Cp4.1LG06g05810 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG06g05810
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionTrypsin family protein
LocationCp4.1LG06 : 3521595 .. 3527871 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CACTGTGGACACGTCTCCCTACCATCGCTTAACCTTCGTTTGCTTAGCGAATTAAAAAATGCGCGCACTTACTCCCTTAACTTTACCTTTTTCGTGTAAAGTCTTCTCTTTCAAACCTTCACTTTTCAATGTTTATGCGCAGAAGAAGCCTCCGTCCTTGTCTTTTCTCTGCTCCAGCTGAGCTCAGCTGCAGTTTTCTGTCTATATTCCTGTCTTTGACCAAAATCCATGCTCTAATTTTGCTTAAAGCTTCAATGGGTTAGCCGTTTTTCAAACCATTTCACCCTAAACTCAGCTCCCTGTGCTATCCTCTTCAAATTCCTCATTGCTTGGAGTGAGTATCAGCTCTCCTTTTCCCTTTTGATTGTGATATCTAGTTTGCCCCTTTGTTGTTTACTTGGTTATGCGTTGATTTGATTGTTTCTGATGGTTTGATTGCTGGGTTATGCGTTGATTTAGGTATTATGATTGGTAGGATGAACTCCAGTTCTTAATTATTTCATCCTTGATTCGTTGGTGTTAATACTTTAGGTGTTGAACTTCATGAATGAAATTAACATATGAACTTACATGTGAGGTAGTGTTATAAAAGATTGTGTATGATTCATCGGTTAGATCACACGTCGGGTTGGAGAGGGGAACGGAGTATTCCTTGTAAGGGTGTGGAAACCTTTCCTTAATAGACGTGATTTAAAACCGTGAGGCTGATGGCGATACGTAAGAAGACAAAGCGGATATAGCTATTAGCGGTGGGTTTGGGTTGTTACAAATGGTATTAGAGGCAGACACTGGGCCCCAAAGGGGGTGGATTGTTAGATCCCATATTAGTTGTAGAGGAGAACGGAGCATTCCTTATAAGGGCGTGGACACCTTTTCTTAACAGACGCGTTTTAAAACCGTGAGGCTGATGACGATACGTAACAAGCCAAAGTGGATAATAGCTATTAGCGGTGGGTTTGGGTTGTTACAAATGGTGTTAGAGGCAGACACTGGCCCCAAAGGGGTGGATTGTTAGATCCCATATTAGTTGGAGAGGGGAACGGAGCATTCCTTATAAGGGTGTGGAAACCTCTCCCTAACAGGCACGTTTTAAAACCGTGTGGCTGATGACGATATGTAATTGGCCAAAGTGGATAATATTTACTAGCGGTGGGTTTGAGGTGTTATAAATGGTATTAGAGCCAGACACTGGGCGGTGTGCCAGCGAGGATGCTGGTCCCCAAGAGGAGTGGATTGTTAAATCCCACATTGGTTGTAGATGGGAATAGAGCATTTCTTATATAGGTGTGGAAATTTCTCCCTAATAGACGCGTTTTAAAACCATGAAGCTGACGACGATACGTAATAGACTAAAGCGGAAAATATCTACTAGTGGTGGATTTGGGCTGTTACAAATGGTATCAGAGGCAGACACTGAGCGGTGTGTTAGCGAGGACGTTGGCCCCCAAGAAGGTGGATTCTTAGATCCCACATCGATTGGAGAGGGGAACAGAGCATTCCTTATAAGGGTGTGAAAACCTCTCCCTAACAGACGCATTTTAAAATCATGAGGCTGACGATGATACATAACGGGCCAAAACGGACAATATCTACTAGCGGTGGGTTTGAGCTGTTACAAATGGTATCAGAGCCAAACACTGGGCGGTGTGCCAGCGTGGACGTTGGGCCCCCAAGGGGGTGGATTGTTAGATCCCACGTCGGTTGGAGAGGGGAACGGAGCATTTCTTATAAGGGTGTGGAAACCTCTCCCTTGATGATAGTTTTACGGGCTCTTCCTTGTCGTTCCATCAACCGGTTTGATTGGTTATTTTCTTTTCTTTTTCTATTTGTATCATTAGGCGATAATGGAGCAAACTAGACGCCATAACAGGATTAACTGCTCTGGTTCAACTCCATCAGAGGAATCAGCTCTGAATCTTGAAAGAAATGGCTGCAGTCACTCAAATTTACCTTCTTTCAGCTCACCTACACTCCAGCCATTTGCATCTGCTGGACAGCATTGTGAGAGCAATGCAGCTTACTTTTCATGGCCGACCCCTATCCGAATAAGCGTCGGTACGGAGGAGCGGGCAAACTATTTTGCAAATCTTCAAAAAGGAGTGCTACCCGATATCCTTCATCCGTTACCAAAAGGGCAGCGGGCAAACACGTTACTCGAGCTCATGACGATACGAGCCTTCCATAGCAAGATCCTGCGTTGTTACAGTCTCGGAACGGCTATTGGATTCCGTATTCGAAAGGGCGTGCTGACCGACATTCCTGCTATTCTTGTTTTTGTTTCCAGGAAAGTTCATAAGCAATGGCTTAGTCCTATTCAATGCCTGCCCACTGCCCTGGAGGTCTGAAGCTGAAGTCCTTTTACTTTGTATTCAAGTTTCTTCCTTACCTCATGGTTGCAAAGTAGGAGTGTTTTTAGTTTCACTTTGATCTTTAATGAGTATGCTGGATTCTTTTCAGGGGCCGGGCGGTGTGTGGTGTGATGTGGATGTTGTAGAATTTTCGTATTTCGGTGCACCGAACCCTGCTCCTAAAGAACAGTTGTATACCGAAATTGTCGATGATCTGCGTGGCTCTGATCCGTGCATTGGCTCGGGTTCACAGGTACTTTCGTGTTTCTTTGCAACTAGGGACCTAAAAACTTGCTATTATATGGAAAAAGACATCTAAACGGCTTAAGAAAAGGGGGAAAGTTTGAGAGAACGATTGGCAACGGGCACGAAATGTGAGATTTTCTACTAATCATGTTCGTTTCGACTTGAATCGTGAGCCCCTTTATTGGAAGCAACTAATATTTGATTTTCAAAATGCCGTATTTGTGAGATCCCACATTGGTTAGAGAGGGGAATGAAGCATTCCTTACAAGGGTGTGGAAACCTCTCCCCAACATATGCATTTTAAAACCGTGAGGTTGACGGCGATACATAACGGGTCAAGTATCTACTAGCGGTAGGCTTGGGTTGTTACAAATGGTATCAGAGCTGGTCAACGAGCGGTGTGCCAGCGAGGACACTGAGCCCCCAACGGGGGTGGATTGTGAGAAACTTCATTCGTTGGAGAGGGGAACGAAGCATTCCTTGTAAGGGTGTGGAAACCTCTCCCTAACATATGCGTATTATAATTGTGAGGCTGACGGTGATATGTAACGGGCTAAAGTGGATAATATCTACTAGCAGTGGACTTGGGCTGATGCAAATGGTATCAGAGCCAGTCACTGAGCGGTGTGCCAGCGTGGACACTGAGCCCTCAACGGGGTGGATTGTGAGATCCCTCATTGGTTGGAGAGGGGAACGAAACATTCCTTATAAGGGTGTGGAAACCTCTCCCTAACATATGGGCTTTAAAATTGTGAGGCTGATGGCGATATGTAACGGGCCAAAGCGGACAATATCTACTAGCGGTGGACTTGGGCTGATGCAAATGGTATCAGAGCCAGTCACCAACCGGTGTGCCAGCGAGGACACTGAGCCCCCAACGGGGTGGATTGTGAGATCCCTCATTGGTTGGAGAGGGGAACGAAGCATTCTTTATAAGGGTGTGGAAACCTCTCCCTAGCAGATGCATTTTAAAACCATGAAGCTGATGGTCATACGTAGCGAGCTAAAACGGACAATATCTACTAGCGTGGGCTTGGGCTGTTACAGCATCACACTAGGATTAAGAACACTTGTTGATACTTCCCGGTTATAATCGTTTCTAATGTTATTACATGAAGTAGCCCTATACTTGAAATTGAGGGATACGTCTTGAGGCACGGTGTGCCTCTTAGTATCACAGTCATGTTTGTCCGGGGCATGTTGCCTGTGGCTGCATGCCCATGTCATGCCAAACCCTTTATTTTCTTTCACAAAGTGCGATTGTGACACCTAGCAACCGTTTATTTGAATTTAATTCCACCTCATCAGGTGGCCAGCCAGGAGACTTACGGAACCTTGGGCGCTATAGTAAGGAGTCAAACGGGCAGTCGTCAAGTTGGTTTTCTCACAAACCGTCATGTCGCTGTTGATTTAGATTATCCGAACCAGAAGATGTTTCATCCTCTTCCACCGACACTCGGGCCCGGGGTGTATCTTGGTGCTGTAGAGAGAGCTACTTCGTTCATCACGGACGAGCTTTGGTACGGAATTTTTGCTGGCATAAATCCAGGTAATACAGTTCATGCCAAAACAAAAGATCATTAGATAGTGACAGTGTTATTGTTCCATTTTCATCTCGCCAAACGTTAATAGCTCGGGTGCACGTATTCTAATTTTGTTATATTGCAGAGACGTTTGTACGGGCAGATGGGTCGTTTATTCCTTTTGCTGATGATTTCGACATGTCCACTGTCACTACATCTGTAAAAGGTGTTGGAGAGATCGGTGACGTGAAGTTCATCGACTTGCAGTCGCCTATCAGTACGCTCATAGGGAAGCAGGTCGTGAAAGTTGGAAGAAGTTCTGGCTTGACGACAGGAACTGTGTTGGCCTATGCTCTCGAGTACAATGACGAGAAAGGAATATGCTTCTTAACCGATTTTCTCGTCGTCGGTGAGAATCAACAGACTTTTGATCTCGAAGGAGATAGCGGTAGCCTCATCATTTTAAAGGGTGAGAATAGAGAGAGTTTGAAACCGATCGGGATCATATGGGGTGGAACGGCTAACCGGGGTCGACTTAAGTTAAAAGTCGGGCAACCTCCTGAGAATTGGACGAGTGGGGTCGATCTCGGGCGCCTTCTCAACCTGCTTGAACTTGACCTGATCACAAGTGATGAAGGGCTCAAAGGTTAGGACTTTAAAAGCTTCTCCTATCCATTCTCTCAATGTCTTACAAATTGCTTTAATCTATTGTAGCATTTGTGTTTCTTTTCTTTCTTTTGAAGTGTTGTTGAAGACTTTCTTCCAATTATGTTAGGGATAAAGTACTGTGTGCCTACGTACACGTAACAATCCTCTCCGACCCTTAAACCACCCCTCAAACTCATTGGAGTCTTTCTCTGTAACAACTCAAGCCCACCGCTAACAGATATTGTCCGCTTTGGCCCGTTACATATCGCCATCAGCCTCGCGGTTTTAAAACGCGGATGTTAGGGAGAGGTTTCCACACCCTTGTAAGGAATGCTTCGTTCCCCTCTCTAACTAATGTGGGATCTCACAATCCACCCCCCTCGGGAGTCCAGCGTGCTCACTGGCACACCGTTCAATGTCTGTCTCTGATATCATTTGTAACAACCTAAGCCCACCGTTAGCAAATATTGTCGATTTTAGCCTGTTACGTATCATCGTCGGGCTCACTGTTTTAAAACGCGTCTACTAGTGAGAGGTTTCCACACCCTTATAAGGAATGTTTCGTTCCCCTCTCCAACCAATGTAGGATCTCACGTTCTCTTATCTGGAATGACTCATTTTCTGTCTAGTAGAATATGCGTTTGATAACACTTTAGTATGGATCACGACGTTCGATAAATGACTTTGAGATCTCACATCGGTTGGTAAGGAGAACGAAGCGTTCTTTATAAGGGTGTGGAAACCTCTCCCTACCAAACACGTTTTAAAAACCTTGAGGGGAAGCCCGGAAGGGAAATGCCAAAGAGGACAATATCTGCTAGCGGTAGGCTTGGGCTAGTTTCACCATTTTGGCCATGTTCTGATAAATATTTTTGTTATGTTTCTTCATTACTATATGCAGCGGCAGTGCAAGAGCAAAGAACCGTTTCAGCGACGGTTATCGGCTCGATTGTTGGAGACTCCTCTCCTCCCGACACAACGCTACCAAGGGAGAAGAGCGAAGAGAAGTTCGAGCCATTGGGTTTTCAGATCCAGCATATGCCTACAGAAGTAGAACCTTCTTCAGCTAAAGACCAATCGTCTCTCCTGGAGACCGAGTTTCATCTCGAAGCTGGAACGAACACGGCTCCCAGTGTAGAACATCAGTTCATTCCAAGCCTCTTCAGTTGTTCTCCCTCCCATCAAAACAGCTCTCTGGTTCATGCCGTTTCCCAGAACCTATCTTTGCTTCGGAACGACTGCGAAGATATTTGCGTCTCGTTGCAATTGGGCGACCACGAAGCTAAGAGACAGCGCTTGGATGGTTCTGTTTCCATGGAAGAACTGAAATAGATCGTCCATGTAAGAACTGTTGGTGTTTGTTAAGATTTTCAGTTCACTTGCTCAAACTGAAATAACTGTATTTTGAGCTGTATGCATTCCCTTATGTTCGAGTATATTAATACACGAGAACAGAGACATGTCGAGAATTTGAGTCATAATTTCAAAAATACCCTTTTAAGTTTCAAAAGT

mRNA sequence

CACTGTGGACACGTCTCCCTACCATCGCTTAACCTTCGTTTGCTTAGCGAATTAAAAAATGCGCGCACTTACTCCCTTAACTTTACCTTTTTCGTGTAAAGTCTTCTCTTTCAAACCTTCACTTTTCAATGTTTATGCGCAGAAGAAGCCTCCGTCCTTGTCTTTTCTCTGCTCCAGCTGAGCTCAGCTGCAGTTTTCTGTCTATATTCCTGTCTTTGACCAAAATCCATGCTCTAATTTTGCTTAAAGCTTCAATGGGTTAGCCGTTTTTCAAACCATTTCACCCTAAACTCAGCTCCCTGTGCTATCCTCTTCAAATTCCTCATTGCTTGGAGCGATAATGGAGCAAACTAGACGCCATAACAGGATTAACTGCTCTGGTTCAACTCCATCAGAGGAATCAGCTCTGAATCTTGAAAGAAATGGCTGCAGTCACTCAAATTTACCTTCTTTCAGCTCACCTACACTCCAGCCATTTGCATCTGCTGGACAGCATTGTGAGAGCAATGCAGCTTACTTTTCATGGCCGACCCCTATCCGAATAAGCGTCGGTACGGAGGAGCGGGCAAACTATTTTGCAAATCTTCAAAAAGGAGTGCTACCCGATATCCTTCATCCGTTACCAAAAGGGCAGCGGGCAAACACGTTACTCGAGCTCATGACGATACGAGCCTTCCATAGCAAGATCCTGCGTTGTTACAGTCTCGGAACGGCTATTGGATTCCGTATTCGAAAGGGCGTGCTGACCGACATTCCTGCTATTCTTGTTTTTGTTTCCAGGAAAGTTCATAAGCAATGGCTTAGTCCTATTCAATGCCTGCCCACTGCCCTGGAGGGGCCGGGCGGTGTGTGGTGTGATGTGGATGTTGTAGAATTTTCGTATTTCGGTGCACCGAACCCTGCTCCTAAAGAACAGTTGTATACCGAAATTGTCGATGATCTGCGTGGCTCTGATCCGTGCATTGGCTCGGGTTCACAGGTGGCCAGCCAGGAGACTTACGGAACCTTGGGCGCTATAGTAAGGAGTCAAACGGGCAGTCGTCAAGTTGGTTTTCTCACAAACCGTCATGTCGCTGTTGATTTAGATTATCCGAACCAGAAGATGTTTCATCCTCTTCCACCGACACTCGGGCCCGGGGTGTATCTTGGTGCTGTAGAGAGAGCTACTTCGTTCATCACGGACGAGCTTTGGTACGGAATTTTTGCTGGCATAAATCCAGAGACGTTTGTACGGGCAGATGGGTCGTTTATTCCTTTTGCTGATGATTTCGACATGTCCACTGTCACTACATCTGTAAAAGGTGTTGGAGAGATCGGTGACGTGAAGTTCATCGACTTGCAGTCGCCTATCAGTACGCTCATAGGGAAGCAGGTCGTGAAAGTTGGAAGAAGTTCTGGCTTGACGACAGGAACTGTGTTGGCCTATGCTCTCGAGTACAATGACGAGAAAGGAATATGCTTCTTAACCGATTTTCTCGTCGTCGGTGAGAATCAACAGACTTTTGATCTCGAAGGAGATAGCGGTAGCCTCATCATTTTAAAGGGTGAGAATAGAGAGAGTTTGAAACCGATCGGGATCATATGGGGTGGAACGGCTAACCGGGGTCGACTTAAGTTAAAAGTCGGGCAACCTCCTGAGAATTGGACGAGTGGGGTCGATCTCGGGCGCCTTCTCAACCTGCTTGAACTTGACCTGATCACAAGTGATGAAGGGCTCAAAGCGGCAGTGCAAGAGCAAAGAACCGTTTCAGCGACGGTTATCGGCTCGATTGTTGGAGACTCCTCTCCTCCCGACACAACGCTACCAAGGGAGAAGAGCGAAGAGAAGTTCGAGCCATTGGGTTTTCAGATCCAGCATATGCCTACAGAAGTAGAACCTTCTTCAGCTAAAGACCAATCGTCTCTCCTGGAGACCGAGTTTCATCTCGAAGCTGGAACGAACACGGCTCCCAGTGTAGAACATCAGTTCATTCCAAGCCTCTTCAGTTGTTCTCCCTCCCATCAAAACAGCTCTCTGGTTCATGCCGTTTCCCAGAACCTATCTTTGCTTCGGAACGACTGCGAAGATATTTGCGTCTCGTTGCAATTGGGCGACCACGAAGCTAAGAGACAGCGCTTGGATGGTTCTGTTTCCATGGAAGAACTGAAATAGATCGTCCATGTAAGAACTGTTGGTGTTTGTTAAGATTTTCAGTTCACTTGCTCAAACTGAAATAACTGTATTTTGAGCTGTATGCATTCCCTTATGTTCGAGTATATTAATACACGAGAACAGAGACATGTCGAGAATTTGAGTCATAATTTCAAAAATACCCTTTTAAGTTTCAAAAGT

Coding sequence (CDS)

ATGGAGCAAACTAGACGCCATAACAGGATTAACTGCTCTGGTTCAACTCCATCAGAGGAATCAGCTCTGAATCTTGAAAGAAATGGCTGCAGTCACTCAAATTTACCTTCTTTCAGCTCACCTACACTCCAGCCATTTGCATCTGCTGGACAGCATTGTGAGAGCAATGCAGCTTACTTTTCATGGCCGACCCCTATCCGAATAAGCGTCGGTACGGAGGAGCGGGCAAACTATTTTGCAAATCTTCAAAAAGGAGTGCTACCCGATATCCTTCATCCGTTACCAAAAGGGCAGCGGGCAAACACGTTACTCGAGCTCATGACGATACGAGCCTTCCATAGCAAGATCCTGCGTTGTTACAGTCTCGGAACGGCTATTGGATTCCGTATTCGAAAGGGCGTGCTGACCGACATTCCTGCTATTCTTGTTTTTGTTTCCAGGAAAGTTCATAAGCAATGGCTTAGTCCTATTCAATGCCTGCCCACTGCCCTGGAGGGGCCGGGCGGTGTGTGGTGTGATGTGGATGTTGTAGAATTTTCGTATTTCGGTGCACCGAACCCTGCTCCTAAAGAACAGTTGTATACCGAAATTGTCGATGATCTGCGTGGCTCTGATCCGTGCATTGGCTCGGGTTCACAGGTGGCCAGCCAGGAGACTTACGGAACCTTGGGCGCTATAGTAAGGAGTCAAACGGGCAGTCGTCAAGTTGGTTTTCTCACAAACCGTCATGTCGCTGTTGATTTAGATTATCCGAACCAGAAGATGTTTCATCCTCTTCCACCGACACTCGGGCCCGGGGTGTATCTTGGTGCTGTAGAGAGAGCTACTTCGTTCATCACGGACGAGCTTTGGTACGGAATTTTTGCTGGCATAAATCCAGAGACGTTTGTACGGGCAGATGGGTCGTTTATTCCTTTTGCTGATGATTTCGACATGTCCACTGTCACTACATCTGTAAAAGGTGTTGGAGAGATCGGTGACGTGAAGTTCATCGACTTGCAGTCGCCTATCAGTACGCTCATAGGGAAGCAGGTCGTGAAAGTTGGAAGAAGTTCTGGCTTGACGACAGGAACTGTGTTGGCCTATGCTCTCGAGTACAATGACGAGAAAGGAATATGCTTCTTAACCGATTTTCTCGTCGTCGGTGAGAATCAACAGACTTTTGATCTCGAAGGAGATAGCGGTAGCCTCATCATTTTAAAGGGTGAGAATAGAGAGAGTTTGAAACCGATCGGGATCATATGGGGTGGAACGGCTAACCGGGGTCGACTTAAGTTAAAAGTCGGGCAACCTCCTGAGAATTGGACGAGTGGGGTCGATCTCGGGCGCCTTCTCAACCTGCTTGAACTTGACCTGATCACAAGTGATGAAGGGCTCAAAGCGGCAGTGCAAGAGCAAAGAACCGTTTCAGCGACGGTTATCGGCTCGATTGTTGGAGACTCCTCTCCTCCCGACACAACGCTACCAAGGGAGAAGAGCGAAGAGAAGTTCGAGCCATTGGGTTTTCAGATCCAGCATATGCCTACAGAAGTAGAACCTTCTTCAGCTAAAGACCAATCGTCTCTCCTGGAGACCGAGTTTCATCTCGAAGCTGGAACGAACACGGCTCCCAGTGTAGAACATCAGTTCATTCCAAGCCTCTTCAGTTGTTCTCCCTCCCATCAAAACAGCTCTCTGGTTCATGCCGTTTCCCAGAACCTATCTTTGCTTCGGAACGACTGCGAAGATATTTGCGTCTCGTTGCAATTGGGCGACCACGAAGCTAAGAGACAGCGCTTGGATGGTTCTGTTTCCATGGAAGAACTGAAATAG

Protein sequence

MEQTRRHNRINCSGSTPSEESALNLERNGCSHSNLPSFSSPTLQPFASAGQHCESNAAYFSWPTPIRISVGTEERANYFANLQKGVLPDILHPLPKGQRANTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDDLRGSDPCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRADGSFIPFADDFDMSTVTTSVKGVGEIGDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIILKGENRESLKPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQRTVSATVIGSIVGDSSPPDTTLPREKSEEKFEPLGFQIQHMPTEVEPSSAKDQSSLLETEFHLEAGTNTAPSVEHQFIPSLFSCSPSHQNSSLVHAVSQNLSLLRNDCEDICVSLQLGDHEAKRQRLDGSVSMEELK
BLAST of Cp4.1LG06g05810 vs. TrEMBL
Match: A0A0A0L2V0_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G607040 PE=4 SV=1)

HSP 1 Score: 1131.3 bits (2925), Expect = 0.0e+00
Identity = 566/603 (93.86%), Postives = 582/603 (96.52%), Query Frame = 1

Query: 1   MEQTRRHNRINCSGSTPSEESALNLERNGCSHSNLPSFSSPTLQPFASAGQHCESNAAYF 60
           MEQTR + RINCSGSTPSEESAL+LERN CSHS+LPSFSSPTLQPFASAGQH   N AYF
Sbjct: 104 MEQTRHNRRINCSGSTPSEESALDLERNCCSHSDLPSFSSPTLQPFASAGQHFGFNTAYF 163

Query: 61  SWPTPIRISVGTEERANYFANLQKGVLPDILHPLPKGQRANTLLELMTIRAFHSKILRCY 120
           SWPTPIR+SVGTEERANYFANLQKGVLPDILHPLPKGQRANTLLELMTIRAFHSKILRCY
Sbjct: 164 SWPTPIRLSVGTEERANYFANLQKGVLPDILHPLPKGQRANTLLELMTIRAFHSKILRCY 223

Query: 121 SLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFS 180
           SLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFS
Sbjct: 224 SLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFS 283

Query: 181 YFGAPNPAPKEQLYTEIVDDLRGSDPCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLT 240
           YFGAPNPAPKEQLYTEIVDDLRGSDPCIGSGSQVASQETYGTLGAIVRSQTG RQVGFLT
Sbjct: 284 YFGAPNPAPKEQLYTEIVDDLRGSDPCIGSGSQVASQETYGTLGAIVRSQTGGRQVGFLT 343

Query: 241 NRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD 300
           NRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD
Sbjct: 344 NRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD 403

Query: 301 GSFIPFADDFDMSTVTTSVKGVGEIGDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVL 360
           G+FIPFADDFDMSTVTTSVKGVG++GDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVL
Sbjct: 404 GAFIPFADDFDMSTVTTSVKGVGQVGDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVL 463

Query: 361 AYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIILKGENRESLKPIGIIWGGTAN 420
           AYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIILKGENR++L+PIGIIWGGTAN
Sbjct: 464 AYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIILKGENRDTLQPIGIIWGGTAN 523

Query: 421 RGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQRTVSATVIGSIVGD 480
           RGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQ TVSATVIGSIVGD
Sbjct: 524 RGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQITVSATVIGSIVGD 583

Query: 481 SSPPDTTLPREKSEEKFEPLGFQIQHMPTEVEPSSAKDQSSLLETEFHLEAGTNTAPSVE 540
           SSPPDTTLP+EKSEEK E LGFQIQHMPTEVEP SAKD+  LLETEFHLE G N APSVE
Sbjct: 584 SSPPDTTLPKEKSEEKSEQLGFQIQHMPTEVEP-SAKDR-PLLETEFHLEPGMNRAPSVE 643

Query: 541 HQFIPSLFSCSPSHQNSSLVHAVSQNLSLLRNDCEDICVSLQLGDHEAKRQRLDGSVSME 600
           HQFIPSLFSCSPSHQNS+L  AVSQNLSLLR+DCED+CVSLQLGDHEAKR+R D SVSME
Sbjct: 644 HQFIPSLFSCSPSHQNSTLDRAVSQNLSLLRSDCEDLCVSLQLGDHEAKRRRSDASVSME 703

Query: 601 ELK 604
           ELK
Sbjct: 704 ELK 704

BLAST of Cp4.1LG06g05810 vs. TrEMBL
Match: M5X0K9_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa003117mg PE=4 SV=1)

HSP 1 Score: 1006.5 bits (2601), Expect = 1.4e-290
Identity = 504/604 (83.44%), Postives = 537/604 (88.91%), Query Frame = 1

Query: 1   MEQTRRHNRINCSGSTPSEESALNLERNGCSHSNLPSFSSPTLQPFASAGQHCESNAAYF 60
           ME+TR + R+ CSGSTPSEES L+LERN  SHSNLPS S PTLQP+ASAGQHCE++AAYF
Sbjct: 1   MERTRFNMRMRCSGSTPSEESVLDLERNCYSHSNLPSLSPPTLQPYASAGQHCETSAAYF 60

Query: 61  SWPTPIRISVGTEERANYFANLQKGVLPDILHPLPKGQRANTLLELMTIRAFHSKILRCY 120
           SWPT  R++   EERANYF NLQKGVLP+ L  LPKGQ+A TLLELMTIRAFHSKILRCY
Sbjct: 61  SWPTSSRLNDAAEERANYFTNLQKGVLPETLGQLPKGQQATTLLELMTIRAFHSKILRCY 120

Query: 121 SLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFS 180
           SLGTAIGFRIR+GVLTDIPAILVFV+RKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFS
Sbjct: 121 SLGTAIGFRIRRGVLTDIPAILVFVARKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFS 180

Query: 181 YFGAPNPAPKEQLYTEIVDDLRGSDPCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLT 240
           YFGAP PAPKEQLYTEIVDDLRG DPCIGSGSQVASQETYGTLGAIVRSQTG+RQVGFLT
Sbjct: 181 YFGAPEPAPKEQLYTEIVDDLRGGDPCIGSGSQVASQETYGTLGAIVRSQTGNRQVGFLT 240

Query: 241 NRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD 300
           NRHVAVDLDYPNQKMFHPLPP+LGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD
Sbjct: 241 NRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD 300

Query: 301 GSFIPFADDFDMSTVTTSVKGVGEIGDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVL 360
           G+FIPFADDFDM TV TSVKGVGEIG+VK IDLQSPISTLIGKQV+KVGRSSGLTTGTVL
Sbjct: 301 GAFIPFADDFDMCTVITSVKGVGEIGNVKIIDLQSPISTLIGKQVMKVGRSSGLTTGTVL 360

Query: 361 AYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIILKGENRESLKPIGIIWGGTAN 420
           AYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIILKGEN E  +PIGIIWGGTAN
Sbjct: 361 AYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIILKGENGEKPRPIGIIWGGTAN 420

Query: 421 RGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQRTVSATVIGSIVGD 480
           RGRLKLK+GQPPENWTSGVDLGRLL LLELDLIT+DEG+K AVQEQRT SAT IGS VGD
Sbjct: 421 RGRLKLKIGQPPENWTSGVDLGRLLKLLELDLITTDEGVKVAVQEQRTASATAIGSTVGD 480

Query: 481 SSPPDTTLPREKSEEKFEPLGFQIQHMPTEVEPSSAKDQSSLLETEFHLEAGTNTAPSVE 540
           SSPPD  LP+E+ EEKFE LG QIQH+P E EPSS+    SL+ETEFHLE G    PSVE
Sbjct: 481 SSPPDGMLPKERPEEKFESLGLQIQHIPLEAEPSSS---LSLVETEFHLEDGIKAVPSVE 540

Query: 541 HQFIPSLFSCSPSHQNSSLVHAVSQNLSLLRNDC-EDICVSLQLGDHEAKRQRLDGSVSM 600
           HQFIPS    SP H+ + +   VS+NLS LRN C EDIC SLQLGD+EAKR+R   S S 
Sbjct: 541 HQFIPSFLGGSPLHKKNQMGRTVSENLSSLRNGCDEDICFSLQLGDNEAKRRRSGASTSA 600

Query: 601 EELK 604
           EE K
Sbjct: 601 EEPK 601

BLAST of Cp4.1LG06g05810 vs. TrEMBL
Match: V4TLL7_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10019372mg PE=4 SV=1)

HSP 1 Score: 1001.9 bits (2589), Expect = 3.3e-289
Identity = 501/604 (82.95%), Postives = 534/604 (88.41%), Query Frame = 1

Query: 1   MEQTRRHNRINCSGSTPSEESALNLERNGCSHSNLPSFSSPTLQPFASAGQHCESNAAYF 60
           M++TR + R  CSGSTPSEESAL+ ERN CSH NLPS S PTLQPFASAGQHCESNAAYF
Sbjct: 1   MDRTRLNIRARCSGSTPSEESALDFERNCCSHPNLPSLSPPTLQPFASAGQHCESNAAYF 60

Query: 61  SWPTPIRISVGTEERANYFANLQKGVLPDILHPLPKGQRANTLLELMTIRAFHSKILRCY 120
           SWPT  R+S   EERANYFANLQKGVLP+ L  LPKGQ+A TLLELMTIRAFHSKILRCY
Sbjct: 61  SWPTSSRLSDAAEERANYFANLQKGVLPETLGQLPKGQQATTLLELMTIRAFHSKILRCY 120

Query: 121 SLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFS 180
           SLGTAIGFRI++GVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFS
Sbjct: 121 SLGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFS 180

Query: 181 YFGAPNPAPKEQLYTEIVDDLRGSDPCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLT 240
           YFGAP P PKEQLYT+IVDDLRG DP IGSGSQVASQETYGTLGAIV+SQTGSRQVGFLT
Sbjct: 181 YFGAPEPTPKEQLYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLT 240

Query: 241 NRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD 300
           NRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITD+LWYGIFAGIN ETFVRAD
Sbjct: 241 NRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDDLWYGIFAGINAETFVRAD 300

Query: 301 GSFIPFADDFDMSTVTTSVKGVGEIGDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVL 360
           G+FIPFADDFDMSTVTTSVKG+GEIGDVK +DLQSPIS+LIGKQVVKVGRSSGLTTGTVL
Sbjct: 301 GAFIPFADDFDMSTVTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVL 360

Query: 361 AYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIILKGENRESLKPIGIIWGGTAN 420
           AYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLI++KGEN E  +PIGIIWGGTAN
Sbjct: 361 AYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLILMKGENGEKPRPIGIIWGGTAN 420

Query: 421 RGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQRTVSATVIGSIVGD 480
           RGRLKLK+GQPPENWTSGVDLGRLLNLLELDLIT+DEGLK AVQEQR  SAT IGS VGD
Sbjct: 421 RGRLKLKIGQPPENWTSGVDLGRLLNLLELDLITTDEGLKVAVQEQRAASATAIGSTVGD 480

Query: 481 SSPPDTTLPREKSEEKFEPLGFQIQHMPTEVEPSSAKDQSSLLETEFHLEAGTNTAPSVE 540
           SSPPD    ++K+E+KFEPLG QIQH+P EVE  S +   SL+ETEFHLE G    PSVE
Sbjct: 481 SSPPDGMHLKDKAEDKFEPLGLQIQHIPVEVEHHSPETNPSLMETEFHLEDGVKAGPSVE 540

Query: 541 HQFIPSLFSCSPSHQNSSLVHAVSQNLSLLRNDC-EDICVSLQLGDHEAKRQRLDGSVSM 600
            QFIPS    SP HQN+    A S+NL+ L N C EDIC SLQLGD+EAKR+R D S S 
Sbjct: 541 LQFIPSFTGHSPLHQNNPSDKASSENLASLWNGCDEDICFSLQLGDNEAKRRRSDASTSK 600

Query: 601 EELK 604
           EE K
Sbjct: 601 EESK 604

BLAST of Cp4.1LG06g05810 vs. TrEMBL
Match: A0A0B2R4C0_GLYSO (Uncharacterized protein OS=Glycine soja GN=glysoja_011410 PE=4 SV=1)

HSP 1 Score: 999.6 bits (2583), Expect = 1.7e-288
Identity = 498/602 (82.72%), Postives = 539/602 (89.53%), Query Frame = 1

Query: 1   MEQTRRHNRINCSGSTPSEESALNLERNGCSHSNLPSFSSPTLQPFASAGQHCESNAAYF 60
           ME+ R + R +CSGSTPSEESAL+LERN CSHSNLPS S PTLQPFASAGQHCES+AAYF
Sbjct: 1   MERARLNMRGHCSGSTPSEESALDLERNCCSHSNLPSLSPPTLQPFASAGQHCESSAAYF 60

Query: 61  SWPTPIRISVGTEERANYFANLQKGVLPDILHPLPKGQRANTLLELMTIRAFHSKILRCY 120
           SWP+  R++   EERANYF NLQKGVLP+ L  LPKG +A TLLELMTIRAFHSKILRCY
Sbjct: 61  SWPS--RLNDAAEERANYFLNLQKGVLPETLGRLPKGHQATTLLELMTIRAFHSKILRCY 120

Query: 121 SLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFS 180
           SLGTAIGFRIR+GVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFS
Sbjct: 121 SLGTAIGFRIRRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFS 180

Query: 181 YFGAPNPAPKEQLYTEIVDDLRGSDPCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLT 240
           YFGAP P PKEQLYTEIVDDLRG DPCIGSGSQVASQETYGTLGAIV+SQTGSRQVGFLT
Sbjct: 181 YFGAPEPVPKEQLYTEIVDDLRGGDPCIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLT 240

Query: 241 NRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD 300
           NRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAG+NPETFVRAD
Sbjct: 241 NRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGMNPETFVRAD 300

Query: 301 GSFIPFADDFDMSTVTTSVKGVGEIGDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVL 360
           G+FIPFADDFDMSTVTTSV+GVG+IGDVK IDLQ+PIS+LIGKQVVKVGRSSGLTTG VL
Sbjct: 301 GAFIPFADDFDMSTVTTSVRGVGDIGDVKIIDLQAPISSLIGKQVVKVGRSSGLTTGVVL 360

Query: 361 AYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIILKGENRESLKPIGIIWGGTAN 420
           AYALEYNDEKGICFLTD LVVGENQQTFDLEGDSGSLI+LKG+N E  +PIGIIWGGTAN
Sbjct: 361 AYALEYNDEKGICFLTDLLVVGENQQTFDLEGDSGSLIMLKGDNGEKPRPIGIIWGGTAN 420

Query: 421 RGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQRTVSATVIGSIVGD 480
           RGRLKLKVGQPPENWTSGVDLGRLLNLLELDLIT+DEGL+ AVQEQR VSATVIGS VGD
Sbjct: 421 RGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITTDEGLQVAVQEQRAVSATVIGSTVGD 480

Query: 481 SSPPDTTLPREKSEEKFEPLGFQIQHMPTEVEPSSAKDQSSLLETEFHLEAGTNTAPSVE 540
           SSPPD  LP++K+E+K+EPLG QIQ +P  V PSS   + S++ETEF LE G N  PS+E
Sbjct: 481 SSPPDGVLPKDKAEDKYEPLGLQIQSIPLGVVPSSQDMKPSIMETEFKLEDGINVGPSIE 540

Query: 541 HQFIPSLFSCSPSHQNSSLVHAVSQNLSLLRNDC-EDICVSLQLGDHEAKRQRLDGSVSM 600
           HQFIPS    SP H+NS      ++NLS LRN+C ED+CVSLQLGD+EAKR+R + S S 
Sbjct: 541 HQFIPSFIGRSPLHKNSIQDRTATENLSSLRNNCDEDLCVSLQLGDNEAKRRRSEASTST 600

Query: 601 EE 602
           EE
Sbjct: 601 EE 600

BLAST of Cp4.1LG06g05810 vs. TrEMBL
Match: I1KU44_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_08G171900 PE=4 SV=1)

HSP 1 Score: 997.3 bits (2577), Expect = 8.2e-288
Identity = 498/602 (82.72%), Postives = 538/602 (89.37%), Query Frame = 1

Query: 1   MEQTRRHNRINCSGSTPSEESALNLERNGCSHSNLPSFSSPTLQPFASAGQHCESNAAYF 60
           ME+ R + R +CSGSTPSEESAL+LERN CSHSNLPS S PTLQPFASAGQHCES+AAYF
Sbjct: 1   MERARLNMRGHCSGSTPSEESALDLERNCCSHSNLPSLSPPTLQPFASAGQHCESSAAYF 60

Query: 61  SWPTPIRISVGTEERANYFANLQKGVLPDILHPLPKGQRANTLLELMTIRAFHSKILRCY 120
           SWP+  R++   EERANYF NLQKGVLP+ L  LPKG +A TLLELMTIRAFHSKILRCY
Sbjct: 61  SWPS--RLNDAAEERANYFLNLQKGVLPETLGRLPKGHQATTLLELMTIRAFHSKILRCY 120

Query: 121 SLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFS 180
           SLGTAIGFRIR+GVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFS
Sbjct: 121 SLGTAIGFRIRRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFS 180

Query: 181 YFGAPNPAPKEQLYTEIVDDLRGSDPCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLT 240
           YFGAP P PKEQLYTEIVDDLRG DPCIGSGSQVASQETYGTLGAIV+SQTGSRQVGFLT
Sbjct: 181 YFGAPEPVPKEQLYTEIVDDLRGGDPCIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLT 240

Query: 241 NRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD 300
           NRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD
Sbjct: 241 NRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD 300

Query: 301 GSFIPFADDFDMSTVTTSVKGVGEIGDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVL 360
           G+FIPFADDFDMSTVTTSV+GVG+IGDVK IDLQ+PIS+LIGKQVVKVGRSSGLTTG VL
Sbjct: 301 GAFIPFADDFDMSTVTTSVRGVGDIGDVKIIDLQAPISSLIGKQVVKVGRSSGLTTGVVL 360

Query: 361 AYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIILKGENRESLKPIGIIWGGTAN 420
           AYALEYNDEKGICFLTD LVVGENQQTFDLEGDSGSLI+LKG+  E  +PIGIIWGGTAN
Sbjct: 361 AYALEYNDEKGICFLTDLLVVGENQQTFDLEGDSGSLIMLKGDIGEKPRPIGIIWGGTAN 420

Query: 421 RGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQRTVSATVIGSIVGD 480
           RGRLKLKVGQPPENWTSGVDLGRLLNLLELDLIT+DEGL+ AVQEQR VSATVIGS VGD
Sbjct: 421 RGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITTDEGLQVAVQEQRAVSATVIGSTVGD 480

Query: 481 SSPPDTTLPREKSEEKFEPLGFQIQHMPTEVEPSSAKDQSSLLETEFHLEAGTNTAPSVE 540
           SSPPD  LP++K+E+K+EPLG QIQ +P  V PSS   + S++ETEF LE G N  PS+E
Sbjct: 481 SSPPDGVLPKDKAEDKYEPLGLQIQSIPLGVVPSSQDMKPSIMETEFKLEDGINVGPSIE 540

Query: 541 HQFIPSLFSCSPSHQNSSLVHAVSQNLSLLRNDC-EDICVSLQLGDHEAKRQRLDGSVSM 600
           HQFIPS    SP H+NS      ++NLS LRN+C ED+CVSLQLGD+EAKR+R + S S 
Sbjct: 541 HQFIPSFIGRSPLHKNSIQDRTATENLSSLRNNCDEDLCVSLQLGDNEAKRRRSEASTST 600

Query: 601 EE 602
           EE
Sbjct: 601 EE 600

BLAST of Cp4.1LG06g05810 vs. TAIR10
Match: AT2G35155.1 (AT2G35155.1 Trypsin family protein)

HSP 1 Score: 757.3 bits (1954), Expect = 7.2e-219
Identity = 388/589 (65.87%), Postives = 456/589 (77.42%), Query Frame = 1

Query: 10  INCSGSTPSEESALNLERNG-CSHSNLPSFSSPT-LQPFASAGQHCESNAAYFSWPTPIR 69
           I  + S+ SE+SAL+LERN  C+H +LPS SSP+ LQPF    QH ESNA YFSWPT  R
Sbjct: 11  IQAAASSESEDSALDLERNHHCNHLSLPSSSSPSPLQPFTLNIQHAESNAPYFSWPTLSR 70

Query: 70  ISVGTEERANYFANLQKGVLPDILHPLPKGQRANTLLELMTIRAFHSKILRCYSLGTAIG 129
           ++   E+RANYF NLQKGVLP+ +  LP GQ+A TLLELMTIRAFHSKILR +SLGTA+G
Sbjct: 71  LNDTVEDRANYFGNLQKGVLPETVGRLPSGQQATTLLELMTIRAFHSKILRRFSLGTAVG 130

Query: 130 FRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNP 189
           FRI +GVLT++PAILVFV+RKVH+QWL+P+QCLP+ALEGPGGVWCDVDVVEF Y+GAP  
Sbjct: 131 FRISRGVLTNVPAILVFVARKVHRQWLNPMQCLPSALEGPGGVWCDVDVVEFQYYGAPAA 190

Query: 190 APKEQLYTEIVDDLRGSDPCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVD 249
            PKEQ+Y E+VD LRGSDPCIGSGSQVASQETYGTLGAIV+S+TG+ QVGFLTNRHVAVD
Sbjct: 191 TPKEQVYNELVDGLRGSDPCIGSGSQVASQETYGTLGAIVKSRTGNHQVGFLTNRHVAVD 250

Query: 250 LDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRADGSFIPFA 309
           LDYP+QKMFHPLPP+LGPGVYLGAVERATSFITD+ WYGIFAG NPETFVRADG+FIPFA
Sbjct: 251 LDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDQWYGIFAGTNPETFVRADGAFIPFA 310

Query: 310 DDFDMSTVTTSVKGVGEIGDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVLAYALEYN 369
           +DF+ S VTT +KG+GEIGDV  IDLQSPI +LIGKQVVKVGRSSG TTGT++AYALEYN
Sbjct: 311 EDFNTSNVTTLIKGIGEIGDVHVIDLQSPIDSLIGKQVVKVGRSSGYTTGTIMAYALEYN 370

Query: 370 DEKGICFLTDFLVVGENQQTFDLEGDSGSLIILKGENRESLKPIGIIWGGTANRGRLKLK 429
           DEKGICFLTDFLV+GENQQTFDLEGDSGSLI+L G N +  +P+GIIWGGTANRGRLKL 
Sbjct: 371 DEKGICFLTDFLVIGENQQTFDLEGDSGSLILLTGPNGQKPRPVGIIWGGTANRGRLKLI 430

Query: 430 VGQPPENWTSGVDLGRLLNLLELDLITSDEGLK--AAVQEQRTVSATVIGSIVGDSSPPD 489
            GQ PENWTSGVDLGRLL+LLELDLITS+  L+  AA +E+R  S T + S V  SSPPD
Sbjct: 431 AGQEPENWTSGVDLGRLLDLLELDLITSNHELEAAAAAREERNTSVTALDSTVSQSSPPD 490

Query: 490 TTLPREKSEEKFEPLGFQIQHMPTEVEPSSAKDQSSLLETEFHLEAGTNTAPSV-EHQFI 549
                +K +E FEP                       +  EFH+E        V EH FI
Sbjct: 491 PVPSGDKQDESFEP----------------------FIPPEFHIEEAIKPTLEVEEHIFI 550

Query: 550 -PSLFSCSPSHQNSSLVHAVSQNLSLLRNDCEDICVSLQLGDHEAKRQR 593
            P   + S S      +  +   ++L  +  E++ +SL LG+ + K+ +
Sbjct: 551 APISVNESTSAIKGQEIPKLDNLMALKNSSEEEVNISLHLGEPKLKKPK 577

BLAST of Cp4.1LG06g05810 vs. TAIR10
Match: AT5G45030.1 (AT5G45030.1 Trypsin family protein)

HSP 1 Score: 752.3 bits (1941), Expect = 2.3e-217
Identity = 406/622 (65.27%), Postives = 478/622 (76.85%), Query Frame = 1

Query: 1   MEQTRRHNRINCSGSTPSEESA-LNLERNGCSHSNLPSFSSPTLQPFASAGQHCESNAA- 60
           ME  R   R + S S+ S ESA L+L++N  +H  L S SSP LQPF S  QH E++AA 
Sbjct: 1   MEGKRLDLRFHHSTSSQSVESAALDLDKNVYNHIKLAS-SSP-LQPFPSGAQHPETSAAA 60

Query: 61  -YFSWPTPIRISVGTEERANYFANLQKGVLPDILHPLPKGQRANTLLELMTIRAFHSKIL 120
            YFSWPT  R++   E+RANYFANLQKGVLP+    LP G++A TLLELM IRAFHSK L
Sbjct: 61  AYFSWPTSSRLNDSAEDRANYFANLQKGVLPESFDGLPTGKKATTLLELMMIRAFHSKNL 120

Query: 121 RCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVV 180
           R +SLGTAIGFRIR+GVLT+I AILVFV+RKVHKQWL+P+QCLPTALEGPGGVWCDVDVV
Sbjct: 121 RRFSLGTAIGFRIRRGVLTNIAAILVFVARKVHKQWLNPLQCLPTALEGPGGVWCDVDVV 180

Query: 181 EFSYFGAPNPAPKEQLYTEIVDDLRGSDPCIGSGSQVASQETYGTLGAIVRSQTGSRQVG 240
           EF Y+GAP   PKEQ+YTE+VDDLRGS   IGSGSQVASQETYGTLGAIV+S+TG RQVG
Sbjct: 181 EFQYYGAPAQTPKEQVYTELVDDLRGSGSSIGSGSQVASQETYGTLGAIVKSKTGIRQVG 240

Query: 241 FLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFV 300
           FLTNRHVAVDLDYP+QKMFHPLPP+LGPGVYLGAVERATSFITD+LWYGIFAG NPETFV
Sbjct: 241 FLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFV 300

Query: 301 RADGSFIPFADDFDMSTVTTSVKGVGEIGDVKFIDLQSPISTLIGKQVVKVGRSSGLTTG 360
           RADG+FIPFA+DF+ + VTT+VKG+GEIGD+   DLQSP+++LIG++VVKVGRSSGLTTG
Sbjct: 301 RADGAFIPFAEDFNTNNVTTTVKGIGEIGDIHATDLQSPVNSLIGRKVVKVGRSSGLTTG 360

Query: 361 TVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIILKG--ENRESLKPIGIIW 420
           T++AYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLI+L    E  E  +P+GIIW
Sbjct: 361 TIMAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLILLAAGDEKNEKPRPVGIIW 420

Query: 421 GGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQRT-VSATVI 480
           GGTANRGRLKLKVG+ PENWTSGVDLGR+LNLLELDLITS+EGL+AAV EQR  +    +
Sbjct: 421 GGTANRGRLKLKVGEQPENWTSGVDLGRVLNLLELDLITSNEGLQAAVLEQRNGIMCAAV 480

Query: 481 GSIVGDSSPPDTTLPREKSEEKFEPLGFQIQHMPTEVEPSSAKDQSSLLETEFHLEAGTN 540
            S V +SSP    + R K+ E FEP+   +Q +  E       D +S +  EF +E    
Sbjct: 481 DSTVVESSPGVCNISRCKTGENFEPINLNVQQVLIE-------DDNSNIHPEFQIEDVLE 540

Query: 541 TAPSV-EHQFIPSLFSCSPSHQNSSLVH--------AVSQNLSLLRNDC--EDICVSLQL 600
           +   + EHQFIPS      S  N S +H          S+NLS L+     ++I  SLQL
Sbjct: 541 SVAVIEEHQFIPS------SSNNGSALHQKPNGPENLESKNLSSLKTSSSGDEIGFSLQL 600

Query: 601 GDHEAKRQRL----DGSVSMEE 602
           G+ + K+++     DGS   EE
Sbjct: 601 GESDTKKRKRTDSPDGSQEDEE 607

BLAST of Cp4.1LG06g05810 vs. TAIR10
Match: AT3G12950.1 (AT3G12950.1 Trypsin family protein)

HSP 1 Score: 734.2 bits (1894), Expect = 6.6e-212
Identity = 392/568 (69.01%), Postives = 445/568 (78.35%), Query Frame = 1

Query: 46  FASAGQHCESNAA-YFSWPTPIRISVGTEERANYFANLQKG------VLPDILHPLPKGQ 105
           + S GQHCE  AA YFSWPT  R+S   EERANYF+NLQK       V P+ +   PKGQ
Sbjct: 4   YGSTGQHCEFTAASYFSWPTSSRLSNAAEERANYFSNLQKEEDDDDEVSPEPVSTEPKGQ 63

Query: 106 RANTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQ 165
           RA TLLELMTIRAFHSK+LRCYSLGTAIGFRIR+GVLTDIPAI+VFVSRKVHKQWLSP+Q
Sbjct: 64  RATTLLELMTIRAFHSKMLRCYSLGTAIGFRIRRGVLTDIPAIIVFVSRKVHKQWLSPLQ 123

Query: 166 CLPTALEGPGGVWCDVDVVEFSYFGAPN--PAPKEQLYTEIVDDLRGSDPCIGSGSQVAS 225
           CLPTALEG GG+WCDVDVVEFSYFG P+  P PK+   T+IVD L+GSDP IGSGSQVAS
Sbjct: 124 CLPTALEGAGGIWCDVDVVEFSYFGEPDHQPTPKQTFTTDIVDHLQGSDPFIGSGSQVAS 183

Query: 226 QETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERAT 285
           QET GTLGAIVRSQTG RQVGF+TNRHVAV+LDYP+QKMFHPLPP LGPGVYLGAVERAT
Sbjct: 184 QETCGTLGAIVRSQTGGRQVGFVTNRHVAVNLDYPSQKMFHPLPPALGPGVYLGAVERAT 243

Query: 286 SFITDELWYGIFAGINPETFVRADGSFIPFADDFDMSTVTTSVK-GVGEIGDVKFIDLQS 345
           SFITD+LW+GIFAG NPETFVRADG+FIPFADD+D+S VTTSVK GVGEIG+VK I+LQS
Sbjct: 244 SFITDDLWFGIFAGTNPETFVRADGAFIPFADDYDLSRVTTSVKGGVGEIGEVKAIELQS 303

Query: 346 PISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQT-FDLEGDS 405
           P+ +L+GKQVVKVGRSSGLTTGTVLAYALEYNDE+G+CFLTDFLVVGEN ++ FDLEGDS
Sbjct: 304 PVGSLVGKQVVKVGRSSGLTTGTVLAYALEYNDERGVCFLTDFLVVGENHRSPFDLEGDS 363

Query: 406 GSLIILKGENRESLKPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLIT 465
           GSLI++KGE  E  +PIGIIWGGT +RGRLKLKVG+ PE+WT+GVDLGRLL  L+LDLIT
Sbjct: 364 GSLIVMKGE--EKARPIGIIWGGTGSRGRLKLKVGECPESWTTGVDLGRLLTHLQLDLIT 423

Query: 466 SDEGLKAAVQEQRTVSATVIGSIVGDSSPPDTTLPREK--SEEKFE-PLG-FQIQHMPTE 525
           +DEGLKAAVQEQR  S T + S+V DSSPP   L +EK   EEK E  LG  Q+QH+   
Sbjct: 424 TDEGLKAAVQEQRAASTTGMSSMVADSSPPYVNLKKEKRSPEEKLEASLGPLQVQHI--- 483

Query: 526 VEPSSAKDQSSLLETEFHLEAGTNTAPSVEHQFIPSLF-SCS----PSHQNSSLVHAVSQ 585
                  D    +ET+         APSVEHQF+P+    CS    P      LV     
Sbjct: 484 -------DLEERIETK-------GGAPSVEHQFMPTFSGQCSASAWPETAREDLV----- 543

Query: 586 NLSLLRNDCE-DICVSLQLGDHEAKRQR 593
                   C+ D+CV L+LGD  AKR+R
Sbjct: 544 -AGFTNGSCDGDLCVGLRLGDDGAKRRR 546

BLAST of Cp4.1LG06g05810 vs. NCBI nr
Match: gi|449453788|ref|XP_004144638.1| (PREDICTED: uncharacterized protein LOC101217211 [Cucumis sativus])

HSP 1 Score: 1131.3 bits (2925), Expect = 0.0e+00
Identity = 566/603 (93.86%), Postives = 582/603 (96.52%), Query Frame = 1

Query: 1   MEQTRRHNRINCSGSTPSEESALNLERNGCSHSNLPSFSSPTLQPFASAGQHCESNAAYF 60
           MEQTR + RINCSGSTPSEESAL+LERN CSHS+LPSFSSPTLQPFASAGQH   N AYF
Sbjct: 1   MEQTRHNRRINCSGSTPSEESALDLERNCCSHSDLPSFSSPTLQPFASAGQHFGFNTAYF 60

Query: 61  SWPTPIRISVGTEERANYFANLQKGVLPDILHPLPKGQRANTLLELMTIRAFHSKILRCY 120
           SWPTPIR+SVGTEERANYFANLQKGVLPDILHPLPKGQRANTLLELMTIRAFHSKILRCY
Sbjct: 61  SWPTPIRLSVGTEERANYFANLQKGVLPDILHPLPKGQRANTLLELMTIRAFHSKILRCY 120

Query: 121 SLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFS 180
           SLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFS
Sbjct: 121 SLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFS 180

Query: 181 YFGAPNPAPKEQLYTEIVDDLRGSDPCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLT 240
           YFGAPNPAPKEQLYTEIVDDLRGSDPCIGSGSQVASQETYGTLGAIVRSQTG RQVGFLT
Sbjct: 181 YFGAPNPAPKEQLYTEIVDDLRGSDPCIGSGSQVASQETYGTLGAIVRSQTGGRQVGFLT 240

Query: 241 NRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD 300
           NRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD
Sbjct: 241 NRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD 300

Query: 301 GSFIPFADDFDMSTVTTSVKGVGEIGDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVL 360
           G+FIPFADDFDMSTVTTSVKGVG++GDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVL
Sbjct: 301 GAFIPFADDFDMSTVTTSVKGVGQVGDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVL 360

Query: 361 AYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIILKGENRESLKPIGIIWGGTAN 420
           AYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIILKGENR++L+PIGIIWGGTAN
Sbjct: 361 AYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIILKGENRDTLQPIGIIWGGTAN 420

Query: 421 RGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQRTVSATVIGSIVGD 480
           RGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQ TVSATVIGSIVGD
Sbjct: 421 RGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQITVSATVIGSIVGD 480

Query: 481 SSPPDTTLPREKSEEKFEPLGFQIQHMPTEVEPSSAKDQSSLLETEFHLEAGTNTAPSVE 540
           SSPPDTTLP+EKSEEK E LGFQIQHMPTEVEP SAKD+  LLETEFHLE G N APSVE
Sbjct: 481 SSPPDTTLPKEKSEEKSEQLGFQIQHMPTEVEP-SAKDR-PLLETEFHLEPGMNRAPSVE 540

Query: 541 HQFIPSLFSCSPSHQNSSLVHAVSQNLSLLRNDCEDICVSLQLGDHEAKRQRLDGSVSME 600
           HQFIPSLFSCSPSHQNS+L  AVSQNLSLLR+DCED+CVSLQLGDHEAKR+R D SVSME
Sbjct: 541 HQFIPSLFSCSPSHQNSTLDRAVSQNLSLLRSDCEDLCVSLQLGDHEAKRRRSDASVSME 600

Query: 601 ELK 604
           ELK
Sbjct: 601 ELK 601

BLAST of Cp4.1LG06g05810 vs. NCBI nr
Match: gi|700199769|gb|KGN54927.1| (hypothetical protein Csa_4G607040 [Cucumis sativus])

HSP 1 Score: 1131.3 bits (2925), Expect = 0.0e+00
Identity = 566/603 (93.86%), Postives = 582/603 (96.52%), Query Frame = 1

Query: 1   MEQTRRHNRINCSGSTPSEESALNLERNGCSHSNLPSFSSPTLQPFASAGQHCESNAAYF 60
           MEQTR + RINCSGSTPSEESAL+LERN CSHS+LPSFSSPTLQPFASAGQH   N AYF
Sbjct: 104 MEQTRHNRRINCSGSTPSEESALDLERNCCSHSDLPSFSSPTLQPFASAGQHFGFNTAYF 163

Query: 61  SWPTPIRISVGTEERANYFANLQKGVLPDILHPLPKGQRANTLLELMTIRAFHSKILRCY 120
           SWPTPIR+SVGTEERANYFANLQKGVLPDILHPLPKGQRANTLLELMTIRAFHSKILRCY
Sbjct: 164 SWPTPIRLSVGTEERANYFANLQKGVLPDILHPLPKGQRANTLLELMTIRAFHSKILRCY 223

Query: 121 SLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFS 180
           SLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFS
Sbjct: 224 SLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFS 283

Query: 181 YFGAPNPAPKEQLYTEIVDDLRGSDPCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLT 240
           YFGAPNPAPKEQLYTEIVDDLRGSDPCIGSGSQVASQETYGTLGAIVRSQTG RQVGFLT
Sbjct: 284 YFGAPNPAPKEQLYTEIVDDLRGSDPCIGSGSQVASQETYGTLGAIVRSQTGGRQVGFLT 343

Query: 241 NRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD 300
           NRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD
Sbjct: 344 NRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD 403

Query: 301 GSFIPFADDFDMSTVTTSVKGVGEIGDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVL 360
           G+FIPFADDFDMSTVTTSVKGVG++GDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVL
Sbjct: 404 GAFIPFADDFDMSTVTTSVKGVGQVGDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVL 463

Query: 361 AYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIILKGENRESLKPIGIIWGGTAN 420
           AYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIILKGENR++L+PIGIIWGGTAN
Sbjct: 464 AYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIILKGENRDTLQPIGIIWGGTAN 523

Query: 421 RGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQRTVSATVIGSIVGD 480
           RGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQ TVSATVIGSIVGD
Sbjct: 524 RGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQITVSATVIGSIVGD 583

Query: 481 SSPPDTTLPREKSEEKFEPLGFQIQHMPTEVEPSSAKDQSSLLETEFHLEAGTNTAPSVE 540
           SSPPDTTLP+EKSEEK E LGFQIQHMPTEVEP SAKD+  LLETEFHLE G N APSVE
Sbjct: 584 SSPPDTTLPKEKSEEKSEQLGFQIQHMPTEVEP-SAKDR-PLLETEFHLEPGMNRAPSVE 643

Query: 541 HQFIPSLFSCSPSHQNSSLVHAVSQNLSLLRNDCEDICVSLQLGDHEAKRQRLDGSVSME 600
           HQFIPSLFSCSPSHQNS+L  AVSQNLSLLR+DCED+CVSLQLGDHEAKR+R D SVSME
Sbjct: 644 HQFIPSLFSCSPSHQNSTLDRAVSQNLSLLRSDCEDLCVSLQLGDHEAKRRRSDASVSME 703

Query: 601 ELK 604
           ELK
Sbjct: 704 ELK 704

BLAST of Cp4.1LG06g05810 vs. NCBI nr
Match: gi|659130946|ref|XP_008465434.1| (PREDICTED: uncharacterized protein LOC103503046 [Cucumis melo])

HSP 1 Score: 1129.0 bits (2919), Expect = 0.0e+00
Identity = 564/603 (93.53%), Postives = 579/603 (96.02%), Query Frame = 1

Query: 1   MEQTRRHNRINCSGSTPSEESALNLERNGCSHSNLPSFSSPTLQPFASAGQHCESNAAYF 60
           MEQTR + RINCSGS PSEESAL+LERN CSHS+LPSFSSPTLQPFASAGQH   N AYF
Sbjct: 1   MEQTRHNRRINCSGSIPSEESALDLERNCCSHSDLPSFSSPTLQPFASAGQHFGFNTAYF 60

Query: 61  SWPTPIRISVGTEERANYFANLQKGVLPDILHPLPKGQRANTLLELMTIRAFHSKILRCY 120
           SWPTPIR+SVGTEERANYFANLQKGVLPDILHPLPKGQRANTLLELMTIRAFHSKILRCY
Sbjct: 61  SWPTPIRLSVGTEERANYFANLQKGVLPDILHPLPKGQRANTLLELMTIRAFHSKILRCY 120

Query: 121 SLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFS 180
           SLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFS
Sbjct: 121 SLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFS 180

Query: 181 YFGAPNPAPKEQLYTEIVDDLRGSDPCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLT 240
           YFGAPNPAPKEQLYTEIVDDLRGSDPCIGSGSQVASQETYGTLGAIVRSQTG RQVGFLT
Sbjct: 181 YFGAPNPAPKEQLYTEIVDDLRGSDPCIGSGSQVASQETYGTLGAIVRSQTGGRQVGFLT 240

Query: 241 NRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD 300
           NRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD
Sbjct: 241 NRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD 300

Query: 301 GSFIPFADDFDMSTVTTSVKGVGEIGDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVL 360
           G+FIPFADDFDMSTVTTSVKGVG++GDVKFIDLQS ISTLIGKQVVKVGRSSGLTTGTVL
Sbjct: 301 GAFIPFADDFDMSTVTTSVKGVGQVGDVKFIDLQSSISTLIGKQVVKVGRSSGLTTGTVL 360

Query: 361 AYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIILKGENRESLKPIGIIWGGTAN 420
           AYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIILKGENRE+L+PIGIIWGGTAN
Sbjct: 361 AYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIILKGENRETLQPIGIIWGGTAN 420

Query: 421 RGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQRTVSATVIGSIVGD 480
           RGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQ TVSATVIGSIVGD
Sbjct: 421 RGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQITVSATVIGSIVGD 480

Query: 481 SSPPDTTLPREKSEEKFEPLGFQIQHMPTEVEPSSAKDQSSLLETEFHLEAGTNTAPSVE 540
           SSPPDTTLP+EKSEEK EPLGFQIQHMPTEVEPS+AKD+  LLETEFHLE G N APSVE
Sbjct: 481 SSPPDTTLPKEKSEEKSEPLGFQIQHMPTEVEPSTAKDR-PLLETEFHLEPGMNRAPSVE 540

Query: 541 HQFIPSLFSCSPSHQNSSLVHAVSQNLSLLRNDCEDICVSLQLGDHEAKRQRLDGSVSME 600
           HQFIPSLFSCSP HQNS+L  AVSQNLS LR+DCED CVSLQLGDHEAKR+R D SVSME
Sbjct: 541 HQFIPSLFSCSPPHQNSTLDRAVSQNLSSLRSDCEDPCVSLQLGDHEAKRRRSDASVSME 600

Query: 601 ELK 604
           ELK
Sbjct: 601 ELK 602

BLAST of Cp4.1LG06g05810 vs. NCBI nr
Match: gi|595862387|ref|XP_007211386.1| (hypothetical protein PRUPE_ppa003117mg [Prunus persica])

HSP 1 Score: 1006.5 bits (2601), Expect = 1.9e-290
Identity = 504/604 (83.44%), Postives = 537/604 (88.91%), Query Frame = 1

Query: 1   MEQTRRHNRINCSGSTPSEESALNLERNGCSHSNLPSFSSPTLQPFASAGQHCESNAAYF 60
           ME+TR + R+ CSGSTPSEES L+LERN  SHSNLPS S PTLQP+ASAGQHCE++AAYF
Sbjct: 1   MERTRFNMRMRCSGSTPSEESVLDLERNCYSHSNLPSLSPPTLQPYASAGQHCETSAAYF 60

Query: 61  SWPTPIRISVGTEERANYFANLQKGVLPDILHPLPKGQRANTLLELMTIRAFHSKILRCY 120
           SWPT  R++   EERANYF NLQKGVLP+ L  LPKGQ+A TLLELMTIRAFHSKILRCY
Sbjct: 61  SWPTSSRLNDAAEERANYFTNLQKGVLPETLGQLPKGQQATTLLELMTIRAFHSKILRCY 120

Query: 121 SLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFS 180
           SLGTAIGFRIR+GVLTDIPAILVFV+RKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFS
Sbjct: 121 SLGTAIGFRIRRGVLTDIPAILVFVARKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFS 180

Query: 181 YFGAPNPAPKEQLYTEIVDDLRGSDPCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLT 240
           YFGAP PAPKEQLYTEIVDDLRG DPCIGSGSQVASQETYGTLGAIVRSQTG+RQVGFLT
Sbjct: 181 YFGAPEPAPKEQLYTEIVDDLRGGDPCIGSGSQVASQETYGTLGAIVRSQTGNRQVGFLT 240

Query: 241 NRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD 300
           NRHVAVDLDYPNQKMFHPLPP+LGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD
Sbjct: 241 NRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD 300

Query: 301 GSFIPFADDFDMSTVTTSVKGVGEIGDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVL 360
           G+FIPFADDFDM TV TSVKGVGEIG+VK IDLQSPISTLIGKQV+KVGRSSGLTTGTVL
Sbjct: 301 GAFIPFADDFDMCTVITSVKGVGEIGNVKIIDLQSPISTLIGKQVMKVGRSSGLTTGTVL 360

Query: 361 AYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIILKGENRESLKPIGIIWGGTAN 420
           AYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIILKGEN E  +PIGIIWGGTAN
Sbjct: 361 AYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIILKGENGEKPRPIGIIWGGTAN 420

Query: 421 RGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQRTVSATVIGSIVGD 480
           RGRLKLK+GQPPENWTSGVDLGRLL LLELDLIT+DEG+K AVQEQRT SAT IGS VGD
Sbjct: 421 RGRLKLKIGQPPENWTSGVDLGRLLKLLELDLITTDEGVKVAVQEQRTASATAIGSTVGD 480

Query: 481 SSPPDTTLPREKSEEKFEPLGFQIQHMPTEVEPSSAKDQSSLLETEFHLEAGTNTAPSVE 540
           SSPPD  LP+E+ EEKFE LG QIQH+P E EPSS+    SL+ETEFHLE G    PSVE
Sbjct: 481 SSPPDGMLPKERPEEKFESLGLQIQHIPLEAEPSSS---LSLVETEFHLEDGIKAVPSVE 540

Query: 541 HQFIPSLFSCSPSHQNSSLVHAVSQNLSLLRNDC-EDICVSLQLGDHEAKRQRLDGSVSM 600
           HQFIPS    SP H+ + +   VS+NLS LRN C EDIC SLQLGD+EAKR+R   S S 
Sbjct: 541 HQFIPSFLGGSPLHKKNQMGRTVSENLSSLRNGCDEDICFSLQLGDNEAKRRRSGASTSA 600

Query: 601 EELK 604
           EE K
Sbjct: 601 EEPK 601

BLAST of Cp4.1LG06g05810 vs. NCBI nr
Match: gi|1009130088|ref|XP_015882108.1| (PREDICTED: uncharacterized protein LOC107417965 isoform X1 [Ziziphus jujuba])

HSP 1 Score: 1004.6 bits (2596), Expect = 7.4e-290
Identity = 505/599 (84.31%), Postives = 532/599 (88.81%), Query Frame = 1

Query: 1   MEQTRRHNRINCSGSTPSEESALNLERNGCSHSNLPSFSSPTLQPFASAGQHCESNAAYF 60
           ME++R   R  CSGSTPSEESAL+LERNGCSHSN PS S P LQPFASAGQHCESNAAYF
Sbjct: 1   MERSRLILRFRCSGSTPSEESALDLERNGCSHSNFPSSSPPALQPFASAGQHCESNAAYF 60

Query: 61  SWPTPIRISVGTEERANYFANLQKGVLPDILHPLPKGQRANTLLELMTIRAFHSKILRCY 120
           SWPT  R+    EERANYFANLQKGVLP+ L+ LPKGQ+A TLLELMTIRAFHSKILRCY
Sbjct: 61  SWPTSSRLINAAEERANYFANLQKGVLPETLNRLPKGQQATTLLELMTIRAFHSKILRCY 120

Query: 121 SLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFS 180
           SLGTAIGFRIR+GVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFS
Sbjct: 121 SLGTAIGFRIRRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFS 180

Query: 181 YFGAPNPAPKEQLYTEIVDDLRGSDPCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLT 240
           YFGAP PAPKEQLYTEIVDDLRG D CIGSGSQVASQETYGTLGAIVRSQTGS+QVGFLT
Sbjct: 181 YFGAPEPAPKEQLYTEIVDDLRGGDLCIGSGSQVASQETYGTLGAIVRSQTGSQQVGFLT 240

Query: 241 NRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD 300
           NRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD
Sbjct: 241 NRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD 300

Query: 301 GSFIPFADDFDMSTVTTSVKGVGEIGDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVL 360
           G+FIPFADDFDM TVTTSVKGVGEIGDVK IDLQSPIS+LIGKQV+KVGRSSGLT GTVL
Sbjct: 301 GAFIPFADDFDMPTVTTSVKGVGEIGDVKIIDLQSPISSLIGKQVMKVGRSSGLTNGTVL 360

Query: 361 AYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIILKGENRESLKPIGIIWGGTAN 420
           AYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIILKG N E  +PIGIIWGGTAN
Sbjct: 361 AYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIILKGVNGEKPRPIGIIWGGTAN 420

Query: 421 RGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQRTVSATVIGSIVGD 480
           RGRLKLK+GQPPENWTSGVDLGRLLNLLELDLIT++EGL+ AVQEQR  SAT IGS VGD
Sbjct: 421 RGRLKLKIGQPPENWTSGVDLGRLLNLLELDLITTEEGLRVAVQEQRAASATAIGSTVGD 480

Query: 481 SSPPDTTLPREKSEEKFEPLGFQIQHMPTEVEPSSAKDQSSLLETEFHLEAGTNTAPSVE 540
           SSPPD   P+EK+ EKFEP+G QIQH+P EV+P S       +ETEFHLE G   APSVE
Sbjct: 481 SSPPDGIHPKEKT-EKFEPMGLQIQHIPLEVQPGSPAANPLSMETEFHLEDGIKVAPSVE 540

Query: 541 HQFIPSLFSCSPSHQNSSLVHAVSQNLSLLRNDC-EDICVSLQLGDHEAKRQRLDGSVS 599
           HQFIPS    SP HQ +     VS+NLSLLRN C EDICVSLQLGD+EAKR+R D S S
Sbjct: 541 HQFIPSFPRRSPLHQTNMKERLVSENLSLLRNGCDEDICVSLQLGDNEAKRRRSDASTS 598

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0L2V0_CUCSA0.0e+0093.86Uncharacterized protein OS=Cucumis sativus GN=Csa_4G607040 PE=4 SV=1[more]
M5X0K9_PRUPE1.4e-29083.44Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa003117mg PE=4 SV=1[more]
V4TLL7_9ROSI3.3e-28982.95Uncharacterized protein OS=Citrus clementina GN=CICLE_v10019372mg PE=4 SV=1[more]
A0A0B2R4C0_GLYSO1.7e-28882.72Uncharacterized protein OS=Glycine soja GN=glysoja_011410 PE=4 SV=1[more]
I1KU44_SOYBN8.2e-28882.72Uncharacterized protein OS=Glycine max GN=GLYMA_08G171900 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G35155.17.2e-21965.87 Trypsin family protein[more]
AT5G45030.12.3e-21765.27 Trypsin family protein[more]
AT3G12950.16.6e-21269.01 Trypsin family protein[more]
Match NameE-valueIdentityDescription
gi|449453788|ref|XP_004144638.1|0.0e+0093.86PREDICTED: uncharacterized protein LOC101217211 [Cucumis sativus][more]
gi|700199769|gb|KGN54927.1|0.0e+0093.86hypothetical protein Csa_4G607040 [Cucumis sativus][more]
gi|659130946|ref|XP_008465434.1|0.0e+0093.53PREDICTED: uncharacterized protein LOC103503046 [Cucumis melo][more]
gi|595862387|ref|XP_007211386.1|1.9e-29083.44hypothetical protein PRUPE_ppa003117mg [Prunus persica][more]
gi|1009130088|ref|XP_015882108.1|7.4e-29084.31PREDICTED: uncharacterized protein LOC107417965 isoform X1 [Ziziphus jujuba][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR009003Peptidase_S1_PA
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG06g05810.1Cp4.1LG06g05810.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR009003Peptidase S1, PA clanunknownSSF50494Trypsin-like serine proteasescoord: 214..425
score: 3.85
NoneNo IPR availableGENE3DG3DSA:2.40.10.10coord: 329..429
score: 3.
NoneNo IPR availablePANTHERPTHR31521FAMILY NOT NAMEDcoord: 2..603
score:
NoneNo IPR availablePANTHERPTHR31521:SF3SUBFAMILY NOT NAMEDcoord: 2..603
score: