MS012500 (gene) Bitter gourd (TR) v1

Overview
NameMS012500
Typegene
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Descriptionglutelin type-A 2-like
Locationscaffold63: 1121111 .. 1124447 (+)
RNA-Seq ExpressionMS012500
SyntenyMS012500
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAGCTGAATTTGGAGCCAATGAATCCGAAACCCTTCTTTGAGGGAGAGGGAGGATCGTTTCACAAATGGTTCCCTTCTGATTTTCCGATGATCGCTCAGACCAAAGTCGCCGCCGGCAGGCTTCTCCTCCACCCTCGCGGTTTCGCCATTCCTCACAACTCCGATTCCTCCAAAGTTGGCTATGTTCTTCAAGGTATCATAAAGCTTTTTTTGCTCATTCATTGTAATTTCTTCCTGTTTTATTGAAATAATTTTGGGTTATTAATACTTTTGCTAGCTGAATTCCCCATAACTCCGATTATTTTCTTTCCATTTTATAGTTTTTGTTGTTGGCTTAGATTTTGTTTGATAACTCAAATTAGGCTTGTTTTTTACGATTTGGTTAGTTAATTTTTATTTGAATTTTCAGCTAAAATAAAAAAAAAAAACCAAAAACAACCTTTTTTTAAATTATTTTTTCCTCAAATTTTGGCCTAAATTTTGAACCAAACAAAGAAATACATAGGGGAAGAAAGTGATGTTTCTAATTAGCTTAGATTTTAGAAATAGAATAAAAAAAACGAATTGATTATCGAATGGTTTAAATTCTATTTTGATTCGTGTTGATTTTATGTCTACTTCTTAATCTTTTATATATTAGTTTCAAAAGTTCAGGAAGAAAATAGTTATTTTTACGGGCTAATTAAAATCAAAGTTTTGATGATTATGCGTACTATATAAAATGTTTTGTTTCTTTTTAAGTTTAAAGAATCAAACTAGACATTTTAAAATTGGAAGAACTAATATAAAAACATATAATGTATATATATTATGTTGGAGGACGACCAAATCGTGATTTTAACAAAACATTTTTTGGTCTTTTGCTTTTTAATTTTTAGAAATTATATAATTGAAACATAACATAAATAATATATATTTTTTTTCCATTTACTTAGTCAATCTATACAATAGACCAATACTTTAAGTCACGTGTATTCAAGTGTCAATTTTTTAATATGGATTTAGATTTTTGTTCTATTATTAATGGAAAAATATATAACTAATTTTTATTTAAAATTTAGAAATTAAAATAAAATATTTTAAAAATTAAGGGTTTAAAACGAAATATTTTAAATGTTGATGATGAAAATAAAATATGTATAAAACTACATGAACTAATATAAGATTCGATTTAAAAGTAATATATTTGATATTTTAATGGGATCTAATTAAGGGATTTGGTTGGACTTTTTCAGTTGCTAACTTTTGTAAGAAAATAATTCAAGTCTTACACTCTTACTTTTCTTCTTTCTAATTAAATGACATTTCTACCTGTCGATAAAAAAAAAACAAAATGATGTTTCTACATTAAAAAAAAAAATGTCATTACCTGTCATGAATAACCCTTTTTTTTATGAAGGATATTGTGATGTCGAAAAAACGTCACAAAATCTTTTCGTGATTTCAAGGGATTGTTATCACCTCTTACCTAGTTTGTTCATATTATTTTAATAAATAATTAGATTGTCAACATTAACTATACCATAATTTTAGAAAATTAATTATATTTAAATTTAATTTTTCAAAATCATTCATTTTATTGTTACTTTCAAACCTATCGAAAACCTGCAGGCAATGGAATTGCCGGACTTTTATTTCCGGGCAAGTCCGGCGAATTTGTGGTGAAACTAGAGAAAGGAGACCTAATTCCGGTACCGGAAGGCGTCACCTCCTGGTGGTTCAACCCCGCCAATGACGACTCCGATTTCGAGATCCTCCTCATCGGCGACTCCTCACACGCCCTCATCCCCGGCGACGTCACATACGTCGTATTCGCGGGACGTCTCGGAATCCTCCAGAGTTTCCCGCCGGAGTACGTCGCCGGATCTTACTCCCTAAACGAAGAAGAATCCGCCGCTCTTCTCAGAAGCCAATCCAACGGCCTGATCTTCAAGCTCCGGCCGGACCAAACCCTACCCGAACCGGACGAAAGCAGCGGTCTGGTTTTTAACATATACGACGCCGTTCCGGACGCCCGATTGGAGGCCGGCGGGTCAGTGACGGCGGTGACAGAGGATAAATTTCCGTTCATTGGGAAGTCTGGGCTGACGGCGGTGCTCGAGAAGCTTGAGGCTAACGCCGTTAGGGCACCGGTCTACGTGGCGGACCCGTCGGTGCAACTTATTTATGTGAGTCGCGGGTCGGGTCGGATTCGGGTGGGCGGGTTTTTGGGGAAAATGGATTCGGAGGTGAAAGCGGGGCAGCTGGTTTTGGTTCCAAAGTACTTCGCCGTCGGGAAGGTCGCCGGCGATGAAGGAATGGAGTGCTTCTCCATTATCACAACTACACAGTAAGATAAATTTTTTATTTTTGTCTTTTATTTTTGTTTATTTATTTATTTTTCATAATCTTGACTTGGAGTTAAAAGAATGTTTTAGAAAAATTTTATAAAGTTTATAGGAAATGAAAGCCTTATACCTAACAAACTTGTGTTTTGAATGGACAATATAAACTTAATTATTTCCTGTTTTTTGAAATTTAGATTATATTTTATGGCTTTTTTGGTAAGTGAAAAACTACCACAAGGAAAATGTTGGTAATAGATTACAAACAAATTTCATTAATTTTTCAAACAAACAATCATTATCAAGCAAATTAAAGGTTTGGTTTTTCTAAAATTAAGCTAAATGTAAACTACAAAATAATTGTGGGATAAACTAAACCTAATTTTTAAAAACAAAAAAAAAAAACAATAAAGTTAAACAACCAANNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTAATTTTAAATGTTTTATTCTTATTCTTATATATATTTTTTAGGCTTATTTTAAATTTGGTCCAATGATTGATTTTTATAGTTGGGTCAAATCCGATCACAAATTAACCCTCATAGGTGGTTTATTGATCCAAATGGATTAGGGTTAATTAAGAAGAAATATGTTCAAACCTCATCTTGATCACCTACGAATAAAATTTAAAAAAAAAATTATCATAAGTTGTTTGAGTCATTACCATATTGAATATGTCTACTTGCATAGAAACGACCGTTGATGTTTTGTATTACTATCGATTATATATATATTGTTGTTATATATTTATTTATTTATTTTGATGAGAATGGTTAGTGAAAGCTGGGCTTGGTTTGTTAACAAATTGCAGCCCTCTAATAGAAGAATTGGGAGGGAAAGATTCAATTTTTGGGAGTTTATCAGCACAAGTTTTTCAAGTTTCATTCAATGTCACAGCTGAGTTTGAGAAGCTTCTCAGGTCAAAGATAACAAAAGCCTCACCCCTGGTTCCTCCCTCAAATCAT

mRNA sequence

ATGGAGCTGAATTTGGAGCCAATGAATCCGAAACCCTTCTTTGAGGGAGAGGGAGGATCGTTTCACAAATGGTTCCCTTCTGATTTTCCGATGATCGCTCAGACCAAAGTCGCCGCCGGCAGGCTTCTCCTCCACCCTCGCGGTTTCGCCATTCCTCACAACTCCGATTCCTCCAAAGTTGGCTATGTTCTTCAAGGCAATGGAATTGCCGGACTTTTATTTCCGGGCAAGTCCGGCGAATTTGTGGTGAAACTAGAGAAAGGAGACCTAATTCCGGTACCGGAAGGCGTCACCTCCTGGTGGTTCAACCCCGCCAATGACGACTCCGATTTCGAGATCCTCCTCATCGGCGACTCCTCACACGCCCTCATCCCCGGCGACGTCACATACGTCGTATTCGCGGGACGTCTCGGAATCCTCCAGAGTTTCCCGCCGGAGTACGTCGCCGGATCTTACTCCCTAAACGAAGAAGAATCCGCCGCTCTTCTCAGAAGCCAATCCAACGGCCTGATCTTCAAGCTCCGGCCGGACCAAACCCTACCCGAACCGGACGAAAGCAGCGGTCTGGTTTTTAACATATACGACGCCGTTCCGGACGCCCGATTGGAGGCCGGCGGGTCAGTGACGGCGGTGACAGAGGATAAATTTCCGTTCATTGGGAAGTCTGGGCTGACGGCGGTGCTCGAGAAGCTTGAGGCTAACGCCGTTAGGGCACCGGTCTACGTGGCGGACCCGTCGGTGCAACTTATTTATGTGAGTCGCGGGTCGGGTCGGATTCGGGTGGGCGGGTTTTTGGGGAAAATGGATTCGGAGGTGAAAGCGGGGCAGCTGGTTTTGGTTCCAAAGTACTTCGCCGTCGGGAAGGTCGCCGGCGATGAAGGAATGGAGTGCTTCTCCATTATCACAACTACACACCCTCTAATAGAAGAATTGGGAGGGAAAGATTCAATTTTTGGGAGTTTATCAGCACAAGTTTTTCAAGTTTCATTCAATGTCACAGCTGAGTTTGAGAAGCTTCTCAGGTCAAAGATAACAAAAGCCTCACCCCTGGTTCCTCCCTCAAATCAT

Coding sequence (CDS)

ATGGAGCTGAATTTGGAGCCAATGAATCCGAAACCCTTCTTTGAGGGAGAGGGAGGATCGTTTCACAAATGGTTCCCTTCTGATTTTCCGATGATCGCTCAGACCAAAGTCGCCGCCGGCAGGCTTCTCCTCCACCCTCGCGGTTTCGCCATTCCTCACAACTCCGATTCCTCCAAAGTTGGCTATGTTCTTCAAGGCAATGGAATTGCCGGACTTTTATTTCCGGGCAAGTCCGGCGAATTTGTGGTGAAACTAGAGAAAGGAGACCTAATTCCGGTACCGGAAGGCGTCACCTCCTGGTGGTTCAACCCCGCCAATGACGACTCCGATTTCGAGATCCTCCTCATCGGCGACTCCTCACACGCCCTCATCCCCGGCGACGTCACATACGTCGTATTCGCGGGACGTCTCGGAATCCTCCAGAGTTTCCCGCCGGAGTACGTCGCCGGATCTTACTCCCTAAACGAAGAAGAATCCGCCGCTCTTCTCAGAAGCCAATCCAACGGCCTGATCTTCAAGCTCCGGCCGGACCAAACCCTACCCGAACCGGACGAAAGCAGCGGTCTGGTTTTTAACATATACGACGCCGTTCCGGACGCCCGATTGGAGGCCGGCGGGTCAGTGACGGCGGTGACAGAGGATAAATTTCCGTTCATTGGGAAGTCTGGGCTGACGGCGGTGCTCGAGAAGCTTGAGGCTAACGCCGTTAGGGCACCGGTCTACGTGGCGGACCCGTCGGTGCAACTTATTTATGTGAGTCGCGGGTCGGGTCGGATTCGGGTGGGCGGGTTTTTGGGGAAAATGGATTCGGAGGTGAAAGCGGGGCAGCTGGTTTTGGTTCCAAAGTACTTCGCCGTCGGGAAGGTCGCCGGCGATGAAGGAATGGAGTGCTTCTCCATTATCACAACTACACACCCTCTAATAGAAGAATTGGGAGGGAAAGATTCAATTTTTGGGAGTTTATCAGCACAAGTTTTTCAAGTTTCATTCAATGTCACAGCTGAGTTTGAGAAGCTTCTCAGGTCAAAGATAACAAAAGCCTCACCCCTGGTTCCTCCCTCAAATCAT

Protein sequence

MELNLEPMNPKPFFEGEGGSFHKWFPSDFPMIAQTKVAAGRLLLHPRGFAIPHNSDSSKVGYVLQGNGIAGLLFPGKSGEFVVKLEKGDLIPVPEGVTSWWFNPANDDSDFEILLIGDSSHALIPGDVTYVVFAGRLGILQSFPPEYVAGSYSLNEEESAALLRSQSNGLIFKLRPDQTLPEPDESSGLVFNIYDAVPDARLEAGGSVTAVTEDKFPFIGKSGLTAVLEKLEANAVRAPVYVADPSVQLIYVSRGSGRIRVGGFLGKMDSEVKAGQLVLVPKYFAVGKVAGDEGMECFSIITTTHPLIEELGGKDSIFGSLSAQVFQVSFNVTAEFEKLLRSKITKASPLVPPSNH
Homology
BLAST of MS012500 vs. NCBI nr
Match: XP_038879635.1 (legumin type B-like [Benincasa hispida])

HSP 1 Score: 553.1 bits (1424), Expect = 1.7e-153
Identity = 273/356 (76.69%), Postives = 313/356 (87.92%), Query Frame = 0

Query: 1   MELNLEPMNPKPFFEGEGGSFHKWFPSDFPMIAQTKVAAGRLLLHPRGFAIPHNSDSSKV 60
           M+LNL+PM+P  FF+GEGGSFHKWFPSDFP+IAQTKV AGRLLLHPRGFA+PHNSDSSKV
Sbjct: 1   MDLNLKPMDPTNFFKGEGGSFHKWFPSDFPIIAQTKVGAGRLLLHPRGFAVPHNSDSSKV 60

Query: 61  GYVLQGNGIAGLLFPGKSGEFVVKLEKGDLIPVPEGVTSWWFNPANDDSDFEILLIGDSS 120
           GYVLQG+G+AG+LFP KS E VV+L+KGDLIPVPEGVTSWWFN  + DSDFE+LL+GD+ 
Sbjct: 61  GYVLQGSGVAGILFPCKSEEAVVRLKKGDLIPVPEGVTSWWFN--DGDSDFEVLLVGDTR 120

Query: 121 HALIPGDVTYVVFAGRLGILQSFPPEYVAGSYSLNEEESAALLRSQSNGLIFKLRPDQTL 180
           +ALIPGD+TYVVFAG LGILQ F  +Y+   Y LNEEE   LL+SQ+NGLIFKL+ DQTL
Sbjct: 121 NALIPGDITYVVFAGPLGILQGFSSDYIQKVYDLNEEERDILLKSQTNGLIFKLQDDQTL 180

Query: 181 PEPDESSGLVFNIYDAVPDARLEAGGSVTAVTEDKFPFIGKSGLTAVLEKLEANAVRAPV 240
           PEP+  S LVFNIY A+PDA ++ GGSVT VT++KFPFIGKSGLTAVLEKLEANAVR+PV
Sbjct: 181 PEPNRHSHLVFNIYHALPDAVVKGGGSVTVVTDEKFPFIGKSGLTAVLEKLEANAVRSPV 240

Query: 241 YVADPSVQLIYVSRGSGRIR-VGGFLGK-MDSEVKAGQLVLVPKYFAVGKVAGDEGMECF 300
           YVADPSVQLIY++ GSGR++ V  FL K +D+EVKAGQLVLVPKYFAVGKVAG+EG+ECF
Sbjct: 241 YVADPSVQLIYIASGSGRVQIVETFLRKNIDAEVKAGQLVLVPKYFAVGKVAGEEGLECF 300

Query: 301 SIITTTHPLIEELGGKDSIFGSLSAQVFQVSFNVTAEFEKLLRSKITKASPLVPPS 355
           +IITTTHPL+EELGG  SIFG+ S QVFQ SFNVTA FEKLLRSKITK S LVPPS
Sbjct: 301 TIITTTHPLLEELGGDTSIFGTFSPQVFQASFNVTARFEKLLRSKITKTSSLVPPS 354

BLAST of MS012500 vs. NCBI nr
Match: XP_008456077.1 (PREDICTED: glutelin type-B 5 [Cucumis melo] >TYJ99756.1 glutelin type-B 5 [Cucumis melo var. makuwa])

HSP 1 Score: 545.8 bits (1405), Expect = 2.7e-151
Identity = 266/358 (74.30%), Postives = 312/358 (87.15%), Query Frame = 0

Query: 1   MELNLEPMNPKPFFEGEGGSFHKWFPSDFPMIAQTKVAAGRLLLHPRGFAIPHNSDSSKV 60
           MEL+L+PM+P  FF GEGGSFHKWFPSD P+I QTKV AGRLLLHPRGFA+PHNSDSSKV
Sbjct: 1   MELDLKPMDPTNFFTGEGGSFHKWFPSDHPIIPQTKVGAGRLLLHPRGFAVPHNSDSSKV 60

Query: 61  GYVLQGNGIAGLLFPGKSGEFVVKLEKGDLIPVPEGVTSWWFNPANDDSDFEILLIGDSS 120
           GYVLQG+G+AG++FP KS E VV+L+KGD+IPVPEGVTSWWFN  + DSDFE+LL+GD+ 
Sbjct: 61  GYVLQGSGVAGIVFPCKSEEAVVRLKKGDVIPVPEGVTSWWFN--DGDSDFEVLLVGDTR 120

Query: 121 HALIPGDVTYVVFAGRLGILQSFPPEYVAGSYSLNEEESAALLRSQSNGLIFKLRPDQTL 180
           +ALIPGD+TYVVFAG LG+LQ F  +Y+   Y L EEE   LL+SQ NGLIFKL+ DQTL
Sbjct: 121 NALIPGDITYVVFAGPLGVLQGFSSDYIEKVYDLTEEEREVLLKSQPNGLIFKLKDDQTL 180

Query: 181 PEPDESSGLVFNIYDAVPDARLEAGGSVTAVTEDKFPFIGKSGLTAVLEKLEANAVRAPV 240
           PEPD  S LVFNIYDA PD+ ++ GG+VT +TE+KFPFIGKSGLTAVLEKLEANAVR+PV
Sbjct: 181 PEPDCHSDLVFNIYDAAPDSVVKGGGTVTVLTEEKFPFIGKSGLTAVLEKLEANAVRSPV 240

Query: 241 YVADPSVQLIYVSRGSGRIRVG-GFLGK-MDSEVKAGQLVLVPKYFAVGKVAGDEGMECF 300
           YVADPSVQLIYV+ GSGRI++   F+ K +D+EVKAGQL+LVPKYFAVGK+AG+EG+ECF
Sbjct: 241 YVADPSVQLIYVASGSGRIQIAETFMRKQIDAEVKAGQLILVPKYFAVGKMAGEEGLECF 300

Query: 301 SIITTTHPLIEELGGKDSIFGSLSAQVFQVSFNVTAEFEKLLRSKITKASPLVPPSNH 357
           +IITTTHPL+EELGGK SIFG+ S QVFQ SFNVTA FEKLL SKITK+SPLVPPS++
Sbjct: 301 TIITTTHPLLEELGGKSSIFGAFSPQVFQASFNVTAHFEKLLISKITKSSPLVPPSDN 356

BLAST of MS012500 vs. NCBI nr
Match: XP_011651276.2 (legumin J [Cucumis sativus] >KGN57581.1 hypothetical protein Csa_011641 [Cucumis sativus])

HSP 1 Score: 544.3 bits (1401), Expect = 8.0e-151
Identity = 262/357 (73.39%), Postives = 308/357 (86.27%), Query Frame = 0

Query: 1   MELNLEPMNPKPFFEGEGGSFHKWFPSDFPMIAQTKVAAGRLLLHPRGFAIPHNSDSSKV 60
           MELNL+PM+P  FF GEGGSFHKWFPSDFP+I+QTKV AGRLLLHPRGFA+PHNSDSSKV
Sbjct: 1   MELNLKPMDPSNFFTGEGGSFHKWFPSDFPIISQTKVGAGRLLLHPRGFAVPHNSDSSKV 60

Query: 61  GYVLQGNGIAGLLFPGKSGEFVVKLEKGDLIPVPEGVTSWWFNPANDDSDFEILLIGDSS 120
           GYVLQG+G+AG++FP KS E  V+L+KGD+IPVPEGVTSWWFN  + DSDFE+LL+GD+ 
Sbjct: 61  GYVLQGSGVAGIIFPCKSEEAAVRLKKGDVIPVPEGVTSWWFN--DGDSDFEVLLVGDTR 120

Query: 121 HALIPGDVTYVVFAGRLGILQSFPPEYVAGSYSLNEEESAALLRSQSNGLIFKLRPDQTL 180
           +ALIPGD+TYVVFAG LG+LQ F  +Y+   Y L E+E   LL+SQ NGLIFKL+ DQTL
Sbjct: 121 NALIPGDITYVVFAGPLGVLQGFSSDYIEKVYDLTEKEREVLLKSQPNGLIFKLKDDQTL 180

Query: 181 PEPDESSGLVFNIYDAVPDARLEAGGSVTAVTEDKFPFIGKSGLTAVLEKLEANAVRAPV 240
           PEPD  S LVFNIY   PDA ++ GGSVT +TE+KFPFIGKSGLTAVLEKLEANAVR+PV
Sbjct: 181 PEPDCHSDLVFNIYHTAPDAVVKGGGSVTVLTEEKFPFIGKSGLTAVLEKLEANAVRSPV 240

Query: 241 YVADPSVQLIYVSRGSGRIRVGGFLGK--MDSEVKAGQLVLVPKYFAVGKVAGDEGMECF 300
           YVADPSVQLIYV+ GSGR+++     +  +D+EVKAGQLVLVPKYFAVGK+AG+EG+ECF
Sbjct: 241 YVADPSVQLIYVASGSGRVQIAETFMRYQIDAEVKAGQLVLVPKYFAVGKMAGEEGLECF 300

Query: 301 SIITTTHPLIEELGGKDSIFGSLSAQVFQVSFNVTAEFEKLLRSKITKASPLVPPSN 356
           +IITTTHPL+EELGGK SIFG+ S QVF+ SFN+TA FEKL RSKITK+SPLVPPS+
Sbjct: 301 TIITTTHPLLEELGGKTSIFGAFSPQVFEASFNLTAHFEKLFRSKITKSSPLVPPSD 355

BLAST of MS012500 vs. NCBI nr
Match: KAA0039049.1 (glutelin type-B 5 [Cucumis melo var. makuwa])

HSP 1 Score: 542.0 bits (1395), Expect = 3.9e-150
Identity = 265/358 (74.02%), Postives = 311/358 (86.87%), Query Frame = 0

Query: 1   MELNLEPMNPKPFFEGEGGSFHKWFPSDFPMIAQTKVAAGRLLLHPRGFAIPHNSDSSKV 60
           MEL+L+PM+P  FF GEGGSFHKWFPSD  +I QTKV AGRLLLHPRGFA+PHNSDSSKV
Sbjct: 1   MELDLKPMDPTNFFTGEGGSFHKWFPSDHLIIPQTKVGAGRLLLHPRGFAVPHNSDSSKV 60

Query: 61  GYVLQGNGIAGLLFPGKSGEFVVKLEKGDLIPVPEGVTSWWFNPANDDSDFEILLIGDSS 120
           GYVLQG+G+AG++FP KS E VV+L+KGD+IPVPEGVTSWWFN  + DSDFE+LL+GD+ 
Sbjct: 61  GYVLQGSGVAGIVFPCKSEEAVVRLKKGDVIPVPEGVTSWWFN--DGDSDFEVLLVGDTR 120

Query: 121 HALIPGDVTYVVFAGRLGILQSFPPEYVAGSYSLNEEESAALLRSQSNGLIFKLRPDQTL 180
           +ALIPGD+TYVVFAG LG+LQ F  +Y+   Y L EEE   LL+SQ NGLIFKL+ DQTL
Sbjct: 121 NALIPGDITYVVFAGPLGVLQGFSSDYIEKVYDLTEEEREVLLKSQPNGLIFKLKDDQTL 180

Query: 181 PEPDESSGLVFNIYDAVPDARLEAGGSVTAVTEDKFPFIGKSGLTAVLEKLEANAVRAPV 240
           PEPD  S LVFNIYDA PD+ ++ GG+VT +TE+KFPFIGKSGLTAVLEKLEANAVR+PV
Sbjct: 181 PEPDCHSDLVFNIYDAAPDSVVKGGGTVTVLTEEKFPFIGKSGLTAVLEKLEANAVRSPV 240

Query: 241 YVADPSVQLIYVSRGSGRIRVG-GFLGK-MDSEVKAGQLVLVPKYFAVGKVAGDEGMECF 300
           YVADPSVQLIYV+ GSGRI++   F+ K +D+EVKAGQL+LVPKYFAVGK+AG+EG+ECF
Sbjct: 241 YVADPSVQLIYVASGSGRIQIAETFMRKQIDAEVKAGQLILVPKYFAVGKMAGEEGLECF 300

Query: 301 SIITTTHPLIEELGGKDSIFGSLSAQVFQVSFNVTAEFEKLLRSKITKASPLVPPSNH 357
           +IITTTHPL+EELGGK SIFG+ S QVFQ SFNVTA FEKLL SKITK+SPLVPPS++
Sbjct: 301 TIITTTHPLLEELGGKSSIFGAFSPQVFQASFNVTAHFEKLLISKITKSSPLVPPSDN 356

BLAST of MS012500 vs. NCBI nr
Match: KAG7014994.1 (12S seed storage protein CRD, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 492.3 bits (1266), Expect = 3.6e-135
Identity = 237/324 (73.15%), Postives = 275/324 (84.88%), Query Frame = 0

Query: 1   MELNLEPMNPKPFFEGEGGSFHKWFPSDFPMIAQTKVAAGRLLLHPRGFAIPHNSDSSKV 60
           MELNLEPM+PK FF GEGGSFHKW PSDFPMIA TKV AGRLLL PRGFA+PHNSDSSKV
Sbjct: 1   MELNLEPMSPKAFFHGEGGSFHKWLPSDFPMIAHTKVGAGRLLLRPRGFALPHNSDSSKV 60

Query: 61  GYVLQGNGIAGLLFPGKSGEFVVKLEKGDLIPVPEGVTSWWFNPANDDSDFEILLIGDSS 120
           GYVLQG G+AG+LFPG S E VV+L+KGDLIPVPEGVTSWWFN  + DSDFE+LL+GD+ 
Sbjct: 61  GYVLQGIGLAGILFPGSSDEAVVRLKKGDLIPVPEGVTSWWFN--DGDSDFEVLLVGDTR 120

Query: 121 HALIPGDVTYVVFAGRLGILQSFPPEYVAGSYSLNEEESAALLRSQSNGLIFKLRPDQTL 180
           +ALIPGD+TYVVF G LG+LQ F P+YV   Y+LN EE+ ALL+SQ+NGLIFKLR DQ +
Sbjct: 121 NALIPGDITYVVFTGPLGVLQGFSPDYVQKVYNLNGEETDALLKSQTNGLIFKLRQDQMM 180

Query: 181 PEPDESSGLVFNIYDAVPDARLEAGGSVTAVTEDKFPFIGKSGLTAVLEKLEANAVRAPV 240
           PEP+    LVFNIYD V  +R E  GSVT VTE +FPFIGKSGLTAVLEKLEAN  R+PV
Sbjct: 181 PEPNRHGDLVFNIYDVV--SRDEGNGSVTVVTEKEFPFIGKSGLTAVLEKLEANTARSPV 240

Query: 241 YVADPSVQLIYVSRGSGRIRVGGFLGKMDSEVKAGQLVLVPKYFAVGKVAGDEGMECFSI 300
           YVADPSVQL+Y++ GSGR+++ GFLGK+D+ VKAGQLVLVPKYFA GK+AG+EG+ECF+I
Sbjct: 241 YVADPSVQLVYIAIGSGRVQIVGFLGKIDTVVKAGQLVLVPKYFAAGKIAGEEGLECFTI 300

Query: 301 ITTTHPLIEELGGKDSIFGSLSAQ 325
           IT+T P +EELGGK SI G+ S Q
Sbjct: 301 ITSTSPKLEELGGKTSILGTFSPQ 320

BLAST of MS012500 vs. ExPASy Swiss-Prot
Match: A0A222NNM9 (Cocosin 1 OS=Cocos nucifera OX=13894 GN=COS-1 PE=1 SV=1)

HSP 1 Score: 105.5 bits (262), Expect = 1.2e-21
Identity = 97/407 (23.83%), Postives = 173/407 (42.51%), Query Frame = 0

Query: 5   LEPMNPKPFFEGEGGSFHKWFPSDFPMIAQTKVAAGRLLLHPRGFAIPHNSDSSKVGYVL 64
           L  + P      E G    +F  D        V+  R ++ PRG  +P  S++ ++ Y++
Sbjct: 50  LNALEPTRTVRSEAG-VTDYFDEDNEQFRCAGVSTIRRVIEPRGLLLPSMSNAPRLVYIV 109

Query: 65  QGNGIAGLLFPGKSGEF-----------------------VVKLEKGDLIPVPEGVTSWW 124
           QG GI GL+ PG    F                       V + ++GD++ VP G   W 
Sbjct: 110 QGRGIVGLVMPGCPETFQSFQRSEREEGERHRWSRDEHQKVYQFQEGDVLAVPNGFAYWC 169

Query: 125 FNPANDDSDFEILLIGDSSHALIPGDVTYVVF--AGRL---------------GILQSFP 184
           +N  N ++    + + D+S+     D ++  F  AGR                 IL+ F 
Sbjct: 170 YN--NGENPVVAITVLDTSNDANQLDRSHRQFLLAGRQEQGRQRYGREGSIKENILRGFS 229

Query: 185 PEYVAGSYSLNEEESAAL-LRSQSNGLIFK-------LRPDQTLPEPDESSGLVFNIYDA 244
            E +A ++ +N E +  L  R  + G I +       LRP   + E +   G   N ++ 
Sbjct: 230 TELLAAAFGVNMELARKLQCRDDTRGEIVRAENGLQVLRP-SGMEEEEREEGRSINGFEE 289

Query: 245 V---------------PDARLEAGGSVTAVTEDKFPFIGKSGLTAVLEKLEANAVRAPVY 304
                            D     GG +T +  +K P +    ++A    L  NA+ +P +
Sbjct: 290 TYCSMKIKQNIGDPRRADVFNPRGGRITTLNSEKLPILRFIQMSAERVVLYRNAMVSPHW 349

Query: 305 VADPSVQLIYVSRGSGRIRVGGFLGK--MDSEVKAGQLVLVPKYFAVGKVAGDEGMECFS 347
             + +  ++Y + G GR+ V    G+   D E++ GQL++VP+ FA+ + AG EG +  S
Sbjct: 350 NIN-AHSIMYCTGGRGRVEVADDRGETVFDGELRQGQLLIVPQNFAMLERAGSEGFQLVS 409

BLAST of MS012500 vs. ExPASy Swiss-Prot
Match: O23880 (13S globulin seed storage protein 2 OS=Fagopyrum esculentum OX=3617 GN=FA18 PE=2 SV=1)

HSP 1 Score: 98.6 bits (244), Expect = 1.5e-19
Identity = 97/424 (22.88%), Postives = 162/424 (38.21%), Query Frame = 0

Query: 5   LEPMNPKPFFEGEGGSFHKWFPSDFPMIAQTKVAAGRLLLHPRGFAIPHNSDSSKVGYVL 64
           L    P      E G    W   D P    T   A R+++ P G  +P  S++  + +V 
Sbjct: 51  LTASEPSRRVRSEAGVTEIW-DHDTPEFRCTGFVAVRVVIQPGGLLLPSYSNAPYITFVE 110

Query: 65  QGNGIAGLLFPG------KSGEF----------------------------VVKLEKGDL 124
           QG G+ G++ PG         EF                            + ++ +GD+
Sbjct: 111 QGRGVQGVVIPGCPETFQSDSEFEYPQSQRGRHSRQSESEEESSRGDQHQKIFRIREGDV 170

Query: 125 IPVPEGVTSWWFNPANDDSDFEILLIGDSSHALIPGDVTYVVFAGR-------------- 184
           IP P GV  W  N  NDD     LL  +S H  +  +V     AG+              
Sbjct: 171 IPSPAGVVQWTHNDGNDDLISVTLLDANSYHKQLDENVRSFFLAGQSQRETREEGSDRQS 230

Query: 185 -----------LGILQSFPPEYVAGSYSLNEEESAALLRSQSN--GLI-----FKLRPDQ 244
                        IL  F  E +   +   + E+ + LR +++  G I      KLR  Q
Sbjct: 231 RESDDDEALLGANILSGFQDEILHELFRDVDRETISKLRGENDQRGFIVQAQDLKLRVPQ 290

Query: 245 TLPEPDE-----------SSGLVFNIYDAVPDARLE--------------AGGSVTAVTE 304
              E  E            SG    +     + +                  G +  V  
Sbjct: 291 DFEEEYERERGDRRRGQGGSGRSNGVEQGFCNLKFRRNFNTPTNTYVFNPRAGRINTVNS 350

Query: 305 DKFPFIGKSGLTAVLEKLEANAVRAPVYVADPSVQLIYVSRGSGRIRVGGFLGK--MDSE 336
           +  P +    L+A    L  NA+  P +  + +   +YV+RG GR++V G  GK   D +
Sbjct: 351 NSLPILEFLQLSAQHVVLYKNAIIGPRWNLN-AHSALYVTRGEGRVQVVGDEGKSVFDDK 410

BLAST of MS012500 vs. ExPASy Swiss-Prot
Match: Q8GZP6 (11S globulin seed storage protein Ana o 2.0101 (Fragment) OS=Anacardium occidentale OX=171929 PE=1 SV=1)

HSP 1 Score: 96.7 bits (239), Expect = 5.8e-19
Identity = 95/403 (23.57%), Postives = 155/403 (38.46%), Query Frame = 0

Query: 5   LEPMNPKPFFEGEGGSFHKWFPSDFPMIAQTKVAAGRLLLHPRGFAIPHNSDSSKVGYVL 64
           L+ + P    E E G+   W P +        VA  R  + P G  +P  S++ ++ YV+
Sbjct: 30  LDALEPDNRVEYEAGTVEAWDP-NHEQFRCAGVALVRHTIQPNGLLLPQYSNAPQLIYVV 89

Query: 65  QGNGIAGLLFP---------------GKSGEF------VVKLEKGDLIPVPEGVTSWWFN 124
           QG G+ G+ +P               G+SG F      + +  +GD+I +P GV  W +N
Sbjct: 90  QGEGMTGISYPGCPETYQAPQQGRQQGQSGRFQDRHQKIRRFRRGDIIAIPAGVAHWCYN 149

Query: 125 PANDDSDFEILLIGDSSHALIPGDVTYVVFAGR---------------LGILQSFPPEYV 184
             N       LL   +S   +         AG                  +   F  E +
Sbjct: 150 EGNSPVVTVTLLDVSNSQNQLDRTPRKFHLAGNPKDVFQQQQQHQSRGRNLFSGFDTELL 209

Query: 185 AGSYSLNEEESAALLRSQSNGLIFKLRPDQTL------------PEPDESS--------- 244
           A ++ ++E     L    + G I K++ D+               E +E S         
Sbjct: 210 AEAFQVDERLIKQLKSEDNRGGIVKVKDDELRVIRPSRSQSERGSESEEESEDEKRRWGQ 269

Query: 245 ------------GLVFNIYD-AVPDARLEAGGSVTAVTEDKFPFIGKSGLTAVLEKLEAN 304
                        L  NI D A  D      G +T +     P +    L+     L  N
Sbjct: 270 RDNGIEETICTMRLKENINDPARADIYTPEVGRLTTLNSLNLPILKWLQLSVEKGVLYKN 329

Query: 305 AVRAPVYVADPSVQLIYVSRGSGRIRVGGFLGK--MDSEVKAGQLVLVPKYFAVGKVAGD 336
           A+  P +  + S  +IY  +G G+++V    G    D EV+ GQ+++VP+ FAV K A +
Sbjct: 330 ALVLPHWNLN-SHSIIYGCKGKGQVQVVDNFGNRVFDGEVREGQMLVVPQNFAVVKRARE 389

BLAST of MS012500 vs. ExPASy Swiss-Prot
Match: P04405 (Glycinin G2 OS=Glycine max OX=3847 GN=GY2 PE=1 SV=2)

HSP 1 Score: 96.3 bits (238), Expect = 7.5e-19
Identity = 106/449 (23.61%), Postives = 169/449 (37.64%), Query Frame = 0

Query: 5   LEPMNPKPFFEGEGGSFHKWFPSDFPMIAQTKVAAGRLLLHPRGFAIPHNSDSSKVGYVL 64
           L  + P    E EGG    W P++ P      VA  R  L+      P  ++  +  Y+ 
Sbjct: 33  LNALKPDNRIESEGGFIETWNPNNKPFQC-AGVALSRCTLNRNALRRPSYTNGPQEIYIQ 92

Query: 65  QGNGIAGLLFPGKSGEF---------------------VVKLEKGDLIPVPEGVTSWWFN 124
           QGNGI G++FPG    +                     V +  +GDLI VP GV  W +N
Sbjct: 93  QGNGIFGMIFPGCPSTYQEPQESQQRGRSQRPQDRHQKVHRFREGDLIAVPTGVAWWMYN 152

Query: 125 PANDDSDFEILLIGDSSHALIPGDVTYVVF--AGR------------------------- 184
             N+D+    + I D++      D     F  AG                          
Sbjct: 153 --NEDTPVVAVSIIDTNSLENQLDQMPRRFYLAGNQEQEFLKYQQQQQGGSQSQKGKQQE 212

Query: 185 -----LGILQSFPPEYVAGSYSL-----------NEEESAALLRSQSNGLIFKL----RP 244
                  IL  F PE++  ++ +           NEEE +  + +   GL        +P
Sbjct: 213 EENEGSNILSGFAPEFLKEAFGVNMQIVRNLQGENEEEDSGAIVTVKGGLRVTAPAMRKP 272

Query: 245 DQTLPEPDES--------------------SGLVFNI----------YDAVPDARLEAGG 304
            Q   + DE                     +G+   I           ++ PD      G
Sbjct: 273 QQEEDDDDEEEQPQCVETDKGCQRQSKRSRNGIDETICTMRLRQNIGQNSSPDIYNPQAG 332

Query: 305 SVTAVTEDKFPFIGKSGLTAVLEKLEANAVRAPVYVADPSVQLIYVSRGSGRIRVGGFLG 354
           S+T  T   FP +    L+A    L  NA+  P Y  + +  +IY   G   ++V    G
Sbjct: 333 SITTATSLDFPALWLLKLSAQYGSLRKNAMFVPHYTLNAN-SIIYALNGRALVQVVNCNG 392

BLAST of MS012500 vs. ExPASy Swiss-Prot
Match: P11828 (Glycinin G3 OS=Glycine max OX=3847 GN=GY3 PE=1 SV=1)

HSP 1 Score: 95.5 bits (236), Expect = 1.3e-18
Identity = 106/442 (23.98%), Postives = 166/442 (37.56%), Query Frame = 0

Query: 5   LEPMNPKPFFEGEGGSFHKWFPSDFPMIAQTKVAAGRLLLHPRGFAIPHNSDSSKVGYVL 64
           L  + P    E EGG    W P++ P      VA  R  L+      P  +++ +  Y+ 
Sbjct: 36  LNALKPDNRIESEGGFIETWNPNNKPFQC-AGVALSRCTLNRNALRRPSYTNAPQEIYIQ 95

Query: 65  QGNGIAGLLFPGKSGEF------------------VVKLEKGDLIPVPEGVTSWWFNPAN 124
           QG+GI G++FPG    F                  +    +GDLI VP G   W +N  N
Sbjct: 96  QGSGIFGMIFPGCPSTFEEPQQKGQSSRPQDRHQKIYHFREGDLIAVPTGFAYWMYN--N 155

Query: 125 DDSDFEILLIGDSSHALIPGDVTYVVF--AGRL--------------------------- 184
           +D+    + + D++      D     F  AG                             
Sbjct: 156 EDTPVVAVSLIDTNSFQNQLDQMPRRFYLAGNQEQEFLQYQPQKQQGGTQSQKGKRQQEE 215

Query: 185 -----GILQSFPPEYVAGSYSL-----------NEEESAALLRSQSNGLIFKLRP---DQ 244
                 IL  F PE++  ++ +           NEEE    + +   GL     P    Q
Sbjct: 216 ENEGGSILSGFAPEFLEHAFVVDRQIVRKLQGENEEEEKGAIVTVKGGLSVISPPTEEQQ 275

Query: 245 TLPEPDE------------------------SSGLVFNI-YDAVPDARLEAGGSVTAVTE 304
             PE +E                        +  L  NI   + PD      GS+T  T 
Sbjct: 276 QRPEEEEKPDCDEKDKHCQSQSRNGIDETICTMRLRHNIGQTSSPDIFNPQAGSITTATS 335

Query: 305 DKFPFIGKSGLTAVLEKLEANAVRAPVYVADPSVQLIYVSRGSGRIRVGGFLGK--MDSE 354
             FP +    L+A    L  NA+  P Y  + +  +IY   G   ++V    G+   D E
Sbjct: 336 LDFPALSWLKLSAQFGSLRKNAMFVPHYNLNAN-SIIYALNGRALVQVVNCNGERVFDGE 395

BLAST of MS012500 vs. ExPASy TrEMBL
Match: A0A5D3BKT3 (Glutelin type-B 5 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold252G00340 PE=4 SV=1)

HSP 1 Score: 545.8 bits (1405), Expect = 1.3e-151
Identity = 266/358 (74.30%), Postives = 312/358 (87.15%), Query Frame = 0

Query: 1   MELNLEPMNPKPFFEGEGGSFHKWFPSDFPMIAQTKVAAGRLLLHPRGFAIPHNSDSSKV 60
           MEL+L+PM+P  FF GEGGSFHKWFPSD P+I QTKV AGRLLLHPRGFA+PHNSDSSKV
Sbjct: 1   MELDLKPMDPTNFFTGEGGSFHKWFPSDHPIIPQTKVGAGRLLLHPRGFAVPHNSDSSKV 60

Query: 61  GYVLQGNGIAGLLFPGKSGEFVVKLEKGDLIPVPEGVTSWWFNPANDDSDFEILLIGDSS 120
           GYVLQG+G+AG++FP KS E VV+L+KGD+IPVPEGVTSWWFN  + DSDFE+LL+GD+ 
Sbjct: 61  GYVLQGSGVAGIVFPCKSEEAVVRLKKGDVIPVPEGVTSWWFN--DGDSDFEVLLVGDTR 120

Query: 121 HALIPGDVTYVVFAGRLGILQSFPPEYVAGSYSLNEEESAALLRSQSNGLIFKLRPDQTL 180
           +ALIPGD+TYVVFAG LG+LQ F  +Y+   Y L EEE   LL+SQ NGLIFKL+ DQTL
Sbjct: 121 NALIPGDITYVVFAGPLGVLQGFSSDYIEKVYDLTEEEREVLLKSQPNGLIFKLKDDQTL 180

Query: 181 PEPDESSGLVFNIYDAVPDARLEAGGSVTAVTEDKFPFIGKSGLTAVLEKLEANAVRAPV 240
           PEPD  S LVFNIYDA PD+ ++ GG+VT +TE+KFPFIGKSGLTAVLEKLEANAVR+PV
Sbjct: 181 PEPDCHSDLVFNIYDAAPDSVVKGGGTVTVLTEEKFPFIGKSGLTAVLEKLEANAVRSPV 240

Query: 241 YVADPSVQLIYVSRGSGRIRVG-GFLGK-MDSEVKAGQLVLVPKYFAVGKVAGDEGMECF 300
           YVADPSVQLIYV+ GSGRI++   F+ K +D+EVKAGQL+LVPKYFAVGK+AG+EG+ECF
Sbjct: 241 YVADPSVQLIYVASGSGRIQIAETFMRKQIDAEVKAGQLILVPKYFAVGKMAGEEGLECF 300

Query: 301 SIITTTHPLIEELGGKDSIFGSLSAQVFQVSFNVTAEFEKLLRSKITKASPLVPPSNH 357
           +IITTTHPL+EELGGK SIFG+ S QVFQ SFNVTA FEKLL SKITK+SPLVPPS++
Sbjct: 301 TIITTTHPLLEELGGKSSIFGAFSPQVFQASFNVTAHFEKLLISKITKSSPLVPPSDN 356

BLAST of MS012500 vs. ExPASy TrEMBL
Match: A0A1S3C332 (glutelin type-B 5 OS=Cucumis melo OX=3656 GN=LOC103496120 PE=4 SV=1)

HSP 1 Score: 545.8 bits (1405), Expect = 1.3e-151
Identity = 266/358 (74.30%), Postives = 312/358 (87.15%), Query Frame = 0

Query: 1   MELNLEPMNPKPFFEGEGGSFHKWFPSDFPMIAQTKVAAGRLLLHPRGFAIPHNSDSSKV 60
           MEL+L+PM+P  FF GEGGSFHKWFPSD P+I QTKV AGRLLLHPRGFA+PHNSDSSKV
Sbjct: 1   MELDLKPMDPTNFFTGEGGSFHKWFPSDHPIIPQTKVGAGRLLLHPRGFAVPHNSDSSKV 60

Query: 61  GYVLQGNGIAGLLFPGKSGEFVVKLEKGDLIPVPEGVTSWWFNPANDDSDFEILLIGDSS 120
           GYVLQG+G+AG++FP KS E VV+L+KGD+IPVPEGVTSWWFN  + DSDFE+LL+GD+ 
Sbjct: 61  GYVLQGSGVAGIVFPCKSEEAVVRLKKGDVIPVPEGVTSWWFN--DGDSDFEVLLVGDTR 120

Query: 121 HALIPGDVTYVVFAGRLGILQSFPPEYVAGSYSLNEEESAALLRSQSNGLIFKLRPDQTL 180
           +ALIPGD+TYVVFAG LG+LQ F  +Y+   Y L EEE   LL+SQ NGLIFKL+ DQTL
Sbjct: 121 NALIPGDITYVVFAGPLGVLQGFSSDYIEKVYDLTEEEREVLLKSQPNGLIFKLKDDQTL 180

Query: 181 PEPDESSGLVFNIYDAVPDARLEAGGSVTAVTEDKFPFIGKSGLTAVLEKLEANAVRAPV 240
           PEPD  S LVFNIYDA PD+ ++ GG+VT +TE+KFPFIGKSGLTAVLEKLEANAVR+PV
Sbjct: 181 PEPDCHSDLVFNIYDAAPDSVVKGGGTVTVLTEEKFPFIGKSGLTAVLEKLEANAVRSPV 240

Query: 241 YVADPSVQLIYVSRGSGRIRVG-GFLGK-MDSEVKAGQLVLVPKYFAVGKVAGDEGMECF 300
           YVADPSVQLIYV+ GSGRI++   F+ K +D+EVKAGQL+LVPKYFAVGK+AG+EG+ECF
Sbjct: 241 YVADPSVQLIYVASGSGRIQIAETFMRKQIDAEVKAGQLILVPKYFAVGKMAGEEGLECF 300

Query: 301 SIITTTHPLIEELGGKDSIFGSLSAQVFQVSFNVTAEFEKLLRSKITKASPLVPPSNH 357
           +IITTTHPL+EELGGK SIFG+ S QVFQ SFNVTA FEKLL SKITK+SPLVPPS++
Sbjct: 301 TIITTTHPLLEELGGKSSIFGAFSPQVFQASFNVTAHFEKLLISKITKSSPLVPPSDN 356

BLAST of MS012500 vs. ExPASy TrEMBL
Match: A0A0A0LC21 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G218170 PE=4 SV=1)

HSP 1 Score: 544.3 bits (1401), Expect = 3.9e-151
Identity = 262/357 (73.39%), Postives = 308/357 (86.27%), Query Frame = 0

Query: 1   MELNLEPMNPKPFFEGEGGSFHKWFPSDFPMIAQTKVAAGRLLLHPRGFAIPHNSDSSKV 60
           MELNL+PM+P  FF GEGGSFHKWFPSDFP+I+QTKV AGRLLLHPRGFA+PHNSDSSKV
Sbjct: 1   MELNLKPMDPSNFFTGEGGSFHKWFPSDFPIISQTKVGAGRLLLHPRGFAVPHNSDSSKV 60

Query: 61  GYVLQGNGIAGLLFPGKSGEFVVKLEKGDLIPVPEGVTSWWFNPANDDSDFEILLIGDSS 120
           GYVLQG+G+AG++FP KS E  V+L+KGD+IPVPEGVTSWWFN  + DSDFE+LL+GD+ 
Sbjct: 61  GYVLQGSGVAGIIFPCKSEEAAVRLKKGDVIPVPEGVTSWWFN--DGDSDFEVLLVGDTR 120

Query: 121 HALIPGDVTYVVFAGRLGILQSFPPEYVAGSYSLNEEESAALLRSQSNGLIFKLRPDQTL 180
           +ALIPGD+TYVVFAG LG+LQ F  +Y+   Y L E+E   LL+SQ NGLIFKL+ DQTL
Sbjct: 121 NALIPGDITYVVFAGPLGVLQGFSSDYIEKVYDLTEKEREVLLKSQPNGLIFKLKDDQTL 180

Query: 181 PEPDESSGLVFNIYDAVPDARLEAGGSVTAVTEDKFPFIGKSGLTAVLEKLEANAVRAPV 240
           PEPD  S LVFNIY   PDA ++ GGSVT +TE+KFPFIGKSGLTAVLEKLEANAVR+PV
Sbjct: 181 PEPDCHSDLVFNIYHTAPDAVVKGGGSVTVLTEEKFPFIGKSGLTAVLEKLEANAVRSPV 240

Query: 241 YVADPSVQLIYVSRGSGRIRVGGFLGK--MDSEVKAGQLVLVPKYFAVGKVAGDEGMECF 300
           YVADPSVQLIYV+ GSGR+++     +  +D+EVKAGQLVLVPKYFAVGK+AG+EG+ECF
Sbjct: 241 YVADPSVQLIYVASGSGRVQIAETFMRYQIDAEVKAGQLVLVPKYFAVGKMAGEEGLECF 300

Query: 301 SIITTTHPLIEELGGKDSIFGSLSAQVFQVSFNVTAEFEKLLRSKITKASPLVPPSN 356
           +IITTTHPL+EELGGK SIFG+ S QVF+ SFN+TA FEKL RSKITK+SPLVPPS+
Sbjct: 301 TIITTTHPLLEELGGKTSIFGAFSPQVFEASFNLTAHFEKLFRSKITKSSPLVPPSD 355

BLAST of MS012500 vs. ExPASy TrEMBL
Match: A0A5A7TCP0 (Glutelin type-B 5 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold84G001410 PE=4 SV=1)

HSP 1 Score: 542.0 bits (1395), Expect = 1.9e-150
Identity = 265/358 (74.02%), Postives = 311/358 (86.87%), Query Frame = 0

Query: 1   MELNLEPMNPKPFFEGEGGSFHKWFPSDFPMIAQTKVAAGRLLLHPRGFAIPHNSDSSKV 60
           MEL+L+PM+P  FF GEGGSFHKWFPSD  +I QTKV AGRLLLHPRGFA+PHNSDSSKV
Sbjct: 1   MELDLKPMDPTNFFTGEGGSFHKWFPSDHLIIPQTKVGAGRLLLHPRGFAVPHNSDSSKV 60

Query: 61  GYVLQGNGIAGLLFPGKSGEFVVKLEKGDLIPVPEGVTSWWFNPANDDSDFEILLIGDSS 120
           GYVLQG+G+AG++FP KS E VV+L+KGD+IPVPEGVTSWWFN  + DSDFE+LL+GD+ 
Sbjct: 61  GYVLQGSGVAGIVFPCKSEEAVVRLKKGDVIPVPEGVTSWWFN--DGDSDFEVLLVGDTR 120

Query: 121 HALIPGDVTYVVFAGRLGILQSFPPEYVAGSYSLNEEESAALLRSQSNGLIFKLRPDQTL 180
           +ALIPGD+TYVVFAG LG+LQ F  +Y+   Y L EEE   LL+SQ NGLIFKL+ DQTL
Sbjct: 121 NALIPGDITYVVFAGPLGVLQGFSSDYIEKVYDLTEEEREVLLKSQPNGLIFKLKDDQTL 180

Query: 181 PEPDESSGLVFNIYDAVPDARLEAGGSVTAVTEDKFPFIGKSGLTAVLEKLEANAVRAPV 240
           PEPD  S LVFNIYDA PD+ ++ GG+VT +TE+KFPFIGKSGLTAVLEKLEANAVR+PV
Sbjct: 181 PEPDCHSDLVFNIYDAAPDSVVKGGGTVTVLTEEKFPFIGKSGLTAVLEKLEANAVRSPV 240

Query: 241 YVADPSVQLIYVSRGSGRIRVG-GFLGK-MDSEVKAGQLVLVPKYFAVGKVAGDEGMECF 300
           YVADPSVQLIYV+ GSGRI++   F+ K +D+EVKAGQL+LVPKYFAVGK+AG+EG+ECF
Sbjct: 241 YVADPSVQLIYVASGSGRIQIAETFMRKQIDAEVKAGQLILVPKYFAVGKMAGEEGLECF 300

Query: 301 SIITTTHPLIEELGGKDSIFGSLSAQVFQVSFNVTAEFEKLLRSKITKASPLVPPSNH 357
           +IITTTHPL+EELGGK SIFG+ S QVFQ SFNVTA FEKLL SKITK+SPLVPPS++
Sbjct: 301 TIITTTHPLLEELGGKSSIFGAFSPQVFQASFNVTAHFEKLLISKITKSSPLVPPSDN 356

BLAST of MS012500 vs. ExPASy TrEMBL
Match: A0A6J1JDB2 (12S seed storage protein CRD-like OS=Cucurbita maxima OX=3661 GN=LOC111483370 PE=4 SV=1)

HSP 1 Score: 466.5 bits (1199), Expect = 1.0e-127
Identity = 223/352 (63.35%), Postives = 283/352 (80.40%), Query Frame = 0

Query: 6   EPMNPKPFFEGEGGSFHKWFPSDFPMIAQTKVAAGRLLLHPRGFAIPHNSDSSKVGYVLQ 65
           +PMNPKPF E E GS+HKW PS++P++A  KVAAGRLLL PRGF +PH +D SKVGYVLQ
Sbjct: 3   QPMNPKPFTEVEAGSYHKWLPSEYPLLAHNKVAAGRLLLRPRGFVVPHYADCSKVGYVLQ 62

Query: 66  G-NGIAGLLFPGKSGEFVVKLEKGDLIPVPEGVTSWWFNPANDDSDFEILLIGDSSHALI 125
           G NG+AGL+FP KS E VV L+KGDLIPVP GV+SWWFN  + DSD EI+ +G+S +A +
Sbjct: 63  GENGVAGLVFPSKSDEVVVNLKKGDLIPVPNGVSSWWFN--DGDSDLEIIFLGESKNAHV 122

Query: 126 PGDVTYVVFAGRLGILQSFPPEYVAGSYSLNEEESAALLRSQSNGLIFKLRPDQTLPEPD 185
           PGD++Y V +G L +L  F PEYV  +YSLN EE+   L+SQSN LIF ++  Q+LP+P 
Sbjct: 123 PGDISYFVLSGILSLLNGFSPEYVGETYSLNGEETTQFLKSQSNALIFSIQQTQSLPKPP 182

Query: 186 ESSGLVFNIYDAVPDARLEAG-GSVTAVTEDKFPFIGKSGLTAVLEKLEANAVRAPVYVA 245
           + S  V+NI  A PD R++ G G+VT VTE KFPFIG+SGLTA+LEKL+ANAVR+PVYVA
Sbjct: 183 KYSKFVYNIDAAAPDGRVKGGAGAVTTVTESKFPFIGQSGLTAILEKLDANAVRSPVYVA 242

Query: 246 DPSVQLIYVSRGSGRIRVGGFLGKMDSEVKAGQLVLVPKYFAVGKVAGDEGMECFSIITT 305
           +P  QLIYV++G G+I++ GF  K+D+EVK GQL+LVPK+FAVGK+AG++G+EC SIIT 
Sbjct: 243 EPYDQLIYVAKGRGKIQIVGFSSKIDAEVKMGQLILVPKFFAVGKIAGEDGLECISIITA 302

Query: 306 THPLIEELGGKDSIFGSLSAQVFQVSFNVTAEFEKLLRSKITKASPLVPPSN 356
           THP++EEL GK S+  +LS +VFQVSFNVTAEFEKLLRSKIT ASP++  S+
Sbjct: 303 THPVVEELAGKTSVLEALSPEVFQVSFNVTAEFEKLLRSKITNASPVIRSSD 352

BLAST of MS012500 vs. TAIR 10
Match: AT1G07750.1 (RmlC-like cupins superfamily protein )

HSP 1 Score: 266.2 bits (679), Expect = 3.9e-71
Identity = 139/359 (38.72%), Postives = 208/359 (57.94%), Query Frame = 0

Query: 1   MELNLEPMNPKPFFEGEGGSFHKWFPSDFPMIAQTKVAAGRLLLHPRGFAIPHNSDSSKV 60
           MEL+L P  PK  + G+GGS+  W P + PM+ Q  + A +L L   GFA+P  SDSSKV
Sbjct: 1   MELDLTPKLPKKVYGGDGGSYSAWCPEELPMLKQGNIGAAKLALEKNGFAVPRYSDSSKV 60

Query: 61  GYVLQGNGIAGLLFPGKSGEFVVKLEKGDLIPVPEGVTSWWFNPANDDSDFEILLIGDSS 120
            YVLQG+G AG++ P K  E V+ +++GD I +P GV +WWFN  N+D +  IL +G++ 
Sbjct: 61  AYVLQGSGTAGIVLPEKE-EKVIAIKQGDSIALPFGVVTWWFN--NEDPELVILFLGETH 120

Query: 121 HALIPGDVTYVVFAGRLGILQSFPPEYVAGSYSLNEEESAALLRSQSNGLIFKLRPDQTL 180
                G  T     G  GI   F  E+V  ++ L+E     L+ SQ+   I KL     +
Sbjct: 121 KGHKAGQFTEFYLTGTNGIFTGFSTEFVGRAWDLDENTVKKLVGSQTGNGIVKLDAGFKM 180

Query: 181 PEPDES--SGLVFNIYDAVPDARLEAGGSVTAVTEDKFPFIGKSGLTAVLEKLEANAVRA 240
           P+P E   +G V N  +A  D  ++ GG V  +     P +G+ G  A L +++A+++ +
Sbjct: 181 PQPKEENRAGFVLNCLEAPLDVDIKDGGRVVVLNTKNLPLVGEVGFGADLVRIDAHSMCS 240

Query: 241 PVYVADPSVQLIYVSRGSGRIRVGGFLGK--MDSEVKAGQLVLVPKYFAVGKVAGDEGME 300
           P +  D ++Q+ Y+  GSGR++V G  GK  +++ +KAG L +VP++F V K+A  +GM 
Sbjct: 241 PGFSCDSALQVTYIVGGSGRVQVVGGDGKRVLETHIKAGSLFIVPRFFVVSKIADADGMS 300

Query: 301 CFSIITTTHPLIEELGGKDSIFGSLSAQVFQVSFNVTAEFEKLLRSKITKASPLVPPSN 356
            FSI+TT  P+   L G  S++ SLS +V Q +F V  E EK  RS  T ++   PPSN
Sbjct: 301 WFSIVTTPDPIFTHLAGNTSVWKSLSPEVLQAAFKVAPEVEKSFRSTRTSSAIFFPPSN 356

BLAST of MS012500 vs. TAIR 10
Match: AT2G28680.1 (RmlC-like cupins superfamily protein )

HSP 1 Score: 253.4 bits (646), Expect = 2.6e-67
Identity = 132/359 (36.77%), Postives = 203/359 (56.55%), Query Frame = 0

Query: 1   MELNLEPMNPKPFFEGEGGSFHKWFPSDFPMIAQTKVAAGRLLLHPRGFAIPHNSDSSKV 60
           MEL+L P  PK  + G+GGS+  W P + PM+    + A +L L   G A+P  SDS KV
Sbjct: 1   MELDLSPRLPKKVYGGDGGSYFAWCPEELPMLRDGNIGASKLALEKYGLALPRYSDSPKV 60

Query: 61  GYVLQGNGIAGLLFPGKSGEFVVKLEKGDLIPVPEGVTSWWFNPANDDSDFEILLIGDSS 120
            YVLQG G AG++ P K  E V+ ++KGD I +P GV +WWFN  N+D++  +L +G++ 
Sbjct: 61  AYVLQGAGTAGIVLPEKE-EKVIAIKKGDSIALPFGVVTWWFN--NEDTELVVLFLGETH 120

Query: 121 HALIPGDVTYVVFAGRLGILQSFPPEYVAGSYSLNEEESAALLRSQSNGLIFKLRPDQTL 180
                G  T     G  GI   F  E+V  ++ L+E     L+ SQ+   I K+     +
Sbjct: 121 KGHKAGQFTDFYLTGSNGIFTGFSTEFVGRAWDLDETTVKKLVGSQTGNGIVKVDASLKM 180

Query: 181 PEP--DESSGLVFNIYDAVPDARLEAGGSVTAVTEDKFPFIGKSGLTAVLEKLEANAVRA 240
           PEP   +  G V N  +A  D  ++ GG V  +     P +G+ G  A L +++ +++ +
Sbjct: 181 PEPKKGDRKGFVLNCLEAPLDVDIKDGGRVVVLNTKNLPLVGEVGFGADLVRIDGHSMCS 240

Query: 241 PVYVADPSVQLIYVSRGSGRIRVGGFLGK--MDSEVKAGQLVLVPKYFAVGKVAGDEGME 300
           P +  D ++Q+ Y+  GSGR+++ G  GK  +++ VKAG L +VP++F V K+A  +G+ 
Sbjct: 241 PGFSCDSALQVTYIVGGSGRVQIVGADGKRVLETHVKAGVLFIVPRFFVVSKIADSDGLS 300

Query: 301 CFSIITTTHPLIEELGGKDSIFGSLSAQVFQVSFNVTAEFEKLLRSKITKASPLVPPSN 356
            FSI+TT  P+   L G+ S++ +LS +V Q +F V  E EK  RSK T  +    PSN
Sbjct: 301 WFSIVTTPDPIFTHLAGRTSVWKALSPEVLQAAFKVDPEVEKAFRSKRTSDAIFFSPSN 356

BLAST of MS012500 vs. TAIR 10
Match: AT1G03890.1 (RmlC-like cupins superfamily protein )

HSP 1 Score: 84.3 bits (207), Expect = 2.1e-16
Identity = 86/401 (21.45%), Postives = 155/401 (38.65%), Query Frame = 0

Query: 17  EGGSFHKWFPSDFPMIAQTKVAAGRLLLHPRGFAIPHNSDSSKVGYVLQGNGIAGLL--- 76
           E G    W     P +    V   R+ L P    +P       + YV+QG G+ G +   
Sbjct: 53  EAGQMEVWDHMS-PELRCAGVTVARITLQPNSIFLPAFFSPPALAYVVQGEGVMGTIASG 112

Query: 77  -------FPGKSG------------EFVVKLE---KGDLIPVPEGVTSWWFNPANDDSDF 136
                    G SG            +   KLE   +GD+     GV+ WW+N    DSD 
Sbjct: 113 CPETFAEVEGSSGRGGGGDPGRRFEDMHQKLENFRRGDVFASLAGVSQWWYN--RGDSDA 172

Query: 137 EILLIGDSSHA-----------LIPGDVTY-----VVFAGRLGILQSFPPEYVAGSYSLN 196
            I+++ D ++             + G  T      + +         F P  +A ++ +N
Sbjct: 173 VIVIVLDVTNRENQLDQVPRMFQLAGSRTQEEEQPLTWPSGNNAFSGFDPNIIAEAFKIN 232

Query: 197 EEESAALLRSQSN---------GLIFKLRPDQTLPEPDESSGL-----VFNIYDAVPDAR 256
            E +  L   + N          L F + P +   +   ++G+        I++ + D  
Sbjct: 233 IETAKQLQNQKDNRGNIIRANGPLHFVIPPPREWQQDGIANGIEETYCTAKIHENIDDPE 292

Query: 257 LE-----AGGSVTAVTEDKFPFIGKSGLTAVLEKLEANAVRAPVYVADPSVQLIYVSRGS 316
                    G ++ +     P +    L A+   L +  +  P + A+    ++YV+ G 
Sbjct: 293 RSDHFSTRAGRISTLNSLNLPVLRLVRLNALRGYLYSGGMVLPQWTANAHT-VLYVTGGQ 352

Query: 317 GRIRVGGFLGK--MDSEVKAGQLVLVPKYFAVGKVAGDEGMECFSIITTTHPLIEELGGK 356
            +I+V    G+   + +V  GQ++++P+ FAV K AG+ G E  S  T  +  I  L G+
Sbjct: 353 AKIQVVDDNGQSVFNEQVGQGQIIVIPQGFAVSKTAGETGFEWISFKTNDNAYINTLSGQ 412

BLAST of MS012500 vs. TAIR 10
Match: AT1G03880.1 (cruciferin 2 )

HSP 1 Score: 67.8 bits (164), Expect = 2.0e-11
Identity = 73/315 (23.17%), Postives = 128/315 (40.63%), Query Frame = 0

Query: 82  VVKLEKGDLIPVPEGVTSWWFNPANDDSDFEILLIGDSSHALIPGDVTYVVFAG------ 141
           V  L  GD I  P GV  W++N  N+           S+   +  ++   + AG      
Sbjct: 135 VEHLRCGDTIATPSGVAQWFYNNGNEPLILVAAADLASNQNQLDRNLRPFLIAGNNPQGQ 194

Query: 142 ----------RLGILQSFPPEYVAGSYSLNEEESAALLRSQSN-GLIFK----------- 201
                     +  I   F PE +A ++ +N E +  L   Q N G I K           
Sbjct: 195 EWLQGRKQQKQNNIFNGFAPEILAQAFKINVETAQQLQNQQDNRGNIVKVNGPFGVIRPP 254

Query: 202 LRPDQTLPEPDE-SSGL---------VFNIYD-AVPDARLEAGGSVTAVTEDKFPFIGKS 261
           LR  +   +P E ++GL           N+ D +  D    + G ++ +     P +   
Sbjct: 255 LRRGEGGQQPHEIANGLEETLCTMRCTENLDDPSDADVYKPSLGYISTLNSYNLPILRLL 314

Query: 262 GLTAVLEKLEANAVRAPVYVADPSVQLIYVSRGSGRIRVGGFLGK--MDSEVKAGQLVLV 321
            L+A+   +  NA+  P +  + +  L YV+ G   I++    G+   D E+ +GQL++V
Sbjct: 315 RLSALRGSIRKNAMVLPQWNVNANAAL-YVTNGKAHIQMVNDNGERVFDQEISSGQLLVV 374

Query: 322 PKYFAVGKVAGDEGMECFSIITTTHPLIEELGGKDSIFGSLSAQVFQVSFNVTAEFEK-- 351
           P+ F+V K A  E  E     T  +  +  L G+ S+   L  +V    + ++ E  K  
Sbjct: 375 PQGFSVMKHAIGEQFEWIEFKTNENAQVNTLAGRTSVMRGLPLEVITNGYQISPEEAKRV 434

BLAST of MS012500 vs. TAIR 10
Match: AT4G28520.1 (cruciferin 3 )

HSP 1 Score: 61.2 bits (147), Expect = 1.9e-09
Identity = 48/175 (27.43%), Postives = 81/175 (46.29%), Query Frame = 0

Query: 166 QSNGL---IFKLRPDQTLPEPDESSGLVFNIYDAVPDARLEAGGSVTAVTEDKFPFIGKS 225
           Q NGL   I  +R  + + +P            A  D    + G VT+V     P +   
Sbjct: 331 QGNGLEETICSMRSHENIDDP------------ARADVYKPSLGRVTSVNSYTLPILEYV 390

Query: 226 GLTAVLEKLEANAVRAPVYVADPSVQLIYVSRGSGRIRVGGFLGK--MDSEVKAGQLVLV 285
            L+A    L+ NA+  P Y  + + +++Y + G GRI+V    G+  +D +V+ GQLV++
Sbjct: 391 RLSATRGVLQGNAMVLPKYNMNAN-EILYCTGGQGRIQVVNDNGQNVLDQQVQKGQLVVI 450

Query: 286 PKYFAVGKVAGDEGMECFSIITTTHPLIEELGGKDSIFGSLSAQVFQVSFNVTAE 336
           P+ FA    +     E  S  T  + +I  L G+ S+  +L  +V    F ++ E
Sbjct: 451 PQGFAYVVQSHGNKFEWISFKTNENAMISTLAGRTSLLRALPLEVISNGFQISPE 492

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038879635.11.7e-15376.69legumin type B-like [Benincasa hispida][more]
XP_008456077.12.7e-15174.30PREDICTED: glutelin type-B 5 [Cucumis melo] >TYJ99756.1 glutelin type-B 5 [Cucum... [more]
XP_011651276.28.0e-15173.39legumin J [Cucumis sativus] >KGN57581.1 hypothetical protein Csa_011641 [Cucumis... [more]
KAA0039049.13.9e-15074.02glutelin type-B 5 [Cucumis melo var. makuwa][more]
KAG7014994.13.6e-13573.1512S seed storage protein CRD, partial [Cucurbita argyrosperma subsp. argyrosperm... [more]
Match NameE-valueIdentityDescription
A0A222NNM91.2e-2123.83Cocosin 1 OS=Cocos nucifera OX=13894 GN=COS-1 PE=1 SV=1[more]
O238801.5e-1922.8813S globulin seed storage protein 2 OS=Fagopyrum esculentum OX=3617 GN=FA18 PE=2... [more]
Q8GZP65.8e-1923.5711S globulin seed storage protein Ana o 2.0101 (Fragment) OS=Anacardium occident... [more]
P044057.5e-1923.61Glycinin G2 OS=Glycine max OX=3847 GN=GY2 PE=1 SV=2[more]
P118281.3e-1823.98Glycinin G3 OS=Glycine max OX=3847 GN=GY3 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A5D3BKT31.3e-15174.30Glutelin type-B 5 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold252G00... [more]
A0A1S3C3321.3e-15174.30glutelin type-B 5 OS=Cucumis melo OX=3656 GN=LOC103496120 PE=4 SV=1[more]
A0A0A0LC213.9e-15173.39Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G218170 PE=4 SV=1[more]
A0A5A7TCP01.9e-15074.02Glutelin type-B 5 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold84G001... [more]
A0A6J1JDB21.0e-12763.3512S seed storage protein CRD-like OS=Cucurbita maxima OX=3661 GN=LOC111483370 PE... [more]
Match NameE-valueIdentityDescription
AT1G07750.13.9e-7138.72RmlC-like cupins superfamily protein [more]
AT2G28680.12.6e-6736.77RmlC-like cupins superfamily protein [more]
AT1G03890.12.1e-1621.45RmlC-like cupins superfamily protein [more]
AT1G03880.12.0e-1123.17cruciferin 2 [more]
AT4G28520.11.9e-0927.43cruciferin 3 [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (TR) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR006045Cupin 1SMARTSM00835Cupin_1_3coord: 191..338
e-value: 1.1E-12
score: 58.1
coord: 3..160
e-value: 6.5E-32
score: 122.0
IPR006045Cupin 1PFAMPF00190Cupin_1coord: 204..336
e-value: 5.2E-12
score: 45.6
coord: 9..158
e-value: 7.9E-21
score: 74.2
IPR014710RmlC-like jelly roll foldGENE3D2.60.120.10Jelly Rollscoord: 196..356
e-value: 8.0E-36
score: 124.8
IPR014710RmlC-like jelly roll foldGENE3D2.60.120.10Jelly Rollscoord: 9..180
e-value: 8.3E-30
score: 105.4
NoneNo IPR availablePANTHERPTHR31189:SF4511S GLOBULIN SEED STORAGE PROTEIN 2-LIKEcoord: 3..346
NoneNo IPR availablePANTHERPTHR31189OS03G0336100 PROTEIN-RELATEDcoord: 3..346
NoneNo IPR availableCDDcd02243cupin_11S_legumin_Ccoord: 205..352
e-value: 6.68125E-59
score: 185.755
NoneNo IPR availableCDDcd02242cupin_11S_legumin_Ncoord: 4..176
e-value: 4.93049E-52
score: 169.688
IPR011051RmlC-like cupin domain superfamilySUPERFAMILY51182RmlC-like cupinscoord: 6..341

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
MS012500.1MS012500.1mRNA