MC10g0916 (gene) Bitter gourd (Dali-11) v1

Overview
NameMC10g0916
Typegene
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptionglutamic acid-rich protein isoform X1
LocationMC10: 7886492 .. 7889079 (+)
RNA-Seq ExpressionMC10g0916
SyntenyMC10g0916
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptideutr3
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGATTCTGACCGCCATTTTCGTACCACCAGCAACAGCACTGCCTCCAGCGAGCTTTTCATCTGCTTCACTTCTCGCTTGTCTTCTTCTTCCGCCATGAAGATCTCTTCCAAATCCATTCTCAGCCCCGGCCGCGCCAGAGAACCCTCCCAAATCTCCCTTTCTACTTCTCTCAGCCGCCGCCTCAAGACCAGCGGCAGCCTCAAGGGCGGCCAGGCCTCGCCGATGTTTCCCACCGGCGCCAAGAAGCGCGGCTGCGCGTTTGATAATCCCGAGCCCTCGTCGCCCAAGGTTACCTGCATTGGACAGGTTAGGGTTAAGACGAAGAAGCAGGGGAAGAAGATGAGGGCCAGATCGCAGAAGCGGAGGAGTAATTCCGAGGCGAGTTTTCGGAAATCGGAGCAGGTTCAACCGCAGACCAATGGCGGTGACCAGCAATTCGTCGCGAAACAATCGCATCACCATCTTCATCTTCATCGTCAGAATAGTAATAGCAGTGGGGGAAACGGTTTCCAGATTCAGAACTCGCAGCAGCAGGAGTGCTTGTCGCATCGGAACCAGCGATGGGTGCATTTGCCGTTCACGATTTGCGAGGCGCTTAGGGCTTTTGGAGCAGAGCTCAACTGCTTCTTGCCGTGCCATTCCTCGTGTACCAGCGACAGGGATAAGGAATCAAAGCCTGCCGCGAAGTCGTCGGAGAGCGAGAGTTCTTGCGGGACGGTGTTCGCGCGGTGGTTGGTGGCAGTGCAGGACGGCGACGGCCAGGGAAGGGAGATCGAGCTGGTGGTCGGAGACGAAGAAACTCGAATGGAGAAGGATAACGGGAGCCAGAGGCGGCATGTCTTCGAGGGAGTAGATTTCAAAGACAAGAGCGAAGTTGTGGAGGAGGAAGAAGAAGAATCCAGGATCAGCATTTGCATTCCTCCGAAGAACGCCTTGTTGCTAATGAGGTGCAGATCTGATCCAGTGAAAGTGGCAGAGCTGGCCAAACGATTCTGCGAATCTCCTGCGCCAAAAGTCGAAGAAGAAGAAGACGAGGAAGATGAACAAAAAGACAAGGAAGAGAAATCCAGACAAAAGGAAGCAGCAAAAATGGACGCGCCTCTCACTGTGATACTGAGTAACGATGAAGACGAAGAAGAAACGAAAGTTGAACTAAATGTCAAGCTTAAAAACGATGAAGAAATGAGTGAAGAATCTGTTTCTGATGGCGAAGAAGAAAATTATTTAGTTTTGCAGCAAGAAGAAGAACATAACGAGGAGGAAACCCTAGAAATAGCCACAGATAACGAAATCGATGTACAGAAATTAGACATTACTGTAAATCACCACAATCAAGAAGAACCAGCAGAGGACGAACAAGAAGAAGAAGAAGAACACGAGAACAACAACAACGACGAAGATAATCAGCCAGAGGAATTAGCAGAAGAAACAAGGGCGATTCCATCCCACTGTGATCCGGAGCTGGCTCAAGATGCAGAGAAAGTAGAATCAGCAGAAGAAGAAGACGAATCCAAATTTCTCCATGGAAACGAATCCATCAACGAAATAGAAGACGACGAAGAACAGACTGAAGAAGAAGGCGAAAATGGCGGAAATCCCGCATCGCCTTCGTTATCAGTTGAGACAGAAAGAGCAGCGGAAGACATGGAAGAAACTGAAGCTGATGTAAATTGGGAAGAAGAAGAAGAAGAAGAAGAAGAGACGATTCATTGGGAAGACAGAGAGAAAGCGACAGAGGAAGAAGGAATGAGACCCGACATCGGAGACGGCGGCGCAATGGAGGAGTCAAAGGAGCGAGAAACTCCGGCGGCGGAAGCGAAGAGAGAAGCAGAAACCGGCGTGCTCCCAGATTGCTTGCTTCTGATGATGTACGAACCGAAGCTATCAATGGAGGTTTCGAAGGAGACGTGGGTTTGCAGCACGGATTTCATCAGGTGCGTTCCGACGAGGGAGAAGAAGGCGGCGGCGGCAAAGAAGCGGGAGGCGAAGGCGGCGGAGAACACGCAGCCGGCGGTGGTGCAGCCGGCGAGGTGGTCGTGTTCGTTTCCTGCGGCGGCGGCTGCGGCGGCGGCGATAGAGCAGAAGCTAGTGAGGGCGAAGGGGTACGAGCCGTTTGTGCTGACTCGGTGTAAGTCGGAGCCGATGAGGTCGTCGGCTAAGCTGGCGCCGGACGCTTGCTTTTGGAAGGACAGGAAGCTCGAGCCGCACCGCCCGGCTACGTTCGGCGTCGGCGCGGCTGGAGTTGGATTTTGACAATTTAACCCCCCAGGTAAAAAAAAAAAAATAGCGAAATGTAAAAATAGCGTGGTAGTAATTTGTATTTTGGTTAGAAAATTTGTAAGTGTAGCTACAATTGTGTTTTTCCATCTCTATCTCTTTTTCTTTTTGGTCTCAAATTTATTATGATAGAAAATTGAATGAAACAAATTTATATGATCGATAATATTTTGTCAATTTATTAAAGACTGAATTAAGAAAAATTTACGACGTGAAAATAGGTCAAGCATCTATTTTCAACCAAATAATTCTGGAATCCCTAAAACCAAAAAAATTGTGAAATCCCTAAATGTTATCGATTTTATTAG

mRNA sequence

ATGGATTCTGACCGCCATTTTCGTACCACCAGCAACAGCACTGCCTCCAGCGAGCTTTTCATCTGCTTCACTTCTCGCTTGTCTTCTTCTTCCGCCATGAAGATCTCTTCCAAATCCATTCTCAGCCCCGGCCGCGCCAGAGAACCCTCCCAAATCTCCCTTTCTACTTCTCTCAGCCGCCGCCTCAAGACCAGCGGCAGCCTCAAGGGCGGCCAGGCCTCGCCGATGTTTCCCACCGGCGCCAAGAAGCGCGGCTGCGCGTTTGATAATCCCGAGCCCTCGTCGCCCAAGGTTACCTGCATTGGACAGGTTAGGGTTAAGACGAAGAAGCAGGGGAAGAAGATGAGGGCCAGATCGCAGAAGCGGAGGAGTAATTCCGAGGCGAGTTTTCGGAAATCGGAGCAGGTTCAACCGCAGACCAATGGCGGTGACCAGCAATTCGTCGCGAAACAATCGCATCACCATCTTCATCTTCATCGTCAGAATAGTAATAGCAGTGGGGGAAACGGTTTCCAGATTCAGAACTCGCAGCAGCAGGAGTGCTTGTCGCATCGGAACCAGCGATGGGTGCATTTGCCGTTCACGATTTGCGAGGCGCTTAGGGCTTTTGGAGCAGAGCTCAACTGCTTCTTGCCGTGCCATTCCTCGTGTACCAGCGACAGGGATAAGGAATCAAAGCCTGCCGCGAAGTCGTCGGAGAGCGAGAGTTCTTGCGGGACGGTGTTCGCGCGGTGGTTGGTGGCAGTGCAGGACGGCGACGGCCAGGGAAGGGAGATCGAGCTGGTGGTCGGAGACGAAGAAACTCGAATGGAGAAGGATAACGGGAGCCAGAGGCGGCATGTCTTCGAGGGAGTAGATTTCAAAGACAAGAGCGAAGTTGTGGAGGAGGAAGAAGAAGAATCCAGGATCAGCATTTGCATTCCTCCGAAGAACGCCTTGTTGCTAATGAGGTGCAGATCTGATCCAGTGAAAGTGGCAGAGCTGGCCAAACGATTCTGCGAATCTCCTGCGCCAAAAGTCGAAGAAGAAGAAGACGAGGAAGATGAACAAAAAGACAAGGAAGAGAAATCCAGACAAAAGGAAGCAGCAAAAATGGACGCGCCTCTCACTGTGATACTGAGTAACGATGAAGACGAAGAAGAAACGAAAGTTGAACTAAATGTCAAGCTTAAAAACGATGAAGAAATGAGTGAAGAATCTGTTTCTGATGGCGAAGAAGAAAATTATTTAGTTTTGCAGCAAGAAGAAGAACATAACGAGGAGGAAACCCTAGAAATAGCCACAGATAACGAAATCGATGTACAGAAATTAGACATTACTGTAAATCACCACAATCAAGAAGAACCAGCAGAGGACGAACAAGAAGAAGAAGAAGAACACGAGAACAACAACAACGACGAAGATAATCAGCCAGAGGAATTAGCAGAAGAAACAAGGGCGATTCCATCCCACTGTGATCCGGAGCTGGCTCAAGATGCAGAGAAAGTAGAATCAGCAGAAGAAGAAGACGAATCCAAATTTCTCCATGGAAACGAATCCATCAACGAAATAGAAGACGACGAAGAACAGACTGAAGAAGAAGGCGAAAATGGCGGAAATCCCGCATCGCCTTCGTTATCAGTTGAGACAGAAAGAGCAGCGGAAGACATGGAAGAAACTGAAGCTGATGTAAATTGGGAAGAAGAAGAAGAAGAAGAAGAAGAGACGATTCATTGGGAAGACAGAGAGAAAGCGACAGAGGAAGAAGGAATGAGACCCGACATCGGAGACGGCGGCGCAATGGAGGAGTCAAAGGAGCGAGAAACTCCGGCGGCGGAAGCGAAGAGAGAAGCAGAAACCGGCGTGCTCCCAGATTGCTTGCTTCTGATGATGTACGAACCGAAGCTATCAATGGAGGTTTCGAAGGAGACGTGGGTTTGCAGCACGGATTTCATCAGGTGCGTTCCGACGAGGGAGAAGAAGGCGGCGGCGGCAAAGAAGCGGGAGGCGAAGGCGGCGGAGAACACGCAGCCGGCGGTGGTGCAGCCGGCGAGGTGGTCGTGTTCGTTTCCTGCGGCGGCGGCTGCGGCGGCGGCGATAGAGCAGAAGCTAGTGAGGGCGAAGGGGTACGAGCCGTTTGTGCTGACTCGGTGTAAGTCGGAGCCGATGAGGTCGTCGGCTAAGCTGGCGCCGGACGCTTGCTTTTGGAAGGACAGGAAGCTCGAGCCGCACCGCCCGGCTACGTTCGGCGTCGGCGCGGCTGGAGTTGGATTTTGACAATTTAACCCCCCAGGTAAAAAAAAAAAAATAGCGAAATGTAAAAATAGCGTGGTAGTAATTTGTATTTTGGTTAGAAAATTTGTAAGTGTAGCTACAATTGTGTTTTTCCATCTCTATCTCTTTTTCTTTTTGGTCTCAAATTTATTATGATAGAAAATTGAATGAAACAAATTTATATGATCGATAATATTTTGTCAATTTATTAAAGACTGAATTAAGAAAAATTTACGACGTGAAAATAGGTCAAGCATCTATTTTCAACCAAATAATTCTGGAATCCCTAAAACCAAAAAAATTGTGAAATCCCTAAATGTTATCGATTTTATTAG

Coding sequence (CDS)

ATGGATTCTGACCGCCATTTTCGTACCACCAGCAACAGCACTGCCTCCAGCGAGCTTTTCATCTGCTTCACTTCTCGCTTGTCTTCTTCTTCCGCCATGAAGATCTCTTCCAAATCCATTCTCAGCCCCGGCCGCGCCAGAGAACCCTCCCAAATCTCCCTTTCTACTTCTCTCAGCCGCCGCCTCAAGACCAGCGGCAGCCTCAAGGGCGGCCAGGCCTCGCCGATGTTTCCCACCGGCGCCAAGAAGCGCGGCTGCGCGTTTGATAATCCCGAGCCCTCGTCGCCCAAGGTTACCTGCATTGGACAGGTTAGGGTTAAGACGAAGAAGCAGGGGAAGAAGATGAGGGCCAGATCGCAGAAGCGGAGGAGTAATTCCGAGGCGAGTTTTCGGAAATCGGAGCAGGTTCAACCGCAGACCAATGGCGGTGACCAGCAATTCGTCGCGAAACAATCGCATCACCATCTTCATCTTCATCGTCAGAATAGTAATAGCAGTGGGGGAAACGGTTTCCAGATTCAGAACTCGCAGCAGCAGGAGTGCTTGTCGCATCGGAACCAGCGATGGGTGCATTTGCCGTTCACGATTTGCGAGGCGCTTAGGGCTTTTGGAGCAGAGCTCAACTGCTTCTTGCCGTGCCATTCCTCGTGTACCAGCGACAGGGATAAGGAATCAAAGCCTGCCGCGAAGTCGTCGGAGAGCGAGAGTTCTTGCGGGACGGTGTTCGCGCGGTGGTTGGTGGCAGTGCAGGACGGCGACGGCCAGGGAAGGGAGATCGAGCTGGTGGTCGGAGACGAAGAAACTCGAATGGAGAAGGATAACGGGAGCCAGAGGCGGCATGTCTTCGAGGGAGTAGATTTCAAAGACAAGAGCGAAGTTGTGGAGGAGGAAGAAGAAGAATCCAGGATCAGCATTTGCATTCCTCCGAAGAACGCCTTGTTGCTAATGAGGTGCAGATCTGATCCAGTGAAAGTGGCAGAGCTGGCCAAACGATTCTGCGAATCTCCTGCGCCAAAAGTCGAAGAAGAAGAAGACGAGGAAGATGAACAAAAAGACAAGGAAGAGAAATCCAGACAAAAGGAAGCAGCAAAAATGGACGCGCCTCTCACTGTGATACTGAGTAACGATGAAGACGAAGAAGAAACGAAAGTTGAACTAAATGTCAAGCTTAAAAACGATGAAGAAATGAGTGAAGAATCTGTTTCTGATGGCGAAGAAGAAAATTATTTAGTTTTGCAGCAAGAAGAAGAACATAACGAGGAGGAAACCCTAGAAATAGCCACAGATAACGAAATCGATGTACAGAAATTAGACATTACTGTAAATCACCACAATCAAGAAGAACCAGCAGAGGACGAACAAGAAGAAGAAGAAGAACACGAGAACAACAACAACGACGAAGATAATCAGCCAGAGGAATTAGCAGAAGAAACAAGGGCGATTCCATCCCACTGTGATCCGGAGCTGGCTCAAGATGCAGAGAAAGTAGAATCAGCAGAAGAAGAAGACGAATCCAAATTTCTCCATGGAAACGAATCCATCAACGAAATAGAAGACGACGAAGAACAGACTGAAGAAGAAGGCGAAAATGGCGGAAATCCCGCATCGCCTTCGTTATCAGTTGAGACAGAAAGAGCAGCGGAAGACATGGAAGAAACTGAAGCTGATGTAAATTGGGAAGAAGAAGAAGAAGAAGAAGAAGAGACGATTCATTGGGAAGACAGAGAGAAAGCGACAGAGGAAGAAGGAATGAGACCCGACATCGGAGACGGCGGCGCAATGGAGGAGTCAAAGGAGCGAGAAACTCCGGCGGCGGAAGCGAAGAGAGAAGCAGAAACCGGCGTGCTCCCAGATTGCTTGCTTCTGATGATGTACGAACCGAAGCTATCAATGGAGGTTTCGAAGGAGACGTGGGTTTGCAGCACGGATTTCATCAGGTGCGTTCCGACGAGGGAGAAGAAGGCGGCGGCGGCAAAGAAGCGGGAGGCGAAGGCGGCGGAGAACACGCAGCCGGCGGTGGTGCAGCCGGCGAGGTGGTCGTGTTCGTTTCCTGCGGCGGCGGCTGCGGCGGCGGCGATAGAGCAGAAGCTAGTGAGGGCGAAGGGGTACGAGCCGTTTGTGCTGACTCGGTGTAAGTCGGAGCCGATGAGGTCGTCGGCTAAGCTGGCGCCGGACGCTTGCTTTTGGAAGGACAGGAAGCTCGAGCCGCACCGCCCGGCTACGTTCGGCGTCGGCGCGGCTGGAGTTGGATTTTGA

Protein sequence

MDSDRHFRTTSNSTASSELFICFTSRLSSSSAMKISSKSILSPGRAREPSQISLSTSLSRRLKTSGSLKGGQASPMFPTGAKKRGCAFDNPEPSSPKVTCIGQVRVKTKKQGKKMRARSQKRRSNSEASFRKSEQVQPQTNGGDQQFVAKQSHHHLHLHRQNSNSSGGNGFQIQNSQQQECLSHRNQRWVHLPFTICEALRAFGAELNCFLPCHSSCTSDRDKESKPAAKSSESESSCGTVFARWLVAVQDGDGQGREIELVVGDEETRMEKDNGSQRRHVFEGVDFKDKSEVVEEEEEESRISICIPPKNALLLMRCRSDPVKVAELAKRFCESPAPKVEEEEDEEDEQKDKEEKSRQKEAAKMDAPLTVILSNDEDEEETKVELNVKLKNDEEMSEESVSDGEEENYLVLQQEEEHNEEETLEIATDNEIDVQKLDITVNHHNQEEPAEDEQEEEEEHENNNNDEDNQPEELAEETRAIPSHCDPELAQDAEKVESAEEEDESKFLHGNESINEIEDDEEQTEEEGENGGNPASPSLSVETERAAEDMEETEADVNWEEEEEEEEETIHWEDREKATEEEGMRPDIGDGGAMEESKERETPAAEAKREAETGVLPDCLLLMMYEPKLSMEVSKETWVCSTDFIRCVPTREKKAAAAKKREAKAAENTQPAVVQPARWSCSFPAAAAAAAAIEQKLVRAKGYEPFVLTRCKSEPMRSSAKLAPDACFWKDRKLEPHRPATFGVGAAGVGF
Homology
BLAST of MC10g0916 vs. NCBI nr
Match: XP_022142926.1 (uncharacterized protein LOC111012918, partial [Momordica charantia])

HSP 1 Score: 1223 bits (3164), Expect = 0.0
Identity = 665/666 (99.85%), Postives = 665/666 (99.85%), Query Frame = 0

Query: 1   MDSDRHFRTTSNSTASSELFICFTSRLSSSSAMKISSKSILSPGRAREPSQISLSTSLSR 60
           MDSDRHFRTTSNSTASSELFICFTSRLSSSSAMKISSKSILSPGRAREPSQISLSTSLSR
Sbjct: 1   MDSDRHFRTTSNSTASSELFICFTSRLSSSSAMKISSKSILSPGRAREPSQISLSTSLSR 60

Query: 61  RLKTSGSLKGGQASPMFPTGAKKRGCAFDNPEPSSPKVTCIGQVRVKTKKQGKKMRARSQ 120
           RLKTSGSLKGGQASPMFPTGAKKRGCAFDNPEPSSPKVTCIGQVRVKTKKQGKKMRARSQ
Sbjct: 61  RLKTSGSLKGGQASPMFPTGAKKRGCAFDNPEPSSPKVTCIGQVRVKTKKQGKKMRARSQ 120

Query: 121 KRRSNSEASFRKSEQVQPQTNGGDQQFVAKQSHHHLHLHRQNSNSSGGNGFQIQNSQQQE 180
           KRRSNSEASFRKSEQVQPQTNGGDQQFVAKQSHHHLHLHRQNSNSSGGNGFQIQNSQQQE
Sbjct: 121 KRRSNSEASFRKSEQVQPQTNGGDQQFVAKQSHHHLHLHRQNSNSSGGNGFQIQNSQQQE 180

Query: 181 CLSHRNQRWVHLPFTICEALRAFGAELNCFLPCHSSCTSDRDKESKPAAKSSESESSCGT 240
           CLSHRNQRWVHLPFTICEALRAFGAELNCFLPCHSSCTSDRDKESKPAAKSSESESSCGT
Sbjct: 181 CLSHRNQRWVHLPFTICEALRAFGAELNCFLPCHSSCTSDRDKESKPAAKSSESESSCGT 240

Query: 241 VFARWLVAVQDGDGQGREIELVVGDEETRMEKDNGSQRRHVFEGVDFKDKSEVVEEEEEE 300
           VFARWLVAVQDGDGQGREIELVVGDEETRMEKDNGSQRRHVFEGVDFKDKSEVVEEEEEE
Sbjct: 241 VFARWLVAVQDGDGQGREIELVVGDEETRMEKDNGSQRRHVFEGVDFKDKSEVVEEEEEE 300

Query: 301 SRISICIPPKNALLLMRCRSDPVKVAELAKRFCESPAPKVEEEEDEEDEQKDKEEKSRQK 360
           SRISICIPPKNALLLMRCRSDPVKVAELAKRFCESPAPKVEEEEDEEDEQKDKEEKSRQK
Sbjct: 301 SRISICIPPKNALLLMRCRSDPVKVAELAKRFCESPAPKVEEEEDEEDEQKDKEEKSRQK 360

Query: 361 EAAKMDAPLTVILSNDEDEEETKVELNVKLKNDEEMSEESVSDGEEENYLVLQQEEEHNE 420
           EAAKMDAPLTVILSNDEDEEETKVELNVKLKNDEEMSEESVSDGEEENYLVLQQEEEHNE
Sbjct: 361 EAAKMDAPLTVILSNDEDEEETKVELNVKLKNDEEMSEESVSDGEEENYLVLQQEEEHNE 420

Query: 421 EETLEIATDNEIDVQKLDITVNHHNQEEPAEDEQEEEEEHENNNNDEDNQPEELAEETRA 480
           EETLEIATDNEIDVQKLDITVNHHNQEEPAEDEQEEEEEHENNNNDEDNQPEELAEETRA
Sbjct: 421 EETLEIATDNEIDVQKLDITVNHHNQEEPAEDEQEEEEEHENNNNDEDNQPEELAEETRA 480

Query: 481 IPSHCDPELAQDAEKVESAEEEDESKFLHGNESINEIEDDEEQTEEEGENGGNPASPSLS 540
           IPSHCDPELAQDAEKVESAEEEDESKFLHGNESINEIEDDEEQTEEEGENGGNPASPSLS
Sbjct: 481 IPSHCDPELAQDAEKVESAEEEDESKFLHGNESINEIEDDEEQTEEEGENGGNPASPSLS 540

Query: 541 VETERAAEDMEETEADVNWEEEEEEEEETIHWEDREKATEEEGMRPDIGDGGAMEESKER 600
           VETERAAEDMEETEADVNWEEEEEEEEETIHWEDREKATEEEGMRPDIGDGGAMEESKER
Sbjct: 541 VETERAAEDMEETEADVNWEEEEEEEEETIHWEDREKATEEEGMRPDIGDGGAMEESKER 600

Query: 601 ETPAAEAKREAETGVLPDCLLLMMYEPKLSMEVSKETWVCSTDFIRCVPTREKKAAAAKK 660
           ETPAAEAKREAETGVLPDCLLLMMYEPKLSMEVSKETWVCSTDFIRCVPTREKKAAAAKK
Sbjct: 601 ETPAAEAKREAETGVLPDCLLLMMYEPKLSMEVSKETWVCSTDFIRCVPTREKKAAAAKK 660

Query: 661 REAKAA 666
           REAK A
Sbjct: 661 REAKTA 666

BLAST of MC10g0916 vs. NCBI nr
Match: XP_038894264.1 (glutamic acid-rich protein [Benincasa hispida])

HSP 1 Score: 925 bits (2391), Expect = 0.0
Identity = 565/799 (70.71%), Postives = 633/799 (79.22%), Query Frame = 0

Query: 1   MDSDRHFRTTS-NSTAS-----SELFICFTSRLSSSSA---MKISSKSILSPGRAREPSQ 60
           MD+DRHFRTTS NST+S     SELFICFTSR SSSS+   MKISSKSILSPGRAREPSQ
Sbjct: 1   MDADRHFRTTSTNSTSSTAAPSSELFICFTSRFSSSSSSSSMKISSKSILSPGRAREPSQ 60

Query: 61  ISLSTSLSRRLKTSGSLKGGQASPMFPTGAKKRGCAFDNPEPSSPKVTCIGQVRVKTKKQ 120
           ISLSTSLSRRLK+SGSLKGGQASPMFPTG KKRGCAFDNPEPSSPKVTCIGQVRVKTKKQ
Sbjct: 61  ISLSTSLSRRLKSSGSLKGGQASPMFPTGGKKRGCAFDNPEPSSPKVTCIGQVRVKTKKQ 120

Query: 121 GKKMRARSQKRRSNSEASFRKSEQV--QPQTNGGDQQFVAKQSHHHLHLHRQNSNSSGGN 180
           GKKMRARS KRRSNSEASFR+SE V    Q NG +QQF    SHH+ HL RQNSN++GGN
Sbjct: 121 GKKMRARSHKRRSNSEASFRRSESVVQSSQMNGNEQQFS---SHHNHHLLRQNSNNNGGN 180

Query: 181 GFQIQNSQQQECLSHRNQRWVHLPFTICEALRAFGAELNCFLPCHSSCTSDRD--KESKP 240
           GFQ      QECLSHRNQRWVHLPFTICEALRAFGAELNCFLPCHSSC+SDR+  KESKP
Sbjct: 181 GFQ------QECLSHRNQRWVHLPFTICEALRAFGAELNCFLPCHSSCSSDRENNKESKP 240

Query: 241 AAKSSESESSCGTVFARWLVAVQDGDGQGREIELVVGDEETRMEKDNGSQRRHVFEGVDF 300
           A +SSE+ESSCGTVFARWLVAVQDGDG+GREIELVVGDEETR EK+NGSQRRHVFEG+DF
Sbjct: 241 AVRSSETESSCGTVFARWLVAVQDGDGKGREIELVVGDEETRTEKENGSQRRHVFEGLDF 300

Query: 301 KDKSEVVEEEEEESRISICIPPKNALLLMRCRSDPVKVAELAKRFCESPAPKVEEEEDEE 360
           KD++E+VE+EE  SRISICIPPKNALLLMRCRSDPVK+AELAKRFC+SPAPKV+EE+ EE
Sbjct: 301 KDENEIVEQEE--SRISICIPPKNALLLMRCRSDPVKMAELAKRFCDSPAPKVDEEDGEE 360

Query: 361 DEQKDKEEKSRQKEAAK-MDAPL------TVILSNDEDEEETKVELNVKLKNDEEMSEES 420
           +E++D E K++Q E  + +  PL      TV +S +++EEE KVEL VKL+NDEE +EES
Sbjct: 361 EEEEDNEAKNKQNEVKRDVSVPLPVPVSSTVTVSKEKEEEERKVELIVKLENDEETNEES 420

Query: 421 VSDGEEEN---YLVLQQE---EEHNEEETLEIATDNEIDVQKLDITV-NHHNQEEPAEDE 480
           V D E+EN    L LQ+E   EE N E T+E+AT NEID QKLDI V N  NQE+  E++
Sbjct: 421 VFDSEKENGQVNLFLQEEREEEEDNRERTIEMATVNEIDEQKLDIIVINQPNQEQAVEEK 480

Query: 481 QEEEEEHENNNNDEDNQPEELAEETRAIP------SHCDPELAQDAEKVESAEEEDESKF 540
           +EE++       D+DNQ     +ET AIP      +H +PE AQDAEK+ES EE DESK 
Sbjct: 481 EEEDK------IDQDNQ-----QETMAIPIPIPIETHYEPETAQDAEKLESVEE-DESKL 540

Query: 541 LHGNESINEIEDDEEQTEEEGENGGNPASPSLSVETERAAEDMEETEADVNWEEEEEEEE 600
            H +E   + ED+  + EEE  NG NP SPS SVETE   ++ E TE D NWEEEEEEEE
Sbjct: 541 PHESEQDQKTEDENLREEEEPVNGENPTSPSFSVETEPVLDETE-TEVDWNWEEEEEEEE 600

Query: 601 ETIHWEDREKATEEEGMRPDIGD-----GGAMEESKERETPAAEAKR--EAETGVLPDCL 660
           E    E+ E+   +EG+ PD  +     G   ++SKERETP  E +R  + ET VLPDCL
Sbjct: 601 EEEEEEEEEEKATDEGIGPDAQNDDKLMGPEEDQSKERETPRPEPERKTQTETSVLPDCL 660

Query: 661 LLMMYEPKLSMEVSKETWVCSTDFIRCVPTREKKAAAA--------KKREAKAAENTQPA 720
           LLMMYEPKLSMEVSKETWVCSTDFIRCVPTREKK A          KKRE K A+ TQ A
Sbjct: 661 LLMMYEPKLSMEVSKETWVCSTDFIRCVPTREKKPACRNPPPPPPPKKRETKPADTTQAA 720

Query: 721 VVQPARWSCSFPAAAAAAAAIEQKLVRAKGYEPFVLTRCKSEPMRSSAKLAPDACFWKDR 751
           VVQPARWSCSFPAAAAAAA IEQKLVRAKGYEPFVLTRCKSEPMRSSAKLAPDACFWKDR
Sbjct: 721 VVQPARWSCSFPAAAAAAAMIEQKLVRAKGYEPFVLTRCKSEPMRSSAKLAPDACFWKDR 775

BLAST of MC10g0916 vs. NCBI nr
Match: XP_008456014.1 (PREDICTED: glutamic acid-rich protein isoform X1 [Cucumis melo])

HSP 1 Score: 890 bits (2299), Expect = 1.97e-315
Identity = 555/813 (68.27%), Postives = 625/813 (76.88%), Query Frame = 0

Query: 1   MDSDRHFRTTS-NSTAS-----SELFICFTSRLSSSSAMKISSKSILSPGRAREPSQISL 60
           MD DRHFRTTS NST+S     SELFICFTSR SSSS+MKISSKSILSPGR REPSQISL
Sbjct: 1   MDPDRHFRTTSTNSTSSTATPSSELFICFTSRFSSSSSMKISSKSILSPGRHREPSQISL 60

Query: 61  STSLSRRLKTSGSLKGGQASPMFPTGAKKRGCAFDNPEPSSPKVTCIGQVRVKTKKQGKK 120
           STSLSRRLK+SGSLKGGQASPMFPTG KKRGCAFDNPEPSSPKVTCIGQVRVKTKKQGKK
Sbjct: 61  STSLSRRLKSSGSLKGGQASPMFPTGRKKRGCAFDNPEPSSPKVTCIGQVRVKTKKQGKK 120

Query: 121 MRARSQKRRSNSEASFRKSEQV--QPQTNGGDQQFVAKQSHHHLHLHRQNSNSSGGNGFQ 180
           MRARSQKRR+NSEASFR+SE V    Q N  DQQF    SHH+ HL RQNSNS+ GNGFQ
Sbjct: 121 MRARSQKRRTNSEASFRRSESVVQSSQVNSNDQQFS---SHHNHHLLRQNSNSNAGNGFQ 180

Query: 181 IQNSQQQECLSHRNQRWVHLPFTICEALRAFGAELNCFLPCHSSCTSDRD--KESKPAAK 240
                 QECLSHRNQRWVHLPFTICEALRAFGAELNCFLPCHSSC+ +R+  KE KPA +
Sbjct: 181 ------QECLSHRNQRWVHLPFTICEALRAFGAELNCFLPCHSSCSGNRENNKEPKPAER 240

Query: 241 SSESESSCGTVFARWLVAVQDGDGQGREIELVVGDEETRMEKDNGSQRRHVFEGVDFKDK 300
           SSESESSCGTVFARWLVAVQDGDG+GREIELVVGDEETR EK+NGSQRRHVFEG+DFKDK
Sbjct: 241 SSESESSCGTVFARWLVAVQDGDGKGREIELVVGDEETRTEKENGSQRRHVFEGLDFKDK 300

Query: 301 SEVVEEEEEESRISICIPPKNALLLMRCRSDPVKVAELAKRFCESPAPKVEEEEDEEDEQ 360
           +E VEEEE  SRISICIPPKNALLLMRCRSDPVK+AELAKRFCE PAPKV+EE++EE E 
Sbjct: 301 NEAVEEEE--SRISICIPPKNALLLMRCRSDPVKMAELAKRFCEPPAPKVDEEDEEEGED 360

Query: 361 KDKEEKSRQKEAAK-----MDAPLTVILSNDEDEEETKVE-------LNVKLKNDEEMSE 420
           +D E K R+ E  +     + + +TV    +E+EEE K E         VKL+N+EE++E
Sbjct: 361 EDNEAKKRKNEVKRDVSVPVSSIITVNKEEEEEEEEEKEEDERKVEQFVVKLENEEEVNE 420

Query: 421 ESVSDGE---EENYLVLQQE---EEHNEEETLEIATDNEIDVQKLDITV-NHHNQEEPAE 480
           ESVSD +   EE  LVLQ+E   E+ NEEET+E+AT+N+ D QK DITV N  NQE+  E
Sbjct: 421 ESVSDEDKEKEEANLVLQEEQREEKDNEEETIEMATEND-DEQKQDITVVNQLNQEQALE 480

Query: 481 DEQEEEEEHENNNNDEDNQPEELAEETRAIP----SHCDPELAQDAEKVESAEEEDESKF 540
           +++E++        D+ NQ     +ET AIP    +HC+PE+AQDAEK+ES E+E ESK 
Sbjct: 481 EKEEDK-------TDQVNQ-----QETMAIPIPIQTHCEPEMAQDAEKLESVEKE-ESKL 540

Query: 541 LHGNESINEIEDDE---------EQTEEEGENGGNPASPSLSVETERAAEDMEETEADVN 600
            H +E   + E+DE         E+ EEEGENG NP SPSLSVET+     ++ETE +V+
Sbjct: 541 SHESEQDQKTEEDEILREEKEEEEEEEEEGENGENPTSPSLSVETKPV---LDETETEVD 600

Query: 601 WEEEEEEEEETIHWEDREKATEEEGMRPDIGDGGAM------EESKERETPAAE------ 660
            + EEEEEEE    E+ EKAT+E G+ PD  + GA+      ++SKERETP  E      
Sbjct: 601 GKREEEEEEE----EEEEKATDE-GIGPDDENNGALVGPEEEDQSKERETPPPEPEPEPE 660

Query: 661 AKREAETGVLPDCLLLMMYEPKLSMEVSKETWVCSTDFIRCVPTREKKAAAA-------- 720
            K + ET VLPDCLLLMMYEPKLSMEVSKETWVCS DFIRCVPTREKK            
Sbjct: 661 GKTQTETSVLPDCLLLMMYEPKLSMEVSKETWVCSADFIRCVPTREKKTVGRDPPPPPPP 720

Query: 721 KKREAKAAENTQPAVVQPARWSCSFPAAAAAAAAIEQKLVRAKGYEPFVLTRCKSEPMRS 751
           KKRE K  +  Q  VVQPARWSCSFPAAAAAAA IEQKL RAKGYEPFVLTRCKSEPMRS
Sbjct: 721 KKRETKPTDTMQTTVVQPARWSCSFPAAAAAAAMIEQKLARAKGYEPFVLTRCKSEPMRS 780

BLAST of MC10g0916 vs. NCBI nr
Match: KAG7031049.1 (hypothetical protein SDJN02_05088, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 887 bits (2291), Expect = 3.22e-314
Identity = 564/821 (68.70%), Postives = 627/821 (76.37%), Query Frame = 0

Query: 1   MDSDRHFRTTSNSTA----SSELFICFTSRLSSSSA--MKISSKSILSPGRAREPSQISL 60
           MDSDRHFR T++STA    SSELFICFTSR SSSS+  MKISSKSILSPGR REP+QISL
Sbjct: 1   MDSDRHFRNTTSSTATASASSELFICFTSRFSSSSSSSMKISSKSILSPGRPREPAQISL 60

Query: 61  STSLSRRLKTSGSLKGGQASPMFPTGAKKRGCAFDNPEPSSPKVTCIGQVRVKTKKQGKK 120
           STSLSRRLK+SGSLKGGQASPMFPTG KKRGCAF+NPEPSSPKVTCIGQVRVKTKKQGKK
Sbjct: 61  STSLSRRLKSSGSLKGGQASPMFPTGGKKRGCAFENPEPSSPKVTCIGQVRVKTKKQGKK 120

Query: 121 MRARSQKRRSNSEASFRKSE--QVQPQTNGGDQQFVAKQSHHHLHLHRQNSNSSGGNGFQ 180
           MRARS KRRSNSEASFRKSE  QVQ Q NG D  FV + SH + HL RQNSN  GGNGFQ
Sbjct: 121 MRARSLKRRSNSEASFRKSESVQVQSQMNGND--FVNQSSHLNHHLLRQNSN--GGNGFQ 180

Query: 181 IQNSQQQECLSHRNQRWVHLPFTICEALRAFGAELNCFLPCHSSCTSDRD--KESKPAAK 240
                 QECLSHRNQRWVHLPFTICEALRAFGAELNCFLPCHSSC+S+R+  KESK   +
Sbjct: 181 ------QECLSHRNQRWVHLPFTICEALRAFGAELNCFLPCHSSCSSNRENNKESKTTER 240

Query: 241 SSESESSCGTVFARWLVAVQDGDGQGREIELVVGDEETRMEKDNGSQRRHVFEGVDFKDK 300
           SSESESSCGTVFARWLVAVQD DG+GREIELVVGDEE+R EK+NGSQRRHVFEG+DFK++
Sbjct: 241 SSESESSCGTVFARWLVAVQDNDGRGREIELVVGDEESRTEKENGSQRRHVFEGLDFKEE 300

Query: 301 SEVVEEEEEESRISICIPPKNALLLMRCRSDPVKVAELAKRFCESPAPKVEEEEDEEDEQ 360
            EVV+EEE  SRISICIPPKNALLLMRCRSDPVK+AELAKRFCESP PK+EEE++EE++ 
Sbjct: 301 KEVVQEEE--SRISICIPPKNALLLMRCRSDPVKMAELAKRFCESPVPKLEEEDEEEND- 360

Query: 361 KDKEEKSRQKEAAK-------MDAPLTVILSNDEDEEETKVELNVKLKNDEEMSEESVSD 420
              EE S Q EA K       M   +T+I   +E+EEETKVELN KLKN+EEM EESVSD
Sbjct: 361 ---EENSNQNEAKKGTPLPVLMPLTVTLIKEEEEEEEETKVELNSKLKNEEEMIEESVSD 420

Query: 421 GEEENY---------LVLQQEEEHNE---EETLEIATDNEIDVQKLDITV-NHHNQEEPA 480
            EEE           +VLQ+EEE  E   EE++E+AT+NEIDVQKLDITV NH +QEE A
Sbjct: 421 AEEEEEEEEEEEEANVVLQEEEEEEEDSGEESIEMATENEIDVQKLDITVINHQDQEEEA 480

Query: 481 EDEQEEEEEHENNNN-DEDNQPEELAEETRA----IPSHCDPELAQDAEKVESAEEEDES 540
           E+++E+E E E  +  D+DNQ ++L EET A    I + C+PE+ QDAEK+ESAE  DE 
Sbjct: 481 EEDKEQEHEQEQEHRIDQDNQQQKLVEETMAFSIPISTQCEPEMVQDAEKLESAEG-DEF 540

Query: 541 KFLHGNESINEIEDDEEQT-----EEEGENGGNPASPSLSVETERAAEDMEETEADVNWE 600
           K  HGNE   E E+ EEQ      EE+ ENG NP SP LSVETE           D NWE
Sbjct: 541 KPFHGNEQDFETEEHEEQMKEFEEEEKSENGENPTSPPLSVETE----------VDGNWE 600

Query: 601 EEEEEEEETIHWEDREKATEEE-----------GMRPDIGDGGAM------EESKERETP 660
           EEEEEEE      +R ++TEEE           G+ P I +   M      ++SKERETP
Sbjct: 601 EEEEEEE------NRGRSTEEELKGTAATAMDEGIGPHIQNDDEMGLEEEEDQSKERETP 660

Query: 661 AAEAKREAET------GVLPDCLLLMMYEPKLSMEVSKETWVCSTDFIRCVPTREKKAAA 720
             E +RE +T       VLPDCLLLMMYEPKLSMEVSKETWVCSTDFIRCVPTREKKAA 
Sbjct: 661 PPEPERETQTQTKPEASVLPDCLLLMMYEPKLSMEVSKETWVCSTDFIRCVPTREKKAAP 720

Query: 721 A---KKREAKAAE-NTQPAV-VQPARWSCSFPAAAAAAAAIEQKLVRAKG--YEPFVLTR 751
               KKRE K AE NTQ  V +QP RWSCSFPAAAAAAA IEQKL RAKG  YEPFVLTR
Sbjct: 721 PPPPKKREIKTAEKNTQTQVAIQPGRWSCSFPAAAAAAAMIEQKLERAKGGGYEPFVLTR 780

BLAST of MC10g0916 vs. NCBI nr
Match: XP_022941617.1 (glutamic acid-rich protein-like [Cucurbita moschata])

HSP 1 Score: 883 bits (2282), Expect = 4.09e-313
Identity = 563/810 (69.51%), Postives = 622/810 (76.79%), Query Frame = 0

Query: 1   MDSDRHFRTTSNSTA----SSELFICFTSRLSSSSA--MKISSKSILSPGRAREPSQISL 60
           MDSDRHFR T++STA    SSELFICFTSR SSSS+  MKISSKSILSPGR REP+QISL
Sbjct: 1   MDSDRHFRNTTSSTATASASSELFICFTSRFSSSSSSSMKISSKSILSPGRPREPAQISL 60

Query: 61  STSLSRRLKTSGSLKGGQASPMFPTGAKKRGCAFDNPEPSSPKVTCIGQVRVKTKKQGKK 120
           STSLSRRLK+SGSLKGGQASPMFPTG KKRGCAF+NPEPSSPKVTCIGQVRVKTKKQGKK
Sbjct: 61  STSLSRRLKSSGSLKGGQASPMFPTGGKKRGCAFENPEPSSPKVTCIGQVRVKTKKQGKK 120

Query: 121 MRARSQKRRSNSEASFRKSE--QVQPQTNGGDQQFVAKQSHHHLHLHRQNSNSSGGNGFQ 180
           MRARS KRRSNSEASFRKSE  QVQ Q NG D  FV + SH + HL RQNSN  GGNGFQ
Sbjct: 121 MRARSLKRRSNSEASFRKSESVQVQSQMNGND--FVNQSSHLNHHLLRQNSN--GGNGFQ 180

Query: 181 IQNSQQQECLSHRNQRWVHLPFTICEALRAFGAELNCFLPCHSSCTSDRD--KESKPAAK 240
                 QECLSHRNQRWVHLPFTICEALRAFGAELNCFLPCHSSC+S+R+  KESK   +
Sbjct: 181 ------QECLSHRNQRWVHLPFTICEALRAFGAELNCFLPCHSSCSSNRENNKESKTTER 240

Query: 241 SSESESSCGTVFARWLVAVQDGDGQGREIELVVGDEETRMEKDNGSQRRHVFEGVDFKDK 300
           SSESESSCGTVFARWLVAVQD DG+GREIELVVGDEE+R EK+NGSQRRHVFEG+DFK++
Sbjct: 241 SSESESSCGTVFARWLVAVQDNDGRGREIELVVGDEESRTEKENGSQRRHVFEGLDFKEE 300

Query: 301 SEVVEEEEEESRISICIPPKNALLLMRCRSDPVKVAELAKRFCESPAPKVEEEEDEEDEQ 360
            EVV+EEE  SRISICIPPKNALLLMRCRSDPVK+AELAKRFCESP PK+EEE++EE++ 
Sbjct: 301 KEVVQEEE--SRISICIPPKNALLLMRCRSDPVKMAELAKRFCESPVPKLEEEDEEEND- 360

Query: 361 KDKEEKSRQKEAAK-----MDAPLTVILSNDEDEEETKVELNVKLKNDEEMSEESVSDGE 420
              EE S Q EA K     +  PLTV L  +E+EEETKVELN KLKN+EEM EESVSD E
Sbjct: 361 ---EENSNQNEAQKGTPLPVLMPLTVTLIKEEEEEETKVELNSKLKNEEEMIEESVSDAE 420

Query: 421 EENY---LVLQQEEEHNE---EETLEIATDNEIDVQKLDITV-NHHNQEEPAED-EQEEE 480
           EE     +VLQ+EEE  E   EE++E+AT+NEIDVQKLDITV NH +QEE  ED EQE E
Sbjct: 421 EEEEEANVVLQEEEEEEEDNGEESIEMATENEIDVQKLDITVINHQDQEEAEEDKEQEHE 480

Query: 481 EEHENNNNDEDNQPEELAEETRA----IPSHCDPELAQDAEKVESAEEEDESKFLHGNES 540
           +EH     D+DNQ ++L EET A    I + C+PE+ QDAEK+ESAE  DE K  HGNE 
Sbjct: 481 QEH---RIDQDNQQQKLVEETMAFSIPISTQCEPEMVQDAEKLESAEG-DEFKPFHGNEQ 540

Query: 541 INEIEDDEEQTEEE--GENGGNPASPSLSVETERAAEDMEETEADVNWEEEEEEEEETIH 600
             E E+  ++ EEE   ENG NP SP LSVETE           D NWEEEEE       
Sbjct: 541 DFETEEQMKELEEEEKSENGENPTSPPLSVETE----------VDGNWEEEEE------- 600

Query: 601 WEDREKATEEE-----------GMRPDIGDGGAM------EESKERETPAAEAKREAET- 660
             +R ++TEEE           G+ P I +   M      ++SKERETP  E +RE +T 
Sbjct: 601 --NRGRSTEEELKGTAATAMDEGIGPHIQNDDEMGLEEEEDQSKERETPPPEPERETQTQ 660

Query: 661 -----GVLPDCLLLMMYEPKLSMEVSKETWVCSTDFIRCVPTREKKAAAA---KKREAKA 720
                 VLPDCLLLMMYEPKLSMEVSKETWVCSTDFIRCVPTREKKAA     KKRE K 
Sbjct: 661 TKPEASVLPDCLLLMMYEPKLSMEVSKETWVCSTDFIRCVPTREKKAAPPPPPKKREIKT 720

Query: 721 AE-NTQPAV-VQPARWSCSFPAAAAAAAAIEQKLVRAKG--YEPFVLTRCKSEPMRSSAK 751
           AE NTQ  V +QP RWSCSFPAAAAAAA IEQKL RAKG  YEPFVLTRCKSEPMRSSAK
Sbjct: 721 AEKNTQTQVAIQPGRWSCSFPAAAAAAAMIEQKLERAKGGGYEPFVLTRCKSEPMRSSAK 771

BLAST of MC10g0916 vs. ExPASy TrEMBL
Match: A0A6J1CNN3 (uncharacterized protein LOC111012918 OS=Momordica charantia OX=3673 GN=LOC111012918 PE=4 SV=1)

HSP 1 Score: 1223 bits (3164), Expect = 0.0
Identity = 665/666 (99.85%), Postives = 665/666 (99.85%), Query Frame = 0

Query: 1   MDSDRHFRTTSNSTASSELFICFTSRLSSSSAMKISSKSILSPGRAREPSQISLSTSLSR 60
           MDSDRHFRTTSNSTASSELFICFTSRLSSSSAMKISSKSILSPGRAREPSQISLSTSLSR
Sbjct: 1   MDSDRHFRTTSNSTASSELFICFTSRLSSSSAMKISSKSILSPGRAREPSQISLSTSLSR 60

Query: 61  RLKTSGSLKGGQASPMFPTGAKKRGCAFDNPEPSSPKVTCIGQVRVKTKKQGKKMRARSQ 120
           RLKTSGSLKGGQASPMFPTGAKKRGCAFDNPEPSSPKVTCIGQVRVKTKKQGKKMRARSQ
Sbjct: 61  RLKTSGSLKGGQASPMFPTGAKKRGCAFDNPEPSSPKVTCIGQVRVKTKKQGKKMRARSQ 120

Query: 121 KRRSNSEASFRKSEQVQPQTNGGDQQFVAKQSHHHLHLHRQNSNSSGGNGFQIQNSQQQE 180
           KRRSNSEASFRKSEQVQPQTNGGDQQFVAKQSHHHLHLHRQNSNSSGGNGFQIQNSQQQE
Sbjct: 121 KRRSNSEASFRKSEQVQPQTNGGDQQFVAKQSHHHLHLHRQNSNSSGGNGFQIQNSQQQE 180

Query: 181 CLSHRNQRWVHLPFTICEALRAFGAELNCFLPCHSSCTSDRDKESKPAAKSSESESSCGT 240
           CLSHRNQRWVHLPFTICEALRAFGAELNCFLPCHSSCTSDRDKESKPAAKSSESESSCGT
Sbjct: 181 CLSHRNQRWVHLPFTICEALRAFGAELNCFLPCHSSCTSDRDKESKPAAKSSESESSCGT 240

Query: 241 VFARWLVAVQDGDGQGREIELVVGDEETRMEKDNGSQRRHVFEGVDFKDKSEVVEEEEEE 300
           VFARWLVAVQDGDGQGREIELVVGDEETRMEKDNGSQRRHVFEGVDFKDKSEVVEEEEEE
Sbjct: 241 VFARWLVAVQDGDGQGREIELVVGDEETRMEKDNGSQRRHVFEGVDFKDKSEVVEEEEEE 300

Query: 301 SRISICIPPKNALLLMRCRSDPVKVAELAKRFCESPAPKVEEEEDEEDEQKDKEEKSRQK 360
           SRISICIPPKNALLLMRCRSDPVKVAELAKRFCESPAPKVEEEEDEEDEQKDKEEKSRQK
Sbjct: 301 SRISICIPPKNALLLMRCRSDPVKVAELAKRFCESPAPKVEEEEDEEDEQKDKEEKSRQK 360

Query: 361 EAAKMDAPLTVILSNDEDEEETKVELNVKLKNDEEMSEESVSDGEEENYLVLQQEEEHNE 420
           EAAKMDAPLTVILSNDEDEEETKVELNVKLKNDEEMSEESVSDGEEENYLVLQQEEEHNE
Sbjct: 361 EAAKMDAPLTVILSNDEDEEETKVELNVKLKNDEEMSEESVSDGEEENYLVLQQEEEHNE 420

Query: 421 EETLEIATDNEIDVQKLDITVNHHNQEEPAEDEQEEEEEHENNNNDEDNQPEELAEETRA 480
           EETLEIATDNEIDVQKLDITVNHHNQEEPAEDEQEEEEEHENNNNDEDNQPEELAEETRA
Sbjct: 421 EETLEIATDNEIDVQKLDITVNHHNQEEPAEDEQEEEEEHENNNNDEDNQPEELAEETRA 480

Query: 481 IPSHCDPELAQDAEKVESAEEEDESKFLHGNESINEIEDDEEQTEEEGENGGNPASPSLS 540
           IPSHCDPELAQDAEKVESAEEEDESKFLHGNESINEIEDDEEQTEEEGENGGNPASPSLS
Sbjct: 481 IPSHCDPELAQDAEKVESAEEEDESKFLHGNESINEIEDDEEQTEEEGENGGNPASPSLS 540

Query: 541 VETERAAEDMEETEADVNWEEEEEEEEETIHWEDREKATEEEGMRPDIGDGGAMEESKER 600
           VETERAAEDMEETEADVNWEEEEEEEEETIHWEDREKATEEEGMRPDIGDGGAMEESKER
Sbjct: 541 VETERAAEDMEETEADVNWEEEEEEEEETIHWEDREKATEEEGMRPDIGDGGAMEESKER 600

Query: 601 ETPAAEAKREAETGVLPDCLLLMMYEPKLSMEVSKETWVCSTDFIRCVPTREKKAAAAKK 660
           ETPAAEAKREAETGVLPDCLLLMMYEPKLSMEVSKETWVCSTDFIRCVPTREKKAAAAKK
Sbjct: 601 ETPAAEAKREAETGVLPDCLLLMMYEPKLSMEVSKETWVCSTDFIRCVPTREKKAAAAKK 660

Query: 661 REAKAA 666
           REAK A
Sbjct: 661 REAKTA 666

BLAST of MC10g0916 vs. ExPASy TrEMBL
Match: A0A1S3C2C2 (glutamic acid-rich protein isoform X1 OS=Cucumis melo OX=3656 GN=LOC103496068 PE=4 SV=1)

HSP 1 Score: 890 bits (2299), Expect = 9.54e-316
Identity = 555/813 (68.27%), Postives = 625/813 (76.88%), Query Frame = 0

Query: 1   MDSDRHFRTTS-NSTAS-----SELFICFTSRLSSSSAMKISSKSILSPGRAREPSQISL 60
           MD DRHFRTTS NST+S     SELFICFTSR SSSS+MKISSKSILSPGR REPSQISL
Sbjct: 1   MDPDRHFRTTSTNSTSSTATPSSELFICFTSRFSSSSSMKISSKSILSPGRHREPSQISL 60

Query: 61  STSLSRRLKTSGSLKGGQASPMFPTGAKKRGCAFDNPEPSSPKVTCIGQVRVKTKKQGKK 120
           STSLSRRLK+SGSLKGGQASPMFPTG KKRGCAFDNPEPSSPKVTCIGQVRVKTKKQGKK
Sbjct: 61  STSLSRRLKSSGSLKGGQASPMFPTGRKKRGCAFDNPEPSSPKVTCIGQVRVKTKKQGKK 120

Query: 121 MRARSQKRRSNSEASFRKSEQV--QPQTNGGDQQFVAKQSHHHLHLHRQNSNSSGGNGFQ 180
           MRARSQKRR+NSEASFR+SE V    Q N  DQQF    SHH+ HL RQNSNS+ GNGFQ
Sbjct: 121 MRARSQKRRTNSEASFRRSESVVQSSQVNSNDQQFS---SHHNHHLLRQNSNSNAGNGFQ 180

Query: 181 IQNSQQQECLSHRNQRWVHLPFTICEALRAFGAELNCFLPCHSSCTSDRD--KESKPAAK 240
                 QECLSHRNQRWVHLPFTICEALRAFGAELNCFLPCHSSC+ +R+  KE KPA +
Sbjct: 181 ------QECLSHRNQRWVHLPFTICEALRAFGAELNCFLPCHSSCSGNRENNKEPKPAER 240

Query: 241 SSESESSCGTVFARWLVAVQDGDGQGREIELVVGDEETRMEKDNGSQRRHVFEGVDFKDK 300
           SSESESSCGTVFARWLVAVQDGDG+GREIELVVGDEETR EK+NGSQRRHVFEG+DFKDK
Sbjct: 241 SSESESSCGTVFARWLVAVQDGDGKGREIELVVGDEETRTEKENGSQRRHVFEGLDFKDK 300

Query: 301 SEVVEEEEEESRISICIPPKNALLLMRCRSDPVKVAELAKRFCESPAPKVEEEEDEEDEQ 360
           +E VEEEE  SRISICIPPKNALLLMRCRSDPVK+AELAKRFCE PAPKV+EE++EE E 
Sbjct: 301 NEAVEEEE--SRISICIPPKNALLLMRCRSDPVKMAELAKRFCEPPAPKVDEEDEEEGED 360

Query: 361 KDKEEKSRQKEAAK-----MDAPLTVILSNDEDEEETKVE-------LNVKLKNDEEMSE 420
           +D E K R+ E  +     + + +TV    +E+EEE K E         VKL+N+EE++E
Sbjct: 361 EDNEAKKRKNEVKRDVSVPVSSIITVNKEEEEEEEEEKEEDERKVEQFVVKLENEEEVNE 420

Query: 421 ESVSDGE---EENYLVLQQE---EEHNEEETLEIATDNEIDVQKLDITV-NHHNQEEPAE 480
           ESVSD +   EE  LVLQ+E   E+ NEEET+E+AT+N+ D QK DITV N  NQE+  E
Sbjct: 421 ESVSDEDKEKEEANLVLQEEQREEKDNEEETIEMATEND-DEQKQDITVVNQLNQEQALE 480

Query: 481 DEQEEEEEHENNNNDEDNQPEELAEETRAIP----SHCDPELAQDAEKVESAEEEDESKF 540
           +++E++        D+ NQ     +ET AIP    +HC+PE+AQDAEK+ES E+E ESK 
Sbjct: 481 EKEEDK-------TDQVNQ-----QETMAIPIPIQTHCEPEMAQDAEKLESVEKE-ESKL 540

Query: 541 LHGNESINEIEDDE---------EQTEEEGENGGNPASPSLSVETERAAEDMEETEADVN 600
            H +E   + E+DE         E+ EEEGENG NP SPSLSVET+     ++ETE +V+
Sbjct: 541 SHESEQDQKTEEDEILREEKEEEEEEEEEGENGENPTSPSLSVETKPV---LDETETEVD 600

Query: 601 WEEEEEEEEETIHWEDREKATEEEGMRPDIGDGGAM------EESKERETPAAE------ 660
            + EEEEEEE    E+ EKAT+E G+ PD  + GA+      ++SKERETP  E      
Sbjct: 601 GKREEEEEEE----EEEEKATDE-GIGPDDENNGALVGPEEEDQSKERETPPPEPEPEPE 660

Query: 661 AKREAETGVLPDCLLLMMYEPKLSMEVSKETWVCSTDFIRCVPTREKKAAAA-------- 720
            K + ET VLPDCLLLMMYEPKLSMEVSKETWVCS DFIRCVPTREKK            
Sbjct: 661 GKTQTETSVLPDCLLLMMYEPKLSMEVSKETWVCSADFIRCVPTREKKTVGRDPPPPPPP 720

Query: 721 KKREAKAAENTQPAVVQPARWSCSFPAAAAAAAAIEQKLVRAKGYEPFVLTRCKSEPMRS 751
           KKRE K  +  Q  VVQPARWSCSFPAAAAAAA IEQKL RAKGYEPFVLTRCKSEPMRS
Sbjct: 721 KKRETKPTDTMQTTVVQPARWSCSFPAAAAAAAMIEQKLARAKGYEPFVLTRCKSEPMRS 780

BLAST of MC10g0916 vs. ExPASy TrEMBL
Match: A0A6J1FMZ8 (glutamic acid-rich protein-like OS=Cucurbita moschata OX=3662 GN=LOC111446922 PE=4 SV=1)

HSP 1 Score: 883 bits (2282), Expect = 1.98e-313
Identity = 563/810 (69.51%), Postives = 622/810 (76.79%), Query Frame = 0

Query: 1   MDSDRHFRTTSNSTA----SSELFICFTSRLSSSSA--MKISSKSILSPGRAREPSQISL 60
           MDSDRHFR T++STA    SSELFICFTSR SSSS+  MKISSKSILSPGR REP+QISL
Sbjct: 1   MDSDRHFRNTTSSTATASASSELFICFTSRFSSSSSSSMKISSKSILSPGRPREPAQISL 60

Query: 61  STSLSRRLKTSGSLKGGQASPMFPTGAKKRGCAFDNPEPSSPKVTCIGQVRVKTKKQGKK 120
           STSLSRRLK+SGSLKGGQASPMFPTG KKRGCAF+NPEPSSPKVTCIGQVRVKTKKQGKK
Sbjct: 61  STSLSRRLKSSGSLKGGQASPMFPTGGKKRGCAFENPEPSSPKVTCIGQVRVKTKKQGKK 120

Query: 121 MRARSQKRRSNSEASFRKSE--QVQPQTNGGDQQFVAKQSHHHLHLHRQNSNSSGGNGFQ 180
           MRARS KRRSNSEASFRKSE  QVQ Q NG D  FV + SH + HL RQNSN  GGNGFQ
Sbjct: 121 MRARSLKRRSNSEASFRKSESVQVQSQMNGND--FVNQSSHLNHHLLRQNSN--GGNGFQ 180

Query: 181 IQNSQQQECLSHRNQRWVHLPFTICEALRAFGAELNCFLPCHSSCTSDRD--KESKPAAK 240
                 QECLSHRNQRWVHLPFTICEALRAFGAELNCFLPCHSSC+S+R+  KESK   +
Sbjct: 181 ------QECLSHRNQRWVHLPFTICEALRAFGAELNCFLPCHSSCSSNRENNKESKTTER 240

Query: 241 SSESESSCGTVFARWLVAVQDGDGQGREIELVVGDEETRMEKDNGSQRRHVFEGVDFKDK 300
           SSESESSCGTVFARWLVAVQD DG+GREIELVVGDEE+R EK+NGSQRRHVFEG+DFK++
Sbjct: 241 SSESESSCGTVFARWLVAVQDNDGRGREIELVVGDEESRTEKENGSQRRHVFEGLDFKEE 300

Query: 301 SEVVEEEEEESRISICIPPKNALLLMRCRSDPVKVAELAKRFCESPAPKVEEEEDEEDEQ 360
            EVV+EEE  SRISICIPPKNALLLMRCRSDPVK+AELAKRFCESP PK+EEE++EE++ 
Sbjct: 301 KEVVQEEE--SRISICIPPKNALLLMRCRSDPVKMAELAKRFCESPVPKLEEEDEEEND- 360

Query: 361 KDKEEKSRQKEAAK-----MDAPLTVILSNDEDEEETKVELNVKLKNDEEMSEESVSDGE 420
              EE S Q EA K     +  PLTV L  +E+EEETKVELN KLKN+EEM EESVSD E
Sbjct: 361 ---EENSNQNEAQKGTPLPVLMPLTVTLIKEEEEEETKVELNSKLKNEEEMIEESVSDAE 420

Query: 421 EENY---LVLQQEEEHNE---EETLEIATDNEIDVQKLDITV-NHHNQEEPAED-EQEEE 480
           EE     +VLQ+EEE  E   EE++E+AT+NEIDVQKLDITV NH +QEE  ED EQE E
Sbjct: 421 EEEEEANVVLQEEEEEEEDNGEESIEMATENEIDVQKLDITVINHQDQEEAEEDKEQEHE 480

Query: 481 EEHENNNNDEDNQPEELAEETRA----IPSHCDPELAQDAEKVESAEEEDESKFLHGNES 540
           +EH     D+DNQ ++L EET A    I + C+PE+ QDAEK+ESAE  DE K  HGNE 
Sbjct: 481 QEH---RIDQDNQQQKLVEETMAFSIPISTQCEPEMVQDAEKLESAEG-DEFKPFHGNEQ 540

Query: 541 INEIEDDEEQTEEE--GENGGNPASPSLSVETERAAEDMEETEADVNWEEEEEEEEETIH 600
             E E+  ++ EEE   ENG NP SP LSVETE           D NWEEEEE       
Sbjct: 541 DFETEEQMKELEEEEKSENGENPTSPPLSVETE----------VDGNWEEEEE------- 600

Query: 601 WEDREKATEEE-----------GMRPDIGDGGAM------EESKERETPAAEAKREAET- 660
             +R ++TEEE           G+ P I +   M      ++SKERETP  E +RE +T 
Sbjct: 601 --NRGRSTEEELKGTAATAMDEGIGPHIQNDDEMGLEEEEDQSKERETPPPEPERETQTQ 660

Query: 661 -----GVLPDCLLLMMYEPKLSMEVSKETWVCSTDFIRCVPTREKKAAAA---KKREAKA 720
                 VLPDCLLLMMYEPKLSMEVSKETWVCSTDFIRCVPTREKKAA     KKRE K 
Sbjct: 661 TKPEASVLPDCLLLMMYEPKLSMEVSKETWVCSTDFIRCVPTREKKAAPPPPPKKREIKT 720

Query: 721 AE-NTQPAV-VQPARWSCSFPAAAAAAAAIEQKLVRAKG--YEPFVLTRCKSEPMRSSAK 751
           AE NTQ  V +QP RWSCSFPAAAAAAA IEQKL RAKG  YEPFVLTRCKSEPMRSSAK
Sbjct: 721 AEKNTQTQVAIQPGRWSCSFPAAAAAAAMIEQKLERAKGGGYEPFVLTRCKSEPMRSSAK 771

BLAST of MC10g0916 vs. ExPASy TrEMBL
Match: A0A6J1IN35 (calponin homology domain-containing protein DDB_G0272472-like isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111478932 PE=4 SV=1)

HSP 1 Score: 880 bits (2273), Expect = 3.85e-312
Identity = 557/801 (69.54%), Postives = 624/801 (77.90%), Query Frame = 0

Query: 1   MDSDRHFRTTSNSTAS----SELFICFTSRLSSSSA----MKISSKSILSPGRAREPSQI 60
           MDSDRHFR T++STAS    SELFICFTSR+SSSS+    MKISSKSILSPGR REP+QI
Sbjct: 1   MDSDRHFRNTTSSTASASASSELFICFTSRISSSSSSSSSMKISSKSILSPGRPREPAQI 60

Query: 61  SLSTSLSRRLKTSGSLKGGQASPMFPTGAKKRGCAFDNPEPSSPKVTCIGQVRVKTKKQG 120
           SLSTSLSRRLK+SGSLKGGQASPMFPTG KKRGC F+NPEPSSPKVTCIGQVRVKTKKQG
Sbjct: 61  SLSTSLSRRLKSSGSLKGGQASPMFPTGGKKRGCGFENPEPSSPKVTCIGQVRVKTKKQG 120

Query: 121 KKMRARSQKRRSNSEASFRKSE--QVQPQTNGGDQQFVAKQSHHHLHLHRQNSNSSGGNG 180
           KKMRARS KRRSNSEASFRKSE  QVQ Q NG D  F+ + SHH+ HL RQNSN  GGNG
Sbjct: 121 KKMRARSLKRRSNSEASFRKSESVQVQSQMNGND--FMNQSSHHNHHLLRQNSN--GGNG 180

Query: 181 FQIQNSQQQECLSHRNQRWVHLPFTICEALRAFGAELNCFLPCHSSCTSDRD--KESKPA 240
           FQ      QECLSHRNQRWVHLPFTICEALRAFGAELNCFLPCHSSC+S+R+  KESK  
Sbjct: 181 FQ------QECLSHRNQRWVHLPFTICEALRAFGAELNCFLPCHSSCSSNRENNKESKTT 240

Query: 241 AKSSESESSCGTVFARWLVAVQDGDGQGREIELVVGDEETRMEKDNGSQRRHVFEGVDFK 300
            +SSESESSCGTVFARWLVAVQD DG+GREIELVVGDEE+R EKDNGSQRRHVFEG+DFK
Sbjct: 241 ERSSESESSCGTVFARWLVAVQDNDGRGREIELVVGDEESRTEKDNGSQRRHVFEGLDFK 300

Query: 301 DKSEVVEEEEEESRISICIPPKNALLLMRCRSDPVKVAELAKRFCESPAPKVEEEEDEED 360
           ++ EVV+EEE  SRISICIPPKNALLLMRCRSDPVK+AELAKRFCESP PK+EEEE+E+D
Sbjct: 301 EEKEVVQEEE--SRISICIPPKNALLLMRCRSDPVKMAELAKRFCESPVPKLEEEEEEDD 360

Query: 361 EQKDKEEKSRQKEAAKMDAPLTVILSNDEDEEETKVELNVKLKNDEEMSEESVSDGEEEN 420
           E+   + ++++     +  PL+V L  +E EEETKVELN KLKN+EEM EESVSD EEE 
Sbjct: 361 EENSNQNEAKKGAPLPVLVPLSVTLIKEE-EEETKVELNSKLKNEEEMIEESVSDAEEEE 420

Query: 421 Y--LVLQQE--EEHNEEETLEIATDNEIDVQKLDITV-NHHNQEEPAEDEQEEEEEHENN 480
              LVLQ+E  EE N EE++E+AT+NEIDVQKLDITV NH +QEE AE+++E+E     +
Sbjct: 421 EANLVLQEEKEEEVNGEESIEMATENEIDVQKLDITVTNHQDQEEEAEEDKEQE-----H 480

Query: 481 NNDEDNQPEELAEETRA----IPSHCDPELAQDAEKVESAEEEDESKFLHGNESINEIED 540
             D+DNQ ++L EET A    I + C+PE+ QDAEK+ESAE  DE K  HGNE   E E+
Sbjct: 481 RIDQDNQQQKL-EETMAFSIPISTRCEPEMVQDAEKLESAEG-DEFKPSHGNEQDFETEE 540

Query: 541 DEEQTEEE--GENGGNPASPSLSVETERAAEDMEETEADVNWEEEEEEEEETIHWEDREK 600
             ++ EEE   ENG NP SP LSVET            D NWEEEEEEEEE      R +
Sbjct: 541 QMKELEEEEKSENGENPTSPPLSVETA----------VDGNWEEEEEEEEEN-----RGR 600

Query: 601 ATEE----------EGMRPDIGDGGAM-----EESKERETPAAEAKREAET------GVL 660
           +TEE          EG+ P I +   M     ++SKERETP  E +RE +T       VL
Sbjct: 601 STEELKGTTATAMDEGIGPHIQNDDEMGLEEEDQSKERETPPPEPERETQTQTKPEASVL 660

Query: 661 PDCLLLMMYEPKLSMEVSKETWVCSTDFIRCVPTREKKAA---AAKKREAKAA-ENTQPA 720
           PDCLLLMMYEPKLSMEVSKETWVCSTDFIRCVPTREKKAA   A KKRE K A ++TQ  
Sbjct: 661 PDCLLLMMYEPKLSMEVSKETWVCSTDFIRCVPTREKKAAPPPAPKKREIKTAGKHTQTQ 720

Query: 721 V-VQPARWSCSFPAAAAAAAAIEQKLVRAKG-YEPFVLTRCKSEPMRSSAKLAPDACFWK 751
           V +QP RWSCSFPAAAAAAA IEQKL RAKG YEPFVLTRCKSEPMRSSAKLAPD  F K
Sbjct: 721 VAIQPGRWSCSFPAAAAAAAMIEQKLERAKGGYEPFVLTRCKSEPMRSSAKLAPDTGFCK 766

BLAST of MC10g0916 vs. ExPASy TrEMBL
Match: A0A0A0L789 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G236550 PE=4 SV=1)

HSP 1 Score: 880 bits (2274), Expect = 4.64e-312
Identity = 559/816 (68.50%), Postives = 620/816 (75.98%), Query Frame = 0

Query: 1   MDSDRHFRTTS-NSTAS-----SELFICFTSRLSSSSA--MKISSKSILSPGRAREPSQI 60
           MDSD HFRTTS NST+S     SELFICFTSR SSSS+  MKISSKSILSPGR REPSQI
Sbjct: 1   MDSDPHFRTTSTNSTSSTATPSSELFICFTSRFSSSSSSSMKISSKSILSPGRPREPSQI 60

Query: 61  SLSTSLSRRLKTSGSLKGGQASPMFPTGAKKRGCAFDNPEPSSPKVTCIGQVRVKTKKQG 120
           SLSTSLSRRLK+SGSLKGGQASPMFPTG KKRGCAFDNPEPSSPKVTCIGQVRVKTKKQG
Sbjct: 61  SLSTSLSRRLKSSGSLKGGQASPMFPTGRKKRGCAFDNPEPSSPKVTCIGQVRVKTKKQG 120

Query: 121 KKMRARSQKRRSNSEASFRKSEQV--QPQTNGGDQQFVAKQSHHHLHLHRQNSNSSGGNG 180
           KKMRARSQKRR+NSEASFR+SE +    Q NG DQQF    SHH+ HL RQNSNS+ GNG
Sbjct: 121 KKMRARSQKRRTNSEASFRRSESLVQSSQGNGSDQQFS---SHHNHHLLRQNSNSNAGNG 180

Query: 181 FQIQNSQQQECLSHRNQRWVHLPFTICEALRAFGAELNCFLPCHSSCTSDRD--KESKPA 240
           FQ      QECLSHRNQRWVHLPFTICEALRAFGAELNCFLPCHSSC+ +R+  KESKPA
Sbjct: 181 FQ------QECLSHRNQRWVHLPFTICEALRAFGAELNCFLPCHSSCSGNRENNKESKPA 240

Query: 241 AKSSESESSCGTVFARWLVAVQDGDGQGREIELVVGDEETRMEKDNGSQRRHVFEGVDFK 300
            +SSESESSCGTVFARWLVAVQDGDG+GREIELVVGDEETR EK+NGSQRRHVFEG+DFK
Sbjct: 241 ERSSESESSCGTVFARWLVAVQDGDGKGREIELVVGDEETRTEKENGSQRRHVFEGLDFK 300

Query: 301 DKSEVVEEEEEESRISICIPPKNALLLMRCRSDPVKVAELAKRFCESPAPKVEEEEDEED 360
           DK+E VEEEE  SRISICIPPKNALLLMRCRSDPVK+AELAKRFCE PAPKV+EE DEE 
Sbjct: 301 DKNEAVEEEE--SRISICIPPKNALLLMRCRSDPVKMAELAKRFCEPPAPKVDEE-DEEG 360

Query: 361 EQKDKEEKSRQKEAAK-MDAPLTVILSNDEDEEETKVE--------LNVKLKNDEEMSEE 420
           E +D E K RQ E  + +  P++ I++ +++EEE K E        L VKL+N+EEM+EE
Sbjct: 361 EDEDNEAKKRQNEVKRDVSVPVSSIVTVNKEEEEVKEEEDERKVEQLIVKLENEEEMNEE 420

Query: 421 SVSDGE---EENYLVLQQEE----EHNEEETLEIATDNEIDVQKLDITVNHHNQEEPAED 480
            VSD +   EE  LVLQ+EE    E NEEET+E+AT+NEID QK    VN  NQE+  E+
Sbjct: 421 CVSDADKEKEEANLVLQEEEREEEEDNEEETIEMATENEIDEQKDITVVNQLNQEQALEE 480

Query: 481 EQEEEEEHENNNNDEDNQPEELAEETRAIP------SHCDPELAQDAEKVESAEEEDESK 540
           ++E++        D+ NQ     +ET AIP      +HC+PE+AQD EK+ES E+E E K
Sbjct: 481 KEEDK-------TDQVNQ-----QETMAIPIPLLIQTHCEPEMAQDVEKLESVEKE-EPK 540

Query: 541 FLHGNESINEIEDDE-------EQTEEEGENGGN---PASPSLSVETERAAEDMEETEAD 600
             H +E   + E+DE       E+ EEEGENG N     SPSLSVETE  +++ E TE D
Sbjct: 541 LSHESEQDQKTEEDENLREDKEEEEEEEGENGENGETTTSPSLSVETEPVSDETE-TEVD 600

Query: 601 VNWEEEEEEEEETIHWEDREKATEEEGMRPDIGDG---GAMEE--SKERETPAAEA---- 660
           VN EEEEEEEEE          T +EG+ PD  +    G  EE  SKE ETP  E     
Sbjct: 601 VNREEEEEEEEEK---------TTDEGIGPDDENDVLVGPEEEDQSKEGETPPPEPESEP 660

Query: 661 ----KREAETGVLPDCLLLMMYEPKLSMEVSKETWVCSTDFIRCVPTREKKAAAA----- 720
               K + ET VLPDCLLLMMYEPKLSMEVSKETWVCS DFIRCVPTREKKA        
Sbjct: 661 KPERKTQTETSVLPDCLLLMMYEPKLSMEVSKETWVCSADFIRCVPTREKKAIGKDPPPP 720

Query: 721 ---KKREAKAAENTQPAVVQPARWSCSFPAAAAAAAAIEQKLVRAKGYEPFVLTRCKSEP 751
              KKRE K  + TQ AVVQPARWSCSFPAAAAAAA IEQKLVRAKGYEPFVLTRCKSEP
Sbjct: 721 PPPKKRETKPTDTTQTAVVQPARWSCSFPAAAAAAAMIEQKLVRAKGYEPFVLTRCKSEP 780

BLAST of MC10g0916 vs. TAIR 10
Match: AT3G15095.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; Has 9762 Blast hits to 6439 proteins in 764 species: Archae - 77; Bacteria - 1339; Metazoa - 3211; Fungi - 718; Plants - 437; Viruses - 131; Other Eukaryotes - 3849 (source: NCBI BLink). )

HSP 1 Score: 347.4 bits (890), Expect = 2.8e-95
Identity = 313/797 (39.27%), Postives = 407/797 (51.07%), Query Frame = 0

Query: 9   TTSNSTASSELFICFTSRLSSSSAMKISSKSILSPGRAREPSQISLSTSLSRRLKTSGSL 68
           + +NS +S++LFICFTSR SSSS+M++SSKSI SP R+       L+TSLSRRL+TSGSL
Sbjct: 17  SNNNSGSSTDLFICFTSRFSSSSSMRLSSKSIHSPARS-----ACLTTSLSRRLRTSGSL 76

Query: 69  KGGQA----SPMFPT--GAKKRGCAFDNP--------EPSSPKVTCIGQVRVKTKKQ-GK 128
           K   A    SPMF    G K+ G  ++N         EPSSPKVTCIGQVRVKT+K   K
Sbjct: 77  KNASAGVLNSPMFGANGGRKRSGSGYENSNNNNNNNIEPSSPKVTCIGQVRVKTRKHVKK 136

Query: 129 KMRARSQKRRSNSEASFRKSEQVQPQTNGGDQQFVAKQSHHHLHLHRQNSNSSGGNGFQI 188
           KMRARS  RR   E SFR+S  V     GG  +F A ++                     
Sbjct: 137 KMRARS--RRKGGENSFRRS--VDQNDGGGGCRFKASEN--------------------- 196

Query: 189 QNSQQQECLSHRNQRWVHLPFTICEALRAFGAELNCFLPCHSSCTSDRDKESKPAAKSSE 248
                         R VHLP TICE+LR+FG+ELNCF PC SSCT +   + + A  +++
Sbjct: 197 --------------RLVHLPVTICESLRSFGSELNCFFPCRSSCTENSHGDGRRAESNND 256

Query: 249 -------SESSCGTVFARWLVAVQD-GDGQGREIELVVGDEETRMEKDNGSQRRHVFEGV 308
                    +SCG VF RW VAV++   G+ REIELVVG E+   E    S+RRHVFEG+
Sbjct: 257 GCGGGGGGSNSCGAVFTRWFVAVEETSGGKRREIELVVGGEDEVEEDRRRSRRRHVFEGL 316

Query: 309 DFKD---KSEVVEEEEEESRISICIPPKNALLLMRCRSDPVKVAELAKRFCESPAPKVEE 368
           D  +   K+E  E  EE  R+SIC PPKNALLLMRCRSDPVKVA LA R           
Sbjct: 317 DLSEIEMKTEKKERGEEVGRMSICSPPKNALLLMRCRSDPVKVAALANRV---------- 376

Query: 369 EEDEEDEQKDKEEKSRQKEAAKMDAPLTVILSNDEDEEETKVELNVKLKNDEEMSEESVS 428
                          R+++ +  D    V    +EDE   + EL ++ K   ++ E+ +S
Sbjct: 377 ---------------RERQLSLNDG---VYTEEEEDERRRRFELEIEDKKRIDLCEKWIS 436

Query: 429 DGEEENYLVLQQEEEHNEEETLEIATDNEIDVQKLDITVNHHNQEEPAEDEQEEEEEHEN 488
                                     +  ++ +++ + V        AE E E E    +
Sbjct: 437 G-------------------------ETTVETEEVSVAV------AEAEAEAEAEAPLPS 496

Query: 489 NNNDEDNQPEELAEETRAIPSHCDPELAQDAEKV-ESAEEEDESKFLHGNESINEIEDDE 548
           N   E+ +  ++ E++         E  Q+A K+ +S EEE E+  +       +IED+ 
Sbjct: 497 NPATEEEERVKVVEDSIV-------EEEQEASKILDSFEEEIEATIM------KKIEDEI 556

Query: 549 EQTEEEGENGGNPASPSLSVETERAAEDMEETEADVNWEEEEEEEEETIHWEDREKATEE 608
               EE E         L+   E A   + ETE     E EE +E         E+ +E+
Sbjct: 557 RNAIEEEE--------KLAEMEELAVVAVAETE-----EVEESKEVVPDCIPQNEERSEQ 616

Query: 609 EGMRPDIGDGGAMEESKERETPAAEAKREAETGVLPDCLLLMMYEPKLSMEVSKETWVCS 668
               PD      M  S + ET   E        VLPDCLLLMM EPKLSMEVSKETWVCS
Sbjct: 617 GNREPDPSPEVVMRRSLQEETTEKEKTTATPYKVLPDCLLLMMCEPKLSMEVSKETWVCS 676

Query: 669 TDFIRCVPTR--------------------EKKAAAAKKREAKAAENT---QPAVVQPAR 728
           TDF+RC+P R                    +K+   A    A +   +    P  +QP R
Sbjct: 677 TDFVRCLPGRPPAKKIPPEAVGDNHHHHQPKKRIVTAVDSNASSRRRSIDRPPLHLQPPR 684

Query: 729 WSCSFPAA----AAAAAAIEQKLVRAKGYEPFVLTRCKSEPMRSSAKLAPDACFWKDRKL 752
            SCS+PAA     AAAA  EQ++  A   +P VL RCKSEP +S++KLAP+ACFWK+RKL
Sbjct: 737 SSCSYPAAPPIITAAAAVGEQRVAGANKVQPPVLPRCKSEPRKSASKLAPEACFWKNRKL 684

BLAST of MC10g0916 vs. TAIR 10
Match: AT3G15095.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; Has 9396 Blast hits to 6248 proteins in 757 species: Archae - 72; Bacteria - 1337; Metazoa - 3078; Fungi - 696; Plants - 406; Viruses - 135; Other Eukaryotes - 3672 (source: NCBI BLink). )

HSP 1 Score: 258.1 bits (658), Expect = 2.2e-68
Identity = 246/676 (36.39%), Postives = 325/676 (48.08%), Query Frame = 0

Query: 115 MRARSQKRRSNSEASFRKSEQVQPQTNGGDQQFVAKQSHHHLHLHRQNSNSSGGNGFQIQ 174
           MRARS  RR   E SFR+S  V     GG  +F A ++                      
Sbjct: 1   MRARS--RRKGGENSFRRS--VDQNDGGGGCRFKASEN---------------------- 60

Query: 175 NSQQQECLSHRNQRWVHLPFTICEALRAFGAELNCFLPCHSSCTSDRDKESKPAAKSSE- 234
                        R VHLP TICE+LR+FG+ELNCF PC SSCT +   + + A  +++ 
Sbjct: 61  -------------RLVHLPVTICESLRSFGSELNCFFPCRSSCTENSHGDGRRAESNNDG 120

Query: 235 ------SESSCGTVFARWLVAVQD-GDGQGREIELVVGDEETRMEKDNGSQRRHVFEGVD 294
                   +SCG VF RW VAV++   G+ REIELVVG E+   E    S+RRHVFEG+D
Sbjct: 121 CGGGGGGSNSCGAVFTRWFVAVEETSGGKRREIELVVGGEDEVEEDRRRSRRRHVFEGLD 180

Query: 295 FKD---KSEVVEEEEEESRISICIPPKNALLLMRCRSDPVKVAELAKRFCESPAPKVEEE 354
             +   K+E  E  EE  R+SIC PPKNALLLMRCRSDPVKVA LA R            
Sbjct: 181 LSEIEMKTEKKERGEEVGRMSICSPPKNALLLMRCRSDPVKVAALANRV----------- 240

Query: 355 EDEEDEQKDKEEKSRQKEAAKMDAPLTVILSNDEDEEETKVELNVKLKNDEEMSEESVSD 414
                         R+++ +  D    V    +EDE   + EL ++ K   ++ E+ +S 
Sbjct: 241 --------------RERQLSLNDG---VYTEEEEDERRRRFELEIEDKKRIDLCEKWISG 300

Query: 415 GEEENYLVLQQEEEHNEEETLEIATDNEIDVQKLDITVNHHNQEEPAEDEQEEEEEHENN 474
                                    +  ++ +++ + V        AE E E E    +N
Sbjct: 301 -------------------------ETTVETEEVSVAV------AEAEAEAEAEAPLPSN 360

Query: 475 NNDEDNQPEELAEETRAIPSHCDPELAQDAEKV-ESAEEEDESKFLHGNESINEIEDDEE 534
              E+ +  ++ E++         E  Q+A K+ +S EEE E+  +       +IED+  
Sbjct: 361 PATEEEERVKVVEDSIV-------EEEQEASKILDSFEEEIEATIM------KKIEDEIR 420

Query: 535 QTEEEGENGGNPASPSLSVETERAAEDMEETEADVNWEEEEEEEEETIHWEDREKATEEE 594
              EE E         L+   E A   + ETE     E EE +E         E+ +E+ 
Sbjct: 421 NAIEEEE--------KLAEMEELAVVAVAETE-----EVEESKEVVPDCIPQNEERSEQG 480

Query: 595 GMRPDIGDGGAMEESKERETPAAEAKREAETGVLPDCLLLMMYEPKLSMEVSKETWVCST 654
              PD      M  S + ET   E        VLPDCLLLMM EPKLSMEVSKETWVCST
Sbjct: 481 NREPDPSPEVVMRRSLQEETTEKEKTTATPYKVLPDCLLLMMCEPKLSMEVSKETWVCST 540

Query: 655 DFIRCVPTR--------------------EKKAAAAKKREAKAAENT---QPAVVQPARW 714
           DF+RC+P R                    +K+   A    A +   +    P  +QP R 
Sbjct: 541 DFVRCLPGRPPAKKIPPEAVGDNHHHHQPKKRIVTAVDSNASSRRRSIDRPPLHLQPPRS 552

Query: 715 SCSFPAA----AAAAAAIEQKLVRAKGYEPFVLTRCKSEPMRSSAKLAPDACFWKDRKLE 752
           SCS+PAA     AAAA  EQ++  A   +P VL RCKSEP +S++KLAP+ACFWK+RKLE
Sbjct: 601 SCSYPAAPPIITAAAAVGEQRVAGANKVQPPVLPRCKSEPRKSASKLAPEACFWKNRKLE 552

BLAST of MC10g0916 vs. TAIR 10
Match: AT3G15095.3 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 165.2 bits (417), Expect = 2.0e-40
Identity = 174/490 (35.51%), Postives = 232/490 (47.35%), Query Frame = 0

Query: 290 KSEVVEEEEEESRISICIPPKNALLLMRCRSDPVKVAELAKRFCESPAPKVEEEEDEEDE 349
           K+E  E  EE  R+SIC PPKNALLLMRCRSDPVKVA LA R                  
Sbjct: 2   KTEKKERGEEVGRMSICSPPKNALLLMRCRSDPVKVAALANRV----------------- 61

Query: 350 QKDKEEKSRQKEAAKMDAPLTVILSNDEDEEETKVELNVKLKNDEEMSEESVSDGEEENY 409
                   R+++ +  D    V    +EDE   + EL ++ K   ++ E+ +S       
Sbjct: 62  --------RERQLSLNDG---VYTEEEEDERRRRFELEIEDKKRIDLCEKWISG------ 121

Query: 410 LVLQQEEEHNEEETLEIATDNEIDVQKLDITVNHHNQEEPAEDEQEEEEEHENNNNDEDN 469
                              +  ++ +++ + V        AE E E E    +N   E+ 
Sbjct: 122 -------------------ETTVETEEVSVAV------AEAEAEAEAEAPLPSNPATEEE 181

Query: 470 QPEELAEETRAIPSHCDPELAQDAEKV-ESAEEEDESKFLHGNESINEIEDDEEQTEEEG 529
           +  ++ E++         E  Q+A K+ +S EEE E+  +       +IED+     EE 
Sbjct: 182 ERVKVVEDSIV-------EEEQEASKILDSFEEEIEATIM------KKIEDEIRNAIEEE 241

Query: 530 ENGGNPASPSLSVETERAAEDMEETEADVNWEEEEEEEEETIHWEDREKATEEEGMRPDI 589
           E         L+   E A   + ETE     E EE +E         E+ +E+    PD 
Sbjct: 242 E--------KLAEMEELAVVAVAETE-----EVEESKEVVPDCIPQNEERSEQGNREPDP 301

Query: 590 GDGGAMEESKERETPAAEAKREAETGVLPDCLLLMMYEPKLSMEVSKETWVCSTDFIRCV 649
                M  S + ET   E        VLPDCLLLMM EPKLSMEVSKETWVCSTDF+RC+
Sbjct: 302 SPEVVMRRSLQEETTEKEKTTATPYKVLPDCLLLMMCEPKLSMEVSKETWVCSTDFVRCL 361

Query: 650 PTR--------------------EKKAAAAKKREAKAAENT---QPAVVQPARWSCSFPA 709
           P R                    +K+   A    A +   +    P  +QP R SCS+PA
Sbjct: 362 PGRPPAKKIPPEAVGDNHHHHQPKKRIVTAVDSNASSRRRSIDRPPLHLQPPRSSCSYPA 406

Query: 710 A----AAAAAAIEQKLVRAKGYEPFVLTRCKSEPMRSSAKLAPDACFWKDRKLEPHRPAT 752
           A     AAAA  EQ++  A   +P VL RCKSEP +S++KLAP+ACFWK+RKLEPH PAT
Sbjct: 422 APPIITAAAAVGEQRVAGANKVQPPVLPRCKSEPRKSASKLAPEACFWKNRKLEPHPPAT 406

BLAST of MC10g0916 vs. TAIR 10
Match: AT1G78110.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: plasma membrane; EXPRESSED IN: 17 plant structures; EXPRESSED DURING: 7 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G22230.1); Has 5452 Blast hits to 3541 proteins in 289 species: Archae - 4; Bacteria - 165; Metazoa - 1756; Fungi - 532; Plants - 205; Viruses - 141; Other Eukaryotes - 2649 (source: NCBI BLink). )

HSP 1 Score: 49.7 bits (117), Expect = 1.2e-05
Identity = 108/413 (26.15%), Postives = 171/413 (41.40%), Query Frame = 0

Query: 8   RTTSNSTASSELFICFTSRLSSSSAMKISSKSILSPGRAREPSQISLSTSLSRRLKTSGS 67
           R +S+S  S++L +CF SR    + + ++ K I SP R  + S         +  K SG 
Sbjct: 9   RGSSSSGYSADLLVCFPSR----THLALTPKPICSPSRPSDSSTNRRPHHRRQLSKLSGG 68

Query: 68  LKGGQASP-MFPTGAKKRGCAFDN-PEPSSPKVTCIGQVRVKTKKQGKKMRARSQKRRSN 127
             GG  SP ++   A  +    D   EP+SPKVTC GQ++V+  K G +           
Sbjct: 69  GGGGHGSPVLWAKQASSKNMGGDEIAEPTSPKVTCAGQIKVRPSKCGGR----------- 128

Query: 128 SEASFRKSEQVQPQTNGGDQQFVAKQSHHHLHLHRQNSNSSGGNGFQIQNSQQQECLSHR 187
                           G + Q V ++      + R + N S    F ++           
Sbjct: 129 ----------------GKNWQSVMEE------IERIHDNRSQSKFFGLKKDV-------- 188

Query: 188 NQRWVHLPFTICEALRAFGAELNCFLPC-HSSCTSDRDKE------SKPAAKSSESESSC 247
                 + F  C  LR    +  CF    H+  TSD D+E       +      E E + 
Sbjct: 189 ------MGFLTC--LRNIKFDFRCFGDFRHADVTSDDDEEEDDDDDEEEEVVEGEEEENS 248

Query: 248 GTVFARWLVAVQDGDGQGREIELVVGDEETRMEKDNGSQRRHVFEGVDFKDKSEVVEEEE 307
            TVF++W + +Q              +E+   + D  + +          D+   +E+ E
Sbjct: 249 KTVFSKWFMVLQ--------------EEQNNKDDDKNNNK---------CDEKRDLEDTE 308

Query: 308 EESRISICIPPKNALLLMRCRSDPVKVAELAKRFCESPAPKVEEEEDEEDEQKDKEEKSR 367
            E      +PP NALLLMRCRS P K + L +R       KV+ E+++ +EQK+++E   
Sbjct: 309 TEP----AVPPPNALLLMRCRSAPAK-SWLEERM------KVKTEQEKREEQKEEKETED 327

Query: 368 QKEAAKMDAPLTVILSNDEDEEETKVELNVKLKNDEEMSEESVSDGEEENYLV 412
           Q+ + K        L      EE K+EL V ++ D E    S SD  +E ++V
Sbjct: 369 QETSMKTKKKDLRSLM-----EEEKMEL-VLMRYDTEFYRLS-SDIAKETWVV 327

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_022142926.10.099.85uncharacterized protein LOC111012918, partial [Momordica charantia][more]
XP_038894264.10.070.71glutamic acid-rich protein [Benincasa hispida][more]
XP_008456014.11.97e-31568.27PREDICTED: glutamic acid-rich protein isoform X1 [Cucumis melo][more]
KAG7031049.13.22e-31468.70hypothetical protein SDJN02_05088, partial [Cucurbita argyrosperma subsp. argyro... [more]
XP_022941617.14.09e-31369.51glutamic acid-rich protein-like [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
A0A6J1CNN30.099.85uncharacterized protein LOC111012918 OS=Momordica charantia OX=3673 GN=LOC111012... [more]
A0A1S3C2C29.54e-31668.27glutamic acid-rich protein isoform X1 OS=Cucumis melo OX=3656 GN=LOC103496068 PE... [more]
A0A6J1FMZ81.98e-31369.51glutamic acid-rich protein-like OS=Cucurbita moschata OX=3662 GN=LOC111446922 PE... [more]
A0A6J1IN353.85e-31269.54calponin homology domain-containing protein DDB_G0272472-like isoform X2 OS=Cucu... [more]
A0A0A0L7894.64e-31268.50Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G236550 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G15095.12.8e-9539.27unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT3G15095.22.2e-6836.39unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT3G15095.32.0e-4035.51unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT1G78110.11.2e-0526.15unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (Dali-11) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 339..366
NoneNo IPR availableCOILSCoilCoilcoord: 543..567
NoneNo IPR availableCOILSCoilCoilcoord: 450..470
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 38..172
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 130..148
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 382..397
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 571..611
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 547..570
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 450..473
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 429..449
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 474..514
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 336..611
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 38..72
NoneNo IPR availablePANTHERPTHR33448:SF4CHLOROPLAST PROTEIN HCF243coord: 1..751
NoneNo IPR availablePANTHERPTHR33448CHLOROPLAST PROTEIN HCF243-RELATEDcoord: 1..751

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
MC10g0916.1MC10g0916.1mRNA