MC04g0352 (gene) Bitter gourd (Dali-11) v1

Overview
NameMC04g0352
Typegene
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionLEA_2 domain-containing protein
LocationMC04: 2809335 .. 2813059 (-)
RNA-Seq ExpressionMC04g0352
SyntenyMC04g0352
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: utr5polypeptideCDSutr3
Hold the cursor over a type above to highlight its positions in the sequence below.
ACAGGTCTTCTACTTACCAGCTCATCAGTTGAGTCAACAGTCACTCGAACCTAACTTCGTGGACTTGGTCAATTAAAAACTTAATCTATTTCATTTTATCAATCGAGTTGATTCCAATGTCGGCCTAGGTAAAAAACCAACCCGACCCAAAGATGGAGAGATCGAGAGCGAGTTGGCAGCTTTAATTGTGAATTAATATAGGGGACATAACAGGGGACAGCAGACCGGGAACTCTTCGACATTCTTGTGCTCCCACGGAAACAAAACTAGAAAAAAAGGAAAAAAAATGTCCACGCGCTCACGCGGCTGCTAAAGCGCGTGAGCTTGCTTTCATCGATCCCGTCCCCTCCCACGCGTCAAAAACAACCCCCATTTTTCCCTGTTTCAGAAAAACAGAGAGTCGCCGGTTTCAGAGTGAGGAGGCGACGATGCACGCGAAATCGTACTCGGAGGTGACGAGCGTGGAGCAGTCGTCGCCGGCGCGATCGCCGAGGCGGCCGCTCTACTACGTGCAGAGCCCGTCGAACCACGACGTGGAGAAAATGTCGTACGGGTCGAGCCCGATGGGGTCGCCGCCGCACCACTTCTACCACGCCTCCCCCATCCACCACTCCCGCGAGTCCTCCACCTCCCGCTTCTCCGCCTCCCTCAAGCCCAACCCCCGCAACCTCGCCGCCTGGAGGAAGCTCCACCGCCCCCTCGAATCCGACGCCGACGACGACGATGACGACGCCACCGCCGCCGCCGACGACCGCGATTCCGAATGGACCCGGAAGTTCCGGCTTTACTTGTTCTTGTTCGTTTTCTTCGTACTTCTCTTCACTGTGTTCTCCCTCATCCTCTGGGGCGCCAGCAGATCCTTCCACCCCCAAATCATCCTTCAGGTACACCCCATTGCAAATTTATTTTCATTGATTTTGGTTTTCATCTCTAATTTTGACTTTTTCATCTAGTTTTACACACATTAATCGCTTATTATTATTATTATTATTTTTTATTTGTTTTTTTTTTTTGTGTGAAAAGCACATTAATGGCTTTTAGTGTTTGTGTTAACAAGAAATTGTTTTGTAGCTGGTACATTAAGTGGGAAAAAATGTATACATATTATTCTAAATATATATATATACATACACATTTTCTTTTGAATTCTAAGTTTCTAACAACATGTTGGGGGATTGGAGCTCCAAACTAATTGAACTATAGAAAATATGAATAATATTTATGTATTGAATGGTAATGAATTTTTTTAGTACAATAAATCAGAGGTATTAAGTTTGACTTTTGATCCCAAGAAAAGGAGTATGAATCTCTCTACTTGTGGTTCTAACTTAAGATAATTATTTGAAATCTAAAAATTATTTTTTTAATAATAACTTAAATGTACTTATAAGCGGTAAAAATATACGCAAAAAATTACAAGTATAATATTTAAAAATAAAAAATAAAAAATTTAAAAGAATACATTTCATAGTTTGAAGAGAAATTACAGCCACTTATCCCCAACCGGAATTCGAATCACCAAACCCTATATTGTACTAGAAAAAGAAATTAAAAAAAAAAAAGGGAAATATACTCAAATGGGATTTAAATAGGTTAGTTTATTTATTCTTTGGCAATTTCAGTACCTGAAAAGGTGGAAATTGTTATTATTAAATATATATGAAAAGCCGAAAAGTAGCATGTTTTGAGTTGGGAAAAATAAAAAAGGTGATAATATTTATATTATATTTATTGTGGAACAGAGTATGGTATTCGAGAGGTTTAACGTACAAGCGGGGAGTGATGCGGGGGGAGTGGCAACTGATCTGATGTCATTAAATTCGACGGTCAGGATCAAGTACAGAAATCCGGCCACGTTTTTCGGTGTCCACGTCAGCTCTTCCCCCATCCAGCTCAACTATTTCCAGCTCCAAATCGCCTCCGGCCAGGTATTTTTATTTTTTAATTACATTTAGGCATGGCTGGCACGTCGTATCGTATCGTAGCTCGTAGGTTTGTGTGTTTTTGTTGCGGGTTTTGGCCCTAAGGTTTCGAATATTACAAAAGGGTGGTGGGGATCGGGGAATAGTGTAAATAGTGGTTTTTATGGGGGTGAGGAATGGTAATTTGAGGAGACTTTTCAGACATTGGGAGCTGGGACGATTATAATTCGCAATCAGAAGTCTGCAAAGTCTGAGCAGTAGTAGTCTTTTTCATACTGGCAGTCTCTTTTGTAATTTTCCCGTTGGAGATAAAAAGGACGATTTTTGGAAACTGTCCTATTTTATTCTTTTTTCACATTTTAAATTATCATAGGTTCAATCTTTTTTTTTTAATCATACAGATTATTGTACCATGAATTTGGACCTTAGAATTTTTATGAGGTTTGTTTCAAGCCTTTCAAAACTTTTTTATTAGTTGTTTAATAGGCGGATAATGTAGGTTTTTATTTATTTATTTTTTTTAGTTTTATGTATATTGATGTGCTGACGTGGGTGTGTATTAGATCGATGACATGATAAAAATGATCTAAAAAGTCAAGACTTTATAGTTTTTTTCCTTCTAATTTCTCCCTCAAATATTCAAAATTAAAAAATTTTGTATATAAAAAATTCAGATGTAAAATTTGGTCCAGCCGTCCTTCATAGTTAGATATTATGTGGTAGAATAATAATAAGTATTGAAGTTTTTTTTAGTACAGATTATCGGGATGAGGATTCGAACTCTCAACTTCATAAGTTAGATATATTTACTTTAATTCGTTGAACTATGCGTTGATGGTCTAAAATAGTTAATTTTTAAGGTTAGGAACTAAAGTAAATATAAATTTAGGAATGAAAATCAGATTTAAACTAAATGCGTTATTGGAGCTTTATCTCAAAATTTTCTCCTCTCTAAGCTATTGATCCTATCTTCTTCAATCTCTATTTAAATTAGGTATAATATGTTCGTATCCTTGTTCCAACTGAATATAGGTATATAGATGTTAATTACTAATCAAACATTTTCCTTTAAAATTTTGCGATAGAGGGAGTAAAGGAAATGAAACAAAGAATCAAATAATGCTTTAGCAATGGTACGATATAGGTGGAGATCAAACGTTTAACCTTACAAGAAAGAAGTAGAGATGTTTTAACAACTGAACTTATATATATTTAAATTAGATGATGGAGTTCTACGAAAAAAGGCAGAGCTCTCGGAGGGTGGCGACAGCGGTGGCAGGGCACCAGGTGCCACTGTACGGTGGGATCGCGGTGATCGGAAACTGGAGGGAGCAGCGACAGGAGGGGGTGGAGGTGCCGCTGAACCTGACGGTGGCGGTGAGGTCAAGAGCTTACATTCTGGGGAAGCTGGTGAAGTCCACATTCCACACCACAATTACTTGCTCACTCACTCTCAGAACTAAGAATCTTGGCAAATTCCACTCTCTCAACAATTCTTGCATTTACACTTAGAAATTTTTTAGTCCCAATTGGAATTTGGAGCTCTTCACCAATTTTGGTTCCTATGATTGTGCTTCATCTCTTTGTAATGTAAGTACCTCTTGTTTGTTATTGCCTTTCTTTTAAATTACCACTTCTGTCATCTTCCTCTCTTGTATCTAAACCTTCTTTTATGTTGTTGCCTTTTGAACTTTGTATGGGAGGAGCTCCCAAACTAGTTCATTTTTGGTTATTATGTTTTTGTAAAATATCAACATTAGCCTATTCAAATTATTATTTGTCGTATAACTCAATCAATTAAGG

mRNA sequence

ACAGGTCTTCTACTTACCAGCTCATCAGTTGAGTCAACAGTCACTCGAACCTAACTTCGTGGACTTGGTCAATTAAAAACTTAATCTATTTCATTTTATCAATCGAGTTGATTCCAATGTCGGCCTAGGTAAAAAACCAACCCGACCCAAAGATGGAGAGATCGAGAGCGAGTTGGCAGCTTTAATTGTGAATTAATATAGGGGACATAACAGGGGACAGCAGACCGGGAACTCTTCGACATTCTTGTGCTCCCACGGAAACAAAACTAGAAAAAAAGGAAAAAAAATGTCCACGCGCTCACGCGGCTGCTAAAGCGCGTGAGCTTGCTTTCATCGATCCCGTCCCCTCCCACGCGTCAAAAACAACCCCCATTTTTCCCTGTTTCAGAAAAACAGAGAGTCGCCGGTTTCAGAGTGAGGAGGCGACGATGCACGCGAAATCGTACTCGGAGGTGACGAGCGTGGAGCAGTCGTCGCCGGCGCGATCGCCGAGGCGGCCGCTCTACTACGTGCAGAGCCCGTCGAACCACGACGTGGAGAAAATGTCGTACGGGTCGAGCCCGATGGGGTCGCCGCCGCACCACTTCTACCACGCCTCCCCCATCCACCACTCCCGCGAGTCCTCCACCTCCCGCTTCTCCGCCTCCCTCAAGCCCAACCCCCGCAACCTCGCCGCCTGGAGGAAGCTCCACCGCCCCCTCGAATCCGACGCCGACGACGACGATGACGACGCCACCGCCGCCGCCGACGACCGCGATTCCGAATGGACCCGGAAGTTCCGGCTTTACTTGTTCTTGTTCGTTTTCTTCGTACTTCTCTTCACTGTGTTCTCCCTCATCCTCTGGGGCGCCAGCAGATCCTTCCACCCCCAAATCATCCTTCAGAGTATGGTATTCGAGAGGTTTAACGTACAAGCGGGGAGTGATGCGGGGGGAGTGGCAACTGATCTGATGTCATTAAATTCGACGGTCAGGATCAAGTACAGAAATCCGGCCACGTTTTTCGGTGTCCACGTCAGCTCTTCCCCCATCCAGCTCAACTATTTCCAGCTCCAAATCGCCTCCGGCCAGATGATGGAGTTCTACGAAAAAAGGCAGAGCTCTCGGAGGGTGGCGACAGCGGTGGCAGGGCACCAGGTGCCACTGTACGGTGGGATCGCGGTGATCGGAAACTGGAGGGAGCAGCGACAGGAGGGGGTGGAGGTGCCGCTGAACCTGACGGTGGCGGTGAGGTCAAGAGCTTACATTCTGGGGAAGCTGGTGAAGTCCACATTCCACACCACAATTACTTGCTCACTCACTCTCAGAACTAAGAATCTTGGCAAATTCCACTCTCTCAACAATTCTTGCATTTACACTTAGAAATTTTTTAGTCCCAATTGGAATTTGGAGCTCTTCACCAATTTTGGTTCCTATGATTGTGCTTCATCTCTTTGTAATGTAAGTACCTCTTGTTTGTTATTGCCTTTCTTTTAAATTACCACTTCTGTCATCTTCCTCTCTTGTATCTAAACCTTCTTTTATGTTGTTGCCTTTTGAACTTTGTATGGGAGGAGCTCCCAAACTAGTTCATTTTTGGTTATTATGTTTTTGTAAAATATCAACATTAGCCTATTCAAATTATTATTTGTCGTATAACTCAATCAATTAAGG

Coding sequence (CDS)

ATGCACGCGAAATCGTACTCGGAGGTGACGAGCGTGGAGCAGTCGTCGCCGGCGCGATCGCCGAGGCGGCCGCTCTACTACGTGCAGAGCCCGTCGAACCACGACGTGGAGAAAATGTCGTACGGGTCGAGCCCGATGGGGTCGCCGCCGCACCACTTCTACCACGCCTCCCCCATCCACCACTCCCGCGAGTCCTCCACCTCCCGCTTCTCCGCCTCCCTCAAGCCCAACCCCCGCAACCTCGCCGCCTGGAGGAAGCTCCACCGCCCCCTCGAATCCGACGCCGACGACGACGATGACGACGCCACCGCCGCCGCCGACGACCGCGATTCCGAATGGACCCGGAAGTTCCGGCTTTACTTGTTCTTGTTCGTTTTCTTCGTACTTCTCTTCACTGTGTTCTCCCTCATCCTCTGGGGCGCCAGCAGATCCTTCCACCCCCAAATCATCCTTCAGAGTATGGTATTCGAGAGGTTTAACGTACAAGCGGGGAGTGATGCGGGGGGAGTGGCAACTGATCTGATGTCATTAAATTCGACGGTCAGGATCAAGTACAGAAATCCGGCCACGTTTTTCGGTGTCCACGTCAGCTCTTCCCCCATCCAGCTCAACTATTTCCAGCTCCAAATCGCCTCCGGCCAGATGATGGAGTTCTACGAAAAAAGGCAGAGCTCTCGGAGGGTGGCGACAGCGGTGGCAGGGCACCAGGTGCCACTGTACGGTGGGATCGCGGTGATCGGAAACTGGAGGGAGCAGCGACAGGAGGGGGTGGAGGTGCCGCTGAACCTGACGGTGGCGGTGAGGTCAAGAGCTTACATTCTGGGGAAGCTGGTGAAGTCCACATTCCACACCACAATTACTTGCTCACTCACTCTCAGAACTAAGAATCTTGGCAAATTCCACTCTCTCAACAATTCTTGCATTTACACTTAG

Protein sequence

MHAKSYSEVTSVEQSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPHHFYHASPIHHSRESSTSRFSASLKPNPRNLAAWRKLHRPLESDADDDDDDATAAADDRDSEWTRKFRLYLFLFVFFVLLFTVFSLILWGASRSFHPQIILQSMVFERFNVQAGSDAGGVATDLMSLNSTVRIKYRNPATFFGVHVSSSPIQLNYFQLQIASGQMMEFYEKRQSSRRVATAVAGHQVPLYGGIAVIGNWREQRQEGVEVPLNLTVAVRSRAYILGKLVKSTFHTTITCSLTLRTKNLGKFHSLNNSCIYT
Homology
BLAST of MC04g0352 vs. NCBI nr
Match: XP_022135688.1 (uncharacterized protein LOC111007587 isoform X1 [Momordica charantia] >XP_022135689.1 uncharacterized protein LOC111007587 isoform X2 [Momordica charantia])

HSP 1 Score: 607 bits (1564), Expect = 1.21e-218
Identity = 308/310 (99.35%), Postives = 308/310 (99.35%), Query Frame = 0

Query: 1   MHAKSYSEVTSVEQSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPHHFYHASPIH 60
           MHAKSYSEVTSVEQSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPHHFYHASPIH
Sbjct: 1   MHAKSYSEVTSVEQSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPHHFYHASPIH 60

Query: 61  HSRESSTSRFSASLKPNPRNLAAWRKLHRPLESDADDDDDDATAAADDRDSEWTRKFRLY 120
           HSRESSTSRFSASLKPN RNLAAWRKLHRPLESDADDDDDDATAAADDRDSEWTRKFRLY
Sbjct: 61  HSRESSTSRFSASLKPNXRNLAAWRKLHRPLESDADDDDDDATAAADDRDSEWTRKFRLY 120

Query: 121 LFLFVFFVLLFTVFSLILWGASRSFHPQIILQSMVFERFNVQAGSDAGGVATDLMSLNST 180
           LFLFVFFVLLFTVFSLILWGASRSFHPQIILQSMVFERFNVQAGSDAGGVATDLMSLNST
Sbjct: 121 LFLFVFFVLLFTVFSLILWGASRSFHPQIILQSMVFERFNVQAGSDAGGVATDLMSLNST 180

Query: 181 VRIKYRNPATFFGVHVSSSPIQLNYFQLQIASGQMMEFYEKRQSSRRVATAVAGHQVPLY 240
           VRIKYRNPATFFGVHVSSSPIQLNYFQLQIASGQMMEFYEKRQSSRRVATAVAGHQVPLY
Sbjct: 181 VRIKYRNPATFFGVHVSSSPIQLNYFQLQIASGQMMEFYEKRQSSRRVATAVAGHQVPLY 240

Query: 241 GGIAVIGNWREQRQEGVEVPLNLTVAVRSRAYILGKLVKSTFHTTITCSLTLRTKNLGKF 300
           GGIAVIGNWREQRQEGVEVPLNLTVAVRSRAYILGKLVKSTFH TITCSLTLRTKNLGKF
Sbjct: 241 GGIAVIGNWREQRQEGVEVPLNLTVAVRSRAYILGKLVKSTFHXTITCSLTLRTKNLGKF 300

Query: 301 HSLNNSCIYT 310
           HSLNNSCIYT
Sbjct: 301 HSLNNSCIYT 310

BLAST of MC04g0352 vs. NCBI nr
Match: XP_038888376.1 (uncharacterized protein LOC120078225 [Benincasa hispida])

HSP 1 Score: 519 bits (1336), Expect = 7.20e-184
Identity = 262/312 (83.97%), Postives = 284/312 (91.03%), Query Frame = 0

Query: 1   MHAKSYSEVTSVEQSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPHHFYHASPIH 60
           MHAKSYSEVTS++QSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPHHFYHASPIH
Sbjct: 1   MHAKSYSEVTSMDQSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPHHFYHASPIH 60

Query: 61  HSRESSTSRFSASLKPNPR---NLAAWRKLHRPLESDADDDDDDATAAADDRDSEWTRKF 120
           HSRESSTSRFSASLK NP    NL+AWRKLHRP +SD DD++DD     DDRDS+W RKF
Sbjct: 61  HSRESSTSRFSASLKNNPNRNGNLSAWRKLHRPQDSD-DDEEDDEDEENDDRDSKWNRKF 120

Query: 121 RLYLFLFVFFVLLFTVFSLILWGASRSFHPQIILQSMVFERFNVQAGSDAGGVATDLMSL 180
           RLYLFLF+ FVLLFTVFSLILWGASRSFHPQI++QSMVFE+FNVQAGSD GGVATDLMSL
Sbjct: 121 RLYLFLFLLFVLLFTVFSLILWGASRSFHPQILIQSMVFEKFNVQAGSDPGGVATDLMSL 180

Query: 181 NSTVRIKYRNPATFFGVHVSSSPIQLNYFQLQIASGQMMEFYEKRQSSRRVATAVAGHQV 240
           NSTVRI YRNPATFFGVHVSS+P  L Y+QLQIASGQM EFY+KRQSSRRV T+VAGHQ+
Sbjct: 181 NSTVRITYRNPATFFGVHVSSTPFHLQYYQLQIASGQMEEFYQKRQSSRRVKTSVAGHQI 240

Query: 241 PLYGGIAVIGNWREQRQEGV--EVPLNLTVAVRSRAYILGKLVKSTFHTTITCSLTLRTK 300
           PLYGGI+ IGNWR+QRQ+GV  E+PLNLTVAVRSRAYILG+LVKSTFHTTITC +TL TK
Sbjct: 241 PLYGGISAIGNWRDQRQDGVGVEIPLNLTVAVRSRAYILGRLVKSTFHTTITCPITLSTK 300

Query: 301 NLGKFHSLNNSC 307
            LGKFHS NNSC
Sbjct: 301 KLGKFHSFNNSC 311

BLAST of MC04g0352 vs. NCBI nr
Match: XP_008447896.1 (PREDICTED: uncharacterized protein LOC103490245 [Cucumis melo])

HSP 1 Score: 508 bits (1309), Expect = 1.04e-179
Identity = 257/314 (81.85%), Postives = 281/314 (89.49%), Query Frame = 0

Query: 1   MHAKSYSEVTSVEQSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPHHFYHASPIH 60
           MHAKSYSEVTSV+QSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPHHFYHASPIH
Sbjct: 1   MHAKSYSEVTSVDQSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPHHFYHASPIH 60

Query: 61  HSRESSTSRFSASLKPNPR---NLAAWRKLHRPLESDADDDDDDATAAADDRDSEWTRKF 120
           HSRESSTSRFSASLK N     N++AWRKLH   +SD DD++DD     +DRDS+W RKF
Sbjct: 61  HSRESSTSRFSASLKSNQNRNGNVSAWRKLHLAEDSDDDDEEDDGDEENEDRDSKWNRKF 120

Query: 121 RLYLFLFVFFVLLFTVFSLILWGASRSFHPQIILQSMVFERFNVQAGSDAGGVATDLMSL 180
           RLYL LF+FFVLLFTVFSLILWGAS+SFHPQI++QSMVF +FNVQAGSD GGVATDLMSL
Sbjct: 121 RLYLILFLFFVLLFTVFSLILWGASKSFHPQILIQSMVFSKFNVQAGSDPGGVATDLMSL 180

Query: 181 NSTVRIKYRNPATFFGVHVSSSPIQLNYFQLQIASGQMMEFYEKRQSSRRVATAVAGHQV 240
           NSTVRI YRNPATFFGVHVSS+P QL+YFQLQIASGQM EFY+KRQSSRR+ T+VAGHQV
Sbjct: 181 NSTVRISYRNPATFFGVHVSSTPFQLHYFQLQIASGQMEEFYQKRQSSRRMVTSVAGHQV 240

Query: 241 PLYGGIAVIGNWREQRQEGV--EVPLNLTVAVRSRAYILGKLVKSTFHTTITCSLTLRTK 300
           PLYGGI+ IGNWR+QRQ+GV  EV LNLTVAVRSRAYILG+LVKSTFHTTITC +TL TK
Sbjct: 241 PLYGGISAIGNWRDQRQDGVGVEVSLNLTVAVRSRAYILGRLVKSTFHTTITCPITLSTK 300

Query: 301 NLGKFHSLNNSCIY 309
            LGK HS NN+C Y
Sbjct: 301 KLGKSHSFNNTCTY 314

BLAST of MC04g0352 vs. NCBI nr
Match: XP_004144875.1 (uncharacterized protein LOC101215215 [Cucumis sativus] >KGN43297.1 hypothetical protein Csa_020295 [Cucumis sativus])

HSP 1 Score: 508 bits (1307), Expect = 2.10e-179
Identity = 255/314 (81.21%), Postives = 281/314 (89.49%), Query Frame = 0

Query: 1   MHAKSYSEVTSVEQSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPHHFYHASPIH 60
           MHAKSYSEVTSV+QSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPHHFYHASPIH
Sbjct: 1   MHAKSYSEVTSVDQSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPHHFYHASPIH 60

Query: 61  HSRESSTSRFSASLKPNPR---NLAAWRKLHRPLESDADDDDDDATAAADDRDSEWTRKF 120
           HSRESSTSRFSASLK N     N++AWRKLH   +SD DD++DD     +DRDS+W RKF
Sbjct: 61  HSRESSTSRFSASLKINQNRNGNVSAWRKLHHAQDSDGDDEEDDEEEENEDRDSKWNRKF 120

Query: 121 RLYLFLFVFFVLLFTVFSLILWGASRSFHPQIILQSMVFERFNVQAGSDAGGVATDLMSL 180
           RLYL LF+FF+LLFTVFSLILWGAS+SFHPQI++QSMVF +FNVQAGSD GGVATDLMSL
Sbjct: 121 RLYLILFLFFILLFTVFSLILWGASKSFHPQILIQSMVFSKFNVQAGSDPGGVATDLMSL 180

Query: 181 NSTVRIKYRNPATFFGVHVSSSPIQLNYFQLQIASGQMMEFYEKRQSSRRVATAVAGHQV 240
           NSTVRI Y+NPATFFGVHVSS+PIQL+Y QLQ+ASGQM EFY+KRQSSRRV T+VAGHQV
Sbjct: 181 NSTVRISYKNPATFFGVHVSSTPIQLHYLQLQVASGQMEEFYQKRQSSRRVVTSVAGHQV 240

Query: 241 PLYGGIAVIGNWREQRQEG--VEVPLNLTVAVRSRAYILGKLVKSTFHTTITCSLTLRTK 300
           PLYGGI+ IGNWR+QRQ+G  VEV LNLTVAVRSRAYILG+LVKSTFHTTITC +TL T 
Sbjct: 241 PLYGGISAIGNWRDQRQDGAGVEVSLNLTVAVRSRAYILGRLVKSTFHTTITCPITLSTN 300

Query: 301 NLGKFHSLNNSCIY 309
            LGK HS NN+CIY
Sbjct: 301 KLGKSHSFNNTCIY 314

BLAST of MC04g0352 vs. NCBI nr
Match: XP_022928427.1 (uncharacterized protein LOC111435243 [Cucurbita moschata])

HSP 1 Score: 501 bits (1291), Expect = 5.33e-177
Identity = 255/312 (81.73%), Postives = 277/312 (88.78%), Query Frame = 0

Query: 1   MHAKSYSEVTSVEQSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPHHFYHASPIH 60
           MHAKSYSEVTSV+QSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPH FYHASPIH
Sbjct: 1   MHAKSYSEVTSVDQSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPHPFYHASPIH 60

Query: 61  HSRESSTSRFSASLKPNPR---NLAAWRKLHRPLESDADDDDDDATAAADDRDSEWTRKF 120
           HSRESSTSRFSASLK N     NL+AWRKLHRP   D ++DDDD      DRDS+W RKF
Sbjct: 61  HSRESSTSRFSASLKNNMNRNGNLSAWRKLHRPPGYDEEEDDDDDGDNDGDRDSKWNRKF 120

Query: 121 RLYLFLFVFFVLLFTVFSLILWGASRSFHPQIILQSMVFERFNVQAGSDAGGVATDLMSL 180
           RLYLFLFV FVLLFTVFSLILWGAS+SFHPQI++QSMVFE+FNVQAGSD GGVATDLMSL
Sbjct: 121 RLYLFLFVLFVLLFTVFSLILWGASKSFHPQILVQSMVFEKFNVQAGSDPGGVATDLMSL 180

Query: 181 NSTVRIKYRNPATFFGVHVSSSPIQLNYFQLQIASGQMMEFYEKRQSSRRVATAVAGHQV 240
           NSTVRI Y+NPATFFGVHVSS+P QL+YFQLQIASGQM EFY+KRQSSR+V T+V+GHQV
Sbjct: 181 NSTVRITYKNPATFFGVHVSSTPFQLHYFQLQIASGQMEEFYQKRQSSRKVTTSVSGHQV 240

Query: 241 PLYGGIAVIGNWREQRQEGVEVPLNLTVAVRSRAYILGKLVKSTFHTTITCSLTLRTKNL 300
           PLYGGI+ IGNWR+QRQ+GVEV LNLTVAVRSRAYILG+LVKSTFHT ITC +TL  K L
Sbjct: 241 PLYGGISAIGNWRDQRQDGVEVLLNLTVAVRSRAYILGRLVKSTFHTKITCPVTLSNKKL 300

Query: 301 GKFHSLNNSCIY 309
           GK HS N +C Y
Sbjct: 301 GKSHSFNKTCTY 312

BLAST of MC04g0352 vs. ExPASy TrEMBL
Match: A0A6J1C5K4 (uncharacterized protein LOC111007587 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111007587 PE=4 SV=1)

HSP 1 Score: 607 bits (1564), Expect = 5.84e-219
Identity = 308/310 (99.35%), Postives = 308/310 (99.35%), Query Frame = 0

Query: 1   MHAKSYSEVTSVEQSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPHHFYHASPIH 60
           MHAKSYSEVTSVEQSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPHHFYHASPIH
Sbjct: 1   MHAKSYSEVTSVEQSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPHHFYHASPIH 60

Query: 61  HSRESSTSRFSASLKPNPRNLAAWRKLHRPLESDADDDDDDATAAADDRDSEWTRKFRLY 120
           HSRESSTSRFSASLKPN RNLAAWRKLHRPLESDADDDDDDATAAADDRDSEWTRKFRLY
Sbjct: 61  HSRESSTSRFSASLKPNXRNLAAWRKLHRPLESDADDDDDDATAAADDRDSEWTRKFRLY 120

Query: 121 LFLFVFFVLLFTVFSLILWGASRSFHPQIILQSMVFERFNVQAGSDAGGVATDLMSLNST 180
           LFLFVFFVLLFTVFSLILWGASRSFHPQIILQSMVFERFNVQAGSDAGGVATDLMSLNST
Sbjct: 121 LFLFVFFVLLFTVFSLILWGASRSFHPQIILQSMVFERFNVQAGSDAGGVATDLMSLNST 180

Query: 181 VRIKYRNPATFFGVHVSSSPIQLNYFQLQIASGQMMEFYEKRQSSRRVATAVAGHQVPLY 240
           VRIKYRNPATFFGVHVSSSPIQLNYFQLQIASGQMMEFYEKRQSSRRVATAVAGHQVPLY
Sbjct: 181 VRIKYRNPATFFGVHVSSSPIQLNYFQLQIASGQMMEFYEKRQSSRRVATAVAGHQVPLY 240

Query: 241 GGIAVIGNWREQRQEGVEVPLNLTVAVRSRAYILGKLVKSTFHTTITCSLTLRTKNLGKF 300
           GGIAVIGNWREQRQEGVEVPLNLTVAVRSRAYILGKLVKSTFH TITCSLTLRTKNLGKF
Sbjct: 241 GGIAVIGNWREQRQEGVEVPLNLTVAVRSRAYILGKLVKSTFHXTITCSLTLRTKNLGKF 300

Query: 301 HSLNNSCIYT 310
           HSLNNSCIYT
Sbjct: 301 HSLNNSCIYT 310

BLAST of MC04g0352 vs. ExPASy TrEMBL
Match: A0A1S3BJ42 (uncharacterized protein LOC103490245 OS=Cucumis melo OX=3656 GN=LOC103490245 PE=4 SV=1)

HSP 1 Score: 508 bits (1309), Expect = 5.04e-180
Identity = 257/314 (81.85%), Postives = 281/314 (89.49%), Query Frame = 0

Query: 1   MHAKSYSEVTSVEQSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPHHFYHASPIH 60
           MHAKSYSEVTSV+QSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPHHFYHASPIH
Sbjct: 1   MHAKSYSEVTSVDQSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPHHFYHASPIH 60

Query: 61  HSRESSTSRFSASLKPNPR---NLAAWRKLHRPLESDADDDDDDATAAADDRDSEWTRKF 120
           HSRESSTSRFSASLK N     N++AWRKLH   +SD DD++DD     +DRDS+W RKF
Sbjct: 61  HSRESSTSRFSASLKSNQNRNGNVSAWRKLHLAEDSDDDDEEDDGDEENEDRDSKWNRKF 120

Query: 121 RLYLFLFVFFVLLFTVFSLILWGASRSFHPQIILQSMVFERFNVQAGSDAGGVATDLMSL 180
           RLYL LF+FFVLLFTVFSLILWGAS+SFHPQI++QSMVF +FNVQAGSD GGVATDLMSL
Sbjct: 121 RLYLILFLFFVLLFTVFSLILWGASKSFHPQILIQSMVFSKFNVQAGSDPGGVATDLMSL 180

Query: 181 NSTVRIKYRNPATFFGVHVSSSPIQLNYFQLQIASGQMMEFYEKRQSSRRVATAVAGHQV 240
           NSTVRI YRNPATFFGVHVSS+P QL+YFQLQIASGQM EFY+KRQSSRR+ T+VAGHQV
Sbjct: 181 NSTVRISYRNPATFFGVHVSSTPFQLHYFQLQIASGQMEEFYQKRQSSRRMVTSVAGHQV 240

Query: 241 PLYGGIAVIGNWREQRQEGV--EVPLNLTVAVRSRAYILGKLVKSTFHTTITCSLTLRTK 300
           PLYGGI+ IGNWR+QRQ+GV  EV LNLTVAVRSRAYILG+LVKSTFHTTITC +TL TK
Sbjct: 241 PLYGGISAIGNWRDQRQDGVGVEVSLNLTVAVRSRAYILGRLVKSTFHTTITCPITLSTK 300

Query: 301 NLGKFHSLNNSCIY 309
            LGK HS NN+C Y
Sbjct: 301 KLGKSHSFNNTCTY 314

BLAST of MC04g0352 vs. ExPASy TrEMBL
Match: A0A0A0K4T2 (LEA_2 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_7G018790 PE=4 SV=1)

HSP 1 Score: 508 bits (1307), Expect = 1.02e-179
Identity = 255/314 (81.21%), Postives = 281/314 (89.49%), Query Frame = 0

Query: 1   MHAKSYSEVTSVEQSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPHHFYHASPIH 60
           MHAKSYSEVTSV+QSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPHHFYHASPIH
Sbjct: 1   MHAKSYSEVTSVDQSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPHHFYHASPIH 60

Query: 61  HSRESSTSRFSASLKPNPR---NLAAWRKLHRPLESDADDDDDDATAAADDRDSEWTRKF 120
           HSRESSTSRFSASLK N     N++AWRKLH   +SD DD++DD     +DRDS+W RKF
Sbjct: 61  HSRESSTSRFSASLKINQNRNGNVSAWRKLHHAQDSDGDDEEDDEEEENEDRDSKWNRKF 120

Query: 121 RLYLFLFVFFVLLFTVFSLILWGASRSFHPQIILQSMVFERFNVQAGSDAGGVATDLMSL 180
           RLYL LF+FF+LLFTVFSLILWGAS+SFHPQI++QSMVF +FNVQAGSD GGVATDLMSL
Sbjct: 121 RLYLILFLFFILLFTVFSLILWGASKSFHPQILIQSMVFSKFNVQAGSDPGGVATDLMSL 180

Query: 181 NSTVRIKYRNPATFFGVHVSSSPIQLNYFQLQIASGQMMEFYEKRQSSRRVATAVAGHQV 240
           NSTVRI Y+NPATFFGVHVSS+PIQL+Y QLQ+ASGQM EFY+KRQSSRRV T+VAGHQV
Sbjct: 181 NSTVRISYKNPATFFGVHVSSTPIQLHYLQLQVASGQMEEFYQKRQSSRRVVTSVAGHQV 240

Query: 241 PLYGGIAVIGNWREQRQEG--VEVPLNLTVAVRSRAYILGKLVKSTFHTTITCSLTLRTK 300
           PLYGGI+ IGNWR+QRQ+G  VEV LNLTVAVRSRAYILG+LVKSTFHTTITC +TL T 
Sbjct: 241 PLYGGISAIGNWRDQRQDGAGVEVSLNLTVAVRSRAYILGRLVKSTFHTTITCPITLSTN 300

Query: 301 NLGKFHSLNNSCIY 309
            LGK HS NN+CIY
Sbjct: 301 KLGKSHSFNNTCIY 314

BLAST of MC04g0352 vs. ExPASy TrEMBL
Match: A0A6J1EJW4 (uncharacterized protein LOC111435243 OS=Cucurbita moschata OX=3662 GN=LOC111435243 PE=4 SV=1)

HSP 1 Score: 501 bits (1291), Expect = 2.58e-177
Identity = 255/312 (81.73%), Postives = 277/312 (88.78%), Query Frame = 0

Query: 1   MHAKSYSEVTSVEQSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPHHFYHASPIH 60
           MHAKSYSEVTSV+QSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPH FYHASPIH
Sbjct: 1   MHAKSYSEVTSVDQSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPHPFYHASPIH 60

Query: 61  HSRESSTSRFSASLKPNPR---NLAAWRKLHRPLESDADDDDDDATAAADDRDSEWTRKF 120
           HSRESSTSRFSASLK N     NL+AWRKLHRP   D ++DDDD      DRDS+W RKF
Sbjct: 61  HSRESSTSRFSASLKNNMNRNGNLSAWRKLHRPPGYDEEEDDDDDGDNDGDRDSKWNRKF 120

Query: 121 RLYLFLFVFFVLLFTVFSLILWGASRSFHPQIILQSMVFERFNVQAGSDAGGVATDLMSL 180
           RLYLFLFV FVLLFTVFSLILWGAS+SFHPQI++QSMVFE+FNVQAGSD GGVATDLMSL
Sbjct: 121 RLYLFLFVLFVLLFTVFSLILWGASKSFHPQILVQSMVFEKFNVQAGSDPGGVATDLMSL 180

Query: 181 NSTVRIKYRNPATFFGVHVSSSPIQLNYFQLQIASGQMMEFYEKRQSSRRVATAVAGHQV 240
           NSTVRI Y+NPATFFGVHVSS+P QL+YFQLQIASGQM EFY+KRQSSR+V T+V+GHQV
Sbjct: 181 NSTVRITYKNPATFFGVHVSSTPFQLHYFQLQIASGQMEEFYQKRQSSRKVTTSVSGHQV 240

Query: 241 PLYGGIAVIGNWREQRQEGVEVPLNLTVAVRSRAYILGKLVKSTFHTTITCSLTLRTKNL 300
           PLYGGI+ IGNWR+QRQ+GVEV LNLTVAVRSRAYILG+LVKSTFHT ITC +TL  K L
Sbjct: 241 PLYGGISAIGNWRDQRQDGVEVLLNLTVAVRSRAYILGRLVKSTFHTKITCPVTLSNKKL 300

Query: 301 GKFHSLNNSCIY 309
           GK HS N +C Y
Sbjct: 301 GKSHSFNKTCTY 312

BLAST of MC04g0352 vs. ExPASy TrEMBL
Match: A0A6J1JK28 (uncharacterized protein LOC111486495 OS=Cucurbita maxima OX=3661 GN=LOC111486495 PE=4 SV=1)

HSP 1 Score: 498 bits (1283), Expect = 4.27e-176
Identity = 253/312 (81.09%), Postives = 277/312 (88.78%), Query Frame = 0

Query: 1   MHAKSYSEVTSVEQSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPHHFYHASPIH 60
           MHAKSYSEVTSV+QSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPH FYHASPIH
Sbjct: 1   MHAKSYSEVTSVDQSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPHPFYHASPIH 60

Query: 61  HSRESSTSRFSASLKPNPR---NLAAWRKLHRPLESDADDDDDDATAAADDRDSEWTRKF 120
           HSRESSTSRFSASLK N     NL+AWRKLHRP   D ++++DD      DRDS+W RKF
Sbjct: 61  HSRESSTSRFSASLKNNMNRNGNLSAWRKLHRPPGYDEEEEEDDDGDNDGDRDSKWNRKF 120

Query: 121 RLYLFLFVFFVLLFTVFSLILWGASRSFHPQIILQSMVFERFNVQAGSDAGGVATDLMSL 180
           RLYLFLFV FVLLFTVFSLILWGAS+SFHPQI++QSMVFE+FNVQAGSD GGVATDLMSL
Sbjct: 121 RLYLFLFVLFVLLFTVFSLILWGASKSFHPQILVQSMVFEKFNVQAGSDPGGVATDLMSL 180

Query: 181 NSTVRIKYRNPATFFGVHVSSSPIQLNYFQLQIASGQMMEFYEKRQSSRRVATAVAGHQV 240
           NSTVRI Y+NPATFFGVHVSS+P QL+YFQLQIASGQM EFY+KRQSSR+V T+V+GHQV
Sbjct: 181 NSTVRITYKNPATFFGVHVSSTPFQLHYFQLQIASGQMEEFYQKRQSSRKVTTSVSGHQV 240

Query: 241 PLYGGIAVIGNWREQRQEGVEVPLNLTVAVRSRAYILGKLVKSTFHTTITCSLTLRTKNL 300
           PLYGGI+ IGNWR+QRQ+GVEV LNLTVAVRSRAYILG+LVKSTFHT ITC +TL  K L
Sbjct: 241 PLYGGISAIGNWRDQRQDGVEVLLNLTVAVRSRAYILGRLVKSTFHTKITCPVTLSNKKL 300

Query: 301 GKFHSLNNSCIY 309
           GK HS N +C Y
Sbjct: 301 GKSHSFNKTCTY 312

BLAST of MC04g0352 vs. TAIR 10
Match: AT2G41990.1 (CONTAINS InterPro DOMAIN/s: Late embryogenesis abundant protein, group 2 (InterPro:IPR004864); BEST Arabidopsis thaliana protein match is: Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family (TAIR:AT4G35170.1); Has 172 Blast hits to 168 proteins in 15 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 172; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 276.2 bits (705), Expect = 3.3e-74
Identity = 162/313 (51.76%), Postives = 209/313 (66.77%), Query Frame = 0

Query: 1   MHAKSYSEVTSVEQS--SPARSPRRPLYYVQSPSNHDVEKMSYGS--SPMGSPPH-HFYH 60
           MHAK+ SE TS++ +  SP RS  RPLYYVQSPSNHDVEKMS+GS  S MGSP H H+YH
Sbjct: 1   MHAKTDSEATSIDAAALSPPRSAIRPLYYVQSPSNHDVEKMSFGSGCSLMGSPTHPHYYH 60

Query: 61  ASPIHHSRESSTSRFSASLKPNPRNLAAWRKLHRPLESDADDDDDDATAAADDRDSEWTR 120
            SPIHHSRESSTSRFS       R L +++ +         +D DD T   DD D    R
Sbjct: 61  CSPIHHSRESSTSRFS------DRALLSYKSIRE--RRRYINDGDDKTDGGDDDDP--FR 120

Query: 121 KFRLYLFLFVFFVLLFTVFSLILWGASRSFHPQIILQSMVFERFNVQAGSDAGGVATDLM 180
             RLY++L +  + LFTVFSLILWGAS+S+ P++ ++ M+    N+QAG+D  GV TD++
Sbjct: 121 NVRLYVWLLLSVIFLFTVFSLILWGASKSYPPKVTVKGMLVRDLNLQAGNDLSGVPTDML 180

Query: 181 SLNSTVRIKYRNPATFFGVHVSSSPIQLNYFQLQIASGQMMEFYEKRQSSRRVATAVAGH 240
           SLNSTVRI YRNP+TFF VHV++SP+ L+Y  L ++SG+M +F   R     V T V GH
Sbjct: 181 SLNSTVRIYYRNPSTFFAVHVTASPLLLHYSNLLLSSGEMNKFTVGRNGETNVVTVVQGH 240

Query: 241 QVPLYGGIAVIGNWREQRQEGVEVPLNLTVAVRSRAYILGKLVKSTFHTTITCSLTLRTK 300
           Q+PLYGG++          + + +PLNLT+ + S+AYILG+LV S F+T I CS TL   
Sbjct: 241 QIPLYGGVSF-------HLDTLSLPLNLTIVLHSKAYILGRLVTSKFYTRIICSFTLDAN 296

Query: 301 NLGKFHSLNNSCI 309
           +L K  SL  SCI
Sbjct: 301 HLPKSISLLRSCI 296

BLAST of MC04g0352 vs. TAIR 10
Match: AT1G45688.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: plasma membrane; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G42860.1); Has 258 Blast hits to 242 proteins in 39 species: Archae - 0; Bacteria - 11; Metazoa - 10; Fungi - 14; Plants - 198; Viruses - 17; Other Eukaryotes - 8 (source: NCBI BLink). )

HSP 1 Score: 229.9 bits (585), Expect = 2.7e-60
Identity = 149/342 (43.57%), Postives = 206/342 (60.23%), Query Frame = 0

Query: 1   MHAKSYSEVTSVEQSSPARSPRRPLYYVQSPS--NHDVEK--MSYGS----SPMGSPPHH 60
           MHAK+ SEVTS+  SSPARSPRRP+YYVQSPS  +HD EK   S+ S    SPMGSPPH 
Sbjct: 1   MHAKTDSEVTSLAASSPARSPRRPVYYVQSPSRDSHDGEKTATSFHSTPVLSPMGSPPHS 60

Query: 61  FYHASPIHHSRESSTSRFSASLKPNPR----NLAAWRKLHRPLESDADDDDDDATAAADD 120
             H+S   HSRESS+SRFS SLKP  R    N  + RK H   +   +    +     DD
Sbjct: 61  --HSSMGRHSRESSSSRFSGSLKPGSRKVNPNDGSKRKGHGGEKQWKECAVIEEEGLLDD 120

Query: 121 RDSEWTRKFRLYLFLFVF-FVLLFTVFSLILWGASRSFHPQIILQSMVFERFNVQAGSDA 180
            D +     R Y+  F+  F +LF  FSLIL+GA++   P+I ++S+ FE   +QAG DA
Sbjct: 121 GDRDGGVPRRCYVLAFIVGFFILFGFFSLILYGAAKPMKPKITVKSITFETLKIQAGQDA 180

Query: 181 GGVATDLMSLNSTVRIKYRNPATFFGVHVSSSPIQLNYFQLQIASGQMMEFYEKRQSSRR 240
           GGV TD++++N+T+R+ YRN  TFFGVHV+S+PI L++ Q++I SG + +FY+ R+S R 
Sbjct: 181 GGVGTDMITMNATLRMLYRNTGTFFGVHVTSTPIDLSFSQIKIGSGSVKKFYQGRKSERT 240

Query: 241 VATAVAGHQVPLYGGIAVI-------GNWREQRQEG------------VEVPLNLTVAVR 300
           V   V G ++PLYG  + +          + ++++G              VP+ L+  VR
Sbjct: 241 VLVHVIGEKIPLYGSGSTLLPPAPPAPLPKPKKKKGAPVPIPDPPAPPAPVPMTLSFVVR 300

Query: 301 SRAYILGKLVKSTFHTTITCSLTLRTKNLGKFHSLNNSCIYT 311
           SRAY+LGKLV+  F+  I C +    KNL K   +  +C  T
Sbjct: 301 SRAYVLGKLVQPKFYKKIECDINFEHKNLNKHIVITKNCTVT 340

BLAST of MC04g0352 vs. TAIR 10
Match: AT4G35170.1 (Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family )

HSP 1 Score: 224.2 bits (570), Expect = 1.5e-58
Identity = 137/296 (46.28%), Postives = 187/296 (63.18%), Query Frame = 0

Query: 14  QSSPARSPRRPLYYVQSPSNHDVEKMSYGS--SPMGSPPHHFYHASPIHHSRESSTSRFS 73
           +SSP ++ R+P+Y V SP N DV+K+S GS  SP GSP +     S   H   + +S + 
Sbjct: 7   RSSP-QNTRKPVYVVHSPPNTDVDKISTGSGFSPFGSPLNDQGQVSNFQHHSVAESSSYP 66

Query: 74  ASLKPNPRNLAAWRKLHRPLESDADDDDDDATAAADDRDSEWTRKFRLYLFLFVFFVLLF 133
            S  P  RN  +  ++H       +D+D D     D++    TR +   LF     VL F
Sbjct: 67  RSSGP-LRNEYSSVQVHDLDRRTHEDEDYDEMDGPDEKRRRITRFYSCLLFT---LVLAF 126

Query: 134 TVFSLILWGASRSFHPQIILQSMVFERFNVQAGSDAGGVATDLMSLNSTVRIKYRNPATF 193
           T+F LILWG S+SF P   L+ MV E  NVQ+G+D  GV TD+++LNSTVRI YRNPATF
Sbjct: 127 TLFCLILWGVSKSFAPIATLKEMVLENLNVQSGNDQSGVLTDMLTLNSTVRILYRNPATF 186

Query: 194 FGVHVSSSPIQLNYFQLQIASGQMMEFYEKRQSSRRVATAVAGHQVPLYGGIAVIGNWRE 253
           F VHV+S+P+QL+Y QL +ASGQM EF ++R+S R + T V G Q+PLYGG+  +   R 
Sbjct: 187 FTVHVTSAPLQLSYSQLILASGQMGEFSQRRKSERIIETKVFGDQIPLYGGVPALFGQRA 246

Query: 254 QRQEGVEVPLNLTVAVRSRAYILGKLVKSTFHTTITCSLTLRTKNLGKFHSLNNSC 308
           +  + V +PLNLT  +R+RAY+LG+LVK+TFH+ I CS+T     LGK   L+ SC
Sbjct: 247 EPDQ-VVLPLNLTFTLRARAYVLGRLVKTTFHSNIKCSITFYGDKLGKTLDLSKSC 296

BLAST of MC04g0352 vs. TAIR 10
Match: AT5G42860.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: plasma membrane; EXPRESSED IN: 19 plant structures; EXPRESSED DURING: 11 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G45688.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )

HSP 1 Score: 206.1 bits (523), Expect = 4.2e-53
Identity = 137/338 (40.53%), Postives = 190/338 (56.21%), Query Frame = 0

Query: 1   MHAKSYSEVTSVEQSSPARSPRRPLYYVQSPS--NHDVEKMSYG-------SSPMGSPPH 60
           MHAK+ SEVTS+  SSP RSPRRP Y+VQSPS  +HD EK +         +SPMGSPPH
Sbjct: 1   MHAKTDSEVTSLSASSPTRSPRRPAYFVQSPSRDSHDGEKTATSFHSTPVLTSPMGSPPH 60

Query: 61  HFYHASPIHHSRESSTSRFSASLKPNPRNLAAWRKLHRPLESDADDDDDDATAAADDRDS 120
                        SS+SRFS  +  + R   A  K    +E +   DD D    A  R  
Sbjct: 61  -----------SHSSSSRFS-KINGSKRKGHAGEKQFAMIEEEGLLDDGDREQEALPR-- 120

Query: 121 EWTRKFRLYLFLFVF-FVLLFTVFSLILWGASRSFHPQIILQSMVFERFNVQAGSDAGGV 180
                 R Y+  F+  F LLF  FSLIL+ A++   P+I ++S+ FE+  VQAG DAGG+
Sbjct: 121 ------RCYVLAFIVGFSLLFAFFSLILYAAAKPQKPKISVKSITFEQLKVQAGQDAGGI 180

Query: 181 ATDLMSLNSTVRIKYRNPATFFGVHVSSSPIQLNYFQLQIASGQMMEFYEKRQSSRRVAT 240
            TD++++N+T+R+ YRN  TFFGVHV+SSPI L++ Q+ I SG + +FY+ R+S R V  
Sbjct: 181 GTDMITMNATLRMLYRNTGTFFGVHVTSSPIDLSFSQITIGSGSIKKFYQSRKSQRTVVV 240

Query: 241 AVAGHQVPLYGGIAVI-------GNWREQRQEG-----------VEVPLNLTVAVRSRAY 300
            V G ++PLYG  + +          + ++++G             VP+ L   VRSRAY
Sbjct: 241 NVLGDKIPLYGSGSTLVPPPPPAPIPKPKKKKGPIVIVEPPAPPAPVPMRLNFTVRSRAY 300

Query: 301 ILGKLVKSTFHTTITCSLTLRTKNLGKFHSLNNSCIYT 311
           +LGKLV+  F+  I C +    K L K   + N+C  T
Sbjct: 301 VLGKLVQPKFYKRIVCLINFEHKKLSKHIPITNNCTVT 318

BLAST of MC04g0352 vs. TAIR 10
Match: AT1G45688.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: plasma membrane; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G42860.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 176.4 bits (446), Expect = 3.5e-44
Identity = 113/228 (49.56%), Postives = 150/228 (65.79%), Query Frame = 0

Query: 1   MHAKSYSEVTSVEQSSPARSPRRPLYYVQSPS--NHDVEK--MSYGS----SPMGSPPHH 60
           MHAK+ SEVTS+  SSPARSPRRP+YYVQSPS  +HD EK   S+ S    SPMGSPPH 
Sbjct: 1   MHAKTDSEVTSLAASSPARSPRRPVYYVQSPSRDSHDGEKTATSFHSTPVLSPMGSPPHS 60

Query: 61  FYHASPIHHSRESSTSRFSASLKPNPR----NLAAWRKLHRPLESDADDDDDDATAAADD 120
             H+S   HSRESS+SRFS SLKP  R    N  + RK H   +   +    +     DD
Sbjct: 61  --HSSMGRHSRESSSSRFSGSLKPGSRKVNPNDGSKRKGHGGEKQWKECAVIEEEGLLDD 120

Query: 121 RDSEWTRKFRLYLFLFVF-FVLLFTVFSLILWGASRSFHPQIILQSMVFERFNVQAGSDA 180
            D +     R Y+  F+  F +LF  FSLIL+GA++   P+I ++S+ FE   +QAG DA
Sbjct: 121 GDRDGGVPRRCYVLAFIVGFFILFGFFSLILYGAAKPMKPKITVKSITFETLKIQAGQDA 180

Query: 181 GGVATDLMSLNSTVRIKYRNPATFFGVHVSSSPIQLNYFQLQIASGQM 216
           GGV TD++++N+T+R+ YRN  TFFGVHV+S+PI L++ Q++I SG +
Sbjct: 181 GGVGTDMITMNATLRMLYRNTGTFFGVHVTSTPIDLSFSQIKIGSGSV 226

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_022135688.11.21e-21899.35uncharacterized protein LOC111007587 isoform X1 [Momordica charantia] >XP_022135... [more]
XP_038888376.17.20e-18483.97uncharacterized protein LOC120078225 [Benincasa hispida][more]
XP_008447896.11.04e-17981.85PREDICTED: uncharacterized protein LOC103490245 [Cucumis melo][more]
XP_004144875.12.10e-17981.21uncharacterized protein LOC101215215 [Cucumis sativus] >KGN43297.1 hypothetical ... [more]
XP_022928427.15.33e-17781.73uncharacterized protein LOC111435243 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
A0A6J1C5K45.84e-21999.35uncharacterized protein LOC111007587 isoform X1 OS=Momordica charantia OX=3673 G... [more]
A0A1S3BJ425.04e-18081.85uncharacterized protein LOC103490245 OS=Cucumis melo OX=3656 GN=LOC103490245 PE=... [more]
A0A0A0K4T21.02e-17981.21LEA_2 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_7G018790 PE=4 ... [more]
A0A6J1EJW42.58e-17781.73uncharacterized protein LOC111435243 OS=Cucurbita moschata OX=3662 GN=LOC1114352... [more]
A0A6J1JK284.27e-17681.09uncharacterized protein LOC111486495 OS=Cucurbita maxima OX=3661 GN=LOC111486495... [more]
Match NameE-valueIdentityDescription
AT2G41990.13.3e-7451.76CONTAINS InterPro DOMAIN/s: Late embryogenesis abundant protein, group 2 (InterP... [more]
AT1G45688.12.7e-6043.57unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT4G35170.11.5e-5846.28Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family [more]
AT5G42860.14.2e-5340.53unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT1G45688.23.5e-4449.56unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (Dali-11) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR004864Late embryogenesis abundant protein, LEA_2 subgroupPFAMPF03168LEA_2coord: 183..289
e-value: 3.5E-9
score: 37.1
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..57
NoneNo IPR availablePANTHERPTHR31852:SF175LATE EMBRYOGENESIS ABUNDANT PROTEINcoord: 52..309
NoneNo IPR availablePANTHERPTHR31852LATE EMBRYOGENESIS ABUNDANT (LEA) HYDROXYPROLINE-RICH GLYCOPROTEIN FAMILYcoord: 52..309

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
MC04g0352.1MC04g0352.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane