MS023563 (gene) Bitter gourd (TR) v1

Overview
NameMS023563
Typegene
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Descriptionprotein CHUP1, chloroplastic
Locationscaffold1258: 139071 .. 141618 (-)
RNA-Seq ExpressionMS023563
SyntenyMS023563
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCCGCAGGAAGAAGATGAAGAATTGGCCATGGAGATCACCAGCTTGAGAAAAGAACTGCAAATTGCTGTGGACAAATCAGATTTTCTAGAGAAAGAAAATCAAGAACTCAGACAAGAATTGGGTCGTCTCAAATCGCAGATTCAGTCTCTCAAAGCTCACAACAATGACAGAAAATCCCTTCTCTGGAAGAAATTTTACAACTCCATGGATGCAGAGTCGCCGCCGGCGACTGACAAACGGGAGGCGACCAAATCATCGCCGAAACAGCCTGTTTGGGTCGCCGTGAAAGAGAGCCAGAGAATGCCGGAGGGGGCACCGGCTCCGGCTCCGGCGCCGCCGCCGCCGCCGCTTCCGACGAAGCTGCTCGCCGGATCAAAGGCAGTGCGGCGAGTACCGGAAGTGTTGGAGCTGTACCGCTCGCTGACGAAACGAGATGCGCAAAAGGAAAACAAGGCCGCCCACGGCGGATTTCCGGCGGTGGCATTCACCAAAAATATGATCGGAGAAATCGAAAACCGGTCAGCGTATCTCACTGCGGTAAGTATTTACTATTTATTCCCCCAATGTTTTAAAGATCAAGAAATTTATTCCTCAAAAAAATAAAAAATAAAAAAAAAACTCGAGAAAATTGGATATTTTTTTAATAATGCTTTAATTCAAACTTATTCCAAATATATATCAATCCCAAACTATTTTCAACGATAATGTTTGGAAAATTACCTAGAACTTATGGGTTAAAAAACAAGCTTACTACCACTGTATCAGAGTCGACGACGTGCCAGATATTTTTCGAACGTTATTCTTGGAAATAGTTTGAGGGTTATATTTGAAATAAGTTTGAATTAAAGCATTATTAAAAAAATCTCAAGAAAATTATACTAGATGCAAAACTATTATATGTATTTTTCAAAATGTCGTTACTTTTATTTTTAAAAAAATCACAAAAATAAGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTAATTTTTTTTTTTTTTNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNATGCTTGACATTTAAACATTTTTTTTTTAATTTTGAGTTCGTGCCTTCTCTATTGGTGGGCGTGATAACTAATATGGTATGATAATACCAAGTTGGAAAAAAAATTAGATGAAAGAAGTTTTCAACTCCAACGAATATAACACAATTGAGAAATTTTAAAACATAAGGAGCAAAATTAAAACATTTTAAATAATTGAGATGAAATTAGAAAAAAATTACTCAAAAGGAATTAAAAGATATATTAAGCCAAAACAAATTATCAGATAATTTTCGGTTTCGTGCTTATCCTTGATATTAATGAAAGTAATAATAAAAAAAGATATCTCCGACTAAAATATCAAAACCTTGCTCCCACTTATTACATTAAAAAAAACCAATTTTAGACATAATTAGTGAATCAGCATACTTGCAAGCATTAAACGACAAAATTAAGTACTCTCCGAAACAAAAAACTCGGGCAGATAAAATCAGAGGTGGAAACTCACGGAGAGTTCGTGAACTGGCTGATCAAGGAAGTGGAAGGGGCAGCACCAAGGGACATAACAGAGGTGGAGAGGTTCGTGAACTGGCTGGACAGAGAGCTGGGGTCGCTGGTAGATGAGAGGGCAGTGCTGAAGCACTTCCCACGGTGGCCTGAGGGGAAGGCGGATGCACTGCGGGAGGCAGCATTCAGTTACAGGGACCTGAAGAGCCTGGAGAGTGAAGTATGTTCCTTCAGAGACAATCCGAAGGAGGAGATGGGTGTGGTACTGAAGAGGGCTCAGGCGCTGCAAGACAGGCGAGAATGTACAATCTGTGGGGAGCCTATTTTTGGAGGTTGAGTGATAAGTGAGAAGTGATGAGAATTTTTTCAAATTTTGCAGGCTGGAGCAGAGTGTGAGCAATGTGGAGAAGACGAGGGAGTTCAGTTGTAACAAGTACAGAAATTTTAGAATACCCTGCGAATGGATGTTCGAATCTGGACTTGTCGGTCAGGTGATTTAAACCTTCAACGATCCAAACTCCTGCATTTTTAATAACACTTCATATTGAAATGTAACCCATAATCTCCACTTCTATTAAAACAACTTTGTTAATTTACATGCACATAATGATTCTGCCTATTGGTTGCAGATGAAGTTAAGCTCATTGAGGCTGGCCAAGGAATACATGCGAAGGATAACAAGAGAACTCCAATCAATCGATAACACGCAACAAGCAGATAATCTTCTTCTTCAAGGGGTTCGATTTGCTTACAGGGTTCACCAGGTAATAGAATGGCAAACTAATCTAATCACATGTTAGTGCACAGTGACTAACACAACTCGGAACCTATTCAGTATGCAGGCGGTTTCGATTCAGAAGCTATAGCAGCATTTGAAGAACTGAAGAAAGTTGGGCTGAGTAGTCAAAGAAAA

mRNA sequence

ATGCCGCAGGAAGAAGATGAAGAATTGGCCATGGAGATCACCAGCTTGAGAAAAGAACTGCAAATTGCTGTGGACAAATCAGATTTTCTAGAGAAAGAAAATCAAGAACTCAGACAAGAATTGGGTCGTCTCAAATCGCAGATTCAGTCTCTCAAAGCTCACAACAATGACAGAAAATCCCTTCTCTGGAAGAAATTTTACAACTCCATGGATGCAGAGTCGCCGCCGGCGACTGACAAACGGGAGGCGACCAAATCATCGCCGAAACAGCCTGTTTGGGTCGCCGTGAAAGAGAGCCAGAGAATGCCGGAGGGGGCACCGGCTCCGGCTCCGGCGCCGCCGCCGCCGCCGCTTCCGACGAAGCTGCTCGCCGGATCAAAGGCAGTGCGGCGAGTACCGGAAGTGTTGGAGCTGTACCGCTCGCTGACGAAACGAGATGCGCAAAAGGAAAACAAGGCCGCCCACGGCGGATTTCCGGCGGTGGCATTCACCAAAAATATGATCGGAGAAATCGAAAACCGGTCAGCGTATCTCACTGCGATAAAATCAGAGGTGGAAACTCACGGAGAGTTCGTGAACTGGCTGATCAAGGAAGTGGAAGGGGCAGCACCAAGGGACATAACAGAGGTGGAGAGGTTCGTGAACTGGCTGGACAGAGAGCTGGGGTCGCTGGTAGATGAGAGGGCAGTGCTGAAGCACTTCCCACGGTGGCCTGAGGGGAAGGCGGATGCACTGCGGGAGGCAGCATTCAGTTACAGGGACCTGAAGAGCCTGGAGAGTGAAGTATGTTCCTTCAGAGACAATCCGAAGGAGGAGATGGGTGTGGTACTGAAGAGGGCTCAGGCGCTGCAAGACAGGCGAGAATGTACAATCTTGAGAAGTGATGAGAATTTTTTCAAATTTTGCAGGCTGGAGCAGAGTGTGAGCAATGTGGAGAAGACGAGGGAGTTCAGTTGTAACAAGTACAGAAATTTTAGAATACCCTGCGAATGGATGTTCGAATCTGGACTTGTCGGTCAGATGAAGTTAAGCTCATTGAGGCTGGCCAAGGAATACATGCGAAGGATAACAAGAGAACTCCAATCAATCGATAACACGCAACAAGCAGATAATCTTCTTCTTCAAGGGGTTCGATTTGCTTACAGGGTTCACCAGTATGCAGGCGGTTTCGATTCAGAAGCTATAGCAGCATTTGAAGAACTGAAGAAAGTTGGGCTGAGTAGTCAAAGAAAA

Coding sequence (CDS)

ATGCCGCAGGAAGAAGATGAAGAATTGGCCATGGAGATCACCAGCTTGAGAAAAGAACTGCAAATTGCTGTGGACAAATCAGATTTTCTAGAGAAAGAAAATCAAGAACTCAGACAAGAATTGGGTCGTCTCAAATCGCAGATTCAGTCTCTCAAAGCTCACAACAATGACAGAAAATCCCTTCTCTGGAAGAAATTTTACAACTCCATGGATGCAGAGTCGCCGCCGGCGACTGACAAACGGGAGGCGACCAAATCATCGCCGAAACAGCCTGTTTGGGTCGCCGTGAAAGAGAGCCAGAGAATGCCGGAGGGGGCACCGGCTCCGGCTCCGGCGCCGCCGCCGCCGCCGCTTCCGACGAAGCTGCTCGCCGGATCAAAGGCAGTGCGGCGAGTACCGGAAGTGTTGGAGCTGTACCGCTCGCTGACGAAACGAGATGCGCAAAAGGAAAACAAGGCCGCCCACGGCGGATTTCCGGCGGTGGCATTCACCAAAAATATGATCGGAGAAATCGAAAACCGGTCAGCGTATCTCACTGCGATAAAATCAGAGGTGGAAACTCACGGAGAGTTCGTGAACTGGCTGATCAAGGAAGTGGAAGGGGCAGCACCAAGGGACATAACAGAGGTGGAGAGGTTCGTGAACTGGCTGGACAGAGAGCTGGGGTCGCTGGTAGATGAGAGGGCAGTGCTGAAGCACTTCCCACGGTGGCCTGAGGGGAAGGCGGATGCACTGCGGGAGGCAGCATTCAGTTACAGGGACCTGAAGAGCCTGGAGAGTGAAGTATGTTCCTTCAGAGACAATCCGAAGGAGGAGATGGGTGTGGTACTGAAGAGGGCTCAGGCGCTGCAAGACAGGCGAGAATGTACAATCTTGAGAAGTGATGAGAATTTTTTCAAATTTTGCAGGCTGGAGCAGAGTGTGAGCAATGTGGAGAAGACGAGGGAGTTCAGTTGTAACAAGTACAGAAATTTTAGAATACCCTGCGAATGGATGTTCGAATCTGGACTTGTCGGTCAGATGAAGTTAAGCTCATTGAGGCTGGCCAAGGAATACATGCGAAGGATAACAAGAGAACTCCAATCAATCGATAACACGCAACAAGCAGATAATCTTCTTCTTCAAGGGGTTCGATTTGCTTACAGGGTTCACCAGTATGCAGGCGGTTTCGATTCAGAAGCTATAGCAGCATTTGAAGAACTGAAGAAAGTTGGGCTGAGTAGTCAAAGAAAA

Protein sequence

MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKSLLWKKFYNSMDAESPPATDKREATKSSPKQPVWVAVKESQRMPEGAPAPAPAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEFVNWLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRRECTILRSDENFFKFCRLEQSVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADNLLLQGVRFAYRVHQYAGGFDSEAIAAFEELKKVGLSSQRK
Homology
BLAST of MS023563 vs. NCBI nr
Match: XP_022150972.1 (protein CHUP1, chloroplastic [Momordica charantia])

HSP 1 Score: 751.5 bits (1939), Expect = 3.8e-213
Identity = 392/411 (95.38%), Postives = 393/411 (95.62%), Query Frame = 0

Query: 1   MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKS 60
           MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKS
Sbjct: 1   MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKS 60

Query: 61  LLWKKFYNSMDAESPPATDKREATKSSPKQPVWVAVKESQRMPEGAPAPAPAPPPPPLPT 120
           LLWKKFYNSMDAESPPATDKREATKSSPKQPVWVAVKESQRMPEGAPAPAPAPPPPPLPT
Sbjct: 61  LLWKKFYNSMDAESPPATDKREATKSSPKQPVWVAVKESQRMPEGAPAPAPAPPPPPLPT 120

Query: 121 KLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIENRSAYLTA 180
           KLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIENRSAYLTA
Sbjct: 121 KLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIENRSAYLTA 180

Query: 181 IKSEVETHGEFVNWLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEG 240
           IKSEVETHGEFVNWLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEG
Sbjct: 181 IKSEVETHGEFVNWLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEG 240

Query: 241 KADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRRECTILRSDENFFK 300
           KADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQD               
Sbjct: 241 KADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQD--------------- 300

Query: 301 FCRLEQSVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITREL 360
             RLEQSVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITREL
Sbjct: 301 --RLEQSVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITREL 360

Query: 361 QSIDNTQQADNLLLQGVRFAYRVHQYAGGFDSEAIAAFEELKKVGLSSQRK 412
           QSIDNTQQADNLLLQGVRFAYRVHQYAGGFDS+AIAAFE LKKVGLSSQRK
Sbjct: 361 QSIDNTQQADNLLLQGVRFAYRVHQYAGGFDSDAIAAFEGLKKVGLSSQRK 394

BLAST of MS023563 vs. NCBI nr
Match: XP_023523072.1 (protein CHUP1, chloroplastic isoform X1 [Cucurbita pepo subsp. pepo] >XP_023523080.1 protein CHUP1, chloroplastic isoform X2 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 584.7 bits (1506), Expect = 6.1e-163
Identity = 313/421 (74.35%), Postives = 348/421 (82.66%), Query Frame = 0

Query: 1   MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKS 60
           MP EEDEELAMEI +L++EL+I++ KS+FLEKENQEL+QEL R KS IQSLKAHNNDRKS
Sbjct: 1   MPMEEDEELAMEIDALKRELEISLQKSNFLEKENQELKQELARFKSHIQSLKAHNNDRKS 60

Query: 61  LLWKKFYNSMDA---------ESPPATDKREATKSSPKQPVWVAVKESQRMPEGAPAPAP 120
           +LWKKF+NSMD          +SPPATDK E T++  KQ  W  VKE+QRM   AP PAP
Sbjct: 61  ILWKKFHNSMDVAVAGTDSSPQSPPATDKWETTRTQ-KQSNWAVVKENQRMAAAAPTPAP 120

Query: 121 APPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEI 180
            PPPPPLPTKLL GSKAVRRVPEVLELYR +TKRDAQKENKAA+GGFPAVAFTKNMIGEI
Sbjct: 121 -PPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVAFTKNMIGEI 180

Query: 181 ENRSAYLTAIKSEVETHGEFVNWLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVL 240
           ENRSAYL+AIKSEVETHGEFVN LI+EVE AAPRDI EVERFV WLD ELGSLVDERAVL
Sbjct: 181 ENRSAYLSAIKSEVETHGEFVNRLIREVEAAAPRDIAEVERFVKWLDGELGSLVDERAVL 240

Query: 241 KHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRRECTI 300
           KHFPRWPEGKADALREAAFSY+DLKSLE+EVCSFR+NPKEE   +LKRAQALQD      
Sbjct: 241 KHFPRWPEGKADALREAAFSYKDLKSLEAEVCSFRENPKEETNAMLKRAQALQD------ 300

Query: 301 LRSDENFFKFCRLEQSVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKE 360
                      RLEQSVSNVE+TREF+C KY  F+IPC+WM +SGL  QMKLSSLRL KE
Sbjct: 301 -----------RLEQSVSNVERTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKE 360

Query: 361 YMRRITRELQSIDNTQQADNLLLQGVRFAYRVHQYAGGFDSEAIAAFEELKKVGLS-SQR 412
            MRRIT+E+Q ++ T Q +NL LQGVRFAYRVHQYAGGFDSEAI AFE +K+VGL  +QR
Sbjct: 361 CMRRITKEIQ-LNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLQLNQR 401

BLAST of MS023563 vs. NCBI nr
Match: XP_022998607.1 (protein CHUP1, chloroplastic [Cucurbita maxima])

HSP 1 Score: 581.6 bits (1498), Expect = 5.2e-162
Identity = 312/421 (74.11%), Postives = 346/421 (82.19%), Query Frame = 0

Query: 1   MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKS 60
           MP EEDEELAMEI +L++EL+I++ KS+FLEKENQEL+QEL R KS +QSLK HNNDRKS
Sbjct: 1   MPMEEDEELAMEIDALKRELEISLQKSNFLEKENQELKQELARFKSHLQSLKPHNNDRKS 60

Query: 61  LLWKKFYNSMDA---------ESPPATDKREATKSSPKQPVWVAVKESQRMPEGAPAPAP 120
           +LWKKF+NSMD          +SPPATDK E T++  KQ  W  VKE+QRM   AP PAP
Sbjct: 61  ILWKKFHNSMDVAVAGTDSSPQSPPATDKWETTRTQ-KQSNWAVVKENQRMAAAAPTPAP 120

Query: 121 APPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEI 180
            PPPPPLPTKLL GSKAVRRVPEVLELYR +TKRDAQKENKA +GGFPAVAFTKNMIGEI
Sbjct: 121 -PPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKATNGGFPAVAFTKNMIGEI 180

Query: 181 ENRSAYLTAIKSEVETHGEFVNWLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVL 240
           ENRSAYL+AIKSEVETHGEFVN LI+EVE AAPRDI EVERFV WLD EL SLVDERAVL
Sbjct: 181 ENRSAYLSAIKSEVETHGEFVNRLIREVEAAAPRDIAEVERFVKWLDGELASLVDERAVL 240

Query: 241 KHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRRECTI 300
           KHFPRWPEGKADALREAAFSY+DLKSLE+EVCSFR+NPKEE   +LKRAQALQD      
Sbjct: 241 KHFPRWPEGKADALREAAFSYKDLKSLEAEVCSFRENPKEETNAMLKRAQALQD------ 300

Query: 301 LRSDENFFKFCRLEQSVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKE 360
                      RLEQSVSNVE+TREF+CNKY  F+IPC+WM +SGL  QMKLSSLRL KE
Sbjct: 301 -----------RLEQSVSNVERTREFNCNKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKE 360

Query: 361 YMRRITRELQSIDNTQQADNLLLQGVRFAYRVHQYAGGFDSEAIAAFEELKKVG-LSSQR 412
            MRRIT+ELQ ++ T Q +NL LQGVRFAYRVHQYAGGFDSEAI AFE +K+VG L SQR
Sbjct: 361 CMRRITKELQ-LNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLLLSQR 401

BLAST of MS023563 vs. NCBI nr
Match: KAG6607325.1 (Protein CHUP1, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 580.1 bits (1494), Expect = 1.5e-161
Identity = 310/419 (73.99%), Postives = 345/419 (82.34%), Query Frame = 0

Query: 1   MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKS 60
           MP EEDEELAMEI +L++EL+I++ KS+FLEKENQEL+QEL R KS +QSLK HNNDRKS
Sbjct: 1   MPMEEDEELAMEIHALKRELEISLQKSNFLEKENQELKQELARFKSHVQSLKVHNNDRKS 60

Query: 61  LLWKKFYNSMDA-------ESPPATDKREATKSSPKQPVWVAVKESQRMPEGAPAPAPAP 120
           +LWKKF+NSMD        +SPPATDK E T++  KQ  W  VKE+QRM   AP PAP P
Sbjct: 61  ILWKKFHNSMDVAGNDSTPQSPPATDKWETTRTQ-KQSNWAVVKENQRMAAAAPTPAP-P 120

Query: 121 PPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIEN 180
           PPPPLPTKLL GSKAVRRVPEVLELYR +TKRDAQKENKAA+GGFPAVAFTKNMIGEIEN
Sbjct: 121 PPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVAFTKNMIGEIEN 180

Query: 181 RSAYLTAIKSEVETHGEFVNWLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKH 240
           RSAYL+AIKSEVETHGEFVN LI+EVE AAPRDI EVERFV WLD EL SLVDERAVLKH
Sbjct: 181 RSAYLSAIKSEVETHGEFVNRLIREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKH 240

Query: 241 FPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRRECTILR 300
           FPRWPEGKADALREAAFSY+DLKSLE EVCSFR+NPKEE   +LKRAQALQD        
Sbjct: 241 FPRWPEGKADALREAAFSYKDLKSLEGEVCSFRENPKEETNAMLKRAQALQD-------- 300

Query: 301 SDENFFKFCRLEQSVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYM 360
                    RLEQSVSNVE+TREF+C KY  F+IPC+WM +SGL  QMKLSSLRL KE M
Sbjct: 301 ---------RLEQSVSNVERTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECM 360

Query: 361 RRITRELQSIDNTQQADNLLLQGVRFAYRVHQYAGGFDSEAIAAFEELKKVGLS-SQRK 412
           RRIT+E+Q ++ T Q +NL LQGVRFAYRVHQYAGGFDSEAI AFE +K+VGL  +QRK
Sbjct: 361 RRITKEVQ-LNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLQLNQRK 399

BLAST of MS023563 vs. NCBI nr
Match: XP_022948306.1 (protein CHUP1, chloroplastic [Cucurbita moschata])

HSP 1 Score: 577.4 bits (1487), Expect = 9.8e-161
Identity = 311/419 (74.22%), Postives = 343/419 (81.86%), Query Frame = 0

Query: 1   MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKS 60
           MP EEDEELAMEI +L++EL+I++ KS FLEKENQEL+QEL R KS I SLKAHNNDRKS
Sbjct: 1   MPMEEDEELAMEIHALKRELEISLQKSIFLEKENQELKQELARFKSHIHSLKAHNNDRKS 60

Query: 61  LLWKKFYNSMDA-------ESPPATDKREATKSSPKQPVWVAVKESQRMPEGAPAPAPAP 120
           +LWKKF+NSMD        +SPPATDK E T++  KQ  W  VKE+QRM   AP PAP P
Sbjct: 61  ILWKKFHNSMDVAGNDSTPQSPPATDKWETTRTQ-KQSNWAVVKENQRMAAAAPTPAP-P 120

Query: 121 PPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIEN 180
           PPPPLPTKLL GSKAVRRVPEVLELYR +TKRDAQKENKAA+GGFPAVAFTKNMIGEIEN
Sbjct: 121 PPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVAFTKNMIGEIEN 180

Query: 181 RSAYLTAIKSEVETHGEFVNWLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKH 240
           RSAYL+AIKSEVETHGEFVN LI+EVE AAPRDI EVERFV WLD EL SLVDERAVLKH
Sbjct: 181 RSAYLSAIKSEVETHGEFVNRLIREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKH 240

Query: 241 FPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRRECTILR 300
           FPRWPEGKADALREAAFSY+DLKSLE EVCSFR+NPKEE   +LKRAQALQD        
Sbjct: 241 FPRWPEGKADALREAAFSYKDLKSLEGEVCSFRENPKEETNAMLKRAQALQD-------- 300

Query: 301 SDENFFKFCRLEQSVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYM 360
                    RLEQSVSNVE+TREF+C KY  F+IPC+WM +SGL  QMKLSSLRL KE M
Sbjct: 301 ---------RLEQSVSNVERTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECM 360

Query: 361 RRITRELQSIDNTQQADNLLLQGVRFAYRVHQYAGGFDSEAIAAFEELKKVGLS-SQRK 412
           RRIT+E Q ++ T Q +NL LQGVRFAYRVHQYAGGFDSEAI AFE +K+VGL  +QRK
Sbjct: 361 RRITKEKQ-LNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLQLNQRK 399

BLAST of MS023563 vs. ExPASy Swiss-Prot
Match: Q9LI74 (Protein CHUP1, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=CHUP1 PE=1 SV=1)

HSP 1 Score: 260.0 bits (663), Expect = 4.5e-68
Identity = 148/312 (47.44%), Postives = 202/312 (64.74%), Query Frame = 0

Query: 103 PEGAPAPAPA---PPPPPLPTKL---LAGSKAVRRVPEVLELYRSLTKRDAQKE---NKA 162
           P G P P P    PPPPP P  L     G   V R PE++E Y+SL KR+++KE   +  
Sbjct: 686 PGGGPPPPPGGGPPPPPPPPGALGRGAGGGNKVHRAPELVEFYQSLMKRESKKEGAPSLI 745

Query: 163 AHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEFVNWLIKEVEGAAPRDITEVERF 222
           + G   + A   NMIGEIENRS +L A+K++VET G+FV  L  EV  ++  DI ++  F
Sbjct: 746 SSGTGNSSAARNNMIGEIENRSTFLLAVKADVETQGDFVQSLATEVRASSFTDIEDLLAF 805

Query: 223 VNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEM 282
           V+WLD EL  LVDERAVLKHF  WPEGKADALREAAF Y+DL  LE +V SF D+P    
Sbjct: 806 VSWLDEELSFLVDERAVLKHFD-WPEGKADALREAAFEYQDLMKLEKQVTSFVDDPNLSC 865

Query: 283 GVVLKRAQALQDRRECTILRSDENFFKFCRLEQSVSNVEKTREFSCNKYRNFRIPCEWMF 342
              LK+   L +                 ++EQSV  + +TR+ + ++Y+ F IP +W+ 
Sbjct: 866 EPALKKMYKLLE-----------------KVEQSVYALLRTRDMAISRYKEFGIPVDWLS 925

Query: 343 ESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADN---LLLQGVRFAYRVHQYAGGF 402
           ++G+VG++KLSS++LAK+YM+R+  EL S+  + +  N   LLLQGVRFA+RVHQ+AGGF
Sbjct: 926 DTGVVGKIKLSSVQLAKKYMKRVAYELDSVSGSDKDPNREFLLLQGVRFAFRVHQFAGGF 979

BLAST of MS023563 vs. ExPASy TrEMBL
Match: A0A6J1DC83 (protein CHUP1, chloroplastic OS=Momordica charantia OX=3673 GN=LOC111018994 PE=4 SV=1)

HSP 1 Score: 751.5 bits (1939), Expect = 1.8e-213
Identity = 392/411 (95.38%), Postives = 393/411 (95.62%), Query Frame = 0

Query: 1   MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKS 60
           MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKS
Sbjct: 1   MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKS 60

Query: 61  LLWKKFYNSMDAESPPATDKREATKSSPKQPVWVAVKESQRMPEGAPAPAPAPPPPPLPT 120
           LLWKKFYNSMDAESPPATDKREATKSSPKQPVWVAVKESQRMPEGAPAPAPAPPPPPLPT
Sbjct: 61  LLWKKFYNSMDAESPPATDKREATKSSPKQPVWVAVKESQRMPEGAPAPAPAPPPPPLPT 120

Query: 121 KLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIENRSAYLTA 180
           KLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIENRSAYLTA
Sbjct: 121 KLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIENRSAYLTA 180

Query: 181 IKSEVETHGEFVNWLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEG 240
           IKSEVETHGEFVNWLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEG
Sbjct: 181 IKSEVETHGEFVNWLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEG 240

Query: 241 KADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRRECTILRSDENFFK 300
           KADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQD               
Sbjct: 241 KADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQD--------------- 300

Query: 301 FCRLEQSVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITREL 360
             RLEQSVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITREL
Sbjct: 301 --RLEQSVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITREL 360

Query: 361 QSIDNTQQADNLLLQGVRFAYRVHQYAGGFDSEAIAAFEELKKVGLSSQRK 412
           QSIDNTQQADNLLLQGVRFAYRVHQYAGGFDS+AIAAFE LKKVGLSSQRK
Sbjct: 361 QSIDNTQQADNLLLQGVRFAYRVHQYAGGFDSDAIAAFEGLKKVGLSSQRK 394

BLAST of MS023563 vs. ExPASy TrEMBL
Match: A0A6J1K8G4 (protein CHUP1, chloroplastic OS=Cucurbita maxima OX=3661 GN=LOC111493194 PE=4 SV=1)

HSP 1 Score: 581.6 bits (1498), Expect = 2.5e-162
Identity = 312/421 (74.11%), Postives = 346/421 (82.19%), Query Frame = 0

Query: 1   MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKS 60
           MP EEDEELAMEI +L++EL+I++ KS+FLEKENQEL+QEL R KS +QSLK HNNDRKS
Sbjct: 1   MPMEEDEELAMEIDALKRELEISLQKSNFLEKENQELKQELARFKSHLQSLKPHNNDRKS 60

Query: 61  LLWKKFYNSMDA---------ESPPATDKREATKSSPKQPVWVAVKESQRMPEGAPAPAP 120
           +LWKKF+NSMD          +SPPATDK E T++  KQ  W  VKE+QRM   AP PAP
Sbjct: 61  ILWKKFHNSMDVAVAGTDSSPQSPPATDKWETTRTQ-KQSNWAVVKENQRMAAAAPTPAP 120

Query: 121 APPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEI 180
            PPPPPLPTKLL GSKAVRRVPEVLELYR +TKRDAQKENKA +GGFPAVAFTKNMIGEI
Sbjct: 121 -PPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKATNGGFPAVAFTKNMIGEI 180

Query: 181 ENRSAYLTAIKSEVETHGEFVNWLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVL 240
           ENRSAYL+AIKSEVETHGEFVN LI+EVE AAPRDI EVERFV WLD EL SLVDERAVL
Sbjct: 181 ENRSAYLSAIKSEVETHGEFVNRLIREVEAAAPRDIAEVERFVKWLDGELASLVDERAVL 240

Query: 241 KHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRRECTI 300
           KHFPRWPEGKADALREAAFSY+DLKSLE+EVCSFR+NPKEE   +LKRAQALQD      
Sbjct: 241 KHFPRWPEGKADALREAAFSYKDLKSLEAEVCSFRENPKEETNAMLKRAQALQD------ 300

Query: 301 LRSDENFFKFCRLEQSVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKE 360
                      RLEQSVSNVE+TREF+CNKY  F+IPC+WM +SGL  QMKLSSLRL KE
Sbjct: 301 -----------RLEQSVSNVERTREFNCNKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKE 360

Query: 361 YMRRITRELQSIDNTQQADNLLLQGVRFAYRVHQYAGGFDSEAIAAFEELKKVG-LSSQR 412
            MRRIT+ELQ ++ T Q +NL LQGVRFAYRVHQYAGGFDSEAI AFE +K+VG L SQR
Sbjct: 361 CMRRITKELQ-LNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLLLSQR 401

BLAST of MS023563 vs. ExPASy TrEMBL
Match: A0A6J1G8X0 (protein CHUP1, chloroplastic OS=Cucurbita moschata OX=3662 GN=LOC111452021 PE=4 SV=1)

HSP 1 Score: 577.4 bits (1487), Expect = 4.7e-161
Identity = 311/419 (74.22%), Postives = 343/419 (81.86%), Query Frame = 0

Query: 1   MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKS 60
           MP EEDEELAMEI +L++EL+I++ KS FLEKENQEL+QEL R KS I SLKAHNNDRKS
Sbjct: 1   MPMEEDEELAMEIHALKRELEISLQKSIFLEKENQELKQELARFKSHIHSLKAHNNDRKS 60

Query: 61  LLWKKFYNSMDA-------ESPPATDKREATKSSPKQPVWVAVKESQRMPEGAPAPAPAP 120
           +LWKKF+NSMD        +SPPATDK E T++  KQ  W  VKE+QRM   AP PAP P
Sbjct: 61  ILWKKFHNSMDVAGNDSTPQSPPATDKWETTRTQ-KQSNWAVVKENQRMAAAAPTPAP-P 120

Query: 121 PPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIEN 180
           PPPPLPTKLL GSKAVRRVPEVLELYR +TKRDAQKENKAA+GGFPAVAFTKNMIGEIEN
Sbjct: 121 PPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVAFTKNMIGEIEN 180

Query: 181 RSAYLTAIKSEVETHGEFVNWLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKH 240
           RSAYL+AIKSEVETHGEFVN LI+EVE AAPRDI EVERFV WLD EL SLVDERAVLKH
Sbjct: 181 RSAYLSAIKSEVETHGEFVNRLIREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKH 240

Query: 241 FPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRRECTILR 300
           FPRWPEGKADALREAAFSY+DLKSLE EVCSFR+NPKEE   +LKRAQALQD        
Sbjct: 241 FPRWPEGKADALREAAFSYKDLKSLEGEVCSFRENPKEETNAMLKRAQALQD-------- 300

Query: 301 SDENFFKFCRLEQSVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYM 360
                    RLEQSVSNVE+TREF+C KY  F+IPC+WM +SGL  QMKLSSLRL KE M
Sbjct: 301 ---------RLEQSVSNVERTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECM 360

Query: 361 RRITRELQSIDNTQQADNLLLQGVRFAYRVHQYAGGFDSEAIAAFEELKKVGLS-SQRK 412
           RRIT+E Q ++ T Q +NL LQGVRFAYRVHQYAGGFDSEAI AFE +K+VGL  +QRK
Sbjct: 361 RRITKEKQ-LNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLQLNQRK 399

BLAST of MS023563 vs. ExPASy TrEMBL
Match: A0A0A0LVK7 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G532360 PE=4 SV=1)

HSP 1 Score: 573.9 bits (1478), Expect = 5.2e-160
Identity = 312/422 (73.93%), Postives = 345/422 (81.75%), Query Frame = 0

Query: 1   MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKS 60
           MP+EEDE LAMEI  L+KEL+I++ KS FLEKENQELRQEL RL+SQIQS KA NN+RKS
Sbjct: 1   MPKEEDEVLAMEINCLKKELEISLQKSIFLEKENQELRQELNRLRSQIQSFKAQNNERKS 60

Query: 61  LLWKKFYNSMD-----AESPP------ATDKREATKSSPKQPVWVAVKESQRMPEGAPAP 120
           +LWKKF++S+D     A+SPP      A DKRE+TK SPKQ  W  VKES RM  G PA 
Sbjct: 61  ILWKKFHSSIDISVAGADSPPLSPATVAGDKRESTK-SPKQSSWDDVKESHRM-TGVPAS 120

Query: 121 APAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIG 180
            P PPPPPLPTKLL GSKAVRRVPEVLELYR+LTKRDAQKENK AHGG PAVAFTKNMIG
Sbjct: 121 PPPPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQKENKVAHGGAPAVAFTKNMIG 180

Query: 181 EIENRSAYLTAIKSEVETHGEFVNWLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERA 240
           EIENRSAYL+AIKSEVETHG+FVNWLIKEVE  APRDI+EVERFV WLD +L SLVDERA
Sbjct: 181 EIENRSAYLSAIKSEVETHGDFVNWLIKEVETIAPRDISEVERFVKWLDGKLASLVDERA 240

Query: 241 VLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRREC 300
           VLK+FPRWPE KADALREAAFSYRDLK LES+VC FRDNPKEEM VVLKRAQALQD    
Sbjct: 241 VLKYFPRWPEAKADALREAAFSYRDLKGLESKVCMFRDNPKEEMNVVLKRAQALQD---- 300

Query: 301 TILRSDENFFKFCRLEQSVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLA 360
                        R+EQSVSN+E+TREF+C KY+ F+IPC+WMF+S L  Q+K+S+LRLA
Sbjct: 301 -------------RVEQSVSNMERTREFNCRKYQAFQIPCQWMFDSALPTQIKMSTLRLA 360

Query: 361 KEYMRRITRELQSIDNTQQADNLLLQGVRFAYRVHQYAGGFDSEAIAAFEELKKVGLSSQ 412
           KEYM RITRELQS + T Q +NL LQG RFAYRVHQYAGGFDSE I AFE LKK GLSSQ
Sbjct: 361 KEYMIRITRELQSTE-TPQRENLFLQGARFAYRVHQYAGGFDSETIEAFEGLKKAGLSSQ 402

BLAST of MS023563 vs. ExPASy TrEMBL
Match: A0A1S3C4V9 (protein CHUP1, chloroplastic isoform X1 OS=Cucumis melo OX=3656 GN=LOC103497059 PE=4 SV=1)

HSP 1 Score: 572.8 bits (1475), Expect = 1.2e-159
Identity = 313/422 (74.17%), Postives = 346/422 (81.99%), Query Frame = 0

Query: 1   MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKS 60
           MP+E+DEELAMEI  L+K+L+I++ KS FLE+ENQELR EL RLKSQIQSLKA NN+RKS
Sbjct: 1   MPKEKDEELAMEIDCLKKDLEISLQKSIFLERENQELRLELNRLKSQIQSLKALNNERKS 60

Query: 61  LLWKKFYNSMD-----AESPP------ATDKREATKSSPKQPVWVAVKESQRMPEGAPAP 120
           +LWKKF++SMD     A+SPP      A DKRE TK  PKQ  W  VKESQRM    PA 
Sbjct: 61  ILWKKFHSSMDMAVAGADSPPLNPATAAGDKREVTK-FPKQSSWDDVKESQRM-TAVPAS 120

Query: 121 APAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIG 180
           AP PPPPPLP KLL GSKAVRRVPEVL+LYR+LTKRDAQKENK AHGG P VAFTKNMIG
Sbjct: 121 APPPPPPPLPKKLLGGSKAVRRVPEVLDLYRTLTKRDAQKENKVAHGGAPVVAFTKNMIG 180

Query: 181 EIENRSAYLTAIKSEVETHGEFVNWLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERA 240
           EIENRSAYL+AIKSEVETHGEFVNWLIKEVE  APRDI+E E+FV WLD +L SLVDERA
Sbjct: 181 EIENRSAYLSAIKSEVETHGEFVNWLIKEVEMIAPRDISEAEKFVKWLDVKLASLVDERA 240

Query: 241 VLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRREC 300
           VLKHFPRWPE KADALREAAFSYRDLKSLES+VC FRDNPKEEM VVLKRAQALQD    
Sbjct: 241 VLKHFPRWPEAKADALREAAFSYRDLKSLESKVCMFRDNPKEEMNVVLKRAQALQD---- 300

Query: 301 TILRSDENFFKFCRLEQSVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLA 360
                        R+EQSVSN+E+TREF+C KY+ F+IPC+WMF+S L  Q+KLS+LRLA
Sbjct: 301 -------------RVEQSVSNMERTREFNCKKYQAFQIPCQWMFDSALPTQIKLSTLRLA 360

Query: 361 KEYMRRITRELQSIDNTQQADNLLLQGVRFAYRVHQYAGGFDSEAIAAFEELKKVGLSSQ 412
           KEYM RITREL+S + T QA+NL LQGVRFAYRVHQYAGGFDSEAI AFE LKK GLSSQ
Sbjct: 361 KEYMIRITRELRSTE-TSQAENLFLQGVRFAYRVHQYAGGFDSEAIEAFEGLKKAGLSSQ 402

BLAST of MS023563 vs. TAIR 10
Match: AT1G07120.1 (FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast envelope; EXPRESSED IN: inflorescence meristem, petal, leaf whorl, flower; EXPRESSED DURING: 4 anthesis, petal differentiation and expansion stage; BEST Arabidopsis thaliana protein match is: Tetratricopeptide repeat (TPR)-like superfamily protein (TAIR:AT4G18570.1); Has 288 Blast hits to 260 proteins in 50 species: Archae - 0; Bacteria - 8; Metazoa - 27; Fungi - 15; Plants - 163; Viruses - 0; Other Eukaryotes - 75 (source: NCBI BLink). )

HSP 1 Score: 362.5 bits (929), Expect = 4.6e-100
Identity = 199/414 (48.07%), Postives = 278/414 (67.15%), Query Frame = 0

Query: 1   MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKS 60
           +P  ED+    ++  L KELQ  + ++D LEKEN ELRQE+ RL++Q+ +LK+H N+RKS
Sbjct: 2   LPNGEDDS---DLLRLVKELQAYLVRNDKLEKENHELRQEVARLRAQVSNLKSHENERKS 61

Query: 61  LLWKKFYNSMDAESPPATDKR--EATKSSPKQPVWVAVKESQRMP--EGAPAPAPAPPPP 120
           +LWKK  +S D  +   ++ +  E+ KS+ K      V+     P  +G       PPPP
Sbjct: 62  MLWKKLQSSYDGSNTDGSNLKAPESVKSNTKGQ---EVRNPNPKPTIQGQSTATKPPPPP 121

Query: 121 PLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIENRSA 180
           PLP+K   G ++VRR PEV+E YR+LTKR++   NK    G  + AF +NMIGEIENRS 
Sbjct: 122 PLPSKRTLGKRSVRRAPEVVEFYRALTKRESHMGNKINQNGVLSPAFNRNMIGEIENRSK 181

Query: 181 YLTAIKSEVETHGEFVNWLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPR 240
           YL+ IKS+ + H + ++ LI +VE A   DI+EVE FV W+D EL SLVDERAVLKHFP+
Sbjct: 182 YLSDIKSDTDRHRDHIHILISKVEAATFTDISEVETFVKWIDEELSSLVDERAVLKHFPK 241

Query: 241 WPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRRECTILRSDE 300
           WPE K D+LREAA +Y+  K+L +E+ SF+DNPK+ +   L+R Q+LQD           
Sbjct: 242 WPERKVDSLREAACNYKRPKNLGNEILSFKDNPKDSLTQALQRIQSLQD----------- 301

Query: 301 NFFKFCRLEQSVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRI 360
                 RLE+SV+N EK R+ +  +Y++F+IP EWM ++GL+GQ+K SSLRLA+EYM+RI
Sbjct: 302 ------RLEESVNNTEKMRDSTGKRYKDFQIPWEWMLDTGLIGQLKYSSLRLAQEYMKRI 361

Query: 361 TRELQSIDNTQQADNLLLQGVRFAYRVHQYAGGFDSEAIAAFEELKKVGLSSQR 411
            +EL+S + + +  NL+LQGVRFAY +HQ+AGGFD E ++ F ELKK+     R
Sbjct: 362 AKELES-NGSGKEGNLMLQGVRFAYTIHQFAGGFDGETLSIFHELKKITTGETR 391

BLAST of MS023563 vs. TAIR 10
Match: AT4G18570.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 271.9 bits (694), Expect = 8.2e-73
Identity = 157/339 (46.31%), Postives = 215/339 (63.42%), Query Frame = 0

Query: 72  AESPPATDKREATKSSPKQPVWVAVKESQRMPEGAPAPAPAPPPPPLPTKLLAGSKAVRR 131
           A+ PP    +++    P  P    +++    P  + AP P PPPPP P  L   S  VRR
Sbjct: 301 ADPPP----QKSIPPPPPPPPPPLLQQPPPPPSVSKAPPPPPPPPP-PKSLSIASAKVRR 360

Query: 132 VPEVLELYRSLTKRDAQKENKAAHGGFPAVA-------FTKNMIGEIENRSAYLTAIKSE 191
           VPEV+E Y SL +RD+    + + GG  A A         ++MIGEIENRS YL AIK++
Sbjct: 361 VPEVVEFYHSLMRRDSTNSRRDSTGGGNAAAEAILANSNARDMIGEIENRSVYLLAIKTD 420

Query: 192 VETHGEFVNWLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADA 251
           VET G+F+ +LIKEV  AA  DI +V  FV WLD EL  LVDERAVLKHF  WPE KADA
Sbjct: 421 VETQGDFIRFLIKEVGNAAFSDIEDVVPFVKWLDDELSYLVDERAVLKHF-EWPEQKADA 480

Query: 252 LREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRRECTILRSDENFFKFCRL 311
           LREAAF Y DLK L SE   FR++P++     LK+ QAL                 F +L
Sbjct: 481 LREAAFCYFDLKKLISEASRFREDPRQSSSSALKKMQAL-----------------FEKL 540

Query: 312 EQSVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSID 371
           E  V ++ + RE +  K+++F+IP +WM E+G+  Q+KL+S++LA +YM+R++ EL++I+
Sbjct: 541 EHGVYSLSRMRESAATKFKSFQIPVDWMLETGITSQIKLASVKLAMKYMKRVSAELEAIE 600

Query: 372 -NTQQADNLLLQGVRFAYRVHQYAGGFDSEAIAAFEELK 403
               + + L++QGVRFA+RVHQ+AGGFD+E + AFEEL+
Sbjct: 601 GGGPEEEELIVQGVRFAFRVHQFAGGFDAETMKAFEELR 616

BLAST of MS023563 vs. TAIR 10
Match: AT3G25690.1 (Hydroxyproline-rich glycoprotein family protein )

HSP 1 Score: 260.0 bits (663), Expect = 3.2e-69
Identity = 148/312 (47.44%), Postives = 202/312 (64.74%), Query Frame = 0

Query: 103 PEGAPAPAPA---PPPPPLPTKL---LAGSKAVRRVPEVLELYRSLTKRDAQKE---NKA 162
           P G P P P    PPPPP P  L     G   V R PE++E Y+SL KR+++KE   +  
Sbjct: 686 PGGGPPPPPGGGPPPPPPPPGALGRGAGGGNKVHRAPELVEFYQSLMKRESKKEGAPSLI 745

Query: 163 AHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEFVNWLIKEVEGAAPRDITEVERF 222
           + G   + A   NMIGEIENRS +L A+K++VET G+FV  L  EV  ++  DI ++  F
Sbjct: 746 SSGTGNSSAARNNMIGEIENRSTFLLAVKADVETQGDFVQSLATEVRASSFTDIEDLLAF 805

Query: 223 VNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEM 282
           V+WLD EL  LVDERAVLKHF  WPEGKADALREAAF Y+DL  LE +V SF D+P    
Sbjct: 806 VSWLDEELSFLVDERAVLKHFD-WPEGKADALREAAFEYQDLMKLEKQVTSFVDDPNLSC 865

Query: 283 GVVLKRAQALQDRRECTILRSDENFFKFCRLEQSVSNVEKTREFSCNKYRNFRIPCEWMF 342
              LK+   L +                 ++EQSV  + +TR+ + ++Y+ F IP +W+ 
Sbjct: 866 EPALKKMYKLLE-----------------KVEQSVYALLRTRDMAISRYKEFGIPVDWLS 925

Query: 343 ESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADN---LLLQGVRFAYRVHQYAGGF 402
           ++G+VG++KLSS++LAK+YM+R+  EL S+  + +  N   LLLQGVRFA+RVHQ+AGGF
Sbjct: 926 DTGVVGKIKLSSVQLAKKYMKRVAYELDSVSGSDKDPNREFLLLQGVRFAFRVHQFAGGF 979

BLAST of MS023563 vs. TAIR 10
Match: AT3G25690.2 (Hydroxyproline-rich glycoprotein family protein )

HSP 1 Score: 260.0 bits (663), Expect = 3.2e-69
Identity = 148/312 (47.44%), Postives = 202/312 (64.74%), Query Frame = 0

Query: 103 PEGAPAPAPA---PPPPPLPTKL---LAGSKAVRRVPEVLELYRSLTKRDAQKE---NKA 162
           P G P P P    PPPPP P  L     G   V R PE++E Y+SL KR+++KE   +  
Sbjct: 686 PGGGPPPPPGGGPPPPPPPPGALGRGAGGGNKVHRAPELVEFYQSLMKRESKKEGAPSLI 745

Query: 163 AHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEFVNWLIKEVEGAAPRDITEVERF 222
           + G   + A   NMIGEIENRS +L A+K++VET G+FV  L  EV  ++  DI ++  F
Sbjct: 746 SSGTGNSSAARNNMIGEIENRSTFLLAVKADVETQGDFVQSLATEVRASSFTDIEDLLAF 805

Query: 223 VNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEM 282
           V+WLD EL  LVDERAVLKHF  WPEGKADALREAAF Y+DL  LE +V SF D+P    
Sbjct: 806 VSWLDEELSFLVDERAVLKHFD-WPEGKADALREAAFEYQDLMKLEKQVTSFVDDPNLSC 865

Query: 283 GVVLKRAQALQDRRECTILRSDENFFKFCRLEQSVSNVEKTREFSCNKYRNFRIPCEWMF 342
              LK+   L +                 ++EQSV  + +TR+ + ++Y+ F IP +W+ 
Sbjct: 866 EPALKKMYKLLE-----------------KVEQSVYALLRTRDMAISRYKEFGIPVDWLS 925

Query: 343 ESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADN---LLLQGVRFAYRVHQYAGGF 402
           ++G+VG++KLSS++LAK+YM+R+  EL S+  + +  N   LLLQGVRFA+RVHQ+AGGF
Sbjct: 926 DTGVVGKIKLSSVQLAKKYMKRVAYELDSVSGSDKDPNREFLLLQGVRFAFRVHQFAGGF 979

BLAST of MS023563 vs. TAIR 10
Match: AT3G25690.3 (Hydroxyproline-rich glycoprotein family protein )

HSP 1 Score: 260.0 bits (663), Expect = 3.2e-69
Identity = 148/312 (47.44%), Postives = 202/312 (64.74%), Query Frame = 0

Query: 103 PEGAPAPAPA---PPPPPLPTKL---LAGSKAVRRVPEVLELYRSLTKRDAQKE---NKA 162
           P G P P P    PPPPP P  L     G   V R PE++E Y+SL KR+++KE   +  
Sbjct: 545 PGGGPPPPPGGGPPPPPPPPGALGRGAGGGNKVHRAPELVEFYQSLMKRESKKEGAPSLI 604

Query: 163 AHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEFVNWLIKEVEGAAPRDITEVERF 222
           + G   + A   NMIGEIENRS +L A+K++VET G+FV  L  EV  ++  DI ++  F
Sbjct: 605 SSGTGNSSAARNNMIGEIENRSTFLLAVKADVETQGDFVQSLATEVRASSFTDIEDLLAF 664

Query: 223 VNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEM 282
           V+WLD EL  LVDERAVLKHF  WPEGKADALREAAF Y+DL  LE +V SF D+P    
Sbjct: 665 VSWLDEELSFLVDERAVLKHFD-WPEGKADALREAAFEYQDLMKLEKQVTSFVDDPNLSC 724

Query: 283 GVVLKRAQALQDRRECTILRSDENFFKFCRLEQSVSNVEKTREFSCNKYRNFRIPCEWMF 342
              LK+   L +                 ++EQSV  + +TR+ + ++Y+ F IP +W+ 
Sbjct: 725 EPALKKMYKLLE-----------------KVEQSVYALLRTRDMAISRYKEFGIPVDWLS 784

Query: 343 ESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADN---LLLQGVRFAYRVHQYAGGF 402
           ++G+VG++KLSS++LAK+YM+R+  EL S+  + +  N   LLLQGVRFA+RVHQ+AGGF
Sbjct: 785 DTGVVGKIKLSSVQLAKKYMKRVAYELDSVSGSDKDPNREFLLLQGVRFAFRVHQFAGGF 838

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022150972.13.8e-21395.38protein CHUP1, chloroplastic [Momordica charantia][more]
XP_023523072.16.1e-16374.35protein CHUP1, chloroplastic isoform X1 [Cucurbita pepo subsp. pepo] >XP_0235230... [more]
XP_022998607.15.2e-16274.11protein CHUP1, chloroplastic [Cucurbita maxima][more]
KAG6607325.11.5e-16173.99Protein CHUP1, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia][more]
XP_022948306.19.8e-16174.22protein CHUP1, chloroplastic [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
Q9LI744.5e-6847.44Protein CHUP1, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=CHUP1 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A6J1DC831.8e-21395.38protein CHUP1, chloroplastic OS=Momordica charantia OX=3673 GN=LOC111018994 PE=4... [more]
A0A6J1K8G42.5e-16274.11protein CHUP1, chloroplastic OS=Cucurbita maxima OX=3661 GN=LOC111493194 PE=4 SV... [more]
A0A6J1G8X04.7e-16174.22protein CHUP1, chloroplastic OS=Cucurbita moschata OX=3662 GN=LOC111452021 PE=4 ... [more]
A0A0A0LVK75.2e-16073.93Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G532360 PE=4 SV=1[more]
A0A1S3C4V91.2e-15974.17protein CHUP1, chloroplastic isoform X1 OS=Cucumis melo OX=3656 GN=LOC103497059 ... [more]
Match NameE-valueIdentityDescription
AT1G07120.14.6e-10048.07FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknow... [more]
AT4G18570.18.2e-7346.31Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT3G25690.13.2e-6947.44Hydroxyproline-rich glycoprotein family protein [more]
AT3G25690.23.2e-6947.44Hydroxyproline-rich glycoprotein family protein [more]
AT3G25690.33.2e-6947.44Hydroxyproline-rich glycoprotein family protein [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (TR) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 13..54
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 71..118
NoneNo IPR availablePANTHERPTHR31342:SF48CHUP1-LIKE PROTEINcoord: 4..405
IPR040265Protein CHUP1-likePANTHERPTHR31342PROTEIN CHUP1, CHLOROPLASTICcoord: 4..405

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
MS023563.1MS023563.1mRNA