CmoCh01G007120 (gene) Cucurbita moschata (Rifu) v1

Overview
NameCmoCh01G007120
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu) v1)
Descriptionprotein CHUP1, chloroplastic
LocationCmo_Chr01: 3654001 .. 3656554 (+)
RNA-Seq ExpressionCmoCh01G007120
SyntenyCmoCh01G007120
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GGAAGCAAATTATTGTTGGTTGGTTGGCTTCTCCTTCAGCCCATTTCAGAGAAAGACCACAAAATCAGAGTAGAATGCCAATGGAAGAAGATGAAGAATTGGCCATGGAGATCCACGCCTTGAAAAGGGAATTGGAAATTTCACTGCAAAAATCTATTTTTCTCGAGAAAGAAAATCAAGAACTCAAACAAGAATTGGCTCGATTCAAATCTCACATTCACTCTCTGAAAGCTCATAATAATGACAGAAAATCCATTCTTTGGAAGAAATTTCACAATTCCATGGATGTCGCCGGAAATGACTCCACGCCGCAGAGTCCACCGGCGACTGACAAATGGGAGACTACCAGAACACAGAAACAGAGTAATTGGGCTGTTGTGAAAGAGAACCAGAGAATGGCGGCGGCGGCACCGACCCCGGCTCCTCCACCGCCGCCGCCACTTCCGACGAAGCTTCTCGGGGGATCTAAGGCAGTGCGGCGAGTCCCGGAAGTGTTGGAGCTGTACCGTTTAGTGACGAAAAGGGATGCCCAGAAGGAAAATAAGGCCGCAAACGGAGGATTTCCGGCGGTGGCGTTCACCAAAAACATGATCGGCGAAATCGAAAACCGATCAGCCTATCTCTCAGCGGTACGTATTTAGGCTTTTTAAAATTTTGTCCAATTTTAATTAATTTAAAGTAGAATTAAGAATTAATCAAATTAAAAACTATTTTTGTATTTAATAAGTAAAAATACCCATACCCAATTTTAATTCGTAGCAAGTTTATATCGTTAAATTAGAGATAGAATTGTAAGAACATGGAAAGTTTTGGATGAGAAAGACGAAAAAGGCAAGTGAATGAAAGCATGGGCACAGATAAAATCGGAGGTGGAGACACATGGGGAGTTTGTGAACCGGCTGATCAGAGAAGTGGAAGCGGCAGCGCCAAGAGACATAGCGGAGGTGGAGAGGTTCGTGAAGTGGCTAGACGGGGAGCTGGCATCGCTCGTGGACGAGAGGGCGGTGCTCAAGCACTTCCCACGGTGGCCGGAGGGGAAGGCAGACGCACTGCGGGAGGCGGCATTCAGCTACAAGGATCTGAAGAGCTTGGAAGGTGAAGTGTGTTCGTTTAGAGAGAATCCAAAGGAGGAGACGAATGCAATGTTGAAGAGGGCTCAGGCCTTGCAAGACAGGCGAGCATGTACAATCTGTTGTATTTCATGTTACTAATAAGGTGAAGCAGAGTTGGGAAGTGACTGATGAACTAGAGAAATGCTTGGAACTGTGCAGGTTGGAGCAGAGCGTGAGCAATGTGGAGAGGACGAGGGAGTTCAACTGTAAGAAGTACAACAAGTTTCAAATCCCTTGCCAATGGATGCTGGACTCTGGCTTGCCAGCCCAGGTTAACTCTTAATCAAAGCTAGCTTCTAAACGAAAAACCATGAACACCACTTCCGTCCACTTCCGTCCAGTTAGGAGAGGAGTCTCTAAGAGATTGGTCATGGGTTTATAACTAAAAAACACTATCTCGTACTAAAAGCAAAATCATGAAAGCTTGAGACCTTTTAGATAAACTAAAAGCAAAATCATGAAAACTTATGCTCAATGTAGACAATATCGTACCATTATGAAGATTCATGGTTTCTAACCAAGTATCCGAGGCATGCCCTTCATTTTGTCATGTCAATAGAATCCTCAAATGTCCAACACAAAAAAGTTGTGAGCCTTAAAAGTGTAATCAAAAGTAACCAAGTGTAGAACAAAAAGTATACTTTGTTCGAAGGCTCAATAGGAGTCAAACCTTGATTAAGGGGAGGTAGCTCGAGAGCTCCATAGGCCTCAAGGGAAGTTCTATGGTGTACTTTGAGATATGCTTTTTTTTTCTATAAAAGGATTGTTGGGAGAGAAGTCCCACGTTCGATTAATAAGTCCCACATTCGGTTAATAAGGGGGTTGATCATGAGTTTATCAGTAAGAACCACTATATCTATTGGTACTAGACATTTAGGGTAAAACTAAAAGCAAAATCATAAGCTGCTCAAAGTGCAGAATATCATACCATTGTAGAGGTAACCACGAACAATGTTTCACCTTCTAGACTAGAGAACATAAAAATGAAAAACTGTAAAACACAATTGGTTGCAGATGAAGCTGAGCTCATTGAGGCTAGTGAAGGAATGCATGCGGAGGATAACAAAAGAGAAACAATTGAACGAAACCCCACAAACAGAAAACCTTTTTCTTCAAGGGGTTCGCTTTGCTTACAGGGTGCACCAGGTAAGCTACAAAGACATACTAACTAATCACATTTTATGATACTCAACAATAACACTCGGTATCTTTTCAGTATGCAGGAGGTTTTGATTCGGAAGCTATAGTGGCTTTTGAAGGAATGAAGCAAGTTGGGCTGCAGCTTAATCAAAGAAAATAGGGTTCTTTGGTGATAAGTTATAGTTAACAGCACTTGTAAGAATCAACATTGCAGCAGACCACATTCAGAAAAGGGATGTAATATGAATGATTGAATGGGAAGTTCTATACACAATCAATCCTATGCTTATTGCAACTTATTC

mRNA sequence

GGAAGCAAATTATTGTTGGTTGGTTGGCTTCTCCTTCAGCCCATTTCAGAGAAAGACCACAAAATCAGAGTAGAATGCCAATGGAAGAAGATGAAGAATTGGCCATGGAGATCCACGCCTTGAAAAGGGAATTGGAAATTTCACTGCAAAAATCTATTTTTCTCGAGAAAGAAAATCAAGAACTCAAACAAGAATTGGCTCGATTCAAATCTCACATTCACTCTCTGAAAGCTCATAATAATGACAGAAAATCCATTCTTTGGAAGAAATTTCACAATTCCATGGATGTCGCCGGAAATGACTCCACGCCGCAGAGTCCACCGGCGACTGACAAATGGGAGACTACCAGAACACAGAAACAGAGTAATTGGGCTGTTGTGAAAGAGAACCAGAGAATGGCGGCGGCGGCACCGACCCCGGCTCCTCCACCGCCGCCGCCACTTCCGACGAAGCTTCTCGGGGGATCTAAGGCAGTGCGGCGAGTCCCGGAAGTGTTGGAGCTGTACCGTTTAGTGACGAAAAGGGATGCCCAGAAGGAAAATAAGGCCGCAAACGGAGGATTTCCGGCGGTGGCGTTCACCAAAAACATGATCGGCGAAATCGAAAACCGATCAGCCTATCTCTCAGCGATAAAATCGGAGGTGGAGACACATGGGGAGTTTGTGAACCGGCTGATCAGAGAAGTGGAAGCGGCAGCGCCAAGAGACATAGCGGAGGTGGAGAGGTTCGTGAAGTGGCTAGACGGGGAGCTGGCATCGCTCGTGGACGAGAGGGCGGTGCTCAAGCACTTCCCACGGTGGCCGGAGGGGAAGGCAGACGCACTGCGGGAGGCGGCATTCAGCTACAAGGATCTGAAGAGCTTGGAAGGTGAAGTGTGTTCGTTTAGAGAGAATCCAAAGGAGGAGACGAATGCAATGTTGAAGAGGGCTCAGGCCTTGCAAGACAGGTTGGAGCAGAGCGTGAGCAATGTGGAGAGGACGAGGGAGTTCAACTGTAAGAAGTACAACAAGTTTCAAATCCCTTGCCAATGGATGCTGGACTCTGGCTTGCCAGCCCAGATGAAGCTGAGCTCATTGAGGCTAGTGAAGGAATGCATGCGGAGGATAACAAAAGAGAAACAATTGAACGAAACCCCACAAACAGAAAACCTTTTTCTTCAAGGGGTTCGCTTTGCTTACAGGGTGCACCAGTATGCAGGAGGTTTTGATTCGGAAGCTATAGTGGCTTTTGAAGGAATGAAGCAAGTTGGGCTGCAGCTTAATCAAAGAAAATAGGGTTCTTTGGTGATAAGTTATAGTTAACAGCACTTGTAAGAATCAACATTGCAGCAGACCACATTCAGAAAAGGGATGTAATATGAATGATTGAATGGGAAGTTCTATACACAATCAATCCTATGCTTATTGCAACTTATTC

Coding sequence (CDS)

ATGCCAATGGAAGAAGATGAAGAATTGGCCATGGAGATCCACGCCTTGAAAAGGGAATTGGAAATTTCACTGCAAAAATCTATTTTTCTCGAGAAAGAAAATCAAGAACTCAAACAAGAATTGGCTCGATTCAAATCTCACATTCACTCTCTGAAAGCTCATAATAATGACAGAAAATCCATTCTTTGGAAGAAATTTCACAATTCCATGGATGTCGCCGGAAATGACTCCACGCCGCAGAGTCCACCGGCGACTGACAAATGGGAGACTACCAGAACACAGAAACAGAGTAATTGGGCTGTTGTGAAAGAGAACCAGAGAATGGCGGCGGCGGCACCGACCCCGGCTCCTCCACCGCCGCCGCCACTTCCGACGAAGCTTCTCGGGGGATCTAAGGCAGTGCGGCGAGTCCCGGAAGTGTTGGAGCTGTACCGTTTAGTGACGAAAAGGGATGCCCAGAAGGAAAATAAGGCCGCAAACGGAGGATTTCCGGCGGTGGCGTTCACCAAAAACATGATCGGCGAAATCGAAAACCGATCAGCCTATCTCTCAGCGATAAAATCGGAGGTGGAGACACATGGGGAGTTTGTGAACCGGCTGATCAGAGAAGTGGAAGCGGCAGCGCCAAGAGACATAGCGGAGGTGGAGAGGTTCGTGAAGTGGCTAGACGGGGAGCTGGCATCGCTCGTGGACGAGAGGGCGGTGCTCAAGCACTTCCCACGGTGGCCGGAGGGGAAGGCAGACGCACTGCGGGAGGCGGCATTCAGCTACAAGGATCTGAAGAGCTTGGAAGGTGAAGTGTGTTCGTTTAGAGAGAATCCAAAGGAGGAGACGAATGCAATGTTGAAGAGGGCTCAGGCCTTGCAAGACAGGTTGGAGCAGAGCGTGAGCAATGTGGAGAGGACGAGGGAGTTCAACTGTAAGAAGTACAACAAGTTTCAAATCCCTTGCCAATGGATGCTGGACTCTGGCTTGCCAGCCCAGATGAAGCTGAGCTCATTGAGGCTAGTGAAGGAATGCATGCGGAGGATAACAAAAGAGAAACAATTGAACGAAACCCCACAAACAGAAAACCTTTTTCTTCAAGGGGTTCGCTTTGCTTACAGGGTGCACCAGTATGCAGGAGGTTTTGATTCGGAAGCTATAGTGGCTTTTGAAGGAATGAAGCAAGTTGGGCTGCAGCTTAATCAAAGAAAATAG

Protein sequence

MPMEEDEELAMEIHALKRELEISLQKSIFLEKENQELKQELARFKSHIHSLKAHNNDRKSILWKKFHNSMDVAGNDSTPQSPPATDKWETTRTQKQSNWAVVKENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNRLIREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEGEVCSFRENPKEETNAMLKRAQALQDRLEQSVSNVERTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKEKQLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLQLNQRK
Homology
BLAST of CmoCh01G007120 vs. ExPASy Swiss-Prot
Match: Q9LI74 (Protein CHUP1, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=CHUP1 PE=1 SV=1)

HSP 1 Score: 245.7 bits (626), Expect = 8.6e-64
Identity = 138/287 (48.08%), Postives = 190/287 (66.20%), Query Frame = 0

Query: 113 PTPAPPPPPPLPTKL---LGGSKAVRRVPEVLELYRLVTKRDAQKE---NKAANGGFPAV 172
           P   PPPPPP P  L    GG   V R PE++E Y+ + KR+++KE   +  ++G   + 
Sbjct: 694 PGGGPPPPPPPPGALGRGAGGGNKVHRAPELVEFYQSLMKRESKKEGAPSLISSGTGNSS 753

Query: 173 AFTKNMIGEIENRSAYLSAIKSEVETHGEFVNRLIREVEAAAPRDIAEVERFVKWLDGEL 232
           A   NMIGEIENRS +L A+K++VET G+FV  L  EV A++  DI ++  FV WLD EL
Sbjct: 754 AARNNMIGEIENRSTFLLAVKADVETQGDFVQSLATEVRASSFTDIEDLLAFVSWLDEEL 813

Query: 233 ASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEGEVCSFRENPKEETNAMLKRAQ 292
           + LVDERAVLKHF  WPEGKADALREAAF Y+DL  LE +V SF ++P       LK+  
Sbjct: 814 SFLVDERAVLKHFD-WPEGKADALREAAFEYQDLMKLEKQVTSFVDDPNLSCEPALKKMY 873

Query: 293 ALQDRLEQSVSNVERTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITK 352
            L +++EQSV  + RTR+    +Y +F IP  W+ D+G+  ++KLSS++L K+ M+R+  
Sbjct: 874 KLLEKVEQSVYALLRTRDMAISRYKEFGIPVDWLSDTGVVGKIKLSSVQLAKKYMKRVAY 933

Query: 353 E----KQLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMK 390
           E       ++ P  E L LQGVRFA+RVHQ+AGGFD+E++ AFE ++
Sbjct: 934 ELDSVSGSDKDPNREFLLLQGVRFAFRVHQFAGGFDAESMKAFEELR 979

BLAST of CmoCh01G007120 vs. ExPASy TrEMBL
Match: A0A6J1G8X0 (protein CHUP1, chloroplastic OS=Cucurbita moschata OX=3662 GN=LOC111452021 PE=4 SV=1)

HSP 1 Score: 779.2 bits (2011), Expect = 8.0e-222
Identity = 399/399 (100.00%), Postives = 399/399 (100.00%), Query Frame = 0

Query: 1   MPMEEDEELAMEIHALKRELEISLQKSIFLEKENQELKQELARFKSHIHSLKAHNNDRKS 60
           MPMEEDEELAMEIHALKRELEISLQKSIFLEKENQELKQELARFKSHIHSLKAHNNDRKS
Sbjct: 1   MPMEEDEELAMEIHALKRELEISLQKSIFLEKENQELKQELARFKSHIHSLKAHNNDRKS 60

Query: 61  ILWKKFHNSMDVAGNDSTPQSPPATDKWETTRTQKQSNWAVVKENQRMAAAAPTPAPPPP 120
           ILWKKFHNSMDVAGNDSTPQSPPATDKWETTRTQKQSNWAVVKENQRMAAAAPTPAPPPP
Sbjct: 61  ILWKKFHNSMDVAGNDSTPQSPPATDKWETTRTQKQSNWAVVKENQRMAAAAPTPAPPPP 120

Query: 121 PPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVAFTKNMIGEIENRS 180
           PPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVAFTKNMIGEIENRS
Sbjct: 121 PPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVAFTKNMIGEIENRS 180

Query: 181 AYLSAIKSEVETHGEFVNRLIREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFP 240
           AYLSAIKSEVETHGEFVNRLIREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFP
Sbjct: 181 AYLSAIKSEVETHGEFVNRLIREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFP 240

Query: 241 RWPEGKADALREAAFSYKDLKSLEGEVCSFRENPKEETNAMLKRAQALQDRLEQSVSNVE 300
           RWPEGKADALREAAFSYKDLKSLEGEVCSFRENPKEETNAMLKRAQALQDRLEQSVSNVE
Sbjct: 241 RWPEGKADALREAAFSYKDLKSLEGEVCSFRENPKEETNAMLKRAQALQDRLEQSVSNVE 300

Query: 301 RTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKEKQLNETPQTENLF 360
           RTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKEKQLNETPQTENLF
Sbjct: 301 RTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKEKQLNETPQTENLF 360

Query: 361 LQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLQLNQRK 400
           LQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLQLNQRK
Sbjct: 361 LQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLQLNQRK 399

BLAST of CmoCh01G007120 vs. ExPASy TrEMBL
Match: A0A6J1K8G4 (protein CHUP1, chloroplastic OS=Cucurbita maxima OX=3661 GN=LOC111493194 PE=4 SV=1)

HSP 1 Score: 745.3 bits (1923), Expect = 1.3e-211
Identity = 386/401 (96.26%), Postives = 389/401 (97.01%), Query Frame = 0

Query: 1   MPMEEDEELAMEIHALKRELEISLQKSIFLEKENQELKQELARFKSHIHSLKAHNNDRKS 60
           MPMEEDEELAMEI ALKRELEISLQKS FLEKENQELKQELARFKSH+ SLK HNNDRKS
Sbjct: 1   MPMEEDEELAMEIDALKRELEISLQKSNFLEKENQELKQELARFKSHLQSLKPHNNDRKS 60

Query: 61  ILWKKFHNSMD--VAGNDSTPQSPPATDKWETTRTQKQSNWAVVKENQRMAAAAPTPAPP 120
           ILWKKFHNSMD  VAG DS+PQSPPATDKWETTRTQKQSNWAVVKENQRMAAAAPTPAPP
Sbjct: 61  ILWKKFHNSMDVAVAGTDSSPQSPPATDKWETTRTQKQSNWAVVKENQRMAAAAPTPAPP 120

Query: 121 PPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVAFTKNMIGEIEN 180
           PPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKA NGGFPAVAFTKNMIGEIEN
Sbjct: 121 PPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKATNGGFPAVAFTKNMIGEIEN 180

Query: 181 RSAYLSAIKSEVETHGEFVNRLIREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKH 240
           RSAYLSAIKSEVETHGEFVNRLIREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKH
Sbjct: 181 RSAYLSAIKSEVETHGEFVNRLIREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKH 240

Query: 241 FPRWPEGKADALREAAFSYKDLKSLEGEVCSFRENPKEETNAMLKRAQALQDRLEQSVSN 300
           FPRWPEGKADALREAAFSYKDLKSLE EVCSFRENPKEETNAMLKRAQALQDRLEQSVSN
Sbjct: 241 FPRWPEGKADALREAAFSYKDLKSLEAEVCSFRENPKEETNAMLKRAQALQDRLEQSVSN 300

Query: 301 VERTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKEKQLNETPQTEN 360
           VERTREFNC KYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKE QLNETPQTEN
Sbjct: 301 VERTREFNCNKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKELQLNETPQTEN 360

Query: 361 LFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLQLNQRK 400
           LFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGL L+QRK
Sbjct: 361 LFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLLLSQRK 401

BLAST of CmoCh01G007120 vs. ExPASy TrEMBL
Match: A0A0A0LVK7 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G532360 PE=4 SV=1)

HSP 1 Score: 589.3 bits (1518), Expect = 1.2e-164
Identity = 310/403 (76.92%), Postives = 344/403 (85.36%), Query Frame = 0

Query: 1   MPMEEDEELAMEIHALKRELEISLQKSIFLEKENQELKQELARFKSHIHSLKAHNNDRKS 60
           MP EEDE LAMEI+ LK+ELEISLQKSIFLEKENQEL+QEL R +S I S KA NN+RKS
Sbjct: 1   MPKEEDEVLAMEINCLKKELEISLQKSIFLEKENQELRQELNRLRSQIQSFKAQNNERKS 60

Query: 61  ILWKKFHNSMD--VAGNDSTPQSPP--ATDKWETTRTQKQSNWAVVKENQRMAAAAPTPA 120
           ILWKKFH+S+D  VAG DS P SP   A DK E+T++ KQS+W  VKE+ RM     +P 
Sbjct: 61  ILWKKFHSSIDISVAGADSPPLSPATVAGDKRESTKSPKQSSWDDVKESHRMTGVPASPP 120

Query: 121 PPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVAFTKNMIGEI 180
           PPPPPPLPTKLLGGSKAVRRVPEVLELYR +TKRDAQKENK A+GG PAVAFTKNMIGEI
Sbjct: 121 PPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQKENKVAHGGAPAVAFTKNMIGEI 180

Query: 181 ENRSAYLSAIKSEVETHGEFVNRLIREVEAAAPRDIAEVERFVKWLDGELASLVDERAVL 240
           ENRSAYLSAIKSEVETHG+FVN LI+EVE  APRDI+EVERFVKWLDG+LASLVDERAVL
Sbjct: 181 ENRSAYLSAIKSEVETHGDFVNWLIKEVETIAPRDISEVERFVKWLDGKLASLVDERAVL 240

Query: 241 KHFPRWPEGKADALREAAFSYKDLKSLEGEVCSFRENPKEETNAMLKRAQALQDRLEQSV 300
           K+FPRWPE KADALREAAFSY+DLK LE +VC FR+NPKEE N +LKRAQALQDR+EQSV
Sbjct: 241 KYFPRWPEAKADALREAAFSYRDLKGLESKVCMFRDNPKEEMNVVLKRAQALQDRVEQSV 300

Query: 301 SNVERTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKEKQLNETPQT 360
           SN+ERTREFNC+KY  FQIPCQWM DS LP Q+K+S+LRL KE M RIT+E Q  ETPQ 
Sbjct: 301 SNMERTREFNCRKYQAFQIPCQWMFDSALPTQIKMSTLRLAKEYMIRITRELQSTETPQR 360

Query: 361 ENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLQLNQRK 400
           ENLFLQG RFAYRVHQYAGGFDSE I AFEG+K+ GL  +QRK
Sbjct: 361 ENLFLQGARFAYRVHQYAGGFDSETIEAFEGLKKAGLS-SQRK 402

BLAST of CmoCh01G007120 vs. ExPASy TrEMBL
Match: A0A6J1DC83 (protein CHUP1, chloroplastic OS=Momordica charantia OX=3673 GN=LOC111018994 PE=4 SV=1)

HSP 1 Score: 587.4 bits (1513), Expect = 4.4e-164
Identity = 311/402 (77.36%), Postives = 344/402 (85.57%), Query Frame = 0

Query: 1   MPMEEDEELAMEIHALKRELEISLQKSIFLEKENQELKQELARFKSHIHSLKAHNNDRKS 60
           MP EEDEELAMEI +L++EL+I++ KS FLEKENQEL+QEL R KS I SLKAHNNDRKS
Sbjct: 1   MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKS 60

Query: 61  ILWKKFHNSMDVAGNDSTPQSPPATDKWETTRTQ-KQSNWAVVKENQRMAAAAPTPAP-P 120
           +LWKKF+NSMD        +SPPATDK E T++  KQ  W  VKE+QRM   AP PAP P
Sbjct: 61  LLWKKFYNSMDA-------ESPPATDKREATKSSPKQPVWVAVKESQRMPEGAPAPAPAP 120

Query: 121 PPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVAFTKNMIGEIEN 180
           PPPPLPTKLL GSKAVRRVPEVLELYR +TKRDAQKENKAA+GGFPAVAFTKNMIGEIEN
Sbjct: 121 PPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIEN 180

Query: 181 RSAYLSAIKSEVETHGEFVNRLIREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKH 240
           RSAYL+AIKSEVETHGEFVN LI+EVE AAPRDI EVERFV WLD EL SLVDERAVLKH
Sbjct: 181 RSAYLTAIKSEVETHGEFVNWLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKH 240

Query: 241 FPRWPEGKADALREAAFSYKDLKSLEGEVCSFRENPKEETNAMLKRAQALQDRLEQSVSN 300
           FPRWPEGKADALREAAFSY+DLKSLE EVCSFR+NPKEE   +LKRAQALQDRLEQSVSN
Sbjct: 241 FPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRLEQSVSN 300

Query: 301 VERTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKEKQ-LNETPQTE 360
           VE+TREF+C KY  F+IPC+WM +SGL  QMKLSSLRL KE MRRIT+E Q ++ T Q +
Sbjct: 301 VEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQAD 360

Query: 361 NLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLQLNQRK 400
           NL LQGVRFAYRVHQYAGGFDS+AI AFEG+K+VGL  +QRK
Sbjct: 361 NLLLQGVRFAYRVHQYAGGFDSDAIAAFEGLKKVGLS-SQRK 394

BLAST of CmoCh01G007120 vs. ExPASy TrEMBL
Match: A0A1S3C4V9 (protein CHUP1, chloroplastic isoform X1 OS=Cucumis melo OX=3656 GN=LOC103497059 PE=4 SV=1)

HSP 1 Score: 579.7 bits (1493), Expect = 9.3e-162
Identity = 309/403 (76.67%), Postives = 341/403 (84.62%), Query Frame = 0

Query: 1   MPMEEDEELAMEIHALKRELEISLQKSIFLEKENQELKQELARFKSHIHSLKAHNNDRKS 60
           MP E+DEELAMEI  LK++LEISLQKSIFLE+ENQEL+ EL R KS I SLKA NN+RKS
Sbjct: 1   MPKEKDEELAMEIDCLKKDLEISLQKSIFLERENQELRLELNRLKSQIQSLKALNNERKS 60

Query: 61  ILWKKFHNSMD--VAGNDSTPQSP--PATDKWETTRTQKQSNWAVVKENQRMAAAAPTPA 120
           ILWKKFH+SMD  VAG DS P +P   A DK E T+  KQS+W  VKE+QRM A   +  
Sbjct: 61  ILWKKFHSSMDMAVAGADSPPLNPATAAGDKREVTKFPKQSSWDDVKESQRMTAVPASAP 120

Query: 121 PPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVAFTKNMIGEI 180
           PPPPPPLP KLLGGSKAVRRVPEVL+LYR +TKRDAQKENK A+GG P VAFTKNMIGEI
Sbjct: 121 PPPPPPLPKKLLGGSKAVRRVPEVLDLYRTLTKRDAQKENKVAHGGAPVVAFTKNMIGEI 180

Query: 181 ENRSAYLSAIKSEVETHGEFVNRLIREVEAAAPRDIAEVERFVKWLDGELASLVDERAVL 240
           ENRSAYLSAIKSEVETHGEFVN LI+EVE  APRDI+E E+FVKWLD +LASLVDERAVL
Sbjct: 181 ENRSAYLSAIKSEVETHGEFVNWLIKEVEMIAPRDISEAEKFVKWLDVKLASLVDERAVL 240

Query: 241 KHFPRWPEGKADALREAAFSYKDLKSLEGEVCSFRENPKEETNAMLKRAQALQDRLEQSV 300
           KHFPRWPE KADALREAAFSY+DLKSLE +VC FR+NPKEE N +LKRAQALQDR+EQSV
Sbjct: 241 KHFPRWPEAKADALREAAFSYRDLKSLESKVCMFRDNPKEEMNVVLKRAQALQDRVEQSV 300

Query: 301 SNVERTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKEKQLNETPQT 360
           SN+ERTREFNCKKY  FQIPCQWM DS LP Q+KLS+LRL KE M RIT+E +  ET Q 
Sbjct: 301 SNMERTREFNCKKYQAFQIPCQWMFDSALPTQIKLSTLRLAKEYMIRITRELRSTETSQA 360

Query: 361 ENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLQLNQRK 400
           ENLFLQGVRFAYRVHQYAGGFDSEAI AFEG+K+ GL  +QRK
Sbjct: 361 ENLFLQGVRFAYRVHQYAGGFDSEAIEAFEGLKKAGLS-SQRK 402

BLAST of CmoCh01G007120 vs. NCBI nr
Match: XP_022948306.1 (protein CHUP1, chloroplastic [Cucurbita moschata])

HSP 1 Score: 779.2 bits (2011), Expect = 1.6e-221
Identity = 399/399 (100.00%), Postives = 399/399 (100.00%), Query Frame = 0

Query: 1   MPMEEDEELAMEIHALKRELEISLQKSIFLEKENQELKQELARFKSHIHSLKAHNNDRKS 60
           MPMEEDEELAMEIHALKRELEISLQKSIFLEKENQELKQELARFKSHIHSLKAHNNDRKS
Sbjct: 1   MPMEEDEELAMEIHALKRELEISLQKSIFLEKENQELKQELARFKSHIHSLKAHNNDRKS 60

Query: 61  ILWKKFHNSMDVAGNDSTPQSPPATDKWETTRTQKQSNWAVVKENQRMAAAAPTPAPPPP 120
           ILWKKFHNSMDVAGNDSTPQSPPATDKWETTRTQKQSNWAVVKENQRMAAAAPTPAPPPP
Sbjct: 61  ILWKKFHNSMDVAGNDSTPQSPPATDKWETTRTQKQSNWAVVKENQRMAAAAPTPAPPPP 120

Query: 121 PPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVAFTKNMIGEIENRS 180
           PPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVAFTKNMIGEIENRS
Sbjct: 121 PPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVAFTKNMIGEIENRS 180

Query: 181 AYLSAIKSEVETHGEFVNRLIREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFP 240
           AYLSAIKSEVETHGEFVNRLIREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFP
Sbjct: 181 AYLSAIKSEVETHGEFVNRLIREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFP 240

Query: 241 RWPEGKADALREAAFSYKDLKSLEGEVCSFRENPKEETNAMLKRAQALQDRLEQSVSNVE 300
           RWPEGKADALREAAFSYKDLKSLEGEVCSFRENPKEETNAMLKRAQALQDRLEQSVSNVE
Sbjct: 241 RWPEGKADALREAAFSYKDLKSLEGEVCSFRENPKEETNAMLKRAQALQDRLEQSVSNVE 300

Query: 301 RTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKEKQLNETPQTENLF 360
           RTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKEKQLNETPQTENLF
Sbjct: 301 RTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKEKQLNETPQTENLF 360

Query: 361 LQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLQLNQRK 400
           LQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLQLNQRK
Sbjct: 361 LQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLQLNQRK 399

BLAST of CmoCh01G007120 vs. NCBI nr
Match: KAG6607325.1 (Protein CHUP1, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 768.8 bits (1984), Expect = 2.2e-218
Identity = 394/399 (98.75%), Postives = 395/399 (99.00%), Query Frame = 0

Query: 1   MPMEEDEELAMEIHALKRELEISLQKSIFLEKENQELKQELARFKSHIHSLKAHNNDRKS 60
           MPMEEDEELAMEIHALKRELEISLQKS FLEKENQELKQELARFKSH+ SLK HNNDRKS
Sbjct: 1   MPMEEDEELAMEIHALKRELEISLQKSNFLEKENQELKQELARFKSHVQSLKVHNNDRKS 60

Query: 61  ILWKKFHNSMDVAGNDSTPQSPPATDKWETTRTQKQSNWAVVKENQRMAAAAPTPAPPPP 120
           ILWKKFHNSMDVAGNDSTPQSPPATDKWETTRTQKQSNWAVVKENQRMAAAAPTPAPPPP
Sbjct: 61  ILWKKFHNSMDVAGNDSTPQSPPATDKWETTRTQKQSNWAVVKENQRMAAAAPTPAPPPP 120

Query: 121 PPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVAFTKNMIGEIENRS 180
           PPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVAFTKNMIGEIENRS
Sbjct: 121 PPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVAFTKNMIGEIENRS 180

Query: 181 AYLSAIKSEVETHGEFVNRLIREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFP 240
           AYLSAIKSEVETHGEFVNRLIREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFP
Sbjct: 181 AYLSAIKSEVETHGEFVNRLIREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFP 240

Query: 241 RWPEGKADALREAAFSYKDLKSLEGEVCSFRENPKEETNAMLKRAQALQDRLEQSVSNVE 300
           RWPEGKADALREAAFSYKDLKSLEGEVCSFRENPKEETNAMLKRAQALQDRLEQSVSNVE
Sbjct: 241 RWPEGKADALREAAFSYKDLKSLEGEVCSFRENPKEETNAMLKRAQALQDRLEQSVSNVE 300

Query: 301 RTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKEKQLNETPQTENLF 360
           RTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKE QLNETPQTENLF
Sbjct: 301 RTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKEVQLNETPQTENLF 360

Query: 361 LQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLQLNQRK 400
           LQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLQLNQRK
Sbjct: 361 LQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLQLNQRK 399

BLAST of CmoCh01G007120 vs. NCBI nr
Match: XP_023523072.1 (protein CHUP1, chloroplastic isoform X1 [Cucurbita pepo subsp. pepo] >XP_023523080.1 protein CHUP1, chloroplastic isoform X2 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 754.2 bits (1946), Expect = 5.7e-214
Identity = 391/401 (97.51%), Postives = 392/401 (97.76%), Query Frame = 0

Query: 1   MPMEEDEELAMEIHALKRELEISLQKSIFLEKENQELKQELARFKSHIHSLKAHNNDRKS 60
           MPMEEDEELAMEI ALKRELEISLQKS FLEKENQELKQELARFKSHI SLKAHNNDRKS
Sbjct: 1   MPMEEDEELAMEIDALKRELEISLQKSNFLEKENQELKQELARFKSHIQSLKAHNNDRKS 60

Query: 61  ILWKKFHNSMD--VAGNDSTPQSPPATDKWETTRTQKQSNWAVVKENQRMAAAAPTPAPP 120
           ILWKKFHNSMD  VAG DS+PQSPPATDKWETTRTQKQSNWAVVKENQRMAAAAPTPAPP
Sbjct: 61  ILWKKFHNSMDVAVAGTDSSPQSPPATDKWETTRTQKQSNWAVVKENQRMAAAAPTPAPP 120

Query: 121 PPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVAFTKNMIGEIEN 180
           PPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVAFTKNMIGEIEN
Sbjct: 121 PPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVAFTKNMIGEIEN 180

Query: 181 RSAYLSAIKSEVETHGEFVNRLIREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKH 240
           RSAYLSAIKSEVETHGEFVNRLIREVEAAAPRDIAEVERFVKWLDGEL SLVDERAVLKH
Sbjct: 181 RSAYLSAIKSEVETHGEFVNRLIREVEAAAPRDIAEVERFVKWLDGELGSLVDERAVLKH 240

Query: 241 FPRWPEGKADALREAAFSYKDLKSLEGEVCSFRENPKEETNAMLKRAQALQDRLEQSVSN 300
           FPRWPEGKADALREAAFSYKDLKSLE EVCSFRENPKEETNAMLKRAQALQDRLEQSVSN
Sbjct: 241 FPRWPEGKADALREAAFSYKDLKSLEAEVCSFRENPKEETNAMLKRAQALQDRLEQSVSN 300

Query: 301 VERTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKEKQLNETPQTEN 360
           VERTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKE QLNETPQTEN
Sbjct: 301 VERTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKEIQLNETPQTEN 360

Query: 361 LFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLQLNQRK 400
           LFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLQLNQRK
Sbjct: 361 LFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLQLNQRK 401

BLAST of CmoCh01G007120 vs. NCBI nr
Match: XP_022998607.1 (protein CHUP1, chloroplastic [Cucurbita maxima])

HSP 1 Score: 745.3 bits (1923), Expect = 2.6e-211
Identity = 386/401 (96.26%), Postives = 389/401 (97.01%), Query Frame = 0

Query: 1   MPMEEDEELAMEIHALKRELEISLQKSIFLEKENQELKQELARFKSHIHSLKAHNNDRKS 60
           MPMEEDEELAMEI ALKRELEISLQKS FLEKENQELKQELARFKSH+ SLK HNNDRKS
Sbjct: 1   MPMEEDEELAMEIDALKRELEISLQKSNFLEKENQELKQELARFKSHLQSLKPHNNDRKS 60

Query: 61  ILWKKFHNSMD--VAGNDSTPQSPPATDKWETTRTQKQSNWAVVKENQRMAAAAPTPAPP 120
           ILWKKFHNSMD  VAG DS+PQSPPATDKWETTRTQKQSNWAVVKENQRMAAAAPTPAPP
Sbjct: 61  ILWKKFHNSMDVAVAGTDSSPQSPPATDKWETTRTQKQSNWAVVKENQRMAAAAPTPAPP 120

Query: 121 PPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVAFTKNMIGEIEN 180
           PPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKA NGGFPAVAFTKNMIGEIEN
Sbjct: 121 PPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKATNGGFPAVAFTKNMIGEIEN 180

Query: 181 RSAYLSAIKSEVETHGEFVNRLIREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKH 240
           RSAYLSAIKSEVETHGEFVNRLIREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKH
Sbjct: 181 RSAYLSAIKSEVETHGEFVNRLIREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKH 240

Query: 241 FPRWPEGKADALREAAFSYKDLKSLEGEVCSFRENPKEETNAMLKRAQALQDRLEQSVSN 300
           FPRWPEGKADALREAAFSYKDLKSLE EVCSFRENPKEETNAMLKRAQALQDRLEQSVSN
Sbjct: 241 FPRWPEGKADALREAAFSYKDLKSLEAEVCSFRENPKEETNAMLKRAQALQDRLEQSVSN 300

Query: 301 VERTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKEKQLNETPQTEN 360
           VERTREFNC KYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKE QLNETPQTEN
Sbjct: 301 VERTREFNCNKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKELQLNETPQTEN 360

Query: 361 LFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLQLNQRK 400
           LFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGL L+QRK
Sbjct: 361 LFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLLLSQRK 401

BLAST of CmoCh01G007120 vs. NCBI nr
Match: KAG7037002.1 (Protein CHUP1, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 705.7 bits (1820), Expect = 2.3e-199
Identity = 371/403 (92.06%), Postives = 376/403 (93.30%), Query Frame = 0

Query: 3   MEEDEELAMEIHALKRELEISLQKSIFLEKENQELKQELARFKSHIHSLKAHNNDRKSIL 62
           MEEDEELAMEIHALKRELEISLQKS FLEKENQELKQELARFKSH+ SLK HNNDRKSIL
Sbjct: 1   MEEDEELAMEIHALKRELEISLQKSNFLEKENQELKQELARFKSHVQSLKVHNNDRKSIL 60

Query: 63  WKKFHNSMDVAGNDSTPQSPPATDKWETTRTQKQSNWAVVKENQRMAAAAPTPAPPPPPP 122
           WKKFHNSMDVAGNDSTPQSPPATDKWETTRTQKQSNWAVVKENQRMAAAAPTPAPPPPPP
Sbjct: 61  WKKFHNSMDVAGNDSTPQSPPATDKWETTRTQKQSNWAVVKENQRMAAAAPTPAPPPPPP 120

Query: 123 LPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVAFTKNMIGEIENRSAY 182
           LPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVAFTKNMIGEIENRSAY
Sbjct: 121 LPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVAFTKNMIGEIENRSAY 180

Query: 183 LSAIKSEVETHGEFVNRLIREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRW 242
           LSAIKSEVETHGEFVNRLIREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRW
Sbjct: 181 LSAIKSEVETHGEFVNRLIREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRW 240

Query: 243 PEGKADALREAAFSYKDLKSLEGE-----VCSFRENPKEETNAMLKRAQALQ-DRLEQSV 302
           PEGKADALREAAFSYKDLKSLEGE     V       K+      +  + L+  RLEQSV
Sbjct: 241 PEGKADALREAAFSYKDLKSLEGEHVQSVVFHVSNKVKQSWEVTDELEKCLELCRLEQSV 300

Query: 303 SNVERTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKEKQLNETPQT 362
           SNVERTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKE QLNETPQT
Sbjct: 301 SNVERTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKEVQLNETPQT 360

Query: 363 ENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLQLNQRK 400
           ENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLQLNQRK
Sbjct: 361 ENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLQLNQRK 403

BLAST of CmoCh01G007120 vs. TAIR 10
Match: AT1G07120.1 (FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast envelope; EXPRESSED IN: inflorescence meristem, petal, leaf whorl, flower; EXPRESSED DURING: 4 anthesis, petal differentiation and expansion stage; BEST Arabidopsis thaliana protein match is: Tetratricopeptide repeat (TPR)-like superfamily protein (TAIR:AT4G18570.1); Has 288 Blast hits to 260 proteins in 50 species: Archae - 0; Bacteria - 8; Metazoa - 27; Fungi - 15; Plants - 163; Viruses - 0; Other Eukaryotes - 75 (source: NCBI BLink). )

HSP 1 Score: 351.7 bits (901), Expect = 7.9e-97
Identity = 192/391 (49.10%), Postives = 264/391 (67.52%), Query Frame = 0

Query: 1   MPMEEDEELAMEIHALKRELEISLQKSIFLEKENQELKQELARFKSHIHSLKAHNNDRKS 60
           +P  ED+    ++  L +EL+  L ++  LEKEN EL+QE+AR ++ + +LK+H N+RKS
Sbjct: 2   LPNGEDDS---DLLRLVKELQAYLVRNDKLEKENHELRQEVARLRAQVSNLKSHENERKS 61

Query: 61  ILWKKFHNSMDVAGNDSTPQSPPATDKWETTRTQKQSNWAVVKENQRMAAAAPTPAPPPP 120
           +LWKK  +S D +  D +    P + K   T+ Q+  N         +   +    PPPP
Sbjct: 62  MLWKKLQSSYDGSNTDGSNLKAPESVK-SNTKGQEVRN---PNPKPTIQGQSTATKPPPP 121

Query: 121 PPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVAFTKNMIGEIENRS 180
           PPLP+K   G ++VRR PEV+E YR +TKR++   NK    G  + AF +NMIGEIENRS
Sbjct: 122 PPLPSKRTLGKRSVRRAPEVVEFYRALTKRESHMGNKINQNGVLSPAFNRNMIGEIENRS 181

Query: 181 AYLSAIKSEVETHGEFVNRLIREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFP 240
            YLS IKS+ + H + ++ LI +VEAA   DI+EVE FVKW+D EL+SLVDERAVLKHFP
Sbjct: 182 KYLSDIKSDTDRHRDHIHILISKVEAATFTDISEVETFVKWIDEELSSLVDERAVLKHFP 241

Query: 241 RWPEGKADALREAAFSYKDLKSLEGEVCSFRENPKEETNAMLKRAQALQDRLEQSVSNVE 300
           +WPE K D+LREAA +YK  K+L  E+ SF++NPK+     L+R Q+LQDRLE+SV+N E
Sbjct: 242 KWPERKVDSLREAACNYKRPKNLGNEILSFKDNPKDSLTQALQRIQSLQDRLEESVNNTE 301

Query: 301 RTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKEKQLNETPQTENLF 360
           + R+   K+Y  FQIP +WMLD+GL  Q+K SSLRL +E M+RI KE + N + +  NL 
Sbjct: 302 KMRDSTGKRYKDFQIPWEWMLDTGLIGQLKYSSLRLAQEYMKRIAKELESNGSGKEGNLM 361

Query: 361 LQGVRFAYRVHQYAGGFDSEAIVAFEGMKQV 392
           LQGVRFAY +HQ+AGGFD E +  F  +K++
Sbjct: 362 LQGVRFAYTIHQFAGGFDGETLSIFHELKKI 385

BLAST of CmoCh01G007120 vs. TAIR 10
Match: AT4G18570.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 267.3 bits (682), Expect = 2.0e-71
Identity = 146/289 (50.52%), Postives = 196/289 (67.82%), Query Frame = 0

Query: 110 AAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVA-- 169
           + +  P PPPPPP P  L   S  VRRVPEV+E Y  + +RD+    + + GG  A A  
Sbjct: 329 SVSKAPPPPPPPPPPKSLSIASAKVRRVPEVVEFYHSLMRRDSTNSRRDSTGGGNAAAEA 388

Query: 170 -----FTKNMIGEIENRSAYLSAIKSEVETHGEFVNRLIREVEAAAPRDIAEVERFVKWL 229
                  ++MIGEIENRS YL AIK++VET G+F+  LI+EV  AA  DI +V  FVKWL
Sbjct: 389 ILANSNARDMIGEIENRSVYLLAIKTDVETQGDFIRFLIKEVGNAAFSDIEDVVPFVKWL 448

Query: 230 DGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEGEVCSFRENPKEETNAML 289
           D EL+ LVDERAVLKHF  WPE KADALREAAF Y DLK L  E   FRE+P++ +++ L
Sbjct: 449 DDELSYLVDERAVLKHF-EWPEQKADALREAAFCYFDLKKLISEASRFREDPRQSSSSAL 508

Query: 290 KRAQALQDRLEQSVSNVERTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMR 349
           K+ QAL ++LE  V ++ R RE    K+  FQIP  WML++G+ +Q+KL+S++L  + M+
Sbjct: 509 KKMQALFEKLEHGVYSLSRMRESAATKFKSFQIPVDWMLETGITSQIKLASVKLAMKYMK 568

Query: 350 RITKEKQLNE--TPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMK 390
           R++ E +  E   P+ E L +QGVRFA+RVHQ+AGGFD+E + AFE ++
Sbjct: 569 RVSAELEAIEGGGPEEEELIVQGVRFAFRVHQFAGGFDAETMKAFEELR 616

BLAST of CmoCh01G007120 vs. TAIR 10
Match: AT3G25690.1 (Hydroxyproline-rich glycoprotein family protein )

HSP 1 Score: 245.7 bits (626), Expect = 6.1e-65
Identity = 138/287 (48.08%), Postives = 190/287 (66.20%), Query Frame = 0

Query: 113 PTPAPPPPPPLPTKL---LGGSKAVRRVPEVLELYRLVTKRDAQKE---NKAANGGFPAV 172
           P   PPPPPP P  L    GG   V R PE++E Y+ + KR+++KE   +  ++G   + 
Sbjct: 694 PGGGPPPPPPPPGALGRGAGGGNKVHRAPELVEFYQSLMKRESKKEGAPSLISSGTGNSS 753

Query: 173 AFTKNMIGEIENRSAYLSAIKSEVETHGEFVNRLIREVEAAAPRDIAEVERFVKWLDGEL 232
           A   NMIGEIENRS +L A+K++VET G+FV  L  EV A++  DI ++  FV WLD EL
Sbjct: 754 AARNNMIGEIENRSTFLLAVKADVETQGDFVQSLATEVRASSFTDIEDLLAFVSWLDEEL 813

Query: 233 ASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEGEVCSFRENPKEETNAMLKRAQ 292
           + LVDERAVLKHF  WPEGKADALREAAF Y+DL  LE +V SF ++P       LK+  
Sbjct: 814 SFLVDERAVLKHFD-WPEGKADALREAAFEYQDLMKLEKQVTSFVDDPNLSCEPALKKMY 873

Query: 293 ALQDRLEQSVSNVERTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITK 352
            L +++EQSV  + RTR+    +Y +F IP  W+ D+G+  ++KLSS++L K+ M+R+  
Sbjct: 874 KLLEKVEQSVYALLRTRDMAISRYKEFGIPVDWLSDTGVVGKIKLSSVQLAKKYMKRVAY 933

Query: 353 E----KQLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMK 390
           E       ++ P  E L LQGVRFA+RVHQ+AGGFD+E++ AFE ++
Sbjct: 934 ELDSVSGSDKDPNREFLLLQGVRFAFRVHQFAGGFDAESMKAFEELR 979

BLAST of CmoCh01G007120 vs. TAIR 10
Match: AT3G25690.2 (Hydroxyproline-rich glycoprotein family protein )

HSP 1 Score: 245.7 bits (626), Expect = 6.1e-65
Identity = 138/287 (48.08%), Postives = 190/287 (66.20%), Query Frame = 0

Query: 113 PTPAPPPPPPLPTKL---LGGSKAVRRVPEVLELYRLVTKRDAQKE---NKAANGGFPAV 172
           P   PPPPPP P  L    GG   V R PE++E Y+ + KR+++KE   +  ++G   + 
Sbjct: 694 PGGGPPPPPPPPGALGRGAGGGNKVHRAPELVEFYQSLMKRESKKEGAPSLISSGTGNSS 753

Query: 173 AFTKNMIGEIENRSAYLSAIKSEVETHGEFVNRLIREVEAAAPRDIAEVERFVKWLDGEL 232
           A   NMIGEIENRS +L A+K++VET G+FV  L  EV A++  DI ++  FV WLD EL
Sbjct: 754 AARNNMIGEIENRSTFLLAVKADVETQGDFVQSLATEVRASSFTDIEDLLAFVSWLDEEL 813

Query: 233 ASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEGEVCSFRENPKEETNAMLKRAQ 292
           + LVDERAVLKHF  WPEGKADALREAAF Y+DL  LE +V SF ++P       LK+  
Sbjct: 814 SFLVDERAVLKHFD-WPEGKADALREAAFEYQDLMKLEKQVTSFVDDPNLSCEPALKKMY 873

Query: 293 ALQDRLEQSVSNVERTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITK 352
            L +++EQSV  + RTR+    +Y +F IP  W+ D+G+  ++KLSS++L K+ M+R+  
Sbjct: 874 KLLEKVEQSVYALLRTRDMAISRYKEFGIPVDWLSDTGVVGKIKLSSVQLAKKYMKRVAY 933

Query: 353 E----KQLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMK 390
           E       ++ P  E L LQGVRFA+RVHQ+AGGFD+E++ AFE ++
Sbjct: 934 ELDSVSGSDKDPNREFLLLQGVRFAFRVHQFAGGFDAESMKAFEELR 979

BLAST of CmoCh01G007120 vs. TAIR 10
Match: AT3G25690.3 (Hydroxyproline-rich glycoprotein family protein )

HSP 1 Score: 245.7 bits (626), Expect = 6.1e-65
Identity = 138/287 (48.08%), Postives = 190/287 (66.20%), Query Frame = 0

Query: 113 PTPAPPPPPPLPTKL---LGGSKAVRRVPEVLELYRLVTKRDAQKE---NKAANGGFPAV 172
           P   PPPPPP P  L    GG   V R PE++E Y+ + KR+++KE   +  ++G   + 
Sbjct: 553 PGGGPPPPPPPPGALGRGAGGGNKVHRAPELVEFYQSLMKRESKKEGAPSLISSGTGNSS 612

Query: 173 AFTKNMIGEIENRSAYLSAIKSEVETHGEFVNRLIREVEAAAPRDIAEVERFVKWLDGEL 232
           A   NMIGEIENRS +L A+K++VET G+FV  L  EV A++  DI ++  FV WLD EL
Sbjct: 613 AARNNMIGEIENRSTFLLAVKADVETQGDFVQSLATEVRASSFTDIEDLLAFVSWLDEEL 672

Query: 233 ASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEGEVCSFRENPKEETNAMLKRAQ 292
           + LVDERAVLKHF  WPEGKADALREAAF Y+DL  LE +V SF ++P       LK+  
Sbjct: 673 SFLVDERAVLKHFD-WPEGKADALREAAFEYQDLMKLEKQVTSFVDDPNLSCEPALKKMY 732

Query: 293 ALQDRLEQSVSNVERTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITK 352
            L +++EQSV  + RTR+    +Y +F IP  W+ D+G+  ++KLSS++L K+ M+R+  
Sbjct: 733 KLLEKVEQSVYALLRTRDMAISRYKEFGIPVDWLSDTGVVGKIKLSSVQLAKKYMKRVAY 792

Query: 353 E----KQLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMK 390
           E       ++ P  E L LQGVRFA+RVHQ+AGGFD+E++ AFE ++
Sbjct: 793 ELDSVSGSDKDPNREFLLLQGVRFAFRVHQFAGGFDAESMKAFEELR 838

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9LI748.6e-6448.08Protein CHUP1, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=CHUP1 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A6J1G8X08.0e-222100.00protein CHUP1, chloroplastic OS=Cucurbita moschata OX=3662 GN=LOC111452021 PE=4 ... [more]
A0A6J1K8G41.3e-21196.26protein CHUP1, chloroplastic OS=Cucurbita maxima OX=3661 GN=LOC111493194 PE=4 SV... [more]
A0A0A0LVK71.2e-16476.92Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G532360 PE=4 SV=1[more]
A0A6J1DC834.4e-16477.36protein CHUP1, chloroplastic OS=Momordica charantia OX=3673 GN=LOC111018994 PE=4... [more]
A0A1S3C4V99.3e-16276.67protein CHUP1, chloroplastic isoform X1 OS=Cucumis melo OX=3656 GN=LOC103497059 ... [more]
Match NameE-valueIdentityDescription
XP_022948306.11.6e-221100.00protein CHUP1, chloroplastic [Cucurbita moschata][more]
KAG6607325.12.2e-21898.75Protein CHUP1, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia][more]
XP_023523072.15.7e-21497.51protein CHUP1, chloroplastic isoform X1 [Cucurbita pepo subsp. pepo] >XP_0235230... [more]
XP_022998607.12.6e-21196.26protein CHUP1, chloroplastic [Cucurbita maxima][more]
KAG7037002.12.3e-19992.06Protein CHUP1, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperm... [more]
Match NameE-valueIdentityDescription
AT1G07120.17.9e-9749.10FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknow... [more]
AT4G18570.12.0e-7150.52Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT3G25690.16.1e-6548.08Hydroxyproline-rich glycoprotein family protein [more]
AT3G25690.26.1e-6548.08Hydroxyproline-rich glycoprotein family protein [more]
AT3G25690.36.1e-6548.08Hydroxyproline-rich glycoprotein family protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita moschata (Rifu) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 20..54
NoneNo IPR availableCOILSCoilCoilcoord: 275..302
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 72..97
NoneNo IPR availablePANTHERPTHR31342:SF48CHUP1-LIKE PROTEINcoord: 4..392
IPR040265Protein CHUP1-likePANTHERPTHR31342PROTEIN CHUP1, CHLOROPLASTICcoord: 4..392

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh01G007120.1CmoCh01G007120.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009658 chloroplast organization
cellular_component GO:0009707 chloroplast outer membrane