CsaV3_1G036390 (gene) Cucumber (Chinese Long) v3

NameCsaV3_1G036390
Typegene
OrganismCucumis sativus (Cucumber (Chinese Long) v3)
Descriptionprotein CHUP1, chloroplastic isoform X1
Locationchr1 : 22315548 .. 22318805 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonpolypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
GAAATAAATTAAATGAGCCCAAAAAATAAAGCAAACTACAACAGTTTGGCTTCTCCATTAGCCCATTTATTAAAAATTAACTAAAAAAACAAAGGAGAATGCCAAAGGAAGAAGATGAAGTATTAGCTATGGAGATCAATTGCTTGAAAAAAGAATTGGAAATTTCTCTACAAAAATCAATTTTTCTCGAAAAAGAAAATCAAGAACTCAGACAGGAACTGAATCGATTGCGATCCCAAATTCAGTCATTCAAAGCTCAAAACAATGAGAGAAAATCCATTCTCTGGAAGAAATTTCATAGCTCCATCGACATTTCCGTAGCCGGAGCTGACTCGCCGCCGCTAAGCCCTGCCACGGTGGCGGGTGATAAACGGGAGTCGACCAAATCGCCGAAACAGAGTAGTTGGGATGATGTGAAAGAGAGTCATAGAATGACGGGGGTACCGGCATCGCCACCGCCACCGCCACCGCCGCCACTTCCGACGAAACTGCTCGGAGGATCAAAGGCAGTGCGGCGTGTTCCGGAAGTGTTGGAGTTGTATCGTACACTGACGAAAAGAGATGCACAGAAGGAAAATAAAGTGGCACACGGCGGAGCTCCGGCTGTGGCGTTCACTAAAAACATGATCGGCGAAATTGAAAACCGATCTGCCTATCTCTCGGCGGTATGTATTTATAATTAACAGATTTTTATGGGGTGGTGGCGTTCAAAAACTTTAAAAATAAAATTTAAAATGTTACTAAATTGAATTTTATATTCGACAGAGTTCTAAACCTTTTAATTTACTCTGTTAATAGAAATGAAATTTTATAACCTTTCCAGGTTGAACTCAAATTTAAATATTTTATAAGTTTACACTTCAAATTTAAGAGTTTTTCGATTTTTCTTATAGTATGTGTGGATAGAAGAATTAAATTATAATATCCAAATTGATATTACATAACATCCATTAAGCTAGGTTTATCTTAATTCAAATTCACTCGAGTTAGAGACTTTAGAATTTTTCTCTAAGGTAATCACTTTTTTCTTTTAAATTACGTGTTTGTTCGATTATACTACAAATAATTAAAATTTTGTTGAATATATAAAATTGAGAGGAAAGCTAATTAAATTAAATTAAAAAGTTCAAAAACCTCTCTTAGTTTGATGGTACTAGATTTAAAATCATCGAGGCAAAAAAGAAAAACTATGGTTTCACAGACATAAACTTGAATCTATGAAACTTAGGTAGAAGTCATACAAGGTAATGAAAGTTAGTCTCTTTGATTAGTTACTCCCATGTTAGTAACTCTAAAAAAACCTTCGAATTAGAAGACTCAGTTGAAATATTTTCATTTAAAATTTTAAAAGATAAAAAATCAAAGGTTGGATTGTCTAAAAAACTAGAGATGGAACACATGGCAGATAAAATCCGAGGTAGAAACACATGGAGATTTTGTGAATTGGTTGATCAAAGAAGTAGAAACGATAGCACCAAGAGACATATCAGAGGTGGAAAGGTTCGTGAAGTGGCTTGATGGGAAGTTGGCATCATTGGTGGACGAGAGAGCAGTGCTAAAGTACTTCCCACGGTGGCCAGAGGCAAAAGCAGATGCACTACGGGAGGCAGCATTCAGCTACCGAGACCTGAAGGGCTTAGAGAGCAAAGTGTGTATGTTTAGAGACAATCCAAAGGAAGAGATGAATGTTGTGTTGAAGAGAGCTCAAGCATTGCAAGACAGGCGAGAATGTACAATCAATTTGTTGTTTGTTGTGCAATGTTTCTGTTTTGGTAAGTTGATTTGAATGATGAACTTATGAGAATTTTGTTGTTTTAAAATGTGCAGGGTGGAGCAGAGTGTGAGTAATATGGAGAGGACGAGGGAGTTCAATTGTAGGAAGTATCAAGCTTTTCAAATTCCCTGCCAATGGATGTTCGATTCCGCTTTGCCTACTCAGGTATCCAACATTAACACATTATATAACACTAATTATCATTTCATTGGATCGAAATCGATTTTCTAACATCTTACAATCTATGCTTGCAGTACTAAACGTAAATTCTATGAGATGTACAAGTCAAATCTATATATAATTAATCAATTAATTTAAAATTTTGAACCTACAAAGCTCAGCAATCTAGACATCAAGTTAAGGGGAAAACGAATTTCAGGGTTATAATCTTAAAACAGATTCGGCGAAACATTTTACAAAACTTTTTTAAGATTAAGAAACACAAATGAAATTGAAATCAATTTTCAATCGACTGTGTCTCAGGACAAGAACCAGGTTGATAGTGAGTTTTAACCACAAAAACTAGTTGAATTTTCTTGGATTATAATCTAATAGTCAAGTTTCAGAATCACATTAAAATAACTAATAATAATATGCTTAGACAAAAATAACTACAAACTATACGTGAACATTCATTGCACACACCAAAAGGAAAACCAAAACATCTTGAATCATCAAGAAAATGAGAAGAATAAAAAGCCCTAACTCAGTCTCTTGGATTGCAGATAAAGATGAGCACATTGAGGCTGGCGAAGGAATACATGATAAGGATAACAAGAGAACTACAATCAACCGAGACCCCACAAAGAGAAAACCTTTTCCTCCAAGGGGCTCGATTTGCGTACAGGGTTCATCAGGTAAACCAGAACGATTCACTAATACAAACACATTGCTTTGACAATGACTAACCAATCATAATCCTTTTTAGTATGCAGGTGGTTTCGATTCAGAGACTATAGAGGCTTTTGAAGGACTGAAGAAAGCTGGGCTGAGTAGTCAAAGAAAATAGGCTCGAACGTAAATTATAGGGAATGAATATTGTCGTCTGTTGCCAACTCAAATGCAGAGTACATTCAACTAGATGATGTAACACAACTGCTTGAACGGAATTTTATATATATAATCAAATCCTATTGCATCTTGTACAACCCATAGCAAGAAAATGGCATATCAGCATAACTTGACTTTGTTCTATTAAAATTTAGTAATTAACTAGATATAACACCAGGAGGAACGTCCTAAGAATATATGAAGGGCGAAGCTTTCAATAAAACAATGTTTCTTGATTGGACGACGTTAGAGAACTTATCCCAACAGTCTAAAACCCCATATTCCACCACTAAGATTTCTAATTGGGCAAGGTTTCCTTTCTCAAAAAAGAAGCAAAACTAGAAAGAGTCAACTCGATGTTCCAACAAAATCACAAAAGAAAATAGAACACTCATGAACAAGAACAGTGTTATAATCAAAATGCAGAGCAATGC

mRNA sequence

ATGCCAAAGGAAGAAGATGAAGTATTAGCTATGGAGATCAATTGCTTGAAAAAAGAATTGGAAATTTCTCTACAAAAATCAATTTTTCTCGAAAAAGAAAATCAAGAACTCAGACAGGAACTGAATCGATTGCGATCCCAAATTCAGTCATTCAAAGCTCAAAACAATGAGAGAAAATCCATTCTCTGGAAGAAATTTCATAGCTCCATCGACATTTCCGTAGCCGGAGCTGACTCGCCGCCGCTAAGCCCTGCCACGGTGGCGGGTGATAAACGGGAGTCGACCAAATCGCCGAAACAGAGTAGTTGGGATGATGTGAAAGAGAGTCATAGAATGACGGGGGTACCGGCATCGCCACCGCCACCGCCACCGCCGCCACTTCCGACGAAACTGCTCGGAGGATCAAAGGCAGTGCGGCGTGTTCCGGAAGTGTTGGAGTTGTATCGTACACTGACGAAAAGAGATGCACAGAAGGAAAATAAAGTGGCACACGGCGGAGCTCCGGCTGTGGCGTTCACTAAAAACATGATCGGCGAAATTGAAAACCGATCTGCCTATCTCTCGGCGATAAAATCCGAGGTAGAAACACATGGAGATTTTGTGAATTGGTTGATCAAAGAAGTAGAAACGATAGCACCAAGAGACATATCAGAGGTGGAAAGGTTCGTGAAGTGGCTTGATGGGAAGTTGGCATCATTGGTGGACGAGAGAGCAGTGCTAAAGTACTTCCCACGGTGGCCAGAGGCAAAAGCAGATGCACTACGGGAGGCAGCATTCAGCTACCGAGACCTGAAGGGCTTAGAGAGCAAAGTGTGTATGTTTAGAGACAATCCAAAGGAAGAGATGAATGTTGTGTTGAAGAGAGCTCAAGCATTGCAAGACAGGGTGGAGCAGAGTGTGAGTAATATGGAGAGGACGAGGGAGTTCAATTGTAGGAAGTATCAAGCTTTTCAAATTCCCTGCCAATGGATGTTCGATTCCGCTTTGCCTACTCAGATAAAGATGAGCACATTGAGGCTGGCGAAGGAATACATGATAAGGATAACAAGAGAACTACAATCAACCGAGACCCCACAAAGAGAAAACCTTTTCCTCCAAGGGGCTCGATTTGCGTACAGGGTTCATCAGTATGCAGGTGGTTTCGATTCAGAGACTATAGAGGCTTTTGAAGGACTGAAGAAAGCTGGGCTGAGTAGTCAAAGAAAATAG

Coding sequence (CDS)

ATGCCAAAGGAAGAAGATGAAGTATTAGCTATGGAGATCAATTGCTTGAAAAAAGAATTGGAAATTTCTCTACAAAAATCAATTTTTCTCGAAAAAGAAAATCAAGAACTCAGACAGGAACTGAATCGATTGCGATCCCAAATTCAGTCATTCAAAGCTCAAAACAATGAGAGAAAATCCATTCTCTGGAAGAAATTTCATAGCTCCATCGACATTTCCGTAGCCGGAGCTGACTCGCCGCCGCTAAGCCCTGCCACGGTGGCGGGTGATAAACGGGAGTCGACCAAATCGCCGAAACAGAGTAGTTGGGATGATGTGAAAGAGAGTCATAGAATGACGGGGGTACCGGCATCGCCACCGCCACCGCCACCGCCGCCACTTCCGACGAAACTGCTCGGAGGATCAAAGGCAGTGCGGCGTGTTCCGGAAGTGTTGGAGTTGTATCGTACACTGACGAAAAGAGATGCACAGAAGGAAAATAAAGTGGCACACGGCGGAGCTCCGGCTGTGGCGTTCACTAAAAACATGATCGGCGAAATTGAAAACCGATCTGCCTATCTCTCGGCGATAAAATCCGAGGTAGAAACACATGGAGATTTTGTGAATTGGTTGATCAAAGAAGTAGAAACGATAGCACCAAGAGACATATCAGAGGTGGAAAGGTTCGTGAAGTGGCTTGATGGGAAGTTGGCATCATTGGTGGACGAGAGAGCAGTGCTAAAGTACTTCCCACGGTGGCCAGAGGCAAAAGCAGATGCACTACGGGAGGCAGCATTCAGCTACCGAGACCTGAAGGGCTTAGAGAGCAAAGTGTGTATGTTTAGAGACAATCCAAAGGAAGAGATGAATGTTGTGTTGAAGAGAGCTCAAGCATTGCAAGACAGGGTGGAGCAGAGTGTGAGTAATATGGAGAGGACGAGGGAGTTCAATTGTAGGAAGTATCAAGCTTTTCAAATTCCCTGCCAATGGATGTTCGATTCCGCTTTGCCTACTCAGATAAAGATGAGCACATTGAGGCTGGCGAAGGAATACATGATAAGGATAACAAGAGAACTACAATCAACCGAGACCCCACAAAGAGAAAACCTTTTCCTCCAAGGGGCTCGATTTGCGTACAGGGTTCATCAGTATGCAGGTGGTTTCGATTCAGAGACTATAGAGGCTTTTGAAGGACTGAAGAAAGCTGGGCTGAGTAGTCAAAGAAAATAG

Protein sequence

MPKEEDEVLAMEINCLKKELEISLQKSIFLEKENQELRQELNRLRSQIQSFKAQNNERKSILWKKFHSSIDISVAGADSPPLSPATVAGDKRESTKSPKQSSWDDVKESHRMTGVPASPPPPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQKENKVAHGGAPAVAFTKNMIGEIENRSAYLSAIKSEVETHGDFVNWLIKEVETIAPRDISEVERFVKWLDGKLASLVDERAVLKYFPRWPEAKADALREAAFSYRDLKGLESKVCMFRDNPKEEMNVVLKRAQALQDRVEQSVSNMERTREFNCRKYQAFQIPCQWMFDSALPTQIKMSTLRLAKEYMIRITRELQSTETPQRENLFLQGARFAYRVHQYAGGFDSETIEAFEGLKKAGLSSQRK
BLAST of CsaV3_1G036390 vs. NCBI nr
Match: XP_011658693.1 (PREDICTED: protein CHUP1, chloroplastic isoform X1 [Cucumis sativus] >KGN65828.1 hypothetical protein Csa_1G532360 [Cucumis sativus])

HSP 1 Score: 753.1 bits (1943), Expect = 5.0e-214
Identity = 402/402 (100.00%), Postives = 402/402 (100.00%), Query Frame = 0

Query: 1   MPKEEDEVLAMEINCLKKELEISLQKSIFLEKENQELRQELNRLRSQIQSFKAQNNERKS 60
           MPKEEDEVLAMEINCLKKELEISLQKSIFLEKENQELRQELNRLRSQIQSFKAQNNERKS
Sbjct: 1   MPKEEDEVLAMEINCLKKELEISLQKSIFLEKENQELRQELNRLRSQIQSFKAQNNERKS 60

Query: 61  ILWKKFHSSIDISVAGADSPPLSPATVAGDKRESTKSPKQSSWDDVKESHRMTGVPAXXX 120
           ILWKKFHSSIDISVAGADSPPLSPATVAGDKRESTKSPKQSSWDDVKESHRMTGVPAXXX
Sbjct: 61  ILWKKFHSSIDISVAGADSPPLSPATVAGDKRESTKSPKQSSWDDVKESHRMTGVPAXXX 120

Query: 121 XXXXXXXXXKLLGGSKAVRRVPEVLELYRTLTKRDAQKENKVAHGGAPAVAFTKNMIGEI 180
           XXXXXXXXXKLLGGSKAVRRVPEVLELYRTLTKRDAQKENKVAHGGAPAVAFTKNMIGEI
Sbjct: 121 XXXXXXXXXKLLGGSKAVRRVPEVLELYRTLTKRDAQKENKVAHGGAPAVAFTKNMIGEI 180

Query: 181 ENRSAYLSAIKSEVETHGDFVNWLIKEVETIAPRDISEVERFVKWLDGKLASLVDERAVL 240
           ENRSAYLSAIKSEVETHGDFVNWLIKEVETIAPRDISEVERFVKWLDGKLASLVDERAVL
Sbjct: 181 ENRSAYLSAIKSEVETHGDFVNWLIKEVETIAPRDISEVERFVKWLDGKLASLVDERAVL 240

Query: 241 KYFPRWPEAKADALREAAFSYRDLKGLESKVCMFRDNPKEEMNVVLKRAQALQDRVEQSV 300
           KYFPRWPEAKADALREAAFSYRDLKGLESKVCMFRDNPKEEMNVVLKRAQALQDRVEQSV
Sbjct: 241 KYFPRWPEAKADALREAAFSYRDLKGLESKVCMFRDNPKEEMNVVLKRAQALQDRVEQSV 300

Query: 301 SNMERTREFNCRKYQAFQIPCQWMFDSALPTQIKMSTLRLAKEYMIRITRELQSTETPQR 360
           SNMERTREFNCRKYQAFQIPCQWMFDSALPTQIKMSTLRLAKEYMIRITRELQSTETPQR
Sbjct: 301 SNMERTREFNCRKYQAFQIPCQWMFDSALPTQIKMSTLRLAKEYMIRITRELQSTETPQR 360

Query: 361 ENLFLQGARFAYRVHQYAGGFDSETIEAFEGLKKAGLSSQRK 403
           ENLFLQGARFAYRVHQYAGGFDSETIEAFEGLKKAGLSSQRK
Sbjct: 361 ENLFLQGARFAYRVHQYAGGFDSETIEAFEGLKKAGLSSQRK 402

BLAST of CsaV3_1G036390 vs. NCBI nr
Match: XP_011658695.1 (PREDICTED: protein CHUP1, chloroplastic isoform X2 [Cucumis sativus])

HSP 1 Score: 702.6 bits (1812), Expect = 7.8e-199
Identity = 376/376 (100.00%), Postives = 376/376 (100.00%), Query Frame = 0

Query: 1   MPKEEDEVLAMEINCLKKELEISLQKSIFLEKENQELRQELNRLRSQIQSFKAQNNERKS 60
           MPKEEDEVLAMEINCLKKELEISLQKSIFLEKENQELRQELNRLRSQIQSFKAQNNERKS
Sbjct: 1   MPKEEDEVLAMEINCLKKELEISLQKSIFLEKENQELRQELNRLRSQIQSFKAQNNERKS 60

Query: 61  ILWKKFHSSIDISVAGADSPPLSPATVAGDKRESTKSPKQSSWDDVKESHRMTGVPAXXX 120
           ILWKKFHSSIDISVAGADSPPLSPATVAGDKRESTKSPKQSSWDDVKESHRMTGVPAXXX
Sbjct: 61  ILWKKFHSSIDISVAGADSPPLSPATVAGDKRESTKSPKQSSWDDVKESHRMTGVPAXXX 120

Query: 121 XXXXXXXXXKLLGGSKAVRRVPEVLELYRTLTKRDAQKENKVAHGGAPAVAFTKNMIGEI 180
           XXXXXXXXXKLLGGSKAVRRVPEVLELYRTLTKRDAQKENKVAHGGAPAVAFTKNMIGEI
Sbjct: 121 XXXXXXXXXKLLGGSKAVRRVPEVLELYRTLTKRDAQKENKVAHGGAPAVAFTKNMIGEI 180

Query: 181 ENRSAYLSAIKSEVETHGDFVNWLIKEVETIAPRDISEVERFVKWLDGKLASLVDERAVL 240
           ENRSAYLSAIKSEVETHGDFVNWLIKEVETIAPRDISEVERFVKWLDGKLASLVDERAVL
Sbjct: 181 ENRSAYLSAIKSEVETHGDFVNWLIKEVETIAPRDISEVERFVKWLDGKLASLVDERAVL 240

Query: 241 KYFPRWPEAKADALREAAFSYRDLKGLESKVCMFRDNPKEEMNVVLKRAQALQDRVEQSV 300
           KYFPRWPEAKADALREAAFSYRDLKGLESKVCMFRDNPKEEMNVVLKRAQALQDRVEQSV
Sbjct: 241 KYFPRWPEAKADALREAAFSYRDLKGLESKVCMFRDNPKEEMNVVLKRAQALQDRVEQSV 300

Query: 301 SNMERTREFNCRKYQAFQIPCQWMFDSALPTQIKMSTLRLAKEYMIRITRELQSTETPQR 360
           SNMERTREFNCRKYQAFQIPCQWMFDSALPTQIKMSTLRLAKEYMIRITRELQSTETPQR
Sbjct: 301 SNMERTREFNCRKYQAFQIPCQWMFDSALPTQIKMSTLRLAKEYMIRITRELQSTETPQR 360

Query: 361 ENLFLQGARFAYRVHQ 377
           ENLFLQGARFAYRVHQ
Sbjct: 361 ENLFLQGARFAYRVHQ 376

BLAST of CsaV3_1G036390 vs. NCBI nr
Match: XP_008457349.1 (PREDICTED: protein CHUP1, chloroplastic isoform X1 [Cucumis melo])

HSP 1 Score: 674.9 bits (1740), Expect = 1.7e-190
Identity = 361/402 (89.80%), Postives = 377/402 (93.78%), Query Frame = 0

Query: 1   MPKEEDEVLAMEINCLKKELEISLQKSIFLEKENQELRQELNRLRSQIQSFKAQNNERKS 60
           MPKE+DE LAMEI+CLKK+LEISLQKSIFLE+ENQELR ELNRL+SQIQS KA NNERKS
Sbjct: 1   MPKEKDEELAMEIDCLKKDLEISLQKSIFLERENQELRLELNRLKSQIQSLKALNNERKS 60

Query: 61  ILWKKFHSSIDISVAGADSPPLSPATVAGDKRESTKSPKQSSWDDVKESHRMTGVPAXXX 120
           ILWKKFHSS+D++VAGADSPPL+PAT AGDKRE TK PKQSSWDDVKES RMT VP XXX
Sbjct: 61  ILWKKFHSSMDMAVAGADSPPLNPATAAGDKREVTKFPKQSSWDDVKESQRMTAVPXXXX 120

Query: 121 XXXXXXXXXKLLGGSKAVRRVPEVLELYRTLTKRDAQKENKVAHGGAPAVAFTKNMIGEI 180
           XXXXXXXXX      KAVRRVPEVL+LYRTLTKRDAQKENKVAHGGAP VAFTKNMIGEI
Sbjct: 121 XXXXXXXXXXXXXXXKAVRRVPEVLDLYRTLTKRDAQKENKVAHGGAPVVAFTKNMIGEI 180

Query: 181 ENRSAYLSAIKSEVETHGDFVNWLIKEVETIAPRDISEVERFVKWLDGKLASLVDERAVL 240
           ENRSAYLSAIKSEVETHG+FVNWLIKEVE IAPRDISE E+FVKWLD KLASLVDERAVL
Sbjct: 181 ENRSAYLSAIKSEVETHGEFVNWLIKEVEMIAPRDISEAEKFVKWLDVKLASLVDERAVL 240

Query: 241 KYFPRWPEAKADALREAAFSYRDLKGLESKVCMFRDNPKEEMNVVLKRAQALQDRVEQSV 300
           K+FPRWPEAKADALREAAFSYRDLK LESKVCMFRDNPKEEMNVVLKRAQALQDRVEQSV
Sbjct: 241 KHFPRWPEAKADALREAAFSYRDLKSLESKVCMFRDNPKEEMNVVLKRAQALQDRVEQSV 300

Query: 301 SNMERTREFNCRKYQAFQIPCQWMFDSALPTQIKMSTLRLAKEYMIRITRELQSTETPQR 360
           SNMERTREFNC+KYQAFQIPCQWMFDSALPTQIK+STLRLAKEYMIRITREL+STET Q 
Sbjct: 301 SNMERTREFNCKKYQAFQIPCQWMFDSALPTQIKLSTLRLAKEYMIRITRELRSTETSQA 360

Query: 361 ENLFLQGARFAYRVHQYAGGFDSETIEAFEGLKKAGLSSQRK 403
           ENLFLQG RFAYRVHQYAGGFDSE IEAFEGLKKAGLSSQRK
Sbjct: 361 ENLFLQGVRFAYRVHQYAGGFDSEAIEAFEGLKKAGLSSQRK 402

BLAST of CsaV3_1G036390 vs. NCBI nr
Match: XP_008457350.1 (PREDICTED: protein CHUP1, chloroplastic isoform X2 [Cucumis melo])

HSP 1 Score: 626.3 bits (1614), Expect = 7.1e-176
Identity = 336/376 (89.36%), Postives = 352/376 (93.62%), Query Frame = 0

Query: 1   MPKEEDEVLAMEINCLKKELEISLQKSIFLEKENQELRQELNRLRSQIQSFKAQNNERKS 60
           MPKE+DE LAMEI+CLKK+LEISLQKSIFLE+ENQELR ELNRL+SQIQS KA NNERKS
Sbjct: 1   MPKEKDEELAMEIDCLKKDLEISLQKSIFLERENQELRLELNRLKSQIQSLKALNNERKS 60

Query: 61  ILWKKFHSSIDISVAGADSPPLSPATVAGDKRESTKSPKQSSWDDVKESHRMTGVPAXXX 120
           ILWKKFHSS+D++VAGADSPPL+PAT AGDKRE TK PKQSSWDDVKES RMT VP XXX
Sbjct: 61  ILWKKFHSSMDMAVAGADSPPLNPATAAGDKREVTKFPKQSSWDDVKESQRMTAVPXXXX 120

Query: 121 XXXXXXXXXKLLGGSKAVRRVPEVLELYRTLTKRDAQKENKVAHGGAPAVAFTKNMIGEI 180
           XXXXXXXXX      KAVRRVPEVL+LYRTLTKRDAQKENKVAHGGAP VAFTKNMIGEI
Sbjct: 121 XXXXXXXXXXXXXXXKAVRRVPEVLDLYRTLTKRDAQKENKVAHGGAPVVAFTKNMIGEI 180

Query: 181 ENRSAYLSAIKSEVETHGDFVNWLIKEVETIAPRDISEVERFVKWLDGKLASLVDERAVL 240
           ENRSAYLSAIKSEVETHG+FVNWLIKEVE IAPRDISE E+FVKWLD KLASLVDERAVL
Sbjct: 181 ENRSAYLSAIKSEVETHGEFVNWLIKEVEMIAPRDISEAEKFVKWLDVKLASLVDERAVL 240

Query: 241 KYFPRWPEAKADALREAAFSYRDLKGLESKVCMFRDNPKEEMNVVLKRAQALQDRVEQSV 300
           K+FPRWPEAKADALREAAFSYRDLK LESKVCMFRDNPKEEMNVVLKRAQALQDRVEQSV
Sbjct: 241 KHFPRWPEAKADALREAAFSYRDLKSLESKVCMFRDNPKEEMNVVLKRAQALQDRVEQSV 300

Query: 301 SNMERTREFNCRKYQAFQIPCQWMFDSALPTQIKMSTLRLAKEYMIRITRELQSTETPQR 360
           SNMERTREFNC+KYQAFQIPCQWMFDSALPTQIK+STLRLAKEYMIRITREL+STET Q 
Sbjct: 301 SNMERTREFNCKKYQAFQIPCQWMFDSALPTQIKLSTLRLAKEYMIRITRELRSTETSQA 360

Query: 361 ENLFLQGARFAYRVHQ 377
           ENLFLQG RFAYRVHQ
Sbjct: 361 ENLFLQGVRFAYRVHQ 376

BLAST of CsaV3_1G036390 vs. NCBI nr
Match: XP_023523072.1 (protein CHUP1, chloroplastic isoform X1 [Cucurbita pepo subsp. pepo] >XP_023523080.1 protein CHUP1, chloroplastic isoform X2 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 567.0 bits (1460), Expect = 5.1e-158
Identity = 309/403 (76.67%), Postives = 346/403 (85.86%), Query Frame = 0

Query: 1   MPKEEDEVLAMEINCLKKELEISLQKSIFLEKENQELRQELNRLRSQIQSFKAQNNERKS 60
           MP EEDE LAMEI+ LK+ELEISLQKS FLEKENQEL+QEL R +S IQS KA NN+RKS
Sbjct: 1   MPMEEDEELAMEIDALKRELEISLQKSNFLEKENQELKQELARFKSHIQSLKAHNNDRKS 60

Query: 61  ILWKKFHSSIDISVAGADSPPLSPATVAGDKRESTKSPKQSSWDDVKESHRMTGVPAXXX 120
           ILWKKFH+S+D++VAG DS P SP   A DK E+T++ KQS+W  VKE+        XXX
Sbjct: 61  ILWKKFHNSMDVAVAGTDSSPQSPP--ATDKWETTRTQKQSNWAVVKENQXXXXXXXXXX 120

Query: 121 XXXXXXXXXKLLGGSKAVRRVPEVLELYRTLTKRDAQKENKVAHGGAPAVAFTKNMIGEI 180
           XXXXXXXXXKLLGGSKAVRRVPEVLELYR +TKRDAQKENK A+GG PAVAFTKNMIGEI
Sbjct: 121 XXXXXXXXXKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVAFTKNMIGEI 180

Query: 181 ENRSAYLSAIKSEVETHGDFVNWLIKEVETIAPRDISEVERFVKWLDGKLASLVDERAVL 240
           ENRSAYLSAIKSEVETHG+FVN LI+EVE  APRDI+EVERFVKWLDG+L SLVDERAVL
Sbjct: 181 ENRSAYLSAIKSEVETHGEFVNRLIREVEAAAPRDIAEVERFVKWLDGELGSLVDERAVL 240

Query: 241 KYFPRWPEAKADALREAAFSYRDLKGLESKVCMFRDNPKEEMNVVLKRAQALQDRVEQSV 300
           K+FPRWPE KADALREAAFSY+DLK LE++VC FR+NPKEE N +LKRAQALQDR+EQSV
Sbjct: 241 KHFPRWPEGKADALREAAFSYKDLKSLEAEVCSFRENPKEETNAMLKRAQALQDRLEQSV 300

Query: 301 SNMERTREFNCRKYQAFQIPCQWMFDSALPTQIKMSTLRLAKEYMIRITRELQSTETPQR 360
           SN+ERTREFNC+KY  FQIPCQWM DS LP Q+K+S+LRL KE M RIT+E+Q  ETPQ 
Sbjct: 301 SNVERTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKEIQLNETPQT 360

Query: 361 ENLFLQGARFAYRVHQYAGGFDSETIEAFEGLKKAGLS-SQRK 403
           ENLFLQG RFAYRVHQYAGGFDSE I AFEG+K+ GL  +QRK
Sbjct: 361 ENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLQLNQRK 401

BLAST of CsaV3_1G036390 vs. TAIR10
Match: AT1G07120.1 (FUNCTIONS IN: molecular_function unknown)

HSP 1 Score: 289.3 bits (739), Expect = 3.7e-78
Identity = 180/394 (45.69%), Postives = 243/394 (61.68%), Query Frame = 0

Query: 1   MPKEEDEVLAMEINCLKKELEISLQKSIFLEKENQELRQELNRLRSQIQSFKAQNNERKS 60
           +P  ED+    ++  L KEL+  L ++  LEKE                        RKS
Sbjct: 2   LPNGEDD---SDLLRLVKELQAYLVRNDKLEKEXXXXXXXXXXXXXXXXXXXXXXXXRKS 61

Query: 61  ILWKKFHSSIDISVAGADSPPLSPATVAGDKRESTKSPKQSSWDDVKESHRMTGVPAXXX 120
           +LWKK  SS D S     +     +  +  K +  ++P                   XXX
Sbjct: 62  MLWKKLQSSYDGSNTDGSNLKAPESVKSNTKGQEVRNP--------XXXXXXXXXXXXXX 121

Query: 121 XXXXXXXXXKLLGGSKAVRRVPEVLELYRTLTKRDAQKENKVAHGGAPAVAFTKNMIGEI 180
           XXXXXXXXXK   G ++VRR PEV+E YR LTKR++   NK+   G  + AF +NMIGEI
Sbjct: 122 XXXXXXXXXKRTLGKRSVRRAPEVVEFYRALTKRESHMGNKINQNGVLSPAFNRNMIGEI 181

Query: 181 ENRSAYLSAIKSEVETHGDFVNWLIKEVETIAPRDISEVERFVKWLDGKLASLVDERAVL 240
           ENRS YLS IKS+ + H D ++ LI +VE     DISEVE FVKW+D +L+SLVDERAVL
Sbjct: 182 ENRSKYLSDIKSDTDRHRDHIHILISKVEAATFTDISEVETFVKWIDEELSSLVDERAVL 241

Query: 241 KYFPRWPEAKADALREAAFSYRDLKGLESKVCMFRDNPKEEMNVVLKRAQALQDRVEQSV 300
           K+FP+WPE K D+LREAA +Y+  K L +++  F+DNPK+ +   L+R Q+LQDR+E+SV
Sbjct: 242 KHFPKWPERKVDSLREAACNYKRPKNLGNEILSFKDNPKDSLTQALQRIQSLQDRLEESV 301

Query: 301 SNMERTREFNCRKYQAFQIPCQWMFDSALPTQIKMSTLRLAKEYMIRITRELQSTETPQR 360
           +N E+ R+   ++Y+ FQIP +WM D+ L  Q+K S+LRLA+EYM RI +EL+S  + + 
Sbjct: 302 NNTEKMRDSTGKRYKDFQIPWEWMLDTGLIGQLKYSSLRLAQEYMKRIAKELESNGSGKE 361

Query: 361 ENLFLQGARFAYRVHQYAGGFDSETIEAFEGLKK 395
            NL LQG RFAY +HQ+AGGFD ET+  F  LKK
Sbjct: 362 GNLMLQGVRFAYTIHQFAGGFDGETLSIFHELKK 384

BLAST of CsaV3_1G036390 vs. TAIR10
Match: AT3G25690.1 (Hydroxyproline-rich glycoprotein family protein)

HSP 1 Score: 237.3 bits (604), Expect = 1.7e-62
Identity = 129/263 (49.05%), Postives = 178/263 (67.68%), Query Frame = 0

Query: 138 VRRVPEVLELYRTLTKRDAQKE---NKVAHGGAPAVAFTKNMIGEIENRSAYLSAIKSEV 197
           V R PE++E Y++L KR+++KE   + ++ G   + A   NMIGEIENRS +L A+K++V
Sbjct: 718 VHRAPELVEFYQSLMKRESKKEGAPSLISSGTGNSSAARNNMIGEIENRSTFLLAVKADV 777

Query: 198 ETHGDFVNWLIKEVETIAPRDISEVERFVKWLDGKLASLVDERAVLKYFPRWPEAKADAL 257
           ET GDFV  L  EV   +  DI ++  FV WLD +L+ LVDERAVLK+F  WPE KADAL
Sbjct: 778 ETQGDFVQSLATEVRASSFTDIEDLLAFVSWLDEELSFLVDERAVLKHFD-WPEGKADAL 837

Query: 258 REAAFSYRDLKGLESKVCMFRDNPKEEMNVVLKRAQALQDRVEQSVSNMERTREFNCRKY 317
           REAAF Y+DL  LE +V  F D+P       LK+   L ++VEQSV  + RTR+    +Y
Sbjct: 838 REAAFEYQDLMKLEKQVTSFVDDPNLSCEPALKKMYKLLEKVEQSVYALLRTRDMAISRY 897

Query: 318 QAFQIPCQWMFDSALPTQIKMSTLRLAKEYMIRITRELQST----ETPQRENLFLQGARF 377
           + F IP  W+ D+ +  +IK+S+++LAK+YM R+  EL S     + P RE L LQG RF
Sbjct: 898 KEFGIPVDWLSDTGVVGKIKLSSVQLAKKYMKRVAYELDSVSGSDKDPNREFLLLQGVRF 957

Query: 378 AYRVHQYAGGFDSETIEAFEGLK 394
           A+RVHQ+AGGFD+E+++AFE L+
Sbjct: 958 AFRVHQFAGGFDAESMKAFEELR 979

BLAST of CsaV3_1G036390 vs. TAIR10
Match: AT4G18570.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 235.0 bits (598), Expect = 8.3e-62
Identity = 119/222 (53.60%), Postives = 166/222 (74.77%), Query Frame = 0

Query: 174 KNMIGEIENRSAYLSAIKSEVETHGDFVNWLIKEVETIAPRDISEVERFVKWLDGKLASL 233
           ++MIGEIENRS YL AIK++VET GDF+ +LIKEV   A  DI +V  FVKWLD +L+ L
Sbjct: 396 RDMIGEIENRSVYLLAIKTDVETQGDFIRFLIKEVGNAAFSDIEDVVPFVKWLDDELSYL 455

Query: 234 VDERAVLKYFPRWPEAKADALREAAFSYRDLKGLESKVCMFRDNPKEEMNVVLKRAQALQ 293
           VDERAVLK+F  WPE KADALREAAF Y DLK L S+   FR++P++  +  LK+ QAL 
Sbjct: 456 VDERAVLKHF-EWPEQKADALREAAFCYFDLKKLISEASRFREDPRQSSSSALKKMQALF 515

Query: 294 DRVEQSVSNMERTREFNCRKYQAFQIPCQWMFDSALPTQIKMSTLRLAKEYMIRITRELQ 353
           +++E  V ++ R RE    K+++FQIP  WM ++ + +QIK+++++LA +YM R++ EL+
Sbjct: 516 EKLEHGVYSLSRMRESAATKFKSFQIPVDWMLETGITSQIKLASVKLAMKYMKRVSAELE 575

Query: 354 STE--TPQRENLFLQGARFAYRVHQYAGGFDSETIEAFEGLK 394
           + E   P+ E L +QG RFA+RVHQ+AGGFD+ET++AFE L+
Sbjct: 576 AIEGGGPEEEELIVQGVRFAFRVHQFAGGFDAETMKAFEELR 616

BLAST of CsaV3_1G036390 vs. TAIR10
Match: AT1G48280.1 (hydroxyproline-rich glycoprotein family protein)

HSP 1 Score: 141.7 bits (356), Expect = 9.6e-34
Identity = 127/407 (31.20%), Postives = 189/407 (46.44%), Query Frame = 0

Query: 9   LAMEINCLKKELEISLQKSIFLEKENQELRQELNRLRSQIQSFKAQNNERKSILWKKFHS 68
           L +++  LK ELE +   ++ LE  N++L Q+L    ++I S  + +   K     +F  
Sbjct: 135 LQLQVLNLKTELEEARNSNVELELNNRKLSQDLVSAEAKISSLSSNDKPAKEHQNSRFKD 194

Query: 69  SIDISVAGADSPPLSPATVAGDKRESTKSPKQSSWDDVKESHRMTGVPA----------- 128
              +  +  + P +         R S  SP  S              PA           
Sbjct: 195 IQRLIASKLEQPKVKKEVAVESSRLSPPSPSPSRLPPXXXXXXFLVSPASSLGKRDXXXX 254

Query: 129 ----XXXXXXXXXXXXKLLGGSKAVRRVPEVLELYRTLTKRDAQKE-NKVAHGGAPAVAF 188
               XXXXXXXXXXXX                     L K+D  +  ++  +G    V  
Sbjct: 255 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLNKQDNSRNLSQSVNGNKSQVNS 314

Query: 189 TKN-MIGEIENRSAYLSAIKSEVETHGDFVNWLIKEVETIAPRDISEVERFVKWLDGKLA 248
             N ++GEI+NRSA+L AIK+++ET G+F+N LI++V T    D+ +V +FV WLD +LA
Sbjct: 315 AHNSIVGEIQNRSAHLIAIKADIETKGEFINDLIQKVLTTCFSDMEDVMKFVDWLDKELA 374

Query: 249 SLVDERAVLKYFPRWPEAKADALREAAFSYRDLKGLESKVCMFRDNPKEEMNVVLKRAQA 308
           +L                                    ++  + D+P     V LK+   
Sbjct: 375 TLA-XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXELSSYSDDPNIHYGVALKKMAN 434

Query: 309 LQDRVEQSVSNMERTREFNCRKYQAFQIPCQWMFDSALPTQIKMSTLRLAKEYMIRITRE 368
           L D+ EQ +  + R R  + R YQ F+IP +WM DS +  +IK ++++LAK YM R+  E
Sbjct: 435 LLDKSEQRIRRLVRLRGSSMRSYQDFKIPVEWMLDSGMICKIKRASIKLAKTYMNRVANE 494

Query: 369 LQSTETPQREN----LFLQGARFAYRVHQYAGGFDSETIEAFEGLKK 395
           LQS     RE+    L LQG RFAYR HQ+AGG D ET+ A E +K+
Sbjct: 495 LQSARNLDRESTKEALLLQGVRFAYRTHQFAGGLDPETLCALEEIKQ 540

BLAST of CsaV3_1G036390 vs. TAIR10
Match: AT1G11070.1 (BEST Arabidopsis thaliana protein match is: Hydroxyproline-rich glycoprotein family protein (TAIR:AT1G61080.1))

HSP 1 Score: 53.1 bits (126), Expect = 4.5e-07
Identity = 35/116 (30.17%), Postives = 57/116 (49.14%), Query Frame = 0

Query: 154 RDAQKENKVAHGGAPAVA--FTKNMIGEIENRSAYLSAIKSEVETHGDFVNWLIKEVETI 213
           R A   +K A G APA       + + EI  +S Y   I+ +V  +   +N L  ++   
Sbjct: 481 RGAGGGSKGATGSAPASGKQGMADALAEITKKSPYFQKIEEDVRMYMTSINELKTDITKF 540

Query: 214 APRDISEVERFVKWLDGKLASLVDERAVLKYFPRWPEAKADALREAAFSYRDLKGL 268
             +DI+E+++F   ++  L  L DE  VL     +P  K +A+R AA  Y  L+G+
Sbjct: 541 KNKDITELQKFHHRIESVLEKLEDETQVLARCEGFPHKKLEAIRMAAALYSKLEGM 596

BLAST of CsaV3_1G036390 vs. Swiss-Prot
Match: sp|Q9LI74|CHUP1_ARATH (Protein CHUP1, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=CHUP1 PE=1 SV=1)

HSP 1 Score: 237.3 bits (604), Expect = 3.0e-61
Identity = 129/263 (49.05%), Postives = 178/263 (67.68%), Query Frame = 0

Query: 138 VRRVPEVLELYRTLTKRDAQKE---NKVAHGGAPAVAFTKNMIGEIENRSAYLSAIKSEV 197
           V R PE++E Y++L KR+++KE   + ++ G   + A   NMIGEIENRS +L A+K++V
Sbjct: 718 VHRAPELVEFYQSLMKRESKKEGAPSLISSGTGNSSAARNNMIGEIENRSTFLLAVKADV 777

Query: 198 ETHGDFVNWLIKEVETIAPRDISEVERFVKWLDGKLASLVDERAVLKYFPRWPEAKADAL 257
           ET GDFV  L  EV   +  DI ++  FV WLD +L+ LVDERAVLK+F  WPE KADAL
Sbjct: 778 ETQGDFVQSLATEVRASSFTDIEDLLAFVSWLDEELSFLVDERAVLKHFD-WPEGKADAL 837

Query: 258 REAAFSYRDLKGLESKVCMFRDNPKEEMNVVLKRAQALQDRVEQSVSNMERTREFNCRKY 317
           REAAF Y+DL  LE +V  F D+P       LK+   L ++VEQSV  + RTR+    +Y
Sbjct: 838 REAAFEYQDLMKLEKQVTSFVDDPNLSCEPALKKMYKLLEKVEQSVYALLRTRDMAISRY 897

Query: 318 QAFQIPCQWMFDSALPTQIKMSTLRLAKEYMIRITRELQST----ETPQRENLFLQGARF 377
           + F IP  W+ D+ +  +IK+S+++LAK+YM R+  EL S     + P RE L LQG RF
Sbjct: 898 KEFGIPVDWLSDTGVVGKIKLSSVQLAKKYMKRVAYELDSVSGSDKDPNREFLLLQGVRF 957

Query: 378 AYRVHQYAGGFDSETIEAFEGLK 394
           A+RVHQ+AGGFD+E+++AFE L+
Sbjct: 958 AFRVHQFAGGFDAESMKAFEELR 979

BLAST of CsaV3_1G036390 vs. TrEMBL
Match: tr|A0A0A0LVK7|A0A0A0LVK7_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G532360 PE=4 SV=1)

HSP 1 Score: 753.1 bits (1943), Expect = 3.3e-214
Identity = 402/402 (100.00%), Postives = 402/402 (100.00%), Query Frame = 0

Query: 1   MPKEEDEVLAMEINCLKKELEISLQKSIFLEKENQELRQELNRLRSQIQSFKAQNNERKS 60
           MPKEEDEVLAMEINCLKKELEISLQKSIFLEKENQELRQELNRLRSQIQSFKAQNNERKS
Sbjct: 1   MPKEEDEVLAMEINCLKKELEISLQKSIFLEKENQELRQELNRLRSQIQSFKAQNNERKS 60

Query: 61  ILWKKFHSSIDISVAGADSPPLSPATVAGDKRESTKSPKQSSWDDVKESHRMTGVPAXXX 120
           ILWKKFHSSIDISVAGADSPPLSPATVAGDKRESTKSPKQSSWDDVKESHRMTGVPAXXX
Sbjct: 61  ILWKKFHSSIDISVAGADSPPLSPATVAGDKRESTKSPKQSSWDDVKESHRMTGVPAXXX 120

Query: 121 XXXXXXXXXKLLGGSKAVRRVPEVLELYRTLTKRDAQKENKVAHGGAPAVAFTKNMIGEI 180
           XXXXXXXXXKLLGGSKAVRRVPEVLELYRTLTKRDAQKENKVAHGGAPAVAFTKNMIGEI
Sbjct: 121 XXXXXXXXXKLLGGSKAVRRVPEVLELYRTLTKRDAQKENKVAHGGAPAVAFTKNMIGEI 180

Query: 181 ENRSAYLSAIKSEVETHGDFVNWLIKEVETIAPRDISEVERFVKWLDGKLASLVDERAVL 240
           ENRSAYLSAIKSEVETHGDFVNWLIKEVETIAPRDISEVERFVKWLDGKLASLVDERAVL
Sbjct: 181 ENRSAYLSAIKSEVETHGDFVNWLIKEVETIAPRDISEVERFVKWLDGKLASLVDERAVL 240

Query: 241 KYFPRWPEAKADALREAAFSYRDLKGLESKVCMFRDNPKEEMNVVLKRAQALQDRVEQSV 300
           KYFPRWPEAKADALREAAFSYRDLKGLESKVCMFRDNPKEEMNVVLKRAQALQDRVEQSV
Sbjct: 241 KYFPRWPEAKADALREAAFSYRDLKGLESKVCMFRDNPKEEMNVVLKRAQALQDRVEQSV 300

Query: 301 SNMERTREFNCRKYQAFQIPCQWMFDSALPTQIKMSTLRLAKEYMIRITRELQSTETPQR 360
           SNMERTREFNCRKYQAFQIPCQWMFDSALPTQIKMSTLRLAKEYMIRITRELQSTETPQR
Sbjct: 301 SNMERTREFNCRKYQAFQIPCQWMFDSALPTQIKMSTLRLAKEYMIRITRELQSTETPQR 360

Query: 361 ENLFLQGARFAYRVHQYAGGFDSETIEAFEGLKKAGLSSQRK 403
           ENLFLQGARFAYRVHQYAGGFDSETIEAFEGLKKAGLSSQRK
Sbjct: 361 ENLFLQGARFAYRVHQYAGGFDSETIEAFEGLKKAGLSSQRK 402

BLAST of CsaV3_1G036390 vs. TrEMBL
Match: tr|A0A1S3C4V9|A0A1S3C4V9_CUCME (protein CHUP1, chloroplastic isoform X1 OS=Cucumis melo OX=3656 GN=LOC103497059 PE=4 SV=1)

HSP 1 Score: 674.9 bits (1740), Expect = 1.2e-190
Identity = 361/402 (89.80%), Postives = 377/402 (93.78%), Query Frame = 0

Query: 1   MPKEEDEVLAMEINCLKKELEISLQKSIFLEKENQELRQELNRLRSQIQSFKAQNNERKS 60
           MPKE+DE LAMEI+CLKK+LEISLQKSIFLE+ENQELR ELNRL+SQIQS KA NNERKS
Sbjct: 1   MPKEKDEELAMEIDCLKKDLEISLQKSIFLERENQELRLELNRLKSQIQSLKALNNERKS 60

Query: 61  ILWKKFHSSIDISVAGADSPPLSPATVAGDKRESTKSPKQSSWDDVKESHRMTGVPAXXX 120
           ILWKKFHSS+D++VAGADSPPL+PAT AGDKRE TK PKQSSWDDVKES RMT VP XXX
Sbjct: 61  ILWKKFHSSMDMAVAGADSPPLNPATAAGDKREVTKFPKQSSWDDVKESQRMTAVPXXXX 120

Query: 121 XXXXXXXXXKLLGGSKAVRRVPEVLELYRTLTKRDAQKENKVAHGGAPAVAFTKNMIGEI 180
           XXXXXXXXX      KAVRRVPEVL+LYRTLTKRDAQKENKVAHGGAP VAFTKNMIGEI
Sbjct: 121 XXXXXXXXXXXXXXXKAVRRVPEVLDLYRTLTKRDAQKENKVAHGGAPVVAFTKNMIGEI 180

Query: 181 ENRSAYLSAIKSEVETHGDFVNWLIKEVETIAPRDISEVERFVKWLDGKLASLVDERAVL 240
           ENRSAYLSAIKSEVETHG+FVNWLIKEVE IAPRDISE E+FVKWLD KLASLVDERAVL
Sbjct: 181 ENRSAYLSAIKSEVETHGEFVNWLIKEVEMIAPRDISEAEKFVKWLDVKLASLVDERAVL 240

Query: 241 KYFPRWPEAKADALREAAFSYRDLKGLESKVCMFRDNPKEEMNVVLKRAQALQDRVEQSV 300
           K+FPRWPEAKADALREAAFSYRDLK LESKVCMFRDNPKEEMNVVLKRAQALQDRVEQSV
Sbjct: 241 KHFPRWPEAKADALREAAFSYRDLKSLESKVCMFRDNPKEEMNVVLKRAQALQDRVEQSV 300

Query: 301 SNMERTREFNCRKYQAFQIPCQWMFDSALPTQIKMSTLRLAKEYMIRITRELQSTETPQR 360
           SNMERTREFNC+KYQAFQIPCQWMFDSALPTQIK+STLRLAKEYMIRITREL+STET Q 
Sbjct: 301 SNMERTREFNCKKYQAFQIPCQWMFDSALPTQIKLSTLRLAKEYMIRITRELRSTETSQA 360

Query: 361 ENLFLQGARFAYRVHQYAGGFDSETIEAFEGLKKAGLSSQRK 403
           ENLFLQG RFAYRVHQYAGGFDSE IEAFEGLKKAGLSSQRK
Sbjct: 361 ENLFLQGVRFAYRVHQYAGGFDSEAIEAFEGLKKAGLSSQRK 402

BLAST of CsaV3_1G036390 vs. TrEMBL
Match: tr|A0A1S3C5E9|A0A1S3C5E9_CUCME (protein CHUP1, chloroplastic isoform X2 OS=Cucumis melo OX=3656 GN=LOC103497059 PE=4 SV=1)

HSP 1 Score: 626.3 bits (1614), Expect = 4.7e-176
Identity = 336/376 (89.36%), Postives = 352/376 (93.62%), Query Frame = 0

Query: 1   MPKEEDEVLAMEINCLKKELEISLQKSIFLEKENQELRQELNRLRSQIQSFKAQNNERKS 60
           MPKE+DE LAMEI+CLKK+LEISLQKSIFLE+ENQELR ELNRL+SQIQS KA NNERKS
Sbjct: 1   MPKEKDEELAMEIDCLKKDLEISLQKSIFLERENQELRLELNRLKSQIQSLKALNNERKS 60

Query: 61  ILWKKFHSSIDISVAGADSPPLSPATVAGDKRESTKSPKQSSWDDVKESHRMTGVPAXXX 120
           ILWKKFHSS+D++VAGADSPPL+PAT AGDKRE TK PKQSSWDDVKES RMT VP XXX
Sbjct: 61  ILWKKFHSSMDMAVAGADSPPLNPATAAGDKREVTKFPKQSSWDDVKESQRMTAVPXXXX 120

Query: 121 XXXXXXXXXKLLGGSKAVRRVPEVLELYRTLTKRDAQKENKVAHGGAPAVAFTKNMIGEI 180
           XXXXXXXXX      KAVRRVPEVL+LYRTLTKRDAQKENKVAHGGAP VAFTKNMIGEI
Sbjct: 121 XXXXXXXXXXXXXXXKAVRRVPEVLDLYRTLTKRDAQKENKVAHGGAPVVAFTKNMIGEI 180

Query: 181 ENRSAYLSAIKSEVETHGDFVNWLIKEVETIAPRDISEVERFVKWLDGKLASLVDERAVL 240
           ENRSAYLSAIKSEVETHG+FVNWLIKEVE IAPRDISE E+FVKWLD KLASLVDERAVL
Sbjct: 181 ENRSAYLSAIKSEVETHGEFVNWLIKEVEMIAPRDISEAEKFVKWLDVKLASLVDERAVL 240

Query: 241 KYFPRWPEAKADALREAAFSYRDLKGLESKVCMFRDNPKEEMNVVLKRAQALQDRVEQSV 300
           K+FPRWPEAKADALREAAFSYRDLK LESKVCMFRDNPKEEMNVVLKRAQALQDRVEQSV
Sbjct: 241 KHFPRWPEAKADALREAAFSYRDLKSLESKVCMFRDNPKEEMNVVLKRAQALQDRVEQSV 300

Query: 301 SNMERTREFNCRKYQAFQIPCQWMFDSALPTQIKMSTLRLAKEYMIRITRELQSTETPQR 360
           SNMERTREFNC+KYQAFQIPCQWMFDSALPTQIK+STLRLAKEYMIRITREL+STET Q 
Sbjct: 301 SNMERTREFNCKKYQAFQIPCQWMFDSALPTQIKLSTLRLAKEYMIRITRELRSTETSQA 360

Query: 361 ENLFLQGARFAYRVHQ 377
           ENLFLQG RFAYRVHQ
Sbjct: 361 ENLFLQGVRFAYRVHQ 376

BLAST of CsaV3_1G036390 vs. TrEMBL
Match: tr|A0A2I4EIH7|A0A2I4EIH7_9ROSI (protein CHUP1, chloroplastic isoform X2 OS=Juglans regia OX=51240 GN=LOC108989893 PE=4 SV=1)

HSP 1 Score: 403.3 bits (1035), Expect = 6.5e-109
Identity = 241/427 (56.44%), Postives = 300/427 (70.26%), Query Frame = 0

Query: 1   MPKEEDEVLAMEINCLKKELEISLQKSIFLEKENQELRQELNRLRSQIQSFKAQNNERKS 60
           M +E+DE    EIN LKK+LE SL+K   LEKENQEL+QE NRL+ QI S +A NNERK+
Sbjct: 1   MLQEDDE---SEINLLKKKLEASLEKIDSLEKENQELKQEANRLKVQISSLRAYNNERKT 60

Query: 61  ILWKKFHSSIDISVAGADSPPLSPATVAGDKRESTKSPKQSSWDDVKES----------- 120
           ILWKK  SS+D +    D+P   P+T      +S ++ K  +  D  ES           
Sbjct: 61  ILWKKLQSSLDGNC--TDAPQHKPSTFVNLSEQSPEAGKSCTRTDFPESTAEKPMRIXXX 120

Query: 121 ---------------HRMTGVPAXXXXXXXXXXXXKLLGGSKAVRRVPEVLELYRTLTKR 180
                                  XXXXXXXXXXXXK L GS+AVRRVPEV+ELYR+LT+R
Sbjct: 121 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKPLVGSRAVRRVPEVIELYRSLTRR 180

Query: 181 DAQKENKVAHGGAPAVAFTKNMIGEIENRSAYLSAIKSEVETHGDFVNWLIKEVETIAPR 240
           D +KEN+    G+P VA TKNMIGEIENRS YLSAIKS+VE  G+F+ +L KEVE+   R
Sbjct: 181 DPRKENRTNPTGSPLVASTKNMIGEIENRSRYLSAIKSDVERRGEFIKFLTKEVESATYR 240

Query: 241 DISEVERFVKWLDGKLASLVDERAVLKYFPRWPEAKADALREAAFSYRDLKGLESKVCMF 300
           D+S+VE FVKWLDG+L+SLVDERAVLK+FP+WPE KADALREAA +YRDLK LES+V  F
Sbjct: 241 DVSDVEAFVKWLDGELSSLVDERAVLKHFPQWPERKADALREAACTYRDLKSLESEVSRF 300

Query: 301 RDNPKEEMNVVLKRAQALQDRVEQSVSNMERTREFNCRKYQAFQIPCQWMFDSALPTQIK 360
            DNPKE +   L+R QALQDR+EQS+ N+ER RE   ++Y+  QIP +WM D+ L  Q+K
Sbjct: 301 VDNPKEPLTQALRRIQALQDRLEQSIDNIERMRESTIKRYKDLQIPWEWMLDTGLLGQMK 360

Query: 361 MSTLRLAKEYMIRITRELQSTETPQRENLFLQGARFAYRVHQYAGGFDSETIEAFEGLKK 402
           +S+L+LA+EYM RI  ELQ+ E    +NL LQG R+AYRVHQ+AGGFD+E I+AFE LKK
Sbjct: 361 LSSLKLAREYMKRIANELQADECSCEDNLKLQGVRYAYRVHQFAGGFDAEAIQAFEKLKK 420

BLAST of CsaV3_1G036390 vs. TrEMBL
Match: tr|A0A2I4EII3|A0A2I4EII3_9ROSI (protein CHUP1, chloroplastic isoform X1 OS=Juglans regia OX=51240 GN=LOC108989893 PE=4 SV=1)

HSP 1 Score: 399.4 bits (1025), Expect = 9.3e-108
Identity = 241/428 (56.31%), Postives = 300/428 (70.09%), Query Frame = 0

Query: 1   MPKEEDEVLAMEINCLKKELEISLQKSIFLEKENQELRQELNRLRSQIQSFKAQNNERKS 60
           M +E+DE    EIN LKK+LE SL+K   LEKENQEL+QE NRL+ QI S +A NNERK+
Sbjct: 1   MLQEDDE---SEINLLKKKLEASLEKIDSLEKENQELKQEANRLKVQISSLRAYNNERKT 60

Query: 61  ILWKKFHSSIDISVAGADSPPLSPATVAGDKRESTKSPKQSSWDDVKES----------- 120
           ILWKK  SS+D +    D+P   P+T      +S ++ K  +  D  ES           
Sbjct: 61  ILWKKLQSSLDGNC--TDAPQHKPSTFVNLSEQSPEAGKSCTRTDFPESTAEKPMRIXXX 120

Query: 121 ---------------HRMTGVPAXXXXXXXXXXXXKLLGGSKAVRRVPEVLELYRTLTKR 180
                                  XXXXXXXXXXXXK L GS+AVRRVPEV+ELYR+LT+R
Sbjct: 121 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKPLVGSRAVRRVPEVIELYRSLTRR 180

Query: 181 DAQKENKVAHGGAPAVAFTKNMIGEIENRSAYLSAIKSEVETHGDFVNWLIKEVETIAPR 240
           D +KEN+    G+P VA TKNMIGEIENRS YLSAIKS+VE  G+F+ +L KEVE+   R
Sbjct: 181 DPRKENRTNPTGSPLVASTKNMIGEIENRSRYLSAIKSDVERRGEFIKFLTKEVESATYR 240

Query: 241 DISEVERFVKWLDGKLASLVDERAVLKYFPRWPEAKADALREAAFSYRDLKGLESKVCMF 300
           D+S+VE FVKWLDG+L+SLVDERAVLK+FP+WPE KADALREAA +YRDLK LES+V  F
Sbjct: 241 DVSDVEAFVKWLDGELSSLVDERAVLKHFPQWPERKADALREAACTYRDLKSLESEVSRF 300

Query: 301 RDNPKEEMNVVLKRAQALQDRVEQSVSNMERTREFNCRKYQAFQIPCQWMFDSA-LPTQI 360
            DNPKE +   L+R QALQDR+EQS+ N+ER RE   ++Y+  QIP +WM D+  L  Q+
Sbjct: 301 VDNPKEPLTQALRRIQALQDRLEQSIDNIERMRESTIKRYKDLQIPWEWMLDTGLLGQQM 360

Query: 361 KMSTLRLAKEYMIRITRELQSTETPQRENLFLQGARFAYRVHQYAGGFDSETIEAFEGLK 402
           K+S+L+LA+EYM RI  ELQ+ E    +NL LQG R+AYRVHQ+AGGFD+E I+AFE LK
Sbjct: 361 KLSSLKLAREYMKRIANELQADECSCEDNLKLQGVRYAYRVHQFAGGFDAEAIQAFEKLK 420

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_011658693.15.0e-214100.00PREDICTED: protein CHUP1, chloroplastic isoform X1 [Cucumis sativus] >KGN65828.1... [more]
XP_011658695.17.8e-199100.00PREDICTED: protein CHUP1, chloroplastic isoform X2 [Cucumis sativus][more]
XP_008457349.11.7e-19089.80PREDICTED: protein CHUP1, chloroplastic isoform X1 [Cucumis melo][more]
XP_008457350.17.1e-17689.36PREDICTED: protein CHUP1, chloroplastic isoform X2 [Cucumis melo][more]
XP_023523072.15.1e-15876.67protein CHUP1, chloroplastic isoform X1 [Cucurbita pepo subsp. pepo] >XP_0235230... [more]
Match NameE-valueIdentityDescription
AT1G07120.13.7e-7845.69FUNCTIONS IN: molecular_function unknown[more]
AT3G25690.11.7e-6249.05Hydroxyproline-rich glycoprotein family protein[more]
AT4G18570.18.3e-6253.60Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G48280.19.6e-3431.20hydroxyproline-rich glycoprotein family protein[more]
AT1G11070.14.5e-0730.17BEST Arabidopsis thaliana protein match is: Hydroxyproline-rich glycoprotein fam... [more]
Match NameE-valueIdentityDescription
sp|Q9LI74|CHUP1_ARATH3.0e-6149.05Protein CHUP1, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=CHUP1 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
tr|A0A0A0LVK7|A0A0A0LVK7_CUCSA3.3e-214100.00Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G532360 PE=4 SV=1[more]
tr|A0A1S3C4V9|A0A1S3C4V9_CUCME1.2e-19089.80protein CHUP1, chloroplastic isoform X1 OS=Cucumis melo OX=3656 GN=LOC103497059 ... [more]
tr|A0A1S3C5E9|A0A1S3C5E9_CUCME4.7e-17689.36protein CHUP1, chloroplastic isoform X2 OS=Cucumis melo OX=3656 GN=LOC103497059 ... [more]
tr|A0A2I4EIH7|A0A2I4EIH7_9ROSI6.5e-10956.44protein CHUP1, chloroplastic isoform X2 OS=Juglans regia OX=51240 GN=LOC10898989... [more]
tr|A0A2I4EII3|A0A2I4EII3_9ROSI9.3e-10856.31protein CHUP1, chloroplastic isoform X1 OS=Juglans regia OX=51240 GN=LOC10898989... [more]
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsaV3_1G036390.1CsaV3_1G036390.1mRNA


Analysis Name: InterPro Annotations of cucumber chineselong genome (v3)
Date Performed: 2019-03-04
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 286..306
NoneNo IPR availableCOILSCoilCoilcoord: 20..54
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 92..111
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 74..128
NoneNo IPR availablePANTHERPTHR31342FAMILY NOT NAMEDcoord: 11..401
NoneNo IPR availablePANTHERPTHR31342:SF30F10K1.18 PROTEINcoord: 11..401

The following gene(s) are paralogous to this gene:

None