Sgr019225 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr019225
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
Descriptionprotein CHUP1, chloroplastic
Locationtig00153293: 824775 .. 828190 (-)
RNA-Seq ExpressionSgr019225
SyntenySgr019225
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTAGCTGGGAAGGTGAAGGTCGCAATGGGGCTGCAGAAGTCCCCGGCGAGCAGAAAGGTTGAGAGCTCACCGAAACCGTCGACGCAGGCGCAGCCTTCTCCGAGCTCTGGTAAGGTTCCTCAGAAAACGGTGTTCTCCCGCTCGTTTGGGGTCTATTTTCCTCGCTCTTCTGCTCAGGTCCAGCCTCGACCGCCTGATGTGACGGAGCTTCTCCGCATGGTCGAGGAGTTGCGTGACAGAGAGGCGCGATTGAAGACTGACCTATTGGAGCACAAGCTTTTGAAGGAGTCTGTCGCCATTGTTCCTCTGCTTGAGAACGAGATCTGTACGAAGGATGCTGAGATTGAGAGAGCGTCTAAGCGGATACTGTTCTTGGAGGCAGAGAATGAGAGGTTGAGAGTTGAGGTGGAGGAACTTCAACTGAATATTGACGAACAGAGGAGAGAGAGTCAGGAAAGAATAAAGGCAATGGAAGGTGAAATCGCAGAGCTGAAGAAAATGGCGTTGGATCGAAACAGAATGGAGCTTAGTTTGGAGAACGACGACCTTTCGGCCTCCCAGAGGTTCCAGGGATTAATGGAGGTCTCGGGAAAGTCTAACCTAATCAGGAACTTGAAAAGAGCGACCAAATGTTCGGATGCTGTTGTTAACCACGACAATCATAAGGTTGACTATCATCCAGAGGCAAAGAAAGAAGAAGTTGAAACCGAGAGACCGAGACATTCGCGATGTAACTCTGAAGAACTCGCCGAGTCCACTCTCTCTAACCTGAAATCACGAATACCTAGGGTTCCAAAACCTCCTCCGAAACCTTCCTCATCTTCCTCTTCTTCTACCTCTTCCTCCTCCTCCTCTTCCTCACCAACTGGCTCTTGTGTTGACACAGAAAACGCGATCCCTCCCCCACCCCCTGTCCCAACAAAGCCAATGACGCCGCCTCCCCCGCCAAGTTCGAAGTCTGCTCCGCCTCCCCCTCCGCCGCCTCCCAAGGGTAAGAGGCTGATGCCAGCGAAGGTGCGGCGAATACCGGAGGTAGTTGAGTTCTATCATTCATTAATGCGGAGGGACTCCCGACGAGATCCTGGTTCCGGCATTACGGACCCGCCGTCGACCGTCAATGCTCGTGACATGATAGGAGAGATCGAGAACCGGTCCGCTCACTTGCTCGCTGTAAGTCACCGGTCACGTCCTCTCTCGGTCCCTTATTTTGTTTTTTCAGTTATCTAATTTACAATTATGCCATAGGCTGAGCTTTTTGTCAGTGACAAGTGACCGGTCCCTCAATTGTTTTTATTAGGTGCCCCTATTGAATTTCTTTGGATTTTATTGTCAAAATCGGAATATTGCTTCCTTTTGTTAGGAATTAGGATTGCCCTCTTGATTCACACTGTCTTTTCGCATATCCTTTTATTTCTCTAGACTCTGGCAATTTTGTCATTTCAAGTTGAAATTTTAACTCAATTCATGGCTTTTTGGAGGTCGAGTAATTCAAATTTACCCTTCCCCCTCTTTTCTCAAAGATGAACTAAACAAAGCGAAAAGGTAATCATAGAGAATAAATTTTCTTTGAAGCCAAATAAAGATTGATTTTGAGAAGATATCAACCTCTTTGATGATGGATTGCTTTAATATCAACTCCACCATCTAAAAAAAAAAGTTACTTTAATTATTATTCTCTTGGGGGTTTTGATTTGGTTATGATTCTCATGATTAATCTCTCATGTTACTTGCAGATAAAGACGGATGTAGAGACTCAAGGGGATTTCATAAAGTTGTTGATCAAAGAAGTTGAAAATGCTTCATTTACTGACATCGAGGACGTTGTGCCATTTGTCAAATGGTTGGATGATGAGCTCTCATATCTGGTAGATGAAAGAGCCGTGCTTAAACACTTCCAGTGGCCGGAGCAAAAGGCCGACGCTCTGCGTGAGGCTGCCTTCGGCTATTGCGATCTAAAGAAGCTGGAATCTGAAGCGTCATCGTTTCGTGATGCCCGCCAGCCCTGTAGTTCAGCTCTCAAGAAGATGCAATCTTTGCTTGAAAAGTATTCAAAATTTTTCCTGTGCTTGCTTCTAACTGATTTTTTCCTTACTTTTAGTTTCATTTCTTAGTTGCTGTTCTTTGGTGCAACAGATTGGAGCATGGCGTATACAATCTGTCGAGAATGCGCGAATCTGCAACTAAGAGATACAAAGTGTTTAAAATTCCAGTGGAATGGATGCTTGATAGTGGAATTGTGAGTCAGGTAAATTCCATCTCTGTTTTTAATTAAACAAATATAGTTTAATTCTAGCTTAGGTCGGCTGGTGGGAAGGGTATCCAAGAAGGATGACTGCTGATTAATAATGATTTCCTTGACTTGAAATGCAACGATTATTAAAATAATTAAGTTGTTCCATTTAGGTTAGGTGTGTGTCCCTCAGGTGGTCCCTTTTAAAGGATCCTTTCTGGAGCGACAGGGAAATAACTCTGACACAGTCTACAGAACAATGTAATATAAATATGGAGAATGGTTAACAAGATGACAAAAAGGATGTTTTGATTTTGCCTTATTTTTCTCTGCCCCTTGCCTGCACGCGCACATGGGAATTCCCAGATTTCTAAACTACATCTTCTTGTCTGCTTTATGGCTGTTTTAAATTATGGGCTAATAAGAGAATCAGATTCTCCTTGTTTGCTGTATGCTAATTTGATTCTTTTTATAACTTTCTTTAGAAAGAGCACACTGGGATTTGTTTGTTAATAAATATTCATGCATTTGATAACTCAAATAAACCCTTGGAAGCTCCAGAATCCAACCTCTCGAGCATTGCATTGTTTAAAATGAGTAAATTAATCAACGTCGTTAAAAGGATAAATCGTTTGCCTTCAAAATATGGTCCCCTGGGCGGTCTAGTTGGCAGTTCTTTAATACGTTCACGTGGTATTAGTTAACTCGGCTGATGTGTTTCTGATCTAAGTCAATTATCTGCTTCAATTAATGTGTCACAGATCAAACTTGTGTCTGTAAAATTAGCAATGAAGTACATGAAGAGAGTATCCGCAGAGCTCGAAACAGTCGGTGGTGGACCTGAAGAAGAAGAGCTGATTGTCCAAGGCGTTAGGTTTGCCTTCCGTGTGCATCAGGTAATCTTGGGCTGCTCAACAACTTGCTTTTGGGCACTTACGGAAGGGGGAGGGTACCTCAACCCTCTTATTTGAAACTGGAAACTGTTGCAAATAACTTTGTTAATCCTCCATGATTATGTGGTAAACTTGGTTCTTCAACTTCTTATGTGCAGTTTGCAGGAGGGTTTGATGTGGAAACGATGAGGGCATTTCAAGAGCTGAGAGATAAAGCAAGTTCATGCCACGTACAATGCCAAAACCAGCAACATAAGTATGTGTGCAGTAGCAGGCCTACAACTTGTTAA

mRNA sequence

ATGGTAGCTGGGAAGGTGAAGGTCGCAATGGGGCTGCAGAAGTCCCCGGCGAGCAGAAAGGTTGAGAGCTCACCGAAACCGTCGACGCAGGCGCAGCCTTCTCCGAGCTCTGGTAAGGTTCCTCAGAAAACGGTGTTCTCCCGCTCGTTTGGGGTCTATTTTCCTCGCTCTTCTGCTCAGGTCCAGCCTCGACCGCCTGATGTGACGGAGCTTCTCCGCATGGTCGAGGAGTTGCGTGACAGAGAGGCGCGATTGAAGACTGACCTATTGGAGCACAAGCTTTTGAAGGAGTCTGTCGCCATTGTTCCTCTGCTTGAGAACGAGATCTGTACGAAGGATGCTGAGATTGAGAGAGCGTCTAAGCGGATACTGTTCTTGGAGGCAGAGAATGAGAGGTTGAGAGTTGAGGTGGAGGAACTTCAACTGAATATTGACGAACAGAGGAGAGAGAGTCAGGAAAGAATAAAGGCAATGGAAGGTGAAATCGCAGAGCTGAAGAAAATGGCGTTGGATCGAAACAGAATGGAGCTTAGTTTGGAGAACGACGACCTTTCGGCCTCCCAGAGGTTCCAGGGATTAATGGAGGTCTCGGGAAAGTCTAACCTAATCAGGAACTTGAAAAGAGCGACCAAATGTTCGGATGCTGTTGTTAACCACGACAATCATAAGGTTGACTATCATCCAGAGGCAAAGAAAGAAGAAGTTGAAACCGAGAGACCGAGACATTCGCGATGTAACTCTGAAGAACTCGCCGAGTCCACTCTCTCTAACCTGAAATCACGAATACCTAGGGTTCCAAAACCTCCTCCGAAACCTTCCTCATCTTCCTCTTCTTCTACCTCTTCCTCCTCCTCCTCTTCCTCACCAACTGGCTCTTGTGTTGACACAGAAAACGCGATCCCTCCCCCACCCCCTGTCCCAACAAAGCCAATGACGCCGCCTCCCCCGCCAAGTTCGAAGTCTGCTCCGCCTCCCCCTCCGCCGCCTCCCAAGGGTAAGAGGCTGATGCCAGCGAAGGTGCGGCGAATACCGGAGGTAGTTGAGTTCTATCATTCATTAATGCGGAGGGACTCCCGACGAGATCCTGGTTCCGGCATTACGGACCCGCCGTCGACCGTCAATGCTCGTGACATGATAGGAGAGATCGAGAACCGGTCCGCTCACTTGCTCGCTATAAAGACGGATGTAGAGACTCAAGGGGATTTCATAAAGTTGTTGATCAAAGAAGTTGAAAATGCTTCATTTACTGACATCGAGGACGTTGTGCCATTTGTCAAATGGTTGGATGATGAGCTCTCATATCTGGTAGATGAAAGAGCCGTGCTTAAACACTTCCAGTGGCCGGAGCAAAAGGCCGACGCTCTGCGTGAGGCTGCCTTCGGCTATTGCGATCTAAAGAAGCTGGAATCTGAAGCGTCATCGTTTCGTGATGCCCGCCAGCCCTGTAGTTCAGCTCTCAAGAAGATGCAATCTTTGCTTGAAAAATTGGAGCATGGCGTATACAATCTGTCGAGAATGCGCGAATCTGCAACTAAGAGATACAAAGTGTTTAAAATTCCAGTGGAATGGATGCTTGATAGTGGAATTGTGAGTCAGATCAAACTTGTGTCTGTAAAATTAGCAATGAAGTACATGAAGAGAGTATCCGCAGAGCTCGAAACAGTCGGTGGTGGACCTGAAGAAGAAGAGCTGATTGTCCAAGGCGTTAGGTTTGCCTTCCGTGTGCATCAGTTTGCAGGAGGGTTTGATGTGGAAACGATGAGGGCATTTCAAGAGCTGAGAGATAAAGCAAGTTCATGCCACGTACAATGCCAAAACCAGCAACATAAGTATGTGTGCAGTAGCAGGCCTACAACTTGTTAA

Coding sequence (CDS)

ATGGTAGCTGGGAAGGTGAAGGTCGCAATGGGGCTGCAGAAGTCCCCGGCGAGCAGAAAGGTTGAGAGCTCACCGAAACCGTCGACGCAGGCGCAGCCTTCTCCGAGCTCTGGTAAGGTTCCTCAGAAAACGGTGTTCTCCCGCTCGTTTGGGGTCTATTTTCCTCGCTCTTCTGCTCAGGTCCAGCCTCGACCGCCTGATGTGACGGAGCTTCTCCGCATGGTCGAGGAGTTGCGTGACAGAGAGGCGCGATTGAAGACTGACCTATTGGAGCACAAGCTTTTGAAGGAGTCTGTCGCCATTGTTCCTCTGCTTGAGAACGAGATCTGTACGAAGGATGCTGAGATTGAGAGAGCGTCTAAGCGGATACTGTTCTTGGAGGCAGAGAATGAGAGGTTGAGAGTTGAGGTGGAGGAACTTCAACTGAATATTGACGAACAGAGGAGAGAGAGTCAGGAAAGAATAAAGGCAATGGAAGGTGAAATCGCAGAGCTGAAGAAAATGGCGTTGGATCGAAACAGAATGGAGCTTAGTTTGGAGAACGACGACCTTTCGGCCTCCCAGAGGTTCCAGGGATTAATGGAGGTCTCGGGAAAGTCTAACCTAATCAGGAACTTGAAAAGAGCGACCAAATGTTCGGATGCTGTTGTTAACCACGACAATCATAAGGTTGACTATCATCCAGAGGCAAAGAAAGAAGAAGTTGAAACCGAGAGACCGAGACATTCGCGATGTAACTCTGAAGAACTCGCCGAGTCCACTCTCTCTAACCTGAAATCACGAATACCTAGGGTTCCAAAACCTCCTCCGAAACCTTCCTCATCTTCCTCTTCTTCTACCTCTTCCTCCTCCTCCTCTTCCTCACCAACTGGCTCTTGTGTTGACACAGAAAACGCGATCCCTCCCCCACCCCCTGTCCCAACAAAGCCAATGACGCCGCCTCCCCCGCCAAGTTCGAAGTCTGCTCCGCCTCCCCCTCCGCCGCCTCCCAAGGGTAAGAGGCTGATGCCAGCGAAGGTGCGGCGAATACCGGAGGTAGTTGAGTTCTATCATTCATTAATGCGGAGGGACTCCCGACGAGATCCTGGTTCCGGCATTACGGACCCGCCGTCGACCGTCAATGCTCGTGACATGATAGGAGAGATCGAGAACCGGTCCGCTCACTTGCTCGCTATAAAGACGGATGTAGAGACTCAAGGGGATTTCATAAAGTTGTTGATCAAAGAAGTTGAAAATGCTTCATTTACTGACATCGAGGACGTTGTGCCATTTGTCAAATGGTTGGATGATGAGCTCTCATATCTGGTAGATGAAAGAGCCGTGCTTAAACACTTCCAGTGGCCGGAGCAAAAGGCCGACGCTCTGCGTGAGGCTGCCTTCGGCTATTGCGATCTAAAGAAGCTGGAATCTGAAGCGTCATCGTTTCGTGATGCCCGCCAGCCCTGTAGTTCAGCTCTCAAGAAGATGCAATCTTTGCTTGAAAAATTGGAGCATGGCGTATACAATCTGTCGAGAATGCGCGAATCTGCAACTAAGAGATACAAAGTGTTTAAAATTCCAGTGGAATGGATGCTTGATAGTGGAATTGTGAGTCAGATCAAACTTGTGTCTGTAAAATTAGCAATGAAGTACATGAAGAGAGTATCCGCAGAGCTCGAAACAGTCGGTGGTGGACCTGAAGAAGAAGAGCTGATTGTCCAAGGCGTTAGGTTTGCCTTCCGTGTGCATCAGTTTGCAGGAGGGTTTGATGTGGAAACGATGAGGGCATTTCAAGAGCTGAGAGATAAAGCAAGTTCATGCCACGTACAATGCCAAAACCAGCAACATAAGTATGTGTGCAGTAGCAGGCCTACAACTTGTTAA

Protein sequence

MVAGKVKVAMGLQKSPASRKVESSPKPSTQAQPSPSSGKVPQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVAIVPLLENEICTKDAEIERASKRILFLEAENERLRVEVEELQLNIDEQRRESQERIKAMEGEIAELKKMALDRNRMELSLENDDLSASQRFQGLMEVSGKSNLIRNLKRATKCSDAVVNHDNHKVDYHPEAKKEEVETERPRHSRCNSEELAESTLSNLKSRIPRVPKPPPKPSSSSSSSTSSSSSSSSPTGSCVDTENAIPPPPPVPTKPMTPPPPPSSKSAPPPPPPPPKGKRLMPAKVRRIPEVVEFYHSLMRRDSRRDPGSGITDPPSTVNARDMIGEIENRSAHLLAIKTDVETQGDFIKLLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRDARQPCSSALKKMQSLLEKLEHGVYNLSRMRESATKRYKVFKIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSCHVQCQNQQHKYVCSSRPTTC
Homology
BLAST of Sgr019225 vs. NCBI nr
Match: XP_008439756.1 (PREDICTED: protein CHUP1, chloroplastic [Cucumis melo])

HSP 1 Score: 1046.6 bits (2705), Expect = 8.6e-302
Identity = 573/622 (92.12%), Postives = 593/622 (95.34%), Query Frame = 0

Query: 1   MVAGKVKVAMGLQKSPASRKVESSPKPSTQAQPSPSSGKVPQKTVFSRSFGVYFPRSSAQ 60
           MVAGKVKVAMGLQKSPASRKVESSPK ST AQPSPSSGKV QKTVFSRSFGVYFPRSSAQ
Sbjct: 1   MVAGKVKVAMGLQKSPASRKVESSPKTSTPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQ 60

Query: 61  VQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVAIVPLLENEICTKDAEIERAS 120
           VQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVAIVP+LENEI TKDAEIERAS
Sbjct: 61  VQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVAIVPVLENEISTKDAEIERAS 120

Query: 121 KRILFLEAENERLRVEVEELQLNIDEQRRESQERIKAMEGEIAELKKMALDRNRMELSLE 180
           KRILFLEAENERLRV+VEE++ +++E+RRESQERIKAMEGEI+ELKKMALDR+RMEL LE
Sbjct: 121 KRILFLEAENERLRVQVEEVKQSVEEERRESQERIKAMEGEISELKKMALDRSRMELILE 180

Query: 181 NDDLSASQRFQGLMEVSGKSNLIRNLKRATKCSDAVVNHDNHKVDYHPEAKKEEVETERP 240
           ND+LSASQRFQGLMEVSGKSNLIRNLKRATKCSDAVVN DNHKV+ HPE KKEEVETERP
Sbjct: 181 NDELSASQRFQGLMEVSGKSNLIRNLKRATKCSDAVVNQDNHKVE-HPEVKKEEVETERP 240

Query: 241 RHSRCNSEELAESTLSNLKSRIPRVPKPPPKPSSSSSSSTSSSSSSSSPTGSCVDTENAI 300
           RHSRCNSEELAESTLSN+KSRIPRVP+PPPKPSSSSSSS ++SSSS   TGS  D E AI
Sbjct: 241 RHSRCNSEELAESTLSNIKSRIPRVPRPPPKPSSSSSSSATTSSSS---TGSSADIEKAI 300

Query: 301 PPPPPVPTKPMTPPPPPSSKSAPPPPPPPPKGKRLMPAKVRRIPEVVEFYHSLMRRDSRR 360
           P PPPVPTKPM PPPPP SKSAPPPPPPPPKGKR MPAKVRRIPEVVEFYHSLMRRDSRR
Sbjct: 301 PAPPPVPTKPMPPPPPPPSKSAPPPPPPPPKGKRPMPAKVRRIPEVVEFYHSLMRRDSRR 360

Query: 361 DPGSGITDPPSTVNARDMIGEIENRSAHLLAIKTDVETQGDFIKLLIKEVENASFTDIED 420
           D GSG+TDPPST NARDMIGEIENRSAHLLAIKTDVETQGDFI+LLIKEVENASFTDIED
Sbjct: 361 DSGSGVTDPPSTANARDMIGEIENRSAHLLAIKTDVETQGDFIRLLIKEVENASFTDIED 420

Query: 421 VVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFR-DAR 480
           VVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFR DAR
Sbjct: 421 VVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDAR 480

Query: 481 QPCSSALKKMQSLLEKLEHGVYNLSRMRESATKRYKVFKIPVEWMLDSGIVSQIKLVSVK 540
           QPC SALKKMQ+LLEKLEHGVYNLSRMRESA KRYK F+IPVEWMLDSGIVSQIKLVSVK
Sbjct: 481 QPCGSALKKMQALLEKLEHGVYNLSRMRESAAKRYKAFQIPVEWMLDSGIVSQIKLVSVK 540

Query: 541 LAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASS 600
           LAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASS
Sbjct: 541 LAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASS 600

Query: 601 CHVQCQN-QQHKYVCSSRPTTC 621
           CHVQCQN QQHKYV SSRPTTC
Sbjct: 601 CHVQCQNQQQHKYVWSSRPTTC 618

BLAST of Sgr019225 vs. NCBI nr
Match: XP_004134665.1 (protein CHUP1, chloroplastic [Cucumis sativus] >KGN49295.1 hypothetical protein Csa_003180 [Cucumis sativus])

HSP 1 Score: 1044.3 bits (2699), Expect = 4.3e-301
Identity = 572/622 (91.96%), Postives = 593/622 (95.34%), Query Frame = 0

Query: 1   MVAGKVKVAMGLQKSPASRKVESSPKPSTQAQPSPSSGKVPQKTVFSRSFGVYFPRSSAQ 60
           MVAGKVKVAMGLQKSPASRKVESSPK ST AQPSPSSGKV QKTVFSRSFGVYFPRSSAQ
Sbjct: 1   MVAGKVKVAMGLQKSPASRKVESSPKTSTPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQ 60

Query: 61  VQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVAIVPLLENEICTKDAEIERAS 120
           VQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVAIVP+LENEI TKDAEIERAS
Sbjct: 61  VQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVAIVPVLENEISTKDAEIERAS 120

Query: 121 KRILFLEAENERLRVEVEELQLNIDEQRRESQERIKAMEGEIAELKKMALDRNRMELSLE 180
           KRILFLEAENERLRV+VEE + +++E+RRESQERIKAMEGE+AELKKMALDR+RMEL LE
Sbjct: 121 KRILFLEAENERLRVQVEEAKQSVEEERRESQERIKAMEGEVAELKKMALDRSRMELILE 180

Query: 181 NDDLSASQRFQGLMEVSGKSNLIRNLKRATKCSDAVVNHDNHKVDYHPEAKKEEVETERP 240
           ND+LSASQRFQGLMEVSGKSNLIRNLKRATKCSDAVVN DNHKV+ HPEAKKEEVETERP
Sbjct: 181 NDELSASQRFQGLMEVSGKSNLIRNLKRATKCSDAVVNQDNHKVE-HPEAKKEEVETERP 240

Query: 241 RHSRCNSEELAESTLSNLKSRIPRVPKPPPKPSSSSSSSTSSSSSSSSPTGSCVDTENAI 300
           RHSRCNSEELAESTLSN+KSRIPRVPKPPPKPSSSSSSS ++S+SSSS TGS  D E AI
Sbjct: 241 RHSRCNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATTSTSSSS-TGSSADIEKAI 300

Query: 301 PPPPPVPTKPMTPPPPPSSKSAPPPPPPPPKGKRLMPAKVRRIPEVVEFYHSLMRRDSRR 360
           P PPPVPTK M PPPPP SKSAPPPPPPPPKGKRLMPAKVRRIPEVVEFYHSLMRRDSRR
Sbjct: 301 PAPPPVPTKAMPPPPPPPSKSAPPPPPPPPKGKRLMPAKVRRIPEVVEFYHSLMRRDSRR 360

Query: 361 DPGSGITDPPSTVNARDMIGEIENRSAHLLAIKTDVETQGDFIKLLIKEVENASFTDIED 420
           D GSG+T+PPST NARDMIGEIENRSAHLLAIKTDVETQGDFI+ LIKEVENASFTDIED
Sbjct: 361 DSGSGVTEPPSTANARDMIGEIENRSAHLLAIKTDVETQGDFIRFLIKEVENASFTDIED 420

Query: 421 VVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFR-DAR 480
           VVPFVKWLDDELS+LVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFR DAR
Sbjct: 421 VVPFVKWLDDELSFLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDAR 480

Query: 481 QPCSSALKKMQSLLEKLEHGVYNLSRMRESATKRYKVFKIPVEWMLDSGIVSQIKLVSVK 540
           QPC SALKKMQ+LLEKLEHGVYNLSRMRESA KRYK F+IPVEWMLD GIVSQIKLVSVK
Sbjct: 481 QPCGSALKKMQALLEKLEHGVYNLSRMRESAAKRYKAFQIPVEWMLDGGIVSQIKLVSVK 540

Query: 541 LAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASS 600
           LAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASS
Sbjct: 541 LAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASS 600

Query: 601 CHVQCQN-QQHKYVCSSRPTTC 621
           CHVQCQN QQHKYV SSRPTTC
Sbjct: 601 CHVQCQNQQQHKYVWSSRPTTC 620

BLAST of Sgr019225 vs. NCBI nr
Match: XP_038883847.1 (protein CHUP1, chloroplastic [Benincasa hispida])

HSP 1 Score: 1040.8 bits (2690), Expect = 4.7e-300
Identity = 569/621 (91.63%), Postives = 588/621 (94.69%), Query Frame = 0

Query: 1   MVAGKVKVAMGLQKSPASRKVESSPKPSTQAQPSPSSGKVPQKTVFSRSFGVYFPRSSAQ 60
           MVAGKVKVAMGLQKSPASRKVESSPKPST AQPSPSSGKV QKTVFSRSFGVYFPRSSAQ
Sbjct: 1   MVAGKVKVAMGLQKSPASRKVESSPKPSTPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQ 60

Query: 61  VQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVAIVPLLENEICTKDAEIERAS 120
           VQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVAIVP+LENEI TKDAEIERAS
Sbjct: 61  VQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVAIVPVLENEISTKDAEIERAS 120

Query: 121 KRILFLEAENERLRVEVEELQLNIDEQRRESQERIKAMEGEIAELKKMALDRNRMELSLE 180
           KRILFLEAENERLRVEVEE++ +++E+RRESQERIKAME EIAELKKMALDR+RMEL LE
Sbjct: 121 KRILFLEAENERLRVEVEEVKQSVEEERRESQERIKAMECEIAELKKMALDRSRMELILE 180

Query: 181 NDDLSASQRFQGLMEVSGKSNLIRNLKRATKCSDAVVNHDNHKVDYHPEAKKEEVETERP 240
           ND+LSASQRFQGLMEVSGKSNLIRNLKR TKCS+AVVN DNHK + HPEAKKEEVETERP
Sbjct: 181 NDELSASQRFQGLMEVSGKSNLIRNLKRTTKCSEAVVNQDNHKFE-HPEAKKEEVETERP 240

Query: 241 RHSRCNSEELAESTLSNLKSRIPRVPKPPPKPSSSSSSSTSSSSSSSSPTGSCVDTENAI 300
           RHSRCNSEELAE TLSN+KSRIPRVPKPPPKPSSSSSSS ++SSSS   TGS  D E AI
Sbjct: 241 RHSRCNSEELAEPTLSNIKSRIPRVPKPPPKPSSSSSSSANTSSSS---TGSSGDLEKAI 300

Query: 301 PPPPPVPTKPMTPPPPPSSKSAPPPPPPPPKGKRLMPAKVRRIPEVVEFYHSLMRRDSRR 360
           P PPPVPTKPM PPPPP SKSAPPPPPPPPKGKR MP KVRRIPEVVEFYHSLMRRDSRR
Sbjct: 301 PAPPPVPTKPMPPPPPPPSKSAPPPPPPPPKGKRPMPTKVRRIPEVVEFYHSLMRRDSRR 360

Query: 361 DPGSGITDPPSTVNARDMIGEIENRSAHLLAIKTDVETQGDFIKLLIKEVENASFTDIED 420
           D GS +TDPPST NARDMIGEIENRSAHLLAIKTDVETQGDFI+ LIKEVENASFTDIED
Sbjct: 361 DSGSSVTDPPSTANARDMIGEIENRSAHLLAIKTDVETQGDFIRFLIKEVENASFTDIED 420

Query: 421 VVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFR-DAR 480
           VVPFVKWLDDELS+LVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFR DAR
Sbjct: 421 VVPFVKWLDDELSFLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDAR 480

Query: 481 QPCSSALKKMQSLLEKLEHGVYNLSRMRESATKRYKVFKIPVEWMLDSGIVSQIKLVSVK 540
           QPC SALKKMQ+LLEKLEHGVYNLSRMRESATKRYK F+IPVEWMLDSGIV QIKLVSVK
Sbjct: 481 QPCGSALKKMQALLEKLEHGVYNLSRMRESATKRYKAFQIPVEWMLDSGIVCQIKLVSVK 540

Query: 541 LAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASS 600
           LAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASS
Sbjct: 541 LAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASS 600

Query: 601 CHVQCQNQQHKYVCSSRPTTC 621
           CHVQCQNQQHKYV SSRPTTC
Sbjct: 601 CHVQCQNQQHKYVWSSRPTTC 617

BLAST of Sgr019225 vs. NCBI nr
Match: KAA0052630.1 (protein CHUP1 [Cucumis melo var. makuwa] >TYK13197.1 protein CHUP1 [Cucumis melo var. makuwa])

HSP 1 Score: 1029.6 bits (2661), Expect = 1.1e-296
Identity = 564/613 (92.01%), Postives = 585/613 (95.43%), Query Frame = 0

Query: 1   MVAGKVKVAMGLQKSPASRKVESSPKPSTQAQPSPSSGKVPQKTVFSRSFGVYFPRSSAQ 60
           MVAGKVKVAMGLQKSPASRKVESSPK ST AQPSPSSGKV QKTVFSRSFGVYFPRSSAQ
Sbjct: 1   MVAGKVKVAMGLQKSPASRKVESSPKTSTPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQ 60

Query: 61  VQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVAIVPLLENEICTKDAEIERAS 120
           VQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVAIVP+LENEI TKDAEIERAS
Sbjct: 61  VQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVAIVPVLENEISTKDAEIERAS 120

Query: 121 KRILFLEAENERLRVEVEELQLNIDEQRRESQERIKAMEGEIAELKKMALDRNRMELSLE 180
           KRILFLEAENERLRV+VEE++ +++E+RRESQER+KAMEGEI+ELKKMALDR+RMEL LE
Sbjct: 121 KRILFLEAENERLRVQVEEVKQSVEEERRESQERMKAMEGEISELKKMALDRSRMELILE 180

Query: 181 NDDLSASQRFQGLMEVSGKSNLIRNLKRATKCSDAVVNHDNHKVDYHPEAKKEEVETERP 240
           ND+LSASQRFQGLMEVSGKSNLIRNLKRATKCSDAVVN DNHKV+ HPE KKEEVETERP
Sbjct: 181 NDELSASQRFQGLMEVSGKSNLIRNLKRATKCSDAVVNQDNHKVE-HPEVKKEEVETERP 240

Query: 241 RHSRCNSEELAESTLSNLKSRIPRVPKPPPKPSSSSSSSTSSSSSSSSPTGSCVDTENAI 300
           RHSRCNSEELAESTLSN+KSRIPRVPKPPPKPSSSSSSS ++SSSSS  TGS  D E AI
Sbjct: 241 RHSRCNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATTSSSSS--TGSSADIEKAI 300

Query: 301 PPPPPVPTKPMTPPPPPSSKSAPPPPPPPPKGKRLMPAKVRRIPEVVEFYHSLMRRDSRR 360
           P PPPVPTKPM PPPPP SKSAPPPPPPPPKGKR MPAKVRRIPEVVEFYHSLMRRDSRR
Sbjct: 301 PAPPPVPTKPMPPPPPPPSKSAPPPPPPPPKGKRPMPAKVRRIPEVVEFYHSLMRRDSRR 360

Query: 361 DPGSGITDPPSTVNARDMIGEIENRSAHLLAIKTDVETQGDFIKLLIKEVENASFTDIED 420
           D GSG+TDPPST NARDMIGEIENRSAHLLAIKTDVETQGDFI+ LIKEVENASFTDIED
Sbjct: 361 DSGSGVTDPPSTANARDMIGEIENRSAHLLAIKTDVETQGDFIRFLIKEVENASFTDIED 420

Query: 421 VVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFR-DAR 480
           VVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFR DAR
Sbjct: 421 VVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDAR 480

Query: 481 QPCSSALKKMQSLLEKLEHGVYNLSRMRESATKRYKVFKIPVEWMLDSGIVSQIKLVSVK 540
           QPC SALKKMQ+LLEKLEHGVYNLSRMRESA KRYK F+IPVEWMLDSGIVSQIKLVSVK
Sbjct: 481 QPCGSALKKMQALLEKLEHGVYNLSRMRESAAKRYKAFQIPVEWMLDSGIVSQIKLVSVK 540

Query: 541 LAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASS 600
           LAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASS
Sbjct: 541 LAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASS 600

Query: 601 CHVQCQN-QQHKY 612
           CHVQCQN QQHK+
Sbjct: 601 CHVQCQNQQQHKF 610

BLAST of Sgr019225 vs. NCBI nr
Match: XP_022926872.1 (protein CHUP1, chloroplastic-like [Cucurbita moschata])

HSP 1 Score: 1014.2 bits (2621), Expect = 4.7e-292
Identity = 560/623 (89.89%), Postives = 586/623 (94.06%), Query Frame = 0

Query: 1   MVAGKVKVAMGLQKSPASRKVESSPKPSTQAQPSPSSGKVPQKTVFSRSFGVYFPRSSAQ 60
           MVAGKVK+AMGLQKSPASRKVESSPKPST AQPSPSSGK+ QKTVFSRSFGVYFPRSSAQ
Sbjct: 1   MVAGKVKLAMGLQKSPASRKVESSPKPSTPAQPSPSSGKISQKTVFSRSFGVYFPRSSAQ 60

Query: 61  VQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVAIVPLLENEICTKDAEIERAS 120
           VQPR PDVTELL++VEELRDREARLKTDLLEHKLLKESVAIVP+LENEI TKDAEIERAS
Sbjct: 61  VQPRLPDVTELLQLVEELRDREARLKTDLLEHKLLKESVAIVPMLENEIATKDAEIERAS 120

Query: 121 KRILFLEAENERLRVEVEELQLNIDEQRRESQERIKAMEGEIAELKKMALDRNRMELSLE 180
           KRILFLEAENERLRVEVEE++ +++EQRRESQER+KAMEGEIAELKKMALDR RMEL LE
Sbjct: 121 KRILFLEAENERLRVEVEEVKQSVEEQRRESQERVKAMEGEIAELKKMALDRRRMELILE 180

Query: 181 NDDLSASQRFQGLMEVSGKSNLIRNLKRATKCSDAVVNHDNHKVDYHPEAKKEEVETERP 240
           ND+LSASQRFQGLMEVSGKSNLIR+LKR TK SD VV  DNHKV+  PEAKKEEVETERP
Sbjct: 181 NDELSASQRFQGLMEVSGKSNLIRSLKRPTKFSDTVVTQDNHKVE-QPEAKKEEVETERP 240

Query: 241 RHSRCNSEELAESTLSNLKSRIPRVPKPPPKPSSSSSSSTSSSSSSSSPTGSCVDTENAI 300
           RHSR NSEELAESTLSN+KSRIPRVPKPPPKPSSSSSSS +SSSSS+S TGS  D E  I
Sbjct: 241 RHSRSNSEELAESTLSNVKSRIPRVPKPPPKPSSSSSSSATSSSSSTS-TGSSGDAEKKI 300

Query: 301 PPPPPVPTKPM-TPPPPPSSKSAPPPPPPPPKGKRLMPAKVRRIPEVVEFYHSLMRRDSR 360
           P PPPVPTKP   PPPPP SKSAPPPPPPPPKGKR  PAKVRRIPEVVEFYHSLMRRDSR
Sbjct: 301 PAPPPVPTKPTPPPPPPPPSKSAPPPPPPPPKGKRPTPAKVRRIPEVVEFYHSLMRRDSR 360

Query: 361 RDPGSGITDPPSTVNARDMIGEIENRSAHLLAIKTDVETQGDFIKLLIKEVENASFTDIE 420
           R+ GSG+T+PPS+ NARDMIGEIENRS HLLAIKTDVETQGDFI+ LIKEVENASFTDIE
Sbjct: 361 RELGSGVTEPPSSANARDMIGEIENRSTHLLAIKTDVETQGDFIRFLIKEVENASFTDIE 420

Query: 421 DVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFR-DA 480
           DVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLE+EASSFR DA
Sbjct: 421 DVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLEAEASSFRGDA 480

Query: 481 RQPCSSALKKMQSLLEKLEHGVYNLSRMRESATKRYKVFKIPVEWMLDSGIVSQIKLVSV 540
           RQPC SALKKMQ+LLEKLEHGVYNLSRMRESATKRYK F+IPVEWMLDSGIVSQIKLVSV
Sbjct: 481 RQPCGSALKKMQALLEKLEHGVYNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSV 540

Query: 541 KLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKAS 600
           KLAMKYMKRVSAELETVGGGPEEEELIV+GVRFAFRVHQFAGGFDVETMRAFQELRDKAS
Sbjct: 541 KLAMKYMKRVSAELETVGGGPEEEELIVRGVRFAFRVHQFAGGFDVETMRAFQELRDKAS 600

Query: 601 SCHVQCQNQQHKYVCSS-RPTTC 621
           SCHVQCQNQQHKYVCSS RPTTC
Sbjct: 601 SCHVQCQNQQHKYVCSSNRPTTC 621

BLAST of Sgr019225 vs. ExPASy Swiss-Prot
Match: Q9LI74 (Protein CHUP1, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=CHUP1 PE=1 SV=1)

HSP 1 Score: 307.8 bits (787), Expect = 2.9e-82
Identity = 204/457 (44.64%), Postives = 273/457 (59.74%), Query Frame = 0

Query: 156 KAMEGEIAELKKMALDRNRMELSLEND-----DLSASQRFQGLMEVSGKSNLIRNLKRAT 215
           K+++  + E      DR+++ +  E       D + ++RF G + +  K   ++  +   
Sbjct: 548 KSVDNVLDEKYPAYKDRHKLAVEREKHIKHKADQARAERFGGNVALPPKLAQLKEKRVVV 607

Query: 216 KCSDAVVNHDNHKVDYHPEAKKEEVETERPRHSRCNSEELAESTLSNLKSRIPRVPKPPP 275
                     +++ +   E K  E           N+  + +  L +++ R PRVP+PPP
Sbjct: 608 PSVITATGDQSNESNESNEGKASE-----------NAATVTKMKLVDIEKRPPRVPRPPP 667

Query: 276 KPSSSSSSSTSSSSSSSSPTGSCVDTENAIPPPPPVPTKPMTPPPPPSSKSAPPPPPPPP 335
           + +    S+   S+    P G          PPPP P     PPPPP     PPPPPP  
Sbjct: 668 RSAGGGKSTNLPSARPPLPGGG---------PPPPPPPPGGGPPPPPGGGPPPPPPPPGA 727

Query: 336 KGKRLMPA-KVRRIPEVVEFYHSLMRRDSRRDPGSGITDP---PSTVNARDMIGEIENRS 395
            G+      KV R PE+VEFY SLM+R+S+++    +       S+    +MIGEIENRS
Sbjct: 728 LGRGAGGGNKVHRAPELVEFYQSLMKRESKKEGAPSLISSGTGNSSAARNNMIGEIENRS 787

Query: 396 AHLLAIKTDVETQGDFIKLLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQ 455
             LLA+K DVETQGDF++ L  EV  +SFTDIED++ FV WLD+ELS+LVDERAVLKHF 
Sbjct: 788 TFLLAVKADVETQGDFVQSLATEVRASSFTDIEDLLAFVSWLDEELSFLVDERAVLKHFD 847

Query: 456 WPEQKADALREAAFGYCDLKKLESEASSF-RDARQPCSSALKKMQSLLEKLEHGVYNLSR 515
           WPE KADALREAAF Y DL KLE + +SF  D    C  ALKKM  LLEK+E  VY L R
Sbjct: 848 WPEGKADALREAAFEYQDLMKLEKQVTSFVDDPNLSCEPALKKMYKLLEKVEQSVYALLR 907

Query: 516 MRESATKRYKVFKIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGG---PEEE 575
            R+ A  RYK F IPV+W+ D+G+V +IKL SV+LA KYMKRV+ EL++V G    P  E
Sbjct: 908 TRDMAISRYKEFGIPVDWLSDTGVVGKIKLSSVQLAKKYMKRVAYELDSVSGSDKDPNRE 967

Query: 576 ELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASS 600
            L++QGVRFAFRVHQFAGGFD E+M+AF+ELR +A +
Sbjct: 968 FLLLQGVRFAFRVHQFAGGFDAESMKAFEELRSRAKT 984


HSP 2 Score: 37.7 bits (86), Expect = 5.5e-01
Identity = 38/155 (24.52%), Postives = 75/155 (48.39%), Query Frame = 0

Query: 67  DVTELLRMVEELRDREARLKTDLLEHKLLKESVAIVPLLENEICTKDAEIERASKRILFL 126
           ++  L ++V+EL +RE +L+ +LLE+  LKE  + +  L+ ++  K  EI+  +  I  L
Sbjct: 130 ELERLKQLVKELEEREVKLEGELLEYYGLKEQESDIVELQRQLKIKTVEIDMLNITINSL 189

Query: 127 EAENERL----------RVEVEELQLNIDEQRRESQERIKAMEGEIAELKKMALDRNRME 186
           +AE ++L          R E+E  +  I E +R+ Q      +G++  LK+        E
Sbjct: 190 QAERKKLQEELSQNGIVRKELEVARNKIKELQRQIQLDANQTKGQLLLLKQHVSSLQMKE 249

Query: 187 LSLENDDLSASQRFQGLMEVSGKSNLIRNLKRATK 212
               N D    ++ + + ++  +   +  LKR  +
Sbjct: 250 EEAMNKDTEVERKLKAVQDLEVQ---VMELKRKNR 281

BLAST of Sgr019225 vs. ExPASy TrEMBL
Match: A0A1S3AZH3 (protein CHUP1, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103484458 PE=4 SV=1)

HSP 1 Score: 1046.6 bits (2705), Expect = 4.2e-302
Identity = 573/622 (92.12%), Postives = 593/622 (95.34%), Query Frame = 0

Query: 1   MVAGKVKVAMGLQKSPASRKVESSPKPSTQAQPSPSSGKVPQKTVFSRSFGVYFPRSSAQ 60
           MVAGKVKVAMGLQKSPASRKVESSPK ST AQPSPSSGKV QKTVFSRSFGVYFPRSSAQ
Sbjct: 1   MVAGKVKVAMGLQKSPASRKVESSPKTSTPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQ 60

Query: 61  VQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVAIVPLLENEICTKDAEIERAS 120
           VQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVAIVP+LENEI TKDAEIERAS
Sbjct: 61  VQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVAIVPVLENEISTKDAEIERAS 120

Query: 121 KRILFLEAENERLRVEVEELQLNIDEQRRESQERIKAMEGEIAELKKMALDRNRMELSLE 180
           KRILFLEAENERLRV+VEE++ +++E+RRESQERIKAMEGEI+ELKKMALDR+RMEL LE
Sbjct: 121 KRILFLEAENERLRVQVEEVKQSVEEERRESQERIKAMEGEISELKKMALDRSRMELILE 180

Query: 181 NDDLSASQRFQGLMEVSGKSNLIRNLKRATKCSDAVVNHDNHKVDYHPEAKKEEVETERP 240
           ND+LSASQRFQGLMEVSGKSNLIRNLKRATKCSDAVVN DNHKV+ HPE KKEEVETERP
Sbjct: 181 NDELSASQRFQGLMEVSGKSNLIRNLKRATKCSDAVVNQDNHKVE-HPEVKKEEVETERP 240

Query: 241 RHSRCNSEELAESTLSNLKSRIPRVPKPPPKPSSSSSSSTSSSSSSSSPTGSCVDTENAI 300
           RHSRCNSEELAESTLSN+KSRIPRVP+PPPKPSSSSSSS ++SSSS   TGS  D E AI
Sbjct: 241 RHSRCNSEELAESTLSNIKSRIPRVPRPPPKPSSSSSSSATTSSSS---TGSSADIEKAI 300

Query: 301 PPPPPVPTKPMTPPPPPSSKSAPPPPPPPPKGKRLMPAKVRRIPEVVEFYHSLMRRDSRR 360
           P PPPVPTKPM PPPPP SKSAPPPPPPPPKGKR MPAKVRRIPEVVEFYHSLMRRDSRR
Sbjct: 301 PAPPPVPTKPMPPPPPPPSKSAPPPPPPPPKGKRPMPAKVRRIPEVVEFYHSLMRRDSRR 360

Query: 361 DPGSGITDPPSTVNARDMIGEIENRSAHLLAIKTDVETQGDFIKLLIKEVENASFTDIED 420
           D GSG+TDPPST NARDMIGEIENRSAHLLAIKTDVETQGDFI+LLIKEVENASFTDIED
Sbjct: 361 DSGSGVTDPPSTANARDMIGEIENRSAHLLAIKTDVETQGDFIRLLIKEVENASFTDIED 420

Query: 421 VVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFR-DAR 480
           VVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFR DAR
Sbjct: 421 VVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDAR 480

Query: 481 QPCSSALKKMQSLLEKLEHGVYNLSRMRESATKRYKVFKIPVEWMLDSGIVSQIKLVSVK 540
           QPC SALKKMQ+LLEKLEHGVYNLSRMRESA KRYK F+IPVEWMLDSGIVSQIKLVSVK
Sbjct: 481 QPCGSALKKMQALLEKLEHGVYNLSRMRESAAKRYKAFQIPVEWMLDSGIVSQIKLVSVK 540

Query: 541 LAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASS 600
           LAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASS
Sbjct: 541 LAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASS 600

Query: 601 CHVQCQN-QQHKYVCSSRPTTC 621
           CHVQCQN QQHKYV SSRPTTC
Sbjct: 601 CHVQCQNQQQHKYVWSSRPTTC 618

BLAST of Sgr019225 vs. ExPASy TrEMBL
Match: A0A0A0KHU8 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G519480 PE=4 SV=1)

HSP 1 Score: 1044.3 bits (2699), Expect = 2.1e-301
Identity = 572/622 (91.96%), Postives = 593/622 (95.34%), Query Frame = 0

Query: 1   MVAGKVKVAMGLQKSPASRKVESSPKPSTQAQPSPSSGKVPQKTVFSRSFGVYFPRSSAQ 60
           MVAGKVKVAMGLQKSPASRKVESSPK ST AQPSPSSGKV QKTVFSRSFGVYFPRSSAQ
Sbjct: 1   MVAGKVKVAMGLQKSPASRKVESSPKTSTPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQ 60

Query: 61  VQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVAIVPLLENEICTKDAEIERAS 120
           VQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVAIVP+LENEI TKDAEIERAS
Sbjct: 61  VQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVAIVPVLENEISTKDAEIERAS 120

Query: 121 KRILFLEAENERLRVEVEELQLNIDEQRRESQERIKAMEGEIAELKKMALDRNRMELSLE 180
           KRILFLEAENERLRV+VEE + +++E+RRESQERIKAMEGE+AELKKMALDR+RMEL LE
Sbjct: 121 KRILFLEAENERLRVQVEEAKQSVEEERRESQERIKAMEGEVAELKKMALDRSRMELILE 180

Query: 181 NDDLSASQRFQGLMEVSGKSNLIRNLKRATKCSDAVVNHDNHKVDYHPEAKKEEVETERP 240
           ND+LSASQRFQGLMEVSGKSNLIRNLKRATKCSDAVVN DNHKV+ HPEAKKEEVETERP
Sbjct: 181 NDELSASQRFQGLMEVSGKSNLIRNLKRATKCSDAVVNQDNHKVE-HPEAKKEEVETERP 240

Query: 241 RHSRCNSEELAESTLSNLKSRIPRVPKPPPKPSSSSSSSTSSSSSSSSPTGSCVDTENAI 300
           RHSRCNSEELAESTLSN+KSRIPRVPKPPPKPSSSSSSS ++S+SSSS TGS  D E AI
Sbjct: 241 RHSRCNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATTSTSSSS-TGSSADIEKAI 300

Query: 301 PPPPPVPTKPMTPPPPPSSKSAPPPPPPPPKGKRLMPAKVRRIPEVVEFYHSLMRRDSRR 360
           P PPPVPTK M PPPPP SKSAPPPPPPPPKGKRLMPAKVRRIPEVVEFYHSLMRRDSRR
Sbjct: 301 PAPPPVPTKAMPPPPPPPSKSAPPPPPPPPKGKRLMPAKVRRIPEVVEFYHSLMRRDSRR 360

Query: 361 DPGSGITDPPSTVNARDMIGEIENRSAHLLAIKTDVETQGDFIKLLIKEVENASFTDIED 420
           D GSG+T+PPST NARDMIGEIENRSAHLLAIKTDVETQGDFI+ LIKEVENASFTDIED
Sbjct: 361 DSGSGVTEPPSTANARDMIGEIENRSAHLLAIKTDVETQGDFIRFLIKEVENASFTDIED 420

Query: 421 VVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFR-DAR 480
           VVPFVKWLDDELS+LVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFR DAR
Sbjct: 421 VVPFVKWLDDELSFLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDAR 480

Query: 481 QPCSSALKKMQSLLEKLEHGVYNLSRMRESATKRYKVFKIPVEWMLDSGIVSQIKLVSVK 540
           QPC SALKKMQ+LLEKLEHGVYNLSRMRESA KRYK F+IPVEWMLD GIVSQIKLVSVK
Sbjct: 481 QPCGSALKKMQALLEKLEHGVYNLSRMRESAAKRYKAFQIPVEWMLDGGIVSQIKLVSVK 540

Query: 541 LAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASS 600
           LAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASS
Sbjct: 541 LAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASS 600

Query: 601 CHVQCQN-QQHKYVCSSRPTTC 621
           CHVQCQN QQHKYV SSRPTTC
Sbjct: 601 CHVQCQNQQQHKYVWSSRPTTC 620

BLAST of Sgr019225 vs. ExPASy TrEMBL
Match: A0A5D3CMM2 (Protein CHUP1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold255G008100 PE=4 SV=1)

HSP 1 Score: 1029.6 bits (2661), Expect = 5.3e-297
Identity = 564/613 (92.01%), Postives = 585/613 (95.43%), Query Frame = 0

Query: 1   MVAGKVKVAMGLQKSPASRKVESSPKPSTQAQPSPSSGKVPQKTVFSRSFGVYFPRSSAQ 60
           MVAGKVKVAMGLQKSPASRKVESSPK ST AQPSPSSGKV QKTVFSRSFGVYFPRSSAQ
Sbjct: 1   MVAGKVKVAMGLQKSPASRKVESSPKTSTPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQ 60

Query: 61  VQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVAIVPLLENEICTKDAEIERAS 120
           VQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVAIVP+LENEI TKDAEIERAS
Sbjct: 61  VQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVAIVPVLENEISTKDAEIERAS 120

Query: 121 KRILFLEAENERLRVEVEELQLNIDEQRRESQERIKAMEGEIAELKKMALDRNRMELSLE 180
           KRILFLEAENERLRV+VEE++ +++E+RRESQER+KAMEGEI+ELKKMALDR+RMEL LE
Sbjct: 121 KRILFLEAENERLRVQVEEVKQSVEEERRESQERMKAMEGEISELKKMALDRSRMELILE 180

Query: 181 NDDLSASQRFQGLMEVSGKSNLIRNLKRATKCSDAVVNHDNHKVDYHPEAKKEEVETERP 240
           ND+LSASQRFQGLMEVSGKSNLIRNLKRATKCSDAVVN DNHKV+ HPE KKEEVETERP
Sbjct: 181 NDELSASQRFQGLMEVSGKSNLIRNLKRATKCSDAVVNQDNHKVE-HPEVKKEEVETERP 240

Query: 241 RHSRCNSEELAESTLSNLKSRIPRVPKPPPKPSSSSSSSTSSSSSSSSPTGSCVDTENAI 300
           RHSRCNSEELAESTLSN+KSRIPRVPKPPPKPSSSSSSS ++SSSSS  TGS  D E AI
Sbjct: 241 RHSRCNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATTSSSSS--TGSSADIEKAI 300

Query: 301 PPPPPVPTKPMTPPPPPSSKSAPPPPPPPPKGKRLMPAKVRRIPEVVEFYHSLMRRDSRR 360
           P PPPVPTKPM PPPPP SKSAPPPPPPPPKGKR MPAKVRRIPEVVEFYHSLMRRDSRR
Sbjct: 301 PAPPPVPTKPMPPPPPPPSKSAPPPPPPPPKGKRPMPAKVRRIPEVVEFYHSLMRRDSRR 360

Query: 361 DPGSGITDPPSTVNARDMIGEIENRSAHLLAIKTDVETQGDFIKLLIKEVENASFTDIED 420
           D GSG+TDPPST NARDMIGEIENRSAHLLAIKTDVETQGDFI+ LIKEVENASFTDIED
Sbjct: 361 DSGSGVTDPPSTANARDMIGEIENRSAHLLAIKTDVETQGDFIRFLIKEVENASFTDIED 420

Query: 421 VVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFR-DAR 480
           VVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFR DAR
Sbjct: 421 VVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDAR 480

Query: 481 QPCSSALKKMQSLLEKLEHGVYNLSRMRESATKRYKVFKIPVEWMLDSGIVSQIKLVSVK 540
           QPC SALKKMQ+LLEKLEHGVYNLSRMRESA KRYK F+IPVEWMLDSGIVSQIKLVSVK
Sbjct: 481 QPCGSALKKMQALLEKLEHGVYNLSRMRESAAKRYKAFQIPVEWMLDSGIVSQIKLVSVK 540

Query: 541 LAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASS 600
           LAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASS
Sbjct: 541 LAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASS 600

Query: 601 CHVQCQN-QQHKY 612
           CHVQCQN QQHK+
Sbjct: 601 CHVQCQNQQQHKF 610

BLAST of Sgr019225 vs. ExPASy TrEMBL
Match: A0A6J1EFK1 (protein CHUP1, chloroplastic-like OS=Cucurbita moschata OX=3662 GN=LOC111433856 PE=4 SV=1)

HSP 1 Score: 1014.2 bits (2621), Expect = 2.3e-292
Identity = 560/623 (89.89%), Postives = 586/623 (94.06%), Query Frame = 0

Query: 1   MVAGKVKVAMGLQKSPASRKVESSPKPSTQAQPSPSSGKVPQKTVFSRSFGVYFPRSSAQ 60
           MVAGKVK+AMGLQKSPASRKVESSPKPST AQPSPSSGK+ QKTVFSRSFGVYFPRSSAQ
Sbjct: 1   MVAGKVKLAMGLQKSPASRKVESSPKPSTPAQPSPSSGKISQKTVFSRSFGVYFPRSSAQ 60

Query: 61  VQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVAIVPLLENEICTKDAEIERAS 120
           VQPR PDVTELL++VEELRDREARLKTDLLEHKLLKESVAIVP+LENEI TKDAEIERAS
Sbjct: 61  VQPRLPDVTELLQLVEELRDREARLKTDLLEHKLLKESVAIVPMLENEIATKDAEIERAS 120

Query: 121 KRILFLEAENERLRVEVEELQLNIDEQRRESQERIKAMEGEIAELKKMALDRNRMELSLE 180
           KRILFLEAENERLRVEVEE++ +++EQRRESQER+KAMEGEIAELKKMALDR RMEL LE
Sbjct: 121 KRILFLEAENERLRVEVEEVKQSVEEQRRESQERVKAMEGEIAELKKMALDRRRMELILE 180

Query: 181 NDDLSASQRFQGLMEVSGKSNLIRNLKRATKCSDAVVNHDNHKVDYHPEAKKEEVETERP 240
           ND+LSASQRFQGLMEVSGKSNLIR+LKR TK SD VV  DNHKV+  PEAKKEEVETERP
Sbjct: 181 NDELSASQRFQGLMEVSGKSNLIRSLKRPTKFSDTVVTQDNHKVE-QPEAKKEEVETERP 240

Query: 241 RHSRCNSEELAESTLSNLKSRIPRVPKPPPKPSSSSSSSTSSSSSSSSPTGSCVDTENAI 300
           RHSR NSEELAESTLSN+KSRIPRVPKPPPKPSSSSSSS +SSSSS+S TGS  D E  I
Sbjct: 241 RHSRSNSEELAESTLSNVKSRIPRVPKPPPKPSSSSSSSATSSSSSTS-TGSSGDAEKKI 300

Query: 301 PPPPPVPTKPM-TPPPPPSSKSAPPPPPPPPKGKRLMPAKVRRIPEVVEFYHSLMRRDSR 360
           P PPPVPTKP   PPPPP SKSAPPPPPPPPKGKR  PAKVRRIPEVVEFYHSLMRRDSR
Sbjct: 301 PAPPPVPTKPTPPPPPPPPSKSAPPPPPPPPKGKRPTPAKVRRIPEVVEFYHSLMRRDSR 360

Query: 361 RDPGSGITDPPSTVNARDMIGEIENRSAHLLAIKTDVETQGDFIKLLIKEVENASFTDIE 420
           R+ GSG+T+PPS+ NARDMIGEIENRS HLLAIKTDVETQGDFI+ LIKEVENASFTDIE
Sbjct: 361 RELGSGVTEPPSSANARDMIGEIENRSTHLLAIKTDVETQGDFIRFLIKEVENASFTDIE 420

Query: 421 DVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFR-DA 480
           DVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLE+EASSFR DA
Sbjct: 421 DVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLEAEASSFRGDA 480

Query: 481 RQPCSSALKKMQSLLEKLEHGVYNLSRMRESATKRYKVFKIPVEWMLDSGIVSQIKLVSV 540
           RQPC SALKKMQ+LLEKLEHGVYNLSRMRESATKRYK F+IPVEWMLDSGIVSQIKLVSV
Sbjct: 481 RQPCGSALKKMQALLEKLEHGVYNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSV 540

Query: 541 KLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKAS 600
           KLAMKYMKRVSAELETVGGGPEEEELIV+GVRFAFRVHQFAGGFDVETMRAFQELRDKAS
Sbjct: 541 KLAMKYMKRVSAELETVGGGPEEEELIVRGVRFAFRVHQFAGGFDVETMRAFQELRDKAS 600

Query: 601 SCHVQCQNQQHKYVCSS-RPTTC 621
           SCHVQCQNQQHKYVCSS RPTTC
Sbjct: 601 SCHVQCQNQQHKYVCSSNRPTTC 621

BLAST of Sgr019225 vs. ExPASy TrEMBL
Match: A0A6J1KWU6 (protein CHUP1, chloroplastic-like OS=Cucurbita maxima OX=3661 GN=LOC111497116 PE=4 SV=1)

HSP 1 Score: 1011.1 bits (2613), Expect = 1.9e-291
Identity = 559/623 (89.73%), Postives = 585/623 (93.90%), Query Frame = 0

Query: 1   MVAGKVKVAMGLQKSPASRKVESSPKPSTQAQPSPSSGKVPQKTVFSRSFGVYFPRSSAQ 60
           MVAGKVK+AMGLQKSPA RKVESSPKPST AQPSPSSGK+ QKTVFSRSFGVYFPRSSAQ
Sbjct: 1   MVAGKVKLAMGLQKSPACRKVESSPKPSTPAQPSPSSGKISQKTVFSRSFGVYFPRSSAQ 60

Query: 61  VQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVAIVPLLENEICTKDAEIERAS 120
           VQPR PDVTELL++VEELRDREARLKTDLLEHKLLKESVAIVP+LENEI TKDAEIERAS
Sbjct: 61  VQPRLPDVTELLQLVEELRDREARLKTDLLEHKLLKESVAIVPMLENEIATKDAEIERAS 120

Query: 121 KRILFLEAENERLRVEVEELQLNIDEQRRESQERIKAMEGEIAELKKMALDRNRMELSLE 180
           KRILFLEAENERLRVEVEE++ +++EQRRESQER+KAMEGEIAELKKMALDR RMEL LE
Sbjct: 121 KRILFLEAENERLRVEVEEVKQSVEEQRRESQERVKAMEGEIAELKKMALDRRRMELILE 180

Query: 181 NDDLSASQRFQGLMEVSGKSNLIRNLKRATKCSDAVVNHDNHKVDYHPEAKKEEVETERP 240
           ND+LSASQRFQGLMEVSGKSNLIR+LKR TK SD VV  DNHKV+  PEAKKEEVETERP
Sbjct: 181 NDELSASQRFQGLMEVSGKSNLIRSLKRPTKFSDTVVTQDNHKVE-QPEAKKEEVETERP 240

Query: 241 RHSRCNSEELAESTLSNLKSRIPRVPKPPPKPSSSSSSSTSSSSSSSSPTGSCVDTENAI 300
           RHSR NSEELAESTLSN+KSRIPRVPKPPPKPSSSSSSS +SSSSS+S TGS  D E  I
Sbjct: 241 RHSRSNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATSSSSSTS-TGSSGDAEKKI 300

Query: 301 PPPPPVPTKPM-TPPPPPSSKSAPPPPPPPPKGKRLMPAKVRRIPEVVEFYHSLMRRDSR 360
           P PPPVPTKP   PPPPP SKSAPPPPPPPPKGKR   AKVRRIPEVVEFYHSLMRRDSR
Sbjct: 301 PAPPPVPTKPTPPPPPPPPSKSAPPPPPPPPKGKRPTSAKVRRIPEVVEFYHSLMRRDSR 360

Query: 361 RDPGSGITDPPSTVNARDMIGEIENRSAHLLAIKTDVETQGDFIKLLIKEVENASFTDIE 420
           R+ GSG+T+PPS+ NARDMIGEIENRSAHLLAIKTDVETQGDFI+ LIKEVENASFTDIE
Sbjct: 361 RELGSGVTEPPSSANARDMIGEIENRSAHLLAIKTDVETQGDFIRFLIKEVENASFTDIE 420

Query: 421 DVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFR-DA 480
           DVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLE+EASSFR DA
Sbjct: 421 DVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLEAEASSFRGDA 480

Query: 481 RQPCSSALKKMQSLLEKLEHGVYNLSRMRESATKRYKVFKIPVEWMLDSGIVSQIKLVSV 540
           RQPC SALKKMQ+LLEKLEHGVYNLSRMRESATKRYK F+IPVEWMLDSGIVSQIKLVSV
Sbjct: 481 RQPCGSALKKMQALLEKLEHGVYNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSV 540

Query: 541 KLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKAS 600
           KLAMKYMKRVSAELETVGGGPEEEELIV+GVRFAFRVHQFAGGFDVETMRAFQELRDKAS
Sbjct: 541 KLAMKYMKRVSAELETVGGGPEEEELIVRGVRFAFRVHQFAGGFDVETMRAFQELRDKAS 600

Query: 601 SCHVQCQNQQHKYVCSS-RPTTC 621
           SCHVQCQNQQHKYVCSS RPTTC
Sbjct: 601 SCHVQCQNQQHKYVCSSNRPTTC 621

BLAST of Sgr019225 vs. TAIR 10
Match: AT4G18570.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 612.8 bits (1579), Expect = 3.0e-175
Identity = 395/666 (59.31%), Postives = 461/666 (69.22%), Query Frame = 0

Query: 1   MVAGKVKVAMGLQKSPASRKVESSPKPSTQAQP------SPSSGKV-------PQKTVFS 60
           MVAGKV+V MG  KSP+++K +  P P     P       PSSG         P K  F+
Sbjct: 1   MVAGKVRVTMGFHKSPSTKKTKDMPSPLPLPPPPPPPLKPPSSGSATTKPPINPSKPGFT 60

Query: 61  RSFGVYFPRSSAQVQPRPPD------VTELLRMVEELRDREARLKTDLLEHKLLKESVAI 120
           RSFGVYFPR+SAQV            V+EL R VEELR+REA LKT+ LE KLL+ESV++
Sbjct: 61  RSFGVYFPRASAQVHATAAAASHNGVVSELRRQVEELREREALLKTENLEVKLLRESVSV 120

Query: 121 VPLLENEICTKDAEIERASKRILFLEAENERLRVEVEELQLNIDEQRRESQERIKAMEGE 180
           +PLLE++I  K+ EI+   K    L  +NERLR E +      +E RRE + R K ME E
Sbjct: 121 IPLLESQIADKNGEIDELRKETARLAEDNERLRREFD----RSEEMRRECETREKEMEAE 180

Query: 181 IAELKKMALDRNRMELSLENDD--LSASQRFQGLMEVSGKSNLIRNLKRA---TKCSDAV 240
           I EL+K+        +S E+DD  LS SQRFQGLM+VS KSNLIR+LKR        + +
Sbjct: 181 IVELRKL--------VSSESDDHALSVSQRFQGLMDVSAKSNLIRSLKRVGSLRNLPEPI 240

Query: 241 VNHDNHKVDYHPEA-------KKEEVETERPRHSR-CNSEELAE-STLSNLKSRIPRVPK 300
            N +N                +K+E+E+    +SR  NSEEL E S+LS ++SR+PRVPK
Sbjct: 241 TNQENTNKSISSSGDADGDIYRKDEIES----YSRSSNSEELTESSSLSTVRSRVPRVPK 300

Query: 301 PPPKPSSSSSSSTSSSSSSSSPTGSCVDTENAIPPPPPVPTKPM--TPPPPPSSKSAPPP 360
           PPPK S S   ST + +            + +IPPPPP P  P+   PPPPPS   APPP
Sbjct: 301 PPPKRSISLGDSTENRADP--------PPQKSIPPPPPPPPPPLLQQPPPPPSVSKAPPP 360

Query: 361 PPPPPKGKRL--MPAKVRRIPEVVEFYHSLMRRD---SRRDPGSGITDPPSTV----NAR 420
           PPPPP  K L    AKVRR+PEVVEFYHSLMRRD   SRRD   G       +    NAR
Sbjct: 361 PPPPPPPKSLSIASAKVRRVPEVVEFYHSLMRRDSTNSRRDSTGGGNAAAEAILANSNAR 420

Query: 421 DMIGEIENRSAHLLAIKTDVETQGDFIKLLIKEVENASFTDIEDVVPFVKWLDDELSYLV 480
           DMIGEIENRS +LLAIKTDVETQGDFI+ LIKEV NA+F+DIEDVVPFVKWLDDELSYLV
Sbjct: 421 DMIGEIENRSVYLLAIKTDVETQGDFIRFLIKEVGNAAFSDIEDVVPFVKWLDDELSYLV 480

Query: 481 DERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFR-DARQPCSSALKKMQSLLEK 540
           DERAVLKHF+WPEQKADALREAAF Y DLKKL SEAS FR D RQ  SSALKKMQ+L EK
Sbjct: 481 DERAVLKHFEWPEQKADALREAAFCYFDLKKLISEASRFREDPRQSSSSALKKMQALFEK 540

Query: 541 LEHGVYNLSRMRESATKRYKVFKIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETV 600
           LEHGVY+LSRMRESA  ++K F+IPV+WML++GI SQIKL SVKLAMKYMKRVSAELE +
Sbjct: 541 LEHGVYSLSRMRESAATKFKSFQIPVDWMLETGITSQIKLASVKLAMKYMKRVSAELEAI 600

Query: 601 -GGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSCHVQCQNQQHKYVCS 621
            GGGPEEEELIVQGVRFAFRVHQFAGGFD ETM+AF+ELRDKA SCHVQCQ+Q H++   
Sbjct: 601 EGGGPEEEELIVQGVRFAFRVHQFAGGFDAETMKAFEELRDKARSCHVQCQSQTHQHKLC 642

BLAST of Sgr019225 vs. TAIR 10
Match: AT3G25690.1 (Hydroxyproline-rich glycoprotein family protein )

HSP 1 Score: 307.8 bits (787), Expect = 2.0e-83
Identity = 204/457 (44.64%), Postives = 273/457 (59.74%), Query Frame = 0

Query: 156 KAMEGEIAELKKMALDRNRMELSLEND-----DLSASQRFQGLMEVSGKSNLIRNLKRAT 215
           K+++  + E      DR+++ +  E       D + ++RF G + +  K   ++  +   
Sbjct: 548 KSVDNVLDEKYPAYKDRHKLAVEREKHIKHKADQARAERFGGNVALPPKLAQLKEKRVVV 607

Query: 216 KCSDAVVNHDNHKVDYHPEAKKEEVETERPRHSRCNSEELAESTLSNLKSRIPRVPKPPP 275
                     +++ +   E K  E           N+  + +  L +++ R PRVP+PPP
Sbjct: 608 PSVITATGDQSNESNESNEGKASE-----------NAATVTKMKLVDIEKRPPRVPRPPP 667

Query: 276 KPSSSSSSSTSSSSSSSSPTGSCVDTENAIPPPPPVPTKPMTPPPPPSSKSAPPPPPPPP 335
           + +    S+   S+    P G          PPPP P     PPPPP     PPPPPP  
Sbjct: 668 RSAGGGKSTNLPSARPPLPGGG---------PPPPPPPPGGGPPPPPGGGPPPPPPPPGA 727

Query: 336 KGKRLMPA-KVRRIPEVVEFYHSLMRRDSRRDPGSGITDP---PSTVNARDMIGEIENRS 395
            G+      KV R PE+VEFY SLM+R+S+++    +       S+    +MIGEIENRS
Sbjct: 728 LGRGAGGGNKVHRAPELVEFYQSLMKRESKKEGAPSLISSGTGNSSAARNNMIGEIENRS 787

Query: 396 AHLLAIKTDVETQGDFIKLLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQ 455
             LLA+K DVETQGDF++ L  EV  +SFTDIED++ FV WLD+ELS+LVDERAVLKHF 
Sbjct: 788 TFLLAVKADVETQGDFVQSLATEVRASSFTDIEDLLAFVSWLDEELSFLVDERAVLKHFD 847

Query: 456 WPEQKADALREAAFGYCDLKKLESEASSF-RDARQPCSSALKKMQSLLEKLEHGVYNLSR 515
           WPE KADALREAAF Y DL KLE + +SF  D    C  ALKKM  LLEK+E  VY L R
Sbjct: 848 WPEGKADALREAAFEYQDLMKLEKQVTSFVDDPNLSCEPALKKMYKLLEKVEQSVYALLR 907

Query: 516 MRESATKRYKVFKIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGG---PEEE 575
            R+ A  RYK F IPV+W+ D+G+V +IKL SV+LA KYMKRV+ EL++V G    P  E
Sbjct: 908 TRDMAISRYKEFGIPVDWLSDTGVVGKIKLSSVQLAKKYMKRVAYELDSVSGSDKDPNRE 967

Query: 576 ELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASS 600
            L++QGVRFAFRVHQFAGGFD E+M+AF+ELR +A +
Sbjct: 968 FLLLQGVRFAFRVHQFAGGFDAESMKAFEELRSRAKT 984


HSP 2 Score: 37.7 bits (86), Expect = 3.9e-02
Identity = 38/155 (24.52%), Postives = 75/155 (48.39%), Query Frame = 0

Query: 67  DVTELLRMVEELRDREARLKTDLLEHKLLKESVAIVPLLENEICTKDAEIERASKRILFL 126
           ++  L ++V+EL +RE +L+ +LLE+  LKE  + +  L+ ++  K  EI+  +  I  L
Sbjct: 130 ELERLKQLVKELEEREVKLEGELLEYYGLKEQESDIVELQRQLKIKTVEIDMLNITINSL 189

Query: 127 EAENERL----------RVEVEELQLNIDEQRRESQERIKAMEGEIAELKKMALDRNRME 186
           +AE ++L          R E+E  +  I E +R+ Q      +G++  LK+        E
Sbjct: 190 QAERKKLQEELSQNGIVRKELEVARNKIKELQRQIQLDANQTKGQLLLLKQHVSSLQMKE 249

Query: 187 LSLENDDLSASQRFQGLMEVSGKSNLIRNLKRATK 212
               N D    ++ + + ++  +   +  LKR  +
Sbjct: 250 EEAMNKDTEVERKLKAVQDLEVQ---VMELKRKNR 281

BLAST of Sgr019225 vs. TAIR 10
Match: AT3G25690.2 (Hydroxyproline-rich glycoprotein family protein )

HSP 1 Score: 307.8 bits (787), Expect = 2.0e-83
Identity = 204/457 (44.64%), Postives = 273/457 (59.74%), Query Frame = 0

Query: 156 KAMEGEIAELKKMALDRNRMELSLEND-----DLSASQRFQGLMEVSGKSNLIRNLKRAT 215
           K+++  + E      DR+++ +  E       D + ++RF G + +  K   ++  +   
Sbjct: 548 KSVDNVLDEKYPAYKDRHKLAVEREKHIKHKADQARAERFGGNVALPPKLAQLKEKRVVV 607

Query: 216 KCSDAVVNHDNHKVDYHPEAKKEEVETERPRHSRCNSEELAESTLSNLKSRIPRVPKPPP 275
                     +++ +   E K  E           N+  + +  L +++ R PRVP+PPP
Sbjct: 608 PSVITATGDQSNESNESNEGKASE-----------NAATVTKMKLVDIEKRPPRVPRPPP 667

Query: 276 KPSSSSSSSTSSSSSSSSPTGSCVDTENAIPPPPPVPTKPMTPPPPPSSKSAPPPPPPPP 335
           + +    S+   S+    P G          PPPP P     PPPPP     PPPPPP  
Sbjct: 668 RSAGGGKSTNLPSARPPLPGGG---------PPPPPPPPGGGPPPPPGGGPPPPPPPPGA 727

Query: 336 KGKRLMPA-KVRRIPEVVEFYHSLMRRDSRRDPGSGITDP---PSTVNARDMIGEIENRS 395
            G+      KV R PE+VEFY SLM+R+S+++    +       S+    +MIGEIENRS
Sbjct: 728 LGRGAGGGNKVHRAPELVEFYQSLMKRESKKEGAPSLISSGTGNSSAARNNMIGEIENRS 787

Query: 396 AHLLAIKTDVETQGDFIKLLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQ 455
             LLA+K DVETQGDF++ L  EV  +SFTDIED++ FV WLD+ELS+LVDERAVLKHF 
Sbjct: 788 TFLLAVKADVETQGDFVQSLATEVRASSFTDIEDLLAFVSWLDEELSFLVDERAVLKHFD 847

Query: 456 WPEQKADALREAAFGYCDLKKLESEASSF-RDARQPCSSALKKMQSLLEKLEHGVYNLSR 515
           WPE KADALREAAF Y DL KLE + +SF  D    C  ALKKM  LLEK+E  VY L R
Sbjct: 848 WPEGKADALREAAFEYQDLMKLEKQVTSFVDDPNLSCEPALKKMYKLLEKVEQSVYALLR 907

Query: 516 MRESATKRYKVFKIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGG---PEEE 575
            R+ A  RYK F IPV+W+ D+G+V +IKL SV+LA KYMKRV+ EL++V G    P  E
Sbjct: 908 TRDMAISRYKEFGIPVDWLSDTGVVGKIKLSSVQLAKKYMKRVAYELDSVSGSDKDPNRE 967

Query: 576 ELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASS 600
            L++QGVRFAFRVHQFAGGFD E+M+AF+ELR +A +
Sbjct: 968 FLLLQGVRFAFRVHQFAGGFDAESMKAFEELRSRAKT 984


HSP 2 Score: 37.7 bits (86), Expect = 3.9e-02
Identity = 38/155 (24.52%), Postives = 75/155 (48.39%), Query Frame = 0

Query: 67  DVTELLRMVEELRDREARLKTDLLEHKLLKESVAIVPLLENEICTKDAEIERASKRILFL 126
           ++  L ++V+EL +RE +L+ +LLE+  LKE  + +  L+ ++  K  EI+  +  I  L
Sbjct: 130 ELERLKQLVKELEEREVKLEGELLEYYGLKEQESDIVELQRQLKIKTVEIDMLNITINSL 189

Query: 127 EAENERL----------RVEVEELQLNIDEQRRESQERIKAMEGEIAELKKMALDRNRME 186
           +AE ++L          R E+E  +  I E +R+ Q      +G++  LK+        E
Sbjct: 190 QAERKKLQEELSQNGIVRKELEVARNKIKELQRQIQLDANQTKGQLLLLKQHVSSLQMKE 249

Query: 187 LSLENDDLSASQRFQGLMEVSGKSNLIRNLKRATK 212
               N D    ++ + + ++  +   +  LKR  +
Sbjct: 250 EEAMNKDTEVERKLKAVQDLEVQ---VMELKRKNR 281

BLAST of Sgr019225 vs. TAIR 10
Match: AT3G25690.3 (Hydroxyproline-rich glycoprotein family protein )

HSP 1 Score: 307.8 bits (787), Expect = 2.0e-83
Identity = 204/457 (44.64%), Postives = 273/457 (59.74%), Query Frame = 0

Query: 156 KAMEGEIAELKKMALDRNRMELSLEND-----DLSASQRFQGLMEVSGKSNLIRNLKRAT 215
           K+++  + E      DR+++ +  E       D + ++RF G + +  K   ++  +   
Sbjct: 407 KSVDNVLDEKYPAYKDRHKLAVEREKHIKHKADQARAERFGGNVALPPKLAQLKEKRVVV 466

Query: 216 KCSDAVVNHDNHKVDYHPEAKKEEVETERPRHSRCNSEELAESTLSNLKSRIPRVPKPPP 275
                     +++ +   E K  E           N+  + +  L +++ R PRVP+PPP
Sbjct: 467 PSVITATGDQSNESNESNEGKASE-----------NAATVTKMKLVDIEKRPPRVPRPPP 526

Query: 276 KPSSSSSSSTSSSSSSSSPTGSCVDTENAIPPPPPVPTKPMTPPPPPSSKSAPPPPPPPP 335
           + +    S+   S+    P G          PPPP P     PPPPP     PPPPPP  
Sbjct: 527 RSAGGGKSTNLPSARPPLPGGG---------PPPPPPPPGGGPPPPPGGGPPPPPPPPGA 586

Query: 336 KGKRLMPA-KVRRIPEVVEFYHSLMRRDSRRDPGSGITDP---PSTVNARDMIGEIENRS 395
            G+      KV R PE+VEFY SLM+R+S+++    +       S+    +MIGEIENRS
Sbjct: 587 LGRGAGGGNKVHRAPELVEFYQSLMKRESKKEGAPSLISSGTGNSSAARNNMIGEIENRS 646

Query: 396 AHLLAIKTDVETQGDFIKLLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQ 455
             LLA+K DVETQGDF++ L  EV  +SFTDIED++ FV WLD+ELS+LVDERAVLKHF 
Sbjct: 647 TFLLAVKADVETQGDFVQSLATEVRASSFTDIEDLLAFVSWLDEELSFLVDERAVLKHFD 706

Query: 456 WPEQKADALREAAFGYCDLKKLESEASSF-RDARQPCSSALKKMQSLLEKLEHGVYNLSR 515
           WPE KADALREAAF Y DL KLE + +SF  D    C  ALKKM  LLEK+E  VY L R
Sbjct: 707 WPEGKADALREAAFEYQDLMKLEKQVTSFVDDPNLSCEPALKKMYKLLEKVEQSVYALLR 766

Query: 516 MRESATKRYKVFKIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGG---PEEE 575
            R+ A  RYK F IPV+W+ D+G+V +IKL SV+LA KYMKRV+ EL++V G    P  E
Sbjct: 767 TRDMAISRYKEFGIPVDWLSDTGVVGKIKLSSVQLAKKYMKRVAYELDSVSGSDKDPNRE 826

Query: 576 ELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASS 600
            L++QGVRFAFRVHQFAGGFD E+M+AF+ELR +A +
Sbjct: 827 FLLLQGVRFAFRVHQFAGGFDAESMKAFEELRSRAKT 843

BLAST of Sgr019225 vs. TAIR 10
Match: AT1G07120.1 (FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast envelope; EXPRESSED IN: inflorescence meristem, petal, leaf whorl, flower; EXPRESSED DURING: 4 anthesis, petal differentiation and expansion stage; BEST Arabidopsis thaliana protein match is: Tetratricopeptide repeat (TPR)-like superfamily protein (TAIR:AT4G18570.1); Has 288 Blast hits to 260 proteins in 50 species: Archae - 0; Bacteria - 8; Metazoa - 27; Fungi - 15; Plants - 163; Viruses - 0; Other Eukaryotes - 75 (source: NCBI BLink). )

HSP 1 Score: 232.6 bits (592), Expect = 8.3e-61
Identity = 164/361 (45.43%), Postives = 221/361 (61.22%), Query Frame = 0

Query: 238 ERPRHSRCNSEELAESTLSNLKS-RIPRVPKPPPKPSSSSSSSTSSSSSSSSPTGSCVDT 297
           E+  H          + +SNLKS    R      K  SS   S +  S+  +P     +T
Sbjct: 29  EKENHELRQEVARLRAQVSNLKSHENERKSMLWKKLQSSYDGSNTDGSNLKAPESVKSNT 88

Query: 298 E-NAIPPPPPVPTKPMTPPPPPSSKSAPPPPPPPPKGKRLMPAKVRRIPEVVEFYHSLMR 357
           +   +  P P PT         S+ + PPPPPP P  + L    VRR PEVVEFY +L +
Sbjct: 89  KGQEVRNPNPKPT-----IQGQSTATKPPPPPPLPSKRTLGKRSVRRAPEVVEFYRALTK 148

Query: 358 RDSRRDPGSGITDPPSTVNARDMIGEIENRSAHLLAIKTDVETQGDFIKLLIKEVENASF 417
           R+S            S    R+MIGEIENRS +L  IK+D +   D I +LI +VE A+F
Sbjct: 149 RESHMGNKINQNGVLSPAFNRNMIGEIENRSKYLSDIKSDTDRHRDHIHILISKVEAATF 208

Query: 418 TDIEDVVPFVKWLDDELSYLVDERAVLKHF-QWPEQKADALREAAFGYCDLKKLESEASS 477
           TDI +V  FVKW+D+ELS LVDERAVLKHF +WPE+K D+LREAA  Y   K L +E  S
Sbjct: 209 TDISEVETFVKWIDEELSSLVDERAVLKHFPKWPERKVDSLREAACNYKRPKNLGNEILS 268

Query: 478 FRD-ARQPCSSALKKMQSLLEKLEHGVYNLSRMRESATKRYKVFKIPVEWMLDSGIVSQI 537
           F+D  +   + AL+++QSL ++LE  V N  +MR+S  KRYK F+IP EWMLD+G++ Q+
Sbjct: 269 FKDNPKDSLTQALQRIQSLQDRLEESVNNTEKMRDSTGKRYKDFQIPWEWMLDTGLIGQL 328

Query: 538 KLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQEL 595
           K  S++LA +YMKR++ ELE+ G G +E  L++QGVRFA+ +HQFAGGFD ET+  F EL
Sbjct: 329 KYSSLRLAQEYMKRIAKELESNGSG-KEGNLMLQGVRFAYTIHQFAGGFDGETLSIFHEL 383

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_008439756.18.6e-30292.12PREDICTED: protein CHUP1, chloroplastic [Cucumis melo][more]
XP_004134665.14.3e-30191.96protein CHUP1, chloroplastic [Cucumis sativus] >KGN49295.1 hypothetical protein ... [more]
XP_038883847.14.7e-30091.63protein CHUP1, chloroplastic [Benincasa hispida][more]
KAA0052630.11.1e-29692.01protein CHUP1 [Cucumis melo var. makuwa] >TYK13197.1 protein CHUP1 [Cucumis melo... [more]
XP_022926872.14.7e-29289.89protein CHUP1, chloroplastic-like [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
Q9LI742.9e-8244.64Protein CHUP1, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=CHUP1 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A1S3AZH34.2e-30292.12protein CHUP1, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103484458 PE=4 SV=1[more]
A0A0A0KHU82.1e-30191.96Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G519480 PE=4 SV=1[more]
A0A5D3CMM25.3e-29792.01Protein CHUP1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold255G008100... [more]
A0A6J1EFK12.3e-29289.89protein CHUP1, chloroplastic-like OS=Cucurbita moschata OX=3662 GN=LOC111433856 ... [more]
A0A6J1KWU61.9e-29189.73protein CHUP1, chloroplastic-like OS=Cucurbita maxima OX=3661 GN=LOC111497116 PE... [more]
Match NameE-valueIdentityDescription
AT4G18570.13.0e-17559.31Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT3G25690.12.0e-8344.64Hydroxyproline-rich glycoprotein family protein [more]
AT3G25690.22.0e-8344.64Hydroxyproline-rich glycoprotein family protein [more]
AT3G25690.32.0e-8344.64Hydroxyproline-rich glycoprotein family protein [more]
AT1G07120.18.3e-6145.43FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknow... [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 116..168
NoneNo IPR availableCOILSCoilCoilcoord: 68..88
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 298..334
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 274..295
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 219..339
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 219..248
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 17..42
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..42
NoneNo IPR availablePANTHERPTHR31342:SF18PROTEIN CHUP1, CHLOROPLASTIC-LIKEcoord: 1..615
IPR040265Protein CHUP1-likePANTHERPTHR31342PROTEIN CHUP1, CHLOROPLASTICcoord: 1..615

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr019225.1Sgr019225.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009658 chloroplast organization
cellular_component GO:0009707 chloroplast outer membrane
molecular_function GO:0005525 GTP binding