Sgr017704 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr017704
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
DescriptionChlorophyll a-b binding protein, chloroplastic
Locationtig00153055: 153451 .. 160378 (-)
RNA-Seq ExpressionSgr017704
SyntenySgr017704
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCAATGGCGCTATGTCAGCCACACACTTCACCTTCTGCAATTTCATCTTCCTCCGCCTCATTCTACGGGAGTTCTCTTCAACAATTCCGTTCACGCACCACCGCCCTTACTTTTCGAGACAGCCACTCTCTCCCCTATCGTTCCACTTGCAGAGCTTCGTGGCAGGAGGTTTGTAACATAACAAGCTGTGTATCCAACTTGAGCTGATCGCTAAAGTCCTTTTTTGTTTTCTCCATCTGGCTGTAGCTTGCGGGAGTTCTGATATTCTCCGCAATCCCCTTCACGGCAGTGAAAGCCATCGCCAATAGTCCACTCGGAGAGTCGCTCCAGAGACAATTGGAAAAGAAGAAAAAAGCTGCCGTAGCCAACTCTTCCAAGTTCAAGGCTCTAGCTGAGGAGGCTAGAAAATATAGGTAATCGGGTAAGCTAATTTGTTGATGCCCAAATACCTCTATCTTATCGCCTTGTTCCCCGAATTATTTTCAAAACCCACTATATTTTCTGTTAGAATGGTCATTTGCACCGTCGAAAATTGGTCTAACGATACACGTGATTTGGCGACGATGACCCTAGATGCGTTAATCTCAGAAAGACTTCTTAATATTTTTCTCTATTCGGATGAAGAAAGCTATGGAAACTGACATTCTATTATTATCATGACAGTGTTGGTTTGGAAATAAAGAACTGCTTGGTCTGGACTCTGGATATTATATAGTTTAAGTTTGCAGTCCTTTACTCTAGGTGGGTATCCAACAAAAATTCAAGCAGATACCTCGGGTTCAAATTTTGTCCGGCAAGACACTAAAGTTGAGGCACAACACAGTACACCCAAAAACACGTGAATGGGAGGGGTCAACAAGTGAATCATAAAGTAACTCATAACTCATGAGGAGTGTGCTCATGTAGTAAGGGGGAAGAGAAGGGTTTATATAAATGAGAAAAGTTGCAGTTAGCACACTGTCACAAAAATCGTATTAGAACTCGAAACAATAGGGAAGGTGACATTAAGGAGGTGGTGGTATTTTCTTTATTTCAATCCAGTTTTTCTCAGGTCTTCATATGCACAAAAACTGATGTAATACACCATTATATGTCAATTCATGAGCATTATCTAAACGAAACTTTTTAACAACCTTTTGAATTTGAGTTTGAATCAATGAAAAGAGGGACGAGCAGCTGAACATCTGACCTTGACCGCACTAGAGAAATCCATGTGAGGCGTGTATAATCATTGACAAGAGTAAAATTAATCTAAAACCAAAATGTGCTGGAACGTGGTAAGGCCCCCAACCATCATAGTGGGTGACATCAAAAGTTGAGGATGGATGACAAATTTTTATTAAACACAAAAGACAGCCTCCTTGTCGAGCCAAGGGACAAATAGAAAAAAGGTCATCGTCTATGTCTATATTACATGAAAAGACTAGAGACGTCATGCCAAGTCTGACTAGTCATATTTTTAGAATCAATACTTACACTTTGAATTAAAGTCTGTGTCTAAGACATATAGGACTTCAAGCTGTTTCGTGTTGTCAATCATGTTCAAAGTGAGTTAGATCTTGAAACAACACAAATGTTGATTAGCTCCGTTGTTTGTAGATTTGTCTGAATTCTGACCTTTGTTTGGGCTTGTTGCTAGGTGGATGATAAGCATGTAGTTTATAACGCCTTTCCACCATCTGACCAGGATAGCTACAATGAGTGCAAATCAGTTACTCCTTTTCTGTCCTTTACGTGCATTTCGAGCATTCACTCTACTAGATTCATTTCCAACTAAGAATGCCATTTGGTAAACTCCATTACTTACAACATACAGTTTTCTCTAATGTTCCTATTGAACAACAAGACAAATGCCCTATCGAAGAAGGTATAGGATCCATTAATAACAACTGAGCTTGAACTTGAGCAAAGAAATCATTTTGACCTGAAAGTCGATAACATACTCCATCTTGTATATATCAGTTAATGCCTTTGTGCCTTGACATACACACTTCACATAAGTGTTGCTAGGCCTATAAATGCTCATCTCCCCTCGGATACTTTTCAATTCAGTAAAATAAGTACTCACCAACTTTTGCTTCCATGGAAGAGTCGTCAATTCCCATCATAACTGAAAGGTTTGAAGGGCATTCTTCTTTGCGAACCTCTCTTTAAGATCCTCTCATATTTCTTCAGCTGAATGTAGGTAGATAGCACTTGTTGCTGTCTTGTTTGATATTGATTTGGGAATCCGGGAGACTACAATGTTATTGCTATGGATCCACACATTTACCAAGTGATCATTTCTAGATGGACATGCCAAGATGCTATCAATAAATCCAACCGAGTTCTTCATGGAAAGTGACATATTCATGGCTTGACACCAGGAAATGAAACTTTTTCCTGTTAGTCGCTGGGAAACAAGCACCAAACCTAGACTGTCGGAATAGTAGATATAATAGGGATTAGGGATTATTGGAGGCATCATCCACCATATAGTTATTTGCAATTGAGTAGGATTCAATTGTAATTGGACTGATTTTCAGATGCCATTAGAACACCAGGAGTAAATCCCTGGAGAATCGTATCTCTTCAAGTTGGCTGGTCTAATCGGTGATGCGGCAGAGAAGGGCATGCTGCAATGAAAGAAAAGGAAGAACTACGATAGTGAGAACATGAAGTAGACAAGCACTGCTCTGATACCACATTTGGGTGAAGAAAGCTTTGGAAACTAACTTTCTATAATTATCATGACGGTGCGATGCAACAAAGTATGAATATAGCCAAATGTCTAGCCAGCTCATAACAGAATGAAGTACCTGACTTGACCAAGTAAAACAATCTAACAACAATTGGTTAATAAAAGCAGAATTATTTACTACTATAATACTTCTTCAAGTCAATTTATTGTTCTATATATTATTAGCTTACTATTCCAATTCAGATTTTGGATATCTGTTTTATTTTTAAAGTTGCATTGTAGATTTGGGCCAGTAGTATTTCTTAAATATTCATTTCATTTGCATTTCCTTTTAGATTTCCAATGTTCCTCTTGAATTGCATCCGTCGTTTATATAAACCCTGGTTCCTGACACAATTATATTACATAAGTATGTTCATCCGAGAATCAAGAAAGGATGTCCAACCAAGAATTGTGAAGTGATGACACAGAAATTCTCTGCAAATGGTTTCAAATTTTCATGCCATAAAATAGATTTTATTATTTAGTATCTTTCATGTTCCGCAAATTTTCTTCGTTGGAGTCAAGTACATGCCTGCGATCTTTTGCTTGTACTAGTTTGCATTGTATTGAACGTGGTATTATGAATTATTGTTTTCCTAACCATGAATGTCTAACTGCCAGGAAAAAATAAACAAATAGTACATTTGAATCTAATTGCATCATATTGTAAATACCATTCTATTGGCGAATAATTGCAGCTCATGGTACGGAGAAGAGCGTCCTCGCTGGCTTGGTCCAATACCATATGATTATCCAAGATATTTGACAGGTGAACTGCCAGGTGATTATGGATTCGATATTGCAGGTCTGAGCAAGGATCCCGTGGCTTTTCAGAAGTACTTCAAGTATATTAACCTTCTTTTAGCCTAAATTTCTCTTCAGCAATCCTGTTTTTCCCCAATATGTGGAAATCATAAGTAGATTTATTAGTGCACACCATAAGAGAATATGATTATTTGATTTGTGAAAGGCCAGTAGACAAAGGGACATCATAATGCTGATTATCAATGCAGATGATGATCAAAATATAGACATCACCTTAAAATGATCCCTGAAGATTTATCCACATATTTGTTAACTATGACACAAATAACTTGCACACTGCACATCTTGACTGGTGACCATAGCTGGAAACCATAAGTAGATTTATGAGTCCACACCATAAGAATATGATTATTTGATGTAGTCTGAGTCTTGACTTGTGAATGGCCGGTAGACAAAGGGACAACATAATGATGATTATCAACGCAGATAATGAGAGAAATGGAAGGATCCTGCAGAACATAATTTAGCCATCACCTTAAAATGATCTCTGAAGATTCATCCACATATTTGTTAGCTATGATGCATAACTTGCACACTGTACAGCTTGCTATCCAACTTTGGAGATATTTGAGCTATTCCATAAGTAATATAACTTTGCAGTCCTAAACTATCTTAGACAAGCAACTCTTATAGAATATATATTTTTTTCTTTTTTCTTTTTGCATAAATAATTCATATATTTACTCTATCATATTTCAGCTTTGAAATACTGCATGCTCGTTGGGCTATGTTGGCATCCCTTGGTGCTTTAATTCCAGAGATCCTAGATATTTTTGGAGCTTTTCATTTCACTGAACCCATCTGGTGGCGAGTAGGCTATTCAAAGCTTAGGGTTTGTTTTCCATCTCCTGGCCAACCATCTTTCTGGATATTTTATTACATGGAATCTTTTTCGTGTGTCTGCATATTTTGTTTATTATATTTTCCCCCCTCTGTCTGGCTATATTTCTTACCGGTCTTAAATTGTTGTTTATCTTACTGCTTCTGTTCAATCATGTTTTATTTTAAAGTCTGCACACCCCTCTTCCTTACTCTTTTTAGTTTTTCACGTCTGCCTTTATTAATGATATCTAATACCATAATTTGCTCAGTGTCTCATTCTAGTGCCAGGAATTATGACTTCATGTCTTTCAGTGCCCTGATAAGCTTGAGTTTGGTTGCAGCCTTTTATTTCAATCAGTCGTTCGTTGTTGTTTCGTTCCAATAAATTGGCTATATGCATTCTTGTTGTGCCATAAAAAGTGTTGCTGGAGTTGATGTCATAACACTCAGTGCTCATATGTTTGTGTTGCCCCGAGTAATCACTGTTCTGATACATATTTACACAAATTTGGTAAAGAAGTAAGAATTCTTTAATGATACTTGAACACATAACTTGCCTGGCAATTTGGAGGTTTAGATTCTCCCACTCTTTCTCTAGAACTGTCCAGCCTTCTCCTTTACAATTTTCTTACTCTATTGAACAACTTCCTAAGTAGCCAAGTACCTACCTCAATAGTAGTTTCCTAATTGTTCACAAAATGAAAATCAAACTGATAACATGCTGAAAAATACAAAGAACATCTTAACACTGGAAAAATAGTTTTATAGAGTGAGCAAAAAATTTAATCTAAATTTAACTTGGAGGAGCAACCAATATCAAAACTAGTAGAATTTGAAAGGAGAAAAAGTTGGGATTTGAGTATTACACAATTTTTTAAAGGTAGATTCGCCTGGCATCAGTACTTGATATTGAGCTGATGTTTGCTTTTCAGGATATAGCAACAACTAATAGTGATCCAAAGAAATGAATTGTGACATAATTCTAACACATAGAACTTAACTCTAGAGAGAGATTGAAAGAGGAAGGAAATGAGAGAAGAGAGTATTCTGAGATGGTGTGTTGAAAAAGGGTGGAAGAGACTCCTCTATTTGGCTTCAGGAAATGAAAGATTATAAAACAAGTTGGAGTTATAAAATCTTTGAAATTTCGAAGTTAATTTTGATAATATTAATTAATTCTAGAATTATTTTCTAAAAGTGTAGGGACGAAAATTTTCATATTCTAACATTTCATATTTCAACTCCATGGATGTGTGTAGTTCCTCATTCACTCATACTAAATATCCAGCACTACTAACTGACTTTTTTCATCCTACTTTTTGCGTTCACTTTCCACCAGGGAGATACACTCGACTATCTTGGAATACCAGGACTTCATTTAGCTGGAAGCCAAGGAGTGATTGTCATTGCAATTTGCCAAGCCATTTTAATGGTAACCTATATCAAACAATCTTATTTTCTGCCTCCTGACATTTTCTTTTTCTCATATCTATTGGGTACTTTAATTAAATGTTGGGGGAAGCGATGACATTATAGCTTGCTCTGGGTTCTACCTATGCGAAGTTGGATTGGTCAACCTCAATCCAATTCCACCCTGTTCCCTTCAAAACACAACAAATGTTTGGCTCTGCTAGCAGCTTCTTCGCCCTTGTTCTTGTTTAGGAATTAGGACTGAGATTTTGCCTTTTGGAGATTATAACAAGTTACAAACAATCAGATAGTTTATTGTTAAGTTGAGCAAATTAAACAATTACATGTCTGATTGTTCTTACAACTGTATGACGCTAATTAATTTGTAAATAGGTTGGACCCGAGTATGCAAGATATTGTGGCATAGAGGCATTGGAGCCTCTCGGAATATACCTGCCGGGGGATATTAATTATCCAGGTGGTGTGTTGTTTGATCCCTTAAACCTGTCCAAGGATGCTGCAGCTTTTGAGGAACTGAAGGTAAAAGAGATTAAAAATGGGCGGTTAGCCATGGTTGCCTGGCTAGGATTTTACAGTCAAGCAGCATTGACGGGTAAGGGACCAGTCCAAAACCTTCTTGAACACATTTCAGATCCTTTCCACAATAACTTTCTTTCCCTGCTCAACTCTTCATAAAGTAGTTGATAACTGGAATCTGGAGTATATAATATAGTTTCTTCAAATCTGTAAAGTGTTGATAAGGGATGAACAATCTATCATATCGATATGGGTGTTCTAGATGGGTGCTTATTGATCAATGAGTTGATACTTGATTGTGTGGTTATTATATCTATCTTATATCAAGTATCTTTGAGAAGTTAATGGTGTAGTGAAAATATATTTTAAACACAAATTTTAGATCAGGTAGAAAAATGTAAGATAATGATTCAAGTAAAAGGCCGTTAATTAACTTTCGTATGGAGCTAAAAGTGAGCAGTTGCACGAATTTCTGTCAGATCAGTGATTAGTGCTTGTGAACCTACGGCTAGCGTATTAGTAGAATGGATTACCAATTGTACTAACCCGAGACATTTGTGGAAGGAACCCACTGCCCACTTTGAGACTGGAGGACTTTGCGGAGCATGA

mRNA sequence

ATGGCAATGGCGCTATGTCAGCCACACACTTCACCTTCTGCAATTTCATCTTCCTCCGCCTCATTCTACGGGAGTTCTCTTCAACAATTCCGTTCACGCACCACCGCCCTTACTTTTCGAGACAGCCACTCTCTCCCCTATCGTTCCACTTGCAGAGCTTCGTGGCAGGAGCTTGCGGGAGTTCTGATATTCTCCGCAATCCCCTTCACGGCAGTGAAAGCCATCGCCAATAGTCCACTCGGAGAGTCGCTCCAGAGACAATTGGAAAAGAAGAAAAAAGCTGCCGTAGCCAACTCTTCCAAGTTCAAGGCTCTAGCTGAGGAGGCTAGAAAATATAGCTCATGGTACGGAGAAGAGCGTCCTCGCTGGCTTGGTCCAATACCATATGATTATCCAAGATATTTGACAGGTGAACTGCCAGGTGATTATGGATTCGATATTGCAGGTCTGAGCAAGGATCCCGTGGCTTTTCAGAAGTACTTCAACTTTGAAATACTGCATGCTCGTTGGGCTATGTTGGCATCCCTTGGTGCTTTAATTCCAGAGATCCTAGATATTTTTGGAGCTTTTCATTTCACTGAACCCATCTGGTGGCGAGTAGGCTATTCAAAGCTTAGGGGAGATACACTCGACTATCTTGGAATACCAGGACTTCATTTAGCTGGAAGCCAAGGAGTGATTGTCATTGCAATTTGCCAAGCCATTTTAATGGTTGGACCCGAGTATGCAAGATATTGTGGCATAGAGGCATTGGAGCCTCTCGGAATATACCTGCCGGGGGATATTAATTATCCAGGTGGTGTGTTGTTTGATCCCTTAAACCTGTCCAAGGATGCTGCAGCTTTTGAGGAACTGAAGGTAAAAGAGATTAAAAATGGGCGGTTAGCCATGGTTGCCTGGCTAGGATTTTACAGTCAAGCAGCATTGACGGTTGCACGAATTTCTGTCAGATCAGTGATTAGTGCTTGTGAACCTACGGCTAGCGTATTAGTAGAATGGATTACCAATTGTACTAACCCGAGACATTTGTGGAAGGAACCCACTGCCCACTTTGAGACTGGAGGACTTTGCGGAGCATGA

Coding sequence (CDS)

ATGGCAATGGCGCTATGTCAGCCACACACTTCACCTTCTGCAATTTCATCTTCCTCCGCCTCATTCTACGGGAGTTCTCTTCAACAATTCCGTTCACGCACCACCGCCCTTACTTTTCGAGACAGCCACTCTCTCCCCTATCGTTCCACTTGCAGAGCTTCGTGGCAGGAGCTTGCGGGAGTTCTGATATTCTCCGCAATCCCCTTCACGGCAGTGAAAGCCATCGCCAATAGTCCACTCGGAGAGTCGCTCCAGAGACAATTGGAAAAGAAGAAAAAAGCTGCCGTAGCCAACTCTTCCAAGTTCAAGGCTCTAGCTGAGGAGGCTAGAAAATATAGCTCATGGTACGGAGAAGAGCGTCCTCGCTGGCTTGGTCCAATACCATATGATTATCCAAGATATTTGACAGGTGAACTGCCAGGTGATTATGGATTCGATATTGCAGGTCTGAGCAAGGATCCCGTGGCTTTTCAGAAGTACTTCAACTTTGAAATACTGCATGCTCGTTGGGCTATGTTGGCATCCCTTGGTGCTTTAATTCCAGAGATCCTAGATATTTTTGGAGCTTTTCATTTCACTGAACCCATCTGGTGGCGAGTAGGCTATTCAAAGCTTAGGGGAGATACACTCGACTATCTTGGAATACCAGGACTTCATTTAGCTGGAAGCCAAGGAGTGATTGTCATTGCAATTTGCCAAGCCATTTTAATGGTTGGACCCGAGTATGCAAGATATTGTGGCATAGAGGCATTGGAGCCTCTCGGAATATACCTGCCGGGGGATATTAATTATCCAGGTGGTGTGTTGTTTGATCCCTTAAACCTGTCCAAGGATGCTGCAGCTTTTGAGGAACTGAAGGTAAAAGAGATTAAAAATGGGCGGTTAGCCATGGTTGCCTGGCTAGGATTTTACAGTCAAGCAGCATTGACGGTTGCACGAATTTCTGTCAGATCAGTGATTAGTGCTTGTGAACCTACGGCTAGCGTATTAGTAGAATGGATTACCAATTGTACTAACCCGAGACATTTGTGGAAGGAACCCACTGCCCACTTTGAGACTGGAGGACTTTGCGGAGCATGA

Protein sequence

MAMALCQPHTSPSAISSSSASFYGSSLQQFRSRTTALTFRDSHSLPYRSTCRASWQELAGVLIFSAIPFTAVKAIANSPLGESLQRQLEKKKKAAVANSSKFKALAEEARKYSSWYGEERPRWLGPIPYDYPRYLTGELPGDYGFDIAGLSKDPVAFQKYFNFEILHARWAMLASLGALIPEILDIFGAFHFTEPIWWRVGYSKLRGDTLDYLGIPGLHLAGSQGVIVIAICQAILMVGPEYARYCGIEALEPLGIYLPGDINYPGGVLFDPLNLSKDAAAFEELKVKEIKNGRLAMVAWLGFYSQAALTVARISVRSVISACEPTASVLVEWITNCTNPRHLWKEPTAHFETGGLCGA
Homology
BLAST of Sgr017704 vs. NCBI nr
Match: XP_038898449.1 (chlorophyll a-b binding protein 7, chloroplastic isoform X1 [Benincasa hispida])

HSP 1 Score: 578.6 bits (1490), Expect = 3.8e-161
Identity = 285/310 (91.94%), Postives = 298/310 (96.13%), Query Frame = 0

Query: 1   MAMALCQPHTSPSAISSSSASFYGSSLQQFRSRTTALTFRDSHSLPYRSTCRASWQELAG 60
           MAMAL QPHT   AISSSSASFYGSSLQ+ RSRT  L  R+ HS+P+RSTCRASWQELAG
Sbjct: 13  MAMALVQPHT--PAISSSSASFYGSSLQRLRSRTPTLNLRNGHSIPHRSTCRASWQELAG 72

Query: 61  VLIFSAIPFTAVKAIANSPLGESLQRQLEKKKKAAVANSSKFKALAEEARKYSSWYGEER 120
           VLIFSAIPFTAVKAIANSPLGESLQRQLEKKKK+A+ANSSKFK LAEEARK SSWYGEER
Sbjct: 73  VLIFSAIPFTAVKAIANSPLGESLQRQLEKKKKSAIANSSKFKVLAEEARKDSSWYGEER 132

Query: 121 PRWLGPIPYDYPRYLTGELPGDYGFDIAGLSKDPVAFQKYFNFEILHARWAMLASLGALI 180
           PRWLGP+PYDYP+YLTGELPGDYGFDIAGLS+DPVAFQKYFNFEILHARWAMLASLGAL+
Sbjct: 133 PRWLGPLPYDYPKYLTGELPGDYGFDIAGLSEDPVAFQKYFNFEILHARWAMLASLGALV 192

Query: 181 PEILDIFGAFHFTEPIWWRVGYSKLRGDTLDYLGIPGLHLAGSQGVIVIAICQAILMVGP 240
           PEILDIFGAFHFTEPIWWRVGYSKL+GDTLDYLGIPGLHLAGSQGV+VIAICQAILMVGP
Sbjct: 193 PEILDIFGAFHFTEPIWWRVGYSKLKGDTLDYLGIPGLHLAGSQGVVVIAICQAILMVGP 252

Query: 241 EYARYCGIEALEPLGIYLPGDINYPGGVLFDPLNLSKDAAAFEELKVKEIKNGRLAMVAW 300
           EYARYCGIEALEPLGI+LPGDINYPGGVLFDPLNLSKDAAAFEELKVKEIKNGRLAMVAW
Sbjct: 253 EYARYCGIEALEPLGIFLPGDINYPGGVLFDPLNLSKDAAAFEELKVKEIKNGRLAMVAW 312

Query: 301 LGFYSQAALT 311
           LGFYSQAALT
Sbjct: 313 LGFYSQAALT 320

BLAST of Sgr017704 vs. NCBI nr
Match: XP_023521109.1 (chlorophyll a-b binding protein 7, chloroplastic [Cucurbita pepo subsp. pepo])

HSP 1 Score: 577.8 bits (1488), Expect = 6.5e-161
Identity = 286/310 (92.26%), Postives = 297/310 (95.81%), Query Frame = 0

Query: 1   MAMALCQPHTSPSAISSSSASFYGSSLQQFRSRTTALTFRDSHSLPYRSTCRASWQELAG 60
           M MAL QPHTSPSAISSSSASFYGSSLQQ RSRT AL  R+S S+P+R TC ASWQELAG
Sbjct: 1   MVMALFQPHTSPSAISSSSASFYGSSLQQLRSRTIALNLRNSPSIPHRFTCTASWQELAG 60

Query: 61  VLIFSAIPFTAVKAIANSPLGESLQRQLEKKKKAAVANSSKFKALAEEARKYSSWYGEER 120
           VLIFSAIPFTAVKAIANSP GESLQRQLEKKK AAVA SSKFKALAEEARK SSWYGE+R
Sbjct: 61  VLIFSAIPFTAVKAIANSPFGESLQRQLEKKKNAAVAKSSKFKALAEEARKDSSWYGEKR 120

Query: 121 PRWLGPIPYDYPRYLTGELPGDYGFDIAGLSKDPVAFQKYFNFEILHARWAMLASLGALI 180
           PRWLGP+PY+YP+YLTGELPGDYGFDIAGLS+DPVAF+KYFNFEILHARWAMLASLGAL+
Sbjct: 121 PRWLGPLPYNYPKYLTGELPGDYGFDIAGLSEDPVAFRKYFNFEILHARWAMLASLGALV 180

Query: 181 PEILDIFGAFHFTEPIWWRVGYSKLRGDTLDYLGIPGLHLAGSQGVIVIAICQAILMVGP 240
           PEILDIFGAFHFTEPIWWRVGYSKL+GDTLDYLGIPGLHLAGSQGVIVIAICQAILMVGP
Sbjct: 181 PEILDIFGAFHFTEPIWWRVGYSKLKGDTLDYLGIPGLHLAGSQGVIVIAICQAILMVGP 240

Query: 241 EYARYCGIEALEPLGIYLPGDINYPGGVLFDPLNLSKDAAAFEELKVKEIKNGRLAMVAW 300
           EYARYCGIEALEPLGIYLPGDINYPGGVLFDPLNLSKDAAAFEELKVKEIKNGRLAMVAW
Sbjct: 241 EYARYCGIEALEPLGIYLPGDINYPGGVLFDPLNLSKDAAAFEELKVKEIKNGRLAMVAW 300

Query: 301 LGFYSQAALT 311
           LGFYSQAALT
Sbjct: 301 LGFYSQAALT 310

BLAST of Sgr017704 vs. NCBI nr
Match: XP_022936160.1 (chlorophyll a-b binding protein 7, chloroplastic [Cucurbita moschata])

HSP 1 Score: 576.6 bits (1485), Expect = 1.5e-160
Identity = 285/310 (91.94%), Postives = 297/310 (95.81%), Query Frame = 0

Query: 1   MAMALCQPHTSPSAISSSSASFYGSSLQQFRSRTTALTFRDSHSLPYRSTCRASWQELAG 60
           M MAL QPHTSPSAISSSSASFYGSSLQQ RSRT AL  R+S S+P+R TC ASWQELAG
Sbjct: 1   MVMALFQPHTSPSAISSSSASFYGSSLQQLRSRTIALNLRNSPSIPHRFTCTASWQELAG 60

Query: 61  VLIFSAIPFTAVKAIANSPLGESLQRQLEKKKKAAVANSSKFKALAEEARKYSSWYGEER 120
           VLIFSAIPFTAVKAIANSP GESLQRQLEKKK AAVA SSKFKALAE+ARK SSWYGE+R
Sbjct: 61  VLIFSAIPFTAVKAIANSPFGESLQRQLEKKKNAAVAKSSKFKALAEQARKDSSWYGEKR 120

Query: 121 PRWLGPIPYDYPRYLTGELPGDYGFDIAGLSKDPVAFQKYFNFEILHARWAMLASLGALI 180
           PRWLGP+PY+YP+YLTGELPGDYGFDIAGLS+DPVAF+KYFNFEILHARWAMLASLGAL+
Sbjct: 121 PRWLGPLPYNYPKYLTGELPGDYGFDIAGLSEDPVAFRKYFNFEILHARWAMLASLGALV 180

Query: 181 PEILDIFGAFHFTEPIWWRVGYSKLRGDTLDYLGIPGLHLAGSQGVIVIAICQAILMVGP 240
           PEILDIFGAFHFTEPIWWRVGYSKL+GDTLDYLGIPGLHLAGSQGVIVIAICQAILMVGP
Sbjct: 181 PEILDIFGAFHFTEPIWWRVGYSKLKGDTLDYLGIPGLHLAGSQGVIVIAICQAILMVGP 240

Query: 241 EYARYCGIEALEPLGIYLPGDINYPGGVLFDPLNLSKDAAAFEELKVKEIKNGRLAMVAW 300
           EYARYCGIEALEPLGIYLPGDINYPGGVLFDPLNLSKDAAAFEELKVKEIKNGRLAMVAW
Sbjct: 241 EYARYCGIEALEPLGIYLPGDINYPGGVLFDPLNLSKDAAAFEELKVKEIKNGRLAMVAW 300

Query: 301 LGFYSQAALT 311
           LGFYSQAALT
Sbjct: 301 LGFYSQAALT 310

BLAST of Sgr017704 vs. NCBI nr
Match: XP_008453032.2 (PREDICTED: chlorophyll a-b binding protein of LHCII type 1 isoform X1 [Cucumis melo])

HSP 1 Score: 576.2 bits (1484), Expect = 1.9e-160
Identity = 283/310 (91.29%), Postives = 296/310 (95.48%), Query Frame = 0

Query: 1   MAMALCQPHTSPSAISSSSASFYGSSLQQFRSRTTALTFRDSHSLPYRSTCRASWQELAG 60
           + MAL QPHT  S+ISSSSASFYG+ LQQ R  TT L  R +HS+ +RSTCRASWQELAG
Sbjct: 35  IVMALIQPHTPASSISSSSASFYGTYLQQLRPLTTTLNLRSNHSISHRSTCRASWQELAG 94

Query: 61  VLIFSAIPFTAVKAIANSPLGESLQRQLEKKKKAAVANSSKFKALAEEARKYSSWYGEER 120
           VLIFSA+PFTAVKAIANSPLGESLQRQLEKKKK+AVANSSKFKALAEEARK SSWYGEER
Sbjct: 95  VLIFSAVPFTAVKAIANSPLGESLQRQLEKKKKSAVANSSKFKALAEEARKDSSWYGEER 154

Query: 121 PRWLGPIPYDYPRYLTGELPGDYGFDIAGLSKDPVAFQKYFNFEILHARWAMLASLGALI 180
           PRWLGP+PYDYP+YLTGELPGDYGFDIAGLS+DPVAFQKYFNFEILHARWAMLASLGAL+
Sbjct: 155 PRWLGPLPYDYPKYLTGELPGDYGFDIAGLSEDPVAFQKYFNFEILHARWAMLASLGALV 214

Query: 181 PEILDIFGAFHFTEPIWWRVGYSKLRGDTLDYLGIPGLHLAGSQGVIVIAICQAILMVGP 240
           PEILDIFGAFHFTEPIWWRVGYSKL+GDTLDYLGIPGLHLAGSQGVIVIAICQAILMVGP
Sbjct: 215 PEILDIFGAFHFTEPIWWRVGYSKLKGDTLDYLGIPGLHLAGSQGVIVIAICQAILMVGP 274

Query: 241 EYARYCGIEALEPLGIYLPGDINYPGGVLFDPLNLSKDAAAFEELKVKEIKNGRLAMVAW 300
           EYARYCGIEALEPLGIYLPGDINYPGGVLFDPLNLSKDAAAFEELKVKEIKNGRLAMVAW
Sbjct: 275 EYARYCGIEALEPLGIYLPGDINYPGGVLFDPLNLSKDAAAFEELKVKEIKNGRLAMVAW 334

Query: 301 LGFYSQAALT 311
           LGFYSQAALT
Sbjct: 335 LGFYSQAALT 344

BLAST of Sgr017704 vs. NCBI nr
Match: KAG6591186.1 (Chlorophyll a-b binding protein 7, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 575.1 bits (1481), Expect = 4.2e-160
Identity = 284/308 (92.21%), Postives = 296/308 (96.10%), Query Frame = 0

Query: 3   MALCQPHTSPSAISSSSASFYGSSLQQFRSRTTALTFRDSHSLPYRSTCRASWQELAGVL 62
           MAL QPHTSPSAISSSSASFYGSSLQQ RSRT AL  R+S S+P+R TC ASWQELAGVL
Sbjct: 1   MALFQPHTSPSAISSSSASFYGSSLQQLRSRTIALNLRNSPSIPHRFTCTASWQELAGVL 60

Query: 63  IFSAIPFTAVKAIANSPLGESLQRQLEKKKKAAVANSSKFKALAEEARKYSSWYGEERPR 122
           IFSAIPFTAVKAIANSP GESLQRQLEKKK AAVA SSKFKALAE+ARK SSWYGE+RPR
Sbjct: 61  IFSAIPFTAVKAIANSPFGESLQRQLEKKKNAAVAKSSKFKALAEQARKDSSWYGEKRPR 120

Query: 123 WLGPIPYDYPRYLTGELPGDYGFDIAGLSKDPVAFQKYFNFEILHARWAMLASLGALIPE 182
           WLGP+PY+YP+YLTGELPGDYGFDIAGLS+DPVAF+KYFNFEILHARWAMLASLGAL+PE
Sbjct: 121 WLGPLPYNYPKYLTGELPGDYGFDIAGLSEDPVAFRKYFNFEILHARWAMLASLGALVPE 180

Query: 183 ILDIFGAFHFTEPIWWRVGYSKLRGDTLDYLGIPGLHLAGSQGVIVIAICQAILMVGPEY 242
           ILDIFGAFHFTEPIWWRVGYSKL+GDTLDYLGIPGLHLAGSQGVIVIAICQAILMVGPEY
Sbjct: 181 ILDIFGAFHFTEPIWWRVGYSKLKGDTLDYLGIPGLHLAGSQGVIVIAICQAILMVGPEY 240

Query: 243 ARYCGIEALEPLGIYLPGDINYPGGVLFDPLNLSKDAAAFEELKVKEIKNGRLAMVAWLG 302
           ARYCGIEALEPLGIYLPGDINYPGGVLFDPLNLSKDAAAFEELKVKEIKNGRLAMVAWLG
Sbjct: 241 ARYCGIEALEPLGIYLPGDINYPGGVLFDPLNLSKDAAAFEELKVKEIKNGRLAMVAWLG 300

Query: 303 FYSQAALT 311
           FYSQAALT
Sbjct: 301 FYSQAALT 308

BLAST of Sgr017704 vs. ExPASy Swiss-Prot
Match: Q9C9K1 (Chlorophyll a-b binding protein 7, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=LHCB7 PE=2 SV=1)

HSP 1 Score: 462.6 bits (1189), Expect = 4.0e-129
Identity = 217/263 (82.51%), Postives = 239/263 (90.87%), Query Frame = 0

Query: 48  RSTCRASWQELAGVLIFSAIPFTAVKAIANSPLGESLQRQLEKKKKAAVANSSKFKALAE 107
           RS CRASWQELAGVL+FSAIPFTAVKAIANS +G SL+R+LE+KKK AV NSS+FK+ A+
Sbjct: 39  RSICRASWQELAGVLVFSAIPFTAVKAIANSSIGVSLRRRLEEKKKEAVENSSRFKSKAQ 98

Query: 108 EARKYSSWYGEERPRWLGPIPYDYPRYLTGELPGDYGFDIAGLSKDPVAFQKYFNFEILH 167
           EAR  S WYG+ERPRW GPIPYDYP YLTGELPGDYGFDIAGL KD + F KYFNFEILH
Sbjct: 99  EARNDSKWYGKERPRWFGPIPYDYPPYLTGELPGDYGFDIAGLGKDRLTFDKYFNFEILH 158

Query: 168 ARWAMLASLGALIPEILDIFGAFHFTEPIWWRVGYSKLRGDTLDYLGIPGLHLAGSQGVI 227
           ARWAMLA+LGALIPE+ D+ G FHF EP+WWRVGYSKL+G+TL+YLGIPGLH+AGSQGVI
Sbjct: 159 ARWAMLAALGALIPEVFDLTGTFHFAEPVWWRVGYSKLQGETLEYLGIPGLHVAGSQGVI 218

Query: 228 VIAICQAILMVGPEYARYCGIEALEPLGIYLPGDINYPGGVLFDPLNLSKDAAAFEELKV 287
           VIAICQ +LMVGPEYARYCGIEALEPLGIYLPGDINYPGG LFDPLNLS+D  AFE+LKV
Sbjct: 219 VIAICQVLLMVGPEYARYCGIEALEPLGIYLPGDINYPGGTLFDPLNLSEDPVAFEDLKV 278

Query: 288 KEIKNGRLAMVAWLGFYSQAALT 311
           KEIKNGRLAMVAWLGFY+QAA T
Sbjct: 279 KEIKNGRLAMVAWLGFYAQAAFT 301

BLAST of Sgr017704 vs. ExPASy Swiss-Prot
Match: P07370 (Chlorophyll a-b binding protein 1B, chloroplastic OS=Solanum lycopersicum OX=4081 GN=CAB1B PE=3 SV=1)

HSP 1 Score: 177.9 bits (450), Expect = 2.0e-43
Identity = 100/208 (48.08%), Postives = 125/208 (60.10%), Query Frame = 0

Query: 103 KALAEEARKYSSWYGEERPRWLGPIPYDYPRYLTGELPGDYGFDIAGLSKDPVAFQKYFN 162
           KA+A+ A   S WYG +R ++LGP   + P YLTGE PGDYG+D AGLS DP  F K   
Sbjct: 37  KAVAKSAPSSSPWYGPDRVKYLGPFSGESPSYLTGEFPGDYGWDTAGLSADPETFAKNRE 96

Query: 163 FEILHARWAMLASLGALIPEILDIFGAFHFTEPIWWRVGYSKLRGDTLDYLGIPGLHLAG 222
            E++H RWAML +LG + PE+L   G   F E +W++ G        LDYLG P   L  
Sbjct: 97  LEVIHCRWAMLGALGCVFPELLARNGV-KFGEAVWFKAGSQIFSEGGLDYLGNPS--LVH 156

Query: 223 SQGVIVIAICQAILMVGPEYARYCGIEALEPLGIYLPGDINYPGGVLFDPLNLSKDAAAF 282
           +Q ++ I  CQ +LM   E  R  G     PLG  +  D  YPGG  FDPL L++D  AF
Sbjct: 157 AQSILAIWACQVVLMGAVEGYRIAG----GPLGEVV--DPLYPGG-SFDPLGLAEDPEAF 216

Query: 283 EELKVKEIKNGRLAMVAWLGFYSQAALT 311
            ELKVKEIKNGRLAM +  GF+ QA +T
Sbjct: 217 AELKVKEIKNGRLAMFSMFGFFVQAIVT 234

BLAST of Sgr017704 vs. ExPASy Swiss-Prot
Match: Q9XF87 (Chlorophyll a-b binding protein 2.4, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=LHCB2.4 PE=1 SV=1)

HSP 1 Score: 177.6 bits (449), Expect = 2.6e-43
Identity = 99/198 (50.00%), Postives = 122/198 (61.62%), Query Frame = 0

Query: 113 SSWYGEERPRWLGPIPYDYPRYLTGELPGDYGFDIAGLSKDPVAFQKYFNFEILHARWAM 172
           S WYG +RP++LGP   + P YLTGE PGDYG+D AGLS DP  F K    E++H+RWAM
Sbjct: 48  SIWYGPDRPKYLGPFSENTPSYLTGEYPGDYGWDTAGLSADPETFAKNRELEVIHSRWAM 107

Query: 173 LASLGALIPEILDIFGAFHFTEPIWWRVGYSKLRGDTLDYLGIPGLHLAGSQGVIVIAIC 232
           L +LG   PEIL   G   F E +W++ G        LDYLG P  +L  +Q ++ I  C
Sbjct: 108 LGALGCTFPEILSKNGV-KFGEAVWFKAGSQIFSEGGLDYLGNP--NLIHAQSILAIWAC 167

Query: 233 QAILMVGPEYARYCGIEALEPLGIYLPGDINYPGGVLFDPLNLSKDAAAFEELKVKEIKN 292
           Q +LM   E  R  G     PLG  L  D  YPGG  FDPLNL++D  AF ELKVKE+KN
Sbjct: 168 QVVLMGFIEGYRIGG----GPLGEGL--DPLYPGGA-FDPLNLAEDPEAFSELKVKELKN 227

Query: 293 GRLAMVAWLGFYSQAALT 311
           GRLAM +  GF+ QA +T
Sbjct: 228 GRLAMFSMFGFFVQAIVT 235

BLAST of Sgr017704 vs. ExPASy Swiss-Prot
Match: P14278 (Chlorophyll a-b binding protein 4, chloroplastic OS=Solanum lycopersicum OX=4081 GN=CAB4 PE=2 SV=1)

HSP 1 Score: 177.2 bits (448), Expect = 3.4e-43
Identity = 99/198 (50.00%), Postives = 120/198 (60.61%), Query Frame = 0

Query: 113 SSWYGEERPRWLGPIPYDYPRYLTGELPGDYGFDIAGLSKDPVAFQKYFNFEILHARWAM 172
           S WYGE+RP++LGP     P YLTGE PGDYG+D AGLS DP  F +    E++H RWAM
Sbjct: 47  SIWYGEDRPKYLGPFSEQTPSYLTGEFPGDYGWDTAGLSADPETFARNRELEVIHCRWAM 106

Query: 173 LASLGALIPEILDIFGAFHFTEPIWWRVGYSKLRGDTLDYLGIPGLHLAGSQGVIVIAIC 232
           L +LG + PEIL   G   F E +W++ G        LDYLG P  +L  +Q ++ I  C
Sbjct: 107 LGALGCVFPEILSKNGV-KFGEAVWFKAGSQIFSEGGLDYLGNP--NLVHAQSILAIWAC 166

Query: 233 QAILMVGPEYARYCGIEALEPLGIYLPGDINYPGGVLFDPLNLSKDAAAFEELKVKEIKN 292
           Q +LM   E  R  G     PLG  L  D  YPGG  FDPL L+ D  AF ELKVKEIKN
Sbjct: 167 QVVLMGFVEGYRVGG----GPLGEGL--DKIYPGGA-FDPLGLADDPEAFAELKVKEIKN 226

Query: 293 GRLAMVAWLGFYSQAALT 311
           GRLAM +  GF+ QA +T
Sbjct: 227 GRLAMFSMFGFFVQAIVT 234

BLAST of Sgr017704 vs. ExPASy Swiss-Prot
Match: P12470 (Chlorophyll a-b binding protein E, chloroplastic OS=Nicotiana plumbaginifolia OX=4092 GN=CABE PE=3 SV=1)

HSP 1 Score: 175.6 bits (444), Expect = 9.8e-43
Identity = 97/198 (48.99%), Postives = 120/198 (60.61%), Query Frame = 0

Query: 113 SSWYGEERPRWLGPIPYDYPRYLTGELPGDYGFDIAGLSKDPVAFQKYFNFEILHARWAM 172
           S WYG +R ++LGP   + P YLTGE PGDYG+D AGLS DP  F K    E++H RWAM
Sbjct: 48  SPWYGPDRVKYLGPFSGESPSYLTGEFPGDYGWDTAGLSADPETFAKNRELEVIHCRWAM 107

Query: 173 LASLGALIPEILDIFGAFHFTEPIWWRVGYSKLRGDTLDYLGIPGLHLAGSQGVIVIAIC 232
           L +LG + PE+L   G   F E +W++ G        LDYLG P   L  +Q ++ I  C
Sbjct: 108 LGALGCVFPELLARNGV-KFGEAVWFKAGSQIFSEGGLDYLGNPS--LVHAQSILAIWAC 167

Query: 233 QAILMVGPEYARYCGIEALEPLGIYLPGDINYPGGVLFDPLNLSKDAAAFEELKVKEIKN 292
           Q +LM   E  R  G    EPLG  +  D  YPGG  FDPL L++D  AF ELKVKEIKN
Sbjct: 168 QVVLMGAVEGYRVAG----EPLGEVV--DPLYPGG-SFDPLGLAEDPEAFAELKVKEIKN 227

Query: 293 GRLAMVAWLGFYSQAALT 311
           GRLAM +  GF+ QA +T
Sbjct: 228 GRLAMFSMFGFFVQALVT 235

BLAST of Sgr017704 vs. ExPASy TrEMBL
Match: A0A6J1FCV0 (Chlorophyll a-b binding protein, chloroplastic OS=Cucurbita moschata OX=3662 GN=LOC111442842 PE=3 SV=1)

HSP 1 Score: 576.6 bits (1485), Expect = 7.1e-161
Identity = 285/310 (91.94%), Postives = 297/310 (95.81%), Query Frame = 0

Query: 1   MAMALCQPHTSPSAISSSSASFYGSSLQQFRSRTTALTFRDSHSLPYRSTCRASWQELAG 60
           M MAL QPHTSPSAISSSSASFYGSSLQQ RSRT AL  R+S S+P+R TC ASWQELAG
Sbjct: 1   MVMALFQPHTSPSAISSSSASFYGSSLQQLRSRTIALNLRNSPSIPHRFTCTASWQELAG 60

Query: 61  VLIFSAIPFTAVKAIANSPLGESLQRQLEKKKKAAVANSSKFKALAEEARKYSSWYGEER 120
           VLIFSAIPFTAVKAIANSP GESLQRQLEKKK AAVA SSKFKALAE+ARK SSWYGE+R
Sbjct: 61  VLIFSAIPFTAVKAIANSPFGESLQRQLEKKKNAAVAKSSKFKALAEQARKDSSWYGEKR 120

Query: 121 PRWLGPIPYDYPRYLTGELPGDYGFDIAGLSKDPVAFQKYFNFEILHARWAMLASLGALI 180
           PRWLGP+PY+YP+YLTGELPGDYGFDIAGLS+DPVAF+KYFNFEILHARWAMLASLGAL+
Sbjct: 121 PRWLGPLPYNYPKYLTGELPGDYGFDIAGLSEDPVAFRKYFNFEILHARWAMLASLGALV 180

Query: 181 PEILDIFGAFHFTEPIWWRVGYSKLRGDTLDYLGIPGLHLAGSQGVIVIAICQAILMVGP 240
           PEILDIFGAFHFTEPIWWRVGYSKL+GDTLDYLGIPGLHLAGSQGVIVIAICQAILMVGP
Sbjct: 181 PEILDIFGAFHFTEPIWWRVGYSKLKGDTLDYLGIPGLHLAGSQGVIVIAICQAILMVGP 240

Query: 241 EYARYCGIEALEPLGIYLPGDINYPGGVLFDPLNLSKDAAAFEELKVKEIKNGRLAMVAW 300
           EYARYCGIEALEPLGIYLPGDINYPGGVLFDPLNLSKDAAAFEELKVKEIKNGRLAMVAW
Sbjct: 241 EYARYCGIEALEPLGIYLPGDINYPGGVLFDPLNLSKDAAAFEELKVKEIKNGRLAMVAW 300

Query: 301 LGFYSQAALT 311
           LGFYSQAALT
Sbjct: 301 LGFYSQAALT 310

BLAST of Sgr017704 vs. ExPASy TrEMBL
Match: A0A1S3BWE1 (Chlorophyll a-b binding protein, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103493859 PE=3 SV=1)

HSP 1 Score: 576.2 bits (1484), Expect = 9.2e-161
Identity = 283/310 (91.29%), Postives = 296/310 (95.48%), Query Frame = 0

Query: 1   MAMALCQPHTSPSAISSSSASFYGSSLQQFRSRTTALTFRDSHSLPYRSTCRASWQELAG 60
           + MAL QPHT  S+ISSSSASFYG+ LQQ R  TT L  R +HS+ +RSTCRASWQELAG
Sbjct: 35  IVMALIQPHTPASSISSSSASFYGTYLQQLRPLTTTLNLRSNHSISHRSTCRASWQELAG 94

Query: 61  VLIFSAIPFTAVKAIANSPLGESLQRQLEKKKKAAVANSSKFKALAEEARKYSSWYGEER 120
           VLIFSA+PFTAVKAIANSPLGESLQRQLEKKKK+AVANSSKFKALAEEARK SSWYGEER
Sbjct: 95  VLIFSAVPFTAVKAIANSPLGESLQRQLEKKKKSAVANSSKFKALAEEARKDSSWYGEER 154

Query: 121 PRWLGPIPYDYPRYLTGELPGDYGFDIAGLSKDPVAFQKYFNFEILHARWAMLASLGALI 180
           PRWLGP+PYDYP+YLTGELPGDYGFDIAGLS+DPVAFQKYFNFEILHARWAMLASLGAL+
Sbjct: 155 PRWLGPLPYDYPKYLTGELPGDYGFDIAGLSEDPVAFQKYFNFEILHARWAMLASLGALV 214

Query: 181 PEILDIFGAFHFTEPIWWRVGYSKLRGDTLDYLGIPGLHLAGSQGVIVIAICQAILMVGP 240
           PEILDIFGAFHFTEPIWWRVGYSKL+GDTLDYLGIPGLHLAGSQGVIVIAICQAILMVGP
Sbjct: 215 PEILDIFGAFHFTEPIWWRVGYSKLKGDTLDYLGIPGLHLAGSQGVIVIAICQAILMVGP 274

Query: 241 EYARYCGIEALEPLGIYLPGDINYPGGVLFDPLNLSKDAAAFEELKVKEIKNGRLAMVAW 300
           EYARYCGIEALEPLGIYLPGDINYPGGVLFDPLNLSKDAAAFEELKVKEIKNGRLAMVAW
Sbjct: 275 EYARYCGIEALEPLGIYLPGDINYPGGVLFDPLNLSKDAAAFEELKVKEIKNGRLAMVAW 334

Query: 301 LGFYSQAALT 311
           LGFYSQAALT
Sbjct: 335 LGFYSQAALT 344

BLAST of Sgr017704 vs. ExPASy TrEMBL
Match: A0A6J1IGJ6 (Chlorophyll a-b binding protein, chloroplastic OS=Cucurbita maxima OX=3661 GN=LOC111474607 PE=3 SV=1)

HSP 1 Score: 572.4 bits (1474), Expect = 1.3e-159
Identity = 281/310 (90.65%), Postives = 296/310 (95.48%), Query Frame = 0

Query: 1   MAMALCQPHTSPSAISSSSASFYGSSLQQFRSRTTALTFRDSHSLPYRSTCRASWQELAG 60
           M MAL  PHTSPSA+SSSSASFYGSSLQQ RSRT AL  R+S S+P+R TC ASWQELAG
Sbjct: 1   MVMALFHPHTSPSAVSSSSASFYGSSLQQLRSRTIALNLRNSPSIPHRFTCTASWQELAG 60

Query: 61  VLIFSAIPFTAVKAIANSPLGESLQRQLEKKKKAAVANSSKFKALAEEARKYSSWYGEER 120
           +LIFSAIPFTAVKAIANSP GESLQRQLEKKK AAVA SSKFKALAE+ARK SSWYGE+R
Sbjct: 61  ILIFSAIPFTAVKAIANSPFGESLQRQLEKKKNAAVAKSSKFKALAEQARKDSSWYGEKR 120

Query: 121 PRWLGPIPYDYPRYLTGELPGDYGFDIAGLSKDPVAFQKYFNFEILHARWAMLASLGALI 180
           PRWLGP+PY+YP+YLTGELPGDYGFDIAGLS+DPVAF+KYFNFEILHARWAMLASLGAL+
Sbjct: 121 PRWLGPLPYNYPKYLTGELPGDYGFDIAGLSEDPVAFRKYFNFEILHARWAMLASLGALV 180

Query: 181 PEILDIFGAFHFTEPIWWRVGYSKLRGDTLDYLGIPGLHLAGSQGVIVIAICQAILMVGP 240
           PEILDIFGAFHFTEPIWWRVGYSKL+GDTLDYLGIPGLHLAGSQGVIVIAICQAILMVGP
Sbjct: 181 PEILDIFGAFHFTEPIWWRVGYSKLKGDTLDYLGIPGLHLAGSQGVIVIAICQAILMVGP 240

Query: 241 EYARYCGIEALEPLGIYLPGDINYPGGVLFDPLNLSKDAAAFEELKVKEIKNGRLAMVAW 300
           EYARYCGIEALEPLGIYLPGDINYPGGVLFDPLNLS+DAAAFEELKVKEIKNGRLAMVAW
Sbjct: 241 EYARYCGIEALEPLGIYLPGDINYPGGVLFDPLNLSEDAAAFEELKVKEIKNGRLAMVAW 300

Query: 301 LGFYSQAALT 311
           LGFYSQAALT
Sbjct: 301 LGFYSQAALT 310

BLAST of Sgr017704 vs. ExPASy TrEMBL
Match: A0A5A7VCG1 (Chlorophyll a-b binding protein, chloroplastic OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold255G004020 PE=3 SV=1)

HSP 1 Score: 569.3 bits (1466), Expect = 1.1e-158
Identity = 281/308 (91.23%), Postives = 293/308 (95.13%), Query Frame = 0

Query: 3   MALCQPHTSPSAISSSSASFYGSSLQQFRSRTTALTFRDSHSLPYRSTCRASWQELAGVL 62
           MAL QPHT  S+ISSSSASFYG+ LQQ R  TT L  R +HS+ +RSTCRASWQELAGVL
Sbjct: 1   MALIQPHTPASSISSSSASFYGTYLQQLRPLTTTLNLRSNHSISHRSTCRASWQELAGVL 60

Query: 63  IFSAIPFTAVKAIANSPLGESLQRQLEKKKKAAVANSSKFKALAEEARKYSSWYGEERPR 122
           IFSA+PFTAVKAIANSPLGESLQRQLEKKKK+AVANSSKFKALAEEARK SSWYGEERPR
Sbjct: 61  IFSAVPFTAVKAIANSPLGESLQRQLEKKKKSAVANSSKFKALAEEARKDSSWYGEERPR 120

Query: 123 WLGPIPYDYPRYLTGELPGDYGFDIAGLSKDPVAFQKYFNFEILHARWAMLASLGALIPE 182
           WLGP+PYDYP+YLTGELPGDYGFDIAGLS+DPVAFQKYFNFEILHARWAMLASLGAL+PE
Sbjct: 121 WLGPLPYDYPKYLTGELPGDYGFDIAGLSEDPVAFQKYFNFEILHARWAMLASLGALVPE 180

Query: 183 ILDIFGAFHFTEPIWWRVGYSKLRGDTLDYLGIPGLHLAGSQGVIVIAICQAILMVGPEY 242
           ILDIFGAFHFTEPIWWRVGYSKL+  TLDYLGIPGLHLAGSQGVIVIAICQAILMVGPEY
Sbjct: 181 ILDIFGAFHFTEPIWWRVGYSKLKVYTLDYLGIPGLHLAGSQGVIVIAICQAILMVGPEY 240

Query: 243 ARYCGIEALEPLGIYLPGDINYPGGVLFDPLNLSKDAAAFEELKVKEIKNGRLAMVAWLG 302
           ARYCGIEALEPLGIYLPGDINYPGGVLFDPLNLSKDAAAFEELKVKEIKNGRLAMVAWLG
Sbjct: 241 ARYCGIEALEPLGIYLPGDINYPGGVLFDPLNLSKDAAAFEELKVKEIKNGRLAMVAWLG 300

Query: 303 FYSQAALT 311
           FYSQAALT
Sbjct: 301 FYSQAALT 308

BLAST of Sgr017704 vs. ExPASy TrEMBL
Match: A0A6J1CK83 (Chlorophyll a-b binding protein, chloroplastic OS=Momordica charantia OX=3673 GN=LOC111012208 PE=3 SV=1)

HSP 1 Score: 567.0 bits (1460), Expect = 5.6e-158
Identity = 282/310 (90.97%), Postives = 294/310 (94.84%), Query Frame = 0

Query: 1   MAMALCQPHTSPSAISSSSASFYGSSLQQFRSRTTALTFRDSHSLPYRSTCRASWQELAG 60
           MAMA+ QPHTSPSAI SSSASFYGSSLQQ R        R+S S+PYRSTC+ASWQELAG
Sbjct: 4   MAMAILQPHTSPSAIPSSSASFYGSSLQQPR----LSNLRNSRSVPYRSTCKASWQELAG 63

Query: 61  VLIFSAIPFTAVKAIANSPLGESLQRQLEKKKKAAVANSSKFKALAEEARKYSSWYGEER 120
           VLIFSAIPFTAVKAIANSPLGESLQRQ+EKKK+AA+ANSSKFKALA EARK SSWYGEER
Sbjct: 64  VLIFSAIPFTAVKAIANSPLGESLQRQMEKKKEAAIANSSKFKALAREARKDSSWYGEER 123

Query: 121 PRWLGPIPYDYPRYLTGELPGDYGFDIAGLSKDPVAFQKYFNFEILHARWAMLASLGALI 180
           PRWLGPIPY+YP+YLTG+LPGDYGFDIAGLSKDPVAFQKYFNFEILHARWAMLASLGALI
Sbjct: 124 PRWLGPIPYEYPKYLTGDLPGDYGFDIAGLSKDPVAFQKYFNFEILHARWAMLASLGALI 183

Query: 181 PEILDIFGAFHFTEPIWWRVGYSKLRGDTLDYLGIPGLHLAGSQGVIVIAICQAILMVGP 240
           PEILDI GAFHF+EPIWWRVGYSKL+GDTLDYLGIPGLHLAGSQGVIVIAICQ ILMVGP
Sbjct: 184 PEILDISGAFHFSEPIWWRVGYSKLKGDTLDYLGIPGLHLAGSQGVIVIAICQVILMVGP 243

Query: 241 EYARYCGIEALEPLGIYLPGDINYPGGVLFDPLNLSKDAAAFEELKVKEIKNGRLAMVAW 300
           EYARYCGIEALEPLGIYLPGDINYPGGVLFDPLNLSKDAAAFEELKVKEIKNGRLAMVAW
Sbjct: 244 EYARYCGIEALEPLGIYLPGDINYPGGVLFDPLNLSKDAAAFEELKVKEIKNGRLAMVAW 303

Query: 301 LGFYSQAALT 311
           LGFYSQAALT
Sbjct: 304 LGFYSQAALT 309

BLAST of Sgr017704 vs. TAIR 10
Match: AT1G76570.1 (Chlorophyll A-B binding family protein )

HSP 1 Score: 462.6 bits (1189), Expect = 2.9e-130
Identity = 217/263 (82.51%), Postives = 239/263 (90.87%), Query Frame = 0

Query: 48  RSTCRASWQELAGVLIFSAIPFTAVKAIANSPLGESLQRQLEKKKKAAVANSSKFKALAE 107
           RS CRASWQELAGVL+FSAIPFTAVKAIANS +G SL+R+LE+KKK AV NSS+FK+ A+
Sbjct: 39  RSICRASWQELAGVLVFSAIPFTAVKAIANSSIGVSLRRRLEEKKKEAVENSSRFKSKAQ 98

Query: 108 EARKYSSWYGEERPRWLGPIPYDYPRYLTGELPGDYGFDIAGLSKDPVAFQKYFNFEILH 167
           EAR  S WYG+ERPRW GPIPYDYP YLTGELPGDYGFDIAGL KD + F KYFNFEILH
Sbjct: 99  EARNDSKWYGKERPRWFGPIPYDYPPYLTGELPGDYGFDIAGLGKDRLTFDKYFNFEILH 158

Query: 168 ARWAMLASLGALIPEILDIFGAFHFTEPIWWRVGYSKLRGDTLDYLGIPGLHLAGSQGVI 227
           ARWAMLA+LGALIPE+ D+ G FHF EP+WWRVGYSKL+G+TL+YLGIPGLH+AGSQGVI
Sbjct: 159 ARWAMLAALGALIPEVFDLTGTFHFAEPVWWRVGYSKLQGETLEYLGIPGLHVAGSQGVI 218

Query: 228 VIAICQAILMVGPEYARYCGIEALEPLGIYLPGDINYPGGVLFDPLNLSKDAAAFEELKV 287
           VIAICQ +LMVGPEYARYCGIEALEPLGIYLPGDINYPGG LFDPLNLS+D  AFE+LKV
Sbjct: 219 VIAICQVLLMVGPEYARYCGIEALEPLGIYLPGDINYPGGTLFDPLNLSEDPVAFEDLKV 278

Query: 288 KEIKNGRLAMVAWLGFYSQAALT 311
           KEIKNGRLAMVAWLGFY+QAA T
Sbjct: 279 KEIKNGRLAMVAWLGFYAQAAFT 301

BLAST of Sgr017704 vs. TAIR 10
Match: AT3G27690.1 (photosystem II light harvesting complex gene 2.3 )

HSP 1 Score: 177.6 bits (449), Expect = 1.8e-44
Identity = 99/198 (50.00%), Postives = 122/198 (61.62%), Query Frame = 0

Query: 113 SSWYGEERPRWLGPIPYDYPRYLTGELPGDYGFDIAGLSKDPVAFQKYFNFEILHARWAM 172
           S WYG +RP++LGP   + P YLTGE PGDYG+D AGLS DP  F K    E++H+RWAM
Sbjct: 48  SIWYGPDRPKYLGPFSENTPSYLTGEYPGDYGWDTAGLSADPETFAKNRELEVIHSRWAM 107

Query: 173 LASLGALIPEILDIFGAFHFTEPIWWRVGYSKLRGDTLDYLGIPGLHLAGSQGVIVIAIC 232
           L +LG   PEIL   G   F E +W++ G        LDYLG P  +L  +Q ++ I  C
Sbjct: 108 LGALGCTFPEILSKNGV-KFGEAVWFKAGSQIFSEGGLDYLGNP--NLIHAQSILAIWAC 167

Query: 233 QAILMVGPEYARYCGIEALEPLGIYLPGDINYPGGVLFDPLNLSKDAAAFEELKVKEIKN 292
           Q +LM   E  R  G     PLG  L  D  YPGG  FDPLNL++D  AF ELKVKE+KN
Sbjct: 168 QVVLMGFIEGYRIGG----GPLGEGL--DPLYPGGA-FDPLNLAEDPEAFSELKVKELKN 227

Query: 293 GRLAMVAWLGFYSQAALT 311
           GRLAM +  GF+ QA +T
Sbjct: 228 GRLAMFSMFGFFVQAIVT 235

BLAST of Sgr017704 vs. TAIR 10
Match: AT5G54270.1 (light-harvesting chlorophyll B-binding protein 3 )

HSP 1 Score: 174.1 bits (440), Expect = 2.0e-43
Identity = 90/196 (45.92%), Postives = 116/196 (59.18%), Query Frame = 0

Query: 115 WYGEERPRWLGPIPYDYPRYLTGELPGDYGFDIAGLSKDPVAFQKYFNFEILHARWAMLA 174
           WYG +R ++LGP     P YLTGE PGDYG+D AGLS DP AF K    E++H RWAML 
Sbjct: 47  WYGPDRVKYLGPFSVQTPSYLTGEFPGDYGWDTAGLSADPEAFAKNRALEVIHGRWAMLG 106

Query: 175 SLGALIPEILDIFGAFHFTEPIWWRVGYSKLRGDTLDYLGIPGLHLAGSQGVIVIAICQA 234
           + G + PE+L  +    F EP+W++ G        LDYLG P  +L  +Q ++ +   Q 
Sbjct: 107 AFGCITPEVLQKWVRVDFKEPVWFKAGSQIFSEGGLDYLGNP--NLVHAQSILAVLGFQV 166

Query: 235 ILMVGPEYARYCGIEALEPLGIYLPGDINYPGGVLFDPLNLSKDAAAFEELKVKEIKNGR 294
           ILM   E  R  G++ +        G+  YPGG  FDPL L+ D   F ELKVKEIKNGR
Sbjct: 167 ILMGLVEGFRINGLDGVG------EGNDLYPGGQYFDPLGLADDPVTFAELKVKEIKNGR 226

Query: 295 LAMVAWLGFYSQAALT 311
           LAM +  GF+ QA +T
Sbjct: 227 LAMFSMFGFFVQAIVT 234

BLAST of Sgr017704 vs. TAIR 10
Match: AT2G05100.1 (photosystem II light harvesting complex gene 2.1 )

HSP 1 Score: 173.7 bits (439), Expect = 2.7e-43
Identity = 98/198 (49.49%), Postives = 121/198 (61.11%), Query Frame = 0

Query: 113 SSWYGEERPRWLGPIPYDYPRYLTGELPGDYGFDIAGLSKDPVAFQKYFNFEILHARWAM 172
           S WYG +RP++LGP   + P YLTGE PGDYG+D AGLS DP  F K    E++H+RWAM
Sbjct: 47  SIWYGPDRPKYLGPFSENTPSYLTGEYPGDYGWDTAGLSADPETFAKNRELEVIHSRWAM 106

Query: 173 LASLGALIPEILDIFGAFHFTEPIWWRVGYSKLRGDTLDYLGIPGLHLAGSQGVIVIAIC 232
           L +LG   PEIL   G   F E +W++ G        LDYLG P  +L  +Q ++ I   
Sbjct: 107 LGALGCTFPEILSKNGV-KFGEAVWFKAGSQIFSEGGLDYLGNP--NLIHAQSILAIWAV 166

Query: 233 QAILMVGPEYARYCGIEALEPLGIYLPGDINYPGGVLFDPLNLSKDAAAFEELKVKEIKN 292
           Q +LM   E  R  G     PLG  L  D  YPGG  FDPLNL++D  AF ELKVKE+KN
Sbjct: 167 QVVLMGFIEGYRIGG----GPLGEGL--DPLYPGGA-FDPLNLAEDPEAFSELKVKELKN 226

Query: 293 GRLAMVAWLGFYSQAALT 311
           GRLAM +  GF+ QA +T
Sbjct: 227 GRLAMFSMFGFFVQAIVT 234

BLAST of Sgr017704 vs. TAIR 10
Match: AT2G05070.1 (photosystem II light harvesting complex gene 2.2 )

HSP 1 Score: 173.7 bits (439), Expect = 2.7e-43
Identity = 98/198 (49.49%), Postives = 121/198 (61.11%), Query Frame = 0

Query: 113 SSWYGEERPRWLGPIPYDYPRYLTGELPGDYGFDIAGLSKDPVAFQKYFNFEILHARWAM 172
           S WYG +RP++LGP   + P YLTGE PGDYG+D AGLS DP  F K    E++H+RWAM
Sbjct: 47  SIWYGPDRPKYLGPFSENTPSYLTGEYPGDYGWDTAGLSADPETFAKNRELEVIHSRWAM 106

Query: 173 LASLGALIPEILDIFGAFHFTEPIWWRVGYSKLRGDTLDYLGIPGLHLAGSQGVIVIAIC 232
           L +LG   PEIL   G   F E +W++ G        LDYLG P  +L  +Q ++ I   
Sbjct: 107 LGALGCTFPEILSKNGV-KFGEAVWFKAGSQIFSEGGLDYLGNP--NLIHAQSILAIWAV 166

Query: 233 QAILMVGPEYARYCGIEALEPLGIYLPGDINYPGGVLFDPLNLSKDAAAFEELKVKEIKN 292
           Q +LM   E  R  G     PLG  L  D  YPGG  FDPLNL++D  AF ELKVKE+KN
Sbjct: 167 QVVLMGFIEGYRIGG----GPLGEGL--DPLYPGGA-FDPLNLAEDPEAFSELKVKELKN 226

Query: 293 GRLAMVAWLGFYSQAALT 311
           GRLAM +  GF+ QA +T
Sbjct: 227 GRLAMFSMFGFFVQAIVT 234

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038898449.13.8e-16191.94chlorophyll a-b binding protein 7, chloroplastic isoform X1 [Benincasa hispida][more]
XP_023521109.16.5e-16192.26chlorophyll a-b binding protein 7, chloroplastic [Cucurbita pepo subsp. pepo][more]
XP_022936160.11.5e-16091.94chlorophyll a-b binding protein 7, chloroplastic [Cucurbita moschata][more]
XP_008453032.21.9e-16091.29PREDICTED: chlorophyll a-b binding protein of LHCII type 1 isoform X1 [Cucumis m... [more]
KAG6591186.14.2e-16092.21Chlorophyll a-b binding protein 7, chloroplastic, partial [Cucurbita argyrosperm... [more]
Match NameE-valueIdentityDescription
Q9C9K14.0e-12982.51Chlorophyll a-b binding protein 7, chloroplastic OS=Arabidopsis thaliana OX=3702... [more]
P073702.0e-4348.08Chlorophyll a-b binding protein 1B, chloroplastic OS=Solanum lycopersicum OX=408... [more]
Q9XF872.6e-4350.00Chlorophyll a-b binding protein 2.4, chloroplastic OS=Arabidopsis thaliana OX=37... [more]
P142783.4e-4350.00Chlorophyll a-b binding protein 4, chloroplastic OS=Solanum lycopersicum OX=4081... [more]
P124709.8e-4348.99Chlorophyll a-b binding protein E, chloroplastic OS=Nicotiana plumbaginifolia OX... [more]
Match NameE-valueIdentityDescription
A0A6J1FCV07.1e-16191.94Chlorophyll a-b binding protein, chloroplastic OS=Cucurbita moschata OX=3662 GN=... [more]
A0A1S3BWE19.2e-16191.29Chlorophyll a-b binding protein, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103... [more]
A0A6J1IGJ61.3e-15990.65Chlorophyll a-b binding protein, chloroplastic OS=Cucurbita maxima OX=3661 GN=LO... [more]
A0A5A7VCG11.1e-15891.23Chlorophyll a-b binding protein, chloroplastic OS=Cucumis melo var. makuwa OX=11... [more]
A0A6J1CK835.6e-15890.97Chlorophyll a-b binding protein, chloroplastic OS=Momordica charantia OX=3673 GN... [more]
Match NameE-valueIdentityDescription
AT1G76570.12.9e-13082.51Chlorophyll A-B binding family protein [more]
AT3G27690.11.8e-4450.00photosystem II light harvesting complex gene 2.3 [more]
AT5G54270.12.0e-4345.92light-harvesting chlorophyll B-binding protein 3 [more]
AT2G05100.12.7e-4349.49photosystem II light harvesting complex gene 2.1 [more]
AT2G05070.12.7e-4349.49photosystem II light harvesting complex gene 2.2 [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR022796Chlorophyll A-B binding proteinPFAMPF00504Chloroa_b-bindcoord: 132..307
e-value: 2.1E-41
score: 142.1
IPR023329Chlorophyll a/b binding domain superfamilyGENE3D1.10.3460.10Chlorophyll a/b binding protein domaincoord: 122..327
e-value: 5.1E-59
score: 201.5
NoneNo IPR availablePANTHERPTHR21649:SF74CHLOROPHYLL A-B BINDING PROTEIN 7, CHLOROPLASTICcoord: 48..310
NoneNo IPR availableSUPERFAMILY103511Chlorophyll a-b binding proteincoord: 114..310
IPR001344Chlorophyll A-B binding protein, plant and chromistaPANTHERPTHR21649CHLOROPHYLL A/B BINDING PROTEINcoord: 48..310

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr017704.1Sgr017704.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009768 photosynthesis, light harvesting in photosystem I
biological_process GO:0018298 protein-chromophore linkage
biological_process GO:0009416 response to light stimulus
biological_process GO:0009765 photosynthesis, light harvesting
cellular_component GO:0009535 chloroplast thylakoid membrane
cellular_component GO:0009522 photosystem I
cellular_component GO:0009523 photosystem II
cellular_component GO:0016020 membrane
molecular_function GO:0016168 chlorophyll binding
molecular_function GO:0046872 metal ion binding