Csa4G664570.1 (mRNA) Cucumber (Chinese Long) v2

NameCsa4G664570.1
TypemRNA
OrganismCucumis. sativus (Cucumber (Chinese Long) v2)
DescriptionChlorophyll a-b-binding protein 5, chloroplastic; contains IPR022796 (Chlorophyll A-B binding protein), IPR023329 (Chlorophyll a/b binding protein domain)
LocationChr4 : 23196232 .. 23200240 (+)
Sequence length825
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAGTTTCTCCAGTTATTTGGCTGGGAACAAGCACCAAACCTTGATTGTCGGAATGGCGGATGTGAAGGGATTGTTGGAGGTGTCATCCACCAAATAGGTATTTGCAATTGAGTCGGGTTTAATTGTAATTGAACAAGATGTTTTTGAATATCGTTAGAATTATAGGAGTAAATATCAGGAGACTCTTATCTCTTCAAGTTGGAACTACTTCGGTGAGAACATGCAGTAGATGAGCACTTCTCTGACAACACATTCAGGTGAAGGAAGTTTGTAAACTAACTTTCTACTATTATGATGAAAGTCCCATACACAAATATGGATATAGCCAAACGTCTAGCCAACTCATAACAGAATGAGGAAGCTAACTATGACCAAGTAAAACAAACTAACAACATGTAGTTAATAAAAGTAAGACTTTTGTTATTTCTTTTTAATATTTGTGAATTCTAAACCCAACATCACAAATCTTGACTCTTCTCATAGAACAACCGTCTGACTCTACTATACTTGGGTTCTATTGGCTACAGTGAAATATTTGCTAATTGCTACTATAATTCTTCTTCTAGTTAACTTATCATGAATGTTCCAATTCAAAATTTGGATACCTGTTTAATTTTTTAAGTTTCATTGTAGAATTGGGTCAGTAGTATTCTTAAATATTTATTTCATTTGGTTTCATTTGATTGTCTAGCATCTTTCATGTTCTGCAAACTTTCTTCATTCTTGAAATTGAGTACATGGCTATTATCTTTTGATCGTAGTAATTTACATTGTATTGGATGTGGTATTATAAATTATTATTTTTCCAACTATGAATGTTTAACGGCGTAAACAAATAGTACATTTTAATCCAATTGGGTTGTGTTGTAAATACTATTCTATTGGTGAATAATTGCAGCTCGTGGTATGGAGAAGAGCGTCCTCGGTGGCTTGGTCCACTACCATATGATTATCCAAAATATCTGACAGGTGAACTGCCTGGTGATTATGGATTTGATATTGCAGGTTTGAGTGAGGATCCTGTGGCTTTCCAGAAATTCTTCAAGTATATTGAGCTTCTTTAGCCTGAATTTTTTTCTCAGCAGTTCCTTTTTTCCCCAATATCTGGAATCATATGTAGATGCAATAGTGCATATCATAAGAGAGTGATTATTTTGACTTGTGAATTGGCTGGTCAACAAAGTGATGACATAGTGATGATTAGGCAGATAAGCTGTGAGAGTAACAAAATGATTCTGCACAATATAATATAGACATCACCTGAAAATGATCCCTGAATAAATATCCACATATTTGTTAGCTATGATACAAAACTCGCATTCTGCACAGCTTGCTATCCAATTTTGACGATATTTGAGCTATTCCATATGCAATATAACTTTGCAGTCCTAATTATCTCCAATAATAGCAACTCTTGTTGAATGTATCTATTTCTTTTAATTGTTTTTTTCTTTTTGAATTTTTCATCAATAATTGATCTGTTTACTCAATCTTACTTCAGCTTTGAAATATTGCATGCTCGTTGGGCTATGTTAGCATCCCTAGGTGCTTTAGTTCCAGAGATCTTGGATATTTTTGGAGCTTTTCATTTCACTGAACCCATCTGGTGGCGAGTGGGCTATTCAAAACTTAAGGTTTGTTTTCTAGCACTTGGTCAACCATTTTTTTGGAGATTTATTTCATAAATTCTTGATCGTATGTTTGCTTGTTTTGTTTCCATGTTTTTACCCTTTTGTCTGGTTATATTTCCACTGCTATCAAATTGTTGTTTTGCTCTTGTTGAATCATGTTTTATTTCAAAGTCTAGTTTTGTCTGCCTTTATTAATGAAATGAAATAATAACGATTATGGCTTCATATATTTCAGAGCCGTGGTAAGCTTGAGTTGCTGCTGCCTTACTTCTATTTCATTTAATCTTCCATTACCGTGTGATCTCGGGAGAAATAGTGGTTCATCAATTTTAATTTCATTTTAACTATCTTGTGTACTATTCCCTGTAAACAGAGGGCTCTCCCATGTATTTGAAGAGAACTGAATCATATTCCCTTTTTTTAGAAACTCCATGGGTGTGTGTAGTTCCTCAATCCACATTCATTACCTATGACTTTCCAGCACTTGTAACTGACTCTTTTTCATGTTACTGTTTGTGTTCTCTTTCCACCAGGGAGATACACTCGACTATCTTGGAATACCAGGGCTTCACTTAGCTGGAAGCCAAGGAGTGATTGTCATTGCAATTTGCCAAGCCATTTTGATGGTAACTTATGACAAACAATCTTATTTTCTGTGGTCTGGTATTTTACTTGTCATATCTTTTGGGCAGTTTAATTAATGGCATGAATGGTTTAGGAAGTGTGACGCTATAATCTGTCCTGTTTTGGGTTTAAACCAGTGTTAAGTTGAGTTGGTTAATCTCAATCAATTCTCATCCATTCCCACAAGAACACAAGTGTTGTGGCTCCCCATGCAGTCTCTACAACGAGCACTGGCCCTTGATAAGCTTGTTGAGGAACTTGGAATTTAAATTATTCTTTTGGAAATCGATCCTTTTGGAGATTATTCAAGTTAGAAAAAAGAGACTTTTGCTGAAGAAAAAAGTTCAAAGATATAAACCTCCAGATGGGGCGAAAGGGAAAACATAATGCCTAAAGAAAAAGGATTACAAGAAACCAAAACAAGCCAAACAAAAAGAAGGAACTAAAGAATCGCCAACAATCAACAAAAAATGCAAAACTAGAAGATCGAAGTGCCAACCTTTTAGATGGCTTTCAAATGAAAGCTAAAACATTAACGGGAATCTCCTCAAATCGCTGAAGCTTCAAATAGAAACCGCACCAACCCACTTGAAACATCACACCAGAAAAGGAAGAAAAGCAGCATCTGCCATGGAGAAATGCATGAACACCAAGGGGACGAACAACCAAAACCTCTATTAAGTGCAAATCTTAACTGCAAAAAACACTAGAAGAAGCATCAGCTTCACCCAAGAACTAAAAACTGAAGAGCAAAACAAAAAATATCCACCAAAGACAAAATCAATGCAAAAAGATAATGGAACCAAAGTCCAAAATCCCATCCTCATAGATTTAATGCCATATTTCTCTAGAAAGGAATGAAGTTTTTCCTTTTCATGGAATGTTTTGATGAGAGGGGGGAGATAAAGAACTATCTGACTGCTACTTTAAAGGTAAATTAATATTGTCTTCTTGAAAAATAGAACAAAGAGCTTCGTTAAAATCCTCGTCATCAGGGGCCTCGTCAGTCGTCAGTCGTAAGAAACCTTTTCATGCGATTAACAATTGGAAAATTGAACTCCAAGTCATAACTACTAAGACTTATTCGATTCCTCACTTTTCTTCATAAAAAAATAACTTTTTGGTGATCCCATTGAGTTATATAATATCCATAGAAAGAGTGTTATTAACTAAGGATGTTGAAGAATTAGATAGTCTATTGTTAAGAGAGCGTATTAAACTATTATATGTCTGATTGTTCTTACAACTGTGTGATGTAAGTTAAAATGTAAATAGGTTGGACCCGAATATGCAAGATATTGTGGCATAGAGGCATTGGAGCCTCTGGGAATATATCTGCCGGGGGATATAAATTATCCAGGTGGTGTGTTGTTTGATCCCTTGAATCTGTCCAAGGATGCTGCAGCCTTTGAGGAACTAAAAGTAAAAGAGATTAAAAATGGGCGTTTAGCCATGGTTGCCTGGCTAGGATTTTACAGTCAAGCTGCTTTGACGGGTAAGGGACCAGTCCAAAACCTTCTTGACCACATTGCAGATCCTTTCCACAATAACTTTCTTTCCCTGCTCAATTCTTCATTTAGTCGTTAATAACTGGAACCTGGATAAAGTATAGTGTCGTCAAATCTGTATAGTGTTCATAAGGGATGGATAATCTGTCATAGCTGTTTTAGACGTGAGCTAGTTGCTCAGTAAGTTAATACTTGTGATGTGGTATTATATCTTTCTATCTTATATCAAGTATCTTTGAGAAC

mRNA sequence

ATGAAGTTTCTCCAGTTATTTGGCTGGGAACAAGCACCAAACCTTGATTGTCGGAATGGCGGATGTGAAGGGATTGTTGGAGGTGTCATCCACCAAATAGAACATGCAGTAGATGAGCACTTCTCTGACAACACATTCAGCTCGTGGTATGGAGAAGAGCGTCCTCGGTGGCTTGGTCCACTACCATATGATTATCCAAAATATCTGACAGGTGAACTGCCTGGTGATTATGGATTTGATATTGCAGGTTTGAGTGAGGATCCTGTGGCTTTCCAGAAATTCTTCAACTTTGAAATATTGCATGCTCGTTGGGCTATGTTAGCATCCCTAGGTGCTTTAGTTCCAGAGATCTTGGATATTTTTGGAGCTTTTCATTTCACTGAACCCATCTGGTGGCGAGTGGGCTATTCAAAACTTAAGGGAGATACACTCGACTATCTTGGAATACCAGGGCTTCACTTAGCTGGAAGCCAAGGAGTGATTGTCATTGCAATTTGCCAAGCCATTTTGATGGTTGGACCCGAATATGCAAGATATTGTGGCATAGAGGCATTGGAGCCTCTGGGAATATATCTGCCGGGGGATATAAATTATCCAGGTGGTGTGTTGTTTGATCCCTTGAATCTGTCCAAGGATGCTGCAGCCTTTGAGGAACTAAAAGTAAAAGAGATTAAAAATGGGCGTTTAGCCATGGTTGCCTGGCTAGGATTTTACAGTCAAGCTGCTTTGACGGGTAAGGGACCAGTCCAAAACCTTCTTGACCACATTGCAGATCCTTTCCACAATAACTTTCTTTCCCTGCTCAATTCTTCATTTAGTCGTTAA

Coding sequence (CDS)

ATGAAGTTTCTCCAGTTATTTGGCTGGGAACAAGCACCAAACCTTGATTGTCGGAATGGCGGATGTGAAGGGATTGTTGGAGGTGTCATCCACCAAATAGAACATGCAGTAGATGAGCACTTCTCTGACAACACATTCAGCTCGTGGTATGGAGAAGAGCGTCCTCGGTGGCTTGGTCCACTACCATATGATTATCCAAAATATCTGACAGGTGAACTGCCTGGTGATTATGGATTTGATATTGCAGGTTTGAGTGAGGATCCTGTGGCTTTCCAGAAATTCTTCAACTTTGAAATATTGCATGCTCGTTGGGCTATGTTAGCATCCCTAGGTGCTTTAGTTCCAGAGATCTTGGATATTTTTGGAGCTTTTCATTTCACTGAACCCATCTGGTGGCGAGTGGGCTATTCAAAACTTAAGGGAGATACACTCGACTATCTTGGAATACCAGGGCTTCACTTAGCTGGAAGCCAAGGAGTGATTGTCATTGCAATTTGCCAAGCCATTTTGATGGTTGGACCCGAATATGCAAGATATTGTGGCATAGAGGCATTGGAGCCTCTGGGAATATATCTGCCGGGGGATATAAATTATCCAGGTGGTGTGTTGTTTGATCCCTTGAATCTGTCCAAGGATGCTGCAGCCTTTGAGGAACTAAAAGTAAAAGAGATTAAAAATGGGCGTTTAGCCATGGTTGCCTGGCTAGGATTTTACAGTCAAGCTGCTTTGACGGGTAAGGGACCAGTCCAAAACCTTCTTGACCACATTGCAGATCCTTTCCACAATAACTTTCTTTCCCTGCTCAATTCTTCATTTAGTCGTTAA

Protein sequence

MKFLQLFGWEQAPNLDCRNGGCEGIVGGVIHQIEHAVDEHFSDNTFSSWYGEERPRWLGPLPYDYPKYLTGELPGDYGFDIAGLSEDPVAFQKFFNFEILHARWAMLASLGALVPEILDIFGAFHFTEPIWWRVGYSKLKGDTLDYLGIPGLHLAGSQGVIVIAICQAILMVGPEYARYCGIEALEPLGIYLPGDINYPGGVLFDPLNLSKDAAAFEELKVKEIKNGRLAMVAWLGFYSQAALTGKGPVQNLLDHIADPFHNNFLSLLNSSFSR*
BLAST of Csa4G664570.1 vs. Swiss-Prot
Match: CB2_CHLMO (Chlorophyll a-b binding protein of LHCII type I, chloroplastic OS=Chlamydomonas moewusii PE=2 SV=1)

HSP 1 Score: 203.4 bits (516), Expect = 3.3e-51
Identity = 105/215 (48.84%), Postives = 129/215 (60.00%), Query Frame = 1

Query: 49  WYGEERPRWLGPLPYDYPKYLTGELPGDYGFDIAGLSEDPVAFQKFFNFEILHARWAMLA 108
           WYG +R +WLGP   + P YLTGE PGDYG+D AGLS DP  F+K+   E++HARWA+L 
Sbjct: 39  WYGPDRAKWLGPFSTNTPAYLTGEFPGDYGWDTAGLSADPETFKKYRELEVIHARWALLG 98

Query: 109 SLGALVPEILDIFGAFHFTEPIWWRVGYSKLKGDTLDYLGIPGLHLAGSQGVIVIAICQA 168
           +LG L PE+L  +    F EP+W++ G        LDYLG P   L  +Q ++     Q 
Sbjct: 99  ALGILTPELLSTYAGVKFGEPVWFKAGAQIFSEGGLDYLGSPA--LIHAQNIVATLAVQV 158

Query: 169 ILMVGPEYARYCGIEALEPLGIYLPGDINYPGGVLFDPLNLSKDAAAFEELKVKEIKNGR 228
           +LM   E  R  G  A E L    PG+        FDPL L+ D   F ELKVKEIKNGR
Sbjct: 159 VLMGLIEGYRVNGGPAGEGLDPLYPGE-------SFDPLGLADDPDTFAELKVKEIKNGR 218

Query: 229 LAMVAWLGFYSQAALTGKGPVQNLLDHIADPFHNN 264
           LAM +  GF+ QA +TGKGP+QNL DH+ADP  NN
Sbjct: 219 LAMFSCFGFFVQAIVTGKGPIQNLADHLADPGTNN 244

BLAST of Csa4G664570.1 vs. Swiss-Prot
Match: CB25_NICPL (Chlorophyll a-b binding protein E, chloroplastic OS=Nicotiana plumbaginifolia GN=CABE PE=3 SV=1)

HSP 1 Score: 203.0 bits (515), Expect = 4.3e-51
Identity = 110/217 (50.69%), Postives = 135/217 (62.21%), Query Frame = 1

Query: 47  SSWYGEERPRWLGPLPYDYPKYLTGELPGDYGFDIAGLSEDPVAFQKFFNFEILHARWAM 106
           S WYG +R ++LGP   + P YLTGE PGDYG+D AGLS DP  F K    E++H RWAM
Sbjct: 48  SPWYGPDRVKYLGPFSGESPSYLTGEFPGDYGWDTAGLSADPETFAKNRELEVIHCRWAM 107

Query: 107 LASLGALVPEILDIFGAFHFTEPIWWRVGYSKLKGDTLDYLGIPGLHLAGSQGVIVIAIC 166
           L +LG + PE+L   G   F E +W++ G        LDYLG P   L  +Q ++ I  C
Sbjct: 108 LGALGCVFPELLARNGV-KFGEAVWFKAGSQIFSEGGLDYLGNPS--LVHAQSILAIWAC 167

Query: 167 QAILMVGPEYARYCGIEALEPLGIYLPGDINYPGGVLFDPLNLSKDAAAFEELKVKEIKN 226
           Q +LM   E  R  G    EPLG  +  D  YPGG  FDPL L++D  AF ELKVKEIKN
Sbjct: 168 QVVLMGAVEGYRVAG----EPLGEVV--DPLYPGG-SFDPLGLAEDPEAFAELKVKEIKN 227

Query: 227 GRLAMVAWLGFYSQAALTGKGPVQNLLDHIADPFHNN 264
           GRLAM +  GF+ QA +TGKGP++NL DH+ADP +NN
Sbjct: 228 GRLAMFSMFGFFVQALVTGKGPLENLADHLADPVNNN 254

BLAST of Csa4G664570.1 vs. Swiss-Prot
Match: CB21_GOSHI (Chlorophyll a-b binding protein 151, chloroplastic OS=Gossypium hirsutum GN=CAB-151 PE=2 SV=2)

HSP 1 Score: 202.2 bits (513), Expect = 7.3e-51
Identity = 112/217 (51.61%), Postives = 133/217 (61.29%), Query Frame = 1

Query: 47  SSWYGEERPRWLGPLPYDYPKYLTGELPGDYGFDIAGLSEDPVAFQKFFNFEILHARWAM 106
           S WYG +RP++LGP     P YLTGE PGDYG+D AGLS DP  F K    E++H RWAM
Sbjct: 47  SIWYGPDRPKYLGPFSDQIPSYLTGEFPGDYGWDTAGLSADPETFAKNRELEVIHCRWAM 106

Query: 107 LASLGALVPEILDIFGAFHFTEPIWWRVGYSKLKGDTLDYLGIPGLHLAGSQGVIVIAIC 166
           L +LG + PEIL   G   F E +W++ G        LDYLG P  +L  +Q ++ I  C
Sbjct: 107 LGALGCVFPEILSKNGV-KFGEAVWFKAGSQIFSEGGLDYLGNP--NLIHAQSILAIWAC 166

Query: 167 QAILMVGPEYARYCGIEALEPLGIYLPGDINYPGGVLFDPLNLSKDAAAFEELKVKEIKN 226
           Q +LM   E  R  G     PLG  L  D  YPGG  FDPL L+ D  AF ELKVKEIKN
Sbjct: 167 QVVLMGFVEGYRVGG----GPLGEGL--DPIYPGGA-FDPLGLADDPDAFAELKVKEIKN 226

Query: 227 GRLAMVAWLGFYSQAALTGKGPVQNLLDHIADPFHNN 264
           GRLAM +  GF+ QA +TGKGP++NL DH+ADP  NN
Sbjct: 227 GRLAMFSMFGFFVQAIVTGKGPIENLFDHLADPVANN 253

BLAST of Csa4G664570.1 vs. Swiss-Prot
Match: CB24_SOLLC (Chlorophyll a-b binding protein 4, chloroplastic OS=Solanum lycopersicum GN=CAB4 PE=2 SV=1)

HSP 1 Score: 201.8 bits (512), Expect = 9.5e-51
Identity = 112/217 (51.61%), Postives = 133/217 (61.29%), Query Frame = 1

Query: 47  SSWYGEERPRWLGPLPYDYPKYLTGELPGDYGFDIAGLSEDPVAFQKFFNFEILHARWAM 106
           S WYGE+RP++LGP     P YLTGE PGDYG+D AGLS DP  F +    E++H RWAM
Sbjct: 47  SIWYGEDRPKYLGPFSEQTPSYLTGEFPGDYGWDTAGLSADPETFARNRELEVIHCRWAM 106

Query: 107 LASLGALVPEILDIFGAFHFTEPIWWRVGYSKLKGDTLDYLGIPGLHLAGSQGVIVIAIC 166
           L +LG + PEIL   G   F E +W++ G        LDYLG P  +L  +Q ++ I  C
Sbjct: 107 LGALGCVFPEILSKNGV-KFGEAVWFKAGSQIFSEGGLDYLGNP--NLVHAQSILAIWAC 166

Query: 167 QAILMVGPEYARYCGIEALEPLGIYLPGDINYPGGVLFDPLNLSKDAAAFEELKVKEIKN 226
           Q +LM   E  R  G     PLG  L  D  YPGG  FDPL L+ D  AF ELKVKEIKN
Sbjct: 167 QVVLMGFVEGYRVGG----GPLGEGL--DKIYPGGA-FDPLGLADDPEAFAELKVKEIKN 226

Query: 227 GRLAMVAWLGFYSQAALTGKGPVQNLLDHIADPFHNN 264
           GRLAM +  GF+ QA +TGKGP++NL DHI DP  NN
Sbjct: 227 GRLAMFSMFGFFVQAIVTGKGPIENLSDHINDPVANN 253

BLAST of Csa4G664570.1 vs. Swiss-Prot
Match: CB23_NICPL (Chlorophyll a-b binding protein C, chloroplastic OS=Nicotiana plumbaginifolia GN=CABC PE=3 SV=1)

HSP 1 Score: 201.1 bits (510), Expect = 1.6e-50
Identity = 110/217 (50.69%), Postives = 134/217 (61.75%), Query Frame = 1

Query: 47  SSWYGEERPRWLGPLPYDYPKYLTGELPGDYGFDIAGLSEDPVAFQKFFNFEILHARWAM 106
           S WYG  R ++LGP   + P YLTGE PGDYG+D AGLS DP  F K    E++H RWAM
Sbjct: 49  SPWYGPNRVKYLGPFSGESPSYLTGEFPGDYGWDTAGLSADPETFAKNRELEVIHCRWAM 108

Query: 107 LASLGALVPEILDIFGAFHFTEPIWWRVGYSKLKGDTLDYLGIPGLHLAGSQGVIVIAIC 166
           L +LG + PE+L   G   F E +W++ G        LDYLG P   L  +Q ++ I  C
Sbjct: 109 LGALGCVFPELLARNGV-KFGEAVWFKGGSQIFSQGGLDYLGNPS--LVHAQSILAIWAC 168

Query: 167 QAILMVGPEYARYCGIEALEPLGIYLPGDINYPGGVLFDPLNLSKDAAAFEELKVKEIKN 226
           Q +LM   E  R  G    EPLG  +  D  YPGG  FDPL L++D  AF ELKVKEIKN
Sbjct: 169 QVVLMGAVEGYRVAG----EPLGEVV--DPLYPGG-SFDPLGLAEDPEAFAELKVKEIKN 228

Query: 227 GRLAMVAWLGFYSQAALTGKGPVQNLLDHIADPFHNN 264
           GRLAM +  GF+ QA +TGKGP++NL DH+ADP +NN
Sbjct: 229 GRLAMFSMFGFFVQAIVTGKGPLENLADHLADPVNNN 255

BLAST of Csa4G664570.1 vs. TrEMBL
Match: A0A0A0L0U5_CUCSA (Chlorophyll a-b binding protein, chloroplastic OS=Cucumis sativus GN=Csa_4G664570 PE=3 SV=1)

HSP 1 Score: 576.2 bits (1484), Expect = 2.1e-161
Identity = 274/274 (100.00%), Postives = 274/274 (100.00%), Query Frame = 1

Query: 1   MKFLQLFGWEQAPNLDCRNGGCEGIVGGVIHQIEHAVDEHFSDNTFSSWYGEERPRWLGP 60
           MKFLQLFGWEQAPNLDCRNGGCEGIVGGVIHQIEHAVDEHFSDNTFSSWYGEERPRWLGP
Sbjct: 1   MKFLQLFGWEQAPNLDCRNGGCEGIVGGVIHQIEHAVDEHFSDNTFSSWYGEERPRWLGP 60

Query: 61  LPYDYPKYLTGELPGDYGFDIAGLSEDPVAFQKFFNFEILHARWAMLASLGALVPEILDI 120
           LPYDYPKYLTGELPGDYGFDIAGLSEDPVAFQKFFNFEILHARWAMLASLGALVPEILDI
Sbjct: 61  LPYDYPKYLTGELPGDYGFDIAGLSEDPVAFQKFFNFEILHARWAMLASLGALVPEILDI 120

Query: 121 FGAFHFTEPIWWRVGYSKLKGDTLDYLGIPGLHLAGSQGVIVIAICQAILMVGPEYARYC 180
           FGAFHFTEPIWWRVGYSKLKGDTLDYLGIPGLHLAGSQGVIVIAICQAILMVGPEYARYC
Sbjct: 121 FGAFHFTEPIWWRVGYSKLKGDTLDYLGIPGLHLAGSQGVIVIAICQAILMVGPEYARYC 180

Query: 181 GIEALEPLGIYLPGDINYPGGVLFDPLNLSKDAAAFEELKVKEIKNGRLAMVAWLGFYSQ 240
           GIEALEPLGIYLPGDINYPGGVLFDPLNLSKDAAAFEELKVKEIKNGRLAMVAWLGFYSQ
Sbjct: 181 GIEALEPLGIYLPGDINYPGGVLFDPLNLSKDAAAFEELKVKEIKNGRLAMVAWLGFYSQ 240

Query: 241 AALTGKGPVQNLLDHIADPFHNNFLSLLNSSFSR 275
           AALTGKGPVQNLLDHIADPFHNNFLSLLNSSFSR
Sbjct: 241 AALTGKGPVQNLLDHIADPFHNNFLSLLNSSFSR 274

BLAST of Csa4G664570.1 vs. TrEMBL
Match: M5WMP2_PRUPE (Chlorophyll a-b binding protein, chloroplastic OS=Prunus persica GN=PRUPE_ppa008546mg PE=3 SV=1)

HSP 1 Score: 429.9 bits (1104), Expect = 2.4e-117
Identity = 197/222 (88.74%), Postives = 211/222 (95.05%), Query Frame = 1

Query: 47  SSWYGEERPRWLGPLPYDYPKYLTGELPGDYGFDIAGLSEDPVAFQKFFNFEILHARWAM 106
           S WYGEERPRWLGP+PYDYP YLTGELPGDYGFD+ GLS DPVAFQK+FNFEILHARWAM
Sbjct: 103 SLWYGEERPRWLGPIPYDYPAYLTGELPGDYGFDVVGLSRDPVAFQKYFNFEILHARWAM 162

Query: 107 LASLGALVPEILDIFGAFHFTEPIWWRVGYSKLKGDTLDYLGIPGLHLAGSQGVIVIAIC 166
           LA+LGAL+PEILD+ GAFHF EP+WWRVGYSKLKGDTLDYLGIPGLH+AGSQGVIVIAIC
Sbjct: 163 LAALGALIPEILDLLGAFHFVEPVWWRVGYSKLKGDTLDYLGIPGLHVAGSQGVIVIAIC 222

Query: 167 QAILMVGPEYARYCGIEALEPLGIYLPGDINYPGGVLFDPLNLSKDAAAFEELKVKEIKN 226
           QA+LMVGPEYARYCGIEALEPLGIYLPGD+NYPGG LFDPLNLS+D  AFEELKVKEIKN
Sbjct: 223 QALLMVGPEYARYCGIEALEPLGIYLPGDVNYPGGELFDPLNLSRDPVAFEELKVKEIKN 282

Query: 227 GRLAMVAWLGFYSQAALTGKGPVQNLLDHIADPFHNNFLSLL 269
           GRLAMVAWLGFYSQAALTGKGPVQNLL+HI+DP HNN LS+L
Sbjct: 283 GRLAMVAWLGFYSQAALTGKGPVQNLLEHISDPLHNNVLSVL 324

BLAST of Csa4G664570.1 vs. TrEMBL
Match: D7UDH1_VITVI (Chlorophyll a-b binding protein, chloroplastic OS=Vitis vinifera GN=VIT_18s0122g00430 PE=3 SV=1)

HSP 1 Score: 429.1 bits (1102), Expect = 4.1e-117
Identity = 197/224 (87.95%), Postives = 209/224 (93.30%), Query Frame = 1

Query: 47  SSWYGEERPRWLGPLPYDYPKYLTGELPGDYGFDIAGLSEDPVAFQKFFNFEILHARWAM 106
           S WYGEERPRWLGP+PYDYP YLTGELPGDYGFDIAGL +DPVAFQK+FNFEILHARWAM
Sbjct: 107 SLWYGEERPRWLGPIPYDYPAYLTGELPGDYGFDIAGLGKDPVAFQKYFNFEILHARWAM 166

Query: 107 LASLGALVPEILDIFGAFHFTEPIWWRVGYSKLKGDTLDYLGIPGLHLAGSQGVIVIAIC 166
           LA+LGAL+PE+LD+ GAFHF EP+WWRVGYSKLKGDTLDYLGIPG H AGSQGVIVIAIC
Sbjct: 167 LAALGALLPELLDLLGAFHFVEPVWWRVGYSKLKGDTLDYLGIPGFHFAGSQGVIVIAIC 226

Query: 167 QAILMVGPEYARYCGIEALEPLGIYLPGDINYPGGVLFDPLNLSKDAAAFEELKVKEIKN 226
           QA+LMVGPEYARYCGIEALEPLGIYLPGDINYPGG LFDPLNLSKD  AFEELKVKEIKN
Sbjct: 227 QALLMVGPEYARYCGIEALEPLGIYLPGDINYPGGALFDPLNLSKDPVAFEELKVKEIKN 286

Query: 227 GRLAMVAWLGFYSQAALTGKGPVQNLLDHIADPFHNNFLSLLNS 271
           GRLAMVAWLGFY QAA TGKGPVQNLLDH+ADPFHNN LS+  +
Sbjct: 287 GRLAMVAWLGFYIQAAATGKGPVQNLLDHLADPFHNNLLSIFKA 330

BLAST of Csa4G664570.1 vs. TrEMBL
Match: A0A067KWW2_JATCU (Chlorophyll a-b binding protein, chloroplastic OS=Jatropha curcas GN=JCGZ_08016 PE=3 SV=1)

HSP 1 Score: 428.3 bits (1100), Expect = 6.9e-117
Identity = 197/222 (88.74%), Postives = 212/222 (95.50%), Query Frame = 1

Query: 47  SSWYGEERPRWLGPLPYDYPKYLTGELPGDYGFDIAGLSEDPVAFQKFFNFEILHARWAM 106
           S WYGEERP WLGP+PYDYP+YLTGELPGDYGFD+AGLS+DPVAFQ++FNFEILHARWAM
Sbjct: 113 SLWYGEERPHWLGPIPYDYPQYLTGELPGDYGFDVAGLSKDPVAFQRYFNFEILHARWAM 172

Query: 107 LASLGALVPEILDIFGAFHFTEPIWWRVGYSKLKGDTLDYLGIPGLHLAGSQGVIVIAIC 166
           LA+LGALVPE+LD+ GAFHF EP+WWRVGYSKLKGDTLDYLGIPGLHLAGSQGVIVIAIC
Sbjct: 173 LAALGALVPELLDLSGAFHFIEPVWWRVGYSKLKGDTLDYLGIPGLHLAGSQGVIVIAIC 232

Query: 167 QAILMVGPEYARYCGIEALEPLGIYLPGDINYPGGVLFDPLNLSKDAAAFEELKVKEIKN 226
           QA+LMVGPEYARYCGIEALEPLGIYLPGDINYPGG LFDPLNLS D  AFEELKVKEIKN
Sbjct: 233 QALLMVGPEYARYCGIEALEPLGIYLPGDINYPGGALFDPLNLSGDPVAFEELKVKEIKN 292

Query: 227 GRLAMVAWLGFYSQAALTGKGPVQNLLDHIADPFHNNFLSLL 269
           GRLAMVAWLGFY+QAALTGKGPVQNLL+HI+DPFHNN  S+L
Sbjct: 293 GRLAMVAWLGFYAQAALTGKGPVQNLLEHISDPFHNNLCSVL 334

BLAST of Csa4G664570.1 vs. TrEMBL
Match: A0A059BPY7_EUCGR (Chlorophyll a-b binding protein, chloroplastic OS=Eucalyptus grandis GN=EUGRSUZ_F01826 PE=3 SV=1)

HSP 1 Score: 424.5 bits (1090), Expect = 1.0e-115
Identity = 194/222 (87.39%), Postives = 209/222 (94.14%), Query Frame = 1

Query: 47  SSWYGEERPRWLGPLPYDYPKYLTGELPGDYGFDIAGLSEDPVAFQKFFNFEILHARWAM 106
           SSWYGE+RP+WLGP+PYDYP YLTGE PGDYGFDIAGL+ DP AF+K+FNFEILHARWAM
Sbjct: 26  SSWYGEDRPQWLGPIPYDYPAYLTGEYPGDYGFDIAGLARDPTAFEKYFNFEILHARWAM 85

Query: 107 LASLGALVPEILDIFGAFHFTEPIWWRVGYSKLKGDTLDYLGIPGLHLAGSQGVIVIAIC 166
           LA+LGALVPE+LD+ GAFHF EP+WWRVGYSKLKGDTLDYLGIPGLHLAGSQGVIVIAIC
Sbjct: 86  LAALGALVPEVLDMVGAFHFVEPVWWRVGYSKLKGDTLDYLGIPGLHLAGSQGVIVIAIC 145

Query: 167 QAILMVGPEYARYCGIEALEPLGIYLPGDINYPGGVLFDPLNLSKDAAAFEELKVKEIKN 226
           QA+LMVGPEYARYCGIEALEPLGIYLPGDINYPGG LFDPLNLSKD A FEELKVKEIKN
Sbjct: 146 QALLMVGPEYARYCGIEALEPLGIYLPGDINYPGGALFDPLNLSKDPATFEELKVKEIKN 205

Query: 227 GRLAMVAWLGFYSQAALTGKGPVQNLLDHIADPFHNNFLSLL 269
           GRLAMVAWLGFY QAALTGKGP+QNLL+HI+DP HNN  S+L
Sbjct: 206 GRLAMVAWLGFYVQAALTGKGPIQNLLEHISDPLHNNLFSVL 247

BLAST of Csa4G664570.1 vs. TAIR10
Match: AT1G76570.1 (AT1G76570.1 Chlorophyll A-B binding family protein)

HSP 1 Score: 406.0 bits (1042), Expect = 1.9e-113
Identity = 180/224 (80.36%), Postives = 203/224 (90.62%), Query Frame = 1

Query: 47  SSWYGEERPRWLGPLPYDYPKYLTGELPGDYGFDIAGLSEDPVAFQKFFNFEILHARWAM 106
           S WYG+ERPRW GP+PYDYP YLTGELPGDYGFDIAGL +D + F K+FNFEILHARWAM
Sbjct: 104 SKWYGKERPRWFGPIPYDYPPYLTGELPGDYGFDIAGLGKDRLTFDKYFNFEILHARWAM 163

Query: 107 LASLGALVPEILDIFGAFHFTEPIWWRVGYSKLKGDTLDYLGIPGLHLAGSQGVIVIAIC 166
           LA+LGAL+PE+ D+ G FHF EP+WWRVGYSKL+G+TL+YLGIPGLH+AGSQGVIVIAIC
Sbjct: 164 LAALGALIPEVFDLTGTFHFAEPVWWRVGYSKLQGETLEYLGIPGLHVAGSQGVIVIAIC 223

Query: 167 QAILMVGPEYARYCGIEALEPLGIYLPGDINYPGGVLFDPLNLSKDAAAFEELKVKEIKN 226
           Q +LMVGPEYARYCGIEALEPLGIYLPGDINYPGG LFDPLNLS+D  AFE+LKVKEIKN
Sbjct: 224 QVLLMVGPEYARYCGIEALEPLGIYLPGDINYPGGTLFDPLNLSEDPVAFEDLKVKEIKN 283

Query: 227 GRLAMVAWLGFYSQAALTGKGPVQNLLDHIADPFHNNFLSLLNS 271
           GRLAMVAWLGFY+QAA TGKGPVQNL+DH++DP HNN +++L +
Sbjct: 284 GRLAMVAWLGFYAQAAFTGKGPVQNLVDHVSDPLHNNLIAMLQT 327

BLAST of Csa4G664570.1 vs. TAIR10
Match: AT3G27690.1 (AT3G27690.1 photosystem II light harvesting complex gene 2.3)

HSP 1 Score: 205.3 bits (521), Expect = 4.8e-53
Identity = 113/217 (52.07%), Postives = 136/217 (62.67%), Query Frame = 1

Query: 47  SSWYGEERPRWLGPLPYDYPKYLTGELPGDYGFDIAGLSEDPVAFQKFFNFEILHARWAM 106
           S WYG +RP++LGP   + P YLTGE PGDYG+D AGLS DP  F K    E++H+RWAM
Sbjct: 48  SIWYGPDRPKYLGPFSENTPSYLTGEYPGDYGWDTAGLSADPETFAKNRELEVIHSRWAM 107

Query: 107 LASLGALVPEILDIFGAFHFTEPIWWRVGYSKLKGDTLDYLGIPGLHLAGSQGVIVIAIC 166
           L +LG   PEIL   G   F E +W++ G        LDYLG P  +L  +Q ++ I  C
Sbjct: 108 LGALGCTFPEILSKNGV-KFGEAVWFKAGSQIFSEGGLDYLGNP--NLIHAQSILAIWAC 167

Query: 167 QAILMVGPEYARYCGIEALEPLGIYLPGDINYPGGVLFDPLNLSKDAAAFEELKVKEIKN 226
           Q +LM   E  R  G     PLG  L  D  YPGG  FDPLNL++D  AF ELKVKE+KN
Sbjct: 168 QVVLMGFIEGYRIGG----GPLGEGL--DPLYPGGA-FDPLNLAEDPEAFSELKVKELKN 227

Query: 227 GRLAMVAWLGFYSQAALTGKGPVQNLLDHIADPFHNN 264
           GRLAM +  GF+ QA +TGKGP++NL DHIADP  NN
Sbjct: 228 GRLAMFSMFGFFVQAIVTGKGPIENLFDHIADPVANN 254

BLAST of Csa4G664570.1 vs. TAIR10
Match: AT2G05100.1 (AT2G05100.1 photosystem II light harvesting complex gene 2.1)

HSP 1 Score: 200.7 bits (509), Expect = 1.2e-51
Identity = 111/217 (51.15%), Postives = 135/217 (62.21%), Query Frame = 1

Query: 47  SSWYGEERPRWLGPLPYDYPKYLTGELPGDYGFDIAGLSEDPVAFQKFFNFEILHARWAM 106
           S WYG +RP++LGP   + P YLTGE PGDYG+D AGLS DP  F K    E++H+RWAM
Sbjct: 47  SIWYGPDRPKYLGPFSENTPSYLTGEYPGDYGWDTAGLSADPETFAKNRELEVIHSRWAM 106

Query: 107 LASLGALVPEILDIFGAFHFTEPIWWRVGYSKLKGDTLDYLGIPGLHLAGSQGVIVIAIC 166
           L +LG   PEIL   G   F E +W++ G        LDYLG P  +L  +Q ++ I   
Sbjct: 107 LGALGCTFPEILSKNGV-KFGEAVWFKAGSQIFSEGGLDYLGNP--NLIHAQSILAIWAV 166

Query: 167 QAILMVGPEYARYCGIEALEPLGIYLPGDINYPGGVLFDPLNLSKDAAAFEELKVKEIKN 226
           Q +LM   E  R  G     PLG  L  D  YPGG  FDPLNL++D  AF ELKVKE+KN
Sbjct: 167 QVVLMGFIEGYRIGG----GPLGEGL--DPLYPGGA-FDPLNLAEDPEAFSELKVKELKN 226

Query: 227 GRLAMVAWLGFYSQAALTGKGPVQNLLDHIADPFHNN 264
           GRLAM +  GF+ QA +TGKGP++NL DH+ADP  NN
Sbjct: 227 GRLAMFSMFGFFVQAIVTGKGPIENLFDHLADPVANN 253

BLAST of Csa4G664570.1 vs. TAIR10
Match: AT2G05070.1 (AT2G05070.1 photosystem II light harvesting complex gene 2.2)

HSP 1 Score: 200.7 bits (509), Expect = 1.2e-51
Identity = 111/217 (51.15%), Postives = 135/217 (62.21%), Query Frame = 1

Query: 47  SSWYGEERPRWLGPLPYDYPKYLTGELPGDYGFDIAGLSEDPVAFQKFFNFEILHARWAM 106
           S WYG +RP++LGP   + P YLTGE PGDYG+D AGLS DP  F K    E++H+RWAM
Sbjct: 47  SIWYGPDRPKYLGPFSENTPSYLTGEYPGDYGWDTAGLSADPETFAKNRELEVIHSRWAM 106

Query: 107 LASLGALVPEILDIFGAFHFTEPIWWRVGYSKLKGDTLDYLGIPGLHLAGSQGVIVIAIC 166
           L +LG   PEIL   G   F E +W++ G        LDYLG P  +L  +Q ++ I   
Sbjct: 107 LGALGCTFPEILSKNGV-KFGEAVWFKAGSQIFSEGGLDYLGNP--NLIHAQSILAIWAV 166

Query: 167 QAILMVGPEYARYCGIEALEPLGIYLPGDINYPGGVLFDPLNLSKDAAAFEELKVKEIKN 226
           Q +LM   E  R  G     PLG  L  D  YPGG  FDPLNL++D  AF ELKVKE+KN
Sbjct: 167 QVVLMGFIEGYRIGG----GPLGEGL--DPLYPGGA-FDPLNLAEDPEAFSELKVKELKN 226

Query: 227 GRLAMVAWLGFYSQAALTGKGPVQNLLDHIADPFHNN 264
           GRLAM +  GF+ QA +TGKGP++NL DH+ADP  NN
Sbjct: 227 GRLAMFSMFGFFVQAIVTGKGPIENLFDHLADPVANN 253

BLAST of Csa4G664570.1 vs. TAIR10
Match: AT5G54270.1 (AT5G54270.1 light-harvesting chlorophyll B-binding protein 3)

HSP 1 Score: 198.4 bits (503), Expect = 5.9e-51
Identity = 102/215 (47.44%), Postives = 130/215 (60.47%), Query Frame = 1

Query: 49  WYGEERPRWLGPLPYDYPKYLTGELPGDYGFDIAGLSEDPVAFQKFFNFEILHARWAMLA 108
           WYG +R ++LGP     P YLTGE PGDYG+D AGLS DP AF K    E++H RWAML 
Sbjct: 47  WYGPDRVKYLGPFSVQTPSYLTGEFPGDYGWDTAGLSADPEAFAKNRALEVIHGRWAMLG 106

Query: 109 SLGALVPEILDIFGAFHFTEPIWWRVGYSKLKGDTLDYLGIPGLHLAGSQGVIVIAICQA 168
           + G + PE+L  +    F EP+W++ G        LDYLG P  +L  +Q ++ +   Q 
Sbjct: 107 AFGCITPEVLQKWVRVDFKEPVWFKAGSQIFSEGGLDYLGNP--NLVHAQSILAVLGFQV 166

Query: 169 ILMVGPEYARYCGIEALEPLGIYLPGDINYPGGVLFDPLNLSKDAAAFEELKVKEIKNGR 228
           ILM   E  R  G++ +        G+  YPGG  FDPL L+ D   F ELKVKEIKNGR
Sbjct: 167 ILMGLVEGFRINGLDGVG------EGNDLYPGGQYFDPLGLADDPVTFAELKVKEIKNGR 226

Query: 229 LAMVAWLGFYSQAALTGKGPVQNLLDHIADPFHNN 264
           LAM +  GF+ QA +TGKGP++NLLDH+ +P  NN
Sbjct: 227 LAMFSMFGFFVQAIVTGKGPLENLLDHLDNPVANN 253

BLAST of Csa4G664570.1 vs. NCBI nr
Match: gi|700200392|gb|KGN55550.1| (hypothetical protein Csa_4G664570 [Cucumis sativus])

HSP 1 Score: 576.2 bits (1484), Expect = 3.0e-161
Identity = 274/274 (100.00%), Postives = 274/274 (100.00%), Query Frame = 1

Query: 1   MKFLQLFGWEQAPNLDCRNGGCEGIVGGVIHQIEHAVDEHFSDNTFSSWYGEERPRWLGP 60
           MKFLQLFGWEQAPNLDCRNGGCEGIVGGVIHQIEHAVDEHFSDNTFSSWYGEERPRWLGP
Sbjct: 1   MKFLQLFGWEQAPNLDCRNGGCEGIVGGVIHQIEHAVDEHFSDNTFSSWYGEERPRWLGP 60

Query: 61  LPYDYPKYLTGELPGDYGFDIAGLSEDPVAFQKFFNFEILHARWAMLASLGALVPEILDI 120
           LPYDYPKYLTGELPGDYGFDIAGLSEDPVAFQKFFNFEILHARWAMLASLGALVPEILDI
Sbjct: 61  LPYDYPKYLTGELPGDYGFDIAGLSEDPVAFQKFFNFEILHARWAMLASLGALVPEILDI 120

Query: 121 FGAFHFTEPIWWRVGYSKLKGDTLDYLGIPGLHLAGSQGVIVIAICQAILMVGPEYARYC 180
           FGAFHFTEPIWWRVGYSKLKGDTLDYLGIPGLHLAGSQGVIVIAICQAILMVGPEYARYC
Sbjct: 121 FGAFHFTEPIWWRVGYSKLKGDTLDYLGIPGLHLAGSQGVIVIAICQAILMVGPEYARYC 180

Query: 181 GIEALEPLGIYLPGDINYPGGVLFDPLNLSKDAAAFEELKVKEIKNGRLAMVAWLGFYSQ 240
           GIEALEPLGIYLPGDINYPGGVLFDPLNLSKDAAAFEELKVKEIKNGRLAMVAWLGFYSQ
Sbjct: 181 GIEALEPLGIYLPGDINYPGGVLFDPLNLSKDAAAFEELKVKEIKNGRLAMVAWLGFYSQ 240

Query: 241 AALTGKGPVQNLLDHIADPFHNNFLSLLNSSFSR 275
           AALTGKGPVQNLLDHIADPFHNNFLSLLNSSFSR
Sbjct: 241 AALTGKGPVQNLLDHIADPFHNNFLSLLNSSFSR 274

BLAST of Csa4G664570.1 vs. NCBI nr
Match: gi|449455655|ref|XP_004145567.1| (PREDICTED: chlorophyll a-b binding protein of LHCII type 1-like isoform X1 [Cucumis sativus])

HSP 1 Score: 476.5 bits (1225), Expect = 3.2e-131
Identity = 228/228 (100.00%), Postives = 228/228 (100.00%), Query Frame = 1

Query: 47  SSWYGEERPRWLGPLPYDYPKYLTGELPGDYGFDIAGLSEDPVAFQKFFNFEILHARWAM 106
           SSWYGEERPRWLGPLPYDYPKYLTGELPGDYGFDIAGLSEDPVAFQKFFNFEILHARWAM
Sbjct: 111 SSWYGEERPRWLGPLPYDYPKYLTGELPGDYGFDIAGLSEDPVAFQKFFNFEILHARWAM 170

Query: 107 LASLGALVPEILDIFGAFHFTEPIWWRVGYSKLKGDTLDYLGIPGLHLAGSQGVIVIAIC 166
           LASLGALVPEILDIFGAFHFTEPIWWRVGYSKLKGDTLDYLGIPGLHLAGSQGVIVIAIC
Sbjct: 171 LASLGALVPEILDIFGAFHFTEPIWWRVGYSKLKGDTLDYLGIPGLHLAGSQGVIVIAIC 230

Query: 167 QAILMVGPEYARYCGIEALEPLGIYLPGDINYPGGVLFDPLNLSKDAAAFEELKVKEIKN 226
           QAILMVGPEYARYCGIEALEPLGIYLPGDINYPGGVLFDPLNLSKDAAAFEELKVKEIKN
Sbjct: 231 QAILMVGPEYARYCGIEALEPLGIYLPGDINYPGGVLFDPLNLSKDAAAFEELKVKEIKN 290

Query: 227 GRLAMVAWLGFYSQAALTGKGPVQNLLDHIADPFHNNFLSLLNSSFSR 275
           GRLAMVAWLGFYSQAALTGKGPVQNLLDHIADPFHNNFLSLLNSSFSR
Sbjct: 291 GRLAMVAWLGFYSQAALTGKGPVQNLLDHIADPFHNNFLSLLNSSFSR 338

BLAST of Csa4G664570.1 vs. NCBI nr
Match: gi|659105211|ref|XP_008453032.1| (PREDICTED: chlorophyll a-b binding protein of LHCII type 1 isoform X1 [Cucumis melo])

HSP 1 Score: 468.4 bits (1204), Expect = 8.7e-129
Identity = 223/225 (99.11%), Postives = 225/225 (100.00%), Query Frame = 1

Query: 47  SSWYGEERPRWLGPLPYDYPKYLTGELPGDYGFDIAGLSEDPVAFQKFFNFEILHARWAM 106
           SSWYGEERPRWLGPLPYDYPKYLTGELPGDYGFDIAGLSEDPVAFQK+FNFEILHARWAM
Sbjct: 120 SSWYGEERPRWLGPLPYDYPKYLTGELPGDYGFDIAGLSEDPVAFQKYFNFEILHARWAM 179

Query: 107 LASLGALVPEILDIFGAFHFTEPIWWRVGYSKLKGDTLDYLGIPGLHLAGSQGVIVIAIC 166
           LASLGALVPEILDIFGAFHFTEPIWWRVGYSKLKGDTLDYLGIPGLHLAGSQGVIVIAIC
Sbjct: 180 LASLGALVPEILDIFGAFHFTEPIWWRVGYSKLKGDTLDYLGIPGLHLAGSQGVIVIAIC 239

Query: 167 QAILMVGPEYARYCGIEALEPLGIYLPGDINYPGGVLFDPLNLSKDAAAFEELKVKEIKN 226
           QAILMVGPEYARYCGIEALEPLGIYLPGDINYPGGVLFDPLNLSKDAAAFEELKVKEIKN
Sbjct: 240 QAILMVGPEYARYCGIEALEPLGIYLPGDINYPGGVLFDPLNLSKDAAAFEELKVKEIKN 299

Query: 227 GRLAMVAWLGFYSQAALTGKGPVQNLLDHIADPFHNNFLSLLNSS 272
           GRLAMVAWLGFYSQAALTGKGPVQNLLDHI+DPFHNNFLSLLNSS
Sbjct: 300 GRLAMVAWLGFYSQAALTGKGPVQNLLDHISDPFHNNFLSLLNSS 344

BLAST of Csa4G664570.1 vs. NCBI nr
Match: gi|595847551|ref|XP_007209349.1| (hypothetical protein PRUPE_ppa008546mg [Prunus persica])

HSP 1 Score: 429.9 bits (1104), Expect = 3.4e-117
Identity = 197/222 (88.74%), Postives = 211/222 (95.05%), Query Frame = 1

Query: 47  SSWYGEERPRWLGPLPYDYPKYLTGELPGDYGFDIAGLSEDPVAFQKFFNFEILHARWAM 106
           S WYGEERPRWLGP+PYDYP YLTGELPGDYGFD+ GLS DPVAFQK+FNFEILHARWAM
Sbjct: 103 SLWYGEERPRWLGPIPYDYPAYLTGELPGDYGFDVVGLSRDPVAFQKYFNFEILHARWAM 162

Query: 107 LASLGALVPEILDIFGAFHFTEPIWWRVGYSKLKGDTLDYLGIPGLHLAGSQGVIVIAIC 166
           LA+LGAL+PEILD+ GAFHF EP+WWRVGYSKLKGDTLDYLGIPGLH+AGSQGVIVIAIC
Sbjct: 163 LAALGALIPEILDLLGAFHFVEPVWWRVGYSKLKGDTLDYLGIPGLHVAGSQGVIVIAIC 222

Query: 167 QAILMVGPEYARYCGIEALEPLGIYLPGDINYPGGVLFDPLNLSKDAAAFEELKVKEIKN 226
           QA+LMVGPEYARYCGIEALEPLGIYLPGD+NYPGG LFDPLNLS+D  AFEELKVKEIKN
Sbjct: 223 QALLMVGPEYARYCGIEALEPLGIYLPGDVNYPGGELFDPLNLSRDPVAFEELKVKEIKN 282

Query: 227 GRLAMVAWLGFYSQAALTGKGPVQNLLDHIADPFHNNFLSLL 269
           GRLAMVAWLGFYSQAALTGKGPVQNLL+HI+DP HNN LS+L
Sbjct: 283 GRLAMVAWLGFYSQAALTGKGPVQNLLEHISDPLHNNVLSVL 324

BLAST of Csa4G664570.1 vs. NCBI nr
Match: gi|297745621|emb|CBI40786.3| (unnamed protein product [Vitis vinifera])

HSP 1 Score: 429.1 bits (1102), Expect = 5.8e-117
Identity = 197/224 (87.95%), Postives = 209/224 (93.30%), Query Frame = 1

Query: 47  SSWYGEERPRWLGPLPYDYPKYLTGELPGDYGFDIAGLSEDPVAFQKFFNFEILHARWAM 106
           S WYGEERPRWLGP+PYDYP YLTGELPGDYGFDIAGL +DPVAFQK+FNFEILHARWAM
Sbjct: 107 SLWYGEERPRWLGPIPYDYPAYLTGELPGDYGFDIAGLGKDPVAFQKYFNFEILHARWAM 166

Query: 107 LASLGALVPEILDIFGAFHFTEPIWWRVGYSKLKGDTLDYLGIPGLHLAGSQGVIVIAIC 166
           LA+LGAL+PE+LD+ GAFHF EP+WWRVGYSKLKGDTLDYLGIPG H AGSQGVIVIAIC
Sbjct: 167 LAALGALLPELLDLLGAFHFVEPVWWRVGYSKLKGDTLDYLGIPGFHFAGSQGVIVIAIC 226

Query: 167 QAILMVGPEYARYCGIEALEPLGIYLPGDINYPGGVLFDPLNLSKDAAAFEELKVKEIKN 226
           QA+LMVGPEYARYCGIEALEPLGIYLPGDINYPGG LFDPLNLSKD  AFEELKVKEIKN
Sbjct: 227 QALLMVGPEYARYCGIEALEPLGIYLPGDINYPGGALFDPLNLSKDPVAFEELKVKEIKN 286

Query: 227 GRLAMVAWLGFYSQAALTGKGPVQNLLDHIADPFHNNFLSLLNS 271
           GRLAMVAWLGFY QAA TGKGPVQNLLDH+ADPFHNN LS+  +
Sbjct: 287 GRLAMVAWLGFYIQAAATGKGPVQNLLDHLADPFHNNLLSIFKA 330

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
CB2_CHLMO3.3e-5148.84Chlorophyll a-b binding protein of LHCII type I, chloroplastic OS=Chlamydomonas ... [more]
CB25_NICPL4.3e-5150.69Chlorophyll a-b binding protein E, chloroplastic OS=Nicotiana plumbaginifolia GN... [more]
CB21_GOSHI7.3e-5151.61Chlorophyll a-b binding protein 151, chloroplastic OS=Gossypium hirsutum GN=CAB-... [more]
CB24_SOLLC9.5e-5151.61Chlorophyll a-b binding protein 4, chloroplastic OS=Solanum lycopersicum GN=CAB4... [more]
CB23_NICPL1.6e-5050.69Chlorophyll a-b binding protein C, chloroplastic OS=Nicotiana plumbaginifolia GN... [more]
Match NameE-valueIdentityDescription
A0A0A0L0U5_CUCSA2.1e-161100.00Chlorophyll a-b binding protein, chloroplastic OS=Cucumis sativus GN=Csa_4G66457... [more]
M5WMP2_PRUPE2.4e-11788.74Chlorophyll a-b binding protein, chloroplastic OS=Prunus persica GN=PRUPE_ppa008... [more]
D7UDH1_VITVI4.1e-11787.95Chlorophyll a-b binding protein, chloroplastic OS=Vitis vinifera GN=VIT_18s0122g... [more]
A0A067KWW2_JATCU6.9e-11788.74Chlorophyll a-b binding protein, chloroplastic OS=Jatropha curcas GN=JCGZ_08016 ... [more]
A0A059BPY7_EUCGR1.0e-11587.39Chlorophyll a-b binding protein, chloroplastic OS=Eucalyptus grandis GN=EUGRSUZ_... [more]
Match NameE-valueIdentityDescription
AT1G76570.11.9e-11380.36 Chlorophyll A-B binding family protein[more]
AT3G27690.14.8e-5352.07 photosystem II light harvesting complex gene 2.3[more]
AT2G05100.11.2e-5151.15 photosystem II light harvesting complex gene 2.1[more]
AT2G05070.11.2e-5151.15 photosystem II light harvesting complex gene 2.2[more]
AT5G54270.15.9e-5147.44 light-harvesting chlorophyll B-binding protein 3[more]
Match NameE-valueIdentityDescription
gi|700200392|gb|KGN55550.1|3.0e-161100.00hypothetical protein Csa_4G664570 [Cucumis sativus][more]
gi|449455655|ref|XP_004145567.1|3.2e-131100.00PREDICTED: chlorophyll a-b binding protein of LHCII type 1-like isoform X1 [Cucu... [more]
gi|659105211|ref|XP_008453032.1|8.7e-12999.11PREDICTED: chlorophyll a-b binding protein of LHCII type 1 isoform X1 [Cucumis m... [more]
gi|595847551|ref|XP_007209349.1|3.4e-11788.74hypothetical protein PRUPE_ppa008546mg [Prunus persica][more]
gi|297745621|emb|CBI40786.3|5.8e-11787.95unnamed protein product [Vitis vinifera][more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
IPR001344Chloro_AB-bd_pln
IPR022796Chloroa_b-bind
IPR023329Chlorophyll_a/b-bd_dom_sf
Vocabulary: Biological Process
TermDefinition
GO:0009765photosynthesis, light harvesting
Vocabulary: Cellular Component
TermDefinition
GO:0016020membrane
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009765 photosynthesis, light harvesting
biological_process GO:0018298 protein-chromophore linkage
cellular_component GO:0016020 membrane
cellular_component GO:0009507 chloroplast
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0009522 photosystem I
cellular_component GO:0009523 photosystem II
molecular_function GO:0016168 chlorophyll binding
molecular_function GO:0046872 metal ion binding

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Csa4G664570Csa4G664570gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Csa4G664570.1Csa4G664570.1-proteinpolypeptide


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Csa4G664570.1.cds1Csa4G664570.1.cds1CDS
Csa4G664570.1.cds2Csa4G664570.1.cds2CDS
Csa4G664570.1.cds3Csa4G664570.1.cds3CDS
Csa4G664570.1.cds4Csa4G664570.1.cds4CDS
Csa4G664570.1.cds5Csa4G664570.1.cds5CDS
Csa4G664570.1.cds6Csa4G664570.1.cds6CDS


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Csa4G664570.1.utr3p1Csa4G664570.1.utr3p1three_prime_UTR


Analysis Name: InterPro Annotations of cucumber (Chinese Long)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001344Chlorophyll A-B binding protein, plantPANTHERPTHR21649CHLOROPHYLL A/B BINDING PROTEINcoord: 47..274
score: 9.8E
IPR022796Chlorophyll A-B binding proteinPFAMPF00504Chloroa_b-bindcoord: 66..241
score: 2.0
IPR023329Chlorophyll a/b binding protein domainGENE3DG3DSA:1.10.3460.10coord: 63..266
score: 4.6
IPR023329Chlorophyll a/b binding protein domainunknownSSF103511Chlorophyll a-b binding proteincoord: 48..270
score: 2.62
NoneNo IPR availablePANTHERPTHR21649:SF24CHLOROPHYLL A-B BINDING FAMILY PROTEINcoord: 47..274
score: 9.8E