CmaCh09G000120 (gene) Cucurbita maxima (Rimu)

NameCmaCh09G000120
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionChlorophyll a-b binding protein 4, chloroplastic
LocationCma_Chr09 : 81081 .. 86559 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTTGACCAAAAAAAACATCTGATGTTTAAACGCGCCATAAAAGTGCGTTATAGCTCAGTGATGACTCTAATAGCATAGATGGTAATGGCGCTATTTCATCCACACACTTCGCCTTCTGCAGTTTCATCTTCCTCCGCCTCATTCTATGGGAGTTCTCTTCAACAACTCCGTTCGCGCACCATAGCCCTCAATCTTCGAAACAGCCCCTCCATCCCTCATCGTTTCACTTGCACAGCTTCATGGCAGGAGGTTTGTAAGAAAACAAGCTAGCTGTTTCCATTTTTCGCTGATTGCTGAGTCTTTTTTATTTTCTTCATCATGCTTTAGCTTGCGGGGATTTTGATATTCTCCGCAATTCCTTTCACGGCCGTGAAAGCCATCGCCAATAGCCCATTCGGAGAGTCGCTCCAGAGGCAATTGGAAAAGAAAAAAAATGCTGCGGTCGCCAAGTCTTCGAAGTTCAAGGCTCTTGCTGAGCAGGCTAGAAAAGATAGGTAATCGGGTAAGCTTATTTATTGATGCACGCGTGATTTGGTGACGCCAACGCCAAGTTGGTCTACTCGTATTTTTAGAATAATTACATACACGTTGAATTAAAGTGTATTTAGAACCCATAGAACTTCATGCATTGTGGTGTTATCAGTCATGTTCAAAGTGAGTAGATTGTGATCAAGACAGTGATTCTTAAAGAAGACGTAGAAACCAAAGGGTTCATGGTTAAAACTTATTGGAATCGTTTGGAACCTGGAATATCTTTAAGCTCAAACAGTTGAGCTTCACATTGCCATAAAGTACACAAACTAGCGTCTCTTCTGGGCAAAGCAATCGAGGCATTATCAACTGGTTTCATTGTCATGGATACTTGTTCTGTTGGAACGATATGACTAGTAGCGGCAGAGGCAAGTATCCAATATTTCTTTGAATTAAGCAATAAATTTGTAGCAACGAAATAACAAGTACCTGAAACATGAACTTGGGCACTTTAACCCCTGTTGCACCATTCTTTATCAACTAGCTGACAATTAAACTTAGTGATTCATTGCTGGCAATGTGTTTGGATTAAGAAACTGGATATTGCTCCCATAATTGCTCTGGCCTGAAGATAGTAGTCCGTCACAGCCTCCTCCATCCCCACCCCATTCTGCCCAGGCCTCGACCATAGCCTTCAGATATTCTCCTATTTGCATCATTCTCTCCATTCATCTTTCTTGCTCTCCTCTAATCCTTGTGCTAAATCTATGTTTCATTGCTGCCACCATTGGTCTTCGATATGCTAGATTAAAGATATCTGATTCATCTAAAAGGTTTTCTCACAAATTTTCTCAGAATTTACGGCTAGTATACCAACTTTTATTATTATGATGAAAGTGCCATACAACAAAGTATTAATATATCCAAATGTCTAGCTAAGTCGTAACAGAACGAGGTACTTGACTTGGACCAAGTAAAACAAACTAACAACATTTAGTTACTAAAAAGTAAAAATGTTTGCTACTGTATTCGGTTGCCAAGGAAACTTGTAAGATATTTAAATCTTTTGGCTATCATGACTTGAACTCATTTCCTTTAAATTCCTTTTTTTATTTATATTGGTTTCTATTGACTACCAGGCCAGTTCCTGGTGGAATGTATGCCACATAAATGTTCTTCTAGTTAAGTTATGGTGAATATTCCATTTCAAATTTTGGATATCTATTTAATTTTTGAAGTTTCATTGGGCCAGTAGTATTTCTTAAATATTTATTTCATTTTAATTTCTTTTTAGATTTCCTACGTTCCTCTTGAATTTTCCATCATTTTTATGTATCCTGGTTCCGGACATAAATAATATGTAACGTTCGCTAAAGAATCAACAAATTATGTCCAACTAATAATTTTGAAGGAATGACATGGACATTTTTTAAAAAGGGTTTCAAATTTTCATGCTATTAAAGGATTTCGGGAATTTGCTATCTTTATTTCATGTAGATTATTCTCTAGTATCTTTCATGCTTTGCAAATCTTCTTCATTCTTGAAATAGATTACATGGCTGCGATCTTTTGCTTGTACTAATTTACATTGGATTGGATGTGATCTTATGAATTATTGTTTATCCAACTATGAATGTTTTGACTGCCAGAAAACATATAAACGAATGGTACAATTGAATCAAATTGCATCGTGTTGTAAATACTATTCTATTGGTGAATAATTGCAGCTCGTGGTATGGAGAAAAGCGTCCTCGCTGGCTTGGTCCACTACCATACAATTATCCAAAATATCTGACAGGTGAACTGCCGGGTGATTATGGATTTGATATTGCAGGTCTGAGTGAGGATCCAGTGGCTTTTCGGAAGTACTTCAAGTATATTGAGCTTCTTTTAGCCTAAAATTTTCTTCAGCAGTTTCTTTTTTCTCCAATATCTGGAATCGTATGTAGATGTATTAGTGCACAGCATGAGAGAATATGATTCTTCAGCAGTTTCTTTTTTTCTCCAATATCTGGAATCATATGTAGATGTATTAGGGCACACCATGAGAGAATATGATTCTCAGTTTCTTTTTTCTCCAATATCTGGAATCATATGTAGATGTATTAGTGCACACCATGAGAGAATATGATTATTTGACTTGTGAGTGGCCGGTAGACTAAGGGACGACATGATGATTATCAATGCATATAAGCTGTGAGAGTAATGAAATGATCCGGCATAACATAATATAGACATCACCTTAAAACAGTCCCCGAATGATTATCCACATGTTTGTTTGCTTGATACAAAACTTGCGTACTGCACATCTTTCTATCCAACTTTGGAGATATTTGAGCTGTTCTGTACGTAGTATAACTTTACAGTCCTAAATTACCTTAGATAAGCAACTCTTATTGAATGTATACATTTCTTTTTTAAACCTCCATCAATAATTTTTATGTATGTATCATATTTCAGCTTTGAAATACTGCATGCTCGTTGGGCTATGTTAGCGTCCCTTGGTGCTTTAGTTCCAGAGATCTTAGATATTTTTGGAGCTTTTCATTTCACTGAACCCATCTGGTGGCGAGTGGGATATTCAAAACTTAAGGTTTGTTTTCCATCTCTTGGTTAACCATTTTTCTGGATATCCATCATATAGGATCTTTCCGTGTGTCTGCATGGTTTGTTTTTATATTTTTACGCTTGTTTGGTTATGTTTTTTTTCACTTTCAAATTACTGTTTAACTTATTGCTTCTGTCTAATCATGTTTTATTTTAAAGTCTAGTTTTTCATGTCATCCTTTATTAATGAAATCAAATAATGAGTTATGACTTCATGTCTTTCAGTGCCGTGATAAGCTTGAGTTCGGTTGCTGTCTTCTATTTCAATTACTCTTTCGCTACTTTGTTTCAATAAATTAGGGACAAACATTTTCAATTACAATATTTCATATTCCCTTCTATAAACTCCATGGGTGTGCGTAGCTCCCCATTCATTCATCCCTACTTTCCAGCACTTGTAATTGACTCTATCATGTTACTGTTTGTGTTCTCGTTCCACCAGGGAGATACACTCGACTATCTTGGAATTCCAGGGCTTCATTTAGCTGGAAGCCAAGGAGTGATCGTCATTGCAATTTGCCAAGCTATTTTGATGGTAACCTATATCAAACAATCTTATGTTCTGCCTTCTCACATTTTCTTTTTGTCATATCTTTTGGCACTTCAGTTAAATGGTTGAGGGAAGCATGACATTATAACGTGTGTTAACTTGAGTTGGTTAACCTCAATCAAATTTCACCCCATTCCCACCAAACACAATAAATGGCTTGGTTCCCATTGCAGTTTCTACAACCAGCCTTGGCCCTTGATTAGCTTGTTTAGGAATTTGGAATTAAAATTTTCCTTTCGGAGATTACACGTTAGAAAAAAAAATTTAAATAGATTTTTGTTTTTTGTTTTAACAAGTGTAACTTTTCATTGGTGTATGAAAAGGAATAAAAGAGAATTAAAATATATGAACCTCCGAATGGGGAGATAGGAAAAGCATAACGCCCAAAGAAAAAAGAAAAAAGAAAAAAGAAAAAGATTACAAGAACCTGAAAAGAAACCAAAACAAACCAAACAAAAAAGAGGAACTAAAGAATTGCCAACGATAACAAAAAAAAAAAATAAATAATAAATAAATAAATAAATGCAAGACTAGTAGATCAAAAAGCTCTTAGAGAGCTTACAAACAAAAGCTAAACATTAATGTGAAGCTCCTCAAACCACCGCAGCTTCAAATAGAAACCGCACCAACCAGCTTGAAATATCACACCAGAAAAGGAAGAAAAGCAACAACTGCCATGGAGAAATGCACGAACACCACATGGCCGAACAACCAAAAGCTCTATCAGCAAAACTCAACTGCAAAGAAAACCAAAAGAAGCAGTAACTTCACCCAAGAACTAAAGGCCGAAGAGCAAATCAAAAAACATCAGCCAAAGACAAGATCATTCCAAGGAGAAAGAACGGAAGTACAGGCAAAAACCCCGTTCTCAAAGAAATAATCTCCATAGATTTAATGCCCCTTTGTTTATAGAGGAAGAAAGTTGATGAGAGGGGAAGATGGAGAATCATCTGACTGTGTTCTTTGAAGGGGAATTAATTTTGTCTTCTTGAAAAATAGAACAAAGAGTTTCTTCAAAACCTTCATCATCAAGTGCCTCATTAGAAACCTTTTCATGCCACTAACAATTTGAAAATTATACTCCAAGTCATAACTACTAAGACCTATCTGACTCCTCATTTCCTCCACAAAAACTAAACTTTTGATGATACCATTTGTTGAGTTAATATTAGAAAGGATGTTATTAACTAAAGATGTTGAAGAATTGAGATTGTCTATTGTTAAGAGGGCATATTAAACTATTATATGTCTGATTGTTCTTACAACCGTATGATGTAAGTTAAAATGCAAGTAGGTTGGACCTGAGTATGCAAGATATTGTGGCATAGAGGCATTGGAGCCTCTGGGAATATATCTGCCGGGGGATATAAATTATCCAGGTGGTGTGTTGTTTGATCCCTTGAACCTGTCCGAGGATGCTGCAGCTTTTGAGGAACTGAAAGTGAAAGAGATTAAAAATGGGCGGTTAGCCATGGTTGCTTGGCTAGGATTTTACAGTCAAGCTGCTTTGACGGGTAAGGGACCAATCCAAAACCTTCTTGAACACGTTTCAGACCCTTTCCACAATAACTTTCTTTCCCTGCTCAACTCATCATGAAGTAGCTTATAATTGGAATCTGGATTATAAAGTATAGTTTCCTCAAATCTATATAGTGTTCATAAGGGATGGACAAACAATCATGCTGTTCTAGATGGGAGATTGTTGCTTGATAAGTTAATACTTAATGGTGTGGTCATTATATCTATCTGTCGTACATCAAGCATCTTTGAGAAGCTAAAGCTTGGTAAAATTTGTTTTTGAAACAAATTTTACTCAACTTTAATAAAGTATCTTGAACCTTAAAAATGTTA

mRNA sequence

TTTGACCAAAAAAAACATCTGATGTTTAAACGCGCCATAAAAGTGCGTTATAGCTCAGTGATGACTCTAATAGCATAGATGGTAATGGCGCTATTTCATCCACACACTTCGCCTTCTGCAGTTTCATCTTCCTCCGCCTCATTCTATGGGAGTTCTCTTCAACAACTCCGTTCGCGCACCATAGCCCTCAATCTTCGAAACAGCCCCTCCATCCCTCATCGTTTCACTTGCACAGCTTCATGGCAGGAGCTTGCGGGGATTTTGATATTCTCCGCAATTCCTTTCACGGCCGTGAAAGCCATCGCCAATAGCCCATTCGGAGAGTCGCTCCAGAGGCAATTGGAAAAGAAAAAAAATGCTGCGGTCGCCAAGTCTTCGAAGTTCAAGGCTCTTGCTGAGCAGGCTAGAAAAGATAGCTCGTGGTATGGAGAAAAGCGTCCTCGCTGGCTTGGTCCACTACCATACAATTATCCAAAATATCTGACAGGTGAACTGCCGGGTGATTATGGATTTGATATTGCAGGTCTGAGTGAGGATCCAGTGGCTTTTCGGAAGTACTTCAACTTTGAAATACTGCATGCTCGTTGGGCTATGTTAGCGTCCCTTGGTGCTTTAGTTCCAGAGATCTTAGATATTTTTGGAGCTTTTCATTTCACTGAACCCATCTGGTGGCGAGTGGGATATTCAAAACTTAAGGGAGATACACTCGACTATCTTGGAATTCCAGGGCTTCATTTAGCTGGAAGCCAAGGAGTGATCGTCATTGCAATTTGCCAAGCTATTTTGATGGTTGGACCTGAGTATGCAAGATATTGTGGCATAGAGGCATTGGAGCCTCTGGGAATATATCTGCCGGGGGATATAAATTATCCAGGTGGTGTGTTGTTTGATCCCTTGAACCTGTCCGAGGATGCTGCAGCTTTTGAGGAACTGAAAGTGAAAGAGATTAAAAATGGGCGGTTAGCCATGGTTGCTTGGCTAGGATTTTACAGTCAAGCTGCTTTGACGGGTAAGGGACCAATCCAAAACCTTCTTGAACACGTTTCAGACCCTTTCCACAATAACTTTCTTTCCCTGCTCAACTCATCATGAAGTAGCTTATAATTGGAATCTGGATTATAAAGTATAGTTTCCTCAAATCTATATAGTGTTCATAAGGGATGGACAAACAATCATGCTGTTCTAGATGGGAGATTGTTGCTTGATAAGTTAATACTTAATGGTGTGGTCATTATATCTATCTGTCGTACATCAAGCATCTTTGAGAAGCTAAAGCTTGGTAAAATTTGTTTTTGAAACAAATTTTACTCAACTTTAATAAAGTATCTTGAACCTTAAAAATGTTA

Coding sequence (CDS)

ATGGTAATGGCGCTATTTCATCCACACACTTCGCCTTCTGCAGTTTCATCTTCCTCCGCCTCATTCTATGGGAGTTCTCTTCAACAACTCCGTTCGCGCACCATAGCCCTCAATCTTCGAAACAGCCCCTCCATCCCTCATCGTTTCACTTGCACAGCTTCATGGCAGGAGCTTGCGGGGATTTTGATATTCTCCGCAATTCCTTTCACGGCCGTGAAAGCCATCGCCAATAGCCCATTCGGAGAGTCGCTCCAGAGGCAATTGGAAAAGAAAAAAAATGCTGCGGTCGCCAAGTCTTCGAAGTTCAAGGCTCTTGCTGAGCAGGCTAGAAAAGATAGCTCGTGGTATGGAGAAAAGCGTCCTCGCTGGCTTGGTCCACTACCATACAATTATCCAAAATATCTGACAGGTGAACTGCCGGGTGATTATGGATTTGATATTGCAGGTCTGAGTGAGGATCCAGTGGCTTTTCGGAAGTACTTCAACTTTGAAATACTGCATGCTCGTTGGGCTATGTTAGCGTCCCTTGGTGCTTTAGTTCCAGAGATCTTAGATATTTTTGGAGCTTTTCATTTCACTGAACCCATCTGGTGGCGAGTGGGATATTCAAAACTTAAGGGAGATACACTCGACTATCTTGGAATTCCAGGGCTTCATTTAGCTGGAAGCCAAGGAGTGATCGTCATTGCAATTTGCCAAGCTATTTTGATGGTTGGACCTGAGTATGCAAGATATTGTGGCATAGAGGCATTGGAGCCTCTGGGAATATATCTGCCGGGGGATATAAATTATCCAGGTGGTGTGTTGTTTGATCCCTTGAACCTGTCCGAGGATGCTGCAGCTTTTGAGGAACTGAAAGTGAAAGAGATTAAAAATGGGCGGTTAGCCATGGTTGCTTGGCTAGGATTTTACAGTCAAGCTGCTTTGACGGGTAAGGGACCAATCCAAAACCTTCTTGAACACGTTTCAGACCCTTTCCACAATAACTTTCTTTCCCTGCTCAACTCATCATGA

Protein sequence

MVMALFHPHTSPSAVSSSSASFYGSSLQQLRSRTIALNLRNSPSIPHRFTCTASWQELAGILIFSAIPFTAVKAIANSPFGESLQRQLEKKKNAAVAKSSKFKALAEQARKDSSWYGEKRPRWLGPLPYNYPKYLTGELPGDYGFDIAGLSEDPVAFRKYFNFEILHARWAMLASLGALVPEILDIFGAFHFTEPIWWRVGYSKLKGDTLDYLGIPGLHLAGSQGVIVIAICQAILMVGPEYARYCGIEALEPLGIYLPGDINYPGGVLFDPLNLSEDAAAFEELKVKEIKNGRLAMVAWLGFYSQAALTGKGPIQNLLEHVSDPFHNNFLSLLNSS
BLAST of CmaCh09G000120 vs. Swiss-Prot
Match: CB2_CHLMO (Chlorophyll a-b binding protein of LHCII type I, chloroplastic OS=Chlamydomonas moewusii PE=2 SV=1)

HSP 1 Score: 209.5 bits (532), Expect = 5.6e-53
Identity = 116/247 (46.96%), Postives = 147/247 (59.51%), Query Frame = 1

Query: 83  SLQRQLEKKKNAAVAKSSKFKALAEQARKDSSWYGEKRPRWLGPLPYNYPKYLTGELPGD 142
           +L  ++E  K AA  K +K  A A+ A  +  WYG  R +WLGP   N P YLTGE PGD
Sbjct: 10  ALTVKVEAAKKAAGTKQTK-AAPAKSAGIE--WYGPDRAKWLGPFSTNTPAYLTGEFPGD 69

Query: 143 YGFDIAGLSEDPVAFRKYFNFEILHARWAMLASLGALVPEILDIFGAFHFTEPIWWRVGY 202
           YG+D AGLS DP  F+KY   E++HARWA+L +LG L PE+L  +    F EP+W++ G 
Sbjct: 70  YGWDTAGLSADPETFKKYRELEVIHARWALLGALGILTPELLSTYAGVKFGEPVWFKAGA 129

Query: 203 SKLKGDTLDYLGIPGLHLAGSQGVIVIAICQAILMVGPEYARYCGIEALEPLGIYLPGDI 262
                  LDYLG P   L  +Q ++     Q +LM   E  R  G  A E L    PG+ 
Sbjct: 130 QIFSEGGLDYLGSPA--LIHAQNIVATLAVQVVLMGLIEGYRVNGGPAGEGLDPLYPGE- 189

Query: 263 NYPGGVLFDPLNLSEDAAAFEELKVKEIKNGRLAMVAWLGFYSQAALTGKGPIQNLLEHV 322
                  FDPL L++D   F ELKVKEIKNGRLAM +  GF+ QA +TGKGPIQNL +H+
Sbjct: 190 ------SFDPLGLADDPDTFAELKVKEIKNGRLAMFSCFGFFVQAIVTGKGPIQNLADHL 244

Query: 323 SDPFHNN 330
           +DP  NN
Sbjct: 250 ADPGTNN 244

BLAST of CmaCh09G000120 vs. Swiss-Prot
Match: CB24_SOLLC (Chlorophyll a-b binding protein 4, chloroplastic OS=Solanum lycopersicum GN=CAB4 PE=2 SV=1)

HSP 1 Score: 208.0 bits (528), Expect = 1.6e-52
Identity = 111/217 (51.15%), Postives = 136/217 (62.67%), Query Frame = 1

Query: 113 SSWYGEKRPRWLGPLPYNYPKYLTGELPGDYGFDIAGLSEDPVAFRKYFNFEILHARWAM 172
           S WYGE RP++LGP     P YLTGE PGDYG+D AGLS DP  F +    E++H RWAM
Sbjct: 47  SIWYGEDRPKYLGPFSEQTPSYLTGEFPGDYGWDTAGLSADPETFARNRELEVIHCRWAM 106

Query: 173 LASLGALVPEILDIFGAFHFTEPIWWRVGYSKLKGDTLDYLGIPGLHLAGSQGVIVIAIC 232
           L +LG + PEIL   G   F E +W++ G        LDYLG P  +L  +Q ++ I  C
Sbjct: 107 LGALGCVFPEILSKNGV-KFGEAVWFKAGSQIFSEGGLDYLGNP--NLVHAQSILAIWAC 166

Query: 233 QAILMVGPEYARYCGIEALEPLGIYLPGDINYPGGVLFDPLNLSEDAAAFEELKVKEIKN 292
           Q +LM   E  R  G     PLG  L  D  YPGG  FDPL L++D  AF ELKVKEIKN
Sbjct: 167 QVVLMGFVEGYRVGG----GPLGEGL--DKIYPGGA-FDPLGLADDPEAFAELKVKEIKN 226

Query: 293 GRLAMVAWLGFYSQAALTGKGPIQNLLEHVSDPFHNN 330
           GRLAM +  GF+ QA +TGKGPI+NL +H++DP  NN
Sbjct: 227 GRLAMFSMFGFFVQAIVTGKGPIENLSDHINDPVANN 253

BLAST of CmaCh09G000120 vs. Swiss-Prot
Match: CB2B_SOLLC (Chlorophyll a-b binding protein 1B, chloroplastic OS=Solanum lycopersicum GN=CAB1B PE=3 SV=1)

HSP 1 Score: 206.8 bits (525), Expect = 3.6e-52
Identity = 116/260 (44.62%), Postives = 155/260 (59.62%), Query Frame = 1

Query: 71  AVKAIANSPF-GESLQRQLEKKKNAAVAKSSKFKALAEQARKDSSWYGEKRPRWLGPLPY 130
           A  A+++  F G++++      + +   + +  KA+A+ A   S WYG  R ++LGP   
Sbjct: 4   ATMALSSPSFAGQAVKLSPSASEISGNGRITMRKAVAKSAPSSSPWYGPDRVKYLGPFSG 63

Query: 131 NYPKYLTGELPGDYGFDIAGLSEDPVAFRKYFNFEILHARWAMLASLGALVPEILDIFGA 190
             P YLTGE PGDYG+D AGLS DP  F K    E++H RWAML +LG + PE+L   G 
Sbjct: 64  ESPSYLTGEFPGDYGWDTAGLSADPETFAKNRELEVIHCRWAMLGALGCVFPELLARNGV 123

Query: 191 FHFTEPIWWRVGYSKLKGDTLDYLGIPGLHLAGSQGVIVIAICQAILMVGPEYARYCGIE 250
             F E +W++ G        LDYLG P   L  +Q ++ I  CQ +LM   E  R  G  
Sbjct: 124 -KFGEAVWFKAGSQIFSEGGLDYLGNPS--LVHAQSILAIWACQVVLMGAVEGYRIAG-- 183

Query: 251 ALEPLGIYLPGDINYPGGVLFDPLNLSEDAAAFEELKVKEIKNGRLAMVAWLGFYSQAAL 310
              PLG  +  D  YPGG  FDPL L+ED  AF ELKVKEIKNGRLAM +  GF+ QA +
Sbjct: 184 --GPLGEVV--DPLYPGG-SFDPLGLAEDPEAFAELKVKEIKNGRLAMFSMFGFFVQAIV 243

Query: 311 TGKGPIQNLLEHVSDPFHNN 330
           TGKGP++NL +H++DP +NN
Sbjct: 244 TGKGPLENLADHLADPVNNN 253

BLAST of CmaCh09G000120 vs. Swiss-Prot
Match: CB25_NICPL (Chlorophyll a-b binding protein E, chloroplastic OS=Nicotiana plumbaginifolia GN=CABE PE=3 SV=1)

HSP 1 Score: 206.1 bits (523), Expect = 6.2e-52
Identity = 110/224 (49.11%), Postives = 137/224 (61.16%), Query Frame = 1

Query: 106 AEQARKDSSWYGEKRPRWLGPLPYNYPKYLTGELPGDYGFDIAGLSEDPVAFRKYFNFEI 165
           A+     S WYG  R ++LGP     P YLTGE PGDYG+D AGLS DP  F K    E+
Sbjct: 41  AKPVSSGSPWYGPDRVKYLGPFSGESPSYLTGEFPGDYGWDTAGLSADPETFAKNRELEV 100

Query: 166 LHARWAMLASLGALVPEILDIFGAFHFTEPIWWRVGYSKLKGDTLDYLGIPGLHLAGSQG 225
           +H RWAML +LG + PE+L   G   F E +W++ G        LDYLG P   L  +Q 
Sbjct: 101 IHCRWAMLGALGCVFPELLARNGV-KFGEAVWFKAGSQIFSEGGLDYLGNPS--LVHAQS 160

Query: 226 VIVIAICQAILMVGPEYARYCGIEALEPLGIYLPGDINYPGGVLFDPLNLSEDAAAFEEL 285
           ++ I  CQ +LM   E  R  G    EPLG  +  D  YPGG  FDPL L+ED  AF EL
Sbjct: 161 ILAIWACQVVLMGAVEGYRVAG----EPLGEVV--DPLYPGG-SFDPLGLAEDPEAFAEL 220

Query: 286 KVKEIKNGRLAMVAWLGFYSQAALTGKGPIQNLLEHVSDPFHNN 330
           KVKEIKNGRLAM +  GF+ QA +TGKGP++NL +H++DP +NN
Sbjct: 221 KVKEIKNGRLAMFSMFGFFVQALVTGKGPLENLADHLADPVNNN 254

BLAST of CmaCh09G000120 vs. Swiss-Prot
Match: CB23_NICPL (Chlorophyll a-b binding protein C, chloroplastic OS=Nicotiana plumbaginifolia GN=CABC PE=3 SV=1)

HSP 1 Score: 205.7 bits (522), Expect = 8.0e-52
Identity = 111/229 (48.47%), Postives = 139/229 (60.70%), Query Frame = 1

Query: 101 KFKALAEQARKDSSWYGEKRPRWLGPLPYNYPKYLTGELPGDYGFDIAGLSEDPVAFRKY 160
           K  + A+     S WYG  R ++LGP     P YLTGE PGDYG+D AGLS DP  F K 
Sbjct: 37  KTASKAKPVSSSSPWYGPNRVKYLGPFSGESPSYLTGEFPGDYGWDTAGLSADPETFAKN 96

Query: 161 FNFEILHARWAMLASLGALVPEILDIFGAFHFTEPIWWRVGYSKLKGDTLDYLGIPGLHL 220
              E++H RWAML +LG + PE+L   G   F E +W++ G        LDYLG P   L
Sbjct: 97  RELEVIHCRWAMLGALGCVFPELLARNGV-KFGEAVWFKGGSQIFSQGGLDYLGNPS--L 156

Query: 221 AGSQGVIVIAICQAILMVGPEYARYCGIEALEPLGIYLPGDINYPGGVLFDPLNLSEDAA 280
             +Q ++ I  CQ +LM   E  R  G    EPLG  +  D  YPGG  FDPL L+ED  
Sbjct: 157 VHAQSILAIWACQVVLMGAVEGYRVAG----EPLGEVV--DPLYPGG-SFDPLGLAEDPE 216

Query: 281 AFEELKVKEIKNGRLAMVAWLGFYSQAALTGKGPIQNLLEHVSDPFHNN 330
           AF ELKVKEIKNGRLAM +  GF+ QA +TGKGP++NL +H++DP +NN
Sbjct: 217 AFAELKVKEIKNGRLAMFSMFGFFVQAIVTGKGPLENLADHLADPVNNN 255

BLAST of CmaCh09G000120 vs. TrEMBL
Match: A0A067KWW2_JATCU (Chlorophyll a-b binding protein, chloroplastic OS=Jatropha curcas GN=JCGZ_08016 PE=3 SV=1)

HSP 1 Score: 528.5 bits (1360), Expect = 6.0e-147
Identity = 260/338 (76.92%), Postives = 294/338 (86.98%), Query Frame = 1

Query: 1   MVMALFHPHTSPSA--VSSSSASFY--GSSLQQLRSRTIALNLRNSPSIPHRFTCTASWQ 60
           MV+ L HP  S S+  +SSSS+SF+  GS     + R I L    S S P    C ASWQ
Sbjct: 1   MVLQL-HPQASVSSPLLSSSSSSFFVQGSKPLAPKPRKILLKRIASSSTP---PCKASWQ 60

Query: 61  ELAGILIFSAIPFTAVKAIANSPFGESLQRQLEKKKNAAVAKSSKFKALAEQARKDSSWY 120
           ELAG+L+FSAIPFTAVKAIANSP GESLQR++E++K  A+ +SSK +ALA +ARK+S WY
Sbjct: 61  ELAGVLVFSAIPFTAVKAIANSPLGESLQRRMEERKKVAIQESSKLQALAAKARKESLWY 120

Query: 121 GEKRPRWLGPLPYNYPKYLTGELPGDYGFDIAGLSEDPVAFRKYFNFEILHARWAMLASL 180
           GE+RP WLGP+PY+YP+YLTGELPGDYGFD+AGLS+DPVAF++YFNFEILHARWAMLA+L
Sbjct: 121 GEERPHWLGPIPYDYPQYLTGELPGDYGFDVAGLSKDPVAFQRYFNFEILHARWAMLAAL 180

Query: 181 GALVPEILDIFGAFHFTEPIWWRVGYSKLKGDTLDYLGIPGLHLAGSQGVIVIAICQAIL 240
           GALVPE+LD+ GAFHF EP+WWRVGYSKLKGDTLDYLGIPGLHLAGSQGVIVIAICQA+L
Sbjct: 181 GALVPELLDLSGAFHFIEPVWWRVGYSKLKGDTLDYLGIPGLHLAGSQGVIVIAICQALL 240

Query: 241 MVGPEYARYCGIEALEPLGIYLPGDINYPGGVLFDPLNLSEDAAAFEELKVKEIKNGRLA 300
           MVGPEYARYCGIEALEPLGIYLPGDINYPGG LFDPLNLS D  AFEELKVKEIKNGRLA
Sbjct: 241 MVGPEYARYCGIEALEPLGIYLPGDINYPGGALFDPLNLSGDPVAFEELKVKEIKNGRLA 300

Query: 301 MVAWLGFYSQAALTGKGPIQNLLEHVSDPFHNNFLSLL 335
           MVAWLGFY+QAALTGKGP+QNLLEH+SDPFHNN  S+L
Sbjct: 301 MVAWLGFYAQAALTGKGPVQNLLEHISDPFHNNLCSVL 334

BLAST of CmaCh09G000120 vs. TrEMBL
Match: D7UDH1_VITVI (Chlorophyll a-b binding protein, chloroplastic OS=Vitis vinifera GN=VIT_18s0122g00430 PE=3 SV=1)

HSP 1 Score: 528.1 bits (1359), Expect = 7.9e-147
Identity = 252/325 (77.54%), Postives = 282/325 (86.77%), Query Frame = 1

Query: 12  PSAVSSSSASFYGSSLQQLRSRTIALNLRNSPSIPHRFTCTASWQELAGILIFSAIPFTA 71
           P +VSSSS+SF+G    +L S+      RN  S      C ASWQELAG+LIFSAIPFTA
Sbjct: 6   PCSVSSSSSSFFGGYRHRLCSKISWPKSRNVNSTSTYSGCKASWQELAGVLIFSAIPFTA 65

Query: 72  VKAIANSPFGESLQRQLEKKKNAAVAKSSKFKALAEQARKDSSWYGEKRPRWLGPLPYNY 131
           VKAIANSP GESLQ ++E+ K AAV  SSKFKALAE+A+ +S WYGE+RPRWLGP+PY+Y
Sbjct: 66  VKAIANSPLGESLQSRMEENKKAAVKNSSKFKALAEKAKNESLWYGEERPRWLGPIPYDY 125

Query: 132 PKYLTGELPGDYGFDIAGLSEDPVAFRKYFNFEILHARWAMLASLGALVPEILDIFGAFH 191
           P YLTGELPGDYGFDIAGL +DPVAF+KYFNFEILHARWAMLA+LGAL+PE+LD+ GAFH
Sbjct: 126 PAYLTGELPGDYGFDIAGLGKDPVAFQKYFNFEILHARWAMLAALGALLPELLDLLGAFH 185

Query: 192 FTEPIWWRVGYSKLKGDTLDYLGIPGLHLAGSQGVIVIAICQAILMVGPEYARYCGIEAL 251
           F EP+WWRVGYSKLKGDTLDYLGIPG H AGSQGVIVIAICQA+LMVGPEYARYCGIEAL
Sbjct: 186 FVEPVWWRVGYSKLKGDTLDYLGIPGFHFAGSQGVIVIAICQALLMVGPEYARYCGIEAL 245

Query: 252 EPLGIYLPGDINYPGGVLFDPLNLSEDAAAFEELKVKEIKNGRLAMVAWLGFYSQAALTG 311
           EPLGIYLPGDINYPGG LFDPLNLS+D  AFEELKVKEIKNGRLAMVAWLGFY QAA TG
Sbjct: 246 EPLGIYLPGDINYPGGALFDPLNLSKDPVAFEELKVKEIKNGRLAMVAWLGFYIQAAATG 305

Query: 312 KGPIQNLLEHVSDPFHNNFLSLLNS 337
           KGP+QNLL+H++DPFHNN LS+  +
Sbjct: 306 KGPVQNLLDHLADPFHNNLLSIFKA 330

BLAST of CmaCh09G000120 vs. TrEMBL
Match: B9R8V1_RICCO (Chlorophyll a-b binding protein, chloroplastic OS=Ricinus communis GN=RCOM_1602420 PE=3 SV=1)

HSP 1 Score: 524.6 bits (1350), Expect = 8.7e-146
Identity = 255/330 (77.27%), Postives = 285/330 (86.36%), Query Frame = 1

Query: 5   LFHPHTSPSAVSSSSASFYGSSLQQLRSRTIALNLRNSPSIPHRFTCTASWQELAGILIF 64
           L  PH S S +SSS  S  GS L  L+ R I  N   +PS P    C ASWQELAG+L+F
Sbjct: 4   LLQPHPSFSWISSSLFS-QGSKLLALKPRKILFN--GNPSSP----CKASWQELAGVLVF 63

Query: 65  SAIPFTAVKAIANSPFGESLQRQLEKKKNAAVAKSSKFKALAEQARKDSSWYGEKRPRWL 124
           SAIPFTAVKA+ANSP GESLQR+LE+ K  AV +SSKF+AL  +ARK+S WYGE+RPRWL
Sbjct: 64  SAIPFTAVKALANSPLGESLQRRLEESKKVAVKESSKFQALTAKARKESLWYGEERPRWL 123

Query: 125 GPLPYNYPKYLTGELPGDYGFDIAGLSEDPVAFRKYFNFEILHARWAMLASLGALVPEIL 184
           GP+PY+YP YLTGELPGDYGFD+AGLS+D VAF++YFNFEILHARWAMLA+LGAL PE+L
Sbjct: 124 GPIPYDYPAYLTGELPGDYGFDVAGLSKDSVAFQRYFNFEILHARWAMLAALGALAPEVL 183

Query: 185 DIFGAFHFTEPIWWRVGYSKLKGDTLDYLGIPGLHLAGSQGVIVIAICQAILMVGPEYAR 244
           D+ GAFHF EP+WWRVGYSKLKGDTLDYLGIPGLH AGSQGV+VIAICQA+LMVGPEYAR
Sbjct: 184 DLSGAFHFIEPVWWRVGYSKLKGDTLDYLGIPGLHFAGSQGVLVIAICQALLMVGPEYAR 243

Query: 245 YCGIEALEPLGIYLPGDINYPGGVLFDPLNLSEDAAAFEELKVKEIKNGRLAMVAWLGFY 304
           YCGIEALEPLGIYLPGDINYPGG LFDPLNLS DA A EELKVKEIKNGRLAMVAWLGFY
Sbjct: 244 YCGIEALEPLGIYLPGDINYPGGPLFDPLNLSSDAVALEELKVKEIKNGRLAMVAWLGFY 303

Query: 305 SQAALTGKGPIQNLLEHVSDPFHNNFLSLL 335
           +QAALTGKGP+QNL+EH+SDPFHNN L +L
Sbjct: 304 AQAALTGKGPVQNLVEHISDPFHNNLLCVL 326

BLAST of CmaCh09G000120 vs. TrEMBL
Match: A0A059BQZ6_EUCGR (Chlorophyll a-b binding protein, chloroplastic OS=Eucalyptus grandis GN=EUGRSUZ_F01826 PE=3 SV=1)

HSP 1 Score: 523.9 bits (1348), Expect = 1.5e-145
Identity = 255/332 (76.81%), Postives = 282/332 (84.94%), Query Frame = 1

Query: 3   MALFHPHTSPSAVSSSSASFYGSSLQQLRSRTIALNLRNSPSIPHRFTCTASWQELAGIL 62
           MA+     SPS+ SSS A    S L++L +R      R          C ASWQELAG+L
Sbjct: 1   MAMLPSPVSPSSPSSSFA--LASPLRRLPARPAPPRRRRRRLPKSASVCKASWQELAGVL 60

Query: 63  IFSAIPFTAVKAIANSPFGESLQRQLEKKKNAAVAKSSKFKALAEQARKDSSWYGEKRPR 122
           +FSAIPFTAVKAIANSP GESLQR++E++K  AV  SS+F+ALAE+ARK+SSWYGE RP+
Sbjct: 61  VFSAIPFTAVKAIANSPLGESLQRRMEERKKFAVENSSRFRALAEKARKESSWYGEDRPQ 120

Query: 123 WLGPLPYNYPKYLTGELPGDYGFDIAGLSEDPVAFRKYFNFEILHARWAMLASLGALVPE 182
           WLGP+PY+YP YLTGE PGDYGFDIAGL+ DP AF KYFNFEILHARWAMLA+LGALVPE
Sbjct: 121 WLGPIPYDYPAYLTGEYPGDYGFDIAGLARDPTAFEKYFNFEILHARWAMLAALGALVPE 180

Query: 183 ILDIFGAFHFTEPIWWRVGYSKLKGDTLDYLGIPGLHLAGSQGVIVIAICQAILMVGPEY 242
           +LD+ GAFHF EP+WWRVGYSKLKGDTLDYLGIPGLHLAGSQGVIVIAICQA+LMVGPEY
Sbjct: 181 VLDMVGAFHFVEPVWWRVGYSKLKGDTLDYLGIPGLHLAGSQGVIVIAICQALLMVGPEY 240

Query: 243 ARYCGIEALEPLGIYLPGDINYPGGVLFDPLNLSEDAAAFEELKVKEIKNGRLAMVAWLG 302
           ARYCGIEALEPLGIYLPGDINYPGG LFDPLNLS+D A FEELKVKEIKNGRLAMVAWLG
Sbjct: 241 ARYCGIEALEPLGIYLPGDINYPGGALFDPLNLSKDPATFEELKVKEIKNGRLAMVAWLG 300

Query: 303 FYSQAALTGKGPIQNLLEHVSDPFHNNFLSLL 335
           FY QAALTGKGPIQNLLEH+SDP HNN  S+L
Sbjct: 301 FYVQAALTGKGPIQNLLEHISDPLHNNLFSVL 330

BLAST of CmaCh09G000120 vs. TrEMBL
Match: B9H5X9_POPTR (Chlorophyll a-b binding protein, chloroplastic OS=Populus trichocarpa GN=POPTR_0005s28000g PE=3 SV=1)

HSP 1 Score: 523.1 bits (1346), Expect = 2.5e-145
Identity = 250/334 (74.85%), Postives = 287/334 (85.93%), Query Frame = 1

Query: 1   MVMALFHPHTSPSAVSSSSASFYGSSLQQLRSRTIALNLRNSPSIPHRFTCTASWQELAG 60
           + +A+ HPH S SA  SSS+ F    + ++       N  NS S+P    C ASWQELAG
Sbjct: 5   LAVAVVHPHVSFSA--SSSSFFAHQRVPRILLSKSRSNSSNSTSLPASI-CKASWQELAG 64

Query: 61  ILIFSAIPFTAVKAIANSPFGESLQRQLEKKKNAAVAKSSKFKALAEQARKDSSWYGEKR 120
           ++IFSAIPFTAVKAIANSP GESLQR+LE++K  AV +SSKFKALA++ARK+S WYGE+R
Sbjct: 65  VVIFSAIPFTAVKAIANSPLGESLQRRLEERKKLAVQQSSKFKALAQKARKESFWYGEER 124

Query: 121 PRWLGPLPYNYPKYLTGELPGDYGFDIAGLSEDPVAFRKYFNFEILHARWAMLASLGALV 180
           PRWLGP+ Y YP YL+GELPGDYGFD+AGL+EDPVAF++YFNFEILHARWAMLA+LGAL+
Sbjct: 125 PRWLGPISYQYPTYLSGELPGDYGFDVAGLAEDPVAFQRYFNFEILHARWAMLAALGALI 184

Query: 181 PEILDIFGAFHFTEPIWWRVGYSKLKGDTLDYLGIPGLHLAGSQGVIVIAICQAILMVGP 240
           PE+LD+ GAFHF EP+WWRVGYSKLKGDTLDYLGIPGLH AG QGV+VIA CQAILMVGP
Sbjct: 185 PEVLDLSGAFHFIEPVWWRVGYSKLKGDTLDYLGIPGLHFAGGQGVLVIAFCQAILMVGP 244

Query: 241 EYARYCGIEALEPLGIYLPGDINYPGGVLFDPLNLSEDAAAFEELKVKEIKNGRLAMVAW 300
           EYARYCGIEALEPLGIYLPGDINYPGG+LFDPLNLS+D  +FEELKVKEIKNGRLAMVAW
Sbjct: 245 EYARYCGIEALEPLGIYLPGDINYPGGILFDPLNLSKDPVSFEELKVKEIKNGRLAMVAW 304

Query: 301 LGFYSQAALTGKGPIQNLLEHVSDPFHNNFLSLL 335
           LGFY QAALTGKGP++NL+EH+SDP HNN  S L
Sbjct: 305 LGFYIQAALTGKGPVENLVEHISDPLHNNLFSTL 335

BLAST of CmaCh09G000120 vs. TAIR10
Match: AT1G76570.1 (AT1G76570.1 Chlorophyll A-B binding family protein)

HSP 1 Score: 491.9 bits (1265), Expect = 3.2e-139
Identity = 238/335 (71.04%), Postives = 279/335 (83.28%), Query Frame = 1

Query: 3   MALFHPHTSPSAVSSSSASFYGSSLQQLRSRTIALNLRNSPSIPH-RFTCTASWQELAGI 62
           MALF        +SS S+S+  SS+  L  R +    RN  ++   R  C ASWQELAG+
Sbjct: 1   MALFQ-----EKLSSLSSSY--SSIHSL-PRILVSKPRNRIAVTKSRSICRASWQELAGV 60

Query: 63  LIFSAIPFTAVKAIANSPFGESLQRQLEKKKNAAVAKSSKFKALAEQARKDSSWYGEKRP 122
           L+FSAIPFTAVKAIANS  G SL+R+LE+KK  AV  SS+FK+ A++AR DS WYG++RP
Sbjct: 61  LVFSAIPFTAVKAIANSSIGVSLRRRLEEKKKEAVENSSRFKSKAQEARNDSKWYGKERP 120

Query: 123 RWLGPLPYNYPKYLTGELPGDYGFDIAGLSEDPVAFRKYFNFEILHARWAMLASLGALVP 182
           RW GP+PY+YP YLTGELPGDYGFDIAGL +D + F KYFNFEILHARWAMLA+LGAL+P
Sbjct: 121 RWFGPIPYDYPPYLTGELPGDYGFDIAGLGKDRLTFDKYFNFEILHARWAMLAALGALIP 180

Query: 183 EILDIFGAFHFTEPIWWRVGYSKLKGDTLDYLGIPGLHLAGSQGVIVIAICQAILMVGPE 242
           E+ D+ G FHF EP+WWRVGYSKL+G+TL+YLGIPGLH+AGSQGVIVIAICQ +LMVGPE
Sbjct: 181 EVFDLTGTFHFAEPVWWRVGYSKLQGETLEYLGIPGLHVAGSQGVIVIAICQVLLMVGPE 240

Query: 243 YARYCGIEALEPLGIYLPGDINYPGGVLFDPLNLSEDAAAFEELKVKEIKNGRLAMVAWL 302
           YARYCGIEALEPLGIYLPGDINYPGG LFDPLNLSED  AFE+LKVKEIKNGRLAMVAWL
Sbjct: 241 YARYCGIEALEPLGIYLPGDINYPGGTLFDPLNLSEDPVAFEDLKVKEIKNGRLAMVAWL 300

Query: 303 GFYSQAALTGKGPIQNLLEHVSDPFHNNFLSLLNS 337
           GFY+QAA TGKGP+QNL++HVSDP HNN +++L +
Sbjct: 301 GFYAQAAFTGKGPVQNLVDHVSDPLHNNLIAMLQT 327

BLAST of CmaCh09G000120 vs. TAIR10
Match: AT3G27690.1 (AT3G27690.1 photosystem II light harvesting complex gene 2.3)

HSP 1 Score: 211.5 bits (537), Expect = 8.3e-55
Identity = 113/217 (52.07%), Postives = 137/217 (63.13%), Query Frame = 1

Query: 113 SSWYGEKRPRWLGPLPYNYPKYLTGELPGDYGFDIAGLSEDPVAFRKYFNFEILHARWAM 172
           S WYG  RP++LGP   N P YLTGE PGDYG+D AGLS DP  F K    E++H+RWAM
Sbjct: 48  SIWYGPDRPKYLGPFSENTPSYLTGEYPGDYGWDTAGLSADPETFAKNRELEVIHSRWAM 107

Query: 173 LASLGALVPEILDIFGAFHFTEPIWWRVGYSKLKGDTLDYLGIPGLHLAGSQGVIVIAIC 232
           L +LG   PEIL   G   F E +W++ G        LDYLG P  +L  +Q ++ I  C
Sbjct: 108 LGALGCTFPEILSKNGV-KFGEAVWFKAGSQIFSEGGLDYLGNP--NLIHAQSILAIWAC 167

Query: 233 QAILMVGPEYARYCGIEALEPLGIYLPGDINYPGGVLFDPLNLSEDAAAFEELKVKEIKN 292
           Q +LM   E  R  G     PLG  L  D  YPGG  FDPLNL+ED  AF ELKVKE+KN
Sbjct: 168 QVVLMGFIEGYRIGG----GPLGEGL--DPLYPGGA-FDPLNLAEDPEAFSELKVKELKN 227

Query: 293 GRLAMVAWLGFYSQAALTGKGPIQNLLEHVSDPFHNN 330
           GRLAM +  GF+ QA +TGKGPI+NL +H++DP  NN
Sbjct: 228 GRLAMFSMFGFFVQAIVTGKGPIENLFDHIADPVANN 254

BLAST of CmaCh09G000120 vs. TAIR10
Match: AT2G05070.1 (AT2G05070.1 photosystem II light harvesting complex gene 2.2)

HSP 1 Score: 206.8 bits (525), Expect = 2.0e-53
Identity = 112/217 (51.61%), Postives = 136/217 (62.67%), Query Frame = 1

Query: 113 SSWYGEKRPRWLGPLPYNYPKYLTGELPGDYGFDIAGLSEDPVAFRKYFNFEILHARWAM 172
           S WYG  RP++LGP   N P YLTGE PGDYG+D AGLS DP  F K    E++H+RWAM
Sbjct: 47  SIWYGPDRPKYLGPFSENTPSYLTGEYPGDYGWDTAGLSADPETFAKNRELEVIHSRWAM 106

Query: 173 LASLGALVPEILDIFGAFHFTEPIWWRVGYSKLKGDTLDYLGIPGLHLAGSQGVIVIAIC 232
           L +LG   PEIL   G   F E +W++ G        LDYLG P  +L  +Q ++ I   
Sbjct: 107 LGALGCTFPEILSKNGV-KFGEAVWFKAGSQIFSEGGLDYLGNP--NLIHAQSILAIWAV 166

Query: 233 QAILMVGPEYARYCGIEALEPLGIYLPGDINYPGGVLFDPLNLSEDAAAFEELKVKEIKN 292
           Q +LM   E  R  G     PLG  L  D  YPGG  FDPLNL+ED  AF ELKVKE+KN
Sbjct: 167 QVVLMGFIEGYRIGG----GPLGEGL--DPLYPGGA-FDPLNLAEDPEAFSELKVKELKN 226

Query: 293 GRLAMVAWLGFYSQAALTGKGPIQNLLEHVSDPFHNN 330
           GRLAM +  GF+ QA +TGKGPI+NL +H++DP  NN
Sbjct: 227 GRLAMFSMFGFFVQAIVTGKGPIENLFDHLADPVANN 253

BLAST of CmaCh09G000120 vs. TAIR10
Match: AT2G05100.1 (AT2G05100.1 photosystem II light harvesting complex gene 2.1)

HSP 1 Score: 206.8 bits (525), Expect = 2.0e-53
Identity = 112/217 (51.61%), Postives = 136/217 (62.67%), Query Frame = 1

Query: 113 SSWYGEKRPRWLGPLPYNYPKYLTGELPGDYGFDIAGLSEDPVAFRKYFNFEILHARWAM 172
           S WYG  RP++LGP   N P YLTGE PGDYG+D AGLS DP  F K    E++H+RWAM
Sbjct: 47  SIWYGPDRPKYLGPFSENTPSYLTGEYPGDYGWDTAGLSADPETFAKNRELEVIHSRWAM 106

Query: 173 LASLGALVPEILDIFGAFHFTEPIWWRVGYSKLKGDTLDYLGIPGLHLAGSQGVIVIAIC 232
           L +LG   PEIL   G   F E +W++ G        LDYLG P  +L  +Q ++ I   
Sbjct: 107 LGALGCTFPEILSKNGV-KFGEAVWFKAGSQIFSEGGLDYLGNP--NLIHAQSILAIWAV 166

Query: 233 QAILMVGPEYARYCGIEALEPLGIYLPGDINYPGGVLFDPLNLSEDAAAFEELKVKEIKN 292
           Q +LM   E  R  G     PLG  L  D  YPGG  FDPLNL+ED  AF ELKVKE+KN
Sbjct: 167 QVVLMGFIEGYRIGG----GPLGEGL--DPLYPGGA-FDPLNLAEDPEAFSELKVKELKN 226

Query: 293 GRLAMVAWLGFYSQAALTGKGPIQNLLEHVSDPFHNN 330
           GRLAM +  GF+ QA +TGKGPI+NL +H++DP  NN
Sbjct: 227 GRLAMFSMFGFFVQAIVTGKGPIENLFDHLADPVANN 253

BLAST of CmaCh09G000120 vs. TAIR10
Match: AT5G54270.1 (AT5G54270.1 light-harvesting chlorophyll B-binding protein 3)

HSP 1 Score: 203.8 bits (517), Expect = 1.7e-52
Identity = 101/215 (46.98%), Postives = 132/215 (61.40%), Query Frame = 1

Query: 115 WYGEKRPRWLGPLPYNYPKYLTGELPGDYGFDIAGLSEDPVAFRKYFNFEILHARWAMLA 174
           WYG  R ++LGP     P YLTGE PGDYG+D AGLS DP AF K    E++H RWAML 
Sbjct: 47  WYGPDRVKYLGPFSVQTPSYLTGEFPGDYGWDTAGLSADPEAFAKNRALEVIHGRWAMLG 106

Query: 175 SLGALVPEILDIFGAFHFTEPIWWRVGYSKLKGDTLDYLGIPGLHLAGSQGVIVIAICQA 234
           + G + PE+L  +    F EP+W++ G        LDYLG P  +L  +Q ++ +   Q 
Sbjct: 107 AFGCITPEVLQKWVRVDFKEPVWFKAGSQIFSEGGLDYLGNP--NLVHAQSILAVLGFQV 166

Query: 235 ILMVGPEYARYCGIEALEPLGIYLPGDINYPGGVLFDPLNLSEDAAAFEELKVKEIKNGR 294
           ILM   E  R  G++ +        G+  YPGG  FDPL L++D   F ELKVKEIKNGR
Sbjct: 167 ILMGLVEGFRINGLDGVG------EGNDLYPGGQYFDPLGLADDPVTFAELKVKEIKNGR 226

Query: 295 LAMVAWLGFYSQAALTGKGPIQNLLEHVSDPFHNN 330
           LAM +  GF+ QA +TGKGP++NLL+H+ +P  NN
Sbjct: 227 LAMFSMFGFFVQAIVTGKGPLENLLDHLDNPVANN 253

BLAST of CmaCh09G000120 vs. NCBI nr
Match: gi|659105211|ref|XP_008453032.1| (PREDICTED: chlorophyll a-b binding protein of LHCII type 1 isoform X1 [Cucumis melo])

HSP 1 Score: 625.2 bits (1611), Expect = 6.8e-176
Identity = 304/337 (90.21%), Postives = 321/337 (95.25%), Query Frame = 1

Query: 1   MVMALFHPHTSPSAVSSSSASFYGSSLQQLRSRTIALNLRNSPSIPHRFTCTASWQELAG 60
           +VMAL  PHT  S++SSSSASFYG+ LQQLR  T  LNLR++ SI HR TC ASWQELAG
Sbjct: 8   IVMALIQPHTPASSISSSSASFYGTYLQQLRPLTTTLNLRSNHSISHRSTCRASWQELAG 67

Query: 61  ILIFSAIPFTAVKAIANSPFGESLQRQLEKKKNAAVAKSSKFKALAEQARKDSSWYGEKR 120
           +LIFSA+PFTAVKAIANSP GESLQRQLEKKK +AVA SSKFKALAE+ARKDSSWYGE+R
Sbjct: 68  VLIFSAVPFTAVKAIANSPLGESLQRQLEKKKKSAVANSSKFKALAEEARKDSSWYGEER 127

Query: 121 PRWLGPLPYNYPKYLTGELPGDYGFDIAGLSEDPVAFRKYFNFEILHARWAMLASLGALV 180
           PRWLGPLPY+YPKYLTGELPGDYGFDIAGLSEDPVAF+KYFNFEILHARWAMLASLGALV
Sbjct: 128 PRWLGPLPYDYPKYLTGELPGDYGFDIAGLSEDPVAFQKYFNFEILHARWAMLASLGALV 187

Query: 181 PEILDIFGAFHFTEPIWWRVGYSKLKGDTLDYLGIPGLHLAGSQGVIVIAICQAILMVGP 240
           PEILDIFGAFHFTEPIWWRVGYSKLKGDTLDYLGIPGLHLAGSQGVIVIAICQAILMVGP
Sbjct: 188 PEILDIFGAFHFTEPIWWRVGYSKLKGDTLDYLGIPGLHLAGSQGVIVIAICQAILMVGP 247

Query: 241 EYARYCGIEALEPLGIYLPGDINYPGGVLFDPLNLSEDAAAFEELKVKEIKNGRLAMVAW 300
           EYARYCGIEALEPLGIYLPGDINYPGGVLFDPLNLS+DAAAFEELKVKEIKNGRLAMVAW
Sbjct: 248 EYARYCGIEALEPLGIYLPGDINYPGGVLFDPLNLSKDAAAFEELKVKEIKNGRLAMVAW 307

Query: 301 LGFYSQAALTGKGPIQNLLEHVSDPFHNNFLSLLNSS 338
           LGFYSQAALTGKGP+QNLL+H+SDPFHNNFLSLLNSS
Sbjct: 308 LGFYSQAALTGKGPVQNLLDHISDPFHNNFLSLLNSS 344

BLAST of CmaCh09G000120 vs. NCBI nr
Match: gi|449455655|ref|XP_004145567.1| (PREDICTED: chlorophyll a-b binding protein of LHCII type 1-like isoform X1 [Cucumis sativus])

HSP 1 Score: 614.8 bits (1584), Expect = 9.2e-173
Identity = 299/335 (89.25%), Postives = 317/335 (94.63%), Query Frame = 1

Query: 3   MALFHPHTSPSAVSSSSASFYGSSLQQLRSRTIALNLRNSPSIPHRFTCTASWQELAGIL 62
           MAL  PHT  S++SSSSAS +G+ LQQ R  T  LNLR++ SI HR TC ASWQELAG+L
Sbjct: 1   MALVQPHTPASSISSSSASLFGTYLQQFRPLTTTLNLRSNHSISHRSTCRASWQELAGVL 60

Query: 63  IFSAIPFTAVKAIANSPFGESLQRQLEKKKNAAVAKSSKFKALAEQARKDSSWYGEKRPR 122
           IFSAIPFTAVKAIANSP G SLQRQLEKKKN+AVA SSKFKALAE+ARKDSSWYGE+RPR
Sbjct: 61  IFSAIPFTAVKAIANSPLGGSLQRQLEKKKNSAVANSSKFKALAEEARKDSSWYGEERPR 120

Query: 123 WLGPLPYNYPKYLTGELPGDYGFDIAGLSEDPVAFRKYFNFEILHARWAMLASLGALVPE 182
           WLGPLPY+YPKYLTGELPGDYGFDIAGLSEDPVAF+K+FNFEILHARWAMLASLGALVPE
Sbjct: 121 WLGPLPYDYPKYLTGELPGDYGFDIAGLSEDPVAFQKFFNFEILHARWAMLASLGALVPE 180

Query: 183 ILDIFGAFHFTEPIWWRVGYSKLKGDTLDYLGIPGLHLAGSQGVIVIAICQAILMVGPEY 242
           ILDIFGAFHFTEPIWWRVGYSKLKGDTLDYLGIPGLHLAGSQGVIVIAICQAILMVGPEY
Sbjct: 181 ILDIFGAFHFTEPIWWRVGYSKLKGDTLDYLGIPGLHLAGSQGVIVIAICQAILMVGPEY 240

Query: 243 ARYCGIEALEPLGIYLPGDINYPGGVLFDPLNLSEDAAAFEELKVKEIKNGRLAMVAWLG 302
           ARYCGIEALEPLGIYLPGDINYPGGVLFDPLNLS+DAAAFEELKVKEIKNGRLAMVAWLG
Sbjct: 241 ARYCGIEALEPLGIYLPGDINYPGGVLFDPLNLSKDAAAFEELKVKEIKNGRLAMVAWLG 300

Query: 303 FYSQAALTGKGPIQNLLEHVSDPFHNNFLSLLNSS 338
           FYSQAALTGKGP+QNLL+H++DPFHNNFLSLLNSS
Sbjct: 301 FYSQAALTGKGPVQNLLDHIADPFHNNFLSLLNSS 335

BLAST of CmaCh09G000120 vs. NCBI nr
Match: gi|802604283|ref|XP_012073543.1| (PREDICTED: chlorophyll a-b binding protein of LHCII type 1 [Jatropha curcas])

HSP 1 Score: 528.5 bits (1360), Expect = 8.7e-147
Identity = 260/338 (76.92%), Postives = 294/338 (86.98%), Query Frame = 1

Query: 1   MVMALFHPHTSPSA--VSSSSASFY--GSSLQQLRSRTIALNLRNSPSIPHRFTCTASWQ 60
           MV+ L HP  S S+  +SSSS+SF+  GS     + R I L    S S P    C ASWQ
Sbjct: 1   MVLQL-HPQASVSSPLLSSSSSSFFVQGSKPLAPKPRKILLKRIASSSTP---PCKASWQ 60

Query: 61  ELAGILIFSAIPFTAVKAIANSPFGESLQRQLEKKKNAAVAKSSKFKALAEQARKDSSWY 120
           ELAG+L+FSAIPFTAVKAIANSP GESLQR++E++K  A+ +SSK +ALA +ARK+S WY
Sbjct: 61  ELAGVLVFSAIPFTAVKAIANSPLGESLQRRMEERKKVAIQESSKLQALAAKARKESLWY 120

Query: 121 GEKRPRWLGPLPYNYPKYLTGELPGDYGFDIAGLSEDPVAFRKYFNFEILHARWAMLASL 180
           GE+RP WLGP+PY+YP+YLTGELPGDYGFD+AGLS+DPVAF++YFNFEILHARWAMLA+L
Sbjct: 121 GEERPHWLGPIPYDYPQYLTGELPGDYGFDVAGLSKDPVAFQRYFNFEILHARWAMLAAL 180

Query: 181 GALVPEILDIFGAFHFTEPIWWRVGYSKLKGDTLDYLGIPGLHLAGSQGVIVIAICQAIL 240
           GALVPE+LD+ GAFHF EP+WWRVGYSKLKGDTLDYLGIPGLHLAGSQGVIVIAICQA+L
Sbjct: 181 GALVPELLDLSGAFHFIEPVWWRVGYSKLKGDTLDYLGIPGLHLAGSQGVIVIAICQALL 240

Query: 241 MVGPEYARYCGIEALEPLGIYLPGDINYPGGVLFDPLNLSEDAAAFEELKVKEIKNGRLA 300
           MVGPEYARYCGIEALEPLGIYLPGDINYPGG LFDPLNLS D  AFEELKVKEIKNGRLA
Sbjct: 241 MVGPEYARYCGIEALEPLGIYLPGDINYPGGALFDPLNLSGDPVAFEELKVKEIKNGRLA 300

Query: 301 MVAWLGFYSQAALTGKGPIQNLLEHVSDPFHNNFLSLL 335
           MVAWLGFY+QAALTGKGP+QNLLEH+SDPFHNN  S+L
Sbjct: 301 MVAWLGFYAQAALTGKGPVQNLLEHISDPFHNNLCSVL 334

BLAST of CmaCh09G000120 vs. NCBI nr
Match: gi|297745621|emb|CBI40786.3| (unnamed protein product [Vitis vinifera])

HSP 1 Score: 528.1 bits (1359), Expect = 1.1e-146
Identity = 252/325 (77.54%), Postives = 282/325 (86.77%), Query Frame = 1

Query: 12  PSAVSSSSASFYGSSLQQLRSRTIALNLRNSPSIPHRFTCTASWQELAGILIFSAIPFTA 71
           P +VSSSS+SF+G    +L S+      RN  S      C ASWQELAG+LIFSAIPFTA
Sbjct: 6   PCSVSSSSSSFFGGYRHRLCSKISWPKSRNVNSTSTYSGCKASWQELAGVLIFSAIPFTA 65

Query: 72  VKAIANSPFGESLQRQLEKKKNAAVAKSSKFKALAEQARKDSSWYGEKRPRWLGPLPYNY 131
           VKAIANSP GESLQ ++E+ K AAV  SSKFKALAE+A+ +S WYGE+RPRWLGP+PY+Y
Sbjct: 66  VKAIANSPLGESLQSRMEENKKAAVKNSSKFKALAEKAKNESLWYGEERPRWLGPIPYDY 125

Query: 132 PKYLTGELPGDYGFDIAGLSEDPVAFRKYFNFEILHARWAMLASLGALVPEILDIFGAFH 191
           P YLTGELPGDYGFDIAGL +DPVAF+KYFNFEILHARWAMLA+LGAL+PE+LD+ GAFH
Sbjct: 126 PAYLTGELPGDYGFDIAGLGKDPVAFQKYFNFEILHARWAMLAALGALLPELLDLLGAFH 185

Query: 192 FTEPIWWRVGYSKLKGDTLDYLGIPGLHLAGSQGVIVIAICQAILMVGPEYARYCGIEAL 251
           F EP+WWRVGYSKLKGDTLDYLGIPG H AGSQGVIVIAICQA+LMVGPEYARYCGIEAL
Sbjct: 186 FVEPVWWRVGYSKLKGDTLDYLGIPGFHFAGSQGVIVIAICQALLMVGPEYARYCGIEAL 245

Query: 252 EPLGIYLPGDINYPGGVLFDPLNLSEDAAAFEELKVKEIKNGRLAMVAWLGFYSQAALTG 311
           EPLGIYLPGDINYPGG LFDPLNLS+D  AFEELKVKEIKNGRLAMVAWLGFY QAA TG
Sbjct: 246 EPLGIYLPGDINYPGGALFDPLNLSKDPVAFEELKVKEIKNGRLAMVAWLGFYIQAAATG 305

Query: 312 KGPIQNLLEHVSDPFHNNFLSLLNS 337
           KGP+QNLL+H++DPFHNN LS+  +
Sbjct: 306 KGPVQNLLDHLADPFHNNLLSIFKA 330

BLAST of CmaCh09G000120 vs. NCBI nr
Match: gi|950990885|ref|XP_014504405.1| (PREDICTED: chlorophyll a-b binding protein 5, chloroplastic isoform X1 [Vigna radiata var. radiata])

HSP 1 Score: 526.2 bits (1354), Expect = 4.3e-146
Identity = 255/336 (75.89%), Postives = 282/336 (83.93%), Query Frame = 1

Query: 3   MALFHPHTSPSAVSSSSASFYGSSLQQ--LRSRTIALNLRNSPSIPHR-FTCTASWQELA 62
           M   HP   P +VSS S+  +  SL     RS   A   RNS     R F C ASWQELA
Sbjct: 1   MVSLHPLVCPLSVSSCSSDIFRCSLWSSCYRSACFAPKRRNSSKTEQRSFICNASWQELA 60

Query: 63  GILIFSAIPFTAVKAIANSPFGESLQRQLEKKKNAAVAKSSKFKALAEQARKDSSWYGEK 122
           G+L+FSAIPFTAVKAIANSP GESLQR++E++K +A  KSSKFKALA++ARK+S WYGE 
Sbjct: 61  GVLLFSAIPFTAVKAIANSPLGESLQRKMEEEKKSAEKKSSKFKALADKARKESCWYGED 120

Query: 123 RPRWLGPLPYNYPKYLTGELPGDYGFDIAGLSEDPVAFRKYFNFEILHARWAMLASLGAL 182
           RPRWLGP+ Y YP YLTG+LPGDYGFDIAGL +DPVA RKYFNFEILHARWAMLAS+GAL
Sbjct: 121 RPRWLGPISYEYPSYLTGDLPGDYGFDIAGLGKDPVALRKYFNFEILHARWAMLASIGAL 180

Query: 183 VPEILDIFGAFHFTEPIWWRVGYSKLKGDTLDYLGIPGLHLAGSQGVIVIAICQAILMVG 242
           +PEILD+ GAFHF EP+WWRVGYSKLKGDTLDYLGI GLH AGSQGV+VIAICQA+LMVG
Sbjct: 181 IPEILDLLGAFHFVEPVWWRVGYSKLKGDTLDYLGIQGLHFAGSQGVVVIAICQALLMVG 240

Query: 243 PEYARYCGIEALEPLGIYLPGDINYPGGVLFDPLNLSEDAAAFEELKVKEIKNGRLAMVA 302
           PEYARYCG EALEPLGIYLPGDINYPGG LFDPLNLS D  AFEELKVKEIKNGRLAMVA
Sbjct: 241 PEYARYCGTEALEPLGIYLPGDINYPGGALFDPLNLSNDPEAFEELKVKEIKNGRLAMVA 300

Query: 303 WLGFYSQAALTGKGPIQNLLEHVSDPFHNNFLSLLN 336
           WLGFY QAALTGKGP+QNL++ +SDPFHNNFL+ LN
Sbjct: 301 WLGFYVQAALTGKGPVQNLIDFISDPFHNNFLNSLN 336

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
CB2_CHLMO5.6e-5346.96Chlorophyll a-b binding protein of LHCII type I, chloroplastic OS=Chlamydomonas ... [more]
CB24_SOLLC1.6e-5251.15Chlorophyll a-b binding protein 4, chloroplastic OS=Solanum lycopersicum GN=CAB4... [more]
CB2B_SOLLC3.6e-5244.62Chlorophyll a-b binding protein 1B, chloroplastic OS=Solanum lycopersicum GN=CAB... [more]
CB25_NICPL6.2e-5249.11Chlorophyll a-b binding protein E, chloroplastic OS=Nicotiana plumbaginifolia GN... [more]
CB23_NICPL8.0e-5248.47Chlorophyll a-b binding protein C, chloroplastic OS=Nicotiana plumbaginifolia GN... [more]
Match NameE-valueIdentityDescription
A0A067KWW2_JATCU6.0e-14776.92Chlorophyll a-b binding protein, chloroplastic OS=Jatropha curcas GN=JCGZ_08016 ... [more]
D7UDH1_VITVI7.9e-14777.54Chlorophyll a-b binding protein, chloroplastic OS=Vitis vinifera GN=VIT_18s0122g... [more]
B9R8V1_RICCO8.7e-14677.27Chlorophyll a-b binding protein, chloroplastic OS=Ricinus communis GN=RCOM_16024... [more]
A0A059BQZ6_EUCGR1.5e-14576.81Chlorophyll a-b binding protein, chloroplastic OS=Eucalyptus grandis GN=EUGRSUZ_... [more]
B9H5X9_POPTR2.5e-14574.85Chlorophyll a-b binding protein, chloroplastic OS=Populus trichocarpa GN=POPTR_0... [more]
Match NameE-valueIdentityDescription
AT1G76570.13.2e-13971.04 Chlorophyll A-B binding family protein[more]
AT3G27690.18.3e-5552.07 photosystem II light harvesting complex gene 2.3[more]
AT2G05070.12.0e-5351.61 photosystem II light harvesting complex gene 2.2[more]
AT2G05100.12.0e-5351.61 photosystem II light harvesting complex gene 2.1[more]
AT5G54270.11.7e-5246.98 light-harvesting chlorophyll B-binding protein 3[more]
Match NameE-valueIdentityDescription
gi|659105211|ref|XP_008453032.1|6.8e-17690.21PREDICTED: chlorophyll a-b binding protein of LHCII type 1 isoform X1 [Cucumis m... [more]
gi|449455655|ref|XP_004145567.1|9.2e-17389.25PREDICTED: chlorophyll a-b binding protein of LHCII type 1-like isoform X1 [Cucu... [more]
gi|802604283|ref|XP_012073543.1|8.7e-14776.92PREDICTED: chlorophyll a-b binding protein of LHCII type 1 [Jatropha curcas][more]
gi|297745621|emb|CBI40786.3|1.1e-14677.54unnamed protein product [Vitis vinifera][more]
gi|950990885|ref|XP_014504405.1|4.3e-14675.89PREDICTED: chlorophyll a-b binding protein 5, chloroplastic isoform X1 [Vigna ra... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001344Chloro_AB-bd_pln
IPR022796Chloroa_b-bind
IPR023329Chlorophyll_a/b-bd_dom_sf
Vocabulary: Biological Process
TermDefinition
GO:0009765photosynthesis, light harvesting
Vocabulary: Cellular Component
TermDefinition
GO:0016020membrane
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015996 chlorophyll catabolic process
biological_process GO:0010264 myo-inositol hexakisphosphate biosynthetic process
biological_process GO:0009765 photosynthesis, light harvesting
biological_process GO:0018298 protein-chromophore linkage
cellular_component GO:0009507 chloroplast
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0009522 photosystem I
cellular_component GO:0009523 photosystem II
cellular_component GO:0016020 membrane
cellular_component GO:0009535 chloroplast thylakoid membrane
molecular_function GO:0016168 chlorophyll binding
molecular_function GO:0046872 metal ion binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh09G000120.1CmaCh09G000120.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001344Chlorophyll A-B binding protein, plantPANTHERPTHR21649CHLOROPHYLL A/B BINDING PROTEINcoord: 55..336
score: 2.8E
IPR022796Chlorophyll A-B binding proteinPFAMPF00504Chloroa_b-bindcoord: 132..307
score: 4.8
IPR023329Chlorophyll a/b binding protein domainGENE3DG3DSA:1.10.3460.10coord: 129..332
score: 1.8
IPR023329Chlorophyll a/b binding protein domainunknownSSF103511Chlorophyll a-b binding proteincoord: 114..334
score: 1.05
NoneNo IPR availablePANTHERPTHR21649:SF24CHLOROPHYLL A-B BINDING FAMILY PROTEINcoord: 55..336
score: 2.8E