CmaCh16G011040 (gene) Cucurbita maxima (Rimu) v1.1

Overview
NameCmaCh16G011040
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu) v1.1)
Descriptionlight-harvesting complex-like protein 3 isotype 1, chloroplastic
LocationCma_Chr16: 8495173 .. 8498340 (-)
RNA-Seq ExpressionCmaCh16G011040
SyntenyCmaCh16G011040
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCCGGAAAAGTCCCCGAAGACGCTGCCACCGGCGAAAGGGATGAGAAAATTAGTGGAAAGCTGCGCCGTTCTCAGAGAGGAGGAGGACAATGACAACGTCCGGTTCGATTATCAATGGCGATATTCTGTTGGAAAGCGCGAAGCTGGAGGCAGTGGTGGCGACCTCCATGGCTATAAAACCTTGAAGAAGCAGAACGATAACCAAGCTGGAGGAGCGAAAATGGGAGATTTGTGGGCTTTTGAAGGTAATGGCTTAGGCCCAGCCCAAATCATTTTTTTTTCAAATATAGATTTTGTACCTATAAAAAAATTTAGACTTAAAAATAAATAAATTAATATCTAAATGAACCTAGCTTACATTTGAATGGAACCGATCTTAAGGAAAAGATTAAAATAACGGAAAGTTTCTATTTATACCAACAAGGTACACCGGTTCAATCACAGAAACTTCATGATTAAACGTACTCAGACCAATTCTATATTAAGTGATCTCTTATGAGTTTTTTAAAGATGTATGTGAGTGATGACAAAACATACTAGAAGAATTCGTGTTAATTGTGGGAATAGTTTTCCCTTCAGGGTATTATTAAAAAACTTTAAAATTCTAGAAAAAAGTTTATAAGGGTATTTTATTATTTTACACAAAATTAAACAATTTAGCTTCATCTTAATAATCATTACATATGAATTAAAAAAACATATTAAACAAGCTTGTGTTTAACCTTTTAATATAAAATTTAAAGAATTTAATAATATATTCCTTTCCACTTATTAATTTTAAAAAATATTTATATTATATTGTTTAAAATAAAAATTATTATTAGATTAAAATATCATTTTAATTTTTATAGATATCTAATTTTAGTTAAATAAATCTAAATTCAGTTTTATTTGATTTTTTTAGTAAAAAAAGTTCTTTATTTAACATTTTTTTTTAAATATATTCACCTAAGTATTTTAAGACTATTATTATTATTTAATAAATTTTGAAATACTAGATTTATTAGAATTAAATTTAATAATATTATTATTTAATCAACTTTATATATATATATTTATTTATTTATTTTTTAAGCAAGCACATTTATTATGGGCGAAAGTGCAAAGCTTCCCTATCCATCATAATTACAAGGCTGAGGATAGAGCTTTGAAGAGGAGGAGCTCTCTCTTCTTCTTCCTTCCACACCAAAACAATGTCTTCCATGGCTTTGTTTTCTCCTCCCTCCCGTCTTTCCTCTCTCTCTCCTTCTTCACATCACCGAACCCACTTCCCCTCCAGACCCTTTTCCTCTCTCAGGTCCAGAAACCCTTCTTCTTGGGTCGCTGCTGACAATGGCGCCGGAATTTCAGGTGGTTCCGTCGCTGTAGAACCCGCCGTCGTACAGAAGGATCCGGAGCCGGCGGTGGAAAAGAAAGAGAGCCTTGCTGAAACCAATGGGTCGGTGGCGGCTGTGGAAGAAGTGGTGGTGGTTAGCAAATTTGAAGACCCTAAATGGGTTAATGGGACTTGGGATTTGAATCAGTTCCGGAAAGATGGAAGTACTGATTGGGATGCTGTTATTGATGCTGGTAATCTTCGTAAACCCTACATTTTCTGTTGATTTTGCTTCTAATTGCATTACGAAATCTGGATGTACATAGATACATGCAATGTCTGAGAAGCTTTTTTGGGGAAATCTTGATCTGCAAGTTTTGATGATGTTTAGATTGTAATGGATTTCTTGGCGATGTTTGAGCAATTACTTGATGTTTGAATCAGAGGCTAGGAGGAGGAAATGGCTGGAAAACAATCCAGAATCATCAAGTAATGAGGACCCTGTGCTGTTTGACACATCCATAGTTCCTTGGTGGGCTTGGATTAAGCGCTACCATCTGCCAGAGGCTGAGATTCTCAATGGTATAAAACATGTTTCTATGATTGCTTATCTTGTTTTCTTAAAGCTTGTCTGTGTGAAATAATGGCTAAATGTGAATGTTTGAAAATCGTGTTTAGAAGTGTAAAATTGAATATTAATTTGATTTTGAATGTTTAAACACATGTTCTTGAGTGATTATGAACGTGAATGACTGATTTTAACCATTTGAAAATCACTCAAACGTATTTCTTGTTCTTTCGTTCATGTCTGTGAATCATGATTTCTACATGTTCATTGTATTGGCTTCTCTGTATGAAAATCATGTTTAAAAGTGTAAAATCATACATGGAAGTTGATTTTGAATGTTTAAACACATGTTCTTGAGTGATTTTGAGCATGTTCTTGAGTGATTTTGAACGTGAAGGACCGATTTTAACCATTTGAAAATCACTCGATCGTATTTTTTGTTCTTTCTTTCACGTCTGTGGATCATGATTTCTACTTGTTCATAACATTGGCTTCTCCATATGAAAATCACGTTTAAAAGCGTAAAATCATACATTAAGATGATTTTGAATGTTCAAACGCATGTTCTTGACGTGAATGACTGATTTTAACCATTTGAAAATCACTCAAACGTATTTTTTGATGATTTTGAATGTTCAAACGCATGTTCTTGAGTGATTTTGAACGTGAATGACTGATTTTAACCATTTGAAAATCACTCGATCATACTTTGTTCTTGAGTATTTCTACGTATAACGTTGGCTTCTCTGTAACTTACAAGTAATATTGTGTCAAGTAAAGTATTGAATCTACTATAACATTGAAGTAAATTTGAGCGGTGCAATGGTTTCGATGCAGGCCGTGCAGCCATGGTGGGGTTTTTCATGGCTTACTTCGTCGATAGCTTGACGGGGGTAGGACTAGTAGGTCAAATGGGCAACTTCTTCTGCAAAACTTTGTTATTTGTTGCAGTGGTGGGAGTTCTTTTGATCAGAAAGAATGAAGATATAGAGACTCTAAAGAAGTTGATTGATGAGACGACATTTTATGATAAACAATGGCAAGCAACTTGGCAGGATGAAACCAAAGGCTCAGGCAAAGTGTAATGTGGCAAAGTTGCCTGTGTTCTTGTTTTGCTTTAATTATGTTTGCATCTCATAAGATTTGGGCTTAGTAGAGAAGGATAATGTGAGCTCTGAAAGATATTGAACGTTTAGGCCTCAGTTTGTTAATGTTTCTTGAAGTTGTTGAGGGTTTCAATGAAAAATCTCTCTCTAAAAGGTTGGGTTTTCTGAAAAAAAAAA

mRNA sequence

ATGCCGGAAAAGTCCCCGAAGACGCTGCCACCGGCGAAAGGGATGAGAAAATTAGTGGAAAGCTGCGCCGTTCTCAGAGAGGAGGAGGACAATGACAACGTCCGGTTCGATTATCAATGGCGATATTCTGTTGGAAAGCGCGAAGCTGGAGGCAGTGGTGGCGACCTCCATGGCTATAAAACCTTGAAGAAGCAGAACGATAACCAAGCTGGAGGAGCGAAAATGGGAGATTTGTGGGCTTTTGAAGGAGGAGCTCTCTCTTCTTCTTCCTTCCACACCAAAACAATGTCTTCCATGGCTTTGTTTTCTCCTCCCTCCCGTCTTTCCTCTCTCTCTCCTTCTTCACATCACCGAACCCACTTCCCCTCCAGACCCTTTTCCTCTCTCAGGTCCAGAAACCCTTCTTCTTGGGTCGCTGCTGACAATGGCGCCGGAATTTCAGGTGGTTCCGTCGCTGTAGAACCCGCCGTCGTACAGAAGGATCCGGAGCCGGCGGTGGAAAAGAAAGAGAGCCTTGCTGAAACCAATGGGTCGGTGGCGGCTGTGGAAGAAGTGGTGGTGGTTAGCAAATTTGAAGACCCTAAATGGGTTAATGGGACTTGGGATTTGAATCAGTTCCGGAAAGATGGAAGTACTGATTGGGATGCTGTTATTGATGCTGAGGCTAGGAGGAGGAAATGGCTGGAAAACAATCCAGAATCATCAAGTAATGAGGACCCTGTGCTGTTTGACACATCCATAGTTCCTTGGTGGGCTTGGATTAAGCGCTACCATCTGCCAGAGGCTGAGATTCTCAATGGCCGTGCAGCCATGGTGGGGTTTTTCATGGCTTACTTCGTCGATAGCTTGACGGGGGTAGGACTAGTAGGTCAAATGGGCAACTTCTTCTGCAAAACTTTGTTATTTGTTGCAGTGGTGGGAGTTCTTTTGATCAGAAAGAATGAAGATATAGAGACTCTAAAGAAGTTGATTGATGAGACGACATTTTATGATAAACAATGGCAAGCAACTTGGCAGGATGAAACCAAAGGCTCAGGCAAAGTGTAATGTGGCAAAGTTGCCTGTGTTCTTGTTTTGCTTTAATTATGTTTGCATCTCATAAGATTTGGGCTTAGTAGAGAAGGATAATGTGAGCTCTGAAAGATATTGAACGTTTAGGCCTCAGTTTGTTAATGTTTCTTGAAGTTGTTGAGGGTTTCAATGAAAAATCTCTCTCTAAAAGGTTGGGTTTTCTGAAAAAAAAAA

Coding sequence (CDS)

ATGCCGGAAAAGTCCCCGAAGACGCTGCCACCGGCGAAAGGGATGAGAAAATTAGTGGAAAGCTGCGCCGTTCTCAGAGAGGAGGAGGACAATGACAACGTCCGGTTCGATTATCAATGGCGATATTCTGTTGGAAAGCGCGAAGCTGGAGGCAGTGGTGGCGACCTCCATGGCTATAAAACCTTGAAGAAGCAGAACGATAACCAAGCTGGAGGAGCGAAAATGGGAGATTTGTGGGCTTTTGAAGGAGGAGCTCTCTCTTCTTCTTCCTTCCACACCAAAACAATGTCTTCCATGGCTTTGTTTTCTCCTCCCTCCCGTCTTTCCTCTCTCTCTCCTTCTTCACATCACCGAACCCACTTCCCCTCCAGACCCTTTTCCTCTCTCAGGTCCAGAAACCCTTCTTCTTGGGTCGCTGCTGACAATGGCGCCGGAATTTCAGGTGGTTCCGTCGCTGTAGAACCCGCCGTCGTACAGAAGGATCCGGAGCCGGCGGTGGAAAAGAAAGAGAGCCTTGCTGAAACCAATGGGTCGGTGGCGGCTGTGGAAGAAGTGGTGGTGGTTAGCAAATTTGAAGACCCTAAATGGGTTAATGGGACTTGGGATTTGAATCAGTTCCGGAAAGATGGAAGTACTGATTGGGATGCTGTTATTGATGCTGAGGCTAGGAGGAGGAAATGGCTGGAAAACAATCCAGAATCATCAAGTAATGAGGACCCTGTGCTGTTTGACACATCCATAGTTCCTTGGTGGGCTTGGATTAAGCGCTACCATCTGCCAGAGGCTGAGATTCTCAATGGCCGTGCAGCCATGGTGGGGTTTTTCATGGCTTACTTCGTCGATAGCTTGACGGGGGTAGGACTAGTAGGTCAAATGGGCAACTTCTTCTGCAAAACTTTGTTATTTGTTGCAGTGGTGGGAGTTCTTTTGATCAGAAAGAATGAAGATATAGAGACTCTAAAGAAGTTGATTGATGAGACGACATTTTATGATAAACAATGGCAAGCAACTTGGCAGGATGAAACCAAAGGCTCAGGCAAAGTGTAA

Protein sequence

MPEKSPKTLPPAKGMRKLVESCAVLREEEDNDNVRFDYQWRYSVGKREAGGSGGDLHGYKTLKKQNDNQAGGAKMGDLWAFEGGALSSSSFHTKTMSSMALFSPPSRLSSLSPSSHHRTHFPSRPFSSLRSRNPSSWVAADNGAGISGGSVAVEPAVVQKDPEPAVEKKESLAETNGSVAAVEEVVVVSKFEDPKWVNGTWDLNQFRKDGSTDWDAVIDAEARRRKWLENNPESSSNEDPVLFDTSIVPWWAWIKRYHLPEAEILNGRAAMVGFFMAYFVDSLTGVGLVGQMGNFFCKTLLFVAVVGVLLIRKNEDIETLKKLIDETTFYDKQWQATWQDETKGSGKV
Homology
BLAST of CmaCh16G011040 vs. ExPASy Swiss-Prot
Match: Q9SYX1 (Light-harvesting complex-like protein 3 isotype 1, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=LIL3.1 PE=1 SV=1)

HSP 1 Score: 292.4 bits (747), Expect = 7.0e-78
Identity = 159/265 (60.00%), Postives = 189/265 (71.32%), Query Frame = 0

Query: 99  MALFSPPSRLSSLSPSSHHRTHFPSRPFSSLRSRNPS----SWVAADNGAGISGGSVAVE 158
           MALFSPP   SSL     +    P   FS L S   S    +  ++D+G+     +V+VE
Sbjct: 1   MALFSPPISSSSL----QNPNFIPKFSFSLLSSNRFSLLSVTRASSDSGSTSPTAAVSVE 60

Query: 159 P----AVVQKDP---EPAVEKKESLAETNGSVAAVEEVVV--VSKFEDPKWVNGTWDLNQ 218
                 V+ K+P    PAV+K+E+    N +V   E      V KF+D +W+NGTWDL Q
Sbjct: 61  APEPVEVIVKEPPQSTPAVKKEETATAKNVAVEGEEMKTTESVVKFQDARWINGTWDLKQ 120

Query: 219 FRKDGSTDWDAVIDAEARRRKWLENNPESSSNEDPVLFDTSIVPWWAWIKRYHLPEAEIL 278
           F KDG TDWD+VI AEA+RRKWLE NPE++SN++PVLFDTSI+PWWAWIKRYHLPEAE+L
Sbjct: 121 FEKDGKTDWDSVIVAEAKRRKWLEENPETTSNDEPVLFDTSIIPWWAWIKRYHLPEAELL 180

Query: 279 NGRAAMVGFFMAYFVDSLTGVGLVGQMGNFFCKTLLFVAVVGVLLIRKNEDIETLKKLID 338
           NGRAAM+GFFMAYFVDSLTGVGLV QMGNFFCKTLLFVAV GVL IRKNED++ LK L D
Sbjct: 181 NGRAAMIGFFMAYFVDSLTGVGLVDQMGNFFCKTLLFVAVAGVLFIRKNEDVDKLKNLFD 240

Query: 339 ETTFYDKQWQATWQ---DETKGSGK 348
           ETT YDKQWQA W+   DE+ GS K
Sbjct: 241 ETTLYDKQWQAAWKNDDDESLGSKK 261

BLAST of CmaCh16G011040 vs. ExPASy Swiss-Prot
Match: Q6NKS4 (Light-harvesting complex-like protein 3 isotype 2, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=LIL3.2 PE=1 SV=1)

HSP 1 Score: 271.9 bits (694), Expect = 9.8e-72
Identity = 145/251 (57.77%), Postives = 177/251 (70.52%), Query Frame = 0

Query: 98  SMALFSPPSRLSSLSPSSHHRTHFPSRPFSSLRSRNPSSWVAADNGAGISGGSVAV---- 157
           SMALFSPP   S  +P      +   +  +SL S    S ++    +  +G +  V    
Sbjct: 4   SMALFSPPISSSLQNP------NLIPKISTSLLSTKRFSLISVPRASSDNGTTSPVVEIP 63

Query: 158 EPAVVQKDPEPAVEKKE-SLAETNGSV---AAVEEVVVVSKFEDPKWVNGTWDLNQFRKD 217
           +PA V  +  P     E S A  NG+V   A       V K+++ KWVNGTWDL QF KD
Sbjct: 64  KPASVAVEEVPVKSPAESSSASENGAVGGEATDSSTETVIKYQNAKWVNGTWDLKQFEKD 123

Query: 218 GSTDWDAVIDAEARRRKWLENNPESSSNEDPVLFDTSIVPWWAWIKRYHLPEAEILNGRA 277
           G TDWD+VI +EA+RRKWLE+NPE++SN++ V+FDTSI+PWWAW+KRYHLPEAE+LNGRA
Sbjct: 124 GKTDWDSVIVSEAKRRKWLEDNPETTSNDELVVFDTSIIPWWAWMKRYHLPEAELLNGRA 183

Query: 278 AMVGFFMAYFVDSLTGVGLVGQMGNFFCKTLLFVAVVGVLLIRKNEDIETLKKLIDETTF 337
           AM+GFFMAYFVDSLTGVGLV QMGNFFCKTLLFVAV GVL IRKNED++ LK L DETT 
Sbjct: 184 AMIGFFMAYFVDSLTGVGLVDQMGNFFCKTLLFVAVAGVLFIRKNEDLDKLKDLFDETTL 243

Query: 338 YDKQWQATWQD 341
           YDKQWQA W++
Sbjct: 244 YDKQWQAAWKE 248

BLAST of CmaCh16G011040 vs. TAIR 10
Match: AT4G17600.1 (Chlorophyll A-B binding family protein )

HSP 1 Score: 292.4 bits (747), Expect = 5.0e-79
Identity = 159/265 (60.00%), Postives = 189/265 (71.32%), Query Frame = 0

Query: 99  MALFSPPSRLSSLSPSSHHRTHFPSRPFSSLRSRNPS----SWVAADNGAGISGGSVAVE 158
           MALFSPP   SSL     +    P   FS L S   S    +  ++D+G+     +V+VE
Sbjct: 1   MALFSPPISSSSL----QNPNFIPKFSFSLLSSNRFSLLSVTRASSDSGSTSPTAAVSVE 60

Query: 159 P----AVVQKDP---EPAVEKKESLAETNGSVAAVEEVVV--VSKFEDPKWVNGTWDLNQ 218
                 V+ K+P    PAV+K+E+    N +V   E      V KF+D +W+NGTWDL Q
Sbjct: 61  APEPVEVIVKEPPQSTPAVKKEETATAKNVAVEGEEMKTTESVVKFQDARWINGTWDLKQ 120

Query: 219 FRKDGSTDWDAVIDAEARRRKWLENNPESSSNEDPVLFDTSIVPWWAWIKRYHLPEAEIL 278
           F KDG TDWD+VI AEA+RRKWLE NPE++SN++PVLFDTSI+PWWAWIKRYHLPEAE+L
Sbjct: 121 FEKDGKTDWDSVIVAEAKRRKWLEENPETTSNDEPVLFDTSIIPWWAWIKRYHLPEAELL 180

Query: 279 NGRAAMVGFFMAYFVDSLTGVGLVGQMGNFFCKTLLFVAVVGVLLIRKNEDIETLKKLID 338
           NGRAAM+GFFMAYFVDSLTGVGLV QMGNFFCKTLLFVAV GVL IRKNED++ LK L D
Sbjct: 181 NGRAAMIGFFMAYFVDSLTGVGLVDQMGNFFCKTLLFVAVAGVLFIRKNEDVDKLKNLFD 240

Query: 339 ETTFYDKQWQATWQ---DETKGSGK 348
           ETT YDKQWQA W+   DE+ GS K
Sbjct: 241 ETTLYDKQWQAAWKNDDDESLGSKK 261

BLAST of CmaCh16G011040 vs. TAIR 10
Match: AT5G47110.1 (Chlorophyll A-B binding family protein )

HSP 1 Score: 271.9 bits (694), Expect = 6.9e-73
Identity = 145/251 (57.77%), Postives = 177/251 (70.52%), Query Frame = 0

Query: 98  SMALFSPPSRLSSLSPSSHHRTHFPSRPFSSLRSRNPSSWVAADNGAGISGGSVAV---- 157
           SMALFSPP   S  +P      +   +  +SL S    S ++    +  +G +  V    
Sbjct: 4   SMALFSPPISSSLQNP------NLIPKISTSLLSTKRFSLISVPRASSDNGTTSPVVEIP 63

Query: 158 EPAVVQKDPEPAVEKKE-SLAETNGSV---AAVEEVVVVSKFEDPKWVNGTWDLNQFRKD 217
           +PA V  +  P     E S A  NG+V   A       V K+++ KWVNGTWDL QF KD
Sbjct: 64  KPASVAVEEVPVKSPAESSSASENGAVGGEATDSSTETVIKYQNAKWVNGTWDLKQFEKD 123

Query: 218 GSTDWDAVIDAEARRRKWLENNPESSSNEDPVLFDTSIVPWWAWIKRYHLPEAEILNGRA 277
           G TDWD+VI +EA+RRKWLE+NPE++SN++ V+FDTSI+PWWAW+KRYHLPEAE+LNGRA
Sbjct: 124 GKTDWDSVIVSEAKRRKWLEDNPETTSNDELVVFDTSIIPWWAWMKRYHLPEAELLNGRA 183

Query: 278 AMVGFFMAYFVDSLTGVGLVGQMGNFFCKTLLFVAVVGVLLIRKNEDIETLKKLIDETTF 337
           AM+GFFMAYFVDSLTGVGLV QMGNFFCKTLLFVAV GVL IRKNED++ LK L DETT 
Sbjct: 184 AMIGFFMAYFVDSLTGVGLVDQMGNFFCKTLLFVAVAGVLFIRKNEDLDKLKDLFDETTL 243

Query: 338 YDKQWQATWQD 341
           YDKQWQA W++
Sbjct: 244 YDKQWQAAWKE 248

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9SYX17.0e-7860.00Light-harvesting complex-like protein 3 isotype 1, chloroplastic OS=Arabidopsis ... [more]
Q6NKS49.8e-7257.77Light-harvesting complex-like protein 3 isotype 2, chloroplastic OS=Arabidopsis ... [more]
Match NameE-valueIdentityDescription
AT4G17600.15.0e-7960.00Chlorophyll A-B binding family protein [more]
AT5G47110.16.9e-7357.77Chlorophyll A-B binding family protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita maxima (Rimu) v1.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR14154:SF87LIGHT-HARVESTING COMPLEX-LIKE PROTEIN 3 ISOTYPE 1, CHLOROPLASTIC-RELATEDcoord: 125..346
NoneNo IPR availablePANTHERPTHR14154UPF0041 BRAIN PROTEIN 44-RELATEDcoord: 125..346
NoneNo IPR availableSUPERFAMILY103511Chlorophyll a-b binding proteincoord: 201..297

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh16G011040.1CmaCh16G011040.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0009535 chloroplast thylakoid membrane
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0046872 metal ion binding