CmaCh12G008770 (gene) Cucurbita maxima (Rimu) v1.1

Overview
NameCmaCh12G008770
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu) v1.1)
DescriptionChlorophyll A-B binding protein
LocationCma_Chr12: 6615419 .. 6619640 (-)
RNA-Seq ExpressionCmaCh12G008770
SyntenyCmaCh12G008770
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAAACAAGCTCTGCATCAGCGATGAGAGCCACTACAAAACCGTCCCTCTCTGCAAGATAAGTGTGGATGTGGGCAACTGAAATCGGTTACTAATTCCATATAAAAATGCTTTATCTCACCAATAGATAAGGGTTGATGTCAAGAAACGCTGGTCTTTGGCGATGGCTTCCACTGCGCTGATTCTCCCCATCCATGGAGGAAACCGTACGTCTTCCCAATGCCTTTCTTTCCGCCATACCCACTCTTCTGCAACATTTTCCAGGTAATTTCATCTTCTTCCTCAAATACTTGTTTATCTCATGCGATTCTATTAGTATAATCGGATGTTCTAAGAGATTAAGAGATGATTGACTGATTTCTTGCGCCGATCGTCGGTGTATCTTCTTTATTAGGTATGAATTTCTTTGTTGAATTGGGGCTTTGTGTCTAGACTTCGTATGTTGTAGATGGCTTCTGTAAAACTTTACGAGCTACAGAGTGATAAATAAAAGCTTCTATTTTCTTCCTTTGAGGTGATTAGAGTTGGAACTGCTTGTCGATGATAAGCACTAAGATTTTGCAAAAGCTGGAGGTCCTTCTAAAATCCTTGTAAAATTTCATAAATAAAGCTAACTTAACTAATGATTCCAAGCAAGGGCGGATTAGAAGATGAATAACTCTCTGATCGAACTTGGAATCTATGATGGTCACTCTTAGATCCCAAGACGATTCACTCCAGAAGTAGCTTGATCAGACTACTCCAATGAATTGATTGTACTTTGAGAAAGAAACACAGTTTTTGTATTCCACCATGATTTCAATCTTGAATACTACATTCGTTTAAATAAGCAAAAAAGAGAATCCAAATATAATGCTAATTCAGTCGTTCAACACCTAATCTTTATTGCCACTCCATTCTTTTCTTAATTTGATACTTAAATCTTCTTTACCTTGCTTCTTGTAATTGCTCCTTTGAGTACATGCAATTGATTAGTGGTTAGATTCATATCAGTTGATGTCTTAGAATTTGTGTGTGTCGTGTTCTGAACATTAGTTAAGACAATTAGATTTGGCGCGTTCTGATTTAGGTCATTATGCTGTGAGAAGTTTGGTGCGTACTTTCATGAGTTAGCGACCTGGCCTAATGCTTAAGCTTCTTGAATATCTGGTATTTTCTCTCTATATATGTCTGGTTGTGAAGTTTTCTGTTTAAAACCTGGTGATGGAGCTTCACTCCTTGCCATGAAAACTTGCTCCGGCCAAAAACAAAGGTTTTGAGGTAATCTCTAGATAGGGAAATTTTTTGGTTCTACGGGAGTACGGATACTCATGTTTGAAACATGCTTACTTGTTACGACGTTTTCCATTCAATCTTGATCTAGATTAGGTTGTTGTTGTACTTTATTAGAAAGAATATGATCTCATTGAATTGAGTTCATATAGGATCAAATACTAACCACCAAGAAACATATAAAAAAGATCTAAATTAGTGTTAAAAACCATTACTGCTAGCTTGGTCGTCTCGTGTTGTGGTTCTTCACAGCTCTGTCTTACTGTAACCCTTTATTTATTCATTTACTTCATACTCCTCCTTTTGCTCTAAGCTCATTGCTAGTAGGTATTGTCCTCTTTGGGGTTTCTCTTTTGGACTTTCCCTCAAGGTTTTTAAAACGCATTTGCTAGAGAGAGGTTTCCACCTCCTTATAAGGAAGGTTTCGTTCCCCTCTCCAACTGATGTGGGATCTCAAAATCCACTCTCTTGGGGCCAGCATCCTTGCTGACACACCGCCCGATGTCTGACTCTGATATCATTTGTAACAGCCCAAGTTCACCGCTAACCGATATTGTCCTTTTTGGACTTACCCTTATGGGCTTACCTTCAAGGTTTTAAAACGTGTCCACTAGAGAGGGGTTTCTACACCCTTACGAGAAATGTTTCGTTCTCATCTCCAATCAATGCGGGATCTCACAATATTTCTTCTGTAAAAGGATCCCTTTTGTTCTCTAGTCGGTGGTTATAAATATAAAGCATACTGATGAAGGCTCTCCAGAATTTCGAGCTGAACTCACACACGTCCTACACTATTCTTGTTGAAAAAGATGAGGGACTTTCCTTAAATTTTTCTTATAGCTAAGACCTATATAGGCAATGAAATGAGTTCTAAACTACTGTCTCATTTACTTTTGAGCCACGTCTCCAAAAATATGCCTATTAATATTTACTAGATGTTCCAAATTATAGCCTTAAAGCTGAATTTATATATTTTTTTGCTCTTCTCTGTTAGTTGATGACTAGCAAAAGAACTTAAACAGTTGGATACAATTTTCGACACATTACATGGTGCATGATATTGTAGTAATAGTTTCAGTCGTTGGAAATGTTAGAACTCTACCTTTTGAAGTATTATTTGACAATTAGTTAGAATCCAAGACCTAGAAAGGAAATGAAACAGTATTCAACAATTTTTTCCCCATCTTGATCCTTGACCTTGGGTGTATTCTTTTTGTCCCTGCTAATTGATACATCGTTCTTGATAGTTTCTTTTCCTTCCAGGTGGGGTTGGAGTAGGGATCGAGACGTCGGAAATAGTACACACAGAACGAGGGGTCAAGCATTCCGAATTTTGGCTAACCCTAATGTACGCTTCAACTCAAGGCTTCTGCATGTTTAATGGCAATATATTTGGTTTGAATATGGTCAAAGATTTTGTAGGTATGTTATTAGAAATTAATTAGTGGATGCATATATTCAGAGAATTCACACCTGCCCCCCATTCGGATTTGAATAGAACCAATCTTTCAGCATCTCTTTATGATTGAAATGAATTTGATTGTGTTGTCTGTCTGATTCATCATAATTATCTTGATTGGTTTTTGTTTTGTTCTCTCAATGAAAACCATATTCTAAGTTGTTTATGGGCTTTTGTTCTGCAACAAACATATATCATGTCAATTGTTCAGGTCTCTCCCGGGAAAGATGACTTAATTAAGAAGGTGATTATGGTTGATCCTTTGGAGGCCAAACGTTTGGCTGCAAAAGAAATGGAAAAAATCAAAGCAAAAGAGAAGTTCAAGGTATGGGAAGGGATCATCAGTGATGCAAAACTTGTTTAATAAACCATATGGATATGCAGATAATGTTGTGAAATGTGATTGCAGAGACGACGTCAAATAGAAGCGATAAATGGAGCATGGGCAATGATTGGTCTGACGGCAGGGCTCATTGTTGAAGGTCAAACTGGAAAAGGCATTCTAGCACAGGTGAGAAGTTATAGCGTTTTACGTTCTTTTGTATTCGTGCCACTTCTTTAACATTCTTTTGAACAAATTGATTGGTTTAGAACATTTGTTTATTCCTATATGCCTCAGAGTGATAGCTTAAGAAAGTTAGCCCCAATGACTGTCTGGAGCAGTTCTTTAACTGCTTGTTAAATATGACAAGAAGTCATCGGTTCGGCCGCTTCATTTAGATAGGTTATTAACTACCAAATTTTATAAGGTTAGAACGTAGTTAGTGAGCGAACGGACTTGATTTTTACGAAGAAAGATACTAAAATAAAAACGAAAATATAATGATATAGTACCTTACAGTATAGAAACGAGACGACAACCTAAACATAGTTGAGTGGATAAAAAGTAACCTTAACATTACTTTTTTTTGTGAGGTGTTTGGCTTGAGAGACAATATGATTTTTAGAATGGAGAGGTCGTGTGACGAGGGCTGAGAGGAGGTGAGGTTTAATGGCTCATTGTGAGCTTCGGTCGCTTAGCTTTTCTTTAGTTATGATCTTGGTTTGATTTTGTTGGATTCGAGTCCTTTTCGATAGTCTGTTCTTTGAGATTCTCTTCTATCGAGCTTGTTTTTTGTATGCACTTGTCGTTTTTTCTAGATGAAAGTTTAGTTTTTTACCGAACAAAAACCTCTTACTCAATGTTGTTTTTCACTTATTTTCCAGTTGGCCGGCTACTTGGCCGCGGTTGTGAACTTCTTTGTACGGTAGACATCTTCAAGGGCAAAAGGACTTCTCCTTTGAATGGAAGAATTATTCTCTTCTCGACCATTGAAAGTTCGGTATTGCTTGTTCATCCTTTTATACGTATCAAAAAGGATCTTTCCATCCTAAGAAACTGAAGTCTGATTCTTTCCTCTTTTCAAGACTGATTTCTTCACATTTCATGTACAAAATTAACAATGTATTGAGATGGTTTGTTGAAATCGTTATCTTACCATCGAGCTGACTCT

mRNA sequence

AAAACAAGCTCTGCATCAGCGATGAGAGCCACTACAAAACCGTCCCTCTCTGCAAGATAAGTGTGGATGTGGGCAACTGAAATCGGTTACTAATTCCATATAAAAATGCTTTATCTCACCAATAGATAAGGGTTGATGTCAAGAAACGCTGGTCTTTGGCGATGGCTTCCACTGCGCTGATTCTCCCCATCCATGGAGGAAACCGTACGTCTTCCCAATGCCTTTCTTTCCGCCATACCCACTCTTCTGCAACATTTTCCAGGTGGGGTTGGAGTAGGGATCGAGACGTCGGAAATAGTACACACAGAACGAGGGGTCAAGCATTCCGAATTTTGGCTAACCCTAATGTCTCTCCCGGGAAAGATGACTTAATTAAGAAGGTGATTATGGTTGATCCTTTGGAGGCCAAACGTTTGGCTGCAAAAGAAATGGAAAAAATCAAAGCAAAAGAGAAGTTCAAGAGACGACGTCAAATAGAAGCGATAAATGGAGCATGGGCAATGATTGGTCTGACGGCAGGGCTCATTGTTGAAGGTCAAACTGGAAAAGGCATTCTAGCACAGGTGAGAAGTTATAGCGTTTTACTTGGCCGGCTACTTGGCCGCGGTTGTGAACTTCTTTGTACGGTAGACATCTTCAAGGGCAAAAGGACTTCTCCTTTGAATGGAAGAATTATTCTCTTCTCGACCATTGAAAGTTCGGTATTGCTTGTTCATCCTTTTATACGTATCAAAAAGGATCTTTCCATCCTAAGAAACTGAAGTCTGATTCTTTCCTCTTTTCAAGACTGATTTCTTCACATTTCATGTACAAAATTAACAATGTATTGAGATGGTTTGTTGAAATCGTTATCTTACCATCGAGCTGACTCT

Coding sequence (CDS)

ATGGCTTCCACTGCGCTGATTCTCCCCATCCATGGAGGAAACCGTACGTCTTCCCAATGCCTTTCTTTCCGCCATACCCACTCTTCTGCAACATTTTCCAGGTGGGGTTGGAGTAGGGATCGAGACGTCGGAAATAGTACACACAGAACGAGGGGTCAAGCATTCCGAATTTTGGCTAACCCTAATGTCTCTCCCGGGAAAGATGACTTAATTAAGAAGGTGATTATGGTTGATCCTTTGGAGGCCAAACGTTTGGCTGCAAAAGAAATGGAAAAAATCAAAGCAAAAGAGAAGTTCAAGAGACGACGTCAAATAGAAGCGATAAATGGAGCATGGGCAATGATTGGTCTGACGGCAGGGCTCATTGTTGAAGGTCAAACTGGAAAAGGCATTCTAGCACAGGTGAGAAGTTATAGCGTTTTACTTGGCCGGCTACTTGGCCGCGGTTGTGAACTTCTTTGTACGGTAGACATCTTCAAGGGCAAAAGGACTTCTCCTTTGAATGGAAGAATTATTCTCTTCTCGACCATTGAAAGTTCGGTATTGCTTGTTCATCCTTTTATACGTATCAAAAAGGATCTTTCCATCCTAAGAAACTGA

Protein sequence

MASTALILPIHGGNRTSSQCLSFRHTHSSATFSRWGWSRDRDVGNSTHRTRGQAFRILANPNVSPGKDDLIKKVIMVDPLEAKRLAAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTAGLIVEGQTGKGILAQVRSYSVLLGRLLGRGCELLCTVDIFKGKRTSPLNGRIILFSTIESSVLLVHPFIRIKKDLSILRN
Homology
BLAST of CmaCh12G008770 vs. TAIR 10
Match: AT4G28025.2 (unknown protein. )

HSP 1 Score: 119.4 bits (298), Expect = 3.3e-27
Identity = 67/114 (58.77%), Postives = 84/114 (73.68%), Query Frame = 0

Query: 29  SATFSRWGWSRDRDVGNSTHRTRGQAFRILANPNVS----PGKDDLIKKVIMVDPLEAKR 88
           S++  R G  R +D     +R R    R+LANPNVS    PGK  + K+VIMVDPLEAKR
Sbjct: 23  SSSNGRKGLRRHQDAKLVGNRARVGVVRVLANPNVSPPPPPGKAKVKKEVIMVDPLEAKR 82

Query: 89  LAAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTAGLIVEGQTGKGILAQVRSY 139
           LA+K+ME+IK +EK +RRR+IEAINGAWA+IGL  GL++E QTGKGILAQ+  Y
Sbjct: 83  LASKQMEEIKGREKQQRRREIEAINGAWAIIGLMIGLVIEAQTGKGILAQLAGY 136

BLAST of CmaCh12G008770 vs. TAIR 10
Match: AT4G28025.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast thylakoid membrane; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages; Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 117.9 bits (294), Expect = 9.6e-27
Identity = 65/107 (60.75%), Postives = 80/107 (74.77%), Query Frame = 0

Query: 36  GWSRDRDVGNSTHRTRGQAFRILANPNVS----PGKDDLIKKVIMVDPLEAKRLAAKEME 95
           G  R +D     +R R    R+LANPNVS    PGK  + K+VIMVDPLEAKRLA+K+ME
Sbjct: 39  GLRRHQDAKLVGNRARVGVVRVLANPNVSPPPPPGKAKVKKEVIMVDPLEAKRLASKQME 98

Query: 96  KIKAKEKFKRRRQIEAINGAWAMIGLTAGLIVEGQTGKGILAQVRSY 139
           +IK +EK +RRR+IEAINGAWA+IGL  GL++E QTGKGILAQ+  Y
Sbjct: 99  EIKGREKQQRRREIEAINGAWAIIGLMIGLVIEAQTGKGILAQLAGY 145

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
AT4G28025.23.3e-2758.77unknown protein. [more]
AT4G28025.19.6e-2760.75unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita maxima (Rimu) v1.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR37752:SF1OS02G0610700 PROTEINcoord: 10..140
NoneNo IPR availablePANTHERPTHR37752OS02G0610700 PROTEINcoord: 10..140
NoneNo IPR availableSUPERFAMILY103511Chlorophyll a-b binding proteincoord: 95..135

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh12G008770.1CmaCh12G008770.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0009507 chloroplast
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0009579 thylakoid
molecular_function GO:0046872 metal ion binding