CmaCh06G010920 (gene) Cucurbita maxima (Rimu) v1.1

Overview
NameCmaCh06G010920
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu) v1.1)
DescriptionPentatricopeptide repeat-containing protein
LocationCma_Chr06: 7371601 .. 7375403 (+)
RNA-Seq ExpressionCmaCh06G010920
SyntenyCmaCh06G010920
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TCTTCCATTCAAAGTGAAGGGGTACTGCGCGATTAAATTCTCATTCGCGGCAAATCATTCTTCCCGAAGGCAGGATTTGATCGGTACGCGGTTCATGTTTCATCTTGTTGCTGCACTTCTTGATTTTTATGCGCGCATGTATTTTGGTGTATGGTCTTGATTTTTAAACCATGCGCCGTTAGCTTAGCTTCACAATCTTAATTTTGGTTTGAACATTGATCATGGTGCTGTGTACCTGTGACTTAGGGGTCTGCAACTTCGAAGTTATTTAAAGGCCTTTTCATCTCGTTTAGCTGATTGTTATGTTGGTTGTGATTTCTGGTGTTAAGTCATTTGATGGGTTAATTGCGGTTTGTGTTGTTTGGTGGATTATTTTTCTTGGATTTGTTTACTGGGATTTCAGGCCTGTGCGCTTTGGAGTTTCTACTTCTTTCTTTTTCTGTGTGTTTCTTCTGTGTTTGGTTGGATACTTCTTTTATGTGCATGCAAATCTCTTGTCATACCTTAAGGAGCCACTTGGTGGCGATTTCCAAATGAAATCTGGTTGTAGAAAACTCTATTCTTTAGTTGGGATTGGTAGCACAATTGGATACTTGGAGATGATTGGTTTAATAGACTCTTTCTCAGTAAGCTCTTTCCAAGGGTTTTTCTTATTATGAACATGTTGAGGATTATTGAGAGGAATAGTTTCACATTGACTAATTAAGAGGAAGATCATGAGTTTATAAGTAAGGAACACTATCTCCATTGGAATGAGGCCTTTTGGAGAAACCAAAAGTAAAACTATGAGAGCCTGTCTCAAAGTGGACAATATCATACCATTGTGGAGGGTTGTGGTTCCTAACATGGTGTCGGATTCATGCTCTTAACTTAGTCATGCTAATAGAATTCTCAAAGGTTGAACAAAGAAGATGTGAGCCTTGAATGTGTAGTCAAATGTGACTCATGTGTCGAACAAAGGGTGTATTTTGTTCGAGGGCTCAAGAGAAGGAGTTGAGCATTGATTAAGGGGAAGTTGTTCGAGAGCTCCAGAGAAAGAGGCCTCAGGGGAGGCTTTATGGTGTACTTTGTTCGAGGGGAGGAATGTTGAGGATTATTGAGAGGAATAGTCCCACCTTAGCTAATTAAGAGGAAAATCATGAGTTCATAAGTAAGGAACACTATCTCCATTGGAATGAAACCTTTTGGGGAAACCAGAAGCAAAGCCATAAGAGCTTGTGTTCAAAGTGGACGATATCATACGATATCATACCATTGTGGAAGTTCACGGTTCCTAACAGAACAATCAACGGTTAACTATAGAAAATCGCCGGAATATGGTATTTTAATTTGAGGATAAATCTTTTGAGTTCAAACTCAACAATATGTAAGTATGGGGGTTCGAACTTTTGATTCGTTGATATTTAATGTCTTGATTAGTTGAGTTATGTTCAAGTTGGCACTTGAGGATAAGTCTTCATGAAAGAAAAAACGGACAAGATGCCAGAGACTGTTATTTAAGTCGAGAGAATGACAAAATGGCATGGCTGCTTAAAATGCAAGAATTTTTTTAAAATAAATATTATTATTTTATATAATCTTTTGACAAACCAAACCTATGCAAGCCGCTGTTTAGGCATTGAATACTTGTGAGTTTATACAAGAAATGAACCGGGGGTGGTCCTTATCCAAATTGGTGTCGTATGTGCAAAAGCTACATGGAGGACCTTAAACTCTCTTAGTCTATTTTAAGCAATTTGGAAGGAGTGACTGTGAAGAATCTCCTTGATGTTTAGTATATTGGGACGTGTTAATTATTCTTAAGGCTTCAAGTTGAAATGTCTTTTTAAAGATTTATTACCTGCTTGCTTCTTTAATTTTGTCCCATTAGAGAATGACTTTAGAATCTCCTTTGACTTCTTCAGACTACTTATCTCACGTATCTCACAAGTCTTCACCTAGAAAGAAATTAAGAATCACTTTCATCTTGTGTTACCAGATGCTAATGGAGAGGCAGGGTGAAGATGACGTCCATGACTGCACAATTAAGCTGGTTCTTTCTTATTTCCGTTCTTAATTTCTTATTTAATTTTTGTAAACAAGTCAATGAAAGAATTGTTAGATCTAAGGTATTGGTGTTCAATGACAGAGAGTAAATCCTAAAAAACGGAGAGACAAGGTGTACATTGGTTGTGGAGCTGGATTTGGAGGAGATAGGCCAACTGCAGCTCTTAAATTGCTTCAGAGGGTTAAAGACCTAAACTATCTCGTGCTTGAATGCCTAGCAGAACGCACTCTTGCTGATCGGCATCAAGCTATGTCGTCTGGTGGCGATGGATATGATTCAAGGAGTATGTCGATGGTTAATAATCATTTTTGGCTTACAATTTTTTTCATCATTCCTTTCTCATATTCAAAATTTCTTGAAAGTTCCTAATATTGTTAGGATGGTAGTTTTTTGTTAAGAAGCTGGTAAATGGTTCCTTAGATTCGTTGGTAGTATATTAGTATTACTTAAAATATTGAAGCTCATATCTATCTATTACATTAGCAAACAGTGACTTTTGTAGAAAATTGTTCTCTTTTGCACAACAATATTGTTAGTCCATGCAACGTCCAGATTGCAACCTGACACCTCATTTTGCCCTGGTATCCAGTTCAAAGATTGCTCTGAATCAAAGTATTTTGAAATGATGCAAACGGTGATGGCATTTCATACACTGATTTTAAGACATACTCTCATCTCATTTAATTGCTGGATCTGATGATATAAACCAGTTACCAAGAAGTTAGTTGGTCTTGTATAATACTGGCTGAGTTTTGTTGTTTTTTAAGCTAATTACTCTGAAAATTATCATTAACAGATTCTGCAAACTATAATGGGTCGAGCAAAAGAGAAGTAATAAATTGAAATATACGAGAATTTTACCCATGAGGTTTCTTCATGATCATTTATTGAACTCAATCTCCGTTTACGTTTTCTTAAAATTCTTCTCCATTAGTACCTTTGACGTTAAGTCAGTTTTGCGGGCTGTTCAGTGTTTTGTTCTTGACTACTATTTGAATTACTCTGTAGTTGCAGATTGGATGAAATTGCTTCTTCCTTTGGCTGTGAAAAGAAATATTTGCATAATTACCAACATGGGTGCAAGTAAGATATCAAATTCTGAATAAGTTTCAAATTTTTATTATGCTTAATTTCGAATGGTGGTTTGGTCTGATGGTTGTTGGTTAATTACTTATCTATATCTCTCCATTTCTTTCCTGGGTAGTGGATCCCCCTGGGGCTCAGCAAAATGTTATAGAAATAGCAAGCAGTCTGGGGTTGAGTGTTTCGGTTGCAGTTGCTTACGAGGTTTCAGTAAAAGAATCAGGTAATTGTACTTTACTTTTTTTAGAGGTTTGACTGGATTATCTAATCAGTTAGAACAGTTTCCTTGTCATCCCAAATGTTGGCCAAGTATACAAGTAAGTTAAACAACTTACATGCCAAATTTTGAGCACCTCAGGTCATGATAGAACACTATTATCCTCAACTGAAGTGCATTTTTCTTTAGCTTTATTGTCAGTTCCAAATAATGAAATTCCCCTCCAGCATATTAGCAGCAAAATTAAGTTTTTTTCTTTTTTCCAGTCAATGTTGTATAACATGACTTCACAAATTGTATATGGCAGCCACAAGACACTTTTCTTTTCCTCTTTTTTGTTCCTTAACGGCGTCTTACTTCTAAAGGTGGAGCTGATCGTTGGTTTAATCAATGAAAGGAATTAGCACATATCTGGGAGCAGCTCCTATCGTCAAGTGTCTGGAAAAGTACCATCCA

mRNA sequence

TCTTCCATTCAAAGTGAAGGGGTACTGCGCGATTAAATTCTCATTCGCGGCAAATCATTCTTCCCGAAGGCAGGATTTGATCGATGCTAATGGAGAGGCAGGGTGAAGATGACGTCCATGACTGCACAATTAAGCTGAGAGTAAATCCTAAAAAACGGAGAGACAAGGTGTACATTGGTTGTGGAGCTGGATTTGGAGGAGATAGGCCAACTGCAGCTCTTAAATTGCTTCAGAGGGTTAAAGACCTAAACTATCTCGTGCTTGAATGCCTAGCAGAACGCACTCTTGCTGATCGGCATCAAGCTATGTCGTCTGGTGGCGATGGATATGATTCAAGGATTGCAGATTGGATGAAATTGCTTCTTCCTTTGGCTGTGAAAAGAAATATTTGCATAATTACCAACATGGGTGCAATGGATCCCCCTGGGGCTCAGCAAAATGTTATAGAAATAGCAAGCAGTCTGGGGTTGAGTGTTTCGGTTGCAGTTGCTTACGAGGTTTCAGTAAAAGAATCAGGTGGAGCTGATCGTTGGTTTAATCAATGAAAGGAATTAGCACATATCTGGGAGCAGCTCCTATCGTCAAGTGTCTGGAAAAGTACCATCCA

Coding sequence (CDS)

ATGCTAATGGAGAGGCAGGGTGAAGATGACGTCCATGACTGCACAATTAAGCTGAGAGTAAATCCTAAAAAACGGAGAGACAAGGTGTACATTGGTTGTGGAGCTGGATTTGGAGGAGATAGGCCAACTGCAGCTCTTAAATTGCTTCAGAGGGTTAAAGACCTAAACTATCTCGTGCTTGAATGCCTAGCAGAACGCACTCTTGCTGATCGGCATCAAGCTATGTCGTCTGGTGGCGATGGATATGATTCAAGGATTGCAGATTGGATGAAATTGCTTCTTCCTTTGGCTGTGAAAAGAAATATTTGCATAATTACCAACATGGGTGCAATGGATCCCCCTGGGGCTCAGCAAAATGTTATAGAAATAGCAAGCAGTCTGGGGTTGAGTGTTTCGGTTGCAGTTGCTTACGAGGTTTCAGTAAAAGAATCAGGTGGAGCTGATCGTTGGTTTAATCAATGA

Protein sequence

MLMERQGEDDVHDCTIKLRVNPKKRRDKVYIGCGAGFGGDRPTAALKLLQRVKDLNYLVLECLAERTLADRHQAMSSGGDGYDSRIADWMKLLLPLAVKRNICIITNMGAMDPPGAQQNVIEIASSLGLSVSVAVAYEVSVKESGGADRWFNQ
Homology
BLAST of CmaCh06G010920 vs. TAIR 10
Match: AT1G01770.1 (unknown protein; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1446 (InterPro:IPR010839); Has 1597 Blast hits to 1509 proteins in 306 species: Archae - 4; Bacteria - 843; Metazoa - 22; Fungi - 131; Plants - 31; Viruses - 0; Other Eukaryotes - 566 (source: NCBI BLink). )

HSP 1 Score: 200.7 bits (509), Expect = 8.6e-52
Identity = 94/135 (69.63%), Postives = 115/135 (85.19%), Query Frame = 0

Query: 13  DCTIKLRVNPKKRRDKVYIGCGAGFGGDRPTAALKLLQRVKDLNYLVLECLAERTLADRH 72
           DC I LR NPK+RR+ VY+GCGAGFGGDRP AALKLLQRV++LNYLVLECLAERTLADR 
Sbjct: 12  DCVINLRENPKRRRETVYVGCGAGFGGDRPLAALKLLQRVEELNYLVLECLAERTLADRW 71

Query: 73  QAMSSGGDGYDSRIADWMKLLLPLAVKRNICIITNMGAMDPPGAQQNVIEIASSLGLSVS 132
            +M+SGG GYD R+++WM+LLLPLAV+R  CIITNMGA+DP GAQ+ V+E+A  LGL++S
Sbjct: 72  LSMASGGLGYDPRVSEWMQLLLPLAVERGTCIITNMGAIDPSGAQKKVLEVAGELGLTIS 131

Query: 133 VAVAYEVSVKESGGA 148
           VAVA+EV  +   G+
Sbjct: 132 VAVAHEVHFETGSGS 146

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
AT1G01770.18.6e-5269.63unknown protein; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1446... [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita maxima (Rimu) v1.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR010839Acyclic terpene utilisationPFAMPF07287AtuAcoord: 31..136
e-value: 1.0E-25
score: 90.2
NoneNo IPR availablePANTHERPTHR47472PROPIONYL-COA CARBOXYLASEcoord: 3..145

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh06G010920.1CmaCh06G010920.1mRNA