CmaCh16G005350 (gene) Cucurbita maxima (Rimu) v1.1

Overview
NameCmaCh16G005350
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu) v1.1)
DescriptionUvrABC system protein C
LocationCma_Chr16: 2737738 .. 2738316 (-)
RNA-Seq ExpressionCmaCh16G005350
SyntenyCmaCh16G005350
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGAAACTGTTCTCTGAAAGGAGTGGCCGCCGATTGCGAAAAGCCCATCAGAATCTTAACCGATTCCGGCAACATAATCAACTTCCATGGCCCTAAACAAGTCGATCAAATCCTCAAGAACTATCCTCCCGGCGTCTATGGCGTTTTCCGGCGCCCCAATCTCTCTTCGCCCTTGCCCATTTCGGAGCCCCTCGACGCCGGAAAATCCTACTTTCTCCTCCCGCTTTCCCGAGCCGCGGAGAAAGAGAGGTCCGATGCGGCGGGGGATCTGAGAACTGGGTCGGGGCTAGAAGTGCTTCCGACAGGTGGCGATGGCATTTGGAGGGTCAAATTAGTGATCGATACGAAGCAGTTGGGGGAAATTTTGGCAGAGGATGGGAACACAGAGGCGTTGATTGAGAGGATGAGAGCGGCGGCGGCGACGGCGGCGGTGCAGAGTCCACGGCGGGAGAAGATCGGAGGGTGGAAGCCGACATGGGGGAATTGGTCGAAGTTTTTTCCAATTGATGTTGGAAACAATAATAAAGCACAAATGAAGGATTTTCATTCTGGAAATGGGTGTTTATATGCCACATAA

mRNA sequence

ATGGGAAACTGTTCTCTGAAAGGAGTGGCCGCCGATTGCGAAAAGCCCATCAGAATCTTAACCGATTCCGGCAACATAATCAACTTCCATGGCCCTAAACAAGTCGATCAAATCCTCAAGAACTATCCTCCCGGCGTCTATGGCGTTTTCCGGCGCCCCAATCTCTCTTCGCCCTTGCCCATTTCGGAGCCCCTCGACGCCGGAAAATCCTACTTTCTCCTCCCGCTTTCCCGAGCCGCGGAGAAAGAGAGGTCCGATGCGGCGGGGGATCTGAGAACTGGGTCGGGGCTAGAAGTGCTTCCGACAGGTGGCGATGGCATTTGGAGGGTCAAATTAGTGATCGATACGAAGCAGTTGGGGGAAATTTTGGCAGAGGATGGGAACACAGAGGCGTTGATTGAGAGGATGAGAGCGGCGGCGGCGACGGCGGCGGTGCAGAGTCCACGGCGGGAGAAGATCGGAGGGTGGAAGCCGACATGGGGGAATTGGTCGAAGTTTTTTCCAATTGATGTTGGAAACAATAATAAAGCACAAATGAAGGATTTTCATTCTGGAAATGGGTGTTTATATGCCACATAA

Coding sequence (CDS)

ATGGGAAACTGTTCTCTGAAAGGAGTGGCCGCCGATTGCGAAAAGCCCATCAGAATCTTAACCGATTCCGGCAACATAATCAACTTCCATGGCCCTAAACAAGTCGATCAAATCCTCAAGAACTATCCTCCCGGCGTCTATGGCGTTTTCCGGCGCCCCAATCTCTCTTCGCCCTTGCCCATTTCGGAGCCCCTCGACGCCGGAAAATCCTACTTTCTCCTCCCGCTTTCCCGAGCCGCGGAGAAAGAGAGGTCCGATGCGGCGGGGGATCTGAGAACTGGGTCGGGGCTAGAAGTGCTTCCGACAGGTGGCGATGGCATTTGGAGGGTCAAATTAGTGATCGATACGAAGCAGTTGGGGGAAATTTTGGCAGAGGATGGGAACACAGAGGCGTTGATTGAGAGGATGAGAGCGGCGGCGGCGACGGCGGCGGTGCAGAGTCCACGGCGGGAGAAGATCGGAGGGTGGAAGCCGACATGGGGGAATTGGTCGAAGTTTTTTCCAATTGATGTTGGAAACAATAATAAAGCACAAATGAAGGATTTTCATTCTGGAAATGGGTGTTTATATGCCACATAA

Protein sequence

MGNCSLKGVAADCEKPIRILTDSGNIINFHGPKQVDQILKNYPPGVYGVFRRPNLSSPLPISEPLDAGKSYFLLPLSRAAEKERSDAAGDLRTGSGLEVLPTGGDGIWRVKLVIDTKQLGEILAEDGNTEALIERMRAAAATAAVQSPRREKIGGWKPTWGNWSKFFPIDVGNNNKAQMKDFHSGNGCLYAT
Homology
BLAST of CmaCh16G005350 vs. TAIR 10
Match: AT3G61920.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: N-terminal protein myristoylation; LOCATED IN: cellular_component unknown; EXPRESSED IN: shoot, hypocotyl, root, egg cell; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G64700.1); Has 77 Blast hits to 77 proteins in 8 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 77; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 79.3 bits (194), Expect = 3.6e-15
Identity = 55/156 (35.26%), Postives = 77/156 (49.36%), Query Frame = 0

Query: 1   MGNCSLKGVAAD------CEKPIRILTDSGNIINFHGPKQVDQILKNYPPGV-YGVFRRP 60
           MGNC  KG           +  I+++T +G ++  H P   + I   +P  V +      
Sbjct: 1   MGNCVFKGNGGSRKLYDKDDSLIKVVTPNGGVMELHPPIFAEFITNEFPGHVIHDSLSLR 60

Query: 61  NLSSPLPISEPLDAGKSYFLLPL-SRAAEKERSDAAGDLRTGSGLE--------VLPTGG 120
           + S PL   E L  G  Y+LLPL S AA   + D++  L T   +          L  GG
Sbjct: 61  HSSPPLLHGEELFPGNIYYLLPLSSSAAATAQLDSSDQLSTPYRMSFGKTPIMAALSGGG 120

Query: 121 DGIWRVKLVIDTKQLGEILAEDGNTEALIERMRAAA 141
            G+W+V+LVI  +QL EILAED  TEAL+E +R  A
Sbjct: 121 CGVWKVRLVISPEQLAEILAEDVETEALVESVRTVA 156

BLAST of CmaCh16G005350 vs. TAIR 10
Match: AT4G10910.1 (unknown protein; Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 58.5 bits (140), Expect = 6.7e-09
Identity = 47/147 (31.97%), Postives = 66/147 (44.90%), Query Frame = 0

Query: 1   MGNCSLKGVAADCEKPIRILTDSGNIINFHGPKQVDQILKNYPPGVYGVFRRPNLSSPLP 60
           MGNCS + V+        I T+S NI+                      + R    S LP
Sbjct: 1   MGNCSQRAVSDGGGSVTVIATNSRNILE--------------------EYYRSRRCSSLP 60

Query: 61  ISEPLDAGKSYFLLPLSRAAEKERSDAAGDLRTGSGLEVLPTGGDGIWRVKLVIDTKQLG 120
           I+     G             +  +   G L  G  ++V P   +G+W+ K+VI +KQL 
Sbjct: 61  ITREKTLG---------YLVRQGTTSPRGVL--GPRIQVSPQRRNGVWKAKVVIGSKQLE 116

Query: 121 EILAEDGNTEALIERMRAAAATAAVQS 148
           EILA +GNT ALI+++R AAA A V S
Sbjct: 121 EILAVEGNTHALIDQLRFAAAEALVSS 116

BLAST of CmaCh16G005350 vs. TAIR 10
Match: AT1G64700.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: N-terminal protein myristoylation; LOCATED IN: cellular_component unknown; EXPRESSED IN: 17 plant structures; EXPRESSED DURING: 11 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G61920.1); Has 48 Blast hits to 47 proteins in 7 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 48; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 43.1 bits (100), Expect = 2.9e-04
Identity = 22/47 (46.81%), Postives = 33/47 (70.21%), Query Frame = 0

Query: 107 IWRVKLVIDTKQLGEILAEDGNTEALIERMRAAA--ATAAVQSPRRE 152
           IW+V L+I+T++L +IL+EDG T  LIE +RA A   T+++ S   E
Sbjct: 149 IWKVNLIINTEELLQILSEDGRTNELIESVRAVAKGETSSITSSSSE 195

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
AT3G61920.13.6e-1535.26unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: N-termin... [more]
AT4G10910.16.7e-0931.97unknown protein; Has 30201 Blast hits to 17322 proteins in 780 species: Archae -... [more]
AT1G64700.12.9e-0446.81unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: N-termin... [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita maxima (Rimu) v1.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR025322Protein of unknown function DUF4228, plantPFAMPF14009DUF4228coord: 1..81
e-value: 2.4E-12
score: 47.6
NoneNo IPR availablePANTHERPTHR33148PLASTID MOVEMENT IMPAIRED PROTEIN-RELATEDcoord: 1..164
NoneNo IPR availablePANTHERPTHR33148:SF33DUF4228 DOMAIN PROTEINcoord: 1..164

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh16G005350.1CmaCh16G005350.1mRNA