CmaCh16G006040.1 (mRNA) Cucurbita maxima (Rimu) v1.1

Overview
NameCmaCh16G006040.1
TypemRNA
OrganismCucurbita maxima (Cucurbita maxima (Rimu) v1.1)
DescriptionDUF4228 domain-containing protein
LocationCma_Chr16: 3114445 .. 3115737 (+)
Sequence length828
RNA-Seq ExpressionCmaCh16G006040.1
SyntenyCmaCh16G006040.1
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCAAGCAAAAAGGTAACAACTTATAGCATATAACTATCGAGCAAGGAAAAGTTGACAATGACATAGATGAATAAGATATACCAATCCAAACTAAGAATTATTTATTTATTTTTTATCCATTCGGGAAGTAAAGATTATAGCATTTGAACTAAAAGTTATAAAAAGAAAGGAAATAAAATGAATTAAATAGTCTAAGACCTTAGGGTGGGGAAAAAAATATGCATGAAAACAGTTGGAGAAGCAAATAAAATAAATGTGAGTTTCCGCGTAAGAACATGAATGAACATGAAAGAACAAATCCAGAACAAATCCAGCCATCCATCGCCACTACAAAACACCAACATAATTTTACTCCAATCCCAATCCCACTCTTTGTTCTTCCTTTCTTCTTTTCGCATCTTTCTCTTTGTTTCCACTATCATAATCCTTATCTTCTACCTCTATAAATCCATCGCCATTCATTTTCTTAACGAAAAGGGGTAGAAACAACCTTCTCCACCGACACCCATTTTCCCGATTCCCCTTCCCTCCGATTTATACCCCAAAAAAGGCCTTCAATCTCACAGAACAGAGAAGAACAGAGCAGAACAGAGTAGAACAGAGCTTCTCTGTGTTTATCGGAATGGGGAACTGCTTATTCGGCGGTGGGTCTGGTGAGATTCAGGGGAAAATCAAGGTAATCACGTCCAACGGTGGGATTATGGAGTTGGGTTCTCCGATTACCGTCGGGTGCATCGCCGACGAGTTTCCGGGATATGGAATATTCAAAAGCCACGATCTTTTTTGGAACCCATTACCGCACAACGAGGAGCTGCTTCCGGGGAAATCCTACTACTTGCTTCAGAGAAACAGGGGAAGAAACAGAGGAGAGACAGAAGAAGGGGAAATGGGAATGATAAGGGCGCGTGAGGGGCACGTGAGGTCGAATAGTGTACCGGAGGCGACGGCGGCGGCGGCGGGTATGGCGTCGTATAGAATGTCGTTTGATTATCAGGGGGTTTTGAGGAGGTCGCAGACGGAGGTTTTTTCGAGGAGCAGTGAGAAGAACGGCGGGGTTTGGAAGGTGAAATTGGTGATTAGTCCAAAGCGATTAGTGGAGATTTTGGAGGAGGAAGGTCACACTCAGGAGTTGATTGAGAGCGTAAGGACTGTGGCTAAATGTGGAAGTACCAGCACGAGCAGTAGCTTTTCGTCGTCCATGGCGTTTTCCGATCACTGGAGTTTGTCCTCCACCACCGCCAATGCTACTCCGAGCGCTTCCGCCAAAAGTGGCGGCTTGCTGGAGATCTAA

mRNA sequence

ATGCAAGCAAAAAGGGGTAGAAACAACCTTCTCCACCGACACCCATTTTCCCGATTCCCCTTCCCTCCGATTTATACCCCAAAAAAGGCCTTCAATCTCACAGAACAGAGAAGAACAGAGCAGAACAGAGTAGAACAGAGCTTCTCTGTGTTTATCGGAATGGGGAACTGCTTATTCGGCGGTGGGTCTGGTGAGATTCAGGGGAAAATCAAGGTAATCACGTCCAACGGTGGGATTATGGAGTTGGGTTCTCCGATTACCGTCGGGTGCATCGCCGACGAGTTTCCGGGATATGGAATATTCAAAAGCCACGATCTTTTTTGGAACCCATTACCGCACAACGAGGAGCTGCTTCCGGGGAAATCCTACTACTTGCTTCAGAGAAACAGGGGAAGAAACAGAGGAGAGACAGAAGAAGGGGAAATGGGAATGATAAGGGCGCGTGAGGGGCACGTGAGGTCGAATAGTGTACCGGAGGCGACGGCGGCGGCGGCGGGTATGGCGTCGTATAGAATGTCGTTTGATTATCAGGGGGTTTTGAGGAGGTCGCAGACGGAGGTTTTTTCGAGGAGCAGTGAGAAGAACGGCGGGGTTTGGAAGGTGAAATTGGTGATTAGTCCAAAGCGATTAGTGGAGATTTTGGAGGAGGAAGGTCACACTCAGGAGTTGATTGAGAGCGTAAGGACTGTGGCTAAATGTGGAAGTACCAGCACGAGCAGTAGCTTTTCGTCGTCCATGGCGTTTTCCGATCACTGGAGTTTGTCCTCCACCACCGCCAATGCTACTCCGAGCGCTTCCGCCAAAAGTGGCGGCTTGCTGGAGATCTAA

Coding sequence (CDS)

ATGCAAGCAAAAAGGGGTAGAAACAACCTTCTCCACCGACACCCATTTTCCCGATTCCCCTTCCCTCCGATTTATACCCCAAAAAAGGCCTTCAATCTCACAGAACAGAGAAGAACAGAGCAGAACAGAGTAGAACAGAGCTTCTCTGTGTTTATCGGAATGGGGAACTGCTTATTCGGCGGTGGGTCTGGTGAGATTCAGGGGAAAATCAAGGTAATCACGTCCAACGGTGGGATTATGGAGTTGGGTTCTCCGATTACCGTCGGGTGCATCGCCGACGAGTTTCCGGGATATGGAATATTCAAAAGCCACGATCTTTTTTGGAACCCATTACCGCACAACGAGGAGCTGCTTCCGGGGAAATCCTACTACTTGCTTCAGAGAAACAGGGGAAGAAACAGAGGAGAGACAGAAGAAGGGGAAATGGGAATGATAAGGGCGCGTGAGGGGCACGTGAGGTCGAATAGTGTACCGGAGGCGACGGCGGCGGCGGCGGGTATGGCGTCGTATAGAATGTCGTTTGATTATCAGGGGGTTTTGAGGAGGTCGCAGACGGAGGTTTTTTCGAGGAGCAGTGAGAAGAACGGCGGGGTTTGGAAGGTGAAATTGGTGATTAGTCCAAAGCGATTAGTGGAGATTTTGGAGGAGGAAGGTCACACTCAGGAGTTGATTGAGAGCGTAAGGACTGTGGCTAAATGTGGAAGTACCAGCACGAGCAGTAGCTTTTCGTCGTCCATGGCGTTTTCCGATCACTGGAGTTTGTCCTCCACCACCGCCAATGCTACTCCGAGCGCTTCCGCCAAAAGTGGCGGCTTGCTGGAGATCTAA

Protein sequence

MQAKRGRNNLLHRHPFSRFPFPPIYTPKKAFNLTEQRRTEQNRVEQSFSVFIGMGNCLFGGGSGEIQGKIKVITSNGGIMELGSPITVGCIADEFPGYGIFKSHDLFWNPLPHNEELLPGKSYYLLQRNRGRNRGETEEGEMGMIRAREGHVRSNSVPEATAAAAGMASYRMSFDYQGVLRRSQTEVFSRSSEKNGGVWKVKLVISPKRLVEILEEEGHTQELIESVRTVAKCGSTSTSSSFSSSMAFSDHWSLSSTTANATPSASAKSGGLLEI
Homology
BLAST of CmaCh16G006040.1 vs. TAIR 10
Match: AT1G64700.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: N-terminal protein myristoylation; LOCATED IN: cellular_component unknown; EXPRESSED IN: 17 plant structures; EXPRESSED DURING: 11 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G61920.1); Has 48 Blast hits to 47 proteins in 7 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 48; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 135.2 bits (339), Expect = 8.0e-32
Identity = 91/209 (43.54%), Postives = 120/209 (57.42%), Query Frame = 0

Query: 54  MGNCLFGG-GSGEIQGKIKVITSNGGIMELGSPITVGCIADEFPGYGIFKSHDLFWNPLP 113
           MGNCLFGG G  E    IKVI S+GG++E  SP+T G ++  F G+ +F + DL W PL 
Sbjct: 1   MGNCLFGGLGDEEEDLLIKVIKSDGGVLEFYSPVTAGFVSHGFSGHALFSAVDLLWKPLA 60

Query: 114 HNEELLPGKSYYLLQRNRGRNRGETEEGEMGMIRAREGHVRSNSVPEATAAAAGMASYRM 173
           H+  L+PG+SYYL   N   +  +T  G          HVRSNS      + + +  YRM
Sbjct: 61  HDHLLVPGQSYYLFP-NIVSDELKTFVGSC--------HVRSNS-----ESLSAITPYRM 120

Query: 174 SFDY-QGVLRRSQTEVFSRSS----------------EKNGGVWKVKLVISPKRLVEILE 233
           S DY   VL+RS T+VFSR+S                   G +WKV L+I+ + L++IL 
Sbjct: 121 SLDYNHRVLKRSYTDVFSRNSHIRTRQKEKKTRRRRTSSKGAIWKVNLIINTEELLQILS 180

Query: 234 EEGHTQELIESVRTVAKCGSTSTSSSFSS 245
           E+G T ELIESVR VAK G TS+ +S SS
Sbjct: 181 EDGRTNELIESVRAVAK-GETSSITSSSS 194

BLAST of CmaCh16G006040.1 vs. TAIR 10
Match: AT3G61920.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: N-terminal protein myristoylation; LOCATED IN: cellular_component unknown; EXPRESSED IN: shoot, hypocotyl, root, egg cell; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G64700.1); Has 77 Blast hits to 77 proteins in 8 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 77; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 105.9 bits (263), Expect = 5.2e-23
Identity = 84/214 (39.25%), Postives = 109/214 (50.93%), Query Frame = 0

Query: 54  MGNCLF--GGGSGEIQGK----IKVITSNGGIMELGSPITVGCIADEFPGYGIFKSHDLF 113
           MGNC+F   GGS ++  K    IKV+T NGG+MEL  PI    I +EFPG+ I  S  L 
Sbjct: 1   MGNCVFKGNGGSRKLYDKDDSLIKVVTPNGGVMELHPPIFAEFITNEFPGHVIHDSLSLR 60

Query: 114 WN--PLPHNEELLPGKSYYLLQRNRGRNRGETEEGEMGMIRAREGHVRSNSVPEATAAAA 173
            +  PL H EEL PG  YYLL                         + S++   A   ++
Sbjct: 61  HSSPPLLHGEELFPGNIYYLLP------------------------LSSSAAATAQLDSS 120

Query: 174 GMAS--YRMSFDYQGVLRRSQTEVFSRSSEKNGGVWKVKLVISPKRLVEILEEEGHTQEL 233
              S  YRMSF         +T + +  S    GVWKV+LVISP++L EIL E+  T+ L
Sbjct: 121 DQLSTPYRMSF--------GKTPIMAALSGGGCGVWKVRLVISPEQLAEILAEDVETEAL 180

Query: 234 IESVRTVAKCGSTSTSSSFSSSMAFSDHWSLSST 258
           +ESVRTVAKCG          S A SD  S++S+
Sbjct: 181 VESVRTVAKCGGYGCGGGV-HSRANSDQLSVTSS 181

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
AT1G64700.18.0e-3243.54unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: N-termin... [more]
AT3G61920.15.2e-2339.25unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: N-termin... [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita maxima (Rimu) v1.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR025322Protein of unknown function DUF4228, plantPFAMPF14009DUF4228coord: 54..216
e-value: 1.9E-16
score: 61.0
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 256..275
NoneNo IPR availablePANTHERPTHR33148:SF48DUF4228 DOMAIN PROTEINcoord: 54..259
NoneNo IPR availablePANTHERPTHR33148PLASTID MOVEMENT IMPAIRED PROTEIN-RELATEDcoord: 54..259

Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
CmaCh16G006040CmaCh16G006040gene


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmaCh16G006040.1:exon:149CmaCh16G006040.1:exon:149exon
CmaCh16G006040.1:exon:150CmaCh16G006040.1:exon:150exon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmaCh16G006040.1:cdsCmaCh16G006040.1:cdsCDS
CmaCh16G006040.1:cdsCmaCh16G006040.1:cds_2CDS


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
CmaCh16G006040.1CmaCh16G006040.1-proteinpolypeptide