CmoCh16G006580.1 (mRNA) Cucurbita moschata (Rifu) v1

Overview
NameCmoCh16G006580.1
TypemRNA
OrganismCucurbita moschata (Cucurbita moschata (Rifu) v1)
DescriptionDUF4228 domain-containing protein
LocationCmo_Chr16: 3250160 .. 3250834 (+)
Sequence length675
RNA-Seq ExpressionCmoCh16G006580.1
SyntenyCmoCh16G006580.1
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGGAACTGCTTATTCGGCGGTGGGTCTGGTGAGATTCAGGGGAAAATCAAGGTAATCACGTCCAATGGCGGGATTATGGAGTTGGGTTCTCCGATTACCGTCGGGTGCATCGCCGATGAGTTTCCTGGCTATGGAATATTCAAAAGCCAAGATCTTTTTTGGAACCCATTACCGCACAACGAGGAGCTGCTTCCGGGGAAATCCTACTACTTGCTTCAGAGAAACAGGGGAAGAAACAGAGGAGAGACAGAAGAAGGGGAAATGGGAATGATAAGGGCGCGTGAGGGGCACGTGAGGTCGAATAGTGTACCGGAGGCGACGGCGGCGGCGGCGGCGGGTATGGCGCCGTACAGAATGTCGTTTGATTATCAGGGGGTTTTGAGGAGGTCGCAGACGGAGGTTTTTTCGAGGAGCAGTGAGAAGAACGGCGGGGGGGTTTGGAAGGTGAAATTGGTGATTAGTCCGAAGCGATTGGTGGAGATTTTGGAGGAGGAAGGTCACACTCAGGAGTTGATTGAGAGCGTAAGGACTGTGGCTAAATGTGGAAGTACTAGCACCAGCAGCAGCTTTTCGTCGTCCATGGCGTTTTCCGATCACTGGAGTTTGTCCTCCACCACCGCCAATGCTACTCCGAGCGCTTCCTCCAAAAGTGGCGGCTTGCCGGAGATCTAA

mRNA sequence

ATGGGGAACTGCTTATTCGGCGGTGGGTCTGGTGAGATTCAGGGGAAAATCAAGGTAATCACGTCCAATGGCGGGATTATGGAGTTGGGTTCTCCGATTACCGTCGGGTGCATCGCCGATGAGTTTCCTGGCTATGGAATATTCAAAAGCCAAGATCTTTTTTGGAACCCATTACCGCACAACGAGGAGCTGCTTCCGGGGAAATCCTACTACTTGCTTCAGAGAAACAGGGGAAGAAACAGAGGAGAGACAGAAGAAGGGGAAATGGGAATGATAAGGGCGCGTGAGGGGCACGTGAGGTCGAATAGTGTACCGGAGGCGACGGCGGCGGCGGCGGCGGGTATGGCGCCGTACAGAATGTCGTTTGATTATCAGGGGGTTTTGAGGAGGTCGCAGACGGAGGTTTTTTCGAGGAGCAGTGAGAAGAACGGCGGGGGGGTTTGGAAGGTGAAATTGGTGATTAGTCCGAAGCGATTGGTGGAGATTTTGGAGGAGGAAGGTCACACTCAGGAGTTGATTGAGAGCGTAAGGACTGTGGCTAAATGTGGAAGTACTAGCACCAGCAGCAGCTTTTCGTCGTCCATGGCGTTTTCCGATCACTGGAGTTTGTCCTCCACCACCGCCAATGCTACTCCGAGCGCTTCCTCCAAAAGTGGCGGCTTGCCGGAGATCTAA

Coding sequence (CDS)

ATGGGGAACTGCTTATTCGGCGGTGGGTCTGGTGAGATTCAGGGGAAAATCAAGGTAATCACGTCCAATGGCGGGATTATGGAGTTGGGTTCTCCGATTACCGTCGGGTGCATCGCCGATGAGTTTCCTGGCTATGGAATATTCAAAAGCCAAGATCTTTTTTGGAACCCATTACCGCACAACGAGGAGCTGCTTCCGGGGAAATCCTACTACTTGCTTCAGAGAAACAGGGGAAGAAACAGAGGAGAGACAGAAGAAGGGGAAATGGGAATGATAAGGGCGCGTGAGGGGCACGTGAGGTCGAATAGTGTACCGGAGGCGACGGCGGCGGCGGCGGCGGGTATGGCGCCGTACAGAATGTCGTTTGATTATCAGGGGGTTTTGAGGAGGTCGCAGACGGAGGTTTTTTCGAGGAGCAGTGAGAAGAACGGCGGGGGGGTTTGGAAGGTGAAATTGGTGATTAGTCCGAAGCGATTGGTGGAGATTTTGGAGGAGGAAGGTCACACTCAGGAGTTGATTGAGAGCGTAAGGACTGTGGCTAAATGTGGAAGTACTAGCACCAGCAGCAGCTTTTCGTCGTCCATGGCGTTTTCCGATCACTGGAGTTTGTCCTCCACCACCGCCAATGCTACTCCGAGCGCTTCCTCCAAAAGTGGCGGCTTGCCGGAGATCTAA

Protein sequence

MGNCLFGGGSGEIQGKIKVITSNGGIMELGSPITVGCIADEFPGYGIFKSQDLFWNPLPHNEELLPGKSYYLLQRNRGRNRGETEEGEMGMIRAREGHVRSNSVPEATAAAAAGMAPYRMSFDYQGVLRRSQTEVFSRSSEKNGGGVWKVKLVISPKRLVEILEEEGHTQELIESVRTVAKCGSTSTSSSFSSSMAFSDHWSLSSTTANATPSASSKSGGLPEI
Homology
BLAST of CmoCh16G006580.1 vs. ExPASy TrEMBL
Match: A0A6J1ETY6 (uncharacterized protein LOC111437546 OS=Cucurbita moschata OX=3662 GN=LOC111437546 PE=4 SV=1)

HSP 1 Score: 437.2 bits (1123), Expect = 4.2e-119
Identity = 224/224 (100.00%), Postives = 224/224 (100.00%), Query Frame = 0

Query: 1   MGNCLFGGGSGEIQGKIKVITSNGGIMELGSPITVGCIADEFPGYGIFKSQDLFWNPLPH 60
           MGNCLFGGGSGEIQGKIKVITSNGGIMELGSPITVGCIADEFPGYGIFKSQDLFWNPLPH
Sbjct: 1   MGNCLFGGGSGEIQGKIKVITSNGGIMELGSPITVGCIADEFPGYGIFKSQDLFWNPLPH 60

Query: 61  NEELLPGKSYYLLQRNRGRNRGETEEGEMGMIRAREGHVRSNSVPEATAAAAAGMAPYRM 120
           NEELLPGKSYYLLQRNRGRNRGETEEGEMGMIRAREGHVRSNSVPEATAAAAAGMAPYRM
Sbjct: 61  NEELLPGKSYYLLQRNRGRNRGETEEGEMGMIRAREGHVRSNSVPEATAAAAAGMAPYRM 120

Query: 121 SFDYQGVLRRSQTEVFSRSSEKNGGGVWKVKLVISPKRLVEILEEEGHTQELIESVRTVA 180
           SFDYQGVLRRSQTEVFSRSSEKNGGGVWKVKLVISPKRLVEILEEEGHTQELIESVRTVA
Sbjct: 121 SFDYQGVLRRSQTEVFSRSSEKNGGGVWKVKLVISPKRLVEILEEEGHTQELIESVRTVA 180

Query: 181 KCGSTSTSSSFSSSMAFSDHWSLSSTTANATPSASSKSGGLPEI 225
           KCGSTSTSSSFSSSMAFSDHWSLSSTTANATPSASSKSGGLPEI
Sbjct: 181 KCGSTSTSSSFSSSMAFSDHWSLSSTTANATPSASSKSGGLPEI 224

BLAST of CmoCh16G006580.1 vs. ExPASy TrEMBL
Match: A0A6J1JCF7 (uncharacterized protein LOC111483212 OS=Cucurbita maxima OX=3661 GN=LOC111483212 PE=4 SV=1)

HSP 1 Score: 414.1 bits (1063), Expect = 3.8e-112
Identity = 218/224 (97.32%), Postives = 219/224 (97.77%), Query Frame = 0

Query: 1   MGNCLFGGGSGEIQGKIKVITSNGGIMELGSPITVGCIADEFPGYGIFKSQDLFWNPLPH 60
           MGNCLFGGGSGEIQGKIKVITSNGGIMELGSPITVGCIADEFPGYGIFKS DLFWNPLPH
Sbjct: 1   MGNCLFGGGSGEIQGKIKVITSNGGIMELGSPITVGCIADEFPGYGIFKSHDLFWNPLPH 60

Query: 61  NEELLPGKSYYLLQRNRGRNRGETEEGEMGMIRAREGHVRSNSVPEATAAAAAGMAPYRM 120
           NEELLPGKSYYLLQRNRGRNRGETEEGEMGMIRAREGHVRSNSVPEAT AAAAGMA YRM
Sbjct: 61  NEELLPGKSYYLLQRNRGRNRGETEEGEMGMIRAREGHVRSNSVPEAT-AAAAGMASYRM 120

Query: 121 SFDYQGVLRRSQTEVFSRSSEKNGGGVWKVKLVISPKRLVEILEEEGHTQELIESVRTVA 180
           SFDYQGVLRRSQTEVFSRSSEKN GGVWKVKLVISPKRLVEILEEEGHTQELIESVRTVA
Sbjct: 121 SFDYQGVLRRSQTEVFSRSSEKN-GGVWKVKLVISPKRLVEILEEEGHTQELIESVRTVA 180

Query: 181 KCGSTSTSSSFSSSMAFSDHWSLSSTTANATPSASSKSGGLPEI 225
           KCGSTSTSSSFSSSMAFSDHWSLSSTTANATPSAS+KSGGL EI
Sbjct: 181 KCGSTSTSSSFSSSMAFSDHWSLSSTTANATPSASAKSGGLLEI 222

BLAST of CmoCh16G006580.1 vs. ExPASy TrEMBL
Match: A0A0A0KUU6 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G047890 PE=4 SV=1)

HSP 1 Score: 384.0 bits (985), Expect = 4.2e-103
Identity = 204/224 (91.07%), Postives = 208/224 (92.86%), Query Frame = 0

Query: 1   MGNCLFGGGSGEIQGKIKVITSNGGIMELGSPITVGCIADEFPGYGIFKSQDLFWNPLPH 60
           MGNCLF GG+GEIQGKIKVITSNGGIMELGSPITVGCIADEFPGYGIFKS DLFWNPLPH
Sbjct: 1   MGNCLFAGGAGEIQGKIKVITSNGGIMELGSPITVGCIADEFPGYGIFKSHDLFWNPLPH 60

Query: 61  NEELLPGKSYYLLQRNRGRNRGETEEGEMGMIRAREGHVRSNSVPEATAAAAAGMAPYRM 120
           NEELLPGKSYYLL RNRGRNRG  +  EMG+IRAREGHVRSNSVPEA AAA A MAPYRM
Sbjct: 61  NEELLPGKSYYLLPRNRGRNRGGEDGVEMGIIRAREGHVRSNSVPEA-AAAMAAMAPYRM 120

Query: 121 SFDYQGVLRRSQTEVFSRSSEKNGGGVWKVKLVISPKRLVEILEEEGHTQELIESVRTVA 180
           SFDYQGVLRRSQTEVFSR SEKN GGVWKVKLVISPKRLVEILEEEGHTQELIESVRTVA
Sbjct: 121 SFDYQGVLRRSQTEVFSRYSEKN-GGVWKVKLVISPKRLVEILEEEGHTQELIESVRTVA 180

Query: 181 KCGSTSTSSSFSSSMAFSDHWSLSSTTANATPSASSKSGGLPEI 225
           KCGSTSTSSSFSSSMAFSD WSLS+ TANATPS SSKSGGL EI
Sbjct: 181 KCGSTSTSSSFSSSMAFSDQWSLSTATANATPSVSSKSGGLLEI 222

BLAST of CmoCh16G006580.1 vs. ExPASy TrEMBL
Match: A0A5D3D1S5 (DUF4228 domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold21G004760 PE=4 SV=1)

HSP 1 Score: 383.6 bits (984), Expect = 5.5e-103
Identity = 203/224 (90.62%), Postives = 208/224 (92.86%), Query Frame = 0

Query: 1   MGNCLFGGGSGEIQGKIKVITSNGGIMELGSPITVGCIADEFPGYGIFKSQDLFWNPLPH 60
           MGNCLF GG+GEIQGKIKVITSNGGIMELGSPITVGCIADEFPGYGIFKS DLFWNPLPH
Sbjct: 1   MGNCLFAGGAGEIQGKIKVITSNGGIMELGSPITVGCIADEFPGYGIFKSHDLFWNPLPH 60

Query: 61  NEELLPGKSYYLLQRNRGRNRGETEEGEMGMIRAREGHVRSNSVPEATAAAAAGMAPYRM 120
           NEELLPGKSYYLL RNRGRNRG  +  EMG+IRAREGHVRSNSVPEA AAA A MAPYRM
Sbjct: 61  NEELLPGKSYYLLPRNRGRNRGGEDGEEMGIIRAREGHVRSNSVPEA-AAAMAAMAPYRM 120

Query: 121 SFDYQGVLRRSQTEVFSRSSEKNGGGVWKVKLVISPKRLVEILEEEGHTQELIESVRTVA 180
           SFDYQGVLRRSQTEVFSR SEKN GGVWKVKLVISP+RLVEILEEEGHTQELIESVRTVA
Sbjct: 121 SFDYQGVLRRSQTEVFSRCSEKN-GGVWKVKLVISPRRLVEILEEEGHTQELIESVRTVA 180

Query: 181 KCGSTSTSSSFSSSMAFSDHWSLSSTTANATPSASSKSGGLPEI 225
           KCGSTSTSSSFSSSMAFSD WSLS+ TANATPS SSKSGGL EI
Sbjct: 181 KCGSTSTSSSFSSSMAFSDQWSLSTATANATPSVSSKSGGLLEI 222

BLAST of CmoCh16G006580.1 vs. ExPASy TrEMBL
Match: A0A1S3BSZ8 (uncharacterized protein LOC103493172 OS=Cucumis melo OX=3656 GN=LOC103493172 PE=4 SV=1)

HSP 1 Score: 383.6 bits (984), Expect = 5.5e-103
Identity = 203/224 (90.62%), Postives = 208/224 (92.86%), Query Frame = 0

Query: 1   MGNCLFGGGSGEIQGKIKVITSNGGIMELGSPITVGCIADEFPGYGIFKSQDLFWNPLPH 60
           MGNCLF GG+GEIQGKIKVITSNGGIMELGSPITVGCIADEFPGYGIFKS DLFWNPLPH
Sbjct: 1   MGNCLFAGGAGEIQGKIKVITSNGGIMELGSPITVGCIADEFPGYGIFKSHDLFWNPLPH 60

Query: 61  NEELLPGKSYYLLQRNRGRNRGETEEGEMGMIRAREGHVRSNSVPEATAAAAAGMAPYRM 120
           NEELLPGKSYYLL RNRGRNRG  +  EMG+IRAREGHVRSNSVPEA AAA A MAPYRM
Sbjct: 61  NEELLPGKSYYLLPRNRGRNRGGEDGEEMGIIRAREGHVRSNSVPEA-AAAMAAMAPYRM 120

Query: 121 SFDYQGVLRRSQTEVFSRSSEKNGGGVWKVKLVISPKRLVEILEEEGHTQELIESVRTVA 180
           SFDYQGVLRRSQTEVFSR SEKN GGVWKVKLVISP+RLVEILEEEGHTQELIESVRTVA
Sbjct: 121 SFDYQGVLRRSQTEVFSRCSEKN-GGVWKVKLVISPRRLVEILEEEGHTQELIESVRTVA 180

Query: 181 KCGSTSTSSSFSSSMAFSDHWSLSSTTANATPSASSKSGGLPEI 225
           KCGSTSTSSSFSSSMAFSD WSLS+ TANATPS SSKSGGL EI
Sbjct: 181 KCGSTSTSSSFSSSMAFSDQWSLSTATANATPSVSSKSGGLLEI 222

BLAST of CmoCh16G006580.1 vs. TAIR 10
Match: AT1G64700.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: N-terminal protein myristoylation; LOCATED IN: cellular_component unknown; EXPRESSED IN: 17 plant structures; EXPRESSED DURING: 11 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G61920.1); Has 48 Blast hits to 47 proteins in 7 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 48; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 138.7 bits (348), Expect = 5.9e-33
Identity = 92/210 (43.81%), Postives = 122/210 (58.10%), Query Frame = 0

Query: 1   MGNCLFGG-GSGEIQGKIKVITSNGGIMELGSPITVGCIADEFPGYGIFKSQDLFWNPLP 60
           MGNCLFGG G  E    IKVI S+GG++E  SP+T G ++  F G+ +F + DL W PL 
Sbjct: 1   MGNCLFGGLGDEEEDLLIKVIKSDGGVLEFYSPVTAGFVSHGFSGHALFSAVDLLWKPLA 60

Query: 61  HNEELLPGKSYYLLQRNRGRNRGETEEGEMGMIRAREGHVRSNSVPEATAAAAAGMAPYR 120
           H+  L+PG+SYYL   N   +  +T  G          HVRSNS       + + + PYR
Sbjct: 61  HDHLLVPGQSYYLFP-NIVSDELKTFVGSC--------HVRSNS------ESLSAITPYR 120

Query: 121 MSFDY-QGVLRRSQTEVFSRSSE---------------KNGGGVWKVKLVISPKRLVEIL 180
           MS DY   VL+RS T+VFSR+S                 + G +WKV L+I+ + L++IL
Sbjct: 121 MSLDYNHRVLKRSYTDVFSRNSHIRTRQKEKKTRRRRTSSKGAIWKVNLIINTEELLQIL 180

Query: 181 EEEGHTQELIESVRTVAKCGSTSTSSSFSS 194
            E+G T ELIESVR VAK G TS+ +S SS
Sbjct: 181 SEDGRTNELIESVRAVAK-GETSSITSSSS 194

BLAST of CmoCh16G006580.1 vs. TAIR 10
Match: AT3G61920.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: N-terminal protein myristoylation; LOCATED IN: cellular_component unknown; EXPRESSED IN: shoot, hypocotyl, root, egg cell; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G64700.1); Has 77 Blast hits to 77 proteins in 8 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 77; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 107.1 bits (266), Expect = 1.9e-23
Identity = 87/222 (39.19%), Postives = 110/222 (49.55%), Query Frame = 0

Query: 1   MGNCLF--GGGSGEIQGK----IKVITSNGGIMELGSPITVGCIADEFPGYGIFKSQDLF 60
           MGNC+F   GGS ++  K    IKV+T NGG+MEL  PI    I +EFPG+ I  S  L 
Sbjct: 1   MGNCVFKGNGGSRKLYDKDDSLIKVVTPNGGVMELHPPIFAEFITNEFPGHVIHDSLSLR 60

Query: 61  WN--PLPHNEELLPGKSYYLLQRNRGRNRGETEEGEMGMIRAREGHVRSNSVPEATAAAA 120
            +  PL H EEL PG  YYLL                               P +++AAA
Sbjct: 61  HSSPPLLHGEELFPGNIYYLL-------------------------------PLSSSAAA 120

Query: 121 AGM--------APYRMSFDYQGVLRRSQTEVFSRSSEKNGGGVWKVKLVISPKRLVEILE 180
                       PYRMSF         +T + +  S   G GVWKV+LVISP++L EIL 
Sbjct: 121 TAQLDSSDQLSTPYRMSF--------GKTPIMAALS-GGGCGVWKVRLVISPEQLAEILA 180

Query: 181 EEGHTQELIESVRTVAKCGSTSTSSSFSSSMAFSDHWSLSST 207
           E+  T+ L+ESVRTVAKCG          S A SD  S++S+
Sbjct: 181 EDVETEALVESVRTVAKCGGYGCGGGV-HSRANSDQLSVTSS 181

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1ETY64.2e-119100.00uncharacterized protein LOC111437546 OS=Cucurbita moschata OX=3662 GN=LOC1114375... [more]
A0A6J1JCF73.8e-11297.32uncharacterized protein LOC111483212 OS=Cucurbita maxima OX=3661 GN=LOC111483212... [more]
A0A0A0KUU64.2e-10391.07Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G047890 PE=4 SV=1[more]
A0A5D3D1S55.5e-10390.63DUF4228 domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E567... [more]
A0A1S3BSZ85.5e-10390.63uncharacterized protein LOC103493172 OS=Cucumis melo OX=3656 GN=LOC103493172 PE=... [more]
Match NameE-valueIdentityDescription
AT1G64700.15.9e-3343.81unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: N-termin... [more]
AT3G61920.11.9e-2339.19unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: N-termin... [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita moschata (Rifu) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR025322Protein of unknown function DUF4228, plantPFAMPF14009DUF4228coord: 1..171
e-value: 5.2E-17
score: 62.8
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 205..224
NoneNo IPR availablePANTHERPTHR33148:SF48DUF4228 DOMAIN PROTEINcoord: 1..208
NoneNo IPR availablePANTHERPTHR33148PLASTID MOVEMENT IMPAIRED PROTEIN-RELATEDcoord: 1..208

Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
CmoCh16G006580CmoCh16G006580gene


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmoCh16G006580.1:exon:10948CmoCh16G006580.1:exon:10948exon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmoCh16G006580.1:cdsCmoCh16G006580.1:cdsCDS


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
CmoCh16G006580.1CmoCh16G006580.1-proteinpolypeptide