CmoCh16G005880 (gene) Cucurbita moschata (Rifu) v1

Overview
NameCmoCh16G005880
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu) v1)
DescriptionUvrABC system protein C
LocationCmo_Chr16: 2839511 .. 2840089 (-)
RNA-Seq ExpressionCmoCh16G005880
SyntenyCmoCh16G005880
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGAAACTGTTCTCTGAAAGGAGTGGCCGCCGATTGCGAAAAGCCCATCAGAATCTTAACGGATTCCGGCAACATAATCAACTTCCATGGCCCTAAACAAGTCGATCAAATCCTCAAGAACTATCCTCCCGGCGTCTATGGCGTTTTCCGGCGCCCCAATCTCTCTTCGCCATTACCCATTTCGGAGCTCCTCGACGCCGGAAAATCCTACTTTCTCCTCCCGCTTTCCCGAGCCGTAGAGAAAGAGAGGTCCGATGCGGCCGAGGATCTGAGAAGTGGGTCGGGGCTGGAAGTGCTACCGACAGGTGGCGACGGCATTTGGAGGGTCAAATTGGTGATTGATACGAAACAGTTGGGGGAAATTTTGGCAGAGGAAGGGAACACAGAGGCGTTGATTGAGAGGATGAGAGCGGCGGCGGCGACGGCGGCGGTGCAGAGTCCACGGCGGGAGAAGATCGGAGGGTGGAAGCCGACATGGGGGAATTGGTCGAAGTTTCTTCCAATTGATGTTGGAAACAATAATAAAGCACAAATCAAGGATTTTCATTCTGGAAATGGGTGTTTATATGCTACATAA

mRNA sequence

ATGGGAAACTGTTCTCTGAAAGGAGTGGCCGCCGATTGCGAAAAGCCCATCAGAATCTTAACGGATTCCGGCAACATAATCAACTTCCATGGCCCTAAACAAGTCGATCAAATCCTCAAGAACTATCCTCCCGGCGTCTATGGCGTTTTCCGGCGCCCCAATCTCTCTTCGCCATTACCCATTTCGGAGCTCCTCGACGCCGGAAAATCCTACTTTCTCCTCCCGCTTTCCCGAGCCGTAGAGAAAGAGAGGTCCGATGCGGCCGAGGATCTGAGAAGTGGGTCGGGGCTGGAAGTGCTACCGACAGGTGGCGACGGCATTTGGAGGGTCAAATTGGTGATTGATACGAAACAGTTGGGGGAAATTTTGGCAGAGGAAGGGAACACAGAGGCGTTGATTGAGAGGATGAGAGCGGCGGCGGCGACGGCGGCGGTGCAGAGTCCACGGCGGGAGAAGATCGGAGGGTGGAAGCCGACATGGGGGAATTGGTCGAAGTTTCTTCCAATTGATGTTGGAAACAATAATAAAGCACAAATCAAGGATTTTCATTCTGGAAATGGGTGTTTATATGCTACATAA

Coding sequence (CDS)

ATGGGAAACTGTTCTCTGAAAGGAGTGGCCGCCGATTGCGAAAAGCCCATCAGAATCTTAACGGATTCCGGCAACATAATCAACTTCCATGGCCCTAAACAAGTCGATCAAATCCTCAAGAACTATCCTCCCGGCGTCTATGGCGTTTTCCGGCGCCCCAATCTCTCTTCGCCATTACCCATTTCGGAGCTCCTCGACGCCGGAAAATCCTACTTTCTCCTCCCGCTTTCCCGAGCCGTAGAGAAAGAGAGGTCCGATGCGGCCGAGGATCTGAGAAGTGGGTCGGGGCTGGAAGTGCTACCGACAGGTGGCGACGGCATTTGGAGGGTCAAATTGGTGATTGATACGAAACAGTTGGGGGAAATTTTGGCAGAGGAAGGGAACACAGAGGCGTTGATTGAGAGGATGAGAGCGGCGGCGGCGACGGCGGCGGTGCAGAGTCCACGGCGGGAGAAGATCGGAGGGTGGAAGCCGACATGGGGGAATTGGTCGAAGTTTCTTCCAATTGATGTTGGAAACAATAATAAAGCACAAATCAAGGATTTTCATTCTGGAAATGGGTGTTTATATGCTACATAA

Protein sequence

MGNCSLKGVAADCEKPIRILTDSGNIINFHGPKQVDQILKNYPPGVYGVFRRPNLSSPLPISELLDAGKSYFLLPLSRAVEKERSDAAEDLRSGSGLEVLPTGGDGIWRVKLVIDTKQLGEILAEEGNTEALIERMRAAAATAAVQSPRREKIGGWKPTWGNWSKFLPIDVGNNNKAQIKDFHSGNGCLYAT
Homology
BLAST of CmoCh16G005880 vs. ExPASy TrEMBL
Match: A0A6J1EZE8 (uncharacterized protein LOC111437614 OS=Cucurbita moschata OX=3662 GN=LOC111437614 PE=4 SV=1)

HSP 1 Score: 397.5 bits (1020), Expect = 3.1e-107
Identity = 192/192 (100.00%), Postives = 192/192 (100.00%), Query Frame = 0

Query: 1   MGNCSLKGVAADCEKPIRILTDSGNIINFHGPKQVDQILKNYPPGVYGVFRRPNLSSPLP 60
           MGNCSLKGVAADCEKPIRILTDSGNIINFHGPKQVDQILKNYPPGVYGVFRRPNLSSPLP
Sbjct: 1   MGNCSLKGVAADCEKPIRILTDSGNIINFHGPKQVDQILKNYPPGVYGVFRRPNLSSPLP 60

Query: 61  ISELLDAGKSYFLLPLSRAVEKERSDAAEDLRSGSGLEVLPTGGDGIWRVKLVIDTKQLG 120
           ISELLDAGKSYFLLPLSRAVEKERSDAAEDLRSGSGLEVLPTGGDGIWRVKLVIDTKQLG
Sbjct: 61  ISELLDAGKSYFLLPLSRAVEKERSDAAEDLRSGSGLEVLPTGGDGIWRVKLVIDTKQLG 120

Query: 121 EILAEEGNTEALIERMRAAAATAAVQSPRREKIGGWKPTWGNWSKFLPIDVGNNNKAQIK 180
           EILAEEGNTEALIERMRAAAATAAVQSPRREKIGGWKPTWGNWSKFLPIDVGNNNKAQIK
Sbjct: 121 EILAEEGNTEALIERMRAAAATAAVQSPRREKIGGWKPTWGNWSKFLPIDVGNNNKAQIK 180

Query: 181 DFHSGNGCLYAT 193
           DFHSGNGCLYAT
Sbjct: 181 DFHSGNGCLYAT 192

BLAST of CmoCh16G005880 vs. ExPASy TrEMBL
Match: A0A6J1J4V0 (uncharacterized protein LOC111483432 OS=Cucurbita maxima OX=3661 GN=LOC111483432 PE=4 SV=1)

HSP 1 Score: 385.6 bits (989), Expect = 1.2e-103
Identity = 185/192 (96.35%), Postives = 188/192 (97.92%), Query Frame = 0

Query: 1   MGNCSLKGVAADCEKPIRILTDSGNIINFHGPKQVDQILKNYPPGVYGVFRRPNLSSPLP 60
           MGNCSLKGVAADCEKPIRILTDSGNIINFHGPKQVDQILKNYPPGVYGVFRRPNLSSPLP
Sbjct: 1   MGNCSLKGVAADCEKPIRILTDSGNIINFHGPKQVDQILKNYPPGVYGVFRRPNLSSPLP 60

Query: 61  ISELLDAGKSYFLLPLSRAVEKERSDAAEDLRSGSGLEVLPTGGDGIWRVKLVIDTKQLG 120
           ISE LDAGKSYFLLPLSRA EKERSDAA DLR+GSGLEVLPTGGDGIWRVKLVIDTKQLG
Sbjct: 61  ISEPLDAGKSYFLLPLSRAAEKERSDAAGDLRTGSGLEVLPTGGDGIWRVKLVIDTKQLG 120

Query: 121 EILAEEGNTEALIERMRAAAATAAVQSPRREKIGGWKPTWGNWSKFLPIDVGNNNKAQIK 180
           EILAE+GNTEALIERMRAAAATAAVQSPRREKIGGWKPTWGNWSKF PIDVGNNNKAQ+K
Sbjct: 121 EILAEDGNTEALIERMRAAAATAAVQSPRREKIGGWKPTWGNWSKFFPIDVGNNNKAQMK 180

Query: 181 DFHSGNGCLYAT 193
           DFHSGNGCLYAT
Sbjct: 181 DFHSGNGCLYAT 192

BLAST of CmoCh16G005880 vs. ExPASy TrEMBL
Match: A0A0A0KYF7 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G052720 PE=4 SV=1)

HSP 1 Score: 308.9 bits (790), Expect = 1.5e-80
Identity = 150/188 (79.79%), Postives = 164/188 (87.23%), Query Frame = 0

Query: 1   MGNCSLKGVAADCEKPIRILTDSGNIINFHGPKQVDQILKNYPPGVYGVFRRPNLSSPLP 60
           MGNCSLKG+A DCEKPIRILTDSG+IINFHGPKQV QIL NYPPG+YGVFRRPNLSSPLP
Sbjct: 17  MGNCSLKGMAVDCEKPIRILTDSGHIINFHGPKQVHQILNNYPPGIYGVFRRPNLSSPLP 76

Query: 61  ISELLDAGKSYFLLPLSRAVEKERS-----DAAEDLRSGSGLEVLPTGGDGIWRVKLVID 120
           +SE LDAGKSYFLLPLS++     S       ++D+ S SGLEVLP GG+G+WRVKLVID
Sbjct: 77  VSEPLDAGKSYFLLPLSQSTNDGESPLPVPPPSKDVGSESGLEVLPAGGNGVWRVKLVID 136

Query: 121 TKQLGEILAEEGNTEALIERMRAAAATAAVQSPRREKIGGWKPTWGNWSKFLPIDVGNNN 180
           TKQLGEILAEEGNTEALIERMRAAAATAAVQSPRR KIGGWKP WGNW KF PIDVGN+N
Sbjct: 137 TKQLGEILAEEGNTEALIERMRAAAATAAVQSPRRGKIGGWKPMWGNWFKFFPIDVGNSN 196

Query: 181 KAQIKDFH 184
           KAQ+K F+
Sbjct: 197 KAQMKVFN 204

BLAST of CmoCh16G005880 vs. ExPASy TrEMBL
Match: A0A5A7TTS6 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold21G003550 PE=4 SV=1)

HSP 1 Score: 302.8 bits (774), Expect = 1.1e-78
Identity = 148/188 (78.72%), Postives = 160/188 (85.11%), Query Frame = 0

Query: 1   MGNCSLKGVAADCEKPIRILTDSGNIINFHGPKQVDQILKNYPPGVYGVFRRPNLSSPLP 60
           MGNCSLKG+A DC KPIRILTDSG+IINFHGPKQV QIL NYPPG+YGVFRRPNLSSPLP
Sbjct: 1   MGNCSLKGMAVDCVKPIRILTDSGHIINFHGPKQVHQILNNYPPGIYGVFRRPNLSSPLP 60

Query: 61  ISELLDAGKSYFLLPLSRAVEKERSD----AAEDLRSGSGLEVLPTGGDGIWRVKLVIDT 120
            SE LDAGKSYFLLPLS+      S      ++DL S SGLEVLP  G+G+WRVKLVIDT
Sbjct: 61  FSEPLDAGKSYFLLPLSQPTNDTESSPPPLPSKDLGSESGLEVLPASGNGVWRVKLVIDT 120

Query: 121 KQLGEILAEEGNTEALIERMRAAAATAAVQSPRREKIGGWKPTWGNWSKFLPIDVGNNNK 180
           KQLGEILAEEGNTEALIER+RAAAATAAVQSPRR KI GWKP WGNW KF P+D GNNNK
Sbjct: 121 KQLGEILAEEGNTEALIERIRAAAATAAVQSPRRGKIVGWKPMWGNWLKFFPMDFGNNNK 180

Query: 181 AQIKDFHS 185
           AQIK+F+S
Sbjct: 181 AQIKEFNS 188

BLAST of CmoCh16G005880 vs. ExPASy TrEMBL
Match: A0A1S3BTS2 (uncharacterized protein LOC103493080 OS=Cucumis melo OX=3656 GN=LOC103493080 PE=4 SV=1)

HSP 1 Score: 302.8 bits (774), Expect = 1.1e-78
Identity = 148/188 (78.72%), Postives = 160/188 (85.11%), Query Frame = 0

Query: 1   MGNCSLKGVAADCEKPIRILTDSGNIINFHGPKQVDQILKNYPPGVYGVFRRPNLSSPLP 60
           MGNCSLKG+A DC KPIRILTDSG+IINFHGPKQV QIL NYPPG+YGVFRRPNLSSPLP
Sbjct: 1   MGNCSLKGMAVDCVKPIRILTDSGHIINFHGPKQVHQILNNYPPGIYGVFRRPNLSSPLP 60

Query: 61  ISELLDAGKSYFLLPLSRAVEKERSD----AAEDLRSGSGLEVLPTGGDGIWRVKLVIDT 120
            SE LDAGKSYFLLPLS+      S      ++DL S SGLEVLP  G+G+WRVKLVIDT
Sbjct: 61  FSEPLDAGKSYFLLPLSQPTNDTESSPPPLPSKDLGSESGLEVLPASGNGVWRVKLVIDT 120

Query: 121 KQLGEILAEEGNTEALIERMRAAAATAAVQSPRREKIGGWKPTWGNWSKFLPIDVGNNNK 180
           KQLGEILAEEGNTEALIER+RAAAATAAVQSPRR KI GWKP WGNW KF P+D GNNNK
Sbjct: 121 KQLGEILAEEGNTEALIERIRAAAATAAVQSPRRGKIVGWKPMWGNWLKFFPMDFGNNNK 180

Query: 181 AQIKDFHS 185
           AQIK+F+S
Sbjct: 181 AQIKEFNS 188

BLAST of CmoCh16G005880 vs. TAIR 10
Match: AT3G61920.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: N-terminal protein myristoylation; LOCATED IN: cellular_component unknown; EXPRESSED IN: shoot, hypocotyl, root, egg cell; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G64700.1); Has 77 Blast hits to 77 proteins in 8 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 77; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 76.3 bits (186), Expect = 3.1e-14
Identity = 54/156 (34.62%), Postives = 78/156 (50.00%), Query Frame = 0

Query: 1   MGNCSLKGVAAD------CEKPIRILTDSGNIINFHGPKQVDQILKNYPPGV-YGVFRRP 60
           MGNC  KG           +  I+++T +G ++  H P   + I   +P  V +      
Sbjct: 1   MGNCVFKGNGGSRKLYDKDDSLIKVVTPNGGVMELHPPIFAEFITNEFPGHVIHDSLSLR 60

Query: 61  NLSSPLPISELLDAGKSYFLLPL-SRAVEKERSDAAEDL----RSGSG----LEVLPTGG 120
           + S PL   E L  G  Y+LLPL S A    + D+++ L    R   G    +  L  GG
Sbjct: 61  HSSPPLLHGEELFPGNIYYLLPLSSSAAATAQLDSSDQLSTPYRMSFGKTPIMAALSGGG 120

Query: 121 DGIWRVKLVIDTKQLGEILAEEGNTEALIERMRAAA 141
            G+W+V+LVI  +QL EILAE+  TEAL+E +R  A
Sbjct: 121 CGVWKVRLVISPEQLAEILAEDVETEALVESVRTVA 156

BLAST of CmoCh16G005880 vs. TAIR 10
Match: AT4G10910.1 (unknown protein; Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 57.8 bits (138), Expect = 1.1e-08
Identity = 29/54 (53.70%), Postives = 39/54 (72.22%), Query Frame = 0

Query: 94  GSGLEVLPTGGDGIWRVKLVIDTKQLGEILAEEGNTEALIERMRAAAATAAVQS 148
           G  ++V P   +G+W+ K+VI +KQL EILA EGNT ALI+++R AAA A V S
Sbjct: 63  GPRIQVSPQRRNGVWKAKVVIGSKQLEEILAVEGNTHALIDQLRFAAAEALVSS 116

BLAST of CmoCh16G005880 vs. TAIR 10
Match: AT1G64700.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: N-terminal protein myristoylation; LOCATED IN: cellular_component unknown; EXPRESSED IN: 17 plant structures; EXPRESSED DURING: 11 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G61920.1); Has 48 Blast hits to 47 proteins in 7 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 48; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 47.4 bits (111), Expect = 1.5e-05
Identity = 49/197 (24.87%), Postives = 81/197 (41.12%), Query Frame = 0

Query: 1   MGNCSLKGVAADCEK-PIRILTDSGNIINFHGPKQVDQILKNYPPGVYGVFRRPNLS-SP 60
           MGNC   G+  + E   I+++   G ++ F+ P     +   +    + +F   +L   P
Sbjct: 1   MGNCLFGGLGDEEEDLLIKVIKSDGGVLEFYSPVTAGFVSHGF--SGHALFSAVDLLWKP 60

Query: 61  LPISELLDAGKSYFLLPLSRAVEKERSDAAEDLRSGS-GLEVLP---------------- 120
           L    LL  G+SY+L P   + E +    +  +RS S  L  +                 
Sbjct: 61  LAHDHLLVPGQSYYLFPNIVSDELKTFVGSCHVRSNSESLSAITPYRMSLDYNHRVLKRS 120

Query: 121 ------------------------TGGDG-IWRVKLVIDTKQLGEILAEEGNTEALIERM 152
                                   T   G IW+V L+I+T++L +IL+E+G T  LIE +
Sbjct: 121 YTDVFSRNSHIRTRQKEKKTRRRRTSSKGAIWKVNLIINTEELLQILSEDGRTNELIESV 180

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1EZE83.1e-107100.00uncharacterized protein LOC111437614 OS=Cucurbita moschata OX=3662 GN=LOC1114376... [more]
A0A6J1J4V01.2e-10396.35uncharacterized protein LOC111483432 OS=Cucurbita maxima OX=3661 GN=LOC111483432... [more]
A0A0A0KYF71.5e-8079.79Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G052720 PE=4 SV=1[more]
A0A5A7TTS61.1e-7878.72Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A1S3BTS21.1e-7878.72uncharacterized protein LOC103493080 OS=Cucumis melo OX=3656 GN=LOC103493080 PE=... [more]
Match NameE-valueIdentityDescription
AT3G61920.13.1e-1434.62unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: N-termin... [more]
AT4G10910.11.1e-0853.70unknown protein; Has 30201 Blast hits to 17322 proteins in 780 species: Archae -... [more]
AT1G64700.11.5e-0524.87unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: N-termin... [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita moschata (Rifu) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR025322Protein of unknown function DUF4228, plantPFAMPF14009DUF4228coord: 1..87
e-value: 3.0E-12
score: 47.3
NoneNo IPR availablePANTHERPTHR33148:SF33DUF4228 DOMAIN PROTEINcoord: 1..164
NoneNo IPR availablePANTHERPTHR33148PLASTID MOVEMENT IMPAIRED PROTEIN-RELATEDcoord: 1..164

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh16G005880.1CmoCh16G005880.1mRNA