CmoCh17G008950.1 (mRNA) Cucurbita moschata (Rifu) v1

Overview
NameCmoCh17G008950.1
TypemRNA
OrganismCucurbita moschata (Cucurbita moschata (Rifu) v1)
DescriptionClp R domain-containing protein
LocationCmo_Chr17: 8106117 .. 8106380 (+)
Sequence length264
RNA-Seq ExpressionCmoCh17G008950.1
SyntenyCmoCh17G008950.1
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGATGATGGAGATTAGGCATTTGGAAGGAATGGCTGAGACTGCAGCCGTGTTCCTCCTTGTGCAGATCCTGGTTTATCTAATTCTCTCTAAATCTTCCCATGTCTTCTCTAAGAACACGAAGAGGTCTCTTAGTTTCAGACAGGTCCGATCTCTGAGTATCCGTCGGATTATGGCAGCTCTCTCCGATTTCCCCTCCGGCACCGGCGACCCAGACGAAAGTCGTGGGGAATGTGGAACCATGTTTGATGAAGGCAGTTCTTAA

mRNA sequence

ATGATGATGGAGATTAGGCATTTGGAAGGAATGGCTGAGACTGCAGCCGTGTTCCTCCTTGTGCAGATCCTGGTTTATCTAATTCTCTCTAAATCTTCCCATGTCTTCTCTAAGAACACGAAGAGGTCTCTTAGTTTCAGACAGGTCCGATCTCTGAGTATCCGTCGGATTATGGCAGCTCTCTCCGATTTCCCCTCCGGCACCGGCGACCCAGACGAAAGTCGTGGGGAATGTGGAACCATGTTTGATGAAGGCAGTTCTTAA

Coding sequence (CDS)

ATGATGATGGAGATTAGGCATTTGGAAGGAATGGCTGAGACTGCAGCCGTGTTCCTCCTTGTGCAGATCCTGGTTTATCTAATTCTCTCTAAATCTTCCCATGTCTTCTCTAAGAACACGAAGAGGTCTCTTAGTTTCAGACAGGTCCGATCTCTGAGTATCCGTCGGATTATGGCAGCTCTCTCCGATTTCCCCTCCGGCACCGGCGACCCAGACGAAAGTCGTGGGGAATGTGGAACCATGTTTGATGAAGGCAGTTCTTAA

Protein sequence

MMMEIRHLEGMAETAAVFLLVQILVYLILSKSSHVFSKNTKRSLSFRQVRSLSIRRIMAALSDFPSGTGDPDESRGECGTMFDEGSS
Homology
BLAST of CmoCh17G008950.1 vs. ExPASy TrEMBL
Match: A0A6J1GRF7 (uncharacterized protein LOC111456452 OS=Cucurbita moschata OX=3662 GN=LOC111456452 PE=4 SV=1)

HSP 1 Score: 163.7 bits (413), Expect = 3.5e-37
Identity = 87/87 (100.00%), Postives = 87/87 (100.00%), Query Frame = 0

Query: 1  MMMEIRHLEGMAETAAVFLLVQILVYLILSKSSHVFSKNTKRSLSFRQVRSLSIRRIMAA 60
          MMMEIRHLEGMAETAAVFLLVQILVYLILSKSSHVFSKNTKRSLSFRQVRSLSIRRIMAA
Sbjct: 1  MMMEIRHLEGMAETAAVFLLVQILVYLILSKSSHVFSKNTKRSLSFRQVRSLSIRRIMAA 60

Query: 61 LSDFPSGTGDPDESRGECGTMFDEGSS 88
          LSDFPSGTGDPDESRGECGTMFDEGSS
Sbjct: 61 LSDFPSGTGDPDESRGECGTMFDEGSS 87

BLAST of CmoCh17G008950.1 vs. ExPASy TrEMBL
Match: A0A6J1JXQ0 (uncharacterized protein LOC111488440 OS=Cucurbita maxima OX=3661 GN=LOC111488440 PE=4 SV=1)

HSP 1 Score: 139.4 bits (350), Expect = 7.0e-30
Identity = 75/76 (98.68%), Postives = 75/76 (98.68%), Query Frame = 0

Query: 1  MMMEIRHLEGMAETAAVFLLVQILVYLILSKSSHVFSKNTKRSLSFRQVRSLSIRRIMAA 60
          MMMEIRHLEGMAETAAVFLLVQILVYLILSKSSHVFSKNTKRSLSFRQVRSLSIRRIMAA
Sbjct: 1  MMMEIRHLEGMAETAAVFLLVQILVYLILSKSSHVFSKNTKRSLSFRQVRSLSIRRIMAA 60

Query: 61 LSDFPSGTGDPDESRG 77
          LSDFPSG GDPDESRG
Sbjct: 61 LSDFPSGAGDPDESRG 76

BLAST of CmoCh17G008950.1 vs. ExPASy TrEMBL
Match: A0A6J1D0M4 (uncharacterized protein LOC111016209 OS=Momordica charantia OX=3673 GN=LOC111016209 PE=4 SV=1)

HSP 1 Score: 106.3 bits (264), Expect = 6.6e-20
Identity = 67/95 (70.53%), Postives = 71/95 (74.74%), Query Frame = 0

Query: 1  MMMEIRHLEGMAETAAVFLLVQILVYLILSKSSHVFSKNTKRSLSFRQVRSLSIRRIMAA 60
          M MEI  LEG+  TAAVF LVQILVYLILSKSS VFSKNTKRSLSFR VRSLSIRRI+AA
Sbjct: 2  MTMEIIPLEGIVATAAVFFLVQILVYLILSKSSDVFSKNTKRSLSFRHVRSLSIRRILAA 61

Query: 61 LSDFPSG--TGDPDES------RGECGTMFDEGSS 88
          LSD PSG     P+ES       GEC  MF +GSS
Sbjct: 62 LSDVPSGGEPSPPNESPSPFSIGGECAAMFRDGSS 96

BLAST of CmoCh17G008950.1 vs. ExPASy TrEMBL
Match: A0A1U7YVL7 (uncharacterized protein LOC104587448 OS=Nelumbo nucifera OX=4432 GN=LOC104587448 PE=4 SV=1)

HSP 1 Score: 75.9 bits (185), Expect = 9.5e-11
Identity = 44/69 (63.77%), Postives = 51/69 (73.91%), Query Frame = 0

Query: 8  LEGMAETAAVFLLVQILVYLILSKSSHVFSKNTKRSLSFRQVRSLSIRRIMAALSDFPSG 67
          LE M     +FLLVQ LVYLILSKSS++FSKN  RS+SFR  RS SIRR +AALSD P+G
Sbjct: 7  LEDMLLKVGMFLLVQALVYLILSKSSNIFSKNVSRSVSFRTARSASIRRFLAALSDLPAG 66

Query: 68 TGDPDESRG 77
          +  P  SRG
Sbjct: 67 SESP--SRG 73

BLAST of CmoCh17G008950.1 vs. ExPASy TrEMBL
Match: A0A660KP35 (Uncharacterized protein OS=Carpinus fangiana OX=176857 GN=FH972_010708 PE=4 SV=1)

HSP 1 Score: 73.9 bits (180), Expect = 3.6e-10
Identity = 42/68 (61.76%), Postives = 52/68 (76.47%), Query Frame = 0

Query: 1  MMMEIRHL-EGMAETAAVFLLVQILVYLILSKSSHVFSKNTKRSLSFRQVRSLSIRRIMA 60
          M+MEI  L + M    A+FLLVQ+LVYLILS SS +FSKN +RSLSFR  RS+SI R++A
Sbjct: 1  MVMEISSLADNMVLKVAMFLLVQVLVYLILSNSSGIFSKNARRSLSFRPARSVSINRVLA 60

Query: 61 ALSDFPSG 68
           LSD P+G
Sbjct: 61 FLSDMPAG 68

BLAST of CmoCh17G008950.1 vs. NCBI nr
Match: XP_022954080.1 (uncharacterized protein LOC111456452 [Cucurbita moschata])

HSP 1 Score: 163.7 bits (413), Expect = 7.1e-37
Identity = 87/87 (100.00%), Postives = 87/87 (100.00%), Query Frame = 0

Query: 1  MMMEIRHLEGMAETAAVFLLVQILVYLILSKSSHVFSKNTKRSLSFRQVRSLSIRRIMAA 60
          MMMEIRHLEGMAETAAVFLLVQILVYLILSKSSHVFSKNTKRSLSFRQVRSLSIRRIMAA
Sbjct: 1  MMMEIRHLEGMAETAAVFLLVQILVYLILSKSSHVFSKNTKRSLSFRQVRSLSIRRIMAA 60

Query: 61 LSDFPSGTGDPDESRGECGTMFDEGSS 88
          LSDFPSGTGDPDESRGECGTMFDEGSS
Sbjct: 61 LSDFPSGTGDPDESRGECGTMFDEGSS 87

BLAST of CmoCh17G008950.1 vs. NCBI nr
Match: XP_023548863.1 (uncharacterized protein LOC111807384 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 162.5 bits (410), Expect = 1.6e-36
Identity = 86/87 (98.85%), Postives = 87/87 (100.00%), Query Frame = 0

Query: 1  MMMEIRHLEGMAETAAVFLLVQILVYLILSKSSHVFSKNTKRSLSFRQVRSLSIRRIMAA 60
          MMMEIRHLEGMAETAAVFLLVQILVYLILSKSSHVFSKNTKRSLSFRQVRSL+IRRIMAA
Sbjct: 1  MMMEIRHLEGMAETAAVFLLVQILVYLILSKSSHVFSKNTKRSLSFRQVRSLTIRRIMAA 60

Query: 61 LSDFPSGTGDPDESRGECGTMFDEGSS 88
          LSDFPSGTGDPDESRGECGTMFDEGSS
Sbjct: 61 LSDFPSGTGDPDESRGECGTMFDEGSS 87

BLAST of CmoCh17G008950.1 vs. NCBI nr
Match: KAG6575656.1 (hypothetical protein SDJN03_26295, partial [Cucurbita argyrosperma subsp. sororia] >KAG7014206.1 hypothetical protein SDJN02_24381, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 160.2 bits (404), Expect = 7.9e-36
Identity = 85/87 (97.70%), Postives = 86/87 (98.85%), Query Frame = 0

Query: 1  MMMEIRHLEGMAETAAVFLLVQILVYLILSKSSHVFSKNTKRSLSFRQVRSLSIRRIMAA 60
          MMMEIRHLEGMAETAAVFLLVQILVYLILSKSSHVFSKNTKRSLSFRQVRSL+IRRIMAA
Sbjct: 1  MMMEIRHLEGMAETAAVFLLVQILVYLILSKSSHVFSKNTKRSLSFRQVRSLTIRRIMAA 60

Query: 61 LSDFPSGTGDPDESRGECGTMFDEGSS 88
          LSDFPSG GDPDESRGECGTMFDEGSS
Sbjct: 61 LSDFPSGDGDPDESRGECGTMFDEGSS 87

BLAST of CmoCh17G008950.1 vs. NCBI nr
Match: XP_022991948.1 (uncharacterized protein LOC111488440 [Cucurbita maxima])

HSP 1 Score: 139.4 bits (350), Expect = 1.4e-29
Identity = 75/76 (98.68%), Postives = 75/76 (98.68%), Query Frame = 0

Query: 1  MMMEIRHLEGMAETAAVFLLVQILVYLILSKSSHVFSKNTKRSLSFRQVRSLSIRRIMAA 60
          MMMEIRHLEGMAETAAVFLLVQILVYLILSKSSHVFSKNTKRSLSFRQVRSLSIRRIMAA
Sbjct: 1  MMMEIRHLEGMAETAAVFLLVQILVYLILSKSSHVFSKNTKRSLSFRQVRSLSIRRIMAA 60

Query: 61 LSDFPSGTGDPDESRG 77
          LSDFPSG GDPDESRG
Sbjct: 61 LSDFPSGAGDPDESRG 76

BLAST of CmoCh17G008950.1 vs. NCBI nr
Match: XP_022147203.1 (uncharacterized protein LOC111016209 [Momordica charantia])

HSP 1 Score: 106.3 bits (264), Expect = 1.4e-19
Identity = 67/95 (70.53%), Postives = 71/95 (74.74%), Query Frame = 0

Query: 1  MMMEIRHLEGMAETAAVFLLVQILVYLILSKSSHVFSKNTKRSLSFRQVRSLSIRRIMAA 60
          M MEI  LEG+  TAAVF LVQILVYLILSKSS VFSKNTKRSLSFR VRSLSIRRI+AA
Sbjct: 2  MTMEIIPLEGIVATAAVFFLVQILVYLILSKSSDVFSKNTKRSLSFRHVRSLSIRRILAA 61

Query: 61 LSDFPSG--TGDPDES------RGECGTMFDEGSS 88
          LSD PSG     P+ES       GEC  MF +GSS
Sbjct: 62 LSDVPSGGEPSPPNESPSPFSIGGECAAMFRDGSS 96

BLAST of CmoCh17G008950.1 vs. TAIR 10
Match: AT1G05575.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: anaerobic respiration; LOCATED IN: endomembrane system; EXPRESSED IN: 17 plant structures; EXPRESSED DURING: 9 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G31945.1); Has 63 Blast hits to 63 proteins in 10 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 63; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 65.1 bits (157), Expect = 3.2e-11
Identity = 37/61 (60.66%), Postives = 44/61 (72.13%), Query Frame = 0

Query: 9  EGMAETAAVFLLVQILVYLILSKSSHVF--SKNTKRSLSFRQVRSLSIRRIMAALSDFPS 68
          EGM    ++F L+Q LVYLILSKSS VF  SK  KR+ SFR  RS+SIRRI+A L D P+
Sbjct: 6  EGMLMKVSIFALIQGLVYLILSKSSSVFSTSKTMKRAYSFRSARSISIRRILAVLQDMPA 65

BLAST of CmoCh17G008950.1 vs. TAIR 10
Match: AT2G31945.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; EXPRESSED IN: 8 plant structures; EXPRESSED DURING: 4 anthesis, petal differentiation and expansion stage; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G05575.1); Has 61 Blast hits to 61 proteins in 10 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 61; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 61.2 bits (147), Expect = 4.7e-10
Identity = 40/79 (50.63%), Postives = 50/79 (63.29%), Query Frame = 0

Query: 11 MAETAAVFLLVQILVYLILSKSSHVF--SKNTKRSLSFRQVRSLSIRRIMAALSDFPSGT 70
          M    A+F LVQ LVYLIL KSS VF  S + KR+ SFR +RS+SIRRI+AAL D P+G 
Sbjct: 8  MLMKVALFALVQGLVYLILLKSSRVFARSNSLKRAYSFRPMRSVSIRRILAALQDIPAGD 67

Query: 71 GDPDESRGECGTMFDEGSS 88
               S G   +  DE ++
Sbjct: 68 DMSPSSNGSSSSSQDEAAT 86

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1GRF73.5e-37100.00uncharacterized protein LOC111456452 OS=Cucurbita moschata OX=3662 GN=LOC1114564... [more]
A0A6J1JXQ07.0e-3098.68uncharacterized protein LOC111488440 OS=Cucurbita maxima OX=3661 GN=LOC111488440... [more]
A0A6J1D0M46.6e-2070.53uncharacterized protein LOC111016209 OS=Momordica charantia OX=3673 GN=LOC111016... [more]
A0A1U7YVL79.5e-1163.77uncharacterized protein LOC104587448 OS=Nelumbo nucifera OX=4432 GN=LOC104587448... [more]
A0A660KP353.6e-1061.76Uncharacterized protein OS=Carpinus fangiana OX=176857 GN=FH972_010708 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
XP_022954080.17.1e-37100.00uncharacterized protein LOC111456452 [Cucurbita moschata][more]
XP_023548863.11.6e-3698.85uncharacterized protein LOC111807384 [Cucurbita pepo subsp. pepo][more]
KAG6575656.17.9e-3697.70hypothetical protein SDJN03_26295, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_022991948.11.4e-2998.68uncharacterized protein LOC111488440 [Cucurbita maxima][more]
XP_022147203.11.4e-1970.53uncharacterized protein LOC111016209 [Momordica charantia][more]
Match NameE-valueIdentityDescription
AT1G05575.13.2e-1160.66unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: anaerobi... [more]
AT2G31945.14.7e-1050.63unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita moschata (Rifu) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 63..87
NoneNo IPR availablePANTHERPTHR34268:SF16OS02G0774200 PROTEINcoord: 10..75
NoneNo IPR availablePANTHERPTHR34268OS01G0321850 PROTEINcoord: 10..75

Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
CmoCh17G008950CmoCh17G008950gene


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmoCh17G008950.1:exon:2969CmoCh17G008950.1:exon:2969exon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmoCh17G008950.1:cdsCmoCh17G008950.1:cdsCDS


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
CmoCh17G008950.1CmoCh17G008950.1-proteinpolypeptide


GO Annotation
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane