CmoCh04G020840 (gene) Cucurbita moschata (Rifu)

NameCmoCh04G020840
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionEukaryotic aspartyl protease family protein
LocationCmo_Chr04 : 12539028 .. 12539655 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTTGGGACCTCCGGCCAAGGGGAACGTGGTTCTCGATATCGGCACGCCGCCGACTTTCCTCCCCAAAGAATTGTACGGACGATTGGCTGCCAAAGTTCGGCAGCATATCCCGTCGAAGCCCATTGATGATACTCTTTGCTACAAATATAATTTGGGGGATTTGGTGATGACTCTACACTTCGACTGCGGTGTGGATTTGCGATTGAGTACGGTTCAGACTTTCAATAAGATGCCGGATGGGTCCTTTTACTTCACCGCGATGGGCGTTGACGAGAAGGACGCACTCATTGAAAATAGTATGATGGCGAATTTTTTGGTTGGGTATGATATTGACAATATGACGGTGTCATTTAAGCCCACTGATTGCACCAAAATTGGTTGAGATTTTTTGGGCCTATGAAAACAATATGCTATTTTATAACTATTATTACTTTTCTCTTAACACTTTTGATGCAACGTTAAGCTTGTTTTATTCATAAAATTTGGAAAGCATTTATTAATATATGCACAACTATATTCTACTTTATTCAGTATTCTTTGATTTCTTTATAGTAATTTTCGGTAACAAATTTTAAGGTAAAATTAGACATTAATTTTTAATTATTTTTCTTTAAGGTGTGAAATAA

mRNA sequence

ATGTTGGGACCTCCGGCCAAGGGGAACGTGGTTCTCGATATCGGCACGCCGCCGACTTTCCTCCCCAAAGAATTGTACGGACGATTGGCTGCCAAAGTTCGGCAGCATATCCCGTCGAAGCCCATTGATGATACTCTTTGCTACAAATATAATTTGGGGGATTTGGTGATGACTCTACACTTCGACTGCGGTGTGGATTTGCGATTGAGTACGGTTCAGACTTTCAATAAGATGCCGGATGGGTCCTTTTACTTCACCGCGATGGGCGTTGACGAGAAGGACGCACTCATTGAAAATAGTGTGAAATAA

Coding sequence (CDS)

ATGTTGGGACCTCCGGCCAAGGGGAACGTGGTTCTCGATATCGGCACGCCGCCGACTTTCCTCCCCAAAGAATTGTACGGACGATTGGCTGCCAAAGTTCGGCAGCATATCCCGTCGAAGCCCATTGATGATACTCTTTGCTACAAATATAATTTGGGGGATTTGGTGATGACTCTACACTTCGACTGCGGTGTGGATTTGCGATTGAGTACGGTTCAGACTTTCAATAAGATGCCGGATGGGTCCTTTTACTTCACCGCGATGGGCGTTGACGAGAAGGACGCACTCATTGAAAATAGTGTGAAATAA
BLAST of CmoCh04G020840 vs. Swiss-Prot
Match: CDR1_ARATH (Aspartic proteinase CDR1 OS=Arabidopsis thaliana GN=CDR1 PE=1 SV=1)

HSP 1 Score: 51.2 bits (121), Expect = 7.7e-06
Identity = 31/92 (33.70%), Postives = 47/92 (51.09%), Query Frame = 1

Query: 6   AKGNVVLDIGTPPTFLPKELYGRLAAKVRQHIPSKPIDD-----TLCYKYNLGDL---VM 65
           ++GN+++D GT  T LP E Y  L   V   I ++   D     +LCY    GDL   V+
Sbjct: 312 SEGNIIIDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQSGLSLCYSAT-GDLKVPVI 371

Query: 66  TLHFDCGVDLRLSTVQTFNKMPDGSFYFTAMG 90
           T+HFD G D++L +   F ++ +    F   G
Sbjct: 372 TMHFD-GADVKLDSSNAFVQVSEDLVCFAFRG 401

BLAST of CmoCh04G020840 vs. TrEMBL
Match: M5WJE5_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa022155mg PE=3 SV=1)

HSP 1 Score: 77.0 bits (188), Expect = 1.5e-11
Identity = 41/105 (39.05%), Postives = 59/105 (56.19%), Query Frame = 1

Query: 3   GPPAKGNVVLDIGTPPTFLPKELYGRLAAKVRQHIPSKPIDD------TLCY--KYNLGD 62
           G  +KGN+ +D GTPPT LP++ Y RL A+V+  IP  PI++       LCY  K NL  
Sbjct: 215 GKVSKGNMFMDTGTPPTLLPQDFYDRLVAEVKNQIPMAPIENDPSLATQLCYNSKTNLEG 274

Query: 63  LVMTLHFDCGVDLRLSTVQTFNKMPDGSFYFTAMGVDEKDALIEN 100
            ++T+HF+ G D++L+  QTF    D  F  +A  V     +  N
Sbjct: 275 PILTVHFE-GADVKLTPTQTFISPRDEVFCLSAQNVTSDGGIYGN 318

BLAST of CmoCh04G020840 vs. TrEMBL
Match: A0A0B2R0K5_GLYSO (Putative aspartic protease OS=Glycine soja GN=glysoja_018172 PE=3 SV=1)

HSP 1 Score: 74.7 bits (182), Expect = 7.2e-11
Identity = 40/85 (47.06%), Postives = 51/85 (60.00%), Query Frame = 1

Query: 7   KGNVVLDIGTPPTFLPKELYGRLAAKVRQHIPSKPIDD------TLCY--KYNLGDLVMT 66
           KGN+ LD GTPPT LP +LY ++ A+VR  +  KP+ D       LCY  K NL   V+T
Sbjct: 277 KGNMFLDSGTPPTILPTQLYDQVVAQVRSEVAMKPVTDDPDLGPQLCYRTKNNLRGPVLT 336

Query: 67  LHFDCGVDLRLSTVQTFNKMPDGSF 84
            HF+ G D++LS  QTF    DG F
Sbjct: 337 AHFE-GADVKLSPTQTFISPKDGVF 360

BLAST of CmoCh04G020840 vs. TrEMBL
Match: K7K5E1_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_01G230100 PE=3 SV=1)

HSP 1 Score: 74.7 bits (182), Expect = 7.2e-11
Identity = 40/85 (47.06%), Postives = 51/85 (60.00%), Query Frame = 1

Query: 7   KGNVVLDIGTPPTFLPKELYGRLAAKVRQHIPSKPIDD------TLCY--KYNLGDLVMT 66
           KGN+ LD GTPPT LP +LY ++ A+VR  +  KP+ D       LCY  K NL   V+T
Sbjct: 295 KGNMFLDSGTPPTILPTQLYDQVVAQVRSEVAMKPVTDDPDLGPQLCYRTKNNLRGPVLT 354

Query: 67  LHFDCGVDLRLSTVQTFNKMPDGSF 84
            HF+ G D++LS  QTF    DG F
Sbjct: 355 AHFE-GADVKLSPTQTFISPKDGVF 378

BLAST of CmoCh04G020840 vs. TrEMBL
Match: A0A151SD18_CAJCA (Aspartic proteinase nepenthesin-1 OS=Cajanus cajan GN=KK1_025419 PE=3 SV=1)

HSP 1 Score: 74.7 bits (182), Expect = 7.2e-11
Identity = 39/102 (38.24%), Postives = 57/102 (55.88%), Query Frame = 1

Query: 6   AKGNVVLDIGTPPTFLPKELYGRLAAKVRQHIPSKPIDD------TLCYK--YNLGDLVM 65
           +KGN+++D GTP T+LP+E Y RL  +++      PIDD       LCY+   NL   ++
Sbjct: 259 SKGNIMIDSGTPATYLPQEFYDRLVEELKVQSSLLPIDDDPDLGTQLCYRSETNLEGPIL 318

Query: 66  TLHFDCGVDLRLSTVQTFNKMPDGSFYFTAMGVDEKDALIEN 100
           T HF+ G D++L  +QTF    DG F F   G  + D +  N
Sbjct: 319 TAHFE-GADVQLMPIQTFIPPKDGVFCFAMAGTTDGDYIFGN 359

BLAST of CmoCh04G020840 vs. TrEMBL
Match: A5BCG4_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_006732 PE=3 SV=1)

HSP 1 Score: 73.9 bits (180), Expect = 1.2e-10
Identity = 37/103 (35.92%), Postives = 56/103 (54.37%), Query Frame = 1

Query: 7   KGNVVLDIGTPPTFLPKELYGRLAAKVRQHIPSKPIDD-----TLCYKYN--LGDLVMTL 66
           KGNV +D GTPPT LP++ Y RL   V++ IP +P+ D      LCY+    +   ++T 
Sbjct: 314 KGNVFIDAGTPPTLLPRDFYNRLVQGVKEAIPMEPVQDPDLQPQLCYRSATLIDGPILTA 373

Query: 67  HFDCGVDLRLSTVQTFNKMPDGSFYFTAMGVDEKDALIENSVK 103
           HFD G D++L  + TF    +G + F    +D    +  N V+
Sbjct: 374 HFD-GADVQLKPLNTFISPKEGVYCFAMQPIDGDTGIFGNFVQ 415

BLAST of CmoCh04G020840 vs. TAIR10
Match: AT5G33340.1 (AT5G33340.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 51.2 bits (121), Expect = 4.3e-07
Identity = 31/92 (33.70%), Postives = 47/92 (51.09%), Query Frame = 1

Query: 6   AKGNVVLDIGTPPTFLPKELYGRLAAKVRQHIPSKPIDD-----TLCYKYNLGDL---VM 65
           ++GN+++D GT  T LP E Y  L   V   I ++   D     +LCY    GDL   V+
Sbjct: 312 SEGNIIIDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQSGLSLCYSAT-GDLKVPVI 371

Query: 66  TLHFDCGVDLRLSTVQTFNKMPDGSFYFTAMG 90
           T+HFD G D++L +   F ++ +    F   G
Sbjct: 372 TMHFD-GADVKLDSSNAFVQVSEDLVCFAFRG 401

BLAST of CmoCh04G020840 vs. NCBI nr
Match: gi|764596024|ref|XP_011465972.1| (PREDICTED: aspartic proteinase CDR1-like [Fragaria vesca subsp. vesca])

HSP 1 Score: 80.1 bits (196), Expect = 2.5e-12
Identity = 42/105 (40.00%), Postives = 58/105 (55.24%), Query Frame = 1

Query: 3   GPPAKGNVVLDIGTPPTFLPKELYGRLAAKVRQHIPSKPI------DDTLCYK--YNLGD 62
           G   +GN+ LD GTPPT +P++ Y RLAA+V+  IP  PI         LCYK   NL  
Sbjct: 300 GQVLEGNMFLDSGTPPTLIPQDFYNRLAAEVKNQIPMAPIVGDPSLGSQLCYKTPTNLKG 359

Query: 63  LVMTLHFDCGVDLRLSTVQTFNKMPDGSFYFTAMGVDEKDALIEN 100
            ++T+HF+   ++ L+ +QTF    DG F F   GV     +I N
Sbjct: 360 PILTVHFNGSANIVLTPIQTFIPPKDGVFCFAMQGVASDGGIIGN 404

BLAST of CmoCh04G020840 vs. NCBI nr
Match: gi|470140940|ref|XP_004306194.1| (PREDICTED: aspartic proteinase CDR1-like [Fragaria vesca subsp. vesca])

HSP 1 Score: 80.1 bits (196), Expect = 2.5e-12
Identity = 41/97 (42.27%), Postives = 58/97 (59.79%), Query Frame = 1

Query: 3   GPPAKGNVVLDIGTPPTFLPKELYGRLAAKVRQHIPSKPIDD------TLCY--KYNLGD 62
           G   KGN+ LD GTPPT++P +LY RL A++ + IP  PI D       LCY  K NL  
Sbjct: 289 GKVEKGNIFLDSGTPPTYIPTDLYERLVAELGKQIPMAPIKDDPDLGNQLCYKTKTNLKG 348

Query: 63  LVMTLHFDCGVDLRLSTVQTFNKMPDGSFYFTAMGVD 92
            ++T+HF+ G +++L++ QTF    D  F F  +G D
Sbjct: 349 PILTVHFEGGTNVKLTSTQTFIPPKDEVFCFAMIGDD 385

BLAST of CmoCh04G020840 vs. NCBI nr
Match: gi|694393472|ref|XP_009372173.1| (PREDICTED: aspartic proteinase CDR1-like [Pyrus x bretschneideri])

HSP 1 Score: 79.7 bits (195), Expect = 3.2e-12
Identity = 42/96 (43.75%), Postives = 57/96 (59.38%), Query Frame = 1

Query: 3   GPPAKGNVVLDIGTPPTFLPKELYGRLAAKVRQHIPSKPIDD------TLCY--KYNLGD 62
           G  +KGN+ +D GTPPT +P++ Y RL A+VR  IP  PI D       LCY  K NL  
Sbjct: 295 GEVSKGNMFMDTGTPPTLIPQDFYDRLVAEVRSQIPMTPIGDDPSLGTQLCYKSKTNLKG 354

Query: 63  LVMTLHFDCGVDLRLSTVQTFNKMPDGSFYFTAMGV 91
            ++T+HF+ G D++L+T+QTF    D  F F    V
Sbjct: 355 PILTVHFE-GADVKLTTIQTFVPPKDEVFCFAMQTV 389

BLAST of CmoCh04G020840 vs. NCBI nr
Match: gi|1012218313|ref|XP_015935066.1| (PREDICTED: aspartic proteinase CDR1-like [Arachis duranensis])

HSP 1 Score: 77.4 bits (189), Expect = 1.6e-11
Identity = 42/86 (48.84%), Postives = 53/86 (61.63%), Query Frame = 1

Query: 6   AKGNVVLDIGTPPTFLPKELYGRLAAKVRQHIPSKPIDD------TLCY--KYNLGDLVM 65
           AKGN+ LD GTPPT LP++LY RL A+VR  +  +PI D       LCY  K NL   ++
Sbjct: 288 AKGNMFLDSGTPPTILPQDLYDRLVAEVRNQVKMEPIKDDPQLGNQLCYKTKTNLRGPML 347

Query: 66  TLHFDCGVDLRLSTVQTFNKMPDGSF 84
           T HF   V+L+LS +QTF    DG F
Sbjct: 348 TAHFFDHVNLQLSPIQTFIPPKDGVF 373

BLAST of CmoCh04G020840 vs. NCBI nr
Match: gi|595841136|ref|XP_007208154.1| (hypothetical protein PRUPE_ppa022155mg [Prunus persica])

HSP 1 Score: 77.0 bits (188), Expect = 2.1e-11
Identity = 41/105 (39.05%), Postives = 59/105 (56.19%), Query Frame = 1

Query: 3   GPPAKGNVVLDIGTPPTFLPKELYGRLAAKVRQHIPSKPIDD------TLCY--KYNLGD 62
           G  +KGN+ +D GTPPT LP++ Y RL A+V+  IP  PI++       LCY  K NL  
Sbjct: 215 GKVSKGNMFMDTGTPPTLLPQDFYDRLVAEVKNQIPMAPIENDPSLATQLCYNSKTNLEG 274

Query: 63  LVMTLHFDCGVDLRLSTVQTFNKMPDGSFYFTAMGVDEKDALIEN 100
            ++T+HF+ G D++L+  QTF    D  F  +A  V     +  N
Sbjct: 275 PILTVHFE-GADVKLTPTQTFISPRDEVFCLSAQNVTSDGGIYGN 318

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
CDR1_ARATH7.7e-0633.70Aspartic proteinase CDR1 OS=Arabidopsis thaliana GN=CDR1 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
M5WJE5_PRUPE1.5e-1139.05Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa022155mg PE=3 SV=1[more]
A0A0B2R0K5_GLYSO7.2e-1147.06Putative aspartic protease OS=Glycine soja GN=glysoja_018172 PE=3 SV=1[more]
K7K5E1_SOYBN7.2e-1147.06Uncharacterized protein OS=Glycine max GN=GLYMA_01G230100 PE=3 SV=1[more]
A0A151SD18_CAJCA7.2e-1138.24Aspartic proteinase nepenthesin-1 OS=Cajanus cajan GN=KK1_025419 PE=3 SV=1[more]
A5BCG4_VITVI1.2e-1035.92Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_006732 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT5G33340.14.3e-0733.70 Eukaryotic aspartyl protease family protein[more]
Match NameE-valueIdentityDescription
gi|764596024|ref|XP_011465972.1|2.5e-1240.00PREDICTED: aspartic proteinase CDR1-like [Fragaria vesca subsp. vesca][more]
gi|470140940|ref|XP_004306194.1|2.5e-1242.27PREDICTED: aspartic proteinase CDR1-like [Fragaria vesca subsp. vesca][more]
gi|694393472|ref|XP_009372173.1|3.2e-1243.75PREDICTED: aspartic proteinase CDR1-like [Pyrus x bretschneideri][more]
gi|1012218313|ref|XP_015935066.1|1.6e-1148.84PREDICTED: aspartic proteinase CDR1-like [Arachis duranensis][more]
gi|595841136|ref|XP_007208154.1|2.1e-1139.05hypothetical protein PRUPE_ppa022155mg [Prunus persica][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001461Aspartic_peptidase_A1
IPR021109Peptidase_aspartic_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0004190aspartic-type endopeptidase activity
Vocabulary: Biological Process
TermDefinition
GO:0006508proteolysis
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
cellular_component GO:0005575 cellular_component
molecular_function GO:0004190 aspartic-type endopeptidase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh04G020840.1CmoCh04G020840.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPANTHERPTHR13683ASPARTYL PROTEASEScoord: 7..90
score: 1.1
IPR021109Aspartic peptidase domainGENE3DG3DSA:2.40.70.10coord: 7..92
score: 7.
IPR021109Aspartic peptidase domainunknownSSF50630Acid proteasescoord: 7..92
score: 1.
NoneNo IPR availablePANTHERPTHR13683:SF309SUBFAMILY NOT NAMEDcoord: 7..90
score: 1.1

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
CmoCh04G020840Cp4.1LG01g17390Cucurbita pepo (Zucchini)cmocpeB673
CmoCh04G020840Cp4.1LG01g17240Cucurbita pepo (Zucchini)cmocpeB680
The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
CmoCh04G020840Cucurbita maxima (Rimu)cmacmoB741
CmoCh04G020840Cucurbita pepo (Zucchini)cmocpeB682