Cp4.1LG01g01350 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG01g01350
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionEukaryotic aspartyl protease family protein
LocationCp4.1LG01 : 3043619 .. 3043927 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCCGTTGCAAACAAGCGATTTAAGGCCACGAACGACATGCCGGTCGCCGCAAAACAAGGAAATATCCTTATCAATTTTGGTACAACATTGACGATTATGCCCCCAAATTTGTACAAACGTGTCACTTTGACATTGGCCCATGTTGGTAAAGCGAAGCGAGTGCATGATTCGATTAGGGTTTTGGATCTCTGCTTCGCTGCGTGCAGCGTTGATCATTTGAATATTCCGGTCATTACGACACATTTTGCTGGCGGCAACGACGTGAAATTGTTATCGTTGAATATATTTGCAATGGTGGCAAAATAA

mRNA sequence

ATGTCCGTTGCAAACAAGCGATTTAAGGCCACGAACGACATGCCGGTCGCCGCAAAACAAGGAAATATCCTTATCAATTTTGGTACAACATTGACGATTATGCCCCCAAATTTGTACAAACGTGTCACTTTGACATTGGCCCATGTTGGTAAAGCGAAGCGAGTGCATGATTCGATTAGGGTTTTGGATCTCTGCTTCGCTGCGTGCAGCGTTGATCATTTGAATATTCCGGTCATTACGACACATTTTGCTGGCGGCAACGACGTGAAATTGTTATCGTTGAATATATTTGCAATGGTGGCAAAATAA

Coding sequence (CDS)

ATGTCCGTTGCAAACAAGCGATTTAAGGCCACGAACGACATGCCGGTCGCCGCAAAACAAGGAAATATCCTTATCAATTTTGGTACAACATTGACGATTATGCCCCCAAATTTGTACAAACGTGTCACTTTGACATTGGCCCATGTTGGTAAAGCGAAGCGAGTGCATGATTCGATTAGGGTTTTGGATCTCTGCTTCGCTGCGTGCAGCGTTGATCATTTGAATATTCCGGTCATTACGACACATTTTGCTGGCGGCAACGACGTGAAATTGTTATCGTTGAATATATTTGCAATGGTGGCAAAATAA

Protein sequence

MSVANKRFKATNDMPVAAKQGNILINFGTTLTIMPPNLYKRVTLTLAHVGKAKRVHDSIRVLDLCFAACSVDHLNIPVITTHFAGGNDVKLLSLNIFAMVAK
BLAST of Cp4.1LG01g01350 vs. Swiss-Prot
Match: CDR1_ARATH (Aspartic proteinase CDR1 OS=Arabidopsis thaliana GN=CDR1 PE=1 SV=1)

HSP 1 Score: 57.4 bits (137), Expect = 1.1e-07
Identity = 34/85 (40.00%), Postives = 49/85 (57.65%), Query Frame = 1

Query: 18  AKQGNILINFGTTLTIMPPNLYKRVTLTLAHVGKAKRVHDSIRVLDLCFAACSVDHLNIP 77
           + +GNI+I+ GTTLT++P   Y  +   +A    A++  D    L LC++A     L +P
Sbjct: 311 SSEGNIIIDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQSGLSLCYSA--TGDLKVP 370

Query: 78  VITTHFAGGNDVKLLSLNIFAMVAK 103
           VIT HF G  DVKL S N F  V++
Sbjct: 371 VITMHFDGA-DVKLDSSNAFVQVSE 392

BLAST of Cp4.1LG01g01350 vs. TrEMBL
Match: A0A0A0KZZ3_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G055410 PE=3 SV=1)

HSP 1 Score: 111.3 bits (277), Expect = 7.0e-22
Identity = 58/101 (57.43%), Postives = 71/101 (70.30%), Query Frame = 1

Query: 1   MSVANKRFKATNDMPVAAKQGNILINFGTTLTIMPPNLYKRVTLTLAHVGKAKRVHDSIR 60
           +SV  KRFKA N +      GNI+I+ GTTLT++P +LY  V  TLA V KAKRV D   
Sbjct: 290 ISVGKKRFKAANGISAMTNHGNIIIDSGTTLTLLPRSLYYGVFSTLARVIKAKRVDDPSG 349

Query: 61  VLDLCFAACSVDHLNIPVITTHFAGGNDVKLLSLNIFAMVA 102
           +L+LC++A  VD LNIP+IT HFAGG DVKLL +N FA VA
Sbjct: 350 ILELCYSAGQVDDLNIPIITAHFAGGADVKLLPVNTFAPVA 390

BLAST of Cp4.1LG01g01350 vs. TrEMBL
Match: A0A0A0KV20_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G055400 PE=3 SV=1)

HSP 1 Score: 82.0 bits (201), Expect = 4.5e-13
Identity = 53/106 (50.00%), Postives = 64/106 (60.38%), Query Frame = 1

Query: 1   MSVANKRFKATNDMPVAAKQGNILINFGTTLTIMPPNLYKRVTLTLAHVGKAKRVHDSIR 60
           +S+ N+R  A       AKQGN++I+ GTTLTI+P  LY  V  +L  V KAKRV D   
Sbjct: 287 ISIGNERHMAF------AKQGNVIIDSGTTLTILPKELYDGVVSSLLKVVKAKRVKDPHG 346

Query: 61  VLDLCF-----AACSVDHLNIPVITTHFAGGNDVKLLSLNIFAMVA 102
            LDLCF     AA S   L IPVIT HF+GG +V LL +N F  VA
Sbjct: 347 SLDLCFDDGINAAAS---LGIPVITAHFSGGANVNLLPINTFRKVA 383

BLAST of Cp4.1LG01g01350 vs. TrEMBL
Match: M5WRG3_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa025167mg PE=3 SV=1)

HSP 1 Score: 79.7 bits (195), Expect = 2.2e-12
Identity = 46/104 (44.23%), Postives = 61/104 (58.65%), Query Frame = 1

Query: 1   MSVANKRFKATNDMP------VAAKQGNILINFGTTLTIMPPNLYKRVTLTLAHVGKAKR 60
           +SV  KR       P      VAA +GNI+I+ GTTLT++PP  +  +   L     A+R
Sbjct: 306 ISVGEKRLAYKTKSPDCEKAAVAANEGNIIIDSGTTLTLLPPGFHDDLVSALETAINAER 365

Query: 61  VHDSIRVLDLCFAACSVDHLNIPVITTHFAGGNDVKLLSLNIFA 99
           V D   +L LCF + S D + +PVIT HF+GG DVKL +LN FA
Sbjct: 366 VSDPRGILSLCFKSKS-DDIGVPVITVHFSGGADVKLQALNTFA 408

BLAST of Cp4.1LG01g01350 vs. TrEMBL
Match: A0A0L9V285_PHAAN (Uncharacterized protein OS=Phaseolus angularis GN=LR48_Vigan08g006700 PE=3 SV=1)

HSP 1 Score: 75.1 bits (183), Expect = 5.5e-11
Identity = 43/99 (43.43%), Postives = 59/99 (59.60%), Query Frame = 1

Query: 2   SVANKRFKATNDMPVAAKQGNILINFGTTLTIMPPNLYKRVTLTLAHVGKAKRVHDSIRV 61
           SV NKR K  +    +   GNI+I  G+TLT++P ++Y ++   +AH  K KRV D  + 
Sbjct: 281 SVGNKRIKFGSSSSESDVDGNIIIGSGSTLTLLPDDVYSKLESAVAHEVKLKRVKDPSKQ 340

Query: 62  LDLCFAACSVDHLNIPVITTHFAGGNDVKLLSLNIFAMV 101
           L LC+ + S D LN PVI  HF G  DVKL ++N F  V
Sbjct: 341 LSLCYES-SFDDLNAPVIVAHFRGA-DVKLNAVNTFVEV 377

BLAST of Cp4.1LG01g01350 vs. TrEMBL
Match: A0A151TPR6_CAJCA (Aspartic proteinase nepenthesin-1 OS=Cajanus cajan GN=KK1_022702 PE=3 SV=1)

HSP 1 Score: 74.7 bits (182), Expect = 7.2e-11
Identity = 39/82 (47.56%), Postives = 55/82 (67.07%), Query Frame = 1

Query: 20  QGNILINFGTTLTIMPPNLYKRVTLTLAHVGKAKRVHDSIRVLDLCFAACSVDHLNIPVI 79
           +GNI+I+ GTTLT +PP++Y +    +A V K KRV D  ++  LC+ A ++D+LN P+I
Sbjct: 127 EGNIIIDSGTTLTFLPPDVYSKFESAVAQVVKLKRVQDPTQLFSLCYKA-TLDNLNAPMI 186

Query: 80  TTHFAGGNDVKLLSLNIFAMVA 102
           T HF G  DV L S+N F  VA
Sbjct: 187 TAHFRGA-DVGLNSINTFTQVA 206

BLAST of Cp4.1LG01g01350 vs. TAIR10
Match: AT1G64830.1 (AT1G64830.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 68.2 bits (165), Expect = 3.4e-12
Identity = 41/102 (40.20%), Postives = 61/102 (59.80%), Query Frame = 1

Query: 1   MSVANKRFKATNDMPVAAKQGNILINFGTTLTIMPPNLYKRVTLTLAHVGKAKRVHDSIR 60
           +SV +K+ + T+ +     +GNI+I+ GTTLT++P N Y  +   +A   KA+RV D   
Sbjct: 289 ISVGSKKIQFTSTI-FGTGEGNIVIDSGTTLTLLPSNFYYELESVVASTIKAERVQDPDG 348

Query: 61  VLDLCFAACSVDHLNIPVITTHFAGGNDVKLLSLNIFAMVAK 103
           +L LC+   S     +P IT HF GG DVKL +LN F  V++
Sbjct: 349 ILSLCYRDSS--SFKVPDITVHFKGG-DVKLGNLNTFVAVSE 386

BLAST of Cp4.1LG01g01350 vs. TAIR10
Match: AT5G33340.1 (AT5G33340.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 57.4 bits (137), Expect = 6.0e-09
Identity = 34/85 (40.00%), Postives = 49/85 (57.65%), Query Frame = 1

Query: 18  AKQGNILINFGTTLTIMPPNLYKRVTLTLAHVGKAKRVHDSIRVLDLCFAACSVDHLNIP 77
           + +GNI+I+ GTTLT++P   Y  +   +A    A++  D    L LC++A     L +P
Sbjct: 311 SSEGNIIIDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQSGLSLCYSA--TGDLKVP 370

Query: 78  VITTHFAGGNDVKLLSLNIFAMVAK 103
           VIT HF G  DVKL S N F  V++
Sbjct: 371 VITMHFDGA-DVKLDSSNAFVQVSE 392

BLAST of Cp4.1LG01g01350 vs. TAIR10
Match: AT2G28010.1 (AT2G28010.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 51.6 bits (122), Expect = 3.3e-07
Identity = 36/97 (37.11%), Postives = 51/97 (52.58%), Query Frame = 1

Query: 1   MSVANKRFKATNDMPVAAKQGNILINFGTTLTIMPPNLYKRVTLTLAHVGKAKRVHDSIR 60
           +SV N R + T      A +GNI+I+ GTTLT  P +    V   + HV  A R  D   
Sbjct: 248 VSVGNTRIE-TMGTTFHALEGNIVIDSGTTLTYFPVSYCNLVRQAVEHVVTAVRAADPTG 307

Query: 61  VLDLCFAACSVDHLNIPVITTHFAGGNDVKLLSLNIF 98
              LC+ + ++D    PVIT HF+GG D+ L   N++
Sbjct: 308 NDMLCYNSDTID--IFPVITMHFSGGVDLVLDKYNMY 341

BLAST of Cp4.1LG01g01350 vs. TAIR10
Match: AT2G35615.1 (AT2G35615.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 50.4 bits (119), Expect = 7.4e-07
Identity = 31/96 (32.29%), Postives = 53/96 (55.21%), Query Frame = 1

Query: 8   FKATNDMPVAAKQGNILINFGTTLTIMPPNLYKRVTLTLAH-VGKAKRVHDSIRVLDLCF 67
           +   +D  ++   GNI+I+ GTTLT++    + + +  +   V  AKRV D   +L  CF
Sbjct: 308 YNPNDDGILSETSGNIIIDSGTTLTLLEAGFFDKFSSAVEESVTGAKRVSDPQGLLSHCF 367

Query: 68  AACSVDHLNIPVITTHFAGGNDVKLLSLNIFAMVAK 103
            + S + + +P IT HF G  DV+L  +N F  +++
Sbjct: 368 KSGSAE-IGLPEITVHFTGA-DVRLSPINAFVKLSE 401

BLAST of Cp4.1LG01g01350 vs. TAIR10
Match: AT2G28220.1 (AT2G28220.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 49.3 bits (116), Expect = 1.6e-06
Identity = 33/88 (37.50%), Postives = 46/88 (52.27%), Query Frame = 1

Query: 10  ATNDMPVAAKQGNILINFGTTLTIMPPNLYKRVTLTLAHVGKAKRVHDSIRVLDLCFAAC 69
           AT   P  A+ GNI I+ GTTLT  P +    V   +  V  A +V D      LC+ + 
Sbjct: 616 ATLGTPFHAEDGNIFIDSGTTLTYFPMSYCNLVREAVEQVVTAVKVPDMGSDNLLCYYSD 675

Query: 70  SVDHLNIPVITTHFAGGNDVKLLSLNIF 98
           ++D    PVIT HF+GG D+ L   N++
Sbjct: 676 TID--IFPVITMHFSGGADLVLDKYNMY 701

BLAST of Cp4.1LG01g01350 vs. NCBI nr
Match: gi|449462551|ref|XP_004149004.1| (PREDICTED: probable aspartic protease At2g35615 [Cucumis sativus])

HSP 1 Score: 111.3 bits (277), Expect = 1.0e-21
Identity = 58/101 (57.43%), Postives = 71/101 (70.30%), Query Frame = 1

Query: 1   MSVANKRFKATNDMPVAAKQGNILINFGTTLTIMPPNLYKRVTLTLAHVGKAKRVHDSIR 60
           +SV  KRFKA N +      GNI+I+ GTTLT++P +LY  V  TLA V KAKRV D   
Sbjct: 290 ISVGKKRFKAANGISAMTNHGNIIIDSGTTLTLLPRSLYYGVFSTLARVIKAKRVDDPSG 349

Query: 61  VLDLCFAACSVDHLNIPVITTHFAGGNDVKLLSLNIFAMVA 102
           +L+LC++A  VD LNIP+IT HFAGG DVKLL +N FA VA
Sbjct: 350 ILELCYSAGQVDDLNIPIITAHFAGGADVKLLPVNTFAPVA 390

BLAST of Cp4.1LG01g01350 vs. NCBI nr
Match: gi|659102472|ref|XP_008452150.1| (PREDICTED: probable aspartic protease At2g35615 [Cucumis melo])

HSP 1 Score: 110.2 bits (274), Expect = 2.2e-21
Identity = 56/101 (55.45%), Postives = 71/101 (70.30%), Query Frame = 1

Query: 1   MSVANKRFKATNDMPVAAKQGNILINFGTTLTIMPPNLYKRVTLTLAHVGKAKRVHDSIR 60
           +SV NKRFKA  DM     QGNI+I+ GTTLT++P +LY  V  TLA V K KRV D   
Sbjct: 290 ISVGNKRFKAAKDMSAMTNQGNIIIDSGTTLTLLPRSLYDGVVSTLARVIKTKRVDDPSG 349

Query: 61  VLDLCFAACSVDHLNIPVITTHFAGGNDVKLLSLNIFAMVA 102
           +L+LC++A  ++ LNIP+IT HF+G  DVKLL +N FA VA
Sbjct: 350 ILELCYSAGQLEDLNIPIITAHFSGRADVKLLPVNTFAPVA 390

BLAST of Cp4.1LG01g01350 vs. NCBI nr
Match: gi|470131788|ref|XP_004301773.1| (PREDICTED: aspartic proteinase CDR1 [Fragaria vesca subsp. vesca])

HSP 1 Score: 82.8 bits (203), Expect = 3.8e-13
Identity = 46/102 (45.10%), Postives = 64/102 (62.75%), Query Frame = 1

Query: 1   MSVANKR--FKATNDMPVAAKQGNILINFGTTLTIMPPNLYKRVTLTLAHVGKAKRVHDS 60
           +SV  K+  +K+ ++  VA  +GNI+I+ GTTLT++PP  +  V   L     A+RV D 
Sbjct: 286 ISVGEKKVLYKSQSNKAVAGSEGNIIIDSGTTLTLLPPGFHDDVVAALEAAINAERVSDP 345

Query: 61  IRVLDLCFAACSVDHLNIPVITTHFAGGNDVKLLSLNIFAMV 101
             VL LCF +   D + +PVIT HF+GG DVKL +LN FA V
Sbjct: 346 RGVLSLCFKS-KKDDIGVPVITAHFSGGADVKLNALNTFARV 386

BLAST of Cp4.1LG01g01350 vs. NCBI nr
Match: gi|778697533|ref|XP_004149005.2| (PREDICTED: probable aspartic protease At2g35615 [Cucumis sativus])

HSP 1 Score: 82.0 bits (201), Expect = 6.5e-13
Identity = 53/106 (50.00%), Postives = 64/106 (60.38%), Query Frame = 1

Query: 1   MSVANKRFKATNDMPVAAKQGNILINFGTTLTIMPPNLYKRVTLTLAHVGKAKRVHDSIR 60
           +S+ N+R  A       AKQGN++I+ GTTLTI+P  LY  V  +L  V KAKRV D   
Sbjct: 287 ISIGNERHMAF------AKQGNVIIDSGTTLTILPKELYDGVVSSLLKVVKAKRVKDPHG 346

Query: 61  VLDLCF-----AACSVDHLNIPVITTHFAGGNDVKLLSLNIFAMVA 102
            LDLCF     AA S   L IPVIT HF+GG +V LL +N F  VA
Sbjct: 347 SLDLCFDDGINAAAS---LGIPVITAHFSGGANVNLLPINTFRKVA 383

BLAST of Cp4.1LG01g01350 vs. NCBI nr
Match: gi|645265299|ref|XP_008238084.1| (PREDICTED: aspartic proteinase CDR1-like [Prunus mume])

HSP 1 Score: 79.7 bits (195), Expect = 3.2e-12
Identity = 46/104 (44.23%), Postives = 61/104 (58.65%), Query Frame = 1

Query: 1   MSVANKRFKATNDMP------VAAKQGNILINFGTTLTIMPPNLYKRVTLTLAHVGKAKR 60
           +SV  KR       P      VAA +GNI+I+ GTTLT++PP  +  +   L     A+R
Sbjct: 307 ISVGEKRLAYKTKSPDCEEAAVAANEGNIIIDSGTTLTLLPPGFHDDLVSALETAINAER 366

Query: 61  VHDSIRVLDLCFAACSVDHLNIPVITTHFAGGNDVKLLSLNIFA 99
           V D   +L LCF + S D + +PVIT HF+GG DVKL +LN FA
Sbjct: 367 VSDPRGILSLCFKSKS-DDIGVPVITAHFSGGADVKLQALNTFA 409

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
CDR1_ARATH1.1e-0740.00Aspartic proteinase CDR1 OS=Arabidopsis thaliana GN=CDR1 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KZZ3_CUCSA7.0e-2257.43Uncharacterized protein OS=Cucumis sativus GN=Csa_4G055410 PE=3 SV=1[more]
A0A0A0KV20_CUCSA4.5e-1350.00Uncharacterized protein OS=Cucumis sativus GN=Csa_4G055400 PE=3 SV=1[more]
M5WRG3_PRUPE2.2e-1244.23Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa025167mg PE=3 SV=1[more]
A0A0L9V285_PHAAN5.5e-1143.43Uncharacterized protein OS=Phaseolus angularis GN=LR48_Vigan08g006700 PE=3 SV=1[more]
A0A151TPR6_CAJCA7.2e-1147.56Aspartic proteinase nepenthesin-1 OS=Cajanus cajan GN=KK1_022702 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT1G64830.13.4e-1240.20 Eukaryotic aspartyl protease family protein[more]
AT5G33340.16.0e-0940.00 Eukaryotic aspartyl protease family protein[more]
AT2G28010.13.3e-0737.11 Eukaryotic aspartyl protease family protein[more]
AT2G35615.17.4e-0732.29 Eukaryotic aspartyl protease family protein[more]
AT2G28220.11.6e-0637.50 Eukaryotic aspartyl protease family protein[more]
Match NameE-valueIdentityDescription
gi|449462551|ref|XP_004149004.1|1.0e-2157.43PREDICTED: probable aspartic protease At2g35615 [Cucumis sativus][more]
gi|659102472|ref|XP_008452150.1|2.2e-2155.45PREDICTED: probable aspartic protease At2g35615 [Cucumis melo][more]
gi|470131788|ref|XP_004301773.1|3.8e-1345.10PREDICTED: aspartic proteinase CDR1 [Fragaria vesca subsp. vesca][more]
gi|778697533|ref|XP_004149005.2|6.5e-1350.00PREDICTED: probable aspartic protease At2g35615 [Cucumis sativus][more]
gi|645265299|ref|XP_008238084.1|3.2e-1244.23PREDICTED: aspartic proteinase CDR1-like [Prunus mume][more]
The following terms have been associated with this gene:
Vocabulary: Biological Process
TermDefinition
GO:0006508proteolysis
Vocabulary: Molecular Function
TermDefinition
GO:0004190aspartic-type endopeptidase activity
Vocabulary: INTERPRO
TermDefinition
IPR021109Peptidase_aspartic_dom_sf
IPR001461Aspartic_peptidase_A1
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
cellular_component GO:0005575 cellular_component
molecular_function GO:0004190 aspartic-type endopeptidase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG01g01350.1Cp4.1LG01g01350.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPANTHERPTHR13683ASPARTYL PROTEASEScoord: 1..100
score: 3.4
IPR021109Aspartic peptidase domainGENE3DG3DSA:2.40.70.10coord: 2..100
score: 2.
IPR021109Aspartic peptidase domainunknownSSF50630Acid proteasescoord: 15..100
score: 2.1
NoneNo IPR availablePANTHERPTHR13683:SF298ASPARTIC PROTEINASE CDR1-RELATEDcoord: 1..100
score: 3.4

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
Cp4.1LG01g01350CmaCh04G005890Cucurbita maxima (Rimu)cmacpeB720
Cp4.1LG01g01350CmoCh04G006280Cucurbita moschata (Rifu)cmocpeB673
The following gene(s) are paralogous to this gene:

None