CmaCh04G005890 (gene) Cucurbita maxima (Rimu)

NameCmaCh04G005890
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionEukaryotic aspartyl protease family protein
LocationCma_Chr04 : 2981869 .. 2982428 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTTAAAAGAACCTAATACCTTCTATTACCCAACTCTCGAAACAATGTCCATTGCAAACAAGCGGTTTAAGGCCGAGAACGACATGTCGGTCGCTGTAGAACAAGAGAATATCCTTATCAATTTCGGTACAACATTGACGCCGAATTTGTACAAACGTGTCACTTTGACATTGGCGCATGTTGTAAAGCGAAACGAGTGCATGATTCGACTGGAGTTTTGGATCTCTGCTTCGCCACGTGCAGCATTAATCATTTGAATATTCCGGTAATTACGGAACATTTTGCCGGAGGTGCCGACGTGAAATTGTTATCGTTGAATATATTTGTAATGGTGGCAGATAATGTGGCTTGTTTGGCCTTGGCGCCATCGACGAATTTTGCCATTTTTGAAAACTTGGCAAAGGTGAACTTTTTGGTCAGATACGATCTCGAGCGCAAGAGATTGTCGTTCAAATACAACGTTTGTGCTTAAAGACTACGAGTCATTTCTCTATTGTTTGTTTTTTTATTATAGTTATTTCTCTTCCCAAACCACATGATTTGGCAGAATAAAATGTGA

mRNA sequence

ATGTTAAAAGAACCTAATACCTTCTATTACCCAACTCTCGAAACAATGTCCATTGCAAACAAGCGGTTTAAGGCCGAGAACGACATGTCGGTCGCTGTAGAACAAGAGAATATCCTTATCAATTTCGGTACAACATTGACGCCGAATTTGTACAAACCGAAACGAGTGCATGATTCGACTGGAGTTTTGGATCTCTGCTTCGCCACGTGCAGCATTAATCATTTGAATATTCCGGTAATTACGGAACATTTTGCCGGAGGTGCCGACGTGAAATTGTTATCGTTGAATATATTTGTAATGGTGGCAGATAATGTGGCTTGTTTGGCCTTGGCGCCATCGACGAATTTTGCCATTTTTGAAAACTTGGCAAAGGTGAACTTTTTGGTCAGATACGATCTCGAGCGCAAGAGATTGTCGTTCAAATACAACTTATTTCTCTTCCCAAACCACATGATTTGGCAGAATAAAATGTGA

Coding sequence (CDS)

ATGTTAAAAGAACCTAATACCTTCTATTACCCAACTCTCGAAACAATGTCCATTGCAAACAAGCGGTTTAAGGCCGAGAACGACATGTCGGTCGCTGTAGAACAAGAGAATATCCTTATCAATTTCGGTACAACATTGACGCCGAATTTGTACAAACCGAAACGAGTGCATGATTCGACTGGAGTTTTGGATCTCTGCTTCGCCACGTGCAGCATTAATCATTTGAATATTCCGGTAATTACGGAACATTTTGCCGGAGGTGCCGACGTGAAATTGTTATCGTTGAATATATTTGTAATGGTGGCAGATAATGTGGCTTGTTTGGCCTTGGCGCCATCGACGAATTTTGCCATTTTTGAAAACTTGGCAAAGGTGAACTTTTTGGTCAGATACGATCTCGAGCGCAAGAGATTGTCGTTCAAATACAACTTATTTCTCTTCCCAAACCACATGATTTGGCAGAATAAAATGTGA

Protein sequence

MLKEPNTFYYPTLETMSIANKRFKAENDMSVAVEQENILINFGTTLTPNLYKPKRVHDSTGVLDLCFATCSINHLNIPVITEHFAGGADVKLLSLNIFVMVADNVACLALAPSTNFAIFENLAKVNFLVRYDLERKRLSFKYNLFLFPNHMIWQNKM
BLAST of CmaCh04G005890 vs. Swiss-Prot
Match: ASPR1_ARATH (Probable aspartic protease At2g35615 OS=Arabidopsis thaliana GN=At2g35615 PE=3 SV=1)

HSP 1 Score: 98.6 bits (244), Expect = 6.5e-20
Identity = 59/162 (36.42%), Postives = 93/162 (57.41%), Query Frame = 1

Query: 3   KEPNTFYYPTLETMSIANKR-------FKAENDMSVAVEQENILINFGTTLT-------- 62
           KEP T+YY TLE +S+  K+       +   +D  ++    NI+I+ GTTLT        
Sbjct: 281 KEPLTYYYLTLEAISVGKKKIPYTGSSYNPNDDGILSETSGNIIIDSGTTLTLLEAGFFD 340

Query: 63  -------PNLYKPKRVHDSTGVLDLCFATCSINHLNIPVITEHFAGGADVKLLSLNIFVM 122
                   ++   KRV D  G+L  CF + S   + +P IT HF  GADV+L  +N FV 
Sbjct: 341 KFSSAVEESVTGAKRVSDPQGLLSHCFKSGSAE-IGLPEITVHFT-GADVRLSPINAFVK 400

Query: 123 VADNVACLALAPSTNFAIFENLAKVNFLVRYDLERKRLSFKY 143
           +++++ CL++ P+T  AI+ N A+++FLV YDLE + +SF++
Sbjct: 401 LSEDMVCLSMVPTTEVAIYGNFAQMDFLVGYDLETRTVSFQH 440

BLAST of CmaCh04G005890 vs. Swiss-Prot
Match: CDR1_ARATH (Aspartic proteinase CDR1 OS=Arabidopsis thaliana GN=CDR1 PE=1 SV=1)

HSP 1 Score: 88.6 bits (218), Expect = 6.7e-17
Identity = 57/149 (38.26%), Postives = 85/149 (57.05%), Query Frame = 1

Query: 7   TFYYPTLETMSIANKRFKAENDMSVAVEQENILINFGTTLT--------------PNLYK 66
           TFYY TL+++S+ +K+ +     S + E  NI+I+ GTTLT               +   
Sbjct: 286 TFYYLTLKSISVGSKQIQYSGSDSESSEG-NIIIDSGTTLTLLPTEFYSELEDAVASSID 345

Query: 67  PKRVHDSTGVLDLCFATCSINHLNIPVITEHFAGGADVKLLSLNIFVMVADNVACLALAP 126
            ++  D    L LC++  +   L +PVIT HF  GADVKL S N FV V++++ C A   
Sbjct: 346 AEKKQDPQSGLSLCYS--ATGDLKVPVITMHF-DGADVKLDSSNAFVQVSEDLVCFAFRG 405

Query: 127 STNFAIFENLAKVNFLVRYDLERKRLSFK 142
           S +F+I+ N+A++NFLV YD   K +SFK
Sbjct: 406 SPSFSIYGNVAQMNFLVGYDTVSKTVSFK 430

BLAST of CmaCh04G005890 vs. Swiss-Prot
Match: NEP1_NEPGR (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis GN=nep1 PE=1 SV=1)

HSP 1 Score: 67.0 bits (162), Expect = 2.1e-10
Identity = 54/154 (35.06%), Postives = 79/154 (51.30%), Query Frame = 1

Query: 7   TFYYPTLETMSIANKRFKAENDMSVAVEQEN----ILINFGTTLT---PNLYKPKR---- 66
           TFYY TL  +S+ + R   +   + A+   N    I+I+ GTTLT    N Y+  R    
Sbjct: 278 TFYYITLNGLSVGSTRLPIDPS-AFALNSNNGTGGIIIDSGTTLTYFVNNAYQSVRQEFI 337

Query: 67  -------VHDSTGVLDLCFATCSI-NHLNIPVITEHFAGGADVKLLSLNIFVMVADNVAC 126
                  V+ S+   DLCF T S  ++L IP    HF GG D++L S N F+  ++ + C
Sbjct: 338 SQINLPVVNGSSSGFDLCFQTPSDPSNLQIPTFVMHFDGG-DLELPSENYFISPSNGLIC 397

Query: 127 LALAPST-NFAIFENLAKVNFLVRYDLERKRLSF 141
           LA+  S+   +IF N+ + N LV YD     +SF
Sbjct: 398 LAMGSSSQGMSIFGNIQQQNMLVVYDTGNSVVSF 429

BLAST of CmaCh04G005890 vs. TrEMBL
Match: A0A0A0KZZ3_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G055410 PE=3 SV=1)

HSP 1 Score: 152.1 bits (383), Expect = 5.5e-34
Identity = 81/156 (51.92%), Postives = 103/156 (66.03%), Query Frame = 1

Query: 3   KEPNTFYYPTLETMSIANKRFKAENDMSVAVEQENILINFGTTLT---PNLY-------- 62
           + P+TFY+ TLE +S+  KRFKA N +S      NI+I+ GTTLT    +LY        
Sbjct: 277 RSPDTFYFLTLEAISVGKKRFKAANGISAMTNHGNIIIDSGTTLTLLPRSLYYGVFSTLA 336

Query: 63  ---KPKRVHDSTGVLDLCFATCSINHLNIPVITEHFAGGADVKLLSLNIFVMVADNVACL 122
              K KRV D +G+L+LC++   ++ LNIP+IT HFAGGADVKLL +N F  VADNV CL
Sbjct: 337 RVIKAKRVDDPSGILELCYSAGQVDDLNIPIITAHFAGGADVKLLPVNTFAPVADNVTCL 396

Query: 123 ALAPSTNFAIFENLAKVNFLVRYDLERKRLSFKYNL 145
             AP+T  AIF NLA++NF V YDL  KRLSF+  L
Sbjct: 397 TFAPATQVAIFGNLAQINFEVGYDLGNKRLSFEPKL 432

BLAST of CmaCh04G005890 vs. TrEMBL
Match: M5WRG3_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa025167mg PE=3 SV=1)

HSP 1 Score: 120.6 bits (301), Expect = 1.8e-24
Identity = 70/159 (44.03%), Postives = 99/159 (62.26%), Query Frame = 1

Query: 3   KEPNTFYYPTLETMSIANKR--FKAEND----MSVAVEQENILINFGTTLT---PNLY-- 62
           K P+TFYY TLE +S+  KR  +K ++      +VA  + NI+I+ GTTLT   P  +  
Sbjct: 293 KNPDTFYYLTLEAISVGEKRLAYKTKSPDCEKAAVAANEGNIIIDSGTTLTLLPPGFHDD 352

Query: 63  ---------KPKRVHDSTGVLDLCFATCSINHLNIPVITEHFAGGADVKLLSLNIFVMVA 122
                      +RV D  G+L LCF + S + + +PVIT HF+GGADVKL +LN F  + 
Sbjct: 353 LVSALETAINAERVSDPRGILSLCFKSKS-DDIGVPVITVHFSGGADVKLQALNTFARMD 412

Query: 123 DNVACLALAPSTNFAIFENLAKVNFLVRYDLERKRLSFK 142
           D++ C  + PS++ AIF NLA++NFLV YDLE + +SFK
Sbjct: 413 DDMICFTMIPSSDVAIFGNLAQMNFLVGYDLEERSVSFK 450

BLAST of CmaCh04G005890 vs. TrEMBL
Match: A0A0A0KV20_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G055400 PE=3 SV=1)

HSP 1 Score: 118.6 bits (296), Expect = 6.7e-24
Identity = 79/159 (49.69%), Postives = 93/159 (58.49%), Query Frame = 1

Query: 3   KEPNTFYYPTLETMSIANKRFKAENDMSVAVEQENILINFGTTLT---PNLY-------- 62
           K   T+YY TLE +SI N+R  A        +Q N++I+ GTTLT     LY        
Sbjct: 274 KNTVTYYYITLEAISIGNERHMA------FAKQGNVIIDSGTTLTILPKELYDGVVSSLL 333

Query: 63  ---KPKRVHDSTGVLDLCFATCSIN---HLNIPVITEHFAGGADVKLLSLNIFVMVADNV 122
              K KRV D  G LDLCF    IN    L IPVIT HF+GGA+V LL +N F  VADNV
Sbjct: 334 KVVKAKRVKDPHGSLDLCFDD-GINAAASLGIPVITAHFSGGANVNLLPINTFRKVADNV 393

Query: 123 ACLAL---APSTNFAIFENLAKVNFLVRYDLERKRLSFK 142
            CL L   +P+T F I  NLA+ NFL+ YDLE KRLSFK
Sbjct: 394 NCLTLKAASPTTEFGIIGNLAQANFLIGYDLEAKRLSFK 425

BLAST of CmaCh04G005890 vs. TrEMBL
Match: A0A067KX75_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_02613 PE=3 SV=1)

HSP 1 Score: 115.9 bits (289), Expect = 4.4e-23
Identity = 68/153 (44.44%), Postives = 95/153 (62.09%), Query Frame = 1

Query: 3   KEPNTFYYPTLETMSIANKRFKAENDMSVAVEQENILINFGTTLT---PNLY-------- 62
           K P+TFY+ TLE +S+ NKR + E   S    + NI+I+ GTTLT   P+ +        
Sbjct: 278 KNPDTFYFLTLEAISVGNKRIEFEGS-SFGTTEGNIIIDSGTTLTLVPPDFFSELSSAID 337

Query: 63  ---KPKRVHDSTGVLDLCFATCSINHLNIPVITEHFAGGADVKLLSLNIFVMVADNVACL 122
              K  RV D  G+L LC++  SI+ +++P +T HF+ GADVKL  LN FV V+D V CL
Sbjct: 338 DVVKGTRVDDPNGILSLCYS--SISEVSLPTLTVHFS-GADVKLNPLNTFVQVSDGVVCL 397

Query: 123 ALAPSTNFAIFENLAKVNFLVRYDLERKRLSFK 142
           A     + AI+ NL+++NFL+ YDLE K +SFK
Sbjct: 398 AFGSIESGAIYGNLSQMNFLIGYDLEEKTVSFK 426

BLAST of CmaCh04G005890 vs. TrEMBL
Match: B9RV48_RICCO (Aspartic proteinase nepenthesin-2, putative OS=Ricinus communis GN=RCOM_0899040 PE=3 SV=1)

HSP 1 Score: 110.2 bits (274), Expect = 2.4e-21
Identity = 63/152 (41.45%), Postives = 89/152 (58.55%), Query Frame = 1

Query: 3   KEPNTFYYPTLETMSIANKRFKAENDMSVAVEQENILINFGTTLT--------------P 62
           K+P T+YY TLE +S+ NKR    N  +  VE+ NI+I+ GTTLT               
Sbjct: 293 KKPETYYYLTLEAISVENKRLPYTNLWNGEVEKGNIIIDSGTTLTFLDSEFFNNLDSAVE 352

Query: 63  NLYKPKRVHDSTGVLDLCFATCSINHLNIPVITEHFAGGADVKLLSLNIFVMVADNVACL 122
              K +RV D  G+ ++CF       + +P+IT HF G ADV+L  +N F  V +++ C 
Sbjct: 353 EAVKGERVSDPHGLFNICFK--DEKAIELPIITAHFTG-ADVELQPVNTFAKVEEDLLCF 412

Query: 123 ALAPSTNFAIFENLAKVNFLVRYDLERKRLSF 141
            + PS + AIF NLA++NFLV YDLE+K +SF
Sbjct: 413 TMIPSNDIAIFGNLAQMNFLVGYDLEKKAVSF 441

BLAST of CmaCh04G005890 vs. TAIR10
Match: AT2G35615.1 (AT2G35615.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 98.6 bits (244), Expect = 3.6e-21
Identity = 59/162 (36.42%), Postives = 93/162 (57.41%), Query Frame = 1

Query: 3   KEPNTFYYPTLETMSIANKR-------FKAENDMSVAVEQENILINFGTTLT-------- 62
           KEP T+YY TLE +S+  K+       +   +D  ++    NI+I+ GTTLT        
Sbjct: 281 KEPLTYYYLTLEAISVGKKKIPYTGSSYNPNDDGILSETSGNIIIDSGTTLTLLEAGFFD 340

Query: 63  -------PNLYKPKRVHDSTGVLDLCFATCSINHLNIPVITEHFAGGADVKLLSLNIFVM 122
                   ++   KRV D  G+L  CF + S   + +P IT HF  GADV+L  +N FV 
Sbjct: 341 KFSSAVEESVTGAKRVSDPQGLLSHCFKSGSAE-IGLPEITVHFT-GADVRLSPINAFVK 400

Query: 123 VADNVACLALAPSTNFAIFENLAKVNFLVRYDLERKRLSFKY 143
           +++++ CL++ P+T  AI+ N A+++FLV YDLE + +SF++
Sbjct: 401 LSEDMVCLSMVPTTEVAIYGNFAQMDFLVGYDLETRTVSFQH 440

BLAST of CmaCh04G005890 vs. TAIR10
Match: AT1G64830.1 (AT1G64830.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 96.7 bits (239), Expect = 1.4e-20
Identity = 61/153 (39.87%), Postives = 87/153 (56.86%), Query Frame = 1

Query: 3   KEPNTFYYPTLETMSIANKRFKAENDMSVAVEQENILINFGTTLT---PNLY-------- 62
           K+P T+Y+  LE +S+ +K+ +  + +     + NI+I+ GTTLT    N Y        
Sbjct: 276 KDPATYYFLNLEAISVGSKKIQFTSTI-FGTGEGNIVIDSGTTLTLLPSNFYYELESVVA 335

Query: 63  ---KPKRVHDSTGVLDLCFATCSINHLNIPVITEHFAGGADVKLLSLNIFVMVADNVACL 122
              K +RV D  G+L LC+   S     +P IT HF GG DVKL +LN FV V+++V+C 
Sbjct: 336 STIKAERVQDPDGILSLCYRDSS--SFKVPDITVHFKGG-DVKLGNLNTFVAVSEDVSCF 395

Query: 123 ALAPSTNFAIFENLAKVNFLVRYDLERKRLSFK 142
           A A +    IF NLA++NFLV YD     +SFK
Sbjct: 396 AFAANEQLTIFGNLAQMNFLVGYDTVSGTVSFK 424

BLAST of CmaCh04G005890 vs. TAIR10
Match: AT5G33340.1 (AT5G33340.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 88.6 bits (218), Expect = 3.8e-18
Identity = 57/149 (38.26%), Postives = 85/149 (57.05%), Query Frame = 1

Query: 7   TFYYPTLETMSIANKRFKAENDMSVAVEQENILINFGTTLT--------------PNLYK 66
           TFYY TL+++S+ +K+ +     S + E  NI+I+ GTTLT               +   
Sbjct: 286 TFYYLTLKSISVGSKQIQYSGSDSESSEG-NIIIDSGTTLTLLPTEFYSELEDAVASSID 345

Query: 67  PKRVHDSTGVLDLCFATCSINHLNIPVITEHFAGGADVKLLSLNIFVMVADNVACLALAP 126
            ++  D    L LC++  +   L +PVIT HF  GADVKL S N FV V++++ C A   
Sbjct: 346 AEKKQDPQSGLSLCYS--ATGDLKVPVITMHF-DGADVKLDSSNAFVQVSEDLVCFAFRG 405

Query: 127 STNFAIFENLAKVNFLVRYDLERKRLSFK 142
           S +F+I+ N+A++NFLV YD   K +SFK
Sbjct: 406 SPSFSIYGNVAQMNFLVGYDTVSKTVSFK 430

BLAST of CmaCh04G005890 vs. TAIR10
Match: AT1G31450.1 (AT1G31450.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 85.1 bits (209), Expect = 4.2e-17
Identity = 53/159 (33.33%), Postives = 84/159 (52.83%), Query Frame = 1

Query: 3   KEPNTFYYPTLETMSIANKRFKAEN-----DMSVAVEQENILINFGTTLT---------- 62
           K+P T+Y+ TLE +++   +          +   +    NI+I+ GTTLT          
Sbjct: 281 KDPETYYFLTLEAVTVGKTKLPYTGGGYGLNGKSSKRTGNIIIDSGTTLTLLDSGFYDDF 340

Query: 63  -----PNLYKPKRVHDSTGVLDLCFATCSINHLNIPVITEHFAGGADVKLLSLNIFVMVA 122
                 ++   KRV D  G+L  CF +     + +P IT HF   ADVKL  +N FV + 
Sbjct: 341 GTAVEESVTGAKRVSDPQGLLTHCFKSGD-KEIGLPAITMHFTN-ADVKLSPINAFVKLN 400

Query: 123 DNVACLALAPSTNFAIFENLAKVNFLVRYDLERKRLSFK 142
           ++  CL++ P+T  AI+ N+ +++FLV YDLE K +SF+
Sbjct: 401 EDTVCLSMIPTTEVAIYGNMVQMDFLVGYDLETKTVSFQ 437

BLAST of CmaCh04G005890 vs. TAIR10
Match: AT2G28010.1 (AT2G28010.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 64.7 bits (156), Expect = 5.8e-11
Identity = 55/151 (36.42%), Postives = 77/151 (50.99%), Query Frame = 1

Query: 8   FYYPTLETMSIANKRFKAENDMSVAVEQENILINFGTTLT--------------PNLYKP 67
           FYY  L+ +S+ N R +       A+E  NI+I+ GTTLT               ++   
Sbjct: 240 FYYLNLDAVSVGNTRIETMGTTFHALEG-NIVIDSGTTLTYFPVSYCNLVRQAVEHVVTA 299

Query: 68  KRVHDSTGVLDLCFATCSINHLNIPVITEHFAGGADVKLLSLNIFVMVADN--VACLALA 127
            R  D TG   LC+ + +I+    PVIT HF+GG D+ L   N++ M ++N  V CLA+ 
Sbjct: 300 VRAADPTGNDMLCYNSDTIDIF--PVITMHFSGGVDLVLDKYNMY-MESNNGGVFCLAII 359

Query: 128 --PSTNFAIFENLAKVNFLVRYDLERKRLSF 141
               T  AIF N A+ NFLV YD     +SF
Sbjct: 360 CNSPTQEAIFGNRAQNNFLVGYDSSSLLVSF 386

BLAST of CmaCh04G005890 vs. NCBI nr
Match: gi|659102472|ref|XP_008452150.1| (PREDICTED: probable aspartic protease At2g35615 [Cucumis melo])

HSP 1 Score: 157.9 bits (398), Expect = 1.4e-35
Identity = 84/153 (54.90%), Postives = 104/153 (67.97%), Query Frame = 1

Query: 3   KEPNTFYYPTLETMSIANKRFKAENDMSVAVEQENILINFGTTLT---PNLY-------- 62
           + P+TFY+ TLE +S+ NKRFKA  DMS    Q NI+I+ GTTLT    +LY        
Sbjct: 277 RSPDTFYFLTLEAISVGNKRFKAAKDMSAMTNQGNIIIDSGTTLTLLPRSLYDGVVSTLA 336

Query: 63  ---KPKRVHDSTGVLDLCFATCSINHLNIPVITEHFAGGADVKLLSLNIFVMVADNVACL 122
              K KRV D +G+L+LC++   +  LNIP+IT HF+G ADVKLL +N F  VADNV CL
Sbjct: 337 RVIKTKRVDDPSGILELCYSAGQLEDLNIPIITAHFSGRADVKLLPVNTFAPVADNVICL 396

Query: 123 ALAPSTNFAIFENLAKVNFLVRYDLERKRLSFK 142
            LAP+TN AIF NLA++NF V YDL  KRLSFK
Sbjct: 397 TLAPATNVAIFGNLAQINFEVGYDLGNKRLSFK 429

BLAST of CmaCh04G005890 vs. NCBI nr
Match: gi|449462551|ref|XP_004149004.1| (PREDICTED: probable aspartic protease At2g35615 [Cucumis sativus])

HSP 1 Score: 152.1 bits (383), Expect = 7.9e-34
Identity = 81/156 (51.92%), Postives = 103/156 (66.03%), Query Frame = 1

Query: 3   KEPNTFYYPTLETMSIANKRFKAENDMSVAVEQENILINFGTTLT---PNLY-------- 62
           + P+TFY+ TLE +S+  KRFKA N +S      NI+I+ GTTLT    +LY        
Sbjct: 277 RSPDTFYFLTLEAISVGKKRFKAANGISAMTNHGNIIIDSGTTLTLLPRSLYYGVFSTLA 336

Query: 63  ---KPKRVHDSTGVLDLCFATCSINHLNIPVITEHFAGGADVKLLSLNIFVMVADNVACL 122
              K KRV D +G+L+LC++   ++ LNIP+IT HFAGGADVKLL +N F  VADNV CL
Sbjct: 337 RVIKAKRVDDPSGILELCYSAGQVDDLNIPIITAHFAGGADVKLLPVNTFAPVADNVTCL 396

Query: 123 ALAPSTNFAIFENLAKVNFLVRYDLERKRLSFKYNL 145
             AP+T  AIF NLA++NF V YDL  KRLSF+  L
Sbjct: 397 TFAPATQVAIFGNLAQINFEVGYDLGNKRLSFEPKL 432

BLAST of CmaCh04G005890 vs. NCBI nr
Match: gi|470131788|ref|XP_004301773.1| (PREDICTED: aspartic proteinase CDR1 [Fragaria vesca subsp. vesca])

HSP 1 Score: 125.2 bits (313), Expect = 1.0e-25
Identity = 69/155 (44.52%), Postives = 99/155 (63.87%), Query Frame = 1

Query: 3   KEPNTFYYPTLETMSIANKR--FKAENDMSVAVEQENILINFGTTLT---PNLY------ 62
           K+PNTFYY TLE +S+  K+  +K++++ +VA  + NI+I+ GTTLT   P  +      
Sbjct: 273 KQPNTFYYLTLEAISVGEKKVLYKSQSNKAVAGSEGNIIIDSGTTLTLLPPGFHDDVVAA 332

Query: 63  -----KPKRVHDSTGVLDLCFATCSINHLNIPVITEHFAGGADVKLLSLNIFVMVADNVA 122
                  +RV D  GVL LCF +   + + +PVIT HF+GGADVKL +LN F  V D++ 
Sbjct: 333 LEAAINAERVSDPRGVLSLCFKSKK-DDIGVPVITAHFSGGADVKLNALNTFARVEDDMV 392

Query: 123 CLALAPSTNFAIFENLAKVNFLVRYDLERKRLSFK 142
           C  + P+ + AIF NLA++NFLV YDL+   +SFK
Sbjct: 393 CFTMIPADDVAIFGNLAQINFLVGYDLDEGTVSFK 426

BLAST of CmaCh04G005890 vs. NCBI nr
Match: gi|657964106|ref|XP_008373676.1| (PREDICTED: aspartic proteinase CDR1-like [Malus domestica])

HSP 1 Score: 123.2 bits (308), Expect = 3.9e-25
Identity = 71/161 (44.10%), Postives = 100/161 (62.11%), Query Frame = 1

Query: 3   KEPNTFYYPTLETMSIANKRF--------KAENDMSVAVEQENILINFGTTLT---PNLY 62
           K+P+TFYY TLE +S+  KR           E+D++VA  + NI+I+ GTTLT   P  Y
Sbjct: 299 KQPDTFYYLTLEAISVGEKRLAYKTKSSPNFEDDVAVAANEGNIIIDSGTTLTLLPPGFY 358

Query: 63  KP-----------KRVHDSTGVLDLCFATCSINHLNIPVITEHFAGGADVKLLSLNIFVM 122
           +            +RV D  G+L LCF + S + + +PVIT HF G ADVKL ++N F  
Sbjct: 359 EDLESALEVAINAERVSDPKGILSLCFRSES-DDIGVPVITAHFKG-ADVKLQAVNTFAR 418

Query: 123 VADNVACLALAPSTNFAIFENLAKVNFLVRYDLERKRLSFK 142
           + D++ CL + PS++ AIF NLA++NFLV YDLE + +SFK
Sbjct: 419 IEDDLVCLTMIPSSDVAIFGNLAQINFLVGYDLEERTVSFK 457

BLAST of CmaCh04G005890 vs. NCBI nr
Match: gi|645265299|ref|XP_008238084.1| (PREDICTED: aspartic proteinase CDR1-like [Prunus mume])

HSP 1 Score: 122.1 bits (305), Expect = 8.7e-25
Identity = 70/159 (44.03%), Postives = 100/159 (62.89%), Query Frame = 1

Query: 3   KEPNTFYYPTLETMSIANKR--FKAEN----DMSVAVEQENILINFGTTLT---PNLY-- 62
           K P+TFYY TLE +S+  KR  +K ++    + +VA  + NI+I+ GTTLT   P  +  
Sbjct: 294 KNPDTFYYLTLEAISVGEKRLAYKTKSPDCEEAAVAANEGNIIIDSGTTLTLLPPGFHDD 353

Query: 63  ---------KPKRVHDSTGVLDLCFATCSINHLNIPVITEHFAGGADVKLLSLNIFVMVA 122
                      +RV D  G+L LCF + S + + +PVIT HF+GGADVKL +LN F  + 
Sbjct: 354 LVSALETAINAERVSDPRGILSLCFKSKS-DDIGVPVITAHFSGGADVKLQALNTFARMD 413

Query: 123 DNVACLALAPSTNFAIFENLAKVNFLVRYDLERKRLSFK 142
           D++ C  + PS++ AIF NLA++NFLV YDLE + +SFK
Sbjct: 414 DDMICFTMIPSSDVAIFGNLAQMNFLVGYDLEERSVSFK 451

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
ASPR1_ARATH6.5e-2036.42Probable aspartic protease At2g35615 OS=Arabidopsis thaliana GN=At2g35615 PE=3 S... [more]
CDR1_ARATH6.7e-1738.26Aspartic proteinase CDR1 OS=Arabidopsis thaliana GN=CDR1 PE=1 SV=1[more]
NEP1_NEPGR2.1e-1035.06Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis GN=nep1 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KZZ3_CUCSA5.5e-3451.92Uncharacterized protein OS=Cucumis sativus GN=Csa_4G055410 PE=3 SV=1[more]
M5WRG3_PRUPE1.8e-2444.03Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa025167mg PE=3 SV=1[more]
A0A0A0KV20_CUCSA6.7e-2449.69Uncharacterized protein OS=Cucumis sativus GN=Csa_4G055400 PE=3 SV=1[more]
A0A067KX75_JATCU4.4e-2344.44Uncharacterized protein OS=Jatropha curcas GN=JCGZ_02613 PE=3 SV=1[more]
B9RV48_RICCO2.4e-2141.45Aspartic proteinase nepenthesin-2, putative OS=Ricinus communis GN=RCOM_0899040 ... [more]
Match NameE-valueIdentityDescription
AT2G35615.13.6e-2136.42 Eukaryotic aspartyl protease family protein[more]
AT1G64830.11.4e-2039.87 Eukaryotic aspartyl protease family protein[more]
AT5G33340.13.8e-1838.26 Eukaryotic aspartyl protease family protein[more]
AT1G31450.14.2e-1733.33 Eukaryotic aspartyl protease family protein[more]
AT2G28010.15.8e-1136.42 Eukaryotic aspartyl protease family protein[more]
Match NameE-valueIdentityDescription
gi|659102472|ref|XP_008452150.1|1.4e-3554.90PREDICTED: probable aspartic protease At2g35615 [Cucumis melo][more]
gi|449462551|ref|XP_004149004.1|7.9e-3451.92PREDICTED: probable aspartic protease At2g35615 [Cucumis sativus][more]
gi|470131788|ref|XP_004301773.1|1.0e-2544.52PREDICTED: aspartic proteinase CDR1 [Fragaria vesca subsp. vesca][more]
gi|657964106|ref|XP_008373676.1|3.9e-2544.10PREDICTED: aspartic proteinase CDR1-like [Malus domestica][more]
gi|645265299|ref|XP_008238084.1|8.7e-2544.03PREDICTED: aspartic proteinase CDR1-like [Prunus mume][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001461Aspartic_peptidase_A1
IPR021109Peptidase_aspartic_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0004190aspartic-type endopeptidase activity
Vocabulary: Biological Process
TermDefinition
GO:0006508proteolysis
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
cellular_component GO:0005575 cellular_component
molecular_function GO:0004190 aspartic-type endopeptidase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh04G005890.1CmaCh04G005890.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPANTHERPTHR13683ASPARTYL PROTEASEScoord: 3..141
score: 1.9
IPR021109Aspartic peptidase domainGENE3DG3DSA:2.40.70.10coord: 58..142
score: 4.9
IPR021109Aspartic peptidase domainunknownSSF50630Acid proteasescoord: 5..143
score: 5.63
NoneNo IPR availablePANTHERPTHR13683:SF298ASPARTIC PROTEINASE CDR1-RELATEDcoord: 3..141
score: 1.9

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
CmaCh04G005890Cp4.1LG01g01350Cucurbita pepo (Zucchini)cmacpeB720
The following gene(s) are paralogous to this gene:

None