Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACTGGTTATCGTATTATCTTCAATCGTGACAAGATGGTGTTGGGCTGGAGTCCTTCAGATTGTGAGAATCACTTCACCTGAAAACTCTTCTTCGTTTCGCTAGTTTTTTTTTTTCCTGCAGTGTCATTGAAGCTTGTTTTCTTAAATCCAGGTTACGACAATGGCGACGGCGCTCCCTCCGGCAATACTCCTCCATCTGACTCTCCTCCGGCCGACTCCCCTCCGGCCGACAACTCTCCACCGTCCGACGCCTCTCCTCCGGCCGACAACTCTCCACCGTCCGACGCCTCTCCTCCGGCCGACAACTCTCCACCGTCCGACGCCTCTCCTCCATCCGGCGACTCTCCTCCATCCGACGACTCTCCTCCGGCACCTTCTACCCCAGGAGGAAGCACCGGTTTTCCGAAACTTGGGGAGGGTGATGCATCGCGGTTGAATCCATTGGCCTCTTTATTTGTTGCTGTTCTTGTAATTTTGGCTGCTGTTTGA
mRNA sequence
ATGACTGGTTATCGTATTATCTTCAATCGTGACAAGATGGTGTTGGGCTGGAGTCCTTCAGATTGTTACGACAATGGCGACGGCGCTCCCTCCGGCAATACTCCTCCATCTGACTCTCCTCCGGCCGACTCCCCTCCGGCCGACAACTCTCCACCGTCCGACGCCTCTCCTCCGGCCGACAACTCTCCACCGTCCGACGCCTCTCCTCCGGCCGACAACTCTCCACCGTCCGACGCCTCTCCTCCATCCGGCGACTCTCCTCCATCCGACGACTCTCCTCCGGCACCTTCTACCCCAGGAGGAAGCACCGGTTTTCCGAAACTTGGGGAGGGTGATGCATCGCGGTTGAATCCATTGGCCTCTTTATTTGTTGCTGTTCTTGTAATTTTGGCTGCTGTTTGA
Coding sequence (CDS)
ATGACTGGTTATCGTATTATCTTCAATCGTGACAAGATGGTGTTGGGCTGGAGTCCTTCAGATTGTTACGACAATGGCGACGGCGCTCCCTCCGGCAATACTCCTCCATCTGACTCTCCTCCGGCCGACTCCCCTCCGGCCGACAACTCTCCACCGTCCGACGCCTCTCCTCCGGCCGACAACTCTCCACCGTCCGACGCCTCTCCTCCGGCCGACAACTCTCCACCGTCCGACGCCTCTCCTCCATCCGGCGACTCTCCTCCATCCGACGACTCTCCTCCGGCACCTTCTACCCCAGGAGGAAGCACCGGTTTTCCGAAACTTGGGGAGGGTGATGCATCGCGGTTGAATCCATTGGCCTCTTTATTTGTTGCTGTTCTTGTAATTTTGGCTGCTGTTTGA
Protein sequence
MTGYRIIFNRDKMVLGWSPSDCYDNGDGAPSGNTPPSDSPPADSPPADNSPPSDASPPADNSPPSDASPPADNSPPSDASPPSGDSPPSDDSPPAPSTPGGSTGFPKLGEGDASRLNPLASLFVAVLVILAAV
Homology
BLAST of Cla97C01G020020 vs. NCBI nr
Match:
XP_038882816.1 (aspartyl protease family protein 1-like [Benincasa hispida])
HSP 1 Score: 174.1 bits (440), Expect = 8.1e-40
Identity = 105/133 (78.95%), Postives = 117/133 (87.97%), Query Frame = 0
Query: 1 MTGYRIIFNRDKMVLGWSPSDCYDNGDGAPSGNTPPSDSPPADSPPADNSPPSDASPPAD 60
MTGYRIIFNR+KMVLGWS SDCYDNGDG PSG+TPPSDSPP+DSPPAD SPP+D SPPAD
Sbjct: 427 MTGYRIIFNREKMVLGWSLSDCYDNGDGTPSGDTPPSDSPPSDSPPAD-SPPTDDSPPAD 486
Query: 61 NSPPSDASPPADNSPPSDASPPSGDSPPSDDSPPAPSTPGGSTGFPKLGEGDASRLNPLA 120
SPP+D SPP+++SPPS+ SPPS DSPPS+DSPPAPSTPGG TG P GDA+RLNPLA
Sbjct: 487 -SPPTDDSPPSEHSPPSEDSPPSEDSPPSEDSPPAPSTPGGRTGLPNF--GDATRLNPLA 546
Query: 121 SLFVAVLVILAAV 134
S+FVAVL ILA V
Sbjct: 547 SVFVAVLAILAVV 555
BLAST of Cla97C01G020020 vs. NCBI nr
Match:
KAA0025642.1 (aspartyl protease family protein 1-like [Cucumis melo var. makuwa] >TYK12515.1 aspartyl protease family protein 1-like [Cucumis melo var. makuwa])
HSP 1 Score: 152.9 bits (385), Expect = 1.9e-33
Identity = 95/137 (69.34%), Postives = 109/137 (79.56%), Query Frame = 0
Query: 1 MTGYRIIFNRDKMVLGWSPSDCYDNGDGAPS----GNTPPSDSPPADSPPADNSPPSDAS 60
MTGYRIIFNR +MVLGWSPSDCYDNGD PS ++PPSDSPP DSPP+D SPP+D +
Sbjct: 424 MTGYRIIFNRGEMVLGWSPSDCYDNGDSTPSDSPPADSPPSDSPPTDSPPSD-SPPTDDT 483
Query: 61 PPADNSPPSDASPPADNSPPSDASPPSGDSPPSDDSPPAPSTPGGSTGFPKLGEGDASRL 120
PP+ +SPPS SPP+ +SPPS SPPS DSPPS DSPPAPSTPGG G P++G A+RL
Sbjct: 484 PPSGDSPPSGDSPPSGDSPPSGDSPPSDDSPPSGDSPPAPSTPGGGNGLPRIGA--AARL 543
Query: 121 NPLASLFVAVLVILAAV 134
NPL S+FVAVL ILA V
Sbjct: 544 NPLGSVFVAVLAILAVV 557
BLAST of Cla97C01G020020 vs. NCBI nr
Match:
XP_023518254.1 (aspartyl protease family protein 1-like [Cucurbita pepo subsp. pepo])
HSP 1 Score: 151.0 bits (380), Expect = 7.3e-33
Identity = 91/134 (67.91%), Postives = 108/134 (80.60%), Query Frame = 0
Query: 1 MTGYRIIFNRDKMVLGWSPSDCYDNGDGAPSGNTPPSDSPPA-DSPPADNSPPSDASPPA 60
MTGYRIIF+R+KMVLGWSPSDCYD+ G PSG+TPP+DSPPA DSPPAD+SPP++ SPPA
Sbjct: 427 MTGYRIIFDREKMVLGWSPSDCYDSDAGTPSGDTPPADSPPAKDSPPADDSPPAEDSPPA 486
Query: 61 DNSPPSDASPPADNSPPSDASPPSGDSPPSDDSPPAPSTPGGSTGFPKLGEGDASRLNPL 120
++SPP++ SPPA+ DSPP+DDSP PS PGGSTG PK+G GDA+RLNPL
Sbjct: 487 EDSPPAEDSPPAE------------DSPPTDDSPSPPSAPGGSTGLPKIG-GDATRLNPL 546
Query: 121 ASLFVAVLVILAAV 134
S+FVAVL ILA V
Sbjct: 547 TSVFVAVLAILAVV 547
BLAST of Cla97C01G020020 vs. NCBI nr
Match:
XP_031742572.1 (aspartyl protease family protein 1 [Cucumis sativus] >KGN48951.1 hypothetical protein Csa_003569 [Cucumis sativus])
HSP 1 Score: 145.6 bits (366), Expect = 3.1e-31
Identity = 94/146 (64.38%), Postives = 110/146 (75.34%), Query Frame = 0
Query: 1 MTGYRIIFNRDKMVLGWSPSDCYDNGDGAPSGNTP----PSDSPPADSPPA--------- 60
MTGYRI FNRD+MVLGWS SDCYDNG G PSG+TP PSDSPP DSPP+
Sbjct: 426 MTGYRITFNRDQMVLGWSSSDCYDNGVGTPSGDTPPADSPSDSPPTDSPPSVSPPTDSPP 485
Query: 61 DNSPPSDASPPADNSPPSDASPPADNSPPSDASPPSGDSPPSDDSPPAPSTPGGSTGFPK 120
+SPP+D +PP+D+SPPS+ SPP+++SPPS+ SPPS DSPPS DSPPAPSTPGG G P
Sbjct: 486 SDSPPTDDTPPSDDSPPSEDSPPSEDSPPSEDSPPSDDSPPSGDSPPAPSTPGGRPGLP- 545
Query: 121 LGEGDASRLNPLASLFVAVLVILAAV 134
G G A++LNPL +F AVL ILA V
Sbjct: 546 -GLGGAAQLNPLGFVFGAVLAILALV 569
BLAST of Cla97C01G020020 vs. NCBI nr
Match:
KAG6594755.1 (Aspartyl protease family protein 1, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 142.9 bits (359), Expect = 2.0e-30
Identity = 87/133 (65.41%), Postives = 102/133 (76.69%), Query Frame = 0
Query: 1 MTGYRIIFNRDKMVLGWSPSDCYDNGDGAPSGNTPPSDSPPADSPPADNSPPSDASPPAD 60
MTGYRIIF+R+KM LGWSPSDCYD+ G PSG+TPP+ DSPPAD+SPP+D SPPAD
Sbjct: 427 MTGYRIIFDREKMALGWSPSDCYDSDAGTPSGDTPPA----KDSPPADDSPPADDSPPAD 486
Query: 61 NSPPSDASPPADNSPPSDASPPSGDSPPSDDSPPAPSTPGGSTGFPKLGEGDASRLNPLA 120
+SPP++ SPPA+ DSPP+DDSP PS PGGSTG PK+G GDA+RLNPL
Sbjct: 487 DSPPAEDSPPAE------------DSPPTDDSPSPPSAPGGSTGLPKIG-GDATRLNPLT 542
Query: 121 SLFVAVLVILAAV 134
S+FVAVL ILA V
Sbjct: 547 SVFVAVLAILAVV 542
BLAST of Cla97C01G020020 vs. ExPASy TrEMBL
Match:
A0A5A7SKI7 (Aspartyl protease family protein 1-like OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold255G00960 PE=3 SV=1)
HSP 1 Score: 152.9 bits (385), Expect = 9.3e-34
Identity = 95/137 (69.34%), Postives = 109/137 (79.56%), Query Frame = 0
Query: 1 MTGYRIIFNRDKMVLGWSPSDCYDNGDGAPS----GNTPPSDSPPADSPPADNSPPSDAS 60
MTGYRIIFNR +MVLGWSPSDCYDNGD PS ++PPSDSPP DSPP+D SPP+D +
Sbjct: 424 MTGYRIIFNRGEMVLGWSPSDCYDNGDSTPSDSPPADSPPSDSPPTDSPPSD-SPPTDDT 483
Query: 61 PPADNSPPSDASPPADNSPPSDASPPSGDSPPSDDSPPAPSTPGGSTGFPKLGEGDASRL 120
PP+ +SPPS SPP+ +SPPS SPPS DSPPS DSPPAPSTPGG G P++G A+RL
Sbjct: 484 PPSGDSPPSGDSPPSGDSPPSGDSPPSDDSPPSGDSPPAPSTPGGGNGLPRIGA--AARL 543
Query: 121 NPLASLFVAVLVILAAV 134
NPL S+FVAVL ILA V
Sbjct: 544 NPLGSVFVAVLAILAVV 557
BLAST of Cla97C01G020020 vs. ExPASy TrEMBL
Match:
A0A0A0KML0 (Peptidase A1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G507250 PE=3 SV=1)
HSP 1 Score: 145.6 bits (366), Expect = 1.5e-31
Identity = 94/146 (64.38%), Postives = 110/146 (75.34%), Query Frame = 0
Query: 1 MTGYRIIFNRDKMVLGWSPSDCYDNGDGAPSGNTP----PSDSPPADSPPA--------- 60
MTGYRI FNRD+MVLGWS SDCYDNG G PSG+TP PSDSPP DSPP+
Sbjct: 426 MTGYRITFNRDQMVLGWSSSDCYDNGVGTPSGDTPPADSPSDSPPTDSPPSVSPPTDSPP 485
Query: 61 DNSPPSDASPPADNSPPSDASPPADNSPPSDASPPSGDSPPSDDSPPAPSTPGGSTGFPK 120
+SPP+D +PP+D+SPPS+ SPP+++SPPS+ SPPS DSPPS DSPPAPSTPGG G P
Sbjct: 486 SDSPPTDDTPPSDDSPPSEDSPPSEDSPPSEDSPPSDDSPPSGDSPPAPSTPGGRPGLP- 545
Query: 121 LGEGDASRLNPLASLFVAVLVILAAV 134
G G A++LNPL +F AVL ILA V
Sbjct: 546 -GLGGAAQLNPLGFVFGAVLAILALV 569
BLAST of Cla97C01G020020 vs. ExPASy TrEMBL
Match:
A0A1S4DT41 (aspartyl protease family protein 1-like OS=Cucumis melo OX=3656 GN=LOC103485161 PE=3 SV=1)
HSP 1 Score: 138.7 bits (348), Expect = 1.8e-29
Identity = 92/133 (69.17%), Postives = 102/133 (76.69%), Query Frame = 0
Query: 1 MTGYRIIFNRDKMVLGWSPSDCYDNGDGAPSGNTPPSDSPPADSPPADNSPPSDASPPAD 60
MTGYRIIFNR +MVLGWSPSDCYDNGD + PSDSPPADSPP+D SPP+D
Sbjct: 424 MTGYRIIFNRGEMVLGWSPSDCYDNGD------STPSDSPPADSPPSD-SPPTD------ 483
Query: 61 NSPPSDASPPADNSPPSDASPPSGDSPPSDDSPPAPSTPGGSTGFPKLGEGDASRLNPLA 120
SPPSD SPP D++PPS SPPS DSPPS DSPPAPSTPGG G P++G A+RLNPL
Sbjct: 484 -SPPSD-SPPTDDTPPSGDSPPSDDSPPSGDSPPAPSTPGGGNGLPRIGA--AARLNPLG 539
Query: 121 SLFVAVLVILAAV 134
S+FVAVL ILA V
Sbjct: 544 SVFVAVLAILAVV 539
BLAST of Cla97C01G020020 vs. ExPASy TrEMBL
Match:
A0A6J1BTF7 (aspartyl protease family protein 1-like OS=Momordica charantia OX=3673 GN=LOC111005407 PE=3 SV=1)
HSP 1 Score: 136.3 bits (342), Expect = 9.0e-29
Identity = 88/136 (64.71%), Postives = 98/136 (72.06%), Query Frame = 0
Query: 1 MTGYRIIFNRDKMVLGWSPSDCYDNGDGAPSGNTPPSDSPPADSPPADNSPPSDASPPAD 60
MTGYRIIFNR+KMVLGW+ SDCYDNG PS N SPPADNSPPSD
Sbjct: 430 MTGYRIIFNREKMVLGWTESDCYDNGAATPSDN----------SPPADNSPPSD------ 489
Query: 61 NSPPSDASPPADNSPPSDASPPSGDSPPSDDSPPAPSTPGGSTG---FPKLGEGDASRLN 120
SPP+D+SPP D+SPPSD SP SGDSPPS DSPPAPST GGS G P++G GDA+RLN
Sbjct: 490 -SPPTDSSPPTDSSPPSD-SPQSGDSPPSGDSPPAPSTAGGSNGTQPLPRIGAGDATRLN 547
Query: 121 PLASLFVAVLVILAAV 134
PL +FVA+L IL V
Sbjct: 550 PLGCVFVAILAILVVV 547
BLAST of Cla97C01G020020 vs. ExPASy TrEMBL
Match:
A0A6J1EEK3 (aspartyl protease family protein 1-like OS=Cucurbita moschata OX=3662 GN=LOC111433625 PE=3 SV=1)
HSP 1 Score: 132.9 bits (333), Expect = 1.0e-27
Identity = 81/133 (60.90%), Postives = 98/133 (73.68%), Query Frame = 0
Query: 1 MTGYRIIFNRDKMVLGWSPSDCYDNGDGAPSGNTPPSDSPPADSPPADNSPPSDASPPAD 60
MTGYRIIF+R+KM LGWSPSDCYD+ G PSG+TPP+ DSPPAD+SPP++ SPPA+
Sbjct: 427 MTGYRIIFDREKMALGWSPSDCYDSDAGTPSGDTPPA----KDSPPADDSPPAEDSPPAE 486
Query: 61 NSPPSDASPPADNSPPSDASPPSGDSPPSDDSPPAPSTPGGSTGFPKLGEGDASRLNPLA 120
+SPP++ DSPP+DDSP PS+PGGSTG PK+G GDA+RLNPL
Sbjct: 487 DSPPAE------------------DSPPTDDSPSPPSSPGGSTGLPKIG-GDATRLNPLT 536
Query: 121 SLFVAVLVILAAV 134
S+FVAVL ILA V
Sbjct: 547 SVFVAVLAILAVV 536
BLAST of Cla97C01G020020 vs. TAIR 10
Match:
AT3G51350.1 (Eukaryotic aspartyl protease family protein )
HSP 1 Score: 42.4 bits (98), Expect = 3.4e-04
Identity = 44/131 (33.59%), Postives = 61/131 (46.56%), Query Frame = 0
Query: 1 MTGYRIIFNRDKMVLGWSPSDCYDNGDGAPSGNTPPSDSPPADSPPADNSPPSDASPPAD 60
+ GYRI+F+R++M+LGW S C++ D + TPP PP PA
Sbjct: 431 VAGYRIVFDRERMILGWKQSLCFE--DESLESTTPP---PPEVEAPA------------- 490
Query: 61 NSPPSDASPPADNSPPSDASPPSGDSPPSDDSPPAPSTPGGSTGFPKLGEGDASRLNPLA 120
PS ++PP PP+ + P P P STG P G G A+ L PLA
Sbjct: 491 ---------------PSVSAPPPRSLPPTVSATPPPINPRNSTGNP--GTGGAANLIPLA 526
Query: 121 SLFVAVLVILA 132
S + +L +LA
Sbjct: 551 SQLLLLLPLLA 526
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_038882816.1 | 8.1e-40 | 78.95 | aspartyl protease family protein 1-like [Benincasa hispida] | [more] |
KAA0025642.1 | 1.9e-33 | 69.34 | aspartyl protease family protein 1-like [Cucumis melo var. makuwa] >TYK12515.1 a... | [more] |
XP_023518254.1 | 7.3e-33 | 67.91 | aspartyl protease family protein 1-like [Cucurbita pepo subsp. pepo] | [more] |
XP_031742572.1 | 3.1e-31 | 64.38 | aspartyl protease family protein 1 [Cucumis sativus] >KGN48951.1 hypothetical pr... | [more] |
KAG6594755.1 | 2.0e-30 | 65.41 | Aspartyl protease family protein 1, partial [Cucurbita argyrosperma subsp. soror... | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A5A7SKI7 | 9.3e-34 | 69.34 | Aspartyl protease family protein 1-like OS=Cucumis melo var. makuwa OX=1194695 G... | [more] |
A0A0A0KML0 | 1.5e-31 | 64.38 | Peptidase A1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G50725... | [more] |
A0A1S4DT41 | 1.8e-29 | 69.17 | aspartyl protease family protein 1-like OS=Cucumis melo OX=3656 GN=LOC103485161 ... | [more] |
A0A6J1BTF7 | 9.0e-29 | 64.71 | aspartyl protease family protein 1-like OS=Momordica charantia OX=3673 GN=LOC111... | [more] |
A0A6J1EEK3 | 1.0e-27 | 60.90 | aspartyl protease family protein 1-like OS=Cucurbita moschata OX=3662 GN=LOC1114... | [more] |
Match Name | E-value | Identity | Description | |
AT3G51350.1 | 3.4e-04 | 33.59 | Eukaryotic aspartyl protease family protein | [more] |