Cla97C01G020020 (gene) Watermelon (97103) v2.5

Overview
NameCla97C01G020020
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionAspartyl protease family protein 1-like
LocationCla97Chr01: 32860175 .. 32860666 (-)
RNA-Seq ExpressionCla97C01G020020
SyntenyCla97C01G020020
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACTGGTTATCGTATTATCTTCAATCGTGACAAGATGGTGTTGGGCTGGAGTCCTTCAGATTGTGAGAATCACTTCACCTGAAAACTCTTCTTCGTTTCGCTAGTTTTTTTTTTTCCTGCAGTGTCATTGAAGCTTGTTTTCTTAAATCCAGGTTACGACAATGGCGACGGCGCTCCCTCCGGCAATACTCCTCCATCTGACTCTCCTCCGGCCGACTCCCCTCCGGCCGACAACTCTCCACCGTCCGACGCCTCTCCTCCGGCCGACAACTCTCCACCGTCCGACGCCTCTCCTCCGGCCGACAACTCTCCACCGTCCGACGCCTCTCCTCCATCCGGCGACTCTCCTCCATCCGACGACTCTCCTCCGGCACCTTCTACCCCAGGAGGAAGCACCGGTTTTCCGAAACTTGGGGAGGGTGATGCATCGCGGTTGAATCCATTGGCCTCTTTATTTGTTGCTGTTCTTGTAATTTTGGCTGCTGTTTGA

mRNA sequence

ATGACTGGTTATCGTATTATCTTCAATCGTGACAAGATGGTGTTGGGCTGGAGTCCTTCAGATTGTTACGACAATGGCGACGGCGCTCCCTCCGGCAATACTCCTCCATCTGACTCTCCTCCGGCCGACTCCCCTCCGGCCGACAACTCTCCACCGTCCGACGCCTCTCCTCCGGCCGACAACTCTCCACCGTCCGACGCCTCTCCTCCGGCCGACAACTCTCCACCGTCCGACGCCTCTCCTCCATCCGGCGACTCTCCTCCATCCGACGACTCTCCTCCGGCACCTTCTACCCCAGGAGGAAGCACCGGTTTTCCGAAACTTGGGGAGGGTGATGCATCGCGGTTGAATCCATTGGCCTCTTTATTTGTTGCTGTTCTTGTAATTTTGGCTGCTGTTTGA

Coding sequence (CDS)

ATGACTGGTTATCGTATTATCTTCAATCGTGACAAGATGGTGTTGGGCTGGAGTCCTTCAGATTGTTACGACAATGGCGACGGCGCTCCCTCCGGCAATACTCCTCCATCTGACTCTCCTCCGGCCGACTCCCCTCCGGCCGACAACTCTCCACCGTCCGACGCCTCTCCTCCGGCCGACAACTCTCCACCGTCCGACGCCTCTCCTCCGGCCGACAACTCTCCACCGTCCGACGCCTCTCCTCCATCCGGCGACTCTCCTCCATCCGACGACTCTCCTCCGGCACCTTCTACCCCAGGAGGAAGCACCGGTTTTCCGAAACTTGGGGAGGGTGATGCATCGCGGTTGAATCCATTGGCCTCTTTATTTGTTGCTGTTCTTGTAATTTTGGCTGCTGTTTGA

Protein sequence

MTGYRIIFNRDKMVLGWSPSDCYDNGDGAPSGNTPPSDSPPADSPPADNSPPSDASPPADNSPPSDASPPADNSPPSDASPPSGDSPPSDDSPPAPSTPGGSTGFPKLGEGDASRLNPLASLFVAVLVILAAV
Homology
BLAST of Cla97C01G020020 vs. NCBI nr
Match: XP_038882816.1 (aspartyl protease family protein 1-like [Benincasa hispida])

HSP 1 Score: 174.1 bits (440), Expect = 8.1e-40
Identity = 105/133 (78.95%), Postives = 117/133 (87.97%), Query Frame = 0

Query: 1   MTGYRIIFNRDKMVLGWSPSDCYDNGDGAPSGNTPPSDSPPADSPPADNSPPSDASPPAD 60
           MTGYRIIFNR+KMVLGWS SDCYDNGDG PSG+TPPSDSPP+DSPPAD SPP+D SPPAD
Sbjct: 427 MTGYRIIFNREKMVLGWSLSDCYDNGDGTPSGDTPPSDSPPSDSPPAD-SPPTDDSPPAD 486

Query: 61  NSPPSDASPPADNSPPSDASPPSGDSPPSDDSPPAPSTPGGSTGFPKLGEGDASRLNPLA 120
            SPP+D SPP+++SPPS+ SPPS DSPPS+DSPPAPSTPGG TG P    GDA+RLNPLA
Sbjct: 487 -SPPTDDSPPSEHSPPSEDSPPSEDSPPSEDSPPAPSTPGGRTGLPNF--GDATRLNPLA 546

Query: 121 SLFVAVLVILAAV 134
           S+FVAVL ILA V
Sbjct: 547 SVFVAVLAILAVV 555

BLAST of Cla97C01G020020 vs. NCBI nr
Match: KAA0025642.1 (aspartyl protease family protein 1-like [Cucumis melo var. makuwa] >TYK12515.1 aspartyl protease family protein 1-like [Cucumis melo var. makuwa])

HSP 1 Score: 152.9 bits (385), Expect = 1.9e-33
Identity = 95/137 (69.34%), Postives = 109/137 (79.56%), Query Frame = 0

Query: 1   MTGYRIIFNRDKMVLGWSPSDCYDNGDGAPS----GNTPPSDSPPADSPPADNSPPSDAS 60
           MTGYRIIFNR +MVLGWSPSDCYDNGD  PS     ++PPSDSPP DSPP+D SPP+D +
Sbjct: 424 MTGYRIIFNRGEMVLGWSPSDCYDNGDSTPSDSPPADSPPSDSPPTDSPPSD-SPPTDDT 483

Query: 61  PPADNSPPSDASPPADNSPPSDASPPSGDSPPSDDSPPAPSTPGGSTGFPKLGEGDASRL 120
           PP+ +SPPS  SPP+ +SPPS  SPPS DSPPS DSPPAPSTPGG  G P++G   A+RL
Sbjct: 484 PPSGDSPPSGDSPPSGDSPPSGDSPPSDDSPPSGDSPPAPSTPGGGNGLPRIGA--AARL 543

Query: 121 NPLASLFVAVLVILAAV 134
           NPL S+FVAVL ILA V
Sbjct: 544 NPLGSVFVAVLAILAVV 557

BLAST of Cla97C01G020020 vs. NCBI nr
Match: XP_023518254.1 (aspartyl protease family protein 1-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 151.0 bits (380), Expect = 7.3e-33
Identity = 91/134 (67.91%), Postives = 108/134 (80.60%), Query Frame = 0

Query: 1   MTGYRIIFNRDKMVLGWSPSDCYDNGDGAPSGNTPPSDSPPA-DSPPADNSPPSDASPPA 60
           MTGYRIIF+R+KMVLGWSPSDCYD+  G PSG+TPP+DSPPA DSPPAD+SPP++ SPPA
Sbjct: 427 MTGYRIIFDREKMVLGWSPSDCYDSDAGTPSGDTPPADSPPAKDSPPADDSPPAEDSPPA 486

Query: 61  DNSPPSDASPPADNSPPSDASPPSGDSPPSDDSPPAPSTPGGSTGFPKLGEGDASRLNPL 120
           ++SPP++ SPPA+            DSPP+DDSP  PS PGGSTG PK+G GDA+RLNPL
Sbjct: 487 EDSPPAEDSPPAE------------DSPPTDDSPSPPSAPGGSTGLPKIG-GDATRLNPL 546

Query: 121 ASLFVAVLVILAAV 134
            S+FVAVL ILA V
Sbjct: 547 TSVFVAVLAILAVV 547

BLAST of Cla97C01G020020 vs. NCBI nr
Match: XP_031742572.1 (aspartyl protease family protein 1 [Cucumis sativus] >KGN48951.1 hypothetical protein Csa_003569 [Cucumis sativus])

HSP 1 Score: 145.6 bits (366), Expect = 3.1e-31
Identity = 94/146 (64.38%), Postives = 110/146 (75.34%), Query Frame = 0

Query: 1   MTGYRIIFNRDKMVLGWSPSDCYDNGDGAPSGNTP----PSDSPPADSPPA--------- 60
           MTGYRI FNRD+MVLGWS SDCYDNG G PSG+TP    PSDSPP DSPP+         
Sbjct: 426 MTGYRITFNRDQMVLGWSSSDCYDNGVGTPSGDTPPADSPSDSPPTDSPPSVSPPTDSPP 485

Query: 61  DNSPPSDASPPADNSPPSDASPPADNSPPSDASPPSGDSPPSDDSPPAPSTPGGSTGFPK 120
            +SPP+D +PP+D+SPPS+ SPP+++SPPS+ SPPS DSPPS DSPPAPSTPGG  G P 
Sbjct: 486 SDSPPTDDTPPSDDSPPSEDSPPSEDSPPSEDSPPSDDSPPSGDSPPAPSTPGGRPGLP- 545

Query: 121 LGEGDASRLNPLASLFVAVLVILAAV 134
            G G A++LNPL  +F AVL ILA V
Sbjct: 546 -GLGGAAQLNPLGFVFGAVLAILALV 569

BLAST of Cla97C01G020020 vs. NCBI nr
Match: KAG6594755.1 (Aspartyl protease family protein 1, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 142.9 bits (359), Expect = 2.0e-30
Identity = 87/133 (65.41%), Postives = 102/133 (76.69%), Query Frame = 0

Query: 1   MTGYRIIFNRDKMVLGWSPSDCYDNGDGAPSGNTPPSDSPPADSPPADNSPPSDASPPAD 60
           MTGYRIIF+R+KM LGWSPSDCYD+  G PSG+TPP+     DSPPAD+SPP+D SPPAD
Sbjct: 427 MTGYRIIFDREKMALGWSPSDCYDSDAGTPSGDTPPA----KDSPPADDSPPADDSPPAD 486

Query: 61  NSPPSDASPPADNSPPSDASPPSGDSPPSDDSPPAPSTPGGSTGFPKLGEGDASRLNPLA 120
           +SPP++ SPPA+            DSPP+DDSP  PS PGGSTG PK+G GDA+RLNPL 
Sbjct: 487 DSPPAEDSPPAE------------DSPPTDDSPSPPSAPGGSTGLPKIG-GDATRLNPLT 542

Query: 121 SLFVAVLVILAAV 134
           S+FVAVL ILA V
Sbjct: 547 SVFVAVLAILAVV 542

BLAST of Cla97C01G020020 vs. ExPASy TrEMBL
Match: A0A5A7SKI7 (Aspartyl protease family protein 1-like OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold255G00960 PE=3 SV=1)

HSP 1 Score: 152.9 bits (385), Expect = 9.3e-34
Identity = 95/137 (69.34%), Postives = 109/137 (79.56%), Query Frame = 0

Query: 1   MTGYRIIFNRDKMVLGWSPSDCYDNGDGAPS----GNTPPSDSPPADSPPADNSPPSDAS 60
           MTGYRIIFNR +MVLGWSPSDCYDNGD  PS     ++PPSDSPP DSPP+D SPP+D +
Sbjct: 424 MTGYRIIFNRGEMVLGWSPSDCYDNGDSTPSDSPPADSPPSDSPPTDSPPSD-SPPTDDT 483

Query: 61  PPADNSPPSDASPPADNSPPSDASPPSGDSPPSDDSPPAPSTPGGSTGFPKLGEGDASRL 120
           PP+ +SPPS  SPP+ +SPPS  SPPS DSPPS DSPPAPSTPGG  G P++G   A+RL
Sbjct: 484 PPSGDSPPSGDSPPSGDSPPSGDSPPSDDSPPSGDSPPAPSTPGGGNGLPRIGA--AARL 543

Query: 121 NPLASLFVAVLVILAAV 134
           NPL S+FVAVL ILA V
Sbjct: 544 NPLGSVFVAVLAILAVV 557

BLAST of Cla97C01G020020 vs. ExPASy TrEMBL
Match: A0A0A0KML0 (Peptidase A1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G507250 PE=3 SV=1)

HSP 1 Score: 145.6 bits (366), Expect = 1.5e-31
Identity = 94/146 (64.38%), Postives = 110/146 (75.34%), Query Frame = 0

Query: 1   MTGYRIIFNRDKMVLGWSPSDCYDNGDGAPSGNTP----PSDSPPADSPPA--------- 60
           MTGYRI FNRD+MVLGWS SDCYDNG G PSG+TP    PSDSPP DSPP+         
Sbjct: 426 MTGYRITFNRDQMVLGWSSSDCYDNGVGTPSGDTPPADSPSDSPPTDSPPSVSPPTDSPP 485

Query: 61  DNSPPSDASPPADNSPPSDASPPADNSPPSDASPPSGDSPPSDDSPPAPSTPGGSTGFPK 120
            +SPP+D +PP+D+SPPS+ SPP+++SPPS+ SPPS DSPPS DSPPAPSTPGG  G P 
Sbjct: 486 SDSPPTDDTPPSDDSPPSEDSPPSEDSPPSEDSPPSDDSPPSGDSPPAPSTPGGRPGLP- 545

Query: 121 LGEGDASRLNPLASLFVAVLVILAAV 134
            G G A++LNPL  +F AVL ILA V
Sbjct: 546 -GLGGAAQLNPLGFVFGAVLAILALV 569

BLAST of Cla97C01G020020 vs. ExPASy TrEMBL
Match: A0A1S4DT41 (aspartyl protease family protein 1-like OS=Cucumis melo OX=3656 GN=LOC103485161 PE=3 SV=1)

HSP 1 Score: 138.7 bits (348), Expect = 1.8e-29
Identity = 92/133 (69.17%), Postives = 102/133 (76.69%), Query Frame = 0

Query: 1   MTGYRIIFNRDKMVLGWSPSDCYDNGDGAPSGNTPPSDSPPADSPPADNSPPSDASPPAD 60
           MTGYRIIFNR +MVLGWSPSDCYDNGD      + PSDSPPADSPP+D SPP+D      
Sbjct: 424 MTGYRIIFNRGEMVLGWSPSDCYDNGD------STPSDSPPADSPPSD-SPPTD------ 483

Query: 61  NSPPSDASPPADNSPPSDASPPSGDSPPSDDSPPAPSTPGGSTGFPKLGEGDASRLNPLA 120
            SPPSD SPP D++PPS  SPPS DSPPS DSPPAPSTPGG  G P++G   A+RLNPL 
Sbjct: 484 -SPPSD-SPPTDDTPPSGDSPPSDDSPPSGDSPPAPSTPGGGNGLPRIGA--AARLNPLG 539

Query: 121 SLFVAVLVILAAV 134
           S+FVAVL ILA V
Sbjct: 544 SVFVAVLAILAVV 539

BLAST of Cla97C01G020020 vs. ExPASy TrEMBL
Match: A0A6J1BTF7 (aspartyl protease family protein 1-like OS=Momordica charantia OX=3673 GN=LOC111005407 PE=3 SV=1)

HSP 1 Score: 136.3 bits (342), Expect = 9.0e-29
Identity = 88/136 (64.71%), Postives = 98/136 (72.06%), Query Frame = 0

Query: 1   MTGYRIIFNRDKMVLGWSPSDCYDNGDGAPSGNTPPSDSPPADSPPADNSPPSDASPPAD 60
           MTGYRIIFNR+KMVLGW+ SDCYDNG   PS N          SPPADNSPPSD      
Sbjct: 430 MTGYRIIFNREKMVLGWTESDCYDNGAATPSDN----------SPPADNSPPSD------ 489

Query: 61  NSPPSDASPPADNSPPSDASPPSGDSPPSDDSPPAPSTPGGSTG---FPKLGEGDASRLN 120
            SPP+D+SPP D+SPPSD SP SGDSPPS DSPPAPST GGS G    P++G GDA+RLN
Sbjct: 490 -SPPTDSSPPTDSSPPSD-SPQSGDSPPSGDSPPAPSTAGGSNGTQPLPRIGAGDATRLN 547

Query: 121 PLASLFVAVLVILAAV 134
           PL  +FVA+L IL  V
Sbjct: 550 PLGCVFVAILAILVVV 547

BLAST of Cla97C01G020020 vs. ExPASy TrEMBL
Match: A0A6J1EEK3 (aspartyl protease family protein 1-like OS=Cucurbita moschata OX=3662 GN=LOC111433625 PE=3 SV=1)

HSP 1 Score: 132.9 bits (333), Expect = 1.0e-27
Identity = 81/133 (60.90%), Postives = 98/133 (73.68%), Query Frame = 0

Query: 1   MTGYRIIFNRDKMVLGWSPSDCYDNGDGAPSGNTPPSDSPPADSPPADNSPPSDASPPAD 60
           MTGYRIIF+R+KM LGWSPSDCYD+  G PSG+TPP+     DSPPAD+SPP++ SPPA+
Sbjct: 427 MTGYRIIFDREKMALGWSPSDCYDSDAGTPSGDTPPA----KDSPPADDSPPAEDSPPAE 486

Query: 61  NSPPSDASPPADNSPPSDASPPSGDSPPSDDSPPAPSTPGGSTGFPKLGEGDASRLNPLA 120
           +SPP++                  DSPP+DDSP  PS+PGGSTG PK+G GDA+RLNPL 
Sbjct: 487 DSPPAE------------------DSPPTDDSPSPPSSPGGSTGLPKIG-GDATRLNPLT 536

Query: 121 SLFVAVLVILAAV 134
           S+FVAVL ILA V
Sbjct: 547 SVFVAVLAILAVV 536

BLAST of Cla97C01G020020 vs. TAIR 10
Match: AT3G51350.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 42.4 bits (98), Expect = 3.4e-04
Identity = 44/131 (33.59%), Postives = 61/131 (46.56%), Query Frame = 0

Query: 1   MTGYRIIFNRDKMVLGWSPSDCYDNGDGAPSGNTPPSDSPPADSPPADNSPPSDASPPAD 60
           + GYRI+F+R++M+LGW  S C++  D +    TPP   PP    PA             
Sbjct: 431 VAGYRIVFDRERMILGWKQSLCFE--DESLESTTPP---PPEVEAPA------------- 490

Query: 61  NSPPSDASPPADNSPPSDASPPSGDSPPSDDSPPAPSTPGGSTGFPKLGEGDASRLNPLA 120
                          PS ++PP    PP+  + P P  P  STG P  G G A+ L PLA
Sbjct: 491 ---------------PSVSAPPPRSLPPTVSATPPPINPRNSTGNP--GTGGAANLIPLA 526

Query: 121 SLFVAVLVILA 132
           S  + +L +LA
Sbjct: 551 SQLLLLLPLLA 526

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038882816.18.1e-4078.95aspartyl protease family protein 1-like [Benincasa hispida][more]
KAA0025642.11.9e-3369.34aspartyl protease family protein 1-like [Cucumis melo var. makuwa] >TYK12515.1 a... [more]
XP_023518254.17.3e-3367.91aspartyl protease family protein 1-like [Cucurbita pepo subsp. pepo][more]
XP_031742572.13.1e-3164.38aspartyl protease family protein 1 [Cucumis sativus] >KGN48951.1 hypothetical pr... [more]
KAG6594755.12.0e-3065.41Aspartyl protease family protein 1, partial [Cucurbita argyrosperma subsp. soror... [more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A5A7SKI79.3e-3469.34Aspartyl protease family protein 1-like OS=Cucumis melo var. makuwa OX=1194695 G... [more]
A0A0A0KML01.5e-3164.38Peptidase A1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G50725... [more]
A0A1S4DT411.8e-2969.17aspartyl protease family protein 1-like OS=Cucumis melo OX=3656 GN=LOC103485161 ... [more]
A0A6J1BTF79.0e-2964.71aspartyl protease family protein 1-like OS=Momordica charantia OX=3673 GN=LOC111... [more]
A0A6J1EEK31.0e-2760.90aspartyl protease family protein 1-like OS=Cucurbita moschata OX=3662 GN=LOC1114... [more]
Match NameE-valueIdentityDescription
AT3G51350.13.4e-0433.59Eukaryotic aspartyl protease family protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (97103) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 10..114
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 34..95

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C01G020020.1Cla97C01G020020.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
molecular_function GO:0004190 aspartic-type endopeptidase activity