Cla97C07G138030 (gene) Watermelon (97103) v2

NameCla97C07G138030
Typegene
OrganismCitrullus lanatus (Watermelon (97103) v2)
Descriptionaspartic proteinase CDR1-like
LocationCla97Chr07 : 25686623 .. 25687927 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTGCCATTTCAATCTTCTTCTATTTCCTCCTCTTCTTCTTCTCGGAAGCAACCACCAATGGCGGTAGCGGCAATGGCTTCACCACCTCTCTTTTCCACCGCGATACCCTTCTCTCTCCTCTCCACAACCCATCTCTCTCCCGCTACGACCGCATTACCAATGCCTTCCGTCGCTCCTTCTCCCGCTCCGCCACCCTCTTCAAGCATGTCACTACCGTCTCCACTGCCTGCATCCAATCTCCGATCATCCCCGACAGCGGTGAGTTCCTAATGTCTGTCTCTATTGGGACCCCGCCGGTTGATTTCATAGCCATCGCGGATACTGGCAGCGATCTGACGTGGACCCAATGCTTGCCATGTCAGGAATGCTTCAACCAATCACGTCGCATTTTTAATCCACGTCGATCATCTTCCTACCGTAACGTGTCTTGCACGTCTGATACTTGTCGCTCCCTCGACAGTTACCATTGTGGGCCCGACCTCCAAACCTGTAGCTATGGCTACAGCTATGGAGACCGATCCTTTACGTATGGTGACCTAGCATCTGATAAAATTACCATCGGGTCCTTCAAACTCTCCAAGACCCTCATTGGATGCGGCCACCAAAATGGTGGCACTTTCGGCGGAGTTACCTCGGGGATCATCGGACTTGGCGGTGGCGCTCTCTCTTTGGTGTCGCAAATGAACACAATCGCTGCCATCAAACGGCAATTCTCATATTGTTTGCCAACTTTCTTCAGTAACGCAAATATCACAGGTAAGATAAGCTTTGGCCAAAATGCCGTCGTTTCAGGGCCTAAAGTCATTTCTACCCCTCTCGTAGCGAAATCTCCCGATACCTTCTATTTCTTAACTCTTGAAGCAATCTCTGTCGCAAACAAGCGGTTTGAAGCTACAAACGACATGTCGGCCATGACCAAACGAGGGAATATTATTATCGATTCTGGTACGACATTGACGTTTCTGCCTCGAAATCTATACGCCGGTGTTGTTTCGACTTTGGCGAGTGTTATTAAAGCAAAGCGAGTGGATGATCCAGCTGGGATTTTGGAACTCTGTTACACTGCGGGCAGCGTTGAGGATTTGAATATTCCAGTCATTGCGGCACATTTTGGCGGTGGCGCCGACGTCAAATTGCTACCGTTGAACACATTTGCGTTGGTGGCTGAGAATGTGAGTTGTTTGACTTTGGCGCCGGCATCGGATTTGGCCATTTTTGGGAACTTGGCGCAAATTAACTTTGTAGTCGGATATGATCTTGAGCAGAAGAGATTGTCGTTTAAACCTACTGTTTGTGCTTAG

mRNA sequence

ATGGCTGCCATTTCAATCTTCTTCTATTTCCTCCTCTTCTTCTTCTCGGAAGCAACCACCAATGGCGGTAGCGGCAATGGCTTCACCACCTCTCTTTTCCACCGCGATACCCTTCTCTCTCCTCTCCACAACCCATCTCTCTCCCGCTACGACCGCATTACCAATGCCTTCCGTCGCTCCTTCTCCCGCTCCGCCACCCTCTTCAAGCATGTCACTACCGTCTCCACTGCCTGCATCCAATCTCCGATCATCCCCGACAGCGGTGAGTTCCTAATGTCTGTCTCTATTGGGACCCCGCCGGTTGATTTCATAGCCATCGCGGATACTGGCAGCGATCTGACGTGGACCCAATGCTTGCCATGTCAGGAATGCTTCAACCAATCACGTCGCATTTTTAATCCACGTCGATCATCTTCCTACCGTAACGTGTCTTGCACGTCTGATACTTGTCGCTCCCTCGACAGTTACCATTGTGGGCCCGACCTCCAAACCTGTAGCTATGGCTACAGCTATGGAGACCGATCCTTTACGTATGGTGACCTAGCATCTGATAAAATTACCATCGGGTCCTTCAAACTCTCCAAGACCCTCATTGGATGCGGCCACCAAAATGGTGGCACTTTCGGCGGAGTTACCTCGGGGATCATCGGACTTGGCGGTGGCGCTCTCTCTTTGGTGTCGCAAATGAACACAATCGCTGCCATCAAACGGCAATTCTCATATTGTTTGCCAACTTTCTTCAGTAACGCAAATATCACAGGTAAGATAAGCTTTGGCCAAAATGCCGTCGTTTCAGGGCCTAAAGTCATTTCTACCCCTCTCGTAGCGAAATCTCCCGATACCTTCTATTTCTTAACTCTTGAAGCAATCTCTGTCGCAAACAAGCGGTTTGAAGCTACAAACGACATGTCGGCCATGACCAAACGAGGGAATATTATTATCGATTCTGGTACGACATTGACGTTTCTGCCTCGAAATCTATACGCCGGTGTTGTTTCGACTTTGGCGAGTGTTATTAAAGCAAAGCGAGTGGATGATCCAGCTGGGATTTTGGAACTCTGTTACACTGCGGGCAGCGTTGAGGATTTGAATATTCCAGTCATTGCGGCACATTTTGGCGGTGGCGCCGACGTCAAATTGCTACCGTTGAACACATTTGCGTTGGTGGCTGAGAATGTGAGTTGTTTGACTTTGGCGCCGGCATCGGATTTGGCCATTTTTGGGAACTTGGCGCAAATTAACTTTGTAGTCGGATATGATCTTGAGCAGAAGAGATTGTCGTTTAAACCTACTGTTTGTGCTTAG

Coding sequence (CDS)

ATGGCTGCCATTTCAATCTTCTTCTATTTCCTCCTCTTCTTCTTCTCGGAAGCAACCACCAATGGCGGTAGCGGCAATGGCTTCACCACCTCTCTTTTCCACCGCGATACCCTTCTCTCTCCTCTCCACAACCCATCTCTCTCCCGCTACGACCGCATTACCAATGCCTTCCGTCGCTCCTTCTCCCGCTCCGCCACCCTCTTCAAGCATGTCACTACCGTCTCCACTGCCTGCATCCAATCTCCGATCATCCCCGACAGCGGTGAGTTCCTAATGTCTGTCTCTATTGGGACCCCGCCGGTTGATTTCATAGCCATCGCGGATACTGGCAGCGATCTGACGTGGACCCAATGCTTGCCATGTCAGGAATGCTTCAACCAATCACGTCGCATTTTTAATCCACGTCGATCATCTTCCTACCGTAACGTGTCTTGCACGTCTGATACTTGTCGCTCCCTCGACAGTTACCATTGTGGGCCCGACCTCCAAACCTGTAGCTATGGCTACAGCTATGGAGACCGATCCTTTACGTATGGTGACCTAGCATCTGATAAAATTACCATCGGGTCCTTCAAACTCTCCAAGACCCTCATTGGATGCGGCCACCAAAATGGTGGCACTTTCGGCGGAGTTACCTCGGGGATCATCGGACTTGGCGGTGGCGCTCTCTCTTTGGTGTCGCAAATGAACACAATCGCTGCCATCAAACGGCAATTCTCATATTGTTTGCCAACTTTCTTCAGTAACGCAAATATCACAGGTAAGATAAGCTTTGGCCAAAATGCCGTCGTTTCAGGGCCTAAAGTCATTTCTACCCCTCTCGTAGCGAAATCTCCCGATACCTTCTATTTCTTAACTCTTGAAGCAATCTCTGTCGCAAACAAGCGGTTTGAAGCTACAAACGACATGTCGGCCATGACCAAACGAGGGAATATTATTATCGATTCTGGTACGACATTGACGTTTCTGCCTCGAAATCTATACGCCGGTGTTGTTTCGACTTTGGCGAGTGTTATTAAAGCAAAGCGAGTGGATGATCCAGCTGGGATTTTGGAACTCTGTTACACTGCGGGCAGCGTTGAGGATTTGAATATTCCAGTCATTGCGGCACATTTTGGCGGTGGCGCCGACGTCAAATTGCTACCGTTGAACACATTTGCGTTGGTGGCTGAGAATGTGAGTTGTTTGACTTTGGCGCCGGCATCGGATTTGGCCATTTTTGGGAACTTGGCGCAAATTAACTTTGTAGTCGGATATGATCTTGAGCAGAAGAGATTGTCGTTTAAACCTACTGTTTGTGCTTAG

Protein sequence

MAAISIFFYFLLFFFSEATTNGGSGNGFTTSLFHRDTLLSPLHNPSLSRYDRITNAFRRSFSRSATLFKHVTTVSTACIQSPIIPDSGEFLMSVSIGTPPVDFIAIADTGSDLTWTQCLPCQECFNQSRRIFNPRRSSSYRNVSCTSDTCRSLDSYHCGPDLQTCSYGYSYGDRSFTYGDLASDKITIGSFKLSKTLIGCGHQNGGTFGGVTSGIIGLGGGALSLVSQMNTIAAIKRQFSYCLPTFFSNANITGKISFGQNAVVSGPKVISTPLVAKSPDTFYFLTLEAISVANKRFEATNDMSAMTKRGNIIIDSGTTLTFLPRNLYAGVVSTLASVIKAKRVDDPAGILELCYTAGSVEDLNIPVIAAHFGGGADVKLLPLNTFALVAENVSCLTLAPASDLAIFGNLAQINFVVGYDLEQKRLSFKPTVCA
BLAST of Cla97C07G138030 vs. NCBI nr
Match: XP_008452150.1 (PREDICTED: probable aspartic protease At2g35615 [Cucumis melo])

HSP 1 Score: 674.5 bits (1739), Expect = 2.4e-190
Identity = 334/405 (82.47%), Postives = 369/405 (91.11%), Query Frame = 0

Query: 30  TSLFHRDTLLSPLHNPSLSRYDRITNAFRRSFSRSATLFKHVTTVSTACIQSPIIPDSGE 89
           TSL+HRD+LLSPLHNPSLSRYD +  +FRRSFSRSATL  H+T+VSTACI+SPIIPDSGE
Sbjct: 30  TSLYHRDSLLSPLHNPSLSRYDSLVESFRRSFSRSATLLNHLTSVSTACIRSPIIPDSGE 89

Query: 90  FLMSVSIGTPPVDFIAIADTGSDLTWTQCLPCQECFNQSRRIFNPRRSSSYRNVSCTSDT 149
           FLMS+ IGTP V+FIAIADTGSDLTWTQCLPC+ECFNQS+ IFNPRRSSSYR VSC+SDT
Sbjct: 90  FLMSIFIGTPRVNFIAIADTGSDLTWTQCLPCRECFNQSQPIFNPRRSSSYRKVSCSSDT 149

Query: 150 CRSLDSYHCGPDLQTCSYGYSYGDRSFTYGDLASDKITIGSFKLSKTLIGCGHQNGGTFG 209
           CRSL+S HCG DL++CSYGYSYGDRSFTYGDLASDKITIGSFKL KT+IGCGHQNGGTFG
Sbjct: 150 CRSLESSHCGLDLKSCSYGYSYGDRSFTYGDLASDKITIGSFKLPKTVIGCGHQNGGTFG 209

Query: 210 GVTSGIIGLGGGALSLVSQMNTIAAIKRQFSYCLPTFFSNANITGKISFGQNAVVSGPKV 269
           GVTSGIIGLGGG+LSLVSQM+TIA +K QFSYCLPTFFSN NITGKISFG+ AVVSG +V
Sbjct: 210 GVTSGIIGLGGGSLSLVSQMSTIAGVKPQFSYCLPTFFSNENITGKISFGRKAVVSGRQV 269

Query: 270 ISTPLVAKSPDTFYFLTLEAISVANKRFEATNDMSAMTKRGNIIIDSGTTLTFLPRNLYA 329
           +STPLV +SPDTFYFLTLEAISV NKRF+A  DMSAMT +GNIIIDSGTTLT LPR+LY 
Sbjct: 270 VSTPLVPRSPDTFYFLTLEAISVGNKRFKAAKDMSAMTNQGNIIIDSGTTLTLLPRSLYD 329

Query: 330 GVVSTLASVIKAKRVDDPAGILELCYTAGSVEDLNIPVIAAHFGGGADVKLLPLNTFALV 389
           GVVSTLA VIK KRVDDP+GILELCY+AG +EDLNIP+I AHF G ADVKLLP+NTFA V
Sbjct: 330 GVVSTLARVIKTKRVDDPSGILELCYSAGQLEDLNIPIITAHFSGRADVKLLPVNTFAPV 389

Query: 390 AENVSCLTLAPASDLAIFGNLAQINFVVGYDLEQKRLSFKPTVCA 435
           A+NV CLTLAPA+++AIFGNLAQINF VGYDL  KRLSFKPT CA
Sbjct: 390 ADNVICLTLAPATNVAIFGNLAQINFEVGYDLGNKRLSFKPTRCA 434

BLAST of Cla97C07G138030 vs. NCBI nr
Match: XP_004149004.1 (PREDICTED: probable aspartic protease At2g35615 [Cucumis sativus] >KGN53446.1 hypothetical protein Csa_4G055410 [Cucumis sativus])

HSP 1 Score: 674.1 bits (1738), Expect = 3.2e-190
Identity = 358/434 (82.49%), Postives = 393/434 (90.55%), Query Frame = 0

Query: 1   MAAISIXXXXXXXXXXXXXXXXXXXXGFTTSLFHRDTLLSPLHNPSLSRYDRITNAFRRS 60
           MAAISIXXXXXXXXXXXXXXXXXXXX  TTSLF RD+ LSPLHNPSLSRYD + +AFRRS
Sbjct: 1   MAAISIXXXXXXXXXXXXXXXXXXXXXXTTSLFRRDSPLSPLHNPSLSRYDSLIDAFRRS 60

Query: 61  FSRSATLFKHVTTVSTACIQSPIIPDSGEFLMSVSIGTPPVDFIAIADTGSDLTWTQCLP 120
           FSRSATL  H+T+VSTACI+SPIIPDSGEFLMS+ IGTPPV+ IAIADTGSDLTWTQCLP
Sbjct: 61  FSRSATLLTHLTSVSTACIRSPIIPDSGEFLMSIFIGTPPVNVIAIADTGSDLTWTQCLP 120

Query: 121 CQECFNQSRRIFNPRRSSSYRNVSCTSDTCRSLDSYHCGPDLQTCSYGYSYGDRSFTYGD 180
           C+ECFNQS+ IFNPRRSSSYR VSC SDTCRSL+SYHCGPDLQ+CSYGYSYGDRSFTYGD
Sbjct: 121 CRECFNQSQPIFNPRRSSSYRKVSCASDTCRSLESYHCGPDLQSCSYGYSYGDRSFTYGD 180

Query: 181 LASDKITIGSFKLSKTLIGCGHQNGGTFGGVTSGIIGLGGGALSLVSQMNTIAAIKRQFS 240
           LASD+ITIGSFKL KT+IGCGHQNGGTFGGVTSGIIGLGGG+LSLVSQM TIA +K +FS
Sbjct: 181 LASDQITIGSFKLPKTVIGCGHQNGGTFGGVTSGIIGLGGGSLSLVSQMRTIAGVKPRFS 240

Query: 241 YCLPTFFSNANITGKISFGQNAVVSGPKVISTPLVAKSPDTFYFLTLEAISVANKRFEAT 300
           YCLPTFFSNANITG ISFG+ AVVSG +V+STPLV +SPDTFYFLTLEAISV  KRF+A 
Sbjct: 241 YCLPTFFSNANITGTISFGRKAVVSGRQVVSTPLVPRSPDTFYFLTLEAISVGKKRFKAA 300

Query: 301 NDMSAMTKRGNIIIDSGTTLTFLPRNLYAGVVSTLASVIKAKRVDDPAGILELCYTAGSV 360
           N +SAMT  GNIIIDSGTTLT LPR+LY GV STLA VIKAKRVDDP+GILELCY+AG V
Sbjct: 301 NGISAMTNHGNIIIDSGTTLTLLPRSLYYGVFSTLARVIKAKRVDDPSGILELCYSAGQV 360

Query: 361 EDLNIPVIAAHFGGGADVKLLPLNTFALVAENVSCLTLAPASDLAIFGNLAQINFVVGYD 420
           +DLNIP+I AHF GGADVKLLP+NTFA VA+NV+CLT APA+ +AIFGNLAQINF VGYD
Sbjct: 361 DDLNIPIITAHFAGGADVKLLPVNTFAPVADNVTCLTFAPATQVAIFGNLAQINFEVGYD 420

Query: 421 LEQKRLSFKPTVCA 435
           L  KRLSF+P +CA
Sbjct: 421 LGNKRLSFEPKLCA 434

BLAST of Cla97C07G138030 vs. NCBI nr
Match: XP_023552860.1 (aspartic proteinase CDR1-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 629.4 bits (1622), Expect = 9.0e-177
Identity = 322/433 (74.36%), Postives = 364/433 (84.06%), Query Frame = 0

Query: 1   MAAISIXXXXXXXXXXXXXXXXXXXXGFTTSLFHRDTLLSPLHNPSLSRYDRITNAFRRS 60
           MAAISI      XXXXXXXXX     GFTTS+ HRD+LLSPLHNPS+S Y+R+T AF RS
Sbjct: 1   MAAISIFFYFLLXXXXXXXXXRGGGNGFTTSIIHRDSLLSPLHNPSVSHYERLTGAFNRS 60

Query: 61  FSRSATLFKHVTTVSTACIQSPIIPDSGEFLMSVSIGTPPVDFIAIADTGSDLTWTQCLP 120
           FSRS TL     TVST  + SP+IPDSGEFL+S+SIGTPPVDF AIADTGSDLTWTQCLP
Sbjct: 61  FSRSTTLTNRAATVSTGGVHSPLIPDSGEFLISLSIGTPPVDFTAIADTGSDLTWTQCLP 120

Query: 121 CQECFNQSRRIFNPRRSSSYRNVSCTSDTCRSLDSYHCGPDLQTCSYGYSYGDRSFTYGD 180
           C +CFNQS  IFNP RSSSY NVSCTSDTC S+ S+ CGPDL+TC+YGYSYGD+SFTYGD
Sbjct: 121 CVKCFNQSSPIFNPHRSSSYSNVSCTSDTCNSIVSHRCGPDLKTCTYGYSYGDQSFTYGD 180

Query: 181 LASDKITIGSFKLSKTLIGCGHQNGGTFGGVTSGIIGLGGGALSLVSQMNTIAAIKRQFS 240
           LA +KITIGSFKL+K +IGCGH+NGGTF G TSGI+GLGGG LSLVSQ+NTIAA+KRQFS
Sbjct: 181 LAYEKITIGSFKLNKVVIGCGHENGGTFLGETSGIVGLGGGPLSLVSQLNTIAAVKRQFS 240

Query: 241 YCLPTFFSNANITGKISFGQNAVVSGPKVISTPLVAKSPDTFYFLTLEAISVANKRFEAT 300
           YCLPTFFS+ NITGKISFG+ A VSG KV+STPLV K P T+YFLTLEA+SVANKRFE  
Sbjct: 241 YCLPTFFSDGNITGKISFGEEAAVSGRKVVSTPLVQKHPYTYYFLTLEAVSVANKRFEVA 300

Query: 301 NDMSAMTKRGNIIIDSGTTLTFLPRNLYAGVVSTLASVIKAKRVDDPAGILELCYTAGSV 360
           N+MS+    GNIIIDSGTTLT LP NLY G+VSTLA V+KAKRV+DP+GILELCY   S+
Sbjct: 301 NNMSSAVVEGNIIIDSGTTLTLLPPNLYDGIVSTLARVVKAKRVNDPSGILELCYGVSSI 360

Query: 361 EDLNIPVIAAHFGGGADVKLLPLNTFALVAENVSCLTLAPASDLAIFGNLAQINFVVGYD 420
           +DLNIP+I AHF GGA V+L P NTFALV E+V+CLTLAPA   AIFGNLAQ+NF+VGYD
Sbjct: 361 DDLNIPIITAHFAGGAAVELQPENTFALVNEDVACLTLAPAKKFAIFGNLAQVNFLVGYD 420

Query: 421 LEQKRLSFKPTVC 434
           LEQK +SFK TVC
Sbjct: 421 LEQKTVSFKRTVC 433

BLAST of Cla97C07G138030 vs. NCBI nr
Match: XP_023543528.1 (aspartic proteinase CDR1-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 624.8 bits (1610), Expect = 2.2e-175
Identity = 305/408 (74.75%), Postives = 350/408 (85.78%), Query Frame = 0

Query: 27  GFTTSLFHRDTLLSPLHNPSLSRYDRITNAFRRSFSRSATLFKHVTTVSTACIQSPIIPD 86
           GFTTSLFHRD+ LSPL+NPSLS YDR+TNAFRRSFSRS TL      VST  I S IIPD
Sbjct: 30  GFTTSLFHRDSFLSPLYNPSLSHYDRLTNAFRRSFSRSDTLLNRAAAVSTTGIHSRIIPD 89

Query: 87  SGEFLMSVSIGTPPVDFIAIADTGSDLTWTQCLPCQECFNQSRRIFNPRRSSSYRNVSCT 146
            GEFLMS+SIGTP V  +AIADTGSDLTWTQC+PC +CFNQS  IFNPRRS SYR+VSCT
Sbjct: 90  DGEFLMSISIGTPRVKIMAIADTGSDLTWTQCMPCHKCFNQSFPIFNPRRSFSYRHVSCT 149

Query: 147 SDTCRSLDSYHCGPDLQTCSYGYSYGDRSFTYGDLASDKITIGSFKLSKTLIGCGHQNGG 206
           S+ CRSLD Y CGPD +TCSYGYSYGD+SFTYGDLAS+KIT+GSFKL KT+IGCGH NGG
Sbjct: 150 SNACRSLDDYRCGPDNRTCSYGYSYGDQSFTYGDLASEKITVGSFKLYKTVIGCGHVNGG 209

Query: 207 TFGGVTSGIIGLGGGALSLVSQMNTIAAIKRQFSYCLPTFFSNANITGKISFGQNAVVSG 266
           TF G TSGIIGLGGG LSL+SQM  IAA+KR+FSYCLPTFFS+ N+TGKISFG+ A+VSG
Sbjct: 210 TFSGDTSGIIGLGGGPLSLISQMRKIAAVKRRFSYCLPTFFSDKNVTGKISFGKKAIVSG 269

Query: 267 PKVISTPLVAKSPDTFYFLTLEAISVANKRFEATNDMSAMTKRGNIIIDSGTTLTFLPRN 326
            KVISTPLV K P+TFY++TL+A+SVANKRF+A N+MSA  +RGNI+IDSGTTLT LP N
Sbjct: 270 RKVISTPLVLKEPNTFYYVTLKAMSVANKRFKAANNMSAAVERGNILIDSGTTLTILPPN 329

Query: 327 LYAGVVSTLASVIKAKRVDDPAGILELCYTAGSVEDLNIPVIAAHFGGGADVKLLPLNTF 386
           LY GV STLA V+KAKRV+DP G+L+LC+   SV+ LNIPVI AHF GGADVKLLPLNTF
Sbjct: 330 LYKGVASTLAHVVKAKRVNDPTGVLDLCFATRSVDHLNIPVITAHFAGGADVKLLPLNTF 389

Query: 387 ALVAENVSCLTLAPASDLAIFGNLAQINFVVGYDLEQKRLSFKPTVCA 435
           A+VA+NV+CL   P+++ AIFGNLAQ+NF+VGYDLE+KRLSFK  VCA
Sbjct: 390 AMVADNVACLAFVPSANFAIFGNLAQVNFLVGYDLERKRLSFKYNVCA 437

BLAST of Cla97C07G138030 vs. NCBI nr
Match: XP_022931430.1 (aspartic proteinase CDR1-like [Cucurbita moschata])

HSP 1 Score: 622.5 bits (1604), Expect = 1.1e-174
Identity = 320/433 (73.90%), Postives = 362/433 (83.60%), Query Frame = 0

Query: 1   MAAISIXXXXXXXXXXXXXXXXXXXXGFTTSLFHRDTLLSPLHNPSLSRYDRITNAFRRS 60
           MAAISI      XXXXXXXXX     GFTTS+ HRD+LLSPLHNPS+SRY R+T AF RS
Sbjct: 1   MAAISIFFYFLLXXXXXXXXXRGGGNGFTTSIIHRDSLLSPLHNPSVSRYQRLTGAFNRS 60

Query: 61  FSRSATLFKHVTTVSTACIQSPIIPDSGEFLMSVSIGTPPVDFIAIADTGSDLTWTQCLP 120
           FSRS TL    TTVST  + SP+IPDSGEFL+S+SIGTPPVDF AIADTGSDLTWTQCLP
Sbjct: 61  FSRSTTLTNRATTVSTGGVHSPLIPDSGEFLISLSIGTPPVDFTAIADTGSDLTWTQCLP 120

Query: 121 CQECFNQSRRIFNPRRSSSYRNVSCTSDTCRSLDSYHCGPDLQTCSYGYSYGDRSFTYGD 180
           C +CFNQS  IFNP RSSSY NVSCTSDTC S+ S+ CGPDL+TC+YGYSYGD+SFTYGD
Sbjct: 121 CVKCFNQSNPIFNPHRSSSYSNVSCTSDTCNSIVSHRCGPDLKTCTYGYSYGDQSFTYGD 180

Query: 181 LASDKITIGSFKLSKTLIGCGHQNGGTFGGVTSGIIGLGGGALSLVSQMNTIAAIKRQFS 240
           LA +KITIGSFKL+K +IGCGH+NGGTF G TSGI+GLGGG LSLVSQ+NTIA +KRQFS
Sbjct: 181 LAYEKITIGSFKLNKVVIGCGHENGGTFLGETSGIVGLGGGPLSLVSQLNTIATVKRQFS 240

Query: 241 YCLPTFFSNANITGKISFGQNAVVSGPKVISTPLVAKSPDTFYFLTLEAISVANKRFEAT 300
           YCLPTFFS+ NITGKISFG+ A VSG KV+STPLV K P T+YFLTLEAISVANKRFE  
Sbjct: 241 YCLPTFFSDGNITGKISFGEEAAVSGRKVVSTPLVQKHPYTYYFLTLEAISVANKRFEVA 300

Query: 301 NDMSAMTKRGNIIIDSGTTLTFLPRNLYAGVVSTLASVIKAKRVDDPAGILELCYTAGSV 360
           ++MS+    GNIIIDSGTTLT LP N+Y GVVS LA V+KAKRV+DP+GILELCY   S+
Sbjct: 301 DNMSSAVVEGNIIIDSGTTLTLLPPNMYDGVVSALARVVKAKRVNDPSGILELCYGVSSI 360

Query: 361 EDLNIPVIAAHFGGGADVKLLPLNTFALVAENVSCLTLAPASDLAIFGNLAQINFVVGYD 420
           +DLNIP+I AHF GGA V+L   NTFALV E+V+CLTLAPA   AIFGNLAQ+NF+VGYD
Sbjct: 361 DDLNIPIITAHFAGGAAVELQLENTFALVNEDVACLTLAPAKKFAIFGNLAQVNFLVGYD 420

Query: 421 LEQKRLSFKPTVC 434
           LEQK +SFK T+C
Sbjct: 421 LEQKTVSFKRTLC 433

BLAST of Cla97C07G138030 vs. TrEMBL
Match: tr|A0A1S3BT75|A0A1S3BT75_CUCME (probable aspartic protease At2g35615 OS=Cucumis melo OX=3656 GN=LOC103493255 PE=3 SV=1)

HSP 1 Score: 674.5 bits (1739), Expect = 1.6e-190
Identity = 334/405 (82.47%), Postives = 369/405 (91.11%), Query Frame = 0

Query: 30  TSLFHRDTLLSPLHNPSLSRYDRITNAFRRSFSRSATLFKHVTTVSTACIQSPIIPDSGE 89
           TSL+HRD+LLSPLHNPSLSRYD +  +FRRSFSRSATL  H+T+VSTACI+SPIIPDSGE
Sbjct: 30  TSLYHRDSLLSPLHNPSLSRYDSLVESFRRSFSRSATLLNHLTSVSTACIRSPIIPDSGE 89

Query: 90  FLMSVSIGTPPVDFIAIADTGSDLTWTQCLPCQECFNQSRRIFNPRRSSSYRNVSCTSDT 149
           FLMS+ IGTP V+FIAIADTGSDLTWTQCLPC+ECFNQS+ IFNPRRSSSYR VSC+SDT
Sbjct: 90  FLMSIFIGTPRVNFIAIADTGSDLTWTQCLPCRECFNQSQPIFNPRRSSSYRKVSCSSDT 149

Query: 150 CRSLDSYHCGPDLQTCSYGYSYGDRSFTYGDLASDKITIGSFKLSKTLIGCGHQNGGTFG 209
           CRSL+S HCG DL++CSYGYSYGDRSFTYGDLASDKITIGSFKL KT+IGCGHQNGGTFG
Sbjct: 150 CRSLESSHCGLDLKSCSYGYSYGDRSFTYGDLASDKITIGSFKLPKTVIGCGHQNGGTFG 209

Query: 210 GVTSGIIGLGGGALSLVSQMNTIAAIKRQFSYCLPTFFSNANITGKISFGQNAVVSGPKV 269
           GVTSGIIGLGGG+LSLVSQM+TIA +K QFSYCLPTFFSN NITGKISFG+ AVVSG +V
Sbjct: 210 GVTSGIIGLGGGSLSLVSQMSTIAGVKPQFSYCLPTFFSNENITGKISFGRKAVVSGRQV 269

Query: 270 ISTPLVAKSPDTFYFLTLEAISVANKRFEATNDMSAMTKRGNIIIDSGTTLTFLPRNLYA 329
           +STPLV +SPDTFYFLTLEAISV NKRF+A  DMSAMT +GNIIIDSGTTLT LPR+LY 
Sbjct: 270 VSTPLVPRSPDTFYFLTLEAISVGNKRFKAAKDMSAMTNQGNIIIDSGTTLTLLPRSLYD 329

Query: 330 GVVSTLASVIKAKRVDDPAGILELCYTAGSVEDLNIPVIAAHFGGGADVKLLPLNTFALV 389
           GVVSTLA VIK KRVDDP+GILELCY+AG +EDLNIP+I AHF G ADVKLLP+NTFA V
Sbjct: 330 GVVSTLARVIKTKRVDDPSGILELCYSAGQLEDLNIPIITAHFSGRADVKLLPVNTFAPV 389

Query: 390 AENVSCLTLAPASDLAIFGNLAQINFVVGYDLEQKRLSFKPTVCA 435
           A+NV CLTLAPA+++AIFGNLAQINF VGYDL  KRLSFKPT CA
Sbjct: 390 ADNVICLTLAPATNVAIFGNLAQINFEVGYDLGNKRLSFKPTRCA 434

BLAST of Cla97C07G138030 vs. TrEMBL
Match: tr|A0A0A0KZZ3|A0A0A0KZZ3_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G055410 PE=3 SV=1)

HSP 1 Score: 674.1 bits (1738), Expect = 2.1e-190
Identity = 358/434 (82.49%), Postives = 393/434 (90.55%), Query Frame = 0

Query: 1   MAAISIXXXXXXXXXXXXXXXXXXXXGFTTSLFHRDTLLSPLHNPSLSRYDRITNAFRRS 60
           MAAISIXXXXXXXXXXXXXXXXXXXX  TTSLF RD+ LSPLHNPSLSRYD + +AFRRS
Sbjct: 1   MAAISIXXXXXXXXXXXXXXXXXXXXXXTTSLFRRDSPLSPLHNPSLSRYDSLIDAFRRS 60

Query: 61  FSRSATLFKHVTTVSTACIQSPIIPDSGEFLMSVSIGTPPVDFIAIADTGSDLTWTQCLP 120
           FSRSATL  H+T+VSTACI+SPIIPDSGEFLMS+ IGTPPV+ IAIADTGSDLTWTQCLP
Sbjct: 61  FSRSATLLTHLTSVSTACIRSPIIPDSGEFLMSIFIGTPPVNVIAIADTGSDLTWTQCLP 120

Query: 121 CQECFNQSRRIFNPRRSSSYRNVSCTSDTCRSLDSYHCGPDLQTCSYGYSYGDRSFTYGD 180
           C+ECFNQS+ IFNPRRSSSYR VSC SDTCRSL+SYHCGPDLQ+CSYGYSYGDRSFTYGD
Sbjct: 121 CRECFNQSQPIFNPRRSSSYRKVSCASDTCRSLESYHCGPDLQSCSYGYSYGDRSFTYGD 180

Query: 181 LASDKITIGSFKLSKTLIGCGHQNGGTFGGVTSGIIGLGGGALSLVSQMNTIAAIKRQFS 240
           LASD+ITIGSFKL KT+IGCGHQNGGTFGGVTSGIIGLGGG+LSLVSQM TIA +K +FS
Sbjct: 181 LASDQITIGSFKLPKTVIGCGHQNGGTFGGVTSGIIGLGGGSLSLVSQMRTIAGVKPRFS 240

Query: 241 YCLPTFFSNANITGKISFGQNAVVSGPKVISTPLVAKSPDTFYFLTLEAISVANKRFEAT 300
           YCLPTFFSNANITG ISFG+ AVVSG +V+STPLV +SPDTFYFLTLEAISV  KRF+A 
Sbjct: 241 YCLPTFFSNANITGTISFGRKAVVSGRQVVSTPLVPRSPDTFYFLTLEAISVGKKRFKAA 300

Query: 301 NDMSAMTKRGNIIIDSGTTLTFLPRNLYAGVVSTLASVIKAKRVDDPAGILELCYTAGSV 360
           N +SAMT  GNIIIDSGTTLT LPR+LY GV STLA VIKAKRVDDP+GILELCY+AG V
Sbjct: 301 NGISAMTNHGNIIIDSGTTLTLLPRSLYYGVFSTLARVIKAKRVDDPSGILELCYSAGQV 360

Query: 361 EDLNIPVIAAHFGGGADVKLLPLNTFALVAENVSCLTLAPASDLAIFGNLAQINFVVGYD 420
           +DLNIP+I AHF GGADVKLLP+NTFA VA+NV+CLT APA+ +AIFGNLAQINF VGYD
Sbjct: 361 DDLNIPIITAHFAGGADVKLLPVNTFAPVADNVTCLTFAPATQVAIFGNLAQINFEVGYD 420

Query: 421 LEQKRLSFKPTVCA 435
           L  KRLSF+P +CA
Sbjct: 421 LGNKRLSFEPKLCA 434

BLAST of Cla97C07G138030 vs. TrEMBL
Match: tr|A0A1S3BUB0|A0A1S3BUB0_CUCME (probable aspartic protease At2g35615 OS=Cucumis melo OX=3656 GN=LOC103493257 PE=3 SV=1)

HSP 1 Score: 486.9 bits (1252), Expect = 4.8e-134
Identity = 251/412 (60.92%), Postives = 306/412 (74.27%), Query Frame = 0

Query: 27  GFTTSLFHRDTLLSPLHNPSLSRYDRITNAFRRSFSRSATLFKHVTTVSTACIQSPIIPD 86
           GFTTSLFHRD+LLSPL   SLS YDR++NAFRRS SRSA L     T     +QSPI P 
Sbjct: 27  GFTTSLFHRDSLLSPLEFSSLSHYDRLSNAFRRSLSRSAALLNRAATSGAVGLQSPIAPG 86

Query: 87  SGEFLMSVSIGTPPVDFIAIADTGSDLTWTQCLPCQECFNQSRRIFNPRRSSSYRNVSCT 146
           SGE+LMSVSIGTPPVD+I +ADTGSDLTW QCLPC +CF QSR IFNP +S+S+ +V C 
Sbjct: 87  SGEYLMSVSIGTPPVDYIGLADTGSDLTWAQCLPCVKCFKQSRPIFNPLKSTSFSHVPCN 146

Query: 147 SDTCRSLDSYHCGPDLQTCSYGYSYGDRSFTYGDLASDKITIGSFKLSKTLIGCGHQNGG 206
           S  C+++D  HCG     C Y Y+YGD+++T GDL  +KITIGS  + K++IGCGH++GG
Sbjct: 147 SQICQAIDDAHCGVQ-GVCDYSYTYGDQTYTKGDLGLEKITIGSSSV-KSVIGCGHESGG 206

Query: 207 TFGGVTSGIIGLGGGALSLVSQMNTIAAIKRQFSYCLPTFFSNANITGKISFGQNAVVSG 266
            F G  SG+IGLGGG LSLVSQM+  + I R+FSYCLPT  S+AN  GKI+FGQNAVVSG
Sbjct: 207 GF-GFASGVIGLGGGQLSLVSQMSQTSGISRRFSYCLPTLLSHAN--GKINFGQNAVVSG 266

Query: 267 PKVISTPLVAKSPDTFYFLTLEAISVANKRFEATNDMSAMTKRGNIIIDSGTTLTFLPRN 326
           P V+STPL++K P T+Y++TLEAIS+ N+R  A+       K+GN+IIDSGTTLT LP+ 
Sbjct: 267 PGVVSTPLISKDPVTYYYITLEAISIGNERHMAS------AKQGNVIIDSGTTLTVLPKE 326

Query: 327 LYAGVVSTLASVIKAKRVDDPAGILELCYTAG--SVEDLNIPVIAAHFGGGADVKLLPLN 386
           LY GVVS+L  V+KAKRV DP    +LC+  G        IP+I AHF GGA+V LLP+N
Sbjct: 327 LYDGVVSSLLKVVKAKRVKDPGSFWDLCFDDGINVAASSGIPIITAHFSGGANVNLLPVN 386

Query: 387 TFALVAENVSCLTL---APASDLAIFGNLAQINFVVGYDLEQKRLSFKPTVC 434
           TF  VA NV+CLTL   +P  +  I GNLAQ NF++GYDLE KRLSFKPTVC
Sbjct: 387 TFQKVANNVNCLTLTAASPTDEFGIIGNLAQANFLIGYDLEAKRLSFKPTVC 427

BLAST of Cla97C07G138030 vs. TrEMBL
Match: tr|A0A0A0KV20|A0A0A0KV20_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G055400 PE=3 SV=1)

HSP 1 Score: 482.3 bits (1240), Expect = 1.2e-132
Identity = 250/413 (60.53%), Postives = 307/413 (74.33%), Query Frame = 0

Query: 27  GFTTSLFHRDTLLSPLHNPSLSRYDRITNAFRRSFSRSATLFKHVTTVSTACIQSPIIPD 86
           GFTTSLFHRD+LLSPL   SLS YDR+ NAFRRS SRSA L     T     +QS I P 
Sbjct: 29  GFTTSLFHRDSLLSPLEFSSLSHYDRLANAFRRSLSRSAALLNRAATSGAVGLQSSIGPG 88

Query: 87  SGEFLMSVSIGTPPVDFIAIADTGSDLTWTQCLPCQECFNQSRRIFNPRRSSSYRNVSCT 146
           SGE+LMSVSIGTPPVD++ IADTGSDLTW QCLPC +C+ Q R IFNP +S+S+ +V C 
Sbjct: 89  SGEYLMSVSIGTPPVDYLGIADTGSDLTWAQCLPCLKCYQQLRPIFNPLKSTSFSHVPCN 148

Query: 147 SDTCRSLDSYHCGPDLQTCSYGYSYGDRSFTYGDLASDKITIGSFKLSKTLIGCGHQNGG 206
           + TC ++D  HCG     C Y Y+YGDR+++ GDL  +KITIGS  + K++IGCGH + G
Sbjct: 149 TQTCHAVDDGHCGVQ-GVCDYSYTYGDRTYSKGDLGFEKITIGSSSV-KSVIGCGHASSG 208

Query: 207 TFGGVTSGIIGLGGGALSLVSQMNTIAAIKRQFSYCLPTFFSNANITGKISFGQNAVVSG 266
            F G  SG+IGLGGG LSLVSQM+  + I R+FSYCLPT  S+AN  GKI+FG+NAVVSG
Sbjct: 209 GF-GFASGVIGLGGGQLSLVSQMSQTSGISRRFSYCLPTLLSHAN--GKINFGENAVVSG 268

Query: 267 PKVISTPLVAKSPDTFYFLTLEAISVANKRFEATNDMSAMTKRGNIIIDSGTTLTFLPRN 326
           P V+STPL++K+  T+Y++TLEAIS+ N+R        A  K+GN+IIDSGTTLT LP+ 
Sbjct: 269 PGVVSTPLISKNTVTYYYITLEAISIGNERH------MAFAKQGNVIIDSGTTLTILPKE 328

Query: 327 LYAGVVSTLASVIKAKRVDDPAGILELCYTAG--SVEDLNIPVIAAHFGGGADVKLLPLN 386
           LY GVVS+L  V+KAKRV DP G L+LC+  G  +   L IPVI AHF GGA+V LLP+N
Sbjct: 329 LYDGVVSSLLKVVKAKRVKDPHGSLDLCFDDGINAAASLGIPVITAHFSGGANVNLLPIN 388

Query: 387 TFALVAENVSCLTL---APASDLAIFGNLAQINFVVGYDLEQKRLSFKPTVCA 435
           TF  VA+NV+CLTL   +P ++  I GNLAQ NF++GYDLE KRLSFKPTVCA
Sbjct: 389 TFRKVADNVNCLTLKAASPTTEFGIIGNLAQANFLIGYDLEAKRLSFKPTVCA 430

BLAST of Cla97C07G138030 vs. TrEMBL
Match: tr|A0A1S3BTZ4|A0A1S3BTZ4_CUCME (probable aspartic protease At2g35615 OS=Cucumis melo OX=3656 GN=LOC103493256 PE=3 SV=1)

HSP 1 Score: 459.9 bits (1182), Expect = 6.3e-126
Identity = 240/412 (58.25%), Postives = 294/412 (71.36%), Query Frame = 0

Query: 27  GFTTSLFHRDTLLSPLHNPSLSRYDRITNAFRRSFSRSATLFKHVTTVSTACIQSPIIPD 86
           GFTTSLFHRD+LLSPL   +LS YDR++NAFRRS SRSA L     T     +QSPI P 
Sbjct: 29  GFTTSLFHRDSLLSPLEFSTLSHYDRLSNAFRRSLSRSAALLNRTATSGAVGLQSPIAPG 88

Query: 87  SGEFLMSVSIGTPPVDFIAIADTGSDLTWTQCLPCQECFNQSRRIFNPRRSSSYRNVSCT 146
           SGE+LM VSIGTPPVD+I + DTGSDLTW QCLPC++CF Q R IFNP +S+S+ +V C 
Sbjct: 89  SGEYLMYVSIGTPPVDYIGMIDTGSDLTWAQCLPCRKCFLQLRPIFNPLKSTSFSHVPCN 148

Query: 147 SDTCRSLDSYHCGPDLQTCSYGYSYGDRSFTYGDLASDKITIGSFKLSKTLIGCGHQNGG 206
           S  C+++D  HCG     C Y Y+YGD+          KITIGS  + K++IGCGH++GG
Sbjct: 149 SQICQAIDDAHCGVQ-GVCDYSYTYGDQXXXXXXXXXXKITIGSSSV-KSVIGCGHESGG 208

Query: 207 TFGGVTSGIIGLGGGALSLVSQMNTIAAIKRQFSYCLPTFFSNANITGKISFGQNAVVSG 266
            F G  SG+IGLGGG LSLVSQM+  + I R+FSYCLP    +AN  GKI+F QNAVVSG
Sbjct: 209 GF-GFASGVIGLGGGQLSLVSQMSQTSGISRRFSYCLPPLLGHAN--GKINFAQNAVVSG 268

Query: 267 PKVISTPLVAKSPDTFYFLTLEAISVANKRFEATNDMSAMTKRGNIIIDSGTTLTFLPRN 326
           P V+STPL++K P T+Y++TLEAIS+ N+R  A+       K+GN+IIDSGTTLT LP+ 
Sbjct: 269 PGVVSTPLISKDPVTYYYITLEAISIGNERHMAS------AKQGNVIIDSGTTLTVLPKE 328

Query: 327 LYAGVVSTLASVIKAKRVDDPAGILELCYTAG--SVEDLNIPVIAAHFGGGADVKLLPLN 386
           LY GVVS+L  V+KAKRV DP    +LC+  G        IP+I AHF GGA+V LLP+N
Sbjct: 329 LYDGVVSSLLKVVKAKRVKDPGSFWDLCFDDGINVAASSGIPIITAHFSGGANVNLLPVN 388

Query: 387 TFALVAENVSCLTL---APASDLAIFGNLAQINFVVGYDLEQKRLSFKPTVC 434
           TF  VA NV+CLTL   +P  +  I GNLAQ NF++GYDLE KRLSFKPTVC
Sbjct: 389 TFQKVANNVNCLTLTAASPTDEFGIIGNLAQANFLIGYDLEAKRLSFKPTVC 429

BLAST of Cla97C07G138030 vs. Swiss-Prot
Match: sp|Q6XBF8|CDR1_ARATH (Aspartic proteinase CDR1 OS=Arabidopsis thaliana OX=3702 GN=CDR1 PE=1 SV=1)

HSP 1 Score: 356.7 bits (914), Expect = 3.7e-97
Identity = 197/415 (47.47%), Postives = 266/415 (64.10%), Query Frame = 0

Query: 27  GFTTSLFHRDTLLSPLHNPSLSRYDRITNAFRRSFSRSATLFKHVTTVSTACIQSPIIPD 86
           GFT  L HRD+  SP +NP  +   R+ NA  RS +R   +F      +T   Q  +  +
Sbjct: 30  GFTADLIHRDSPKSPFYNPMETSSQRLRNAIHRSVNR---VFHFTEKDNTPQPQIDLTSN 89

Query: 87  SGEFLMSVSIGTPPVDFIAIADTGSDLTWTQCLPCQECFNQSRRIFNPRRSSSYRNVSCT 146
           SGE+LM+VSIGTPP   +AIADTGSDL WTQC PC +C+ Q   +F+P+ SS+Y++VSC+
Sbjct: 90  SGEYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCDDCYTQVDPLFDPKTSSTYKDVSCS 149

Query: 147 SDTCRSLDSY-HCGPDLQTCSYGYSYGDRSFTYGDLASDKITIGS-----FKLSKTLIGC 206
           S  C +L++   C  +  TCSY  SYGD S+T G++A D +T+GS      +L   +IGC
Sbjct: 150 SSQCTALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLGSSDTRPMQLKNIIIGC 209

Query: 207 GHQNGGTFGGVTSGIIGLGGGALSLVSQMNTIAAIKRQFSYCLPTFFSNANITGKISFGQ 266
           GH N GTF    SGI+GLGGG +SL+ Q+    +I  +FSYCL    S  + T KI+FG 
Sbjct: 210 GHNNAGTFNKKGSGIVGLGGGPVSLIKQLGD--SIDGKFSYCLVPLTSKKDQTSKINFGT 269

Query: 267 NAVVSGPKVISTPLVAK-SPDTFYFLTLEAISVANKRFEATNDMSAMTKRGNIIIDSGTT 326
           NA+VSG  V+STPL+AK S +TFY+LTL++ISV +K+ + +   S  +  GNIIIDSGTT
Sbjct: 270 NAIVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQYSGSDSE-SSEGNIIIDSGTT 329

Query: 327 LTFLPRNLYAGVVSTLASVIKAKRVDDPAGILELCYTAGSVEDLNIPVIAAHFGGGADVK 386
           LT LP   Y+ +   +AS I A++  DP   L LCY+A    DL +PVI  HF  GADVK
Sbjct: 330 LTLLPTEFYSELEDAVASSIDAEKKQDPQSGLSLCYSA--TGDLKVPVITMHF-DGADVK 389

Query: 387 LLPLNTFALVAENVSCLTLAPASDLAIFGNLAQINFVVGYDLEQKRLSFKPTVCA 435
           L   N F  V+E++ C     +   +I+GN+AQ+NF+VGYD   K +SFKPT CA
Sbjct: 390 LDSSNAFVQVSEDLVCFAFRGSPSFSIYGNVAQMNFLVGYDTVSKTVSFKPTDCA 435

BLAST of Cla97C07G138030 vs. Swiss-Prot
Match: sp|Q3EBM5|ASPR1_ARATH (Probable aspartic protease At2g35615 OS=Arabidopsis thaliana OX=3702 GN=At2g35615 PE=3 SV=1)

HSP 1 Score: 340.5 bits (872), Expect = 2.7e-92
Identity = 195/426 (45.77%), Postives = 272/426 (63.85%), Query Frame = 0

Query: 28  FTTSLFHRDTLLSPLHNPSLSRYDRITNAFRRSFSRSATLFKHVTTVSTACIQSPIIPDS 87
           F+  L HRD+ LSP++NP ++  DR+  AF RS SRS   F H   +S   +QS +I   
Sbjct: 26  FSVELIHRDSPLSPIYNPQITVTDRLNAAFLRSVSRSRR-FNH--QLSQTDLQSGLIGAD 85

Query: 88  GEFLMSVSIGTPPVDFIAIADTGSDLTWTQCLPCQECFNQSRRIFNPRRSSSYRNVSCTS 147
           GEF MS++IGTPP+   AIADTGSDLTW QC PCQ+C+ ++  IF+ ++SS+Y++  C S
Sbjct: 86  GEFFMSITIGTPPIKVFAIADTGSDLTWVQCKPCQQCYKENGPIFDKKKSSTYKSEPCDS 145

Query: 148 DTCRSLDSYH--CGPDLQTCSYGYSYGDRSFTYGDLASDKITIGSFKLSK-----TLIGC 207
             C++L S    C      C Y YSYGD+SF+ GD+A++ ++I S   S      T+ GC
Sbjct: 146 RNCQALSSTERGCDESNNICKYRYSYGDQSFSKGDVATETVSIDSASGSPVSFPGTVFGC 205

Query: 208 GHQNGGTFGGVTSGIIGLGGGALSLVSQMNTIAAIKRQFSYCLPTFFSNANITGKISFGQ 267
           G+ NGGTF    SGIIGLGGG LSL+SQ+   ++I ++FSYCL    +  N T  I+ G 
Sbjct: 206 GYNNGGTFDETGSGIIGLGGGHLSLISQLG--SSISKKFSYCLSHKSATTNGTSVINLGT 265

Query: 268 NAVVSG----PKVISTPLVAKSPDTFYFLTLEAISVANKRFEAT------NDMSAMTK-R 327
           N++ S       V+STPLV K P T+Y+LTLEAISV  K+   T      ND   +++  
Sbjct: 266 NSIPSSLSKDSGVVSTPLVDKEPLTYYYLTLEAISVGKKKIPYTGSSYNPNDDGILSETS 325

Query: 328 GNIIIDSGTTLTFLPRNLYAGVVSTL-ASVIKAKRVDDPAGILELCYTAGSVEDLNIPVI 387
           GNIIIDSGTTLT L    +    S +  SV  AKRV DP G+L  C+ +GS E + +P I
Sbjct: 326 GNIIIDSGTTLTLLEAGFFDKFSSAVEESVTGAKRVSDPQGLLSHCFKSGSAE-IGLPEI 385

Query: 388 AAHFGGGADVKLLPLNTFALVAENVSCLTLAPASDLAIFGNLAQINFVVGYDLEQKRLSF 435
             HF  GADV+L P+N F  ++E++ CL++ P +++AI+GN AQ++F+VGYDLE + +SF
Sbjct: 386 TVHF-TGADVRLSPINAFVKLSEDMVCLSMVPTTEVAIYGNFAQMDFLVGYDLETRTVSF 444

BLAST of Cla97C07G138030 vs. Swiss-Prot
Match: sp|Q766C3|NEP1_NEPGR (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 SV=1)

HSP 1 Score: 221.1 bits (562), Expect = 2.4e-56
Identity = 146/413 (35.35%), Postives = 209/413 (50.61%), Query Frame = 0

Query: 27  GFTTSLFHRDTLLSPLHNPSLSRYDRITNAFRRSFSRSATLFKHVTTVSTACIQSPIIPD 86
           GF   L H D+        +L+++  +  A  R   R   L   +   S   +++ +   
Sbjct: 40  GFQIMLEHVDS------GKNLTKFQLLERAIERGSRRLQRLEAMLNGPSG--VETSVYAG 99

Query: 87  SGEFLMSVSIGTPPVDFIAIADTGSDLTWTQCLPCQECFNQSRRIFNPRRSSSYRNVSCT 146
            GE+LM++SIGTP   F AI DTGSDL WTQC PC +CFNQS  IFNP+ SSS+  + C+
Sbjct: 100 DGEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFNPQGSSSFSTLPCS 159

Query: 147 SDTCRSLDSYHCGPDLQTCSYGYSYGDRSFTYGDLASDKITIGSFKLSKTLIGCGHQNGG 206
           S  C++L S  C  +   C Y Y YGD S T G + ++ +T GS  +     GCG  N G
Sbjct: 160 SQLCQALSSPTCSNNF--CQYTYGYGDGSETQGSMGTETLTFGSVSIPNITFGCGENNQG 219

Query: 207 TFGGVTSGIIGLGGGALSLVSQMNTIAAIKRQFSYCLPTFFSNANITGKISFGQNAVVSG 266
              G  +G++G+G G LSL SQ++       +FSYC+    S+      +    N+V +G
Sbjct: 220 FGQGNGAGLVGMGRGPLSLPSQLDV-----TKFSYCMTPIGSSTPSNLLLGSLANSVTAG 279

Query: 267 PKVISTPLVAKSP-DTFYFLTLEAISVANKRFEATNDMSAMTKR---GNIIIDSGTTLTF 326
               +T L+  S   TFY++TL  +SV + R        A+      G IIIDSGTTLT+
Sbjct: 280 SP--NTTLIQSSQIPTFYYITLNGLSVGSTRLPIDPSAFALNSNNGTGGIIIDSGTTLTY 339

Query: 327 LPRNLYAGVVSTLASVIKAKRVDDPAGILELCY-TAGSVEDLNIPVIAAHFGGGADVKLL 386
              N Y  V     S I    V+  +   +LC+ T     +L IP    HF GG D++L 
Sbjct: 340 FVNNAYQSVRQEFISQINLPVVNGSSSGFDLCFQTPSDPSNLQIPTFVMHFDGG-DLELP 399

Query: 387 PLNTFALVAENVSCLTLAPASD-LAIFGNLAQINFVVGYDLEQKRLSFKPTVC 434
             N F   +  + CL +  +S  ++IFGN+ Q N +V YD     +SF    C
Sbjct: 400 SENYFISPSNGLICLAMGSSSQGMSIFGNIQQQNMLVVYDTGNSVVSFASAQC 434

BLAST of Cla97C07G138030 vs. Swiss-Prot
Match: sp|Q766C2|NEP2_NEPGR (Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 SV=1)

HSP 1 Score: 219.9 bits (559), Expect = 5.4e-56
Identity = 143/394 (36.29%), Postives = 208/394 (52.79%), Query Frame = 0

Query: 46  SLSRYDRITNAFRRSFSRSATLFKHVTTVSTACIQSPIIPDSGEFLMSVSIGTPPVDFIA 105
           +L++Y+ I  A +R   R  ++  +    S++ I++P+    GE+LM+V+IGTP   F A
Sbjct: 54  NLTKYELIKRAIKRGERRMRSI--NAMLQSSSGIETPVYAGDGEYLMNVAIGTPDSSFSA 113

Query: 106 IADTGSDLTWTQCLPCQECFNQSRRIFNPRRSSSYRNVSCTSDTCRSLDSYHCGPDLQTC 165
           I DTGSDL WTQC PC +CF+Q   IFNP+ SSS+  + C S  C+ L S  C  +   C
Sbjct: 114 IMDTGSDLIWTQCEPCTQCFSQPTPIFNPQDSSSFSTLPCESQYCQDLPSETCNNN--EC 173

Query: 166 SYGYSYGDRSFTYGDLASDKITIGSFKLSKTLIGCGHQNGGTFGGVTSGIIGLGGGALSL 225
            Y Y YGD S T G +A++  T  +  +     GCG  N G   G  +G+IG+G G LSL
Sbjct: 174 QYTYGYGDGSTTQGYMATETFTFETSSVPNIAFGCGEDNQGFGQGNGAGLIGMGWGPLSL 233

Query: 226 VSQMNTIAAIKRQFSYCLPTFFSNANITGKISFGQNAVVSGPKVISTPLVAKSPD-TFYF 285
            SQ+        QFSYC+ ++ S++  T  +    + V  G    ST L+  S + T+Y+
Sbjct: 234 PSQLGV-----GQFSYCMTSYGSSSPSTLALGSAASGVPEGSP--STTLIHSSLNPTYYY 293

Query: 286 LTLEAISVANKRFEATNDMSAMTK--RGNIIIDSGTTLTFLPRNLYAGVVSTLASVIKAK 345
           +TL+ I+V        +    +     G +IIDSGTTLT+LP++ Y  V       I   
Sbjct: 294 ITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLTYLPQDAYNAVAQAFTDQINLP 353

Query: 346 RVDDPAGILELCYTAGS-VEDLNIPVIAAHFGGGADVKLLPLNTFALVAENVSCLTLAPA 405
            VD+ +  L  C+   S    + +P I+  F GG  + L   N     AE V CL +  +
Sbjct: 354 TVDESSSGLSTCFQQPSDGSTVQVPEISMQFDGGV-LNLGEQNILISPAEGVICLAMGSS 413

Query: 406 SDL--AIFGNLAQINFVVGYDLEQKRLSFKPTVC 434
           S L  +IFGN+ Q    V YDL+   +SF PT C
Sbjct: 414 SQLGISIFGNIQQQETQVLYDLQNLAVSFVPTQC 435

BLAST of Cla97C07G138030 vs. Swiss-Prot
Match: sp|Q8S9J6|ASPA_ARATH (Aspartyl protease family protein At5g10770 OS=Arabidopsis thaliana OX=3702 GN=At5g10770 PE=2 SV=1)

HSP 1 Score: 200.7 bits (509), Expect = 3.4e-50
Identity = 137/357 (38.38%), Postives = 186/357 (52.10%), Query Frame = 0

Query: 87  SGEFLMSVSIGTPPVDFIAIADTGSDLTWTQCLPC-QECFNQSRRIFNPRRSSSYRNVSC 146
           SG ++++V +GTP  D   I DTGSDLTWTQC PC + C++Q   IFNP +S+SY NVSC
Sbjct: 129 SGNYIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRTCYDQKEPIFNPSKSTSYYNVSC 188

Query: 147 TSDTCRSLDSY--HCGP-DLQTCSYGYSYGDRSFTYGDLASDKITI-GSFKLSKTLIGCG 206
           +S  C SL S   + G      C YG  YGD+SF+ G LA +K T+  S        GCG
Sbjct: 189 SSAACGSLSSATGNAGSCSASNCIYGIQYGDQSFSVGFLAKEKFTLTNSDVFDGVYFGCG 248

Query: 207 HQNGGTFGGVTSGIIGLGGGALSLVSQMNTIAAIKRQFSYCLPTFFSNANITGKISFGQN 266
             N G F GV +G++GLG   LS  SQ  T  A  + FSYCLP   S+A+ TG ++FG  
Sbjct: 249 ENNQGLFTGV-AGLLGLGRDKLSFPSQ--TATAYNKIFSYCLP---SSASYTGHLTFGSA 308

Query: 267 AVVSGPKVISTPLVAKSPDT-FYFLTLEAISVANKRFEATNDMSAMTKRGNIIIDSGTTL 326
            +    K   TP+   +  T FY L + AI+V  ++       S +      +IDSGT +
Sbjct: 309 GISRSVKF--TPISTITDGTSFYGLNIVAITVGGQKLPIP---STVFSTPGALIDSGTVI 368

Query: 327 TFLPRNLYAGVVSTLASVIKAKRVDDPAGILELCYTAGSVEDLNIPVIAAHFGGGADVKL 386
           T LP   YA + S+  + +          IL+ C+     + + IP +A  F GGA V+L
Sbjct: 369 TRLPPKAYAALRSSFKAKMSKYPTTSGVSILDTCFDLSGFKTVTIPKVAFSFSGGAVVEL 428

Query: 387 LPLNTFALVAENVSCLTLAPASD---LAIFGNLAQINFVVGYDLEQKRLSFKPTVCA 435
                F +   +  CL  A  SD    AIFGN+ Q    V YD    R+ F P  C+
Sbjct: 429 GSKGIFYVFKISQVCLAFAGNSDDSNAAIFGNVQQQTLEVVYDGAGGRVGFAPNGCS 474

BLAST of Cla97C07G138030 vs. TAIR10
Match: AT1G64830.1 (Eukaryotic aspartyl protease family protein)

HSP 1 Score: 386.0 bits (990), Expect = 3.2e-107
Identity = 213/413 (51.57%), Postives = 269/413 (65.13%), Query Frame = 0

Query: 27  GFTTSLFHRDTLLSPLHNPSLSRYDRITNAFRRSFSRSATLFKHVTTVSTACIQSPIIPD 86
           GFT  L HRD+  SP +N + +   R+ NA RRS +RS   F +    S    QS I  +
Sbjct: 25  GFTIDLIHRDSPKSPFYNSAETSSQRMRNAIRRS-ARSTLQFSN-DDASPNSPQSFITSN 84

Query: 87  SGEFLMSVSIGTPPVDFIAIADTGSDLTWTQCLPCQECFNQSRRIFNPRRSSSYRNVSCT 146
            GE+LM++SIGTPPV  +AIADTGSDL WTQC PC++C+ Q+  +F+P+ SS+YR VSC+
Sbjct: 85  RGEYLMNISIGTPPVPILAIADTGSDLIWTQCNPCEDCYQQTSPLFDPKESSTYRKVSCS 144

Query: 147 SDTCRSLDSYHCGPDLQTCSYGYSYGDRSFTYGDLASDKITIGS-----FKLSKTLIGCG 206
           S  CR+L+   C  D  TCSY  +YGD S+T GD+A D +T+GS       L   +IGCG
Sbjct: 145 SSQCRALEDASCSTDENTCSYTITYGDNSYTKGDVAVDTVTMGSSGRRPVSLRNMIIGCG 204

Query: 207 HQNGGTFGGVTSGIIGLGGGALSLVSQMNTIAAIKRQFSYCLPTFFSNANITGKISFGQN 266
           H+N GTF    SGIIGLGGG+ SLVSQ+    +I  +FSYCL  F S   +T KI+FG N
Sbjct: 205 HENTGTFDPAGSGIIGLGGGSTSLVSQLR--KSINGKFSYCLVPFTSETGLTSKINFGTN 264

Query: 267 AVVSGPKVISTPLVAKSPDTFYFLTLEAISVANKRFEATNDMSAMTKRGNIIIDSGTTLT 326
            +VSG  V+ST +V K P T+YFL LEAISV +K+ + T+ +   T  GNI+IDSGTTLT
Sbjct: 265 GIVSGDGVVSTSMVKKDPATYYFLNLEAISVGSKKIQFTSTIFG-TGEGNIVIDSGTTLT 324

Query: 327 FLPRNLYAGVVSTLASVIKAKRVDDPAGILELCYTAGSVEDLNIPVIAAHFGGGADVKLL 386
            LP N Y  + S +AS IKA+RV DP GIL LCY   S     +P I  HF GG DVKL 
Sbjct: 325 LLPSNFYYELESVVASTIKAERVQDPDGILSLCYRDSS--SFKVPDITVHFKGG-DVKLG 384

Query: 387 PLNTFALVAENVSCLTLAPASDLAIFGNLAQINFVVGYDLEQKRLSFKPTVCA 435
            LNTF  V+E+VSC   A    L IFGNLAQ+NF+VGYD     +SFK T C+
Sbjct: 385 NLNTFVAVSEDVSCFAFAANEQLTIFGNLAQMNFLVGYDTVSGTVSFKKTDCS 429

BLAST of Cla97C07G138030 vs. TAIR10
Match: AT5G33340.1 (Eukaryotic aspartyl protease family protein)

HSP 1 Score: 356.7 bits (914), Expect = 2.0e-98
Identity = 197/415 (47.47%), Postives = 266/415 (64.10%), Query Frame = 0

Query: 27  GFTTSLFHRDTLLSPLHNPSLSRYDRITNAFRRSFSRSATLFKHVTTVSTACIQSPIIPD 86
           GFT  L HRD+  SP +NP  +   R+ NA  RS +R   +F      +T   Q  +  +
Sbjct: 30  GFTADLIHRDSPKSPFYNPMETSSQRLRNAIHRSVNR---VFHFTEKDNTPQPQIDLTSN 89

Query: 87  SGEFLMSVSIGTPPVDFIAIADTGSDLTWTQCLPCQECFNQSRRIFNPRRSSSYRNVSCT 146
           SGE+LM+VSIGTPP   +AIADTGSDL WTQC PC +C+ Q   +F+P+ SS+Y++VSC+
Sbjct: 90  SGEYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCDDCYTQVDPLFDPKTSSTYKDVSCS 149

Query: 147 SDTCRSLDSY-HCGPDLQTCSYGYSYGDRSFTYGDLASDKITIGS-----FKLSKTLIGC 206
           S  C +L++   C  +  TCSY  SYGD S+T G++A D +T+GS      +L   +IGC
Sbjct: 150 SSQCTALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLGSSDTRPMQLKNIIIGC 209

Query: 207 GHQNGGTFGGVTSGIIGLGGGALSLVSQMNTIAAIKRQFSYCLPTFFSNANITGKISFGQ 266
           GH N GTF    SGI+GLGGG +SL+ Q+    +I  +FSYCL    S  + T KI+FG 
Sbjct: 210 GHNNAGTFNKKGSGIVGLGGGPVSLIKQLGD--SIDGKFSYCLVPLTSKKDQTSKINFGT 269

Query: 267 NAVVSGPKVISTPLVAK-SPDTFYFLTLEAISVANKRFEATNDMSAMTKRGNIIIDSGTT 326
           NA+VSG  V+STPL+AK S +TFY+LTL++ISV +K+ + +   S  +  GNIIIDSGTT
Sbjct: 270 NAIVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQYSGSDSE-SSEGNIIIDSGTT 329

Query: 327 LTFLPRNLYAGVVSTLASVIKAKRVDDPAGILELCYTAGSVEDLNIPVIAAHFGGGADVK 386
           LT LP   Y+ +   +AS I A++  DP   L LCY+A    DL +PVI  HF  GADVK
Sbjct: 330 LTLLPTEFYSELEDAVASSIDAEKKQDPQSGLSLCYSA--TGDLKVPVITMHF-DGADVK 389

Query: 387 LLPLNTFALVAENVSCLTLAPASDLAIFGNLAQINFVVGYDLEQKRLSFKPTVCA 435
           L   N F  V+E++ C     +   +I+GN+AQ+NF+VGYD   K +SFKPT CA
Sbjct: 390 LDSSNAFVQVSEDLVCFAFRGSPSFSIYGNVAQMNFLVGYDTVSKTVSFKPTDCA 435

BLAST of Cla97C07G138030 vs. TAIR10
Match: AT2G35615.1 (Eukaryotic aspartyl protease family protein)

HSP 1 Score: 340.5 bits (872), Expect = 1.5e-93
Identity = 195/426 (45.77%), Postives = 272/426 (63.85%), Query Frame = 0

Query: 28  FTTSLFHRDTLLSPLHNPSLSRYDRITNAFRRSFSRSATLFKHVTTVSTACIQSPIIPDS 87
           F+  L HRD+ LSP++NP ++  DR+  AF RS SRS   F H   +S   +QS +I   
Sbjct: 26  FSVELIHRDSPLSPIYNPQITVTDRLNAAFLRSVSRSRR-FNH--QLSQTDLQSGLIGAD 85

Query: 88  GEFLMSVSIGTPPVDFIAIADTGSDLTWTQCLPCQECFNQSRRIFNPRRSSSYRNVSCTS 147
           GEF MS++IGTPP+   AIADTGSDLTW QC PCQ+C+ ++  IF+ ++SS+Y++  C S
Sbjct: 86  GEFFMSITIGTPPIKVFAIADTGSDLTWVQCKPCQQCYKENGPIFDKKKSSTYKSEPCDS 145

Query: 148 DTCRSLDSYH--CGPDLQTCSYGYSYGDRSFTYGDLASDKITIGSFKLSK-----TLIGC 207
             C++L S    C      C Y YSYGD+SF+ GD+A++ ++I S   S      T+ GC
Sbjct: 146 RNCQALSSTERGCDESNNICKYRYSYGDQSFSKGDVATETVSIDSASGSPVSFPGTVFGC 205

Query: 208 GHQNGGTFGGVTSGIIGLGGGALSLVSQMNTIAAIKRQFSYCLPTFFSNANITGKISFGQ 267
           G+ NGGTF    SGIIGLGGG LSL+SQ+   ++I ++FSYCL    +  N T  I+ G 
Sbjct: 206 GYNNGGTFDETGSGIIGLGGGHLSLISQLG--SSISKKFSYCLSHKSATTNGTSVINLGT 265

Query: 268 NAVVSG----PKVISTPLVAKSPDTFYFLTLEAISVANKRFEAT------NDMSAMTK-R 327
           N++ S       V+STPLV K P T+Y+LTLEAISV  K+   T      ND   +++  
Sbjct: 266 NSIPSSLSKDSGVVSTPLVDKEPLTYYYLTLEAISVGKKKIPYTGSSYNPNDDGILSETS 325

Query: 328 GNIIIDSGTTLTFLPRNLYAGVVSTL-ASVIKAKRVDDPAGILELCYTAGSVEDLNIPVI 387
           GNIIIDSGTTLT L    +    S +  SV  AKRV DP G+L  C+ +GS E + +P I
Sbjct: 326 GNIIIDSGTTLTLLEAGFFDKFSSAVEESVTGAKRVSDPQGLLSHCFKSGSAE-IGLPEI 385

Query: 388 AAHFGGGADVKLLPLNTFALVAENVSCLTLAPASDLAIFGNLAQINFVVGYDLEQKRLSF 435
             HF  GADV+L P+N F  ++E++ CL++ P +++AI+GN AQ++F+VGYDLE + +SF
Sbjct: 386 TVHF-TGADVRLSPINAFVKLSEDMVCLSMVPTTEVAIYGNFAQMDFLVGYDLETRTVSF 444

BLAST of Cla97C07G138030 vs. TAIR10
Match: AT1G31450.1 (Eukaryotic aspartyl protease family protein)

HSP 1 Score: 336.3 bits (861), Expect = 2.9e-92
Identity = 190/423 (44.92%), Postives = 264/423 (62.41%), Query Frame = 0

Query: 29  TTSLFHRDTLLSPLHNPSLSRYDRITNAFRRSFSRSATLFKHVTTVSTACIQSPIIPDSG 88
           T  L HRD+  SPL+NP  +  DR+  AF RS SRS    +  TT +   +QS +I + G
Sbjct: 30  TVELIHRDSPHSPLYNPHHTVSDRLNAAFLRSISRS----RRFTTKTD--LQSGLISNGG 89

Query: 89  EFLMSVSIGTPPVDFIAIADTGSDLTWTQCLPCQECFNQSRRIFNPRRSSSYRNVSCTSD 148
           E+ MS+SIGTPP    AIADTGSDLTW QC PCQ+C+ Q+  +F+ ++SS+Y+  SC S 
Sbjct: 90  EYFMSISIGTPPSKVFAIADTGSDLTWVQCKPCQQCYKQNSPLFDKKKSSTYKTESCDSK 149

Query: 149 TCRSLDSYH--CGPDLQTCSYGYSYGDRSFTYGDLASDKITIGSFKLSK-----TLIGCG 208
           TC++L  +   C      C Y YSYGD SFT GD+A++ I+I S   S      T+ GCG
Sbjct: 150 TCQALSEHEEGCDESKDICKYRYSYGDNSFTKGDVATETISIDSSSGSSVSFPGTVFGCG 209

Query: 209 HQNGGTFGGVTSGIIGLGGGALSLVSQMNTIAAIKRQFSYCLPTFFSNANITGKISFGQN 268
           + NGGTF    SGIIGLGGG LSLVSQ+   ++I ++FSYCL    +  N T  I+ G N
Sbjct: 210 YNNGGTFEETGSGIIGLGGGPLSLVSQLG--SSIGKKFSYCLSHTAATTNGTSVINLGTN 269

Query: 269 AVVSGPK----VISTPLVAKSPDTFYFLTLEAISVANKRFEATN-----DMSAMTKRGNI 328
           ++ S P      ++TPL+ K P+T+YFLTLEA++V   +   T      +  +  + GNI
Sbjct: 270 SIPSNPSKDSATLTTPLIQKDPETYYFLTLEAVTVGKTKLPYTGGGYGLNGKSSKRTGNI 329

Query: 329 IIDSGTTLTFLPRNLYAGVVSTL-ASVIKAKRVDDPAGILELCYTAGSVEDLNIPVIAAH 388
           IIDSGTTLT L    Y    + +  SV  AKRV DP G+L  C+ +G  +++ +P I  H
Sbjct: 330 IIDSGTTLTLLDSGFYDDFGTAVEESVTGAKRVSDPQGLLTHCFKSGD-KEIGLPAITMH 389

Query: 389 FGGGADVKLLPLNTFALVAENVSCLTLAPASDLAIFGNLAQINFVVGYDLEQKRLSFKPT 435
           F   ADVKL P+N F  + E+  CL++ P +++AI+GN+ Q++F+VGYDLE K +SF+  
Sbjct: 390 F-TNADVKLSPINAFVKLNEDTVCLSMIPTTEVAIYGNMVQMDFLVGYDLETKTVSFQRM 442

BLAST of Cla97C07G138030 vs. TAIR10
Match: AT2G28010.1 (Eukaryotic aspartyl protease family protein)

HSP 1 Score: 220.3 bits (560), Expect = 2.3e-57
Identity = 152/409 (37.16%), Postives = 215/409 (52.57%), Query Frame = 0

Query: 37  TLLSPLHNPSLSRYDRITNAFRRSFSRSATLFKHVTTVSTACIQSPIIPDSGEFLMSVSI 96
           T  SP H  ++    R +NA  R  +  +    +  TV           D+  +LM + +
Sbjct: 22  TTASPPHGFTMDLIHRRSNASSRVSNTQSGSSPYANTVF----------DNSVYLMKLQV 81

Query: 97  GTPPVDFIAIADTGSDLTWTQCLPCQECFNQSRRIFNPRRSSSYRNVSCTSDTCRSLDSY 156
           GTPP +  AI DTGS++TWTQCLPC  C+ Q+  IF+P +SS+++   C           
Sbjct: 82  GTPPFEIQAIIDTGSEITWTQCLPCVHCYEQNAPIFDPSKSSTFKEKRC----------- 141

Query: 157 HCGPDLQTCSYGYSYGDRSFTYGDLASDKITIGS-----FKLSKTLIGCGHQNGGTFGGV 216
               D  +C Y   Y D ++T G LA++ IT+ S     F + +T+IGCGH N   F   
Sbjct: 142 ----DGHSCPYEVDYFDHTYTMGTLATETITLHSTSGEPFVMPETIIGCGH-NNSWFKPS 201

Query: 217 TSGIIGLGGGALSLVSQMNTIAAIKRQFSYCLPTFFSNANITGKISFGQNAVVSGPKVIS 276
            SG++GL  G  SL++QM          SYC    FS    T KI+FG NA+V+G  V+S
Sbjct: 202 FSGMVGLNWGPSSLITQMG--GEYPGLMSYC----FSGQG-TSKINFGANAIVAGDGVVS 261

Query: 277 TPL-VAKSPDTFYFLTLEAISVANKRFEATNDMSAMTKRGNIIIDSGTTLTFLPRNLYAG 336
           T + +  +   FY+L L+A+SV N R E T   +     GNI+IDSGTTLT+ P +    
Sbjct: 262 TTMFMTTAKPGFYYLNLDAVSVGNTRIE-TMGTTFHALEGNIVIDSGTTLTYFPVSYCNL 321

Query: 337 VVSTLASVIKAKRVDDPAGILELCYTAGSVEDLNIPVIAAHFGGGADVKLLPLNTFALVA 396
           V   +  V+ A R  DP G   LCY + +++    PVI  HF GG D+ L   N + + +
Sbjct: 322 VRQAVEHVVTAVRAADPTGNDMLCYNSDTID--IFPVITMHFSGGVDLVLDKYNMY-MES 381

Query: 397 EN--VSCLTL---APASDLAIFGNLAQINFVVGYDLEQKRLSFKPTVCA 435
            N  V CL +   +P  + AIFGN AQ NF+VGYD     +SF PT C+
Sbjct: 382 NNGGVFCLAIICNSPTQE-AIFGNRAQNNFLVGYDSSSLLVSFSPTNCS 392

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_008452150.12.4e-19082.47PREDICTED: probable aspartic protease At2g35615 [Cucumis melo][more]
XP_004149004.13.2e-19082.49PREDICTED: probable aspartic protease At2g35615 [Cucumis sativus] >KGN53446.1 hy... [more]
XP_023552860.19.0e-17774.36aspartic proteinase CDR1-like [Cucurbita pepo subsp. pepo][more]
XP_023543528.12.2e-17574.75aspartic proteinase CDR1-like [Cucurbita pepo subsp. pepo][more]
XP_022931430.11.1e-17473.90aspartic proteinase CDR1-like [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
tr|A0A1S3BT75|A0A1S3BT75_CUCME1.6e-19082.47probable aspartic protease At2g35615 OS=Cucumis melo OX=3656 GN=LOC103493255 PE=... [more]
tr|A0A0A0KZZ3|A0A0A0KZZ3_CUCSA2.1e-19082.49Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G055410 PE=3 SV=1[more]
tr|A0A1S3BUB0|A0A1S3BUB0_CUCME4.8e-13460.92probable aspartic protease At2g35615 OS=Cucumis melo OX=3656 GN=LOC103493257 PE=... [more]
tr|A0A0A0KV20|A0A0A0KV20_CUCSA1.2e-13260.53Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G055400 PE=3 SV=1[more]
tr|A0A1S3BTZ4|A0A1S3BTZ4_CUCME6.3e-12658.25probable aspartic protease At2g35615 OS=Cucumis melo OX=3656 GN=LOC103493256 PE=... [more]
Match NameE-valueIdentityDescription
sp|Q6XBF8|CDR1_ARATH3.7e-9747.47Aspartic proteinase CDR1 OS=Arabidopsis thaliana OX=3702 GN=CDR1 PE=1 SV=1[more]
sp|Q3EBM5|ASPR1_ARATH2.7e-9245.77Probable aspartic protease At2g35615 OS=Arabidopsis thaliana OX=3702 GN=At2g3561... [more]
sp|Q766C3|NEP1_NEPGR2.4e-5635.35Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 S... [more]
sp|Q766C2|NEP2_NEPGR5.4e-5636.29Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 S... [more]
sp|Q8S9J6|ASPA_ARATH3.4e-5038.38Aspartyl protease family protein At5g10770 OS=Arabidopsis thaliana OX=3702 GN=At... [more]
Match NameE-valueIdentityDescription
AT1G64830.13.2e-10751.57Eukaryotic aspartyl protease family protein[more]
AT5G33340.12.0e-9847.47Eukaryotic aspartyl protease family protein[more]
AT2G35615.11.5e-9345.77Eukaryotic aspartyl protease family protein[more]
AT1G31450.12.9e-9244.92Eukaryotic aspartyl protease family protein[more]
AT2G28010.12.3e-5737.16Eukaryotic aspartyl protease family protein[more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0004190aspartic-type endopeptidase activity
Vocabulary: Biological Process
TermDefinition
GO:0006508proteolysis
Vocabulary: INTERPRO
TermDefinition
IPR034161Pepsin-like_plant
IPR033121PEPTIDASE_A1
IPR001461Aspartic_peptidase_A1
IPR032799TAXi_C
IPR021109Peptidase_aspartic_dom_sf
IPR032861TAXi_N
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
biological_process GO:0030163 protein catabolic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0004190 aspartic-type endopeptidase activity
molecular_function GO:0016787 hydrolase activity
molecular_function GO:0008233 peptidase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C07G138030.1Cla97C07G138030.1mRNA


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR032861Xylanase inhibitor, N-terminalPFAMPF14543TAXi_Ncoord: 90..260
e-value: 4.4E-51
score: 173.6
IPR021109Aspartic peptidase domain superfamilyGENE3DG3DSA:2.40.70.10coord: 264..434
e-value: 2.2E-41
score: 143.5
IPR021109Aspartic peptidase domain superfamilyGENE3DG3DSA:2.40.70.10coord: 68..260
e-value: 1.8E-46
score: 160.7
IPR021109Aspartic peptidase domain superfamilySUPERFAMILYSSF50630Acid proteasescoord: 84..433
IPR032799Xylanase inhibitor, C-terminalPFAMPF14541TAXi_Ccoord: 283..429
e-value: 9.4E-25
score: 87.2
NoneNo IPR availablePANTHERPTHR13683:SF524ASPARTIC PROTEINASE CDR1-RELATEDcoord: 7..433
IPR001461Aspartic peptidase A1 familyPANTHERPTHR13683ASPARTYL PROTEASEScoord: 7..433
IPR033121Peptidase family A1 domainPROSITEPS51767PEPTIDASE_A1coord: 90..429
score: 43.169
IPR034161Pepsin-like domain, plantCDDcd05476pepsin_A_like_plantcoord: 89..433
e-value: 4.85272E-70
score: 224.064