CSPI03G02720 (gene) Cucumber (PI 183967) v1

Overview
NameCSPI03G02720
Typegene
OrganismCucumis sativus var. hardwickii cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionEukaryotic aspartyl protease family protein
LocationChr3: 2104705 .. 2106775 (+)
RNA-Seq ExpressionCSPI03G02720
SyntenyCSPI03G02720
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAAATATGAACCAAACTTTTAAATTTTTTTAAGGAAAACTAAGCTTATAATTTAAGAAGATAAAGAAAGAAGAAAGAAGAAAGAAGAAATAAGAAAGAGGGTGTAGTTGTAATGGTCACCACTTTTGAAACCCATACAAAAACCCCATCTTTCTCGTCTTCTAACCAGAACCTTCTTCTTCTTCTTCTATTCTCAATCCCCATTTCTTTAAATTCCCCAGTTTCTATCTCTAATTTCATTTCAAAAACCCCAATTTCTCTATCTAAAGTACTCCATTAATGGCGGTTTCCCCTGTTTTCATCTTCCTCCTTTGTTTTCTCCTCTCCTCCCCTGTTTTCTCCTCACAAATTTTCCTTCTACCTCTCTCCCATTCCTTATCATCCTCAATCTCCGATTTCAACAACACCCACAATCTTCTCAAATCCACTGCCACCCGCTCCTCCGCCCGATTCCACCGCCACCGCCATAACCACCTCTCTCTGCCCCTCTCCCCTGGCGGCGATTACACTCTCTCCTTCAACCTCGGCTCTGAGTCTCACAAAATTTCCCTCTATATGGACACTGGCAGCGACCTCGTTTGGTTCCCTTGTTCCCCGTTTGAGTGTATTCTTTGTGAAGGTAAACCAAAAATTCAATCCCCTTTGCCCAAAATCGCAAATAACAAATCAGTTTCCTGCAGCGCCGCCGCCTGCTCCGCCGCTCACGGTGGCTCCCTCTCCGCTTCCCACCTCTGTGCAATTTCTCGATGTCCACTTGAATCCATTGAAATTTCTGAGTGCTCCTCTTTTTCCTGTCCTCCGTTTTATTATGCTTATGGCGATGGGAGTTTGGTTGCTCGGCTTTATAGAGATAGCCTCAGCTTGCCAACGCCGGCGCCATCTCCGCCTATTAATGTTCGGAATTTTACTTTTGGATGTGCCCACACGACGCTTGGCGAGCCGGTTGGGGTTGCCGGATTCGGCCGGGGGGTGTTGTCAATGCCCAGTCAACTCGCTACTTTCTCCCCTCAACTCGGGAACCGGTTTTCTTATTGTTTGGTTTCTCACTCGTTTGCGGCAGACCGAGTTCGCCGCCCGAGTCCACTGATTCTCGGGCGGTACTACACCGGGGAGACGGAGTTCATTTACACTTCCTTGCTTGAGAATCCAAAACATCCTTACTTTTACTCCGTTGGGTTGGCCGGAATATCAGTTGGGAATGTAAGGATTCCAGCGCCGGAGTTTCTGACAAAAGTGGATGAGGGTGGGAGTGGCGGCGTTGTGGTGGATTCCGGCACTACTTTTACTATGCTCCCGGCAGGTTTGTATGAATCGGTGGTGGCCGAGTTCGAAAACCGTACCGGAAAGGTTGCAAACCGGGCGAGACGGATTGAAGAAAATACCGGGTTGAGCCCTTGCTATTACTACGAGAACTCAGTTGGGGTGCCACGTGTCGTGCTACATTTTGTTGGGGAAAAATCCAATGTGGTGCTTCCTAGGAAGAACTATTTCTACGAGTTTTTGGACGGCGGAGATGGGGTGGTGGGGAGGAAGAGAAAAGTTGGATGTTTGATGTTGATGAACGGTGGAGATGAGGCTGAGCTGGCAGGTGGGCCTGGTGCGACGCTTGGGAACTACCAACAACAAGGTTTTGAGGTGGTTTATGATTTGGAAAAGAACCGGGTTGGATTCGCTCGGCGGCAATGCTCCACTCTTTGGGACAATTTGAACCGGAGTAAGTGAAAGTGTGAACCCGGTTGAGGAAGTGAGAGTTGATCTTTGACATTGTGAGTATTGTCAACGGTGAAGGGTATGAGGGTAAATAAGGTAATTTTAGGTTTCAAGGGTTTATTTATTTATTTTATTCTATTTTTTGTTGTAAATTCCTTGGGCACTTCACTTCTTGCTTTAATCTTATTTTTACTTGAAATTTGTATATTAAAAGTGTTCAAAAAAAAAATGTATAGCTCAAATTTTATTGCATGAAGCAAGGAATGAAGTGGTTGCTTTTATCAAGTTAAGAATAAAATAGGGGCAATAAATTAAGGATGTTAAAAAACAAATCATAAATCAAATTTACTTGTAAAACTTTT

mRNA sequence

AAAATATGAACCAAACTTTTAAATTTTTTTAAGGAAAACTAAGCTTATAATTTAAGAAGATAAAGAAAGAAGAAAGAAGAAAGAAGAAATAAGAAAGAGGGTGTAGTTGTAATGGTCACCACTTTTGAAACCCATACAAAAACCCCATCTTTCTCGTCTTCTAACCAGAACCTTCTTCTTCTTCTTCTATTCTCAATCCCCATTTCTTTAAATTCCCCAGTTTCTATCTCTAATTTCATTTCAAAAACCCCAATTTCTCTATCTAAAGTACTCCATTAATGGCGGTTTCCCCTGTTTTCATCTTCCTCCTTTGTTTTCTCCTCTCCTCCCCTGTTTTCTCCTCACAAATTTTCCTTCTACCTCTCTCCCATTCCTTATCATCCTCAATCTCCGATTTCAACAACACCCACAATCTTCTCAAATCCACTGCCACCCGCTCCTCCGCCCGATTCCACCGCCACCGCCATAACCACCTCTCTCTGCCCCTCTCCCCTGGCGGCGATTACACTCTCTCCTTCAACCTCGGCTCTGAGTCTCACAAAATTTCCCTCTATATGGACACTGGCAGCGACCTCGTTTGGTTCCCTTGTTCCCCGTTTGAGTGTATTCTTTGTGAAGGTAAACCAAAAATTCAATCCCCTTTGCCCAAAATCGCAAATAACAAATCAGTTTCCTGCAGCGCCGCCGCCTGCTCCGCCGCTCACGGTGGCTCCCTCTCCGCTTCCCACCTCTGTGCAATTTCTCGATGTCCACTTGAATCCATTGAAATTTCTGAGTGCTCCTCTTTTTCCTGTCCTCCGTTTTATTATGCTTATGGCGATGGGAGTTTGGTTGCTCGGCTTTATAGAGATAGCCTCAGCTTGCCAACGCCGGCGCCATCTCCGCCTATTAATGTTCGGAATTTTACTTTTGGATGTGCCCACACGACGCTTGGCGAGCCGGTTGGGGTTGCCGGATTCGGCCGGGGGGTGTTGTCAATGCCCAGTCAACTCGCTACTTTCTCCCCTCAACTCGGGAACCGGTTTTCTTATTGTTTGGTTTCTCACTCGTTTGCGGCAGACCGAGTTCGCCGCCCGAGTCCACTGATTCTCGGGCGGTACTACACCGGGGAGACGGAGTTCATTTACACTTCCTTGCTTGAGAATCCAAAACATCCTTACTTTTACTCCGTTGGGTTGGCCGGAATATCAGTTGGGAATGTAAGGATTCCAGCGCCGGAGTTTCTGACAAAAGTGGATGAGGGTGGGAGTGGCGGCGTTGTGGTGGATTCCGGCACTACTTTTACTATGCTCCCGGCAGGTTTGTATGAATCGGTGGTGGCCGAGTTCGAAAACCGTACCGGAAAGGTTGCAAACCGGGCGAGACGGATTGAAGAAAATACCGGGTTGAGCCCTTGCTATTACTACGAGAACTCAGTTGGGGTGCCACGTGTCGTGCTACATTTTGTTGGGGAAAAATCCAATGTGGTGCTTCCTAGGAAGAACTATTTCTACGAGTTTTTGGACGGCGGAGATGGGGTGGTGGGGAGGAAGAGAAAAGTTGGATGTTTGATGTTGATGAACGGTGGAGATGAGGCTGAGCTGGCAGGTGGGCCTGGTGCGACGCTTGGGAACTACCAACAACAAGGTTTTGAGGTGGTTTATGATTTGGAAAAGAACCGGGTTGGATTCGCTCGGCGGCAATGCTCCACTCTTTGGGACAATTTGAACCGGAGTAAGTGAAAGTGTGAACCCGGTTGAGGAAGTGAGAGTTGATCTTTGACATTGTGAGTATTGTCAACGGTGAAGGGTATGAGGGTAAATAAGGTAATTTTAGGTTTCAAGGGTTTATTTATTTATTTTATTCTATTTTTTGTTGTAAATTCCTTGGGCACTTCACTTCTTGCTTTAATCTTATTTTTACTTGAAATTTGTATATTAAAAGTGTTCAAAAAAAAAATGTATAGCTCAAATTTTATTGCATGAAGCAAGGAATGAAGTGGTTGCTTTTATCAAGTTAAGAATAAAATAGGGGCAATAAATTAAGGATGTTAAAAAACAAATCATAAATCAAATTTACTTGTAAAACTTTT

Coding sequence (CDS)

ATGGCGGTTTCCCCTGTTTTCATCTTCCTCCTTTGTTTTCTCCTCTCCTCCCCTGTTTTCTCCTCACAAATTTTCCTTCTACCTCTCTCCCATTCCTTATCATCCTCAATCTCCGATTTCAACAACACCCACAATCTTCTCAAATCCACTGCCACCCGCTCCTCCGCCCGATTCCACCGCCACCGCCATAACCACCTCTCTCTGCCCCTCTCCCCTGGCGGCGATTACACTCTCTCCTTCAACCTCGGCTCTGAGTCTCACAAAATTTCCCTCTATATGGACACTGGCAGCGACCTCGTTTGGTTCCCTTGTTCCCCGTTTGAGTGTATTCTTTGTGAAGGTAAACCAAAAATTCAATCCCCTTTGCCCAAAATCGCAAATAACAAATCAGTTTCCTGCAGCGCCGCCGCCTGCTCCGCCGCTCACGGTGGCTCCCTCTCCGCTTCCCACCTCTGTGCAATTTCTCGATGTCCACTTGAATCCATTGAAATTTCTGAGTGCTCCTCTTTTTCCTGTCCTCCGTTTTATTATGCTTATGGCGATGGGAGTTTGGTTGCTCGGCTTTATAGAGATAGCCTCAGCTTGCCAACGCCGGCGCCATCTCCGCCTATTAATGTTCGGAATTTTACTTTTGGATGTGCCCACACGACGCTTGGCGAGCCGGTTGGGGTTGCCGGATTCGGCCGGGGGGTGTTGTCAATGCCCAGTCAACTCGCTACTTTCTCCCCTCAACTCGGGAACCGGTTTTCTTATTGTTTGGTTTCTCACTCGTTTGCGGCAGACCGAGTTCGCCGCCCGAGTCCACTGATTCTCGGGCGGTACTACACCGGGGAGACGGAGTTCATTTACACTTCCTTGCTTGAGAATCCAAAACATCCTTACTTTTACTCCGTTGGGTTGGCCGGAATATCAGTTGGGAATGTAAGGATTCCAGCGCCGGAGTTTCTGACAAAAGTGGATGAGGGTGGGAGTGGCGGCGTTGTGGTGGATTCCGGCACTACTTTTACTATGCTCCCGGCAGGTTTGTATGAATCGGTGGTGGCCGAGTTCGAAAACCGTACCGGAAAGGTTGCAAACCGGGCGAGACGGATTGAAGAAAATACCGGGTTGAGCCCTTGCTATTACTACGAGAACTCAGTTGGGGTGCCACGTGTCGTGCTACATTTTGTTGGGGAAAAATCCAATGTGGTGCTTCCTAGGAAGAACTATTTCTACGAGTTTTTGGACGGCGGAGATGGGGTGGTGGGGAGGAAGAGAAAAGTTGGATGTTTGATGTTGATGAACGGTGGAGATGAGGCTGAGCTGGCAGGTGGGCCTGGTGCGACGCTTGGGAACTACCAACAACAAGGTTTTGAGGTGGTTTATGATTTGGAAAAGAACCGGGTTGGATTCGCTCGGCGGCAATGCTCCACTCTTTGGGACAATTTGAACCGGAGTAAGTGA

Protein sequence

MAVSPVFIFLLCFLLSSPVFSSQIFLLPLSHSLSSSISDFNNTHNLLKSTATRSSARFHRHRHNHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQSPLPKIANNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAYGDGSLVARLYRDSLSLPTPAPSPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYTGETEFIYTSLLENPKHPYFYSVGLAGISVGNVRIPAPEFLTKVDEGGSGGVVVDSGTTFTMLPAGLYESVVAEFENRTGKVANRARRIEENTGLSPCYYYENSVGVPRVVLHFVGEKSNVVLPRKNYFYEFLDGGDGVVGRKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDNLNRSK*
Homology
BLAST of CSPI03G02720 vs. ExPASy Swiss-Prot
Match: Q940R4 (Probable aspartyl protease At4g16563 OS=Arabidopsis thaliana OX=3702 GN=At4g16563 PE=2 SV=1)

HSP 1 Score: 581.6 bits (1498), Expect = 8.0e-165
Identity = 298/478 (62.34%), Postives = 356/478 (74.48%), Query Frame = 0

Query: 26  LLPLSHSLSSSISDFNNTHNLLKSTATRSSARFHRHRH----NHLSLPLSPGGDYTLSFN 85
           LL LSHSLS+S    +  H LLKS+++RSSARF RH H      LSLP+S G DY +S +
Sbjct: 30  LLHLSHSLSTSKHSSSPLH-LLKSSSSRSSARFRRHHHKQQQQQLSLPISSGSDYLISLS 89

Query: 86  LGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQSPLPKIANN-KSVSCSAAACSA 145
           +GS S  +SLY+DTGSDLVWFPC PF CILCE KP   SP   ++++  +VSCS+ +CSA
Sbjct: 90  VGSSSSAVSLYLDTGSDLVWFPCRPFTCILCESKPLPPSPPSSLSSSATTVSCSSPSCSA 149

Query: 146 AHGGSLSASHLCAISRCPLESIEISEC--SSFSCPPFYYAYGDGSLVARLYRDSLSLPTP 205
           AH  SL +S LCAIS CPL+ IE  +C  SS+ CPPFYYAYGDGSLVA+LY DSLSLP+ 
Sbjct: 150 AH-SSLPSSDLCAISNCPLDFIETGDCNTSSYPCPPFYYAYGDGSLVAKLYSDSLSLPS- 209

Query: 206 APSPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQLATFSPQLGNRFSYCLVSHSF 265
                ++V NFTFGCAHTTL EP+GVAGFGRG LS+P+QLA  SP LGN FSYCLVSHSF
Sbjct: 210 -----VSVSNFTFGCAHTTLAEPIGVAGFGRGRLSLPAQLAVHSPHLGNSFSYCLVSHSF 269

Query: 266 AADRVRRPSPLILGRYY------TGET--------------EFIYTSLLENPKHPYFYSV 325
            +DRVRRPSPLILGR+        G T              EF++T +LENPKHPYFYSV
Sbjct: 270 DSDRVRRPSPLILGRFVDKKEKRVGTTDDHDDGDDEKKKKNEFVFTEMLENPKHPYFYSV 329

Query: 326 GLAGISVGNVRIPAPEFLTKVDEGGSGGVVVDSGTTFTMLPAGLYESVVAEFENRTGKVA 385
            L GIS+G   IPAP  L ++D+ G GGVVVDSGTTFTMLPA  Y SVV EF++R G+V 
Sbjct: 330 SLQGISIGKRNIPAPAMLRRIDKNGGGGVVVDSGTTFTMLPAKFYNSVVEEFDSRVGRVH 389

Query: 386 NRARRIEENTGLSPCYYYENSVGVPRVVLHFVGEKSNVVLPRKNYFYEFLDGGDGVVGRK 445
            RA R+E ++G+SPCYY   +V VP +VLHF G +S+V LPR+NYFYEF+DGGDG    K
Sbjct: 390 ERADRVEPSSGMSPCYYLNQTVKVPALVLHFAGNRSSVTLPRRNYFYEFMDGGDG-KEEK 449

Query: 446 RKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDNL 477
           RK+GCLMLMNGGDE+EL GG GA LGNYQQQGFEVVYDL   RVGFA+R+C++LWD+L
Sbjct: 450 RKIGCLMLMNGGDESELRGGTGAILGNYQQQGFEVVYDLLNRRVGFAKRKCASLWDSL 498

BLAST of CSPI03G02720 vs. ExPASy Swiss-Prot
Match: Q9LNJ3 (Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 SV=1)

HSP 1 Score: 152.1 bits (383), Expect = 1.6e-35
Identity = 133/409 (32.52%), Postives = 183/409 (44.74%), Query Frame = 0

Query: 70  LSPG-GDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQSPLPKIANN 129
           LS G G+Y     +G+ +  + + +DTGSD+VW  C+P  C  C  +       P     
Sbjct: 135 LSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAP--CRRCYSQSD-----PIFDPR 194

Query: 130 KSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAYGDGSL-VAR 189
           KS + +   CS+ H   L ++  C   R          C       +  +YGDGS  V  
Sbjct: 195 KSKTYATIPCSSPHCRRLDSAG-CNTRR--------KTCL------YQVSYGDGSFTVGD 254

Query: 190 LYRDSLSLPTPAPSPPINVRNFTFGCAHTTLGEPVGVA---GFGRGVLSMPSQLATFSPQ 249
              ++L+           V+    GC H   G  VG A   G G+G LS P Q      +
Sbjct: 255 FSTETLTFRRN------RVKGVALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQT---GHR 314

Query: 250 LGNRFSYCLVSHSFAADRVRRPSPLILGRYYTGETEFIYTSLLENPKHPYFYSVGLAGIS 309
              +FSYCLV  S ++    +PS ++ G          +T LL NPK   FY VGL GIS
Sbjct: 315 FNQKFSYCLVDRSASS----KPSSVVFGNAAVSRIA-RFTPLLSNPKLDTFYYVGLLGIS 374

Query: 310 VGNVRIP-APEFLTKVDEGGSGGVVVDSGTTFTMLPAGLYESVVAEFENRTGKVANRARR 369
           VG  R+P     L K+D+ G+GGV++DSGT+ T L    Y ++   F  R G  A   +R
Sbjct: 375 VGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAF--RVG--AKTLKR 434

Query: 370 IEENTGLSPCYYYE--NSVGVPRVVLHFVGEKSNVVLPRKNYFYEFLDGGDGVVGRKRKV 429
             + +    C+     N V VP VVLHF G  ++V LP  NY       G         +
Sbjct: 435 APDFSLFDTCFDLSNMNEVKVPTVVLHFRG--ADVSLPATNYLIPVDTNGKFCFAFAGTM 485

Query: 430 GCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCS 471
           G L +                +GN QQQGF VVYDL  +RVGFA   C+
Sbjct: 495 GGLSI----------------IGNIQQQGFRVVYDLASSRVGFAPGGCA 485

BLAST of CSPI03G02720 vs. ExPASy Swiss-Prot
Match: Q9LS40 (Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana OX=3702 GN=ASPG1 PE=1 SV=1)

HSP 1 Score: 144.8 bits (364), Expect = 2.5e-33
Identity = 124/421 (29.45%), Postives = 173/421 (41.09%), Query Frame = 0

Query: 60  RHRHNHLSLPLSPG-----GDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEG 119
           R++   L+ P+  G     G+Y     +G+ + ++ L +DTGSD+ W  C P  C  C  
Sbjct: 141 RYQTEDLTTPVVSGASQGSGEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCEP--CADCYQ 200

Query: 120 KPKIQSPLPKIANNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPP 179
           +          +  KS++CSA  CS                      +E S C S  C  
Sbjct: 201 QSDPVFNPTSSSTYKSLTCSAPQCSL---------------------LETSACRSNKC-L 260

Query: 180 FYYAYGDGSL-VARLYRDSLSLPTPAPSPPINVRNFTFGCAHTTLG---EPVGVAGFGRG 239
           +  +YGDGS  V  L  D+++           + N   GC H   G      G+ G G G
Sbjct: 261 YQVSYGDGSFTVGELATDTVTFGNSG-----KINNVALGCGHDNEGLFTGAAGLLGLGGG 320

Query: 240 VLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYTGETEFIYTSLLENP 299
           VLS+ +Q+   S      FSYCLV          + S L       G  +     LL N 
Sbjct: 321 VLSITNQMKATS------FSYCLVDRDSG-----KSSSLDFNSVQLGGGD-ATAPLLRNK 380

Query: 300 KHPYFYSVGLAGISVGNVRIPAPEFLTKVDEGGSGGVVVDSGTTFTMLPAGLYESVVAEF 359
           K   FY VGL+G SVG  ++  P+ +  VD  GSGGV++D GT  T L    Y S+   F
Sbjct: 381 KIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSGGVILDCGTAVTRLQTQAYNSLRDAF 440

Query: 360 ENRTGKVANRARRIEENTGLSPCYYYE--NSVGVPRVVLHFVGEKSNVVLPRKNYFYEFL 419
              T    N  +     +    CY +   ++V VP V  HF G KS + LP KNY     
Sbjct: 441 LKLT---VNLKKGSSSISLFDTCYDFSSLSTVKVPTVAFHFTGGKS-LDLPAKNYLIPVD 500

Query: 420 DGGDGVVGRKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQ 470
           D G           C           +       +GN QQQG  + YDL KN +G +  +
Sbjct: 501 DSG---------TFCFAFAPTSSSLSI-------IGNVQQQGTRITYDLSKNVIGLSGNK 500

BLAST of CSPI03G02720 vs. ExPASy Swiss-Prot
Match: Q766C3 (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 SV=1)

HSP 1 Score: 144.8 bits (364), Expect = 2.5e-33
Identity = 121/406 (29.80%), Postives = 181/406 (44.58%), Query Frame = 0

Query: 74  GDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQSPLPKIANNKSVSC 133
           G+Y ++ ++G+ +   S  MDTGSDL+W  C P  C  C          P      S S 
Sbjct: 93  GEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQP--CTQC-----FNQSTPIFNPQGSSSF 152

Query: 134 SAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAYGDGSLVARLYRDSL 193
           S   CS         S LC       +++    CS+  C  + Y YGDGS      + S+
Sbjct: 153 STLPCS---------SQLC-------QALSSPTCSNNFC-QYTYGYGDGSET----QGSM 212

Query: 194 SLPTPAPSPPINVRNFTFGCAHTT----LGEPVGVAGFGRGVLSMPSQLATFSPQLGNRF 253
              T      +++ N TFGC         G   G+ G GRG LS+PSQL         +F
Sbjct: 213 GTET-LTFGSVSIPNITFGCGENNQGFGQGNGAGLVGMGRGPLSLPSQLDV------TKF 272

Query: 254 SYCLVSHSFAADRVRRPSPLILGRYYTGETE-FIYTSLLENPKHPYFYSVGLAGISVGNV 313
           SYC+     +      PS L+LG      T     T+L+++ + P FY + L G+SVG+ 
Sbjct: 273 SYCMTPIGSST-----PSNLLLGSLANSVTAGSPNTTLIQSSQIPTFYYITLNGLSVGST 332

Query: 314 RIPA-PEFLTKVDEGGSGGVVVDSGTTFTMLPAGLYESVVAEFENRTG-KVANRARRIEE 373
           R+P  P         G+GG+++DSGTT T      Y+SV  EF ++    V N +     
Sbjct: 333 RLPIDPSAFALNSNNGTGGIIIDSGTTLTYFVNNAYQSVRQEFISQINLPVVNGS----- 392

Query: 374 NTGLSPCYYY---ENSVGVPRVVLHFVGEKSNVVLPRKNYFYEFLDGGDGVVGRKRKVGC 433
           ++G   C+      +++ +P  V+HF G   ++ LP +NY   F+   +G++       C
Sbjct: 393 SSGFDLCFQTPSDPSNLQIPTFVMHFDG--GDLELPSENY---FISPSNGLI-------C 434

Query: 434 LMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQC 470
           L + +      +        GN QQQ   VVYD   + V FA  QC
Sbjct: 453 LAMGSSSQGMSI-------FGNIQQQNMLVVYDTGNSVVSFASAQC 434

BLAST of CSPI03G02720 vs. ExPASy Swiss-Prot
Match: Q766C2 (Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 SV=1)

HSP 1 Score: 144.8 bits (364), Expect = 2.5e-33
Identity = 119/405 (29.38%), Postives = 179/405 (44.20%), Query Frame = 0

Query: 74  GDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQSPLPKIANNKSVSC 133
           G+Y ++  +G+     S  MDTGSDL+W  C P  C  C        P P      S S 
Sbjct: 94  GEYLMNVAIGTPDSSFSAIMDTGSDLIWTQCEP--CTQC-----FSQPTPIFNPQDSSSF 153

Query: 134 SAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAYGDGSLV-ARLYRDS 193
           S   C + +   L           P E+   +EC       + Y YGDGS     +  ++
Sbjct: 154 STLPCESQYCQDL-----------PSETCNNNECQ------YTYGYGDGSTTQGYMATET 213

Query: 194 LSLPTPAPSPPINVRNFTFGCAHTT----LGEPVGVAGFGRGVLSMPSQLATFSPQLGNR 253
            +  T       +V N  FGC         G   G+ G G G LS+PSQL         +
Sbjct: 214 FTFETS------SVPNIAFGCGEDNQGFGQGNGAGLIGMGWGPLSLPSQLGV------GQ 273

Query: 254 FSYCLVSHSFAADRVRRPSPLILGRYYTGETE-FIYTSLLENPKHPYFYSVGLAGISVGN 313
           FSYC+ S+  ++     PS L LG   +G  E    T+L+ +  +P +Y + L GI+VG 
Sbjct: 274 FSYCMTSYGSSS-----PSTLALGSAASGVPEGSPSTTLIHSSLNPTYYYITLQGITVGG 333

Query: 314 VRIPAPEFLTKVDEGGSGGVVVDSGTTFTMLPAGLYESVVAEFENRTGKVANRARRIEEN 373
             +  P    ++ + G+GG+++DSGTT T LP   Y +V   F ++     N     E +
Sbjct: 334 DNLGIPSSTFQLQDDGTGGMIIDSGTTLTYLPQDAYNAVAQAFTDQ----INLPTVDESS 393

Query: 374 TGLSPCYYYE---NSVGVPRVVLHFVGEKSNVVLPRKNYFYEFLDGGDGVVGRKRKVGCL 433
           +GLS C+      ++V VP + + F G   N  L  +N     +   +GV+       CL
Sbjct: 394 SGLSTCFQQPSDGSTVQVPEISMQFDGGVLN--LGEQNI---LISPAEGVI-------CL 435

Query: 434 MLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQC 470
            +   G  ++L     +  GN QQQ  +V+YDL+   V F   QC
Sbjct: 454 AM---GSSSQLG---ISIFGNIQQQETQVLYDLQNLAVSFVPTQC 435

BLAST of CSPI03G02720 vs. ExPASy TrEMBL
Match: A0A0A0L5I7 (Pepsin A OS=Cucumis sativus OX=3659 GN=Csa_3G020060 PE=3 SV=1)

HSP 1 Score: 972.6 bits (2513), Expect = 5.9e-280
Identity = 479/480 (99.79%), Postives = 480/480 (100.00%), Query Frame = 0

Query: 1   MAVSPVFIFLLCFLLSSPVFSSQIFLLPLSHSLSSSISDFNNTHNLLKSTATRSSARFHR 60
           MAVSPVFIFLLCFLLSSPVFSSQIFLLPLSHSLSSSISDFNNTHNLLKSTATRSSARFHR
Sbjct: 1   MAVSPVFIFLLCFLLSSPVFSSQIFLLPLSHSLSSSISDFNNTHNLLKSTATRSSARFHR 60

Query: 61  HRHNHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQS 120
           HRHNHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQS
Sbjct: 61  HRHNHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQS 120

Query: 121 PLPKIANNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAYG 180
           PLPKIANNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAYG
Sbjct: 121 PLPKIANNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAYG 180

Query: 181 DGSLVARLYRDSLSLPTPAPSPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQLAT 240
           DGSLVARLYRDSLSLPTPAPSPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQLAT
Sbjct: 181 DGSLVARLYRDSLSLPTPAPSPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQLAT 240

Query: 241 FSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYTGETEFIYTSLLENPKHPYFYSVGL 300
           FSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYTGETEFIYTSLLENPKHPYFYSVGL
Sbjct: 241 FSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYTGETEFIYTSLLENPKHPYFYSVGL 300

Query: 301 AGISVGNVRIPAPEFLTKVDEGGSGGVVVDSGTTFTMLPAGLYESVVAEFENRTGKVANR 360
           AGISVGN+RIPAPEFLTKVDEGGSGGVVVDSGTTFTMLPAGLYESVVAEFENRTGKVANR
Sbjct: 301 AGISVGNIRIPAPEFLTKVDEGGSGGVVVDSGTTFTMLPAGLYESVVAEFENRTGKVANR 360

Query: 361 ARRIEENTGLSPCYYYENSVGVPRVVLHFVGEKSNVVLPRKNYFYEFLDGGDGVVGRKRK 420
           ARRIEENTGLSPCYYYENSVGVPRVVLHFVGEKSNVVLPRKNYFYEFLDGGDGVVGRKRK
Sbjct: 361 ARRIEENTGLSPCYYYENSVGVPRVVLHFVGEKSNVVLPRKNYFYEFLDGGDGVVGRKRK 420

Query: 421 VGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDNLNRSK 480
           VGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDNLNRSK
Sbjct: 421 VGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDNLNRSK 480

BLAST of CSPI03G02720 vs. ExPASy TrEMBL
Match: A0A1S3BK28 (aspartic proteinase nepenthesin-1 OS=Cucumis melo OX=3656 GN=LOC103490888 PE=3 SV=1)

HSP 1 Score: 953.4 bits (2463), Expect = 3.7e-274
Identity = 472/481 (98.13%), Postives = 477/481 (99.17%), Query Frame = 0

Query: 1   MAVSPVFIFLLCFLLSSPVFSSQIFLLPLSHSLSSSISDFNNTHNLLKSTATRSSARFHR 60
           MAVSPVFIFLLCFLLSSPVFSSQIFLLPLSHSLSSSISDFN+THNLLKSTATRSSARFHR
Sbjct: 1   MAVSPVFIFLLCFLLSSPVFSSQIFLLPLSHSLSSSISDFNSTHNLLKSTATRSSARFHR 60

Query: 61  HRHNHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQS 120
           HRHNHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQS
Sbjct: 61  HRHNHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQS 120

Query: 121 PLPKIANNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAYG 180
           PLPKI+NNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAYG
Sbjct: 121 PLPKISNNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAYG 180

Query: 181 DGSLVARLYRDSLSLPTPAPSPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQLAT 240
           DGSLVARLYRDSLSLPTPAPSPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQLAT
Sbjct: 181 DGSLVARLYRDSLSLPTPAPSPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQLAT 240

Query: 241 FSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYTGETEFIYTSLLENPKHPYFYSVGL 300
           FSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRY+TGETEFIYTSLLENPKHPYFYSVGL
Sbjct: 241 FSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYHTGETEFIYTSLLENPKHPYFYSVGL 300

Query: 301 AGISVGNVRIPAPEFLTKVDEGGSGGVVVDSGTTFTMLPAGLYESVVAEFENRTGKVANR 360
           AGISVGNVRIPAPEFL KVDE GSGGVVVDSGTTFTMLP+GLYESVVAEFENRTGKVANR
Sbjct: 301 AGISVGNVRIPAPEFLRKVDESGSGGVVVDSGTTFTMLPSGLYESVVAEFENRTGKVANR 360

Query: 361 ARRIEENTGLSPCYYYENSVGVPRVVLHFVGEKSNVVLPRKNYFYEFLDGGDGV--VGRK 420
           ARRIEENTGLSPCYYYENSVGVPRVVLHFVGEKS+VVLPRKNYFYEFLDGGDGV  VGRK
Sbjct: 361 ARRIEENTGLSPCYYYENSVGVPRVVLHFVGEKSSVVLPRKNYFYEFLDGGDGVVEVGRK 420

Query: 421 RKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDNLNR 480
           RKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDNLNR
Sbjct: 421 RKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDNLNR 480

BLAST of CSPI03G02720 vs. ExPASy TrEMBL
Match: A0A5D3CP11 (Aspartic proteinase nepenthesin-1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1017G00280 PE=3 SV=1)

HSP 1 Score: 947.2 bits (2447), Expect = 2.7e-272
Identity = 471/483 (97.52%), Postives = 477/483 (98.76%), Query Frame = 0

Query: 1   MAVSPVFIFLLCFLLSSPVFSSQIFLLPLSHSLSSSISDFNNTHNLLKSTATRSSARFHR 60
           MAVSPVFIFLLCFLLSSPVFSSQIFLLPLSHSLSSSISDFN+THNLLKSTATRSSARFHR
Sbjct: 1   MAVSPVFIFLLCFLLSSPVFSSQIFLLPLSHSLSSSISDFNSTHNLLKSTATRSSARFHR 60

Query: 61  HRHNHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQS 120
           HRHNHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQS
Sbjct: 61  HRHNHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQS 120

Query: 121 PLPKIANNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAYG 180
           PLPKI+NNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAYG
Sbjct: 121 PLPKISNNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAYG 180

Query: 181 DGSLVARLYRDSLSLPT--PAPSPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQL 240
           DGSLVARLYRDSLSLPT  PAPSPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQL
Sbjct: 181 DGSLVARLYRDSLSLPTPAPAPSPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQL 240

Query: 241 ATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYTGETEFIYTSLLENPKHPYFYSV 300
           ATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRY+TGETEFIYTSLLENPKHPYFYSV
Sbjct: 241 ATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYHTGETEFIYTSLLENPKHPYFYSV 300

Query: 301 GLAGISVGNVRIPAPEFLTKVDEGGSGGVVVDSGTTFTMLPAGLYESVVAEFENRTGKVA 360
           GLAGISVGNVRIPAPEFL KVDE GSGGVVVDSGTTFTMLP+GLYESVVAEFENRTGKVA
Sbjct: 301 GLAGISVGNVRIPAPEFLRKVDESGSGGVVVDSGTTFTMLPSGLYESVVAEFENRTGKVA 360

Query: 361 NRARRIEENTGLSPCYYYENSVGVPRVVLHFVGEKSNVVLPRKNYFYEFLDGGDGV--VG 420
           NRARRIEENTGLSPCYYY+NSVGVPRVVLHFVGEKS+VVLPRKNYFYEFLDGGDGV  VG
Sbjct: 361 NRARRIEENTGLSPCYYYQNSVGVPRVVLHFVGEKSSVVLPRKNYFYEFLDGGDGVVEVG 420

Query: 421 RKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDNL 480
           RKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDNL
Sbjct: 421 RKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDNL 480

BLAST of CSPI03G02720 vs. ExPASy TrEMBL
Match: A0A6J1L3Z9 (probable aspartyl protease At4g16563 OS=Cucurbita maxima OX=3661 GN=LOC111500303 PE=3 SV=1)

HSP 1 Score: 869.4 bits (2245), Expect = 7.1e-249
Identity = 432/480 (90.00%), Postives = 450/480 (93.75%), Query Frame = 0

Query: 4   SPVFIFLLCFLLSSPVFSSQIFLLPLSHSLSSSISDFNNTHNLLKSTATRSSARFHR--- 63
           SPVF+FLLCFLL SPVFSSQI LLPLS+SLSSS SDFNNTHNLLKSTA RSSARFH    
Sbjct: 3   SPVFLFLLCFLLPSPVFSSQILLLPLSNSLSSS-SDFNNTHNLLKSTAARSSARFHHRRR 62

Query: 64  -HRHNHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQ 123
            H  +HLSLPLSPGGDYTLSFNLGSES KISLYMDTGSDLVWFPCSPFECILCEGKPKIQ
Sbjct: 63  THHRSHLSLPLSPGGDYTLSFNLGSESQKISLYMDTGSDLVWFPCSPFECILCEGKPKIQ 122

Query: 124 SPLPKIANNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAY 183
           SPLPKI+N KSVSCSAAACSAAHGGSLSASHLCAISRCPLESIE+SECSSFSCPPFYYAY
Sbjct: 123 SPLPKISNQKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEVSECSSFSCPPFYYAY 182

Query: 184 GDGSLVARLYRDSLSLPTPAPSPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQLA 243
           GDGSL+ RLYRDSLSLP PAPSP INVRNFTFGCAH+ LGEP+GVAGFGRG+LSMP QLA
Sbjct: 183 GDGSLIGRLYRDSLSLPAPAPSPAINVRNFTFGCAHSALGEPIGVAGFGRGLLSMPIQLA 242

Query: 244 TFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYTGETEFIYTSLLENPKHPYFYSVG 303
           TFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYY  ETEFIYTS+LENPKHPYFYSVG
Sbjct: 243 TFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYGSETEFIYTSMLENPKHPYFYSVG 302

Query: 304 LAGISVGNVRIPAPEFLTKVDEGGSGGVVVDSGTTFTMLPAGLYESVVAEFENRTGKVAN 363
           LAGISVG+V IPAPEFL KVDEGGSGGVVVDSGTTFTMLPAGLY SVVA+FENRTG+VA+
Sbjct: 303 LAGISVGSVMIPAPEFLKKVDEGGSGGVVVDSGTTFTMLPAGLYNSVVAQFENRTGRVAS 362

Query: 364 RARRIEENTGLSPCYYYENSVGVPRVVLHFVGEKSNVVLPRKNYFYEFLDGGDGVVGRKR 423
           RA +IEENTGLSPCYYYE SV VPRVVLHFVGEKS+V+LPRKNYFYEFLDGGDG VGRK 
Sbjct: 363 RASQIEENTGLSPCYYYEKSVEVPRVVLHFVGEKSSVMLPRKNYFYEFLDGGDG-VGRKI 422

Query: 424 KVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDNLNRS 480
           KVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEV YDLE NRVGFARRQCSTLWD+LNRS
Sbjct: 423 KVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVAYDLENNRVGFARRQCSTLWDSLNRS 480

BLAST of CSPI03G02720 vs. ExPASy TrEMBL
Match: A0A6J1EC44 (probable aspartyl protease At4g16563 OS=Cucurbita moschata OX=3662 GN=LOC111431201 PE=3 SV=1)

HSP 1 Score: 867.5 bits (2240), Expect = 2.7e-248
Identity = 433/482 (89.83%), Postives = 451/482 (93.57%), Query Frame = 0

Query: 4   SPVFIFLLCFLLSSPVFSSQIFLLPLSHSLSSSISDFNNTHNLLKSTATRSSARFHR--- 63
           SPVF+FLLCFL SSPVFSSQ+ LLPLS+SLSSS SDFNNTHNLLKSTA RSSARFH    
Sbjct: 3   SPVFLFLLCFLFSSPVFSSQLLLLPLSNSLSSS-SDFNNTHNLLKSTAARSSARFHHRRR 62

Query: 64  -HRHNHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQ 123
            H  +HLSLPLSPGGDYTLSFNLGSES KISLYMDTGSDLVWFPCSPFECILCEGKPKIQ
Sbjct: 63  THHRSHLSLPLSPGGDYTLSFNLGSESQKISLYMDTGSDLVWFPCSPFECILCEGKPKIQ 122

Query: 124 SPLPKIANNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAY 183
           SPLPKI+N KSVSCSAAACSAAHGGSLSASHLCAISRCPLESIE+SECSSFSCPPFYYAY
Sbjct: 123 SPLPKISNQKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEVSECSSFSCPPFYYAY 182

Query: 184 GDGSLVARLYRDSLSL--PTPAPSPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQ 243
           GDGSL+ RLYRDSLSL  P PAPSP INVRNFTFGCAH+ LGEP+GVAGFGRG+LSMPSQ
Sbjct: 183 GDGSLIGRLYRDSLSLPAPAPAPSPAINVRNFTFGCAHSALGEPIGVAGFGRGLLSMPSQ 242

Query: 244 LATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYTGETEFIYTSLLENPKHPYFYS 303
           LATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYY  ETEFIYTS+LENPKHPYFYS
Sbjct: 243 LATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYGSETEFIYTSMLENPKHPYFYS 302

Query: 304 VGLAGISVGNVRIPAPEFLTKVDEGGSGGVVVDSGTTFTMLPAGLYESVVAEFENRTGKV 363
           VGLAGISVG+VRIPAPEFL +VDEGGSGGVVVDSGTTFTMLPAGLY SVVA+FENRTG+V
Sbjct: 303 VGLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYNSVVAQFENRTGRV 362

Query: 364 ANRARRIEENTGLSPCYYYENSVGVPRVVLHFVGEKSNVVLPRKNYFYEFLDGGDGVVGR 423
           A+RA RIEENTGLSPCY YE SV VPRVVLHFVGEKS+V LPRKNYFYEFLDGGDG VGR
Sbjct: 363 ASRASRIEENTGLSPCYSYEKSVEVPRVVLHFVGEKSSVELPRKNYFYEFLDGGDG-VGR 422

Query: 424 KRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDNLN 480
           KRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEV YDLE NRVGFARRQCSTLWD+LN
Sbjct: 423 KRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVAYDLENNRVGFARRQCSTLWDSLN 482

BLAST of CSPI03G02720 vs. NCBI nr
Match: XP_004147205.1 (probable aspartyl protease At4g16563 [Cucumis sativus])

HSP 1 Score: 972.6 bits (2513), Expect = 1.2e-279
Identity = 479/480 (99.79%), Postives = 480/480 (100.00%), Query Frame = 0

Query: 1   MAVSPVFIFLLCFLLSSPVFSSQIFLLPLSHSLSSSISDFNNTHNLLKSTATRSSARFHR 60
           MAVSPVFIFLLCFLLSSPVFSSQIFLLPLSHSLSSSISDFNNTHNLLKSTATRSSARFHR
Sbjct: 1   MAVSPVFIFLLCFLLSSPVFSSQIFLLPLSHSLSSSISDFNNTHNLLKSTATRSSARFHR 60

Query: 61  HRHNHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQS 120
           HRHNHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQS
Sbjct: 61  HRHNHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQS 120

Query: 121 PLPKIANNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAYG 180
           PLPKIANNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAYG
Sbjct: 121 PLPKIANNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAYG 180

Query: 181 DGSLVARLYRDSLSLPTPAPSPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQLAT 240
           DGSLVARLYRDSLSLPTPAPSPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQLAT
Sbjct: 181 DGSLVARLYRDSLSLPTPAPSPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQLAT 240

Query: 241 FSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYTGETEFIYTSLLENPKHPYFYSVGL 300
           FSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYTGETEFIYTSLLENPKHPYFYSVGL
Sbjct: 241 FSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYTGETEFIYTSLLENPKHPYFYSVGL 300

Query: 301 AGISVGNVRIPAPEFLTKVDEGGSGGVVVDSGTTFTMLPAGLYESVVAEFENRTGKVANR 360
           AGISVGN+RIPAPEFLTKVDEGGSGGVVVDSGTTFTMLPAGLYESVVAEFENRTGKVANR
Sbjct: 301 AGISVGNIRIPAPEFLTKVDEGGSGGVVVDSGTTFTMLPAGLYESVVAEFENRTGKVANR 360

Query: 361 ARRIEENTGLSPCYYYENSVGVPRVVLHFVGEKSNVVLPRKNYFYEFLDGGDGVVGRKRK 420
           ARRIEENTGLSPCYYYENSVGVPRVVLHFVGEKSNVVLPRKNYFYEFLDGGDGVVGRKRK
Sbjct: 361 ARRIEENTGLSPCYYYENSVGVPRVVLHFVGEKSNVVLPRKNYFYEFLDGGDGVVGRKRK 420

Query: 421 VGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDNLNRSK 480
           VGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDNLNRSK
Sbjct: 421 VGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDNLNRSK 480

BLAST of CSPI03G02720 vs. NCBI nr
Match: XP_008448851.1 (PREDICTED: aspartic proteinase nepenthesin-1 [Cucumis melo])

HSP 1 Score: 953.4 bits (2463), Expect = 7.7e-274
Identity = 472/481 (98.13%), Postives = 477/481 (99.17%), Query Frame = 0

Query: 1   MAVSPVFIFLLCFLLSSPVFSSQIFLLPLSHSLSSSISDFNNTHNLLKSTATRSSARFHR 60
           MAVSPVFIFLLCFLLSSPVFSSQIFLLPLSHSLSSSISDFN+THNLLKSTATRSSARFHR
Sbjct: 1   MAVSPVFIFLLCFLLSSPVFSSQIFLLPLSHSLSSSISDFNSTHNLLKSTATRSSARFHR 60

Query: 61  HRHNHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQS 120
           HRHNHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQS
Sbjct: 61  HRHNHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQS 120

Query: 121 PLPKIANNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAYG 180
           PLPKI+NNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAYG
Sbjct: 121 PLPKISNNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAYG 180

Query: 181 DGSLVARLYRDSLSLPTPAPSPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQLAT 240
           DGSLVARLYRDSLSLPTPAPSPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQLAT
Sbjct: 181 DGSLVARLYRDSLSLPTPAPSPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQLAT 240

Query: 241 FSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYTGETEFIYTSLLENPKHPYFYSVGL 300
           FSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRY+TGETEFIYTSLLENPKHPYFYSVGL
Sbjct: 241 FSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYHTGETEFIYTSLLENPKHPYFYSVGL 300

Query: 301 AGISVGNVRIPAPEFLTKVDEGGSGGVVVDSGTTFTMLPAGLYESVVAEFENRTGKVANR 360
           AGISVGNVRIPAPEFL KVDE GSGGVVVDSGTTFTMLP+GLYESVVAEFENRTGKVANR
Sbjct: 301 AGISVGNVRIPAPEFLRKVDESGSGGVVVDSGTTFTMLPSGLYESVVAEFENRTGKVANR 360

Query: 361 ARRIEENTGLSPCYYYENSVGVPRVVLHFVGEKSNVVLPRKNYFYEFLDGGDGV--VGRK 420
           ARRIEENTGLSPCYYYENSVGVPRVVLHFVGEKS+VVLPRKNYFYEFLDGGDGV  VGRK
Sbjct: 361 ARRIEENTGLSPCYYYENSVGVPRVVLHFVGEKSSVVLPRKNYFYEFLDGGDGVVEVGRK 420

Query: 421 RKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDNLNR 480
           RKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDNLNR
Sbjct: 421 RKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDNLNR 480

BLAST of CSPI03G02720 vs. NCBI nr
Match: TYK12019.1 (aspartic proteinase nepenthesin-1 [Cucumis melo var. makuwa])

HSP 1 Score: 947.2 bits (2447), Expect = 5.5e-272
Identity = 471/483 (97.52%), Postives = 477/483 (98.76%), Query Frame = 0

Query: 1   MAVSPVFIFLLCFLLSSPVFSSQIFLLPLSHSLSSSISDFNNTHNLLKSTATRSSARFHR 60
           MAVSPVFIFLLCFLLSSPVFSSQIFLLPLSHSLSSSISDFN+THNLLKSTATRSSARFHR
Sbjct: 1   MAVSPVFIFLLCFLLSSPVFSSQIFLLPLSHSLSSSISDFNSTHNLLKSTATRSSARFHR 60

Query: 61  HRHNHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQS 120
           HRHNHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQS
Sbjct: 61  HRHNHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQS 120

Query: 121 PLPKIANNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAYG 180
           PLPKI+NNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAYG
Sbjct: 121 PLPKISNNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAYG 180

Query: 181 DGSLVARLYRDSLSLPT--PAPSPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQL 240
           DGSLVARLYRDSLSLPT  PAPSPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQL
Sbjct: 181 DGSLVARLYRDSLSLPTPAPAPSPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQL 240

Query: 241 ATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYTGETEFIYTSLLENPKHPYFYSV 300
           ATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRY+TGETEFIYTSLLENPKHPYFYSV
Sbjct: 241 ATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYHTGETEFIYTSLLENPKHPYFYSV 300

Query: 301 GLAGISVGNVRIPAPEFLTKVDEGGSGGVVVDSGTTFTMLPAGLYESVVAEFENRTGKVA 360
           GLAGISVGNVRIPAPEFL KVDE GSGGVVVDSGTTFTMLP+GLYESVVAEFENRTGKVA
Sbjct: 301 GLAGISVGNVRIPAPEFLRKVDESGSGGVVVDSGTTFTMLPSGLYESVVAEFENRTGKVA 360

Query: 361 NRARRIEENTGLSPCYYYENSVGVPRVVLHFVGEKSNVVLPRKNYFYEFLDGGDGV--VG 420
           NRARRIEENTGLSPCYYY+NSVGVPRVVLHFVGEKS+VVLPRKNYFYEFLDGGDGV  VG
Sbjct: 361 NRARRIEENTGLSPCYYYQNSVGVPRVVLHFVGEKSSVVLPRKNYFYEFLDGGDGVVEVG 420

Query: 421 RKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDNL 480
           RKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDNL
Sbjct: 421 RKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDNL 480

BLAST of CSPI03G02720 vs. NCBI nr
Match: XP_038905814.1 (probable aspartyl protease At4g16563 [Benincasa hispida])

HSP 1 Score: 894.4 bits (2310), Expect = 4.2e-256
Identity = 441/480 (91.88%), Postives = 456/480 (95.00%), Query Frame = 0

Query: 4   SPVFIFLLCFLLSSPVFSSQIFLLPLSHSLSSSISDFNNTHNLLKSTATRSSARFHRHR- 63
           S VF+ LLCFLLSSPVFSSQ+ LLPLSHSLSSSISDFNNTHNLLKSTA RSSARFH  R 
Sbjct: 3   SSVFVLLLCFLLSSPVFSSQLLLLPLSHSLSSSISDFNNTHNLLKSTAARSSARFHHRRR 62

Query: 64  ---HNHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQ 123
              HNHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPK+Q
Sbjct: 63  TQHHNHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKVQ 122

Query: 124 SPLPKIANNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAY 183
           SPLPKI+NNKSVSCSA ACSAAHGGSLSASHLCAIS+CPLESIEISECSSFSCPPFYYAY
Sbjct: 123 SPLPKISNNKSVSCSAPACSAAHGGSLSASHLCAISQCPLESIEISECSSFSCPPFYYAY 182

Query: 184 GDGSLVARLYRDSLSLPTPAPSPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQLA 243
           GDGSL+ARLYRDSLSLP PAPSP INVRNFTFGCAHT LGEPVGVAGFGRG LSMPSQLA
Sbjct: 183 GDGSLIARLYRDSLSLPAPAPSPAINVRNFTFGCAHTALGEPVGVAGFGRGTLSMPSQLA 242

Query: 244 TFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYTGETEFIYTSLLENPKHPYFYSVG 303
           TFSPQLGNRFSYCLVSHSFAA+RVRRPSPLILGRYY GETEFIYTSLLENPKHPYFYSVG
Sbjct: 243 TFSPQLGNRFSYCLVSHSFAAERVRRPSPLILGRYYGGETEFIYTSLLENPKHPYFYSVG 302

Query: 304 LAGISVGNVRIPAPEFLTKVDEGGSGGVVVDSGTTFTMLPAGLYESVVAEFENRTGKVAN 363
           L GISVGN+ IPAPEFL KVDEGGSGGVVVDSGTTFTMLPAGLY+SVVA FENRTG+VAN
Sbjct: 303 LTGISVGNMMIPAPEFLKKVDEGGSGGVVVDSGTTFTMLPAGLYDSVVAAFENRTGRVAN 362

Query: 364 RARRIEENTGLSPCYYYENSVGVPRVVLHFVGEKSNVVLPRKNYFYEFLDGGDGVVGRKR 423
           RARRIEENTGLSPCYYYENSV VPRVVLHFVGEKS+V+LP+KNYFYEFLDGGDG VG+KR
Sbjct: 363 RARRIEENTGLSPCYYYENSVEVPRVVLHFVGEKSSVLLPKKNYFYEFLDGGDG-VGKKR 422

Query: 424 KVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDNLNRS 480
           KVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEV YDL KNRVGFARRQCSTLWD+LNRS
Sbjct: 423 KVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVAYDLAKNRVGFARRQCSTLWDSLNRS 481

BLAST of CSPI03G02720 vs. NCBI nr
Match: KAE8650060.1 (hypothetical protein Csa_011084 [Cucumis sativus])

HSP 1 Score: 883.6 bits (2282), Expect = 7.5e-253
Identity = 437/438 (99.77%), Postives = 438/438 (100.00%), Query Frame = 0

Query: 1   MAVSPVFIFLLCFLLSSPVFSSQIFLLPLSHSLSSSISDFNNTHNLLKSTATRSSARFHR 60
           MAVSPVFIFLLCFLLSSPVFSSQIFLLPLSHSLSSSISDFNNTHNLLKSTATRSSARFHR
Sbjct: 1   MAVSPVFIFLLCFLLSSPVFSSQIFLLPLSHSLSSSISDFNNTHNLLKSTATRSSARFHR 60

Query: 61  HRHNHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQS 120
           HRHNHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQS
Sbjct: 61  HRHNHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQS 120

Query: 121 PLPKIANNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAYG 180
           PLPKIANNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAYG
Sbjct: 121 PLPKIANNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAYG 180

Query: 181 DGSLVARLYRDSLSLPTPAPSPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQLAT 240
           DGSLVARLYRDSLSLPTPAPSPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQLAT
Sbjct: 181 DGSLVARLYRDSLSLPTPAPSPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQLAT 240

Query: 241 FSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYTGETEFIYTSLLENPKHPYFYSVGL 300
           FSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYTGETEFIYTSLLENPKHPYFYSVGL
Sbjct: 241 FSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYTGETEFIYTSLLENPKHPYFYSVGL 300

Query: 301 AGISVGNVRIPAPEFLTKVDEGGSGGVVVDSGTTFTMLPAGLYESVVAEFENRTGKVANR 360
           AGISVGN+RIPAPEFLTKVDEGGSGGVVVDSGTTFTMLPAGLYESVVAEFENRTGKVANR
Sbjct: 301 AGISVGNIRIPAPEFLTKVDEGGSGGVVVDSGTTFTMLPAGLYESVVAEFENRTGKVANR 360

Query: 361 ARRIEENTGLSPCYYYENSVGVPRVVLHFVGEKSNVVLPRKNYFYEFLDGGDGVVGRKRK 420
           ARRIEENTGLSPCYYYENSVGVPRVVLHFVGEKSNVVLPRKNYFYEFLDGGDGVVGRKRK
Sbjct: 361 ARRIEENTGLSPCYYYENSVGVPRVVLHFVGEKSNVVLPRKNYFYEFLDGGDGVVGRKRK 420

Query: 421 VGCLMLMNGGDEAELAGG 439
           VGCLMLMNGGDEAELAGG
Sbjct: 421 VGCLMLMNGGDEAELAGG 438

BLAST of CSPI03G02720 vs. TAIR 10
Match: AT4G16563.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 581.6 bits (1498), Expect = 5.7e-166
Identity = 298/478 (62.34%), Postives = 356/478 (74.48%), Query Frame = 0

Query: 26  LLPLSHSLSSSISDFNNTHNLLKSTATRSSARFHRHRH----NHLSLPLSPGGDYTLSFN 85
           LL LSHSLS+S    +  H LLKS+++RSSARF RH H      LSLP+S G DY +S +
Sbjct: 30  LLHLSHSLSTSKHSSSPLH-LLKSSSSRSSARFRRHHHKQQQQQLSLPISSGSDYLISLS 89

Query: 86  LGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQSPLPKIANN-KSVSCSAAACSA 145
           +GS S  +SLY+DTGSDLVWFPC PF CILCE KP   SP   ++++  +VSCS+ +CSA
Sbjct: 90  VGSSSSAVSLYLDTGSDLVWFPCRPFTCILCESKPLPPSPPSSLSSSATTVSCSSPSCSA 149

Query: 146 AHGGSLSASHLCAISRCPLESIEISEC--SSFSCPPFYYAYGDGSLVARLYRDSLSLPTP 205
           AH  SL +S LCAIS CPL+ IE  +C  SS+ CPPFYYAYGDGSLVA+LY DSLSLP+ 
Sbjct: 150 AH-SSLPSSDLCAISNCPLDFIETGDCNTSSYPCPPFYYAYGDGSLVAKLYSDSLSLPS- 209

Query: 206 APSPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQLATFSPQLGNRFSYCLVSHSF 265
                ++V NFTFGCAHTTL EP+GVAGFGRG LS+P+QLA  SP LGN FSYCLVSHSF
Sbjct: 210 -----VSVSNFTFGCAHTTLAEPIGVAGFGRGRLSLPAQLAVHSPHLGNSFSYCLVSHSF 269

Query: 266 AADRVRRPSPLILGRYY------TGET--------------EFIYTSLLENPKHPYFYSV 325
            +DRVRRPSPLILGR+        G T              EF++T +LENPKHPYFYSV
Sbjct: 270 DSDRVRRPSPLILGRFVDKKEKRVGTTDDHDDGDDEKKKKNEFVFTEMLENPKHPYFYSV 329

Query: 326 GLAGISVGNVRIPAPEFLTKVDEGGSGGVVVDSGTTFTMLPAGLYESVVAEFENRTGKVA 385
            L GIS+G   IPAP  L ++D+ G GGVVVDSGTTFTMLPA  Y SVV EF++R G+V 
Sbjct: 330 SLQGISIGKRNIPAPAMLRRIDKNGGGGVVVDSGTTFTMLPAKFYNSVVEEFDSRVGRVH 389

Query: 386 NRARRIEENTGLSPCYYYENSVGVPRVVLHFVGEKSNVVLPRKNYFYEFLDGGDGVVGRK 445
            RA R+E ++G+SPCYY   +V VP +VLHF G +S+V LPR+NYFYEF+DGGDG    K
Sbjct: 390 ERADRVEPSSGMSPCYYLNQTVKVPALVLHFAGNRSSVTLPRRNYFYEFMDGGDG-KEEK 449

Query: 446 RKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDNL 477
           RK+GCLMLMNGGDE+EL GG GA LGNYQQQGFEVVYDL   RVGFA+R+C++LWD+L
Sbjct: 450 RKIGCLMLMNGGDESELRGGTGAILGNYQQQGFEVVYDLLNRRVGFAKRKCASLWDSL 498

BLAST of CSPI03G02720 vs. TAIR 10
Match: AT5G45120.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 208.4 bits (529), Expect = 1.3e-53
Identity = 145/421 (34.44%), Postives = 204/421 (48.46%), Query Frame = 0

Query: 76  YTLSFNLGSESHKISLYMDTGSDLVWFPCS--PFECILCEG-------KPKIQSPLPKIA 135
           Y ++ N+G+    + +Y+DTGSDL W PC    F+CI C          P + SPL    
Sbjct: 83  YLITLNIGTPPQAVQVYLDTGSDLTWVPCGNLSFDCIECYDLKNNDLKSPSVFSPLHSST 142

Query: 136 NNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAYGDGSLVA 195
           + +  SC+++ C   H  S +    CA++ C +  +  S C    CP F Y YG+G L++
Sbjct: 143 SFRD-SCASSFCVEIH-SSDNPFDPCAVAGCSVSMLLKSTCVR-PCPSFAYTYGEGGLIS 202

Query: 196 R-LYRDSLSLPTPAPSPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQLATFSPQL 255
             L RD L   T       +V  F+FGC  +T  EP+G+AGFGRG+LS+PSQL      L
Sbjct: 203 GILTRDILKARTR------DVPRFSFGCVTSTYREPIGIAGFGRGLLSLPSQLGF----L 262

Query: 256 GNRFSYCLVSHSFAADRVRRPSPLILGRYYTG---ETEFIYTSLLENPKHPYFYSVGLAG 315
              FS+C +   F  +     SPLILG             +T +L  P +P  Y +GL  
Sbjct: 263 EKGFSHCFLPFKF-VNNPNISSPLILGASALSINLTDSLQFTPMLNTPMYPNSYYIGLES 322

Query: 316 ISVGNVRIP--APEFLTKVDEGGSGGVVVDSGTTFTMLPAGLYESVVAEFENRTGKVANR 375
           I++G    P   P  L + D  G+GG++VDSGTT+T LP   Y  ++   ++       R
Sbjct: 323 ITIGTNITPTQVPLTLRQFDSQGNGGMLVDSGTTYTHLPEPFYSQLLTTLQSTI--TYPR 382

Query: 376 ARRIEENTGLSPCY----------YYENSVGV--PRVVLHFVGEKSNVVLPRKNYFYEFL 435
           A   E  TG   CY            EN V +  P +  HF+   + ++LP+ N FY   
Sbjct: 383 ATETESRTGFDLCYKVPCPNNNLTSLENDVMMIFPSITFHFL-NNATLLLPQGNSFYAMS 442

Query: 436 DGGDGVVGRKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQ 470
              DG V     V CL+  N  D      GP    G++QQQ  +VVYDLEK R+GF    
Sbjct: 443 APSDGSV-----VQCLLFQNMEDGDY---GPAGVFGSFQQQNVKVVYDLEKERIGFQAMD 478

BLAST of CSPI03G02720 vs. TAIR 10
Match: AT3G52500.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 178.3 bits (451), Expect = 1.4e-44
Identity = 151/505 (29.90%), Postives = 226/505 (44.75%), Query Frame = 0

Query: 4   SPVFIFLLCFLLSSPVFSSQIFLLPLSHSLSSSISDFNNTHNLLKSTATRSSARFHRHRH 63
           S +F F L FL  S V + ++ L P SHS  S    + +   L +S    S AR H+ +H
Sbjct: 3   SSIFFFFLIFL--SVVSAVKLPLSPFSHSDQSPKDPYLSLRRLAES----SIARAHKLKH 62

Query: 64  NH-------------------LSLPLSPG--GDYTLSFNLGSESHKISLYMDTGSDLVWF 123
                                +  PLS    G Y++S + G+ S  I    DTGS LVW 
Sbjct: 63  GTSIKPDEDALSSTTTASATVVKSPLSAKSYGGYSVSLSFGTPSQTIPFVFDTGSSLVWL 122

Query: 124 PC-SPFECILCEGKPKIQSPLPKI-----ANNKSVSCSAAACSAAHGGSLSASHLCAISR 183
           PC S + C  C+      + +P+      +++K + C +  C   +G ++         +
Sbjct: 123 PCTSRYLCSGCDFSGLDPTLIPRFIPKNSSSSKIIGCQSPKCQFLYGPNV---------Q 182

Query: 184 CPLESIEISECSSFSCPPFYYAYGDGSLVARLYRDSLSLPTPAPSPPINVRNFTFGCAHT 243
           C         C +  CPP+   YG GS    L  + L        P + V +F  GC+  
Sbjct: 183 CRGCDPNTRNC-TVGCPPYILQYGLGSTAGVLITEKLDF------PDLTVPDFVVGCSII 242

Query: 244 TLGEPVGVAGFGRGVLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYT 303
           +  +P G+AGFGRG +S+PSQ+         RFS+CLVS  F    V     L  G  + 
Sbjct: 243 STRQPAGIAGFGRGPVSLPSQMNL------KRFSHCLVSRRFDDTNVTTDLDLDTGSGHN 302

Query: 304 GETE---FIYTSLLENPK-----HPYFYSVGLAGISVGNVRIPAPEFLTKVDEGGSGGVV 363
             ++     YT   +NP         +Y + L  I VG   +  P         G GG +
Sbjct: 303 SGSKTPGLTYTPFRKNPNVSNKAFLEYYYLNLRRIYVGRKHVKIPYKYLAPGTNGDGGSI 362

Query: 364 VDSGTTFTMLPAGLYESVVAEFENRTGKVANRARRIEENTGLSPCYYY--ENSVGVPRVV 423
           VDSG+TFT +   ++E V  EF ++      R + +E+ TGL PC+    +  V VP ++
Sbjct: 363 VDSGSTFTFMERPVFELVAEEFASQMSNY-TREKDLEKETGLGPCFNISGKGDVTVPELI 422

Query: 424 LHFVGEKSNVVLPRKNYFYEFLDGGDGVVGRKRKVGCLMLMNGGDEAELAG-GPGATLGN 471
             F G  + + LP  NYF  F+   D V        CL +++        G GP   LG+
Sbjct: 423 FEFKG-GAKLELPLSNYF-TFVGNTDTV--------CLTVVSDKTVNPSGGTGPAIILGS 468

BLAST of CSPI03G02720 vs. TAIR 10
Match: AT3G61820.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 152.5 bits (384), Expect = 8.5e-37
Identity = 131/407 (32.19%), Postives = 175/407 (43.00%), Query Frame = 0

Query: 70  LSPG-GDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQSPLPKIANN 129
           LS G G+Y +   +G+ +  + + +DTGSD+VW  CSP  C  C  +        K    
Sbjct: 128 LSQGSGEYFMRLGVGTPATNVYMVLDTGSDVVWLQCSP--CKACYNQTDAIFDPKKSKTF 187

Query: 130 KSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAYGDGSLVARL 189
            +V C +  C       L  S  C   R            S +C  +  +YGDGS     
Sbjct: 188 ATVPCGSRLCR-----RLDDSSECVTRR------------SKTC-LYQVSYGDGSFT--- 247

Query: 190 YRDSLSLPTPAPSPPINVRNFTFGCAHTTLGEPVGVA---GFGRGVLSMPSQLATFSPQL 249
                S  T        V +   GC H   G  VG A   G GRG LS PSQ      + 
Sbjct: 248 -EGDFSTET-LTFHGARVDHVPLGCGHDNEGLFVGAAGLLGLGRGGLSFPSQTKN---RY 307

Query: 250 GNRFSYCLVSHSFAADRVRRPSPLILGRYYTGETEFIYTSLLENPKHPYFYSVGLAGISV 309
             +FSYCLV  + +    + PS ++ G     +T  ++T LL NPK   FY + L GISV
Sbjct: 308 NGKFSYCLVDRTSSGSSSKPPSTIVFGNAAVPKTS-VFTPLLTNPKLDTFYYLQLLGISV 367

Query: 310 GNVRIP-APEFLTKVDEGGSGGVVVDSGTTFTMLPAGLYESVVAEFENRTGKVANRARRI 369
           G  R+P   E   K+D  G+GGV++DSGT+ T L    Y ++   F  R G  A + +R 
Sbjct: 368 GGSRVPGVSESQFKLDATGNGGVIIDSGTSVTRLTQPAYVALRDAF--RLG--ATKLKRA 427

Query: 370 EENTGLSPCYYYE--NSVGVPRVVLHFVGEKSNVVLPRKNYFYEFLDGGDGVVGRKRKVG 429
              +    C+      +V VP VV HF G    V LP  NY       G         +G
Sbjct: 428 PSYSLFDTCFDLSGMTTVKVPTVVFHFGG--GEVSLPASNYLIPVNTEGRFCFAFAGTMG 483

Query: 430 CLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQC 470
            L +                +GN QQQGF V YDL  +RVGF  R C
Sbjct: 488 SLSI----------------IGNIQQQGFRVAYDLVGSRVGFLSRAC 483

BLAST of CSPI03G02720 vs. TAIR 10
Match: AT1G01300.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 152.1 bits (383), Expect = 1.1e-36
Identity = 133/409 (32.52%), Postives = 183/409 (44.74%), Query Frame = 0

Query: 70  LSPG-GDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQSPLPKIANN 129
           LS G G+Y     +G+ +  + + +DTGSD+VW  C+P  C  C  +       P     
Sbjct: 135 LSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAP--CRRCYSQSD-----PIFDPR 194

Query: 130 KSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAYGDGSL-VAR 189
           KS + +   CS+ H   L ++  C   R          C       +  +YGDGS  V  
Sbjct: 195 KSKTYATIPCSSPHCRRLDSAG-CNTRR--------KTCL------YQVSYGDGSFTVGD 254

Query: 190 LYRDSLSLPTPAPSPPINVRNFTFGCAHTTLGEPVGVA---GFGRGVLSMPSQLATFSPQ 249
              ++L+           V+    GC H   G  VG A   G G+G LS P Q      +
Sbjct: 255 FSTETLTFRRN------RVKGVALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQT---GHR 314

Query: 250 LGNRFSYCLVSHSFAADRVRRPSPLILGRYYTGETEFIYTSLLENPKHPYFYSVGLAGIS 309
              +FSYCLV  S ++    +PS ++ G          +T LL NPK   FY VGL GIS
Sbjct: 315 FNQKFSYCLVDRSASS----KPSSVVFGNAAVSRIA-RFTPLLSNPKLDTFYYVGLLGIS 374

Query: 310 VGNVRIP-APEFLTKVDEGGSGGVVVDSGTTFTMLPAGLYESVVAEFENRTGKVANRARR 369
           VG  R+P     L K+D+ G+GGV++DSGT+ T L    Y ++   F  R G  A   +R
Sbjct: 375 VGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAF--RVG--AKTLKR 434

Query: 370 IEENTGLSPCYYYE--NSVGVPRVVLHFVGEKSNVVLPRKNYFYEFLDGGDGVVGRKRKV 429
             + +    C+     N V VP VVLHF G  ++V LP  NY       G         +
Sbjct: 435 APDFSLFDTCFDLSNMNEVKVPTVVLHFRG--ADVSLPATNYLIPVDTNGKFCFAFAGTM 485

Query: 430 GCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCS 471
           G L +                +GN QQQGF VVYDL  +RVGFA   C+
Sbjct: 495 GGLSI----------------IGNIQQQGFRVVYDLASSRVGFAPGGCA 485

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q940R48.0e-16562.34Probable aspartyl protease At4g16563 OS=Arabidopsis thaliana OX=3702 GN=At4g1656... [more]
Q9LNJ31.6e-3532.52Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 ... [more]
Q9LS402.5e-3329.45Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana OX=3702 GN=ASP... [more]
Q766C32.5e-3329.80Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 S... [more]
Q766C22.5e-3329.38Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 S... [more]
Match NameE-valueIdentityDescription
A0A0A0L5I75.9e-28099.79Pepsin A OS=Cucumis sativus OX=3659 GN=Csa_3G020060 PE=3 SV=1[more]
A0A1S3BK283.7e-27498.13aspartic proteinase nepenthesin-1 OS=Cucumis melo OX=3656 GN=LOC103490888 PE=3 S... [more]
A0A5D3CP112.7e-27297.52Aspartic proteinase nepenthesin-1 OS=Cucumis melo var. makuwa OX=1194695 GN=E567... [more]
A0A6J1L3Z97.1e-24990.00probable aspartyl protease At4g16563 OS=Cucurbita maxima OX=3661 GN=LOC111500303... [more]
A0A6J1EC442.7e-24889.83probable aspartyl protease At4g16563 OS=Cucurbita moschata OX=3662 GN=LOC1114312... [more]
Match NameE-valueIdentityDescription
XP_004147205.11.2e-27999.79probable aspartyl protease At4g16563 [Cucumis sativus][more]
XP_008448851.17.7e-27498.13PREDICTED: aspartic proteinase nepenthesin-1 [Cucumis melo][more]
TYK12019.15.5e-27297.52aspartic proteinase nepenthesin-1 [Cucumis melo var. makuwa][more]
XP_038905814.14.2e-25691.88probable aspartyl protease At4g16563 [Benincasa hispida][more]
KAE8650060.17.5e-25399.77hypothetical protein Csa_011084 [Cucumis sativus][more]
Match NameE-valueIdentityDescription
AT4G16563.15.7e-16662.34Eukaryotic aspartyl protease family protein [more]
AT5G45120.11.3e-5334.44Eukaryotic aspartyl protease family protein [more]
AT3G52500.11.4e-4429.90Eukaryotic aspartyl protease family protein [more]
AT3G61820.18.5e-3732.19Eukaryotic aspartyl protease family protein [more]
AT1G01300.11.1e-3632.52Eukaryotic aspartyl protease family protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (PI 183967) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR032799Xylanase inhibitor, C-terminalPFAMPF14541TAXi_Ccoord: 295..465
e-value: 1.9E-28
score: 99.3
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 54..261
e-value: 7.3E-34
score: 119.4
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 272..475
e-value: 4.3E-49
score: 168.7
IPR021109Aspartic peptidase domain superfamilySUPERFAMILY50630Acid proteasescoord: 71..474
IPR032861Xylanase inhibitor, N-terminalPFAMPF14543TAXi_Ncoord: 76..259
e-value: 1.8E-28
score: 100.0
NoneNo IPR availablePANTHERPTHR47967OS07G0603500 PROTEIN-RELATEDcoord: 1..474
NoneNo IPR availablePANTHERPTHR47967:SF26BNAA01G17170D PROTEINcoord: 1..474
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 327..338
IPR033121Peptidase family A1 domainPROSITEPS51767PEPTIDASE_A1coord: 76..465
score: 29.87694
IPR034161Pepsin-like domain, plantCDDcd05476pepsin_A_like_plantcoord: 76..469
e-value: 1.83697E-71
score: 225.605

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI03G02720.1CSPI03G02720.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
cellular_component GO:0005576 extracellular region
molecular_function GO:0004190 aspartic-type endopeptidase activity