CSPI03G02720 (gene) Wild cucumber (PI 183967)

NameCSPI03G02720
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionEukaryotic aspartyl protease family protein
LocationChr3 : 2104705 .. 2106775 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAAATATGAACCAAACTTTTAAATTTTTTTAAGGAAAACTAAGCTTATAATTTAAGAAGATAAAGAAAGAAGAAAGAAGAAAGAAGAAATAAGAAAGAGGGTGTAGTTGTAATGGTCACCACTTTTGAAACCCATACAAAAACCCCATCTTTCTCGTCTTCTAACCAGAACCTTCTTCTTCTTCTTCTATTCTCAATCCCCATTTCTTTAAATTCCCCAGTTTCTATCTCTAATTTCATTTCAAAAACCCCAATTTCTCTATCTAAAGTACTCCATTAATGGCGGTTTCCCCTGTTTTCATCTTCCTCCTTTGTTTTCTCCTCTCCTCCCCTGTTTTCTCCTCACAAATTTTCCTTCTACCTCTCTCCCATTCCTTATCATCCTCAATCTCCGATTTCAACAACACCCACAATCTTCTCAAATCCACTGCCACCCGCTCCTCCGCCCGATTCCACCGCCACCGCCATAACCACCTCTCTCTGCCCCTCTCCCCTGGCGGCGATTACACTCTCTCCTTCAACCTCGGCTCTGAGTCTCACAAAATTTCCCTCTATATGGACACTGGCAGCGACCTCGTTTGGTTCCCTTGTTCCCCGTTTGAGTGTATTCTTTGTGAAGGTAAACCAAAAATTCAATCCCCTTTGCCCAAAATCGCAAATAACAAATCAGTTTCCTGCAGCGCCGCCGCCTGCTCCGCCGCTCACGGTGGCTCCCTCTCCGCTTCCCACCTCTGTGCAATTTCTCGATGTCCACTTGAATCCATTGAAATTTCTGAGTGCTCCTCTTTTTCCTGTCCTCCGTTTTATTATGCTTATGGCGATGGGAGTTTGGTTGCTCGGCTTTATAGAGATAGCCTCAGCTTGCCAACGCCGGCGCCATCTCCGCCTATTAATGTTCGGAATTTTACTTTTGGATGTGCCCACACGACGCTTGGCGAGCCGGTTGGGGTTGCCGGATTCGGCCGGGGGGTGTTGTCAATGCCCAGTCAACTCGCTACTTTCTCCCCTCAACTCGGGAACCGGTTTTCTTATTGTTTGGTTTCTCACTCGTTTGCGGCAGACCGAGTTCGCCGCCCGAGTCCACTGATTCTCGGGCGGTACTACACCGGGGAGACGGAGTTCATTTACACTTCCTTGCTTGAGAATCCAAAACATCCTTACTTTTACTCCGTTGGGTTGGCCGGAATATCAGTTGGGAATGTAAGGATTCCAGCGCCGGAGTTTCTGACAAAAGTGGATGAGGGTGGGAGTGGCGGCGTTGTGGTGGATTCCGGCACTACTTTTACTATGCTCCCGGCAGGTTTGTATGAATCGGTGGTGGCCGAGTTCGAAAACCGTACCGGAAAGGTTGCAAACCGGGCGAGACGGATTGAAGAAAATACCGGGTTGAGCCCTTGCTATTACTACGAGAACTCAGTTGGGGTGCCACGTGTCGTGCTACATTTTGTTGGGGAAAAATCCAATGTGGTGCTTCCTAGGAAGAACTATTTCTACGAGTTTTTGGACGGCGGAGATGGGGTGGTGGGGAGGAAGAGAAAAGTTGGATGTTTGATGTTGATGAACGGTGGAGATGAGGCTGAGCTGGCAGGTGGGCCTGGTGCGACGCTTGGGAACTACCAACAACAAGGTTTTGAGGTGGTTTATGATTTGGAAAAGAACCGGGTTGGATTCGCTCGGCGGCAATGCTCCACTCTTTGGGACAATTTGAACCGGAGTAAGTGAAAGTGTGAACCCGGTTGAGGAAGTGAGAGTTGATCTTTGACATTGTGAGTATTGTCAACGGTGAAGGGTATGAGGGTAAATAAGGTAATTTTAGGTTTCAAGGGTTTATTTATTTATTTTATTCTATTTTTTGTTGTAAATTCCTTGGGCACTTCACTTCTTGCTTTAATCTTATTTTTACTTGAAATTTGTATATTAAAAGTGTTCAAAAAAAAAATGTATAGCTCAAATTTTATTGCATGAAGCAAGGAATGAAGTGGTTGCTTTTATCAAGTTAAGAATAAAATAGGGGCAATAAATTAAGGATGTTAAAAAACAAATCATAAATCAAATTTACTTGTAAAACTTTT

mRNA sequence

ATGGCGGTTTCCCCTGTTTTCATCTTCCTCCTTTGTTTTCTCCTCTCCTCCCCTGTTTTCTCCTCACAAATTTTCCTTCTACCTCTCTCCCATTCCTTATCATCCTCAATCTCCGATTTCAACAACACCCACAATCTTCTCAAATCCACTGCCACCCGCTCCTCCGCCCGATTCCACCGCCACCGCCATAACCACCTCTCTCTGCCCCTCTCCCCTGGCGGCGATTACACTCTCTCCTTCAACCTCGGCTCTGAGTCTCACAAAATTTCCCTCTATATGGACACTGGCAGCGACCTCGTTTGGTTCCCTTGTTCCCCGTTTGAGTGTATTCTTTGTGAAGGTAAACCAAAAATTCAATCCCCTTTGCCCAAAATCGCAAATAACAAATCAGTTTCCTGCAGCGCCGCCGCCTGCTCCGCCGCTCACGGTGGCTCCCTCTCCGCTTCCCACCTCTGTGCAATTTCTCGATGTCCACTTGAATCCATTGAAATTTCTGAGTGCTCCTCTTTTTCCTGTCCTCCGTTTTATTATGCTTATGGCGATGGGAGTTTGGTTGCTCGGCTTTATAGAGATAGCCTCAGCTTGCCAACGCCGGCGCCATCTCCGCCTATTAATGTTCGGAATTTTACTTTTGGATGTGCCCACACGACGCTTGGCGAGCCGGTTGGGGTTGCCGGATTCGGCCGGGGGGTGTTGTCAATGCCCAGTCAACTCGCTACTTTCTCCCCTCAACTCGGGAACCGGTTTTCTTATTGTTTGGTTTCTCACTCGTTTGCGGCAGACCGAGTTCGCCGCCCGAGTCCACTGATTCTCGGGCGGTACTACACCGGGGAGACGGAGTTCATTTACACTTCCTTGCTTGAGAATCCAAAACATCCTTACTTTTACTCCGTTGGGTTGGCCGGAATATCAGTTGGGAATGTAAGGATTCCAGCGCCGGAGTTTCTGACAAAAGTGGATGAGGGTGGGAGTGGCGGCGTTGTGGTGGATTCCGGCACTACTTTTACTATGCTCCCGGCAGGTTTGTATGAATCGGTGGTGGCCGAGTTCGAAAACCGTACCGGAAAGGTTGCAAACCGGGCGAGACGGATTGAAGAAAATACCGGGTTGAGCCCTTGCTATTACTACGAGAACTCAGTTGGGGTGCCACGTGTCGTGCTACATTTTGTTGGGGAAAAATCCAATGTGGTGCTTCCTAGGAAGAACTATTTCTACGAGTTTTTGGACGGCGGAGATGGGGTGGTGGGGAGGAAGAGAAAAGTTGGATGTTTGATGTTGATGAACGGTGGAGATGAGGCTGAGCTGGCAGGTGGGCCTGGTGCGACGCTTGGGAACTACCAACAACAAGGTTTTGAGGTGGTTTATGATTTGGAAAAGAACCGGGTTGGATTCGCTCGGCGGCAATGCTCCACTCTTTGGGACAATTTGAACCGGAGTAAGTGA

Coding sequence (CDS)

ATGGCGGTTTCCCCTGTTTTCATCTTCCTCCTTTGTTTTCTCCTCTCCTCCCCTGTTTTCTCCTCACAAATTTTCCTTCTACCTCTCTCCCATTCCTTATCATCCTCAATCTCCGATTTCAACAACACCCACAATCTTCTCAAATCCACTGCCACCCGCTCCTCCGCCCGATTCCACCGCCACCGCCATAACCACCTCTCTCTGCCCCTCTCCCCTGGCGGCGATTACACTCTCTCCTTCAACCTCGGCTCTGAGTCTCACAAAATTTCCCTCTATATGGACACTGGCAGCGACCTCGTTTGGTTCCCTTGTTCCCCGTTTGAGTGTATTCTTTGTGAAGGTAAACCAAAAATTCAATCCCCTTTGCCCAAAATCGCAAATAACAAATCAGTTTCCTGCAGCGCCGCCGCCTGCTCCGCCGCTCACGGTGGCTCCCTCTCCGCTTCCCACCTCTGTGCAATTTCTCGATGTCCACTTGAATCCATTGAAATTTCTGAGTGCTCCTCTTTTTCCTGTCCTCCGTTTTATTATGCTTATGGCGATGGGAGTTTGGTTGCTCGGCTTTATAGAGATAGCCTCAGCTTGCCAACGCCGGCGCCATCTCCGCCTATTAATGTTCGGAATTTTACTTTTGGATGTGCCCACACGACGCTTGGCGAGCCGGTTGGGGTTGCCGGATTCGGCCGGGGGGTGTTGTCAATGCCCAGTCAACTCGCTACTTTCTCCCCTCAACTCGGGAACCGGTTTTCTTATTGTTTGGTTTCTCACTCGTTTGCGGCAGACCGAGTTCGCCGCCCGAGTCCACTGATTCTCGGGCGGTACTACACCGGGGAGACGGAGTTCATTTACACTTCCTTGCTTGAGAATCCAAAACATCCTTACTTTTACTCCGTTGGGTTGGCCGGAATATCAGTTGGGAATGTAAGGATTCCAGCGCCGGAGTTTCTGACAAAAGTGGATGAGGGTGGGAGTGGCGGCGTTGTGGTGGATTCCGGCACTACTTTTACTATGCTCCCGGCAGGTTTGTATGAATCGGTGGTGGCCGAGTTCGAAAACCGTACCGGAAAGGTTGCAAACCGGGCGAGACGGATTGAAGAAAATACCGGGTTGAGCCCTTGCTATTACTACGAGAACTCAGTTGGGGTGCCACGTGTCGTGCTACATTTTGTTGGGGAAAAATCCAATGTGGTGCTTCCTAGGAAGAACTATTTCTACGAGTTTTTGGACGGCGGAGATGGGGTGGTGGGGAGGAAGAGAAAAGTTGGATGTTTGATGTTGATGAACGGTGGAGATGAGGCTGAGCTGGCAGGTGGGCCTGGTGCGACGCTTGGGAACTACCAACAACAAGGTTTTGAGGTGGTTTATGATTTGGAAAAGAACCGGGTTGGATTCGCTCGGCGGCAATGCTCCACTCTTTGGGACAATTTGAACCGGAGTAAGTGA
BLAST of CSPI03G02720 vs. Swiss-Prot
Match: ASP63_ARATH (Probable aspartyl protease At4g16563 OS=Arabidopsis thaliana GN=At4g16563 PE=2 SV=1)

HSP 1 Score: 581.6 bits (1498), Expect = 7.7e-165
Identity = 298/478 (62.34%), Postives = 356/478 (74.48%), Query Frame = 1

Query: 26  LLPLSHSLSSSISDFNNTHNLLKSTATRSSARFHRHRHNH----LSLPLSPGGDYTLSFN 85
           LL LSHSLS+S    +  H LLKS+++RSSARF RH H      LSLP+S G DY +S +
Sbjct: 30  LLHLSHSLSTSKHSSSPLH-LLKSSSSRSSARFRRHHHKQQQQQLSLPISSGSDYLISLS 89

Query: 86  LGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQSPLPKIANNKS-VSCSAAACSA 145
           +GS S  +SLY+DTGSDLVWFPC PF CILCE KP   SP   ++++ + VSCS+ +CSA
Sbjct: 90  VGSSSSAVSLYLDTGSDLVWFPCRPFTCILCESKPLPPSPPSSLSSSATTVSCSSPSCSA 149

Query: 146 AHGGSLSASHLCAISRCPLESIEISEC--SSFSCPPFYYAYGDGSLVARLYRDSLSLPTP 205
           AH  SL +S LCAIS CPL+ IE  +C  SS+ CPPFYYAYGDGSLVA+LY DSLSLP+ 
Sbjct: 150 AHS-SLPSSDLCAISNCPLDFIETGDCNTSSYPCPPFYYAYGDGSLVAKLYSDSLSLPS- 209

Query: 206 APSPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQLATFSPQLGNRFSYCLVSHSF 265
                ++V NFTFGCAHTTL EP+GVAGFGRG LS+P+QLA  SP LGN FSYCLVSHSF
Sbjct: 210 -----VSVSNFTFGCAHTTLAEPIGVAGFGRGRLSLPAQLAVHSPHLGNSFSYCLVSHSF 269

Query: 266 AADRVRRPSPLILGRYY------TGET--------------EFIYTSLLENPKHPYFYSV 325
            +DRVRRPSPLILGR+        G T              EF++T +LENPKHPYFYSV
Sbjct: 270 DSDRVRRPSPLILGRFVDKKEKRVGTTDDHDDGDDEKKKKNEFVFTEMLENPKHPYFYSV 329

Query: 326 GLAGISVGNVRIPAPEFLTKVDEGGSGGVVVDSGTTFTMLPAGLYESVVAEFENRTGKVA 385
            L GIS+G   IPAP  L ++D+ G GGVVVDSGTTFTMLPA  Y SVV EF++R G+V 
Sbjct: 330 SLQGISIGKRNIPAPAMLRRIDKNGGGGVVVDSGTTFTMLPAKFYNSVVEEFDSRVGRVH 389

Query: 386 NRARRIEENTGLSPCYYYENSVGVPRVVLHFVGEKSNVVLPRKNYFYEFLDGGDGVVGRK 445
            RA R+E ++G+SPCYY   +V VP +VLHF G +S+V LPR+NYFYEF+DGGDG    K
Sbjct: 390 ERADRVEPSSGMSPCYYLNQTVKVPALVLHFAGNRSSVTLPRRNYFYEFMDGGDG-KEEK 449

Query: 446 RKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDNL 477
           RK+GCLMLMNGGDE+EL GG GA LGNYQQQGFEVVYDL   RVGFA+R+C++LWD+L
Sbjct: 450 RKIGCLMLMNGGDESELRGGTGAILGNYQQQGFEVVYDLLNRRVGFAKRKCASLWDSL 498

BLAST of CSPI03G02720 vs. Swiss-Prot
Match: APF2_ARATH (Aspartyl protease family protein 2 OS=Arabidopsis thaliana GN=APF2 PE=2 SV=1)

HSP 1 Score: 152.5 bits (384), Expect = 1.2e-35
Identity = 131/409 (32.03%), Postives = 182/409 (44.50%), Query Frame = 1

Query: 70  LSPG-GDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQSPLPKIANN 129
           LS G G+Y     +G+ +  + + +DTGSD+VW  C+P  C  C  +       P     
Sbjct: 135 LSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAP--CRRCYSQSD-----PIFDPR 194

Query: 130 KSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAYGDGSL-VAR 189
           KS + +   CS+ H   L ++      +  L  +               +YGDGS  V  
Sbjct: 195 KSKTYATIPCSSPHCRRLDSAGCNTRRKTCLYQV---------------SYGDGSFTVGD 254

Query: 190 LYRDSLSLPTPAPSPPINVRNFTFGCAHTTLGEPVGVAGF---GRGVLSMPSQLATFSPQ 249
              ++L+           V+    GC H   G  VG AG    G+G LS P Q      +
Sbjct: 255 FSTETLTFRRN------RVKGVALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTGH---R 314

Query: 250 LGNRFSYCLVSHSFAADRVRRPSPLILGRYYTGETEFIYTSLLENPKHPYFYSVGLAGIS 309
              +FSYCLV  S ++    +PS ++ G          +T LL NPK   FY VGL GIS
Sbjct: 315 FNQKFSYCLVDRSASS----KPSSVVFGNAAVSRIAR-FTPLLSNPKLDTFYYVGLLGIS 374

Query: 310 VGNVRIPA-PEFLTKVDEGGSGGVVVDSGTTFTMLPAGLYESVVAEFENRTGKVANRARR 369
           VG  R+P     L K+D+ G+GGV++DSGT+ T L    Y ++   F  R G  A   +R
Sbjct: 375 VGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAF--RVG--AKTLKR 434

Query: 370 IEENTGLSPCYYYEN--SVGVPRVVLHFVGEKSNVVLPRKNYFYEFLDGGDGVVGRKRKV 429
             + +    C+   N   V VP VVLHF G  ++V LP  NY       G         +
Sbjct: 435 APDFSLFDTCFDLSNMNEVKVPTVVLHFRG--ADVSLPATNYLIPVDTNGKFCFAFAGTM 485

Query: 430 GCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCS 471
           G L +                +GN QQQGF VVYDL  +RVGFA   C+
Sbjct: 495 GGLSI----------------IGNIQQQGFRVVYDLASSRVGFAPGGCA 485

BLAST of CSPI03G02720 vs. Swiss-Prot
Match: NEP1_NEPGR (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis GN=nep1 PE=1 SV=1)

HSP 1 Score: 145.6 bits (366), Expect = 1.4e-33
Identity = 120/407 (29.48%), Postives = 183/407 (44.96%), Query Frame = 1

Query: 74  GDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQSPLPKIANNKSVSC 133
           G+Y ++ ++G+ +   S  MDTGSDL+W  C P  C  C          P      S S 
Sbjct: 93  GEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQP--CTQC-----FNQSTPIFNPQGSSSF 152

Query: 134 SAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAYGDGSLV-ARLYRDS 193
           S   CS         S LC       +++    CS+  C  + Y YGDGS     +  ++
Sbjct: 153 STLPCS---------SQLC-------QALSSPTCSNNFCQ-YTYGYGDGSETQGSMGTET 212

Query: 194 LSLPTPAPSPPINVRNFTFGCAHTTLG----EPVGVAGFGRGVLSMPSQLATFSPQLGNR 253
           L+  +      +++ N TFGC     G       G+ G GRG LS+PSQL         +
Sbjct: 213 LTFGS------VSIPNITFGCGENNQGFGQGNGAGLVGMGRGPLSLPSQLDV------TK 272

Query: 254 FSYCLVSHSFAADRVRRPSPLILGRYYTGETEFI-YTSLLENPKHPYFYSVGLAGISVGN 313
           FSYC+     +      PS L+LG      T     T+L+++ + P FY + L G+SVG+
Sbjct: 273 FSYCMTPIGSST-----PSNLLLGSLANSVTAGSPNTTLIQSSQIPTFYYITLNGLSVGS 332

Query: 314 VRIPA-PEFLTKVDEGGSGGVVVDSGTTFTMLPAGLYESVVAEFENRTG-KVANRARRIE 373
            R+P  P         G+GG+++DSGTT T      Y+SV  EF ++    V N +    
Sbjct: 333 TRLPIDPSAFALNSNNGTGGIIIDSGTTLTYFVNNAYQSVRQEFISQINLPVVNGS---- 392

Query: 374 ENTGLSPCYYY---ENSVGVPRVVLHFVGEKSNVVLPRKNYFYEFLDGGDGVVGRKRKVG 433
            ++G   C+      +++ +P  V+HF G   ++ LP +NY   F+   +G++       
Sbjct: 393 -SSGFDLCFQTPSDPSNLQIPTFVMHFDG--GDLELPSENY---FISPSNGLI------- 434

Query: 434 CLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQC 470
           CL + +      +        GN QQQ   VVYD   + V FA  QC
Sbjct: 453 CLAMGSSSQGMSI-------FGNIQQQNMLVVYDTGNSVVSFASAQC 434

BLAST of CSPI03G02720 vs. Swiss-Prot
Match: ASPG1_ARATH (Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana GN=ASPG1 PE=1 SV=1)

HSP 1 Score: 145.2 bits (365), Expect = 1.8e-33
Identity = 125/421 (29.69%), Postives = 173/421 (41.09%), Query Frame = 1

Query: 60  RHRHNHLSLPLSPG-----GDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEG 119
           R++   L+ P+  G     G+Y     +G+ + ++ L +DTGSD+ W  C P  C  C  
Sbjct: 141 RYQTEDLTTPVVSGASQGSGEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCEP--CADCYQ 200

Query: 120 KPKIQSPLPKIANNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPP 179
           +          +  KS++CSA  CS                      +E S C S  C  
Sbjct: 201 QSDPVFNPTSSSTYKSLTCSAPQCSL---------------------LETSACRSNKCL- 260

Query: 180 FYYAYGDGSL-VARLYRDSLSLPTPAPSPPINVRNFTFGCAHTTLGEPVGVAGF---GRG 239
           +  +YGDGS  V  L  D+++           + N   GC H   G   G AG    G G
Sbjct: 261 YQVSYGDGSFTVGELATDTVTFGNSG-----KINNVALGCGHDNEGLFTGAAGLLGLGGG 320

Query: 240 VLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYTGETEFIYTSLLENP 299
           VLS+ +Q+   S      FSYCLV          + S L       G  +     LL N 
Sbjct: 321 VLSITNQMKATS------FSYCLVDRDSG-----KSSSLDFNSVQLGGGDAT-APLLRNK 380

Query: 300 KHPYFYSVGLAGISVGNVRIPAPEFLTKVDEGGSGGVVVDSGTTFTMLPAGLYESVVAEF 359
           K   FY VGL+G SVG  ++  P+ +  VD  GSGGV++D GT  T L    Y S+   F
Sbjct: 381 KIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSGGVILDCGTAVTRLQTQAYNSLRDAF 440

Query: 360 ENRTGKVANRARRIEENTGLSPCYYYE--NSVGVPRVVLHFVGEKSNVVLPRKNYFYEFL 419
              T    N  +     +    CY +   ++V VP V  HF G KS + LP KNY     
Sbjct: 441 LKLT---VNLKKGSSSISLFDTCYDFSSLSTVKVPTVAFHFTGGKS-LDLPAKNYLIPVD 500

Query: 420 DGGDGVVGRKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQ 470
           D G           C           +       +GN QQQG  + YDL KN +G +  +
Sbjct: 501 DSG---------TFCFAFAPTSSSLSI-------IGNVQQQGTRITYDLSKNVIGLSGNK 500

BLAST of CSPI03G02720 vs. Swiss-Prot
Match: NEP2_NEPGR (Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis GN=nep2 PE=1 SV=1)

HSP 1 Score: 144.8 bits (364), Expect = 2.4e-33
Identity = 119/405 (29.38%), Postives = 179/405 (44.20%), Query Frame = 1

Query: 74  GDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQSPLPKIANNKSVSC 133
           G+Y ++  +G+     S  MDTGSDL+W  C P  C  C        P P      S S 
Sbjct: 94  GEYLMNVAIGTPDSSFSAIMDTGSDLIWTQCEP--CTQC-----FSQPTPIFNPQDSSSF 153

Query: 134 SAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAYGDGSLV-ARLYRDS 193
           S   C + +   L           P E+   +EC       + Y YGDGS     +  ++
Sbjct: 154 STLPCESQYCQDL-----------PSETCNNNECQ------YTYGYGDGSTTQGYMATET 213

Query: 194 LSLPTPAPSPPINVRNFTFGCAHTTLG----EPVGVAGFGRGVLSMPSQLATFSPQLGNR 253
            +  T +      V N  FGC     G       G+ G G G LS+PSQL         +
Sbjct: 214 FTFETSS------VPNIAFGCGEDNQGFGQGNGAGLIGMGWGPLSLPSQLGV------GQ 273

Query: 254 FSYCLVSHSFAADRVRRPSPLILGRYYTGETEFI-YTSLLENPKHPYFYSVGLAGISVGN 313
           FSYC+ S+  ++     PS L LG   +G  E    T+L+ +  +P +Y + L GI+VG 
Sbjct: 274 FSYCMTSYGSSS-----PSTLALGSAASGVPEGSPSTTLIHSSLNPTYYYITLQGITVGG 333

Query: 314 VRIPAPEFLTKVDEGGSGGVVVDSGTTFTMLPAGLYESVVAEFENRTGKVANRARRIEEN 373
             +  P    ++ + G+GG+++DSGTT T LP   Y +V   F ++     N     E +
Sbjct: 334 DNLGIPSSTFQLQDDGTGGMIIDSGTTLTYLPQDAYNAVAQAFTDQ----INLPTVDESS 393

Query: 374 TGLSPCYYYE---NSVGVPRVVLHFVGEKSNVVLPRKNYFYEFLDGGDGVVGRKRKVGCL 433
           +GLS C+      ++V VP + + F G   N  L  +N     +   +GV+       CL
Sbjct: 394 SGLSTCFQQPSDGSTVQVPEISMQFDGGVLN--LGEQNI---LISPAEGVI-------CL 435

Query: 434 MLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQC 470
            +   G  ++L     +  GN QQQ  +V+YDL+   V F   QC
Sbjct: 454 AM---GSSSQLG---ISIFGNIQQQETQVLYDLQNLAVSFVPTQC 435

BLAST of CSPI03G02720 vs. TrEMBL
Match: A0A0A0L5I7_CUCSA (Pepsin A OS=Cucumis sativus GN=Csa_3G020060 PE=3 SV=1)

HSP 1 Score: 972.6 bits (2513), Expect = 1.7e-280
Identity = 479/480 (99.79%), Postives = 480/480 (100.00%), Query Frame = 1

Query: 1   MAVSPVFIFLLCFLLSSPVFSSQIFLLPLSHSLSSSISDFNNTHNLLKSTATRSSARFHR 60
           MAVSPVFIFLLCFLLSSPVFSSQIFLLPLSHSLSSSISDFNNTHNLLKSTATRSSARFHR
Sbjct: 1   MAVSPVFIFLLCFLLSSPVFSSQIFLLPLSHSLSSSISDFNNTHNLLKSTATRSSARFHR 60

Query: 61  HRHNHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQS 120
           HRHNHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQS
Sbjct: 61  HRHNHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQS 120

Query: 121 PLPKIANNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAYG 180
           PLPKIANNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAYG
Sbjct: 121 PLPKIANNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAYG 180

Query: 181 DGSLVARLYRDSLSLPTPAPSPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQLAT 240
           DGSLVARLYRDSLSLPTPAPSPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQLAT
Sbjct: 181 DGSLVARLYRDSLSLPTPAPSPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQLAT 240

Query: 241 FSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYTGETEFIYTSLLENPKHPYFYSVGL 300
           FSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYTGETEFIYTSLLENPKHPYFYSVGL
Sbjct: 241 FSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYTGETEFIYTSLLENPKHPYFYSVGL 300

Query: 301 AGISVGNVRIPAPEFLTKVDEGGSGGVVVDSGTTFTMLPAGLYESVVAEFENRTGKVANR 360
           AGISVGN+RIPAPEFLTKVDEGGSGGVVVDSGTTFTMLPAGLYESVVAEFENRTGKVANR
Sbjct: 301 AGISVGNIRIPAPEFLTKVDEGGSGGVVVDSGTTFTMLPAGLYESVVAEFENRTGKVANR 360

Query: 361 ARRIEENTGLSPCYYYENSVGVPRVVLHFVGEKSNVVLPRKNYFYEFLDGGDGVVGRKRK 420
           ARRIEENTGLSPCYYYENSVGVPRVVLHFVGEKSNVVLPRKNYFYEFLDGGDGVVGRKRK
Sbjct: 361 ARRIEENTGLSPCYYYENSVGVPRVVLHFVGEKSNVVLPRKNYFYEFLDGGDGVVGRKRK 420

Query: 421 VGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDNLNRSK 480
           VGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDNLNRSK
Sbjct: 421 VGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDNLNRSK 480

BLAST of CSPI03G02720 vs. TrEMBL
Match: B9GYA7_POPTR (Aspartyl protease family protein OS=Populus trichocarpa GN=POPTR_0003s07390g PE=3 SV=1)

HSP 1 Score: 648.7 bits (1672), Expect = 5.7e-183
Identity = 337/496 (67.94%), Postives = 388/496 (78.23%), Query Frame = 1

Query: 10  LLCFLLSSP---VFSSQIFLLPLSHSLSSSISDFNNTHNLLKSTATRSSARFHRHRHN-- 69
           LLCF+L      + +SQ   LPL HSLS +   F +TH+LLKST+TRS+ RFH H HN  
Sbjct: 8   LLCFILCFTHIFISTSQTLFLPLIHSLSKT--QFTSTHHLLKSTSTRSTTRFHHHHHNKN 67

Query: 70  -----HLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPK-- 129
                 +SLPLSPG DYTLSF + S+   ISLY+DTGSDLVWFPC PFECILCEGK +  
Sbjct: 68  SHNHRQVSLPLSPGSDYTLSFTINSQ--PISLYLDTGSDLVWFPCQPFECILCEGKAENA 127

Query: 130 --IQSPLPKIANNKS-VSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPP 189
               +P PK++   + VSC ++ACSA H  +L +S LCAIS CPLESIEIS+C   SCP 
Sbjct: 128 SLASTPPPKLSKTATPVSCKSSACSAVH-SNLPSSDLCAISNCPLESIEISDCRKHSCPQ 187

Query: 190 FYYAYGDGSLVARLYRDSLSLPTPAPSPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSM 249
           FYYAYGDGSL+ARLYRDS+ LP    +  I   NFTFGCAHTTL EP+GVAGFGRGVLS+
Sbjct: 188 FYYAYGDGSLIARLYRDSIRLPLSNQTNLI-FNNFTFGCAHTTLAEPIGVAGFGRGVLSL 247

Query: 250 PSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYTGETE----------FIYT 309
           P+QLAT SPQLGN+FSYCLVSHSF +DRVRRPSPLILGRY   E E          F+YT
Sbjct: 248 PAQLATLSPQLGNQFSYCLVSHSFDSDRVRRPSPLILGRYDHDEKERRVNGVKKPSFVYT 307

Query: 310 SLLENPKHPYFYSVGLAGISVGNVRIPAPEFLTKVDEGGSGGVVVDSGTTFTMLPAGLYE 369
           S+L+NP+HPYFY VGL GIS+G  +IPAP+FL KVD  GSGGVVVDSGTTFTMLPA LY+
Sbjct: 308 SMLDNPRHPYFYCVGLEGISIGRKKIPAPDFLRKVDRKGSGGVVVDSGTTFTMLPASLYD 367

Query: 370 SVVAEFENRTGKVANRARRIEENTGLSPCYYYENS-VGVPRVVLHFVGEKSNVVLPRKNY 429
            VVAEFENR G+V  RA  IEENTGLSPCYY++N+ V VPRVVLHFVG  S+VVLPR+NY
Sbjct: 368 FVVAEFENRVGRVNERASVIEENTGLSPCYYFDNNVVNVPRVVLHFVGNGSSVVLPRRNY 427

Query: 430 FYEFLDGGDGVVGRKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVG 480
           FYEFLDGG G  G+KRKVGCLMLMNGGDEAEL+GGPGATLGNYQQQGFEVVYDLE  RVG
Sbjct: 428 FYEFLDGGHG-KGKKRKVGCLMLMNGGDEAELSGGPGATLGNYQQQGFEVVYDLENRRVG 487

BLAST of CSPI03G02720 vs. TrEMBL
Match: B9SSF8_RICCO (Pepsin A, putative OS=Ricinus communis GN=RCOM_1061010 PE=3 SV=1)

HSP 1 Score: 639.4 bits (1648), Expect = 3.5e-180
Identity = 324/489 (66.26%), Postives = 391/489 (79.96%), Query Frame = 1

Query: 9   FLLCFLLSSPVFSSQIFLLPLSHSLSSSISDFNNTHNLLKSTATRSSARF-HRHRHNHL- 68
           F+LCF   S V  S+I  LPL+HSLS++   F +TH+LLKST++RS++RF H+H+  HL 
Sbjct: 11  FILCFSCIS-VSISEILYLPLTHSLSNT--QFTSTHHLLKSTSSRSASRFQHQHQKRHLR 70

Query: 69  -----SLPLSPGGDYTLSFNLGSESHK-ISLYMDTGSDLVWFPCSPFECILCEGKPK--- 128
                SLPLSPG DYTLSF L S   + +SLY+DTGSDLVWFPC PFECILCEGK +   
Sbjct: 71  NRHQVSLPLSPGSDYTLSFTLNSNPPQHVSLYLDTGSDLVWFPCKPFECILCEGKAENTT 130

Query: 129 IQSPLPKIANN-KSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFY 188
             +P P++++  +SV C ++ACSAAH  +L  S LCAI+ CPLESIE S+C SFSCP FY
Sbjct: 131 ASTPPPRLSSTARSVHCKSSACSAAHS-NLPTSDLCAIADCPLESIETSDCHSFSCPSFY 190

Query: 189 YAYGDGSLVARLYRDSLSLPTPAPSPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPS 248
           YAYGDGSLVARLY DS+ LP   PS  +++ NFTFGCAHT L EPVGVAGFGRGVLS+P+
Sbjct: 191 YAYGDGSLVARLYHDSIKLPLATPS--LSLHNFTFGCAHTALAEPVGVAGFGRGVLSLPA 250

Query: 249 QLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILG-------RYYTGETEFIYTSLLEN 308
           QLA+F+PQLGNRFSYCLVSHSF +DR+R PSPLILG       R    + +F+YTS+L+N
Sbjct: 251 QLASFAPQLGNRFSYCLVSHSFNSDRLRLPSPLILGHSDDKEKRVNKDDVQFVYTSMLDN 310

Query: 309 PKHPYFYSVGLAGISVGNVRIPAPEFLTKVDEGGSGGVVVDSGTTFTMLPAGLYESVVAE 368
           PKHPYFY VGL GIS+G  +IPAPEFL +VD  GSGGVVVDSGTTFTMLPA LY SVVAE
Sbjct: 311 PKHPYFYCVGLEGISIGKKKIPAPEFLKRVDREGSGGVVVDSGTTFTMLPASLYNSVVAE 370

Query: 369 FENRTGKVANRARRIEENTGLSPCYYYENSVGVPRVVLHFVGEKSNVVLPRKNYFYEFLD 428
           F+NR G+V  RA+ +E+ TGL PCYYY+  V +P +VLHFVG +S+VVLP+KNYFY+FLD
Sbjct: 371 FDNRVGRVYERAKEVEDKTGLGPCYYYDTVVNIPSLVLHFVGNESSVVLPKKNYFYDFLD 430

Query: 429 GGDGVVGRKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQC 479
           GGDG V RKR+VGCLMLMNGG+EAEL GGPGATLGNYQQ GFEVVYDLE+ RVGFARR+C
Sbjct: 431 GGDG-VRRKRRVGCLMLMNGGEEAELTGGPGATLGNYQQHGFEVVYDLEQRRVGFARRKC 490

BLAST of CSPI03G02720 vs. TrEMBL
Match: B9NGC6_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0001s15870g PE=3 SV=1)

HSP 1 Score: 639.4 bits (1648), Expect = 3.5e-180
Identity = 339/498 (68.07%), Postives = 387/498 (77.71%), Query Frame = 1

Query: 7   FIFLLCFLLSSPVF---SSQIFLLPLSHSLSSSISDFNNTHNLLKSTATRSSARFHRHRH 66
           +  LLCF L    F   +SQ   LPL+HSLS +   F +TH+L+KST+T S  RF RH H
Sbjct: 5   YSLLLCFSLCFSHFFISTSQTLFLPLTHSLSKT--QFTSTHHLIKSTSTSSITRFRRHHH 64

Query: 67  -----NH--LSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKP 126
                NH  +SLPLSPG DYTLSF L  +S  I LY+DTGSDLVWFPC PFECILCEGK 
Sbjct: 65  QKNTHNHRQVSLPLSPGSDYTLSFTL--DSQPIFLYLDTGSDLVWFPCQPFECILCEGKA 124

Query: 127 KIQS----PLPKIANNKS-VSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFS 186
           +  S    P PK++   + VSC ++ACSAAH  +L +S LCAIS CPLESIE S+C   S
Sbjct: 125 ENTSLASTPPPKLSKTATPVSCKSSACSAAHS-NLPSSDLCAISNCPLESIETSDCQKHS 184

Query: 187 CPPFYYAYGDGSLVARLYRDSLSLPTPAPSPPINVRNFTFGCAHTTLGEPVGVAGFGRGV 246
           CP FYYAYGDGSL+ARLYRDS+SLP   P+  I V NFTFGCAHT L EP+GVAGFGRGV
Sbjct: 185 CPQFYYAYGDGSLIARLYRDSISLPLSNPTNLI-VNNFTFGCAHTALAEPIGVAGFGRGV 244

Query: 247 LSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYTGETE----------F 306
           LS+P+QLAT SPQLGN+FSYCLVSHSF +DR+RRPSPLILGRY   E E          F
Sbjct: 245 LSLPAQLATLSPQLGNQFSYCLVSHSFDSDRLRRPSPLILGRYDHDEKERRVNGVNKPRF 304

Query: 307 IYTSLLENPKHPYFYSVGLAGISVGNVRIPAPEFLTKVDEGGSGGVVVDSGTTFTMLPAG 366
           +YTS+L+N +HPYFY VGL GIS+G  +IPAP FL KVD  GSGG+VVDSGTTFTMLPA 
Sbjct: 305 VYTSMLDNLEHPYFYCVGLEGISIGRKKIPAPGFLRKVDGEGSGGLVVDSGTTFTMLPAS 364

Query: 367 LYESVVAEFENRTGKVANRARRIEENTGLSPCYYYENS-VGVPRVVLHFVGEKSNVVLPR 426
           LY SVVAEFENR G+V  RAR IEE+TGLSPCYY++N+ V VP VVLHFVG  S+VVLPR
Sbjct: 365 LYGSVVAEFENRVGRVNERARVIEEDTGLSPCYYFDNNVVNVPSVVLHFVGNGSSVVLPR 424

Query: 427 KNYFYEFLDGGDGVVGRKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKN 479
           +NYFYEFLDGGDG  G+KRKVGCLMLMNGGDEAEL+GGPGATLGNYQQQGFEVVYDLE  
Sbjct: 425 RNYFYEFLDGGDG-KGKKRKVGCLMLMNGGDEAELSGGPGATLGNYQQQGFEVVYDLENK 484

BLAST of CSPI03G02720 vs. TrEMBL
Match: G7JW26_MEDTR (Eukaryotic aspartyl protease family protein OS=Medicago truncatula GN=MTR_5g012490 PE=3 SV=1)

HSP 1 Score: 634.0 bits (1634), Expect = 1.5e-178
Identity = 323/490 (65.92%), Postives = 377/490 (76.94%), Query Frame = 1

Query: 4   SPVFIFLLCFLLS-SPVFSSQIFLLPLSHSLSSSISDFNNTHNLLKSTATRSSARFHRHR 63
           SP+F+ LLCF+L  SP  SSQ  LLPL+HS+S +   FN+TH+LLKST+TRS ARFH   
Sbjct: 3   SPIFLVLLCFILCFSP--SSQTILLPLTHSISKT--KFNSTHHLLKSTSTRSKARFHHQH 62

Query: 64  HNH---LSLPLSPGGDYTLSFNLGSESHK-ISLYMDTGSDLVWFPCSPFECILCEGKPKI 123
           H H   +SLPL+PG DYTLSFNLGS   + I+LYMDTGSDLVWFPCSPFECILCEGKP+ 
Sbjct: 63  HKHQTQVSLPLAPGSDYTLSFNLGSNPPQLITLYMDTGSDLVWFPCSPFECILCEGKPQT 122

Query: 124 QSPLPKIANNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYA 183
             P        SVSC + ACSAAH  S+S+S+LCAISRCPL+ IE S+CSSFSCPPFYYA
Sbjct: 123 TKPANITKQTHSVSCQSPACSAAHA-SMSSSNLCAISRCPLDYIETSDCSSFSCPPFYYA 182

Query: 184 YGDGSLVARLYRDSLSLPTPAPSPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQL 243
           YGDGS VA LY+ +LSL +      ++++NFTFGCAHT L EP GVAGFGRG+LS+P+QL
Sbjct: 183 YGDGSFVANLYQQTLSLSS------LHLQNFTFGCAHTALAEPTGVAGFGRGILSLPAQL 242

Query: 244 ATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRY---YTG-----ETEFIYTSLLENP 303
           +T SP LGNRFSYCLVSHSF  DR+RRPSPLILGR+    TG       EF+YTS+L NP
Sbjct: 243 STLSPHLGNRFSYCLVSHSFDGDRLRRPSPLILGRHNDTITGAGDGESVEFVYTSMLSNP 302

Query: 304 KHPYFYSVGLAGISVGNVRIPAPEFLTKVDEGGSGGVVVDSGTTFTMLPAGLYESVVAEF 363
           KHPY+Y VGLAGISVG   +PAPE L +VDE G+GG+VVDSGTTFTMLP   Y +VV EF
Sbjct: 303 KHPYYYCVGLAGISVGKRTVPAPEILKRVDEKGNGGMVVDSGTTFTMLPESFYNAVVNEF 362

Query: 364 ENRTGKVANRARRIEENTGLSPCYYYENSVGVPRVVLHFVGEKSNVVLPRKNYFYEFLDG 423
           + R  +   RA  IE  TGL PCYY      +P + LHFVG  S+VVLPRKNYFYEF+DG
Sbjct: 363 DKRVNRFHKRASEIETKTGLGPCYYLNGLSQIPVLKLHFVGNNSDVVLPRKNYFYEFMDG 422

Query: 424 GDGVVGRKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCS 481
           GDG + RK KVGC+MLMNG DE EL GGPGATLGNYQQQGFEVVYDLEK RVGFA+++C+
Sbjct: 423 GDG-IRRKGKVGCMMLMNGEDETELDGGPGATLGNYQQQGFEVVYDLEKERVGFAKKECA 480

BLAST of CSPI03G02720 vs. TAIR10
Match: AT4G16563.1 (AT4G16563.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 581.6 bits (1498), Expect = 4.3e-166
Identity = 298/478 (62.34%), Postives = 356/478 (74.48%), Query Frame = 1

Query: 26  LLPLSHSLSSSISDFNNTHNLLKSTATRSSARFHRHRHNH----LSLPLSPGGDYTLSFN 85
           LL LSHSLS+S    +  H LLKS+++RSSARF RH H      LSLP+S G DY +S +
Sbjct: 30  LLHLSHSLSTSKHSSSPLH-LLKSSSSRSSARFRRHHHKQQQQQLSLPISSGSDYLISLS 89

Query: 86  LGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQSPLPKIANNKS-VSCSAAACSA 145
           +GS S  +SLY+DTGSDLVWFPC PF CILCE KP   SP   ++++ + VSCS+ +CSA
Sbjct: 90  VGSSSSAVSLYLDTGSDLVWFPCRPFTCILCESKPLPPSPPSSLSSSATTVSCSSPSCSA 149

Query: 146 AHGGSLSASHLCAISRCPLESIEISEC--SSFSCPPFYYAYGDGSLVARLYRDSLSLPTP 205
           AH  SL +S LCAIS CPL+ IE  +C  SS+ CPPFYYAYGDGSLVA+LY DSLSLP+ 
Sbjct: 150 AHS-SLPSSDLCAISNCPLDFIETGDCNTSSYPCPPFYYAYGDGSLVAKLYSDSLSLPS- 209

Query: 206 APSPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQLATFSPQLGNRFSYCLVSHSF 265
                ++V NFTFGCAHTTL EP+GVAGFGRG LS+P+QLA  SP LGN FSYCLVSHSF
Sbjct: 210 -----VSVSNFTFGCAHTTLAEPIGVAGFGRGRLSLPAQLAVHSPHLGNSFSYCLVSHSF 269

Query: 266 AADRVRRPSPLILGRYY------TGET--------------EFIYTSLLENPKHPYFYSV 325
            +DRVRRPSPLILGR+        G T              EF++T +LENPKHPYFYSV
Sbjct: 270 DSDRVRRPSPLILGRFVDKKEKRVGTTDDHDDGDDEKKKKNEFVFTEMLENPKHPYFYSV 329

Query: 326 GLAGISVGNVRIPAPEFLTKVDEGGSGGVVVDSGTTFTMLPAGLYESVVAEFENRTGKVA 385
            L GIS+G   IPAP  L ++D+ G GGVVVDSGTTFTMLPA  Y SVV EF++R G+V 
Sbjct: 330 SLQGISIGKRNIPAPAMLRRIDKNGGGGVVVDSGTTFTMLPAKFYNSVVEEFDSRVGRVH 389

Query: 386 NRARRIEENTGLSPCYYYENSVGVPRVVLHFVGEKSNVVLPRKNYFYEFLDGGDGVVGRK 445
            RA R+E ++G+SPCYY   +V VP +VLHF G +S+V LPR+NYFYEF+DGGDG    K
Sbjct: 390 ERADRVEPSSGMSPCYYLNQTVKVPALVLHFAGNRSSVTLPRRNYFYEFMDGGDG-KEEK 449

Query: 446 RKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDNL 477
           RK+GCLMLMNGGDE+EL GG GA LGNYQQQGFEVVYDL   RVGFA+R+C++LWD+L
Sbjct: 450 RKIGCLMLMNGGDESELRGGTGAILGNYQQQGFEVVYDLLNRRVGFAKRKCASLWDSL 498

BLAST of CSPI03G02720 vs. TAIR10
Match: AT5G45120.1 (AT5G45120.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 208.0 bits (528), Expect = 1.3e-53
Identity = 144/421 (34.20%), Postives = 203/421 (48.22%), Query Frame = 1

Query: 76  YTLSFNLGSESHKISLYMDTGSDLVWFPCS--PFECILCEG-------KPKIQSPLPKIA 135
           Y ++ N+G+    + +Y+DTGSDL W PC    F+CI C          P + SPL    
Sbjct: 83  YLITLNIGTPPQAVQVYLDTGSDLTWVPCGNLSFDCIECYDLKNNDLKSPSVFSPLHSST 142

Query: 136 NNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAYGDGSLVA 195
           + +  SC+++ C   H    +    CA++ C +  +  S C    CP F Y YG+G L++
Sbjct: 143 SFRD-SCASSFCVEIHSSD-NPFDPCAVAGCSVSMLLKSTCVR-PCPSFAYTYGEGGLIS 202

Query: 196 R-LYRDSLSLPTPAPSPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQLATFSPQL 255
             L RD L   T       +V  F+FGC  +T  EP+G+AGFGRG+LS+PSQL      L
Sbjct: 203 GILTRDILKARTR------DVPRFSFGCVTSTYREPIGIAGFGRGLLSLPSQLGF----L 262

Query: 256 GNRFSYCLVSHSFAADRVRRPSPLILGRYYTG---ETEFIYTSLLENPKHPYFYSVGLAG 315
              FS+C +   F  +     SPLILG             +T +L  P +P  Y +GL  
Sbjct: 263 EKGFSHCFLPFKF-VNNPNISSPLILGASALSINLTDSLQFTPMLNTPMYPNSYYIGLES 322

Query: 316 ISVGNVRIP--APEFLTKVDEGGSGGVVVDSGTTFTMLPAGLYESVVAEFENRTGKVANR 375
           I++G    P   P  L + D  G+GG++VDSGTT+T LP   Y  ++   ++       R
Sbjct: 323 ITIGTNITPTQVPLTLRQFDSQGNGGMLVDSGTTYTHLPEPFYSQLLTTLQSTI--TYPR 382

Query: 376 ARRIEENTGLSPCY----------YYENSVGV--PRVVLHFVGEKSNVVLPRKNYFYEFL 435
           A   E  TG   CY            EN V +  P +  HF+   + ++LP+ N FY   
Sbjct: 383 ATETESRTGFDLCYKVPCPNNNLTSLENDVMMIFPSITFHFL-NNATLLLPQGNSFYAMS 442

Query: 436 DGGDGVVGRKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQ 470
              DG V     V CL+  N  D      GP    G++QQQ  +VVYDLEK R+GF    
Sbjct: 443 APSDGSV-----VQCLLFQNMEDGDY---GPAGVFGSFQQQNVKVVYDLEKERIGFQAMD 478

BLAST of CSPI03G02720 vs. TAIR10
Match: AT3G52500.1 (AT3G52500.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 176.8 bits (447), Expect = 3.2e-44
Identity = 151/505 (29.90%), Postives = 226/505 (44.75%), Query Frame = 1

Query: 4   SPVFIFLLCFLLSSPVFSSQIFLLPLSHSLSSSISDFNNTHNLLKSTATRSSARFHRHRH 63
           S +F F L FL  S V + ++ L P SHS  S    + +   L +S    S AR H+ +H
Sbjct: 3   SSIFFFFLIFL--SVVSAVKLPLSPFSHSDQSPKDPYLSLRRLAES----SIARAHKLKH 62

Query: 64  NH-------------------LSLPLSPG--GDYTLSFNLGSESHKISLYMDTGSDLVWF 123
                                +  PLS    G Y++S + G+ S  I    DTGS LVW 
Sbjct: 63  GTSIKPDEDALSSTTTASATVVKSPLSAKSYGGYSVSLSFGTPSQTIPFVFDTGSSLVWL 122

Query: 124 PC-SPFECILCEGKPKIQSPLPKI-----ANNKSVSCSAAACSAAHGGSLSASHLCAISR 183
           PC S + C  C+      + +P+      +++K + C +  C   +G ++        +R
Sbjct: 123 PCTSRYLCSGCDFSGLDPTLIPRFIPKNSSSSKIIGCQSPKCQFLYGPNVQCRGCDPNTR 182

Query: 184 CPLESIEISECSSFSCPPFYYAYGDGSLVARLYRDSLSLPTPAPSPPINVRNFTFGCAHT 243
                     C+   CPP+   YG GS    L  + L  P       + V +F  GC+  
Sbjct: 183 ---------NCT-VGCPPYILQYGLGSTAGVLITEKLDFPD------LTVPDFVVGCSII 242

Query: 244 TLGEPVGVAGFGRGVLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYT 303
           +  +P G+AGFGRG +S+PSQ+         RFS+CLVS  F    V     L  G  + 
Sbjct: 243 STRQPAGIAGFGRGPVSLPSQMNL------KRFSHCLVSRRFDDTNVTTDLDLDTGSGHN 302

Query: 304 GETE---FIYTSLLENPKHPY-----FYSVGLAGISVGNVRIPAPEFLTKVDEGGSGGVV 363
             ++     YT   +NP         +Y + L  I VG   +  P         G GG +
Sbjct: 303 SGSKTPGLTYTPFRKNPNVSNKAFLEYYYLNLRRIYVGRKHVKIPYKYLAPGTNGDGGSI 362

Query: 364 VDSGTTFTMLPAGLYESVVAEFENRTGKVANRARRIEENTGLSPCYYY--ENSVGVPRVV 423
           VDSG+TFT +   ++E V  EF ++      R + +E+ TGL PC+    +  V VP ++
Sbjct: 363 VDSGSTFTFMERPVFELVAEEFASQMSNYT-REKDLEKETGLGPCFNISGKGDVTVPELI 422

Query: 424 LHFVGEKSNVVLPRKNYFYEFLDGGDGVVGRKRKVGCLMLMNGGDEAELAG-GPGATLGN 471
             F G  + + LP  NYF  F+   D V        CL +++        G GP   LG+
Sbjct: 423 FEFKGG-AKLELPLSNYF-TFVGNTDTV--------CLTVVSDKTVNPSGGTGPAIILGS 468

BLAST of CSPI03G02720 vs. TAIR10
Match: AT3G61820.1 (AT3G61820.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 153.7 bits (387), Expect = 2.9e-37
Identity = 129/407 (31.70%), Postives = 173/407 (42.51%), Query Frame = 1

Query: 70  LSPG-GDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQSPLPKIANN 129
           LS G G+Y +   +G+ +  + + +DTGSD+VW  CSP  C  C  +        K    
Sbjct: 128 LSQGSGEYFMRLGVGTPATNVYMVLDTGSDVVWLQCSP--CKACYNQTDAIFDPKKSKTF 187

Query: 130 KSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAYGDGSLVARL 189
            +V C +  C       L  S  C   R      ++S             YGDGS     
Sbjct: 188 ATVPCGSRLCRR-----LDDSSECVTRRSKTCLYQVS-------------YGDGSFTEGD 247

Query: 190 YRDSLSLPTPAPSPPINVRNFTFGCAHTTLGEPVGVAGF---GRGVLSMPSQLATFSPQL 249
           +         A      V +   GC H   G  VG AG    GRG LS PSQ      + 
Sbjct: 248 FSTETLTFHGA-----RVDHVPLGCGHDNEGLFVGAAGLLGLGRGGLSFPSQTKN---RY 307

Query: 250 GNRFSYCLVSHSFAADRVRRPSPLILGRYYTGETEFIYTSLLENPKHPYFYSVGLAGISV 309
             +FSYCLV  + +    + PS ++ G     +T  ++T LL NPK   FY + L GISV
Sbjct: 308 NGKFSYCLVDRTSSGSSSKPPSTIVFGNAAVPKTS-VFTPLLTNPKLDTFYYLQLLGISV 367

Query: 310 GNVRIPA-PEFLTKVDEGGSGGVVVDSGTTFTMLPAGLYESVVAEFENRTGKVANRARRI 369
           G  R+P   E   K+D  G+GGV++DSGT+ T L    Y ++   F  R G  A + +R 
Sbjct: 368 GGSRVPGVSESQFKLDATGNGGVIIDSGTSVTRLTQPAYVALRDAF--RLG--ATKLKRA 427

Query: 370 EENTGLSPCYYYEN--SVGVPRVVLHFVGEKSNVVLPRKNYFYEFLDGGDGVVGRKRKVG 429
              +    C+      +V VP VV HF G    V LP  NY       G         +G
Sbjct: 428 PSYSLFDTCFDLSGMTTVKVPTVVFHFGG--GEVSLPASNYLIPVNTEGRFCFAFAGTMG 483

Query: 430 CLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQC 470
            L +                +GN QQQGF V YDL  +RVGF  R C
Sbjct: 488 SLSI----------------IGNIQQQGFRVAYDLVGSRVGFLSRAC 483

BLAST of CSPI03G02720 vs. TAIR10
Match: AT1G01300.1 (AT1G01300.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 152.5 bits (384), Expect = 6.5e-37
Identity = 131/409 (32.03%), Postives = 182/409 (44.50%), Query Frame = 1

Query: 70  LSPG-GDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQSPLPKIANN 129
           LS G G+Y     +G+ +  + + +DTGSD+VW  C+P  C  C  +       P     
Sbjct: 135 LSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAP--CRRCYSQSD-----PIFDPR 194

Query: 130 KSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAYGDGSL-VAR 189
           KS + +   CS+ H   L ++      +  L  +               +YGDGS  V  
Sbjct: 195 KSKTYATIPCSSPHCRRLDSAGCNTRRKTCLYQV---------------SYGDGSFTVGD 254

Query: 190 LYRDSLSLPTPAPSPPINVRNFTFGCAHTTLGEPVGVAGF---GRGVLSMPSQLATFSPQ 249
              ++L+           V+    GC H   G  VG AG    G+G LS P Q      +
Sbjct: 255 FSTETLTFRRN------RVKGVALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTGH---R 314

Query: 250 LGNRFSYCLVSHSFAADRVRRPSPLILGRYYTGETEFIYTSLLENPKHPYFYSVGLAGIS 309
              +FSYCLV  S ++    +PS ++ G          +T LL NPK   FY VGL GIS
Sbjct: 315 FNQKFSYCLVDRSASS----KPSSVVFGNAAVSRIAR-FTPLLSNPKLDTFYYVGLLGIS 374

Query: 310 VGNVRIPA-PEFLTKVDEGGSGGVVVDSGTTFTMLPAGLYESVVAEFENRTGKVANRARR 369
           VG  R+P     L K+D+ G+GGV++DSGT+ T L    Y ++   F  R G  A   +R
Sbjct: 375 VGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAF--RVG--AKTLKR 434

Query: 370 IEENTGLSPCYYYEN--SVGVPRVVLHFVGEKSNVVLPRKNYFYEFLDGGDGVVGRKRKV 429
             + +    C+   N   V VP VVLHF G  ++V LP  NY       G         +
Sbjct: 435 APDFSLFDTCFDLSNMNEVKVPTVVLHFRG--ADVSLPATNYLIPVDTNGKFCFAFAGTM 485

Query: 430 GCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCS 471
           G L +                +GN QQQGF VVYDL  +RVGFA   C+
Sbjct: 495 GGLSI----------------IGNIQQQGFRVVYDLASSRVGFAPGGCA 485

BLAST of CSPI03G02720 vs. NCBI nr
Match: gi|449458942|ref|XP_004147205.1| (PREDICTED: aspartic proteinase nepenthesin-1 [Cucumis sativus])

HSP 1 Score: 972.6 bits (2513), Expect = 2.5e-280
Identity = 479/480 (99.79%), Postives = 480/480 (100.00%), Query Frame = 1

Query: 1   MAVSPVFIFLLCFLLSSPVFSSQIFLLPLSHSLSSSISDFNNTHNLLKSTATRSSARFHR 60
           MAVSPVFIFLLCFLLSSPVFSSQIFLLPLSHSLSSSISDFNNTHNLLKSTATRSSARFHR
Sbjct: 1   MAVSPVFIFLLCFLLSSPVFSSQIFLLPLSHSLSSSISDFNNTHNLLKSTATRSSARFHR 60

Query: 61  HRHNHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQS 120
           HRHNHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQS
Sbjct: 61  HRHNHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQS 120

Query: 121 PLPKIANNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAYG 180
           PLPKIANNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAYG
Sbjct: 121 PLPKIANNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAYG 180

Query: 181 DGSLVARLYRDSLSLPTPAPSPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQLAT 240
           DGSLVARLYRDSLSLPTPAPSPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQLAT
Sbjct: 181 DGSLVARLYRDSLSLPTPAPSPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQLAT 240

Query: 241 FSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYTGETEFIYTSLLENPKHPYFYSVGL 300
           FSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYTGETEFIYTSLLENPKHPYFYSVGL
Sbjct: 241 FSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYTGETEFIYTSLLENPKHPYFYSVGL 300

Query: 301 AGISVGNVRIPAPEFLTKVDEGGSGGVVVDSGTTFTMLPAGLYESVVAEFENRTGKVANR 360
           AGISVGN+RIPAPEFLTKVDEGGSGGVVVDSGTTFTMLPAGLYESVVAEFENRTGKVANR
Sbjct: 301 AGISVGNIRIPAPEFLTKVDEGGSGGVVVDSGTTFTMLPAGLYESVVAEFENRTGKVANR 360

Query: 361 ARRIEENTGLSPCYYYENSVGVPRVVLHFVGEKSNVVLPRKNYFYEFLDGGDGVVGRKRK 420
           ARRIEENTGLSPCYYYENSVGVPRVVLHFVGEKSNVVLPRKNYFYEFLDGGDGVVGRKRK
Sbjct: 361 ARRIEENTGLSPCYYYENSVGVPRVVLHFVGEKSNVVLPRKNYFYEFLDGGDGVVGRKRK 420

Query: 421 VGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDNLNRSK 480
           VGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDNLNRSK
Sbjct: 421 VGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDNLNRSK 480

BLAST of CSPI03G02720 vs. NCBI nr
Match: gi|659095959|ref|XP_008448851.1| (PREDICTED: aspartic proteinase nepenthesin-1 [Cucumis melo])

HSP 1 Score: 953.4 bits (2463), Expect = 1.6e-274
Identity = 472/481 (98.13%), Postives = 477/481 (99.17%), Query Frame = 1

Query: 1   MAVSPVFIFLLCFLLSSPVFSSQIFLLPLSHSLSSSISDFNNTHNLLKSTATRSSARFHR 60
           MAVSPVFIFLLCFLLSSPVFSSQIFLLPLSHSLSSSISDFN+THNLLKSTATRSSARFHR
Sbjct: 1   MAVSPVFIFLLCFLLSSPVFSSQIFLLPLSHSLSSSISDFNSTHNLLKSTATRSSARFHR 60

Query: 61  HRHNHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQS 120
           HRHNHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQS
Sbjct: 61  HRHNHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQS 120

Query: 121 PLPKIANNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAYG 180
           PLPKI+NNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAYG
Sbjct: 121 PLPKISNNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAYG 180

Query: 181 DGSLVARLYRDSLSLPTPAPSPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQLAT 240
           DGSLVARLYRDSLSLPTPAPSPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQLAT
Sbjct: 181 DGSLVARLYRDSLSLPTPAPSPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQLAT 240

Query: 241 FSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYTGETEFIYTSLLENPKHPYFYSVGL 300
           FSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRY+TGETEFIYTSLLENPKHPYFYSVGL
Sbjct: 241 FSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYHTGETEFIYTSLLENPKHPYFYSVGL 300

Query: 301 AGISVGNVRIPAPEFLTKVDEGGSGGVVVDSGTTFTMLPAGLYESVVAEFENRTGKVANR 360
           AGISVGNVRIPAPEFL KVDE GSGGVVVDSGTTFTMLP+GLYESVVAEFENRTGKVANR
Sbjct: 301 AGISVGNVRIPAPEFLRKVDESGSGGVVVDSGTTFTMLPSGLYESVVAEFENRTGKVANR 360

Query: 361 ARRIEENTGLSPCYYYENSVGVPRVVLHFVGEKSNVVLPRKNYFYEFLDGGDGV--VGRK 420
           ARRIEENTGLSPCYYYENSVGVPRVVLHFVGEKS+VVLPRKNYFYEFLDGGDGV  VGRK
Sbjct: 361 ARRIEENTGLSPCYYYENSVGVPRVVLHFVGEKSSVVLPRKNYFYEFLDGGDGVVEVGRK 420

Query: 421 RKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDNLNR 480
           RKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDNLNR
Sbjct: 421 RKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDNLNR 480

BLAST of CSPI03G02720 vs. NCBI nr
Match: gi|470130620|ref|XP_004301201.1| (PREDICTED: aspartic proteinase nepenthesin-2 [Fragaria vesca subsp. vesca])

HSP 1 Score: 655.6 bits (1690), Expect = 6.7e-185
Identity = 331/484 (68.39%), Postives = 383/484 (79.13%), Query Frame = 1

Query: 4   SPVFIFLLCFLLSSPVFSSQIFLLPLSHSLSSSISDFNNTHNLLKSTATRSSARFHRHRH 63
           SP+F+ +LCF   S  FS  +FL PL+HSLS +   FN TH+LLK+TATRS+ RFHRHRH
Sbjct: 5   SPLFL-ILCFTYLSVSFSQTLFL-PLTHSLSQT--QFNTTHHLLKATATRSATRFHRHRH 64

Query: 64  N----HLSLPLSPGGDYTLSFNLGSES-HKISLYMDTGSDLVWFPCSPFECILCEGKPKI 123
                 +SLPLSPG DYTLSF LGS     ISLYMDTGSDLVWFPCSPFECILCEGKP  
Sbjct: 65  RKTTQQVSLPLSPGSDYTLSFTLGSSPPQSISLYMDTGSDLVWFPCSPFECILCEGKPNS 124

Query: 124 QSPLPKIANNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYA 183
             P PKI  N +VSC + +CSAAH  +LS+  LCAI+ CPL+SIE+SECSSF CPPFYYA
Sbjct: 125 TFPPPKIPQNAAVSCDSHSCSAAHS-ALSSRSLCAIANCPLDSIELSECSSFKCPPFYYA 184

Query: 184 YGDGSLVARLYRDSLSLPTPAPSPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQL 243
           YGDGSL+++L+R SLS+P   PS  + + NFTFGC+H+ LGEP+GVAGFGRG+LS+P+QL
Sbjct: 185 YGDGSLISKLFRYSLSIPMSTPS--LLLPNFTFGCSHSALGEPIGVAGFGRGLLSLPAQL 244

Query: 244 ATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYT----GETEFIYTSLLENPKHPY 303
           A  SP LGN+FSYCLVSHSF  +RV RPSPLILGRY      G  E+ YTS+L NPKHPY
Sbjct: 245 ARSSPHLGNQFSYCLVSHSFDQERVGRPSPLILGRYDQNSAHGADEYTYTSMLYNPKHPY 304

Query: 304 FYSVGLAGISVGNVRIPAPEFLTKVDEGGSGGVVVDSGTTFTMLPAGLYESVVAEFENRT 363
           FY VGLAGIS+G   +PAPEFL +VDE G+GGVVVDSGTTFTMLP   Y S+VAEF+ R 
Sbjct: 305 FYCVGLAGISIGKRVVPAPEFLKRVDEKGNGGVVVDSGTTFTMLPQRFYNSLVAEFDRRV 364

Query: 364 GKVANRARRIEENTGLSPCYYYENSVGVPRVVLHFVGEKSNVVLPRKNYFYEFLDGGDGV 423
           G+V  RA ++E+ TGL PCYYY+  + VP V LHFVGEKS+VVLPRKNYFYEF DGGDG 
Sbjct: 365 GRVHKRATQVEDGTGLGPCYYYDGVMEVPAVTLHFVGEKSSVVLPRKNYFYEFTDGGDG- 424

Query: 424 VGRKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWD 479
            G+KRKVGC MLMNGGDE E  GGPGA  GNYQQQGFEVVYDLEK+RVGFA+RQCS LWD
Sbjct: 425 TGKKRKVGCWMLMNGGDEKESGGGPGAIFGNYQQQGFEVVYDLEKHRVGFAKRQCSLLWD 480

BLAST of CSPI03G02720 vs. NCBI nr
Match: gi|1009157128|ref|XP_015896606.1| (PREDICTED: aspartic proteinase nepenthesin-2 [Ziziphus jujuba])

HSP 1 Score: 649.4 bits (1674), Expect = 4.8e-183
Identity = 332/489 (67.89%), Postives = 390/489 (79.75%), Query Frame = 1

Query: 6   VFIFLLCFLLSS-PVFSSQIFLLPLSHSLSSSISDFNNTHNLLKSTATRSSARFHR---- 65
           ++  +LCF      V  SQI LLPL+HSLS +   FN+T +LLKSTATRS+ARFHR    
Sbjct: 8   LYYIILCFSFECLSVSYSQILLLPLTHSLSQN--QFNSTQHLLKSTATRSAARFHRSRSD 67

Query: 66  -HRHNHLSLPLSPGGDYTLSFNLGSES-HKISLYMDTGSDLVWFPCSPFECILCEGK--P 125
            +RH+ +SLPLS G DYTLS  +G+     ISLYMDTGSDLVWFPCSPFECILCEGK  P
Sbjct: 68  RNRHSQVSLPLSSGSDYTLSLTVGTNPPQSISLYMDTGSDLVWFPCSPFECILCEGKYDP 127

Query: 126 KIQSPLPKIANNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFY 185
           K  +   KI  N +VSC + ACSAAH  SLS+S+LCAI+RCPLESIEIS+CSSFSCPPFY
Sbjct: 128 KTTNKPLKIPPNATVSCKSPACSAAHS-SLSSSNLCAIARCPLESIEISDCSSFSCPPFY 187

Query: 186 YAYGDGSLVARLYRDSLSLPTPAPSPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPS 245
           YAY DGSL+ARL++  LS+P  +PS  + + NFTFGCAH+ LGEP+GVAGFGRG+LS+P+
Sbjct: 188 YAYADGSLIARLHKYRLSIPMSSPS--LVLHNFTFGCAHSALGEPIGVAGFGRGLLSLPA 247

Query: 246 QLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYTGE-------TEFIYTSLLEN 305
           QL++FSPQLGNRFSYCLVSHSF +DRVRRPSPLILGRY   E        +F+YTS+L+N
Sbjct: 248 QLSSFSPQLGNRFSYCLVSHSFDSDRVRRPSPLILGRYEEKEKRVGDDGAQFVYTSMLDN 307

Query: 306 PKHPYFYSVGLAGISVGNVRIPAPEFLTKVDEGGSGGVVVDSGTTFTMLPAGLYESVVAE 365
           PKHPYFYSVGL GISVG   I APEFL  VD  G+GG+VVDSGTTFTMLP+ LY S+VAE
Sbjct: 308 PKHPYFYSVGLVGISVGKKNILAPEFLHGVDATGNGGMVVDSGTTFTMLPSSLYNSLVAE 367

Query: 366 FENRTGKVANRARRIEENTGLSPCYYYENSVGVPRVVLHFVGEKSNVVLPRKNYFYEFLD 425
           F+ R G+V  RAR IE+ TGLSPCYYY   + +P + LHFVG +S V+LPR+NYFYEFLD
Sbjct: 368 FDQRVGRVHERARDIEDKTGLSPCYYYNKVIQIPNLTLHFVGNESGVLLPRRNYFYEFLD 427

Query: 426 GGDGVVGRKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQC 479
           GGDG  G+KR VGCLMLMNGGDE EL GGPGATLGNYQQQGFEVVYDL K RVGFARR+C
Sbjct: 428 GGDG-SGKKRNVGCLMLMNGGDEKELTGGPGATLGNYQQQGFEVVYDLAKRRVGFARREC 487

BLAST of CSPI03G02720 vs. NCBI nr
Match: gi|743864222|ref|XP_011031864.1| (PREDICTED: aspartic proteinase nepenthesin-1 [Populus euphratica])

HSP 1 Score: 649.0 bits (1673), Expect = 6.3e-183
Identity = 337/499 (67.54%), Postives = 389/499 (77.96%), Query Frame = 1

Query: 10  LLCFLLSSP---VFSSQIFLLPLSHSLSSSISDFNNTHNLLKSTATRSSARFHRHRHNH- 69
           LLCF+L      + +SQ   LPL HSLS +   F +TH+LLKST+TRS+ARFH H H+H 
Sbjct: 8   LLCFILCFTHVFISTSQTLFLPLIHSLSKT--QFTSTHHLLKSTSTRSTARFHHHHHHHH 67

Query: 70  ----------LSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGK 129
                     +SLPLSPG DYTLSF + S+   ISLY+DTGSDLVWFPC PFECILCEGK
Sbjct: 68  NNKNSHKHRQVSLPLSPGSDYTLSFTINSQ--PISLYLDTGSDLVWFPCQPFECILCEGK 127

Query: 130 PK----IQSPLPKIANNKS-VSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSF 189
            +      +P PK++   + VSC ++ACSA H  +L +S LCAIS CPLESIEIS+C   
Sbjct: 128 AENASLASTPPPKLSKTATPVSCKSSACSAVH-SNLPSSDLCAISNCPLESIEISDCRKH 187

Query: 190 SCPPFYYAYGDGSLVARLYRDSLSLPTPAPSPPINVRNFTFGCAHTTLGEPVGVAGFGRG 249
           SCP FYYAYGDGSL+ARLYRDS+ LP    +  I   NFTFGCAHTTL EP+GVAGFGRG
Sbjct: 188 SCPQFYYAYGDGSLIARLYRDSIRLPLSNQTNLI-FNNFTFGCAHTTLAEPIGVAGFGRG 247

Query: 250 VLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYTGETE---------- 309
           VLS+P+QLAT SPQLGN+FSYCLVSHSF +D VRRPSPLILGRY   E E          
Sbjct: 248 VLSLPAQLATLSPQLGNQFSYCLVSHSFDSDGVRRPSPLILGRYDHDEKERRVNGVKKPS 307

Query: 310 FIYTSLLENPKHPYFYSVGLAGISVGNVRIPAPEFLTKVDEGGSGGVVVDSGTTFTMLPA 369
           F+YTS+L+NP+HPYFY VGL GIS+G  +IPAP+FL KVD  GSGGVVVDSGTTFTMLPA
Sbjct: 308 FVYTSMLDNPRHPYFYCVGLEGISIGRKKIPAPDFLRKVDGEGSGGVVVDSGTTFTMLPA 367

Query: 370 GLYESVVAEFENRTGKVANRARRIEENTGLSPCYYYENS-VGVPRVVLHFVGEKSNVVLP 429
            LY+ +VAEFENR G+V  RA  IEENTGLSPCYY++N+ V VPRVVLHFVG  S+VVLP
Sbjct: 368 SLYDFIVAEFENRVGRVNERASVIEENTGLSPCYYFDNNVVNVPRVVLHFVGNGSSVVLP 427

Query: 430 RKNYFYEFLDGGDGVVGRKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEK 479
           R+NYFYEFLDGGDG  G+KRKVGCLMLMNGGDEAEL+GGPGATLGNYQQQGFEVVYDLE 
Sbjct: 428 RRNYFYEFLDGGDG-KGKKRKVGCLMLMNGGDEAELSGGPGATLGNYQQQGFEVVYDLEN 487

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
ASP63_ARATH7.7e-16562.34Probable aspartyl protease At4g16563 OS=Arabidopsis thaliana GN=At4g16563 PE=2 S... [more]
APF2_ARATH1.2e-3532.03Aspartyl protease family protein 2 OS=Arabidopsis thaliana GN=APF2 PE=2 SV=1[more]
NEP1_NEPGR1.4e-3329.48Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis GN=nep1 PE=1 SV=1[more]
ASPG1_ARATH1.8e-3329.69Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana GN=ASPG1 PE=1 ... [more]
NEP2_NEPGR2.4e-3329.38Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis GN=nep2 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0L5I7_CUCSA1.7e-28099.79Pepsin A OS=Cucumis sativus GN=Csa_3G020060 PE=3 SV=1[more]
B9GYA7_POPTR5.7e-18367.94Aspartyl protease family protein OS=Populus trichocarpa GN=POPTR_0003s07390g PE=... [more]
B9SSF8_RICCO3.5e-18066.26Pepsin A, putative OS=Ricinus communis GN=RCOM_1061010 PE=3 SV=1[more]
B9NGC6_POPTR3.5e-18068.07Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0001s15870g PE=3 SV=1[more]
G7JW26_MEDTR1.5e-17865.92Eukaryotic aspartyl protease family protein OS=Medicago truncatula GN=MTR_5g0124... [more]
Match NameE-valueIdentityDescription
AT4G16563.14.3e-16662.34 Eukaryotic aspartyl protease family protein[more]
AT5G45120.11.3e-5334.20 Eukaryotic aspartyl protease family protein[more]
AT3G52500.13.2e-4429.90 Eukaryotic aspartyl protease family protein[more]
AT3G61820.12.9e-3731.70 Eukaryotic aspartyl protease family protein[more]
AT1G01300.16.5e-3732.03 Eukaryotic aspartyl protease family protein[more]
Match NameE-valueIdentityDescription
gi|449458942|ref|XP_004147205.1|2.5e-28099.79PREDICTED: aspartic proteinase nepenthesin-1 [Cucumis sativus][more]
gi|659095959|ref|XP_008448851.1|1.6e-27498.13PREDICTED: aspartic proteinase nepenthesin-1 [Cucumis melo][more]
gi|470130620|ref|XP_004301201.1|6.7e-18568.39PREDICTED: aspartic proteinase nepenthesin-2 [Fragaria vesca subsp. vesca][more]
gi|1009157128|ref|XP_015896606.1|4.8e-18367.89PREDICTED: aspartic proteinase nepenthesin-2 [Ziziphus jujuba][more]
gi|743864222|ref|XP_011031864.1|6.3e-18367.54PREDICTED: aspartic proteinase nepenthesin-1 [Populus euphratica][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001461Aspartic_peptidase_A1
IPR001969Aspartic_peptidase_AS
IPR021109Peptidase_aspartic_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0004190aspartic-type endopeptidase activity
Vocabulary: Biological Process
TermDefinition
GO:0006508proteolysis
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
biological_process GO:0030163 protein catabolic process
cellular_component GO:0005575 cellular_component
cellular_component GO:0009505 plant-type cell wall
molecular_function GO:0004190 aspartic-type endopeptidase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI03G02720.1CSPI03G02720.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPANTHERPTHR13683ASPARTYL PROTEASEScoord: 163..477
score: 6.3E-243coord: 2..138
score: 6.3E
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 327..338
scor
IPR021109Aspartic peptidase domainGENE3DG3DSA:2.40.70.10coord: 279..473
score: 2.1E-44coord: 73..272
score: 1.4
IPR021109Aspartic peptidase domainunknownSSF50630Acid proteasescoord: 71..474
score: 6.28
NoneNo IPR availablePANTHERPTHR13683:SF276SUBFAMILY NOT NAMEDcoord: 2..138
score: 6.3E-243coord: 163..477
score: 6.3E

The following gene(s) are paralogous to this gene:

None