CSPI03G35440 (gene) Cucumber (PI 183967) v1

Overview
NameCSPI03G35440
Typegene
OrganismCucumis sativus var. hardwickii cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionEukaryotic aspartyl protease family protein
LocationChr3: 30930376 .. 30932252 (+)
RNA-Seq ExpressionCSPI03G35440
SyntenyCSPI03G35440
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TACGTTCATTTGCATACCATTTTCTCTTCTCTAAAACCGCAGCTCTGCTTCCAGCTCACCCCTTTGTCCCTTCTTCCTCTCCATTTATACTTCTATTCCCCATTTCTCATCCCTTCTCATCAACATAACTCCACATTCCCTTCATTCCTTTCCAATCCATAATACCATCACACCATTTCGTTTACAACTAGGAGTAACCTTCTTCCATGGCTTCTCCTTCCCCTCTCTCTTTCTTCTACCTCCTCCTCTTCTCCTCTCTTTCCGCCATTGCCCACTCCAATCCCATCACTCTCCCTCTCAACTCTTTTCCCCACCTTTCTTCTCCAGATCCACTCCAGGCTCTCACTTTCCTCGCCTCTTCTTCCCAGACCAGAGCCCATCAAATCAAAACCCCCAAATCCAACTCTGTTTTCAAGTCCCCTCTCTCCCCCCATAGCTATGGAGCTTACTCCACTCCACTCAGCTTTGGTACTCCACAACAGACTCTGCATTTGATCTTCGATACAGGTAGTAGCCTCGTTTGGTTCCCTTGCACTTCCCGATATCTCTGTTCTGAATGTTCCTTCCCCAAAATAGATCCAACTGGAATCCCCAGATTTGTCCCCAAATTGTCCTCCTCTTCTAAGCTTGTCGGTTGCCAGAATCCCAAATGTGCCTGGATTTTTGGTCCCGATGTCAAATCTCAGTGCCGGAGTTGTAACCCCAAAACAGAGAACTGTACCCAAACTTGCCCTGCTTACGTTGTTCAATATGGTTCCGGTTCCACGGCTGGGCTTTTGCTATCAGAAACCCTTGATTTTCCTGATAAAAAAATCCCTAATTTTGTTGTTGGGTGTTCGTTTTTGTCGATCCATCAGCCCTCTGGAATCGCCGGATTTGGCCGAGGATCTGAATCGCTCCCCTCGCAAATGGGTCTCAAGAAATTCGCTTACTGCCTAGCGTCTCGGAAATTCGACGACTCGCCGCATTCCGGCCAGCTGATTTTAGATTCAACCGGCGTGAAGTCCAGCGGCCTTACCTACACCCCATTCCGGCAGAACCCCTCTGTTTCTAACAACGCTTATAAAGAATACTATTACTTAAACATACGGAAAATCATCGTCGGAAACCAGGCTGTGAAGGTTCCATACAAGTATCTAGTGCCGGGCCCCGATGGGAACGGCGGATCTATCATCGATTCCGGCTCCACCTTCACGTTTATGGACAAACCAGTCTTGGAGGTGGTGGCGCGAGAGTTCGAGAAGCAATTGGCAAATTGGACGAGAGCCACGGATGTGGAAACTCTCACTGGATTACGGCCGTGTTTCGACATTTCGAAGGAGAAATCGGTGAAGTTTCCGGAGTTGATTTTCCAATTTAAAGGTGGAGCGAAATGGGCTCTGCCGTTGAATAACTATTTCGCTTTAGTCAGTAGCTCCGGCGTGGCGTGTTTGACAGTAGTAACGCATCAGATGGAGGACGGCGGTGGCGGTGGGCCGTCTGTGATTTTAGGGGCTTTCCAGCAGCAGAATTTCTATGTGGAATATGACTTGGTGAATCAAAGATTGGGATTTCGGCAACAGACTTGCTCGTAGAAACGATGTCGTATTGGTGTATGCATGTTTATGTTTTCTAACTTTGGGTATTTGGTTGGTCGGTGGCCGGTTAAGCTACTGGTCGGAGCTACAACGGTGGTTTCAATTTGGCAGCGGCGGCGTTCAGATGAATTCAAAATCAATTTTTTTTTCACTTTCTTGTAATAATTATTCGTTGCTGTTGTAATGATATATCTCATATTTCATAATTTGAAGACAGAAAAGAATGTTAAAAAGTAATAGATGATCCCTAATAAGTTAACCTTTTTTCATTCATCATTATATATATAACGCACACACACA

mRNA sequence

TACGTTCATTTGCATACCATTTTCTCTTCTCTAAAACCGCAGCTCTGCTTCCAGCTCACCCCTTTGTCCCTTCTTCCTCTCCATTTATACTTCTATTCCCCATTTCTCATCCCTTCTCATCAACATAACTCCACATTCCCTTCATTCCTTTCCAATCCATAATACCATCACACCATTTCGTTTACAACTAGGAGTAACCTTCTTCCATGGCTTCTCCTTCCCCTCTCTCTTTCTTCTACCTCCTCCTCTTCTCCTCTCTTTCCGCCATTGCCCACTCCAATCCCATCACTCTCCCTCTCAACTCTTTTCCCCACCTTTCTTCTCCAGATCCACTCCAGGCTCTCACTTTCCTCGCCTCTTCTTCCCAGACCAGAGCCCATCAAATCAAAACCCCCAAATCCAACTCTGTTTTCAAGTCCCCTCTCTCCCCCCATAGCTATGGAGCTTACTCCACTCCACTCAGCTTTGGTACTCCACAACAGACTCTGCATTTGATCTTCGATACAGGTAGTAGCCTCGTTTGGTTCCCTTGCACTTCCCGATATCTCTGTTCTGAATGTTCCTTCCCCAAAATAGATCCAACTGGAATCCCCAGATTTGTCCCCAAATTGTCCTCCTCTTCTAAGCTTGTCGGTTGCCAGAATCCCAAATGTGCCTGGATTTTTGGTCCCGATGTCAAATCTCAGTGCCGGAGTTGTAACCCCAAAACAGAGAACTGTACCCAAACTTGCCCTGCTTACGTTGTTCAATATGGTTCCGGTTCCACGGCTGGGCTTTTGCTATCAGAAACCCTTGATTTTCCTGATAAAAAAATCCCTAATTTTGTTGTTGGGTGTTCGTTTTTGTCGATCCATCAGCCCTCTGGAATCGCCGGATTTGGCCGAGGATCTGAATCGCTCCCCTCGCAAATGGGTCTCAAGAAATTCGCTTACTGCCTAGCGTCTCGGAAATTCGACGACTCGCCGCATTCCGGCCAGCTGATTTTAGATTCAACCGGCGTGAAGTCCAGCGGCCTTACCTACACCCCATTCCGGCAGAACCCCTCTGTTTCTAACAACGCTTATAAAGAATACTATTACTTAAACATACGGAAAATCATCGTCGGAAACCAGGCTGTGAAGGTTCCATACAAGTATCTAGTGCCGGGCCCCGATGGGAACGGCGGATCTATCATCGATTCCGGCTCCACCTTCACGTTTATGGACAAACCAGTCTTGGAGGTGGTGGCGCGAGAGTTCGAGAAGCAATTGGCAAATTGGACGAGAGCCACGGATGTGGAAACTCTCACTGGATTACGGCCGTGTTTCGACATTTCGAAGGAGAAATCGGTGAAGTTTCCGGAGTTGATTTTCCAATTTAAAGGTGGAGCGAAATGGGCTCTGCCGTTGAATAACTATTTCGCTTTAGTCAGTAGCTCCGGCGTGGCGTGTTTGACAGTAGTAACGCATCAGATGGAGGACGGCGGTGGCGGTGGGCCGTCTGTGATTTTAGGGGCTTTCCAGCAGCAGAATTTCTATGTGGAATATGACTTGGTGAATCAAAGATTGGGATTTCGGCAACAGACTTGCTCGTAGAAACGATGTCGTATTGGTGTATGCATGTTTATGTTTTCTAACTTTGGGTATTTGGTTGGTCGGTGGCCGGTTAAGCTACTGGTCGGAGCTACAACGGTGGTTTCAATTTGGCAGCGGCGGCGTTCAGATGAATTCAAAATCAATTTTTTTTTCACTTTCTTGTAATAATTATTCGTTGCTGTTGTAATGATATATCTCATATTTCATAATTTGAAGACAGAAAAGAATGTTAAAAAGTAATAGATGATCCCTAATAAGTTAACCTTTTTTCATTCATCATTATATATATAACGCACACACACA

Coding sequence (CDS)

ATGGCTTCTCCTTCCCCTCTCTCTTTCTTCTACCTCCTCCTCTTCTCCTCTCTTTCCGCCATTGCCCACTCCAATCCCATCACTCTCCCTCTCAACTCTTTTCCCCACCTTTCTTCTCCAGATCCACTCCAGGCTCTCACTTTCCTCGCCTCTTCTTCCCAGACCAGAGCCCATCAAATCAAAACCCCCAAATCCAACTCTGTTTTCAAGTCCCCTCTCTCCCCCCATAGCTATGGAGCTTACTCCACTCCACTCAGCTTTGGTACTCCACAACAGACTCTGCATTTGATCTTCGATACAGGTAGTAGCCTCGTTTGGTTCCCTTGCACTTCCCGATATCTCTGTTCTGAATGTTCCTTCCCCAAAATAGATCCAACTGGAATCCCCAGATTTGTCCCCAAATTGTCCTCCTCTTCTAAGCTTGTCGGTTGCCAGAATCCCAAATGTGCCTGGATTTTTGGTCCCGATGTCAAATCTCAGTGCCGGAGTTGTAACCCCAAAACAGAGAACTGTACCCAAACTTGCCCTGCTTACGTTGTTCAATATGGTTCCGGTTCCACGGCTGGGCTTTTGCTATCAGAAACCCTTGATTTTCCTGATAAAAAAATCCCTAATTTTGTTGTTGGGTGTTCGTTTTTGTCGATCCATCAGCCCTCTGGAATCGCCGGATTTGGCCGAGGATCTGAATCGCTCCCCTCGCAAATGGGTCTCAAGAAATTCGCTTACTGCCTAGCGTCTCGGAAATTCGACGACTCGCCGCATTCCGGCCAGCTGATTTTAGATTCAACCGGCGTGAAGTCCAGCGGCCTTACCTACACCCCATTCCGGCAGAACCCCTCTGTTTCTAACAACGCTTATAAAGAATACTATTACTTAAACATACGGAAAATCATCGTCGGAAACCAGGCTGTGAAGGTTCCATACAAGTATCTAGTGCCGGGCCCCGATGGGAACGGCGGATCTATCATCGATTCCGGCTCCACCTTCACGTTTATGGACAAACCAGTCTTGGAGGTGGTGGCGCGAGAGTTCGAGAAGCAATTGGCAAATTGGACGAGAGCCACGGATGTGGAAACTCTCACTGGATTACGGCCGTGTTTCGACATTTCGAAGGAGAAATCGGTGAAGTTTCCGGAGTTGATTTTCCAATTTAAAGGTGGAGCGAAATGGGCTCTGCCGTTGAATAACTATTTCGCTTTAGTCAGTAGCTCCGGCGTGGCGTGTTTGACAGTAGTAACGCATCAGATGGAGGACGGCGGTGGCGGTGGGCCGTCTGTGATTTTAGGGGCTTTCCAGCAGCAGAATTTCTATGTGGAATATGACTTGGTGAATCAAAGATTGGGATTTCGGCAACAGACTTGCTCGTAG

Protein sequence

MASPSPLSFFYLLLFSSLSAIAHSNPITLPLNSFPHLSSPDPLQALTFLASSSQTRAHQIKTPKSNSVFKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSFPKIDPTGIPRFVPKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGSTAGLLLSETLDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSPHSGQLILDSTGVKSSGLTYTPFRQNPSVSNNAYKEYYYLNIRKIIVGNQAVKVPYKYLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQLANWTRATDVETLTGLRPCFDISKEKSVKFPELIFQFKGGAKWALPLNNYFALVSSSGVACLTVVTHQMEDGGGGGPSVILGAFQQQNFYVEYDLVNQRLGFRQQTCS*
Homology
BLAST of CSPI03G35440 vs. ExPASy Swiss-Prot
Match: Q940R4 (Probable aspartyl protease At4g16563 OS=Arabidopsis thaliana OX=3702 GN=At4g16563 PE=2 SV=1)

HSP 1 Score: 194.1 bits (492), Expect = 3.4e-48
Identity = 149/501 (29.74%), Postives = 224/501 (44.71%), Query Frame = 0

Query: 7   LSFFYLLLFSSLSA-----IAHSNPITLPLNSFPHLSSPDPLQALTFLASSSQTRAHQIK 66
           L +++    SSLS      ++HS      L++  H SSP  L   +   SS++ R H  K
Sbjct: 14  LQYYFHFSVSSLSTPLLLHLSHS------LSTSKHSSSPLHLLKSSSSRSSARFRRHHHK 73

Query: 67  TPKSNSVFKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSFP 126
             +       P+S  S   Y   LS G+    + L  DTGS LVWFPC   + C  C   
Sbjct: 74  --QQQQQLSLPIS--SGSDYLISLSVGSSSSAVSLYLDTGSDLVWFPCRP-FTCILCESK 133

Query: 127 KIDPTGIPRFVPKLSSSSKLVGCQNPKCAWIFGPDVKSQ-CRSCN-----PKTENCTQT- 186
            + P+        LSSS+  V C +P C+        S  C   N      +T +C  + 
Sbjct: 134 PLPPSP----PSSLSSSATTVSCSSPSCSAAHSSLPSSDLCAISNCPLDFIETGDCNTSS 193

Query: 187 --CPAYVVQYGSGSTAGLLLSETLDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLP 246
             CP +   YG GS    L S++L  P   + NF  GC+  ++ +P G+AGFGRG  SLP
Sbjct: 194 YPCPPFYYAYGDGSLVAKLYSDSLSLPSVSVSNFTFGCAHTTLAEPIGVAGFGRGRLSLP 253

Query: 247 SQMGL------KKFAYCLASRKFDDS--PHSGQLIL--------------------DSTG 306
           +Q+ +        F+YCL S  FD         LIL                    D   
Sbjct: 254 AQLAVHSPHLGNSFSYCLVSHSFDSDRVRRPSPLILGRFVDKKEKRVGTTDDHDDGDDEK 313

Query: 307 VKSSGLTYTPFRQNPSVSNNAYKEYYYLNIRKIIVGNQAVKVPYKYLVPGPDGNGGSIID 366
            K +   +T   +NP      +  +Y ++++ I +G + +  P        +G GG ++D
Sbjct: 314 KKKNEFVFTEMLENPK-----HPYFYSVSLQGISIGKRNIPAPAMLRRIDKNGGGGVVVD 373

Query: 367 SGSTFTFMDKPVLEVVAREFEKQLAN-WTRATDVETLTGLRPCFDISKEKSVKFPELIFQ 426
           SG+TFT +       V  EF+ ++     RA  VE  +G+ PC+ ++  ++VK P L+  
Sbjct: 374 SGTTFTMLPAKFYNSVVEEFDSRVGRVHERADRVEPSSGMSPCYYLN--QTVKVPALVLH 433

Query: 427 FKGG-AKWALPLNNYFALVSSSG--------VACLTVVTHQMEDGGGGGPSVILGAFQQQ 456
           F G  +   LP  NYF      G        + CL ++    E    GG   ILG +QQQ
Sbjct: 434 FAGNRSSVTLPRRNYFYEFMDGGDGKEEKRKIGCLMLMNGGDESELRGGTGAILGNYQQQ 492

BLAST of CSPI03G35440 vs. ExPASy Swiss-Prot
Match: Q9LNJ3 (Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 SV=1)

HSP 1 Score: 159.1 bits (401), Expect = 1.2e-37
Identity = 143/465 (30.75%), Postives = 197/465 (42.37%), Query Frame = 0

Query: 15  FSSLSAIAHSNPITLPLNSFPHLSS---PDP-----LQALTFLASSSQTRAHQI------ 74
           F S S    S+ ITL L+    LSS   PD      LQ  +    S  T A QI      
Sbjct: 60  FESGSDSESSSSITLNLDHIDALSSNKTPDELFSSRLQRDSRRVKSIATLAAQIPGRNVT 119

Query: 75  KTPKSNSVFKSPLSPHSYGA--YSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSEC 134
             P+      S +S  S G+  Y T L  GTP + ++++ DTGS +VW  C     C  C
Sbjct: 120 HAPRPGGFSSSVVSGLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAP---CRRC 179

Query: 135 SFPKIDPTGIPRFVPKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAY 194
            + + DP     F P+ S +   + C +P C        +     CN + + C      Y
Sbjct: 180 -YSQSDPI----FDPRKSKTYATIPCSSPHCR-------RLDSAGCNTRRKTC-----LY 239

Query: 195 VVQYGSGS-TAGLLLSETLDFPDKKIPNFVVGCSFLS---IHQPSGIAGFGRGSESLPSQ 254
            V YG GS T G   +ETL F   ++    +GC   +       +G+ G G+G  S P Q
Sbjct: 240 QVSYGDGSFTVGDFSTETLTFRRNRVKGVALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQ 299

Query: 255 MGLK---KFAYCLASRKFDDSPHSGQLILDSTGVKSSGLTYTPFRQNPSVSNNAYKEYYY 314
            G +   KF+YCL  R     P S   ++      S    +TP   NP +       +YY
Sbjct: 300 TGHRFNQKFSYCLVDRSASSKPSS---VVFGNAAVSRIARFTPLLSNPKLDT-----FYY 359

Query: 315 LNIRKIIVGNQAVK-VPYKYLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQLAN 374
           + +  I VG   V  V          GNGG IIDSG++ T + +P    +   F      
Sbjct: 360 VGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKT 419

Query: 375 WTRATDVETLTGLRPCFDISKEKSVKFPELIFQFKGGAKWALPLNNYFALVSSSGVACLT 434
             RA D         CFD+S    VK P ++  F+ GA  +LP  NY   V ++G  C  
Sbjct: 420 LKRAPDFSLFD---TCFDLSNMNEVKVPTVVLHFR-GADVSLPATNYLIPVDTNGKFCFA 479

Query: 435 VVTHQMEDGGGGGPSVILGAFQQQNFYVEYDLVNQRLGFRQQTCS 456
                   G  GG S+I G  QQQ F V YDL + R+GF    C+
Sbjct: 480 FA------GTMGGLSII-GNIQQQGFRVVYDLASSRVGFAPGGCA 485

BLAST of CSPI03G35440 vs. ExPASy Swiss-Prot
Match: Q766C2 (Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 SV=1)

HSP 1 Score: 149.4 bits (376), Expect = 9.6e-35
Identity = 122/383 (31.85%), Postives = 169/383 (44.13%), Query Frame = 0

Query: 79  GAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSFPKIDPTGIPRFVPKLSSS 138
           G Y   ++ GTP  +   I DTGS L+W  C     C++C      PT  P F P+ SSS
Sbjct: 94  GEYLMNVAIGTPDSSFSAIMDTGSDLIWTQCEP---CTQCF---SQPT--PIFNPQDSSS 153

Query: 139 SKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGSTA-GLLLSETLD 198
              + C++  C      D+ S         E C      Y   YG GST  G + +ET  
Sbjct: 154 FSTLPCESQYC-----QDLPS---------ETCNNNECQYTYGYGDGSTTQGYMATETFT 213

Query: 199 FPDKKIPNFVVGC----SFLSIHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSP 258
           F    +PN   GC            +G+ G G G  SLPSQ+G+ +F+YC+ S     SP
Sbjct: 214 FETSSVPNIAFGCGEDNQGFGQGNGAGLIGMGWGPLSLPSQLGVGQFSYCMTSYG-SSSP 273

Query: 259 HSGQLILDSTGVKSSGLTYTPFRQNPSVSNNAYKEYYYLNIRKIIVGNQAVKVPYKYLVP 318
            +  L   ++GV     + T       + ++    YYY+ ++ I VG   + +P      
Sbjct: 274 STLALGSAASGVPEGSPSTT------LIHSSLNPTYYYITLQGITVGGDNLGIPSSTFQL 333

Query: 319 GPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQLANWTRATDVETLTGLRPCFDISKEK 378
             DG GG IIDSG+T T++ +     VA+ F  Q+      T  E+ +GL  CF    + 
Sbjct: 334 QDDGTGGMIIDSGTTLTYLPQDAYNAVAQAFTDQI---NLPTVDESSSGLSTCFQQPSDG 393

Query: 379 S-VKFPELIFQFKGGAKWALPLNNYFALVS-SSGVACLTVVTHQMEDGGGGGPSVILGAF 438
           S V+ PE+  QF GG    L L     L+S + GV CL      M      G S I G  
Sbjct: 394 STVQVPEISMQFDGG---VLNLGEQNILISPAEGVICLA-----MGSSSQLGIS-IFGNI 435

Query: 439 QQQNFYVEYDLVNQRLGFRQQTC 455
           QQQ   V YDL N  + F    C
Sbjct: 454 QQQETQVLYDLQNLAVSFVPTQC 435

BLAST of CSPI03G35440 vs. ExPASy Swiss-Prot
Match: Q766C3 (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 SV=1)

HSP 1 Score: 145.6 bits (366), Expect = 1.4e-33
Identity = 113/387 (29.20%), Postives = 160/387 (41.34%), Query Frame = 0

Query: 79  GAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSFPKIDPTGIPRFVPKLSSS 138
           G Y   LS GTP Q    I DTGS L+W  C     C   S P  +P G        SSS
Sbjct: 93  GEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFNPQG--------SSS 152

Query: 139 SKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGS-TAGLLLSETLD 198
              + C +  C  +  P               C+     Y   YG GS T G + +ETL 
Sbjct: 153 FSTLPCSSQLCQALSSP--------------TCSNNFCQYTYGYGDGSETQGSMGTETLT 212

Query: 199 FPDKKIPNFVVGC----SFLSIHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSP 258
           F    IPN   GC            +G+ G GRG  SLPSQ+ + KF+YC+       S 
Sbjct: 213 FGSVSIPNITFGCGENNQGFGQGNGAGLVGMGRGPLSLPSQLDVTKFSYCMT--PIGSST 272

Query: 259 HSGQLILDSTGVKSSGLTYTPFRQNPSVSNNAYKEYYYLNIRKIIVGNQAVKV-PYKYLV 318
            S  L+       ++G   T   Q+  +       +YY+ +  + VG+  + + P  + +
Sbjct: 273 PSNLLLGSLANSVTAGSPNTTLIQSSQIPT-----FYYITLNGLSVGSTRLPIDPSAFAL 332

Query: 319 PGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQLANWTRATDVETLTGLRPCFDI--- 378
              +G GG IIDSG+T T+      + V +EF  Q+       ++  + G    FD+   
Sbjct: 333 NSNNGTGGIIIDSGTTLTYFVNNAYQSVRQEFISQI-------NLPVVNGSSSGFDLCFQ 392

Query: 379 --SKEKSVKFPELIFQFKGGAKWALPLNNYFALVSSSGVACLTVVTHQMEDGGGGGPSVI 438
             S   +++ P  +  F GG    LP  NYF +  S+G+ CL +       G       I
Sbjct: 393 TPSDPSNLQIPTFVMHFDGG-DLELPSENYF-ISPSNGLICLAM-------GSSSQGMSI 434

Query: 439 LGAFQQQNFYVEYDLVNQRLGFRQQTC 455
            G  QQQN  V YD  N  + F    C
Sbjct: 453 FGNIQQQNMLVVYDTGNSVVSFASAQC 434

BLAST of CSPI03G35440 vs. ExPASy Swiss-Prot
Match: Q6F4N5 (Aspartyl protease 25 OS=Oryza sativa subsp. japonica OX=39947 GN=AP25 PE=2 SV=1)

HSP 1 Score: 139.8 bits (351), Expect = 7.6e-32
Identity = 127/468 (27.14%), Postives = 200/468 (42.74%), Query Frame = 0

Query: 1   MASPSPLSFFYLLLFSSLSAIAHSNPITLPLNSFPHLSSPDPLQALTFLASSSQTRAHQI 60
           MA+ + +    LLL ++++A A      L +    H SSP PL+++  LA     R   +
Sbjct: 1   MAATTTIPLLLLLLAATVAAAA----AELSVYHNVHPSSPSPLESIIALARDDDARLLFL 60

Query: 61  KTPKSNS-VFKSPL-SPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSEC 120
            +  + + V  +P+ S  +  +Y      G+P Q L L  DT +   W  C+    C  C
Sbjct: 61  SSKAATAGVSSAPVASGQAPPSYVVRAGLGSPSQQLLLALDTSADATWAHCSP---CGTC 120

Query: 121 SFPKIDPTGIPRFVPKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAY 180
               +       F P  SSS   + C +  C    G    +     +      T    A+
Sbjct: 121 PSSSL-------FAPANSSSYASLPCSSSWCPLFQGQACPAPQGGGDAAPPPATLPTCAF 180

Query: 181 VVQYGSGSTAGLLLSETLDFPDKKIPNFVVGCSFLSIHQPS------GIAGFGRGSESLP 240
              +   S    L S+TL      IPN+  GC   S+  P+      G+ G GRG  +L 
Sbjct: 181 SKPFADASFQAALASDTLRLGKDAIPNYTFGC-VSSVTGPTTNMPRQGLLGLGRGPMALL 240

Query: 241 SQMGL---KKFAYCLASRKFDDSPHSGQLILDSTGVKSSGLTYTPFRQNPSVSNNAYKEY 300
           SQ G      F+YCL S  +     SG L L + G +   + YTP  +NP  S+      
Sbjct: 241 SQAGSLYNGVFSYCLPS--YRSYYFSGSLRLGAGGGQPRSVRYTPMLRNPHRSS-----L 300

Query: 301 YYLNIRKIIVGNQAVKVPYKYLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQLA 360
           YY+N+  + VG+  VKVP            G+++DSG+  T    PV   +  EF +Q+A
Sbjct: 301 YYVNVTGLSVGHAWVKVPAGSFAFDAATGAGTVVDSGTVITRWTAPVYAALREEFRRQVA 360

Query: 361 NWTRATDVETLTGLRPCFDISKEKSVKFPELIFQFKGGAKWALPLNNYFALVSSSGVACL 420
                +   +L     CF+  +  +   P +     GG   ALP+ N     S++ +ACL
Sbjct: 361 ---APSGYTSLGAFDTCFNTDEVAAGGAPAVTVHMDGGVDLALPMENTLIHSSATPLACL 420

Query: 421 TVVTHQMEDGGGGGPSV--ILGAFQQQNFYVEYDLVNQRLGFRQQTCS 456
                 M +      SV  ++   QQQN  V +D+ N R+GF +++C+
Sbjct: 421 A-----MAEAPQNVNSVVNVIANLQQQNIRVVFDVANSRVGFAKESCN 438

BLAST of CSPI03G35440 vs. ExPASy TrEMBL
Match: A0A0A0LBI9 (Peptidase A1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G778440 PE=3 SV=1)

HSP 1 Score: 926.4 bits (2393), Expect = 4.6e-266
Identity = 453/457 (99.12%), Postives = 455/457 (99.56%), Query Frame = 0

Query: 1   MASPSPLSFFYLLLFSSLSAIAHSNPITLPLNSFPHLSSPDPLQALTFLASSSQTRAHQI 60
           MASPSPLSFFYLLLFSSLSAIAHSNPITLPLNSFPHLSSPDPLQALTFLASSSQTRAHQI
Sbjct: 1   MASPSPLSFFYLLLFSSLSAIAHSNPITLPLNSFPHLSSPDPLQALTFLASSSQTRAHQI 60

Query: 61  KTPKSNSVFKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSF 120
           KTPKSNSVFKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSF
Sbjct: 61  KTPKSNSVFKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSF 120

Query: 121 PKIDPTGIPRFVPKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAYVV 180
           PKIDPTGIPRFVPKLSSSSKLVGCQNPKC+WIFGPDVKSQCRSCNPKTENCTQTCPAYVV
Sbjct: 121 PKIDPTGIPRFVPKLSSSSKLVGCQNPKCSWIFGPDVKSQCRSCNPKTENCTQTCPAYVV 180

Query: 181 QYGSGSTAGLLLSETLDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKF 240
           QYGSGSTAGLLLSETLDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKF
Sbjct: 181 QYGSGSTAGLLLSETLDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKF 240

Query: 241 AYCLASRKFDDSPHSGQLILDSTGVKSSGLTYTPFRQNPSVSNNAYKEYYYLNIRKIIVG 300
           AYCLASRKFDDSPHSGQLILDSTGVKSSGLTYTPFRQNPSVSNNAYKEYYYLNIRKIIVG
Sbjct: 241 AYCLASRKFDDSPHSGQLILDSTGVKSSGLTYTPFRQNPSVSNNAYKEYYYLNIRKIIVG 300

Query: 301 NQAVKVPYKYLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQLANWTRATDVETL 360
           NQAVKVPYK+LVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQLANWTRATDVETL
Sbjct: 301 NQAVKVPYKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQLANWTRATDVETL 360

Query: 361 TGLRPCFDISKEKSVKFPELIFQFKGGAKWALPLNNYFALVSSSGVACLTVVTHQMED-- 420
           TGLRPCFDISKEKSVKFPELIFQFKGGAKWALPLNNYFALVSSSGVACLTVVTHQMED  
Sbjct: 361 TGLRPCFDISKEKSVKFPELIFQFKGGAKWALPLNNYFALVSSSGVACLTVVTHQMEDGG 420

Query: 421 GGGGGPSVILGAFQQQNFYVEYDLVNQRLGFRQQTCS 456
           GGGGGPSVILGAFQQQNFYVEYDLVNQRLGFRQQTCS
Sbjct: 421 GGGGGPSVILGAFQQQNFYVEYDLVNQRLGFRQQTCS 457

BLAST of CSPI03G35440 vs. ExPASy TrEMBL
Match: A0A5A7TRK2 (Aspartic proteinase nepenthesin-2 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold352G004410 PE=3 SV=1)

HSP 1 Score: 882.1 bits (2278), Expect = 1.0e-252
Identity = 428/455 (94.07%), Postives = 446/455 (98.02%), Query Frame = 0

Query: 1   MASPSPLSFFYLLLFSSLSAIAHSNPITLPLNSFPHLSSPDPLQALTFLASSSQTRAHQI 60
           MASPSPLSFFY+LLFSSLSAI++SNPITLPLNS PHLSS DPLQALTFLAS+S+ RAH+I
Sbjct: 1   MASPSPLSFFYILLFSSLSAISNSNPITLPLNSSPHLSSSDPLQALTFLASASKNRAHRI 60

Query: 61  KTPKSNSVFKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSF 120
           KTPKSNSV KSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLC+ECSF
Sbjct: 61  KTPKSNSVSKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCTECSF 120

Query: 121 PKIDPTGIPRFVPKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAYVV 180
           PKIDPTGIPRFVPKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAYVV
Sbjct: 121 PKIDPTGIPRFVPKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAYVV 180

Query: 181 QYGSGSTAGLLLSETLDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKF 240
           QYGSGSTAGLLLSETLDFP+KKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKF
Sbjct: 181 QYGSGSTAGLLLSETLDFPNKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKF 240

Query: 241 AYCLASRKFDDSPHSGQLILDSTGVKSSGLTYTPFRQNPSVSNNAYKEYYYLNIRKIIVG 300
           AYCLASRKFDDS HSGQLILDS+GVK+SGLTYT FRQNPSVSN+AYKEYYYLNIRKIIVG
Sbjct: 241 AYCLASRKFDDSAHSGQLILDSSGVKTSGLTYTSFRQNPSVSNHAYKEYYYLNIRKIIVG 300

Query: 301 NQAVKVPYKYLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQLANWTRATDVETL 360
           NQAVKVPYKYLVPGPDGNGGSIIDSGSTFTFMDKPVL+VVA+EFEKQLAN TRATDVETL
Sbjct: 301 NQAVKVPYKYLVPGPDGNGGSIIDSGSTFTFMDKPVLDVVAQEFEKQLANRTRATDVETL 360

Query: 361 TGLRPCFDISKEKSVKFPELIFQFKGGAKWALPLNNYFALVSSSGVACLTVVTHQMEDGG 420
           TGLRPCFD+SKEKSV+FPELIFQFKGGAKWALPLNNYFALVSSSGVACLTVVTH  EDGG
Sbjct: 361 TGLRPCFDVSKEKSVEFPELIFQFKGGAKWALPLNNYFALVSSSGVACLTVVTHNTEDGG 420

Query: 421 GGGPSVILGAFQQQNFYVEYDLVNQRLGFRQQTCS 456
           GGGPSVILGAFQQQNFYVEYDLVN+RLGFR+QTC+
Sbjct: 421 GGGPSVILGAFQQQNFYVEYDLVNERLGFRKQTCT 455

BLAST of CSPI03G35440 vs. ExPASy TrEMBL
Match: A0A1S3B6B5 (aspartic proteinase nepenthesin-2 OS=Cucumis melo OX=3656 GN=LOC103486666 PE=3 SV=1)

HSP 1 Score: 882.1 bits (2278), Expect = 1.0e-252
Identity = 428/455 (94.07%), Postives = 446/455 (98.02%), Query Frame = 0

Query: 1   MASPSPLSFFYLLLFSSLSAIAHSNPITLPLNSFPHLSSPDPLQALTFLASSSQTRAHQI 60
           MASPSPLSFFY+LLFSSLSAI++SNPITLPLNS PHLSS DPLQALTFLAS+S+ RAH+I
Sbjct: 1   MASPSPLSFFYILLFSSLSAISNSNPITLPLNSSPHLSSSDPLQALTFLASASKNRAHRI 60

Query: 61  KTPKSNSVFKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSF 120
           KTPKSNSV KSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLC+ECSF
Sbjct: 61  KTPKSNSVSKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCTECSF 120

Query: 121 PKIDPTGIPRFVPKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAYVV 180
           PKIDPTGIPRFVPKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAYVV
Sbjct: 121 PKIDPTGIPRFVPKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAYVV 180

Query: 181 QYGSGSTAGLLLSETLDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKF 240
           QYGSGSTAGLLLSETLDFP+KKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKF
Sbjct: 181 QYGSGSTAGLLLSETLDFPNKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKF 240

Query: 241 AYCLASRKFDDSPHSGQLILDSTGVKSSGLTYTPFRQNPSVSNNAYKEYYYLNIRKIIVG 300
           AYCLASRKFDDS HSGQLILDS+GVK+SGLTYT FRQNPSVSN+AYKEYYYLNIRKIIVG
Sbjct: 241 AYCLASRKFDDSAHSGQLILDSSGVKTSGLTYTSFRQNPSVSNHAYKEYYYLNIRKIIVG 300

Query: 301 NQAVKVPYKYLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQLANWTRATDVETL 360
           NQAVKVPYKYLVPGPDGNGGSIIDSGSTFTFMDKPVL+VVA+EFEKQLAN TRATDVETL
Sbjct: 301 NQAVKVPYKYLVPGPDGNGGSIIDSGSTFTFMDKPVLDVVAQEFEKQLANRTRATDVETL 360

Query: 361 TGLRPCFDISKEKSVKFPELIFQFKGGAKWALPLNNYFALVSSSGVACLTVVTHQMEDGG 420
           TGLRPCFD+SKEKSV+FPELIFQFKGGAKWALPLNNYFALVSSSGVACLTVVTH  EDGG
Sbjct: 361 TGLRPCFDVSKEKSVEFPELIFQFKGGAKWALPLNNYFALVSSSGVACLTVVTHNTEDGG 420

Query: 421 GGGPSVILGAFQQQNFYVEYDLVNQRLGFRQQTCS 456
           GGGPSVILGAFQQQNFYVEYDLVN+RLGFR+QTC+
Sbjct: 421 GGGPSVILGAFQQQNFYVEYDLVNERLGFRKQTCT 455

BLAST of CSPI03G35440 vs. ExPASy TrEMBL
Match: A0A6J1IXY3 (probable aspartyl protease At4g16563 OS=Cucurbita maxima OX=3661 GN=LOC111481640 PE=3 SV=1)

HSP 1 Score: 820.8 bits (2119), Expect = 2.7e-234
Identity = 398/457 (87.09%), Postives = 423/457 (92.56%), Query Frame = 0

Query: 1   MASPSPLSFFYLLLFSSLSAIAHSNPITLPLNSFPHLSSPDPLQALTFLASSSQTRAHQI 60
           MA P PL FFY+LL SS+SAIA +NPIT+PL+SFPH SS DPLQ L FLAS+SQ RAHQI
Sbjct: 1   MAPPPPLCFFYILLVSSVSAIADTNPITIPLSSFPHHSSSDPLQTLNFLASASQNRAHQI 60

Query: 61  KTPK--SNSVFKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSEC 120
           K PK  SNSV KSPLSPHSYGAYSTPLSFGTP QTLHLIFDTGSSLVW PCTS+YLCSEC
Sbjct: 61  KAPKSESNSVSKSPLSPHSYGAYSTPLSFGTPPQTLHLIFDTGSSLVWLPCTSKYLCSEC 120

Query: 121 SFPKIDPTGIPRFVPKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAY 180
           SFPKIDP GIPRF+PKLSS+SKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAY
Sbjct: 121 SFPKIDPAGIPRFIPKLSSTSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAY 180

Query: 181 VVQYGSGSTAGLLLSETLDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLK 240
           VVQYGSGSTAGLLLSETLDFPDKK  NFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLK
Sbjct: 181 VVQYGSGSTAGLLLSETLDFPDKKFTNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLK 240

Query: 241 KFAYCLASRKFDDSPHSGQLILDSTGVKSSGLTYTPFRQNPSVSNNAYKEYYYLNIRKII 300
           KFAYCLASRKFDDSPH+G+LILDS+G K+SGL+YTPFRQNPSVSN+AYKEYYYL IRKI 
Sbjct: 241 KFAYCLASRKFDDSPHAGELILDSSGAKTSGLSYTPFRQNPSVSNHAYKEYYYLTIRKIF 300

Query: 301 VGNQAVKVPYKYLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQLANWTRATDVE 360
           VG +AVKVPYKYLVPGPDGNGGSIIDSGSTFTFMDKPV E VA+E EKQLAN TRATDVE
Sbjct: 301 VGKKAVKVPYKYLVPGPDGNGGSIIDSGSTFTFMDKPVFEAVAQEIEKQLANRTRATDVE 360

Query: 361 TLTGLRPCFDISKEKSVKFPELIFQFKGGAKWALPLNNYFALVSSSGVACLTVVTHQMED 420
           +LTGLRPCFDISK+KSV+FPEL FQ KGGAKW LPL+NYFALVSSSGVACLTVVTH+  D
Sbjct: 361 SLTGLRPCFDISKDKSVEFPELTFQLKGGAKWGLPLSNYFALVSSSGVACLTVVTHKTAD 420

Query: 421 GGGGGPSVILGAFQQQNFYVEYDLVNQRLGFRQQTCS 456
             GGGPS+ILGAFQQQNFYVEYDLVNQ++GFRQQTCS
Sbjct: 421 -SGGGPSIILGAFQQQNFYVEYDLVNQKIGFRQQTCS 456

BLAST of CSPI03G35440 vs. ExPASy TrEMBL
Match: A0A6J1F3G5 (probable aspartyl protease At4g16563 OS=Cucurbita moschata OX=3662 GN=LOC111441834 PE=3 SV=1)

HSP 1 Score: 819.3 bits (2115), Expect = 7.9e-234
Identity = 399/457 (87.31%), Postives = 423/457 (92.56%), Query Frame = 0

Query: 1   MASPSPLSFFYLLLFSSLSAIAHSNPITLPLNSFPHLSSPDPLQALTFLASSSQTRAHQI 60
           MA P PL FFY+LL SS+SAIA +NPITLPL+SFPH SS DPLQ L FLAS+SQ RAHQI
Sbjct: 1   MAPPPPLCFFYILLLSSVSAIADTNPITLPLSSFPHHSSSDPLQTLNFLASASQNRAHQI 60

Query: 61  KTP--KSNSVFKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSEC 120
           K P  KSNSV KSPLSPHSYGAYSTPLSFGTP QTLHLIFDTGSSLVW PCTS+YLCSEC
Sbjct: 61  KAPKSKSNSVSKSPLSPHSYGAYSTPLSFGTPSQTLHLIFDTGSSLVWLPCTSKYLCSEC 120

Query: 121 SFPKIDPTGIPRFVPKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAY 180
           SFPKIDP  IPRF+PKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAY
Sbjct: 121 SFPKIDPARIPRFIPKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAY 180

Query: 181 VVQYGSGSTAGLLLSETLDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLK 240
           VVQYGSGSTAGLLLSETLDFP+KKI NFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLK
Sbjct: 181 VVQYGSGSTAGLLLSETLDFPNKKITNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLK 240

Query: 241 KFAYCLASRKFDDSPHSGQLILDSTGVKSSGLTYTPFRQNPSVSNNAYKEYYYLNIRKII 300
           KFAYCLASRKFDDSPH+G+LILDS+G K+SGLTYTPFRQNPSVSN+AYKEYYYL IRKI 
Sbjct: 241 KFAYCLASRKFDDSPHAGELILDSSGAKTSGLTYTPFRQNPSVSNHAYKEYYYLTIRKIF 300

Query: 301 VGNQAVKVPYKYLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQLANWTRATDVE 360
           VGN+AVKVPYKYLVPGPDGNGGSIIDSGSTFTFMDKPV E VA+E EKQLAN TRATDVE
Sbjct: 301 VGNKAVKVPYKYLVPGPDGNGGSIIDSGSTFTFMDKPVFEAVAQEIEKQLANRTRATDVE 360

Query: 361 TLTGLRPCFDISKEKSVKFPELIFQFKGGAKWALPLNNYFALVSSSGVACLTVVTHQMED 420
           +LTGLRPCFDISK+KSV+FPEL F  KGGAKWA PL+NYFALVSSSGVACLTVVTH+  +
Sbjct: 361 SLTGLRPCFDISKDKSVEFPELTFHLKGGAKWAPPLSNYFALVSSSGVACLTVVTHKAAE 420

Query: 421 GGGGGPSVILGAFQQQNFYVEYDLVNQRLGFRQQTCS 456
             GGGPS+ILGAFQQQNFYVEYDLVNQ++GFRQQTCS
Sbjct: 421 -SGGGPSIILGAFQQQNFYVEYDLVNQKIGFRQQTCS 456

BLAST of CSPI03G35440 vs. NCBI nr
Match: XP_004136706.1 (probable aspartyl protease At4g16563 [Cucumis sativus] >KGN59188.1 hypothetical protein Csa_002380 [Cucumis sativus])

HSP 1 Score: 926.4 bits (2393), Expect = 9.5e-266
Identity = 453/457 (99.12%), Postives = 455/457 (99.56%), Query Frame = 0

Query: 1   MASPSPLSFFYLLLFSSLSAIAHSNPITLPLNSFPHLSSPDPLQALTFLASSSQTRAHQI 60
           MASPSPLSFFYLLLFSSLSAIAHSNPITLPLNSFPHLSSPDPLQALTFLASSSQTRAHQI
Sbjct: 1   MASPSPLSFFYLLLFSSLSAIAHSNPITLPLNSFPHLSSPDPLQALTFLASSSQTRAHQI 60

Query: 61  KTPKSNSVFKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSF 120
           KTPKSNSVFKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSF
Sbjct: 61  KTPKSNSVFKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSF 120

Query: 121 PKIDPTGIPRFVPKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAYVV 180
           PKIDPTGIPRFVPKLSSSSKLVGCQNPKC+WIFGPDVKSQCRSCNPKTENCTQTCPAYVV
Sbjct: 121 PKIDPTGIPRFVPKLSSSSKLVGCQNPKCSWIFGPDVKSQCRSCNPKTENCTQTCPAYVV 180

Query: 181 QYGSGSTAGLLLSETLDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKF 240
           QYGSGSTAGLLLSETLDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKF
Sbjct: 181 QYGSGSTAGLLLSETLDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKF 240

Query: 241 AYCLASRKFDDSPHSGQLILDSTGVKSSGLTYTPFRQNPSVSNNAYKEYYYLNIRKIIVG 300
           AYCLASRKFDDSPHSGQLILDSTGVKSSGLTYTPFRQNPSVSNNAYKEYYYLNIRKIIVG
Sbjct: 241 AYCLASRKFDDSPHSGQLILDSTGVKSSGLTYTPFRQNPSVSNNAYKEYYYLNIRKIIVG 300

Query: 301 NQAVKVPYKYLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQLANWTRATDVETL 360
           NQAVKVPYK+LVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQLANWTRATDVETL
Sbjct: 301 NQAVKVPYKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQLANWTRATDVETL 360

Query: 361 TGLRPCFDISKEKSVKFPELIFQFKGGAKWALPLNNYFALVSSSGVACLTVVTHQMED-- 420
           TGLRPCFDISKEKSVKFPELIFQFKGGAKWALPLNNYFALVSSSGVACLTVVTHQMED  
Sbjct: 361 TGLRPCFDISKEKSVKFPELIFQFKGGAKWALPLNNYFALVSSSGVACLTVVTHQMEDGG 420

Query: 421 GGGGGPSVILGAFQQQNFYVEYDLVNQRLGFRQQTCS 456
           GGGGGPSVILGAFQQQNFYVEYDLVNQRLGFRQQTCS
Sbjct: 421 GGGGGPSVILGAFQQQNFYVEYDLVNQRLGFRQQTCS 457

BLAST of CSPI03G35440 vs. NCBI nr
Match: XP_008442902.1 (PREDICTED: aspartic proteinase nepenthesin-2 [Cucumis melo] >KAA0043829.1 aspartic proteinase nepenthesin-2 [Cucumis melo var. makuwa] >TYK25303.1 aspartic proteinase nepenthesin-2 [Cucumis melo var. makuwa])

HSP 1 Score: 882.1 bits (2278), Expect = 2.1e-252
Identity = 428/455 (94.07%), Postives = 446/455 (98.02%), Query Frame = 0

Query: 1   MASPSPLSFFYLLLFSSLSAIAHSNPITLPLNSFPHLSSPDPLQALTFLASSSQTRAHQI 60
           MASPSPLSFFY+LLFSSLSAI++SNPITLPLNS PHLSS DPLQALTFLAS+S+ RAH+I
Sbjct: 1   MASPSPLSFFYILLFSSLSAISNSNPITLPLNSSPHLSSSDPLQALTFLASASKNRAHRI 60

Query: 61  KTPKSNSVFKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSF 120
           KTPKSNSV KSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLC+ECSF
Sbjct: 61  KTPKSNSVSKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCTECSF 120

Query: 121 PKIDPTGIPRFVPKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAYVV 180
           PKIDPTGIPRFVPKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAYVV
Sbjct: 121 PKIDPTGIPRFVPKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAYVV 180

Query: 181 QYGSGSTAGLLLSETLDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKF 240
           QYGSGSTAGLLLSETLDFP+KKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKF
Sbjct: 181 QYGSGSTAGLLLSETLDFPNKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKF 240

Query: 241 AYCLASRKFDDSPHSGQLILDSTGVKSSGLTYTPFRQNPSVSNNAYKEYYYLNIRKIIVG 300
           AYCLASRKFDDS HSGQLILDS+GVK+SGLTYT FRQNPSVSN+AYKEYYYLNIRKIIVG
Sbjct: 241 AYCLASRKFDDSAHSGQLILDSSGVKTSGLTYTSFRQNPSVSNHAYKEYYYLNIRKIIVG 300

Query: 301 NQAVKVPYKYLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQLANWTRATDVETL 360
           NQAVKVPYKYLVPGPDGNGGSIIDSGSTFTFMDKPVL+VVA+EFEKQLAN TRATDVETL
Sbjct: 301 NQAVKVPYKYLVPGPDGNGGSIIDSGSTFTFMDKPVLDVVAQEFEKQLANRTRATDVETL 360

Query: 361 TGLRPCFDISKEKSVKFPELIFQFKGGAKWALPLNNYFALVSSSGVACLTVVTHQMEDGG 420
           TGLRPCFD+SKEKSV+FPELIFQFKGGAKWALPLNNYFALVSSSGVACLTVVTH  EDGG
Sbjct: 361 TGLRPCFDVSKEKSVEFPELIFQFKGGAKWALPLNNYFALVSSSGVACLTVVTHNTEDGG 420

Query: 421 GGGPSVILGAFQQQNFYVEYDLVNQRLGFRQQTCS 456
           GGGPSVILGAFQQQNFYVEYDLVN+RLGFR+QTC+
Sbjct: 421 GGGPSVILGAFQQQNFYVEYDLVNERLGFRKQTCT 455

BLAST of CSPI03G35440 vs. NCBI nr
Match: XP_038905730.1 (probable aspartyl protease At4g16563 [Benincasa hispida])

HSP 1 Score: 870.5 bits (2248), Expect = 6.2e-249
Identity = 420/455 (92.31%), Postives = 441/455 (96.92%), Query Frame = 0

Query: 1   MASPSPLSFFYLLLFSSLSAIAHSNPITLPLNSFPHLSSPDPLQALTFLASSSQTRAHQI 60
           MA PS LSFFY+LLFSS+SAIA++NPITLPLN+FPHLSS DPLQ LTFLAS+SQ RAHQI
Sbjct: 1   MAPPSSLSFFYILLFSSVSAIANTNPITLPLNAFPHLSSSDPLQTLTFLASASQNRAHQI 60

Query: 61  KTPKSNSVFKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSF 120
           KTPKSNSV KSPL PHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSF
Sbjct: 61  KTPKSNSVSKSPLFPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSF 120

Query: 121 PKIDPTGIPRFVPKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAYVV 180
           PKIDPTGIPRFVPKLSSSSKLVGCQNPKCAWIFGP+VKSQCRSCNPKTENCTQTCPAYVV
Sbjct: 121 PKIDPTGIPRFVPKLSSSSKLVGCQNPKCAWIFGPEVKSQCRSCNPKTENCTQTCPAYVV 180

Query: 181 QYGSGSTAGLLLSETLDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKF 240
           QYGSGSTAGLLLSETLDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKF
Sbjct: 181 QYGSGSTAGLLLSETLDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKF 240

Query: 241 AYCLASRKFDDSPHSGQLILDSTGVKSSGLTYTPFRQNPSVSNNAYKEYYYLNIRKIIVG 300
           AYCLASRKFDDSPHSG+LILDSTGVK+SGL+YTPFRQNPSVSN+AYKEYYYLNIRKI VG
Sbjct: 241 AYCLASRKFDDSPHSGELILDSTGVKTSGLSYTPFRQNPSVSNHAYKEYYYLNIRKIFVG 300

Query: 301 NQAVKVPYKYLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQLANWTRATDVETL 360
           NQAVKVPYK+LVPGPDGNGGSIIDSGSTFTFMDKPV E VA+EFEKQLAN TRATDVE+L
Sbjct: 301 NQAVKVPYKFLVPGPDGNGGSIIDSGSTFTFMDKPVFEAVAQEFEKQLANRTRATDVESL 360

Query: 361 TGLRPCFDISKEKSVKFPELIFQFKGGAKWALPLNNYFALVSSSGVACLTVVTHQMEDGG 420
           TGLRPCFDISK+KSV+FPELIFQFKGGAKWALPL+NYFALVSSSGVACLTVVTH+ E GG
Sbjct: 361 TGLRPCFDISKDKSVEFPELIFQFKGGAKWALPLSNYFALVSSSGVACLTVVTHKTEAGG 420

Query: 421 GGGPSVILGAFQQQNFYVEYDLVNQRLGFRQQTCS 456
           GGGPSVI GAFQQQNFYVEYDLVN++LGFRQQTC+
Sbjct: 421 GGGPSVIFGAFQQQNFYVEYDLVNEKLGFRQQTCT 455

BLAST of CSPI03G35440 vs. NCBI nr
Match: XP_023528159.1 (probable aspartyl protease At4g16563 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 821.6 bits (2121), Expect = 3.3e-234
Identity = 401/457 (87.75%), Postives = 424/457 (92.78%), Query Frame = 0

Query: 1   MASPSPLSFFYLLLFSSLSAIAHSNPITLPLNSFPHLSSPDPLQALTFLASSSQTRAHQI 60
           MA P  L FFY+LL SS+SAIA +NPITLPL+SFPH SS DPLQ L FLAS+SQ RAHQI
Sbjct: 1   MAPPPLLCFFYILLLSSVSAIADTNPITLPLSSFPHHSSSDPLQTLNFLASASQNRAHQI 60

Query: 61  KTP--KSNSVFKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSEC 120
           K P  KSNSV KSPLSPHSYGAYSTPLSFGTP QTLHLIFDTGSSLVW PCTS+YLCSEC
Sbjct: 61  KAPKSKSNSVSKSPLSPHSYGAYSTPLSFGTPSQTLHLIFDTGSSLVWLPCTSKYLCSEC 120

Query: 121 SFPKIDPTGIPRFVPKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAY 180
           SFPKIDP GIPRF+PKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAY
Sbjct: 121 SFPKIDPAGIPRFIPKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAY 180

Query: 181 VVQYGSGSTAGLLLSETLDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLK 240
           VVQYGSGSTAGLLLSETLDF +KKI NFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLK
Sbjct: 181 VVQYGSGSTAGLLLSETLDFANKKITNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLK 240

Query: 241 KFAYCLASRKFDDSPHSGQLILDSTGVKSSGLTYTPFRQNPSVSNNAYKEYYYLNIRKII 300
           KFAYCLASRKFDDSPH+G+LILDS+G K+SGLTYTPFRQNPSVSN+AYKEYYYL IRKI 
Sbjct: 241 KFAYCLASRKFDDSPHAGELILDSSGAKTSGLTYTPFRQNPSVSNHAYKEYYYLTIRKIF 300

Query: 301 VGNQAVKVPYKYLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQLANWTRATDVE 360
           VGN+AVKVPYKYLVPGPDGNGGSIIDSGSTFTFMDKPV E VA+E EKQLAN TRATDVE
Sbjct: 301 VGNKAVKVPYKYLVPGPDGNGGSIIDSGSTFTFMDKPVFEAVAQEIEKQLANRTRATDVE 360

Query: 361 TLTGLRPCFDISKEKSVKFPELIFQFKGGAKWALPLNNYFALVSSSGVACLTVVTHQMED 420
           +LTGLRPCFDISK+KSV+FPEL FQ KGGAKWALPL+NYFALVSSSGVACLTVVTH+  D
Sbjct: 361 SLTGLRPCFDISKDKSVEFPELTFQLKGGAKWALPLSNYFALVSSSGVACLTVVTHKAAD 420

Query: 421 GGGGGPSVILGAFQQQNFYVEYDLVNQRLGFRQQTCS 456
             GGGPS+ILGAFQQQNFYVEYDLVNQ++GFRQQTCS
Sbjct: 421 -SGGGPSIILGAFQQQNFYVEYDLVNQKIGFRQQTCS 456

BLAST of CSPI03G35440 vs. NCBI nr
Match: XP_022982947.1 (probable aspartyl protease At4g16563 [Cucurbita maxima])

HSP 1 Score: 820.8 bits (2119), Expect = 5.6e-234
Identity = 398/457 (87.09%), Postives = 423/457 (92.56%), Query Frame = 0

Query: 1   MASPSPLSFFYLLLFSSLSAIAHSNPITLPLNSFPHLSSPDPLQALTFLASSSQTRAHQI 60
           MA P PL FFY+LL SS+SAIA +NPIT+PL+SFPH SS DPLQ L FLAS+SQ RAHQI
Sbjct: 1   MAPPPPLCFFYILLVSSVSAIADTNPITIPLSSFPHHSSSDPLQTLNFLASASQNRAHQI 60

Query: 61  KTPK--SNSVFKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSEC 120
           K PK  SNSV KSPLSPHSYGAYSTPLSFGTP QTLHLIFDTGSSLVW PCTS+YLCSEC
Sbjct: 61  KAPKSESNSVSKSPLSPHSYGAYSTPLSFGTPPQTLHLIFDTGSSLVWLPCTSKYLCSEC 120

Query: 121 SFPKIDPTGIPRFVPKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAY 180
           SFPKIDP GIPRF+PKLSS+SKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAY
Sbjct: 121 SFPKIDPAGIPRFIPKLSSTSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAY 180

Query: 181 VVQYGSGSTAGLLLSETLDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLK 240
           VVQYGSGSTAGLLLSETLDFPDKK  NFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLK
Sbjct: 181 VVQYGSGSTAGLLLSETLDFPDKKFTNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLK 240

Query: 241 KFAYCLASRKFDDSPHSGQLILDSTGVKSSGLTYTPFRQNPSVSNNAYKEYYYLNIRKII 300
           KFAYCLASRKFDDSPH+G+LILDS+G K+SGL+YTPFRQNPSVSN+AYKEYYYL IRKI 
Sbjct: 241 KFAYCLASRKFDDSPHAGELILDSSGAKTSGLSYTPFRQNPSVSNHAYKEYYYLTIRKIF 300

Query: 301 VGNQAVKVPYKYLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQLANWTRATDVE 360
           VG +AVKVPYKYLVPGPDGNGGSIIDSGSTFTFMDKPV E VA+E EKQLAN TRATDVE
Sbjct: 301 VGKKAVKVPYKYLVPGPDGNGGSIIDSGSTFTFMDKPVFEAVAQEIEKQLANRTRATDVE 360

Query: 361 TLTGLRPCFDISKEKSVKFPELIFQFKGGAKWALPLNNYFALVSSSGVACLTVVTHQMED 420
           +LTGLRPCFDISK+KSV+FPEL FQ KGGAKW LPL+NYFALVSSSGVACLTVVTH+  D
Sbjct: 361 SLTGLRPCFDISKDKSVEFPELTFQLKGGAKWGLPLSNYFALVSSSGVACLTVVTHKTAD 420

Query: 421 GGGGGPSVILGAFQQQNFYVEYDLVNQRLGFRQQTCS 456
             GGGPS+ILGAFQQQNFYVEYDLVNQ++GFRQQTCS
Sbjct: 421 -SGGGPSIILGAFQQQNFYVEYDLVNQKIGFRQQTCS 456

BLAST of CSPI03G35440 vs. TAIR 10
Match: AT3G52500.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 538.1 bits (1385), Expect = 6.8e-153
Identity = 269/474 (56.75%), Postives = 341/474 (71.94%), Query Frame = 0

Query: 5   SPLSFFYLLLFSSLSAIAHSNPITLPLNSFPH--LSSPDPLQALTFLASSSQTRAHQIK- 64
           S + FF+L+  S +SA      + LPL+ F H   S  DP  +L  LA SS  RAH++K 
Sbjct: 3   SSIFFFFLIFLSVVSA------VKLPLSPFSHSDQSPKDPYLSLRRLAESSIARAHKLKH 62

Query: 65  --------------TPKSNSVFKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWF 124
                         T  S +V KSPLS  SYG YS  LSFGTP QT+  +FDTGSSLVW 
Sbjct: 63  GTSIKPDEDALSSTTTASATVVKSPLSAKSYGGYSVSLSFGTPSQTIPFVFDTGSSLVWL 122

Query: 125 PCTSRYLCSECSFPKIDPTGIPRFVPKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPK 184
           PCTSRYLCS C F  +DPT IPRF+PK SSSSK++GCQ+PKC +++GP+V  QCR C+P 
Sbjct: 123 PCTSRYLCSGCDFSGLDPTLIPRFIPKNSSSSKIIGCQSPKCQFLYGPNV--QCRGCDPN 182

Query: 185 TENCTQTCPAYVVQYGSGSTAGLLLSETLDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRG 244
           T NCT  CP Y++QYG GSTAG+L++E LDFPD  +P+FVVGCS +S  QP+GIAGFGRG
Sbjct: 183 TRNCTVGCPPYILQYGLGSTAGVLITEKLDFPDLTVPDFVVGCSIISTRQPAGIAGFGRG 242

Query: 245 SESLPSQMGLKKFAYCLASRKFDDSPHSGQLILD-----STGVKSSGLTYTPFRQNPSVS 304
             SLPSQM LK+F++CL SR+FDD+  +  L LD     ++G K+ GLTYTPFR+NP+VS
Sbjct: 243 PVSLPSQMNLKRFSHCLVSRRFDDTNVTTDLDLDTGSGHNSGSKTPGLTYTPFRKNPNVS 302

Query: 305 NNAYKEYYYLNIRKIIVGNQAVKVPYKYLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAR 364
           N A+ EYYYLN+R+I VG + VK+PYKYL PG +G+GGSI+DSGSTFTFM++PV E+VA 
Sbjct: 303 NKAFLEYYYLNLRRIYVGRKHVKIPYKYLAPGTNGDGGSIVDSGSTFTFMERPVFELVAE 362

Query: 365 EFEKQLANWTRATDVETLTGLRPCFDISKEKSVKFPELIFQFKGGAKWALPLNNYFALVS 424
           EF  Q++N+TR  D+E  TGL PCF+IS +  V  PELIF+FKGGAK  LPL+NYF  V 
Sbjct: 363 EFASQMSNYTREKDLEKETGLGPCFNISGKGDVTVPELIFEFKGGAKLELPLSNYFTFVG 422

Query: 425 SSGVACLTVVTHQ-MEDGGGGGPSVILGAFQQQNFYVEYDLVNQRLGFRQQTCS 456
           ++   CLTVV+ + +   GG GP++ILG+FQQQN+ VEYDL N R GF ++ CS
Sbjct: 423 NTDTVCLTVVSDKTVNPSGGTGPAIILGSFQQQNYLVEYDLENDRFGFAKKKCS 468

BLAST of CSPI03G35440 vs. TAIR 10
Match: AT4G16563.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 194.1 bits (492), Expect = 2.4e-49
Identity = 149/501 (29.74%), Postives = 224/501 (44.71%), Query Frame = 0

Query: 7   LSFFYLLLFSSLSA-----IAHSNPITLPLNSFPHLSSPDPLQALTFLASSSQTRAHQIK 66
           L +++    SSLS      ++HS      L++  H SSP  L   +   SS++ R H  K
Sbjct: 14  LQYYFHFSVSSLSTPLLLHLSHS------LSTSKHSSSPLHLLKSSSSRSSARFRRHHHK 73

Query: 67  TPKSNSVFKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSFP 126
             +       P+S  S   Y   LS G+    + L  DTGS LVWFPC   + C  C   
Sbjct: 74  --QQQQQLSLPIS--SGSDYLISLSVGSSSSAVSLYLDTGSDLVWFPCRP-FTCILCESK 133

Query: 127 KIDPTGIPRFVPKLSSSSKLVGCQNPKCAWIFGPDVKSQ-CRSCN-----PKTENCTQT- 186
            + P+        LSSS+  V C +P C+        S  C   N      +T +C  + 
Sbjct: 134 PLPPSP----PSSLSSSATTVSCSSPSCSAAHSSLPSSDLCAISNCPLDFIETGDCNTSS 193

Query: 187 --CPAYVVQYGSGSTAGLLLSETLDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLP 246
             CP +   YG GS    L S++L  P   + NF  GC+  ++ +P G+AGFGRG  SLP
Sbjct: 194 YPCPPFYYAYGDGSLVAKLYSDSLSLPSVSVSNFTFGCAHTTLAEPIGVAGFGRGRLSLP 253

Query: 247 SQMGL------KKFAYCLASRKFDDS--PHSGQLIL--------------------DSTG 306
           +Q+ +        F+YCL S  FD         LIL                    D   
Sbjct: 254 AQLAVHSPHLGNSFSYCLVSHSFDSDRVRRPSPLILGRFVDKKEKRVGTTDDHDDGDDEK 313

Query: 307 VKSSGLTYTPFRQNPSVSNNAYKEYYYLNIRKIIVGNQAVKVPYKYLVPGPDGNGGSIID 366
            K +   +T   +NP      +  +Y ++++ I +G + +  P        +G GG ++D
Sbjct: 314 KKKNEFVFTEMLENPK-----HPYFYSVSLQGISIGKRNIPAPAMLRRIDKNGGGGVVVD 373

Query: 367 SGSTFTFMDKPVLEVVAREFEKQLAN-WTRATDVETLTGLRPCFDISKEKSVKFPELIFQ 426
           SG+TFT +       V  EF+ ++     RA  VE  +G+ PC+ ++  ++VK P L+  
Sbjct: 374 SGTTFTMLPAKFYNSVVEEFDSRVGRVHERADRVEPSSGMSPCYYLN--QTVKVPALVLH 433

Query: 427 FKGG-AKWALPLNNYFALVSSSG--------VACLTVVTHQMEDGGGGGPSVILGAFQQQ 456
           F G  +   LP  NYF      G        + CL ++    E    GG   ILG +QQQ
Sbjct: 434 FAGNRSSVTLPRRNYFYEFMDGGDGKEEKRKIGCLMLMNGGDESELRGGTGAILGNYQQQ 492

BLAST of CSPI03G35440 vs. TAIR 10
Match: AT5G45120.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 189.5 bits (480), Expect = 5.9e-48
Identity = 155/490 (31.63%), Postives = 226/490 (46.12%), Query Frame = 0

Query: 1   MASPSPLSFFYLLLFSSLS------AIAHSNPITLPLNSFPHLSSPDPLQALTFLASSSQ 60
           M + + + F +LL+   L+      A  H NP +   +SF  L+      +L    S +Q
Sbjct: 1   METQTHVLFLFLLITLLLNTTNKTQARQHKNP-SSSSSSFLVLTLTKSSVSLPTPKSQTQ 60

Query: 61  TRAHQIKTPKSN-SVFKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTS-R 120
            R   IK P S+  V   PL     G Y   L+ GTP Q + +  DTGS L W PC +  
Sbjct: 61  ER---IKKPLSSVDVVMEPLREVRDG-YLITLNIGTPPQAVQVYLDTGSDLTWVPCGNLS 120

Query: 121 YLCSECSFPKIDPTGIPR-FVPKLSSSSKLVGCQNPKCAWI------FGPDVKSQCRSCN 180
           + C EC   K +    P  F P  SS+S    C +  C  I      F P   + C    
Sbjct: 121 FDCIECYDLKNNDLKSPSVFSPLHSSTSFRDSCASSFCVEIHSSDNPFDPCAVAGCSVSM 180

Query: 181 PKTENCTQTCPAYVVQYGSGS-TAGLLLSETLDFPDKKIPNFVVGCSFLSIHQPSGIAGF 240
                C + CP++   YG G   +G+L  + L    + +P F  GC   +  +P GIAGF
Sbjct: 181 LLKSTCVRPCPSFAYTYGEGGLISGILTRDILKARTRDVPRFSFGCVTSTYREPIGIAGF 240

Query: 241 GRGSESLPSQMGL--KKFAYCLASRKFDDSPH-SGQLILDSTGVK---SSGLTYTPFRQN 300
           GRG  SLPSQ+G   K F++C    KF ++P+ S  LIL ++ +    +  L +TP    
Sbjct: 241 GRGLLSLPSQLGFLEKGFSHCFLPFKFVNNPNISSPLILGASALSINLTDSLQFTPMLNT 300

Query: 301 PSVSNNAYKEYYYLNIRKIIVGNQ--AVKVPYKYLVPGPDGNGGSIIDSGSTFTFMDKPV 360
           P   N+     YY+ +  I +G      +VP         GNGG ++DSG+T+T + +P 
Sbjct: 301 PMYPNS-----YYIGLESITIGTNITPTQVPLTLRQFDSQGNGGMLVDSGTTYTHLPEPF 360

Query: 361 LEVVAREFEKQLANWTRATDVETLTGLRPCFDI--------SKEKSVK--FPELIFQFKG 420
              +    +  +  + RAT+ E+ TG   C+ +        S E  V   FP + F F  
Sbjct: 361 YSQLLTTLQSTI-TYPRATETESRTGFDLCYKVPCPNNNLTSLENDVMMIFPSITFHFLN 420

Query: 421 GAKWALPLNNYFALVS--SSGVACLTVVTHQMEDGGGGGPSVILGAFQQQNFYVEYDLVN 455
            A   LP  N F  +S  S G     ++   MED G  GP+ + G+FQQQN  V YDL  
Sbjct: 421 NATLLLPQGNSFYAMSAPSDGSVVQCLLFQNMED-GDYGPAGVFGSFQQQNVKVVYDLEK 478

BLAST of CSPI03G35440 vs. TAIR 10
Match: AT2G42980.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 174.1 bits (440), Expect = 2.6e-43
Identity = 127/398 (31.91%), Postives = 183/398 (45.98%), Query Frame = 0

Query: 79  GAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSFPKIDPTGIPRFVPKLSSS 138
           G Y   +  GTP +   LI DTGS L W  C   Y C   +    D        PK S+S
Sbjct: 158 GEYFMDVLVGTPPKHFSLILDTGSDLNWLQCLPCYDCFHQNGMFYD--------PKTSAS 217

Query: 139 SKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGS-TAGLLLSETLD 198
            K + C +P+C+ I  PD   QC S N       Q+CP Y   YG  S T G    ET  
Sbjct: 218 FKNITCNDPRCSLISSPDPPVQCESDN-------QSCP-YFYWYGDRSNTTGDFAVETFT 277

Query: 199 F---------PDKKIPNFVVGCSFLS---IHQPSGIAGFGRGSESLPSQMGL---KKFAY 258
                      + K+ N + GC   +       SG+ G GRG  S  SQ+       F+Y
Sbjct: 278 VNLTTTEGGSSEYKVGNMMFGCGHWNRGLFSGASGLLGLGRGPLSFSSQLQSLYGHSFSY 337

Query: 259 CLASRKFDDSPHSGQLIL--DSTGVKSSGLTYTPFRQNPSVSNNAYKEYYYLNIRKIIVG 318
           CL  R   ++  S +LI   D   +  + L +T F        N+ + +YY+ I+ I+VG
Sbjct: 338 CLVDRN-SNTNVSSKLIFGEDKDLLNHTNLNFTSFVNG---KENSVETFYYIQIKSILVG 397

Query: 319 NQAVKVPYKYLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREF-EKQLANWTRATDVET 378
            +A+ +P +      DG+GG+IIDSG+T ++  +P  E++  +F EK   N+    D   
Sbjct: 398 GKALDIPEETWNISSDGDGGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKENYPIFRDFPV 457

Query: 379 LTGLRPCFDIS--KEKSVKFPELIFQFKGGAKWALPLNNYFALVSSSGVACLTVVTHQME 438
           L    PCF++S  +E ++  PEL   F  G  W  P  N F  +S   + CL ++     
Sbjct: 458 LD---PCFNVSGIEENNIHLPELGIAFVDGTVWNFPAENSFIWLSED-LVCLAIL----- 517

Query: 439 DGGGGGPSVILGAFQQQNFYVEYDLVNQRLGFRQQTCS 456
            G       I+G +QQQNF++ YD    RLGF    C+
Sbjct: 518 -GTPKSTFSIIGNYQQQNFHILYDTKRSRLGFTPTKCA 525

BLAST of CSPI03G35440 vs. TAIR 10
Match: AT2G03200.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 162.9 bits (411), Expect = 5.9e-40
Identity = 158/494 (31.98%), Postives = 216/494 (43.72%), Query Frame = 0

Query: 2   ASPSPLSFFYLLLFSSLSAIAHSN----PITLPLN--------SFPHLSSPDPLQALTFL 61
           +S S L  F+L+LFS L +++ S       TLP N        S  H+ S   L  +  +
Sbjct: 5   SSSSLLFPFFLILFSCLISVSSSRRSLIDRTLPKNLPRSGFRLSLRHVDSGKNLTKIQKI 64

Query: 62  ASSSQTRAHQI------------KTPKSNSVFKSPLSPHSYGAYSTPLSFGTPQQTLHLI 121
                   H++              P   +  K+P    S G +   LS G P      I
Sbjct: 65  QRGINRGFHRLNRLGAVAVLAVASKPDDTNNIKAPTHGGS-GEFLMELSIGNPAVKYSAI 124

Query: 122 FDTGSSLVWFPCTSRYLCSECSFPKIDPTGIPRFVPKLSSSSKLVGCQNPKCAWIFGPDV 181
            DTGS L+W  C     C+EC      PT  P F P+ SSS   VGC +  C  +     
Sbjct: 125 VDTGSDLIWTQCKP---CTECF---DQPT--PIFDPEKSSSYSKVGCSSGLCNAL----- 184

Query: 182 KSQCRSCNPKTENCTQTCPAYVVQYGS-GSTAGLLLSETLDFPDK-KIPNFVVGCSFLS- 241
                +CN   + C      Y+  YG   ST GLL +ET  F D+  I     GC   + 
Sbjct: 185 --PRSNCNEDKDAC-----EYLYTYGDYSSTRGLLATETFTFEDENSISGIGFGCGVENE 244

Query: 242 ---IHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSPHSGQL--------ILDST 301
                Q SG+ G GRG  SL SQ+   KF+YCL S   +DS  S  L        I++ T
Sbjct: 245 GDGFSQGSGLVGLGRGPLSLISQLKETKFSYCLTS--IEDSEASSSLFIGSLASGIVNKT 304

Query: 302 GVKSSG-LTYT-PFRQNPSVSNNAYKEYYYLNIRKIIVGNQAVKVPYKYLVPGPDGNGGS 361
           G    G +T T    +NP         +YYL ++ I VG + + V         DG GG 
Sbjct: 305 GASLDGEVTKTMSLLRNPD-----QPSFYYLELQGITVGAKRLSVEKSTFELAEDGTGGM 364

Query: 362 IIDSGSTFTFMDKPVLEVVAREFEKQLANWTRATDVETLTGLRPCFDI-SKEKSVKFPEL 421
           IIDSG+T T++++   +V+  EF  ++   +   D    TGL  CF +    K++  P++
Sbjct: 365 IIDSGTTITYLEETAFKVLKEEFTSRM---SLPVDDSGSTGLDLCFKLPDAAKNIAVPKM 424

Query: 422 IFQFKGGAKWALPLNNYFALVSSSGVACLTVVTHQMEDGGGGGPSVILGAFQQQNFYVEY 455
           IF FK GA   LP  NY    SS+GV CL +       G   G S I G  QQQNF V +
Sbjct: 425 IFHFK-GADLELPGENYMVADSSTGVLCLAM-------GSSNGMS-IFGNVQQQNFNVLH 458

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q940R43.4e-4829.74Probable aspartyl protease At4g16563 OS=Arabidopsis thaliana OX=3702 GN=At4g1656... [more]
Q9LNJ31.2e-3730.75Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 ... [more]
Q766C29.6e-3531.85Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 S... [more]
Q766C31.4e-3329.20Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 S... [more]
Q6F4N57.6e-3227.14Aspartyl protease 25 OS=Oryza sativa subsp. japonica OX=39947 GN=AP25 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0LBI94.6e-26699.12Peptidase A1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G77844... [more]
A0A5A7TRK21.0e-25294.07Aspartic proteinase nepenthesin-2 OS=Cucumis melo var. makuwa OX=1194695 GN=E567... [more]
A0A1S3B6B51.0e-25294.07aspartic proteinase nepenthesin-2 OS=Cucumis melo OX=3656 GN=LOC103486666 PE=3 S... [more]
A0A6J1IXY32.7e-23487.09probable aspartyl protease At4g16563 OS=Cucurbita maxima OX=3661 GN=LOC111481640... [more]
A0A6J1F3G57.9e-23487.31probable aspartyl protease At4g16563 OS=Cucurbita moschata OX=3662 GN=LOC1114418... [more]
Match NameE-valueIdentityDescription
XP_004136706.19.5e-26699.12probable aspartyl protease At4g16563 [Cucumis sativus] >KGN59188.1 hypothetical ... [more]
XP_008442902.12.1e-25294.07PREDICTED: aspartic proteinase nepenthesin-2 [Cucumis melo] >KAA0043829.1 aspart... [more]
XP_038905730.16.2e-24992.31probable aspartyl protease At4g16563 [Benincasa hispida][more]
XP_023528159.13.3e-23487.75probable aspartyl protease At4g16563 [Cucurbita pepo subsp. pepo][more]
XP_022982947.15.6e-23487.09probable aspartyl protease At4g16563 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
AT3G52500.16.8e-15356.75Eukaryotic aspartyl protease family protein [more]
AT4G16563.12.4e-4929.74Eukaryotic aspartyl protease family protein [more]
AT5G45120.15.9e-4831.63Eukaryotic aspartyl protease family protein [more]
AT2G42980.12.6e-4331.91Eukaryotic aspartyl protease family protein [more]
AT2G03200.15.9e-4031.98Eukaryotic aspartyl protease family protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (PI 183967) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPRINTSPR00792PEPSINcoord: 87..107
score: 50.56
coord: 321..332
score: 29.09
coord: 426..441
score: 30.89
IPR032799Xylanase inhibitor, C-terminalPFAMPF14541TAXi_Ccoord: 290..450
e-value: 2.3E-32
score: 112.0
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 64..259
e-value: 4.1E-34
score: 120.2
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 260..455
e-value: 4.2E-50
score: 172.0
IPR021109Aspartic peptidase domain superfamilySUPERFAMILY50630Acid proteasescoord: 75..454
IPR032861Xylanase inhibitor, N-terminalPFAMPF14543TAXi_Ncoord: 81..258
e-value: 1.9E-29
score: 103.1
NoneNo IPR availablePANTHERPTHR47967:SF36BNACNNG47670D PROTEINcoord: 8..454
NoneNo IPR availablePANTHERPTHR47967OS07G0603500 PROTEIN-RELATEDcoord: 8..454
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 96..107
IPR033121Peptidase family A1 domainPROSITEPS51767PEPTIDASE_A1coord: 81..450
score: 33.401329
IPR034161Pepsin-like domain, plantCDDcd05476pepsin_A_like_plantcoord: 80..454
e-value: 3.67328E-80
score: 247.561

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI03G35440.1CSPI03G35440.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
molecular_function GO:0004190 aspartic-type endopeptidase activity