Cucsa.241660 (gene) Cucumber (Gy14) v1

NameCucsa.241660
Typegene
OrganismCucumis sativus (Cucumber (Gy14) v1)
DescriptionAspartyl protease family protein
Locationscaffold02047 : 1597082 .. 1598413 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTTCTCCTTCCCCTCTCTCTTTCTTCTACCTCCTCCTCTTCTCCTCTCTTTCCGCCATTGCCCACTCCAATCCCATCACTCTCCCTCTCAACTCTTTTCCCCACCTTTCTTCTCCAGATCCACTCCAGGCTCTCACTTTCCTCGCCTCTTCTTCCCAGACCAGAGCCCATCAAATCAAAACCCCCAAATCCAACTCTGTTTTCAAGTCCCCTCTCTCCCCCCATAGCTATGGAGCTTACTCCACTCCACTCAGCTTTGGTACTCCACAACAGACTCTGCATTTGATCTTCGATACAGGTAGTAGCCTCGTTTGGTTCCCTTGCACTTCCCGATATCTCTGTTCTGAATGTTCCTTCCCCAAAATAGATCCAACTGGAATCCCCAGATTTGTCCCCAAATTGTCCTCCTCTTCTAAGCTTGTCGGTTGCCAGAATCCCAAATGTTCCTGGATTTTTGGTCCCGATGTCAAATCTCAGTGCCGGAGTTGTAACCCCAAAACAGAGAACTGTACCCAAACTTGCCCTGCTTACGTTGTTCAATATGGTTCCGGTTCCACGGCTGGGCTTTTGCTATCAGAAACCCTTGATTTTCCTGATAAAAAAaTCCCTAATTTTGTTGTTGGGTGTTCGTTTTTGTCGATCCATCAGCCCTCTGGAATCGCCGGATTTGGCCGAGGATCTGAATCGCTCCCCTCGCAAATGGGTCTCAAGAAATTCGCTTACTGCCTAGCGTCTCGGAAATTCGACGACTCGCCGCATTCCGGCCAGCTGATTTTAGATTCAACCGGCGTGAAGTCCAGCGGCCTTACCTACACCCCATTCCGGCAGAACCCCTCTGTTTCTAACAACGCTTATAAAGAATACTATTACTTAAACATACGGAAAATCATCGTCGGAAACCAGGCTGTGAAGGTTCCATACAAGTTTCTAGTGCCGGGCCCCGATGGGAACGGCGGATCTATCATCGATTCCGGCTCCACCTTCACGTTTATGGACAAACCAGTCTTGGAGGTGGTGGCGCGAGAGTTCGAGAAGCAATTGGCAAATTGGACGAGAGCCACCGATGTGGAAACTCTCACTGGATTACGGCCGTGTTTCGACATTTCGAAGGAGAAATCGGTGAAGTTTCCGGAGTTGATTTTCCAATTTAAAGGTGGAGCGAAATGGGCTCTGCCGTTGAATAACTATTTCGCTTTAGTCAGTAGCTCCGGCGTGGCGTGTTTGACAGTAGTAACGCATCAGATGGAGGACGGCGGTGGCGGTGGCGGTGGGCCGTCTGTGATTTTAGGGGCTTTCCAGCAGCAGAATTTCTATGTGGAATATGACTTG

mRNA sequence

ATGGcttctccttcccctctctctttcttctacctcctcctcttctcctctctttccgccattgcccactccaatcccatcactctccctctcaactcttttccccacctttcttctccagatccactccaggctctcactttcctcgcctcttcttcccAGACCAGAGCCCATCAAATCAAAACCCCCAAATCCAACTCTGTTTTCAAGTCCCCTCTCTCCCCCCATAGCTATGGAGCTTACTCCACTCCACTCAGCTTTGGTACTCCACAACAGACTCTGCATTTGATCTTCGATACAGGTAGTAGCCTCGTTTGGTTCCCTTGCACTTCCCGATATCTCTGTTCTGAATGTTCCTTCCCCAAAATAGATCCAACTGGAATCCCCAGATTTGTCCCCAAATTGTCCTCCTCTTCTAAGCTTGTCGGTTGCCAGAATCCCAAATGTTCCTGGATTTTTGGTCCCGATGTCAAATCTCAGTGCCGGAGTTGTAACCCCAAAACAGAGAACTGTACCCAAACTTGCCCTGCTTACGTTGTTCAATATGGTTCCGGTTCCACGGCTGGGCTTTTGCTATCAGAAACCCTTGATTTTCCTGATAAAAAAATCCCTAATTTTGTTGTTGGGTGTTCGTTTTTGTCGATCCATCAGCCCTCTGGAATCGCCGGATTTGGCCGAGGATCTGAATCGCTCCCCTCGCAAATGGGTCTCAAGAAATTCGCTTACTGCCTAGCGTCTCGGAAATTCGACGACTCGCCGCATTCCGGCCAGCTGATTTTAGATTCAACCGGCGTGAAGTCCAGCGGCCTTACCTACACCCCATTCCGGCAGAACCCCTCTGTTTCTAACAACGCTTATAAAGAATACTATTACTTAAACATACGGAAAATCATCGTCGGAAACCAGGCTGTGAAGGTTCCATACAAGTTTCTAGTGCCGGGCCCCGATGGGAACGGCGGATCTATCATCGATTCCGGCTCCACCTTCACGTTTATGGACAAACCAGTCTTGGAGGTGGTGGCGCGAGAGTTCGAGAAGCAATTGGCAAATTGGACGAGAGCCACCGATGTGGAAACTCTCACTGGATTACGGCCGTGTTTCGACATTTCGAAGGAGAAATCGGTGAAGTTTCCGGAGTTGATTTTCCAATTTAAAGGTGGAGCGAAATGGGCTCTGCCGTTGAATAACTATTTCGCTTTAGTCAGTAGCTCCGGCGTGGCGTGTTTGACAGTAGTAACGCATCAGATGGAGGACGGCGGTGGCGGTGGCGGTGGGCCGTCTGTGATTTTAGGGGCTTTCCAGCAGCAGAATTTCTATGTGGAATATGACTTG

Coding sequence (CDS)

ATGGCTTCTCCTTCCCCTCTCTCTTTCTTCTACCTCCTCCTCTTCTCCTCTCTTTCCGCCATTGCCCACTCCAATCCCATCACTCTCCCTCTCAACTCTTTTCCCCACCTTTCTTCTCCAGATCCACTCCAGGCTCTCACTTTCCTCGCCTCTTCTTCCCAGACCAGAGCCCATCAAATCAAAACCCCCAAATCCAACTCTGTTTTCAAGTCCCCTCTCTCCCCCCATAGCTATGGAGCTTACTCCACTCCACTCAGCTTTGGTACTCCACAACAGACTCTGCATTTGATCTTCGATACAGGTAGTAGCCTCGTTTGGTTCCCTTGCACTTCCCGATATCTCTGTTCTGAATGTTCCTTCCCCAAAATAGATCCAACTGGAATCCCCAGATTTGTCCCCAAATTGTCCTCCTCTTCTAAGCTTGTCGGTTGCCAGAATCCCAAATGTTCCTGGATTTTTGGTCCCGATGTCAAATCTCAGTGCCGGAGTTGTAACCCCAAAACAGAGAACTGTACCCAAACTTGCCCTGCTTACGTTGTTCAATATGGTTCCGGTTCCACGGCTGGGCTTTTGCTATCAGAAACCCTTGATTTTCCTGATAAAAAAaTCCCTAATTTTGTTGTTGGGTGTTCGTTTTTGTCGATCCATCAGCCCTCTGGAATCGCCGGATTTGGCCGAGGATCTGAATCGCTCCCCTCGCAAATGGGTCTCAAGAAATTCGCTTACTGCCTAGCGTCTCGGAAATTCGACGACTCGCCGCATTCCGGCCAGCTGATTTTAGATTCAACCGGCGTGAAGTCCAGCGGCCTTACCTACACCCCATTCCGGCAGAACCCCTCTGTTTCTAACAACGCTTATAAAGAATACTATTACTTAAACATACGGAAAATCATCGTCGGAAACCAGGCTGTGAAGGTTCCATACAAGTTTCTAGTGCCGGGCCCCGATGGGAACGGCGGATCTATCATCGATTCCGGCTCCACCTTCACGTTTATGGACAAACCAGTCTTGGAGGTGGTGGCGCGAGAGTTCGAGAAGCAATTGGCAAATTGGACGAGAGCCACCGATGTGGAAACTCTCACTGGATTACGGCCGTGTTTCGACATTTCGAAGGAGAAATCGGTGAAGTTTCCGGAGTTGATTTTCCAATTTAAAGGTGGAGCGAAATGGGCTCTGCCGTTGAATAACTATTTCGCTTTAGTCAGTAGCTCCGGCGTGGCGTGTTTGACAGTAGTAACGCATCAGATGGAGGACGGCGGTGGCGGTGGCGGTGGGCCGTCTGTGATTTTAGGGGCTTTCCAGCAGCAGAATTTCTATGTGGAATATGACTTG

Protein sequence

MASPSPLSFFYLLLFSSLSAIAHSNPITLPLNSFPHLSSPDPLQALTFLASSSQTRAHQIKTPKSNSVFKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSFPKIDPTGIPRFVPKLSSSSKLVGCQNPKCSWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGSTAGLLLSETLDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSPHSGQLILDSTGVKSSGLTYTPFRQNPSVSNNAYKEYYYLNIRKIIVGNQAVKVPYKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQLANWTRATDVETLTGLRPCFDISKEKSVKFPELIFQFKGGAKWALPLNNYFALVSSSGVACLTVVTHQMEDGGGGGGGPSVILGAFQQQNFYVEYDL
BLAST of Cucsa.241660 vs. Swiss-Prot
Match: ASP63_ARATH (Probable aspartyl protease At4g16563 OS=Arabidopsis thaliana GN=At4g16563 PE=2 SV=1)

HSP 1 Score: 177.6 bits (449), Expect = 3.1e-43
Identity = 146/490 (29.80%), Postives = 214/490 (43.67%), Query Frame = 1

Query: 7   LSFFYLLLFSSLSA-----IAHSNPITLPLNSFPHLSSPDPLQALTFLASSSQTRAHQIK 66
           L +++    SSLS      ++HS      L++  H SSP  L   +   SS++ R H  K
Sbjct: 14  LQYYFHFSVSSLSTPLLLHLSHS------LSTSKHSSSPLHLLKSSSSRSSARFRRHHHK 73

Query: 67  TPKSNSVFKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSFP 126
             +       P+S  S   Y   LS G+    + L  DTGS LVWFPC   + C  C   
Sbjct: 74  QQQQQ--LSLPISSGS--DYLISLSVGSSSSAVSLYLDTGSDLVWFPCRP-FTCILCESK 133

Query: 127 KIDPTGIPRFVPKLSSSSKLVGCQNPKCSWIFGPDVKSQ-CRSCNP-----KTENCTQT- 186
            + P+        LSSS+  V C +P CS        S  C   N      +T +C  + 
Sbjct: 134 PLPPSP----PSSLSSSATTVSCSSPSCSAAHSSLPSSDLCAISNCPLDFIETGDCNTSS 193

Query: 187 --CPAYVVQYGSGSTAGLLLSETLDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLP 246
             CP +   YG GS    L S++L  P   + NF  GC+  ++ +P G+AGFGRG  SLP
Sbjct: 194 YPCPPFYYAYGDGSLVAKLYSDSLSLPSVSVSNFTFGCAHTTLAEPIGVAGFGRGRLSLP 253

Query: 247 SQMGL------KKFAYCLASRKFDDS--PHSGQLIL--------------------DSTG 306
           +Q+ +        F+YCL S  FD         LIL                    D   
Sbjct: 254 AQLAVHSPHLGNSFSYCLVSHSFDSDRVRRPSPLILGRFVDKKEKRVGTTDDHDDGDDEK 313

Query: 307 VKSSGLTYTPFRQNPSVSNNAYKEYYYLNIRKIIVGNQAVKVPYKFLVPGPDGNGGSIID 366
            K +   +T   +NP      +  +Y ++++ I +G + +  P        +G GG ++D
Sbjct: 314 KKKNEFVFTEMLENPK-----HPYFYSVSLQGISIGKRNIPAPAMLRRIDKNGGGGVVVD 373

Query: 367 SGSTFTFMDKPVLEVVAREFEKQLAN-WTRATDVETLTGLRPCFDISKEKSVKFPELIFQ 426
           SG+TFT +       V  EF+ ++     RA  VE  +G+ PC+ ++  ++VK P L+  
Sbjct: 374 SGTTFTMLPAKFYNSVVEEFDSRVGRVHERADRVEPSSGMSPCYYLN--QTVKVPALVLH 433

Query: 427 FKGG-AKWALPLNNYFALVSSSG--------VACLTVVTHQMEDGGGGGGGPSVILGAFQ 445
           F G  +   LP  NYF      G        + CL ++    E    GG G   ILG +Q
Sbjct: 434 FAGNRSSVTLPRRNYFYEFMDGGDGKEEKRKIGCLMLMNGGDESELRGGTG--AILGNYQ 479

BLAST of Cucsa.241660 vs. Swiss-Prot
Match: APF2_ARATH (Aspartyl protease family protein 2 OS=Arabidopsis thaliana GN=APF2 PE=2 SV=1)

HSP 1 Score: 149.8 bits (377), Expect = 6.9e-35
Identity = 139/454 (30.62%), Postives = 190/454 (41.85%), Query Frame = 1

Query: 15  FSSLSAIAHSNPITLPLNSFPHLSS---PDPL-----QALTFLASSSQTRAHQIK----- 74
           F S S    S+ ITL L+    LSS   PD L     Q  +    S  T A QI      
Sbjct: 60  FESGSDSESSSSITLNLDHIDALSSNKTPDELFSSRLQRDSRRVKSIATLAAQIPGRNVT 119

Query: 75  -TPKSNSVFKSPLSPHSYGA--YSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSEC 134
             P+      S +S  S G+  Y T L  GTP + ++++ DTGS +VW  C     C  C
Sbjct: 120 HAPRPGGFSSSVVSGLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAP---CRRC 179

Query: 135 SFPKIDPTGIPRFVPKLSSSSKLVGCQNPKCSWIFGPDVKSQCRSCNPKTENCTQTCPAY 194
            + + DP     F P+ S +   + C +P C        +     CN + + C      Y
Sbjct: 180 -YSQSDPI----FDPRKSKTYATIPCSSPHCR-------RLDSAGCNTRRKTC-----LY 239

Query: 195 VVQYGSGS-TAGLLLSETLDFPDKKIPNFVVGCSFLS---IHQPSGIAGFGRGSESLPSQ 254
            V YG GS T G   +ETL F   ++    +GC   +       +G+ G G+G  S P Q
Sbjct: 240 QVSYGDGSFTVGDFSTETLTFRRNRVKGVALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQ 299

Query: 255 MGLK---KFAYCLASRKFDDSPHSGQLILDSTGVKSSGLTYTPFRQNPSVSNNAYKEYYY 314
            G +   KF+YCL  R     P S   ++      S    +TP   NP +       +YY
Sbjct: 300 TGHRFNQKFSYCLVDRSASSKPSS---VVFGNAAVSRIARFTPLLSNPKLDT-----FYY 359

Query: 315 LNIRKIIVGNQAVK-VPYKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQLAN 374
           + +  I VG   V  V          GNGG IIDSG++ T + +P    +   F      
Sbjct: 360 VGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKT 419

Query: 375 WTRATDVETLTGLRPCFDISKEKSVKFPELIFQFKGGAKWALPLNNYFALVSSSGVACLT 434
             RA D         CFD+S    VK P ++  F+ GA  +LP  NY   V ++G  C  
Sbjct: 420 LKRAPDFSLFD---TCFDLSNMNEVKVPTVVLHFR-GADVSLPATNYLIPVDTNGKFCFA 472

Query: 435 VVTHQMEDGGGGGGGPSVILGAFQQQNFYVEYDL 445
                     G  GG S+I G  QQQ F V YDL
Sbjct: 480 F--------AGTMGGLSII-GNIQQQGFRVVYDL 472

BLAST of Cucsa.241660 vs. Swiss-Prot
Match: NEP2_NEPGR (Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis GN=nep2 PE=1 SV=1)

HSP 1 Score: 141.0 bits (354), Expect = 3.2e-32
Identity = 119/373 (31.90%), Postives = 166/373 (44.50%), Query Frame = 1

Query: 79  GAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSFPKIDPTGIPRFVPKLSSS 138
           G Y   ++ GTP  +   I DTGS L+W  C     C++C      PT  P F P+ SSS
Sbjct: 94  GEYLMNVAIGTPDSSFSAIMDTGSDLIWTQCEP---CTQCFS---QPT--PIFNPQDSSS 153

Query: 139 SKLVGCQNPKCSWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGSTA-GLLLSETLD 198
              + C++  C      D+ S         E C      Y   YG GST  G + +ET  
Sbjct: 154 FSTLPCESQYCQ-----DLPS---------ETCNNNECQYTYGYGDGSTTQGYMATETFT 213

Query: 199 FPDKKIPNFVVGCSF----LSIHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSP 258
           F    +PN   GC            +G+ G G G  SLPSQ+G+ +F+YC+ S     SP
Sbjct: 214 FETSSVPNIAFGCGEDNQGFGQGNGAGLIGMGWGPLSLPSQLGVGQFSYCMTSYG-SSSP 273

Query: 259 HSGQLILDSTGVKSSGLTYTPFRQNPSVSNNAYKEYYYLNIRKIIVGNQAVKVPYKFLVP 318
            +  L   ++GV     + T       + ++    YYY+ ++ I VG   + +P      
Sbjct: 274 STLALGSAASGVPEGSPSTT------LIHSSLNPTYYYITLQGITVGGDNLGIPSSTFQL 333

Query: 319 GPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQLANWTRATDVETLTGLRPCFDISKEK 378
             DG GG IIDSG+T T++ +     VA+ F  Q+      T  E+ +GL  CF    + 
Sbjct: 334 QDDGTGGMIIDSGTTLTYLPQDAYNAVAQAFTDQI---NLPTVDESSSGLSTCFQQPSDG 393

Query: 379 S-VKFPELIFQFKGGAKWALPLNNYFALVS-SSGVACLTVVTHQMEDGGGGGGGPSVILG 438
           S V+ PE+  QF GG    L L     L+S + GV CL +       G     G S I G
Sbjct: 394 STVQVPEISMQFDGG---VLNLGEQNILISPAEGVICLAM-------GSSSQLGIS-IFG 423

Query: 439 AFQQQNFYVEYDL 445
             QQQ   V YDL
Sbjct: 454 NIQQQETQVLYDL 423

BLAST of Cucsa.241660 vs. Swiss-Prot
Match: NEP1_NEPGR (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis GN=nep1 PE=1 SV=1)

HSP 1 Score: 136.7 bits (343), Expect = 6.1e-31
Identity = 113/376 (30.05%), Postives = 158/376 (42.02%), Query Frame = 1

Query: 79  GAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSFPKIDPTGIPRFVPKLSSS 138
           G Y   LS GTP Q    I DTGS L+W  C     C   S P  +P G        SSS
Sbjct: 93  GEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFNPQG--------SSS 152

Query: 139 SKLVGCQNPKCSWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGS-TAGLLLSETLD 198
              + C +  C  +  P               C+     Y   YG GS T G + +ETL 
Sbjct: 153 FSTLPCSSQLCQALSSP--------------TCSNNFCQYTYGYGDGSETQGSMGTETLT 212

Query: 199 FPDKKIPNFVVGCSF----LSIHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSP 258
           F    IPN   GC            +G+ G GRG  SLPSQ+ + KF+YC+       S 
Sbjct: 213 FGSVSIPNITFGCGENNQGFGQGNGAGLVGMGRGPLSLPSQLDVTKFSYCMT--PIGSST 272

Query: 259 HSGQLILDSTGVKSSGLTYTPFRQNPSVSNNAYKEYYYLNIRKIIVGNQAVKV-PYKFLV 318
            S  L+       ++G   T   Q+  +       +YY+ +  + VG+  + + P  F +
Sbjct: 273 PSNLLLGSLANSVTAGSPNTTLIQSSQIPT-----FYYITLNGLSVGSTRLPIDPSAFAL 332

Query: 319 PGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQLANWTRATDVETLTGLRPCFDI--- 378
              +G GG IIDSG+T T+      + V +EF  Q+       ++  + G    FD+   
Sbjct: 333 NSNNGTGGIIIDSGTTLTYFVNNAYQSVRQEFISQI-------NLPVVNGSSSGFDLCFQ 392

Query: 379 --SKEKSVKFPELIFQFKGGAKWALPLNNYFALVSSSGVACLTVVTHQMEDGGGGGGGPS 438
             S   +++ P  +  F GG    LP  NYF +  S+G+ CL +        G    G S
Sbjct: 393 TPSDPSNLQIPTFVMHFDGG-DLELPSENYF-ISPSNGLICLAM--------GSSSQGMS 421

Query: 439 VILGAFQQQNFYVEYD 444
            I G  QQQN  V YD
Sbjct: 453 -IFGNIQQQNMLVVYD 421

BLAST of Cucsa.241660 vs. Swiss-Prot
Match: AED3_ARATH (Aspartyl protease AED3 OS=Arabidopsis thaliana GN=AED3 PE=1 SV=1)

HSP 1 Score: 119.4 bits (298), Expect = 1.0e-25
Identity = 116/416 (27.88%), Postives = 171/416 (41.11%), Query Frame = 1

Query: 36  HLSSPDPLQALTFLASSSQTRAHQIKTPKSNSVFKSPLSPHSYGAYSTPLSFGTPQQTLH 95
           H++S D    LT+L+S    +      PK  SV  +  +    G Y      GTP Q + 
Sbjct: 66  HMASSDS-HRLTYLSSLVAGK------PKPTSVPVASGNQLHIGNYVVRAKLGTPPQLMF 125

Query: 96  LIFDTGSSLVWFPCTSRYLCSECSFPKIDPTGIPRFVPKLSSSSKLVGCQNPKCSWIFGP 155
           ++ DT +  VW PC+    CS CS           F    SS+   V C   +C+   G 
Sbjct: 126 MVLDTSNDAVWLPCSG---CSGCS------NASTSFNTNSSSTYSTVSCSTAQCTQARG- 185

Query: 156 DVKSQCRSCNPKTENCTQTCPAYVVQYGSGST-AGLLLSETLDFPDKKIPNFVVGCSFLS 215
                C S +P+   C+     +   YG  S+ +  L+ +TL      IPNF  GC   +
Sbjct: 186 ---LTCPSSSPQPSVCS-----FNQSYGGDSSFSASLVQDTLTLAPDVIPNFSFGCINSA 245

Query: 216 IHQ---PSGIAGFGRGSESLPSQ---MGLKKFAYCLASRKFDDSPHSGQLILDSTGVKSS 275
                 P G+ G GRG  SL SQ   +    F+YCL S  F     SG L L   G   S
Sbjct: 246 SGNSLPPQGLMGLGRGPMSLVSQTTSLYSGVFSYCLPS--FRSFYFSGSLKLGLLGQPKS 305

Query: 276 GLTYTPFRQNPSVSNNAYKEYYYLNIRKIIVGNQAVKVPYKFLVPGPDGNGGSIIDSGST 335
            + YTP  +NP   +      YY+N+  + VG+  V V   +L    +   G+IIDSG+ 
Sbjct: 306 -IRYTPLLRNPRRPS-----LYYVNLTGVSVGSVQVPVDPVYLTFDANSGAGTIIDSGTV 365

Query: 336 FTFMDKPVLEVVAREFEKQLANWTRATDVETLTGLRPCFDISKEKSVKFPELIFQFKGGA 395
            T   +PV E +  EF KQ+      +   TL     CF    E     P++        
Sbjct: 366 ITRFAQPVYEAIRDEFRKQV----NVSSFSTLGAFDTCFSADNENVA--PKITLHMT-SL 425

Query: 396 KWALPLNNYFALVSSSGVACLTVVTHQMEDGGGGGGGPSVILGAFQQQNFYVEYDL 445
              LP+ N     S+  + CL++   +             ++   QQQN  + +D+
Sbjct: 426 DLKLPMENTLIHSSAGTLTCLSMAGIRQ-----NANAVLNVIANLQQQNLRILFDV 436

BLAST of Cucsa.241660 vs. TrEMBL
Match: A0A0A0LBI9_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G778440 PE=3 SV=1)

HSP 1 Score: 911.8 bits (2355), Expect = 3.3e-262
Identity = 444/444 (100.00%), Postives = 444/444 (100.00%), Query Frame = 1

Query: 1   MASPSPLSFFYLLLFSSLSAIAHSNPITLPLNSFPHLSSPDPLQALTFLASSSQTRAHQI 60
           MASPSPLSFFYLLLFSSLSAIAHSNPITLPLNSFPHLSSPDPLQALTFLASSSQTRAHQI
Sbjct: 1   MASPSPLSFFYLLLFSSLSAIAHSNPITLPLNSFPHLSSPDPLQALTFLASSSQTRAHQI 60

Query: 61  KTPKSNSVFKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSF 120
           KTPKSNSVFKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSF
Sbjct: 61  KTPKSNSVFKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSF 120

Query: 121 PKIDPTGIPRFVPKLSSSSKLVGCQNPKCSWIFGPDVKSQCRSCNPKTENCTQTCPAYVV 180
           PKIDPTGIPRFVPKLSSSSKLVGCQNPKCSWIFGPDVKSQCRSCNPKTENCTQTCPAYVV
Sbjct: 121 PKIDPTGIPRFVPKLSSSSKLVGCQNPKCSWIFGPDVKSQCRSCNPKTENCTQTCPAYVV 180

Query: 181 QYGSGSTAGLLLSETLDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKF 240
           QYGSGSTAGLLLSETLDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKF
Sbjct: 181 QYGSGSTAGLLLSETLDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKF 240

Query: 241 AYCLASRKFDDSPHSGQLILDSTGVKSSGLTYTPFRQNPSVSNNAYKEYYYLNIRKIIVG 300
           AYCLASRKFDDSPHSGQLILDSTGVKSSGLTYTPFRQNPSVSNNAYKEYYYLNIRKIIVG
Sbjct: 241 AYCLASRKFDDSPHSGQLILDSTGVKSSGLTYTPFRQNPSVSNNAYKEYYYLNIRKIIVG 300

Query: 301 NQAVKVPYKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQLANWTRATDVETL 360
           NQAVKVPYKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQLANWTRATDVETL
Sbjct: 301 NQAVKVPYKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQLANWTRATDVETL 360

Query: 361 TGLRPCFDISKEKSVKFPELIFQFKGGAKWALPLNNYFALVSSSGVACLTVVTHQMEDGG 420
           TGLRPCFDISKEKSVKFPELIFQFKGGAKWALPLNNYFALVSSSGVACLTVVTHQMEDGG
Sbjct: 361 TGLRPCFDISKEKSVKFPELIFQFKGGAKWALPLNNYFALVSSSGVACLTVVTHQMEDGG 420

Query: 421 GGGGGPSVILGAFQQQNFYVEYDL 445
           GGGGGPSVILGAFQQQNFYVEYDL
Sbjct: 421 GGGGGPSVILGAFQQQNFYVEYDL 444

BLAST of Cucsa.241660 vs. TrEMBL
Match: M5VQG8_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa005104mg PE=3 SV=1)

HSP 1 Score: 579.3 bits (1492), Expect = 3.9e-162
Identity = 283/463 (61.12%), Postives = 350/463 (75.59%), Query Frame = 1

Query: 5   SPLSFFYLLLFSSLSAIAHSNPITLPLNSFPHLSSPDPLQALTFLASSSQTRAHQIKTPK 64
           +P S   L    SL  +  S+ ITLPL+ FP+  S DPLQAL+F AS+S +RAH IK  +
Sbjct: 3   NPKSLLTLTSLFSLFLLTLSSKITLPLSPFPNHPSSDPLQALSFHASASISRAHHIKNSR 62

Query: 65  --SNSVFKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSFPK 124
             ++S+ + PL PHSYG YS  L+FGTP QT   I DTGSSLVWFPCT RY+CS C FP 
Sbjct: 63  KPNSSLTQVPLFPHSYGDYSVSLNFGTPPQTSSFIMDTGSSLVWFPCTKRYICSRCQFPN 122

Query: 125 IDPTGIPRFVPKLSSSSKLVGCQNPKCSWIFGPDVKSQCRSCN-PKTENCTQTCPAYVVQ 184
           I+P  IP F PKLSSSSK+VGCQNPKC WIFGP+VKS+C +CN P  +NC+Q CP Y++Q
Sbjct: 123 INPAKIPTFKPKLSSSSKIVGCQNPKCGWIFGPEVKSKCPNCNNPSHQNCSQACPTYIIQ 182

Query: 185 YGSGSTAGLLLSETLDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFA 244
           YGSG+TAG+LLSETLDFP K +P+F+VGCSF+SI QP+GIAGFGRG +SLP+QMGL KF+
Sbjct: 183 YGSGTTAGILLSETLDFPKKIVPDFLVGCSFVSIRQPAGIAGFGRGPQSLPAQMGLTKFS 242

Query: 245 YCLASRKFDDSPHSGQLILDSTGVKSSG--------------------LTYTPFRQNPSV 304
           YCL S +FDD+P S  L+L S+   SS                     L+ TPF++NP  
Sbjct: 243 YCLVSHRFDDTPQSSDLVLYSSSSGSSSSSEEEPTIAESQRNKTKLQSLSSTPFQKNPGP 302

Query: 305 SNNAYKEYYYLNIRKIIVGNQAVKVPYKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVA 364
            N+A++EYYY+ +RK+IVGN+ VK+PYKFLVPG D +GG+I+DSGSTFTFM+KPV E VA
Sbjct: 303 PNSAFREYYYIMLRKVIVGNKNVKIPYKFLVPGADSSGGTIVDSGSTFTFMEKPVFEPVA 362

Query: 365 REFEKQLANWTRATDVETLTGLRPCFDISKEKSVKFPELIFQFKGGAKWALPLNNYFALV 424
           +EFE Q+AN+TRA D+E  TGLRPCFDISKEK V FPEL+FQFKGGAK  LP  NYF++V
Sbjct: 363 KEFEAQMANYTRAKDLENKTGLRPCFDISKEKKVDFPELVFQFKGGAKMELPSKNYFSMV 422

Query: 425 SSSGVACLTVVTHQMEDGGGGGGGPSVILGAFQQQNFYVEYDL 445
           SSSGV CLT+VT  +  G GG GGP++ILG +QQQ+F+VEYDL
Sbjct: 423 SSSGVVCLTIVTDGVV-GPGGNGGPAIILGNYQQQDFHVEYDL 464

BLAST of Cucsa.241660 vs. TrEMBL
Match: B9T7L5_RICCO (Pepsin A, putative OS=Ricinus communis GN=RCOM_0308790 PE=3 SV=1)

HSP 1 Score: 550.8 bits (1418), Expect = 1.5e-153
Identity = 276/456 (60.53%), Postives = 346/456 (75.88%), Query Frame = 1

Query: 1   MASPSPLSFFYLLLFSS-LSAIAHSNPITLPLN-SFPHLSSPDPLQALTFLASSSQTRAH 60
           MA+ S L     LLFSS +      + IT+PL+ +     S DP + L  LA++S +RAH
Sbjct: 1   MATASSLLLLLFLLFSSFVFPFISPSTITIPLSPTITKRPSSDPWEYLNHLATTSISRAH 60

Query: 61  QIKTPKSN-SVFKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSE 120
            +K+PK+N S+ K+PL   SYG YS  LS GTP QT+ LI DTGSSLVWFPCTSRY+C+ 
Sbjct: 61  HLKSPKTNFSLIKTPLFSRSYGGYSMSLSLGTPSQTVKLIMDTGSSLVWFPCTSRYVCAS 120

Query: 121 CSFPKIDPTGIPRFVPKLSSSSKLVGCQNPKCSWIFGPDVKSQCRSCNPKTENCTQTCPA 180
           C+FP  D T IP+F+P+LSSSSKL+GC+NPKC+W+FG  V+S+C +CNP+ +NCTQ CP 
Sbjct: 121 CNFPNTDITKIPKFMPRLSSSSKLIGCKNPKCAWVFGSSVQSKCHNCNPQAQNCTQACPP 180

Query: 181 YVVQYGSGSTAGLLLSETLDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGL 240
           Y++QYG GSTAGLLLSET++FP+K I +F+ GCS LS  QP GIAGFGR  ESLP Q+GL
Sbjct: 181 YIIQYGLGSTAGLLLSETINFPNKTISDFLAGCSLLSTRQPEGIAGFGRSQESLPLQLGL 240

Query: 241 KKFAYCLASRKFDDSPHSGQLILD----STGVKSSGLTYTPFRQN-PSVSNNAYKEYYYL 300
           KKF+YCL SR+FDDSP S  LILD    ++  K++GL+YTPF++N  S SN A++EYYY+
Sbjct: 241 KKFSYCLVSRRFDDSPVSSDLILDMGPSTSDSKTTGLSYTPFQKNLASQSNPAFQEYYYV 300

Query: 301 NIRKIIVGNQAVKVPYKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQLANWT 360
            +RKIIVG   VKVPY FLVPG DGNGG+I+DSGSTFTF++  V E++A+EFEKQ+AN+T
Sbjct: 301 MLRKIIVGKTHVKVPYSFLVPGSDGNGGTIVDSGSTFTFVEGHVFELLAKEFEKQMANYT 360

Query: 361 RATDVETLTGLRPCFDISKEKSVKFPELIFQFKGGAKWALPLNNYFALVSSSGVACLTVV 420
            AT+V+ LTGLRPCFDIS EKSV  P+L FQFKGGAK  LPL+NYFA V   GV CLT+V
Sbjct: 361 VATNVQKLTGLRPCFDISGEKSVVIPDLTFQFKGGAKMQLPLSNYFAFV-DMGVVCLTIV 420

Query: 421 THQMEDGGGGGG----GPSVILGAFQQQNFYVEYDL 445
           +      GG GG    GP++ILG FQQQNFY+EYDL
Sbjct: 421 SDNAAALGGDGGVRSSGPAIILGNFQQQNFYIEYDL 455

BLAST of Cucsa.241660 vs. TrEMBL
Match: B9HBQ5_POPTR (Aspartyl protease family protein OS=Populus trichocarpa GN=POPTR_0006s22110g PE=3 SV=1)

HSP 1 Score: 550.4 bits (1417), Expect = 2.0e-153
Identity = 277/451 (61.42%), Postives = 342/451 (75.83%), Query Frame = 1

Query: 7   LSFFYLLLFSSLSAIAHSNP---ITLPLNSFPH----LSSPDPLQALTFLASSSQTRAHQ 66
           LSF  L+  SS +      P   IT+PL++       +SS +P  AL  LAS S +RAH 
Sbjct: 10  LSFLILISSSSSTPTPTRTPTSTITIPLSAPSSTKLIVSSKNPWGALNHLASLSLSRAHH 69

Query: 67  IKTPKSN-SVFKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSEC 126
           IK+PK+  S+ K+PL P SYG YS  L+FGTP QT   + DTGSSLVWFPCTSRYLCS C
Sbjct: 70  IKSPKTKFSLLKTPLFPRSYGGYSISLNFGTPPQTTKFVMDTGSSLVWFPCTSRYLCSRC 129

Query: 127 SFPKIDPTGIPRFVPKLSSSSKLVGCQNPKCSWIFGPDVKSQCRSCNPKTENCTQTCPAY 186
            FP I+ TGIP F+PK SSSS L+GC+N KCSW+FGP V+S+C+ C+P T+NCTQ+CP Y
Sbjct: 130 DFPNIEVTGIPTFIPKQSSSSNLIGCKNHKCSWLFGPKVQSKCQECDPTTQNCTQSCPPY 189

Query: 187 VVQYGSGSTAGLLLSETLDFPDKK-IPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGL 246
           V+QYG GSTAGLLLSETLDFP KK IP F+VGCS  SI QP GIAGFGR  ESLPSQ+GL
Sbjct: 190 VIQYGLGSTAGLLLSETLDFPHKKTIPGFLVGCSLFSIRQPEGIAGFGRSPESLPSQLGL 249

Query: 247 KKFAYCLASRKFDDSPHSGQLILD----STGVKSSGLTYTPFRQNPSVSNNAYKEYYYLN 306
           KKF+YCL S  FDD+P S  L+LD    S   K+ GL+YTPF++NP+    A+++YYY+ 
Sbjct: 250 KKFSYCLVSHAFDDTPASSDLVLDTGSGSDDTKTPGLSYTPFQKNPTA---AFRDYYYVL 309

Query: 307 IRKIIVGNQAVKVPYKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQLANWTR 366
           +R I++G+  VKVPYKFLVPG DGNGG+I+DSG+TFTFM+KPV E+VA+EFEKQ+A++T 
Sbjct: 310 LRNIVIGDTHVKVPYKFLVPGSDGNGGTIVDSGTTFTFMEKPVYELVAKEFEKQVAHYTV 369

Query: 367 ATDVETLTGLRPCFDISKEKSVKFPELIFQFKGGAKWALPLNNYFALVSSSGVACLTVVT 426
           AT+V+  TGLRPCF+IS EKSV  PE IF FKGGAK ALPL NYF+ V  SGV CLT+V+
Sbjct: 370 ATEVQNQTGLRPCFNISGEKSVSVPEFIFHFKGGAKMALPLANYFSFV-DSGVICLTIVS 429

Query: 427 HQMEDGGGGGGGPSVILGAFQQQNFYVEYDL 445
             M  G G GGGP++ILG +QQ+NF+VE+DL
Sbjct: 430 DNM-SGSGIGGGPAIILGNYQQRNFHVEFDL 455

BLAST of Cucsa.241660 vs. TrEMBL
Match: A0A0D2S3P3_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_006G150600 PE=3 SV=1)

HSP 1 Score: 545.4 bits (1404), Expect = 6.3e-152
Identity = 262/445 (58.88%), Postives = 332/445 (74.61%), Query Frame = 1

Query: 10  FYLLLFSSLSAIAHSNPITLPLNSFPHLSSPDPLQALTFLASSSQTRAHQIKTPKS---- 69
           F+++  +  SA A +  I L L+ FPH SS  P Q L  L +SS  RAH +K PK+    
Sbjct: 12  FFIISTTLTSAGAAAATIKLSLSPFPHPSSSHPYQILNNLVTSSVARAHHLKHPKAKADN 71

Query: 70  --NSVFKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSFPKI 129
             +S+ ++ L PHSYG Y+  L FGTP QTL  I DTGSSL WFPCTSRYLCS+C+FP +
Sbjct: 72  TTSSLLRASLFPHSYGGYTISLKFGTPPQTLPFIMDTGSSLSWFPCTSRYLCSQCAFPNV 131

Query: 130 DPTGIPRFVPKLSSSSKLVGCQNPKCSWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYG 189
           DP  IP F PK SSS KLVGC+NPKCSW+FGPDV S+C+ C P +ENCTQTCP Y++QYG
Sbjct: 132 DPAKIPTFAPKRSSSKKLVGCRNPKCSWLFGPDVASRCQDCEPTSENCTQTCPPYLIQYG 191

Query: 190 SGSTAGLLLSETLDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFAYC 249
            GSTAGLLL E L FP K   +F+VGCS LS  QP+GIAGFGR +ESLPSQ+GLKKF+YC
Sbjct: 192 LGSTAGLLLVENLVFPQKTFQDFLVGCSILSNRQPAGIAGFGRSAESLPSQLGLKKFSYC 251

Query: 250 LASRKFDDSPHSGQLILD----STGVKSSGLTYTPFRQNPSVSNNAYKEYYYLNIRKIIV 309
           L SR+FDD+  S  ++L+    S   K+ GL+YTPF +N   SN  +KE+YY+ +RKI+V
Sbjct: 252 LVSRRFDDTGVSSNMLLETGSGSGDAKTPGLSYTPFYRNQVASNPNFKEFYYVTLRKILV 311

Query: 310 GNQAVKVPYKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQLANWTRATDVET 369
           G++ VKVPY +LVPG DGNGG+I+DSGSTFTFMD+PV EVV++EFEKQ+ N+ RA ++E 
Sbjct: 312 GDKHVKVPYSYLVPGSDGNGGTIVDSGSTFTFMDRPVFEVVSKEFEKQMGNYRRAREIEN 371

Query: 370 LTGLRPCFDISKEKSVKFPELIFQFKGGAKWALPLNNYFALVSSSGVACLTVVTHQMEDG 429
           ++GL PCF+IS   S+KFPEL FQFKGGAK ALPL NYF+ V    VACL +V+  +  G
Sbjct: 372 ISGLAPCFNISGYTSIKFPELSFQFKGGAKMALPLVNYFSFVGDDKVACLMIVSDNVV-G 431

Query: 430 GGGGGGPSVILGAFQQQNFYVEYDL 445
            G  GGP++ILG+FQQQN+Y+E+D+
Sbjct: 432 QGSHGGPAIILGSFQQQNYYIEFDM 455

BLAST of Cucsa.241660 vs. TAIR10
Match: AT3G52500.1 (AT3G52500.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 521.9 bits (1343), Expect = 3.8e-148
Identity = 262/462 (56.71%), Postives = 333/462 (72.08%), Query Frame = 1

Query: 5   SPLSFFYLLLFSSLSAIAHSNPITLPLNSFPHL--SSPDPLQALTFLASSSQTRAHQIK- 64
           S + FF+L+  S +SA+       LPL+ F H   S  DP  +L  LA SS  RAH++K 
Sbjct: 3   SSIFFFFLIFLSVVSAVK------LPLSPFSHSDQSPKDPYLSLRRLAESSIARAHKLKH 62

Query: 65  --------------TPKSNSVFKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWF 124
                         T  S +V KSPLS  SYG YS  LSFGTP QT+  +FDTGSSLVW 
Sbjct: 63  GTSIKPDEDALSSTTTASATVVKSPLSAKSYGGYSVSLSFGTPSQTIPFVFDTGSSLVWL 122

Query: 125 PCTSRYLCSECSFPKIDPTGIPRFVPKLSSSSKLVGCQNPKCSWIFGPDVKSQCRSCNPK 184
           PCTSRYLCS C F  +DPT IPRF+PK SSSSK++GCQ+PKC +++GP+V  QCR C+P 
Sbjct: 123 PCTSRYLCSGCDFSGLDPTLIPRFIPKNSSSSKIIGCQSPKCQFLYGPNV--QCRGCDPN 182

Query: 185 TENCTQTCPAYVVQYGSGSTAGLLLSETLDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRG 244
           T NCT  CP Y++QYG GSTAG+L++E LDFPD  +P+FVVGCS +S  QP+GIAGFGRG
Sbjct: 183 TRNCTVGCPPYILQYGLGSTAGVLITEKLDFPDLTVPDFVVGCSIISTRQPAGIAGFGRG 242

Query: 245 SESLPSQMGLKKFAYCLASRKFDDSPHSGQLILDS-----TGVKSSGLTYTPFRQNPSVS 304
             SLPSQM LK+F++CL SR+FDD+  +  L LD+     +G K+ GLTYTPFR+NP+VS
Sbjct: 243 PVSLPSQMNLKRFSHCLVSRRFDDTNVTTDLDLDTGSGHNSGSKTPGLTYTPFRKNPNVS 302

Query: 305 NNAYKEYYYLNIRKIIVGNQAVKVPYKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAR 364
           N A+ EYYYLN+R+I VG + VK+PYK+L PG +G+GGSI+DSGSTFTFM++PV E+VA 
Sbjct: 303 NKAFLEYYYLNLRRIYVGRKHVKIPYKYLAPGTNGDGGSIVDSGSTFTFMERPVFELVAE 362

Query: 365 EFEKQLANWTRATDVETLTGLRPCFDISKEKSVKFPELIFQFKGGAKWALPLNNYFALVS 424
           EF  Q++N+TR  D+E  TGL PCF+IS +  V  PELIF+FKGGAK  LPL+NYF  V 
Sbjct: 363 EFASQMSNYTREKDLEKETGLGPCFNISGKGDVTVPELIFEFKGGAKLELPLSNYFTFVG 422

Query: 425 SSGVACLTVVTHQMEDGGGGGGGPSVILGAFQQQNFYVEYDL 445
           ++   CLTVV+ +  +   GG GP++ILG+FQQQN+ VEYDL
Sbjct: 423 NTDTVCLTVVSDKTVN-PSGGTGPAIILGSFQQQNYLVEYDL 455

BLAST of Cucsa.241660 vs. TAIR10
Match: AT4G16563.1 (AT4G16563.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 177.6 bits (449), Expect = 1.7e-44
Identity = 146/490 (29.80%), Postives = 214/490 (43.67%), Query Frame = 1

Query: 7   LSFFYLLLFSSLSA-----IAHSNPITLPLNSFPHLSSPDPLQALTFLASSSQTRAHQIK 66
           L +++    SSLS      ++HS      L++  H SSP  L   +   SS++ R H  K
Sbjct: 14  LQYYFHFSVSSLSTPLLLHLSHS------LSTSKHSSSPLHLLKSSSSRSSARFRRHHHK 73

Query: 67  TPKSNSVFKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSFP 126
             +       P+S  S   Y   LS G+    + L  DTGS LVWFPC   + C  C   
Sbjct: 74  QQQQQ--LSLPISSGS--DYLISLSVGSSSSAVSLYLDTGSDLVWFPCRP-FTCILCESK 133

Query: 127 KIDPTGIPRFVPKLSSSSKLVGCQNPKCSWIFGPDVKSQ-CRSCNP-----KTENCTQT- 186
            + P+        LSSS+  V C +P CS        S  C   N      +T +C  + 
Sbjct: 134 PLPPSP----PSSLSSSATTVSCSSPSCSAAHSSLPSSDLCAISNCPLDFIETGDCNTSS 193

Query: 187 --CPAYVVQYGSGSTAGLLLSETLDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLP 246
             CP +   YG GS    L S++L  P   + NF  GC+  ++ +P G+AGFGRG  SLP
Sbjct: 194 YPCPPFYYAYGDGSLVAKLYSDSLSLPSVSVSNFTFGCAHTTLAEPIGVAGFGRGRLSLP 253

Query: 247 SQMGL------KKFAYCLASRKFDDS--PHSGQLIL--------------------DSTG 306
           +Q+ +        F+YCL S  FD         LIL                    D   
Sbjct: 254 AQLAVHSPHLGNSFSYCLVSHSFDSDRVRRPSPLILGRFVDKKEKRVGTTDDHDDGDDEK 313

Query: 307 VKSSGLTYTPFRQNPSVSNNAYKEYYYLNIRKIIVGNQAVKVPYKFLVPGPDGNGGSIID 366
            K +   +T   +NP      +  +Y ++++ I +G + +  P        +G GG ++D
Sbjct: 314 KKKNEFVFTEMLENPK-----HPYFYSVSLQGISIGKRNIPAPAMLRRIDKNGGGGVVVD 373

Query: 367 SGSTFTFMDKPVLEVVAREFEKQLAN-WTRATDVETLTGLRPCFDISKEKSVKFPELIFQ 426
           SG+TFT +       V  EF+ ++     RA  VE  +G+ PC+ ++  ++VK P L+  
Sbjct: 374 SGTTFTMLPAKFYNSVVEEFDSRVGRVHERADRVEPSSGMSPCYYLN--QTVKVPALVLH 433

Query: 427 FKGG-AKWALPLNNYFALVSSSG--------VACLTVVTHQMEDGGGGGGGPSVILGAFQ 445
           F G  +   LP  NYF      G        + CL ++    E    GG G   ILG +Q
Sbjct: 434 FAGNRSSVTLPRRNYFYEFMDGGDGKEEKRKIGCLMLMNGGDESELRGGTG--AILGNYQ 479

BLAST of Cucsa.241660 vs. TAIR10
Match: AT5G45120.1 (AT5G45120.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 177.6 bits (449), Expect = 1.7e-44
Identity = 151/480 (31.46%), Postives = 219/480 (45.62%), Query Frame = 1

Query: 1   MASPSPLSFFYLLLFSSLS------AIAHSNPITLPLNSFPHLSSPDPLQALTFLASSSQ 60
           M + + + F +LL+   L+      A  H NP +   +SF  L+      +L    S +Q
Sbjct: 1   METQTHVLFLFLLITLLLNTTNKTQARQHKNPSSSS-SSFLVLTLTKSSVSLPTPKSQTQ 60

Query: 61  TRAHQIKTPKSN-SVFKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTS-R 120
            R   IK P S+  V   PL     G Y   L+ GTP Q + +  DTGS L W PC +  
Sbjct: 61  ER---IKKPLSSVDVVMEPLREVRDG-YLITLNIGTPPQAVQVYLDTGSDLTWVPCGNLS 120

Query: 121 YLCSECSFPKIDPTGIPR-FVPKLSSSSKLVGCQNPKCSWI------FGPDVKSQCRSCN 180
           + C EC   K +    P  F P  SS+S    C +  C  I      F P   + C    
Sbjct: 121 FDCIECYDLKNNDLKSPSVFSPLHSSTSFRDSCASSFCVEIHSSDNPFDPCAVAGCSVSM 180

Query: 181 PKTENCTQTCPAYVVQYGSGST-AGLLLSETLDFPDKKIPNFVVGCSFLSIHQPSGIAGF 240
                C + CP++   YG G   +G+L  + L    + +P F  GC   +  +P GIAGF
Sbjct: 181 LLKSTCVRPCPSFAYTYGEGGLISGILTRDILKARTRDVPRFSFGCVTSTYREPIGIAGF 240

Query: 241 GRGSESLPSQMGL--KKFAYCLASRKFDDSPH-SGQLILDSTGVK---SSGLTYTPFRQN 300
           GRG  SLPSQ+G   K F++C    KF ++P+ S  LIL ++ +    +  L +TP    
Sbjct: 241 GRGLLSLPSQLGFLEKGFSHCFLPFKFVNNPNISSPLILGASALSINLTDSLQFTPMLNT 300

Query: 301 PSVSNNAYKEYYYLNIRKIIVGNQ--AVKVPYKFLVPGPDGNGGSIIDSGSTFTFMDKPV 360
           P   N+     YY+ +  I +G      +VP         GNGG ++DSG+T+T + +P 
Sbjct: 301 PMYPNS-----YYIGLESITIGTNITPTQVPLTLRQFDSQGNGGMLVDSGTTYTHLPEPF 360

Query: 361 LEVVAREFEKQLANWTRATDVETLTGLRPCFDI--------SKEKSVK--FPELIFQFKG 420
              +    +  +  + RAT+ E+ TG   C+ +        S E  V   FP + F F  
Sbjct: 361 YSQLLTTLQSTI-TYPRATETESRTGFDLCYKVPCPNNNLTSLENDVMMIFPSITFHFLN 420

Query: 421 GAKWALPLNNYFALVS--SSGVACLTVVTHQMEDGGGGGGGPSVILGAFQQQNFYVEYDL 445
            A   LP  N F  +S  S G     ++   MED   G  GP+ + G+FQQQN  V YDL
Sbjct: 421 NATLLLPQGNSFYAMSAPSDGSVVQCLLFQNMED---GDYGPAGVFGSFQQQNVKVVYDL 466

BLAST of Cucsa.241660 vs. TAIR10
Match: AT2G03200.1 (AT2G03200.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 161.4 bits (407), Expect = 1.3e-39
Identity = 156/484 (32.23%), Postives = 214/484 (44.21%), Query Frame = 1

Query: 2   ASPSPLSFFYLLLFSSLSAIAHSNPI----TLPLN--------SFPHLSSPDPLQALTFL 61
           +S S L  F+L+LFS L +++ S       TLP N        S  H+ S   L  +  +
Sbjct: 5   SSSSLLFPFFLILFSCLISVSSSRRSLIDRTLPKNLPRSGFRLSLRHVDSGKNLTKIQKI 64

Query: 62  ASSSQTRAHQIKT------------PKSNSVFKSPLSPHSYGAYSTPLSFGTPQQTLHLI 121
                   H++              P   +  K+P    S G +   LS G P      I
Sbjct: 65  QRGINRGFHRLNRLGAVAVLAVASKPDDTNNIKAPTHGGS-GEFLMELSIGNPAVKYSAI 124

Query: 122 FDTGSSLVWFPCTSRYLCSECSFPKIDPTGIPRFVPKLSSSSKLVGCQNPKCSWIFGPDV 181
            DTGS L+W  C     C+EC      PT  P F P+ SSS   VGC +  C+ +     
Sbjct: 125 VDTGSDLIWTQCKP---CTECFD---QPT--PIFDPEKSSSYSKVGCSSGLCNAL----- 184

Query: 182 KSQCRSCNPKTENCTQTCPAYVVQYGS-GSTAGLLLSETLDFPDKK-IPNFVVGCSFLS- 241
                +CN   + C      Y+  YG   ST GLL +ET  F D+  I     GC   + 
Sbjct: 185 --PRSNCNEDKDACE-----YLYTYGDYSSTRGLLATETFTFEDENSISGIGFGCGVENE 244

Query: 242 ---IHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSPHSGQL--------ILDST 301
                Q SG+ G GRG  SL SQ+   KF+YCL S   +DS  S  L        I++ T
Sbjct: 245 GDGFSQGSGLVGLGRGPLSLISQLKETKFSYCLTS--IEDSEASSSLFIGSLASGIVNKT 304

Query: 302 GVKSSG-LTYT-PFRQNPSVSNNAYKEYYYLNIRKIIVGNQAVKVPYKFLVPGPDGNGGS 361
           G    G +T T    +NP   +     +YYL ++ I VG + + V         DG GG 
Sbjct: 305 GASLDGEVTKTMSLLRNPDQPS-----FYYLELQGITVGAKRLSVEKSTFELAEDGTGGM 364

Query: 362 IIDSGSTFTFMDKPVLEVVAREFEKQLANWTRATDVETLTGLRPCFDI-SKEKSVKFPEL 421
           IIDSG+T T++++   +V+  EF  ++   +   D    TGL  CF +    K++  P++
Sbjct: 365 IIDSGTTITYLEETAFKVLKEEFTSRM---SLPVDDSGSTGLDLCFKLPDAAKNIAVPKM 424

Query: 422 IFQFKGGAKWALPLNNYFALVSSSGVACLTVVTHQMEDGGGGGGGPSVILGAFQQQNFYV 445
           IF FK GA   LP  NY    SS+GV CL +         G   G S I G  QQQNF V
Sbjct: 425 IFHFK-GADLELPGENYMVADSSTGVLCLAM---------GSSNGMS-IFGNVQQQNFNV 446

BLAST of Cucsa.241660 vs. TAIR10
Match: AT2G42980.1 (AT2G42980.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 159.5 bits (402), Expect = 4.9e-39
Identity = 123/386 (31.87%), Postives = 177/386 (45.85%), Query Frame = 1

Query: 79  GAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSFPKIDPTGIPRFVPKLSSS 138
           G Y   +  GTP +   LI DTGS L W  C   Y C   +    DP        K S+S
Sbjct: 158 GEYFMDVLVGTPPKHFSLILDTGSDLNWLQCLPCYDCFHQNGMFYDP--------KTSAS 217

Query: 139 SKLVGCQNPKCSWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGS-TAGLLLSETLD 198
            K + C +P+CS I  PD   QC S N       Q+CP Y   YG  S T G    ET  
Sbjct: 218 FKNITCNDPRCSLISSPDPPVQCESDN-------QSCP-YFYWYGDRSNTTGDFAVETFT 277

Query: 199 F---------PDKKIPNFVVGCSFLS---IHQPSGIAGFGRGSESLPSQMGL---KKFAY 258
                      + K+ N + GC   +       SG+ G GRG  S  SQ+       F+Y
Sbjct: 278 VNLTTTEGGSSEYKVGNMMFGCGHWNRGLFSGASGLLGLGRGPLSFSSQLQSLYGHSFSY 337

Query: 259 CLASRKFDDSPHSGQLIL--DSTGVKSSGLTYTPFRQNPSVSNNAYKEYYYLNIRKIIVG 318
           CL  R  + +  S +LI   D   +  + L +T F        N+ + +YY+ I+ I+VG
Sbjct: 338 CLVDRNSNTNV-SSKLIFGEDKDLLNHTNLNFTSFVNG---KENSVETFYYIQIKSILVG 397

Query: 319 NQAVKVPYKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREF-EKQLANWTRATDVET 378
            +A+ +P +      DG+GG+IIDSG+T ++  +P  E++  +F EK   N+    D   
Sbjct: 398 GKALDIPEETWNISSDGDGGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKENYPIFRDFPV 457

Query: 379 LTGLRPCFDIS--KEKSVKFPELIFQFKGGAKWALPLNNYFALVSSSGVACLTVVTHQME 438
           L    PCF++S  +E ++  PEL   F  G  W  P  N F  +S   + CL ++     
Sbjct: 458 LD---PCFNVSGIEENNIHLPELGIAFVDGTVWNFPAENSFIWLSED-LVCLAIL----- 511

Query: 439 DGGGGGGGPSVILGAFQQQNFYVEYD 444
              G       I+G +QQQNF++ YD
Sbjct: 518 ---GTPKSTFSIIGNYQQQNFHILYD 511

BLAST of Cucsa.241660 vs. NCBI nr
Match: gi|449437856|ref|XP_004136706.1| (PREDICTED: aspartic proteinase nepenthesin-2 [Cucumis sativus])

HSP 1 Score: 911.8 bits (2355), Expect = 4.8e-262
Identity = 444/444 (100.00%), Postives = 444/444 (100.00%), Query Frame = 1

Query: 1   MASPSPLSFFYLLLFSSLSAIAHSNPITLPLNSFPHLSSPDPLQALTFLASSSQTRAHQI 60
           MASPSPLSFFYLLLFSSLSAIAHSNPITLPLNSFPHLSSPDPLQALTFLASSSQTRAHQI
Sbjct: 1   MASPSPLSFFYLLLFSSLSAIAHSNPITLPLNSFPHLSSPDPLQALTFLASSSQTRAHQI 60

Query: 61  KTPKSNSVFKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSF 120
           KTPKSNSVFKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSF
Sbjct: 61  KTPKSNSVFKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSF 120

Query: 121 PKIDPTGIPRFVPKLSSSSKLVGCQNPKCSWIFGPDVKSQCRSCNPKTENCTQTCPAYVV 180
           PKIDPTGIPRFVPKLSSSSKLVGCQNPKCSWIFGPDVKSQCRSCNPKTENCTQTCPAYVV
Sbjct: 121 PKIDPTGIPRFVPKLSSSSKLVGCQNPKCSWIFGPDVKSQCRSCNPKTENCTQTCPAYVV 180

Query: 181 QYGSGSTAGLLLSETLDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKF 240
           QYGSGSTAGLLLSETLDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKF
Sbjct: 181 QYGSGSTAGLLLSETLDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKF 240

Query: 241 AYCLASRKFDDSPHSGQLILDSTGVKSSGLTYTPFRQNPSVSNNAYKEYYYLNIRKIIVG 300
           AYCLASRKFDDSPHSGQLILDSTGVKSSGLTYTPFRQNPSVSNNAYKEYYYLNIRKIIVG
Sbjct: 241 AYCLASRKFDDSPHSGQLILDSTGVKSSGLTYTPFRQNPSVSNNAYKEYYYLNIRKIIVG 300

Query: 301 NQAVKVPYKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQLANWTRATDVETL 360
           NQAVKVPYKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQLANWTRATDVETL
Sbjct: 301 NQAVKVPYKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQLANWTRATDVETL 360

Query: 361 TGLRPCFDISKEKSVKFPELIFQFKGGAKWALPLNNYFALVSSSGVACLTVVTHQMEDGG 420
           TGLRPCFDISKEKSVKFPELIFQFKGGAKWALPLNNYFALVSSSGVACLTVVTHQMEDGG
Sbjct: 361 TGLRPCFDISKEKSVKFPELIFQFKGGAKWALPLNNYFALVSSSGVACLTVVTHQMEDGG 420

Query: 421 GGGGGPSVILGAFQQQNFYVEYDL 445
           GGGGGPSVILGAFQQQNFYVEYDL
Sbjct: 421 GGGGGPSVILGAFQQQNFYVEYDL 444

BLAST of Cucsa.241660 vs. NCBI nr
Match: gi|659084466|ref|XP_008442902.1| (PREDICTED: aspartic proteinase nepenthesin-2 [Cucumis melo])

HSP 1 Score: 851.7 bits (2199), Expect = 5.9e-244
Identity = 416/444 (93.69%), Postives = 433/444 (97.52%), Query Frame = 1

Query: 1   MASPSPLSFFYLLLFSSLSAIAHSNPITLPLNSFPHLSSPDPLQALTFLASSSQTRAHQI 60
           MASPSPLSFFY+LLFSSLSAI++SNPITLPLNS PHLSS DPLQALTFLAS+S+ RAH+I
Sbjct: 1   MASPSPLSFFYILLFSSLSAISNSNPITLPLNSSPHLSSSDPLQALTFLASASKNRAHRI 60

Query: 61  KTPKSNSVFKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSF 120
           KTPKSNSV KSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLC+ECSF
Sbjct: 61  KTPKSNSVSKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCTECSF 120

Query: 121 PKIDPTGIPRFVPKLSSSSKLVGCQNPKCSWIFGPDVKSQCRSCNPKTENCTQTCPAYVV 180
           PKIDPTGIPRFVPKLSSSSKLVGCQNPKC+WIFGPDVKSQCRSCNPKTENCTQTCPAYVV
Sbjct: 121 PKIDPTGIPRFVPKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAYVV 180

Query: 181 QYGSGSTAGLLLSETLDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKF 240
           QYGSGSTAGLLLSETLDFP+KKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKF
Sbjct: 181 QYGSGSTAGLLLSETLDFPNKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKF 240

Query: 241 AYCLASRKFDDSPHSGQLILDSTGVKSSGLTYTPFRQNPSVSNNAYKEYYYLNIRKIIVG 300
           AYCLASRKFDDS HSGQLILDS+GVK+SGLTYT FRQNPSVSN+AYKEYYYLNIRKIIVG
Sbjct: 241 AYCLASRKFDDSAHSGQLILDSSGVKTSGLTYTSFRQNPSVSNHAYKEYYYLNIRKIIVG 300

Query: 301 NQAVKVPYKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQLANWTRATDVETL 360
           NQAVKVPYK+LVPGPDGNGGSIIDSGSTFTFMDKPVL+VVA+EFEKQLAN TRATDVETL
Sbjct: 301 NQAVKVPYKYLVPGPDGNGGSIIDSGSTFTFMDKPVLDVVAQEFEKQLANRTRATDVETL 360

Query: 361 TGLRPCFDISKEKSVKFPELIFQFKGGAKWALPLNNYFALVSSSGVACLTVVTHQMEDGG 420
           TGLRPCFD+SKEKSV+FPELIFQFKGGAKWALPLNNYFALVSSSGVACLTVVTH  ED  
Sbjct: 361 TGLRPCFDVSKEKSVEFPELIFQFKGGAKWALPLNNYFALVSSSGVACLTVVTHNTED-- 420

Query: 421 GGGGGPSVILGAFQQQNFYVEYDL 445
           GGGGGPSVILGAFQQQNFYVEYDL
Sbjct: 421 GGGGGPSVILGAFQQQNFYVEYDL 442

BLAST of Cucsa.241660 vs. NCBI nr
Match: gi|645276803|ref|XP_008243463.1| (PREDICTED: aspartic proteinase nepenthesin-2-like [Prunus mume])

HSP 1 Score: 584.7 bits (1506), Expect = 1.3e-163
Identity = 287/463 (61.99%), Postives = 350/463 (75.59%), Query Frame = 1

Query: 5   SPLSFFYLLLFSSLSAIAHSNPITLPLNSFPHLSSPDPLQALTFLASSSQTRAHQIKTPK 64
           +P S   L    SL  +  S+ +TLPL+ FP+  S DPLQAL+F AS+S +RAH IK  +
Sbjct: 3   NPKSLLTLTSLFSLFLLTLSSKLTLPLSPFPNYPSSDPLQALSFHASASISRAHHIKNSR 62

Query: 65  --SNSVFKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSFPK 124
             ++S+ + PL PHSYG YS  L+FGTP QT   I DTGSSLVWFPCT RY CS C FP 
Sbjct: 63  KPNSSLTQVPLFPHSYGDYSVSLNFGTPPQTSSFIMDTGSSLVWFPCTKRYSCSRCQFPN 122

Query: 125 IDPTGIPRFVPKLSSSSKLVGCQNPKCSWIFGPDVKSQCRSCN-PKTENCTQTCPAYVVQ 184
           I+P  IP F PKLSSSSK+VGCQNPKC WIFGP+VKS+C +CN P  +NC+QTCP Y++Q
Sbjct: 123 INPAKIPTFKPKLSSSSKIVGCQNPKCGWIFGPEVKSKCPNCNNPSPQNCSQTCPTYIIQ 182

Query: 185 YGSGSTAGLLLSETLDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFA 244
           YGSG+TAG+LLSETL+FP K +P+F+VGCSFLSI QPSGIAGFGRG +SLP+QMGL KF+
Sbjct: 183 YGSGTTAGILLSETLNFPKKIVPDFLVGCSFLSIRQPSGIAGFGRGPQSLPAQMGLSKFS 242

Query: 245 YCLASRKFDDSPHSGQLILDSTGVKSSG--------------------LTYTPFRQNPSV 304
           YCL S KFDD+P S  L+L S+   SS                     L+ TPF++NP  
Sbjct: 243 YCLVSHKFDDTPQSSDLVLYSSSSGSSSSSEEEPTIAESQRNKTRLQSLSSTPFQKNPGP 302

Query: 305 SNNAYKEYYYLNIRKIIVGNQAVKVPYKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVA 364
            N+A++EYYY+ +RK+IVGN+ VK+PYKFLVPG D +GG+I+DSGSTFTFM+KPV E+VA
Sbjct: 303 PNSAFREYYYIMLRKVIVGNKNVKIPYKFLVPGADSSGGTIVDSGSTFTFMEKPVFELVA 362

Query: 365 REFEKQLANWTRATDVETLTGLRPCFDISKEKSVKFPELIFQFKGGAKWALPLNNYFALV 424
            EFE Q+AN+TRA + E  TGLRPCFDISKEK V FPEL+FQFKGGAK  LPL NYF+LV
Sbjct: 363 EEFEAQMANYTRAKEWENKTGLRPCFDISKEKKVDFPELVFQFKGGAKMELPLTNYFSLV 422

Query: 425 SSSGVACLTVVTHQMEDGGGGGGGPSVILGAFQQQNFYVEYDL 445
           SSSGV CLT+VT  +  G GG GGP++ILG +QQQNF+VEYDL
Sbjct: 423 SSSGVVCLTIVTDGVA-GPGGNGGPAIILGNYQQQNFHVEYDL 464

BLAST of Cucsa.241660 vs. NCBI nr
Match: gi|595803542|ref|XP_007202027.1| (hypothetical protein PRUPE_ppa005104mg [Prunus persica])

HSP 1 Score: 579.3 bits (1492), Expect = 5.6e-162
Identity = 283/463 (61.12%), Postives = 350/463 (75.59%), Query Frame = 1

Query: 5   SPLSFFYLLLFSSLSAIAHSNPITLPLNSFPHLSSPDPLQALTFLASSSQTRAHQIKTPK 64
           +P S   L    SL  +  S+ ITLPL+ FP+  S DPLQAL+F AS+S +RAH IK  +
Sbjct: 3   NPKSLLTLTSLFSLFLLTLSSKITLPLSPFPNHPSSDPLQALSFHASASISRAHHIKNSR 62

Query: 65  --SNSVFKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSFPK 124
             ++S+ + PL PHSYG YS  L+FGTP QT   I DTGSSLVWFPCT RY+CS C FP 
Sbjct: 63  KPNSSLTQVPLFPHSYGDYSVSLNFGTPPQTSSFIMDTGSSLVWFPCTKRYICSRCQFPN 122

Query: 125 IDPTGIPRFVPKLSSSSKLVGCQNPKCSWIFGPDVKSQCRSCN-PKTENCTQTCPAYVVQ 184
           I+P  IP F PKLSSSSK+VGCQNPKC WIFGP+VKS+C +CN P  +NC+Q CP Y++Q
Sbjct: 123 INPAKIPTFKPKLSSSSKIVGCQNPKCGWIFGPEVKSKCPNCNNPSHQNCSQACPTYIIQ 182

Query: 185 YGSGSTAGLLLSETLDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFA 244
           YGSG+TAG+LLSETLDFP K +P+F+VGCSF+SI QP+GIAGFGRG +SLP+QMGL KF+
Sbjct: 183 YGSGTTAGILLSETLDFPKKIVPDFLVGCSFVSIRQPAGIAGFGRGPQSLPAQMGLTKFS 242

Query: 245 YCLASRKFDDSPHSGQLILDSTGVKSSG--------------------LTYTPFRQNPSV 304
           YCL S +FDD+P S  L+L S+   SS                     L+ TPF++NP  
Sbjct: 243 YCLVSHRFDDTPQSSDLVLYSSSSGSSSSSEEEPTIAESQRNKTKLQSLSSTPFQKNPGP 302

Query: 305 SNNAYKEYYYLNIRKIIVGNQAVKVPYKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVA 364
            N+A++EYYY+ +RK+IVGN+ VK+PYKFLVPG D +GG+I+DSGSTFTFM+KPV E VA
Sbjct: 303 PNSAFREYYYIMLRKVIVGNKNVKIPYKFLVPGADSSGGTIVDSGSTFTFMEKPVFEPVA 362

Query: 365 REFEKQLANWTRATDVETLTGLRPCFDISKEKSVKFPELIFQFKGGAKWALPLNNYFALV 424
           +EFE Q+AN+TRA D+E  TGLRPCFDISKEK V FPEL+FQFKGGAK  LP  NYF++V
Sbjct: 363 KEFEAQMANYTRAKDLENKTGLRPCFDISKEKKVDFPELVFQFKGGAKMELPSKNYFSMV 422

Query: 425 SSSGVACLTVVTHQMEDGGGGGGGPSVILGAFQQQNFYVEYDL 445
           SSSGV CLT+VT  +  G GG GGP++ILG +QQQ+F+VEYDL
Sbjct: 423 SSSGVVCLTIVTDGVV-GPGGNGGPAIILGNYQQQDFHVEYDL 464

BLAST of Cucsa.241660 vs. NCBI nr
Match: gi|1009161294|ref|XP_015898820.1| (PREDICTED: aspartic proteinase nepenthesin-1 [Ziziphus jujuba])

HSP 1 Score: 573.2 bits (1476), Expect = 4.0e-160
Identity = 278/454 (61.23%), Postives = 352/454 (77.53%), Query Frame = 1

Query: 2   ASPSPLSFFYLL-LFSSLSAIAHSNP-----ITLPLNSFPHLSSPDPLQALTFLASSSQT 61
           AS S L F  L  LF S+++ + ++P     I++ ++ F    S DP Q+L FLAS S +
Sbjct: 3   ASISCLCFISLFSLFLSVASSSSTSPPKPTSISIQISPFSKHPSSDPFQSLNFLASLSIS 62

Query: 62  RAHQIKTPKSNS-VFKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYL 121
           RAH +K PKSNS + K PL P  YG YS  L+FGTP Q +  + DTGSSLVW PCTSRYL
Sbjct: 63  RAHHLKHPKSNSSLTKVPLYPRGYGGYSIFLNFGTPPQKISFVMDTGSSLVWLPCTSRYL 122

Query: 122 CSECSFPKIDPTGIPRFVPKLSSSSKLVGCQNPKCSWIFGPDVKSQCRSCNPKTENCTQT 181
           CS+CSFP I P  IP F+PKLSSSSK+VGC+NPKC W+ GPDVK  C+ C+P ++NC+Q 
Sbjct: 123 CSKCSFPNIVPAKIPTFIPKLSSSSKIVGCKNPKCGWVLGPDVK--CQDCDPSSKNCSQP 182

Query: 182 CPAYVVQYGSGSTAGLLLSETLDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQ 241
           CPAY++QYGSG+TAGLL+SE+LDFP+K +P+F+VGCSFLS  QPSGIAGFGRG +SLPSQ
Sbjct: 183 CPAYIIQYGSGTTAGLLISESLDFPEKTVPDFLVGCSFLSFRQPSGIAGFGRGPQSLPSQ 242

Query: 242 MGLKKFAYCLASRKFDDSPHSGQLIL----DSTGVKSSGLTYTPFRQNPSVSNNAYKEYY 301
           MGL KF+YCL S KFDD+  S  L+L    DS   K++ L+YTPF++NP VSN A+ EYY
Sbjct: 243 MGLSKFSYCLISHKFDDTQESSNLVLYSGSDSGNSKATDLSYTPFQKNPEVSNPAFHEYY 302

Query: 302 YLNIRKIIVGNQAVKVPYKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQLAN 361
           Y+ +RK+IVG   VK+PYKFLVPG +GNGG+I+DSGSTFTFM+KPV E V++EF KQ+ N
Sbjct: 303 YVLLRKVIVGGTRVKIPYKFLVPGSEGNGGTIVDSGSTFTFMEKPVFEAVSQEFAKQMVN 362

Query: 362 WTRATDVETLTGLRPCFDISKEKSVKFPELIFQFKGGAKWALPLNNYFALVSSSGVACLT 421
           +TRATD+E  TGL+PCFDIS+EKSV FPEL+FQFKGGAK ALP+ NYF+LV+ SG+ CLT
Sbjct: 363 YTRATDIENRTGLQPCFDISREKSVNFPELVFQFKGGAKMALPVANYFSLVTDSGIICLT 422

Query: 422 VVTHQMEDGGGGGGGPSVILGAFQQQNFYVEYDL 445
           +VT ++  G     GP++ILG +QQQNF++EYDL
Sbjct: 423 IVTDEVA-GPSFTSGPAIILGNYQQQNFHIEYDL 453

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
ASP63_ARATH3.1e-4329.80Probable aspartyl protease At4g16563 OS=Arabidopsis thaliana GN=At4g16563 PE=2 S... [more]
APF2_ARATH6.9e-3530.62Aspartyl protease family protein 2 OS=Arabidopsis thaliana GN=APF2 PE=2 SV=1[more]
NEP2_NEPGR3.2e-3231.90Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis GN=nep2 PE=1 SV=1[more]
NEP1_NEPGR6.1e-3130.05Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis GN=nep1 PE=1 SV=1[more]
AED3_ARATH1.0e-2527.88Aspartyl protease AED3 OS=Arabidopsis thaliana GN=AED3 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0LBI9_CUCSA3.3e-262100.00Uncharacterized protein OS=Cucumis sativus GN=Csa_3G778440 PE=3 SV=1[more]
M5VQG8_PRUPE3.9e-16261.12Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa005104mg PE=3 SV=1[more]
B9T7L5_RICCO1.5e-15360.53Pepsin A, putative OS=Ricinus communis GN=RCOM_0308790 PE=3 SV=1[more]
B9HBQ5_POPTR2.0e-15361.42Aspartyl protease family protein OS=Populus trichocarpa GN=POPTR_0006s22110g PE=... [more]
A0A0D2S3P3_GOSRA6.3e-15258.88Uncharacterized protein OS=Gossypium raimondii GN=B456_006G150600 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT3G52500.13.8e-14856.71 Eukaryotic aspartyl protease family protein[more]
AT4G16563.11.7e-4429.80 Eukaryotic aspartyl protease family protein[more]
AT5G45120.11.7e-4431.46 Eukaryotic aspartyl protease family protein[more]
AT2G03200.11.3e-3932.23 Eukaryotic aspartyl protease family protein[more]
AT2G42980.14.9e-3931.87 Eukaryotic aspartyl protease family protein[more]
Match NameE-valueIdentityDescription
gi|449437856|ref|XP_004136706.1|4.8e-262100.00PREDICTED: aspartic proteinase nepenthesin-2 [Cucumis sativus][more]
gi|659084466|ref|XP_008442902.1|5.9e-24493.69PREDICTED: aspartic proteinase nepenthesin-2 [Cucumis melo][more]
gi|645276803|ref|XP_008243463.1|1.3e-16361.99PREDICTED: aspartic proteinase nepenthesin-2-like [Prunus mume][more]
gi|595803542|ref|XP_007202027.1|5.6e-16261.12hypothetical protein PRUPE_ppa005104mg [Prunus persica][more]
gi|1009161294|ref|XP_015898820.1|4.0e-16061.23PREDICTED: aspartic proteinase nepenthesin-1 [Ziziphus jujuba][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001461Aspartic_peptidase_A1
IPR001969Aspartic_peptidase_AS
IPR021109Peptidase_aspartic_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0004190aspartic-type endopeptidase activity
Vocabulary: Biological Process
TermDefinition
GO:0006508proteolysis
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
cellular_component GO:0005618 cell wall
cellular_component GO:0005575 cellular_component
molecular_function GO:0004190 aspartic-type endopeptidase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cucsa.241660.1Cucsa.241660.1mRNA


Analysis Name: InterPro Annotations of cucumber (Gy14)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPRINTSPR00792PEPSINcoord: 321..332
score: 8.4E-7coord: 87..107
score: 8.4E-7coord: 428..443
score: 8.
IPR001461Aspartic peptidase A1 familyPANTHERPTHR13683ASPARTYL PROTEASEScoord: 2..444
score: 1.1E
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 96..107
scor
IPR021109Aspartic peptidase domainGENE3DG3DSA:2.40.70.10coord: 267..444
score: 5.8E-44coord: 76..253
score: 1.2
IPR021109Aspartic peptidase domainunknownSSF50630Acid proteasescoord: 75..444
score: 4.33
NoneNo IPR availablePANTHERPTHR13683:SF340ASPARTYL PROTEASE FAMILY PROTEINcoord: 2..444
score: 1.1E

The following gene(s) are paralogous to this gene:

None