Lsi07G013020 (gene) Bottle gourd (USVL1VR-Ls)

NameLsi07G013020
Typegene
OrganismLagenaria siceraria (Bottle gourd (USVL1VR-Ls))
DescriptionEukaryotic aspartyl protease family protein
Locationchr07 : 18987195 .. 18988640 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTTCCCCTGTTTTTGTTTTCCTCCTCTGTTTTCTCCTCTCTTCCCCTGTTTTCTCCTCACAAATTCTCCTTCTACCTCTCTCCCATTCCTTATTATCCTCAATATCAGATTTCAACAACACCCACAACCTCCTCAAATCCACCGCTGCCCGCTCCTCCGCCAGATTCCACCACCACCGCCGTGCCCACCACCGTAACCACCTCTCTCTTCCCCTCTCCCCTGGCGGCGATTACACTCTCTCCTTCAACCTCGGCTCTGAGTCTCACAAAATTTCCCTCTATATGGACACTGGCAGCGACCTTGTTTGGTTCCCCTGTTCCCCGTTTGAATGTATTCTTTGTGAAGGCAAACCGAAAATTCAATCCCCTTTGCCCAAAATCTCAAATAATAAATCAGTTTCCTGCAGCGCCGCCGCCTGCTCTGCCGCCCATGGTGGCTCCCTCTCCGCCTCCCACCTCTGTGCAATTTCTCAGTGTCCACTTGAATCCATTGAAATTTCCGAGTGCTCCTCTTTTTCCTGTCCCCCGTTTTATTATGCTTACGGCGATGGGAGTTTAATTGCTCGGCTTTATAGAGATAGCCTCAGTTTGCCGGCGCCGGCGCCGTCACCGGCGATTAATGTTCGGAATTTTACTTTTGGGTGTGCCCACACGACGCTCGGCGAGCCGGTTGGGGTCGCCGGATTCGGCCGGGGGACGTTGTCGATGCCGAGTCAACTCGCCACTTTCTCCCCCCAACTTGGGAACCGGTTTTCTTATTGTTTGGTTTCTCACTCGTTTGCCGCGGACCGAGTTCGCCGTCCGAGTCCGCTGATTCTCGGCCGGTACTTCGGCGGCGAGACAGAGTTCGTTTACACTTCCTTGCTTGAGAATCCGAAACATCCTTATTTTTACTCGGTTGGGTTGGCCGGAATTTCAGTCGGGAATGTGAGGATTCCGGCGCCGGAGTTTTTGAAAAAAGTGGATGAGGGTGGTAGCGGCGGCGTTGTGGTGGATTCCGGCACTACTTTCACTATGCTGCCGGCAGGTTTGTATGACTCGGTGGTGGCTGAATTTGAGAATCGGACCGGAAAAGTTGCGAACCGGGCGAGACGGATTGAAGAAAATATCGGGTTGAGCCCTTGCTATTACTACGAGAACTCAGTTGAAGTGCCACGTGTCGTGTTGCATTTCGTTGGGGAGAAATCGAGTGTGGTGCTTCCTAGGAAGAATTATTTCTATGAGTTTGTGGACGGTGGAGATGGGGTTGGGAGGAAGAGAAAAGTTGGGTGTTTGATGCTGATGAACGGTGGGGATGAGGCTGAGTTGGCAGGTGGGCCCGGGGCCACGCTTGGGAACTACCAACAACAGGGTTTTGAAGTAGTTTATGATTTGGAAAATAACCGGGTCGGGTTCGCCCGGCGGCAGTGCTCAACTCTTTGGGACAGCTTGAACCGGAGTTAG

mRNA sequence

ATGGCTTCCCCTGTTTTTGTTTTCCTCCTCTGTTTTCTCCTCTCTTCCCCTGTTTTCTCCTCACAAATTCTCCTTCTACCTCTCTCCCATTCCTTATTATCCTCAATATCAGATTTCAACAACACCCACAACCTCCTCAAATCCACCGCTGCCCGCTCCTCCGCCAGATTCCACCACCACCGCCGTGCCCACCACCGTAACCACCTCTCTCTTCCCCTCTCCCCTGGCGGCGATTACACTCTCTCCTTCAACCTCGGCTCTGAGTCTCACAAAATTTCCCTCTATATGGACACTGGCAGCGACCTTGTTTGGTTCCCCTGTTCCCCGTTTGAATGTATTCTTTGTGAAGGCAAACCGAAAATTCAATCCCCTTTGCCCAAAATCTCAAATAATAAATCAGTTTCCTGCAGCGCCGCCGCCTGCTCTGCCGCCCATGGTGGCTCCCTCTCCGCCTCCCACCTCTGTGCAATTTCTCAGTGTCCACTTGAATCCATTGAAATTTCCGAGTGCTCCTCTTTTTCCTGTCCCCCGTTTTATTATGCTTACGGCGATGGGAGTTTAATTGCTCGGCTTTATAGAGATAGCCTCAGTTTGCCGGCGCCGGCGCCGTCACCGGCGATTAATGTTCGGAATTTTACTTTTGGGTGTGCCCACACGACGCTCGGCGAGCCGGTTGGGGTCGCCGGATTCGGCCGGGGGACGTTGTCGATGCCGAGTCAACTCGCCACTTTCTCCCCCCAACTTGGGAACCGGTTTTCTTATTGTTTGGTTTCTCACTCGTTTGCCGCGGACCGAGTTCGCCGTCCGAGTCCGCTGATTCTCGGCCGGTACTTCGGCGGCGAGACAGAGTTCGTTTACACTTCCTTGCTTGAGAATCCGAAACATCCTTATTTTTACTCGGTTGGGTTGGCCGGAATTTCAGTCGGGAATGTGAGGATTCCGGCGCCGGAGTTTTTGAAAAAAGTGGATGAGGGTGGTAGCGGCGGCGTTGTGGTGGATTCCGGCACTACTTTCACTATGCTGCCGGCAGGTTTGTATGACTCGGTGGTGGCTGAATTTGAGAATCGGACCGGAAAAGTTGCGAACCGGGCGAGACGGATTGAAGAAAATATCGGGTTGAGCCCTTGCTATTACTACGAGAACTCAGTTGAAGTGCCACGTGTCGTGTTGCATTTCGTTGGGGAGAAATCGAGTGTGGTGCTTCCTAGGAAGAATTATTTCTATGAGTTTGTGGACGGTGGAGATGGGGTTGGGAGGAAGAGAAAAGTTGGGTGTTTGATGCTGATGAACGGTGGGGATGAGGCTGAGTTGGCAGGTGGGCCCGGGGCCACGCTTGGGAACTACCAACAACAGGGTTTTGAAGTAGTTTATGATTTGGAAAATAACCGGGTCGGGTTCGCCCGGCGGCAGTGCTCAACTCTTTGGGACAGCTTGAACCGGAGTTAG

Coding sequence (CDS)

ATGGCTTCCCCTGTTTTTGTTTTCCTCCTCTGTTTTCTCCTCTCTTCCCCTGTTTTCTCCTCACAAATTCTCCTTCTACCTCTCTCCCATTCCTTATTATCCTCAATATCAGATTTCAACAACACCCACAACCTCCTCAAATCCACCGCTGCCCGCTCCTCCGCCAGATTCCACCACCACCGCCGTGCCCACCACCGTAACCACCTCTCTCTTCCCCTCTCCCCTGGCGGCGATTACACTCTCTCCTTCAACCTCGGCTCTGAGTCTCACAAAATTTCCCTCTATATGGACACTGGCAGCGACCTTGTTTGGTTCCCCTGTTCCCCGTTTGAATGTATTCTTTGTGAAGGCAAACCGAAAATTCAATCCCCTTTGCCCAAAATCTCAAATAATAAATCAGTTTCCTGCAGCGCCGCCGCCTGCTCTGCCGCCCATGGTGGCTCCCTCTCCGCCTCCCACCTCTGTGCAATTTCTCAGTGTCCACTTGAATCCATTGAAATTTCCGAGTGCTCCTCTTTTTCCTGTCCCCCGTTTTATTATGCTTACGGCGATGGGAGTTTAATTGCTCGGCTTTATAGAGATAGCCTCAGTTTGCCGGCGCCGGCGCCGTCACCGGCGATTAATGTTCGGAATTTTACTTTTGGGTGTGCCCACACGACGCTCGGCGAGCCGGTTGGGGTCGCCGGATTCGGCCGGGGGACGTTGTCGATGCCGAGTCAACTCGCCACTTTCTCCCCCCAACTTGGGAACCGGTTTTCTTATTGTTTGGTTTCTCACTCGTTTGCCGCGGACCGAGTTCGCCGTCCGAGTCCGCTGATTCTCGGCCGGTACTTCGGCGGCGAGACAGAGTTCGTTTACACTTCCTTGCTTGAGAATCCGAAACATCCTTATTTTTACTCGGTTGGGTTGGCCGGAATTTCAGTCGGGAATGTGAGGATTCCGGCGCCGGAGTTTTTGAAAAAAGTGGATGAGGGTGGTAGCGGCGGCGTTGTGGTGGATTCCGGCACTACTTTCACTATGCTGCCGGCAGGTTTGTATGACTCGGTGGTGGCTGAATTTGAGAATCGGACCGGAAAAGTTGCGAACCGGGCGAGACGGATTGAAGAAAATATCGGGTTGAGCCCTTGCTATTACTACGAGAACTCAGTTGAAGTGCCACGTGTCGTGTTGCATTTCGTTGGGGAGAAATCGAGTGTGGTGCTTCCTAGGAAGAATTATTTCTATGAGTTTGTGGACGGTGGAGATGGGGTTGGGAGGAAGAGAAAAGTTGGGTGTTTGATGCTGATGAACGGTGGGGATGAGGCTGAGTTGGCAGGTGGGCCCGGGGCCACGCTTGGGAACTACCAACAACAGGGTTTTGAAGTAGTTTATGATTTGGAAAATAACCGGGTCGGGTTCGCCCGGCGGCAGTGCTCAACTCTTTGGGACAGCTTGAACCGGAGTTAG

Protein sequence

MASPVFVFLLCFLLSSPVFSSQILLLPLSHSLLSSISDFNNTHNLLKSTAARSSARFHHHRRAHHRNHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQSPLPKISNNKSVSCSAAACSAAHGGSLSASHLCAISQCPLESIEISECSSFSCPPFYYAYGDGSLIARLYRDSLSLPAPAPSPAINVRNFTFGCAHTTLGEPVGVAGFGRGTLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYFGGETEFVYTSLLENPKHPYFYSVGLAGISVGNVRIPAPEFLKKVDEGGSGGVVVDSGTTFTMLPAGLYDSVVAEFENRTGKVANRARRIEENIGLSPCYYYENSVEVPRVVLHFVGEKSSVVLPRKNYFYEFVDGGDGVGRKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLENNRVGFARRQCSTLWDSLNRS
BLAST of Lsi07G013020 vs. Swiss-Prot
Match: ASP63_ARATH (Probable aspartyl protease At4g16563 OS=Arabidopsis thaliana GN=At4g16563 PE=2 SV=1)

HSP 1 Score: 591.7 bits (1524), Expect = 7.4e-168
Identity = 298/478 (62.34%), Postives = 357/478 (74.69%), Query Frame = 1

Query: 24  LLLPLSHSLLSSISDFNNTHNLLKSTAARSSARFHHHRRAHHRNHLSLPLSPGGDYTLSF 83
           LLL LSHSL +S    +  H LLKS+++RSSARF  H     +  LSLP+S G DY +S 
Sbjct: 29  LLLHLSHSLSTSKHSSSPLH-LLKSSSSRSSARFRRHHHKQQQQQLSLPISSGSDYLISL 88

Query: 84  NLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQSPLPKISNNKS-VSCSAAACS 143
           ++GS S  +SLY+DTGSDLVWFPC PF CILCE KP   SP   +S++ + VSCS+ +CS
Sbjct: 89  SVGSSSSAVSLYLDTGSDLVWFPCRPFTCILCESKPLPPSPPSSLSSSATTVSCSSPSCS 148

Query: 144 AAHGGSLSASHLCAISQCPLESIEISEC--SSFSCPPFYYAYGDGSLIARLYRDSLSLPA 203
           AAH  SL +S LCAIS CPL+ IE  +C  SS+ CPPFYYAYGDGSL+A+LY DSLSLP+
Sbjct: 149 AAHS-SLPSSDLCAISNCPLDFIETGDCNTSSYPCPPFYYAYGDGSLVAKLYSDSLSLPS 208

Query: 204 PAPSPAINVRNFTFGCAHTTLGEPVGVAGFGRGTLSMPSQLATFSPQLGNRFSYCLVSHS 263
                 ++V NFTFGCAHTTL EP+GVAGFGRG LS+P+QLA  SP LGN FSYCLVSHS
Sbjct: 209 ------VSVSNFTFGCAHTTLAEPIGVAGFGRGRLSLPAQLAVHSPHLGNSFSYCLVSHS 268

Query: 264 FAADRVRRPSPLILGRYFGGE--------------------TEFVYTSLLENPKHPYFYS 323
           F +DRVRRPSPLILGR+   +                     EFV+T +LENPKHPYFYS
Sbjct: 269 FDSDRVRRPSPLILGRFVDKKEKRVGTTDDHDDGDDEKKKKNEFVFTEMLENPKHPYFYS 328

Query: 324 VGLAGISVGNVRIPAPEFLKKVDEGGSGGVVVDSGTTFTMLPAGLYDSVVAEFENRTGKV 383
           V L GIS+G   IPAP  L+++D+ G GGVVVDSGTTFTMLPA  Y+SVV EF++R G+V
Sbjct: 329 VSLQGISIGKRNIPAPAMLRRIDKNGGGGVVVDSGTTFTMLPAKFYNSVVEEFDSRVGRV 388

Query: 384 ANRARRIEENIGLSPCYYYENSVEVPRVVLHFVGEKSSVVLPRKNYFYEFVDGGDGVGRK 443
             RA R+E + G+SPCYY   +V+VP +VLHF G +SSV LPR+NYFYEF+DGGDG   K
Sbjct: 389 HERADRVEPSSGMSPCYYLNQTVKVPALVLHFAGNRSSVTLPRRNYFYEFMDGGDGKEEK 448

Query: 444 RKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLENNRVGFARRQCSTLWDSL 479
           RK+GCLMLMNGGDE+EL GG GA LGNYQQQGFEVVYDL N RVGFA+R+C++LWDSL
Sbjct: 449 RKIGCLMLMNGGDESELRGGTGAILGNYQQQGFEVVYDLLNRRVGFAKRKCASLWDSL 498

BLAST of Lsi07G013020 vs. Swiss-Prot
Match: APF2_ARATH (Aspartyl protease family protein 2 OS=Arabidopsis thaliana GN=APF2 PE=2 SV=1)

HSP 1 Score: 150.2 bits (378), Expect = 5.7e-35
Identity = 134/410 (32.68%), Postives = 185/410 (45.12%), Query Frame = 1

Query: 73  LSPG-GDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQSPLPKISNN 132
           LS G G+Y     +G+ +  + + +DTGSD+VW  C+P  C  C  +       P     
Sbjct: 135 LSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAP--CRRCYSQSD-----PIFDPR 194

Query: 133 KSVSCSAAACSAAHGGSLSASHLCAISQCPLESIEISECSSFSCPPFYYAYGDGSL-IAR 192
           KS + +   CS+ H   L ++      +  L  +               +YGDGS  +  
Sbjct: 195 KSKTYATIPCSSPHCRRLDSAGCNTRRKTCLYQV---------------SYGDGSFTVGD 254

Query: 193 LYRDSLSLPAPAPSPAINVRNFTFGCAHTTLGEPVGVAGF---GRGTLSMPSQLATFSPQ 252
              ++L+           V+    GC H   G  VG AG    G+G LS P Q      +
Sbjct: 255 FSTETLTFRRN------RVKGVALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTGH---R 314

Query: 253 LGNRFSYCLVSHSFAADRVRRPSPLILGRYFGGETEFVYTSLLENPKHPYFYSVGLAGIS 312
              +FSYCLV  S ++    +PS ++ G          +T LL NPK   FY VGL GIS
Sbjct: 315 FNQKFSYCLVDRSASS----KPSSVVFGNAAVSRIAR-FTPLLSNPKLDTFYYVGLLGIS 374

Query: 313 VGNVRIPA-PEFLKKVDEGGSGGVVVDSGTTFTMLPAGLYDSVVAEFENRTGKVANRARR 372
           VG  R+P     L K+D+ G+GGV++DSGT+ T L    Y ++   F  R G  A   +R
Sbjct: 375 VGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAF--RVG--AKTLKR 434

Query: 373 IEENIGLSPCYYYEN--SVEVPRVVLHFVGEKSSVVLPRKNYFYEFVDGGDGVGRKRKVG 432
             +      C+   N   V+VP VVLHF G  + V LP  NY    VD            
Sbjct: 435 APDFSLFDTCFDLSNMNEVKVPTVVLHFRG--ADVSLPATNYLIP-VD------------ 485

Query: 433 CLMLMNGGDEAELAGGPG--ATLGNYQQQGFEVVYDLENNRVGFARRQCS 473
                NG      AG  G  + +GN QQQGF VVYDL ++RVGFA   C+
Sbjct: 495 ----TNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGCA 485

BLAST of Lsi07G013020 vs. Swiss-Prot
Match: NEP1_NEPGR (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis GN=nep1 PE=1 SV=1)

HSP 1 Score: 146.7 bits (369), Expect = 6.3e-34
Identity = 122/406 (30.05%), Postives = 179/406 (44.09%), Query Frame = 1

Query: 77  GDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQSPLPKISNNKSVSC 136
           G+Y ++ ++G+ +   S  MDTGSDL+W  C P  C  C          P  +   S S 
Sbjct: 93  GEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQP--CTQC-----FNQSTPIFNPQGSSSF 152

Query: 137 SAAACSAAHGGSLSASHLCAISQCPLESIEISECSSFSCPPFYYAYGDGSLI-ARLYRDS 196
           S   CS         S LC     P        CS+  C  + Y YGDGS     +  ++
Sbjct: 153 STLPCS---------SQLCQALSSPT-------CSNNFCQ-YTYGYGDGSETQGSMGTET 212

Query: 197 LSLPAPAPSPAINVRNFTFGCAHTTLG----EPVGVAGFGRGTLSMPSQLATFSPQLGNR 256
           L+  +      +++ N TFGC     G       G+ G GRG LS+PSQL         +
Sbjct: 213 LTFGS------VSIPNITFGCGENNQGFGQGNGAGLVGMGRGPLSLPSQLDV------TK 272

Query: 257 FSYCLVSHSFAADRVRRPSPLILGRYFGGETE-FVYTSLLENPKHPYFYSVGLAGISVGN 316
           FSYC+     +      PS L+LG      T     T+L+++ + P FY + L G+SVG+
Sbjct: 273 FSYCMTPIGSST-----PSNLLLGSLANSVTAGSPNTTLIQSSQIPTFYYITLNGLSVGS 332

Query: 317 VRIPA-PEFLKKVDEGGSGGVVVDSGTTFTMLPAGLYDSVVAEFENRTG-KVANRARRIE 376
            R+P  P         G+GG+++DSGTT T      Y SV  EF ++    V N +    
Sbjct: 333 TRLPIDPSAFALNSNNGTGGIIIDSGTTLTYFVNNAYQSVRQEFISQINLPVVNGS---- 392

Query: 377 ENIGLSPCYYY---ENSVEVPRVVLHFVGEKSSVVLPRKNYFYEFVDGGDGVGRKRKVGC 436
            + G   C+      +++++P  V+HF G    + LP +NYF    +G         + C
Sbjct: 393 -SSGFDLCFQTPSDPSNLQIPTFVMHFDG--GDLELPSENYFISPSNG---------LIC 434

Query: 437 LMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLENNRVGFARRQC 472
           L + +      +        GN QQQ   VVYD  N+ V FA  QC
Sbjct: 453 LAMGSSSQGMSI-------FGNIQQQNMLVVYDTGNSVVSFASAQC 434

BLAST of Lsi07G013020 vs. Swiss-Prot
Match: ASPG1_ARATH (Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana GN=ASPG1 PE=1 SV=1)

HSP 1 Score: 146.4 bits (368), Expect = 8.3e-34
Identity = 120/401 (29.93%), Postives = 167/401 (41.65%), Query Frame = 1

Query: 77  GDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQSPLPKISNNKSVSC 136
           G+Y     +G+ + ++ L +DTGSD+ W  C P  C  C  +          S  KS++C
Sbjct: 160 GEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCEP--CADCYQQSDPVFNPTSSSTYKSLTC 219

Query: 137 SAAACSAAHGGSLSASHLCAISQCPLESIEISECSSFSCPPFYYAYGDGSL-IARLYRDS 196
           SA  CS                      +E S C S  C  +  +YGDGS  +  L  D+
Sbjct: 220 SAPQCSL---------------------LETSACRSNKCL-YQVSYGDGSFTVGELATDT 279

Query: 197 LSLPAPAPSPAINVRNFTFGCAHTTLGEPVGVAGF---GRGTLSMPSQLATFSPQLGNRF 256
           ++           + N   GC H   G   G AG    G G LS+ +Q+   S      F
Sbjct: 280 VTFGNSG-----KINNVALGCGHDNEGLFTGAAGLLGLGGGVLSITNQMKATS------F 339

Query: 257 SYCLVSHSFAADRVRRPSPLILGRYFGGETEFVYTSLLENPKHPYFYSVGLAGISVGNVR 316
           SYCLV            + + LG   GG+       LL N K   FY VGL+G SVG  +
Sbjct: 340 SYCLVDRDSGKSSSLDFNSVQLG---GGDAT---APLLRNKKIDTFYYVGLSGFSVGGEK 399

Query: 317 IPAPEFLKKVDEGGSGGVVVDSGTTFTMLPAGLYDSVVAEFENRTGKVANRARRIEENIG 376
           +  P+ +  VD  GSGGV++D GT  T L    Y+S+   F   T  +   +  I     
Sbjct: 400 VVLPDAIFDVDASGSGGVILDCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSISL--- 459

Query: 377 LSPCYYYE--NSVEVPRVVLHFVGEKSSVVLPRKNYFYEFVDGGDGVGRKRKVGCLMLMN 436
              CY +   ++V+VP V  HF G K S+ LP KNY     D G          C     
Sbjct: 460 FDTCYDFSSLSTVKVPTVAFHFTGGK-SLDLPAKNYLIPVDDSG--------TFCFAFAP 500

Query: 437 GGDEAELAGGPGATLGNYQQQGFEVVYDLENNRVGFARRQC 472
                 +       +GN QQQG  + YDL  N +G +  +C
Sbjct: 520 TSSSLSI-------IGNVQQQGTRITYDLSKNVIGLSGNKC 500

BLAST of Lsi07G013020 vs. Swiss-Prot
Match: AED3_ARATH (Aspartyl protease AED3 OS=Arabidopsis thaliana GN=AED3 PE=1 SV=1)

HSP 1 Score: 145.6 bits (366), Expect = 1.4e-33
Identity = 119/415 (28.67%), Postives = 177/415 (42.65%), Query Frame = 1

Query: 70  SLPLSPG-----GDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQSP 129
           S+P++ G     G+Y +   LG+    + + +DT +D VW PCS      C G     + 
Sbjct: 90  SVPVASGNQLHIGNYVVRAKLGTPPQLMFMVLDTSNDAVWLPCSG-----CSGCSNASTS 149

Query: 130 LPKISNN--KSVSCSAAACSAAHGGSLSASHLCAISQCPLESIEISECSSFSCPPFYYAY 189
               S++   +VSCS A C+ A G +           CP  S + S CS      F  +Y
Sbjct: 150 FNTNSSSTYSTVSCSTAQCTQARGLT-----------CPSSSPQPSVCS------FNQSY 209

Query: 190 G-DGSLIARLYRDSLSLPAPAPSPAINVRNFTFGCAHTTLGE---PVGVAGFGRGTLSMP 249
           G D S  A L +D+L+L AP   P     NF+FGC ++  G    P G+ G GRG +S+ 
Sbjct: 210 GGDSSFSASLVQDTLTL-APDVIP-----NFSFGCINSASGNSLPPQGLMGLGRGPMSLV 269

Query: 250 SQLATFSPQLGNRFSYCLVS-HSFAADRVRRPSPLILGRYFGGETEFVYTSLLENPKHPY 309
           SQ  +    +   FSYCL S  SF          L LG   G      YT LL NP+ P 
Sbjct: 270 SQTTSLYSGV---FSYCLPSFRSFYFS-----GSLKLG-LLGQPKSIRYTPLLRNPRRPS 329

Query: 310 FYSVGLAGISVGNVRIPAPEFLKKVDEGGSGGVVVDSGTTFTMLPAGLYDSVVAEFENRT 369
            Y V L G+SVG+V++P        D     G ++DSGT  T     +Y+++  EF  + 
Sbjct: 330 LYYVNLTGVSVGSVQVPVDPVYLTFDANSGAGTIIDSGTVITRFAQPVYEAIRDEFRKQV 389

Query: 370 GKVANRARRIEENIGLSPCYYYENSVEVPRVVLHFVGEKSSVVLPRKNYFYEFVDGGDGV 429
                             C+  +N    P++ LH       + LP +N            
Sbjct: 390 -----NVSSFSTLGAFDTCFSADNENVAPKITLHMT--SLDLKLPMENTLIH-------- 449

Query: 430 GRKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLENNRVGFARRQCS 473
                 G L  ++     + A      + N QQQ   +++D+ N+R+G A   C+
Sbjct: 450 ---SSAGTLTCLSMAGIRQNANAVLNVIANLQQQNLRILFDVPNSRIGIAPEPCN 449

BLAST of Lsi07G013020 vs. TrEMBL
Match: A0A0A0L5I7_CUCSA (Pepsin A OS=Cucumis sativus GN=Csa_3G020060 PE=3 SV=1)

HSP 1 Score: 912.1 bits (2356), Expect = 2.8e-262
Identity = 451/480 (93.96%), Postives = 462/480 (96.25%), Query Frame = 1

Query: 3   SPVFVFLLCFLLSSPVFSSQILLLPLSHSLLSSISDFNNTHNLLKSTAARSSARFHHHRR 62
           SPVF+FLLCFLLSSPVFSSQI LLPLSHSL SSISDFNNTHNLLKSTA RSSARFH HR 
Sbjct: 4   SPVFIFLLCFLLSSPVFSSQIFLLPLSHSLSSSISDFNNTHNLLKSTATRSSARFHRHRH 63

Query: 63  AHHRNHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQ 122
               NHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQ
Sbjct: 64  ----NHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQ 123

Query: 123 SPLPKISNNKSVSCSAAACSAAHGGSLSASHLCAISQCPLESIEISECSSFSCPPFYYAY 182
           SPLPKI+NNKSVSCSAAACSAAHGGSLSASHLCAIS+CPLESIEISECSSFSCPPFYYAY
Sbjct: 124 SPLPKIANNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAY 183

Query: 183 GDGSLIARLYRDSLSLPAPAPSPAINVRNFTFGCAHTTLGEPVGVAGFGRGTLSMPSQLA 242
           GDGSL+ARLYRDSLSLP PAPSP INVRNFTFGCAHTTLGEPVGVAGFGRG LSMPSQLA
Sbjct: 184 GDGSLVARLYRDSLSLPTPAPSPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQLA 243

Query: 243 TFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYFGGETEFVYTSLLENPKHPYFYSVG 302
           TFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRY+ GETEF+YTSLLENPKHPYFYSVG
Sbjct: 244 TFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYTGETEFIYTSLLENPKHPYFYSVG 303

Query: 303 LAGISVGNVRIPAPEFLKKVDEGGSGGVVVDSGTTFTMLPAGLYDSVVAEFENRTGKVAN 362
           LAGISVGN+RIPAPEFL KVDEGGSGGVVVDSGTTFTMLPAGLY+SVVAEFENRTGKVAN
Sbjct: 304 LAGISVGNIRIPAPEFLTKVDEGGSGGVVVDSGTTFTMLPAGLYESVVAEFENRTGKVAN 363

Query: 363 RARRIEENIGLSPCYYYENSVEVPRVVLHFVGEKSSVVLPRKNYFYEFVDGGDG-VGRKR 422
           RARRIEEN GLSPCYYYENSV VPRVVLHFVGEKS+VVLPRKNYFYEF+DGGDG VGRKR
Sbjct: 364 RARRIEENTGLSPCYYYENSVGVPRVVLHFVGEKSNVVLPRKNYFYEFLDGGDGVVGRKR 423

Query: 423 KVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLENNRVGFARRQCSTLWDSLNRS 482
           KVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLE NRVGFARRQCSTLWD+LNRS
Sbjct: 424 KVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDNLNRS 479

BLAST of Lsi07G013020 vs. TrEMBL
Match: B9GYA7_POPTR (Aspartyl protease family protein OS=Populus trichocarpa GN=POPTR_0003s07390g PE=3 SV=1)

HSP 1 Score: 659.4 bits (1700), Expect = 3.2e-186
Identity = 341/495 (68.89%), Postives = 392/495 (79.19%), Query Frame = 1

Query: 9   LLCFLLSSP---VFSSQILLLPLSHSLLSSISDFNNTHNLLKSTAARSSARFHHH---RR 68
           LLCF+L      + +SQ L LPL HSL  S + F +TH+LLKST+ RS+ RFHHH   + 
Sbjct: 8   LLCFILCFTHIFISTSQTLFLPLIHSL--SKTQFTSTHHLLKSTSTRSTTRFHHHHHNKN 67

Query: 69  AHHRNHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQ 128
           +H+   +SLPLSPG DYTLSF + S+   ISLY+DTGSDLVWFPC PFECILCEGK +  
Sbjct: 68  SHNHRQVSLPLSPGSDYTLSFTINSQP--ISLYLDTGSDLVWFPCQPFECILCEGKAENA 127

Query: 129 S----PLPKISNNKS-VSCSAAACSAAHGGSLSASHLCAISQCPLESIEISECSSFSCPP 188
           S    P PK+S   + VSC ++ACSA H  +L +S LCAIS CPLESIEIS+C   SCP 
Sbjct: 128 SLASTPPPKLSKTATPVSCKSSACSAVHS-NLPSSDLCAISNCPLESIEISDCRKHSCPQ 187

Query: 189 FYYAYGDGSLIARLYRDSLSLPAPAPSPAINVRNFTFGCAHTTLGEPVGVAGFGRGTLSM 248
           FYYAYGDGSLIARLYRDS+ LP    +  I   NFTFGCAHTTL EP+GVAGFGRG LS+
Sbjct: 188 FYYAYGDGSLIARLYRDSIRLPLSNQTNLI-FNNFTFGCAHTTLAEPIGVAGFGRGVLSL 247

Query: 249 PSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYFGGETE----------FVYT 308
           P+QLAT SPQLGN+FSYCLVSHSF +DRVRRPSPLILGRY   E E          FVYT
Sbjct: 248 PAQLATLSPQLGNQFSYCLVSHSFDSDRVRRPSPLILGRYDHDEKERRVNGVKKPSFVYT 307

Query: 309 SLLENPKHPYFYSVGLAGISVGNVRIPAPEFLKKVDEGGSGGVVVDSGTTFTMLPAGLYD 368
           S+L+NP+HPYFY VGL GIS+G  +IPAP+FL+KVD  GSGGVVVDSGTTFTMLPA LYD
Sbjct: 308 SMLDNPRHPYFYCVGLEGISIGRKKIPAPDFLRKVDRKGSGGVVVDSGTTFTMLPASLYD 367

Query: 369 SVVAEFENRTGKVANRARRIEENIGLSPCYYYENSV-EVPRVVLHFVGEKSSVVLPRKNY 428
            VVAEFENR G+V  RA  IEEN GLSPCYY++N+V  VPRVVLHFVG  SSVVLPR+NY
Sbjct: 368 FVVAEFENRVGRVNERASVIEENTGLSPCYYFDNNVVNVPRVVLHFVGNGSSVVLPRRNY 427

Query: 429 FYEFVDGGDGVGRKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLENNRVGF 482
           FYEF+DGG G G+KRKVGCLMLMNGGDEAEL+GGPGATLGNYQQQGFEVVYDLEN RVGF
Sbjct: 428 FYEFLDGGHGKGKKRKVGCLMLMNGGDEAELSGGPGATLGNYQQQGFEVVYDLENRRVGF 487

BLAST of Lsi07G013020 vs. TrEMBL
Match: M5WCX9_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa004852mg PE=3 SV=1)

HSP 1 Score: 652.1 bits (1681), Expect = 5.2e-184
Identity = 328/494 (66.40%), Postives = 388/494 (78.54%), Query Frame = 1

Query: 1   MASPVFVFLLCFLLSSPVFSSQILLLPLSHSLLSSISDFNNTHNLLKSTAARSSARFHHH 60
           MAS +F+ +LCF     +  SQ L LPL+H+L  S + FN+T +LLKST  RS+ RFHHH
Sbjct: 1   MASALFL-ILCFTHFF-LSCSQPLYLPLTHTL--SQTQFNSTQHLLKSTTTRSARRFHHH 60

Query: 61  RRAHHR--NHLSLPLSPGGDYTLSFNLGSESHK-ISLYMDTGSDLVWFPCSPFECILCEG 120
            R H+R  N +SLPL+PG DYTLSF L S   + ++LYMDTGSDLVWFPCSPFECILCEG
Sbjct: 61  HRGHNRQTNQVSLPLAPGSDYTLSFTLNSSPPQPVALYMDTGSDLVWFPCSPFECILCEG 120

Query: 121 KPKIQSPLPKISNNKSVSCSAAACSAAHGGSLSASHLCAISQCPLESIEISECSSFSCPP 180
           KP   +P PKI  N +VSC + +CSAAH   LS+++LCAIS CPL+SIEISECSSFSCPP
Sbjct: 121 KPNSTNPPPKIPKNAAVSCDSRSCSAAH-SFLSSANLCAISHCPLDSIEISECSSFSCPP 180

Query: 181 FYYAYGDGSLIARLYRDSLSLPAPAPSPAINVRNFTFGCAHTTLGEPVGVAGFGRGTLSM 240
           FYYAY DGS IA+LY+ SLS+  P  +PA+ +RNFTFGC+H++LGEP+GVAGFGRG LS+
Sbjct: 181 FYYAYADGSFIAKLYKHSLSI--PMSTPALVLRNFTFGCSHSSLGEPIGVAGFGRGLLSL 240

Query: 241 PSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRY----------FGGETEFVYT 300
           P+QL+TFSP L  +FSYCLVSHSF  DRVRRPSPLILG Y           GG  E+ YT
Sbjct: 241 PAQLSTFSPHLATQFSYCLVSHSFDQDRVRRPSPLILGPYDQKQKRFGDGAGGSVEYAYT 300

Query: 301 SLLENPKHPYFYSVGLAGISVGNVRIPAPEFLKKVDEGGSGGVVVDSGTTFTMLPAGLYD 360
           S+L+NPKHPYFYS+GLAG+SVG    PAPE L+ VD+ G+GG+VVDSGTTFTM P G Y+
Sbjct: 301 SMLDNPKHPYFYSIGLAGVSVGKRVFPAPEILQGVDKNGNGGIVVDSGTTFTMFPQGFYN 360

Query: 361 SVVAEFENRTGKVANRARRIEENIGLSPCYYYENSVEVPRVVLHFVGEKSSVVLPRKNYF 420
           S+VAEF+ R G+V  RA R+E+  GL PCYYYE  V+VP V LHF G  SSV+LPR+NYF
Sbjct: 361 SLVAEFDRRVGRVHERATRVEDETGLGPCYYYEKVVDVPAVTLHFAGNNSSVLLPRRNYF 420

Query: 421 YEFVDGGDGVGRKR-KVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLENNRVGF 480
           YEFVDGGDG GRKR KVGC MLMNGGDEAE++GGPG  LGNYQQQGFEVVYDLE  RVGF
Sbjct: 421 YEFVDGGDGAGRKRKKVGCWMLMNGGDEAEMSGGPGGILGNYQQQGFEVVYDLEKQRVGF 480

BLAST of Lsi07G013020 vs. TrEMBL
Match: B9NGC6_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0001s15870g PE=3 SV=1)

HSP 1 Score: 647.9 bits (1670), Expect = 9.7e-183
Identity = 339/497 (68.21%), Postives = 388/497 (78.07%), Query Frame = 1

Query: 6   FVFLLCFLLSSPVF---SSQILLLPLSHSLLSSISDFNNTHNLLKSTAARSSARF---HH 65
           +  LLCF L    F   +SQ L LPL+HSL  S + F +TH+L+KST+  S  RF   HH
Sbjct: 5   YSLLLCFSLCFSHFFISTSQTLFLPLTHSL--SKTQFTSTHHLIKSTSTSSITRFRRHHH 64

Query: 66  HRRAHHRNHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKP 125
            +  H+   +SLPLSPG DYTLSF L  +S  I LY+DTGSDLVWFPC PFECILCEGK 
Sbjct: 65  QKNTHNHRQVSLPLSPGSDYTLSFTL--DSQPIFLYLDTGSDLVWFPCQPFECILCEGKA 124

Query: 126 KIQS----PLPKISNNKS-VSCSAAACSAAHGGSLSASHLCAISQCPLESIEISECSSFS 185
           +  S    P PK+S   + VSC ++ACSAAH  +L +S LCAIS CPLESIE S+C   S
Sbjct: 125 ENTSLASTPPPKLSKTATPVSCKSSACSAAHS-NLPSSDLCAISNCPLESIETSDCQKHS 184

Query: 186 CPPFYYAYGDGSLIARLYRDSLSLPAPAPSPAINVRNFTFGCAHTTLGEPVGVAGFGRGT 245
           CP FYYAYGDGSLIARLYRDS+SLP   P+  I V NFTFGCAHT L EP+GVAGFGRG 
Sbjct: 185 CPQFYYAYGDGSLIARLYRDSISLPLSNPTNLI-VNNFTFGCAHTALAEPIGVAGFGRGV 244

Query: 246 LSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYFGGETE----------F 305
           LS+P+QLAT SPQLGN+FSYCLVSHSF +DR+RRPSPLILGRY   E E          F
Sbjct: 245 LSLPAQLATLSPQLGNQFSYCLVSHSFDSDRLRRPSPLILGRYDHDEKERRVNGVNKPRF 304

Query: 306 VYTSLLENPKHPYFYSVGLAGISVGNVRIPAPEFLKKVDEGGSGGVVVDSGTTFTMLPAG 365
           VYTS+L+N +HPYFY VGL GIS+G  +IPAP FL+KVD  GSGG+VVDSGTTFTMLPA 
Sbjct: 305 VYTSMLDNLEHPYFYCVGLEGISIGRKKIPAPGFLRKVDGEGSGGLVVDSGTTFTMLPAS 364

Query: 366 LYDSVVAEFENRTGKVANRARRIEENIGLSPCYYYENSV-EVPRVVLHFVGEKSSVVLPR 425
           LY SVVAEFENR G+V  RAR IEE+ GLSPCYY++N+V  VP VVLHFVG  SSVVLPR
Sbjct: 365 LYGSVVAEFENRVGRVNERARVIEEDTGLSPCYYFDNNVVNVPSVVLHFVGNGSSVVLPR 424

Query: 426 KNYFYEFVDGGDGVGRKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLENNR 481
           +NYFYEF+DGGDG G+KRKVGCLMLMNGGDEAEL+GGPGATLGNYQQQGFEVVYDLEN R
Sbjct: 425 RNYFYEFLDGGDGKGKKRKVGCLMLMNGGDEAELSGGPGATLGNYQQQGFEVVYDLENKR 484

BLAST of Lsi07G013020 vs. TrEMBL
Match: B9SSF8_RICCO (Pepsin A, putative OS=Ricinus communis GN=RCOM_1061010 PE=3 SV=1)

HSP 1 Score: 647.5 bits (1669), Expect = 1.3e-182
Identity = 329/498 (66.06%), Postives = 396/498 (79.52%), Query Frame = 1

Query: 1   MASPVFVFLLCFLLSSPVFS---SQILLLPLSHSLLSSISDFNNTHNLLKSTAARSSARF 60
           MA+  + FL CF+L     S   S+IL LPL+HSL  S + F +TH+LLKST++RS++RF
Sbjct: 1   MATSCYAFL-CFILCFSCISVSISEILYLPLTHSL--SNTQFTSTHHLLKSTSSRSASRF 60

Query: 61  HH-HRRAHHRNH--LSLPLSPGGDYTLSFNLGSESHK-ISLYMDTGSDLVWFPCSPFECI 120
            H H++ H RN   +SLPLSPG DYTLSF L S   + +SLY+DTGSDLVWFPC PFECI
Sbjct: 61  QHQHQKRHLRNRHQVSLPLSPGSDYTLSFTLNSNPPQHVSLYLDTGSDLVWFPCKPFECI 120

Query: 121 LCEGKPK---IQSPLPKISNN-KSVSCSAAACSAAHGGSLSASHLCAISQCPLESIEISE 180
           LCEGK +     +P P++S+  +SV C ++ACSAAH  +L  S LCAI+ CPLESIE S+
Sbjct: 121 LCEGKAENTTASTPPPRLSSTARSVHCKSSACSAAHS-NLPTSDLCAIADCPLESIETSD 180

Query: 181 CSSFSCPPFYYAYGDGSLIARLYRDSLSLPAPAPSPAINVRNFTFGCAHTTLGEPVGVAG 240
           C SFSCP FYYAYGDGSL+ARLY DS+ LP   PS  +++ NFTFGCAHT L EPVGVAG
Sbjct: 181 CHSFSCPSFYYAYGDGSLVARLYHDSIKLPLATPS--LSLHNFTFGCAHTALAEPVGVAG 240

Query: 241 FGRGTLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILG-------RYFGGET 300
           FGRG LS+P+QLA+F+PQLGNRFSYCLVSHSF +DR+R PSPLILG       R    + 
Sbjct: 241 FGRGVLSLPAQLASFAPQLGNRFSYCLVSHSFNSDRLRLPSPLILGHSDDKEKRVNKDDV 300

Query: 301 EFVYTSLLENPKHPYFYSVGLAGISVGNVRIPAPEFLKKVDEGGSGGVVVDSGTTFTMLP 360
           +FVYTS+L+NPKHPYFY VGL GIS+G  +IPAPEFLK+VD  GSGGVVVDSGTTFTMLP
Sbjct: 301 QFVYTSMLDNPKHPYFYCVGLEGISIGKKKIPAPEFLKRVDREGSGGVVVDSGTTFTMLP 360

Query: 361 AGLYDSVVAEFENRTGKVANRARRIEENIGLSPCYYYENSVEVPRVVLHFVGEKSSVVLP 420
           A LY+SVVAEF+NR G+V  RA+ +E+  GL PCYYY+  V +P +VLHFVG +SSVVLP
Sbjct: 361 ASLYNSVVAEFDNRVGRVYERAKEVEDKTGLGPCYYYDTVVNIPSLVLHFVGNESSVVLP 420

Query: 421 RKNYFYEFVDGGDGVGRKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLENN 480
           +KNYFY+F+DGGDGV RKR+VGCLMLMNGG+EAEL GGPGATLGNYQQ GFEVVYDLE  
Sbjct: 421 KKNYFYDFLDGGDGVRRKRRVGCLMLMNGGEEAELTGGPGATLGNYQQHGFEVVYDLEQR 480

BLAST of Lsi07G013020 vs. TAIR10
Match: AT4G16563.1 (AT4G16563.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 591.7 bits (1524), Expect = 4.2e-169
Identity = 298/478 (62.34%), Postives = 357/478 (74.69%), Query Frame = 1

Query: 24  LLLPLSHSLLSSISDFNNTHNLLKSTAARSSARFHHHRRAHHRNHLSLPLSPGGDYTLSF 83
           LLL LSHSL +S    +  H LLKS+++RSSARF  H     +  LSLP+S G DY +S 
Sbjct: 29  LLLHLSHSLSTSKHSSSPLH-LLKSSSSRSSARFRRHHHKQQQQQLSLPISSGSDYLISL 88

Query: 84  NLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQSPLPKISNNKS-VSCSAAACS 143
           ++GS S  +SLY+DTGSDLVWFPC PF CILCE KP   SP   +S++ + VSCS+ +CS
Sbjct: 89  SVGSSSSAVSLYLDTGSDLVWFPCRPFTCILCESKPLPPSPPSSLSSSATTVSCSSPSCS 148

Query: 144 AAHGGSLSASHLCAISQCPLESIEISEC--SSFSCPPFYYAYGDGSLIARLYRDSLSLPA 203
           AAH  SL +S LCAIS CPL+ IE  +C  SS+ CPPFYYAYGDGSL+A+LY DSLSLP+
Sbjct: 149 AAHS-SLPSSDLCAISNCPLDFIETGDCNTSSYPCPPFYYAYGDGSLVAKLYSDSLSLPS 208

Query: 204 PAPSPAINVRNFTFGCAHTTLGEPVGVAGFGRGTLSMPSQLATFSPQLGNRFSYCLVSHS 263
                 ++V NFTFGCAHTTL EP+GVAGFGRG LS+P+QLA  SP LGN FSYCLVSHS
Sbjct: 209 ------VSVSNFTFGCAHTTLAEPIGVAGFGRGRLSLPAQLAVHSPHLGNSFSYCLVSHS 268

Query: 264 FAADRVRRPSPLILGRYFGGE--------------------TEFVYTSLLENPKHPYFYS 323
           F +DRVRRPSPLILGR+   +                     EFV+T +LENPKHPYFYS
Sbjct: 269 FDSDRVRRPSPLILGRFVDKKEKRVGTTDDHDDGDDEKKKKNEFVFTEMLENPKHPYFYS 328

Query: 324 VGLAGISVGNVRIPAPEFLKKVDEGGSGGVVVDSGTTFTMLPAGLYDSVVAEFENRTGKV 383
           V L GIS+G   IPAP  L+++D+ G GGVVVDSGTTFTMLPA  Y+SVV EF++R G+V
Sbjct: 329 VSLQGISIGKRNIPAPAMLRRIDKNGGGGVVVDSGTTFTMLPAKFYNSVVEEFDSRVGRV 388

Query: 384 ANRARRIEENIGLSPCYYYENSVEVPRVVLHFVGEKSSVVLPRKNYFYEFVDGGDGVGRK 443
             RA R+E + G+SPCYY   +V+VP +VLHF G +SSV LPR+NYFYEF+DGGDG   K
Sbjct: 389 HERADRVEPSSGMSPCYYLNQTVKVPALVLHFAGNRSSVTLPRRNYFYEFMDGGDGKEEK 448

Query: 444 RKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLENNRVGFARRQCSTLWDSL 479
           RK+GCLMLMNGGDE+EL GG GA LGNYQQQGFEVVYDL N RVGFA+R+C++LWDSL
Sbjct: 449 RKIGCLMLMNGGDESELRGGTGAILGNYQQQGFEVVYDLLNRRVGFAKRKCASLWDSL 498

BLAST of Lsi07G013020 vs. TAIR10
Match: AT5G45120.1 (AT5G45120.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 191.4 bits (485), Expect = 1.3e-48
Identity = 156/495 (31.52%), Postives = 228/495 (46.06%), Query Frame = 1

Query: 5   VFVFLLCFLLSSPVFSSQILLLPLSHSLLSSISDFNNTHNLLKSTAARSSARFHHHRRAH 64
           +F+FLL  LL +    +Q        S  SS      T + +     +S  +    +   
Sbjct: 8   LFLFLLITLLLNTTNKTQARQHKNPSSSSSSFLVLTLTKSSVSLPTPKSQTQERIKKPLS 67

Query: 65  HRNHLSLPLSPGGD-YTLSFNLGSESHKISLYMDTGSDLVWFPCS--PFECILCEG---- 124
             + +  PL    D Y ++ N+G+    + +Y+DTGSDL W PC    F+CI C      
Sbjct: 68  SVDVVMEPLREVRDGYLITLNIGTPPQAVQVYLDTGSDLTWVPCGNLSFDCIECYDLKNN 127

Query: 125 ---KPKIQSPLPKISNNKSVSCSAAACSAAHGGSLSASHLCAISQCPLESIEISECSSFS 184
               P + SPL   ++ +  SC+++ C   H    +    CA++ C +  +  S C    
Sbjct: 128 DLKSPSVFSPLHSSTSFRD-SCASSFCVEIHSSD-NPFDPCAVAGCSVSMLLKSTCVR-P 187

Query: 185 CPPFYYAYGDGSLIAR-LYRDSLSLPAPAPSPAINVRNFTFGCAHTTLGEPVGVAGFGRG 244
           CP F Y YG+G LI+  L RD L       +   +V  F+FGC  +T  EP+G+AGFGRG
Sbjct: 188 CPSFAYTYGEGGLISGILTRDILK------ARTRDVPRFSFGCVTSTYREPIGIAGFGRG 247

Query: 245 TLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGR---YFGGETEFVYTSLL 304
            LS+PSQL      L   FS+C +   F  +     SPLILG             +T +L
Sbjct: 248 LLSLPSQLGF----LEKGFSHCFLPFKF-VNNPNISSPLILGASALSINLTDSLQFTPML 307

Query: 305 ENPKHPYFYSVGLAGISVGNVRIP--APEFLKKVDEGGSGGVVVDSGTTFTMLPAGLYDS 364
             P +P  Y +GL  I++G    P   P  L++ D  G+GG++VDSGTT+T LP   Y  
Sbjct: 308 NTPMYPNSYYIGLESITIGTNITPTQVPLTLRQFDSQGNGGMLVDSGTTYTHLPEPFYSQ 367

Query: 365 VVAEFENRTGKVANRARRIEENIGLSPCY----------YYENSVEV--PRVVLHFVGEK 424
           ++   ++       RA   E   G   CY            EN V +  P +  HF+   
Sbjct: 368 LLTTLQSTI--TYPRATETESRTGFDLCYKVPCPNNNLTSLENDVMMIFPSITFHFL-NN 427

Query: 425 SSVVLPRKNYFYEFVDGGDGVGRKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVV 472
           ++++LP+ N FY      DG      V CL+  N  D      GP    G++QQQ  +VV
Sbjct: 428 ATLLLPQGNSFYAMSAPSDG----SVVQCLLFQNMEDGDY---GPAGVFGSFQQQNVKVV 478

BLAST of Lsi07G013020 vs. TAIR10
Match: AT3G52500.1 (AT3G52500.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 184.1 bits (466), Expect = 2.0e-46
Identity = 157/502 (31.27%), Postives = 228/502 (45.42%), Query Frame = 1

Query: 1   MASPVFVFLLCFLLSSPVFSSQILLLPLSHSLLSSISDFNNTHNLLKSTAARSSARFHHH 60
           MAS +F F L FL  S V + ++ L P SHS  S    + +   L +S+ AR+    H  
Sbjct: 1   MASSIFFFFLIFL--SVVSAVKLPLSPFSHSDQSPKDPYLSLRRLAESSIARAHKLKHGT 60

Query: 61  RRAHHRNHLSL-----------PLSPG--GDYTLSFNLGSESHKISLYMDTGSDLVWFPC 120
                 + LS            PLS    G Y++S + G+ S  I    DTGS LVW PC
Sbjct: 61  SIKPDEDALSSTTTASATVVKSPLSAKSYGGYSVSLSFGTPSQTIPFVFDTGSSLVWLPC 120

Query: 121 -SPFECILCEGKPKIQSPLPKI-----SNNKSVSCSAAACSAAHGGSLSASHLCAISQCP 180
            S + C  C+      + +P+      S++K + C +  C   +G ++         QC 
Sbjct: 121 TSRYLCSGCDFSGLDPTLIPRFIPKNSSSSKIIGCQSPKCQFLYGPNV---------QCR 180

Query: 181 LESIEISECSSFSCPPFYYAYGDGSLIARLYRDSLSLPAPAPSPAINVRNFTFGCAHTTL 240
                   C+   CPP+   YG GS    L  + L  P       + V +F  GC+  + 
Sbjct: 181 GCDPNTRNCT-VGCPPYILQYGLGSTAGVLITEKLDFPD------LTVPDFVVGCSIIST 240

Query: 241 GEPVGVAGFGRGTLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILG--RYFG 300
            +P G+AGFGRG +S+PSQ+         RFS+CLVS  F    V     L  G     G
Sbjct: 241 RQPAGIAGFGRGPVSLPSQMNL------KRFSHCLVSRRFDDTNVTTDLDLDTGSGHNSG 300

Query: 301 GETE-FVYTSLLENPKHPY-----FYSVGLAGISVGNVRIPAPEFLKKVDEGGSGGVVVD 360
            +T    YT   +NP         +Y + L  I VG   +  P         G GG +VD
Sbjct: 301 SKTPGLTYTPFRKNPNVSNKAFLEYYYLNLRRIYVGRKHVKIPYKYLAPGTNGDGGSIVD 360

Query: 361 SGTTFTMLPAGLYDSVVAEFENRTGKVANRARRIEENIGLSPCYYY--ENSVEVPRVVLH 420
           SG+TFT +   +++ V  EF ++      R + +E+  GL PC+    +  V VP ++  
Sbjct: 361 SGSTFTFMERPVFELVAEEFASQMSNYT-REKDLEKETGLGPCFNISGKGDVTVPELIFE 420

Query: 421 FVGEKSSVVLPRKNYFYEFVDGGDGVGRKRKVGCLMLMNGGDEAELAG-GPGATLGNYQQ 473
           F G  + + LP  NYF  FV   D V       CL +++        G GP   LG++QQ
Sbjct: 421 FKGG-AKLELPLSNYF-TFVGNTDTV-------CLTVVSDKTVNPSGGTGPAIILGSFQQ 468

BLAST of Lsi07G013020 vs. TAIR10
Match: AT1G25510.1 (AT1G25510.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 153.3 bits (386), Expect = 3.8e-37
Identity = 126/405 (31.11%), Postives = 186/405 (45.93%), Query Frame = 1

Query: 77  GDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQSPLPKISNNKSVSC 136
           G+Y     +G  + ++ + +DTGSD+ W  C+P  C  C  + +        S+ + +SC
Sbjct: 146 GEYFTRVGIGKPAREVYMVLDTGSDVNWLQCTP--CADCYHQTEPIFEPSSSSSYEPLSC 205

Query: 137 SAAACSAAHGGSLSASHLCAISQCPLESIEISECSSFSCPPFYYAYGDGSL-IARLYRDS 196
               C+A                     +E+SEC + +C  +  +YGDGS  +     ++
Sbjct: 206 DTPQCNA---------------------LEVSECRNATCL-YEVSYGDGSYTVGDFATET 265

Query: 197 LSLPAPAPSPAINVRNFTFGCAHTTLGEPVGVAGF---GRGTLSMPSQLATFSPQLGNRF 256
           L++ +        V+N   GC H+  G  VG AG    G G L++PSQL T S      F
Sbjct: 266 LTIGSTL------VQNVAVGCGHSNEGLFVGAAGLLGLGGGLLALPSQLNTTS------F 325

Query: 257 SYCLVSH-SFAADRVRRPSPLILGRYFGGETEFVYTSLLENPKHPYFYSVGLAGISVGNV 316
           SYCLV   S +A  V   + L          + V   LL N +   FY +GL GISVG  
Sbjct: 326 SYCLVDRDSDSASTVDFGTSL--------SPDAVVAPLLRNHQLDTFYYLGLTGISVGGE 385

Query: 317 RIPAPEFLKKVDEGGSGGVVVDSGTTFTMLPAGLYDSVVAEFENRTGKVANRARRIEENI 376
            +  P+   ++DE GSGG+++DSGT  T L   +Y+S+   F   T         +E+  
Sbjct: 386 LLQIPQSSFEMDESGSGGIIIDSGTAVTRLQTEIYNSLRDSFVKGT-------LDLEKAA 445

Query: 377 GLS---PCYYY--ENSVEVPRVVLHFVGEKSSVVLPRKNYFYEFVDGGDGVGRKRKVGCL 436
           G++    CY    + +VEVP V  HF G K  + LP KNY    VD          VG  
Sbjct: 446 GVAMFDTCYNLSAKTTVEVPTVAFHFPGGK-MLALPAKNYMIP-VD---------SVGTF 483

Query: 437 MLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLENNRVGFARRQC 472
            L      + L     A +GN QQQG  V +DL N+ +GF+  +C
Sbjct: 506 CLAFAPTASSL-----AIIGNVQQQGTRVTFDLANSLIGFSSNKC 483

BLAST of Lsi07G013020 vs. TAIR10
Match: AT3G61820.1 (AT3G61820.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 151.8 bits (382), Expect = 1.1e-36
Identity = 129/407 (31.70%), Postives = 176/407 (43.24%), Query Frame = 1

Query: 73  LSPG-GDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQSPLPKISNN 132
           LS G G+Y +   +G+ +  + + +DTGSD+VW  CSP  C  C  +        K    
Sbjct: 128 LSQGSGEYFMRLGVGTPATNVYMVLDTGSDVVWLQCSP--CKACYNQTDAIFDPKKSKTF 187

Query: 133 KSVSCSAAACSAAHGGSLSASHLCAISQCPLESIEISECSSFSCPPFYYAYGDGSLIARL 192
            +V C +  C                     +S E     S +C  +  +YGDGS     
Sbjct: 188 ATVPCGSRLCRRLD-----------------DSSECVTRRSKTCL-YQVSYGDGSFTEGD 247

Query: 193 YR-DSLSLPAPAPSPAINVRNFTFGCAHTTLGEPVGVAGF---GRGTLSMPSQLATFSPQ 252
           +  ++L+           V +   GC H   G  VG AG    GRG LS PSQ      +
Sbjct: 248 FSTETLTFHGA------RVDHVPLGCGHDNEGLFVGAAGLLGLGRGGLSFPSQTKN---R 307

Query: 253 LGNRFSYCLVSHSFAADRVRRPSPLILGRYFGGETEFVYTSLLENPKHPYFYSVGLAGIS 312
              +FSYCLV  + +    + PS ++ G     +T  V+T LL NPK   FY + L GIS
Sbjct: 308 YNGKFSYCLVDRTSSGSSSKPPSTIVFGNAAVPKTS-VFTPLLTNPKLDTFYYLQLLGIS 367

Query: 313 VGNVRIPA-PEFLKKVDEGGSGGVVVDSGTTFTMLPAGLYDSVVAEFENRTGKVANRARR 372
           VG  R+P   E   K+D  G+GGV++DSGT+ T L    Y ++   F  R G  A + +R
Sbjct: 368 VGGSRVPGVSESQFKLDATGNGGVIIDSGTSVTRLTQPAYVALRDAF--RLG--ATKLKR 427

Query: 373 IEENIGLSPCYYYEN--SVEVPRVVLHFVGEKSSVVLPRKNYFYEFVDGGDGVGRKRKVG 432
                    C+      +V+VP VV HF G    V LP  NY                 G
Sbjct: 428 APSYSLFDTCFDLSGMTTVKVPTVVFHFGG--GEVSLPASNYLIPV----------NTEG 483

Query: 433 CLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLENNRVGFARRQC 472
                  G    L+      +GN QQQGF V YDL  +RVGF  R C
Sbjct: 488 RFCFAFAGTMGSLS-----IIGNIQQQGFRVAYDLVGSRVGFLSRAC 483

BLAST of Lsi07G013020 vs. NCBI nr
Match: gi|449458942|ref|XP_004147205.1| (PREDICTED: aspartic proteinase nepenthesin-1 [Cucumis sativus])

HSP 1 Score: 912.1 bits (2356), Expect = 4.0e-262
Identity = 451/480 (93.96%), Postives = 462/480 (96.25%), Query Frame = 1

Query: 3   SPVFVFLLCFLLSSPVFSSQILLLPLSHSLLSSISDFNNTHNLLKSTAARSSARFHHHRR 62
           SPVF+FLLCFLLSSPVFSSQI LLPLSHSL SSISDFNNTHNLLKSTA RSSARFH HR 
Sbjct: 4   SPVFIFLLCFLLSSPVFSSQIFLLPLSHSLSSSISDFNNTHNLLKSTATRSSARFHRHRH 63

Query: 63  AHHRNHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQ 122
               NHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQ
Sbjct: 64  ----NHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQ 123

Query: 123 SPLPKISNNKSVSCSAAACSAAHGGSLSASHLCAISQCPLESIEISECSSFSCPPFYYAY 182
           SPLPKI+NNKSVSCSAAACSAAHGGSLSASHLCAIS+CPLESIEISECSSFSCPPFYYAY
Sbjct: 124 SPLPKIANNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAY 183

Query: 183 GDGSLIARLYRDSLSLPAPAPSPAINVRNFTFGCAHTTLGEPVGVAGFGRGTLSMPSQLA 242
           GDGSL+ARLYRDSLSLP PAPSP INVRNFTFGCAHTTLGEPVGVAGFGRG LSMPSQLA
Sbjct: 184 GDGSLVARLYRDSLSLPTPAPSPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQLA 243

Query: 243 TFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYFGGETEFVYTSLLENPKHPYFYSVG 302
           TFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRY+ GETEF+YTSLLENPKHPYFYSVG
Sbjct: 244 TFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYTGETEFIYTSLLENPKHPYFYSVG 303

Query: 303 LAGISVGNVRIPAPEFLKKVDEGGSGGVVVDSGTTFTMLPAGLYDSVVAEFENRTGKVAN 362
           LAGISVGN+RIPAPEFL KVDEGGSGGVVVDSGTTFTMLPAGLY+SVVAEFENRTGKVAN
Sbjct: 304 LAGISVGNIRIPAPEFLTKVDEGGSGGVVVDSGTTFTMLPAGLYESVVAEFENRTGKVAN 363

Query: 363 RARRIEENIGLSPCYYYENSVEVPRVVLHFVGEKSSVVLPRKNYFYEFVDGGDG-VGRKR 422
           RARRIEEN GLSPCYYYENSV VPRVVLHFVGEKS+VVLPRKNYFYEF+DGGDG VGRKR
Sbjct: 364 RARRIEENTGLSPCYYYENSVGVPRVVLHFVGEKSNVVLPRKNYFYEFLDGGDGVVGRKR 423

Query: 423 KVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLENNRVGFARRQCSTLWDSLNRS 482
           KVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLE NRVGFARRQCSTLWD+LNRS
Sbjct: 424 KVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDNLNRS 479

BLAST of Lsi07G013020 vs. NCBI nr
Match: gi|659095959|ref|XP_008448851.1| (PREDICTED: aspartic proteinase nepenthesin-1 [Cucumis melo])

HSP 1 Score: 908.3 bits (2346), Expect = 5.7e-261
Identity = 451/482 (93.57%), Postives = 461/482 (95.64%), Query Frame = 1

Query: 3   SPVFVFLLCFLLSSPVFSSQILLLPLSHSLLSSISDFNNTHNLLKSTAARSSARFHHHRR 62
           SPVF+FLLCFLLSSPVFSSQI LLPLSHSL SSISDFN+THNLLKSTA RSSARFH HR 
Sbjct: 4   SPVFIFLLCFLLSSPVFSSQIFLLPLSHSLSSSISDFNSTHNLLKSTATRSSARFHRHRH 63

Query: 63  AHHRNHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQ 122
               NHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQ
Sbjct: 64  ----NHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQ 123

Query: 123 SPLPKISNNKSVSCSAAACSAAHGGSLSASHLCAISQCPLESIEISECSSFSCPPFYYAY 182
           SPLPKISNNKSVSCSAAACSAAHGGSLSASHLCAIS+CPLESIEISECSSFSCPPFYYAY
Sbjct: 124 SPLPKISNNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAY 183

Query: 183 GDGSLIARLYRDSLSLPAPAPSPAINVRNFTFGCAHTTLGEPVGVAGFGRGTLSMPSQLA 242
           GDGSL+ARLYRDSLSLP PAPSP INVRNFTFGCAHTTLGEPVGVAGFGRG LSMPSQLA
Sbjct: 184 GDGSLVARLYRDSLSLPTPAPSPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQLA 243

Query: 243 TFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYFGGETEFVYTSLLENPKHPYFYSVG 302
           TFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRY  GETEF+YTSLLENPKHPYFYSVG
Sbjct: 244 TFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYHTGETEFIYTSLLENPKHPYFYSVG 303

Query: 303 LAGISVGNVRIPAPEFLKKVDEGGSGGVVVDSGTTFTMLPAGLYDSVVAEFENRTGKVAN 362
           LAGISVGNVRIPAPEFL+KVDE GSGGVVVDSGTTFTMLP+GLY+SVVAEFENRTGKVAN
Sbjct: 304 LAGISVGNVRIPAPEFLRKVDESGSGGVVVDSGTTFTMLPSGLYESVVAEFENRTGKVAN 363

Query: 363 RARRIEENIGLSPCYYYENSVEVPRVVLHFVGEKSSVVLPRKNYFYEFVDGGDG---VGR 422
           RARRIEEN GLSPCYYYENSV VPRVVLHFVGEKSSVVLPRKNYFYEF+DGGDG   VGR
Sbjct: 364 RARRIEENTGLSPCYYYENSVGVPRVVLHFVGEKSSVVLPRKNYFYEFLDGGDGVVEVGR 423

Query: 423 KRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLENNRVGFARRQCSTLWDSLN 482
           KRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLE NRVGFARRQCSTLWD+LN
Sbjct: 424 KRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDNLN 481

BLAST of Lsi07G013020 vs. NCBI nr
Match: gi|645266261|ref|XP_008238534.1| (PREDICTED: aspartic proteinase nepenthesin-1 [Prunus mume])

HSP 1 Score: 662.5 bits (1708), Expect = 5.5e-187
Identity = 334/494 (67.61%), Postives = 392/494 (79.35%), Query Frame = 1

Query: 1   MASPVFVFLLCFLLSSPVFSSQILLLPLSHSLLSSISDFNNTHNLLKSTAARSSARFHHH 60
           MAS +F+ +LCF     + SSQ L LPL+H+L  S + FN+T +LLKST  RS+ RFHHH
Sbjct: 1   MASALFL-ILCFTHFF-LSSSQPLYLPLTHTL--SQTQFNSTQHLLKSTTTRSARRFHHH 60

Query: 61  RRAHHR--NHLSLPLSPGGDYTLSFNLGSESHK-ISLYMDTGSDLVWFPCSPFECILCEG 120
            R H+R  N +SLPL+PG DYTLSF L S   + +SLYMDTGSDLVWFPCSPFECILCEG
Sbjct: 61  HRRHNRQTNQVSLPLAPGSDYTLSFTLNSSPPQPVSLYMDTGSDLVWFPCSPFECILCEG 120

Query: 121 KPKIQSPLPKISNNKSVSCSAAACSAAHGGSLSASHLCAISQCPLESIEISECSSFSCPP 180
           KP   +P PKI  N +VSC + +CSAAH  SLS+++LCAIS CPL+SIEISECSSFSCPP
Sbjct: 121 KPNSTNPPPKIPKNAAVSCDSRSCSAAH-SSLSSANLCAISHCPLDSIEISECSSFSCPP 180

Query: 181 FYYAYGDGSLIARLYRDSLSLPAPAPSPAINVRNFTFGCAHTTLGEPVGVAGFGRGTLSM 240
           FYYAY DGS IA+LY+ SLS+  P  +PA+ +RNFTFGC+H++LGEP+GVAGFGRG LS+
Sbjct: 181 FYYAYADGSFIAKLYKHSLSI--PMSTPALVLRNFTFGCSHSSLGEPIGVAGFGRGLLSL 240

Query: 241 PSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRY----------FGGETEFVYT 300
           P+QL+TFSP L  +FSYCLVSHSF  DRVRRPSPLILG Y           GG  E+ YT
Sbjct: 241 PAQLSTFSPHLATQFSYCLVSHSFDQDRVRRPSPLILGPYDQKQKRFGDGAGGSVEYAYT 300

Query: 301 SLLENPKHPYFYSVGLAGISVGNVRIPAPEFLKKVDEGGSGGVVVDSGTTFTMLPAGLYD 360
           S+L+NPKHPYFYS+GLAG+SVG    PAPE L+ VDE G+GG+VVDSGTTFTM P G Y+
Sbjct: 301 SMLDNPKHPYFYSIGLAGVSVGKKVFPAPEILQGVDENGNGGIVVDSGTTFTMFPQGFYN 360

Query: 361 SVVAEFENRTGKVANRARRIEENIGLSPCYYYENSVEVPRVVLHFVGEKSSVVLPRKNYF 420
           S+VAEF+ R G+V  RA R+E+  GL+PCYYYE  VEVP V LHF G KSSV+LPR+NYF
Sbjct: 361 SLVAEFDRRVGRVHERATRVEDETGLAPCYYYEKVVEVPAVTLHFAGNKSSVLLPRRNYF 420

Query: 421 YEFVDGGDGVGRKR-KVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLENNRVGF 480
           YEFVDGGDG GRKR KVGC MLMNGGDEAE++GGPG  LGNYQQQGFEVVYDLE  RVGF
Sbjct: 421 YEFVDGGDGAGRKRKKVGCWMLMNGGDEAEMSGGPGGILGNYQQQGFEVVYDLEKRRVGF 480

BLAST of Lsi07G013020 vs. NCBI nr
Match: gi|743864222|ref|XP_011031864.1| (PREDICTED: aspartic proteinase nepenthesin-1 [Populus euphratica])

HSP 1 Score: 661.0 bits (1704), Expect = 1.6e-186
Identity = 343/498 (68.88%), Postives = 391/498 (78.51%), Query Frame = 1

Query: 9   LLCFLLSSP---VFSSQILLLPLSHSLLSSISDFNNTHNLLKSTAARSSARFHHHRRAHH 68
           LLCF+L      + +SQ L LPL HSL  S + F +TH+LLKST+ RS+ARFHHH   HH
Sbjct: 8   LLCFILCFTHVFISTSQTLFLPLIHSL--SKTQFTSTHHLLKSTSTRSTARFHHHHHHHH 67

Query: 69  RN-------HLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGK 128
            N        +SLPLSPG DYTLSF + S+   ISLY+DTGSDLVWFPC PFECILCEGK
Sbjct: 68  NNKNSHKHRQVSLPLSPGSDYTLSFTINSQP--ISLYLDTGSDLVWFPCQPFECILCEGK 127

Query: 129 PKIQS----PLPKISNNKS-VSCSAAACSAAHGGSLSASHLCAISQCPLESIEISECSSF 188
            +  S    P PK+S   + VSC ++ACSA H  +L +S LCAIS CPLESIEIS+C   
Sbjct: 128 AENASLASTPPPKLSKTATPVSCKSSACSAVHS-NLPSSDLCAISNCPLESIEISDCRKH 187

Query: 189 SCPPFYYAYGDGSLIARLYRDSLSLPAPAPSPAINVRNFTFGCAHTTLGEPVGVAGFGRG 248
           SCP FYYAYGDGSLIARLYRDS+ LP    +  I   NFTFGCAHTTL EP+GVAGFGRG
Sbjct: 188 SCPQFYYAYGDGSLIARLYRDSIRLPLSNQTNLI-FNNFTFGCAHTTLAEPIGVAGFGRG 247

Query: 249 TLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYFGGETE---------- 308
            LS+P+QLAT SPQLGN+FSYCLVSHSF +D VRRPSPLILGRY   E E          
Sbjct: 248 VLSLPAQLATLSPQLGNQFSYCLVSHSFDSDGVRRPSPLILGRYDHDEKERRVNGVKKPS 307

Query: 309 FVYTSLLENPKHPYFYSVGLAGISVGNVRIPAPEFLKKVDEGGSGGVVVDSGTTFTMLPA 368
           FVYTS+L+NP+HPYFY VGL GIS+G  +IPAP+FL+KVD  GSGGVVVDSGTTFTMLPA
Sbjct: 308 FVYTSMLDNPRHPYFYCVGLEGISIGRKKIPAPDFLRKVDGEGSGGVVVDSGTTFTMLPA 367

Query: 369 GLYDSVVAEFENRTGKVANRARRIEENIGLSPCYYYENSV-EVPRVVLHFVGEKSSVVLP 428
            LYD +VAEFENR G+V  RA  IEEN GLSPCYY++N+V  VPRVVLHFVG  SSVVLP
Sbjct: 368 SLYDFIVAEFENRVGRVNERASVIEENTGLSPCYYFDNNVVNVPRVVLHFVGNGSSVVLP 427

Query: 429 RKNYFYEFVDGGDGVGRKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLENN 481
           R+NYFYEF+DGGDG G+KRKVGCLMLMNGGDEAEL+GGPGATLGNYQQQGFEVVYDLEN 
Sbjct: 428 RRNYFYEFLDGGDGKGKKRKVGCLMLMNGGDEAELSGGPGATLGNYQQQGFEVVYDLENR 487

BLAST of Lsi07G013020 vs. NCBI nr
Match: gi|224074147|ref|XP_002304273.1| (aspartyl protease family protein [Populus trichocarpa])

HSP 1 Score: 659.4 bits (1700), Expect = 4.6e-186
Identity = 341/495 (68.89%), Postives = 392/495 (79.19%), Query Frame = 1

Query: 9   LLCFLLSSP---VFSSQILLLPLSHSLLSSISDFNNTHNLLKSTAARSSARFHHH---RR 68
           LLCF+L      + +SQ L LPL HSL  S + F +TH+LLKST+ RS+ RFHHH   + 
Sbjct: 8   LLCFILCFTHIFISTSQTLFLPLIHSL--SKTQFTSTHHLLKSTSTRSTTRFHHHHHNKN 67

Query: 69  AHHRNHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQ 128
           +H+   +SLPLSPG DYTLSF + S+   ISLY+DTGSDLVWFPC PFECILCEGK +  
Sbjct: 68  SHNHRQVSLPLSPGSDYTLSFTINSQP--ISLYLDTGSDLVWFPCQPFECILCEGKAENA 127

Query: 129 S----PLPKISNNKS-VSCSAAACSAAHGGSLSASHLCAISQCPLESIEISECSSFSCPP 188
           S    P PK+S   + VSC ++ACSA H  +L +S LCAIS CPLESIEIS+C   SCP 
Sbjct: 128 SLASTPPPKLSKTATPVSCKSSACSAVHS-NLPSSDLCAISNCPLESIEISDCRKHSCPQ 187

Query: 189 FYYAYGDGSLIARLYRDSLSLPAPAPSPAINVRNFTFGCAHTTLGEPVGVAGFGRGTLSM 248
           FYYAYGDGSLIARLYRDS+ LP    +  I   NFTFGCAHTTL EP+GVAGFGRG LS+
Sbjct: 188 FYYAYGDGSLIARLYRDSIRLPLSNQTNLI-FNNFTFGCAHTTLAEPIGVAGFGRGVLSL 247

Query: 249 PSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYFGGETE----------FVYT 308
           P+QLAT SPQLGN+FSYCLVSHSF +DRVRRPSPLILGRY   E E          FVYT
Sbjct: 248 PAQLATLSPQLGNQFSYCLVSHSFDSDRVRRPSPLILGRYDHDEKERRVNGVKKPSFVYT 307

Query: 309 SLLENPKHPYFYSVGLAGISVGNVRIPAPEFLKKVDEGGSGGVVVDSGTTFTMLPAGLYD 368
           S+L+NP+HPYFY VGL GIS+G  +IPAP+FL+KVD  GSGGVVVDSGTTFTMLPA LYD
Sbjct: 308 SMLDNPRHPYFYCVGLEGISIGRKKIPAPDFLRKVDRKGSGGVVVDSGTTFTMLPASLYD 367

Query: 369 SVVAEFENRTGKVANRARRIEENIGLSPCYYYENSV-EVPRVVLHFVGEKSSVVLPRKNY 428
            VVAEFENR G+V  RA  IEEN GLSPCYY++N+V  VPRVVLHFVG  SSVVLPR+NY
Sbjct: 368 FVVAEFENRVGRVNERASVIEENTGLSPCYYFDNNVVNVPRVVLHFVGNGSSVVLPRRNY 427

Query: 429 FYEFVDGGDGVGRKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLENNRVGF 482
           FYEF+DGG G G+KRKVGCLMLMNGGDEAEL+GGPGATLGNYQQQGFEVVYDLEN RVGF
Sbjct: 428 FYEFLDGGHGKGKKRKVGCLMLMNGGDEAELSGGPGATLGNYQQQGFEVVYDLENRRVGF 487

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
ASP63_ARATH7.4e-16862.34Probable aspartyl protease At4g16563 OS=Arabidopsis thaliana GN=At4g16563 PE=2 S... [more]
APF2_ARATH5.7e-3532.68Aspartyl protease family protein 2 OS=Arabidopsis thaliana GN=APF2 PE=2 SV=1[more]
NEP1_NEPGR6.3e-3430.05Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis GN=nep1 PE=1 SV=1[more]
ASPG1_ARATH8.3e-3429.93Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana GN=ASPG1 PE=1 ... [more]
AED3_ARATH1.4e-3328.67Aspartyl protease AED3 OS=Arabidopsis thaliana GN=AED3 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0L5I7_CUCSA2.8e-26293.96Pepsin A OS=Cucumis sativus GN=Csa_3G020060 PE=3 SV=1[more]
B9GYA7_POPTR3.2e-18668.89Aspartyl protease family protein OS=Populus trichocarpa GN=POPTR_0003s07390g PE=... [more]
M5WCX9_PRUPE5.2e-18466.40Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa004852mg PE=3 SV=1[more]
B9NGC6_POPTR9.7e-18368.21Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0001s15870g PE=3 SV=1[more]
B9SSF8_RICCO1.3e-18266.06Pepsin A, putative OS=Ricinus communis GN=RCOM_1061010 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT4G16563.14.2e-16962.34 Eukaryotic aspartyl protease family protein[more]
AT5G45120.11.3e-4831.52 Eukaryotic aspartyl protease family protein[more]
AT3G52500.12.0e-4631.27 Eukaryotic aspartyl protease family protein[more]
AT1G25510.13.8e-3731.11 Eukaryotic aspartyl protease family protein[more]
AT3G61820.11.1e-3631.70 Eukaryotic aspartyl protease family protein[more]
Match NameE-valueIdentityDescription
gi|449458942|ref|XP_004147205.1|4.0e-26293.96PREDICTED: aspartic proteinase nepenthesin-1 [Cucumis sativus][more]
gi|659095959|ref|XP_008448851.1|5.7e-26193.57PREDICTED: aspartic proteinase nepenthesin-1 [Cucumis melo][more]
gi|645266261|ref|XP_008238534.1|5.5e-18767.61PREDICTED: aspartic proteinase nepenthesin-1 [Prunus mume][more]
gi|743864222|ref|XP_011031864.1|1.6e-18668.88PREDICTED: aspartic proteinase nepenthesin-1 [Populus euphratica][more]
gi|224074147|ref|XP_002304273.1|4.6e-18668.89aspartyl protease family protein [Populus trichocarpa][more]
The following terms have been associated with this gene:
Vocabulary: Biological Process
TermDefinition
GO:0006508proteolysis
Vocabulary: Molecular Function
TermDefinition
GO:0004190aspartic-type endopeptidase activity
Vocabulary: INTERPRO
TermDefinition
IPR021109Peptidase_aspartic_dom_sf
IPR001969Aspartic_peptidase_AS
IPR001461Aspartic_peptidase_A1
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0030163 protein catabolic process
biological_process GO:0006508 proteolysis
cellular_component GO:0005575 cellular_component
cellular_component GO:0009505 plant-type cell wall
molecular_function GO:0004190 aspartic-type endopeptidase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lsi07G013020.1Lsi07G013020.1mRNA


Analysis Name: InterPro Annotations of Lagenaria siceraria
Date Performed: 2017-09-18
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPANTHERPTHR13683ASPARTYL PROTEASEScoord: 166..479
score: 5.4E-246coord: 1..141
score: 5.4E
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 330..341
scor
IPR021109Aspartic peptidase domainGENE3DG3DSA:2.40.70.10coord: 76..275
score: 7.1E-36coord: 282..476
score: 2.4
IPR021109Aspartic peptidase domainunknownSSF50630Acid proteasescoord: 74..476
score: 7.37
NoneNo IPR availablePANTHERPTHR13683:SF276SUBFAMILY NOT NAMEDcoord: 166..479
score: 5.4E-246coord: 1..141
score: 5.4E

The following gene(s) are paralogous to this gene:

None