Lsi01G007370 (gene) Bottle gourd (USVL1VR-Ls)

NameLsi01G007370
Typegene
OrganismLagenaria siceraria (Bottle gourd (USVL1VR-Ls))
DescriptionEukaryotic aspartyl protease family protein
Locationchr01 : 5817010 .. 5820173 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
TTGCAATGTCGCTTCGGAATCTGATTTTGTTGATGCTGATGGGGATTGGCGTTCACCAGGGAGTGTCGATTACGTTCACATCGAGGATACTTCACAGGTTCTCTGAGGAAATGAAAGCGCTTAGGGTTTCAGGGAGTACGAATACGAGTGTTCGAGCATCATGGCCTGAGAAGGGGAGCATGGAGTATTATCAGGAGCTTGTGAGTGGTGACTTCCAGAGGCAGAAGATGAAGCTTGGCTCTCGGTTTCAGTTGCTTTTCCCGTCTGAAGGCAGTAAAACCATTGCGCTGGGAAATGACTTTGGCTGGTCACCATTACTCTACCTTCTCCTTTTTTTTTTTTTGTTTCCCTTCCCTGTTCTTGCTATTTTCGTTCTTCCGTTTTCCATTTACTTTTCTGTTTGTTTTTCTCTTGACGCCATAATCCTCTGCAAACCTTTCTTTTCAGTAATCCCCCCCCCCCCCCCCCCCCCCNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNAAAAAAAAAAAAAAAAAAAACCCACCTCCTAATAGCCTTTTGCACTGTGGCACCTTTATCAGAAAATTTGTGTTATAAAATGTTATCCTATGCATTTGAAAATCAACTTATTATTTACGTATTATAGGTTGCATTACACCTGGATCGATATCGGGACACCGAGTGTTTCATTTCTGGTTGCATTGGATGCTGGAAGTGATCTACTTTGGGTTCCTTGTGATTGCATACAATGTGCTCCCTTGTCTGCAAGTTACTATGGCAGTCTGGTAGGCCTGCTTTTCAGAATCTGTTCTTTCTACATAGGCATTAATTACTTCTAATCCACGACTTATGCTTTATCGTCAAGCATATTGCCTTTTCGTTTCCGTTAGAGAAAGGGTCATGCTTGGACAAATTGTGTTCTGGAATTCAAAGTTTTGTAGTGCTTTTTTGCGGAAAAGCAAGGCATGGTCGGTTCCAACGGGAAAAAAAAAATGCCCAAGAGGAGACGGACATGAAAGGCTCTATCATCTGCATCTAAAGAAAAATGATTGGCTAACTCATTTTTGGCGCAAATGTATTAGTGGATTATGAAGTCTTTTTAATAAGACAGCAAGTTTAGCACTTATTTCCGATGATTGATTGGCAGGTGCTCCTGCTCCCTCATGTTTGGGGCATTATTTATAATGCAGGATAAAGATCTCAATGAATATCGTCCATCCAGTTCAAGCACGAGCAAGCATATATCTTGCAGCCATAATTTGTGCGAGTCAGGCCAAAGCTGCCAAAGTCCGAAGCAGTCATGTCCTTATGTTATTGACTACATTACTGAAAATACTTCAAGTTCAGGACTACTAATTCAGGATATGTTGCATCTTTCATCCGGTTGTGAGAATTCATCTAATTGTATGATTCAGGCTCCGGTCATTTTAGGGTATGACCCATTGATCCTATATTTTGTTGCTCTAATTGTTTGCTCTGTATGCCAACTGCAGCTACGAGGAAGGATGCTGTATTTATATTACCCTTTTTTGCTCTCTAATACAGGTGTGGTATGAAGCAAAGTGGTGGTTATCTAAGTGGAGTCGCTCCAGATGGTCTTTTTGGATTGGGGCTAGGAGAAATTTCTGTTCTTAGTTCCCTTGCGAAAGAAGAATTGGTGCAGAACTCTTTCTCGCTGTGTTTTAATGAGGATGGATCTGGCAGAATTTTTTTTGGGGACGAGGGACCAGCAAGTCAACAAATGACTTCATTTGTGCCGTTAGATGGGAAATAGTAATTACCTCACTGAATTATCTTCATGTTTATGATCATGGTTTGTTATATATGCTGACTGAAGTCCTTGTCATACCTGAATCTCTCAGTGAAACCTACATTGTCGGGGTGGAAGCATGTTGTATTGAGAATTCGTGCCTCAAGCAGACAAGTTTTAAAGCATTGATAGATAGTGGAACGTCATTTACGTATCTTCCAGAGGAAGCATATGAAAATATTGTGATGGAGGTACATTCTTTTTGTAAGCTTTGTACTTCATTTAGTTTGATTGTCCTTTAACCAGTTCCCCTTTTGATAGTTTGATAAAAGGTTAAACACTTCAAGCTCCGTCTCCTTTAAAGGATATCCGTGGAAGTATTGCTATAAGATCAGGTGACAACGATGAGATGGAAAACCTTTAAAATCCTTTTCCCTTTATAACTCTCTACCTTGTGCATGTCTAATGTTAATGTTATGGGTTTTTCCAACAGTGCAGACGCAATGCCAAAGGTTCCATCTGTGACATTGTTGTTCCCACAAAACAATAGCTTTGTGGTTCATGATCCCGTGTTCCCTATCTATGGCGATCAGGTAGATGAATTTTCCTTCTGAGAATGTTCAAGTTTACATGATTTGATTTGATGAATGTGGATGAAGGTGTGGACTTGATCTTTGAACTTTGTTATAATTAGCCATTAATAGAATAGCTTCCACTTTTAGTGGGGATAAAAGTACCCTTAAATATCAAACTTGGTATCTTAGTAGTGAAGTTGATAGCTTCTGAATACTCCAGAAAATAGAGCCCCCAAGGAAATGAAGAATAAGAACATC

mRNA sequence

TTGCAATGTCGCTTCGGAATCTGATTTTGTTGATGCTGATGGGGATTGGCGTTCACCAGGGAGTGTCGATTACGTTCACATCGAGGATACTTCACAGGTTCTCTGAGGAAATGAAAGCGCTTAGGGTTTCAGGGAGTACGAATACGAGTGTTCGAGCATCATGGCCTGAGAAGGGGAGCATGGAGTATTATCAGGAGCTTGTGAGTGGTGACTTCCAGAGGCAGAAGATGAAGCTTGGCTCTCGGTTTCAGTTGCTTTTCCCGTCTGAAGGCAGTAAAACCATTGCGCTGGGAAATGACTTTGGCTGGTCACCATTACTCTACCTTCTCCTTTTTTTTTTTTTGTTTCCCTTCCCTGTTCTTGCTATTTTCGTTCTTCCGTTTTCCATTTACTTTTCTGTTTGTTTTTCTCTTGACGCCATAATCCTCTGCAAACCTTTCTTTTCAAAAATTTGTGTTATAAAATGTTATCCTATGCATTTGAAAATCAACTTATTATTTACGTATTATAGGTTGCATTACACCTGGATCGATATCGGGACACCGAGTGTTTCATTTCTGGTTGCATTGGATGCTGGAAGTGATCTACTTTGGGTTCCTTGTGATTGCATACAATGTGCTCCCTTGTCTGCAAGTTACTATGGCAGTCTGGATAAAGATCTCAATGAATATCGTCCATCCAGTTCAAGCACGAGCAAGCATATATCTTGCAGCCATAATTTGTGCGAGTCAGGCCAAAGCTGCCAAAGTCCGAAGCAGTCATGTCCTTATGTTATTGACTACATTACTGAAAATACTTCAAGTTCAGGACTACTAATTCAGGATATGTTGCATCTTTCATCCGGTTGTGAGAATTCATCTAATTGTATGATTCAGGCTCCGGTCATTTTAGGGTGTGGTATGAAGCAAAGTGGTGGTTATCTAAGTGGAGTCGCTCCAGATGGTCTTTTTGGATTGGGGCTAGGAGAAATTTCTGTTCTTAGTTCCCTTGCGAAAGAAGAATTGGTGCAGAACTCTTTCTCGCTGTGTTTTAATGAGGATGGATCTGGCAGAATTTTTTTTGGGGACGAGGGACCAGCAAGTCAACAAATGACTTCATTTGTGCCGTTAGATGGGAAATATGAAACCTACATTGTCGGGGTGGAAGCATGTTGTATTGAGAATTCGTGCCTCAAGCAGACAAGTTTTAAAGCATTGATAGATAGTGGAACGTCATTTACGTATCTTCCAGAGGAAGCATATGAAAATATTGTGATGGAGTTTGATAAAAGGTTAAACACTTCAAGCTCCGTCTCCTTTAAAGGATATCCGTGGAAGTATTGCTATAAGATCAGTGCAGACGCAATGCCAAAGGTTCCATCTGTGACATTGTTGTTCCCACAAAACAATAGCTTTGTGGTTCATGATCCCGTGTTCCCTATCTATGGCGATCAGCTTCTGAATACTCCAGAAAATAGAGCCCCCAAGGAAATGAAGAATAAGAACATC

Coding sequence (CDS)

ATGTCGCTTCGGAATCTGATTTTGTTGATGCTGATGGGGATTGGCGTTCACCAGGGAGTGTCGATTACGTTCACATCGAGGATACTTCACAGGTTCTCTGAGGAAATGAAAGCGCTTAGGGTTTCAGGGAGTACGAATACGAGTGTTCGAGCATCATGGCCTGAGAAGGGGAGCATGGAGTATTATCAGGAGCTTGTGAGTGGTGACTTCCAGAGGCAGAAGATGAAGCTTGGCTCTCGGTTTCAGTTGCTTTTCCCGTCTGAAGGCAGTAAAACCATTGCGCTGGGAAATGACTTTGGCTGGTCACCATTACTCTACCTTCTCCTTTTTTTTTTTTTGTTTCCCTTCCCTGTTCTTGCTATTTTCGTTCTTCCGTTTTCCATTTACTTTTCTGTTTGTTTTTCTCTTGACGCCATAATCCTCTGCAAACCTTTCTTTTCAAAAATTTGTGTTATAAAATGTTATCCTATGCATTTGAAAATCAACTTATTATTTACGTATTATAGGTTGCATTACACCTGGATCGATATCGGGACACCGAGTGTTTCATTTCTGGTTGCATTGGATGCTGGAAGTGATCTACTTTGGGTTCCTTGTGATTGCATACAATGTGCTCCCTTGTCTGCAAGTTACTATGGCAGTCTGGATAAAGATCTCAATGAATATCGTCCATCCAGTTCAAGCACGAGCAAGCATATATCTTGCAGCCATAATTTGTGCGAGTCAGGCCAAAGCTGCCAAAGTCCGAAGCAGTCATGTCCTTATGTTATTGACTACATTACTGAAAATACTTCAAGTTCAGGACTACTAATTCAGGATATGTTGCATCTTTCATCCGGTTGTGAGAATTCATCTAATTGTATGATTCAGGCTCCGGTCATTTTAGGGTGTGGTATGAAGCAAAGTGGTGGTTATCTAAGTGGAGTCGCTCCAGATGGTCTTTTTGGATTGGGGCTAGGAGAAATTTCTGTTCTTAGTTCCCTTGCGAAAGAAGAATTGGTGCAGAACTCTTTCTCGCTGTGTTTTAATGAGGATGGATCTGGCAGAATTTTTTTTGGGGACGAGGGACCAGCAAGTCAACAAATGACTTCATTTGTGCCGTTAGATGGGAAATATGAAACCTACATTGTCGGGGTGGAAGCATGTTGTATTGAGAATTCGTGCCTCAAGCAGACAAGTTTTAAAGCATTGATAGATAGTGGAACGTCATTTACGTATCTTCCAGAGGAAGCATATGAAAATATTGTGATGGAGTTTGATAAAAGGTTAAACACTTCAAGCTCCGTCTCCTTTAAAGGATATCCGTGGAAGTATTGCTATAAGATCAGTGCAGACGCAATGCCAAAGGTTCCATCTGTGACATTGTTGTTCCCACAAAACAATAGCTTTGTGGTTCATGATCCCGTGTTCCCTATCTATGGCGATCAGCTTCTGAATACTCCAGAAAATAGAGCCCCCAAGGAAATGAAGAATAAGAACATC

Protein sequence

MSLRNLILLMLMGIGVHQGVSITFTSRILHRFSEEMKALRVSGSTNTSVRASWPEKGSMEYYQELVSGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFGWSPLLYLLLFFFLFPFPVLAIFVLPFSIYFSVCFSLDAIILCKPFFSKICVIKCYPMHLKINLLFTYYRLHYTWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSLDKDLNEYRPSSSSTSKHISCSHNLCESGQSCQSPKQSCPYVIDYITENTSSSGLLIQDMLHLSSGCENSSNCMIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEELVQNSFSLCFNEDGSGRIFFGDEGPASQQMTSFVPLDGKYETYIVGVEACCIENSCLKQTSFKALIDSGTSFTYLPEEAYENIVMEFDKRLNTSSSVSFKGYPWKYCYKISADAMPKVPSVTLLFPQNNSFVVHDPVFPIYGDQLLNTPENRAPKEMKNKNI
BLAST of Lsi01G007370 vs. Swiss-Prot
Match: ASPL1_ARATH (Aspartic proteinase-like protein 1 OS=Arabidopsis thaliana GN=At5g10080 PE=1 SV=1)

HSP 1 Score: 391.3 bits (1004), Expect = 1.5e-107
Identity = 189/306 (61.76%), Postives = 236/306 (77.12%), Query Frame = 1

Query: 170 LHYTWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSL-DKDLNEYRPSSSS 229
           LHYTWIDIGTPSVSFLVALD GS+LLW+PC+C+QCAPL+++YY SL  KDLNEY PSSSS
Sbjct: 99  LHYTWIDIGTPSVSFLVALDTGSNLLWIPCNCVQCAPLTSTYYSSLATKDLNEYNPSSSS 158

Query: 230 TSKHISCSHNLCESGQSCQSPKQSCPYVIDYITENTSSSGLLIQDMLHLSSGCEN---SS 289
           TSK   CSH LC+S   C+SPK+ CPY ++Y++ NTSSSGLL++D+LHL+    N   + 
Sbjct: 159 TSKVFLCSHKLCDSASDCESPKEQCPYTVNYLSGNTSSSGLLVEDILHLTYNTNNRLMNG 218

Query: 290 NCMIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEELVQNSFSLCFNED 349
           +  ++A V++GCG KQSG YL GVAPDGL GLG  EISV S L+K  L++NSFSLCF+E+
Sbjct: 219 SSSVKARVVIGCGKKQSGDYLDGVAPDGLMGLGPAEISVPSFLSKAGLMRNSFSLCFDEE 278

Query: 350 GSGRIFFGDEGPASQQMTSFVPLD-GKYETYIVGVEACCIENSCLKQTSFKALIDSGTSF 409
            SGRI+FGD GP+ QQ T F+ LD  KY  YIVGVEACCI NSCLKQTSF   IDSG SF
Sbjct: 279 DSGRIYFGDMGPSIQQSTPFLQLDNNKYSGYIVGVEACCIGNSCLKQTSFTTFIDSGQSF 338

Query: 410 TYLPEEAYENIVMEFDKRLNTSSSVSFKGYPWKYCYKISADAMPKVPSVTLLFPQNNSFV 469
           TYLPEE Y  + +E D+ +N +S  +F+G  W+YCY+ SA+  PKVP++ L F  NN+FV
Sbjct: 339 TYLPEEIYRKVALEIDRHINATSK-NFEGVSWEYCYESSAE--PKVPAIKLKFSHNNTFV 398

Query: 470 VHDPVF 471
           +H P+F
Sbjct: 399 IHKPLF 401

BLAST of Lsi01G007370 vs. Swiss-Prot
Match: APF1_ARATH (Aspartyl protease family protein 1 OS=Arabidopsis thaliana GN=APF1 PE=1 SV=1)

HSP 1 Score: 229.2 bits (583), Expect = 1.0e-58
Identity = 134/309 (43.37%), Postives = 181/309 (58.58%), Query Frame = 1

Query: 170 LHYTWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSLDKDLNEYRPSSSST 229
           LHY  + +GTPS  F+VALD GSDL W+PCDC  C     +  GS   DLN Y P++SST
Sbjct: 103 LHYANVTVGTPSDWFMVALDTGSDLFWLPCDCTNCVRELKAPGGS-SLDLNIYSPNASST 162

Query: 230 SKHISCSHNLCESGQSCQSPKQSCPYVIDYITENTSSSGLLIQDMLHLSSGCENSSNCMI 289
           S  + C+  LC  G  C SP+  CPY I Y++  TSS+G+L++D+LHL S  ++S    I
Sbjct: 163 STKVPCNSTLCTRGDRCASPESDCPYQIRYLSNGTSSTGVLVEDVLHLVSNDKSSK--AI 222

Query: 290 QAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEELVQNSFSLCFNEDGSGR 349
            A V  GCG  Q+G +  G AP+GLFGLGL +ISV S LAKE +  NSFS+CF  DG+GR
Sbjct: 223 PARVTFGCGQVQTGVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAANSFSMCFGNDGAGR 282

Query: 350 IFFGDEGPASQQMTSFVPLDGKYETYIVGVEACCIENSCLKQTSFKALIDSGTSFTYLPE 409
           I FGD+G   Q+ T  + +   + TY + V    +  +      F A+ DSGTSFTYL +
Sbjct: 283 ISFGDKGSVDQRETP-LNIRQPHPTYNITVTKISVGGN-TGDLEFDAVFDSGTSFTYLTD 342

Query: 410 EAYENIVMEF-----DKRLNTSSSVSFKGYPWKYCYKISADAMP-KVPSVTLLFPQNNSF 469
            AY  I   F     DKR  T+ S      P++YCY +S +    + P+V L     +S+
Sbjct: 343 AAYTLISESFNSLALDKRYQTTDS----ELPFEYCYALSPNKDSFQYPAVNLTMKGGSSY 402

Query: 470 VVHDPVFPI 473
            V+ P+  I
Sbjct: 403 PVYHPLVVI 402

BLAST of Lsi01G007370 vs. Swiss-Prot
Match: ASPL2_ARATH (Aspartic proteinase-like protein 2 OS=Arabidopsis thaliana GN=At1g65240 PE=1 SV=2)

HSP 1 Score: 107.8 bits (268), Expect = 3.3e-22
Identity = 90/315 (28.57%), Postives = 142/315 (45.08%), Query Frame = 1

Query: 170 LHYTWIDIGTPSVSFLVALDAGSDLLWVPC-DCIQCAPLSASYYGSLDKDLNEYRPSSSS 229
           L++T I +G+P   + V +D GSD+LW+ C  C +C   +     +L+  L+ +  ++SS
Sbjct: 73  LYFTKIKLGSPPKEYHVQVDTGSDILWINCKPCPKCPTKT-----NLNFRLSLFDMNASS 132

Query: 230 TSKHISCSHNLC---ESGQSCQSPKQSCPYVIDYITENTSSSGLLIQDMLHLSSGCENSS 289
           TSK + C  + C       SCQ P   C Y I Y  E+T S G  I+DML L     +  
Sbjct: 133 TSKKVGCDDDFCSFISQSDSCQ-PALGCSYHIVYADEST-SDGKFIRDMLTLEQVTGDLK 192

Query: 290 NCMIQAPVILGCGMKQSGGYLSG-VAPDGLFGLGLGEISVLSSLAKEELVQNSFSLCF-N 349
              +   V+ GCG  QSG   +G  A DG+ G G    SVLS LA     +  FS C  N
Sbjct: 193 TGPLGQEVVFGCGSDQSGQLGNGDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCLDN 252

Query: 350 EDGSGRIFFGDEGPASQQMTSFVPLDGKYETYIVGVE----ACCIENSCLKQTSFKALID 409
             G G    G       + T  VP    Y   ++G++    +  +  S ++      ++D
Sbjct: 253 VKGGGIFAVGVVDSPKVKTTPMVPNQMHYNVMLMGMDVDGTSLDLPRSIVRNGG--TIVD 312

Query: 410 SGTSFTYLPEEAYENIVMEFDKRLNTSSSVSFKGYPWKYCYKISADAMPKVPSVTLLFPQ 469
           SGT+  Y P+  Y++++     R      +  + +    C+  S +     P V+  F  
Sbjct: 313 SGTTLAYFPKVLYDSLIETILARQPVKLHIVEETF---QCFSFSTNVDEAFPPVSFEFED 372

Query: 470 NNSFVV--HDPVFPI 473
           +    V  HD +F +
Sbjct: 373 SVKLTVYPHDYLFTL 375

BLAST of Lsi01G007370 vs. Swiss-Prot
Match: AED1_ARATH (Aspartyl protease AED1 OS=Arabidopsis thaliana GN=AED1 PE=2 SV=1)

HSP 1 Score: 102.1 bits (253), Expect = 1.8e-20
Identity = 86/293 (29.35%), Postives = 132/293 (45.05%), Query Frame = 1

Query: 175 IDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSLDKDLNEYRPSSSSTSKHIS 234
           I IGTP     +  D GSDL W      QC P   S Y   +    ++ PSSSST +++S
Sbjct: 136 IGIGTPKHDLSLVFDTGSDLTWT-----QCEPCLGSCYSQKEP---KFNPSSSSTYQNVS 195

Query: 235 CSHNLCESGQSCQSPKQSCPYVIDYITENTSSSGLLIQDMLHLSSGCENSSNCMIQAPVI 294
           CS  +CE  +SC +   +C Y I Y  + + + G L ++   L       +N  +   V 
Sbjct: 196 CSSPMCEDAESCSA--SNCVYSIVY-GDKSFTQGFLAKEKFTL-------TNSDVLEDVY 255

Query: 295 LGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEELVQNSFSLC---FNEDGSGRIF 354
            GCG + + G   GVA  GL GLG G++S+ +         N FS C   F  + +G + 
Sbjct: 256 FGCG-ENNQGLFDGVA--GLLGLGPGKLSLPAQTT--TTYNNIFSYCLPSFTSNSTGHLT 315

Query: 355 FGDEGPASQQMTSFVPLDGKYETYIVGVEACCI----ENSCLKQTSFK---ALIDSGTSF 414
           FG  G    +   F P+      +  G++   I    +   +   SF    A+IDSGT F
Sbjct: 316 FGSAG--ISESVKFTPISSFPSAFNYGIDIIGISVGDKELAITPNSFSTEGAIIDSGTVF 375

Query: 415 TYLPEEAYENIVMEFDKRLNTSSSVSFKGYPWKYCYKISADAMPKVPSVTLLF 458
           T LP + Y  +   F +++++  S S  G  +  CY  +       P++   F
Sbjct: 376 TRLPTKVYAELRSVFKEKMSSYKSTSGYGL-FDTCYDFTGLDTVTYPTIAFSF 402

BLAST of Lsi01G007370 vs. Swiss-Prot
Match: APCB1_ARATH (Aspartyl protease APCB1 OS=Arabidopsis thaliana GN=APCB1 PE=1 SV=1)

HSP 1 Score: 89.0 bits (219), Expect = 1.6e-16
Identity = 79/262 (30.15%), Postives = 121/262 (46.18%), Query Frame = 1

Query: 170 LHYTWIDIGTPSVS--FLVALDAGSDLLWVPCD--CIQCAPLSASYYGSLDKDLNEYRPS 229
           L+YT I +G P     + + +D GS+L W+ CD  C  CA  +   Y     +L      
Sbjct: 202 LYYTRILVGKPEDGQYYHLDIDTGSELTWIQCDAPCTSCAKGANQLYKPRKDNLVR---- 261

Query: 230 SSSTSKHISCSHN-LCESGQSCQSPKQSCPYVIDYITENTSSSGLLIQDMLHLSSGCENS 289
            SS +  +    N L E  ++C      C Y I+Y  +++ S G+L +D  HL     N 
Sbjct: 262 -SSEAFCVEVQRNQLTEHCENCHQ----CDYEIEY-ADHSYSMGVLTKDKFHLK--LHNG 321

Query: 290 SNCMIQAPVILGCGMKQSGGYLSGVAP-DGLFGLGLGEISVLSSLAKEELVQNSFSLCFN 349
           S  + ++ ++ GCG  Q G  L+ +   DG+ GL   +IS+ S LA   ++ N    C  
Sbjct: 322 S--LAESDIVFGCGYDQQGLLLNTLLKTDGILGLSRAKISLPSQLASRGIISNVVGHCLA 381

Query: 350 ED--GSGRIFFGDEGPASQQMTSFVPL--DGKYETYIVGVEACCIENSCLKQTSF----- 409
            D  G G IF G +   S  MT +VP+  D + + Y + V         L          
Sbjct: 382 SDLNGEGYIFMGSDLVPSHGMT-WVPMLHDSRLDAYQMQVTKMSYGQGMLSLDGENGRVG 441

Query: 410 KALIDSGTSFTYLPEEAYENIV 417
           K L D+G+S+TY P +AY  +V
Sbjct: 442 KVLFDTGSSYTYFPNQAYSQLV 448

BLAST of Lsi01G007370 vs. TrEMBL
Match: A0A0A0KGB2_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G500670 PE=3 SV=1)

HSP 1 Score: 520.8 bits (1340), Expect = 1.8e-144
Identity = 253/261 (96.93%), Postives = 258/261 (98.85%), Query Frame = 1

Query: 216 DKDLNEYRPSSSSTSKHISCSHNLCESGQSCQSPKQSCPYVIDYITENTSSSGLLIQDML 275
           DKDLNEYRPSSSSTSKHISCSHNLC+SGQSCQSPKQSCPYVIDYITENTSSSGLLIQD+L
Sbjct: 10  DKDLNEYRPSSSSTSKHISCSHNLCDSGQSCQSPKQSCPYVIDYITENTSSSGLLIQDVL 69

Query: 276 HLSSGCENSSNCMIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEELVQ 335
           HLSSGCENSSNC IQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEELVQ
Sbjct: 70  HLSSGCENSSNCTIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEELVQ 129

Query: 336 NSFSLCFNEDGSGRIFFGDEGPASQQMTSFVPLDGKYETYIVGVEACCIENSCLKQTSFK 395
           NSFSLCFNEDGSGRIFFGDEGPASQQ TSFVPLDGKYETYIVGVEACCIENSCLKQTSFK
Sbjct: 130 NSFSLCFNEDGSGRIFFGDEGPASQQTTSFVPLDGKYETYIVGVEACCIENSCLKQTSFK 189

Query: 396 ALIDSGTSFTYLPEEAYENIVMEFDKRLNTSSSVSFKGYPWKYCYKISADAMPKVPSVTL 455
           ALIDSGTSFTYLPEEAYENIV+EFDKRLNT+S+VSFKGYPWKYCYKISADAMPKVPSVTL
Sbjct: 190 ALIDSGTSFTYLPEEAYENIVIEFDKRLNTTSAVSFKGYPWKYCYKISADAMPKVPSVTL 249

Query: 456 LFPQNNSFVVHDPVFPIYGDQ 477
           LFP NNSFVVHDPVFPIYGDQ
Sbjct: 250 LFPLNNSFVVHDPVFPIYGDQ 270

BLAST of Lsi01G007370 vs. TrEMBL
Match: M5WHG8_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa003982mg PE=3 SV=1)

HSP 1 Score: 482.3 bits (1240), Expect = 7.3e-133
Identity = 223/307 (72.64%), Postives = 264/307 (85.99%), Query Frame = 1

Query: 170 LHYTWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSLDKDLNEYRPSSSST 229
           LHYTWIDIGTP+VSFLVALD GSDL WVPCDCIQCAPLSASYY +LD+DLNEY PS S+T
Sbjct: 109 LHYTWIDIGTPNVSFLVALDTGSDLFWVPCDCIQCAPLSASYYSTLDRDLNEYSPSGSNT 168

Query: 230 SKHISCSHNLCESGQSCQSPKQSCPYVIDYITENTSSSGLLIQDMLHLSSGCENSSNCMI 289
           SKH+SCSH LCESG +C+SPKQSCPY +DY TENTSSSGLL++D+LH ++G ++  N  I
Sbjct: 169 SKHVSCSHELCESGTNCKSPKQSCPYTVDYYTENTSSSGLLVEDILHFAAGGDDGPNTSI 228

Query: 290 QAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEELVQNSFSLCFNEDGSGR 349
           +APVI+GCGMKQSGGYL G+APDGL GLGLGEISV + LAK  L +NSFS+CF+ED SGR
Sbjct: 229 EAPVIIGCGMKQSGGYLDGIAPDGLLGLGLGEISVPTFLAKAGLTKNSFSMCFDEDDSGR 288

Query: 350 IFFGDEGPASQQMTSFVPLDGKYETYIVGVEACCIENSCLKQTSFKALIDSGTSFTYLPE 409
           +FFGD+GPA+QQ TSF+P +G YETYIVGVEACCIENSCLKQTSFKAL+DSGTSFT+LPE
Sbjct: 289 LFFGDQGPAAQQSTSFLPSNGNYETYIVGVEACCIENSCLKQTSFKALVDSGTSFTFLPE 348

Query: 410 EAYENIVMEFDKRLNTSSSVSFKGYPWKYCYKISADAMPKVPSVTLLFPQNNSFVVHDPV 469
             Y+ I  EFD+++N ++  ++ G PWKYCY  S+  +PKVPSVTL+F  NNSFVVHDPV
Sbjct: 349 ALYDKISEEFDRQVN-ATITNYAGSPWKYCYNTSSQDLPKVPSVTLMFVANNSFVVHDPV 408

Query: 470 FPIYGDQ 477
           FPI G+Q
Sbjct: 409 FPINGNQ 414

BLAST of Lsi01G007370 vs. TrEMBL
Match: D7SW20_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_07s0031g01170 PE=3 SV=1)

HSP 1 Score: 472.6 bits (1215), Expect = 5.8e-130
Identity = 222/310 (71.61%), Postives = 267/310 (86.13%), Query Frame = 1

Query: 167 YYRLHYTWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSLDKDLNEYRPSS 226
           Y  LHYTWIDIGTP++SFLVALDAGSDLLW+PCDCIQCAPLSASYYGSLD+DLN+Y PS 
Sbjct: 96  YGWLHYTWIDIGTPNISFLVALDAGSDLLWIPCDCIQCAPLSASYYGSLDRDLNQYSPSG 155

Query: 227 SSTSKHISCSHNLCESGQSCQSPKQSCPYVIDYITENTSSSGLLIQDMLHLSSGCENSSN 286
           SSTSKH+SCSH LCES  +C SPKQ CPY I+Y +ENTSSSGLLI+D+LHL+SG +++SN
Sbjct: 156 SSTSKHLSCSHQLCESSPNCDSPKQLCPYTINYYSENTSSSGLLIEDILHLTSGIDDASN 215

Query: 287 CMIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEELVQNSFSLCFNEDG 346
             ++APVI+GCGM+Q+GGYL GVAPDGL GLGLGEISV S L+K  LV+NSFSLCFN+D 
Sbjct: 216 SSVRAPVIIGCGMRQTGGYLDGVAPDGLMGLGLGEISVPSFLSKAGLVKNSFSLCFNDDD 275

Query: 347 SGRIFFGDEGPASQQMTSFVPLDGKYETYIVGVEACCIENSCLKQTSFKALIDSGTSFTY 406
           SGRIFFGD+G A+QQ T F+P DGKYETYIVGVEACCI +SC+KQTSF+AL+DSG SFT+
Sbjct: 276 SGRIFFGDQGLATQQTTLFLPSDGKYETYIVGVEACCIGSSCIKQTSFRALVDSGASFTF 335

Query: 407 LPEEAYENIVMEFDKRLNTSSSVSFKGYPWKYCYKISADAMPKVPSVTLLFPQNNSFVVH 466
           LP+E+Y N+V EFDK++N ++  SF+GYPW+YCYK S+  + K PSV L F  NNSFVVH
Sbjct: 336 LPDESYRNVVDEFDKQVN-ATRFSFEGYPWEYCYKSSSKELLKNPSVILKFALNNSFVVH 395

Query: 467 DPVFPIYGDQ 477
           +PVF ++G Q
Sbjct: 396 NPVFVVHGYQ 404

BLAST of Lsi01G007370 vs. TrEMBL
Match: V4SW25_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10025730mg PE=3 SV=1)

HSP 1 Score: 469.9 bits (1208), Expect = 3.7e-129
Identity = 218/305 (71.48%), Postives = 262/305 (85.90%), Query Frame = 1

Query: 170 LHYTWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSLDKDLNEYRPSSSST 229
           LHYTWIDIGTP+VSFLVALDAGSDLLW+PCDC++CAPLSASYY SLD++LNEY PS+SST
Sbjct: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRNLNEYSPSASST 161

Query: 230 SKHISCSHNLCESGQSCQSPKQSCPYVIDYITENTSSSGLLIQDMLHLSSGCENSSNCMI 289
           SKH+SCSH LC+ G SCQ+PKQ CPY IDY TENTSSSGLL++D+LHL SG +N+    +
Sbjct: 162 SKHLSCSHRLCDLGTSCQNPKQPCPYTIDYYTENTSSSGLLVEDILHLISGGDNALKNSV 221

Query: 290 QAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEELVQNSFSLCFNEDGSGR 349
           QA VI+GCGMKQSGGYL GVAPDGL GLGLGEISV S LAK  L++NSFS+CF+ED SGR
Sbjct: 222 QASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDEDDSGR 281

Query: 350 IFFGDEGPASQQMTSFVPLDGKYETYIVGVEACCIENSCLKQTSFKALIDSGTSFTYLPE 409
           IFFGD+GPA+QQ TSF+  +GKY TYI+GVE CCI +SCLKQT FKA++DSG+SFT+LP+
Sbjct: 282 IFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTIFKAIVDSGSSFTFLPK 341

Query: 410 EAYENIVMEFDKRLNTSSSVSFKGYPWKYCYKISADAMPKVPSVTLLFPQNNSFVVHDPV 469
           + YE I  EFD+++N  +  SF+GYPWKYCYK S+  +PK+PSV L+FPQNNSFV ++PV
Sbjct: 342 DIYETIAAEFDRQVN-DTITSFEGYPWKYCYKSSSQRLPKLPSVKLMFPQNNSFVANNPV 401

Query: 470 FPIYG 475
           F IYG
Sbjct: 402 FVIYG 405

BLAST of Lsi01G007370 vs. TrEMBL
Match: W9RJK6_9ROSA (Aspartic proteinase-like protein 1 OS=Morus notabilis GN=L484_020133 PE=3 SV=1)

HSP 1 Score: 466.1 bits (1198), Expect = 5.4e-128
Identity = 216/309 (69.90%), Postives = 262/309 (84.79%), Query Frame = 1

Query: 170 LHYTWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSLDKDLNEYRPSSSST 229
           LHYTWI+IGTP+VSFLVALD GSDLLWVPCDC+QCAPLSASYY SLD+DLNEY PS SS+
Sbjct: 120 LHYTWINIGTPNVSFLVALDVGSDLLWVPCDCVQCAPLSASYYSSLDRDLNEYSPSGSSS 179

Query: 230 SKHISCSHNLCESGQSCQSPKQSCPYVIDYITENTSSSGLLIQDMLHLSSGCENSSNCMI 289
           S H+SCSH LC+ G SC+SPKQ CPYV+ Y TENTS+SGLL++D+LHL++   ++SN  +
Sbjct: 180 SNHLSCSHQLCQLGPSCKSPKQPCPYVVSYYTENTSTSGLLVEDILHLAAWGNDTSNSSV 239

Query: 290 QAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEELVQNSFSLCFNEDGSGR 349
           QAPVI+GCGMKQSGGYL GVAPDGL GLGLGEISV S L+K   V+NSFSLCF+ED SGR
Sbjct: 240 QAPVIIGCGMKQSGGYLDGVAPDGLMGLGLGEISVPSFLSKAGFVRNSFSLCFDEDNSGR 299

Query: 350 IFFGDEGPASQQMTSFVPL--DGKYETYIVGVEACCIENSCLKQTSFKALIDSGTSFTYL 409
           IFFGD+GP +QQ TSF+P   +G Y+TYIVGV+A CI NSC+KQT FKAL+DSGTSFTY+
Sbjct: 300 IFFGDQGPVNQQSTSFLPSTPNGNYDTYIVGVQAFCIGNSCMKQTGFKALVDSGTSFTYV 359

Query: 410 PEEAYENIVMEFDKRLNTSSSVSFKGYPWKYCYKISADAMPKVPSVTLLFPQNNSFVVHD 469
           P+E YE +  EFD+R+N ++  S++GYPWKYCYK S+ A+PK+PSV L+FP NNSFVV  
Sbjct: 360 PQEVYEKVSKEFDRRVN-ATRTSYEGYPWKYCYKTSSQALPKIPSVVLVFPANNSFVVSY 419

Query: 470 PVFPIYGDQ 477
           P+FPI G++
Sbjct: 420 PIFPINGEE 427

BLAST of Lsi01G007370 vs. TAIR10
Match: AT5G10080.1 (AT5G10080.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 391.3 bits (1004), Expect = 8.5e-109
Identity = 189/306 (61.76%), Postives = 236/306 (77.12%), Query Frame = 1

Query: 170 LHYTWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSL-DKDLNEYRPSSSS 229
           LHYTWIDIGTPSVSFLVALD GS+LLW+PC+C+QCAPL+++YY SL  KDLNEY PSSSS
Sbjct: 99  LHYTWIDIGTPSVSFLVALDTGSNLLWIPCNCVQCAPLTSTYYSSLATKDLNEYNPSSSS 158

Query: 230 TSKHISCSHNLCESGQSCQSPKQSCPYVIDYITENTSSSGLLIQDMLHLSSGCEN---SS 289
           TSK   CSH LC+S   C+SPK+ CPY ++Y++ NTSSSGLL++D+LHL+    N   + 
Sbjct: 159 TSKVFLCSHKLCDSASDCESPKEQCPYTVNYLSGNTSSSGLLVEDILHLTYNTNNRLMNG 218

Query: 290 NCMIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEELVQNSFSLCFNED 349
           +  ++A V++GCG KQSG YL GVAPDGL GLG  EISV S L+K  L++NSFSLCF+E+
Sbjct: 219 SSSVKARVVIGCGKKQSGDYLDGVAPDGLMGLGPAEISVPSFLSKAGLMRNSFSLCFDEE 278

Query: 350 GSGRIFFGDEGPASQQMTSFVPLD-GKYETYIVGVEACCIENSCLKQTSFKALIDSGTSF 409
            SGRI+FGD GP+ QQ T F+ LD  KY  YIVGVEACCI NSCLKQTSF   IDSG SF
Sbjct: 279 DSGRIYFGDMGPSIQQSTPFLQLDNNKYSGYIVGVEACCIGNSCLKQTSFTTFIDSGQSF 338

Query: 410 TYLPEEAYENIVMEFDKRLNTSSSVSFKGYPWKYCYKISADAMPKVPSVTLLFPQNNSFV 469
           TYLPEE Y  + +E D+ +N +S  +F+G  W+YCY+ SA+  PKVP++ L F  NN+FV
Sbjct: 339 TYLPEEIYRKVALEIDRHINATSK-NFEGVSWEYCYESSAE--PKVPAIKLKFSHNNTFV 398

Query: 470 VHDPVF 471
           +H P+F
Sbjct: 399 IHKPLF 401

BLAST of Lsi01G007370 vs. TAIR10
Match: AT4G35880.1 (AT4G35880.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 240.4 bits (612), Expect = 2.4e-63
Identity = 125/304 (41.12%), Postives = 186/304 (61.18%), Query Frame = 1

Query: 170 LHYTWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSLDKDLNEYRPSSSST 229
           LHYT + +GTP + F+VALD GSDL WVPCDC +CAP   + Y S + +L+ Y P  S+T
Sbjct: 106 LHYTTVKLGTPGMRFMVALDTGSDLFWVPCDCGKCAPTEGATYAS-EFELSIYNPKVSTT 165

Query: 230 SKHISCSHNLCESGQSCQSPKQSCPYVIDYITENTSSSGLLIQDMLHLSSGCENSSNCMI 289
           +K ++C+++LC     C     +CPY++ Y++  TS+SG+L++D++HL++  +N     +
Sbjct: 166 NKKVTCNNSLCAQRNQCLGTFSTCPYMVSYVSAQTSTSGILMEDVMHLTTEDKNPER--V 225

Query: 290 QAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEELVQNSFSLCFNEDGSGR 349
           +A V  GCG  QSG +L   AP+GLFGLG+ +ISV S LA+E LV +SFS+CF  DG GR
Sbjct: 226 EAYVTFGCGQVQSGSFLDIAAPNGLFGLGMEKISVPSVLAREGLVADSFSMCFGHDGVGR 285

Query: 350 IFFGDEGPASQQMTSFVPLDGKYETYIVGVEACCIENSCLKQTSFKALIDSGTSFTYLPE 409
           I FGD+G + Q+ T F  L+  +  Y + V    +  + L    F AL D+GTSFTYL +
Sbjct: 286 ISFGDKGSSDQEETPF-NLNPSHPNYNITVTRVRV-GTTLIDDEFTALFDTGTSFTYLVD 345

Query: 410 EAYENIVMEFDKRLNTSSSVSFKGYPWKYCYKISADAMPK-VPSVTLLFPQNNSFVVHDP 469
             Y  +   F  +            P++YCY +S DA    +PS++L    N+ F ++DP
Sbjct: 346 PMYTTVSESFHSQAQDKRHSPDSRIPFEYCYDMSNDANASLIPSLSLTMKGNSHFTINDP 404

Query: 470 VFPI 473
           +  I
Sbjct: 406 IIVI 404

BLAST of Lsi01G007370 vs. TAIR10
Match: AT2G17760.1 (AT2G17760.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 229.2 bits (583), Expect = 5.6e-60
Identity = 134/309 (43.37%), Postives = 181/309 (58.58%), Query Frame = 1

Query: 170 LHYTWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSLDKDLNEYRPSSSST 229
           LHY  + +GTPS  F+VALD GSDL W+PCDC  C     +  GS   DLN Y P++SST
Sbjct: 103 LHYANVTVGTPSDWFMVALDTGSDLFWLPCDCTNCVRELKAPGGS-SLDLNIYSPNASST 162

Query: 230 SKHISCSHNLCESGQSCQSPKQSCPYVIDYITENTSSSGLLIQDMLHLSSGCENSSNCMI 289
           S  + C+  LC  G  C SP+  CPY I Y++  TSS+G+L++D+LHL S  ++S    I
Sbjct: 163 STKVPCNSTLCTRGDRCASPESDCPYQIRYLSNGTSSTGVLVEDVLHLVSNDKSSK--AI 222

Query: 290 QAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEELVQNSFSLCFNEDGSGR 349
            A V  GCG  Q+G +  G AP+GLFGLGL +ISV S LAKE +  NSFS+CF  DG+GR
Sbjct: 223 PARVTFGCGQVQTGVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAANSFSMCFGNDGAGR 282

Query: 350 IFFGDEGPASQQMTSFVPLDGKYETYIVGVEACCIENSCLKQTSFKALIDSGTSFTYLPE 409
           I FGD+G   Q+ T  + +   + TY + V    +  +      F A+ DSGTSFTYL +
Sbjct: 283 ISFGDKGSVDQRETP-LNIRQPHPTYNITVTKISVGGN-TGDLEFDAVFDSGTSFTYLTD 342

Query: 410 EAYENIVMEF-----DKRLNTSSSVSFKGYPWKYCYKISADAMP-KVPSVTLLFPQNNSF 469
            AY  I   F     DKR  T+ S      P++YCY +S +    + P+V L     +S+
Sbjct: 343 AAYTLISESFNSLALDKRYQTTDS----ELPFEYCYALSPNKDSFQYPAVNLTMKGGSSY 402

Query: 470 VVHDPVFPI 473
            V+ P+  I
Sbjct: 403 PVYHPLVVI 402

BLAST of Lsi01G007370 vs. TAIR10
Match: AT3G51330.1 (AT3G51330.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 194.9 bits (494), Expect = 1.2e-49
Identity = 116/310 (37.42%), Postives = 166/310 (53.55%), Query Frame = 1

Query: 170 LHYTWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYG-SLDKDLNEYRPSSSS 229
           LHY  + +GTP+  FLVALD GSDL W+PC+C           G S  + LN Y P++SS
Sbjct: 101 LHYANVSVGTPATWFLVALDTGSDLFWLPCNCGSTCIRDLKEVGLSQSRPLNLYSPNTSS 160

Query: 230 TSKHISCSHNLCESGQSCQSPKQSCPYVIDYITENTSSSGLLIQDMLHLSSGCENSSNCM 289
           TS  I CS + C     C SP  SCPY I Y++++T ++G L +D+LHL +  E+     
Sbjct: 161 TSSSIRCSDDRCFGSSRCSSPASSCPYQIQYLSKDTFTTGTLFEDVLHLVT--EDEGLEP 220

Query: 290 IQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEELVQNSFSLCFNE--DG 349
           ++A + LGCG  Q+G   S  A +GL GLGL + SV S LAK ++  NSFS+CF    D 
Sbjct: 221 VKANITLGCGKNQTGFLQSSAAVNGLLGLGLKDYSVPSILAKAKITANSFSMCFGNIIDV 280

Query: 350 SGRIFFGDEGPASQQMTSFVPLDGKYETYIVGVEACCIENSCLKQTSFKALIDSGTSFTY 409
            GRI FGD+G   Q  T  +P +    TY V V    +    +      AL D+GTSFT+
Sbjct: 281 VGRISFGDKGYTDQMETPLLPTEPS-PTYAVSVTEVSVGGDAV-GVQLLALFDTGTSFTH 340

Query: 410 LPEEAYENIVMEFDKRLNTSSSVSFKGYPWKYCYKISADAMPKV-PSVTLLFPQNNSFVV 469
           L E  Y  I   FD  +           P+++CY +S +    + P V + F   +   +
Sbjct: 341 LLEPEYGLITKAFDDHVTDKRRPIDPELPFEFCYDLSPNKTTILFPRVAMTFEGGSQMFL 400

Query: 470 HDPVFPIYGD 476
            +P+F ++ +
Sbjct: 401 RNPLFIVWNE 406

BLAST of Lsi01G007370 vs. TAIR10
Match: AT3G51350.1 (AT3G51350.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 176.8 bits (447), Expect = 3.3e-44
Identity = 110/305 (36.07%), Postives = 161/305 (52.79%), Query Frame = 1

Query: 170 LHYTWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSLDK-DLNEYRPSSSS 229
           L+Y  + +GTP  SFLVALD GSDL W+PC+C           G      LN Y P++S+
Sbjct: 101 LYYANVSVGTPPSSFLVALDTGSDLFWLPCNCGTTCIRDLEDIGVPQSVPLNLYTPNAST 160

Query: 230 TSKHISCSHNLCESGQSCQSPKQSCPYVIDYITENTSSSGLLIQDMLHLSSGCENSSNCM 289
           TS  I CS   C   + C SP   CPY I Y + +T + G L+QD+LHL++  EN +   
Sbjct: 161 TSSSIRCSDKRCFGSKKCSSPSSICPYQISY-SNSTGTKGTLLQDVLHLATEDENLTP-- 220

Query: 290 IQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEELVQNSFSLCFNE--DG 349
           ++A V LGCG KQ+G +    + +G+ GLG+   SV S LAK  +  NSFS+CF      
Sbjct: 221 VKANVTLGCGQKQTGLFQRNNSVNGVLGLGIKGYSVPSLLAKANITANSFSMCFGRVIGN 280

Query: 350 SGRIFFGDEGPASQQMTSFVPLDGKYETYIVGVEACCIENSCLKQTSFKALIDSGTSFTY 409
            GRI FGD G   Q+ T F+ +      Y V +    +    +    F A  D+G+SFT+
Sbjct: 281 VGRISFGDRGYTDQEETPFISV-APSTAYGVNISGVSVAGDPVDIRLF-AKFDTGSSFTH 340

Query: 410 LPEEAYENIVMEFDKRLNTSSSVSFKGYPWKYCYKISADAMP-KVPSVTLLFPQNNSFVV 469
           L E AY  +   FD+ +           P+++CY +S +A   + P V + F   +  ++
Sbjct: 341 LREPAYGVLTKSFDELVEDRRRPVDPELPFEFCYDLSPNATTIQFPLVEMTFIGGSKIIL 400

Query: 470 HDPVF 471
           ++P F
Sbjct: 401 NNPFF 400

BLAST of Lsi01G007370 vs. NCBI nr
Match: gi|659080156|ref|XP_008440641.1| (PREDICTED: aspartic proteinase-like protein 1 isoform X1 [Cucumis melo])

HSP 1 Score: 620.2 bits (1598), Expect = 3.2e-174
Identity = 299/307 (97.39%), Postives = 305/307 (99.35%), Query Frame = 1

Query: 170 LHYTWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSLDKDLNEYRPSSSST 229
           LHYTWIDIGTPSVSFLVALDAGSDLLW+PC+CIQCAPLSASYYGSLDKDLNEYRPSSSST
Sbjct: 102 LHYTWIDIGTPSVSFLVALDAGSDLLWIPCNCIQCAPLSASYYGSLDKDLNEYRPSSSST 161

Query: 230 SKHISCSHNLCESGQSCQSPKQSCPYVIDYITENTSSSGLLIQDMLHLSSGCENSSNCMI 289
           SKHISCSHNLC+SGQSCQSPKQSCPYVIDYITENTSSSGLLIQD+LHLSSGCENSSNC I
Sbjct: 162 SKHISCSHNLCDSGQSCQSPKQSCPYVIDYITENTSSSGLLIQDVLHLSSGCENSSNCTI 221

Query: 290 QAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEELVQNSFSLCFNEDGSGR 349
           QAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEELVQNSFSLCFNEDGSGR
Sbjct: 222 QAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEELVQNSFSLCFNEDGSGR 281

Query: 350 IFFGDEGPASQQMTSFVPLDGKYETYIVGVEACCIENSCLKQTSFKALIDSGTSFTYLPE 409
           IFFGDEGPASQQMTSFVPLDGKYETYIVGVEACCIENSCLKQTSFKALIDSGTSFTYLPE
Sbjct: 282 IFFGDEGPASQQMTSFVPLDGKYETYIVGVEACCIENSCLKQTSFKALIDSGTSFTYLPE 341

Query: 410 EAYENIVMEFDKRLNTSSSVSFKGYPWKYCYKISADAMPKVPSVTLLFPQNNSFVVHDPV 469
           EAYENIVMEFDKRLNT+S+VSFKGYPWKYCYKISADAMPKVPSVTLLFP NNSFVVHDPV
Sbjct: 342 EAYENIVMEFDKRLNTTSAVSFKGYPWKYCYKISADAMPKVPSVTLLFPLNNSFVVHDPV 401

Query: 470 FPIYGDQ 477
           FPIYGDQ
Sbjct: 402 FPIYGDQ 408

BLAST of Lsi01G007370 vs. NCBI nr
Match: gi|778719260|ref|XP_004143563.2| (PREDICTED: aspartic proteinase-like protein 1 isoform X1 [Cucumis sativus])

HSP 1 Score: 616.7 bits (1589), Expect = 3.5e-173
Identity = 298/307 (97.07%), Postives = 304/307 (99.02%), Query Frame = 1

Query: 170 LHYTWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSLDKDLNEYRPSSSST 229
           LHYTWIDIGTPSVSFLVALDAGSDLLWVPC+CIQCAPLSASYYGSLDKDLNEYRPSSSST
Sbjct: 102 LHYTWIDIGTPSVSFLVALDAGSDLLWVPCNCIQCAPLSASYYGSLDKDLNEYRPSSSST 161

Query: 230 SKHISCSHNLCESGQSCQSPKQSCPYVIDYITENTSSSGLLIQDMLHLSSGCENSSNCMI 289
           SKHISCSHNLC+SGQSCQSPKQSCPYVIDYITENTSSSGLLIQD+LHLSSGCENSSNC I
Sbjct: 162 SKHISCSHNLCDSGQSCQSPKQSCPYVIDYITENTSSSGLLIQDVLHLSSGCENSSNCTI 221

Query: 290 QAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEELVQNSFSLCFNEDGSGR 349
           QAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEELVQNSFSLCFNEDGSGR
Sbjct: 222 QAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEELVQNSFSLCFNEDGSGR 281

Query: 350 IFFGDEGPASQQMTSFVPLDGKYETYIVGVEACCIENSCLKQTSFKALIDSGTSFTYLPE 409
           IFFGDEGPASQQ TSFVPLDGKYETYIVGVEACCIENSCLKQTSFKALIDSGTSFTYLPE
Sbjct: 282 IFFGDEGPASQQTTSFVPLDGKYETYIVGVEACCIENSCLKQTSFKALIDSGTSFTYLPE 341

Query: 410 EAYENIVMEFDKRLNTSSSVSFKGYPWKYCYKISADAMPKVPSVTLLFPQNNSFVVHDPV 469
           EAYENIV+EFDKRLNT+S+VSFKGYPWKYCYKISADAMPKVPSVTLLFP NNSFVVHDPV
Sbjct: 342 EAYENIVIEFDKRLNTTSAVSFKGYPWKYCYKISADAMPKVPSVTLLFPLNNSFVVHDPV 401

Query: 470 FPIYGDQ 477
           FPIYGDQ
Sbjct: 402 FPIYGDQ 408

BLAST of Lsi01G007370 vs. NCBI nr
Match: gi|659080158|ref|XP_008440642.1| (PREDICTED: aspartic proteinase-like protein 1 isoform X2 [Cucumis melo])

HSP 1 Score: 525.8 bits (1353), Expect = 8.2e-146
Identity = 256/273 (93.77%), Postives = 264/273 (96.70%), Query Frame = 1

Query: 204 CAPLSASYYGSLDKDLNEYRPSSSSTSKHISCSHNLCESGQSCQSPKQSCPYVIDYITEN 263
           C+ +  + +   DKDLNEYRPSSSSTSKHISCSHNLC+SGQSCQSPKQSCPYVIDYITEN
Sbjct: 54  CSLMFGALFIMQDKDLNEYRPSSSSTSKHISCSHNLCDSGQSCQSPKQSCPYVIDYITEN 113

Query: 264 TSSSGLLIQDMLHLSSGCENSSNCMIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEIS 323
           TSSSGLLIQD+LHLSSGCENSSNC IQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEIS
Sbjct: 114 TSSSGLLIQDVLHLSSGCENSSNCTIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEIS 173

Query: 324 VLSSLAKEELVQNSFSLCFNEDGSGRIFFGDEGPASQQMTSFVPLDGKYETYIVGVEACC 383
           VLSSLAKEELVQNSFSLCFNEDGSGRIFFGDEGPASQQMTSFVPLDGKYETYIVGVEACC
Sbjct: 174 VLSSLAKEELVQNSFSLCFNEDGSGRIFFGDEGPASQQMTSFVPLDGKYETYIVGVEACC 233

Query: 384 IENSCLKQTSFKALIDSGTSFTYLPEEAYENIVMEFDKRLNTSSSVSFKGYPWKYCYKIS 443
           IENSCLKQTSFKALIDSGTSFTYLPEEAYENIVMEFDKRLNT+S+VSFKGYPWKYCYKIS
Sbjct: 234 IENSCLKQTSFKALIDSGTSFTYLPEEAYENIVMEFDKRLNTTSAVSFKGYPWKYCYKIS 293

Query: 444 ADAMPKVPSVTLLFPQNNSFVVHDPVFPIYGDQ 477
           ADAMPKVPSVTLLFP NNSFVVHDPVFPIYGDQ
Sbjct: 294 ADAMPKVPSVTLLFPLNNSFVVHDPVFPIYGDQ 326

BLAST of Lsi01G007370 vs. NCBI nr
Match: gi|778719263|ref|XP_011657988.1| (PREDICTED: aspartic proteinase-like protein 1 isoform X2 [Cucumis sativus])

HSP 1 Score: 521.9 bits (1343), Expect = 1.2e-144
Identity = 254/273 (93.04%), Postives = 263/273 (96.34%), Query Frame = 1

Query: 204 CAPLSASYYGSLDKDLNEYRPSSSSTSKHISCSHNLCESGQSCQSPKQSCPYVIDYITEN 263
           C+ +  + +   DKDLNEYRPSSSSTSKHISCSHNLC+SGQSCQSPKQSCPYVIDYITEN
Sbjct: 54  CSLMFGALFIMQDKDLNEYRPSSSSTSKHISCSHNLCDSGQSCQSPKQSCPYVIDYITEN 113

Query: 264 TSSSGLLIQDMLHLSSGCENSSNCMIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEIS 323
           TSSSGLLIQD+LHLSSGCENSSNC IQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEIS
Sbjct: 114 TSSSGLLIQDVLHLSSGCENSSNCTIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEIS 173

Query: 324 VLSSLAKEELVQNSFSLCFNEDGSGRIFFGDEGPASQQMTSFVPLDGKYETYIVGVEACC 383
           VLSSLAKEELVQNSFSLCFNEDGSGRIFFGDEGPASQQ TSFVPLDGKYETYIVGVEACC
Sbjct: 174 VLSSLAKEELVQNSFSLCFNEDGSGRIFFGDEGPASQQTTSFVPLDGKYETYIVGVEACC 233

Query: 384 IENSCLKQTSFKALIDSGTSFTYLPEEAYENIVMEFDKRLNTSSSVSFKGYPWKYCYKIS 443
           IENSCLKQTSFKALIDSGTSFTYLPEEAYENIV+EFDKRLNT+S+VSFKGYPWKYCYKIS
Sbjct: 234 IENSCLKQTSFKALIDSGTSFTYLPEEAYENIVIEFDKRLNTTSAVSFKGYPWKYCYKIS 293

Query: 444 ADAMPKVPSVTLLFPQNNSFVVHDPVFPIYGDQ 477
           ADAMPKVPSVTLLFP NNSFVVHDPVFPIYGDQ
Sbjct: 294 ADAMPKVPSVTLLFPLNNSFVVHDPVFPIYGDQ 326

BLAST of Lsi01G007370 vs. NCBI nr
Match: gi|700193571|gb|KGN48775.1| (hypothetical protein Csa_6G500670 [Cucumis sativus])

HSP 1 Score: 520.8 bits (1340), Expect = 2.6e-144
Identity = 253/261 (96.93%), Postives = 258/261 (98.85%), Query Frame = 1

Query: 216 DKDLNEYRPSSSSTSKHISCSHNLCESGQSCQSPKQSCPYVIDYITENTSSSGLLIQDML 275
           DKDLNEYRPSSSSTSKHISCSHNLC+SGQSCQSPKQSCPYVIDYITENTSSSGLLIQD+L
Sbjct: 10  DKDLNEYRPSSSSTSKHISCSHNLCDSGQSCQSPKQSCPYVIDYITENTSSSGLLIQDVL 69

Query: 276 HLSSGCENSSNCMIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEELVQ 335
           HLSSGCENSSNC IQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEELVQ
Sbjct: 70  HLSSGCENSSNCTIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEELVQ 129

Query: 336 NSFSLCFNEDGSGRIFFGDEGPASQQMTSFVPLDGKYETYIVGVEACCIENSCLKQTSFK 395
           NSFSLCFNEDGSGRIFFGDEGPASQQ TSFVPLDGKYETYIVGVEACCIENSCLKQTSFK
Sbjct: 130 NSFSLCFNEDGSGRIFFGDEGPASQQTTSFVPLDGKYETYIVGVEACCIENSCLKQTSFK 189

Query: 396 ALIDSGTSFTYLPEEAYENIVMEFDKRLNTSSSVSFKGYPWKYCYKISADAMPKVPSVTL 455
           ALIDSGTSFTYLPEEAYENIV+EFDKRLNT+S+VSFKGYPWKYCYKISADAMPKVPSVTL
Sbjct: 190 ALIDSGTSFTYLPEEAYENIVIEFDKRLNTTSAVSFKGYPWKYCYKISADAMPKVPSVTL 249

Query: 456 LFPQNNSFVVHDPVFPIYGDQ 477
           LFP NNSFVVHDPVFPIYGDQ
Sbjct: 250 LFPLNNSFVVHDPVFPIYGDQ 270

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
ASPL1_ARATH1.5e-10761.76Aspartic proteinase-like protein 1 OS=Arabidopsis thaliana GN=At5g10080 PE=1 SV=... [more]
APF1_ARATH1.0e-5843.37Aspartyl protease family protein 1 OS=Arabidopsis thaliana GN=APF1 PE=1 SV=1[more]
ASPL2_ARATH3.3e-2228.57Aspartic proteinase-like protein 2 OS=Arabidopsis thaliana GN=At1g65240 PE=1 SV=... [more]
AED1_ARATH1.8e-2029.35Aspartyl protease AED1 OS=Arabidopsis thaliana GN=AED1 PE=2 SV=1[more]
APCB1_ARATH1.6e-1630.15Aspartyl protease APCB1 OS=Arabidopsis thaliana GN=APCB1 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KGB2_CUCSA1.8e-14496.93Uncharacterized protein OS=Cucumis sativus GN=Csa_6G500670 PE=3 SV=1[more]
M5WHG8_PRUPE7.3e-13372.64Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa003982mg PE=3 SV=1[more]
D7SW20_VITVI5.8e-13071.61Putative uncharacterized protein OS=Vitis vinifera GN=VIT_07s0031g01170 PE=3 SV=... [more]
V4SW25_9ROSI3.7e-12971.48Uncharacterized protein OS=Citrus clementina GN=CICLE_v10025730mg PE=3 SV=1[more]
W9RJK6_9ROSA5.4e-12869.90Aspartic proteinase-like protein 1 OS=Morus notabilis GN=L484_020133 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT5G10080.18.5e-10961.76 Eukaryotic aspartyl protease family protein[more]
AT4G35880.12.4e-6341.12 Eukaryotic aspartyl protease family protein[more]
AT2G17760.15.6e-6043.37 Eukaryotic aspartyl protease family protein[more]
AT3G51330.11.2e-4937.42 Eukaryotic aspartyl protease family protein[more]
AT3G51350.13.3e-4436.07 Eukaryotic aspartyl protease family protein[more]
Match NameE-valueIdentityDescription
gi|659080156|ref|XP_008440641.1|3.2e-17497.39PREDICTED: aspartic proteinase-like protein 1 isoform X1 [Cucumis melo][more]
gi|778719260|ref|XP_004143563.2|3.5e-17397.07PREDICTED: aspartic proteinase-like protein 1 isoform X1 [Cucumis sativus][more]
gi|659080158|ref|XP_008440642.1|8.2e-14693.77PREDICTED: aspartic proteinase-like protein 1 isoform X2 [Cucumis melo][more]
gi|778719263|ref|XP_011657988.1|1.2e-14493.04PREDICTED: aspartic proteinase-like protein 1 isoform X2 [Cucumis sativus][more]
gi|700193571|gb|KGN48775.1|2.6e-14496.93hypothetical protein Csa_6G500670 [Cucumis sativus][more]
The following terms have been associated with this gene:
Vocabulary: Biological Process
TermDefinition
GO:0006508proteolysis
Vocabulary: Molecular Function
TermDefinition
GO:0004190aspartic-type endopeptidase activity
Vocabulary: INTERPRO
TermDefinition
IPR021109Peptidase_aspartic_dom_sf
IPR001969Aspartic_peptidase_AS
IPR001461Aspartic_peptidase_A1
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0030163 protein catabolic process
biological_process GO:0006508 proteolysis
cellular_component GO:0046658 anchored component of plasma membrane
cellular_component GO:0005575 cellular_component
molecular_function GO:0004190 aspartic-type endopeptidase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lsi01G007370.1Lsi01G007370.1mRNA


Analysis Name: InterPro Annotations of Lagenaria siceraria
Date Performed: 2017-09-18
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPRINTSPR00792PEPSINcoord: 396..407
score: 1.7E-5coord: 177..197
score: 1.
IPR001461Aspartic peptidase A1 familyPANTHERPTHR13683ASPARTYL PROTEASEScoord: 154..474
score: 1.4E-189coord: 1..89
score: 1.4E
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 396..407
scor
IPR021109Aspartic peptidase domainGENE3DG3DSA:2.40.70.10coord: 167..354
score: 2.8E-36coord: 364..465
score: 3.7
IPR021109Aspartic peptidase domainunknownSSF50630Acid proteasescoord: 165..467
score: 1.95
NoneNo IPR availablePANTHERPTHR13683:SF303ASPARTIC PROTEINASE-LIKE PROTEIN 1coord: 154..474
score: 1.4E-189coord: 1..89
score: 1.4E