Lsi10G007850 (gene) Bottle gourd (USVL1VR-Ls)

NameLsi10G007850
Typegene
OrganismLagenaria siceraria (Bottle gourd (USVL1VR-Ls))
DescriptionEukaryotic aspartyl protease family protein
Locationchr10 : 10689408 .. 10692479 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGGAAAGGGGTATTGATGATATTGGTTCTGATGGTGGCCTCAATGAGCTGTTTGGCTCTGTGTTCAGCTTCTTCGTTCTTTAAGGATAAGCCATGGGAGAGAAGAAGGCCAATTCTGTCGGTTCCGACTGCATCTTCTTCGTTTGCTTCATCCTCCATTGTGTTGCCTCTTCAAGGGAACGTCTTTCCAAATGGGTAAGCCATAACTCTGCTCAAAGATAACGACGAAGGTTCAGCTTTTGTTTTTTCCCATTTCGATTTTACTCCACTGTTACAGAGTACGAAGTGGCTGAAGAGGAAGTTCAATGTGAATGTGTTGACTCCAAATATACTTTTCACATTGATAGTATTGTAATAATATCAGTAGGCCTTTCCCATTCTTAATCTTATTGCATCTTCTCTCACGTGTTAAATTGGAGAAAAAATGATTACGATGATTACGAGACTCTGTACTCTTATGTTTGCTTCTCTCGCGTATTATGCTATTCCTGTGCAATAGCTTTGGTGGAGTGTATTTTTGAGTGCTTTTCAGAATAGGGTTTGTAACTTTGTAGCAAAAGAACACATCCCCAAATTAAGTACTTCTAAAATCTTCAAACCCATTTTCCTGAATATTTTTAATTAGTTACAAACAGTTTGAAAATAAAATATATATTTTTTTGTTCTTGTTTGAGGCTAACTTCTTTACTATCAGATCCTAAATAATGATTTTCATCAATATGTTTTGAGTCATTTAATCCGTGTCTTTTTTCAGGTTCTATAACGTTACTCTTTATGTAGGGCAGCCTCCAAAGCCTTATTTTCTAGATCCAGACACCGGTAGTGATCTCACTTGGCTTCAATGTGACGCTCCATGTCAGCAGTGCACCGAGGTAAATTTTTATTTGATGGTAATTTGGAGATTCTTGTAAATTTGATGACCCAAATGATTGAAGATAGTTTACCAAAGTACAGCTTTGGCTTCATAAGTTAATTTTGATCATAAGTAATTCAAATTTTCATTTTTGAATATTCTATGTTGTTCTATGTTCCCTTATATTTCTTCTTGATGTGCTTAGTGCTTCTTCTTTCGCTCGGTCTCTCACTACCTTAAAAAATTATTGCATTTCTTTTGGAGTTGAGAGTTAAGGTTGATTGCTTCAAATTGATGGCCTTGTTTCTTTCCAGACACTTCATCCGCTCTATCAACCAAGCAACGATCTTGTGCCATGTAAGGACCCTTTGTGTATGTCCTTGCACTCATCTATGGACCACAAATGTGAGAACCCAGGTCAATGTGACTACGAGGTTGAGTATGCAGATGGCGGTTCTTCTCTTGGAGTCCTTGTCAGGGATGTATTTCCTCTCAACTTAACCAATGGAGATCCAATTAGACCGCGTTTGGCCCTCGGGTAAACTGCTCAACTGCCTTCAGTAATTCATCTTAATTCAGTGTACTGCTGAGTTCCTAGCTCTCTAAATCAACCTTGCTACTTCCAATTACAACCGCACATGAAGGTTTTGAGAAAGAAACATCATATCATGCTGTCGTATCTTATGTTTTGCACTTTCCTGACATTGCAAGATAATGAATAGCATGAATGTAAAGGTAGAGTTTATGTTGAATAACAAACGTGTGCATTCATGCTGATGGGTTACTTCAAGAAGACAATTAAGATAGCACCAGGCACTAGATTTTGTGACGGACATAATGATTTGGGTTTGTATTTTTCATAGTTATCAATAAATATTTGTACAAGAATTTCATGCGTATCTAGTCCTTACGAACCTTAGTTGTCAGCTCATGTGGTTGAATAGCATGCATGTAAAGTCGGTATGTAAGTTAGCTATTTGAATCTATAGTACTGAGTTTGGTTGGTTCCGTTTGATTACTCAGATGTGGTTATGATCAAGATCCTGGATCATCATCTTATCACCCCATGGATGGAATACTTGGCCTTGGACGGGGAGCAGTAAGCATGGTCTCACAACTGCATAATCAAGGCATTGTCCGTAATGTCGTTGGTCACTGTTTCAGCAGCAAAGGAGGAGGATATCTTTTCTTTGGGGATGGCATTTATGATCCTTATCGCTTAGTTTGGACGCCCATGTCACGGGACTACCCGTAAACGACCTGCCTTGATCATATATATATATATATATACCATGTTATAATTCATGTTAGCTTCTGTTTTTATAAAACCTTCCTCATGTGACTGTATCAGGAAGCACTACTCCCCTGGGTTTGGAGAACTAATCTTCAATGGAAGAAGTACTGGACTCAGAAACCTGTTTGTAGTTTTTGACAGTGGGAGCTCTTACACATACTTCAATGCTCAGGCTTATCAAGTTTTAACATCTTTGGTAAATCATCTTTAAAAATAACTATTGACTGTGTCTGGGAAAAATAGATCTTTCTGGCTCTTGAAAACTGATGGTTAATACACAATGATTCTTCAATTCATGTTGGCACTGTTCTTGTAGTTAAATAGAGAACTAACTGGAAAACCGCTAAGAGAAGCCATGGACGACGATACGCTTCCGCTCTGTTGGAGAGGGCGGAAGCCATTCAAAAGCTTACGTGATGTGAGAAAATATTTCAAGCCATTGGCCTTGAGCTTTTCCAGTGGTGGAAGAACCAAAGCAGTGTTTGAAATACCAATGGAAGGTTATCTGATAATATCGGTAAAAGCTCCAACTGCTATTCACTACACATTGTCTCTTTGGATTTTCAAGTTGAATAATTACACTTATAAGTTAATTTTGGAATGGCAGTCCATGGGAAATGTTTGCTTAGGAATTCTGAACGGCACCGACGTTGGGCTTGAAAACTCGAATATCATTGGTGGTACGTTATTGCATGCATTGCATGTTATTATTTTCTCGGAATTGCAAATACAAATGCCTTCGAAACCATGCAAAATTATTCTTTTATTAGAGAGAAAAAAAGAGCCCCATTTTCTGTATGCTTTTGTTGACAGATATATCAATGCAAGATAAAATGGTAGTATACAACAACGAGAAGCAAGCAATTGGATGGGCTACTGCTAACTGTGATCGGGTTCCCAAGTCTCGAGTTGGTAGCATGTAA

mRNA sequence

ATGGGGAAAGGGGTATTGATGATATTGGTTCTGATGGTGGCCTCAATGAGCTGTTTGGCTCTGTGTTCAGCTTCTTCGTTCTTTAAGGATAAGCCATGGGAGAGAAGAAGGCCAATTCTGTCGGTTCCGACTGCATCTTCTTCGTTTGCTTCATCCTCCATTGTGTTGCCTCTTCAAGGGAACGTCTTTCCAAATGGGTTCTATAACGTTACTCTTTATGTAGGGCAGCCTCCAAAGCCTTATTTTCTAGATCCAGACACCGGTAGTGATCTCACTTGGCTTCAATGTGACGCTCCATGTCAGCAGTGCACCGAGACACTTCATCCGCTCTATCAACCAAGCAACGATCTTGTGCCATGTAAGGACCCTTTGTGTATGTCCTTGCACTCATCTATGGACCACAAATGTGAGAACCCAGGTCAATGTGACTACGAGGTTGAGTATGCAGATGGCGGTTCTTCTCTTGGAGTCCTTGTCAGGGATGTATTTCCTCTCAACTTAACCAATGGAGATCCAATTAGACCGCGTTTGGCCCTCGGATGTGGTTATGATCAAGATCCTGGATCATCATCTTATCACCCCATGGATGGAATACTTGGCCTTGGACGGGGAGCAGTAAGCATGGTCTCACAACTGCATAATCAAGGCATTGTCCGTAATGTCGTTGGTCACTGTTTCAGCAGCAAAGGAGGAGGATATCTTTTCTTTGGGGATGGCATTTATGATCCTTATCGCTTAGTTTGGACGCCCATGTCACGGGACTACCCGAAGCACTACTCCCCTGGGTTTGGAGAACTAATCTTCAATGGAAGAAGTACTGGACTCAGAAACCTGTTTGTAGTTTTTGACAGTGGGAGCTCTTACACATACTTCAATGCTCAGGCTTATCAAGTTTTAACATCTTTGTTAAATAGAGAACTAACTGGAAAACCGCTAAGAGAAGCCATGGACGACGATACGCTTCCGCTCTGTTGGAGAGGGCGGAAGCCATTCAAAAGCTTACGTGATGTGAGAAAATATTTCAAGCCATTGGCCTTGAGCTTTTCCAGTGGTGGAAGAACCAAAGCAGTGTTTGAAATACCAATGGAAGGTTATCTGATAATATCGTCCATGGGAAATGTTTGCTTAGGAATTCTGAACGGCACCGACGTTGGGCTTGAAAACTCGAATATCATTGGTGATATATCAATGCAAGATAAAATGGTAGTATACAACAACGAGAAGCAAGCAATTGGATGGGCTACTGCTAACTGTGATCGGGTTCCCAAGTCTCGAGTTGGTAGCATGTAA

Coding sequence (CDS)

ATGGGGAAAGGGGTATTGATGATATTGGTTCTGATGGTGGCCTCAATGAGCTGTTTGGCTCTGTGTTCAGCTTCTTCGTTCTTTAAGGATAAGCCATGGGAGAGAAGAAGGCCAATTCTGTCGGTTCCGACTGCATCTTCTTCGTTTGCTTCATCCTCCATTGTGTTGCCTCTTCAAGGGAACGTCTTTCCAAATGGGTTCTATAACGTTACTCTTTATGTAGGGCAGCCTCCAAAGCCTTATTTTCTAGATCCAGACACCGGTAGTGATCTCACTTGGCTTCAATGTGACGCTCCATGTCAGCAGTGCACCGAGACACTTCATCCGCTCTATCAACCAAGCAACGATCTTGTGCCATGTAAGGACCCTTTGTGTATGTCCTTGCACTCATCTATGGACCACAAATGTGAGAACCCAGGTCAATGTGACTACGAGGTTGAGTATGCAGATGGCGGTTCTTCTCTTGGAGTCCTTGTCAGGGATGTATTTCCTCTCAACTTAACCAATGGAGATCCAATTAGACCGCGTTTGGCCCTCGGATGTGGTTATGATCAAGATCCTGGATCATCATCTTATCACCCCATGGATGGAATACTTGGCCTTGGACGGGGAGCAGTAAGCATGGTCTCACAACTGCATAATCAAGGCATTGTCCGTAATGTCGTTGGTCACTGTTTCAGCAGCAAAGGAGGAGGATATCTTTTCTTTGGGGATGGCATTTATGATCCTTATCGCTTAGTTTGGACGCCCATGTCACGGGACTACCCGAAGCACTACTCCCCTGGGTTTGGAGAACTAATCTTCAATGGAAGAAGTACTGGACTCAGAAACCTGTTTGTAGTTTTTGACAGTGGGAGCTCTTACACATACTTCAATGCTCAGGCTTATCAAGTTTTAACATCTTTGTTAAATAGAGAACTAACTGGAAAACCGCTAAGAGAAGCCATGGACGACGATACGCTTCCGCTCTGTTGGAGAGGGCGGAAGCCATTCAAAAGCTTACGTGATGTGAGAAAATATTTCAAGCCATTGGCCTTGAGCTTTTCCAGTGGTGGAAGAACCAAAGCAGTGTTTGAAATACCAATGGAAGGTTATCTGATAATATCGTCCATGGGAAATGTTTGCTTAGGAATTCTGAACGGCACCGACGTTGGGCTTGAAAACTCGAATATCATTGGTGATATATCAATGCAAGATAAAATGGTAGTATACAACAACGAGAAGCAAGCAATTGGATGGGCTACTGCTAACTGTGATCGGGTTCCCAAGTCTCGAGTTGGTAGCATGTAA

Protein sequence

MGKGVLMILVLMVASMSCLALCSASSFFKDKPWERRRPILSVPTASSSFASSSIVLPLQGNVFPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPCKDPLCMSLHSSMDHKCENPGQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALGCGYDQDPGSSSYHPMDGILGLGRGAVSMVSQLHNQGIVRNVVGHCFSSKGGGYLFFGDGIYDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLTSLLNRELTGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLALSFSSGGRTKAVFEIPMEGYLIISSMGNVCLGILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATANCDRVPKSRVGSM
BLAST of Lsi10G007850 vs. Swiss-Prot
Match: ASP1_ORYSJ (Aspartic proteinase Asp1 OS=Oryza sativa subsp. japonica GN=ASP1 PE=2 SV=1)

HSP 1 Score: 339.7 bits (870), Expect = 4.5e-92
Identity = 179/386 (46.37%), Postives = 252/386 (65.28%), Query Frame = 1

Query: 51  SSSIVLPLQGNVFPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPL 110
           SS++VL L GNV+P G + +T+ +G P K YFLD DTGS LTWLQCDAPC  C    H L
Sbjct: 21  SSAVVLELHGNVYPIGHFFITMNIGDPAKSYFLDIDTGSTLTWLQCDAPCTNCNIVPHVL 80

Query: 111 YQPS-NDLVPCKDPLCMSLHSSM--DHKCENPGQCDYEVEYADGGSSLGVLVRDVFPLNL 170
           Y+P+   LV C D LC  L++ +    +C +  QCDY ++Y D  SS+GVLV D F L+ 
Sbjct: 81  YKPTPKKLVTCADSLCTDLYTDLGKPKRCGSQKQCDYVIQYVD-SSSMGVLVIDRFSLSA 140

Query: 171 TNGDPIRPRLALGCGYDQDPGSSSYH-PMDGILGLGRGAVSMVSQLHNQGIV-RNVVGHC 230
           +NG      +A GCGYDQ   + +   P+D ILGL RG V+++SQL +QG++ ++V+GHC
Sbjct: 141 SNGTN-PTTIAFGCGYDQGKKNRNVPIPVDSILGLSRGKVTLLSQLKSQGVITKHVLGHC 200

Query: 231 FSSKGGGYLFFGDGIYDPYRLVWTPMSRDYPKHYSPGFGELIF--NGRSTGLRNLFVVFD 290
            SSKGGG+LFFGD       + WTPM+R++ K+YSPG G L F  N ++     + V+FD
Sbjct: 201 ISSKGGGFLFFGDAQVPTSGVTWTPMNREH-KYYSPGHGTLHFDSNSKAISAAPMAVIFD 260

Query: 291 SGSSYTYFNAQAYQVLTSLLNRELTG--KPLREAMDDD-TLPLCWRGRKPFKSLRDVRKY 350
           SG++YTYF AQ YQ   S++   L    K L E  + D  L +CW+G+    ++ +V+K 
Sbjct: 261 SGATYTYFAAQPYQATLSVVKSTLNSECKFLTEVTEKDRALTVCWKGKDKIVTIDEVKKC 320

Query: 351 FKPLALSFSSGGRTKAVFEIPMEGYLIISSMGNVCLGILNGT--DVGLENSNIIGDISMQ 410
           F+ L+L F+ G + KA  EIP E YLIIS  G+VCLGIL+G+   + L  +N+IG I+M 
Sbjct: 321 FRSLSLEFADGDK-KATLEIPPEHYLIISQEGHVCLGILDGSKEHLSLAGTNLIGGITML 380

Query: 411 DKMVVYNNEKQAIGWATANCDRVPKS 425
           D+MV+Y++E+  +GW    CDR+P+S
Sbjct: 381 DQMVIYDSERSLLGWVNYQCDRIPRS 402

BLAST of Lsi10G007850 vs. Swiss-Prot
Match: ASP1_ORYSI (Aspartic proteinase Asp1 OS=Oryza sativa subsp. indica GN=ASP1 PE=2 SV=2)

HSP 1 Score: 325.5 bits (833), Expect = 8.9e-88
Identity = 174/386 (45.08%), Postives = 247/386 (63.99%), Query Frame = 1

Query: 51  SSSIVLPLQGNVFPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPL 110
           SS++VL L GNV+P G + VT+ +G P KPYFLD DTGS LTWLQCD PC  C +  H L
Sbjct: 21  SSAVVLELHGNVYPIGHFFVTMNIGDPAKPYFLDIDTGSTLTWLQCDYPCINCNKVPHGL 80

Query: 111 YQPS-NDLVPCKDPLCMSLHSSM--DHKCENPGQCDYEVEYADGGSSLGVLVRDVFPLNL 170
           Y+P     V C +  C  L++ +    KC    QC Y ++Y  GGSS+GVL+ D F L  
Sbjct: 81  YKPELKYAVKCTEQRCADLYADLRKPMKCGPKNQCHYGIQYV-GGSSIGVLIVDSFSLPA 140

Query: 171 TNGDPIRPRLALGCGYDQDPGSSSY-HPMDGILGLGRGAVSMVSQLHNQGIV-RNVVGHC 230
           +NG      +A GCGY+Q   + +   P++GILGLGRG V+++SQL +QG++ ++V+GHC
Sbjct: 141 SNGTN-PTSIAFGCGYNQGKNNHNVPTPVNGILGLGRGKVTLLSQLKSQGVITKHVLGHC 200

Query: 231 FSSKGGGYLFFGDGIYDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTGL--RNLFVVFD 290
            SSKG G+LFFGD       + W+PM+R++ KHYSP  G L FN  S  +    + V+FD
Sbjct: 201 ISSKGKGFLFFGDAKVPTSGVTWSPMNREH-KHYSPRQGTLQFNSNSKPISAAPMEVIFD 260

Query: 291 SGSSYTYFNAQAYQVLTSLLNRELTG--KPLREAMDDD-TLPLCWRGRKPFKSLRDVRKY 350
           SG++YTYF  Q Y    S++   L+   K L E  + D  L +CW+G+   +++ +V+K 
Sbjct: 261 SGATYTYFALQPYHATLSVVKSTLSKECKFLTEVKEKDRALTVCWKGKDKIRTIDEVKKC 320

Query: 351 FKPLALSFSSGGRTKAVFEIPMEGYLIISSMGNVCLGILNGT--DVGLENSNIIGDISMQ 410
           F+ L+L F+ G + KA  EIP E YLIIS  G+VCLGIL+G+     L  +N+IG I+M 
Sbjct: 321 FRSLSLKFADGDK-KATLEIPPEHYLIISQEGHVCLGILDGSKEHPSLAGTNLIGGITML 380

Query: 411 DKMVVYNNEKQAIGWATANCDRVPKS 425
           D+MV+Y++E+  +GW    CDR+P+S
Sbjct: 381 DQMVIYDSERSLLGWVNYQCDRIPRS 402

BLAST of Lsi10G007850 vs. Swiss-Prot
Match: APCB1_ARATH (Aspartyl protease APCB1 OS=Arabidopsis thaliana GN=APCB1 PE=1 SV=1)

HSP 1 Score: 301.6 bits (771), Expect = 1.4e-80
Identity = 170/396 (42.93%), Postives = 236/396 (59.60%), Query Frame = 1

Query: 44  TASSSFASSSIVLPLQGNVFPNGFYNVTLYVGQPP--KPYFLDPDTGSDLTWLQCDAPCQ 103
           T++ S  SS+ + P+ GNV+P+G Y   + VG+P   + Y LD DTGS+LTW+QCDAPC 
Sbjct: 179 TSAGSIDSSTTIFPVGGNVYPDGLYYTRILVGKPEDGQYYHLDIDTGSELTWIQCDAPCT 238

Query: 104 QCTETLHPLYQPSND-LVPCKDPLCMSLH-SSMDHKCENPGQCDYEVEYADGGSSLGVLV 163
            C +  + LY+P  D LV   +  C+ +  + +   CEN  QCDYE+EYAD   S+GVL 
Sbjct: 239 SCAKGANQLYKPRKDNLVRSSEAFCVEVQRNQLTEHCENCHQCDYEIEYADHSYSMGVLT 298

Query: 164 RDVFPLNLTNGDPIRPRLALGCGYDQDPGS-SSYHPMDGILGLGRGAVSMVSQLHNQGIV 223
           +D F L L NG      +  GCGYDQ     ++    DGILGL R  +S+ SQL ++GI+
Sbjct: 299 KDKFHLKLHNGSLAESDIVFGCGYDQQGLLLNTLLKTDGILGLSRAKISLPSQLASRGII 358

Query: 224 RNVVGHCFSS--KGGGYLFFGDGIYDPYRLVWTPMSRD--------YPKHYSPGFGELIF 283
            NVVGHC +S   G GY+F G  +   + + W PM  D             S G G L  
Sbjct: 359 SNVVGHCLASDLNGEGYIFMGSDLVPSHGMTWVPMLHDSRLDAYQMQVTKMSYGQGMLSL 418

Query: 284 NGRSTGLRNLFVVFDSGSSYTYFNAQAY-QVLTSLLNRELTGKPLREAMDDDTLPLCWRG 343
           +G +  +    V+FD+GSSYTYF  QAY Q++TSL  +E++G  L     D+TLP+CWR 
Sbjct: 419 DGENGRVGK--VLFDTGSSYTYFPNQAYSQLVTSL--QEVSGLELTRDDSDETLPICWRA 478

Query: 344 RK--PFKSLRDVRKYFKPLALSFSSGGRT-KAVFEIPMEGYLIISSMGNVCLGILNGTDV 403
           +   PF SL DV+K+F+P+ L   S          I  E YLIIS+ GNVCLGIL+G+ V
Sbjct: 479 KTNFPFSSLSDVKKFFRPITLQIGSKWLIISRKLLIQPEDYLIISNKGNVCLGILDGSSV 538

Query: 404 GLENSNIIGDISMQDKMVVYNNEKQAIGWATANCDR 421
              ++ I+GDISM+  ++VY+N K+ IGW  ++C R
Sbjct: 539 HDGSTIILGDISMRGHLIVYDNVKRRIGWMKSDCVR 570

BLAST of Lsi10G007850 vs. Swiss-Prot
Match: ASPL2_ARATH (Aspartic proteinase-like protein 2 OS=Arabidopsis thaliana GN=At1g65240 PE=1 SV=2)

HSP 1 Score: 145.6 bits (366), Expect = 1.3e-33
Identity = 126/448 (28.12%), Postives = 195/448 (43.53%), Query Frame = 1

Query: 5   VLMILVLMVASMSCLALCSASSFFKDKPWERRRPILSVPTASSSFASSSIVLPLQGN--V 64
           V+ + V+++   S   +  A   F  K  +      S  T   S   +SI LPL G+  V
Sbjct: 10  VVAVFVIVIEFASANFVFKAQHKFAGKK-KNLEHFKSHDTRRHSRMLASIDLPLGGDSRV 69

Query: 65  FPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPS-------- 124
              G Y   + +G PPK Y +  DTGSD+ W+ C  PC +C    +  ++ S        
Sbjct: 70  DSVGLYFTKIKLGSPPKEYHVQVDTGSDILWINC-KPCPKCPTKTNLNFRLSLFDMNASS 129

Query: 125 -NDLVPCKDPLCMSLHSSMDHKCENPGQCDYEVEYADGGSSLGVLVRDVFPLNLTNGD-- 184
            +  V C D  C  +  S    C+    C Y + YAD  +S G  +RD+  L    GD  
Sbjct: 130 TSKKVGCDDDFCSFI--SQSDSCQPALGCSYHIVYADESTSDGKFIRDMLTLEQVTGDLK 189

Query: 185 --PIRPRLALGCGYDQDPG-SSSYHPMDGILGLGRGAVSMVSQLHNQGIVRNVVGHCFSS 244
             P+   +  GCG DQ     +    +DG++G G+   S++SQL   G  + V  HC  +
Sbjct: 190 TGPLGQEVVFGCGSDQSGQLGNGDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCLDN 249

Query: 245 -KGGGYLFFGDGIYDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTGL-----RNLFVVF 304
            KGGG   F  G+ D  ++  TPM  +   HY+     +  +G S  L     RN   + 
Sbjct: 250 VKGGG--IFAVGVVDSPKVKTTPMVPN-QMHYNVMLMGMDVDGTSLDLPRSIVRNGGTIV 309

Query: 305 DSGSSYTYFNAQAYQVLTSLLNRELTGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFK 364
           DSG++  YF    Y    SL+   L  +P++  + ++T        + F    +V + F 
Sbjct: 310 DSGTTLAYFPKVLYD---SLIETILARQPVKLHIVEETF-------QCFSFSTNVDEAFP 369

Query: 365 PLALSFSSGGRTKAVFEIPMEGYLIISSMGNVCLGILNGTDVGLENSNII--GDISMQDK 424
           P++  F    +      +    YL        C G   G     E S +I  GD+ + +K
Sbjct: 370 PVSFEFEDSVK----LTVYPHDYLFTLEEELYCFGWQAGGLTTDERSEVILLGDLVLSNK 429

Query: 425 MVVYNNEKQAIGWATANCDRVPKSRVGS 429
           +VVY+ + + IGWA  NC    K + GS
Sbjct: 430 LVVYDLDNEVIGWADHNCSSSIKIKDGS 436

BLAST of Lsi10G007850 vs. Swiss-Prot
Match: APF1_ARATH (Aspartyl protease family protein 1 OS=Arabidopsis thaliana GN=APF1 PE=1 SV=1)

HSP 1 Score: 124.0 bits (310), Expect = 3.9e-27
Identity = 115/377 (30.50%), Postives = 165/377 (43.77%), Query Frame = 1

Query: 67  FYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHP---------LYQPS--- 126
           + NVT  VG P   + +  DTGSDL WL CD  C  C   L           +Y P+   
Sbjct: 105 YANVT--VGTPSDWFMVALDTGSDLFWLPCD--CTNCVRELKAPGGSSLDLNIYSPNASS 164

Query: 127 -NDLVPCKDPLCMSLHSSMDHKCENP-GQCDYEVEY-ADGGSSLGVLVRDVFPL--NLTN 186
            +  VPC   LC     +   +C +P   C Y++ Y ++G SS GVLV DV  L  N  +
Sbjct: 165 TSTKVPCNSTLC-----TRGDRCASPESDCPYQIRYLSNGTSSTGVLVEDVLHLVSNDKS 224

Query: 187 GDPIRPRLALGCGYDQDPGSSSYHPMDGILGLGRGAVSMVSQLHNQGIVRNVVGHCFSSK 246
              I  R+  GCG  Q          +G+ GLG   +S+ S L  +GI  N    CF + 
Sbjct: 225 SKAIPARVTFGCGQVQTGVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAANSFSMCFGND 284

Query: 247 GGGYLFFGD-GIYDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTGLRNLFVVFDSGSSY 306
           G G + FGD G  D      TP++   P          I  G +TG      VFDSG+S+
Sbjct: 285 GAGRISFGDKGSVDQRE---TPLNIRQPHPTYNITVTKISVGGNTGDLEFDAVFDSGTSF 344

Query: 307 TYFNAQAYQVLTSLLNRELTGKPLREAMDDDTLPL--CWRGRKPFKSLRDVRKYFKPLAL 366
           TY    AY +++   N     K  R    D  LP   C+       +L   +  F+  A+
Sbjct: 345 TYLTDAAYTLISESFNSLALDK--RYQTTDSELPFEYCY-------ALSPNKDSFQYPAV 404

Query: 367 SFS-SGGRTKAVFE----IPMEGYLIISSMGNVCLGILNGTDVGLENSNIIGDISMQDKM 419
           + +  GG +  V+     IPM+   +       CL I+      +E+ +IIG   M    
Sbjct: 405 NLTMKGGSSYPVYHPLVVIPMKDTDV------YCLAIMK-----IEDISIIGQNFMTGYR 449

BLAST of Lsi10G007850 vs. TrEMBL
Match: A0A0A0LKB0_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_2G338820 PE=3 SV=1)

HSP 1 Score: 729.2 bits (1881), Expect = 3.0e-207
Identity = 341/352 (96.88%), Postives = 347/352 (98.58%), Query Frame = 1

Query: 77  PPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPCKDPLCMSLHSSMDHKC 136
           PPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPCKDPLCMSLHSSMDH+C
Sbjct: 35  PPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPCKDPLCMSLHSSMDHRC 94

Query: 137 ENPGQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALGCGYDQDPGSSSYHPMD 196
           ENP QCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALGCGYDQDPGSSSYHPMD
Sbjct: 95  ENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALGCGYDQDPGSSSYHPMD 154

Query: 197 GILGLGRGAVSMVSQLHNQGIVRNVVGHCFSSKGGGYLFFGDGIYDPYRLVWTPMSRDYP 256
           GILGLGRGAVS+VSQLHNQGIVRNVVGHCF+SKGGGYLFFGDGIYDPYRLVWTPMSRDYP
Sbjct: 155 GILGLGRGAVSIVSQLHNQGIVRNVVGHCFNSKGGGYLFFGDGIYDPYRLVWTPMSRDYP 214

Query: 257 KHYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLTSLLNRELTGKPLREAM 316
           KHYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLTSLLNREL GKPLREAM
Sbjct: 215 KHYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLTSLLNRELAGKPLREAM 274

Query: 317 DDDTLPLCWRGRKPFKSLRDVRKYFKPLALSFSSGGRTKAVFEIPMEGYLIISSMGNVCL 376
           DDDTLPLCWRGRKP KSLRDVRKYFKPLALSFSSGGR+KAVFEIP EGY+IISSMGNVCL
Sbjct: 275 DDDTLPLCWRGRKPIKSLRDVRKYFKPLALSFSSGGRSKAVFEIPTEGYMIISSMGNVCL 334

Query: 377 GILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATANCDRVPKSRVGS 429
           GILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATANCDRVPKS+V S
Sbjct: 335 GILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATANCDRVPKSQVSS 386

BLAST of Lsi10G007850 vs. TrEMBL
Match: M5WHY2_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa005961mg PE=3 SV=1)

HSP 1 Score: 603.6 bits (1555), Expect = 1.9e-169
Identity = 287/427 (67.21%), Postives = 339/427 (79.39%), Query Frame = 1

Query: 2   GKGVLMILVLMVASMSCLALCSASSFFKDKPWERRRPILSVPTASS---SFASSSIVLPL 61
           GK   ++L++ +  M   A  S++SF       RR+ +L     SS   + A+SSIVLP+
Sbjct: 5   GKSGWLLLLMSLLVMGLSATMSSASFGDQYHRGRRKTMLPDEATSSLGLNRAASSIVLPV 64

Query: 62  QGNVFPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLV 121
            GNV+P G YNVTL +GQPPKPYFLDPDTGSDLTWLQCDAPC +CTE  HP Y+P+NDLV
Sbjct: 65  HGNVYPIGSYNVTLNIGQPPKPYFLDPDTGSDLTWLQCDAPCVRCTEAPHPFYRPNNDLV 124

Query: 122 PCKDPLCMSLHSSMDHKCENPGQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLA 181
            CKDPLC +LH+   HKC+NP QCDYEVEYADGGSSLGVLVRD F LN TNG+     LA
Sbjct: 125 VCKDPLCEALHAPGSHKCDNPEQCDYEVEYADGGSSLGVLVRDAFLLNFTNGNQRTTHLA 184

Query: 182 LGCGYDQDPGSSSYHPMDGILGLGRGAVSMVSQLHNQGIVRNVVGHCFSSKGGGYLFFGD 241
           LGCGYDQ PGS SYHP+DG+LGLG+G  S+VSQL NQG+VR+V+GHC S +GGG+ F GD
Sbjct: 185 LGCGYDQLPGS-SYHPIDGVLGLGKGKSSIVSQLSNQGLVRHVIGHCLSGRGGGFFFLGD 244

Query: 242 GIYDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQV 301
           G+YD  R+VWTPMS DY KHYSPG  ELI  G+STG RNL +VFDSGSSYTY N+QAYQ 
Sbjct: 245 GLYDSSRIVWTPMSPDYAKHYSPGLAELIVGGKSTGFRNLVMVFDSGSSYTYLNSQAYQF 304

Query: 302 LTSLLNRELTGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLALSFSSGGRTKAVF 361
           LTS L RELTGKPL+EA+DD TLPLCW+GRKPF+++RDV+ YFKPLAL F+SG +    F
Sbjct: 305 LTSWLKRELTGKPLKEALDDRTLPLCWKGRKPFRNIRDVKTYFKPLALRFASGRKDTTQF 364

Query: 362 EIPMEGYLIISSMGNVCLGILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATANC 421
           E+P E YLIISS GNVCLGILNG++VGL+NSNIIGDISMQDKMV+Y+NEKQ IGW   NC
Sbjct: 365 ELPPEAYLIISSKGNVCLGILNGSEVGLQNSNIIGDISMQDKMVIYDNEKQMIGWGPGNC 424

Query: 422 DRVPKSR 426
           D++PKSR
Sbjct: 425 DKLPKSR 430

BLAST of Lsi10G007850 vs. TrEMBL
Match: A0A061DK09_THECC (Eukaryotic aspartyl protease family protein isoform 1 OS=Theobroma cacao GN=TCM_001596 PE=3 SV=1)

HSP 1 Score: 600.5 bits (1547), Expect = 1.6e-168
Identity = 285/432 (65.97%), Postives = 344/432 (79.63%), Query Frame = 1

Query: 1   MGKGVLMILVLMVASMSCLALCSASSFFKDKPWERRRPILSVPTASSSFAS---SSIVLP 60
           MGKG + +L+L++      + CSAS    D+ W  R+ ++S    SS   +   SSI+ P
Sbjct: 1   MGKGRMSVLLLLLF----FSFCSAS----DQKW--RKAMISTDKGSSMMMNRVGSSILFP 60

Query: 61  LQGNVFPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDL 120
           + GNV+P G+YNVT+ +GQPPKPYFLD DTGSDLTWLQCDAPC  C E  HPLY+P+NDL
Sbjct: 61  IHGNVYPTGYYNVTISIGQPPKPYFLDLDTGSDLTWLQCDAPCVHCVEAPHPLYRPTNDL 120

Query: 121 VPCKDPLCMSLHSSMDHKCENPGQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRL 180
           VPCKDPLC +LH   D+KCENP QCDYEVEYADGGSSLGVLVRDVF LN TNG  + PRL
Sbjct: 121 VPCKDPLCAALHPPGDYKCENPEQCDYEVEYADGGSSLGVLVRDVFSLNYTNGIRLSPRL 180

Query: 181 ALGCGYDQDPGSSSYHPMDGILGLGRGAVSMVSQLHNQGIVRNVVGHCFSSKGGGYLFFG 240
           ALGCGYDQ PGS SYHP+DGILGLGRG  S+VSQL +QG+VRNVVGHC S +GGG+LFFG
Sbjct: 181 ALGCGYDQIPGS-SYHPLDGILGLGRGKASIVSQLQSQGLVRNVVGHCLSGRGGGFLFFG 240

Query: 241 DGIYDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQ 300
           DG+YD  R+ WT MS++  K+YSPG  EL F G++T ++NL VVFDSGSSYTY N+QAYQ
Sbjct: 241 DGLYDSSRVTWTSMSQELTKYYSPGIAELQFGGKATSVKNLIVVFDSGSSYTYLNSQAYQ 300

Query: 301 VLTSLLNRELTGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLALSFSSGGRTKAV 360
            LT LL +EL+G+ L+EA +D TLPLCW+GRKPFK++RDV+KYFK LAL+F+S  RTK  
Sbjct: 301 TLTVLLKKELSGRSLKEAPEDQTLPLCWKGRKPFKNVRDVKKYFKTLALAFASSSRTKTQ 360

Query: 361 FEIPMEGYLIISSMGNVCLGILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATAN 420
           FE+P E YLIIS+ GNVCLGILNGT VGL+N N+IGDISMQD+MV+Y+NEKQ IGWA AN
Sbjct: 361 FELPPEAYLIISNKGNVCLGILNGTQVGLQNLNVIGDISMQDRMVIYDNEKQVIGWAPAN 420

Query: 421 CDRVPKSRVGSM 430
           CD++P+S  G M
Sbjct: 421 CDQLPRSTTGYM 421

BLAST of Lsi10G007850 vs. TrEMBL
Match: A0A165Z0G7_DAUCA (Uncharacterized protein OS=Daucus carota subsp. sativus GN=DCAR_013175 PE=4 SV=1)

HSP 1 Score: 596.3 bits (1536), Expect = 3.0e-167
Identity = 274/388 (70.62%), Postives = 327/388 (84.28%), Query Frame = 1

Query: 45  ASSSFASS---SIVLPLQGNVFPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQ 104
           ASSS  SS   S+VLPL GNV+P+G+Y+V   +GQPPKPYFLDPDTGSDLTWLQCDAPC 
Sbjct: 41  ASSSVVSSVGSSVVLPLYGNVYPSGYYHVQFNIGQPPKPYFLDPDTGSDLTWLQCDAPCI 100

Query: 105 QCTETLHPLYQPSNDLVPCKDPLCMSLHSSMDHKCENPGQCDYEVEYADGGSSLGVLVRD 164
           QCT   HPLYQP+NDLV CKDP+C SLH   +++C++P QCDYEVEYADGGSS+GVLV D
Sbjct: 101 QCTPAPHPLYQPTNDLVVCKDPICASLHPD-NYRCDDPDQCDYEVEYADGGSSIGVLVND 160

Query: 165 VFPLNLTNGDPIRPRLALGCGYDQDPGSSSYHPMDGILGLGRGAVSMVSQLHNQGIVRNV 224
           +FP+NLT+G   RPRL +GCGYDQ PG  +YHP+DG+LGLGRG+ S+V+QL +QG+VRNV
Sbjct: 161 LFPVNLTSGMRARPRLTIGCGYDQLPGI-AYHPLDGVLGLGRGSSSIVAQLSSQGLVRNV 220

Query: 225 VGHCFSSKGGGYLFFGDGIYDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTGLRNLFVV 284
           VGHCFS +GGGYLFFGD IYD  +++WTPMSRDY KHY+PGF ELI NGRS+GL+NL VV
Sbjct: 221 VGHCFSRRGGGYLFFGDDIYDSSKVIWTPMSRDYLKHYTPGFAELILNGRSSGLKNLLVV 280

Query: 285 FDSGSSYTYFNAQAYQVLTSLLNRELTGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYF 344
           FDSGSSYTYFN Q YQ L S + ++L GKPL+EA++DDTLP+CWRG+KPFKS+RD +KYF
Sbjct: 281 FDSGSSYTYFNTQTYQTLLSFIKKDLHGKPLKEAVEDDTLPVCWRGKKPFKSIRDAKKYF 340

Query: 345 KPLALSFSSGGRTKAVFEIPMEGYLIISSMGNVCLGILNGTDVGLENSNIIGDISMQDKM 404
           KPLALSF SG +TK+ FEI  E YLIISS G+VCLGILNGT+VGL+N NIIGDISMQ+K+
Sbjct: 341 KPLALSFGSGWKTKSQFEIQQESYLIISSKGSVCLGILNGTEVGLQNYNIIGDISMQEKL 400

Query: 405 VVYNNEKQAIGWATANCDRVPKSRVGSM 430
           V+Y+NEKQ IGW  +NCDR PK    SM
Sbjct: 401 VIYDNEKQVIGWQPSNCDRPPKGDTFSM 426

BLAST of Lsi10G007850 vs. TrEMBL
Match: Q5NT86_DAUCA (Nucellin-like protein OS=Daucus carota GN=DcNLP PE=3 SV=1)

HSP 1 Score: 596.3 bits (1536), Expect = 3.0e-167
Identity = 274/388 (70.62%), Postives = 327/388 (84.28%), Query Frame = 1

Query: 45  ASSSFASS---SIVLPLQGNVFPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQ 104
           ASSS  SS   S+VLPL GNV+P+G+Y+V   +GQPPKPYFLDPDTGSDLTWLQCDAPC 
Sbjct: 41  ASSSVVSSVGSSVVLPLYGNVYPSGYYHVQFNIGQPPKPYFLDPDTGSDLTWLQCDAPCI 100

Query: 105 QCTETLHPLYQPSNDLVPCKDPLCMSLHSSMDHKCENPGQCDYEVEYADGGSSLGVLVRD 164
           QCT   HPLYQP+NDLV CKDP+C SLH   +++C++P QCDYEVEYADGGSS+GVLV D
Sbjct: 101 QCTPAPHPLYQPTNDLVVCKDPICASLHPD-NYRCDDPDQCDYEVEYADGGSSIGVLVND 160

Query: 165 VFPLNLTNGDPIRPRLALGCGYDQDPGSSSYHPMDGILGLGRGAVSMVSQLHNQGIVRNV 224
           +FP+NLT+G   RPRL +GCGYDQ PG  +YHP+DG+LGLGRG+ S+V+QL +QG+VRNV
Sbjct: 161 LFPVNLTSGMRARPRLTIGCGYDQLPGI-AYHPLDGVLGLGRGSSSIVAQLSSQGLVRNV 220

Query: 225 VGHCFSSKGGGYLFFGDGIYDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTGLRNLFVV 284
           VGHCFS +GGGYLFFGD IYD  +++WTPMSRDY KHY+PGF ELI NGRS+GL+NL VV
Sbjct: 221 VGHCFSRRGGGYLFFGDDIYDSSKVIWTPMSRDYLKHYTPGFAELILNGRSSGLKNLLVV 280

Query: 285 FDSGSSYTYFNAQAYQVLTSLLNRELTGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYF 344
           FDSGSSYTYFN Q YQ L S + ++L GKPL+EA++DDTLP+CWRG+KPFKS+RD +KYF
Sbjct: 281 FDSGSSYTYFNTQTYQTLLSFIKKDLHGKPLKEAVEDDTLPVCWRGKKPFKSIRDAKKYF 340

Query: 345 KPLALSFSSGGRTKAVFEIPMEGYLIISSMGNVCLGILNGTDVGLENSNIIGDISMQDKM 404
           KPLALSF SG +TK+ FEI  E YLIISS G+VCLGILNGT+VGL+N NIIGDISMQ+K+
Sbjct: 341 KPLALSFGSGWKTKSQFEIQQESYLIISSKGSVCLGILNGTEVGLQNYNIIGDISMQEKL 400

Query: 405 VVYNNEKQAIGWATANCDRVPKSRVGSM 430
           V+Y+NEKQ IGW  +NCDR PK    SM
Sbjct: 401 VIYDNEKQVIGWQPSNCDRPPKGDTFSM 426

BLAST of Lsi10G007850 vs. TAIR10
Match: AT4G33490.2 (AT4G33490.2 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 573.9 bits (1478), Expect = 8.1e-164
Identity = 270/418 (64.59%), Postives = 330/418 (78.95%), Query Frame = 1

Query: 5   VLMILVLMVASMSCLALCSASSFFKDKPWERRRPILSVPTASSSFASSSIVLPLQGNVFP 64
           V  ++VLMV S+  L   SA  F     W +        T     A SS+V P+ GNV+P
Sbjct: 6   VRFMIVLMVMSL-VLGFSSAVDF----RWRKTAGFSDRFTR----AVSSVVFPVHGNVYP 65

Query: 65  NGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPCKDPL 124
            G+YNVT+ +GQPP+PY+LD DTGSDLTWLQCDAPC +C E  HPLYQPS+DL+PC DPL
Sbjct: 66  LGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVRCLEAPHPLYQPSSDLIPCNDPL 125

Query: 125 CMSLHSSMDHKCENPGQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALGCGYD 184
           C +LH + + +CE P QCDYEVEYADGGSSLGVLVRDVF +N T G  + PRLALGCGYD
Sbjct: 126 CKALHLNSNQRCETPEQCDYEVEYADGGSSLGVLVRDVFSMNYTQGLRLTPRLALGCGYD 185

Query: 185 QDPGSSSYHPMDGILGLGRGAVSMVSQLHNQGIVRNVVGHCFSSKGGGYLFFGDGIYDPY 244
           Q PG+SS+HP+DG+LGLGRG VS++SQLH+QG V+NV+GHC SS GGG LFFGD +YD  
Sbjct: 186 QIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHCLSSLGGGILFFGDDLYDSS 245

Query: 245 RLVWTPMSRDYPKHYSPGF-GELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLTSLL 304
           R+ WTPMSR+Y KHYSP   GEL+F GR+TGL+NL  VFDSGSSYTYFN++AYQ +T LL
Sbjct: 246 RVSWTPMSREYSKHYSPAMGGELLFGGRTTGLKNLLTVFDSGSSYTYFNSKAYQAVTYLL 305

Query: 305 NRELTGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLALSFSSGGRTKAVFEIPME 364
            REL+GKPL+EA DD TLPLCW+GR+PF S+ +V+KYFKPLALSF +G R+K +FEIP E
Sbjct: 306 KRELSGKPLKEARDDHTLPLCWQGRRPFMSIEEVKKYFKPLALSFKTGWRSKTLFEIPPE 365

Query: 365 GYLIISSMGNVCLGILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATANCDRV 422
            YLIIS  GNVCLGILNGT++GL+N N+IGDISMQD+M++Y+NEKQ+IGW   +CD +
Sbjct: 366 AYLIISMKGNVCLGILNGTEIGLQNLNLIGDISMQDQMIIYDNEKQSIGWMPVDCDEL 414

BLAST of Lsi10G007850 vs. TAIR10
Match: AT1G44130.1 (AT1G44130.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 430.6 bits (1106), Expect = 1.1e-120
Identity = 200/396 (50.51%), Postives = 283/396 (71.46%), Query Frame = 1

Query: 39  ILSVPTASSSF-------ASSSIVLPLQGNVFPNGFYNVTLYVGQPPKPYFLDPDTGSDL 98
           ++ VP + SS        + SS+V PL GNVFP G+Y+V + +G PPK +  D DTGSDL
Sbjct: 13  LVIVPLSKSSIFKTFIKSSPSSVVFPLSGNVFPLGYYSVLMQIGSPPKAFQFDIDTGSDL 72

Query: 99  TWLQCDAPCQQCTETLHPLYQPSNDLVPCKDPLCMSLHSSMDHKCENPG-QCDYEVEYAD 158
           TW+QCDAPC  CT   +  Y+P  +++PC +P+C +LH      C NP  QCDYEV+YAD
Sbjct: 73  TWVQCDAPCSGCTLPPNLQYKPKGNIIPCSNPICTALHWPNKPHCPNPQEQCDYEVKYAD 132

Query: 159 GGSSLGVLVRDVFPLNLTNGDPIRPRLALGCGYDQDPGSSSYHPMD-GILGLGRGAVSMV 218
            GSS+G LV D FPL L NG  ++P +A GCGYDQ   S+   P   G+LGLGRG + ++
Sbjct: 133 QGSSMGALVTDQFPLKLVNGSFMQPPVAFGCGYDQSYPSAHPPPATAGVLGLGRGKIGLL 192

Query: 219 SQLHNQGIVRNVVGHCFSSKGGGYLFFGDGIYDPYRLVWTPM-SRDYPKHYSPGFGELIF 278
           +QL + G+ RNVVGHC SSKGGG+LFFGD +     + WTP+ S+D   HY+ G  +L+F
Sbjct: 193 TQLVSAGLTRNVVGHCLSSKGGGFLFFGDNLVPSIGVAWTPLLSQD--NHYTTGPADLLF 252

Query: 279 NGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLTSLLNRELTGKPLREAMDDDTLPLCWRGR 338
           NG+ TGL+ L ++FD+GSSYTYFN++AYQ + +L+  +L   PL+ A +D TLP+CW+G 
Sbjct: 253 NGKPTGLKGLKLIFDTGSSYTYFNSKAYQTIINLIGNDLKVSPLKVAKEDKTLPICWKGA 312

Query: 339 KPFKSLRDVRKYFKPLALSFSSGGRTKAVFEIPMEGYLIISSMGNVCLGILNGTDVGLEN 398
           KPFKS+ +V+ +FK + ++F++G R   ++  P E YLI+S  GNVCLG+LNG++VGL+N
Sbjct: 313 KPFKSVLEVKNFFKTITINFTNGRRNTQLYLAP-ELYLIVSKTGNVCLGLLNGSEVGLQN 372

Query: 399 SNIIGDISMQDKMVVYNNEKQAIGWATANCDRVPKS 425
           SN+IGDISMQ  M++Y+NEKQ +GW +++C+++PK+
Sbjct: 373 SNVIGDISMQGLMMIYDNEKQQLGWVSSDCNKLPKT 405

BLAST of Lsi10G007850 vs. TAIR10
Match: AT1G77480.1 (AT1G77480.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 404.4 bits (1038), Expect = 8.5e-113
Identity = 191/375 (50.93%), Postives = 257/375 (68.53%), Query Frame = 1

Query: 51  SSSIVLPLQGNVFPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPL 110
           SS++V P+ GNV+P G+Y V L +G PPK + LD DTGSDLTW+QCDAPC  CT+     
Sbjct: 50  SSTVVFPVSGNVYPLGYYYVLLNIGNPPKLFDLDIDTGSDLTWVQCDAPCNGCTKPRAKQ 109

Query: 111 YQPSNDLVPCKDPLCMSLHSSMDHKCENP-GQCDYEVEYADGGSSLGVLVRDVFPLNLTN 170
           Y+P+++ +PC   LC  L    D  C +P  QCDYE+ Y+D  SS+G LV D  PL L N
Sbjct: 110 YKPNHNTLPCSHILCSGLDLPQDRPCADPEDQCDYEIGYSDHASSIGALVTDEVPLKLAN 169

Query: 171 GDPIRPRLALGCGYDQ-DPGSSSYHPMDGILGLGRGAVSMVSQLHNQGIVRNVVGHCFSS 230
           G  +  RL  GCGYDQ +PG     P  GILGLGRG V + +QL + GI +NV+ HC S 
Sbjct: 170 GSIMNLRLTFGCGYDQQNPGPHPPPPTAGILGLGRGKVGLSTQLKSLGITKNVIVHCLSH 229

Query: 231 KGGGYLFFGDGIYDPYRLVWTPMSRDYP-KHYSPGFGELIFNGRSTGLRNLFVVFDSGSS 290
            G G+L  GD +     + WT ++ + P K+Y  G  EL+FN ++TG++ + VVFDSGSS
Sbjct: 230 TGKGFLSIGDELVPSSGVTWTSLATNSPSKNYMAGPAELLFNDKTTGVKGINVVFDSGSS 289

Query: 291 YTYFNAQAYQVLTSLLNRELTGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLALS 350
           YTYFNA+AYQ +  L+ ++L GKPL +  DD +LP+CW+G+KP KSL +V+KYFK + L 
Sbjct: 290 YTYFNAEAYQAILDLIRKDLNGKPLTDTKDDKSLPVCWKGKKPLKSLDEVKKYFKTITLR 349

Query: 351 FSSGGRTKAVFEIPMEGYLIISSMGNVCLGILNGTDVGLENSNIIGDISMQDKMVVYNNE 410
           F +  +   +F++P E YLII+  G VCLGILNGT++GLE  NIIGDIS Q  MV+Y+NE
Sbjct: 350 FGN-QKNGQLFQVPPESYLIITEKGRVCLGILNGTEIGLEGYNIIGDISFQGIMVIYDNE 409

Query: 411 KQAIGWATANCDRVP 423
           KQ IGW +++CD++P
Sbjct: 410 KQRIGWISSDCDKLP 423

BLAST of Lsi10G007850 vs. TAIR10
Match: AT1G49050.1 (AT1G49050.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 301.6 bits (771), Expect = 7.7e-82
Identity = 170/396 (42.93%), Postives = 236/396 (59.60%), Query Frame = 1

Query: 44  TASSSFASSSIVLPLQGNVFPNGFYNVTLYVGQPP--KPYFLDPDTGSDLTWLQCDAPCQ 103
           T++ S  SS+ + P+ GNV+P+G Y   + VG+P   + Y LD DTGS+LTW+QCDAPC 
Sbjct: 179 TSAGSIDSSTTIFPVGGNVYPDGLYYTRILVGKPEDGQYYHLDIDTGSELTWIQCDAPCT 238

Query: 104 QCTETLHPLYQPSND-LVPCKDPLCMSLH-SSMDHKCENPGQCDYEVEYADGGSSLGVLV 163
            C +  + LY+P  D LV   +  C+ +  + +   CEN  QCDYE+EYAD   S+GVL 
Sbjct: 239 SCAKGANQLYKPRKDNLVRSSEAFCVEVQRNQLTEHCENCHQCDYEIEYADHSYSMGVLT 298

Query: 164 RDVFPLNLTNGDPIRPRLALGCGYDQDPGS-SSYHPMDGILGLGRGAVSMVSQLHNQGIV 223
           +D F L L NG      +  GCGYDQ     ++    DGILGL R  +S+ SQL ++GI+
Sbjct: 299 KDKFHLKLHNGSLAESDIVFGCGYDQQGLLLNTLLKTDGILGLSRAKISLPSQLASRGII 358

Query: 224 RNVVGHCFSS--KGGGYLFFGDGIYDPYRLVWTPMSRD--------YPKHYSPGFGELIF 283
            NVVGHC +S   G GY+F G  +   + + W PM  D             S G G L  
Sbjct: 359 SNVVGHCLASDLNGEGYIFMGSDLVPSHGMTWVPMLHDSRLDAYQMQVTKMSYGQGMLSL 418

Query: 284 NGRSTGLRNLFVVFDSGSSYTYFNAQAY-QVLTSLLNRELTGKPLREAMDDDTLPLCWRG 343
           +G +  +    V+FD+GSSYTYF  QAY Q++TSL  +E++G  L     D+TLP+CWR 
Sbjct: 419 DGENGRVGK--VLFDTGSSYTYFPNQAYSQLVTSL--QEVSGLELTRDDSDETLPICWRA 478

Query: 344 RK--PFKSLRDVRKYFKPLALSFSSGGRT-KAVFEIPMEGYLIISSMGNVCLGILNGTDV 403
           +   PF SL DV+K+F+P+ L   S          I  E YLIIS+ GNVCLGIL+G+ V
Sbjct: 479 KTNFPFSSLSDVKKFFRPITLQIGSKWLIISRKLLIQPEDYLIISNKGNVCLGILDGSSV 538

Query: 404 GLENSNIIGDISMQDKMVVYNNEKQAIGWATANCDR 421
              ++ I+GDISM+  ++VY+N K+ IGW  ++C R
Sbjct: 539 HDGSTIILGDISMRGHLIVYDNVKRRIGWMKSDCVR 570

BLAST of Lsi10G007850 vs. TAIR10
Match: AT1G65240.1 (AT1G65240.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 145.6 bits (366), Expect = 7.1e-35
Identity = 126/448 (28.12%), Postives = 195/448 (43.53%), Query Frame = 1

Query: 5   VLMILVLMVASMSCLALCSASSFFKDKPWERRRPILSVPTASSSFASSSIVLPLQGN--V 64
           V+ + V+++   S   +  A   F  K  +      S  T   S   +SI LPL G+  V
Sbjct: 10  VVAVFVIVIEFASANFVFKAQHKFAGKK-KNLEHFKSHDTRRHSRMLASIDLPLGGDSRV 69

Query: 65  FPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPS-------- 124
              G Y   + +G PPK Y +  DTGSD+ W+ C  PC +C    +  ++ S        
Sbjct: 70  DSVGLYFTKIKLGSPPKEYHVQVDTGSDILWINC-KPCPKCPTKTNLNFRLSLFDMNASS 129

Query: 125 -NDLVPCKDPLCMSLHSSMDHKCENPGQCDYEVEYADGGSSLGVLVRDVFPLNLTNGD-- 184
            +  V C D  C  +  S    C+    C Y + YAD  +S G  +RD+  L    GD  
Sbjct: 130 TSKKVGCDDDFCSFI--SQSDSCQPALGCSYHIVYADESTSDGKFIRDMLTLEQVTGDLK 189

Query: 185 --PIRPRLALGCGYDQDPG-SSSYHPMDGILGLGRGAVSMVSQLHNQGIVRNVVGHCFSS 244
             P+   +  GCG DQ     +    +DG++G G+   S++SQL   G  + V  HC  +
Sbjct: 190 TGPLGQEVVFGCGSDQSGQLGNGDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCLDN 249

Query: 245 -KGGGYLFFGDGIYDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTGL-----RNLFVVF 304
            KGGG   F  G+ D  ++  TPM  +   HY+     +  +G S  L     RN   + 
Sbjct: 250 VKGGG--IFAVGVVDSPKVKTTPMVPN-QMHYNVMLMGMDVDGTSLDLPRSIVRNGGTIV 309

Query: 305 DSGSSYTYFNAQAYQVLTSLLNRELTGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFK 364
           DSG++  YF    Y    SL+   L  +P++  + ++T        + F    +V + F 
Sbjct: 310 DSGTTLAYFPKVLYD---SLIETILARQPVKLHIVEETF-------QCFSFSTNVDEAFP 369

Query: 365 PLALSFSSGGRTKAVFEIPMEGYLIISSMGNVCLGILNGTDVGLENSNII--GDISMQDK 424
           P++  F    +      +    YL        C G   G     E S +I  GD+ + +K
Sbjct: 370 PVSFEFEDSVK----LTVYPHDYLFTLEEELYCFGWQAGGLTTDERSEVILLGDLVLSNK 429

Query: 425 MVVYNNEKQAIGWATANCDRVPKSRVGS 429
           +VVY+ + + IGWA  NC    K + GS
Sbjct: 430 LVVYDLDNEVIGWADHNCSSSIKIKDGS 436

BLAST of Lsi10G007850 vs. NCBI nr
Match: gi|778670347|ref|XP_004147327.2| (PREDICTED: aspartic proteinase Asp1 isoform X2 [Cucumis sativus])

HSP 1 Score: 864.4 bits (2232), Expect = 8.5e-248
Identity = 411/428 (96.03%), Postives = 421/428 (98.36%), Query Frame = 1

Query: 1   MGKGVLMILVLMVASMSCLALCSASSFFKDKPWERRRPILSVPTASSSFASSSIVLPLQG 60
           MGK VL++LVLMVASMSCLA CSASSFFKDKPWER+RPILSVPTASSSFASSSIVLPLQG
Sbjct: 1   MGKRVLVVLVLMVASMSCLAPCSASSFFKDKPWERKRPILSVPTASSSFASSSIVLPLQG 60

Query: 61  NVFPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPC 120
           NV+PNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPC
Sbjct: 61  NVYPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPC 120

Query: 121 KDPLCMSLHSSMDHKCENPGQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG 180
           KDPLCMSLHSSMDH+CENP QCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG
Sbjct: 121 KDPLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG 180

Query: 181 CGYDQDPGSSSYHPMDGILGLGRGAVSMVSQLHNQGIVRNVVGHCFSSKGGGYLFFGDGI 240
           CGYDQDPGSSSYHPMDGILGLGRGAVS+VSQLHNQGIVRNVVGHCF+SKGGGYLFFGDGI
Sbjct: 181 CGYDQDPGSSSYHPMDGILGLGRGAVSIVSQLHNQGIVRNVVGHCFNSKGGGYLFFGDGI 240

Query: 241 YDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLT 300
           YDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLT
Sbjct: 241 YDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLT 300

Query: 301 SLLNRELTGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLALSFSSGGRTKAVFEI 360
           SLLNREL GKPLREAMDDDTLPLCWRGRKP KSLRDVRKYFKPLALSFSSGGR+KAVFEI
Sbjct: 301 SLLNRELAGKPLREAMDDDTLPLCWRGRKPIKSLRDVRKYFKPLALSFSSGGRSKAVFEI 360

Query: 361 PMEGYLIISSMGNVCLGILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATANCDR 420
           P EGY+IISSMGNVCLGILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATANCDR
Sbjct: 361 PTEGYMIISSMGNVCLGILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATANCDR 420

Query: 421 VPKSRVGS 429
           VPKS+V S
Sbjct: 421 VPKSQVSS 428

BLAST of Lsi10G007850 vs. NCBI nr
Match: gi|659121807|ref|XP_008460823.1| (PREDICTED: aspartic proteinase Asp1 isoform X2 [Cucumis melo])

HSP 1 Score: 859.4 bits (2219), Expect = 2.7e-246
Identity = 408/428 (95.33%), Postives = 420/428 (98.13%), Query Frame = 1

Query: 1   MGKGVLMILVLMVASMSCLALCSASSFFKDKPWERRRPILSVPTASSSFASSSIVLPLQG 60
           MGK VL++L LMVASMSCLA CSASSFFKDKPWER+RPILSVPTASSSFASSSIVLPLQG
Sbjct: 1   MGKWVLVVLALMVASMSCLAPCSASSFFKDKPWERKRPILSVPTASSSFASSSIVLPLQG 60

Query: 61  NVFPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPC 120
           NV+PNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPC
Sbjct: 61  NVYPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPC 120

Query: 121 KDPLCMSLHSSMDHKCENPGQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG 180
           KDPLCMSLHSSMDH+CENP QCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG
Sbjct: 121 KDPLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG 180

Query: 181 CGYDQDPGSSSYHPMDGILGLGRGAVSMVSQLHNQGIVRNVVGHCFSSKGGGYLFFGDGI 240
           CGYDQDPGSSSYHPMDGILGLGRGAVS+VSQLHNQGIVRNVVGHCF+SKGGGYLFFGDGI
Sbjct: 181 CGYDQDPGSSSYHPMDGILGLGRGAVSIVSQLHNQGIVRNVVGHCFNSKGGGYLFFGDGI 240

Query: 241 YDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLT 300
           YDPYRLVWTPMSRDYPKHYSPGFGEL+FNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLT
Sbjct: 241 YDPYRLVWTPMSRDYPKHYSPGFGELMFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLT 300

Query: 301 SLLNRELTGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLALSFSSGGRTKAVFEI 360
           SLLNREL GKPLREAMDDDTLPLCWR RKP KSLRDVRKYFKPLALSFSSGGR+KAVFEI
Sbjct: 301 SLLNRELAGKPLREAMDDDTLPLCWRERKPIKSLRDVRKYFKPLALSFSSGGRSKAVFEI 360

Query: 361 PMEGYLIISSMGNVCLGILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATANCDR 420
           P+EGY+IISSMGNVCLGILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATANCDR
Sbjct: 361 PIEGYMIISSMGNVCLGILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATANCDR 420

Query: 421 VPKSRVGS 429
           VPKS+V S
Sbjct: 421 VPKSQVSS 428

BLAST of Lsi10G007850 vs. NCBI nr
Match: gi|778670345|ref|XP_011649449.1| (PREDICTED: aspartic proteinase Asp1 isoform X1 [Cucumis sativus])

HSP 1 Score: 858.6 bits (2217), Expect = 4.7e-246
Identity = 411/432 (95.14%), Postives = 421/432 (97.45%), Query Frame = 1

Query: 1   MGKGVLMILVLMVASMSCLALCSASSFFKDKPWERRRPILSVPTASSSFASSSIVLPLQG 60
           MGK VL++LVLMVASMSCLA CSASSFFKDKPWER+RPILSVPTASSSFASSSIVLPLQG
Sbjct: 1   MGKRVLVVLVLMVASMSCLAPCSASSFFKDKPWERKRPILSVPTASSSFASSSIVLPLQG 60

Query: 61  NVFPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPC 120
           NV+PNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPC
Sbjct: 61  NVYPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPC 120

Query: 121 KDPLCMSLHSSMDHKCENPGQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG 180
           KDPLCMSLHSSMDH+CENP QCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG
Sbjct: 121 KDPLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG 180

Query: 181 ----CGYDQDPGSSSYHPMDGILGLGRGAVSMVSQLHNQGIVRNVVGHCFSSKGGGYLFF 240
               CGYDQDPGSSSYHPMDGILGLGRGAVS+VSQLHNQGIVRNVVGHCF+SKGGGYLFF
Sbjct: 181 CQLICGYDQDPGSSSYHPMDGILGLGRGAVSIVSQLHNQGIVRNVVGHCFNSKGGGYLFF 240

Query: 241 GDGIYDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAY 300
           GDGIYDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAY
Sbjct: 241 GDGIYDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAY 300

Query: 301 QVLTSLLNRELTGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLALSFSSGGRTKA 360
           QVLTSLLNREL GKPLREAMDDDTLPLCWRGRKP KSLRDVRKYFKPLALSFSSGGR+KA
Sbjct: 301 QVLTSLLNRELAGKPLREAMDDDTLPLCWRGRKPIKSLRDVRKYFKPLALSFSSGGRSKA 360

Query: 361 VFEIPMEGYLIISSMGNVCLGILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATA 420
           VFEIP EGY+IISSMGNVCLGILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATA
Sbjct: 361 VFEIPTEGYMIISSMGNVCLGILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATA 420

Query: 421 NCDRVPKSRVGS 429
           NCDRVPKS+V S
Sbjct: 421 NCDRVPKSQVSS 432

BLAST of Lsi10G007850 vs. NCBI nr
Match: gi|659121805|ref|XP_008460822.1| (PREDICTED: aspartic proteinase Asp1 isoform X1 [Cucumis melo])

HSP 1 Score: 853.6 bits (2204), Expect = 1.5e-244
Identity = 408/432 (94.44%), Postives = 420/432 (97.22%), Query Frame = 1

Query: 1   MGKGVLMILVLMVASMSCLALCSASSFFKDKPWERRRPILSVPTASSSFASSSIVLPLQG 60
           MGK VL++L LMVASMSCLA CSASSFFKDKPWER+RPILSVPTASSSFASSSIVLPLQG
Sbjct: 1   MGKWVLVVLALMVASMSCLAPCSASSFFKDKPWERKRPILSVPTASSSFASSSIVLPLQG 60

Query: 61  NVFPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPC 120
           NV+PNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPC
Sbjct: 61  NVYPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPC 120

Query: 121 KDPLCMSLHSSMDHKCENPGQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG 180
           KDPLCMSLHSSMDH+CENP QCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG
Sbjct: 121 KDPLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG 180

Query: 181 ----CGYDQDPGSSSYHPMDGILGLGRGAVSMVSQLHNQGIVRNVVGHCFSSKGGGYLFF 240
               CGYDQDPGSSSYHPMDGILGLGRGAVS+VSQLHNQGIVRNVVGHCF+SKGGGYLFF
Sbjct: 181 CQLICGYDQDPGSSSYHPMDGILGLGRGAVSIVSQLHNQGIVRNVVGHCFNSKGGGYLFF 240

Query: 241 GDGIYDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAY 300
           GDGIYDPYRLVWTPMSRDYPKHYSPGFGEL+FNGRSTGLRNLFVVFDSGSSYTYFNAQAY
Sbjct: 241 GDGIYDPYRLVWTPMSRDYPKHYSPGFGELMFNGRSTGLRNLFVVFDSGSSYTYFNAQAY 300

Query: 301 QVLTSLLNRELTGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLALSFSSGGRTKA 360
           QVLTSLLNREL GKPLREAMDDDTLPLCWR RKP KSLRDVRKYFKPLALSFSSGGR+KA
Sbjct: 301 QVLTSLLNRELAGKPLREAMDDDTLPLCWRERKPIKSLRDVRKYFKPLALSFSSGGRSKA 360

Query: 361 VFEIPMEGYLIISSMGNVCLGILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATA 420
           VFEIP+EGY+IISSMGNVCLGILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATA
Sbjct: 361 VFEIPIEGYMIISSMGNVCLGILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATA 420

Query: 421 NCDRVPKSRVGS 429
           NCDRVPKS+V S
Sbjct: 421 NCDRVPKSQVSS 432

BLAST of Lsi10G007850 vs. NCBI nr
Match: gi|700207119|gb|KGN62238.1| (hypothetical protein Csa_2G338820 [Cucumis sativus])

HSP 1 Score: 729.2 bits (1881), Expect = 4.3e-207
Identity = 341/352 (96.88%), Postives = 347/352 (98.58%), Query Frame = 1

Query: 77  PPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPCKDPLCMSLHSSMDHKC 136
           PPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPCKDPLCMSLHSSMDH+C
Sbjct: 35  PPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPCKDPLCMSLHSSMDHRC 94

Query: 137 ENPGQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALGCGYDQDPGSSSYHPMD 196
           ENP QCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALGCGYDQDPGSSSYHPMD
Sbjct: 95  ENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALGCGYDQDPGSSSYHPMD 154

Query: 197 GILGLGRGAVSMVSQLHNQGIVRNVVGHCFSSKGGGYLFFGDGIYDPYRLVWTPMSRDYP 256
           GILGLGRGAVS+VSQLHNQGIVRNVVGHCF+SKGGGYLFFGDGIYDPYRLVWTPMSRDYP
Sbjct: 155 GILGLGRGAVSIVSQLHNQGIVRNVVGHCFNSKGGGYLFFGDGIYDPYRLVWTPMSRDYP 214

Query: 257 KHYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLTSLLNRELTGKPLREAM 316
           KHYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLTSLLNREL GKPLREAM
Sbjct: 215 KHYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLTSLLNRELAGKPLREAM 274

Query: 317 DDDTLPLCWRGRKPFKSLRDVRKYFKPLALSFSSGGRTKAVFEIPMEGYLIISSMGNVCL 376
           DDDTLPLCWRGRKP KSLRDVRKYFKPLALSFSSGGR+KAVFEIP EGY+IISSMGNVCL
Sbjct: 275 DDDTLPLCWRGRKPIKSLRDVRKYFKPLALSFSSGGRSKAVFEIPTEGYMIISSMGNVCL 334

Query: 377 GILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATANCDRVPKSRVGS 429
           GILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATANCDRVPKS+V S
Sbjct: 335 GILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATANCDRVPKSQVSS 386

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
ASP1_ORYSJ4.5e-9246.37Aspartic proteinase Asp1 OS=Oryza sativa subsp. japonica GN=ASP1 PE=2 SV=1[more]
ASP1_ORYSI8.9e-8845.08Aspartic proteinase Asp1 OS=Oryza sativa subsp. indica GN=ASP1 PE=2 SV=2[more]
APCB1_ARATH1.4e-8042.93Aspartyl protease APCB1 OS=Arabidopsis thaliana GN=APCB1 PE=1 SV=1[more]
ASPL2_ARATH1.3e-3328.13Aspartic proteinase-like protein 2 OS=Arabidopsis thaliana GN=At1g65240 PE=1 SV=... [more]
APF1_ARATH3.9e-2730.50Aspartyl protease family protein 1 OS=Arabidopsis thaliana GN=APF1 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0LKB0_CUCSA3.0e-20796.88Uncharacterized protein OS=Cucumis sativus GN=Csa_2G338820 PE=3 SV=1[more]
M5WHY2_PRUPE1.9e-16967.21Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa005961mg PE=3 SV=1[more]
A0A061DK09_THECC1.6e-16865.97Eukaryotic aspartyl protease family protein isoform 1 OS=Theobroma cacao GN=TCM_... [more]
A0A165Z0G7_DAUCA3.0e-16770.62Uncharacterized protein OS=Daucus carota subsp. sativus GN=DCAR_013175 PE=4 SV=1[more]
Q5NT86_DAUCA3.0e-16770.62Nucellin-like protein OS=Daucus carota GN=DcNLP PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT4G33490.28.1e-16464.59 Eukaryotic aspartyl protease family protein[more]
AT1G44130.11.1e-12050.51 Eukaryotic aspartyl protease family protein[more]
AT1G77480.18.5e-11350.93 Eukaryotic aspartyl protease family protein[more]
AT1G49050.17.7e-8242.93 Eukaryotic aspartyl protease family protein[more]
AT1G65240.17.1e-3528.13 Eukaryotic aspartyl protease family protein[more]
Match NameE-valueIdentityDescription
gi|778670347|ref|XP_004147327.2|8.5e-24896.03PREDICTED: aspartic proteinase Asp1 isoform X2 [Cucumis sativus][more]
gi|659121807|ref|XP_008460823.1|2.7e-24695.33PREDICTED: aspartic proteinase Asp1 isoform X2 [Cucumis melo][more]
gi|778670345|ref|XP_011649449.1|4.7e-24695.14PREDICTED: aspartic proteinase Asp1 isoform X1 [Cucumis sativus][more]
gi|659121805|ref|XP_008460822.1|1.5e-24494.44PREDICTED: aspartic proteinase Asp1 isoform X1 [Cucumis melo][more]
gi|700207119|gb|KGN62238.1|4.3e-20796.88hypothetical protein Csa_2G338820 [Cucumis sativus][more]
The following terms have been associated with this gene:
Vocabulary: Biological Process
TermDefinition
GO:0006508proteolysis
Vocabulary: Molecular Function
TermDefinition
GO:0004190aspartic-type endopeptidase activity
Vocabulary: INTERPRO
TermDefinition
IPR021109Peptidase_aspartic_dom_sf
IPR001461Aspartic_peptidase_A1
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
cellular_component GO:0005576 extracellular region
cellular_component GO:0005575 cellular_component
molecular_function GO:0004190 aspartic-type endopeptidase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lsi10G007850.1Lsi10G007850.1mRNA


Analysis Name: InterPro Annotations of Lagenaria siceraria
Date Performed: 2017-09-18
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPANTHERPTHR13683ASPARTYL PROTEASEScoord: 1..426
score: 1.7E
IPR021109Aspartic peptidase domainGENE3DG3DSA:2.40.70.10coord: 53..238
score: 1.7E-39coord: 245..420
score: 5.3
IPR021109Aspartic peptidase domainunknownSSF50630Acid proteasescoord: 60..421
score: 2.82
NoneNo IPR availablePANTHERPTHR13683:SF227ASPARTYL PROTEASE FAMILY PROTEINcoord: 1..426
score: 1.7E
NoneNo IPR availablePROFILEPS51257PROKAR_LIPOPROTEINcoord: 1..18
score: