Lsi01G018950 (gene) Bottle gourd (USVL1VR-Ls)

NameLsi01G018950
Typegene
OrganismLagenaria siceraria (Bottle gourd (USVL1VR-Ls))
DescriptionHydrolase, hydrolyzing O-glycosyl compounds, putative
Locationchr01 : 22139026 .. 22142611 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TCCGAATCAGGATCGAAACAAATTACAACATATAAAGAGTATTGGAACTGGTTGGTCTTCAATCCCCTCTTTTTCTTCTTCTGGCTCAGTAAGTTTTCTAACTATTTTCTCGTATAGATTTTTACATTCTCTTCACATTTATTTGTTTCCAAATATTTCTCTCCATCTATCAAAGTTTCAAACTAAAAGAATTTAAATTCTTGAATTGAAGTTTCTAAGTTCAGTGATACTTGTAGTTGTAATATACGAAAACTAACATTATGTTTATTATTTCTTTGATTGGGTTTTTTTTTCTTAATTAAGAAAGATGGCGAGAGTTCTTATCACTTTAGTTGGACTTTTGCTACTTTGTTTCTCTGAAACATTGGCAAAAGCAGAACAATTGAAATACAAAGACCCAAAACAACCCTTGAACATTCGCATTAAGGACCTACTTGGTCGGATGACCCTCGAGGAGAAAATAGGTCAAATGGTGCAAATTGAAAGGGTTAATGCTTCTTCTAAGGTTATGGAAAAGTATTTCATTGGTAAATATCTTCCTTTATTTTTCTTCTTACATTTCCTATGTTTTGAAAAAGAAAATATATTAATTGATTTAACTTGTACAAAGAGATTGTGATTAAATGAAAATGACATCTTATTTCTTAGGGAGTGTATTGAGTGGTGGAGGCAGTGCTCCATCAAAGAGAGCTTCAGCCAAAGCTTGGGTCCATATGGTAAATAAAATTCAAAAGGGGGCTTTGTCGACTAGGCTTGGAATTCCTATGATATATGGAATTGATGCTGTACATGGTCACAACAATGTATATAATGCAACAATCTTCCCTCATAATATTGGTCTTGGAGCTACCAGGTAAGATAAATACTACATATTAAGGATATTGAGAGTTTAACTTTTAGCTTTTAAAACTCAATATGTGAGGACATTTTCTTCGATTAAACAGGGATCCTCAACTTGTGAAAAAGATTGGGATTGCTACTGCCCTTGAAGTTAGAGCAACTGGGATTCCTTATGCTTTTGCACCTTGTGTAGCAGTAACTATTTTTCTCTTCCAGTATTTCAATTGATAAAGATATCATCGCCTCTCCAATGTCATTTTGTGATATATTACATACAATTATTTTTTCTATATGTGATATTTTTTTTGTGTGACTTTTTTCCCCTTAATTAGGTTTGCAGAGATCCTAGATGGGGTCGATGTTATGAAAGCTATAGTGAAGACCCTAAGGTCGTTCAAGCTATGACAGAGATCATACCAGGTTTACAAGGAGAAATCCCACCTAATTCTCGCAAGGGTGTTCCTTATGTTGCTGGAAAGTGAGTACTCCTAAAACTTTAGTCCATATATTCGGTACACAACAATTCAACAAACTTTTAACTTTTTAGCTGGACATCAATATTATTCAAAACCAAATTTATAGTGTTAAACAAGTAATGTTTTACCAATATTGCATCGTCTAAATCGAGCGTTGATATAAAGCATTTTGAAATCAGTTAAATCAATTGAAATATTGAGATAATCGATATTTATCAATCATTTACTATTGCTGTTGTTGTTATTATTACTATATGTGATGATTTCAGTGTGTCAAACCTCGCATTTGGCCCATCAATTTGATGATGTAATTTAAAATTTAAAATAAAGGGTATATTTTTTTTGTCAAAATAATGAAAGTAGTTTTCTAATGAAGTTTCTAACGACTCTTTCTATTATTTATATAGAAAAAATGTAGCAGCCTGTGCAAAGCACTTTGTAGGTGATGGTGGAACAACTAAGGGTATCAATGAGAACAACACAGTGATAGATAGACATAGATTACTTAGCATTCATATGCCAGGTTACTATAACTCAATAATCAAGGGAGTTGCAACCATTATGGTTTCTTATTCAAGTTTGAATGGAGAGAAGATGCACGCAAACAAGAATCTTGTAATCGACTTTCTTAAGAACACTCTTCATTTTAGGGTAAATATGTCTTCGTTGCTATATATTGTAGAGATCTCACGTTGTTACTTAGTAGATAACCTTGGGGAAAGAAACTATACAAAGTGACTTAGATATTTTGTTGGTCTTTTTCAGGGCTTTGTAATCTCAGATTGGCAGGGCATTGATAAGATTACAACTCCACCTCATTCTAACTATACATATTCCATTATGGCAAGCGTTAATGCTGGTGTTGACATGGTTGGTACTATGCTGACATTTTGATTATATTTCAACAAATAGCAATGCTCAATTATATAATTTGTGACAGATTATGGTGCCATACAACTACACAGAGTTCATCGACGGTCTTACCTACTTGGTAAAAAATAATGTAATTCCTATTAGTCGAATTGATGATGCAGTGAAGAGAATATTGCGAGTCAAATTTGTTATGGGTTTATTTGAGAATCCATTAGCTGACTTAAGCTTGATAAATGAGCTTGGTAAACAGGTACTTTTACACAATACCTTTAATACTATGAGGGAGAAAATATTACATTTTCTTACGCTTTGTTTCTTCACAACAGGAGCATAGAGAACTAGCTAGAGAAGCTGTAAGAAAATCACTAGTGCTATTAAAGAATGGAAAATTGCCGAACAAACCATTATTGCCCCTCCCAAAGAAAGCACCAAAGATACTTGTTGCTGGCAGCCATGCAGATAACCTTGGAAATCAGTGTGGTGGTTGGACTATGGAATGGCAAGGACTTAGTGGCAACAACCTTACCACCGGTATGCAAAGTAACATAATACTTACAAACTAGTGATTGTGGGTTACGATACTGCATAACATTAATATCAACCTCTTTCTCCTAGGCACAACTGTTCTTGCAGCAATAAAAGACACAATTGATCCTGAAACAGAAATTATATTTAACGAGAATCCAAATGTGGAATTTCTCAAATCACACAATTTTTCTTATGCCATTGTGGTGGTTGGAGAATATCCATATGCAGAAACCAATGGTGATAGCTTGAATCTGACAATTCCTCACCCTGGTCCACACACGATCACAAATGTTTGTGGAGCCGTGAAATGCGTAGTTATAATAATCTCAGGACGACCGGTAGTAATTCAACCTTATATTGCTTCAATGGATGCACTTGTTGCTGCATGGCTTCCAGGAACTGAAGGCAAAGGCATTACGGATGTGTTATTTGGAGACTATGGCTTTACTGGCAAGCTTTCACAAACGTGGTTTAAGACTATTGATCAATTGCCGATGAATTTTGGAGATCCACATTATGATCCCCTTTTCCCATTTGGATATGGTATTACTACAGAGCTTGTCAAAGCTAATTAAATGAGTGTTTCGACTTGAGCTATTTGACATGTATACACAATATTCTTCCACAGTACTCTTTACTTGATACAGGTTTTTATTTTCAATTTGTTTTTGTAAAAAATATGGTTGTCTATTTAATTGTTTTAAAATCATTGTCAATGAGTTTGAATGGTACTTGCTTACCACACCAACTACAATGAAGATGTCGTATTTCTCATACTTCATCAACTCTACTTTCGGTCGCAACATGATTAAACTAATTCAATGAAAAAGTTGCAATATATGTCAAGAACATTGGTATTAGTCTTCACT

mRNA sequence

TCCGAATCAGGATCGAAACAAATTACAACATATAAAGAGTATTGGAACTGGTTGGTCTTCAATCCCCTCTTTTTCTTCTTCTGGCTCAAAAGAGAAAGATGGCGAGAGTTCTTATCACTTTAGTTGGACTTTTGCTACTTTGTTTCTCTGAAACATTGGCAAAAGCAGAACAATTGAAATACAAAGACCCAAAACAACCCTTGAACATTCGCATTAAGGACCTACTTGGTCGGATGACCCTCGAGGAGAAAATAGGTCAAATGGTGCAAATTGAAAGGGTTAATGCTTCTTCTAAGGTTATGGAAAAGTATTTCATTGGGAGTGTATTGAGTGGTGGAGGCAGTGCTCCATCAAAGAGAGCTTCAGCCAAAGCTTGGGTCCATATGGTAAATAAAATTCAAAAGGGGGCTTTGTCGACTAGGCTTGGAATTCCTATGATATATGGAATTGATGCTGTACATGGTCACAACAATGTATATAATGCAACAATCTTCCCTCATAATATTGGTCTTGGAGCTACCAGGGATCCTCAACTTGTGAAAAAGATTGGGATTGCTACTGCCCTTGAAGTTAGAGCAACTGGGATTCCTTATGCTTTTGCACCTTGTGTAGCAGTTTGCAGAGATCCTAGATGGGGTCGATGTTATGAAAGCTATAGTGAAGACCCTAAGGTCGTTCAAGCTATGACAGAGATCATACCAGGTTTACAAGGAGAAATCCCACCTAATTCTCGCAAGGGTGTTCCTTATGTTGCTGGAAAAAAAAATGTAGCAGCCTGTGCAAAGCACTTTGTAGGTGATGGTGGAACAACTAAGGGTATCAATGAGAACAACACAGTGATAGATAGACATAGATTACTTAGCATTCATATGCCAGGTTACTATAACTCAATAATCAAGGGAGTTGCAACCATTATGGTTTCTTATTCAAGTTTGAATGGAGAGAAGATGCACGCAAACAAGAATCTTGTAATCGACTTTCTTAAGAACACTCTTCATTTTAGGGGCTTTGTAATCTCAGATTGGCAGGGCATTGATAAGATTACAACTCCACCTCATTCTAACTATACATATTCCATTATGGCAAGCGTTAATGCTGGTGTTGACATGGTTGAGTTCATCGACGGTCTTACCTACTTGGTAAAAAATAATGTAATTCCTATTAGTCGAATTGATGATGCAGTGAAGAGAATATTGCGAGTCAAATTTGTTATGGGTTTATTTGAGAATCCATTAGCTGACTTAAGCTTGATAAATGAGCTTGGTAAACAGGAGCATAGAGAACTAGCTAGAGAAGCTGTAAGAAAATCACTAGTGCTATTAAAGAATGGAAAATTGCCGAACAAACCATTATTGCCCCTCCCAAAGAAAGCACCAAAGATACTTGTTGCTGGCAGCCATGCAGATAACCTTGGAAATCAGTGTGGTGGTTGGACTATGGAATGGCAAGGACTTAGTGGCAACAACCTTACCACCGGCACAACTGTTCTTGCAGCAATAAAAGACACAATTGATCCTGAAACAGAAATTATATTTAACGAGAATCCAAATGTGGAATTTCTCAAATCACACAATTTTTCTTATGCCATTGTGGTGGTTGGAGAATATCCATATGCAGAAACCAATGGTGATAGCTTGAATCTGACAATTCCTCACCCTGGTCCACACACGATCACAAATGTTTGTGGAGCCGTGAAATGCGTAGTTATAATAATCTCAGGACGACCGGTAGTAATTCAACCTTATATTGCTTCAATGGATGCACTTGTTGCTGCATGGCTTCCAGGAACTGAAGGCAAAGGCATTACGGATGTGTTATTTGGAGACTATGGCTTTACTGGCAAGCTTTCACAAACGTGGTTTAAGACTATTGATCAATTGCCGATGAATTTTGGAGATCCACATTATGATCCCCTTTTCCCATTTGGATATGGTATTACTACAGAGCTTGTCAAAGCTAATTAAATGAGTGTTTCGACTTGAGCTATTTGACATGTATACACAATATTCTTCCACAGTACTCTTTACTTGATACAGGTTTTTATTTTCAATTTGTTTTTGTAAAAAATATGGTTGTCTATTTAATTGTTTTAAAATCATTGTCAATGAGTTTGAATGGTACTTGCTTACCACACCAACTACAATGAAGATGTCGTATTTCTCATACTTCATCAACTCTACTTTCGGTCGCAACATGATTAAACTAATTCAATGAAAAAGTTGCAATATATGTCAAGAACATTGGTATTAGTCTTCACT

Coding sequence (CDS)

ATGGCGAGAGTTCTTATCACTTTAGTTGGACTTTTGCTACTTTGTTTCTCTGAAACATTGGCAAAAGCAGAACAATTGAAATACAAAGACCCAAAACAACCCTTGAACATTCGCATTAAGGACCTACTTGGTCGGATGACCCTCGAGGAGAAAATAGGTCAAATGGTGCAAATTGAAAGGGTTAATGCTTCTTCTAAGGTTATGGAAAAGTATTTCATTGGGAGTGTATTGAGTGGTGGAGGCAGTGCTCCATCAAAGAGAGCTTCAGCCAAAGCTTGGGTCCATATGGTAAATAAAATTCAAAAGGGGGCTTTGTCGACTAGGCTTGGAATTCCTATGATATATGGAATTGATGCTGTACATGGTCACAACAATGTATATAATGCAACAATCTTCCCTCATAATATTGGTCTTGGAGCTACCAGGGATCCTCAACTTGTGAAAAAGATTGGGATTGCTACTGCCCTTGAAGTTAGAGCAACTGGGATTCCTTATGCTTTTGCACCTTGTGTAGCAGTTTGCAGAGATCCTAGATGGGGTCGATGTTATGAAAGCTATAGTGAAGACCCTAAGGTCGTTCAAGCTATGACAGAGATCATACCAGGTTTACAAGGAGAAATCCCACCTAATTCTCGCAAGGGTGTTCCTTATGTTGCTGGAAAAAAAAATGTAGCAGCCTGTGCAAAGCACTTTGTAGGTGATGGTGGAACAACTAAGGGTATCAATGAGAACAACACAGTGATAGATAGACATAGATTACTTAGCATTCATATGCCAGGTTACTATAACTCAATAATCAAGGGAGTTGCAACCATTATGGTTTCTTATTCAAGTTTGAATGGAGAGAAGATGCACGCAAACAAGAATCTTGTAATCGACTTTCTTAAGAACACTCTTCATTTTAGGGGCTTTGTAATCTCAGATTGGCAGGGCATTGATAAGATTACAACTCCACCTCATTCTAACTATACATATTCCATTATGGCAAGCGTTAATGCTGGTGTTGACATGGTTGAGTTCATCGACGGTCTTACCTACTTGGTAAAAAATAATGTAATTCCTATTAGTCGAATTGATGATGCAGTGAAGAGAATATTGCGAGTCAAATTTGTTATGGGTTTATTTGAGAATCCATTAGCTGACTTAAGCTTGATAAATGAGCTTGGTAAACAGGAGCATAGAGAACTAGCTAGAGAAGCTGTAAGAAAATCACTAGTGCTATTAAAGAATGGAAAATTGCCGAACAAACCATTATTGCCCCTCCCAAAGAAAGCACCAAAGATACTTGTTGCTGGCAGCCATGCAGATAACCTTGGAAATCAGTGTGGTGGTTGGACTATGGAATGGCAAGGACTTAGTGGCAACAACCTTACCACCGGCACAACTGTTCTTGCAGCAATAAAAGACACAATTGATCCTGAAACAGAAATTATATTTAACGAGAATCCAAATGTGGAATTTCTCAAATCACACAATTTTTCTTATGCCATTGTGGTGGTTGGAGAATATCCATATGCAGAAACCAATGGTGATAGCTTGAATCTGACAATTCCTCACCCTGGTCCACACACGATCACAAATGTTTGTGGAGCCGTGAAATGCGTAGTTATAATAATCTCAGGACGACCGGTAGTAATTCAACCTTATATTGCTTCAATGGATGCACTTGTTGCTGCATGGCTTCCAGGAACTGAAGGCAAAGGCATTACGGATGTGTTATTTGGAGACTATGGCTTTACTGGCAAGCTTTCACAAACGTGGTTTAAGACTATTGATCAATTGCCGATGAATTTTGGAGATCCACATTATGATCCCCTTTTCCCATTTGGATATGGTATTACTACAGAGCTTGTCAAAGCTAATTAA

Protein sequence

MARVLITLVGLLLLCFSETLAKAEQLKYKDPKQPLNIRIKDLLGRMTLEEKIGQMVQIERVNASSKVMEKYFIGSVLSGGGSAPSKRASAKAWVHMVNKIQKGALSTRLGIPMIYGIDAVHGHNNVYNATIFPHNIGLGATRDPQLVKKIGIATALEVRATGIPYAFAPCVAVCRDPRWGRCYESYSEDPKVVQAMTEIIPGLQGEIPPNSRKGVPYVAGKKNVAACAKHFVGDGGTTKGINENNTVIDRHRLLSIHMPGYYNSIIKGVATIMVSYSSLNGEKMHANKNLVIDFLKNTLHFRGFVISDWQGIDKITTPPHSNYTYSIMASVNAGVDMVEFIDGLTYLVKNNVIPISRIDDAVKRILRVKFVMGLFENPLADLSLINELGKQEHRELAREAVRKSLVLLKNGKLPNKPLLPLPKKAPKILVAGSHADNLGNQCGGWTMEWQGLSGNNLTTGTTVLAAIKDTIDPETEIIFNENPNVEFLKSHNFSYAIVVVGEYPYAETNGDSLNLTIPHPGPHTITNVCGAVKCVVIIISGRPVVIQPYIASMDALVAAWLPGTEGKGITDVLFGDYGFTGKLSQTWFKTIDQLPMNFGDPHYDPLFPFGYGITTELVKAN
BLAST of Lsi01G018950 vs. Swiss-Prot
Match: BGH3B_BACO1 (Beta-glucosidase BoGH3B OS=Bacteroides ovatus (strain ATCC 8483 / DSM 1896 / JCM 5824 / NCTC 11153) GN=BACOVA_02659 PE=1 SV=1)

HSP 1 Score: 274.6 bits (701), Expect = 2.6e-72
Identity = 194/647 (29.98%), Postives = 321/647 (49.61%), Query Frame = 1

Query: 31  PKQP-LNIRIKDLLGRMTLEEKIGQMVQI-----ERVNASSK------------VMEKYF 90
           P  P +   I++ L +MTLE+KIGQM +I       +  S K            V+ KY 
Sbjct: 30  PTDPAIETHIREWLQKMTLEQKIGQMCEITIDVVSDLETSRKKGFCLSEAMLDTVIGKYK 89

Query: 91  IGSVLSGGGSAPSKRASAKAWVHMVNKIQKGALSTRLGIPMIYGIDAVHGHNNVYNATIF 150
           +GS+L+       K+   + W   + +IQ+ ++   +GIP IYG+D +HG     + T+F
Sbjct: 90  VGSLLNVPLGVAQKK---EKWAEAIKQIQEKSMK-EIGIPCIYGVDQIHGTTYTLDGTMF 149

Query: 151 PHNIGLGATRDPQLVKKIGIATALEVRATGIPYAFAPCVAVCRDPRWGRCYESYSEDPKV 210
           P  I +GAT + +L ++    +A E +A  IP+ FAP V + RDPRW R +E+Y ED  V
Sbjct: 150 PQGINMGATFNRELTRRGAKISAYETKAGCIPWTFAPVVDLGRDPRWARMWENYGEDCYV 209

Query: 211 VQAM-TEIIPGLQGEIPPNSRKGVPYVAGKKNVAACAKHFVGDGGTTKGINENNTVIDRH 270
              M    + G QGE P           G+ NVAAC KH++G G    G +   + I R 
Sbjct: 210 NAEMGVSAVKGFQGEDPNR--------IGEYNVAACMKHYMGYGVPVSGKDRTPSSISRS 269

Query: 271 RLLSIHMPGYYNSIIKGVATIMVSYSSLNGEKMHANKNLVIDFLKNTLHFRGFVISDWQG 330
            +   H   +  ++ +G  ++MV+    NG   HAN+ L+ ++LK  L++ G +++DW  
Sbjct: 270 DMREKHFAPFLAAVRQGALSVMVNSGVDNGLPFHANRELLTEWLKEDLNWDGLIVTDWAD 329

Query: 331 IDKITTPPHSNYT--YSIMASVNAGVDM------VEFIDGLTYLVKNNVIPISRIDDAVK 390
           I+ + T  H   T   ++   +NAG+DM      V F D L  LV+   + + RIDDAV 
Sbjct: 330 INNLCTRDHIAATKKEAVKIVINAGIDMSMVPYEVSFCDYLKELVEEGEVSMERIDDAVA 389

Query: 391 RILRVKFVMGLFENPLADLSLINELGKQEHRELAREAVRKSLVLLKNGKLPNKPLLPLPK 450
           R+LR+K+ +GLF++P  D+   ++ G +E   +A +A  +S VLLKN    +  +LP+  
Sbjct: 390 RVLRLKYRLGLFDHPYWDIKKYDKFGSKEFAAVALQAAEESEVLLKN----DGNILPI-A 449

Query: 451 KAPKILVAGSHADNLGNQCGGWTMEWQG-LSGNNLTTGTTVLAAIKDTIDPETEIIFNEN 510
           K  KIL+ G +A+++    GGW+  WQG ++        T+  A+ +    E  II+   
Sbjct: 450 KGKKILLTGPNANSMRCLNGGWSYSWQGHVADEYAQAYHTIYEALCEKYGKE-NIIYEPG 509

Query: 511 PNVEFLKSHNF------------------SYAIVVVGEYPYAETNGDSLNLTIPHPGPHT 570
                 K+ N+                     I  +GE  Y ET G+  +LT+     + 
Sbjct: 510 VTYASYKNDNWWEENKPETEKPVAAAAQADIIITCIGENSYCETPGNLTDLTLSENQRNL 569

Query: 571 ITNVCGAVKCVVIIIS-GRPVVIQPYIASMDALVAAWLPGT-EGKGITDVLFGDYGFTGK 615
           +  +    K +V++++ GRP +I   +    A+V   LP    G  + ++L GD  F+GK
Sbjct: 570 VKALAATGKPIVLVLNQGRPRIINDIVPLAKAVVNIMLPSNYGGDALANLLAGDANFSGK 629

BLAST of Lsi01G018950 vs. Swiss-Prot
Match: GLUA_DICDI (Lysosomal beta glucosidase OS=Dictyostelium discoideum GN=gluA PE=1 SV=2)

HSP 1 Score: 266.2 bits (679), Expect = 9.2e-70
Identity = 193/632 (30.54%), Postives = 314/632 (49.68%), Query Frame = 1

Query: 39  IKDLLGRMTLEEKIGQMVQIERVNASSK------------VMEKYFIGSVL----SGGGS 98
           + +L+ +M++ EKIGQM Q++    +S               + Y+IGS L    SGG +
Sbjct: 80  VDNLMSKMSITEKIGQMTQLDITTLTSPNTITINETTLAYYAKTYYIGSYLNSPVSGGLA 139

Query: 99  APSKRASAKAWVHMVNKIQKGALSTRLG-IPMIYGIDAVHGHNNVYNATIFPHNIGLGAT 158
                 ++  W+ M+N IQ   +      IPMIYG+D+VHG N V+ AT+FPHN GL AT
Sbjct: 140 GDIHHINSSVWLDMINTIQTIVIEGSPNKIPMIYGLDSVHGANYVHKATLFPHNTGLAAT 199

Query: 159 RDPQLVKKIGIATALEVRATGIPYAFAPCVAVCRDPRWGRCYESYSEDPKVVQAM-TEII 218
            + +        T+ +  A GIP+ FAP + +   P W R YE++ EDP V   M    +
Sbjct: 200 FNIEHATTAAQITSKDTVAVGIPWVFAPVLGIGVQPLWSRIYETFGEDPYVASMMGAAAV 259

Query: 219 PGLQGEIPPNSRKGVPYVAGKKNVAACAKHFVGDGGTTKGINENNTVIDRHRLLSIHMPG 278
            G QG    NS  G        +    AKH+ G    T G +     I    L    +P 
Sbjct: 260 RGFQG--GNNSFDG---PINAPSAVCTAKHYFGYSDPTSGKDRTAAWIPERMLRRYFLPS 319

Query: 279 YYNSII-KGVATIMVSYSSLNGEKMHANKNLVIDFLKNTLHFRGFVISDWQGIDKITTPP 338
           +  +I   G  TIM++   +NG  MH +   + + L+  L F G  ++DWQ I+K+    
Sbjct: 320 FAEAITGAGAGTIMINSGEVNGVPMHTSYKYLTEVLRGELQFEGVAVTDWQDIEKLVYFH 379

Query: 339 HS--NYTYSIMASVNAGVDM------VEFIDGLTYLVKNNVIPISRIDDAVKRILRVKFV 398
           H+  +   +I+ +++AG+DM      + F   L  +V    +P SR+D +V+RIL +K+ 
Sbjct: 380 HTAGSAEEAILQALDAGIDMSMVPLDLSFPIILAEMVAAGTVPESRLDLSVRRILNLKYA 439

Query: 399 MGLFENPL--ADLSLINELGKQEHRELAREAVRKSLVLLKNGKLPNKPLLPLPKKAPK-I 458
           +GLF NP    + ++++ +G+ + RE A     +S+ LL+N       +LPL     K +
Sbjct: 440 LGLFSNPYPNPNAAIVDTIGQVQDREAAAATAEESITLLQN----KNNILPLNTNTIKNV 499

Query: 459 LVAGSHADNLGNQCGGWTMEWQG-LSGNNLTTGTTVLAAIKD------------TIDPET 518
           L+ G  AD++ N  GGW++ WQG    +    GT++L  +++            TI  E 
Sbjct: 500 LLTGPSADSIRNLNGGWSVHWQGAYEDSEFPFGTSILTGLREITNDTADFNIQYTIGHEI 559

Query: 519 EIIFNENPNVEFLK-SHNFSYAIVVVGEYPYAETNGDSLNLTIPHPGPHTITNVCGAVKC 578
            +  N+    E ++ + +    +VV+GE P AET GD  +L++       +  +    K 
Sbjct: 560 GVPTNQTSIDEAVELAQSSDVVVVVIGELPEAETPGDIYDLSMDPNEVLLLQQLVDTGKP 619

Query: 579 VV-IIISGRPVVIQP-YIASMDALVAAWLPGTE-GKGITDVLFGDYGFTGKLSQTWFKTI 615
           VV I++  RP ++ P  + S  A++ A+LPG+E GK I ++L G+   +G+L  T+  T 
Sbjct: 620 VVLILVEARPRILPPDLVYSCAAVLMAYLPGSEGGKPIANILMGNVNPSGRLPLTYPGTT 679

BLAST of Lsi01G018950 vs. Swiss-Prot
Match: BGLX_SALTY (Periplasmic beta-glucosidase OS=Salmonella typhimurium (strain LT2 / SGSC1412 / ATCC 700720) GN=bglX PE=3 SV=2)

HSP 1 Score: 223.0 bits (567), Expect = 9.0e-57
Identity = 193/664 (29.07%), Postives = 309/664 (46.54%), Query Frame = 1

Query: 21  AKAEQLKYKDPKQP--LNIRIKDLLGRMTLEEKIGQM--VQIERVNASSKVMEKYFIGSV 80
           A AE L    P  P   +  + DLL +MT++EKIGQ+  + +   N    + E    G V
Sbjct: 18  ALAENLFGNHPLTPEARDAFVTDLLKKMTVDEKIGQLRLISVGPDNPKEAIREMIKDGQV 77

Query: 81  LSGGGSAPSKRASAKAWVHMVNKIQKGALSTRLGIPMIYGIDAVHGHNNVYNATIFPHNI 140
                 A     + +    M +++   ALS RL IP+ +  D VHG       T+FP ++
Sbjct: 78  -----GAIFNTVTRQDIRQMQDQVM--ALS-RLKIPLFFAYDVVHGQR-----TVFPISL 137

Query: 141 GLGATRDPQLVKKIGIATALEVRATGIPYAFAPCVAVCRDPRWGRCYESYSEDPKVVQAM 200
           GL ++ +   V+ +G  +A E    G+   +AP V V RDPRWGR  E + ED  +   M
Sbjct: 138 GLASSFNLDAVRTVGRVSAYEAADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSIM 197

Query: 201 TE-IIPGLQGEIPPNSRKGVPYVAGKKNVAACAKHFVGDGGTTKGINENNTVIDRHRLLS 260
            E ++  +QG+ P          A + +V    KHF   G    G   N   +   RL +
Sbjct: 198 GETMVKAMQGKSP----------ADRYSVMTSVKHFAAYGAVEGGKEYNTVDMSSQRLFN 257

Query: 261 IHMPGYYNSIIKGVATIMVSYSSLNGEKMHANKNLVIDFLKNTLHFRGFVISDWQGIDK- 320
            +MP Y   +  G   +MV+ +SLNG    ++  L+ D L++   F+G  +SD   I + 
Sbjct: 258 DYMPPYKAGLDAGSGAVMVALNSLNGTPATSDSWLLKDVLRDEWGFKGITVSDHGAIKEL 317

Query: 321 ITTPPHSNYTYSIMASVNAGVDMVE----FIDGLTYLVKNNVIPISRIDDAVKRILRVKF 380
           I     ++   ++  ++ AGVDM      +   L  L+K+  + ++ +DDA + +L VK+
Sbjct: 318 IKHGTAADPEDAVRVALKAGVDMSMADEYYSKYLPGLIKSGKVTMAELDDATRHVLNVKY 377

Query: 381 VMGLFENPLADLS------LINELGKQEHRELAREAVRKSLVLLKNGKLPNKPLLPLPKK 440
            MGLF +P + L       +      + HR+ ARE  R+S+VLLKN +L   PL    KK
Sbjct: 378 DMGLFNDPYSHLGPKESDPVDTNAESRLHRKEAREVARESVVLLKN-RLETLPL----KK 437

Query: 441 APKILVAGSHADNLGNQCGGWTMEWQGLSGNNLTTGTTVLAAIKDTIDPETEIIFNENPN 500
           +  I V G  AD+  +  G W+    G++  ++    TVLA I++ +    +I++ +  N
Sbjct: 438 SGTIAVVGPLADSQRDVMGSWSA--AGVANQSV----TVLAGIQNAVGDGAKILYAKGAN 497

Query: 501 -------VEFLK-----------------------SHNFSYAIVVVGE-YPYAETNGDSL 560
                  V+FL                        +      + VVGE    A       
Sbjct: 498 ITNDKGIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADVVVAVVGESQGMAHEASSRT 557

Query: 561 NLTIPHPGPHTITNVCGAVK-CVVIIISGRPVVIQPYIASMDALVAAWLPGTE-GKGITD 615
           N+TIP      IT +    K  V+++++GRP+ +       DA++  W  GTE G  I D
Sbjct: 558 NITIPQSQRDLITALKATGKPLVLVLMNGRPLALVKEDQQADAILETWFAGTEGGNAIAD 617

BLAST of Lsi01G018950 vs. Swiss-Prot
Match: BGLX_ECOLI (Periplasmic beta-glucosidase OS=Escherichia coli (strain K12) GN=bglX PE=3 SV=2)

HSP 1 Score: 217.2 bits (552), Expect = 4.9e-55
Identity = 177/644 (27.48%), Postives = 293/644 (45.50%), Query Frame = 1

Query: 39  IKDLLGRMTLEEKIGQM--VQIERVNASSKVMEKYFIGSVLSGGGSAPSKRASAKAWVHM 98
           + +LL +MT++EKIGQ+  + +   N    + E    G V  G       R   +A    
Sbjct: 38  VTELLKKMTVDEKIGQLRLISVGPDNPKEAIREMIKDGQV--GAIFNTVTRQDIRAMQDQ 97

Query: 99  VNKIQKGALSTRLGIPMIYGIDAVHGHNNVYNATIFPHNIGLGATRDPQLVKKIGIATAL 158
           V ++      +RL IP+ +  D +HG       T+FP ++GL ++ +   VK +G  +A 
Sbjct: 98  VMEL------SRLKIPLFFAYDVLHGQR-----TVFPISLGLASSFNLDAVKTVGRVSAY 157

Query: 159 EVRATGIPYAFAPCVAVCRDPRWGRCYESYSEDPKVVQAMTE-IIPGLQGEIPPNSRKGV 218
           E    G+   +AP V V RDPRWGR  E + ED  +   M + ++  +QG+ P       
Sbjct: 158 EAADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSTMGKTMVEAMQGKSP------- 217

Query: 219 PYVAGKKNVAACAKHFVGDGGTTKGINENNTVIDRHRLLSIHMPGYYNSIIKGVATIMVS 278
              A + +V    KHF   G    G   N   +   RL + +MP Y   +  G   +MV+
Sbjct: 218 ---ADRYSVMTSVKHFAAYGAVEGGKEYNTVDMSPQRLFNDYMPPYKAGLDAGSGAVMVA 277

Query: 279 YSSLNGEKMHANKNLVIDFLKNTLHFRGFVISDWQGIDKI-----TTPPHSNYTYSIMAS 338
            +SLNG    ++  L+ D L++   F+G  +SD   I ++        P      ++ + 
Sbjct: 278 LNSLNGTPATSDSWLLKDVLRDQWGFKGITVSDHGAIKELIKHGTAADPEDAVRVALKSG 337

Query: 339 VNAGVDMVEFIDGLTYLVKNNVIPISRIDDAVKRILRVKFVMGLFENPLADLS------L 398
           +N  +    +   L  L+K+  + ++ +DDA + +L VK+ MGLF +P + L       +
Sbjct: 338 INMSMSDEYYSKYLPGLIKSGKVTMAELDDAARHVLNVKYDMGLFNDPYSHLGPKESDPV 397

Query: 399 INELGKQEHRELAREAVRKSLVLLKNGKLPNKPLLPLPKKAPKILVAGSHADNLGNQCGG 458
                 + HR+ ARE  R+SLVLLKN +L   PL    KK+  I V G  AD+  +  G 
Sbjct: 398 DTNAESRLHRKEAREVARESLVLLKN-RLETLPL----KKSATIAVVGPLADSKRDVMGS 457

Query: 459 WTMEWQGLSGNNLTTGTTVLAAIKDTIDPETEIIFNENPNV-------EFLKSH------ 518
           W+    G++  ++    TVL  IK+ +    ++++ +  NV       +FL  +      
Sbjct: 458 WSA--AGVADQSV----TVLTGIKNAVGENGKVLYAKGANVTSDKGIIDFLNQYEEAVKV 517

Query: 519 -----------------NFSYAIVVVGE-YPYAETNGDSLNLTIPHPGPHTITNVCGAVK 578
                                 + VVGE    A       ++TIP      I  +    K
Sbjct: 518 DPRSPQEMIDEAVQTAKQSDVVVAVVGEAQGMAHEASSRTDITIPQSQRDLIAALKATGK 577

Query: 579 -CVVIIISGRPVVIQPYIASMDALVAAWLPGTE-GKGITDVLFGDYGFTGKLSQTWFKTI 615
             V+++++GRP+ +       DA++  W  GTE G  I DVLFGDY  +GKL  ++ +++
Sbjct: 578 PLVLVLMNGRPLALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPMSFPRSV 637

BLAST of Lsi01G018950 vs. Swiss-Prot
Match: BGLC_ASPOR (Probable beta-glucosidase C OS=Aspergillus oryzae (strain ATCC 42149 / RIB 40) GN=bglC PE=3 SV=2)

HSP 1 Score: 154.1 bits (388), Expect = 5.1e-36
Identity = 134/456 (29.39%), Postives = 197/456 (43.20%), Query Frame = 1

Query: 28  YKDPKQPLNIRIKDLLGRMTLEEKIGQMVQIERVNA---------SSKVMEKYFIGSVLS 87
           YKD    ++ R+ DLL RMT+EEK GQ+     ++          ++       IG    
Sbjct: 46  YKDASYCIDERVDDLLARMTIEEKAGQLFHTRLMDGPLDDEGSGNNAHNSTSNMIGEKHM 105

Query: 88  GGGSAPSKRASAKAWVHMVNKIQKGALSTRLGIPMIYGIDAVHGHN-NV---YNATIF-- 147
              +  S   +A      +N+IQ+ AL TRLGIP+    D  H    NV   + A +F  
Sbjct: 106 THFNLASDITNATETAEFINRIQELALQTRLGIPVTVSTDPRHSFTENVGTGFKAGVFSQ 165

Query: 148 -PHNIGLGATRDPQLVKKIGIATALEVRATGIPYAFAPCVAVCRDPRWGRCYESYSEDPK 207
            P +IGL A RDP +V+K       E  A GI  A  P V +  +PRW R   ++ E+  
Sbjct: 166 WPESIGLAALRDPYVVRKFAEVAKEEYIAVGIRAALHPQVDLSTEPRWARISNTWGENST 225

Query: 208 VV-QAMTEIIPGLQGEIPPNSRKGVPYVAGKKNVAACAKHFVGDGGTTKG------INEN 267
           +  + + E I G QG+             G ++V    KHF G G    G        +N
Sbjct: 226 LTSELLVEYIKGFQGD-----------KLGPQSVKTVTKHFPGGGPVENGEDSHFAYGKN 285

Query: 268 NTVIDRHRLLSIHMPGYYNSIIKGVATIMVSYSSLNGEKMHA-----NKNLVIDFLKNTL 327
            T    +  L  H+  +  +I  G   IM  YS   G +        NK +V + L+N L
Sbjct: 286 QTYPGNN--LEEHLKPFKAAIAAGATEIMPYYSRPIGTEYEPVAFSFNKRIVTELLRNEL 345

Query: 328 HFRGFVISDWQ-----------------GIDKITTPPHSNYTYSIMASVNAGVDMVEFID 387
            F G V++DW                  G++ +T    +            G +  E I 
Sbjct: 346 GFDGIVLTDWGLITDGYIAGQYMPARAWGVENLTELQRAARILDAGCDQFGGEERPELI- 405

Query: 388 GLTYLVKNNVIPISRIDDAVKRILRVKFVMGLFENPLADLSLINE-LGKQEHRELAREAV 436
               LV+  +I   RID +V+R+L+ KFV+GLF+NP  D       +G      L REA 
Sbjct: 406 --VQLVQEGIISEDRIDVSVRRLLKEKFVLGLFDNPFVDAEAAGRVVGNDYFVRLGREAQ 465

BLAST of Lsi01G018950 vs. TrEMBL
Match: A0A0A0LI54_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G842090 PE=4 SV=1)

HSP 1 Score: 1127.9 bits (2916), Expect = 0.0e+00
Identity = 545/628 (86.78%), Postives = 591/628 (94.11%), Query Frame = 1

Query: 1   MAR-VLITLVGLLLLCFSETLAKAEQLKYKDPKQPLNIRIKDLLGRMTLEEKIGQMVQIE 60
           MAR VLIT VGLL+LCFSETLAKAE LKYKDPKQPLN+RIKDLLGRMTLEEKIGQMVQIE
Sbjct: 2   MARSVLITFVGLLVLCFSETLAKAEYLKYKDPKQPLNVRIKDLLGRMTLEEKIGQMVQIE 61

Query: 61  RVNASSKVMEKYFIGSVLSGGGSAPSKRASAKAWVHMVNKIQKGALSTRLGIPMIYGIDA 120
           R NAS+ VM++YFIGSVLSGGGSAPSK+ASAK WVHMVNKIQ+ ALSTRLGIPMIYGIDA
Sbjct: 62  RANASADVMKQYFIGSVLSGGGSAPSKQASAKDWVHMVNKIQEAALSTRLGIPMIYGIDA 121

Query: 121 VHGHNNVYNATIFPHNIGLGATRDPQLVKKIGIATALEVRATGIPYAFAPCVAVCRDPRW 180
           VHGHNNVYNATIFPHNIGLGATRDPQL+K+IG ATALEVRATGIPYAFAPC+AVCRDPRW
Sbjct: 122 VHGHNNVYNATIFPHNIGLGATRDPQLLKRIGAATALEVRATGIPYAFAPCIAVCRDPRW 181

Query: 181 GRCYESYSEDPKVVQAMTEIIPGLQGEIPPNSRKGVPYVAGKKNVAACAKHFVGDGGTTK 240
           GRCYESY ED  +VQAMTEIIPGLQG++P N RKGVPYVAGK NVAACAKHFVGDGGTTK
Sbjct: 182 GRCYESYGEDHTIVQAMTEIIPGLQGDVPANIRKGVPYVAGKNNVAACAKHFVGDGGTTK 241

Query: 241 GINENNTVIDRHRLLSIHMPGYYNSIIKGVATIMVSYSSLNGEKMHANKNLVIDFLKNTL 300
           GINENNTV+D H L SIHMP YYNSIIKGVAT+MVSYSS+NGEKMHANK LV DFLKNTL
Sbjct: 242 GINENNTVVDGHGLFSIHMPAYYNSIIKGVATVMVSYSSINGEKMHANKKLVTDFLKNTL 301

Query: 301 HFRGFVISDWQGIDKITTPPHSNYTYSIMASVNAGVDMV-------EFIDGLTYLVKNNV 360
           HF+GFVISDWQGIDKITTPPH+NYTYSI+ASVNAGVDM+       EFIDGLTYLVKNN 
Sbjct: 302 HFKGFVISDWQGIDKITTPPHANYTYSILASVNAGVDMIMVPYNYTEFIDGLTYLVKNNA 361

Query: 361 IPISRIDDAVKRILRVKFVMGLFENPLADLSLINELGKQEHRELAREAVRKSLVLLKNGK 420
           IPISRIDDAVKRILRVKFVMGLFENPLADLSLINELGKQEHRELAREAVRKSLVLLKNGK
Sbjct: 362 IPISRIDDAVKRILRVKFVMGLFENPLADLSLINELGKQEHRELAREAVRKSLVLLKNGK 421

Query: 421 LPNKPLLPLPKKAPKILVAGSHADNLGNQCGGWTMEWQGLSGNNLTTGTTVLAAIKDTID 480
           LPN+PLLPLPKKAPKILVAG+HA++LGNQCGGWTMEWQGL+GNNLT+GTT+L AIKDT+D
Sbjct: 422 LPNQPLLPLPKKAPKILVAGTHANDLGNQCGGWTMEWQGLTGNNLTSGTTILTAIKDTVD 481

Query: 481 PETEIIFNENPNVEFLKSHNFSYAIVVVGEYPYAETNGDSLNLTIPHPGPHTITNVCGAV 540
           PETE++F++NPN EFL++H FSYAIVVVGE+PYAETNGDSLNLTIP PGP TI NVCGAV
Sbjct: 482 PETEVVFHDNPNAEFLQTHQFSYAIVVVGEHPYAETNGDSLNLTIPEPGPETIKNVCGAV 541

Query: 541 KCVVIIISGRPVVIQPYIASMDALVAAWLPGTEGKGITDVLFGDYGFTGKLSQTWFKTID 600
           KCVV++ISGRPVV+QPYI S+DA+VAAWLPGTEGKGI+DVLFGDYGFTGKLSQTWFK++D
Sbjct: 542 KCVVVVISGRPVVLQPYIDSIDAVVAAWLPGTEGKGISDVLFGDYGFTGKLSQTWFKSVD 601

Query: 601 QLPMNFGDPHYDPLFPFGYGITTELVKA 621
           QLPMNFGD HYDPLFPFG+G+TT+ VKA
Sbjct: 602 QLPMNFGDAHYDPLFPFGFGLTTQPVKA 629

BLAST of Lsi01G018950 vs. TrEMBL
Match: A0A0A0LY55_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G661750 PE=4 SV=1)

HSP 1 Score: 1073.9 bits (2776), Expect = 6.7e-311
Identity = 521/627 (83.09%), Postives = 578/627 (92.19%), Query Frame = 1

Query: 1   MARVLITLVGLLLLCFSETLAKAEQLKYKDPKQPLNIRIKDLLGRMTLEEKIGQMVQIER 60
           MA+ +I L+ LLL+C  ET AKAE  KYKDP Q LN+RIKDLLGRMTLEEKIGQMVQIER
Sbjct: 2   MAKAII-LIALLLICCFETGAKAENFKYKDPTQRLNVRIKDLLGRMTLEEKIGQMVQIER 61

Query: 61  VNASSKVMEKYFIGSVLSGGGSAPSKRASAKAWVHMVNKIQKGALSTRLGIPMIYGIDAV 120
           VNAS++VM+KYFIGSVLSGGGS PSK+ASA+ W++MVN+IQKGALSTRLGIPMIYGIDAV
Sbjct: 62  VNASTEVMKKYFIGSVLSGGGSVPSKQASAQDWINMVNEIQKGALSTRLGIPMIYGIDAV 121

Query: 121 HGHNNVYNATIFPHNIGLGATRDPQLVKKIGIATALEVRATGIPYAFAPCVAVCRDPRWG 180
           HGHNNVYNATIFPHNIGLGATRDPQL+K+IG+A+A E+RATGIPYAFAPCVAVCRDPRWG
Sbjct: 122 HGHNNVYNATIFPHNIGLGATRDPQLLKRIGVASAREIRATGIPYAFAPCVAVCRDPRWG 181

Query: 181 RCYESYSEDPKVVQAMTEIIPGLQGEIPPNSRKGVPYVAGKKNVAACAKHFVGDGGTTKG 240
           RCYESY EDPK+VQ MTEIIPGLQGEIPPNSRKGVPYVAGK+NV ACAKH+VGDGGTTKG
Sbjct: 182 RCYESYGEDPKIVQEMTEIIPGLQGEIPPNSRKGVPYVAGKENVVACAKHYVGDGGTTKG 241

Query: 241 INENNTVIDRHRLLSIHMPGYYNSIIKGVATIMVSYSSLNGEKMHANKNLVIDFLKNTLH 300
           I+ENNTVIDRH LLSIHMPGYY+SIIKGVATIMVSYSS NGEKMHANKNLV DFLKNTLH
Sbjct: 242 IDENNTVIDRHGLLSIHMPGYYHSIIKGVATIMVSYSSWNGEKMHANKNLVTDFLKNTLH 301

Query: 301 FRGFVISDWQGIDKITTPPHSNYTYSIMASVNAGVDMV-------EFIDGLTYLVKNNVI 360
           F+GFVISDW+ ID+IT PPH+NYTYSI+AS+ AG+DM+       EFIDGLT LVK+N I
Sbjct: 302 FQGFVISDWEAIDRITDPPHANYTYSILASITAGLDMIMIPYNYPEFIDGLTNLVKSNYI 361

Query: 361 PISRIDDAVKRILRVKFVMGLFENPLADLSLINELGKQEHRELAREAVRKSLVLLKNGKL 420
           PISRIDDAVKRILRVKFVMGLFENP+ADLSL+NELGKQEHRELAREAVRKSLVLLKNGK 
Sbjct: 362 PISRIDDAVKRILRVKFVMGLFENPIADLSLVNELGKQEHRELAREAVRKSLVLLKNGKS 421

Query: 421 PNKPLLPLPKKAPKILVAGSHADNLGNQCGGWTMEWQGLSGNNLTTGTTVLAAIKDTIDP 480
            +KPLLPL KK  KILVAGSHA+NLG QCGGWT+EWQGLSGNNLT+GTTVL AIKDT+DP
Sbjct: 422 ADKPLLPLEKKTQKILVAGSHANNLGYQCGGWTIEWQGLSGNNLTSGTTVLDAIKDTVDP 481

Query: 481 ETEIIFNENPNVEFLKSHNFSYAIVVVGEYPYAETNGDSLNLTIPHPGPHTITNVCGAVK 540
            TE+IFNENP+ + L+S  FSYAIVVVGE+PYAE NGDSLNLTIP PGP+TITNVCG +K
Sbjct: 482 TTEVIFNENPDKKSLQSDTFSYAIVVVGEHPYAELNGDSLNLTIPDPGPNTITNVCGVIK 541

Query: 541 CVVIIISGRPVVIQPYIASMDALVAAWLPGTEGKGITDVLFGDYGFTGKLSQTWFKTIDQ 600
           C V+IISGRPVVIQPY+ S+DALVAAWLPGTEGKGITDVLFGDYGFTGKLSQTWFKT+DQ
Sbjct: 542 CAVVIISGRPVVIQPYVDSIDALVAAWLPGTEGKGITDVLFGDYGFTGKLSQTWFKTVDQ 601

Query: 601 LPMNFGDPHYDPLFPFGYGITTELVKA 621
           LPMNFG+P+YDPLFPFG+G+TT+ +K+
Sbjct: 602 LPMNFGNPNYDPLFPFGHGLTTQPIKS 627

BLAST of Lsi01G018950 vs. TrEMBL
Match: A0A0A0LFL8_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G842070 PE=4 SV=1)

HSP 1 Score: 1043.1 bits (2696), Expect = 1.3e-301
Identity = 496/628 (78.98%), Postives = 562/628 (89.49%), Query Frame = 1

Query: 1   MARVLITLVGLLLLCFSETLAKAEQLKYKDPKQPLNIRIKDLLGRMTLEEKIGQMVQIER 60
           MA+ LI  +G  + C +E  AK + ++YKDPKQPLN+RI DLLGRMTLEEKIGQMVQI+R
Sbjct: 1   MAKNLIFFMGFFIFCLTEVWAKHQYMRYKDPKQPLNVRISDLLGRMTLEEKIGQMVQIDR 60

Query: 61  VNASSKVMEKYFIGSVLSGGGSAPSKRASAKAWVHMVNKIQKGALSTRLGIPMIYGIDAV 120
             AS KVM+KY IGSVLSGGGS PSK AS K W+ MVN+ QKG+LSTRLGIPMIYGIDAV
Sbjct: 61  TVASKKVMKKYLIGSVLSGGGSVPSKEASPKVWIDMVNEFQKGSLSTRLGIPMIYGIDAV 120

Query: 121 HGHNNVYNATIFPHNIGLGATRDPQLVKKIGIATALEVRATGIPYAFAPCVAVCRDPRWG 180
           HGHNNVY ATIFPHN+GLGATRDP L K+IG ATALEVRATGI Y FAPC+AVCRDPRWG
Sbjct: 121 HGHNNVYKATIFPHNVGLGATRDPNLAKRIGAATALEVRATGISYVFAPCIAVCRDPRWG 180

Query: 181 RCYESYSEDPKVVQAMTEIIPGLQGEIPPNSRKGVPYVAGKKNVAACAKHFVGDGGTTKG 240
           RC+ESYSEDPKVVQ MTEII GLQGEIP NSRKGVPYVAG++ VAACAKH+VGDGGTTKG
Sbjct: 181 RCFESYSEDPKVVQEMTEIISGLQGEIPSNSRKGVPYVAGREKVAACAKHYVGDGGTTKG 240

Query: 241 INENNTVIDRHRLLSIHMPGYYNSIIKGVATIMVSYSSLNGEKMHANKNLVIDFLKNTLH 300
           +NENNT+  RH LLSIHMPGYYNSIIKGV+T+M+SYSS NG+KMH N++L+  FLKNTL 
Sbjct: 241 MNENNTLASRHGLLSIHMPGYYNSIIKGVSTVMISYSSWNGKKMHENRDLITGFLKNTLR 300

Query: 301 FRGFVISDWQGIDKITTPPHSNYTYSIMASVNAGVDMV-------EFIDGLTYLVKNNVI 360
           FRGFVISDWQGID+IT+PPH+NYTYSI+A + AG+DM+       EFIDGLTYLVK NVI
Sbjct: 301 FRGFVISDWQGIDRITSPPHANYTYSIIAGITAGIDMIMVPFNYTEFIDGLTYLVKTNVI 360

Query: 361 PISRIDDAVKRILRVKFVMGLFENPLADLSLINELGKQEHRELAREAVRKSLVLLKNGKL 420
           PISRIDDAVKRILRVKFVMGLFENPLAD S +NELGK+EHRELAREAVRKSLVLLKNG+ 
Sbjct: 361 PISRIDDAVKRILRVKFVMGLFENPLADSSFVNELGKKEHRELAREAVRKSLVLLKNGES 420

Query: 421 PNKPLLPLPKKAPKILVAGSHADNLGNQCGGWTMEWQGLSGNNLTTGTTVLAAIKDTIDP 480
            +KP+LPLPKK PKILVAGSHA+NLG QCGGWT+EWQGL GNNLT+GTT+L+AIKDT+DP
Sbjct: 421 ADKPILPLPKKVPKILVAGSHANNLGFQCGGWTIEWQGLGGNNLTSGTTILSAIKDTVDP 480

Query: 481 ETEIIFNENPNVEFLKSHNFSYAIVVVGEYPYAETNGDSLNLTIPHPGPHTITNVCGAVK 540
           +T+++F ENP++EF+KS+ FSYAIVVVGEYPYAET GDSLNLTIP PGP TITNVCGAVK
Sbjct: 481 KTKVVFKENPDMEFVKSNKFSYAIVVVGEYPYAETFGDSLNLTIPEPGPSTITNVCGAVK 540

Query: 541 CVVIIISGRPVVIQPYIASMDALVAAWLPGTEGKGITDVLFGDYGFTGKLSQTWFKTIDQ 600
           CVVI+ISGRPVV+QPYI+S+DALVAAWLPGTEGKGI+DVLFGDYGF+GKLS+TWFKT+DQ
Sbjct: 541 CVVIVISGRPVVLQPYISSIDALVAAWLPGTEGKGISDVLFGDYGFSGKLSRTWFKTVDQ 600

Query: 601 LPMNFGDPHYDPLFPFGYGITTELVKAN 622
           LPMN GD HYDPLFPFG+G+TT  +KAN
Sbjct: 601 LPMNVGDAHYDPLFPFGFGLTTNPIKAN 628

BLAST of Lsi01G018950 vs. TrEMBL
Match: M5VW21_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa002894mg PE=4 SV=1)

HSP 1 Score: 1017.3 bits (2629), Expect = 7.9e-294
Identity = 488/622 (78.46%), Postives = 555/622 (89.23%), Query Frame = 1

Query: 1   MARVLITLVGLLLLCFSETLAKAEQLKYKDPKQPLNIRIKDLLGRMTLEEKIGQMVQIER 60
           MAR+ I L+GLL LCF+  +A+A+ + YKDPKQPLN RIKDL+ RMTLEEKIGQMVQI+R
Sbjct: 1   MARIPIFLMGLLFLCFNIAIAEAQYINYKDPKQPLNSRIKDLVSRMTLEEKIGQMVQIDR 60

Query: 61  VNASSKVMEKYFIGSVLSGGGSAPSKRASAKAWVHMVNKIQKGALSTRLGIPMIYGIDAV 120
             AS++VM+KYFIGS+LSGGGS P+++ASA+ W++MVN  QKG+LSTRLGIP+IYGIDAV
Sbjct: 61  SVASAEVMKKYFIGSILSGGGSVPAQKASAETWINMVNDFQKGSLSTRLGIPLIYGIDAV 120

Query: 121 HGHNNVYNATIFPHNIGLGATRDPQLVKKIGIATALEVRATGIPYAFAPCVAVCRDPRWG 180
           HGHNNVY ATIFPHNIGLGATR     ++IG ATALE RATGIPY FAPC+AVCRDPRWG
Sbjct: 121 HGHNNVYKATIFPHNIGLGATR-----QRIGAATALEARATGIPYVFAPCIAVCRDPRWG 180

Query: 181 RCYESYSEDPKVVQAMTEIIPGLQGEIPPNSRKGVPYVAGKKNVAACAKHFVGDGGTTKG 240
           RCYESYSEDPK+VQAMTEIIPGLQGEIP NSRKGVP+VAG K VAACAKHFVGDGGTTKG
Sbjct: 181 RCYESYSEDPKIVQAMTEIIPGLQGEIPANSRKGVPFVAGNKKVAACAKHFVGDGGTTKG 240

Query: 241 INENNTVIDRHRLLSIHMPGYYNSIIKGVATIMVSYSSLNGEKMHANKNLVIDFLKNTLH 300
           INENNTVI+RH LLSIHMPGYYNSIIKGVATIMVSYSS NG KMHAN +LV  FLKNTL 
Sbjct: 241 INENNTVINRHGLLSIHMPGYYNSIIKGVATIMVSYSSWNGVKMHANHDLVTAFLKNTLR 300

Query: 301 FRGFVISDWQGIDKITTPPHSNYTYSIMASVNAGVDMV-------EFIDGLTYLVKNNVI 360
           FRGFVISDW+GID+IT+PPH+NY+YSI A +NAG+DMV       EFIDGLT+LVKN +I
Sbjct: 301 FRGFVISDWEGIDRITSPPHANYSYSIQAGINAGIDMVMVPYNYMEFIDGLTFLVKNKII 360

Query: 361 PISRIDDAVKRILRVKFVMGLFENPLADLSLINELGKQEHRELAREAVRKSLVLLKNGKL 420
           P+SRIDDAVKRILRVKFVMGLFE P AD+SL+++LG QEHRELAREAVR+SLVLLKNG+ 
Sbjct: 361 PMSRIDDAVKRILRVKFVMGLFEEPFADMSLVHQLGSQEHRELAREAVRRSLVLLKNGES 420

Query: 421 PNKPLLPLPKKAPKILVAGSHADNLGNQCGGWTMEWQGLSGNNLTTGTTVLAAIKDTIDP 480
             KPLLPLPKK  KILVAGSHADNLG QCGGWT+EWQGLSGNNLT GTT+L AIK+T+DP
Sbjct: 421 AEKPLLPLPKKTSKILVAGSHADNLGYQCGGWTIEWQGLSGNNLTEGTTILTAIKNTVDP 480

Query: 481 ETEIIFNENPNVEFLKSHNFSYAIVVVGEYPYAETNGDSLNLTIPHPGPHTITNVCGAVK 540
           + ++++ ENP+ +F+KS+N SYAIVVVGE+PYAET GDSLNLTIP PGP TITNVCG VK
Sbjct: 481 KAQVVYKENPDADFVKSNNISYAIVVVGEHPYAETFGDSLNLTIPDPGPTTITNVCGTVK 540

Query: 541 CVVIIISGRPVVIQPYIASMDALVAAWLPGTEGKGITDVLFGDYGFTGKLSQTWFKTIDQ 600
           CVVI+ISGRPVVIQPY+AS+DALV AWLPGTEG+G+ DVLFGDYGFTGKLS+TWFKT+DQ
Sbjct: 541 CVVIVISGRPVVIQPYVASIDALVTAWLPGTEGQGVADVLFGDYGFTGKLSRTWFKTVDQ 600

Query: 601 LPMNFGDPHYDPLFPFGYGITT 616
           LPMN GD HYDPLFPFG+G+TT
Sbjct: 601 LPMNVGDAHYDPLFPFGFGLTT 617

BLAST of Lsi01G018950 vs. TrEMBL
Match: A0A061F0I5_THECC (Glycosyl hydrolase family protein OS=Theobroma cacao GN=TCM_025896 PE=4 SV=1)

HSP 1 Score: 1016.9 bits (2628), Expect = 1.0e-293
Identity = 487/613 (79.45%), Postives = 551/613 (89.89%), Query Frame = 1

Query: 15   CFSETLAKAEQLKYKDPKQPLNIRIKDLLGRMTLEEKIGQMVQIERVNASSKVMEKYFIG 74
            C SE   KAE +KYKDPKQPLN+RIKDL+GRMTLEEKIGQMVQIER  AS++VM+KYFIG
Sbjct: 611  CSSE---KAEHVKYKDPKQPLNVRIKDLIGRMTLEEKIGQMVQIERAVASAEVMKKYFIG 670

Query: 75   SVLSGGGSAPSKRASAKAWVHMVNKIQKGALSTRLGIPMIYGIDAVHGHNNVYNATIFPH 134
            SVLSGGGS P+ +ASAK W++MVN+ QKG+LSTRLGIPMIYGIDAVHGHNNVY ATIFPH
Sbjct: 671  SVLSGGGSVPAPKASAKTWLNMVNEFQKGSLSTRLGIPMIYGIDAVHGHNNVYKATIFPH 730

Query: 135  NIGLGATRDPQLVKKIGIATALEVRATGIPYAFAPCVAVCRDPRWGRCYESYSEDPKVVQ 194
            NIGLGATRDP LVKKIG ATALEVRATGIPYAFAPC+AVCRDPRWGRCYESYSED K+VQ
Sbjct: 731  NIGLGATRDPALVKKIGAATALEVRATGIPYAFAPCLAVCRDPRWGRCYESYSEDHKIVQ 790

Query: 195  AMTEIIPGLQGEIPPNSRKGVPYVAGKKNVAACAKHFVGDGGTTKGINENNTVIDRHRLL 254
            AMTEIIPGLQG+IP NSRKGVP+VAGKKNVAACAKH+VGDGGTT+GINENNTVIDRH LL
Sbjct: 791  AMTEIIPGLQGDIPSNSRKGVPFVAGKKNVAACAKHYVGDGGTTRGINENNTVIDRHGLL 850

Query: 255  SIHMPGYYNSIIKGVATIMVSYSSLNGEKMHANKNLVIDFLKNTLHFRGFVISDWQGIDK 314
            SIHMP YYNSIIKGV+T+M SYSS NG K HAN  +V +FLK TL FRGFVISDW+GID+
Sbjct: 851  SIHMPAYYNSIIKGVSTVMTSYSSWNGVKNHANHEMVTNFLKKTLRFRGFVISDWEGIDR 910

Query: 315  ITTPPHSNYTYSIMASVNAGVDMV-------EFIDGLTYLVKNNVIPISRIDDAVKRILR 374
            IT+PPH+NYTYSI+AS+NAG+DM+       EFIDGLTYLVKN  IP+SRIDDAVKRILR
Sbjct: 911  ITSPPHANYTYSILASINAGLDMIMVPNNYKEFIDGLTYLVKNKFIPMSRIDDAVKRILR 970

Query: 375  VKFVMGLFENPLADLSLINELGKQEHRELAREAVRKSLVLLKNGKLPNKPLLPLPKKAPK 434
            VKFVMGLFE+PLAD SL+++LG QEHRELAREAVRKSLVLLKNG   + PLLPLPKKAPK
Sbjct: 971  VKFVMGLFEDPLADDSLVDQLGSQEHRELAREAVRKSLVLLKNGDSADAPLLPLPKKAPK 1030

Query: 435  ILVAGSHADNLGNQCGGWTMEWQGLSGNNLTTGTTVLAAIKDTIDPETEIIFNENPNVEF 494
            ILVAGSHA+NLG QCGGWT+EWQG  GNN+T GTT+L AIK T+DP+T++++ E P+ EF
Sbjct: 1031 ILVAGSHANNLGYQCGGWTIEWQGQGGNNITDGTTILTAIKKTVDPKTKVVYKEKPDAEF 1090

Query: 495  LKSHNFSYAIVVVGEYPYAETNGDSLNLTIPHPGPHTITNVCGAVKCVVIIISGRPVVIQ 554
            +KS++FSYAIVVVGE+PYAETNGDSLNLTIP PGP TI NVCGAVKCVV++ISGRPVVIQ
Sbjct: 1091 VKSNDFSYAIVVVGEHPYAETNGDSLNLTIPEPGPSTIGNVCGAVKCVVVVISGRPVVIQ 1150

Query: 555  PYIASMDALVAAWLPGTEGKGITDVLFGDYGFTGKLSQTWFKTIDQLPMNFGDPHYDPLF 614
            PY+  +DA+VAAWLPG+EG+G+ DVLFGDYGFTGKLS TWFKT+DQLPM+ GD HYDPLF
Sbjct: 1151 PYVRYIDAIVAAWLPGSEGQGVADVLFGDYGFTGKLSFTWFKTVDQLPMHVGDSHYDPLF 1210

Query: 615  PFGYGITTELVKA 621
            PFG+G+TT+  KA
Sbjct: 1211 PFGFGLTTKPTKA 1220

BLAST of Lsi01G018950 vs. TAIR10
Match: AT5G20950.1 (AT5G20950.1 Glycosyl hydrolase family protein)

HSP 1 Score: 921.0 bits (2379), Expect = 3.9e-268
Identity = 441/616 (71.59%), Postives = 521/616 (84.58%), Query Frame = 1

Query: 11  LLLLCFSETLAKAEQLKYKDPKQPLNIRIKDLLGRMTLEEKIGQMVQIERVNASSKVMEK 70
           L+LLC     A+   LKYKDPKQPL  RI+DL+ RMTL+EKIGQMVQIER  A+ +VM+K
Sbjct: 10  LMLLCCIVAAAEGT-LKYKDPKQPLGARIRDLMNRMTLQEKIGQMVQIERSVATPEVMKK 69

Query: 71  YFIGSVLSGGGSAPSKRASAKAWVHMVNKIQKGALSTRLGIPMIYGIDAVHGHNNVYNAT 130
           YFIGSVLSGGGS PS++A+ + WV+MVN+IQK +LSTRLGIPMIYGIDAVHGHNNVY AT
Sbjct: 70  YFIGSVLSGGGSVPSEKATPETWVNMVNEIQKASLSTRLGIPMIYGIDAVHGHNNVYGAT 129

Query: 131 IFPHNIGLGATRDPQLVKKIGIATALEVRATGIPYAFAPCVAVCRDPRWGRCYESYSEDP 190
           IFPHN+GLG TRDP LVK+IG ATALEVRATGIPYAFAPC+AVCRDPRWGRCYESYSED 
Sbjct: 130 IFPHNVGLGVTRDPNLVKRIGAATALEVRATGIPYAFAPCIAVCRDPRWGRCYESYSEDY 189

Query: 191 KVVQAMTEIIPGLQGEIPPNSRKGVPYVAGKKNVAACAKHFVGDGGTTKGINENNTVIDR 250
           ++VQ MTEIIPGLQG++P   RKGVP+V GK  VAACAKHFVGDGGT +GI+ENNTVID 
Sbjct: 190 RIVQQMTEIIPGLQGDLP-TKRKGVPFVGGKTKVAACAKHFVGDGGTVRGIDENNTVIDS 249

Query: 251 HRLLSIHMPGYYNSIIKGVATIMVSYSSLNGEKMHANKNLVIDFLKNTLHFRGFVISDWQ 310
             L  IHMPGYYN++ KGVATIMVSYS+ NG +MHANK LV  FLKN L FRGFVISDWQ
Sbjct: 250 KGLFGIHMPGYYNAVNKGVATIMVSYSAWNGLRMHANKELVTGFLKNKLKFRGFVISDWQ 309

Query: 311 GIDKITTPPHSNYTYSIMASVNAGVDMV-------EFIDGLTYLVKNNVIPISRIDDAVK 370
           GID+ITTPPH NY+YS+ A ++AG+DM+       EFID ++  ++  +IPISRIDDA+K
Sbjct: 310 GIDRITTPPHLNYSYSVYAGISAGIDMIMVPYNYTEFIDEISSQIQKKLIPISRIDDALK 369

Query: 371 RILRVKFVMGLFENPLADLSLINELGKQEHRELAREAVRKSLVLLKNGKLPNKPLLPLPK 430
           RILRVKF MGLFE PLADLS  N+LG +EHRELAREAVRKSLVLLKNGK   KPLLPLPK
Sbjct: 370 RILRVKFTMGLFEEPLADLSFANQLGSKEHRELAREAVRKSLVLLKNGKTGAKPLLPLPK 429

Query: 431 KAPKILVAGSHADNLGNQCGGWTMEWQGLSGNNLTTGTTVLAAIKDTIDPETEIIFNENP 490
           K+ KILVAG+HADNLG QCGGWT+ WQGL+GN+ T GTT+LAA+K+T+ P T++++++NP
Sbjct: 430 KSGKILVAGAHADNLGYQCGGWTITWQGLNGNDHTVGTTILAAVKNTVAPTTQVVYSQNP 489

Query: 491 NVEFLKSHNFSYAIVVVGEYPYAETNGDSLNLTIPHPGPHTITNVCGAVKCVVIIISGRP 550
           +  F+KS  F YAIVVVGE PYAE  GD+ NLTI  PGP  I NVCG+VKCVV+++SGRP
Sbjct: 490 DANFVKSGKFDYAIVVVGEPPYAEMFGDTTNLTISDPGPSIIGNVCGSVKCVVVVVSGRP 549

Query: 551 VVIQPYIASMDALVAAWLPGTEGKGITDVLFGDYGFTGKLSQTWFKTIDQLPMNFGDPHY 610
           VVIQPY++++DALVAAWLPGTEG+G+ D LFGDYGFTGKL++TWFK++ QLPMN GD HY
Sbjct: 550 VVIQPYVSTIDALVAAWLPGTEGQGVADALFGDYGFTGKLARTWFKSVKQLPMNVGDRHY 609

Query: 611 DPLFPFGYGITTELVK 620
           DPL+PFG+G+TT+  K
Sbjct: 610 DPLYPFGFGLTTKPYK 623

BLAST of Lsi01G018950 vs. TAIR10
Match: AT5G20940.1 (AT5G20940.1 Glycosyl hydrolase family protein)

HSP 1 Score: 889.8 bits (2298), Expect = 9.6e-259
Identity = 431/621 (69.40%), Postives = 516/621 (83.09%), Query Frame = 1

Query: 5   LITLVGLLLLCFSETLAKAE--QLKYKDPKQPLNIRIKDLLGRMTLEEKIGQMVQIERVN 64
           L+  +GLLLLC +    K      KYKDPK+PL +RIK+L+  MTLEEKIGQMVQ+ERVN
Sbjct: 7   LLQTLGLLLLCCTVAANKVPLANAKYKDPKEPLGVRIKNLMSHMTLEEKIGQMVQVERVN 66

Query: 65  ASSKVMEKYFIGSVLSGGGSAPSKRASAKAWVHMVNKIQKGALSTRLGIPMIYGIDAVHG 124
           A+++VM+KYF+GSV SGGGS P      +AWV+MVN++QK ALSTRLGIP+IYGIDAVHG
Sbjct: 67  ATTEVMQKYFVGSVFSGGGSVPKPYIGPEAWVNMVNEVQKKALSTRLGIPIIYGIDAVHG 126

Query: 125 HNNVYNATIFPHNIGLGATRDPQLVKKIGIATALEVRATGIPYAFAPCVAVCRDPRWGRC 184
           HN VYNATIFPHN+GLG TRDP LVK+IG ATALEVRATGI Y FAPC+AVCRDPRWGRC
Sbjct: 127 HNTVYNATIFPHNVGLGVTRDPGLVKRIGEATALEVRATGIQYVFAPCIAVCRDPRWGRC 186

Query: 185 YESYSEDPKVVQAMTEIIPGLQGEIPPNSRKGVPYVAGKKNVAACAKHFVGDGGTTKGIN 244
           YESYSED K+VQ MTEIIPGLQG++P   +KGVP+VAGK  VAACAKHFVGDGGT +G+N
Sbjct: 187 YESYSEDHKIVQQMTEIIPGLQGDLP-TGQKGVPFVAGKTKVAACAKHFVGDGGTLRGMN 246

Query: 245 ENNTVIDRHRLLSIHMPGYYNSIIKGVATIMVSYSSLNGEKMHANKNLVIDFLKNTLHFR 304
            NNTVI+ + LL IHMP Y++++ KGVAT+MVSYSS+NG KMHANK L+  FLKN L FR
Sbjct: 247 ANNTVINSNGLLGIHMPAYHDAVNKGVATVMVSYSSINGLKMHANKKLITGFLKNKLKFR 306

Query: 305 GFVISDWQGIDKITTPPHSNYTYSIMASVNAGVDM-------VEFIDGLTYLVKNNVIPI 364
           G VISD+ G+D+I TP  +NY++S+ A+  AG+DM        + ID LT  VK   IP+
Sbjct: 307 GIVISDYLGVDQINTPLGANYSHSVYAATTAGLDMFMGSSNLTKLIDELTSQVKRKFIPM 366

Query: 365 SRIDDAVKRILRVKFVMGLFENPLADLSLINELGKQEHRELAREAVRKSLVLLKNGKLPN 424
           SRIDDAVKRILRVKF MGLFENP+AD SL  +LG +EHRELAREAVRKSLVLLKNG+  +
Sbjct: 367 SRIDDAVKRILRVKFTMGLFENPIADHSLAKKLGSKEHRELAREAVRKSLVLLKNGENAD 426

Query: 425 KPLLPLPKKAPKILVAGSHADNLGNQCGGWTMEWQGLSGNNLTTGTTVLAAIKDTIDPET 484
           KPLLPLPKKA KILVAG+HADNLG QCGGWT+ WQGL+GNNLT GTT+LAA+K T+DP+T
Sbjct: 427 KPLLPLPKKANKILVAGTHADNLGYQCGGWTITWQGLNGNNLTIGTTILAAVKKTVDPKT 486

Query: 485 EIIFNENPNVEFLKSHNFSYAIVVVGEYPYAETNGDSLNLTIPHPGPHTITNVCGAVKCV 544
           ++I+N+NP+  F+K+ +F YAIV VGE PYAE  GDS NLTI  PGP TI NVC +VKCV
Sbjct: 487 QVIYNQNPDTNFVKAGDFDYAIVAVGEKPYAEGFGDSTNLTISEPGPSTIGNVCASVKCV 546

Query: 545 VIIISGRPVVIQPYIASMDALVAAWLPGTEGKGITDVLFGDYGFTGKLSQTWFKTIDQLP 604
           V+++SGRPVV+Q  I+++DALVAAWLPGTEG+G+ DVLFGDYGFTGKL++TWFKT+DQLP
Sbjct: 547 VVVVSGRPVVMQ--ISNIDALVAAWLPGTEGQGVADVLFGDYGFTGKLARTWFKTVDQLP 606

Query: 605 MNFGDPHYDPLFPFGYGITTE 617
           MN GDPHYDPL+PFG+G+ T+
Sbjct: 607 MNVGDPHYDPLYPFGFGLITK 624

BLAST of Lsi01G018950 vs. TAIR10
Match: AT5G04885.1 (AT5G04885.1 Glycosyl hydrolase family protein)

HSP 1 Score: 877.9 bits (2267), Expect = 3.8e-255
Identity = 424/631 (67.19%), Postives = 511/631 (80.98%), Query Frame = 1

Query: 1   MARVLITLVGLLL------LCFSETLAKAEQLKYKDPKQPLNIRIKDLLGRMTLEEKIGQ 60
           M+R  + +VG+LL       C+ +     E L YKDPKQ ++ R+ DL GRMTLEEKIGQ
Sbjct: 1   MSRDSVRIVGVLLWMCMWVCCYGD----GEYLLYKDPKQTVSDRVADLFGRMTLEEKIGQ 60

Query: 61  MVQIERVNASSKVMEKYFIGSVLSGGGSAPSKRASAKAWVHMVNKIQKGALSTRLGIPMI 120
           MVQI+R  A+  +M  YFIGSVLSGGGSAP   ASA+ WV M+N+ QKGAL +RLGIPMI
Sbjct: 61  MVQIDRSVATVNIMRDYFIGSVLSGGGSAPLPEASAQNWVDMINEYQKGALVSRLGIPMI 120

Query: 121 YGIDAVHGHNNVYNATIFPHNIGLGATRDPQLVKKIGIATALEVRATGIPYAFAPCVAVC 180
           YGIDAVHGHNNVYNATIFPHN+GLGATRDP LVK+IG ATA+EVRATGIPY FAPC+AVC
Sbjct: 121 YGIDAVHGHNNVYNATIFPHNVGLGATRDPDLVKRIGAATAVEVRATGIPYTFAPCIAVC 180

Query: 181 RDPRWGRCYESYSEDPKVVQAMTEIIPGLQGEIPPNSRKGVPYVAGKKNVAACAKHFVGD 240
           RDPRWGRCYESYSED KVV+ MT++I GLQGE P N + GVP+V G+  VAACAKH+VGD
Sbjct: 181 RDPRWGRCYESYSEDHKVVEDMTDVILGLQGEPPSNYKHGVPFVGGRDKVAACAKHYVGD 240

Query: 241 GGTTKGINENNTVIDRHRLLSIHMPGYYNSIIKGVATIMVSYSSLNGEKMHANKNLVIDF 300
           GGTT+G+NENNTV D H LLS+HMP Y +++ KGV+T+MVSYSS NGEKMHAN  L+  +
Sbjct: 241 GGTTRGVNENNTVTDLHGLLSVHMPAYADAVYKGVSTVMVSYSSWNGEKMHANTELITGY 300

Query: 301 LKNTLHFRGFVISDWQGIDKITTPPHSNYTYSIMASVNAGVDMV-------EFIDGLTYL 360
           LK TL F+GFVISDWQG+DKI+TPPH++YT S+ A++ AG+DMV       EF++ LT L
Sbjct: 301 LKGTLKFKGFVISDWQGVDKISTPPHTHYTASVRAAIQAGIDMVMVPFNFTEFVNDLTTL 360

Query: 361 VKNNVIPISRIDDAVKRILRVKFVMGLFENPLADLSLINELGKQEHRELAREAVRKSLVL 420
           VKNN IP++RIDDAV+RIL VKF MGLFENPLAD S  +ELG Q HR+LAREAVRKSLVL
Sbjct: 361 VKNNSIPVTRIDDAVRRILLVKFTMGLFENPLADYSFSSELGSQAHRDLAREAVRKSLVL 420

Query: 421 LKNGKLPNKPLLPLPKKAPKILVAGSHADNLGNQCGGWTMEWQGLSGNNLTTGTTVLAAI 480
           LKNG   N P+LPLP+K  KILVAG+HADNLG QCGGWT+ WQG SGN  T GTT+L+A+
Sbjct: 421 LKNGNKTN-PMLPLPRKTSKILVAGTHADNLGYQCGGWTITWQGFSGNKNTRGTTLLSAV 480

Query: 481 KDTIDPETEIIFNENPNVEFLKSHNFSYAIVVVGEYPYAETNGDSLNLTIPHPGPHTITN 540
           K  +D  TE++F ENP+ EF+KS+NF+YAI+ VGE PYAET GDS  LT+  PGP  I++
Sbjct: 481 KSAVDQSTEVVFRENPDAEFIKSNNFAYAIIAVGEPPYAETAGDSDKLTMLDPGPAIISS 540

Query: 541 VCGAVKCVVIIISGRPVVIQPYIASMDALVAAWLPGTEGKGITDVLFGDYGFTGKLSQTW 600
            C AVKCVV++ISGRP+V++PY+AS+DALVAAWLPGTEG+GITD LFGD+GF+GKL  TW
Sbjct: 541 TCQAVKCVVVVISGRPLVMEPYVASIDALVAAWLPGTEGQGITDALFGDHGFSGKLPVTW 600

Query: 601 FKTIDQLPMNFGDPHYDPLFPFGYGITTELV 619
           F+  +QLPM++GD HYDPLF +G G+ TE V
Sbjct: 601 FRNTEQLPMSYGDTHYDPLFAYGSGLETESV 626

BLAST of Lsi01G018950 vs. TAIR10
Match: AT3G62710.1 (AT3G62710.1 Glycosyl hydrolase family protein)

HSP 1 Score: 697.6 bits (1799), Expect = 7.0e-201
Identity = 370/636 (58.18%), Postives = 450/636 (70.75%), Query Frame = 1

Query: 19  TLAKAEQLKYKDPKQPLNIRIKDLLGRMTLEEKIGQMVQIERVNASS----------KVM 78
           T A    +KYKDPK  +  R++DLL RMTL EK+GQM QI+R N S           ++ 
Sbjct: 29  TAADRGYIKYKDPKVAVEERVEDLLIRMTLPEKLGQMCQIDRFNFSQVTGGVATVVPEIF 88

Query: 79  EKYFIGSVLSGG---GSAPSKRASAKAWVHMVNKIQKGALSTRLGIPMIYGIDAVHGHNN 138
            KY IGSVLS     G   +KR      +   N ++K +LSTRLGIP++Y +DAVHGHN 
Sbjct: 89  TKYMIGSVLSNPYDTGKDIAKR------IFQTNAMKKLSLSTRLGIPLLYAVDAVHGHNT 148

Query: 139 VYNATIFPHNIGLGATRDPQLVKKIGIATALEVRATGIPYAFAPCVAVCRDPRWGRCYES 198
             +ATIFPHN+GLGATRDPQLVKKIG  TA EVRATG+  AFAPCVAVCRDPRWGRCYES
Sbjct: 149 FIDATIFPHNVGLGATRDPQLVKKIGAITAQEVRATGVAQAFAPCVAVCRDPRWGRCYES 208

Query: 199 YSEDPKVVQAMTE-IIPGLQGEIPPNSRKGVPYVAGKK-NVAACAKHFVGDGGTTKGINE 258
           YSEDP VV  MTE II GLQG          PY+A  K NVA CAKHFVGDGGT  GINE
Sbjct: 209 YSEDPAVVNMMTESIIDGLQG--------NAPYLADPKINVAGCAKHFVGDGGTINGINE 268

Query: 259 NNTVIDRHRLLSIHMPGYYNSIIKGVATIMVSYSSLNGEKMHANKNLVIDFLKNTLHFRG 318
           NNTV D   L  IHMP +  ++ KG+A+IM SYSSLNG KMHAN+ ++ D+LKNTL F+G
Sbjct: 269 NNTVADNATLFGIHMPPFEIAVKKGIASIMASYSSLNGVKMHANRAMITDYLKNTLKFQG 328

Query: 319 FVISDWQGIDKITTPPHSNYTYSIMASVNAGVDMV-------EFIDGLTYLVKNNVIPIS 378
           FVISDW GIDKIT    SNYTYSI AS+NAG+DMV       E+++ LT LV    IP+S
Sbjct: 329 FVISDWLGIDKITPIEKSNYTYSIEASINAGIDMVMVPWAYPEYLEKLTNLVNGGYIPMS 388

Query: 379 RIDDAVKRILRVKFVMGLFENPLADLSL-INELGKQEHRELAREAVRKSLVLLKNGKLPN 438
           RIDDAV+RILRVKF +GLFEN LAD  L   E G + HRE+ REAVRKS+VLLKNGK   
Sbjct: 389 RIDDAVRRILRVKFSIGLFENSLADEKLPTTEFGSEAHREVGREAVRKSMVLLKNGKTDA 448

Query: 439 KPLLPLPKKAPKILVAGSHADNLGNQCGGWTMEWQGLSG----------NNLTT----GT 498
             ++PLPKK  KI+VAG HA+++G QCGG+++ WQG +G          + L T    GT
Sbjct: 449 DKIVPLPKKVKKIVVAGRHANDMGWQCGGFSLTWQGFNGTGEDMPTNTKHGLPTGKIKGT 508

Query: 499 TVLAAIKDTIDPETEIIFNENPNVEFLKSH-NFSYAIVVVGEYPYAETNGDSLNLTIPHP 558
           T+L AI+  +DP TE+++ E PN +  K H + +Y IVVVGE PYAET GDS  L I  P
Sbjct: 509 TILEAIQKAVDPTTEVVYVEEPNQDTAKLHADAAYTIVVVGETPYAETFGDSPTLGITKP 568

Query: 559 GPHTITNVCGA-VKCVVIIISGRPVVIQPYIASMDALVAAWLPGTEGKGITDVLFGDYGF 616
           GP T+++ CG+ +KC+VI+++GRP+VI+PYI  +DAL  AWLPGTEG+G+ DVLFGD+ F
Sbjct: 569 GPDTLSHTCGSGMKCLVILVTGRPLVIEPYIDMLDALAVAWLPGTEGQGVADVLFGDHPF 628

BLAST of Lsi01G018950 vs. TAIR10
Match: AT3G47000.1 (AT3G47000.1 Glycosyl hydrolase family protein)

HSP 1 Score: 689.5 bits (1778), Expect = 1.9e-198
Identity = 341/600 (56.83%), Postives = 438/600 (73.00%), Query Frame = 1

Query: 28  YKDPKQPLNIRIKDLLGRMTLEEKIGQMVQIERVNASSKVMEKYFIGSVLSGGGSAPSKR 87
           YK+   P+  R+KDLL RMTL EKIGQM QIER  AS      +FIGSVL+ GGS P + 
Sbjct: 10  YKNGDAPVEARVKDLLSRMTLPEKIGQMTQIERRVASPSAFTDFFIGSVLNAGGSVPFED 69

Query: 88  ASAKAWVHMVNKIQKGALSTRLGIPMIYGIDAVHGHNNVYNATIFPHNIGLGATRDPQLV 147
           A +  W  M++  Q+ AL++RLGIP+IYG DAVHG+NNVY AT+FPHNIGLGATRD  LV
Sbjct: 70  AKSSDWADMIDGFQRSALASRLGIPIIYGTDAVHGNNNVYGATVFPHNIGLGATRDADLV 129

Query: 148 KKIGIATALEVRATGIPYAFAPCVAVCRDPRWGRCYESYSEDPKVVQAMTEIIPGLQGEI 207
           ++IG ATALEVRA+G+ +AF+PCVAV RDPRWGRCYESY EDP++V  MT ++ GLQG  
Sbjct: 130 RRIGAATALEVRASGVHWAFSPCVAVLRDPRWGRCYESYGEDPELVCEMTSLVSGLQGVP 189

Query: 208 PPNSRKGVPYVAGKKNVAACAKHFVGDGGTTKGINENNTVIDRHRLLSIHMPGYYNSIIK 267
           P     G P+VAG+ NV AC KHFVGDGGT KGINE NT+     L  IH+P Y   + +
Sbjct: 190 PEEHPNGYPFVAGRNNVVACVKHFVGDGGTDKGINEGNTIASYEELEKIHIPPYLKCLAQ 249

Query: 268 GVATIMVSYSSLNGEKMHANKNLVIDFLKNTLHFRGFVISDWQGIDKITTPPHSNYTYSI 327
           GV+T+M SYSS NG ++HA++ L+ + LK  L F+GF++SDW+G+D+++ P  SNY Y I
Sbjct: 250 GVSTVMASYSSWNGTRLHADRFLLTEILKEKLGFKGFLVSDWEGLDRLSEPQGSNYRYCI 309

Query: 328 MASVNAGVDMV-------EFIDGLTYLVKNNVIPISRIDDAVKRILRVKFVMGLFENPLA 387
             +VNAG+DMV       +FI  +T LV++  IP++RI+DAV+RILRVKFV GLF +PL 
Sbjct: 310 KTAVNAGIDMVMVPFKYEQFIQDMTDLVESGEIPMARINDAVERILRVKFVAGLFGHPLT 369

Query: 388 DLSLINELGKQEHRELAREAVRKSLVLLKNGKLPNKPLLPLPKKAPKILVAGSHADNLGN 447
           D SL+  +G +EHRELA+EAVRKSLVLLK+GK  +KP LPL + A +ILV G+HAD+LG 
Sbjct: 370 DRSLLPTVGCKEHRELAQEAVRKSLVLLKSGKNADKPFLPLDRNAKRILVTGTHADDLGY 429

Query: 448 QCGGWTMEWQGLSGNNLTTGTTVLAAIKDTIDPETEIIFNENPNVEFL-KSHNFSYAIVV 507
           QCGGWT  W GLSG  +T GTT+L AIK+ +  ETE+I+ + P+ E L  S  FSYAIV 
Sbjct: 430 QCGGWTKTWFGLSG-RITIGTTLLDAIKEAVGDETEVIYEKTPSKETLASSEGFSYAIVA 489

Query: 508 VGEYPYAETNGDSLNLTIPHPGPHTITNVCGAVKCVVIIISGRPVVIQP-YIASMDALVA 567
           VGE PYAET GD+  L IP  G   +T V   +  +VI+ISGRPVV++P  +   +ALVA
Sbjct: 490 VGEPPYAETMGDNSELRIPFNGTDIVTAVAEIIPTLVILISGRPVVLEPTVLEKTEALVA 549

Query: 568 AWLPGTEGKGITDVLFGDYGFTGKLSQTWFKTIDQLPMNFGDPHYDPLFPFGYGITTELV 619
           AWLPGTEG+G+ DV+FGDY F GKL  +WFK ++ LP++     YDPLFPFG+G+ ++ V
Sbjct: 550 AWLPGTEGQGVADVVFGDYDFKGKLPVSWFKHVEHLPLDAHANSYDPLFPFGFGLNSKPV 608

BLAST of Lsi01G018950 vs. NCBI nr
Match: gi|778685993|ref|XP_011652313.1| (PREDICTED: lysosomal beta glucosidase-like [Cucumis sativus])

HSP 1 Score: 1127.9 bits (2916), Expect = 0.0e+00
Identity = 545/628 (86.78%), Postives = 591/628 (94.11%), Query Frame = 1

Query: 1   MAR-VLITLVGLLLLCFSETLAKAEQLKYKDPKQPLNIRIKDLLGRMTLEEKIGQMVQIE 60
           MAR VLIT VGLL+LCFSETLAKAE LKYKDPKQPLN+RIKDLLGRMTLEEKIGQMVQIE
Sbjct: 2   MARSVLITFVGLLVLCFSETLAKAEYLKYKDPKQPLNVRIKDLLGRMTLEEKIGQMVQIE 61

Query: 61  RVNASSKVMEKYFIGSVLSGGGSAPSKRASAKAWVHMVNKIQKGALSTRLGIPMIYGIDA 120
           R NAS+ VM++YFIGSVLSGGGSAPSK+ASAK WVHMVNKIQ+ ALSTRLGIPMIYGIDA
Sbjct: 62  RANASADVMKQYFIGSVLSGGGSAPSKQASAKDWVHMVNKIQEAALSTRLGIPMIYGIDA 121

Query: 121 VHGHNNVYNATIFPHNIGLGATRDPQLVKKIGIATALEVRATGIPYAFAPCVAVCRDPRW 180
           VHGHNNVYNATIFPHNIGLGATRDPQL+K+IG ATALEVRATGIPYAFAPC+AVCRDPRW
Sbjct: 122 VHGHNNVYNATIFPHNIGLGATRDPQLLKRIGAATALEVRATGIPYAFAPCIAVCRDPRW 181

Query: 181 GRCYESYSEDPKVVQAMTEIIPGLQGEIPPNSRKGVPYVAGKKNVAACAKHFVGDGGTTK 240
           GRCYESY ED  +VQAMTEIIPGLQG++P N RKGVPYVAGK NVAACAKHFVGDGGTTK
Sbjct: 182 GRCYESYGEDHTIVQAMTEIIPGLQGDVPANIRKGVPYVAGKNNVAACAKHFVGDGGTTK 241

Query: 241 GINENNTVIDRHRLLSIHMPGYYNSIIKGVATIMVSYSSLNGEKMHANKNLVIDFLKNTL 300
           GINENNTV+D H L SIHMP YYNSIIKGVAT+MVSYSS+NGEKMHANK LV DFLKNTL
Sbjct: 242 GINENNTVVDGHGLFSIHMPAYYNSIIKGVATVMVSYSSINGEKMHANKKLVTDFLKNTL 301

Query: 301 HFRGFVISDWQGIDKITTPPHSNYTYSIMASVNAGVDMV-------EFIDGLTYLVKNNV 360
           HF+GFVISDWQGIDKITTPPH+NYTYSI+ASVNAGVDM+       EFIDGLTYLVKNN 
Sbjct: 302 HFKGFVISDWQGIDKITTPPHANYTYSILASVNAGVDMIMVPYNYTEFIDGLTYLVKNNA 361

Query: 361 IPISRIDDAVKRILRVKFVMGLFENPLADLSLINELGKQEHRELAREAVRKSLVLLKNGK 420
           IPISRIDDAVKRILRVKFVMGLFENPLADLSLINELGKQEHRELAREAVRKSLVLLKNGK
Sbjct: 362 IPISRIDDAVKRILRVKFVMGLFENPLADLSLINELGKQEHRELAREAVRKSLVLLKNGK 421

Query: 421 LPNKPLLPLPKKAPKILVAGSHADNLGNQCGGWTMEWQGLSGNNLTTGTTVLAAIKDTID 480
           LPN+PLLPLPKKAPKILVAG+HA++LGNQCGGWTMEWQGL+GNNLT+GTT+L AIKDT+D
Sbjct: 422 LPNQPLLPLPKKAPKILVAGTHANDLGNQCGGWTMEWQGLTGNNLTSGTTILTAIKDTVD 481

Query: 481 PETEIIFNENPNVEFLKSHNFSYAIVVVGEYPYAETNGDSLNLTIPHPGPHTITNVCGAV 540
           PETE++F++NPN EFL++H FSYAIVVVGE+PYAETNGDSLNLTIP PGP TI NVCGAV
Sbjct: 482 PETEVVFHDNPNAEFLQTHQFSYAIVVVGEHPYAETNGDSLNLTIPEPGPETIKNVCGAV 541

Query: 541 KCVVIIISGRPVVIQPYIASMDALVAAWLPGTEGKGITDVLFGDYGFTGKLSQTWFKTID 600
           KCVV++ISGRPVV+QPYI S+DA+VAAWLPGTEGKGI+DVLFGDYGFTGKLSQTWFK++D
Sbjct: 542 KCVVVVISGRPVVLQPYIDSIDAVVAAWLPGTEGKGISDVLFGDYGFTGKLSQTWFKSVD 601

Query: 601 QLPMNFGDPHYDPLFPFGYGITTELVKA 621
           QLPMNFGD HYDPLFPFG+G+TT+ VKA
Sbjct: 602 QLPMNFGDAHYDPLFPFGFGLTTQPVKA 629

BLAST of Lsi01G018950 vs. NCBI nr
Match: gi|659130020|ref|XP_008464960.1| (PREDICTED: lysosomal beta glucosidase-like [Cucumis melo])

HSP 1 Score: 1126.3 bits (2912), Expect = 0.0e+00
Identity = 545/628 (86.78%), Postives = 586/628 (93.31%), Query Frame = 1

Query: 1   MAR-VLITLVGLLLLCFSETLAKAEQLKYKDPKQPLNIRIKDLLGRMTLEEKIGQMVQIE 60
           MAR VLIT VGLL+LCFSETLAKAE LKYKDPKQPLN+RIKDL GRMTLEEKIGQMVQIE
Sbjct: 1   MARSVLITFVGLLVLCFSETLAKAEYLKYKDPKQPLNVRIKDLFGRMTLEEKIGQMVQIE 60

Query: 61  RVNASSKVMEKYFIGSVLSGGGSAPSKRASAKAWVHMVNKIQKGALSTRLGIPMIYGIDA 120
           R NAS  VM KYFIGSVLSGGGS PSK ASAK WVHMVNKIQ+GALSTRLGIPMIYGIDA
Sbjct: 61  RANASMDVMRKYFIGSVLSGGGSVPSKNASAKTWVHMVNKIQEGALSTRLGIPMIYGIDA 120

Query: 121 VHGHNNVYNATIFPHNIGLGATRDPQLVKKIGIATALEVRATGIPYAFAPCVAVCRDPRW 180
           +HGHNNVYNATIFPHNIGLGATRDPQL+K+IG+ATALEVRATGIPYAFAPC+AVCRDPRW
Sbjct: 121 IHGHNNVYNATIFPHNIGLGATRDPQLIKRIGVATALEVRATGIPYAFAPCIAVCRDPRW 180

Query: 181 GRCYESYSEDPKVVQAMTEIIPGLQGEIPPNSRKGVPYVAGKKNVAACAKHFVGDGGTTK 240
           GRCYESY ED K+VQAMTEIIPGLQG++P N RKGVPYVAGK NVAACAKHFVGDGGTTK
Sbjct: 181 GRCYESYGEDHKIVQAMTEIIPGLQGDLPSNIRKGVPYVAGKNNVAACAKHFVGDGGTTK 240

Query: 241 GINENNTVIDRHRLLSIHMPGYYNSIIKGVATIMVSYSSLNGEKMHANKNLVIDFLKNTL 300
           GINENNTVID H L SIHMP YYNSIIKGVATIMVSYSS+NGEKMHANK LV DFLKNTL
Sbjct: 241 GINENNTVIDGHGLFSIHMPAYYNSIIKGVATIMVSYSSVNGEKMHANKKLVTDFLKNTL 300

Query: 301 HFRGFVISDWQGIDKITTPPHSNYTYSIMASVNAGVDMV-------EFIDGLTYLVKNNV 360
           HF+GFVISDWQGIDKIT+PPH+NYTYSI+ASVNAGVDM+       EFID LTYLVKNN 
Sbjct: 301 HFKGFVISDWQGIDKITSPPHANYTYSILASVNAGVDMIMVPYNYTEFIDALTYLVKNNA 360

Query: 361 IPISRIDDAVKRILRVKFVMGLFENPLADLSLINELGKQEHRELAREAVRKSLVLLKNGK 420
           IPISRIDDAVKRILRVKFVMGLFENPLADLSL+NE+GKQEHRELAREAVRKSLVLLKNGK
Sbjct: 361 IPISRIDDAVKRILRVKFVMGLFENPLADLSLVNEIGKQEHRELAREAVRKSLVLLKNGK 420

Query: 421 LPNKPLLPLPKKAPKILVAGSHADNLGNQCGGWTMEWQGLSGNNLTTGTTVLAAIKDTID 480
           LPN+PLLPLPKKAPKILVAG+HA++LGNQCGGWT+EWQGL+GNNLT+GTTVL AIKDT+D
Sbjct: 421 LPNQPLLPLPKKAPKILVAGTHANDLGNQCGGWTIEWQGLTGNNLTSGTTVLTAIKDTVD 480

Query: 481 PETEIIFNENPNVEFLKSHNFSYAIVVVGEYPYAETNGDSLNLTIPHPGPHTITNVCGAV 540
           PETE++F+ NPN EFLK+H FSYAIVVVGE+PYAETNGDSLNLTIP PGP TI NVCGAV
Sbjct: 481 PETEVVFDNNPNAEFLKTHQFSYAIVVVGEHPYAETNGDSLNLTIPEPGPETIKNVCGAV 540

Query: 541 KCVVIIISGRPVVIQPYIASMDALVAAWLPGTEGKGITDVLFGDYGFTGKLSQTWFKTID 600
           KCVV++ISGRPVVIQPYI S+DALVAAWLPGTEGKGI+DVLFGDYGFTGKLSQTWFK++D
Sbjct: 541 KCVVVVISGRPVVIQPYIDSIDALVAAWLPGTEGKGISDVLFGDYGFTGKLSQTWFKSVD 600

Query: 601 QLPMNFGDPHYDPLFPFGYGITTELVKA 621
           QLPMNFGD HYDPLFP G+G+TT+ VKA
Sbjct: 601 QLPMNFGDAHYDPLFPLGFGLTTQPVKA 628

BLAST of Lsi01G018950 vs. NCBI nr
Match: gi|659086037|ref|XP_008443733.1| (PREDICTED: lysosomal beta glucosidase-like [Cucumis melo])

HSP 1 Score: 1080.5 bits (2793), Expect = 0.0e+00
Identity = 524/626 (83.71%), Postives = 571/626 (91.21%), Query Frame = 1

Query: 1   MARVLITLVGLLLLCFSETLAKAEQLKYKDPKQPLNIRIKDLLGRMTLEEKIGQMVQIER 60
           MA+ +  L+GLLLLCF ET AKAE LKYKDPKQPLN+RIKDLLGRMTLEEKIGQM QIER
Sbjct: 1   MAKAINILIGLLLLCFFETWAKAENLKYKDPKQPLNVRIKDLLGRMTLEEKIGQMTQIER 60

Query: 61  VNASSKVMEKYFIGSVLSGGGSAPSKRASAKAWVHMVNKIQKGALSTRLGIPMIYGIDAV 120
           VNAS+ VM+KYFIGSVLSGGGS PSK ASA+ WV MVN+IQ+GALSTRLGIPMIYGIDAV
Sbjct: 61  VNASTDVMKKYFIGSVLSGGGSVPSKEASAQDWVQMVNEIQQGALSTRLGIPMIYGIDAV 120

Query: 121 HGHNNVYNATIFPHNIGLGATRDPQLVKKIGIATALEVRATGIPYAFAPCVAVCRDPRWG 180
           HGHNNVYNATIFPHNIGLGATRDPQL+K+IG A+ALE+RATGIPYAFAPC+AVCRDPRWG
Sbjct: 121 HGHNNVYNATIFPHNIGLGATRDPQLLKRIGEASALEIRATGIPYAFAPCIAVCRDPRWG 180

Query: 181 RCYESYSEDPKVVQAMTEIIPGLQGEIPPNSRKGVPYVAGKKNVAACAKHFVGDGGTTKG 240
           RCYESY EDPK+VQ MTEIIPGLQGEIPPNSRKGVPYVAGK+ V ACAKH+VGDGGTTKG
Sbjct: 181 RCYESYGEDPKLVQEMTEIIPGLQGEIPPNSRKGVPYVAGKEKVVACAKHYVGDGGTTKG 240

Query: 241 INENNTVIDRHRLLSIHMPGYYNSIIKGVATIMVSYSSLNGEKMHANKNLVIDFLKNTLH 300
           I+ENNTVIDRH LLSIHMPGYY+SIIKGVAT+MVSYSS NG KMHANK LV DFLKNTLH
Sbjct: 241 IDENNTVIDRHGLLSIHMPGYYHSIIKGVATVMVSYSSWNGVKMHANKELVTDFLKNTLH 300

Query: 301 FRGFVISDWQGIDKITTPPHSNYTYSIMASVNAGVDMV-------EFIDGLTYLVKNNVI 360
           F+GFVISDWQ ID+IT PPH+NYTYSI+ASV AG+DM+       EFIDGLTYLV NN I
Sbjct: 301 FQGFVISDWQAIDRITDPPHANYTYSILASVTAGLDMIMVPYNYTEFIDGLTYLVNNNFI 360

Query: 361 PISRIDDAVKRILRVKFVMGLFENPLADLSLINELGKQEHRELAREAVRKSLVLLKNGKL 420
           PI+RIDDAVKRILRVKF+MGLFENP+ADLSL+NELGKQEHRELAREAVRKSLVLLKNGK 
Sbjct: 361 PITRIDDAVKRILRVKFIMGLFENPIADLSLVNELGKQEHRELAREAVRKSLVLLKNGKS 420

Query: 421 PNKPLLPLPKKAPKILVAGSHADNLGNQCGGWTMEWQGLSGNNLTTGTTVLAAIKDTIDP 480
            +KPLLPL KK  KILVAGSHADNLG QCGGWT+EWQGLSGNNLT+GTTVL AIKDT+DP
Sbjct: 421 ADKPLLPLEKKTQKILVAGSHADNLGYQCGGWTIEWQGLSGNNLTSGTTVLDAIKDTVDP 480

Query: 481 ETEIIFNENPNVEFLKSHNFSYAIVVVGEYPYAETNGDSLNLTIPHPGPHTITNVCGAVK 540
            TE+IFNENP+  FL+S  FSYAIVVVGE+PYAE  GDSLNLTIP PGP TITNVCG +K
Sbjct: 481 STEVIFNENPDKGFLQSGTFSYAIVVVGEHPYAEMMGDSLNLTIPDPGPSTITNVCGVIK 540

Query: 541 CVVIIISGRPVVIQPYIASMDALVAAWLPGTEGKGITDVLFGDYGFTGKLSQTWFKTIDQ 600
           CVV+IISGRPVVIQPY+ S+DALVAAWLPGTEGKGITDVLFGDYGFTGKLSQTWFKT+DQ
Sbjct: 541 CVVVIISGRPVVIQPYVDSVDALVAAWLPGTEGKGITDVLFGDYGFTGKLSQTWFKTVDQ 600

Query: 601 LPMNFGDPHYDPLFPFGYGITTELVK 620
           LPMNFGD HYDPLFP G+G+TT+ +K
Sbjct: 601 LPMNFGDSHYDPLFPLGHGLTTQPIK 626

BLAST of Lsi01G018950 vs. NCBI nr
Match: gi|778665412|ref|XP_011648555.1| (PREDICTED: lysosomal beta glucosidase-like [Cucumis sativus])

HSP 1 Score: 1073.9 bits (2776), Expect = 9.7e-311
Identity = 521/627 (83.09%), Postives = 578/627 (92.19%), Query Frame = 1

Query: 1   MARVLITLVGLLLLCFSETLAKAEQLKYKDPKQPLNIRIKDLLGRMTLEEKIGQMVQIER 60
           MA+ +I L+ LLL+C  ET AKAE  KYKDP Q LN+RIKDLLGRMTLEEKIGQMVQIER
Sbjct: 2   MAKAII-LIALLLICCFETGAKAENFKYKDPTQRLNVRIKDLLGRMTLEEKIGQMVQIER 61

Query: 61  VNASSKVMEKYFIGSVLSGGGSAPSKRASAKAWVHMVNKIQKGALSTRLGIPMIYGIDAV 120
           VNAS++VM+KYFIGSVLSGGGS PSK+ASA+ W++MVN+IQKGALSTRLGIPMIYGIDAV
Sbjct: 62  VNASTEVMKKYFIGSVLSGGGSVPSKQASAQDWINMVNEIQKGALSTRLGIPMIYGIDAV 121

Query: 121 HGHNNVYNATIFPHNIGLGATRDPQLVKKIGIATALEVRATGIPYAFAPCVAVCRDPRWG 180
           HGHNNVYNATIFPHNIGLGATRDPQL+K+IG+A+A E+RATGIPYAFAPCVAVCRDPRWG
Sbjct: 122 HGHNNVYNATIFPHNIGLGATRDPQLLKRIGVASAREIRATGIPYAFAPCVAVCRDPRWG 181

Query: 181 RCYESYSEDPKVVQAMTEIIPGLQGEIPPNSRKGVPYVAGKKNVAACAKHFVGDGGTTKG 240
           RCYESY EDPK+VQ MTEIIPGLQGEIPPNSRKGVPYVAGK+NV ACAKH+VGDGGTTKG
Sbjct: 182 RCYESYGEDPKIVQEMTEIIPGLQGEIPPNSRKGVPYVAGKENVVACAKHYVGDGGTTKG 241

Query: 241 INENNTVIDRHRLLSIHMPGYYNSIIKGVATIMVSYSSLNGEKMHANKNLVIDFLKNTLH 300
           I+ENNTVIDRH LLSIHMPGYY+SIIKGVATIMVSYSS NGEKMHANKNLV DFLKNTLH
Sbjct: 242 IDENNTVIDRHGLLSIHMPGYYHSIIKGVATIMVSYSSWNGEKMHANKNLVTDFLKNTLH 301

Query: 301 FRGFVISDWQGIDKITTPPHSNYTYSIMASVNAGVDMV-------EFIDGLTYLVKNNVI 360
           F+GFVISDW+ ID+IT PPH+NYTYSI+AS+ AG+DM+       EFIDGLT LVK+N I
Sbjct: 302 FQGFVISDWEAIDRITDPPHANYTYSILASITAGLDMIMIPYNYPEFIDGLTNLVKSNYI 361

Query: 361 PISRIDDAVKRILRVKFVMGLFENPLADLSLINELGKQEHRELAREAVRKSLVLLKNGKL 420
           PISRIDDAVKRILRVKFVMGLFENP+ADLSL+NELGKQEHRELAREAVRKSLVLLKNGK 
Sbjct: 362 PISRIDDAVKRILRVKFVMGLFENPIADLSLVNELGKQEHRELAREAVRKSLVLLKNGKS 421

Query: 421 PNKPLLPLPKKAPKILVAGSHADNLGNQCGGWTMEWQGLSGNNLTTGTTVLAAIKDTIDP 480
            +KPLLPL KK  KILVAGSHA+NLG QCGGWT+EWQGLSGNNLT+GTTVL AIKDT+DP
Sbjct: 422 ADKPLLPLEKKTQKILVAGSHANNLGYQCGGWTIEWQGLSGNNLTSGTTVLDAIKDTVDP 481

Query: 481 ETEIIFNENPNVEFLKSHNFSYAIVVVGEYPYAETNGDSLNLTIPHPGPHTITNVCGAVK 540
            TE+IFNENP+ + L+S  FSYAIVVVGE+PYAE NGDSLNLTIP PGP+TITNVCG +K
Sbjct: 482 TTEVIFNENPDKKSLQSDTFSYAIVVVGEHPYAELNGDSLNLTIPDPGPNTITNVCGVIK 541

Query: 541 CVVIIISGRPVVIQPYIASMDALVAAWLPGTEGKGITDVLFGDYGFTGKLSQTWFKTIDQ 600
           C V+IISGRPVVIQPY+ S+DALVAAWLPGTEGKGITDVLFGDYGFTGKLSQTWFKT+DQ
Sbjct: 542 CAVVIISGRPVVIQPYVDSIDALVAAWLPGTEGKGITDVLFGDYGFTGKLSQTWFKTVDQ 601

Query: 601 LPMNFGDPHYDPLFPFGYGITTELVKA 621
           LPMNFG+P+YDPLFPFG+G+TT+ +K+
Sbjct: 602 LPMNFGNPNYDPLFPFGHGLTTQPIKS 627

BLAST of Lsi01G018950 vs. NCBI nr
Match: gi|449446738|ref|XP_004141128.1| (PREDICTED: lysosomal beta glucosidase-like [Cucumis sativus])

HSP 1 Score: 1043.1 bits (2696), Expect = 1.9e-301
Identity = 496/628 (78.98%), Postives = 562/628 (89.49%), Query Frame = 1

Query: 1   MARVLITLVGLLLLCFSETLAKAEQLKYKDPKQPLNIRIKDLLGRMTLEEKIGQMVQIER 60
           MA+ LI  +G  + C +E  AK + ++YKDPKQPLN+RI DLLGRMTLEEKIGQMVQI+R
Sbjct: 1   MAKNLIFFMGFFIFCLTEVWAKHQYMRYKDPKQPLNVRISDLLGRMTLEEKIGQMVQIDR 60

Query: 61  VNASSKVMEKYFIGSVLSGGGSAPSKRASAKAWVHMVNKIQKGALSTRLGIPMIYGIDAV 120
             AS KVM+KY IGSVLSGGGS PSK AS K W+ MVN+ QKG+LSTRLGIPMIYGIDAV
Sbjct: 61  TVASKKVMKKYLIGSVLSGGGSVPSKEASPKVWIDMVNEFQKGSLSTRLGIPMIYGIDAV 120

Query: 121 HGHNNVYNATIFPHNIGLGATRDPQLVKKIGIATALEVRATGIPYAFAPCVAVCRDPRWG 180
           HGHNNVY ATIFPHN+GLGATRDP L K+IG ATALEVRATGI Y FAPC+AVCRDPRWG
Sbjct: 121 HGHNNVYKATIFPHNVGLGATRDPNLAKRIGAATALEVRATGISYVFAPCIAVCRDPRWG 180

Query: 181 RCYESYSEDPKVVQAMTEIIPGLQGEIPPNSRKGVPYVAGKKNVAACAKHFVGDGGTTKG 240
           RC+ESYSEDPKVVQ MTEII GLQGEIP NSRKGVPYVAG++ VAACAKH+VGDGGTTKG
Sbjct: 181 RCFESYSEDPKVVQEMTEIISGLQGEIPSNSRKGVPYVAGREKVAACAKHYVGDGGTTKG 240

Query: 241 INENNTVIDRHRLLSIHMPGYYNSIIKGVATIMVSYSSLNGEKMHANKNLVIDFLKNTLH 300
           +NENNT+  RH LLSIHMPGYYNSIIKGV+T+M+SYSS NG+KMH N++L+  FLKNTL 
Sbjct: 241 MNENNTLASRHGLLSIHMPGYYNSIIKGVSTVMISYSSWNGKKMHENRDLITGFLKNTLR 300

Query: 301 FRGFVISDWQGIDKITTPPHSNYTYSIMASVNAGVDMV-------EFIDGLTYLVKNNVI 360
           FRGFVISDWQGID+IT+PPH+NYTYSI+A + AG+DM+       EFIDGLTYLVK NVI
Sbjct: 301 FRGFVISDWQGIDRITSPPHANYTYSIIAGITAGIDMIMVPFNYTEFIDGLTYLVKTNVI 360

Query: 361 PISRIDDAVKRILRVKFVMGLFENPLADLSLINELGKQEHRELAREAVRKSLVLLKNGKL 420
           PISRIDDAVKRILRVKFVMGLFENPLAD S +NELGK+EHRELAREAVRKSLVLLKNG+ 
Sbjct: 361 PISRIDDAVKRILRVKFVMGLFENPLADSSFVNELGKKEHRELAREAVRKSLVLLKNGES 420

Query: 421 PNKPLLPLPKKAPKILVAGSHADNLGNQCGGWTMEWQGLSGNNLTTGTTVLAAIKDTIDP 480
            +KP+LPLPKK PKILVAGSHA+NLG QCGGWT+EWQGL GNNLT+GTT+L+AIKDT+DP
Sbjct: 421 ADKPILPLPKKVPKILVAGSHANNLGFQCGGWTIEWQGLGGNNLTSGTTILSAIKDTVDP 480

Query: 481 ETEIIFNENPNVEFLKSHNFSYAIVVVGEYPYAETNGDSLNLTIPHPGPHTITNVCGAVK 540
           +T+++F ENP++EF+KS+ FSYAIVVVGEYPYAET GDSLNLTIP PGP TITNVCGAVK
Sbjct: 481 KTKVVFKENPDMEFVKSNKFSYAIVVVGEYPYAETFGDSLNLTIPEPGPSTITNVCGAVK 540

Query: 541 CVVIIISGRPVVIQPYIASMDALVAAWLPGTEGKGITDVLFGDYGFTGKLSQTWFKTIDQ 600
           CVVI+ISGRPVV+QPYI+S+DALVAAWLPGTEGKGI+DVLFGDYGF+GKLS+TWFKT+DQ
Sbjct: 541 CVVIVISGRPVVLQPYISSIDALVAAWLPGTEGKGISDVLFGDYGFSGKLSRTWFKTVDQ 600

Query: 601 LPMNFGDPHYDPLFPFGYGITTELVKAN 622
           LPMN GD HYDPLFPFG+G+TT  +KAN
Sbjct: 601 LPMNVGDAHYDPLFPFGFGLTTNPIKAN 628

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
BGH3B_BACO12.6e-7229.98Beta-glucosidase BoGH3B OS=Bacteroides ovatus (strain ATCC 8483 / DSM 1896 / JCM... [more]
GLUA_DICDI9.2e-7030.54Lysosomal beta glucosidase OS=Dictyostelium discoideum GN=gluA PE=1 SV=2[more]
BGLX_SALTY9.0e-5729.07Periplasmic beta-glucosidase OS=Salmonella typhimurium (strain LT2 / SGSC1412 / ... [more]
BGLX_ECOLI4.9e-5527.48Periplasmic beta-glucosidase OS=Escherichia coli (strain K12) GN=bglX PE=3 SV=2[more]
BGLC_ASPOR5.1e-3629.39Probable beta-glucosidase C OS=Aspergillus oryzae (strain ATCC 42149 / RIB 40) G... [more]
Match NameE-valueIdentityDescription
A0A0A0LI54_CUCSA0.0e+0086.78Uncharacterized protein OS=Cucumis sativus GN=Csa_3G842090 PE=4 SV=1[more]
A0A0A0LY55_CUCSA6.7e-31183.09Uncharacterized protein OS=Cucumis sativus GN=Csa_1G661750 PE=4 SV=1[more]
A0A0A0LFL8_CUCSA1.3e-30178.98Uncharacterized protein OS=Cucumis sativus GN=Csa_3G842070 PE=4 SV=1[more]
M5VW21_PRUPE7.9e-29478.46Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa002894mg PE=4 SV=1[more]
A0A061F0I5_THECC1.0e-29379.45Glycosyl hydrolase family protein OS=Theobroma cacao GN=TCM_025896 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G20950.13.9e-26871.59 Glycosyl hydrolase family protein[more]
AT5G20940.19.6e-25969.40 Glycosyl hydrolase family protein[more]
AT5G04885.13.8e-25567.19 Glycosyl hydrolase family protein[more]
AT3G62710.17.0e-20158.18 Glycosyl hydrolase family protein[more]
AT3G47000.11.9e-19856.83 Glycosyl hydrolase family protein[more]
Match NameE-valueIdentityDescription
gi|778685993|ref|XP_011652313.1|0.0e+0086.78PREDICTED: lysosomal beta glucosidase-like [Cucumis sativus][more]
gi|659130020|ref|XP_008464960.1|0.0e+0086.78PREDICTED: lysosomal beta glucosidase-like [Cucumis melo][more]
gi|659086037|ref|XP_008443733.1|0.0e+0083.71PREDICTED: lysosomal beta glucosidase-like [Cucumis melo][more]
gi|778665412|ref|XP_011648555.1|9.7e-31183.09PREDICTED: lysosomal beta glucosidase-like [Cucumis sativus][more]
gi|449446738|ref|XP_004141128.1|1.9e-30178.98PREDICTED: lysosomal beta glucosidase-like [Cucumis sativus][more]
The following terms have been associated with this gene:
Vocabulary: Biological Process
TermDefinition
GO:0005975carbohydrate metabolic process
Vocabulary: Molecular Function
TermDefinition
GO:0004553hydrolase activity, hydrolyzing O-glycosyl compounds
Vocabulary: INTERPRO
TermDefinition
IPR026892Glycoside hydrolase family 3
IPR017853Glycoside_hydrolase_SF
IPR002772Glyco_hydro_3_C
IPR001764Glyco_hydro_3_N
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0005975 carbohydrate metabolic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compounds

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lsi01G018950.1Lsi01G018950.1mRNA


Analysis Name: InterPro Annotations of Lagenaria siceraria
Date Performed: 2017-09-18
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001764Glycoside hydrolase, family 3, N-terminalPRINTSPR00133GLHYDRLASE3coord: 178..194
score: 3.8E-26coord: 132..151
score: 3.8E-26coord: 108..124
score: 3.8E-26coord: 224..240
score: 3.8E-26coord: 294..312
score: 3.8
IPR001764Glycoside hydrolase, family 3, N-terminalGENE3DG3DSA:3.20.20.300coord: 26..384
score: 9.0E
IPR001764Glycoside hydrolase, family 3, N-terminalPFAMPF00933Glyco_hydro_3coord: 47..368
score: 1.1
IPR002772Glycoside hydrolase family 3 C-terminal domainGENE3DG3DSA:3.40.50.1700coord: 391..613
score: 4.9
IPR002772Glycoside hydrolase family 3 C-terminal domainPFAMPF01915Glyco_hydro_3_Ccoord: 406..613
score: 9.3
IPR002772Glycoside hydrolase family 3 C-terminal domainunknownSSF52279Beta-D-glucan exohydrolase, C-terminal domaincoord: 405..614
score: 6.54
IPR017853Glycoside hydrolase superfamilyunknownSSF51445(Trans)glycosidasescoord: 26..404
score: 2.63E
IPR026892Glycoside hydrolase family 3PANTHERPTHR30620PERIPLASMIC BETA-GLUCOSIDASE-RELATEDcoord: 81..615
score: 0.0coord: 1..62
score:
NoneNo IPR availablePANTHERPTHR30620:SF39SUBFAMILY NOT NAMEDcoord: 1..62
score: 0.0coord: 81..615
score:

The following gene(s) are orthologous to this gene:

None

The following gene(s) are paralogous to this gene:

None