Lsi10G004150 (gene) Bottle gourd (USVL1VR-Ls)

NameLsi10G004150
Typegene
OrganismLagenaria siceraria (Bottle gourd (USVL1VR-Ls))
DescriptionBeta-D-glucosidase
Locationchr10 : 6290303 .. 6294263 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCTCGTGCTAAAAATGATTAATGACAACACGCCAGACTACTCATACAAACGCTTGACAAAATTGACAACATTCGAATGATTTGCTTAATTTTTATTAGTTAATTTGTTTGCTAACAACTTTTAACAAGTAGTTTTAAACATACTTTGGGAGTCTTAATTAAAATTGAAGAAAAAAAATAATAACATTGCATAATGACAATTTTATACCAAAGTATTATAAATGTAATATAAATATAGATATCCATAACTTATATAGGTTGCAAAATATCATAAAGCTCCCATTGAATTTCATGTTGTAACATACCCCTCATTGAATTCAATCTTGTATGTTTGAATTCAATCTTATAACATACACCATCCCATATCCTATAAATTGTGGTATATGGTGCCTTTGTAATACACACCACAATTGAGTTCAATTCTTATCTCTACTTCCTAATTCTCTCTACTTTGTTGTTTTGTTCTTTCTTTGTTCTTCTTTTGTTCATATTTTATAACACAAAGTAATATTTGACACTATTATTCTTTGTTGATTTTTTGTGTTACTTTTGAAATCCTATATTAATTCTTTAGGTTAACTAACAATTTGAATTAAGTTTGTGTCGAGAAGGTTCTTGAATTTTAAAAAATATCTAATAAATTTCCAAATTTTAATTTTATCTAATAAATCCCTAAACTTTAGAGTCAAATTTTAAGTGTAATAGAATATTAAACTTTCAATTTTGTGTAAGTAGGTCCATGAATCTAAAGAAAATGTTGAATAGGTCAAGAATCCATTAAACACAAAATTAAAATGCAAGAAGCTATTGGACAAAAAATTACAAAATTAAGGACCATTAATATATTAGACACAACATTGAAAGAATAGAGACTTATTAGACAATCAAATTTGAGAACACACAAAAATTTAGAAACTAAACTTATAATTAAAATAATAAACCACCATTGATTTTAGTTTTAACTAGAATTAACATGATTTTAAAGACTAAAAATTAGGAATAGTTGCAAATATAGCAATCTGATTAAAAGTATTAGCATATATAGCAACATTTTAAAAAAATTGCAAATATAGCAAAGTCATGGTCTAGGTGGTCTATCACTGATAGACCATAAGAGTACTATAGATTTTGCTATATTTGCAAATTTTTTTTAAAATGTTGCTATACACTTGGTTATTATCTCTAAAATTGCTACTCATTGCACTTAGCCTAAAAATTATAAGCAAAACCAATACTCAACGAGCGAATCAGCATATACTGGTTTCAATTCGACTAGAAAAAACTATCACCACCTATTTGTTTTTCAAAACAGAAATATTTCTGAGCATACAACCCAGAGGTTCTCTCATACCAACAGCAACAATTAAGTTAAAAAATATTTCTCAAAGAGGATGATCATATATAGAAATGAGGTTGAGAATCAGTATTTCAAAACCCGTTGGAGGTTACCTAAAATGGAGGGGAAACAGAAATTCAAGGACAACCAAACTGACTATAAAAAAAAAACATAACTAACCTAAAGATAGATCTGAAATGAATTCCTGTTAGGTTTCATTGGCCAAGAAGATGATGAAGGTGACAGTGGTTTTACTCTGTTGCTGGGCGGCTTTGGTGGCTGCTGATGAAGACTATGTCAAGTACAAGGACCCGATACAACCGCTTAACATCCGGATCAAAGACCTAATGGATAGAATGACTCTAGCAGACAAGCTTGGGCAGATGGCACAGTTGGATCGTTCGGTTGTAACACCAGAGATCATGAGAGATTACTCCATTGGCAGTGTGCTTAGCGCCGGAGGCAGTGTCCCATCACCACAGGCTACTGCCCAGAAGTGGATTGACATGGTGAATTCAATCCAAAAGGGTTCCTTATCAAGTAGGCTTGGAATTCCAATGTTATATGGTATCGATGCTGTCCATGGAAATAACAATGTCTACAATGCCACAGTCTTTCCCCACAATGTCGGTCTTGGCGCAACCAGGTGATAATACATGAGAGGAGGATCCAAACTACATTTTTGAACTACCAACGGTATGGCCCATGTCGAGCATCGACACAGAGTTACTGGTGGCTAAATAGTTTTCTCATGTGACAGGGAACCTGAACTTTTAAGGAGGATTGGTGCTGCTACTGCTAAAGAAGTTAAAGCAACTGGGATTGATTATGTCTTTGCTCCATGCATAGCAGTATGTTTTCTAGAATTCATTTCCCCCTCTTTATATAAAACTTGTCTGATTCAAACAGAACTACGAGCATTTTTGTTTCAGGTCTGTAGAGATCCTAGATGGGGAAGGTGCTATGAAAGCTACAGTGAAGATCCTAACATTGTCAAAGAAATGACAGATATCATAGTTGGGCTGCAAGGACAAATCCCATCTGGTTTTTCAAAAGGTATTCCATATGTTGGTGGAAGGTAAGTTTTCAACCACTAAAGATTTGGACATTTTTCCATTTCACAAAACTTACTCAAAAAACCTCTTGGCCTTGTAACAGAGACAAGGTTGCAGCTTGTGCAAAGCATTATGTGGGCGATGGCAGCACAACAAGGGGTATCAACGAGAACAACACCGTAATTAGCAGGCATGAATTGTTGAGCATTCACATGCCAGGATACTATCACTCCATAATCAAAGGTGTCTCTACAATAATGGTTTCCTACTCCAGTTGGAACGGTGAGAAGATGCATTCAAACCATGAACTTATCACTGATTTCCTTAAGAACACTCTAAACTTCAGGGTATGTATTGCACTAAATCCTCATGGATGATAAACCTATATACAAGCCCCTAAATTTGAACGGACATAACTATTGCAGGGTTTTGTAATCTCCGATTGGCAAGGTATTGATAAAATCACAGACCCAGCTCATTCAAATTACACATTTTCAATTCTCTCTGGAGTTCAAGCTGGAATAGATATGGTAAGGTAATCAACATAGGCAATCCAATCAACAGAGCATCTATAGATCTTATAAAACTAGTTTTACTGTTGTGTTTCAGGTTATGGTTCCTACAAATTACACAGAGTTCATCGATAACCTTACCTACCTTGTCAACAGCAACGCCATTCCGATGTCTCGAATCAACGACGCCGTGAGAAGGATTTTGAGAGTCAAGTTTGTAATGGGCCTGTTTGAGAATCCATTGGCCGATGACAGATTGGTAAATGAGCTTGGAAGCCAGGTTAGTTGACATGAAAACACAAAGAAATGTAGTTGAAAATTGAAATATCATTTTGAAAAAGATACCAATTCAGCTGAACAGGAACACAAAGATTTGGCAAGAGAAGCAGTGAGGAAATCACTTGTTTTACTGAAGAATGGCGAAAATGCTGATGAACCAGTCCTTCCTCTGTCGAAGAAGGCAGCGAAGATCTTAGTAGCTGGAACTCACGCCGACAATCTTGGTTACCAGTGCGGCGGCTGGACAATCACCTGGCAAGGACTCAGCGGCAACAATCTCACAACCGGAACCACCATTCTCGAGGCAGTGAAGAAAACCGTCGATCCAAACACGGAGGTCATCTACAATGTAAATCCGACGACTGATTACCTCAAGGCAAACAACTTCTCGTACGCCATTGTCGTGGTAGGAGAGACGCCGTACGCCGAGACCGATGGCGACAACCTGAACCTGACTATCGCCGAAGGAGGTTCGGACACGATCCAGAACGTGTGCAACGTTGTGAAGTGTGTCGTCGTCATCGTCTCCGGCCGACCTCTGACGATTGAGCCGCACATGTCGCAGTTGGACGCGCTGGTGGCGGCGTGGCTGCCGGGAACAGAGGGGGAGGGCGTGACCGACGTGCTGTTCGGTGATTATGGATTCACCGGTAAGCTGGCAAGGACGTGGTTCAAGACGGTGGATCAACTTCCGATGAACTATGGCGATGAGAATTACAATCCGCTTTTCCCTCTAGGATTTGGGCTTACAACTGAGCCTGTTAATAAAGCAAGCTAG

mRNA sequence

ATGCTCGTGACAGTGGTTTTACTCTGTTGCTGGGCGGCTTTGGTGGCTGCTGATGAAGACTATGTCAAGTACAAGGACCCGATACAACCGCTTAACATCCGGATCAAAGACCTAATGGATAGAATGACTCTAGCAGACAAGCTTGGGCAGATGGCACAGTTGGATCGTTCGGTTGTAACACCAGAGATCATGAGAGATTACTCCATTGGCAGTGTGCTTAGCGCCGGAGGCAGTGTCCCATCACCACAGGCTACTGCCCAGAAGTGGATTGACATGGTTATGGTTCCTACAAATTACACAGAGTTCATCGATAACCTTACCTACCTTGTCAACAGCAACGCCATTCCGATGTCTCGAATCAACGACGCCGTGAGAAGGATTTTGAGAGTCAAGTTTGTAATGGGCCTGTTTGAGAATCCATTGGCCGATGACAGATTGGTAAATGAGCTTGGAAGCCAGAATGGCGAAAATGCTGATGAACCAGTCCTTCCTCTGTCGAAGAAGGCAGCGAAGATCTTAGTAGCTGGAACTCACGCCGACAATCTTGGTTACCAGTGCGGCGGCTGGACAATCACCTGGCAAGGACTCAGCGGCAACAATCTCACAACCGGAACCACCATTCTCGAGGCAGTGAAGAAAACCGTCGATCCAAACACGGAGGTCATCTACAATGTAAATCCGACGACTGATTACCTCAAGGCAAACAACTTCTCGTACGCCATTGTCGTGGTAGGAGAGACGCCGTACGCCGAGACCGATGGCGACAACCTGAACCTGACTATCGCCGAAGGAGGTTCGGACACGATCCAGAACGTGTGCAACGTTGTGAAGTGTGTCGTCGTCATCGTCTCCGGCCGACCTCTGACGATTGAGCCGCACATGTCGCAGTTGGACGCGCTGGTGGCGGCGTGGCTGCCGGGAACAGAGGGGGAGGGCGTGACCGACGTGCTGTTCGGTGATTATGGATTCACCGGTAAGCTGGCAAGGACGTGGTTCAAGACGGTGGATCAACTTCCGATGAACTATGGCGATGAGAATTACAATCCGCTTTTCCCTCTAGGATTTGGGCTTACAACTGAGCCTGTTAATAAAGCAAGCTAG

Coding sequence (CDS)

ATGCTCGTGACAGTGGTTTTACTCTGTTGCTGGGCGGCTTTGGTGGCTGCTGATGAAGACTATGTCAAGTACAAGGACCCGATACAACCGCTTAACATCCGGATCAAAGACCTAATGGATAGAATGACTCTAGCAGACAAGCTTGGGCAGATGGCACAGTTGGATCGTTCGGTTGTAACACCAGAGATCATGAGAGATTACTCCATTGGCAGTGTGCTTAGCGCCGGAGGCAGTGTCCCATCACCACAGGCTACTGCCCAGAAGTGGATTGACATGGTTATGGTTCCTACAAATTACACAGAGTTCATCGATAACCTTACCTACCTTGTCAACAGCAACGCCATTCCGATGTCTCGAATCAACGACGCCGTGAGAAGGATTTTGAGAGTCAAGTTTGTAATGGGCCTGTTTGAGAATCCATTGGCCGATGACAGATTGGTAAATGAGCTTGGAAGCCAGAATGGCGAAAATGCTGATGAACCAGTCCTTCCTCTGTCGAAGAAGGCAGCGAAGATCTTAGTAGCTGGAACTCACGCCGACAATCTTGGTTACCAGTGCGGCGGCTGGACAATCACCTGGCAAGGACTCAGCGGCAACAATCTCACAACCGGAACCACCATTCTCGAGGCAGTGAAGAAAACCGTCGATCCAAACACGGAGGTCATCTACAATGTAAATCCGACGACTGATTACCTCAAGGCAAACAACTTCTCGTACGCCATTGTCGTGGTAGGAGAGACGCCGTACGCCGAGACCGATGGCGACAACCTGAACCTGACTATCGCCGAAGGAGGTTCGGACACGATCCAGAACGTGTGCAACGTTGTGAAGTGTGTCGTCGTCATCGTCTCCGGCCGACCTCTGACGATTGAGCCGCACATGTCGCAGTTGGACGCGCTGGTGGCGGCGTGGCTGCCGGGAACAGAGGGGGAGGGCGTGACCGACGTGCTGTTCGGTGATTATGGATTCACCGGTAAGCTGGCAAGGACGTGGTTCAAGACGGTGGATCAACTTCCGATGAACTATGGCGATGAGAATTACAATCCGCTTTTCCCTCTAGGATTTGGGCTTACAACTGAGCCTGTTAATAAAGCAAGCTAG

Protein sequence

MLVTVVLLCCWAALVAADEDYVKYKDPIQPLNIRIKDLMDRMTLADKLGQMAQLDRSVVTPEIMRDYSIGSVLSAGGSVPSPQATAQKWIDMVMVPTNYTEFIDNLTYLVNSNAIPMSRINDAVRRILRVKFVMGLFENPLADDRLVNELGSQNGENADEPVLPLSKKAAKILVAGTHADNLGYQCGGWTITWQGLSGNNLTTGTTILEAVKKTVDPNTEVIYNVNPTTDYLKANNFSYAIVVVGETPYAETDGDNLNLTIAEGGSDTIQNVCNVVKCVVVIVSGRPLTIEPHMSQLDALVAAWLPGTEGEGVTDVLFGDYGFTGKLARTWFKTVDQLPMNYGDENYNPLFPLGFGLTTEPVNKAS
BLAST of Lsi10G004150 vs. Swiss-Prot
Match: BGH3B_BACO1 (Beta-glucosidase BoGH3B OS=Bacteroides ovatus (strain ATCC 8483 / DSM 1896 / JCM 5824 / NCTC 11153) GN=BACOVA_02659 PE=1 SV=1)

HSP 1 Score: 106.3 bits (264), Expect = 7.2e-22
Identity = 91/319 (28.53%), Postives = 152/319 (47.65%), Query Frame = 1

Query: 90  IDMVMVPTNYTEFIDNLTYLVNSNAIPMSRINDAVRRILRVKFVMGLFENPLADDRLVNE 149
           IDM MVP   + F D L  LV    + M RI+DAV R+LR+K+ +GLF++P  D +  ++
Sbjct: 343 IDMSMVPYEVS-FCDYLKELVEEGEVSMERIDDAVARVLRLKYRLGLFDHPYWDIKKYDK 402

Query: 150 LGS--------QNGENA------DEPVLPLSKKAAKILVAGTHADNLGYQCGGWTITWQG 209
            GS        Q  E +      D  +LP++ K  KIL+ G +A+++    GGW+ +WQG
Sbjct: 403 FGSKEFAAVALQAAEESEVLLKNDGNILPIA-KGKKILLTGPNANSMRCLNGGWSYSWQG 462

Query: 210 -LSGNNLTTGTTILEAVKKTVDPNTEVIYNVNPTTDYLKANNF----------------- 269
            ++        TI EA+ +       +IY    T    K +N+                 
Sbjct: 463 HVADEYAQAYHTIYEALCEKYG-KENIIYEPGVTYASYKNDNWWEENKPETEKPVAAAAQ 522

Query: 270 -SYAIVVVGETPYAETDGDNLNLTIAEGGSDTIQNVCNVVKCVVVIVS-GRPLTIEPHMS 329
               I  +GE  Y ET G+  +LT++E   + ++ +    K +V++++ GRP  I   + 
Sbjct: 523 ADIIITCIGENSYCETPGNLTDLTLSENQRNLVKALAATGKPIVLVLNQGRPRIINDIVP 582

Query: 330 QLDALVAAWLPGT-EGEGVTDVLFGDYGFTGKLARTW-----------FKTVDQLPMNYG 359
              A+V   LP    G+ + ++L GD  F+GK+  T+           +K  + +    G
Sbjct: 583 LAKAVVNIMLPSNYGGDALANLLAGDANFSGKMPFTYPRLINALATYDYKPCENMGQMGG 642

BLAST of Lsi10G004150 vs. Swiss-Prot
Match: GLUA_DICDI (Lysosomal beta glucosidase OS=Dictyostelium discoideum GN=gluA PE=1 SV=2)

HSP 1 Score: 102.8 bits (255), Expect = 8.0e-21
Identity = 91/313 (29.07%), Postives = 152/313 (48.56%), Query Frame = 1

Query: 90  IDMVMVPTNYTEFIDNLTYLVNSNAIPMSRINDAVRRILRVKFVMGLFENPLADDR--LV 149
           IDM MVP + + F   L  +V +  +P SR++ +VRRIL +K+ +GLF NP  +    +V
Sbjct: 392 IDMSMVPLDLS-FPIILAEMVAAGTVPESRLDLSVRRILNLKYALGLFSNPYPNPNAAIV 451

Query: 150 NELGSQNGENA--------------DEPVLPLSKKAAK-ILVAGTHADNLGYQCGGWTIT 209
           + +G      A                 +LPL+    K +L+ G  AD++    GGW++ 
Sbjct: 452 DTIGQVQDREAAAATAEESITLLQNKNNILPLNTNTIKNVLLTGPSADSIRNLNGGWSVH 511

Query: 210 WQGL-SGNNLTTGTTILEAVKK----TVDPNTE--VIYNVNPTTDYLK-------ANNFS 269
           WQG    +    GT+IL  +++    T D N +  + + +   T+          A +  
Sbjct: 512 WQGAYEDSEFPFGTSILTGLREITNDTADFNIQYTIGHEIGVPTNQTSIDEAVELAQSSD 571

Query: 270 YAIVVVGETPYAETDGDNLNLTIAEGGSDTIQNVCNVVKCVV-VIVSGRPLTIEPHM-SQ 329
             +VV+GE P AET GD  +L++       +Q + +  K VV ++V  RP  + P +   
Sbjct: 572 VVVVVIGELPEAETPGDIYDLSMDPNEVLLLQQLVDTGKPVVLILVEARPRILPPDLVYS 631

Query: 330 LDALVAAWLPGTE-GEGVTDVLFGDYGFTGKLARTWFKTVDQLPMNY---GDEN--YNPL 364
             A++ A+LPG+E G+ + ++L G+   +G+L  T+  T   + + Y     EN    PL
Sbjct: 632 CAAVLMAYLPGSEGGKPIANILMGNVNPSGRLPLTYPGTTGDIGVPYYHKYSENGVTTPL 691

BLAST of Lsi10G004150 vs. Swiss-Prot
Match: BGLX_SALTY (Periplasmic beta-glucosidase OS=Salmonella typhimurium (strain LT2 / SGSC1412 / ATCC 700720) GN=bglX PE=3 SV=2)

HSP 1 Score: 98.2 bits (243), Expect = 2.0e-19
Identity = 90/342 (26.32%), Postives = 144/342 (42.11%), Query Frame = 1

Query: 90  IDMVMVPTNYTEFIDNLTYLVNSNAIPMSRINDAVRRILRVKFVMGLFENPLA------- 149
           +DM M    Y++++  L   + S  + M+ ++DA R +L VK+ MGLF +P +       
Sbjct: 315 VDMSMADEYYSKYLPGL---IKSGKVTMAELDDATRHVLNVKYDMGLFNDPYSHLGPKES 374

Query: 150 ---DDRLVNELGSQNG-ENADEPVLPLS--------KKAAKILVAGTHADNLGYQCGGWT 209
              D    + L  +   E A E V+ L         KK+  I V G  AD+     G W+
Sbjct: 375 DPVDTNAESRLHRKEAREVARESVVLLKNRLETLPLKKSGTIAVVGPLADSQRDVMGSWS 434

Query: 210 ITWQGLSGNNLTTGTTILEAVKKTVDPNTEVIY-------NVNPTTDYLK---------- 269
               G++  ++T    +L  ++  V    +++Y       N     D+L           
Sbjct: 435 AA--GVANQSVT----VLAGIQNAVGDGAKILYAKGANITNDKGIVDFLNLYEEAVKIDP 494

Query: 270 -------------ANNFSYAIVVVGETP-YAETDGDNLNLTIAEGGSDTIQNVCNVVK-C 329
                        A      + VVGE+   A       N+TI +   D I  +    K  
Sbjct: 495 RSPQAMIDEAVQAAKQADVVVAVVGESQGMAHEASSRTNITIPQSQRDLITALKATGKPL 554

Query: 330 VVVIVSGRPLTIEPHMSQLDALVAAWLPGTEG-EGVTDVLFGDYGFTGKLARTWFKTVDQ 359
           V+V+++GRPL +     Q DA++  W  GTEG   + DVLFGDY  +GKL  ++ ++V Q
Sbjct: 555 VLVLMNGRPLALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPISFPRSVGQ 614

BLAST of Lsi10G004150 vs. Swiss-Prot
Match: BGLX_ECOLI (Periplasmic beta-glucosidase OS=Escherichia coli (strain K12) GN=bglX PE=3 SV=2)

HSP 1 Score: 76.6 bits (187), Expect = 6.1e-13
Identity = 88/344 (25.58%), Postives = 146/344 (42.44%), Query Frame = 1

Query: 90  IDMVMVPTNYTEFIDNLTYLVNSNAIPMSRINDAVRRILRVKFVMGLFENPLA------- 149
           I+M M    Y++++  L   + S  + M+ ++DA R +L VK+ MGLF +P +       
Sbjct: 315 INMSMSDEYYSKYLPGL---IKSGKVTMAELDDAARHVLNVKYDMGLFNDPYSHLGPKES 374

Query: 150 ---DDRLVNELGSQNG-ENADEPVLPLS--------KKAAKILVAGTHADNLGYQCGGWT 209
              D    + L  +   E A E ++ L         KK+A I V G  AD+     G W+
Sbjct: 375 DPVDTNAESRLHRKEAREVARESLVLLKNRLETLPLKKSATIAVVGPLADSKRDVMGSWS 434

Query: 210 ITWQGLSGNNLTTGTTILEAVKKTVDPNTEVIY--NVNPTT-----DYLKANNFSYAIVV 269
               G++  ++    T+L  +K  V  N +V+Y    N T+     D+L  N +  A+ V
Sbjct: 435 AA--GVADQSV----TVLTGIKNAVGENGKVLYAKGANVTSDKGIIDFL--NQYEEAVKV 494

Query: 270 VGETPYAETD-----GDNLNLTIA----------EGGSDTIQNVCNVVKCVVVIV--SGR 329
              +P    D         ++ +A          E  S T   +    + ++  +  +G+
Sbjct: 495 DPRSPQEMIDEAVQTAKQSDVVVAVVGEAQGMAHEASSRTDITIPQSQRDLIAALKATGK 554

Query: 330 PLTI-----EPHM-----SQLDALVAAWLPGTE-GEGVTDVLFGDYGFTGKLARTWFKTV 359
           PL +      P        Q DA++  W  GTE G  + DVLFGDY  +GKL  ++ ++V
Sbjct: 555 PLVLVLMNGRPLALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPMSFPRSV 614

BLAST of Lsi10G004150 vs. Swiss-Prot
Match: XYL3A_PRER2 (Xylan 1,4-beta-xylosidase OS=Prevotella ruminicola (strain ATCC 19189 / JCM 8958 / 23) GN=xyl3A PE=1 SV=1)

HSP 1 Score: 66.6 bits (161), Expect = 6.3e-10
Identity = 41/99 (41.41%), Postives = 54/99 (54.55%), Query Frame = 1

Query: 279 VVVIVSGRPLTIEPHMSQLDALVAAWLPGTEG-EGVTDVLFGDYGFTGKLARTWFKTVDQ 338
           + V  SG  + ++P     DA+V AW PG EG   V DVLFGDY   GKL+ T++K   Q
Sbjct: 656 IYVNCSGSAIALQPETESCDAIVQAWYPGQEGGTAVADVLFGDYNPGGKLSVTFYKNDQQ 715

Query: 339 LPMNYGDENY---------NPLFPLGFGL--TTEPVNKA 366
           LP +Y D +          + LFP G+GL  TT  V +A
Sbjct: 716 LP-DYEDYSMKGRTYRYFDDALFPFGYGLSYTTFEVGEA 753

BLAST of Lsi10G004150 vs. TrEMBL
Match: B9SIA5_RICCO (Hydrolase, hydrolyzing O-glycosyl compounds, putative OS=Ricinus communis GN=RCOM_1322270 PE=4 SV=1)

HSP 1 Score: 432.6 bits (1111), Expect = 4.9e-118
Identity = 217/327 (66.36%), Postives = 251/327 (76.76%), Query Frame = 1

Query: 54  LDRSVVTPEIMRDYSIGSVLSAGGSVPSPQATAQKWIDMVMVPTNYTEFIDNLTYLVNSN 113
           +DR    P     YS+ + +SAG             IDM+MVP NYTEFID LTYLV S 
Sbjct: 310 IDRITFPPHANYTYSVLAGISAG-------------IDMIMVPYNYTEFIDGLTYLVKSG 369

Query: 114 AIPMSRINDAVRRILRVKFVMGLFENPLADDRLVNELGS------------------QNG 173
            IPMSRI+DAV+RILRVKFVMGLFENP AD+ LVN+LGS                  +NG
Sbjct: 370 IIPMSRIDDAVKRILRVKFVMGLFENPNADESLVNQLGSHEHRQLAREAVRKSLVLLRNG 429

Query: 174 ENADEPVLPLSKKAAKILVAGTHADNLGYQCGGWTITWQGLSGNNLTTGTTILEAVKKTV 233
           + AD+P LPL KKA+KILVAG+HADNLGYQCGGWTI WQGL GN+LT+GTTIL A+K TV
Sbjct: 430 KYADKPSLPLPKKASKILVAGSHADNLGYQCGGWTIEWQGLGGNDLTSGTTILTAIKNTV 489

Query: 234 DPNTEVIYNVNPTTDYLKANNFSYAIVVVGETPYAETDGDNLNLTIAEGGSDTIQNVCNV 293
           D +T+V+Y  NP  D++KANNFSYAIVVVGE PYAET GD++NLTIAE G  TIQNVC  
Sbjct: 490 DSSTKVVYEENPDADFVKANNFSYAIVVVGEHPYAETQGDSMNLTIAEPGPSTIQNVCGA 549

Query: 294 VKCVVVIVSGRPLTIEPHMSQLDALVAAWLPGTEGEGVTDVLFGDYGFTGKLARTWFKTV 353
           VKCVVV+VSGRP+ I+P+++ +DALVAAWLPGTEG+GV DVLFGDYGFTGKL+ TWFKTV
Sbjct: 550 VKCVVVVVSGRPVVIQPYVNIIDALVAAWLPGTEGQGVADVLFGDYGFTGKLSHTWFKTV 609

Query: 354 DQLPMNYGDENYNPLFPLGFGLTTEPV 363
           DQLPMN GD  Y+PLFP GFGLTTEPV
Sbjct: 610 DQLPMNVGDRYYDPLFPFGFGLTTEPV 623

BLAST of Lsi10G004150 vs. TrEMBL
Match: Q7XAS3_GOSHI (Beta-D-glucosidase OS=Gossypium hirsutum PE=2 SV=1)

HSP 1 Score: 432.6 bits (1111), Expect = 4.9e-118
Identity = 216/329 (65.65%), Postives = 255/329 (77.51%), Query Frame = 1

Query: 54  LDRSVVTPEIMRDYSIGSVLSAGGSVPSPQATAQKWIDMVMVPTNYTEFIDNLTYLVNSN 113
           LDR    P     YS+ + + AG             IDMVMVP N+TEFID+LTY V +N
Sbjct: 312 LDRITSPPHANYSYSVEAGVGAG-------------IDMVMVPYNFTEFIDDLTYQVKNN 371

Query: 114 AIPMSRINDAVRRILRVKFVMGLFENPLADDRLVNELGSQ------------------NG 173
            IPMSRI+DAV+RILRVKFVMGLFENP+AD+ LVN+LGSQ                  NG
Sbjct: 372 IIPMSRIDDAVKRILRVKFVMGLFENPMADNSLVNQLGSQEHRELAREAVRKSLVLLKNG 431

Query: 174 ENADEPVLPLSKKAAKILVAGTHADNLGYQCGGWTITWQGLSGNNLTTGTTILEAVKKTV 233
           E+AD+P+LPL KKA KILVAGTHADNLGYQCGGWTITWQGL GN+LTTGTTIL+AVK TV
Sbjct: 432 ESADKPLLPLPKKATKILVAGTHADNLGYQCGGWTITWQGLGGNDLTTGTTILQAVKNTV 491

Query: 234 DPNTEVIYNVNPTTDYLKANNFSYAIVVVGETPYAETDGDNLNLTIAEGGSDTIQNVCNV 293
           D +T+V+Y+ NP   ++K+  FSYAIVVVGE PYAET GD+LNLTI+E G  TI NVC  
Sbjct: 492 DSSTQVVYSENPDAGFVKSGEFSYAIVVVGEPPYAETYGDSLNLTISEPGPMTIYNVCGS 551

Query: 294 VKCVVVIVSGRPLTIEPHMSQLDALVAAWLPGTEGEGVTDVLFGDYGFTGKLARTWFKTV 353
           VKCVVV++SGRP+ ++P +S +DALVAAWLPGTEG+GV+DVLFGDYGFTGKLARTWFKTV
Sbjct: 552 VKCVVVVISGRPVVVQPFVSSVDALVAAWLPGTEGQGVSDVLFGDYGFTGKLARTWFKTV 611

Query: 354 DQLPMNYGDENYNPLFPLGFGLTTEPVNK 365
           DQLPMN GD +Y+PLFP GFGLTT+P ++
Sbjct: 612 DQLPMNVGDPHYDPLFPFGFGLTTKPTHQ 627

BLAST of Lsi10G004150 vs. TrEMBL
Match: A0A0D2S1K4_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_006G118100 PE=4 SV=1)

HSP 1 Score: 432.6 bits (1111), Expect = 4.9e-118
Identity = 215/324 (66.36%), Postives = 253/324 (78.09%), Query Frame = 1

Query: 54  LDRSVVTPEIMRDYSIGSVLSAGGSVPSPQATAQKWIDMVMVPTNYTEFIDNLTYLVNSN 113
           LDR    P     YS+ + + AG             IDMVMVP N+TEFID+LTY V +N
Sbjct: 312 LDRITSPPHANYSYSVEAGVGAG-------------IDMVMVPYNFTEFIDDLTYQVKNN 371

Query: 114 AIPMSRINDAVRRILRVKFVMGLFENPLADDRLVNELGSQ-------------NGENADE 173
            IPMSRI+DAV+RILRVKFVMGLFENP+AD+ LVN+LGSQ             NGE+AD+
Sbjct: 372 IIPMSRIDDAVKRILRVKFVMGLFENPMADNSLVNQLGSQAREAVRKSLVLLKNGESADK 431

Query: 174 PVLPLSKKAAKILVAGTHADNLGYQCGGWTITWQGLSGNNLTTGTTILEAVKKTVDPNTE 233
           P+LPL KKA KILVAGTHADNLGYQCGGWTITWQGL GN+LTTGTTIL+AVK TVD +T+
Sbjct: 432 PLLPLPKKATKILVAGTHADNLGYQCGGWTITWQGLGGNDLTTGTTILQAVKNTVDSSTQ 491

Query: 234 VIYNVNPTTDYLKANNFSYAIVVVGETPYAETDGDNLNLTIAEGGSDTIQNVCNVVKCVV 293
           V+Y+ NP   ++K+  FSYAIVVVGE PYAET GD+LNLTI+E G  TI NVC  VKCVV
Sbjct: 492 VVYSENPDAGFVKSGEFSYAIVVVGEPPYAETYGDSLNLTISEPGPKTIYNVCGSVKCVV 551

Query: 294 VIVSGRPLTIEPHMSQLDALVAAWLPGTEGEGVTDVLFGDYGFTGKLARTWFKTVDQLPM 353
           V++SGRP+ ++P +S + ALVAAWLPGTEG+GV DVLFGDYGFTGKLARTWFKTVDQLPM
Sbjct: 552 VVISGRPVVVQPFVSSVHALVAAWLPGTEGQGVADVLFGDYGFTGKLARTWFKTVDQLPM 611

Query: 354 NYGDENYNPLFPLGFGLTTEPVNK 365
           N GD +Y+PLFP GFGLTT+P ++
Sbjct: 612 NVGDSHYDPLFPFGFGLTTKPTHQ 622

BLAST of Lsi10G004150 vs. TrEMBL
Match: D7U8L2_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_08s0032g00470 PE=4 SV=1)

HSP 1 Score: 431.0 bits (1107), Expect = 1.4e-117
Identity = 217/326 (66.56%), Postives = 246/326 (75.46%), Query Frame = 1

Query: 54  LDRSVVTPEIMRDYSIGSVLSAGGSVPSPQATAQKWIDMVMVPTNYTEFIDNLTYLVNSN 113
           +DR    P     YSI + + AG             IDM+MVP NYTEFID LTY V S 
Sbjct: 320 IDRITSPPHANYSYSIEAGIKAG-------------IDMIMVPYNYTEFIDGLTYQVKSK 379

Query: 114 AIPMSRINDAVRRILRVKFVMGLFENPLADDRLVNELGSQ------------------NG 173
            IPMSRI+DAVRRILRVKFVMGLFE+PLAD  LV+ELGSQ                  NG
Sbjct: 380 IIPMSRIDDAVRRILRVKFVMGLFESPLADHSLVHELGSQVHRELAREAVRKSLVLLKNG 439

Query: 174 ENADEPVLPLSKKAAKILVAGTHADNLGYQCGGWTITWQGLSGNNLTTGTTILEAVKKTV 233
           E AD+P+LPL KKA KILVAGTHADNLG QCGGWTI WQGLSGNNLT+GTTIL A+KKTV
Sbjct: 440 EPADKPLLPLPKKAPKILVAGTHADNLGNQCGGWTIEWQGLSGNNLTSGTTILSAIKKTV 499

Query: 234 DPNTEVIYNVNPTTDYLKANNFSYAIVVVGETPYAETDGDNLNLTIAEGGSDTIQNVCNV 293
           DP TEV+Y  NP   Y+K++ FSYAIVVVGE PYAET GDNLNLTI + G   I NVC  
Sbjct: 500 DPKTEVVYKENPDLSYVKSSKFSYAIVVVGEPPYAETFGDNLNLTIPDPGPSIITNVCGA 559

Query: 294 VKCVVVIVSGRPLTIEPHMSQLDALVAAWLPGTEGEGVTDVLFGDYGFTGKLARTWFKTV 353
           VKCVV+++SGRPL I+P++ Q+DALVAAWLPGTEG+GV DVLFGDYGFTGKL+RTWF+TV
Sbjct: 560 VKCVVIVISGRPLVIQPYVDQIDALVAAWLPGTEGQGVADVLFGDYGFTGKLSRTWFRTV 619

Query: 354 DQLPMNYGDENYNPLFPLGFGLTTEP 362
           +QLPMN GD +Y+PLFP GFGLTTEP
Sbjct: 620 EQLPMNVGDRHYDPLFPFGFGLTTEP 632

BLAST of Lsi10G004150 vs. TrEMBL
Match: A0A067KU67_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_02702 PE=4 SV=1)

HSP 1 Score: 431.0 bits (1107), Expect = 1.4e-117
Identity = 211/328 (64.33%), Postives = 254/328 (77.44%), Query Frame = 1

Query: 54  LDRSVVTPEIMRDYSIGSVLSAGGSVPSPQATAQKWIDMVMVPTNYTEFIDNLTYLVNSN 113
           +DR    P     YSI + +SAG  + SP       I  +MVP NYTEFID LT  V  N
Sbjct: 323 IDRITSPPHANYTYSIQAGISAGIDMASP-------ILQIMVPFNYTEFIDGLTDQVKKN 382

Query: 114 AIPMSRINDAVRRILRVKFVMGLFENPLADDRLVNELGS------------------QNG 173
            IPMSRI+DAV+RILRVKF MGLFENP AD+ LVN+LGS                  +NG
Sbjct: 383 IIPMSRIDDAVKRILRVKFTMGLFENPYADESLVNQLGSQEHRELAREAVRKSLVLLKNG 442

Query: 174 ENADEPVLPLSKKAAKILVAGTHADNLGYQCGGWTITWQGLSGNNLTTGTTILEAVKKTV 233
           +NA+EP+LPL KK++KILVAG+HADNLGYQCGGWTI WQGLSGNN T+GTTIL A+K TV
Sbjct: 443 KNANEPLLPLPKKSSKILVAGSHADNLGYQCGGWTIEWQGLSGNNHTSGTTILTAIKNTV 502

Query: 234 DPNTEVIYNVNPTTDYLKANNFSYAIVVVGETPYAETDGDNLNLTIAEGGSDTIQNVCNV 293
           DP+T+++YN NP  D++K+N FSYAIVVVGE PYAET GD++NLT++  G  TIQNVC  
Sbjct: 503 DPSTKIVYNENPDADFVKSNKFSYAIVVVGEHPYAETQGDSMNLTLSNPGPSTIQNVCGA 562

Query: 294 VKCVVVIVSGRPLTIEPHMSQLDALVAAWLPGTEGEGVTDVLFGDYGFTGKLARTWFKTV 353
           VKCVV++VSGRP+ ++P+++ ++ALVAAWLPGTEG+GV DVLFGDYGFTGKL+RTWFK+V
Sbjct: 563 VKCVVIVVSGRPVVMQPYVNSIEALVAAWLPGTEGQGVADVLFGDYGFTGKLSRTWFKSV 622

Query: 354 DQLPMNYGDENYNPLFPLGFGLTTEPVN 364
           DQLPMN GD NY+PLFP GFGLTTEPVN
Sbjct: 623 DQLPMNVGDRNYDPLFPFGFGLTTEPVN 643

BLAST of Lsi10G004150 vs. TAIR10
Match: AT5G20940.1 (AT5G20940.1 Glycosyl hydrolase family protein)

HSP 1 Score: 394.0 bits (1011), Expect = 9.8e-110
Identity = 205/324 (63.27%), Postives = 238/324 (73.46%), Query Frame = 1

Query: 56  RSVVTPEIMRDYSIGSVLSAGGSVPSPQATAQKWIDMVMVPTNYTEFIDNLTYLVNSNAI 115
           R +V  + +    I + L A  S  S  A     +DM M  +N T+ ID LT  V    I
Sbjct: 305 RGIVISDYLGVDQINTPLGANYS-HSVYAATTAGLDMFMGSSNLTKLIDELTSQVKRKFI 364

Query: 116 PMSRINDAVRRILRVKFVMGLFENPLADDRLVNELGS------------------QNGEN 175
           PMSRI+DAV+RILRVKF MGLFENP+AD  L  +LGS                  +NGEN
Sbjct: 365 PMSRIDDAVKRILRVKFTMGLFENPIADHSLAKKLGSKEHRELAREAVRKSLVLLKNGEN 424

Query: 176 ADEPVLPLSKKAAKILVAGTHADNLGYQCGGWTITWQGLSGNNLTTGTTILEAVKKTVDP 235
           AD+P+LPL KKA KILVAGTHADNLGYQCGGWTITWQGL+GNNLT GTTIL AVKKTVDP
Sbjct: 425 ADKPLLPLPKKANKILVAGTHADNLGYQCGGWTITWQGLNGNNLTIGTTILAAVKKTVDP 484

Query: 236 NTEVIYNVNPTTDYLKANNFSYAIVVVGETPYAETDGDNLNLTIAEGGSDTIQNVCNVVK 295
            T+VIYN NP T+++KA +F YAIV VGE PYAE  GD+ NLTI+E G  TI NVC  VK
Sbjct: 485 KTQVIYNQNPDTNFVKAGDFDYAIVAVGEKPYAEGFGDSTNLTISEPGPSTIGNVCASVK 544

Query: 296 CVVVIVSGRPLTIEPHMSQLDALVAAWLPGTEGEGVTDVLFGDYGFTGKLARTWFKTVDQ 355
           CVVV+VSGRP+ ++  +S +DALVAAWLPGTEG+GV DVLFGDYGFTGKLARTWFKTVDQ
Sbjct: 545 CVVVVVSGRPVVMQ--ISNIDALVAAWLPGTEGQGVADVLFGDYGFTGKLARTWFKTVDQ 604

Query: 356 LPMNYGDENYNPLFPLGFGLTTEP 362
           LPMN GD +Y+PL+P GFGL T+P
Sbjct: 605 LPMNVGDPHYDPLYPFGFGLITKP 625

BLAST of Lsi10G004150 vs. TAIR10
Match: AT5G20950.1 (AT5G20950.1 Glycosyl hydrolase family protein)

HSP 1 Score: 385.6 bits (989), Expect = 3.5e-107
Identity = 191/326 (58.59%), Postives = 233/326 (71.47%), Query Frame = 1

Query: 54  LDRSVVTPEIMRDYSIGSVLSAGGSVPSPQATAQKWIDMVMVPTNYTEFIDNLTYLVNSN 113
           +DR    P +   YS+ + +SAG             IDM+MVP NYTEFID ++  +   
Sbjct: 309 IDRITTPPHLNYSYSVYAGISAG-------------IDMIMVPYNYTEFIDEISSQIQKK 368

Query: 114 AIPMSRINDAVRRILRVKFVMGLFENPLADDRLVNELGSQ------------------NG 173
            IP+SRI+DA++RILRVKF MGLFE PLAD    N+LGS+                  NG
Sbjct: 369 LIPISRIDDALKRILRVKFTMGLFEEPLADLSFANQLGSKEHRELAREAVRKSLVLLKNG 428

Query: 174 ENADEPVLPLSKKAAKILVAGTHADNLGYQCGGWTITWQGLSGNNLTTGTTILEAVKKTV 233
           +   +P+LPL KK+ KILVAG HADNLGYQCGGWTITWQGL+GN+ T GTTIL AVK TV
Sbjct: 429 KTGAKPLLPLPKKSGKILVAGAHADNLGYQCGGWTITWQGLNGNDHTVGTTILAAVKNTV 488

Query: 234 DPNTEVIYNVNPTTDYLKANNFSYAIVVVGETPYAETDGDNLNLTIAEGGSDTIQNVCNV 293
            P T+V+Y+ NP  +++K+  F YAIVVVGE PYAE  GD  NLTI++ G   I NVC  
Sbjct: 489 APTTQVVYSQNPDANFVKSGKFDYAIVVVGEPPYAEMFGDTTNLTISDPGPSIIGNVCGS 548

Query: 294 VKCVVVIVSGRPLTIEPHMSQLDALVAAWLPGTEGEGVTDVLFGDYGFTGKLARTWFKTV 353
           VKCVVV+VSGRP+ I+P++S +DALVAAWLPGTEG+GV D LFGDYGFTGKLARTWFK+V
Sbjct: 549 VKCVVVVVSGRPVVIQPYVSTIDALVAAWLPGTEGQGVADALFGDYGFTGKLARTWFKSV 608

Query: 354 DQLPMNYGDENYNPLFPLGFGLTTEP 362
            QLPMN GD +Y+PL+P GFGLTT+P
Sbjct: 609 KQLPMNVGDRHYDPLYPFGFGLTTKP 621

BLAST of Lsi10G004150 vs. TAIR10
Match: AT5G04885.1 (AT5G04885.1 Glycosyl hydrolase family protein)

HSP 1 Score: 365.9 bits (938), Expect = 2.8e-101
Identity = 179/300 (59.67%), Postives = 223/300 (74.33%), Query Frame = 1

Query: 81  SPQATAQKWIDMVMVPTNYTEFIDNLTYLVNSNAIPMSRINDAVRRILRVKFVMGLFENP 140
           S +A  Q  IDMVMVP N+TEF+++LT LV +N+IP++RI+DAVRRIL VKF MGLFENP
Sbjct: 328 SVRAAIQAGIDMVMVPFNFTEFVNDLTTLVKNNSIPVTRIDDAVRRILLVKFTMGLFENP 387

Query: 141 LADDRLVNELGSQ------------------NGENADEPVLPLSKKAAKILVAGTHADNL 200
           LAD    +ELGSQ                  NG N   P+LPL +K +KILVAGTHADNL
Sbjct: 388 LADYSFSSELGSQAHRDLAREAVRKSLVLLKNG-NKTNPMLPLPRKTSKILVAGTHADNL 447

Query: 201 GYQCGGWTITWQGLSGNNLTTGTTILEAVKKTVDPNTEVIYNVNPTTDYLKANNFSYAIV 260
           GYQCGGWTITWQG SGN  T GTT+L AVK  VD +TEV++  NP  +++K+NNF+YAI+
Sbjct: 448 GYQCGGWTITWQGFSGNKNTRGTTLLSAVKSAVDQSTEVVFRENPDAEFIKSNNFAYAII 507

Query: 261 VVGETPYAETDGDNLNLTIAEGGSDTIQNVCNVVKCVVVIVSGRPLTIEPHMSQLDALVA 320
            VGE PYAET GD+  LT+ + G   I + C  VKCVVV++SGRPL +EP+++ +DALVA
Sbjct: 508 AVGEPPYAETAGDSDKLTMLDPGPAIISSTCQAVKCVVVVISGRPLVMEPYVASIDALVA 567

Query: 321 AWLPGTEGEGVTDVLFGDYGFTGKLARTWFKTVDQLPMNYGDENYNPLFPLGFGLTTEPV 363
           AWLPGTEG+G+TD LFGD+GF+GKL  TWF+  +QLPM+YGD +Y+PLF  G GL TE V
Sbjct: 568 AWLPGTEGQGITDALFGDHGFSGKLPVTWFRNTEQLPMSYGDTHYDPLFAYGSGLETESV 626

BLAST of Lsi10G004150 vs. TAIR10
Match: AT3G62710.1 (AT3G62710.1 Glycosyl hydrolase family protein)

HSP 1 Score: 308.5 bits (789), Expect = 5.4e-84
Identity = 164/314 (52.23%), Postives = 212/314 (67.52%), Query Frame = 1

Query: 81  SPQATAQKWIDMVMVPTNYTEFIDNLTYLVNSNAIPMSRINDAVRRILRVKFVMGLFENP 140
           S +A+    IDMVMVP  Y E+++ LT LVN   IPMSRI+DAVRRILRVKF +GLFEN 
Sbjct: 337 SIEASINAGIDMVMVPWAYPEYLEKLTNLVNGGYIPMSRIDDAVRRILRVKFSIGLFENS 396

Query: 141 LADDRL-VNELGSQ------------------NGENADEPVLPLSKKAAKILVAGTHADN 200
           LAD++L   E GS+                  NG+   + ++PL KK  KI+VAG HA++
Sbjct: 397 LADEKLPTTEFGSEAHREVGREAVRKSMVLLKNGKTDADKIVPLPKKVKKIVVAGRHAND 456

Query: 201 LGYQCGGWTITWQGLSG----------NNLTTG----TTILEAVKKTVDPNTEVIYNVNP 260
           +G+QCGG+++TWQG +G          + L TG    TTILEA++K VDP TEV+Y   P
Sbjct: 457 MGWQCGGFSLTWQGFNGTGEDMPTNTKHGLPTGKIKGTTILEAIQKAVDPTTEVVYVEEP 516

Query: 261 TTDYLKAN-NFSYAIVVVGETPYAETDGDNLNLTIAEGGSDTIQNVCNV-VKCVVVIVSG 320
             D  K + + +Y IVVVGETPYAET GD+  L I + G DT+ + C   +KC+V++V+G
Sbjct: 517 NQDTAKLHADAAYTIVVVGETPYAETFGDSPTLGITKPGPDTLSHTCGSGMKCLVILVTG 576

Query: 321 RPLTIEPHMSQLDALVAAWLPGTEGEGVTDVLFGDYGFTGKLARTWFKTVDQLPMNYGDE 360
           RPL IEP++  LDAL  AWLPGTEG+GV DVLFGD+ FTG L RTW K V QLPMN GD+
Sbjct: 577 RPLVIEPYIDMLDALAVAWLPGTEGQGVADVLFGDHPFTGTLPRTWMKHVTQLPMNVGDK 636

BLAST of Lsi10G004150 vs. TAIR10
Match: AT3G47000.1 (AT3G47000.1 Glycosyl hydrolase family protein)

HSP 1 Score: 303.1 bits (775), Expect = 2.3e-82
Identity = 156/293 (53.24%), Postives = 202/293 (68.94%), Query Frame = 1

Query: 90  IDMVMVPTNYTEFIDNLTYLVNSNAIPMSRINDAVRRILRVKFVMGLFENPLADDRLVNE 149
           IDMVMVP  Y +FI ++T LV S  IPM+RINDAV RILRVKFV GLF +PL D  L+  
Sbjct: 317 IDMVMVPFKYEQFIQDMTDLVESGEIPMARINDAVERILRVKFVAGLFGHPLTDRSLLPT 376

Query: 150 LGSQ------------------NGENADEPVLPLSKKAAKILVAGTHADNLGYQCGGWTI 209
           +G +                  +G+NAD+P LPL + A +ILV GTHAD+LGYQCGGWT 
Sbjct: 377 VGCKEHRELAQEAVRKSLVLLKSGKNADKPFLPLDRNAKRILVTGTHADDLGYQCGGWTK 436

Query: 210 TWQGLSGNNLTTGTTILEAVKKTVDPNTEVIYNVNPTTDYLKANN-FSYAIVVVGETPYA 269
           TW GLSG  +T GTT+L+A+K+ V   TEVIY   P+ + L ++  FSYAIV VGE PYA
Sbjct: 437 TWFGLSGR-ITIGTTLLDAIKEAVGDETEVIYEKTPSKETLASSEGFSYAIVAVGEPPYA 496

Query: 270 ETDGDNLNLTIAEGGSDTIQNVCNVVKCVVVIVSGRPLTIEPH-MSQLDALVAAWLPGTE 329
           ET GDN  L I   G+D +  V  ++  +V+++SGRP+ +EP  + + +ALVAAWLPGTE
Sbjct: 497 ETMGDNSELRIPFNGTDIVTAVAEIIPTLVILISGRPVVLEPTVLEKTEALVAAWLPGTE 556

Query: 330 GEGVTDVLFGDYGFTGKLARTWFKTVDQLPMNYGDENYNPLFPLGFGLTTEPV 363
           G+GV DV+FGDY F GKL  +WFK V+ LP++    +Y+PLFP GFGL ++PV
Sbjct: 557 GQGVADVVFGDYDFKGKLPVSWFKHVEHLPLDAHANSYDPLFPFGFGLNSKPV 608

BLAST of Lsi10G004150 vs. NCBI nr
Match: gi|802585432|ref|XP_012070424.1| (PREDICTED: lysosomal beta glucosidase-like [Jatropha curcas])

HSP 1 Score: 433.7 bits (1114), Expect = 3.2e-118
Identity = 212/328 (64.63%), Postives = 253/328 (77.13%), Query Frame = 1

Query: 54  LDRSVVTPEIMRDYSIGSVLSAGGSVPSPQATAQKWIDMVMVPTNYTEFIDNLTYLVNSN 113
           +DR    P     YSI + +SAG             IDM+MVP NYTEFID LT  V  N
Sbjct: 312 IDRITSPPHANYTYSIQAGISAG-------------IDMIMVPFNYTEFIDGLTDQVKKN 371

Query: 114 AIPMSRINDAVRRILRVKFVMGLFENPLADDRLVNELGSQ------------------NG 173
            IPMSRI+DAV+RILRVKF MGLFENP AD+ LVN+LGSQ                  NG
Sbjct: 372 IIPMSRIDDAVKRILRVKFTMGLFENPYADESLVNQLGSQEHRELAREAVRKSLVLLKNG 431

Query: 174 ENADEPVLPLSKKAAKILVAGTHADNLGYQCGGWTITWQGLSGNNLTTGTTILEAVKKTV 233
           +NA+EP+LPL KK++KILVAG+HADNLGYQCGGWTI WQGLSGNN T+GTTIL A+K TV
Sbjct: 432 KNANEPLLPLPKKSSKILVAGSHADNLGYQCGGWTIEWQGLSGNNHTSGTTILTAIKNTV 491

Query: 234 DPNTEVIYNVNPTTDYLKANNFSYAIVVVGETPYAETDGDNLNLTIAEGGSDTIQNVCNV 293
           DP+T+++YN NP  D++K+N FSYAIVVVGE PYAET GD++NLT++  G  TIQNVC  
Sbjct: 492 DPSTKIVYNENPDADFVKSNKFSYAIVVVGEHPYAETQGDSMNLTLSNPGPSTIQNVCGA 551

Query: 294 VKCVVVIVSGRPLTIEPHMSQLDALVAAWLPGTEGEGVTDVLFGDYGFTGKLARTWFKTV 353
           VKCVV++VSGRP+ ++P+++ ++ALVAAWLPGTEG+GV DVLFGDYGFTGKL+RTWFK+V
Sbjct: 552 VKCVVIVVSGRPVVMQPYVNSIEALVAAWLPGTEGQGVADVLFGDYGFTGKLSRTWFKSV 611

Query: 354 DQLPMNYGDENYNPLFPLGFGLTTEPVN 364
           DQLPMN GD NY+PLFP GFGLTTEPVN
Sbjct: 612 DQLPMNVGDRNYDPLFPFGFGLTTEPVN 626

BLAST of Lsi10G004150 vs. NCBI nr
Match: gi|255569514|ref|XP_002525724.1| (PREDICTED: beta-glucosidase BoGH3B [Ricinus communis])

HSP 1 Score: 432.6 bits (1111), Expect = 7.0e-118
Identity = 217/327 (66.36%), Postives = 251/327 (76.76%), Query Frame = 1

Query: 54  LDRSVVTPEIMRDYSIGSVLSAGGSVPSPQATAQKWIDMVMVPTNYTEFIDNLTYLVNSN 113
           +DR    P     YS+ + +SAG             IDM+MVP NYTEFID LTYLV S 
Sbjct: 310 IDRITFPPHANYTYSVLAGISAG-------------IDMIMVPYNYTEFIDGLTYLVKSG 369

Query: 114 AIPMSRINDAVRRILRVKFVMGLFENPLADDRLVNELGS------------------QNG 173
            IPMSRI+DAV+RILRVKFVMGLFENP AD+ LVN+LGS                  +NG
Sbjct: 370 IIPMSRIDDAVKRILRVKFVMGLFENPNADESLVNQLGSHEHRQLAREAVRKSLVLLRNG 429

Query: 174 ENADEPVLPLSKKAAKILVAGTHADNLGYQCGGWTITWQGLSGNNLTTGTTILEAVKKTV 233
           + AD+P LPL KKA+KILVAG+HADNLGYQCGGWTI WQGL GN+LT+GTTIL A+K TV
Sbjct: 430 KYADKPSLPLPKKASKILVAGSHADNLGYQCGGWTIEWQGLGGNDLTSGTTILTAIKNTV 489

Query: 234 DPNTEVIYNVNPTTDYLKANNFSYAIVVVGETPYAETDGDNLNLTIAEGGSDTIQNVCNV 293
           D +T+V+Y  NP  D++KANNFSYAIVVVGE PYAET GD++NLTIAE G  TIQNVC  
Sbjct: 490 DSSTKVVYEENPDADFVKANNFSYAIVVVGEHPYAETQGDSMNLTIAEPGPSTIQNVCGA 549

Query: 294 VKCVVVIVSGRPLTIEPHMSQLDALVAAWLPGTEGEGVTDVLFGDYGFTGKLARTWFKTV 353
           VKCVVV+VSGRP+ I+P+++ +DALVAAWLPGTEG+GV DVLFGDYGFTGKL+ TWFKTV
Sbjct: 550 VKCVVVVVSGRPVVIQPYVNIIDALVAAWLPGTEGQGVADVLFGDYGFTGKLSHTWFKTV 609

Query: 354 DQLPMNYGDENYNPLFPLGFGLTTEPV 363
           DQLPMN GD  Y+PLFP GFGLTTEPV
Sbjct: 610 DQLPMNVGDRYYDPLFPFGFGLTTEPV 623

BLAST of Lsi10G004150 vs. NCBI nr
Match: gi|763768301|gb|KJB35516.1| (hypothetical protein B456_006G118100 [Gossypium raimondii])

HSP 1 Score: 432.6 bits (1111), Expect = 7.0e-118
Identity = 215/324 (66.36%), Postives = 253/324 (78.09%), Query Frame = 1

Query: 54  LDRSVVTPEIMRDYSIGSVLSAGGSVPSPQATAQKWIDMVMVPTNYTEFIDNLTYLVNSN 113
           LDR    P     YS+ + + AG             IDMVMVP N+TEFID+LTY V +N
Sbjct: 312 LDRITSPPHANYSYSVEAGVGAG-------------IDMVMVPYNFTEFIDDLTYQVKNN 371

Query: 114 AIPMSRINDAVRRILRVKFVMGLFENPLADDRLVNELGSQ-------------NGENADE 173
            IPMSRI+DAV+RILRVKFVMGLFENP+AD+ LVN+LGSQ             NGE+AD+
Sbjct: 372 IIPMSRIDDAVKRILRVKFVMGLFENPMADNSLVNQLGSQAREAVRKSLVLLKNGESADK 431

Query: 174 PVLPLSKKAAKILVAGTHADNLGYQCGGWTITWQGLSGNNLTTGTTILEAVKKTVDPNTE 233
           P+LPL KKA KILVAGTHADNLGYQCGGWTITWQGL GN+LTTGTTIL+AVK TVD +T+
Sbjct: 432 PLLPLPKKATKILVAGTHADNLGYQCGGWTITWQGLGGNDLTTGTTILQAVKNTVDSSTQ 491

Query: 234 VIYNVNPTTDYLKANNFSYAIVVVGETPYAETDGDNLNLTIAEGGSDTIQNVCNVVKCVV 293
           V+Y+ NP   ++K+  FSYAIVVVGE PYAET GD+LNLTI+E G  TI NVC  VKCVV
Sbjct: 492 VVYSENPDAGFVKSGEFSYAIVVVGEPPYAETYGDSLNLTISEPGPKTIYNVCGSVKCVV 551

Query: 294 VIVSGRPLTIEPHMSQLDALVAAWLPGTEGEGVTDVLFGDYGFTGKLARTWFKTVDQLPM 353
           V++SGRP+ ++P +S + ALVAAWLPGTEG+GV DVLFGDYGFTGKLARTWFKTVDQLPM
Sbjct: 552 VVISGRPVVVQPFVSSVHALVAAWLPGTEGQGVADVLFGDYGFTGKLARTWFKTVDQLPM 611

Query: 354 NYGDENYNPLFPLGFGLTTEPVNK 365
           N GD +Y+PLFP GFGLTT+P ++
Sbjct: 612 NVGDSHYDPLFPFGFGLTTKPTHQ 622

BLAST of Lsi10G004150 vs. NCBI nr
Match: gi|33391721|gb|AAQ17461.1| (beta-D-glucosidase [Gossypium hirsutum])

HSP 1 Score: 432.6 bits (1111), Expect = 7.0e-118
Identity = 216/329 (65.65%), Postives = 255/329 (77.51%), Query Frame = 1

Query: 54  LDRSVVTPEIMRDYSIGSVLSAGGSVPSPQATAQKWIDMVMVPTNYTEFIDNLTYLVNSN 113
           LDR    P     YS+ + + AG             IDMVMVP N+TEFID+LTY V +N
Sbjct: 312 LDRITSPPHANYSYSVEAGVGAG-------------IDMVMVPYNFTEFIDDLTYQVKNN 371

Query: 114 AIPMSRINDAVRRILRVKFVMGLFENPLADDRLVNELGSQ------------------NG 173
            IPMSRI+DAV+RILRVKFVMGLFENP+AD+ LVN+LGSQ                  NG
Sbjct: 372 IIPMSRIDDAVKRILRVKFVMGLFENPMADNSLVNQLGSQEHRELAREAVRKSLVLLKNG 431

Query: 174 ENADEPVLPLSKKAAKILVAGTHADNLGYQCGGWTITWQGLSGNNLTTGTTILEAVKKTV 233
           E+AD+P+LPL KKA KILVAGTHADNLGYQCGGWTITWQGL GN+LTTGTTIL+AVK TV
Sbjct: 432 ESADKPLLPLPKKATKILVAGTHADNLGYQCGGWTITWQGLGGNDLTTGTTILQAVKNTV 491

Query: 234 DPNTEVIYNVNPTTDYLKANNFSYAIVVVGETPYAETDGDNLNLTIAEGGSDTIQNVCNV 293
           D +T+V+Y+ NP   ++K+  FSYAIVVVGE PYAET GD+LNLTI+E G  TI NVC  
Sbjct: 492 DSSTQVVYSENPDAGFVKSGEFSYAIVVVGEPPYAETYGDSLNLTISEPGPMTIYNVCGS 551

Query: 294 VKCVVVIVSGRPLTIEPHMSQLDALVAAWLPGTEGEGVTDVLFGDYGFTGKLARTWFKTV 353
           VKCVVV++SGRP+ ++P +S +DALVAAWLPGTEG+GV+DVLFGDYGFTGKLARTWFKTV
Sbjct: 552 VKCVVVVISGRPVVVQPFVSSVDALVAAWLPGTEGQGVSDVLFGDYGFTGKLARTWFKTV 611

Query: 354 DQLPMNYGDENYNPLFPLGFGLTTEPVNK 365
           DQLPMN GD +Y+PLFP GFGLTT+P ++
Sbjct: 612 DQLPMNVGDPHYDPLFPFGFGLTTKPTHQ 627

BLAST of Lsi10G004150 vs. NCBI nr
Match: gi|643732586|gb|KDP39682.1| (hypothetical protein JCGZ_02702 [Jatropha curcas])

HSP 1 Score: 431.0 bits (1107), Expect = 2.0e-117
Identity = 211/328 (64.33%), Postives = 254/328 (77.44%), Query Frame = 1

Query: 54  LDRSVVTPEIMRDYSIGSVLSAGGSVPSPQATAQKWIDMVMVPTNYTEFIDNLTYLVNSN 113
           +DR    P     YSI + +SAG  + SP       I  +MVP NYTEFID LT  V  N
Sbjct: 323 IDRITSPPHANYTYSIQAGISAGIDMASP-------ILQIMVPFNYTEFIDGLTDQVKKN 382

Query: 114 AIPMSRINDAVRRILRVKFVMGLFENPLADDRLVNELGS------------------QNG 173
            IPMSRI+DAV+RILRVKF MGLFENP AD+ LVN+LGS                  +NG
Sbjct: 383 IIPMSRIDDAVKRILRVKFTMGLFENPYADESLVNQLGSQEHRELAREAVRKSLVLLKNG 442

Query: 174 ENADEPVLPLSKKAAKILVAGTHADNLGYQCGGWTITWQGLSGNNLTTGTTILEAVKKTV 233
           +NA+EP+LPL KK++KILVAG+HADNLGYQCGGWTI WQGLSGNN T+GTTIL A+K TV
Sbjct: 443 KNANEPLLPLPKKSSKILVAGSHADNLGYQCGGWTIEWQGLSGNNHTSGTTILTAIKNTV 502

Query: 234 DPNTEVIYNVNPTTDYLKANNFSYAIVVVGETPYAETDGDNLNLTIAEGGSDTIQNVCNV 293
           DP+T+++YN NP  D++K+N FSYAIVVVGE PYAET GD++NLT++  G  TIQNVC  
Sbjct: 503 DPSTKIVYNENPDADFVKSNKFSYAIVVVGEHPYAETQGDSMNLTLSNPGPSTIQNVCGA 562

Query: 294 VKCVVVIVSGRPLTIEPHMSQLDALVAAWLPGTEGEGVTDVLFGDYGFTGKLARTWFKTV 353
           VKCVV++VSGRP+ ++P+++ ++ALVAAWLPGTEG+GV DVLFGDYGFTGKL+RTWFK+V
Sbjct: 563 VKCVVIVVSGRPVVMQPYVNSIEALVAAWLPGTEGQGVADVLFGDYGFTGKLSRTWFKSV 622

Query: 354 DQLPMNYGDENYNPLFPLGFGLTTEPVN 364
           DQLPMN GD NY+PLFP GFGLTTEPVN
Sbjct: 623 DQLPMNVGDRNYDPLFPFGFGLTTEPVN 643

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
BGH3B_BACO17.2e-2228.53Beta-glucosidase BoGH3B OS=Bacteroides ovatus (strain ATCC 8483 / DSM 1896 / JCM... [more]
GLUA_DICDI8.0e-2129.07Lysosomal beta glucosidase OS=Dictyostelium discoideum GN=gluA PE=1 SV=2[more]
BGLX_SALTY2.0e-1926.32Periplasmic beta-glucosidase OS=Salmonella typhimurium (strain LT2 / SGSC1412 / ... [more]
BGLX_ECOLI6.1e-1325.58Periplasmic beta-glucosidase OS=Escherichia coli (strain K12) GN=bglX PE=3 SV=2[more]
XYL3A_PRER26.3e-1041.41Xylan 1,4-beta-xylosidase OS=Prevotella ruminicola (strain ATCC 19189 / JCM 8958... [more]
Match NameE-valueIdentityDescription
B9SIA5_RICCO4.9e-11866.36Hydrolase, hydrolyzing O-glycosyl compounds, putative OS=Ricinus communis GN=RCO... [more]
Q7XAS3_GOSHI4.9e-11865.65Beta-D-glucosidase OS=Gossypium hirsutum PE=2 SV=1[more]
A0A0D2S1K4_GOSRA4.9e-11866.36Uncharacterized protein OS=Gossypium raimondii GN=B456_006G118100 PE=4 SV=1[more]
D7U8L2_VITVI1.4e-11766.56Putative uncharacterized protein OS=Vitis vinifera GN=VIT_08s0032g00470 PE=4 SV=... [more]
A0A067KU67_JATCU1.4e-11764.33Uncharacterized protein OS=Jatropha curcas GN=JCGZ_02702 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G20940.19.8e-11063.27 Glycosyl hydrolase family protein[more]
AT5G20950.13.5e-10758.59 Glycosyl hydrolase family protein[more]
AT5G04885.12.8e-10159.67 Glycosyl hydrolase family protein[more]
AT3G62710.15.4e-8452.23 Glycosyl hydrolase family protein[more]
AT3G47000.12.3e-8253.24 Glycosyl hydrolase family protein[more]
Match NameE-valueIdentityDescription
gi|802585432|ref|XP_012070424.1|3.2e-11864.63PREDICTED: lysosomal beta glucosidase-like [Jatropha curcas][more]
gi|255569514|ref|XP_002525724.1|7.0e-11866.36PREDICTED: beta-glucosidase BoGH3B [Ricinus communis][more]
gi|763768301|gb|KJB35516.1|7.0e-11866.36hypothetical protein B456_006G118100 [Gossypium raimondii][more]
gi|33391721|gb|AAQ17461.1|7.0e-11865.65beta-D-glucosidase [Gossypium hirsutum][more]
gi|643732586|gb|KDP39682.1|2.0e-11764.33hypothetical protein JCGZ_02702 [Jatropha curcas][more]
The following terms have been associated with this gene:
Vocabulary: Biological Process
TermDefinition
GO:0005975carbohydrate metabolic process
Vocabulary: Molecular Function
TermDefinition
GO:0004553hydrolase activity, hydrolyzing O-glycosyl compounds
Vocabulary: INTERPRO
TermDefinition
IPR026892Glycoside hydrolase family 3
IPR017853Glycoside_hydrolase_SF
IPR002772Glyco_hydro_3_C
IPR001764Glyco_hydro_3_N
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009251 glucan catabolic process
biological_process GO:0005975 carbohydrate metabolic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0008422 beta-glucosidase activity
molecular_function GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compounds

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lsi10G004150.1Lsi10G004150.1mRNA


Analysis Name: InterPro Annotations of Lagenaria siceraria
Date Performed: 2017-09-18
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001764Glycoside hydrolase, family 3, N-terminalGENE3DG3DSA:3.20.20.300coord: 23..89
score: 4.3E-10coord: 90..145
score: 6.4
IPR001764Glycoside hydrolase, family 3, N-terminalPFAMPF00933Glyco_hydro_3coord: 85..130
score: 1.
IPR002772Glycoside hydrolase family 3 C-terminal domainGENE3DG3DSA:3.40.50.1700coord: 157..358
score: 5.5
IPR002772Glycoside hydrolase family 3 C-terminal domainPFAMPF01915Glyco_hydro_3_Ccoord: 160..358
score: 3.1
IPR002772Glycoside hydrolase family 3 C-terminal domainunknownSSF52279Beta-D-glucan exohydrolase, C-terminal domaincoord: 157..358
score: 7.06
IPR017853Glycoside hydrolase superfamilyunknownSSF51445(Trans)glycosidasescoord: 90..154
score: 3.79E-13coord: 22..89
score: 2.5
IPR026892Glycoside hydrolase family 3PANTHERPTHR30620PERIPLASMIC BETA-GLUCOSIDASE-RELATEDcoord: 90..363
score: 3.6E
NoneNo IPR availablePANTHERPTHR30620:SF39SUBFAMILY NOT NAMEDcoord: 90..363
score: 3.6E

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Lsi10G004150Cucurbita pepo (Zucchini)cpelsiB436
Lsi10G004150Cucurbita pepo (Zucchini)cpelsiB589
Lsi10G004150Wild cucumber (PI 183967)cpilsiB412
Lsi10G004150Wild cucumber (PI 183967)cpilsiB497
Lsi10G004150Cucumber (Chinese Long) v2culsiB093
Lsi10G004150Cucumber (Chinese Long) v2culsiB489
Lsi10G004150Melon (DHL92) v3.5.1lsimeB064
Lsi10G004150Watermelon (Charleston Gray)lsiwcgB039
Lsi10G004150Watermelon (Charleston Gray)lsiwcgB042
Lsi10G004150Watermelon (Charleston Gray)lsiwcgB057
Lsi10G004150Watermelon (97103) v1lsiwmB059
Lsi10G004150Watermelon (97103) v1lsiwmB079
Lsi10G004150Cucumber (Gy14) v2cgyblsiB086
Lsi10G004150Cucumber (Gy14) v2cgyblsiB456
Lsi10G004150Melon (DHL92) v3.6.1lsimedB047
Lsi10G004150Melon (DHL92) v3.6.1lsimedB068
Lsi10G004150Melon (DHL92) v3.6.1lsimedB079
Lsi10G004150Silver-seed gourdcarlsiB288
Lsi10G004150Cucumber (Chinese Long) v3cuclsiB104
Lsi10G004150Cucumber (Chinese Long) v3cuclsiB176
Lsi10G004150Cucumber (Chinese Long) v3cuclsiB435
Lsi10G004150Cucumber (Chinese Long) v3cuclsiB524
Lsi10G004150Watermelon (97103) v2lsiwmbB035
Lsi10G004150Watermelon (97103) v2lsiwmbB056
Lsi10G004150Wax gourdlsiwgoB066
Lsi10G004150Wax gourdlsiwgoB092
Lsi10G004150Bottle gourd (USVL1VR-Ls)lsilsiB021
Lsi10G004150Bottle gourd (USVL1VR-Ls)lsilsiB032
Lsi10G004150Bottle gourd (USVL1VR-Ls)lsilsiB044
Lsi10G004150Cucumber (Gy14) v1cgylsiB145
Lsi10G004150Cucumber (Gy14) v1cgylsiB609
Lsi10G004150Cucurbita maxima (Rimu)cmalsiB466
Lsi10G004150Cucurbita maxima (Rimu)cmalsiB507
Lsi10G004150Cucurbita maxima (Rimu)cmalsiB548
Lsi10G004150Cucurbita moschata (Rifu)cmolsiB451
Lsi10G004150Cucurbita moschata (Rifu)cmolsiB491
Lsi10G004150Cucurbita moschata (Rifu)cmolsiB533
Lsi10G004150Cucurbita pepo (Zucchini)cpelsiB195