Lsi10G004150.1 (mRNA) Bottle gourd (USVL1VR-Ls)

NameLsi10G004150.1
TypemRNA
OrganismLagenaria siceraria (Bottle gourd (USVL1VR-Ls))
DescriptionBeta-D-glucosidase
Locationchr10 : 6290303 .. 6294263 (+)
Sequence length1101
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCTCGTGCTAAAAATGATTAATGACAACACGCCAGACTACTCATACAAACGCTTGACAAAATTGACAACATTCGAATGATTTGCTTAATTTTTATTAGTTAATTTGTTTGCTAACAACTTTTAACAAGTAGTTTTAAACATACTTTGGGAGTCTTAATTAAAATTGAAGAAAAAAAATAATAACATTGCATAATGACAATTTTATACCAAAGTATTATAAATGTAATATAAATATAGATATCCATAACTTATATAGGTTGCAAAATATCATAAAGCTCCCATTGAATTTCATGTTGTAACATACCCCTCATTGAATTCAATCTTGTATGTTTGAATTCAATCTTATAACATACACCATCCCATATCCTATAAATTGTGGTATATGGTGCCTTTGTAATACACACCACAATTGAGTTCAATTCTTATCTCTACTTCCTAATTCTCTCTACTTTGTTGTTTTGTTCTTTCTTTGTTCTTCTTTTGTTCATATTTTATAACACAAAGTAATATTTGACACTATTATTCTTTGTTGATTTTTTGTGTTACTTTTGAAATCCTATATTAATTCTTTAGGTTAACTAACAATTTGAATTAAGTTTGTGTCGAGAAGGTTCTTGAATTTTAAAAAATATCTAATAAATTTCCAAATTTTAATTTTATCTAATAAATCCCTAAACTTTAGAGTCAAATTTTAAGTGTAATAGAATATTAAACTTTCAATTTTGTGTAAGTAGGTCCATGAATCTAAAGAAAATGTTGAATAGGTCAAGAATCCATTAAACACAAAATTAAAATGCAAGAAGCTATTGGACAAAAAATTACAAAATTAAGGACCATTAATATATTAGACACAACATTGAAAGAATAGAGACTTATTAGACAATCAAATTTGAGAACACACAAAAATTTAGAAACTAAACTTATAATTAAAATAATAAACCACCATTGATTTTAGTTTTAACTAGAATTAACATGATTTTAAAGACTAAAAATTAGGAATAGTTGCAAATATAGCAATCTGATTAAAAGTATTAGCATATATAGCAACATTTTAAAAAAATTGCAAATATAGCAAAGTCATGGTCTAGGTGGTCTATCACTGATAGACCATAAGAGTACTATAGATTTTGCTATATTTGCAAATTTTTTTTAAAATGTTGCTATACACTTGGTTATTATCTCTAAAATTGCTACTCATTGCACTTAGCCTAAAAATTATAAGCAAAACCAATACTCAACGAGCGAATCAGCATATACTGGTTTCAATTCGACTAGAAAAAACTATCACCACCTATTTGTTTTTCAAAACAGAAATATTTCTGAGCATACAACCCAGAGGTTCTCTCATACCAACAGCAACAATTAAGTTAAAAAATATTTCTCAAAGAGGATGATCATATATAGAAATGAGGTTGAGAATCAGTATTTCAAAACCCGTTGGAGGTTACCTAAAATGGAGGGGAAACAGAAATTCAAGGACAACCAAACTGACTATAAAAAAAAAACATAACTAACCTAAAGATAGATCTGAAATGAATTCCTGTTAGGTTTCATTGGCCAAGAAGATGATGAAGGTGACAGTGGTTTTACTCTGTTGCTGGGCGGCTTTGGTGGCTGCTGATGAAGACTATGTCAAGTACAAGGACCCGATACAACCGCTTAACATCCGGATCAAAGACCTAATGGATAGAATGACTCTAGCAGACAAGCTTGGGCAGATGGCACAGTTGGATCGTTCGGTTGTAACACCAGAGATCATGAGAGATTACTCCATTGGCAGTGTGCTTAGCGCCGGAGGCAGTGTCCCATCACCACAGGCTACTGCCCAGAAGTGGATTGACATGGTGAATTCAATCCAAAAGGGTTCCTTATCAAGTAGGCTTGGAATTCCAATGTTATATGGTATCGATGCTGTCCATGGAAATAACAATGTCTACAATGCCACAGTCTTTCCCCACAATGTCGGTCTTGGCGCAACCAGGTGATAATACATGAGAGGAGGATCCAAACTACATTTTTGAACTACCAACGGTATGGCCCATGTCGAGCATCGACACAGAGTTACTGGTGGCTAAATAGTTTTCTCATGTGACAGGGAACCTGAACTTTTAAGGAGGATTGGTGCTGCTACTGCTAAAGAAGTTAAAGCAACTGGGATTGATTATGTCTTTGCTCCATGCATAGCAGTATGTTTTCTAGAATTCATTTCCCCCTCTTTATATAAAACTTGTCTGATTCAAACAGAACTACGAGCATTTTTGTTTCAGGTCTGTAGAGATCCTAGATGGGGAAGGTGCTATGAAAGCTACAGTGAAGATCCTAACATTGTCAAAGAAATGACAGATATCATAGTTGGGCTGCAAGGACAAATCCCATCTGGTTTTTCAAAAGGTATTCCATATGTTGGTGGAAGGTAAGTTTTCAACCACTAAAGATTTGGACATTTTTCCATTTCACAAAACTTACTCAAAAAACCTCTTGGCCTTGTAACAGAGACAAGGTTGCAGCTTGTGCAAAGCATTATGTGGGCGATGGCAGCACAACAAGGGGTATCAACGAGAACAACACCGTAATTAGCAGGCATGAATTGTTGAGCATTCACATGCCAGGATACTATCACTCCATAATCAAAGGTGTCTCTACAATAATGGTTTCCTACTCCAGTTGGAACGGTGAGAAGATGCATTCAAACCATGAACTTATCACTGATTTCCTTAAGAACACTCTAAACTTCAGGGTATGTATTGCACTAAATCCTCATGGATGATAAACCTATATACAAGCCCCTAAATTTGAACGGACATAACTATTGCAGGGTTTTGTAATCTCCGATTGGCAAGGTATTGATAAAATCACAGACCCAGCTCATTCAAATTACACATTTTCAATTCTCTCTGGAGTTCAAGCTGGAATAGATATGGTAAGGTAATCAACATAGGCAATCCAATCAACAGAGCATCTATAGATCTTATAAAACTAGTTTTACTGTTGTGTTTCAGGTTATGGTTCCTACAAATTACACAGAGTTCATCGATAACCTTACCTACCTTGTCAACAGCAACGCCATTCCGATGTCTCGAATCAACGACGCCGTGAGAAGGATTTTGAGAGTCAAGTTTGTAATGGGCCTGTTTGAGAATCCATTGGCCGATGACAGATTGGTAAATGAGCTTGGAAGCCAGGTTAGTTGACATGAAAACACAAAGAAATGTAGTTGAAAATTGAAATATCATTTTGAAAAAGATACCAATTCAGCTGAACAGGAACACAAAGATTTGGCAAGAGAAGCAGTGAGGAAATCACTTGTTTTACTGAAGAATGGCGAAAATGCTGATGAACCAGTCCTTCCTCTGTCGAAGAAGGCAGCGAAGATCTTAGTAGCTGGAACTCACGCCGACAATCTTGGTTACCAGTGCGGCGGCTGGACAATCACCTGGCAAGGACTCAGCGGCAACAATCTCACAACCGGAACCACCATTCTCGAGGCAGTGAAGAAAACCGTCGATCCAAACACGGAGGTCATCTACAATGTAAATCCGACGACTGATTACCTCAAGGCAAACAACTTCTCGTACGCCATTGTCGTGGTAGGAGAGACGCCGTACGCCGAGACCGATGGCGACAACCTGAACCTGACTATCGCCGAAGGAGGTTCGGACACGATCCAGAACGTGTGCAACGTTGTGAAGTGTGTCGTCGTCATCGTCTCCGGCCGACCTCTGACGATTGAGCCGCACATGTCGCAGTTGGACGCGCTGGTGGCGGCGTGGCTGCCGGGAACAGAGGGGGAGGGCGTGACCGACGTGCTGTTCGGTGATTATGGATTCACCGGTAAGCTGGCAAGGACGTGGTTCAAGACGGTGGATCAACTTCCGATGAACTATGGCGATGAGAATTACAATCCGCTTTTCCCTCTAGGATTTGGGCTTACAACTGAGCCTGTTAATAAAGCAAGCTAG

mRNA sequence

ATGCTCGTGACAGTGGTTTTACTCTGTTGCTGGGCGGCTTTGGTGGCTGCTGATGAAGACTATGTCAAGTACAAGGACCCGATACAACCGCTTAACATCCGGATCAAAGACCTAATGGATAGAATGACTCTAGCAGACAAGCTTGGGCAGATGGCACAGTTGGATCGTTCGGTTGTAACACCAGAGATCATGAGAGATTACTCCATTGGCAGTGTGCTTAGCGCCGGAGGCAGTGTCCCATCACCACAGGCTACTGCCCAGAAGTGGATTGACATGGTTATGGTTCCTACAAATTACACAGAGTTCATCGATAACCTTACCTACCTTGTCAACAGCAACGCCATTCCGATGTCTCGAATCAACGACGCCGTGAGAAGGATTTTGAGAGTCAAGTTTGTAATGGGCCTGTTTGAGAATCCATTGGCCGATGACAGATTGGTAAATGAGCTTGGAAGCCAGAATGGCGAAAATGCTGATGAACCAGTCCTTCCTCTGTCGAAGAAGGCAGCGAAGATCTTAGTAGCTGGAACTCACGCCGACAATCTTGGTTACCAGTGCGGCGGCTGGACAATCACCTGGCAAGGACTCAGCGGCAACAATCTCACAACCGGAACCACCATTCTCGAGGCAGTGAAGAAAACCGTCGATCCAAACACGGAGGTCATCTACAATGTAAATCCGACGACTGATTACCTCAAGGCAAACAACTTCTCGTACGCCATTGTCGTGGTAGGAGAGACGCCGTACGCCGAGACCGATGGCGACAACCTGAACCTGACTATCGCCGAAGGAGGTTCGGACACGATCCAGAACGTGTGCAACGTTGTGAAGTGTGTCGTCGTCATCGTCTCCGGCCGACCTCTGACGATTGAGCCGCACATGTCGCAGTTGGACGCGCTGGTGGCGGCGTGGCTGCCGGGAACAGAGGGGGAGGGCGTGACCGACGTGCTGTTCGGTGATTATGGATTCACCGGTAAGCTGGCAAGGACGTGGTTCAAGACGGTGGATCAACTTCCGATGAACTATGGCGATGAGAATTACAATCCGCTTTTCCCTCTAGGATTTGGGCTTACAACTGAGCCTGTTAATAAAGCAAGCTAG

Coding sequence (CDS)

ATGCTCGTGACAGTGGTTTTACTCTGTTGCTGGGCGGCTTTGGTGGCTGCTGATGAAGACTATGTCAAGTACAAGGACCCGATACAACCGCTTAACATCCGGATCAAAGACCTAATGGATAGAATGACTCTAGCAGACAAGCTTGGGCAGATGGCACAGTTGGATCGTTCGGTTGTAACACCAGAGATCATGAGAGATTACTCCATTGGCAGTGTGCTTAGCGCCGGAGGCAGTGTCCCATCACCACAGGCTACTGCCCAGAAGTGGATTGACATGGTTATGGTTCCTACAAATTACACAGAGTTCATCGATAACCTTACCTACCTTGTCAACAGCAACGCCATTCCGATGTCTCGAATCAACGACGCCGTGAGAAGGATTTTGAGAGTCAAGTTTGTAATGGGCCTGTTTGAGAATCCATTGGCCGATGACAGATTGGTAAATGAGCTTGGAAGCCAGAATGGCGAAAATGCTGATGAACCAGTCCTTCCTCTGTCGAAGAAGGCAGCGAAGATCTTAGTAGCTGGAACTCACGCCGACAATCTTGGTTACCAGTGCGGCGGCTGGACAATCACCTGGCAAGGACTCAGCGGCAACAATCTCACAACCGGAACCACCATTCTCGAGGCAGTGAAGAAAACCGTCGATCCAAACACGGAGGTCATCTACAATGTAAATCCGACGACTGATTACCTCAAGGCAAACAACTTCTCGTACGCCATTGTCGTGGTAGGAGAGACGCCGTACGCCGAGACCGATGGCGACAACCTGAACCTGACTATCGCCGAAGGAGGTTCGGACACGATCCAGAACGTGTGCAACGTTGTGAAGTGTGTCGTCGTCATCGTCTCCGGCCGACCTCTGACGATTGAGCCGCACATGTCGCAGTTGGACGCGCTGGTGGCGGCGTGGCTGCCGGGAACAGAGGGGGAGGGCGTGACCGACGTGCTGTTCGGTGATTATGGATTCACCGGTAAGCTGGCAAGGACGTGGTTCAAGACGGTGGATCAACTTCCGATGAACTATGGCGATGAGAATTACAATCCGCTTTTCCCTCTAGGATTTGGGCTTACAACTGAGCCTGTTAATAAAGCAAGCTAG

Protein sequence

MLVTVVLLCCWAALVAADEDYVKYKDPIQPLNIRIKDLMDRMTLADKLGQMAQLDRSVVTPEIMRDYSIGSVLSAGGSVPSPQATAQKWIDMVMVPTNYTEFIDNLTYLVNSNAIPMSRINDAVRRILRVKFVMGLFENPLADDRLVNELGSQNGENADEPVLPLSKKAAKILVAGTHADNLGYQCGGWTITWQGLSGNNLTTGTTILEAVKKTVDPNTEVIYNVNPTTDYLKANNFSYAIVVVGETPYAETDGDNLNLTIAEGGSDTIQNVCNVVKCVVVIVSGRPLTIEPHMSQLDALVAAWLPGTEGEGVTDVLFGDYGFTGKLARTWFKTVDQLPMNYGDENYNPLFPLGFGLTTEPVNKAS
BLAST of Lsi10G004150.1 vs. Swiss-Prot
Match: BGH3B_BACO1 (Beta-glucosidase BoGH3B OS=Bacteroides ovatus (strain ATCC 8483 / DSM 1896 / JCM 5824 / NCTC 11153) GN=BACOVA_02659 PE=1 SV=1)

HSP 1 Score: 106.3 bits (264), Expect = 7.2e-22
Identity = 91/319 (28.53%), Postives = 152/319 (47.65%), Query Frame = 1

Query: 90  IDMVMVPTNYTEFIDNLTYLVNSNAIPMSRINDAVRRILRVKFVMGLFENPLADDRLVNE 149
           IDM MVP   + F D L  LV    + M RI+DAV R+LR+K+ +GLF++P  D +  ++
Sbjct: 343 IDMSMVPYEVS-FCDYLKELVEEGEVSMERIDDAVARVLRLKYRLGLFDHPYWDIKKYDK 402

Query: 150 LGS--------QNGENA------DEPVLPLSKKAAKILVAGTHADNLGYQCGGWTITWQG 209
            GS        Q  E +      D  +LP++ K  KIL+ G +A+++    GGW+ +WQG
Sbjct: 403 FGSKEFAAVALQAAEESEVLLKNDGNILPIA-KGKKILLTGPNANSMRCLNGGWSYSWQG 462

Query: 210 -LSGNNLTTGTTILEAVKKTVDPNTEVIYNVNPTTDYLKANNF----------------- 269
            ++        TI EA+ +       +IY    T    K +N+                 
Sbjct: 463 HVADEYAQAYHTIYEALCEKYG-KENIIYEPGVTYASYKNDNWWEENKPETEKPVAAAAQ 522

Query: 270 -SYAIVVVGETPYAETDGDNLNLTIAEGGSDTIQNVCNVVKCVVVIVS-GRPLTIEPHMS 329
               I  +GE  Y ET G+  +LT++E   + ++ +    K +V++++ GRP  I   + 
Sbjct: 523 ADIIITCIGENSYCETPGNLTDLTLSENQRNLVKALAATGKPIVLVLNQGRPRIINDIVP 582

Query: 330 QLDALVAAWLPGT-EGEGVTDVLFGDYGFTGKLARTW-----------FKTVDQLPMNYG 359
              A+V   LP    G+ + ++L GD  F+GK+  T+           +K  + +    G
Sbjct: 583 LAKAVVNIMLPSNYGGDALANLLAGDANFSGKMPFTYPRLINALATYDYKPCENMGQMGG 642

BLAST of Lsi10G004150.1 vs. Swiss-Prot
Match: GLUA_DICDI (Lysosomal beta glucosidase OS=Dictyostelium discoideum GN=gluA PE=1 SV=2)

HSP 1 Score: 102.8 bits (255), Expect = 8.0e-21
Identity = 91/313 (29.07%), Postives = 152/313 (48.56%), Query Frame = 1

Query: 90  IDMVMVPTNYTEFIDNLTYLVNSNAIPMSRINDAVRRILRVKFVMGLFENPLADDR--LV 149
           IDM MVP + + F   L  +V +  +P SR++ +VRRIL +K+ +GLF NP  +    +V
Sbjct: 392 IDMSMVPLDLS-FPIILAEMVAAGTVPESRLDLSVRRILNLKYALGLFSNPYPNPNAAIV 451

Query: 150 NELGSQNGENA--------------DEPVLPLSKKAAK-ILVAGTHADNLGYQCGGWTIT 209
           + +G      A                 +LPL+    K +L+ G  AD++    GGW++ 
Sbjct: 452 DTIGQVQDREAAAATAEESITLLQNKNNILPLNTNTIKNVLLTGPSADSIRNLNGGWSVH 511

Query: 210 WQGL-SGNNLTTGTTILEAVKK----TVDPNTE--VIYNVNPTTDYLK-------ANNFS 269
           WQG    +    GT+IL  +++    T D N +  + + +   T+          A +  
Sbjct: 512 WQGAYEDSEFPFGTSILTGLREITNDTADFNIQYTIGHEIGVPTNQTSIDEAVELAQSSD 571

Query: 270 YAIVVVGETPYAETDGDNLNLTIAEGGSDTIQNVCNVVKCVV-VIVSGRPLTIEPHM-SQ 329
             +VV+GE P AET GD  +L++       +Q + +  K VV ++V  RP  + P +   
Sbjct: 572 VVVVVIGELPEAETPGDIYDLSMDPNEVLLLQQLVDTGKPVVLILVEARPRILPPDLVYS 631

Query: 330 LDALVAAWLPGTE-GEGVTDVLFGDYGFTGKLARTWFKTVDQLPMNY---GDEN--YNPL 364
             A++ A+LPG+E G+ + ++L G+   +G+L  T+  T   + + Y     EN    PL
Sbjct: 632 CAAVLMAYLPGSEGGKPIANILMGNVNPSGRLPLTYPGTTGDIGVPYYHKYSENGVTTPL 691

BLAST of Lsi10G004150.1 vs. Swiss-Prot
Match: BGLX_SALTY (Periplasmic beta-glucosidase OS=Salmonella typhimurium (strain LT2 / SGSC1412 / ATCC 700720) GN=bglX PE=3 SV=2)

HSP 1 Score: 98.2 bits (243), Expect = 2.0e-19
Identity = 90/342 (26.32%), Postives = 144/342 (42.11%), Query Frame = 1

Query: 90  IDMVMVPTNYTEFIDNLTYLVNSNAIPMSRINDAVRRILRVKFVMGLFENPLA------- 149
           +DM M    Y++++  L   + S  + M+ ++DA R +L VK+ MGLF +P +       
Sbjct: 315 VDMSMADEYYSKYLPGL---IKSGKVTMAELDDATRHVLNVKYDMGLFNDPYSHLGPKES 374

Query: 150 ---DDRLVNELGSQNG-ENADEPVLPLS--------KKAAKILVAGTHADNLGYQCGGWT 209
              D    + L  +   E A E V+ L         KK+  I V G  AD+     G W+
Sbjct: 375 DPVDTNAESRLHRKEAREVARESVVLLKNRLETLPLKKSGTIAVVGPLADSQRDVMGSWS 434

Query: 210 ITWQGLSGNNLTTGTTILEAVKKTVDPNTEVIY-------NVNPTTDYLK---------- 269
               G++  ++T    +L  ++  V    +++Y       N     D+L           
Sbjct: 435 AA--GVANQSVT----VLAGIQNAVGDGAKILYAKGANITNDKGIVDFLNLYEEAVKIDP 494

Query: 270 -------------ANNFSYAIVVVGETP-YAETDGDNLNLTIAEGGSDTIQNVCNVVK-C 329
                        A      + VVGE+   A       N+TI +   D I  +    K  
Sbjct: 495 RSPQAMIDEAVQAAKQADVVVAVVGESQGMAHEASSRTNITIPQSQRDLITALKATGKPL 554

Query: 330 VVVIVSGRPLTIEPHMSQLDALVAAWLPGTEG-EGVTDVLFGDYGFTGKLARTWFKTVDQ 359
           V+V+++GRPL +     Q DA++  W  GTEG   + DVLFGDY  +GKL  ++ ++V Q
Sbjct: 555 VLVLMNGRPLALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPISFPRSVGQ 614

BLAST of Lsi10G004150.1 vs. Swiss-Prot
Match: BGLX_ECOLI (Periplasmic beta-glucosidase OS=Escherichia coli (strain K12) GN=bglX PE=3 SV=2)

HSP 1 Score: 76.6 bits (187), Expect = 6.1e-13
Identity = 88/344 (25.58%), Postives = 146/344 (42.44%), Query Frame = 1

Query: 90  IDMVMVPTNYTEFIDNLTYLVNSNAIPMSRINDAVRRILRVKFVMGLFENPLA------- 149
           I+M M    Y++++  L   + S  + M+ ++DA R +L VK+ MGLF +P +       
Sbjct: 315 INMSMSDEYYSKYLPGL---IKSGKVTMAELDDAARHVLNVKYDMGLFNDPYSHLGPKES 374

Query: 150 ---DDRLVNELGSQNG-ENADEPVLPLS--------KKAAKILVAGTHADNLGYQCGGWT 209
              D    + L  +   E A E ++ L         KK+A I V G  AD+     G W+
Sbjct: 375 DPVDTNAESRLHRKEAREVARESLVLLKNRLETLPLKKSATIAVVGPLADSKRDVMGSWS 434

Query: 210 ITWQGLSGNNLTTGTTILEAVKKTVDPNTEVIY--NVNPTT-----DYLKANNFSYAIVV 269
               G++  ++    T+L  +K  V  N +V+Y    N T+     D+L  N +  A+ V
Sbjct: 435 AA--GVADQSV----TVLTGIKNAVGENGKVLYAKGANVTSDKGIIDFL--NQYEEAVKV 494

Query: 270 VGETPYAETD-----GDNLNLTIA----------EGGSDTIQNVCNVVKCVVVIV--SGR 329
              +P    D         ++ +A          E  S T   +    + ++  +  +G+
Sbjct: 495 DPRSPQEMIDEAVQTAKQSDVVVAVVGEAQGMAHEASSRTDITIPQSQRDLIAALKATGK 554

Query: 330 PLTI-----EPHM-----SQLDALVAAWLPGTE-GEGVTDVLFGDYGFTGKLARTWFKTV 359
           PL +      P        Q DA++  W  GTE G  + DVLFGDY  +GKL  ++ ++V
Sbjct: 555 PLVLVLMNGRPLALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPMSFPRSV 614

BLAST of Lsi10G004150.1 vs. Swiss-Prot
Match: XYL3A_PRER2 (Xylan 1,4-beta-xylosidase OS=Prevotella ruminicola (strain ATCC 19189 / JCM 8958 / 23) GN=xyl3A PE=1 SV=1)

HSP 1 Score: 66.6 bits (161), Expect = 6.3e-10
Identity = 41/99 (41.41%), Postives = 54/99 (54.55%), Query Frame = 1

Query: 279 VVVIVSGRPLTIEPHMSQLDALVAAWLPGTEG-EGVTDVLFGDYGFTGKLARTWFKTVDQ 338
           + V  SG  + ++P     DA+V AW PG EG   V DVLFGDY   GKL+ T++K   Q
Sbjct: 656 IYVNCSGSAIALQPETESCDAIVQAWYPGQEGGTAVADVLFGDYNPGGKLSVTFYKNDQQ 715

Query: 339 LPMNYGDENY---------NPLFPLGFGL--TTEPVNKA 366
           LP +Y D +          + LFP G+GL  TT  V +A
Sbjct: 716 LP-DYEDYSMKGRTYRYFDDALFPFGYGLSYTTFEVGEA 753

BLAST of Lsi10G004150.1 vs. TrEMBL
Match: B9SIA5_RICCO (Hydrolase, hydrolyzing O-glycosyl compounds, putative OS=Ricinus communis GN=RCOM_1322270 PE=4 SV=1)

HSP 1 Score: 432.6 bits (1111), Expect = 4.9e-118
Identity = 217/327 (66.36%), Postives = 251/327 (76.76%), Query Frame = 1

Query: 54  LDRSVVTPEIMRDYSIGSVLSAGGSVPSPQATAQKWIDMVMVPTNYTEFIDNLTYLVNSN 113
           +DR    P     YS+ + +SAG             IDM+MVP NYTEFID LTYLV S 
Sbjct: 310 IDRITFPPHANYTYSVLAGISAG-------------IDMIMVPYNYTEFIDGLTYLVKSG 369

Query: 114 AIPMSRINDAVRRILRVKFVMGLFENPLADDRLVNELGS------------------QNG 173
            IPMSRI+DAV+RILRVKFVMGLFENP AD+ LVN+LGS                  +NG
Sbjct: 370 IIPMSRIDDAVKRILRVKFVMGLFENPNADESLVNQLGSHEHRQLAREAVRKSLVLLRNG 429

Query: 174 ENADEPVLPLSKKAAKILVAGTHADNLGYQCGGWTITWQGLSGNNLTTGTTILEAVKKTV 233
           + AD+P LPL KKA+KILVAG+HADNLGYQCGGWTI WQGL GN+LT+GTTIL A+K TV
Sbjct: 430 KYADKPSLPLPKKASKILVAGSHADNLGYQCGGWTIEWQGLGGNDLTSGTTILTAIKNTV 489

Query: 234 DPNTEVIYNVNPTTDYLKANNFSYAIVVVGETPYAETDGDNLNLTIAEGGSDTIQNVCNV 293
           D +T+V+Y  NP  D++KANNFSYAIVVVGE PYAET GD++NLTIAE G  TIQNVC  
Sbjct: 490 DSSTKVVYEENPDADFVKANNFSYAIVVVGEHPYAETQGDSMNLTIAEPGPSTIQNVCGA 549

Query: 294 VKCVVVIVSGRPLTIEPHMSQLDALVAAWLPGTEGEGVTDVLFGDYGFTGKLARTWFKTV 353
           VKCVVV+VSGRP+ I+P+++ +DALVAAWLPGTEG+GV DVLFGDYGFTGKL+ TWFKTV
Sbjct: 550 VKCVVVVVSGRPVVIQPYVNIIDALVAAWLPGTEGQGVADVLFGDYGFTGKLSHTWFKTV 609

Query: 354 DQLPMNYGDENYNPLFPLGFGLTTEPV 363
           DQLPMN GD  Y+PLFP GFGLTTEPV
Sbjct: 610 DQLPMNVGDRYYDPLFPFGFGLTTEPV 623

BLAST of Lsi10G004150.1 vs. TrEMBL
Match: Q7XAS3_GOSHI (Beta-D-glucosidase OS=Gossypium hirsutum PE=2 SV=1)

HSP 1 Score: 432.6 bits (1111), Expect = 4.9e-118
Identity = 216/329 (65.65%), Postives = 255/329 (77.51%), Query Frame = 1

Query: 54  LDRSVVTPEIMRDYSIGSVLSAGGSVPSPQATAQKWIDMVMVPTNYTEFIDNLTYLVNSN 113
           LDR    P     YS+ + + AG             IDMVMVP N+TEFID+LTY V +N
Sbjct: 312 LDRITSPPHANYSYSVEAGVGAG-------------IDMVMVPYNFTEFIDDLTYQVKNN 371

Query: 114 AIPMSRINDAVRRILRVKFVMGLFENPLADDRLVNELGSQ------------------NG 173
            IPMSRI+DAV+RILRVKFVMGLFENP+AD+ LVN+LGSQ                  NG
Sbjct: 372 IIPMSRIDDAVKRILRVKFVMGLFENPMADNSLVNQLGSQEHRELAREAVRKSLVLLKNG 431

Query: 174 ENADEPVLPLSKKAAKILVAGTHADNLGYQCGGWTITWQGLSGNNLTTGTTILEAVKKTV 233
           E+AD+P+LPL KKA KILVAGTHADNLGYQCGGWTITWQGL GN+LTTGTTIL+AVK TV
Sbjct: 432 ESADKPLLPLPKKATKILVAGTHADNLGYQCGGWTITWQGLGGNDLTTGTTILQAVKNTV 491

Query: 234 DPNTEVIYNVNPTTDYLKANNFSYAIVVVGETPYAETDGDNLNLTIAEGGSDTIQNVCNV 293
           D +T+V+Y+ NP   ++K+  FSYAIVVVGE PYAET GD+LNLTI+E G  TI NVC  
Sbjct: 492 DSSTQVVYSENPDAGFVKSGEFSYAIVVVGEPPYAETYGDSLNLTISEPGPMTIYNVCGS 551

Query: 294 VKCVVVIVSGRPLTIEPHMSQLDALVAAWLPGTEGEGVTDVLFGDYGFTGKLARTWFKTV 353
           VKCVVV++SGRP+ ++P +S +DALVAAWLPGTEG+GV+DVLFGDYGFTGKLARTWFKTV
Sbjct: 552 VKCVVVVISGRPVVVQPFVSSVDALVAAWLPGTEGQGVSDVLFGDYGFTGKLARTWFKTV 611

Query: 354 DQLPMNYGDENYNPLFPLGFGLTTEPVNK 365
           DQLPMN GD +Y+PLFP GFGLTT+P ++
Sbjct: 612 DQLPMNVGDPHYDPLFPFGFGLTTKPTHQ 627

BLAST of Lsi10G004150.1 vs. TrEMBL
Match: A0A0D2S1K4_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_006G118100 PE=4 SV=1)

HSP 1 Score: 432.6 bits (1111), Expect = 4.9e-118
Identity = 215/324 (66.36%), Postives = 253/324 (78.09%), Query Frame = 1

Query: 54  LDRSVVTPEIMRDYSIGSVLSAGGSVPSPQATAQKWIDMVMVPTNYTEFIDNLTYLVNSN 113
           LDR    P     YS+ + + AG             IDMVMVP N+TEFID+LTY V +N
Sbjct: 312 LDRITSPPHANYSYSVEAGVGAG-------------IDMVMVPYNFTEFIDDLTYQVKNN 371

Query: 114 AIPMSRINDAVRRILRVKFVMGLFENPLADDRLVNELGSQ-------------NGENADE 173
            IPMSRI+DAV+RILRVKFVMGLFENP+AD+ LVN+LGSQ             NGE+AD+
Sbjct: 372 IIPMSRIDDAVKRILRVKFVMGLFENPMADNSLVNQLGSQAREAVRKSLVLLKNGESADK 431

Query: 174 PVLPLSKKAAKILVAGTHADNLGYQCGGWTITWQGLSGNNLTTGTTILEAVKKTVDPNTE 233
           P+LPL KKA KILVAGTHADNLGYQCGGWTITWQGL GN+LTTGTTIL+AVK TVD +T+
Sbjct: 432 PLLPLPKKATKILVAGTHADNLGYQCGGWTITWQGLGGNDLTTGTTILQAVKNTVDSSTQ 491

Query: 234 VIYNVNPTTDYLKANNFSYAIVVVGETPYAETDGDNLNLTIAEGGSDTIQNVCNVVKCVV 293
           V+Y+ NP   ++K+  FSYAIVVVGE PYAET GD+LNLTI+E G  TI NVC  VKCVV
Sbjct: 492 VVYSENPDAGFVKSGEFSYAIVVVGEPPYAETYGDSLNLTISEPGPKTIYNVCGSVKCVV 551

Query: 294 VIVSGRPLTIEPHMSQLDALVAAWLPGTEGEGVTDVLFGDYGFTGKLARTWFKTVDQLPM 353
           V++SGRP+ ++P +S + ALVAAWLPGTEG+GV DVLFGDYGFTGKLARTWFKTVDQLPM
Sbjct: 552 VVISGRPVVVQPFVSSVHALVAAWLPGTEGQGVADVLFGDYGFTGKLARTWFKTVDQLPM 611

Query: 354 NYGDENYNPLFPLGFGLTTEPVNK 365
           N GD +Y+PLFP GFGLTT+P ++
Sbjct: 612 NVGDSHYDPLFPFGFGLTTKPTHQ 622

BLAST of Lsi10G004150.1 vs. TrEMBL
Match: D7U8L2_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_08s0032g00470 PE=4 SV=1)

HSP 1 Score: 431.0 bits (1107), Expect = 1.4e-117
Identity = 217/326 (66.56%), Postives = 246/326 (75.46%), Query Frame = 1

Query: 54  LDRSVVTPEIMRDYSIGSVLSAGGSVPSPQATAQKWIDMVMVPTNYTEFIDNLTYLVNSN 113
           +DR    P     YSI + + AG             IDM+MVP NYTEFID LTY V S 
Sbjct: 320 IDRITSPPHANYSYSIEAGIKAG-------------IDMIMVPYNYTEFIDGLTYQVKSK 379

Query: 114 AIPMSRINDAVRRILRVKFVMGLFENPLADDRLVNELGSQ------------------NG 173
            IPMSRI+DAVRRILRVKFVMGLFE+PLAD  LV+ELGSQ                  NG
Sbjct: 380 IIPMSRIDDAVRRILRVKFVMGLFESPLADHSLVHELGSQVHRELAREAVRKSLVLLKNG 439

Query: 174 ENADEPVLPLSKKAAKILVAGTHADNLGYQCGGWTITWQGLSGNNLTTGTTILEAVKKTV 233
           E AD+P+LPL KKA KILVAGTHADNLG QCGGWTI WQGLSGNNLT+GTTIL A+KKTV
Sbjct: 440 EPADKPLLPLPKKAPKILVAGTHADNLGNQCGGWTIEWQGLSGNNLTSGTTILSAIKKTV 499

Query: 234 DPNTEVIYNVNPTTDYLKANNFSYAIVVVGETPYAETDGDNLNLTIAEGGSDTIQNVCNV 293
           DP TEV+Y  NP   Y+K++ FSYAIVVVGE PYAET GDNLNLTI + G   I NVC  
Sbjct: 500 DPKTEVVYKENPDLSYVKSSKFSYAIVVVGEPPYAETFGDNLNLTIPDPGPSIITNVCGA 559

Query: 294 VKCVVVIVSGRPLTIEPHMSQLDALVAAWLPGTEGEGVTDVLFGDYGFTGKLARTWFKTV 353
           VKCVV+++SGRPL I+P++ Q+DALVAAWLPGTEG+GV DVLFGDYGFTGKL+RTWF+TV
Sbjct: 560 VKCVVIVISGRPLVIQPYVDQIDALVAAWLPGTEGQGVADVLFGDYGFTGKLSRTWFRTV 619

Query: 354 DQLPMNYGDENYNPLFPLGFGLTTEP 362
           +QLPMN GD +Y+PLFP GFGLTTEP
Sbjct: 620 EQLPMNVGDRHYDPLFPFGFGLTTEP 632

BLAST of Lsi10G004150.1 vs. TrEMBL
Match: A0A067KU67_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_02702 PE=4 SV=1)

HSP 1 Score: 431.0 bits (1107), Expect = 1.4e-117
Identity = 211/328 (64.33%), Postives = 254/328 (77.44%), Query Frame = 1

Query: 54  LDRSVVTPEIMRDYSIGSVLSAGGSVPSPQATAQKWIDMVMVPTNYTEFIDNLTYLVNSN 113
           +DR    P     YSI + +SAG  + SP       I  +MVP NYTEFID LT  V  N
Sbjct: 323 IDRITSPPHANYTYSIQAGISAGIDMASP-------ILQIMVPFNYTEFIDGLTDQVKKN 382

Query: 114 AIPMSRINDAVRRILRVKFVMGLFENPLADDRLVNELGS------------------QNG 173
            IPMSRI+DAV+RILRVKF MGLFENP AD+ LVN+LGS                  +NG
Sbjct: 383 IIPMSRIDDAVKRILRVKFTMGLFENPYADESLVNQLGSQEHRELAREAVRKSLVLLKNG 442

Query: 174 ENADEPVLPLSKKAAKILVAGTHADNLGYQCGGWTITWQGLSGNNLTTGTTILEAVKKTV 233
           +NA+EP+LPL KK++KILVAG+HADNLGYQCGGWTI WQGLSGNN T+GTTIL A+K TV
Sbjct: 443 KNANEPLLPLPKKSSKILVAGSHADNLGYQCGGWTIEWQGLSGNNHTSGTTILTAIKNTV 502

Query: 234 DPNTEVIYNVNPTTDYLKANNFSYAIVVVGETPYAETDGDNLNLTIAEGGSDTIQNVCNV 293
           DP+T+++YN NP  D++K+N FSYAIVVVGE PYAET GD++NLT++  G  TIQNVC  
Sbjct: 503 DPSTKIVYNENPDADFVKSNKFSYAIVVVGEHPYAETQGDSMNLTLSNPGPSTIQNVCGA 562

Query: 294 VKCVVVIVSGRPLTIEPHMSQLDALVAAWLPGTEGEGVTDVLFGDYGFTGKLARTWFKTV 353
           VKCVV++VSGRP+ ++P+++ ++ALVAAWLPGTEG+GV DVLFGDYGFTGKL+RTWFK+V
Sbjct: 563 VKCVVIVVSGRPVVMQPYVNSIEALVAAWLPGTEGQGVADVLFGDYGFTGKLSRTWFKSV 622

Query: 354 DQLPMNYGDENYNPLFPLGFGLTTEPVN 364
           DQLPMN GD NY+PLFP GFGLTTEPVN
Sbjct: 623 DQLPMNVGDRNYDPLFPFGFGLTTEPVN 643

BLAST of Lsi10G004150.1 vs. TAIR10
Match: AT5G20940.1 (AT5G20940.1 Glycosyl hydrolase family protein)

HSP 1 Score: 394.0 bits (1011), Expect = 9.8e-110
Identity = 205/324 (63.27%), Postives = 238/324 (73.46%), Query Frame = 1

Query: 56  RSVVTPEIMRDYSIGSVLSAGGSVPSPQATAQKWIDMVMVPTNYTEFIDNLTYLVNSNAI 115
           R +V  + +    I + L A  S  S  A     +DM M  +N T+ ID LT  V    I
Sbjct: 305 RGIVISDYLGVDQINTPLGANYS-HSVYAATTAGLDMFMGSSNLTKLIDELTSQVKRKFI 364

Query: 116 PMSRINDAVRRILRVKFVMGLFENPLADDRLVNELGS------------------QNGEN 175
           PMSRI+DAV+RILRVKF MGLFENP+AD  L  +LGS                  +NGEN
Sbjct: 365 PMSRIDDAVKRILRVKFTMGLFENPIADHSLAKKLGSKEHRELAREAVRKSLVLLKNGEN 424

Query: 176 ADEPVLPLSKKAAKILVAGTHADNLGYQCGGWTITWQGLSGNNLTTGTTILEAVKKTVDP 235
           AD+P+LPL KKA KILVAGTHADNLGYQCGGWTITWQGL+GNNLT GTTIL AVKKTVDP
Sbjct: 425 ADKPLLPLPKKANKILVAGTHADNLGYQCGGWTITWQGLNGNNLTIGTTILAAVKKTVDP 484

Query: 236 NTEVIYNVNPTTDYLKANNFSYAIVVVGETPYAETDGDNLNLTIAEGGSDTIQNVCNVVK 295
            T+VIYN NP T+++KA +F YAIV VGE PYAE  GD+ NLTI+E G  TI NVC  VK
Sbjct: 485 KTQVIYNQNPDTNFVKAGDFDYAIVAVGEKPYAEGFGDSTNLTISEPGPSTIGNVCASVK 544

Query: 296 CVVVIVSGRPLTIEPHMSQLDALVAAWLPGTEGEGVTDVLFGDYGFTGKLARTWFKTVDQ 355
           CVVV+VSGRP+ ++  +S +DALVAAWLPGTEG+GV DVLFGDYGFTGKLARTWFKTVDQ
Sbjct: 545 CVVVVVSGRPVVMQ--ISNIDALVAAWLPGTEGQGVADVLFGDYGFTGKLARTWFKTVDQ 604

Query: 356 LPMNYGDENYNPLFPLGFGLTTEP 362
           LPMN GD +Y+PL+P GFGL T+P
Sbjct: 605 LPMNVGDPHYDPLYPFGFGLITKP 625

BLAST of Lsi10G004150.1 vs. TAIR10
Match: AT5G20950.1 (AT5G20950.1 Glycosyl hydrolase family protein)

HSP 1 Score: 385.6 bits (989), Expect = 3.5e-107
Identity = 191/326 (58.59%), Postives = 233/326 (71.47%), Query Frame = 1

Query: 54  LDRSVVTPEIMRDYSIGSVLSAGGSVPSPQATAQKWIDMVMVPTNYTEFIDNLTYLVNSN 113
           +DR    P +   YS+ + +SAG             IDM+MVP NYTEFID ++  +   
Sbjct: 309 IDRITTPPHLNYSYSVYAGISAG-------------IDMIMVPYNYTEFIDEISSQIQKK 368

Query: 114 AIPMSRINDAVRRILRVKFVMGLFENPLADDRLVNELGSQ------------------NG 173
            IP+SRI+DA++RILRVKF MGLFE PLAD    N+LGS+                  NG
Sbjct: 369 LIPISRIDDALKRILRVKFTMGLFEEPLADLSFANQLGSKEHRELAREAVRKSLVLLKNG 428

Query: 174 ENADEPVLPLSKKAAKILVAGTHADNLGYQCGGWTITWQGLSGNNLTTGTTILEAVKKTV 233
           +   +P+LPL KK+ KILVAG HADNLGYQCGGWTITWQGL+GN+ T GTTIL AVK TV
Sbjct: 429 KTGAKPLLPLPKKSGKILVAGAHADNLGYQCGGWTITWQGLNGNDHTVGTTILAAVKNTV 488

Query: 234 DPNTEVIYNVNPTTDYLKANNFSYAIVVVGETPYAETDGDNLNLTIAEGGSDTIQNVCNV 293
            P T+V+Y+ NP  +++K+  F YAIVVVGE PYAE  GD  NLTI++ G   I NVC  
Sbjct: 489 APTTQVVYSQNPDANFVKSGKFDYAIVVVGEPPYAEMFGDTTNLTISDPGPSIIGNVCGS 548

Query: 294 VKCVVVIVSGRPLTIEPHMSQLDALVAAWLPGTEGEGVTDVLFGDYGFTGKLARTWFKTV 353
           VKCVVV+VSGRP+ I+P++S +DALVAAWLPGTEG+GV D LFGDYGFTGKLARTWFK+V
Sbjct: 549 VKCVVVVVSGRPVVIQPYVSTIDALVAAWLPGTEGQGVADALFGDYGFTGKLARTWFKSV 608

Query: 354 DQLPMNYGDENYNPLFPLGFGLTTEP 362
            QLPMN GD +Y+PL+P GFGLTT+P
Sbjct: 609 KQLPMNVGDRHYDPLYPFGFGLTTKP 621

BLAST of Lsi10G004150.1 vs. TAIR10
Match: AT5G04885.1 (AT5G04885.1 Glycosyl hydrolase family protein)

HSP 1 Score: 365.9 bits (938), Expect = 2.8e-101
Identity = 179/300 (59.67%), Postives = 223/300 (74.33%), Query Frame = 1

Query: 81  SPQATAQKWIDMVMVPTNYTEFIDNLTYLVNSNAIPMSRINDAVRRILRVKFVMGLFENP 140
           S +A  Q  IDMVMVP N+TEF+++LT LV +N+IP++RI+DAVRRIL VKF MGLFENP
Sbjct: 328 SVRAAIQAGIDMVMVPFNFTEFVNDLTTLVKNNSIPVTRIDDAVRRILLVKFTMGLFENP 387

Query: 141 LADDRLVNELGSQ------------------NGENADEPVLPLSKKAAKILVAGTHADNL 200
           LAD    +ELGSQ                  NG N   P+LPL +K +KILVAGTHADNL
Sbjct: 388 LADYSFSSELGSQAHRDLAREAVRKSLVLLKNG-NKTNPMLPLPRKTSKILVAGTHADNL 447

Query: 201 GYQCGGWTITWQGLSGNNLTTGTTILEAVKKTVDPNTEVIYNVNPTTDYLKANNFSYAIV 260
           GYQCGGWTITWQG SGN  T GTT+L AVK  VD +TEV++  NP  +++K+NNF+YAI+
Sbjct: 448 GYQCGGWTITWQGFSGNKNTRGTTLLSAVKSAVDQSTEVVFRENPDAEFIKSNNFAYAII 507

Query: 261 VVGETPYAETDGDNLNLTIAEGGSDTIQNVCNVVKCVVVIVSGRPLTIEPHMSQLDALVA 320
            VGE PYAET GD+  LT+ + G   I + C  VKCVVV++SGRPL +EP+++ +DALVA
Sbjct: 508 AVGEPPYAETAGDSDKLTMLDPGPAIISSTCQAVKCVVVVISGRPLVMEPYVASIDALVA 567

Query: 321 AWLPGTEGEGVTDVLFGDYGFTGKLARTWFKTVDQLPMNYGDENYNPLFPLGFGLTTEPV 363
           AWLPGTEG+G+TD LFGD+GF+GKL  TWF+  +QLPM+YGD +Y+PLF  G GL TE V
Sbjct: 568 AWLPGTEGQGITDALFGDHGFSGKLPVTWFRNTEQLPMSYGDTHYDPLFAYGSGLETESV 626

BLAST of Lsi10G004150.1 vs. TAIR10
Match: AT3G62710.1 (AT3G62710.1 Glycosyl hydrolase family protein)

HSP 1 Score: 308.5 bits (789), Expect = 5.4e-84
Identity = 164/314 (52.23%), Postives = 212/314 (67.52%), Query Frame = 1

Query: 81  SPQATAQKWIDMVMVPTNYTEFIDNLTYLVNSNAIPMSRINDAVRRILRVKFVMGLFENP 140
           S +A+    IDMVMVP  Y E+++ LT LVN   IPMSRI+DAVRRILRVKF +GLFEN 
Sbjct: 337 SIEASINAGIDMVMVPWAYPEYLEKLTNLVNGGYIPMSRIDDAVRRILRVKFSIGLFENS 396

Query: 141 LADDRL-VNELGSQ------------------NGENADEPVLPLSKKAAKILVAGTHADN 200
           LAD++L   E GS+                  NG+   + ++PL KK  KI+VAG HA++
Sbjct: 397 LADEKLPTTEFGSEAHREVGREAVRKSMVLLKNGKTDADKIVPLPKKVKKIVVAGRHAND 456

Query: 201 LGYQCGGWTITWQGLSG----------NNLTTG----TTILEAVKKTVDPNTEVIYNVNP 260
           +G+QCGG+++TWQG +G          + L TG    TTILEA++K VDP TEV+Y   P
Sbjct: 457 MGWQCGGFSLTWQGFNGTGEDMPTNTKHGLPTGKIKGTTILEAIQKAVDPTTEVVYVEEP 516

Query: 261 TTDYLKAN-NFSYAIVVVGETPYAETDGDNLNLTIAEGGSDTIQNVCNV-VKCVVVIVSG 320
             D  K + + +Y IVVVGETPYAET GD+  L I + G DT+ + C   +KC+V++V+G
Sbjct: 517 NQDTAKLHADAAYTIVVVGETPYAETFGDSPTLGITKPGPDTLSHTCGSGMKCLVILVTG 576

Query: 321 RPLTIEPHMSQLDALVAAWLPGTEGEGVTDVLFGDYGFTGKLARTWFKTVDQLPMNYGDE 360
           RPL IEP++  LDAL  AWLPGTEG+GV DVLFGD+ FTG L RTW K V QLPMN GD+
Sbjct: 577 RPLVIEPYIDMLDALAVAWLPGTEGQGVADVLFGDHPFTGTLPRTWMKHVTQLPMNVGDK 636

BLAST of Lsi10G004150.1 vs. TAIR10
Match: AT3G47000.1 (AT3G47000.1 Glycosyl hydrolase family protein)

HSP 1 Score: 303.1 bits (775), Expect = 2.3e-82
Identity = 156/293 (53.24%), Postives = 202/293 (68.94%), Query Frame = 1

Query: 90  IDMVMVPTNYTEFIDNLTYLVNSNAIPMSRINDAVRRILRVKFVMGLFENPLADDRLVNE 149
           IDMVMVP  Y +FI ++T LV S  IPM+RINDAV RILRVKFV GLF +PL D  L+  
Sbjct: 317 IDMVMVPFKYEQFIQDMTDLVESGEIPMARINDAVERILRVKFVAGLFGHPLTDRSLLPT 376

Query: 150 LGSQ------------------NGENADEPVLPLSKKAAKILVAGTHADNLGYQCGGWTI 209
           +G +                  +G+NAD+P LPL + A +ILV GTHAD+LGYQCGGWT 
Sbjct: 377 VGCKEHRELAQEAVRKSLVLLKSGKNADKPFLPLDRNAKRILVTGTHADDLGYQCGGWTK 436

Query: 210 TWQGLSGNNLTTGTTILEAVKKTVDPNTEVIYNVNPTTDYLKANN-FSYAIVVVGETPYA 269
           TW GLSG  +T GTT+L+A+K+ V   TEVIY   P+ + L ++  FSYAIV VGE PYA
Sbjct: 437 TWFGLSGR-ITIGTTLLDAIKEAVGDETEVIYEKTPSKETLASSEGFSYAIVAVGEPPYA 496

Query: 270 ETDGDNLNLTIAEGGSDTIQNVCNVVKCVVVIVSGRPLTIEPH-MSQLDALVAAWLPGTE 329
           ET GDN  L I   G+D +  V  ++  +V+++SGRP+ +EP  + + +ALVAAWLPGTE
Sbjct: 497 ETMGDNSELRIPFNGTDIVTAVAEIIPTLVILISGRPVVLEPTVLEKTEALVAAWLPGTE 556

Query: 330 GEGVTDVLFGDYGFTGKLARTWFKTVDQLPMNYGDENYNPLFPLGFGLTTEPV 363
           G+GV DV+FGDY F GKL  +WFK V+ LP++    +Y+PLFP GFGL ++PV
Sbjct: 557 GQGVADVVFGDYDFKGKLPVSWFKHVEHLPLDAHANSYDPLFPFGFGLNSKPV 608

BLAST of Lsi10G004150.1 vs. NCBI nr
Match: gi|802585432|ref|XP_012070424.1| (PREDICTED: lysosomal beta glucosidase-like [Jatropha curcas])

HSP 1 Score: 433.7 bits (1114), Expect = 3.2e-118
Identity = 212/328 (64.63%), Postives = 253/328 (77.13%), Query Frame = 1

Query: 54  LDRSVVTPEIMRDYSIGSVLSAGGSVPSPQATAQKWIDMVMVPTNYTEFIDNLTYLVNSN 113
           +DR    P     YSI + +SAG             IDM+MVP NYTEFID LT  V  N
Sbjct: 312 IDRITSPPHANYTYSIQAGISAG-------------IDMIMVPFNYTEFIDGLTDQVKKN 371

Query: 114 AIPMSRINDAVRRILRVKFVMGLFENPLADDRLVNELGSQ------------------NG 173
            IPMSRI+DAV+RILRVKF MGLFENP AD+ LVN+LGSQ                  NG
Sbjct: 372 IIPMSRIDDAVKRILRVKFTMGLFENPYADESLVNQLGSQEHRELAREAVRKSLVLLKNG 431

Query: 174 ENADEPVLPLSKKAAKILVAGTHADNLGYQCGGWTITWQGLSGNNLTTGTTILEAVKKTV 233
           +NA+EP+LPL KK++KILVAG+HADNLGYQCGGWTI WQGLSGNN T+GTTIL A+K TV
Sbjct: 432 KNANEPLLPLPKKSSKILVAGSHADNLGYQCGGWTIEWQGLSGNNHTSGTTILTAIKNTV 491

Query: 234 DPNTEVIYNVNPTTDYLKANNFSYAIVVVGETPYAETDGDNLNLTIAEGGSDTIQNVCNV 293
           DP+T+++YN NP  D++K+N FSYAIVVVGE PYAET GD++NLT++  G  TIQNVC  
Sbjct: 492 DPSTKIVYNENPDADFVKSNKFSYAIVVVGEHPYAETQGDSMNLTLSNPGPSTIQNVCGA 551

Query: 294 VKCVVVIVSGRPLTIEPHMSQLDALVAAWLPGTEGEGVTDVLFGDYGFTGKLARTWFKTV 353
           VKCVV++VSGRP+ ++P+++ ++ALVAAWLPGTEG+GV DVLFGDYGFTGKL+RTWFK+V
Sbjct: 552 VKCVVIVVSGRPVVMQPYVNSIEALVAAWLPGTEGQGVADVLFGDYGFTGKLSRTWFKSV 611

Query: 354 DQLPMNYGDENYNPLFPLGFGLTTEPVN 364
           DQLPMN GD NY+PLFP GFGLTTEPVN
Sbjct: 612 DQLPMNVGDRNYDPLFPFGFGLTTEPVN 626

BLAST of Lsi10G004150.1 vs. NCBI nr
Match: gi|255569514|ref|XP_002525724.1| (PREDICTED: beta-glucosidase BoGH3B [Ricinus communis])

HSP 1 Score: 432.6 bits (1111), Expect = 7.0e-118
Identity = 217/327 (66.36%), Postives = 251/327 (76.76%), Query Frame = 1

Query: 54  LDRSVVTPEIMRDYSIGSVLSAGGSVPSPQATAQKWIDMVMVPTNYTEFIDNLTYLVNSN 113
           +DR    P     YS+ + +SAG             IDM+MVP NYTEFID LTYLV S 
Sbjct: 310 IDRITFPPHANYTYSVLAGISAG-------------IDMIMVPYNYTEFIDGLTYLVKSG 369

Query: 114 AIPMSRINDAVRRILRVKFVMGLFENPLADDRLVNELGS------------------QNG 173
            IPMSRI+DAV+RILRVKFVMGLFENP AD+ LVN+LGS                  +NG
Sbjct: 370 IIPMSRIDDAVKRILRVKFVMGLFENPNADESLVNQLGSHEHRQLAREAVRKSLVLLRNG 429

Query: 174 ENADEPVLPLSKKAAKILVAGTHADNLGYQCGGWTITWQGLSGNNLTTGTTILEAVKKTV 233
           + AD+P LPL KKA+KILVAG+HADNLGYQCGGWTI WQGL GN+LT+GTTIL A+K TV
Sbjct: 430 KYADKPSLPLPKKASKILVAGSHADNLGYQCGGWTIEWQGLGGNDLTSGTTILTAIKNTV 489

Query: 234 DPNTEVIYNVNPTTDYLKANNFSYAIVVVGETPYAETDGDNLNLTIAEGGSDTIQNVCNV 293
           D +T+V+Y  NP  D++KANNFSYAIVVVGE PYAET GD++NLTIAE G  TIQNVC  
Sbjct: 490 DSSTKVVYEENPDADFVKANNFSYAIVVVGEHPYAETQGDSMNLTIAEPGPSTIQNVCGA 549

Query: 294 VKCVVVIVSGRPLTIEPHMSQLDALVAAWLPGTEGEGVTDVLFGDYGFTGKLARTWFKTV 353
           VKCVVV+VSGRP+ I+P+++ +DALVAAWLPGTEG+GV DVLFGDYGFTGKL+ TWFKTV
Sbjct: 550 VKCVVVVVSGRPVVIQPYVNIIDALVAAWLPGTEGQGVADVLFGDYGFTGKLSHTWFKTV 609

Query: 354 DQLPMNYGDENYNPLFPLGFGLTTEPV 363
           DQLPMN GD  Y+PLFP GFGLTTEPV
Sbjct: 610 DQLPMNVGDRYYDPLFPFGFGLTTEPV 623

BLAST of Lsi10G004150.1 vs. NCBI nr
Match: gi|763768301|gb|KJB35516.1| (hypothetical protein B456_006G118100 [Gossypium raimondii])

HSP 1 Score: 432.6 bits (1111), Expect = 7.0e-118
Identity = 215/324 (66.36%), Postives = 253/324 (78.09%), Query Frame = 1

Query: 54  LDRSVVTPEIMRDYSIGSVLSAGGSVPSPQATAQKWIDMVMVPTNYTEFIDNLTYLVNSN 113
           LDR    P     YS+ + + AG             IDMVMVP N+TEFID+LTY V +N
Sbjct: 312 LDRITSPPHANYSYSVEAGVGAG-------------IDMVMVPYNFTEFIDDLTYQVKNN 371

Query: 114 AIPMSRINDAVRRILRVKFVMGLFENPLADDRLVNELGSQ-------------NGENADE 173
            IPMSRI+DAV+RILRVKFVMGLFENP+AD+ LVN+LGSQ             NGE+AD+
Sbjct: 372 IIPMSRIDDAVKRILRVKFVMGLFENPMADNSLVNQLGSQAREAVRKSLVLLKNGESADK 431

Query: 174 PVLPLSKKAAKILVAGTHADNLGYQCGGWTITWQGLSGNNLTTGTTILEAVKKTVDPNTE 233
           P+LPL KKA KILVAGTHADNLGYQCGGWTITWQGL GN+LTTGTTIL+AVK TVD +T+
Sbjct: 432 PLLPLPKKATKILVAGTHADNLGYQCGGWTITWQGLGGNDLTTGTTILQAVKNTVDSSTQ 491

Query: 234 VIYNVNPTTDYLKANNFSYAIVVVGETPYAETDGDNLNLTIAEGGSDTIQNVCNVVKCVV 293
           V+Y+ NP   ++K+  FSYAIVVVGE PYAET GD+LNLTI+E G  TI NVC  VKCVV
Sbjct: 492 VVYSENPDAGFVKSGEFSYAIVVVGEPPYAETYGDSLNLTISEPGPKTIYNVCGSVKCVV 551

Query: 294 VIVSGRPLTIEPHMSQLDALVAAWLPGTEGEGVTDVLFGDYGFTGKLARTWFKTVDQLPM 353
           V++SGRP+ ++P +S + ALVAAWLPGTEG+GV DVLFGDYGFTGKLARTWFKTVDQLPM
Sbjct: 552 VVISGRPVVVQPFVSSVHALVAAWLPGTEGQGVADVLFGDYGFTGKLARTWFKTVDQLPM 611

Query: 354 NYGDENYNPLFPLGFGLTTEPVNK 365
           N GD +Y+PLFP GFGLTT+P ++
Sbjct: 612 NVGDSHYDPLFPFGFGLTTKPTHQ 622

BLAST of Lsi10G004150.1 vs. NCBI nr
Match: gi|33391721|gb|AAQ17461.1| (beta-D-glucosidase [Gossypium hirsutum])

HSP 1 Score: 432.6 bits (1111), Expect = 7.0e-118
Identity = 216/329 (65.65%), Postives = 255/329 (77.51%), Query Frame = 1

Query: 54  LDRSVVTPEIMRDYSIGSVLSAGGSVPSPQATAQKWIDMVMVPTNYTEFIDNLTYLVNSN 113
           LDR    P     YS+ + + AG             IDMVMVP N+TEFID+LTY V +N
Sbjct: 312 LDRITSPPHANYSYSVEAGVGAG-------------IDMVMVPYNFTEFIDDLTYQVKNN 371

Query: 114 AIPMSRINDAVRRILRVKFVMGLFENPLADDRLVNELGSQ------------------NG 173
            IPMSRI+DAV+RILRVKFVMGLFENP+AD+ LVN+LGSQ                  NG
Sbjct: 372 IIPMSRIDDAVKRILRVKFVMGLFENPMADNSLVNQLGSQEHRELAREAVRKSLVLLKNG 431

Query: 174 ENADEPVLPLSKKAAKILVAGTHADNLGYQCGGWTITWQGLSGNNLTTGTTILEAVKKTV 233
           E+AD+P+LPL KKA KILVAGTHADNLGYQCGGWTITWQGL GN+LTTGTTIL+AVK TV
Sbjct: 432 ESADKPLLPLPKKATKILVAGTHADNLGYQCGGWTITWQGLGGNDLTTGTTILQAVKNTV 491

Query: 234 DPNTEVIYNVNPTTDYLKANNFSYAIVVVGETPYAETDGDNLNLTIAEGGSDTIQNVCNV 293
           D +T+V+Y+ NP   ++K+  FSYAIVVVGE PYAET GD+LNLTI+E G  TI NVC  
Sbjct: 492 DSSTQVVYSENPDAGFVKSGEFSYAIVVVGEPPYAETYGDSLNLTISEPGPMTIYNVCGS 551

Query: 294 VKCVVVIVSGRPLTIEPHMSQLDALVAAWLPGTEGEGVTDVLFGDYGFTGKLARTWFKTV 353
           VKCVVV++SGRP+ ++P +S +DALVAAWLPGTEG+GV+DVLFGDYGFTGKLARTWFKTV
Sbjct: 552 VKCVVVVISGRPVVVQPFVSSVDALVAAWLPGTEGQGVSDVLFGDYGFTGKLARTWFKTV 611

Query: 354 DQLPMNYGDENYNPLFPLGFGLTTEPVNK 365
           DQLPMN GD +Y+PLFP GFGLTT+P ++
Sbjct: 612 DQLPMNVGDPHYDPLFPFGFGLTTKPTHQ 627

BLAST of Lsi10G004150.1 vs. NCBI nr
Match: gi|643732586|gb|KDP39682.1| (hypothetical protein JCGZ_02702 [Jatropha curcas])

HSP 1 Score: 431.0 bits (1107), Expect = 2.0e-117
Identity = 211/328 (64.33%), Postives = 254/328 (77.44%), Query Frame = 1

Query: 54  LDRSVVTPEIMRDYSIGSVLSAGGSVPSPQATAQKWIDMVMVPTNYTEFIDNLTYLVNSN 113
           +DR    P     YSI + +SAG  + SP       I  +MVP NYTEFID LT  V  N
Sbjct: 323 IDRITSPPHANYTYSIQAGISAGIDMASP-------ILQIMVPFNYTEFIDGLTDQVKKN 382

Query: 114 AIPMSRINDAVRRILRVKFVMGLFENPLADDRLVNELGS------------------QNG 173
            IPMSRI+DAV+RILRVKF MGLFENP AD+ LVN+LGS                  +NG
Sbjct: 383 IIPMSRIDDAVKRILRVKFTMGLFENPYADESLVNQLGSQEHRELAREAVRKSLVLLKNG 442

Query: 174 ENADEPVLPLSKKAAKILVAGTHADNLGYQCGGWTITWQGLSGNNLTTGTTILEAVKKTV 233
           +NA+EP+LPL KK++KILVAG+HADNLGYQCGGWTI WQGLSGNN T+GTTIL A+K TV
Sbjct: 443 KNANEPLLPLPKKSSKILVAGSHADNLGYQCGGWTIEWQGLSGNNHTSGTTILTAIKNTV 502

Query: 234 DPNTEVIYNVNPTTDYLKANNFSYAIVVVGETPYAETDGDNLNLTIAEGGSDTIQNVCNV 293
           DP+T+++YN NP  D++K+N FSYAIVVVGE PYAET GD++NLT++  G  TIQNVC  
Sbjct: 503 DPSTKIVYNENPDADFVKSNKFSYAIVVVGEHPYAETQGDSMNLTLSNPGPSTIQNVCGA 562

Query: 294 VKCVVVIVSGRPLTIEPHMSQLDALVAAWLPGTEGEGVTDVLFGDYGFTGKLARTWFKTV 353
           VKCVV++VSGRP+ ++P+++ ++ALVAAWLPGTEG+GV DVLFGDYGFTGKL+RTWFK+V
Sbjct: 563 VKCVVIVVSGRPVVMQPYVNSIEALVAAWLPGTEGQGVADVLFGDYGFTGKLSRTWFKSV 622

Query: 354 DQLPMNYGDENYNPLFPLGFGLTTEPVN 364
           DQLPMN GD NY+PLFP GFGLTTEPVN
Sbjct: 623 DQLPMNVGDRNYDPLFPFGFGLTTEPVN 643

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
BGH3B_BACO17.2e-2228.53Beta-glucosidase BoGH3B OS=Bacteroides ovatus (strain ATCC 8483 / DSM 1896 / JCM... [more]
GLUA_DICDI8.0e-2129.07Lysosomal beta glucosidase OS=Dictyostelium discoideum GN=gluA PE=1 SV=2[more]
BGLX_SALTY2.0e-1926.32Periplasmic beta-glucosidase OS=Salmonella typhimurium (strain LT2 / SGSC1412 / ... [more]
BGLX_ECOLI6.1e-1325.58Periplasmic beta-glucosidase OS=Escherichia coli (strain K12) GN=bglX PE=3 SV=2[more]
XYL3A_PRER26.3e-1041.41Xylan 1,4-beta-xylosidase OS=Prevotella ruminicola (strain ATCC 19189 / JCM 8958... [more]
Match NameE-valueIdentityDescription
B9SIA5_RICCO4.9e-11866.36Hydrolase, hydrolyzing O-glycosyl compounds, putative OS=Ricinus communis GN=RCO... [more]
Q7XAS3_GOSHI4.9e-11865.65Beta-D-glucosidase OS=Gossypium hirsutum PE=2 SV=1[more]
A0A0D2S1K4_GOSRA4.9e-11866.36Uncharacterized protein OS=Gossypium raimondii GN=B456_006G118100 PE=4 SV=1[more]
D7U8L2_VITVI1.4e-11766.56Putative uncharacterized protein OS=Vitis vinifera GN=VIT_08s0032g00470 PE=4 SV=... [more]
A0A067KU67_JATCU1.4e-11764.33Uncharacterized protein OS=Jatropha curcas GN=JCGZ_02702 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G20940.19.8e-11063.27 Glycosyl hydrolase family protein[more]
AT5G20950.13.5e-10758.59 Glycosyl hydrolase family protein[more]
AT5G04885.12.8e-10159.67 Glycosyl hydrolase family protein[more]
AT3G62710.15.4e-8452.23 Glycosyl hydrolase family protein[more]
AT3G47000.12.3e-8253.24 Glycosyl hydrolase family protein[more]
Match NameE-valueIdentityDescription
gi|802585432|ref|XP_012070424.1|3.2e-11864.63PREDICTED: lysosomal beta glucosidase-like [Jatropha curcas][more]
gi|255569514|ref|XP_002525724.1|7.0e-11866.36PREDICTED: beta-glucosidase BoGH3B [Ricinus communis][more]
gi|763768301|gb|KJB35516.1|7.0e-11866.36hypothetical protein B456_006G118100 [Gossypium raimondii][more]
gi|33391721|gb|AAQ17461.1|7.0e-11865.65beta-D-glucosidase [Gossypium hirsutum][more]
gi|643732586|gb|KDP39682.1|2.0e-11764.33hypothetical protein JCGZ_02702 [Jatropha curcas][more]
The following terms have been associated with this mRNA:
Vocabulary: Biological Process
TermDefinition
GO:0005975carbohydrate metabolic process
Vocabulary: Molecular Function
TermDefinition
GO:0004553hydrolase activity, hydrolyzing O-glycosyl compounds
Vocabulary: INTERPRO
TermDefinition
IPR026892Glycoside hydrolase family 3
IPR017853Glycoside_hydrolase_SF
IPR002772Glyco_hydro_3_C
IPR001764Glyco_hydro_3_N
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0005975 carbohydrate metabolic process
biological_process GO:0009251 glucan catabolic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compounds
molecular_function GO:0008422 beta-glucosidase activity

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Lsi10G004150Lsi10G004150gene


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Lsi10G004150.1.CDS.1Lsi10G004150.1.CDS.1CDS
Lsi10G004150.1.CDS.2Lsi10G004150.1.CDS.2CDS
Lsi10G004150.1.CDS.3Lsi10G004150.1.CDS.3CDS
Lsi10G004150.1.CDS.4Lsi10G004150.1.CDS.4CDS


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Lsi10G004150.1Lsi10G004150.1-proteinpolypeptide


Analysis Name: InterPro Annotations of Lagenaria siceraria
Date Performed: 2017-09-18
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001764Glycoside hydrolase, family 3, N-terminalGENE3DG3DSA:3.20.20.300coord: 23..89
score: 4.3E-10coord: 90..145
score: 6.4
IPR001764Glycoside hydrolase, family 3, N-terminalPFAMPF00933Glyco_hydro_3coord: 85..130
score: 1.
IPR002772Glycoside hydrolase family 3 C-terminal domainGENE3DG3DSA:3.40.50.1700coord: 157..358
score: 5.5
IPR002772Glycoside hydrolase family 3 C-terminal domainPFAMPF01915Glyco_hydro_3_Ccoord: 160..358
score: 3.1
IPR002772Glycoside hydrolase family 3 C-terminal domainunknownSSF52279Beta-D-glucan exohydrolase, C-terminal domaincoord: 157..358
score: 7.06
IPR017853Glycoside hydrolase superfamilyunknownSSF51445(Trans)glycosidasescoord: 90..154
score: 3.79E-13coord: 22..89
score: 2.5
IPR026892Glycoside hydrolase family 3PANTHERPTHR30620PERIPLASMIC BETA-GLUCOSIDASE-RELATEDcoord: 90..363
score: 3.6E
NoneNo IPR availablePANTHERPTHR30620:SF39SUBFAMILY NOT NAMEDcoord: 90..363
score: 3.6E