CmoCh06G009350 (gene) Cucurbita moschata (Rifu)

NameCmoCh06G009350
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
Description(Hydrolase, hydrolyzing O-glycosyl compounds, putative) (3.2.1.21)
LocationCmo_Chr06 : 6962161 .. 6964760 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACCAAGGTTCTCATTGTTTTAATGGGATTTTTACTCATGTTTTTGTCGGAGACATTGGGAACAGGAGAACAGCTCACATATAAAGATCCAACAAAACCATTAAATGTTCGAATCAAGGATCTACTTGGTCGGATGACTGTGGAGGAAAAAATAGGTCAAATGGTGCAAATTGAAAGGGTTAATGCTTCGGCTGATGTCATGAAAAACTATTTCATTGGTAAATTATCTTACTCCTAATCTAATTCTCTCCTCACGCTCTCAAAGTACTAATACAATAATATTAAATGTAGGGAGTGTATTGAGTGGTGGAGGTAGTGCCCCATCAAAGAATGCCTCAGCTAAGGATTGGGTGGACATGGTGAATGAAATCCAAAAAGGAGCTTTGTCGAGTAGGCTAGGAATTCCAATGATATATGGAATTGATGCTGTACATGGCCACAACAATGTCTATAATGCAACCATCTTCCCTCACAACGTCGGACTTGGTGCTACTAGGTTACATAACCATACTCCCTCTCTATTTTTTATTTTTTATTTTTATTTTTATTCAGCACTAATCATATAGCTTATTCGTGTAAAAATGATTTTCCTTTGATTAAACAAACAGAGACCCTCAACTTGTGAAGAATATTGGGAGTGCTACTGCCCTTGAAATTAGAGCAACTGGCATTCCGTATGCTTTTGCGCCTTGTATAGCGGTAATTATCTTTGTGTTTCAATGGATAAAGATCATCTTAACATTTCTGTTACTATTTGTTGTTTGTTATGTCATGTGATTTATTTTGGAGGTTTTTGTATTTCAATTAGGTTTGTAAAGATCCACGATGGGGTCGATGCTATGAAAGCTACAGTGAGGACCCTAAGATTGTTCAAGAAATGACTGAGATCATACTAGGTTTACAAGGAGAGATTCCACCTAATTCTCGCAAGGGTGTTCCTTATGTCGGTGGGAAGTAAGTCTTCAAATATTAATCAGTGTCTTTATTTGTATAGGTGTTGAAAATAACCACTATTTTCATTTATTTATGCATAGAGACAAAGTGGCAGGGTGTGCAAAACATTACGTTGGTGATGGTGGAACAACTAAGGGTATCAACGAGAACGACACAGTAATAGACAGACATAGCTTACTTAGCATTCACATGCCAGGTTACTCTCACTCGATCATCAAGGGAATTGCAACGGTTATGGCTTCTTATTCAAGTTGGAATGGAGAGAAAATGCATGCCCACAAAGAACTCCTTACTGGCTTTCTCAAGAACACTCTTAATTTTAAGGTAATATGGGTTCTTGATTCTACTTTGTGATTGTCACTTTGTTGCAGACATTTGATGATTTGATAAAAAGTGAGCTTAATATTGCTTCGGTCTTTTTAGGGCTTTGTGATCTCAGATTGGCAGGGTATTGATAGGATTACAACTCCACCTCATGCTAATTATACATATTCCATCATAGCCAGCGTTACTGCTGGTGTTGACATGGTTAGTATTATGCTCGACACTTTAAAACGTATTTCAAGGAATAATTATATGCTCATGTACACTTTCAATACAGATAATGATACCGTACGACTACAAGGAGTTCATCGATAAAATTACCTACTTGGTAAAAAATAACATAATTCCTATGAGTCGAATTGACGATGCAGTTTGGAGAATTTTGAGAGTCAAATTTGTTATGGGTTTATTTGAGAACCCATTAGCTGACTACAGCTTGGTTAATGAGATTGGTAAAAAGGTAACTATATGGAATATGTCTTTAATATTATGAAATAAAATATAGAAATTATGTTTTAACTTATGTTTTTCTCACAATAGGAACATAGAGAACTAGCTAGAGAAGCCGTAAGAAAATCTCTAGTGTTATTAAAGAATGGAAAATCGACTTCAACACCATTGCTTCCTCTTCCAAAGAAGACGCAAAAAATACTTGTTGCTGGCACCCATGCAAACAACCTTGGGTATCAATGTGGTGGTTGGACTATCGAATGGCAAGGAGCTAGTGGCAACAACCTTACAAGTGGTATGAAAATATTACAATATAATTGTGAAGTTTTTCCTTATAAATTATTGGGACATTGTGTATGATATTTATATCATCTCCTTTTGTTAGGTACAACTGTGCTTGATGCTATAAAAGAAACGGTTGATCCTGAAACAGAAGTTACCTTTGAGGAACAACCAAATAAGGAGAGTCTCCAATCACATGAGTTTTCTTATGGCATTGTTGTAGTGGGAGAATATCCATATGCAGAAACTAATGGCGATAGCTTGAATTTGACAATTCCCGACCCGGGTCCAAGCACCATCACAGATGTTTGTGGCGCTATGAAATGTGTAGTTATAATAATCTCAGGACGGCCTGTAGTAATCGAACCTTATATTTCTTCAATGGATGCACTTGTTGCTGCTTGGCTTCCCGGAACTGAAGGAAAAGGCATTACTGATGTATTGTTTGGAGATTATGGTTTTACTGGAAAACTTCCCCGAACGTGGTTCAAAACTGTTGATCAATTGCCAATGAACTTTGGAGATCCTCATTATGATCCTCTTTTCTCCTTTGGATATGGTCTTACTACAGAACCCATCAAAGCTTAG

mRNA sequence

ATGACCAAGGTTCTCATTGTTTTAATGGGATTTTTACTCATGTTTTTGTCGGAGACATTGGGAACAGGAGAACAGCTCACATATAAAGATCCAACAAAACCATTAAATGTTCGAATCAAGGATCTACTTGGTCGGATGACTGTGGAGGAAAAAATAGGTCAAATGGTGCAAATTGAAAGGGTTAATGCTTCGGCTGATGTCATGAAAAACTATTTCATTGGGAGTGTATTGAGTGGTGGAGGTAGTGCCCCATCAAAGAATGCCTCAGCTAAGGATTGGGTGGACATGGTGAATGAAATCCAAAAAGGAGCTTTGTCGAGTAGGCTAGGAATTCCAATGATATATGGAATTGATGCTGTACATGGCCACAACAATGTCTATAATGCAACCATCTTCCCTCACAACGTCGGACTTGGTGCTACTAGAGACCCTCAACTTGTGAAGAATATTGGGAGTGCTACTGCCCTTGAAATTAGAGCAACTGGCATTCCGTATGCTTTTGCGCCTTGTATAGCGGTTTGTAAAGATCCACGATGGGGTCGATGCTATGAAAGCTACAGTGAGGACCCTAAGATTGTTCAAGAAATGACTGAGATCATACTAGGTTTACAAGGAGAGATTCCACCTAATTCTCGCAAGGGTGTTCCTTATGTCGGTGGGAAAGACAAAGTGGCAGGGTGTGCAAAACATTACGTTGGTGATGGTGGAACAACTAAGGGTATCAACGAGAACGACACAGTAATAGACAGACATAGCTTACTTAGCATTCACATGCCAGGTTACTCTCACTCGATCATCAAGGGAATTGCAACGGTTATGGCTTCTTATTCAAGTTGGAATGGAGAGAAAATGCATGCCCACAAAGAACTCCTTACTGGCTTTCTCAAGAACACTCTTAATTTTAAGGGCTTTGTGATCTCAGATTGGCAGGGTATTGATAGGATTACAACTCCACCTCATGCTAATTATACATATTCCATCATAGCCAGCGTTACTGCTGGTGTTGACATGATAATGATACCGTACGACTACAAGGAGTTCATCGATAAAATTACCTACTTGGTAAAAAATAACATAATTCCTATGAGTCGAATTGACGATGCAGTTTGGAGAATTTTGAGAGTCAAATTTGTTATGGGTTTATTTGAGAACCCATTAGCTGACTACAGCTTGGTTAATGAGATTGGTAAAAAGGAACATAGAGAACTAGCTAGAGAAGCCGTAAGAAAATCTCTAGTGTTATTAAAGAATGGAAAATCGACTTCAACACCATTGCTTCCTCTTCCAAAGAAGACGCAAAAAATACTTGTTGCTGGCACCCATGCAAACAACCTTGGGTATCAATGTGGTGGTTGGACTATCGAATGGCAAGGAGCTAGTGGCAACAACCTTACAAGTGGTACAACTGTGCTTGATGCTATAAAAGAAACGGTTGATCCTGAAACAGAAGTTACCTTTGAGGAACAACCAAATAAGGAGAGTCTCCAATCACATGAGTTTTCTTATGGCATTGTTGTAGTGGGAGAATATCCATATGCAGAAACTAATGGCGATAGCTTGAATTTGACAATTCCCGACCCGGGTCCAAGCACCATCACAGATGTTTGTGGCGCTATGAAATGTGTAGTTATAATAATCTCAGGACGGCCTGTAGTAATCGAACCTTATATTTCTTCAATGGATGCACTTGTTGCTGCTTGGCTTCCCGGAACTGAAGGAAAAGGCATTACTGATGTATTGTTTGGAGATTATGGTTTTACTGGAAAACTTCCCCGAACGTGGTTCAAAACTGTTGATCAATTGCCAATGAACTTTGGAGATCCTCATTATGATCCTCTTTTCTCCTTTGGATATGGTCTTACTACAGAACCCATCAAAGCTTAG

Coding sequence (CDS)

ATGACCAAGGTTCTCATTGTTTTAATGGGATTTTTACTCATGTTTTTGTCGGAGACATTGGGAACAGGAGAACAGCTCACATATAAAGATCCAACAAAACCATTAAATGTTCGAATCAAGGATCTACTTGGTCGGATGACTGTGGAGGAAAAAATAGGTCAAATGGTGCAAATTGAAAGGGTTAATGCTTCGGCTGATGTCATGAAAAACTATTTCATTGGGAGTGTATTGAGTGGTGGAGGTAGTGCCCCATCAAAGAATGCCTCAGCTAAGGATTGGGTGGACATGGTGAATGAAATCCAAAAAGGAGCTTTGTCGAGTAGGCTAGGAATTCCAATGATATATGGAATTGATGCTGTACATGGCCACAACAATGTCTATAATGCAACCATCTTCCCTCACAACGTCGGACTTGGTGCTACTAGAGACCCTCAACTTGTGAAGAATATTGGGAGTGCTACTGCCCTTGAAATTAGAGCAACTGGCATTCCGTATGCTTTTGCGCCTTGTATAGCGGTTTGTAAAGATCCACGATGGGGTCGATGCTATGAAAGCTACAGTGAGGACCCTAAGATTGTTCAAGAAATGACTGAGATCATACTAGGTTTACAAGGAGAGATTCCACCTAATTCTCGCAAGGGTGTTCCTTATGTCGGTGGGAAAGACAAAGTGGCAGGGTGTGCAAAACATTACGTTGGTGATGGTGGAACAACTAAGGGTATCAACGAGAACGACACAGTAATAGACAGACATAGCTTACTTAGCATTCACATGCCAGGTTACTCTCACTCGATCATCAAGGGAATTGCAACGGTTATGGCTTCTTATTCAAGTTGGAATGGAGAGAAAATGCATGCCCACAAAGAACTCCTTACTGGCTTTCTCAAGAACACTCTTAATTTTAAGGGCTTTGTGATCTCAGATTGGCAGGGTATTGATAGGATTACAACTCCACCTCATGCTAATTATACATATTCCATCATAGCCAGCGTTACTGCTGGTGTTGACATGATAATGATACCGTACGACTACAAGGAGTTCATCGATAAAATTACCTACTTGGTAAAAAATAACATAATTCCTATGAGTCGAATTGACGATGCAGTTTGGAGAATTTTGAGAGTCAAATTTGTTATGGGTTTATTTGAGAACCCATTAGCTGACTACAGCTTGGTTAATGAGATTGGTAAAAAGGAACATAGAGAACTAGCTAGAGAAGCCGTAAGAAAATCTCTAGTGTTATTAAAGAATGGAAAATCGACTTCAACACCATTGCTTCCTCTTCCAAAGAAGACGCAAAAAATACTTGTTGCTGGCACCCATGCAAACAACCTTGGGTATCAATGTGGTGGTTGGACTATCGAATGGCAAGGAGCTAGTGGCAACAACCTTACAAGTGGTACAACTGTGCTTGATGCTATAAAAGAAACGGTTGATCCTGAAACAGAAGTTACCTTTGAGGAACAACCAAATAAGGAGAGTCTCCAATCACATGAGTTTTCTTATGGCATTGTTGTAGTGGGAGAATATCCATATGCAGAAACTAATGGCGATAGCTTGAATTTGACAATTCCCGACCCGGGTCCAAGCACCATCACAGATGTTTGTGGCGCTATGAAATGTGTAGTTATAATAATCTCAGGACGGCCTGTAGTAATCGAACCTTATATTTCTTCAATGGATGCACTTGTTGCTGCTTGGCTTCCCGGAACTGAAGGAAAAGGCATTACTGATGTATTGTTTGGAGATTATGGTTTTACTGGAAAACTTCCCCGAACGTGGTTCAAAACTGTTGATCAATTGCCAATGAACTTTGGAGATCCTCATTATGATCCTCTTTTCTCCTTTGGATATGGTCTTACTACAGAACCCATCAAAGCTTAG
BLAST of CmoCh06G009350 vs. Swiss-Prot
Match: BGH3B_BACO1 (Beta-glucosidase BoGH3B OS=Bacteroides ovatus (strain ATCC 8483 / DSM 1896 / JCM 5824 / NCTC 11153) GN=BACOVA_02659 PE=1 SV=1)

HSP 1 Score: 293.5 bits (750), Expect = 5.5e-78
Identity = 200/649 (30.82%), Postives = 331/649 (51.00%), Query Frame = 1

Query: 31  PTKP-LNVRIKDLLGRMTVEEKIGQMVQIERVNASAD------------------VMKNY 90
           PT P +   I++ L +MT+E+KIGQM +I  ++  +D                  V+  Y
Sbjct: 30  PTDPAIETHIREWLQKMTLEQKIGQMCEIT-IDVVSDLETSRKKGFCLSEAMLDTVIGKY 89

Query: 91  FIGSVLSGGGSAPSKNASAKD-WVDMVNEIQKGALSSRLGIPMIYGIDAVHGHNNVYNAT 150
            +GS+L+     P   A  K+ W + + +IQ+ ++   +GIP IYG+D +HG     + T
Sbjct: 90  KVGSLLN----VPLGVAQKKEKWAEAIKQIQEKSMKE-IGIPCIYGVDQIHGTTYTLDGT 149

Query: 151 IFPHNVGLGATRDPQLVKNIGSATALEIRATGIPYAFAPCIAVCKDPRWGRCYESYSEDP 210
           +FP  + +GAT + +L +     +A E +A  IP+ FAP + + +DPRW R +E+Y ED 
Sbjct: 150 MFPQGINMGATFNRELTRRGAKISAYETKAGCIPWTFAPVVDLGRDPRWARMWENYGEDC 209

Query: 211 KIVQEM-TEIILGLQGEIPPNSRKGVPYVGGKDKVAGCAKHYVGDGGTTKGINENDTVID 270
            +  EM    + G QGE P           G+  VA C KHY+G G    G +   + I 
Sbjct: 210 YVNAEMGVSAVKGFQGEDPNRI--------GEYNVAACMKHYMGYGVPVSGKDRTPSSIS 269

Query: 271 RHSLLSIHMPGYSHSIIKGIATVMASYSSWNGEKMHAHKELLTGFLKNTLNFKGFVISDW 330
           R  +   H   +  ++ +G  +VM +    NG   HA++ELLT +LK  LN+ G +++DW
Sbjct: 270 RSDMREKHFAPFLAAVRQGALSVMVNSGVDNGLPFHANRELLTEWLKEDLNWDGLIVTDW 329

Query: 331 QGIDRITTPPH--ANYTYSIIASVTAGVDMIMIPYDYKEFIDKITYLVKNNIIPMSRIDD 390
             I+ + T  H  A    ++   + AG+DM M+PY+   F D +  LV+   + M RIDD
Sbjct: 330 ADINNLCTRDHIAATKKEAVKIVINAGIDMSMVPYEV-SFCDYLKELVEEGEVSMERIDD 389

Query: 391 AVWRILRVKFVMGLFENPLADYSLVNEIGKKEHRELAREAVRKSLVLLKNGKSTSTPLLP 450
           AV R+LR+K+ +GLF++P  D    ++ G KE   +A +A  +S VLLKN  +    +LP
Sbjct: 390 AVARVLRLKYRLGLFDHPYWDIKKYDKFGSKEFAAVALQAAEESEVLLKNDGN----ILP 449

Query: 451 LPKKTQKILVAGTHANNLGYQCGGWTIEWQGASGNNLTSG-TTVLDAI-----KETVDPE 510
           +  K +KIL+ G +AN++    GGW+  WQG   +       T+ +A+     KE +  E
Sbjct: 450 I-AKGKKILLTGPNANSMRCLNGGWSYSWQGHVADEYAQAYHTIYEALCEKYGKENIIYE 509

Query: 511 TEVTF----------EEQPNKESLQSHEFSYGIVV--VGEYPYAETNGDSLNLTIPDPGP 570
             VT+          E +P  E   +      I++  +GE  Y ET G+  +LT+ +   
Sbjct: 510 PGVTYASYKNDNWWEENKPETEKPVAAAAQADIIITCIGENSYCETPGNLTDLTLSENQR 569

Query: 571 STITDVCGAMKCVVIIIS-GRPVVIEPYISSMDALVAAWLPGT-EGKGITDVLFGDYGFT 622
           + +  +    K +V++++ GRP +I   +    A+V   LP    G  + ++L GD  F+
Sbjct: 570 NLVKALAATGKPIVLVLNQGRPRIINDIVPLAKAVVNIMLPSNYGGDALANLLAGDANFS 629

BLAST of CmoCh06G009350 vs. Swiss-Prot
Match: GLUA_DICDI (Lysosomal beta glucosidase OS=Dictyostelium discoideum GN=gluA PE=1 SV=2)

HSP 1 Score: 283.9 bits (725), Expect = 4.3e-75
Identity = 205/634 (32.33%), Postives = 322/634 (50.79%), Query Frame = 1

Query: 39  IKDLLGRMTVEEKIGQMVQIERVNAS------------ADVMKNYFIGSVL----SGGGS 98
           + +L+ +M++ EKIGQM Q++    +            A   K Y+IGS L    SGG +
Sbjct: 80  VDNLMSKMSITEKIGQMTQLDITTLTSPNTITINETTLAYYAKTYYIGSYLNSPVSGGLA 139

Query: 99  APSKNASAKDWVDMVNEIQKGALS-SRLGIPMIYGIDAVHGHNNVYNATIFPHNVGLGAT 158
               + ++  W+DM+N IQ   +  S   IPMIYG+D+VHG N V+ AT+FPHN GL AT
Sbjct: 140 GDIHHINSSVWLDMINTIQTIVIEGSPNKIPMIYGLDSVHGANYVHKATLFPHNTGLAAT 199

Query: 159 RDPQLVKNIGSATALEIRATGIPYAFAPCIAVCKDPRWGRCYESYSEDPKIVQEM-TEII 218
            + +        T+ +  A GIP+ FAP + +   P W R YE++ EDP +   M    +
Sbjct: 200 FNIEHATTAAQITSKDTVAVGIPWVFAPVLGIGVQPLWSRIYETFGEDPYVASMMGAAAV 259

Query: 219 LGLQGEIPPNSRKGVPYVGGKDKVAGCAKHYVGDGGTTKGINENDTVIDRHSLLSIHMPG 278
            G QG    NS  G   +     V   AKHY G    T G +     I    L    +P 
Sbjct: 260 RGFQG--GNNSFDGP--INAPSAVC-TAKHYFGYSDPTSGKDRTAAWIPERMLRRYFLPS 319

Query: 279 YSHSII-KGIATVMASYSSWNGEKMHAHKELLTGFLKNTLNFKGFVISDWQGIDRITTPP 338
           ++ +I   G  T+M +    NG  MH   + LT  L+  L F+G  ++DWQ I+++    
Sbjct: 320 FAEAITGAGAGTIMINSGEVNGVPMHTSYKYLTEVLRGELQFEGVAVTDWQDIEKLVYFH 379

Query: 339 H--ANYTYSIIASVTAGVDMIMIPYDYKEFIDKITYLVKNNIIPMSRIDDAVWRILRVKF 398
           H   +   +I+ ++ AG+DM M+P D   F   +  +V    +P SR+D +V RIL +K+
Sbjct: 380 HTAGSAEEAILQALDAGIDMSMVPLDL-SFPIILAEMVAAGTVPESRLDLSVRRILNLKY 439

Query: 399 VMGLFENPL--ADYSLVNEIGKKEHRELAREAVRKSLVLLKNGKSTSTPLLPLPKKTQK- 458
            +GLF NP    + ++V+ IG+ + RE A     +S+ LL+N  +    +LPL   T K 
Sbjct: 440 ALGLFSNPYPNPNAAIVDTIGQVQDREAAAATAEESITLLQNKNN----ILPLNTNTIKN 499

Query: 459 ILVAGTHANNLGYQCGGWTIEWQGA-SGNNLTSGTTVLDAIKE------------TVDPE 518
           +L+ G  A+++    GGW++ WQGA   +    GT++L  ++E            T+  E
Sbjct: 500 VLLTGPSADSIRNLNGGWSVHWQGAYEDSEFPFGTSILTGLREITNDTADFNIQYTIGHE 559

Query: 519 TEVTFEEQPNKESLQSHEFS-YGIVVVGEYPYAETNGDSLNLTIPDPGPSTITD--VCGA 578
             V   +    E+++  + S   +VV+GE P AET GD  +L++ DP    +    V   
Sbjct: 560 IGVPTNQTSIDEAVELAQSSDVVVVVIGELPEAETPGDIYDLSM-DPNEVLLLQQLVDTG 619

Query: 579 MKCVVIIISGRPVVIEP-YISSMDALVAAWLPGTE-GKGITDVLFGDYGFTGKLPRTWFK 622
              V+I++  RP ++ P  + S  A++ A+LPG+E GK I ++L G+   +G+LP T+  
Sbjct: 620 KPVVLILVEARPRILPPDLVYSCAAVLMAYLPGSEGGKPIANILMGNVNPSGRLPLTYPG 679

BLAST of CmoCh06G009350 vs. Swiss-Prot
Match: BGLX_SALTY (Periplasmic beta-glucosidase OS=Salmonella typhimurium (strain LT2 / SGSC1412 / ATCC 700720) GN=bglX PE=3 SV=2)

HSP 1 Score: 219.5 bits (558), Expect = 1.0e-55
Identity = 185/648 (28.55%), Postives = 292/648 (45.06%), Query Frame = 1

Query: 39  IKDLLGRMTVEEKIGQMVQIERVNASADVMKNYFIGSVLSGGGSAPSKNASAKDWVDMVN 98
           + DLL +MTV+EKIGQ+     ++   D  K      +  G   A     + +D   M +
Sbjct: 38  VTDLLKKMTVDEKIGQL---RLISVGPDNPKEAIREMIKDGQVGAIFNTVTRQDIRQMQD 97

Query: 99  EIQKGALSSRLGIPMIYGIDAVHGHNNVYNATIFPHNVGLGATRDPQLVKNIGSATALEI 158
           ++      SRL IP+ +  D VHG       T+FP ++GL ++ +   V+ +G  +A E 
Sbjct: 98  QVMA---LSRLKIPLFFAYDVVHGQR-----TVFPISLGLASSFNLDAVRTVGRVSAYEA 157

Query: 159 RATGIPYAFAPCIAVCKDPRWGRCYESYSEDPKIVQEMTE-IILGLQGEIPPNSRKGVPY 218
              G+   +AP + V +DPRWGR  E + ED  +   M E ++  +QG+ P +       
Sbjct: 158 ADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSIMGETMVKAMQGKSPAD------- 217

Query: 219 VGGKDKVAGCAKHYVGDGGTTKGINENDTVIDRHSLLSIHMPGYSHSIIKGIATVMASYS 278
              +  V    KH+   G    G   N   +    L + +MP Y   +  G   VM + +
Sbjct: 218 ---RYSVMTSVKHFAAYGAVEGGKEYNTVDMSSQRLFNDYMPPYKAGLDAGSGAVMVALN 277

Query: 279 SWNGEKMHAHKELLTGFLKNTLNFKGFVISDWQGI-DRITTPPHANYTYSIIASVTAGVD 338
           S NG    +   LL   L++   FKG  +SD   I + I     A+   ++  ++ AGVD
Sbjct: 278 SLNGTPATSDSWLLKDVLRDEWGFKGITVSDHGAIKELIKHGTAADPEDAVRVALKAGVD 337

Query: 339 MIMIPYDYKEFIDKITYLVKNNIIPMSRIDDAVWRILRVKFVMGLFENPLADYSLVNEIG 398
           M M    Y +++     L+K+  + M+ +DDA   +L VK+ MGLF +P +       +G
Sbjct: 338 MSMADEYYSKYLPG---LIKSGKVTMAELDDATRHVLNVKYDMGLFNDPYS------HLG 397

Query: 399 KKE------------HRELAREAVRKSLVLLKNGKSTSTPLLPLPKKTQKILVAGTHANN 458
            KE            HR+ ARE  R+S+VLLKN   T    LPL KK+  I V G  A++
Sbjct: 398 PKESDPVDTNAESRLHRKEAREVARESVVLLKNRLET----LPL-KKSGTIAVVGPLADS 457

Query: 459 LGYQCGGWTIEWQG----------------------ASGNNLTSGTTVLDAIK-----ET 518
                G W+                           A G N+T+   ++D +        
Sbjct: 458 QRDVMGSWSAAGVANQSVTVLAGIQNAVGDGAKILYAKGANITNDKGIVDFLNLYEEAVK 517

Query: 519 VDPETEVTFEEQPNKESLQSHEFSYGIVVVGEYP-YAETNGDSLNLTIPDPGPSTITDVC 578
           +DP +     ++  + + Q+      + VVGE    A       N+TIP      IT + 
Sbjct: 518 IDPRSPQAMIDEAVQAAKQADVV---VAVVGESQGMAHEASSRTNITIPQSQRDLITALK 577

Query: 579 GAMK-CVVIIISGRPVVIEPYISSMDALVAAWLPGTEG-KGITDVLFGDYGFTGKLPRTW 622
              K  V+++++GRP+ +       DA++  W  GTEG   I DVLFGDY  +GKLP ++
Sbjct: 578 ATGKPLVLVLMNGRPLALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPISF 637

BLAST of CmoCh06G009350 vs. Swiss-Prot
Match: BGLX_ECOLI (Periplasmic beta-glucosidase OS=Escherichia coli (strain K12) GN=bglX PE=3 SV=2)

HSP 1 Score: 211.8 bits (538), Expect = 2.1e-53
Identity = 181/648 (27.93%), Postives = 292/648 (45.06%), Query Frame = 1

Query: 39  IKDLLGRMTVEEKIGQMVQIERVNASADVMKNYFIGSVLSGGGSAPSKNASAKDWVDMVN 98
           + +LL +MTV+EKIGQ+     ++   D  K      +  G   A     + +D   M +
Sbjct: 38  VTELLKKMTVDEKIGQL---RLISVGPDNPKEAIREMIKDGQVGAIFNTVTRQDIRAMQD 97

Query: 99  EIQKGALSSRLGIPMIYGIDAVHGHNNVYNATIFPHNVGLGATRDPQLVKNIGSATALEI 158
           ++ +    SRL IP+ +  D +HG       T+FP ++GL ++ +   VK +G  +A E 
Sbjct: 98  QVME---LSRLKIPLFFAYDVLHGQR-----TVFPISLGLASSFNLDAVKTVGRVSAYEA 157

Query: 159 RATGIPYAFAPCIAVCKDPRWGRCYESYSEDPKIVQEMTEIIL-GLQGEIPPNSRKGVPY 218
              G+   +AP + V +DPRWGR  E + ED  +   M + ++  +QG+ P +       
Sbjct: 158 ADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSTMGKTMVEAMQGKSPAD------- 217

Query: 219 VGGKDKVAGCAKHYVGDGGTTKGINENDTVIDRHSLLSIHMPGYSHSIIKGIATVMASYS 278
              +  V    KH+   G    G   N   +    L + +MP Y   +  G   VM + +
Sbjct: 218 ---RYSVMTSVKHFAAYGAVEGGKEYNTVDMSPQRLFNDYMPPYKAGLDAGSGAVMVALN 277

Query: 279 SWNGEKMHAHKELLTGFLKNTLNFKGFVISDWQGI-DRITTPPHANYTYSIIASVTAGVD 338
           S NG    +   LL   L++   FKG  +SD   I + I     A+   ++  ++ +G++
Sbjct: 278 SLNGTPATSDSWLLKDVLRDQWGFKGITVSDHGAIKELIKHGTAADPEDAVRVALKSGIN 337

Query: 339 MIMIPYDYKEFIDKITYLVKNNIIPMSRIDDAVWRILRVKFVMGLFENPLADYSLVNEIG 398
           M M    Y +++     L+K+  + M+ +DDA   +L VK+ MGLF +P +       +G
Sbjct: 338 MSMSDEYYSKYLPG---LIKSGKVTMAELDDAARHVLNVKYDMGLFNDPYS------HLG 397

Query: 399 KKE------------HRELAREAVRKSLVLLKNGKSTSTPLLPLPKKTQKILVAGTHANN 458
            KE            HR+ ARE  R+SLVLLKN   T    LPL KK+  I V G  A++
Sbjct: 398 PKESDPVDTNAESRLHRKEAREVARESLVLLKNRLET----LPL-KKSATIAVVGPLADS 457

Query: 459 LGYQCGGWTIEWQG----------------------ASGNNLTSGTTVLDAIKE-----T 518
                G W+                           A G N+TS   ++D + +      
Sbjct: 458 KRDVMGSWSAAGVADQSVTVLTGIKNAVGENGKVLYAKGANVTSDKGIIDFLNQYEEAVK 517

Query: 519 VDPETEVTFEEQPNKESLQSHEFSYGIVVVGEYP-YAETNGDSLNLTIPDPGPSTITDVC 578
           VDP +     ++  + + QS      + VVGE    A       ++TIP      I  + 
Sbjct: 518 VDPRSPQEMIDEAVQTAKQSDVV---VAVVGEAQGMAHEASSRTDITIPQSQRDLIAALK 577

Query: 579 GAMK-CVVIIISGRPVVIEPYISSMDALVAAWLPGTEG-KGITDVLFGDYGFTGKLPRTW 622
              K  V+++++GRP+ +       DA++  W  GTEG   I DVLFGDY  +GKLP ++
Sbjct: 578 ATGKPLVLVLMNGRPLALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPMSF 637

BLAST of CmoCh06G009350 vs. Swiss-Prot
Match: BGLC_ASPOR (Probable beta-glucosidase C OS=Aspergillus oryzae (strain ATCC 42149 / RIB 40) GN=bglC PE=3 SV=2)

HSP 1 Score: 173.7 bits (439), Expect = 6.3e-42
Identity = 171/646 (26.47%), Postives = 266/646 (41.18%), Query Frame = 1

Query: 28  YKDPTKPLNVRIKDLLGRMTVEEKIGQMVQIERVNASADV---------MKNYFIGSVLS 87
           YKD +  ++ R+ DLL RMT+EEK GQ+     ++   D            +  IG    
Sbjct: 46  YKDASYCIDERVDDLLARMTIEEKAGQLFHTRLMDGPLDDEGSGNNAHNSTSNMIGEKHM 105

Query: 88  GGGSAPSKNASAKDWVDMVNEIQKGALSSRLGIPMIYGIDAVHGHN-NV---YNATIF-- 147
              +  S   +A +  + +N IQ+ AL +RLGIP+    D  H    NV   + A +F  
Sbjct: 106 THFNLASDITNATETAEFINRIQELALQTRLGIPVTVSTDPRHSFTENVGTGFKAGVFSQ 165

Query: 148 -PHNVGLGATRDPQLVKNIGSATALEIRATGIPYAFAPCIAVCKDPRWGRCYESYSEDPK 207
            P ++GL A RDP +V+        E  A GI  A  P + +  +PRW R   ++ E+  
Sbjct: 166 WPESIGLAALRDPYVVRKFAEVAKEEYIAVGIRAALHPQVDLSTEPRWARISNTWGENST 225

Query: 208 IVQEM-TEIILGLQGE-IPPNSRKGVPYVGGKDKVAGCAKHYVGDGGTTKG------INE 267
           +  E+  E I G QG+ + P S K V             KH+ G G    G        +
Sbjct: 226 LTSELLVEYIKGFQGDKLGPQSVKTV------------TKHFPGGGPVENGEDSHFAYGK 285

Query: 268 NDTVIDRHSLLSIHMPGYSHSIIKGIATVMASYSSWNGEKMHA-----HKELLTGFLKNT 327
           N T    +  L  H+  +  +I  G   +M  YS   G +        +K ++T  L+N 
Sbjct: 286 NQTYPGNN--LEEHLKPFKAAIAAGATEIMPYYSRPIGTEYEPVAFSFNKRIVTELLRNE 345

Query: 328 LNFKGFVISDWQ-----------------GIDRITTPPHANYTYSIIASVTAGVDMIMIP 387
           L F G V++DW                  G++ +T    A         + AG D     
Sbjct: 346 LGFDGIVLTDWGLITDGYIAGQYMPARAWGVENLTELQRA------ARILDAGCDQ---- 405

Query: 388 YDYKEFIDKITYLVKNNIIPMSRIDDAVWRILRVKFVMGLFENPLADYSLVNE-IGKKEH 447
           +  +E  + I  LV+  II   RID +V R+L+ KFV+GLF+NP  D       +G    
Sbjct: 406 FGGEERPELIVQLVQEGIISEDRIDVSVRRLLKEKFVLGLFDNPFVDAEAAGRVVGNDYF 465

Query: 448 RELAREAVRKSLVLLKNGKSTSTPLLPLPK--KTQKILVAGTHANNLGYQCGGWTIEWQG 507
             L REA R+S  LL N +     ++PL K  K+ K  + G +A+ +      W      
Sbjct: 466 VRLGREAQRRSYTLLSNNED----IVPLKKIEKSTKFYIEGFNASFI----ESWNY---- 525

Query: 508 ASGNNLTSGTTVLDAIKETVDPETEVTFEEQPNKESLQSHEFSYGIVVVGEYPYAETNGD 567
                     TV+D+ +E            +P                       E N  
Sbjct: 526 ----------TVVDSPEEAEYALLRYNAPYEPRPGGF------------------EANMH 585

Query: 568 SLNLTIPDPGPSTITDVCGAMKCVVIIISGRPVVIEPYISSMDALVAAWLPGTEGKGITD 622
           + +L   D   +    +  A+  +V I+  RP VI   I    A+ A++  G++     D
Sbjct: 586 AGSLAFNDTEKARQAKIYSAVPTIVDIVMDRPAVIPEIIEQAKAVFASY--GSDSNAFLD 625

BLAST of CmoCh06G009350 vs. TrEMBL
Match: A0A0A0LY55_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G661750 PE=4 SV=1)

HSP 1 Score: 1067.4 bits (2759), Expect = 6.7e-309
Identity = 512/622 (82.32%), Postives = 565/622 (90.84%), Query Frame = 1

Query: 6   IVLMGFLLMFLSETLGTGEQLTYKDPTKPLNVRIKDLLGRMTVEEKIGQMVQIERVNASA 65
           I+L+  LL+   ET    E   YKDPT+ LNVRIKDLLGRMT+EEKIGQMVQIERVNAS 
Sbjct: 6   IILIALLLICCFETGAKAENFKYKDPTQRLNVRIKDLLGRMTLEEKIGQMVQIERVNAST 65

Query: 66  DVMKNYFIGSVLSGGGSAPSKNASAKDWVDMVNEIQKGALSSRLGIPMIYGIDAVHGHNN 125
           +VMK YFIGSVLSGGGS PSK ASA+DW++MVNEIQKGALS+RLGIPMIYGIDAVHGHNN
Sbjct: 66  EVMKKYFIGSVLSGGGSVPSKQASAQDWINMVNEIQKGALSTRLGIPMIYGIDAVHGHNN 125

Query: 126 VYNATIFPHNVGLGATRDPQLVKNIGSATALEIRATGIPYAFAPCIAVCKDPRWGRCYES 185
           VYNATIFPHN+GLGATRDPQL+K IG A+A EIRATGIPYAFAPC+AVC+DPRWGRCYES
Sbjct: 126 VYNATIFPHNIGLGATRDPQLLKRIGVASAREIRATGIPYAFAPCVAVCRDPRWGRCYES 185

Query: 186 YSEDPKIVQEMTEIILGLQGEIPPNSRKGVPYVGGKDKVAGCAKHYVGDGGTTKGINEND 245
           Y EDPKIVQEMTEII GLQGEIPPNSRKGVPYV GK+ V  CAKHYVGDGGTTKGI+EN+
Sbjct: 186 YGEDPKIVQEMTEIIPGLQGEIPPNSRKGVPYVAGKENVVACAKHYVGDGGTTKGIDENN 245

Query: 246 TVIDRHSLLSIHMPGYSHSIIKGIATVMASYSSWNGEKMHAHKELLTGFLKNTLNFKGFV 305
           TVIDRH LLSIHMPGY HSIIKG+AT+M SYSSWNGEKMHA+K L+T FLKNTL+F+GFV
Sbjct: 246 TVIDRHGLLSIHMPGYYHSIIKGVATIMVSYSSWNGEKMHANKNLVTDFLKNTLHFQGFV 305

Query: 306 ISDWQGIDRITTPPHANYTYSIIASVTAGVDMIMIPYDYKEFIDKITYLVKNNIIPMSRI 365
           ISDW+ IDRIT PPHANYTYSI+AS+TAG+DMIMIPY+Y EFID +T LVK+N IP+SRI
Sbjct: 306 ISDWEAIDRITDPPHANYTYSILASITAGLDMIMIPYNYPEFIDGLTNLVKSNYIPISRI 365

Query: 366 DDAVWRILRVKFVMGLFENPLADYSLVNEIGKKEHRELAREAVRKSLVLLKNGKSTSTPL 425
           DDAV RILRVKFVMGLFENP+AD SLVNE+GK+EHRELAREAVRKSLVLLKNGKS   PL
Sbjct: 366 DDAVKRILRVKFVMGLFENPIADLSLVNELGKQEHRELAREAVRKSLVLLKNGKSADKPL 425

Query: 426 LPLPKKTQKILVAGTHANNLGYQCGGWTIEWQGASGNNLTSGTTVLDAIKETVDPETEVT 485
           LPL KKTQKILVAG+HANNLGYQCGGWTIEWQG SGNNLTSGTTVLDAIK+TVDP TEV 
Sbjct: 426 LPLEKKTQKILVAGSHANNLGYQCGGWTIEWQGLSGNNLTSGTTVLDAIKDTVDPTTEVI 485

Query: 486 FEEQPNKESLQSHEFSYGIVVVGEYPYAETNGDSLNLTIPDPGPSTITDVCGAMKCVVII 545
           F E P+K+SLQS  FSY IVVVGE+PYAE NGDSLNLTIPDPGP+TIT+VCG +KC V+I
Sbjct: 486 FNENPDKKSLQSDTFSYAIVVVGEHPYAELNGDSLNLTIPDPGPNTITNVCGVIKCAVVI 545

Query: 546 ISGRPVVIEPYISSMDALVAAWLPGTEGKGITDVLFGDYGFTGKLPRTWFKTVDQLPMNF 605
           ISGRPVVI+PY+ S+DALVAAWLPGTEGKGITDVLFGDYGFTGKL +TWFKTVDQLPMNF
Sbjct: 546 ISGRPVVIQPYVDSIDALVAAWLPGTEGKGITDVLFGDYGFTGKLSQTWFKTVDQLPMNF 605

Query: 606 GDPHYDPLFSFGYGLTTEPIKA 628
           G+P+YDPLF FG+GLTT+PIK+
Sbjct: 606 GNPNYDPLFPFGHGLTTQPIKS 627

BLAST of CmoCh06G009350 vs. TrEMBL
Match: A0A0A0LI54_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G842090 PE=4 SV=1)

HSP 1 Score: 1053.9 bits (2724), Expect = 7.7e-305
Identity = 504/624 (80.77%), Postives = 563/624 (90.22%), Query Frame = 1

Query: 4   VLIVLMGFLLMFLSETLGTGEQLTYKDPTKPLNVRIKDLLGRMTVEEKIGQMVQIERVNA 63
           VLI  +G L++  SETL   E L YKDP +PLNVRIKDLLGRMT+EEKIGQMVQIER NA
Sbjct: 6   VLITFVGLLVLCFSETLAKAEYLKYKDPKQPLNVRIKDLLGRMTLEEKIGQMVQIERANA 65

Query: 64  SADVMKNYFIGSVLSGGGSAPSKNASAKDWVDMVNEIQKGALSSRLGIPMIYGIDAVHGH 123
           SADVMK YFIGSVLSGGGSAPSK ASAKDWV MVN+IQ+ ALS+RLGIPMIYGIDAVHGH
Sbjct: 66  SADVMKQYFIGSVLSGGGSAPSKQASAKDWVHMVNKIQEAALSTRLGIPMIYGIDAVHGH 125

Query: 124 NNVYNATIFPHNVGLGATRDPQLVKNIGSATALEIRATGIPYAFAPCIAVCKDPRWGRCY 183
           NNVYNATIFPHN+GLGATRDPQL+K IG+ATALE+RATGIPYAFAPCIAVC+DPRWGRCY
Sbjct: 126 NNVYNATIFPHNIGLGATRDPQLLKRIGAATALEVRATGIPYAFAPCIAVCRDPRWGRCY 185

Query: 184 ESYSEDPKIVQEMTEIILGLQGEIPPNSRKGVPYVGGKDKVAGCAKHYVGDGGTTKGINE 243
           ESY ED  IVQ MTEII GLQG++P N RKGVPYV GK+ VA CAKH+VGDGGTTKGINE
Sbjct: 186 ESYGEDHTIVQAMTEIIPGLQGDVPANIRKGVPYVAGKNNVAACAKHFVGDGGTTKGINE 245

Query: 244 NDTVIDRHSLLSIHMPGYSHSIIKGIATVMASYSSWNGEKMHAHKELLTGFLKNTLNFKG 303
           N+TV+D H L SIHMP Y +SIIKG+ATVM SYSS NGEKMHA+K+L+T FLKNTL+FKG
Sbjct: 246 NNTVVDGHGLFSIHMPAYYNSIIKGVATVMVSYSSINGEKMHANKKLVTDFLKNTLHFKG 305

Query: 304 FVISDWQGIDRITTPPHANYTYSIIASVTAGVDMIMIPYDYKEFIDKITYLVKNNIIPMS 363
           FVISDWQGID+ITTPPHANYTYSI+ASV AGVDMIM+PY+Y EFID +TYLVKNN IP+S
Sbjct: 306 FVISDWQGIDKITTPPHANYTYSILASVNAGVDMIMVPYNYTEFIDGLTYLVKNNAIPIS 365

Query: 364 RIDDAVWRILRVKFVMGLFENPLADYSLVNEIGKKEHRELAREAVRKSLVLLKNGKSTST 423
           RIDDAV RILRVKFVMGLFENPLAD SL+NE+GK+EHRELAREAVRKSLVLLKNGK  + 
Sbjct: 366 RIDDAVKRILRVKFVMGLFENPLADLSLINELGKQEHRELAREAVRKSLVLLKNGKLPNQ 425

Query: 424 PLLPLPKKTQKILVAGTHANNLGYQCGGWTIEWQGASGNNLTSGTTVLDAIKETVDPETE 483
           PLLPLPKK  KILVAGTHAN+LG QCGGWT+EWQG +GNNLTSGTT+L AIK+TVDPETE
Sbjct: 426 PLLPLPKKAPKILVAGTHANDLGNQCGGWTMEWQGLTGNNLTSGTTILTAIKDTVDPETE 485

Query: 484 VTFEEQPNKESLQSHEFSYGIVVVGEYPYAETNGDSLNLTIPDPGPSTITDVCGAMKCVV 543
           V F + PN E LQ+H+FSY IVVVGE+PYAETNGDSLNLTIP+PGP TI +VCGA+KCVV
Sbjct: 486 VVFHDNPNAEFLQTHQFSYAIVVVGEHPYAETNGDSLNLTIPEPGPETIKNVCGAVKCVV 545

Query: 544 IIISGRPVVIEPYISSMDALVAAWLPGTEGKGITDVLFGDYGFTGKLPRTWFKTVDQLPM 603
           ++ISGRPVV++PYI S+DA+VAAWLPGTEGKGI+DVLFGDYGFTGKL +TWFK+VDQLPM
Sbjct: 546 VVISGRPVVLQPYIDSIDAVVAAWLPGTEGKGISDVLFGDYGFTGKLSQTWFKSVDQLPM 605

Query: 604 NFGDPHYDPLFSFGYGLTTEPIKA 628
           NFGD HYDPLF FG+GLTT+P+KA
Sbjct: 606 NFGDAHYDPLFPFGFGLTTQPVKA 629

BLAST of CmoCh06G009350 vs. TrEMBL
Match: A0A0A0LFL8_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G842070 PE=4 SV=1)

HSP 1 Score: 1038.5 bits (2684), Expect = 3.3e-300
Identity = 494/627 (78.79%), Postives = 557/627 (88.84%), Query Frame = 1

Query: 1   MTKVLIVLMGFLLMFLSETLGTGEQLTYKDPTKPLNVRIKDLLGRMTVEEKIGQMVQIER 60
           M K LI  MGF +  L+E     + + YKDP +PLNVRI DLLGRMT+EEKIGQMVQI+R
Sbjct: 1   MAKNLIFFMGFFIFCLTEVWAKHQYMRYKDPKQPLNVRISDLLGRMTLEEKIGQMVQIDR 60

Query: 61  VNASADVMKNYFIGSVLSGGGSAPSKNASAKDWVDMVNEIQKGALSSRLGIPMIYGIDAV 120
             AS  VMK Y IGSVLSGGGS PSK AS K W+DMVNE QKG+LS+RLGIPMIYGIDAV
Sbjct: 61  TVASKKVMKKYLIGSVLSGGGSVPSKEASPKVWIDMVNEFQKGSLSTRLGIPMIYGIDAV 120

Query: 121 HGHNNVYNATIFPHNVGLGATRDPQLVKNIGSATALEIRATGIPYAFAPCIAVCKDPRWG 180
           HGHNNVY ATIFPHNVGLGATRDP L K IG+ATALE+RATGI Y FAPCIAVC+DPRWG
Sbjct: 121 HGHNNVYKATIFPHNVGLGATRDPNLAKRIGAATALEVRATGISYVFAPCIAVCRDPRWG 180

Query: 181 RCYESYSEDPKIVQEMTEIILGLQGEIPPNSRKGVPYVGGKDKVAGCAKHYVGDGGTTKG 240
           RC+ESYSEDPK+VQEMTEII GLQGEIP NSRKGVPYV G++KVA CAKHYVGDGGTTKG
Sbjct: 181 RCFESYSEDPKVVQEMTEIISGLQGEIPSNSRKGVPYVAGREKVAACAKHYVGDGGTTKG 240

Query: 241 INENDTVIDRHSLLSIHMPGYSHSIIKGIATVMASYSSWNGEKMHAHKELLTGFLKNTLN 300
           +NEN+T+  RH LLSIHMPGY +SIIKG++TVM SYSSWNG+KMH +++L+TGFLKNTL 
Sbjct: 241 MNENNTLASRHGLLSIHMPGYYNSIIKGVSTVMISYSSWNGKKMHENRDLITGFLKNTLR 300

Query: 301 FKGFVISDWQGIDRITTPPHANYTYSIIASVTAGVDMIMIPYDYKEFIDKITYLVKNNII 360
           F+GFVISDWQGIDRIT+PPHANYTYSIIA +TAG+DMIM+P++Y EFID +TYLVK N+I
Sbjct: 301 FRGFVISDWQGIDRITSPPHANYTYSIIAGITAGIDMIMVPFNYTEFIDGLTYLVKTNVI 360

Query: 361 PMSRIDDAVWRILRVKFVMGLFENPLADYSLVNEIGKKEHRELAREAVRKSLVLLKNGKS 420
           P+SRIDDAV RILRVKFVMGLFENPLAD S VNE+GKKEHRELAREAVRKSLVLLKNG+S
Sbjct: 361 PISRIDDAVKRILRVKFVMGLFENPLADSSFVNELGKKEHRELAREAVRKSLVLLKNGES 420

Query: 421 TSTPLLPLPKKTQKILVAGTHANNLGYQCGGWTIEWQGASGNNLTSGTTVLDAIKETVDP 480
              P+LPLPKK  KILVAG+HANNLG+QCGGWTIEWQG  GNNLTSGTT+L AIK+TVDP
Sbjct: 421 ADKPILPLPKKVPKILVAGSHANNLGFQCGGWTIEWQGLGGNNLTSGTTILSAIKDTVDP 480

Query: 481 ETEVTFEEQPNKESLQSHEFSYGIVVVGEYPYAETNGDSLNLTIPDPGPSTITDVCGAMK 540
           +T+V F+E P+ E ++S++FSY IVVVGEYPYAET GDSLNLTIP+PGPSTIT+VCGA+K
Sbjct: 481 KTKVVFKENPDMEFVKSNKFSYAIVVVGEYPYAETFGDSLNLTIPEPGPSTITNVCGAVK 540

Query: 541 CVVIIISGRPVVIEPYISSMDALVAAWLPGTEGKGITDVLFGDYGFTGKLPRTWFKTVDQ 600
           CVVI+ISGRPVV++PYISS+DALVAAWLPGTEGKGI+DVLFGDYGF+GKL RTWFKTVDQ
Sbjct: 541 CVVIVISGRPVVLQPYISSIDALVAAWLPGTEGKGISDVLFGDYGFSGKLSRTWFKTVDQ 600

Query: 601 LPMNFGDPHYDPLFSFGYGLTTEPIKA 628
           LPMN GD HYDPLF FG+GLTT PIKA
Sbjct: 601 LPMNVGDAHYDPLFPFGFGLTTNPIKA 627

BLAST of CmoCh06G009350 vs. TrEMBL
Match: A0A061F0I5_THECC (Glycosyl hydrolase family protein OS=Theobroma cacao GN=TCM_025896 PE=4 SV=1)

HSP 1 Score: 998.4 bits (2580), Expect = 3.8e-288
Identity = 471/604 (77.98%), Postives = 540/604 (89.40%), Query Frame = 1

Query: 24   EQLTYKDPTKPLNVRIKDLLGRMTVEEKIGQMVQIERVNASADVMKNYFIGSVLSGGGSA 83
            E + YKDP +PLNVRIKDL+GRMT+EEKIGQMVQIER  ASA+VMK YFIGSVLSGGGS 
Sbjct: 617  EHVKYKDPKQPLNVRIKDLIGRMTLEEKIGQMVQIERAVASAEVMKKYFIGSVLSGGGSV 676

Query: 84   PSKNASAKDWVDMVNEIQKGALSSRLGIPMIYGIDAVHGHNNVYNATIFPHNVGLGATRD 143
            P+  ASAK W++MVNE QKG+LS+RLGIPMIYGIDAVHGHNNVY ATIFPHN+GLGATRD
Sbjct: 677  PAPKASAKTWLNMVNEFQKGSLSTRLGIPMIYGIDAVHGHNNVYKATIFPHNIGLGATRD 736

Query: 144  PQLVKNIGSATALEIRATGIPYAFAPCIAVCKDPRWGRCYESYSEDPKIVQEMTEIILGL 203
            P LVK IG+ATALE+RATGIPYAFAPC+AVC+DPRWGRCYESYSED KIVQ MTEII GL
Sbjct: 737  PALVKKIGAATALEVRATGIPYAFAPCLAVCRDPRWGRCYESYSEDHKIVQAMTEIIPGL 796

Query: 204  QGEIPPNSRKGVPYVGGKDKVAGCAKHYVGDGGTTKGINENDTVIDRHSLLSIHMPGYSH 263
            QG+IP NSRKGVP+V GK  VA CAKHYVGDGGTT+GINEN+TVIDRH LLSIHMP Y +
Sbjct: 797  QGDIPSNSRKGVPFVAGKKNVAACAKHYVGDGGTTRGINENNTVIDRHGLLSIHMPAYYN 856

Query: 264  SIIKGIATVMASYSSWNGEKMHAHKELLTGFLKNTLNFKGFVISDWQGIDRITTPPHANY 323
            SIIKG++TVM SYSSWNG K HA+ E++T FLK TL F+GFVISDW+GIDRIT+PPHANY
Sbjct: 857  SIIKGVSTVMTSYSSWNGVKNHANHEMVTNFLKKTLRFRGFVISDWEGIDRITSPPHANY 916

Query: 324  TYSIIASVTAGVDMIMIPYDYKEFIDKITYLVKNNIIPMSRIDDAVWRILRVKFVMGLFE 383
            TYSI+AS+ AG+DMIM+P +YKEFID +TYLVKN  IPMSRIDDAV RILRVKFVMGLFE
Sbjct: 917  TYSILASINAGLDMIMVPNNYKEFIDGLTYLVKNKFIPMSRIDDAVKRILRVKFVMGLFE 976

Query: 384  NPLADYSLVNEIGKKEHRELAREAVRKSLVLLKNGKSTSTPLLPLPKKTQKILVAGTHAN 443
            +PLAD SLV+++G +EHRELAREAVRKSLVLLKNG S   PLLPLPKK  KILVAG+HAN
Sbjct: 977  DPLADDSLVDQLGSQEHRELAREAVRKSLVLLKNGDSADAPLLPLPKKAPKILVAGSHAN 1036

Query: 444  NLGYQCGGWTIEWQGASGNNLTSGTTVLDAIKETVDPETEVTFEEQPNKESLQSHEFSYG 503
            NLGYQCGGWTIEWQG  GNN+T GTT+L AIK+TVDP+T+V ++E+P+ E ++S++FSY 
Sbjct: 1037 NLGYQCGGWTIEWQGQGGNNITDGTTILTAIKKTVDPKTKVVYKEKPDAEFVKSNDFSYA 1096

Query: 504  IVVVGEYPYAETNGDSLNLTIPDPGPSTITDVCGAMKCVVIIISGRPVVIEPYISSMDAL 563
            IVVVGE+PYAETNGDSLNLTIP+PGPSTI +VCGA+KCVV++ISGRPVVI+PY+  +DA+
Sbjct: 1097 IVVVGEHPYAETNGDSLNLTIPEPGPSTIGNVCGAVKCVVVVISGRPVVIQPYVRYIDAI 1156

Query: 564  VAAWLPGTEGKGITDVLFGDYGFTGKLPRTWFKTVDQLPMNFGDPHYDPLFSFGYGLTTE 623
            VAAWLPG+EG+G+ DVLFGDYGFTGKL  TWFKTVDQLPM+ GD HYDPLF FG+GLTT+
Sbjct: 1157 VAAWLPGSEGQGVADVLFGDYGFTGKLSFTWFKTVDQLPMHVGDSHYDPLFPFGFGLTTK 1216

Query: 624  PIKA 628
            P KA
Sbjct: 1217 PTKA 1220

BLAST of CmoCh06G009350 vs. TrEMBL
Match: U5FEM9_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0019s05340g PE=4 SV=1)

HSP 1 Score: 995.0 bits (2571), Expect = 4.2e-287
Identity = 476/626 (76.04%), Postives = 545/626 (87.06%), Query Frame = 1

Query: 1   MTKVLIVLMGFLLMFLSETLGTGEQLTYKDPTKPLNVRIKDLLGRMTVEEKIGQMVQIER 60
           M ++ I LMG ++++ +  L   E + YKD TKPLN RIKDL+ RMT+EEKIGQM QIER
Sbjct: 1   MARIPIFLMGLVVIWAA--LAEAEYMIYKDATKPLNSRIKDLMSRMTLEEKIGQMTQIER 60

Query: 61  VNASADVMKNYFIGSVLSGGGSAPSKNASAKDWVDMVNEIQKGALSSRLGIPMIYGIDAV 120
             ASA+VMK+YFIGSVLSGGGS PSK ASA+ W++MVNE+QKGALS+RLGIPMIYGIDAV
Sbjct: 61  GVASAEVMKDYFIGSVLSGGGSVPSKQASAETWINMVNELQKGALSTRLGIPMIYGIDAV 120

Query: 121 HGHNNVYNATIFPHNVGLGATRDPQLVKNIGSATALEIRATGIPYAFAPCIAVCKDPRWG 180
           HGHNNVY ATIFPHNVGLGATRDP LVK IG+ATALE+RATGIPY FAPCIAVC+DPRWG
Sbjct: 121 HGHNNVYKATIFPHNVGLGATRDPNLVKRIGAATALEVRATGIPYVFAPCIAVCRDPRWG 180

Query: 181 RCYESYSEDPKIVQEMTEIILGLQGEIPPNSRKGVPYVGGKDKVAGCAKHYVGDGGTTKG 240
           RCYESYSEDPK+VQ MTEI+ GLQG+IP NS KGVP+V GK KVA CAKHYVGDGGTTKG
Sbjct: 181 RCYESYSEDPKLVQAMTEIVSGLQGDIPANSSKGVPFVAGKTKVAACAKHYVGDGGTTKG 240

Query: 241 INENDTVIDRHSLLSIHMPGYSHSIIKGIATVMASYSSWNGEKMHAHKELLTGFLKNTLN 300
           INEN+T I RH LLSIHMPGY +SIIKG++TVM SYSSWNG KMHA+++++TGFLKN L 
Sbjct: 241 INENNTQISRHGLLSIHMPGYYNSIIKGVSTVMVSYSSWNGVKMHANRDMVTGFLKNILR 300

Query: 301 FKGFVISDWQGIDRITTPPHANYTYSIIASVTAGVDMIMIPYDYKEFIDKITYLVKNNII 360
           FKGFVISDW+GIDRIT+PPHANY+YSI A ++AG+DMIM+P +YKEFID +T  VKN +I
Sbjct: 301 FKGFVISDWEGIDRITSPPHANYSYSIQAGISAGIDMIMVPNNYKEFIDGLTSHVKNKVI 360

Query: 361 PMSRIDDAVWRILRVKFVMGLFENPLADYSLVNEIGKKEHRELAREAVRKSLVLLKNGKS 420
           PMSRIDDAV RILRVKF MGLFENPLAD SLVNE+G +EHRELAREAVRKSLVLLKNG+S
Sbjct: 361 PMSRIDDAVTRILRVKFTMGLFENPLADNSLVNELGSQEHRELAREAVRKSLVLLKNGES 420

Query: 421 TSTPLLPLPKKTQKILVAGTHANNLGYQCGGWTIEWQGASGNNLTSGTTVLDAIKETVDP 480
            + PLLPLPKK  KILVAG+HA+NLGYQCGGWTIEWQG  GNNLTSGTT+L AIK TVDP
Sbjct: 421 AAEPLLPLPKKATKILVAGSHADNLGYQCGGWTIEWQGLGGNNLTSGTTILTAIKNTVDP 480

Query: 481 ETEVTFEEQPNKESLQSHEFSYGIVVVGEYPYAETNGDSLNLTIPDPGPSTITDVCGAMK 540
            TEV ++E P+ + ++S+ FSY IVVVGE PYAET GDSLNLTI +PGPSTI +VCG +K
Sbjct: 481 STEVVYKENPDADFVKSNNFSYAIVVVGEPPYAETFGDSLNLTISEPGPSTIQNVCGTVK 540

Query: 541 CVVIIISGRPVVIEPYISSMDALVAAWLPGTEGKGITDVLFGDYGFTGKLPRTWFKTVDQ 600
           CV +IISGRPVVI+PY+S MDALVAAWLPG+EG+G+ D LFGDYGFTG L RTWFKTVDQ
Sbjct: 541 CVTVIISGRPVVIQPYVSLMDALVAAWLPGSEGQGVADALFGDYGFTGTLSRTWFKTVDQ 600

Query: 601 LPMNFGDPHYDPLFSFGYGLTTEPIK 627
           LPMN GD HYDPLF FG+GL+T+P K
Sbjct: 601 LPMNIGDQHYDPLFPFGFGLSTKPTK 624

BLAST of CmoCh06G009350 vs. TAIR10
Match: AT5G04885.1 (AT5G04885.1 Glycosyl hydrolase family protein)

HSP 1 Score: 917.1 bits (2369), Expect = 5.7e-267
Identity = 423/605 (69.92%), Postives = 516/605 (85.29%), Query Frame = 1

Query: 21  GTGEQLTYKDPTKPLNVRIKDLLGRMTVEEKIGQMVQIERVNASADVMKNYFIGSVLSGG 80
           G GE L YKDP + ++ R+ DL GRMT+EEKIGQMVQI+R  A+ ++M++YFIGSVLSGG
Sbjct: 23  GDGEYLLYKDPKQTVSDRVADLFGRMTLEEKIGQMVQIDRSVATVNIMRDYFIGSVLSGG 82

Query: 81  GSAPSKNASAKDWVDMVNEIQKGALSSRLGIPMIYGIDAVHGHNNVYNATIFPHNVGLGA 140
           GSAP   ASA++WVDM+NE QKGAL SRLGIPMIYGIDAVHGHNNVYNATIFPHNVGLGA
Sbjct: 83  GSAPLPEASAQNWVDMINEYQKGALVSRLGIPMIYGIDAVHGHNNVYNATIFPHNVGLGA 142

Query: 141 TRDPQLVKNIGSATALEIRATGIPYAFAPCIAVCKDPRWGRCYESYSEDPKIVQEMTEII 200
           TRDP LVK IG+ATA+E+RATGIPY FAPCIAVC+DPRWGRCYESYSED K+V++MT++I
Sbjct: 143 TRDPDLVKRIGAATAVEVRATGIPYTFAPCIAVCRDPRWGRCYESYSEDHKVVEDMTDVI 202

Query: 201 LGLQGEIPPNSRKGVPYVGGKDKVAGCAKHYVGDGGTTKGINENDTVIDRHSLLSIHMPG 260
           LGLQGE P N + GVP+VGG+DKVA CAKHYVGDGGTT+G+NEN+TV D H LLS+HMP 
Sbjct: 203 LGLQGEPPSNYKHGVPFVGGRDKVAACAKHYVGDGGTTRGVNENNTVTDLHGLLSVHMPA 262

Query: 261 YSHSIIKGIATVMASYSSWNGEKMHAHKELLTGFLKNTLNFKGFVISDWQGIDRITTPPH 320
           Y+ ++ KG++TVM SYSSWNGEKMHA+ EL+TG+LK TL FKGFVISDWQG+D+I+TPPH
Sbjct: 263 YADAVYKGVSTVMVSYSSWNGEKMHANTELITGYLKGTLKFKGFVISDWQGVDKISTPPH 322

Query: 321 ANYTYSIIASVTAGVDMIMIPYDYKEFIDKITYLVKNNIIPMSRIDDAVWRILRVKFVMG 380
            +YT S+ A++ AG+DM+M+P+++ EF++ +T LVKNN IP++RIDDAV RIL VKF MG
Sbjct: 323 THYTASVRAAIQAGIDMVMVPFNFTEFVNDLTTLVKNNSIPVTRIDDAVRRILLVKFTMG 382

Query: 381 LFENPLADYSLVNEIGKKEHRELAREAVRKSLVLLKNGKSTSTPLLPLPKKTQKILVAGT 440
           LFENPLADYS  +E+G + HR+LAREAVRKSLVLLKNG  T+ P+LPLP+KT KILVAGT
Sbjct: 383 LFENPLADYSFSSELGSQAHRDLAREAVRKSLVLLKNGNKTN-PMLPLPRKTSKILVAGT 442

Query: 441 HANNLGYQCGGWTIEWQGASGNNLTSGTTVLDAIKETVDPETEVTFEEQPNKESLQSHEF 500
           HA+NLGYQCGGWTI WQG SGN  T GTT+L A+K  VD  TEV F E P+ E ++S+ F
Sbjct: 443 HADNLGYQCGGWTITWQGFSGNKNTRGTTLLSAVKSAVDQSTEVVFRENPDAEFIKSNNF 502

Query: 501 SYGIVVVGEYPYAETNGDSLNLTIPDPGPSTITDVCGAMKCVVIIISGRPVVIEPYISSM 560
           +Y I+ VGE PYAET GDS  LT+ DPGP+ I+  C A+KCVV++ISGRP+V+EPY++S+
Sbjct: 503 AYAIIAVGEPPYAETAGDSDKLTMLDPGPAIISSTCQAVKCVVVVISGRPLVMEPYVASI 562

Query: 561 DALVAAWLPGTEGKGITDVLFGDYGFTGKLPRTWFKTVDQLPMNFGDPHYDPLFSFGYGL 620
           DALVAAWLPGTEG+GITD LFGD+GF+GKLP TWF+  +QLPM++GD HYDPLF++G GL
Sbjct: 563 DALVAAWLPGTEGQGITDALFGDHGFSGKLPVTWFRNTEQLPMSYGDTHYDPLFAYGSGL 622

Query: 621 TTEPI 626
            TE +
Sbjct: 623 ETESV 626

BLAST of CmoCh06G009350 vs. TAIR10
Match: AT5G20950.1 (AT5G20950.1 Glycosyl hydrolase family protein)

HSP 1 Score: 917.1 bits (2369), Expect = 5.7e-267
Identity = 436/626 (69.65%), Postives = 526/626 (84.03%), Query Frame = 1

Query: 1   MTKVLIVLMGFLLMFLSETLGTGEQLTYKDPTKPLNVRIKDLLGRMTVEEKIGQMVQIER 60
           ++KVL +++  L   ++   GT   L YKDP +PL  RI+DL+ RMT++EKIGQMVQIER
Sbjct: 4   LSKVLCLML--LCCIVAAAEGT---LKYKDPKQPLGARIRDLMNRMTLQEKIGQMVQIER 63

Query: 61  VNASADVMKNYFIGSVLSGGGSAPSKNASAKDWVDMVNEIQKGALSSRLGIPMIYGIDAV 120
             A+ +VMK YFIGSVLSGGGS PS+ A+ + WV+MVNEIQK +LS+RLGIPMIYGIDAV
Sbjct: 64  SVATPEVMKKYFIGSVLSGGGSVPSEKATPETWVNMVNEIQKASLSTRLGIPMIYGIDAV 123

Query: 121 HGHNNVYNATIFPHNVGLGATRDPQLVKNIGSATALEIRATGIPYAFAPCIAVCKDPRWG 180
           HGHNNVY ATIFPHNVGLG TRDP LVK IG+ATALE+RATGIPYAFAPCIAVC+DPRWG
Sbjct: 124 HGHNNVYGATIFPHNVGLGVTRDPNLVKRIGAATALEVRATGIPYAFAPCIAVCRDPRWG 183

Query: 181 RCYESYSEDPKIVQEMTEIILGLQGEIPPNSRKGVPYVGGKDKVAGCAKHYVGDGGTTKG 240
           RCYESYSED +IVQ+MTEII GLQG++P   RKGVP+VGGK KVA CAKH+VGDGGT +G
Sbjct: 184 RCYESYSEDYRIVQQMTEIIPGLQGDLP-TKRKGVPFVGGKTKVAACAKHFVGDGGTVRG 243

Query: 241 INENDTVIDRHSLLSIHMPGYSHSIIKGIATVMASYSSWNGEKMHAHKELLTGFLKNTLN 300
           I+EN+TVID   L  IHMPGY +++ KG+AT+M SYS+WNG +MHA+KEL+TGFLKN L 
Sbjct: 244 IDENNTVIDSKGLFGIHMPGYYNAVNKGVATIMVSYSAWNGLRMHANKELVTGFLKNKLK 303

Query: 301 FKGFVISDWQGIDRITTPPHANYTYSIIASVTAGVDMIMIPYDYKEFIDKITYLVKNNII 360
           F+GFVISDWQGIDRITTPPH NY+YS+ A ++AG+DMIM+PY+Y EFID+I+  ++  +I
Sbjct: 304 FRGFVISDWQGIDRITTPPHLNYSYSVYAGISAGIDMIMVPYNYTEFIDEISSQIQKKLI 363

Query: 361 PMSRIDDAVWRILRVKFVMGLFENPLADYSLVNEIGKKEHRELAREAVRKSLVLLKNGKS 420
           P+SRIDDA+ RILRVKF MGLFE PLAD S  N++G KEHRELAREAVRKSLVLLKNGK+
Sbjct: 364 PISRIDDALKRILRVKFTMGLFEEPLADLSFANQLGSKEHRELAREAVRKSLVLLKNGKT 423

Query: 421 TSTPLLPLPKKTQKILVAGTHANNLGYQCGGWTIEWQGASGNNLTSGTTVLDAIKETVDP 480
            + PLLPLPKK+ KILVAG HA+NLGYQCGGWTI WQG +GN+ T GTT+L A+K TV P
Sbjct: 424 GAKPLLPLPKKSGKILVAGAHADNLGYQCGGWTITWQGLNGNDHTVGTTILAAVKNTVAP 483

Query: 481 ETEVTFEEQPNKESLQSHEFSYGIVVVGEYPYAETNGDSLNLTIPDPGPSTITDVCGAMK 540
            T+V + + P+   ++S +F Y IVVVGE PYAE  GD+ NLTI DPGPS I +VCG++K
Sbjct: 484 TTQVVYSQNPDANFVKSGKFDYAIVVVGEPPYAEMFGDTTNLTISDPGPSIIGNVCGSVK 543

Query: 541 CVVIIISGRPVVIEPYISSMDALVAAWLPGTEGKGITDVLFGDYGFTGKLPRTWFKTVDQ 600
           CVV+++SGRPVVI+PY+S++DALVAAWLPGTEG+G+ D LFGDYGFTGKL RTWFK+V Q
Sbjct: 544 CVVVVVSGRPVVIQPYVSTIDALVAAWLPGTEGQGVADALFGDYGFTGKLARTWFKSVKQ 603

Query: 601 LPMNFGDPHYDPLFSFGYGLTTEPIK 627
           LPMN GD HYDPL+ FG+GLTT+P K
Sbjct: 604 LPMNVGDRHYDPLYPFGFGLTTKPYK 623

BLAST of CmoCh06G009350 vs. TAIR10
Match: AT5G20940.1 (AT5G20940.1 Glycosyl hydrolase family protein)

HSP 1 Score: 865.9 bits (2236), Expect = 1.5e-251
Identity = 413/597 (69.18%), Postives = 495/597 (82.91%), Query Frame = 1

Query: 28  YKDPTKPLNVRIKDLLGRMTVEEKIGQMVQIERVNASADVMKNYFIGSVLSGGGSAPSKN 87
           YKDP +PL VRIK+L+  MT+EEKIGQMVQ+ERVNA+ +VM+ YF+GSV SGGGS P   
Sbjct: 32  YKDPKEPLGVRIKNLMSHMTLEEKIGQMVQVERVNATTEVMQKYFVGSVFSGGGSVPKPY 91

Query: 88  ASAKDWVDMVNEIQKGALSSRLGIPMIYGIDAVHGHNNVYNATIFPHNVGLGATRDPQLV 147
              + WV+MVNE+QK ALS+RLGIP+IYGIDAVHGHN VYNATIFPHNVGLG TRDP LV
Sbjct: 92  IGPEAWVNMVNEVQKKALSTRLGIPIIYGIDAVHGHNTVYNATIFPHNVGLGVTRDPGLV 151

Query: 148 KNIGSATALEIRATGIPYAFAPCIAVCKDPRWGRCYESYSEDPKIVQEMTEIILGLQGEI 207
           K IG ATALE+RATGI Y FAPCIAVC+DPRWGRCYESYSED KIVQ+MTEII GLQG++
Sbjct: 152 KRIGEATALEVRATGIQYVFAPCIAVCRDPRWGRCYESYSEDHKIVQQMTEIIPGLQGDL 211

Query: 208 PPNSRKGVPYVGGKDKVAGCAKHYVGDGGTTKGINENDTVIDRHSLLSIHMPGYSHSIIK 267
           P   +KGVP+V GK KVA CAKH+VGDGGT +G+N N+TVI+ + LL IHMP Y  ++ K
Sbjct: 212 P-TGQKGVPFVAGKTKVAACAKHFVGDGGTLRGMNANNTVINSNGLLGIHMPAYHDAVNK 271

Query: 268 GIATVMASYSSWNGEKMHAHKELLTGFLKNTLNFKGFVISDWQGIDRITTPPHANYTYSI 327
           G+ATVM SYSS NG KMHA+K+L+TGFLKN L F+G VISD+ G+D+I TP  ANY++S+
Sbjct: 272 GVATVMVSYSSINGLKMHANKKLITGFLKNKLKFRGIVISDYLGVDQINTPLGANYSHSV 331

Query: 328 IASVTAGVDMIMIPYDYKEFIDKITYLVKNNIIPMSRIDDAVWRILRVKFVMGLFENPLA 387
            A+ TAG+DM M   +  + ID++T  VK   IPMSRIDDAV RILRVKF MGLFENP+A
Sbjct: 332 YAATTAGLDMFMGSSNLTKLIDELTSQVKRKFIPMSRIDDAVKRILRVKFTMGLFENPIA 391

Query: 388 DYSLVNEIGKKEHRELAREAVRKSLVLLKNGKSTSTPLLPLPKKTQKILVAGTHANNLGY 447
           D+SL  ++G KEHRELAREAVRKSLVLLKNG++   PLLPLPKK  KILVAGTHA+NLGY
Sbjct: 392 DHSLAKKLGSKEHRELAREAVRKSLVLLKNGENADKPLLPLPKKANKILVAGTHADNLGY 451

Query: 448 QCGGWTIEWQGASGNNLTSGTTVLDAIKETVDPETEVTFEEQPNKESLQSHEFSYGIVVV 507
           QCGGWTI WQG +GNNLT GTT+L A+K+TVDP+T+V + + P+   +++ +F Y IV V
Sbjct: 452 QCGGWTITWQGLNGNNLTIGTTILAAVKKTVDPKTQVIYNQNPDTNFVKAGDFDYAIVAV 511

Query: 508 GEYPYAETNGDSLNLTIPDPGPSTITDVCGAMKCVVIIISGRPVVIEPYISSMDALVAAW 567
           GE PYAE  GDS NLTI +PGPSTI +VC ++KCVV+++SGRPVV++  IS++DALVAAW
Sbjct: 512 GEKPYAEGFGDSTNLTISEPGPSTIGNVCASVKCVVVVVSGRPVVMQ--ISNIDALVAAW 571

Query: 568 LPGTEGKGITDVLFGDYGFTGKLPRTWFKTVDQLPMNFGDPHYDPLFSFGYGLTTEP 625
           LPGTEG+G+ DVLFGDYGFTGKL RTWFKTVDQLPMN GDPHYDPL+ FG+GL T+P
Sbjct: 572 LPGTEGQGVADVLFGDYGFTGKLARTWFKTVDQLPMNVGDPHYDPLYPFGFGLITKP 625

BLAST of CmoCh06G009350 vs. TAIR10
Match: AT3G47000.1 (AT3G47000.1 Glycosyl hydrolase family protein)

HSP 1 Score: 725.3 bits (1871), Expect = 3.2e-209
Identity = 349/600 (58.17%), Postives = 449/600 (74.83%), Query Frame = 1

Query: 28  YKDPTKPLNVRIKDLLGRMTVEEKIGQMVQIERVNASADVMKNYFIGSVLSGGGSAPSKN 87
           YK+   P+  R+KDLL RMT+ EKIGQM QIER  AS     ++FIGSVL+ GGS P ++
Sbjct: 10  YKNGDAPVEARVKDLLSRMTLPEKIGQMTQIERRVASPSAFTDFFIGSVLNAGGSVPFED 69

Query: 88  ASAKDWVDMVNEIQKGALSSRLGIPMIYGIDAVHGHNNVYNATIFPHNVGLGATRDPQLV 147
           A + DW DM++  Q+ AL+SRLGIP+IYG DAVHG+NNVY AT+FPHN+GLGATRD  LV
Sbjct: 70  AKSSDWADMIDGFQRSALASRLGIPIIYGTDAVHGNNNVYGATVFPHNIGLGATRDADLV 129

Query: 148 KNIGSATALEIRATGIPYAFAPCIAVCKDPRWGRCYESYSEDPKIVQEMTEIILGLQGEI 207
           + IG+ATALE+RA+G+ +AF+PC+AV +DPRWGRCYESY EDP++V EMT ++ GLQG  
Sbjct: 130 RRIGAATALEVRASGVHWAFSPCVAVLRDPRWGRCYESYGEDPELVCEMTSLVSGLQGVP 189

Query: 208 PPNSRKGVPYVGGKDKVAGCAKHYVGDGGTTKGINENDTVIDRHSLLSIHMPGYSHSIIK 267
           P     G P+V G++ V  C KH+VGDGGT KGINE +T+     L  IH+P Y   + +
Sbjct: 190 PEEHPNGYPFVAGRNNVVACVKHFVGDGGTDKGINEGNTIASYEELEKIHIPPYLKCLAQ 249

Query: 268 GIATVMASYSSWNGEKMHAHKELLTGFLKNTLNFKGFVISDWQGIDRITTPPHANYTYSI 327
           G++TVMASYSSWNG ++HA + LLT  LK  L FKGF++SDW+G+DR++ P  +NY Y I
Sbjct: 250 GVSTVMASYSSWNGTRLHADRFLLTEILKEKLGFKGFLVSDWEGLDRLSEPQGSNYRYCI 309

Query: 328 IASVTAGVDMIMIPYDYKEFIDKITYLVKNNIIPMSRIDDAVWRILRVKFVMGLFENPLA 387
             +V AG+DM+M+P+ Y++FI  +T LV++  IPM+RI+DAV RILRVKFV GLF +PL 
Sbjct: 310 KTAVNAGIDMVMVPFKYEQFIQDMTDLVESGEIPMARINDAVERILRVKFVAGLFGHPLT 369

Query: 388 DYSLVNEIGKKEHRELAREAVRKSLVLLKNGKSTSTPLLPLPKKTQKILVAGTHANNLGY 447
           D SL+  +G KEHRELA+EAVRKSLVLLK+GK+   P LPL +  ++ILV GTHA++LGY
Sbjct: 370 DRSLLPTVGCKEHRELAQEAVRKSLVLLKSGKNADKPFLPLDRNAKRILVTGTHADDLGY 429

Query: 448 QCGGWTIEWQGASGNNLTSGTTVLDAIKETVDPETEVTFEEQPNKESLQSHE-FSYGIVV 507
           QCGGWT  W G SG  +T GTT+LDAIKE V  ETEV +E+ P+KE+L S E FSY IV 
Sbjct: 430 QCGGWTKTWFGLSG-RITIGTTLLDAIKEAVGDETEVIYEKTPSKETLASSEGFSYAIVA 489

Query: 508 VGEYPYAETNGDSLNLTIPDPGPSTITDVCGAMKCVVIIISGRPVVIEP-YISSMDALVA 567
           VGE PYAET GD+  L IP  G   +T V   +  +VI+ISGRPVV+EP  +   +ALVA
Sbjct: 490 VGEPPYAETMGDNSELRIPFNGTDIVTAVAEIIPTLVILISGRPVVLEPTVLEKTEALVA 549

Query: 568 AWLPGTEGKGITDVLFGDYGFTGKLPRTWFKTVDQLPMNFGDPHYDPLFSFGYGLTTEPI 626
           AWLPGTEG+G+ DV+FGDY F GKLP +WFK V+ LP++     YDPLF FG+GL ++P+
Sbjct: 550 AWLPGTEGQGVADVVFGDYDFKGKLPVSWFKHVEHLPLDAHANSYDPLFPFGFGLNSKPV 608

BLAST of CmoCh06G009350 vs. TAIR10
Match: AT3G62710.1 (AT3G62710.1 Glycosyl hydrolase family protein)

HSP 1 Score: 700.7 bits (1807), Expect = 8.4e-202
Identity = 355/626 (56.71%), Postives = 450/626 (71.88%), Query Frame = 1

Query: 26  LTYKDPTKPLNVRIKDLLGRMTVEEKIGQMVQIERVNASA----------DVMKNYFIGS 85
           + YKDP   +  R++DLL RMT+ EK+GQM QI+R N S           ++   Y IGS
Sbjct: 36  IKYKDPKVAVEERVEDLLIRMTLPEKLGQMCQIDRFNFSQVTGGVATVVPEIFTKYMIGS 95

Query: 86  VLSGGGSAPSKNASAKDWVDMVNEIQKGALSSRLGIPMIYGIDAVHGHNNVYNATIFPHN 145
           VLS         A     +   N ++K +LS+RLGIP++Y +DAVHGHN   +ATIFPHN
Sbjct: 96  VLSNPYDTGKDIAKR---IFQTNAMKKLSLSTRLGIPLLYAVDAVHGHNTFIDATIFPHN 155

Query: 146 VGLGATRDPQLVKNIGSATALEIRATGIPYAFAPCIAVCKDPRWGRCYESYSEDPKIVQE 205
           VGLGATRDPQLVK IG+ TA E+RATG+  AFAPC+AVC+DPRWGRCYESYSEDP +V  
Sbjct: 156 VGLGATRDPQLVKKIGAITAQEVRATGVAQAFAPCVAVCRDPRWGRCYESYSEDPAVVNM 215

Query: 206 MTEIIL-GLQGEIPPNSRKGVPYVGG-KDKVAGCAKHYVGDGGTTKGINENDTVIDRHSL 265
           MTE I+ GLQG          PY+   K  VAGCAKH+VGDGGT  GINEN+TV D  +L
Sbjct: 216 MTESIIDGLQGN--------APYLADPKINVAGCAKHFVGDGGTINGINENNTVADNATL 275

Query: 266 LSIHMPGYSHSIIKGIATVMASYSSWNGEKMHAHKELLTGFLKNTLNFKGFVISDWQGID 325
             IHMP +  ++ KGIA++MASYSS NG KMHA++ ++T +LKNTL F+GFVISDW GID
Sbjct: 276 FGIHMPPFEIAVKKGIASIMASYSSLNGVKMHANRAMITDYLKNTLKFQGFVISDWLGID 335

Query: 326 RITTPPHANYTYSIIASVTAGVDMIMIPYDYKEFIDKITYLVKNNIIPMSRIDDAVWRIL 385
           +IT    +NYTYSI AS+ AG+DM+M+P+ Y E+++K+T LV    IPMSRIDDAV RIL
Sbjct: 336 KITPIEKSNYTYSIEASINAGIDMVMVPWAYPEYLEKLTNLVNGGYIPMSRIDDAVRRIL 395

Query: 386 RVKFVMGLFENPLADYSL-VNEIGKKEHRELAREAVRKSLVLLKNGKSTSTPLLPLPKKT 445
           RVKF +GLFEN LAD  L   E G + HRE+ REAVRKS+VLLKNGK+ +  ++PLPKK 
Sbjct: 396 RVKFSIGLFENSLADEKLPTTEFGSEAHREVGREAVRKSMVLLKNGKTDADKIVPLPKKV 455

Query: 446 QKILVAGTHANNLGYQCGGWTIEWQGASG--------------NNLTSGTTVLDAIKETV 505
           +KI+VAG HAN++G+QCGG+++ WQG +G                   GTT+L+AI++ V
Sbjct: 456 KKIVVAGRHANDMGWQCGGFSLTWQGFNGTGEDMPTNTKHGLPTGKIKGTTILEAIQKAV 515

Query: 506 DPETEVTFEEQPNKESLQSH-EFSYGIVVVGEYPYAETNGDSLNLTIPDPGPSTITDVCG 565
           DP TEV + E+PN+++ + H + +Y IVVVGE PYAET GDS  L I  PGP T++  CG
Sbjct: 516 DPTTEVVYVEEPNQDTAKLHADAAYTIVVVGETPYAETFGDSPTLGITKPGPDTLSHTCG 575

Query: 566 A-MKCVVIIISGRPVVIEPYISSMDALVAAWLPGTEGKGITDVLFGDYGFTGKLPRTWFK 623
           + MKC+VI+++GRP+VIEPYI  +DAL  AWLPGTEG+G+ DVLFGD+ FTG LPRTW K
Sbjct: 576 SGMKCLVILVTGRPLVIEPYIDMLDALAVAWLPGTEGQGVADVLFGDHPFTGTLPRTWMK 635

BLAST of CmoCh06G009350 vs. NCBI nr
Match: gi|659086037|ref|XP_008443733.1| (PREDICTED: lysosomal beta glucosidase-like [Cucumis melo])

HSP 1 Score: 1073.9 bits (2776), Expect = 9.8e-311
Identity = 517/626 (82.59%), Postives = 564/626 (90.10%), Query Frame = 1

Query: 1   MTKVLIVLMGFLLMFLSETLGTGEQLTYKDPTKPLNVRIKDLLGRMTVEEKIGQMVQIER 60
           M K + +L+G LL+   ET    E L YKDP +PLNVRIKDLLGRMT+EEKIGQM QIER
Sbjct: 1   MAKAINILIGLLLLCFFETWAKAENLKYKDPKQPLNVRIKDLLGRMTLEEKIGQMTQIER 60

Query: 61  VNASADVMKNYFIGSVLSGGGSAPSKNASAKDWVDMVNEIQKGALSSRLGIPMIYGIDAV 120
           VNAS DVMK YFIGSVLSGGGS PSK ASA+DWV MVNEIQ+GALS+RLGIPMIYGIDAV
Sbjct: 61  VNASTDVMKKYFIGSVLSGGGSVPSKEASAQDWVQMVNEIQQGALSTRLGIPMIYGIDAV 120

Query: 121 HGHNNVYNATIFPHNVGLGATRDPQLVKNIGSATALEIRATGIPYAFAPCIAVCKDPRWG 180
           HGHNNVYNATIFPHN+GLGATRDPQL+K IG A+ALEIRATGIPYAFAPCIAVC+DPRWG
Sbjct: 121 HGHNNVYNATIFPHNIGLGATRDPQLLKRIGEASALEIRATGIPYAFAPCIAVCRDPRWG 180

Query: 181 RCYESYSEDPKIVQEMTEIILGLQGEIPPNSRKGVPYVGGKDKVAGCAKHYVGDGGTTKG 240
           RCYESY EDPK+VQEMTEII GLQGEIPPNSRKGVPYV GK+KV  CAKHYVGDGGTTKG
Sbjct: 181 RCYESYGEDPKLVQEMTEIIPGLQGEIPPNSRKGVPYVAGKEKVVACAKHYVGDGGTTKG 240

Query: 241 INENDTVIDRHSLLSIHMPGYSHSIIKGIATVMASYSSWNGEKMHAHKELLTGFLKNTLN 300
           I+EN+TVIDRH LLSIHMPGY HSIIKG+ATVM SYSSWNG KMHA+KEL+T FLKNTL+
Sbjct: 241 IDENNTVIDRHGLLSIHMPGYYHSIIKGVATVMVSYSSWNGVKMHANKELVTDFLKNTLH 300

Query: 301 FKGFVISDWQGIDRITTPPHANYTYSIIASVTAGVDMIMIPYDYKEFIDKITYLVKNNII 360
           F+GFVISDWQ IDRIT PPHANYTYSI+ASVTAG+DMIM+PY+Y EFID +TYLV NN I
Sbjct: 301 FQGFVISDWQAIDRITDPPHANYTYSILASVTAGLDMIMVPYNYTEFIDGLTYLVNNNFI 360

Query: 361 PMSRIDDAVWRILRVKFVMGLFENPLADYSLVNEIGKKEHRELAREAVRKSLVLLKNGKS 420
           P++RIDDAV RILRVKF+MGLFENP+AD SLVNE+GK+EHRELAREAVRKSLVLLKNGKS
Sbjct: 361 PITRIDDAVKRILRVKFIMGLFENPIADLSLVNELGKQEHRELAREAVRKSLVLLKNGKS 420

Query: 421 TSTPLLPLPKKTQKILVAGTHANNLGYQCGGWTIEWQGASGNNLTSGTTVLDAIKETVDP 480
              PLLPL KKTQKILVAG+HA+NLGYQCGGWTIEWQG SGNNLTSGTTVLDAIK+TVDP
Sbjct: 421 ADKPLLPLEKKTQKILVAGSHADNLGYQCGGWTIEWQGLSGNNLTSGTTVLDAIKDTVDP 480

Query: 481 ETEVTFEEQPNKESLQSHEFSYGIVVVGEYPYAETNGDSLNLTIPDPGPSTITDVCGAMK 540
            TEV F E P+K  LQS  FSY IVVVGE+PYAE  GDSLNLTIPDPGPSTIT+VCG +K
Sbjct: 481 STEVIFNENPDKGFLQSGTFSYAIVVVGEHPYAEMMGDSLNLTIPDPGPSTITNVCGVIK 540

Query: 541 CVVIIISGRPVVIEPYISSMDALVAAWLPGTEGKGITDVLFGDYGFTGKLPRTWFKTVDQ 600
           CVV+IISGRPVVI+PY+ S+DALVAAWLPGTEGKGITDVLFGDYGFTGKL +TWFKTVDQ
Sbjct: 541 CVVVIISGRPVVIQPYVDSVDALVAAWLPGTEGKGITDVLFGDYGFTGKLSQTWFKTVDQ 600

Query: 601 LPMNFGDPHYDPLFSFGYGLTTEPIK 627
           LPMNFGD HYDPLF  G+GLTT+PIK
Sbjct: 601 LPMNFGDSHYDPLFPLGHGLTTQPIK 626

BLAST of CmoCh06G009350 vs. NCBI nr
Match: gi|778665412|ref|XP_011648555.1| (PREDICTED: lysosomal beta glucosidase-like [Cucumis sativus])

HSP 1 Score: 1067.4 bits (2759), Expect = 9.7e-309
Identity = 512/622 (82.32%), Postives = 565/622 (90.84%), Query Frame = 1

Query: 6   IVLMGFLLMFLSETLGTGEQLTYKDPTKPLNVRIKDLLGRMTVEEKIGQMVQIERVNASA 65
           I+L+  LL+   ET    E   YKDPT+ LNVRIKDLLGRMT+EEKIGQMVQIERVNAS 
Sbjct: 6   IILIALLLICCFETGAKAENFKYKDPTQRLNVRIKDLLGRMTLEEKIGQMVQIERVNAST 65

Query: 66  DVMKNYFIGSVLSGGGSAPSKNASAKDWVDMVNEIQKGALSSRLGIPMIYGIDAVHGHNN 125
           +VMK YFIGSVLSGGGS PSK ASA+DW++MVNEIQKGALS+RLGIPMIYGIDAVHGHNN
Sbjct: 66  EVMKKYFIGSVLSGGGSVPSKQASAQDWINMVNEIQKGALSTRLGIPMIYGIDAVHGHNN 125

Query: 126 VYNATIFPHNVGLGATRDPQLVKNIGSATALEIRATGIPYAFAPCIAVCKDPRWGRCYES 185
           VYNATIFPHN+GLGATRDPQL+K IG A+A EIRATGIPYAFAPC+AVC+DPRWGRCYES
Sbjct: 126 VYNATIFPHNIGLGATRDPQLLKRIGVASAREIRATGIPYAFAPCVAVCRDPRWGRCYES 185

Query: 186 YSEDPKIVQEMTEIILGLQGEIPPNSRKGVPYVGGKDKVAGCAKHYVGDGGTTKGINEND 245
           Y EDPKIVQEMTEII GLQGEIPPNSRKGVPYV GK+ V  CAKHYVGDGGTTKGI+EN+
Sbjct: 186 YGEDPKIVQEMTEIIPGLQGEIPPNSRKGVPYVAGKENVVACAKHYVGDGGTTKGIDENN 245

Query: 246 TVIDRHSLLSIHMPGYSHSIIKGIATVMASYSSWNGEKMHAHKELLTGFLKNTLNFKGFV 305
           TVIDRH LLSIHMPGY HSIIKG+AT+M SYSSWNGEKMHA+K L+T FLKNTL+F+GFV
Sbjct: 246 TVIDRHGLLSIHMPGYYHSIIKGVATIMVSYSSWNGEKMHANKNLVTDFLKNTLHFQGFV 305

Query: 306 ISDWQGIDRITTPPHANYTYSIIASVTAGVDMIMIPYDYKEFIDKITYLVKNNIIPMSRI 365
           ISDW+ IDRIT PPHANYTYSI+AS+TAG+DMIMIPY+Y EFID +T LVK+N IP+SRI
Sbjct: 306 ISDWEAIDRITDPPHANYTYSILASITAGLDMIMIPYNYPEFIDGLTNLVKSNYIPISRI 365

Query: 366 DDAVWRILRVKFVMGLFENPLADYSLVNEIGKKEHRELAREAVRKSLVLLKNGKSTSTPL 425
           DDAV RILRVKFVMGLFENP+AD SLVNE+GK+EHRELAREAVRKSLVLLKNGKS   PL
Sbjct: 366 DDAVKRILRVKFVMGLFENPIADLSLVNELGKQEHRELAREAVRKSLVLLKNGKSADKPL 425

Query: 426 LPLPKKTQKILVAGTHANNLGYQCGGWTIEWQGASGNNLTSGTTVLDAIKETVDPETEVT 485
           LPL KKTQKILVAG+HANNLGYQCGGWTIEWQG SGNNLTSGTTVLDAIK+TVDP TEV 
Sbjct: 426 LPLEKKTQKILVAGSHANNLGYQCGGWTIEWQGLSGNNLTSGTTVLDAIKDTVDPTTEVI 485

Query: 486 FEEQPNKESLQSHEFSYGIVVVGEYPYAETNGDSLNLTIPDPGPSTITDVCGAMKCVVII 545
           F E P+K+SLQS  FSY IVVVGE+PYAE NGDSLNLTIPDPGP+TIT+VCG +KC V+I
Sbjct: 486 FNENPDKKSLQSDTFSYAIVVVGEHPYAELNGDSLNLTIPDPGPNTITNVCGVIKCAVVI 545

Query: 546 ISGRPVVIEPYISSMDALVAAWLPGTEGKGITDVLFGDYGFTGKLPRTWFKTVDQLPMNF 605
           ISGRPVVI+PY+ S+DALVAAWLPGTEGKGITDVLFGDYGFTGKL +TWFKTVDQLPMNF
Sbjct: 546 ISGRPVVIQPYVDSIDALVAAWLPGTEGKGITDVLFGDYGFTGKLSQTWFKTVDQLPMNF 605

Query: 606 GDPHYDPLFSFGYGLTTEPIKA 628
           G+P+YDPLF FG+GLTT+PIK+
Sbjct: 606 GNPNYDPLFPFGHGLTTQPIKS 627

BLAST of CmoCh06G009350 vs. NCBI nr
Match: gi|778685993|ref|XP_011652313.1| (PREDICTED: lysosomal beta glucosidase-like [Cucumis sativus])

HSP 1 Score: 1053.9 bits (2724), Expect = 1.1e-304
Identity = 504/624 (80.77%), Postives = 563/624 (90.22%), Query Frame = 1

Query: 4   VLIVLMGFLLMFLSETLGTGEQLTYKDPTKPLNVRIKDLLGRMTVEEKIGQMVQIERVNA 63
           VLI  +G L++  SETL   E L YKDP +PLNVRIKDLLGRMT+EEKIGQMVQIER NA
Sbjct: 6   VLITFVGLLVLCFSETLAKAEYLKYKDPKQPLNVRIKDLLGRMTLEEKIGQMVQIERANA 65

Query: 64  SADVMKNYFIGSVLSGGGSAPSKNASAKDWVDMVNEIQKGALSSRLGIPMIYGIDAVHGH 123
           SADVMK YFIGSVLSGGGSAPSK ASAKDWV MVN+IQ+ ALS+RLGIPMIYGIDAVHGH
Sbjct: 66  SADVMKQYFIGSVLSGGGSAPSKQASAKDWVHMVNKIQEAALSTRLGIPMIYGIDAVHGH 125

Query: 124 NNVYNATIFPHNVGLGATRDPQLVKNIGSATALEIRATGIPYAFAPCIAVCKDPRWGRCY 183
           NNVYNATIFPHN+GLGATRDPQL+K IG+ATALE+RATGIPYAFAPCIAVC+DPRWGRCY
Sbjct: 126 NNVYNATIFPHNIGLGATRDPQLLKRIGAATALEVRATGIPYAFAPCIAVCRDPRWGRCY 185

Query: 184 ESYSEDPKIVQEMTEIILGLQGEIPPNSRKGVPYVGGKDKVAGCAKHYVGDGGTTKGINE 243
           ESY ED  IVQ MTEII GLQG++P N RKGVPYV GK+ VA CAKH+VGDGGTTKGINE
Sbjct: 186 ESYGEDHTIVQAMTEIIPGLQGDVPANIRKGVPYVAGKNNVAACAKHFVGDGGTTKGINE 245

Query: 244 NDTVIDRHSLLSIHMPGYSHSIIKGIATVMASYSSWNGEKMHAHKELLTGFLKNTLNFKG 303
           N+TV+D H L SIHMP Y +SIIKG+ATVM SYSS NGEKMHA+K+L+T FLKNTL+FKG
Sbjct: 246 NNTVVDGHGLFSIHMPAYYNSIIKGVATVMVSYSSINGEKMHANKKLVTDFLKNTLHFKG 305

Query: 304 FVISDWQGIDRITTPPHANYTYSIIASVTAGVDMIMIPYDYKEFIDKITYLVKNNIIPMS 363
           FVISDWQGID+ITTPPHANYTYSI+ASV AGVDMIM+PY+Y EFID +TYLVKNN IP+S
Sbjct: 306 FVISDWQGIDKITTPPHANYTYSILASVNAGVDMIMVPYNYTEFIDGLTYLVKNNAIPIS 365

Query: 364 RIDDAVWRILRVKFVMGLFENPLADYSLVNEIGKKEHRELAREAVRKSLVLLKNGKSTST 423
           RIDDAV RILRVKFVMGLFENPLAD SL+NE+GK+EHRELAREAVRKSLVLLKNGK  + 
Sbjct: 366 RIDDAVKRILRVKFVMGLFENPLADLSLINELGKQEHRELAREAVRKSLVLLKNGKLPNQ 425

Query: 424 PLLPLPKKTQKILVAGTHANNLGYQCGGWTIEWQGASGNNLTSGTTVLDAIKETVDPETE 483
           PLLPLPKK  KILVAGTHAN+LG QCGGWT+EWQG +GNNLTSGTT+L AIK+TVDPETE
Sbjct: 426 PLLPLPKKAPKILVAGTHANDLGNQCGGWTMEWQGLTGNNLTSGTTILTAIKDTVDPETE 485

Query: 484 VTFEEQPNKESLQSHEFSYGIVVVGEYPYAETNGDSLNLTIPDPGPSTITDVCGAMKCVV 543
           V F + PN E LQ+H+FSY IVVVGE+PYAETNGDSLNLTIP+PGP TI +VCGA+KCVV
Sbjct: 486 VVFHDNPNAEFLQTHQFSYAIVVVGEHPYAETNGDSLNLTIPEPGPETIKNVCGAVKCVV 545

Query: 544 IIISGRPVVIEPYISSMDALVAAWLPGTEGKGITDVLFGDYGFTGKLPRTWFKTVDQLPM 603
           ++ISGRPVV++PYI S+DA+VAAWLPGTEGKGI+DVLFGDYGFTGKL +TWFK+VDQLPM
Sbjct: 546 VVISGRPVVLQPYIDSIDAVVAAWLPGTEGKGISDVLFGDYGFTGKLSQTWFKSVDQLPM 605

Query: 604 NFGDPHYDPLFSFGYGLTTEPIKA 628
           NFGD HYDPLF FG+GLTT+P+KA
Sbjct: 606 NFGDAHYDPLFPFGFGLTTQPVKA 629

BLAST of CmoCh06G009350 vs. NCBI nr
Match: gi|659130020|ref|XP_008464960.1| (PREDICTED: lysosomal beta glucosidase-like [Cucumis melo])

HSP 1 Score: 1050.4 bits (2715), Expect = 1.2e-303
Identity = 504/624 (80.77%), Postives = 560/624 (89.74%), Query Frame = 1

Query: 4   VLIVLMGFLLMFLSETLGTGEQLTYKDPTKPLNVRIKDLLGRMTVEEKIGQMVQIERVNA 63
           VLI  +G L++  SETL   E L YKDP +PLNVRIKDL GRMT+EEKIGQMVQIER NA
Sbjct: 5   VLITFVGLLVLCFSETLAKAEYLKYKDPKQPLNVRIKDLFGRMTLEEKIGQMVQIERANA 64

Query: 64  SADVMKNYFIGSVLSGGGSAPSKNASAKDWVDMVNEIQKGALSSRLGIPMIYGIDAVHGH 123
           S DVM+ YFIGSVLSGGGS PSKNASAK WV MVN+IQ+GALS+RLGIPMIYGIDA+HGH
Sbjct: 65  SMDVMRKYFIGSVLSGGGSVPSKNASAKTWVHMVNKIQEGALSTRLGIPMIYGIDAIHGH 124

Query: 124 NNVYNATIFPHNVGLGATRDPQLVKNIGSATALEIRATGIPYAFAPCIAVCKDPRWGRCY 183
           NNVYNATIFPHN+GLGATRDPQL+K IG ATALE+RATGIPYAFAPCIAVC+DPRWGRCY
Sbjct: 125 NNVYNATIFPHNIGLGATRDPQLIKRIGVATALEVRATGIPYAFAPCIAVCRDPRWGRCY 184

Query: 184 ESYSEDPKIVQEMTEIILGLQGEIPPNSRKGVPYVGGKDKVAGCAKHYVGDGGTTKGINE 243
           ESY ED KIVQ MTEII GLQG++P N RKGVPYV GK+ VA CAKH+VGDGGTTKGINE
Sbjct: 185 ESYGEDHKIVQAMTEIIPGLQGDLPSNIRKGVPYVAGKNNVAACAKHFVGDGGTTKGINE 244

Query: 244 NDTVIDRHSLLSIHMPGYSHSIIKGIATVMASYSSWNGEKMHAHKELLTGFLKNTLNFKG 303
           N+TVID H L SIHMP Y +SIIKG+AT+M SYSS NGEKMHA+K+L+T FLKNTL+FKG
Sbjct: 245 NNTVIDGHGLFSIHMPAYYNSIIKGVATIMVSYSSVNGEKMHANKKLVTDFLKNTLHFKG 304

Query: 304 FVISDWQGIDRITTPPHANYTYSIIASVTAGVDMIMIPYDYKEFIDKITYLVKNNIIPMS 363
           FVISDWQGID+IT+PPHANYTYSI+ASV AGVDMIM+PY+Y EFID +TYLVKNN IP+S
Sbjct: 305 FVISDWQGIDKITSPPHANYTYSILASVNAGVDMIMVPYNYTEFIDALTYLVKNNAIPIS 364

Query: 364 RIDDAVWRILRVKFVMGLFENPLADYSLVNEIGKKEHRELAREAVRKSLVLLKNGKSTST 423
           RIDDAV RILRVKFVMGLFENPLAD SLVNEIGK+EHRELAREAVRKSLVLLKNGK  + 
Sbjct: 365 RIDDAVKRILRVKFVMGLFENPLADLSLVNEIGKQEHRELAREAVRKSLVLLKNGKLPNQ 424

Query: 424 PLLPLPKKTQKILVAGTHANNLGYQCGGWTIEWQGASGNNLTSGTTVLDAIKETVDPETE 483
           PLLPLPKK  KILVAGTHAN+LG QCGGWTIEWQG +GNNLTSGTTVL AIK+TVDPETE
Sbjct: 425 PLLPLPKKAPKILVAGTHANDLGNQCGGWTIEWQGLTGNNLTSGTTVLTAIKDTVDPETE 484

Query: 484 VTFEEQPNKESLQSHEFSYGIVVVGEYPYAETNGDSLNLTIPDPGPSTITDVCGAMKCVV 543
           V F+  PN E L++H+FSY IVVVGE+PYAETNGDSLNLTIP+PGP TI +VCGA+KCVV
Sbjct: 485 VVFDNNPNAEFLKTHQFSYAIVVVGEHPYAETNGDSLNLTIPEPGPETIKNVCGAVKCVV 544

Query: 544 IIISGRPVVIEPYISSMDALVAAWLPGTEGKGITDVLFGDYGFTGKLPRTWFKTVDQLPM 603
           ++ISGRPVVI+PYI S+DALVAAWLPGTEGKGI+DVLFGDYGFTGKL +TWFK+VDQLPM
Sbjct: 545 VVISGRPVVIQPYIDSIDALVAAWLPGTEGKGISDVLFGDYGFTGKLSQTWFKSVDQLPM 604

Query: 604 NFGDPHYDPLFSFGYGLTTEPIKA 628
           NFGD HYDPLF  G+GLTT+P+KA
Sbjct: 605 NFGDAHYDPLFPLGFGLTTQPVKA 628

BLAST of CmoCh06G009350 vs. NCBI nr
Match: gi|659130018|ref|XP_008464959.1| (PREDICTED: lysosomal beta glucosidase-like [Cucumis melo])

HSP 1 Score: 1039.3 bits (2686), Expect = 2.8e-300
Identity = 494/627 (78.79%), Postives = 557/627 (88.84%), Query Frame = 1

Query: 1   MTKVLIVLMGFLLMFLSETLGTGEQLTYKDPTKPLNVRIKDLLGRMTVEEKIGQMVQIER 60
           M K+LI  MGF +  L+E       + YKDP +PLNVRI DLLGRMT+EEKIGQMVQI+R
Sbjct: 1   MAKILIFFMGFFIFCLTEVWAKPRYMRYKDPKQPLNVRINDLLGRMTLEEKIGQMVQIDR 60

Query: 61  VNASADVMKNYFIGSVLSGGGSAPSKNASAKDWVDMVNEIQKGALSSRLGIPMIYGIDAV 120
             AS +VMK Y IGSVLSGGGS PSK AS K W+DMVN+ QKG+LS+RLGIPMIYGIDAV
Sbjct: 61  TVASKEVMKKYLIGSVLSGGGSVPSKEASPKVWIDMVNDFQKGSLSTRLGIPMIYGIDAV 120

Query: 121 HGHNNVYNATIFPHNVGLGATRDPQLVKNIGSATALEIRATGIPYAFAPCIAVCKDPRWG 180
           HGHNNVY ATIFPHNVGLGATRDP L K IG+ATALE+RATGI Y FAPCIAVC+DPRWG
Sbjct: 121 HGHNNVYKATIFPHNVGLGATRDPNLAKRIGAATALEVRATGISYVFAPCIAVCRDPRWG 180

Query: 181 RCYESYSEDPKIVQEMTEIILGLQGEIPPNSRKGVPYVGGKDKVAGCAKHYVGDGGTTKG 240
           RCYESYSEDPKIVQEMTEII GLQGEIP NSRKGVPYV G++KVA CAKHYVGDGGTTKG
Sbjct: 181 RCYESYSEDPKIVQEMTEIISGLQGEIPSNSRKGVPYVAGREKVAACAKHYVGDGGTTKG 240

Query: 241 INENDTVIDRHSLLSIHMPGYSHSIIKGIATVMASYSSWNGEKMHAHKELLTGFLKNTLN 300
           INEN+T+  RH LLSIHMPGY +SIIKG++TVM SYSSWNG+KMH +++L+TGFLKNTL 
Sbjct: 241 INENNTLASRHGLLSIHMPGYYNSIIKGVSTVMISYSSWNGKKMHENRDLITGFLKNTLR 300

Query: 301 FKGFVISDWQGIDRITTPPHANYTYSIIASVTAGVDMIMIPYDYKEFIDKITYLVKNNII 360
           F+GFVISDWQGIDRIT+PPHANYTYSIIA +TAG+DMIM+PY+Y EFID +TYLVK N+I
Sbjct: 301 FRGFVISDWQGIDRITSPPHANYTYSIIAGITAGIDMIMVPYNYTEFIDGLTYLVKTNVI 360

Query: 361 PMSRIDDAVWRILRVKFVMGLFENPLADYSLVNEIGKKEHRELAREAVRKSLVLLKNGKS 420
           P+SRIDDAV RILRVKF+MGLFENPLAD S VNE+GKKEHRELAREAVRKSLVLLKNG+S
Sbjct: 361 PISRIDDAVKRILRVKFIMGLFENPLADSSFVNELGKKEHRELAREAVRKSLVLLKNGES 420

Query: 421 TSTPLLPLPKKTQKILVAGTHANNLGYQCGGWTIEWQGASGNNLTSGTTVLDAIKETVDP 480
              P+LPLPKK  KILVAG+HANNLG+QCGGWTIEWQG  GNNLTSGTT+L AIK+TVDP
Sbjct: 421 ADKPILPLPKKVPKILVAGSHANNLGFQCGGWTIEWQGLGGNNLTSGTTILSAIKDTVDP 480

Query: 481 ETEVTFEEQPNKESLQSHEFSYGIVVVGEYPYAETNGDSLNLTIPDPGPSTITDVCGAMK 540
           +T+V F+E P+ E ++S++FSY IVVVGE+PYAET GDSLNLTIPDPG STIT+VCG +K
Sbjct: 481 KTKVVFKENPDIEFVKSNKFSYAIVVVGEHPYAETFGDSLNLTIPDPGSSTITNVCGVVK 540

Query: 541 CVVIIISGRPVVIEPYISSMDALVAAWLPGTEGKGITDVLFGDYGFTGKLPRTWFKTVDQ 600
           CVVI+ISGRPVV++PYISS+DALVAAWLPGTEGKGI+DVLFGDYGF+GKL RTWFKTVDQ
Sbjct: 541 CVVIVISGRPVVLQPYISSIDALVAAWLPGTEGKGISDVLFGDYGFSGKLSRTWFKTVDQ 600

Query: 601 LPMNFGDPHYDPLFSFGYGLTTEPIKA 628
           LPMN GD HYDPLF FG+GLTT+PIKA
Sbjct: 601 LPMNVGDAHYDPLFPFGFGLTTDPIKA 627

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
BGH3B_BACO15.5e-7830.82Beta-glucosidase BoGH3B OS=Bacteroides ovatus (strain ATCC 8483 / DSM 1896 / JCM... [more]
GLUA_DICDI4.3e-7532.33Lysosomal beta glucosidase OS=Dictyostelium discoideum GN=gluA PE=1 SV=2[more]
BGLX_SALTY1.0e-5528.55Periplasmic beta-glucosidase OS=Salmonella typhimurium (strain LT2 / SGSC1412 / ... [more]
BGLX_ECOLI2.1e-5327.93Periplasmic beta-glucosidase OS=Escherichia coli (strain K12) GN=bglX PE=3 SV=2[more]
BGLC_ASPOR6.3e-4226.47Probable beta-glucosidase C OS=Aspergillus oryzae (strain ATCC 42149 / RIB 40) G... [more]
Match NameE-valueIdentityDescription
A0A0A0LY55_CUCSA6.7e-30982.32Uncharacterized protein OS=Cucumis sativus GN=Csa_1G661750 PE=4 SV=1[more]
A0A0A0LI54_CUCSA7.7e-30580.77Uncharacterized protein OS=Cucumis sativus GN=Csa_3G842090 PE=4 SV=1[more]
A0A0A0LFL8_CUCSA3.3e-30078.79Uncharacterized protein OS=Cucumis sativus GN=Csa_3G842070 PE=4 SV=1[more]
A0A061F0I5_THECC3.8e-28877.98Glycosyl hydrolase family protein OS=Theobroma cacao GN=TCM_025896 PE=4 SV=1[more]
U5FEM9_POPTR4.2e-28776.04Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0019s05340g PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G04885.15.7e-26769.92 Glycosyl hydrolase family protein[more]
AT5G20950.15.7e-26769.65 Glycosyl hydrolase family protein[more]
AT5G20940.11.5e-25169.18 Glycosyl hydrolase family protein[more]
AT3G47000.13.2e-20958.17 Glycosyl hydrolase family protein[more]
AT3G62710.18.4e-20256.71 Glycosyl hydrolase family protein[more]
Match NameE-valueIdentityDescription
gi|659086037|ref|XP_008443733.1|9.8e-31182.59PREDICTED: lysosomal beta glucosidase-like [Cucumis melo][more]
gi|778665412|ref|XP_011648555.1|9.7e-30982.32PREDICTED: lysosomal beta glucosidase-like [Cucumis sativus][more]
gi|778685993|ref|XP_011652313.1|1.1e-30480.77PREDICTED: lysosomal beta glucosidase-like [Cucumis sativus][more]
gi|659130020|ref|XP_008464960.1|1.2e-30380.77PREDICTED: lysosomal beta glucosidase-like [Cucumis melo][more]
gi|659130018|ref|XP_008464959.1|2.8e-30078.79PREDICTED: lysosomal beta glucosidase-like [Cucumis melo][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001764Glyco_hydro_3_N
IPR002772Glyco_hydro_3_C
IPR017853Glycoside_hydrolase_SF
IPR026892Glycoside hydrolase family 3
Vocabulary: Molecular Function
TermDefinition
GO:0004553hydrolase activity, hydrolyzing O-glycosyl compounds
Vocabulary: Biological Process
TermDefinition
GO:0005975carbohydrate metabolic process
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0005975 carbohydrate metabolic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compounds

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh06G009350.1CmoCh06G009350.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001764Glycoside hydrolase, family 3, N-terminalPRINTSPR00133GLHYDRLASE3coord: 108..124
score: 2.4E-25coord: 224..240
score: 2.4E-25coord: 132..151
score: 2.4E-25coord: 294..312
score: 2.4E-25coord: 178..194
score: 2.4
IPR001764Glycoside hydrolase, family 3, N-terminalGENE3DG3DSA:3.20.20.300coord: 27..391
score: 7.7E
IPR001764Glycoside hydrolase, family 3, N-terminalPFAMPF00933Glyco_hydro_3coord: 47..375
score: 2.3
IPR002772Glycoside hydrolase family 3 C-terminal domainGENE3DG3DSA:3.40.50.1700coord: 398..621
score: 7.9
IPR002772Glycoside hydrolase family 3 C-terminal domainPFAMPF01915Glyco_hydro_3_Ccoord: 412..621
score: 3.9
IPR002772Glycoside hydrolase family 3 C-terminal domainunknownSSF52279Beta-D-glucan exohydrolase, C-terminal domaincoord: 412..621
score: 3.14
IPR017853Glycoside hydrolase superfamilyunknownSSF51445(Trans)glycosidasescoord: 26..411
score: 1.68E
IPR026892Glycoside hydrolase family 3PANTHERPTHR30620PERIPLASMIC BETA-GLUCOSIDASE-RELATEDcoord: 81..625
score: 0.0coord: 1..62
score:
NoneNo IPR availablePANTHERPTHR30620:SF39SUBFAMILY NOT NAMEDcoord: 81..625
score: 0.0coord: 1..62
score:

The following gene(s) are orthologous to this gene:

None

The following gene(s) are paralogous to this gene:

None