Cp4.1LG08g06820 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG08g06820
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionBeta-glucosidase
LocationCp4.1LG08 : 5021413 .. 5024096 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GGTTCAGCCAAGGATCCATCCAAGATGACCAAGGTTCTCATTGTTTTAATGGGATTTTTTCTCATGTTTTTGTCGGAGACATTGGGAACAGGAGAACAGCTCAAATACAAAGATCCAACAAAACCATTAAATGTTCGAATCAAGGATCTACTTGGTCGGATGACTGTGGAGGAAAAAATAGGTCAAATGGTGCAAATTGAAAGGGTTAATGCTTCAGCTGATGTCATGAAAAACTATTTCATTGGTAAATATCTTACTCCTAATCTAATTCTCTCCTCACTCTCTTAAAGTACTAATACAATAATATTAAATGTAGGGAGTGTATTGAGTGGTGGAGGTAGTGCCCCATCGAAGAATGCCTCAGCTAAGGATTGGGTGGACATGGTGAATGAAATCCAAAAAGGAGCTTTGTCGAGTAGGCTAGGAATTCCAATGATATATGGAATTGATGCTGTTCATGGCCACAACAATGTCTATAATGCAACCATCTTCCCTCACAACGTTGGACTTGGAGCTACTAGGTTACATAAATACTCCCTCTCTATTTTTTATTTTTTATTTTTATTCAACACTAATCATATACCTTATTCATTCATGTAAAAATGATTTTCCTTTGATTAAACAAACAGAGACCCTCAACTTGTGAAGAATATTGGGAGTGCTACTGCCCTTGAAATTAGAGCAACTGGCATTCCGTATGCTTTTGCGCCTTGTATAGCGGTAATTATCTCTGTGTTTCAATGGAGAAAGATCATCTTAACATTTCTGTTACTATTTGTTGTTTATTATGTCATGTGATTTATTTTGGAGGTTTTTGTATTTCAATTAGGTTTGTAAAGATCCACGATGGGGTCGGTGTTATGAAAGCTATAGTGAAGACCCTAAGATTGTTCAAGAAATGACTGAGATCATACTAGGTTTACAAGGAGAGATTCCACCTAATTCTCGCAAGGGTGTTCCTTATGTCGGTGGGAAGTGAGTCTTCAAATATTAATCAGTTTCTTTAATCGTGTACGTGTTGAAAATAACCACTATTTTCATTTATTTATGAATAGAGACAAAGTGGTTGGATGTGCAAAACATTACGTTGGTGATGGTGGAACAACTAAGGGTATCAACGAGAACGACACAGTAATAGACAGACATAGCTTACTTAGCATTCACATGCCAGGTTACTATCACTCGATCATTAAGGGAATTGCAACCGTTATGGCTTCTTATTCAAGTTGGAATGGACAGAAAATGCATGCCCACAAAGAACTCCTTACTGACTTTCTCAAAAACACTCTCAATTTTAAGGTAATATGACTTCTTGATTCTACTTTGTGATTGTCACTTTGTTGCAGACATTTGATGATTTGATTTAAAGTGAGCTTAATATTACTTCGGTCTTTTTAGGGCTTTGTGATCTCAGATTGGCAGGGTATTGATAGGATTACATCTCCACCACATGCTAATTATACATATTCCATCATAGCCAGCGTTACTGCAGGTGTTGACATGGTTAGTATTATGCTCGACACTTTAAAACGTATTTCAACAAATAATTATATGCTCATGTACACTTTTCAATACAGATAATGATACCGTACGACTACAAGGAGTTCATCGATAAAATTACCTACTTGGTAAAAAATAACATAATTCCTATGAGTCGAATTGACGATGCAGTTTGGAGAATTTTGAGAGTTAAATTTGTTATGGGTTTATTTGAGAACCCATTAGCTGACTACAGTTTGGTTAATGAGATTGGTAAAAAGGTAACTATATGGAATATGTCTTCAATATTATGAAATAAAATATAGAAATTATGTTTTAACTTATGTTTTTCTCACAATAGGAGCATAGAGAACTAGCTAGAGAAGCCGTAAGAAAATCACTAGTGTTATTAAAGAATGGGAAATCGACTTCAACACCATTGCTTCCTCTTCCAAAGAAGGCACAAAAAATACTTGTTGCTGGCACCCATGCAAACAACCTTGGATATCAATGTGGTGGTTGGACTATCGAATGGCAAGGAGCTAGTGGCAACAACCTTACAAGTGGTATGAAAATATTACAATATAAATTGTGAAGTTTTTCCTTATAAATTATTGGGATATTTATATCATCTCCTTTTGTTAGGCACAACTGTGCTTGATGCTATAAAAGAAACGGTTGGTCCTGAAACAGAAGTTGCCTTTGAGGAAAAACCAAATAAGGAGAGTCTTCAATCACATGAGTTTTCTTATGGCATTGTTGTAGTGGGAGAGTATCCATATGCAGAAACTAATGGTGATAGCTTGAATTTGACAATTCCCGACCCTGGTCCAAGCACCATCACAGATGTTTGTGGCGCTATGAAATGTGTAGTTATATTAATCTCAGGACGGCCTGTAGTAATCGAACCTTATATTTCTTCAATGGATGCACTTGTTGCTGCTTGGCTTCCAGGAACTGAAGGAAAAGGCATTACTGATGTATTGTTTGGAGATTATGGTTTTACTGGAAAACTTCCCCGAACGTGGTTCAAAACTGTTGATCAATTGCCAATGAACTTTGGAGATCCTCATTATGATCCTCTTTTCTCCTTTGGATATGGTCTTACTACAGAACCCATCAAAGCTTAATGAATGACTGCGTCAACTTAACTATTTTACATATATAAGTGCATGGCCTCCAATATTGTATACTCATATATATAA

mRNA sequence

GGTTCAGCCAAGGATCCATCCAAGATGACCAAGGTTCTCATTGTTTTAATGGGATTTTTTCTCATGTTTTTGTCGGAGACATTGGGAACAGGAGAACAGCTCAAATACAAAGATCCAACAAAACCATTAAATGTTCGAATCAAGGATCTACTTGGTCGGATGACTGTGGAGGAAAAAATAGGTCAAATGGTGCAAATTGAAAGGGTTAATGCTTCAGCTGATGTCATGAAAAACTATTTCATTGGGAGTGTATTGAGTGGTGGAGGTAGTGCCCCATCGAAGAATGCCTCAGCTAAGGATTGGGTGGACATGGTGAATGAAATCCAAAAAGGAGCTTTGTCGAGTAGGCTAGGAATTCCAATGATATATGGAATTGATGCTGTTCATGGCCACAACAATGTCTATAATGCAACCATCTTCCCTCACAACGTTGGACTTGGAGCTACTAGAGACCCTCAACTTGTGAAGAATATTGGGAGTGCTACTGCCCTTGAAATTAGAGCAACTGGCATTCCGTATGCTTTTGCGCCTTGTATAGCGGTTTGTAAAGATCCACGATGGGGTCGGTGTTATGAAAGCTATAGTGAAGACCCTAAGATTGTTCAAGAAATGACTGAGATCATACTAGGTTTACAAGGAGAGATTCCACCTAATTCTCGCAAGGGTGTTCCTTATGTCGGTGGGAAAGACAAAGTGGTTGGATGTGCAAAACATTACGTTGGTGATGGTGGAACAACTAAGGGTATCAACGAGAACGACACAGTAATAGACAGACATAGCTTACTTAGCATTCACATGCCAGGTTACTATCACTCGATCATTAAGGGAATTGCAACCGTTATGGCTTCTTATTCAAGTTGGAATGGACAGAAAATGCATGCCCACAAAGAACTCCTTACTGACTTTCTCAAAAACACTCTCAATTTTAAGGGCTTTGTGATCTCAGATTGGCAGGGTATTGATAGGATTACATCTCCACCACATGCTAATTATACATATTCCATCATAGCCAGCGTTACTGCAGGTGTTGACATGATAATGATACCGTACGACTACAAGGAGTTCATCGATAAAATTACCTACTTGGTAAAAAATAACATAATTCCTATGAGTCGAATTGACGATGCAGTTTGGAGAATTTTGAGAGTTAAATTTGTTATGGGTTTATTTGAGAACCCATTAGCTGACTACAGTTTGGTTAATGAGATTGGTAAAAAGGAGCATAGAGAACTAGCTAGAGAAGCCGTAAGAAAATCACTAGTGTTATTAAAGAATGGGAAATCGACTTCAACACCATTGCTTCCTCTTCCAAAGAAGGCACAAAAAATACTTGTTGCTGGCACCCATGCAAACAACCTTGGATATCAATGTGGTGGTTGGACTATCGAATGGCAAGGAGCTAGTGGCAACAACCTTACAAGTGGCACAACTGTGCTTGATGCTATAAAAGAAACGGTTGGTCCTGAAACAGAAGTTGCCTTTGAGGAAAAACCAAATAAGGAGAGTCTTCAATCACATGAGTTTTCTTATGGCATTGTTGTAGTGGGAGAGTATCCATATGCAGAAACTAATGGTGATAGCTTGAATTTGACAATTCCCGACCCTGGTCCAAGCACCATCACAGATGTTTGTGGCGCTATGAAATGTGTAGTTATATTAATCTCAGGACGGCCTGTAGTAATCGAACCTTATATTTCTTCAATGGATGCACTTGTTGCTGCTTGGCTTCCAGGAACTGAAGGAAAAGGCATTACTGATGTATTGTTTGGAGATTATGGTTTTACTGGAAAACTTCCCCGAACGTGGTTCAAAACTGTTGATCAATTGCCAATGAACTTTGGAGATCCTCATTATGATCCTCTTTTCTCCTTTGGATATGGTCTTACTACAGAACCCATCAAAGCTTAATGAATGACTGCGTCAACTTAACTATTTTACATATATAAGTGCATGGCCTCCAATATTGTATACTCATATATATAA

Coding sequence (CDS)

ATGACCAAGGTTCTCATTGTTTTAATGGGATTTTTTCTCATGTTTTTGTCGGAGACATTGGGAACAGGAGAACAGCTCAAATACAAAGATCCAACAAAACCATTAAATGTTCGAATCAAGGATCTACTTGGTCGGATGACTGTGGAGGAAAAAATAGGTCAAATGGTGCAAATTGAAAGGGTTAATGCTTCAGCTGATGTCATGAAAAACTATTTCATTGGGAGTGTATTGAGTGGTGGAGGTAGTGCCCCATCGAAGAATGCCTCAGCTAAGGATTGGGTGGACATGGTGAATGAAATCCAAAAAGGAGCTTTGTCGAGTAGGCTAGGAATTCCAATGATATATGGAATTGATGCTGTTCATGGCCACAACAATGTCTATAATGCAACCATCTTCCCTCACAACGTTGGACTTGGAGCTACTAGAGACCCTCAACTTGTGAAGAATATTGGGAGTGCTACTGCCCTTGAAATTAGAGCAACTGGCATTCCGTATGCTTTTGCGCCTTGTATAGCGGTTTGTAAAGATCCACGATGGGGTCGGTGTTATGAAAGCTATAGTGAAGACCCTAAGATTGTTCAAGAAATGACTGAGATCATACTAGGTTTACAAGGAGAGATTCCACCTAATTCTCGCAAGGGTGTTCCTTATGTCGGTGGGAAAGACAAAGTGGTTGGATGTGCAAAACATTACGTTGGTGATGGTGGAACAACTAAGGGTATCAACGAGAACGACACAGTAATAGACAGACATAGCTTACTTAGCATTCACATGCCAGGTTACTATCACTCGATCATTAAGGGAATTGCAACCGTTATGGCTTCTTATTCAAGTTGGAATGGACAGAAAATGCATGCCCACAAAGAACTCCTTACTGACTTTCTCAAAAACACTCTCAATTTTAAGGGCTTTGTGATCTCAGATTGGCAGGGTATTGATAGGATTACATCTCCACCACATGCTAATTATACATATTCCATCATAGCCAGCGTTACTGCAGGTGTTGACATGATAATGATACCGTACGACTACAAGGAGTTCATCGATAAAATTACCTACTTGGTAAAAAATAACATAATTCCTATGAGTCGAATTGACGATGCAGTTTGGAGAATTTTGAGAGTTAAATTTGTTATGGGTTTATTTGAGAACCCATTAGCTGACTACAGTTTGGTTAATGAGATTGGTAAAAAGGAGCATAGAGAACTAGCTAGAGAAGCCGTAAGAAAATCACTAGTGTTATTAAAGAATGGGAAATCGACTTCAACACCATTGCTTCCTCTTCCAAAGAAGGCACAAAAAATACTTGTTGCTGGCACCCATGCAAACAACCTTGGATATCAATGTGGTGGTTGGACTATCGAATGGCAAGGAGCTAGTGGCAACAACCTTACAAGTGGCACAACTGTGCTTGATGCTATAAAAGAAACGGTTGGTCCTGAAACAGAAGTTGCCTTTGAGGAAAAACCAAATAAGGAGAGTCTTCAATCACATGAGTTTTCTTATGGCATTGTTGTAGTGGGAGAGTATCCATATGCAGAAACTAATGGTGATAGCTTGAATTTGACAATTCCCGACCCTGGTCCAAGCACCATCACAGATGTTTGTGGCGCTATGAAATGTGTAGTTATATTAATCTCAGGACGGCCTGTAGTAATCGAACCTTATATTTCTTCAATGGATGCACTTGTTGCTGCTTGGCTTCCAGGAACTGAAGGAAAAGGCATTACTGATGTATTGTTTGGAGATTATGGTTTTACTGGAAAACTTCCCCGAACGTGGTTCAAAACTGTTGATCAATTGCCAATGAACTTTGGAGATCCTCATTATGATCCTCTTTTCTCCTTTGGATATGGTCTTACTACAGAACCCATCAAAGCTTAA

Protein sequence

MTKVLIVLMGFFLMFLSETLGTGEQLKYKDPTKPLNVRIKDLLGRMTVEEKIGQMVQIERVNASADVMKNYFIGSVLSGGGSAPSKNASAKDWVDMVNEIQKGALSSRLGIPMIYGIDAVHGHNNVYNATIFPHNVGLGATRDPQLVKNIGSATALEIRATGIPYAFAPCIAVCKDPRWGRCYESYSEDPKIVQEMTEIILGLQGEIPPNSRKGVPYVGGKDKVVGCAKHYVGDGGTTKGINENDTVIDRHSLLSIHMPGYYHSIIKGIATVMASYSSWNGQKMHAHKELLTDFLKNTLNFKGFVISDWQGIDRITSPPHANYTYSIIASVTAGVDMIMIPYDYKEFIDKITYLVKNNIIPMSRIDDAVWRILRVKFVMGLFENPLADYSLVNEIGKKEHRELAREAVRKSLVLLKNGKSTSTPLLPLPKKAQKILVAGTHANNLGYQCGGWTIEWQGASGNNLTSGTTVLDAIKETVGPETEVAFEEKPNKESLQSHEFSYGIVVVGEYPYAETNGDSLNLTIPDPGPSTITDVCGAMKCVVILISGRPVVIEPYISSMDALVAAWLPGTEGKGITDVLFGDYGFTGKLPRTWFKTVDQLPMNFGDPHYDPLFSFGYGLTTEPIKA
BLAST of Cp4.1LG08g06820 vs. Swiss-Prot
Match: BGH3B_BACO1 (Beta-glucosidase BoGH3B OS=Bacteroides ovatus (strain ATCC 8483 / DSM 1896 / JCM 5824 / NCTC 11153) GN=BACOVA_02659 PE=1 SV=1)

HSP 1 Score: 294.7 bits (753), Expect = 2.5e-78
Identity = 198/649 (30.51%), Postives = 330/649 (50.85%), Query Frame = 1

Query: 31  PTKP-LNVRIKDLLGRMTVEEKIGQMVQIERVNASAD------------------VMKNY 90
           PT P +   I++ L +MT+E+KIGQM +I  ++  +D                  V+  Y
Sbjct: 30  PTDPAIETHIREWLQKMTLEQKIGQMCEIT-IDVVSDLETSRKKGFCLSEAMLDTVIGKY 89

Query: 91  FIGSVLSGGGSAPSKNASAKD-WVDMVNEIQKGALSSRLGIPMIYGIDAVHGHNNVYNAT 150
            +GS+L+     P   A  K+ W + + +IQ+ ++   +GIP IYG+D +HG     + T
Sbjct: 90  KVGSLLN----VPLGVAQKKEKWAEAIKQIQEKSMKE-IGIPCIYGVDQIHGTTYTLDGT 149

Query: 151 IFPHNVGLGATRDPQLVKNIGSATALEIRATGIPYAFAPCIAVCKDPRWGRCYESYSEDP 210
           +FP  + +GAT + +L +     +A E +A  IP+ FAP + + +DPRW R +E+Y ED 
Sbjct: 150 MFPQGINMGATFNRELTRRGAKISAYETKAGCIPWTFAPVVDLGRDPRWARMWENYGEDC 209

Query: 211 KIVQEM-TEIILGLQGEIPPNSRKGVPYVGGKDKVVGCAKHYVGDGGTTKGINENDTVID 270
            +  EM    + G QGE P           G+  V  C KHY+G G    G +   + I 
Sbjct: 210 YVNAEMGVSAVKGFQGEDPNRI--------GEYNVAACMKHYMGYGVPVSGKDRTPSSIS 269

Query: 271 RHSLLSIHMPGYYHSIIKGIATVMASYSSWNGQKMHAHKELLTDFLKNTLNFKGFVISDW 330
           R  +   H   +  ++ +G  +VM +    NG   HA++ELLT++LK  LN+ G +++DW
Sbjct: 270 RSDMREKHFAPFLAAVRQGALSVMVNSGVDNGLPFHANRELLTEWLKEDLNWDGLIVTDW 329

Query: 331 QGIDRITSPPH--ANYTYSIIASVTAGVDMIMIPYDYKEFIDKITYLVKNNIIPMSRIDD 390
             I+ + +  H  A    ++   + AG+DM M+PY+   F D +  LV+   + M RIDD
Sbjct: 330 ADINNLCTRDHIAATKKEAVKIVINAGIDMSMVPYEV-SFCDYLKELVEEGEVSMERIDD 389

Query: 391 AVWRILRVKFVMGLFENPLADYSLVNEIGKKEHRELAREAVRKSLVLLKNGKSTSTPLLP 450
           AV R+LR+K+ +GLF++P  D    ++ G KE   +A +A  +S VLLKN  +    +LP
Sbjct: 390 AVARVLRLKYRLGLFDHPYWDIKKYDKFGSKEFAAVALQAAEESEVLLKNDGN----ILP 449

Query: 451 LPKKAQKILVAGTHANNLGYQCGGWTIEWQGASGNNLTSG-TTVLDAIKETVGPE----- 510
           +  K +KIL+ G +AN++    GGW+  WQG   +       T+ +A+ E  G E     
Sbjct: 450 I-AKGKKILLTGPNANSMRCLNGGWSYSWQGHVADEYAQAYHTIYEALCEKYGKENIIYE 509

Query: 511 ---TEVAF-------EEKPNKESLQSHEFSYGIVV--VGEYPYAETNGDSLNLTIPDPGP 570
              T  ++       E KP  E   +      I++  +GE  Y ET G+  +LT+ +   
Sbjct: 510 PGVTYASYKNDNWWEENKPETEKPVAAAAQADIIITCIGENSYCETPGNLTDLTLSENQR 569

Query: 571 STITDVCGAMKCVVILIS-GRPVVIEPYISSMDALVAAWLPGT-EGKGITDVLFGDYGFT 622
           + +  +    K +V++++ GRP +I   +    A+V   LP    G  + ++L GD  F+
Sbjct: 570 NLVKALAATGKPIVLVLNQGRPRIINDIVPLAKAVVNIMLPSNYGGDALANLLAGDANFS 629

BLAST of Cp4.1LG08g06820 vs. Swiss-Prot
Match: GLUA_DICDI (Lysosomal beta glucosidase OS=Dictyostelium discoideum GN=gluA PE=1 SV=2)

HSP 1 Score: 288.9 bits (738), Expect = 1.3e-76
Identity = 205/634 (32.33%), Postives = 321/634 (50.63%), Query Frame = 1

Query: 39  IKDLLGRMTVEEKIGQMVQIERVNAS------------ADVMKNYFIGSVL----SGGGS 98
           + +L+ +M++ EKIGQM Q++    +            A   K Y+IGS L    SGG +
Sbjct: 80  VDNLMSKMSITEKIGQMTQLDITTLTSPNTITINETTLAYYAKTYYIGSYLNSPVSGGLA 139

Query: 99  APSKNASAKDWVDMVNEIQKGALS-SRLGIPMIYGIDAVHGHNNVYNATIFPHNVGLGAT 158
               + ++  W+DM+N IQ   +  S   IPMIYG+D+VHG N V+ AT+FPHN GL AT
Sbjct: 140 GDIHHINSSVWLDMINTIQTIVIEGSPNKIPMIYGLDSVHGANYVHKATLFPHNTGLAAT 199

Query: 159 RDPQLVKNIGSATALEIRATGIPYAFAPCIAVCKDPRWGRCYESYSEDPKIVQEM-TEII 218
            + +        T+ +  A GIP+ FAP + +   P W R YE++ EDP +   M    +
Sbjct: 200 FNIEHATTAAQITSKDTVAVGIPWVFAPVLGIGVQPLWSRIYETFGEDPYVASMMGAAAV 259

Query: 219 LGLQGEIPPNSRKGVPYVGGKDKVVGCAKHYVGDGGTTKGINENDTVIDRHSLLSIHMPG 278
            G QG    N+    P        V  AKHY G    T G +     I    L    +P 
Sbjct: 260 RGFQG---GNNSFDGPI--NAPSAVCTAKHYFGYSDPTSGKDRTAAWIPERMLRRYFLPS 319

Query: 279 YYHSII-KGIATVMASYSSWNGQKMHAHKELLTDFLKNTLNFKGFVISDWQGIDRITSPP 338
           +  +I   G  T+M +    NG  MH   + LT+ L+  L F+G  ++DWQ I+++    
Sbjct: 320 FAEAITGAGAGTIMINSGEVNGVPMHTSYKYLTEVLRGELQFEGVAVTDWQDIEKLVYFH 379

Query: 339 H--ANYTYSIIASVTAGVDMIMIPYDYKEFIDKITYLVKNNIIPMSRIDDAVWRILRVKF 398
           H   +   +I+ ++ AG+DM M+P D   F   +  +V    +P SR+D +V RIL +K+
Sbjct: 380 HTAGSAEEAILQALDAGIDMSMVPLDL-SFPIILAEMVAAGTVPESRLDLSVRRILNLKY 439

Query: 399 VMGLFENPL--ADYSLVNEIGKKEHRELAREAVRKSLVLLKNGKSTSTPLLPLPKKAQK- 458
            +GLF NP    + ++V+ IG+ + RE A     +S+ LL+N  +    +LPL     K 
Sbjct: 440 ALGLFSNPYPNPNAAIVDTIGQVQDREAAAATAEESITLLQNKNN----ILPLNTNTIKN 499

Query: 459 ILVAGTHANNLGYQCGGWTIEWQGA-SGNNLTSGTTVLDAIKE------------TVGPE 518
           +L+ G  A+++    GGW++ WQGA   +    GT++L  ++E            T+G E
Sbjct: 500 VLLTGPSADSIRNLNGGWSVHWQGAYEDSEFPFGTSILTGLREITNDTADFNIQYTIGHE 559

Query: 519 TEVAFEEKPNKESLQSHEFS-YGIVVVGEYPYAETNGDSLNLTIPDPGPSTITD--VCGA 578
             V   +    E+++  + S   +VV+GE P AET GD  +L++ DP    +    V   
Sbjct: 560 IGVPTNQTSIDEAVELAQSSDVVVVVIGELPEAETPGDIYDLSM-DPNEVLLLQQLVDTG 619

Query: 579 MKCVVILISGRPVVIEP-YISSMDALVAAWLPGTE-GKGITDVLFGDYGFTGKLPRTWFK 622
              V+IL+  RP ++ P  + S  A++ A+LPG+E GK I ++L G+   +G+LP T+  
Sbjct: 620 KPVVLILVEARPRILPPDLVYSCAAVLMAYLPGSEGGKPIANILMGNVNPSGRLPLTYPG 679

BLAST of Cp4.1LG08g06820 vs. Swiss-Prot
Match: BGLX_SALTY (Periplasmic beta-glucosidase OS=Salmonella typhimurium (strain LT2 / SGSC1412 / ATCC 700720) GN=bglX PE=3 SV=2)

HSP 1 Score: 224.2 bits (570), Expect = 4.1e-57
Identity = 187/648 (28.86%), Postives = 294/648 (45.37%), Query Frame = 1

Query: 39  IKDLLGRMTVEEKIGQMVQIERVNASADVMKNYFIGSVLSGGGSAPSKNASAKDWVDMVN 98
           + DLL +MTV+EKIGQ+     ++   D  K      +  G   A     + +D   M +
Sbjct: 38  VTDLLKKMTVDEKIGQL---RLISVGPDNPKEAIREMIKDGQVGAIFNTVTRQDIRQMQD 97

Query: 99  EIQKGALSSRLGIPMIYGIDAVHGHNNVYNATIFPHNVGLGATRDPQLVKNIGSATALEI 158
           ++      SRL IP+ +  D VHG       T+FP ++GL ++ +   V+ +G  +A E 
Sbjct: 98  QVMA---LSRLKIPLFFAYDVVHGQR-----TVFPISLGLASSFNLDAVRTVGRVSAYEA 157

Query: 159 RATGIPYAFAPCIAVCKDPRWGRCYESYSEDPKIVQEMTE-IILGLQGEIPPNSRKGVPY 218
              G+   +AP + V +DPRWGR  E + ED  +   M E ++  +QG+ P +       
Sbjct: 158 ADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSIMGETMVKAMQGKSPAD------- 217

Query: 219 VGGKDKVVGCAKHYVGDGGTTKGINENDTVIDRHSLLSIHMPGYYHSIIKGIATVMASYS 278
              +  V+   KH+   G    G   N   +    L + +MP Y   +  G   VM + +
Sbjct: 218 ---RYSVMTSVKHFAAYGAVEGGKEYNTVDMSSQRLFNDYMPPYKAGLDAGSGAVMVALN 277

Query: 279 SWNGQKMHAHKELLTDFLKNTLNFKGFVISDWQGI-DRITSPPHANYTYSIIASVTAGVD 338
           S NG    +   LL D L++   FKG  +SD   I + I     A+   ++  ++ AGVD
Sbjct: 278 SLNGTPATSDSWLLKDVLRDEWGFKGITVSDHGAIKELIKHGTAADPEDAVRVALKAGVD 337

Query: 339 MIMIPYDYKEFIDKITYLVKNNIIPMSRIDDAVWRILRVKFVMGLFENPLADYSLVNEIG 398
           M M    Y +++     L+K+  + M+ +DDA   +L VK+ MGLF +P +       +G
Sbjct: 338 MSMADEYYSKYLPG---LIKSGKVTMAELDDATRHVLNVKYDMGLFNDPYS------HLG 397

Query: 399 KKE------------HRELAREAVRKSLVLLKNGKSTSTPLLPLPKKAQKILVAGTHANN 458
            KE            HR+ ARE  R+S+VLLKN   T    LPL KK+  I V G  A++
Sbjct: 398 PKESDPVDTNAESRLHRKEAREVARESVVLLKNRLET----LPL-KKSGTIAVVGPLADS 457

Query: 459 LGYQCGGWTIEWQG----------------------ASGNNLTSGTTVLDAIK-----ET 518
                G W+                           A G N+T+   ++D +        
Sbjct: 458 QRDVMGSWSAAGVANQSVTVLAGIQNAVGDGAKILYAKGANITNDKGIVDFLNLYEEAVK 517

Query: 519 VGPETEVAFEEKPNKESLQSHEFSYGIVVVGEYP-YAETNGDSLNLTIPDPGPSTITDVC 578
           + P +  A  ++  + + Q+      + VVGE    A       N+TIP      IT + 
Sbjct: 518 IDPRSPQAMIDEAVQAAKQADVV---VAVVGESQGMAHEASSRTNITIPQSQRDLITALK 577

Query: 579 GAMK-CVVILISGRPVVIEPYISSMDALVAAWLPGTEG-KGITDVLFGDYGFTGKLPRTW 622
              K  V++L++GRP+ +       DA++  W  GTEG   I DVLFGDY  +GKLP ++
Sbjct: 578 ATGKPLVLVLMNGRPLALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPISF 637

BLAST of Cp4.1LG08g06820 vs. Swiss-Prot
Match: BGLX_ECOLI (Periplasmic beta-glucosidase OS=Escherichia coli (strain K12) GN=bglX PE=3 SV=2)

HSP 1 Score: 216.5 bits (550), Expect = 8.5e-55
Identity = 185/651 (28.42%), Postives = 294/651 (45.16%), Query Frame = 1

Query: 39  IKDLLGRMTVEEKIGQMVQIERVNASADVMKNYFIGSVLSGGGSAPSKNASAKDWVDMVN 98
           + +LL +MTV+EKIGQ+     ++   D  K      +  G   A     + +D   M +
Sbjct: 38  VTELLKKMTVDEKIGQL---RLISVGPDNPKEAIREMIKDGQVGAIFNTVTRQDIRAMQD 97

Query: 99  EIQKGALSSRLGIPMIYGIDAVHGHNNVYNATIFPHNVGLGATRDPQLVKNIGSATALEI 158
           ++ +    SRL IP+ +  D +HG       T+FP ++GL ++ +   VK +G  +A E 
Sbjct: 98  QVME---LSRLKIPLFFAYDVLHGQR-----TVFPISLGLASSFNLDAVKTVGRVSAYEA 157

Query: 159 RATGIPYAFAPCIAVCKDPRWGRCYESYSEDPKIVQEMTEIIL-GLQGEIPPNSRKGVPY 218
              G+   +AP + V +DPRWGR  E + ED  +   M + ++  +QG+ P +       
Sbjct: 158 ADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSTMGKTMVEAMQGKSPAD------- 217

Query: 219 VGGKDKVVGCAKHYVGDGGTTKGINENDTVIDRHSLLSIHMPGYYHSIIKGIATVMASYS 278
              +  V+   KH+   G    G   N   +    L + +MP Y   +  G   VM + +
Sbjct: 218 ---RYSVMTSVKHFAAYGAVEGGKEYNTVDMSPQRLFNDYMPPYKAGLDAGSGAVMVALN 277

Query: 279 SWNGQKMHAHKELLTDFLKNTLNFKGFVISDWQGI-DRITSPPHANYTYSIIASVTAGVD 338
           S NG    +   LL D L++   FKG  +SD   I + I     A+   ++  ++ +G++
Sbjct: 278 SLNGTPATSDSWLLKDVLRDQWGFKGITVSDHGAIKELIKHGTAADPEDAVRVALKSGIN 337

Query: 339 MIMIPYDYKEFIDKITYLVKNNIIPMSRIDDAVWRILRVKFVMGLFENPLADYSLVNEIG 398
           M M    Y +++     L+K+  + M+ +DDA   +L VK+ MGLF +P +       +G
Sbjct: 338 MSMSDEYYSKYLPG---LIKSGKVTMAELDDAARHVLNVKYDMGLFNDPYS------HLG 397

Query: 399 KKE------------HRELAREAVRKSLVLLKNGKSTSTPLLPLPKKAQKILVAGTHANN 458
            KE            HR+ ARE  R+SLVLLKN   T    LPL KK+  I V G  A++
Sbjct: 398 PKESDPVDTNAESRLHRKEAREVARESLVLLKNRLET----LPL-KKSATIAVVGPLADS 457

Query: 459 LGYQCGGWTIEWQGASGNNLTSGTTVLDAIKETVGPETEVAFEEKPN------------- 518
                G W+      +        TVL  IK  VG   +V + +  N             
Sbjct: 458 KRDVMGSWS------AAGVADQSVTVLTGIKNAVGENGKVLYAKGANVTSDKGIIDFLNQ 517

Query: 519 ----------------KESLQSHEFSYGIV-VVGEYP-YAETNGDSLNLTIPDPGPSTIT 578
                            E++Q+ + S  +V VVGE    A       ++TIP      I 
Sbjct: 518 YEEAVKVDPRSPQEMIDEAVQTAKQSDVVVAVVGEAQGMAHEASSRTDITIPQSQRDLIA 577

Query: 579 DVCGAMK-CVVILISGRPVVIEPYISSMDALVAAWLPGTEG-KGITDVLFGDYGFTGKLP 622
            +    K  V++L++GRP+ +       DA++  W  GTEG   I DVLFGDY  +GKLP
Sbjct: 578 ALKATGKPLVLVLMNGRPLALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNPSGKLP 637

BLAST of Cp4.1LG08g06820 vs. Swiss-Prot
Match: BGLC_ASPOR (Probable beta-glucosidase C OS=Aspergillus oryzae (strain ATCC 42149 / RIB 40) GN=bglC PE=3 SV=2)

HSP 1 Score: 174.9 bits (442), Expect = 2.8e-42
Identity = 171/648 (26.39%), Postives = 269/648 (41.51%), Query Frame = 1

Query: 28  YKDPTKPLNVRIKDLLGRMTVEEKIGQMVQIERVNASADV---------MKNYFIGSVLS 87
           YKD +  ++ R+ DLL RMT+EEK GQ+     ++   D            +  IG    
Sbjct: 46  YKDASYCIDERVDDLLARMTIEEKAGQLFHTRLMDGPLDDEGSGNNAHNSTSNMIGEKHM 105

Query: 88  GGGSAPSKNASAKDWVDMVNEIQKGALSSRLGIPMIYGIDAVHGHN-NV---YNATIF-- 147
              +  S   +A +  + +N IQ+ AL +RLGIP+    D  H    NV   + A +F  
Sbjct: 106 THFNLASDITNATETAEFINRIQELALQTRLGIPVTVSTDPRHSFTENVGTGFKAGVFSQ 165

Query: 148 -PHNVGLGATRDPQLVKNIGSATALEIRATGIPYAFAPCIAVCKDPRWGRCYESYSEDPK 207
            P ++GL A RDP +V+        E  A GI  A  P + +  +PRW R   ++ E+  
Sbjct: 166 WPESIGLAALRDPYVVRKFAEVAKEEYIAVGIRAALHPQVDLSTEPRWARISNTWGENST 225

Query: 208 IVQEM-TEIILGLQGE-IPPNSRKGVPYVGGKDKVVGCAKHYVGDGGTTKG------INE 267
           +  E+  E I G QG+ + P S K V             KH+ G G    G        +
Sbjct: 226 LTSELLVEYIKGFQGDKLGPQSVKTV------------TKHFPGGGPVENGEDSHFAYGK 285

Query: 268 NDTVIDRHSLLSIHMPGYYHSIIKGIATVMASYSSWNGQKMHA-----HKELLTDFLKNT 327
           N T    +  L  H+  +  +I  G   +M  YS   G +        +K ++T+ L+N 
Sbjct: 286 NQTYPGNN--LEEHLKPFKAAIAAGATEIMPYYSRPIGTEYEPVAFSFNKRIVTELLRNE 345

Query: 328 LNFKGFVISDWQ-----------------GIDRITSPPHANYTYSIIASVTAGVDMIMIP 387
           L F G V++DW                  G++ +T    A         + AG D     
Sbjct: 346 LGFDGIVLTDWGLITDGYIAGQYMPARAWGVENLTELQRA------ARILDAGCDQ---- 405

Query: 388 YDYKEFIDKITYLVKNNIIPMSRIDDAVWRILRVKFVMGLFENPLADYSLVNE-IGKKEH 447
           +  +E  + I  LV+  II   RID +V R+L+ KFV+GLF+NP  D       +G    
Sbjct: 406 FGGEERPELIVQLVQEGIISEDRIDVSVRRLLKEKFVLGLFDNPFVDAEAAGRVVGNDYF 465

Query: 448 RELAREAVRKSLVLLKNGKSTSTPLLPLPK--KAQKILVAGTHANNLGYQCGGWTIEWQG 507
             L REA R+S  LL N +     ++PL K  K+ K  + G +A+ +      W      
Sbjct: 466 VRLGREAQRRSYTLLSNNED----IVPLKKIEKSTKFYIEGFNASFI----ESWNY---- 525

Query: 508 ASGNNLTSGTTVLDAIKETVGP--ETEVAFEEKPNKESLQSHEFSYGIVVVGEYPYAETN 567
                     TV+D+ +E           +E +P                       E N
Sbjct: 526 ----------TVVDSPEEAEYALLRYNAPYEPRPGG--------------------FEAN 585

Query: 568 GDSLNLTIPDPGPSTITDVCGAMKCVVILISGRPVVIEPYISSMDALVAAWLPGTEGKGI 622
             + +L   D   +    +  A+  +V ++  RP VI   I    A+ A++  G++    
Sbjct: 586 MHAGSLAFNDTEKARQAKIYSAVPTIVDIVMDRPAVIPEIIEQAKAVFASY--GSDSNAF 625

BLAST of Cp4.1LG08g06820 vs. TrEMBL
Match: A0A0A0LY55_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G661750 PE=4 SV=1)

HSP 1 Score: 1072.4 bits (2772), Expect = 2.0e-310
Identity = 511/622 (82.15%), Postives = 566/622 (91.00%), Query Frame = 1

Query: 6   IVLMGFFLMFLSETLGTGEQLKYKDPTKPLNVRIKDLLGRMTVEEKIGQMVQIERVNASA 65
           I+L+   L+   ET    E  KYKDPT+ LNVRIKDLLGRMT+EEKIGQMVQIERVNAS 
Sbjct: 6   IILIALLLICCFETGAKAENFKYKDPTQRLNVRIKDLLGRMTLEEKIGQMVQIERVNAST 65

Query: 66  DVMKNYFIGSVLSGGGSAPSKNASAKDWVDMVNEIQKGALSSRLGIPMIYGIDAVHGHNN 125
           +VMK YFIGSVLSGGGS PSK ASA+DW++MVNEIQKGALS+RLGIPMIYGIDAVHGHNN
Sbjct: 66  EVMKKYFIGSVLSGGGSVPSKQASAQDWINMVNEIQKGALSTRLGIPMIYGIDAVHGHNN 125

Query: 126 VYNATIFPHNVGLGATRDPQLVKNIGSATALEIRATGIPYAFAPCIAVCKDPRWGRCYES 185
           VYNATIFPHN+GLGATRDPQL+K IG A+A EIRATGIPYAFAPC+AVC+DPRWGRCYES
Sbjct: 126 VYNATIFPHNIGLGATRDPQLLKRIGVASAREIRATGIPYAFAPCVAVCRDPRWGRCYES 185

Query: 186 YSEDPKIVQEMTEIILGLQGEIPPNSRKGVPYVGGKDKVVGCAKHYVGDGGTTKGINEND 245
           Y EDPKIVQEMTEII GLQGEIPPNSRKGVPYV GK+ VV CAKHYVGDGGTTKGI+EN+
Sbjct: 186 YGEDPKIVQEMTEIIPGLQGEIPPNSRKGVPYVAGKENVVACAKHYVGDGGTTKGIDENN 245

Query: 246 TVIDRHSLLSIHMPGYYHSIIKGIATVMASYSSWNGQKMHAHKELLTDFLKNTLNFKGFV 305
           TVIDRH LLSIHMPGYYHSIIKG+AT+M SYSSWNG+KMHA+K L+TDFLKNTL+F+GFV
Sbjct: 246 TVIDRHGLLSIHMPGYYHSIIKGVATIMVSYSSWNGEKMHANKNLVTDFLKNTLHFQGFV 305

Query: 306 ISDWQGIDRITSPPHANYTYSIIASVTAGVDMIMIPYDYKEFIDKITYLVKNNIIPMSRI 365
           ISDW+ IDRIT PPHANYTYSI+AS+TAG+DMIMIPY+Y EFID +T LVK+N IP+SRI
Sbjct: 306 ISDWEAIDRITDPPHANYTYSILASITAGLDMIMIPYNYPEFIDGLTNLVKSNYIPISRI 365

Query: 366 DDAVWRILRVKFVMGLFENPLADYSLVNEIGKKEHRELAREAVRKSLVLLKNGKSTSTPL 425
           DDAV RILRVKFVMGLFENP+AD SLVNE+GK+EHRELAREAVRKSLVLLKNGKS   PL
Sbjct: 366 DDAVKRILRVKFVMGLFENPIADLSLVNELGKQEHRELAREAVRKSLVLLKNGKSADKPL 425

Query: 426 LPLPKKAQKILVAGTHANNLGYQCGGWTIEWQGASGNNLTSGTTVLDAIKETVGPETEVA 485
           LPL KK QKILVAG+HANNLGYQCGGWTIEWQG SGNNLTSGTTVLDAIK+TV P TEV 
Sbjct: 426 LPLEKKTQKILVAGSHANNLGYQCGGWTIEWQGLSGNNLTSGTTVLDAIKDTVDPTTEVI 485

Query: 486 FEEKPNKESLQSHEFSYGIVVVGEYPYAETNGDSLNLTIPDPGPSTITDVCGAMKCVVIL 545
           F E P+K+SLQS  FSY IVVVGE+PYAE NGDSLNLTIPDPGP+TIT+VCG +KC V++
Sbjct: 486 FNENPDKKSLQSDTFSYAIVVVGEHPYAELNGDSLNLTIPDPGPNTITNVCGVIKCAVVI 545

Query: 546 ISGRPVVIEPYISSMDALVAAWLPGTEGKGITDVLFGDYGFTGKLPRTWFKTVDQLPMNF 605
           ISGRPVVI+PY+ S+DALVAAWLPGTEGKGITDVLFGDYGFTGKL +TWFKTVDQLPMNF
Sbjct: 546 ISGRPVVIQPYVDSIDALVAAWLPGTEGKGITDVLFGDYGFTGKLSQTWFKTVDQLPMNF 605

Query: 606 GDPHYDPLFSFGYGLTTEPIKA 628
           G+P+YDPLF FG+GLTT+PIK+
Sbjct: 606 GNPNYDPLFPFGHGLTTQPIKS 627

BLAST of Cp4.1LG08g06820 vs. TrEMBL
Match: A0A0A0LI54_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G842090 PE=4 SV=1)

HSP 1 Score: 1057.4 bits (2733), Expect = 6.9e-306
Identity = 503/624 (80.61%), Postives = 564/624 (90.38%), Query Frame = 1

Query: 4   VLIVLMGFFLMFLSETLGTGEQLKYKDPTKPLNVRIKDLLGRMTVEEKIGQMVQIERVNA 63
           VLI  +G  ++  SETL   E LKYKDP +PLNVRIKDLLGRMT+EEKIGQMVQIER NA
Sbjct: 6   VLITFVGLLVLCFSETLAKAEYLKYKDPKQPLNVRIKDLLGRMTLEEKIGQMVQIERANA 65

Query: 64  SADVMKNYFIGSVLSGGGSAPSKNASAKDWVDMVNEIQKGALSSRLGIPMIYGIDAVHGH 123
           SADVMK YFIGSVLSGGGSAPSK ASAKDWV MVN+IQ+ ALS+RLGIPMIYGIDAVHGH
Sbjct: 66  SADVMKQYFIGSVLSGGGSAPSKQASAKDWVHMVNKIQEAALSTRLGIPMIYGIDAVHGH 125

Query: 124 NNVYNATIFPHNVGLGATRDPQLVKNIGSATALEIRATGIPYAFAPCIAVCKDPRWGRCY 183
           NNVYNATIFPHN+GLGATRDPQL+K IG+ATALE+RATGIPYAFAPCIAVC+DPRWGRCY
Sbjct: 126 NNVYNATIFPHNIGLGATRDPQLLKRIGAATALEVRATGIPYAFAPCIAVCRDPRWGRCY 185

Query: 184 ESYSEDPKIVQEMTEIILGLQGEIPPNSRKGVPYVGGKDKVVGCAKHYVGDGGTTKGINE 243
           ESY ED  IVQ MTEII GLQG++P N RKGVPYV GK+ V  CAKH+VGDGGTTKGINE
Sbjct: 186 ESYGEDHTIVQAMTEIIPGLQGDVPANIRKGVPYVAGKNNVAACAKHFVGDGGTTKGINE 245

Query: 244 NDTVIDRHSLLSIHMPGYYHSIIKGIATVMASYSSWNGQKMHAHKELLTDFLKNTLNFKG 303
           N+TV+D H L SIHMP YY+SIIKG+ATVM SYSS NG+KMHA+K+L+TDFLKNTL+FKG
Sbjct: 246 NNTVVDGHGLFSIHMPAYYNSIIKGVATVMVSYSSINGEKMHANKKLVTDFLKNTLHFKG 305

Query: 304 FVISDWQGIDRITSPPHANYTYSIIASVTAGVDMIMIPYDYKEFIDKITYLVKNNIIPMS 363
           FVISDWQGID+IT+PPHANYTYSI+ASV AGVDMIM+PY+Y EFID +TYLVKNN IP+S
Sbjct: 306 FVISDWQGIDKITTPPHANYTYSILASVNAGVDMIMVPYNYTEFIDGLTYLVKNNAIPIS 365

Query: 364 RIDDAVWRILRVKFVMGLFENPLADYSLVNEIGKKEHRELAREAVRKSLVLLKNGKSTST 423
           RIDDAV RILRVKFVMGLFENPLAD SL+NE+GK+EHRELAREAVRKSLVLLKNGK  + 
Sbjct: 366 RIDDAVKRILRVKFVMGLFENPLADLSLINELGKQEHRELAREAVRKSLVLLKNGKLPNQ 425

Query: 424 PLLPLPKKAQKILVAGTHANNLGYQCGGWTIEWQGASGNNLTSGTTVLDAIKETVGPETE 483
           PLLPLPKKA KILVAGTHAN+LG QCGGWT+EWQG +GNNLTSGTT+L AIK+TV PETE
Sbjct: 426 PLLPLPKKAPKILVAGTHANDLGNQCGGWTMEWQGLTGNNLTSGTTILTAIKDTVDPETE 485

Query: 484 VAFEEKPNKESLQSHEFSYGIVVVGEYPYAETNGDSLNLTIPDPGPSTITDVCGAMKCVV 543
           V F + PN E LQ+H+FSY IVVVGE+PYAETNGDSLNLTIP+PGP TI +VCGA+KCVV
Sbjct: 486 VVFHDNPNAEFLQTHQFSYAIVVVGEHPYAETNGDSLNLTIPEPGPETIKNVCGAVKCVV 545

Query: 544 ILISGRPVVIEPYISSMDALVAAWLPGTEGKGITDVLFGDYGFTGKLPRTWFKTVDQLPM 603
           ++ISGRPVV++PYI S+DA+VAAWLPGTEGKGI+DVLFGDYGFTGKL +TWFK+VDQLPM
Sbjct: 546 VVISGRPVVLQPYIDSIDAVVAAWLPGTEGKGISDVLFGDYGFTGKLSQTWFKSVDQLPM 605

Query: 604 NFGDPHYDPLFSFGYGLTTEPIKA 628
           NFGD HYDPLF FG+GLTT+P+KA
Sbjct: 606 NFGDAHYDPLFPFGFGLTTQPVKA 629

BLAST of Cp4.1LG08g06820 vs. TrEMBL
Match: A0A0A0LFL8_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G842070 PE=4 SV=1)

HSP 1 Score: 1042.0 bits (2693), Expect = 3.0e-301
Identity = 494/627 (78.79%), Postives = 557/627 (88.84%), Query Frame = 1

Query: 1   MTKVLIVLMGFFLMFLSETLGTGEQLKYKDPTKPLNVRIKDLLGRMTVEEKIGQMVQIER 60
           M K LI  MGFF+  L+E     + ++YKDP +PLNVRI DLLGRMT+EEKIGQMVQI+R
Sbjct: 1   MAKNLIFFMGFFIFCLTEVWAKHQYMRYKDPKQPLNVRISDLLGRMTLEEKIGQMVQIDR 60

Query: 61  VNASADVMKNYFIGSVLSGGGSAPSKNASAKDWVDMVNEIQKGALSSRLGIPMIYGIDAV 120
             AS  VMK Y IGSVLSGGGS PSK AS K W+DMVNE QKG+LS+RLGIPMIYGIDAV
Sbjct: 61  TVASKKVMKKYLIGSVLSGGGSVPSKEASPKVWIDMVNEFQKGSLSTRLGIPMIYGIDAV 120

Query: 121 HGHNNVYNATIFPHNVGLGATRDPQLVKNIGSATALEIRATGIPYAFAPCIAVCKDPRWG 180
           HGHNNVY ATIFPHNVGLGATRDP L K IG+ATALE+RATGI Y FAPCIAVC+DPRWG
Sbjct: 121 HGHNNVYKATIFPHNVGLGATRDPNLAKRIGAATALEVRATGISYVFAPCIAVCRDPRWG 180

Query: 181 RCYESYSEDPKIVQEMTEIILGLQGEIPPNSRKGVPYVGGKDKVVGCAKHYVGDGGTTKG 240
           RC+ESYSEDPK+VQEMTEII GLQGEIP NSRKGVPYV G++KV  CAKHYVGDGGTTKG
Sbjct: 181 RCFESYSEDPKVVQEMTEIISGLQGEIPSNSRKGVPYVAGREKVAACAKHYVGDGGTTKG 240

Query: 241 INENDTVIDRHSLLSIHMPGYYHSIIKGIATVMASYSSWNGQKMHAHKELLTDFLKNTLN 300
           +NEN+T+  RH LLSIHMPGYY+SIIKG++TVM SYSSWNG+KMH +++L+T FLKNTL 
Sbjct: 241 MNENNTLASRHGLLSIHMPGYYNSIIKGVSTVMISYSSWNGKKMHENRDLITGFLKNTLR 300

Query: 301 FKGFVISDWQGIDRITSPPHANYTYSIIASVTAGVDMIMIPYDYKEFIDKITYLVKNNII 360
           F+GFVISDWQGIDRITSPPHANYTYSIIA +TAG+DMIM+P++Y EFID +TYLVK N+I
Sbjct: 301 FRGFVISDWQGIDRITSPPHANYTYSIIAGITAGIDMIMVPFNYTEFIDGLTYLVKTNVI 360

Query: 361 PMSRIDDAVWRILRVKFVMGLFENPLADYSLVNEIGKKEHRELAREAVRKSLVLLKNGKS 420
           P+SRIDDAV RILRVKFVMGLFENPLAD S VNE+GKKEHRELAREAVRKSLVLLKNG+S
Sbjct: 361 PISRIDDAVKRILRVKFVMGLFENPLADSSFVNELGKKEHRELAREAVRKSLVLLKNGES 420

Query: 421 TSTPLLPLPKKAQKILVAGTHANNLGYQCGGWTIEWQGASGNNLTSGTTVLDAIKETVGP 480
              P+LPLPKK  KILVAG+HANNLG+QCGGWTIEWQG  GNNLTSGTT+L AIK+TV P
Sbjct: 421 ADKPILPLPKKVPKILVAGSHANNLGFQCGGWTIEWQGLGGNNLTSGTTILSAIKDTVDP 480

Query: 481 ETEVAFEEKPNKESLQSHEFSYGIVVVGEYPYAETNGDSLNLTIPDPGPSTITDVCGAMK 540
           +T+V F+E P+ E ++S++FSY IVVVGEYPYAET GDSLNLTIP+PGPSTIT+VCGA+K
Sbjct: 481 KTKVVFKENPDMEFVKSNKFSYAIVVVGEYPYAETFGDSLNLTIPEPGPSTITNVCGAVK 540

Query: 541 CVVILISGRPVVIEPYISSMDALVAAWLPGTEGKGITDVLFGDYGFTGKLPRTWFKTVDQ 600
           CVVI+ISGRPVV++PYISS+DALVAAWLPGTEGKGI+DVLFGDYGF+GKL RTWFKTVDQ
Sbjct: 541 CVVIVISGRPVVLQPYISSIDALVAAWLPGTEGKGISDVLFGDYGFSGKLSRTWFKTVDQ 600

Query: 601 LPMNFGDPHYDPLFSFGYGLTTEPIKA 628
           LPMN GD HYDPLF FG+GLTT PIKA
Sbjct: 601 LPMNVGDAHYDPLFPFGFGLTTNPIKA 627

BLAST of Cp4.1LG08g06820 vs. TrEMBL
Match: A0A061F0I5_THECC (Glycosyl hydrolase family protein OS=Theobroma cacao GN=TCM_025896 PE=4 SV=1)

HSP 1 Score: 1006.1 bits (2600), Expect = 1.8e-290
Identity = 474/604 (78.48%), Postives = 542/604 (89.74%), Query Frame = 1

Query: 24   EQLKYKDPTKPLNVRIKDLLGRMTVEEKIGQMVQIERVNASADVMKNYFIGSVLSGGGSA 83
            E +KYKDP +PLNVRIKDL+GRMT+EEKIGQMVQIER  ASA+VMK YFIGSVLSGGGS 
Sbjct: 617  EHVKYKDPKQPLNVRIKDLIGRMTLEEKIGQMVQIERAVASAEVMKKYFIGSVLSGGGSV 676

Query: 84   PSKNASAKDWVDMVNEIQKGALSSRLGIPMIYGIDAVHGHNNVYNATIFPHNVGLGATRD 143
            P+  ASAK W++MVNE QKG+LS+RLGIPMIYGIDAVHGHNNVY ATIFPHN+GLGATRD
Sbjct: 677  PAPKASAKTWLNMVNEFQKGSLSTRLGIPMIYGIDAVHGHNNVYKATIFPHNIGLGATRD 736

Query: 144  PQLVKNIGSATALEIRATGIPYAFAPCIAVCKDPRWGRCYESYSEDPKIVQEMTEIILGL 203
            P LVK IG+ATALE+RATGIPYAFAPC+AVC+DPRWGRCYESYSED KIVQ MTEII GL
Sbjct: 737  PALVKKIGAATALEVRATGIPYAFAPCLAVCRDPRWGRCYESYSEDHKIVQAMTEIIPGL 796

Query: 204  QGEIPPNSRKGVPYVGGKDKVVGCAKHYVGDGGTTKGINENDTVIDRHSLLSIHMPGYYH 263
            QG+IP NSRKGVP+V GK  V  CAKHYVGDGGTT+GINEN+TVIDRH LLSIHMP YY+
Sbjct: 797  QGDIPSNSRKGVPFVAGKKNVAACAKHYVGDGGTTRGINENNTVIDRHGLLSIHMPAYYN 856

Query: 264  SIIKGIATVMASYSSWNGQKMHAHKELLTDFLKNTLNFKGFVISDWQGIDRITSPPHANY 323
            SIIKG++TVM SYSSWNG K HA+ E++T+FLK TL F+GFVISDW+GIDRITSPPHANY
Sbjct: 857  SIIKGVSTVMTSYSSWNGVKNHANHEMVTNFLKKTLRFRGFVISDWEGIDRITSPPHANY 916

Query: 324  TYSIIASVTAGVDMIMIPYDYKEFIDKITYLVKNNIIPMSRIDDAVWRILRVKFVMGLFE 383
            TYSI+AS+ AG+DMIM+P +YKEFID +TYLVKN  IPMSRIDDAV RILRVKFVMGLFE
Sbjct: 917  TYSILASINAGLDMIMVPNNYKEFIDGLTYLVKNKFIPMSRIDDAVKRILRVKFVMGLFE 976

Query: 384  NPLADYSLVNEIGKKEHRELAREAVRKSLVLLKNGKSTSTPLLPLPKKAQKILVAGTHAN 443
            +PLAD SLV+++G +EHRELAREAVRKSLVLLKNG S   PLLPLPKKA KILVAG+HAN
Sbjct: 977  DPLADDSLVDQLGSQEHRELAREAVRKSLVLLKNGDSADAPLLPLPKKAPKILVAGSHAN 1036

Query: 444  NLGYQCGGWTIEWQGASGNNLTSGTTVLDAIKETVGPETEVAFEEKPNKESLQSHEFSYG 503
            NLGYQCGGWTIEWQG  GNN+T GTT+L AIK+TV P+T+V ++EKP+ E ++S++FSY 
Sbjct: 1037 NLGYQCGGWTIEWQGQGGNNITDGTTILTAIKKTVDPKTKVVYKEKPDAEFVKSNDFSYA 1096

Query: 504  IVVVGEYPYAETNGDSLNLTIPDPGPSTITDVCGAMKCVVILISGRPVVIEPYISSMDAL 563
            IVVVGE+PYAETNGDSLNLTIP+PGPSTI +VCGA+KCVV++ISGRPVVI+PY+  +DA+
Sbjct: 1097 IVVVGEHPYAETNGDSLNLTIPEPGPSTIGNVCGAVKCVVVVISGRPVVIQPYVRYIDAI 1156

Query: 564  VAAWLPGTEGKGITDVLFGDYGFTGKLPRTWFKTVDQLPMNFGDPHYDPLFSFGYGLTTE 623
            VAAWLPG+EG+G+ DVLFGDYGFTGKL  TWFKTVDQLPM+ GD HYDPLF FG+GLTT+
Sbjct: 1157 VAAWLPGSEGQGVADVLFGDYGFTGKLSFTWFKTVDQLPMHVGDSHYDPLFPFGFGLTTK 1216

Query: 624  PIKA 628
            P KA
Sbjct: 1217 PTKA 1220

BLAST of Cp4.1LG08g06820 vs. TrEMBL
Match: U5FEM9_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0019s05340g PE=4 SV=1)

HSP 1 Score: 995.0 bits (2571), Expect = 4.2e-287
Identity = 475/626 (75.88%), Postives = 543/626 (86.74%), Query Frame = 1

Query: 1   MTKVLIVLMGFFLMFLSETLGTGEQLKYKDPTKPLNVRIKDLLGRMTVEEKIGQMVQIER 60
           M ++ I LMG  +++ +  L   E + YKD TKPLN RIKDL+ RMT+EEKIGQM QIER
Sbjct: 1   MARIPIFLMGLVVIWAA--LAEAEYMIYKDATKPLNSRIKDLMSRMTLEEKIGQMTQIER 60

Query: 61  VNASADVMKNYFIGSVLSGGGSAPSKNASAKDWVDMVNEIQKGALSSRLGIPMIYGIDAV 120
             ASA+VMK+YFIGSVLSGGGS PSK ASA+ W++MVNE+QKGALS+RLGIPMIYGIDAV
Sbjct: 61  GVASAEVMKDYFIGSVLSGGGSVPSKQASAETWINMVNELQKGALSTRLGIPMIYGIDAV 120

Query: 121 HGHNNVYNATIFPHNVGLGATRDPQLVKNIGSATALEIRATGIPYAFAPCIAVCKDPRWG 180
           HGHNNVY ATIFPHNVGLGATRDP LVK IG+ATALE+RATGIPY FAPCIAVC+DPRWG
Sbjct: 121 HGHNNVYKATIFPHNVGLGATRDPNLVKRIGAATALEVRATGIPYVFAPCIAVCRDPRWG 180

Query: 181 RCYESYSEDPKIVQEMTEIILGLQGEIPPNSRKGVPYVGGKDKVVGCAKHYVGDGGTTKG 240
           RCYESYSEDPK+VQ MTEI+ GLQG+IP NS KGVP+V GK KV  CAKHYVGDGGTTKG
Sbjct: 181 RCYESYSEDPKLVQAMTEIVSGLQGDIPANSSKGVPFVAGKTKVAACAKHYVGDGGTTKG 240

Query: 241 INENDTVIDRHSLLSIHMPGYYHSIIKGIATVMASYSSWNGQKMHAHKELLTDFLKNTLN 300
           INEN+T I RH LLSIHMPGYY+SIIKG++TVM SYSSWNG KMHA+++++T FLKN L 
Sbjct: 241 INENNTQISRHGLLSIHMPGYYNSIIKGVSTVMVSYSSWNGVKMHANRDMVTGFLKNILR 300

Query: 301 FKGFVISDWQGIDRITSPPHANYTYSIIASVTAGVDMIMIPYDYKEFIDKITYLVKNNII 360
           FKGFVISDW+GIDRITSPPHANY+YSI A ++AG+DMIM+P +YKEFID +T  VKN +I
Sbjct: 301 FKGFVISDWEGIDRITSPPHANYSYSIQAGISAGIDMIMVPNNYKEFIDGLTSHVKNKVI 360

Query: 361 PMSRIDDAVWRILRVKFVMGLFENPLADYSLVNEIGKKEHRELAREAVRKSLVLLKNGKS 420
           PMSRIDDAV RILRVKF MGLFENPLAD SLVNE+G +EHRELAREAVRKSLVLLKNG+S
Sbjct: 361 PMSRIDDAVTRILRVKFTMGLFENPLADNSLVNELGSQEHRELAREAVRKSLVLLKNGES 420

Query: 421 TSTPLLPLPKKAQKILVAGTHANNLGYQCGGWTIEWQGASGNNLTSGTTVLDAIKETVGP 480
            + PLLPLPKKA KILVAG+HA+NLGYQCGGWTIEWQG  GNNLTSGTT+L AIK TV P
Sbjct: 421 AAEPLLPLPKKATKILVAGSHADNLGYQCGGWTIEWQGLGGNNLTSGTTILTAIKNTVDP 480

Query: 481 ETEVAFEEKPNKESLQSHEFSYGIVVVGEYPYAETNGDSLNLTIPDPGPSTITDVCGAMK 540
            TEV ++E P+ + ++S+ FSY IVVVGE PYAET GDSLNLTI +PGPSTI +VCG +K
Sbjct: 481 STEVVYKENPDADFVKSNNFSYAIVVVGEPPYAETFGDSLNLTISEPGPSTIQNVCGTVK 540

Query: 541 CVVILISGRPVVIEPYISSMDALVAAWLPGTEGKGITDVLFGDYGFTGKLPRTWFKTVDQ 600
           CV ++ISGRPVVI+PY+S MDALVAAWLPG+EG+G+ D LFGDYGFTG L RTWFKTVDQ
Sbjct: 541 CVTVIISGRPVVIQPYVSLMDALVAAWLPGSEGQGVADALFGDYGFTGTLSRTWFKTVDQ 600

Query: 601 LPMNFGDPHYDPLFSFGYGLTTEPIK 627
           LPMN GD HYDPLF FG+GL+T+P K
Sbjct: 601 LPMNIGDQHYDPLFPFGFGLSTKPTK 624

BLAST of Cp4.1LG08g06820 vs. TAIR10
Match: AT5G20950.1 (AT5G20950.1 Glycosyl hydrolase family protein)

HSP 1 Score: 919.8 bits (2376), Expect = 8.8e-268
Identity = 435/626 (69.49%), Postives = 527/626 (84.19%), Query Frame = 1

Query: 1   MTKVLIVLMGFFLMFLSETLGTGEQLKYKDPTKPLNVRIKDLLGRMTVEEKIGQMVQIER 60
           ++KVL +++   ++  +E  GT   LKYKDP +PL  RI+DL+ RMT++EKIGQMVQIER
Sbjct: 4   LSKVLCLMLLCCIVAAAE--GT---LKYKDPKQPLGARIRDLMNRMTLQEKIGQMVQIER 63

Query: 61  VNASADVMKNYFIGSVLSGGGSAPSKNASAKDWVDMVNEIQKGALSSRLGIPMIYGIDAV 120
             A+ +VMK YFIGSVLSGGGS PS+ A+ + WV+MVNEIQK +LS+RLGIPMIYGIDAV
Sbjct: 64  SVATPEVMKKYFIGSVLSGGGSVPSEKATPETWVNMVNEIQKASLSTRLGIPMIYGIDAV 123

Query: 121 HGHNNVYNATIFPHNVGLGATRDPQLVKNIGSATALEIRATGIPYAFAPCIAVCKDPRWG 180
           HGHNNVY ATIFPHNVGLG TRDP LVK IG+ATALE+RATGIPYAFAPCIAVC+DPRWG
Sbjct: 124 HGHNNVYGATIFPHNVGLGVTRDPNLVKRIGAATALEVRATGIPYAFAPCIAVCRDPRWG 183

Query: 181 RCYESYSEDPKIVQEMTEIILGLQGEIPPNSRKGVPYVGGKDKVVGCAKHYVGDGGTTKG 240
           RCYESYSED +IVQ+MTEII GLQG++P   RKGVP+VGGK KV  CAKH+VGDGGT +G
Sbjct: 184 RCYESYSEDYRIVQQMTEIIPGLQGDLP-TKRKGVPFVGGKTKVAACAKHFVGDGGTVRG 243

Query: 241 INENDTVIDRHSLLSIHMPGYYHSIIKGIATVMASYSSWNGQKMHAHKELLTDFLKNTLN 300
           I+EN+TVID   L  IHMPGYY+++ KG+AT+M SYS+WNG +MHA+KEL+T FLKN L 
Sbjct: 244 IDENNTVIDSKGLFGIHMPGYYNAVNKGVATIMVSYSAWNGLRMHANKELVTGFLKNKLK 303

Query: 301 FKGFVISDWQGIDRITSPPHANYTYSIIASVTAGVDMIMIPYDYKEFIDKITYLVKNNII 360
           F+GFVISDWQGIDRIT+PPH NY+YS+ A ++AG+DMIM+PY+Y EFID+I+  ++  +I
Sbjct: 304 FRGFVISDWQGIDRITTPPHLNYSYSVYAGISAGIDMIMVPYNYTEFIDEISSQIQKKLI 363

Query: 361 PMSRIDDAVWRILRVKFVMGLFENPLADYSLVNEIGKKEHRELAREAVRKSLVLLKNGKS 420
           P+SRIDDA+ RILRVKF MGLFE PLAD S  N++G KEHRELAREAVRKSLVLLKNGK+
Sbjct: 364 PISRIDDALKRILRVKFTMGLFEEPLADLSFANQLGSKEHRELAREAVRKSLVLLKNGKT 423

Query: 421 TSTPLLPLPKKAQKILVAGTHANNLGYQCGGWTIEWQGASGNNLTSGTTVLDAIKETVGP 480
            + PLLPLPKK+ KILVAG HA+NLGYQCGGWTI WQG +GN+ T GTT+L A+K TV P
Sbjct: 424 GAKPLLPLPKKSGKILVAGAHADNLGYQCGGWTITWQGLNGNDHTVGTTILAAVKNTVAP 483

Query: 481 ETEVAFEEKPNKESLQSHEFSYGIVVVGEYPYAETNGDSLNLTIPDPGPSTITDVCGAMK 540
            T+V + + P+   ++S +F Y IVVVGE PYAE  GD+ NLTI DPGPS I +VCG++K
Sbjct: 484 TTQVVYSQNPDANFVKSGKFDYAIVVVGEPPYAEMFGDTTNLTISDPGPSIIGNVCGSVK 543

Query: 541 CVVILISGRPVVIEPYISSMDALVAAWLPGTEGKGITDVLFGDYGFTGKLPRTWFKTVDQ 600
           CVV+++SGRPVVI+PY+S++DALVAAWLPGTEG+G+ D LFGDYGFTGKL RTWFK+V Q
Sbjct: 544 CVVVVVSGRPVVIQPYVSTIDALVAAWLPGTEGQGVADALFGDYGFTGKLARTWFKSVKQ 603

Query: 601 LPMNFGDPHYDPLFSFGYGLTTEPIK 627
           LPMN GD HYDPL+ FG+GLTT+P K
Sbjct: 604 LPMNVGDRHYDPLYPFGFGLTTKPYK 623

BLAST of Cp4.1LG08g06820 vs. TAIR10
Match: AT5G04885.1 (AT5G04885.1 Glycosyl hydrolase family protein)

HSP 1 Score: 905.6 bits (2339), Expect = 1.7e-263
Identity = 417/605 (68.93%), Postives = 511/605 (84.46%), Query Frame = 1

Query: 21  GTGEQLKYKDPTKPLNVRIKDLLGRMTVEEKIGQMVQIERVNASADVMKNYFIGSVLSGG 80
           G GE L YKDP + ++ R+ DL GRMT+EEKIGQMVQI+R  A+ ++M++YFIGSVLSGG
Sbjct: 23  GDGEYLLYKDPKQTVSDRVADLFGRMTLEEKIGQMVQIDRSVATVNIMRDYFIGSVLSGG 82

Query: 81  GSAPSKNASAKDWVDMVNEIQKGALSSRLGIPMIYGIDAVHGHNNVYNATIFPHNVGLGA 140
           GSAP   ASA++WVDM+NE QKGAL SRLGIPMIYGIDAVHGHNNVYNATIFPHNVGLGA
Sbjct: 83  GSAPLPEASAQNWVDMINEYQKGALVSRLGIPMIYGIDAVHGHNNVYNATIFPHNVGLGA 142

Query: 141 TRDPQLVKNIGSATALEIRATGIPYAFAPCIAVCKDPRWGRCYESYSEDPKIVQEMTEII 200
           TRDP LVK IG+ATA+E+RATGIPY FAPCIAVC+DPRWGRCYESYSED K+V++MT++I
Sbjct: 143 TRDPDLVKRIGAATAVEVRATGIPYTFAPCIAVCRDPRWGRCYESYSEDHKVVEDMTDVI 202

Query: 201 LGLQGEIPPNSRKGVPYVGGKDKVVGCAKHYVGDGGTTKGINENDTVIDRHSLLSIHMPG 260
           LGLQGE P N + GVP+VGG+DKV  CAKHYVGDGGTT+G+NEN+TV D H LLS+HMP 
Sbjct: 203 LGLQGEPPSNYKHGVPFVGGRDKVAACAKHYVGDGGTTRGVNENNTVTDLHGLLSVHMPA 262

Query: 261 YYHSIIKGIATVMASYSSWNGQKMHAHKELLTDFLKNTLNFKGFVISDWQGIDRITSPPH 320
           Y  ++ KG++TVM SYSSWNG+KMHA+ EL+T +LK TL FKGFVISDWQG+D+I++PPH
Sbjct: 263 YADAVYKGVSTVMVSYSSWNGEKMHANTELITGYLKGTLKFKGFVISDWQGVDKISTPPH 322

Query: 321 ANYTYSIIASVTAGVDMIMIPYDYKEFIDKITYLVKNNIIPMSRIDDAVWRILRVKFVMG 380
            +YT S+ A++ AG+DM+M+P+++ EF++ +T LVKNN IP++RIDDAV RIL VKF MG
Sbjct: 323 THYTASVRAAIQAGIDMVMVPFNFTEFVNDLTTLVKNNSIPVTRIDDAVRRILLVKFTMG 382

Query: 381 LFENPLADYSLVNEIGKKEHRELAREAVRKSLVLLKNGKSTSTPLLPLPKKAQKILVAGT 440
           LFENPLADYS  +E+G + HR+LAREAVRKSLVLLKNG  T+ P+LPLP+K  KILVAGT
Sbjct: 383 LFENPLADYSFSSELGSQAHRDLAREAVRKSLVLLKNGNKTN-PMLPLPRKTSKILVAGT 442

Query: 441 HANNLGYQCGGWTIEWQGASGNNLTSGTTVLDAIKETVGPETEVAFEEKPNKESLQSHEF 500
           HA+NLGYQCGGWTI WQG SGN  T GTT+L A+K  V   TEV F E P+ E ++S+ F
Sbjct: 443 HADNLGYQCGGWTITWQGFSGNKNTRGTTLLSAVKSAVDQSTEVVFRENPDAEFIKSNNF 502

Query: 501 SYGIVVVGEYPYAETNGDSLNLTIPDPGPSTITDVCGAMKCVVILISGRPVVIEPYISSM 560
           +Y I+ VGE PYAET GDS  LT+ DPGP+ I+  C A+KCVV++ISGRP+V+EPY++S+
Sbjct: 503 AYAIIAVGEPPYAETAGDSDKLTMLDPGPAIISSTCQAVKCVVVVISGRPLVMEPYVASI 562

Query: 561 DALVAAWLPGTEGKGITDVLFGDYGFTGKLPRTWFKTVDQLPMNFGDPHYDPLFSFGYGL 620
           DALVAAWLPGTEG+GITD LFGD+GF+GKLP TWF+  +QLPM++GD HYDPLF++G GL
Sbjct: 563 DALVAAWLPGTEGQGITDALFGDHGFSGKLPVTWFRNTEQLPMSYGDTHYDPLFAYGSGL 622

Query: 621 TTEPI 626
            TE +
Sbjct: 623 ETESV 626

BLAST of Cp4.1LG08g06820 vs. TAIR10
Match: AT5G20940.1 (AT5G20940.1 Glycosyl hydrolase family protein)

HSP 1 Score: 863.6 bits (2230), Expect = 7.5e-251
Identity = 411/598 (68.73%), Postives = 495/598 (82.78%), Query Frame = 1

Query: 27  KYKDPTKPLNVRIKDLLGRMTVEEKIGQMVQIERVNASADVMKNYFIGSVLSGGGSAPSK 86
           KYKDP +PL VRIK+L+  MT+EEKIGQMVQ+ERVNA+ +VM+ YF+GSV SGGGS P  
Sbjct: 31  KYKDPKEPLGVRIKNLMSHMTLEEKIGQMVQVERVNATTEVMQKYFVGSVFSGGGSVPKP 90

Query: 87  NASAKDWVDMVNEIQKGALSSRLGIPMIYGIDAVHGHNNVYNATIFPHNVGLGATRDPQL 146
               + WV+MVNE+QK ALS+RLGIP+IYGIDAVHGHN VYNATIFPHNVGLG TRDP L
Sbjct: 91  YIGPEAWVNMVNEVQKKALSTRLGIPIIYGIDAVHGHNTVYNATIFPHNVGLGVTRDPGL 150

Query: 147 VKNIGSATALEIRATGIPYAFAPCIAVCKDPRWGRCYESYSEDPKIVQEMTEIILGLQGE 206
           VK IG ATALE+RATGI Y FAPCIAVC+DPRWGRCYESYSED KIVQ+MTEII GLQG+
Sbjct: 151 VKRIGEATALEVRATGIQYVFAPCIAVCRDPRWGRCYESYSEDHKIVQQMTEIIPGLQGD 210

Query: 207 IPPNSRKGVPYVGGKDKVVGCAKHYVGDGGTTKGINENDTVIDRHSLLSIHMPGYYHSII 266
           +P   +KGVP+V GK KV  CAKH+VGDGGT +G+N N+TVI+ + LL IHMP Y+ ++ 
Sbjct: 211 LP-TGQKGVPFVAGKTKVAACAKHFVGDGGTLRGMNANNTVINSNGLLGIHMPAYHDAVN 270

Query: 267 KGIATVMASYSSWNGQKMHAHKELLTDFLKNTLNFKGFVISDWQGIDRITSPPHANYTYS 326
           KG+ATVM SYSS NG KMHA+K+L+T FLKN L F+G VISD+ G+D+I +P  ANY++S
Sbjct: 271 KGVATVMVSYSSINGLKMHANKKLITGFLKNKLKFRGIVISDYLGVDQINTPLGANYSHS 330

Query: 327 IIASVTAGVDMIMIPYDYKEFIDKITYLVKNNIIPMSRIDDAVWRILRVKFVMGLFENPL 386
           + A+ TAG+DM M   +  + ID++T  VK   IPMSRIDDAV RILRVKF MGLFENP+
Sbjct: 331 VYAATTAGLDMFMGSSNLTKLIDELTSQVKRKFIPMSRIDDAVKRILRVKFTMGLFENPI 390

Query: 387 ADYSLVNEIGKKEHRELAREAVRKSLVLLKNGKSTSTPLLPLPKKAQKILVAGTHANNLG 446
           AD+SL  ++G KEHRELAREAVRKSLVLLKNG++   PLLPLPKKA KILVAGTHA+NLG
Sbjct: 391 ADHSLAKKLGSKEHRELAREAVRKSLVLLKNGENADKPLLPLPKKANKILVAGTHADNLG 450

Query: 447 YQCGGWTIEWQGASGNNLTSGTTVLDAIKETVGPETEVAFEEKPNKESLQSHEFSYGIVV 506
           YQCGGWTI WQG +GNNLT GTT+L A+K+TV P+T+V + + P+   +++ +F Y IV 
Sbjct: 451 YQCGGWTITWQGLNGNNLTIGTTILAAVKKTVDPKTQVIYNQNPDTNFVKAGDFDYAIVA 510

Query: 507 VGEYPYAETNGDSLNLTIPDPGPSTITDVCGAMKCVVILISGRPVVIEPYISSMDALVAA 566
           VGE PYAE  GDS NLTI +PGPSTI +VC ++KCVV+++SGRPVV++  IS++DALVAA
Sbjct: 511 VGEKPYAEGFGDSTNLTISEPGPSTIGNVCASVKCVVVVVSGRPVVMQ--ISNIDALVAA 570

Query: 567 WLPGTEGKGITDVLFGDYGFTGKLPRTWFKTVDQLPMNFGDPHYDPLFSFGYGLTTEP 625
           WLPGTEG+G+ DVLFGDYGFTGKL RTWFKTVDQLPMN GDPHYDPL+ FG+GL T+P
Sbjct: 571 WLPGTEGQGVADVLFGDYGFTGKLARTWFKTVDQLPMNVGDPHYDPLYPFGFGLITKP 625

BLAST of Cp4.1LG08g06820 vs. TAIR10
Match: AT3G47000.1 (AT3G47000.1 Glycosyl hydrolase family protein)

HSP 1 Score: 736.1 bits (1899), Expect = 1.8e-212
Identity = 353/600 (58.83%), Postives = 453/600 (75.50%), Query Frame = 1

Query: 28  YKDPTKPLNVRIKDLLGRMTVEEKIGQMVQIERVNASADVMKNYFIGSVLSGGGSAPSKN 87
           YK+   P+  R+KDLL RMT+ EKIGQM QIER  AS     ++FIGSVL+ GGS P ++
Sbjct: 10  YKNGDAPVEARVKDLLSRMTLPEKIGQMTQIERRVASPSAFTDFFIGSVLNAGGSVPFED 69

Query: 88  ASAKDWVDMVNEIQKGALSSRLGIPMIYGIDAVHGHNNVYNATIFPHNVGLGATRDPQLV 147
           A + DW DM++  Q+ AL+SRLGIP+IYG DAVHG+NNVY AT+FPHN+GLGATRD  LV
Sbjct: 70  AKSSDWADMIDGFQRSALASRLGIPIIYGTDAVHGNNNVYGATVFPHNIGLGATRDADLV 129

Query: 148 KNIGSATALEIRATGIPYAFAPCIAVCKDPRWGRCYESYSEDPKIVQEMTEIILGLQGEI 207
           + IG+ATALE+RA+G+ +AF+PC+AV +DPRWGRCYESY EDP++V EMT ++ GLQG  
Sbjct: 130 RRIGAATALEVRASGVHWAFSPCVAVLRDPRWGRCYESYGEDPELVCEMTSLVSGLQGVP 189

Query: 208 PPNSRKGVPYVGGKDKVVGCAKHYVGDGGTTKGINENDTVIDRHSLLSIHMPGYYHSIIK 267
           P     G P+V G++ VV C KH+VGDGGT KGINE +T+     L  IH+P Y   + +
Sbjct: 190 PEEHPNGYPFVAGRNNVVACVKHFVGDGGTDKGINEGNTIASYEELEKIHIPPYLKCLAQ 249

Query: 268 GIATVMASYSSWNGQKMHAHKELLTDFLKNTLNFKGFVISDWQGIDRITSPPHANYTYSI 327
           G++TVMASYSSWNG ++HA + LLT+ LK  L FKGF++SDW+G+DR++ P  +NY Y I
Sbjct: 250 GVSTVMASYSSWNGTRLHADRFLLTEILKEKLGFKGFLVSDWEGLDRLSEPQGSNYRYCI 309

Query: 328 IASVTAGVDMIMIPYDYKEFIDKITYLVKNNIIPMSRIDDAVWRILRVKFVMGLFENPLA 387
             +V AG+DM+M+P+ Y++FI  +T LV++  IPM+RI+DAV RILRVKFV GLF +PL 
Sbjct: 310 KTAVNAGIDMVMVPFKYEQFIQDMTDLVESGEIPMARINDAVERILRVKFVAGLFGHPLT 369

Query: 388 DYSLVNEIGKKEHRELAREAVRKSLVLLKNGKSTSTPLLPLPKKAQKILVAGTHANNLGY 447
           D SL+  +G KEHRELA+EAVRKSLVLLK+GK+   P LPL + A++ILV GTHA++LGY
Sbjct: 370 DRSLLPTVGCKEHRELAQEAVRKSLVLLKSGKNADKPFLPLDRNAKRILVTGTHADDLGY 429

Query: 448 QCGGWTIEWQGASGNNLTSGTTVLDAIKETVGPETEVAFEEKPNKESLQSHE-FSYGIVV 507
           QCGGWT  W G SG  +T GTT+LDAIKE VG ETEV +E+ P+KE+L S E FSY IV 
Sbjct: 430 QCGGWTKTWFGLSG-RITIGTTLLDAIKEAVGDETEVIYEKTPSKETLASSEGFSYAIVA 489

Query: 508 VGEYPYAETNGDSLNLTIPDPGPSTITDVCGAMKCVVILISGRPVVIEP-YISSMDALVA 567
           VGE PYAET GD+  L IP  G   +T V   +  +VILISGRPVV+EP  +   +ALVA
Sbjct: 490 VGEPPYAETMGDNSELRIPFNGTDIVTAVAEIIPTLVILISGRPVVLEPTVLEKTEALVA 549

Query: 568 AWLPGTEGKGITDVLFGDYGFTGKLPRTWFKTVDQLPMNFGDPHYDPLFSFGYGLTTEPI 626
           AWLPGTEG+G+ DV+FGDY F GKLP +WFK V+ LP++     YDPLF FG+GL ++P+
Sbjct: 550 AWLPGTEGQGVADVVFGDYDFKGKLPVSWFKHVEHLPLDAHANSYDPLFPFGFGLNSKPV 608

BLAST of Cp4.1LG08g06820 vs. TAIR10
Match: AT3G47010.1 (AT3G47010.1 Glycosyl hydrolase family protein)

HSP 1 Score: 708.4 bits (1827), Expect = 4.0e-204
Identity = 348/597 (58.29%), Postives = 443/597 (74.20%), Query Frame = 1

Query: 28  YKDPTKPLNVRIKDLLGRMTVEEKIGQMVQIERVNASADVMKNYFIGSVLSGGGSAPSKN 87
           YK+   P+  R+KDLL RMT+ EKIGQM QIER  AS  V+ N FIGSV SG GS P ++
Sbjct: 11  YKNRDAPVEARVKDLLSRMTLPEKIGQMTQIERSVASPQVITNSFIGSVQSGAGSWPLED 70

Query: 88  ASAKDWVDMVNEIQKGALSSRLGIPMIYGIDAVHGHNNVYNATIFPHNVGLGATRDPQLV 147
           A + DW DM++  Q+ AL+SRLGIP+IYG DAVHG+NNVY AT+FPHN+GLGATRD  LV
Sbjct: 71  AKSSDWADMIDGFQRSALASRLGIPIIYGTDAVHGNNNVYGATVFPHNIGLGATRDADLV 130

Query: 148 KNIGSATALEIRATGIPYAFAPCIAVCKDPRWGRCYESYSEDPKIVQEMTEIILGLQGEI 207
           K IG+ATALEIRA+G+ + FAPC+AV  DPRWGRCYESYSE  KIV EM+ +I GLQGE 
Sbjct: 131 KRIGAATALEIRASGVHWTFAPCVAVLGDPRWGRCYESYSEAAKIVCEMSLLISGLQGEP 190

Query: 208 PPNSRKGVPYVGGKDKVVGCAKHYVGDGGTTKGINENDTVIDRHSLLSIHMPGYYHSIIK 267
           P     G P++ G++ V+ CAKH+VGDGGT KG++E +T+     L  IH+  Y + I +
Sbjct: 191 PEEHPYGYPFLAGRNNVIACAKHFVGDGGTEKGLSEGNTITSYEDLEKIHVAPYLNCIAQ 250

Query: 268 GIATVMASYSSWNGQKMHAHKELLTDFLKNTLNFKGFVISDWQGIDRITSPPHANYTYSI 327
           G++TVMAS+SSWNG ++H+   LLT+ LK  L FKGF++SDW G++ I+ P  +NY   +
Sbjct: 251 GVSTVMASFSSWNGSRLHSDYFLLTEVLKQKLGFKGFLVSDWDGLETISEPEGSNYRNCV 310

Query: 328 IASVTAGVDMIMIPYDYKEFIDKITYLVKNNIIPMSRIDDAVWRILRVKFVMGLFENPLA 387
              + AG+DM+M+P+ Y++FI  +T LV++  IPM+R++DAV RILRVKFV GLFE+PLA
Sbjct: 311 KLGINAGIDMVMVPFKYEQFIQDMTDLVESGEIPMARVNDAVERILRVKFVAGLFEHPLA 370

Query: 388 DYSLVNEIGKKEHRELAREAVRKSLVLLKNGKSTSTPLLPLPKKAQKILVAGTHANNLGY 447
           D SL+  +G KEHRE+AREAVRKSLVLLKNGK+  TP LPL + A++ILV G HAN+LG 
Sbjct: 371 DRSLLGTVGCKEHREVAREAVRKSLVLLKNGKNADTPFLPLDRNAKRILVVGMHANDLGN 430

Query: 448 QCGGWTIEWQGASGNNLTSGTTVLDAIKETVGPETEVAFEEKPNKESLQSHE-FSYGIVV 507
           QCGGWT    G SG  +T GTT+LD+IK  VG +TEV FE+ P KE+L S + FSY IV 
Sbjct: 431 QCGGWTKIKSGQSG-RITIGTTLLDSIKAAVGDKTEVIFEKTPTKETLASSDGFSYAIVA 490

Query: 508 VGEYPYAETNGDSLNLTIPDPGPSTITDVCGAMKCVVILISGRPVVIEP-YISSMDALVA 567
           VGE PYAE  GD+  LTIP  G + IT V   +  +VIL SGRP+V+EP  +   +ALVA
Sbjct: 491 VGEPPYAEMKGDNSELTIPFNGNNIITAVAEKIPTLVILFSGRPMVLEPTVLEKTEALVA 550

Query: 568 AWLPGTEGKGITDVLFGDYGFTGKLPRTWFKTVDQLPMNFGDPHYDPLFSFGYGLTT 623
           AW PGTEG+G++DV+FGDY F GKLP +WFK VDQLP+N     YDPLF  G+GLT+
Sbjct: 551 AWFPGTEGQGMSDVIFGDYDFKGKLPVSWFKRVDQLPLNAEANSYDPLFPLGFGLTS 606

BLAST of Cp4.1LG08g06820 vs. NCBI nr
Match: gi|659086037|ref|XP_008443733.1| (PREDICTED: lysosomal beta glucosidase-like [Cucumis melo])

HSP 1 Score: 1080.1 bits (2792), Expect = 0.0e+00
Identity = 517/626 (82.59%), Postives = 565/626 (90.26%), Query Frame = 1

Query: 1   MTKVLIVLMGFFLMFLSETLGTGEQLKYKDPTKPLNVRIKDLLGRMTVEEKIGQMVQIER 60
           M K + +L+G  L+   ET    E LKYKDP +PLNVRIKDLLGRMT+EEKIGQM QIER
Sbjct: 1   MAKAINILIGLLLLCFFETWAKAENLKYKDPKQPLNVRIKDLLGRMTLEEKIGQMTQIER 60

Query: 61  VNASADVMKNYFIGSVLSGGGSAPSKNASAKDWVDMVNEIQKGALSSRLGIPMIYGIDAV 120
           VNAS DVMK YFIGSVLSGGGS PSK ASA+DWV MVNEIQ+GALS+RLGIPMIYGIDAV
Sbjct: 61  VNASTDVMKKYFIGSVLSGGGSVPSKEASAQDWVQMVNEIQQGALSTRLGIPMIYGIDAV 120

Query: 121 HGHNNVYNATIFPHNVGLGATRDPQLVKNIGSATALEIRATGIPYAFAPCIAVCKDPRWG 180
           HGHNNVYNATIFPHN+GLGATRDPQL+K IG A+ALEIRATGIPYAFAPCIAVC+DPRWG
Sbjct: 121 HGHNNVYNATIFPHNIGLGATRDPQLLKRIGEASALEIRATGIPYAFAPCIAVCRDPRWG 180

Query: 181 RCYESYSEDPKIVQEMTEIILGLQGEIPPNSRKGVPYVGGKDKVVGCAKHYVGDGGTTKG 240
           RCYESY EDPK+VQEMTEII GLQGEIPPNSRKGVPYV GK+KVV CAKHYVGDGGTTKG
Sbjct: 181 RCYESYGEDPKLVQEMTEIIPGLQGEIPPNSRKGVPYVAGKEKVVACAKHYVGDGGTTKG 240

Query: 241 INENDTVIDRHSLLSIHMPGYYHSIIKGIATVMASYSSWNGQKMHAHKELLTDFLKNTLN 300
           I+EN+TVIDRH LLSIHMPGYYHSIIKG+ATVM SYSSWNG KMHA+KEL+TDFLKNTL+
Sbjct: 241 IDENNTVIDRHGLLSIHMPGYYHSIIKGVATVMVSYSSWNGVKMHANKELVTDFLKNTLH 300

Query: 301 FKGFVISDWQGIDRITSPPHANYTYSIIASVTAGVDMIMIPYDYKEFIDKITYLVKNNII 360
           F+GFVISDWQ IDRIT PPHANYTYSI+ASVTAG+DMIM+PY+Y EFID +TYLV NN I
Sbjct: 301 FQGFVISDWQAIDRITDPPHANYTYSILASVTAGLDMIMVPYNYTEFIDGLTYLVNNNFI 360

Query: 361 PMSRIDDAVWRILRVKFVMGLFENPLADYSLVNEIGKKEHRELAREAVRKSLVLLKNGKS 420
           P++RIDDAV RILRVKF+MGLFENP+AD SLVNE+GK+EHRELAREAVRKSLVLLKNGKS
Sbjct: 361 PITRIDDAVKRILRVKFIMGLFENPIADLSLVNELGKQEHRELAREAVRKSLVLLKNGKS 420

Query: 421 TSTPLLPLPKKAQKILVAGTHANNLGYQCGGWTIEWQGASGNNLTSGTTVLDAIKETVGP 480
              PLLPL KK QKILVAG+HA+NLGYQCGGWTIEWQG SGNNLTSGTTVLDAIK+TV P
Sbjct: 421 ADKPLLPLEKKTQKILVAGSHADNLGYQCGGWTIEWQGLSGNNLTSGTTVLDAIKDTVDP 480

Query: 481 ETEVAFEEKPNKESLQSHEFSYGIVVVGEYPYAETNGDSLNLTIPDPGPSTITDVCGAMK 540
            TEV F E P+K  LQS  FSY IVVVGE+PYAE  GDSLNLTIPDPGPSTIT+VCG +K
Sbjct: 481 STEVIFNENPDKGFLQSGTFSYAIVVVGEHPYAEMMGDSLNLTIPDPGPSTITNVCGVIK 540

Query: 541 CVVILISGRPVVIEPYISSMDALVAAWLPGTEGKGITDVLFGDYGFTGKLPRTWFKTVDQ 600
           CVV++ISGRPVVI+PY+ S+DALVAAWLPGTEGKGITDVLFGDYGFTGKL +TWFKTVDQ
Sbjct: 541 CVVVIISGRPVVIQPYVDSVDALVAAWLPGTEGKGITDVLFGDYGFTGKLSQTWFKTVDQ 600

Query: 601 LPMNFGDPHYDPLFSFGYGLTTEPIK 627
           LPMNFGD HYDPLF  G+GLTT+PIK
Sbjct: 601 LPMNFGDSHYDPLFPLGHGLTTQPIK 626

BLAST of Cp4.1LG08g06820 vs. NCBI nr
Match: gi|778665412|ref|XP_011648555.1| (PREDICTED: lysosomal beta glucosidase-like [Cucumis sativus])

HSP 1 Score: 1072.4 bits (2772), Expect = 2.9e-310
Identity = 511/622 (82.15%), Postives = 566/622 (91.00%), Query Frame = 1

Query: 6   IVLMGFFLMFLSETLGTGEQLKYKDPTKPLNVRIKDLLGRMTVEEKIGQMVQIERVNASA 65
           I+L+   L+   ET    E  KYKDPT+ LNVRIKDLLGRMT+EEKIGQMVQIERVNAS 
Sbjct: 6   IILIALLLICCFETGAKAENFKYKDPTQRLNVRIKDLLGRMTLEEKIGQMVQIERVNAST 65

Query: 66  DVMKNYFIGSVLSGGGSAPSKNASAKDWVDMVNEIQKGALSSRLGIPMIYGIDAVHGHNN 125
           +VMK YFIGSVLSGGGS PSK ASA+DW++MVNEIQKGALS+RLGIPMIYGIDAVHGHNN
Sbjct: 66  EVMKKYFIGSVLSGGGSVPSKQASAQDWINMVNEIQKGALSTRLGIPMIYGIDAVHGHNN 125

Query: 126 VYNATIFPHNVGLGATRDPQLVKNIGSATALEIRATGIPYAFAPCIAVCKDPRWGRCYES 185
           VYNATIFPHN+GLGATRDPQL+K IG A+A EIRATGIPYAFAPC+AVC+DPRWGRCYES
Sbjct: 126 VYNATIFPHNIGLGATRDPQLLKRIGVASAREIRATGIPYAFAPCVAVCRDPRWGRCYES 185

Query: 186 YSEDPKIVQEMTEIILGLQGEIPPNSRKGVPYVGGKDKVVGCAKHYVGDGGTTKGINEND 245
           Y EDPKIVQEMTEII GLQGEIPPNSRKGVPYV GK+ VV CAKHYVGDGGTTKGI+EN+
Sbjct: 186 YGEDPKIVQEMTEIIPGLQGEIPPNSRKGVPYVAGKENVVACAKHYVGDGGTTKGIDENN 245

Query: 246 TVIDRHSLLSIHMPGYYHSIIKGIATVMASYSSWNGQKMHAHKELLTDFLKNTLNFKGFV 305
           TVIDRH LLSIHMPGYYHSIIKG+AT+M SYSSWNG+KMHA+K L+TDFLKNTL+F+GFV
Sbjct: 246 TVIDRHGLLSIHMPGYYHSIIKGVATIMVSYSSWNGEKMHANKNLVTDFLKNTLHFQGFV 305

Query: 306 ISDWQGIDRITSPPHANYTYSIIASVTAGVDMIMIPYDYKEFIDKITYLVKNNIIPMSRI 365
           ISDW+ IDRIT PPHANYTYSI+AS+TAG+DMIMIPY+Y EFID +T LVK+N IP+SRI
Sbjct: 306 ISDWEAIDRITDPPHANYTYSILASITAGLDMIMIPYNYPEFIDGLTNLVKSNYIPISRI 365

Query: 366 DDAVWRILRVKFVMGLFENPLADYSLVNEIGKKEHRELAREAVRKSLVLLKNGKSTSTPL 425
           DDAV RILRVKFVMGLFENP+AD SLVNE+GK+EHRELAREAVRKSLVLLKNGKS   PL
Sbjct: 366 DDAVKRILRVKFVMGLFENPIADLSLVNELGKQEHRELAREAVRKSLVLLKNGKSADKPL 425

Query: 426 LPLPKKAQKILVAGTHANNLGYQCGGWTIEWQGASGNNLTSGTTVLDAIKETVGPETEVA 485
           LPL KK QKILVAG+HANNLGYQCGGWTIEWQG SGNNLTSGTTVLDAIK+TV P TEV 
Sbjct: 426 LPLEKKTQKILVAGSHANNLGYQCGGWTIEWQGLSGNNLTSGTTVLDAIKDTVDPTTEVI 485

Query: 486 FEEKPNKESLQSHEFSYGIVVVGEYPYAETNGDSLNLTIPDPGPSTITDVCGAMKCVVIL 545
           F E P+K+SLQS  FSY IVVVGE+PYAE NGDSLNLTIPDPGP+TIT+VCG +KC V++
Sbjct: 486 FNENPDKKSLQSDTFSYAIVVVGEHPYAELNGDSLNLTIPDPGPNTITNVCGVIKCAVVI 545

Query: 546 ISGRPVVIEPYISSMDALVAAWLPGTEGKGITDVLFGDYGFTGKLPRTWFKTVDQLPMNF 605
           ISGRPVVI+PY+ S+DALVAAWLPGTEGKGITDVLFGDYGFTGKL +TWFKTVDQLPMNF
Sbjct: 546 ISGRPVVIQPYVDSIDALVAAWLPGTEGKGITDVLFGDYGFTGKLSQTWFKTVDQLPMNF 605

Query: 606 GDPHYDPLFSFGYGLTTEPIKA 628
           G+P+YDPLF FG+GLTT+PIK+
Sbjct: 606 GNPNYDPLFPFGHGLTTQPIKS 627

BLAST of Cp4.1LG08g06820 vs. NCBI nr
Match: gi|778685993|ref|XP_011652313.1| (PREDICTED: lysosomal beta glucosidase-like [Cucumis sativus])

HSP 1 Score: 1057.4 bits (2733), Expect = 1.0e-305
Identity = 503/624 (80.61%), Postives = 564/624 (90.38%), Query Frame = 1

Query: 4   VLIVLMGFFLMFLSETLGTGEQLKYKDPTKPLNVRIKDLLGRMTVEEKIGQMVQIERVNA 63
           VLI  +G  ++  SETL   E LKYKDP +PLNVRIKDLLGRMT+EEKIGQMVQIER NA
Sbjct: 6   VLITFVGLLVLCFSETLAKAEYLKYKDPKQPLNVRIKDLLGRMTLEEKIGQMVQIERANA 65

Query: 64  SADVMKNYFIGSVLSGGGSAPSKNASAKDWVDMVNEIQKGALSSRLGIPMIYGIDAVHGH 123
           SADVMK YFIGSVLSGGGSAPSK ASAKDWV MVN+IQ+ ALS+RLGIPMIYGIDAVHGH
Sbjct: 66  SADVMKQYFIGSVLSGGGSAPSKQASAKDWVHMVNKIQEAALSTRLGIPMIYGIDAVHGH 125

Query: 124 NNVYNATIFPHNVGLGATRDPQLVKNIGSATALEIRATGIPYAFAPCIAVCKDPRWGRCY 183
           NNVYNATIFPHN+GLGATRDPQL+K IG+ATALE+RATGIPYAFAPCIAVC+DPRWGRCY
Sbjct: 126 NNVYNATIFPHNIGLGATRDPQLLKRIGAATALEVRATGIPYAFAPCIAVCRDPRWGRCY 185

Query: 184 ESYSEDPKIVQEMTEIILGLQGEIPPNSRKGVPYVGGKDKVVGCAKHYVGDGGTTKGINE 243
           ESY ED  IVQ MTEII GLQG++P N RKGVPYV GK+ V  CAKH+VGDGGTTKGINE
Sbjct: 186 ESYGEDHTIVQAMTEIIPGLQGDVPANIRKGVPYVAGKNNVAACAKHFVGDGGTTKGINE 245

Query: 244 NDTVIDRHSLLSIHMPGYYHSIIKGIATVMASYSSWNGQKMHAHKELLTDFLKNTLNFKG 303
           N+TV+D H L SIHMP YY+SIIKG+ATVM SYSS NG+KMHA+K+L+TDFLKNTL+FKG
Sbjct: 246 NNTVVDGHGLFSIHMPAYYNSIIKGVATVMVSYSSINGEKMHANKKLVTDFLKNTLHFKG 305

Query: 304 FVISDWQGIDRITSPPHANYTYSIIASVTAGVDMIMIPYDYKEFIDKITYLVKNNIIPMS 363
           FVISDWQGID+IT+PPHANYTYSI+ASV AGVDMIM+PY+Y EFID +TYLVKNN IP+S
Sbjct: 306 FVISDWQGIDKITTPPHANYTYSILASVNAGVDMIMVPYNYTEFIDGLTYLVKNNAIPIS 365

Query: 364 RIDDAVWRILRVKFVMGLFENPLADYSLVNEIGKKEHRELAREAVRKSLVLLKNGKSTST 423
           RIDDAV RILRVKFVMGLFENPLAD SL+NE+GK+EHRELAREAVRKSLVLLKNGK  + 
Sbjct: 366 RIDDAVKRILRVKFVMGLFENPLADLSLINELGKQEHRELAREAVRKSLVLLKNGKLPNQ 425

Query: 424 PLLPLPKKAQKILVAGTHANNLGYQCGGWTIEWQGASGNNLTSGTTVLDAIKETVGPETE 483
           PLLPLPKKA KILVAGTHAN+LG QCGGWT+EWQG +GNNLTSGTT+L AIK+TV PETE
Sbjct: 426 PLLPLPKKAPKILVAGTHANDLGNQCGGWTMEWQGLTGNNLTSGTTILTAIKDTVDPETE 485

Query: 484 VAFEEKPNKESLQSHEFSYGIVVVGEYPYAETNGDSLNLTIPDPGPSTITDVCGAMKCVV 543
           V F + PN E LQ+H+FSY IVVVGE+PYAETNGDSLNLTIP+PGP TI +VCGA+KCVV
Sbjct: 486 VVFHDNPNAEFLQTHQFSYAIVVVGEHPYAETNGDSLNLTIPEPGPETIKNVCGAVKCVV 545

Query: 544 ILISGRPVVIEPYISSMDALVAAWLPGTEGKGITDVLFGDYGFTGKLPRTWFKTVDQLPM 603
           ++ISGRPVV++PYI S+DA+VAAWLPGTEGKGI+DVLFGDYGFTGKL +TWFK+VDQLPM
Sbjct: 546 VVISGRPVVLQPYIDSIDAVVAAWLPGTEGKGISDVLFGDYGFTGKLSQTWFKSVDQLPM 605

Query: 604 NFGDPHYDPLFSFGYGLTTEPIKA 628
           NFGD HYDPLF FG+GLTT+P+KA
Sbjct: 606 NFGDAHYDPLFPFGFGLTTQPVKA 629

BLAST of Cp4.1LG08g06820 vs. NCBI nr
Match: gi|659130020|ref|XP_008464960.1| (PREDICTED: lysosomal beta glucosidase-like [Cucumis melo])

HSP 1 Score: 1056.6 bits (2731), Expect = 1.7e-305
Identity = 505/624 (80.93%), Postives = 561/624 (89.90%), Query Frame = 1

Query: 4   VLIVLMGFFLMFLSETLGTGEQLKYKDPTKPLNVRIKDLLGRMTVEEKIGQMVQIERVNA 63
           VLI  +G  ++  SETL   E LKYKDP +PLNVRIKDL GRMT+EEKIGQMVQIER NA
Sbjct: 5   VLITFVGLLVLCFSETLAKAEYLKYKDPKQPLNVRIKDLFGRMTLEEKIGQMVQIERANA 64

Query: 64  SADVMKNYFIGSVLSGGGSAPSKNASAKDWVDMVNEIQKGALSSRLGIPMIYGIDAVHGH 123
           S DVM+ YFIGSVLSGGGS PSKNASAK WV MVN+IQ+GALS+RLGIPMIYGIDA+HGH
Sbjct: 65  SMDVMRKYFIGSVLSGGGSVPSKNASAKTWVHMVNKIQEGALSTRLGIPMIYGIDAIHGH 124

Query: 124 NNVYNATIFPHNVGLGATRDPQLVKNIGSATALEIRATGIPYAFAPCIAVCKDPRWGRCY 183
           NNVYNATIFPHN+GLGATRDPQL+K IG ATALE+RATGIPYAFAPCIAVC+DPRWGRCY
Sbjct: 125 NNVYNATIFPHNIGLGATRDPQLIKRIGVATALEVRATGIPYAFAPCIAVCRDPRWGRCY 184

Query: 184 ESYSEDPKIVQEMTEIILGLQGEIPPNSRKGVPYVGGKDKVVGCAKHYVGDGGTTKGINE 243
           ESY ED KIVQ MTEII GLQG++P N RKGVPYV GK+ V  CAKH+VGDGGTTKGINE
Sbjct: 185 ESYGEDHKIVQAMTEIIPGLQGDLPSNIRKGVPYVAGKNNVAACAKHFVGDGGTTKGINE 244

Query: 244 NDTVIDRHSLLSIHMPGYYHSIIKGIATVMASYSSWNGQKMHAHKELLTDFLKNTLNFKG 303
           N+TVID H L SIHMP YY+SIIKG+AT+M SYSS NG+KMHA+K+L+TDFLKNTL+FKG
Sbjct: 245 NNTVIDGHGLFSIHMPAYYNSIIKGVATIMVSYSSVNGEKMHANKKLVTDFLKNTLHFKG 304

Query: 304 FVISDWQGIDRITSPPHANYTYSIIASVTAGVDMIMIPYDYKEFIDKITYLVKNNIIPMS 363
           FVISDWQGID+ITSPPHANYTYSI+ASV AGVDMIM+PY+Y EFID +TYLVKNN IP+S
Sbjct: 305 FVISDWQGIDKITSPPHANYTYSILASVNAGVDMIMVPYNYTEFIDALTYLVKNNAIPIS 364

Query: 364 RIDDAVWRILRVKFVMGLFENPLADYSLVNEIGKKEHRELAREAVRKSLVLLKNGKSTST 423
           RIDDAV RILRVKFVMGLFENPLAD SLVNEIGK+EHRELAREAVRKSLVLLKNGK  + 
Sbjct: 365 RIDDAVKRILRVKFVMGLFENPLADLSLVNEIGKQEHRELAREAVRKSLVLLKNGKLPNQ 424

Query: 424 PLLPLPKKAQKILVAGTHANNLGYQCGGWTIEWQGASGNNLTSGTTVLDAIKETVGPETE 483
           PLLPLPKKA KILVAGTHAN+LG QCGGWTIEWQG +GNNLTSGTTVL AIK+TV PETE
Sbjct: 425 PLLPLPKKAPKILVAGTHANDLGNQCGGWTIEWQGLTGNNLTSGTTVLTAIKDTVDPETE 484

Query: 484 VAFEEKPNKESLQSHEFSYGIVVVGEYPYAETNGDSLNLTIPDPGPSTITDVCGAMKCVV 543
           V F+  PN E L++H+FSY IVVVGE+PYAETNGDSLNLTIP+PGP TI +VCGA+KCVV
Sbjct: 485 VVFDNNPNAEFLKTHQFSYAIVVVGEHPYAETNGDSLNLTIPEPGPETIKNVCGAVKCVV 544

Query: 544 ILISGRPVVIEPYISSMDALVAAWLPGTEGKGITDVLFGDYGFTGKLPRTWFKTVDQLPM 603
           ++ISGRPVVI+PYI S+DALVAAWLPGTEGKGI+DVLFGDYGFTGKL +TWFK+VDQLPM
Sbjct: 545 VVISGRPVVIQPYIDSIDALVAAWLPGTEGKGISDVLFGDYGFTGKLSQTWFKSVDQLPM 604

Query: 604 NFGDPHYDPLFSFGYGLTTEPIKA 628
           NFGD HYDPLF  G+GLTT+P+KA
Sbjct: 605 NFGDAHYDPLFPLGFGLTTQPVKA 628

BLAST of Cp4.1LG08g06820 vs. NCBI nr
Match: gi|659130018|ref|XP_008464959.1| (PREDICTED: lysosomal beta glucosidase-like [Cucumis melo])

HSP 1 Score: 1042.7 bits (2695), Expect = 2.5e-301
Identity = 494/627 (78.79%), Postives = 557/627 (88.84%), Query Frame = 1

Query: 1   MTKVLIVLMGFFLMFLSETLGTGEQLKYKDPTKPLNVRIKDLLGRMTVEEKIGQMVQIER 60
           M K+LI  MGFF+  L+E       ++YKDP +PLNVRI DLLGRMT+EEKIGQMVQI+R
Sbjct: 1   MAKILIFFMGFFIFCLTEVWAKPRYMRYKDPKQPLNVRINDLLGRMTLEEKIGQMVQIDR 60

Query: 61  VNASADVMKNYFIGSVLSGGGSAPSKNASAKDWVDMVNEIQKGALSSRLGIPMIYGIDAV 120
             AS +VMK Y IGSVLSGGGS PSK AS K W+DMVN+ QKG+LS+RLGIPMIYGIDAV
Sbjct: 61  TVASKEVMKKYLIGSVLSGGGSVPSKEASPKVWIDMVNDFQKGSLSTRLGIPMIYGIDAV 120

Query: 121 HGHNNVYNATIFPHNVGLGATRDPQLVKNIGSATALEIRATGIPYAFAPCIAVCKDPRWG 180
           HGHNNVY ATIFPHNVGLGATRDP L K IG+ATALE+RATGI Y FAPCIAVC+DPRWG
Sbjct: 121 HGHNNVYKATIFPHNVGLGATRDPNLAKRIGAATALEVRATGISYVFAPCIAVCRDPRWG 180

Query: 181 RCYESYSEDPKIVQEMTEIILGLQGEIPPNSRKGVPYVGGKDKVVGCAKHYVGDGGTTKG 240
           RCYESYSEDPKIVQEMTEII GLQGEIP NSRKGVPYV G++KV  CAKHYVGDGGTTKG
Sbjct: 181 RCYESYSEDPKIVQEMTEIISGLQGEIPSNSRKGVPYVAGREKVAACAKHYVGDGGTTKG 240

Query: 241 INENDTVIDRHSLLSIHMPGYYHSIIKGIATVMASYSSWNGQKMHAHKELLTDFLKNTLN 300
           INEN+T+  RH LLSIHMPGYY+SIIKG++TVM SYSSWNG+KMH +++L+T FLKNTL 
Sbjct: 241 INENNTLASRHGLLSIHMPGYYNSIIKGVSTVMISYSSWNGKKMHENRDLITGFLKNTLR 300

Query: 301 FKGFVISDWQGIDRITSPPHANYTYSIIASVTAGVDMIMIPYDYKEFIDKITYLVKNNII 360
           F+GFVISDWQGIDRITSPPHANYTYSIIA +TAG+DMIM+PY+Y EFID +TYLVK N+I
Sbjct: 301 FRGFVISDWQGIDRITSPPHANYTYSIIAGITAGIDMIMVPYNYTEFIDGLTYLVKTNVI 360

Query: 361 PMSRIDDAVWRILRVKFVMGLFENPLADYSLVNEIGKKEHRELAREAVRKSLVLLKNGKS 420
           P+SRIDDAV RILRVKF+MGLFENPLAD S VNE+GKKEHRELAREAVRKSLVLLKNG+S
Sbjct: 361 PISRIDDAVKRILRVKFIMGLFENPLADSSFVNELGKKEHRELAREAVRKSLVLLKNGES 420

Query: 421 TSTPLLPLPKKAQKILVAGTHANNLGYQCGGWTIEWQGASGNNLTSGTTVLDAIKETVGP 480
              P+LPLPKK  KILVAG+HANNLG+QCGGWTIEWQG  GNNLTSGTT+L AIK+TV P
Sbjct: 421 ADKPILPLPKKVPKILVAGSHANNLGFQCGGWTIEWQGLGGNNLTSGTTILSAIKDTVDP 480

Query: 481 ETEVAFEEKPNKESLQSHEFSYGIVVVGEYPYAETNGDSLNLTIPDPGPSTITDVCGAMK 540
           +T+V F+E P+ E ++S++FSY IVVVGE+PYAET GDSLNLTIPDPG STIT+VCG +K
Sbjct: 481 KTKVVFKENPDIEFVKSNKFSYAIVVVGEHPYAETFGDSLNLTIPDPGSSTITNVCGVVK 540

Query: 541 CVVILISGRPVVIEPYISSMDALVAAWLPGTEGKGITDVLFGDYGFTGKLPRTWFKTVDQ 600
           CVVI+ISGRPVV++PYISS+DALVAAWLPGTEGKGI+DVLFGDYGF+GKL RTWFKTVDQ
Sbjct: 541 CVVIVISGRPVVLQPYISSIDALVAAWLPGTEGKGISDVLFGDYGFSGKLSRTWFKTVDQ 600

Query: 601 LPMNFGDPHYDPLFSFGYGLTTEPIKA 628
           LPMN GD HYDPLF FG+GLTT+PIKA
Sbjct: 601 LPMNVGDAHYDPLFPFGFGLTTDPIKA 627

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
BGH3B_BACO12.5e-7830.51Beta-glucosidase BoGH3B OS=Bacteroides ovatus (strain ATCC 8483 / DSM 1896 / JCM... [more]
GLUA_DICDI1.3e-7632.33Lysosomal beta glucosidase OS=Dictyostelium discoideum GN=gluA PE=1 SV=2[more]
BGLX_SALTY4.1e-5728.86Periplasmic beta-glucosidase OS=Salmonella typhimurium (strain LT2 / SGSC1412 / ... [more]
BGLX_ECOLI8.5e-5528.42Periplasmic beta-glucosidase OS=Escherichia coli (strain K12) GN=bglX PE=3 SV=2[more]
BGLC_ASPOR2.8e-4226.39Probable beta-glucosidase C OS=Aspergillus oryzae (strain ATCC 42149 / RIB 40) G... [more]
Match NameE-valueIdentityDescription
A0A0A0LY55_CUCSA2.0e-31082.15Uncharacterized protein OS=Cucumis sativus GN=Csa_1G661750 PE=4 SV=1[more]
A0A0A0LI54_CUCSA6.9e-30680.61Uncharacterized protein OS=Cucumis sativus GN=Csa_3G842090 PE=4 SV=1[more]
A0A0A0LFL8_CUCSA3.0e-30178.79Uncharacterized protein OS=Cucumis sativus GN=Csa_3G842070 PE=4 SV=1[more]
A0A061F0I5_THECC1.8e-29078.48Glycosyl hydrolase family protein OS=Theobroma cacao GN=TCM_025896 PE=4 SV=1[more]
U5FEM9_POPTR4.2e-28775.88Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0019s05340g PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G20950.18.8e-26869.49 Glycosyl hydrolase family protein[more]
AT5G04885.11.7e-26368.93 Glycosyl hydrolase family protein[more]
AT5G20940.17.5e-25168.73 Glycosyl hydrolase family protein[more]
AT3G47000.11.8e-21258.83 Glycosyl hydrolase family protein[more]
AT3G47010.14.0e-20458.29 Glycosyl hydrolase family protein[more]
Match NameE-valueIdentityDescription
gi|659086037|ref|XP_008443733.1|0.0e+0082.59PREDICTED: lysosomal beta glucosidase-like [Cucumis melo][more]
gi|778665412|ref|XP_011648555.1|2.9e-31082.15PREDICTED: lysosomal beta glucosidase-like [Cucumis sativus][more]
gi|778685993|ref|XP_011652313.1|1.0e-30580.61PREDICTED: lysosomal beta glucosidase-like [Cucumis sativus][more]
gi|659130020|ref|XP_008464960.1|1.7e-30580.93PREDICTED: lysosomal beta glucosidase-like [Cucumis melo][more]
gi|659130018|ref|XP_008464959.1|2.5e-30178.79PREDICTED: lysosomal beta glucosidase-like [Cucumis melo][more]
The following terms have been associated with this gene:
Vocabulary: Biological Process
TermDefinition
GO:0005975carbohydrate metabolic process
Vocabulary: Molecular Function
TermDefinition
GO:0004553hydrolase activity, hydrolyzing O-glycosyl compounds
Vocabulary: INTERPRO
TermDefinition
IPR026892Glycoside hydrolase family 3
IPR017853Glycoside_hydrolase_SF
IPR002772Glyco_hydro_3_C
IPR001764Glyco_hydro_3_N
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0005975 carbohydrate metabolic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compounds

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG08g06820.1Cp4.1LG08g06820.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001764Glycoside hydrolase, family 3, N-terminalPRINTSPR00133GLHYDRLASE3coord: 108..124
score: 1.5E-25coord: 132..151
score: 1.5E-25coord: 294..312
score: 1.5E-25coord: 178..194
score: 1.5E-25coord: 224..240
score: 1.5
IPR001764Glycoside hydrolase, family 3, N-terminalGENE3DG3DSA:3.20.20.300coord: 26..391
score: 5.1E
IPR001764Glycoside hydrolase, family 3, N-terminalPFAMPF00933Glyco_hydro_3coord: 47..375
score: 3.7
IPR002772Glycoside hydrolase family 3 C-terminal domainGENE3DG3DSA:3.40.50.1700coord: 398..621
score: 1.4
IPR002772Glycoside hydrolase family 3 C-terminal domainPFAMPF01915Glyco_hydro_3_Ccoord: 412..621
score: 3.2
IPR002772Glycoside hydrolase family 3 C-terminal domainunknownSSF52279Beta-D-glucan exohydrolase, C-terminal domaincoord: 412..621
score: 3.92
IPR017853Glycoside hydrolase superfamilyunknownSSF51445(Trans)glycosidasescoord: 26..411
score: 3.58E
IPR026892Glycoside hydrolase family 3PANTHERPTHR30620PERIPLASMIC BETA-GLUCOSIDASE-RELATEDcoord: 81..625
score: 0.0coord: 1..62
score:
NoneNo IPR availablePANTHERPTHR30620:SF39SUBFAMILY NOT NAMEDcoord: 81..625
score: 0.0coord: 1..62
score:

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
Cp4.1LG08g06820CmoCh06G009340Cucurbita moschata (Rifu)cmocpeB796
The following gene(s) are paralogous to this gene:

None