Cla022081 (gene) Watermelon (97103) v1

NameCla022081
Typegene
OrganismCitrullus. lanatus (Watermelon (97103) v1)
DescriptionBeta-D-xylosidase (AHRD V1 ***- Q8W011_HORVU); contains Interpro domain(s) IPR001764 Glycoside hydrolase, family 3, N-terminal
LocationChr8 : 20184494 .. 20190141 (-)
   



The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTTCCTCCTCCTTCTTCCCTCGCAAAATGAAACTCCAAAAACTCCTTCTCTCCGCCGCCGTCTTCTCCGCCCTCCTCTCCCTCATCGTCGCCGACTCGTCGTCTCAGCTGCCGTACGCCTGCGACTCTTCCAACTCACTCACCAAAACTCTCCCATTCTGCAGAACCTCTCTGCCAATCAAACAAAGAGCCCGCGATCTCGTCTCTCGACTCACATTGGACGAGAAGGTCCTCCAGCTCGTCAACGCCGCTCCGGCGATCCCCCGTCTCGGCATCCCTGCCTACGAGTGGTGGTCGGAGGCCCTCCACGGCGTCGCCCACGTCGGCTACGGCATCCGCCTCAACGGCACTATCTCTGCTGCTACTAGCTTCCCTCAGGTCATCCTCACCGCCGCCTCCTTCGACGAAAACCTCTGGTACCAAATCGGACAGGTCGAATCAAATCCCTAACCAAACACTCTCCCCTGATTTTCTTCATTTCTTCATTATCCAATTTCTGTTCACAATACGAATGAAACATCTGATTCGCATTTAAGTTTTAGGGTTCCAATAGTTTACTGCTTGTTCATTAGTGTATCTATCGGATCCCATTTCCTTTCAAATTTGTGCTGATTTCTTAGACCACAAATCTAGGCTTAGTGACATCTTTAGTAGTAGTTTAACATGATTTAGAGAGTAATAGGCCCTCTATTTGAACTCTGGAATGTGATTTTCCTTGGGCATTGTGCTAACATACAACAACCCATTGAATATTGAATGAATCTGTATATGAATATGTGAAACAGGCGATAGGAACAGAGGCGAGGGCGGTGTACAATGCAGGGCAGGCGAAGGGAATGACATTTTGGGCGCCAAACATAAACATATTCAGAGATCCAAGATGGGGAAGAGGGCAAGAAACACCAGGAGAAGATCCATTGATGACGGCAAAATACTCAGTAGCATATGTGAGAGGGATTCAAGGGGACGCCATTGAAGGAGGGAAGCTCGGGAATCAACTCAAAGCTTCGGCATGCTGCAAACACTTCACTGCATACGATCTGGACCGGTGGAATGGGATGACTCGATATGTATTCGATGCCAAGGTAATATTGAATTGCGACGGAAAATTGGAATTGGAATTTGATTGATGGGTTTTTAGGGTTTGGAAAATGGATGTAGGTGACGATGCAGGACATGGCGGACACGTATCAGCCGCCATTTGAGAGCTGTGTGGAGAAGGGGAAAGCAAGTGGAATAATGTGCGCTTACAATAGGGTGAATGGAGTTCCCAGCTGTGCCGATCATCATCTTTTAACTGCTACTGCAAGAAAACAATGGAAGTTCAATGGGTTAGGGTTTCTTTTTCTTTTCTTTTCTCTTTATCCAATGCTTCACTCTCTGGCATTTATTGTTTCTGCATCTTTGTCCCTCCATTATTATAGTTGCTTTTGTTTTTTAGTGCTTCACTACTAGTATAGTATTCTTCTTTGGTTGTAATGCTATCTAACAGATTTGATGTTGGTGTGGTTATTGCATTCTTAGAATGAGTGTCGGAGGTAGCTTGCGAACTCAACTAGTCTTAAATTGAGTATTGGGACTTGGGAGTAGCTTACGAACTCGACTAATCTTAGAATGAGTATCGAGTCTAGCTTACTAACTCGCCTAAAAAGGAGTTTTGGGTCTAGTTTGCAAGTACAAACTCTACTAATTTTATAAGACAATTTACTTTACTTTTCAATATTTTAATATAAAATAAATTTATAAGATATTAAATTTAGGTAGAGACAATCCTAGATTGAACCTATTTTCTCTTTGACTTCTTTATTTAATTGGTGACTTTTATTGATCATTAGGTCCAATGATTTTGAAAATCTAATTTCGGTTCTAATCATTAATTTATTATATTTTTTTTGAAAACAAATCAATGGACCAATCAACCAAAAAAAAAATCAACTTTAGATTTTTAAGATTTTTTTTTAAAAACAAAAACTTAATTATTTTCAACTGTTTTTCTATTTTTAAAAATATTCTATACTTTTAGAAAATGACAGGTTGTTGGAAAGGTCAAAACCTCTATATATATATATAGACTAAAAGTTTAGAAACTTATTAAATATAAAACAAAAAAGGAGAGAGATTAATACTATTTGTCGTTATTTTAGTGTATTCTCAAAAAAGGTTATGTATATCAATATCTAAGTTATTAATTACAATTTTTTAAAAAATATTTTTTTTTTCTCTTGAGAAAAATGAAGATATAATTTCCTTTTTTTTAAAAAAATGACAATAATTTTGACCTTTCGAAATGTGTGCCTTTCTCTTGTCTCTTGTTCTTTACCAGAGATCCTATTAATTTTCTTTAAAAAATTAAATAAAGGATTCTAATTTCTAGTGTTTTCCTTTCTTTTTCTAATTATCCCATTGTTATTAAATTTAAACAAAGAAAATAGTTTTGAAAACTGGAAAAATAATTTTGTATAGTCCTTATATATGTTTTTAATCTTTTGAGTAATTTTAGCACTGCATTTTCTCAAGAATTATTATTACCACATTATTTTCACCCGTATGCTACATGTTGCACCCACTGTAGTTCTTAAGTTTGTATTGGTGAAAACTTCATAACAATCATTGCCACTGATTTTTCTCAAAAAAAGAAAAGAAAAAAAAAACAATTATGGCCACTGATTAAGGTTATTGGGAAATAATCAGGTACATCACGTCGGACTGCGATGCGGTGTCCATCATTCATGATGCACAAGGTTACGCCAAAATTCCAGAAGATGCGGTGGCTGATGTTCTTAGAGCTGGTATGTTGACTTCAATGCCAAACTTGTAATTTCTGGTTTTATTTTTTTCTTTTTTAGAAAAAAATCTGTGCTTAGTAATATATATTTTTTTGACAATGGGGTAAAATTAGGAATTTTGAAATAGTTAGATCTGGTGAATTCTTATGAGGGTCTCATAATAATAGAAACAAGTTAGTAATAAGTTTTAAATGATTGATCAAAATTTGACTTAGAAATAATTGTTCTCTCTTTAATTGGATACACTTTACCCAATCGAGATATTACATATCTCTTGATCGGGCCCCTTCTTTTAATGCATGGCCTTCTATTAGTGATTTGTTCATTCTTCTTAATAAAAGTTGTTCCTCAATCTTATGTAATTACTTATTTGGTTAAATTACAAGTTTAGTCAATGACTTCAGGAAGGTCCATGAAGTTTCCAATTTTCGATAAGTTCTGTAACTTTTATTTTTGTGTCTAAAAGATCCTTGACCTATTCATCACTTTTTAAAATTTACGAATTTATTGGACAAAAACTTGAAGGTGTAATGTTCTATTAAACACCAAATTCAATTTTATGTCTAATAGACGTGTTAATTTTTACAAAGGTTGAATATTTCATGAACATAATTGACACAAAATTGATGGGCCAAAAGACCTATTTGACATTTCTAGAGCTCAGGGACCTATAACACATGATTATATGAATCACAAAATATCCTTTTATGCCTATGATAGTGAATATCTTCACAATTTTGAAAGAAAAAGATTTTTCTTATAAAAAGAATTATATTATATATGTATAAAGCCTTTCAAATCCAATATTTTGAGCTGGCTTTGTTGACTAAATGTGGACTCTTTTGTAGTACTGTGGAGTGGACCCATTAGGGCCCAAATAAGTGGACCAATCTCAACTTCCTTTTCTATTTGTCTCTCCTTCCCCCCACAGAAAACTACAAACTATTCTGCACGCCTGCACATGTGGTAACCCATTACATCAAAGCCTACTTTTAACCAAAATTATATTTAAAATAGAAAAAAAAAACCATATTATTTTTCATTAAAGTTGTATCATATTATAAATGTTTCAATATTAGCCTAAATCCTTGAATTTGTTTGAGGGAAACCACTTCATTAGTAATTTCCCTTGAAATGCTTGATTCAATGAATAGAGTGAGGGTAGAATTATAACTTCTAAGATATTGGGGACAATATTAGCAGCTCTATGAAGTTAAGATTGCCGTATTACCTAAAGTTCCATACCCCTCATTTGGCTTGAGTTTTGGATTGTTTTTGTAAGCATTTTGGTTCCGAGATTTAAACCTGATATACTTCTTATCATTCAGAGAAATGAAAAGTTTTGATTTATGACCAAAACATTCAAATACTTACAGGAATGGACATCAACTGTGGCACGTACCTGAAGGAGCACGCGAAATCCGCTGTGGAGATGAAGAAAGTACCTATTCCTCATATAGACCGAGCACTCCACAACCTCTTTGCCGTTAGAATGAGATTGGGTTTGTTTGATGGCAACCCAACCAAACTGCCTTTTGGCCAAATTGGTCCAGACCAAGTATGCTCAAAGCAGCATCAAGATCTGGCTCTTCAAGCTGCAAGAGAAGGCATTGTTCTCCTAAAGAACTCTGCCAAACTTCTTCCACTTTCCAAATCAAGTATACGTTCGCTTGCTGTTATAGGCCACAATGGCGATGAACCAAAAACACTTCGAGGAAATTATGCAGGAATTCCTTGCAAATCTGTTACCCCATTTCAAGGTTTGAATAGCTATGTCAAGAACACTGTTTACCACAGAGGCTGCAACTGGGCTAACTGTACGGAAGCTACAATTGATCAGGCAGTGCAAATTGCGAAAAGTGTGGATTACGTGGTGTTGGTTATGGGGCTGGATCAAACTCAAGAAAGAGAAGACTTTGATCGCACGGAGTTGGGGCTCCCAGGAAATCAAGAAGCACTCATTGCTGAAGTCGCTAAAGCTGCAAAACGTCCAGTCATTTTGGTGATTCTCTCTGGAGGTCCCGTCGATATATCTTCAGCCAAGTATAATGAGAAGATAGGAAGCATCTTGTGGGCTGGTTATCCAGGGCAAGCTGGAGGAACTGCCATTGCAGAGATCATATTTGGTGATCACAACCCAGGTTAACATTTTAAATATCCATCTTATTTTGTTTAACCAGCACTTCATTATGCTTGAAATCTCAAACCATATAGTGAACGAACTGGATCGACATGATATTTACCCTTAACTTTCCATGTTCGTGTGCAGGAGGAAGACTGCCATTAACTTGGTATCCACATGATTTCATCAAATTTCCAATGACAGACATGAGAATGAGAGCAGACCCTTCAACAGGCTACCCTGGTCGCACCTACCGCTTCTATAACGGACCGAAAGTCTACGAATTTGGCTACGGTCTCAGCTACTCCAACTATCTCTATGAATTCACATCAGTAACTGAAAGCAAACTACACCTTAGCAATCCAACAGCCAGCCAGCCAGCCAAAAGCTCTGACTCAATCCGCTACAGGCTCGTCTCAGAGCTGGACAAGAAGTTCTGTGAGAGCAGGGCTGTGAATGTGACCGTTGGAGTTAGAAATGACGGGGAAATGGCAGGTAAGCATTCAGTCTTGTTATTCGTTAAGCCTTCGAAACCCGTAAATGGGAGTCCTGTGAAGCAATTGGTGGGATTCAAAAGGGTGGAGATAAATGCAGGTGGGAGAAGTGAGATTGAGTTTTTGTTGAGCCCTTGTGAACATGTAAGTAAGGCTAATGAAGAGGGACTGATGATTATAGAAGAAGGGTCTTATTCATTGGTCGTAGGAGATGTGGAACATCCTCTTGATATCTTTGTTTGA

mRNA sequence

ATGGCTTCCTCCTCCTTCTTCCCTCGCAAAATGAAACTCCAAAAACTCCTTCTCTCCGCCGCCGTCTTCTCCGCCCTCCTCTCCCTCATCGTCGCCGACTCGTCGTCTCAGCTGCCGTACGCCTGCGACTCTTCCAACTCACTCACCAAAACTCTCCCATTCTGCAGAACCTCTCTGCCAATCAAACAAAGAGCCCGCGATCTCGTCTCTCGACTCACATTGGACGAGAAGGTCCTCCAGCTCGTCAACGCCGCTCCGGCGATCCCCCGTCTCGGCATCCCTGCCTACGAGTGGTGGTCGGAGGCCCTCCACGGCGTCGCCCACGTCGGCTACGGCATCCGCCTCAACGGCACTATCTCTGCTGCTACTAGCTTCCCTCAGGTCATCCTCACCGCCGCCTCCTTCGACGAAAACCTCTGGTACCAAATCGGACAGGCGATAGGAACAGAGGCGAGGGCGGTGTACAATGCAGGGCAGGCGAAGGGAATGACATTTTGGGCGCCAAACATAAACATATTCAGAGATCCAAGATGGGGAAGAGGGCAAGAAACACCAGGAGAAGATCCATTGATGACGGCAAAATACTCAGTAGCATATGTGAGAGGGATTCAAGGGGACGCCATTGAAGGAGGGAAGCTCGGGAATCAACTCAAAGCTTCGGCATGCTGCAAACACTTCACTGCATACGATCTGGACCGGTGGAATGGGATGACTCGATATGTATTCGATGCCAAGGTGACGATGCAGGACATGGCGGACACGTATCAGCCGCCATTTGAGAGCTGTGTGGAGAAGGGGAAAGCAAGTGGAATAATGTGCGCTTACAATAGGGTGAATGGAGTTCCCAGCTGTGCCGATCATCATCTTTTAACTGCTACTGCAAGAAAACAATGGAAGTTCAATGGGTACATCACGTCGGACTGCGATGCGGTGTCCATCATTCATGATGCACAAGGTTACGCCAAAATTCCAGAAGATGCGGTGGCTGATGTTCTTAGAGCTGGAATGGACATCAACTGTGGCACGTACCTGAAGGAGCACGCGAAATCCGCTGTGGAGATGAAGAAAGTACCTATTCCTCATATAGACCGAGCACTCCACAACCTCTTTGCCGTTAGAATGAGATTGGGTTTGTTTGATGGCAACCCAACCAAACTGCCTTTTGGCCAAATTGGTCCAGACCAAGTATGCTCAAAGCAGCATCAAGATCTGGCTCTTCAAGCTGCAAGAGAAGGCATTGTTCTCCTAAAGAACTCTGCCAAACTTCTTCCACTTTCCAAATCAAGTATACGTTCGCTTGCTGTTATAGGCCACAATGGCGATGAACCAAAAACACTTCGAGGAAATTATGCAGGAATTCCTTGCAAATCTGTTACCCCATTTCAAGGTTTGAATAGCTATGTCAAGAACACTGTTTACCACAGAGGCTGCAACTGGGCTAACTGTACGGAAGCTACAATTGATCAGGCAGTGCAAATTGCGAAAAGTGTGGATTACGTGGTGTTGGTTATGGGGCTGGATCAAACTCAAGAAAGAGAAGACTTTGATCGCACGGAGTTGGGGCTCCCAGGAAATCAAGAAGCACTCATTGCTGAAGTCGCTAAAGCTGCAAAACGTCCAGTCATTTTGGTGATTCTCTCTGGAGGTCCCGTCGATATATCTTCAGCCAAGTATAATGAGAAGATAGGAAGCATCTTGTGGGCTGGTTATCCAGGGCAAGCTGGAGGAACTGCCATTGCAGAGATCATATTTGGTGATCACAACCCAGGAGGAAGACTGCCATTAACTTGGTATCCACATGATTTCATCAAATTTCCAATGACAGACATGAGAATGAGAGCAGACCCTTCAACAGGCTACCCTGGTCGCACCTACCGCTTCTATAACGGACCGAAAGTCTACGAATTTGGCTACGGTCTCAGCTACTCCAACTATCTCTATGAATTCACATCAGTAACTGAAAGCAAACTACACCTTAGCAATCCAACAGCCAGCCAGCCAGCCAAAAGCTCTGACTCAATCCGCTACAGGCTCGTCTCAGAGCTGGACAAGAAGTTCTGTGAGAGCAGGGCTGTGAATGTGACCGTTGGAGTTAGAAATGACGGGGAAATGGCAGGTAAGCATTCAGTCTTGTTATTCGTTAAGCCTTCGAAACCCGTAAATGGGAGTCCTGTGAAGCAATTGGTGGGATTCAAAAGGGTGGAGATAAATGCAGGTGGGAGAAGTGAGATTGAGTTTTTGTTGAGCCCTTGTGAACATGTAAGTAAGGCTAATGAAGAGGGACTGATGATTATAGAAGAAGGGTCTTATTCATTGGTCGTAGGAGATGTGGAACATCCTCTTGATATCTTTGTTTGA

Coding sequence (CDS)

ATGGCTTCCTCCTCCTTCTTCCCTCGCAAAATGAAACTCCAAAAACTCCTTCTCTCCGCCGCCGTCTTCTCCGCCCTCCTCTCCCTCATCGTCGCCGACTCGTCGTCTCAGCTGCCGTACGCCTGCGACTCTTCCAACTCACTCACCAAAACTCTCCCATTCTGCAGAACCTCTCTGCCAATCAAACAAAGAGCCCGCGATCTCGTCTCTCGACTCACATTGGACGAGAAGGTCCTCCAGCTCGTCAACGCCGCTCCGGCGATCCCCCGTCTCGGCATCCCTGCCTACGAGTGGTGGTCGGAGGCCCTCCACGGCGTCGCCCACGTCGGCTACGGCATCCGCCTCAACGGCACTATCTCTGCTGCTACTAGCTTCCCTCAGGTCATCCTCACCGCCGCCTCCTTCGACGAAAACCTCTGGTACCAAATCGGACAGGCGATAGGAACAGAGGCGAGGGCGGTGTACAATGCAGGGCAGGCGAAGGGAATGACATTTTGGGCGCCAAACATAAACATATTCAGAGATCCAAGATGGGGAAGAGGGCAAGAAACACCAGGAGAAGATCCATTGATGACGGCAAAATACTCAGTAGCATATGTGAGAGGGATTCAAGGGGACGCCATTGAAGGAGGGAAGCTCGGGAATCAACTCAAAGCTTCGGCATGCTGCAAACACTTCACTGCATACGATCTGGACCGGTGGAATGGGATGACTCGATATGTATTCGATGCCAAGGTGACGATGCAGGACATGGCGGACACGTATCAGCCGCCATTTGAGAGCTGTGTGGAGAAGGGGAAAGCAAGTGGAATAATGTGCGCTTACAATAGGGTGAATGGAGTTCCCAGCTGTGCCGATCATCATCTTTTAACTGCTACTGCAAGAAAACAATGGAAGTTCAATGGGTACATCACGTCGGACTGCGATGCGGTGTCCATCATTCATGATGCACAAGGTTACGCCAAAATTCCAGAAGATGCGGTGGCTGATGTTCTTAGAGCTGGAATGGACATCAACTGTGGCACGTACCTGAAGGAGCACGCGAAATCCGCTGTGGAGATGAAGAAAGTACCTATTCCTCATATAGACCGAGCACTCCACAACCTCTTTGCCGTTAGAATGAGATTGGGTTTGTTTGATGGCAACCCAACCAAACTGCCTTTTGGCCAAATTGGTCCAGACCAAGTATGCTCAAAGCAGCATCAAGATCTGGCTCTTCAAGCTGCAAGAGAAGGCATTGTTCTCCTAAAGAACTCTGCCAAACTTCTTCCACTTTCCAAATCAAGTATACGTTCGCTTGCTGTTATAGGCCACAATGGCGATGAACCAAAAACACTTCGAGGAAATTATGCAGGAATTCCTTGCAAATCTGTTACCCCATTTCAAGGTTTGAATAGCTATGTCAAGAACACTGTTTACCACAGAGGCTGCAACTGGGCTAACTGTACGGAAGCTACAATTGATCAGGCAGTGCAAATTGCGAAAAGTGTGGATTACGTGGTGTTGGTTATGGGGCTGGATCAAACTCAAGAAAGAGAAGACTTTGATCGCACGGAGTTGGGGCTCCCAGGAAATCAAGAAGCACTCATTGCTGAAGTCGCTAAAGCTGCAAAACGTCCAGTCATTTTGGTGATTCTCTCTGGAGGTCCCGTCGATATATCTTCAGCCAAGTATAATGAGAAGATAGGAAGCATCTTGTGGGCTGGTTATCCAGGGCAAGCTGGAGGAACTGCCATTGCAGAGATCATATTTGGTGATCACAACCCAGGAGGAAGACTGCCATTAACTTGGTATCCACATGATTTCATCAAATTTCCAATGACAGACATGAGAATGAGAGCAGACCCTTCAACAGGCTACCCTGGTCGCACCTACCGCTTCTATAACGGACCGAAAGTCTACGAATTTGGCTACGGTCTCAGCTACTCCAACTATCTCTATGAATTCACATCAGTAACTGAAAGCAAACTACACCTTAGCAATCCAACAGCCAGCCAGCCAGCCAAAAGCTCTGACTCAATCCGCTACAGGCTCGTCTCAGAGCTGGACAAGAAGTTCTGTGAGAGCAGGGCTGTGAATGTGACCGTTGGAGTTAGAAATGACGGGGAAATGGCAGGTAAGCATTCAGTCTTGTTATTCGTTAAGCCTTCGAAACCCGTAAATGGGAGTCCTGTGAAGCAATTGGTGGGATTCAAAAGGGTGGAGATAAATGCAGGTGGGAGAAGTGAGATTGAGTTTTTGTTGAGCCCTTGTGAACATGTAAGTAAGGCTAATGAAGAGGGACTGATGATTATAGAAGAAGGGTCTTATTCATTGGTCGTAGGAGATGTGGAACATCCTCTTGATATCTTTGTTTGA

Protein sequence

MASSSFFPRKMKLQKLLLSAAVFSALLSLIVADSSSQLPYACDSSNSLTKTLPFCRTSLPIKQRARDLVSRLTLDEKVLQLVNAAPAIPRLGIPAYEWWSEALHGVAHVGYGIRLNGTISAATSFPQVILTAASFDENLWYQIGQAIGTEARAVYNAGQAKGMTFWAPNINIFRDPRWGRGQETPGEDPLMTAKYSVAYVRGIQGDAIEGGKLGNQLKASACCKHFTAYDLDRWNGMTRYVFDAKVTMQDMADTYQPPFESCVEKGKASGIMCAYNRVNGVPSCADHHLLTATARKQWKFNGYITSDCDAVSIIHDAQGYAKIPEDAVADVLRAGMDINCGTYLKEHAKSAVEMKKVPIPHIDRALHNLFAVRMRLGLFDGNPTKLPFGQIGPDQVCSKQHQDLALQAAREGIVLLKNSAKLLPLSKSSIRSLAVIGHNGDEPKTLRGNYAGIPCKSVTPFQGLNSYVKNTVYHRGCNWANCTEATIDQAVQIAKSVDYVVLVMGLDQTQEREDFDRTELGLPGNQEALIAEVAKAAKRPVILVILSGGPVDISSAKYNEKIGSILWAGYPGQAGGTAIAEIIFGDHNPGGRLPLTWYPHDFIKFPMTDMRMRADPSTGYPGRTYRFYNGPKVYEFGYGLSYSNYLYEFTSVTESKLHLSNPTASQPAKSSDSIRYRLVSELDKKFCESRAVNVTVGVRNDGEMAGKHSVLLFVKPSKPVNGSPVKQLVGFKRVEINAGGRSEIEFLLSPCEHVSKANEEGLMIIEEGSYSLVVGDVEHPLDIFV
BLAST of Cla022081 vs. Swiss-Prot
Match: BXL7_ARATH (Probable beta-D-xylosidase 7 OS=Arabidopsis thaliana GN=BXL7 PE=2 SV=2)

HSP 1 Score: 1044.6 bits (2700), Expect = 5.2e-304
Identity = 508/763 (66.58%), Postives = 605/763 (79.29%), Query Frame = 1

Query: 26  LLSLIVADSSSQLPYACDSSNSLTKTLPFCRTSLPIKQRARDLVSRLTLDEKVLQLVNAA 85
           LL ++    S+  P++CD SN  TK   FCRT LPI +RARDLVSRLT+DEK+ QLVN A
Sbjct: 10  LLFIVHGVESAPPPHSCDPSNPTTKLYQFCRTDLPIGKRARDLVSRLTIDEKISQLVNTA 69

Query: 86  PAIPRLGIPAYEWWSEALHGVAHVGYGIRLNGTISAATSFPQVILTAASFDENLWYQIGQ 145
           P IPRLG+PAYEWWSEALHGVA+ G GIR NGT+ AATSFPQVILTAASFD   W++I Q
Sbjct: 70  PGIPRLGVPAYEWWSEALHGVAYAGPGIRFNGTVKAATSFPQVILTAASFDSYEWFRIAQ 129

Query: 146 AIGTEARAVYNAGQAKGMTFWAPNINIFRDPRWGRGQETPGEDPLMTAKYSVAYVRGIQG 205
            IG EAR VYNAGQA GMTFWAPNINIFRDPRWGRGQETPGEDP+MT  Y+VAYVRG+QG
Sbjct: 130 VIGKEARGVYNAGQANGMTFWAPNINIFRDPRWGRGQETPGEDPMMTGTYAVAYVRGLQG 189

Query: 206 DAIEGGK-LGNQLKASACCKHFTAYDLDRWNGMTRYVFDAKVTMQDMADTYQPPFESCVE 265
           D+ +G K L N L+ASACCKHFTAYDLDRW G+TRYVF+A+V++ D+A+TYQPPF+ C+E
Sbjct: 190 DSFDGRKTLSNHLQASACCKHFTAYDLDRWKGITRYVFNAQVSLADLAETYQPPFKKCIE 249

Query: 266 KGKASGIMCAYNRVNGVPSCADHHLLTATARKQWKFNGYITSDCDAVSIIHDAQGYAKIP 325
           +G+ASGIMCAYNRVNG+PSCAD +LLT TAR QW F GYITSDCDAVSII+DAQGYAK P
Sbjct: 250 EGRASGIMCAYNRVNGIPSCADPNLLTRTARGQWAFRGYITSDCDAVSIIYDAQGYAKSP 309

Query: 326 EDAVADVLRAGMDINCGTYLKEHAKSAVEMKKVPIPHIDRALHNLFAVRMRLGLFDGNPT 385
           EDAVADVL+AGMD+NCG+YL++H KSA++ KKV    IDRAL NLF+VR+RLGLF+G+PT
Sbjct: 310 EDAVADVLKAGMDVNCGSYLQKHTKSALQQKKVSETDIDRALLNLFSVRIRLGLFNGDPT 369

Query: 386 KLPFGQIGPDQVCSKQHQDLALQAAREGIVLLKNSAKLLPLSKSSIRSLAVIGHNGDEPK 445
           KLP+G I P++VCS  HQ LAL AAR GIVLLKN+ KLLP SK S+ SLAVIG N    K
Sbjct: 370 KLPYGNISPNEVCSPAHQALALDAARNGIVLLKNNLKLLPFSKRSVSSLAVIGPNAHVVK 429

Query: 446 TLRGNYAGIPCKSVTPFQGLNSYVKNTVYHRGCNWANCTEATIDQAVQIAKSVDYVVLVM 505
           TL GNYAG PCK+VTP   L SYVKN VYH+GC+   C+ A IDQAV IAK+ D+VVL+M
Sbjct: 430 TLLGNYAGPPCKTVTPLDALRSYVKNAVYHQGCDSVACSNAAIDQAVAIAKNADHVVLIM 489

Query: 506 GLDQTQEREDFDRTELGLPGNQEALIAEVAKAAKRPVILVILSGGPVDISSAKYNEKIGS 565
           GLDQTQE+EDFDR +L LPG Q+ LI  VA AAK+PV+LV++ GGPVDIS A  N KIGS
Sbjct: 490 GLDQTQEKEDFDRVDLSLPGKQQELITSVANAAKKPVVLVLICGGPVDISFAANNNKIGS 549

Query: 566 ILWAGYPGQAGGTAIAEIIFGDHNPGGRLPLTWYPHDFIKFPMTDMRMRADPSTGYPGRT 625
           I+WAGYPG+AGG AI+EIIFGDHNPGGRLP+TWYP  F+   MTDMRMR+  +TGYPGRT
Sbjct: 550 IIWAGYPGEAGGIAISEIIFGDHNPGGRLPVTWYPQSFVNIQMTDMRMRS--ATGYPGRT 609

Query: 626 YRFYNGPKVYEFGYGLSYSNYLYEFTSVTESKLHLSNPTASQPAKSSDSIRYRLVSELDK 685
           Y+FY GPKVYEFG+GLSYS Y Y F ++ E+ L+L+    S+   +SDS+RY LVSE+ K
Sbjct: 610 YKFYKGPKVYEFGHGLSYSAYSYRFKTLAETNLYLNQ---SKAQTNSDSVRYTLVSEMGK 669

Query: 686 KFCESRAVNVTVGVRNDGEMAGKHSVLLFVKPSKPVNGS--PVKQLVGFKRVEINAGGRS 745
           + C+     VTV V N GEMAGKH VL+F +  +         KQLVGFK + ++ G ++
Sbjct: 670 EGCDVAKTKVTVEVENQGEMAGKHPVLMFARHERGGEDGKRAEKQLVGFKSIVLSNGEKA 729

Query: 746 EIEFLLSPCEHVSKANEEGLMIIEEGSYSLVVGDVEHPLDIFV 786
           E+EF +  CEH+S+ANE G+M++EEG Y L VGD E PL + V
Sbjct: 730 EMEFEIGLCEHLSRANEFGVMVLEEGKYFLTVGDSELPLIVNV 767

BLAST of Cla022081 vs. Swiss-Prot
Match: BXL6_ARATH (Probable beta-D-xylosidase 6 OS=Arabidopsis thaliana GN=BXL6 PE=2 SV=1)

HSP 1 Score: 799.7 bits (2064), Expect = 2.9e-230
Identity = 404/794 (50.88%), Postives = 528/794 (66.50%), Query Frame = 1

Query: 11  MKLQKLLLSAAVFSALLSLIVADSSSQLPYACDSSNSLTKTLPFCRTSLPIKQRARDLVS 70
           M LQ  L+S   F++ ++    +  S   + C   +    + PFC  SL IKQRA  LVS
Sbjct: 1   MNLQLTLISLLFFTSAIAETFKNLDSHPQFPCKPPHF--SSYPFCNVSLSIKQRAISLVS 60

Query: 71  RLTLDEKVLQLVNAAPAIPRLGIPAYEWWSEALHGVAHVGYGIRLNGTISAATSFPQVIL 130
            L L EK+ QL N A ++PRLGIP YEWWSE+LHG+A  G G+  NG+ISAATSFPQVI+
Sbjct: 61  LLMLPEKIGQLSNTAASVPRLGIPPYEWWSESLHGLADNGPGVSFNGSISAATSFPQVIV 120

Query: 131 TAASFDENLWYQIGQAIGTEARAVYNAGQAKGMTFWAPNINIFRDPRWGRGQETPGEDPL 190
           +AASF+  LWY+IG A+  E RA+YN GQA G+TFWAPNIN+FRDPRWGRGQETPGEDP 
Sbjct: 121 SAASFNRTLWYEIGSAVAVEGRAMYNGGQA-GLTFWAPNINVFRDPRWGRGQETPGEDPK 180

Query: 191 MTAKYSVAYVRGIQ------------GDAIEGGK----LGNQLKASACCKHFTAYDLDRW 250
           + ++Y V +VRG Q             D ++  +       +L  SACCKHFTAYDL++W
Sbjct: 181 VVSEYGVEFVRGFQEKKKRKVLKRRFSDDVDDDRHDDDADGKLMLSACCKHFTAYDLEKW 240

Query: 251 NGMTRYVFDAKVTMQDMADTYQPPFESCVEKGKASGIMCAYNRVNGVPSCADHHLLTATA 310
              TRY F+A VT QDM DTYQPPFE+C+  GKAS +MC+YN VNGVP+CA   LL   A
Sbjct: 241 GNFTRYDFNAVVTEQDMEDTYQPPFETCIRDGKASCLMCSYNAVNGVPACAQGDLLQK-A 300

Query: 311 RKQWKFNGYITSDCDAVSIIHDAQGYAKIPEDAVADVLRAGMDINCGTYLKEHAKSAVEM 370
           R +W F GYITSDCDAV+ I   QGY K PE+AVAD ++AG+DINCGTY+  H +SA+E 
Sbjct: 301 RVEWGFEGYITSDCDAVATIFAYQGYTKSPEEAVADAIKAGVDINCGTYMLRHTQSAIEQ 360

Query: 371 KKVPIPHIDRALHNLFAVRMRLGLFDGNPTKLPFGQIGPDQVCSKQHQDLALQAAREGIV 430
            KV    +DRAL NLFAV++RLGLFDG+P +  +G++G + +CS  H+ LAL+A R+GIV
Sbjct: 361 GKVSEELVDRALLNLFAVQLRLGLFDGDPRRGQYGKLGSNDICSSDHRKLALEATRQGIV 420

Query: 431 LLKNSAKLLPLSKSSIRSLAVIGHNGDEPKTLRGNYAGIPCKSVTPFQGLNSYVKNTVYH 490
           LLKN  KLLPL+K+ + SLA++G   +    + G Y G PC+  T F  L  YVK T Y 
Sbjct: 421 LLKNDHKLLPLNKNHVSSLAIVGPMANNISNMGGTYTGKPCQRKTLFTELLEYVKKTSYA 480

Query: 491 RGCNWANCTEAT-IDQAVQIAKSVDYVVLVMGLDQTQEREDFDRTELGLPGNQEALIAEV 550
            GC+  +C   T   +AV IAK  D+V++V GLD +QE ED DR  L LPG Q+ L++ V
Sbjct: 481 SGCSDVSCDSDTGFGEAVAIAKGADFVIVVAGLDLSQETEDKDRVSLSLPGKQKDLVSHV 540

Query: 551 AKAAKRPVILVILSGGPVDISSAKYNEKIGSILWAGYPGQAGGTAIAEIIFGDHNPGGRL 610
           A  +K+PVILV+  GGPVD++ AK + +IGSI+W GYPG+ GG A+AEIIFGD NPGGRL
Sbjct: 541 AAVSKKPVILVLTGGGPVDVTFAKNDPRIGSIIWIGYPGETGGQALAEIIFGDFNPGGRL 600

Query: 611 PLTWYPHDFIKFPMTDMRMRADPSTGYPGRTYRFYNGPKVYEFGYGLSYSNYLYEFTS-- 670
           P TWYP  F    M+DM MRA+ S GYPGRTYRFY GP+VY FG GLSY+ + Y+  S  
Sbjct: 601 PTTWYPESFTDVAMSDMHMRANSSRGYPGRTYRFYTGPQVYSFGTGLSYTKFEYKILSAP 660

Query: 671 --VTESKLHLSNPTASQPAKSSDSIRYRLVSELDKKFCESRAVNVTVGVRNDGEMAGKHS 730
             ++ S+L     +  +  +  + +RY  + ++    CES   NV V V N GE+ G H 
Sbjct: 661 IRLSLSELLPQQSSHKKQLQHGEELRYLQLDDVIVNSCESLRFNVRVHVSNTGEIDGSHV 720

Query: 731 VLLFVKPSKPVNGSPVKQLVGFKRVEINAGGRSEIEFLLSPCEHVSKANEEGLMIIEEGS 784
           V+LF K    ++G P KQL+G+ RV + +    E  F++ PC+ +S AN+ G  +I  GS
Sbjct: 721 VMLFSKMPPVLSGVPEKQLIGYDRVHVRSNEMMETVFVIDPCKQLSVANDVGKRVIPLGS 780

BLAST of Cla022081 vs. Swiss-Prot
Match: BXL5_ARATH (Probable beta-D-xylosidase 5 OS=Arabidopsis thaliana GN=BXL5 PE=2 SV=2)

HSP 1 Score: 752.3 bits (1941), Expect = 5.4e-216
Identity = 380/759 (50.07%), Postives = 507/759 (66.80%), Query Frame = 1

Query: 26  LLSLIVADSSSQLPYACDSSNSLTKTLPFCRTSLPIKQRARDLVSRLTLDEKVLQLVNAA 85
           +++L+ +   SQ  +ACD S   T    FC  SL  + RA+DLVSRL+L EKV QLVN A
Sbjct: 13  IIALVSSLCESQKNFACDISAPATAKYGFCNVSLSYEARAKDLVSRLSLKEKVQQLVNKA 72

Query: 86  PAIPRLGIPAYEWWSEALHGVAHVGYGIRLNGTISAATSFPQVILTAASFDENLWYQIGQ 145
             +PRLG+P YEWWSEALHGV+ VG G+  NGT+  ATSFP  ILTAASF+ +LW ++G+
Sbjct: 73  TGVPRLGVPPYEWWSEALHGVSDVGPGVHFNGTVPGATSFPATILTAASFNTSLWLKMGE 132

Query: 146 AIGTEARAVYNAGQAKGMTFWAPNINIFRDPRWGRGQETPGEDPLMTAKYSVAYVRGIQG 205
            + TEARA++N G A G+T+W+PN+N+FRDPRWGRGQETPGEDPL+ +KY+V YV+G+Q 
Sbjct: 133 VVSTEARAMHNVGLA-GLTYWSPNVNVFRDPRWGRGQETPGEDPLVVSKYAVNYVKGLQ- 192

Query: 206 DAIEGGKLGNQLKASACCKHFTAYDLDRWNGMTRYVFDAKVTMQDMADTYQPPFESCVEK 265
           D  + GK   +LK S+CCKH+TAYDLD W G+ R+ FDAKVT QD+ DTYQ PF+SCVE+
Sbjct: 193 DVHDAGK-SRRLKVSSCCKHYTAYDLDNWKGIDRFHFDAKVTKQDLEDTYQTPFKSCVEE 252

Query: 266 GKASGIMCAYNRVNGVPSCADHHLLTATARKQWKFNGYITSDCDAVSIIHDAQGYAKIPE 325
           G  S +MC+YNRVNG+P+CAD +LL    R QW+ +GYI SDCD++ +  +   Y K  E
Sbjct: 253 GDVSSVMCSYNRVNGIPTCADPNLLRGVIRGQWRLDGYIVSDCDSIQVYFNDIHYTKTRE 312

Query: 326 DAVADVLRAGMDINCGTYLKEHAKSAVEMKKVPIPHIDRALHNLFAVRMRLGLFDGNPTK 385
           DAVA  L+AG+++NCG +L ++ ++AV++KK+    +D AL   + V MRLG FDG+P  
Sbjct: 313 DAVALALKAGLNMNCGDFLGKYTENAVKLKKLNGSDVDEALIYNYIVLMRLGFFDGDPKS 372

Query: 386 LPFGQIGPDQVCSKQHQDLALQAAREGIVLLKNSAKLLPLSKSSIRSLAVIGHNGDEPKT 445
           LPFG +GP  VCSK HQ LAL+AA++GIVLL+N    LPL K++++ LAVIG N +  K 
Sbjct: 373 LPFGNLGPSDVCSKDHQMLALEAAKQGIVLLENRGD-LPLPKTTVKKLAVIGPNANATKV 432

Query: 446 LRGNYAGIPCKSVTPFQGLNSYV-KNTVYHRGCNWANCTEAT-IDQAVQIAKSVDYVVLV 505
           +  NYAG+PCK  +P QGL  YV +  VY  GC    C + T I  AV+     D  VLV
Sbjct: 433 MISNYAGVPCKYTSPIQGLQKYVPEKIVYEPGCKDVKCGDQTLISAAVKAVSEADVTVLV 492

Query: 506 MGLDQTQEREDFDRTELGLPGNQEALIAEVAKAAKRPVILVILSGGPVDISSAKYNEKIG 565
           +GLDQT E E  DR  L LPG QE L+ +VA AAK+ V+LVI+S GP+DIS AK    I 
Sbjct: 493 VGLDQTVEAEGLDRVNLTLPGYQEKLVRDVANAAKKTVVLVIMSAGPIDISFAKNLSTIR 552

Query: 566 SILWAGYPGQAGGTAIAEIIFGDHNPGGRLPLTWYPHDFI-KFPMTDMRMRADPSTGYPG 625
           ++LW GYPG+AGG AIA++IFGD+NP GRLP TWYP +F  K  MTDM MR + ++G+PG
Sbjct: 553 AVLWVGYPGEAGGDAIAQVIFGDYNPSGRLPETWYPQEFADKVAMTDMNMRPNSTSGFPG 612

Query: 626 RTYRFYNGPKVYEFGYGLSYSNYLYEFTSVTESKLHL-SNPTASQPAKSSDSIRYRLVSE 685
           R+YRFY G  +Y+FGYGLSYS++   F     S +H+ +NP  +    +S         +
Sbjct: 613 RSYRFYTGKPIYKFGYGLSYSSF-STFVLSAPSIIHIKTNPIMNLNKTTS--------VD 672

Query: 686 LDKKFCESRAVNVTVGVRNDGEMAGKHSVLLFVKPSK-----PVNGSPVKQLVGFKRVEI 745
           +    C    + + +GV+N G  +G H VL+F KP K        G P+ QLVGF+RVE+
Sbjct: 673 ISTVNCHDLKIRIVIGVKNHGLRSGSHVVLVFWKPPKCSKSLVGGGVPLTQLVGFERVEV 732

Query: 746 NAGGRSEIEFLLSPCEHVSKANEEGLMIIEEGSYSLVVG 776
                 +       C+ +S  +  G   +  G + LV+G
Sbjct: 733 GRSMTEKFTVDFDVCKALSLVDTHGKRKLVTGHHKLVIG 758

BLAST of Cla022081 vs. Swiss-Prot
Match: BXL3_ARATH (Beta-D-xylosidase 3 OS=Arabidopsis thaliana GN=BXL3 PE=1 SV=1)

HSP 1 Score: 734.9 bits (1896), Expect = 8.9e-211
Identity = 380/759 (50.07%), Postives = 507/759 (66.80%), Query Frame = 1

Query: 32  ADSSSQLPYACD-SSNSLTKTLPFCRTSLPIKQRARDLVSRLTLDEKVLQLVNAAPAIPR 91
           +++ S   +ACD + N     L FC   L IK R  DLV RLTL+EK+  L + A  + R
Sbjct: 26  SNNQSSPVFACDVTGNPSLAGLRFCNAGLSIKARVTDLVGRLTLEEKIGFLTSKAIGVSR 85

Query: 92  LGIPAYEWWSEALHGVAHVGYGIRLNGTISAATSFPQVILTAASFDENLWYQIGQAIGTE 151
           LGIP+Y+WWSEALHGV++VG G R  G +  ATSFPQVILTAASF+ +L+  IG+ + TE
Sbjct: 86  LGIPSYKWWSEALHGVSNVGGGSRFTGQVPGATSFPQVILTAASFNVSLFQAIGKVVSTE 145

Query: 152 ARAVYNAGQAKGMTFWAPNINIFRDPRWGRGQETPGEDPLMTAKYSVAYVRGIQGDAIEG 211
           ARA+YN G A G+TFW+PN+NIFRDPRWGRGQETPGEDP +++KY+VAYV+G+Q    +G
Sbjct: 146 ARAMYNVGSA-GLTFWSPNVNIFRDPRWGRGQETPGEDPTLSSKYAVAYVKGLQ--ETDG 205

Query: 212 GKLGNQLKASACCKHFTAYDLDRWNGMTRYVFDAKVTMQDMADTYQPPFESCVEKGKASG 271
           G   N+LK +ACCKH+TAYD+D W  + R  F+A V  QD+ADT+QPPF+SCV  G  + 
Sbjct: 206 GD-PNRLKVAACCKHYTAYDIDNWRNVNRLTFNAVVNQQDLADTFQPPFKSCVVDGHVAS 265

Query: 272 IMCAYNRVNGVPSCADHHLLTATARKQWKFNGYITSDCDAVSIIHDAQGYAKIPEDAVAD 331
           +MC+YN+VNG P+CAD  LL+   R QW+ NGYI SDCD+V ++   Q YAK PE+AVA 
Sbjct: 266 VMCSYNQVNGKPTCADPDLLSGVIRGQWQLNGYIVSDCDSVDVLFRKQHYAKTPEEAVAK 325

Query: 332 VLRAGMDINCGTYLKEHAKSAVEMKKVPIPHIDRALHNLFAVRMRLGLFDGNPTKLPFGQ 391
            L AG+D+NC  +  +HA  AV+   V    ID+A+ N FA  MRLG FDG+P K  +G 
Sbjct: 326 SLLAGLDLNCDHFNGQHAMGAVKAGLVNETAIDKAISNNFATLMRLGFFDGDPKKQLYGG 385

Query: 392 IGPDQVCSKQHQDLALQAAREGIVLLKNSAKLLPLSKSSIRSLAVIGHNGDEPKTLRGNY 451
           +GP  VC+  +Q+LA   AR+GIVLLKNSA  LPLS S+I++LAVIG N +  +T+ GNY
Sbjct: 386 LGPKDVCTADNQELARDGARQGIVLLKNSAGSLPLSPSAIKTLAVIGPNANATETMIGNY 445

Query: 452 AGIPCKSVTPFQGLNSYVKNTVYHRGCNWANCTEATIDQAVQIAKSVDYVVLVMGLDQTQ 511
            G+PCK  TP QGL   V +T Y  GCN A C +A I  AV +A S D VVLV+G DQ+ 
Sbjct: 446 HGVPCKYTTPLQGLAETVSST-YQLGCNVA-CVDADIGSAVDLAASADAVVLVVGADQSI 505

Query: 512 EREDFDRTELGLPGNQEALIAEVAKAAKRPVILVILSGGPVDISSAKYNEKIGSILWAGY 571
           ERE  DR +L LPG Q+ L+  VA AA+ PV+LVI+SGG  DI+ AK ++KI SI+W GY
Sbjct: 506 EREGHDRVDLYLPGKQQELVTRVAMAARGPVVLVIMSGGGFDITFAKNDKKITSIMWVGY 565

Query: 572 PGQAGGTAIAEIIFGDHNPGGRLPLTWYPHDFI-KFPMTDMRMRADPSTGYPGRTYRFYN 631
           PG+AGG AIA++IFG HNP G LP+TWYP  ++ K PM++M MR D S GYPGR+YRFY 
Sbjct: 566 PGEAGGLAIADVIFGRHNPSGNLPMTWYPQSYVEKVPMSNMNMRPDKSKGYPGRSYRFYT 625

Query: 632 GPKVYEFGYGLSYSNY---LYEFTSVTESKLHLSNPTASQPAKSSDSIRYRLVSELDKKF 691
           G  VY F   L+Y+ +   L +   +    L  ++P  S   +S D+I     + ++   
Sbjct: 626 GETVYAFADALTYTKFDHQLIKAPRLVSLSLDENHPCRSSECQSLDAIGPHCENAVE--- 685

Query: 692 CESRAVNVTVGVRNDGEMAGKHSVLLFVKPSKPVNGSPVKQLVGFKRVEINAGGRSEIEF 751
                  V + V+N G+ AG H+V LF   S  V+GSP+KQL+GF+++ +     + + F
Sbjct: 686 -GGSDFEVHLNVKNTGDRAGSHTVFLFT-TSPQVHGSPIKQLLGFEKIRLGKSEEAVVRF 745

Query: 752 LLSPCEHVSKANEEGLMIIEEGSYSLVVGDVEHPLDIFV 786
            ++ C+ +S  +E G   I  G + L VG ++H L+I V
Sbjct: 746 NVNVCKDLSVVDETGKRKIALGHHLLHVGSLKHSLNISV 773

BLAST of Cla022081 vs. Swiss-Prot
Match: XYL1_MEDSV (Beta-xylosidase/alpha-L-arabinofuranosidase 1 (Fragment) OS=Medicago sativa subsp. varia GN=Xyl1 PE=1 SV=1)

HSP 1 Score: 733.0 bits (1891), Expect = 3.4e-210
Identity = 370/785 (47.13%), Postives = 508/785 (64.71%), Query Frame = 1

Query: 9   RKMKLQKLLLSAAVF--SALLSLIVADSSSQLPYACD-SSNSLTKTLPFCRTSLPIKQRA 68
           R+ K+  + L  ++F  + LL+       +   +ACD + N+   +  FC  SL ++ R 
Sbjct: 6   REPKVSSVFLCFSIFYVTVLLNCNHVYGQTSTVFACDVAKNTNVSSYGFCDNSLSVEDRV 65

Query: 69  RDLVSRLTLDEKVLQLVNAAPAIPRLGIPAYEWWSEALHGVAHVGYGIRLNGTISAATSF 128
            DLV RLTL EK+  L N+A  + RLGIP YEWWSEALHGV+++G G   +  +  AT+F
Sbjct: 66  SDLVKRLTLQEKIGNLGNSAVEVSRLGIPKYEWWSEALHGVSNIGPGTHFSSLVPGATNF 125

Query: 129 PQVILTAASFDENLWYQIGQAIGTEARAVYNAGQAKGMTFWAPNINIFRDPRWGRGQETP 188
           P  ILTAASF+ +L+  IG  +  EARA+YN G A G+T+W+PNINIFRDPRWGRGQETP
Sbjct: 126 PMPILTAASFNTSLFQAIGSVVSNEARAMYNVGLA-GLTYWSPNINIFRDPRWGRGQETP 185

Query: 189 GEDPLMTAKYSVAYVRGIQ----GDAIEGGKLGNQLKASACCKHFTAYDLDRWNGMTRYV 248
           GEDPL+++KY+  YV+G+Q    GD+       ++LK +ACCKH+TAYD+D W G+ RY 
Sbjct: 186 GEDPLLSSKYAAGYVKGLQQTDDGDS-------DKLKVAACCKHYTAYDVDNWKGVQRYT 245

Query: 249 FDAKVTMQDMADTYQPPFESCVEKGKASGIMCAYNRVNGVPSCADHHLLTATARKQWKFN 308
           FDA V+ QD+ DT+QPPF+SCV  G  + +MC+YN+VNG P+CAD  LL    R +WK N
Sbjct: 246 FDAVVSQQDLDDTFQPPFKSCVIDGNVASVMCSYNKVNGKPTCADPDLLKGVIRGKWKLN 305

Query: 309 GYITSDCDAVSIIHDAQGYAKIPEDAVADVLRAGMDINCGTYLKEHAKSAVEMKKVPIPH 368
           GYI SDCD+V +++  Q Y K PE+A A  + +G+D++CG+YL ++   AV+   V    
Sbjct: 306 GYIVSDCDSVEVLYKDQHYTKTPEEAAAKTILSGLDLDCGSYLGQYTGGAVKQGLVDEAS 365

Query: 369 IDRALHNLFAVRMRLGLFDGNPTKLPFGQIGPDQVCSKQHQDLALQAAREGIVLLKNSAK 428
           I  A+ N FA  MRLG FDG+P+K P+G +GP  VC+ ++Q+LA +AAR+GIVLLKNS +
Sbjct: 366 ITNAVSNNFATLMRLGFFDGDPSKQPYGNLGPKDVCTPENQELAREAARQGIVLLKNSPR 425

Query: 429 LLPLSKSSIRSLAVIGHNGDEPKTLRGNYAGIPCKSVTPFQGLNSYVKNTVYHRGCNWAN 488
            LPLS  +I+SLAVIG N +  + + GNY GIPCK  +P QGL ++V  T Y  GC    
Sbjct: 426 SLPLSSKAIKSLAVIGPNANATRVMIGNYEGIPCKYTSPLQGLTAFVP-TSYAPGCPDVQ 485

Query: 489 CTEATIDQAVQIAKSVDYVVLVMGLDQTQEREDFDRTELGLPGNQEALIAEVAKAAKRPV 548
           C  A ID A +IA S D  ++V+G +   E E  DR  + LPG Q+ L+ EVA  +K PV
Sbjct: 486 CANAQIDDAAKIAASADATIIVVGANLAIEAESLDRVNILLPGQQQQLVNEVANVSKGPV 545

Query: 549 ILVILSGGPVDISSAKYNEKIGSILWAGYPGQAGGTAIAEIIFGDHNPGGRLPLTWYPHD 608
           ILVI+SGG +D+S AK N+KI SILW GYPG+AGG AIA++IFG +NP GRLP+TWYP  
Sbjct: 546 ILVIMSGGGMDVSFAKTNDKITSILWVGYPGEAGGAAIADVIFGSYNPSGRLPMTWYPQS 605

Query: 609 FI-KFPMTDMRMRADPSTGYPGRTYRFYNGPKVYEFGYGLSYSNYLYEFTSVTESKLHLS 668
           ++ K PMT+M MRADP+TGYPGRTYRFY G  V+ FG G+S+           E K+  +
Sbjct: 606 YVEKVPMTNMNMRADPATGYPGRTYRFYKGETVFSFGDGMSF--------GTVEHKIVKA 665

Query: 669 NPTASQPAKSSDSIRYRLVSELD--KKFCESRAVNVTVGVRNDGEMAGKHSVLLFVKPSK 728
               S P       R      LD   K C++ A ++ + V+N G+M+  HSVLLF  P  
Sbjct: 666 PQLVSVPLAEDHECRSLECKSLDVADKHCQNLAFDIHLSVKNMGKMSSSHSVLLFFTPPN 725

Query: 729 PVNGSPVKQLVGFKRVEINAGGRSEIEFLLSPCEHVSKANEEGLMIIEEGSYSLVVGDVE 784
            V+ +P K L+GF++V++       + F +  C  +S  +E G   +  G + L VG+++
Sbjct: 726 -VHNAPQKHLLGFEKVQLAGKSEGMVRFKVDVCNDLSVVDELGNRKVPLGDHMLHVGNLK 772

BLAST of Cla022081 vs. TrEMBL
Match: A0A0A0LMA9_CUCSA (Periplasmic beta-glucosidase OS=Cucumis sativus GN=Csa_2G308360 PE=4 SV=1)

HSP 1 Score: 1449.1 bits (3750), Expect = 0.0e+00
Identity = 711/780 (91.15%), Postives = 743/780 (95.26%), Query Frame = 1

Query: 6   FFPRKMKLQKLLLSAAVFSALLSLIVADSSSQLPYACDSSNSLTKTLPFCRTSLPIKQRA 65
           FFP K+KL  LLLSAA     LSLIVA SSSQ PYACDSSN LTKTLPFC+T LPIK RA
Sbjct: 8   FFPHKIKLLTLLLSAA----FLSLIVAGSSSQPPYACDSSNPLTKTLPFCKTYLPIKLRA 67

Query: 66  RDLVSRLTLDEKVLQLVNAAPAIPRLGIPAYEWWSEALHGVAHVGYGIRLNGTISAATSF 125
           RDLVSRLTLDEKVLQLVN  P IPRLGIPAYEWWSEALHGVA+VGYGIRLNGTI+AATSF
Sbjct: 68  RDLVSRLTLDEKVLQLVNTVPPIPRLGIPAYEWWSEALHGVANVGYGIRLNGTITAATSF 127

Query: 126 PQVILTAASFDENLWYQIGQAIGTEARAVYNAGQAKGMTFWAPNINIFRDPRWGRGQETP 185
           PQVILTAASFDENLWYQIGQAIGTEARAVYNAGQAKGMTFW PNINIFRDPRWGRGQETP
Sbjct: 128 PQVILTAASFDENLWYQIGQAIGTEARAVYNAGQAKGMTFWTPNINIFRDPRWGRGQETP 187

Query: 186 GEDPLMTAKYSVAYVRGIQGDAIEGGKLGNQLKASACCKHFTAYDLDRWNGMTRYVFDAK 245
           GEDPLMT KYSVAYVRGIQGDAIEGGKLGNQLKASACCKHFTAYDLDRWNGMTRYVFDAK
Sbjct: 188 GEDPLMTGKYSVAYVRGIQGDAIEGGKLGNQLKASACCKHFTAYDLDRWNGMTRYVFDAK 247

Query: 246 VTMQDMADTYQPPFESCVEKGKASGIMCAYNRVNGVPSCADHHLLTATARKQWKFNGYIT 305
           VTMQDMADTYQPPFESCVE+GKASGIMCAYNRVNGVPSCADHHLLTATARKQWKFNGYIT
Sbjct: 248 VTMQDMADTYQPPFESCVEEGKASGIMCAYNRVNGVPSCADHHLLTATARKQWKFNGYIT 307

Query: 306 SDCDAVSIIHDAQGYAKIPEDAVADVLRAGMDINCGTYLKEHAKSAVEMKKVPIPHIDRA 365
           SDCDAVSIIHDAQGYAKIPEDAVADVLRAGMD+NCGTYLKEH KSAVEMKKVP+ HIDRA
Sbjct: 308 SDCDAVSIIHDAQGYAKIPEDAVADVLRAGMDVNCGTYLKEHTKSAVEMKKVPMLHIDRA 367

Query: 366 LHNLFAVRMRLGLFDGNPTKLPFGQIGPDQVCSKQHQDLALQAAREGIVLLKNSAKLLPL 425
           L NLF+VRMRLGLFDGNPTKLPFGQIG DQVCS+QHQ+LALQAAREGIVLLKNSAKLLPL
Sbjct: 368 LRNLFSVRMRLGLFDGNPTKLPFGQIGRDQVCSQQHQNLALQAAREGIVLLKNSAKLLPL 427

Query: 426 SKSSIRSLAVIGHNGDEPKTLRGNYAGIPCKSVTPFQGLNSYVKNTVYHRGCNWANCTEA 485
           SKS+  SLAVIGHNG++PKTLRGNYAGIPCKS TPFQGLN+YVKNTVYHRGCN+ANCTEA
Sbjct: 428 SKSNTHSLAVIGHNGNDPKTLRGNYAGIPCKSATPFQGLNNYVKNTVYHRGCNYANCTEA 487

Query: 486 TIDQAVQIAKSVDYVVLVMGLDQTQEREDFDRTELGLPGNQEALIAEVAKAAKRPVILVI 545
           TI QAV+IAKSVDYVVLVMGLDQTQEREDFDRTELGLPG Q+ LIAEVAKAAKRPVILVI
Sbjct: 488 TIYQAVKIAKSVDYVVLVMGLDQTQEREDFDRTELGLPGKQDKLIAEVAKAAKRPVILVI 547

Query: 546 LSGGPVDISSAKYNEKIGSILWAGYPGQAGGTAIAEIIFGDHNPGGRLPLTWYPHDFIKF 605
           LSGGPVDISSAKYNEKIGSILWAGYPGQAGGTAIAEIIFGDHNPGGRLPLTWYPHDFIKF
Sbjct: 548 LSGGPVDISSAKYNEKIGSILWAGYPGQAGGTAIAEIIFGDHNPGGRLPLTWYPHDFIKF 607

Query: 606 PMTDMRMRADPSTGYPGRTYRFYNGPKVYEFGYGLSYSNYLYEFTSVTESKLHLSNPTAS 665
           PMTDMRMRAD STGYPGRTYRFYNGPKVYEFGYGLSYSN++YEFTSV+ESKL LS+P AS
Sbjct: 608 PMTDMRMRADSSTGYPGRTYRFYNGPKVYEFGYGLSYSNHIYEFTSVSESKLLLSHPKAS 667

Query: 666 QPAKSSDSIRYRLVSELDKKFCESRAVNVTVGVRNDGEMAGKHSVLLFVKPSKPVNGSPV 725
           QPAK+SD + YRLVSELDKKFCES+ VNVTVGVRN+GEM GKHSVLLF+KPSKP+NGSPV
Sbjct: 668 QPAKNSDLVSYRLVSELDKKFCESKTVNVTVGVRNEGEMGGKHSVLLFIKPSKPINGSPV 727

Query: 726 KQLVGFKRVEINAGGRSEIEFLLSPCEHVSKANEEGLMIIEEGSYSLVVGDVEHPLDIFV 785
           KQLVGFK+VEINAG R EIEFL+SPC+H+SKA+EEGLMIIEEGSYSLVVGDVEHPLDIFV
Sbjct: 728 KQLVGFKKVEINAGERREIEFLVSPCDHISKASEEGLMIIEEGSYSLVVGDVEHPLDIFV 783

BLAST of Cla022081 vs. TrEMBL
Match: A0A061FFD4_THECC (Glycosyl hydrolase family protein isoform 3 OS=Theobroma cacao GN=TCM_034945 PE=4 SV=1)

HSP 1 Score: 1124.0 bits (2906), Expect = 0.0e+00
Identity = 538/780 (68.97%), Postives = 651/780 (83.46%), Query Frame = 1

Query: 10   KMKLQKL-LLSAAVFSALLSLIVADSSSQLPYACDSSNSLTKTLPFCRTSLPIKQRARDL 69
            KMKLQKL LL+    S+LL L++ADS+ Q P++CD+S+  TK+ PFC+T+LPI QR +DL
Sbjct: 815  KMKLQKLSLLTLIHISSLLLLVLADST-QPPFSCDTSDPRTKSYPFCKTTLPINQRVQDL 874

Query: 70   VSRLTLDEKVLQLVNAAPAIPRLGIPAYEWWSEALHGVAH---VGYGIRLNGTISAATSF 129
            +SRLTLDEK+ QLVN+AP IPRLGIP  EWWSEALHGVA    V  GIR NGTI +ATSF
Sbjct: 875  ISRLTLDEKISQLVNSAPPIPRLGIPGDEWWSEALHGVAFLASVSQGIRFNGTIQSATSF 934

Query: 130  PQVILTAASFDENLWYQIGQAIGTEARAVYNAGQAKGMTFWAPNINIFRDPRWGRGQETP 189
            PQVILTAASFD +LW++IGQAIG EAR +YNAGQA+GMTFWAPNINI+RDPRWGRGQETP
Sbjct: 935  PQVILTAASFDAHLWFRIGQAIGIEARGIYNAGQARGMTFWAPNINIYRDPRWGRGQETP 994

Query: 190  GEDPLMTAKYSVAYVRGIQGDAIEGGKLGNQLKASACCKHFTAYDLDRWNGMTRYVFDAK 249
            GEDPL+T KY+V++VRGIQGD+ EGG LG  L+ SACCKHFTAYDLD W G+ R+VF+AK
Sbjct: 995  GEDPLVTGKYAVSFVRGIQGDSFEGGMLGEHLQVSACCKHFTAYDLDNWKGVNRFVFNAK 1054

Query: 250  VTMQDMADTYQPPFESCVEKGKASGIMCAYNRVNGVPSCADHHLLTATARKQWKFNGYIT 309
            V++QD+ADTYQPPF+SC+++GKASGIMCAYNRVNGVP+CAD++LL+ TAR QW FNGYIT
Sbjct: 1055 VSLQDLADTYQPPFQSCIQQGKASGIMCAYNRVNGVPNCADYNLLSKTARGQWGFNGYIT 1114

Query: 310  SDCDAVSIIHDAQGYAKIPEDAVADVLRAGMDINCGTYLKEHAKSAVEMKKVPIPHIDRA 369
            SDCDAVSI+H+ QGYAK+PEDAVADVL+AGMD+NCG YLK + KSAV+ +K+P+  IDRA
Sbjct: 1115 SDCDAVSIMHEKQGYAKVPEDAVADVLKAGMDVNCGNYLKNYTKSAVKKRKLPMSEIDRA 1174

Query: 370  LHNLFAVRMRLGLFDGNPTKLPFGQIGPDQVCSKQHQDLALQAAREGIVLLKNSAKLLPL 429
            LHNLF+VRMRLGLF+GNPTK PFG IG DQVCS++HQ+LAL+AAR GIVLLKN+  LLPL
Sbjct: 1175 LHNLFSVRMRLGLFNGNPTKQPFGNIGSDQVCSQEHQNLALEAARNGIVLLKNTDSLLPL 1234

Query: 430  SKSSIRSLAVIGHNGDEPKTLRGNYAGIPCKSVTPFQGLNSYVKNTVYHRGCNWANCTEA 489
            SK+   SLAVIG N +  KTL GNYAG PCKS+TP Q L SY K+T YH GC+  NC+ A
Sbjct: 1235 SKTKTTSLAVIGPNANSAKTLVGNYAGPPCKSITPLQALQSYAKDTRYHPGCSAVNCSSA 1294

Query: 490  TIDQAVQIAKSVDYVVLVMGLDQTQEREDFDRTELGLPGNQEALIAEVAKAAKRPVILVI 549
              DQAV+IAK  D+VVLVMGLDQTQERED DR +L LP  Q+ LI+ +A+AAK PVILV+
Sbjct: 1295 LTDQAVKIAKGADHVVLVMGLDQTQEREDHDRVDLVLPAKQQNLISSIARAAKNPVILVL 1354

Query: 550  LSGGPVDISSAKYNEKIGSILWAGYPGQAGGTAIAEIIFGDHNPGGRLPLTWYPHDFIKF 609
            LSGGPVDI+ AKY++ IGSILWAGYPG+AGG A+AEIIFGDHNPGGRLP+TWYP  FIK 
Sbjct: 1355 LSGGPVDITFAKYDQHIGSILWAGYPGEAGGLALAEIIFGDHNPGGRLPVTWYPQSFIKV 1414

Query: 610  PMTDMRMRADPSTGYPGRTYRFYNGPKVYEFGYGLSYSNYLYEFTSVTESKLHLSNPTAS 669
            PMTDMRMR +PS+GYPGRTYRFY GPKV+EFGYGLSYS Y YEF  VT++K++L++ + +
Sbjct: 1415 PMTDMRMRPEPSSGYPGRTYRFYQGPKVFEFGYGLSYSKYSYEFLPVTQNKVYLNHQSCN 1474

Query: 670  QPAKSSDSIRYRLVSELDKKFCESRAVNVTVGVRNDGEMAGKHSVLLFVKPSKPVNGSPV 729
            +  ++S+ +RY  VSE+ K+ C+ R   V VGV+N GEMAG H VLLFV+ +K  NG P+
Sbjct: 1475 KMVENSNPVRYMPVSEIAKELCDKRKFPVKVGVQNHGEMAGTHPVLLFVRQAKVGNGRPM 1534

Query: 730  KQLVGFKRVEINAGGRSEIEFLLSPCEHVSKANEEGLMIIEEGSYSLVVGDVEHPLDIFV 786
            KQLVGF  V +NAG R EIEF LSPCEH+S+ANE+GLM+IEEG + L +GD E  + +F+
Sbjct: 1535 KQLVGFHSVNLNAGERVEIEFELSPCEHLSRANEDGLMVIEEGPHFLSIGDKESEITVFI 1593

BLAST of Cla022081 vs. TrEMBL
Match: A0A061FFD4_THECC (Glycosyl hydrolase family protein isoform 3 OS=Theobroma cacao GN=TCM_034945 PE=4 SV=1)

HSP 1 Score: 1068.9 bits (2763), Expect = 2.9e-309
Identity = 511/752 (67.95%), Postives = 616/752 (81.91%), Query Frame = 1

Query: 16  LLLSAAVFSALLS---LIVADSSSQLPYACDSSNSLTKTLPFCRTSLPIKQRARDLVSRL 75
           ++L    F +L+S   L +   S+Q P++CD S+  TK  PFC+T+LPI QRARDLVSRL
Sbjct: 1   MMLQGLSFVSLISFTLLFIHAGSTQPPFSCDPSDPSTKNYPFCQTTLPISQRARDLVSRL 60

Query: 76  TLDEKVLQLVNAAPAIPRLGIPAYEWWSEALHGVAHVGYGIRLNGTISAATSFPQVILTA 135
           TLDEK+ QLVN+APAIPRLGIPAYEWWSEALHGVA+VG GI+ +G+I AATSFPQVILTA
Sbjct: 61  TLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVANVGPGIKFDGSIKAATSFPQVILTA 120

Query: 136 ASFDENLWYQIGQAIGTEARAVYNAGQAKGMTFWAPNINIFRDPRWGRGQETPGEDPLMT 195
           ASFD   WY+IGQ IG EARA+YNAGQA+GMTFWAPNINIFRDPRWGRGQETPGEDPL+T
Sbjct: 121 ASFDAYQWYRIGQVIGREARAIYNAGQARGMTFWAPNINIFRDPRWGRGQETPGEDPLVT 180

Query: 196 AKYSVAYVRGIQGDAIEGGKLGNQLKASACCKHFTAYDLDRWNGMTRYVFDAKVTMQDMA 255
            KY+V+YVRG+QGD  +GGKL   L+ASACCKHFTAYDLD W G+ R+VFDA+VT+QD+A
Sbjct: 181 GKYAVSYVRGVQGDIFQGGKLNGHLQASACCKHFTAYDLDNWKGVNRFVFDARVTVQDLA 240

Query: 256 DTYQPPFESCVEKGKASGIMCAYNRVNGVPSCADHHLLTATARKQWKFNGYITSDCDAVS 315
           DTYQPPF+SCV+ G+ASGIMCAYNRVNGVPSCAD +LL+ T R +W F GYITSDCDAV+
Sbjct: 241 DTYQPPFKSCVQDGRASGIMCAYNRVNGVPSCADSNLLSKTLRGEWDFKGYITSDCDAVA 300

Query: 316 IIHDAQGYAKIPEDAVADVLRAGMDINCGTYLKEHAKSAVEMKKVPIPHIDRALHNLFAV 375
           IIH+ QGYAK PEDAV DVL+AGMD+NCG+YL++++KSAV  KK+P   IDRALHNLFAV
Sbjct: 301 IIHNDQGYAKSPEDAVVDVLKAGMDLNCGSYLQKYSKSAVLQKKLPESEIDRALHNLFAV 360

Query: 376 RMRLGLFDGNPTKLPFGQIGPDQVCSKQHQDLALQAAREGIVLLKNSAKLLPLSKSSIRS 435
           RMRLGLF+GNP + PFG IG DQVCS +HQ LAL+AAR GIVLLKN  KLLPL K+++ S
Sbjct: 361 RMRLGLFNGNPAQHPFGNIGTDQVCSPEHQILALEAARNGIVLLKNEEKLLPLPKATV-S 420

Query: 436 LAVIGHNGDEPKTLRGNYAGIPCKSVTPFQGLNSYVKNTVYHRGCNWANCTEATIDQAVQ 495
           LAVIG N + P+TL GNYAG PCKSVTP Q L SYVKNTVYH GC+  +C+   ID+AV 
Sbjct: 421 LAVIGPNANSPQTLLGNYAGPPCKSVTPLQALQSYVKNTVYHPGCDTVSCSTGVIDKAVD 480

Query: 496 IAKSVDYVVLVMGLDQTQEREDFDRTELGLPGNQEALIAEVAKAAKRPVILVILSGGPVD 555
           IAK  DYVVL+MGLDQTQE+E+ DR +L LPG Q+ LI  VAKAAKRPV+LV+LSGGP+D
Sbjct: 481 IAKQADYVVLIMGLDQTQEKEELDRVDLLLPGRQQELITSVAKAAKRPVVLVLLSGGPID 540

Query: 556 ISSAKYNEKIGSILWAGYPGQAGGTAIAEIIFGDHNPGGRLPLTWYPHDFIKFPMTDMRM 615
           +S AK + +IG I WAGYPG+ GG A+AEI+FGDHNPGGRLP+TWYP +F K PMTDMRM
Sbjct: 541 VSFAKDDPRIGGIFWAGYPGEGGGIALAEIVFGDHNPGGRLPVTWYPQEFTKVPMTDMRM 600

Query: 616 RADPSTGYPGRTYRFYNGPKVYEFGYGLSYSNYLYEFTSVTESKLHLSNPTASQPAKSSD 675
           R + S+ YPGRTYRFY G KV+EFGYGLSYS Y YEFT V+++ ++L++ ++     +SD
Sbjct: 601 RPESSSEYPGRTYRFYKGDKVFEFGYGLSYSKYSYEFTRVSQNNVYLNHSSSFHTTVTSD 660

Query: 676 SIRYRLVSELDKKFCESRAVNVTVGVRNDGEMAGKHSVLLFVKPSKPVNGSPVKQLVGFK 735
           S+RY+LVSEL  + C+ R   V VGV+N GEMAGKH VLLF +     +G P KQLVGF+
Sbjct: 661 SVRYKLVSELGAEVCDQRKFTVCVGVKNHGEMAGKHPVLLFARHGNHGDGRPKKQLVGFQ 720

Query: 736 RVEINAGGRSEIEFLLSPCEHVSKANEEGLMI 765
            V ++AG  +EI+F +SPCEH+S+ANE GLM+
Sbjct: 721 SVILSAGEMAEIQFEVSPCEHLSRANEYGLML 751


HSP 2 Score: 1123.6 bits (2905), Expect = 0.0e+00
Identity = 537/780 (68.85%), Postives = 651/780 (83.46%), Query Frame = 1

Query: 10   KMKLQKL-LLSAAVFSALLSLIVADSSSQLPYACDSSNSLTKTLPFCRTSLPIKQRARDL 69
            KMKLQKL LL+    S+LL L++ADS+ Q P++CD+S+  TK+ PFC+T+LPI QR +DL
Sbjct: 815  KMKLQKLSLLTLIHISSLLLLVLADST-QPPFSCDTSDPRTKSYPFCKTTLPINQRVQDL 874

Query: 70   VSRLTLDEKVLQLVNAAPAIPRLGIPAYEWWSEALHGVAH---VGYGIRLNGTISAATSF 129
            +SRLTLDEK+ QLVN+AP IPRLGIP  EWWSEALHGVA    V  GIR NGTI +ATSF
Sbjct: 875  ISRLTLDEKISQLVNSAPPIPRLGIPGDEWWSEALHGVAFLASVSQGIRFNGTIQSATSF 934

Query: 130  PQVILTAASFDENLWYQIGQAIGTEARAVYNAGQAKGMTFWAPNINIFRDPRWGRGQETP 189
            PQVILTAASFD +LW++IGQA+G EAR +YNAGQA+GMTFWAPNINI+RDPRWGRGQETP
Sbjct: 935  PQVILTAASFDAHLWFRIGQAVGIEARGIYNAGQARGMTFWAPNINIYRDPRWGRGQETP 994

Query: 190  GEDPLMTAKYSVAYVRGIQGDAIEGGKLGNQLKASACCKHFTAYDLDRWNGMTRYVFDAK 249
            GEDPL+T KY+V++VRGIQGD+ EGG LG  L+ SACCKHFTAYDLD W G+ R+VF+AK
Sbjct: 995  GEDPLVTGKYAVSFVRGIQGDSFEGGMLGEHLQVSACCKHFTAYDLDNWKGVNRFVFNAK 1054

Query: 250  VTMQDMADTYQPPFESCVEKGKASGIMCAYNRVNGVPSCADHHLLTATARKQWKFNGYIT 309
            V++QD+ADTYQPPF+SC+++GKASGIMCAYNRVNGVP+CAD++LL+ TAR QW FNGYIT
Sbjct: 1055 VSLQDLADTYQPPFQSCIQQGKASGIMCAYNRVNGVPNCADYNLLSKTARGQWGFNGYIT 1114

Query: 310  SDCDAVSIIHDAQGYAKIPEDAVADVLRAGMDINCGTYLKEHAKSAVEMKKVPIPHIDRA 369
            SDCDAVSI+H+ QGYAK+PEDAVADVL+AGMD+NCG YLK + KSAV+ +K+P+  IDRA
Sbjct: 1115 SDCDAVSIMHEKQGYAKVPEDAVADVLKAGMDVNCGNYLKNYTKSAVKKRKLPMSEIDRA 1174

Query: 370  LHNLFAVRMRLGLFDGNPTKLPFGQIGPDQVCSKQHQDLALQAAREGIVLLKNSAKLLPL 429
            LHNLF+VRMRLGLF+GNPTK PFG IG DQVCS++HQ+LAL+AAR GIVLLKN+  LLPL
Sbjct: 1175 LHNLFSVRMRLGLFNGNPTKQPFGNIGSDQVCSQEHQNLALEAARNGIVLLKNTDSLLPL 1234

Query: 430  SKSSIRSLAVIGHNGDEPKTLRGNYAGIPCKSVTPFQGLNSYVKNTVYHRGCNWANCTEA 489
            SK+   SLAVIG N +  KTL GNYAG PCKS+TP Q L SY K+T YH GC+  NC+ A
Sbjct: 1235 SKTKTTSLAVIGPNANSAKTLVGNYAGPPCKSITPLQALQSYAKDTRYHPGCSAVNCSSA 1294

Query: 490  TIDQAVQIAKSVDYVVLVMGLDQTQEREDFDRTELGLPGNQEALIAEVAKAAKRPVILVI 549
              DQAV+IAK  D+VVLVMGLDQTQERED DR +L LP  Q+ LI+ +A+AAK PVILV+
Sbjct: 1295 LTDQAVKIAKGADHVVLVMGLDQTQEREDHDRVDLVLPAKQQNLISSIARAAKNPVILVL 1354

Query: 550  LSGGPVDISSAKYNEKIGSILWAGYPGQAGGTAIAEIIFGDHNPGGRLPLTWYPHDFIKF 609
            LSGGPVDI+ AKY++ IGSILWAGYPG+AGG A+AEIIFGDHNPGGRLP+TWYP  FIK 
Sbjct: 1355 LSGGPVDITFAKYDQHIGSILWAGYPGEAGGLALAEIIFGDHNPGGRLPVTWYPQSFIKV 1414

Query: 610  PMTDMRMRADPSTGYPGRTYRFYNGPKVYEFGYGLSYSNYLYEFTSVTESKLHLSNPTAS 669
            PMTDMRMR +PS+GYPGRTYRFY GPKV+EFGYGLSYS Y YEF  VT++K++L++ + +
Sbjct: 1415 PMTDMRMRPEPSSGYPGRTYRFYQGPKVFEFGYGLSYSKYSYEFLPVTQNKVYLNHQSCN 1474

Query: 670  QPAKSSDSIRYRLVSELDKKFCESRAVNVTVGVRNDGEMAGKHSVLLFVKPSKPVNGSPV 729
            +  ++S+ +RY  VSE+ K+ C+ R   V VGV+N GEMAG H VLLFV+ +K  NG P+
Sbjct: 1475 KMVENSNPVRYMPVSEIAKELCDKRKFPVKVGVQNHGEMAGTHPVLLFVRQAKVGNGRPM 1534

Query: 730  KQLVGFKRVEINAGGRSEIEFLLSPCEHVSKANEEGLMIIEEGSYSLVVGDVEHPLDIFV 786
            KQLVGF  V +NAG R EIEF LSPCEH+S+ANE+GLM+IEEG + L +GD E  + +F+
Sbjct: 1535 KQLVGFHSVNLNAGERVEIEFELSPCEHLSRANEDGLMVIEEGPHFLSIGDKESEITVFI 1593

BLAST of Cla022081 vs. TrEMBL
Match: A0A061FGK5_THECC (Glycosyl hydrolase family protein isoform 1 OS=Theobroma cacao GN=TCM_034945 PE=4 SV=1)

HSP 1 Score: 1068.9 bits (2763), Expect = 2.9e-309
Identity = 511/752 (67.95%), Postives = 616/752 (81.91%), Query Frame = 1

Query: 16  LLLSAAVFSALLS---LIVADSSSQLPYACDSSNSLTKTLPFCRTSLPIKQRARDLVSRL 75
           ++L    F +L+S   L +   S+Q P++CD S+  TK  PFC+T+LPI QRARDLVSRL
Sbjct: 1   MMLQGLSFVSLISFTLLFIHAGSTQPPFSCDPSDPSTKNYPFCQTTLPISQRARDLVSRL 60

Query: 76  TLDEKVLQLVNAAPAIPRLGIPAYEWWSEALHGVAHVGYGIRLNGTISAATSFPQVILTA 135
           TLDEK+ QLVN+APAIPRLGIPAYEWWSEALHGVA+VG GI+ +G+I AATSFPQVILTA
Sbjct: 61  TLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVANVGPGIKFDGSIKAATSFPQVILTA 120

Query: 136 ASFDENLWYQIGQAIGTEARAVYNAGQAKGMTFWAPNINIFRDPRWGRGQETPGEDPLMT 195
           ASFD   WY+IGQ IG EARA+YNAGQA+GMTFWAPNINIFRDPRWGRGQETPGEDPL+T
Sbjct: 121 ASFDAYQWYRIGQVIGREARAIYNAGQARGMTFWAPNINIFRDPRWGRGQETPGEDPLVT 180

Query: 196 AKYSVAYVRGIQGDAIEGGKLGNQLKASACCKHFTAYDLDRWNGMTRYVFDAKVTMQDMA 255
            KY+V+YVRG+QGD  +GGKL   L+ASACCKHFTAYDLD W G+ R+VFDA+VT+QD+A
Sbjct: 181 GKYAVSYVRGVQGDIFQGGKLNGHLQASACCKHFTAYDLDNWKGVNRFVFDARVTVQDLA 240

Query: 256 DTYQPPFESCVEKGKASGIMCAYNRVNGVPSCADHHLLTATARKQWKFNGYITSDCDAVS 315
           DTYQPPF+SCV+ G+ASGIMCAYNRVNGVPSCAD +LL+ T R +W F GYITSDCDAV+
Sbjct: 241 DTYQPPFKSCVQDGRASGIMCAYNRVNGVPSCADSNLLSKTLRGEWDFKGYITSDCDAVA 300

Query: 316 IIHDAQGYAKIPEDAVADVLRAGMDINCGTYLKEHAKSAVEMKKVPIPHIDRALHNLFAV 375
           IIH+ QGYAK PEDAV DVL+AGMD+NCG+YL++++KSAV  KK+P   IDRALHNLFAV
Sbjct: 301 IIHNDQGYAKSPEDAVVDVLKAGMDLNCGSYLQKYSKSAVLQKKLPESEIDRALHNLFAV 360

Query: 376 RMRLGLFDGNPTKLPFGQIGPDQVCSKQHQDLALQAAREGIVLLKNSAKLLPLSKSSIRS 435
           RMRLGLF+GNP + PFG IG DQVCS +HQ LAL+AAR GIVLLKN  KLLPL K+++ S
Sbjct: 361 RMRLGLFNGNPAQHPFGNIGTDQVCSPEHQILALEAARNGIVLLKNEEKLLPLPKATV-S 420

Query: 436 LAVIGHNGDEPKTLRGNYAGIPCKSVTPFQGLNSYVKNTVYHRGCNWANCTEATIDQAVQ 495
           LAVIG N + P+TL GNYAG PCKSVTP Q L SYVKNTVYH GC+  +C+   ID+AV 
Sbjct: 421 LAVIGPNANSPQTLLGNYAGPPCKSVTPLQALQSYVKNTVYHPGCDTVSCSTGVIDKAVD 480

Query: 496 IAKSVDYVVLVMGLDQTQEREDFDRTELGLPGNQEALIAEVAKAAKRPVILVILSGGPVD 555
           IAK  DYVVL+MGLDQTQE+E+ DR +L LPG Q+ LI  VAKAAKRPV+LV+LSGGP+D
Sbjct: 481 IAKQADYVVLIMGLDQTQEKEELDRVDLLLPGRQQELITSVAKAAKRPVVLVLLSGGPID 540

Query: 556 ISSAKYNEKIGSILWAGYPGQAGGTAIAEIIFGDHNPGGRLPLTWYPHDFIKFPMTDMRM 615
           +S AK + +IG I WAGYPG+ GG A+AEI+FGDHNPGGRLP+TWYP +F K PMTDMRM
Sbjct: 541 VSFAKDDPRIGGIFWAGYPGEGGGIALAEIVFGDHNPGGRLPVTWYPQEFTKVPMTDMRM 600

Query: 616 RADPSTGYPGRTYRFYNGPKVYEFGYGLSYSNYLYEFTSVTESKLHLSNPTASQPAKSSD 675
           R + S+ YPGRTYRFY G KV+EFGYGLSYS Y YEFT V+++ ++L++ ++     +SD
Sbjct: 601 RPESSSEYPGRTYRFYKGDKVFEFGYGLSYSKYSYEFTRVSQNNVYLNHSSSFHTTVTSD 660

Query: 676 SIRYRLVSELDKKFCESRAVNVTVGVRNDGEMAGKHSVLLFVKPSKPVNGSPVKQLVGFK 735
           S+RY+LVSEL  + C+ R   V VGV+N GEMAGKH VLLF +     +G P KQLVGF+
Sbjct: 661 SVRYKLVSELGAEVCDQRKFTVCVGVKNHGEMAGKHPVLLFARHGNHGDGRPKKQLVGFQ 720

Query: 736 RVEINAGGRSEIEFLLSPCEHVSKANEEGLMI 765
            V ++AG  +EI+F +SPCEH+S+ANE GLM+
Sbjct: 721 SVILSAGEMAEIQFEVSPCEHLSRANEYGLML 751


HSP 2 Score: 1114.4 bits (2881), Expect = 0.0e+00
Identity = 536/784 (68.37%), Postives = 650/784 (82.91%), Query Frame = 1

Query: 10   KMKLQKL-LLSAAVFSALLSLIVADSSSQLPYACDSSNSLTKTLPFCRTSLPIKQRARDL 69
            KMKLQKL LL+    S+LL L++ADS+ Q P++CD+S+  TK+ PFC+T+LPI QR +DL
Sbjct: 815  KMKLQKLSLLTLIHISSLLLLVLADST-QPPFSCDTSDPRTKSYPFCKTTLPINQRVQDL 874

Query: 70   VSRLTLDEKVLQLVNAAPAIPRLGIPAYEWWSEALHGVAH---VGYGIRLNGTISAATSF 129
            +SRLTLDEK+ QLVN+AP IPRLGIP  EWWSEALHGVA    V  GIR NGTI +ATSF
Sbjct: 875  ISRLTLDEKISQLVNSAPPIPRLGIPGDEWWSEALHGVAFLASVSQGIRFNGTIQSATSF 934

Query: 130  PQVILTAASFDENLWYQIG----QAIGTEARAVYNAGQAKGMTFWAPNINIFRDPRWGRG 189
            PQVILTAASFD +LW++I     QA+G EAR +YNAGQA+GMTFWAPNINI+RDPRWGRG
Sbjct: 935  PQVILTAASFDAHLWFRIVYDYIQAVGIEARGIYNAGQARGMTFWAPNINIYRDPRWGRG 994

Query: 190  QETPGEDPLMTAKYSVAYVRGIQGDAIEGGKLGNQLKASACCKHFTAYDLDRWNGMTRYV 249
            QETPGEDPL+T KY+V++VRGIQGD+ EGG LG  L+ SACCKHFTAYDLD W G+ R+V
Sbjct: 995  QETPGEDPLVTGKYAVSFVRGIQGDSFEGGMLGEHLQVSACCKHFTAYDLDNWKGVNRFV 1054

Query: 250  FDAKVTMQDMADTYQPPFESCVEKGKASGIMCAYNRVNGVPSCADHHLLTATARKQWKFN 309
            F+AKV++QD+ADTYQPPF+SC+++GKASGIMCAYNRVNGVP+CAD++LL+ TAR QW FN
Sbjct: 1055 FNAKVSLQDLADTYQPPFQSCIQQGKASGIMCAYNRVNGVPNCADYNLLSKTARGQWGFN 1114

Query: 310  GYITSDCDAVSIIHDAQGYAKIPEDAVADVLRAGMDINCGTYLKEHAKSAVEMKKVPIPH 369
            GYITSDCDAVSI+H+ QGYAK+PEDAVADVL+AGMD+NCG YLK + KSAV+ +K+P+  
Sbjct: 1115 GYITSDCDAVSIMHEKQGYAKVPEDAVADVLKAGMDVNCGNYLKNYTKSAVKKRKLPMSE 1174

Query: 370  IDRALHNLFAVRMRLGLFDGNPTKLPFGQIGPDQVCSKQHQDLALQAAREGIVLLKNSAK 429
            IDRALHNLF+VRMRLGLF+GNPTK PFG IG DQVCS++HQ+LAL+AAR GIVLLKN+  
Sbjct: 1175 IDRALHNLFSVRMRLGLFNGNPTKQPFGNIGSDQVCSQEHQNLALEAARNGIVLLKNTDS 1234

Query: 430  LLPLSKSSIRSLAVIGHNGDEPKTLRGNYAGIPCKSVTPFQGLNSYVKNTVYHRGCNWAN 489
            LLPLSK+   SLAVIG N +  KTL GNYAG PCKS+TP Q L SY K+T YH GC+  N
Sbjct: 1235 LLPLSKTKTTSLAVIGPNANSAKTLVGNYAGPPCKSITPLQALQSYAKDTRYHPGCSAVN 1294

Query: 490  CTEATIDQAVQIAKSVDYVVLVMGLDQTQEREDFDRTELGLPGNQEALIAEVAKAAKRPV 549
            C+ A  DQAV+IAK  D+VVLVMGLDQTQERED DR +L LP  Q+ LI+ +A+AAK PV
Sbjct: 1295 CSSALTDQAVKIAKGADHVVLVMGLDQTQEREDHDRVDLVLPAKQQNLISSIARAAKNPV 1354

Query: 550  ILVILSGGPVDISSAKYNEKIGSILWAGYPGQAGGTAIAEIIFGDHNPGGRLPLTWYPHD 609
            ILV+LSGGPVDI+ AKY++ IGSILWAGYPG+AGG A+AEIIFGDHNPGGRLP+TWYP  
Sbjct: 1355 ILVLLSGGPVDITFAKYDQHIGSILWAGYPGEAGGLALAEIIFGDHNPGGRLPVTWYPQS 1414

Query: 610  FIKFPMTDMRMRADPSTGYPGRTYRFYNGPKVYEFGYGLSYSNYLYEFTSVTESKLHLSN 669
            FIK PMTDMRMR +PS+GYPGRTYRFY GPKV+EFGYGLSYS Y YEF  VT++K++L++
Sbjct: 1415 FIKVPMTDMRMRPEPSSGYPGRTYRFYQGPKVFEFGYGLSYSKYSYEFLPVTQNKVYLNH 1474

Query: 670  PTASQPAKSSDSIRYRLVSELDKKFCESRAVNVTVGVRNDGEMAGKHSVLLFVKPSKPVN 729
             + ++  ++S+ +RY  VSE+ K+ C+ R   V VGV+N GEMAG H VLLFV+ +K  N
Sbjct: 1475 QSCNKMVENSNPVRYMPVSEIAKELCDKRKFPVKVGVQNHGEMAGTHPVLLFVRQAKVGN 1534

Query: 730  GSPVKQLVGFKRVEINAGGRSEIEFLLSPCEHVSKANEEGLMIIEEGSYSLVVGDVEHPL 786
            G P+KQLVGF  V +NAG R EIEF LSPCEH+S+ANE+GLM+IEEG + L +GD E  +
Sbjct: 1535 GRPMKQLVGFHSVNLNAGERVEIEFELSPCEHLSRANEDGLMVIEEGPHFLSIGDKESEI 1594

BLAST of Cla022081 vs. TrEMBL
Match: A0A061FNE1_THECC (Glycosyl hydrolase family protein isoform 2 OS=Theobroma cacao GN=TCM_034945 PE=4 SV=1)

HSP 1 Score: 1068.9 bits (2763), Expect = 2.9e-309
Identity = 511/752 (67.95%), Postives = 616/752 (81.91%), Query Frame = 1

Query: 16  LLLSAAVFSALLS---LIVADSSSQLPYACDSSNSLTKTLPFCRTSLPIKQRARDLVSRL 75
           ++L    F +L+S   L +   S+Q P++CD S+  TK  PFC+T+LPI QRARDLVSRL
Sbjct: 1   MMLQGLSFVSLISFTLLFIHAGSTQPPFSCDPSDPSTKNYPFCQTTLPISQRARDLVSRL 60

Query: 76  TLDEKVLQLVNAAPAIPRLGIPAYEWWSEALHGVAHVGYGIRLNGTISAATSFPQVILTA 135
           TLDEK+ QLVN+APAIPRLGIPAYEWWSEALHGVA+VG GI+ +G+I AATSFPQVILTA
Sbjct: 61  TLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVANVGPGIKFDGSIKAATSFPQVILTA 120

Query: 136 ASFDENLWYQIGQAIGTEARAVYNAGQAKGMTFWAPNINIFRDPRWGRGQETPGEDPLMT 195
           ASFD   WY+IGQ IG EARA+YNAGQA+GMTFWAPNINIFRDPRWGRGQETPGEDPL+T
Sbjct: 121 ASFDAYQWYRIGQVIGREARAIYNAGQARGMTFWAPNINIFRDPRWGRGQETPGEDPLVT 180

Query: 196 AKYSVAYVRGIQGDAIEGGKLGNQLKASACCKHFTAYDLDRWNGMTRYVFDAKVTMQDMA 255
            KY+V+YVRG+QGD  +GGKL   L+ASACCKHFTAYDLD W G+ R+VFDA+VT+QD+A
Sbjct: 181 GKYAVSYVRGVQGDIFQGGKLNGHLQASACCKHFTAYDLDNWKGVNRFVFDARVTVQDLA 240

Query: 256 DTYQPPFESCVEKGKASGIMCAYNRVNGVPSCADHHLLTATARKQWKFNGYITSDCDAVS 315
           DTYQPPF+SCV+ G+ASGIMCAYNRVNGVPSCAD +LL+ T R +W F GYITSDCDAV+
Sbjct: 241 DTYQPPFKSCVQDGRASGIMCAYNRVNGVPSCADSNLLSKTLRGEWDFKGYITSDCDAVA 300

Query: 316 IIHDAQGYAKIPEDAVADVLRAGMDINCGTYLKEHAKSAVEMKKVPIPHIDRALHNLFAV 375
           IIH+ QGYAK PEDAV DVL+AGMD+NCG+YL++++KSAV  KK+P   IDRALHNLFAV
Sbjct: 301 IIHNDQGYAKSPEDAVVDVLKAGMDLNCGSYLQKYSKSAVLQKKLPESEIDRALHNLFAV 360

Query: 376 RMRLGLFDGNPTKLPFGQIGPDQVCSKQHQDLALQAAREGIVLLKNSAKLLPLSKSSIRS 435
           RMRLGLF+GNP + PFG IG DQVCS +HQ LAL+AAR GIVLLKN  KLLPL K+++ S
Sbjct: 361 RMRLGLFNGNPAQHPFGNIGTDQVCSPEHQILALEAARNGIVLLKNEEKLLPLPKATV-S 420

Query: 436 LAVIGHNGDEPKTLRGNYAGIPCKSVTPFQGLNSYVKNTVYHRGCNWANCTEATIDQAVQ 495
           LAVIG N + P+TL GNYAG PCKSVTP Q L SYVKNTVYH GC+  +C+   ID+AV 
Sbjct: 421 LAVIGPNANSPQTLLGNYAGPPCKSVTPLQALQSYVKNTVYHPGCDTVSCSTGVIDKAVD 480

Query: 496 IAKSVDYVVLVMGLDQTQEREDFDRTELGLPGNQEALIAEVAKAAKRPVILVILSGGPVD 555
           IAK  DYVVL+MGLDQTQE+E+ DR +L LPG Q+ LI  VAKAAKRPV+LV+LSGGP+D
Sbjct: 481 IAKQADYVVLIMGLDQTQEKEELDRVDLLLPGRQQELITSVAKAAKRPVVLVLLSGGPID 540

Query: 556 ISSAKYNEKIGSILWAGYPGQAGGTAIAEIIFGDHNPGGRLPLTWYPHDFIKFPMTDMRM 615
           +S AK + +IG I WAGYPG+ GG A+AEI+FGDHNPGGRLP+TWYP +F K PMTDMRM
Sbjct: 541 VSFAKDDPRIGGIFWAGYPGEGGGIALAEIVFGDHNPGGRLPVTWYPQEFTKVPMTDMRM 600

Query: 616 RADPSTGYPGRTYRFYNGPKVYEFGYGLSYSNYLYEFTSVTESKLHLSNPTASQPAKSSD 675
           R + S+ YPGRTYRFY G KV+EFGYGLSYS Y YEFT V+++ ++L++ ++     +SD
Sbjct: 601 RPESSSEYPGRTYRFYKGDKVFEFGYGLSYSKYSYEFTRVSQNNVYLNHSSSFHTTVTSD 660

Query: 676 SIRYRLVSELDKKFCESRAVNVTVGVRNDGEMAGKHSVLLFVKPSKPVNGSPVKQLVGFK 735
           S+RY+LVSEL  + C+ R   V VGV+N GEMAGKH VLLF +     +G P KQLVGF+
Sbjct: 661 SVRYKLVSELGAEVCDQRKFTVCVGVKNHGEMAGKHPVLLFARHGNHGDGRPKKQLVGFQ 720

Query: 736 RVEINAGGRSEIEFLLSPCEHVSKANEEGLMI 765
            V ++AG  +EI+F +SPCEH+S+ANE GLM+
Sbjct: 721 SVILSAGEMAEIQFEVSPCEHLSRANEYGLML 751


HSP 2 Score: 1107.4 bits (2863), Expect = 0.0e+00
Identity = 534/775 (68.90%), Postives = 637/775 (82.19%), Query Frame = 1

Query: 11  MKLQKLLLSAAVFSALLSLIVADSSSQLPYACDSSNSLTKTLPFCRTSLPIKQRARDLVS 70
           M++Q+L        ALL L VA  S+Q P++CD SN  T +  FC+T+LPI QR RDLVS
Sbjct: 1   MRVQQLSYFTFTIFALLILRVA--STQPPFSCDPSNPSTGSYLFCKTTLPISQRVRDLVS 60

Query: 71  RLTLDEKVLQLVNAAPAIPRLGIPAYEWWSEALHGVAHVGYGIRLNGTISAATSFPQVIL 130
           RLTLDEK+ QLV++APAIPRLGIPAYEWWSEALHGVA+VG GI   G+I +ATSFPQVIL
Sbjct: 61  RLTLDEKISQLVSSAPAIPRLGIPAYEWWSEALHGVANVGRGIHFQGSIQSATSFPQVIL 120

Query: 131 TAASFDENLWYQIGQAIGTEARAVYNAGQAKGMTFWAPNINIFRDPRWGRGQETPGEDPL 190
           TAASFD   WY+IGQ IG EARAVYNAGQA GMTFWAPNINIFRDPRWGRGQETPGEDPL
Sbjct: 121 TAASFDAYQWYRIGQVIGREARAVYNAGQATGMTFWAPNINIFRDPRWGRGQETPGEDPL 180

Query: 191 MTAKYSVAYVRGIQGDAIEGGKLGNQLKASACCKHFTAYDLDRWNGMTRYVFDAKVTMQD 250
           +T KY+V+YVRGIQGD+ +GGKL   L+ASACCKHFTAYDLD W G+ R+VFDA+VTMQD
Sbjct: 181 VTGKYAVSYVRGIQGDSFQGGKLEGHLQASACCKHFTAYDLDNWKGVNRFVFDARVTMQD 240

Query: 251 MADTYQPPFESCVEKGKASGIMCAYNRVNGVPSCADHHLLTATARKQWKFNGYITSDCDA 310
           +ADTYQPPF+SCV++GKASGIMCAYNRVNGVPSCAD++LL+ TAR QW F+GYITSDCDA
Sbjct: 241 LADTYQPPFQSCVQQGKASGIMCAYNRVNGVPSCADYNLLSKTARGQWGFHGYITSDCDA 300

Query: 311 VSIIHDAQGYAKIPEDAVADVLRAGMDINCGTYLKEHAKSAVEMKKVPIPHIDRALHNLF 370
           VSII++ QGYAK PEDAV DVL+AGMD+NCG+YL++H K+AV+ KK+P   IDRALHNLF
Sbjct: 301 VSIIYNNQGYAKSPEDAVVDVLKAGMDVNCGSYLQKHTKAAVQQKKLPESAIDRALHNLF 360

Query: 371 AVRMRLGLFDGNPTKLPFGQIGPDQVCSKQHQDLALQAAREGIVLLKNSAKLLPLSKSSI 430
           +VRMRLGLF+GNP + PF  IGPDQVCS++HQ LAL+AAR GIVLLKNSA+LLPLSKS  
Sbjct: 361 SVRMRLGLFNGNPMEQPFSNIGPDQVCSQEHQMLALEAARNGIVLLKNSARLLPLSKSKT 420

Query: 431 RSLAVIGHNGDEPKTLRGNYAGIPCKSVTPFQGLNSYVKNTVYHRGCNWANCTEATIDQA 490
            SLAVIG N D  +TL GNYAG PCKSVTP Q L  Y+KNT+Y  GC+   CT A+ID+A
Sbjct: 421 ISLAVIGPNADSAQTLLGNYAGPPCKSVTPLQALQYYIKNTIYDPGCDTVQCTSASIDKA 480

Query: 491 VQIAKSVDYVVLVMGLDQTQEREDFDRTELGLPGNQEALIAEVAKAAKRPVILVILSGGP 550
           V ++K VD+VVL+MGLDQTQERE+ DRT+L LPG Q+ LI  VAK+AK P+ILV+LSGGP
Sbjct: 481 VNVSKGVDHVVLIMGLDQTQEREELDRTDLVLPGKQQELITNVAKSAKNPIILVLLSGGP 540

Query: 551 VDISSAKYNEKIGSILWAGYPGQAGGTAIAEIIFGDHNPGGRLPLTWYPHDFIKFPMTDM 610
           +D+S AKY++ IGSILWAGYPG+AGGTA+AEIIFGDHNPGGRLP+TWYP +F+K PMTDM
Sbjct: 541 IDVSFAKYDKNIGSILWAGYPGEAGGTALAEIIFGDHNPGGRLPMTWYPQEFVKVPMTDM 600

Query: 611 RMRADPSTGYPGRTYRFYNGPKVYEFGYGLSYSNYLYEFTSVTESKLHLSNPTASQPAKS 670
           RMR D S+GYPGRTYRFY G  V+ FGYGLSYS Y Y   SV+++KL+L+  +  +    
Sbjct: 601 RMRPDSSSGYPGRTYRFYKGRNVFNFGYGLSYSKYSYVLKSVSQNKLYLNQSSTMRIIGD 660

Query: 671 SDSIRYRLVSELDKKFCESRAVNVTVGVRNDGEMAGKHSVLLFVKPSKPVNGSPVKQLVG 730
           SDS+R  +VS++  +FCE     V VGV N GEMAGKH +LLFV+ +K  NG P KQL+G
Sbjct: 661 SDSVRTAVVSDMRTEFCEQSKFLVRVGVENQGEMAGKHPILLFVRHAKHGNGRPRKQLIG 720

Query: 731 FKRVEINAGGRSEIEFLLSPCEHVSKANEEGLMIIEEGSYSLVVGDVEHPLDIFV 786
           FK V ++AG ++EIEF LSPCEH S+ANE+GLM+IEEG + LVVG  +HP+ I V
Sbjct: 721 FKSVILSAGEKAEIEFELSPCEHFSRANEDGLMVIEEGRHFLVVGGDKHPISIIV 773

BLAST of Cla022081 vs. NCBI nr
Match: gi|449465962|ref|XP_004150696.1| (PREDICTED: probable beta-D-xylosidase 7 [Cucumis sativus])

HSP 1 Score: 1449.1 bits (3750), Expect = 0.0e+00
Identity = 711/780 (91.15%), Postives = 743/780 (95.26%), Query Frame = 1

Query: 6   FFPRKMKLQKLLLSAAVFSALLSLIVADSSSQLPYACDSSNSLTKTLPFCRTSLPIKQRA 65
           FFP K+KL  LLLSAA     LSLIVA SSSQ PYACDSSN LTKTLPFC+T LPIK RA
Sbjct: 8   FFPHKIKLLTLLLSAA----FLSLIVAGSSSQPPYACDSSNPLTKTLPFCKTYLPIKLRA 67

Query: 66  RDLVSRLTLDEKVLQLVNAAPAIPRLGIPAYEWWSEALHGVAHVGYGIRLNGTISAATSF 125
           RDLVSRLTLDEKVLQLVN  P IPRLGIPAYEWWSEALHGVA+VGYGIRLNGTI+AATSF
Sbjct: 68  RDLVSRLTLDEKVLQLVNTVPPIPRLGIPAYEWWSEALHGVANVGYGIRLNGTITAATSF 127

Query: 126 PQVILTAASFDENLWYQIGQAIGTEARAVYNAGQAKGMTFWAPNINIFRDPRWGRGQETP 185
           PQVILTAASFDENLWYQIGQAIGTEARAVYNAGQAKGMTFW PNINIFRDPRWGRGQETP
Sbjct: 128 PQVILTAASFDENLWYQIGQAIGTEARAVYNAGQAKGMTFWTPNINIFRDPRWGRGQETP 187

Query: 186 GEDPLMTAKYSVAYVRGIQGDAIEGGKLGNQLKASACCKHFTAYDLDRWNGMTRYVFDAK 245
           GEDPLMT KYSVAYVRGIQGDAIEGGKLGNQLKASACCKHFTAYDLDRWNGMTRYVFDAK
Sbjct: 188 GEDPLMTGKYSVAYVRGIQGDAIEGGKLGNQLKASACCKHFTAYDLDRWNGMTRYVFDAK 247

Query: 246 VTMQDMADTYQPPFESCVEKGKASGIMCAYNRVNGVPSCADHHLLTATARKQWKFNGYIT 305
           VTMQDMADTYQPPFESCVE+GKASGIMCAYNRVNGVPSCADHHLLTATARKQWKFNGYIT
Sbjct: 248 VTMQDMADTYQPPFESCVEEGKASGIMCAYNRVNGVPSCADHHLLTATARKQWKFNGYIT 307

Query: 306 SDCDAVSIIHDAQGYAKIPEDAVADVLRAGMDINCGTYLKEHAKSAVEMKKVPIPHIDRA 365
           SDCDAVSIIHDAQGYAKIPEDAVADVLRAGMD+NCGTYLKEH KSAVEMKKVP+ HIDRA
Sbjct: 308 SDCDAVSIIHDAQGYAKIPEDAVADVLRAGMDVNCGTYLKEHTKSAVEMKKVPMLHIDRA 367

Query: 366 LHNLFAVRMRLGLFDGNPTKLPFGQIGPDQVCSKQHQDLALQAAREGIVLLKNSAKLLPL 425
           L NLF+VRMRLGLFDGNPTKLPFGQIG DQVCS+QHQ+LALQAAREGIVLLKNSAKLLPL
Sbjct: 368 LRNLFSVRMRLGLFDGNPTKLPFGQIGRDQVCSQQHQNLALQAAREGIVLLKNSAKLLPL 427

Query: 426 SKSSIRSLAVIGHNGDEPKTLRGNYAGIPCKSVTPFQGLNSYVKNTVYHRGCNWANCTEA 485
           SKS+  SLAVIGHNG++PKTLRGNYAGIPCKS TPFQGLN+YVKNTVYHRGCN+ANCTEA
Sbjct: 428 SKSNTHSLAVIGHNGNDPKTLRGNYAGIPCKSATPFQGLNNYVKNTVYHRGCNYANCTEA 487

Query: 486 TIDQAVQIAKSVDYVVLVMGLDQTQEREDFDRTELGLPGNQEALIAEVAKAAKRPVILVI 545
           TI QAV+IAKSVDYVVLVMGLDQTQEREDFDRTELGLPG Q+ LIAEVAKAAKRPVILVI
Sbjct: 488 TIYQAVKIAKSVDYVVLVMGLDQTQEREDFDRTELGLPGKQDKLIAEVAKAAKRPVILVI 547

Query: 546 LSGGPVDISSAKYNEKIGSILWAGYPGQAGGTAIAEIIFGDHNPGGRLPLTWYPHDFIKF 605
           LSGGPVDISSAKYNEKIGSILWAGYPGQAGGTAIAEIIFGDHNPGGRLPLTWYPHDFIKF
Sbjct: 548 LSGGPVDISSAKYNEKIGSILWAGYPGQAGGTAIAEIIFGDHNPGGRLPLTWYPHDFIKF 607

Query: 606 PMTDMRMRADPSTGYPGRTYRFYNGPKVYEFGYGLSYSNYLYEFTSVTESKLHLSNPTAS 665
           PMTDMRMRAD STGYPGRTYRFYNGPKVYEFGYGLSYSN++YEFTSV+ESKL LS+P AS
Sbjct: 608 PMTDMRMRADSSTGYPGRTYRFYNGPKVYEFGYGLSYSNHIYEFTSVSESKLLLSHPKAS 667

Query: 666 QPAKSSDSIRYRLVSELDKKFCESRAVNVTVGVRNDGEMAGKHSVLLFVKPSKPVNGSPV 725
           QPAK+SD + YRLVSELDKKFCES+ VNVTVGVRN+GEM GKHSVLLF+KPSKP+NGSPV
Sbjct: 668 QPAKNSDLVSYRLVSELDKKFCESKTVNVTVGVRNEGEMGGKHSVLLFIKPSKPINGSPV 727

Query: 726 KQLVGFKRVEINAGGRSEIEFLLSPCEHVSKANEEGLMIIEEGSYSLVVGDVEHPLDIFV 785
           KQLVGFK+VEINAG R EIEFL+SPC+H+SKA+EEGLMIIEEGSYSLVVGDVEHPLDIFV
Sbjct: 728 KQLVGFKKVEINAGERREIEFLVSPCDHISKASEEGLMIIEEGSYSLVVGDVEHPLDIFV 783

BLAST of Cla022081 vs. NCBI nr
Match: gi|659089146|ref|XP_008445351.1| (PREDICTED: probable beta-D-xylosidase 7 [Cucumis melo])

HSP 1 Score: 1445.3 bits (3740), Expect = 0.0e+00
Identity = 712/780 (91.28%), Postives = 741/780 (95.00%), Query Frame = 1

Query: 6   FFPRKMKLQKLLLSAAVFSALLSLIVADSSSQLPYACDSSNSLTKTLPFCRTSLPIKQRA 65
           FFP K+KLQ LLLSAA     LSLIVA SSSQ PYACDSSN LTKTLPFCRTSLPIK RA
Sbjct: 8   FFPHKIKLQTLLLSAA----FLSLIVAGSSSQPPYACDSSNPLTKTLPFCRTSLPIKLRA 67

Query: 66  RDLVSRLTLDEKVLQLVNAAPAIPRLGIPAYEWWSEALHGVAHVGYGIRLNGTISAATSF 125
           RDLVSRLTLDEKVLQLVN APAIPRLGIPAYEWWSEALHGVA VGYGIRLNGTI AATSF
Sbjct: 68  RDLVSRLTLDEKVLQLVNTAPAIPRLGIPAYEWWSEALHGVADVGYGIRLNGTIPAATSF 127

Query: 126 PQVILTAASFDENLWYQIGQAIGTEARAVYNAGQAKGMTFWAPNINIFRDPRWGRGQETP 185
           PQVILTAASFDENLWYQIGQAIGTEARAVYNAGQAKGMTFW PNINIFRDPRWGRGQETP
Sbjct: 128 PQVILTAASFDENLWYQIGQAIGTEARAVYNAGQAKGMTFWTPNINIFRDPRWGRGQETP 187

Query: 186 GEDPLMTAKYSVAYVRGIQGDAIEGGKLGNQLKASACCKHFTAYDLDRWNGMTRYVFDAK 245
           GEDPLMT KYSVAYVRGIQGDAIEGGKLGN+LKASACCKHFTAYDLDRWNGMTRYVFDAK
Sbjct: 188 GEDPLMTGKYSVAYVRGIQGDAIEGGKLGNELKASACCKHFTAYDLDRWNGMTRYVFDAK 247

Query: 246 VTMQDMADTYQPPFESCVEKGKASGIMCAYNRVNGVPSCADHHLLTATARKQWKFNGYIT 305
           VTMQDMADTYQPPFESCVEKGKASGIMCAYNRVNGVPSCADHHLLTATARKQWKFNGYIT
Sbjct: 248 VTMQDMADTYQPPFESCVEKGKASGIMCAYNRVNGVPSCADHHLLTATARKQWKFNGYIT 307

Query: 306 SDCDAVSIIHDAQGYAKIPEDAVADVLRAGMDINCGTYLKEHAKSAVEMKKVPIPHIDRA 365
           SDCDAVSIIHDAQ YAK PEDAVADVLRAGMD+NCGTYLKEH KSAVEM KV I +IDRA
Sbjct: 308 SDCDAVSIIHDAQDYAKSPEDAVADVLRAGMDVNCGTYLKEHTKSAVEMNKVSISYIDRA 367

Query: 366 LHNLFAVRMRLGLFDGNPTKLPFGQIGPDQVCSKQHQDLALQAAREGIVLLKNSAKLLPL 425
           L NLF VRMRLGLFDGNPTKLPFGQIGPDQVCS+QHQ+LALQAAREGIVLLKNSAKLLPL
Sbjct: 368 LRNLFTVRMRLGLFDGNPTKLPFGQIGPDQVCSRQHQNLALQAAREGIVLLKNSAKLLPL 427

Query: 426 SKSSIRSLAVIGHNGDEPKTLRGNYAGIPCKSVTPFQGLNSYVKNTVYHRGCNWANCTEA 485
           SKS+  SLAVIGHNG++PKTLRGNYAGIPCKSVTPFQGLNSY+KNT+YHRGCN+ANCTEA
Sbjct: 428 SKSNTYSLAVIGHNGNDPKTLRGNYAGIPCKSVTPFQGLNSYIKNTLYHRGCNYANCTEA 487

Query: 486 TIDQAVQIAKSVDYVVLVMGLDQTQEREDFDRTELGLPGNQEALIAEVAKAAKRPVILVI 545
           TI QAV+IAKSVDYVVLVMGLDQTQEREDFDR ELGLPG Q+ LIA+VA+AAKRPVILVI
Sbjct: 488 TIYQAVKIAKSVDYVVLVMGLDQTQEREDFDRMELGLPGKQDELIAKVAEAAKRPVILVI 547

Query: 546 LSGGPVDISSAKYNEKIGSILWAGYPGQAGGTAIAEIIFGDHNPGGRLPLTWYPHDFIKF 605
           LSGGPVDISSAKYNEKIGSILWAGYPGQAGGTAIAEIIFGDHNPGGRLPLTWYP DFIKF
Sbjct: 548 LSGGPVDISSAKYNEKIGSILWAGYPGQAGGTAIAEIIFGDHNPGGRLPLTWYPRDFIKF 607

Query: 606 PMTDMRMRADPSTGYPGRTYRFYNGPKVYEFGYGLSYSNYLYEFTSVTESKLHLSNPTAS 665
           PMTDMRMRAD STGYPGRTYRFYNGPKVYEFGYGLSYSN++YEFTSV+ESKL LS+PTAS
Sbjct: 608 PMTDMRMRADSSTGYPGRTYRFYNGPKVYEFGYGLSYSNHIYEFTSVSESKLLLSHPTAS 667

Query: 666 QPAKSSDSIRYRLVSELDKKFCESRAVNVTVGVRNDGEMAGKHSVLLFVKPSKPVNGSPV 725
           QPAK+SD + YRLVSELDKKFCES+ VNVTVGVRN+GEM  KHSVLLFVKPSKP+NGSPV
Sbjct: 668 QPAKNSDLVSYRLVSELDKKFCESKTVNVTVGVRNEGEMGSKHSVLLFVKPSKPINGSPV 727

Query: 726 KQLVGFKRVEINAGGRSEIEFLLSPCEHVSKANEEGLMIIEEGSYSLVVGDVEHPLDIFV 785
           KQLVGFKRVEINAG RSEIEFL+SPC+HVSKA+EEG+MIIEEGSYSLVVGDVEHPLDIFV
Sbjct: 728 KQLVGFKRVEINAGERSEIEFLVSPCDHVSKASEEGVMIIEEGSYSLVVGDVEHPLDIFV 783

BLAST of Cla022081 vs. NCBI nr
Match: gi|470130855|ref|XP_004301317.1| (PREDICTED: probable beta-D-xylosidase 7 [Fragaria vesca subsp. vesca])

HSP 1 Score: 1137.9 bits (2942), Expect = 0.0e+00
Identity = 549/778 (70.57%), Postives = 650/778 (83.55%), Query Frame = 1

Query: 11  MKLQKLLLSAAVFSALLSLIVADSSSQLPYACDSSNSLTKTLPFCRTSLPIKQRARDLVS 70
           MKL  L+L   +F +  +LI    S+Q PY+CDSSN  T++  FC+T+LPI QR  DLVS
Sbjct: 1   MKLPALILIPLIFFS--TLIFLTESTQPPYSCDSSNPSTESFLFCKTTLPINQRVHDLVS 60

Query: 71  RLTLDEKVLQLVNAAPAIPRLGIPAYEWWSEALHGVAHVGYGIRLNGTISAATSFPQVIL 130
           RLTLDEK+ QLVN+AP IPRLGIP+YEWWSEALHGVA VG GIRL  TI++ATSFPQVIL
Sbjct: 61  RLTLDEKISQLVNSAPPIPRLGIPSYEWWSEALHGVADVGKGIRLYSTINSATSFPQVIL 120

Query: 131 TAASFDENLWYQIGQAIGTEARAVYNAGQAKGMTFWAPNINIFRDPRWGRGQETPGEDPL 190
           TAASF+E+LWY+IGQ IG EARAVYNAGQA GMTFWAPNINIFRDPRWGRGQETPGEDPL
Sbjct: 121 TAASFNEHLWYRIGQVIGIEARAVYNAGQATGMTFWAPNINIFRDPRWGRGQETPGEDPL 180

Query: 191 MTAKYSVAYVRGIQGDAIEGGKL--GNQLKASACCKHFTAYDLDRWNGMTRYVFDAKVTM 250
           MTAKYSVAYVRG+QGD+ EGGKL  G  L+ASACCKHFTAYDLD WN +TR+ F+AKVT 
Sbjct: 181 MTAKYSVAYVRGVQGDSYEGGKLKVGGHLQASACCKHFTAYDLDNWNNVTRFGFNAKVTQ 240

Query: 251 QDMADTYQPPFESCVEKGKASGIMCAYNRVNGVPSCADHHLLTATARKQWKFNGYITSDC 310
           QD+ADTYQPPF+SCVE+GKASGIMCAYN+VNGVPSCADH+LLT TAR +W F+GYITSDC
Sbjct: 241 QDLADTYQPPFKSCVEQGKASGIMCAYNQVNGVPSCADHNLLTKTARGEWGFHGYITSDC 300

Query: 311 DAVSIIHDAQGYAKIPEDAVADVLRAGMDINCGTYLKEHAKSAVEMKKVPIPHIDRALHN 370
           DAVSII+D QGYAK PEDAV DVL+AGMD+NCGTYL+ H K+AV+ KK+P+ +ID+ALHN
Sbjct: 301 DAVSIIYDVQGYAKHPEDAVVDVLKAGMDVNCGTYLQNHTKNAVQQKKLPVSYIDKALHN 360

Query: 371 LFAVRMRLGLFDGNPTKLPFGQIGPDQVCSKQHQDLALQAAREGIVLLKNSAKLLPLSKS 430
           LF++RMRLGLFDGNPTKLPFG IGP++VCSKQHQ LAL+AA +GIVLLKN+ KLLPL KS
Sbjct: 361 LFSIRMRLGLFDGNPTKLPFGNIGPEKVCSKQHQALALEAAEDGIVLLKNAGKLLPLPKS 420

Query: 431 SIRSLAVIGHNGDEPKTLRGNYAGIPCKSVTPFQGLNSYVKNTVYHRGCNWANCTEATID 490
              SLAVIG N +  +TL GNY G PCK +TP QGL  Y K TVYH GC+   C   TID
Sbjct: 421 KGISLAVIGPNANASETLLGNYHGPPCKLITPLQGLLGYAKKTVYHPGCDTVKCPNPTID 480

Query: 491 QAVQIAKSVDYVVLVMGLDQTQEREDFDRTELGLPGNQEALIAEVAKAAKRPVILVILSG 550
           QAV++A+  DYVVL++GLDQ +ERE  DR  L LPG Q+ LI+ VAKAAK+PVILVILSG
Sbjct: 481 QAVRVAQQADYVVLIVGLDQGEEREAHDRDHLNLPGKQQQLISSVAKAAKKPVILVILSG 540

Query: 551 GPVDISSAKYNEKIGSILWAGYPGQAGGTAIAEIIFGDHNPGGRLPLTWYPHDFIKFPMT 610
           GPVDIS+AKYN KIGSILWAGYPG+AGG+A+AE+IFGDHNPGGRLP+TWY  D+IK  MT
Sbjct: 541 GPVDISAAKYNPKIGSILWAGYPGEAGGSALAEVIFGDHNPGGRLPVTWYTQDYIKTLMT 600

Query: 611 DMRMRADPSTGYPGRTYRFYNGPKVYEFGYGLSYSNYLYEF-TSVTESKLHLSNPTASQP 670
           DMRMR D  +GYPGRTYRFY G +V++FGYGLSYSNY Y F +SVT++K++L+  +    
Sbjct: 601 DMRMRPDKRSGYPGRTYRFYTGKRVFDFGYGLSYSNYAYNFVSSVTQNKVYLNESSVGLA 660

Query: 671 AKSSDSIRYRLVSELDKKFCESRAVNVTVGVRNDGEMAGKHSVLLFVKPSKPVNGSPVKQ 730
           AK+SDS RY+LVS+L ++ CE +   VTVG +N+GEMAGKH VLLFV    P NGSP+KQ
Sbjct: 661 AKNSDSGRYQLVSDLGEELCEKKLFKVTVGAKNEGEMAGKHPVLLFVSRKNPTNGSPMKQ 720

Query: 731 LVGFKRVEINAGGRSEIEFLLSPCEHVSKANEEGLMIIEEGSYSLVVGDVEHPLDIFV 786
           LVGFK V ++AG ++E+EF+L+PCEH+S ANE+G M++EEGS  LVVGDVE+P+DI V
Sbjct: 721 LVGFKSVILSAGEKAELEFMLNPCEHLSHANEDGWMVVEEGSRFLVVGDVEYPIDIIV 776

BLAST of Cla022081 vs. NCBI nr
Match: gi|590598192|ref|XP_007018825.1| (Glycosyl hydrolase family protein isoform 3 [Theobroma cacao])

HSP 1 Score: 1124.0 bits (2906), Expect = 0.0e+00
Identity = 538/780 (68.97%), Postives = 651/780 (83.46%), Query Frame = 1

Query: 10   KMKLQKL-LLSAAVFSALLSLIVADSSSQLPYACDSSNSLTKTLPFCRTSLPIKQRARDL 69
            KMKLQKL LL+    S+LL L++ADS+ Q P++CD+S+  TK+ PFC+T+LPI QR +DL
Sbjct: 815  KMKLQKLSLLTLIHISSLLLLVLADST-QPPFSCDTSDPRTKSYPFCKTTLPINQRVQDL 874

Query: 70   VSRLTLDEKVLQLVNAAPAIPRLGIPAYEWWSEALHGVAH---VGYGIRLNGTISAATSF 129
            +SRLTLDEK+ QLVN+AP IPRLGIP  EWWSEALHGVA    V  GIR NGTI +ATSF
Sbjct: 875  ISRLTLDEKISQLVNSAPPIPRLGIPGDEWWSEALHGVAFLASVSQGIRFNGTIQSATSF 934

Query: 130  PQVILTAASFDENLWYQIGQAIGTEARAVYNAGQAKGMTFWAPNINIFRDPRWGRGQETP 189
            PQVILTAASFD +LW++IGQAIG EAR +YNAGQA+GMTFWAPNINI+RDPRWGRGQETP
Sbjct: 935  PQVILTAASFDAHLWFRIGQAIGIEARGIYNAGQARGMTFWAPNINIYRDPRWGRGQETP 994

Query: 190  GEDPLMTAKYSVAYVRGIQGDAIEGGKLGNQLKASACCKHFTAYDLDRWNGMTRYVFDAK 249
            GEDPL+T KY+V++VRGIQGD+ EGG LG  L+ SACCKHFTAYDLD W G+ R+VF+AK
Sbjct: 995  GEDPLVTGKYAVSFVRGIQGDSFEGGMLGEHLQVSACCKHFTAYDLDNWKGVNRFVFNAK 1054

Query: 250  VTMQDMADTYQPPFESCVEKGKASGIMCAYNRVNGVPSCADHHLLTATARKQWKFNGYIT 309
            V++QD+ADTYQPPF+SC+++GKASGIMCAYNRVNGVP+CAD++LL+ TAR QW FNGYIT
Sbjct: 1055 VSLQDLADTYQPPFQSCIQQGKASGIMCAYNRVNGVPNCADYNLLSKTARGQWGFNGYIT 1114

Query: 310  SDCDAVSIIHDAQGYAKIPEDAVADVLRAGMDINCGTYLKEHAKSAVEMKKVPIPHIDRA 369
            SDCDAVSI+H+ QGYAK+PEDAVADVL+AGMD+NCG YLK + KSAV+ +K+P+  IDRA
Sbjct: 1115 SDCDAVSIMHEKQGYAKVPEDAVADVLKAGMDVNCGNYLKNYTKSAVKKRKLPMSEIDRA 1174

Query: 370  LHNLFAVRMRLGLFDGNPTKLPFGQIGPDQVCSKQHQDLALQAAREGIVLLKNSAKLLPL 429
            LHNLF+VRMRLGLF+GNPTK PFG IG DQVCS++HQ+LAL+AAR GIVLLKN+  LLPL
Sbjct: 1175 LHNLFSVRMRLGLFNGNPTKQPFGNIGSDQVCSQEHQNLALEAARNGIVLLKNTDSLLPL 1234

Query: 430  SKSSIRSLAVIGHNGDEPKTLRGNYAGIPCKSVTPFQGLNSYVKNTVYHRGCNWANCTEA 489
            SK+   SLAVIG N +  KTL GNYAG PCKS+TP Q L SY K+T YH GC+  NC+ A
Sbjct: 1235 SKTKTTSLAVIGPNANSAKTLVGNYAGPPCKSITPLQALQSYAKDTRYHPGCSAVNCSSA 1294

Query: 490  TIDQAVQIAKSVDYVVLVMGLDQTQEREDFDRTELGLPGNQEALIAEVAKAAKRPVILVI 549
              DQAV+IAK  D+VVLVMGLDQTQERED DR +L LP  Q+ LI+ +A+AAK PVILV+
Sbjct: 1295 LTDQAVKIAKGADHVVLVMGLDQTQEREDHDRVDLVLPAKQQNLISSIARAAKNPVILVL 1354

Query: 550  LSGGPVDISSAKYNEKIGSILWAGYPGQAGGTAIAEIIFGDHNPGGRLPLTWYPHDFIKF 609
            LSGGPVDI+ AKY++ IGSILWAGYPG+AGG A+AEIIFGDHNPGGRLP+TWYP  FIK 
Sbjct: 1355 LSGGPVDITFAKYDQHIGSILWAGYPGEAGGLALAEIIFGDHNPGGRLPVTWYPQSFIKV 1414

Query: 610  PMTDMRMRADPSTGYPGRTYRFYNGPKVYEFGYGLSYSNYLYEFTSVTESKLHLSNPTAS 669
            PMTDMRMR +PS+GYPGRTYRFY GPKV+EFGYGLSYS Y YEF  VT++K++L++ + +
Sbjct: 1415 PMTDMRMRPEPSSGYPGRTYRFYQGPKVFEFGYGLSYSKYSYEFLPVTQNKVYLNHQSCN 1474

Query: 670  QPAKSSDSIRYRLVSELDKKFCESRAVNVTVGVRNDGEMAGKHSVLLFVKPSKPVNGSPV 729
            +  ++S+ +RY  VSE+ K+ C+ R   V VGV+N GEMAG H VLLFV+ +K  NG P+
Sbjct: 1475 KMVENSNPVRYMPVSEIAKELCDKRKFPVKVGVQNHGEMAGTHPVLLFVRQAKVGNGRPM 1534

Query: 730  KQLVGFKRVEINAGGRSEIEFLLSPCEHVSKANEEGLMIIEEGSYSLVVGDVEHPLDIFV 786
            KQLVGF  V +NAG R EIEF LSPCEH+S+ANE+GLM+IEEG + L +GD E  + +F+
Sbjct: 1535 KQLVGFHSVNLNAGERVEIEFELSPCEHLSRANEDGLMVIEEGPHFLSIGDKESEITVFI 1593

BLAST of Cla022081 vs. NCBI nr
Match: gi|590598192|ref|XP_007018825.1| (Glycosyl hydrolase family protein isoform 3 [Theobroma cacao])

HSP 1 Score: 1068.9 bits (2763), Expect = 4.2e-309
Identity = 511/752 (67.95%), Postives = 616/752 (81.91%), Query Frame = 1

Query: 16  LLLSAAVFSALLS---LIVADSSSQLPYACDSSNSLTKTLPFCRTSLPIKQRARDLVSRL 75
           ++L    F +L+S   L +   S+Q P++CD S+  TK  PFC+T+LPI QRARDLVSRL
Sbjct: 1   MMLQGLSFVSLISFTLLFIHAGSTQPPFSCDPSDPSTKNYPFCQTTLPISQRARDLVSRL 60

Query: 76  TLDEKVLQLVNAAPAIPRLGIPAYEWWSEALHGVAHVGYGIRLNGTISAATSFPQVILTA 135
           TLDEK+ QLVN+APAIPRLGIPAYEWWSEALHGVA+VG GI+ +G+I AATSFPQVILTA
Sbjct: 61  TLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVANVGPGIKFDGSIKAATSFPQVILTA 120

Query: 136 ASFDENLWYQIGQAIGTEARAVYNAGQAKGMTFWAPNINIFRDPRWGRGQETPGEDPLMT 195
           ASFD   WY+IGQ IG EARA+YNAGQA+GMTFWAPNINIFRDPRWGRGQETPGEDPL+T
Sbjct: 121 ASFDAYQWYRIGQVIGREARAIYNAGQARGMTFWAPNINIFRDPRWGRGQETPGEDPLVT 180

Query: 196 AKYSVAYVRGIQGDAIEGGKLGNQLKASACCKHFTAYDLDRWNGMTRYVFDAKVTMQDMA 255
            KY+V+YVRG+QGD  +GGKL   L+ASACCKHFTAYDLD W G+ R+VFDA+VT+QD+A
Sbjct: 181 GKYAVSYVRGVQGDIFQGGKLNGHLQASACCKHFTAYDLDNWKGVNRFVFDARVTVQDLA 240

Query: 256 DTYQPPFESCVEKGKASGIMCAYNRVNGVPSCADHHLLTATARKQWKFNGYITSDCDAVS 315
           DTYQPPF+SCV+ G+ASGIMCAYNRVNGVPSCAD +LL+ T R +W F GYITSDCDAV+
Sbjct: 241 DTYQPPFKSCVQDGRASGIMCAYNRVNGVPSCADSNLLSKTLRGEWDFKGYITSDCDAVA 300

Query: 316 IIHDAQGYAKIPEDAVADVLRAGMDINCGTYLKEHAKSAVEMKKVPIPHIDRALHNLFAV 375
           IIH+ QGYAK PEDAV DVL+AGMD+NCG+YL++++KSAV  KK+P   IDRALHNLFAV
Sbjct: 301 IIHNDQGYAKSPEDAVVDVLKAGMDLNCGSYLQKYSKSAVLQKKLPESEIDRALHNLFAV 360

Query: 376 RMRLGLFDGNPTKLPFGQIGPDQVCSKQHQDLALQAAREGIVLLKNSAKLLPLSKSSIRS 435
           RMRLGLF+GNP + PFG IG DQVCS +HQ LAL+AAR GIVLLKN  KLLPL K+++ S
Sbjct: 361 RMRLGLFNGNPAQHPFGNIGTDQVCSPEHQILALEAARNGIVLLKNEEKLLPLPKATV-S 420

Query: 436 LAVIGHNGDEPKTLRGNYAGIPCKSVTPFQGLNSYVKNTVYHRGCNWANCTEATIDQAVQ 495
           LAVIG N + P+TL GNYAG PCKSVTP Q L SYVKNTVYH GC+  +C+   ID+AV 
Sbjct: 421 LAVIGPNANSPQTLLGNYAGPPCKSVTPLQALQSYVKNTVYHPGCDTVSCSTGVIDKAVD 480

Query: 496 IAKSVDYVVLVMGLDQTQEREDFDRTELGLPGNQEALIAEVAKAAKRPVILVILSGGPVD 555
           IAK  DYVVL+MGLDQTQE+E+ DR +L LPG Q+ LI  VAKAAKRPV+LV+LSGGP+D
Sbjct: 481 IAKQADYVVLIMGLDQTQEKEELDRVDLLLPGRQQELITSVAKAAKRPVVLVLLSGGPID 540

Query: 556 ISSAKYNEKIGSILWAGYPGQAGGTAIAEIIFGDHNPGGRLPLTWYPHDFIKFPMTDMRM 615
           +S AK + +IG I WAGYPG+ GG A+AEI+FGDHNPGGRLP+TWYP +F K PMTDMRM
Sbjct: 541 VSFAKDDPRIGGIFWAGYPGEGGGIALAEIVFGDHNPGGRLPVTWYPQEFTKVPMTDMRM 600

Query: 616 RADPSTGYPGRTYRFYNGPKVYEFGYGLSYSNYLYEFTSVTESKLHLSNPTASQPAKSSD 675
           R + S+ YPGRTYRFY G KV+EFGYGLSYS Y YEFT V+++ ++L++ ++     +SD
Sbjct: 601 RPESSSEYPGRTYRFYKGDKVFEFGYGLSYSKYSYEFTRVSQNNVYLNHSSSFHTTVTSD 660

Query: 676 SIRYRLVSELDKKFCESRAVNVTVGVRNDGEMAGKHSVLLFVKPSKPVNGSPVKQLVGFK 735
           S+RY+LVSEL  + C+ R   V VGV+N GEMAGKH VLLF +     +G P KQLVGF+
Sbjct: 661 SVRYKLVSELGAEVCDQRKFTVCVGVKNHGEMAGKHPVLLFARHGNHGDGRPKKQLVGFQ 720

Query: 736 RVEINAGGRSEIEFLLSPCEHVSKANEEGLMI 765
            V ++AG  +EI+F +SPCEH+S+ANE GLM+
Sbjct: 721 SVILSAGEMAEIQFEVSPCEHLSRANEYGLML 751


HSP 2 Score: 1123.6 bits (2905), Expect = 0.0e+00
Identity = 537/780 (68.85%), Postives = 651/780 (83.46%), Query Frame = 1

Query: 10   KMKLQKL-LLSAAVFSALLSLIVADSSSQLPYACDSSNSLTKTLPFCRTSLPIKQRARDL 69
            KMKLQKL LL+    S+LL L++ADS+ Q P++CD+S+  TK+ PFC+T+LPI QR +DL
Sbjct: 815  KMKLQKLSLLTLIHISSLLLLVLADST-QPPFSCDTSDPRTKSYPFCKTTLPINQRVQDL 874

Query: 70   VSRLTLDEKVLQLVNAAPAIPRLGIPAYEWWSEALHGVAH---VGYGIRLNGTISAATSF 129
            +SRLTLDEK+ QLVN+AP IPRLGIP  EWWSEALHGVA    V  GIR NGTI +ATSF
Sbjct: 875  ISRLTLDEKISQLVNSAPPIPRLGIPGDEWWSEALHGVAFLASVSQGIRFNGTIQSATSF 934

Query: 130  PQVILTAASFDENLWYQIGQAIGTEARAVYNAGQAKGMTFWAPNINIFRDPRWGRGQETP 189
            PQVILTAASFD +LW++IGQA+G EAR +YNAGQA+GMTFWAPNINI+RDPRWGRGQETP
Sbjct: 935  PQVILTAASFDAHLWFRIGQAVGIEARGIYNAGQARGMTFWAPNINIYRDPRWGRGQETP 994

Query: 190  GEDPLMTAKYSVAYVRGIQGDAIEGGKLGNQLKASACCKHFTAYDLDRWNGMTRYVFDAK 249
            GEDPL+T KY+V++VRGIQGD+ EGG LG  L+ SACCKHFTAYDLD W G+ R+VF+AK
Sbjct: 995  GEDPLVTGKYAVSFVRGIQGDSFEGGMLGEHLQVSACCKHFTAYDLDNWKGVNRFVFNAK 1054

Query: 250  VTMQDMADTYQPPFESCVEKGKASGIMCAYNRVNGVPSCADHHLLTATARKQWKFNGYIT 309
            V++QD+ADTYQPPF+SC+++GKASGIMCAYNRVNGVP+CAD++LL+ TAR QW FNGYIT
Sbjct: 1055 VSLQDLADTYQPPFQSCIQQGKASGIMCAYNRVNGVPNCADYNLLSKTARGQWGFNGYIT 1114

Query: 310  SDCDAVSIIHDAQGYAKIPEDAVADVLRAGMDINCGTYLKEHAKSAVEMKKVPIPHIDRA 369
            SDCDAVSI+H+ QGYAK+PEDAVADVL+AGMD+NCG YLK + KSAV+ +K+P+  IDRA
Sbjct: 1115 SDCDAVSIMHEKQGYAKVPEDAVADVLKAGMDVNCGNYLKNYTKSAVKKRKLPMSEIDRA 1174

Query: 370  LHNLFAVRMRLGLFDGNPTKLPFGQIGPDQVCSKQHQDLALQAAREGIVLLKNSAKLLPL 429
            LHNLF+VRMRLGLF+GNPTK PFG IG DQVCS++HQ+LAL+AAR GIVLLKN+  LLPL
Sbjct: 1175 LHNLFSVRMRLGLFNGNPTKQPFGNIGSDQVCSQEHQNLALEAARNGIVLLKNTDSLLPL 1234

Query: 430  SKSSIRSLAVIGHNGDEPKTLRGNYAGIPCKSVTPFQGLNSYVKNTVYHRGCNWANCTEA 489
            SK+   SLAVIG N +  KTL GNYAG PCKS+TP Q L SY K+T YH GC+  NC+ A
Sbjct: 1235 SKTKTTSLAVIGPNANSAKTLVGNYAGPPCKSITPLQALQSYAKDTRYHPGCSAVNCSSA 1294

Query: 490  TIDQAVQIAKSVDYVVLVMGLDQTQEREDFDRTELGLPGNQEALIAEVAKAAKRPVILVI 549
              DQAV+IAK  D+VVLVMGLDQTQERED DR +L LP  Q+ LI+ +A+AAK PVILV+
Sbjct: 1295 LTDQAVKIAKGADHVVLVMGLDQTQEREDHDRVDLVLPAKQQNLISSIARAAKNPVILVL 1354

Query: 550  LSGGPVDISSAKYNEKIGSILWAGYPGQAGGTAIAEIIFGDHNPGGRLPLTWYPHDFIKF 609
            LSGGPVDI+ AKY++ IGSILWAGYPG+AGG A+AEIIFGDHNPGGRLP+TWYP  FIK 
Sbjct: 1355 LSGGPVDITFAKYDQHIGSILWAGYPGEAGGLALAEIIFGDHNPGGRLPVTWYPQSFIKV 1414

Query: 610  PMTDMRMRADPSTGYPGRTYRFYNGPKVYEFGYGLSYSNYLYEFTSVTESKLHLSNPTAS 669
            PMTDMRMR +PS+GYPGRTYRFY GPKV+EFGYGLSYS Y YEF  VT++K++L++ + +
Sbjct: 1415 PMTDMRMRPEPSSGYPGRTYRFYQGPKVFEFGYGLSYSKYSYEFLPVTQNKVYLNHQSCN 1474

Query: 670  QPAKSSDSIRYRLVSELDKKFCESRAVNVTVGVRNDGEMAGKHSVLLFVKPSKPVNGSPV 729
            +  ++S+ +RY  VSE+ K+ C+ R   V VGV+N GEMAG H VLLFV+ +K  NG P+
Sbjct: 1475 KMVENSNPVRYMPVSEIAKELCDKRKFPVKVGVQNHGEMAGTHPVLLFVRQAKVGNGRPM 1534

Query: 730  KQLVGFKRVEINAGGRSEIEFLLSPCEHVSKANEEGLMIIEEGSYSLVVGDVEHPLDIFV 786
            KQLVGF  V +NAG R EIEF LSPCEH+S+ANE+GLM+IEEG + L +GD E  + +F+
Sbjct: 1535 KQLVGFHSVNLNAGERVEIEFELSPCEHLSRANEDGLMVIEEGPHFLSIGDKESEITVFI 1593

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
BXL7_ARATH5.2e-30466.58Probable beta-D-xylosidase 7 OS=Arabidopsis thaliana GN=BXL7 PE=2 SV=2[more]
BXL6_ARATH2.9e-23050.88Probable beta-D-xylosidase 6 OS=Arabidopsis thaliana GN=BXL6 PE=2 SV=1[more]
BXL5_ARATH5.4e-21650.07Probable beta-D-xylosidase 5 OS=Arabidopsis thaliana GN=BXL5 PE=2 SV=2[more]
BXL3_ARATH8.9e-21150.07Beta-D-xylosidase 3 OS=Arabidopsis thaliana GN=BXL3 PE=1 SV=1[more]
XYL1_MEDSV3.4e-21047.13Beta-xylosidase/alpha-L-arabinofuranosidase 1 (Fragment) OS=Medicago sativa subs... [more]
Match NameE-valueIdentityDescription
A0A0A0LMA9_CUCSA0.0e+0091.15Periplasmic beta-glucosidase OS=Cucumis sativus GN=Csa_2G308360 PE=4 SV=1[more]
A0A061FFD4_THECC0.0e+0068.97Glycosyl hydrolase family protein isoform 3 OS=Theobroma cacao GN=TCM_034945 PE=... [more]
A0A061FFD4_THECC2.9e-30967.95Glycosyl hydrolase family protein isoform 3 OS=Theobroma cacao GN=TCM_034945 PE=... [more]
A0A061FGK5_THECC2.9e-30967.95Glycosyl hydrolase family protein isoform 1 OS=Theobroma cacao GN=TCM_034945 PE=... [more]
A0A061FNE1_THECC2.9e-30967.95Glycosyl hydrolase family protein isoform 2 OS=Theobroma cacao GN=TCM_034945 PE=... [more]
Match NameE-valueIdentityDescription
gi|449465962|ref|XP_004150696.1|0.0e+0091.15PREDICTED: probable beta-D-xylosidase 7 [Cucumis sativus][more]
gi|659089146|ref|XP_008445351.1|0.0e+0091.28PREDICTED: probable beta-D-xylosidase 7 [Cucumis melo][more]
gi|470130855|ref|XP_004301317.1|0.0e+0070.57PREDICTED: probable beta-D-xylosidase 7 [Fragaria vesca subsp. vesca][more]
gi|590598192|ref|XP_007018825.1|0.0e+0068.97Glycosyl hydrolase family protein isoform 3 [Theobroma cacao][more]
gi|590598192|ref|XP_007018825.1|4.2e-30967.95Glycosyl hydrolase family protein isoform 3 [Theobroma cacao][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001764Glyco_hydro_3_N
IPR002772Glyco_hydro_3_C
IPR017853Glycoside_hydrolase_SF
IPR026891Fn3-like
IPR026892Glycoside hydrolase family 3
Vocabulary: Molecular Function
TermDefinition
GO:0004553hydrolase activity, hydrolyzing O-glycosyl compounds
Vocabulary: Biological Process
TermDefinition
GO:0005975carbohydrate metabolic process
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0031222 arabinan catabolic process
biological_process GO:0005975 carbohydrate metabolic process
biological_process GO:0045493 xylan catabolic process
cellular_component GO:0048046 apoplast
cellular_component GO:0009507 chloroplast
cellular_component GO:0009505 plant-type cell wall
cellular_component GO:0009506 plasmodesma
cellular_component GO:0005575 cellular_component
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0008422 beta-glucosidase activity
molecular_function GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compounds
molecular_function GO:0102483 scopolin beta-glucosidase activity
molecular_function GO:0016787 hydrolase activity
molecular_function GO:0046556 alpha-L-arabinofuranosidase activity
molecular_function GO:0009044 xylan 1,4-beta-xylosidase activity
This gene is associated with the following unigenes:
Unigene NameAnalysis NameSequence type in Unigene
WMU54089watermelon EST collection version 2.0transcribed_cluster

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla022081Cla022081.1mRNA


The following transcribed_cluster feature(s) are associated with this gene:

Feature NameUnique NameType
WMU54089WMU54089transcribed_cluster


Analysis Name: InterPro Annotations of watermelon (97103)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001764Glycoside hydrolase, family 3, N-terminalPRINTSPR00133GLHYDRLASE3coord: 219..235
score: 7.6E-6coord: 125..144
score: 7.6E-6coord: 293..311
score: 7.
IPR001764Glycoside hydrolase, family 3, N-terminalGENE3DG3DSA:3.20.20.300coord: 52..385
score: 1.7E
IPR001764Glycoside hydrolase, family 3, N-terminalPFAMPF00933Glyco_hydro_3coord: 118..369
score: 1.4
IPR002772Glycoside hydrolase family 3 C-terminal domainGENE3DG3DSA:3.40.50.1700coord: 399..644
score: 3.5
IPR002772Glycoside hydrolase family 3 C-terminal domainPFAMPF01915Glyco_hydro_3_Ccoord: 413..642
score: 4.3
IPR002772Glycoside hydrolase family 3 C-terminal domainunknownSSF52279Beta-D-glucan exohydrolase, C-terminal domaincoord: 413..645
score: 3.01
IPR017853Glycoside hydrolase superfamilyunknownSSF51445(Trans)glycosidasescoord: 52..412
score: 5.58
IPR026891Fibronectin type III-like domainPFAMPF14310Fn3-likecoord: 709..776
score: 7.7
IPR026891Fibronectin type III-like domainSMARTSM01217Fn3_like_2coord: 708..778
score: 8.3
IPR026892Glycoside hydrolase family 3PANTHERPTHR30620PERIPLASMIC BETA-GLUCOSIDASE-RELATEDcoord: 22..783
score:
NoneNo IPR availablePANTHERPTHR30620:SF8BETA-D-XYLOSIDASE 7-RELATEDcoord: 22..783
score:

The following gene(s) are paralogous to this gene:

None