ClCG01G004790 (gene) Watermelon (Charleston Gray)

NameClCG01G004790
Typegene
OrganismCitrullus lanatus (Watermelon (Charleston Gray))
DescriptionEndoglucanase 17
LocationCG_Chr01 : 5054357 .. 5063927 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCATTAGTTTTGATGAAGCTTTTGGTGATGATTTTGATGTTGAAGCTTCTGGCCATGGCTGTGGCTTCTCATGACTATGAAGATGCTTTGACAAAAAGTATATTGTTTTTTGAAGGTCAGAGGTCTGGAAAACTGCCTCCTAATCAAAGGGTCACTTGGAGGAAGGATTCTGCTCTTCACGATGGCCTTGAGTTCGGTGTAAGATCAATCAATATTTTCTTTCTTTATTTCCATCTACACATGCAAAGCTTCTATTTTAACCATTAAATGTTCTAGTATTCTATTTTTAAACTTTGAATTTTATGTATAAATATAGTTTATTTATTTTGAGTTGTAAAAAGATATTGTGAATAGGACAAATGCATAATCTAACTCTAAAACTTATTAAGAAGCTCTTAAAATATTTCGTTTGTTTCTCAATACCATTTTTAATTAAATGGGGTCTCATCGTATTTTGAAATTAATTCTATAATGCTGATGTGAGCATATAAATACTATCTTATGATGTTTTGTTGGACGATGGGATGTGACAATAATATCTCTCTCACTCAGTATATATGCCAACCTTATGGGTGGAAAAATGTGAGAGTATATATATATATATATATATATATATATATATATATATATATATATAGGTTCTTTTTAGTACAACTGTGGAGATGGATGATCAAACCTCTTACTCTAGATGAAAGGTTATGTCAAAATTACTATTGAGCTAAGATCGCTTAATTAGATACAGAGATTTTAAAATATAACAAAATGAATCAAAATATTTTAGTATATTTTTAATTAAAAAAAAGACATAAATATATTCCTAAGCCCCTAATATTTTTGTAATGATCTTCAATTTAGTCGGAATGTTTAAAATGTAATTTTAATTCCATCTTTCATTATTGGTGAAAAAAATAAAATTTGCTGAAAGTAAACTTATATGTGAAAATAAAATTTTGTATCTTGATAAAAATTAAGCGAACAAAAAAAGTAAAATCCACTAAGTTCAATAAGCATGTATTTAACAGACAACCTTAATTCTATTTCTGAAAACTGTTTTCGGATTTTGTGATCCAAAACGTGCAAATTTTGAAAACAACATTTTTATGTTTTTATTGTATCCAAATTTTAAAATTTAAAATTTAAAATACAAAATTATATTAAATAAAGATAAAAACCTTCAAAAATACAATATGTCATATTATAAACTCAATTCATTCTATAAAAAAATGTAATGTATTGTGTAATGTAATATAAATTATATAATATTACAAAATAATAATTATATAAAATTTTTAACATAAATTTAGTTCATAAATAAATTAATAACAGTTTATTGTCAACTAAATATTAGCGAATAACTATAACTAGTTTTATGATTTATAAATATATGATATCAAACACGTTTTAAACAATTATTTAAATTAGAATGTAAGTTTTGAATCTGCTGCGAAACATAAATCTGAAAATCTGAAATACAATTTTGTTTTTATAGAAGGTTTGTTTTCATATTTTTGTTTTCAAAATTTCTCCAAGCGTTTAGAAACTCAAAATTTCTTCTAACCGTCATGAAAGAGAAAAAATAATGAAGATTTTGAGATGGGTGTTATAGGTTGACTTGGTGGGTGGCTACTATGACGCCGGCGACAACGTGAAGTTCAGTTTTCCAATGGCGTTCACCACTACCATGCTTTCATGGAGTGTGTTGGAGTTTGGGAAAGACATGGGTTCGGATCTTCCCTACGCCATGGACTCGATCCGATGGGCCACTGATTACTTACTCAAAGCCACCTCTGTCCCTGGCTTTGTTTTTGCTCAGGTTGGCGACCCCTATGCCGACCATTTCTGCTGGGAGCGGCCGGAAGATATGGATACGCCGAGAACTCCTTACGCCGTCAGTAAGCAGTTCCCCGGCTCCGAAGTCTCCGCCGAGATTGCCGCGGCTCTTGCTGCCTCCTCCATCGTGTTTAAGCCCCTTGATAGAGGCTACTCTGCCAGGCTTCTTAAAAGGGCTAGAATGGTGAGACTTTGCTTTGATGTCATCTCATTTGCACACATGTTGAAGATAGACTAAAATTTTGAATTGAGTTCCTTGTGATAAATAGAATACTATTTTGATGGATTAATATTTTTATTATCATCTTTTTAGTACAATAAGAGTACAAAGATTCGAATTACATATCTCTTGGTCGCTAACATATCTATATGTCAGCTGAGTTATATGCTTGCTTTAACACTCTTTTAATTGTCAATTAGTTTTGAGATGGAAACTTATATTATTTAATATGTTATGAGAGACCATAAAGTTTACATGGACTTTTGGTCCCACCATTTTGAGGGAAAATGTTAAGATCTCACTCTCCATATTGACAAAATTGATAAGACTAAATTAAATTGCATAAAAAGTTTCTTAAAATAAAATTTTATTAGACACATTTAAAATTATAGCGAATTTATTGAGATAAAGGATGCAATTGATGAAATTTGAAAATTGGCTAATCAGTTAGGTTTTCTCTTTCTCGTATATTGTCATGTGTGTAACAAGGCTGCTCATTGTGTAGCTCAGTATAATGCTAATTTCCATAATTCATTCGGTTTTTTTTATTCTTCCATTTTGGAAGATGGTGGGCGTATTTAGATTTCAAGCTTTCCAAGTTGGGTTTTGGATTTTGGATGTAAGTCTTGGTGACTCTTTCTTTCCTTTACCCCTTTGAGGGAGCTTTTATGCATTTCCCTTTTTTTTAAAAAAAATAAAATTGTAGGAACCTATTATATAAGTATCTTTAGGTTAAGAAATTAAATATTAATGTTTATGATTTGATTTATTTTCAACAATCTTATAACAAATAGTTATTATACATGTGGACTTTAACCAAGATTTATATCCTTAATATTTGTCTATTCCTTTATTTCATTCATTAATCACGACTTCCACCAATAAGGTGTTTTATTTTATTAATAGTTTATGACAATATTTTCGAAATCCAAACTAAGACTAAAAAGAATAATTAATTCTAAAAAAATGGCCTTGTTCTTAATTAGAATTATTGACTAAGTACTAGAATTAGCCAGAAAAGAAACCTATAGTTATGGAATTAGATATTAAAAATGTATTGTTCATGTTTATGGTCCATGAAAGGATTGTTTTTATTGATTTAGGTTTTATTTTTGTTGGTTTAGGTTTTTGAGTTTGCAGATACTTATCGAGGGTCTTACAACCAAAGTCTTGGACCATGGGTTTGTCCATTTTATTGCAGTTATAGTGGCTATGAGGTATATTCATATATATATATATATATATATATATATATATATACATTTCTTAATAAATTAAAATAATACAATAAAAATGTTTTGAATAGATCATAATTTAATAATAAAAAAAGGCTATTTAATTATCCATGTAGGACGAATTAATATGGGGAGCTGCGTGGTTGTTCAAGGCAACAAAGGCTACATTTTACTGGAACTATATCACTAACAACATCAACAAAGTAGAGAACAACCCTGCTGTTGATTATGTGATCAATGCCTTTCAACATTATGCTGCTGGTAGCTTTGCTGAGTTTGGATGGGATACCAAACATGCAGGGATTAATGTCCTCATTTCCAAGGTATATTTTTATTCTTTTGTCAATTTCTTTATGTTTGGATTTGTCTAGCTAGTTGAATTAATTGAACCTTTAAGTTCGAAGAATGATTAAGGAAGGAGTGGATATCTTAACTATTAAGTTATATATATATATATGCTCGTGTTGGTTTAAAAAGAAATAAAGATTGAAAAGAGAAAAAAAAGAGATCAATTTTGACCCAAAATACTAGATGAGTTGTATCGATTTTAACCCTATTCTTTCAATTTCATTCATTTAAACCTCAAACTTGAATAGGTCTATATCAACTTAAACGGTTAAGTTTTCATTCCAATTATTGTTATAAATGCTTTTCACACACATGTCCAAATTAGTCACATGTTTGCCTTATTAAAAATATCTAGGCTTCGGTTTGGTAATTATTTGGTTTTTTGTTTTTCTTTTTGAAAATTAAGTTTACAGAAACTACTTCCACCTCCAAATTTCATTCTTTGTTATCTACTTTTTACTAATGGTTTTAAAAAATCAAGCTGAATTTTGAAAACTAAAAAAATAGATTTTAAAAACTTGTTTTTGTTTTTGGAATTTTGACTAAAAATTCAACCATTATACTATGCGACCATTGTACTTAAGAAAGATGCAACCATTATAATTAAGAAATATTCAGATCATTATAAGAAATGAGAGGAGGATATAGACTTGATTTTCAAAAACCAAATGATTACCAGACGGAGTCTTAATTGTTTGTTATTATTTTTATAAAGAAAAAAAAAATCAAATCAACTTTTTGGGTAACTTTTTTCTTTTAAACAAAACATGTATTTAAACAAATACTTAGTGTAGAGCAACATGTGTGGAAACCTTGTAACGGTGATTGAAACAGACATAAACTTGATTGATACACTTATATATATTTAACGGCTAATTAACTTCAGAATCAAATTTGATTATTCTTTTTCAAAAATGAAAAGCATTTTATAGATTGATAGACCCAATTGTTTATGATTGCAGTTTGGGATGAGTGGAAACGATGCTTCCAATATGTTTATTAATAATGCTGATAAGTTCATATGCTCAGTTCTTCCTGAATCACCCTCTGTATCAGTCTCTTACTCACCAGGTAATTCAAAATTCTCTTTTCTATCATAATCTTTAATTATGATTTATTTTCAAATTGTTTCAAATTTGAAAGAGAATCGAATCTCTAACTTTGAAATTGATAATACATGTTCACCATAGATTGAACTCATGACCTCTTAACCGTGTATTGAGACTTTGTCTCTTTCCATGATGGTTAATTAGTTTTGTTTTTTTTAACAACCCTTTATTTTAGTCCTATAAAGAAAGAGACAAAATAAATATATTTTTTAACCATTCATCGTCCAAAAATTAACAACCTAACTAAAATTTGTTAAACTTATCCCACGAAACCATGAAAATGAAAAGACTAATTTGTAATTCAACCGCACTATATACTATATCCTACTTCAATTGAAAAATAAAAAGTTCAAGCTTCCTTTATTAAGGATTTTGTTCCGGCTTCTATATTATGAGTTTTACATTTCTTGAAAAAACACTTAAATTCTCAGTTGAGCTTAAAAACTATTTTTTGAAAACTATTTTCTTTCAAGCTTTCAAATTTTGGTTTTGGTTTGTTTTTTTTTTTTAAAAAAAAAACTTTAAATGGATAACAAAACAAAATAATCAAGTGAAAGTAGTGTTTTATAAGCTTAATTTATAAATAAAGAACCGAATTATTCTTGGGTGGATTATTTAGGGGGAAAAACAATCAAAAGATAATGCTAATGTTTAAATGCCTTGAAATTTGATATGTGGCGCTTCAAACGACCAAGTACGAAAGTCTTCTCTCTAATAATAAATTGTTTTCTCTTTGTCGGTAGATGTAGCTAACACACTATTAATAAATCACGTAAATCTTTGTATCGATTTTCTATTATCTTACGTATTCTGTTTTTTTCGTTTGTCGATTCTATAACAGATAACCTTAATCCTAATATGGTATTAATATAGGTGGGCTTCTATTCAAACCGGGAGGAAGTAACATGCAACATTCAACAACCTTATCTTTTCTTCTTCTTGTTTACTCTAATTACTTGAATCAATCCAATTCCAAACGCCTCCTTCATTGTGGAAATGTTGTAGCCTCACCATCTCGTCTTAGACAAGTTGCCAAGGGCCAGGTATATACCTTAAACAACGAACTTAGATTAAAGACACTCAATTTATGTTTAAATTGCAATCTTAGTCCTTTAACTTTCACGTTTTTATTCTATTTGGTCATGATCTTACTTTTATTTTTGTGTCAATAAGTCTTTAAACTCTAAGAAATTTTTTTAGACGTATTTAAACATTCAATTTTATGCCCAAATAGAATTTTGAAATTTTAATTTTGTCTCTAATAAATTTTTTAATTTATTTAATTTTAACTTTTTATATATTCATAAACTACTAACTATAAAATTGAAAATTAAGTAACATACTTAGAAAAGAATTTCAATTATATATGTAAGAGGCAAATCATTTTTAAAAACTTTAATTTGAATATATTATGGATAAATTAGACATAAAATTGAAAGCTTTGAAGCTTGTTAGACATTTTTAAAGTCTCTAAAGACTTTTTTAACCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAANNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGCTAAAAGGCTTATTGAATATTCCTAAAGTTCAAGGATCCTATTAGACACAAAATTGAAAGTTTAAGTCCTATTAATTAAAATATTCTCAATGTAACAAAATTAAATTCAACTTTCTCAAATTGAATTTATATTGAACTAAATCCAAAATTTCAAAACTCTAACCTTCAATCTAAATTGAATTATCTTAACCCATTAATTAAATCTTCAACCTCAAGGTTCAAACTAACATCCAAATAATTCAAGATAAATAAATTAAAATTATCTTAAAATTTGGAATGTCACACACTCAAAGTTCAAAGTCCTATTAGACACATAACTAAAATAAGTTCATGGACCTATTCGGTATTTTTATAGTTTTCGATTCTATTAGACACAAAATTGAAAGTACAAGAATTTATTAAACACATTTTTAAAGATTAGAAATTTATTTTAAAAGCTCAAGGCTTATTAATTAAAATACTTTTTAAAGTTAAAAAACCATCCTATATTAGACCAAATATATAAAGTTTAGAGATGTATTAGATATTTTTAAAGTTCAAGATCCTACTAGCCACAAATTTAAACGGATGTTTGGCTGAGAGTTGAAATTGAGGGAGTGGGGTTGTTTGGTCGAAAGAGTTCATGGGCCCTACAACTAAATAGGCCCCACGACTTAGCTTCCCTACTATCCATACTCCATAATTTTACTCCTTACCTTAAACACGCTTAAAAGTTCAATAACTAAACTTGTAATTCAACCTAACAACGGTGATATTGATACATTTTGGATGATATGTAGGTGGATTACATACTAGGAAGCAACCCATTGGGGATGTCCTATATGGTGGGTTATGGCAACAAATTCCCACAACGGATCCACCATCGTGGCTCGTCATTGCCATCCATGGCTAAACATCCAGAGCCCATTGCATGTGCAAAAGGGAAACCATATTTTCAAAGCAACAATCCAAACCCTAATTTGCTAATTGGAGCTGTTGTTGGAGGACCTGATTTCAAAGATTCTTATGCAGATTCTCGACTTGATTTTGTTTATTCTGAACCAACTACTTACATTAATGCTCCTCTTGTCGGCCTCTTGGCTTACTTCAAATCTCATCCTAATTCATAG

mRNA sequence

ATGGCATTAGTTTTGATGAAGCTTTTGGTGATGATTTTGATGTTGAAGCTTCTGGCCATGGCTGTGGCTTCTCATGACTATGAAGATGCTTTGACAAAAAGTATATTGTTTTTTGAAGGTCAGAGGTCTGGAAAACTGCCTCCTAATCAAAGGGTCACTTGGAGGAAGGATTCTGCTCTTCACGATGGCCTTGAGTTCGGTGTTGACTTGGTGGGTGGCTACTATGACGCCGGCGACAACGTGAAGTTCAGTTTTCCAATGGCGTTCACCACTACCATGCTTTCATGGAGTGTGTTGGAGTTTGGGAAAGACATGGGTTCGGATCTTCCCTACGCCATGGACTCGATCCGATGGGCCACTGATTACTTACTCAAAGCCACCTCTGTCCCTGGCTTTGTTTTTGCTCAGGTTGGCGACCCCTATGCCGACCATTTCTGCTGGGAGCGGCCGGAAGATATGGATACGCCGAGAACTCCTTACGCCGTCAGTAAGCAGTTCCCCGGCTCCGAAGTCTCCGCCGAGATTGCCGCGGCTCTTGCTGCCTCCTCCATCGTGTTTAAGCCCCTTGATAGAGGCTACTCTGCCAGGCTTCTTAAAAGGGCTAGAATGGTTTTTGAGTTTGCAGATACTTATCGAGGGTCTTACAACCAAAGTCTTGGACCATGGGTTTGTCCATTTTATTGCAGTTATAGTGGCTATGAGGACGAATTAATATGGGGAGCTGCGTGGTTGTTCAAGGCAACAAAGGCTACATTTTACTGGAACTATATCACTAACAACATCAACAAAGTAGAGAACAACCCTGCTGTTGATTATGTGATCAATGCCTTTCAACATTATGCTGCTGGTAGCTTTGCTGAGTTTGGATGGGATACCAAACATGCAGGGATTAATGTCCTCATTTCCAAGTTTGGGATGAGTGGAAACGATGCTTCCAATATGTTTATTAATAATGCTGATAAGTTCATATGCTCAGTTCTTCCTGAATCACCCTCTGTATCAGTCTCTTACTCACCAGGAAGTAACATGCAACATTCAACAACCTTATCTTTTCTTCTTCTTGTTTACTCTAATTACTTGAATCAATCCAATTCCAAACGCCTCCTTCATTGTGGAAATGTTGTAGCCTCACCATCTCGTCTTAGACAAGTTGCCAAGGGCCAGGTGGATTACATACTAGGAAGCAACCCATTGGGGATGTCCTATATGGTGGGTTATGGCAACAAATTCCCACAACGGATCCACCATCGTGGCTCGTCATTGCCATCCATGGCTAAACATCCAGAGCCCATTGCATGTGCAAAAGGGAAACCATATTTTCAAAGCAACAATCCAAACCCTAATTTGCTAATTGGAGCTGTTGTTGGAGGACCTGATTTCAAAGATTCTTATGCAGATTCTCGACTTGATTTTGTTTATTCTGAACCAACTACTTACATTAATGCTCCTCTTGTCGGCCTCTTGGCTTACTTCAAATCTCATCCTAATTCATAG

Coding sequence (CDS)

ATGGCATTAGTTTTGATGAAGCTTTTGGTGATGATTTTGATGTTGAAGCTTCTGGCCATGGCTGTGGCTTCTCATGACTATGAAGATGCTTTGACAAAAAGTATATTGTTTTTTGAAGGTCAGAGGTCTGGAAAACTGCCTCCTAATCAAAGGGTCACTTGGAGGAAGGATTCTGCTCTTCACGATGGCCTTGAGTTCGGTGTTGACTTGGTGGGTGGCTACTATGACGCCGGCGACAACGTGAAGTTCAGTTTTCCAATGGCGTTCACCACTACCATGCTTTCATGGAGTGTGTTGGAGTTTGGGAAAGACATGGGTTCGGATCTTCCCTACGCCATGGACTCGATCCGATGGGCCACTGATTACTTACTCAAAGCCACCTCTGTCCCTGGCTTTGTTTTTGCTCAGGTTGGCGACCCCTATGCCGACCATTTCTGCTGGGAGCGGCCGGAAGATATGGATACGCCGAGAACTCCTTACGCCGTCAGTAAGCAGTTCCCCGGCTCCGAAGTCTCCGCCGAGATTGCCGCGGCTCTTGCTGCCTCCTCCATCGTGTTTAAGCCCCTTGATAGAGGCTACTCTGCCAGGCTTCTTAAAAGGGCTAGAATGGTTTTTGAGTTTGCAGATACTTATCGAGGGTCTTACAACCAAAGTCTTGGACCATGGGTTTGTCCATTTTATTGCAGTTATAGTGGCTATGAGGACGAATTAATATGGGGAGCTGCGTGGTTGTTCAAGGCAACAAAGGCTACATTTTACTGGAACTATATCACTAACAACATCAACAAAGTAGAGAACAACCCTGCTGTTGATTATGTGATCAATGCCTTTCAACATTATGCTGCTGGTAGCTTTGCTGAGTTTGGATGGGATACCAAACATGCAGGGATTAATGTCCTCATTTCCAAGTTTGGGATGAGTGGAAACGATGCTTCCAATATGTTTATTAATAATGCTGATAAGTTCATATGCTCAGTTCTTCCTGAATCACCCTCTGTATCAGTCTCTTACTCACCAGGAAGTAACATGCAACATTCAACAACCTTATCTTTTCTTCTTCTTGTTTACTCTAATTACTTGAATCAATCCAATTCCAAACGCCTCCTTCATTGTGGAAATGTTGTAGCCTCACCATCTCGTCTTAGACAAGTTGCCAAGGGCCAGGTGGATTACATACTAGGAAGCAACCCATTGGGGATGTCCTATATGGTGGGTTATGGCAACAAATTCCCACAACGGATCCACCATCGTGGCTCGTCATTGCCATCCATGGCTAAACATCCAGAGCCCATTGCATGTGCAAAAGGGAAACCATATTTTCAAAGCAACAATCCAAACCCTAATTTGCTAATTGGAGCTGTTGTTGGAGGACCTGATTTCAAAGATTCTTATGCAGATTCTCGACTTGATTTTGTTTATTCTGAACCAACTACTTACATTAATGCTCCTCTTGTCGGCCTCTTGGCTTACTTCAAATCTCATCCTAATTCATAG

Protein sequence

MALVLMKLLVMILMLKLLAMAVASHDYEDALTKSILFFEGQRSGKLPPNQRVTWRKDSALHDGLEFGVDLVGGYYDAGDNVKFSFPMAFTTTMLSWSVLEFGKDMGSDLPYAMDSIRWATDYLLKATSVPGFVFAQVGDPYADHFCWERPEDMDTPRTPYAVSKQFPGSEVSAEIAAALAASSIVFKPLDRGYSARLLKRARMVFEFADTYRGSYNQSLGPWVCPFYCSYSGYEDELIWGAAWLFKATKATFYWNYITNNINKVENNPAVDYVINAFQHYAAGSFAEFGWDTKHAGINVLISKFGMSGNDASNMFINNADKFICSVLPESPSVSVSYSPGSNMQHSTTLSFLLLVYSNYLNQSNSKRLLHCGNVVASPSRLRQVAKGQVDYILGSNPLGMSYMVGYGNKFPQRIHHRGSSLPSMAKHPEPIACAKGKPYFQSNNPNPNLLIGAVVGGPDFKDSYADSRLDFVYSEPTTYINAPLVGLLAYFKSHPNS
BLAST of ClCG01G004790 vs. Swiss-Prot
Match: GUN8_ARATH (Endoglucanase 8 OS=Arabidopsis thaliana GN=CEL1 PE=2 SV=1)

HSP 1 Score: 585.1 bits (1507), Expect = 7.2e-166
Identity = 293/500 (58.60%), Postives = 359/500 (71.80%), Query Frame = 1

Query: 4   VLMKLLVMILMLKLLAMAVASHDYEDALTKSILFFEGQRSGKLPPNQRVTWRKDSALHDG 63
           ++  ++++ ++L    +  A HDY DAL KSILFFEGQRSGKLPP+QR+ WR+DSAL DG
Sbjct: 6   LIFPVILLAVLLFSPPIYSAGHDYRDALRKSILFFEGQRSGKLPPDQRLKWRRDSALRDG 65

Query: 64  LEFGVDLVGGYYDAGDNVKFSFPMAFTTTMLSWSVLEFGKDMGSDLPYAMDSIRWATDYL 123
              GVDL GGYYDAGDN+KF FPMAFTTTMLSWS+++FGK MG +L  A+ +++W TDYL
Sbjct: 66  SSAGVDLSGGYYDAGDNIKFGFPMAFTTTMLSWSIIDFGKTMGPELRNAVKAVKWGTDYL 125

Query: 124 LKATSVPGFVFAQVGDPYADHFCWERPEDMDTPRTPYAVSKQFPGSEVSAEIAAALAASS 183
           LKAT++PG VF QVGD Y+DH CWERPEDMDT RT Y + +  PGS+V+ E AAALAA+S
Sbjct: 126 LKATAIPGVVFVQVGDAYSDHNCWERPEDMDTLRTVYKIDRAHPGSDVAGETAAALAAAS 185

Query: 184 IVFKPLDRGYSARLLKRARMVFEFADTYRGSYNQSLGPWVCPFYCSYSGYEDELIWGAAW 243
           IVF+  D  YS  LL RA  VF FA+ YRG+Y+ SL   VCPFYC ++GY+DEL+WGAAW
Sbjct: 186 IVFRKRDPAYSRLLLDRATRVFAFANRYRGAYSNSLYHAVCPFYCDFNGYQDELLWGAAW 245

Query: 244 LFKATKATFYWNYITNNINKVENNPAVDYVINAFQHYAAGSFAEFGWDTKHAGINVLISK 303
           L KA++   Y  +I  N          + ++      A  +  EFGWD KHAGINVLISK
Sbjct: 246 LHKASRKRAYREFIVKN----------EVILK-----AGDTINEFGWDNKHAGINVLISK 305

Query: 304 FGMSGN-DASNMFINNADKFICSVLPESPSVSVSYS--------PGSNMQHSTTLSFLLL 363
             + G  +    F  NAD FICS+LP      V YS         GSNMQH T+LSFLLL
Sbjct: 306 EVLMGKAEYFESFKQNADGFICSILPGISHPQVQYSRGGLLVKTGGSNMQHVTSLSFLLL 365

Query: 364 VYSNYLNQSNSKRLLHCGNVVASPSRLRQVAKGQVDYILGSNPLGMSYMVGYGNKFPQRI 423
            YSNYL  S++K+++ CG + ASPS LRQ+AK QVDYILG NP+G+SYMVGYG KFP+RI
Sbjct: 366 AYSNYL--SHAKKVVPCGELTASPSLLRQIAKRQVDYILGDNPMGLSYMVGYGQKFPRRI 425

Query: 424 HHRGSSLPSMAKHPEPIACAKGKPYFQSNNPNPNLLIGAVVGGPDFKDSYADSRLDFVYS 483
           HHRGSS+PS++ HP  I C +G  YF S NPNPNLL+GAVVGGP+  D++ DSR  F  S
Sbjct: 426 HHRGSSVPSVSAHPSHIGCKEGSRYFLSPNPNPNLLVGAVVGGPNVTDAFPDSRPYFQQS 485

Query: 484 EPTTYINAPLVGLLAYFKSH 495
           EPTTYINAPLVGLL YF +H
Sbjct: 486 EPTTYINAPLVGLLGYFSAH 488

BLAST of ClCG01G004790 vs. Swiss-Prot
Match: GUN4_ARATH (Endoglucanase 4 OS=Arabidopsis thaliana GN=At1g23210 PE=2 SV=1)

HSP 1 Score: 582.4 bits (1500), Expect = 4.7e-165
Identity = 296/495 (59.80%), Postives = 350/495 (70.71%), Query Frame = 1

Query: 10  VMILMLKLLAMAV-ASHDYEDALTKSILFFEGQRSGKLPPNQRVTWRKDSALHDGLEFGV 69
           +M+ ML L++    A HDY DAL KSILFFEGQRSGKLPP+QR+ WR+DSAL DG   GV
Sbjct: 11  IMLAMLLLISPETYAGHDYRDALRKSILFFEGQRSGKLPPDQRLKWRRDSALRDGSSAGV 70

Query: 70  DLVGGYYDAGDNVKFSFPMAFTTTMLSWSVLEFGKDMGSDLPYAMDSIRWATDYLLKATS 129
           DL GGYYDAGDNVKF FPMAFTTTM+SWSV++FGK MG +L  A+ +I+W TDYL+KAT 
Sbjct: 71  DLTGGYYDAGDNVKFGFPMAFTTTMMSWSVIDFGKTMGPELENAVKAIKWGTDYLMKATQ 130

Query: 130 VPGFVFAQVGDPYADHFCWERPEDMDTPRTPYAVSKQFPGSEVSAEIAAALAASSIVFKP 189
           +P  VF QVGD Y+DH CWERPEDMDT RT Y + K   GSEV+ E AAALAA+SIVF+ 
Sbjct: 131 IPDVVFVQVGDAYSDHNCWERPEDMDTLRTVYKIDKDHSGSEVAGETAAALAAASIVFEK 190

Query: 190 LDRGYSARLLKRARMVFEFADTYRGSYNQSLGPWVCPFYCSYSGYEDELIWGAAWLFKAT 249
            D  YS  LL RA  VF FA  YRG+Y+ SL   VCPFYC ++GYEDEL+WGAAWL KA+
Sbjct: 191 RDPVYSKMLLDRATRVFAFAQKYRGAYSDSLYQAVCPFYCDFNGYEDELLWGAAWLHKAS 250

Query: 250 KATFYWNYITNNINKVENNPAVDYVINAFQHYAAGSFAEFGWDTKHAGINVLISKFGMSG 309
           K   Y  +I  N            ++      A  +  EFGWD KHAGINVL+SK  + G
Sbjct: 251 KKRVYREFIVKN----------QVILR-----AGDTIHEFGWDNKHAGINVLVSKMVLMG 310

Query: 310 N-DASNMFINNADKFICSVLPESPSVSVSYSP--------GSNMQHSTTLSFLLLVYSNY 369
             +    F  NAD+FICS+LP      V YS         GSNMQH T+LSFLLL YSNY
Sbjct: 311 KAEYFQSFKQNADEFICSLLPGISHPQVQYSQGGLLVKSGGSNMQHVTSLSFLLLTYSNY 370

Query: 370 LNQSNSKRLLHCGNVVASPSRLRQVAKGQVDYILGSNPLGMSYMVGYGNKFPQRIHHRGS 429
           L+ +N  +++ CG   ASP+ LRQVAK QVDYILG NP+ MSYMVGYG++FPQ+IHHRGS
Sbjct: 371 LSHAN--KVVPCGEFTASPALLRQVAKRQVDYILGDNPMKMSYMVGYGSRFPQKIHHRGS 430

Query: 430 SLPSMAKHPEPIACAKGKPYFQSNNPNPNLLIGAVVGGPDFKDSYADSRLDFVYSEPTTY 489
           S+PS+  HP+ I C  G  YF SNNPNPNLLIGAVVGGP+  D + DSR  F  +EPTTY
Sbjct: 431 SVPSVVDHPDRIGCKDGSRYFFSNNPNPNLLIGAVVGGPNITDDFPDSRPYFQLTEPTTY 488

Query: 490 INAPLVGLLAYFKSH 495
           INAPL+GLL YF +H
Sbjct: 491 INAPLLGLLGYFSAH 488

BLAST of ClCG01G004790 vs. Swiss-Prot
Match: GUN17_ORYSJ (Endoglucanase 17 OS=Oryza sativa subsp. japonica GN=GLU13 PE=2 SV=1)

HSP 1 Score: 572.0 bits (1473), Expect = 6.3e-162
Identity = 293/498 (58.84%), Postives = 350/498 (70.28%), Query Frame = 1

Query: 10  VMILMLKLLAMAVASHDYEDALTKSILFFEGQRSGKLPPNQRVTWRKDSALHDGLEFGVD 69
           V++L+L         HDY DAL KSILFFEGQRSG+LPP+QR+ WR+DSAL+DG   GVD
Sbjct: 8   VLLLVLATATSVTGQHDYSDALHKSILFFEGQRSGRLPPDQRLRWRRDSALNDGATAGVD 67

Query: 70  LVGGYYDAGDNVKFSFPMAFTTTMLSWSVLEFGKDMGSDLPYAMDSIRWATDYLLKATSV 129
           L GGYYDAGDNVKF FPMAFT T++SW +++FG+  G+    A +++RWATDYL+KAT+ 
Sbjct: 68  LTGGYYDAGDNVKFGFPMAFTATLMSWGLIDFGRSFGAHAAEAREAVRWATDYLMKATAT 127

Query: 130 PGFVFAQVGDPYADHFCWERPEDMDTPRTPYAVSKQFPGSEVSAEIAAALAASSIVFKPL 189
           P  V+ QVGD + DH CWERPEDMDTPRT Y V    PGS+V+AE AAALAA+SIVF+  
Sbjct: 128 PNTVYVQVGDAFRDHSCWERPEDMDTPRTVYKVDPSHPGSDVAAETAAALAAASIVFRDA 187

Query: 190 DRGYSARLLKRARMVFEFADTYRGSYNQSLGPWVCPFYCSYSGYEDELIWGAAWLFKATK 249
           D  YS RLL RA  VFEFAD YRG Y+ SL   VCP YC YSGY+DEL+WGAAWL KA++
Sbjct: 188 DPDYSNRLLDRAIQVFEFADKYRGPYSSSLHAAVCPCYCDYSGYKDELLWGAAWLHKASR 247

Query: 250 ATFYWNYITNNINKVENNPAVDYVINAFQHYAAGSFAEFGWDTKHAGINVLISKFGMSGN 309
              Y +YI  N          + V+ A +     +  EFGWD KHAGINVLISK  + G 
Sbjct: 248 RREYRDYIKRN----------EVVLGASE-----AINEFGWDNKHAGINVLISKEVLMGK 307

Query: 310 DA-SNMFINNADKFICSVLPE-SPSVSVSYSPG--------SNMQHSTTLSFLLLVYSNY 369
           D     F  NAD FIC++LP  S    + YSPG        SNMQH T+LSFLLL YSNY
Sbjct: 308 DEYFQSFRVNADNFICTLLPGISNHPQIQYSPGGLLFKVGNSNMQHVTSLSFLLLAYSNY 367

Query: 370 LNQSNSKRLLHCGNVVASPSRLRQVAKGQVDYILGSNPLGMSYMVGYGNKFPQRIHHRGS 429
           L+ +N +  + CG   ASP +LR+VAK QVDYILG NPL MSYMVGYG+++P RIHHRGS
Sbjct: 368 LSHANVR--VPCGTSSASPVQLRRVAKRQVDYILGDNPLRMSYMVGYGSRYPLRIHHRGS 427

Query: 430 SLPSMAKHPEPIACAKGKPYFQSNNPNPNLLIGAVVGGP-DFKDSYADSRLDFVYSEPTT 489
           SLPS+A HP  I C  G  Y+ S  PNPNLL+GAVVGGP +  D++ D+R  F  SEPTT
Sbjct: 428 SLPSVAAHPAQIGCKAGATYYASAAPNPNLLVGAVVGGPSNTSDAFPDARAVFQQSEPTT 487

Query: 490 YINAPLVGLLAYFKSHPN 497
           YINAPL+GLLAYF +HPN
Sbjct: 488 YINAPLLGLLAYFSAHPN 488

BLAST of ClCG01G004790 vs. Swiss-Prot
Match: GUN6_ORYSJ (Endoglucanase 6 OS=Oryza sativa subsp. japonica GN=Os02g0733300 PE=2 SV=1)

HSP 1 Score: 566.2 bits (1458), Expect = 3.5e-160
Identity = 295/508 (58.07%), Postives = 351/508 (69.09%), Query Frame = 1

Query: 1   MALVLMKLLVMILMLKLLAMAVASHDYEDALTKSILFFEGQRSGKLPPNQRVTWRKDSAL 60
           +A+V   +LV++L    + +    HDY DAL KSILFFEGQRSG+LPP+QR+ WR+DS L
Sbjct: 11  VAVVAAAVLVLLLSPAAVVVVAGQHDYGDALHKSILFFEGQRSGRLPPDQRLRWRRDSGL 70

Query: 61  HDGLEFGVDLVGGYYDAGDNVKFSFPMAFTTTMLSWSVLEFGKDMGSDLPYAMDSIRWAT 120
           HDG    VDL GGYYDAGDNVKF FPMAFT T++SW +++FG+  G     A  ++RWAT
Sbjct: 71  HDGAAASVDLTGGYYDAGDNVKFGFPMAFTATLMSWGLIDFGRSFGPHKEEARKAVRWAT 130

Query: 121 DYLLKATSVPGFVFAQVGDPYADHFCWERPEDMDTPRTPYAVSKQFPGSEVSAEIAAALA 180
           DYL+KAT+ P  V+ QVGD + DH CWERPEDMDTPRT Y V    PGS+V+AE AAALA
Sbjct: 131 DYLMKATAKPNTVYVQVGDAFRDHSCWERPEDMDTPRTVYKVDPSHPGSDVAAETAAALA 190

Query: 181 ASSIVFKPLDRGYSARLLKRARMVFEFADTYRGSYNQSLGPWVCPFYCSYSGYEDELIWG 240
           A SIVF+  D  YS RLL RA  VFEFAD YRG Y+ SL   VCP YC +SGY+DEL+WG
Sbjct: 191 AGSIVFRDADPAYSKRLLDRAIAVFEFADKYRGPYSSSLHDAVCPCYCDFSGYKDELLWG 250

Query: 241 AAWLFKATKATFYWNYITNNINKVENNPAVDYVINAFQHYAAGSFAEFGWDTKHAGINVL 300
           AAWL KA++   Y  YI  N          + V+ A +     S  EFGWD KHAGINVL
Sbjct: 251 AAWLHKASRRREYREYIKKN----------EVVLGASE-----SINEFGWDNKHAGINVL 310

Query: 301 ISKFGMSGNDA-SNMFINNADKFICSVLPE-SPSVSVSYSP--------GSNMQHSTTLS 360
           ISK  + G D     F  NAD F+CS+LP  S    + YSP        GSNMQH T+LS
Sbjct: 311 ISKEVLMGKDEYFQSFRVNADNFMCSLLPGISNHPQIQYSPGGLLFKVGGSNMQHVTSLS 370

Query: 361 FLLLVYSNYLNQSNSKRLLHCG-NVVASPSRLRQVAKGQVDYILGSNPLGMSYMVGYGNK 420
           FLLL YSNYL+ + ++  + CG    ASP++LR+VAK QVDYILG NPL MSYMVGYG +
Sbjct: 371 FLLLAYSNYLSHAGAR--VSCGAGGSASPTQLRRVAKRQVDYILGDNPLRMSYMVGYGAR 430

Query: 421 FPQRIHHRGSSLPSMAKHPEPIACAKGKPYFQSNNPNPNLLIGAVVGGP-DFKDSYADSR 480
           FP+RIHHRGSSLPS+A HP  I C  G  Y+ S  PNPNLL+GAVVGGP D  D++ D+R
Sbjct: 431 FPRRIHHRGSSLPSVAAHPARIGCKGGAAYYASAAPNPNLLVGAVVGGPSDATDAFPDAR 490

Query: 481 LDFVYSEPTTYINAPLVGLLAYFKSHPN 497
             F  SEPTTYINAPL+GLLAYF +HPN
Sbjct: 491 AVFQQSEPTTYINAPLMGLLAYFSAHPN 501

BLAST of ClCG01G004790 vs. Swiss-Prot
Match: GUN17_ARATH (Endoglucanase 17 OS=Arabidopsis thaliana GN=At4g02290 PE=2 SV=1)

HSP 1 Score: 550.1 bits (1416), Expect = 2.6e-155
Identity = 283/479 (59.08%), Postives = 334/479 (69.73%), Query Frame = 1

Query: 22  VASHDYEDALTKSILFFEGQRSGKLPPNQRVTWRKDSALHDGLEFGVDLVGGYYDAGDNV 81
           +A H+Y+DALTKSILFFEGQRSGKLP NQR++WR+DS L DG    VDLVGGYYDAGDN+
Sbjct: 48  LAKHNYKDALTKSILFFEGQRSGKLPSNQRMSWRRDSGLSDGSALHVDLVGGYYDAGDNI 107

Query: 82  KFSFPMAFTTTMLSWSVLEFGKDMGSDLPYAMDSIRWATDYLLKATSVPGFVFAQVGDPY 141
           KF FPMAFTTTMLSWSV+EFG  M S+L  A  +IRWATDYLLKATS P  ++ QVGD  
Sbjct: 108 KFGFPMAFTTTMLSWSVIEFGGLMKSELQNAKIAIRWATDYLLKATSQPDTIYVQVGDAN 167

Query: 142 ADHFCWERPEDMDTPRTPYAVSKQFPGSEVSAEIAAALAASSIVFKPLDRGYSARLLKRA 201
            DH CWERPEDMDT R+ + V K  PGS+V+AE AAALAA++IVF+  D  YS  LLKRA
Sbjct: 168 KDHSCWERPEDMDTVRSVFKVDKNIPGSDVAAETAAALAAAAIVFRKSDPSYSKVLLKRA 227

Query: 202 RMVFEFADTYRGSYNQSLGPWVCPFYCSYSGYEDELIWGAAWLFKATKATFYWNYITNNI 261
             VF FAD YRG+Y+  L P VCPFYCSYSGY+DEL+WGAAWL KATK   Y NYI    
Sbjct: 228 ISVFAFADKYRGTYSAGLKPDVCPFYCSYSGYQDELLWGAAWLQKATKNIKYLNYIK--- 287

Query: 262 NKVENNPAVDYVINAFQHYAAGSFAEFGWDTKHAGINVLISK-FGMSGNDASNMFINNAD 321
                       IN     AA     FGWD KHAG  +L++K F +      + +  +AD
Sbjct: 288 ------------INGQILGAAEYDNTFGWDNKHAGARILLTKAFLVQNVKTLHEYKGHAD 347

Query: 322 KFICSVLPESPSVSVSYSPG--------SNMQHSTTLSFLLLVYSNYLNQSNSKRLLHCG 381
            FICSV+P +P  S  Y+PG        +NMQ+ T+ SFLLL Y+ YL  +++K ++HCG
Sbjct: 348 NFICSVIPGAPFSSTQYTPGGLLFKMADANMQYVTSTSFLLLTYAKYL--TSAKTVVHCG 407

Query: 382 NVVASPSRLRQVAKGQVDYILGSNPLGMSYMVGYGNKFPQRIHHRGSSLPSMAKHPEPIA 441
             V +P RLR +AK QVDY+LG NPL MSYMVGYG KFP+RIHHRGSSLP +A HP  I 
Sbjct: 408 GSVYTPGRLRSIAKRQVDYLLGDNPLRMSYMVGYGPKFPRRIHHRGSSLPCVASHPAKIQ 467

Query: 442 CAKGKPYFQSNNPNPNLLIGAVVGGPDFKDSYADSRLDFVYSEPTTYINAPLVGLLAYF 492
           C +G     S +PNPN L+GAVVGGPD  D + D R D+  SEP TYIN+PLVG LAYF
Sbjct: 468 CHQGFAIMNSQSPNPNFLVGAVVGGPDQHDRFPDERSDYEQSEPATYINSPLVGALAYF 509

BLAST of ClCG01G004790 vs. TrEMBL
Match: M5XMI2_PRUPE (Endoglucanase OS=Prunus persica GN=PRUPE_ppa014785mg PE=3 SV=1)

HSP 1 Score: 748.8 bits (1932), Expect = 4.2e-213
Identity = 366/488 (75.00%), Postives = 410/488 (84.02%), Query Frame = 1

Query: 18  LAMAVA---SHDYEDALTKSILFFEGQRSGKLPPNQRVTWRKDSALHDGLEFGVDLVGGY 77
           + M VA   SHDY DAL KSILFFEGQRSGKLP +QR+TWRKDSAL DG E GVDLVGGY
Sbjct: 1   MVMGVACSQSHDYGDALNKSILFFEGQRSGKLPSSQRMTWRKDSALRDGYEIGVDLVGGY 60

Query: 78  YDAGDNVKFSFPMAFTTTMLSWSVLEFGKDMGSDLPYAMDSIRWATDYLLKATSVPGFVF 137
           YDAGDNVKF+FPMAF+TTML+WSVLEFGK M SDLP+A+D+IRWATDY LKATS+PGFVF
Sbjct: 61  YDAGDNVKFNFPMAFSTTMLAWSVLEFGKGMSSDLPHALDAIRWATDYFLKATSIPGFVF 120

Query: 138 AQVGDPYADHFCWERPEDMDTPRTPYAVSKQFPGSEVSAEIAAALAASSIVFKPLDRGYS 197
            QVGDPY DH CWERPEDMDTPRTP+AVSKQFPGSEVSAEIAAALAAS++VF+P+D  YS
Sbjct: 121 VQVGDPYGDHNCWERPEDMDTPRTPFAVSKQFPGSEVSAEIAAALAASAMVFRPIDLKYS 180

Query: 198 ARLLKRARMVFEFADTYRGSYNQSLGPWVCPFYCSYSGYEDELIWGAAWLFKATKATFYW 257
           ARLLKRARMVF+FAD Y+GSYN SLGPWVCPFYC +SGYEDEL+WGAAWLFKATK   YW
Sbjct: 181 ARLLKRARMVFDFADKYQGSYNDSLGPWVCPFYCDFSGYEDELVWGAAWLFKATKQPIYW 240

Query: 258 NYITNNINKVENNPAVDYVINAFQHYAAGSFAEFGWDTKHAGINVLISKFGMS-GNDASN 317
           NY+  NINK+E++  V Y+      Y  GSFAEFGWD+KHAGINVL+SK  MS     S 
Sbjct: 241 NYVLQNINKLESSATVKYINGV--SYLGGSFAEFGWDSKHAGINVLVSKLIMSMAGTGST 300

Query: 318 MFINNADKFICSVLPESPSVSVSYSP--------GSNMQHSTTLSFLLLVYSNYLNQSNS 377
            FI+NADKFIC++LPESP+VSVSYSP        GSNMQH+TTLSFLL+VY+ YL  SN 
Sbjct: 301 PFISNADKFICTLLPESPTVSVSYSPGGLLFKPGGSNMQHATTLSFLLVVYARYLKLSN- 360

Query: 378 KRLLHCGNVVASPSRLRQVAKGQVDYILGSNPLGMSYMVGYGNKFPQRIHHRGSSLPSMA 437
            R++HCGNVVASP+RL ++AKGQVDYILGSNP GMSYMVGYG KFPQRIHHRGSSLPS+ 
Sbjct: 361 -RVVHCGNVVASPARLVKLAKGQVDYILGSNPFGMSYMVGYGKKFPQRIHHRGSSLPSVG 420

Query: 438 KHPEPIACAKGKPYFQSNNPNPNLLIGAVVGGPDFKDSYADSRLDFVYSEPTTYINAPLV 494
           +HP+ I C  G  Y+ S NPN NLLIGAVVGGPD +DSYAD R DFV SEPTTYINAPLV
Sbjct: 421 QHPKQIDCKGGTDYYNSKNPNLNLLIGAVVGGPDIEDSYADFREDFVQSEPTTYINAPLV 480

BLAST of ClCG01G004790 vs. TrEMBL
Match: B9R9R3_RICCO (Endoglucanase OS=Ricinus communis GN=RCOM_1500170 PE=3 SV=1)

HSP 1 Score: 742.3 bits (1915), Expect = 3.9e-211
Identity = 368/501 (73.45%), Postives = 419/501 (83.63%), Query Frame = 1

Query: 5   LMKLLVMILMLKLLAMAVASHDYEDALTKSILFFEGQRSGKLPPNQRVTWRKDSALHDGL 64
           L+ LL  I+M  ++A+ V SHDY DALTKSILFFEGQRSGKLPPNQR+TWRKDSAL DG 
Sbjct: 8   LLGLLGGIVMATVVAV-VDSHDYGDALTKSILFFEGQRSGKLPPNQRMTWRKDSALRDGY 67

Query: 65  EFGVDLVGGYYDAGDNVKFSFPMAFTTTMLSWSVLEFGKDMGSDLPYAMDSIRWATDYLL 124
           + GVDLVGGYYDAGDNVKF+FPMAF+TTML+WSV+EFGK MG D  +A+++I+WATDY L
Sbjct: 68  QIGVDLVGGYYDAGDNVKFNFPMAFSTTMLAWSVVEFGKFMGPDQKHALEAIQWATDYFL 127

Query: 125 KATSVPGFVFAQVGDPYADHFCWERPEDMDTPRTPYAVSKQFPGSEVSAEIAAALAASSI 184
           KATS+PG VFAQVGDPY DH CWERPEDMDTPRTPYAVSKQFPGSEVS EIAAALAA+SI
Sbjct: 128 KATSIPGVVFAQVGDPYGDHNCWERPEDMDTPRTPYAVSKQFPGSEVSGEIAAALAAASI 187

Query: 185 VFKPLDRGYSARLLKRARMVFEFADTYRGSYNQSLGPWVCPFYCSYSGYEDELIWGAAWL 244
            F+P +R YSA+LLKRARM+FEFADTYRGSYN+SLG WVCPFYC YSGYEDELIWGAAWL
Sbjct: 188 AFRPSNRAYSAKLLKRARMIFEFADTYRGSYNESLGQWVCPFYCDYSGYEDELIWGAAWL 247

Query: 245 FKATKATFYWNYITNNINKVENNPAVDYVINAFQHYAAGSFAEFGWDTKHAGINVLISKF 304
           +KATK   YWNY+ NNI  +EN  AV   I+    Y+ GSFAEFGWD KHAGINVL+S+ 
Sbjct: 248 YKATKDPNYWNYVVNNIKDLEN--AVVKNIDGVS-YSGGSFAEFGWDAKHAGINVLVSRL 307

Query: 305 GMSGNDAS-NMFINNADKFICSVLPESPSVSVSYSP--------GSNMQHSTTLSFLLLV 364
             S   +S + FI NADKF+CSVLPESP+VSVSYSP        GSN QH+T LSFLLL 
Sbjct: 308 LKSAKASSLDPFIPNADKFVCSVLPESPTVSVSYSPGGFLFKPGGSNSQHATALSFLLLA 367

Query: 365 YSNYLNQSNSKRLLHCGNVVASPSRLRQVAKGQVDYILGSNPLGMSYMVGYGNKFPQRIH 424
           YS YLNQ+N  R++HCGNVVA+ +RL Q A+ QVDYILGSNP+ MSYMVGYG KFP RIH
Sbjct: 368 YSRYLNQAN--RVIHCGNVVATSARLVQFARIQVDYILGSNPMKMSYMVGYGQKFPLRIH 427

Query: 425 HRGSSLPSMAKHPEPIACAKGKPYFQSNNPNPNLLIGAVVGGPDFKDSYADSRLDFVYSE 484
           HRGSSLPS+ +HP  I C  G PY+ SNNP+PNLL+GAVVGGPD KDSYADSR DFV+SE
Sbjct: 428 HRGSSLPSVNQHPGRIDCQGGTPYYNSNNPDPNLLVGAVVGGPDVKDSYADSRPDFVHSE 487

Query: 485 PTTYINAPLVGLLAYFKSHPN 497
           PTTYINAPLVG+LAYF+SHP+
Sbjct: 488 PTTYINAPLVGVLAYFRSHPS 502

BLAST of ClCG01G004790 vs. TrEMBL
Match: B9ICT3_POPTR (Endoglucanase OS=Populus trichocarpa GN=GH9B8 PE=2 SV=1)

HSP 1 Score: 735.3 bits (1897), Expect = 4.8e-209
Identity = 357/490 (72.86%), Postives = 408/490 (83.27%), Query Frame = 1

Query: 17  LLAMAVASHDYEDALTKSILFFEGQRSGKLPPNQRVTWRKDSALHDGLEFGVDLVGGYYD 76
           ++ M VASHDY DALTKSILFFEGQRSGKLPP QR+TWRKDS L DG + GVDLVGGYYD
Sbjct: 2   VMVMRVASHDYGDALTKSILFFEGQRSGKLPPTQRMTWRKDSGLQDGFQIGVDLVGGYYD 61

Query: 77  AGDNVKFSFPMAFTTTMLSWSVLEFGKDMGSDLPYAMDSIRWATDYLLKATSVPGFVFAQ 136
           AGDNVKF+FPMAF+TTML+WSVL+FG  MG DLP+A+++I+WATDY LKATS+PGFVF Q
Sbjct: 62  AGDNVKFNFPMAFSTTMLAWSVLDFGNFMGPDLPHALEAIKWATDYFLKATSIPGFVFVQ 121

Query: 137 VGDPYADHFCWERPEDMDTPRTPYAVSKQFPGSEVSAEIAAALAASSIVFKPLDRGYSAR 196
           VGDPY DH CWERPEDMDTPR PYA SKQFPGSEVSAEIAAALAASS+VF+P +  YSAR
Sbjct: 122 VGDPYGDHNCWERPEDMDTPRIPYAASKQFPGSEVSAEIAAALAASSMVFRPSNPAYSAR 181

Query: 197 LLKRARMVFEFADTYRGSYNQSLGPWVCPFYCSYSGYEDELIWGAAWLFKATKATFYWNY 256
           LLKRA MVFEFAD  RGSYN +LGPWVCPFYC +SGYEDELIWGAAWL++ATKA  YW+Y
Sbjct: 182 LLKRAAMVFEFADANRGSYNDTLGPWVCPFYCDFSGYEDELIWGAAWLYRATKAPNYWSY 241

Query: 257 ITNNINKVENNPA--VDYVINAFQHYAAGSFAEFGWDTKHAGINVLISKFGMSGNDAS-N 316
           +  NI+ +E N A   D V      Y  GSFAEFGWDTK+AGIN+L+SK  +S   +   
Sbjct: 242 VVQNISNLEKNVAKHTDRV-----GYGGGSFAEFGWDTKNAGINILVSKLLLSSKTSDVG 301

Query: 317 MFINNADKFICSVLPESPSVSVSYSP--------GSNMQHSTTLSFLLLVYSNYLNQSNS 376
            FI NADKF+C+VLPESP+V VSYSP        GSN+QH+T LSFLLL Y+ YLNQSN 
Sbjct: 302 PFIPNADKFVCTVLPESPTVYVSYSPGGLLFKPGGSNLQHATALSFLLLAYARYLNQSN- 361

Query: 377 KRLLHCGNVVASPSRLRQVAKGQVDYILGSNPLGMSYMVGYGNKFPQRIHHRGSSLPSMA 436
            R +HCGNVVA+P+RL Q A+GQVDYILG+NPL MSYMVGYG+KFP++IHHRGSSLPS+ 
Sbjct: 362 -REIHCGNVVATPARLIQFARGQVDYILGTNPLKMSYMVGYGSKFPRKIHHRGSSLPSVD 421

Query: 437 KHPEPIACAKGKPYFQSNNPNPNLLIGAVVGGPDFKDSYADSRLDFVYSEPTTYINAPLV 496
           +HP  I C  G PYFQSN+PNPNLLIGAVVGGPD  DSY+DSR DFV++EPTTYINAPLV
Sbjct: 422 QHPASINCQGGTPYFQSNDPNPNLLIGAVVGGPDKGDSYSDSRADFVHTEPTTYINAPLV 481

BLAST of ClCG01G004790 vs. TrEMBL
Match: L0ASN3_POPTO (Endoglucanase OS=Populus tomentosa PE=3 SV=1)

HSP 1 Score: 735.3 bits (1897), Expect = 4.8e-209
Identity = 357/490 (72.86%), Postives = 408/490 (83.27%), Query Frame = 1

Query: 17  LLAMAVASHDYEDALTKSILFFEGQRSGKLPPNQRVTWRKDSALHDGLEFGVDLVGGYYD 76
           ++ M VASHDY DALTKSILFFEGQRSGKLPP QR+TWRKDS L DG + GVDLVGGYYD
Sbjct: 2   VMVMRVASHDYGDALTKSILFFEGQRSGKLPPTQRMTWRKDSGLQDGFQIGVDLVGGYYD 61

Query: 77  AGDNVKFSFPMAFTTTMLSWSVLEFGKDMGSDLPYAMDSIRWATDYLLKATSVPGFVFAQ 136
           AGDNVKF+FPMAF+TTML+WSVL+FG  MG DLP+A+++I+WATDY LKATS+PGFVF Q
Sbjct: 62  AGDNVKFNFPMAFSTTMLAWSVLDFGNFMGPDLPHALEAIKWATDYFLKATSIPGFVFVQ 121

Query: 137 VGDPYADHFCWERPEDMDTPRTPYAVSKQFPGSEVSAEIAAALAASSIVFKPLDRGYSAR 196
           VGDPY DH CWERPEDMDTPR PYA SKQFPGSEVSAEIAAALAASS+VF+P +  YSAR
Sbjct: 122 VGDPYGDHNCWERPEDMDTPRIPYAASKQFPGSEVSAEIAAALAASSMVFRPSNPAYSAR 181

Query: 197 LLKRARMVFEFADTYRGSYNQSLGPWVCPFYCSYSGYEDELIWGAAWLFKATKATFYWNY 256
           LLKRA MVFEFAD  RGSYN +LGPWVCPFYC +SGYEDELIWGAAWL++ATKA  YW+Y
Sbjct: 182 LLKRAAMVFEFADANRGSYNDTLGPWVCPFYCDFSGYEDELIWGAAWLYRATKAPNYWSY 241

Query: 257 ITNNINKVENNPA--VDYVINAFQHYAAGSFAEFGWDTKHAGINVLISKFGMSGNDAS-N 316
           +  NI+ +E N A   D V      Y  GSFAEFGWDTK+AGIN+L+SK  +S   +   
Sbjct: 242 VVQNISNLEKNVAKHTDRV-----GYGGGSFAEFGWDTKNAGINILVSKLLLSSKTSDVG 301

Query: 317 MFINNADKFICSVLPESPSVSVSYSP--------GSNMQHSTTLSFLLLVYSNYLNQSNS 376
            FI NADKF+C+VLPESP+V VSYSP        GSN+QH+T LSFLLL Y+ YLNQSN 
Sbjct: 302 PFIPNADKFVCTVLPESPTVYVSYSPGGLLFKPGGSNLQHATALSFLLLAYARYLNQSN- 361

Query: 377 KRLLHCGNVVASPSRLRQVAKGQVDYILGSNPLGMSYMVGYGNKFPQRIHHRGSSLPSMA 436
            R +HCGNVVA+P+RL Q A+GQVDYILG+NPL MSYMVGYG+KFP++IHHRGSSLPS+ 
Sbjct: 362 -REIHCGNVVATPARLIQFARGQVDYILGTNPLKMSYMVGYGSKFPRKIHHRGSSLPSVD 421

Query: 437 KHPEPIACAKGKPYFQSNNPNPNLLIGAVVGGPDFKDSYADSRLDFVYSEPTTYINAPLV 496
           +HP  I C  G PYFQSN+PNPNLLIGAVVGGPD  DSY+DSR DFV++EPTTYINAPLV
Sbjct: 422 QHPASINCQGGTPYFQSNDPNPNLLIGAVVGGPDKGDSYSDSRADFVHTEPTTYINAPLV 481

BLAST of ClCG01G004790 vs. TrEMBL
Match: A0A0D2PJD4_GOSRA (Endoglucanase OS=Gossypium raimondii GN=B456_001G048100 PE=3 SV=1)

HSP 1 Score: 728.4 bits (1879), Expect = 5.9e-207
Identity = 355/500 (71.00%), Postives = 407/500 (81.40%), Query Frame = 1

Query: 6   MKLLVMILMLKLLAMA---VASHDYEDALTKSILFFEGQRSGKLPPNQRVTWRKDSALHD 65
           M L+  + +L + AMA   VASHDY  ALTKSILF+EGQRSGKLPP QR+TWRKDSAL D
Sbjct: 1   MSLIQRLSVLGMAAMAMGLVASHDYGAALTKSILFYEGQRSGKLPPTQRITWRKDSALRD 60

Query: 66  GLEFGVDLVGGYYDAGDNVKFSFPMAFTTTMLSWSVLEFGKDMGSDLPYAMDSIRWATDY 125
           G E GVDLVGGYYDAGDNVKF+FPMAF+ TML+WS+LEFG+ +G+DL +++ +I+W TDY
Sbjct: 61  GFEIGVDLVGGYYDAGDNVKFTFPMAFSITMLAWSLLEFGQSLGTDLQHSLKAIQWGTDY 120

Query: 126 LLKATSVPGFVFAQVGDPYADHFCWERPEDMDTPRTPYAVSKQFPGSEVSAEIAAALAAS 185
           LLKATSVPGFVFAQVGDPY DH CWERPEDMDTPRTPYAVSK+FPGSEVSAEIAAALAAS
Sbjct: 121 LLKATSVPGFVFAQVGDPYGDHNCWERPEDMDTPRTPYAVSKEFPGSEVSAEIAAALAAS 180

Query: 186 SIVFKPLDRGYSARLLKRARMVFEFADTYRGSYNQSLGPWVCPFYCSYSGYEDELIWGAA 245
           S+VF+P++RGYSARLLKRARM+FEFAD YRGSYN SLGPW CPFYC YSGY+DEL+WGAA
Sbjct: 181 SMVFRPINRGYSARLLKRARMIFEFADKYRGSYNDSLGPWACPFYCDYSGYQDELVWGAA 240

Query: 246 WLFKATKATFYWNYITNNINKVENNPAVDYVINAFQHYAAGSFAEFGWDTKHAGINVLIS 305
           WL +ATKA +Y NY+  NI  ++                + SFAEFGWDTKHAGINVL+S
Sbjct: 241 WLLRATKAPYYRNYVLANIQNLDK---------------SSSFAEFGWDTKHAGINVLVS 300

Query: 306 KFGMSGNDASNMFINNADKFICSVLPESPSVSVSYSP--------GSNMQHSTTLSFLLL 365
           +  +  +     FI NADKF+CSVLPESP++SVSYSP        GSN+QH+T LSFLLL
Sbjct: 301 R--LIKSQTPEPFITNADKFVCSVLPESPTISVSYSPGGLLIKPGGSNLQHATALSFLLL 360

Query: 366 VYSNYLNQSNSKRLLHCGNVVASPSRLRQVAKGQVDYILGSNPLGMSYMVGYGNKFPQRI 425
           VYS  L  S   R++HCGNVVA+P+RL QVA+ QVDYILGSNPL MSYMVGYG KFP+RI
Sbjct: 361 VYSRPL--SKDSRVIHCGNVVATPARLIQVARSQVDYILGSNPLNMSYMVGYGEKFPERI 420

Query: 426 HHRGSSLPSMAKHPEPIACAKGKPYFQSNNPNPNLLIGAVVGGPDFKDSYADSRLDFVYS 485
           HHRGSSLPS+ +HP+ I C  G  YF +NNPNPNLL GAVVGGPD KDSY DSR DF +S
Sbjct: 421 HHRGSSLPSITQHPQHIDCTGGATYFYTNNPNPNLLTGAVVGGPDIKDSYGDSRADFAHS 480

Query: 486 EPTTYINAPLVGLLAYFKSH 495
           EPTTYINAPLVGLLAYFKSH
Sbjct: 481 EPTTYINAPLVGLLAYFKSH 481

BLAST of ClCG01G004790 vs. TAIR10
Match: AT1G70710.1 (AT1G70710.1 glycosyl hydrolase 9B1)

HSP 1 Score: 585.1 bits (1507), Expect = 4.1e-167
Identity = 293/500 (58.60%), Postives = 359/500 (71.80%), Query Frame = 1

Query: 4   VLMKLLVMILMLKLLAMAVASHDYEDALTKSILFFEGQRSGKLPPNQRVTWRKDSALHDG 63
           ++  ++++ ++L    +  A HDY DAL KSILFFEGQRSGKLPP+QR+ WR+DSAL DG
Sbjct: 6   LIFPVILLAVLLFSPPIYSAGHDYRDALRKSILFFEGQRSGKLPPDQRLKWRRDSALRDG 65

Query: 64  LEFGVDLVGGYYDAGDNVKFSFPMAFTTTMLSWSVLEFGKDMGSDLPYAMDSIRWATDYL 123
              GVDL GGYYDAGDN+KF FPMAFTTTMLSWS+++FGK MG +L  A+ +++W TDYL
Sbjct: 66  SSAGVDLSGGYYDAGDNIKFGFPMAFTTTMLSWSIIDFGKTMGPELRNAVKAVKWGTDYL 125

Query: 124 LKATSVPGFVFAQVGDPYADHFCWERPEDMDTPRTPYAVSKQFPGSEVSAEIAAALAASS 183
           LKAT++PG VF QVGD Y+DH CWERPEDMDT RT Y + +  PGS+V+ E AAALAA+S
Sbjct: 126 LKATAIPGVVFVQVGDAYSDHNCWERPEDMDTLRTVYKIDRAHPGSDVAGETAAALAAAS 185

Query: 184 IVFKPLDRGYSARLLKRARMVFEFADTYRGSYNQSLGPWVCPFYCSYSGYEDELIWGAAW 243
           IVF+  D  YS  LL RA  VF FA+ YRG+Y+ SL   VCPFYC ++GY+DEL+WGAAW
Sbjct: 186 IVFRKRDPAYSRLLLDRATRVFAFANRYRGAYSNSLYHAVCPFYCDFNGYQDELLWGAAW 245

Query: 244 LFKATKATFYWNYITNNINKVENNPAVDYVINAFQHYAAGSFAEFGWDTKHAGINVLISK 303
           L KA++   Y  +I  N          + ++      A  +  EFGWD KHAGINVLISK
Sbjct: 246 LHKASRKRAYREFIVKN----------EVILK-----AGDTINEFGWDNKHAGINVLISK 305

Query: 304 FGMSGN-DASNMFINNADKFICSVLPESPSVSVSYS--------PGSNMQHSTTLSFLLL 363
             + G  +    F  NAD FICS+LP      V YS         GSNMQH T+LSFLLL
Sbjct: 306 EVLMGKAEYFESFKQNADGFICSILPGISHPQVQYSRGGLLVKTGGSNMQHVTSLSFLLL 365

Query: 364 VYSNYLNQSNSKRLLHCGNVVASPSRLRQVAKGQVDYILGSNPLGMSYMVGYGNKFPQRI 423
            YSNYL  S++K+++ CG + ASPS LRQ+AK QVDYILG NP+G+SYMVGYG KFP+RI
Sbjct: 366 AYSNYL--SHAKKVVPCGELTASPSLLRQIAKRQVDYILGDNPMGLSYMVGYGQKFPRRI 425

Query: 424 HHRGSSLPSMAKHPEPIACAKGKPYFQSNNPNPNLLIGAVVGGPDFKDSYADSRLDFVYS 483
           HHRGSS+PS++ HP  I C +G  YF S NPNPNLL+GAVVGGP+  D++ DSR  F  S
Sbjct: 426 HHRGSSVPSVSAHPSHIGCKEGSRYFLSPNPNPNLLVGAVVGGPNVTDAFPDSRPYFQQS 485

Query: 484 EPTTYINAPLVGLLAYFKSH 495
           EPTTYINAPLVGLL YF +H
Sbjct: 486 EPTTYINAPLVGLLGYFSAH 488

BLAST of ClCG01G004790 vs. TAIR10
Match: AT1G23210.1 (AT1G23210.1 glycosyl hydrolase 9B6)

HSP 1 Score: 582.4 bits (1500), Expect = 2.6e-166
Identity = 296/495 (59.80%), Postives = 350/495 (70.71%), Query Frame = 1

Query: 10  VMILMLKLLAMAV-ASHDYEDALTKSILFFEGQRSGKLPPNQRVTWRKDSALHDGLEFGV 69
           +M+ ML L++    A HDY DAL KSILFFEGQRSGKLPP+QR+ WR+DSAL DG   GV
Sbjct: 11  IMLAMLLLISPETYAGHDYRDALRKSILFFEGQRSGKLPPDQRLKWRRDSALRDGSSAGV 70

Query: 70  DLVGGYYDAGDNVKFSFPMAFTTTMLSWSVLEFGKDMGSDLPYAMDSIRWATDYLLKATS 129
           DL GGYYDAGDNVKF FPMAFTTTM+SWSV++FGK MG +L  A+ +I+W TDYL+KAT 
Sbjct: 71  DLTGGYYDAGDNVKFGFPMAFTTTMMSWSVIDFGKTMGPELENAVKAIKWGTDYLMKATQ 130

Query: 130 VPGFVFAQVGDPYADHFCWERPEDMDTPRTPYAVSKQFPGSEVSAEIAAALAASSIVFKP 189
           +P  VF QVGD Y+DH CWERPEDMDT RT Y + K   GSEV+ E AAALAA+SIVF+ 
Sbjct: 131 IPDVVFVQVGDAYSDHNCWERPEDMDTLRTVYKIDKDHSGSEVAGETAAALAAASIVFEK 190

Query: 190 LDRGYSARLLKRARMVFEFADTYRGSYNQSLGPWVCPFYCSYSGYEDELIWGAAWLFKAT 249
            D  YS  LL RA  VF FA  YRG+Y+ SL   VCPFYC ++GYEDEL+WGAAWL KA+
Sbjct: 191 RDPVYSKMLLDRATRVFAFAQKYRGAYSDSLYQAVCPFYCDFNGYEDELLWGAAWLHKAS 250

Query: 250 KATFYWNYITNNINKVENNPAVDYVINAFQHYAAGSFAEFGWDTKHAGINVLISKFGMSG 309
           K   Y  +I  N            ++      A  +  EFGWD KHAGINVL+SK  + G
Sbjct: 251 KKRVYREFIVKN----------QVILR-----AGDTIHEFGWDNKHAGINVLVSKMVLMG 310

Query: 310 N-DASNMFINNADKFICSVLPESPSVSVSYSP--------GSNMQHSTTLSFLLLVYSNY 369
             +    F  NAD+FICS+LP      V YS         GSNMQH T+LSFLLL YSNY
Sbjct: 311 KAEYFQSFKQNADEFICSLLPGISHPQVQYSQGGLLVKSGGSNMQHVTSLSFLLLTYSNY 370

Query: 370 LNQSNSKRLLHCGNVVASPSRLRQVAKGQVDYILGSNPLGMSYMVGYGNKFPQRIHHRGS 429
           L+ +N  +++ CG   ASP+ LRQVAK QVDYILG NP+ MSYMVGYG++FPQ+IHHRGS
Sbjct: 371 LSHAN--KVVPCGEFTASPALLRQVAKRQVDYILGDNPMKMSYMVGYGSRFPQKIHHRGS 430

Query: 430 SLPSMAKHPEPIACAKGKPYFQSNNPNPNLLIGAVVGGPDFKDSYADSRLDFVYSEPTTY 489
           S+PS+  HP+ I C  G  YF SNNPNPNLLIGAVVGGP+  D + DSR  F  +EPTTY
Sbjct: 431 SVPSVVDHPDRIGCKDGSRYFFSNNPNPNLLIGAVVGGPNITDDFPDSRPYFQLTEPTTY 488

Query: 490 INAPLVGLLAYFKSH 495
           INAPL+GLL YF +H
Sbjct: 491 INAPLLGLLGYFSAH 488

BLAST of ClCG01G004790 vs. TAIR10
Match: AT4G02290.1 (AT4G02290.1 glycosyl hydrolase 9B13)

HSP 1 Score: 550.1 bits (1416), Expect = 1.4e-156
Identity = 283/479 (59.08%), Postives = 334/479 (69.73%), Query Frame = 1

Query: 22  VASHDYEDALTKSILFFEGQRSGKLPPNQRVTWRKDSALHDGLEFGVDLVGGYYDAGDNV 81
           +A H+Y+DALTKSILFFEGQRSGKLP NQR++WR+DS L DG    VDLVGGYYDAGDN+
Sbjct: 48  LAKHNYKDALTKSILFFEGQRSGKLPSNQRMSWRRDSGLSDGSALHVDLVGGYYDAGDNI 107

Query: 82  KFSFPMAFTTTMLSWSVLEFGKDMGSDLPYAMDSIRWATDYLLKATSVPGFVFAQVGDPY 141
           KF FPMAFTTTMLSWSV+EFG  M S+L  A  +IRWATDYLLKATS P  ++ QVGD  
Sbjct: 108 KFGFPMAFTTTMLSWSVIEFGGLMKSELQNAKIAIRWATDYLLKATSQPDTIYVQVGDAN 167

Query: 142 ADHFCWERPEDMDTPRTPYAVSKQFPGSEVSAEIAAALAASSIVFKPLDRGYSARLLKRA 201
            DH CWERPEDMDT R+ + V K  PGS+V+AE AAALAA++IVF+  D  YS  LLKRA
Sbjct: 168 KDHSCWERPEDMDTVRSVFKVDKNIPGSDVAAETAAALAAAAIVFRKSDPSYSKVLLKRA 227

Query: 202 RMVFEFADTYRGSYNQSLGPWVCPFYCSYSGYEDELIWGAAWLFKATKATFYWNYITNNI 261
             VF FAD YRG+Y+  L P VCPFYCSYSGY+DEL+WGAAWL KATK   Y NYI    
Sbjct: 228 ISVFAFADKYRGTYSAGLKPDVCPFYCSYSGYQDELLWGAAWLQKATKNIKYLNYIK--- 287

Query: 262 NKVENNPAVDYVINAFQHYAAGSFAEFGWDTKHAGINVLISK-FGMSGNDASNMFINNAD 321
                       IN     AA     FGWD KHAG  +L++K F +      + +  +AD
Sbjct: 288 ------------INGQILGAAEYDNTFGWDNKHAGARILLTKAFLVQNVKTLHEYKGHAD 347

Query: 322 KFICSVLPESPSVSVSYSPG--------SNMQHSTTLSFLLLVYSNYLNQSNSKRLLHCG 381
            FICSV+P +P  S  Y+PG        +NMQ+ T+ SFLLL Y+ YL  +++K ++HCG
Sbjct: 348 NFICSVIPGAPFSSTQYTPGGLLFKMADANMQYVTSTSFLLLTYAKYL--TSAKTVVHCG 407

Query: 382 NVVASPSRLRQVAKGQVDYILGSNPLGMSYMVGYGNKFPQRIHHRGSSLPSMAKHPEPIA 441
             V +P RLR +AK QVDY+LG NPL MSYMVGYG KFP+RIHHRGSSLP +A HP  I 
Sbjct: 408 GSVYTPGRLRSIAKRQVDYLLGDNPLRMSYMVGYGPKFPRRIHHRGSSLPCVASHPAKIQ 467

Query: 442 CAKGKPYFQSNNPNPNLLIGAVVGGPDFKDSYADSRLDFVYSEPTTYINAPLVGLLAYF 492
           C +G     S +PNPN L+GAVVGGPD  D + D R D+  SEP TYIN+PLVG LAYF
Sbjct: 468 CHQGFAIMNSQSPNPNFLVGAVVGGPDQHDRFPDERSDYEQSEPATYINSPLVGALAYF 509

BLAST of ClCG01G004790 vs. TAIR10
Match: AT1G02800.1 (AT1G02800.1 cellulase 2)

HSP 1 Score: 543.1 bits (1398), Expect = 1.8e-154
Identity = 281/477 (58.91%), Postives = 332/477 (69.60%), Query Frame = 1

Query: 24  SHDYEDALTKSILFFEGQRSGKLPPNQRVTWRKDSALHDGLEFGVDLVGGYYDAGDNVKF 83
           +H+Y+DAL+KSILFFEGQRSGKLPPNQR+TWR +S L DG    VDLVGGYYDAGDN+KF
Sbjct: 41  NHNYKDALSKSILFFEGQRSGKLPPNQRMTWRSNSGLSDGSALNVDLVGGYYDAGDNMKF 100

Query: 84  SFPMAFTTTMLSWSVLEFGKDMGSDLPYAMDSIRWATDYLLKATSVPGFVFAQVGDPYAD 143
            FPMAFTTTMLSWS++EFG  M S+LP A D+IRWATD+LLKATS P  ++ QVGDP  D
Sbjct: 101 GFPMAFTTTMLSWSLIEFGGLMKSELPNAKDAIRWATDFLLKATSHPDTIYVQVGDPNMD 160

Query: 144 HFCWERPEDMDTPRTPYAVSKQFPGSEVSAEIAAALAASSIVFKPLDRGYSARLLKRARM 203
           H CWERPEDMDTPR+ + V K  PGS+++ EIAAALAA+SIVF+  D  YS  LL+RA  
Sbjct: 161 HACWERPEDMDTPRSVFKVDKNNPGSDIAGEIAAALAAASIVFRKCDPSYSNHLLQRAIT 220

Query: 204 VFEFADTYRGSYNQSLGPWVCPFYCSYSGYEDELIWGAAWLFKATKATFYWNYITNNINK 263
           VF FAD YRG Y+  L P VCPFYCSYSGY+DEL+WGAAWL KAT    Y NYI  N   
Sbjct: 221 VFTFADKYRGPYSAGLAPEVCPFYCSYSGYQDELLWGAAWLQKATNNPTYLNYIKAN--- 280

Query: 264 VENNPAVDYVINAFQHYAAGSFAE-FGWDTKHAGINVLISK-FGMSGNDASNMFINNADK 323
                         Q   A  F   F WD KH G  +L+SK F +    +   +  +AD 
Sbjct: 281 -------------GQILGADEFDNMFSWDNKHVGARILLSKEFLIQKVKSLEEYKEHADS 340

Query: 324 FICSVLPESPSVSVSYSPG--------SNMQHSTTLSFLLLVYSNYLNQSNSKRLLHCGN 383
           FICSVLP +   S  Y+PG        SNMQ+ T+ SFLLL Y+ YL  ++++ + +CG 
Sbjct: 341 FICSVLPGAS--SSQYTPGGLLFKMGESNMQYVTSTSFLLLTYAKYL--TSARTVAYCGG 400

Query: 384 VVASPSRLRQVAKGQVDYILGSNPLGMSYMVGYGNKFPQRIHHRGSSLPSMAKHPEPIAC 443
            V +P+RLR +AK QVDY+LG NPL MSYMVGYG K+P+RIHHRGSSLPS+A HP  I C
Sbjct: 401 SVVTPARLRSIAKKQVDYLLGGNPLKMSYMVGYGLKYPRRIHHRGSSLPSVAVHPTRIQC 460

Query: 444 AKGKPYFQSNNPNPNLLIGAVVGGPDFKDSYADSRLDFVYSEPTTYINAPLVGLLAY 491
             G   F S +PNPN L+GAVVGGPD  D + D R D+  SEP TYINAPLVG LAY
Sbjct: 461 HDGFSLFTSQSPNPNDLVGAVVGGPDQNDQFPDERSDYGRSEPATYINAPLVGALAY 497

BLAST of ClCG01G004790 vs. TAIR10
Match: AT4G39010.1 (AT4G39010.1 glycosyl hydrolase 9B18)

HSP 1 Score: 533.9 bits (1374), Expect = 1.1e-151
Identity = 273/491 (55.60%), Postives = 341/491 (69.45%), Query Frame = 1

Query: 20  MAVASHDYEDALTKSILFFEGQRSGKLPPNQRVTWRKDSALHDGLEFGVDLVGGYYDAGD 79
           M+   HDY DAL+KSILFFEGQRSG LP +QR+TWR++S L DG    +DL GGYYDAGD
Sbjct: 23  MSSNQHDYSDALSKSILFFEGQRSGYLPNDQRMTWRRNSGLSDGWTHNIDLTGGYYDAGD 82

Query: 80  NVKFSFPMAFTTTMLSWSVLEFGKDM-GSDLPYAMDSIRWATDYLLKATS-VPGFVFAQV 139
           NVKF+FPMAFTTTML+WSV+EFG+ M  S+L  ++ ++RW+++YLLK+ S +P  +F QV
Sbjct: 83  NVKFNFPMAFTTTMLAWSVIEFGEFMPSSELRNSLVALRWSSNYLLKSVSQLPNRIFVQV 142

Query: 140 GDPYADHFCWERPEDMDTPRTPYAVSKQFPGSEVSAEIAAALAASSIVFKPLDRGYSARL 199
           GDP ADH CWERPEDMDTPRT YAV+   P SEV+ E  AAL+A+SI F+  D GYS  L
Sbjct: 143 GDPIADHNCWERPEDMDTPRTAYAVNAPNPASEVAGETTAALSAASIAFRSSDPGYSQTL 202

Query: 200 LKRARMVFEFADTYRGSY--NQSLGPWVCPFYCSYSGYEDELIWGAAWLFKATKATFYWN 259
           L+ A   F+FAD YRG+Y  N  +   VCPFYC ++G++DEL+WGAAWL KAT    Y N
Sbjct: 203 LQNAVKTFQFADMYRGAYSSNDDIKNDVCPFYCDFNGFQDELLWGAAWLRKATGDESYLN 262

Query: 260 YITNNINKVENNPAVDYVINAFQHYAAGSFAEFGWDTKHAGINVLISKFGMSGNDAS-NM 319
           YI +N      N  VD               EFGWD K  G+NVL+SK  + GN  +   
Sbjct: 263 YIESNREPFGANDNVD---------------EFGWDNKVGGLNVLVSKEVIEGNMYNLEA 322

Query: 320 FINNADKFICSVLPESPSVSVSYSP--------GSNMQHSTTLSFLLLVYSNYLNQSNSK 379
           +  +A+ F+CS++PES    V Y+         GS +QH+TT+SFLLLVY+ YL++S+  
Sbjct: 323 YKASAESFMCSLVPESSGPHVEYTSAGLLYKPGGSQLQHATTISFLLLVYAQYLSRSSLS 382

Query: 380 RLLHCGNVVASPSRLRQVAKGQVDYILGSNPLGMSYMVGYGNKFPQRIHHRGSSLPSMAK 439
             L+CG +   P  LR++AK QVDYILG+NP+G+SYMVGYG ++P+RIHHRGSSLPS+  
Sbjct: 383 --LNCGTLTVPPDYLRRLAKKQVDYILGNNPMGLSYMVGYGERYPKRIHHRGSSLPSIVD 442

Query: 440 HPEPIACAKGKPYFQSNNPNPNLLIGAVVGGPDFKDSYADSRLDFVYSEPTTYINAPLVG 498
           HPE I C  G  YF S  PNPN+LIGAVVGGP   D Y D R DF  SEPTTYINAP VG
Sbjct: 443 HPEAIRCKDGSVYFNSTEPNPNVLIGAVVGGPGEDDMYDDDRSDFRKSEPTTYINAPFVG 496

BLAST of ClCG01G004790 vs. NCBI nr
Match: gi|659073248|ref|XP_008436961.1| (PREDICTED: endoglucanase 8-like [Cucumis melo])

HSP 1 Score: 879.0 bits (2270), Expect = 3.9e-252
Identity = 445/519 (85.74%), Postives = 466/519 (89.79%), Query Frame = 1

Query: 1   MALVLMK-LLVMILMLKLLAMAVASHDYEDALTKSILFFEGQRSGKLPPNQRVTWRKDSA 60
           MA+  MK  LVM+LMLKL+   V+SHDY DALTKSILFFEGQRSGKLPPNQRVTWRKDSA
Sbjct: 1   MAVFFMKYFLVMVLMLKLVV--VSSHDYGDALTKSILFFEGQRSGKLPPNQRVTWRKDSA 60

Query: 61  LHDGLEFGVDLVGGYYDAGDNVKFSFPMAFTTTMLSWSVLEFGKDMGSDLPYAMDSIRWA 120
           L DG+EFGVDLVGGYYDAGDNVKFSFPMAFTTTMLSWSVLEFGKDMGSDLPYAMDSIRWA
Sbjct: 61  LRDGIEFGVDLVGGYYDAGDNVKFSFPMAFTTTMLSWSVLEFGKDMGSDLPYAMDSIRWA 120

Query: 121 TDYLLKATSVPGFVFAQVGDPYADHFCWERPEDMDTPRTPYAVSKQFPGSEVSAEIAAAL 180
           TDYLLKATSVPGFVFAQVGDPYADHFCWERPEDMDTPRTPYAVSKQFPGSEVSAEIAAAL
Sbjct: 121 TDYLLKATSVPGFVFAQVGDPYADHFCWERPEDMDTPRTPYAVSKQFPGSEVSAEIAAAL 180

Query: 181 AASSIVFKPLDRGYSARLLKRARMVFEFADTYRGSYNQSLGPWVCPFYCSYSGYEDELIW 240
           AASSIVFKPLD GYSARLLKRARMVFEFADTYRGSYN SLG WVCPFYCSYSGYEDELIW
Sbjct: 181 AASSIVFKPLDGGYSARLLKRARMVFEFADTYRGSYNDSLGRWVCPFYCSYSGYEDELIW 240

Query: 241 GAAWLFKATKATFYWNYITNNINKVENN----PAVDY--VI------NAFQHYAAGSFAE 300
           GAAWL+KATK+ FYWNYIT NINK+ENN     AVD+  VI      NAFQ Y++G+FAE
Sbjct: 241 GAAWLYKATKSAFYWNYITKNINKIENNNNNYAAVDHNNVINSKNNNNAFQRYSSGTFAE 300

Query: 301 FGWDTKHAGINVLISKFGMSGN-DASNMFINNADKFICSVLPESPSVSVSYSP------- 360
           FGWDTK+AGINVLISKF MSGN  +S+MFIN ADKFICSVLPESPS SVSYSP       
Sbjct: 301 FGWDTKYAGINVLISKFVMSGNGSSSSMFINYADKFICSVLPESPSPSVSYSPGGLLFKP 360

Query: 361 -GSNMQHSTTLSFLLLVYSNYLNQSNSKRLLHCGNVVASPSRLRQVAKGQVDYILGSNPL 420
            GSNMQHST LSFL++VYSNYLNQ   KR LHCGNVVASPSRL Q+AK QVDYILGSNPL
Sbjct: 361 GGSNMQHSTALSFLVVVYSNYLNQ--YKRTLHCGNVVASPSRLLQLAKTQVDYILGSNPL 420

Query: 421 GMSYMVGYGNKFPQRIHHRGSSLPSMAKHPEPIACAKGKPYFQSNNPNPNLLIGAVVGGP 480
           GMSYMVGYG KFPQRIHHRGSSLPSMA +P+ I CAKGK YFQSNNPNPNLLIGAVVGGP
Sbjct: 421 GMSYMVGYGKKFPQRIHHRGSSLPSMANYPQAIGCAKGKQYFQSNNPNPNLLIGAVVGGP 480

Query: 481 DFKDSYADSRLDFVYSEPTTYINAPLVGLLAYFKSHPNS 498
           DFKDSYADSR+DFVYSEPTTYINAPLVGLLAYFKSHPNS
Sbjct: 481 DFKDSYADSRVDFVYSEPTTYINAPLVGLLAYFKSHPNS 515

BLAST of ClCG01G004790 vs. NCBI nr
Match: gi|778700342|ref|XP_011654857.1| (PREDICTED: endoglucanase 4-like [Cucumis sativus])

HSP 1 Score: 868.2 bits (2242), Expect = 6.8e-249
Identity = 438/518 (84.56%), Postives = 461/518 (89.00%), Query Frame = 1

Query: 1   MALVLMK-LLVMILMLKLLAMAVASHDYEDALTKSILFFEGQRSGKLPPNQRVTWRKDSA 60
           MA+  MK  LVMILMLKL+   V+SHDY DALTKSILFFEGQRSGKLPPNQRVTWRKDSA
Sbjct: 1   MAIFFMKYFLVMILMLKLVV--VSSHDYGDALTKSILFFEGQRSGKLPPNQRVTWRKDSA 60

Query: 61  LHDGLEFGVDLVGGYYDAGDNVKFSFPMAFTTTMLSWSVLEFGKDMGSDLPYAMDSIRWA 120
           L DGLEFGVDLVGGYYDAGDNVKFSFPMAFTTTMLSWSVLEFGKDMGSDLPYAMDSIRWA
Sbjct: 61  LRDGLEFGVDLVGGYYDAGDNVKFSFPMAFTTTMLSWSVLEFGKDMGSDLPYAMDSIRWA 120

Query: 121 TDYLLKATSVPGFVFAQVGDPYADHFCWERPEDMDTPRTPYAVSKQFPGSEVSAEIAAAL 180
           TDYLLKATSVPGFVFAQVGDPYADHFCWERPEDMDTPRTPYAVSKQFPGSEVSAEIAAAL
Sbjct: 121 TDYLLKATSVPGFVFAQVGDPYADHFCWERPEDMDTPRTPYAVSKQFPGSEVSAEIAAAL 180

Query: 181 AASSIVFKPLDRGYSARLLKRARMVFEFADTYRGSYNQSLGPWVCPFYCSYSGYEDELIW 240
           AASS+VFKPLD GYSARLLKRARMVFEFADTYRGSYN SLG WVCPFYCSYSGYEDELIW
Sbjct: 181 AASSMVFKPLDGGYSARLLKRARMVFEFADTYRGSYNDSLGRWVCPFYCSYSGYEDELIW 240

Query: 241 GAAWLFKATKATFYWNYITNNINKVE---NNPAVDY--VINA-----FQHYAAGSFAEFG 300
           GAAWL+KATK  FYWNYIT NIN ++   NNPAVDY  VIN+     FQHY++G+FAEFG
Sbjct: 241 GAAWLYKATKTAFYWNYITKNINTIKNNNNNPAVDYNNVINSKTDNVFQHYSSGNFAEFG 300

Query: 301 WDTKHAGINVLISKFGMS---GNDASNMFINNADKFICSVLPESPSVSVSY--------S 360
           WDTK+AGINVLISKF +S   G+ +SNMFIN ADKF+CSVLPESPS+ VSY        S
Sbjct: 301 WDTKYAGINVLISKFVLSTGNGSSSSNMFINYADKFVCSVLPESPSLLVSYSRGGLLFKS 360

Query: 361 PGSNMQHSTTLSFLLLVYSNYLNQSNSKRLLHCGNVVASPSRLRQVAKGQVDYILGSNPL 420
            GSN+QHST LSFLL+VYSNYLNQ   K +LHCGNVVASPSRL Q+AK QVDYILGSNPL
Sbjct: 361 GGSNIQHSTALSFLLIVYSNYLNQ--YKHILHCGNVVASPSRLLQLAKTQVDYILGSNPL 420

Query: 421 GMSYMVGYGNKFPQRIHHRGSSLPSMAKHPEPIACAKGKPYFQSNNPNPNLLIGAVVGGP 480
           GMSYMVGYG  FPQRIHHRGSSLPSMA +P+ I CAKGK YFQSNNPNPNLLIGAVVGGP
Sbjct: 421 GMSYMVGYGKNFPQRIHHRGSSLPSMANYPQAIGCAKGKQYFQSNNPNPNLLIGAVVGGP 480

Query: 481 DFKDSYADSRLDFVYSEPTTYINAPLVGLLAYFKSHPN 497
           DF DSYADSR DFVYSEPTTYINAPLVGLLAYFKSHPN
Sbjct: 481 DFNDSYADSRPDFVYSEPTTYINAPLVGLLAYFKSHPN 514

BLAST of ClCG01G004790 vs. NCBI nr
Match: gi|645257380|ref|XP_008234387.1| (PREDICTED: endoglucanase 8-like [Prunus mume])

HSP 1 Score: 754.6 bits (1947), Expect = 1.1e-214
Identity = 369/496 (74.40%), Postives = 417/496 (84.07%), Query Frame = 1

Query: 9   LVMILMLKLLAMAVAS--HDYEDALTKSILFFEGQRSGKLPPNQRVTWRKDSALHDGLEF 68
           LV+ +M  ++   V S  HDY DAL+KSILFFEGQRSGKLP +QR+TWRKDSAL DG E 
Sbjct: 10  LVVAVMAAMVMRVVCSQSHDYGDALSKSILFFEGQRSGKLPSSQRMTWRKDSALRDGYEI 69

Query: 69  GVDLVGGYYDAGDNVKFSFPMAFTTTMLSWSVLEFGKDMGSDLPYAMDSIRWATDYLLKA 128
           GVDLVGGYYDAGDNVKF+FPMAF+TTML+WSVLEFGK M SDL +A+D+IRWATDY LKA
Sbjct: 70  GVDLVGGYYDAGDNVKFNFPMAFSTTMLAWSVLEFGKGMSSDLQHALDAIRWATDYFLKA 129

Query: 129 TSVPGFVFAQVGDPYADHFCWERPEDMDTPRTPYAVSKQFPGSEVSAEIAAALAASSIVF 188
           TS+PGFVF QVGDPY DH CWERPEDMDTPRTP+AVSKQFPGSEVSAEIAAALAAS++VF
Sbjct: 130 TSIPGFVFVQVGDPYGDHNCWERPEDMDTPRTPFAVSKQFPGSEVSAEIAAALAASAMVF 189

Query: 189 KPLDRGYSARLLKRARMVFEFADTYRGSYNQSLGPWVCPFYCSYSGYEDELIWGAAWLFK 248
           +P+D  YSARLLKRARMVF+FAD Y+GSYN SLGPWVCPFYC +SGYEDEL+WGAAWLFK
Sbjct: 190 RPIDLKYSARLLKRARMVFDFADKYQGSYNDSLGPWVCPFYCDFSGYEDELVWGAAWLFK 249

Query: 249 ATKATFYWNYITNNINKVENNPAVDYVINAFQHYAAGSFAEFGWDTKHAGINVLISKFGM 308
           ATK   YWNY+  NINK+E++  V Y+      Y  GSFAEFGWD+KHAGINVL+SK  M
Sbjct: 250 ATKQPIYWNYVLQNINKLESSATVKYINGV--SYLGGSFAEFGWDSKHAGINVLVSKLIM 309

Query: 309 S-GNDASNMFINNADKFICSVLPESPSVSVSYSP--------GSNMQHSTTLSFLLLVYS 368
           S     S  FI+NADKFIC++LPESP+VSVSYSP        GSNMQH+TTLSFLL+VY+
Sbjct: 310 SMAGTGSTPFISNADKFICTLLPESPTVSVSYSPGGLLFKPGGSNMQHATTLSFLLVVYA 369

Query: 369 NYLNQSNSKRLLHCGNVVASPSRLRQVAKGQVDYILGSNPLGMSYMVGYGNKFPQRIHHR 428
            YL  S+  R++HCGNVVASP+RL ++AKGQVDYILGSNPLGMSYMVGYG KFPQRIHHR
Sbjct: 370 RYLKLSS--RVVHCGNVVASPARLVKLAKGQVDYILGSNPLGMSYMVGYGKKFPQRIHHR 429

Query: 429 GSSLPSMAKHPEPIACAKGKPYFQSNNPNPNLLIGAVVGGPDFKDSYADSRLDFVYSEPT 488
           GSSLPS+ +HP+ I C  G  Y+ S NPNPNLLIGAVVGGPD KDSYADSR DFV SEPT
Sbjct: 430 GSSLPSVGQHPKQIDCKGGTDYYNSKNPNPNLLIGAVVGGPDIKDSYADSREDFVQSEPT 489

Query: 489 TYINAPLVGLLAYFKS 494
           TYINAPLVG+LAYF S
Sbjct: 490 TYINAPLVGVLAYFNS 501

BLAST of ClCG01G004790 vs. NCBI nr
Match: gi|596022028|ref|XP_007219066.1| (hypothetical protein PRUPE_ppa014785mg [Prunus persica])

HSP 1 Score: 748.8 bits (1932), Expect = 6.0e-213
Identity = 366/488 (75.00%), Postives = 410/488 (84.02%), Query Frame = 1

Query: 18  LAMAVA---SHDYEDALTKSILFFEGQRSGKLPPNQRVTWRKDSALHDGLEFGVDLVGGY 77
           + M VA   SHDY DAL KSILFFEGQRSGKLP +QR+TWRKDSAL DG E GVDLVGGY
Sbjct: 1   MVMGVACSQSHDYGDALNKSILFFEGQRSGKLPSSQRMTWRKDSALRDGYEIGVDLVGGY 60

Query: 78  YDAGDNVKFSFPMAFTTTMLSWSVLEFGKDMGSDLPYAMDSIRWATDYLLKATSVPGFVF 137
           YDAGDNVKF+FPMAF+TTML+WSVLEFGK M SDLP+A+D+IRWATDY LKATS+PGFVF
Sbjct: 61  YDAGDNVKFNFPMAFSTTMLAWSVLEFGKGMSSDLPHALDAIRWATDYFLKATSIPGFVF 120

Query: 138 AQVGDPYADHFCWERPEDMDTPRTPYAVSKQFPGSEVSAEIAAALAASSIVFKPLDRGYS 197
            QVGDPY DH CWERPEDMDTPRTP+AVSKQFPGSEVSAEIAAALAAS++VF+P+D  YS
Sbjct: 121 VQVGDPYGDHNCWERPEDMDTPRTPFAVSKQFPGSEVSAEIAAALAASAMVFRPIDLKYS 180

Query: 198 ARLLKRARMVFEFADTYRGSYNQSLGPWVCPFYCSYSGYEDELIWGAAWLFKATKATFYW 257
           ARLLKRARMVF+FAD Y+GSYN SLGPWVCPFYC +SGYEDEL+WGAAWLFKATK   YW
Sbjct: 181 ARLLKRARMVFDFADKYQGSYNDSLGPWVCPFYCDFSGYEDELVWGAAWLFKATKQPIYW 240

Query: 258 NYITNNINKVENNPAVDYVINAFQHYAAGSFAEFGWDTKHAGINVLISKFGMS-GNDASN 317
           NY+  NINK+E++  V Y+      Y  GSFAEFGWD+KHAGINVL+SK  MS     S 
Sbjct: 241 NYVLQNINKLESSATVKYINGV--SYLGGSFAEFGWDSKHAGINVLVSKLIMSMAGTGST 300

Query: 318 MFINNADKFICSVLPESPSVSVSYSP--------GSNMQHSTTLSFLLLVYSNYLNQSNS 377
            FI+NADKFIC++LPESP+VSVSYSP        GSNMQH+TTLSFLL+VY+ YL  SN 
Sbjct: 301 PFISNADKFICTLLPESPTVSVSYSPGGLLFKPGGSNMQHATTLSFLLVVYARYLKLSN- 360

Query: 378 KRLLHCGNVVASPSRLRQVAKGQVDYILGSNPLGMSYMVGYGNKFPQRIHHRGSSLPSMA 437
            R++HCGNVVASP+RL ++AKGQVDYILGSNP GMSYMVGYG KFPQRIHHRGSSLPS+ 
Sbjct: 361 -RVVHCGNVVASPARLVKLAKGQVDYILGSNPFGMSYMVGYGKKFPQRIHHRGSSLPSVG 420

Query: 438 KHPEPIACAKGKPYFQSNNPNPNLLIGAVVGGPDFKDSYADSRLDFVYSEPTTYINAPLV 494
           +HP+ I C  G  Y+ S NPN NLLIGAVVGGPD +DSYAD R DFV SEPTTYINAPLV
Sbjct: 421 QHPKQIDCKGGTDYYNSKNPNLNLLIGAVVGGPDIEDSYADFREDFVQSEPTTYINAPLV 480

BLAST of ClCG01G004790 vs. NCBI nr
Match: gi|764635349|ref|XP_011470097.1| (PREDICTED: endoglucanase 4-like isoform X2 [Fragaria vesca subsp. vesca])

HSP 1 Score: 744.6 bits (1921), Expect = 1.1e-211
Identity = 362/500 (72.40%), Postives = 421/500 (84.20%), Query Frame = 1

Query: 6   MKLLVMILMLKLLAMAVASHDYEDALTKSILFFEGQRSGKLPPNQRVTWRKDSALHDGLE 65
           ++L+V++  + +  +  AS DY DALTKSILFFEGQRSGK+P  QR+TWRKDSALHDGL+
Sbjct: 7   LRLVVLVAAMMVAMVVKASQDYGDALTKSILFFEGQRSGKIPSTQRMTWRKDSALHDGLQ 66

Query: 66  FGVDLVGGYYDAGDNVKFSFPMAFTTTMLSWSVLEFGKDMGSDLPYAMDSIRWATDYLLK 125
            GVDLVGGYYDAGDNVKFSFPMAF+TTML+WSVLEFGKDMG+DLP A+D+IRW TDY LK
Sbjct: 67  IGVDLVGGYYDAGDNVKFSFPMAFSTTMLAWSVLEFGKDMGADLPRALDAIRWGTDYFLK 126

Query: 126 ATSVPGFVFAQVGDPYADHFCWERPEDMDTPRTPYAVSKQFPGSEVSAEIAAALAASSIV 185
           ATS+PGFVFAQVG+PY DH CWERPEDMDTPR PYAVSKQFPGSE+SAEIAAALAASS+V
Sbjct: 127 ATSIPGFVFAQVGEPYDDHNCWERPEDMDTPRNPYAVSKQFPGSELSAEIAAALAASSMV 186

Query: 186 FKPLDRGYSARLLKRARMVFEFADTYRGSYNQSLGPWVCPFYCSYSGYEDELIWGAAWLF 245
           FK +D  YSARLLKRARMVFEFA TY+GSYN +LGPWVCPFYC +SGYEDELIWGAAWLF
Sbjct: 187 FKHVDLQYSARLLKRARMVFEFAVTYQGSYNDALGPWVCPFYCDFSGYEDELIWGAAWLF 246

Query: 246 KATKATFYWNYITNNINKVENNPAVDYVINAFQHYA-AGSFAEFGWDTKHAGINVLISKF 305
           KATK   Y  Y+ +N++K+E   +V   +N   + +  GSF EFGWDTKHAGINVL SK 
Sbjct: 247 KATKIPTYSKYVLDNVHKLEK--SVMRNVNGLSYSSVGGSFTEFGWDTKHAGINVLASKL 306

Query: 306 GMSGND-ASNMFINNADKFICSVLPESPSVSVSYSP--------GSNMQHSTTLSFLLLV 365
            MS  D  S+ F+ N+DKFICS+LP+SP++SVSYSP        GSN+QH+TTLSFLL+V
Sbjct: 307 FMSSIDIGSSPFVINSDKFICSLLPDSPTLSVSYSPGGLLFKPGGSNLQHATTLSFLLVV 366

Query: 366 YSNYLNQSNSKRLLHCGNVVASPSRLRQVAKGQVDYILGSNPLGMSYMVGYGNKFPQRIH 425
           YS YL++SN  R++HC NVVA+P+RL Q+AKGQVDYILGSNPL MSYMVGYG KFPQRIH
Sbjct: 367 YSRYLSKSN--RVVHCNNVVATPARLVQIAKGQVDYILGSNPLNMSYMVGYGKKFPQRIH 426

Query: 426 HRGSSLPSMAKHPEPIACAKGKPYFQSNNPNPNLLIGAVVGGPDFKDSYADSRLDFVYSE 485
           HRGSSLPS+ +HP+ I C +G  YF S+NPNPN L GAVVGGPD KD+Y DSR+DFV+SE
Sbjct: 427 HRGSSLPSVGQHPQQIGCKEGSVYFGSSNPNPNELTGAVVGGPDIKDAYTDSRVDFVHSE 486

Query: 486 PTTYINAPLVGLLAYFKSHP 496
           PTTYINAP VG+LAYFKSHP
Sbjct: 487 PTTYINAPFVGVLAYFKSHP 502

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
GUN8_ARATH7.2e-16658.60Endoglucanase 8 OS=Arabidopsis thaliana GN=CEL1 PE=2 SV=1[more]
GUN4_ARATH4.7e-16559.80Endoglucanase 4 OS=Arabidopsis thaliana GN=At1g23210 PE=2 SV=1[more]
GUN17_ORYSJ6.3e-16258.84Endoglucanase 17 OS=Oryza sativa subsp. japonica GN=GLU13 PE=2 SV=1[more]
GUN6_ORYSJ3.5e-16058.07Endoglucanase 6 OS=Oryza sativa subsp. japonica GN=Os02g0733300 PE=2 SV=1[more]
GUN17_ARATH2.6e-15559.08Endoglucanase 17 OS=Arabidopsis thaliana GN=At4g02290 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
M5XMI2_PRUPE4.2e-21375.00Endoglucanase OS=Prunus persica GN=PRUPE_ppa014785mg PE=3 SV=1[more]
B9R9R3_RICCO3.9e-21173.45Endoglucanase OS=Ricinus communis GN=RCOM_1500170 PE=3 SV=1[more]
B9ICT3_POPTR4.8e-20972.86Endoglucanase OS=Populus trichocarpa GN=GH9B8 PE=2 SV=1[more]
L0ASN3_POPTO4.8e-20972.86Endoglucanase OS=Populus tomentosa PE=3 SV=1[more]
A0A0D2PJD4_GOSRA5.9e-20771.00Endoglucanase OS=Gossypium raimondii GN=B456_001G048100 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT1G70710.14.1e-16758.60 glycosyl hydrolase 9B1[more]
AT1G23210.12.6e-16659.80 glycosyl hydrolase 9B6[more]
AT4G02290.11.4e-15659.08 glycosyl hydrolase 9B13[more]
AT1G02800.11.8e-15458.91 cellulase 2[more]
AT4G39010.11.1e-15155.60 glycosyl hydrolase 9B18[more]
Match NameE-valueIdentityDescription
gi|659073248|ref|XP_008436961.1|3.9e-25285.74PREDICTED: endoglucanase 8-like [Cucumis melo][more]
gi|778700342|ref|XP_011654857.1|6.8e-24984.56PREDICTED: endoglucanase 4-like [Cucumis sativus][more]
gi|645257380|ref|XP_008234387.1|1.1e-21474.40PREDICTED: endoglucanase 8-like [Prunus mume][more]
gi|596022028|ref|XP_007219066.1|6.0e-21375.00hypothetical protein PRUPE_ppa014785mg [Prunus persica][more]
gi|764635349|ref|XP_011470097.1|1.1e-21172.40PREDICTED: endoglucanase 4-like isoform X2 [Fragaria vesca subsp. vesca][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001701Glyco_hydro_9
IPR0089286-hairpin_glycosidase_sf
IPR0123416hp_glycosidase-like_sf
IPR018221Glyco_hydro_9_His_AS
Vocabulary: Molecular Function
TermDefinition
GO:0004553hydrolase activity, hydrolyzing O-glycosyl compounds
GO:0003824catalytic activity
Vocabulary: Biological Process
TermDefinition
GO:0005975carbohydrate metabolic process
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0030245 cellulose catabolic process
biological_process GO:0005982 starch metabolic process
biological_process GO:0005985 sucrose metabolic process
biological_process GO:0005975 carbohydrate metabolic process
biological_process GO:0000272 polysaccharide catabolic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0008810 cellulase activity
molecular_function GO:0003824 catalytic activity
molecular_function GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compounds

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG01G004790.1ClCG01G004790.1mRNA


Analysis Name: InterPro Annotations of watermelon (Charleston Gray)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001701Glycoside hydrolase family 9PFAMPF00759Glyco_hydro_9coord: 27..488
score: 7.8E
IPR008928Six-hairpin glycosidase-likeunknownSSF48208Six-hairpin glycosidasescoord: 23..494
score: 2.62E
IPR012341Six-hairpin glycosidaseGENE3DG3DSA:1.50.10.10coord: 24..492
score: 1.3E
IPR018221Glycoside hydrolase family 9, His active sitePROSITEPS00592GLYCOSYL_HYDROL_F9_1coord: 401..417
scor
NoneNo IPR availablePANTHERPTHR22298ENDO-1,4-BETA-GLUCANASEcoord: 1..495
score:
NoneNo IPR availablePANTHERPTHR22298:SF45SUBFAMILY NOT NAMEDcoord: 1..495
score:

The following gene(s) are paralogous to this gene:

None