CmoCh01G009100 (gene) Cucurbita moschata (Rifu)

NameCmoCh01G009100
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionEndoglucanase (3.2.1.4)
LocationCmo_Chr01 : 5127331 .. 5130102 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TCATTACCCGCTTCTCCTATGGCTGCCACCACAAACGCTCCCACGTTCTTCTTCTTCCTCCTCCTCTCGTCTTCCTTCTCGTTCTATACCGCTCGAGCCAACCCCAATTATCGAGACGCCTTGGCTAAGTCTTTATTGTTCTTCCAAGGACAACGCTCTGGTAGAATCCCAAATGGTCAACAAATCGCTTGGAGGTCCAACTCTGGTCTCTATGACGGTGAACTTGCTCATGTATGTCTTAAAAGTTTTGACATGTTCTTCCGGTACTCGTAATATTATGATGTTTTTTTACGGTTTAAATTATTACTCATGTTTTATTTTGTTATCATTTTCAAAAATTTTATTAATATATTTTAGAAGAAAAAGAGAAAAAAATGGAATTAAATTAAATTATTACTGTTTTGTTTTTTTTTTTCCTTAAAAAATTCGAACTTTACCAAAATCAAATCGTTGGGCAATTCTCAGGTGGATTTAACCGGCGGGTACTACGACGCCGGCGACAATGTCAAATTCAACTTACCGATGGCATTCACAACCACAATGCTTTCATGGGGAGCACTCGAGTACGGGGCGCGTATGGGCTCGGAGTTACCCAACACACGCGCCGCCATCCGTTGGGCCACGGACTACCTTCTCAAGTGCGCCACCGCCACACCCGGCAAACTCTACGTCGGCGTTGGCGACCCCAACGTCGACCACAAATGCTGGGAACGACCGGAGGACATGGATACAGTTAGGACAGTCTACTCTGTCTCTGCCGCAAACCCGGGGTCTGACGTGGCCGGCGAGACAGCCGCCGCTCTCGCCGCCGCGTCGTTGGTGTTCCGGCGGGTTGACAGAAAGTACTCCGGGCTGCTATTGGCGACGGCGAAGAAGGTGTTTCAGTTTGCCGTTGAGCATCGAGGGTCGTACAGTGATTCGCTTGGATCGGCTGTGTGTCCGTTTTATTGCTCTTATTCTGGGTATAAGGTAAAGATTCGGAATCTTTTTTTTTTTTTCTCAATTAAATTGAATAATTAAAAAATTAAGATATTAAATATTTTTTTATATGAAAAATAGGATGAGCTTGTGTGGGGAGCGACATGGCTACTAAGAGCAACAAATGATGTTCGATACTTCAATTTGTTGAAGTCATTGGGCGGCGACGATGTTCCCGACATCTTTAGTTGGGACAACAAATATGCTGGTGCTCATGTTCTTCTCTCCCGGGTAAATATTCCTTTCCCCTCTCTAGCGCCTCGAGAACTCGGGTAATTTGGATTATCCAACCCAATTCATATAAATCGGTTGAGTTGAGTTGGGTTGAGTTCAATTTTTGAAAAATTGAAAAATTTGAGTTGGTTTGTCGGTTGACATTTTATTGGTCGGACTGATTTGATCAACTCGACCCATTGAAAAATAGAGTTGCAACCCAATTCTAATATACCATATTTTTAATATTATGATGAAAATTAAAAATGTTTTACGTTCGTAATAGATGTTAATGTTGTGTTGTGACCGGTGAATATTTGATGTTTGAGTTTTATGATATTTTAGTTATTAATGTAACCTGTGTGGTAGAAAAGTGAAATTTTTTTTAAGTAATTAAAAAAAAGAACCATAGGATGACCCAACCCTTGGACTTACAACTCTTGAACCGGGTAAAATAGATGGAACCCTACTTGGTTGTTCGAGTCACTACTTGAAATTTCGGGATGGTTCAAAAGATTTCTCCAACCCAACTTGTGTACATACATACCGCTATTAAATGGTTGATTTCTATTGGTGTAGCGAGCATTACTGAACAATGACAAGAATTTCGATTCCTACAAACAAAAAGCTGAGTCATTCATGTGTCGGATTCTACCTAATTCTCCATATTCCAGCACTCAATACACACAAGGTATGCTATTTTCCGTCCGGTTTAAGCATAATTCAATCAGTGGTTCATAATTTAAGCTCGAGAATTAAAAAGCTTAGGATTTGTGTAGGAGGATTGATGTTCAAGTTGCCACAAAGTAACCTCCAATATGTGACATCCATAACATTTTTGCTGACAACATATTCCAAGTACATGTCCGCCGCCAAGCACACCTTCAACTGCGGCGGCGTTCTTGTCACTCCGACATCCCTCAAGAATCTAGCAAAGCAACAGGTAAATAACTACCAAACGCCGTCGTTTTTGGGCTCAACTTAATAACACTTTTTAATTAATTTAACAATTAAAATAAAATAATTCAGGTGGATTATATATTGGGAGTGAACCCATTGAAGATGTCATACATGGTGGGGTTTGGCAAAAACTTCCCCAAAAGAATCCACCACAGAGGATCTTCGCTGCCGTCCAAGGCCAGCCACCCTCAGGCCATCGGCTGCGACGGCGGTTTTCAGCCCTTTTTCTACTCCTACAACCCCAACCCCAACATCTTAACCGGCGCTGTCGTCGGCGGCCCCAATCAGAACGATGGCTTTCCCGACGACCGCTCTGATTACAGCCACTCCGAACCCGCCACGTACATCAACGCCGCCCTTGTTGGCCCCCTTGCCTTCTTTTCTGGCAAGACTAAATGATTTGATAAGATCCTATGTTATAACTACGTGAATATAGCTAGATTGATCCCGAATGTGCGGATAAAAATAGCTCTTTTCTTTTTCTAAATGTGTTCGTAGCTTATCATACCACCAAAAAAATAGGGTGATAAAGTGACGAATTGAATTAAAACTTCGAAGCTTTTCTTAAGAGATTCAAATCAAATCAAAATGTCTGTATAATTATATATTATGACTGTTCCA

mRNA sequence

TCATTACCCGCTTCTCCTATGGCTGCCACCACAAACGCTCCCACGTTCTTCTTCTTCCTCCTCCTCTCGTCTTCCTTCTCGTTCTATACCGCTCGAGCCAACCCCAATTATCGAGACGCCTTGGCTAAGTCTTTATTGTTCTTCCAAGGACAACGCTCTGGTAGAATCCCAAATGGTCAACAAATCGCTTGGAGGTCCAACTCTGGTCTCTATGACGGTGAACTTGCTCATGTGGATTTAACCGGCGGGTACTACGACGCCGGCGACAATGTCAAATTCAACTTACCGATGGCATTCACAACCACAATGCTTTCATGGGGAGCACTCGAGTACGGGGCGCGTATGGGCTCGGAGTTACCCAACACACGCGCCGCCATCCGTTGGGCCACGGACTACCTTCTCAAGTGCGCCACCGCCACACCCGGCAAACTCTACGTCGGCGTTGGCGACCCCAACGTCGACCACAAATGCTGGGAACGACCGGAGGACATGGATACAGTTAGGACAGTCTACTCTGTCTCTGCCGCAAACCCGGGGTCTGACGTGGCCGGCGAGACAGCCGCCGCTCTCGCCGCCGCGTCGTTGGTGTTCCGGCGGGTTGACAGAAAGTACTCCGGGCTGCTATTGGCGACGGCGAAGAAGGTGTTTCAGTTTGCCGTTGAGCATCGAGGGTCGTACAGTGATTCGCTTGGATCGGCTGTGTGTCCGTTTTATTGCTCTTATTCTGGGTATAAGGATGAGCTTGTGTGGGGAGCGACATGGCTACTAAGAGCAACAAATGATGTTCGATACTTCAATTTGTTGAAGTCATTGGGCGGCGACGATGTTCCCGACATCTTTAGTTGGGACAACAAATATGCTGGTGCTCATGTTCTTCTCTCCCGGCGAGCATTACTGAACAATGACAAGAATTTCGATTCCTACAAACAAAAAGCTGAGTCATTCATGTGTCGGATTCTACCTAATTCTCCATATTCCAGCACTCAATACACACAAGGAGGATTGATGTTCAAGTTGCCACAAAGTAACCTCCAATATGTGACATCCATAACATTTTTGCTGACAACATATTCCAAGTACATGTCCGCCGCCAAGCACACCTTCAACTGCGGCGGCGTTCTTGTCACTCCGACATCCCTCAAGAATCTAGCAAAGCAACAGGTGGATTATATATTGGGAGTGAACCCATTGAAGATGTCATACATGGTGGGGTTTGGCAAAAACTTCCCCAAAAGAATCCACCACAGAGGATCTTCGCTGCCGTCCAAGGCCAGCCACCCTCAGGCCATCGGCTGCGACGGCGGTTTTCAGCCCTTTTTCTACTCCTACAACCCCAACCCCAACATCTTAACCGGCGCTGTCGTCGGCGGCCCCAATCAGAACGATGGCTTTCCCGACGACCGCTCTGATTACAGCCACTCCGAACCCGCCACGTACATCAACGCCGCCCTTGTTGGCCCCCTTGCCTTCTTTTCTGGCAAGACTAAATGATTTGATAAGATCCTATGTTATAACTACGTGAATATAGCTAGATTGATCCCGAATGTGCGGATAAAAATAGCTCTTTTCTTTTTCTAAATGTGTTCGTAGCTTATCATACCACCAAAAAAATAGGGTGATAAAGTGACGAATTGAATTAAAACTTCGAAGCTTTTCTTAAGAGATTCAAATCAAATCAAAATGTCTGTATAATTATATATTATGACTGTTCCA

Coding sequence (CDS)

ATGGCTGCCACCACAAACGCTCCCACGTTCTTCTTCTTCCTCCTCCTCTCGTCTTCCTTCTCGTTCTATACCGCTCGAGCCAACCCCAATTATCGAGACGCCTTGGCTAAGTCTTTATTGTTCTTCCAAGGACAACGCTCTGGTAGAATCCCAAATGGTCAACAAATCGCTTGGAGGTCCAACTCTGGTCTCTATGACGGTGAACTTGCTCATGTGGATTTAACCGGCGGGTACTACGACGCCGGCGACAATGTCAAATTCAACTTACCGATGGCATTCACAACCACAATGCTTTCATGGGGAGCACTCGAGTACGGGGCGCGTATGGGCTCGGAGTTACCCAACACACGCGCCGCCATCCGTTGGGCCACGGACTACCTTCTCAAGTGCGCCACCGCCACACCCGGCAAACTCTACGTCGGCGTTGGCGACCCCAACGTCGACCACAAATGCTGGGAACGACCGGAGGACATGGATACAGTTAGGACAGTCTACTCTGTCTCTGCCGCAAACCCGGGGTCTGACGTGGCCGGCGAGACAGCCGCCGCTCTCGCCGCCGCGTCGTTGGTGTTCCGGCGGGTTGACAGAAAGTACTCCGGGCTGCTATTGGCGACGGCGAAGAAGGTGTTTCAGTTTGCCGTTGAGCATCGAGGGTCGTACAGTGATTCGCTTGGATCGGCTGTGTGTCCGTTTTATTGCTCTTATTCTGGGTATAAGGATGAGCTTGTGTGGGGAGCGACATGGCTACTAAGAGCAACAAATGATGTTCGATACTTCAATTTGTTGAAGTCATTGGGCGGCGACGATGTTCCCGACATCTTTAGTTGGGACAACAAATATGCTGGTGCTCATGTTCTTCTCTCCCGGCGAGCATTACTGAACAATGACAAGAATTTCGATTCCTACAAACAAAAAGCTGAGTCATTCATGTGTCGGATTCTACCTAATTCTCCATATTCCAGCACTCAATACACACAAGGAGGATTGATGTTCAAGTTGCCACAAAGTAACCTCCAATATGTGACATCCATAACATTTTTGCTGACAACATATTCCAAGTACATGTCCGCCGCCAAGCACACCTTCAACTGCGGCGGCGTTCTTGTCACTCCGACATCCCTCAAGAATCTAGCAAAGCAACAGGTGGATTATATATTGGGAGTGAACCCATTGAAGATGTCATACATGGTGGGGTTTGGCAAAAACTTCCCCAAAAGAATCCACCACAGAGGATCTTCGCTGCCGTCCAAGGCCAGCCACCCTCAGGCCATCGGCTGCGACGGCGGTTTTCAGCCCTTTTTCTACTCCTACAACCCCAACCCCAACATCTTAACCGGCGCTGTCGTCGGCGGCCCCAATCAGAACGATGGCTTTCCCGACGACCGCTCTGATTACAGCCACTCCGAACCCGCCACGTACATCAACGCCGCCCTTGTTGGCCCCCTTGCCTTCTTTTCTGGCAAGACTAAATGA
BLAST of CmoCh01G009100 vs. Swiss-Prot
Match: GUN9_ARATH (Endoglucanase 9 OS=Arabidopsis thaliana GN=CEL3 PE=1 SV=1)

HSP 1 Score: 780.0 bits (2013), Expect = 1.5e-224
Identity = 371/478 (77.62%), Postives = 417/478 (87.24%), Query Frame = 1

Query: 9   TFFFFLLLSSSFSFYTARANPNYRDALAKSLLFFQGQRSGRIPNGQQIAWRSNSGLYDGE 68
           + FFF+LL SS       ANPNY++AL+KSLLFFQGQRSG +P GQQI+WR++SGL DG 
Sbjct: 3   SLFFFVLLFSSLLISNGDANPNYKEALSKSLLFFQGQRSGPLPRGQQISWRASSGLSDGS 62

Query: 69  LAHVDLTGGYYDAGDNVKFNLPMAFTTTMLSWGALEYGARMGSELPNTRAAIRWATDYLL 128
            AHVDLTGGYYDAGDNVKFNLPMAFTTTMLSW ALEYG RMG EL N R  IRWATDYLL
Sbjct: 63  AAHVDLTGGYYDAGDNVKFNLPMAFTTTMLSWSALEYGKRMGPELENARVNIRWATDYLL 122

Query: 129 KCATATPGKLYVGVGDPNVDHKCWERPEDMDTVRTVYSVSAANPGSDVAGETAAALAAAS 188
           KCA ATPGKLYVGVGDPNVDHKCWERPEDMDT RTVYSVSA+NPGSDVA ETAAALAAAS
Sbjct: 123 KCARATPGKLYVGVGDPNVDHKCWERPEDMDTPRTVYSVSASNPGSDVAAETAAALAAAS 182

Query: 189 LVFRRVDRKYSGLLLATAKKVFQFAVEHRGSYSDSLGSAVCPFYCSYSGYKDELVWGATW 248
           +VFR+VD KYS LLLATAK V QFA++++G+YSDSL S+VCPFYCSYSGYKDEL+WGA+W
Sbjct: 183 MVFRKVDSKYSRLLLATAKDVMQFAIQYQGAYSDSLSSSVCPFYCSYSGYKDELMWGASW 242

Query: 249 LLRATNDVRYFNLLKSLGGDDVPDIFSWDNKYAGAHVLLSRRALLNNDKNFDSYKQKAES 308
           LLRATN+  Y N +KSLGG D PDIFSWDNKYAGA+VLLSRRALLN D NF+ YKQ AE+
Sbjct: 243 LLRATNNPYYANFIKSLGGGDQPDIFSWDNKYAGAYVLLSRRALLNKDSNFEQYKQAAEN 302

Query: 309 FMCRILPNSPYSSTQYTQGGLMFKLPQSNLQYVTSITFLLTTYSKYMSAAKHTFNCGGVL 368
           F+C+ILP+SP SSTQYTQGGLM+KLPQSNLQYVTSITFLLTTY+KYM A KHTFNCG  +
Sbjct: 303 FICKILPDSPSSSTQYTQGGLMYKLPQSNLQYVTSITFLLTTYAKYMKATKHTFNCGSSV 362

Query: 369 VTPTSLKNLAKQQVDYILGVNPLKMSYMVGFGKNFPKRIHHRGSSLPSKASHPQAIGCDG 428
           + P +L +L+K+QVDYILG NP+KMSYMVGF  NFPKRIHHR SSLPS A   Q++GC+G
Sbjct: 363 IVPNALISLSKRQVDYILGDNPIKMSYMVGFSSNFPKRIHHRASSLPSHALRSQSLGCNG 422

Query: 429 GFQPFFYSYNPNPNILTGAVVGGPNQNDGFPDDRSDYSHSEPATYINAALVGPLAFFS 487
           GFQ  FY+ NPNPNILTGA+VGGPNQNDG+PD R DYSH+EPATYINAA VGPLA+F+
Sbjct: 423 GFQS-FYTQNPNPNILTGAIVGGPNQNDGYPDQRDDYSHAEPATYINAAFVGPLAYFA 479

BLAST of CmoCh01G009100 vs. Swiss-Prot
Match: GUN3_ARATH (Endoglucanase 3 OS=Arabidopsis thaliana GN=CEL5 PE=2 SV=2)

HSP 1 Score: 764.6 bits (1973), Expect = 6.5e-220
Identity = 362/476 (76.05%), Postives = 413/476 (86.76%), Query Frame = 1

Query: 11  FFFLLLSSSFSFYTARANPNYRDALAKSLLFFQGQRSGRIPNGQQIAWRSNSGLYDGELA 70
           FFF+ L S+ S     A+PNYR+AL+KSLLFFQGQRSGR+P+ QQ++WRS+SGL DG  A
Sbjct: 5   FFFVFLLSALSLENTYASPNYREALSKSLLFFQGQRSGRLPSDQQLSWRSSSGLSDGSSA 64

Query: 71  HVDLTGGYYDAGDNVKFNLPMAFTTTMLSWGALEYGARMGSELPNTRAAIRWATDYLLKC 130
           HVDLTGGYYDAGDNVKFN PMAFTTTMLSW +LEYG +MG EL N+R AIRWATDYLLKC
Sbjct: 65  HVDLTGGYYDAGDNVKFNFPMAFTTTMLSWSSLEYGKKMGPELQNSRVAIRWATDYLLKC 124

Query: 131 ATATPGKLYVGVGDPNVDHKCWERPEDMDTVRTVYSVSAANPGSDVAGETAAALAAASLV 190
           A ATPGKLYVGVGDPN DHKCWERPEDMDT RTVYSVS +NPGSDVA ETAAALAA+S+V
Sbjct: 125 ARATPGKLYVGVGDPNGDHKCWERPEDMDTPRTVYSVSPSNPGSDVAAETAAALAASSMV 184

Query: 191 FRRVDRKYSGLLLATAKKVFQFAVEHRGSYSDSLGSAVCPFYCSYSGYKDELVWGATWLL 250
           FR+VD KYS LLLATAKKV QFA+++RG+YS+SL S+VCPFYCSYSGYKDEL+WGA WL 
Sbjct: 185 FRKVDPKYSRLLLATAKKVMQFAIQYRGAYSNSLSSSVCPFYCSYSGYKDELLWGAAWLH 244

Query: 251 RATNDVRYFNLLKSLGGDDVPDIFSWDNKYAGAHVLLSRRALLNNDKNFDSYKQKAESFM 310
           RATND  Y N +KSLGG D PDIFSWDNKYAGA+VLLSRRA+LN D NF+ YKQ AE+FM
Sbjct: 245 RATNDPYYTNFIKSLGGGDQPDIFSWDNKYAGAYVLLSRRAVLNKDNNFELYKQAAENFM 304

Query: 311 CRILPNSPYSSTQYTQGGLMFKLPQSNLQYVTSITFLLTTYSKYMSAAKHTFNCGGVLVT 370
           C+ILPNSP SST+YT+GGLM+KLPQSNLQYVTSITFLLTTY+KYM + K TFNCG  L+ 
Sbjct: 305 CKILPNSPSSSTKYTKGGLMYKLPQSNLQYVTSITFLLTTYAKYMKSTKQTFNCGNSLIV 364

Query: 371 PTSLKNLAKQQVDYILGVNPLKMSYMVGFGKNFPKRIHHRGSSLPSKASHPQAIGCDGGF 430
           P +L NL+K+QVDY+LGVNP+KMSYMVGF  NFPKRIHHRGSSLPS+A    ++GC+GGF
Sbjct: 365 PNALINLSKRQVDYVLGVNPMKMSYMVGFSSNFPKRIHHRGSSLPSRAVRSNSLGCNGGF 424

Query: 431 QPFFYSYNPNPNILTGAVVGGPNQNDGFPDDRSDYSHSEPATYINAALVGPLAFFS 487
           Q  F + NPNPNILTGA+VGGPNQND +PD R DY+ SEPATYINAA VGPLA+F+
Sbjct: 425 QS-FRTQNPNPNILTGAIVGGPNQNDEYPDQRDDYTRSEPATYINAAFVGPLAYFA 479

BLAST of CmoCh01G009100 vs. Swiss-Prot
Match: GUN11_ORYSJ (Endoglucanase 11 OS=Oryza sativa subsp. japonica GN=GLU4 PE=2 SV=3)

HSP 1 Score: 623.6 bits (1607), Expect = 1.8e-177
Identity = 298/464 (64.22%), Postives = 359/464 (77.37%), Query Frame = 1

Query: 25  ARANPNYRDALAKSLLFFQGQRSGRIPNGQQIAWRSNSGLYDGELAHVDLTGGYYDAGDN 84
           A  +P+Y DALAKS+LFFQGQRSGR+P  Q + WRSNSGL DG  A+VDLTGGYYD GDN
Sbjct: 33  AAGHPDYADALAKSILFFQGQRSGRLPPDQAVKWRSNSGLSDGSAANVDLTGGYYDGGDN 92

Query: 85  VKFNLPMAFTTTMLSWGALEYGARM-GSELPNTRAAIRWATDYLLKCATATPGKLYVGVG 144
           VKF  PMAFTTTMLSWG +EYG RM G  L + R A+RWA DYLL+ ATATPG LYVGVG
Sbjct: 93  VKFGFPMAFTTTMLSWGVVEYGGRMRGRVLRDARDAVRWAADYLLRAATATPGVLYVGVG 152

Query: 145 DPNVDHKCWERPEDMDTVRTVYSVSAANPGSDVAGETAAALAAASLVFRRVDRKYSGLLL 204
           DP+ DH+CWERPEDMDT R VYSVSA++PGSDVA ETAAALAAASL  R  D  YS  LL
Sbjct: 153 DPDADHRCWERPEDMDTPRAVYSVSASSPGSDVAAETAAALAAASLALRAADPGYSRRLL 212

Query: 205 ATAKKVFQFAVEHRGSYSDSLGSAVCPFYCSYSGYKDELVWGATWLLRATNDVRYFNLLK 264
           A A+ V  FAV H+G YSD +G  V  +Y SYSGY+DEL+WG+ WLL AT +  Y + L 
Sbjct: 213 AAARDVMAFAVRHQGKYSDHVGGDVGAYYASYSGYQDELLWGSAWLLWATRNASYLDYLA 272

Query: 265 SLGGDDVPDIFSWDNKYAGAHVLLSRRALLNNDKNFDSYKQKAESFMCRILPNSPYSSTQ 324
           SLG +D  D+FSWDNK AGA VLLSRRAL+N D+  D++++ AE F+CRILP SP S+TQ
Sbjct: 273 SLGANDGVDMFSWDNKLAGARVLLSRRALVNGDRRLDAFRRLAEDFICRILPGSPSSTTQ 332

Query: 325 YTQGGLMFKLPQSNLQYVTSITFLLTTYSKYMSAAKHTFNCGGVLVTPTSLKNLAKQQVD 384
           YT GG+M+K   +NLQYVTS +FLLTT++KYM+ + HTF+C  + VT  +L+ LA++QVD
Sbjct: 333 YTPGGMMYKSGHANLQYVTSASFLLTTFAKYMAVSNHTFSCQSLPVTAKTLRALARKQVD 392

Query: 385 YILGVNPLKMSYMVGFGKNFPKRIHHRGSSLPSKASHPQAIGCDGGFQPFFYSYNPNPNI 444
           YILG NP  MSYMVG+G  FP+RIHHRG+S+PS A++P  IGC  GF  +F +   NPN+
Sbjct: 393 YILGANPQGMSYMVGYGARFPQRIHHRGASMPSVAAYPAHIGCQEGFSGYFNAGGANPNV 452

Query: 445 LTGAVVGGPNQNDGFPDDRSDYSHSEPATYINAALVGPLAFFSG 488
            TGAVVGGP+Q+D FPD+R DY  SEP TY NAALVG LA+F+G
Sbjct: 453 HTGAVVGGPDQHDAFPDERGDYDRSEPTTYTNAALVGCLAYFAG 496

BLAST of CmoCh01G009100 vs. Swiss-Prot
Match: GUN1_ARATH (Endoglucanase 1 OS=Arabidopsis thaliana GN=CEL2 PE=2 SV=1)

HSP 1 Score: 600.5 bits (1547), Expect = 1.6e-170
Identity = 294/488 (60.25%), Postives = 366/488 (75.00%), Query Frame = 1

Query: 12  FFLLLSSSFSFYTARA---------NPNYRDALAKSLLFFQGQRSGRIPNGQQIAWRSNS 71
           F LLLS+ FS  ++R          N NY+DAL+KS+LFF+GQRSG++P  Q++ WRSNS
Sbjct: 16  FILLLSNGFSSSSSRPSIHHRHHLDNHNYKDALSKSILFFEGQRSGKLPPNQRMTWRSNS 75

Query: 72  GLYDGELAHVDLTGGYYDAGDNVKFNLPMAFTTTMLSWGALEYGARMGSELPNTRAAIRW 131
           GL DG   +VDL GGYYDAGDN+KF  PMAFTTTMLSW  +E+G  M SELPN + AIRW
Sbjct: 76  GLSDGSALNVDLVGGYYDAGDNMKFGFPMAFTTTMLSWSLIEFGGLMKSELPNAKDAIRW 135

Query: 132 ATDYLLKCATATPGKLYVGVGDPNVDHKCWERPEDMDTVRTVYSVSAANPGSDVAGETAA 191
           ATD+LLK AT+ P  +YV VGDPN+DH CWERPEDMDT R+V+ V   NPGSD+AGE AA
Sbjct: 136 ATDFLLK-ATSHPDTIYVQVGDPNMDHACWERPEDMDTPRSVFKVDKNNPGSDIAGEIAA 195

Query: 192 ALAAASLVFRRVDRKYSGLLLATAKKVFQFAVEHRGSYSDSLGSAVCPFYCSYSGYKDEL 251
           ALAAAS+VFR+ D  YS  LL  A  VF FA ++RG YS  L   VCPFYCSYSGY+DEL
Sbjct: 196 ALAAASIVFRKCDPSYSNHLLQRAITVFTFADKYRGPYSAGLAPEVCPFYCSYSGYQDEL 255

Query: 252 VWGATWLLRATNDVRYFNLLKS----LGGDDVPDIFSWDNKYAGAHVLLSRRALLNNDKN 311
           +WGA WL +ATN+  Y N +K+    LG D+  ++FSWDNK+ GA +LLS+  L+   K+
Sbjct: 256 LWGAAWLQKATNNPTYLNYIKANGQILGADEFDNMFSWDNKHVGARILLSKEFLIQKVKS 315

Query: 312 FDSYKQKAESFMCRILPNSPYSSTQYTQGGLMFKLPQSNLQYVTSITFLLTTYSKYMSAA 371
            + YK+ A+SF+C +LP +  SS+QYT GGL+FK+ +SN+QYVTS +FLL TY+KY+++A
Sbjct: 316 LEEYKEHADSFICSVLPGA--SSSQYTPGGLLFKMGESNMQYVTSTSFLLLTYAKYLTSA 375

Query: 372 KHTFNCGGVLVTPTSLKNLAKQQVDYILGVNPLKMSYMVGFGKNFPKRIHHRGSSLPSKA 431
           +    CGG +VTP  L+++AK+QVDY+LG NPLKMSYMVG+G  +P+RIHHRGSSLPS A
Sbjct: 376 RTVAYCGGSVVTPARLRSIAKKQVDYLLGGNPLKMSYMVGYGLKYPRRIHHRGSSLPSVA 435

Query: 432 SHPQAIGCDGGFQPFFYSYNPNPNILTGAVVGGPNQNDGFPDDRSDYSHSEPATYINAAL 487
            HP  I C  GF   F S +PNPN L GAVVGGP+QND FPD+RSDY  SEPATYINA L
Sbjct: 436 VHPTRIQCHDGFS-LFTSQSPNPNDLVGAVVGGPDQNDQFPDERSDYGRSEPATYINAPL 495

BLAST of CmoCh01G009100 vs. Swiss-Prot
Match: GUN17_ARATH (Endoglucanase 17 OS=Arabidopsis thaliana GN=At4g02290 PE=2 SV=1)

HSP 1 Score: 589.0 bits (1517), Expect = 4.9e-167
Identity = 288/492 (58.54%), Postives = 363/492 (73.78%), Query Frame = 1

Query: 9   TFFFFLL-------LSSSFSFYTAR---ANPNYRDALAKSLLFFQGQRSGRIPNGQQIAW 68
           +FFFFL         SS F+ +  R   A  NY+DAL KS+LFF+GQRSG++P+ Q+++W
Sbjct: 21  SFFFFLCNGFSYPTTSSLFNTHHHRHHLAKHNYKDALTKSILFFEGQRSGKLPSNQRMSW 80

Query: 69  RSNSGLYDGELAHVDLTGGYYDAGDNVKFNLPMAFTTTMLSWGALEYGARMGSELPNTRA 128
           R +SGL DG   HVDL GGYYDAGDN+KF  PMAFTTTMLSW  +E+G  M SEL N + 
Sbjct: 81  RRDSGLSDGSALHVDLVGGYYDAGDNIKFGFPMAFTTTMLSWSVIEFGGLMKSELQNAKI 140

Query: 129 AIRWATDYLLKCATATPGKLYVGVGDPNVDHKCWERPEDMDTVRTVYSVSAANPGSDVAG 188
           AIRWATDYLLK AT+ P  +YV VGD N DH CWERPEDMDTVR+V+ V    PGSDVA 
Sbjct: 141 AIRWATDYLLK-ATSQPDTIYVQVGDANKDHSCWERPEDMDTVRSVFKVDKNIPGSDVAA 200

Query: 189 ETAAALAAASLVFRRVDRKYSGLLLATAKKVFQFAVEHRGSYSDSLGSAVCPFYCSYSGY 248
           ETAAALAAA++VFR+ D  YS +LL  A  VF FA ++RG+YS  L   VCPFYCSYSGY
Sbjct: 201 ETAAALAAAAIVFRKSDPSYSKVLLKRAISVFAFADKYRGTYSAGLKPDVCPFYCSYSGY 260

Query: 249 KDELVWGATWLLRATNDVRYFNLLKS----LGGDDVPDIFSWDNKYAGAHVLLSRRALLN 308
           +DEL+WGA WL +AT +++Y N +K     LG  +  + F WDNK+AGA +LL++  L+ 
Sbjct: 261 QDELLWGAAWLQKATKNIKYLNYIKINGQILGAAEYDNTFGWDNKHAGARILLTKAFLVQ 320

Query: 309 NDKNFDSYKQKAESFMCRILPNSPYSSTQYTQGGLMFKLPQSNLQYVTSITFLLTTYSKY 368
           N K    YK  A++F+C ++P +P+SSTQYT GGL+FK+  +N+QYVTS +FLL TY+KY
Sbjct: 321 NVKTLHEYKGHADNFICSVIPGAPFSSTQYTPGGLLFKMADANMQYVTSTSFLLLTYAKY 380

Query: 369 MSAAKHTFNCGGVLVTPTSLKNLAKQQVDYILGVNPLKMSYMVGFGKNFPKRIHHRGSSL 428
           +++AK   +CGG + TP  L+++AK+QVDY+LG NPL+MSYMVG+G  FP+RIHHRGSSL
Sbjct: 381 LTSAKTVVHCGGSVYTPGRLRSIAKRQVDYLLGDNPLRMSYMVGYGPKFPRRIHHRGSSL 440

Query: 429 PSKASHPQAIGCDGGFQPFFYSYNPNPNILTGAVVGGPNQNDGFPDDRSDYSHSEPATYI 487
           P  ASHP  I C  GF     S +PNPN L GAVVGGP+Q+D FPD+RSDY  SEPATYI
Sbjct: 441 PCVASHPAKIQCHQGF-AIMNSQSPNPNFLVGAVVGGPDQHDRFPDERSDYEQSEPATYI 500

BLAST of CmoCh01G009100 vs. TrEMBL
Match: A0A0A0KW33_CUCSA (Endoglucanase OS=Cucumis sativus GN=Csa_5G648680 PE=3 SV=1)

HSP 1 Score: 896.0 bits (2314), Expect = 2.1e-257
Identity = 436/489 (89.16%), Postives = 461/489 (94.27%), Query Frame = 1

Query: 2   AATTNAPTFF--FFLLLSSSFSFYTARANPNYRDALAKSLLFFQGQRSGRIPNGQQIAWR 61
           +A +N+ T F  FFLLLS SF+   A A PNYRDALAKS+LFF+GQRSGRIP  Q+I WR
Sbjct: 3   SAISNSSTLFLLFFLLLSFSFAG-RALAGPNYRDALAKSILFFEGQRSGRIPANQRITWR 62

Query: 62  SNSGLYDGELAHVDLTGGYYDAGDNVKFNLPMAFTTTMLSWGALEYGARMGSELPNTRAA 121
           SNSGLYDGEL HVDLTGGYYDAGDNVKFNLPMAFTTTMLSWGALEYGARMGSELPNTRAA
Sbjct: 63  SNSGLYDGELDHVDLTGGYYDAGDNVKFNLPMAFTTTMLSWGALEYGARMGSELPNTRAA 122

Query: 122 IRWATDYLLKCATATPGKLYVGVGDPNVDHKCWERPEDMDTVRTVYSVSAANPGSDVAGE 181
           IRWATDYLLKCATATPGKLYVGVG+P+ DHKCWERPEDMDTVRTVYSVSA NPGSDVAGE
Sbjct: 123 IRWATDYLLKCATATPGKLYVGVGEPHADHKCWERPEDMDTVRTVYSVSAGNPGSDVAGE 182

Query: 182 TAAALAAASLVFRRVDRKYSGLLLATAKKVFQFAVEHRGSYSDSLGSAVCPFYCSYSGYK 241
           TAAALAAASLVFRRVDRKYS +LLATAKKV +FA+EHRGSYSDSL SAVCPFYCSYSGYK
Sbjct: 183 TAAALAAASLVFRRVDRKYSKVLLATAKKVMEFALEHRGSYSDSLSSAVCPFYCSYSGYK 242

Query: 242 DELVWGATWLLRATNDVRYFNLLKSLGGDDVPDIFSWDNKYAGAHVLLSRRALLNNDKNF 301
           DELVWGA WLLRATN+V+YFNLLKSLGGDDV DIFSWDNK+AGAHVLLSRR+LLNNDKNF
Sbjct: 243 DELVWGAAWLLRATNNVKYFNLLKSLGGDDVTDIFSWDNKFAGAHVLLSRRSLLNNDKNF 302

Query: 302 DSYKQKAESFMCRILPNSPYSSTQYTQGGLMFKLPQSNLQYVTSITFLLTTYSKYMSAAK 361
           DSYKQ+AE+FMCRILPNSP SSTQYTQG LMFKLP+SNLQYVTSITFLLTTYSKYMSAAK
Sbjct: 303 DSYKQEAEAFMCRILPNSPSSSTQYTQGRLMFKLPESNLQYVTSITFLLTTYSKYMSAAK 362

Query: 362 HTFNCGGVLVTPTSLKNLAKQQVDYILGVNPLKMSYMVGFGKNFPKRIHHRGSSLPSKAS 421
           HTFNCG ++VTP SLKNLAK QVDYILGVNPLKMSYMVGFGKN+PKRIHHRGSSLPSKA+
Sbjct: 363 HTFNCGNLVVTPASLKNLAKIQVDYILGVNPLKMSYMVGFGKNYPKRIHHRGSSLPSKAT 422

Query: 422 HPQAIGCDGGFQPFFYSYNPNPNILTGAVVGGPNQNDGFPDDRSDYSHSEPATYINAALV 481
           HPQAI CDGGFQPFFYSYNPNPNILTGAVVGGPNQ+DGFPDDR+DYSHSEPATYINAALV
Sbjct: 423 HPQAIACDGGFQPFFYSYNPNPNILTGAVVGGPNQSDGFPDDRTDYSHSEPATYINAALV 482

Query: 482 GPLAFFSGK 489
           GPLAFFSGK
Sbjct: 483 GPLAFFSGK 490

BLAST of CmoCh01G009100 vs. TrEMBL
Match: U5FH10_POPTR (Endoglucanase OS=Populus trichocarpa GN=POPTR_0019s09740g PE=3 SV=1)

HSP 1 Score: 823.2 bits (2125), Expect = 1.7e-235
Identity = 386/476 (81.09%), Postives = 430/476 (90.34%), Query Frame = 1

Query: 12  FFLLLSSSFSFYT-ARANPNYRDALAKSLLFFQGQRSGRIPNGQQIAWRSNSGLYDGELA 71
           F LL S S +      ANPNY+DALAKS+LFFQGQRSGR+P  QQ+AWRS+SGL DG  A
Sbjct: 7   FCLLFSLSLALLGFVHANPNYKDALAKSILFFQGQRSGRLPRSQQLAWRSDSGLSDGLFA 66

Query: 72  HVDLTGGYYDAGDNVKFNLPMAFTTTMLSWGALEYGARMGSELPNTRAAIRWATDYLLKC 131
           HVDLTGGYYDAGDNVKFN PMAFTTTMLSW  LEYG RMG ELPN RAAIRWATDYLLKC
Sbjct: 67  HVDLTGGYYDAGDNVKFNFPMAFTTTMLSWSTLEYGKRMGPELPNARAAIRWATDYLLKC 126

Query: 132 ATATPGKLYVGVGDPNVDHKCWERPEDMDTVRTVYSVSAANPGSDVAGETAAALAAASLV 191
           ATATPGKLYVGVGDPNVDHKCWERPEDMDT RTV+SVSA +PGSDVAGETAAALAAAS+V
Sbjct: 127 ATATPGKLYVGVGDPNVDHKCWERPEDMDTARTVFSVSARSPGSDVAGETAAALAAASMV 186

Query: 192 FRRVDRKYSGLLLATAKKVFQFAVEHRGSYSDSLGSAVCPFYCSYSGYKDELVWGATWLL 251
           FR+VDRKYS LLL TA+KVFQFA++++G+YSDSLGSAVCPFYCSYSGYKDEL+WGA WL 
Sbjct: 187 FRKVDRKYSALLLRTARKVFQFAMQYQGAYSDSLGSAVCPFYCSYSGYKDELLWGAAWLF 246

Query: 252 RATNDVRYFNLLKSLGGDDVPDIFSWDNKYAGAHVLLSRRALLNNDKNFDSYKQKAESFM 311
           RATN++ Y+N+ KSLG DD PD+FSWDNKYAG HVLLSRRALLNNDKNF+ ++ +AESFM
Sbjct: 247 RATNEMSYYNIFKSLGADDQPDLFSWDNKYAGVHVLLSRRALLNNDKNFEQFEGEAESFM 306

Query: 312 CRILPNSPYSSTQYTQGGLMFKLPQSNLQYVTSITFLLTTYSKYMSAAKHTFNCGGVLVT 371
           CRILPNSPY +TQYTQGGLM+KLP+SNLQYVTSITFLLTTY+KYM A +HTFNCG +LVT
Sbjct: 307 CRILPNSPYKTTQYTQGGLMYKLPESNLQYVTSITFLLTTYAKYMKATRHTFNCGNLLVT 366

Query: 372 PTSLKNLAKQQVDYILGVNPLKMSYMVGFGKNFPKRIHHRGSSLPSKASHPQAIGCDGGF 431
           P SL  +AK+QVDYILG NP++MSYMVGFG NFPKRIHHRGSSLPS ASHPQAIGCD GF
Sbjct: 367 PNSLLYVAKRQVDYILGENPIRMSYMVGFGPNFPKRIHHRGSSLPSLASHPQAIGCDSGF 426

Query: 432 QPFFYSYNPNPNILTGAVVGGPNQNDGFPDDRSDYSHSEPATYINAALVGPLAFFS 487
           +PFF+S NPNPNILTGA+VGGPNQNDG+PD+RSDYSHSEPATYINAA+VGPLA+F+
Sbjct: 427 EPFFHSANPNPNILTGAIVGGPNQNDGYPDERSDYSHSEPATYINAAMVGPLAYFA 482

BLAST of CmoCh01G009100 vs. TrEMBL
Match: Q6DMM3_9ROSI (Endoglucanase OS=Populus tremula x Populus tremuloides PE=2 SV=1)

HSP 1 Score: 819.3 bits (2115), Expect = 2.5e-234
Identity = 385/476 (80.88%), Postives = 429/476 (90.13%), Query Frame = 1

Query: 12  FFLLLSSSFSFYT-ARANPNYRDALAKSLLFFQGQRSGRIPNGQQIAWRSNSGLYDGELA 71
           F LL S S       +A PNY +ALAKS+LFFQGQRSGR+P  QQ+AWRS+SGL DG  A
Sbjct: 7   FCLLFSLSLVLLGFVQAKPNYNEALAKSILFFQGQRSGRLPGSQQLAWRSDSGLSDGLFA 66

Query: 72  HVDLTGGYYDAGDNVKFNLPMAFTTTMLSWGALEYGARMGSELPNTRAAIRWATDYLLKC 131
           HVDLTGGYYDAGDNVKFN PMAFTTTMLSW  LEYG RMG ELPN RAAIRWATDYLLKC
Sbjct: 67  HVDLTGGYYDAGDNVKFNFPMAFTTTMLSWSTLEYGKRMGPELPNARAAIRWATDYLLKC 126

Query: 132 ATATPGKLYVGVGDPNVDHKCWERPEDMDTVRTVYSVSAANPGSDVAGETAAALAAASLV 191
           ATATPGKLYVGVGDPNVDHKCWERPEDMDTVRTV+SVSA +PGSDVAGETAAALAAAS+V
Sbjct: 127 ATATPGKLYVGVGDPNVDHKCWERPEDMDTVRTVFSVSARSPGSDVAGETAAALAAASMV 186

Query: 192 FRRVDRKYSGLLLATAKKVFQFAVEHRGSYSDSLGSAVCPFYCSYSGYKDELVWGATWLL 251
           FR+VDRKYS LLL TA+KVFQFA++++G+YSDSLGSAVCPFYCSYSGYKDEL+WGA WL 
Sbjct: 187 FRKVDRKYSALLLRTARKVFQFAMQYQGAYSDSLGSAVCPFYCSYSGYKDELLWGAAWLF 246

Query: 252 RATNDVRYFNLLKSLGGDDVPDIFSWDNKYAGAHVLLSRRALLNNDKNFDSYKQKAESFM 311
           RATN++ Y+N+ KSLG DD PD+FSWDNKYAG HVLLSRRALLNNDKNF+ ++ +AESFM
Sbjct: 247 RATNEMSYYNIFKSLGADDQPDLFSWDNKYAGVHVLLSRRALLNNDKNFEQFEGEAESFM 306

Query: 312 CRILPNSPYSSTQYTQGGLMFKLPQSNLQYVTSITFLLTTYSKYMSAAKHTFNCGGVLVT 371
           CRILPNSPY +TQYTQGGLM+KLP+SNLQYVTSITFLLTTY+KYM A +HTFNCG +LVT
Sbjct: 307 CRILPNSPYKTTQYTQGGLMYKLPESNLQYVTSITFLLTTYAKYMKATRHTFNCGNLLVT 366

Query: 372 PTSLKNLAKQQVDYILGVNPLKMSYMVGFGKNFPKRIHHRGSSLPSKASHPQAIGCDGGF 431
           P SL  +AK+QVDYILG NP++MSYMVGFG NFPKRIHHRGSSLPS ASHPQAIGCD GF
Sbjct: 367 PNSLLYVAKRQVDYILGENPIRMSYMVGFGPNFPKRIHHRGSSLPSLASHPQAIGCDSGF 426

Query: 432 QPFFYSYNPNPNILTGAVVGGPNQNDGFPDDRSDYSHSEPATYINAALVGPLAFFS 487
           +PFF+S NPNPNILTGA+VGGPNQNDG+PD+RSDYSHSEPATYINAA+VGPLA+F+
Sbjct: 427 EPFFHSANPNPNILTGAIVGGPNQNDGYPDERSDYSHSEPATYINAAMVGPLAYFA 482

BLAST of CmoCh01G009100 vs. TrEMBL
Match: A0A067LME8_JATCU (Endoglucanase OS=Jatropha curcas GN=JCGZ_17191 PE=3 SV=1)

HSP 1 Score: 816.2 bits (2107), Expect = 2.1e-233
Identity = 384/482 (79.67%), Postives = 429/482 (89.00%), Query Frame = 1

Query: 12  FFLLLSSSFSFYT---ARANPNYRDALAKSLLFFQGQRSGRIPNGQQIAWRSNSGLYDGE 71
           F   L  SFSF+     ++NPNY+DAL KS+LFF+GQRSG++P  Q I WRS SGL DG 
Sbjct: 4   FSFCLLFSFSFFLLGFVQSNPNYKDALTKSILFFEGQRSGKVPPSQGINWRSTSGLSDGL 63

Query: 72  LAHVDLTGGYYDAGDNVKFNLPMAFTTTMLSWGALEYGARMGSELPNTRAAIRWATDYLL 131
           LAHVDLTGGYYDAGDNVKFN PMAFTTTMLSW  +EYG RMG EL N RAAIRWATDYLL
Sbjct: 64  LAHVDLTGGYYDAGDNVKFNFPMAFTTTMLSWSTIEYGRRMGPELQNARAAIRWATDYLL 123

Query: 132 KCATATPGKLYVGVGDPNVDHKCWERPEDMDTVRTVYSVSAANPGSDVAGETAAALAAAS 191
           KCA ATPGKLYVGVGDPNVDHKCWERPEDMDTVR+VYSVSA NPGSDVAGETAAALAAAS
Sbjct: 124 KCARATPGKLYVGVGDPNVDHKCWERPEDMDTVRSVYSVSARNPGSDVAGETAAALAAAS 183

Query: 192 LVFRRVDRKYSGLLLATAKKVFQFAVEHRGSYSDSLGSAVCPFYCSYSGYKDELVWGATW 251
           +VFR+ D  YS LLL+TAK VFQFA++++G+YSDSLGSAVCPFYCSYSGYKDEL+WGA W
Sbjct: 184 IVFRKADPNYSELLLSTAKNVFQFALKYQGAYSDSLGSAVCPFYCSYSGYKDELLWGAAW 243

Query: 252 LLRATNDVRYFNLLKSLGGDDVPDIFSWDNKYAGAHVLLSRRALLNNDKNFDSYKQKAES 311
           L RATN + Y+NL+KSLG DD PDIFSWDNKYAGAHVLLSRRALLNNDKNF+ YK +AE+
Sbjct: 244 LFRATNQMYYYNLIKSLGADDQPDIFSWDNKYAGAHVLLSRRALLNNDKNFEQYKVEAEN 303

Query: 312 FMCRILPNSPYSSTQYTQGGLMFKLPQSNLQYVTSITFLLTTYSKYMSAAKHTFNCGGVL 371
           FMC+ILPNSP+++TQYTQGGLM+KLP+SNLQYVTSITFLLTTY+KYM + KHTFNCG ++
Sbjct: 304 FMCKILPNSPFTTTQYTQGGLMYKLPESNLQYVTSITFLLTTYAKYMKSTKHTFNCGNLM 363

Query: 372 VTPTSLKNLAKQQVDYILGVNPLKMSYMVGFGKNFPKRIHHRGSSLPSKASHPQAIGCDG 431
           VTP SL  +AK+QVDYILGVNP++MSYMVGFG +FPKRIHHRGSSLPS ASHPQ IGCDG
Sbjct: 364 VTPNSLLYVAKRQVDYILGVNPIQMSYMVGFGPHFPKRIHHRGSSLPSLASHPQTIGCDG 423

Query: 432 GFQPFFYSYNPNPNILTGAVVGGPNQNDGFPDDRSDYSHSEPATYINAALVGPLAFFSGK 491
           GFQPFFYS NPNPNIL GA+VGGPNQ+DGFPDDRSDYSHSEPATYINAA+VGPLA+F G 
Sbjct: 424 GFQPFFYSANPNPNILIGAIVGGPNQSDGFPDDRSDYSHSEPATYINAAIVGPLAYFGGS 483

BLAST of CmoCh01G009100 vs. TrEMBL
Match: A0A061FIN7_THECC (Endoglucanase OS=Theobroma cacao GN=TCM_036320 PE=3 SV=1)

HSP 1 Score: 814.7 bits (2103), Expect = 6.1e-233
Identity = 383/485 (78.97%), Postives = 424/485 (87.42%), Query Frame = 1

Query: 3   ATTNAPTFFFFLLLSSSFSFYTARANPNYRDALAKSLLFFQGQRSGRIPNGQQIAWRSNS 62
           AT+++ +F   LL  S     T   NPNY++AL KS+LFFQGQRSGR+P  QQI WRSNS
Sbjct: 27  ATSSSVSFLCLLLFLSPLLLNTVHGNPNYKEALLKSILFFQGQRSGRLPANQQITWRSNS 86

Query: 63  GLYDGELAHVDLTGGYYDAGDNVKFNLPMAFTTTMLSWGALEYGARMGSELPNTRAAIRW 122
           GL DG L HVDLTGGYYDAGDNVKFN PMAFTTTMLSW  LEYG RMG +L   RAAIRW
Sbjct: 87  GLSDGLLEHVDLTGGYYDAGDNVKFNFPMAFTTTMLSWSTLEYGKRMGPQLQEARAAIRW 146

Query: 123 ATDYLLKCATATPGKLYVGVGDPNVDHKCWERPEDMDTVRTVYSVSAANPGSDVAGETAA 182
           ATDYLLKCA A PGKLYVGVGDPN DHKCWERPEDMDTVRT YSVS +NPGSDVA ETAA
Sbjct: 147 ATDYLLKCANAKPGKLYVGVGDPNADHKCWERPEDMDTVRTSYSVSPSNPGSDVAAETAA 206

Query: 183 ALAAASLVFRRVDRKYSGLLLATAKKVFQFAVEHRGSYSDSLGSAVCPFYCSYSGYKDEL 242
           ALAAAS+VFR++D KYS LL  TA+KV  FA+++RG+YSDSLGSAVCPFYCSYSGYKDEL
Sbjct: 207 ALAAASMVFRKIDPKYSSLLRETARKVMAFAIQYRGAYSDSLGSAVCPFYCSYSGYKDEL 266

Query: 243 VWGATWLLRATNDVRYFNLLKSLGGDDVPDIFSWDNKYAGAHVLLSRRALLNNDKNFDSY 302
           +WGA+WLLRATND  Y+N LK+LG DD PD+FSWDNKYAGAHVLL+RRAL+ NDKNF+ Y
Sbjct: 267 LWGASWLLRATNDAYYYNFLKTLGADDQPDLFSWDNKYAGAHVLLARRALVENDKNFEQY 326

Query: 303 KQKAESFMCRILPNSPYSSTQYTQGGLMFKLPQSNLQYVTSITFLLTTYSKYMSAAKHTF 362
           KQ+AESFMCRILPNSPYS+TQYTQGGLM+KLPQSNLQYVTSITFLLTTY KYM A + TF
Sbjct: 327 KQEAESFMCRILPNSPYSTTQYTQGGLMYKLPQSNLQYVTSITFLLTTYGKYMKARRQTF 386

Query: 363 NCGGVLVTPTSLKNLAKQQVDYILGVNPLKMSYMVGFGKNFPKRIHHRGSSLPSKASHPQ 422
           NCG ++V+P SL  LAK+QVDYILG NP+KMSYMVGFG NFPKRIHHRGSSLPS ASHPQ
Sbjct: 387 NCGNLMVSPNSLIGLAKRQVDYILGENPIKMSYMVGFGPNFPKRIHHRGSSLPSLASHPQ 446

Query: 423 AIGCDGGFQPFFYSYNPNPNILTGAVVGGPNQNDGFPDDRSDYSHSEPATYINAALVGPL 482
           +IGCDGGFQPFFYS NPNPNIL GA+VGGPNQNDG+PDDRSDYSHSEPATYINAA+VGPL
Sbjct: 447 SIGCDGGFQPFFYSSNPNPNILVGAIVGGPNQNDGYPDDRSDYSHSEPATYINAAMVGPL 506

Query: 483 AFFSG 488
           A+F+G
Sbjct: 507 AYFAG 511

BLAST of CmoCh01G009100 vs. TAIR10
Match: AT1G71380.1 (AT1G71380.1 cellulase 3)

HSP 1 Score: 780.0 bits (2013), Expect = 8.5e-226
Identity = 371/478 (77.62%), Postives = 417/478 (87.24%), Query Frame = 1

Query: 9   TFFFFLLLSSSFSFYTARANPNYRDALAKSLLFFQGQRSGRIPNGQQIAWRSNSGLYDGE 68
           + FFF+LL SS       ANPNY++AL+KSLLFFQGQRSG +P GQQI+WR++SGL DG 
Sbjct: 3   SLFFFVLLFSSLLISNGDANPNYKEALSKSLLFFQGQRSGPLPRGQQISWRASSGLSDGS 62

Query: 69  LAHVDLTGGYYDAGDNVKFNLPMAFTTTMLSWGALEYGARMGSELPNTRAAIRWATDYLL 128
            AHVDLTGGYYDAGDNVKFNLPMAFTTTMLSW ALEYG RMG EL N R  IRWATDYLL
Sbjct: 63  AAHVDLTGGYYDAGDNVKFNLPMAFTTTMLSWSALEYGKRMGPELENARVNIRWATDYLL 122

Query: 129 KCATATPGKLYVGVGDPNVDHKCWERPEDMDTVRTVYSVSAANPGSDVAGETAAALAAAS 188
           KCA ATPGKLYVGVGDPNVDHKCWERPEDMDT RTVYSVSA+NPGSDVA ETAAALAAAS
Sbjct: 123 KCARATPGKLYVGVGDPNVDHKCWERPEDMDTPRTVYSVSASNPGSDVAAETAAALAAAS 182

Query: 189 LVFRRVDRKYSGLLLATAKKVFQFAVEHRGSYSDSLGSAVCPFYCSYSGYKDELVWGATW 248
           +VFR+VD KYS LLLATAK V QFA++++G+YSDSL S+VCPFYCSYSGYKDEL+WGA+W
Sbjct: 183 MVFRKVDSKYSRLLLATAKDVMQFAIQYQGAYSDSLSSSVCPFYCSYSGYKDELMWGASW 242

Query: 249 LLRATNDVRYFNLLKSLGGDDVPDIFSWDNKYAGAHVLLSRRALLNNDKNFDSYKQKAES 308
           LLRATN+  Y N +KSLGG D PDIFSWDNKYAGA+VLLSRRALLN D NF+ YKQ AE+
Sbjct: 243 LLRATNNPYYANFIKSLGGGDQPDIFSWDNKYAGAYVLLSRRALLNKDSNFEQYKQAAEN 302

Query: 309 FMCRILPNSPYSSTQYTQGGLMFKLPQSNLQYVTSITFLLTTYSKYMSAAKHTFNCGGVL 368
           F+C+ILP+SP SSTQYTQGGLM+KLPQSNLQYVTSITFLLTTY+KYM A KHTFNCG  +
Sbjct: 303 FICKILPDSPSSSTQYTQGGLMYKLPQSNLQYVTSITFLLTTYAKYMKATKHTFNCGSSV 362

Query: 369 VTPTSLKNLAKQQVDYILGVNPLKMSYMVGFGKNFPKRIHHRGSSLPSKASHPQAIGCDG 428
           + P +L +L+K+QVDYILG NP+KMSYMVGF  NFPKRIHHR SSLPS A   Q++GC+G
Sbjct: 363 IVPNALISLSKRQVDYILGDNPIKMSYMVGFSSNFPKRIHHRASSLPSHALRSQSLGCNG 422

Query: 429 GFQPFFYSYNPNPNILTGAVVGGPNQNDGFPDDRSDYSHSEPATYINAALVGPLAFFS 487
           GFQ  FY+ NPNPNILTGA+VGGPNQNDG+PD R DYSH+EPATYINAA VGPLA+F+
Sbjct: 423 GFQS-FYTQNPNPNILTGAIVGGPNQNDGYPDQRDDYSHAEPATYINAAFVGPLAYFA 479

BLAST of CmoCh01G009100 vs. TAIR10
Match: AT1G22880.1 (AT1G22880.1 cellulase 5)

HSP 1 Score: 764.6 bits (1973), Expect = 3.7e-221
Identity = 362/476 (76.05%), Postives = 413/476 (86.76%), Query Frame = 1

Query: 11  FFFLLLSSSFSFYTARANPNYRDALAKSLLFFQGQRSGRIPNGQQIAWRSNSGLYDGELA 70
           FFF+ L S+ S     A+PNYR+AL+KSLLFFQGQRSGR+P+ QQ++WRS+SGL DG  A
Sbjct: 5   FFFVFLLSALSLENTYASPNYREALSKSLLFFQGQRSGRLPSDQQLSWRSSSGLSDGSSA 64

Query: 71  HVDLTGGYYDAGDNVKFNLPMAFTTTMLSWGALEYGARMGSELPNTRAAIRWATDYLLKC 130
           HVDLTGGYYDAGDNVKFN PMAFTTTMLSW +LEYG +MG EL N+R AIRWATDYLLKC
Sbjct: 65  HVDLTGGYYDAGDNVKFNFPMAFTTTMLSWSSLEYGKKMGPELQNSRVAIRWATDYLLKC 124

Query: 131 ATATPGKLYVGVGDPNVDHKCWERPEDMDTVRTVYSVSAANPGSDVAGETAAALAAASLV 190
           A ATPGKLYVGVGDPN DHKCWERPEDMDT RTVYSVS +NPGSDVA ETAAALAA+S+V
Sbjct: 125 ARATPGKLYVGVGDPNGDHKCWERPEDMDTPRTVYSVSPSNPGSDVAAETAAALAASSMV 184

Query: 191 FRRVDRKYSGLLLATAKKVFQFAVEHRGSYSDSLGSAVCPFYCSYSGYKDELVWGATWLL 250
           FR+VD KYS LLLATAKKV QFA+++RG+YS+SL S+VCPFYCSYSGYKDEL+WGA WL 
Sbjct: 185 FRKVDPKYSRLLLATAKKVMQFAIQYRGAYSNSLSSSVCPFYCSYSGYKDELLWGAAWLH 244

Query: 251 RATNDVRYFNLLKSLGGDDVPDIFSWDNKYAGAHVLLSRRALLNNDKNFDSYKQKAESFM 310
           RATND  Y N +KSLGG D PDIFSWDNKYAGA+VLLSRRA+LN D NF+ YKQ AE+FM
Sbjct: 245 RATNDPYYTNFIKSLGGGDQPDIFSWDNKYAGAYVLLSRRAVLNKDNNFELYKQAAENFM 304

Query: 311 CRILPNSPYSSTQYTQGGLMFKLPQSNLQYVTSITFLLTTYSKYMSAAKHTFNCGGVLVT 370
           C+ILPNSP SST+YT+GGLM+KLPQSNLQYVTSITFLLTTY+KYM + K TFNCG  L+ 
Sbjct: 305 CKILPNSPSSSTKYTKGGLMYKLPQSNLQYVTSITFLLTTYAKYMKSTKQTFNCGNSLIV 364

Query: 371 PTSLKNLAKQQVDYILGVNPLKMSYMVGFGKNFPKRIHHRGSSLPSKASHPQAIGCDGGF 430
           P +L NL+K+QVDY+LGVNP+KMSYMVGF  NFPKRIHHRGSSLPS+A    ++GC+GGF
Sbjct: 365 PNALINLSKRQVDYVLGVNPMKMSYMVGFSSNFPKRIHHRGSSLPSRAVRSNSLGCNGGF 424

Query: 431 QPFFYSYNPNPNILTGAVVGGPNQNDGFPDDRSDYSHSEPATYINAALVGPLAFFS 487
           Q  F + NPNPNILTGA+VGGPNQND +PD R DY+ SEPATYINAA VGPLA+F+
Sbjct: 425 QS-FRTQNPNPNILTGAIVGGPNQNDEYPDQRDDYTRSEPATYINAAFVGPLAYFA 479

BLAST of CmoCh01G009100 vs. TAIR10
Match: AT1G02800.1 (AT1G02800.1 cellulase 2)

HSP 1 Score: 600.5 bits (1547), Expect = 9.2e-172
Identity = 294/488 (60.25%), Postives = 366/488 (75.00%), Query Frame = 1

Query: 12  FFLLLSSSFSFYTARA---------NPNYRDALAKSLLFFQGQRSGRIPNGQQIAWRSNS 71
           F LLLS+ FS  ++R          N NY+DAL+KS+LFF+GQRSG++P  Q++ WRSNS
Sbjct: 16  FILLLSNGFSSSSSRPSIHHRHHLDNHNYKDALSKSILFFEGQRSGKLPPNQRMTWRSNS 75

Query: 72  GLYDGELAHVDLTGGYYDAGDNVKFNLPMAFTTTMLSWGALEYGARMGSELPNTRAAIRW 131
           GL DG   +VDL GGYYDAGDN+KF  PMAFTTTMLSW  +E+G  M SELPN + AIRW
Sbjct: 76  GLSDGSALNVDLVGGYYDAGDNMKFGFPMAFTTTMLSWSLIEFGGLMKSELPNAKDAIRW 135

Query: 132 ATDYLLKCATATPGKLYVGVGDPNVDHKCWERPEDMDTVRTVYSVSAANPGSDVAGETAA 191
           ATD+LLK AT+ P  +YV VGDPN+DH CWERPEDMDT R+V+ V   NPGSD+AGE AA
Sbjct: 136 ATDFLLK-ATSHPDTIYVQVGDPNMDHACWERPEDMDTPRSVFKVDKNNPGSDIAGEIAA 195

Query: 192 ALAAASLVFRRVDRKYSGLLLATAKKVFQFAVEHRGSYSDSLGSAVCPFYCSYSGYKDEL 251
           ALAAAS+VFR+ D  YS  LL  A  VF FA ++RG YS  L   VCPFYCSYSGY+DEL
Sbjct: 196 ALAAASIVFRKCDPSYSNHLLQRAITVFTFADKYRGPYSAGLAPEVCPFYCSYSGYQDEL 255

Query: 252 VWGATWLLRATNDVRYFNLLKS----LGGDDVPDIFSWDNKYAGAHVLLSRRALLNNDKN 311
           +WGA WL +ATN+  Y N +K+    LG D+  ++FSWDNK+ GA +LLS+  L+   K+
Sbjct: 256 LWGAAWLQKATNNPTYLNYIKANGQILGADEFDNMFSWDNKHVGARILLSKEFLIQKVKS 315

Query: 312 FDSYKQKAESFMCRILPNSPYSSTQYTQGGLMFKLPQSNLQYVTSITFLLTTYSKYMSAA 371
            + YK+ A+SF+C +LP +  SS+QYT GGL+FK+ +SN+QYVTS +FLL TY+KY+++A
Sbjct: 316 LEEYKEHADSFICSVLPGA--SSSQYTPGGLLFKMGESNMQYVTSTSFLLLTYAKYLTSA 375

Query: 372 KHTFNCGGVLVTPTSLKNLAKQQVDYILGVNPLKMSYMVGFGKNFPKRIHHRGSSLPSKA 431
           +    CGG +VTP  L+++AK+QVDY+LG NPLKMSYMVG+G  +P+RIHHRGSSLPS A
Sbjct: 376 RTVAYCGGSVVTPARLRSIAKKQVDYLLGGNPLKMSYMVGYGLKYPRRIHHRGSSLPSVA 435

Query: 432 SHPQAIGCDGGFQPFFYSYNPNPNILTGAVVGGPNQNDGFPDDRSDYSHSEPATYINAAL 487
            HP  I C  GF   F S +PNPN L GAVVGGP+QND FPD+RSDY  SEPATYINA L
Sbjct: 436 VHPTRIQCHDGFS-LFTSQSPNPNDLVGAVVGGPDQNDQFPDERSDYGRSEPATYINAPL 495

BLAST of CmoCh01G009100 vs. TAIR10
Match: AT4G02290.1 (AT4G02290.1 glycosyl hydrolase 9B13)

HSP 1 Score: 589.0 bits (1517), Expect = 2.8e-168
Identity = 288/492 (58.54%), Postives = 363/492 (73.78%), Query Frame = 1

Query: 9   TFFFFLL-------LSSSFSFYTAR---ANPNYRDALAKSLLFFQGQRSGRIPNGQQIAW 68
           +FFFFL         SS F+ +  R   A  NY+DAL KS+LFF+GQRSG++P+ Q+++W
Sbjct: 21  SFFFFLCNGFSYPTTSSLFNTHHHRHHLAKHNYKDALTKSILFFEGQRSGKLPSNQRMSW 80

Query: 69  RSNSGLYDGELAHVDLTGGYYDAGDNVKFNLPMAFTTTMLSWGALEYGARMGSELPNTRA 128
           R +SGL DG   HVDL GGYYDAGDN+KF  PMAFTTTMLSW  +E+G  M SEL N + 
Sbjct: 81  RRDSGLSDGSALHVDLVGGYYDAGDNIKFGFPMAFTTTMLSWSVIEFGGLMKSELQNAKI 140

Query: 129 AIRWATDYLLKCATATPGKLYVGVGDPNVDHKCWERPEDMDTVRTVYSVSAANPGSDVAG 188
           AIRWATDYLLK AT+ P  +YV VGD N DH CWERPEDMDTVR+V+ V    PGSDVA 
Sbjct: 141 AIRWATDYLLK-ATSQPDTIYVQVGDANKDHSCWERPEDMDTVRSVFKVDKNIPGSDVAA 200

Query: 189 ETAAALAAASLVFRRVDRKYSGLLLATAKKVFQFAVEHRGSYSDSLGSAVCPFYCSYSGY 248
           ETAAALAAA++VFR+ D  YS +LL  A  VF FA ++RG+YS  L   VCPFYCSYSGY
Sbjct: 201 ETAAALAAAAIVFRKSDPSYSKVLLKRAISVFAFADKYRGTYSAGLKPDVCPFYCSYSGY 260

Query: 249 KDELVWGATWLLRATNDVRYFNLLKS----LGGDDVPDIFSWDNKYAGAHVLLSRRALLN 308
           +DEL+WGA WL +AT +++Y N +K     LG  +  + F WDNK+AGA +LL++  L+ 
Sbjct: 261 QDELLWGAAWLQKATKNIKYLNYIKINGQILGAAEYDNTFGWDNKHAGARILLTKAFLVQ 320

Query: 309 NDKNFDSYKQKAESFMCRILPNSPYSSTQYTQGGLMFKLPQSNLQYVTSITFLLTTYSKY 368
           N K    YK  A++F+C ++P +P+SSTQYT GGL+FK+  +N+QYVTS +FLL TY+KY
Sbjct: 321 NVKTLHEYKGHADNFICSVIPGAPFSSTQYTPGGLLFKMADANMQYVTSTSFLLLTYAKY 380

Query: 369 MSAAKHTFNCGGVLVTPTSLKNLAKQQVDYILGVNPLKMSYMVGFGKNFPKRIHHRGSSL 428
           +++AK   +CGG + TP  L+++AK+QVDY+LG NPL+MSYMVG+G  FP+RIHHRGSSL
Sbjct: 381 LTSAKTVVHCGGSVYTPGRLRSIAKRQVDYLLGDNPLRMSYMVGYGPKFPRRIHHRGSSL 440

Query: 429 PSKASHPQAIGCDGGFQPFFYSYNPNPNILTGAVVGGPNQNDGFPDDRSDYSHSEPATYI 487
           P  ASHP  I C  GF     S +PNPN L GAVVGGP+Q+D FPD+RSDY  SEPATYI
Sbjct: 441 PCVASHPAKIQCHQGF-AIMNSQSPNPNFLVGAVVGGPDQHDRFPDERSDYEQSEPATYI 500

BLAST of CmoCh01G009100 vs. TAIR10
Match: AT1G70710.1 (AT1G70710.1 glycosyl hydrolase 9B1)

HSP 1 Score: 543.5 bits (1399), Expect = 1.3e-154
Identity = 269/481 (55.93%), Postives = 341/481 (70.89%), Query Frame = 1

Query: 10  FFFFLLLSSSFSFYTARANPNYRDALAKSLLFFQGQRSGRIPNGQQIAWRSNSGLYDGEL 69
           F   LL    FS     A  +YRDAL KS+LFF+GQRSG++P  Q++ WR +S L DG  
Sbjct: 8   FPVILLAVLLFSPPIYSAGHDYRDALRKSILFFEGQRSGKLPPDQRLKWRRDSALRDGSS 67

Query: 70  AHVDLTGGYYDAGDNVKFNLPMAFTTTMLSWGALEYGARMGSELPNTRAAIRWATDYLLK 129
           A VDL+GGYYDAGDN+KF  PMAFTTTMLSW  +++G  MG EL N   A++W TDYLLK
Sbjct: 68  AGVDLSGGYYDAGDNIKFGFPMAFTTTMLSWSIIDFGKTMGPELRNAVKAVKWGTDYLLK 127

Query: 130 CATATPGKLYVGVGDPNVDHKCWERPEDMDTVRTVYSVSAANPGSDVAGETAAALAAASL 189
            ATA PG ++V VGD   DH CWERPEDMDT+RTVY +  A+PGSDVAGETAAALAAAS+
Sbjct: 128 -ATAIPGVVFVQVGDAYSDHNCWERPEDMDTLRTVYKIDRAHPGSDVAGETAAALAAASI 187

Query: 190 VFRRVDRKYSGLLLATAKKVFQFAVEHRGSYSDSLGSAVCPFYCSYSGYKDELVWGATWL 249
           VFR+ D  YS LLL  A +VF FA  +RG+YS+SL  AVCPFYC ++GY+DEL+WGA WL
Sbjct: 188 VFRKRDPAYSRLLLDRATRVFAFANRYRGAYSNSLYHAVCPFYCDFNGYQDELLWGAAWL 247

Query: 250 LRATNDVRYFNLLKS----LGGDDVPDIFSWDNKYAGAHVLLSRRALLNNDKNFDSYKQK 309
            +A+    Y   +      L   D  + F WDNK+AG +VL+S+  L+   + F+S+KQ 
Sbjct: 248 HKASRKRAYREFIVKNEVILKAGDTINEFGWDNKHAGINVLISKEVLMGKAEYFESFKQN 307

Query: 310 AESFMCRILPNSPYSSTQYTQGGLMFKLPQSNLQYVTSITFLLTTYSKYMSAAKHTFNCG 369
           A+ F+C ILP   +   QY++GGL+ K   SN+Q+VTS++FLL  YS Y+S AK    CG
Sbjct: 308 ADGFICSILPGISHPQVQYSRGGLLVKTGGSNMQHVTSLSFLLLAYSNYLSHAKKVVPCG 367

Query: 370 GVLVTPTSLKNLAKQQVDYILGVNPLKMSYMVGFGKNFPKRIHHRGSSLPSKASHPQAIG 429
            +  +P+ L+ +AK+QVDYILG NP+ +SYMVG+G+ FP+RIHHRGSS+PS ++HP  IG
Sbjct: 368 ELTASPSLLRQIAKRQVDYILGDNPMGLSYMVGYGQKFPRRIHHRGSSVPSVSAHPSHIG 427

Query: 430 CDGGFQPFFYSYNPNPNILTGAVVGGPNQNDGFPDDRSDYSHSEPATYINAALVGPLAFF 487
           C  G + +F S NPNPN+L GAVVGGPN  D FPD R  +  SEP TYINA LVG L +F
Sbjct: 428 CKEGSR-YFLSPNPNPNLLVGAVVGGPNVTDAFPDSRPYFQQSEPTTYINAPLVGLLGYF 486

BLAST of CmoCh01G009100 vs. NCBI nr
Match: gi|449447557|ref|XP_004141534.1| (PREDICTED: endoglucanase 9 [Cucumis sativus])

HSP 1 Score: 896.0 bits (2314), Expect = 3.0e-257
Identity = 436/489 (89.16%), Postives = 461/489 (94.27%), Query Frame = 1

Query: 2   AATTNAPTFF--FFLLLSSSFSFYTARANPNYRDALAKSLLFFQGQRSGRIPNGQQIAWR 61
           +A +N+ T F  FFLLLS SF+   A A PNYRDALAKS+LFF+GQRSGRIP  Q+I WR
Sbjct: 3   SAISNSSTLFLLFFLLLSFSFAG-RALAGPNYRDALAKSILFFEGQRSGRIPANQRITWR 62

Query: 62  SNSGLYDGELAHVDLTGGYYDAGDNVKFNLPMAFTTTMLSWGALEYGARMGSELPNTRAA 121
           SNSGLYDGEL HVDLTGGYYDAGDNVKFNLPMAFTTTMLSWGALEYGARMGSELPNTRAA
Sbjct: 63  SNSGLYDGELDHVDLTGGYYDAGDNVKFNLPMAFTTTMLSWGALEYGARMGSELPNTRAA 122

Query: 122 IRWATDYLLKCATATPGKLYVGVGDPNVDHKCWERPEDMDTVRTVYSVSAANPGSDVAGE 181
           IRWATDYLLKCATATPGKLYVGVG+P+ DHKCWERPEDMDTVRTVYSVSA NPGSDVAGE
Sbjct: 123 IRWATDYLLKCATATPGKLYVGVGEPHADHKCWERPEDMDTVRTVYSVSAGNPGSDVAGE 182

Query: 182 TAAALAAASLVFRRVDRKYSGLLLATAKKVFQFAVEHRGSYSDSLGSAVCPFYCSYSGYK 241
           TAAALAAASLVFRRVDRKYS +LLATAKKV +FA+EHRGSYSDSL SAVCPFYCSYSGYK
Sbjct: 183 TAAALAAASLVFRRVDRKYSKVLLATAKKVMEFALEHRGSYSDSLSSAVCPFYCSYSGYK 242

Query: 242 DELVWGATWLLRATNDVRYFNLLKSLGGDDVPDIFSWDNKYAGAHVLLSRRALLNNDKNF 301
           DELVWGA WLLRATN+V+YFNLLKSLGGDDV DIFSWDNK+AGAHVLLSRR+LLNNDKNF
Sbjct: 243 DELVWGAAWLLRATNNVKYFNLLKSLGGDDVTDIFSWDNKFAGAHVLLSRRSLLNNDKNF 302

Query: 302 DSYKQKAESFMCRILPNSPYSSTQYTQGGLMFKLPQSNLQYVTSITFLLTTYSKYMSAAK 361
           DSYKQ+AE+FMCRILPNSP SSTQYTQG LMFKLP+SNLQYVTSITFLLTTYSKYMSAAK
Sbjct: 303 DSYKQEAEAFMCRILPNSPSSSTQYTQGRLMFKLPESNLQYVTSITFLLTTYSKYMSAAK 362

Query: 362 HTFNCGGVLVTPTSLKNLAKQQVDYILGVNPLKMSYMVGFGKNFPKRIHHRGSSLPSKAS 421
           HTFNCG ++VTP SLKNLAK QVDYILGVNPLKMSYMVGFGKN+PKRIHHRGSSLPSKA+
Sbjct: 363 HTFNCGNLVVTPASLKNLAKIQVDYILGVNPLKMSYMVGFGKNYPKRIHHRGSSLPSKAT 422

Query: 422 HPQAIGCDGGFQPFFYSYNPNPNILTGAVVGGPNQNDGFPDDRSDYSHSEPATYINAALV 481
           HPQAI CDGGFQPFFYSYNPNPNILTGAVVGGPNQ+DGFPDDR+DYSHSEPATYINAALV
Sbjct: 423 HPQAIACDGGFQPFFYSYNPNPNILTGAVVGGPNQSDGFPDDRTDYSHSEPATYINAALV 482

Query: 482 GPLAFFSGK 489
           GPLAFFSGK
Sbjct: 483 GPLAFFSGK 490

BLAST of CmoCh01G009100 vs. NCBI nr
Match: gi|566238853|ref|XP_006371389.1| (hypothetical protein POPTR_0019s09740g [Populus trichocarpa])

HSP 1 Score: 823.2 bits (2125), Expect = 2.5e-235
Identity = 386/476 (81.09%), Postives = 430/476 (90.34%), Query Frame = 1

Query: 12  FFLLLSSSFSFYT-ARANPNYRDALAKSLLFFQGQRSGRIPNGQQIAWRSNSGLYDGELA 71
           F LL S S +      ANPNY+DALAKS+LFFQGQRSGR+P  QQ+AWRS+SGL DG  A
Sbjct: 7   FCLLFSLSLALLGFVHANPNYKDALAKSILFFQGQRSGRLPRSQQLAWRSDSGLSDGLFA 66

Query: 72  HVDLTGGYYDAGDNVKFNLPMAFTTTMLSWGALEYGARMGSELPNTRAAIRWATDYLLKC 131
           HVDLTGGYYDAGDNVKFN PMAFTTTMLSW  LEYG RMG ELPN RAAIRWATDYLLKC
Sbjct: 67  HVDLTGGYYDAGDNVKFNFPMAFTTTMLSWSTLEYGKRMGPELPNARAAIRWATDYLLKC 126

Query: 132 ATATPGKLYVGVGDPNVDHKCWERPEDMDTVRTVYSVSAANPGSDVAGETAAALAAASLV 191
           ATATPGKLYVGVGDPNVDHKCWERPEDMDT RTV+SVSA +PGSDVAGETAAALAAAS+V
Sbjct: 127 ATATPGKLYVGVGDPNVDHKCWERPEDMDTARTVFSVSARSPGSDVAGETAAALAAASMV 186

Query: 192 FRRVDRKYSGLLLATAKKVFQFAVEHRGSYSDSLGSAVCPFYCSYSGYKDELVWGATWLL 251
           FR+VDRKYS LLL TA+KVFQFA++++G+YSDSLGSAVCPFYCSYSGYKDEL+WGA WL 
Sbjct: 187 FRKVDRKYSALLLRTARKVFQFAMQYQGAYSDSLGSAVCPFYCSYSGYKDELLWGAAWLF 246

Query: 252 RATNDVRYFNLLKSLGGDDVPDIFSWDNKYAGAHVLLSRRALLNNDKNFDSYKQKAESFM 311
           RATN++ Y+N+ KSLG DD PD+FSWDNKYAG HVLLSRRALLNNDKNF+ ++ +AESFM
Sbjct: 247 RATNEMSYYNIFKSLGADDQPDLFSWDNKYAGVHVLLSRRALLNNDKNFEQFEGEAESFM 306

Query: 312 CRILPNSPYSSTQYTQGGLMFKLPQSNLQYVTSITFLLTTYSKYMSAAKHTFNCGGVLVT 371
           CRILPNSPY +TQYTQGGLM+KLP+SNLQYVTSITFLLTTY+KYM A +HTFNCG +LVT
Sbjct: 307 CRILPNSPYKTTQYTQGGLMYKLPESNLQYVTSITFLLTTYAKYMKATRHTFNCGNLLVT 366

Query: 372 PTSLKNLAKQQVDYILGVNPLKMSYMVGFGKNFPKRIHHRGSSLPSKASHPQAIGCDGGF 431
           P SL  +AK+QVDYILG NP++MSYMVGFG NFPKRIHHRGSSLPS ASHPQAIGCD GF
Sbjct: 367 PNSLLYVAKRQVDYILGENPIRMSYMVGFGPNFPKRIHHRGSSLPSLASHPQAIGCDSGF 426

Query: 432 QPFFYSYNPNPNILTGAVVGGPNQNDGFPDDRSDYSHSEPATYINAALVGPLAFFS 487
           +PFF+S NPNPNILTGA+VGGPNQNDG+PD+RSDYSHSEPATYINAA+VGPLA+F+
Sbjct: 427 EPFFHSANPNPNILTGAIVGGPNQNDGYPDERSDYSHSEPATYINAAMVGPLAYFA 482

BLAST of CmoCh01G009100 vs. NCBI nr
Match: gi|743921651|ref|XP_011004890.1| (PREDICTED: endoglucanase 9 [Populus euphratica])

HSP 1 Score: 822.0 bits (2122), Expect = 5.5e-235
Identity = 385/476 (80.88%), Postives = 433/476 (90.97%), Query Frame = 1

Query: 12  FFLLLSSSFSFYT-ARANPNYRDALAKSLLFFQGQRSGRIPNGQQIAWRSNSGLYDGELA 71
           F LL S SF+     +A PNY++ALAKS+LFFQGQRSGR+P  QQ+AWRS+SGL DG  A
Sbjct: 7   FCLLFSLSFALLGFVQAKPNYKEALAKSILFFQGQRSGRLPRSQQLAWRSDSGLSDGLFA 66

Query: 72  HVDLTGGYYDAGDNVKFNLPMAFTTTMLSWGALEYGARMGSELPNTRAAIRWATDYLLKC 131
           HVDLTGGYYDAGDNVKFN PMAFTTTMLSW  LEYG R+G ELPN RAAIRWATDYLLKC
Sbjct: 67  HVDLTGGYYDAGDNVKFNFPMAFTTTMLSWSTLEYGKRLGPELPNARAAIRWATDYLLKC 126

Query: 132 ATATPGKLYVGVGDPNVDHKCWERPEDMDTVRTVYSVSAANPGSDVAGETAAALAAASLV 191
           ATATPGKLYVGVGDPNVDHKCWERPEDMDTVRTV+SVSA NPGSDVAGETAAALAAAS+V
Sbjct: 127 ATATPGKLYVGVGDPNVDHKCWERPEDMDTVRTVFSVSARNPGSDVAGETAAALAAASMV 186

Query: 192 FRRVDRKYSGLLLATAKKVFQFAVEHRGSYSDSLGSAVCPFYCSYSGYKDELVWGATWLL 251
           FR+VDRKYS LLL TA+KVFQFA++++G+YSDSLGSAVCPFYCSYSGYKDEL+WGA WL 
Sbjct: 187 FRKVDRKYSALLLRTARKVFQFAMQYQGAYSDSLGSAVCPFYCSYSGYKDELLWGAAWLF 246

Query: 252 RATNDVRYFNLLKSLGGDDVPDIFSWDNKYAGAHVLLSRRALLNNDKNFDSYKQKAESFM 311
           RATN++ Y+ +LKSLG DD PD+FSWDNKYAGAHVLLSRRALL+NDKNF+ ++ +AE+FM
Sbjct: 247 RATNEMSYYKILKSLGADDQPDLFSWDNKYAGAHVLLSRRALLHNDKNFEQFEGEAENFM 306

Query: 312 CRILPNSPYSSTQYTQGGLMFKLPQSNLQYVTSITFLLTTYSKYMSAAKHTFNCGGVLVT 371
           CRILPNSPY +TQYTQGGLM+KLP+SNLQYVTSITFLLTTY+KYM A +HTFNCG +LVT
Sbjct: 307 CRILPNSPYKTTQYTQGGLMYKLPESNLQYVTSITFLLTTYAKYMKATRHTFNCGNLLVT 366

Query: 372 PTSLKNLAKQQVDYILGVNPLKMSYMVGFGKNFPKRIHHRGSSLPSKASHPQAIGCDGGF 431
           P SL  +AK+QVDYILG NP++MSYMVGFG NFPKRIHHRGSSLPS ASHPQAIGCD GF
Sbjct: 367 PNSLLYVAKRQVDYILGENPIRMSYMVGFGPNFPKRIHHRGSSLPSLASHPQAIGCDSGF 426

Query: 432 QPFFYSYNPNPNILTGAVVGGPNQNDGFPDDRSDYSHSEPATYINAALVGPLAFFS 487
           +PFF+S NPNPNILTGA+VGGPNQNDG+PD+RSDYSHSEPATYINAA+VGPLA+F+
Sbjct: 427 EPFFHSANPNPNILTGAIVGGPNQNDGYPDERSDYSHSEPATYINAAMVGPLAYFA 482

BLAST of CmoCh01G009100 vs. NCBI nr
Match: gi|50346664|gb|AAT75042.1| (Cel9B [Populus tremula x Populus tremuloides])

HSP 1 Score: 819.3 bits (2115), Expect = 3.6e-234
Identity = 385/476 (80.88%), Postives = 429/476 (90.13%), Query Frame = 1

Query: 12  FFLLLSSSFSFYT-ARANPNYRDALAKSLLFFQGQRSGRIPNGQQIAWRSNSGLYDGELA 71
           F LL S S       +A PNY +ALAKS+LFFQGQRSGR+P  QQ+AWRS+SGL DG  A
Sbjct: 7   FCLLFSLSLVLLGFVQAKPNYNEALAKSILFFQGQRSGRLPGSQQLAWRSDSGLSDGLFA 66

Query: 72  HVDLTGGYYDAGDNVKFNLPMAFTTTMLSWGALEYGARMGSELPNTRAAIRWATDYLLKC 131
           HVDLTGGYYDAGDNVKFN PMAFTTTMLSW  LEYG RMG ELPN RAAIRWATDYLLKC
Sbjct: 67  HVDLTGGYYDAGDNVKFNFPMAFTTTMLSWSTLEYGKRMGPELPNARAAIRWATDYLLKC 126

Query: 132 ATATPGKLYVGVGDPNVDHKCWERPEDMDTVRTVYSVSAANPGSDVAGETAAALAAASLV 191
           ATATPGKLYVGVGDPNVDHKCWERPEDMDTVRTV+SVSA +PGSDVAGETAAALAAAS+V
Sbjct: 127 ATATPGKLYVGVGDPNVDHKCWERPEDMDTVRTVFSVSARSPGSDVAGETAAALAAASMV 186

Query: 192 FRRVDRKYSGLLLATAKKVFQFAVEHRGSYSDSLGSAVCPFYCSYSGYKDELVWGATWLL 251
           FR+VDRKYS LLL TA+KVFQFA++++G+YSDSLGSAVCPFYCSYSGYKDEL+WGA WL 
Sbjct: 187 FRKVDRKYSALLLRTARKVFQFAMQYQGAYSDSLGSAVCPFYCSYSGYKDELLWGAAWLF 246

Query: 252 RATNDVRYFNLLKSLGGDDVPDIFSWDNKYAGAHVLLSRRALLNNDKNFDSYKQKAESFM 311
           RATN++ Y+N+ KSLG DD PD+FSWDNKYAG HVLLSRRALLNNDKNF+ ++ +AESFM
Sbjct: 247 RATNEMSYYNIFKSLGADDQPDLFSWDNKYAGVHVLLSRRALLNNDKNFEQFEGEAESFM 306

Query: 312 CRILPNSPYSSTQYTQGGLMFKLPQSNLQYVTSITFLLTTYSKYMSAAKHTFNCGGVLVT 371
           CRILPNSPY +TQYTQGGLM+KLP+SNLQYVTSITFLLTTY+KYM A +HTFNCG +LVT
Sbjct: 307 CRILPNSPYKTTQYTQGGLMYKLPESNLQYVTSITFLLTTYAKYMKATRHTFNCGNLLVT 366

Query: 372 PTSLKNLAKQQVDYILGVNPLKMSYMVGFGKNFPKRIHHRGSSLPSKASHPQAIGCDGGF 431
           P SL  +AK+QVDYILG NP++MSYMVGFG NFPKRIHHRGSSLPS ASHPQAIGCD GF
Sbjct: 367 PNSLLYVAKRQVDYILGENPIRMSYMVGFGPNFPKRIHHRGSSLPSLASHPQAIGCDSGF 426

Query: 432 QPFFYSYNPNPNILTGAVVGGPNQNDGFPDDRSDYSHSEPATYINAALVGPLAFFS 487
           +PFF+S NPNPNILTGA+VGGPNQNDG+PD+RSDYSHSEPATYINAA+VGPLA+F+
Sbjct: 427 EPFFHSANPNPNILTGAIVGGPNQNDGYPDERSDYSHSEPATYINAAMVGPLAYFA 482

BLAST of CmoCh01G009100 vs. NCBI nr
Match: gi|802541443|ref|XP_012079988.1| (PREDICTED: endoglucanase 9-like [Jatropha curcas])

HSP 1 Score: 816.2 bits (2107), Expect = 3.0e-233
Identity = 384/482 (79.67%), Postives = 429/482 (89.00%), Query Frame = 1

Query: 12  FFLLLSSSFSFYT---ARANPNYRDALAKSLLFFQGQRSGRIPNGQQIAWRSNSGLYDGE 71
           F   L  SFSF+     ++NPNY+DAL KS+LFF+GQRSG++P  Q I WRS SGL DG 
Sbjct: 4   FSFCLLFSFSFFLLGFVQSNPNYKDALTKSILFFEGQRSGKVPPSQGINWRSTSGLSDGL 63

Query: 72  LAHVDLTGGYYDAGDNVKFNLPMAFTTTMLSWGALEYGARMGSELPNTRAAIRWATDYLL 131
           LAHVDLTGGYYDAGDNVKFN PMAFTTTMLSW  +EYG RMG EL N RAAIRWATDYLL
Sbjct: 64  LAHVDLTGGYYDAGDNVKFNFPMAFTTTMLSWSTIEYGRRMGPELQNARAAIRWATDYLL 123

Query: 132 KCATATPGKLYVGVGDPNVDHKCWERPEDMDTVRTVYSVSAANPGSDVAGETAAALAAAS 191
           KCA ATPGKLYVGVGDPNVDHKCWERPEDMDTVR+VYSVSA NPGSDVAGETAAALAAAS
Sbjct: 124 KCARATPGKLYVGVGDPNVDHKCWERPEDMDTVRSVYSVSARNPGSDVAGETAAALAAAS 183

Query: 192 LVFRRVDRKYSGLLLATAKKVFQFAVEHRGSYSDSLGSAVCPFYCSYSGYKDELVWGATW 251
           +VFR+ D  YS LLL+TAK VFQFA++++G+YSDSLGSAVCPFYCSYSGYKDEL+WGA W
Sbjct: 184 IVFRKADPNYSELLLSTAKNVFQFALKYQGAYSDSLGSAVCPFYCSYSGYKDELLWGAAW 243

Query: 252 LLRATNDVRYFNLLKSLGGDDVPDIFSWDNKYAGAHVLLSRRALLNNDKNFDSYKQKAES 311
           L RATN + Y+NL+KSLG DD PDIFSWDNKYAGAHVLLSRRALLNNDKNF+ YK +AE+
Sbjct: 244 LFRATNQMYYYNLIKSLGADDQPDIFSWDNKYAGAHVLLSRRALLNNDKNFEQYKVEAEN 303

Query: 312 FMCRILPNSPYSSTQYTQGGLMFKLPQSNLQYVTSITFLLTTYSKYMSAAKHTFNCGGVL 371
           FMC+ILPNSP+++TQYTQGGLM+KLP+SNLQYVTSITFLLTTY+KYM + KHTFNCG ++
Sbjct: 304 FMCKILPNSPFTTTQYTQGGLMYKLPESNLQYVTSITFLLTTYAKYMKSTKHTFNCGNLM 363

Query: 372 VTPTSLKNLAKQQVDYILGVNPLKMSYMVGFGKNFPKRIHHRGSSLPSKASHPQAIGCDG 431
           VTP SL  +AK+QVDYILGVNP++MSYMVGFG +FPKRIHHRGSSLPS ASHPQ IGCDG
Sbjct: 364 VTPNSLLYVAKRQVDYILGVNPIQMSYMVGFGPHFPKRIHHRGSSLPSLASHPQTIGCDG 423

Query: 432 GFQPFFYSYNPNPNILTGAVVGGPNQNDGFPDDRSDYSHSEPATYINAALVGPLAFFSGK 491
           GFQPFFYS NPNPNIL GA+VGGPNQ+DGFPDDRSDYSHSEPATYINAA+VGPLA+F G 
Sbjct: 424 GFQPFFYSANPNPNILIGAIVGGPNQSDGFPDDRSDYSHSEPATYINAAIVGPLAYFGGS 483

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
GUN9_ARATH1.5e-22477.62Endoglucanase 9 OS=Arabidopsis thaliana GN=CEL3 PE=1 SV=1[more]
GUN3_ARATH6.5e-22076.05Endoglucanase 3 OS=Arabidopsis thaliana GN=CEL5 PE=2 SV=2[more]
GUN11_ORYSJ1.8e-17764.22Endoglucanase 11 OS=Oryza sativa subsp. japonica GN=GLU4 PE=2 SV=3[more]
GUN1_ARATH1.6e-17060.25Endoglucanase 1 OS=Arabidopsis thaliana GN=CEL2 PE=2 SV=1[more]
GUN17_ARATH4.9e-16758.54Endoglucanase 17 OS=Arabidopsis thaliana GN=At4g02290 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KW33_CUCSA2.1e-25789.16Endoglucanase OS=Cucumis sativus GN=Csa_5G648680 PE=3 SV=1[more]
U5FH10_POPTR1.7e-23581.09Endoglucanase OS=Populus trichocarpa GN=POPTR_0019s09740g PE=3 SV=1[more]
Q6DMM3_9ROSI2.5e-23480.88Endoglucanase OS=Populus tremula x Populus tremuloides PE=2 SV=1[more]
A0A067LME8_JATCU2.1e-23379.67Endoglucanase OS=Jatropha curcas GN=JCGZ_17191 PE=3 SV=1[more]
A0A061FIN7_THECC6.1e-23378.97Endoglucanase OS=Theobroma cacao GN=TCM_036320 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT1G71380.18.5e-22677.62 cellulase 3[more]
AT1G22880.13.7e-22176.05 cellulase 5[more]
AT1G02800.19.2e-17260.25 cellulase 2[more]
AT4G02290.12.8e-16858.54 glycosyl hydrolase 9B13[more]
AT1G70710.11.3e-15455.93 glycosyl hydrolase 9B1[more]
Match NameE-valueIdentityDescription
gi|449447557|ref|XP_004141534.1|3.0e-25789.16PREDICTED: endoglucanase 9 [Cucumis sativus][more]
gi|566238853|ref|XP_006371389.1|2.5e-23581.09hypothetical protein POPTR_0019s09740g [Populus trichocarpa][more]
gi|743921651|ref|XP_011004890.1|5.5e-23580.88PREDICTED: endoglucanase 9 [Populus euphratica][more]
gi|50346664|gb|AAT75042.1|3.6e-23480.88Cel9B [Populus tremula x Populus tremuloides][more]
gi|802541443|ref|XP_012079988.1|3.0e-23379.67PREDICTED: endoglucanase 9-like [Jatropha curcas][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001701Glyco_hydro_9
IPR0089286-hairpin_glycosidase_sf
IPR0123416hp_glycosidase-like_sf
IPR018221Glyco_hydro_9_His_AS
Vocabulary: Molecular Function
TermDefinition
GO:0004553hydrolase activity, hydrolyzing O-glycosyl compounds
GO:0003824catalytic activity
Vocabulary: Biological Process
TermDefinition
GO:0005975carbohydrate metabolic process
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0030245 cellulose catabolic process
biological_process GO:0042254 ribosome biogenesis
biological_process GO:0005982 starch metabolic process
biological_process GO:0005985 sucrose metabolic process
biological_process GO:0006412 translation
biological_process GO:0005975 carbohydrate metabolic process
cellular_component GO:0005840 ribosome
cellular_component GO:0005575 cellular_component
molecular_function GO:0008810 cellulase activity
molecular_function GO:0003735 structural constituent of ribosome
molecular_function GO:0003824 catalytic activity
molecular_function GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compounds

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh01G009100.1CmoCh01G009100.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001701Glycoside hydrolase family 9PFAMPF00759Glyco_hydro_9coord: 31..480
score: 8.1E
IPR008928Six-hairpin glycosidase-likeunknownSSF48208Six-hairpin glycosidasescoord: 11..487
score: 2.46E
IPR012341Six-hairpin glycosidaseGENE3DG3DSA:1.50.10.10coord: 29..485
score: 6.4E
IPR018221Glycoside hydrolase family 9, His active sitePROSITEPS00592GLYCOSYL_HYDROL_F9_1coord: 394..410
scor
NoneNo IPR availablePANTHERPTHR22298ENDO-1,4-BETA-GLUCANASEcoord: 10..487
score:
NoneNo IPR availablePANTHERPTHR22298:SF54ENDOGLUCANASE 3-RELATEDcoord: 10..487
score: