CmaCh08G004410 (gene) Cucurbita maxima (Rimu) v1.1

Overview
NameCmaCh08G004410
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu) v1.1)
DescriptionEndoglucanase
LocationCma_Chr08: 2483745 .. 2486355 (-)
RNA-Seq ExpressionCmaCh08G004410
SyntenyCmaCh08G004410
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTGCTCCAAAGCCCTTTCCTTCCAGAGCTTCTTCATCAATGGCTGCCCTCAGACCTCTGTTTTTTATCCTCCTCCTCTGTTTCTTCCCCTGCAACAATGGCTTCCCTACTCATCGCCATCCCCATTCTGCCCACCATAACTACCGAGACGCTCTCACCAAATCCATCCTCTTCTTTGAAGGCCAGAGATCCGGCAGGCTCCCGCCCAATCAGAGGATCACTTGGCGGCGCAACTCCGGCCTCTCTGACGGCGCTGCCATGAAGGTATTTCCCGATTTGGTTTGTTCTTTGAGTTTTGATTCTTTATTTCAACTCAAAATGTGAATTTATGAAGTGGGTTTTGGTTGGATTTAACTGGTTTTGTGGAAATGTAGGTCGATTTAGTCGGCGGTTACTACGATGCCGGCGACAATGTGAAATTTGGGTTTCCGATGGCGTTCACCACCACCATGCTTTCATGGAGTGTTCTTGAGTTCGGTGGGTTGATGAAAGGGGAGCTTCAAAATGCTAAACAAGCCATTCGTTGGGCCACTGATTATCTCCTCAAAGCCACCGCTCATCCCGACACCATTTACGTTCAGGTCGGCGATGCGAACAAGGACCATGCCTGTTGGGAGAGGCCAGAGGACATGGACACTCCCCGAAGTGTTTTCAAGGTGGACAAGAACAACCCCGGCACAGAGGTCGCCGCCGAGACCGCCGCCGCACTTGCCGCCGCATCGCTGGTATTCCGACGGAGTGATCCTAATTATTCAAACCTTCTGGTCAGAAGGGCCATGAGAGTAAGCTAAAACACTAACCTTTATAGCTTTAGAAACAGAGTAAACAAGATTGAAACAGAGCATTTTGATTCAATTTCAAAATTCTCCCTTTTTCTTTGCAGGTTTTTGAGTTTGCTGATAAGTATAGAGGCTCTTACAGCACTGGTTTGAAAAAATATGTGTGCCCATTTTACTGTTCCTTCTCTGGATATCAGGTAAAGCTACAAAAAGCTCTGTTTTTTTTCTCAAAAACAGAGCTTTTCCCCCCTTCTTTCTAAAGAACAAAATGTGGGTTTGCAGGATGAGCTTCTATGGGGAGCTGCTTGGCTTCAGAGAGCCACCAAAAACCCCAAATATCTCAATTACATTCAAGTCAATGGACAGACTCTCGGCGCCGGGGAGTTCGATAACACGTTCGGTTGGGATAACAAGCACGCCGGAGCCCGAATCCTTCTCTCAAAGGTTCGAAAAACACCCTCTTTATATCTCCACAACGGTATGATATTATAACAATAAACGTTGTTGTTCATCGTTTTTTATAGGCATTTTTGGTGCAAAAGATGCAATCTCTTCATGATTATAAAGGGCATGCTGATAATTTCATATGCTCTATAATTCCCGGAGCTTCATTCTCCTCAACCAAATACACTCCTGGTTGGTTCTGTTCAATTTCTTAACGGCTACTTTTTTTTGTGAAAATTTGAACAAAGTTTTGAACTTTAGTGTGATTGGGCAGGTGGGTTGCTGTTTAAAATGAACGATAGCAACATGCAATATGTTACATCAACTTCATTCCTACTCTTAACTTACGCCAAGTACTTGACCTCCGCCCGCTCCGTGGTGAACTGCGGCACGACCATCACCCCGAAAACCCTCCGCTCCATTGCCAAAAAACAGGTACCTCAAAAACTCGCTCTTGTTTGTTAAAAATCACGAACCTTCGCAATGGTATGATATTGTGAGATCCCACGTTAGTTGGGAAGGCGAATGAAACATTCTGTATACGGGTGTGGAAATATCTCCTTAGTAGACGCGTTTTAAGATCTTGAGGAGAAGCTGAGGACAATATCTACTAGTTGTTGGTTTGGGCCGCTATAGATATTGTCCACTCCGAGCATTAACTCACCAAAAGATCTCGTACCAAAATAGACGTTATTCTTTACTTATAAATTTATGAAGAACCCCTTAATTGGGAGCGAGTGGAAATTATCAATGTTGTATATTTGGGTGGTGTAGGTGGACTATCTTCTAGGGGATAACCCACAAAAGATGTCCTACATGGTGGGGTATGGGGCAAGGTACCCTAAAAGGATCCATCACAGGGGATCATCACTGCCGTCGATTGCAACTCATCCAGGGAAGATCCAATGCACGGCAGGGTTCAGTGTGATGAACTCGGCCGCACCAAATCCAAACTTGCTGATTGGTGCAGTGGTGGGTGGGCCCGATAAGAATGACAGGTTCCCAGATCAACGGTCGGATTACGAACAGTCTGAGCCCGCCACGTATATGAATGCACCTCTTGTAGGGTCACTGGCTTATCTGGCTCACTCTTTTGGCCAGCTCTAAGGGCACTCTCCCTCTAAACTAAGGGAGGGGTACCCCAGATGATTACTATTATTAATGAAGAAAAAACGTGAGGGCACATGGCTAAAAAGGGGAAGGCTTTGCCCTCTACATTATTGACTTGGATGGAGAGTGTGGATTTCGATGTTCTTTTTTTCGGGTTCTCGAGTCAAAAGTAAAGTCTCGTAGGTATGATATGTTTGGTTTTCACTTAGATCATAAAATGGAATGAAAGCGTATCGTTATTATTATCATTATCGTCATAGTAACTTGTTCAAACC

mRNA sequence

ATGGCTGCTCCAAAGCCCTTTCCTTCCAGAGCTTCTTCATCAATGGCTGCCCTCAGACCTCTGTTTTTTATCCTCCTCCTCTGTTTCTTCCCCTGCAACAATGGCTTCCCTACTCATCGCCATCCCCATTCTGCCCACCATAACTACCGAGACGCTCTCACCAAATCCATCCTCTTCTTTGAAGGCCAGAGATCCGGCAGGCTCCCGCCCAATCAGAGGATCACTTGGCGGCGCAACTCCGGCCTCTCTGACGGCGCTGCCATGAAGGTCGATTTAGTCGGCGGTTACTACGATGCCGGCGACAATGTGAAATTTGGGTTTCCGATGGCGTTCACCACCACCATGCTTTCATGGAGTGTTCTTGAGTTCGGTGGGTTGATGAAAGGGGAGCTTCAAAATGCTAAACAAGCCATTCGTTGGGCCACTGATTATCTCCTCAAAGCCACCGCTCATCCCGACACCATTTACGTTCAGGTCGGCGATGCGAACAAGGACCATGCCTGTTGGGAGAGGCCAGAGGACATGGACACTCCCCGAAGTGTTTTCAAGGTGGACAAGAACAACCCCGGCACAGAGGTCGCCGCCGAGACCGCCGCCGCACTTGCCGCCGCATCGCTGGTATTCCGACGGAGTGATCCTAATTATTCAAACCTTCTGGTCAGAAGGGCCATGAGAGTTTTTGAGTTTGCTGATAAGTATAGAGGCTCTTACAGCACTGGTTTGAAAAAATATGTGTGCCCATTTTACTGTTCCTTCTCTGGATATCAGGATGAGCTTCTATGGGGAGCTGCTTGGCTTCAGAGAGCCACCAAAAACCCCAAATATCTCAATTACATTCAAGTCAATGGACAGACTCTCGGCGCCGGGGAGTTCGATAACACGTTCGGTTGGGATAACAAGCACGCCGGAGCCCGAATCCTTCTCTCAAAGGCATTTTTGGTGCAAAAGATGCAATCTCTTCATGATTATAAAGGGCATGCTGATAATTTCATATGCTCTATAATTCCCGGAGCTTCATTCTCCTCAACCAAATACACTCCTGGTGGGTTGCTGTTTAAAATGAACGATAGCAACATGCAATATGTTACATCAACTTCATTCCTACTCTTAACTTACGCCAAGTACTTGACCTCCGCCCGCTCCGTGGTGAACTGCGGCACGACCATCACCCCGAAAACCCTCCGCTCCATTGCCAAAAAACAGGTGGACTATCTTCTAGGGGATAACCCACAAAAGATGTCCTACATGGTGGGGTATGGGGCAAGGTACCCTAAAAGGATCCATCACAGGGGATCATCACTGCCGTCGATTGCAACTCATCCAGGGAAGATCCAATGCACGGCAGGGTTCAGTGTGATGAACTCGGCCGCACCAAATCCAAACTTGCTGATTGGTGCAGTGGTGGGTGGGCCCGATAAGAATGACAGGTTCCCAGATCAACGGTCGGATTACGAACAGTCTGAGCCCGCCACGTATATGAATGCACCTCTTGTAGGGTCACTGGCTTATCTGGCTCACTCTTTTGGCCAGCTCTAAGGGCACTCTCCCTCTAAACTAAGGGAGGGGTACCCCAGATGATTACTATTATTAATGAAGAAAAAACGTGAGGGCACATGGCTAAAAAGGGGAAGGCTTTGCCCTCTACATTATTGACTTGGATGGAGAGTGTGGATTTCGATGTTCTTTTTTTCGGGTTCTCGAGTCAAAAGTAAAGTCTCGTAGGTATGATATGTTTGGTTTTCACTTAGATCATAAAATGGAATGAAAGCGTATCGTTATTATTATCATTATCGTCATAGTAACTTGTTCAAACC

Coding sequence (CDS)

ATGGCTGCTCCAAAGCCCTTTCCTTCCAGAGCTTCTTCATCAATGGCTGCCCTCAGACCTCTGTTTTTTATCCTCCTCCTCTGTTTCTTCCCCTGCAACAATGGCTTCCCTACTCATCGCCATCCCCATTCTGCCCACCATAACTACCGAGACGCTCTCACCAAATCCATCCTCTTCTTTGAAGGCCAGAGATCCGGCAGGCTCCCGCCCAATCAGAGGATCACTTGGCGGCGCAACTCCGGCCTCTCTGACGGCGCTGCCATGAAGGTCGATTTAGTCGGCGGTTACTACGATGCCGGCGACAATGTGAAATTTGGGTTTCCGATGGCGTTCACCACCACCATGCTTTCATGGAGTGTTCTTGAGTTCGGTGGGTTGATGAAAGGGGAGCTTCAAAATGCTAAACAAGCCATTCGTTGGGCCACTGATTATCTCCTCAAAGCCACCGCTCATCCCGACACCATTTACGTTCAGGTCGGCGATGCGAACAAGGACCATGCCTGTTGGGAGAGGCCAGAGGACATGGACACTCCCCGAAGTGTTTTCAAGGTGGACAAGAACAACCCCGGCACAGAGGTCGCCGCCGAGACCGCCGCCGCACTTGCCGCCGCATCGCTGGTATTCCGACGGAGTGATCCTAATTATTCAAACCTTCTGGTCAGAAGGGCCATGAGAGTTTTTGAGTTTGCTGATAAGTATAGAGGCTCTTACAGCACTGGTTTGAAAAAATATGTGTGCCCATTTTACTGTTCCTTCTCTGGATATCAGGATGAGCTTCTATGGGGAGCTGCTTGGCTTCAGAGAGCCACCAAAAACCCCAAATATCTCAATTACATTCAAGTCAATGGACAGACTCTCGGCGCCGGGGAGTTCGATAACACGTTCGGTTGGGATAACAAGCACGCCGGAGCCCGAATCCTTCTCTCAAAGGCATTTTTGGTGCAAAAGATGCAATCTCTTCATGATTATAAAGGGCATGCTGATAATTTCATATGCTCTATAATTCCCGGAGCTTCATTCTCCTCAACCAAATACACTCCTGGTGGGTTGCTGTTTAAAATGAACGATAGCAACATGCAATATGTTACATCAACTTCATTCCTACTCTTAACTTACGCCAAGTACTTGACCTCCGCCCGCTCCGTGGTGAACTGCGGCACGACCATCACCCCGAAAACCCTCCGCTCCATTGCCAAAAAACAGGTGGACTATCTTCTAGGGGATAACCCACAAAAGATGTCCTACATGGTGGGGTATGGGGCAAGGTACCCTAAAAGGATCCATCACAGGGGATCATCACTGCCGTCGATTGCAACTCATCCAGGGAAGATCCAATGCACGGCAGGGTTCAGTGTGATGAACTCGGCCGCACCAAATCCAAACTTGCTGATTGGTGCAGTGGTGGGTGGGCCCGATAAGAATGACAGGTTCCCAGATCAACGGTCGGATTACGAACAGTCTGAGCCCGCCACGTATATGAATGCACCTCTTGTAGGGTCACTGGCTTATCTGGCTCACTCTTTTGGCCAGCTCTAA

Protein sequence

MAAPKPFPSRASSSMAALRPLFFILLLCFFPCNNGFPTHRHPHSAHHNYRDALTKSILFFEGQRSGRLPPNQRITWRRNSGLSDGAAMKVDLVGGYYDAGDNVKFGFPMAFTTTMLSWSVLEFGGLMKGELQNAKQAIRWATDYLLKATAHPDTIYVQVGDANKDHACWERPEDMDTPRSVFKVDKNNPGTEVAAETAAALAAASLVFRRSDPNYSNLLVRRAMRVFEFADKYRGSYSTGLKKYVCPFYCSFSGYQDELLWGAAWLQRATKNPKYLNYIQVNGQTLGAGEFDNTFGWDNKHAGARILLSKAFLVQKMQSLHDYKGHADNFICSIIPGASFSSTKYTPGGLLFKMNDSNMQYVTSTSFLLLTYAKYLTSARSVVNCGTTITPKTLRSIAKKQVDYLLGDNPQKMSYMVGYGARYPKRIHHRGSSLPSIATHPGKIQCTAGFSVMNSAAPNPNLLIGAVVGGPDKNDRFPDQRSDYEQSEPATYMNAPLVGSLAYLAHSFGQL
Homology
BLAST of CmaCh08G004410 vs. ExPASy Swiss-Prot
Match: O81416 (Endoglucanase 17 OS=Arabidopsis thaliana OX=3702 GN=At4g02290 PE=2 SV=1)

HSP 1 Score: 820.5 bits (2118), Expect = 1.1e-236
Identity = 394/512 (76.95%), Postives = 455/512 (88.87%), Query Frame = 0

Query: 11  ASSSMAALRPLFFILLLCFFPCNN-GFPT---------HRHPHSAHHNYRDALTKSILFF 70
           +SSS  ALR   F L   FF CN   +PT         HRH H A HNY+DALTKSILFF
Sbjct: 7   SSSSSYALRVTIF-LSFFFFLCNGFSYPTTSSLFNTHHHRH-HLAKHNYKDALTKSILFF 66

Query: 71  EGQRSGRLPPNQRITWRRNSGLSDGAAMKVDLVGGYYDAGDNVKFGFPMAFTTTMLSWSV 130
           EGQRSG+LP NQR++WRR+SGLSDG+A+ VDLVGGYYDAGDN+KFGFPMAFTTTMLSWSV
Sbjct: 67  EGQRSGKLPSNQRMSWRRDSGLSDGSALHVDLVGGYYDAGDNIKFGFPMAFTTTMLSWSV 126

Query: 131 LEFGGLMKGELQNAKQAIRWATDYLLKATAHPDTIYVQVGDANKDHACWERPEDMDTPRS 190
           +EFGGLMK ELQNAK AIRWATDYLLKAT+ PDTIYVQVGDANKDH+CWERPEDMDT RS
Sbjct: 127 IEFGGLMKSELQNAKIAIRWATDYLLKATSQPDTIYVQVGDANKDHSCWERPEDMDTVRS 186

Query: 191 VFKVDKNNPGTEVAAETAAALAAASLVFRRSDPNYSNLLVRRAMRVFEFADKYRGSYSTG 250
           VFKVDKN PG++VAAETAAALAAA++VFR+SDP+YS +L++RA+ VF FADKYRG+YS G
Sbjct: 187 VFKVDKNIPGSDVAAETAAALAAAAIVFRKSDPSYSKVLLKRAISVFAFADKYRGTYSAG 246

Query: 251 LKKYVCPFYCSFSGYQDELLWGAAWLQRATKNPKYLNYIQVNGQTLGAGEFDNTFGWDNK 310
           LK  VCPFYCS+SGYQDELLWGAAWLQ+ATKN KYLNYI++NGQ LGA E+DNTFGWDNK
Sbjct: 247 LKPDVCPFYCSYSGYQDELLWGAAWLQKATKNIKYLNYIKINGQILGAAEYDNTFGWDNK 306

Query: 311 HAGARILLSKAFLVQKMQSLHDYKGHADNFICSIIPGASFSSTKYTPGGLLFKMNDSNMQ 370
           HAGARILL+KAFLVQ +++LH+YKGHADNFICS+IPGA FSST+YTPGGLLFKM D+NMQ
Sbjct: 307 HAGARILLTKAFLVQNVKTLHEYKGHADNFICSVIPGAPFSSTQYTPGGLLFKMADANMQ 366

Query: 371 YVTSTSFLLLTYAKYLTSARSVVNCGTTI-TPKTLRSIAKKQVDYLLGDNPQKMSYMVGY 430
           YVTSTSFLLLTYAKYLTSA++VV+CG ++ TP  LRSIAK+QVDYLLGDNP +MSYMVGY
Sbjct: 367 YVTSTSFLLLTYAKYLTSAKTVVHCGGSVYTPGRLRSIAKRQVDYLLGDNPLRMSYMVGY 426

Query: 431 GARYPKRIHHRGSSLPSIATHPGKIQCTAGFSVMNSAAPNPNLLIGAVVGGPDKNDRFPD 490
           G ++P+RIHHRGSSLP +A+HP KIQC  GF++MNS +PNPN L+GAVVGGPD++DRFPD
Sbjct: 427 GPKFPRRIHHRGSSLPCVASHPAKIQCHQGFAIMNSQSPNPNFLVGAVVGGPDQHDRFPD 486

Query: 491 QRSDYEQSEPATYMNAPLVGSLAYLAHSFGQL 512
           +RSDYEQSEPATY+N+PLVG+LAY AH++GQL
Sbjct: 487 ERSDYEQSEPATYINSPLVGALAYFAHAYGQL 516

BLAST of CmaCh08G004410 vs. ExPASy Swiss-Prot
Match: Q9SRX3 (Endoglucanase 1 OS=Arabidopsis thaliana OX=3702 GN=CEL2 PE=2 SV=1)

HSP 1 Score: 776.2 bits (2003), Expect = 2.3e-223
Identity = 377/504 (74.80%), Postives = 430/504 (85.32%), Query Frame = 0

Query: 13  SSMAALRPLFFILLLCFFPCNNGFPT--------HRHPHSAHHNYRDALTKSILFFEGQR 72
           SS   +  L FILLL     +NGF +        HRH H  +HNY+DAL+KSILFFEGQR
Sbjct: 6   SSSRLITFLSFILLL-----SNGFSSSSSRPSIHHRH-HLDNHNYKDALSKSILFFEGQR 65

Query: 73  SGRLPPNQRITWRRNSGLSDGAAMKVDLVGGYYDAGDNVKFGFPMAFTTTMLSWSVLEFG 132
           SG+LPPNQR+TWR NSGLSDG+A+ VDLVGGYYDAGDN+KFGFPMAFTTTMLSWS++EFG
Sbjct: 66  SGKLPPNQRMTWRSNSGLSDGSALNVDLVGGYYDAGDNMKFGFPMAFTTTMLSWSLIEFG 125

Query: 133 GLMKGELQNAKQAIRWATDYLLKATAHPDTIYVQVGDANKDHACWERPEDMDTPRSVFKV 192
           GLMK EL NAK AIRWATD+LLKAT+HPDTIYVQVGD N DHACWERPEDMDTPRSVFKV
Sbjct: 126 GLMKSELPNAKDAIRWATDFLLKATSHPDTIYVQVGDPNMDHACWERPEDMDTPRSVFKV 185

Query: 193 DKNNPGTEVAAETAAALAAASLVFRRSDPNYSNLLVRRAMRVFEFADKYRGSYSTGLKKY 252
           DKNNPG+++A E AAALAAAS+VFR+ DP+YSN L++RA+ VF FADKYRG YS GL   
Sbjct: 186 DKNNPGSDIAGEIAAALAAASIVFRKCDPSYSNHLLQRAITVFTFADKYRGPYSAGLAPE 245

Query: 253 VCPFYCSFSGYQDELLWGAAWLQRATKNPKYLNYIQVNGQTLGAGEFDNTFGWDNKHAGA 312
           VCPFYCS+SGYQDELLWGAAWLQ+AT NP YLNYI+ NGQ LGA EFDN F WDNKH GA
Sbjct: 246 VCPFYCSYSGYQDELLWGAAWLQKATNNPTYLNYIKANGQILGADEFDNMFSWDNKHVGA 305

Query: 313 RILLSKAFLVQKMQSLHDYKGHADNFICSIIPGASFSSTKYTPGGLLFKMNDSNMQYVTS 372
           RILLSK FL+QK++SL +YK HAD+FICS++PGA  SS++YTPGGLLFKM +SNMQYVTS
Sbjct: 306 RILLSKEFLIQKVKSLEEYKEHADSFICSVLPGA--SSSQYTPGGLLFKMGESNMQYVTS 365

Query: 373 TSFLLLTYAKYLTSARSVVNC-GTTITPKTLRSIAKKQVDYLLGDNPQKMSYMVGYGARY 432
           TSFLLLTYAKYLTSAR+V  C G+ +TP  LRSIAKKQVDYLLG NP KMSYMVGYG +Y
Sbjct: 366 TSFLLLTYAKYLTSARTVAYCGGSVVTPARLRSIAKKQVDYLLGGNPLKMSYMVGYGLKY 425

Query: 433 PKRIHHRGSSLPSIATHPGKIQCTAGFSVMNSAAPNPNLLIGAVVGGPDKNDRFPDQRSD 492
           P+RIHHRGSSLPS+A HP +IQC  GFS+  S +PNPN L+GAVVGGPD+ND+FPD+RSD
Sbjct: 426 PRRIHHRGSSLPSVAVHPTRIQCHDGFSLFTSQSPNPNDLVGAVVGGPDQNDQFPDERSD 485

Query: 493 YEQSEPATYMNAPLVGSLAYLAHS 508
           Y +SEPATY+NAPLVG+LAYLA S
Sbjct: 486 YGRSEPATYINAPLVGALAYLARS 501

BLAST of CmaCh08G004410 vs. ExPASy Swiss-Prot
Match: Q8LQ92 (Endoglucanase 3 OS=Oryza sativa subsp. japonica OX=39947 GN=GLU8 PE=2 SV=1)

HSP 1 Score: 762.3 bits (1967), Expect = 3.5e-219
Identity = 361/501 (72.06%), Postives = 429/501 (85.63%), Query Frame = 0

Query: 15  MAALRPLFFILLLCFFPCNNG---FPTHRHPHSAHHNYRDALTKSILFFEGQRSGRLPPN 74
           MA LR LF  LL    P  N         H   A H+YRDALTKSILFFEGQRSG+LPP+
Sbjct: 1   MALLRCLF--LLAVLLPHRNAAVVAAASPHHGPAPHDYRDALTKSILFFEGQRSGKLPPS 60

Query: 75  QRITWRRNSGLSDGAAMKVDLVGGYYDAGDNVKFGFPMAFTTTMLSWSVLEFGGLMKGEL 134
           QR++WR +SGLSDG+++KVDLVGGYYDAGDN+KFGFP+AF+ TML+WSV+EFGGLMKGEL
Sbjct: 61  QRVSWRGDSGLSDGSSIKVDLVGGYYDAGDNMKFGFPLAFSMTMLAWSVVEFGGLMKGEL 120

Query: 135 QNAKQAIRWATDYLLKATAHPDTIYVQVGDANKDHACWERPEDMDTPRSVFKVDKNNPGT 194
           Q+A+ A+RW +DYLLKATAHPDT+YVQVGDAN+DHACWERPEDMDTPR+V+KVD + PGT
Sbjct: 121 QHARDAVRWGSDYLLKATAHPDTVYVQVGDANRDHACWERPEDMDTPRTVYKVDPSTPGT 180

Query: 195 EVAAETAAALAAASLVFRRSDPNYSNLLVRRAMRVFEFADKYRGSYSTGLKKYVCPFYCS 254
           +VAAETAAALAAASLVFR+SDP Y++ LV RA RVFEFADK+RG+YST L  YVCP+YCS
Sbjct: 181 DVAAETAAALAAASLVFRKSDPAYASRLVARAKRVFEFADKHRGTYSTRLSPYVCPYYCS 240

Query: 255 FSGYQDELLWGAAWLQRATKNPKYLNYIQVNGQTLGAGEFDNTFGWDNKHAGARILLSKA 314
           +SGYQDELLWGAAWL RATKNP YL+YIQ+NGQ LGA E DNTFGWDNKHAGARIL++KA
Sbjct: 241 YSGYQDELLWGAAWLHRATKNPTYLSYIQMNGQVLGADEQDNTFGWDNKHAGARILIAKA 300

Query: 315 FLVQKMQSLHDYKGHADNFICSIIPGASFSSTKYTPGGLLFKMNDSNMQYVTSTSFLLLT 374
           FLVQK+ +LH+YKGHAD+FICS++PG     T+YT GGLLFK++DSNMQYVTS+SFLLLT
Sbjct: 301 FLVQKVAALHEYKGHADSFICSMVPGTPTDQTQYTRGGLLFKLSDSNMQYVTSSSFLLLT 360

Query: 375 YAKYLTSARSVVNC-GTTITPKTLRSIAKKQVDYLLGDNPQKMSYMVGYGARYPKRIHHR 434
           YAKYL  +++ V+C G  +TP  LR+IA++QVDYLLG NP  MSYMVGYGA+YP+RIHHR
Sbjct: 361 YAKYLAFSKTTVSCGGAAVTPARLRAIARQQVDYLLGSNPMGMSYMVGYGAKYPRRIHHR 420

Query: 435 GSSLPSIATHPGKIQCTAGFSVMNSAAPNPNLLIGAVVGGPDKNDRFPDQRSDYEQSEPA 494
            SSLPS+A HP +I C+ GF+ + S   NPN+L+GAVVGGP+  D+FPDQRSD+E SEPA
Sbjct: 421 ASSLPSVAAHPARIGCSQGFTALYSGVANPNVLVGAVVGGPNLQDQFPDQRSDHEHSEPA 480

Query: 495 TYMNAPLVGSLAYLAHSFGQL 512
           TY+NAPLVG+LAYLAHS+GQL
Sbjct: 481 TYINAPLVGALAYLAHSYGQL 499

BLAST of CmaCh08G004410 vs. ExPASy Swiss-Prot
Match: P05522 (Endoglucanase 1 OS=Persea americana OX=3435 GN=CEL1 PE=2 SV=1)

HSP 1 Score: 639.0 bits (1647), Expect = 4.5e-182
Identity = 304/489 (62.17%), Postives = 377/489 (77.10%), Query Frame = 0

Query: 21  LFFILLLC--FFPCNNGFPTHRHPHSAHHNYRDALTKSILFFEGQRSGRLPPNQRITWRR 80
           LF +LL+C     C +    H         Y DAL KSILFFEGQRSG+LP NQR+TWR 
Sbjct: 9   LFHLLLVCTVMVKCCSASDLH---------YSDALEKSILFFEGQRSGKLPTNQRLTWRG 68

Query: 81  NSGLSDGAAMKVDLVGGYYDAGDNVKFGFPMAFTTTMLSWSVLEFGGLMKGELQNAKQAI 140
           +SGLSDG++  VDLVGGYYDAGDN+KFG PMAFTTTML+W ++EFG LM  +++NA+ A+
Sbjct: 69  DSGLSDGSSYHVDLVGGYYDAGDNLKFGLPMAFTTTMLAWGIIEFGCLMPEQVENARAAL 128

Query: 141 RWATDYLLKA-TAHPDTIYVQVGDANKDHACWERPEDMDTPRSVFKVDKNNPGTEVAAET 200
           RW+TDYLLKA TA  +++YVQVG+ N DH CWERPEDMDTPR+V+KV   NPG++VAAET
Sbjct: 129 RWSTDYLLKASTATSNSLYVQVGEPNADHRCWERPEDMDTPRNVYKVSTQNPGSDVAAET 188

Query: 201 AAALAAASLVFRRSDPNYSNLLVRRAMRVFEFADKYRGSYSTGLKKYVCPFYCSFSGYQD 260
           AAALAAAS+VF  SD +YS  L+  A++VFEFAD+YRGSYS  L   VCPFYCS+SGY D
Sbjct: 189 AAALAAASIVFGDSDSSYSTKLLHTAVKVFEFADQYRGSYSDSLGSVVCPFYCSYSGYND 248

Query: 261 ELLWGAAWLQRATKNPKYLNYIQVNGQTLGAGEFDNTFGWDNKHAGARILLSKAFLVQKM 320
           ELLWGA+WL RA++N  Y+ YIQ NG TLGA + D +F WD+K  G ++LLSK FL  ++
Sbjct: 249 ELLWGASWLHRASQNASYMTYIQSNGHTLGADDDDYSFSWDDKRVGTKVLLSKGFLQDRI 308

Query: 321 QSLHDYKGHADNFICSIIPGASFSSTKYTPGGLLFKMNDSNMQYVTSTSFLLLTYAKYLT 380
           + L  YK H DN+ICS+IPG S    +YTPGGLL+K + SN+QYVTST+FLLLTYA YL 
Sbjct: 309 EELQLYKVHTDNYICSLIPGTSSFQAQYTPGGLLYKGSASNLQYVTSTAFLLLTYANYLN 368

Query: 381 SARSVVNCG-TTITPKTLRSIAKKQVDYLLGDNPQKMSYMVGYGARYPKRIHHRGSSLPS 440
           S+    +CG TT+T K L S+AKKQVDY+LG NP KMSYMVG+G RYP+ +HHRGSSLPS
Sbjct: 369 SSGGHASCGTTTVTAKNLISLAKKQVDYILGQNPAKMSYMVGFGERYPQHVHHRGSSLPS 428

Query: 441 IATHPGKIQCTAGFSVMNSAAPNPNLLIGAVVGGPDKNDRFPDQRSDYEQSEPATYMNAP 500
           +  HP  I C AGF  + S+ PNPN+L+GA++GGPD  D F D R++Y+QSEPATY+NAP
Sbjct: 429 VQVHPNSIPCNAGFQYLYSSPPNPNILVGAILGGPDNRDSFSDDRNNYQQSEPATYINAP 488

Query: 501 LVGSLAYLA 506
           LVG+LA+ A
Sbjct: 489 LVGALAFFA 488

BLAST of CmaCh08G004410 vs. ExPASy Swiss-Prot
Match: Q652F9 (Endoglucanase 17 OS=Oryza sativa subsp. japonica OX=39947 GN=GLU13 PE=2 SV=1)

HSP 1 Score: 620.9 bits (1600), Expect = 1.3e-176
Identity = 294/465 (63.23%), Postives = 370/465 (79.57%), Query Frame = 0

Query: 44  SAHHNYRDALTKSILFFEGQRSGRLPPNQRITWRRNSGLSDGAAMKVDLVGGYYDAGDNV 103
           +  H+Y DAL KSILFFEGQRSGRLPP+QR+ WRR+S L+DGA   VDL GGYYDAGDNV
Sbjct: 20  TGQHDYSDALHKSILFFEGQRSGRLPPDQRLRWRRDSALNDGATAGVDLTGGYYDAGDNV 79

Query: 104 KFGFPMAFTTTMLSWSVLEFGGLMKGELQNAKQAIRWATDYLLKATAHPDTIYVQVGDAN 163
           KFGFPMAFT T++SW +++FG         A++A+RWATDYL+KATA P+T+YVQVGDA 
Sbjct: 80  KFGFPMAFTATLMSWGLIDFGRSFGAHAAEAREAVRWATDYLMKATATPNTVYVQVGDAF 139

Query: 164 KDHACWERPEDMDTPRSVFKVDKNNPGTEVAAETAAALAAASLVFRRSDPNYSNLLVRRA 223
           +DH+CWERPEDMDTPR+V+KVD ++PG++VAAETAAALAAAS+VFR +DP+YSN L+ RA
Sbjct: 140 RDHSCWERPEDMDTPRTVYKVDPSHPGSDVAAETAAALAAASIVFRDADPDYSNRLLDRA 199

Query: 224 MRVFEFADKYRGSYSTGLKKYVCPFYCSFSGYQDELLWGAAWLQRATKNPKYLNYIQVNG 283
           ++VFEFADKYRG YS+ L   VCP YC +SGY+DELLWGAAWL +A++  +Y +YI+ N 
Sbjct: 200 IQVFEFADKYRGPYSSSLHAAVCPCYCDYSGYKDELLWGAAWLHKASRRREYRDYIKRNE 259

Query: 284 QTLGAGEFDNTFGWDNKHAGARILLSKAFLVQKMQSLHDYKGHADNFICSIIPGAS-FSS 343
             LGA E  N FGWDNKHAG  +L+SK  L+ K +    ++ +ADNFIC+++PG S    
Sbjct: 260 VVLGASEAINEFGWDNKHAGINVLISKEVLMGKDEYFQSFRVNADNFICTLLPGISNHPQ 319

Query: 344 TKYTPGGLLFKMNDSNMQYVTSTSFLLLTYAKYLTSARSVVNCGT-TITPKTLRSIAKKQ 403
            +Y+PGGLLFK+ +SNMQ+VTS SFLLL Y+ YL+ A   V CGT + +P  LR +AK+Q
Sbjct: 320 IQYSPGGLLFKVGNSNMQHVTSLSFLLLAYSNYLSHANVRVPCGTSSASPVQLRRVAKRQ 379

Query: 404 VDYLLGDNPQKMSYMVGYGARYPKRIHHRGSSLPSIATHPGKIQCTAGFSVMNSAAPNPN 463
           VDY+LGDNP +MSYMVGYG+RYP RIHHRGSSLPS+A HP +I C AG +   SAAPNPN
Sbjct: 380 VDYILGDNPLRMSYMVGYGSRYPLRIHHRGSSLPSVAAHPAQIGCKAGATYYASAAPNPN 439

Query: 464 LLIGAVVGGP-DKNDRFPDQRSDYEQSEPATYMNAPLVGSLAYLA 506
           LL+GAVVGGP + +D FPD R+ ++QSEP TY+NAPL+G LAY +
Sbjct: 440 LLVGAVVGGPSNTSDAFPDARAVFQQSEPTTYINAPLLGLLAYFS 484

BLAST of CmaCh08G004410 vs. TAIR 10
Match: AT4G02290.1 (glycosyl hydrolase 9B13 )

HSP 1 Score: 820.5 bits (2118), Expect = 7.7e-238
Identity = 394/512 (76.95%), Postives = 455/512 (88.87%), Query Frame = 0

Query: 11  ASSSMAALRPLFFILLLCFFPCNN-GFPT---------HRHPHSAHHNYRDALTKSILFF 70
           +SSS  ALR   F L   FF CN   +PT         HRH H A HNY+DALTKSILFF
Sbjct: 7   SSSSSYALRVTIF-LSFFFFLCNGFSYPTTSSLFNTHHHRH-HLAKHNYKDALTKSILFF 66

Query: 71  EGQRSGRLPPNQRITWRRNSGLSDGAAMKVDLVGGYYDAGDNVKFGFPMAFTTTMLSWSV 130
           EGQRSG+LP NQR++WRR+SGLSDG+A+ VDLVGGYYDAGDN+KFGFPMAFTTTMLSWSV
Sbjct: 67  EGQRSGKLPSNQRMSWRRDSGLSDGSALHVDLVGGYYDAGDNIKFGFPMAFTTTMLSWSV 126

Query: 131 LEFGGLMKGELQNAKQAIRWATDYLLKATAHPDTIYVQVGDANKDHACWERPEDMDTPRS 190
           +EFGGLMK ELQNAK AIRWATDYLLKAT+ PDTIYVQVGDANKDH+CWERPEDMDT RS
Sbjct: 127 IEFGGLMKSELQNAKIAIRWATDYLLKATSQPDTIYVQVGDANKDHSCWERPEDMDTVRS 186

Query: 191 VFKVDKNNPGTEVAAETAAALAAASLVFRRSDPNYSNLLVRRAMRVFEFADKYRGSYSTG 250
           VFKVDKN PG++VAAETAAALAAA++VFR+SDP+YS +L++RA+ VF FADKYRG+YS G
Sbjct: 187 VFKVDKNIPGSDVAAETAAALAAAAIVFRKSDPSYSKVLLKRAISVFAFADKYRGTYSAG 246

Query: 251 LKKYVCPFYCSFSGYQDELLWGAAWLQRATKNPKYLNYIQVNGQTLGAGEFDNTFGWDNK 310
           LK  VCPFYCS+SGYQDELLWGAAWLQ+ATKN KYLNYI++NGQ LGA E+DNTFGWDNK
Sbjct: 247 LKPDVCPFYCSYSGYQDELLWGAAWLQKATKNIKYLNYIKINGQILGAAEYDNTFGWDNK 306

Query: 311 HAGARILLSKAFLVQKMQSLHDYKGHADNFICSIIPGASFSSTKYTPGGLLFKMNDSNMQ 370
           HAGARILL+KAFLVQ +++LH+YKGHADNFICS+IPGA FSST+YTPGGLLFKM D+NMQ
Sbjct: 307 HAGARILLTKAFLVQNVKTLHEYKGHADNFICSVIPGAPFSSTQYTPGGLLFKMADANMQ 366

Query: 371 YVTSTSFLLLTYAKYLTSARSVVNCGTTI-TPKTLRSIAKKQVDYLLGDNPQKMSYMVGY 430
           YVTSTSFLLLTYAKYLTSA++VV+CG ++ TP  LRSIAK+QVDYLLGDNP +MSYMVGY
Sbjct: 367 YVTSTSFLLLTYAKYLTSAKTVVHCGGSVYTPGRLRSIAKRQVDYLLGDNPLRMSYMVGY 426

Query: 431 GARYPKRIHHRGSSLPSIATHPGKIQCTAGFSVMNSAAPNPNLLIGAVVGGPDKNDRFPD 490
           G ++P+RIHHRGSSLP +A+HP KIQC  GF++MNS +PNPN L+GAVVGGPD++DRFPD
Sbjct: 427 GPKFPRRIHHRGSSLPCVASHPAKIQCHQGFAIMNSQSPNPNFLVGAVVGGPDQHDRFPD 486

Query: 491 QRSDYEQSEPATYMNAPLVGSLAYLAHSFGQL 512
           +RSDYEQSEPATY+N+PLVG+LAY AH++GQL
Sbjct: 487 ERSDYEQSEPATYINSPLVGALAYFAHAYGQL 516

BLAST of CmaCh08G004410 vs. TAIR 10
Match: AT1G02800.1 (cellulase 2 )

HSP 1 Score: 776.2 bits (2003), Expect = 1.7e-224
Identity = 377/504 (74.80%), Postives = 430/504 (85.32%), Query Frame = 0

Query: 13  SSMAALRPLFFILLLCFFPCNNGFPT--------HRHPHSAHHNYRDALTKSILFFEGQR 72
           SS   +  L FILLL     +NGF +        HRH H  +HNY+DAL+KSILFFEGQR
Sbjct: 6   SSSRLITFLSFILLL-----SNGFSSSSSRPSIHHRH-HLDNHNYKDALSKSILFFEGQR 65

Query: 73  SGRLPPNQRITWRRNSGLSDGAAMKVDLVGGYYDAGDNVKFGFPMAFTTTMLSWSVLEFG 132
           SG+LPPNQR+TWR NSGLSDG+A+ VDLVGGYYDAGDN+KFGFPMAFTTTMLSWS++EFG
Sbjct: 66  SGKLPPNQRMTWRSNSGLSDGSALNVDLVGGYYDAGDNMKFGFPMAFTTTMLSWSLIEFG 125

Query: 133 GLMKGELQNAKQAIRWATDYLLKATAHPDTIYVQVGDANKDHACWERPEDMDTPRSVFKV 192
           GLMK EL NAK AIRWATD+LLKAT+HPDTIYVQVGD N DHACWERPEDMDTPRSVFKV
Sbjct: 126 GLMKSELPNAKDAIRWATDFLLKATSHPDTIYVQVGDPNMDHACWERPEDMDTPRSVFKV 185

Query: 193 DKNNPGTEVAAETAAALAAASLVFRRSDPNYSNLLVRRAMRVFEFADKYRGSYSTGLKKY 252
           DKNNPG+++A E AAALAAAS+VFR+ DP+YSN L++RA+ VF FADKYRG YS GL   
Sbjct: 186 DKNNPGSDIAGEIAAALAAASIVFRKCDPSYSNHLLQRAITVFTFADKYRGPYSAGLAPE 245

Query: 253 VCPFYCSFSGYQDELLWGAAWLQRATKNPKYLNYIQVNGQTLGAGEFDNTFGWDNKHAGA 312
           VCPFYCS+SGYQDELLWGAAWLQ+AT NP YLNYI+ NGQ LGA EFDN F WDNKH GA
Sbjct: 246 VCPFYCSYSGYQDELLWGAAWLQKATNNPTYLNYIKANGQILGADEFDNMFSWDNKHVGA 305

Query: 313 RILLSKAFLVQKMQSLHDYKGHADNFICSIIPGASFSSTKYTPGGLLFKMNDSNMQYVTS 372
           RILLSK FL+QK++SL +YK HAD+FICS++PGA  SS++YTPGGLLFKM +SNMQYVTS
Sbjct: 306 RILLSKEFLIQKVKSLEEYKEHADSFICSVLPGA--SSSQYTPGGLLFKMGESNMQYVTS 365

Query: 373 TSFLLLTYAKYLTSARSVVNC-GTTITPKTLRSIAKKQVDYLLGDNPQKMSYMVGYGARY 432
           TSFLLLTYAKYLTSAR+V  C G+ +TP  LRSIAKKQVDYLLG NP KMSYMVGYG +Y
Sbjct: 366 TSFLLLTYAKYLTSARTVAYCGGSVVTPARLRSIAKKQVDYLLGGNPLKMSYMVGYGLKY 425

Query: 433 PKRIHHRGSSLPSIATHPGKIQCTAGFSVMNSAAPNPNLLIGAVVGGPDKNDRFPDQRSD 492
           P+RIHHRGSSLPS+A HP +IQC  GFS+  S +PNPN L+GAVVGGPD+ND+FPD+RSD
Sbjct: 426 PRRIHHRGSSLPSVAVHPTRIQCHDGFSLFTSQSPNPNDLVGAVVGGPDQNDQFPDERSD 485

Query: 493 YEQSEPATYMNAPLVGSLAYLAHS 508
           Y +SEPATY+NAPLVG+LAYLA S
Sbjct: 486 YGRSEPATYINAPLVGALAYLARS 501

BLAST of CmaCh08G004410 vs. TAIR 10
Match: AT1G70710.1 (glycosyl hydrolase 9B1 )

HSP 1 Score: 609.4 bits (1570), Expect = 2.7e-174
Identity = 291/467 (62.31%), Postives = 356/467 (76.23%), Query Frame = 0

Query: 43  HSAHHNYRDALTKSILFFEGQRSGRLPPNQRITWRRNSGLSDGAAMKVDLVGGYYDAGDN 102
           +SA H+YRDAL KSILFFEGQRSG+LPP+QR+ WRR+S L DG++  VDL GGYYDAGDN
Sbjct: 23  YSAGHDYRDALRKSILFFEGQRSGKLPPDQRLKWRRDSALRDGSSAGVDLSGGYYDAGDN 82

Query: 103 VKFGFPMAFTTTMLSWSVLEFGGLMKGELQNAKQAIRWATDYLLKATAHPDTIYVQVGDA 162
           +KFGFPMAFTTTMLSWS+++FG  M  EL+NA +A++W TDYLLKATA P  ++VQVGDA
Sbjct: 83  IKFGFPMAFTTTMLSWSIIDFGKTMGPELRNAVKAVKWGTDYLLKATAIPGVVFVQVGDA 142

Query: 163 NKDHACWERPEDMDTPRSVFKVDKNNPGTEVAAETAAALAAASLVFRRSDPNYSNLLVRR 222
             DH CWERPEDMDT R+V+K+D+ +PG++VA ETAAALAAAS+VFR+ DP YS LL+ R
Sbjct: 143 YSDHNCWERPEDMDTLRTVYKIDRAHPGSDVAGETAAALAAASIVFRKRDPAYSRLLLDR 202

Query: 223 AMRVFEFADKYRGSYSTGLKKYVCPFYCSFSGYQDELLWGAAWLQRATKNPKYLNYIQVN 282
           A RVF FA++YRG+YS  L   VCPFYC F+GYQDELLWGAAWL +A++   Y  +I  N
Sbjct: 203 ATRVFAFANRYRGAYSNSLYHAVCPFYCDFNGYQDELLWGAAWLHKASRKRAYREFIVKN 262

Query: 283 GQTLGAGEFDNTFGWDNKHAGARILLSKAFLVQKMQSLHDYKGHADNFICSIIPGASFSS 342
              L AG+  N FGWDNKHAG  +L+SK  L+ K +    +K +AD FICSI+PG S   
Sbjct: 263 EVILKAGDTINEFGWDNKHAGINVLISKEVLMGKAEYFESFKQNADGFICSILPGISHPQ 322

Query: 343 TKYTPGGLLFKMNDSNMQYVTSTSFLLLTYAKYLTSARSVVNCG-TTITPKTLRSIAKKQ 402
            +Y+ GGLL K   SNMQ+VTS SFLLL Y+ YL+ A+ VV CG  T +P  LR IAK+Q
Sbjct: 323 VQYSRGGLLVKTGGSNMQHVTSLSFLLLAYSNYLSHAKKVVPCGELTASPSLLRQIAKRQ 382

Query: 403 VDYLLGDNPQKMSYMVGYGARYPKRIHHRGSSLPSIATHPGKIQCTAGFSVMNSAAPNPN 462
           VDY+LGDNP  +SYMVGYG ++P+RIHHRGSS+PS++ HP  I C  G     S  PNPN
Sbjct: 383 VDYILGDNPMGLSYMVGYGQKFPRRIHHRGSSVPSVSAHPSHIGCKEGSRYFLSPNPNPN 442

Query: 463 LLIGAVVGGPDKNDRFPDQRSDYEQSEPATYMNAPLVGSLAYL-AHS 508
           LL+GAVVGGP+  D FPD R  ++QSEP TY+NAPLVG L Y  AHS
Sbjct: 443 LLVGAVVGGPNVTDAFPDSRPYFQQSEPTTYINAPLVGLLGYFSAHS 489

BLAST of CmaCh08G004410 vs. TAIR 10
Match: AT1G23210.1 (glycosyl hydrolase 9B6 )

HSP 1 Score: 601.3 bits (1549), Expect = 7.3e-172
Identity = 286/462 (61.90%), Postives = 352/462 (76.19%), Query Frame = 0

Query: 45  AHHNYRDALTKSILFFEGQRSGRLPPNQRITWRRNSGLSDGAAMKVDLVGGYYDAGDNVK 104
           A H+YRDAL KSILFFEGQRSG+LPP+QR+ WRR+S L DG++  VDL GGYYDAGDNVK
Sbjct: 25  AGHDYRDALRKSILFFEGQRSGKLPPDQRLKWRRDSALRDGSSAGVDLTGGYYDAGDNVK 84

Query: 105 FGFPMAFTTTMLSWSVLEFGGLMKGELQNAKQAIRWATDYLLKATAHPDTIYVQVGDANK 164
           FGFPMAFTTTM+SWSV++FG  M  EL+NA +AI+W TDYL+KAT  PD ++VQVGDA  
Sbjct: 85  FGFPMAFTTTMMSWSVIDFGKTMGPELENAVKAIKWGTDYLMKATQIPDVVFVQVGDAYS 144

Query: 165 DHACWERPEDMDTPRSVFKVDKNNPGTEVAAETAAALAAASLVFRRSDPNYSNLLVRRAM 224
           DH CWERPEDMDT R+V+K+DK++ G+EVA ETAAALAAAS+VF + DP YS +L+ RA 
Sbjct: 145 DHNCWERPEDMDTLRTVYKIDKDHSGSEVAGETAAALAAASIVFEKRDPVYSKMLLDRAT 204

Query: 225 RVFEFADKYRGSYSTGLKKYVCPFYCSFSGYQDELLWGAAWLQRATKNPKYLNYIQVNGQ 284
           RVF FA KYRG+YS  L + VCPFYC F+GY+DELLWGAAWL +A+K   Y  +I  N  
Sbjct: 205 RVFAFAQKYRGAYSDSLYQAVCPFYCDFNGYEDELLWGAAWLHKASKKRVYREFIVKNQV 264

Query: 285 TLGAGEFDNTFGWDNKHAGARILLSKAFLVQKMQSLHDYKGHADNFICSIIPGASFSSTK 344
            L AG+  + FGWDNKHAG  +L+SK  L+ K +    +K +AD FICS++PG S    +
Sbjct: 265 ILRAGDTIHEFGWDNKHAGINVLVSKMVLMGKAEYFQSFKQNADEFICSLLPGISHPQVQ 324

Query: 345 YTPGGLLFKMNDSNMQYVTSTSFLLLTYAKYLTSARSVVNCGT-TITPKTLRSIAKKQVD 404
           Y+ GGLL K   SNMQ+VTS SFLLLTY+ YL+ A  VV CG  T +P  LR +AK+QVD
Sbjct: 325 YSQGGLLVKSGGSNMQHVTSLSFLLLTYSNYLSHANKVVPCGEFTASPALLRQVAKRQVD 384

Query: 405 YLLGDNPQKMSYMVGYGARYPKRIHHRGSSLPSIATHPGKIQCTAGFSVMNSAAPNPNLL 464
           Y+LGDNP KMSYMVGYG+R+P++IHHRGSS+PS+  HP +I C  G     S  PNPNLL
Sbjct: 385 YILGDNPMKMSYMVGYGSRFPQKIHHRGSSVPSVVDHPDRIGCKDGSRYFFSNNPNPNLL 444

Query: 465 IGAVVGGPDKNDRFPDQRSDYEQSEPATYMNAPLVGSLAYLA 506
           IGAVVGGP+  D FPD R  ++ +EP TY+NAPL+G L Y +
Sbjct: 445 IGAVVGGPNITDDFPDSRPYFQLTEPTTYINAPLLGLLGYFS 486

BLAST of CmaCh08G004410 vs. TAIR 10
Match: AT1G22880.1 (cellulase 5 )

HSP 1 Score: 596.7 bits (1537), Expect = 1.8e-170
Identity = 289/490 (58.98%), Postives = 362/490 (73.88%), Query Frame = 0

Query: 20  PLFFILLLCFFPCNNGFPTHRHPHSAHHNYRDALTKSILFFEGQRSGRLPPNQRITWRRN 79
           P FF+ LL      N +        A  NYR+AL+KS+LFF+GQRSGRLP +Q+++WR +
Sbjct: 4   PFFFVFLLSALSLENTY--------ASPNYREALSKSLLFFQGQRSGRLPSDQQLSWRSS 63

Query: 80  SGLSDGAAMKVDLVGGYYDAGDNVKFGFPMAFTTTMLSWSVLEFGGLMKGELQNAKQAIR 139
           SGLSDG++  VDL GGYYDAGDNVKF FPMAFTTTMLSWS LE+G  M  ELQN++ AIR
Sbjct: 64  SGLSDGSSAHVDLTGGYYDAGDNVKFNFPMAFTTTMLSWSSLEYGKKMGPELQNSRVAIR 123

Query: 140 WATDYLLK-ATAHPDTIYVQVGDANKDHACWERPEDMDTPRSVFKVDKNNPGTEVAAETA 199
           WATDYLLK A A P  +YV VGD N DH CWERPEDMDTPR+V+ V  +NPG++VAAETA
Sbjct: 124 WATDYLLKCARATPGKLYVGVGDPNGDHKCWERPEDMDTPRTVYSVSPSNPGSDVAAETA 183

Query: 200 AALAAASLVFRRSDPNYSNLLVRRAMRVFEFADKYRGSYSTGLKKYVCPFYCSFSGYQDE 259
           AALAA+S+VFR+ DP YS LL+  A +V +FA +YRG+YS  L   VCPFYCS+SGY+DE
Sbjct: 184 AALAASSMVFRKVDPKYSRLLLATAKKVMQFAIQYRGAYSNSLSSSVCPFYCSYSGYKDE 243

Query: 260 LLWGAAWLQRATKNPKYLNYIQVNGQTLGAGEFDNTFGWDNKHAGARILLSKAFLVQKMQ 319
           LLWGAAWL RAT +P Y N+I    ++LG G+  + F WDNK+AGA +LLS+  ++ K  
Sbjct: 244 LLWGAAWLHRATNDPYYTNFI----KSLGGGDQPDIFSWDNKYAGAYVLLSRRAVLNKDN 303

Query: 320 SLHDYKGHADNFICSIIPGASFSSTKYTPGGLLFKMNDSNMQYVTSTSFLLLTYAKYLTS 379
           +   YK  A+NF+C I+P +  SSTKYT GGL++K+  SN+QYVTS +FLL TYAKY+ S
Sbjct: 304 NFELYKQAAENFMCKILPNSPSSSTKYTKGGLMYKLPQSNLQYVTSITFLLTTYAKYMKS 363

Query: 380 ARSVVNCGTT-ITPKTLRSIAKKQVDYLLGDNPQKMSYMVGYGARYPKRIHHRGSSLPSI 439
            +   NCG + I P  L +++K+QVDY+LG NP KMSYMVG+ + +PKRIHHRGSSLPS 
Sbjct: 364 TKQTFNCGNSLIVPNALINLSKRQVDYVLGVNPMKMSYMVGFSSNFPKRIHHRGSSLPSR 423

Query: 440 ATHPGKIQCTAGFSVMNSAAPNPNLLIGAVVGGPDKNDRFPDQRSDYEQSEPATYMNAPL 499
           A     + C  GF    +  PNPN+L GA+VGGP++ND +PDQR DY +SEPATY+NA  
Sbjct: 424 AVRSNSLGCNGGFQSFRTQNPNPNILTGAIVGGPNQNDEYPDQRDDYTRSEPATYINAAF 481

Query: 500 VGSLAYLAHS 508
           VG LAY A S
Sbjct: 484 VGPLAYFAAS 481

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
O814161.1e-23676.95Endoglucanase 17 OS=Arabidopsis thaliana OX=3702 GN=At4g02290 PE=2 SV=1[more]
Q9SRX32.3e-22374.80Endoglucanase 1 OS=Arabidopsis thaliana OX=3702 GN=CEL2 PE=2 SV=1[more]
Q8LQ923.5e-21972.06Endoglucanase 3 OS=Oryza sativa subsp. japonica OX=39947 GN=GLU8 PE=2 SV=1[more]
P055224.5e-18262.17Endoglucanase 1 OS=Persea americana OX=3435 GN=CEL1 PE=2 SV=1[more]
Q652F91.3e-17663.23Endoglucanase 17 OS=Oryza sativa subsp. japonica OX=39947 GN=GLU13 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
AT4G02290.17.7e-23876.95glycosyl hydrolase 9B13 [more]
AT1G02800.11.7e-22474.80cellulase 2 [more]
AT1G70710.12.7e-17462.31glycosyl hydrolase 9B1 [more]
AT1G23210.17.3e-17261.90glycosyl hydrolase 9B6 [more]
AT1G22880.11.8e-17058.98cellulase 5 [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita maxima (Rimu) v1.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR012341Six-hairpin glycosidase-like superfamilyGENE3D1.50.10.10coord: 45..507
e-value: 5.8E-172
score: 574.6
IPR001701Glycoside hydrolase family 9PFAMPF00759Glyco_hydro_9coord: 49..500
e-value: 1.8E-148
score: 495.6
NoneNo IPR availablePANTHERPTHR22298ENDO-1,4-BETA-GLUCANASEcoord: 13..511
NoneNo IPR availablePANTHERPTHR22298:SF159ENDOGLUCANASEcoord: 13..511
IPR033126Glycosyl hydrolases family 9, Asp/Glu active sitesPROSITEPS00698GH9_3coord: 477..495
IPR018221Glycoside hydrolase family 9, His active sitePROSITEPS00592GH9_2coord: 404..430
IPR008928Six-hairpin glycosidase superfamilySUPERFAMILY48208Six-hairpin glycosidasescoord: 46..509

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh08G004410.1CmaCh08G004410.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0030245 cellulose catabolic process
biological_process GO:0005975 carbohydrate metabolic process
molecular_function GO:0008810 cellulase activity
molecular_function GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compounds