CmaCh16G012330 (gene) Cucurbita maxima (Rimu) v1.1

Overview
NameCmaCh16G012330
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu) v1.1)
DescriptionEndoglucanase
LocationCma_Chr16: 9348957 .. 9354190 (+)
RNA-Seq ExpressionCmaCh16G012330
SyntenyCmaCh16G012330
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCCTTATTCCATCACTTTGGCCCTCTACTTCATTTTGCCTTTCTTTACCTTATCTTCCTCTGCTTTCACTTCTCAACATTACTCTACTGCTCTTCAGTATTCCATTCTCTTCTTTGAGGGACAGCGATCCGGGAAGCTTCCCTCGAACCAACGTCTCACATGGAGAGGAAATTCTGCCTTATCAGATGGCTCCTCTTATCATGTGCGTTGTTCCCCTTTTGCTTATTTTGATGTTCATTACTTCTATATAAGTCTAACCTAGTTGATATTGATAAATAACTCAGGCTCTAACTTGATAAGACGTGAATATGAACTTTCCAGGTTGACCTTGTTGGTGGCTACTATGATGCTGGGGATAATGTCAAGTTTGGCTTGCCAATGGCCTTTACTACTACATTGTTGGCTTGGAGTGTCATTGAGTTTGGTGACTCGATGGGGAACGAGATTGAGAATGCAAGAGAAGCAGTCCGTTGGGGGTCGGATTATCTATTGAAGGCTGCCACCGACGCGCCTAATGGCTTATATGTTCAAGTGAGTAGAGTGACTATAAAGAAGAATAAGATGATTAAGATAATAATAATCACTATAATAAGGATATTTGTAACGTATGTAAACTTGTGGTGTTAAGGTGGGAGATCCAAACCTTGATCATAAATGTTGGGAAAGGCCAGAAGATATGGACACACCACGCACTGTGTATAAGATAACTGCTCAAAACCCAGGCTCCGATGTGGCATCAGAGACTGCAGCTGCATTGGCTGCAGCTTCGATTGTGTTCAATACATCCGATCCTTCATATTCCAACAAATTGCTTGACGCGGCCTTAAAAGTAAATCATACTCCCGATCCCTCCTATTCCATTTTTCTTTCACATTAGACATAGCCATCGGACCTCATTATGCTCCAAACAAGGTAACAAGTTCCCATCAATGCAATGCAGGTATTCGATTTTGCAGACAAGCATAGAGGTGCTTACAGTGATTCCCTCCATTCAGTGGTCTGTCCATTTTACTGCTCTTACTCAGGATACAATGTAAGTAAGAAGATACACATAAACCTACTTGTTTTCTTTCTAGGTGTTGTTTTTTCTGAGTTTATATGACTTCTATCAGGATGAGCTTCTATGGGGTGCCTCATGGATATACAAAGCCTCAAAAAACAGCATACATTTGAACTATATCCAGTCCAATGGCCATATACTGGGAGCTGATGACGACGACTACACGTTCAGCTGGGACGACAAGCGCCCGGGAACCAAAATCCTTCTCTCCCAGGTTTGTGTTTACTCCCTCTAAATAAAAATCAGTCCAAAATCTAATTCCTAAACTCAAGCTTGATGATCTTAACATCGCGTGGACCTTTGCCACAGGATTTCCTAGTGCAAAATTCAGAGGAGTTCCAAATCTATAAAGCACACTCAGATAACTACATATGCTCCCTCATTCCAGGAACTTCCAGTTCTAGTGGTCAATATACTCCTGGTTAGCAAAAACTAACTCTTCGCTTGTCATTACTTTATGTGTGTGCATATATGAGGTCTTAATCCTATTTTTTGTTTATGTCCTAAAACATTAAGCATTTTTTCATATTGATACTTGAATTTTTAAAATATTCATTTTGGTCCTTAAAACTTTTTTCTTTAACAAACGATCGTTCTAGTTCAAAACTTTCGAAATCTCTATTTTAGTACTTCAACTTCTAAAAAAGACAATTTTTGTCCAATACCATTAATTTGTGCTTGCTACCTAATATTTACCCACCCAATACTCACTTTCAACTTACCTATGTGGATTCATTTAAGTGCCAAAGTTCATACATTAATAAACGTATTAAAGATATACTACCCATTCATTTAAATCAAAATGGTAATTTTTAAAGTTTAGGAACAAATTAAAGACAAATATGAATGAAAGTTTAAAGTAAATAGGCCAAAATAGATATTTTGAAAGTAAAGTGACCAAAATGAACTTAAGTTGAAAAGAACATGAAACAAAATTAACTTAAATTGGAAGTATAGACGCTAAAATCACAGGGTAAAGGACTAAAATGTAGCTTAAACCTATACACATGAAACCAAAAGCAGAGCTAGTACACAAACAAGACAGAACAGATCCCCATTTTCAATGAAAACAAATCGTGATCATGAAATGATAAGTGCAGGAGGACTATTTTTCAAAGGAAGCGAGAGCAACCTGCAATATGTAACTTCAGCAGCCTTCCTCCTTGTAACGTACGCGAAATACCTAAGCTCCAACGGAGGAGCAATTCGATGTGGAACTTCAAGGATTTCACCAGAAGAACTAATCGCAGAGGCAAAGAAGCAAGTTGATTACATATTAGGAGAAAATCCAGAGAAAATGTCATACATGGTGGGATTTGGAGAACGATACCCTCAGCATATTCACCATAGAGGCTCCTCTGTGCCGTCTCTTCATTCGCGCCCTAATCAAGTTTCTTGCAATGAAGGCTTCCAGTTTCTGTACTCTTCTTCCCCCAATCCGAACGTGCTCGCTGGCGCCATTGTTGGTGGACCTGATAATGGCGATAAGTTCTCCGACGATCGCAATAACTATCAGCAGTCGGAGCCCGCCACTTACATAAATGCCCCATTCGTTGGCGCCTTGGCCTATTTTTCAAAAACGCCCTAATGGATAATCCACCCTTTGGGCGGAGGCTAAGAAGGAAGGAGAGAGAGGTCATTTTGGACATTTGTCTTACGAGTCTCGGTTTAGGGTTTAGGTCGGGGTCGGTCGATCCAAAGTTCAAATCTCCACTTCTTTTATTGAGCTAAAAATATATATATTAACGTTTTAAGGATCCGATCTTACTGAACCCAATAAAAATTTTCTAGTTTATTTAGTTTTTTCTCTTACCCTCTTGAATTTGAGTAGCTCGGGTATTGGGTTTTAACCCTATGATATATAAACGGAACAGTAAACCTAATCTCCAAATCTATTCTAACCATTAATAGAATTAGGGTCGACCGGTCCAAAGTTCAAATCTCCACTTCTTTTATTGAGCTAAAAATATATATATTAACGTTTTAAGGATCCGATCTTACTGAACCCAATAAAAATTTTCTAGTTTATTTAGTTTTTTCTCTTACCCTCTTGAATTTGAGTAGCTCGGGTATTGGGTTTTAACCCTATGATATATAAACGGAACAGTAAACCTAATCTCCAAATCTATTCTAACCATTAATAGAATTAGGGTCGACCGGTCCAAAGTTCAAATCTCCACTTCTTTTATTGAGCTAAATAACCATTAATAGAATTAGGGTCGACCGGTCCAAAGTTCAAATCTCCACTTCTTTTATTGAGCTAAAAATATATATATTAACGTTTTAAGGATCCGATCTTGCTGAACCCAATAAAAATTTTCTAGTTTATTTAGTTTTTTCTCTTACCCTCTTGAATTTGAGTAGCTCGGGTATTGGGTTTTAACCCTATGATATATAAACGGAACAGTAAACCTAATCTCCAAATCTATTCTAACCATTAATAGAATTAGGGTCGACCGGTCCAAAGTTCAAATCTCCACTTCTTTTATTGAGCTAAATATATATATATTAACGTTTTAAGGATCTGATCTTACCGAACCTAATAGAAATTTTCTAGTTTATTTAGTTTTTTTGTCGTTTCTCTTGCACTCTTGAATTTGAGTAGCTCGGGTATTGGGTTTTAACCCTAAGATATATAAAATTCAAATCTCCACTTGTTATGTACATATATTAAAAAGTTAATTTAAATATAGTTCCGTGGATAGTTCGTCTTCAACACCAAGATTGGTGGAAGTATAAATTTCCAAATAAAATTAATTTTTTGATTCCGATATAACCAGGGAGATTTGCAGAAGCAGATCTGAGAGATAAGAAGAAAACTCCGATCGTCTTCTTCGTCGGCTTCAACTCTCGTCGCCGTGGCCGCCATGGATTTCTCCTCCATCTTTCTCCTCCTTGTTTTCCTCTTACCTATATCTTCTTTTGCAGAGATTCGCTTCACCGACATCAGAAATGACAATCGACCAATCATTCCCTTCGACGTCTTCGGCTTCAGCCATGGCGGTCGTCTCGAGCTCAACGTCTCTCATGTCTCACTCTCCGATACCAATCCAGATCTGGACCTCTCCAAGGCTGGATTCTTCCTCTGTACTCGAGAATCGTGGCTTCATGTGATCCAGCAATTAGAGGAAGCGGAAATCTCTTGCGCTCTTCAATCCGACCTCGTCAAACCGGTCTACACCTTCAATTCACTCAAAGGTCAGGACCGATTCGATGTTCTTTATTCCGAGAGTGATGCCGATCAATACACTCTCGTTTTTGCTAACTGTCTTCAGCAACTTAAGGTTTCTATGGACGTTCGATCTGCTATGTACAATCTCGAAGGCAAAAGCGGTCGCCGGGATTATCTTTCTGCTGGGAAAACCATCCTCCCGAGGATTTACTTCATTTTCTCTATGATTTATTTCTTGCTCGCTATCGTATGGATTCATGTTCTTTACAAAAAGCGATTGACGGTTTATGGTATTCATTTCTTTATGCTTGCTGTTGTGATCTTGAAGGCTTTGAATCTTATCTGTGAGGCTGAGGATAAATCGTACATTAAGCGCACTGGGAGCGCCCATGGTTGGGATATTCTGTTCTACATATTTAGTTTCTTAAAGGGCATCACCTTGTTCACTTTGATCGTCTTGATTGGCACTGGTTGGTCTTTCTTGAAACCATATTTGCAGGACAAGGAGAAGAAGGTCTTGATGATTGTGATTCCATTGCAAGTGGTGGCTAATATTGCCCAGGTTGTGACTGATGAAACTGGGCCATTTGAGCAAGAATGGGTCACTTGGAAACAGGTGTTTTTGCTTGTTGATGTGATCTGTTGCTGTGCTGTTTTGTTCCCCATTGTTTGGTCAATCAAGAACTTGCGTGAGGCTGCACGAACAGATGGAAAAGCAGCTGTGAATTTGATGAAATTGACCCTTTTTAGACAGTATTACATTGTGGTTATATGCTATATCTACTTCACTCGGGTTGTCGTTTATGCACTTGAGACAATTACATCGTATCGGTATCTTTGGACGAGTGTGGTGGCCGGGGAACTGGCAACGTTTGCATTTTATGCATTCACTGGTTACAAGTTCAAGCCTGAGGCTCATAATCCTTATTTTGTAGTTGATGATGAGGAGGAGGAAGCAGCTGCTGAGGCACTGAAGCTTGAAGATGAGTTTGAATTGTGA

mRNA sequence

ATGCCTTATTCCATCACTTTGGCCCTCTACTTCATTTTGCCTTTCTTTACCTTATCTTCCTCTGCTTTCACTTCTCAACATTACTCTACTGCTCTTCAGTATTCCATTCTCTTCTTTGAGGGACAGCGATCCGGGAAGCTTCCCTCGAACCAACGTCTCACATGGAGAGGAAATTCTGCCTTATCAGATGGCTCCTCTTATCATGTTGACCTTGTTGGTGGCTACTATGATGCTGGGGATAATGTCAAGTTTGGCTTGCCAATGGCCTTTACTACTACATTGTTGGCTTGGAGTGTCATTGAGTTTGGTGACTCGATGGGGAACGAGATTGAGAATGCAAGAGAAGCAGTCCGTTGGGGGTCGGATTATCTATTGAAGGCTGCCACCGACGCGCCTAATGGCTTATATGTTCAAGTGGGAGATCCAAACCTTGATCATAAATGTTGGGAAAGGCCAGAAGATATGGACACACCACGCACTGTGTATAAGATAACTGCTCAAAACCCAGGCTCCGATGTGGCATCAGAGACTGCAGCTGCATTGGCTGCAGCTTCGATTGTGTTCAATACATCCGATCCTTCATATTCCAACAAATTGCTTGACGCGGCCTTAAAAGTATTCGATTTTGCAGACAAGCATAGAGGTGCTTACAGTGATTCCCTCCATTCAGTGGTCTGTCCATTTTACTGCTCTTACTCAGGATACAATGATGAGCTTCTATGGGGTGCCTCATGGATATACAAAGCCTCAAAAAACAGCATACATTTGAACTATATCCAGTCCAATGGCCATATACTGGGAGCTGATGACGACGACTACACGTTCAGCTGGGACGACAAGCGCCCGGGAACCAAAATCCTTCTCTCCCAGGATTTCCTAGTGCAAAATTCAGAGGAGTTCCAAATCTATAAAGCACACTCAGATAACTACATATGCTCCCTCATTCCAGGAACTTCCAGTTCTAGTGGTCAATATACTCCTGGAGGACTATTTTTCAAAGGAAGCGAGAGCAACCTGCAATATGTAACTTCAGCAGCCTTCCTCCTTGTAACGTACGCGAAATACCTAAGCTCCAACGGAGGAGCAATTCGATGTGGAACTTCAAGGATTTCACCAGAAGAACTAATCGCAGAGGCAAAGAAGCAAGTTGATTACATATTAGGAGAAAATCCAGAGAAAATGTCATACATGGTGGGATTTGGAGAACGATACCCTCAGCATATTCACCATAGAGGCTCCTCTGTGCCGTCTCTTCATTCGCGCCCTAATCAAGTTTCTTGCAATGAAGGCTTCCAGTTTCTGTACTCTTCTTCCCCCAATCCGAACGTGCTCGCTGGCGCCATTGTTGGTGGACCTGATAATGGCGATAAGTTCTCCGACGATCGCAATAACTATCAGCAGTCGGAGCCCGCCACTTACATAAATGCCCCATTCATAAGAAGAAAACTCCGATCGTCTTCTTCGTCGGCTTCAACTCTCGTCGCCGTGGCCGCCATGGATTTCTCCTCCATCTTTCTCCTCCTTGTTTTCCTCTTACCTATATCTTCTTTTGCAGAGATTCGCTTCACCGACATCAGAAATGACAATCGACCAATCATTCCCTTCGACGTCTTCGGCTTCAGCCATGGCGGTCGTCTCGAGCTCAACGTCTCTCATGTCTCACTCTCCGATACCAATCCAGATCTGGACCTCTCCAAGGCTGGATTCTTCCTCTGTACTCGAGAATCGTGGCTTCATGTGATCCAGCAATTAGAGGAAGCGGAAATCTCTTGCGCTCTTCAATCCGACCTCGTCAAACCGGTCTACACCTTCAATTCACTCAAAGGTCAGGACCGATTCGATGTTCTTTATTCCGAGAGTGATGCCGATCAATACACTCTCGTTTTTGCTAACTGTCTTCAGCAACTTAAGGTTTCTATGGACGTTCGATCTGCTATGTACAATCTCGAAGGCAAAAGCGGTCGCCGGGATTATCTTTCTGCTGGGAAAACCATCCTCCCGAGGATTTACTTCATTTTCTCTATGATTTATTTCTTGCTCGCTATCGTATGGATTCATGTTCTTTACAAAAAGCGATTGACGGTTTATGGTATTCATTTCTTTATGCTTGCTGTTGTGATCTTGAAGGCTTTGAATCTTATCTGTGAGGCTGAGGATAAATCGTACATTAAGCGCACTGGGAGCGCCCATGGTTGGGATATTCTGTTCTACATATTTAGTTTCTTAAAGGGCATCACCTTGTTCACTTTGATCGTCTTGATTGGCACTGGTTGGTCTTTCTTGAAACCATATTTGCAGGACAAGGAGAAGAAGGTCTTGATGATTGTGATTCCATTGCAAGTGGTGGCTAATATTGCCCAGGTTGTGACTGATGAAACTGGGCCATTTGAGCAAGAATGGGTCACTTGGAAACAGGTGTTTTTGCTTGTTGATGTGATCTGTTGCTGTGCTGTTTTGTTCCCCATTGTTTGGTCAATCAAGAACTTGCGTGAGGCTGCACGAACAGATGGAAAAGCAGCTGTGAATTTGATGAAATTGACCCTTTTTAGACAGTATTACATTGTGGTTATATGCTATATCTACTTCACTCGGGTTGTCGTTTATGCACTTGAGACAATTACATCGTATCGGTATCTTTGGACGAGTGTGGTGGCCGGGGAACTGGCAACGTTTGCATTTTATGCATTCACTGGTTACAAGTTCAAGCCTGAGGCTCATAATCCTTATTTTGTAGTTGATGATGAGGAGGAGGAAGCAGCTGCTGAGGCACTGAAGCTTGAAGATGAGTTTGAATTGTGA

Coding sequence (CDS)

ATGCCTTATTCCATCACTTTGGCCCTCTACTTCATTTTGCCTTTCTTTACCTTATCTTCCTCTGCTTTCACTTCTCAACATTACTCTACTGCTCTTCAGTATTCCATTCTCTTCTTTGAGGGACAGCGATCCGGGAAGCTTCCCTCGAACCAACGTCTCACATGGAGAGGAAATTCTGCCTTATCAGATGGCTCCTCTTATCATGTTGACCTTGTTGGTGGCTACTATGATGCTGGGGATAATGTCAAGTTTGGCTTGCCAATGGCCTTTACTACTACATTGTTGGCTTGGAGTGTCATTGAGTTTGGTGACTCGATGGGGAACGAGATTGAGAATGCAAGAGAAGCAGTCCGTTGGGGGTCGGATTATCTATTGAAGGCTGCCACCGACGCGCCTAATGGCTTATATGTTCAAGTGGGAGATCCAAACCTTGATCATAAATGTTGGGAAAGGCCAGAAGATATGGACACACCACGCACTGTGTATAAGATAACTGCTCAAAACCCAGGCTCCGATGTGGCATCAGAGACTGCAGCTGCATTGGCTGCAGCTTCGATTGTGTTCAATACATCCGATCCTTCATATTCCAACAAATTGCTTGACGCGGCCTTAAAAGTATTCGATTTTGCAGACAAGCATAGAGGTGCTTACAGTGATTCCCTCCATTCAGTGGTCTGTCCATTTTACTGCTCTTACTCAGGATACAATGATGAGCTTCTATGGGGTGCCTCATGGATATACAAAGCCTCAAAAAACAGCATACATTTGAACTATATCCAGTCCAATGGCCATATACTGGGAGCTGATGACGACGACTACACGTTCAGCTGGGACGACAAGCGCCCGGGAACCAAAATCCTTCTCTCCCAGGATTTCCTAGTGCAAAATTCAGAGGAGTTCCAAATCTATAAAGCACACTCAGATAACTACATATGCTCCCTCATTCCAGGAACTTCCAGTTCTAGTGGTCAATATACTCCTGGAGGACTATTTTTCAAAGGAAGCGAGAGCAACCTGCAATATGTAACTTCAGCAGCCTTCCTCCTTGTAACGTACGCGAAATACCTAAGCTCCAACGGAGGAGCAATTCGATGTGGAACTTCAAGGATTTCACCAGAAGAACTAATCGCAGAGGCAAAGAAGCAAGTTGATTACATATTAGGAGAAAATCCAGAGAAAATGTCATACATGGTGGGATTTGGAGAACGATACCCTCAGCATATTCACCATAGAGGCTCCTCTGTGCCGTCTCTTCATTCGCGCCCTAATCAAGTTTCTTGCAATGAAGGCTTCCAGTTTCTGTACTCTTCTTCCCCCAATCCGAACGTGCTCGCTGGCGCCATTGTTGGTGGACCTGATAATGGCGATAAGTTCTCCGACGATCGCAATAACTATCAGCAGTCGGAGCCCGCCACTTACATAAATGCCCCATTCATAAGAAGAAAACTCCGATCGTCTTCTTCGTCGGCTTCAACTCTCGTCGCCGTGGCCGCCATGGATTTCTCCTCCATCTTTCTCCTCCTTGTTTTCCTCTTACCTATATCTTCTTTTGCAGAGATTCGCTTCACCGACATCAGAAATGACAATCGACCAATCATTCCCTTCGACGTCTTCGGCTTCAGCCATGGCGGTCGTCTCGAGCTCAACGTCTCTCATGTCTCACTCTCCGATACCAATCCAGATCTGGACCTCTCCAAGGCTGGATTCTTCCTCTGTACTCGAGAATCGTGGCTTCATGTGATCCAGCAATTAGAGGAAGCGGAAATCTCTTGCGCTCTTCAATCCGACCTCGTCAAACCGGTCTACACCTTCAATTCACTCAAAGGTCAGGACCGATTCGATGTTCTTTATTCCGAGAGTGATGCCGATCAATACACTCTCGTTTTTGCTAACTGTCTTCAGCAACTTAAGGTTTCTATGGACGTTCGATCTGCTATGTACAATCTCGAAGGCAAAAGCGGTCGCCGGGATTATCTTTCTGCTGGGAAAACCATCCTCCCGAGGATTTACTTCATTTTCTCTATGATTTATTTCTTGCTCGCTATCGTATGGATTCATGTTCTTTACAAAAAGCGATTGACGGTTTATGGTATTCATTTCTTTATGCTTGCTGTTGTGATCTTGAAGGCTTTGAATCTTATCTGTGAGGCTGAGGATAAATCGTACATTAAGCGCACTGGGAGCGCCCATGGTTGGGATATTCTGTTCTACATATTTAGTTTCTTAAAGGGCATCACCTTGTTCACTTTGATCGTCTTGATTGGCACTGGTTGGTCTTTCTTGAAACCATATTTGCAGGACAAGGAGAAGAAGGTCTTGATGATTGTGATTCCATTGCAAGTGGTGGCTAATATTGCCCAGGTTGTGACTGATGAAACTGGGCCATTTGAGCAAGAATGGGTCACTTGGAAACAGGTGTTTTTGCTTGTTGATGTGATCTGTTGCTGTGCTGTTTTGTTCCCCATTGTTTGGTCAATCAAGAACTTGCGTGAGGCTGCACGAACAGATGGAAAAGCAGCTGTGAATTTGATGAAATTGACCCTTTTTAGACAGTATTACATTGTGGTTATATGCTATATCTACTTCACTCGGGTTGTCGTTTATGCACTTGAGACAATTACATCGTATCGGTATCTTTGGACGAGTGTGGTGGCCGGGGAACTGGCAACGTTTGCATTTTATGCATTCACTGGTTACAAGTTCAAGCCTGAGGCTCATAATCCTTATTTTGTAGTTGATGATGAGGAGGAGGAAGCAGCTGCTGAGGCACTGAAGCTTGAAGATGAGTTTGAATTGTGA

Protein sequence

MPYSITLALYFILPFFTLSSSAFTSQHYSTALQYSILFFEGQRSGKLPSNQRLTWRGNSALSDGSSYHVDLVGGYYDAGDNVKFGLPMAFTTTLLAWSVIEFGDSMGNEIENAREAVRWGSDYLLKAATDAPNGLYVQVGDPNLDHKCWERPEDMDTPRTVYKITAQNPGSDVASETAAALAAASIVFNTSDPSYSNKLLDAALKVFDFADKHRGAYSDSLHSVVCPFYCSYSGYNDELLWGASWIYKASKNSIHLNYIQSNGHILGADDDDYTFSWDDKRPGTKILLSQDFLVQNSEEFQIYKAHSDNYICSLIPGTSSSSGQYTPGGLFFKGSESNLQYVTSAAFLLVTYAKYLSSNGGAIRCGTSRISPEELIAEAKKQVDYILGENPEKMSYMVGFGERYPQHIHHRGSSVPSLHSRPNQVSCNEGFQFLYSSSPNPNVLAGAIVGGPDNGDKFSDDRNNYQQSEPATYINAPFIRRKLRSSSSSASTLVAVAAMDFSSIFLLLVFLLPISSFAEIRFTDIRNDNRPIIPFDVFGFSHGGRLELNVSHVSLSDTNPDLDLSKAGFFLCTRESWLHVIQQLEEAEISCALQSDLVKPVYTFNSLKGQDRFDVLYSESDADQYTLVFANCLQQLKVSMDVRSAMYNLEGKSGRRDYLSAGKTILPRIYFIFSMIYFLLAIVWIHVLYKKRLTVYGIHFFMLAVVILKALNLICEAEDKSYIKRTGSAHGWDILFYIFSFLKGITLFTLIVLIGTGWSFLKPYLQDKEKKVLMIVIPLQVVANIAQVVTDETGPFEQEWVTWKQVFLLVDVICCCAVLFPIVWSIKNLREAARTDGKAAVNLMKLTLFRQYYIVVICYIYFTRVVVYALETITSYRYLWTSVVAGELATFAFYAFTGYKFKPEAHNPYFVVDDEEEEAAAEALKLEDEFEL
Homology
BLAST of CmaCh16G012330 vs. ExPASy Swiss-Prot
Match: P05522 (Endoglucanase 1 OS=Persea americana OX=3435 GN=CEL1 PE=2 SV=1)

HSP 1 Score: 731.5 bits (1887), Expect = 1.2e-209
Identity = 347/481 (72.14%), Postives = 404/481 (83.99%), Query Frame = 0

Query: 1   MPYSITLALYFILPFFTLSSSAFTSQ--HYSTALQYSILFFEGQRSGKLPSNQRLTWRGN 60
           M  S  L+L+ +L   T+     ++   HYS AL+ SILFFEGQRSGKLP+NQRLTWRG+
Sbjct: 1   MDCSSPLSLFHLLLVCTVMVKCCSASDLHYSDALEKSILFFEGQRSGKLPTNQRLTWRGD 60

Query: 61  SALSDGSSYHVDLVGGYYDAGDNVKFGLPMAFTTTLLAWSVIEFGDSMGNEIENAREAVR 120
           S LSDGSSYHVDLVGGYYDAGDN+KFGLPMAFTTT+LAW +IEFG  M  ++ENAR A+R
Sbjct: 61  SGLSDGSSYHVDLVGGYYDAGDNLKFGLPMAFTTTMLAWGIIEFGCLMPEQVENARAALR 120

Query: 121 WGSDYLLKAATDAPNGLYVQVGDPNLDHKCWERPEDMDTPRTVYKITAQNPGSDVASETA 180
           W +DYLLKA+T   N LYVQVG+PN DH+CWERPEDMDTPR VYK++ QNPGSDVA+ETA
Sbjct: 121 WSTDYLLKASTATSNSLYVQVGEPNADHRCWERPEDMDTPRNVYKVSTQNPGSDVAAETA 180

Query: 181 AALAAASIVFNTSDPSYSNKLLDAALKVFDFADKHRGAYSDSLHSVVCPFYCSYSGYNDE 240
           AALAAASIVF  SD SYS KLL  A+KVF+FAD++RG+YSDSL SVVCPFYCSYSGYNDE
Sbjct: 181 AALAAASIVFGDSDSSYSTKLLHTAVKVFEFADQYRGSYSDSLGSVVCPFYCSYSGYNDE 240

Query: 241 LLWGASWIYKASKNSIHLNYIQSNGHILGADDDDYTFSWDDKRPGTKILLSQDFLVQNSE 300
           LLWGASW+++AS+N+ ++ YIQSNGH LGADDDDY+FSWDDKR GTK+LLS+ FL    E
Sbjct: 241 LLWGASWLHRASQNASYMTYIQSNGHTLGADDDDYSFSWDDKRVGTKVLLSKGFLQDRIE 300

Query: 301 EFQIYKAHSDNYICSLIPGTSSSSGQYTPGGLFFKGSESNLQYVTSAAFLLVTYAKYLSS 360
           E Q+YK H+DNYICSLIPGTSS   QYTPGGL +KGS SNLQYVTS AFLL+TYA YL+S
Sbjct: 301 ELQLYKVHTDNYICSLIPGTSSFQAQYTPGGLLYKGSASNLQYVTSTAFLLLTYANYLNS 360

Query: 361 NGGAIRCGTSRISPEELIAEAKKQVDYILGENPEKMSYMVGFGERYPQHIHHRGSSVPSL 420
           +GG   CGT+ ++ + LI+ AKKQVDYILG+NP KMSYMVGFGERYPQH+HHRGSS+PS+
Sbjct: 361 SGGHASCGTTTVTAKNLISLAKKQVDYILGQNPAKMSYMVGFGERYPQHVHHRGSSLPSV 420

Query: 421 HSRPNQVSCNEGFQFLYSSSPNPNVLAGAIVGGPDNGDKFSDDRNNYQQSEPATYINAPF 480
              PN + CN GFQ+LYSS PNPN+L GAI+GGPDN D FSDDRNNYQQSEPATYINAP 
Sbjct: 421 QVHPNSIPCNAGFQYLYSSPPNPNILVGAILGGPDNRDSFSDDRNNYQQSEPATYINAPL 480

BLAST of CmaCh16G012330 vs. ExPASy Swiss-Prot
Match: Q6YXT7 (Endoglucanase 19 OS=Oryza sativa subsp. japonica OX=39947 GN=Os08g0114200 PE=2 SV=1)

HSP 1 Score: 645.6 bits (1664), Expect = 8.7e-184
Identity = 303/481 (62.99%), Postives = 379/481 (78.79%), Query Frame = 0

Query: 5   ITLALYFILPFFTLSSSAFTSQHYSTALQYSILFFEGQRSGKLPSNQRLTWRGNSALSDG 64
           I L L  +L    L  S+  + +Y+ AL  SI+FFEGQRSGKLP   R+ WR +S L+DG
Sbjct: 32  IRLRLLVVLHLLLLVPSSAMAFNYADALAKSIIFFEGQRSGKLPPGNRMPWRADSGLTDG 91

Query: 65  SSYHVDLVGGYYDAGDNVKFGLPMAFTTTLLAWSVIEFGDSMGNEIENAREAVRWGSDYL 124
           + Y+VDLVGGYYDAGDNVKFGLPMAF+TT+LAWSV++FG  MG E+ NAR AVRWG+DYL
Sbjct: 92  AQYNVDLVGGYYDAGDNVKFGLPMAFSTTMLAWSVLDFGKFMGAELPNARAAVRWGADYL 151

Query: 125 LKAATDAPNGLYVQVGDPNLDHKCWERPEDMDTPRTVYKITAQNPGSDVASETAAALAAA 184
           LKAAT  P  LYVQV DPN DH+CWERPEDMDTPR+VY++TA  PGSDVA ETAAALAA+
Sbjct: 152 LKAATATPGALYVQVADPNQDHRCWERPEDMDTPRSVYRVTADKPGSDVAGETAAALAAS 211

Query: 185 SIVFNTSDPSYSNKLLDAALKVFDFADKHRGAYSDSLHSVVCPFYCSYSGYNDELLWGAS 244
           S+VF  +DP+YS +LL AA +VFDFAD+HRG+YSDSL S VCPFYCSYSGY+DELLWGAS
Sbjct: 212 SMVFRRADPAYSARLLHAATQVFDFADRHRGSYSDSLASSVCPFYCSYSGYHDELLWGAS 271

Query: 245 WIYKASKNSIHLNYIQSNGHILGADDDDYTFSWDDKRPGTKILLSQDFLVQNSEEFQIYK 304
           W+++AS+N+  ++Y+++NG  LGA DDDY+FSWDDKR GTK+LL++ FL       ++YK
Sbjct: 272 WLHRASRNASFMSYVEANGMQLGAGDDDYSFSWDDKRVGTKVLLAKGFLRNRLHGLELYK 331

Query: 305 AHSDNYICSLIPGTSSSSGQYTPGGLFFKGSESNLQYVTSAAFLLVTYAKYLSSNGGAIR 364
           AHSD+YICSL+PGT+S   +YTPGGL ++   SN+QYVT+A FL++ YAKYL S+G    
Sbjct: 332 AHSDSYICSLVPGTASFQSRYTPGGLLYREGSSNMQYVTTATFLMLAYAKYLRSSGATAS 391

Query: 365 CG------TSRISPEELIAEAKKQVDYILGENPEKMSYMVGFGERYPQHIHHRGSSVPSL 424
           CG         +S  EL+A AK+QVDYILG+NP  MSYMVGFG RYP+  HHRG+S+PS+
Sbjct: 392 CGDGGGGARGEVSAAELVAVAKRQVDYILGKNPAGMSYMVGFGCRYPRRAHHRGASMPSV 451

Query: 425 HSRPNQVSCNEGFQFLYSSSPNPNVLAGAIVGGPDNGDKFSDDRNNYQQSEPATYINAPF 480
            + P ++SC+ GF +L+S  PNPNVL GA+VGGPD+ D F+DDR N+ QSEPATYINAP 
Sbjct: 452 RAHPGRISCDAGFGYLHSGEPNPNVLVGAVVGGPDSRDAFADDRGNFAQSEPATYINAPL 511

BLAST of CmaCh16G012330 vs. ExPASy Swiss-Prot
Match: Q6Z715 (Endoglucanase 4 OS=Oryza sativa subsp. japonica OX=39947 GN=GLU14 PE=2 SV=1)

HSP 1 Score: 624.4 bits (1609), Expect = 2.1e-177
Identity = 306/489 (62.58%), Postives = 372/489 (76.07%), Query Frame = 0

Query: 9   LYFILPFFTLSSSAFTSQHYSTALQYSILFFEGQRSGKLPSNQRLTWRGNSALSDGSSYH 68
           L  ++    L+     + +Y+ AL  +ILFFE QRSGKLP  QR+ WR +S LSDGS+  
Sbjct: 6   LLLVVAAVCLAGREAAAFNYADALDKAILFFEAQRSGKLPPGQRVAWRADSGLSDGSADG 65

Query: 69  VDLVGGYYDAGDNVKFGLPMAFTTTLLAWSVIEFGDSMG----------------NEIEN 128
           VDL GGYYDAGDNVKFGLPMAFT T+L+WSVIEFGD M                  +++N
Sbjct: 66  VDLAGGYYDAGDNVKFGLPMAFTVTMLSWSVIEFGDMMPARRSSFLGGIFGGGGVAQLDN 125

Query: 129 AREAVRWGSDYLLKAATDAPNGLYVQVGDPNLDHKCWERPEDMDTPRTVYKITAQNPGSD 188
           AR AVRWG+DYLLKAAT  P+ LYVQV DP  DH+CWERPEDMDTPR+VYK+T Q+PGSD
Sbjct: 126 ARAAVRWGADYLLKAATATPDTLYVQVADPYQDHRCWERPEDMDTPRSVYKVTPQSPGSD 185

Query: 189 VASETAAALAAASIVFNTSDPSYSNKLLDAALKVFDFADKHRGAYSDSLHSVVCPFYCSY 248
           VA ETAAALAAASIVF  SDPSYS KLLDAA  VFDFADK+RG+YSDSL SVVCPFYCS+
Sbjct: 186 VAGETAAALAAASIVFRVSDPSYSAKLLDAAQLVFDFADKYRGSYSDSLSSVVCPFYCSH 245

Query: 249 SGYNDELLWGASWIYKAS--KNSIHLNYIQSNGHILGADDDDYTFSWDDKRPGTKILLSQ 308
           S Y+DELLW ASW++ AS  K  ++L+YI SNGH LGA+ DD+TFSWDDKR  TK     
Sbjct: 246 S-YHDELLWAASWLHLASPEKKDVYLSYIGSNGHALGAEQDDFTFSWDDKRVATK----- 305

Query: 309 DFLVQNSEEFQIYKAHSDNYICSLIPGTSSSSGQYTPGGLFFKGSESNLQYVTSAAFLLV 368
            FL   ++  Q+YKAH+DNYICSL+PG +    QYTPGGL FK  +SN+QYVTS AFLL+
Sbjct: 306 GFLQSRADGLQLYKAHTDNYICSLVPGANGFQSQYTPGGLLFKEGDSNMQYVTSTAFLLL 365

Query: 369 TYAKYLSSNGGAIRCGTSRISPEELIAEAKKQVDYILGENPEKMSYMVGFGERYPQHIHH 428
           TYAKYLSS+   + CG++ +SP  LI+ AKKQVDYILG NP  MSYMVGFG RYP+H+HH
Sbjct: 366 TYAKYLSSSAATVSCGSTAVSPSTLISLAKKQVDYILGANPAGMSYMVGFGARYPRHVHH 425

Query: 429 RGSSVPSLHSRPNQVSCNEGFQFLYSSSPNPNVLAGAIVGGPDNGDKFSDDRNNYQQSEP 480
           RG+S+PS+   P ++ C+EGF++L+S  P+ N+LAGA+VGGPD GD F+D R+NY Q+EP
Sbjct: 426 RGASMPSVRDHPARIGCDEGFRYLHSPEPDRNLLAGAVVGGPDAGDAFADGRDNYAQAEP 485

BLAST of CmaCh16G012330 vs. ExPASy Swiss-Prot
Match: Q9SRX3 (Endoglucanase 1 OS=Arabidopsis thaliana OX=3702 GN=CEL2 PE=2 SV=1)

HSP 1 Score: 610.1 bits (1572), Expect = 4.1e-173
Identity = 298/492 (60.57%), Postives = 368/492 (74.80%), Query Frame = 0

Query: 3   YSITLALYFILPFFTLSSSAFTS---------------QHYSTALQYSILFFEGQRSGKL 62
           Y  +  L   L F  L S+ F+S                +Y  AL  SILFFEGQRSGKL
Sbjct: 4   YLSSSRLITFLSFILLLSNGFSSSSSRPSIHHRHHLDNHNYKDALSKSILFFEGQRSGKL 63

Query: 63  PSNQRLTWRGNSALSDGSSYHVDLVGGYYDAGDNVKFGLPMAFTTTLLAWSVIEFGDSMG 122
           P NQR+TWR NS LSDGS+ +VDLVGGYYDAGDN+KFG PMAFTTT+L+WS+IEFG  M 
Sbjct: 64  PPNQRMTWRSNSGLSDGSALNVDLVGGYYDAGDNMKFGFPMAFTTTMLSWSLIEFGGLMK 123

Query: 123 NEIENAREAVRWGSDYLLKAATDAPNGLYVQVGDPNLDHKCWERPEDMDTPRTVYKITAQ 182
           +E+ NA++A+RW +D+LLK AT  P+ +YVQVGDPN+DH CWERPEDMDTPR+V+K+   
Sbjct: 124 SELPNAKDAIRWATDFLLK-ATSHPDTIYVQVGDPNMDHACWERPEDMDTPRSVFKVDKN 183

Query: 183 NPGSDVASETAAALAAASIVFNTSDPSYSNKLLDAALKVFDFADKHRGAYSDSLHSVVCP 242
           NPGSD+A E AAALAAASIVF   DPSYSN LL  A+ VF FADK+RG YS  L   VCP
Sbjct: 184 NPGSDIAGEIAAALAAASIVFRKCDPSYSNHLLQRAITVFTFADKYRGPYSAGLAPEVCP 243

Query: 243 FYCSYSGYNDELLWGASWIYKASKNSIHLNYIQSNGHILGADDDDYTFSWDDKRPGTKIL 302
           FYCSYSGY DELLWGA+W+ KA+ N  +LNYI++NG ILGAD+ D  FSWD+K  G +IL
Sbjct: 244 FYCSYSGYQDELLWGAAWLQKATNNPTYLNYIKANGQILGADEFDNMFSWDNKHVGARIL 303

Query: 303 LSQDFLVQNSEEFQIYKAHSDNYICSLIPGTSSSSGQYTPGGLFFKGSESNLQYVTSAAF 362
           LS++FL+Q  +  + YK H+D++ICS++PG SSS  QYTPGGL FK  ESN+QYVTS +F
Sbjct: 304 LSKEFLIQKVKSLEEYKEHADSFICSVLPGASSS--QYTPGGLLFKMGESNMQYVTSTSF 363

Query: 363 LLVTYAKYLSSNGGAIRCGTSRISPEELIAEAKKQVDYILGENPEKMSYMVGFGERYPQH 422
           LL+TYAKYL+S      CG S ++P  L + AKKQVDY+LG NP KMSYMVG+G +YP+ 
Sbjct: 364 LLLTYAKYLTSARTVAYCGGSVVTPARLRSIAKKQVDYLLGGNPLKMSYMVGYGLKYPRR 423

Query: 423 IHHRGSSVPSLHSRPNQVSCNEGFQFLYSSSPNPNVLAGAIVGGPDNGDKFSDDRNNYQQ 480
           IHHRGSS+PS+   P ++ C++GF    S SPNPN L GA+VGGPD  D+F D+R++Y +
Sbjct: 424 IHHRGSSLPSVAVHPTRIQCHDGFSLFTSQSPNPNDLVGAVVGGPDQNDQFPDERSDYGR 483

BLAST of CmaCh16G012330 vs. ExPASy Swiss-Prot
Match: O81416 (Endoglucanase 17 OS=Arabidopsis thaliana OX=3702 GN=At4g02290 PE=2 SV=1)

HSP 1 Score: 604.7 bits (1558), Expect = 1.7e-171
Identity = 295/488 (60.45%), Postives = 370/488 (75.82%), Query Frame = 0

Query: 4   SITLALYFILP---FFTLSSSAFTSQH---------YSTALQYSILFFEGQRSGKLPSNQ 63
           +I L+ +F L     +  +SS F + H         Y  AL  SILFFEGQRSGKLPSNQ
Sbjct: 17  TIFLSFFFFLCNGFSYPTTSSLFNTHHHRHHLAKHNYKDALTKSILFFEGQRSGKLPSNQ 76

Query: 64  RLTWRGNSALSDGSSYHVDLVGGYYDAGDNVKFGLPMAFTTTLLAWSVIEFGDSMGNEIE 123
           R++WR +S LSDGS+ HVDLVGGYYDAGDN+KFG PMAFTTT+L+WSVIEFG  M +E++
Sbjct: 77  RMSWRRDSGLSDGSALHVDLVGGYYDAGDNIKFGFPMAFTTTMLSWSVIEFGGLMKSELQ 136

Query: 124 NAREAVRWGSDYLLKAATDAPNGLYVQVGDPNLDHKCWERPEDMDTPRTVYKITAQNPGS 183
           NA+ A+RW +DYLLK AT  P+ +YVQVGD N DH CWERPEDMDT R+V+K+    PGS
Sbjct: 137 NAKIAIRWATDYLLK-ATSQPDTIYVQVGDANKDHSCWERPEDMDTVRSVFKVDKNIPGS 196

Query: 184 DVASETAAALAAASIVFNTSDPSYSNKLLDAALKVFDFADKHRGAYSDSLHSVVCPFYCS 243
           DVA+ETAAALAAA+IVF  SDPSYS  LL  A+ VF FADK+RG YS  L   VCPFYCS
Sbjct: 197 DVAAETAAALAAAAIVFRKSDPSYSKVLLKRAISVFAFADKYRGTYSAGLKPDVCPFYCS 256

Query: 244 YSGYNDELLWGASWIYKASKNSIHLNYIQSNGHILGADDDDYTFSWDDKRPGTKILLSQD 303
           YSGY DELLWGA+W+ KA+KN  +LNYI+ NG ILGA + D TF WD+K  G +ILL++ 
Sbjct: 257 YSGYQDELLWGAAWLQKATKNIKYLNYIKINGQILGAAEYDNTFGWDNKHAGARILLTKA 316

Query: 304 FLVQNSEEFQIYKAHSDNYICSLIPGTSSSSGQYTPGGLFFKGSESNLQYVTSAAFLLVT 363
           FLVQN +    YK H+DN+ICS+IPG   SS QYTPGGL FK +++N+QYVTS +FLL+T
Sbjct: 317 FLVQNVKTLHEYKGHADNFICSVIPGAPFSSTQYTPGGLLFKMADANMQYVTSTSFLLLT 376

Query: 364 YAKYLSSNGGAIRCGTSRISPEELIAEAKKQVDYILGENPEKMSYMVGFGERYPQHIHHR 423
           YAKYL+S    + CG S  +P  L + AK+QVDY+LG+NP +MSYMVG+G ++P+ IHHR
Sbjct: 377 YAKYLTSAKTVVHCGGSVYTPGRLRSIAKRQVDYLLGDNPLRMSYMVGYGPKFPRRIHHR 436

Query: 424 GSSVPSLHSRPNQVSCNEGFQFLYSSSPNPNVLAGAIVGGPDNGDKFSDDRNNYQQSEPA 480
           GSS+P + S P ++ C++GF  + S SPNPN L GA+VGGPD  D+F D+R++Y+QSEPA
Sbjct: 437 GSSLPCVASHPAKIQCHQGFAIMNSQSPNPNFLVGAVVGGPDQHDRFPDERSDYEQSEPA 496

BLAST of CmaCh16G012330 vs. TAIR 10
Match: AT5G42090.1 (Lung seven transmembrane receptor family protein )

HSP 1 Score: 708.4 bits (1827), Expect = 7.8e-204
Identity = 351/434 (80.88%), Postives = 400/434 (92.17%), Query Frame = 0

Query: 501 FSSIFLLLVFLLPISSFAEIRFTDIRNDNRPIIPFDVFGFSHGGRLELNVSHVSLSDTNP 560
           FSSI +LL+  + I+S AEIR ++IR+D+RPIIP D FGF+H GRLEL+ S + LS++NP
Sbjct: 7   FSSILILLLISISIAS-AEIRKSEIRSDDRPIIPLDEFGFTHSGRLELDASKIWLSNSNP 66

Query: 561 DLDLSKAGFFLCTRESWLHVIQQLEEAEISCALQSDLVKPVYTFNSLKGQD--RFDVLYS 620
           DLDLSK GFFLCTR++W+HVIQQLEE EI+CALQSDLVK V+TFN+LKG D  RF  +++
Sbjct: 67  DLDLSKVGFFLCTRDAWVHVIQQLEEEEITCALQSDLVKHVFTFNNLKGGDKSRFSTVFT 126

Query: 621 ESDADQYTLVFANCLQQLKVSMDVRSAMYNLEGKSGRRDYLSAGKTILPRIYFIFSMIYF 680
           E+DADQY+LVFANCLQQ+K+SMDVRSAMYNLEGK G RDYLSAG+T+LP++YF+FS+IYF
Sbjct: 127 ENDADQYSLVFANCLQQVKISMDVRSAMYNLEGKKGGRDYLSAGRTVLPKVYFLFSVIYF 186

Query: 681 LLAIVWIHVLYKKRLTVYGIHFFMLAVVILKALNLICEAEDKSYIKRTGSAHGWDILFYI 740
            LA  WI+VLYKKRLTV+ IHFFML VV+LKALNL+CEAEDKSYIK+TG+AHGWD+LFYI
Sbjct: 187 SLAATWIYVLYKKRLTVFAIHFFMLGVVVLKALNLLCEAEDKSYIKKTGTAHGWDVLFYI 246

Query: 741 FSFLKGITLFTLIVLIGTGWSFLKPYLQDKEKKVLMIVIPLQVVANIAQVVTDETGPFEQ 800
           F+FLKGITLFTLIVLIGTGWSFLKPYLQDKEKKVLMIVIPLQVVAN AQVV DETGP+ Q
Sbjct: 247 FNFLKGITLFTLIVLIGTGWSFLKPYLQDKEKKVLMIVIPLQVVANFAQVVIDETGPYGQ 306

Query: 801 EWVTWKQVFLLVDVICCCAVLFPIVWSIKNLREAARTDGKAAVNLMKLTLFRQYYIVVIC 860
           +WVTWKQ+FLLVDV+CCCAVLFPIVWSIKNLREAA+TDGKAAVNL+KLTLFRQYYIVVIC
Sbjct: 307 DWVTWKQIFLLVDVVCCCAVLFPIVWSIKNLREAAKTDGKAAVNLVKLTLFRQYYIVVIC 366

Query: 861 YIYFTRVVVYALETITSYRYLWTSVVAGELATFAFYAFTGYKFKPEAHNPYFVVDDEEEE 920
           YIYFTRVVVYALETITSY+Y+WTSVVA ELAT AFY FTGYKF+PE HNPYFVVDDEEEE
Sbjct: 367 YIYFTRVVVYALETITSYKYMWTSVVASELATLAFYLFTGYKFRPEVHNPYFVVDDEEEE 426

Query: 921 AAAEALKLEDEFEL 933
           AAAEALKLEDEFEL
Sbjct: 427 AAAEALKLEDEFEL 439

BLAST of CmaCh16G012330 vs. TAIR 10
Match: AT1G02800.1 (cellulase 2 )

HSP 1 Score: 610.1 bits (1572), Expect = 2.9e-174
Identity = 298/492 (60.57%), Postives = 368/492 (74.80%), Query Frame = 0

Query: 3   YSITLALYFILPFFTLSSSAFTS---------------QHYSTALQYSILFFEGQRSGKL 62
           Y  +  L   L F  L S+ F+S                +Y  AL  SILFFEGQRSGKL
Sbjct: 4   YLSSSRLITFLSFILLLSNGFSSSSSRPSIHHRHHLDNHNYKDALSKSILFFEGQRSGKL 63

Query: 63  PSNQRLTWRGNSALSDGSSYHVDLVGGYYDAGDNVKFGLPMAFTTTLLAWSVIEFGDSMG 122
           P NQR+TWR NS LSDGS+ +VDLVGGYYDAGDN+KFG PMAFTTT+L+WS+IEFG  M 
Sbjct: 64  PPNQRMTWRSNSGLSDGSALNVDLVGGYYDAGDNMKFGFPMAFTTTMLSWSLIEFGGLMK 123

Query: 123 NEIENAREAVRWGSDYLLKAATDAPNGLYVQVGDPNLDHKCWERPEDMDTPRTVYKITAQ 182
           +E+ NA++A+RW +D+LLK AT  P+ +YVQVGDPN+DH CWERPEDMDTPR+V+K+   
Sbjct: 124 SELPNAKDAIRWATDFLLK-ATSHPDTIYVQVGDPNMDHACWERPEDMDTPRSVFKVDKN 183

Query: 183 NPGSDVASETAAALAAASIVFNTSDPSYSNKLLDAALKVFDFADKHRGAYSDSLHSVVCP 242
           NPGSD+A E AAALAAASIVF   DPSYSN LL  A+ VF FADK+RG YS  L   VCP
Sbjct: 184 NPGSDIAGEIAAALAAASIVFRKCDPSYSNHLLQRAITVFTFADKYRGPYSAGLAPEVCP 243

Query: 243 FYCSYSGYNDELLWGASWIYKASKNSIHLNYIQSNGHILGADDDDYTFSWDDKRPGTKIL 302
           FYCSYSGY DELLWGA+W+ KA+ N  +LNYI++NG ILGAD+ D  FSWD+K  G +IL
Sbjct: 244 FYCSYSGYQDELLWGAAWLQKATNNPTYLNYIKANGQILGADEFDNMFSWDNKHVGARIL 303

Query: 303 LSQDFLVQNSEEFQIYKAHSDNYICSLIPGTSSSSGQYTPGGLFFKGSESNLQYVTSAAF 362
           LS++FL+Q  +  + YK H+D++ICS++PG SSS  QYTPGGL FK  ESN+QYVTS +F
Sbjct: 304 LSKEFLIQKVKSLEEYKEHADSFICSVLPGASSS--QYTPGGLLFKMGESNMQYVTSTSF 363

Query: 363 LLVTYAKYLSSNGGAIRCGTSRISPEELIAEAKKQVDYILGENPEKMSYMVGFGERYPQH 422
           LL+TYAKYL+S      CG S ++P  L + AKKQVDY+LG NP KMSYMVG+G +YP+ 
Sbjct: 364 LLLTYAKYLTSARTVAYCGGSVVTPARLRSIAKKQVDYLLGGNPLKMSYMVGYGLKYPRR 423

Query: 423 IHHRGSSVPSLHSRPNQVSCNEGFQFLYSSSPNPNVLAGAIVGGPDNGDKFSDDRNNYQQ 480
           IHHRGSS+PS+   P ++ C++GF    S SPNPN L GA+VGGPD  D+F D+R++Y +
Sbjct: 424 IHHRGSSLPSVAVHPTRIQCHDGFSLFTSQSPNPNDLVGAVVGGPDQNDQFPDERSDYGR 483

BLAST of CmaCh16G012330 vs. TAIR 10
Match: AT4G02290.1 (glycosyl hydrolase 9B13 )

HSP 1 Score: 604.7 bits (1558), Expect = 1.2e-172
Identity = 295/488 (60.45%), Postives = 370/488 (75.82%), Query Frame = 0

Query: 4   SITLALYFILP---FFTLSSSAFTSQH---------YSTALQYSILFFEGQRSGKLPSNQ 63
           +I L+ +F L     +  +SS F + H         Y  AL  SILFFEGQRSGKLPSNQ
Sbjct: 17  TIFLSFFFFLCNGFSYPTTSSLFNTHHHRHHLAKHNYKDALTKSILFFEGQRSGKLPSNQ 76

Query: 64  RLTWRGNSALSDGSSYHVDLVGGYYDAGDNVKFGLPMAFTTTLLAWSVIEFGDSMGNEIE 123
           R++WR +S LSDGS+ HVDLVGGYYDAGDN+KFG PMAFTTT+L+WSVIEFG  M +E++
Sbjct: 77  RMSWRRDSGLSDGSALHVDLVGGYYDAGDNIKFGFPMAFTTTMLSWSVIEFGGLMKSELQ 136

Query: 124 NAREAVRWGSDYLLKAATDAPNGLYVQVGDPNLDHKCWERPEDMDTPRTVYKITAQNPGS 183
           NA+ A+RW +DYLLK AT  P+ +YVQVGD N DH CWERPEDMDT R+V+K+    PGS
Sbjct: 137 NAKIAIRWATDYLLK-ATSQPDTIYVQVGDANKDHSCWERPEDMDTVRSVFKVDKNIPGS 196

Query: 184 DVASETAAALAAASIVFNTSDPSYSNKLLDAALKVFDFADKHRGAYSDSLHSVVCPFYCS 243
           DVA+ETAAALAAA+IVF  SDPSYS  LL  A+ VF FADK+RG YS  L   VCPFYCS
Sbjct: 197 DVAAETAAALAAAAIVFRKSDPSYSKVLLKRAISVFAFADKYRGTYSAGLKPDVCPFYCS 256

Query: 244 YSGYNDELLWGASWIYKASKNSIHLNYIQSNGHILGADDDDYTFSWDDKRPGTKILLSQD 303
           YSGY DELLWGA+W+ KA+KN  +LNYI+ NG ILGA + D TF WD+K  G +ILL++ 
Sbjct: 257 YSGYQDELLWGAAWLQKATKNIKYLNYIKINGQILGAAEYDNTFGWDNKHAGARILLTKA 316

Query: 304 FLVQNSEEFQIYKAHSDNYICSLIPGTSSSSGQYTPGGLFFKGSESNLQYVTSAAFLLVT 363
           FLVQN +    YK H+DN+ICS+IPG   SS QYTPGGL FK +++N+QYVTS +FLL+T
Sbjct: 317 FLVQNVKTLHEYKGHADNFICSVIPGAPFSSTQYTPGGLLFKMADANMQYVTSTSFLLLT 376

Query: 364 YAKYLSSNGGAIRCGTSRISPEELIAEAKKQVDYILGENPEKMSYMVGFGERYPQHIHHR 423
           YAKYL+S    + CG S  +P  L + AK+QVDY+LG+NP +MSYMVG+G ++P+ IHHR
Sbjct: 377 YAKYLTSAKTVVHCGGSVYTPGRLRSIAKRQVDYLLGDNPLRMSYMVGYGPKFPRRIHHR 436

Query: 424 GSSVPSLHSRPNQVSCNEGFQFLYSSSPNPNVLAGAIVGGPDNGDKFSDDRNNYQQSEPA 480
           GSS+P + S P ++ C++GF  + S SPNPN L GA+VGGPD  D+F D+R++Y+QSEPA
Sbjct: 437 GSSLPCVASHPAKIQCHQGFAIMNSQSPNPNFLVGAVVGGPDQHDRFPDERSDYEQSEPA 496

BLAST of CmaCh16G012330 vs. TAIR 10
Match: AT1G22880.1 (cellulase 5 )

HSP 1 Score: 562.8 bits (1449), Expect = 5.3e-160
Identity = 270/471 (57.32%), Postives = 343/471 (72.82%), Query Frame = 0

Query: 10  YFILPFFTLS-SSAFTSQHYSTALQYSILFFEGQRSGKLPSNQRLTWRGNSALSDGSSYH 69
           +F+     LS  + + S +Y  AL  S+LFF+GQRSG+LPS+Q+L+WR +S LSDGSS H
Sbjct: 6   FFVFLLSALSLENTYASPNYREALSKSLLFFQGQRSGRLPSDQQLSWRSSSGLSDGSSAH 65

Query: 70  VDLVGGYYDAGDNVKFGLPMAFTTTLLAWSVIEFGDSMGNEIENAREAVRWGSDYLLKAA 129
           VDL GGYYDAGDNVKF  PMAFTTT+L+WS +E+G  MG E++N+R A+RW +DYLLK A
Sbjct: 66  VDLTGGYYDAGDNVKFNFPMAFTTTMLSWSSLEYGKKMGPELQNSRVAIRWATDYLLKCA 125

Query: 130 TDAPNGLYVQVGDPNLDHKCWERPEDMDTPRTVYKITAQNPGSDVASETAAALAAASIVF 189
              P  LYV VGDPN DHKCWERPEDMDTPRTVY ++  NPGSDVA+ETAAALAA+S+VF
Sbjct: 126 RATPGKLYVGVGDPNGDHKCWERPEDMDTPRTVYSVSPSNPGSDVAAETAAALAASSMVF 185

Query: 190 NTSDPSYSNKLLDAALKVFDFADKHRGAYSDSLHSVVCPFYCSYSGYNDELLWGASWIYK 249
              DP YS  LL  A KV  FA ++RGAYS+SL S VCPFYCSYSGY DELLWGA+W+++
Sbjct: 186 RKVDPKYSRLLLATAKKVMQFAIQYRGAYSNSLSSSVCPFYCSYSGYKDELLWGAAWLHR 245

Query: 250 ASKNSIHLNYIQSNGHILGADDDDYTFSWDDKRPGTKILLSQDFLVQNSEEFQIYKAHSD 309
           A+ +  + N+I+S    LG  D    FSWD+K  G  +LLS+  ++     F++YK  ++
Sbjct: 246 ATNDPYYTNFIKS----LGGGDQPDIFSWDNKYAGAYVLLSRRAVLNKDNNFELYKQAAE 305

Query: 310 NYICSLIPGTSSSSGQYTPGGLFFKGSESNLQYVTSAAFLLVTYAKYLSSNGGAIRCGTS 369
           N++C ++P + SSS +YT GGL +K  +SNLQYVTS  FLL TYAKY+ S      CG S
Sbjct: 306 NFMCKILPNSPSSSTKYTKGGLMYKLPQSNLQYVTSITFLLTTYAKYMKSTKQTFNCGNS 365

Query: 370 RISPEELIAEAKKQVDYILGENPEKMSYMVGFGERYPQHIHHRGSSVPSLHSRPNQVSCN 429
            I P  LI  +K+QVDY+LG NP KMSYMVGF   +P+ IHHRGSS+PS   R N + CN
Sbjct: 366 LIVPNALINLSKRQVDYVLGVNPMKMSYMVGFSSNFPKRIHHRGSSLPSRAVRSNSLGCN 425

Query: 430 EGFQFLYSSSPNPNVLAGAIVGGPDNGDKFSDDRNNYQQSEPATYINAPFI 480
            GFQ   + +PNPN+L GAIVGGP+  D++ D R++Y +SEPATYINA F+
Sbjct: 426 GGFQSFRTQNPNPNILTGAIVGGPNQNDEYPDQRDDYTRSEPATYINAAFV 472

BLAST of CmaCh16G012330 vs. TAIR 10
Match: AT1G71380.1 (cellulase 3 )

HSP 1 Score: 560.5 bits (1443), Expect = 2.6e-159
Identity = 273/476 (57.35%), Postives = 342/476 (71.85%), Query Frame = 0

Query: 5   ITLALYFILPFFT-LSSSAFTSQHYSTALQYSILFFEGQRSGKLPSNQRLTWRGNSALSD 64
           +T   +F+L F + L S+   + +Y  AL  S+LFF+GQRSG LP  Q+++WR +S LSD
Sbjct: 1   MTSLFFFVLLFSSLLISNGDANPNYKEALSKSLLFFQGQRSGPLPRGQQISWRASSGLSD 60

Query: 65  GSSYHVDLVGGYYDAGDNVKFGLPMAFTTTLLAWSVIEFGDSMGNEIENAREAVRWGSDY 124
           GS+ HVDL GGYYDAGDNVKF LPMAFTTT+L+WS +E+G  MG E+ENAR  +RW +DY
Sbjct: 61  GSAAHVDLTGGYYDAGDNVKFNLPMAFTTTMLSWSALEYGKRMGPELENARVNIRWATDY 120

Query: 125 LLKAATDAPNGLYVQVGDPNLDHKCWERPEDMDTPRTVYKITAQNPGSDVASETAAALAA 184
           LLK A   P  LYV VGDPN+DHKCWERPEDMDTPRTVY ++A NPGSDVA+ETAAALAA
Sbjct: 121 LLKCARATPGKLYVGVGDPNVDHKCWERPEDMDTPRTVYSVSASNPGSDVAAETAAALAA 180

Query: 185 ASIVFNTSDPSYSNKLLDAALKVFDFADKHRGAYSDSLHSVVCPFYCSYSGYNDELLWGA 244
           AS+VF   D  YS  LL  A  V  FA +++GAYSDSL S VCPFYCSYSGY DEL+WGA
Sbjct: 181 ASMVFRKVDSKYSRLLLATAKDVMQFAIQYQGAYSDSLSSSVCPFYCSYSGYKDELMWGA 240

Query: 245 SWIYKASKNSIHLNYIQSNGHILGADDDDYTFSWDDKRPGTKILLSQDFLVQNSEEFQIY 304
           SW+ +A+ N  + N+I+S    LG  D    FSWD+K  G  +LLS+  L+     F+ Y
Sbjct: 241 SWLLRATNNPYYANFIKS----LGGGDQPDIFSWDNKYAGAYVLLSRRALLNKDSNFEQY 300

Query: 305 KAHSDNYICSLIPGTSSSSGQYTPGGLFFKGSESNLQYVTSAAFLLVTYAKYLSSNGGAI 364
           K  ++N+IC ++P + SSS QYT GGL +K  +SNLQYVTS  FLL TYAKY+ +     
Sbjct: 301 KQAAENFICKILPDSPSSSTQYTQGGLMYKLPQSNLQYVTSITFLLTTYAKYMKATKHTF 360

Query: 365 RCGTSRISPEELIAEAKKQVDYILGENPEKMSYMVGFGERYPQHIHHRGSSVPSLHSRPN 424
            CG+S I P  LI+ +K+QVDYILG+NP KMSYMVGF   +P+ IHHR SS+PS   R  
Sbjct: 361 NCGSSVIVPNALISLSKRQVDYILGDNPIKMSYMVGFSSNFPKRIHHRASSLPSHALRSQ 420

Query: 425 QVSCNEGFQFLYSSSPNPNVLAGAIVGGPDNGDKFSDDRNNYQQSEPATYINAPFI 480
            + CN GFQ  Y+ +PNPN+L GAIVGGP+  D + D R++Y  +EPATYINA F+
Sbjct: 421 SLGCNGGFQSFYTQNPNPNILTGAIVGGPNQNDGYPDQRDDYSHAEPATYINAAFV 472

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
P055221.2e-20972.14Endoglucanase 1 OS=Persea americana OX=3435 GN=CEL1 PE=2 SV=1[more]
Q6YXT78.7e-18462.99Endoglucanase 19 OS=Oryza sativa subsp. japonica OX=39947 GN=Os08g0114200 PE=2 S... [more]
Q6Z7152.1e-17762.58Endoglucanase 4 OS=Oryza sativa subsp. japonica OX=39947 GN=GLU14 PE=2 SV=1[more]
Q9SRX34.1e-17360.57Endoglucanase 1 OS=Arabidopsis thaliana OX=3702 GN=CEL2 PE=2 SV=1[more]
O814161.7e-17160.45Endoglucanase 17 OS=Arabidopsis thaliana OX=3702 GN=At4g02290 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
AT5G42090.17.8e-20480.88Lung seven transmembrane receptor family protein [more]
AT1G02800.12.9e-17460.57cellulase 2 [more]
AT4G02290.11.2e-17260.45glycosyl hydrolase 9B13 [more]
AT1G22880.15.3e-16057.32cellulase 5 [more]
AT1G71380.12.6e-15957.35cellulase 3 [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita maxima (Rimu) v1.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001701Glycoside hydrolase family 9PFAMPF00759Glyco_hydro_9coord: 28..479
e-value: 1.2E-143
score: 479.7
IPR012341Six-hairpin glycosidase-like superfamilyGENE3D1.50.10.10coord: 23..492
e-value: 4.7E-166
score: 555.3
IPR009637Transmembrane protein GPR107/GPR108-likePFAMPF06814Lung_7-TM_Rcoord: 624..908
e-value: 7.4E-49
score: 166.6
NoneNo IPR availablePANTHERPTHR22298ENDO-1,4-BETA-GLUCANASEcoord: 15..479
NoneNo IPR availablePANTHERPTHR22298:SF29ENDOGLUCANASE 4coord: 15..479
IPR033126Glycosyl hydrolases family 9, Asp/Glu active sitesPROSITEPS00698GH9_3coord: 458..476
IPR018221Glycoside hydrolase family 9, His active sitePROSITEPS00592GH9_2coord: 385..411
IPR008928Six-hairpin glycosidase superfamilySUPERFAMILY48208Six-hairpin glycosidasescoord: 25..480

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh16G012330.1CmaCh16G012330.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0030245 cellulose catabolic process
biological_process GO:0005975 carbohydrate metabolic process
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0008810 cellulase activity
molecular_function GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compounds