CSPI03G00790 (gene) Wild cucumber (PI 183967)

NameCSPI03G00790
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionEndoglucanase
LocationChr3 : 530287 .. 533299 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTCCAAGTGAGTGTGTTATATTACAACTTATAACGATATTACAATGTCTTTCTCAATCACTTTGGCTCTCTACTTCATTTTGTCTCTCTTTACCTTATCTTCCTCTGCTTTCACTTCTCAACATTATTCTACTGCTCTTCAGTATTCTATTCTTTTCTTTGAGGGACAGCGATCCGGAAAGCTGCCCTCTAACCAACGTCTCACATGGAGAGCTGATTCAGCCTTATCAGATGGCTCCTCCTATCATGTGTGCTTACTCCCATTTTTCTTATTGTGATGTTTGTAACTTCTACAAGTCTAATCTAGTTGATAAATAACTTAGGCTCTAACTTAATGGGATGTGAATATGAACTTTTCAGGTTGACCTTGTTGGTGGCTACTATGATGCTGGGGATAATGTCAAGTTTGGCTTGCCAATGGCCTTCACTACTACATTGCTGGCTTGGAGTGTCATTGAGTTTGGCGACTCGATGGGGAATGAGATTGAGAATGCAAGAGCAGCAGTTCGTTGGGGGTCGGATTATCTATTGAAAGCTGCTACTGCTGCACCTGATGTCTTATATGTTCAAGTGAGTAGAGTGACCGTAAACAAGTTTGTGAGAGAAAGAGAGGAAAGGAAAAAGAATGTGATGGTTAAGATAATAATAATCACTACGACGAGAATGATTTTTAACGTAAGTAGTAAACTTGTGTGAAGGTGGGAGATCCAAACCTAGATCATAAATGTTGGGAAAGGCCAGAAGACATGGACACGCCACGTACTGTGTATAAGATAACTGCTCAAAACCCAGGCTCTGATGTAGCAGCAGAGACCGCAGCTGCGTTGGCTGCAGCTTCAATCGTGTTCAAAGCATCCGACCCTTCTTATTCTAACAAATTACTGGACGCAGCCTTAAAAGTAAATCATACTCTCATTCCTCATACACATTGCATGTCATTATGCTCTAAACAAAGTAACAAGTTCCCATTTAATGCAATGCAGGTATTCGATTTAGCAGACAAGCATAGAGGTTCTTACAGTGATTCACTCCATTCAGTGGTCTGTCCATTTTACTGTTCTTACTCGGGATACAATGTAAGTAAGAAGATAAGATGACCATACATCTACTTGTTTTCAATCTTTTTTTTTCCTTCAAGGTGTTCTTTTTTCTGAGTTCGTATTGACTTGTATCAGGATGAGCTTCTATGGGCTGCCTCGTGGGTTTACAAAGCCTCAAAAAACAGCATTCATTTAAGCTATATACAGTCCAATGGCCATATACTAGGAGCCGAAGAAGACGACTACACTTTTAGCTGGGACGACAAACGCCCTGGAACCAAGATCCTTCTCTCCCAGGTCTGTCTTTGCTCCCTCTTGATAAAAATATATATATATATGTCCAGAATCAAATTCCTAAACTAAAGCTTGATGATTTTATCATGATGTGGACCTATGCCACAGGATTTCTTAGTGCAAAGTTCGGAGGAGTTCCAAATCTATAAAGCACACTCAGATAATTACATATGCTCCCTCATTCCAGGAACTTCCACTTCTAGTGGTCAATATACTCCTGGTTAGCAATAACATAAACTGTTTCCTTGACATTTGTTTACATACAGTCTTAATCCTATTTTCGTCCCTAAATATCGAACATTTTTCCATATTGGTCCCTGAATTAAAAAAATGTTCATTTTAGTCCTTAAAACTTTACAAAGACCGAAAAAGACATTATTGCCCATGACTTTCAGAAGGTCTATTTCATTACATCCACCTATGAAAAAGACAACTTTTGTCCTTTACTATTAATTTATGATTGCTACCTAACAGTTATCCACCCAATACTTTCTCAAGAGTGCCAAAGTTGATAAATTAATAAATATATGGTTGATTGGTTTTTAAATACTCTTTTGGTTCCTACACTTTCCGTCTTAATTTATTTTGGTTCTTATACTTTCAAAATGATCGTTTCAATCTTTATGGTTTCGTTTTTTGGTTCTTTAAAATATGAAATAAAAATGAAAATAAAAGGACCATGTTCGATTATAAAAACATTTTAAAAGTATAGAGATCAAAATTTAAACCAAATTAACCTTAGCCAAAAGTACACGATTTATGTGATTCAACCCTATACACAAAGAATCAAAAGAAGAGATAGTACACAGACAAGCTAATTTGAACGAAAATACGCCATGCTCAAAGAAATTGAAAATTGCAGGAGGACTATTTTTCAAAGGAAGCGAGAGCAACCTGCAATATGTAACTTCAGCAGCGTTTCTTCTTCTGACATACGCAAAATACCTAAGCTCCAGTGGGGGATCCATTCGATGTGGGACTTCAAGGATTTCACCAGAAGACCTAATAGCACAAGCAAAGAAACAAGTTGATTACATATTGGGAGAAAATCCAGAGAAAATGTCATACATGGTGGGATTTGGAGAACGATACCCTCAGCATATTCATCACAGAGGTTCCTCTGTACCGTCCCTTCATTCACACCCTAATCGAGTTTCTTGCAATGATGGTTTCCAGTTCCTGTACTCTTCTTCGCCAAACCCAAATCTGCTCCTTGGTGCCATTGTTGGTGGACCTGATAATGGCGATAAATTTTCCGACGATCGGAATAACTATCAGCAGTCGGAGCCAGCTACTTATATAAACGCTCCACTTGTTGGTGCCTTAGCCTTTTTTGCAAAAACAACTTAGTAGATAGTTACATTTTAGTTAAGAAGGAAGGAGAGATGATTGGAATTTCGTCCTATAAGATTGTGTTTAGGGTTTATGGGTTTAGCTCGGGTCAAAAGTTCAAATCTCCACGAGCTAAAACAAAAGTATATTAAAGTAAGTCGGAAACTTCGAAGGTAGATATCTTGCTATGATTTGACTTATACATATGAGTCTTAAGTAGGTTCGAGGGTCTTTTCTTTTTTTTTTTTTCATTTAATCCTTTTCATGTATGGTGCTTAATTTGACGACAAGTCGATAAGTTGTCAATTTGGATCTCATCAATTAGCACTGTGCCTAACATCTCACCGA

mRNA sequence

ATGTCTTTCTCAATCACTTTGGCTCTCTACTTCATTTTGTCTCTCTTTACCTTATCTTCCTCTGCTTTCACTTCTCAACATTATTCTACTGCTCTTCAGTATTCTATTCTTTTCTTTGAGGGACAGCGATCCGGAAAGCTGCCCTCTAACCAACGTCTCACATGGAGAGCTGATTCAGCCTTATCAGATGGCTCCTCCTATCATGTTGACCTTGTTGGTGGCTACTATGATGCTGGGGATAATGTCAAGTTTGGCTTGCCAATGGCCTTCACTACTACATTGCTGGCTTGGAGTGTCATTGAGTTTGGCGACTCGATGGGGAATGAGATTGAGAATGCAAGAGCAGCAGTTCGTTGGGGGTCGGATTATCTATTGAAAGCTGCTACTGCTGCACCTGATGTCTTATATGTTCAAGTGGGAGATCCAAACCTAGATCATAAATGTTGGGAAAGGCCAGAAGACATGGACACGCCACGTACTGTGTATAAGATAACTGCTCAAAACCCAGGCTCTGATGTAGCAGCAGAGACCGCAGCTGCGTTGGCTGCAGCTTCAATCGTGTTCAAAGCATCCGACCCTTCTTATTCTAACAAATTACTGGACGCAGCCTTAAAAGTATTCGATTTAGCAGACAAGCATAGAGGTTCTTACAGTGATTCACTCCATTCAGTGGTCTGTCCATTTTACTGTTCTTACTCGGGATACAATGATGAGCTTCTATGGGCTGCCTCGTGGGTTTACAAAGCCTCAAAAAACAGCATTCATTTAAGCTATATACAGTCCAATGGCCATATACTAGGAGCCGAAGAAGACGACTACACTTTTAGCTGGGACGACAAACGCCCTGGAACCAAGATCCTTCTCTCCCAGGATTTCTTAGTGCAAAGTTCGGAGGAGTTCCAAATCTATAAAGCACACTCAGATAATTACATATGCTCCCTCATTCCAGGAACTTCCACTTCTAGTGGTCAATATACTCCTGGAGGACTATTTTTCAAAGGAAGCGAGAGCAACCTGCAATATGTAACTTCAGCAGCGTTTCTTCTTCTGACATACGCAAAATACCTAAGCTCCAGTGGGGGATCCATTCGATGTGGGACTTCAAGGATTTCACCAGAAGACCTAATAGCACAAGCAAAGAAACAAGTTGATTACATATTGGGAGAAAATCCAGAGAAAATGTCATACATGGTGGGATTTGGAGAACGATACCCTCAGCATATTCATCACAGAGGTTCCTCTGTACCGTCCCTTCATTCACACCCTAATCGAGTTTCTTGCAATGATGGTTTCCAGTTCCTGTACTCTTCTTCGCCAAACCCAAATCTGCTCCTTGGTGCCATTGTTGGTGGACCTGATAATGGCGATAAATTTTCCGACGATCGGAATAACTATCAGCAGTCGGAGCCAGCTACTTATATAAACGCTCCACTTGTTGGTGCCTTAGCCTTTTTTGCAAAAACAACTTAG

Coding sequence (CDS)

ATGTCTTTCTCAATCACTTTGGCTCTCTACTTCATTTTGTCTCTCTTTACCTTATCTTCCTCTGCTTTCACTTCTCAACATTATTCTACTGCTCTTCAGTATTCTATTCTTTTCTTTGAGGGACAGCGATCCGGAAAGCTGCCCTCTAACCAACGTCTCACATGGAGAGCTGATTCAGCCTTATCAGATGGCTCCTCCTATCATGTTGACCTTGTTGGTGGCTACTATGATGCTGGGGATAATGTCAAGTTTGGCTTGCCAATGGCCTTCACTACTACATTGCTGGCTTGGAGTGTCATTGAGTTTGGCGACTCGATGGGGAATGAGATTGAGAATGCAAGAGCAGCAGTTCGTTGGGGGTCGGATTATCTATTGAAAGCTGCTACTGCTGCACCTGATGTCTTATATGTTCAAGTGGGAGATCCAAACCTAGATCATAAATGTTGGGAAAGGCCAGAAGACATGGACACGCCACGTACTGTGTATAAGATAACTGCTCAAAACCCAGGCTCTGATGTAGCAGCAGAGACCGCAGCTGCGTTGGCTGCAGCTTCAATCGTGTTCAAAGCATCCGACCCTTCTTATTCTAACAAATTACTGGACGCAGCCTTAAAAGTATTCGATTTAGCAGACAAGCATAGAGGTTCTTACAGTGATTCACTCCATTCAGTGGTCTGTCCATTTTACTGTTCTTACTCGGGATACAATGATGAGCTTCTATGGGCTGCCTCGTGGGTTTACAAAGCCTCAAAAAACAGCATTCATTTAAGCTATATACAGTCCAATGGCCATATACTAGGAGCCGAAGAAGACGACTACACTTTTAGCTGGGACGACAAACGCCCTGGAACCAAGATCCTTCTCTCCCAGGATTTCTTAGTGCAAAGTTCGGAGGAGTTCCAAATCTATAAAGCACACTCAGATAATTACATATGCTCCCTCATTCCAGGAACTTCCACTTCTAGTGGTCAATATACTCCTGGAGGACTATTTTTCAAAGGAAGCGAGAGCAACCTGCAATATGTAACTTCAGCAGCGTTTCTTCTTCTGACATACGCAAAATACCTAAGCTCCAGTGGGGGATCCATTCGATGTGGGACTTCAAGGATTTCACCAGAAGACCTAATAGCACAAGCAAAGAAACAAGTTGATTACATATTGGGAGAAAATCCAGAGAAAATGTCATACATGGTGGGATTTGGAGAACGATACCCTCAGCATATTCATCACAGAGGTTCCTCTGTACCGTCCCTTCATTCACACCCTAATCGAGTTTCTTGCAATGATGGTTTCCAGTTCCTGTACTCTTCTTCGCCAAACCCAAATCTGCTCCTTGGTGCCATTGTTGGTGGACCTGATAATGGCGATAAATTTTCCGACGATCGGAATAACTATCAGCAGTCGGAGCCAGCTACTTATATAAACGCTCCACTTGTTGGTGCCTTAGCCTTTTTTGCAAAAACAACTTAG
BLAST of CSPI03G00790 vs. Swiss-Prot
Match: GUN1_PERAE (Endoglucanase 1 OS=Persea americana GN=CEL1 PE=2 SV=1)

HSP 1 Score: 745.0 bits (1922), Expect = 5.4e-214
Identity = 357/488 (73.16%), Postives = 416/488 (85.25%), Query Frame = 1

Query: 1   MSFSITLALYFILSLFTLSSSAFTSQ--HYSTALQYSILFFEGQRSGKLPSNQRLTWRAD 60
           M  S  L+L+ +L + T+     ++   HYS AL+ SILFFEGQRSGKLP+NQRLTWR D
Sbjct: 1   MDCSSPLSLFHLLLVCTVMVKCCSASDLHYSDALEKSILFFEGQRSGKLPTNQRLTWRGD 60

Query: 61  SALSDGSSYHVDLVGGYYDAGDNVKFGLPMAFTTTLLAWSVIEFGDSMGNEIENARAAVR 120
           S LSDGSSYHVDLVGGYYDAGDN+KFGLPMAFTTT+LAW +IEFG  M  ++ENARAA+R
Sbjct: 61  SGLSDGSSYHVDLVGGYYDAGDNLKFGLPMAFTTTMLAWGIIEFGCLMPEQVENARAALR 120

Query: 121 WGSDYLLKAATAAPDVLYVQVGDPNLDHKCWERPEDMDTPRTVYKITAQNPGSDVAAETA 180
           W +DYLLKA+TA  + LYVQVG+PN DH+CWERPEDMDTPR VYK++ QNPGSDVAAETA
Sbjct: 121 WSTDYLLKASTATSNSLYVQVGEPNADHRCWERPEDMDTPRNVYKVSTQNPGSDVAAETA 180

Query: 181 AALAAASIVFKASDPSYSNKLLDAALKVFDLADKHRGSYSDSLHSVVCPFYCSYSGYNDE 240
           AALAAASIVF  SD SYS KLL  A+KVF+ AD++RGSYSDSL SVVCPFYCSYSGYNDE
Sbjct: 181 AALAAASIVFGDSDSSYSTKLLHTAVKVFEFADQYRGSYSDSLGSVVCPFYCSYSGYNDE 240

Query: 241 LLWAASWVYKASKNSIHLSYIQSNGHILGAEEDDYTFSWDDKRPGTKILLSQDFLVQSSE 300
           LLW ASW+++AS+N+ +++YIQSNGH LGA++DDY+FSWDDKR GTK+LLS+ FL    E
Sbjct: 241 LLWGASWLHRASQNASYMTYIQSNGHTLGADDDDYSFSWDDKRVGTKVLLSKGFLQDRIE 300

Query: 301 EFQIYKAHSDNYICSLIPGTSTSSGQYTPGGLFFKGSESNLQYVTSAAFLLLTYAKYLSS 360
           E Q+YK H+DNYICSLIPGTS+   QYTPGGL +KGS SNLQYVTS AFLLLTYA YL+S
Sbjct: 301 ELQLYKVHTDNYICSLIPGTSSFQAQYTPGGLLYKGSASNLQYVTSTAFLLLTYANYLNS 360

Query: 361 SGGSIRCGTSRISPEDLIAQAKKQVDYILGENPEKMSYMVGFGERYPQHIHHRGSSVPSL 420
           SGG   CGT+ ++ ++LI+ AKKQVDYILG+NP KMSYMVGFGERYPQH+HHRGSS+PS+
Sbjct: 361 SGGHASCGTTTVTAKNLISLAKKQVDYILGQNPAKMSYMVGFGERYPQHVHHRGSSLPSV 420

Query: 421 HSHPNRVSCNDGFQFLYSSSPNPNLLLGAIVGGPDNGDKFSDDRNNYQQSEPATYINAPL 480
             HPN + CN GFQ+LYSS PNPN+L+GAI+GGPDN D FSDDRNNYQQSEPATYINAPL
Sbjct: 421 QVHPNSIPCNAGFQYLYSSPPNPNILVGAILGGPDNRDSFSDDRNNYQQSEPATYINAPL 480

Query: 481 VGALAFFA 487
           VGALAFFA
Sbjct: 481 VGALAFFA 488

BLAST of CSPI03G00790 vs. Swiss-Prot
Match: GUN19_ORYSJ (Endoglucanase 19 OS=Oryza sativa subsp. japonica GN=Os08g0114200 PE=2 SV=1)

HSP 1 Score: 670.2 bits (1728), Expect = 1.7e-191
Identity = 318/491 (64.77%), Postives = 395/491 (80.45%), Query Frame = 1

Query: 5   ITLALYFILSLFTLSSSAFTSQHYSTALQYSILFFEGQRSGKLPSNQRLTWRADSALSDG 64
           I L L  +L L  L  S+  + +Y+ AL  SI+FFEGQRSGKLP   R+ WRADS L+DG
Sbjct: 32  IRLRLLVVLHLLLLVPSSAMAFNYADALAKSIIFFEGQRSGKLPPGNRMPWRADSGLTDG 91

Query: 65  SSYHVDLVGGYYDAGDNVKFGLPMAFTTTLLAWSVIEFGDSMGNEIENARAAVRWGSDYL 124
           + Y+VDLVGGYYDAGDNVKFGLPMAF+TT+LAWSV++FG  MG E+ NARAAVRWG+DYL
Sbjct: 92  AQYNVDLVGGYYDAGDNVKFGLPMAFSTTMLAWSVLDFGKFMGAELPNARAAVRWGADYL 151

Query: 125 LKAATAAPDVLYVQVGDPNLDHKCWERPEDMDTPRTVYKITAQNPGSDVAAETAAALAAA 184
           LKAATA P  LYVQV DPN DH+CWERPEDMDTPR+VY++TA  PGSDVA ETAAALAA+
Sbjct: 152 LKAATATPGALYVQVADPNQDHRCWERPEDMDTPRSVYRVTADKPGSDVAGETAAALAAS 211

Query: 185 SIVFKASDPSYSNKLLDAALKVFDLADKHRGSYSDSLHSVVCPFYCSYSGYNDELLWAAS 244
           S+VF+ +DP+YS +LL AA +VFD AD+HRGSYSDSL S VCPFYCSYSGY+DELLW AS
Sbjct: 212 SMVFRRADPAYSARLLHAATQVFDFADRHRGSYSDSLASSVCPFYCSYSGYHDELLWGAS 271

Query: 245 WVYKASKNSIHLSYIQSNGHILGAEEDDYTFSWDDKRPGTKILLSQDFLVQSSEEFQIYK 304
           W+++AS+N+  +SY+++NG  LGA +DDY+FSWDDKR GTK+LL++ FL       ++YK
Sbjct: 272 WLHRASRNASFMSYVEANGMQLGAGDDDYSFSWDDKRVGTKVLLAKGFLRNRLHGLELYK 331

Query: 305 AHSDNYICSLIPGTSTSSGQYTPGGLFFKGSESNLQYVTSAAFLLLTYAKYLSSSGGSIR 364
           AHSD+YICSL+PGT++   +YTPGGL ++   SN+QYVT+A FL+L YAKYL SSG +  
Sbjct: 332 AHSDSYICSLVPGTASFQSRYTPGGLLYREGSSNMQYVTTATFLMLAYAKYLRSSGATAS 391

Query: 365 CG------TSRISPEDLIAQAKKQVDYILGENPEKMSYMVGFGERYPQHIHHRGSSVPSL 424
           CG         +S  +L+A AK+QVDYILG+NP  MSYMVGFG RYP+  HHRG+S+PS+
Sbjct: 392 CGDGGGGARGEVSAAELVAVAKRQVDYILGKNPAGMSYMVGFGCRYPRRAHHRGASMPSV 451

Query: 425 HSHPNRVSCNDGFQFLYSSSPNPNLLLGAIVGGPDNGDKFSDDRNNYQQSEPATYINAPL 484
            +HP R+SC+ GF +L+S  PNPN+L+GA+VGGPD+ D F+DDR N+ QSEPATYINAPL
Sbjct: 452 RAHPGRISCDAGFGYLHSGEPNPNVLVGAVVGGPDSRDAFADDRGNFAQSEPATYINAPL 511

Query: 485 VGALAFFAKTT 490
           VGALA+FA TT
Sbjct: 512 VGALAYFAGTT 522

BLAST of CSPI03G00790 vs. Swiss-Prot
Match: GUN4_ORYSJ (Endoglucanase 4 OS=Oryza sativa subsp. japonica GN=GLU14 PE=2 SV=1)

HSP 1 Score: 658.7 bits (1698), Expect = 5.1e-188
Identity = 326/496 (65.73%), Postives = 387/496 (78.02%), Query Frame = 1

Query: 9   LYFILSLFTLSSSAFTSQHYSTALQYSILFFEGQRSGKLPSNQRLTWRADSALSDGSSYH 68
           L  +++   L+     + +Y+ AL  +ILFFE QRSGKLP  QR+ WRADS LSDGS+  
Sbjct: 6   LLLVVAAVCLAGREAAAFNYADALDKAILFFEAQRSGKLPPGQRVAWRADSGLSDGSADG 65

Query: 69  VDLVGGYYDAGDNVKFGLPMAFTTTLLAWSVIEFGDSMGN----------------EIEN 128
           VDL GGYYDAGDNVKFGLPMAFT T+L+WSVIEFGD M                  +++N
Sbjct: 66  VDLAGGYYDAGDNVKFGLPMAFTVTMLSWSVIEFGDMMPARRSSFLGGIFGGGGVAQLDN 125

Query: 129 ARAAVRWGSDYLLKAATAAPDVLYVQVGDPNLDHKCWERPEDMDTPRTVYKITAQNPGSD 188
           ARAAVRWG+DYLLKAATA PD LYVQV DP  DH+CWERPEDMDTPR+VYK+T Q+PGSD
Sbjct: 126 ARAAVRWGADYLLKAATATPDTLYVQVADPYQDHRCWERPEDMDTPRSVYKVTPQSPGSD 185

Query: 189 VAAETAAALAAASIVFKASDPSYSNKLLDAALKVFDLADKHRGSYSDSLHSVVCPFYCSY 248
           VA ETAAALAAASIVF+ SDPSYS KLLDAA  VFD ADK+RGSYSDSL SVVCPFYCS+
Sbjct: 186 VAGETAAALAAASIVFRVSDPSYSAKLLDAAQLVFDFADKYRGSYSDSLSSVVCPFYCSH 245

Query: 249 SGYNDELLWAASWVYKAS--KNSIHLSYIQSNGHILGAEEDDYTFSWDDKRPGTKILLSQ 308
           S Y+DELLWAASW++ AS  K  ++LSYI SNGH LGAE+DD+TFSWDDKR  TK     
Sbjct: 246 S-YHDELLWAASWLHLASPEKKDVYLSYIGSNGHALGAEQDDFTFSWDDKRVATK----- 305

Query: 309 DFLVQSSEEFQIYKAHSDNYICSLIPGTSTSSGQYTPGGLFFKGSESNLQYVTSAAFLLL 368
            FL   ++  Q+YKAH+DNYICSL+PG +    QYTPGGL FK  +SN+QYVTS AFLLL
Sbjct: 306 GFLQSRADGLQLYKAHTDNYICSLVPGANGFQSQYTPGGLLFKEGDSNMQYVTSTAFLLL 365

Query: 369 TYAKYLSSSGGSIRCGTSRISPEDLIAQAKKQVDYILGENPEKMSYMVGFGERYPQHIHH 428
           TYAKYLSSS  ++ CG++ +SP  LI+ AKKQVDYILG NP  MSYMVGFG RYP+H+HH
Sbjct: 366 TYAKYLSSSAATVSCGSTAVSPSTLISLAKKQVDYILGANPAGMSYMVGFGARYPRHVHH 425

Query: 429 RGSSVPSLHSHPNRVSCNDGFQFLYSSSPNPNLLLGAIVGGPDNGDKFSDDRNNYQQSEP 487
           RG+S+PS+  HP R+ C++GF++L+S  P+ NLL GA+VGGPD GD F+D R+NY Q+EP
Sbjct: 426 RGASMPSVRDHPARIGCDEGFRYLHSPEPDRNLLAGAVVGGPDAGDAFADGRDNYAQAEP 485

BLAST of CSPI03G00790 vs. Swiss-Prot
Match: GUN17_ARATH (Endoglucanase 17 OS=Arabidopsis thaliana GN=At4g02290 PE=2 SV=1)

HSP 1 Score: 623.6 bits (1607), Expect = 1.8e-177
Identity = 305/495 (61.62%), Postives = 381/495 (76.97%), Query Frame = 1

Query: 4   SITLALYFILS---LFTLSSSAFTSQH---------YSTALQYSILFFEGQRSGKLPSNQ 63
           +I L+ +F L     +  +SS F + H         Y  AL  SILFFEGQRSGKLPSNQ
Sbjct: 17  TIFLSFFFFLCNGFSYPTTSSLFNTHHHRHHLAKHNYKDALTKSILFFEGQRSGKLPSNQ 76

Query: 64  RLTWRADSALSDGSSYHVDLVGGYYDAGDNVKFGLPMAFTTTLLAWSVIEFGDSMGNEIE 123
           R++WR DS LSDGS+ HVDLVGGYYDAGDN+KFG PMAFTTT+L+WSVIEFG  M +E++
Sbjct: 77  RMSWRRDSGLSDGSALHVDLVGGYYDAGDNIKFGFPMAFTTTMLSWSVIEFGGLMKSELQ 136

Query: 124 NARAAVRWGSDYLLKAATAAPDVLYVQVGDPNLDHKCWERPEDMDTPRTVYKITAQNPGS 183
           NA+ A+RW +DYLLKA T+ PD +YVQVGD N DH CWERPEDMDT R+V+K+    PGS
Sbjct: 137 NAKIAIRWATDYLLKA-TSQPDTIYVQVGDANKDHSCWERPEDMDTVRSVFKVDKNIPGS 196

Query: 184 DVAAETAAALAAASIVFKASDPSYSNKLLDAALKVFDLADKHRGSYSDSLHSVVCPFYCS 243
           DVAAETAAALAAA+IVF+ SDPSYS  LL  A+ VF  ADK+RG+YS  L   VCPFYCS
Sbjct: 197 DVAAETAAALAAAAIVFRKSDPSYSKVLLKRAISVFAFADKYRGTYSAGLKPDVCPFYCS 256

Query: 244 YSGYNDELLWAASWVYKASKNSIHLSYIQSNGHILGAEEDDYTFSWDDKRPGTKILLSQD 303
           YSGY DELLW A+W+ KA+KN  +L+YI+ NG ILGA E D TF WD+K  G +ILL++ 
Sbjct: 257 YSGYQDELLWGAAWLQKATKNIKYLNYIKINGQILGAAEYDNTFGWDNKHAGARILLTKA 316

Query: 304 FLVQSSEEFQIYKAHSDNYICSLIPGTSTSSGQYTPGGLFFKGSESNLQYVTSAAFLLLT 363
           FLVQ+ +    YK H+DN+ICS+IPG   SS QYTPGGL FK +++N+QYVTS +FLLLT
Sbjct: 317 FLVQNVKTLHEYKGHADNFICSVIPGAPFSSTQYTPGGLLFKMADANMQYVTSTSFLLLT 376

Query: 364 YAKYLSSSGGSIRCGTSRISPEDLIAQAKKQVDYILGENPEKMSYMVGFGERYPQHIHHR 423
           YAKYL+S+   + CG S  +P  L + AK+QVDY+LG+NP +MSYMVG+G ++P+ IHHR
Sbjct: 377 YAKYLTSAKTVVHCGGSVYTPGRLRSIAKRQVDYLLGDNPLRMSYMVGYGPKFPRRIHHR 436

Query: 424 GSSVPSLHSHPNRVSCNDGFQFLYSSSPNPNLLLGAIVGGPDNGDKFSDDRNNYQQSEPA 483
           GSS+P + SHP ++ C+ GF  + S SPNPN L+GA+VGGPD  D+F D+R++Y+QSEPA
Sbjct: 437 GSSLPCVASHPAKIQCHQGFAIMNSQSPNPNFLVGAVVGGPDQHDRFPDERSDYEQSEPA 496

Query: 484 TYINAPLVGALAFFA 487
           TYIN+PLVGALA+FA
Sbjct: 497 TYINSPLVGALAYFA 510

BLAST of CSPI03G00790 vs. Swiss-Prot
Match: GUN1_ARATH (Endoglucanase 1 OS=Arabidopsis thaliana GN=CEL2 PE=2 SV=1)

HSP 1 Score: 622.5 bits (1604), Expect = 4.0e-177
Identity = 307/491 (62.53%), Postives = 380/491 (77.39%), Query Frame = 1

Query: 9   LYFILSL---FTLSSSAFTSQH--------YSTALQYSILFFEGQRSGKLPSNQRLTWRA 68
           L FIL L   F+ SSS  +  H        Y  AL  SILFFEGQRSGKLP NQR+TWR+
Sbjct: 14  LSFILLLSNGFSSSSSRPSIHHRHHLDNHNYKDALSKSILFFEGQRSGKLPPNQRMTWRS 73

Query: 69  DSALSDGSSYHVDLVGGYYDAGDNVKFGLPMAFTTTLLAWSVIEFGDSMGNEIENARAAV 128
           +S LSDGS+ +VDLVGGYYDAGDN+KFG PMAFTTT+L+WS+IEFG  M +E+ NA+ A+
Sbjct: 74  NSGLSDGSALNVDLVGGYYDAGDNMKFGFPMAFTTTMLSWSLIEFGGLMKSELPNAKDAI 133

Query: 129 RWGSDYLLKAATAAPDVLYVQVGDPNLDHKCWERPEDMDTPRTVYKITAQNPGSDVAAET 188
           RW +D+LLKA T+ PD +YVQVGDPN+DH CWERPEDMDTPR+V+K+   NPGSD+A E 
Sbjct: 134 RWATDFLLKA-TSHPDTIYVQVGDPNMDHACWERPEDMDTPRSVFKVDKNNPGSDIAGEI 193

Query: 189 AAALAAASIVFKASDPSYSNKLLDAALKVFDLADKHRGSYSDSLHSVVCPFYCSYSGYND 248
           AAALAAASIVF+  DPSYSN LL  A+ VF  ADK+RG YS  L   VCPFYCSYSGY D
Sbjct: 194 AAALAAASIVFRKCDPSYSNHLLQRAITVFTFADKYRGPYSAGLAPEVCPFYCSYSGYQD 253

Query: 249 ELLWAASWVYKASKNSIHLSYIQSNGHILGAEEDDYTFSWDDKRPGTKILLSQDFLVQSS 308
           ELLW A+W+ KA+ N  +L+YI++NG ILGA+E D  FSWD+K  G +ILLS++FL+Q  
Sbjct: 254 ELLWGAAWLQKATNNPTYLNYIKANGQILGADEFDNMFSWDNKHVGARILLSKEFLIQKV 313

Query: 309 EEFQIYKAHSDNYICSLIPGTSTSSGQYTPGGLFFKGSESNLQYVTSAAFLLLTYAKYLS 368
           +  + YK H+D++ICS++PG S+S  QYTPGGL FK  ESN+QYVTS +FLLLTYAKYL+
Sbjct: 314 KSLEEYKEHADSFICSVLPGASSS--QYTPGGLLFKMGESNMQYVTSTSFLLLTYAKYLT 373

Query: 369 SSGGSIRCGTSRISPEDLIAQAKKQVDYILGENPEKMSYMVGFGERYPQHIHHRGSSVPS 428
           S+     CG S ++P  L + AKKQVDY+LG NP KMSYMVG+G +YP+ IHHRGSS+PS
Sbjct: 374 SARTVAYCGGSVVTPARLRSIAKKQVDYLLGGNPLKMSYMVGYGLKYPRRIHHRGSSLPS 433

Query: 429 LHSHPNRVSCNDGFQFLYSSSPNPNLLLGAIVGGPDNGDKFSDDRNNYQQSEPATYINAP 488
           +  HP R+ C+DGF    S SPNPN L+GA+VGGPD  D+F D+R++Y +SEPATYINAP
Sbjct: 434 VAVHPTRIQCHDGFSLFTSQSPNPNDLVGAVVGGPDQNDQFPDERSDYGRSEPATYINAP 493

BLAST of CSPI03G00790 vs. TrEMBL
Match: E5RDC9_CUCME (Endoglucanase OS=Cucumis melo subsp. melo PE=3 SV=1)

HSP 1 Score: 972.2 bits (2512), Expect = 2.3e-280
Identity = 481/490 (98.16%), Postives = 487/490 (99.39%), Query Frame = 1

Query: 1   MSFSITLALYFILSLFTL-SSSAFTSQHYSTALQYSILFFEGQRSGKLPSNQRLTWRADS 60
           MSFSITLALYFILSLFTL SSSAFTS+HYSTALQYSILFFEGQRSGKLPSNQRLTWRADS
Sbjct: 1   MSFSITLALYFILSLFTLSSSSAFTSEHYSTALQYSILFFEGQRSGKLPSNQRLTWRADS 60

Query: 61  ALSDGSSYHVDLVGGYYDAGDNVKFGLPMAFTTTLLAWSVIEFGDSMGNEIENARAAVRW 120
            LSDGSSYHVDLVGGYYDAGDNVKFGLPMAFTTTLLAWSVIEFGDSMGNEIENARAAVRW
Sbjct: 61  GLSDGSSYHVDLVGGYYDAGDNVKFGLPMAFTTTLLAWSVIEFGDSMGNEIENARAAVRW 120

Query: 121 GSDYLLKAATAAPDVLYVQVGDPNLDHKCWERPEDMDTPRTVYKITAQNPGSDVAAETAA 180
           GSDYLLKAATAAPDVLYVQVGDPNLDHKCWERPEDMDTPRTVYKITAQNPGSDVAAETAA
Sbjct: 121 GSDYLLKAATAAPDVLYVQVGDPNLDHKCWERPEDMDTPRTVYKITAQNPGSDVAAETAA 180

Query: 181 ALAAASIVFKASDPSYSNKLLDAALKVFDLADKHRGSYSDSLHSVVCPFYCSYSGYNDEL 240
           ALAAASIVFKASDPSYSNKLLDAALKVFDLADKHRGSYSDSLHSVVCPFYCSYSGYNDEL
Sbjct: 181 ALAAASIVFKASDPSYSNKLLDAALKVFDLADKHRGSYSDSLHSVVCPFYCSYSGYNDEL 240

Query: 241 LWAASWVYKASKNSIHLSYIQSNGHILGAEEDDYTFSWDDKRPGTKILLSQDFLVQSSEE 300
           LWAASW+YKASKNSIHLSYIQ+NGHILGAEEDDYTFSWDDKRPGTKILLSQDFLVQSSEE
Sbjct: 241 LWAASWIYKASKNSIHLSYIQANGHILGAEEDDYTFSWDDKRPGTKILLSQDFLVQSSEE 300

Query: 301 FQIYKAHSDNYICSLIPGTSTSSGQYTPGGLFFKGSESNLQYVTSAAFLLLTYAKYLSSS 360
           FQIYKAHSDNYICSLIPGTSTSSGQYTPGGLFFKGSESNLQYVTSAAFLLLTYAKYLSS+
Sbjct: 301 FQIYKAHSDNYICSLIPGTSTSSGQYTPGGLFFKGSESNLQYVTSAAFLLLTYAKYLSSN 360

Query: 361 GGSIRCGTSRISPEDLIAQAKKQVDYILGENPEKMSYMVGFGERYPQHIHHRGSSVPSLH 420
           GGSIRCGTSRISP+DLIAQAKKQVDYILGENPEKMSYMVGFGERYPQHIHHRGSSVPSLH
Sbjct: 361 GGSIRCGTSRISPQDLIAQAKKQVDYILGENPEKMSYMVGFGERYPQHIHHRGSSVPSLH 420

Query: 421 SHPNRVSCNDGFQFLYSSSPNPNLLLGAIVGGPDNGDKFSDDRNNYQQSEPATYINAPLV 480
           +HPNRVSCNDGFQFLYSSSPNPNLLLGAIVGGPDNGDKFSDDRNNYQQSEPATYINAPLV
Sbjct: 421 AHPNRVSCNDGFQFLYSSSPNPNLLLGAIVGGPDNGDKFSDDRNNYQQSEPATYINAPLV 480

Query: 481 GALAFFAKTT 490
           GALAFF KTT
Sbjct: 481 GALAFFTKTT 490

BLAST of CSPI03G00790 vs. TrEMBL
Match: B9GK69_POPTR (Endoglucanase OS=Populus trichocarpa GN=GH9B14 PE=2 SV=1)

HSP 1 Score: 803.1 bits (2073), Expect = 1.8e-229
Identity = 380/486 (78.19%), Postives = 434/486 (89.30%), Query Frame = 1

Query: 2   SFSITLALYFI-LSLFTLSSSAFTSQHYSTALQYSILFFEGQRSGKLPSNQRLTWRADSA 61
           +FS+ L  +FI     +  S AFTSQ Y+ AL+ SILFFEGQRSGKLPSNQRLTWR DS 
Sbjct: 6   TFSLMLQFFFITFCCLSYFSFAFTSQDYANALEKSILFFEGQRSGKLPSNQRLTWRGDSG 65

Query: 62  LSDGSSYHVDLVGGYYDAGDNVKFGLPMAFTTTLLAWSVIEFGDSMGNEIENARAAVRWG 121
           LSDGS+YHV+LVGGYYDAGDNVKFGLPMAFTTTLLAWSVIEFG SM N+IENA+AA+RW 
Sbjct: 66  LSDGSTYHVNLVGGYYDAGDNVKFGLPMAFTTTLLAWSVIEFGSSMQNQIENAKAAIRWS 125

Query: 122 SDYLLKAATAAPDVLYVQVGDPNLDHKCWERPEDMDTPRTVYKITAQNPGSDVAAETAAA 181
           +DYLLKAATA PD LYVQVGDPN+DH+CWERPEDMDTPR VYK+T QNPGSDVAAETAAA
Sbjct: 126 TDYLLKAATATPDTLYVQVGDPNMDHRCWERPEDMDTPRNVYKVTIQNPGSDVAAETAAA 185

Query: 182 LAAASIVFKASDPSYSNKLLDAALKVFDLADKHRGSYSDSLHSVVCPFYCSYSGYNDELL 241
           LAAASIVFK SDPSYS KLL  A+KVFD AD++RGSYS+SL+SVVCPFYCSYSGY DELL
Sbjct: 186 LAAASIVFKESDPSYSTKLLHTAMKVFDFADRYRGSYSNSLNSVVCPFYCSYSGYQDELL 245

Query: 242 WAASWVYKASKNSIHLSYIQSNGHILGAEEDDYTFSWDDKRPGTKILLSQDFLVQSSEEF 301
           W ASW+++AS+N  +L+YIQSNGH +G+++DDY+FSWDDKRPGTKILLS++FL +++EEF
Sbjct: 246 WGASWIHRASQNGSYLTYIQSNGHTMGSDDDDYSFSWDDKRPGTKILLSKEFLEKTTEEF 305

Query: 302 QIYKAHSDNYICSLIPGTSTSSGQYTPGGLFFKGSESNLQYVTSAAFLLLTYAKYLSSSG 361
           Q+YK+HSDNYICSLIPGTS+   QYTPGGLF+K SESNLQYVTS  FLLLTYAKYL S+G
Sbjct: 306 QLYKSHSDNYICSLIPGTSSFQAQYTPGGLFYKASESNLQYVTSTTFLLLTYAKYLGSNG 365

Query: 362 GSIRCGTSRISPEDLIAQAKKQVDYILGENPEKMSYMVGFGERYPQHIHHRGSSVPSLHS 421
           G  RCG S ++ E LIAQAKKQVDYILG+NP +MSYMVGFG RYPQH+HHRGSSVPS+H+
Sbjct: 366 GVARCGGSTVTAESLIAQAKKQVDYILGDNPARMSYMVGFGNRYPQHVHHRGSSVPSIHA 425

Query: 422 HPNRVSCNDGFQFLYSSSPNPNLLLGAIVGGPDNGDKFSDDRNNYQQSEPATYINAPLVG 481
           HPNR+SCNDGFQFLYSSSPNPN+L+GAI+GGPDN D F+DDRNNYQQSEPATYINAP VG
Sbjct: 426 HPNRISCNDGFQFLYSSSPNPNVLVGAIIGGPDNRDNFADDRNNYQQSEPATYINAPFVG 485

Query: 482 ALAFFA 487
           ALAFF+
Sbjct: 486 ALAFFS 491

BLAST of CSPI03G00790 vs. TrEMBL
Match: Q9AVI5_POPAL (Endoglucanase OS=Populus alba GN=PopCel1 PE=3 SV=1)

HSP 1 Score: 795.0 bits (2052), Expect = 5.0e-227
Identity = 375/486 (77.16%), Postives = 431/486 (88.68%), Query Frame = 1

Query: 2   SFSITLALYFI-LSLFTLSSSAFTSQHYSTALQYSILFFEGQRSGKLPSNQRLTWRADSA 61
           +FS+ L  +F+     +  S AFTSQ Y+ AL+  ILFFEGQRSGKLPSNQRL WR DS 
Sbjct: 6   TFSLMLQFFFVTFCCLSYFSFAFTSQDYANALEKPILFFEGQRSGKLPSNQRLAWRGDSG 65

Query: 62  LSDGSSYHVDLVGGYYDAGDNVKFGLPMAFTTTLLAWSVIEFGDSMGNEIENARAAVRWG 121
           LSDGS+YHV+LVGGYYDAGDNVKFGLPMAFTTTLLAWSVIEFG SM N+IENA+AA+RW 
Sbjct: 66  LSDGSTYHVNLVGGYYDAGDNVKFGLPMAFTTTLLAWSVIEFGSSMQNQIENAKAAIRWS 125

Query: 122 SDYLLKAATAAPDVLYVQVGDPNLDHKCWERPEDMDTPRTVYKITAQNPGSDVAAETAAA 181
           +DYLLKAATA PD LYVQVG+PN+DH+CWERPEDMDTPR VYK+T  NPGSDVAAETAAA
Sbjct: 126 TDYLLKAATATPDTLYVQVGNPNMDHRCWERPEDMDTPRNVYKVTIHNPGSDVAAETAAA 185

Query: 182 LAAASIVFKASDPSYSNKLLDAALKVFDLADKHRGSYSDSLHSVVCPFYCSYSGYNDELL 241
           LAAASIVFK SDPSYS KLL  A+KVFD AD++RGSYS+SL+SVVCPFYCSYSGY DELL
Sbjct: 186 LAAASIVFKESDPSYSTKLLHTAMKVFDFADRYRGSYSNSLNSVVCPFYCSYSGYQDELL 245

Query: 242 WAASWVYKASKNSIHLSYIQSNGHILGAEEDDYTFSWDDKRPGTKILLSQDFLVQSSEEF 301
           W ASW+++AS+N  +L+YIQSNGH +G+++DDY+FSWDDKRPGTKILLS++FL +++EEF
Sbjct: 246 WGASWIHRASQNGSYLTYIQSNGHTMGSDDDDYSFSWDDKRPGTKILLSKEFLEKTTEEF 305

Query: 302 QIYKAHSDNYICSLIPGTSTSSGQYTPGGLFFKGSESNLQYVTSAAFLLLTYAKYLSSSG 361
           Q+YK+HSDNYICSLIPGTS+   QYTPGGLF+K SESNLQYVTS  FLLLTYAKYL S+G
Sbjct: 306 QLYKSHSDNYICSLIPGTSSFQAQYTPGGLFYKASESNLQYVTSTTFLLLTYAKYLGSNG 365

Query: 362 GSIRCGTSRISPEDLIAQAKKQVDYILGENPEKMSYMVGFGERYPQHIHHRGSSVPSLHS 421
           G  RCG S ++ E LIAQAKKQVDYILG+NP +MSYMVGFG RYPQH+HHRGSSVPS+H+
Sbjct: 366 GVARCGGSTVTTESLIAQAKKQVDYILGDNPARMSYMVGFGNRYPQHVHHRGSSVPSIHA 425

Query: 422 HPNRVSCNDGFQFLYSSSPNPNLLLGAIVGGPDNGDKFSDDRNNYQQSEPATYINAPLVG 481
           HPNR+SCNDGFQFLYSSSPNPN+L+GAI+GGPDN D F+DDRNNYQQSEPATYINAP VG
Sbjct: 426 HPNRISCNDGFQFLYSSSPNPNVLVGAIIGGPDNRDNFADDRNNYQQSEPATYINAPFVG 485

Query: 482 ALAFFA 487
           ALAFF+
Sbjct: 486 ALAFFS 491

BLAST of CSPI03G00790 vs. TrEMBL
Match: L0AUV0_POPTO (Endoglucanase OS=Populus tomentosa PE=3 SV=1)

HSP 1 Score: 794.7 bits (2051), Expect = 6.6e-227
Identity = 375/486 (77.16%), Postives = 432/486 (88.89%), Query Frame = 1

Query: 2   SFSITLALYFI-LSLFTLSSSAFTSQHYSTALQYSILFFEGQRSGKLPSNQRLTWRADSA 61
           +FS+ L  +FI     +  S AFTSQ Y+ AL+ SILFFEGQRSGKLPSNQRLTWR DS 
Sbjct: 6   TFSLMLQFFFITFCCLSYFSFAFTSQDYANALEKSILFFEGQRSGKLPSNQRLTWRGDSG 65

Query: 62  LSDGSSYHVDLVGGYYDAGDNVKFGLPMAFTTTLLAWSVIEFGDSMGNEIENARAAVRWG 121
           LSDGS+YHV+LVGGYYDAGDNVKFGLPMAFTTTLLAWSVIEFG SM N+I NA+AA+RW 
Sbjct: 66  LSDGSTYHVNLVGGYYDAGDNVKFGLPMAFTTTLLAWSVIEFGSSMQNQITNAKAAIRWS 125

Query: 122 SDYLLKAATAAPDVLYVQVGDPNLDHKCWERPEDMDTPRTVYKITAQNPGSDVAAETAAA 181
           +DYLLKAATA PD LY+QVGDPN+DH+CWERPEDMDTPR VYK+T QNPGSDVAAETAAA
Sbjct: 126 TDYLLKAATATPDTLYIQVGDPNMDHRCWERPEDMDTPRNVYKVTIQNPGSDVAAETAAA 185

Query: 182 LAAASIVFKASDPSYSNKLLDAALKVFDLADKHRGSYSDSLHSVVCPFYCSYSGYNDELL 241
           LAAASIVFK SDPSYS KLL  A+KVFD AD++RGSYS+SL+SVVCPFYCSYSGY DELL
Sbjct: 186 LAAASIVFKESDPSYSTKLLHTAMKVFDFADRYRGSYSNSLNSVVCPFYCSYSGYQDELL 245

Query: 242 WAASWVYKASKNSIHLSYIQSNGHILGAEEDDYTFSWDDKRPGTKILLSQDFLVQSSEEF 301
           W ASW++KAS N  +L+YIQSNGH +G+++DDY+FSWDDKRPGTKILLS++FL +++EEF
Sbjct: 246 WGASWLHKASLNGTYLAYIQSNGHTMGSDDDDYSFSWDDKRPGTKILLSKEFLEKTTEEF 305

Query: 302 QIYKAHSDNYICSLIPGTSTSSGQYTPGGLFFKGSESNLQYVTSAAFLLLTYAKYLSSSG 361
           QIYK+HSDNYICSL+PG+S+   QYTPGGLF+K +ESNLQYVTS  FLLLTYAKYL S+G
Sbjct: 306 QIYKSHSDNYICSLMPGSSSFQAQYTPGGLFYKATESNLQYVTSTTFLLLTYAKYLGSNG 365

Query: 362 GSIRCGTSRISPEDLIAQAKKQVDYILGENPEKMSYMVGFGERYPQHIHHRGSSVPSLHS 421
           G  +CG S ++ E LIAQAKKQVDYILG+NP KMSYMVGFG +YPQH+HHRGSSVPS+H+
Sbjct: 366 GVAKCGGSTVTAESLIAQAKKQVDYILGDNPAKMSYMVGFGNKYPQHVHHRGSSVPSIHA 425

Query: 422 HPNRVSCNDGFQFLYSSSPNPNLLLGAIVGGPDNGDKFSDDRNNYQQSEPATYINAPLVG 481
           HPNR+SCNDGFQ+LYSSSPNPN+L+GAI+GGPDN D F+DDRNNYQQSEPATYINAP VG
Sbjct: 426 HPNRISCNDGFQYLYSSSPNPNVLVGAIIGGPDNRDNFADDRNNYQQSEPATYINAPFVG 485

Query: 482 ALAFFA 487
           ALAFF+
Sbjct: 486 ALAFFS 491

BLAST of CSPI03G00790 vs. TrEMBL
Match: Q9XIY8_POPAL (Endoglucanase OS=Populus alba GN=POPCEL2 PE=2 SV=1)

HSP 1 Score: 792.0 bits (2044), Expect = 4.3e-226
Identity = 377/485 (77.73%), Postives = 429/485 (88.45%), Query Frame = 1

Query: 3   FSITLALYFILSLF-TLSSSAFTSQHYSTALQYSILFFEGQRSGKLPSNQRLTWRADSAL 62
           FS+ L + FI+    +  S+AFTSQ Y+ AL+ SILFFEGQRSGKLP NQRLTWR DS L
Sbjct: 7   FSLKLQILFIIFCCRSYFSTAFTSQDYADALEKSILFFEGQRSGKLPVNQRLTWRGDSGL 66

Query: 63  SDGSSYHVDLVGGYYDAGDNVKFGLPMAFTTTLLAWSVIEFGDSMGNEIENARAAVRWGS 122
           SDGS+YHV+LVGGYYDAGDNVKFGLPMAFTTTLLAWSVIEFG SM N+I NA AA+RW +
Sbjct: 67  SDGSAYHVNLVGGYYDAGDNVKFGLPMAFTTTLLAWSVIEFGSSMQNQITNAEAAIRWST 126

Query: 123 DYLLKAATAAPDVLYVQVGDPNLDHKCWERPEDMDTPRTVYKITAQNPGSDVAAETAAAL 182
           DYLLKAATA PD LYVQVGDPN+DH+CWERPEDMDTPR VYK+T QNPGSDVAAETAAAL
Sbjct: 127 DYLLKAATATPDTLYVQVGDPNMDHRCWERPEDMDTPRNVYKVTTQNPGSDVAAETAAAL 186

Query: 183 AAASIVFKASDPSYSNKLLDAALKVFDLADKHRGSYSDSLHSVVCPFYCSYSGYNDELLW 242
           AAASIVFK SDPSYS +LL AA KVFD AD+HRGSYSDSL S VCPFYCSYSGY DELLW
Sbjct: 187 AAASIVFKESDPSYSTELLHAATKVFDFADRHRGSYSDSLSSAVCPFYCSYSGYQDELLW 246

Query: 243 AASWVYKASKNSIHLSYIQSNGHILGAEEDDYTFSWDDKRPGTKILLSQDFLVQSSEEFQ 302
            ASW++KAS N  +L+YIQSNGH +G+++DDY+FSWDDKRPGTKILLS++FL +++EEFQ
Sbjct: 247 GASWLHKASLNGTYLAYIQSNGHTMGSDDDDYSFSWDDKRPGTKILLSKEFLDKTTEEFQ 306

Query: 303 IYKAHSDNYICSLIPGTSTSSGQYTPGGLFFKGSESNLQYVTSAAFLLLTYAKYLSSSGG 362
           IYK+HSDNYICSL+PG+S+   QYTPGGLF+K +ESNLQYVTS  FLLLTYAKYL S+GG
Sbjct: 307 IYKSHSDNYICSLMPGSSSFQAQYTPGGLFYKATESNLQYVTSTTFLLLTYAKYLGSNGG 366

Query: 363 SIRCGTSRISPEDLIAQAKKQVDYILGENPEKMSYMVGFGERYPQHIHHRGSSVPSLHSH 422
             +CG S ++ E LIAQAKKQVDYILG+NP KMSYMVGFG +YPQH+HHRGSSVPS+H+H
Sbjct: 367 VAKCGGSTVTAESLIAQAKKQVDYILGDNPAKMSYMVGFGNKYPQHVHHRGSSVPSIHAH 426

Query: 423 PNRVSCNDGFQFLYSSSPNPNLLLGAIVGGPDNGDKFSDDRNNYQQSEPATYINAPLVGA 482
           PNR+SCNDGFQ+LYSSSPNPN+L+GAIVGGPDN D F+DDRNNYQQSEPATYINAP VGA
Sbjct: 427 PNRISCNDGFQYLYSSSPNPNVLVGAIVGGPDNRDHFADDRNNYQQSEPATYINAPFVGA 486

Query: 483 LAFFA 487
           LAFF+
Sbjct: 487 LAFFS 491

BLAST of CSPI03G00790 vs. TAIR10
Match: AT4G02290.1 (AT4G02290.1 glycosyl hydrolase 9B13)

HSP 1 Score: 623.6 bits (1607), Expect = 1.0e-178
Identity = 305/495 (61.62%), Postives = 381/495 (76.97%), Query Frame = 1

Query: 4   SITLALYFILS---LFTLSSSAFTSQH---------YSTALQYSILFFEGQRSGKLPSNQ 63
           +I L+ +F L     +  +SS F + H         Y  AL  SILFFEGQRSGKLPSNQ
Sbjct: 17  TIFLSFFFFLCNGFSYPTTSSLFNTHHHRHHLAKHNYKDALTKSILFFEGQRSGKLPSNQ 76

Query: 64  RLTWRADSALSDGSSYHVDLVGGYYDAGDNVKFGLPMAFTTTLLAWSVIEFGDSMGNEIE 123
           R++WR DS LSDGS+ HVDLVGGYYDAGDN+KFG PMAFTTT+L+WSVIEFG  M +E++
Sbjct: 77  RMSWRRDSGLSDGSALHVDLVGGYYDAGDNIKFGFPMAFTTTMLSWSVIEFGGLMKSELQ 136

Query: 124 NARAAVRWGSDYLLKAATAAPDVLYVQVGDPNLDHKCWERPEDMDTPRTVYKITAQNPGS 183
           NA+ A+RW +DYLLKA T+ PD +YVQVGD N DH CWERPEDMDT R+V+K+    PGS
Sbjct: 137 NAKIAIRWATDYLLKA-TSQPDTIYVQVGDANKDHSCWERPEDMDTVRSVFKVDKNIPGS 196

Query: 184 DVAAETAAALAAASIVFKASDPSYSNKLLDAALKVFDLADKHRGSYSDSLHSVVCPFYCS 243
           DVAAETAAALAAA+IVF+ SDPSYS  LL  A+ VF  ADK+RG+YS  L   VCPFYCS
Sbjct: 197 DVAAETAAALAAAAIVFRKSDPSYSKVLLKRAISVFAFADKYRGTYSAGLKPDVCPFYCS 256

Query: 244 YSGYNDELLWAASWVYKASKNSIHLSYIQSNGHILGAEEDDYTFSWDDKRPGTKILLSQD 303
           YSGY DELLW A+W+ KA+KN  +L+YI+ NG ILGA E D TF WD+K  G +ILL++ 
Sbjct: 257 YSGYQDELLWGAAWLQKATKNIKYLNYIKINGQILGAAEYDNTFGWDNKHAGARILLTKA 316

Query: 304 FLVQSSEEFQIYKAHSDNYICSLIPGTSTSSGQYTPGGLFFKGSESNLQYVTSAAFLLLT 363
           FLVQ+ +    YK H+DN+ICS+IPG   SS QYTPGGL FK +++N+QYVTS +FLLLT
Sbjct: 317 FLVQNVKTLHEYKGHADNFICSVIPGAPFSSTQYTPGGLLFKMADANMQYVTSTSFLLLT 376

Query: 364 YAKYLSSSGGSIRCGTSRISPEDLIAQAKKQVDYILGENPEKMSYMVGFGERYPQHIHHR 423
           YAKYL+S+   + CG S  +P  L + AK+QVDY+LG+NP +MSYMVG+G ++P+ IHHR
Sbjct: 377 YAKYLTSAKTVVHCGGSVYTPGRLRSIAKRQVDYLLGDNPLRMSYMVGYGPKFPRRIHHR 436

Query: 424 GSSVPSLHSHPNRVSCNDGFQFLYSSSPNPNLLLGAIVGGPDNGDKFSDDRNNYQQSEPA 483
           GSS+P + SHP ++ C+ GF  + S SPNPN L+GA+VGGPD  D+F D+R++Y+QSEPA
Sbjct: 437 GSSLPCVASHPAKIQCHQGFAIMNSQSPNPNFLVGAVVGGPDQHDRFPDERSDYEQSEPA 496

Query: 484 TYINAPLVGALAFFA 487
           TYIN+PLVGALA+FA
Sbjct: 497 TYINSPLVGALAYFA 510

BLAST of CSPI03G00790 vs. TAIR10
Match: AT1G02800.1 (AT1G02800.1 cellulase 2)

HSP 1 Score: 622.5 bits (1604), Expect = 2.3e-178
Identity = 307/491 (62.53%), Postives = 380/491 (77.39%), Query Frame = 1

Query: 9   LYFILSL---FTLSSSAFTSQH--------YSTALQYSILFFEGQRSGKLPSNQRLTWRA 68
           L FIL L   F+ SSS  +  H        Y  AL  SILFFEGQRSGKLP NQR+TWR+
Sbjct: 14  LSFILLLSNGFSSSSSRPSIHHRHHLDNHNYKDALSKSILFFEGQRSGKLPPNQRMTWRS 73

Query: 69  DSALSDGSSYHVDLVGGYYDAGDNVKFGLPMAFTTTLLAWSVIEFGDSMGNEIENARAAV 128
           +S LSDGS+ +VDLVGGYYDAGDN+KFG PMAFTTT+L+WS+IEFG  M +E+ NA+ A+
Sbjct: 74  NSGLSDGSALNVDLVGGYYDAGDNMKFGFPMAFTTTMLSWSLIEFGGLMKSELPNAKDAI 133

Query: 129 RWGSDYLLKAATAAPDVLYVQVGDPNLDHKCWERPEDMDTPRTVYKITAQNPGSDVAAET 188
           RW +D+LLKA T+ PD +YVQVGDPN+DH CWERPEDMDTPR+V+K+   NPGSD+A E 
Sbjct: 134 RWATDFLLKA-TSHPDTIYVQVGDPNMDHACWERPEDMDTPRSVFKVDKNNPGSDIAGEI 193

Query: 189 AAALAAASIVFKASDPSYSNKLLDAALKVFDLADKHRGSYSDSLHSVVCPFYCSYSGYND 248
           AAALAAASIVF+  DPSYSN LL  A+ VF  ADK+RG YS  L   VCPFYCSYSGY D
Sbjct: 194 AAALAAASIVFRKCDPSYSNHLLQRAITVFTFADKYRGPYSAGLAPEVCPFYCSYSGYQD 253

Query: 249 ELLWAASWVYKASKNSIHLSYIQSNGHILGAEEDDYTFSWDDKRPGTKILLSQDFLVQSS 308
           ELLW A+W+ KA+ N  +L+YI++NG ILGA+E D  FSWD+K  G +ILLS++FL+Q  
Sbjct: 254 ELLWGAAWLQKATNNPTYLNYIKANGQILGADEFDNMFSWDNKHVGARILLSKEFLIQKV 313

Query: 309 EEFQIYKAHSDNYICSLIPGTSTSSGQYTPGGLFFKGSESNLQYVTSAAFLLLTYAKYLS 368
           +  + YK H+D++ICS++PG S+S  QYTPGGL FK  ESN+QYVTS +FLLLTYAKYL+
Sbjct: 314 KSLEEYKEHADSFICSVLPGASSS--QYTPGGLLFKMGESNMQYVTSTSFLLLTYAKYLT 373

Query: 369 SSGGSIRCGTSRISPEDLIAQAKKQVDYILGENPEKMSYMVGFGERYPQHIHHRGSSVPS 428
           S+     CG S ++P  L + AKKQVDY+LG NP KMSYMVG+G +YP+ IHHRGSS+PS
Sbjct: 374 SARTVAYCGGSVVTPARLRSIAKKQVDYLLGGNPLKMSYMVGYGLKYPRRIHHRGSSLPS 433

Query: 429 LHSHPNRVSCNDGFQFLYSSSPNPNLLLGAIVGGPDNGDKFSDDRNNYQQSEPATYINAP 488
           +  HP R+ C+DGF    S SPNPN L+GA+VGGPD  D+F D+R++Y +SEPATYINAP
Sbjct: 434 VAVHPTRIQCHDGFSLFTSQSPNPNDLVGAVVGGPDQNDQFPDERSDYGRSEPATYINAP 493

BLAST of CSPI03G00790 vs. TAIR10
Match: AT1G23210.1 (AT1G23210.1 glycosyl hydrolase 9B6)

HSP 1 Score: 568.9 bits (1465), Expect = 3.0e-162
Identity = 276/472 (58.47%), Postives = 347/472 (73.52%), Query Frame = 1

Query: 15  LFTLSSSAFTSQHYSTALQYSILFFEGQRSGKLPSNQRLTWRADSALSDGSSYHVDLVGG 74
           L  +S   +    Y  AL+ SILFFEGQRSGKLP +QRL WR DSAL DGSS  VDL GG
Sbjct: 16  LLLISPETYAGHDYRDALRKSILFFEGQRSGKLPPDQRLKWRRDSALRDGSSAGVDLTGG 75

Query: 75  YYDAGDNVKFGLPMAFTTTLLAWSVIEFGDSMGNEIENARAAVRWGSDYLLKAATAAPDV 134
           YYDAGDNVKFG PMAFTTT+++WSVI+FG +MG E+ENA  A++WG+DYL+KA T  PDV
Sbjct: 76  YYDAGDNVKFGFPMAFTTTMMSWSVIDFGKTMGPELENAVKAIKWGTDYLMKA-TQIPDV 135

Query: 135 LYVQVGDPNLDHKCWERPEDMDTPRTVYKITAQNPGSDVAAETAAALAAASIVFKASDPS 194
           ++VQVGD   DH CWERPEDMDT RTVYKI   + GS+VA ETAAALAAASIVF+  DP 
Sbjct: 136 VFVQVGDAYSDHNCWERPEDMDTLRTVYKIDKDHSGSEVAGETAAALAAASIVFEKRDPV 195

Query: 195 YSNKLLDAALKVFDLADKHRGSYSDSLHSVVCPFYCSYSGYNDELLWAASWVYKASKNSI 254
           YS  LLD A +VF  A K+RG+YSDSL+  VCPFYC ++GY DELLW A+W++KASK  +
Sbjct: 196 YSKMLLDRATRVFAFAQKYRGAYSDSLYQAVCPFYCDFNGYEDELLWGAAWLHKASKKRV 255

Query: 255 HLSYIQSNGHILGAEEDDYTFSWDDKRPGTKILLSQDFLVQSSEEFQIYKAHSDNYICSL 314
           +  +I  N  IL A +  + F WD+K  G  +L+S+  L+  +E FQ +K ++D +ICSL
Sbjct: 256 YREFIVKNQVILRAGDTIHEFGWDNKHAGINVLVSKMVLMGKAEYFQSFKQNADEFICSL 315

Query: 315 IPGTSTSSGQYTPGGLFFKGSESNLQYVTSAAFLLLTYAKYLSSSGGSIRCGTSRISPED 374
           +PG S    QY+ GGL  K   SN+Q+VTS +FLLLTY+ YLS +   + CG    SP  
Sbjct: 316 LPGISHPQVQYSQGGLLVKSGGSNMQHVTSLSFLLLTYSNYLSHANKVVPCGEFTASPAL 375

Query: 375 LIAQAKKQVDYILGENPEKMSYMVGFGERYPQHIHHRGSSVPSLHSHPNRVSCNDGFQFL 434
           L   AK+QVDYILG+NP KMSYMVG+G R+PQ IHHRGSSVPS+  HP+R+ C DG ++ 
Sbjct: 376 LRQVAKRQVDYILGDNPMKMSYMVGYGSRFPQKIHHRGSSVPSVVDHPDRIGCKDGSRYF 435

Query: 435 YSSSPNPNLLLGAIVGGPDNGDKFSDDRNNYQQSEPATYINAPLVGALAFFA 487
           +S++PNPNLL+GA+VGGP+  D F D R  +Q +EP TYINAPL+G L +F+
Sbjct: 436 FSNNPNPNLLIGAVVGGPNITDDFPDSRPYFQLTEPTTYINAPLLGLLGYFS 486

BLAST of CSPI03G00790 vs. TAIR10
Match: AT1G22880.1 (AT1G22880.1 cellulase 5)

HSP 1 Score: 563.9 bits (1452), Expect = 9.5e-161
Identity = 272/476 (57.14%), Postives = 353/476 (74.16%), Query Frame = 1

Query: 11  FILSLFTLSSSAFTSQHYSTALQYSILFFEGQRSGKLPSNQRLTWRADSALSDGSSYHVD 70
           F+LS  +L ++ + S +Y  AL  S+LFF+GQRSG+LPS+Q+L+WR+ S LSDGSS HVD
Sbjct: 9   FLLSALSLENT-YASPNYREALSKSLLFFQGQRSGRLPSDQQLSWRSSSGLSDGSSAHVD 68

Query: 71  LVGGYYDAGDNVKFGLPMAFTTTLLAWSVIEFGDSMGNEIENARAAVRWGSDYLLKAATA 130
           L GGYYDAGDNVKF  PMAFTTT+L+WS +E+G  MG E++N+R A+RW +DYLLK A A
Sbjct: 69  LTGGYYDAGDNVKFNFPMAFTTTMLSWSSLEYGKKMGPELQNSRVAIRWATDYLLKCARA 128

Query: 131 APDVLYVQVGDPNLDHKCWERPEDMDTPRTVYKITAQNPGSDVAAETAAALAAASIVFKA 190
            P  LYV VGDPN DHKCWERPEDMDTPRTVY ++  NPGSDVAAETAAALAA+S+VF+ 
Sbjct: 129 TPGKLYVGVGDPNGDHKCWERPEDMDTPRTVYSVSPSNPGSDVAAETAAALAASSMVFRK 188

Query: 191 SDPSYSNKLLDAALKVFDLADKHRGSYSDSLHSVVCPFYCSYSGYNDELLWAASWVYKAS 250
            DP YS  LL  A KV   A ++RG+YS+SL S VCPFYCSYSGY DELLW A+W+++A+
Sbjct: 189 VDPKYSRLLLATAKKVMQFAIQYRGAYSNSLSSSVCPFYCSYSGYKDELLWGAAWLHRAT 248

Query: 251 KNSIHLSYIQSNGHILGAEEDDYTFSWDDKRPGTKILLSQDFLVQSSEEFQIYKAHSDNY 310
            +  + ++I+S G   G ++ D  FSWD+K  G  +LLS+  ++     F++YK  ++N+
Sbjct: 249 NDPYYTNFIKSLG---GGDQPD-IFSWDNKYAGAYVLLSRRAVLNKDNNFELYKQAAENF 308

Query: 311 ICSLIPGTSTSSGQYTPGGLFFKGSESNLQYVTSAAFLLLTYAKYLSSSGGSIRCGTSRI 370
           +C ++P + +SS +YT GGL +K  +SNLQYVTS  FLL TYAKY+ S+  +  CG S I
Sbjct: 309 MCKILPNSPSSSTKYTKGGLMYKLPQSNLQYVTSITFLLTTYAKYMKSTKQTFNCGNSLI 368

Query: 371 SPEDLIAQAKKQVDYILGENPEKMSYMVGFGERYPQHIHHRGSSVPSLHSHPNRVSCNDG 430
            P  LI  +K+QVDY+LG NP KMSYMVGF   +P+ IHHRGSS+PS     N + CN G
Sbjct: 369 VPNALINLSKRQVDYVLGVNPMKMSYMVGFSSNFPKRIHHRGSSLPSRAVRSNSLGCNGG 428

Query: 431 FQFLYSSSPNPNLLLGAIVGGPDNGDKFSDDRNNYQQSEPATYINAPLVGALAFFA 487
           FQ   + +PNPN+L GAIVGGP+  D++ D R++Y +SEPATYINA  VG LA+FA
Sbjct: 429 FQSFRTQNPNPNILTGAIVGGPNQNDEYPDQRDDYTRSEPATYINAAFVGPLAYFA 479

BLAST of CSPI03G00790 vs. TAIR10
Match: AT1G71380.1 (AT1G71380.1 cellulase 3)

HSP 1 Score: 561.2 bits (1445), Expect = 6.2e-160
Identity = 275/481 (57.17%), Postives = 350/481 (72.77%), Query Frame = 1

Query: 8   ALYFILSLFT--LSSSAFTSQHYSTALQYSILFFEGQRSGKLPSNQRLTWRADSALSDGS 67
           +L+F + LF+  L S+   + +Y  AL  S+LFF+GQRSG LP  Q+++WRA S LSDGS
Sbjct: 3   SLFFFVLLFSSLLISNGDANPNYKEALSKSLLFFQGQRSGPLPRGQQISWRASSGLSDGS 62

Query: 68  SYHVDLVGGYYDAGDNVKFGLPMAFTTTLLAWSVIEFGDSMGNEIENARAAVRWGSDYLL 127
           + HVDL GGYYDAGDNVKF LPMAFTTT+L+WS +E+G  MG E+ENAR  +RW +DYLL
Sbjct: 63  AAHVDLTGGYYDAGDNVKFNLPMAFTTTMLSWSALEYGKRMGPELENARVNIRWATDYLL 122

Query: 128 KAATAAPDVLYVQVGDPNLDHKCWERPEDMDTPRTVYKITAQNPGSDVAAETAAALAAAS 187
           K A A P  LYV VGDPN+DHKCWERPEDMDTPRTVY ++A NPGSDVAAETAAALAAAS
Sbjct: 123 KCARATPGKLYVGVGDPNVDHKCWERPEDMDTPRTVYSVSASNPGSDVAAETAAALAAAS 182

Query: 188 IVFKASDPSYSNKLLDAALKVFDLADKHRGSYSDSLHSVVCPFYCSYSGYNDELLWAASW 247
           +VF+  D  YS  LL  A  V   A +++G+YSDSL S VCPFYCSYSGY DEL+W ASW
Sbjct: 183 MVFRKVDSKYSRLLLATAKDVMQFAIQYQGAYSDSLSSSVCPFYCSYSGYKDELMWGASW 242

Query: 248 VYKASKNSIHLSYIQSNGHILGAEEDDYTFSWDDKRPGTKILLSQDFLVQSSEEFQIYKA 307
           + +A+ N  + ++I+S G   G ++ D  FSWD+K  G  +LLS+  L+     F+ YK 
Sbjct: 243 LLRATNNPYYANFIKSLG---GGDQPD-IFSWDNKYAGAYVLLSRRALLNKDSNFEQYKQ 302

Query: 308 HSDNYICSLIPGTSTSSGQYTPGGLFFKGSESNLQYVTSAAFLLLTYAKYLSSSGGSIRC 367
            ++N+IC ++P + +SS QYT GGL +K  +SNLQYVTS  FLL TYAKY+ ++  +  C
Sbjct: 303 AAENFICKILPDSPSSSTQYTQGGLMYKLPQSNLQYVTSITFLLTTYAKYMKATKHTFNC 362

Query: 368 GTSRISPEDLIAQAKKQVDYILGENPEKMSYMVGFGERYPQHIHHRGSSVPSLHSHPNRV 427
           G+S I P  LI+ +K+QVDYILG+NP KMSYMVGF   +P+ IHHR SS+PS       +
Sbjct: 363 GSSVIVPNALISLSKRQVDYILGDNPIKMSYMVGFSSNFPKRIHHRASSLPSHALRSQSL 422

Query: 428 SCNDGFQFLYSSSPNPNLLLGAIVGGPDNGDKFSDDRNNYQQSEPATYINAPLVGALAFF 487
            CN GFQ  Y+ +PNPN+L GAIVGGP+  D + D R++Y  +EPATYINA  VG LA+F
Sbjct: 423 GCNGGFQSFYTQNPNPNILTGAIVGGPNQNDGYPDQRDDYSHAEPATYINAAFVGPLAYF 479

BLAST of CSPI03G00790 vs. NCBI nr
Match: gi|449456799|ref|XP_004146136.1| (PREDICTED: endoglucanase 1 [Cucumis sativus])

HSP 1 Score: 984.9 bits (2545), Expect = 4.9e-284
Identity = 488/489 (99.80%), Postives = 489/489 (100.00%), Query Frame = 1

Query: 1   MSFSITLALYFILSLFTLSSSAFTSQHYSTALQYSILFFEGQRSGKLPSNQRLTWRADSA 60
           MSFSITLALYFILSLFTLSSSAFTSQHYSTALQYSILFFEGQRSGKLPSNQRLTWRADSA
Sbjct: 1   MSFSITLALYFILSLFTLSSSAFTSQHYSTALQYSILFFEGQRSGKLPSNQRLTWRADSA 60

Query: 61  LSDGSSYHVDLVGGYYDAGDNVKFGLPMAFTTTLLAWSVIEFGDSMGNEIENARAAVRWG 120
           LSDGSSYHVDLVGGYYDAGDNVKFGLPMAFTTTLLAWSVIEFGDSMGNEIENARAAVRWG
Sbjct: 61  LSDGSSYHVDLVGGYYDAGDNVKFGLPMAFTTTLLAWSVIEFGDSMGNEIENARAAVRWG 120

Query: 121 SDYLLKAATAAPDVLYVQVGDPNLDHKCWERPEDMDTPRTVYKITAQNPGSDVAAETAAA 180
           SDYLLKAATAAPDVLYVQVGDPNLDHKCWERPEDMDTPRTVYKITAQNPGSDVAAETAAA
Sbjct: 121 SDYLLKAATAAPDVLYVQVGDPNLDHKCWERPEDMDTPRTVYKITAQNPGSDVAAETAAA 180

Query: 181 LAAASIVFKASDPSYSNKLLDAALKVFDLADKHRGSYSDSLHSVVCPFYCSYSGYNDELL 240
           LAAASIVFKASDPSYSNKLLDAALKVFDLADKHRGSYSDSLHSVVCPFYCSYSGYNDELL
Sbjct: 181 LAAASIVFKASDPSYSNKLLDAALKVFDLADKHRGSYSDSLHSVVCPFYCSYSGYNDELL 240

Query: 241 WAASWVYKASKNSIHLSYIQSNGHILGAEEDDYTFSWDDKRPGTKILLSQDFLVQSSEEF 300
           WAASWVYKASKNSIHLSYIQSNGHILGAEEDDYTFSWDDKRPGTKILLSQDFLVQSSEEF
Sbjct: 241 WAASWVYKASKNSIHLSYIQSNGHILGAEEDDYTFSWDDKRPGTKILLSQDFLVQSSEEF 300

Query: 301 QIYKAHSDNYICSLIPGTSTSSGQYTPGGLFFKGSESNLQYVTSAAFLLLTYAKYLSSSG 360
           QIYKAHSDNYICSLIPGTSTSSGQYTPGGLFFKGSESNLQYVTSAAFLLLTYAKYLSS+G
Sbjct: 301 QIYKAHSDNYICSLIPGTSTSSGQYTPGGLFFKGSESNLQYVTSAAFLLLTYAKYLSSNG 360

Query: 361 GSIRCGTSRISPEDLIAQAKKQVDYILGENPEKMSYMVGFGERYPQHIHHRGSSVPSLHS 420
           GSIRCGTSRISPEDLIAQAKKQVDYILGENPEKMSYMVGFGERYPQHIHHRGSSVPSLHS
Sbjct: 361 GSIRCGTSRISPEDLIAQAKKQVDYILGENPEKMSYMVGFGERYPQHIHHRGSSVPSLHS 420

Query: 421 HPNRVSCNDGFQFLYSSSPNPNLLLGAIVGGPDNGDKFSDDRNNYQQSEPATYINAPLVG 480
           HPNRVSCNDGFQFLYSSSPNPNLLLGAIVGGPDNGDKFSDDRNNYQQSEPATYINAPLVG
Sbjct: 421 HPNRVSCNDGFQFLYSSSPNPNLLLGAIVGGPDNGDKFSDDRNNYQQSEPATYINAPLVG 480

Query: 481 ALAFFAKTT 490
           ALAFFAKTT
Sbjct: 481 ALAFFAKTT 489

BLAST of CSPI03G00790 vs. NCBI nr
Match: gi|659095397|ref|XP_008448558.1| (PREDICTED: endoglucanase CX-like [Cucumis melo])

HSP 1 Score: 972.2 bits (2512), Expect = 3.3e-280
Identity = 481/490 (98.16%), Postives = 487/490 (99.39%), Query Frame = 1

Query: 1   MSFSITLALYFILSLFTL-SSSAFTSQHYSTALQYSILFFEGQRSGKLPSNQRLTWRADS 60
           MSFSITLALYFILSLFTL SSSAFTS+HYSTALQYSILFFEGQRSGKLPSNQRLTWRADS
Sbjct: 1   MSFSITLALYFILSLFTLSSSSAFTSEHYSTALQYSILFFEGQRSGKLPSNQRLTWRADS 60

Query: 61  ALSDGSSYHVDLVGGYYDAGDNVKFGLPMAFTTTLLAWSVIEFGDSMGNEIENARAAVRW 120
            LSDGSSYHVDLVGGYYDAGDNVKFGLPMAFTTTLLAWSVIEFGDSMGNEIENARAAVRW
Sbjct: 61  GLSDGSSYHVDLVGGYYDAGDNVKFGLPMAFTTTLLAWSVIEFGDSMGNEIENARAAVRW 120

Query: 121 GSDYLLKAATAAPDVLYVQVGDPNLDHKCWERPEDMDTPRTVYKITAQNPGSDVAAETAA 180
           GSDYLLKAATAAPDVLYVQVGDPNLDHKCWERPEDMDTPRTVYKITAQNPGSDVAAETAA
Sbjct: 121 GSDYLLKAATAAPDVLYVQVGDPNLDHKCWERPEDMDTPRTVYKITAQNPGSDVAAETAA 180

Query: 181 ALAAASIVFKASDPSYSNKLLDAALKVFDLADKHRGSYSDSLHSVVCPFYCSYSGYNDEL 240
           ALAAASIVFKASDPSYSNKLLDAALKVFDLADKHRGSYSDSLHSVVCPFYCSYSGYNDEL
Sbjct: 181 ALAAASIVFKASDPSYSNKLLDAALKVFDLADKHRGSYSDSLHSVVCPFYCSYSGYNDEL 240

Query: 241 LWAASWVYKASKNSIHLSYIQSNGHILGAEEDDYTFSWDDKRPGTKILLSQDFLVQSSEE 300
           LWAASW+YKASKNSIHLSYIQ+NGHILGAEEDDYTFSWDDKRPGTKILLSQDFLVQSSEE
Sbjct: 241 LWAASWIYKASKNSIHLSYIQANGHILGAEEDDYTFSWDDKRPGTKILLSQDFLVQSSEE 300

Query: 301 FQIYKAHSDNYICSLIPGTSTSSGQYTPGGLFFKGSESNLQYVTSAAFLLLTYAKYLSSS 360
           FQIYKAHSDNYICSLIPGTSTSSGQYTPGGLFFKGSESNLQYVTSAAFLLLTYAKYLSS+
Sbjct: 301 FQIYKAHSDNYICSLIPGTSTSSGQYTPGGLFFKGSESNLQYVTSAAFLLLTYAKYLSSN 360

Query: 361 GGSIRCGTSRISPEDLIAQAKKQVDYILGENPEKMSYMVGFGERYPQHIHHRGSSVPSLH 420
           GGSIRCGTSRISP+DLIAQAKKQVDYILGENPEKMSYMVGFGERYPQHIHHRGSSVPSLH
Sbjct: 361 GGSIRCGTSRISPQDLIAQAKKQVDYILGENPEKMSYMVGFGERYPQHIHHRGSSVPSLH 420

Query: 421 SHPNRVSCNDGFQFLYSSSPNPNLLLGAIVGGPDNGDKFSDDRNNYQQSEPATYINAPLV 480
           +HPNRVSCNDGFQFLYSSSPNPNLLLGAIVGGPDNGDKFSDDRNNYQQSEPATYINAPLV
Sbjct: 421 AHPNRVSCNDGFQFLYSSSPNPNLLLGAIVGGPDNGDKFSDDRNNYQQSEPATYINAPLV 480

Query: 481 GALAFFAKTT 490
           GALAFF KTT
Sbjct: 481 GALAFFTKTT 490

BLAST of CSPI03G00790 vs. NCBI nr
Match: gi|224057986|ref|XP_002299423.1| (hypothetical protein POPTR_0001s11430g [Populus trichocarpa])

HSP 1 Score: 803.1 bits (2073), Expect = 2.6e-229
Identity = 380/486 (78.19%), Postives = 434/486 (89.30%), Query Frame = 1

Query: 2   SFSITLALYFI-LSLFTLSSSAFTSQHYSTALQYSILFFEGQRSGKLPSNQRLTWRADSA 61
           +FS+ L  +FI     +  S AFTSQ Y+ AL+ SILFFEGQRSGKLPSNQRLTWR DS 
Sbjct: 6   TFSLMLQFFFITFCCLSYFSFAFTSQDYANALEKSILFFEGQRSGKLPSNQRLTWRGDSG 65

Query: 62  LSDGSSYHVDLVGGYYDAGDNVKFGLPMAFTTTLLAWSVIEFGDSMGNEIENARAAVRWG 121
           LSDGS+YHV+LVGGYYDAGDNVKFGLPMAFTTTLLAWSVIEFG SM N+IENA+AA+RW 
Sbjct: 66  LSDGSTYHVNLVGGYYDAGDNVKFGLPMAFTTTLLAWSVIEFGSSMQNQIENAKAAIRWS 125

Query: 122 SDYLLKAATAAPDVLYVQVGDPNLDHKCWERPEDMDTPRTVYKITAQNPGSDVAAETAAA 181
           +DYLLKAATA PD LYVQVGDPN+DH+CWERPEDMDTPR VYK+T QNPGSDVAAETAAA
Sbjct: 126 TDYLLKAATATPDTLYVQVGDPNMDHRCWERPEDMDTPRNVYKVTIQNPGSDVAAETAAA 185

Query: 182 LAAASIVFKASDPSYSNKLLDAALKVFDLADKHRGSYSDSLHSVVCPFYCSYSGYNDELL 241
           LAAASIVFK SDPSYS KLL  A+KVFD AD++RGSYS+SL+SVVCPFYCSYSGY DELL
Sbjct: 186 LAAASIVFKESDPSYSTKLLHTAMKVFDFADRYRGSYSNSLNSVVCPFYCSYSGYQDELL 245

Query: 242 WAASWVYKASKNSIHLSYIQSNGHILGAEEDDYTFSWDDKRPGTKILLSQDFLVQSSEEF 301
           W ASW+++AS+N  +L+YIQSNGH +G+++DDY+FSWDDKRPGTKILLS++FL +++EEF
Sbjct: 246 WGASWIHRASQNGSYLTYIQSNGHTMGSDDDDYSFSWDDKRPGTKILLSKEFLEKTTEEF 305

Query: 302 QIYKAHSDNYICSLIPGTSTSSGQYTPGGLFFKGSESNLQYVTSAAFLLLTYAKYLSSSG 361
           Q+YK+HSDNYICSLIPGTS+   QYTPGGLF+K SESNLQYVTS  FLLLTYAKYL S+G
Sbjct: 306 QLYKSHSDNYICSLIPGTSSFQAQYTPGGLFYKASESNLQYVTSTTFLLLTYAKYLGSNG 365

Query: 362 GSIRCGTSRISPEDLIAQAKKQVDYILGENPEKMSYMVGFGERYPQHIHHRGSSVPSLHS 421
           G  RCG S ++ E LIAQAKKQVDYILG+NP +MSYMVGFG RYPQH+HHRGSSVPS+H+
Sbjct: 366 GVARCGGSTVTAESLIAQAKKQVDYILGDNPARMSYMVGFGNRYPQHVHHRGSSVPSIHA 425

Query: 422 HPNRVSCNDGFQFLYSSSPNPNLLLGAIVGGPDNGDKFSDDRNNYQQSEPATYINAPLVG 481
           HPNR+SCNDGFQFLYSSSPNPN+L+GAI+GGPDN D F+DDRNNYQQSEPATYINAP VG
Sbjct: 426 HPNRISCNDGFQFLYSSSPNPNVLVGAIIGGPDNRDNFADDRNNYQQSEPATYINAPFVG 485

Query: 482 ALAFFA 487
           ALAFF+
Sbjct: 486 ALAFFS 491

BLAST of CSPI03G00790 vs. NCBI nr
Match: gi|743842748|ref|XP_011026779.1| (PREDICTED: endoglucanase CX-like [Populus euphratica])

HSP 1 Score: 798.5 bits (2061), Expect = 6.5e-228
Identity = 378/486 (77.78%), Postives = 434/486 (89.30%), Query Frame = 1

Query: 2   SFSITLALYFI-LSLFTLSSSAFTSQHYSTALQYSILFFEGQRSGKLPSNQRLTWRADSA 61
           +FS+ L  +FI     +  S AFTSQ Y+ AL+ SILFFEGQRSGKLPSNQRLTWR DS 
Sbjct: 6   TFSLMLQFFFITFCCLSYFSFAFTSQDYANALEKSILFFEGQRSGKLPSNQRLTWRGDSG 65

Query: 62  LSDGSSYHVDLVGGYYDAGDNVKFGLPMAFTTTLLAWSVIEFGDSMGNEIENARAAVRWG 121
           LSDGS+YHV+LVGGYYDAGDNVKFGLPMAFTTTLL+WSVIEFG SM N+IENA+AA+RW 
Sbjct: 66  LSDGSTYHVNLVGGYYDAGDNVKFGLPMAFTTTLLSWSVIEFGSSMQNQIENAKAAIRWS 125

Query: 122 SDYLLKAATAAPDVLYVQVGDPNLDHKCWERPEDMDTPRTVYKITAQNPGSDVAAETAAA 181
           +DYLLKAATA PD LYVQVGDPN+DH+CWERPEDMDTPR VYK+T QNPGSDVAAETAAA
Sbjct: 126 TDYLLKAATATPDTLYVQVGDPNMDHRCWERPEDMDTPRNVYKVTIQNPGSDVAAETAAA 185

Query: 182 LAAASIVFKASDPSYSNKLLDAALKVFDLADKHRGSYSDSLHSVVCPFYCSYSGYNDELL 241
           LAAASIVFK SDPSYS+KLL  A+KVFD ADK+RGSYS+SL+SVVCPFYCSYSGY DELL
Sbjct: 186 LAAASIVFKESDPSYSSKLLHTAMKVFDFADKYRGSYSNSLNSVVCPFYCSYSGYQDELL 245

Query: 242 WAASWVYKASKNSIHLSYIQSNGHILGAEEDDYTFSWDDKRPGTKILLSQDFLVQSSEEF 301
           W ASW+++AS+N  +L+YIQSNGH +G+++DDY+FSWDDKRPGTKILLS++FL +++EEF
Sbjct: 246 WGASWIHRASQNRSYLTYIQSNGHTMGSDDDDYSFSWDDKRPGTKILLSKEFLEKTTEEF 305

Query: 302 QIYKAHSDNYICSLIPGTSTSSGQYTPGGLFFKGSESNLQYVTSAAFLLLTYAKYLSSSG 361
           Q+YK+HSDNYICSLIPGTS+   QYTPGGL +K SESNLQYVTS  FLLLTYAKYL S+G
Sbjct: 306 QLYKSHSDNYICSLIPGTSSFQAQYTPGGLSYKASESNLQYVTSTTFLLLTYAKYLGSNG 365

Query: 362 GSIRCGTSRISPEDLIAQAKKQVDYILGENPEKMSYMVGFGERYPQHIHHRGSSVPSLHS 421
           G  RCG S ++ E LIAQAKKQVDYILG+NP +MSYMVGFG RYPQH+HHRGSSVPS+H+
Sbjct: 366 GVARCGGSTVTAESLIAQAKKQVDYILGDNPARMSYMVGFGNRYPQHVHHRGSSVPSIHA 425

Query: 422 HPNRVSCNDGFQFLYSSSPNPNLLLGAIVGGPDNGDKFSDDRNNYQQSEPATYINAPLVG 481
           HPNRVSCNDGF+FLYSSSPNPN+L+GA++GGPDN D F+DDRNNYQQSEPATYINAP VG
Sbjct: 426 HPNRVSCNDGFKFLYSSSPNPNVLVGAVIGGPDNRDNFADDRNNYQQSEPATYINAPFVG 485

Query: 482 ALAFFA 487
           ALAFF+
Sbjct: 486 ALAFFS 491

BLAST of CSPI03G00790 vs. NCBI nr
Match: gi|13383303|dbj|BAB39482.1| (endo-1,4-beta glucanase [Populus alba])

HSP 1 Score: 795.0 bits (2052), Expect = 7.2e-227
Identity = 375/486 (77.16%), Postives = 431/486 (88.68%), Query Frame = 1

Query: 2   SFSITLALYFI-LSLFTLSSSAFTSQHYSTALQYSILFFEGQRSGKLPSNQRLTWRADSA 61
           +FS+ L  +F+     +  S AFTSQ Y+ AL+  ILFFEGQRSGKLPSNQRL WR DS 
Sbjct: 6   TFSLMLQFFFVTFCCLSYFSFAFTSQDYANALEKPILFFEGQRSGKLPSNQRLAWRGDSG 65

Query: 62  LSDGSSYHVDLVGGYYDAGDNVKFGLPMAFTTTLLAWSVIEFGDSMGNEIENARAAVRWG 121
           LSDGS+YHV+LVGGYYDAGDNVKFGLPMAFTTTLLAWSVIEFG SM N+IENA+AA+RW 
Sbjct: 66  LSDGSTYHVNLVGGYYDAGDNVKFGLPMAFTTTLLAWSVIEFGSSMQNQIENAKAAIRWS 125

Query: 122 SDYLLKAATAAPDVLYVQVGDPNLDHKCWERPEDMDTPRTVYKITAQNPGSDVAAETAAA 181
           +DYLLKAATA PD LYVQVG+PN+DH+CWERPEDMDTPR VYK+T  NPGSDVAAETAAA
Sbjct: 126 TDYLLKAATATPDTLYVQVGNPNMDHRCWERPEDMDTPRNVYKVTIHNPGSDVAAETAAA 185

Query: 182 LAAASIVFKASDPSYSNKLLDAALKVFDLADKHRGSYSDSLHSVVCPFYCSYSGYNDELL 241
           LAAASIVFK SDPSYS KLL  A+KVFD AD++RGSYS+SL+SVVCPFYCSYSGY DELL
Sbjct: 186 LAAASIVFKESDPSYSTKLLHTAMKVFDFADRYRGSYSNSLNSVVCPFYCSYSGYQDELL 245

Query: 242 WAASWVYKASKNSIHLSYIQSNGHILGAEEDDYTFSWDDKRPGTKILLSQDFLVQSSEEF 301
           W ASW+++AS+N  +L+YIQSNGH +G+++DDY+FSWDDKRPGTKILLS++FL +++EEF
Sbjct: 246 WGASWIHRASQNGSYLTYIQSNGHTMGSDDDDYSFSWDDKRPGTKILLSKEFLEKTTEEF 305

Query: 302 QIYKAHSDNYICSLIPGTSTSSGQYTPGGLFFKGSESNLQYVTSAAFLLLTYAKYLSSSG 361
           Q+YK+HSDNYICSLIPGTS+   QYTPGGLF+K SESNLQYVTS  FLLLTYAKYL S+G
Sbjct: 306 QLYKSHSDNYICSLIPGTSSFQAQYTPGGLFYKASESNLQYVTSTTFLLLTYAKYLGSNG 365

Query: 362 GSIRCGTSRISPEDLIAQAKKQVDYILGENPEKMSYMVGFGERYPQHIHHRGSSVPSLHS 421
           G  RCG S ++ E LIAQAKKQVDYILG+NP +MSYMVGFG RYPQH+HHRGSSVPS+H+
Sbjct: 366 GVARCGGSTVTTESLIAQAKKQVDYILGDNPARMSYMVGFGNRYPQHVHHRGSSVPSIHA 425

Query: 422 HPNRVSCNDGFQFLYSSSPNPNLLLGAIVGGPDNGDKFSDDRNNYQQSEPATYINAPLVG 481
           HPNR+SCNDGFQFLYSSSPNPN+L+GAI+GGPDN D F+DDRNNYQQSEPATYINAP VG
Sbjct: 426 HPNRISCNDGFQFLYSSSPNPNVLVGAIIGGPDNRDNFADDRNNYQQSEPATYINAPFVG 485

Query: 482 ALAFFA 487
           ALAFF+
Sbjct: 486 ALAFFS 491

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
GUN1_PERAE5.4e-21473.16Endoglucanase 1 OS=Persea americana GN=CEL1 PE=2 SV=1[more]
GUN19_ORYSJ1.7e-19164.77Endoglucanase 19 OS=Oryza sativa subsp. japonica GN=Os08g0114200 PE=2 SV=1[more]
GUN4_ORYSJ5.1e-18865.73Endoglucanase 4 OS=Oryza sativa subsp. japonica GN=GLU14 PE=2 SV=1[more]
GUN17_ARATH1.8e-17761.62Endoglucanase 17 OS=Arabidopsis thaliana GN=At4g02290 PE=2 SV=1[more]
GUN1_ARATH4.0e-17762.53Endoglucanase 1 OS=Arabidopsis thaliana GN=CEL2 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
E5RDC9_CUCME2.3e-28098.16Endoglucanase OS=Cucumis melo subsp. melo PE=3 SV=1[more]
B9GK69_POPTR1.8e-22978.19Endoglucanase OS=Populus trichocarpa GN=GH9B14 PE=2 SV=1[more]
Q9AVI5_POPAL5.0e-22777.16Endoglucanase OS=Populus alba GN=PopCel1 PE=3 SV=1[more]
L0AUV0_POPTO6.6e-22777.16Endoglucanase OS=Populus tomentosa PE=3 SV=1[more]
Q9XIY8_POPAL4.3e-22677.73Endoglucanase OS=Populus alba GN=POPCEL2 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
AT4G02290.11.0e-17861.62 glycosyl hydrolase 9B13[more]
AT1G02800.12.3e-17862.53 cellulase 2[more]
AT1G23210.13.0e-16258.47 glycosyl hydrolase 9B6[more]
AT1G22880.19.5e-16157.14 cellulase 5[more]
AT1G71380.16.2e-16057.17 cellulase 3[more]
Match NameE-valueIdentityDescription
gi|449456799|ref|XP_004146136.1|4.9e-28499.80PREDICTED: endoglucanase 1 [Cucumis sativus][more]
gi|659095397|ref|XP_008448558.1|3.3e-28098.16PREDICTED: endoglucanase CX-like [Cucumis melo][more]
gi|224057986|ref|XP_002299423.1|2.6e-22978.19hypothetical protein POPTR_0001s11430g [Populus trichocarpa][more]
gi|743842748|ref|XP_011026779.1|6.5e-22877.78PREDICTED: endoglucanase CX-like [Populus euphratica][more]
gi|13383303|dbj|BAB39482.1|7.2e-22777.16endo-1,4-beta glucanase [Populus alba][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001701Glyco_hydro_9
IPR0089286-hairpin_glycosidase_sf
IPR0123416hp_glycosidase-like_sf
IPR018221Glyco_hydro_9_His_AS
Vocabulary: Molecular Function
TermDefinition
GO:0004553hydrolase activity, hydrolyzing O-glycosyl compounds
GO:0003824catalytic activity
Vocabulary: Biological Process
TermDefinition
GO:0005975carbohydrate metabolic process
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0030245 cellulose catabolic process
biological_process GO:0005982 starch metabolic process
biological_process GO:0005985 sucrose metabolic process
biological_process GO:0005975 carbohydrate metabolic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0008810 cellulase activity
molecular_function GO:0003824 catalytic activity
molecular_function GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compounds

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI03G00790.1CSPI03G00790.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001701Glycoside hydrolase family 9PFAMPF00759Glyco_hydro_9coord: 28..482
score: 1.3E
IPR008928Six-hairpin glycosidase-likeunknownSSF48208Six-hairpin glycosidasescoord: 15..487
score: 1.62E
IPR012341Six-hairpin glycosidaseGENE3DG3DSA:1.50.10.10coord: 26..487
score: 1.6E
IPR018221Glycoside hydrolase family 9, His active sitePROSITEPS00592GLYCOSYL_HYDROL_F9_1coord: 395..411
scor
NoneNo IPR availablePANTHERPTHR22298ENDO-1,4-BETA-GLUCANASEcoord: 9..486
score:
NoneNo IPR availablePANTHERPTHR22298:SF32SUBFAMILY NOT NAMEDcoord: 9..486
score: