Csa5G167210 (gene) Cucumber (Chinese Long) v2

NameCsa5G167210
Typegene
OrganismCucumis. sativus (Cucumber (Chinese Long) v2)
DescriptionEndo-1,4-beta-glucanase; contains IPR001701 (Glycoside hydrolase, family 9), IPR008928 (Six-hairpin glycosidase-like)
LocationChr5 : 6533470 .. 6536882 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAGTACTTTTTGGTGATGATTTTGATGTTGAAGCTTGTGGTTGTGTCTTCTCATGACTATGGTGATGCTTTGACAAAAAGTATATTGTTTTTTGAAGGTCAGAGGTCCGGAAAATTGCCTCCTAACCAACGAGTCACTTGGAGGAAGGATTCGGCTCTTCGTGATGGCCTTGAGTTTGGTGTAAGATCATATTCTTTTCTTTCTTTCTGTTGAGGTTATGATACAATTCTTCATACTCTTATTCTTTAGATACTTCAATTTCACATATGACTGTGAGGAAAAAAATGTTATTTTTGACCAAAAACAATGACGGTTTTAAGATGGGTACTGTTATATAGGTTGATTTGGTGGGTGGCTACTATGACGCCGGTGACAACGTGAAGTTCAGTTTTCCAATGGCGTTCACCACCACCATGCTTTCATGGAGTGTGCTGGAGTTCGGCAAAGACATGGGTTCCGACCTTCCCTACGCCATGGACTCCATCCGATGGGCCACTGATTACTTACTCAAAGCCACCTCTGTCCCTGGCTTTGTTTTTGCTCAGGTTGGCGACCCCTACGCCGATCATTTCTGCTGGGAACGACCCGAAGACATGGATACGCCGAGGACTCCTTATGCTGTCAGCAAGCAGTTTCCCGGCTCTGAAGTCTCCGCCGAGATCGCTGCTGCGCTAGCTGCTTCCTCCATGGTGTTTAAGCCTCTTGATGGAGGTTACTCTGCTAGGCTTCTTAAAAGGGCTAGAATGGTGAGAGACTTTGTTTTAATTTATGATAGAAGTAAAGTTTTAGTGATAGATTGAGTCAGTTTATAAACAACCTATTCAAATGGAGTGAGATATATACATATATTTGAAATATATTTCAATGCATCTCAAGTTGTATATATATATATATATATTTGTGCTATCTTATAACCTAAGCTCTCGTTATCAAGTTGGGTTATTATTTGAACTTTAACATTGCACTATACAATTTTTTGGACACTTCAATTAGGGATCTTAGTTAATGAACAAAGTTTATGTATGTTTATTGATTTAGGTTTATTAATTTTTTTTTTTGGGTTAGGTTTTTGAGTTTGCAGATACTTATCGAGGGTCTTACAATGACAGTCTTGGACGATGGGTTTGTCCATTTTATTGTAGCTATAGTGGCTATGAGGTATGTATTATCTAATATTAAAAAAAATAACGACTCGTTTGGCAAAAAAAATTCTATTTTATGTTTTCAAATGCATTCAGTCGTTCAACAAATATATGTTTAGCATATACTTCGAATTATGTCTTTTGAAAATCGTTTTCAGATTTTGTGATCTATCATGTTCAAATCTTAAAAATTAAGATTTTTTTCATGTTTTTGTTGTGCATACATTTTCATCGAACCACTTGTTTTCAAATTCTCACTAACAAATGAATCCTAAATAATCTGGTTGGATATAGGACGAATTAATATGGGGAGCTGCTTGGTTATACAAGGCAACAAAGACGGCATTTTATTGGAACTATATCACTAAAAACATCAACACAATAAAGAACAACAACAACAATCCTGCAGTTGATTACAATAATGTAATTAACTCTAAAACTGATAATGTCTTTCAACATTATTCTAGTGGTAACTTTGCTGAGTTTGGATGGGATACCAAATATGCGGGAATTAATGTACTCATTTCCAAGGTATGTATATATGTATATATATAATTATAGCAGAATAAATATAATGTATTATTATTAACCAAATTATTTATTATTACAGTTTGTGTTAAGTACTGGAAATGGTTCTTCTTCATCCAATATGTTTATTAATTATGCTGATAAGTTCGTATGTTCAGTTCTTCCTGAATCCCCTTCTCTGTTAGTTTCTTACTCACGAGGTAACTAAATTCTCTTTCTAATCATATATACATGTATGTGTGGTTATAAGTTATAAATCAAAGGGTTATACAAAACAAATCTGGAGGTGAATATCTAGTTTGAACTAAAATGGGTTATTAATATGAATGAATATAGGTGGGCTTCTGTTCAAGTCTGGAGGAAGTAACATACAGCATTCAACAGCCTTATCATTTCTTCTTATTGTATATTCAAATTACTTGAATCAATATAAACACATTCTTCATTGTGGAAATGTTGTAGCCTCTCCATCTCGTCTTCTACAACTTGCCAAGACCCAGGTAATTAATTAAACAAACTCCAATTTTAGTCCTTAAATAACTTTTTTACGTTTTAAGTAATGATCTCTTTCCCATGCACCCAAGTAATTACTTATCTCTAAATTAAAACAAACAAACAAACAAACCTAATTCTTAATTTCGCATTTAAAAAATCTTTGATTTGTTTTTCCTTTAACAAAATAATCATAACACCAATTAGATCTTTATAGTCATATTTAGATATAAATTTTAATTTTATAGAAGTGTTTTTAAATATAATAAAATATATAGTAAAATTTTGACTATCTTTATCGATCAATATTAATGATAAATATTGATATACATACACTCATATAATTTGTACGATAGAATTTGAAATCCGGTTAAATATTATTTGGAGTAGTTTTACATTTTATAATAATCTTCAAATTTTATGTCTTGGGTCAGATAATAATTTGAATGTATTATGAATAAGATACAAAAGAAAACTAAAAGCTTATTGAACATATATATTATATTAGACCTTATCTTAAAATCTACCCCTATTGAATCTAAAGTTCAAAAGACTTACTAGATATTTCTAATTTGATTGTTTTTTCTTAGACATAAAGTTGAAAATAGAGTTTAACCAAAGTTTGTAATGCTTATTAATTAAGAAATTCTTATATTTTACTAGTTGAGATATTAGAATGCTAAAATGTGTCAATATTTTCATACGTACACTTAGTTGAGAGATATTAATATGCTACTCTATTATCGATCGACCACTAATGTTTCTTTGTTAAAAAAATTATTATAATAATCTTCTAAATTATATTAGATATTGTGAAACGTATGATAGTATGAAATACAATTTTTTTTTCCAAACACATGAAAAACTTGTATTTAACCTAACAACGAAGGTGACGTGGATGCATTTCTGGATGATACGTAGGTGGATTACATATTAGGAAGCAACCCATTGGGGATGTCCTATATGGTGGGTTATGGCAAAAACTTCCCACAAAGGATTCACCATCGTGGCTCATCATTGCCATCCATGGCCAACTATCCACAGGCCATTGGATGTGCAAAAGGGAAACAGTATTTTCAAAGCAACAATCCAAACCCTAATTTGCTAATTGGAGCTGTTGTTGGAGGACCTGATTTCAATGATTCTTATGCAGATTCCAGACCTGATTTTGTTTATTCTGAACCAACTACTTATATTAATGCTCCTCTTGTTGGTCTCTTGGCTTACTTCAAATCCCATCCTAATTAA

mRNA sequence

ATGAAGTACTTTTTGGTGATGATTTTGATGTTGAAGCTTGTGGTTGTGTCTTCTCATGACTATGGTGATGCTTTGACAAAAAGTATATTGTTTTTTGAAGGTCAGAGGTCCGGAAAATTGCCTCCTAACCAACGAGTCACTTGGAGGAAGGATTCGGCTCTTCGTGATGGCCTTGAGTTTGGTGTTGATTTGGTGGGTGGCTACTATGACGCCGGTGACAACGTGAAGTTCAGTTTTCCAATGGCGTTCACCACCACCATGCTTTCATGGAGTGTGCTGGAGTTCGGCAAAGACATGGGTTCCGACCTTCCCTACGCCATGGACTCCATCCGATGGGCCACTGATTACTTACTCAAAGCCACCTCTGTCCCTGGCTTTGTTTTTGCTCAGGTTGGCGACCCCTACGCCGATCATTTCTGCTGGGAACGACCCGAAGACATGGATACGCCGAGGACTCCTTATGCTGTCAGCAAGCAGTTTCCCGGCTCTGAAGTCTCCGCCGAGATCGCTGCTGCGCTAGCTGCTTCCTCCATGGTGTTTAAGCCTCTTGATGGAGGTTACTCTGCTAGGCTTCTTAAAAGGGCTAGAATGGTTTTTGAGTTTGCAGATACTTATCGAGGGTCTTACAATGACAGTCTTGGACGATGGGTTTGTCCATTTTATTGTAGCTATAGTGGCTATGAGTTTGTGTTAAGTACTGGAAATGGTTCTTCTTCATCCAATATGTTTATTAATTATGCTGATAAGTTCGTATGTTCAGTTCTTCCTGAATCCCCTTCTCTGTTAGTTTCTTACTCACGAGGTGGGCTTCTGTTCAAGTCTGGAGGAAGTAACATACAGCATTCAACAGCCTTATCATTTCTTCTTATTGTATATTCAAATTACTTGAATCAATATAAACACATTCTTCATTGTGGAAATGTTGTAGCCTCTCCATCTCGTCTTCTACAACTTGCCAAGACCCAGGTGGATTACATATTAGGAAGCAACCCATTGGGGATGTCCTATATGGTGGGTTATGGCAAAAACTTCCCACAAAGGATTCACCATCGTGGCTCATCATTGCCATCCATGGCCAACTATCCACAGGCCATTGGATGTGCAAAAGGGAAACAGTATTTTCAAAGCAACAATCCAAACCCTAATTTGCTAATTGGAGCTGTTGTTGGAGGACCTGATTTCAATGATTCTTATGCAGATTCCAGACCTGATTTTGTTTATTCTGAACCAACTACTTATATTAATGCTCCTCTTGTTGGTCTCTTGGCTTACTTCAAATCCCATCCTAATTAA

Coding sequence (CDS)

ATGAAGTACTTTTTGGTGATGATTTTGATGTTGAAGCTTGTGGTTGTGTCTTCTCATGACTATGGTGATGCTTTGACAAAAAGTATATTGTTTTTTGAAGGTCAGAGGTCCGGAAAATTGCCTCCTAACCAACGAGTCACTTGGAGGAAGGATTCGGCTCTTCGTGATGGCCTTGAGTTTGGTGTTGATTTGGTGGGTGGCTACTATGACGCCGGTGACAACGTGAAGTTCAGTTTTCCAATGGCGTTCACCACCACCATGCTTTCATGGAGTGTGCTGGAGTTCGGCAAAGACATGGGTTCCGACCTTCCCTACGCCATGGACTCCATCCGATGGGCCACTGATTACTTACTCAAAGCCACCTCTGTCCCTGGCTTTGTTTTTGCTCAGGTTGGCGACCCCTACGCCGATCATTTCTGCTGGGAACGACCCGAAGACATGGATACGCCGAGGACTCCTTATGCTGTCAGCAAGCAGTTTCCCGGCTCTGAAGTCTCCGCCGAGATCGCTGCTGCGCTAGCTGCTTCCTCCATGGTGTTTAAGCCTCTTGATGGAGGTTACTCTGCTAGGCTTCTTAAAAGGGCTAGAATGGTTTTTGAGTTTGCAGATACTTATCGAGGGTCTTACAATGACAGTCTTGGACGATGGGTTTGTCCATTTTATTGTAGCTATAGTGGCTATGAGTTTGTGTTAAGTACTGGAAATGGTTCTTCTTCATCCAATATGTTTATTAATTATGCTGATAAGTTCGTATGTTCAGTTCTTCCTGAATCCCCTTCTCTGTTAGTTTCTTACTCACGAGGTGGGCTTCTGTTCAAGTCTGGAGGAAGTAACATACAGCATTCAACAGCCTTATCATTTCTTCTTATTGTATATTCAAATTACTTGAATCAATATAAACACATTCTTCATTGTGGAAATGTTGTAGCCTCTCCATCTCGTCTTCTACAACTTGCCAAGACCCAGGTGGATTACATATTAGGAAGCAACCCATTGGGGATGTCCTATATGGTGGGTTATGGCAAAAACTTCCCACAAAGGATTCACCATCGTGGCTCATCATTGCCATCCATGGCCAACTATCCACAGGCCATTGGATGTGCAAAAGGGAAACAGTATTTTCAAAGCAACAATCCAAACCCTAATTTGCTAATTGGAGCTGTTGTTGGAGGACCTGATTTCAATGATTCTTATGCAGATTCCAGACCTGATTTTGTTTATTCTGAACCAACTACTTATATTAATGCTCCTCTTGTTGGTCTCTTGGCTTACTTCAAATCCCATCCTAATTAA

Protein sequence

MKYFLVMILMLKLVVVSSHDYGDALTKSILFFEGQRSGKLPPNQRVTWRKDSALRDGLEFGVDLVGGYYDAGDNVKFSFPMAFTTTMLSWSVLEFGKDMGSDLPYAMDSIRWATDYLLKATSVPGFVFAQVGDPYADHFCWERPEDMDTPRTPYAVSKQFPGSEVSAEIAAALAASSMVFKPLDGGYSARLLKRARMVFEFADTYRGSYNDSLGRWVCPFYCSYSGYEFVLSTGNGSSSSNMFINYADKFVCSVLPESPSLLVSYSRGGLLFKSGGSNIQHSTALSFLLIVYSNYLNQYKHILHCGNVVASPSRLLQLAKTQVDYILGSNPLGMSYMVGYGKNFPQRIHHRGSSLPSMANYPQAIGCAKGKQYFQSNNPNPNLLIGAVVGGPDFNDSYADSRPDFVYSEPTTYINAPLVGLLAYFKSHPN*
BLAST of Csa5G167210 vs. Swiss-Prot
Match: GUN3_ORYSJ (Endoglucanase 3 OS=Oryza sativa subsp. japonica GN=GLU8 PE=2 SV=1)

HSP 1 Score: 451.1 bits (1159), Expect = 1.4e-125
Identity = 233/458 (50.87%), Postives = 293/458 (63.97%), Query Frame = 1

Query: 19  HDYGDALTKSILFFEGQRSGKLPPNQRVTWRKDSALRDGLEFGVDLVGGYYDAGDNVKFS 78
           HDY DALTKSILFFEGQRSGKLPP+QRV+WR DS L DG    VDLVGGYYDAGDN+KF 
Sbjct: 34  HDYRDALTKSILFFEGQRSGKLPPSQRVSWRGDSGLSDGSSIKVDLVGGYYDAGDNMKFG 93

Query: 79  FPMAFTTTMLSWSVLEFGKDMGSDLPYAMDSIRWATDYLLKATSVPGFVFAQVGDPYADH 138
           FP+AF+ TML+WSV+EFG  M  +L +A D++RW +DYLLKAT+ P  V+ QVGD   DH
Sbjct: 94  FPLAFSMTMLAWSVVEFGGLMKGELQHARDAVRWGSDYLLKATAHPDTVYVQVGDANRDH 153

Query: 139 FCWERPEDMDTPRTPYAVSKQFPGSEVSAEIAAALAASSMVFKPLDGGYSARLLKRARMV 198
            CWERPEDMDTPRT Y V    PG++V+AE AAALAA+S+VF+  D  Y++RL+ RA+ V
Sbjct: 154 ACWERPEDMDTPRTVYKVDPSTPGTDVAAETAAALAAASLVFRKSDPAYASRLVARAKRV 213

Query: 199 FEFADTYRGSYNDSLGRWV--------------------------CPFYCSY-------- 258
           FEFAD +RG+Y+  L  +V                           P Y SY        
Sbjct: 214 FEFADKHRGTYSTRLSPYVCPYYCSYSGYQDELLWGAAWLHRATKNPTYLSYIQMNGQVL 273

Query: 259 --SGYEFVLSTGNGSSSSNMFI----------------NYADKFVCSVLPESPSLLVSYS 318
                +      N  + + + I                 +AD F+CS++P +P+    Y+
Sbjct: 274 GADEQDNTFGWDNKHAGARILIAKAFLVQKVAALHEYKGHADSFICSMVPGTPTDQTQYT 333

Query: 319 RGGLLFKSGGSNIQHSTALSFLLIVYSNYLNQYKHILHCGNVVASPSRLLQLAKTQVDYI 378
           RGGLLFK   SN+Q+ T+ SFLL+ Y+ YL   K  + CG    +P+RL  +A+ QVDY+
Sbjct: 334 RGGLLFKLSDSNMQYVTSSSFLLLTYAKYLAFSKTTVSCGGAAVTPARLRAIARQQVDYL 393

Query: 379 LGSNPLGMSYMVGYGKNFPQRIHHRGSSLPSMANYPQAIGCAKGKQYFQSNNPNPNLLIG 425
           LGSNP+GMSYMVGYG  +P+RIHHR SSLPS+A +P  IGC++G     S   NPN+L+G
Sbjct: 394 LGSNPMGMSYMVGYGAKYPRRIHHRASSLPSVAAHPARIGCSQGFTALYSGVANPNVLVG 453

BLAST of Csa5G167210 vs. Swiss-Prot
Match: GUN4_ORYSJ (Endoglucanase 4 OS=Oryza sativa subsp. japonica GN=GLU14 PE=2 SV=1)

HSP 1 Score: 402.5 bits (1033), Expect = 5.7e-111
Identity = 219/475 (46.11%), Postives = 284/475 (59.79%), Query Frame = 1

Query: 17  SSHDYGDALTKSILFFEGQRSGKLPPNQRVTWRKDSALRDGLEFGVDLVGGYYDAGDNVK 76
           ++ +Y DAL K+ILFFE QRSGKLPP QRV WR DS L DG   GVDL GGYYDAGDNVK
Sbjct: 21  AAFNYADALDKAILFFEAQRSGKLPPGQRVAWRADSGLSDGSADGVDLAGGYYDAGDNVK 80

Query: 77  FSFPMAFTTTMLSWSVLEFGKDMGS----------------DLPYAMDSIRWATDYLLKA 136
           F  PMAFT TMLSWSV+EFG  M +                 L  A  ++RW  DYLLKA
Sbjct: 81  FGLPMAFTVTMLSWSVIEFGDMMPARRSSFLGGIFGGGGVAQLDNARAAVRWGADYLLKA 140

Query: 137 -TSVPGFVFAQVGDPYADHFCWERPEDMDTPRTPYAVSKQFPGSEVSAEIAAALAASSMV 196
            T+ P  ++ QV DPY DH CWERPEDMDTPR+ Y V+ Q PGS+V+ E AAALAA+S+V
Sbjct: 141 ATATPDTLYVQVADPYQDHRCWERPEDMDTPRSVYKVTPQSPGSDVAGETAAALAAASIV 200

Query: 197 FKPLDGGYSARLLKR--------ARMVFEFADTYRG---------SYNDSLGRWVCPF-- 256
           F+  D  YSA+LL           +    ++D+            SY+D L  W   +  
Sbjct: 201 FRVSDPSYSAKLLDAAQLVFDFADKYRGSYSDSLSSVVCPFYCSHSYHDEL-LWAASWLH 260

Query: 257 ---------YCSYSGY----------EFVLSTGNGSSSSNMFIN-----------YADKF 316
                    Y SY G           +F  S  +   ++  F+            + D +
Sbjct: 261 LASPEKKDVYLSYIGSNGHALGAEQDDFTFSWDDKRVATKGFLQSRADGLQLYKAHTDNY 320

Query: 317 VCSVLPESPSLLVSYSRGGLLFKSGGSNIQHSTALSFLLIVYSNYLNQYKHILHCGNVVA 376
           +CS++P +      Y+ GGLLFK G SN+Q+ T+ +FLL+ Y+ YL+     + CG+   
Sbjct: 321 ICSLVPGANGFQSQYTPGGLLFKEGDSNMQYVTSTAFLLLTYAKYLSSSAATVSCGSTAV 380

Query: 377 SPSRLLQLAKTQVDYILGSNPLGMSYMVGYGKNFPQRIHHRGSSLPSMANYPQAIGCAKG 426
           SPS L+ LAK QVDYILG+NP GMSYMVG+G  +P+ +HHRG+S+PS+ ++P  IGC +G
Sbjct: 381 SPSTLISLAKKQVDYILGANPAGMSYMVGFGARYPRHVHHRGASMPSVRDHPARIGCDEG 440

BLAST of Csa5G167210 vs. Swiss-Prot
Match: GUN20_ARATH (Endoglucanase 20 OS=Arabidopsis thaliana GN=At4g23560 PE=2 SV=1)

HSP 1 Score: 386.0 bits (990), Expect = 5.6e-106
Identity = 218/483 (45.13%), Postives = 290/483 (60.04%), Query Frame = 1

Query: 1   MKYFLVMILM---LKLVVVSSHDYGDALTKSILFFEGQRSGKLPPNQRVTWRKDSALRDG 60
           M   LV++L+   L    + + +YGDAL KSILFFEGQRSGKLP NQRV WR DSAL DG
Sbjct: 1   MGKLLVLMLVGMFLAFESLEALEYGDALNKSILFFEGQRSGKLPTNQRVKWRADSALSDG 60

Query: 61  LEFGVDLVGGYYDAGDNVKFSFPMAFTTTMLSWSVLEFGKDMGS--DLPYAMDSIRWATD 120
               V+L+GGYYDAGDNVKF +PM+FTTT+LSW+ +E+  ++ S   L Y   +I+W TD
Sbjct: 61  SLANVNLIGGYYDAGDNVKFVWPMSFTTTLLSWAAIEYQNEISSVNQLGYLRSTIKWGTD 120

Query: 121 YLLKATSVPGFVFAQVGDPYADHFCWERPEDMDTPRTPYAVSKQFPGSEVSAEIAAALAA 180
           ++L+A + P  ++ QVGD  +DH CWERPEDMDT RT Y++S   PGSE + E AAALAA
Sbjct: 121 FILRAHTSPNMLYTQVGDGNSDHSCWERPEDMDTSRTLYSISSSSPGSEAAGEAAAALAA 180

Query: 181 SSMVFKPLDGGYSARLLKRARMVFEFADTYRGSYNDSLGRWVCPFYCSYSGYE------- 240
           +S+VFK +D  YS+ LL  A+ +FEFAD YRGSY  S     CPFYCSYSGY+       
Sbjct: 181 ASLVFKSVDSTYSSTLLNHAKTLFEFADKYRGSYQAS-----CPFYCSYSGYQDELLWAA 240

Query: 241 --------------FVLSTGNGSSSSNMFINYADKFVCSVLPESPSLLVSYSRGG----L 300
                         +V+S  + S + N F ++ +KFV      + +LLVS    G     
Sbjct: 241 AWLYKATGDKIYINYVISNKDWSQAVNEF-SWDNKFV-----GAQALLVSEFYNGANDLA 300

Query: 301 LFKS-----------GGSNIQHSTALSFLLIVYSNYLNQYKHI----------------- 360
            FKS           G S+ Q       LL +  +   QY                    
Sbjct: 301 KFKSDVESFVCAMMPGSSSQQIKPTPGGLLFIRDSSNLQYVTTATTVLFHYSKTLTKAGV 360

Query: 361 --LHCGNVVASPSRLLQLAKTQVDYILGSNPLGMSYMVGYGKNFPQRIHHRGSSLPSMAN 420
             + CG+   + S++   AK+QVDYILG+NP+ MSYMVG+G  +P + HHRGSSLPS+ +
Sbjct: 361 GSIQCGSTKFTVSQIRNFAKSQVDYILGNNPMKMSYMVGFGTKYPTQPHHRGSSLPSIQS 420

Query: 421 YPQAIGCAKGKQYFQSNNPNPNLLIGAVVGGPDFNDSYADSRPDFVYSEPTTYINAPLVG 424
            P+ I C  G  Y+ S+ PNPN+ IGA+VGGP+ +D Y+D + D+ ++EPTTYINA  +G
Sbjct: 421 KPEKIDCNGGYSYYNSDTPNPNVHIGAIVGGPNSSDQYSDKKSDYSHAEPTTYINAAFIG 472

BLAST of Csa5G167210 vs. Swiss-Prot
Match: GUN_PHAVU (Endoglucanase OS=Phaseolus vulgaris PE=2 SV=2)

HSP 1 Score: 372.9 bits (956), Expect = 4.9e-102
Identity = 203/455 (44.62%), Postives = 279/455 (61.32%), Query Frame = 1

Query: 18  SHDYGDALTKSILFFEGQRSGKLPPNQRVTWRKDSALRDGLEFGVDLVGGYYDAGDNVKF 77
           ++DY DAL K+ILFFEGQRSGKLP +QRV WR+DSAL DG    V+L+GGYYDAGDNVKF
Sbjct: 38  NYDYADALAKAILFFEGQRSGKLPSSQRVKWREDSALSDGKLQNVNLMGGYYDAGDNVKF 97

Query: 78  SFPMAFTTTMLSWSVLEFGKDMGS--DLPYAMDSIRWATDYLLKATSVPGFVFAQVGDPY 137
            +PMAF+T++LSW+ +E+  ++ S   L Y   +IRW  D++L+A + P  ++ QVGD  
Sbjct: 98  GWPMAFSTSLLSWAAVEYESEISSVNQLGYLQSAIRWGADFMLRAHTSPTTLYTQVGDGN 157

Query: 138 ADHFCWERPEDMDTPRTPYAVSKQFPGSEVSAEIAAALAASSMVFKPLDGGYSARLLKR- 197
           ADH CWERPEDMDTPRT Y +    PG+EV+AE AAAL+A+S+VFK +D  YS+ LL   
Sbjct: 158 ADHNCWERPEDMDTPRTVYKIDANSPGTEVAAEYAAALSAASIVFKKIDAKYSSTLLSHS 217

Query: 198 ---ARMVFEFADTYRGS---------YNDSLGRWVCPFYCSYSGYEFVLS---------- 257
                   +   +Y GS         Y D L  W   +    SG    LS          
Sbjct: 218 KSLFDFADKNRGSYSGSCPFYCSYSGYQDEL-LWAAAWLYKASGESKYLSYIISNQGWSQ 277

Query: 258 TGNGSSSSNMFINY---------------------ADKFVCSVLPESPSLLVSYSRGGLL 317
           T +  S  N F+                       A+ F+C+V+P S S  +  + GGLL
Sbjct: 278 TVSEFSWDNKFVGAQTLLTEEFYGGKKDLAKIKTDAESFICAVMPGSNSRQIKTTPGGLL 337

Query: 318 FKSGGSNIQHSTALSFLLIVYSNYLNQYKHI--LHCGNVVASPSRLLQLAKTQVDYILGS 377
           F    SN+Q++T+ + +L ++S  LN+  HI  ++CG+   + S++   AKTQV+YILG 
Sbjct: 338 FTRDSSNLQYTTSSTMVLFIFSRILNR-NHINGINCGSSHFTASQIRGFAKTQVEYILGK 397

Query: 378 NPLGMSYMVGYGKNFPQRIHHRGSSLPSMANYPQAIGCAKG-KQYFQSNNPNPNLLIGAV 424
           NP+ MSYMVG+G  +P+++HHRGSS+PS+  +P  +GC  G   Y+ S NPNPN  +GA+
Sbjct: 398 NPMKMSYMVGFGSKYPKQLHHRGSSIPSIKVHPAKVGCNAGLSDYYNSANPNPNTHVGAI 457

BLAST of Csa5G167210 vs. Swiss-Prot
Match: GUN20_ORYSJ (Endoglucanase 20 OS=Oryza sativa subsp. japonica GN=GLU15 PE=2 SV=1)

HSP 1 Score: 340.1 bits (871), Expect = 3.5e-92
Identity = 194/461 (42.08%), Postives = 263/461 (57.05%), Query Frame = 1

Query: 16  VSSHDYGDALTKSILFFEGQRSGKLPPNQRVTWRKDSALRDGLEFGVDLVGGYYDAGDNV 75
           + S DYGDAL K+ILFFEGQRSG+LP NQR TWR DSAL DG E  V+L GGYYDAGDNV
Sbjct: 36  LGSPDYGDALAKAILFFEGQRSGRLPANQRATWRGDSALTDGREENVNLTGGYYDAGDNV 95

Query: 76  KFSFPMAFTTTMLSWSVLEFGKDMGS--DLPYAMDSIRWATDYLLKATSVPGFVFAQVGD 135
           KF +PMAFT T+L WS +E+G  + +  +L     +IRW  D+LL+A + P  ++ QVGD
Sbjct: 96  KFGYPMAFTVTLLGWSAVEYGAAVAAAGELGNLRAAIRWGADFLLRAHASPTTLYTQVGD 155

Query: 136 PYADHFCWERPEDMDTPRTPYAVSKQFPGSEVSAEIAAAL-------------------- 195
             ADH CWERPEDMDTPRT Y ++   PGSE +AE +AAL                    
Sbjct: 156 GNADHQCWERPEDMDTPRTLYKITADSPGSEAAAEASAALAAAYVALKDDGDTAFSSRLL 215

Query: 196 AASSMVFKPLDG----------------GYSARLLKRARMVF---------EFADTYRGS 255
           AAS  +F   +                 G+   LL  +  +F         +F    +GS
Sbjct: 216 AASRSLFDFANNYRGSFQSSCPFYCSYSGFQDELLWASAWLFKATRDAKYLDFLTNNQGS 275

Query: 256 YNDSLGRWVCPFYCS--YSGYEFVLSTG--NGSSSSNMFINYADKFVCSVLPESPSLLVS 315
            N      V  F     Y+G + + +     G +    + +  D FVC+++P S ++ + 
Sbjct: 276 SNP-----VNEFSWDNKYAGAQMLAAQEYLGGRTQLARYKDNLDSFVCALMPNSGNVQIR 335

Query: 316 YSRGGLLFKSGGSNIQHSTALSFLLIVYSNYLNQY-KHILHCGNVVASPSRLLQLAKTQV 375
            + GGLLF     N+Q++T  + +L +YS  L       + C     SP+++   A +QV
Sbjct: 336 TTPGGLLFTRDSVNLQYTTTATLVLSIYSKVLKSSGSRGVRCSAATFSPNQISSFATSQV 395

Query: 376 DYILGSNPLGMSYMVGYGKNFPQRIHHRGSSLPSMANYPQAIGCAKG-KQYFQSNNPNPN 424
           DYILG NPLGMSYMVG+   FP+RIHHRGSS+PS+    + + C +G   +  +++PNPN
Sbjct: 396 DYILGKNPLGMSYMVGFSTKFPRRIHHRGSSIPSIKVLSRKVTCKEGFSSWLPTSDPNPN 455

BLAST of Csa5G167210 vs. TrEMBL
Match: A0A0A0KN43_CUCSA (Endoglucanase OS=Cucumis sativus GN=Csa_5G167210 PE=3 SV=1)

HSP 1 Score: 882.5 bits (2279), Expect = 2.1e-253
Identity = 430/430 (100.00%), Postives = 430/430 (100.00%), Query Frame = 1

Query: 1   MKYFLVMILMLKLVVVSSHDYGDALTKSILFFEGQRSGKLPPNQRVTWRKDSALRDGLEF 60
           MKYFLVMILMLKLVVVSSHDYGDALTKSILFFEGQRSGKLPPNQRVTWRKDSALRDGLEF
Sbjct: 1   MKYFLVMILMLKLVVVSSHDYGDALTKSILFFEGQRSGKLPPNQRVTWRKDSALRDGLEF 60

Query: 61  GVDLVGGYYDAGDNVKFSFPMAFTTTMLSWSVLEFGKDMGSDLPYAMDSIRWATDYLLKA 120
           GVDLVGGYYDAGDNVKFSFPMAFTTTMLSWSVLEFGKDMGSDLPYAMDSIRWATDYLLKA
Sbjct: 61  GVDLVGGYYDAGDNVKFSFPMAFTTTMLSWSVLEFGKDMGSDLPYAMDSIRWATDYLLKA 120

Query: 121 TSVPGFVFAQVGDPYADHFCWERPEDMDTPRTPYAVSKQFPGSEVSAEIAAALAASSMVF 180
           TSVPGFVFAQVGDPYADHFCWERPEDMDTPRTPYAVSKQFPGSEVSAEIAAALAASSMVF
Sbjct: 121 TSVPGFVFAQVGDPYADHFCWERPEDMDTPRTPYAVSKQFPGSEVSAEIAAALAASSMVF 180

Query: 181 KPLDGGYSARLLKRARMVFEFADTYRGSYNDSLGRWVCPFYCSYSGYEFVLSTGNGSSSS 240
           KPLDGGYSARLLKRARMVFEFADTYRGSYNDSLGRWVCPFYCSYSGYEFVLSTGNGSSSS
Sbjct: 181 KPLDGGYSARLLKRARMVFEFADTYRGSYNDSLGRWVCPFYCSYSGYEFVLSTGNGSSSS 240

Query: 241 NMFINYADKFVCSVLPESPSLLVSYSRGGLLFKSGGSNIQHSTALSFLLIVYSNYLNQYK 300
           NMFINYADKFVCSVLPESPSLLVSYSRGGLLFKSGGSNIQHSTALSFLLIVYSNYLNQYK
Sbjct: 241 NMFINYADKFVCSVLPESPSLLVSYSRGGLLFKSGGSNIQHSTALSFLLIVYSNYLNQYK 300

Query: 301 HILHCGNVVASPSRLLQLAKTQVDYILGSNPLGMSYMVGYGKNFPQRIHHRGSSLPSMAN 360
           HILHCGNVVASPSRLLQLAKTQVDYILGSNPLGMSYMVGYGKNFPQRIHHRGSSLPSMAN
Sbjct: 301 HILHCGNVVASPSRLLQLAKTQVDYILGSNPLGMSYMVGYGKNFPQRIHHRGSSLPSMAN 360

Query: 361 YPQAIGCAKGKQYFQSNNPNPNLLIGAVVGGPDFNDSYADSRPDFVYSEPTTYINAPLVG 420
           YPQAIGCAKGKQYFQSNNPNPNLLIGAVVGGPDFNDSYADSRPDFVYSEPTTYINAPLVG
Sbjct: 361 YPQAIGCAKGKQYFQSNNPNPNLLIGAVVGGPDFNDSYADSRPDFVYSEPTTYINAPLVG 420

Query: 421 LLAYFKSHPN 431
           LLAYFKSHPN
Sbjct: 421 LLAYFKSHPN 430

BLAST of Csa5G167210 vs. TrEMBL
Match: A0A0D2PJD4_GOSRA (Endoglucanase OS=Gossypium raimondii GN=B456_001G048100 PE=3 SV=1)

HSP 1 Score: 623.6 bits (1607), Expect = 1.8e-175
Identity = 310/463 (66.95%), Postives = 359/463 (77.54%), Query Frame = 1

Query: 15  VVSSHDYGDALTKSILFFEGQRSGKLPPNQRVTWRKDSALRDGLEFGVDLVGGYYDAGDN 74
           +V+SHDYG ALTKSILF+EGQRSGKLPP QR+TWRKDSALRDG E GVDLVGGYYDAGDN
Sbjct: 19  LVASHDYGAALTKSILFYEGQRSGKLPPTQRITWRKDSALRDGFEIGVDLVGGYYDAGDN 78

Query: 75  VKFSFPMAFTTTMLSWSVLEFGKDMGSDLPYAMDSIRWATDYLLKATSVPGFVFAQVGDP 134
           VKF+FPMAF+ TML+WS+LEFG+ +G+DL +++ +I+W TDYLLKATSVPGFVFAQVGDP
Sbjct: 79  VKFTFPMAFSITMLAWSLLEFGQSLGTDLQHSLKAIQWGTDYLLKATSVPGFVFAQVGDP 138

Query: 135 YADHFCWERPEDMDTPRTPYAVSKQFPGSEVSAEIAAALAASSMVFKPLDGGYSARLLKR 194
           Y DH CWERPEDMDTPRTPYAVSK+FPGSEVSAEIAAALAASSMVF+P++ GYSARLLKR
Sbjct: 139 YGDHNCWERPEDMDTPRTPYAVSKEFPGSEVSAEIAAALAASSMVFRPINRGYSARLLKR 198

Query: 195 ARMVFEFADTYRGSYNDSLG---------------------RWV-----CPFYCSY---- 254
           ARM+FEFAD YRGSYNDSLG                      W+      P+Y +Y    
Sbjct: 199 ARMIFEFADKYRGSYNDSLGPWACPFYCDYSGYQDELVWGAAWLLRATKAPYYRNYVLAN 258

Query: 255 -------------------SGYEFVLSTGNGSSSSNMFINYADKFVCSVLPESPSLLVSY 314
                              +G   ++S    S +   FI  ADKFVCSVLPESP++ VSY
Sbjct: 259 IQNLDKSSSFAEFGWDTKHAGINVLVSRLIKSQTPEPFITNADKFVCSVLPESPTISVSY 318

Query: 315 SRGGLLFKSGGSNIQHSTALSFLLIVYSNYLNQYKHILHCGNVVASPSRLLQLAKTQVDY 374
           S GGLL K GGSN+QH+TALSFLL+VYS  L++   ++HCGNVVA+P+RL+Q+A++QVDY
Sbjct: 319 SPGGLLIKPGGSNLQHATALSFLLLVYSRPLSKDSRVIHCGNVVATPARLIQVARSQVDY 378

Query: 375 ILGSNPLGMSYMVGYGKNFPQRIHHRGSSLPSMANYPQAIGCAKGKQYFQSNNPNPNLLI 429
           ILGSNPL MSYMVGYG+ FP+RIHHRGSSLPS+  +PQ I C  G  YF +NNPNPNLL 
Sbjct: 379 ILGSNPLNMSYMVGYGEKFPERIHHRGSSLPSITQHPQHIDCTGGATYFYTNNPNPNLLT 438

BLAST of Csa5G167210 vs. TrEMBL
Match: A0A059DJ18_EUCGR (Endoglucanase OS=Eucalyptus grandis GN=EUGRSUZ_A02886 PE=3 SV=1)

HSP 1 Score: 583.9 bits (1504), Expect = 1.5e-163
Identity = 294/454 (64.76%), Postives = 343/454 (75.55%), Query Frame = 1

Query: 13  LVVVSSHDYGDALTKSILFFEGQRSGKLPPNQRVTWRKDSALRDGLEFGVDLVGGYYDAG 72
           +V VS HDYGDAL+KSILFFEGQRSGKLPP+QR+TWRKDS LRDG +  +DLVGGYYDAG
Sbjct: 1   MVSVSGHDYGDALSKSILFFEGQRSGKLPPSQRLTWRKDSGLRDGFDKHIDLVGGYYDAG 60

Query: 73  DNVKFSFPMAFTTTMLSWSVLEFGKDMGSDLPYAMDSIRWATDYLLKATSVPGFVFAQVG 132
           DNVKF+FPMAF+TTML+WSV+EFG+ MG DL  A+D++RWATDY LKATSVPG V+AQVG
Sbjct: 61  DNVKFNFPMAFSTTMLAWSVIEFGRGMGPDLRKAVDAVRWATDYFLKATSVPGLVYAQVG 120

Query: 133 DPYADHFCWERPEDMDTPRTPYAVSKQFPGSEVSAEIAAALAASSMVFKPLDGGYSARLL 192
           +PY DH CWERPEDMDT RT YAVS Q PGSEVSAEIAAALAA+S+VFK ++  YS  LL
Sbjct: 121 EPYGDHECWERPEDMDTARTAYAVSAQSPGSEVSAEIAAALAAASLVFKRVNHNYSDLLL 180

Query: 193 KRARMVFEFADTYRGSYNDSLGRWVCPFYCSYSG---------------------YEFVL 252
            RA+ VFEFAD YRGSYN SLG  VCPFYC++ G                     + +V 
Sbjct: 181 SRAKTVFEFADKYRGSYNYSLGSVVCPFYCNFGGYEDDLIWAAAWLFRATAAPNYWNYVT 240

Query: 253 ST-----GN----GSSSSNMFINY------ADKFVCSVLPESPSLLVSYSRGGLLFKSGG 312
                  GN    G  + +  IN       ADKFVCS+LPESP+  VSYS GGLLFK GG
Sbjct: 241 ENIPTEGGNFAEFGWDTKDAGINVLASKLNADKFVCSILPESPTKYVSYSPGGLLFKPGG 300

Query: 313 SNIQHSTALSFLLIVYSNYLNQYKHILHCGNVVASPSRLLQLAKTQVDYILGSNPLGMSY 372
           SN+QH+TALSFLLIVY+N LN+   ++ CG+V A+  RL+Q+A+ Q DYILGSNP+ MSY
Sbjct: 301 SNLQHATALSFLLIVYANSLNRANRVVQCGSVQATSDRLIQVARAQADYILGSNPMKMSY 360

Query: 373 MVGYGKNFPQRIHHRGSSLPSMANYPQAIGCAKGKQYFQSNNPNPNLLIGAVVGGPDFND 431
           MVGYG  FPQRIHHRGSSLPS+  +PQ IGC  G  YF+S+ PNPN L GAVVGGPD  D
Sbjct: 361 MVGYGGKFPQRIHHRGSSLPSLDQHPQHIGCKDGTPYFKSSGPNPNQLTGAVVGGPDIQD 420

BLAST of Csa5G167210 vs. TrEMBL
Match: V7ACZ4_PHAVU (Endoglucanase OS=Phaseolus vulgaris GN=PHAVU_011G005600g PE=3 SV=1)

HSP 1 Score: 552.0 bits (1421), Expect = 6.5e-154
Identity = 288/473 (60.89%), Postives = 337/473 (71.25%), Query Frame = 1

Query: 19  HDYGDALTKSILFFEGQRSGKLPPNQRVTWRKDSALRDGLEFG-------------VDLV 78
           H+YG+AL+KSILFFEGQRSGKLPP QR+TWRKDSAL+D                  VDLV
Sbjct: 34  HNYGEALSKSILFFEGQRSGKLPPTQRMTWRKDSALQDLYLLPPHTTTTVFPFLPKVDLV 93

Query: 79  GGYYDAGDNVKFSFPMAFTTTMLSWSVLEFGKDMGSDLPYAMDSIRWATDYLLKATSVPG 138
           GGYYDAGDNVKF+FPMA++TTML+WSV+EFGK MG+DL +A+D+IRW +DY LKATS+PG
Sbjct: 94  GGYYDAGDNVKFNFPMAYSTTMLAWSVIEFGKFMGADLKHALDAIRWGSDYFLKATSIPG 153

Query: 139 FVFAQVGDPYADHFCWERPEDMDTPRTPYAVSKQFPGSEVSAEIAAALAASSMVFKPLDG 198
            VFAQVGDPYADH CW+RPEDMDTPRT YAVS+  P SE+SAEIAAALAASS+VF+    
Sbjct: 154 IVFAQVGDPYADHSCWQRPEDMDTPRTAYAVSRDHPASELSAEIAAALAASSIVFRKYHL 213

Query: 199 GYSARLLKRARMVFEFADTYRGSYNDS---------------------LGRWVC-----P 258
            Y++RLL+RA MVF+FAD YRGSYNDS                        W+      P
Sbjct: 214 AYASRLLRRAIMVFDFADKYRGSYNDSLGPWVCPFYCDFSGYQDELVWGAAWLLKATKRP 273

Query: 259 FYCSY--------------------SGYEFVLST---GNGSSSSNMFINYADKFVCSVLP 318
           +Y  Y                    +G   +LS     N SS S  FI  A+KFVC+VLP
Sbjct: 274 YYLDYIDQNIHNLTNFAEFGWDSKDAGINVLLSKLLINNTSSDSKPFIFNAEKFVCAVLP 333

Query: 319 ESPSLLVSYSRGGLLFKSGGSNIQHSTALSFLLIVYSNYLNQYKHILHCG-NVVASPSRL 378
           ESPS+ V YS GGLLFK GGSN+QH+TA+SFL +VY+ YL +    + CG NV ASP+RL
Sbjct: 334 ESPSVSVKYSPGGLLFKPGGSNLQHTTAISFLFLVYAGYLKKTNTEIDCGGNVFASPTRL 393

Query: 379 LQLAKTQVDYILGSNPLGMSYMVGYGKNFPQRIHHRGSSLPSMANYPQAIGCAKGKQYFQ 429
            Q+A+ QVDYILGSNPL MSYMVGYG  +P+RIHHR SSLPSM  YP  +GC +G  YF+
Sbjct: 394 RQIARGQVDYILGSNPLNMSYMVGYGAKYPERIHHRASSLPSMDEYPPHVGCKEGSFYFE 453

BLAST of Csa5G167210 vs. TrEMBL
Match: A0A0A0KWL5_CUCSA (Endoglucanase OS=Cucumis sativus GN=Csa_4G001940 PE=3 SV=1)

HSP 1 Score: 493.4 bits (1269), Expect = 2.8e-136
Identity = 249/432 (57.64%), Postives = 302/432 (69.91%), Query Frame = 1

Query: 19  HDYGDALTKSILFFEGQRSGKLPPNQRVTWRKDSALRDGLEFGVDLVGGYYDAGDNVKFS 78
           H+Y DAL KSILFF+GQRSGKLPPNQ++ WRKDS L DG    VDLVGGYYDAGDNVKF 
Sbjct: 42  HNYRDALAKSILFFQGQRSGKLPPNQKMAWRKDSGLSDGSSMNVDLVGGYYDAGDNVKFG 101

Query: 79  FPMAFTTTMLSWSVLEFGKDMGSDLPYAMDSIRWATDYLLKATSVPGFVFAQVGDPYADH 138
           FPMAFTTTMLSWSV+EFG  M ++L  A  +IRWATDYLLKAT++P  +F QVGD   DH
Sbjct: 102 FPMAFTTTMLSWSVVEFGGVMKNELNNAKQAIRWATDYLLKATALPDTIFVQVGDANKDH 161

Query: 139 FCWERPEDMDTPRTPYAVSKQFPGSEVSAEIAAALAASSMVFKPLDGGYSARLLKRARMV 198
            CWERPEDMDTPRT   + K  PGSEV+AE AAALA++S+VFK  D  YS  L+K A  V
Sbjct: 162 ACWERPEDMDTPRTVLKIDKNNPGSEVAAETAAALASASLVFKKSDPTYSKLLIKTAIRV 221

Query: 199 FEFADTYRGSYNDSLGRWVCPFYCSYSGYE---------------------FVLSTG--- 258
           FEF D YRGSY++ L  +VCPFYCS+SGY+                      +LS     
Sbjct: 222 FEFGDKYRGSYSNGLNNFVCPFYCSFSGYQNLGGVEFDNTFGWDNKHVGARILLSKAFLI 281

Query: 259 NGSSSSNMFINYADKFVCSVLPESP-SLLVSYSRGGLLFKSGGSNIQHSTALSFLLIVYS 318
               S   + ++AD F+CS++P++P S  V Y+ GGLLFK G SN+Q+ T+ +FLL+ Y+
Sbjct: 282 QNVKSLYEYKDHADNFICSLIPDAPSSSSVHYTPGGLLFKMGDSNMQYVTSTTFLLLTYA 341

Query: 319 NYLNQYKHILHCGNVVASPSRLLQLAKTQVDYILGSNPLGMSYMVGYGKNFPQRIHHRGS 378
            YL       +C     +P+ L  +AK Q+DY+LG NPL MSYMVGYG ++PQRIHHR S
Sbjct: 342 KYLTSAHTTANCNGRSITPNILRTIAKKQIDYLLGENPLKMSYMVGYGSHYPQRIHHRAS 401

Query: 379 SLPSMANYPQAIGCAKGKQYFQSNNPNPNLLIGAVVGGPDFNDSYADSRPDFVYSEPTTY 426
           SLPS+A +P  I C+ G     SN+PNPN+LIGAVVGGPD ND + D R DF  SEP+TY
Sbjct: 402 SLPSIAEHPAKIDCSSGFFVMHSNSPNPNVLIGAVVGGPDQNDEFPDERSDFEQSEPSTY 461

BLAST of Csa5G167210 vs. TAIR10
Match: AT4G23560.1 (AT4G23560.1 glycosyl hydrolase 9B15)

HSP 1 Score: 386.0 bits (990), Expect = 3.1e-107
Identity = 218/483 (45.13%), Postives = 290/483 (60.04%), Query Frame = 1

Query: 1   MKYFLVMILM---LKLVVVSSHDYGDALTKSILFFEGQRSGKLPPNQRVTWRKDSALRDG 60
           M   LV++L+   L    + + +YGDAL KSILFFEGQRSGKLP NQRV WR DSAL DG
Sbjct: 1   MGKLLVLMLVGMFLAFESLEALEYGDALNKSILFFEGQRSGKLPTNQRVKWRADSALSDG 60

Query: 61  LEFGVDLVGGYYDAGDNVKFSFPMAFTTTMLSWSVLEFGKDMGS--DLPYAMDSIRWATD 120
               V+L+GGYYDAGDNVKF +PM+FTTT+LSW+ +E+  ++ S   L Y   +I+W TD
Sbjct: 61  SLANVNLIGGYYDAGDNVKFVWPMSFTTTLLSWAAIEYQNEISSVNQLGYLRSTIKWGTD 120

Query: 121 YLLKATSVPGFVFAQVGDPYADHFCWERPEDMDTPRTPYAVSKQFPGSEVSAEIAAALAA 180
           ++L+A + P  ++ QVGD  +DH CWERPEDMDT RT Y++S   PGSE + E AAALAA
Sbjct: 121 FILRAHTSPNMLYTQVGDGNSDHSCWERPEDMDTSRTLYSISSSSPGSEAAGEAAAALAA 180

Query: 181 SSMVFKPLDGGYSARLLKRARMVFEFADTYRGSYNDSLGRWVCPFYCSYSGYE------- 240
           +S+VFK +D  YS+ LL  A+ +FEFAD YRGSY  S     CPFYCSYSGY+       
Sbjct: 181 ASLVFKSVDSTYSSTLLNHAKTLFEFADKYRGSYQAS-----CPFYCSYSGYQDELLWAA 240

Query: 241 --------------FVLSTGNGSSSSNMFINYADKFVCSVLPESPSLLVSYSRGG----L 300
                         +V+S  + S + N F ++ +KFV      + +LLVS    G     
Sbjct: 241 AWLYKATGDKIYINYVISNKDWSQAVNEF-SWDNKFV-----GAQALLVSEFYNGANDLA 300

Query: 301 LFKS-----------GGSNIQHSTALSFLLIVYSNYLNQYKHI----------------- 360
            FKS           G S+ Q       LL +  +   QY                    
Sbjct: 301 KFKSDVESFVCAMMPGSSSQQIKPTPGGLLFIRDSSNLQYVTTATTVLFHYSKTLTKAGV 360

Query: 361 --LHCGNVVASPSRLLQLAKTQVDYILGSNPLGMSYMVGYGKNFPQRIHHRGSSLPSMAN 420
             + CG+   + S++   AK+QVDYILG+NP+ MSYMVG+G  +P + HHRGSSLPS+ +
Sbjct: 361 GSIQCGSTKFTVSQIRNFAKSQVDYILGNNPMKMSYMVGFGTKYPTQPHHRGSSLPSIQS 420

Query: 421 YPQAIGCAKGKQYFQSNNPNPNLLIGAVVGGPDFNDSYADSRPDFVYSEPTTYINAPLVG 424
            P+ I C  G  Y+ S+ PNPN+ IGA+VGGP+ +D Y+D + D+ ++EPTTYINA  +G
Sbjct: 421 KPEKIDCNGGYSYYNSDTPNPNVHIGAIVGGPNSSDQYSDKKSDYSHAEPTTYINAAFIG 472

BLAST of Csa5G167210 vs. TAIR10
Match: AT2G44550.1 (AT2G44550.1 glycosyl hydrolase 9B10)

HSP 1 Score: 339.0 bits (868), Expect = 4.4e-93
Identity = 200/472 (42.37%), Postives = 274/472 (58.05%), Query Frame = 1

Query: 5   LVMILMLKLVVVSSHDYGDALTKSILFFEGQRSGKLPPNQRVTWRKDSALRDGLEFGVDL 64
           +V+I+M       S +Y +AL  S+L+FE QRSGKLPPNQRVTWR DSALRDG +  +DL
Sbjct: 18  IVLIVMSMAREAVSTNYAEALKNSLLYFEAQRSGKLPPNQRVTWRGDSALRDGSDAHIDL 77

Query: 65  VGGYYDAGDNVKFSFPMAFTTTMLSWSVLEFGKDMGS--DLPYAMDSIRWATDYLLKATS 124
            GGYYDAGDN+KF FP+AFTTTML+WS +E    + +  +   A+ +++WATDYL+KA  
Sbjct: 78  TGGYYDAGDNMKFGFPLAFTTTMLAWSNIEMASQLRAHHEKGNALRALKWATDYLIKAHP 137

Query: 125 VPGFVFAQVGDPYADHFCWERPEDMDTPRTPYAVSKQFPGSEVSAEIAAALAASSMVFKP 184
            P  ++ QVG+  +DH CW RPEDM TPRT Y +  Q PGS+++ E AAA+AA+S+ F P
Sbjct: 138 QPNVLYGQVGEGNSDHKCWMRPEDMTTPRTSYRIDAQHPGSDLAGETAAAMAAASIAFAP 197

Query: 185 LDGGYSARLLKRARMVFEFADTYRGSYNDSLGRWVCPFYCSYSGYE----FVLSTGNGSS 244
            D  Y+  L+  A+ +F FA  +RG Y +S+      FY S SGYE    +  +  + ++
Sbjct: 198 SDKAYANILIGHAKDLFAFAKAHRGLYQNSIPN-AGGFYAS-SGYEDELLWAAAWLHRAT 257

Query: 245 SSNMFINYA---------------DKFV-CSVL---------PESPSLLVSYSRGGLLF- 304
           +  ++++Y                DKFV   VL          ES   +V Y      F 
Sbjct: 258 NDQIYLDYLTEAETGGPRTVFAWDDKFVGAQVLVAKLALEGKVESSEQIVEYKSMAEQFI 317

Query: 305 ----KSGGSNIQHSTA--LSFL--------------LIVYSNYLNQYKHILHCGNVVASP 364
               + G +N++ +    L FL              L  YS YL   K  + C +     
Sbjct: 318 CNCAQKGDNNVKKTPGGLLYFLPWNNLQYTTAATFVLSAYSKYLEAAKASIDCPDGALQA 377

Query: 365 SRLLQLAKTQVDYILGSNPLGMSYMVGYGKNFPQRIHHRGSSLPSMANYPQAIGCAKG-K 424
           S LLQ+A++QVDYILGSNP  MSYMVG G N+P++ HHR +S+ S+      + C+ G  
Sbjct: 378 SDLLQVARSQVDYILGSNPQKMSYMVGVGTNYPKKPHHRAASIVSIRQDKTPVTCSGGYD 437

BLAST of Csa5G167210 vs. TAIR10
Match: AT3G43860.1 (AT3G43860.1 glycosyl hydrolase 9A4)

HSP 1 Score: 334.0 bits (855), Expect = 1.4e-91
Identity = 189/449 (42.09%), Postives = 260/449 (57.91%), Query Frame = 1

Query: 20  DYGDALTKSILFFEGQRSGKLPPNQRVTWRKDSALRDGLEFGVDLVGGYYDAGDNVKFSF 79
           +Y DALTKS++F E QRSGKLPPN RV WR DSAL DG    VDL GGYYDAGDNVK+  
Sbjct: 34  NYKDALTKSLIFLEAQRSGKLPPNNRVPWRGDSALDDGKLVNVDLSGGYYDAGDNVKYGL 93

Query: 80  PMAFTTTMLSWSVLEFGKDMGS--DLPYAMDSIRWATDYLLKATSVPGFVFAQVGDPYAD 139
           PMAFT T L+WS + + K++ +  +L  A  +IRW TDY LK  S    ++ QVGDP AD
Sbjct: 94  PMAFTITTLAWSTITYEKELRATGELENARAAIRWGTDYFLKCASRKNRLYVQVGDPNAD 153

Query: 140 HFCWERPEDMDTPRTPYAVSKQFPGSEVSAEIAAALAASSMVFKPLDGGYSARLLKRAR- 199
           H CW RPE+M TPRT   +S + PG+E++AE AAA AASS+VF+ +D  Y+ RLL +A+ 
Sbjct: 154 HQCWARPENMKTPRTVLEISDKVPGTEIAAEAAAAFAASSIVFRHVDHKYARRLLNKAKL 213

Query: 200 ----------------------------MVFEFADTYRGSYN---------DSLGRWVCP 259
                                       +++     Y+ + N         +++  +V  
Sbjct: 214 LFKLAKSHKGTYDGECPFYCSNSGYNDELIWAATWLYKATRNHLYLSYLKFEAISAYVAE 273

Query: 260 FY--CSYSGYEFVLST--GNGSSSSNMFINYADKFVCSVLPESPSLLVSYSRGGLLFKSG 319
           F     Y+G + +++     G    +++   AD FVCS LP SP   V  + GG++    
Sbjct: 274 FSWDLKYAGAQILITKLIFEGHKGLDLYKQQADSFVCSNLPGSPYHQVFTTPGGMIHLRD 333

Query: 320 GSNIQHSTALSFLLIVYSNYLNQYKHILHCGNVVASPSRLLQLAKTQVDYILGSNPLGMS 379
           G+N Q+ TA +FL   Y++ L ++   + CG+     + L+  AK Q+DYILG NP G S
Sbjct: 334 GANSQYVTATAFLFSAYADILQKHNQKISCGSHQFDSTHLMAFAKKQIDYILGHNPQGRS 393

Query: 380 YMVGYGKNFPQRIHHRGSSLP-SMANYPQAIGCAKGKQYFQSNNPNPNLLIGAVVGGPDF 424
           YMVG+G N P++ HHRG+S+P   AN P +   +  K ++  N PN N L GA++GGPD 
Sbjct: 394 YMVGFGPNPPKQAHHRGASVPMHEANAPLSCPLSFVK-WYNKNVPNANELTGAILGGPDR 453

BLAST of Csa5G167210 vs. TAIR10
Match: AT2G44560.1 (AT2G44560.1 glycosyl hydrolase 9B11)

HSP 1 Score: 319.3 bits (817), Expect = 3.6e-87
Identity = 192/465 (41.29%), Postives = 258/465 (55.48%), Query Frame = 1

Query: 18  SHDYGDALTKSILFFEGQRSGKLPPNQRVTWRKDSALRDGLEFGVDLVGGYYDAGDNVKF 77
           S +YGDALTKS+L+FE QRSGKLP NQRVTWR DSALRDG +  VDL GGYYDAGDN+KF
Sbjct: 31  SRNYGDALTKSLLYFEAQRSGKLPSNQRVTWRGDSALRDGSDAHVDLTGGYYDAGDNMKF 90

Query: 78  SFPMAFTTTMLSWSVLEFGKDMGS--DLPYAMDSIRWATDYLLKATSVPGFVFAQVGDPY 137
            FP+AF TTML+WS +E    + +  +   A+ +++WATD+L+KA   P  ++ QVGD  
Sbjct: 91  GFPLAFFTTMLAWSNIEMATQLKAHQEQENALAALKWATDFLIKAHPEPNVLYGQVGDGN 150

Query: 138 ADHFCWERPEDMDTPRTPYAVSKQFPGSEVSAEIAAALAASSMVFKPLDGGYSARLLKRA 197
           +DH CW RPEDM TPR  + +  Q PGS+++ E AAA+AA+S+ F P D  Y+  L+  A
Sbjct: 151 SDHECWMRPEDMTTPRPSFRIDAQHPGSDLAGETAAAMAAASIAFAPSDEAYAQILIGHA 210

Query: 198 RMVFEFADTYRGSYNDSLGRWVCPFYCSYSGYE--------------------FVLSTGN 257
           + +FEFA  Y G Y +S+      FY S SGYE                      L+  +
Sbjct: 211 KELFEFAKAYPGIYQNSITN-AGGFYAS-SGYEDELLWAAAWLHRATNDQIYLDYLTQAS 270

Query: 258 GSSSSNMFINYADKFV-CSVL---------PESPSLLVSYSRGGLLF-----KSGGSNIQ 317
           G+       ++ DKFV   VL          ES   +  Y      F     + G +N++
Sbjct: 271 GTGGPRTAFSWDDKFVGAQVLVAKLALEGKVESNGKIAEYKSMAEQFICNCAQKGSNNVK 330

Query: 318 HSTALSFLLIVYSNYLNQYKHILHCGNVVAS---------------------PSRLLQLA 377
            +       + ++N   QY         V S                      S LL LA
Sbjct: 331 KTPGGLLYFLPWNNL--QY---TTAATFVLSAYSKYLTDAKASIQCPNGALQASDLLDLA 390

Query: 378 KTQVDYILGSNPLGMSYMVGYGKNFPQRIHHRGSSLPSMANYPQAIGCAKG-KQYFQSNN 424
           ++QVDYILGSNP  MSYMVG G N+P++ HHR +S+ S+      + C++G   +F +  
Sbjct: 391 RSQVDYILGSNPQNMSYMVGVGTNYPKKPHHRAASIVSITKDKTPVTCSEGFDAWFNNPA 450

BLAST of Csa5G167210 vs. TAIR10
Match: AT1G23210.1 (AT1G23210.1 glycosyl hydrolase 9B6)

HSP 1 Score: 307.8 bits (787), Expect = 1.1e-83
Identity = 148/230 (64.35%), Postives = 173/230 (75.22%), Query Frame = 1

Query: 5   LVMILMLKLVVVSSHDYGDALTKSILFFEGQRSGKLPPNQRVTWRKDSALRDGLEFGVDL 64
           L M+L++     + HDY DAL KSILFFEGQRSGKLPP+QR+ WR+DSALRDG   GVDL
Sbjct: 13  LAMLLLISPETYAGHDYRDALRKSILFFEGQRSGKLPPDQRLKWRRDSALRDGSSAGVDL 72

Query: 65  VGGYYDAGDNVKFSFPMAFTTTMLSWSVLEFGKDMGSDLPYAMDSIRWATDYLLKATSVP 124
            GGYYDAGDNVKF FPMAFTTTM+SWSV++FGK MG +L  A+ +I+W TDYL+KAT +P
Sbjct: 73  TGGYYDAGDNVKFGFPMAFTTTMMSWSVIDFGKTMGPELENAVKAIKWGTDYLMKATQIP 132

Query: 125 GFVFAQVGDPYADHFCWERPEDMDTPRTPYAVSKQFPGSEVSAEIAAALAASSMVFKPLD 184
             VF QVGD Y+DH CWERPEDMDT RT Y + K   GSEV+ E AAALAA+S+VF+  D
Sbjct: 133 DVVFVQVGDAYSDHNCWERPEDMDTLRTVYKIDKDHSGSEVAGETAAALAAASIVFEKRD 192

Query: 185 GGYSARLLKRARMVFEFADTYRGSYNDSLGRWVCPFYCSYSGYEFVLSTG 235
             YS  LL RA  VF FA  YRG+Y+DSL + VCPFYC ++GYE  L  G
Sbjct: 193 PVYSKMLLDRATRVFAFAQKYRGAYSDSLYQAVCPFYCDFNGYEDELLWG 242


HSP 2 Score: 250.0 bits (637), Expect = 2.7e-66
Identity = 118/186 (63.44%), Postives = 140/186 (75.27%), Query Frame = 1

Query: 243 FINYADKFVCSVLPESPSLLVSYSRGGLLFKSGGSNIQHSTALSFLLIVYSNYLNQYKHI 302
           F   AD+F+CS+LP      V YS+GGLL KSGGSN+QH T+LSFLL+ YSNYL+    +
Sbjct: 303 FKQNADEFICSLLPGISHPQVQYSQGGLLVKSGGSNMQHVTSLSFLLLTYSNYLSHANKV 362

Query: 303 LHCGNVVASPSRLLQLAKTQVDYILGSNPLGMSYMVGYGKNFPQRIHHRGSSLPSMANYP 362
           + CG   ASP+ L Q+AK QVDYILG NP+ MSYMVGYG  FPQ+IHHRGSS+PS+ ++P
Sbjct: 363 VPCGEFTASPALLRQVAKRQVDYILGDNPMKMSYMVGYGSRFPQKIHHRGSSVPSVVDHP 422

Query: 363 QAIGCAKGKQYFQSNNPNPNLLIGAVVGGPDFNDSYADSRPDFVYSEPTTYINAPLVGLL 422
             IGC  G +YF SNNPNPNLLIGAVVGGP+  D + DSRP F  +EPTTYINAPL+GLL
Sbjct: 423 DRIGCKDGSRYFFSNNPNPNLLIGAVVGGPNITDDFPDSRPYFQLTEPTTYINAPLLGLL 482

Query: 423 AYFKSH 429
            YF +H
Sbjct: 483 GYFSAH 488

BLAST of Csa5G167210 vs. NCBI nr
Match: gi|700195152|gb|KGN50329.1| (hypothetical protein Csa_5G167210 [Cucumis sativus])

HSP 1 Score: 882.5 bits (2279), Expect = 3.0e-253
Identity = 430/430 (100.00%), Postives = 430/430 (100.00%), Query Frame = 1

Query: 1   MKYFLVMILMLKLVVVSSHDYGDALTKSILFFEGQRSGKLPPNQRVTWRKDSALRDGLEF 60
           MKYFLVMILMLKLVVVSSHDYGDALTKSILFFEGQRSGKLPPNQRVTWRKDSALRDGLEF
Sbjct: 1   MKYFLVMILMLKLVVVSSHDYGDALTKSILFFEGQRSGKLPPNQRVTWRKDSALRDGLEF 60

Query: 61  GVDLVGGYYDAGDNVKFSFPMAFTTTMLSWSVLEFGKDMGSDLPYAMDSIRWATDYLLKA 120
           GVDLVGGYYDAGDNVKFSFPMAFTTTMLSWSVLEFGKDMGSDLPYAMDSIRWATDYLLKA
Sbjct: 61  GVDLVGGYYDAGDNVKFSFPMAFTTTMLSWSVLEFGKDMGSDLPYAMDSIRWATDYLLKA 120

Query: 121 TSVPGFVFAQVGDPYADHFCWERPEDMDTPRTPYAVSKQFPGSEVSAEIAAALAASSMVF 180
           TSVPGFVFAQVGDPYADHFCWERPEDMDTPRTPYAVSKQFPGSEVSAEIAAALAASSMVF
Sbjct: 121 TSVPGFVFAQVGDPYADHFCWERPEDMDTPRTPYAVSKQFPGSEVSAEIAAALAASSMVF 180

Query: 181 KPLDGGYSARLLKRARMVFEFADTYRGSYNDSLGRWVCPFYCSYSGYEFVLSTGNGSSSS 240
           KPLDGGYSARLLKRARMVFEFADTYRGSYNDSLGRWVCPFYCSYSGYEFVLSTGNGSSSS
Sbjct: 181 KPLDGGYSARLLKRARMVFEFADTYRGSYNDSLGRWVCPFYCSYSGYEFVLSTGNGSSSS 240

Query: 241 NMFINYADKFVCSVLPESPSLLVSYSRGGLLFKSGGSNIQHSTALSFLLIVYSNYLNQYK 300
           NMFINYADKFVCSVLPESPSLLVSYSRGGLLFKSGGSNIQHSTALSFLLIVYSNYLNQYK
Sbjct: 241 NMFINYADKFVCSVLPESPSLLVSYSRGGLLFKSGGSNIQHSTALSFLLIVYSNYLNQYK 300

Query: 301 HILHCGNVVASPSRLLQLAKTQVDYILGSNPLGMSYMVGYGKNFPQRIHHRGSSLPSMAN 360
           HILHCGNVVASPSRLLQLAKTQVDYILGSNPLGMSYMVGYGKNFPQRIHHRGSSLPSMAN
Sbjct: 301 HILHCGNVVASPSRLLQLAKTQVDYILGSNPLGMSYMVGYGKNFPQRIHHRGSSLPSMAN 360

Query: 361 YPQAIGCAKGKQYFQSNNPNPNLLIGAVVGGPDFNDSYADSRPDFVYSEPTTYINAPLVG 420
           YPQAIGCAKGKQYFQSNNPNPNLLIGAVVGGPDFNDSYADSRPDFVYSEPTTYINAPLVG
Sbjct: 361 YPQAIGCAKGKQYFQSNNPNPNLLIGAVVGGPDFNDSYADSRPDFVYSEPTTYINAPLVG 420

Query: 421 LLAYFKSHPN 431
           LLAYFKSHPN
Sbjct: 421 LLAYFKSHPN 430

BLAST of Csa5G167210 vs. NCBI nr
Match: gi|823121401|ref|XP_012466569.1| (PREDICTED: endoglucanase 8-like isoform X1 [Gossypium raimondii])

HSP 1 Score: 623.6 bits (1607), Expect = 2.5e-175
Identity = 310/463 (66.95%), Postives = 359/463 (77.54%), Query Frame = 1

Query: 15  VVSSHDYGDALTKSILFFEGQRSGKLPPNQRVTWRKDSALRDGLEFGVDLVGGYYDAGDN 74
           +V+SHDYG ALTKSILF+EGQRSGKLPP QR+TWRKDSALRDG E GVDLVGGYYDAGDN
Sbjct: 19  LVASHDYGAALTKSILFYEGQRSGKLPPTQRITWRKDSALRDGFEIGVDLVGGYYDAGDN 78

Query: 75  VKFSFPMAFTTTMLSWSVLEFGKDMGSDLPYAMDSIRWATDYLLKATSVPGFVFAQVGDP 134
           VKF+FPMAF+ TML+WS+LEFG+ +G+DL +++ +I+W TDYLLKATSVPGFVFAQVGDP
Sbjct: 79  VKFTFPMAFSITMLAWSLLEFGQSLGTDLQHSLKAIQWGTDYLLKATSVPGFVFAQVGDP 138

Query: 135 YADHFCWERPEDMDTPRTPYAVSKQFPGSEVSAEIAAALAASSMVFKPLDGGYSARLLKR 194
           Y DH CWERPEDMDTPRTPYAVSK+FPGSEVSAEIAAALAASSMVF+P++ GYSARLLKR
Sbjct: 139 YGDHNCWERPEDMDTPRTPYAVSKEFPGSEVSAEIAAALAASSMVFRPINRGYSARLLKR 198

Query: 195 ARMVFEFADTYRGSYNDSLG---------------------RWV-----CPFYCSY---- 254
           ARM+FEFAD YRGSYNDSLG                      W+      P+Y +Y    
Sbjct: 199 ARMIFEFADKYRGSYNDSLGPWACPFYCDYSGYQDELVWGAAWLLRATKAPYYRNYVLAN 258

Query: 255 -------------------SGYEFVLSTGNGSSSSNMFINYADKFVCSVLPESPSLLVSY 314
                              +G   ++S    S +   FI  ADKFVCSVLPESP++ VSY
Sbjct: 259 IQNLDKSSSFAEFGWDTKHAGINVLVSRLIKSQTPEPFITNADKFVCSVLPESPTISVSY 318

Query: 315 SRGGLLFKSGGSNIQHSTALSFLLIVYSNYLNQYKHILHCGNVVASPSRLLQLAKTQVDY 374
           S GGLL K GGSN+QH+TALSFLL+VYS  L++   ++HCGNVVA+P+RL+Q+A++QVDY
Sbjct: 319 SPGGLLIKPGGSNLQHATALSFLLLVYSRPLSKDSRVIHCGNVVATPARLIQVARSQVDY 378

Query: 375 ILGSNPLGMSYMVGYGKNFPQRIHHRGSSLPSMANYPQAIGCAKGKQYFQSNNPNPNLLI 429
           ILGSNPL MSYMVGYG+ FP+RIHHRGSSLPS+  +PQ I C  G  YF +NNPNPNLL 
Sbjct: 379 ILGSNPLNMSYMVGYGEKFPERIHHRGSSLPSITQHPQHIDCTGGATYFYTNNPNPNLLT 438

BLAST of Csa5G167210 vs. NCBI nr
Match: gi|629126403|gb|KCW90828.1| (hypothetical protein EUGRSUZ_A02886 [Eucalyptus grandis])

HSP 1 Score: 583.9 bits (1504), Expect = 2.2e-163
Identity = 294/454 (64.76%), Postives = 343/454 (75.55%), Query Frame = 1

Query: 13  LVVVSSHDYGDALTKSILFFEGQRSGKLPPNQRVTWRKDSALRDGLEFGVDLVGGYYDAG 72
           +V VS HDYGDAL+KSILFFEGQRSGKLPP+QR+TWRKDS LRDG +  +DLVGGYYDAG
Sbjct: 1   MVSVSGHDYGDALSKSILFFEGQRSGKLPPSQRLTWRKDSGLRDGFDKHIDLVGGYYDAG 60

Query: 73  DNVKFSFPMAFTTTMLSWSVLEFGKDMGSDLPYAMDSIRWATDYLLKATSVPGFVFAQVG 132
           DNVKF+FPMAF+TTML+WSV+EFG+ MG DL  A+D++RWATDY LKATSVPG V+AQVG
Sbjct: 61  DNVKFNFPMAFSTTMLAWSVIEFGRGMGPDLRKAVDAVRWATDYFLKATSVPGLVYAQVG 120

Query: 133 DPYADHFCWERPEDMDTPRTPYAVSKQFPGSEVSAEIAAALAASSMVFKPLDGGYSARLL 192
           +PY DH CWERPEDMDT RT YAVS Q PGSEVSAEIAAALAA+S+VFK ++  YS  LL
Sbjct: 121 EPYGDHECWERPEDMDTARTAYAVSAQSPGSEVSAEIAAALAAASLVFKRVNHNYSDLLL 180

Query: 193 KRARMVFEFADTYRGSYNDSLGRWVCPFYCSYSG---------------------YEFVL 252
            RA+ VFEFAD YRGSYN SLG  VCPFYC++ G                     + +V 
Sbjct: 181 SRAKTVFEFADKYRGSYNYSLGSVVCPFYCNFGGYEDDLIWAAAWLFRATAAPNYWNYVT 240

Query: 253 ST-----GN----GSSSSNMFINY------ADKFVCSVLPESPSLLVSYSRGGLLFKSGG 312
                  GN    G  + +  IN       ADKFVCS+LPESP+  VSYS GGLLFK GG
Sbjct: 241 ENIPTEGGNFAEFGWDTKDAGINVLASKLNADKFVCSILPESPTKYVSYSPGGLLFKPGG 300

Query: 313 SNIQHSTALSFLLIVYSNYLNQYKHILHCGNVVASPSRLLQLAKTQVDYILGSNPLGMSY 372
           SN+QH+TALSFLLIVY+N LN+   ++ CG+V A+  RL+Q+A+ Q DYILGSNP+ MSY
Sbjct: 301 SNLQHATALSFLLIVYANSLNRANRVVQCGSVQATSDRLIQVARAQADYILGSNPMKMSY 360

Query: 373 MVGYGKNFPQRIHHRGSSLPSMANYPQAIGCAKGKQYFQSNNPNPNLLIGAVVGGPDFND 431
           MVGYG  FPQRIHHRGSSLPS+  +PQ IGC  G  YF+S+ PNPN L GAVVGGPD  D
Sbjct: 361 MVGYGGKFPQRIHHRGSSLPSLDQHPQHIGCKDGTPYFKSSGPNPNQLTGAVVGGPDIQD 420

BLAST of Csa5G167210 vs. NCBI nr
Match: gi|702250990|ref|XP_010062163.1| (PREDICTED: endoglucanase 8-like [Eucalyptus grandis])

HSP 1 Score: 577.8 bits (1488), Expect = 1.6e-161
Identity = 294/475 (61.89%), Postives = 346/475 (72.84%), Query Frame = 1

Query: 6   VMILMLKLVVVSSHDYGDALTKSILFFEGQRSGKLPPNQRVTWRKDSALRDGLEFGVDLV 65
           ++++   +V VS HDYGDAL+KSILFFEGQRSGKLPP+QR+TWRKDS LRDG +  +DLV
Sbjct: 1   MLMVAAMMVSVSGHDYGDALSKSILFFEGQRSGKLPPSQRLTWRKDSGLRDGFDKHIDLV 60

Query: 66  GGYYDAGDNVKFSFPMAFTTTMLSWSVLEFGKDMGSDLPYAMDSIRWATDYLLKATSVPG 125
           GGYYDAGDNVKF+FPMAF+TTML+WSV+EFG+ MG DL  A+D++RWATDY LKATSVPG
Sbjct: 61  GGYYDAGDNVKFNFPMAFSTTMLAWSVIEFGRGMGPDLRKAVDAVRWATDYFLKATSVPG 120

Query: 126 FVFAQVGDPYADHFCWERPEDMDTPRTPYAVSKQFPGSEVSAEIAAALAASSMVFKPLDG 185
            V+AQVG+PY DH CWERPEDMDT RT YAVS Q PGSEVSAEIAAALAA+S+VFK ++ 
Sbjct: 121 LVYAQVGEPYGDHECWERPEDMDTARTAYAVSAQSPGSEVSAEIAAALAAASLVFKRVNH 180

Query: 186 GYSARLLKRARMVFEFADTYRGSYNDSLGRWVCPFYCS---------------------- 245
            YS  LL RA+ VFEFAD YRGSYN SLG  VCPFYC+                      
Sbjct: 181 NYSDLLLSRAKTVFEFADKYRGSYNYSLGSVVCPFYCNFGGYEDDLIWAAAWLFRATAAP 240

Query: 246 -YSGY-------------EFVLSTGNG--------------SSSSNMFINYADKFVCSVL 305
            Y  Y             EF   T +               S  +++F   ADKFVCS+L
Sbjct: 241 NYWNYVTENIPTEGGNFAEFGWDTKDAGINVLASKQILTKDSKYTDIFKLNADKFVCSIL 300

Query: 306 PESPSLLVSYSRGGLLFKSGGSNIQHSTALSFLLIVYSNYLNQYKHILHCGNVVASPSRL 365
           PESP+  VSYS GGLLFK GGSN+QH+TALSFLLIVY+N LN+   ++ CG+V A+  RL
Sbjct: 301 PESPTKYVSYSPGGLLFKPGGSNLQHATALSFLLIVYANSLNRANRVVQCGSVQATSDRL 360

Query: 366 LQLAKTQVDYILGSNPLGMSYMVGYGKNFPQRIHHRGSSLPSMANYPQAIGCAKGKQYFQ 425
           +Q+A+ Q DYILGSNP+ MSYMVGYG  FPQRIHHRGSSLPS+  +PQ IGC  G  YF+
Sbjct: 361 IQVARAQADYILGSNPMKMSYMVGYGGKFPQRIHHRGSSLPSLDQHPQHIGCKDGTPYFK 420

Query: 426 SNNPNPNLLIGAVVGGPDFNDSYADSRPDFVYSEPTTYINAPLVGLLAYFKSHPN 431
           S+ PNPN L GAVVGGPD  D Y DSR +FV+SEPTTYINAPLVG+LA+ K  PN
Sbjct: 421 SSGPNPNQLTGAVVGGPDIQDQYNDSRVEFVHSEPTTYINAPLVGVLAFLKGRPN 475

BLAST of Csa5G167210 vs. NCBI nr
Match: gi|658037276|ref|XP_008354206.1| (PREDICTED: endoglucanase 8-like isoform X3 [Malus domestica])

HSP 1 Score: 567.8 bits (1462), Expect = 1.6e-158
Identity = 296/475 (62.32%), Postives = 342/475 (72.00%), Query Frame = 1

Query: 6   VMILMLKLVVVSS--HDYGDALTKSILFFEGQRSGKLPPNQRVTWRKDSALRDGLEFGVD 65
           +M+ M+  VV SS  HDYGDALTKSILFFEGQRSGKLP +QR+TWRKDSALRDG E GVD
Sbjct: 8   IMVAMVMTVVSSSSTHDYGDALTKSILFFEGQRSGKLPSSQRMTWRKDSALRDGFEIGVD 67

Query: 66  LVGGYYDAGDNVKFSFPMAFTTTMLSWSVLEFGKDMGSDLPYAMDSIRWATDYLLKATSV 125
           LVGGYYDAGDNVKF+FPMAF+TTML+WSVLEFGKDM SDLP+A+++IRWATDY LKATS+
Sbjct: 68  LVGGYYDAGDNVKFNFPMAFSTTMLAWSVLEFGKDMSSDLPHALBTIRWATDYFLKATSI 127

Query: 126 PGFVFAQVGDPYADHFCWERPEDMDTPRTPYAVSKQFPGSEVSAEIAAALAASSMVFKPL 185
           PGF F QVGDPY DH CWERPE M TPRTP+AVSKQFPGSEVSAEIAAALAASSMVF+ +
Sbjct: 128 PGFXFVQVGDPYGDHNCWERPEXMXTPRTPFAVSKQFPGSEVSAEIAAALAASSMVFRQI 187

Query: 186 D----------------------GGYSARLLKRARMVFEFADTYRGSYNDSL---GRWVC 245
           D                      G Y+  L      V  F   + G Y D L     W+ 
Sbjct: 188 DRGYSARLLKRAKMVFDFADKYQGSYNDSL---GPWVCPFYCDFSG-YEDELIWGAAWLF 247

Query: 246 PFYCSYSGYEFVLS------TGNGS--------------------SSSNMFINYADKFVC 305
                 + + +VL       +GN +                    + S  FI+ AD+F+C
Sbjct: 248 KATKQINYWNYVLQNMPKLDSGNTNEFGWDSKHAGINVLVSKVINTESLPFISNADRFIC 307

Query: 306 SVLPESPSLLVSYSRGGLLFKSGGSNIQHSTALSFLLIVYSNYLNQYKHILHCGNVVASP 365
           ++LPESP+L VSYS GGLLFK GGSN+QH+T LSFLL+VY+ YLNQ K  +HCGNVVASP
Sbjct: 308 TLLPESPTLSVSYSPGGLLFKPGGSNLQHATTLSFLLVVYARYLNQAKRAVHCGNVVASP 367

Query: 366 SRLLQLAKTQVDYILGSNPLGMSYMVGYGKNFPQRIHHRGSSLPSMANYPQAIGCAKGKQ 425
           +RL+QLAK Q DYILGSNPL MSYMVGYG+ FPQ IHHRGSSLPS+  +P+ I C  G  
Sbjct: 368 ARLVQLAKGQADYILGSNPLSMSYMVGYGEKFPQNIHHRGSSLPSLDQHPEPIDCKGGND 427

Query: 426 YFQSNNPNPNLLIGAVVGGPDFNDSYADSRPDFVYSEPTTYINAPLVGLLAYFKS 428
           Y  S NPNPNLLIGA+VGGPD  D+Y DSR DFV+SEPTTYINAP VG+LAYFKS
Sbjct: 428 YLYSENPNPNLLIGAIVGGPDIKDAYVDSREDFVHSEPTTYINAPFVGVLAYFKS 478

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
GUN3_ORYSJ1.4e-12550.87Endoglucanase 3 OS=Oryza sativa subsp. japonica GN=GLU8 PE=2 SV=1[more]
GUN4_ORYSJ5.7e-11146.11Endoglucanase 4 OS=Oryza sativa subsp. japonica GN=GLU14 PE=2 SV=1[more]
GUN20_ARATH5.6e-10645.13Endoglucanase 20 OS=Arabidopsis thaliana GN=At4g23560 PE=2 SV=1[more]
GUN_PHAVU4.9e-10244.62Endoglucanase OS=Phaseolus vulgaris PE=2 SV=2[more]
GUN20_ORYSJ3.5e-9242.08Endoglucanase 20 OS=Oryza sativa subsp. japonica GN=GLU15 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KN43_CUCSA2.1e-253100.00Endoglucanase OS=Cucumis sativus GN=Csa_5G167210 PE=3 SV=1[more]
A0A0D2PJD4_GOSRA1.8e-17566.95Endoglucanase OS=Gossypium raimondii GN=B456_001G048100 PE=3 SV=1[more]
A0A059DJ18_EUCGR1.5e-16364.76Endoglucanase OS=Eucalyptus grandis GN=EUGRSUZ_A02886 PE=3 SV=1[more]
V7ACZ4_PHAVU6.5e-15460.89Endoglucanase OS=Phaseolus vulgaris GN=PHAVU_011G005600g PE=3 SV=1[more]
A0A0A0KWL5_CUCSA2.8e-13657.64Endoglucanase OS=Cucumis sativus GN=Csa_4G001940 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT4G23560.13.1e-10745.13 glycosyl hydrolase 9B15[more]
AT2G44550.14.4e-9342.37 glycosyl hydrolase 9B10[more]
AT3G43860.11.4e-9142.09 glycosyl hydrolase 9A4[more]
AT2G44560.13.6e-8741.29 glycosyl hydrolase 9B11[more]
AT1G23210.11.1e-8364.35 glycosyl hydrolase 9B6[more]
Match NameE-valueIdentityDescription
gi|700195152|gb|KGN50329.1|3.0e-253100.00hypothetical protein Csa_5G167210 [Cucumis sativus][more]
gi|823121401|ref|XP_012466569.1|2.5e-17566.95PREDICTED: endoglucanase 8-like isoform X1 [Gossypium raimondii][more]
gi|629126403|gb|KCW90828.1|2.2e-16364.76hypothetical protein EUGRSUZ_A02886 [Eucalyptus grandis][more]
gi|702250990|ref|XP_010062163.1|1.6e-16161.89PREDICTED: endoglucanase 8-like [Eucalyptus grandis][more]
gi|658037276|ref|XP_008354206.1|1.6e-15862.32PREDICTED: endoglucanase 8-like isoform X3 [Malus domestica][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001701Glyco_hydro_9
IPR0089286-hairpin_glycosidase_sf
IPR0123416hp_glycosidase-like_sf
IPR018221Glyco_hydro_9_His_AS
Vocabulary: Molecular Function
TermDefinition
GO:0004553hydrolase activity, hydrolyzing O-glycosyl compounds
GO:0003824catalytic activity
Vocabulary: Biological Process
TermDefinition
GO:0005975carbohydrate metabolic process
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0030245 cellulose catabolic process
biological_process GO:0005982 starch metabolic process
biological_process GO:0005985 sucrose metabolic process
biological_process GO:0005975 carbohydrate metabolic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0008810 cellulase activity
molecular_function GO:0003824 catalytic activity
molecular_function GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compounds

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Csa5G167210.1Csa5G167210.1mRNA


Analysis Name: InterPro Annotations of cucumber (Chinese Long)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001701Glycoside hydrolase family 9PFAMPF00759Glyco_hydro_9coord: 21..226
score: 2.9E-72coord: 244..422
score: 2.5
IPR008928Six-hairpin glycosidase-likeunknownSSF48208Six-hairpin glycosidasescoord: 16..427
score: 3.06E
IPR012341Six-hairpin glycosidaseGENE3DG3DSA:1.50.10.10coord: 240..426
score: 4.3E-52coord: 18..228
score: 1.5
IPR018221Glycoside hydrolase family 9, His active sitePROSITEPS00592GLYCOSYL_HYDROL_F9_1coord: 335..351
scor
NoneNo IPR availablePANTHERPTHR22298ENDO-1,4-BETA-GLUCANASEcoord: 6..429
score: 6.7E
NoneNo IPR availablePANTHERPTHR22298:SF45SUBFAMILY NOT NAMEDcoord: 6..429
score: 6.7E

The following gene(s) are paralogous to this gene:

None