Cucsa.303540 (gene) Cucumber (Gy14) v1

NameCucsa.303540
Typegene
OrganismCucumis sativus (Cucumber (Gy14) v1)
DescriptionEndoglucanase
Locationscaffold02951 : 1425234 .. 1428337 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATCTCGATCGAACCTTCAACTATAGTGATGCCCTTGGCAAAGCTGTGTTGTTCTTTGAGGGACAACGGTCAGGGAAGTTGCCTGTGACGCAACGAGTCAAGTGGCGGGGCAATTCTGCACTCTCTGATGGGAGCTATGAAAATGTATGAGAAAAACGAGTAATTTCTTTTGTGGTGGTTGTTTAATTAGTAAGAGTATAAACAAATCATTTAACTTTTAAAAGTCAGTGTTCTTTAATAAGAGCGTCATAGTAGAAAACTTGTGTTGAACTTTGCAAGACAATTGCATGGTGCACATTCTACGTTTAGGAATGTTTAAAATTGGATTGTATAAAGTGATCTTAGGTATAACATTATATGTTTTAGGAGACTATATTAATCTCGGCCTTCACAAACAAAGTTAGCTTCTTAAGATCAAAACACCTAAGTTGCTTAAACTGATCTAACCCCAAATATGGTGTCATGATCATGTCATTTCTTTTCTTAACATTCATAAGTAATTAACAACATATATAAATAAAAATGAGACAAAGCCATAACTCATTAGTTTAAGATGTGAGATGTAAAACATGGCTTCTTTTTAAAATGTTTTGGAGTGTGGTCTTTGGCAGGTGAATTTGGTGGGAGGCTACTACGATGCTGGCGACAATGTGAAGTTCGGGTGGCCCATGGCGTTTACCCTGACATTGTTGAGTTGGACCACCGTGGAATATGAAAAAGAGATAGCATCTGTGATGCAGTTGGAACACCTCCGGAGCTCAGTCCGGTGGGGCACCGACTTCATTTTACGAGCTCATGTTTCACCTACCACACTCTATACTCAGGTTTGCATCATATCATTGTTTCCTAATTTTAGTGATCTTCTATGTATATACACACACTTCAAATTTCTACTTGAAAGAAGAAAGGAAAAAAAAAGAAGATTTCTTTTATTTGGTATATTAAACAAATAAAATAAATAATTTTGGTGAAGACACATTAATAATCTTGTATACAATTTTTTTAGTACATTCAAAACGTAGGTGGAAAGATTTCAAATCTTAGACCTTTTGGTTGGTTATATATACTAAAATTTTAACACGAAATTTTAAAACTTGAAGTGAGAATTAGTCTAGAGTCCTATATGCTTCAACATAATCTTACTTTTTAGTTTGGTAATTGAGTGATACCGAAGTCAACAATAATAATCTCTATAAAATATGAAAACTGAAAACAAAATAATTTCTAAAGTGTAAAATATTGCATGAAAACACACAGGTAGGAGATGCAAATGGTGATCATCAATGTTGGGAGCGGCCGGAGGACATGGACACCCCTCGAACGCTGTACAAGATAACGCCTAATTCACCCGGCACCGAAGCTGCCGCCGAGGCTGCAGCTGCACTCGCTGCTGCTTCCATCTTGTTCAACCGTGTCGATGCTAATTACTCCAGAAGGCTTCTTCAACACTCAAAATCTGTGAGAATTTTAATTCAATCAATTATGGTTACTTCATTAAAGGCCACCCTAGCTAGTCATATTTTAACTACTGTGTGTATTAAAAGGTTGTCGTTCTTTTGGCATTGGATAGCTCTTTCAGTTTGCTGACAAGTTTAGAGGATCTTACTCTGCTTCTTGTCCATTCTATTGTTCTTATTCTGGATATCAGGTAATTAATCAACTTCGATACATTATTACTAACCCCATCATTTGTAGATATGGGATCGAGTTTAAAGTTAATTGTTGATTGTAGGATGAGTTGTTGTGGGCAGCTGCTTGGTTGTATAAAGCAAGTGGAAATATGAAATATTTGAGGTATGTTTTAAGCAATCAATGGTGGAGTCAACCCACATCCGAGTTTAGTTGGGACAACAAATTTGTTGGAGCTCAAATACTATTAACAAAGGTTCTTAAGAAAAACTATTTTTCTTTCTTTTTTAGATTTTTAGAAAACTATAGTTTAATTTCTCTAATTCTAAAAGCTGTGAACGAACTGATCTACAGGAATTCTATAAAGGAAAGAAAAACTTGAGCAAGTTTAAGAACGATGTTGAAACGTTCATATGCAAACTTATGCCTGATGATGGTGGCTCTTCCGAGATTTCGCGAACACCTGGTATGGATTGCGTCACGATAGATACAACATATGGAGATCAATGAATTTAAATTGTTTAAAATAATTTATTTTTTCATTTTCAAAACTATGAAGTAATATAGTAATGTGCATGCTTTTACGATATGTTTGGGAGTGATTTCAAAATGGTTAAAATAACATCTTGTTGCTGATTATTCAAAATTTGATTTGCTTCGTTCAATATAATATAATAATATAATGCATGTTTCATTCTGGTAGGTGGACTTCTTTTCCTAAGAGATAATAGTAATCTGCAATATACATCAAGCTCGTCCATGGTGCTTTTCATGTATTCTAGACTCTTAAACCAAGCTCATATCTATGGAATTCACTGTGGATCCAAATATTTTTCTTCTTCTCAAATCAAAACCTTTGCCAAATCACAGGTAAGTTTTATGGAGAGATTCTCATTTGATTATTTTTCAACAATTGCCTAAAAAAAATATGACAATATAGTTGTAAAAACCCTAGTTTATAACCATTATATTTAACAGAACTACAAGAAAATAGTGATTTTAACCAATATATATATCACTATATAAAAAAAAGTACAATCAATGATAATAAATCACTATACTTTAATAATCTAAAGATACTAAAATTACTTCTTTTTATAATAATGTCAAATTCATGGACGATAATCTAATTAGTATCTAATTTCTTGCATATAGGTGGATTATATATTGGGGAAAAATACGTTGAAGATGTCATACATGGTAGGATTTGGTAACAAATATCCATTGCAATTGCATCATAGAGCCTCATCCATCCCTTCAACAAAAGTGCTCTCAACAAAGGTTGGTTGTAACGATGGTCGCTCAAGCTATTTTTATTCAAATGGTCCGAATCCCAACACACATATTGGTTCTATAGTAGGAGGTCCTTATTTAAACGATGAATTCAGCGATTTGAGATCGGACTACTCTCATTCCGAGCCTACTACTTATATGAATGCTGCTTTTGTTGGTTCGGTAGCTGCACTGGTTGTGTAA

mRNA sequence

ATCTCGATCGAACCTTCAACTATAGTGATGCCCTTGGCAAAGCTGTGTTGTTCTTTGAGGGACAACGGTCAGGGAAGTTGCCTGTGACGCAACGAGTCAAGTGGCGGGGCAATTCTGCACTCTCTGATGGGAGCTATGAAAATGTGAATTTGGTGGGAGGCTACTACGATGCTGGCGACAATGTGAAGTTCGGGTGGCCCATGGCGTTTACCCTGACATTGTTGAGTTGGACCACCGTGGAATATGAAAAAGAGATAGCATCTGTGATGCAGTTGGAACACCTCCGGAGCTCAGTCCGGTGGGGCACCGACTTCATTTTACGAGCTCATGTTTCACCTACCACACTCTATACTCAGGTAGGAGATGCAAATGGTGATCATCAATGTTGGGAGCGGCCGGAGGACATGGACACCCCTCGAACGCTGTACAAGATAACGCCTAATTCACCCGGCACCGAAGCTGCCGCCGAGGCTGCAGCTGCACTCGCTGCTGCTTCCATCTTGTTCAACCGTGTCGATGCTAATTACTCCAGAAGGCTTCTTCAACACTCAAAATCTCTCTTTCAGTTTGCTGACAAGTTTAGAGGATCTTACTCTGCTTCTTGTCCATTCTATTGTTCTTATTCTGGATATCAGGATGAGTTGTTGTGGGCAGCTGCTTGGTTGTATAAAGCAAGTGGAAATATGAAATATTTGAGGTATGTTTTAAGCAATCAATGGTGGAGTCAACCCACATCCGAGTTTAGTTGGGACAACAAATTTGTTGGAGCTCAAATACTATTAACAAAGGAATTCTATAAAGGAAAGAAAAACTTGAGCAAGTTTAAGAACGATGTTGAAACGTTCATATGCAAACTTATGCCTGATGATGGTGGCTCTTCCGAGATTTCGCGAACACCTGGTGGACTTCTTTTCCTAAGAGATAATAGTAATCTGCAATATACATCAAGCTCGTCCATGGTGCTTTTCATGTATTCTAGACTCTTAAACCAAGCTCATATCTATGGAATTCACTGTGGATCCAAATATTTTTCTTCTTCTCAAATCAAAACCTTTGCCAAATCACAGGTGgattatatattggggaaaaatacgttgaagatgtcatacatggtaggatttggtaacaaatatccattgcaattgcatcatagagcctcatccatcccttcaacaaaagtgctctcaacaaaggttggttgtaacgatggtcgctcaagctatttttattcaaatggtccgaatcccaacacacatattggttctatagtaggaggtccttatttaaacgatgaattcagcgatttgagatcggactactctcattccgagcctactacttatatgaatgctgcttttgttggttcGGTAGCTGCACTGGTTGTGTAA

Coding sequence (CDS)

ATCTCGATCGAACCTTCAACTATAGTGATGCCCTTGGCAAAGCTGTGTTGTTCTTTGAGGGACAACGGTCAGGGAAGTTGCCTGTGACGCAACGAGTCAAGTGGCGGGGCAATTCTGCACTCTCTGATGGGAGCTATGAAAATGTGAATTTGGTGGGAGGCTACTACGATGCTGGCGACAATGTGAAGTTCGGGTGGCCCATGGCGTTTACCCTGACATTGTTGAGTTGGACCACCGTGGAATATGAAAAAGAGATAGCATCTGTGATGCAGTTGGAACACCTCCGGAGCTCAGTCCGGTGGGGCACCGACTTCATTTTACGAGCTCATGTTTCACCTACCACACTCTATACTCAGGTAGGAGATGCAAATGGTGATCATCAATGTTGGGAGCGGCCGGAGGACATGGACACCCCTCGAACGCTGTACAAGATAACGCCTAATTCACCCGGCACCGAAGCTGCCGCCGAGGCTGCAGCTGCACTCGCTGCTGCTTCCATCTTGTTCAACCGTGTCGATGCTAATTACTCCAGAAGGCTTCTTCAACACTCAAAATCTCTCTTTCAGTTTGCTGACAAGTTTAGAGGATCTTACTCTGCTTCTTGTCCATTCTATTGTTCTTATTCTGGATATCAGGATGAGTTGTTGTGGGCAGCTGCTTGGTTGTATAAAGCAAGTGGAAATATGAAATATTTGAGGTATGTTTTAAGCAATCAATGGTGGAGTCAACCCACATCCGAGTTTAGTTGGGACAACAAATTTGTTGGAGCTCAAATACTATTAACAAAGGAATTCTATAAAGGAAAGAAAAACTTGAGCAAGTTTAAGAACGATGTTGAAACGTTCATATGCAAACTTATGCCTGATGATGGTGGCTCTTCCGAGATTTCGCGAACACCTGGTGGACTTCTTTTCCTAAGAGATAATAGTAATCTGCAATATACATCAAGCTCGTCCATGGTGCTTTTCATGTATTCTAGACTCTTAAACCAAGCTCATATCTATGGAATTCACTGTGGATCCAAATATTTTTCTTCTTCTCAAATCAAAACCTTTGCCAAATCACAGGTGGATTATATATTGGGGAAAAATACGTTGAAGATGTCATACATGGTAGGATTTGGTAACAAATATCCATTGCAATTGCATCATAGAGCCTCATCCATCCCTTCAACAAAAGTGCTCTCAACAAAGGTTGGTTGTAACGATGGTCGCTCAAGCTATTTTTATTCAAATGGTCCGAATCCCAACACACATATTGGTTCTATAGTAGGAGGTCCTTATTTAAACGATGAATTCAGCGATTTGAGATCGGACTACTCTCATTCCGAGCCTACTACTTATATGAATGCTGCTTTTGTTGGTTCGGTAGCTGCACTGGTTGTGTAA

Protein sequence

LDRTFNYSDALGKAVLFFEGQRSGKLPVTQRVKWRGNSALSDGSYENVNLVGGYYDAGDNVKFGWPMAFTLTLLSWTTVEYEKEIASVMQLEHLRSSVRWGTDFILRAHVSPTTLYTQVGDANGDHQCWERPEDMDTPRTLYKITPNSPGTEAAAEAAAALAAASILFNRVDANYSRRLLQHSKSLFQFADKFRGSYSASCPFYCSYSGYQDELLWAAAWLYKASGNMKYLRYVLSNQWWSQPTSEFSWDNKFVGAQILLTKEFYKGKKNLSKFKNDVETFICKLMPDDGGSSEISRTPGGLLFLRDNSNLQYTSSSSMVLFMYSRLLNQAHIYGIHCGSKYFSSSQIKTFAKSQVDYILGKNTLKMSYMVGFGNKYPLQLHHRASSIPSTKVLSTKVGCNDGRSSYFYSNGPNPNTHIGSIVGGPYLNDEFSDLRSDYSHSEPTTYMNAAFVGSVAALVV*
BLAST of Cucsa.303540 vs. Swiss-Prot
Match: GUN_PHAVU (Endoglucanase OS=Phaseolus vulgaris PE=2 SV=2)

HSP 1 Score: 701.4 bits (1809), Expect = 6.4e-201
Identity = 332/456 (72.81%), Postives = 386/456 (84.65%), Query Frame = 1

Query: 5   FNYSDALGKAVLFFEGQRSGKLPVTQRVKWRGNSALSDGSYENVNLVGGYYDAGDNVKFG 64
           ++Y+DAL KA+LFFEGQRSGKLP +QRVKWR +SALSDG  +NVNL+GGYYDAGDNVKFG
Sbjct: 39  YDYADALAKAILFFEGQRSGKLPSSQRVKWREDSALSDGKLQNVNLMGGYYDAGDNVKFG 98

Query: 65  WPMAFTLTLLSWTTVEYEKEIASVMQLEHLRSSVRWGTDFILRAHVSPTTLYTQVGDANG 124
           WPMAF+ +LLSW  VEYE EI+SV QL +L+S++RWG DF+LRAH SPTTLYTQVGD N 
Sbjct: 99  WPMAFSTSLLSWAAVEYESEISSVNQLGYLQSAIRWGADFMLRAHTSPTTLYTQVGDGNA 158

Query: 125 DHQCWERPEDMDTPRTLYKITPNSPGTEAAAEAAAALAAASILFNRVDANYSRRLLQHSK 184
           DH CWERPEDMDTPRT+YKI  NSPGTE AAE AAAL+AASI+F ++DA YS  LL HSK
Sbjct: 159 DHNCWERPEDMDTPRTVYKIDANSPGTEVAAEYAAALSAASIVFKKIDAKYSSTLLSHSK 218

Query: 185 SLFQFADKFRGSYSASCPFYCSYSGYQDELLWAAAWLYKASGNMKYLRYVLSNQWWSQPT 244
           SLF FADK RGSYS SCPFYCSYSGYQDELLWAAAWLYKASG  KYL Y++SNQ WSQ  
Sbjct: 219 SLFDFADKNRGSYSGSCPFYCSYSGYQDELLWAAAWLYKASGESKYLSYIISNQGWSQTV 278

Query: 245 SEFSWDNKFVGAQILLTKEFYKGKKNLSKFKNDVETFICKLMPDDGGSSEISRTPGGLLF 304
           SEFSWDNKFVGAQ LLT+EFY GKK+L+K K D E+FIC +MP    S +I  TPGGLLF
Sbjct: 279 SEFSWDNKFVGAQTLLTEEFYGGKKDLAKIKTDAESFICAVMP-GSNSRQIKTTPGGLLF 338

Query: 305 LRDNSNLQYTSSSSMVLFMYSRLLNQAHIYGIHCGSKYFSSSQIKTFAKSQVDYILGKNT 364
            RD+SNLQYT+SS+MVLF++SR+LN+ HI GI+CGS +F++SQI+ FAK+QV+YILGKN 
Sbjct: 339 TRDSSNLQYTTSSTMVLFIFSRILNRNHINGINCGSSHFTASQIRGFAKTQVEYILGKNP 398

Query: 365 LKMSYMVGFGNKYPLQLHHRASSIPSTKVLSTKVGCNDGRSSYFYSNGPNPNTHIGSIVG 424
           +KMSYMVGFG+KYP QLHHR SSIPS KV   KVGCN G S Y+ S  PNPNTH+G+IVG
Sbjct: 399 MKMSYMVGFGSKYPKQLHHRGSSIPSIKVHPAKVGCNAGLSDYYNSANPNPNTHVGAIVG 458

Query: 425 GPYLNDEFSDLRSDYSHSEPTTYMNAAFVGSVAALV 461
           GP  ND F+D RSDYSH+EPTTY+NAAFV S++AL+
Sbjct: 459 GPDSNDRFNDARSDYSHAEPTTYINAAFVASISALL 493

BLAST of Cucsa.303540 vs. Swiss-Prot
Match: GUN20_ARATH (Endoglucanase 20 OS=Arabidopsis thaliana GN=At4g23560 PE=2 SV=1)

HSP 1 Score: 651.0 bits (1678), Expect = 9.9e-186
Identity = 307/454 (67.62%), Postives = 368/454 (81.06%), Query Frame = 1

Query: 7   YSDALGKAVLFFEGQRSGKLPVTQRVKWRGNSALSDGSYENVNLVGGYYDAGDNVKFGWP 66
           Y DAL K++LFFEGQRSGKLP  QRVKWR +SALSDGS  NVNL+GGYYDAGDNVKF WP
Sbjct: 24  YGDALNKSILFFEGQRSGKLPTNQRVKWRADSALSDGSLANVNLIGGYYDAGDNVKFVWP 83

Query: 67  MAFTLTLLSWTTVEYEKEIASVMQLEHLRSSVRWGTDFILRAHVSPTTLYTQVGDANGDH 126
           M+FT TLLSW  +EY+ EI+SV QL +LRS+++WGTDFILRAH SP  LYTQVGD N DH
Sbjct: 84  MSFTTTLLSWAAIEYQNEISSVNQLGYLRSTIKWGTDFILRAHTSPNMLYTQVGDGNSDH 143

Query: 127 QCWERPEDMDTPRTLYKITPNSPGTEAAAEAAAALAAASILFNRVDANYSRRLLQHSKSL 186
            CWERPEDMDT RTLY I+ +SPG+EAA EAAAALAAAS++F  VD+ YS  LL H+K+L
Sbjct: 144 SCWERPEDMDTSRTLYSISSSSPGSEAAGEAAAALAAASLVFKSVDSTYSSTLLNHAKTL 203

Query: 187 FQFADKFRGSYSASCPFYCSYSGYQDELLWAAAWLYKASGNMKYLRYVLSNQWWSQPTSE 246
           F+FADK+RGSY ASCPFYCSYSGYQDELLWAAAWLYKA+G+  Y+ YV+SN+ WSQ  +E
Sbjct: 204 FEFADKYRGSYQASCPFYCSYSGYQDELLWAAAWLYKATGDKIYINYVISNKDWSQAVNE 263

Query: 247 FSWDNKFVGAQILLTKEFYKGKKNLSKFKNDVETFICKLMPDDGGSSEISRTPGGLLFLR 306
           FSWDNKFVGAQ LL  EFY G  +L+KFK+DVE+F+C +MP    S +I  TPGGLLF+R
Sbjct: 264 FSWDNKFVGAQALLVSEFYNGANDLAKFKSDVESFVCAMMP-GSSSQQIKPTPGGLLFIR 323

Query: 307 DNSNLQYTSSSSMVLFMYSRLLNQAHIYGIHCGSKYFSSSQIKTFAKSQVDYILGKNTLK 366
           D+SNLQY ++++ VLF YS+ L +A +  I CGS  F+ SQI+ FAKSQVDYILG N +K
Sbjct: 324 DSSNLQYVTTATTVLFHYSKTLTKAGVGSIQCGSTKFTVSQIRNFAKSQVDYILGNNPMK 383

Query: 367 MSYMVGFGNKYPLQLHHRASSIPSTKVLSTKVGCNDGRSSYFYSNGPNPNTHIGSIVGGP 426
           MSYMVGFG KYP Q HHR SS+PS +    K+ CN G  SY+ S+ PNPN HIG+IVGGP
Sbjct: 384 MSYMVGFGTKYPTQPHHRGSSLPSIQSKPEKIDCNGG-YSYYNSDTPNPNVHIGAIVGGP 443

Query: 427 YLNDEFSDLRSDYSHSEPTTYMNAAFVGSVAALV 461
             +D++SD +SDYSH+EPTTY+NAAF+G VAAL+
Sbjct: 444 NSSDQYSDKKSDYSHAEPTTYINAAFIGPVAALI 475

BLAST of Cucsa.303540 vs. Swiss-Prot
Match: GUN18_ARATH (Endoglucanase 18 OS=Arabidopsis thaliana GN=At4g09740 PE=3 SV=2)

HSP 1 Score: 634.8 bits (1636), Expect = 7.4e-181
Identity = 299/455 (65.71%), Postives = 361/455 (79.34%), Query Frame = 1

Query: 6   NYSDALGKAVLFFEGQRSGKLPVTQRVKWRGNSALSDGSYENVNLVGGYYDAGDNVKFGW 65
           +Y DAL K++LFFEGQRSGKLP  QRVKWR +S LSDG+  NVNL+GGYYDAGDNVKF W
Sbjct: 23  DYGDALNKSILFFEGQRSGKLPTNQRVKWRADSGLSDGASANVNLIGGYYDAGDNVKFVW 82

Query: 66  PMAFTLTLLSWTTVEYEKEIASVMQLEHLRSSVRWGTDFILRAHVSPTTLYTQVGDANGD 125
           PM+FT TLLSW  +EY+ EI  V QL +LRS+++WGT+FILRAH S   LYTQVGD N D
Sbjct: 83  PMSFTTTLLSWAALEYQNEITFVNQLGYLRSTIKWGTNFILRAHTSTNMLYTQVGDGNSD 142

Query: 126 HQCWERPEDMDTPRTLYKITPNSPGTEAAAEAAAALAAASILFNRVDANYSRRLLQHSKS 185
           H CWERPEDMDTPRTLY I+ +SPG+EAA EAAAALAAAS++F  VD+ YS +LL ++KS
Sbjct: 143 HSCWERPEDMDTPRTLYSISSSSPGSEAAGEAAAALAAASLVFKLVDSTYSSKLLNNAKS 202

Query: 186 LFQFADKFRGSYSASCPFYCSYSGYQDELLWAAAWLYKASGNMKYLRYVLSNQWWSQPTS 245
           LF+FADK+RGSY ASCPFYCS+SGYQDELLWAAAWLYKA+G   YL YV+SN+ WS+  +
Sbjct: 203 LFEFADKYRGSYQASCPFYCSHSGYQDELLWAAAWLYKATGEKSYLNYVISNKDWSKAIN 262

Query: 246 EFSWDNKFVGAQILLTKEFYKGKKNLSKFKNDVETFICKLMPDDGGSSEISRTPGGLLFL 305
           EFSWDNKF G Q LL  EFY G  +L KFK DVE+F+C LMP    S +I  TPGG+LF+
Sbjct: 263 EFSWDNKFAGVQALLASEFYNGANDLEKFKTDVESFVCALMP-GSSSQQIKPTPGGILFI 322

Query: 306 RDNSNLQYTSSSSMVLFMYSRLLNQAHIYGIHCGSKYFSSSQIKTFAKSQVDYILGKNTL 365
           RD+SNLQY ++++ +LF YS+ L +A +  I CGS  F+ SQI+ FAKSQVDYILG N L
Sbjct: 323 RDSSNLQYVTTATTILFYYSKTLTKAGVGSIQCGSTQFTVSQIRNFAKSQVDYILGNNPL 382

Query: 366 KMSYMVGFGNKYPLQLHHRASSIPSTKVLSTKVGCNDGRSSYFYSNGPNPNTHIGSIVGG 425
           KMSYMVGFG KYP Q HHR SS+PS +    K+ CN G  SY+  + PNPN H G+IVGG
Sbjct: 383 KMSYMVGFGTKYPTQPHHRGSSLPSIQSKPEKIDCNGG-FSYYNFDTPNPNVHTGAIVGG 442

Query: 426 PYLNDEFSDLRSDYSHSEPTTYMNAAFVGSVAALV 461
           P  +D++SD R+DYSH+EPTTY+NAAF+GSVAAL+
Sbjct: 443 PNSSDQYSDKRTDYSHAEPTTYINAAFIGSVAALI 475

BLAST of Cucsa.303540 vs. Swiss-Prot
Match: GUN20_ORYSJ (Endoglucanase 20 OS=Oryza sativa subsp. japonica GN=GLU15 PE=2 SV=1)

HSP 1 Score: 594.3 bits (1531), Expect = 1.1e-168
Identity = 282/454 (62.11%), Postives = 361/454 (79.52%), Query Frame = 1

Query: 6   NYSDALGKAVLFFEGQRSGKLPVTQRVKWRGNSALSDGSYENVNLVGGYYDAGDNVKFGW 65
           +Y DAL KA+LFFEGQRSG+LP  QR  WRG+SAL+DG  ENVNL GGYYDAGDNVKFG+
Sbjct: 40  DYGDALAKAILFFEGQRSGRLPANQRATWRGDSALTDGREENVNLTGGYYDAGDNVKFGY 99

Query: 66  PMAFTLTLLSWTTVEYEKEIASVMQLEHLRSSVRWGTDFILRAHVSPTTLYTQVGDANGD 125
           PMAFT+TLL W+ VEY   +A+  +L +LR+++RWG DF+LRAH SPTTLYTQVGD N D
Sbjct: 100 PMAFTVTLLGWSAVEYGAAVAAAGELGNLRAAIRWGADFLLRAHASPTTLYTQVGDGNAD 159

Query: 126 HQCWERPEDMDTPRTLYKITPNSPGTEAAAEAAAALAAASI-LFNRVDANYSRRLLQHSK 185
           HQCWERPEDMDTPRTLYKIT +SPG+EAAAEA+AALAAA + L +  D  +S RLL  S+
Sbjct: 160 HQCWERPEDMDTPRTLYKITADSPGSEAAAEASAALAAAYVALKDDGDTAFSSRLLAASR 219

Query: 186 SLFQFADKFRGSYSASCPFYCSYSGYQDELLWAAAWLYKASGNMKYLRYVLSNQWWSQPT 245
           SLF FA+ +RGS+ +SCPFYCSYSG+QDELLWA+AWL+KA+ + KYL ++ +NQ  S P 
Sbjct: 220 SLFDFANNYRGSFQSSCPFYCSYSGFQDELLWASAWLFKATRDAKYLDFLTNNQGSSNPV 279

Query: 246 SEFSWDNKFVGAQILLTKEFYKGKKNLSKFKNDVETFICKLMPDDGGSSEISRTPGGLLF 305
           +EFSWDNK+ GAQ+L  +E+  G+  L+++K+++++F+C LMP + G+ +I  TPGGLLF
Sbjct: 280 NEFSWDNKYAGAQMLAAQEYLGGRTQLARYKDNLDSFVCALMP-NSGNVQIRTTPGGLLF 339

Query: 306 LRDNSNLQYTSSSSMVLFMYSRLLNQAHIYGIHCGSKYFSSSQIKTFAKSQVDYILGKNT 365
            RD+ NLQYT+++++VL +YS++L  +   G+ C +  FS +QI +FA SQVDYILGKN 
Sbjct: 340 TRDSVNLQYTTTATLVLSIYSKVLKSSGSRGVRCSAATFSPNQISSFATSQVDYILGKNP 399

Query: 366 LKMSYMVGFGNKYPLQLHHRASSIPSTKVLSTKVGCNDGRSSYFYSNGPNPNTHIGSIVG 425
           L MSYMVGF  K+P ++HHR SSIPS KVLS KV C +G SS+  ++ PNPN H+G+IVG
Sbjct: 400 LGMSYMVGFSTKFPRRIHHRGSSIPSIKVLSRKVTCKEGFSSWLPTSDPNPNIHVGAIVG 459

Query: 426 GPYLNDEFSDLRSDYSHSEPTTYMNAAFVGSVAA 459
           GP  ND+FSD R D SHSEP TY+NAAFVG+ AA
Sbjct: 460 GPDGNDQFSDNRGDSSHSEPATYINAAFVGACAA 492

BLAST of Cucsa.303540 vs. Swiss-Prot
Match: GUN1_ARATH (Endoglucanase 1 OS=Arabidopsis thaliana GN=CEL2 PE=2 SV=1)

HSP 1 Score: 507.3 bits (1305), Expect = 1.8e-142
Identity = 253/463 (54.64%), Postives = 329/463 (71.06%), Query Frame = 1

Query: 6   NYSDALGKAVLFFEGQRSGKLPVTQRVKWRGNSALSDGSYENVNLVGGYYDAGDNVKFGW 65
           NY DAL K++LFFEGQRSGKLP  QR+ WR NS LSDGS  NV+LVGGYYDAGDN+KFG+
Sbjct: 43  NYKDALSKSILFFEGQRSGKLPPNQRMTWRSNSGLSDGSALNVDLVGGYYDAGDNMKFGF 102

Query: 66  PMAFTLTLLSWTTVEYEKEIASVMQLEHLRSSVRWGTDFILRAHVSPTTLYTQVGDANGD 125
           PMAFT T+LSW+ +E+   + S  +L + + ++RW TDF+L+A   P T+Y QVGD N D
Sbjct: 103 PMAFTTTMLSWSLIEFGGLMKS--ELPNAKDAIRWATDFLLKATSHPDTIYVQVGDPNMD 162

Query: 126 HQCWERPEDMDTPRTLYKITPNSPGTEAAAEAAAALAAASILFNRVDANYSRRLLQHSKS 185
           H CWERPEDMDTPR+++K+  N+PG++ A E AAALAAASI+F + D +YS  LLQ + +
Sbjct: 163 HACWERPEDMDTPRSVFKVDKNNPGSDIAGEIAAALAAASIVFRKCDPSYSNHLLQRAIT 222

Query: 186 LFQFADKFRGSYSAS-----CPFYCSYSGYQDELLWAAAWLYKASGNMKYLRYVLSN--- 245
           +F FADK+RG YSA      CPFYCSYSGYQDELLW AAWL KA+ N  YL Y+ +N   
Sbjct: 223 VFTFADKYRGPYSAGLAPEVCPFYCSYSGYQDELLWGAAWLQKATNNPTYLNYIKANGQI 282

Query: 246 QWWSQPTSEFSWDNKFVGAQILLTKEFYKGK-KNLSKFKNDVETFICKLMPDDGGSSEIS 305
               +  + FSWDNK VGA+ILL+KEF   K K+L ++K   ++FIC ++P   G+S   
Sbjct: 283 LGADEFDNMFSWDNKHVGARILLSKEFLIQKVKSLEEYKEHADSFICSVLP---GASSSQ 342

Query: 306 RTPGGLLFLRDNSNLQYTSSSSMVLFMYSRLLNQAHIYGIHCGSKYFSSSQIKTFAKSQV 365
            TPGGLLF    SN+QY +S+S +L  Y++ L  A     +CG    + +++++ AK QV
Sbjct: 343 YTPGGLLFKMGESNMQYVTSTSFLLLTYAKYLTSARTVA-YCGGSVVTPARLRSIAKKQV 402

Query: 366 DYILGKNTLKMSYMVGFGNKYPLQLHHRASSIPSTKVLSTKVGCNDGRSSYFYSNGPNPN 425
           DY+LG N LKMSYMVG+G KYP ++HHR SS+PS  V  T++ C+DG  S F S  PNPN
Sbjct: 403 DYLLGGNPLKMSYMVGYGLKYPRRIHHRGSSLPSVAVHPTRIQCHDG-FSLFTSQSPNPN 462

Query: 426 THIGSIVGGPYLNDEFSDLRSDYSHSEPTTYMNAAFVGSVAAL 460
             +G++VGGP  ND+F D RSDY  SEP TY+NA  VG++A L
Sbjct: 463 DLVGAVVGGPDQNDQFPDERSDYGRSEPATYINAPLVGALAYL 498

BLAST of Cucsa.303540 vs. TrEMBL
Match: A0A0A0KK57_CUCSA (Endoglucanase OS=Cucumis sativus GN=Csa_5G152200 PE=3 SV=1)

HSP 1 Score: 938.7 bits (2425), Expect = 2.7e-270
Identity = 457/461 (99.13%), Postives = 459/461 (99.57%), Query Frame = 1

Query: 1   LDRTFNYSDALGKAVLFFEGQRSGKLPVTQRVKWRGNSALSDGSYENVNLVGGYYDAGDN 60
           LDRTFNYSDALGKAVLFFEGQRSGKLPVTQRVKWRGNSALSDGSYENVNLVGGYYDAGDN
Sbjct: 28  LDRTFNYSDALGKAVLFFEGQRSGKLPVTQRVKWRGNSALSDGSYENVNLVGGYYDAGDN 87

Query: 61  VKFGWPMAFTLTLLSWTTVEYEKEIASVMQLEHLRSSVRWGTDFILRAHVSPTTLYTQVG 120
           VKFGWPMAFTLTLLSWTTVEYEKEIASVMQLEHLRSSVRWGTDFILRAHVSPTTLYTQVG
Sbjct: 88  VKFGWPMAFTLTLLSWTTVEYEKEIASVMQLEHLRSSVRWGTDFILRAHVSPTTLYTQVG 147

Query: 121 DANGDHQCWERPEDMDTPRTLYKITPNSPGTEAAAEAAAALAAASILFNRVDANYSRRLL 180
           DANGDHQCWERPEDMDTPRTLYKITPNSPGTEAAAEAAAALAAASILFNRVDANYSRRLL
Sbjct: 148 DANGDHQCWERPEDMDTPRTLYKITPNSPGTEAAAEAAAALAAASILFNRVDANYSRRLL 207

Query: 181 QHSKSLFQFADKFRGSYSASCPFYCSYSGYQDELLWAAAWLYKASGNMKYLRYVLSNQWW 240
           QHSKSLFQFADKFRGSYSASCPFYCSYSGYQDELLWAAAWLYKASGNMKYLRYVLSNQWW
Sbjct: 208 QHSKSLFQFADKFRGSYSASCPFYCSYSGYQDELLWAAAWLYKASGNMKYLRYVLSNQWW 267

Query: 241 SQPTSEFSWDNKFVGAQILLTKEFYKGKKNLSKFKNDVETFICKLMPDDGGSSEISRTPG 300
           SQPTSEFSWDNKFVGAQILLTKEFYKGKKNLSKFKNDVETFICKLMPDDGGSS+ISRTPG
Sbjct: 268 SQPTSEFSWDNKFVGAQILLTKEFYKGKKNLSKFKNDVETFICKLMPDDGGSSKISRTPG 327

Query: 301 GLLFLRDNSNLQYTSSSSMVLFMYSRLLNQAHIYGIHCGSKYFSSSQIKTFAKSQVDYIL 360
           GLLFLRDNSNLQYTSSSSMVLFMYSRLLNQAHI+GIHCGSKYFSSSQIKTFAKSQVDYIL
Sbjct: 328 GLLFLRDNSNLQYTSSSSMVLFMYSRLLNQAHIHGIHCGSKYFSSSQIKTFAKSQVDYIL 387

Query: 361 GKNTLKMSYMVGFGNKYPLQLHHRASSIPSTKVLSTKVGCNDGRSSYFYSNGPNPNTHIG 420
           GKN LKMSYMVGFGNKYP QLHHRASSIPSTKVLSTKVGCNDGRSSYFYSNGPNPNTHIG
Sbjct: 388 GKNPLKMSYMVGFGNKYPSQLHHRASSIPSTKVLSTKVGCNDGRSSYFYSNGPNPNTHIG 447

Query: 421 SIVGGPYLNDEFSDLRSDYSHSEPTTYMNAAFVGSVAALVV 462
           SIVGGPYLNDEFSDLRSDYSHSEPTTYMNAAFVGSVAALVV
Sbjct: 448 SIVGGPYLNDEFSDLRSDYSHSEPTTYMNAAFVGSVAALVV 488

BLAST of Cucsa.303540 vs. TrEMBL
Match: G7KC03_MEDTR (Endoglucanase OS=Medicago truncatula GN=MTR_5g010000 PE=3 SV=1)

HSP 1 Score: 713.8 bits (1841), Expect = 1.4e-202
Identity = 334/456 (73.25%), Postives = 391/456 (85.75%), Query Frame = 1

Query: 5   FNYSDALGKAVLFFEGQRSGKLPVTQRVKWRGNSALSDGSYENVNLVGGYYDAGDNVKFG 64
           ++Y+DALGKA+LFFEGQRSGKLP  QRVKWRG+SALSDG  +NV+LVGGYYDAGDNVKFG
Sbjct: 39  YDYADALGKAILFFEGQRSGKLPKDQRVKWRGDSALSDGKTQNVDLVGGYYDAGDNVKFG 98

Query: 65  WPMAFTLTLLSWTTVEYEKEIASVMQLEHLRSSVRWGTDFILRAHVSPTTLYTQVGDANG 124
           WPM+FT++LLSW  VEYE EI+S  QL++LRS++RWG DFIL+AH SPTTL+TQVGD N 
Sbjct: 99  WPMSFTVSLLSWAAVEYESEISSANQLDYLRSAIRWGADFILKAHTSPTTLFTQVGDGNA 158

Query: 125 DHQCWERPEDMDTPRTLYKITPNSPGTEAAAEAAAALAAASILFNRVDANYSRRLLQHSK 184
           DH CWERPEDMDTPRT YKI  NSPGTEAAAEAAAAL+AASI+F + D NYS +LL  SK
Sbjct: 159 DHNCWERPEDMDTPRTTYKIDSNSPGTEAAAEAAAALSAASIVFKKTDINYSSKLLSQSK 218

Query: 185 SLFQFADKFRGSYSASCPFYCSYSGYQDELLWAAAWLYKASGNMKYLRYVLSNQWWSQPT 244
           SLF FADK+RGSYS SCPFYCSYSGYQDELLWAA WLYKASG  KYL Y+ SNQ WSQ  
Sbjct: 219 SLFDFADKYRGSYSGSCPFYCSYSGYQDELLWAATWLYKASGESKYLTYITSNQGWSQAV 278

Query: 245 SEFSWDNKFVGAQILLTKEFYKGKKNLSKFKNDVETFICKLMPDDGGSSEISRTPGGLLF 304
           SEFSWDNKFVGAQ LLT+EFY G++ L+KF+ D E+FIC LMP    S +I  TPGGLL+
Sbjct: 279 SEFSWDNKFVGAQTLLTQEFYGGREELAKFQTDAESFICALMP-GSSSLQIKTTPGGLLY 338

Query: 305 LRDNSNLQYTSSSSMVLFMYSRLLNQAHIYGIHCGSKYFSSSQIKTFAKSQVDYILGKNT 364
           +RD+SNLQYT++S+MVLF++S++LN+ HI GIHCGS +FS S+I+ FAK QVDYILG N 
Sbjct: 339 IRDSSNLQYTTTSTMVLFIFSKILNKNHIDGIHCGSAHFSPSEIRAFAKLQVDYILGNNP 398

Query: 365 LKMSYMVGFGNKYPLQLHHRASSIPSTKVLSTKVGCNDGRSSYFYSNGPNPNTHIGSIVG 424
           +KMSYMVG+G+KYP QLHHR SSIPS KV  TKVGCNDG+S+YF S+ PNPN H+G+IVG
Sbjct: 399 MKMSYMVGYGSKYPKQLHHRGSSIPSIKVHQTKVGCNDGQSNYFSSSNPNPNIHVGAIVG 458

Query: 425 GPYLNDEFSDLRSDYSHSEPTTYMNAAFVGSVAALV 461
           GP  ND+++D RSDYSH+EPTTYMNAAFVGSVAAL+
Sbjct: 459 GPNSNDQYNDARSDYSHAEPTTYMNAAFVGSVAALL 493

BLAST of Cucsa.303540 vs. TrEMBL
Match: V4VJ70_9ROSI (Endoglucanase OS=Citrus clementina GN=CICLE_v10023813mg PE=3 SV=1)

HSP 1 Score: 711.1 bits (1834), Expect = 9.0e-202
Identity = 336/457 (73.52%), Postives = 390/457 (85.34%), Query Frame = 1

Query: 4   TFNYSDALGKAVLFFEGQRSGKLPVTQRVKWRGNSALSDGSYENVNLVGGYYDAGDNVKF 63
           TF Y DALGKA+LFFEGQRSGKLP +QRVKWRGNSALSDG  ENVNL+GGYYDAGDNVKF
Sbjct: 38  TFAYRDALGKAILFFEGQRSGKLPESQRVKWRGNSALSDGKPENVNLIGGYYDAGDNVKF 97

Query: 64  GWPMAFTLTLLSWTTVEYEKEIASVMQLEHLRSSVRWGTDFILRAHVSPTTLYTQVGDAN 123
           GWPMA++++LLSW  VEY++EI+SV QL +LR ++RWGTDFILRAH SPTTLYTQVGD N
Sbjct: 98  GWPMAYSVSLLSWAAVEYQREISSVNQLGYLRGAIRWGTDFILRAHTSPTTLYTQVGDGN 157

Query: 124 GDHQCWERPEDMDTPRTLYKITPNSPGTEAAAEAAAALAAASILFNRVDANYSRRLLQHS 183
            DHQCWERPEDMDTPRTLY+IT +SPG+EAAAE+AAALAAASI+F +VD+ YS RLL HS
Sbjct: 158 ADHQCWERPEDMDTPRTLYRITSDSPGSEAAAESAAALAAASIVFKKVDSIYSSRLLNHS 217

Query: 184 KSLFQFADKFRGSYSASCPFYCSYSGYQDELLWAAAWLYKASGNMKYLRYVLSNQWWSQP 243
           KSLF+FADK RGSY ASCPFYCSYSGYQDELLWAAAWLYKAS + KYL YVLSNQ WSQ 
Sbjct: 218 KSLFEFADKHRGSYQASCPFYCSYSGYQDELLWAAAWLYKASEDNKYLNYVLSNQGWSQV 277

Query: 244 TSEFSWDNKFVGAQILLTKEFYKGKKNLSKFKNDVETFICKLMPDDGGSSEISRTPGGLL 303
            SEFSWDNKF GAQ+LL KEF+ G K LS FK  VE+F+C LMP +  S  I  TPGGLL
Sbjct: 278 ASEFSWDNKFAGAQMLLAKEFFGGNKQLSLFKIHVESFVCALMP-ESSSVRIETTPGGLL 337

Query: 304 FLRDNSNLQYTSSSSMVLFMYSRLLNQAHIYGIHCGSKYFSSSQIKTFAKSQVDYILGKN 363
           ++RD+SNLQY +S++++LF+YS+ LN AHI G+ CGS +FS+SQI  FAKSQVDYILGKN
Sbjct: 338 YIRDSSNLQYVTSATLLLFLYSKTLNTAHINGLQCGSAHFSASQISAFAKSQVDYILGKN 397

Query: 364 TLKMSYMVGFGNKYPLQLHHRASSIPSTKVLSTKVGCNDGRSSYFYSNGPNPNTHIGSIV 423
            +KMSYM GFG+K+PLQ+HHR +SIPS      KV CNDG SSY++S+ PNPN H+G+IV
Sbjct: 398 PMKMSYMAGFGSKFPLQIHHRGASIPSIGAHPAKVSCNDGYSSYYHSSNPNPNVHVGAIV 457

Query: 424 GGPYLNDEFSDLRSDYSHSEPTTYMNAAFVGSVAALV 461
           GGP  ND+F DLR+DYSH+EPTTYMNAAFVGSVA L+
Sbjct: 458 GGPDSNDQFKDLRTDYSHAEPTTYMNAAFVGSVAPLL 493

BLAST of Cucsa.303540 vs. TrEMBL
Match: A0A067G9J1_CITSI (Endoglucanase OS=Citrus sinensis GN=CISIN_1g042201mg PE=3 SV=1)

HSP 1 Score: 710.3 bits (1832), Expect = 1.5e-201
Identity = 336/457 (73.52%), Postives = 390/457 (85.34%), Query Frame = 1

Query: 4   TFNYSDALGKAVLFFEGQRSGKLPVTQRVKWRGNSALSDGSYENVNLVGGYYDAGDNVKF 63
           TF Y DALGKA+LFFEGQRSGKLP +QRVKWRGNSALSDG  ENVNL+GGYYDAGDNVKF
Sbjct: 38  TFAYRDALGKAILFFEGQRSGKLPESQRVKWRGNSALSDGKPENVNLIGGYYDAGDNVKF 97

Query: 64  GWPMAFTLTLLSWTTVEYEKEIASVMQLEHLRSSVRWGTDFILRAHVSPTTLYTQVGDAN 123
           GWPMA++++LLSW  VEY++EI+SV QL +LR ++RWGTDFILRAH SPTTLYTQVGD N
Sbjct: 98  GWPMAYSVSLLSWAAVEYQREISSVNQLGYLRGAIRWGTDFILRAHTSPTTLYTQVGDGN 157

Query: 124 GDHQCWERPEDMDTPRTLYKITPNSPGTEAAAEAAAALAAASILFNRVDANYSRRLLQHS 183
            DHQCWERPEDMDTPRTLY+IT +SPG+EAAAE+AAALAAASI+F +VD+ YS RLL HS
Sbjct: 158 ADHQCWERPEDMDTPRTLYRITSDSPGSEAAAESAAALAAASIVFKKVDSIYSSRLLNHS 217

Query: 184 KSLFQFADKFRGSYSASCPFYCSYSGYQDELLWAAAWLYKASGNMKYLRYVLSNQWWSQP 243
           KSLF+FADK RGSY ASCPFYCSYSGYQDELLWAAAWLYKAS + KYL YVLSNQ WSQ 
Sbjct: 218 KSLFEFADKHRGSYQASCPFYCSYSGYQDELLWAAAWLYKASEDNKYLDYVLSNQGWSQV 277

Query: 244 TSEFSWDNKFVGAQILLTKEFYKGKKNLSKFKNDVETFICKLMPDDGGSSEISRTPGGLL 303
            SEFSWDNKF GAQ+LL KEF+ G K LS FK  VE+F+C LMP +  S  I  TPGGLL
Sbjct: 278 ASEFSWDNKFAGAQMLLAKEFFGGNKQLSLFKIHVESFVCALMP-ESSSVRIETTPGGLL 337

Query: 304 FLRDNSNLQYTSSSSMVLFMYSRLLNQAHIYGIHCGSKYFSSSQIKTFAKSQVDYILGKN 363
           ++RD+SNLQY +S++++LF+YS+ LN AHI G+ CGS +FS+SQI  FAKSQVDYILGKN
Sbjct: 338 YIRDSSNLQYVTSATLLLFLYSKTLNTAHINGLQCGSAHFSASQISAFAKSQVDYILGKN 397

Query: 364 TLKMSYMVGFGNKYPLQLHHRASSIPSTKVLSTKVGCNDGRSSYFYSNGPNPNTHIGSIV 423
            +KMSYM GFG+K+PLQ+HHR +SIPS      KV CNDG SSY++S+ PNPN H+G+IV
Sbjct: 398 PMKMSYMAGFGSKFPLQIHHRGASIPSIGAHPAKVSCNDGYSSYYHSSNPNPNVHVGAIV 457

Query: 424 GGPYLNDEFSDLRSDYSHSEPTTYMNAAFVGSVAALV 461
           GGP  ND+F DLR+DYSH+EPTTYMNAAFVGSVA L+
Sbjct: 458 GGPDSNDQFKDLRTDYSHAEPTTYMNAAFVGSVAPLL 493

BLAST of Cucsa.303540 vs. TrEMBL
Match: G7KBZ9_MEDTR (Endoglucanase OS=Medicago truncatula GN=MTR_5g009970 PE=3 SV=1)

HSP 1 Score: 706.8 bits (1823), Expect = 1.7e-200
Identity = 330/457 (72.21%), Postives = 392/457 (85.78%), Query Frame = 1

Query: 4   TFNYSDALGKAVLFFEGQRSGKLPVTQRVKWRGNSALSDGSYENVNLVGGYYDAGDNVKF 63
           ++NY+DALGKA+LFFEGQRSGKLP  QRVKWRG+SALSDG  +NVNLVGGYYDAGDNVKF
Sbjct: 39  SYNYADALGKAILFFEGQRSGKLPKDQRVKWRGDSALSDGKTQNVNLVGGYYDAGDNVKF 98

Query: 64  GWPMAFTLTLLSWTTVEYEKEIASVMQLEHLRSSVRWGTDFILRAHVSPTTLYTQVGDAN 123
           GWPM+FT++LLSW  VEYE EI+SV Q+ +LR ++RWGT+FIL++H SP TL+TQVG+ N
Sbjct: 99  GWPMSFTVSLLSWAAVEYESEISSVNQIGYLRRAIRWGTNFILQSHTSPITLFTQVGEGN 158

Query: 124 GDHQCWERPEDMDTPRTLYKITPNSPGTEAAAEAAAALAAASILFNRVDANYSRRLLQHS 183
            DH CWERPEDMDTPRTLYKI  NSPGTEAAAEAAAAL+AASI+F + D  YS +LL+HS
Sbjct: 159 ADHNCWERPEDMDTPRTLYKIDANSPGTEAAAEAAAALSAASIVFKKKDTKYSSKLLRHS 218

Query: 184 KSLFQFADKFRGSYSASCPFYCSYSGYQDELLWAAAWLYKASGNMKYLRYVLSNQWWSQP 243
           KSLF FADK+RG+Y+ SCPFYCSYSGYQDELLWAAAWLYKASG  KYL+Y+  NQ W+Q 
Sbjct: 219 KSLFDFADKYRGTYTGSCPFYCSYSGYQDELLWAAAWLYKASGESKYLKYITDNQGWNQA 278

Query: 244 TSEFSWDNKFVGAQILLTKEFYKGKKNLSKFKNDVETFICKLMPDDGGSSEISRTPGGLL 303
            SEFSWDNKFVG Q LLT+EFY GKK+L+K  +D E+FIC LM     S +I +TPGGLL
Sbjct: 279 ASEFSWDNKFVGVQTLLTQEFYGGKKDLAKIHSDGESFICALM-QGSYSLQIKKTPGGLL 338

Query: 304 FLRDNSNLQYTSSSSMVLFMYSRLLNQAHIYGIHCGSKYFSSSQIKTFAKSQVDYILGKN 363
           + RD++NLQYT++S+MVLF++S++LN+ +I GIHCGS  F+SS+IK FAKSQVDYILG N
Sbjct: 339 YTRDSNNLQYTTTSTMVLFIFSKILNKNNIDGIHCGSTNFTSSEIKAFAKSQVDYILGNN 398

Query: 364 TLKMSYMVGFGNKYPLQLHHRASSIPSTKVLSTKVGCNDGRSSYFYSNGPNPNTHIGSIV 423
            +KMSYMVG+G+KYP QLHHR SSIPS KV  TKVGCNDG + YFYS+ PNPN H+G+IV
Sbjct: 399 PMKMSYMVGYGSKYPKQLHHRGSSIPSIKVHQTKVGCNDGYTDYFYSSNPNPNIHVGAIV 458

Query: 424 GGPYLNDEFSDLRSDYSHSEPTTYMNAAFVGSVAALV 461
           GGP  ND+F+D RSDYSHSEPTTYMNAAF+GSVAAL+
Sbjct: 459 GGPDFNDQFNDARSDYSHSEPTTYMNAAFIGSVAALI 494

BLAST of Cucsa.303540 vs. TAIR10
Match: AT4G23560.1 (AT4G23560.1 glycosyl hydrolase 9B15)

HSP 1 Score: 651.0 bits (1678), Expect = 5.6e-187
Identity = 307/454 (67.62%), Postives = 368/454 (81.06%), Query Frame = 1

Query: 7   YSDALGKAVLFFEGQRSGKLPVTQRVKWRGNSALSDGSYENVNLVGGYYDAGDNVKFGWP 66
           Y DAL K++LFFEGQRSGKLP  QRVKWR +SALSDGS  NVNL+GGYYDAGDNVKF WP
Sbjct: 24  YGDALNKSILFFEGQRSGKLPTNQRVKWRADSALSDGSLANVNLIGGYYDAGDNVKFVWP 83

Query: 67  MAFTLTLLSWTTVEYEKEIASVMQLEHLRSSVRWGTDFILRAHVSPTTLYTQVGDANGDH 126
           M+FT TLLSW  +EY+ EI+SV QL +LRS+++WGTDFILRAH SP  LYTQVGD N DH
Sbjct: 84  MSFTTTLLSWAAIEYQNEISSVNQLGYLRSTIKWGTDFILRAHTSPNMLYTQVGDGNSDH 143

Query: 127 QCWERPEDMDTPRTLYKITPNSPGTEAAAEAAAALAAASILFNRVDANYSRRLLQHSKSL 186
            CWERPEDMDT RTLY I+ +SPG+EAA EAAAALAAAS++F  VD+ YS  LL H+K+L
Sbjct: 144 SCWERPEDMDTSRTLYSISSSSPGSEAAGEAAAALAAASLVFKSVDSTYSSTLLNHAKTL 203

Query: 187 FQFADKFRGSYSASCPFYCSYSGYQDELLWAAAWLYKASGNMKYLRYVLSNQWWSQPTSE 246
           F+FADK+RGSY ASCPFYCSYSGYQDELLWAAAWLYKA+G+  Y+ YV+SN+ WSQ  +E
Sbjct: 204 FEFADKYRGSYQASCPFYCSYSGYQDELLWAAAWLYKATGDKIYINYVISNKDWSQAVNE 263

Query: 247 FSWDNKFVGAQILLTKEFYKGKKNLSKFKNDVETFICKLMPDDGGSSEISRTPGGLLFLR 306
           FSWDNKFVGAQ LL  EFY G  +L+KFK+DVE+F+C +MP    S +I  TPGGLLF+R
Sbjct: 264 FSWDNKFVGAQALLVSEFYNGANDLAKFKSDVESFVCAMMP-GSSSQQIKPTPGGLLFIR 323

Query: 307 DNSNLQYTSSSSMVLFMYSRLLNQAHIYGIHCGSKYFSSSQIKTFAKSQVDYILGKNTLK 366
           D+SNLQY ++++ VLF YS+ L +A +  I CGS  F+ SQI+ FAKSQVDYILG N +K
Sbjct: 324 DSSNLQYVTTATTVLFHYSKTLTKAGVGSIQCGSTKFTVSQIRNFAKSQVDYILGNNPMK 383

Query: 367 MSYMVGFGNKYPLQLHHRASSIPSTKVLSTKVGCNDGRSSYFYSNGPNPNTHIGSIVGGP 426
           MSYMVGFG KYP Q HHR SS+PS +    K+ CN G  SY+ S+ PNPN HIG+IVGGP
Sbjct: 384 MSYMVGFGTKYPTQPHHRGSSLPSIQSKPEKIDCNGG-YSYYNSDTPNPNVHIGAIVGGP 443

Query: 427 YLNDEFSDLRSDYSHSEPTTYMNAAFVGSVAALV 461
             +D++SD +SDYSH+EPTTY+NAAF+G VAAL+
Sbjct: 444 NSSDQYSDKKSDYSHAEPTTYINAAFIGPVAALI 475

BLAST of Cucsa.303540 vs. TAIR10
Match: AT4G09740.1 (AT4G09740.1 glycosyl hydrolase 9B14)

HSP 1 Score: 634.8 bits (1636), Expect = 4.1e-182
Identity = 299/455 (65.71%), Postives = 361/455 (79.34%), Query Frame = 1

Query: 6   NYSDALGKAVLFFEGQRSGKLPVTQRVKWRGNSALSDGSYENVNLVGGYYDAGDNVKFGW 65
           +Y DAL K++LFFEGQRSGKLP  QRVKWR +S LSDG+  NVNL+GGYYDAGDNVKF W
Sbjct: 23  DYGDALNKSILFFEGQRSGKLPTNQRVKWRADSGLSDGASANVNLIGGYYDAGDNVKFVW 82

Query: 66  PMAFTLTLLSWTTVEYEKEIASVMQLEHLRSSVRWGTDFILRAHVSPTTLYTQVGDANGD 125
           PM+FT TLLSW  +EY+ EI  V QL +LRS+++WGT+FILRAH S   LYTQVGD N D
Sbjct: 83  PMSFTTTLLSWAALEYQNEITFVNQLGYLRSTIKWGTNFILRAHTSTNMLYTQVGDGNSD 142

Query: 126 HQCWERPEDMDTPRTLYKITPNSPGTEAAAEAAAALAAASILFNRVDANYSRRLLQHSKS 185
           H CWERPEDMDTPRTLY I+ +SPG+EAA EAAAALAAAS++F  VD+ YS +LL ++KS
Sbjct: 143 HSCWERPEDMDTPRTLYSISSSSPGSEAAGEAAAALAAASLVFKLVDSTYSSKLLNNAKS 202

Query: 186 LFQFADKFRGSYSASCPFYCSYSGYQDELLWAAAWLYKASGNMKYLRYVLSNQWWSQPTS 245
           LF+FADK+RGSY ASCPFYCS+SGYQDELLWAAAWLYKA+G   YL YV+SN+ WS+  +
Sbjct: 203 LFEFADKYRGSYQASCPFYCSHSGYQDELLWAAAWLYKATGEKSYLNYVISNKDWSKAIN 262

Query: 246 EFSWDNKFVGAQILLTKEFYKGKKNLSKFKNDVETFICKLMPDDGGSSEISRTPGGLLFL 305
           EFSWDNKF G Q LL  EFY G  +L KFK DVE+F+C LMP    S +I  TPGG+LF+
Sbjct: 263 EFSWDNKFAGVQALLASEFYNGANDLEKFKTDVESFVCALMP-GSSSQQIKPTPGGILFI 322

Query: 306 RDNSNLQYTSSSSMVLFMYSRLLNQAHIYGIHCGSKYFSSSQIKTFAKSQVDYILGKNTL 365
           RD+SNLQY ++++ +LF YS+ L +A +  I CGS  F+ SQI+ FAKSQVDYILG N L
Sbjct: 323 RDSSNLQYVTTATTILFYYSKTLTKAGVGSIQCGSTQFTVSQIRNFAKSQVDYILGNNPL 382

Query: 366 KMSYMVGFGNKYPLQLHHRASSIPSTKVLSTKVGCNDGRSSYFYSNGPNPNTHIGSIVGG 425
           KMSYMVGFG KYP Q HHR SS+PS +    K+ CN G  SY+  + PNPN H G+IVGG
Sbjct: 383 KMSYMVGFGTKYPTQPHHRGSSLPSIQSKPEKIDCNGG-FSYYNFDTPNPNVHTGAIVGG 442

Query: 426 PYLNDEFSDLRSDYSHSEPTTYMNAAFVGSVAALV 461
           P  +D++SD R+DYSH+EPTTY+NAAF+GSVAAL+
Sbjct: 443 PNSSDQYSDKRTDYSHAEPTTYINAAFIGSVAALI 475

BLAST of Cucsa.303540 vs. TAIR10
Match: AT1G02800.1 (AT1G02800.1 cellulase 2)

HSP 1 Score: 507.3 bits (1305), Expect = 1.0e-143
Identity = 253/463 (54.64%), Postives = 329/463 (71.06%), Query Frame = 1

Query: 6   NYSDALGKAVLFFEGQRSGKLPVTQRVKWRGNSALSDGSYENVNLVGGYYDAGDNVKFGW 65
           NY DAL K++LFFEGQRSGKLP  QR+ WR NS LSDGS  NV+LVGGYYDAGDN+KFG+
Sbjct: 43  NYKDALSKSILFFEGQRSGKLPPNQRMTWRSNSGLSDGSALNVDLVGGYYDAGDNMKFGF 102

Query: 66  PMAFTLTLLSWTTVEYEKEIASVMQLEHLRSSVRWGTDFILRAHVSPTTLYTQVGDANGD 125
           PMAFT T+LSW+ +E+   + S  +L + + ++RW TDF+L+A   P T+Y QVGD N D
Sbjct: 103 PMAFTTTMLSWSLIEFGGLMKS--ELPNAKDAIRWATDFLLKATSHPDTIYVQVGDPNMD 162

Query: 126 HQCWERPEDMDTPRTLYKITPNSPGTEAAAEAAAALAAASILFNRVDANYSRRLLQHSKS 185
           H CWERPEDMDTPR+++K+  N+PG++ A E AAALAAASI+F + D +YS  LLQ + +
Sbjct: 163 HACWERPEDMDTPRSVFKVDKNNPGSDIAGEIAAALAAASIVFRKCDPSYSNHLLQRAIT 222

Query: 186 LFQFADKFRGSYSAS-----CPFYCSYSGYQDELLWAAAWLYKASGNMKYLRYVLSN--- 245
           +F FADK+RG YSA      CPFYCSYSGYQDELLW AAWL KA+ N  YL Y+ +N   
Sbjct: 223 VFTFADKYRGPYSAGLAPEVCPFYCSYSGYQDELLWGAAWLQKATNNPTYLNYIKANGQI 282

Query: 246 QWWSQPTSEFSWDNKFVGAQILLTKEFYKGK-KNLSKFKNDVETFICKLMPDDGGSSEIS 305
               +  + FSWDNK VGA+ILL+KEF   K K+L ++K   ++FIC ++P   G+S   
Sbjct: 283 LGADEFDNMFSWDNKHVGARILLSKEFLIQKVKSLEEYKEHADSFICSVLP---GASSSQ 342

Query: 306 RTPGGLLFLRDNSNLQYTSSSSMVLFMYSRLLNQAHIYGIHCGSKYFSSSQIKTFAKSQV 365
            TPGGLLF    SN+QY +S+S +L  Y++ L  A     +CG    + +++++ AK QV
Sbjct: 343 YTPGGLLFKMGESNMQYVTSTSFLLLTYAKYLTSARTVA-YCGGSVVTPARLRSIAKKQV 402

Query: 366 DYILGKNTLKMSYMVGFGNKYPLQLHHRASSIPSTKVLSTKVGCNDGRSSYFYSNGPNPN 425
           DY+LG N LKMSYMVG+G KYP ++HHR SS+PS  V  T++ C+DG  S F S  PNPN
Sbjct: 403 DYLLGGNPLKMSYMVGYGLKYPRRIHHRGSSLPSVAVHPTRIQCHDG-FSLFTSQSPNPN 462

Query: 426 THIGSIVGGPYLNDEFSDLRSDYSHSEPTTYMNAAFVGSVAAL 460
             +G++VGGP  ND+F D RSDY  SEP TY+NA  VG++A L
Sbjct: 463 DLVGAVVGGPDQNDQFPDERSDYGRSEPATYINAPLVGALAYL 498

BLAST of Cucsa.303540 vs. TAIR10
Match: AT4G02290.1 (AT4G02290.1 glycosyl hydrolase 9B13)

HSP 1 Score: 480.7 bits (1236), Expect = 1.0e-135
Identity = 238/461 (51.63%), Postives = 323/461 (70.07%), Query Frame = 1

Query: 6   NYSDALGKAVLFFEGQRSGKLPVTQRVKWRGNSALSDGSYENVNLVGGYYDAGDNVKFGW 65
           NY DAL K++LFFEGQRSGKLP  QR+ WR +S LSDGS  +V+LVGGYYDAGDN+KFG+
Sbjct: 52  NYKDALTKSILFFEGQRSGKLPSNQRMSWRRDSGLSDGSALHVDLVGGYYDAGDNIKFGF 111

Query: 66  PMAFTLTLLSWTTVEYEKEIASVMQLEHLRSSVRWGTDFILRAHVSPTTLYTQVGDANGD 125
           PMAFT T+LSW+ +E+   + S  +L++ + ++RW TD++L+A   P T+Y QVGDAN D
Sbjct: 112 PMAFTTTMLSWSVIEFGGLMKS--ELQNAKIAIRWATDYLLKATSQPDTIYVQVGDANKD 171

Query: 126 HQCWERPEDMDTPRTLYKITPNSPGTEAAAEAAAALAAASILFNRVDANYSRRLLQHSKS 185
           H CWERPEDMDT R+++K+  N PG++ AAE AAALAAA+I+F + D +YS+ LL+ + S
Sbjct: 172 HSCWERPEDMDTVRSVFKVDKNIPGSDVAAETAAALAAAAIVFRKSDPSYSKVLLKRAIS 231

Query: 186 LFQFADKFRGSYSAS-----CPFYCSYSGYQDELLWAAAWLYKASGNMKYLRYVLSN--- 245
           +F FADK+RG+YSA      CPFYCSYSGYQDELLW AAWL KA+ N+KYL Y+  N   
Sbjct: 232 VFAFADKYRGTYSAGLKPDVCPFYCSYSGYQDELLWGAAWLQKATKNIKYLNYIKINGQI 291

Query: 246 QWWSQPTSEFSWDNKFVGAQILLTKEF-YKGKKNLSKFKNDVETFICKLMPDDGGSSEIS 305
              ++  + F WDNK  GA+ILLTK F  +  K L ++K   + FIC ++P    SS   
Sbjct: 292 LGAAEYDNTFGWDNKHAGARILLTKAFLVQNVKTLHEYKGHADNFICSVIPGAPFSS-TQ 351

Query: 306 RTPGGLLFLRDNSNLQYTSSSSMVLFMYSRLLNQAHIYGIHCGSKYFSSSQIKTFAKSQV 365
            TPGGLLF   ++N+QY +S+S +L  Y++ L  A    +HCG   ++  ++++ AK QV
Sbjct: 352 YTPGGLLFKMADANMQYVTSTSFLLLTYAKYLTSAKTV-VHCGGSVYTPGRLRSIAKRQV 411

Query: 366 DYILGKNTLKMSYMVGFGNKYPLQLHHRASSIPSTKVLSTKVGCNDGRSSYFYSNGPNPN 425
           DY+LG N L+MSYMVG+G K+P ++HHR SS+P       K+ C+ G  +   S  PNPN
Sbjct: 412 DYLLGDNPLRMSYMVGYGPKFPRRIHHRGSSLPCVASHPAKIQCHQG-FAIMNSQSPNPN 471

Query: 426 THIGSIVGGPYLNDEFSDLRSDYSHSEPTTYMNAAFVGSVA 458
             +G++VGGP  +D F D RSDY  SEP TY+N+  VG++A
Sbjct: 472 FLVGAVVGGPDQHDRFPDERSDYEQSEPATYINSPLVGALA 507

BLAST of Cucsa.303540 vs. TAIR10
Match: AT1G71380.1 (AT1G71380.1 cellulase 3)

HSP 1 Score: 470.3 bits (1209), Expect = 1.4e-132
Identity = 236/463 (50.97%), Postives = 319/463 (68.90%), Query Frame = 1

Query: 2   DRTFNYSDALGKAVLFFEGQRSGKLPVTQRVKWRGNSALSDGSYENVNLVGGYYDAGDNV 61
           D   NY +AL K++LFF+GQRSG LP  Q++ WR +S LSDGS  +V+L GGYYDAGDNV
Sbjct: 20  DANPNYKEALSKSLLFFQGQRSGPLPRGQQISWRASSGLSDGSAAHVDLTGGYYDAGDNV 79

Query: 62  KFGWPMAFTLTLLSWTTVEYEKEIASVMQLEHLRSSVRWGTDFILR-AHVSPTTLYTQVG 121
           KF  PMAFT T+LSW+ +EY K +    +LE+ R ++RW TD++L+ A  +P  LY  VG
Sbjct: 80  KFNLPMAFTTTMLSWSALEYGKRMGP--ELENARVNIRWATDYLLKCARATPGKLYVGVG 139

Query: 122 DANGDHQCWERPEDMDTPRTLYKITPNSPGTEAAAEAAAALAAASILFNRVDANYSRRLL 181
           D N DH+CWERPEDMDTPRT+Y ++ ++PG++ AAE AAALAAAS++F +VD+ YSR LL
Sbjct: 140 DPNVDHKCWERPEDMDTPRTVYSVSASNPGSDVAAETAAALAAASMVFRKVDSKYSRLLL 199

Query: 182 QHSKSLFQFADKFRGSYSAS-----CPFYCSYSGYQDELLWAAAWLYKASGNMKYLRYVL 241
             +K + QFA +++G+YS S     CPFYCSYSGY+DEL+W A+WL +A+ N  Y  ++ 
Sbjct: 200 ATAKDVMQFAIQYQGAYSDSLSSSVCPFYCSYSGYKDELMWGASWLLRATNNPYYANFIK 259

Query: 242 SNQWWSQPTSEFSWDNKFVGAQILLTKEFYKGK-KNLSKFKNDVETFICKLMPDDGGSSE 301
           S     QP   FSWDNK+ GA +LL++     K  N  ++K   E FICK++P D  SS 
Sbjct: 260 SLGGGDQP-DIFSWDNKYAGAYVLLSRRALLNKDSNFEQYKQAAENFICKILP-DSPSSS 319

Query: 302 ISRTPGGLLFLRDNSNLQYTSSSSMVLFMYSRLLNQAHIYGIHCGSKYFSSSQIKTFAKS 361
              T GGL++    SNLQY +S + +L  Y++ + +A  +  +CGS     + + + +K 
Sbjct: 320 TQYTQGGLMYKLPQSNLQYVTSITFLLTTYAKYM-KATKHTFNCGSSVIVPNALISLSKR 379

Query: 362 QVDYILGKNTLKMSYMVGFGNKYPLQLHHRASSIPSTKVLSTKVGCNDGRSSYFYSNGPN 421
           QVDYILG N +KMSYMVGF + +P ++HHRASS+PS  + S  +GCN G  S FY+  PN
Sbjct: 380 QVDYILGDNPIKMSYMVGFSSNFPKRIHHRASSLPSHALRSQSLGCNGGFQS-FYTQNPN 439

Query: 422 PNTHIGSIVGGPYLNDEFSDLRSDYSHSEPTTYMNAAFVGSVA 458
           PN   G+IVGGP  ND + D R DYSH+EP TY+NAAFVG +A
Sbjct: 440 PNILTGAIVGGPNQNDGYPDQRDDYSHAEPATYINAAFVGPLA 476

BLAST of Cucsa.303540 vs. NCBI nr
Match: gi|449452444|ref|XP_004143969.1| (PREDICTED: endoglucanase-like [Cucumis sativus])

HSP 1 Score: 938.7 bits (2425), Expect = 3.8e-270
Identity = 457/461 (99.13%), Postives = 459/461 (99.57%), Query Frame = 1

Query: 1   LDRTFNYSDALGKAVLFFEGQRSGKLPVTQRVKWRGNSALSDGSYENVNLVGGYYDAGDN 60
           LDRTFNYSDALGKAVLFFEGQRSGKLPVTQRVKWRGNSALSDGSYENVNLVGGYYDAGDN
Sbjct: 28  LDRTFNYSDALGKAVLFFEGQRSGKLPVTQRVKWRGNSALSDGSYENVNLVGGYYDAGDN 87

Query: 61  VKFGWPMAFTLTLLSWTTVEYEKEIASVMQLEHLRSSVRWGTDFILRAHVSPTTLYTQVG 120
           VKFGWPMAFTLTLLSWTTVEYEKEIASVMQLEHLRSSVRWGTDFILRAHVSPTTLYTQVG
Sbjct: 88  VKFGWPMAFTLTLLSWTTVEYEKEIASVMQLEHLRSSVRWGTDFILRAHVSPTTLYTQVG 147

Query: 121 DANGDHQCWERPEDMDTPRTLYKITPNSPGTEAAAEAAAALAAASILFNRVDANYSRRLL 180
           DANGDHQCWERPEDMDTPRTLYKITPNSPGTEAAAEAAAALAAASILFNRVDANYSRRLL
Sbjct: 148 DANGDHQCWERPEDMDTPRTLYKITPNSPGTEAAAEAAAALAAASILFNRVDANYSRRLL 207

Query: 181 QHSKSLFQFADKFRGSYSASCPFYCSYSGYQDELLWAAAWLYKASGNMKYLRYVLSNQWW 240
           QHSKSLFQFADKFRGSYSASCPFYCSYSGYQDELLWAAAWLYKASGNMKYLRYVLSNQWW
Sbjct: 208 QHSKSLFQFADKFRGSYSASCPFYCSYSGYQDELLWAAAWLYKASGNMKYLRYVLSNQWW 267

Query: 241 SQPTSEFSWDNKFVGAQILLTKEFYKGKKNLSKFKNDVETFICKLMPDDGGSSEISRTPG 300
           SQPTSEFSWDNKFVGAQILLTKEFYKGKKNLSKFKNDVETFICKLMPDDGGSS+ISRTPG
Sbjct: 268 SQPTSEFSWDNKFVGAQILLTKEFYKGKKNLSKFKNDVETFICKLMPDDGGSSKISRTPG 327

Query: 301 GLLFLRDNSNLQYTSSSSMVLFMYSRLLNQAHIYGIHCGSKYFSSSQIKTFAKSQVDYIL 360
           GLLFLRDNSNLQYTSSSSMVLFMYSRLLNQAHI+GIHCGSKYFSSSQIKTFAKSQVDYIL
Sbjct: 328 GLLFLRDNSNLQYTSSSSMVLFMYSRLLNQAHIHGIHCGSKYFSSSQIKTFAKSQVDYIL 387

Query: 361 GKNTLKMSYMVGFGNKYPLQLHHRASSIPSTKVLSTKVGCNDGRSSYFYSNGPNPNTHIG 420
           GKN LKMSYMVGFGNKYP QLHHRASSIPSTKVLSTKVGCNDGRSSYFYSNGPNPNTHIG
Sbjct: 388 GKNPLKMSYMVGFGNKYPSQLHHRASSIPSTKVLSTKVGCNDGRSSYFYSNGPNPNTHIG 447

Query: 421 SIVGGPYLNDEFSDLRSDYSHSEPTTYMNAAFVGSVAALVV 462
           SIVGGPYLNDEFSDLRSDYSHSEPTTYMNAAFVGSVAALVV
Sbjct: 448 SIVGGPYLNDEFSDLRSDYSHSEPTTYMNAAFVGSVAALVV 488

BLAST of Cucsa.303540 vs. NCBI nr
Match: gi|659074659|ref|XP_008437724.1| (PREDICTED: endoglucanase-like [Cucumis melo])

HSP 1 Score: 864.8 bits (2233), Expect = 7.0e-248
Identity = 423/460 (91.96%), Postives = 438/460 (95.22%), Query Frame = 1

Query: 1   LDRTFNYSDALGKAVLFFEGQRSGKLPVTQRVKWRGNSALSDGSYENVNLVGGYYDAGDN 60
           LDRTFNYSDALGKAVLFFEGQRSGKLP TQRVKWRG+SALSDGSYENVNLVGGYYDAGDN
Sbjct: 29  LDRTFNYSDALGKAVLFFEGQRSGKLPKTQRVKWRGDSALSDGSYENVNLVGGYYDAGDN 88

Query: 61  VKFGWPMAFTLTLLSWTTVEYEKEIASVMQLEHLRSSVRWGTDFILRAHVSPTTLYTQVG 120
           VKFGWPMAFT+TLLSWTTVEYEKEI SVMQLEHLRSSVRWG DFILRAHVSPTTLYTQVG
Sbjct: 89  VKFGWPMAFTVTLLSWTTVEYEKEIKSVMQLEHLRSSVRWGADFILRAHVSPTTLYTQVG 148

Query: 121 DANGDHQCWERPEDMDTPRTLYKITPNSPGTEAAAEAAAALAAASILFNRVDANYSRRLL 180
           DANGDHQCWERPEDMDTPRTLYKITP+SPGTEAAAEAAAALAAASI+FN V+ NYSR+LL
Sbjct: 149 DANGDHQCWERPEDMDTPRTLYKITPDSPGTEAAAEAAAALAAASIVFNHVNTNYSRKLL 208

Query: 181 QHSKSLFQFADKFRGSYSASCPFYCSYSGYQDELLWAAAWLYKASGNMKYLRYVLSNQWW 240
           QHSKSLF+FADKFRGSYSASCPFYCSYSGYQDELLWAAAWLYKAS NMKYL YVLSNQWW
Sbjct: 209 QHSKSLFEFADKFRGSYSASCPFYCSYSGYQDELLWAAAWLYKASRNMKYLGYVLSNQWW 268

Query: 241 SQPTSEFSWDNKFVGAQILLTKEFYKGKKNLSKFKNDVETFICKLMPDDGGSSEISRTPG 300
           SQPTSEFSWDNKFVGAQ LLTKEFYKGKKNLSKFKNDVE+FICK+MP DGGSSEI RT G
Sbjct: 269 SQPTSEFSWDNKFVGAQTLLTKEFYKGKKNLSKFKNDVESFICKVMP-DGGSSEILRTNG 328

Query: 301 GLLFLRDNSNLQYTSSSSMVLFMYSRLLNQAHIYGIHCGSKYFSSSQIKTFAKSQVDYIL 360
           GLLFLRD+SNLQY SSSSMVLFMYS+LL QA I+GIHCGSKYFSSS+IKTFAKSQVDYIL
Sbjct: 329 GLLFLRDSSNLQYASSSSMVLFMYSKLLKQADIHGIHCGSKYFSSSRIKTFAKSQVDYIL 388

Query: 361 GKNTLKMSYMVGFGNKYPLQLHHRASSIPSTKVLSTKVGCNDGRSSYFYSNGPNPNTHIG 420
           GKN LKMSYMVGFGNKYPLQLHHRASSIPS K  STKVGC+DGRSSYFYS  PNPN HIG
Sbjct: 389 GKNPLKMSYMVGFGNKYPLQLHHRASSIPSIKAHSTKVGCDDGRSSYFYSKDPNPNIHIG 448

Query: 421 SIVGGPYLNDEFSDLRSDYSHSEPTTYMNAAFVGSVAALV 461
           SIVGGP LND+FSDLRSDYSHSEPTTYMNAAFVGSVAALV
Sbjct: 449 SIVGGPDLNDQFSDLRSDYSHSEPTTYMNAAFVGSVAALV 487

BLAST of Cucsa.303540 vs. NCBI nr
Match: gi|502159858|ref|XP_004511554.1| (PREDICTED: endoglucanase [Cicer arietinum])

HSP 1 Score: 719.9 bits (1857), Expect = 2.8e-204
Identity = 339/456 (74.34%), Postives = 393/456 (86.18%), Query Frame = 1

Query: 5   FNYSDALGKAVLFFEGQRSGKLPVTQRVKWRGNSALSDGSYENVNLVGGYYDAGDNVKFG 64
           ++Y+DALGKA+LFFEGQRSGKLP  QRVKWRG+SALSDG  +NV+LVGGYYDAGDNVKFG
Sbjct: 54  YDYADALGKAILFFEGQRSGKLPKDQRVKWRGDSALSDGKPQNVDLVGGYYDAGDNVKFG 113

Query: 65  WPMAFTLTLLSWTTVEYEKEIASVMQLEHLRSSVRWGTDFILRAHVSPTTLYTQVGDANG 124
           WPM+FT++LLSW  VEYE EI+SV QL +L S++RWG DFI ++H SPTTL+TQVGD N 
Sbjct: 114 WPMSFTVSLLSWAAVEYESEISSVNQLGYLHSAIRWGADFIFQSHTSPTTLFTQVGDGNA 173

Query: 125 DHQCWERPEDMDTPRTLYKITPNSPGTEAAAEAAAALAAASILFNRVDANYSRRLLQHSK 184
           DHQCWERPEDMDTPRT+YKI  NSPGTEAAAEAAAAL+AASI+F + D NYS +LL+ SK
Sbjct: 174 DHQCWERPEDMDTPRTVYKIDANSPGTEAAAEAAAALSAASIVFKKTDVNYSSKLLKQSK 233

Query: 185 SLFQFADKFRGSYSASCPFYCSYSGYQDELLWAAAWLYKASGNMKYLRYVLSNQWWSQPT 244
           SLF FADK+RGSYSASCPFYCSYSGYQDELLWAA+WLYKASG  KYL+Y++ NQ WSQ  
Sbjct: 234 SLFDFADKYRGSYSASCPFYCSYSGYQDELLWAASWLYKASGESKYLKYIIDNQGWSQTG 293

Query: 245 SEFSWDNKFVGAQILLTKEFYKGKKNLSKFKNDVETFICKLMPDDGGSSEISRTPGGLLF 304
           SEFSWDNKFVGAQ LLT+EFY GKK LSK ++D E+FIC LMP    S +I  TPGGLL+
Sbjct: 294 SEFSWDNKFVGAQTLLTQEFYDGKKELSKIQSDAESFICGLMP-GSSSVQIKTTPGGLLY 353

Query: 305 LRDNSNLQYTSSSSMVLFMYSRLLNQAHIYGIHCGSKYFSSSQIKTFAKSQVDYILGKNT 364
            RD+SNLQYT++S+MVLF++S++LN+ HI GIHCGS +FS S+I  FAKSQVDYILGKN 
Sbjct: 354 TRDSSNLQYTTTSTMVLFIFSKILNKNHIDGIHCGSSHFSPSEISAFAKSQVDYILGKNP 413

Query: 365 LKMSYMVGFGNKYPLQLHHRASSIPSTKVLSTKVGCNDGRSSYFYSNGPNPNTHIGSIVG 424
           +KMSYMVG+G+KYP QLHHR SSIPS KV  TKVGCNDG S YF S+ PNPN H+G+IVG
Sbjct: 414 MKMSYMVGYGSKYPKQLHHRGSSIPSIKVHPTKVGCNDGHSDYFSSSNPNPNIHVGAIVG 473

Query: 425 GPYLNDEFSDLRSDYSHSEPTTYMNAAFVGSVAALV 461
           GP  ND+FSD RSDYSHSEPTTYMNAAF+GSVAAL+
Sbjct: 474 GPDSNDQFSDARSDYSHSEPTTYMNAAFIGSVAALL 508

BLAST of Cucsa.303540 vs. NCBI nr
Match: gi|357481551|ref|XP_003611061.1| (glycosyl hydrolase family 9 protein [Medicago truncatula])

HSP 1 Score: 713.8 bits (1841), Expect = 2.0e-202
Identity = 334/456 (73.25%), Postives = 391/456 (85.75%), Query Frame = 1

Query: 5   FNYSDALGKAVLFFEGQRSGKLPVTQRVKWRGNSALSDGSYENVNLVGGYYDAGDNVKFG 64
           ++Y+DALGKA+LFFEGQRSGKLP  QRVKWRG+SALSDG  +NV+LVGGYYDAGDNVKFG
Sbjct: 39  YDYADALGKAILFFEGQRSGKLPKDQRVKWRGDSALSDGKTQNVDLVGGYYDAGDNVKFG 98

Query: 65  WPMAFTLTLLSWTTVEYEKEIASVMQLEHLRSSVRWGTDFILRAHVSPTTLYTQVGDANG 124
           WPM+FT++LLSW  VEYE EI+S  QL++LRS++RWG DFIL+AH SPTTL+TQVGD N 
Sbjct: 99  WPMSFTVSLLSWAAVEYESEISSANQLDYLRSAIRWGADFILKAHTSPTTLFTQVGDGNA 158

Query: 125 DHQCWERPEDMDTPRTLYKITPNSPGTEAAAEAAAALAAASILFNRVDANYSRRLLQHSK 184
           DH CWERPEDMDTPRT YKI  NSPGTEAAAEAAAAL+AASI+F + D NYS +LL  SK
Sbjct: 159 DHNCWERPEDMDTPRTTYKIDSNSPGTEAAAEAAAALSAASIVFKKTDINYSSKLLSQSK 218

Query: 185 SLFQFADKFRGSYSASCPFYCSYSGYQDELLWAAAWLYKASGNMKYLRYVLSNQWWSQPT 244
           SLF FADK+RGSYS SCPFYCSYSGYQDELLWAA WLYKASG  KYL Y+ SNQ WSQ  
Sbjct: 219 SLFDFADKYRGSYSGSCPFYCSYSGYQDELLWAATWLYKASGESKYLTYITSNQGWSQAV 278

Query: 245 SEFSWDNKFVGAQILLTKEFYKGKKNLSKFKNDVETFICKLMPDDGGSSEISRTPGGLLF 304
           SEFSWDNKFVGAQ LLT+EFY G++ L+KF+ D E+FIC LMP    S +I  TPGGLL+
Sbjct: 279 SEFSWDNKFVGAQTLLTQEFYGGREELAKFQTDAESFICALMP-GSSSLQIKTTPGGLLY 338

Query: 305 LRDNSNLQYTSSSSMVLFMYSRLLNQAHIYGIHCGSKYFSSSQIKTFAKSQVDYILGKNT 364
           +RD+SNLQYT++S+MVLF++S++LN+ HI GIHCGS +FS S+I+ FAK QVDYILG N 
Sbjct: 339 IRDSSNLQYTTTSTMVLFIFSKILNKNHIDGIHCGSAHFSPSEIRAFAKLQVDYILGNNP 398

Query: 365 LKMSYMVGFGNKYPLQLHHRASSIPSTKVLSTKVGCNDGRSSYFYSNGPNPNTHIGSIVG 424
           +KMSYMVG+G+KYP QLHHR SSIPS KV  TKVGCNDG+S+YF S+ PNPN H+G+IVG
Sbjct: 399 MKMSYMVGYGSKYPKQLHHRGSSIPSIKVHQTKVGCNDGQSNYFSSSNPNPNIHVGAIVG 458

Query: 425 GPYLNDEFSDLRSDYSHSEPTTYMNAAFVGSVAALV 461
           GP  ND+++D RSDYSH+EPTTYMNAAFVGSVAAL+
Sbjct: 459 GPNSNDQYNDARSDYSHAEPTTYMNAAFVGSVAALL 493

BLAST of Cucsa.303540 vs. NCBI nr
Match: gi|567893952|ref|XP_006439464.1| (hypothetical protein CICLE_v10023813mg [Citrus clementina])

HSP 1 Score: 711.1 bits (1834), Expect = 1.3e-201
Identity = 336/457 (73.52%), Postives = 390/457 (85.34%), Query Frame = 1

Query: 4   TFNYSDALGKAVLFFEGQRSGKLPVTQRVKWRGNSALSDGSYENVNLVGGYYDAGDNVKF 63
           TF Y DALGKA+LFFEGQRSGKLP +QRVKWRGNSALSDG  ENVNL+GGYYDAGDNVKF
Sbjct: 38  TFAYRDALGKAILFFEGQRSGKLPESQRVKWRGNSALSDGKPENVNLIGGYYDAGDNVKF 97

Query: 64  GWPMAFTLTLLSWTTVEYEKEIASVMQLEHLRSSVRWGTDFILRAHVSPTTLYTQVGDAN 123
           GWPMA++++LLSW  VEY++EI+SV QL +LR ++RWGTDFILRAH SPTTLYTQVGD N
Sbjct: 98  GWPMAYSVSLLSWAAVEYQREISSVNQLGYLRGAIRWGTDFILRAHTSPTTLYTQVGDGN 157

Query: 124 GDHQCWERPEDMDTPRTLYKITPNSPGTEAAAEAAAALAAASILFNRVDANYSRRLLQHS 183
            DHQCWERPEDMDTPRTLY+IT +SPG+EAAAE+AAALAAASI+F +VD+ YS RLL HS
Sbjct: 158 ADHQCWERPEDMDTPRTLYRITSDSPGSEAAAESAAALAAASIVFKKVDSIYSSRLLNHS 217

Query: 184 KSLFQFADKFRGSYSASCPFYCSYSGYQDELLWAAAWLYKASGNMKYLRYVLSNQWWSQP 243
           KSLF+FADK RGSY ASCPFYCSYSGYQDELLWAAAWLYKAS + KYL YVLSNQ WSQ 
Sbjct: 218 KSLFEFADKHRGSYQASCPFYCSYSGYQDELLWAAAWLYKASEDNKYLNYVLSNQGWSQV 277

Query: 244 TSEFSWDNKFVGAQILLTKEFYKGKKNLSKFKNDVETFICKLMPDDGGSSEISRTPGGLL 303
            SEFSWDNKF GAQ+LL KEF+ G K LS FK  VE+F+C LMP +  S  I  TPGGLL
Sbjct: 278 ASEFSWDNKFAGAQMLLAKEFFGGNKQLSLFKIHVESFVCALMP-ESSSVRIETTPGGLL 337

Query: 304 FLRDNSNLQYTSSSSMVLFMYSRLLNQAHIYGIHCGSKYFSSSQIKTFAKSQVDYILGKN 363
           ++RD+SNLQY +S++++LF+YS+ LN AHI G+ CGS +FS+SQI  FAKSQVDYILGKN
Sbjct: 338 YIRDSSNLQYVTSATLLLFLYSKTLNTAHINGLQCGSAHFSASQISAFAKSQVDYILGKN 397

Query: 364 TLKMSYMVGFGNKYPLQLHHRASSIPSTKVLSTKVGCNDGRSSYFYSNGPNPNTHIGSIV 423
            +KMSYM GFG+K+PLQ+HHR +SIPS      KV CNDG SSY++S+ PNPN H+G+IV
Sbjct: 398 PMKMSYMAGFGSKFPLQIHHRGASIPSIGAHPAKVSCNDGYSSYYHSSNPNPNVHVGAIV 457

Query: 424 GGPYLNDEFSDLRSDYSHSEPTTYMNAAFVGSVAALV 461
           GGP  ND+F DLR+DYSH+EPTTYMNAAFVGSVA L+
Sbjct: 458 GGPDSNDQFKDLRTDYSHAEPTTYMNAAFVGSVAPLL 493

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
GUN_PHAVU6.4e-20172.81Endoglucanase OS=Phaseolus vulgaris PE=2 SV=2[more]
GUN20_ARATH9.9e-18667.62Endoglucanase 20 OS=Arabidopsis thaliana GN=At4g23560 PE=2 SV=1[more]
GUN18_ARATH7.4e-18165.71Endoglucanase 18 OS=Arabidopsis thaliana GN=At4g09740 PE=3 SV=2[more]
GUN20_ORYSJ1.1e-16862.11Endoglucanase 20 OS=Oryza sativa subsp. japonica GN=GLU15 PE=2 SV=1[more]
GUN1_ARATH1.8e-14254.64Endoglucanase 1 OS=Arabidopsis thaliana GN=CEL2 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KK57_CUCSA2.7e-27099.13Endoglucanase OS=Cucumis sativus GN=Csa_5G152200 PE=3 SV=1[more]
G7KC03_MEDTR1.4e-20273.25Endoglucanase OS=Medicago truncatula GN=MTR_5g010000 PE=3 SV=1[more]
V4VJ70_9ROSI9.0e-20273.52Endoglucanase OS=Citrus clementina GN=CICLE_v10023813mg PE=3 SV=1[more]
A0A067G9J1_CITSI1.5e-20173.52Endoglucanase OS=Citrus sinensis GN=CISIN_1g042201mg PE=3 SV=1[more]
G7KBZ9_MEDTR1.7e-20072.21Endoglucanase OS=Medicago truncatula GN=MTR_5g009970 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT4G23560.15.6e-18767.62 glycosyl hydrolase 9B15[more]
AT4G09740.14.1e-18265.71 glycosyl hydrolase 9B14[more]
AT1G02800.11.0e-14354.64 cellulase 2[more]
AT4G02290.11.0e-13551.63 glycosyl hydrolase 9B13[more]
AT1G71380.11.4e-13250.97 cellulase 3[more]
Match NameE-valueIdentityDescription
gi|449452444|ref|XP_004143969.1|3.8e-27099.13PREDICTED: endoglucanase-like [Cucumis sativus][more]
gi|659074659|ref|XP_008437724.1|7.0e-24891.96PREDICTED: endoglucanase-like [Cucumis melo][more]
gi|502159858|ref|XP_004511554.1|2.8e-20474.34PREDICTED: endoglucanase [Cicer arietinum][more]
gi|357481551|ref|XP_003611061.1|2.0e-20273.25glycosyl hydrolase family 9 protein [Medicago truncatula][more]
gi|567893952|ref|XP_006439464.1|1.3e-20173.52hypothetical protein CICLE_v10023813mg [Citrus clementina][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001701Glyco_hydro_9
IPR0089286-hairpin_glycosidase_sf
IPR0123416hp_glycosidase-like_sf
IPR018221Glyco_hydro_9_His_AS
Vocabulary: Molecular Function
TermDefinition
GO:0004553hydrolase activity, hydrolyzing O-glycosyl compounds
GO:0003824catalytic activity
Vocabulary: Biological Process
TermDefinition
GO:0005975carbohydrate metabolic process
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0030245 cellulose catabolic process
biological_process GO:0009835 fruit ripening
biological_process GO:0005982 starch metabolic process
biological_process GO:0005985 sucrose metabolic process
biological_process GO:0005975 carbohydrate metabolic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0008810 cellulase activity
molecular_function GO:0003824 catalytic activity
molecular_function GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compounds

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cucsa.303540.1Cucsa.303540.1mRNA


Analysis Name: InterPro Annotations of cucumber (Gy14)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001701Glycoside hydrolase family 9PFAMPF00759Glyco_hydro_9coord: 7..454
score: 1.6E
IPR008928Six-hairpin glycosidase-likeunknownSSF48208Six-hairpin glycosidasescoord: 4..460
score: 9.53E
IPR012341Six-hairpin glycosidaseGENE3DG3DSA:1.50.10.10coord: 5..459
score: 8.1E
IPR018221Glycoside hydrolase family 9, His active sitePROSITEPS00592GLYCOSYL_HYDROL_F9_1coord: 368..384
scor
NoneNo IPR availablePANTHERPTHR22298ENDO-1,4-BETA-GLUCANASEcoord: 6..460
score:
NoneNo IPR availablePANTHERPTHR22298:SF22ENDOGLUCANASE 18-RELATEDcoord: 6..460
score:

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Cucsa.303540Melon (DHL92) v3.6.1cgymedB495
Cucsa.303540Melon (DHL92) v3.6.1cgymedB498
Cucsa.303540Melon (DHL92) v3.6.1cgymedB500
Cucsa.303540Silver-seed gourdcarcgyB1043
Cucsa.303540Silver-seed gourdcarcgyB1054
Cucsa.303540Cucumber (Chinese Long) v3cgycucB468
Cucsa.303540Cucumber (Chinese Long) v3cgycucB469
Cucsa.303540Cucumber (Chinese Long) v3cgycucB471
Cucsa.303540Watermelon (97103) v2cgywmbB520
Cucsa.303540Watermelon (97103) v2cgywmbB523
Cucsa.303540Watermelon (97103) v2cgywmbB524
Cucsa.303540Watermelon (97103) v2cgywmbB526
Cucsa.303540Wax gourdcgywgoB602
Cucsa.303540Cucumber (Gy14) v1cgycgyB056
Cucsa.303540Cucumber (Gy14) v1cgycgyB087
Cucsa.303540Cucurbita maxima (Rimu)cgycmaB0830
Cucsa.303540Cucurbita maxima (Rimu)cgycmaB0831
Cucsa.303540Cucurbita maxima (Rimu)cgycmaB0833
Cucsa.303540Cucurbita maxima (Rimu)cgycmaB0837
Cucsa.303540Cucurbita maxima (Rimu)cgycmaB0838
Cucsa.303540Cucumber (Chinese Long) v2cgycuB435
Cucsa.303540Cucurbita moschata (Rifu)cgycmoB0824
Cucsa.303540Cucurbita moschata (Rifu)cgycmoB0825
Cucsa.303540Cucurbita moschata (Rifu)cgycmoB0828
Cucsa.303540Cucurbita moschata (Rifu)cgycmoB0833
Cucsa.303540Wild cucumber (PI 183967)cgycpiB454
Cucsa.303540Wild cucumber (PI 183967)cgycpiB456
Cucsa.303540Wild cucumber (PI 183967)cgycpiB458
Cucsa.303540Cucumber (Chinese Long) v2cgycuB433
Cucsa.303540Cucumber (Chinese Long) v2cgycuB431
Cucsa.303540Melon (DHL92) v3.5.1cgymeB496
Cucsa.303540Melon (DHL92) v3.5.1cgymeB497
Cucsa.303540Melon (DHL92) v3.5.1cgymeB499
Cucsa.303540Melon (DHL92) v3.5.1cgymeB501
Cucsa.303540Watermelon (Charleston Gray)cgywcgB521
Cucsa.303540Watermelon (Charleston Gray)cgywcgB522
Cucsa.303540Watermelon (Charleston Gray)cgywcgB523
Cucsa.303540Watermelon (97103) v1cgywmB549
Cucsa.303540Watermelon (97103) v1cgywmB550
Cucsa.303540Cucurbita pepo (Zucchini)cgycpeB0789
Cucsa.303540Cucurbita pepo (Zucchini)cgycpeB0792
Cucsa.303540Bottle gourd (USVL1VR-Ls)cgylsiB486
Cucsa.303540Bottle gourd (USVL1VR-Ls)cgylsiB489