CsGy5G016230 (gene) Cucumber (Gy14) v2

NameCsGy5G016230
Typegene
OrganismCucumis sativus (Cucumber (Gy14) v2)
Descriptionmajor extracellular endoglucanase-like
LocationChr5 : 22589248 .. 22590906 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTTGGCAGAAGGCCTCAACCATAGGCCATTAAAAGACCTAGCAGACGAGGCGATCAAGTTAAGATTTAATTGTGTACGACTCACATATGCAACTCATATGTTCACTCGCTATGCAAATAGGACGGTTGAGGAAAACTTTGATCTTCTTGATTTAAAACAAGCAAAAGCTGGATTGGCTCAATATAATCCTTTTGTGTTGAATAAGACTATTGCTGAAGCATATGAAGCTGTTGTTGATGTGTTGGGGGCAAGTGGTTTAATGGTTATCGCTGACAATCACATGAGCCAACCAAGATGGTGTTGCTCTCTCGATGATGGCAATGGCTTCTTCGGAAACAACAATTTTGACCCTCAAGAATGGCTACAAGGGCTTAGCTTGGTTGCTCAACGCTTTCGCAACAAATCAACGGTATGTAATTAATTCACATCACGAGATTTTCTTAGATTCTTAAAACTAAACTCAAACTTTTAAGAACATGTATAGTATTTCATTAATTAACAATTTTTTTAATGGCTGTTTAGGTGGTAGGAATGAGTTTACGAAATGAGATACGAGGCTTTATGGAAAATGCAAATGATTGGAACAAATATATAACTCAAGGGGTAACCACGATTCATAACATAAATTCGGAAGTTTTAGTCATTGTTTCAGGGTTAAATTATGACAACGATCTTCGATGCTTAAAGGAAAAGCCTTTGAATGTTGGCACCTTAGACAATAAGTTGGTTTTCGAGGTACACTTATATTCTTTCAGTGGAGATTCCGAGAGCAAGTTTGTAAAACAACCATTGAACAATATATGTGCAAATATTATGAATGGATTTATAGACCATGCTGGGTTTGTGATGCAAGGACCAAACCCGTTTCCATTATTTGTTAGTGAATATGGATATGATCAAAGAGAAGTTAACGATGCTGAAAACCGATTCATGAGTTGCTTCACAGCCCATCTTGCACAAAGAGATTTGGATTGGGCATTGTGGGCTTGGCAAGGTAGCTATTATTTTAGAGAAGGTCAAGCAGAGCCTGGAGAAAGTTTCGGAGTGCTCGACTCTAATTGGACTCAAATTAAGAACCCTAACTTTGTACAAAAGTTTCAACTATTGCAGACCATGTTGCACGGTAACTATATAAATGATTATTACAAGTACATGCAATGATCTATAGTTATATAGTATCAAAGTAACACTTTGTATAACATCTTTTTATGATATATATGCAGATCCAAATTCTAATGCATCGTTCTCATATGTTATATATCATCCACAAAGTAGCCAATGCATCCAAGTCTCCAATGACAACAAAGAAATTTTCCTCACCAATTGCTCCACCCCAACTCGATGGAGTCATAACAATGATGGCACTCCAATTGAGATGTCAAGCACTGGTTTATACTTGAAGGCCAGTGGGGAAGGCCTTGAGGCATCTCTTTCAACTGATACCTTAAGCCAACAAAGTGTTTGGAGTGCCATTTCAAACTCTAAACTTCATTTGGCCACCTTCACTCAAGGTGGAAAGAGCCTTTGTTTGCAAATGGATAGCTCCAACTCTTCAAAAGTTGTGACCAACTCTTGCATTTGCACCAATGGTGATCCAAATTGCCTCCAAGACACCCGAAGCCAATGGTTTGAACTCGTTGAAACCAACACATTATGA

mRNA sequence

ATGTTGGCAGAAGGCCTCAACCATAGGCCATTAAAAGACCTAGCAGACGAGGCGATCAAGTTAAGATTTAATTGTGTACGACTCACATATGCAACTCATATGTTCACTCGCTATGCAAATAGGACGGTTGAGGAAAACTTTGATCTTCTTGATTTAAAACAAGCAAAAGCTGGATTGGCTCAATATAATCCTTTTGTGTTGAATAAGACTATTGCTGAAGCATATGAAGCTGTTGTTGATGTGTTGGGGGCAAGTGGTTTAATGGTTATCGCTGACAATCACATGAGCCAACCAAGATGGTGTTGCTCTCTCGATGATGGCAATGGCTTCTTCGGAAACAACAATTTTGACCCTCAAGAATGGCTACAAGGGCTTAGCTTGGTTGCTCAACGCTTTCGCAACAAATCAACGGTGGTAGGAATGAGTTTACGAAATGAGATACGAGGCTTTATGGAAAATGCAAATGATTGGAACAAATATATAACTCAAGGGGTAACCACGATTCATAACATAAATTCGGAAGTTTTAGTCATTGTTTCAGGGTTAAATTATGACAACGATCTTCGATGCTTAAAGGAAAAGCCTTTGAATGTTGGCACCTTAGACAATAAGTTGGTTTTCGAGGTACACTTATATTCTTTCAGTGGAGATTCCGAGAGCAAGTTTGTAAAACAACCATTGAACAATATATGTGCAAATATTATGAATGGATTTATAGACCATGCTGGGTTTGTGATGCAAGGACCAAACCCGTTTCCATTATTTGTTAGTGAATATGGATATGATCAAAGAGAAGTTAACGATGCTGAAAACCGATTCATGAGTTGCTTCACAGCCCATCTTGCACAAAGAGATTTGGATTGGGCATTGTGGGCTTGGCAAGGTAGCTATTATTTTAGAGAAGGTCAAGCAGAGCCTGGAGAAAGTTTCGGAGTGCTCGACTCTAATTGGACTCAAATTAAGAACCCTAACTTTGTACAAAAGTTTCAACTATTGCAGACCATGTTGCACGATCCAAATTCTAATGCATCGTTCTCATATGTTATATATCATCCACAAAGTAGCCAATGCATCCAAGTCTCCAATGACAACAAAGAAATTTTCCTCACCAATTGCTCCACCCCAACTCGATGGAGTCATAACAATGATGGCACTCCAATTGAGATGTCAAGCACTGGTTTATACTTGAAGGCCAGTGGGGAAGGCCTTGAGGCATCTCTTTCAACTGATACCTTAAGCCAACAAAGTGTTTGGAGTGCCATTTCAAACTCTAAACTTCATTTGGCCACCTTCACTCAAGGTGGAAAGAGCCTTTGTTTGCAAATGGATAGCTCCAACTCTTCAAAAGTTGTGACCAACTCTTGCATTTGCACCAATGGTGATCCAAATTGCCTCCAAGACACCCGAAGCCAATGGTTTGAACTCGTTGAAACCAACACATTATGA

Coding sequence (CDS)

ATGTTGGCAGAAGGCCTCAACCATAGGCCATTAAAAGACCTAGCAGACGAGGCGATCAAGTTAAGATTTAATTGTGTACGACTCACATATGCAACTCATATGTTCACTCGCTATGCAAATAGGACGGTTGAGGAAAACTTTGATCTTCTTGATTTAAAACAAGCAAAAGCTGGATTGGCTCAATATAATCCTTTTGTGTTGAATAAGACTATTGCTGAAGCATATGAAGCTGTTGTTGATGTGTTGGGGGCAAGTGGTTTAATGGTTATCGCTGACAATCACATGAGCCAACCAAGATGGTGTTGCTCTCTCGATGATGGCAATGGCTTCTTCGGAAACAACAATTTTGACCCTCAAGAATGGCTACAAGGGCTTAGCTTGGTTGCTCAACGCTTTCGCAACAAATCAACGGTGGTAGGAATGAGTTTACGAAATGAGATACGAGGCTTTATGGAAAATGCAAATGATTGGAACAAATATATAACTCAAGGGGTAACCACGATTCATAACATAAATTCGGAAGTTTTAGTCATTGTTTCAGGGTTAAATTATGACAACGATCTTCGATGCTTAAAGGAAAAGCCTTTGAATGTTGGCACCTTAGACAATAAGTTGGTTTTCGAGGTACACTTATATTCTTTCAGTGGAGATTCCGAGAGCAAGTTTGTAAAACAACCATTGAACAATATATGTGCAAATATTATGAATGGATTTATAGACCATGCTGGGTTTGTGATGCAAGGACCAAACCCGTTTCCATTATTTGTTAGTGAATATGGATATGATCAAAGAGAAGTTAACGATGCTGAAAACCGATTCATGAGTTGCTTCACAGCCCATCTTGCACAAAGAGATTTGGATTGGGCATTGTGGGCTTGGCAAGGTAGCTATTATTTTAGAGAAGGTCAAGCAGAGCCTGGAGAAAGTTTCGGAGTGCTCGACTCTAATTGGACTCAAATTAAGAACCCTAACTTTGTACAAAAGTTTCAACTATTGCAGACCATGTTGCACGATCCAAATTCTAATGCATCGTTCTCATATGTTATATATCATCCACAAAGTAGCCAATGCATCCAAGTCTCCAATGACAACAAAGAAATTTTCCTCACCAATTGCTCCACCCCAACTCGATGGAGTCATAACAATGATGGCACTCCAATTGAGATGTCAAGCACTGGTTTATACTTGAAGGCCAGTGGGGAAGGCCTTGAGGCATCTCTTTCAACTGATACCTTAAGCCAACAAAGTGTTTGGAGTGCCATTTCAAACTCTAAACTTCATTTGGCCACCTTCACTCAAGGTGGAAAGAGCCTTTGTTTGCAAATGGATAGCTCCAACTCTTCAAAAGTTGTGACCAACTCTTGCATTTGCACCAATGGTGATCCAAATTGCCTCCAAGACACCCGAAGCCAATGGTTTGAACTCGTTGAAACCAACACATTATGA

Protein sequence

MLAEGLNHRPLKDLADEAIKLRFNCVRLTYATHMFTRYANRTVEENFDLLDLKQAKAGLAQYNPFVLNKTIAEAYEAVVDVLGASGLMVIADNHMSQPRWCCSLDDGNGFFGNNNFDPQEWLQGLSLVAQRFRNKSTVVGMSLRNEIRGFMENANDWNKYITQGVTTIHNINSEVLVIVSGLNYDNDLRCLKEKPLNVGTLDNKLVFEVHLYSFSGDSESKFVKQPLNNICANIMNGFIDHAGFVMQGPNPFPLFVSEYGYDQREVNDAENRFMSCFTAHLAQRDLDWALWAWQGSYYFREGQAEPGESFGVLDSNWTQIKNPNFVQKFQLLQTMLHDPNSNASFSYVIYHPQSSQCIQVSNDNKEIFLTNCSTPTRWSHNNDGTPIEMSSTGLYLKASGEGLEASLSTDTLSQQSVWSAISNSKLHLATFTQGGKSLCLQMDSSNSSKVVTNSCICTNGDPNCLQDTRSQWFELVETNTL
BLAST of CsGy5G016230 vs. NCBI nr
Match: XP_011658389.1 (PREDICTED: uncharacterized protein LOC101207450 [Cucumis sativus] >KGN45940.1 hypothetical protein Csa_6G028440 [Cucumis sativus])

HSP 1 Score: 875.5 bits (2261), Expect = 8.0e-251
Identity = 422/482 (87.55%), Postives = 447/482 (92.74%), Query Frame = 0

Query: 1   MLAEGLNHRPLKDLADEAIKLRFNCVRLTYATHMFTRYANRTVEENFDLLDLKQAKAGLA 60
           ML EGLNHRPLK+LADEAIKLRFNCVRLTYATHMFTRYANRTVEENFDLLDL+QAKAGLA
Sbjct: 57  MLIEGLNHRPLKELADEAIKLRFNCVRLTYATHMFTRYANRTVEENFDLLDLEQAKAGLA 116

Query: 61  QYNPFVLNKTIAEAYEAVVDVLGASGLMVIADNHMSQPRWCCSLDDGNGFFGNNNFDPQE 120
           QYNPFVLNKTIAEAYEAVVDVLGASGLMVIADNHMSQPRWCCSLDDGNGFFGN  FDPQE
Sbjct: 117 QYNPFVLNKTIAEAYEAVVDVLGASGLMVIADNHMSQPRWCCSLDDGNGFFGNRYFDPQE 176

Query: 121 WLQGLSLVAQRFRNKSTVVGMSLRNEIRGFMENANDWNKYITQGVTTIHNINSEVLVIVS 180
           WLQGLSLVAQRF NKSTVVGMSLRNE+RG MENANDWN Y+TQGVTTIH IN  VLVIVS
Sbjct: 177 WLQGLSLVAQRFNNKSTVVGMSLRNELRGMMENANDWNNYVTQGVTTIHKINPAVLVIVS 236

Query: 181 GLNYDNDLRCLKEKPLNVGTLDNKLVFEVHLYSFSGDSESKFVKQPLNNICANIMNGFID 240
           GLNYDNDLRCLK+KPLNV TLDNKL FEVHLYSFSGDSESKFV+QPLNNICA IM+ FID
Sbjct: 237 GLNYDNDLRCLKDKPLNVSTLDNKLAFEVHLYSFSGDSESKFVQQPLNNICAKIMHEFID 296

Query: 241 HAGFVMQGPNPFPLFVSEYGYDQREVNDAENRFMSCFTAHLAQRDLDWALWAWQGSYYFR 300
           HA FV++GPNPFPLFVSEYGYDQREV+DAENRFMSCFTAHLAQ+DLDWALW WQGSYY+R
Sbjct: 297 HAEFVIEGPNPFPLFVSEYGYDQREVDDAENRFMSCFTAHLAQKDLDWALWTWQGSYYYR 356

Query: 301 EGQAEPGESFGVLDSNWTQIKNPNFVQKFQLLQTMLHDPNSNASFSYVIYHPQSSQCIQV 360
           EGQAE  E+FGVLDSNWTQIKNPNFVQKFQLLQTML DP SNASFSYVIYH QS QCI+V
Sbjct: 357 EGQAELAETFGVLDSNWTQIKNPNFVQKFQLLQTMLQDPYSNASFSYVIYHVQSGQCIEV 416

Query: 361 SNDNKEIFLTNCSTPTRWSHNNDGTPIEMSSTGLYLKASGEGLEASLSTDTLSQQSVWSA 420
           SNDNKEIFLTNCST +RWSH+ND TPI+MSSTGL LKASGEGLEASLSTD + +QS+WSA
Sbjct: 417 SNDNKEIFLTNCSTSSRWSHDNDSTPIKMSSTGLCLKASGEGLEASLSTDCIGKQSLWSA 476

Query: 421 ISNSKLHLATFTQGGKSLCLQ-MDSSNSSKVVTNSCICTNGDPNCLQDTRSQWFELVETN 480
           ISNS LHL T T+ GKSLCLQ ++SSNSSK+VTNSCICT  DP CLQDT+SQWFELV TN
Sbjct: 477 ISNSNLHLGTVTEDGKSLCLQIIESSNSSKIVTNSCICTTNDPTCLQDTQSQWFELVATN 536

Query: 481 TL 482
           TL
Sbjct: 537 TL 538

BLAST of CsGy5G016230 vs. NCBI nr
Match: XP_022932816.1 (uncharacterized protein LOC111439277 [Cucurbita moschata])

HSP 1 Score: 873.2 bits (2255), Expect = 4.0e-250
Identity = 418/481 (86.90%), Postives = 441/481 (91.68%), Query Frame = 0

Query: 1   MLAEGLNHRPLKDLADEAIKLRFNCVRLTYATHMFTRYANRTVEENFDLLDLKQAKAGLA 60
           ML EGLNHRPLK+LADEAIKLRFNCVRLTYAT MFTRYANRTVEENFDLLDL+QAKAGLA
Sbjct: 1   MLIEGLNHRPLKELADEAIKLRFNCVRLTYATQMFTRYANRTVEENFDLLDLEQAKAGLA 60

Query: 61  QYNPFVLNKTIAEAYEAVVDVLGASGLMVIADNHMSQPRWCCSLDDGNGFFGNNNFDPQE 120
           QYNPFVLNKTIAEAYEAVVDVLG SGLMVIADNHMSQPRWCCSLDDGNGFFG+  FDPQE
Sbjct: 61  QYNPFVLNKTIAEAYEAVVDVLGESGLMVIADNHMSQPRWCCSLDDGNGFFGDRYFDPQE 120

Query: 121 WLQGLSLVAQRFRNKSTVVGMSLRNEIRGFMENANDWNKYITQGVTTIHNINSEVLVIVS 180
           WLQGLSLVAQRF  KSTVVGMSLRNEIRG  ENANDWN Y+TQGVTTIHNIN  VLVIVS
Sbjct: 121 WLQGLSLVAQRFSKKSTVVGMSLRNEIRGTNENANDWNNYVTQGVTTIHNINPNVLVIVS 180

Query: 181 GLNYDNDLRCLKEKPLNVGTLDNKLVFEVHLYSFSGDSESKFVKQPLNNICANIMNGFID 240
           GLN+DNDLRCLKEKPLNV TLDNKLVFEVHLYSFSGD ESKF+ QPLNNICANI+NGF+D
Sbjct: 181 GLNFDNDLRCLKEKPLNVSTLDNKLVFEVHLYSFSGDQESKFINQPLNNICANIINGFVD 240

Query: 241 HAGFVMQGPNPFPLFVSEYGYDQREVNDAENRFMSCFTAHLAQRDLDWALWAWQGSYYFR 300
           HA FV +GPNPFPLFVSEYGYDQREVNDAENRFMSCFTAHLAQ DLDWALW WQGSYY+R
Sbjct: 241 HAEFVREGPNPFPLFVSEYGYDQREVNDAENRFMSCFTAHLAQEDLDWALWTWQGSYYYR 300

Query: 301 EGQAEPGESFGVLDSNWTQIKNPNFVQKFQLLQTMLHDPNSNASFSYVIYHPQSSQCIQV 360
           EGQA P E+FGVLDSNWTQIKNPNFVQKFQLLQTML DPNSNASFSYVIYHPQS QCIQV
Sbjct: 301 EGQAGPAETFGVLDSNWTQIKNPNFVQKFQLLQTMLQDPNSNASFSYVIYHPQSGQCIQV 360

Query: 361 SNDNKEIFLTNCSTPTRWSHNNDGTPIEMSSTGLYLKASGEGLEASLSTDTLSQQSVWSA 420
           SNDNK+IF+ NCS  +RW+H+ND TPI MSSTGL LK SGEGL  SLSTD    QS W+A
Sbjct: 361 SNDNKDIFMGNCSISSRWTHDNDSTPIRMSSTGLCLKTSGEGLMPSLSTDCFGPQSSWTA 420

Query: 421 ISNSKLHLATFTQGGKSLCLQMDSSNSSKVVTNSCICTNGDPNCLQDTRSQWFELVETNT 480
           ISN+KLHLAT TQ GKSLCLQ++SSNSSK+VTNSCICT+G PNCLQDT+SQWFELVETNT
Sbjct: 421 ISNTKLHLATVTQDGKSLCLQVESSNSSKIVTNSCICTDGAPNCLQDTQSQWFELVETNT 480

Query: 481 L 482
           L
Sbjct: 481 L 481

BLAST of CsGy5G016230 vs. NCBI nr
Match: XP_022933313.1 (uncharacterized protein LOC111440529 [Cucurbita moschata])

HSP 1 Score: 873.2 bits (2255), Expect = 4.0e-250
Identity = 418/481 (86.90%), Postives = 441/481 (91.68%), Query Frame = 0

Query: 1   MLAEGLNHRPLKDLADEAIKLRFNCVRLTYATHMFTRYANRTVEENFDLLDLKQAKAGLA 60
           ML EGLNHRPLK+LADEAIKLRFNCVRLTYAT MFTRYANRTVEENFDLLDL+QAKAGLA
Sbjct: 54  MLIEGLNHRPLKELADEAIKLRFNCVRLTYATQMFTRYANRTVEENFDLLDLEQAKAGLA 113

Query: 61  QYNPFVLNKTIAEAYEAVVDVLGASGLMVIADNHMSQPRWCCSLDDGNGFFGNNNFDPQE 120
           QYNPFVLNKTIAEAYEAVVDVLG SGLMVIADNHMSQPRWCCSLDDGNGFFG+  FDPQE
Sbjct: 114 QYNPFVLNKTIAEAYEAVVDVLGESGLMVIADNHMSQPRWCCSLDDGNGFFGDRYFDPQE 173

Query: 121 WLQGLSLVAQRFRNKSTVVGMSLRNEIRGFMENANDWNKYITQGVTTIHNINSEVLVIVS 180
           WLQGLSLVAQRF  KSTVVGMSLRNEIRG  ENANDWN Y+TQGVTTIHNIN  VLVIVS
Sbjct: 174 WLQGLSLVAQRFSKKSTVVGMSLRNEIRGTNENANDWNNYVTQGVTTIHNINPNVLVIVS 233

Query: 181 GLNYDNDLRCLKEKPLNVGTLDNKLVFEVHLYSFSGDSESKFVKQPLNNICANIMNGFID 240
           GLN+DNDLRCLKEKPLNV TLDNKLVFEVHLYSFSGD ESKF+ QPLNNICANI+NGF+D
Sbjct: 234 GLNFDNDLRCLKEKPLNVSTLDNKLVFEVHLYSFSGDQESKFINQPLNNICANIINGFVD 293

Query: 241 HAGFVMQGPNPFPLFVSEYGYDQREVNDAENRFMSCFTAHLAQRDLDWALWAWQGSYYFR 300
           HA FV +GPNPFPLFVSEYGYDQREVNDAENRFMSCFTAHLAQ DLDWALW WQGSYY+R
Sbjct: 294 HAEFVREGPNPFPLFVSEYGYDQREVNDAENRFMSCFTAHLAQEDLDWALWTWQGSYYYR 353

Query: 301 EGQAEPGESFGVLDSNWTQIKNPNFVQKFQLLQTMLHDPNSNASFSYVIYHPQSSQCIQV 360
           EGQA P E+FGVLDSNWTQIKNPNFVQKFQLLQTML DPNSNASFSYVIYHPQS QCIQV
Sbjct: 354 EGQAGPAETFGVLDSNWTQIKNPNFVQKFQLLQTMLQDPNSNASFSYVIYHPQSGQCIQV 413

Query: 361 SNDNKEIFLTNCSTPTRWSHNNDGTPIEMSSTGLYLKASGEGLEASLSTDTLSQQSVWSA 420
           SNDNK+IF+ NCS  +RW+H+ND TPI MSSTGL LK SGEGL  SLSTD    QS W+A
Sbjct: 414 SNDNKDIFMGNCSISSRWTHDNDSTPIRMSSTGLCLKTSGEGLMPSLSTDCFGPQSSWTA 473

Query: 421 ISNSKLHLATFTQGGKSLCLQMDSSNSSKVVTNSCICTNGDPNCLQDTRSQWFELVETNT 480
           ISN+KLHLAT TQ GKSLCLQ++SSNSSK+VTNSCICT+G PNCLQDT+SQWFELVETNT
Sbjct: 474 ISNTKLHLATVTQDGKSLCLQVESSNSSKIVTNSCICTDGAPNCLQDTQSQWFELVETNT 533

Query: 481 L 482
           L
Sbjct: 534 L 534

BLAST of CsGy5G016230 vs. NCBI nr
Match: XP_022995752.1 (uncharacterized protein LOC111491191 [Cucurbita maxima])

HSP 1 Score: 871.3 bits (2250), Expect = 1.5e-249
Identity = 418/481 (86.90%), Postives = 441/481 (91.68%), Query Frame = 0

Query: 1   MLAEGLNHRPLKDLADEAIKLRFNCVRLTYATHMFTRYANRTVEENFDLLDLKQAKAGLA 60
           ML EGLNHRPLK+LADEAIKLRFNCVRLTYAT MFTRYANRTVEENFDLLDL+QAKAGLA
Sbjct: 56  MLIEGLNHRPLKELADEAIKLRFNCVRLTYATQMFTRYANRTVEENFDLLDLEQAKAGLA 115

Query: 61  QYNPFVLNKTIAEAYEAVVDVLGASGLMVIADNHMSQPRWCCSLDDGNGFFGNNNFDPQE 120
           Q NPFVLNKTIAEAYEAVVDVLG SGLMVIADNH+SQPRWCCSLDDGNGFFG+  FDPQE
Sbjct: 116 QNNPFVLNKTIAEAYEAVVDVLGESGLMVIADNHISQPRWCCSLDDGNGFFGDRYFDPQE 175

Query: 121 WLQGLSLVAQRFRNKSTVVGMSLRNEIRGFMENANDWNKYITQGVTTIHNINSEVLVIVS 180
           WLQGLSLVAQRF  KSTVVGMSLRNEIRG  ENANDWN Y+TQGVTTIHNIN  VLVIVS
Sbjct: 176 WLQGLSLVAQRFSKKSTVVGMSLRNEIRGTNENANDWNNYVTQGVTTIHNINPNVLVIVS 235

Query: 181 GLNYDNDLRCLKEKPLNVGTLDNKLVFEVHLYSFSGDSESKFVKQPLNNICANIMNGFID 240
           GLNYDNDLRCLKEKPLNV TLDNKLVFEVHLYSFSGD ESKF+ QPLNNICANI+NGF+D
Sbjct: 236 GLNYDNDLRCLKEKPLNVSTLDNKLVFEVHLYSFSGDPESKFINQPLNNICANIINGFVD 295

Query: 241 HAGFVMQGPNPFPLFVSEYGYDQREVNDAENRFMSCFTAHLAQRDLDWALWAWQGSYYFR 300
           HA FV +G NPFPLFVSEYGYDQREVNDAENRFMSCFTAHLAQ+DLDWALW WQGSYY+R
Sbjct: 296 HAEFVTEGSNPFPLFVSEYGYDQREVNDAENRFMSCFTAHLAQKDLDWALWTWQGSYYYR 355

Query: 301 EGQAEPGESFGVLDSNWTQIKNPNFVQKFQLLQTMLHDPNSNASFSYVIYHPQSSQCIQV 360
           EGQAEPGE+FGVLDSNWTQIKNPNFVQKFQLLQTML DPNSNASFSYVIYHPQS QCIQV
Sbjct: 356 EGQAEPGETFGVLDSNWTQIKNPNFVQKFQLLQTMLQDPNSNASFSYVIYHPQSGQCIQV 415

Query: 361 SNDNKEIFLTNCSTPTRWSHNNDGTPIEMSSTGLYLKASGEGLEASLSTDTLSQQSVWSA 420
           SNDNK+IF+ NCS  +RW+H+ND TPI MSS GL LK  GEGL  SLSTD L  QS WSA
Sbjct: 416 SNDNKDIFMGNCSISSRWTHDNDSTPIRMSSMGLCLKTIGEGLTPSLSTDCLGPQSSWSA 475

Query: 421 ISNSKLHLATFTQGGKSLCLQMDSSNSSKVVTNSCICTNGDPNCLQDTRSQWFELVETNT 480
           ISN+KLHLAT +Q GKSLCLQ++SSNSSK+VTNSCICTNG PNCLQDTRSQWFELV+TNT
Sbjct: 476 ISNTKLHLATISQDGKSLCLQVESSNSSKIVTNSCICTNGAPNCLQDTRSQWFELVKTNT 535

Query: 481 L 482
           L
Sbjct: 536 L 536

BLAST of CsGy5G016230 vs. NCBI nr
Match: XP_022930167.1 (uncharacterized protein LOC111436676 [Cucurbita moschata])

HSP 1 Score: 867.1 bits (2239), Expect = 2.9e-248
Identity = 415/481 (86.28%), Postives = 438/481 (91.06%), Query Frame = 0

Query: 1   MLAEGLNHRPLKDLADEAIKLRFNCVRLTYATHMFTRYANRTVEENFDLLDLKQAKAGLA 60
           ML EGLNHRPLK+LADEAIKLRFNCVRLTYAT MFTRYANRTVEENFDLLDL+QAKAGLA
Sbjct: 1   MLIEGLNHRPLKELADEAIKLRFNCVRLTYATQMFTRYANRTVEENFDLLDLEQAKAGLA 60

Query: 61  QYNPFVLNKTIAEAYEAVVDVLGASGLMVIADNHMSQPRWCCSLDDGNGFFGNNNFDPQE 120
           QYNPFVLNKTIAEAYEAVVDVLG SGLMVIADNHMSQPRWCCSLDDGNGFFG+  FDPQE
Sbjct: 61  QYNPFVLNKTIAEAYEAVVDVLGESGLMVIADNHMSQPRWCCSLDDGNGFFGDRYFDPQE 120

Query: 121 WLQGLSLVAQRFRNKSTVVGMSLRNEIRGFMENANDWNKYITQGVTTIHNINSEVLVIVS 180
           WLQGLS VAQRF  KSTVVGMSLRNEIRG  ENANDWN Y+TQGVTTIHNIN  VLVIVS
Sbjct: 121 WLQGLSFVAQRFSKKSTVVGMSLRNEIRGTNENANDWNNYVTQGVTTIHNINPNVLVIVS 180

Query: 181 GLNYDNDLRCLKEKPLNVGTLDNKLVFEVHLYSFSGDSESKFVKQPLNNICANIMNGFID 240
            LN+DNDLRCLKEKPLNV TLDNKLVFEVHLYSFSGD ESKF+ QPLNNICANI+NGF+D
Sbjct: 181 SLNFDNDLRCLKEKPLNVSTLDNKLVFEVHLYSFSGDQESKFINQPLNNICANIINGFVD 240

Query: 241 HAGFVMQGPNPFPLFVSEYGYDQREVNDAENRFMSCFTAHLAQRDLDWALWAWQGSYYFR 300
           HA FV +GPNPFPLFVSEYGYDQREVNDAENRFMSCFTAHLAQ DLDWALW WQGSYY+R
Sbjct: 241 HAEFVREGPNPFPLFVSEYGYDQREVNDAENRFMSCFTAHLAQEDLDWALWTWQGSYYYR 300

Query: 301 EGQAEPGESFGVLDSNWTQIKNPNFVQKFQLLQTMLHDPNSNASFSYVIYHPQSSQCIQV 360
           EGQA P E+FGVLDSNWTQIKNPNFVQKFQLLQTML DPNSNASFSYVIYHPQS QCIQV
Sbjct: 301 EGQAGPAETFGVLDSNWTQIKNPNFVQKFQLLQTMLQDPNSNASFSYVIYHPQSGQCIQV 360

Query: 361 SNDNKEIFLTNCSTPTRWSHNNDGTPIEMSSTGLYLKASGEGLEASLSTDTLSQQSVWSA 420
           SNDNK+IF+ NCS  +RW+H+ND TPI MSSTGL LK SGEGL  SLS D    QS W+A
Sbjct: 361 SNDNKDIFMGNCSISSRWTHDNDSTPIRMSSTGLCLKTSGEGLMPSLSKDCFGPQSSWTA 420

Query: 421 ISNSKLHLATFTQGGKSLCLQMDSSNSSKVVTNSCICTNGDPNCLQDTRSQWFELVETNT 480
           ISN+KLHLAT TQ GKSLCLQ++SSNSSK+VTNSCICT+G PNCLQDT+SQWFELVETNT
Sbjct: 421 ISNTKLHLATVTQDGKSLCLQVESSNSSKIVTNSCICTDGAPNCLQDTQSQWFELVETNT 480

Query: 481 L 482
           L
Sbjct: 481 L 481

BLAST of CsGy5G016230 vs. TAIR10
Match: AT1G13130.1 (Cellulase (glycosyl hydrolase family 5) protein)

HSP 1 Score: 365.2 bits (936), Expect = 6.4e-101
Identity = 190/482 (39.42%), Postives = 292/482 (60.58%), Query Frame = 0

Query: 1   MLAEGLNHRPLKDLADEAIKLRFNCVRLTYATHMFTRYA---NRTVEENFDLLDLKQAKA 60
           ++AEGL+ +P+  +A + +++ FNCVRLT+   + T      N TV ++F  L L     
Sbjct: 64  VVAEGLSKQPVDAVAKKIVEMGFNCVRLTWPLDLMTNETLANNVTVRQSFQSLGLNDDIV 123

Query: 61  GLAQYNPFVLNKTIAEAYEAVVDVLGASGLMVIADNHMSQPRWCCSLDDGNGFFGNNNFD 120
           G    NP +++  + EAY+ VV  LG + +MVI DNH+++P WCC+ DDGNGFFG+  FD
Sbjct: 124 GFQTNNPSIIDLPLIEAYKTVVTTLGNNDVMVILDNHLTKPGWCCANDDGNGFFGDQFFD 183

Query: 121 PQEWLQGLSLVAQRFRNKSTVVGMSLRNEIRGFMENANDWNKYITQGVTTIHNINSEVLV 180
           P  W+  L  +A  F   S VVGMSLRNE+RG  +N NDW KY+ QG   +H+ N++VLV
Sbjct: 184 PTVWVAALKKMAATFNGVSNVVGMSLRNELRGPKQNVNDWFKYMQQGAEAVHSANNKVLV 243

Query: 181 IVSGLNYDNDLRCLKEKPLNVGTLDNKLVFEVHLYSFSGDSESKFVKQPLNNICANIMNG 240
           I+SGL++D DL  ++ +P+ + +   KLVFE+H YSFS D  S     P N+IC  ++N 
Sbjct: 244 ILSGLSFDADLSFVRSRPVKL-SFTGKLVFELHWYSFS-DGNSWAANNP-NDICGRVLNR 303

Query: 241 FIDHAGFVMQGPNPFPLFVSEYGYDQREVNDAENRFMSCFTAHLAQRDLDWALWAWQGSY 300
             +  G+++     FPLF+SE+G D+R VN  +NR+  C T   A+ D+DW+LWA  GSY
Sbjct: 304 IGNGGGYLLN--QGFPLFLSEFGIDERGVNTNDNRYFGCLTGWAAENDVDWSLWALTGSY 363

Query: 301 YFREGQAEPGESFGVLDSNWTQIKNPNFVQKFQLLQTMLHDPNSNASFSYVIYHPQSSQC 360
           Y R+G+    E +GVLDS+W  ++N +F+QK   LQ+ L  P        +++HP +  C
Sbjct: 364 YLRQGKVGMNEYYGVLDSDWISVRNSSFLQKISFLQSPLQGPGPRTDAYNLVFHPLTGLC 423

Query: 361 IQVS-NDNKEIFLTNCSTPTRWSHNNDGTPIEMSSTGLYLKASGEGLEASLS-TDTLSQQ 420
           I  S +D K + L  C++   WS+      + +    L L+++G     +++ T   +  
Sbjct: 424 IVRSLDDPKMLTLGPCNSSEPWSYTKKA--LRIKDQQLCLQSNGPKNPVTMTRTSCSTSG 483

Query: 421 SVWSAISNSKLHLATFTQGGKSLCLQMDSSNSSKVVTNSCICTNGDPNCLQDTRSQWFEL 478
           S W  IS S++HLA+ T    SLCL +D++N+  VV N+C C + D +C  +  SQWF++
Sbjct: 484 SKWQTISASRMHLASTTSNKTSLCLDVDTANN--VVANACKCLSKDKSC--EPMSQWFKI 534

BLAST of CsGy5G016230 vs. TAIR10
Match: AT3G26130.1 (Cellulase (glycosyl hydrolase family 5) protein)

HSP 1 Score: 342.8 bits (878), Expect = 3.4e-94
Identity = 196/485 (40.41%), Postives = 280/485 (57.73%), Query Frame = 0

Query: 2   LAEGLNHRPLKDLADEAIKLRFNCVRLTYATHMFTR---YANRTVEENFDLLDLKQAKAG 61
           +AEGL+ +PL  +A++ + + FNCVRLT+  ++ T     A  TV ++     L +A +G
Sbjct: 55  VAEGLSKQPLDAIAEKIVSMGFNCVRLTWPLYLATDESFSAFMTVRQSLRKFRLFEAVSG 114

Query: 62  LAQYNPFVLNKTIAEAYEAVVDVLGASGLMVIADNHMSQPRWCCSLDDGNGFFGNNNFDP 121
              +NP +L+  + +A++ VV  L    +MVI DNH+SQP WCCS +DGNGFFG+ + +P
Sbjct: 115 FQTHNPTILDLPLIKAFQEVVYCLEKHRVMVILDNHISQPGWCCSDNDGNGFFGDKHLNP 174

Query: 122 QEWLQGLSLVAQRFRN-KSTVVGMSLRNEIRGFMENANDWNKYITQGVTTIHNINSEVLV 181
           Q W++GL  +A  F N  S VVGMSLRNE+RG  +N  DW KY+ +G   +H++N  VLV
Sbjct: 175 QVWIKGLKKMASMFANVSSNVVGMSLRNELRGPKQNIKDWYKYMREGAEAVHSVNPNVLV 234

Query: 182 IVSGLNYDNDLRCLKEKPLNVGTLDNKLVFEVHLYSFSGDSESKFVKQPLNNICANIMNG 241
           IVSGLNY  DL  L+E+P  V +   K+VFE+H Y F    E       LN IC      
Sbjct: 235 IVSGLNYATDLSFLRERPFEV-SFRRKVVFEIHWYGFWNTWEG----DNLNKICGKETEK 294

Query: 242 FIDHAGFVMQGPNPFPLFVSEYGYDQREVNDAENRFMSCFTAHLAQRDLDWALWAWQGSY 301
            +  +GF+++     PLFVSE+G DQR  N  +N+F+SCF A  A RDLDW+LW   GSY
Sbjct: 295 MMKMSGFLLE--KGIPLFVSEFGIDQRGNNANDNKFLSCFMALAADRDLDWSLWTLAGSY 354

Query: 302 YFREGQAEPGESFGVLDSNWTQIKNPNFVQKFQLLQT-----MLHDPNSNASFSYVIYHP 361
           Y RE      ES+GVLD NW+ I+N   +Q    +QT     M   P        +++HP
Sbjct: 355 YIREKSIGSDESYGVLDFNWSSIRNSTILQMISAIQTPFIGLMETQPKK------IMFHP 414

Query: 362 QSSQCIQVSNDNKEIFLTNCSTPTRWSHNNDGTPIEMSSTGLYLKASGEGLEASLST-DT 421
            +  CI V     ++ L +C+    W  ++           L LKA  +G    L    +
Sbjct: 415 STGLCI-VRKSLFQLKLGSCNRSESWRLSSHRVLSLAEEQILCLKAYEKGKSVKLRLFFS 474

Query: 422 LSQQSVWSAISNSKLHLATFTQGGKSLCLQMDSSNSSKVVTNSCICTNGDPNCLQDTRSQ 477
            S  S W   S+SK+ L++ T+ G S+CL +D+ N++ +VTNSC C  G+ +C  D RSQ
Sbjct: 475 ESYCSKWKLFSDSKMQLSSITKNGFSVCLDVDTENNN-IVTNSCKCLRGNSSC--DPRSQ 522

BLAST of CsGy5G016230 vs. TAIR10
Match: AT3G26140.1 (Cellulase (glycosyl hydrolase family 5) protein)

HSP 1 Score: 333.2 bits (853), Expect = 2.7e-91
Identity = 181/486 (37.24%), Postives = 283/486 (58.23%), Query Frame = 0

Query: 1   MLAEGLNHRPLKDLADEAIKLRFNCVRLTYATHMFTRYA---NRTVEENFDLLDLKQAKA 60
           ++AEGL+ + + DLA + + + FNCVR T+   + T      N TV ++F  L L    +
Sbjct: 34  VVAEGLSKQSVDDLAKKIMAMGFNCVRFTWPLDLATNETLANNVTVRQSFQSLGLNDDIS 93

Query: 61  GLAQYNPFVLNKTIAEAYEAVVDVLGASGLMVIADNHMSQPRWCCSLDDGNGFFGNNNFD 120
           G    NP +++  + EAY+ VV  LG + +MVI DNH+++P WCC  +DGNGFFG+  FD
Sbjct: 94  GFETKNPSMIDLPLIEAYKKVVAKLGNNNVMVILDNHVTKPGWCCGYNDGNGFFGDTFFD 153

Query: 121 PQEWLQGLSLVAQRFRNKSTVVGMSLRNEIRGFMENANDWNKYITQGVTTIHNINSEVLV 180
           P  W+ GL+ +A  F+  + VVGMSLRNE+RG  +N +DW KY+ QG   +H  N  VLV
Sbjct: 154 PTTWIAGLTKIAMTFKGATNVVGMSLRNELRGPKQNVDDWFKYMQQGAEAVHEANPNVLV 213

Query: 181 IVSGLNYDNDLRCLKEKPLNVGTLDNKLVFEVHLYSFSGDSESKFVKQPLNNICANIMNG 240
           I+SGL+YD DL  ++ + +N+ T   KLVFE+H YSF+ ++ +   K P N  C  I+  
Sbjct: 214 ILSGLSYDTDLSFVRSRHVNL-TFTRKLVFELHRYSFT-NTNTWSSKNP-NEACGEILKS 273

Query: 241 FIDHAGFVMQGPNPFPLFVSEYGYDQREVNDAENRFMSCFTAHLAQRDLDWALWAWQGSY 300
             +  GF ++    FP+F+SE+G D R  N  +NR++ C     A+ D+DW++W  QGSY
Sbjct: 274 IENGGGFNLR---DFPVFLSEFGIDLRGKNVNDNRYIGCILGWAAENDVDWSIWTLQGSY 333

Query: 301 YFREGQAEPGESFGVLDSNWTQIKNPNFVQKFQLLQTMLHDPNSNASFSYVIYHPQSSQC 360
           Y REG     E +G+LDS+W ++++ +F+Q+  L+ + L  P S +    +++HP +  C
Sbjct: 334 YLREGVVGMSEFYGILDSDWVRVRSQSFLQRLSLILSPLQGPGSQSKVYNLVFHPLTGLC 393

Query: 361 -IQVSNDNKEIFLTNCSTPTRWSHNNDGTPIEMSSTGLYLKASGEGLEASLSTDTLSQQ- 420
            +Q   D  ++ L  C+    WS+    T + +    L L+++G      LS  + S   
Sbjct: 394 MLQSILDPTKVTLGLCNESQPWSYTPQNT-LTLKDKSLCLESTGPNAPVKLSETSCSSPN 453

Query: 421 -SVWSAISNSKLHLATFTQGGKSLCLQMDSSNSSKVVTNSCICTNG-DPNCLQDTRSQWF 480
            S W  IS S + LA       SLCL +D +N+  ++ ++C C  G D +C  D  SQWF
Sbjct: 454 LSEWETISASNMLLAA-KSTNNSLCLDVDETNN--LMASNCKCVKGEDSSC--DPISQWF 507

BLAST of CsGy5G016230 vs. TAIR10
Match: AT5G17500.1 (Glycosyl hydrolase superfamily protein)

HSP 1 Score: 318.9 bits (816), Expect = 5.2e-87
Identity = 177/488 (36.27%), Postives = 274/488 (56.15%), Query Frame = 0

Query: 1   MLAEGLNHRPLKDLADEAIKLRFNCVRLTYATHMF---TRYANRTVEENFDLLDLKQAKA 60
           ++AEGL+ +P+  ++ +   + FNCVRLT+   +    T   N TV+++F+   L     
Sbjct: 57  VVAEGLSSQPMDSISKKIKDMGFNCVRLTWPLELMINDTLAFNVTVKQSFERYGLDHELQ 116

Query: 61  GLAQYNPFVLNKTIAEAYEAVVDVLGASGLMVIADNHMSQPRWCCSLDDGNGFFGNNNFD 120
           G+  +NP+++N  +   ++AVV  LG   +MVI DNH + P WCCS DD + FFG+  F+
Sbjct: 117 GIYTHNPYIVNTPLINVFQAVVYSLGRHDVMVILDNHKTVPGWCCSNDDPDAFFGDPKFN 176

Query: 121 PQEWLQGLSLVAQRFRNKSTVVGMSLRNEIRGFMENANDWNKYITQGVTTIHNINSEVLV 180
           P  W+ GL  +A  F N   VVGMSLRNE+RG+   + DW KY+ +G   +H  N  VLV
Sbjct: 177 PDLWMLGLKKMATIFMNVKNVVGMSLRNELRGYNHTSKDWYKYMQKGAEAVHTSNPNVLV 236

Query: 181 IVSGLNYDNDLRCLKEKPLNVGTLDNKLVFEVHLYSFSGDSESKFVKQPLNNICANIMNG 240
           I+SGLN+D DL  LK++P+N+ +   KLV E+H YSF+ D   ++    +N+ C+ + + 
Sbjct: 237 ILSGLNFDADLSFLKDRPVNL-SFKKKLVLELHWYSFT-DGTGQWKSHNVNDFCSQMFSK 296

Query: 241 FIDHAGFVMQGPNPFPLFVSEYGYDQREVNDAENRFMSCFTAHLAQRDLDWALWAWQGSY 300
                GFV+     FPLF+SE+G DQR  +   NR+M+C  A  A++DLDWA+WA  G Y
Sbjct: 297 ERRTGGFVLD--QGFPLFLSEFGTDQRGGDLEGNRYMNCMLAWAAEKDLDWAVWAVTGVY 356

Query: 301 YFREGQAEPGESFGVLDSNWTQIKNPNFVQKFQLLQTMLHDPNSNASFSYVIYHPQSSQC 360
           YFREG+    E++G+LD+NW  + N  ++++  ++Q     P    +    I+HP +  C
Sbjct: 357 YFREGKRGVVEAYGMLDANWHNVHNYTYLRRLSVIQPPHTGPGVKHNHHKKIFHPLTGLC 416

Query: 361 IQVSN--DNKEIFLTNCSTPTRWSHNNDGTPIEMSSTGLYLKASGEGLEASLSTDTLSQQ 420
           +   +     E+ L  C+    WS+++ G           +     G ++ L  +T   +
Sbjct: 417 LVRKSHCHESELTLGPCTKDEPWSYSHGG-----------ILEIRRGHKSCLEGETAVGK 476

Query: 421 SV--------WSAISNSKLHLATFTQGGKSLCLQMDSSNSSKVVTNSCICTNGDPNCLQD 476
           SV           IS +K+HL+  T  G  +CL +DS N+  VV NSC C  GD  C  +
Sbjct: 477 SVKLGRICTKIEQISATKMHLSFNTSDGSLVCLDVDSDNN--VVANSCNCLTGDTTC--E 525

BLAST of CsGy5G016230 vs. TAIR10
Match: AT5G16700.1 (Glycosyl hydrolase superfamily protein)

HSP 1 Score: 259.2 bits (661), Expect = 4.9e-69
Identity = 164/481 (34.10%), Postives = 256/481 (53.22%), Query Frame = 0

Query: 2   LAEGLNHRPLKDLADEAIKLRFNCVRLTYATHMFTR---YANRTVEENFDLLDLKQAKAG 61
           +AEGL+ +PL  ++ + + + FNCVRLT+   + T        TV+++F+ L L +   G
Sbjct: 56  VAEGLSKQPLDSISKKIVSMGFNCVRLTWPLDLVTNDTLALKVTVKQSFESLKLFEDVLG 115

Query: 62  LAQYNPFVLNKTIAEAYEAVVDVLGASGLMVIADNHMSQPRWCCSLDDGNGFFGNNNFDP 121
           +  +NP +L+  +  A++ VV  LG +G+MVI DNH++ P WCC  +D + FFG  +FDP
Sbjct: 116 IQTHNPKLLHLPLFNAFQEVVSNLGENGVMVILDNHLTTPGWCCGDNDLDAFFGYPHFDP 175

Query: 122 QEWLQGLSLVAQRFRNKSTVVGMSLRNEIRGFMENANDWNKYITQGVTTIHNINSEVLVI 181
             W +GL  +A  FRN + V+GMSLRNE RG  +  + W +++ QG   +H  N ++LVI
Sbjct: 176 LVWAKGLRKMATLFRNFTHVIGMSLRNEPRGARDYPDLWFRHMPQGAEAVHAANPKLLVI 235

Query: 182 VSGLNYDNDLRCLKEKPLNVGTLDNKLVFEVHLYSFSGDSESKFVKQPLNNICANIMNGF 241
           +SG+++D +L  L+++ +NV   D KLVFE+H YSFS D    + K   N+ C  I+   
Sbjct: 236 LSGIDFDTNLSFLRDRSVNVSFTD-KLVFELHWYSFS-DGRDSWRKHNSNDFCVKIIEKV 295

Query: 242 IDHAGFVMQGPNPFPLFVSEYGYDQREVNDAENRFMSCFTAHLAQRDLDWALWAWQGSYY 301
             + GF++     FPL +SE+G DQR  + + NR+M+C  A  A+ DLDWA+WA  G YY
Sbjct: 296 THNGGFLL--GRGFPLILSEFGTDQRGGDMSGNRYMNCLVAWAAENDLDWAVWALTGDYY 355

Query: 302 FREGQAEPGESFGVLDSNWTQIKNPNFVQKFQLLQTMLHDPNSNASFSYVIYHPQSSQCI 361
            R G   PG                               PN N     +++HP +  C+
Sbjct: 356 LRTG---PGLR-----------------------------PNKN-----LLFHPSTGLCV 415

Query: 362 --QVSNDNKEIFLTNCSTPTRWSHNNDGTPIEMSSTGLYLKASG-EGLEASLSTDTLSQQ 421
               S++   + L  C     W+ N     + ++   + ++A    G +  L   T  + 
Sbjct: 416 TNNPSDNIPTLRLGPCPKSDPWTFNPSEGILWINK--MCVEAPNVVGQKVKLGVGT--KC 475

Query: 422 SVWSAISNSKLHLATFTQGGKSLCLQMDSSNSSKVVTNSCICTNGDPNCLQDTRSQWFEL 477
           S    IS +K+HL+  T  G  LCL +D  ++S VV N C     D +C  D  SQWF++
Sbjct: 476 SKLGQISATKMHLSFKTSNGLLLCLDVDERDNS-VVANRCKFLTMDASC--DPASQWFKV 488

BLAST of CsGy5G016230 vs. Swiss-Prot
Match: sp|C0HLA0|GH5FP_CHAOB (Glycosyl hydrolase 5 family protein OS=Chamaecyparis obtusa OX=13415 PE=1 SV=1)

HSP 1 Score: 346.3 bits (887), Expect = 5.5e-94
Identity = 190/493 (38.54%), Postives = 286/493 (58.01%), Query Frame = 0

Query: 2   LAEGLNHRPLKDLADEAIKLRFNCVRLTYATHMFTR--YANRTVEENFDLLDLKQAKAGL 61
           L EGLN  P+  +A     L FNCVRLTY+ HM TR  Y N TV + F  L+L +A +G+
Sbjct: 62  LPEGLNRLPVATVAHTISSLGFNCVRLTYSIHMLTRTSYTNATVAQTFARLNLTEAASGI 121

Query: 62  AQYNPFVLNKTIAEAYEAVVDVLGASGLMVIADNHMSQPRWCCSLDDGNGFFGNNNFDPQ 121
              NP +L+     AY  VV  L  +G+MVI DNH+S+P+WCC++DDGNGFFG+  F+P 
Sbjct: 122 EHNNPELLDLGHVAAYHHVVAALSEAGVMVILDNHVSKPKWCCAVDDGNGFFGDRYFNPN 181

Query: 122 EWLQGLSLVAQRFRNKSTVVGMSLRNEIRGFMENANDWNKYITQGVTTIHNINSEVLVIV 181
            W++GL L+A  F N   VV MSLRNE+RG       W++++  G  T+H  N +VLVI+
Sbjct: 182 TWVEGLGLMATYFNNTPNVVAMSLRNELRGNRSTPISWSRHMQWGAATVHKANPKVLVIL 241

Query: 182 SGLNYDNDLRCLKEKPLNVGTLDNKLVFEVHLYSFSGDSESKFVKQPLNNICANIMNGFI 241
           SGL +D DL  L   P+ +     K+V+E H YSF     +       N++C N    F 
Sbjct: 242 SGLQFDTDLSFLPVLPVTL-PFKEKIVYEGHWYSFGVPWRTGLP----NDVCKNETGRFK 301

Query: 242 DHAGFVMQGPN--PFPLFVSEYGYDQREVNDAENRFMSCFTAHLAQRDLDWALWAWQGSY 301
            + GFV    N    PLF+SE+G DQR VND +NR+++C  A+LA+ DLDWALW   GSY
Sbjct: 302 SNVGFVTSSANATAAPLFMSEFGIDQRYVNDNDNRYLNCILAYLAEEDLDWALWTMGGSY 361

Query: 302 YFREGQ---AEPGESFGVLDSNWTQIKNPNFVQKFQLLQTMLHDPN-SNASFSYVIYHPQ 361
           Y+R  +    +  E++G  + +W++I+NP+F+ + + +Q  + DP  +   +  +IYHP 
Sbjct: 362 YYRSDKQPVKDFEETYGFFNHDWSRIRNPDFISRLKEIQQPIQDPYLAPGPYYQIIYHPA 421

Query: 362 SSQCIQVSNDNKEIFLTNC-STPTRWSHN-NDGTPIEMSSTGLYLKASGEGLEASLSTD- 421
           S  C++ S     + L +C S  +RW+++ +   PI +  +   +   G GL A ++ + 
Sbjct: 422 SGLCVE-SGIGNTVHLGSCQSVRSRWNYDASVKGPIGLMGSSSCISTQGNGLPAIMTENC 481

Query: 422 TLSQQSVWSAISNSKLHLATFTQG--GKSLCLQMDSSNSSKVVTNSCICTNGDPNCLQ-- 480
           +    ++WS +S+++L L T   G  GK   + +D S S  + TN CIC   D +C    
Sbjct: 482 SAPNNTLWSTVSSAQLQLGTRVLGKDGKEKWMCLDGSKSPLISTNECICIT-DSHCYPKL 541

BLAST of CsGy5G016230 vs. Swiss-Prot
Match: sp|P19487|GUNA_XANCP (Major extracellular endoglucanase OS=Xanthomonas campestris pv. campestris (strain ATCC 33913 / DSM 3586 / NCPPB 528 / LMG 568 / P 25) OX=190485 GN=engXCA PE=1 SV=2)

HSP 1 Score: 75.9 bits (185), Expect = 1.4e-12
Identity = 83/354 (23.45%), Postives = 138/354 (38.98%), Query Frame = 0

Query: 5   GLNHRPLKDLADEAIKLRFNCVRLTYATHMF---TRYANRTVEENFDLLDLKQAKAGLAQ 64
           GL  R  KD+  +   L FN VRL +        T  A+     N DL  L   +     
Sbjct: 61  GLWARNWKDMIVQMQGLGFNAVRLPFCPATLRSDTMPASIDYSRNADLQGLTSLQ----- 120

Query: 65  YNPFVLNKTIAEAYEAVVDVLGASGLMVIADNHMSQPRWCCSLDDGNGFFGNNNFDPQEW 124
               +L+K IAE          A G+ V+ D+H      C  + +    +   ++   +W
Sbjct: 121 ----ILDKVIAE--------FNARGMYVLLDHHTPD---CAGISE---LWYTGSYTEAQW 180

Query: 125 LQGLSLVAQRFRNKSTVVGMSLRNEIRGFM-----ENANDWNKYITQGVTTIHNINSEVL 184
           L  L  VA R++N   V+G+ L+NE  G         A DWNK   +G   +  +  + L
Sbjct: 181 LADLRFVANRYKNVPYVLGLDLKNEPHGAATWGTGNAATDWNKAAERGSAAVLAVAPKWL 240

Query: 185 VIVSGLN------------YDNDLRCLKEKPLNVGTLDNKLVFEVHLYSFSGDSESKFVK 244
           + V G+             +  +L+ L   PLN+    N+L+   H+Y         FV+
Sbjct: 241 IAVEGITDNPVCSTNGGIFWGGNLQPLACTPLNIPA--NRLLLAPHVY-----GPDVFVQ 300

Query: 245 QPLN--NICANIMNGFIDHAGFVMQGPNPFPLFVSEYGYDQREVNDAENRFMSCFTAHLA 304
              N  N   N+   +  H G   Q      L + E+G    E +  +  +      +L 
Sbjct: 301 SYFNDSNFPNNMPAIWERHFG---QFAGTHALLLGEFGGKYGEGDARDKTWQDALVKYLR 360

Query: 305 QRDLDWAL-WAWQGSYYFREGQAEPGESFGVLDSNWTQIKNPNFVQKFQLLQTM 336
            + ++    W+W             G++ G+L  +WT ++      K  LL+T+
Sbjct: 361 SKGINQGFYWSW---------NPNSGDTGGILRDDWTSVRQ----DKMTLLRTL 368

BLAST of CsGy5G016230 vs. Swiss-Prot
Match: sp|P54583|GUN1_ACIC1 (Endoglucanase E1 OS=Acidothermus cellulolyticus (strain ATCC 43068 / 11B) OX=351607 GN=Acel_0614 PE=1 SV=1)

HSP 1 Score: 75.5 bits (184), Expect = 1.8e-12
Identity = 75/338 (22.19%), Postives = 136/338 (40.24%), Query Frame = 0

Query: 2   LAEGLNHRPLKDLADEAIKLRFNCVRLTYATHMFTRYANRTVEENFDLLDLKQAKAGLAQ 61
           +  GL  R  + + D+   L +N +RL Y+  +       T+  + +   + Q   GL  
Sbjct: 78  VVHGLWSRDYRSMLDQIKSLGYNTIRLPYSDDIL---KPGTMPNSINFYQMNQDLQGLTS 137

Query: 62  YNPFVLNKTIAEAYEAVVDVLGASGLMVIADNHMSQPRWCCSLDDGNGFFGNNNFDPQEW 121
               V++K +A A        G  GL +I D H    R  CS    +  +  ++     W
Sbjct: 138 LQ--VMDKIVAYA--------GQIGLRIILDRH----RPDCS--GQSALWYTSSVSEATW 197

Query: 122 LQGLSLVAQRFRNKSTVVGMSLRNEIR-----GFMENANDWNKYITQGVTTIHNINSEVL 181
           +  L  +AQR++   TVVG  L NE       G  + + DW     +    + ++N  +L
Sbjct: 198 ISDLQALAQRYKGNPTVVGFDLHNEPHDPACWGCGDPSIDWRLAAERAGNAVLSVNPNLL 257

Query: 182 VIVSGL-NYDND-------LRCLKEKPLNVGTLDNKLVFEVHLYSFSGDSESKFVKQPLN 241
           + V G+ +Y+ D       L+   + P+ V  + N+LV+  H Y+ S   ++ F      
Sbjct: 258 IFVEGVQSYNGDSYWWGGNLQGAGQYPV-VLNVPNRLVYSAHDYATSVYPQTWFSDPTFP 317

Query: 242 NICANIMNGFIDHAGFVMQGPNPFPLFVSEYGYDQREVNDAENRFMSCF------TAHLA 301
           N    I N    + G++    N  P+++ E+G   +   D    ++         TA   
Sbjct: 318 NNMPGIWN---KNWGYLF-NQNIAPVWLGEFGTTLQSTTD--QTWLKTLVQYLRPTAQYG 377

Query: 302 QRDLDWALWAWQGSYYFREGQAEPGESFGVLDSNWTQI 321
                W  W+W           + G++ G+L  +W  +
Sbjct: 378 ADSFQWTFWSW---------NPDSGDTGGILKDDWQTV 380

BLAST of CsGy5G016230 vs. Swiss-Prot
Match: sp|P23548|GUN_PAEPO (Endoglucanase OS=Paenibacillus polymyxa OX=1406 PE=3 SV=2)

HSP 1 Score: 70.9 bits (172), Expect = 4.5e-11
Identity = 68/339 (20.06%), Postives = 138/339 (40.71%), Query Frame = 0

Query: 5   GLNHRPLKDLADEAIKLRFNCVRLTYATHMFTRYANRTVEENFDLLDLKQAKAGLAQYNP 64
           GL  R + D+ D+  K  +N +RL Y+  +F   +        D +D  +        NP
Sbjct: 73  GLWSRSMDDMLDQVKKEGYNLIRLPYSNQLFDSSSRP------DSIDYHK--------NP 132

Query: 65  FVLNKTIAEAYEAVVDVLGASGLMVIADNHMSQPRWCCSLDDGNG----FFGNNNFDPQE 124
            ++     +  + +++  G  G+ +I D H            G+G     +  + +    
Sbjct: 133 DLVGLNPIQIMDKLIEKAGQRGIQIILDRHR----------PGSGGQSELWYTSQYPESR 192

Query: 125 WLQGLSLVAQRFRNKSTVVGMSLRNEIRGFME----NAN-DWNKYITQGVTTIHNINSEV 184
           W+    ++A R++N  TV+G  L NE  G       NA+ DW     +    I ++N   
Sbjct: 193 WISDWKMLADRYKNNPTVIGADLHNEPHGQASWGTGNASTDWRLAAQRAGNAILSVNPNW 252

Query: 185 LVIVSGLNYD-----------NDLRCLKEKPLNVGTLDNKLVFEVHLYSFSGDSESKFVK 244
           L++V G++++            +L  +   P+ V  + N++V+  H Y   G S   +  
Sbjct: 253 LILVEGVDHNVQGNNSQYWWGGNLTGVANYPV-VLDVPNRVVYSPHDYG-PGVSSQPWFN 312

Query: 245 QPLNNICANIMNGFIDHAGFVMQGPNPFPLFVSEYGYDQREVNDAENRFMSCFTAHLAQR 304
            P     +N+   +    G++ +  N  P+ V E+G    +++  E ++ +    ++   
Sbjct: 313 DPA--FPSNLPAIWDQTWGYISK-QNIAPVLVGEFGGRNVDLSCPEGKWQNALVHYIGAN 372

Query: 305 DLDWALWAWQGSYYFREGQAEPGESFGVLDSNWTQIKNP 324
           +L +  W+              G++ G+L  +WT    P
Sbjct: 373 NLYFTYWSL---------NPNSGDTGGLLLDDWTTWNRP 373

BLAST of CsGy5G016230 vs. TrEMBL
Match: tr|A0A0A0K853|A0A0A0K853_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G028440 PE=3 SV=1)

HSP 1 Score: 875.5 bits (2261), Expect = 5.3e-251
Identity = 422/482 (87.55%), Postives = 447/482 (92.74%), Query Frame = 0

Query: 1   MLAEGLNHRPLKDLADEAIKLRFNCVRLTYATHMFTRYANRTVEENFDLLDLKQAKAGLA 60
           ML EGLNHRPLK+LADEAIKLRFNCVRLTYATHMFTRYANRTVEENFDLLDL+QAKAGLA
Sbjct: 57  MLIEGLNHRPLKELADEAIKLRFNCVRLTYATHMFTRYANRTVEENFDLLDLEQAKAGLA 116

Query: 61  QYNPFVLNKTIAEAYEAVVDVLGASGLMVIADNHMSQPRWCCSLDDGNGFFGNNNFDPQE 120
           QYNPFVLNKTIAEAYEAVVDVLGASGLMVIADNHMSQPRWCCSLDDGNGFFGN  FDPQE
Sbjct: 117 QYNPFVLNKTIAEAYEAVVDVLGASGLMVIADNHMSQPRWCCSLDDGNGFFGNRYFDPQE 176

Query: 121 WLQGLSLVAQRFRNKSTVVGMSLRNEIRGFMENANDWNKYITQGVTTIHNINSEVLVIVS 180
           WLQGLSLVAQRF NKSTVVGMSLRNE+RG MENANDWN Y+TQGVTTIH IN  VLVIVS
Sbjct: 177 WLQGLSLVAQRFNNKSTVVGMSLRNELRGMMENANDWNNYVTQGVTTIHKINPAVLVIVS 236

Query: 181 GLNYDNDLRCLKEKPLNVGTLDNKLVFEVHLYSFSGDSESKFVKQPLNNICANIMNGFID 240
           GLNYDNDLRCLK+KPLNV TLDNKL FEVHLYSFSGDSESKFV+QPLNNICA IM+ FID
Sbjct: 237 GLNYDNDLRCLKDKPLNVSTLDNKLAFEVHLYSFSGDSESKFVQQPLNNICAKIMHEFID 296

Query: 241 HAGFVMQGPNPFPLFVSEYGYDQREVNDAENRFMSCFTAHLAQRDLDWALWAWQGSYYFR 300
           HA FV++GPNPFPLFVSEYGYDQREV+DAENRFMSCFTAHLAQ+DLDWALW WQGSYY+R
Sbjct: 297 HAEFVIEGPNPFPLFVSEYGYDQREVDDAENRFMSCFTAHLAQKDLDWALWTWQGSYYYR 356

Query: 301 EGQAEPGESFGVLDSNWTQIKNPNFVQKFQLLQTMLHDPNSNASFSYVIYHPQSSQCIQV 360
           EGQAE  E+FGVLDSNWTQIKNPNFVQKFQLLQTML DP SNASFSYVIYH QS QCI+V
Sbjct: 357 EGQAELAETFGVLDSNWTQIKNPNFVQKFQLLQTMLQDPYSNASFSYVIYHVQSGQCIEV 416

Query: 361 SNDNKEIFLTNCSTPTRWSHNNDGTPIEMSSTGLYLKASGEGLEASLSTDTLSQQSVWSA 420
           SNDNKEIFLTNCST +RWSH+ND TPI+MSSTGL LKASGEGLEASLSTD + +QS+WSA
Sbjct: 417 SNDNKEIFLTNCSTSSRWSHDNDSTPIKMSSTGLCLKASGEGLEASLSTDCIGKQSLWSA 476

Query: 421 ISNSKLHLATFTQGGKSLCLQ-MDSSNSSKVVTNSCICTNGDPNCLQDTRSQWFELVETN 480
           ISNS LHL T T+ GKSLCLQ ++SSNSSK+VTNSCICT  DP CLQDT+SQWFELV TN
Sbjct: 477 ISNSNLHLGTVTEDGKSLCLQIIESSNSSKIVTNSCICTTNDPTCLQDTQSQWFELVATN 536

Query: 481 TL 482
           TL
Sbjct: 537 TL 538

BLAST of CsGy5G016230 vs. TrEMBL
Match: tr|A0A1S3BDI2|A0A1S3BDI2_CUCME (major extracellular endoglucanase-like OS=Cucumis melo OX=3656 GN=LOC103488703 PE=3 SV=1)

HSP 1 Score: 818.9 bits (2114), Expect = 5.9e-234
Identity = 394/448 (87.95%), Postives = 416/448 (92.86%), Query Frame = 0

Query: 34  MFTRYANRTVEENFDLLDLKQAKAGLAQYNPFVLNKTIAEAYEAVVDVLGASGLMVIADN 93
           MFTRYANRTVEENFDLLDL QAKAGL QYNPFVLNKTIAEAYEAVVDVLGASGLMVIADN
Sbjct: 1   MFTRYANRTVEENFDLLDLGQAKAGLTQYNPFVLNKTIAEAYEAVVDVLGASGLMVIADN 60

Query: 94  HMSQPRWCCSLDDGNGFFGNNNFDPQEWLQGLSLVAQRFRNKSTVVGMSLRNEIRGFMEN 153
           HMSQPRWCCSLDDGNGFFGN  FDPQEWLQGLSLVAQRF NKSTVVGMSLRNEIRG MEN
Sbjct: 61  HMSQPRWCCSLDDGNGFFGNRYFDPQEWLQGLSLVAQRFNNKSTVVGMSLRNEIRGMMEN 120

Query: 154 ANDWNKYITQGVTTIHNINSEVLVIVSGLNYDNDLRCLKEKPLNVGTLDNKLVFEVHLYS 213
           ANDWN Y+TQGVTTIHNIN EVLVIV GLNYDNDLRCLKEKPLNV TLDNKLVFEVHLYS
Sbjct: 121 ANDWNHYVTQGVTTIHNINPEVLVIVGGLNYDNDLRCLKEKPLNVSTLDNKLVFEVHLYS 180

Query: 214 FSGDSESKFVKQPLNNICANIMNGFIDHAGFVMQGPNPFPLFVSEYGYDQREVNDAENRF 273
           FSG SESKFV+QPLNNICA I+N FIDHA FV++G NPFPLFVSEYGYDQREV+DAENRF
Sbjct: 181 FSGASESKFVQQPLNNICAKIINEFIDHAEFVIEGSNPFPLFVSEYGYDQREVDDAENRF 240

Query: 274 MSCFTAHLAQRDLDWALWAWQGSYYFREGQAEPGESFGVLDSNWTQIKNPNFVQKFQLLQ 333
           MSCFTAHLAQ+DLDWALW WQGSYY+REGQAE  E+FGVL+SNWTQIKNPNFVQKFQLLQ
Sbjct: 241 MSCFTAHLAQKDLDWALWTWQGSYYYREGQAELPETFGVLESNWTQIKNPNFVQKFQLLQ 300

Query: 334 TMLHDPNSNASFSYVIYHPQSSQCIQVSNDNKEIFLTNCSTPTRWSHNNDGTPIEMSSTG 393
           TML DPNSNASFSYVIYHPQS QCI+VSNDNK+IFLTNCST +RWSH+ND TPI+MS+TG
Sbjct: 301 TMLQDPNSNASFSYVIYHPQSGQCIEVSNDNKDIFLTNCSTSSRWSHDNDSTPIKMSNTG 360

Query: 394 LYLKASGEGLEASLSTDTLSQQSVWSAISNSKLHLATFTQGGKSLCLQMDSSNSSKVVTN 453
           L LKASGEGL ASLS D L +QSVWSAISNSKLHLAT T+ GKSLCLQ++SSNSSK+VTN
Sbjct: 361 LCLKASGEGLAASLSNDCLGKQSVWSAISNSKLHLATVTENGKSLCLQIESSNSSKIVTN 420

Query: 454 SCICTNGDPNCLQDTRSQWFELVETNTL 482
           SCICT  DP CLQDT+SQWFELVETNTL
Sbjct: 421 SCICTTDDPTCLQDTQSQWFELVETNTL 448

BLAST of CsGy5G016230 vs. TrEMBL
Match: tr|A0A1S3CTF8|A0A1S3CTF8_CUCME (major extracellular endoglucanase-like OS=Cucumis melo OX=3656 GN=LOC103504686 PE=3 SV=1)

HSP 1 Score: 691.4 bits (1783), Expect = 1.4e-195
Identity = 322/483 (66.67%), Postives = 391/483 (80.95%), Query Frame = 0

Query: 1   MLAEGLNHRPLKDLADEAIKLRFNCVRLTYATHMFTRYANRTVEENFDLLDLKQAKAGLA 60
           ML EGL+ RPLKDLA+E ++L+FNCVRLTYATHMFTRYANRTVEENFDLLDL+ +K GLA
Sbjct: 57  MLIEGLDRRPLKDLANEVMRLKFNCVRLTYATHMFTRYANRTVEENFDLLDLRASKVGLA 116

Query: 61  QYNPFVLNKTIAEAYEAVVDVLGASGLMVIADNHMSQPRWCCSLDDGNGFFGNNNFDPQE 120
            +NPFVLN TI EAYEAVVDVLG SGLMVIADNH+SQPRWCCSL+DGNGFFG+  FD +E
Sbjct: 117 LHNPFVLNMTIFEAYEAVVDVLGTSGLMVIADNHISQPRWCCSLEDGNGFFGDRYFDSEE 176

Query: 121 WLQGLSLVAQRFRNKSTVVGMSLRNEIRGFMENANDWNKYITQGVTTIHNINSEVLVIVS 180
           WL+GL LVA+RF NKS VV MSLRNE+RG    + DWNKY+TQG TTIHNIN  +LVI+S
Sbjct: 177 WLEGLRLVARRFYNKSAVVAMSLRNELRGASSKSKDWNKYVTQGATTIHNINPNILVIIS 236

Query: 181 GLNYDNDLRCLKEKPLNVGTLDNKLVFEVHLYSFSGDSESKFVKQPLNNICANIMNGFID 240
           GLN+DNDLRC ++ PL +  L NKLVFEVHLYSFSG+S+SKF+  PLN IC+ I+NGF+ 
Sbjct: 237 GLNFDNDLRCQRQYPLQLNNLHNKLVFEVHLYSFSGESQSKFIHNPLNKICSKIINGFVQ 296

Query: 241 HAGFVMQGPNPFPLFVSEYGYDQREVNDAENRFMSCFTAHLAQRDLDWALWAWQGSYYFR 300
            A FVM+G    PLFVSE+G DQ  VN+A++RF+SCF+AHL ++DLDWALW WQGSYY+R
Sbjct: 297 RAEFVMEGAEAVPLFVSEFGLDQTGVNEADDRFLSCFSAHLVEKDLDWALWGWQGSYYYR 356

Query: 301 EGQAEPGESFGVLDSNWTQIKNPNFVQKFQLLQTMLHDPNSNASFSYVIYHPQSSQCIQV 360
           +G+ E  E FGVL+ NW+ ++NP F Q FQLLQTML DPNSN+S +Y++YHPQS QC+QV
Sbjct: 357 QGKVELEEVFGVLNYNWSDVRNPRFSQMFQLLQTMLQDPNSNSSNTYLMYHPQSGQCVQV 416

Query: 361 SN-DNKEIFLTNCSTPTRWSHNNDGTPIEMSSTGLYLKASGEGLEASLSTDTLSQQSVWS 420
            +   KEIFL NCS  + WS+  DGTPI ++ST   LKA+G GL  SLS D   +QSVW+
Sbjct: 417 HDMKQKEIFLNNCSNASHWSYEGDGTPIMLASTNFCLKANGNGLPPSLSRDCFGEQSVWT 476

Query: 421 AISNSKLHLATFT-QGGKSLCLQMDSSNSSKVVTNSCICTNGDPNCLQDTRSQWFELVET 480
           AIS+SKLHLAT T QG   +CL+ +SSNSS+++  SC+C   D NCLQDT++QWF+LV T
Sbjct: 477 AISDSKLHLATLTKQGNNGMCLEKESSNSSRILMRSCVCVGSDSNCLQDTQAQWFQLVVT 536

Query: 481 NTL 482
           NTL
Sbjct: 537 NTL 539

BLAST of CsGy5G016230 vs. TrEMBL
Match: tr|A0A0A0L644|A0A0A0L644_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G186670 PE=4 SV=1)

HSP 1 Score: 656.4 bits (1692), Expect = 5.1e-185
Identity = 337/448 (75.22%), Postives = 344/448 (76.79%), Query Frame = 0

Query: 34  MFTRYANRTVEENFDLLDLKQAKAGLAQYNPFVLNKTIAEAYEAVVDVLGASGLMVIADN 93
           M TRYANRT+EENFDLLDLKQAKAGLAQYNPFVLNKT+AEAYEAVVDVLGASGLMVIADN
Sbjct: 1   MLTRYANRTIEENFDLLDLKQAKAGLAQYNPFVLNKTVAEAYEAVVDVLGASGLMVIADN 60

Query: 94  HMSQPRWCCSLDDGNGFFGNNNFDPQEWLQGLSLVAQRFRNKSTVVGMSLRNEIRGFMEN 153
           HMSQPRWCCSLDDGNGFFGNNNFDPQEWLQGLSLVAQRFRNKSTV               
Sbjct: 61  HMSQPRWCCSLDDGNGFFGNNNFDPQEWLQGLSLVAQRFRNKSTVY-------------- 120

Query: 154 ANDWNKYITQGVTTIHNINSEVLVIVSGLNYDNDLRCLKEKPLNVGTLDNKLVFEVHLYS 213
                                                                       
Sbjct: 121 ------------------------------------------------------------ 180

Query: 214 FSGDSESKFVKQPLNNICANIMNGFIDHAGFVMQGPNPFPLFVSEYGYDQREVNDAENRF 273
               SESKFVKQPLNNICANIMNGFIDHAGFVMQGPNPFPLFV+                
Sbjct: 181 ----SESKFVKQPLNNICANIMNGFIDHAGFVMQGPNPFPLFVT---------------- 240

Query: 274 MSCFTAHLAQRDLDWALWAWQGSYYFREGQAEPGESFGVLDSNWTQIKNPNFVQKFQLLQ 333
                 HLAQRDLDWALWAWQGSYYFREGQAEPGESFGVLDSNWTQIKNPNFV+KFQLLQ
Sbjct: 241 ------HLAQRDLDWALWAWQGSYYFREGQAEPGESFGVLDSNWTQIKNPNFVRKFQLLQ 300

Query: 334 TMLHDPNSNASFSYVIYHPQSSQCIQVSNDNKEIFLTNCSTPTRWSHNNDGTPIEMSSTG 393
           TML DPNSNASFSYVIYHPQSSQCIQVSNDNKEIFLTNCSTPTRWSHNNDGTPIEMSSTG
Sbjct: 301 TMLQDPNSNASFSYVIYHPQSSQCIQVSNDNKEIFLTNCSTPTRWSHNNDGTPIEMSSTG 348

Query: 394 LYLKASGEGLEASLSTDTLSQQSVWSAISNSKLHLATFTQGGKSLCLQMDSSNSSKVVTN 453
           LYLKASG+GLEASLS+DTLSQQSVWSAISNSKLHLATFTQGGKSLCLQ+DSSNSSKVVTN
Sbjct: 361 LYLKASGKGLEASLSSDTLSQQSVWSAISNSKLHLATFTQGGKSLCLQIDSSNSSKVVTN 348

Query: 454 SCICTNGDPNCLQDTRSQWFELVETNTL 482
           SCICTNGDPNCLQDTRSQWFELV TNTL
Sbjct: 421 SCICTNGDPNCLQDTRSQWFELVGTNTL 348

BLAST of CsGy5G016230 vs. TrEMBL
Match: tr|A0A1S3CT43|A0A1S3CT43_CUCME (endoglucanase-like OS=Cucumis melo OX=3656 GN=LOC103504654 PE=3 SV=1)

HSP 1 Score: 533.5 bits (1373), Expect = 4.9e-148
Identity = 258/480 (53.75%), Postives = 344/480 (71.67%), Query Frame = 0

Query: 1   MLAEGLNHRPLKDLADEAIKLRFNCVRLTYATHMFTRYANRTVEENFDLLDLKQAKAGLA 60
           ML EGL+ RPL D+A    KLRFNCVRLTY+ HMFTR+AN TV+++F+  D+K A AG+A
Sbjct: 59  MLVEGLHRRPLDDIAALVAKLRFNCVRLTYSIHMFTRHANLTVKQSFENFDMKDAMAGIA 118

Query: 61  QYNPFVLNKTIAEAYEAVVDVLGASGLMVIADNHMSQPRWCCSLDDGNGFFGNNNFDPQE 120
           Q NP +LN T+ EAY AVVD L A G+MV++DNH+SQPRWCC  +DGNGFFG+  FDPQE
Sbjct: 119 QNNPSILNLTLVEAYGAVVDSLVAHGIMVVSDNHISQPRWCCDNNDGNGFFGDRYFDPQE 178

Query: 121 WLQGLSLVAQRFRNKSTVVGMSLRNEIRGFMENANDWNKYITQGVTTIHNINSEVLVIVS 180
           WLQG+SL AQ  ++K+ VV MSLRNE+RG  +N   W +Y++QG   IH IN   LV+VS
Sbjct: 179 WLQGISLAAQSLKSKAQVVAMSLRNELRGPNQNVEKWFQYMSQGAKLIHQINPNALVVVS 238

Query: 181 GLNYDNDLRCLKEKPLNVGTLDNKLVFEVHLYSFSGDSESKFVKQPLNNICANIMNGFID 240
           GL+YD DL  LK + +    LDNKLVFE HLYSF+ +    ++ +PLN  CA+I  GF D
Sbjct: 239 GLSYDTDLSFLKNRSMGF-NLDNKLVFEAHLYSFTNNMRDFWMSKPLNTFCASINQGFED 298

Query: 241 HAGFVMQGPNPFPLFVSEYGYDQREVNDAENRFMSCFTAHLAQRDLDWALWAWQGSYYFR 300
            AGF+++G NP PLFVSE+G DQ   N+ +NRF+SCF ++L + D DW LWA QGSYY++
Sbjct: 299 RAGFLVRGQNPIPLFVSEFGIDQTGTNEGQNRFLSCFFSYLTENDFDWGLWALQGSYYYK 358

Query: 301 EGQAEPGESFGVLDSNWTQIKNPN-FVQKFQLLQTMLHDPNSNASFSYVIYHPQSSQCIQ 360
            G     E+FGVLDSN+T+ KN   F+Q+FQL+QT L DP+SN + ++++YHP S  C++
Sbjct: 359 VGVKNAEENFGVLDSNFTKAKNSKLFLQRFQLMQTKLQDPSSNFTTTFIMYHPLSGGCVR 418

Query: 361 VSNDNKEIFLTNCSTPTRWSHNNDGTPIEMSSTGLYLKASGEGLEASLSTDTLSQQSVWS 420
           + N   ++ +++C T  RWSH  DG PI+++ + L LKA G GL   LS D  SQQS+W 
Sbjct: 419 M-NKKYQLGISSCKTSNRWSHEQDGAPIKLAGSILCLKAIGVGLPPILSQDCSSQQSIWR 478

Query: 421 AISNSKLHLATFTQGGKSLCLQMDSSNSSKVVTNSCICTNGDPNCLQDTRSQWFELVETN 480
             SN+KL LAT  + G++LCLQ  +S+S ++VTN C+CT  D  C +D +SQWF LV +N
Sbjct: 479 YASNAKLQLATVDEQGQALCLQR-ASHSHQIVTNKCLCTI-DSQCQEDPQSQWFTLVPSN 534

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_011658389.18.0e-25187.55PREDICTED: uncharacterized protein LOC101207450 [Cucumis sativus] >KGN45940.1 hy... [more]
XP_022932816.14.0e-25086.90uncharacterized protein LOC111439277 [Cucurbita moschata][more]
XP_022933313.14.0e-25086.90uncharacterized protein LOC111440529 [Cucurbita moschata][more]
XP_022995752.11.5e-24986.90uncharacterized protein LOC111491191 [Cucurbita maxima][more]
XP_022930167.12.9e-24886.28uncharacterized protein LOC111436676 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
AT1G13130.16.4e-10139.42Cellulase (glycosyl hydrolase family 5) protein[more]
AT3G26130.13.4e-9440.41Cellulase (glycosyl hydrolase family 5) protein[more]
AT3G26140.12.7e-9137.24Cellulase (glycosyl hydrolase family 5) protein[more]
AT5G17500.15.2e-8736.27Glycosyl hydrolase superfamily protein[more]
AT5G16700.14.9e-6934.10Glycosyl hydrolase superfamily protein[more]
Match NameE-valueIdentityDescription
sp|C0HLA0|GH5FP_CHAOB5.5e-9438.54Glycosyl hydrolase 5 family protein OS=Chamaecyparis obtusa OX=13415 PE=1 SV=1[more]
sp|P19487|GUNA_XANCP1.4e-1223.45Major extracellular endoglucanase OS=Xanthomonas campestris pv. campestris (stra... [more]
sp|P54583|GUN1_ACIC11.8e-1222.19Endoglucanase E1 OS=Acidothermus cellulolyticus (strain ATCC 43068 / 11B) OX=351... [more]
sp|P23548|GUN_PAEPO4.5e-1120.06Endoglucanase OS=Paenibacillus polymyxa OX=1406 PE=3 SV=2[more]
Match NameE-valueIdentityDescription
tr|A0A0A0K853|A0A0A0K853_CUCSA5.3e-25187.55Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G028440 PE=3 SV=1[more]
tr|A0A1S3BDI2|A0A1S3BDI2_CUCME5.9e-23487.95major extracellular endoglucanase-like OS=Cucumis melo OX=3656 GN=LOC103488703 P... [more]
tr|A0A1S3CTF8|A0A1S3CTF8_CUCME1.4e-19566.67major extracellular endoglucanase-like OS=Cucumis melo OX=3656 GN=LOC103504686 P... [more]
tr|A0A0A0L644|A0A0A0L644_CUCSA5.1e-18575.22Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G186670 PE=4 SV=1[more]
tr|A0A1S3CT43|A0A1S3CT43_CUCME4.9e-14853.75endoglucanase-like OS=Cucumis melo OX=3656 GN=LOC103504654 PE=3 SV=1[more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0004553hydrolase activity, hydrolyzing O-glycosyl compounds
Vocabulary: Biological Process
TermDefinition
GO:0005975carbohydrate metabolic process
Vocabulary: INTERPRO
TermDefinition
IPR035992Ricin_B-like_lectins
IPR017853Glycoside_hydrolase_SF
IPR001547Glyco_hydro_5
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0005975 carbohydrate metabolic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compounds

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsGy5G016230.1CsGy5G016230.1mRNA


Analysis Name: InterPro Annotations of cucumber Gy14 genome (v2)
Date Performed: 2018-09-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001547Glycoside hydrolase, family 5PFAMPF00150Cellulasecoord: 10..294
e-value: 2.1E-24
score: 86.3
NoneNo IPR availableGENE3DG3DSA:3.20.20.80coord: 1..337
e-value: 1.1E-61
score: 210.9
NoneNo IPR availablePANTHERPTHR31263FAMILY NOT NAMEDcoord: 2..477
NoneNo IPR availablePANTHERPTHR31263:SF10SUBFAMILY NOT NAMEDcoord: 2..477
IPR017853Glycoside hydrolase superfamilySUPERFAMILYSSF51445(Trans)glycosidasescoord: 4..329
IPR035992Ricin B-like lectinsSUPERFAMILYSSF50370Ricin B-like lectinscoord: 343..456

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
CsGy5G016230CsaV3_5G025540Cucumber (Chinese Long) v3cgybcucB232
The following gene(s) are paralogous to this gene:

None