CsaV3_3G017720 (gene) Cucumber (Chinese Long) v3

NameCsaV3_3G017720
Typegene
OrganismCucumis sativus (Cucumber (Chinese Long) v3)
DescriptionCellulase (glycosyl hydrolase family 5) protein
Locationchr3 : 13299971 .. 13301628 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTTGGCAGAAGGCCTCAACCATAGGCCATTAAAAGACCTAGCAGACGAGGCGATCAAGTTAAGATTTAATTGTGTACGACTCACATATGCAACTCATATGTTAACTCGTTATGCAAATAGGACGATTGAAGAAAACTTTGATCTTCTTGATTTAAAACAAGCAAAAGCTGGATTGGCTCAATATAATCCTTTTGTGTTGAATAAGACTGTTGCTGAAGCATATGAAGCTGTTGTTGATGTGTTGGGGGCAAGTGGTCTAATGGTCATCGCTGACAATCACATGAGCCAACCAAGATGGTGTTGCTCTCTTGACGATGGCAATGGCTTCTTTGGAAACAACAATTTTGACCCTCAAGAATGGCTACAAGGGCTTAGCTTGGTTGCTCAACGCTTTCGCAACAAATCAACGGTATGTAATTCACATCATAAGATTTTCTTAGATTCTTAAAACTAAACTAAAACTTTTAAGAACTTGTATAATATTTCATTAATTAACAATTTTTTTATTGGTTGTTTAGGTGGTAGGAATGAGTTTACGAAATGAGATACGGGGCTTTATGGAAAATGCAAACGATTGGAACAAATATATAACTCAAGGGGTAACCACGATTCATAACATAAATTCGGAAGTCTTAGTCATTGTTTCAGGGTTAAATTATGACAACGATCTCCGATGCTTAAAGGAAAAACCTTTGAATGTTAGCACCTTAGACAATAAATTGGTTTTCGAGGTACACTTGTATTCTTTCAGTGGAGATTCCGAGAGCAAGTTTGTAAAACAACCATTGAACAATATATGTGCAAATATTATGAATGGATTTATAGACCATGTTGGGTTTGTAATGCAAGGACCAAACCCGTTTCCATTATTTGTTAGTGAATATGGATATGATCAAAGAGAAGTTAACGATGCTGAAAACCGATTCATGAGTTGCTTCACAGCCCATCTTACACAAAGAGATTTAGATTGGGCATTGTGGGCTTGGCAAGGTAGCTATTATTTTAGAGAAGGTCAAGCAGAGCCTGGAGAAAGTTTCGGAGTGCTCGACTCTAATTGGACTCAAATTAAGAACCCTAACTTTGTACGAAAGTTTCAACTGTTGCAGACCATGTTGCAAGGTAACTATATAAATGATTATTACAAGTACATGCAATGACCTCTATAAATATATAATATAAAGTAACAATTTATGTATAACATCTTTTTATGATATATATGCAGATCCAAATTCCAATGCATCGTTCTCATATGTTATATATCATCCACAAAGTAGCCAATGCATCCAAGTCTCGAATGACAACAAAGAAATTTTTCTCACCAATTGCTCCACCCCAACTCGATGGAGTCATAACAATGATGGCACTCCAATTGAGATGTCAAGCACTGGTTTATACTTGAAGGCTAGTGGGAAAGGCCTTGAGGCATCTCTTTCATCTGATACCTTAAGCCAACAAAGTGTTTGGAGTGCCATTTCAAATTCTAAACTTCATTTGGCCACCTTCACTCAAGGTGGAAAGAGCCTTTGTTTGCAAATTGATAGCTCTAACTCTTCAAAAGTTGTGACCAACTCTTGCATTTGCACCAATGGTGATCCAAATTGCCTCCAAGACACCCGAAGCCAATGGTTTGAACTCGTTGGAACCAACACATTATGA

mRNA sequence

ATGTTGGCAGAAGGCCTCAACCATAGGCCATTAAAAGACCTAGCAGACGAGGCGATCAAGTTAAGATTTAATTGTGTACGACTCACATATGCAACTCATATGTTAACTCGTTATGCAAATAGGACGATTGAAGAAAACTTTGATCTTCTTGATTTAAAACAAGCAAAAGCTGGATTGGCTCAATATAATCCTTTTGTGTTGAATAAGACTGTTGCTGAAGCATATGAAGCTGTTGTTGATGTGTTGGGGGCAAGTGGTCTAATGGTCATCGCTGACAATCACATGAGCCAACCAAGATGGTGTTGCTCTCTTGACGATGGCAATGGCTTCTTTGGAAACAACAATTTTGACCCTCAAGAATGGCTACAAGGGCTTAGCTTGGTTGCTCAACGCTTTCGCAACAAATCAACGGTGGTAGGAATGAGTTTACGAAATGAGATACGGGGCTTTATGGAAAATGCAAACGATTGGAACAAATATATAACTCAAGGGGTAACCACGATTCATAACATAAATTCGGAAGTCTTAGTCATTGTTTCAGGGTTAAATTATGACAACGATCTCCGATGCTTAAAGGAAAAACCTTTGAATGTTAGCACCTTAGACAATAAATTGGTTTTCGAGGTACACTTGTATTCTTTCAGTGGAGATTCCGAGAGCAAGTTTGTAAAACAACCATTGAACAATATATGTGCAAATATTATGAATGGATTTATAGACCATGTTGGGTTTGTAATGCAAGGACCAAACCCGTTTCCATTATTTGTTAGTGAATATGGATATGATCAAAGAGAAGTTAACGATGCTGAAAACCGATTCATGAGTTGCTTCACAGCCCATCTTACACAAAGAGATTTAGATTGGGCATTGTGGGCTTGGCAAGGTAGCTATTATTTTAGAGAAGGTCAAGCAGAGCCTGGAGAAAGTTTCGGAGTGCTCGACTCTAATTGGACTCAAATTAAGAACCCTAACTTTGTACGAAAGTTTCAACTGTTGCAGACCATGTTGCAAGATCCAAATTCCAATGCATCGTTCTCATATGTTATATATCATCCACAAAGTAGCCAATGCATCCAAGTCTCGAATGACAACAAAGAAATTTTTCTCACCAATTGCTCCACCCCAACTCGATGGAGTCATAACAATGATGGCACTCCAATTGAGATGTCAAGCACTGGTTTATACTTGAAGGCTAGTGGGAAAGGCCTTGAGGCATCTCTTTCATCTGATACCTTAAGCCAACAAAGTGTTTGGAGTGCCATTTCAAATTCTAAACTTCATTTGGCCACCTTCACTCAAGGTGGAAAGAGCCTTTGTTTGCAAATTGATAGCTCTAACTCTTCAAAAGTTGTGACCAACTCTTGCATTTGCACCAATGGTGATCCAAATTGCCTCCAAGACACCCGAAGCCAATGGTTTGAACTCGTTGGAACCAACACATTATGA

Coding sequence (CDS)

ATGTTGGCAGAAGGCCTCAACCATAGGCCATTAAAAGACCTAGCAGACGAGGCGATCAAGTTAAGATTTAATTGTGTACGACTCACATATGCAACTCATATGTTAACTCGTTATGCAAATAGGACGATTGAAGAAAACTTTGATCTTCTTGATTTAAAACAAGCAAAAGCTGGATTGGCTCAATATAATCCTTTTGTGTTGAATAAGACTGTTGCTGAAGCATATGAAGCTGTTGTTGATGTGTTGGGGGCAAGTGGTCTAATGGTCATCGCTGACAATCACATGAGCCAACCAAGATGGTGTTGCTCTCTTGACGATGGCAATGGCTTCTTTGGAAACAACAATTTTGACCCTCAAGAATGGCTACAAGGGCTTAGCTTGGTTGCTCAACGCTTTCGCAACAAATCAACGGTGGTAGGAATGAGTTTACGAAATGAGATACGGGGCTTTATGGAAAATGCAAACGATTGGAACAAATATATAACTCAAGGGGTAACCACGATTCATAACATAAATTCGGAAGTCTTAGTCATTGTTTCAGGGTTAAATTATGACAACGATCTCCGATGCTTAAAGGAAAAACCTTTGAATGTTAGCACCTTAGACAATAAATTGGTTTTCGAGGTACACTTGTATTCTTTCAGTGGAGATTCCGAGAGCAAGTTTGTAAAACAACCATTGAACAATATATGTGCAAATATTATGAATGGATTTATAGACCATGTTGGGTTTGTAATGCAAGGACCAAACCCGTTTCCATTATTTGTTAGTGAATATGGATATGATCAAAGAGAAGTTAACGATGCTGAAAACCGATTCATGAGTTGCTTCACAGCCCATCTTACACAAAGAGATTTAGATTGGGCATTGTGGGCTTGGCAAGGTAGCTATTATTTTAGAGAAGGTCAAGCAGAGCCTGGAGAAAGTTTCGGAGTGCTCGACTCTAATTGGACTCAAATTAAGAACCCTAACTTTGTACGAAAGTTTCAACTGTTGCAGACCATGTTGCAAGATCCAAATTCCAATGCATCGTTCTCATATGTTATATATCATCCACAAAGTAGCCAATGCATCCAAGTCTCGAATGACAACAAAGAAATTTTTCTCACCAATTGCTCCACCCCAACTCGATGGAGTCATAACAATGATGGCACTCCAATTGAGATGTCAAGCACTGGTTTATACTTGAAGGCTAGTGGGAAAGGCCTTGAGGCATCTCTTTCATCTGATACCTTAAGCCAACAAAGTGTTTGGAGTGCCATTTCAAATTCTAAACTTCATTTGGCCACCTTCACTCAAGGTGGAAAGAGCCTTTGTTTGCAAATTGATAGCTCTAACTCTTCAAAAGTTGTGACCAACTCTTGCATTTGCACCAATGGTGATCCAAATTGCCTCCAAGACACCCGAAGCCAATGGTTTGAACTCGTTGGAACCAACACATTATGA

Protein sequence

MLAEGLNHRPLKDLADEAIKLRFNCVRLTYATHMLTRYANRTIEENFDLLDLKQAKAGLAQYNPFVLNKTVAEAYEAVVDVLGASGLMVIADNHMSQPRWCCSLDDGNGFFGNNNFDPQEWLQGLSLVAQRFRNKSTVVGMSLRNEIRGFMENANDWNKYITQGVTTIHNINSEVLVIVSGLNYDNDLRCLKEKPLNVSTLDNKLVFEVHLYSFSGDSESKFVKQPLNNICANIMNGFIDHVGFVMQGPNPFPLFVSEYGYDQREVNDAENRFMSCFTAHLTQRDLDWALWAWQGSYYFREGQAEPGESFGVLDSNWTQIKNPNFVRKFQLLQTMLQDPNSNASFSYVIYHPQSSQCIQVSNDNKEIFLTNCSTPTRWSHNNDGTPIEMSSTGLYLKASGKGLEASLSSDTLSQQSVWSAISNSKLHLATFTQGGKSLCLQIDSSNSSKVVTNSCICTNGDPNCLQDTRSQWFELVGTNTL
BLAST of CsaV3_3G017720 vs. NCBI nr
Match: XP_011658389.1 (PREDICTED: uncharacterized protein LOC101207450 [Cucumis sativus] >KGN45940.1 hypothetical protein Csa_6G028440 [Cucumis sativus])

HSP 1 Score: 869.8 bits (2246), Expect = 4.4e-249
Identity = 417/482 (86.51%), Postives = 446/482 (92.53%), Query Frame = 0

Query: 1   MLAEGLNHRPLKDLADEAIKLRFNCVRLTYATHMLTRYANRTIEENFDLLDLKQAKAGLA 60
           ML EGLNHRPLK+LADEAIKLRFNCVRLTYATHM TRYANRT+EENFDLLDL+QAKAGLA
Sbjct: 57  MLIEGLNHRPLKELADEAIKLRFNCVRLTYATHMFTRYANRTVEENFDLLDLEQAKAGLA 116

Query: 61  QYNPFVLNKTVAEAYEAVVDVLGASGLMVIADNHMSQPRWCCSLDDGNGFFGNNNFDPQE 120
           QYNPFVLNKT+AEAYEAVVDVLGASGLMVIADNHMSQPRWCCSLDDGNGFFGN  FDPQE
Sbjct: 117 QYNPFVLNKTIAEAYEAVVDVLGASGLMVIADNHMSQPRWCCSLDDGNGFFGNRYFDPQE 176

Query: 121 WLQGLSLVAQRFRNKSTVVGMSLRNEIRGFMENANDWNKYITQGVTTIHNINSEVLVIVS 180
           WLQGLSLVAQRF NKSTVVGMSLRNE+RG MENANDWN Y+TQGVTTIH IN  VLVIVS
Sbjct: 177 WLQGLSLVAQRFNNKSTVVGMSLRNELRGMMENANDWNNYVTQGVTTIHKINPAVLVIVS 236

Query: 181 GLNYDNDLRCLKEKPLNVSTLDNKLVFEVHLYSFSGDSESKFVKQPLNNICANIMNGFID 240
           GLNYDNDLRCLK+KPLNVSTLDNKL FEVHLYSFSGDSESKFV+QPLNNICA IM+ FID
Sbjct: 237 GLNYDNDLRCLKDKPLNVSTLDNKLAFEVHLYSFSGDSESKFVQQPLNNICAKIMHEFID 296

Query: 241 HVGFVMQGPNPFPLFVSEYGYDQREVNDAENRFMSCFTAHLTQRDLDWALWAWQGSYYFR 300
           H  FV++GPNPFPLFVSEYGYDQREV+DAENRFMSCFTAHL Q+DLDWALW WQGSYY+R
Sbjct: 297 HAEFVIEGPNPFPLFVSEYGYDQREVDDAENRFMSCFTAHLAQKDLDWALWTWQGSYYYR 356

Query: 301 EGQAEPGESFGVLDSNWTQIKNPNFVRKFQLLQTMLQDPNSNASFSYVIYHPQSSQCIQV 360
           EGQAE  E+FGVLDSNWTQIKNPNFV+KFQLLQTMLQDP SNASFSYVIYH QS QCI+V
Sbjct: 357 EGQAELAETFGVLDSNWTQIKNPNFVQKFQLLQTMLQDPYSNASFSYVIYHVQSGQCIEV 416

Query: 361 SNDNKEIFLTNCSTPTRWSHNNDGTPIEMSSTGLYLKASGKGLEASLSSDTLSQQSVWSA 420
           SNDNKEIFLTNCST +RWSH+ND TPI+MSSTGL LKASG+GLEASLS+D + +QS+WSA
Sbjct: 417 SNDNKEIFLTNCSTSSRWSHDNDSTPIKMSSTGLCLKASGEGLEASLSTDCIGKQSLWSA 476

Query: 421 ISNSKLHLATFTQGGKSLCLQ-IDSSNSSKVVTNSCICTNGDPNCLQDTRSQWFELVGTN 480
           ISNS LHL T T+ GKSLCLQ I+SSNSSK+VTNSCICT  DP CLQDT+SQWFELV TN
Sbjct: 477 ISNSNLHLGTVTEDGKSLCLQIIESSNSSKIVTNSCICTTNDPTCLQDTQSQWFELVATN 536

Query: 481 TL 482
           TL
Sbjct: 537 TL 538

BLAST of CsaV3_3G017720 vs. NCBI nr
Match: XP_022932816.1 (uncharacterized protein LOC111439277 [Cucurbita moschata])

HSP 1 Score: 864.0 bits (2231), Expect = 2.4e-247
Identity = 411/481 (85.45%), Postives = 439/481 (91.27%), Query Frame = 0

Query: 1   MLAEGLNHRPLKDLADEAIKLRFNCVRLTYATHMLTRYANRTIEENFDLLDLKQAKAGLA 60
           ML EGLNHRPLK+LADEAIKLRFNCVRLTYAT M TRYANRT+EENFDLLDL+QAKAGLA
Sbjct: 1   MLIEGLNHRPLKELADEAIKLRFNCVRLTYATQMFTRYANRTVEENFDLLDLEQAKAGLA 60

Query: 61  QYNPFVLNKTVAEAYEAVVDVLGASGLMVIADNHMSQPRWCCSLDDGNGFFGNNNFDPQE 120
           QYNPFVLNKT+AEAYEAVVDVLG SGLMVIADNHMSQPRWCCSLDDGNGFFG+  FDPQE
Sbjct: 61  QYNPFVLNKTIAEAYEAVVDVLGESGLMVIADNHMSQPRWCCSLDDGNGFFGDRYFDPQE 120

Query: 121 WLQGLSLVAQRFRNKSTVVGMSLRNEIRGFMENANDWNKYITQGVTTIHNINSEVLVIVS 180
           WLQGLSLVAQRF  KSTVVGMSLRNEIRG  ENANDWN Y+TQGVTTIHNIN  VLVIVS
Sbjct: 121 WLQGLSLVAQRFSKKSTVVGMSLRNEIRGTNENANDWNNYVTQGVTTIHNINPNVLVIVS 180

Query: 181 GLNYDNDLRCLKEKPLNVSTLDNKLVFEVHLYSFSGDSESKFVKQPLNNICANIMNGFID 240
           GLN+DNDLRCLKEKPLNVSTLDNKLVFEVHLYSFSGD ESKF+ QPLNNICANI+NGF+D
Sbjct: 181 GLNFDNDLRCLKEKPLNVSTLDNKLVFEVHLYSFSGDQESKFINQPLNNICANIINGFVD 240

Query: 241 HVGFVMQGPNPFPLFVSEYGYDQREVNDAENRFMSCFTAHLTQRDLDWALWAWQGSYYFR 300
           H  FV +GPNPFPLFVSEYGYDQREVNDAENRFMSCFTAHL Q DLDWALW WQGSYY+R
Sbjct: 241 HAEFVREGPNPFPLFVSEYGYDQREVNDAENRFMSCFTAHLAQEDLDWALWTWQGSYYYR 300

Query: 301 EGQAEPGESFGVLDSNWTQIKNPNFVRKFQLLQTMLQDPNSNASFSYVIYHPQSSQCIQV 360
           EGQA P E+FGVLDSNWTQIKNPNFV+KFQLLQTMLQDPNSNASFSYVIYHPQS QCIQV
Sbjct: 301 EGQAGPAETFGVLDSNWTQIKNPNFVQKFQLLQTMLQDPNSNASFSYVIYHPQSGQCIQV 360

Query: 361 SNDNKEIFLTNCSTPTRWSHNNDGTPIEMSSTGLYLKASGKGLEASLSSDTLSQQSVWSA 420
           SNDNK+IF+ NCS  +RW+H+ND TPI MSSTGL LK SG+GL  SLS+D    QS W+A
Sbjct: 361 SNDNKDIFMGNCSISSRWTHDNDSTPIRMSSTGLCLKTSGEGLMPSLSTDCFGPQSSWTA 420

Query: 421 ISNSKLHLATFTQGGKSLCLQIDSSNSSKVVTNSCICTNGDPNCLQDTRSQWFELVGTNT 480
           ISN+KLHLAT TQ GKSLCLQ++SSNSSK+VTNSCICT+G PNCLQDT+SQWFELV TNT
Sbjct: 421 ISNTKLHLATVTQDGKSLCLQVESSNSSKIVTNSCICTDGAPNCLQDTQSQWFELVETNT 480

Query: 481 L 482
           L
Sbjct: 481 L 481

BLAST of CsaV3_3G017720 vs. NCBI nr
Match: XP_022933313.1 (uncharacterized protein LOC111440529 [Cucurbita moschata])

HSP 1 Score: 864.0 bits (2231), Expect = 2.4e-247
Identity = 411/481 (85.45%), Postives = 439/481 (91.27%), Query Frame = 0

Query: 1   MLAEGLNHRPLKDLADEAIKLRFNCVRLTYATHMLTRYANRTIEENFDLLDLKQAKAGLA 60
           ML EGLNHRPLK+LADEAIKLRFNCVRLTYAT M TRYANRT+EENFDLLDL+QAKAGLA
Sbjct: 54  MLIEGLNHRPLKELADEAIKLRFNCVRLTYATQMFTRYANRTVEENFDLLDLEQAKAGLA 113

Query: 61  QYNPFVLNKTVAEAYEAVVDVLGASGLMVIADNHMSQPRWCCSLDDGNGFFGNNNFDPQE 120
           QYNPFVLNKT+AEAYEAVVDVLG SGLMVIADNHMSQPRWCCSLDDGNGFFG+  FDPQE
Sbjct: 114 QYNPFVLNKTIAEAYEAVVDVLGESGLMVIADNHMSQPRWCCSLDDGNGFFGDRYFDPQE 173

Query: 121 WLQGLSLVAQRFRNKSTVVGMSLRNEIRGFMENANDWNKYITQGVTTIHNINSEVLVIVS 180
           WLQGLSLVAQRF  KSTVVGMSLRNEIRG  ENANDWN Y+TQGVTTIHNIN  VLVIVS
Sbjct: 174 WLQGLSLVAQRFSKKSTVVGMSLRNEIRGTNENANDWNNYVTQGVTTIHNINPNVLVIVS 233

Query: 181 GLNYDNDLRCLKEKPLNVSTLDNKLVFEVHLYSFSGDSESKFVKQPLNNICANIMNGFID 240
           GLN+DNDLRCLKEKPLNVSTLDNKLVFEVHLYSFSGD ESKF+ QPLNNICANI+NGF+D
Sbjct: 234 GLNFDNDLRCLKEKPLNVSTLDNKLVFEVHLYSFSGDQESKFINQPLNNICANIINGFVD 293

Query: 241 HVGFVMQGPNPFPLFVSEYGYDQREVNDAENRFMSCFTAHLTQRDLDWALWAWQGSYYFR 300
           H  FV +GPNPFPLFVSEYGYDQREVNDAENRFMSCFTAHL Q DLDWALW WQGSYY+R
Sbjct: 294 HAEFVREGPNPFPLFVSEYGYDQREVNDAENRFMSCFTAHLAQEDLDWALWTWQGSYYYR 353

Query: 301 EGQAEPGESFGVLDSNWTQIKNPNFVRKFQLLQTMLQDPNSNASFSYVIYHPQSSQCIQV 360
           EGQA P E+FGVLDSNWTQIKNPNFV+KFQLLQTMLQDPNSNASFSYVIYHPQS QCIQV
Sbjct: 354 EGQAGPAETFGVLDSNWTQIKNPNFVQKFQLLQTMLQDPNSNASFSYVIYHPQSGQCIQV 413

Query: 361 SNDNKEIFLTNCSTPTRWSHNNDGTPIEMSSTGLYLKASGKGLEASLSSDTLSQQSVWSA 420
           SNDNK+IF+ NCS  +RW+H+ND TPI MSSTGL LK SG+GL  SLS+D    QS W+A
Sbjct: 414 SNDNKDIFMGNCSISSRWTHDNDSTPIRMSSTGLCLKTSGEGLMPSLSTDCFGPQSSWTA 473

Query: 421 ISNSKLHLATFTQGGKSLCLQIDSSNSSKVVTNSCICTNGDPNCLQDTRSQWFELVGTNT 480
           ISN+KLHLAT TQ GKSLCLQ++SSNSSK+VTNSCICT+G PNCLQDT+SQWFELV TNT
Sbjct: 474 ISNTKLHLATVTQDGKSLCLQVESSNSSKIVTNSCICTDGAPNCLQDTQSQWFELVETNT 533

Query: 481 L 482
           L
Sbjct: 534 L 534

BLAST of CsaV3_3G017720 vs. NCBI nr
Match: XP_022995752.1 (uncharacterized protein LOC111491191 [Cucurbita maxima])

HSP 1 Score: 863.6 bits (2230), Expect = 3.2e-247
Identity = 412/481 (85.65%), Postives = 439/481 (91.27%), Query Frame = 0

Query: 1   MLAEGLNHRPLKDLADEAIKLRFNCVRLTYATHMLTRYANRTIEENFDLLDLKQAKAGLA 60
           ML EGLNHRPLK+LADEAIKLRFNCVRLTYAT M TRYANRT+EENFDLLDL+QAKAGLA
Sbjct: 56  MLIEGLNHRPLKELADEAIKLRFNCVRLTYATQMFTRYANRTVEENFDLLDLEQAKAGLA 115

Query: 61  QYNPFVLNKTVAEAYEAVVDVLGASGLMVIADNHMSQPRWCCSLDDGNGFFGNNNFDPQE 120
           Q NPFVLNKT+AEAYEAVVDVLG SGLMVIADNH+SQPRWCCSLDDGNGFFG+  FDPQE
Sbjct: 116 QNNPFVLNKTIAEAYEAVVDVLGESGLMVIADNHISQPRWCCSLDDGNGFFGDRYFDPQE 175

Query: 121 WLQGLSLVAQRFRNKSTVVGMSLRNEIRGFMENANDWNKYITQGVTTIHNINSEVLVIVS 180
           WLQGLSLVAQRF  KSTVVGMSLRNEIRG  ENANDWN Y+TQGVTTIHNIN  VLVIVS
Sbjct: 176 WLQGLSLVAQRFSKKSTVVGMSLRNEIRGTNENANDWNNYVTQGVTTIHNINPNVLVIVS 235

Query: 181 GLNYDNDLRCLKEKPLNVSTLDNKLVFEVHLYSFSGDSESKFVKQPLNNICANIMNGFID 240
           GLNYDNDLRCLKEKPLNVSTLDNKLVFEVHLYSFSGD ESKF+ QPLNNICANI+NGF+D
Sbjct: 236 GLNYDNDLRCLKEKPLNVSTLDNKLVFEVHLYSFSGDPESKFINQPLNNICANIINGFVD 295

Query: 241 HVGFVMQGPNPFPLFVSEYGYDQREVNDAENRFMSCFTAHLTQRDLDWALWAWQGSYYFR 300
           H  FV +G NPFPLFVSEYGYDQREVNDAENRFMSCFTAHL Q+DLDWALW WQGSYY+R
Sbjct: 296 HAEFVTEGSNPFPLFVSEYGYDQREVNDAENRFMSCFTAHLAQKDLDWALWTWQGSYYYR 355

Query: 301 EGQAEPGESFGVLDSNWTQIKNPNFVRKFQLLQTMLQDPNSNASFSYVIYHPQSSQCIQV 360
           EGQAEPGE+FGVLDSNWTQIKNPNFV+KFQLLQTMLQDPNSNASFSYVIYHPQS QCIQV
Sbjct: 356 EGQAEPGETFGVLDSNWTQIKNPNFVQKFQLLQTMLQDPNSNASFSYVIYHPQSGQCIQV 415

Query: 361 SNDNKEIFLTNCSTPTRWSHNNDGTPIEMSSTGLYLKASGKGLEASLSSDTLSQQSVWSA 420
           SNDNK+IF+ NCS  +RW+H+ND TPI MSS GL LK  G+GL  SLS+D L  QS WSA
Sbjct: 416 SNDNKDIFMGNCSISSRWTHDNDSTPIRMSSMGLCLKTIGEGLTPSLSTDCLGPQSSWSA 475

Query: 421 ISNSKLHLATFTQGGKSLCLQIDSSNSSKVVTNSCICTNGDPNCLQDTRSQWFELVGTNT 480
           ISN+KLHLAT +Q GKSLCLQ++SSNSSK+VTNSCICTNG PNCLQDTRSQWFELV TNT
Sbjct: 476 ISNTKLHLATISQDGKSLCLQVESSNSSKIVTNSCICTNGAPNCLQDTRSQWFELVKTNT 535

Query: 481 L 482
           L
Sbjct: 536 L 536

BLAST of CsaV3_3G017720 vs. NCBI nr
Match: XP_022958333.1 (uncharacterized protein LOC111459581 [Cucurbita moschata])

HSP 1 Score: 859.8 bits (2220), Expect = 4.6e-246
Identity = 411/481 (85.45%), Postives = 436/481 (90.64%), Query Frame = 0

Query: 1   MLAEGLNHRPLKDLADEAIKLRFNCVRLTYATHMLTRYANRTIEENFDLLDLKQAKAGLA 60
           ML EGLNHRPLK+LADEAIKLRFNCVRLTYAT M TRYANRT+EENFDLLDL+ AKAGLA
Sbjct: 1   MLIEGLNHRPLKELADEAIKLRFNCVRLTYATQMFTRYANRTVEENFDLLDLETAKAGLA 60

Query: 61  QYNPFVLNKTVAEAYEAVVDVLGASGLMVIADNHMSQPRWCCSLDDGNGFFGNNNFDPQE 120
           Q NPFVLNKT+AEAYEAVVDVLG SGLMVIADNH+SQPRWCCSLDDGNGFFG+  FDPQE
Sbjct: 61  QNNPFVLNKTIAEAYEAVVDVLGESGLMVIADNHISQPRWCCSLDDGNGFFGDRYFDPQE 120

Query: 121 WLQGLSLVAQRFRNKSTVVGMSLRNEIRGFMENANDWNKYITQGVTTIHNINSEVLVIVS 180
           WLQGLSLVAQRF  KSTVVGMSLRNEIRG  ENANDWN Y+TQGVTTIHNIN  VLVIVS
Sbjct: 121 WLQGLSLVAQRFSKKSTVVGMSLRNEIRGTNENANDWNNYVTQGVTTIHNINPNVLVIVS 180

Query: 181 GLNYDNDLRCLKEKPLNVSTLDNKLVFEVHLYSFSGDSESKFVKQPLNNICANIMNGFID 240
           GLNYDNDLRCLKEKPLNVSTLDNKLVFEVHLYSFSGDSESKF+ QPLNNICANI+NGF+D
Sbjct: 181 GLNYDNDLRCLKEKPLNVSTLDNKLVFEVHLYSFSGDSESKFINQPLNNICANIINGFVD 240

Query: 241 HVGFVMQGPNPFPLFVSEYGYDQREVNDAENRFMSCFTAHLTQRDLDWALWAWQGSYYFR 300
           H  FV +G NPFPLFVSEYGYDQREVNDAENRFMSCFTAHL Q+DLDWALW WQGSYY+R
Sbjct: 241 HAEFVTEGSNPFPLFVSEYGYDQREVNDAENRFMSCFTAHLAQKDLDWALWTWQGSYYYR 300

Query: 301 EGQAEPGESFGVLDSNWTQIKNPNFVRKFQLLQTMLQDPNSNASFSYVIYHPQSSQCIQV 360
           EGQAEPGE+FGVLDSNWTQIKNPNFV+KFQLLQTMLQDPNSNASFSYVIYHPQS QCIQV
Sbjct: 301 EGQAEPGETFGVLDSNWTQIKNPNFVQKFQLLQTMLQDPNSNASFSYVIYHPQSGQCIQV 360

Query: 361 SNDNKEIFLTNCSTPTRWSHNNDGTPIEMSSTGLYLKASGKGLEASLSSDTLSQQSVWSA 420
           SNDNK+IF+ NCS  +RW+H+ND TPI MSST L LK SG+GL  SLS D L  QS WSA
Sbjct: 361 SNDNKDIFMGNCSISSRWTHDNDSTPIRMSSTSLCLKTSGEGLMPSLSIDCLGPQSSWSA 420

Query: 421 ISNSKLHLATFTQGGKSLCLQIDSSNSSKVVTNSCICTNGDPNCLQDTRSQWFELVGTNT 480
           ISN+KLHLAT    GKSLCLQ++SSNSSK+VTNSCICTNG PNCLQDTRSQWFELV TN 
Sbjct: 421 ISNTKLHLATIAPNGKSLCLQVESSNSSKIVTNSCICTNGAPNCLQDTRSQWFELVKTNA 480

Query: 481 L 482
           L
Sbjct: 481 L 481

BLAST of CsaV3_3G017720 vs. TAIR10
Match: AT1G13130.1 (Cellulase (glycosyl hydrolase family 5) protein)

HSP 1 Score: 364.0 bits (933), Expect = 1.4e-100
Identity = 189/481 (39.29%), Postives = 292/481 (60.71%), Query Frame = 0

Query: 1   MLAEGLNHRPLKDLADEAIKLRFNCVRLTYATHMLTRYA---NRTIEENFDLLDLKQAKA 60
           ++AEGL+ +P+  +A + +++ FNCVRLT+   ++T      N T+ ++F  L L     
Sbjct: 64  VVAEGLSKQPVDAVAKKIVEMGFNCVRLTWPLDLMTNETLANNVTVRQSFQSLGLNDDIV 123

Query: 61  GLAQYNPFVLNKTVAEAYEAVVDVLGASGLMVIADNHMSQPRWCCSLDDGNGFFGNNNFD 120
           G    NP +++  + EAY+ VV  LG + +MVI DNH+++P WCC+ DDGNGFFG+  FD
Sbjct: 124 GFQTNNPSIIDLPLIEAYKTVVTTLGNNDVMVILDNHLTKPGWCCANDDGNGFFGDQFFD 183

Query: 121 PQEWLQGLSLVAQRFRNKSTVVGMSLRNEIRGFMENANDWNKYITQGVTTIHNINSEVLV 180
           P  W+  L  +A  F   S VVGMSLRNE+RG  +N NDW KY+ QG   +H+ N++VLV
Sbjct: 184 PTVWVAALKKMAATFNGVSNVVGMSLRNELRGPKQNVNDWFKYMQQGAEAVHSANNKVLV 243

Query: 181 IVSGLNYDNDLRCLKEKPLNVSTLDNKLVFEVHLYSFSGDSESKFVKQPLNNICANIMNG 240
           I+SGL++D DL  ++ +P+ +S    KLVFE+H YSFS D  S     P N+IC  ++N 
Sbjct: 244 ILSGLSFDADLSFVRSRPVKLS-FTGKLVFELHWYSFS-DGNSWAANNP-NDICGRVLNR 303

Query: 241 FIDHVGFVMQGPNPFPLFVSEYGYDQREVNDAENRFMSCFTAHLTQRDLDWALWAWQGSY 300
             +  G+++     FPLF+SE+G D+R VN  +NR+  C T    + D+DW+LWA  GSY
Sbjct: 304 IGNGGGYLLN--QGFPLFLSEFGIDERGVNTNDNRYFGCLTGWAAENDVDWSLWALTGSY 363

Query: 301 YFREGQAEPGESFGVLDSNWTQIKNPNFVRKFQLLQTMLQDPNSNASFSYVIYHPQSSQC 360
           Y R+G+    E +GVLDS+W  ++N +F++K   LQ+ LQ P        +++HP +  C
Sbjct: 364 YLRQGKVGMNEYYGVLDSDWISVRNSSFLQKISFLQSPLQGPGPRTDAYNLVFHPLTGLC 423

Query: 361 IQVS-NDNKEIFLTNCSTPTRWSHNNDGTPIEMSSTGLYLKASGKGLEASLSSDTLSQQ- 420
           I  S +D K + L  C++   WS+      + +    L L+++G     +++  + S   
Sbjct: 424 IVRSLDDPKMLTLGPCNSSEPWSYTKKA--LRIKDQQLCLQSNGPKNPVTMTRTSCSTSG 483

Query: 421 SVWSAISNSKLHLATFTQGGKSLCLQIDSSNSSKVVTNSCICTNGDPNCLQDTRSQWFEL 477
           S W  IS S++HLA+ T    SLCL +D++N+  VV N+C C + D +C  +  SQWF++
Sbjct: 484 SKWQTISASRMHLASTTSNKTSLCLDVDTANN--VVANACKCLSKDKSC--EPMSQWFKI 533

BLAST of CsaV3_3G017720 vs. TAIR10
Match: AT3G26130.1 (Cellulase (glycosyl hydrolase family 5) protein)

HSP 1 Score: 341.3 bits (874), Expect = 9.9e-94
Identity = 195/485 (40.21%), Postives = 278/485 (57.32%), Query Frame = 0

Query: 2   LAEGLNHRPLKDLADEAIKLRFNCVRLTYATHMLTR---YANRTIEENFDLLDLKQAKAG 61
           +AEGL+ +PL  +A++ + + FNCVRLT+  ++ T     A  T+ ++     L +A +G
Sbjct: 55  VAEGLSKQPLDAIAEKIVSMGFNCVRLTWPLYLATDESFSAFMTVRQSLRKFRLFEAVSG 114

Query: 62  LAQYNPFVLNKTVAEAYEAVVDVLGASGLMVIADNHMSQPRWCCSLDDGNGFFGNNNFDP 121
              +NP +L+  + +A++ VV  L    +MVI DNH+SQP WCCS +DGNGFFG+ + +P
Sbjct: 115 FQTHNPTILDLPLIKAFQEVVYCLEKHRVMVILDNHISQPGWCCSDNDGNGFFGDKHLNP 174

Query: 122 QEWLQGLSLVAQRFRN-KSTVVGMSLRNEIRGFMENANDWNKYITQGVTTIHNINSEVLV 181
           Q W++GL  +A  F N  S VVGMSLRNE+RG  +N  DW KY+ +G   +H++N  VLV
Sbjct: 175 QVWIKGLKKMASMFANVSSNVVGMSLRNELRGPKQNIKDWYKYMREGAEAVHSVNPNVLV 234

Query: 182 IVSGLNYDNDLRCLKEKPLNVSTLDNKLVFEVHLYSFSGDSESKFVKQPLNNICANIMNG 241
           IVSGLNY  DL  L+E+P  VS    K+VFE+H Y F    E       LN IC      
Sbjct: 235 IVSGLNYATDLSFLRERPFEVS-FRRKVVFEIHWYGFWNTWEG----DNLNKICGKETEK 294

Query: 242 FIDHVGFVMQGPNPFPLFVSEYGYDQREVNDAENRFMSCFTAHLTQRDLDWALWAWQGSY 301
            +   GF+++     PLFVSE+G DQR  N  +N+F+SCF A    RDLDW+LW   GSY
Sbjct: 295 MMKMSGFLLE--KGIPLFVSEFGIDQRGNNANDNKFLSCFMALAADRDLDWSLWTLAGSY 354

Query: 302 YFREGQAEPGESFGVLDSNWTQIKNPNFVRKFQLLQT-----MLQDPNSNASFSYVIYHP 361
           Y RE      ES+GVLD NW+ I+N   ++    +QT     M   P        +++HP
Sbjct: 355 YIREKSIGSDESYGVLDFNWSSIRNSTILQMISAIQTPFIGLMETQPKK------IMFHP 414

Query: 362 QSSQCIQVSNDNKEIFLTNCSTPTRWSHNNDGTPIEMSSTGLYLKASGKGLEASLS-SDT 421
            +  CI V     ++ L +C+    W  ++           L LKA  KG    L    +
Sbjct: 415 STGLCI-VRKSLFQLKLGSCNRSESWRLSSHRVLSLAEEQILCLKAYEKGKSVKLRLFFS 474

Query: 422 LSQQSVWSAISNSKLHLATFTQGGKSLCLQIDSSNSSKVVTNSCICTNGDPNCLQDTRSQ 477
            S  S W   S+SK+ L++ T+ G S+CL +D+ N++ +VTNSC C  G+ +C  D RSQ
Sbjct: 475 ESYCSKWKLFSDSKMQLSSITKNGFSVCLDVDTENNN-IVTNSCKCLRGNSSC--DPRSQ 522

BLAST of CsaV3_3G017720 vs. TAIR10
Match: AT3G26140.1 (Cellulase (glycosyl hydrolase family 5) protein)

HSP 1 Score: 330.5 bits (846), Expect = 1.7e-90
Identity = 179/483 (37.06%), Postives = 281/483 (58.18%), Query Frame = 0

Query: 1   MLAEGLNHRPLKDLADEAIKLRFNCVRLTYATHMLTRYA---NRTIEENFDLLDLKQAKA 60
           ++AEGL+ + + DLA + + + FNCVR T+   + T      N T+ ++F  L L    +
Sbjct: 34  VVAEGLSKQSVDDLAKKIMAMGFNCVRFTWPLDLATNETLANNVTVRQSFQSLGLNDDIS 93

Query: 61  GLAQYNPFVLNKTVAEAYEAVVDVLGASGLMVIADNHMSQPRWCCSLDDGNGFFGNNNFD 120
           G    NP +++  + EAY+ VV  LG + +MVI DNH+++P WCC  +DGNGFFG+  FD
Sbjct: 94  GFETKNPSMIDLPLIEAYKKVVAKLGNNNVMVILDNHVTKPGWCCGYNDGNGFFGDTFFD 153

Query: 121 PQEWLQGLSLVAQRFRNKSTVVGMSLRNEIRGFMENANDWNKYITQGVTTIHNINSEVLV 180
           P  W+ GL+ +A  F+  + VVGMSLRNE+RG  +N +DW KY+ QG   +H  N  VLV
Sbjct: 154 PTTWIAGLTKIAMTFKGATNVVGMSLRNELRGPKQNVDDWFKYMQQGAEAVHEANPNVLV 213

Query: 181 IVSGLNYDNDLRCLKEKPLNVSTLDNKLVFEVHLYSFSGDSESKFVKQPLNNICANIMNG 240
           I+SGL+YD DL  ++ + +N+ T   KLVFE+H YSF+ ++ +   K P N  C  I+  
Sbjct: 214 ILSGLSYDTDLSFVRSRHVNL-TFTRKLVFELHRYSFT-NTNTWSSKNP-NEACGEILKS 273

Query: 241 FIDHVGFVMQGPNPFPLFVSEYGYDQREVNDAENRFMSCFTAHLTQRDLDWALWAWQGSY 300
             +  GF ++    FP+F+SE+G D R  N  +NR++ C      + D+DW++W  QGSY
Sbjct: 274 IENGGGFNLR---DFPVFLSEFGIDLRGKNVNDNRYIGCILGWAAENDVDWSIWTLQGSY 333

Query: 301 YFREGQAEPGESFGVLDSNWTQIKNPNFVRKFQLLQTMLQDPNSNASFSYVIYHPQSSQC 360
           Y REG     E +G+LDS+W ++++ +F+++  L+ + LQ P S +    +++HP +  C
Sbjct: 334 YLREGVVGMSEFYGILDSDWVRVRSQSFLQRLSLILSPLQGPGSQSKVYNLVFHPLTGLC 393

Query: 361 -IQVSNDNKEIFLTNCSTPTRWSHNNDGTPIEMSSTGLYLKASGKGLEASLSSDTLSQQ- 420
            +Q   D  ++ L  C+    WS+    T + +    L L+++G      LS  + S   
Sbjct: 394 MLQSILDPTKVTLGLCNESQPWSYTPQNT-LTLKDKSLCLESTGPNAPVKLSETSCSSPN 453

Query: 421 -SVWSAISNSKLHLATFTQGGKSLCLQIDSSNSSKVVTNSCICTNG-DPNCLQDTRSQWF 477
            S W  IS S + LA       SLCL +D +N+  ++ ++C C  G D +C  D  SQWF
Sbjct: 454 LSEWETISASNMLLAA-KSTNNSLCLDVDETNN--LMASNCKCVKGEDSSC--DPISQWF 504

BLAST of CsaV3_3G017720 vs. TAIR10
Match: AT5G17500.1 (Glycosyl hydrolase superfamily protein)

HSP 1 Score: 320.9 bits (821), Expect = 1.4e-87
Identity = 177/488 (36.27%), Postives = 275/488 (56.35%), Query Frame = 0

Query: 1   MLAEGLNHRPLKDLADEAIKLRFNCVRLTYATHML---TRYANRTIEENFDLLDLKQAKA 60
           ++AEGL+ +P+  ++ +   + FNCVRLT+   ++   T   N T++++F+   L     
Sbjct: 57  VVAEGLSSQPMDSISKKIKDMGFNCVRLTWPLELMINDTLAFNVTVKQSFERYGLDHELQ 116

Query: 61  GLAQYNPFVLNKTVAEAYEAVVDVLGASGLMVIADNHMSQPRWCCSLDDGNGFFGNNNFD 120
           G+  +NP+++N  +   ++AVV  LG   +MVI DNH + P WCCS DD + FFG+  F+
Sbjct: 117 GIYTHNPYIVNTPLINVFQAVVYSLGRHDVMVILDNHKTVPGWCCSNDDPDAFFGDPKFN 176

Query: 121 PQEWLQGLSLVAQRFRNKSTVVGMSLRNEIRGFMENANDWNKYITQGVTTIHNINSEVLV 180
           P  W+ GL  +A  F N   VVGMSLRNE+RG+   + DW KY+ +G   +H  N  VLV
Sbjct: 177 PDLWMLGLKKMATIFMNVKNVVGMSLRNELRGYNHTSKDWYKYMQKGAEAVHTSNPNVLV 236

Query: 181 IVSGLNYDNDLRCLKEKPLNVSTLDNKLVFEVHLYSFSGDSESKFVKQPLNNICANIMNG 240
           I+SGLN+D DL  LK++P+N+S    KLV E+H YSF+ D   ++    +N+ C+ + + 
Sbjct: 237 ILSGLNFDADLSFLKDRPVNLS-FKKKLVLELHWYSFT-DGTGQWKSHNVNDFCSQMFSK 296

Query: 241 FIDHVGFVMQGPNPFPLFVSEYGYDQREVNDAENRFMSCFTAHLTQRDLDWALWAWQGSY 300
                GFV+     FPLF+SE+G DQR  +   NR+M+C  A   ++DLDWA+WA  G Y
Sbjct: 297 ERRTGGFVLD--QGFPLFLSEFGTDQRGGDLEGNRYMNCMLAWAAEKDLDWAVWAVTGVY 356

Query: 301 YFREGQAEPGESFGVLDSNWTQIKNPNFVRKFQLLQTMLQDPNSNASFSYVIYHPQSSQC 360
           YFREG+    E++G+LD+NW  + N  ++R+  ++Q     P    +    I+HP +  C
Sbjct: 357 YFREGKRGVVEAYGMLDANWHNVHNYTYLRRLSVIQPPHTGPGVKHNHHKKIFHPLTGLC 416

Query: 361 IQVSN--DNKEIFLTNCSTPTRWSHNNDGTPIEMSSTGLYLKASGKGLEASLSSDTLSQQ 420
           +   +     E+ L  C+    WS+++ G           +    +G ++ L  +T   +
Sbjct: 417 LVRKSHCHESELTLGPCTKDEPWSYSHGG-----------ILEIRRGHKSCLEGETAVGK 476

Query: 421 SV--------WSAISNSKLHLATFTQGGKSLCLQIDSSNSSKVVTNSCICTNGDPNCLQD 476
           SV           IS +K+HL+  T  G  +CL +DS N+  VV NSC C  GD  C  +
Sbjct: 477 SVKLGRICTKIEQISATKMHLSFNTSDGSLVCLDVDSDNN--VVANSCNCLTGDTTC--E 525

BLAST of CsaV3_3G017720 vs. TAIR10
Match: AT5G16700.1 (Glycosyl hydrolase superfamily protein)

HSP 1 Score: 257.3 bits (656), Expect = 1.9e-68
Identity = 163/481 (33.89%), Postives = 257/481 (53.43%), Query Frame = 0

Query: 2   LAEGLNHRPLKDLADEAIKLRFNCVRLTYATHMLTR---YANRTIEENFDLLDLKQAKAG 61
           +AEGL+ +PL  ++ + + + FNCVRLT+   ++T        T++++F+ L L +   G
Sbjct: 56  VAEGLSKQPLDSISKKIVSMGFNCVRLTWPLDLVTNDTLALKVTVKQSFESLKLFEDVLG 115

Query: 62  LAQYNPFVLNKTVAEAYEAVVDVLGASGLMVIADNHMSQPRWCCSLDDGNGFFGNNNFDP 121
           +  +NP +L+  +  A++ VV  LG +G+MVI DNH++ P WCC  +D + FFG  +FDP
Sbjct: 116 IQTHNPKLLHLPLFNAFQEVVSNLGENGVMVILDNHLTTPGWCCGDNDLDAFFGYPHFDP 175

Query: 122 QEWLQGLSLVAQRFRNKSTVVGMSLRNEIRGFMENANDWNKYITQGVTTIHNINSEVLVI 181
             W +GL  +A  FRN + V+GMSLRNE RG  +  + W +++ QG   +H  N ++LVI
Sbjct: 176 LVWAKGLRKMATLFRNFTHVIGMSLRNEPRGARDYPDLWFRHMPQGAEAVHAANPKLLVI 235

Query: 182 VSGLNYDNDLRCLKEKPLNVSTLDNKLVFEVHLYSFSGDSESKFVKQPLNNICANIMNGF 241
           +SG+++D +L  L+++ +NVS  D KLVFE+H YSFS D    + K   N+ C  I+   
Sbjct: 236 LSGIDFDTNLSFLRDRSVNVSFTD-KLVFELHWYSFS-DGRDSWRKHNSNDFCVKIIEKV 295

Query: 242 IDHVGFVMQGPNPFPLFVSEYGYDQREVNDAENRFMSCFTAHLTQRDLDWALWAWQGSYY 301
             + GF++     FPL +SE+G DQR  + + NR+M+C  A   + DLDWA+WA  G YY
Sbjct: 296 THNGGFLL--GRGFPLILSEFGTDQRGGDMSGNRYMNCLVAWAAENDLDWAVWALTGDYY 355

Query: 302 FREGQAEPGESFGVLDSNWTQIKNPNFVRKFQLLQTMLQDPNSNASFSYVIYHPQSSQCI 361
            R G   PG                               PN N     +++HP +  C+
Sbjct: 356 LRTG---PG-----------------------------LRPNKN-----LLFHPSTGLCV 415

Query: 362 --QVSNDNKEIFLTNCSTPTRWSHNNDGTPIEMSSTGLYLKASG-KGLEASLSSDTLSQQ 421
               S++   + L  C     W+ N     + ++   + ++A    G +  L   T  + 
Sbjct: 416 TNNPSDNIPTLRLGPCPKSDPWTFNPSEGILWINK--MCVEAPNVVGQKVKLGVGT--KC 475

Query: 422 SVWSAISNSKLHLATFTQGGKSLCLQIDSSNSSKVVTNSCICTNGDPNCLQDTRSQWFEL 477
           S    IS +K+HL+  T  G  LCL +D  ++S VV N C     D +C  D  SQWF++
Sbjct: 476 SKLGQISATKMHLSFKTSNGLLLCLDVDERDNS-VVANRCKFLTMDASC--DPASQWFKV 488

BLAST of CsaV3_3G017720 vs. Swiss-Prot
Match: sp|C0HLA0|GH5FP_CHAOB (Glycosyl hydrolase 5 family protein OS=Chamaecyparis obtusa OX=13415 PE=1 SV=1)

HSP 1 Score: 349.0 bits (894), Expect = 8.6e-95
Identity = 191/493 (38.74%), Postives = 288/493 (58.42%), Query Frame = 0

Query: 2   LAEGLNHRPLKDLADEAIKLRFNCVRLTYATHMLTR--YANRTIEENFDLLDLKQAKAGL 61
           L EGLN  P+  +A     L FNCVRLTY+ HMLTR  Y N T+ + F  L+L +A +G+
Sbjct: 62  LPEGLNRLPVATVAHTISSLGFNCVRLTYSIHMLTRTSYTNATVAQTFARLNLTEAASGI 121

Query: 62  AQYNPFVLNKTVAEAYEAVVDVLGASGLMVIADNHMSQPRWCCSLDDGNGFFGNNNFDPQ 121
              NP +L+     AY  VV  L  +G+MVI DNH+S+P+WCC++DDGNGFFG+  F+P 
Sbjct: 122 EHNNPELLDLGHVAAYHHVVAALSEAGVMVILDNHVSKPKWCCAVDDGNGFFGDRYFNPN 181

Query: 122 EWLQGLSLVAQRFRNKSTVVGMSLRNEIRGFMENANDWNKYITQGVTTIHNINSEVLVIV 181
            W++GL L+A  F N   VV MSLRNE+RG       W++++  G  T+H  N +VLVI+
Sbjct: 182 TWVEGLGLMATYFNNTPNVVAMSLRNELRGNRSTPISWSRHMQWGAATVHKANPKVLVIL 241

Query: 182 SGLNYDNDLRCLKEKPLNVSTLDNKLVFEVHLYSFSGDSESKFVKQPLNNICANIMNGFI 241
           SGL +D DL  L   P+ +     K+V+E H YSF     +       N++C N    F 
Sbjct: 242 SGLQFDTDLSFLPVLPVTL-PFKEKIVYEGHWYSFGVPWRTGLP----NDVCKNETGRFK 301

Query: 242 DHVGFVMQGPN--PFPLFVSEYGYDQREVNDAENRFMSCFTAHLTQRDLDWALWAWQGSY 301
            +VGFV    N    PLF+SE+G DQR VND +NR+++C  A+L + DLDWALW   GSY
Sbjct: 302 SNVGFVTSSANATAAPLFMSEFGIDQRYVNDNDNRYLNCILAYLAEEDLDWALWTMGGSY 361

Query: 302 YFREGQ---AEPGESFGVLDSNWTQIKNPNFVRKFQLLQTMLQDPN-SNASFSYVIYHPQ 361
           Y+R  +    +  E++G  + +W++I+NP+F+ + + +Q  +QDP  +   +  +IYHP 
Sbjct: 362 YYRSDKQPVKDFEETYGFFNHDWSRIRNPDFISRLKEIQQPIQDPYLAPGPYYQIIYHPA 421

Query: 362 SSQCIQVSNDNKEIFLTNC-STPTRWSHN-NDGTPIEMSSTGLYLKASGKGLEASLSSD- 421
           S  C++ S     + L +C S  +RW+++ +   PI +  +   +   G GL A ++ + 
Sbjct: 422 SGLCVE-SGIGNTVHLGSCQSVRSRWNYDASVKGPIGLMGSSSCISTQGNGLPAIMTENC 481

Query: 422 TLSQQSVWSAISNSKLHLATFTQG--GKSLCLQIDSSNSSKVVTNSCICTNGDPNCLQ-- 480
           +    ++WS +S+++L L T   G  GK   + +D S S  + TN CIC   D +C    
Sbjct: 482 SAPNNTLWSTVSSAQLQLGTRVLGKDGKEKWMCLDGSKSPLISTNECICIT-DSHCYPKL 541

BLAST of CsaV3_3G017720 vs. Swiss-Prot
Match: sp|P54583|GUN1_ACIC1 (Endoglucanase E1 OS=Acidothermus cellulolyticus (strain ATCC 43068 / 11B) OX=351607 GN=Acel_0614 PE=1 SV=1)

HSP 1 Score: 76.3 bits (186), Expect = 1.1e-12
Identity = 77/338 (22.78%), Postives = 137/338 (40.53%), Query Frame = 0

Query: 2   LAEGLNHRPLKDLADEAIKLRFNCVRLTYATHMLTRYANRTIEENFDLLDLKQAKAGLAQ 61
           +  GL  R  + + D+   L +N +RL Y+  +L      T+  + +   + Q   GL  
Sbjct: 78  VVHGLWSRDYRSMLDQIKSLGYNTIRLPYSDDIL---KPGTMPNSINFYQMNQDLQGLTS 137

Query: 62  YNPFVLNKTVAEAYEAVVDVLGASGLMVIADNHMSQPRWCCSLDDGNGFFGNNNFDPQEW 121
               V++K VA A        G  GL +I D H    R  CS    +  +  ++     W
Sbjct: 138 LQ--VMDKIVAYA--------GQIGLRIILDRH----RPDCS--GQSALWYTSSVSEATW 197

Query: 122 LQGLSLVAQRFRNKSTVVGMSLRNEIR-----GFMENANDWNKYITQGVTTIHNINSEVL 181
           +  L  +AQR++   TVVG  L NE       G  + + DW     +    + ++N  +L
Sbjct: 198 ISDLQALAQRYKGNPTVVGFDLHNEPHDPACWGCGDPSIDWRLAAERAGNAVLSVNPNLL 257

Query: 182 VIVSGL-NYDND-------LRCLKEKPLNVSTLDNKLVFEVHLYSFSGDSESKFVKQPLN 241
           + V G+ +Y+ D       L+   + P+ V  + N+LV+  H Y+ S   ++ F      
Sbjct: 258 IFVEGVQSYNGDSYWWGGNLQGAGQYPV-VLNVPNRLVYSAHDYATSVYPQTWFSDPTFP 317

Query: 242 NICANIMNGFIDHVGFVMQGPNPFPLFVSEYGYDQREVNDAENRFMSCF------TAHLT 301
           N    I N    + G++    N  P+++ E+G   +   D    ++         TA   
Sbjct: 318 NNMPGIWN---KNWGYLF-NQNIAPVWLGEFGTTLQSTTD--QTWLKTLVQYLRPTAQYG 377

Query: 302 QRDLDWALWAWQGSYYFREGQAEPGESFGVLDSNWTQI 321
                W  W+W           + G++ G+L  +W  +
Sbjct: 378 ADSFQWTFWSW---------NPDSGDTGGILKDDWQTV 380

BLAST of CsaV3_3G017720 vs. Swiss-Prot
Match: sp|P19487|GUNA_XANCP (Major extracellular endoglucanase OS=Xanthomonas campestris pv. campestris (strain ATCC 33913 / DSM 3586 / NCPPB 528 / LMG 568 / P 25) OX=190485 GN=engXCA PE=1 SV=2)

HSP 1 Score: 76.3 bits (186), Expect = 1.1e-12
Identity = 83/354 (23.45%), Postives = 139/354 (39.27%), Query Frame = 0

Query: 5   GLNHRPLKDLADEAIKLRFNCVRLTYATHML---TRYANRTIEENFDLLDLKQAKAGLAQ 64
           GL  R  KD+  +   L FN VRL +    L   T  A+     N DL  L   +     
Sbjct: 61  GLWARNWKDMIVQMQGLGFNAVRLPFCPATLRSDTMPASIDYSRNADLQGLTSLQ----- 120

Query: 65  YNPFVLNKTVAEAYEAVVDVLGASGLMVIADNHMSQPRWCCSLDDGNGFFGNNNFDPQEW 124
               +L+K +AE          A G+ V+ D+H      C  + +    +   ++   +W
Sbjct: 121 ----ILDKVIAE--------FNARGMYVLLDHHTPD---CAGISE---LWYTGSYTEAQW 180

Query: 125 LQGLSLVAQRFRNKSTVVGMSLRNEIRGFM-----ENANDWNKYITQGVTTIHNINSEVL 184
           L  L  VA R++N   V+G+ L+NE  G         A DWNK   +G   +  +  + L
Sbjct: 181 LADLRFVANRYKNVPYVLGLDLKNEPHGAATWGTGNAATDWNKAAERGSAAVLAVAPKWL 240

Query: 185 VIVSGLN------------YDNDLRCLKEKPLNVSTLDNKLVFEVHLYSFSGDSESKFVK 244
           + V G+             +  +L+ L   PLN+    N+L+   H+Y         FV+
Sbjct: 241 IAVEGITDNPVCSTNGGIFWGGNLQPLACTPLNIPA--NRLLLAPHVY-----GPDVFVQ 300

Query: 245 QPLN--NICANIMNGFIDHVGFVMQGPNPFPLFVSEYGYDQREVNDAENRFMSCFTAHLT 304
              N  N   N+   +  H G   Q      L + E+G    E +  +  +      +L 
Sbjct: 301 SYFNDSNFPNNMPAIWERHFG---QFAGTHALLLGEFGGKYGEGDARDKTWQDALVKYLR 360

Query: 305 QRDLDWAL-WAWQGSYYFREGQAEPGESFGVLDSNWTQIKNPNFVRKFQLLQTM 336
            + ++    W+W             G++ G+L  +WT ++      K  LL+T+
Sbjct: 361 SKGINQGFYWSW---------NPNSGDTGGILRDDWTSVRQD----KMTLLRTL 368

BLAST of CsaV3_3G017720 vs. Swiss-Prot
Match: sp|P23548|GUN_PAEPO (Endoglucanase OS=Paenibacillus polymyxa OX=1406 PE=3 SV=2)

HSP 1 Score: 67.8 bits (164), Expect = 3.8e-10
Identity = 67/339 (19.76%), Postives = 137/339 (40.41%), Query Frame = 0

Query: 5   GLNHRPLKDLADEAIKLRFNCVRLTYATHMLTRYANRTIEENFDLLDLKQAKAGLAQYNP 64
           GL  R + D+ D+  K  +N +RL Y+  +    +        D +D  +        NP
Sbjct: 73  GLWSRSMDDMLDQVKKEGYNLIRLPYSNQLFDSSSRP------DSIDYHK--------NP 132

Query: 65  FVLNKTVAEAYEAVVDVLGASGLMVIADNHMSQPRWCCSLDDGNG----FFGNNNFDPQE 124
            ++     +  + +++  G  G+ +I D H            G+G     +  + +    
Sbjct: 133 DLVGLNPIQIMDKLIEKAGQRGIQIILDRHR----------PGSGGQSELWYTSQYPESR 192

Query: 125 WLQGLSLVAQRFRNKSTVVGMSLRNEIRGFME----NAN-DWNKYITQGVTTIHNINSEV 184
           W+    ++A R++N  TV+G  L NE  G       NA+ DW     +    I ++N   
Sbjct: 193 WISDWKMLADRYKNNPTVIGADLHNEPHGQASWGTGNASTDWRLAAQRAGNAILSVNPNW 252

Query: 185 LVIVSGLNYD-----------NDLRCLKEKPLNVSTLDNKLVFEVHLYSFSGDSESKFVK 244
           L++V G++++            +L  +   P+ V  + N++V+  H Y   G S   +  
Sbjct: 253 LILVEGVDHNVQGNNSQYWWGGNLTGVANYPV-VLDVPNRVVYSPHDYG-PGVSSQPWFN 312

Query: 245 QPLNNICANIMNGFIDHVGFVMQGPNPFPLFVSEYGYDQREVNDAENRFMSCFTAHLTQR 304
            P     +N+   +    G++ +  N  P+ V E+G    +++  E ++ +    ++   
Sbjct: 313 DPA--FPSNLPAIWDQTWGYISK-QNIAPVLVGEFGGRNVDLSCPEGKWQNALVHYIGAN 372

Query: 305 DLDWALWAWQGSYYFREGQAEPGESFGVLDSNWTQIKNP 324
           +L +  W+              G++ G+L  +WT    P
Sbjct: 373 NLYFTYWSL---------NPNSGDTGGLLLDDWTTWNRP 373

BLAST of CsaV3_3G017720 vs. TrEMBL
Match: tr|A0A0A0K853|A0A0A0K853_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G028440 PE=3 SV=1)

HSP 1 Score: 869.8 bits (2246), Expect = 2.9e-249
Identity = 417/482 (86.51%), Postives = 446/482 (92.53%), Query Frame = 0

Query: 1   MLAEGLNHRPLKDLADEAIKLRFNCVRLTYATHMLTRYANRTIEENFDLLDLKQAKAGLA 60
           ML EGLNHRPLK+LADEAIKLRFNCVRLTYATHM TRYANRT+EENFDLLDL+QAKAGLA
Sbjct: 57  MLIEGLNHRPLKELADEAIKLRFNCVRLTYATHMFTRYANRTVEENFDLLDLEQAKAGLA 116

Query: 61  QYNPFVLNKTVAEAYEAVVDVLGASGLMVIADNHMSQPRWCCSLDDGNGFFGNNNFDPQE 120
           QYNPFVLNKT+AEAYEAVVDVLGASGLMVIADNHMSQPRWCCSLDDGNGFFGN  FDPQE
Sbjct: 117 QYNPFVLNKTIAEAYEAVVDVLGASGLMVIADNHMSQPRWCCSLDDGNGFFGNRYFDPQE 176

Query: 121 WLQGLSLVAQRFRNKSTVVGMSLRNEIRGFMENANDWNKYITQGVTTIHNINSEVLVIVS 180
           WLQGLSLVAQRF NKSTVVGMSLRNE+RG MENANDWN Y+TQGVTTIH IN  VLVIVS
Sbjct: 177 WLQGLSLVAQRFNNKSTVVGMSLRNELRGMMENANDWNNYVTQGVTTIHKINPAVLVIVS 236

Query: 181 GLNYDNDLRCLKEKPLNVSTLDNKLVFEVHLYSFSGDSESKFVKQPLNNICANIMNGFID 240
           GLNYDNDLRCLK+KPLNVSTLDNKL FEVHLYSFSGDSESKFV+QPLNNICA IM+ FID
Sbjct: 237 GLNYDNDLRCLKDKPLNVSTLDNKLAFEVHLYSFSGDSESKFVQQPLNNICAKIMHEFID 296

Query: 241 HVGFVMQGPNPFPLFVSEYGYDQREVNDAENRFMSCFTAHLTQRDLDWALWAWQGSYYFR 300
           H  FV++GPNPFPLFVSEYGYDQREV+DAENRFMSCFTAHL Q+DLDWALW WQGSYY+R
Sbjct: 297 HAEFVIEGPNPFPLFVSEYGYDQREVDDAENRFMSCFTAHLAQKDLDWALWTWQGSYYYR 356

Query: 301 EGQAEPGESFGVLDSNWTQIKNPNFVRKFQLLQTMLQDPNSNASFSYVIYHPQSSQCIQV 360
           EGQAE  E+FGVLDSNWTQIKNPNFV+KFQLLQTMLQDP SNASFSYVIYH QS QCI+V
Sbjct: 357 EGQAELAETFGVLDSNWTQIKNPNFVQKFQLLQTMLQDPYSNASFSYVIYHVQSGQCIEV 416

Query: 361 SNDNKEIFLTNCSTPTRWSHNNDGTPIEMSSTGLYLKASGKGLEASLSSDTLSQQSVWSA 420
           SNDNKEIFLTNCST +RWSH+ND TPI+MSSTGL LKASG+GLEASLS+D + +QS+WSA
Sbjct: 417 SNDNKEIFLTNCSTSSRWSHDNDSTPIKMSSTGLCLKASGEGLEASLSTDCIGKQSLWSA 476

Query: 421 ISNSKLHLATFTQGGKSLCLQ-IDSSNSSKVVTNSCICTNGDPNCLQDTRSQWFELVGTN 480
           ISNS LHL T T+ GKSLCLQ I+SSNSSK+VTNSCICT  DP CLQDT+SQWFELV TN
Sbjct: 477 ISNSNLHLGTVTEDGKSLCLQIIESSNSSKIVTNSCICTTNDPTCLQDTQSQWFELVATN 536

Query: 481 TL 482
           TL
Sbjct: 537 TL 538

BLAST of CsaV3_3G017720 vs. TrEMBL
Match: tr|A0A1S3BDI2|A0A1S3BDI2_CUCME (major extracellular endoglucanase-like OS=Cucumis melo OX=3656 GN=LOC103488703 PE=3 SV=1)

HSP 1 Score: 812.0 bits (2096), Expect = 7.2e-232
Identity = 389/448 (86.83%), Postives = 415/448 (92.63%), Query Frame = 0

Query: 34  MLTRYANRTIEENFDLLDLKQAKAGLAQYNPFVLNKTVAEAYEAVVDVLGASGLMVIADN 93
           M TRYANRT+EENFDLLDL QAKAGL QYNPFVLNKT+AEAYEAVVDVLGASGLMVIADN
Sbjct: 1   MFTRYANRTVEENFDLLDLGQAKAGLTQYNPFVLNKTIAEAYEAVVDVLGASGLMVIADN 60

Query: 94  HMSQPRWCCSLDDGNGFFGNNNFDPQEWLQGLSLVAQRFRNKSTVVGMSLRNEIRGFMEN 153
           HMSQPRWCCSLDDGNGFFGN  FDPQEWLQGLSLVAQRF NKSTVVGMSLRNEIRG MEN
Sbjct: 61  HMSQPRWCCSLDDGNGFFGNRYFDPQEWLQGLSLVAQRFNNKSTVVGMSLRNEIRGMMEN 120

Query: 154 ANDWNKYITQGVTTIHNINSEVLVIVSGLNYDNDLRCLKEKPLNVSTLDNKLVFEVHLYS 213
           ANDWN Y+TQGVTTIHNIN EVLVIV GLNYDNDLRCLKEKPLNVSTLDNKLVFEVHLYS
Sbjct: 121 ANDWNHYVTQGVTTIHNINPEVLVIVGGLNYDNDLRCLKEKPLNVSTLDNKLVFEVHLYS 180

Query: 214 FSGDSESKFVKQPLNNICANIMNGFIDHVGFVMQGPNPFPLFVSEYGYDQREVNDAENRF 273
           FSG SESKFV+QPLNNICA I+N FIDH  FV++G NPFPLFVSEYGYDQREV+DAENRF
Sbjct: 181 FSGASESKFVQQPLNNICAKIINEFIDHAEFVIEGSNPFPLFVSEYGYDQREVDDAENRF 240

Query: 274 MSCFTAHLTQRDLDWALWAWQGSYYFREGQAEPGESFGVLDSNWTQIKNPNFVRKFQLLQ 333
           MSCFTAHL Q+DLDWALW WQGSYY+REGQAE  E+FGVL+SNWTQIKNPNFV+KFQLLQ
Sbjct: 241 MSCFTAHLAQKDLDWALWTWQGSYYYREGQAELPETFGVLESNWTQIKNPNFVQKFQLLQ 300

Query: 334 TMLQDPNSNASFSYVIYHPQSSQCIQVSNDNKEIFLTNCSTPTRWSHNNDGTPIEMSSTG 393
           TMLQDPNSNASFSYVIYHPQS QCI+VSNDNK+IFLTNCST +RWSH+ND TPI+MS+TG
Sbjct: 301 TMLQDPNSNASFSYVIYHPQSGQCIEVSNDNKDIFLTNCSTSSRWSHDNDSTPIKMSNTG 360

Query: 394 LYLKASGKGLEASLSSDTLSQQSVWSAISNSKLHLATFTQGGKSLCLQIDSSNSSKVVTN 453
           L LKASG+GL ASLS+D L +QSVWSAISNSKLHLAT T+ GKSLCLQI+SSNSSK+VTN
Sbjct: 361 LCLKASGEGLAASLSNDCLGKQSVWSAISNSKLHLATVTENGKSLCLQIESSNSSKIVTN 420

Query: 454 SCICTNGDPNCLQDTRSQWFELVGTNTL 482
           SCICT  DP CLQDT+SQWFELV TNTL
Sbjct: 421 SCICTTDDPTCLQDTQSQWFELVETNTL 448

BLAST of CsaV3_3G017720 vs. TrEMBL
Match: tr|A0A1S3CTF8|A0A1S3CTF8_CUCME (major extracellular endoglucanase-like OS=Cucumis melo OX=3656 GN=LOC103504686 PE=3 SV=1)

HSP 1 Score: 686.4 bits (1770), Expect = 4.6e-194
Identity = 318/483 (65.84%), Postives = 391/483 (80.95%), Query Frame = 0

Query: 1   MLAEGLNHRPLKDLADEAIKLRFNCVRLTYATHMLTRYANRTIEENFDLLDLKQAKAGLA 60
           ML EGL+ RPLKDLA+E ++L+FNCVRLTYATHM TRYANRT+EENFDLLDL+ +K GLA
Sbjct: 57  MLIEGLDRRPLKDLANEVMRLKFNCVRLTYATHMFTRYANRTVEENFDLLDLRASKVGLA 116

Query: 61  QYNPFVLNKTVAEAYEAVVDVLGASGLMVIADNHMSQPRWCCSLDDGNGFFGNNNFDPQE 120
            +NPFVLN T+ EAYEAVVDVLG SGLMVIADNH+SQPRWCCSL+DGNGFFG+  FD +E
Sbjct: 117 LHNPFVLNMTIFEAYEAVVDVLGTSGLMVIADNHISQPRWCCSLEDGNGFFGDRYFDSEE 176

Query: 121 WLQGLSLVAQRFRNKSTVVGMSLRNEIRGFMENANDWNKYITQGVTTIHNINSEVLVIVS 180
           WL+GL LVA+RF NKS VV MSLRNE+RG    + DWNKY+TQG TTIHNIN  +LVI+S
Sbjct: 177 WLEGLRLVARRFYNKSAVVAMSLRNELRGASSKSKDWNKYVTQGATTIHNINPNILVIIS 236

Query: 181 GLNYDNDLRCLKEKPLNVSTLDNKLVFEVHLYSFSGDSESKFVKQPLNNICANIMNGFID 240
           GLN+DNDLRC ++ PL ++ L NKLVFEVHLYSFSG+S+SKF+  PLN IC+ I+NGF+ 
Sbjct: 237 GLNFDNDLRCQRQYPLQLNNLHNKLVFEVHLYSFSGESQSKFIHNPLNKICSKIINGFVQ 296

Query: 241 HVGFVMQGPNPFPLFVSEYGYDQREVNDAENRFMSCFTAHLTQRDLDWALWAWQGSYYFR 300
              FVM+G    PLFVSE+G DQ  VN+A++RF+SCF+AHL ++DLDWALW WQGSYY+R
Sbjct: 297 RAEFVMEGAEAVPLFVSEFGLDQTGVNEADDRFLSCFSAHLVEKDLDWALWGWQGSYYYR 356

Query: 301 EGQAEPGESFGVLDSNWTQIKNPNFVRKFQLLQTMLQDPNSNASFSYVIYHPQSSQCIQV 360
           +G+ E  E FGVL+ NW+ ++NP F + FQLLQTMLQDPNSN+S +Y++YHPQS QC+QV
Sbjct: 357 QGKVELEEVFGVLNYNWSDVRNPRFSQMFQLLQTMLQDPNSNSSNTYLMYHPQSGQCVQV 416

Query: 361 SN-DNKEIFLTNCSTPTRWSHNNDGTPIEMSSTGLYLKASGKGLEASLSSDTLSQQSVWS 420
            +   KEIFL NCS  + WS+  DGTPI ++ST   LKA+G GL  SLS D   +QSVW+
Sbjct: 417 HDMKQKEIFLNNCSNASHWSYEGDGTPIMLASTNFCLKANGNGLPPSLSRDCFGEQSVWT 476

Query: 421 AISNSKLHLATFT-QGGKSLCLQIDSSNSSKVVTNSCICTNGDPNCLQDTRSQWFELVGT 480
           AIS+SKLHLAT T QG   +CL+ +SSNSS+++  SC+C   D NCLQDT++QWF+LV T
Sbjct: 477 AISDSKLHLATLTKQGNNGMCLEKESSNSSRILMRSCVCVGSDSNCLQDTQAQWFQLVVT 536

Query: 481 NTL 482
           NTL
Sbjct: 537 NTL 539

BLAST of CsaV3_3G017720 vs. TrEMBL
Match: tr|A0A0A0L644|A0A0A0L644_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G186670 PE=4 SV=1)

HSP 1 Score: 666.4 bits (1718), Expect = 4.9e-188
Identity = 344/448 (76.79%), Postives = 345/448 (77.01%), Query Frame = 0

Query: 34  MLTRYANRTIEENFDLLDLKQAKAGLAQYNPFVLNKTVAEAYEAVVDVLGASGLMVIADN 93
           MLTRYANRTIEENFDLLDLKQAKAGLAQYNPFVLNKTVAEAYEAVVDVLGASGLMVIADN
Sbjct: 1   MLTRYANRTIEENFDLLDLKQAKAGLAQYNPFVLNKTVAEAYEAVVDVLGASGLMVIADN 60

Query: 94  HMSQPRWCCSLDDGNGFFGNNNFDPQEWLQGLSLVAQRFRNKSTVVGMSLRNEIRGFMEN 153
           HMSQPRWCCSLDDGNGFFGNNNFDPQEWLQGLSLVAQRFRNKSTV               
Sbjct: 61  HMSQPRWCCSLDDGNGFFGNNNFDPQEWLQGLSLVAQRFRNKSTVY-------------- 120

Query: 154 ANDWNKYITQGVTTIHNINSEVLVIVSGLNYDNDLRCLKEKPLNVSTLDNKLVFEVHLYS 213
                                                                       
Sbjct: 121 ------------------------------------------------------------ 180

Query: 214 FSGDSESKFVKQPLNNICANIMNGFIDHVGFVMQGPNPFPLFVSEYGYDQREVNDAENRF 273
               SESKFVKQPLNNICANIMNGFIDH GFVMQGPNPFPLFV+                
Sbjct: 181 ----SESKFVKQPLNNICANIMNGFIDHAGFVMQGPNPFPLFVT---------------- 240

Query: 274 MSCFTAHLTQRDLDWALWAWQGSYYFREGQAEPGESFGVLDSNWTQIKNPNFVRKFQLLQ 333
                 HL QRDLDWALWAWQGSYYFREGQAEPGESFGVLDSNWTQIKNPNFVRKFQLLQ
Sbjct: 241 ------HLAQRDLDWALWAWQGSYYFREGQAEPGESFGVLDSNWTQIKNPNFVRKFQLLQ 300

Query: 334 TMLQDPNSNASFSYVIYHPQSSQCIQVSNDNKEIFLTNCSTPTRWSHNNDGTPIEMSSTG 393
           TMLQDPNSNASFSYVIYHPQSSQCIQVSNDNKEIFLTNCSTPTRWSHNNDGTPIEMSSTG
Sbjct: 301 TMLQDPNSNASFSYVIYHPQSSQCIQVSNDNKEIFLTNCSTPTRWSHNNDGTPIEMSSTG 348

Query: 394 LYLKASGKGLEASLSSDTLSQQSVWSAISNSKLHLATFTQGGKSLCLQIDSSNSSKVVTN 453
           LYLKASGKGLEASLSSDTLSQQSVWSAISNSKLHLATFTQGGKSLCLQIDSSNSSKVVTN
Sbjct: 361 LYLKASGKGLEASLSSDTLSQQSVWSAISNSKLHLATFTQGGKSLCLQIDSSNSSKVVTN 348

Query: 454 SCICTNGDPNCLQDTRSQWFELVGTNTL 482
           SCICTNGDPNCLQDTRSQWFELVGTNTL
Sbjct: 421 SCICTNGDPNCLQDTRSQWFELVGTNTL 348

BLAST of CsaV3_3G017720 vs. TrEMBL
Match: tr|A0A1S3CT43|A0A1S3CT43_CUCME (endoglucanase-like OS=Cucumis melo OX=3656 GN=LOC103504654 PE=3 SV=1)

HSP 1 Score: 531.2 bits (1367), Expect = 2.5e-147
Identity = 256/480 (53.33%), Postives = 345/480 (71.88%), Query Frame = 0

Query: 1   MLAEGLNHRPLKDLADEAIKLRFNCVRLTYATHMLTRYANRTIEENFDLLDLKQAKAGLA 60
           ML EGL+ RPL D+A    KLRFNCVRLTY+ HM TR+AN T++++F+  D+K A AG+A
Sbjct: 59  MLVEGLHRRPLDDIAALVAKLRFNCVRLTYSIHMFTRHANLTVKQSFENFDMKDAMAGIA 118

Query: 61  QYNPFVLNKTVAEAYEAVVDVLGASGLMVIADNHMSQPRWCCSLDDGNGFFGNNNFDPQE 120
           Q NP +LN T+ EAY AVVD L A G+MV++DNH+SQPRWCC  +DGNGFFG+  FDPQE
Sbjct: 119 QNNPSILNLTLVEAYGAVVDSLVAHGIMVVSDNHISQPRWCCDNNDGNGFFGDRYFDPQE 178

Query: 121 WLQGLSLVAQRFRNKSTVVGMSLRNEIRGFMENANDWNKYITQGVTTIHNINSEVLVIVS 180
           WLQG+SL AQ  ++K+ VV MSLRNE+RG  +N   W +Y++QG   IH IN   LV+VS
Sbjct: 179 WLQGISLAAQSLKSKAQVVAMSLRNELRGPNQNVEKWFQYMSQGAKLIHQINPNALVVVS 238

Query: 181 GLNYDNDLRCLKEKPLNVSTLDNKLVFEVHLYSFSGDSESKFVKQPLNNICANIMNGFID 240
           GL+YD DL  LK + +  + LDNKLVFE HLYSF+ +    ++ +PLN  CA+I  GF D
Sbjct: 239 GLSYDTDLSFLKNRSMGFN-LDNKLVFEAHLYSFTNNMRDFWMSKPLNTFCASINQGFED 298

Query: 241 HVGFVMQGPNPFPLFVSEYGYDQREVNDAENRFMSCFTAHLTQRDLDWALWAWQGSYYFR 300
             GF+++G NP PLFVSE+G DQ   N+ +NRF+SCF ++LT+ D DW LWA QGSYY++
Sbjct: 299 RAGFLVRGQNPIPLFVSEFGIDQTGTNEGQNRFLSCFFSYLTENDFDWGLWALQGSYYYK 358

Query: 301 EGQAEPGESFGVLDSNWTQIKNPN-FVRKFQLLQTMLQDPNSNASFSYVIYHPQSSQCIQ 360
            G     E+FGVLDSN+T+ KN   F+++FQL+QT LQDP+SN + ++++YHP S  C++
Sbjct: 359 VGVKNAEENFGVLDSNFTKAKNSKLFLQRFQLMQTKLQDPSSNFTTTFIMYHPLSGGCVR 418

Query: 361 VSNDNKEIFLTNCSTPTRWSHNNDGTPIEMSSTGLYLKASGKGLEASLSSDTLSQQSVWS 420
           + N   ++ +++C T  RWSH  DG PI+++ + L LKA G GL   LS D  SQQS+W 
Sbjct: 419 M-NKKYQLGISSCKTSNRWSHEQDGAPIKLAGSILCLKAIGVGLPPILSQDCSSQQSIWR 478

Query: 421 AISNSKLHLATFTQGGKSLCLQIDSSNSSKVVTNSCICTNGDPNCLQDTRSQWFELVGTN 480
             SN+KL LAT  + G++LCLQ  +S+S ++VTN C+CT  D  C +D +SQWF LV +N
Sbjct: 479 YASNAKLQLATVDEQGQALCLQ-RASHSHQIVTNKCLCTI-DSQCQEDPQSQWFTLVPSN 534

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_011658389.14.4e-24986.51PREDICTED: uncharacterized protein LOC101207450 [Cucumis sativus] >KGN45940.1 hy... [more]
XP_022932816.12.4e-24785.45uncharacterized protein LOC111439277 [Cucurbita moschata][more]
XP_022933313.12.4e-24785.45uncharacterized protein LOC111440529 [Cucurbita moschata][more]
XP_022995752.13.2e-24785.65uncharacterized protein LOC111491191 [Cucurbita maxima][more]
XP_022958333.14.6e-24685.45uncharacterized protein LOC111459581 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
AT1G13130.11.4e-10039.29Cellulase (glycosyl hydrolase family 5) protein[more]
AT3G26130.19.9e-9440.21Cellulase (glycosyl hydrolase family 5) protein[more]
AT3G26140.11.7e-9037.06Cellulase (glycosyl hydrolase family 5) protein[more]
AT5G17500.11.4e-8736.27Glycosyl hydrolase superfamily protein[more]
AT5G16700.11.9e-6833.89Glycosyl hydrolase superfamily protein[more]
Match NameE-valueIdentityDescription
sp|C0HLA0|GH5FP_CHAOB8.6e-9538.74Glycosyl hydrolase 5 family protein OS=Chamaecyparis obtusa OX=13415 PE=1 SV=1[more]
sp|P54583|GUN1_ACIC11.1e-1222.78Endoglucanase E1 OS=Acidothermus cellulolyticus (strain ATCC 43068 / 11B) OX=351... [more]
sp|P19487|GUNA_XANCP1.1e-1223.45Major extracellular endoglucanase OS=Xanthomonas campestris pv. campestris (stra... [more]
sp|P23548|GUN_PAEPO3.8e-1019.76Endoglucanase OS=Paenibacillus polymyxa OX=1406 PE=3 SV=2[more]
Match NameE-valueIdentityDescription
tr|A0A0A0K853|A0A0A0K853_CUCSA2.9e-24986.51Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G028440 PE=3 SV=1[more]
tr|A0A1S3BDI2|A0A1S3BDI2_CUCME7.2e-23286.83major extracellular endoglucanase-like OS=Cucumis melo OX=3656 GN=LOC103488703 P... [more]
tr|A0A1S3CTF8|A0A1S3CTF8_CUCME4.6e-19465.84major extracellular endoglucanase-like OS=Cucumis melo OX=3656 GN=LOC103504686 P... [more]
tr|A0A0A0L644|A0A0A0L644_CUCSA4.9e-18876.79Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G186670 PE=4 SV=1[more]
tr|A0A1S3CT43|A0A1S3CT43_CUCME2.5e-14753.33endoglucanase-like OS=Cucumis melo OX=3656 GN=LOC103504654 PE=3 SV=1[more]
The following terms have been associated with this gene:
Vocabulary: Biological Process
TermDefinition
GO:0005975carbohydrate metabolic process
Vocabulary: Molecular Function
TermDefinition
GO:0004553hydrolase activity, hydrolyzing O-glycosyl compounds
Vocabulary: INTERPRO
TermDefinition
IPR017853Glycoside_hydrolase_SF
IPR035992Ricin_B-like_lectins
IPR001547Glyco_hydro_5
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0005975 carbohydrate metabolic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compounds

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsaV3_3G017720.1CsaV3_3G017720.1mRNA


Analysis Name: InterPro Annotations of cucumber chineselong genome (v3)
Date Performed: 2019-03-04
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableGENE3DG3DSA:3.20.20.80coord: 1..400
e-value: 9.5E-62
score: 211.1
NoneNo IPR availablePANTHERPTHR31263FAMILY NOT NAMEDcoord: 2..477
NoneNo IPR availablePANTHERPTHR31263:SF10SUBFAMILY NOT NAMEDcoord: 2..477
IPR001547Glycoside hydrolase, family 5PFAMPF00150Cellulasecoord: 10..294
e-value: 4.9E-24
score: 85.1
IPR035992Ricin B-like lectinsSUPERFAMILYSSF50370Ricin B-like lectinscoord: 343..456
IPR017853Glycoside hydrolase superfamilySUPERFAMILYSSF51445(Trans)glycosidasescoord: 4..330

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
CsaV3_3G017720CSPI03G17560Wild cucumber (PI 183967)cpicucB144
CsaV3_3G017720CsGy3G017590Cucumber (Gy14) v2cgybcucB117
The following gene(s) are paralogous to this gene:

None