Cla97C02G027410 (gene) Watermelon (97103) v2

NameCla97C02G027410
Typegene
OrganismCitrullus lanatus (Watermelon (97103) v2)
DescriptionGlycosyl hydrolase family 5 protein
LocationCla97Chr02 : 1123460 .. 1125323 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGAAGAAATTTACAAACCATTCTCTTTTTGGCAATAGTCTTTCAGTTTTTATCTTTCTCGGCTTATTCCTTGCCTTTGTCAACAAAAGGAAGATGGATCGTTGATTCGAAGACTGGGCATCGTGTCAAACTTGTTTGTGTAAATTGGCCTTCTCACACTCAAAGCATGCTGGCAGAAGGCCTCAACCATCGACCATTGAAAGAACTTGCTGATGAGGCAATCAAGTTGAGGTTCAATTGTGTGCGTCTCACATATGCAACTCACATGTTCACTCGCTATGCTAATAGGACAATTGAAGAGAACTTTGACCTTCTTGATTTAAAGCAAGCCAAAGCTGGATTGGCTCAACATAACCCTTTTGTACTGAATAAGACCATTGTTGAAGCCTATGAAGCTGTTGTTGATGTGCTTGGGGCAAGTGGGTTAATGGTTATTGCTGACAACCACATTAGTCAACCAAGATGGTGTTGCTCTCTTGATGATGGTAATGGCTTCTTTGGAAATCGAAATTTTGATCCTCAAGAATGGTTACAAGGTCTTCGCTTAGTCGCTCAACGTTTCATAAACAAGTCAACGGTATGTACTGTTTAAAAGAGTTTTTTGATATTCTAAATGAAACCAATATCAAACATCTGAAAATTAGAAAATAAATCATTTCATTAACAAGTTTTTTCATTATGGCATATAGGTGGTAGCAATGAGCCTACGAAATGAGATACGAGGAATGATGGAAAATGCAAATGATTGGAACAACTATGTAACTCAAGGAGTAACAACAATCCACAACATAAATCCAGAAGTTGTAGTGATTGTTTCGGGGCTAAATTATGATAATGATCTTCGATGCTTAAAAGAAAAGCCTTTGAACGTTAACACCTTAGACAATAAGCTAGTTTTTGAAGTCCACTTATATTCTTTTAGTGGAGATTCTGAGAGCAAATTCGTAACACAACCATTAAATGATATTTGTGCAACTATCATGAATGGGTTTATAGATCATGCTGAGTTTGTAATTGAAGGATCAAATCCATTTCCTTTGTTTGTTAGCGAATATGGGTATGATCAAAGGGAGGTTGATGAAGCTGAAAATCGGTTCATGAGTTGCTTTACCGCTCATCTTGTTCAGAAAGACTTGGATTGGGCATTATGGACTTGGCAAGGTAGTTATTATTATAGGGAAGGTCAAGCAGAGCCTGCAGAAACATTTGGTGTTCTCAACTCTAATTGGACTCAGATTAAGAACCCTAACTTTCCTAAAAGGTTCCAACTATTGCAGACAATGTTGCAGGGTAAGTAAATTAACAAATCTAAGTAGTTTTAAGCATTTGTAAAGAGGAAAAATTGTTATAAAAAATTATAAGCAGCCCACACAATTAAACCAGTTTATGGGATCAAAATATAATGTGTAACCTTTTATTGATTTGCAGATCCAAATTCCAATGCTTCTAACTCGTATGTTATGTATCATCCACAAAGTGGGCAATGTGCCCAAGTGTCAAATGACAAGACAGGAATATTTTTGAACAATTGCTCTACCTCAAGCCGTTGGAGTCATAATGGTGATGGTACCCCAATTAGGTTGACATCCACTGGTTTGTGTTTGAAGTCCAATGGAGAAGGCCTTGGGGCATCCCTTTCAAATGATTGTTTGAGTCAACAGAGCGCTTGGAGAGCCATTTCTAACTCTAAGCTTCACCTTGCCACCTTCACTCAAGATGGAAACAACCTTTGTTTACAAATTGAAAACTCCAACTCCTCAAAGATTGTGGTCAACTCCTGTATTTGCACCAATGGCGACCCAAAATGCCTTGAAGACACCCAAAGCCAATGGCTCACACTCGTTGCAACCAATACCTTGTAA

mRNA sequence

ATGGGAAGAAATTTACAAACCATTCTCTTTTTGGCAATAGTCTTTCAGTTTTTATCTTTCTCGGCTTATTCCTTGCCTTTGTCAACAAAAGGAAGATGGATCGTTGATTCGAAGACTGGGCATCGTGTCAAACTTGTTTGTGTAAATTGGCCTTCTCACACTCAAAGCATGCTGGCAGAAGGCCTCAACCATCGACCATTGAAAGAACTTGCTGATGAGGCAATCAAGTTGAGGTTCAATTGTGTGCGTCTCACATATGCAACTCACATGTTCACTCGCTATGCTAATAGGACAATTGAAGAGAACTTTGACCTTCTTGATTTAAAGCAAGCCAAAGCTGGATTGGCTCAACATAACCCTTTTGTACTGAATAAGACCATTGTTGAAGCCTATGAAGCTGTTGTTGATGTGCTTGGGGCAAGTGGGTTAATGGTTATTGCTGACAACCACATTAGTCAACCAAGATGGTGTTGCTCTCTTGATGATGGTAATGGCTTCTTTGGAAATCGAAATTTTGATCCTCAAGAATGGTTACAAGGTCTTCGCTTAGTCGCTCAACGTTTCATAAACAAGTCAACGGTGGTAGCAATGAGCCTACGAAATGAGATACGAGGAATGATGGAAAATGCAAATGATTGGAACAACTATGTAACTCAAGGAGTAACAACAATCCACAACATAAATCCAGAAGTTGTAGTGATTGTTTCGGGGCTAAATTATGATAATGATCTTCGATGCTTAAAAGAAAAGCCTTTGAACGTTAACACCTTAGACAATAAGCTAGTTTTTGAAGTCCACTTATATTCTTTTAGTGGAGATTCTGAGAGCAAATTCGTAACACAACCATTAAATGATATTTGTGCAACTATCATGAATGGGTTTATAGATCATGCTGAGTTTGTAATTGAAGGATCAAATCCATTTCCTTTGTTTGTTAGCGAATATGGGTATGATCAAAGGGAGGTTGATGAAGCTGAAAATCGGTTCATGAGTTGCTTTACCGCTCATCTTGTTCAGAAAGACTTGGATTGGGCATTATGGACTTGGCAAGGTAGTTATTATTATAGGGAAGGTCAAGCAGAGCCTGCAGAAACATTTGGTGTTCTCAACTCTAATTGGACTCAGATTAAGAACCCTAACTTTCCTAAAAGGTTCCAACTATTGCAGACAATGTTGCAGGATCCAAATTCCAATGCTTCTAACTCGTATGTTATGTATCATCCACAAAGTGGGCAATGTGCCCAAGTGTCAAATGACAAGACAGGAATATTTTTGAACAATTGCTCTACCTCAAGCCGTTGGAGTCATAATGGTGATGGTACCCCAATTAGGTTGACATCCACTGGTTTGTGTTTGAAGTCCAATGGAGAAGGCCTTGGGGCATCCCTTTCAAATGATTGTTTGAGTCAACAGAGCGCTTGGAGAGCCATTTCTAACTCTAAGCTTCACCTTGCCACCTTCACTCAAGATGGAAACAACCTTTGTTTACAAATTGAAAACTCCAACTCCTCAAAGATTGTGGTCAACTCCTGTATTTGCACCAATGGCGACCCAAAATGCCTTGAAGACACCCAAAGCCAATGGCTCACACTCGTTGCAACCAATACCTTGTAA

Coding sequence (CDS)

ATGGGAAGAAATTTACAAACCATTCTCTTTTTGGCAATAGTCTTTCAGTTTTTATCTTTCTCGGCTTATTCCTTGCCTTTGTCAACAAAAGGAAGATGGATCGTTGATTCGAAGACTGGGCATCGTGTCAAACTTGTTTGTGTAAATTGGCCTTCTCACACTCAAAGCATGCTGGCAGAAGGCCTCAACCATCGACCATTGAAAGAACTTGCTGATGAGGCAATCAAGTTGAGGTTCAATTGTGTGCGTCTCACATATGCAACTCACATGTTCACTCGCTATGCTAATAGGACAATTGAAGAGAACTTTGACCTTCTTGATTTAAAGCAAGCCAAAGCTGGATTGGCTCAACATAACCCTTTTGTACTGAATAAGACCATTGTTGAAGCCTATGAAGCTGTTGTTGATGTGCTTGGGGCAAGTGGGTTAATGGTTATTGCTGACAACCACATTAGTCAACCAAGATGGTGTTGCTCTCTTGATGATGGTAATGGCTTCTTTGGAAATCGAAATTTTGATCCTCAAGAATGGTTACAAGGTCTTCGCTTAGTCGCTCAACGTTTCATAAACAAGTCAACGGTGGTAGCAATGAGCCTACGAAATGAGATACGAGGAATGATGGAAAATGCAAATGATTGGAACAACTATGTAACTCAAGGAGTAACAACAATCCACAACATAAATCCAGAAGTTGTAGTGATTGTTTCGGGGCTAAATTATGATAATGATCTTCGATGCTTAAAAGAAAAGCCTTTGAACGTTAACACCTTAGACAATAAGCTAGTTTTTGAAGTCCACTTATATTCTTTTAGTGGAGATTCTGAGAGCAAATTCGTAACACAACCATTAAATGATATTTGTGCAACTATCATGAATGGGTTTATAGATCATGCTGAGTTTGTAATTGAAGGATCAAATCCATTTCCTTTGTTTGTTAGCGAATATGGGTATGATCAAAGGGAGGTTGATGAAGCTGAAAATCGGTTCATGAGTTGCTTTACCGCTCATCTTGTTCAGAAAGACTTGGATTGGGCATTATGGACTTGGCAAGGTAGTTATTATTATAGGGAAGGTCAAGCAGAGCCTGCAGAAACATTTGGTGTTCTCAACTCTAATTGGACTCAGATTAAGAACCCTAACTTTCCTAAAAGGTTCCAACTATTGCAGACAATGTTGCAGGATCCAAATTCCAATGCTTCTAACTCGTATGTTATGTATCATCCACAAAGTGGGCAATGTGCCCAAGTGTCAAATGACAAGACAGGAATATTTTTGAACAATTGCTCTACCTCAAGCCGTTGGAGTCATAATGGTGATGGTACCCCAATTAGGTTGACATCCACTGGTTTGTGTTTGAAGTCCAATGGAGAAGGCCTTGGGGCATCCCTTTCAAATGATTGTTTGAGTCAACAGAGCGCTTGGAGAGCCATTTCTAACTCTAAGCTTCACCTTGCCACCTTCACTCAAGATGGAAACAACCTTTGTTTACAAATTGAAAACTCCAACTCCTCAAAGATTGTGGTCAACTCCTGTATTTGCACCAATGGCGACCCAAAATGCCTTGAAGACACCCAAAGCCAATGGCTCACACTCGTTGCAACCAATACCTTGTAA

Protein sequence

MGRNLQTILFLAIVFQFLSFSAYSLPLSTKGRWIVDSKTGHRVKLVCVNWPSHTQSMLAEGLNHRPLKELADEAIKLRFNCVRLTYATHMFTRYANRTIEENFDLLDLKQAKAGLAQHNPFVLNKTIVEAYEAVVDVLGASGLMVIADNHISQPRWCCSLDDGNGFFGNRNFDPQEWLQGLRLVAQRFINKSTVVAMSLRNEIRGMMENANDWNNYVTQGVTTIHNINPEVVVIVSGLNYDNDLRCLKEKPLNVNTLDNKLVFEVHLYSFSGDSESKFVTQPLNDICATIMNGFIDHAEFVIEGSNPFPLFVSEYGYDQREVDEAENRFMSCFTAHLVQKDLDWALWTWQGSYYYREGQAEPAETFGVLNSNWTQIKNPNFPKRFQLLQTMLQDPNSNASNSYVMYHPQSGQCAQVSNDKTGIFLNNCSTSSRWSHNGDGTPIRLTSTGLCLKSNGEGLGASLSNDCLSQQSAWRAISNSKLHLATFTQDGNNLCLQIENSNSSKIVVNSCICTNGDPKCLEDTQSQWLTLVATNTL
BLAST of Cla97C02G027410 vs. NCBI nr
Match: XP_011658389.1 (PREDICTED: uncharacterized protein LOC101207450 [Cucumis sativus] >KGN45940.1 hypothetical protein Csa_6G028440 [Cucumis sativus])

HSP 1 Score: 939.9 bits (2428), Expect = 3.9e-270
Identity = 454/538 (84.39%), Postives = 487/538 (90.52%), Query Frame = 0

Query: 1   MGRNLQTILFLAIVFQFLSFSAYSLPLSTKGRWIVDSKTGHRVKLVCVNWPSHTQSMLAE 60
           M R +Q IL LA+V  F SFSAYSLPLST GRWI+DS++G RVKLVCVNWPSHTQSML E
Sbjct: 1   MERTIQVILLLALVSVFSSFSAYSLPLSTHGRWIIDSQSGKRVKLVCVNWPSHTQSMLIE 60

Query: 61  GLNHRPLKELADEAIKLRFNCVRLTYATHMFTRYANRTIEENFDLLDLKQAKAGLAQHNP 120
           GLNHRPLKELADEAIKLRFNCVRLTYATHMFTRYANRT+EENFDLLDL+QAKAGLAQ+NP
Sbjct: 61  GLNHRPLKELADEAIKLRFNCVRLTYATHMFTRYANRTVEENFDLLDLEQAKAGLAQYNP 120

Query: 121 FVLNKTIVEAYEAVVDVLGASGLMVIADNHISQPRWCCSLDDGNGFFGNRNFDPQEWLQG 180
           FVLNKTI EAYEAVVDVLGASGLMVIADNH+SQPRWCCSLDDGNGFFGNR FDPQEWLQG
Sbjct: 121 FVLNKTIAEAYEAVVDVLGASGLMVIADNHMSQPRWCCSLDDGNGFFGNRYFDPQEWLQG 180

Query: 181 LRLVAQRFINKSTVVAMSLRNEIRGMMENANDWNNYVTQGVTTIHNINPEVVVIVSGLNY 240
           L LVAQRF NKSTVV MSLRNE+RGMMENANDWNNYVTQGVTTIH INP V+VIVSGLNY
Sbjct: 181 LSLVAQRFNNKSTVVGMSLRNELRGMMENANDWNNYVTQGVTTIHKINPAVLVIVSGLNY 240

Query: 241 DNDLRCLKEKPLNVNTLDNKLVFEVHLYSFSGDSESKFVTQPLNDICATIMNGFIDHAEF 300
           DNDLRCLK+KPLNV+TLDNKL FEVHLYSFSGDSESKFV QPLN+ICA IM+ FIDHAEF
Sbjct: 241 DNDLRCLKDKPLNVSTLDNKLAFEVHLYSFSGDSESKFVQQPLNNICAKIMHEFIDHAEF 300

Query: 301 VIEGSNPFPLFVSEYGYDQREVDEAENRFMSCFTAHLVQKDLDWALWTWQGSYYYREGQA 360
           VIEG NPFPLFVSEYGYDQREVD+AENRFMSCFTAHL QKDLDWALWTWQGSYYYREGQA
Sbjct: 301 VIEGPNPFPLFVSEYGYDQREVDDAENRFMSCFTAHLAQKDLDWALWTWQGSYYYREGQA 360

Query: 361 EPAETFGVLNSNWTQIKNPNFPKRFQLLQTMLQDPNSNASNSYVMYHPQSGQCAQVSNDK 420
           E AETFGVL+SNWTQIKNPNF ++FQLLQTMLQDP SNAS SYV+YH QSGQC +VSND 
Sbjct: 361 ELAETFGVLDSNWTQIKNPNFVQKFQLLQTMLQDPYSNASFSYVIYHVQSGQCIEVSNDN 420

Query: 421 TGIFLNNCSTSSRWSHNGDGTPIRLTSTGLCLKSNGEGLGASLSNDCLSQQSAWRAISNS 480
             IFL NCSTSSRWSH+ D TPI+++STGLCLK++GEGL ASLS DC+ +QS W AISNS
Sbjct: 421 KEIFLTNCSTSSRWSHDNDSTPIKMSSTGLCLKASGEGLEASLSTDCIGKQSLWSAISNS 480

Query: 481 KLHLATFTQDGNNLCLQ-IENSNSSKIVVNSCICTNGDPKCLEDTQSQWLTLVATNTL 538
            LHL T T+DG +LCLQ IE+SNSSKIV NSCICT  DP CL+DTQSQW  LVATNTL
Sbjct: 481 NLHLGTVTEDGKSLCLQIIESSNSSKIVTNSCICTTNDPTCLQDTQSQWFELVATNTL 538

BLAST of Cla97C02G027410 vs. NCBI nr
Match: XP_023533776.1 (uncharacterized protein LOC111795530 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 936.8 bits (2420), Expect = 3.3e-269
Identity = 451/537 (83.99%), Postives = 482/537 (89.76%), Query Frame = 0

Query: 1   MGRNLQTILFLAIVFQFLSFSAYSLPLSTKGRWIVDSKTGHRVKLVCVNWPSHTQSMLAE 60
           MGR  Q +LFLA+ F FLS S YSLPLSTKGRWI+DS TG RVKLVCVNWPSHTQSML E
Sbjct: 1   MGRTFQVVLFLAL-FVFLSPSVYSLPLSTKGRWIIDSTTGRRVKLVCVNWPSHTQSMLIE 60

Query: 61  GLNHRPLKELADEAIKLRFNCVRLTYATHMFTRYANRTIEENFDLLDLKQAKAGLAQHNP 120
           GLNHRPLKELADEAIKLRFNCVRLTYATHMFTRYANRT+EENFDLLDL+ AKAGLAQ+NP
Sbjct: 61  GLNHRPLKELADEAIKLRFNCVRLTYATHMFTRYANRTVEENFDLLDLEPAKAGLAQNNP 120

Query: 121 FVLNKTIVEAYEAVVDVLGASGLMVIADNHISQPRWCCSLDDGNGFFGNRNFDPQEWLQG 180
           FVLNKTI EAYEAVVDVLG SGLMVIADNHISQPRWCCSLDDGNGFFG+R FDPQEWLQG
Sbjct: 121 FVLNKTIAEAYEAVVDVLGESGLMVIADNHISQPRWCCSLDDGNGFFGDRYFDPQEWLQG 180

Query: 181 LRLVAQRFINKSTVVAMSLRNEIRGMMENANDWNNYVTQGVTTIHNINPEVVVIVSGLNY 240
           L LVAQRF  KSTVV MSLRNEIRG  ENANDWNNYVTQGVTTIHNINP V+VIVSGLNY
Sbjct: 181 LSLVAQRFSKKSTVVGMSLRNEIRGTNENANDWNNYVTQGVTTIHNINPNVLVIVSGLNY 240

Query: 241 DNDLRCLKEKPLNVNTLDNKLVFEVHLYSFSGDSESKFVTQPLNDICATIMNGFIDHAEF 300
           DNDLRCLKEKPLNV+TLDNKLVFEVHLYSFSGDSESKF+ QPLN+ICA I+NGF+DHAEF
Sbjct: 241 DNDLRCLKEKPLNVSTLDNKLVFEVHLYSFSGDSESKFINQPLNNICANIINGFVDHAEF 300

Query: 301 VIEGSNPFPLFVSEYGYDQREVDEAENRFMSCFTAHLVQKDLDWALWTWQGSYYYREGQA 360
           V EGSNPFPLFVSEYGYDQREV++AENRFMSCFTAHL QKDLDWALWTWQGSYYYREGQA
Sbjct: 301 VTEGSNPFPLFVSEYGYDQREVNDAENRFMSCFTAHLAQKDLDWALWTWQGSYYYREGQA 360

Query: 361 EPAETFGVLNSNWTQIKNPNFPKRFQLLQTMLQDPNSNASNSYVMYHPQSGQCAQVSNDK 420
           EP E FGVL+SNWTQIKNPNF ++FQLLQTMLQDPNSNAS SYV+YHPQSGQC QVSND 
Sbjct: 361 EPGEAFGVLDSNWTQIKNPNFVQKFQLLQTMLQDPNSNASFSYVIYHPQSGQCIQVSNDN 420

Query: 421 TGIFLNNCSTSSRWSHNGDGTPIRLTSTGLCLKSNGEGLGASLSNDCLSQQSAWRAISNS 480
             IFL NCS SSRW+H+ D TPIR++STGLCLK++GEGL  SLS DCL  Q++W AISN+
Sbjct: 421 KDIFLGNCSISSRWTHDNDSTPIRMSSTGLCLKTSGEGLMPSLSTDCLGPQNSWSAISNT 480

Query: 481 KLHLATFTQDGNNLCLQIENSNSSKIVVNSCICTNGDPKCLEDTQSQWLTLVATNTL 538
           KLHLAT   D  +LCLQIE+SNSSKIV NS ICTNG P CLEDT+SQW  LV TNTL
Sbjct: 481 KLHLATIAPDEKSLCLQIESSNSSKIVTNSFICTNGAPNCLEDTRSQWFELVKTNTL 536

BLAST of Cla97C02G027410 vs. NCBI nr
Match: XP_022995752.1 (uncharacterized protein LOC111491191 [Cucurbita maxima])

HSP 1 Score: 932.2 bits (2408), Expect = 8.1e-268
Identity = 447/537 (83.24%), Postives = 481/537 (89.57%), Query Frame = 0

Query: 1   MGRNLQTILFLAIVFQFLSFSAYSLPLSTKGRWIVDSKTGHRVKLVCVNWPSHTQSMLAE 60
           MGR  Q + FLA+ F FLS   YSLPLSTK +WI+DS TG RVKLVCVNWPSHTQSML E
Sbjct: 1   MGRIFQVVFFLAL-FVFLSPYVYSLPLSTKEKWIIDSTTGRRVKLVCVNWPSHTQSMLIE 60

Query: 61  GLNHRPLKELADEAIKLRFNCVRLTYATHMFTRYANRTIEENFDLLDLKQAKAGLAQHNP 120
           GLNHRPLKELADEAIKLRFNCVRLTYAT MFTRYANRT+EENFDLLDL+QAKAGLAQ+NP
Sbjct: 61  GLNHRPLKELADEAIKLRFNCVRLTYATQMFTRYANRTVEENFDLLDLEQAKAGLAQNNP 120

Query: 121 FVLNKTIVEAYEAVVDVLGASGLMVIADNHISQPRWCCSLDDGNGFFGNRNFDPQEWLQG 180
           FVLNKTI EAYEAVVDVLG SGLMVIADNHISQPRWCCSLDDGNGFFG+R FDPQEWLQG
Sbjct: 121 FVLNKTIAEAYEAVVDVLGESGLMVIADNHISQPRWCCSLDDGNGFFGDRYFDPQEWLQG 180

Query: 181 LRLVAQRFINKSTVVAMSLRNEIRGMMENANDWNNYVTQGVTTIHNINPEVVVIVSGLNY 240
           L LVAQRF  KSTVV MSLRNEIRG  ENANDWNNYVTQGVTTIHNINP V+VIVSGLNY
Sbjct: 181 LSLVAQRFSKKSTVVGMSLRNEIRGTNENANDWNNYVTQGVTTIHNINPNVLVIVSGLNY 240

Query: 241 DNDLRCLKEKPLNVNTLDNKLVFEVHLYSFSGDSESKFVTQPLNDICATIMNGFIDHAEF 300
           DNDLRCLKEKPLNV+TLDNKLVFEVHLYSFSGD ESKF+ QPLN+ICA I+NGF+DHAEF
Sbjct: 241 DNDLRCLKEKPLNVSTLDNKLVFEVHLYSFSGDPESKFINQPLNNICANIINGFVDHAEF 300

Query: 301 VIEGSNPFPLFVSEYGYDQREVDEAENRFMSCFTAHLVQKDLDWALWTWQGSYYYREGQA 360
           V EGSNPFPLFVSEYGYDQREV++AENRFMSCFTAHL QKDLDWALWTWQGSYYYREGQA
Sbjct: 301 VTEGSNPFPLFVSEYGYDQREVNDAENRFMSCFTAHLAQKDLDWALWTWQGSYYYREGQA 360

Query: 361 EPAETFGVLNSNWTQIKNPNFPKRFQLLQTMLQDPNSNASNSYVMYHPQSGQCAQVSNDK 420
           EP ETFGVL+SNWTQIKNPNF ++FQLLQTMLQDPNSNAS SYV+YHPQSGQC QVSND 
Sbjct: 361 EPGETFGVLDSNWTQIKNPNFVQKFQLLQTMLQDPNSNASFSYVIYHPQSGQCIQVSNDN 420

Query: 421 TGIFLNNCSTSSRWSHNGDGTPIRLTSTGLCLKSNGEGLGASLSNDCLSQQSAWRAISNS 480
             IF+ NCS SSRW+H+ D TPIR++S GLCLK+ GEGL  SLS DCL  QS+W AISN+
Sbjct: 421 KDIFMGNCSISSRWTHDNDSTPIRMSSMGLCLKTIGEGLTPSLSTDCLGPQSSWSAISNT 480

Query: 481 KLHLATFTQDGNNLCLQIENSNSSKIVVNSCICTNGDPKCLEDTQSQWLTLVATNTL 538
           KLHLAT +QDG +LCLQ+E+SNSSKIV NSCICTNG P CL+DT+SQW  LV TNTL
Sbjct: 481 KLHLATISQDGKSLCLQVESSNSSKIVTNSCICTNGAPNCLQDTRSQWFELVKTNTL 536

BLAST of Cla97C02G027410 vs. NCBI nr
Match: XP_022975072.1 (uncharacterized protein LOC111474044 [Cucurbita maxima])

HSP 1 Score: 918.3 bits (2372), Expect = 1.2e-263
Identity = 439/536 (81.90%), Postives = 474/536 (88.43%), Query Frame = 0

Query: 2   GRNLQTILFLAIVFQFLSFSAYSLPLSTKGRWIVDSKTGHRVKLVCVNWPSHTQSMLAEG 61
           G +L   L    +F  L  S YSLPLST GRWIVDS TG RVKLVCVNWPSHTQSML EG
Sbjct: 17  GLSLMEDLKSGALFVLLPPSVYSLPLSTNGRWIVDSTTGRRVKLVCVNWPSHTQSMLIEG 76

Query: 62  LNHRPLKELADEAIKLRFNCVRLTYATHMFTRYANRTIEENFDLLDLKQAKAGLAQHNPF 121
           LNHRPLKELADEAIKLRFNCVRLTYAT MFTRYANRT+EENFDLLDL+QAKAGLAQ+NPF
Sbjct: 77  LNHRPLKELADEAIKLRFNCVRLTYATQMFTRYANRTVEENFDLLDLEQAKAGLAQYNPF 136

Query: 122 VLNKTIVEAYEAVVDVLGASGLMVIADNHISQPRWCCSLDDGNGFFGNRNFDPQEWLQGL 181
           VLNKTI EAYEAVVDVLG SGLMVIADNH+SQPRWCCSLDDGNGFFG+R FDPQEWLQGL
Sbjct: 137 VLNKTIAEAYEAVVDVLGESGLMVIADNHMSQPRWCCSLDDGNGFFGDRYFDPQEWLQGL 196

Query: 182 RLVAQRFINKSTVVAMSLRNEIRGMMENANDWNNYVTQGVTTIHNINPEVVVIVSGLNYD 241
            LVAQRF  K TVV MSLRNEIRG  ENANDWNNYVTQGVTTIHNINP V+VIVSGLN+D
Sbjct: 197 SLVAQRFSKKPTVVGMSLRNEIRGTNENANDWNNYVTQGVTTIHNINPNVLVIVSGLNFD 256

Query: 242 NDLRCLKEKPLNVNTLDNKLVFEVHLYSFSGDSESKFVTQPLNDICATIMNGFIDHAEFV 301
           NDLRCLKEKPLN + LDNKLVFEVHLYSFSGD+ESKF+ QPLN+ICA I+NGF+DHAEFV
Sbjct: 257 NDLRCLKEKPLNASALDNKLVFEVHLYSFSGDAESKFINQPLNNICADIINGFVDHAEFV 316

Query: 302 IEGSNPFPLFVSEYGYDQREVDEAENRFMSCFTAHLVQKDLDWALWTWQGSYYYREGQAE 361
            EG NPFPLFVSEYGYDQREV++AENRFMSCFTAHL Q+DLDWALWTWQGSYYYREGQA 
Sbjct: 317 REGPNPFPLFVSEYGYDQREVNDAENRFMSCFTAHLAQEDLDWALWTWQGSYYYREGQAG 376

Query: 362 PAETFGVLNSNWTQIKNPNFPKRFQLLQTMLQDPNSNASNSYVMYHPQSGQCAQVSNDKT 421
           PAETFGVL+SNWTQIKNPNF ++FQLLQTMLQDPNSNAS SYV+YHPQSGQC QVSND  
Sbjct: 377 PAETFGVLDSNWTQIKNPNFVQKFQLLQTMLQDPNSNASFSYVIYHPQSGQCIQVSNDNK 436

Query: 422 GIFLNNCSTSSRWSHNGDGTPIRLTSTGLCLKSNGEGLGASLSNDCLSQQSAWRAISNSK 481
            +F+ NCS S RW+H+ D TPIR++STGLCLK++GEGL  SLS DC   QS+WRAISN+K
Sbjct: 437 DMFMGNCSNSGRWTHDNDSTPIRMSSTGLCLKTSGEGLMPSLSTDCFGPQSSWRAISNTK 496

Query: 482 LHLATFTQDGNNLCLQIENSNSSKIVVNSCICTNGDPKCLEDTQSQWLTLVATNTL 538
           LHLAT TQDG +LCLQ+ENSNSSKIV NSCICT+G P CLEDTQSQW  LV TNTL
Sbjct: 497 LHLATITQDGKSLCLQVENSNSSKIVTNSCICTDGAPTCLEDTQSQWFELVETNTL 552

BLAST of Cla97C02G027410 vs. NCBI nr
Match: XP_022958844.1 (uncharacterized protein LOC111459997 [Cucurbita moschata])

HSP 1 Score: 916.4 bits (2367), Expect = 4.6e-263
Identity = 431/535 (80.56%), Postives = 476/535 (88.97%), Query Frame = 0

Query: 1   MGRNLQTILFLAIVFQFLSFSAYSLPLSTKGRWIVDSKTGHRVKLVCVNWPSHTQSMLAE 60
           M R LQ I+ LA+VF F S S  SLPLST+GRWI+DSKTG RVKLVCVNWPSHTQSML E
Sbjct: 1   MRRTLQAIVCLALVFIFSSLSTCSLPLSTRGRWIIDSKTGRRVKLVCVNWPSHTQSMLIE 60

Query: 61  GLNHRPLKELADEAIKLRFNCVRLTYATHMFTRYANRTIEENFDLLDLKQAKAGLAQHNP 120
           GLNHRPLKELADEAIKL+FNCVRLTYATHMFTRYANRTIEENFDLLDLK AKAGL Q+NP
Sbjct: 61  GLNHRPLKELADEAIKLKFNCVRLTYATHMFTRYANRTIEENFDLLDLKPAKAGLVQYNP 120

Query: 121 FVLNKTIVEAYEAVVDVLGASGLMVIADNHISQPRWCCSLDDGNGFFGNRNFDPQEWLQG 180
           FVLNKTI EA+EAVVDVLG SGLMVIADNHISQPRWCCSLDDGNGFFG+R FDPQEWLQG
Sbjct: 121 FVLNKTIAEAFEAVVDVLGESGLMVIADNHISQPRWCCSLDDGNGFFGDRYFDPQEWLQG 180

Query: 181 LRLVAQRFINKSTVVAMSLRNEIRGMMENANDWNNYVTQGVTTIHNINPEVVVIVSGLNY 240
           L+LVA RFINK  ++AMSLRNEIRG  ENANDWNNY+TQGVT +HN NP+++VIVSGLN+
Sbjct: 181 LQLVAWRFINKPAMIAMSLRNEIRGTRENANDWNNYITQGVTIVHNTNPDILVIVSGLNF 240

Query: 241 DNDLRCLKEKPLNVNTLDNKLVFEVHLYSFSGDSESKFVTQPLNDICATIMNGFIDHAEF 300
           DNDLRCLKEKPLNV  LDNKLVFEVHLYS SGD ESKFV QPLN+ICA I+NGF+DHA F
Sbjct: 241 DNDLRCLKEKPLNVTNLDNKLVFEVHLYSLSGDPESKFVQQPLNNICANIINGFVDHAGF 300

Query: 301 VIEGSNPFPLFVSEYGYDQREVDEAENRFMSCFTAHLVQKDLDWALWTWQGSYYYREGQA 360
           V++G NPFPLFVSEYGYDQRE ++AENRFMSCFT+HL QKDLDWALWTWQGSYYYREGQA
Sbjct: 301 VMDGPNPFPLFVSEYGYDQRESNDAENRFMSCFTSHLAQKDLDWALWTWQGSYYYREGQA 360

Query: 361 EPAETFGVLNSNWTQIKNPNFPKRFQLLQTMLQDPNSNASNSYVMYHPQSGQCAQVSNDK 420
           EP E FGVLNSNWT+I+NPNF K+F LLQ+MLQDPNSNAS SY++YHPQSGQCAQ SN+ 
Sbjct: 361 EPTEVFGVLNSNWTKIQNPNFSKKFHLLQSMLQDPNSNASFSYILYHPQSGQCAQTSNND 420

Query: 421 TGIFLNNCSTSSRWSHNGDGTPIRLTSTGLCLKSNGEGLGASLSNDCLSQQSAWRAISNS 480
           T IFL++CSTSSRWSH   G  I++ +TGLCLK+NGEGLG SLS+DCLSQQS WR ISN+
Sbjct: 421 TQIFLSDCSTSSRWSHGDGGNSIKMATTGLCLKANGEGLGVSLSSDCLSQQSVWRTISNT 480

Query: 481 KLHLATFTQDGNNLCLQIENSNSSKIVVNSCICTNGDPKCLEDTQSQWLTLVATN 536
            LHLAT TQDG  LCLQ+E+S+SSKIV N CICTNG+P CL+DTQSQW  LVATN
Sbjct: 481 NLHLATVTQDGKYLCLQVESSSSSKIVTNPCICTNGEPNCLQDTQSQWFQLVATN 535

BLAST of Cla97C02G027410 vs. TrEMBL
Match: tr|A0A0A0K853|A0A0A0K853_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G028440 PE=3 SV=1)

HSP 1 Score: 939.9 bits (2428), Expect = 2.6e-270
Identity = 454/538 (84.39%), Postives = 487/538 (90.52%), Query Frame = 0

Query: 1   MGRNLQTILFLAIVFQFLSFSAYSLPLSTKGRWIVDSKTGHRVKLVCVNWPSHTQSMLAE 60
           M R +Q IL LA+V  F SFSAYSLPLST GRWI+DS++G RVKLVCVNWPSHTQSML E
Sbjct: 1   MERTIQVILLLALVSVFSSFSAYSLPLSTHGRWIIDSQSGKRVKLVCVNWPSHTQSMLIE 60

Query: 61  GLNHRPLKELADEAIKLRFNCVRLTYATHMFTRYANRTIEENFDLLDLKQAKAGLAQHNP 120
           GLNHRPLKELADEAIKLRFNCVRLTYATHMFTRYANRT+EENFDLLDL+QAKAGLAQ+NP
Sbjct: 61  GLNHRPLKELADEAIKLRFNCVRLTYATHMFTRYANRTVEENFDLLDLEQAKAGLAQYNP 120

Query: 121 FVLNKTIVEAYEAVVDVLGASGLMVIADNHISQPRWCCSLDDGNGFFGNRNFDPQEWLQG 180
           FVLNKTI EAYEAVVDVLGASGLMVIADNH+SQPRWCCSLDDGNGFFGNR FDPQEWLQG
Sbjct: 121 FVLNKTIAEAYEAVVDVLGASGLMVIADNHMSQPRWCCSLDDGNGFFGNRYFDPQEWLQG 180

Query: 181 LRLVAQRFINKSTVVAMSLRNEIRGMMENANDWNNYVTQGVTTIHNINPEVVVIVSGLNY 240
           L LVAQRF NKSTVV MSLRNE+RGMMENANDWNNYVTQGVTTIH INP V+VIVSGLNY
Sbjct: 181 LSLVAQRFNNKSTVVGMSLRNELRGMMENANDWNNYVTQGVTTIHKINPAVLVIVSGLNY 240

Query: 241 DNDLRCLKEKPLNVNTLDNKLVFEVHLYSFSGDSESKFVTQPLNDICATIMNGFIDHAEF 300
           DNDLRCLK+KPLNV+TLDNKL FEVHLYSFSGDSESKFV QPLN+ICA IM+ FIDHAEF
Sbjct: 241 DNDLRCLKDKPLNVSTLDNKLAFEVHLYSFSGDSESKFVQQPLNNICAKIMHEFIDHAEF 300

Query: 301 VIEGSNPFPLFVSEYGYDQREVDEAENRFMSCFTAHLVQKDLDWALWTWQGSYYYREGQA 360
           VIEG NPFPLFVSEYGYDQREVD+AENRFMSCFTAHL QKDLDWALWTWQGSYYYREGQA
Sbjct: 301 VIEGPNPFPLFVSEYGYDQREVDDAENRFMSCFTAHLAQKDLDWALWTWQGSYYYREGQA 360

Query: 361 EPAETFGVLNSNWTQIKNPNFPKRFQLLQTMLQDPNSNASNSYVMYHPQSGQCAQVSNDK 420
           E AETFGVL+SNWTQIKNPNF ++FQLLQTMLQDP SNAS SYV+YH QSGQC +VSND 
Sbjct: 361 ELAETFGVLDSNWTQIKNPNFVQKFQLLQTMLQDPYSNASFSYVIYHVQSGQCIEVSNDN 420

Query: 421 TGIFLNNCSTSSRWSHNGDGTPIRLTSTGLCLKSNGEGLGASLSNDCLSQQSAWRAISNS 480
             IFL NCSTSSRWSH+ D TPI+++STGLCLK++GEGL ASLS DC+ +QS W AISNS
Sbjct: 421 KEIFLTNCSTSSRWSHDNDSTPIKMSSTGLCLKASGEGLEASLSTDCIGKQSLWSAISNS 480

Query: 481 KLHLATFTQDGNNLCLQ-IENSNSSKIVVNSCICTNGDPKCLEDTQSQWLTLVATNTL 538
            LHL T T+DG +LCLQ IE+SNSSKIV NSCICT  DP CL+DTQSQW  LVATNTL
Sbjct: 481 NLHLGTVTEDGKSLCLQIIESSNSSKIVTNSCICTTNDPTCLQDTQSQWFELVATNTL 538

BLAST of Cla97C02G027410 vs. TrEMBL
Match: tr|A0A1S3BDI2|A0A1S3BDI2_CUCME (major extracellular endoglucanase-like OS=Cucumis melo OX=3656 GN=LOC103488703 PE=3 SV=1)

HSP 1 Score: 802.7 bits (2072), Expect = 4.9e-229
Identity = 384/448 (85.71%), Postives = 410/448 (91.52%), Query Frame = 0

Query: 90  MFTRYANRTIEENFDLLDLKQAKAGLAQHNPFVLNKTIVEAYEAVVDVLGASGLMVIADN 149
           MFTRYANRT+EENFDLLDL QAKAGL Q+NPFVLNKTI EAYEAVVDVLGASGLMVIADN
Sbjct: 1   MFTRYANRTVEENFDLLDLGQAKAGLTQYNPFVLNKTIAEAYEAVVDVLGASGLMVIADN 60

Query: 150 HISQPRWCCSLDDGNGFFGNRNFDPQEWLQGLRLVAQRFINKSTVVAMSLRNEIRGMMEN 209
           H+SQPRWCCSLDDGNGFFGNR FDPQEWLQGL LVAQRF NKSTVV MSLRNEIRGMMEN
Sbjct: 61  HMSQPRWCCSLDDGNGFFGNRYFDPQEWLQGLSLVAQRFNNKSTVVGMSLRNEIRGMMEN 120

Query: 210 ANDWNNYVTQGVTTIHNINPEVVVIVSGLNYDNDLRCLKEKPLNVNTLDNKLVFEVHLYS 269
           ANDWN+YVTQGVTTIHNINPEV+VIV GLNYDNDLRCLKEKPLNV+TLDNKLVFEVHLYS
Sbjct: 121 ANDWNHYVTQGVTTIHNINPEVLVIVGGLNYDNDLRCLKEKPLNVSTLDNKLVFEVHLYS 180

Query: 270 FSGDSESKFVTQPLNDICATIMNGFIDHAEFVIEGSNPFPLFVSEYGYDQREVDEAENRF 329
           FSG SESKFV QPLN+ICA I+N FIDHAEFVIEGSNPFPLFVSEYGYDQREVD+AENRF
Sbjct: 181 FSGASESKFVQQPLNNICAKIINEFIDHAEFVIEGSNPFPLFVSEYGYDQREVDDAENRF 240

Query: 330 MSCFTAHLVQKDLDWALWTWQGSYYYREGQAEPAETFGVLNSNWTQIKNPNFPKRFQLLQ 389
           MSCFTAHL QKDLDWALWTWQGSYYYREGQAE  ETFGVL SNWTQIKNPNF ++FQLLQ
Sbjct: 241 MSCFTAHLAQKDLDWALWTWQGSYYYREGQAELPETFGVLESNWTQIKNPNFVQKFQLLQ 300

Query: 390 TMLQDPNSNASNSYVMYHPQSGQCAQVSNDKTGIFLNNCSTSSRWSHNGDGTPIRLTSTG 449
           TMLQDPNSNAS SYV+YHPQSGQC +VSND   IFL NCSTSSRWSH+ D TPI++++TG
Sbjct: 301 TMLQDPNSNASFSYVIYHPQSGQCIEVSNDNKDIFLTNCSTSSRWSHDNDSTPIKMSNTG 360

Query: 450 LCLKSNGEGLGASLSNDCLSQQSAWRAISNSKLHLATFTQDGNNLCLQIENSNSSKIVVN 509
           LCLK++GEGL ASLSNDCL +QS W AISNSKLHLAT T++G +LCLQIE+SNSSKIV N
Sbjct: 361 LCLKASGEGLAASLSNDCLGKQSVWSAISNSKLHLATVTENGKSLCLQIESSNSSKIVTN 420

Query: 510 SCICTNGDPKCLEDTQSQWLTLVATNTL 538
           SCICT  DP CL+DTQSQW  LV TNTL
Sbjct: 421 SCICTTDDPTCLQDTQSQWFELVETNTL 448

BLAST of Cla97C02G027410 vs. TrEMBL
Match: tr|A0A1S3CTF8|A0A1S3CTF8_CUCME (major extracellular endoglucanase-like OS=Cucumis melo OX=3656 GN=LOC103504686 PE=3 SV=1)

HSP 1 Score: 797.7 bits (2059), Expect = 1.6e-227
Identity = 375/539 (69.57%), Postives = 440/539 (81.63%), Query Frame = 0

Query: 1   MGRNLQTILFLAIVFQFLSFSAYSLPLSTKGRWIVDSKTGHRVKLVCVNWPSHTQSMLAE 60
           MG   Q    +     F S  +YSLPLST GRWIVDS TGHRVKLVCVNWPSHTQSML E
Sbjct: 1   MGITTQFSFVVLAFICFFSSLSYSLPLSTNGRWIVDSATGHRVKLVCVNWPSHTQSMLIE 60

Query: 61  GLNHRPLKELADEAIKLRFNCVRLTYATHMFTRYANRTIEENFDLLDLKQAKAGLAQHNP 120
           GL+ RPLK+LA+E ++L+FNCVRLTYATHMFTRYANRT+EENFDLLDL+ +K GLA HNP
Sbjct: 61  GLDRRPLKDLANEVMRLKFNCVRLTYATHMFTRYANRTVEENFDLLDLRASKVGLALHNP 120

Query: 121 FVLNKTIVEAYEAVVDVLGASGLMVIADNHISQPRWCCSLDDGNGFFGNRNFDPQEWLQG 180
           FVLN TI EAYEAVVDVLG SGLMVIADNHISQPRWCCSL+DGNGFFG+R FD +EWL+G
Sbjct: 121 FVLNMTIFEAYEAVVDVLGTSGLMVIADNHISQPRWCCSLEDGNGFFGDRYFDSEEWLEG 180

Query: 181 LRLVAQRFINKSTVVAMSLRNEIRGMMENANDWNNYVTQGVTTIHNINPEVVVIVSGLNY 240
           LRLVA+RF NKS VVAMSLRNE+RG    + DWN YVTQG TTIHNINP ++VI+SGLN+
Sbjct: 181 LRLVARRFYNKSAVVAMSLRNELRGASSKSKDWNKYVTQGATTIHNINPNILVIISGLNF 240

Query: 241 DNDLRCLKEKPLNVNTLDNKLVFEVHLYSFSGDSESKFVTQPLNDICATIMNGFIDHAEF 300
           DNDLRC ++ PL +N L NKLVFEVHLYSFSG+S+SKF+  PLN IC+ I+NGF+  AEF
Sbjct: 241 DNDLRCQRQYPLQLNNLHNKLVFEVHLYSFSGESQSKFIHNPLNKICSKIINGFVQRAEF 300

Query: 301 VIEGSNPFPLFVSEYGYDQREVDEAENRFMSCFTAHLVQKDLDWALWTWQGSYYYREGQA 360
           V+EG+   PLFVSE+G DQ  V+EA++RF+SCF+AHLV+KDLDWALW WQGSYYYR+G+ 
Sbjct: 301 VMEGAEAVPLFVSEFGLDQTGVNEADDRFLSCFSAHLVEKDLDWALWGWQGSYYYRQGKV 360

Query: 361 EPAETFGVLNSNWTQIKNPNFPKRFQLLQTMLQDPNSNASNSYVMYHPQSGQCAQVSNDK 420
           E  E FGVLN NW+ ++NP F + FQLLQTMLQDPNSN+SN+Y+MYHPQSGQC QV + K
Sbjct: 361 ELEEVFGVLNYNWSDVRNPRFSQMFQLLQTMLQDPNSNSSNTYLMYHPQSGQCVQVHDMK 420

Query: 421 -TGIFLNNCSTSSRWSHNGDGTPIRLTSTGLCLKSNGEGLGASLSNDCLSQQSAWRAISN 480
              IFLNNCS +S WS+ GDGTPI L ST  CLK+NG GL  SLS DC  +QS W AIS+
Sbjct: 421 QKEIFLNNCSNASHWSYEGDGTPIMLASTNFCLKANGNGLPPSLSRDCFGEQSVWTAISD 480

Query: 481 SKLHLATFTQDGNN-LCLQIENSNSSKIVVNSCICTNGDPKCLEDTQSQWLTLVATNTL 538
           SKLHLAT T+ GNN +CL+ E+SNSS+I++ SC+C   D  CL+DTQ+QW  LV TNTL
Sbjct: 481 SKLHLATLTKQGNNGMCLEKESSNSSRILMRSCVCVGSDSNCLQDTQAQWFQLVVTNTL 539

BLAST of Cla97C02G027410 vs. TrEMBL
Match: tr|A0A0A0KL32|A0A0A0KL32_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G171770 PE=3 SV=1)

HSP 1 Score: 613.2 bits (1580), Expect = 5.5e-172
Identity = 296/526 (56.27%), Postives = 379/526 (72.05%), Query Frame = 0

Query: 11  LAIVFQFLSFSAYSLPLSTKGRWIVDSKTGHRVKLVCVNWPSHTQSMLAEGLNHRPLKEL 70
           L  VF  L+F AYSLPLST GRWIVD+ TG RVKL+CVNWP H Q MLAEGL+ RPL ++
Sbjct: 12  LVCVFVLLTFKAYSLPLSTNGRWIVDATTGQRVKLMCVNWPGHMQGMLAEGLHRRPLDDI 71

Query: 71  ADEAIKLRFNCVRLTYATHMFTRYANRTIEENFDLLDLKQAKAGLAQHNPFVLNKTIVEA 130
                KLRFNCVRLTY+ HMFTR+AN T++++F+  D+K A AG+AQ+NP ++N T+VEA
Sbjct: 72  ISLVAKLRFNCVRLTYSIHMFTRHANLTVQQSFENFDMKDAMAGIAQNNPSLVNLTLVEA 131

Query: 131 YEAVVDVLGASGLMVIADNHISQPRWCCSLDDGNGFFGNRNFDPQEWLQGLRLVAQRFIN 190
           Y AVVD L A G+MV++DNHISQPRWCC+ DDGNGFFG+R FDP+EWLQG+ L AQ   +
Sbjct: 132 YGAVVDSLAAHGVMVVSDNHISQPRWCCNNDDGNGFFGDRYFDPEEWLQGISLAAQSLKS 191

Query: 191 KSTVVAMSLRNEIRGMMENANDWNNYVTQGVTTIHNINPEVVVIVSGLNYDNDLRCLKEK 250
           K+ VVAMS+RNE RG  +N   W  Y++QG   IH INP  +V+VSGL+YD DL  LK +
Sbjct: 192 KAEVVAMSMRNEPRGPNQNVEKWFQYMSQGAKLIHQINPNALVVVSGLSYDTDLSFLKNR 251

Query: 251 PLNVNTLDNKLVFEVHLYSFSGDSESKFVTQPLNDICATIMNGFIDHAEFVIEGSNPFPL 310
            +  N LDNKLVFE HLYSF+ +    ++++PLN  CA++  GF D A F++ G NP PL
Sbjct: 252 SMGFN-LDNKLVFEAHLYSFTNNMGDFWMSKPLNTFCASVNQGFEDRAGFLVRGQNPMPL 311

Query: 311 FVSEYGYDQREVDEAENRFMSCFTAHLVQKDLDWALWTWQGSYYYREGQAEPAETFGVLN 370
           FVSE+G DQR V+E +NRF+SCF ++L + D DW LW  QGSYYYREG     E FGVL+
Sbjct: 312 FVSEFGIDQRGVNEGQNRFLSCFFSYLTENDFDWGLWALQGSYYYREGVKNAEENFGVLD 371

Query: 371 SNWTQIKNPN-FPKRFQLLQTMLQDPNSNASNSYVMYHPQSGQCAQVSNDKTGIFLNNCS 430
           S + + KN   F +RFQL+QT LQDP+SN + S +MYHP SG C ++ N K  + +++C 
Sbjct: 372 STFAKAKNSKLFLQRFQLMQTKLQDPSSNFTTSLIMYHPLSGGCVRM-NKKYQLGISSCK 431

Query: 431 TSSRWSHNGDGTPIRLTSTGLCLKSNGEGLGASLSNDCLSQQSAWRAISNSKLHLATFTQ 490
           TS+RW H  D +PI+L  + LCLK+ G GL   LS DC SQQS W+  S++KL LAT  +
Sbjct: 432 TSNRWIHEQDSSPIKLAGSVLCLKAIGVGLPPILSQDCSSQQSIWKYGSSAKLQLATVDE 491

Query: 491 DGNNLCLQIENSNSSKIVVNSCICTNGDPKCLEDTQSQWLTLVATN 536
            G  LCLQ   S+S +IV N C+C+N D +C ED QSQW TLV +N
Sbjct: 492 QGQALCLQRAASHSHQIVTNKCLCSN-DSQCQEDPQSQWFTLVPSN 534

BLAST of Cla97C02G027410 vs. TrEMBL
Match: tr|A0A1S3CT43|A0A1S3CT43_CUCME (endoglucanase-like OS=Cucumis melo OX=3656 GN=LOC103504654 PE=3 SV=1)

HSP 1 Score: 608.2 bits (1567), Expect = 1.8e-170
Identity = 295/531 (55.56%), Postives = 379/531 (71.37%), Query Frame = 0

Query: 6   QTILFLAIVFQFLSFSAYSLPLSTKGRWIVDSKTGHRVKLVCVNWPSHTQSMLAEGLNHR 65
           + I  + +    L+F A+SLPLST GRWI+D+ TG RVKL+CVNW  H Q ML EGL+ R
Sbjct: 8   KNIALVCVFVLLLTFKAFSLPLSTNGRWIIDATTGRRVKLMCVNWAGHMQGMLVEGLHRR 67

Query: 66  PLKELADEAIKLRFNCVRLTYATHMFTRYANRTIEENFDLLDLKQAKAGLAQHNPFVLNK 125
           PL ++A    KLRFNCVRLTY+ HMFTR+AN T++++F+  D+K A AG+AQ+NP +LN 
Sbjct: 68  PLDDIAALVAKLRFNCVRLTYSIHMFTRHANLTVKQSFENFDMKDAMAGIAQNNPSILNL 127

Query: 126 TIVEAYEAVVDVLGASGLMVIADNHISQPRWCCSLDDGNGFFGNRNFDPQEWLQGLRLVA 185
           T+VEAY AVVD L A G+MV++DNHISQPRWCC  +DGNGFFG+R FDPQEWLQG+ L A
Sbjct: 128 TLVEAYGAVVDSLVAHGIMVVSDNHISQPRWCCDNNDGNGFFGDRYFDPQEWLQGISLAA 187

Query: 186 QRFINKSTVVAMSLRNEIRGMMENANDWNNYVTQGVTTIHNINPEVVVIVSGLNYDNDLR 245
           Q   +K+ VVAMSLRNE+RG  +N   W  Y++QG   IH INP  +V+VSGL+YD DL 
Sbjct: 188 QSLKSKAQVVAMSLRNELRGPNQNVEKWFQYMSQGAKLIHQINPNALVVVSGLSYDTDLS 247

Query: 246 CLKEKPLNVNTLDNKLVFEVHLYSFSGDSESKFVTQPLNDICATIMNGFIDHAEFVIEGS 305
            LK + +  N LDNKLVFE HLYSF+ +    ++++PLN  CA+I  GF D A F++ G 
Sbjct: 248 FLKNRSMGFN-LDNKLVFEAHLYSFTNNMRDFWMSKPLNTFCASINQGFEDRAGFLVRGQ 307

Query: 306 NPFPLFVSEYGYDQREVDEAENRFMSCFTAHLVQKDLDWALWTWQGSYYYREGQAEPAET 365
           NP PLFVSE+G DQ   +E +NRF+SCF ++L + D DW LW  QGSYYY+ G     E 
Sbjct: 308 NPIPLFVSEFGIDQTGTNEGQNRFLSCFFSYLTENDFDWGLWALQGSYYYKVGVKNAEEN 367

Query: 366 FGVLNSNWTQIKNPN-FPKRFQLLQTMLQDPNSNASNSYVMYHPQSGQCAQVSNDKTGIF 425
           FGVL+SN+T+ KN   F +RFQL+QT LQDP+SN + +++MYHP SG C ++ N K  + 
Sbjct: 368 FGVLDSNFTKAKNSKLFLQRFQLMQTKLQDPSSNFTTTFIMYHPLSGGCVRM-NKKYQLG 427

Query: 426 LNNCSTSSRWSHNGDGTPIRLTSTGLCLKSNGEGLGASLSNDCLSQQSAWRAISNSKLHL 485
           +++C TS+RWSH  DG PI+L  + LCLK+ G GL   LS DC SQQS WR  SN+KL L
Sbjct: 428 ISSCKTSNRWSHEQDGAPIKLAGSILCLKAIGVGLPPILSQDCSSQQSIWRYASNAKLQL 487

Query: 486 ATFTQDGNNLCLQIENSNSSKIVVNSCICTNGDPKCLEDTQSQWLTLVATN 536
           AT  + G  LCLQ   S+S +IV N C+CT  D +C ED QSQW TLV +N
Sbjct: 488 ATVDEQGQALCLQ-RASHSHQIVTNKCLCTI-DSQCQEDPQSQWFTLVPSN 534

BLAST of Cla97C02G027410 vs. Swiss-Prot
Match: sp|C0HLA0|GH5FP_CHAOB (Glycosyl hydrolase 5 family protein OS=Chamaecyparis obtusa OX=13415 PE=1 SV=1)

HSP 1 Score: 406.4 bits (1043), Expect = 5.0e-112
Identity = 220/542 (40.59%), Postives = 319/542 (58.86%), Query Frame = 0

Query: 9   LFLAIVFQFLSFSAYSLPLSTKGRWIVDSKTGHRVKLVCVNWPSHTQSMLAEGLNHRPLK 68
           L  A++   ++  ++SLPL T+GRWIVD  TG RVKL CVNW  H +  L EGLN  P+ 
Sbjct: 13  LLTALLLLLVAAPSHSLPLLTRGRWIVDEATGLRVKLACVNWVGHLEPGLPEGLNRLPVA 72

Query: 69  ELADEAIKLRFNCVRLTYATHMFTR--YANRTIEENFDLLDLKQAKAGLAQHNPFVLNKT 128
            +A     L FNCVRLTY+ HM TR  Y N T+ + F  L+L +A +G+  +NP +L+  
Sbjct: 73  TVAHTISSLGFNCVRLTYSIHMLTRTSYTNATVAQTFARLNLTEAASGIEHNNPELLDLG 132

Query: 129 IVEAYEAVVDVLGASGLMVIADNHISQPRWCCSLDDGNGFFGNRNFDPQEWLQGLRLVAQ 188
            V AY  VV  L  +G+MVI DNH+S+P+WCC++DDGNGFFG+R F+P  W++GL L+A 
Sbjct: 133 HVAAYHHVVAALSEAGVMVILDNHVSKPKWCCAVDDGNGFFGDRYFNPNTWVEGLGLMAT 192

Query: 189 RFINKSTVVAMSLRNEIRGMMENANDWNNYVTQGVTTIHNINPEVVVIVSGLNYDNDLRC 248
            F N   VVAMSLRNE+RG       W+ ++  G  T+H  NP+V+VI+SGL +D DL  
Sbjct: 193 YFNNTPNVVAMSLRNELRGNRSTPISWSRHMQWGAATVHKANPKVLVILSGLQFDTDLSF 252

Query: 249 LKEKPLNVNTLDNKLVFEVHLYSFSGDSESKFVTQPLNDICATIMNGFIDHAEFVIEGSN 308
           L   P+ +     K+V+E H YSF       + T   ND+C      F  +  FV   +N
Sbjct: 253 LPVLPVTL-PFKEKIVYEGHWYSFG----VPWRTGLPNDVCKNETGRFKSNVGFVTSSAN 312

Query: 309 --PFPLFVSEYGYDQREVDEAENRFMSCFTAHLVQKDLDWALWTWQGSYYYREGQ---AE 368
               PLF+SE+G DQR V++ +NR+++C  A+L ++DLDWALWT  GSYYYR  +    +
Sbjct: 313 ATAAPLFMSEFGIDQRYVNDNDNRYLNCILAYLAEEDLDWALWTMGGSYYYRSDKQPVKD 372

Query: 369 PAETFGVLNSNWTQIKNPNFPKRFQLLQTMLQDPNSNASNSY-VMYHPQSGQCAQVSNDK 428
             ET+G  N +W++I+NP+F  R + +Q  +QDP       Y ++YHP SG C +     
Sbjct: 373 FEETYGFFNHDWSRIRNPDFISRLKEIQQPIQDPYLAPGPYYQIIYHPASGLCVESGIGN 432

Query: 429 TGIFLNNC-STSSRWSHNGD-GTPIRLTSTGLCLKSNGEGLGASLSNDCLS-QQSAWRAI 488
           T + L +C S  SRW+++     PI L  +  C+ + G GL A ++ +C +   + W  +
Sbjct: 433 T-VHLGSCQSVRSRWNYDASVKGPIGLMGSSSCISTQGNGLPAIMTENCSAPNNTLWSTV 492

Query: 489 SNSKLHLAT--FTQDGNNLCLQIENSNSSKIVVNSCICTNGDPKCLE--DTQSQWLTLVA 536
           S+++L L T    +DG    + ++ S S  I  N CIC   D  C    + + QW  ++ 
Sbjct: 493 SSAQLQLGTRVLGKDGKEKWMCLDGSKSPLISTNECICIT-DSHCYPKLNPEKQWFKVIT 547

BLAST of Cla97C02G027410 vs. Swiss-Prot
Match: sp|P19487|GUNA_XANCP (Major extracellular endoglucanase OS=Xanthomonas campestris pv. campestris (strain ATCC 33913 / DSM 3586 / NCPPB 528 / LMG 568 / P 25) OX=190485 GN=engXCA PE=1 SV=2)

HSP 1 Score: 73.9 bits (180), Expect = 5.9e-12
Identity = 94/410 (22.93%), Postives = 159/410 (38.78%), Query Frame = 0

Query: 9   LFLAIVFQFLSFSAYSLPLSTKGRWIVDSKTGHRVKLVCVN-WPSHTQSMLAEGLNHRPL 68
           L LA      +  A+S  ++   R IVD  +G  V+L  VN +   T + +  GL  R  
Sbjct: 10  LALATALALAAGPAFSYSIN-NSRQIVDD-SGKVVQLKGVNVFGFETGNHVMHGLWARNW 69

Query: 69  KELADEAIKLRFNCVRLTYATHMF---TRYANRTIEENFDLLDLKQAKAGLAQHNPFVLN 128
           K++  +   L FN VRL +        T  A+     N DL  L   +         +L+
Sbjct: 70  KDMIVQMQGLGFNAVRLPFCPATLRSDTMPASIDYSRNADLQGLTSLQ---------ILD 129

Query: 129 KTIVEAYEAVVDVLGASGLMVIADNHISQPRWCCSLDDGNGFFGNRNFDPQEWLQGLRLV 188
           K I E          A G+ V+ D+H      C  + +    +   ++   +WL  LR V
Sbjct: 130 KVIAE--------FNARGMYVLLDHHTPD---CAGISE---LWYTGSYTEAQWLADLRFV 189

Query: 189 AQRFINKSTVVAMSLRNEIRGMM-----ENANDWNNYVTQGVTTIHNINPEVVVIVSGLN 248
           A R+ N   V+ + L+NE  G         A DWN    +G   +  + P+ ++ V G+ 
Sbjct: 190 ANRYKNVPYVLGLDLKNEPHGAATWGTGNAATDWNKAAERGSAAVLAVAPKWLIAVEGIT 249

Query: 249 ------------YDNDLRCLKEKPLNVNTLDNKLVFEVHLYSFSGDSESKFVTQPLNDIC 308
                       +  +L+ L   PLN+    N+L+   H+Y         FV    ND  
Sbjct: 250 DNPVCSTNGGIFWGGNLQPLACTPLNIPA--NRLLLAPHVY-----GPDVFVQSYFND-- 309

Query: 309 ATIMNGFIDHAEFVIEG-----SNPFPLFVSEYGYDQREVDEAENRFMSCFTAHLVQKDL 368
               + F ++   + E      +    L + E+G    E D  +  +      +L  K +
Sbjct: 310 ----SNFPNNMPAIWERHFGQFAGTHALLLGEFGGKYGEGDARDKTWQDALVKYLRSKGI 368

Query: 369 DWAL-WTWQGSYYYREGQAEPAETFGVLNSNWTQIKNPNFPKRFQLLQTM 392
           +    W+W              +T G+L  +WT ++      +  LL+T+
Sbjct: 370 NQGFYWSW---------NPNSGDTGGILRDDWTSVRQ----DKMTLLRTL 368

BLAST of Cla97C02G027410 vs. TAIR10
Match: AT1G13130.1 (Cellulase (glycosyl hydrolase family 5) protein)

HSP 1 Score: 401.7 bits (1031), Expect = 6.9e-112
Identity = 207/531 (38.98%), Postives = 320/531 (60.26%), Query Frame = 0

Query: 10  FLAIVFQFLSFSA---YSLPLSTKGRWIVDSKTGHRVKLVCVNWPSHTQSMLAEGLNHRP 69
           F    F F++ +     S PLST  RWIVD + G RVKLVC NWPSH Q ++AEGL+ +P
Sbjct: 15  FFCFFFSFIAQNTVPNMSYPLSTSSRWIVD-ENGLRVKLVCANWPSHLQPVVAEGLSKQP 74

Query: 70  LKELADEAIKLRFNCVRLTYATHMFTRYA---NRTIEENFDLLDLKQAKAGLAQHNPFVL 129
           +  +A + +++ FNCVRLT+   + T      N T+ ++F  L L     G   +NP ++
Sbjct: 75  VDAVAKKIVEMGFNCVRLTWPLDLMTNETLANNVTVRQSFQSLGLNDDIVGFQTNNPSII 134

Query: 130 NKTIVEAYEAVVDVLGASGLMVIADNHISQPRWCCSLDDGNGFFGNRNFDPQEWLQGLRL 189
           +  ++EAY+ VV  LG + +MVI DNH+++P WCC+ DDGNGFFG++ FDP  W+  L+ 
Sbjct: 135 DLPLIEAYKTVVTTLGNNDVMVILDNHLTKPGWCCANDDGNGFFGDQFFDPTVWVAALKK 194

Query: 190 VAQRFINKSTVVAMSLRNEIRGMMENANDWNNYVTQGVTTIHNINPEVVVIVSGLNYDND 249
           +A  F   S VV MSLRNE+RG  +N NDW  Y+ QG   +H+ N +V+VI+SGL++D D
Sbjct: 195 MAATFNGVSNVVGMSLRNELRGPKQNVNDWFKYMQQGAEAVHSANNKVLVILSGLSFDAD 254

Query: 250 LRCLKEKPLNVNTLDNKLVFEVHLYSFSGDSESKFVTQPLNDICATIMNGFIDHAEFVIE 309
           L  ++ +P+ + +   KLVFE+H YSFS D  S     P NDIC  ++N   +   +++ 
Sbjct: 255 LSFVRSRPVKL-SFTGKLVFELHWYSFS-DGNSWAANNP-NDICGRVLNRIGNGGGYLL- 314

Query: 310 GSNPFPLFVSEYGYDQREVDEAENRFMSCFTAHLVQKDLDWALWTWQGSYYYREGQAEPA 369
            +  FPLF+SE+G D+R V+  +NR+  C T    + D+DW+LW   GSYY R+G+    
Sbjct: 315 -NQGFPLFLSEFGIDERGVNTNDNRYFGCLTGWAAENDVDWSLWALTGSYYLRQGKVGMN 374

Query: 370 ETFGVLNSNWTQIKNPNFPKRFQLLQTMLQDPNSNASNSYVMYHPQSGQCAQVS-NDKTG 429
           E +GVL+S+W  ++N +F ++   LQ+ LQ P        +++HP +G C   S +D   
Sbjct: 375 EYYGVLDSDWISVRNSSFLQKISFLQSPLQGPGPRTDAYNLVFHPLTGLCIVRSLDDPKM 434

Query: 430 IFLNNCSTSSRWSHNGDGTPIRLTSTGLCLKSNGEGLGASLS-NDCLSQQSAWRAISNSK 489
           + L  C++S  WS+      +R+    LCL+SNG     +++   C +  S W+ IS S+
Sbjct: 435 LTLGPCNSSEPWSYTKKA--LRIKDQQLCLQSNGPKNPVTMTRTSCSTSGSKWQTISASR 494

Query: 490 LHLATFTQDGNNLCLQIENSNSSKIVVNSCICTNGDPKCLEDTQSQWLTLV 533
           +HLA+ T +  +LCL ++ +N+  +V N+C C + D  C  +  SQW  ++
Sbjct: 495 MHLASTTSNKTSLCLDVDTANN--VVANACKCLSKDKSC--EPMSQWFKII 533

BLAST of Cla97C02G027410 vs. TAIR10
Match: AT3G26130.1 (Cellulase (glycosyl hydrolase family 5) protein)

HSP 1 Score: 377.5 bits (968), Expect = 1.4e-104
Identity = 207/536 (38.62%), Postives = 306/536 (57.09%), Query Frame = 0

Query: 5   LQTILFLAIVFQFLSFSAYSLPLSTKGRWIV-DSKTGHRVKLVCVNWPSHTQSMLAEGLN 64
           ++   F+++       + ++ P ST  RWIV D   G RVKL CVNWPSH ++ +AEGL+
Sbjct: 1   MEKFFFISVFLLPYVITTFAFPPSTDSRWIVDDGNKGRRVKLTCVNWPSHLETAVAEGLS 60

Query: 65  HRPLKELADEAIKLRFNCVRLTYATHMFTR---YANRTIEENFDLLDLKQAKAGLAQHNP 124
            +PL  +A++ + + FNCVRLT+  ++ T     A  T+ ++     L +A +G   HNP
Sbjct: 61  KQPLDAIAEKIVSMGFNCVRLTWPLYLATDESFSAFMTVRQSLRKFRLFEAVSGFQTHNP 120

Query: 125 FVLNKTIVEAYEAVVDVLGASGLMVIADNHISQPRWCCSLDDGNGFFGNRNFDPQEWLQG 184
            +L+  +++A++ VV  L    +MVI DNHISQP WCCS +DGNGFFG+++ +PQ W++G
Sbjct: 121 TILDLPLIKAFQEVVYCLEKHRVMVILDNHISQPGWCCSDNDGNGFFGDKHLNPQVWIKG 180

Query: 185 LRLVAQRFIN-KSTVVAMSLRNEIRGMMENANDWNNYVTQGVTTIHNINPEVVVIVSGLN 244
           L+ +A  F N  S VV MSLRNE+RG  +N  DW  Y+ +G   +H++NP V+VIVSGLN
Sbjct: 181 LKKMASMFANVSSNVVGMSLRNELRGPKQNIKDWYKYMREGAEAVHSVNPNVLVIVSGLN 240

Query: 245 YDNDLRCLKEKPLNVNTLDNKLVFEVHLYSFSGDSESKFVTQPLNDICATIMNGFIDHAE 304
           Y  DL  L+E+P  V +   K+VFE+H Y F    E       LN IC       +  + 
Sbjct: 241 YATDLSFLRERPFEV-SFRRKVVFEIHWYGFWNTWEG----DNLNKICGKETEKMMKMSG 300

Query: 305 FVIEGSNPFPLFVSEYGYDQREVDEAENRFMSCFTAHLVQKDLDWALWTWQGSYYYREGQ 364
           F++E     PLFVSE+G DQR  +  +N+F+SCF A    +DLDW+LWT  GSYY RE  
Sbjct: 301 FLLE--KGIPLFVSEFGIDQRGNNANDNKFLSCFMALAADRDLDWSLWTLAGSYYIREKS 360

Query: 365 AEPAETFGVLNSNWTQIKNPNFPKRFQLLQTMLQDPNSNASNSYVMYHPQSGQCAQVSND 424
               E++GVL+ NW+ I+N    +    +QT             +M+HP +G C  V   
Sbjct: 361 IGSDESYGVLDFNWSSIRNSTILQMISAIQTPFIGLMETQPKK-IMFHPSTGLCI-VRKS 420

Query: 425 KTGIFLNNCSTSSRWSHNGDGTPIRLTSTGLCLKSNGEGLGASLS-NDCLSQQSAWRAIS 484
              + L +C+ S  W  +            LCLK+  +G    L      S  S W+  S
Sbjct: 421 LFQLKLGSCNRSESWRLSSHRVLSLAEEQILCLKAYEKGKSVKLRLFFSESYCSKWKLFS 480

Query: 485 NSKLHLATFTQDGNNLCLQIENSNSSKIVVNSCICTNGDPKCLEDTQSQWLTLVAT 535
           +SK+ L++ T++G ++CL ++  N++ IV NSC C  G+  C  D +SQW  LV +
Sbjct: 481 DSKMQLSSITKNGFSVCLDVDTENNN-IVTNSCKCLRGNSSC--DPRSQWFKLVTS 524

BLAST of Cla97C02G027410 vs. TAIR10
Match: AT3G26140.1 (Cellulase (glycosyl hydrolase family 5) protein)

HSP 1 Score: 376.3 bits (965), Expect = 3.1e-104
Identity = 199/514 (38.72%), Postives = 300/514 (58.37%), Query Frame = 0

Query: 26  PLSTKGRWIVDSKTGHRVKLVCVNWPSHTQSMLAEGLNHRPLKELADEAIKLRFNCVRLT 85
           PLST  RWI+D K G RVKL CVNWPSH Q ++AEGL+ + + +LA + + + FNCVR T
Sbjct: 4   PLSTNSRWIIDEK-GQRVKLACVNWPSHLQPVVAEGLSKQSVDDLAKKIMAMGFNCVRFT 63

Query: 86  YATHMFTRYA---NRTIEENFDLLDLKQAKAGLAQHNPFVLNKTIVEAYEAVVDVLGASG 145
           +   + T      N T+ ++F  L L    +G    NP +++  ++EAY+ VV  LG + 
Sbjct: 64  WPLDLATNETLANNVTVRQSFQSLGLNDDISGFETKNPSMIDLPLIEAYKKVVAKLGNNN 123

Query: 146 LMVIADNHISQPRWCCSLDDGNGFFGNRNFDPQEWLQGLRLVAQRFINKSTVVAMSLRNE 205
           +MVI DNH+++P WCC  +DGNGFFG+  FDP  W+ GL  +A  F   + VV MSLRNE
Sbjct: 124 VMVILDNHVTKPGWCCGYNDGNGFFGDTFFDPTTWIAGLTKIAMTFKGATNVVGMSLRNE 183

Query: 206 IRGMMENANDWNNYVTQGVTTIHNINPEVVVIVSGLNYDNDLRCLKEKPLNVNTLDNKLV 265
           +RG  +N +DW  Y+ QG   +H  NP V+VI+SGL+YD DL  ++ + +N+ T   KLV
Sbjct: 184 LRGPKQNVDDWFKYMQQGAEAVHEANPNVLVILSGLSYDTDLSFVRSRHVNL-TFTRKLV 243

Query: 266 FEVHLYSFSGDSESKFVTQPLNDICATIMNGFIDHAEFVIEGSNPFPLFVSEYGYDQREV 325
           FE+H YSF+  + + + ++  N+ C  I+    +   F +     FP+F+SE+G D R  
Sbjct: 244 FELHRYSFT--NTNTWSSKNPNEACGEILKSIENGGGFNL---RDFPVFLSEFGIDLRGK 303

Query: 326 DEAENRFMSCFTAHLVQKDLDWALWTWQGSYYYREGQAEPAETFGVLNSNWTQIKNPNFP 385
           +  +NR++ C      + D+DW++WT QGSYY REG    +E +G+L+S+W ++++ +F 
Sbjct: 304 NVNDNRYIGCILGWAAENDVDWSIWTLQGSYYLREGVVGMSEFYGILDSDWVRVRSQSFL 363

Query: 386 KRFQLLQTMLQDPNSNASNSYVMYHPQSGQC-AQVSNDKTGIFLNNCSTSSRWSHNGDGT 445
           +R  L+ + LQ P S +    +++HP +G C  Q   D T + L  C+ S  WS+    T
Sbjct: 364 QRLSLILSPLQGPGSQSKVYNLVFHPLTGLCMLQSILDPTKVTLGLCNESQPWSYTPQNT 423

Query: 446 PIRLTSTGLCLKSNGEGLGASLSNDCLSQQ--SAWRAISNSKLHLATFTQDGNNLCLQIE 505
            + L    LCL+S G      LS    S    S W  IS S + LA      N+LCL ++
Sbjct: 424 -LTLKDKSLCLESTGPNAPVKLSETSCSSPNLSEWETISASNMLLAA-KSTNNSLCLDVD 483

Query: 506 NSNSSKIVVNSCICTNG-DPKCLEDTQSQWLTLV 533
            +N+  ++ ++C C  G D  C  D  SQW  +V
Sbjct: 484 ETNN--LMASNCKCVKGEDSSC--DPISQWFKIV 504

BLAST of Cla97C02G027410 vs. TAIR10
Match: AT5G17500.1 (Glycosyl hydrolase superfamily protein)

HSP 1 Score: 363.2 bits (931), Expect = 2.7e-100
Identity = 192/516 (37.21%), Postives = 295/516 (57.17%), Query Frame = 0

Query: 22  AYSLPLSTKGRWIVDSKTGHRVKLVCVNWPSHTQSMLAEGLNHRPLKELADEAIKLRFNC 81
           A   PL TK RWIV++K GHRVKL C NWPSH + ++AEGL+ +P+  ++ +   + FNC
Sbjct: 23  ATDYPLFTKSRWIVNNK-GHRVKLACANWPSHLKPVVAEGLSSQPMDSISKKIKDMGFNC 82

Query: 82  VRLTYATHMF---TRYANRTIEENFDLLDLKQAKAGLAQHNPFVLNKTIVEAYEAVVDVL 141
           VRLT+   +    T   N T++++F+   L     G+  HNP+++N  ++  ++AVV  L
Sbjct: 83  VRLTWPLELMINDTLAFNVTVKQSFERYGLDHELQGIYTHNPYIVNTPLINVFQAVVYSL 142

Query: 142 GASGLMVIADNHISQPRWCCSLDDGNGFFGNRNFDPQEWLQGLRLVAQRFINKSTVVAMS 201
           G   +MVI DNH + P WCCS DD + FFG+  F+P  W+ GL+ +A  F+N   VV MS
Sbjct: 143 GRHDVMVILDNHKTVPGWCCSNDDPDAFFGDPKFNPDLWMLGLKKMATIFMNVKNVVGMS 202

Query: 202 LRNEIRGMMENANDWNNYVTQGVTTIHNINPEVVVIVSGLNYDNDLRCLKEKPLNVNTLD 261
           LRNE+RG    + DW  Y+ +G   +H  NP V+VI+SGLN+D DL  LK++P+N+ +  
Sbjct: 203 LRNELRGYNHTSKDWYKYMQKGAEAVHTSNPNVLVILSGLNFDADLSFLKDRPVNL-SFK 262

Query: 262 NKLVFEVHLYSFSGDSESKFVTQPLNDICATIMNGFIDHAEFVIEGSNPFPLFVSEYGYD 321
            KLV E+H YSF+ D   ++ +  +ND C+ + +       FV++    FPLF+SE+G D
Sbjct: 263 KKLVLELHWYSFT-DGTGQWKSHNVNDFCSQMFSKERRTGGFVLD--QGFPLFLSEFGTD 322

Query: 322 QREVDEAENRFMSCFTAHLVQKDLDWALWTWQGSYYYREGQAEPAETFGVLNSNWTQIKN 381
           QR  D   NR+M+C  A   +KDLDWA+W   G YY+REG+    E +G+L++NW  + N
Sbjct: 323 QRGGDLEGNRYMNCMLAWAAEKDLDWAVWAVTGVYYFREGKRGVVEAYGMLDANWHNVHN 382

Query: 382 PNFPKRFQLLQTMLQDPNSNASNSYVMYHPQSGQCAQVSN--DKTGIFLNNCSTSSRWSH 441
             + +R  ++Q     P    ++   ++HP +G C    +   ++ + L  C+    WS+
Sbjct: 383 YTYLRRLSVIQPPHTGPGVKHNHHKKIFHPLTGLCLVRKSHCHESELTLGPCTKDEPWSY 442

Query: 442 NGDGTPIRLTSTGLCLK-SNGEGLGASLSNDCLSQQSAWRAISNSKLHLATFTQDGNNLC 501
           +  G          CL+     G    L   C   +     IS +K+HL+  T DG+ +C
Sbjct: 443 SHGGILEIRRGHKSCLEGETAVGKSVKLGRICTKIEQ----ISATKMHLSFNTSDGSLVC 502

Query: 502 LQIENSNSSKIVVNSCICTNGDPKCLEDTQSQWLTL 532
           L +++ N+  +V NSC C  GD  C  +  SQW  +
Sbjct: 503 LDVDSDNN--VVANSCNCLTGDTTC--EPASQWFKI 525

BLAST of Cla97C02G027410 vs. TAIR10
Match: AT5G16700.1 (Glycosyl hydrolase superfamily protein)

HSP 1 Score: 303.9 bits (777), Expect = 2.0e-82
Identity = 186/535 (34.77%), Postives = 283/535 (52.90%), Query Frame = 0

Query: 10  FLAIVFQFLSFSA---YSLPLSTKGRWIVDSKTGHRVKLVCVNWPSHTQSMLAEGLNHRP 69
           +  + F F+S ++    S PLSTK RWIVD K G RVKL CVNWP+H Q  +AEGL+ +P
Sbjct: 6   YFCLFFLFISSTSKLTTSYPLSTKSRWIVDEK-GQRVKLACVNWPAHLQPTVAEGLSKQP 65

Query: 70  LKELADEAIKLRFNCVRLTYATHMFTR---YANRTIEENFDLLDLKQAKAGLAQHNPFVL 129
           L  ++ + + + FNCVRLT+   + T        T++++F+ L L +   G+  HNP +L
Sbjct: 66  LDSISKKIVSMGFNCVRLTWPLDLVTNDTLALKVTVKQSFESLKLFEDVLGIQTHNPKLL 125

Query: 130 NKTIVEAYEAVVDVLGASGLMVIADNHISQPRWCCSLDDGNGFFGNRNFDPQEWLQGLRL 189
           +  +  A++ VV  LG +G+MVI DNH++ P WCC  +D + FFG  +FDP  W +GLR 
Sbjct: 126 HLPLFNAFQEVVSNLGENGVMVILDNHLTTPGWCCGDNDLDAFFGYPHFDPLVWAKGLRK 185

Query: 190 VAQRFINKSTVVAMSLRNEIRGMMENANDWNNYVTQGVTTIHNINPEVVVIVSGLNYDND 249
           +A  F N + V+ MSLRNE RG  +  + W  ++ QG   +H  NP+++VI+SG+++D +
Sbjct: 186 MATLFRNFTHVIGMSLRNEPRGARDYPDLWFRHMPQGAEAVHAANPKLLVILSGIDFDTN 245

Query: 250 LRCLKEKPLNVNTLDNKLVFEVHLYSFSGDSESKFVTQPLNDICATIMNGFIDHAEFVIE 309
           L  L+++ +NV+  D KLVFE+H YSFS D    +     ND C  I+     +  F++ 
Sbjct: 246 LSFLRDRSVNVSFTD-KLVFELHWYSFS-DGRDSWRKHNSNDFCVKIIEKVTHNGGFLL- 305

Query: 310 GSNPFPLFVSEYGYDQREVDEAENRFMSCFTAHLVQKDLDWALWTWQGSYYYREGQAEPA 369
               FPL +SE+G DQR  D + NR+M+C  A   + DLDWA+W   G YY R G     
Sbjct: 306 -GRGFPLILSEFGTDQRGGDMSGNRYMNCLVAWAAENDLDWAVWALTGDYYLRTGPG--- 365

Query: 370 ETFGVLNSNWTQIKNPNFPKRFQLLQTMLQDPNSNASNSYVMYHPQSGQCA--QVSNDKT 429
                                          PN N     +++HP +G C     S++  
Sbjct: 366 -----------------------------LRPNKN-----LLFHPSTGLCVTNNPSDNIP 425

Query: 430 GIFLNNCSTSSRWSHN-GDGTPIRLTSTGLCLKSN---GEGLGASLSNDCLSQQSAWRAI 489
            + L  C  S  W+ N  +G    L    +C+++    G+ +   +   C    S    I
Sbjct: 426 TLRLGPCPKSDPWTFNPSEGI---LWINKMCVEAPNVVGQKVKLGVGTKC----SKLGQI 485

Query: 490 SNSKLHLATFTQDGNNLCLQIENSNSSKIVVNSCICTNGDPKCLEDTQSQWLTLV 533
           S +K+HL+  T +G  LCL ++  ++S +V N C     D  C  D  SQW  ++
Sbjct: 486 SATKMHLSFKTSNGLLLCLDVDERDNS-VVANRCKFLTMDASC--DPASQWFKVL 488

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_011658389.13.9e-27084.39PREDICTED: uncharacterized protein LOC101207450 [Cucumis sativus] >KGN45940.1 hy... [more]
XP_023533776.13.3e-26983.99uncharacterized protein LOC111795530 [Cucurbita pepo subsp. pepo][more]
XP_022995752.18.1e-26883.24uncharacterized protein LOC111491191 [Cucurbita maxima][more]
XP_022975072.11.2e-26381.90uncharacterized protein LOC111474044 [Cucurbita maxima][more]
XP_022958844.14.6e-26380.56uncharacterized protein LOC111459997 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
tr|A0A0A0K853|A0A0A0K853_CUCSA2.6e-27084.39Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G028440 PE=3 SV=1[more]
tr|A0A1S3BDI2|A0A1S3BDI2_CUCME4.9e-22985.71major extracellular endoglucanase-like OS=Cucumis melo OX=3656 GN=LOC103488703 P... [more]
tr|A0A1S3CTF8|A0A1S3CTF8_CUCME1.6e-22769.57major extracellular endoglucanase-like OS=Cucumis melo OX=3656 GN=LOC103504686 P... [more]
tr|A0A0A0KL32|A0A0A0KL32_CUCSA5.5e-17256.27Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G171770 PE=3 SV=1[more]
tr|A0A1S3CT43|A0A1S3CT43_CUCME1.8e-17055.56endoglucanase-like OS=Cucumis melo OX=3656 GN=LOC103504654 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
sp|C0HLA0|GH5FP_CHAOB5.0e-11240.59Glycosyl hydrolase 5 family protein OS=Chamaecyparis obtusa OX=13415 PE=1 SV=1[more]
sp|P19487|GUNA_XANCP5.9e-1222.93Major extracellular endoglucanase OS=Xanthomonas campestris pv. campestris (stra... [more]
Match NameE-valueIdentityDescription
AT1G13130.16.9e-11238.98Cellulase (glycosyl hydrolase family 5) protein[more]
AT3G26130.11.4e-10438.62Cellulase (glycosyl hydrolase family 5) protein[more]
AT3G26140.13.1e-10438.72Cellulase (glycosyl hydrolase family 5) protein[more]
AT5G17500.12.7e-10037.21Glycosyl hydrolase superfamily protein[more]
AT5G16700.12.0e-8234.77Glycosyl hydrolase superfamily protein[more]
The following terms have been associated with this gene:
Vocabulary: Biological Process
TermDefinition
GO:0005975carbohydrate metabolic process
Vocabulary: Molecular Function
TermDefinition
GO:0004553hydrolase activity, hydrolyzing O-glycosyl compounds
Vocabulary: INTERPRO
TermDefinition
IPR035992Ricin_B-like_lectins
IPR017853Glycoside_hydrolase_SF
IPR000772Ricin_B_lectin
IPR001547Glyco_hydro_5
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0005975 carbohydrate metabolic process
biological_process GO:0008152 metabolic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0016787 hydrolase activity
molecular_function GO:0016798 hydrolase activity, acting on glycosyl bonds
molecular_function GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compounds

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C02G027410.1Cla97C02G027410.1mRNA


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001547Glycoside hydrolase, family 5PFAMPF00150Cellulasecoord: 63..350
e-value: 1.8E-26
score: 93.1
NoneNo IPR availableGENE3DG3DSA:3.20.20.80coord: 25..394
e-value: 1.5E-69
score: 236.9
NoneNo IPR availablePANTHERPTHR31263:SF10SUBFAMILY NOT NAMEDcoord: 18..533
NoneNo IPR availablePANTHERPTHR31263FAMILY NOT NAMEDcoord: 18..533
IPR000772Ricin B, lectin domainPROSITEPS50231RICIN_B_LECTINcoord: 400..531
score: 10.061
IPR000772Ricin B, lectin domainCDDcd00161RICINcoord: 410..517
e-value: 3.57852E-5
score: 41.3371
IPR017853Glycoside hydrolase superfamilySUPERFAMILYSSF51445(Trans)glycosidasescoord: 26..380
IPR035992Ricin B-like lectinsSUPERFAMILYSSF50370Ricin B-like lectinscoord: 402..512

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
Cla97C02G027410Cla007754Watermelon (97103) v1wmwmbB324
Cla97C02G027410ClCG02G001010Watermelon (Charleston Gray)wcgwmbB138
Cla97C02G027410Lsi10G006680Bottle gourd (USVL1VR-Ls)lsiwmbB053
Cla97C02G027410Bhi10G001877Wax gourdwgowmbB375
The following gene(s) are paralogous to this gene:

None