Cla97C01G005230 (gene) Watermelon (97103) v2

NameCla97C01G005230
Typegene
OrganismCitrullus lanatus (Watermelon (97103) v2)
DescriptionGlycosyl hydrolase family 5 protein
LocationCla97Chr01 : 4944497 .. 4950450 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCTAATCGAAGGCCTTGATCGTAGGCCGTTGAAAGACCTTGCCCATGAGGTAGTGCGGTTGAGGTTTAATTGCGTGAGACTCACATATGCAACTCACATGTTTACTCGATATGCTAATAGGACAGTTGAAGAGAATTTTGACATTCTTGATTTGAAAGCTTCTAAGGCAGGGTTGGCTTTGCATAATCCATTTGTATTGAACATGACTATTTTTGAAGCTTATGAAGCTGCAATTGATGTGCTTGGAACTAGTGGTTTGATGGTCATAGCTGACAATCATATTAGCCAACCAAAATGGTGTTGCTCTCTTGACGATGGAAATGGTTTCTTTGGAGATCGTTATTTTGACCCAGAAGAATGGTTGGAAGGTCTTCGCTTGGTTGCTCGACGATTCTACAACAAATCAACTGTACGTAAATGTTGTACTTTTGAATTATTTTGTGAAATATGAAACATATAACATAGAGATACACAATTTTATATAAGTAGCTGCAAGTATAGTCCATTTATTAAATAAAAGATTAAAACTATATCTATGATATTAAAGGGTGAGAATCTTGGAAAAACTTTTTAGGACACTTTTGTCACTTCCTTATTTTACTTATTTACACAATATGTCAGACTAAATATAAATACTAAATTAATTAATTAATCTATAATTATCAATAATAAATATATAATTAATTAATATGGTAATGTTGGATTGTAAAAAGTTTTTTTCCACTTAAAAGAGTTTAATTTTTACTTTAGTTATAATTAATATTGAATTTTCAAAGTTGTAGAATTCAAAAAGTTTTTGTAACTCTCATCTTCAACTATTATAAAAAATGTGTCTTCTTTAACATTTTTCACCAACATTGCACTTAAGTTTTATTCTTTCTCCATTTTTTTCTCTCAAATCATTGCTCTTAAAACATCTTCAATTTTCAAAGCATCTCAAGATGTTGGTATTATATTCAAGTTGCAAGAGATGCATGTGTTGGATCAAAATCTTAGATTGTAACCTTTTTTTTATTTTTTTTTATTTTTAAATAAATGGATTAGAAGCTTTGCTACTTAACTTTACTATTTCATTTTGATGGAGTTGAATATTAGTGTTTGTTTAGCAAGTAAATTTTATTTTATTTTATTTTATTTTATTTTTGTTCTTTTAGAATGTATTGTAATTTTACTTAAAATGTTTATTGAAAAATAAAAGTTTATCCCTTATGTTACTTTGTTGAGTAAAAAATTTATTGAACTCTATGCCTCACAAAATTTTTATCAATTATAAATAACATATTAATCAATTAAATTAATTATTAAAAATAAAAATTAAAAACTTGGAATAAAAACTAAAAAGTTGAATGCAACAAAGTTTTGTGAAAGAAATTCATTACATTGTGAAACTAATTATATCTTTACTGAACTTGAAAAGTTTATTAGATAACAAAAGTTAAAGTTTCAAGTTTATTAGATAACAAAAGATTATAATGCAACAAAGTTTTTTTTTTTTTTTTTAAACTCAATAATAATTAAGACTCTTTGATGAAGAGTAAATGATAAAATATTGACAAATTGAAAATACATAAACTAATTTAAATATCATTTCCATCATTATAAAAAAAATGCATAACTATTTATGTAAAAATAAAACTACTTAAAACATTTGCACATTTTAGAATTTTAACTTATTTTTAATTTAATATTAATTAGATTTGCTGTTATATTTTTTACAAGTACACTTTTTTATTGGTTTCTAATTAGTTTATAAATTTTAATTATAATCTTTTGTTAAATTTTACAAATTATAATTTTAGATCCTTATTCTTTGATAATCAAACTTTATAGATTTTAATTTATTTAAATTATTTAGCCCCATCTAAATAATGTAAATAATTCGCTAACTTTATGCATGTTCTCCCAATTACATTGTTTAAGTTTTCACCTCTTTCGTTCATTCTTTAAACATAAAACTTATTTGAGTTTCGTTATTAATAAAATGTTGAGTGCTAACTTATAATTACACCGAAATTTTATTGTTGGTTGTAATTTGAGGTAAGAGAATGGGAAAAAAAGGAAGAATATGAAATTCAAGGTTTTTTAAAATTAATTTATAAAATAATTCTAAAATTTATGATATTACATTTGAAATTTTAATTTTTAATAATTTATTTATTTGGAATTTTTAAGAGCTACTTTTAAAAGTAATAGTGGTTGGATTCCTAGTTAGTTAATTATATTATTGAATAATAATGTGAAAGTTAGAATTTGGAGAGTTAGGATATACATAGTATAAATTTATTATAAGTTAACTTTTTATTTTTATATAAATGAAATTTAAGAGTAATGATTATTTTTAATCATAATCAATAATGAAATATCTAAGTTTTGGAATCTCAACGCTTTGTTACTCATTTATCTCGTATAACTAAATGCTAATTTTATACCTATTATCAACAAAACCTCAAATATTTCCGATAAAGTAGATTTGTTTTTCTCATTTAATAATTTTCTTATTTTCCTTATAACTTTTGGTTGGATTAGTTGCATTTTTTTATAAATTTTTTTTTATAAATATTTTTTTAAACTATGTTATAATAATAAATATAAAAATATTATTATTATAAAATAATTGATATATCCTTATGATTTTTTCAAAATTTTAAATTGTAGTATATGATAACTCAATAATTTCGTTTTTCTATGTTGAACCACCTAAACTAATATTGTTGGTTGTTTTAAAATTTTTCTAATAGATATTACATATTAAAAAATCAAATAAACAATTTATTTATATATATATATAAACGTGCTAAACATGTATTAAAATACTAGTATTTATTAACATAATTAATGTAATCAGGCTAATTATATAGTAACTTTTTTTAGTGGTATTATTTTAGTAATATTAGACTATTATTATAATGGTTAAATTACATTTTTAATCCCTTAACACTTTTAAATCTTAATTTCAATTTTGTCTCTTATTGGTTCAAAAGTTTCATTTATCTTGAAAGCCGAGTTAATATGGCAAGTCGGTTACCATCCTCGACACAACTTTTTTTTTTTTTTATAATTTTTTTTTTAAATTTTTAAATTTTTATTTTTTAAGGAAAGAGTTTGATAAGTCATATTGTTATTTAGGAAAAAATATATATATATATTTTACATAGAATTTTGGTGTATTTTAAATTGAAGAAGGAAAAATATCAATTTTGACACTAAACTTGTCGAGTCATATCAATTTTGACCCTACACTTTTAATTTCATCAAATTACACTAAAACTTAGATGATTATTGCAATCTTAACCTTAAACTTTAAAAAGTACTCCAATTTTAACTCTTATGCTCCAAAAATCGTTATAATTTTTTTTTTCCATGAAATTGATAATTTTTTTTTTTTTTAAATGACATTGTAGATAGATTTGTTATTGAAATTTACGAATGTTTGGGGAAGTTATTCTATAGTTAATCTGTGCATTGGTCAAACTTATTTTTGTACAAAATGTTATTTGATTGAAATTGATGAGTTTATGTAGAATATTTTAAACTTTTTCACCAAAAAATTAATAGGTGAATTAACGGAAATTAAACTAAGAGTTAAAATTGCAATATTTAAGTTTCACTACCCATTTTTAAAACGTTCTTAAAAATCAAGCCATTTTTTTTAATTAAAAAACTTGATATTTTTTAATTAAAATTTGGCCTAAAATTGAAATGTGTTTAAAAAAATGTGAAATTTTTGCCAAAGAATTGTAAAGCAAGCATAATTTTAAAAAACATTATGATTATAAAACGGAGTTTTAATGTTTAGAATGAAAATAATCTATGTATAAAAAGTTCGTACGTAAAATTTTGTCCCTCTTTTAAGATTTTTTTTTTCCTCTTACGTCAAACAGGTTGTAGCCATGAGCCTACGAAACGAACTTCGAGGAGCAAAATCAAAATCAAAAGATTGGAACAAATACATAACACAAGGAGCAACAACAATCCACAACATAAACCCAAACATCCTCGTGATCATTTCAGGTCTAAACTTCGACAACGACCTACGATGCCAAAGACAAAACCCCTTGCAACTAAACAACCTACACAACAAGCTAGTTTTTGAAGTACACTTATATTCCTTCAGTGGAGAGTCCCAATCAAAGTTCATCCATAATCCTCTCAACAAAATCTGCTCAAGGATCATCAATGGGTTTGTGGAGAGAGCTGAGTTTGTGATGGAAGGAACTGAGGCAGTTCCTTTATTTGTGAGCGAGTTTGGGTTTGACCAAAGTGGAGTTAATGAGGCTGATGATAGGTTCTTGAGTTGCTTTAGTGCTCATCTTGCAAAGGAAGATTTAGACTGGGCGCTATGGGCTTGGCAAGGCAGTTATTATTATAGACAGGGCAATGTCGAACCTGAAGAAGTGTTTGGAGTTTTGAATTACAATTGGAGTGATGTCAGAAACCATCGTTTTTCTCAGATGTTTGGACTCCTGCAAACCATGTTGCAAGGTATGGCTTTGAGTCCTATTTGATAACGGTTTTGTTTTGATTTTTCATATTTGCTATTTTTTAAATTAAATTATTAGAATTATTAGTGTTGAAAGAATAATGTTGATATATGACTCTAAAAATGGTCAATAGGAAATCTTATCAAACGAGTTAAGTTCCTAACGTCATGAGTGAATTATATATTTATTTCTTCCCATAAAAACATAGAATATTTTTAAGTTGATAAATTAATTATGAACATTTTGACAGCATAACATTATTTATTATAAACTATTTAATTTTAATATTGAAAAATTATCTTAAATGACAAAACTGTTGAAAATATTTACAATTAATAATAAAATACACAATCTATTTGCCATAGATCGCGATAGATCATGATAGACTACTATCTGCATCTATCGTGACACAGATAGTAGTCTATCGCGATCTATCATGGTCTATTGCAGATAGACAATGAAATTTTGCTATATTTATAAATACTTTGGTCTCTTTTGCTATATTTAAAAAGAGCCCTTTAACATTTTGTTACAAATAAAAACTTGATGGTAGAAATTTTACTATTTATGAGGGTCACTTTTCTCTCCTAGCTAATATATTAAAAGCTAAATTTTAAAATTAATACGTCCTTAAACTTTGTACTTTTCTGTAAAAATATCCTTGAACTTTCAAAAATAGTTCAAAAATACTTTTACCATTAGTTTTTTTTAGACCGAAATTGTTAGTGTTTTGTTTCAAATATACCTTTGAACTTCCAAAAATACCCTTAAAAAATGTTTAAAAAATACTTTTACTGTTAACATTTGGGAATCAAATTAGTATTAACAACTTTATAACTCAAGCTTTTTGTGATTTAGAGGTATAGTTGAAAGAGAAATTAGGTTAAAGTACAATTCCAAGTCCATAGGTTACATTTTGTAAAACATTCATAATCTTTTTTAATTAGTATGAAAATTGAAAACTATTTATATGTAGCTAGCCTTCTTTTTTCCTTTTTTTTTTTTCTCTTTAGTAAAACTTTATGCACATGCAGATCCAAATTCCAATTCCTCAAACTCTTACTTAATGTATCATCCACAAAGTGGGCAATGTGTCCAAGTGAAAGACAAGATGGATGAGCAAATTTATCTCAACAACTGTTCCAGTGCAAGCCATTGGAGCCATGAAGGAGATGGGACTCCAATAATGTTGGAAGCCACTGATTTTTGTCTAAAAGCCAATGGAAATGGGCTTCCACCATCACTCTCAAGGGATTGTTTTGGTGAGCAAAGTGTTTGGACAGCCATTTCAGACTCTAAGCTTCATTTGGCCACACTCACAAAACAAGGCAATGGTTTGTGTTTAGAGAAAGAGAGCTCAAATTCAACTAAGATTGTGATGGGGAGATGTGTTTGTGTTGGTAATGATTCAAATTGTTTACAAGATACTCAAGCTCAATGGTTTGAACTTGTTGTTACAAATACTTTGTAG

mRNA sequence

ATGCTAATCGAAGGCCTTGATCGTAGGCCGTTGAAAGACCTTGCCCATGAGGTAGTGCGGTTGAGGTTTAATTGCGTGAGACTCACATATGCAACTCACATGTTTACTCGATATGCTAATAGGACAGTTGAAGAGAATTTTGACATTCTTGATTTGAAAGCTTCTAAGGCAGGGTTGGCTTTGCATAATCCATTTGTATTGAACATGACTATTTTTGAAGCTTATGAAGCTGCAATTGATGTGCTTGGAACTAGTGGTTTGATGGTCATAGCTGACAATCATATTAGCCAACCAAAATGGTGTTGCTCTCTTGACGATGGAAATGGTTTCTTTGGAGATCGTTATTTTGACCCAGAAGAATGGTTGGAAGGTCTTCGCTTGGTTGCTCGACGATTCTACAACAAATCAACTGTTGTAGCCATGAGCCTACGAAACGAACTTCGAGGAGCAAAATCAAAATCAAAAGATTGGAACAAATACATAACACAAGGAGCAACAACAATCCACAACATAAACCCAAACATCCTCGTGATCATTTCAGGTCTAAACTTCGACAACGACCTACGATGCCAAAGACAAAACCCCTTGCAACTAAACAACCTACACAACAAGCTAGTTTTTGAAGTACACTTATATTCCTTCAGTGGAGAGTCCCAATCAAAGTTCATCCATAATCCTCTCAACAAAATCTGCTCAAGGATCATCAATGGGTTTGTGGAGAGAGCTGAGTTTGTGATGGAAGGAACTGAGGCAGTTCCTTTATTTGTGAGCGAGTTTGGGTTTGACCAAAGTGGAGTTAATGAGGCTGATGATAGGTTCTTGAGTTGCTTTAGTGCTCATCTTGCAAAGGAAGATTTAGACTGGGCGCTATGGGCTTGGCAAGGCAGTTATTATTATAGACAGGGCAATGTCGAACCTGAAGAAGTGTTTGGAGTTTTGAATTACAATTGGAGTGATGTCAGAAACCATCGTTTTTCTCAGATGTTTGGACTCCTGCAAACCATGTTGCAAGATCCAAATTCCAATTCCTCAAACTCTTACTTAATGTATCATCCACAAAGTGGGCAATGTGTCCAAGTGAAAGACAAGATGGATGAGCAAATTTATCTCAACAACTGTTCCAGTGCAAGCCATTGGAGCCATGAAGGAGATGGGACTCCAATAATGTTGGAAGCCACTGATTTTTGTCTAAAAGCCAATGGAAATGGGCTTCCACCATCACTCTCAAGGGATTGTTTTGGTGAGCAAAGTGTTTGGACAGCCATTTCAGACTCTAAGCTTCATTTGGCCACACTCACAAAACAAGGCAATGGTTTGTGTTTAGAGAAAGAGAGCTCAAATTCAACTAAGATTGTGATGGGGAGATGTGTTTGTGTTGGTAATGATTCAAATTGTTTACAAGATACTCAAGCTCAATGGTTTGAACTTGTTGTTACAAATACTTTGTAG

Coding sequence (CDS)

ATGCTAATCGAAGGCCTTGATCGTAGGCCGTTGAAAGACCTTGCCCATGAGGTAGTGCGGTTGAGGTTTAATTGCGTGAGACTCACATATGCAACTCACATGTTTACTCGATATGCTAATAGGACAGTTGAAGAGAATTTTGACATTCTTGATTTGAAAGCTTCTAAGGCAGGGTTGGCTTTGCATAATCCATTTGTATTGAACATGACTATTTTTGAAGCTTATGAAGCTGCAATTGATGTGCTTGGAACTAGTGGTTTGATGGTCATAGCTGACAATCATATTAGCCAACCAAAATGGTGTTGCTCTCTTGACGATGGAAATGGTTTCTTTGGAGATCGTTATTTTGACCCAGAAGAATGGTTGGAAGGTCTTCGCTTGGTTGCTCGACGATTCTACAACAAATCAACTGTTGTAGCCATGAGCCTACGAAACGAACTTCGAGGAGCAAAATCAAAATCAAAAGATTGGAACAAATACATAACACAAGGAGCAACAACAATCCACAACATAAACCCAAACATCCTCGTGATCATTTCAGGTCTAAACTTCGACAACGACCTACGATGCCAAAGACAAAACCCCTTGCAACTAAACAACCTACACAACAAGCTAGTTTTTGAAGTACACTTATATTCCTTCAGTGGAGAGTCCCAATCAAAGTTCATCCATAATCCTCTCAACAAAATCTGCTCAAGGATCATCAATGGGTTTGTGGAGAGAGCTGAGTTTGTGATGGAAGGAACTGAGGCAGTTCCTTTATTTGTGAGCGAGTTTGGGTTTGACCAAAGTGGAGTTAATGAGGCTGATGATAGGTTCTTGAGTTGCTTTAGTGCTCATCTTGCAAAGGAAGATTTAGACTGGGCGCTATGGGCTTGGCAAGGCAGTTATTATTATAGACAGGGCAATGTCGAACCTGAAGAAGTGTTTGGAGTTTTGAATTACAATTGGAGTGATGTCAGAAACCATCGTTTTTCTCAGATGTTTGGACTCCTGCAAACCATGTTGCAAGATCCAAATTCCAATTCCTCAAACTCTTACTTAATGTATCATCCACAAAGTGGGCAATGTGTCCAAGTGAAAGACAAGATGGATGAGCAAATTTATCTCAACAACTGTTCCAGTGCAAGCCATTGGAGCCATGAAGGAGATGGGACTCCAATAATGTTGGAAGCCACTGATTTTTGTCTAAAAGCCAATGGAAATGGGCTTCCACCATCACTCTCAAGGGATTGTTTTGGTGAGCAAAGTGTTTGGACAGCCATTTCAGACTCTAAGCTTCATTTGGCCACACTCACAAAACAAGGCAATGGTTTGTGTTTAGAGAAAGAGAGCTCAAATTCAACTAAGATTGTGATGGGGAGATGTGTTTGTGTTGGTAATGATTCAAATTGTTTACAAGATACTCAAGCTCAATGGTTTGAACTTGTTGTTACAAATACTTTGTAG

Protein sequence

MLIEGLDRRPLKDLAHEVVRLRFNCVRLTYATHMFTRYANRTVEENFDILDLKASKAGLALHNPFVLNMTIFEAYEAAIDVLGTSGLMVIADNHISQPKWCCSLDDGNGFFGDRYFDPEEWLEGLRLVARRFYNKSTVVAMSLRNELRGAKSKSKDWNKYITQGATTIHNINPNILVIISGLNFDNDLRCQRQNPLQLNNLHNKLVFEVHLYSFSGESQSKFIHNPLNKICSRIINGFVERAEFVMEGTEAVPLFVSEFGFDQSGVNEADDRFLSCFSAHLAKEDLDWALWAWQGSYYYRQGNVEPEEVFGVLNYNWSDVRNHRFSQMFGLLQTMLQDPNSNSSNSYLMYHPQSGQCVQVKDKMDEQIYLNNCSSASHWSHEGDGTPIMLEATDFCLKANGNGLPPSLSRDCFGEQSVWTAISDSKLHLATLTKQGNGLCLEKESSNSTKIVMGRCVCVGNDSNCLQDTQAQWFELVVTNTL
BLAST of Cla97C01G005230 vs. NCBI nr
Match: XP_008467306.1 (PREDICTED: major extracellular endoglucanase-like [Cucumis melo])

HSP 1 Score: 912.5 bits (2357), Expect = 5.9e-262
Identity = 433/483 (89.65%), Postives = 461/483 (95.45%), Query Frame = 0

Query: 1   MLIEGLDRRPLKDLAHEVVRLRFNCVRLTYATHMFTRYANRTVEENFDILDLKASKAGLA 60
           MLIEGLDRRPLKDLA+EV+RL+FNCVRLTYATHMFTRYANRTVEENFD+LDL+ASK GLA
Sbjct: 57  MLIEGLDRRPLKDLANEVMRLKFNCVRLTYATHMFTRYANRTVEENFDLLDLRASKVGLA 116

Query: 61  LHNPFVLNMTIFEAYEAAIDVLGTSGLMVIADNHISQPKWCCSLDDGNGFFGDRYFDPEE 120
           LHNPFVLNMTIFEAYEA +DVLGTSGLMVIADNHISQP+WCCSL+DGNGFFGDRYFD EE
Sbjct: 117 LHNPFVLNMTIFEAYEAVVDVLGTSGLMVIADNHISQPRWCCSLEDGNGFFGDRYFDSEE 176

Query: 121 WLEGLRLVARRFYNKSTVVAMSLRNELRGAKSKSKDWNKYITQGATTIHNINPNILVIIS 180
           WLEGLRLVARRFYNKS VVAMSLRNELRGA SKSKDWNKY+TQGATTIHNINPNILVIIS
Sbjct: 177 WLEGLRLVARRFYNKSAVVAMSLRNELRGASSKSKDWNKYVTQGATTIHNINPNILVIIS 236

Query: 181 GLNFDNDLRCQRQNPLQLNNLHNKLVFEVHLYSFSGESQSKFIHNPLNKICSRIINGFVE 240
           GLNFDNDLRCQRQ PLQLNNLHNKLVFEVHLYSFSGESQSKFIHNPLNKICS+IINGFV+
Sbjct: 237 GLNFDNDLRCQRQYPLQLNNLHNKLVFEVHLYSFSGESQSKFIHNPLNKICSKIINGFVQ 296

Query: 241 RAEFVMEGTEAVPLFVSEFGFDQSGVNEADDRFLSCFSAHLAKEDLDWALWAWQGSYYYR 300
           RAEFVMEG EAVPLFVSEFG DQ+GVNEADDRFLSCFSAHL ++DLDWALW WQGSYYYR
Sbjct: 297 RAEFVMEGAEAVPLFVSEFGLDQTGVNEADDRFLSCFSAHLVEKDLDWALWGWQGSYYYR 356

Query: 301 QGNVEPEEVFGVLNYNWSDVRNHRFSQMFGLLQTMLQDPNSNSSNSYLMYHPQSGQCVQV 360
           QG VE EEVFGVLNYNWSDVRN RFSQMF LLQTMLQDPNSNSSN+YLMYHPQSGQCVQV
Sbjct: 357 QGKVELEEVFGVLNYNWSDVRNPRFSQMFQLLQTMLQDPNSNSSNTYLMYHPQSGQCVQV 416

Query: 361 KDKMDEQIYLNNCSSASHWSHEGDGTPIMLEATDFCLKANGNGLPPSLSRDCFGEQSVWT 420
            D   ++I+LNNCS+ASHWS+EGDGTPIML +T+FCLKANGNGLPPSLSRDCFGEQSVWT
Sbjct: 417 HDMKQKEIFLNNCSNASHWSYEGDGTPIMLASTNFCLKANGNGLPPSLSRDCFGEQSVWT 476

Query: 421 AISDSKLHLATLTKQG-NGLCLEKESSNSTKIVMGRCVCVGNDSNCLQDTQAQWFELVVT 480
           AISDSKLHLATLTKQG NG+CLEKESSNS++I+M  CVCVG+DSNCLQDTQAQWF+LVVT
Sbjct: 477 AISDSKLHLATLTKQGNNGMCLEKESSNSSRILMRSCVCVGSDSNCLQDTQAQWFQLVVT 536

Query: 481 NTL 483
           NTL
Sbjct: 537 NTL 539

BLAST of Cla97C01G005230 vs. NCBI nr
Match: XP_004143723.1 (PREDICTED: uncharacterized protein LOC101213113 [Cucumis sativus])

HSP 1 Score: 903.7 bits (2334), Expect = 2.8e-259
Identity = 430/483 (89.03%), Postives = 456/483 (94.41%), Query Frame = 0

Query: 1   MLIEGLDRRPLKDLAHEVVRLRFNCVRLTYATHMFTRYANRTVEENFDILDLKASKAGLA 60
           MLIEGLDRRPLKDLA+EVVRLRFNCVRLTYATHMFTRYANRTVEENFD+LDL+A+K GLA
Sbjct: 57  MLIEGLDRRPLKDLANEVVRLRFNCVRLTYATHMFTRYANRTVEENFDLLDLRAAKVGLA 116

Query: 61  LHNPFVLNMTIFEAYEAAIDVLGTSGLMVIADNHISQPKWCCSLDDGNGFFGDRYFDPEE 120
            HNPFVLNMTIFEAYEA +DVLGTSGLMVIADNHISQP+WCCSL+DGNGFFGDRYFD EE
Sbjct: 117 FHNPFVLNMTIFEAYEAVVDVLGTSGLMVIADNHISQPRWCCSLEDGNGFFGDRYFDTEE 176

Query: 121 WLEGLRLVARRFYNKSTVVAMSLRNELRGAKSKSKDWNKYITQGATTIHNINPNILVIIS 180
           WLEGLRLVARRFYNKS VVAMSLRNELRGA SKSKDWNKYITQGATTIHNINP ILVIIS
Sbjct: 177 WLEGLRLVARRFYNKSAVVAMSLRNELRGASSKSKDWNKYITQGATTIHNINPKILVIIS 236

Query: 181 GLNFDNDLRCQRQNPLQLNNLHNKLVFEVHLYSFSGESQSKFIHNPLNKICSRIINGFVE 240
           GLNFDNDLRCQRQ PLQLNNLHNKLVFEVHLYSFSGESQSKFIHNPLNKICS++INGFVE
Sbjct: 237 GLNFDNDLRCQRQYPLQLNNLHNKLVFEVHLYSFSGESQSKFIHNPLNKICSKVINGFVE 296

Query: 241 RAEFVMEGTEAVPLFVSEFGFDQSGVNEADDRFLSCFSAHLAKEDLDWALWAWQGSYYYR 300
           RAEFVMEG EAVPLFVSEFG DQ GVNEADDRFLSCFSAHL ++DLDWALW WQGSYYYR
Sbjct: 297 RAEFVMEGAEAVPLFVSEFGLDQRGVNEADDRFLSCFSAHLVEKDLDWALWGWQGSYYYR 356

Query: 301 QGNVEPEEVFGVLNYNWSDVRNHRFSQMFGLLQTMLQDPNSNSSNSYLMYHPQSGQCVQV 360
           QG V PEEVFGVLNYNWSDVRN  FSQMF LLQTMLQDPNSNSSN+Y+MYHPQSGQCV V
Sbjct: 357 QGKVGPEEVFGVLNYNWSDVRNPHFSQMFQLLQTMLQDPNSNSSNTYVMYHPQSGQCVLV 416

Query: 361 KDKMDEQIYLNNCSSASHWSHEGDGTPIMLEATDFCLKANGNGLPPSLSRDCFGEQSVWT 420
           +D    QIYLN+CS+ASHWS+EGDGTPIML +T+FCLKA+G+GLPPSLSRDCFGEQSVWT
Sbjct: 417 QDMKHMQIYLNDCSNASHWSYEGDGTPIMLASTNFCLKASGDGLPPSLSRDCFGEQSVWT 476

Query: 421 AISDSKLHLATLTKQG-NGLCLEKESSNSTKIVMGRCVCVGNDSNCLQDTQAQWFELVVT 480
           AISDSKLHLATLTKQG NG+CLEKESSNS++I+M  CVCVGNDSNCLQDTQAQWF+LVVT
Sbjct: 477 AISDSKLHLATLTKQGNNGMCLEKESSNSSRILMRSCVCVGNDSNCLQDTQAQWFQLVVT 536

Query: 481 NTL 483
           NTL
Sbjct: 537 NTL 539

BLAST of Cla97C01G005230 vs. NCBI nr
Match: XP_023534098.1 (uncharacterized protein LOC111795760 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 855.5 bits (2209), Expect = 8.6e-245
Identity = 404/482 (83.82%), Postives = 439/482 (91.08%), Query Frame = 0

Query: 1   MLIEGLDRRPLKDLAHEVVRLRFNCVRLTYATHMFTRYANRTVEENFDILDLKASKAGLA 60
           MLIEGL  R LKDLA E+V L+FNCVRLTYATHMFTRYANRTVEENFD+LDL+ASKAGL 
Sbjct: 56  MLIEGLAHRSLKDLADELVSLKFNCVRLTYATHMFTRYANRTVEENFDLLDLRASKAGLV 115

Query: 61  LHNPFVLNMTIFEAYEAAIDVLGTSGLMVIADNHISQPKWCCSLDDGNGFFGDRYFDPEE 120
           LHNPFVLNMTIFEAYEA +DVLGTSGLMVIADNHISQP+WCCSL+DGNGFFGDRYFDPEE
Sbjct: 116 LHNPFVLNMTIFEAYEAVVDVLGTSGLMVIADNHISQPRWCCSLNDGNGFFGDRYFDPEE 175

Query: 121 WLEGLRLVARRFYNKSTVVAMSLRNELRGAKSKSKDWNKYITQGATTIHNINPNILVIIS 180
           WLEGLRLVARRF NKS VVAMSLRNELRGAKS SKDWNKY+TQGATTIH+INPN+LVI+S
Sbjct: 176 WLEGLRLVARRFTNKSNVVAMSLRNELRGAKSSSKDWNKYMTQGATTIHDINPNLLVIVS 235

Query: 181 GLNFDNDLRCQRQNPLQLNNLHNKLVFEVHLYSFSGESQSKFIHNPLNKICSRIINGFVE 240
           GLNFDNDLRCQR NPL LNNLHNKLVFEVHLYSFSG +++KFI NPLNKICS IINGFVE
Sbjct: 236 GLNFDNDLRCQRHNPLLLNNLHNKLVFEVHLYSFSGATKTKFITNPLNKICSTIINGFVE 295

Query: 241 RAEFVMEGTEAVPLFVSEFGFDQSGVNEADDRFLSCFSAHLAKEDLDWALWAWQGSYYYR 300
           RAEFVM+G EAVPLFVSEFGFDQ G N ADDRFLSCF AHLAK DLDWALWAWQGSYYYR
Sbjct: 296 RAEFVMQGAEAVPLFVSEFGFDQRGTNVADDRFLSCFVAHLAKTDLDWALWAWQGSYYYR 355

Query: 301 QGNVEPEEVFGVLNYNWSDVRNHRFSQMFGLLQTMLQDPNSNSSNSYLMYHPQSGQCVQV 360
           QG  + +EVFGVLNYNWSDVRN RFS+ F LLQTML+DPNSN+ NSY+MYHPQSGQCV+V
Sbjct: 356 QGQAQSDEVFGVLNYNWSDVRNPRFSKTFQLLQTMLRDPNSNAPNSYVMYHPQSGQCVRV 415

Query: 361 KDKMDEQIYLNNCSSASHWSHEGDGTPIMLEATDFCLKANGNGLPPSLSRDCFGEQSVWT 420
           KD M ++IYLN+CS+ASHWSH GDGTPI LEAT  CLKA+G+GL P LSRDC   +S WT
Sbjct: 416 KDMMSKEIYLNDCSNASHWSHRGDGTPIELEATGLCLKADGDGLRPLLSRDCSSNESSWT 475

Query: 421 AISDSKLHLATLTKQGNGLCLEKESSNSTKIVMGRCVCVGNDSNCLQDTQAQWFELVVTN 480
            IS+SKLHLATLT+QGNGLCLEKESSNST+IVMGRCVCVG+DSNCL DT++QWFELV TN
Sbjct: 476 TISNSKLHLATLTRQGNGLCLEKESSNSTRIVMGRCVCVGDDSNCLDDTRSQWFELVATN 535

Query: 481 TL 483
           TL
Sbjct: 536 TL 537

BLAST of Cla97C01G005230 vs. NCBI nr
Match: XP_022958497.1 (uncharacterized protein LOC111459707 [Cucurbita moschata])

HSP 1 Score: 849.0 bits (2192), Expect = 8.1e-243
Identity = 399/482 (82.78%), Postives = 437/482 (90.66%), Query Frame = 0

Query: 1   MLIEGLDRRPLKDLAHEVVRLRFNCVRLTYATHMFTRYANRTVEENFDILDLKASKAGLA 60
           MLIEGL  RPLKDLA E+V L+FNCVRLTYATHMFTRYANRTVEENFD+LDL+ASK GLA
Sbjct: 56  MLIEGLANRPLKDLADELVSLKFNCVRLTYATHMFTRYANRTVEENFDLLDLRASKVGLA 115

Query: 61  LHNPFVLNMTIFEAYEAAIDVLGTSGLMVIADNHISQPKWCCSLDDGNGFFGDRYFDPEE 120
            HNPFVLNMTIFEAYE  +DVLGTSGLMVIADNHISQP+WCCSL+DGNGFFGDRYFDP+E
Sbjct: 116 SHNPFVLNMTIFEAYETVVDVLGTSGLMVIADNHISQPRWCCSLNDGNGFFGDRYFDPQE 175

Query: 121 WLEGLRLVARRFYNKSTVVAMSLRNELRGAKSKSKDWNKYITQGATTIHNINPNILVIIS 180
           WLEGLRLVARRF NKS VVAMSLRNELRGAKS SKDWNKY+TQGATTIH+INPN+LVI+S
Sbjct: 176 WLEGLRLVARRFTNKSNVVAMSLRNELRGAKSSSKDWNKYMTQGATTIHDINPNLLVIVS 235

Query: 181 GLNFDNDLRCQRQNPLQLNNLHNKLVFEVHLYSFSGESQSKFIHNPLNKICSRIINGFVE 240
           GLNFDNDLRCQR NPL LNNLHNKLVFEVHLYSFSG +++KFI NPLNKICS IINGFVE
Sbjct: 236 GLNFDNDLRCQRHNPLPLNNLHNKLVFEVHLYSFSGATKTKFITNPLNKICSTIINGFVE 295

Query: 241 RAEFVMEGTEAVPLFVSEFGFDQSGVNEADDRFLSCFSAHLAKEDLDWALWAWQGSYYYR 300
           RAEFVM+G EAVPLFVSEFGFDQ G N ADDRFLSCF AHLAK DLDWALWAWQGSYYYR
Sbjct: 296 RAEFVMQGAEAVPLFVSEFGFDQRGTNVADDRFLSCFVAHLAKTDLDWALWAWQGSYYYR 355

Query: 301 QGNVEPEEVFGVLNYNWSDVRNHRFSQMFGLLQTMLQDPNSNSSNSYLMYHPQSGQCVQV 360
           QG  + +EVFG+LNYNWS VRN RFS+ F LLQTML+DPNSN+ NSY+MYHPQSGQCV+V
Sbjct: 356 QGQAQFDEVFGILNYNWSGVRNPRFSKTFQLLQTMLRDPNSNAPNSYVMYHPQSGQCVRV 415

Query: 361 KDKMDEQIYLNNCSSASHWSHEGDGTPIMLEATDFCLKANGNGLPPSLSRDCFGEQSVWT 420
           KD M ++IYLN+CS+ASHWSH GDGTPI LEAT  CLKA+G+GL P LSRDC   +S WT
Sbjct: 416 KDMMSKEIYLNDCSNASHWSHRGDGTPIELEATGLCLKADGDGLRPLLSRDCLSNESSWT 475

Query: 421 AISDSKLHLATLTKQGNGLCLEKESSNSTKIVMGRCVCVGNDSNCLQDTQAQWFELVVTN 480
            IS+SKLHLATLT+ GNGLCLEK+SSNST+IVMGRCVCVG+DSNCL DT++QWFELVVTN
Sbjct: 476 TISNSKLHLATLTRHGNGLCLEKDSSNSTRIVMGRCVCVGDDSNCLDDTRSQWFELVVTN 535

Query: 481 TL 483
           TL
Sbjct: 536 TL 537

BLAST of Cla97C01G005230 vs. NCBI nr
Match: XP_022995241.1 (uncharacterized protein LOC111490847 [Cucurbita maxima])

HSP 1 Score: 844.7 bits (2181), Expect = 1.5e-241
Identity = 396/482 (82.16%), Postives = 436/482 (90.46%), Query Frame = 0

Query: 1   MLIEGLDRRPLKDLAHEVVRLRFNCVRLTYATHMFTRYANRTVEENFDILDLKASKAGLA 60
           MLIEGL  RPLKDLA E+V L+FNCVRLTYATHMFTRYANRTVEENFD+LDL+ASKAGLA
Sbjct: 56  MLIEGLAHRPLKDLADELVNLKFNCVRLTYATHMFTRYANRTVEENFDLLDLRASKAGLA 115

Query: 61  LHNPFVLNMTIFEAYEAAIDVLGTSGLMVIADNHISQPKWCCSLDDGNGFFGDRYFDPEE 120
            HNPF+LNMTIF+AYEA +DVLGTSGLMVIADNHISQP+WCCSL+DGNGFFGDRYFDP+E
Sbjct: 116 SHNPFILNMTIFDAYEAVVDVLGTSGLMVIADNHISQPRWCCSLNDGNGFFGDRYFDPQE 175

Query: 121 WLEGLRLVARRFYNKSTVVAMSLRNELRGAKSKSKDWNKYITQGATTIHNINPNILVIIS 180
           WLEGLRLVARRF NK  VVAMSLRNELRGAKS SKDWNKY+TQGATTIH+INPN+LVI+S
Sbjct: 176 WLEGLRLVARRFTNKLNVVAMSLRNELRGAKSSSKDWNKYMTQGATTIHDINPNLLVIVS 235

Query: 181 GLNFDNDLRCQRQNPLQLNNLHNKLVFEVHLYSFSGESQSKFIHNPLNKICSRIINGFVE 240
           GLNFDNDLRCQR NPL LNNLHNKLVFEVHLYSFSG +++KFI NPLNKICS IINGFVE
Sbjct: 236 GLNFDNDLRCQRHNPLPLNNLHNKLVFEVHLYSFSGATKTKFITNPLNKICSAIINGFVE 295

Query: 241 RAEFVMEGTEAVPLFVSEFGFDQSGVNEADDRFLSCFSAHLAKEDLDWALWAWQGSYYYR 300
           RAEFVM+G EAVPLFVSEFGFDQ G N ADDRF SCF AHLA+ DLDWALWAWQGSYYYR
Sbjct: 296 RAEFVMQGAEAVPLFVSEFGFDQRGTNVADDRFSSCFVAHLARTDLDWALWAWQGSYYYR 355

Query: 301 QGNVEPEEVFGVLNYNWSDVRNHRFSQMFGLLQTMLQDPNSNSSNSYLMYHPQSGQCVQV 360
           QG  + +EVFGVLNYNWSDVRN  FS+ F LLQTML+DPNSN+ NSY+MYHPQSGQCV+V
Sbjct: 356 QGQAQSDEVFGVLNYNWSDVRNPHFSKTFQLLQTMLRDPNSNAPNSYVMYHPQSGQCVRV 415

Query: 361 KDKMDEQIYLNNCSSASHWSHEGDGTPIMLEATDFCLKANGNGLPPSLSRDCFGEQSVWT 420
           KD M + IYLN+CS+ASHWSH GDGTPI LEAT  CLKA+ +GL P LSRDC G++S WT
Sbjct: 416 KDMMSKDIYLNDCSNASHWSHRGDGTPIELEATSLCLKADRDGLRPLLSRDCSGDESAWT 475

Query: 421 AISDSKLHLATLTKQGNGLCLEKESSNSTKIVMGRCVCVGNDSNCLQDTQAQWFELVVTN 480
            IS+SKLHLAT T+QGNGLCLEKESSNST+IVMGRC+CVG+DSNCL DT++QWFELV TN
Sbjct: 476 TISNSKLHLATFTRQGNGLCLEKESSNSTRIVMGRCLCVGDDSNCLDDTRSQWFELVGTN 535

Query: 481 TL 483
           TL
Sbjct: 536 TL 537

BLAST of Cla97C01G005230 vs. TrEMBL
Match: tr|A0A1S3CTF8|A0A1S3CTF8_CUCME (major extracellular endoglucanase-like OS=Cucumis melo OX=3656 GN=LOC103504686 PE=3 SV=1)

HSP 1 Score: 912.5 bits (2357), Expect = 3.9e-262
Identity = 433/483 (89.65%), Postives = 461/483 (95.45%), Query Frame = 0

Query: 1   MLIEGLDRRPLKDLAHEVVRLRFNCVRLTYATHMFTRYANRTVEENFDILDLKASKAGLA 60
           MLIEGLDRRPLKDLA+EV+RL+FNCVRLTYATHMFTRYANRTVEENFD+LDL+ASK GLA
Sbjct: 57  MLIEGLDRRPLKDLANEVMRLKFNCVRLTYATHMFTRYANRTVEENFDLLDLRASKVGLA 116

Query: 61  LHNPFVLNMTIFEAYEAAIDVLGTSGLMVIADNHISQPKWCCSLDDGNGFFGDRYFDPEE 120
           LHNPFVLNMTIFEAYEA +DVLGTSGLMVIADNHISQP+WCCSL+DGNGFFGDRYFD EE
Sbjct: 117 LHNPFVLNMTIFEAYEAVVDVLGTSGLMVIADNHISQPRWCCSLEDGNGFFGDRYFDSEE 176

Query: 121 WLEGLRLVARRFYNKSTVVAMSLRNELRGAKSKSKDWNKYITQGATTIHNINPNILVIIS 180
           WLEGLRLVARRFYNKS VVAMSLRNELRGA SKSKDWNKY+TQGATTIHNINPNILVIIS
Sbjct: 177 WLEGLRLVARRFYNKSAVVAMSLRNELRGASSKSKDWNKYVTQGATTIHNINPNILVIIS 236

Query: 181 GLNFDNDLRCQRQNPLQLNNLHNKLVFEVHLYSFSGESQSKFIHNPLNKICSRIINGFVE 240
           GLNFDNDLRCQRQ PLQLNNLHNKLVFEVHLYSFSGESQSKFIHNPLNKICS+IINGFV+
Sbjct: 237 GLNFDNDLRCQRQYPLQLNNLHNKLVFEVHLYSFSGESQSKFIHNPLNKICSKIINGFVQ 296

Query: 241 RAEFVMEGTEAVPLFVSEFGFDQSGVNEADDRFLSCFSAHLAKEDLDWALWAWQGSYYYR 300
           RAEFVMEG EAVPLFVSEFG DQ+GVNEADDRFLSCFSAHL ++DLDWALW WQGSYYYR
Sbjct: 297 RAEFVMEGAEAVPLFVSEFGLDQTGVNEADDRFLSCFSAHLVEKDLDWALWGWQGSYYYR 356

Query: 301 QGNVEPEEVFGVLNYNWSDVRNHRFSQMFGLLQTMLQDPNSNSSNSYLMYHPQSGQCVQV 360
           QG VE EEVFGVLNYNWSDVRN RFSQMF LLQTMLQDPNSNSSN+YLMYHPQSGQCVQV
Sbjct: 357 QGKVELEEVFGVLNYNWSDVRNPRFSQMFQLLQTMLQDPNSNSSNTYLMYHPQSGQCVQV 416

Query: 361 KDKMDEQIYLNNCSSASHWSHEGDGTPIMLEATDFCLKANGNGLPPSLSRDCFGEQSVWT 420
            D   ++I+LNNCS+ASHWS+EGDGTPIML +T+FCLKANGNGLPPSLSRDCFGEQSVWT
Sbjct: 417 HDMKQKEIFLNNCSNASHWSYEGDGTPIMLASTNFCLKANGNGLPPSLSRDCFGEQSVWT 476

Query: 421 AISDSKLHLATLTKQG-NGLCLEKESSNSTKIVMGRCVCVGNDSNCLQDTQAQWFELVVT 480
           AISDSKLHLATLTKQG NG+CLEKESSNS++I+M  CVCVG+DSNCLQDTQAQWF+LVVT
Sbjct: 477 AISDSKLHLATLTKQGNNGMCLEKESSNSSRILMRSCVCVGSDSNCLQDTQAQWFQLVVT 536

Query: 481 NTL 483
           NTL
Sbjct: 537 NTL 539

BLAST of Cla97C01G005230 vs. TrEMBL
Match: tr|A0A0A0K853|A0A0A0K853_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G028440 PE=3 SV=1)

HSP 1 Score: 704.5 bits (1817), Expect = 1.6e-199
Identity = 323/483 (66.87%), Postives = 397/483 (82.19%), Query Frame = 0

Query: 1   MLIEGLDRRPLKDLAHEVVRLRFNCVRLTYATHMFTRYANRTVEENFDILDLKASKAGLA 60
           MLIEGL+ RPLK+LA E ++LRFNCVRLTYATHMFTRYANRTVEENFD+LDL+ +KAGLA
Sbjct: 57  MLIEGLNHRPLKELADEAIKLRFNCVRLTYATHMFTRYANRTVEENFDLLDLEQAKAGLA 116

Query: 61  LHNPFVLNMTIFEAYEAAIDVLGTSGLMVIADNHISQPKWCCSLDDGNGFFGDRYFDPEE 120
            +NPFVLN TI EAYEA +DVLG SGLMVIADNH+SQP+WCCSLDDGNGFFG+RYFDP+E
Sbjct: 117 QYNPFVLNKTIAEAYEAVVDVLGASGLMVIADNHMSQPRWCCSLDDGNGFFGNRYFDPQE 176

Query: 121 WLEGLRLVARRFYNKSTVVAMSLRNELRGAKSKSKDWNKYITQGATTIHNINPNILVIIS 180
           WL+GL LVA+RF NKSTVV MSLRNELRG    + DWN Y+TQG TTIH INP +LVI+S
Sbjct: 177 WLQGLSLVAQRFNNKSTVVGMSLRNELRGMMENANDWNNYVTQGVTTIHKINPAVLVIVS 236

Query: 181 GLNFDNDLRCQRQNPLQLNNLHNKLVFEVHLYSFSGESQSKFIHNPLNKICSRIINGFVE 240
           GLN+DNDLRC +  PL ++ L NKL FEVHLYSFSG+S+SKF+  PLN IC++I++ F++
Sbjct: 237 GLNYDNDLRCLKDKPLNVSTLDNKLAFEVHLYSFSGDSESKFVQQPLNNICAKIMHEFID 296

Query: 241 RAEFVMEGTEAVPLFVSEFGFDQSGVNEADDRFLSCFSAHLAKEDLDWALWAWQGSYYYR 300
            AEFV+EG    PLFVSE+G+DQ  V++A++RF+SCF+AHLA++DLDWALW WQGSYYYR
Sbjct: 297 HAEFVIEGPNPFPLFVSEYGYDQREVDDAENRFMSCFTAHLAQKDLDWALWTWQGSYYYR 356

Query: 301 QGNVEPEEVFGVLNYNWSDVRNHRFSQMFGLLQTMLQDPNSNSSNSYLMYHPQSGQCVQV 360
           +G  E  E FGVL+ NW+ ++N  F Q F LLQTMLQDP SN+S SY++YH QSGQC++V
Sbjct: 357 EGQAELAETFGVLDSNWTQIKNPNFVQKFQLLQTMLQDPYSNASFSYVIYHVQSGQCIEV 416

Query: 361 KDKMDEQIYLNNCSSASHWSHEGDGTPIMLEATDFCLKANGNGLPPSLSRDCFGEQSVWT 420
            +  +++I+L NCS++S WSH+ D TPI + +T  CLKA+G GL  SLS DC G+QS+W+
Sbjct: 417 SND-NKEIFLTNCSTSSRWSHDNDSTPIKMSSTGLCLKASGEGLEASLSTDCIGKQSLWS 476

Query: 421 AISDSKLHLATLTKQGNGLCLE-KESSNSTKIVMGRCVCVGNDSNCLQDTQAQWFELVVT 480
           AIS+S LHL T+T+ G  LCL+  ESSNS+KIV   C+C  ND  CLQDTQ+QWFELV T
Sbjct: 477 AISNSNLHLGTVTEDGKSLCLQIIESSNSSKIVTNSCICTTNDPTCLQDTQSQWFELVAT 536

Query: 481 NTL 483
           NTL
Sbjct: 537 NTL 538

BLAST of Cla97C01G005230 vs. TrEMBL
Match: tr|A0A1S3BDI2|A0A1S3BDI2_CUCME (major extracellular endoglucanase-like OS=Cucumis melo OX=3656 GN=LOC103488703 PE=3 SV=1)

HSP 1 Score: 663.7 bits (1711), Expect = 3.2e-187
Identity = 302/449 (67.26%), Postives = 368/449 (81.96%), Query Frame = 0

Query: 34  MFTRYANRTVEENFDILDLKASKAGLALHNPFVLNMTIFEAYEAAIDVLGTSGLMVIADN 93
           MFTRYANRTVEENFD+LDL  +KAGL  +NPFVLN TI EAYEA +DVLG SGLMVIADN
Sbjct: 1   MFTRYANRTVEENFDLLDLGQAKAGLTQYNPFVLNKTIAEAYEAVVDVLGASGLMVIADN 60

Query: 94  HISQPKWCCSLDDGNGFFGDRYFDPEEWLEGLRLVARRFYNKSTVVAMSLRNELRGAKSK 153
           H+SQP+WCCSLDDGNGFFG+RYFDP+EWL+GL LVA+RF NKSTVV MSLRNE+RG    
Sbjct: 61  HMSQPRWCCSLDDGNGFFGNRYFDPQEWLQGLSLVAQRFNNKSTVVGMSLRNEIRGMMEN 120

Query: 154 SKDWNKYITQGATTIHNINPNILVIISGLNFDNDLRCQRQNPLQLNNLHNKLVFEVHLYS 213
           + DWN Y+TQG TTIHNINP +LVI+ GLN+DNDLRC ++ PL ++ L NKLVFEVHLYS
Sbjct: 121 ANDWNHYVTQGVTTIHNINPEVLVIVGGLNYDNDLRCLKEKPLNVSTLDNKLVFEVHLYS 180

Query: 214 FSGESQSKFIHNPLNKICSRIINGFVERAEFVMEGTEAVPLFVSEFGFDQSGVNEADDRF 273
           FSG S+SKF+  PLN IC++IIN F++ AEFV+EG+   PLFVSE+G+DQ  V++A++RF
Sbjct: 181 FSGASESKFVQQPLNNICAKIINEFIDHAEFVIEGSNPFPLFVSEYGYDQREVDDAENRF 240

Query: 274 LSCFSAHLAKEDLDWALWAWQGSYYYRQGNVEPEEVFGVLNYNWSDVRNHRFSQMFGLLQ 333
           +SCF+AHLA++DLDWALW WQGSYYYR+G  E  E FGVL  NW+ ++N  F Q F LLQ
Sbjct: 241 MSCFTAHLAQKDLDWALWTWQGSYYYREGQAELPETFGVLESNWTQIKNPNFVQKFQLLQ 300

Query: 334 TMLQDPNSNSSNSYLMYHPQSGQCVQVKDKMDEQIYLNNCSSASHWSHEGDGTPIMLEAT 393
           TMLQDPNSN+S SY++YHPQSGQC++V +  ++ I+L NCS++S WSH+ D TPI +  T
Sbjct: 301 TMLQDPNSNASFSYVIYHPQSGQCIEVSND-NKDIFLTNCSTSSRWSHDNDSTPIKMSNT 360

Query: 394 DFCLKANGNGLPPSLSRDCFGEQSVWTAISDSKLHLATLTKQGNGLCLEKESSNSTKIVM 453
             CLKA+G GL  SLS DC G+QSVW+AIS+SKLHLAT+T+ G  LCL+ ESSNS+KIV 
Sbjct: 361 GLCLKASGEGLAASLSNDCLGKQSVWSAISNSKLHLATVTENGKSLCLQIESSNSSKIVT 420

Query: 454 GRCVCVGNDSNCLQDTQAQWFELVVTNTL 483
             C+C  +D  CLQDTQ+QWFELV TNTL
Sbjct: 421 NSCICTTDDPTCLQDTQSQWFELVETNTL 448

BLAST of Cla97C01G005230 vs. TrEMBL
Match: tr|A0A0A0KKZ0|A0A0A0KKZ0_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G168960 PE=3 SV=1)

HSP 1 Score: 633.6 bits (1633), Expect = 3.5e-178
Identity = 302/343 (88.05%), Postives = 321/343 (93.59%), Query Frame = 0

Query: 141 MSLRNELRGAKSKSKDWNKYITQGATTIHNINPNILVIISGLNFDNDLRCQRQNPLQLNN 200
           MSLRNELRGA SKSKDWNKYITQGATTIHNINP ILVIISGLNFDNDLRCQRQ PLQLNN
Sbjct: 1   MSLRNELRGASSKSKDWNKYITQGATTIHNINPKILVIISGLNFDNDLRCQRQYPLQLNN 60

Query: 201 LHNKLVFEVHLYSFSGESQSKFIHNPLNKICSRIINGFVERAEFVMEGTEAVPLFVSEFG 260
           LHNKLVFEVHLYSFSGESQSKFIHNPLNKICS++INGFVERAEFVMEG EAVPLFVSEFG
Sbjct: 61  LHNKLVFEVHLYSFSGESQSKFIHNPLNKICSKVINGFVERAEFVMEGAEAVPLFVSEFG 120

Query: 261 FDQSGVNEADDRFLSCFSAHLAKEDLDWALWAWQGSYYYRQGNVEPEEVFGVLNYNWSDV 320
            DQ GVNEADDRFLSCFSAHL ++DLDWALW WQGSYYYRQG V PEEVFGVLNYNWSDV
Sbjct: 121 LDQRGVNEADDRFLSCFSAHLVEKDLDWALWGWQGSYYYRQGKVGPEEVFGVLNYNWSDV 180

Query: 321 RNHRFSQMFGLLQTMLQDPNSNSSNSYLMYHPQSGQCVQVKDKMDEQIYLNNCSSASHWS 380
           RN  FSQMF LLQTMLQDPNSNSSN+Y+MYHPQSGQCV V+D    QIYLN+CS+ASHWS
Sbjct: 181 RNPHFSQMFQLLQTMLQDPNSNSSNTYVMYHPQSGQCVLVQDMKHMQIYLNDCSNASHWS 240

Query: 381 HEGDGTPIMLEATDFCLKANGNGLPPSLSRDCFGEQSVWTAISDSKLHLATLTKQG-NGL 440
           +EGDGTPIML +T+FCLKA+G+GLPPSLSRDCFGEQSVWTAISDSKLHLATLTKQG NG+
Sbjct: 241 YEGDGTPIMLASTNFCLKASGDGLPPSLSRDCFGEQSVWTAISDSKLHLATLTKQGNNGM 300

Query: 441 CLEKESSNSTKIVMGRCVCVGNDSNCLQDTQAQWFELVVTNTL 483
           CLEKESSNS++I+M  CVCVGNDSNCLQDTQAQWF+LVVTNTL
Sbjct: 301 CLEKESSNSSRILMRSCVCVGNDSNCLQDTQAQWFQLVVTNTL 343

BLAST of Cla97C01G005230 vs. TrEMBL
Match: tr|A0A059CGH5|A0A059CGH5_EUCGR (Uncharacterized protein OS=Eucalyptus grandis OX=71139 GN=EUGRSUZ_D01800 PE=3 SV=1)

HSP 1 Score: 547.0 bits (1408), Expect = 4.3e-152
Identity = 261/483 (54.04%), Postives = 341/483 (70.60%), Query Frame = 0

Query: 1   MLIEGLDRRPLKDLAHEVVRLRFNCVRLTYATHMFTR--YANRTVEENFDILDLKASKAG 60
           ML EGLD++PL  +  E+ RLRFNCVRLT+AT+MFT+  + ++ VEE  D L L  +K G
Sbjct: 60  MLAEGLDKKPLGVIVAEIRRLRFNCVRLTWATYMFTQPGHGDQPVEETLDSLGLAEAKGG 119

Query: 61  LALHNPFVLNMTIFEAYEAAIDVLGTSGLMVIADNHISQPKWCCSLDDGNGFFGDRYFDP 120
           +A +NP VLNMT  EAY A +D LG  G+MV+ DNH+S+PKWCC+ DDGNGFFGD YFDP
Sbjct: 120 VARNNPLVLNMTHVEAYAAVVDELGKQGVMVVLDNHVSKPKWCCAYDDGNGFFGDEYFDP 179

Query: 121 EEWLEGLRLVARRFYNKSTVVAMSLRNELRGAKSKSKDWNKYITQGATTIHNINPNILVI 180
           EEWL GL  VA  F  KS VV MS+RNELRG +    DW +YI   AT +H  NPN+LVI
Sbjct: 180 EEWLRGLVAVAEHFNGKSQVVGMSVRNELRGPRQNDYDWYQYIRTAATKVHQANPNVLVI 239

Query: 181 ISGLNFDNDLRCQRQNPLQLNNLHNKLVFEVHLYSFSGESQSKFIHNPLNKICSRIINGF 240
           +SGLN+ +DL   R+ P+ L +L  KLV+E H YSFSG+ +   +  P++++C+  +   
Sbjct: 240 LSGLNWASDLSFLRKRPVGL-SLGRKLVYEAHWYSFSGDRKIWEV-QPVDRVCANAVQRM 299

Query: 241 VERAEFVMEGTEAVPLFVSEFGFDQSGVNEADDRFLSCFSAHLAKEDLDWALWAWQGSYY 300
            ++A F+  G  AVPLF+ EFGFDQ+G ++ADDRFLSCF  + A +DLDWALWA QGSYY
Sbjct: 300 EDQAGFLSSGPGAVPLFLGEFGFDQTGKSQADDRFLSCFMGYAAGKDLDWALWALQGSYY 359

Query: 301 YRQGNVEPEEVFGVLNYNWSDVRNHRFSQMFGLLQTMLQDPNSNSSNSYLMYHPQSGQCV 360
           YRQG V PEE FGVL++NW  +RN +F + F L+QTM+QDP+SNS  SY+MYHPQSG C+
Sbjct: 360 YRQGVVGPEETFGVLDFNWDGLRNPKFKERFQLVQTMVQDPSSNSPMSYIMYHPQSGLCI 419

Query: 361 QVKDKMDEQIYLNNCSSASHWSHEGDGTPIMLEATDFCLKANGNGLPPSLSRDCFGEQSV 420
           +  +  + +I    C   S W H  DG+PI L  T  CLKA G+GLPP LS DC   +S 
Sbjct: 420 RANN--NHEIGTAECQHWSRWIHYRDGSPIRLMGTPLCLKALGDGLPPVLSNDCSNRRSA 479

Query: 421 WTAISDSKLHLATLTKQGNGLCLEKESSNSTKIVMGRCVCVGNDSNCLQDTQAQWFELVV 480
           W +IS+SKLH+A   + GN LCLEK+S+ S+ I+  +C+CV +DS C ++ Q QWF+ V 
Sbjct: 480 WRSISNSKLHVAATDEHGNRLCLEKKSNESSVILTRKCICVDDDSGCTENPQGQWFKFVP 538

Query: 481 TNT 482
           TNT
Sbjct: 540 TNT 538

BLAST of Cla97C01G005230 vs. Swiss-Prot
Match: sp|C0HLA0|GH5FP_CHAOB (Glycosyl hydrolase 5 family protein OS=Chamaecyparis obtusa OX=13415 PE=1 SV=1)

HSP 1 Score: 370.9 bits (951), Expect = 2.1e-101
Identity = 204/494 (41.30%), Postives = 287/494 (58.10%), Query Frame = 0

Query: 2   LIEGLDRRPLKDLAHEVVRLRFNCVRLTYATHMFTR--YANRTVEENFDILDLKASKAGL 61
           L EGL+R P+  +AH +  L FNCVRLTY+ HM TR  Y N TV + F  L+L  + +G+
Sbjct: 62  LPEGLNRLPVATVAHTISSLGFNCVRLTYSIHMLTRTSYTNATVAQTFARLNLTEAASGI 121

Query: 62  ALHNPFVLNMTIFEAYEAAIDVLGTSGLMVIADNHISQPKWCCSLDDGNGFFGDRYFDPE 121
             +NP +L++    AY   +  L  +G+MVI DNH+S+PKWCC++DDGNGFFGDRYF+P 
Sbjct: 122 EHNNPELLDLGHVAAYHHVVAALSEAGVMVILDNHVSKPKWCCAVDDGNGFFGDRYFNPN 181

Query: 122 EWLEGLRLVARRFYNKSTVVAMSLRNELRGAKSKSKDWNKYITQGATTIHNINPNILVII 181
            W+EGL L+A  F N   VVAMSLRNELRG +S    W++++  GA T+H  NP +LVI+
Sbjct: 182 TWVEGLGLMATYFNNTPNVVAMSLRNELRGNRSTPISWSRHMQWGAATVHKANPKVLVIL 241

Query: 182 SGLNFDNDLRCQRQNPLQLNNLHNKLVFEVHLYSFSGESQSKFIHNPLNKICSRIINGFV 241
           SGL FD DL      P+ L     K+V+E H YSF    ++       N +C      F 
Sbjct: 242 SGLQFDTDLSFLPVLPVTL-PFKEKIVYEGHWYSFGVPWRTGL----PNDVCKNETGRFK 301

Query: 242 ERAEFVMEGTE--AVPLFVSEFGFDQSGVNEADDRFLSCFSAHLAKEDLDWALWAWQGSY 301
               FV       A PLF+SEFG DQ  VN+ D+R+L+C  A+LA+EDLDWALW   GSY
Sbjct: 302 SNVGFVTSSANATAAPLFMSEFGIDQRYVNDNDNRYLNCILAYLAEEDLDWALWTMGGSY 361

Query: 302 YYR---QGNVEPEEVFGVLNYNWSDVRNHRFSQMFGLLQTMLQDPNSNSSNSY-LMYHPQ 361
           YYR   Q   + EE +G  N++WS +RN  F      +Q  +QDP       Y ++YHP 
Sbjct: 362 YYRSDKQPVKDFEETYGFFNHDWSRIRNPDFISRLKEIQQPIQDPYLAPGPYYQIIYHPA 421

Query: 362 SGQCVQVKDKMDEQIYLNNCSSA-SHWSHEGD-GTPIMLEATDFCLKANGNGLPPSLSRD 421
           SG CV+    +   ++L +C S  S W+++     PI L  +  C+   GNGLP  ++ +
Sbjct: 422 SGLCVE--SGIGNTVHLGSCQSVRSRWNYDASVKGPIGLMGSSSCISTQGNGLPAIMTEN 481

Query: 422 CFG-EQSVWTAISDSKLHLAT--LTKQGNGLCLEKESSNSTKIVMGRCVCVGNDSNCLQ- 481
           C     ++W+ +S ++L L T  L K G    +  + S S  I    C+C+  DS+C   
Sbjct: 482 CSAPNNTLWSTVSSAQLQLGTRVLGKDGKEKWMCLDGSKSPLISTNECICI-TDSHCYPK 541

BLAST of Cla97C01G005230 vs. Swiss-Prot
Match: sp|P19487|GUNA_XANCP (Major extracellular endoglucanase OS=Xanthomonas campestris pv. campestris (strain ATCC 33913 / DSM 3586 / NCPPB 528 / LMG 568 / P 25) OX=190485 GN=engXCA PE=1 SV=2)

HSP 1 Score: 65.1 bits (157), Expect = 2.5e-09
Identity = 81/356 (22.75%), Postives = 136/356 (38.20%), Query Frame = 0

Query: 2   LIEGLDRRPLKDLAHEVVRLRFNCVRLTYATHMFTRYANRTVEENFDILDLKASKAGLAL 61
           ++ GL  R  KD+  ++  L FN VRL +         + T+  + D             
Sbjct: 58  VMHGLWARNWKDMIVQMQGLGFNAVRLPFCP---ATLRSDTMPASIDY-----------S 117

Query: 62  HNPFVLNMTIFEAYEAAIDVLGTSGLMVIADNHISQPKWCCSLDDGNGFFGDRYFDPEEW 121
            N  +  +T  +  +  I      G+ V+ D+H      C  + +    +    +   +W
Sbjct: 118 RNADLQGLTSLQILDKVIAEFNARGMYVLLDHHTPD---CAGISE---LWYTGSYTEAQW 177

Query: 122 LEGLRLVARRFYNKSTVVAMSLRNELRGAK-----SKSKDWNKYITQGATTIHNINPNIL 181
           L  LR VA R+ N   V+ + L+NE  GA      + + DWNK   +G+  +  + P  L
Sbjct: 178 LADLRFVANRYKNVPYVLGLDLKNEPHGAATWGTGNAATDWNKAAERGSAAVLAVAPKWL 237

Query: 182 VIISGLNFDNDLRCQRQ---------NPL---QLNNLHNKLVFEVHLYSFSGESQSKF-- 241
           + + G+  DN + C             PL    LN   N+L+   H+Y      QS F  
Sbjct: 238 IAVEGIT-DNPV-CSTNGGIFWGGNLQPLACTPLNIPANRLLLAPHVYGPDVFVQSYFND 297

Query: 242 --IHNPLNKICSRIINGFVERAEFVMEGTEAVPLFVSEFGFDQSGVNEADDRFLSCFSAH 301
               N +  I  R    F         GT A  L + EFG      +  D  +      +
Sbjct: 298 SNFPNNMPAIWERHFGQFA--------GTHA--LLLGEFGGKYGEGDARDKTWQDALVKY 357

Query: 302 LAKEDLDWAL-WAWQGSYYYRQGNVEPEEVFGVLNYNWSDVRNHRFSQMFGLLQTM 336
           L  + ++    W+W         N    +  G+L  +W+ VR  + +    LL+T+
Sbjct: 358 LRSKGINQGFYWSW---------NPNSGDTGGILRDDWTSVRQDKMT----LLRTL 368

BLAST of Cla97C01G005230 vs. TAIR10
Match: AT1G13130.1 (Cellulase (glycosyl hydrolase family 5) protein)

HSP 1 Score: 374.0 bits (959), Expect = 1.4e-103
Identity = 189/481 (39.29%), Postives = 285/481 (59.25%), Query Frame = 0

Query: 1   MLIEGLDRRPLKDLAHEVVRLRFNCVRLTYATHMFTRYA---NRTVEENFDILDLKASKA 60
           ++ EGL ++P+  +A ++V + FNCVRLT+   + T      N TV ++F  L L     
Sbjct: 64  VVAEGLSKQPVDAVAKKIVEMGFNCVRLTWPLDLMTNETLANNVTVRQSFQSLGLNDDIV 123

Query: 61  GLALHNPFVLNMTIFEAYEAAIDVLGTSGLMVIADNHISQPKWCCSLDDGNGFFGDRYFD 120
           G   +NP ++++ + EAY+  +  LG + +MVI DNH+++P WCC+ DDGNGFFGD++FD
Sbjct: 124 GFQTNNPSIIDLPLIEAYKTVVTTLGNNDVMVILDNHLTKPGWCCANDDGNGFFGDQFFD 183

Query: 121 PEEWLEGLRLVARRFYNKSTVVAMSLRNELRGAKSKSKDWNKYITQGATTIHNINPNILV 180
           P  W+  L+ +A  F   S VV MSLRNELRG K    DW KY+ QGA  +H+ N  +LV
Sbjct: 184 PTVWVAALKKMAATFNGVSNVVGMSLRNELRGPKQNVNDWFKYMQQGAEAVHSANNKVLV 243

Query: 181 IISGLNFDNDLRCQRQNPLQLNNLHNKLVFEVHLYSFSGESQSKFIHNPLNKICSRIING 240
           I+SGL+FD DL   R  P++L +   KLVFE+H YSFS +  S   +NP N IC R++N 
Sbjct: 244 ILSGLSFDADLSFVRSRPVKL-SFTGKLVFELHWYSFS-DGNSWAANNP-NDICGRVLNR 303

Query: 241 FVERAEFVMEGTEAVPLFVSEFGFDQSGVNEADDRFLSCFSAHLAKEDLDWALWAWQGSY 300
                 +++   +  PLF+SEFG D+ GVN  D+R+  C +   A+ D+DW+LWA  GSY
Sbjct: 304 IGNGGGYLL--NQGFPLFLSEFGIDERGVNTNDNRYFGCLTGWAAENDVDWSLWALTGSY 363

Query: 301 YYRQGNVEPEEVFGVLNYNWSDVRNHRFSQMFGLLQTMLQDPNSNSSNSYLMYHPQSGQC 360
           Y RQG V   E +GVL+ +W  VRN  F Q    LQ+ LQ P   +    L++HP +G C
Sbjct: 364 YLRQGKVGMNEYYGVLDSDWISVRNSSFLQKISFLQSPLQGPGPRTDAYNLVFHPLTGLC 423

Query: 361 VQVKDKMDEQIYLNNCSSASHWSHEGDGTPIMLEATDFCLKANGNGLPPSLSR-DCFGEQ 420
           +       + + L  C+S+  WS+      + ++    CL++NG   P +++R  C    
Sbjct: 424 IVRSLDDPKMLTLGPCNSSEPWSYTKKA--LRIKDQQLCLQSNGPKNPVTMTRTSCSTSG 483

Query: 421 SVWTAISDSKLHLATLTKQGNGLCLEKESSNSTKIVMGRCVCVGNDSNCLQDTQAQWFEL 478
           S W  IS S++HLA+ T     LCL+ +++N+  +V   C C+  D +C  +  +QWF++
Sbjct: 484 SKWQTISASRMHLASTTSNKTSLCLDVDTANN--VVANACKCLSKDKSC--EPMSQWFKI 533

BLAST of Cla97C01G005230 vs. TAIR10
Match: AT3G26130.1 (Cellulase (glycosyl hydrolase family 5) protein)

HSP 1 Score: 364.4 bits (934), Expect = 1.1e-100
Identity = 193/482 (40.04%), Postives = 285/482 (59.13%), Query Frame = 0

Query: 4   EGLDRRPLKDLAHEVVRLRFNCVRLTYATHMFTR---YANRTVEENFDILDLKASKAGLA 63
           EGL ++PL  +A ++V + FNCVRLT+  ++ T     A  TV ++     L  + +G  
Sbjct: 57  EGLSKQPLDAIAEKIVSMGFNCVRLTWPLYLATDESFSAFMTVRQSLRKFRLFEAVSGFQ 116

Query: 64  LHNPFVLNMTIFEAYEAAIDVLGTSGLMVIADNHISQPKWCCSLDDGNGFFGDRYFDPEE 123
            HNP +L++ + +A++  +  L    +MVI DNHISQP WCCS +DGNGFFGD++ +P+ 
Sbjct: 117 THNPTILDLPLIKAFQEVVYCLEKHRVMVILDNHISQPGWCCSDNDGNGFFGDKHLNPQV 176

Query: 124 WLEGLRLVARRFYN-KSTVVAMSLRNELRGAKSKSKDWNKYITQGATTIHNINPNILVII 183
           W++GL+ +A  F N  S VV MSLRNELRG K   KDW KY+ +GA  +H++NPN+LVI+
Sbjct: 177 WIKGLKKMASMFANVSSNVVGMSLRNELRGPKQNIKDWYKYMREGAEAVHSVNPNVLVIV 236

Query: 184 SGLNFDNDLRCQRQNPLQLNNLHNKLVFEVHLYSFSGESQSKFIHNPLNKICSRIINGFV 243
           SGLN+  DL   R+ P ++ +   K+VFE+H Y F    +     + LNKIC +     +
Sbjct: 237 SGLNYATDLSFLRERPFEV-SFRRKVVFEIHWYGFWNTWEG----DNLNKICGKETEKMM 296

Query: 244 ERAEFVMEGTEAVPLFVSEFGFDQSGVNEADDRFLSCFSAHLAKEDLDWALWAWQGSYYY 303
           + + F++E  + +PLFVSEFG DQ G N  D++FLSCF A  A  DLDW+LW   GSYY 
Sbjct: 297 KMSGFLLE--KGIPLFVSEFGIDQRGNNANDNKFLSCFMALAADRDLDWSLWTLAGSYYI 356

Query: 304 RQGNVEPEEVFGVLNYNWSDVRNHRFSQMFGLLQTMLQDPNSNSSNSYLMYHPQSGQCVQ 363
           R+ ++  +E +GVL++NWS +RN    QM   +QT             +M+HP +G C+ 
Sbjct: 357 REKSIGSDESYGVLDFNWSSIRNSTILQMISAIQTPFIGLMETQPKK-IMFHPSTGLCIV 416

Query: 364 VKDKMDEQIYLNNCSSASHWSHEGDGTPIMLEATDFCLKANGNGLPPSLSRDCFGEQ--S 423
            K     Q+ L +C+ +  W         + E    CLKA   G    L R  F E   S
Sbjct: 417 RKSLF--QLKLGSCNRSESWRLSSHRVLSLAEEQILCLKAYEKGKSVKL-RLFFSESYCS 476

Query: 424 VWTAISDSKLHLATLTKQGNGLCLEKESSNSTKIVMGRCVCVGNDSNCLQDTQAQWFELV 480
            W   SDSK+ L+++TK G  +CL+ ++ N+  IV   C C+  +S+C  D ++QWF+LV
Sbjct: 477 KWKLFSDSKMQLSSITKNGFSVCLDVDTENN-NIVTNSCKCLRGNSSC--DPRSQWFKLV 524

BLAST of Cla97C01G005230 vs. TAIR10
Match: AT3G26140.1 (Cellulase (glycosyl hydrolase family 5) protein)

HSP 1 Score: 336.7 bits (862), Expect = 2.4e-92
Identity = 183/483 (37.89%), Postives = 269/483 (55.69%), Query Frame = 0

Query: 1   MLIEGLDRRPLKDLAHEVVRLRFNCVRLTYATHMFTRYA---NRTVEENFDILDLKASKA 60
           ++ EGL ++ + DLA +++ + FNCVR T+   + T      N TV ++F  L L    +
Sbjct: 34  VVAEGLSKQSVDDLAKKIMAMGFNCVRFTWPLDLATNETLANNVTVRQSFQSLGLNDDIS 93

Query: 61  GLALHNPFVLNMTIFEAYEAAIDVLGTSGLMVIADNHISQPKWCCSLDDGNGFFGDRYFD 120
           G    NP ++++ + EAY+  +  LG + +MVI DNH+++P WCC  +DGNGFFGD +FD
Sbjct: 94  GFETKNPSMIDLPLIEAYKKVVAKLGNNNVMVILDNHVTKPGWCCGYNDGNGFFGDTFFD 153

Query: 121 PEEWLEGLRLVARRFYNKSTVVAMSLRNELRGAKSKSKDWNKYITQGATTIHNINPNILV 180
           P  W+ GL  +A  F   + VV MSLRNELRG K    DW KY+ QGA  +H  NPN+LV
Sbjct: 154 PTTWIAGLTKIAMTFKGATNVVGMSLRNELRGPKQNVDDWFKYMQQGAEAVHEANPNVLV 213

Query: 181 IISGLNFDNDLRCQRQNPLQLNNLHNKLVFEVHLYSFSGESQSKFIHNPLNKICSRIING 240
           I+SGL++D DL   R   + L     KLVFE+H YSF+  + +    NP N+ C  I+  
Sbjct: 214 ILSGLSYDTDLSFVRSRHVNL-TFTRKLVFELHRYSFT-NTNTWSSKNP-NEACGEILKS 273

Query: 241 FVERAEFVMEGTEAVPLFVSEFGFDQSGVNEADDRFLSCFSAHLAKEDLDWALWAWQGSY 300
                 F +      P+F+SEFG D  G N  D+R++ C     A+ D+DW++W  QGSY
Sbjct: 274 IENGGGFNLRD---FPVFLSEFGIDLRGKNVNDNRYIGCILGWAAENDVDWSIWTLQGSY 333

Query: 301 YYRQGNVEPEEVFGVLNYNWSDVRNHRFSQMFGLLQTMLQDPNSNSSNSYLMYHPQSGQC 360
           Y R+G V   E +G+L+ +W  VR+  F Q   L+ + LQ P S S    L++HP +G C
Sbjct: 334 YLREGVVGMSEFYGILDSDWVRVRSQSFLQRLSLILSPLQGPGSQSKVYNLVFHPLTGLC 393

Query: 361 VQVKDKMDEQIYLNNCSSASHWSHEGDGTPIMLEATDFCLKANGNGLPPSLSR-DCFGEQ 420
           +        ++ L  C+ +  WS+    T + L+    CL++ G   P  LS   C    
Sbjct: 394 MLQSILDPTKVTLGLCNESQPWSYTPQNT-LTLKDKSLCLESTGPNAPVKLSETSCSSPN 453

Query: 421 -SVWTAISDSKLHLATLTKQGNGLCLEKESSNSTKIVMGRCVCV-GNDSNCLQDTQAQWF 478
            S W  IS S + LA      N LCL+ + +N+  ++   C CV G DS+C  D  +QWF
Sbjct: 454 LSEWETISASNMLLAA-KSTNNSLCLDVDETNN--LMASNCKCVKGEDSSC--DPISQWF 504

BLAST of Cla97C01G005230 vs. TAIR10
Match: AT5G17500.1 (Glycosyl hydrolase superfamily protein)

HSP 1 Score: 316.6 bits (810), Expect = 2.6e-86
Identity = 172/481 (35.76%), Postives = 269/481 (55.93%), Query Frame = 0

Query: 1   MLIEGLDRRPLKDLAHEVVRLRFNCVRLTYATHMF---TRYANRTVEENFDILDLKASKA 60
           ++ EGL  +P+  ++ ++  + FNCVRLT+   +    T   N TV+++F+   L     
Sbjct: 57  VVAEGLSSQPMDSISKKIKDMGFNCVRLTWPLELMINDTLAFNVTVKQSFERYGLDHELQ 116

Query: 61  GLALHNPFVLNMTIFEAYEAAIDVLGTSGLMVIADNHISQPKWCCSLDDGNGFFGDRYFD 120
           G+  HNP+++N  +   ++A +  LG   +MVI DNH + P WCCS DD + FFGD  F+
Sbjct: 117 GIYTHNPYIVNTPLINVFQAVVYSLGRHDVMVILDNHKTVPGWCCSNDDPDAFFGDPKFN 176

Query: 121 PEEWLEGLRLVARRFYNKSTVVAMSLRNELRGAKSKSKDWNKYITQGATTIHNINPNILV 180
           P+ W+ GL+ +A  F N   VV MSLRNELRG    SKDW KY+ +GA  +H  NPN+LV
Sbjct: 177 PDLWMLGLKKMATIFMNVKNVVGMSLRNELRGYNHTSKDWYKYMQKGAEAVHTSNPNVLV 236

Query: 181 IISGLNFDNDLRCQRQNPLQLNNLHNKLVFEVHLYSFSGESQSKFIHNPLNKICSRIING 240
           I+SGLNFD DL   +  P+ L +   KLV E+H YSF+  +     HN +N  CS++ + 
Sbjct: 237 ILSGLNFDADLSFLKDRPVNL-SFKKKLVLELHWYSFTDGTGQWKSHN-VNDFCSQMFSK 296

Query: 241 FVERAEFVMEGTEAVPLFVSEFGFDQSGVNEADDRFLSCFSAHLAKEDLDWALWAWQGSY 300
                 FV++  +  PLF+SEFG DQ G +   +R+++C  A  A++DLDWA+WA  G Y
Sbjct: 297 ERRTGGFVLD--QGFPLFLSEFGTDQRGGDLEGNRYMNCMLAWAAEKDLDWAVWAVTGVY 356

Query: 301 YYRQGNVEPEEVFGVLNYNWSDVRNHRFSQMFGLLQTMLQDPNSNSSNSYLMYHPQSGQC 360
           Y+R+G     E +G+L+ NW +V N+ + +   ++Q     P    ++   ++HP +G C
Sbjct: 357 YFREGKRGVVEAYGMLDANWHNVHNYTYLRRLSVIQPPHTGPGVKHNHHKKIFHPLTGLC 416

Query: 361 VQVKDKMDE-QIYLNNCSSASHWSHEGDGTPIMLEATDFCLKA-NGNGLPPSLSRDCFGE 420
           +  K    E ++ L  C+    WS+   G   +      CL+     G    L R C   
Sbjct: 417 LVRKSHCHESELTLGPCTKDEPWSYSHGGILEIRRGHKSCLEGETAVGKSVKLGRICTKI 476

Query: 421 QSVWTAISDSKLHLATLTKQGNGLCLEKESSNSTKIVMGRCVCVGNDSNCLQDTQAQWFE 477
           +     IS +K+HL+  T  G+ +CL+ +S N+  +V   C C+  D+ C  +  +QWF+
Sbjct: 477 EQ----ISATKMHLSFNTSDGSLVCLDVDSDNN--VVANSCNCLTGDTTC--EPASQWFK 525

BLAST of Cla97C01G005230 vs. TAIR10
Match: AT5G16700.1 (Glycosyl hydrolase superfamily protein)

HSP 1 Score: 258.5 bits (659), Expect = 8.4e-69
Identity = 162/482 (33.61%), Postives = 251/482 (52.07%), Query Frame = 0

Query: 4   EGLDRRPLKDLAHEVVRLRFNCVRLTYATHMFTR---YANRTVEENFDILDLKASKAGLA 63
           EGL ++PL  ++ ++V + FNCVRLT+   + T        TV+++F+ L L     G+ 
Sbjct: 58  EGLSKQPLDSISKKIVSMGFNCVRLTWPLDLVTNDTLALKVTVKQSFESLKLFEDVLGIQ 117

Query: 64  LHNPFVLNMTIFEAYEAAIDVLGTSGLMVIADNHISQPKWCCSLDDGNGFFGDRYFDPEE 123
            HNP +L++ +F A++  +  LG +G+MVI DNH++ P WCC  +D + FFG  +FDP  
Sbjct: 118 THNPKLLHLPLFNAFQEVVSNLGENGVMVILDNHLTTPGWCCGDNDLDAFFGYPHFDPLV 177

Query: 124 WLEGLRLVARRFYNKSTVVAMSLRNELRGAKSKSKDWNKYITQGATTIHNINPNILVIIS 183
           W +GLR +A  F N + V+ MSLRNE RGA+     W +++ QGA  +H  NP +LVI+S
Sbjct: 178 WAKGLRKMATLFRNFTHVIGMSLRNEPRGARDYPDLWFRHMPQGAEAVHAANPKLLVILS 237

Query: 184 GLNFDNDLRCQRQNPLQLNNLHNKLVFEVHLYSFSGESQSKFIHNPLNKICSRIINGFVE 243
           G++FD +L   R   + + +  +KLVFE+H YSFS    S   HN  N  C +II     
Sbjct: 238 GIDFDTNLSFLRDRSVNV-SFTDKLVFELHWYSFSDGRDSWRKHNS-NDFCVKIIEKVTH 297

Query: 244 RAEFVMEGTEAVPLFVSEFGFDQSGVNEADDRFLSCFSAHLAKEDLDWALWAWQGSYYYR 303
              F++      PL +SEFG DQ G + + +R+++C  A  A+ DLDWA+WA  G YY R
Sbjct: 298 NGGFLL--GRGFPLILSEFGTDQRGGDMSGNRYMNCLVAWAAENDLDWAVWALTGDYYLR 357

Query: 304 QGNVEPEEVFGVLNYNWSDVRNHRFSQMFGLLQTMLQDPNSNSSNSYLMYHPQSGQCVQV 363
            G                           GL       PN N     L++HP +G CV  
Sbjct: 358 TGP--------------------------GL------RPNKN-----LLFHPSTGLCVTN 417

Query: 364 KDKMD-EQIYLNNCSSASHWS-HEGDGTPIMLEATDFCLKAN---GNGLPPSLSRDCFGE 423
               +   + L  C  +  W+ +  +G   +L     C++A    G  +   +   C   
Sbjct: 418 NPSDNIPTLRLGPCPKSDPWTFNPSEG---ILWINKMCVEAPNVVGQKVKLGVGTKC--- 477

Query: 424 QSVWTAISDSKLHLATLTKQGNGLCLEKESSNSTKIVMGRCVCVGNDSNCLQDTQAQWFE 478
            S    IS +K+HL+  T  G  LCL+ +  +++ +V  RC  +  D++C  D  +QWF+
Sbjct: 478 -SKLGQISATKMHLSFKTSNGLLLCLDVDERDNS-VVANRCKFLTMDASC--DPASQWFK 488

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_008467306.15.9e-26289.65PREDICTED: major extracellular endoglucanase-like [Cucumis melo][more]
XP_004143723.12.8e-25989.03PREDICTED: uncharacterized protein LOC101213113 [Cucumis sativus][more]
XP_023534098.18.6e-24583.82uncharacterized protein LOC111795760 [Cucurbita pepo subsp. pepo][more]
XP_022958497.18.1e-24382.78uncharacterized protein LOC111459707 [Cucurbita moschata][more]
XP_022995241.11.5e-24182.16uncharacterized protein LOC111490847 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
tr|A0A1S3CTF8|A0A1S3CTF8_CUCME3.9e-26289.65major extracellular endoglucanase-like OS=Cucumis melo OX=3656 GN=LOC103504686 P... [more]
tr|A0A0A0K853|A0A0A0K853_CUCSA1.6e-19966.87Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G028440 PE=3 SV=1[more]
tr|A0A1S3BDI2|A0A1S3BDI2_CUCME3.2e-18767.26major extracellular endoglucanase-like OS=Cucumis melo OX=3656 GN=LOC103488703 P... [more]
tr|A0A0A0KKZ0|A0A0A0KKZ0_CUCSA3.5e-17888.05Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G168960 PE=3 SV=1[more]
tr|A0A059CGH5|A0A059CGH5_EUCGR4.3e-15254.04Uncharacterized protein OS=Eucalyptus grandis OX=71139 GN=EUGRSUZ_D01800 PE=3 SV... [more]
Match NameE-valueIdentityDescription
sp|C0HLA0|GH5FP_CHAOB2.1e-10141.30Glycosyl hydrolase 5 family protein OS=Chamaecyparis obtusa OX=13415 PE=1 SV=1[more]
sp|P19487|GUNA_XANCP2.5e-0922.75Major extracellular endoglucanase OS=Xanthomonas campestris pv. campestris (stra... [more]
Match NameE-valueIdentityDescription
AT1G13130.11.4e-10339.29Cellulase (glycosyl hydrolase family 5) protein[more]
AT3G26130.11.1e-10040.04Cellulase (glycosyl hydrolase family 5) protein[more]
AT3G26140.12.4e-9237.89Cellulase (glycosyl hydrolase family 5) protein[more]
AT5G17500.12.6e-8635.76Glycosyl hydrolase superfamily protein[more]
AT5G16700.18.4e-6933.61Glycosyl hydrolase superfamily protein[more]
The following terms have been associated with this gene:
Vocabulary: Biological Process
TermDefinition
GO:0005975carbohydrate metabolic process
Vocabulary: Molecular Function
TermDefinition
GO:0004553hydrolase activity, hydrolyzing O-glycosyl compounds
Vocabulary: INTERPRO
TermDefinition
IPR017853Glycoside_hydrolase_SF
IPR035992Ricin_B-like_lectins
IPR000772Ricin_B_lectin
IPR001547Glyco_hydro_5
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0005975 carbohydrate metabolic process
biological_process GO:0008152 metabolic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0016787 hydrolase activity
molecular_function GO:0016798 hydrolase activity, acting on glycosyl bonds
molecular_function GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compounds

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C01G005230.1Cla97C01G005230.1mRNA


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableGENE3DG3DSA:2.80.10.50coord: 350..472
e-value: 1.2E-5
score: 27.2
NoneNo IPR availableGENE3DG3DSA:3.20.20.80coord: 1..331
e-value: 1.8E-60
score: 206.9
NoneNo IPR availablePANTHERPTHR31263:SF10SUBFAMILY NOT NAMEDcoord: 2..478
NoneNo IPR availablePANTHERPTHR31263FAMILY NOT NAMEDcoord: 2..478
IPR001547Glycoside hydrolase, family 5PFAMPF00150Cellulasecoord: 10..294
e-value: 7.0E-24
score: 84.6
IPR000772Ricin B, lectin domainPROSITEPS50231RICIN_B_LECTINcoord: 344..476
score: 9.066
IPR035992Ricin B-like lectinsSUPERFAMILYSSF50370Ricin B-like lectinscoord: 342..462
IPR017853Glycoside hydrolase superfamilySUPERFAMILYSSF51445(Trans)glycosidasescoord: 3..329

The following gene(s) are paralogous to this gene:

None