HG10019301 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10019301
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionEndoglucanase
LocationChr04: 20004180 .. 20010329 (+)
RNA-Seq ExpressionHG10019301
SyntenyHG10019301
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTCTTTCTCCCTTCTCTTTCAAACTCATTGCCTTCTCTTTCCTTCTTCTCACCCTTTCCGACGCTTCTCCTGCCACCGTCGGCCACCATCGCCGCCCTCGTTTTACCCCTCATAACTATAGAGACGCTCTTGCCAAATCTATCCTCTTCTTTCAAGGCCAAAGGTCCGGGAAACTCCCTCCAAATCAGAAGATGACTTGGAGGAAAGATTCTGGTTTATTCGATGGCTCAACAATGAACGTAAGTCAAATTACTACTAAACTCAAAAGCTTAAATTGATAACGTTTTTTAACTTTTTTTTAAGGGTTGAAATGGTAGGTTGATTTGGTTGGTGGGTACTATGATGCTGGGGATAATGTGAAGTTTGGGTTTCCTATGGCTTTCACAACCACTATGCTTTCATGGAGTGTTATTGAGTTTGGTGGAGTAATGAAAAATGAGCTCAATAATGCTAAAGAAGCCATTCGTTGGGCCACTGATTATCTCCTCAAAGCCACTGCTCTTCCTGACACCATTTTTGTTCAGGTCTTCACTTCATTTTCTCCTATTTTCAAATATAATAAGTAATTTTGTGTGTTTGTTTGGGAATTCATTCTAAATTCTCATGATATATTAAAATGACTAAAATACCCTCACACTACAAATTTAAAGTTACTGGTTTATTTTTGTGTTTAATAGTAGTTTTTTATATAGTAATTTGACATTGGATAATTGATTGCACTTAATCTCACGCATCCTTCCTTTAGTACATAGTAAAAAATAAAATAAAACAATTCAATTTTTTAATGAAAAAAGATGTCTACATGGTAAAACTTTATTGGGCAACCGTATGTAGGCTGCACAAAGATAGATGTAGTCCAAACAGCATCCAACTTTTTTTTTTTTTTTTTGACATGGAAGCATTTTGAATTTAGGACAAACCAAACAAATTTGTGTATAGAAATACTTTCGAGAAATGTAAATGAACATATAAAAAAATGTTTTAAAAAATAGCAATACTTGGATGCATCCCAAAATGTACCTATCTATATGCATCACATTCTCATCACACACACACAAAAAAATTAAAATATTTTAAAAAAATATAAGAAGAATTTGTCATGGGCAAATTATGATTGGACAATTTTGTGGGGATGCGCAAAGATAGGTGTAGCCTAAGATGCACTTAAATATTTTTCTTAAAAAAAACGTTTTTTTTTTCACCTGCAATAATGTTCTAAATCACTCTTCAAAAACACTTCAAAAACATGTTCTTATTTTTCCACTATGACAAAATGATCTCCTCTCTGGATGGAATAATTCTAAAGTAAAACCGAAAATATAATATCATATCATCATCGTAGAGATATATAAAAAATGTAAACTTTCACCGTTTTGCAGGTGGGTGATGCTAACAGGGACCATGCTTGTTGGGAGAGACCAGAAGACATGGACACACCAAGAACTGTGTTGAAAATAGACAAGAACACCCCTGGTTCTGAAGTAGCAGCTGAAACTGCAGCTGCTCTTGCTTCTGCTTCATTAGTCTTCAAAAAATCTGATCCCACATACTCAAACCTCTTAATCAAAACAGCCATAAGGGTAAAACTTTGAGGGGGCAAAAAAAAAAAAAAACATCCAAAAAATGAGAAAATAAAAAATAGTTATCAAACGAAATGAGAAACAGAGAAACAATACTAATTAAACGAAACTTAATTTTTGCATGTTGAAATCTGAATTGGGTTTTGTTTTTTGGGGATTTAGGTGTTTGAATTTGCTGATAAGTACAGAGGATCCTACAGCAATGGATTGAAGAATTATGTCTGCCCTTTTTACTGCTCTTTCTCTGGTTATCAGGTAATTTTATTTTCACCCTACTTGACCATGGAATAATTGAAATTGTTTAAAGAAGGAAGATTGAATGATTTTAATATTAATTTGGATTGAGAATTAATGGCAGGATGAGCTATTGTGGGGTGCTGCTTGGCTACACAGAGCTACAAAGAATGGCAGCTTCTTGAATTACATTCAAGAGAATGGGCAGAATCTTGGAGGTGGAGAGTTTGATAATACTTTTGGTTGGGATAACAAGCATGTTGGAGCAAGAATTCTTCTTTCCAAGGTTCCTTCTTTTATTTACTGTAAATTTTGTTGGGAAATTTTTATAAATGGAAAAAAAATATCCAACTATTTACAAATAATTTTTACTTTGAATCGCAGTCTAGCCCAGTATAGAAGTCTATAACGGTCACAATGAAATTTTTCATTATTTATAAATATTTTAACTCATTTTTCTATATTTGAAAACAACCCAATTTTGTATGTAAAATGTATTTTATTACACACAAAATTTAAAGTTTAACAAATTAAATAGACATGTTTTAAAGTTCAAGAACATATTAGCTAATATGAAATTGAAAGTTGATTTATCAATTAAAAACTTCTTAAGTTCAATGGTCTATTTAACATAAACTTAAAAGTTCTAACTCTAAACGTTTGATTTAACTTATTTCAATGTATGTTTCTAAATTAAGGGTTATCTTTACAATGATTTTTTTTTTTTTTTTTTTTTGTGATGCTCTTTATAATTATTCTATATTTTGTTTTAAAACTTATTTTTAAATTTTTAGTCAATTTAAGAAGTTAGTTAAGCTATTGAAATTATTTTTTTAAACAAAACTTTTTAAAAAAGAAATTAGTTAAGCTGTTGGAAATATTTTTAATTTTAAGTACTCTTTTTTTTTGTTCGAAGTGTAATTTAATGTACTCAATTCTAAGAGAAGCTGTAGTTTTAGGATTTTATAAAATAAATAAATGAGTCCAAAAATAATTGTAAAAGTGTAAAAAAAAAAATTTGGGATTGGAACTTTCTATTTATTTAATTTTAAAGCTGTTATGGAGAATTATTTTTAAATGTAGAGTTTGGTACTTACATCAGCTTTCCTATTTGTTAAGTATTTTATTTTCATTAATGCAGCTTTTTAAACAGTCTGACAGACTGACTGCTTTATCATGTGACAACTATTCTATTGTAACGACTTTTGTATTTCGCAAAATAAAAATAAAACTATTGTAACGATTTTTTTTTTTTTTTTTTAAGCAAAACACTTTTTCTTAAAGTGATTTTAAAATAATAAAATTACTTTTTAAACATATCTTAAACTATTTGAAATCAAATTTTGATAATATAAAAAATATCTTTTAGAAAATATAAACTCAAATCAATTAAGATGTTTATTAGTATCTATTTGTCTCTATAATTGATAGACAGTAATACTTTGCTATATTTGAAAGAAATAGTTGCAAATATAGCAATTAAATTTAAAGTATTAGCATAAATAGCAACATTTTAAAAAAATTGTAAATATAGCAAAATCTGTTAGTAATAGACTTCTATCCTTGATAGACTTTTATGGTTTATAGGTAGTCTATCACAGATAGAATATAAGAGTCTATCAGTGATAAACTTTGCTATATTTGTAATTTTTTTAAAATCCTGTTGTACACTTAGTTATTATCCTTAAAATTGCTACTCATTATAATTACCTTATTTGAAAATATTTTCAGCAGTTTTGTCGTTTAAATCAATTACTTAAAAAAGAATGTAAACTACTTTTTTTCTGTAACTATTGTTTAAAAGAATAATTTGCTTTTGTTTTAAAAAAAAAAAAAAACAAAGAATAATTTGCTTTAGAAATAGTTTAAGATGATGTAAATATTGTTTGAAAGAATAGTTTGGCTAAATAATATAGTTTAATACGATTTAAAGAATTAAGTACTTTTTATATCTTAGCCAAACAACTTACCTCATAAAAAATTTGAAAATAAATTCTTAATAGTCTAAAAAGATAGATAGTTATACAACTAAGTAACACATACAATAGGACATATCTTTTTTCTCTTTCTTTTTATTCATCAAATATTGGTTGAGATTAAACTTTGACTCCTATAAATAAAAATGTGTCTTATCTACTAAATTATAATAGAACGTGCACCAAATAACTTGTTTTTAAAAGAAATTAATAAAAATAAAAAAAATATCAAATTATTTATAAATATAGAAAAAATGCTAATAGACAGCAATAGAATTCTATCGTTTGATCGATGGACTTCTATCATTGTCTATCGCGGTCTATCGCTAATATATAATAAAAAATTTCTATATTTATAAATAGTTTAACTCATTTTTTTATATTTAAAAAAAATCCGTCAAATATCACATCCGTTTTTTTATAAAATGTGCCATAATTTATTAGGTCATATTGTCCTTCCTTGTACAAGACCATGTGTTTGGTTTAACCTTTGATTTATAAACCAATTTTCAGGCGTTCTTAATCCAAAATGAGAAGTCTTTTCATGATTATAAAAATCATGCAGATAGTTTTATATGTTCTCTCATACCCGGCGCTCCTTCCTCTTCTGCTCAATATACCCCAGGTCTCTGATCTCCTATTACTTTAACATATTTATTTATTAATTTTTTTTAATTAAAATACTATTTTGATTTATATATTATTTTTTTAATTTTTTTTTCATTTTAATTCATCATAATTTGGAAGATAGGAAAAAGAAGGTGTTTGAATTGAGAGAAATTGAATAATATATGAATATTAATTTGTAGGAGGTCTTTTATTTAAAATGGGAGATAGCAACATGCAGTATGTAACATCAACCTCATTTCTACTATTAACCTATGCCAAATACTTAACCTCTGCTCACACAACTGCATATTGCACCGCCCGAACCATCACTCCCAACATTCTACGAGCCATTGCCAAGAAACAGGTATCTACTTTTTTTTATTATTATTATTATATTTACTCATTTAAACTTATAACTCCAATAATTCTTCCCTTTTAGCTTCCTTAACCATTTCAATTTTGTTTACTATTTCCTATCTACTTTAAAATAGTCTACATTTCAAACACAAACTATTATGATCCACAAACTATAATAACCAAAACTCAGCGTCACAAACGACCCCATAGTTATTTTGAGATATGAACTTTCAATAAATTTATAAATAGTTTGGGCTATTGATTTGGTCTTTGAACTTTAAAAGATGTCTGAAAAGTCTTTAAACTTCTAGTTTTGTTTCTAGTAAAAACTAAAAAAAAAAAAAAAGAAAATTAAAGGAAACTGAATTACAAGTTTAGTAGCTAAAATTTAAAGTTTTTATATGTTTGGTCTTTAAAGTATCCAATAAATCACTAATGTTGGTTATAATAATCCATATGTTATAATAGTTTATGTTTGGAATGCAAACTATTTTAATCTGGGTTATAATAGTCTATATTTGGAGTTTAGACTATTTTAGTTTAGGTAGGAAATAGTAAACAATGTAACAAATTGGAATTTAAAATAGTAATAATGTAGTTAATTATTATAATACCTCACTCCACCCACCTGAAGTTGGGGCTCAAACACTCCTAACAAATTTGAAATTTTTTAAAATCATTTCATCTAATTAGATATAAATTTGAATGTTATGTTTAGTGGAATCTTAAACTTATAATTTTATATCTAATAGATCCACAAATTTTAAAAAATTTGAAATTTTAAGATTAAATTTATAATTTTGAATAGTGAGTAACGTTAAAAACTAAACTTATAATTTAACCTAAATTTATGAATAATATTGTTCCCAACTAATCTAGTTGAGCGTGGTTTCGATAGACATCAACATGTCAAAGACGACAAAAAAAAAAAAAACCAAAAAAAGAGTAAAATTTTCAAAAATCAAATAATTACCAAGCCTAAATTAATTATTTTGTAAAAAGAATGGAAGGATAAAAAGGATAGAGAATTTAAATAATTTTTTAATTTTGTTTGTAGATTGATTACCTGCTGGGAGAGAATCCATTGAAGATGTCATACATGGTCGGATATGGCGGCCGCTACCCACGGAGAATCCACCACAGAGGCTCATCGCTGCCGTCGATTGCAGAACATCCGGCCAAGATCGGCTGCTCCTCCGGCTTCTCCGCCATGGATTCCAATTCCCCCAACCCTAACATTCTCGTCGGCGCGGTGGTCGGAGGGCCCGATCAGAACGACGAATTTCCAGATGAGCGATCGGATTTCGAGCAATCCGAACCGTCCACTTACATTAATGCCCCGCTCGTGGGATCGCTCGCCTATTTTGCGCACTCCTTCGGCCAGCTTTAA

mRNA sequence

ATGGCTCTTTCTCCCTTCTCTTTCAAACTCATTGCCTTCTCTTTCCTTCTTCTCACCCTTTCCGACGCTTCTCCTGCCACCGTCGGCCACCATCGCCGCCCTCGTTTTACCCCTCATAACTATAGAGACGCTCTTGCCAAATCTATCCTCTTCTTTCAAGGCCAAAGGTCCGGGAAACTCCCTCCAAATCAGAAGATGACTTGGAGGAAAGATTCTGGTTTATTCGATGGCTCAACAATGAACGTTGATTTGGTTGGTGGGTACTATGATGCTGGGGATAATGTGAAGTTTGGGTTTCCTATGGCTTTCACAACCACTATGCTTTCATGGAGTGTTATTGAGTTTGGTGGAGTAATGAAAAATGAGCTCAATAATGCTAAAGAAGCCATTCGTTGGGCCACTGATTATCTCCTCAAAGCCACTGCTCTTCCTGACACCATTTTTGTTCAGGTGGGTGATGCTAACAGGGACCATGCTTGTTGGGAGAGACCAGAAGACATGGACACACCAAGAACTGTGTTGAAAATAGACAAGAACACCCCTGGTTCTGAAGTAGCAGCTGAAACTGCAGCTGCTCTTGCTTCTGCTTCATTAGTCTTCAAAAAATCTGATCCCACATACTCAAACCTCTTAATCAAAACAGCCATAAGGGTGTTTGAATTTGCTGATAAGTACAGAGGATCCTACAGCAATGGATTGAAGAATTATGTCTGCCCTTTTTACTGCTCTTTCTCTGGTTATCAGGATGAGCTATTGTGGGGTGCTGCTTGGCTACACAGAGCTACAAAGAATGGCAGCTTCTTGAATTACATTCAAGAGAATGGGCAGAATCTTGGAGGTGGAGAGTTTGATAATACTTTTGGTTGGGATAACAAGCATGTTGGAGCAAGAATTCTTCTTTCCAAGGCGTTCTTAATCCAAAATGAGAAGTCTTTTCATGATTATAAAAATCATGCAGATAGTTTTATATGTTCTCTCATACCCGGCGCTCCTTCCTCTTCTGCTCAATATACCCCAGGAGGTCTTTTATTTAAAATGGGAGATAGCAACATGCAGTATGTAACATCAACCTCATTTCTACTATTAACCTATGCCAAATACTTAACCTCTGCTCACACAACTGCATATTGCACCGCCCGAACCATCACTCCCAACATTCTACGAGCCATTGCCAAGAAACAGATTGATTACCTGCTGGGAGAGAATCCATTGAAGATGTCATACATGGTCGGATATGGCGGCCGCTACCCACGGAGAATCCACCACAGAGGCTCATCGCTGCCGTCGATTGCAGAACATCCGGCCAAGATCGGCTGCTCCTCCGGCTTCTCCGCCATGGATTCCAATTCCCCCAACCCTAACATTCTCGTCGGCGCGGTGGTCGGAGGGCCCGATCAGAACGACGAATTTCCAGATGAGCGATCGGATTTCGAGCAATCCGAACCGTCCACTTACATTAATGCCCCGCTCGTGGGATCGCTCGCCTATTTTGCGCACTCCTTCGGCCAGCTTTAA

Coding sequence (CDS)

ATGGCTCTTTCTCCCTTCTCTTTCAAACTCATTGCCTTCTCTTTCCTTCTTCTCACCCTTTCCGACGCTTCTCCTGCCACCGTCGGCCACCATCGCCGCCCTCGTTTTACCCCTCATAACTATAGAGACGCTCTTGCCAAATCTATCCTCTTCTTTCAAGGCCAAAGGTCCGGGAAACTCCCTCCAAATCAGAAGATGACTTGGAGGAAAGATTCTGGTTTATTCGATGGCTCAACAATGAACGTTGATTTGGTTGGTGGGTACTATGATGCTGGGGATAATGTGAAGTTTGGGTTTCCTATGGCTTTCACAACCACTATGCTTTCATGGAGTGTTATTGAGTTTGGTGGAGTAATGAAAAATGAGCTCAATAATGCTAAAGAAGCCATTCGTTGGGCCACTGATTATCTCCTCAAAGCCACTGCTCTTCCTGACACCATTTTTGTTCAGGTGGGTGATGCTAACAGGGACCATGCTTGTTGGGAGAGACCAGAAGACATGGACACACCAAGAACTGTGTTGAAAATAGACAAGAACACCCCTGGTTCTGAAGTAGCAGCTGAAACTGCAGCTGCTCTTGCTTCTGCTTCATTAGTCTTCAAAAAATCTGATCCCACATACTCAAACCTCTTAATCAAAACAGCCATAAGGGTGTTTGAATTTGCTGATAAGTACAGAGGATCCTACAGCAATGGATTGAAGAATTATGTCTGCCCTTTTTACTGCTCTTTCTCTGGTTATCAGGATGAGCTATTGTGGGGTGCTGCTTGGCTACACAGAGCTACAAAGAATGGCAGCTTCTTGAATTACATTCAAGAGAATGGGCAGAATCTTGGAGGTGGAGAGTTTGATAATACTTTTGGTTGGGATAACAAGCATGTTGGAGCAAGAATTCTTCTTTCCAAGGCGTTCTTAATCCAAAATGAGAAGTCTTTTCATGATTATAAAAATCATGCAGATAGTTTTATATGTTCTCTCATACCCGGCGCTCCTTCCTCTTCTGCTCAATATACCCCAGGAGGTCTTTTATTTAAAATGGGAGATAGCAACATGCAGTATGTAACATCAACCTCATTTCTACTATTAACCTATGCCAAATACTTAACCTCTGCTCACACAACTGCATATTGCACCGCCCGAACCATCACTCCCAACATTCTACGAGCCATTGCCAAGAAACAGATTGATTACCTGCTGGGAGAGAATCCATTGAAGATGTCATACATGGTCGGATATGGCGGCCGCTACCCACGGAGAATCCACCACAGAGGCTCATCGCTGCCGTCGATTGCAGAACATCCGGCCAAGATCGGCTGCTCCTCCGGCTTCTCCGCCATGGATTCCAATTCCCCCAACCCTAACATTCTCGTCGGCGCGGTGGTCGGAGGGCCCGATCAGAACGACGAATTTCCAGATGAGCGATCGGATTTCGAGCAATCCGAACCGTCCACTTACATTAATGCCCCGCTCGTGGGATCGCTCGCCTATTTTGCGCACTCCTTCGGCCAGCTTTAA

Protein sequence

MALSPFSFKLIAFSFLLLTLSDASPATVGHHRRPRFTPHNYRDALAKSILFFQGQRSGKLPPNQKMTWRKDSGLFDGSTMNVDLVGGYYDAGDNVKFGFPMAFTTTMLSWSVIEFGGVMKNELNNAKEAIRWATDYLLKATALPDTIFVQVGDANRDHACWERPEDMDTPRTVLKIDKNTPGSEVAAETAAALASASLVFKKSDPTYSNLLIKTAIRVFEFADKYRGSYSNGLKNYVCPFYCSFSGYQDELLWGAAWLHRATKNGSFLNYIQENGQNLGGGEFDNTFGWDNKHVGARILLSKAFLIQNEKSFHDYKNHADSFICSLIPGAPSSSAQYTPGGLLFKMGDSNMQYVTSTSFLLLTYAKYLTSAHTTAYCTARTITPNILRAIAKKQIDYLLGENPLKMSYMVGYGGRYPRRIHHRGSSLPSIAEHPAKIGCSSGFSAMDSNSPNPNILVGAVVGGPDQNDEFPDERSDFEQSEPSTYINAPLVGSLAYFAHSFGQL
Homology
BLAST of HG10019301 vs. NCBI nr
Match: XP_038905683.1 (endoglucanase 17-like [Benincasa hispida])

HSP 1 Score: 979.5 bits (2531), Expect = 1.0e-281
Identity = 475/504 (94.25%), Postives = 491/504 (97.42%), Query Frame = 0

Query: 1   MALSPFSFKLIAFSFLLLTLSDASPATVGHHRRPRFTPHNYRDALAKSILFFQGQRSGKL 60
           MALSP SFKLIAFSFLLLTLSDASPATVGHHRRP FTPHNYRDALAKSILFFQGQRSGKL
Sbjct: 1   MALSPLSFKLIAFSFLLLTLSDASPATVGHHRRPHFTPHNYRDALAKSILFFQGQRSGKL 60

Query: 61  PPNQKMTWRKDSGLFDGSTMNVDLVGGYYDAGDNVKFGFPMAFTTTMLSWSVIEFGGVMK 120
           PPNQKMTWRKDSGL DGSTMNVDLVGGYYDAGDNVKFGFPMAFTTTMLSWSVIEFGGVMK
Sbjct: 61  PPNQKMTWRKDSGLLDGSTMNVDLVGGYYDAGDNVKFGFPMAFTTTMLSWSVIEFGGVMK 120

Query: 121 NELNNAKEAIRWATDYLLKATALPDTIFVQVGDANRDHACWERPEDMDTPRTVLKIDKNT 180
           NELNN+KEAIRWATDYLLK+TALPDTIFVQVGDAN+DHACWERPEDMDTPRTVLKIDKNT
Sbjct: 121 NELNNSKEAIRWATDYLLKSTALPDTIFVQVGDANKDHACWERPEDMDTPRTVLKIDKNT 180

Query: 181 PGSEVAAETAAALASASLVFKKSDPTYSNLLIKTAIRVFEFADKYRGSYSNGLKNYVCPF 240
           PGSEVAAETAAALASASLVFK+SDPTYSN+LIK AIRVFEFADKYRGSYSNGLKNYVCPF
Sbjct: 181 PGSEVAAETAAALASASLVFKRSDPTYSNILIKRAIRVFEFADKYRGSYSNGLKNYVCPF 240

Query: 241 YCSFSGYQDELLWGAAWLHRATKNGSFLNYIQENGQNLGGGEFDNTFGWDNKHVGARILL 300
           YCSFSGYQDELLWGAAWL+RATKNGS+LNYIQENGQNLGG EFDN FGWDNKHVGARILL
Sbjct: 241 YCSFSGYQDELLWGAAWLYRATKNGSYLNYIQENGQNLGGVEFDNAFGWDNKHVGARILL 300

Query: 301 SKAFLIQNEKSFHDYKNHADSFICSLIPGAPSSSAQYTPGGLLFKMGDSNMQYVTSTSFL 360
           SKAFLIQNEKS H+YK HAD+FICSLIPGAP SSAQYTPGGLLFKMGDSNMQYVTSTSFL
Sbjct: 301 SKAFLIQNEKSLHEYKGHADNFICSLIPGAPFSSAQYTPGGLLFKMGDSNMQYVTSTSFL 360

Query: 361 LLTYAKYLTSAHTTAYCTARTITPNILRAIAKKQIDYLLGENPLKMSYMVGYGGRYPRRI 420
           +L+YAKYLTSAHTT  CT RTITPNILR+IAKKQIDYLLGENPLKMSYMVGYGGRYP+RI
Sbjct: 361 VLSYAKYLTSAHTTVDCTGRTITPNILRSIAKKQIDYLLGENPLKMSYMVGYGGRYPQRI 420

Query: 421 HHRGSSLPSIAEHPAKIGCSSGFSAMDSNSPNPNILVGAVVGGPDQNDEFPDERSDFEQS 480
           HHRGSSLPSIAEHPAKI CS+GFSAMDSNSPNPNILVGAVVGGPD+NDEFPD+RSDFEQS
Sbjct: 421 HHRGSSLPSIAEHPAKIDCSTGFSAMDSNSPNPNILVGAVVGGPDRNDEFPDDRSDFEQS 480

Query: 481 EPSTYINAPLVGSLAYFAHSFGQL 505
           EPSTYINAPLVGSLAYFAHSFGQL
Sbjct: 481 EPSTYINAPLVGSLAYFAHSFGQL 504

BLAST of HG10019301 vs. NCBI nr
Match: XP_022934344.1 (endoglucanase 17-like [Cucurbita moschata])

HSP 1 Score: 940.3 bits (2429), Expect = 7.1e-270
Identity = 453/504 (89.88%), Postives = 480/504 (95.24%), Query Frame = 0

Query: 1   MALSPFSFKLIAFSFLLLTLSDASPATVGHHRRPRFTPHNYRDALAKSILFFQGQRSGKL 60
           M  SP SFKLI F +LLLTLS AS ATVGHHRRPRF+PHNYRDALAKSILFFQGQRSGKL
Sbjct: 1   MPHSPLSFKLITFFYLLLTLSGASLATVGHHRRPRFSPHNYRDALAKSILFFQGQRSGKL 60

Query: 61  PPNQKMTWRKDSGLFDGSTMNVDLVGGYYDAGDNVKFGFPMAFTTTMLSWSVIEFGGVMK 120
           PP+QKM WRKDSGLFDGSTMNVDLVGGYYDAGDNVKFGFPMAF+TTMLSWSV+EFGGVM+
Sbjct: 61  PPSQKMAWRKDSGLFDGSTMNVDLVGGYYDAGDNVKFGFPMAFSTTMLSWSVVEFGGVMR 120

Query: 121 NELNNAKEAIRWATDYLLKATALPDTIFVQVGDANRDHACWERPEDMDTPRTVLKIDKNT 180
           +ELNNAKEAIRWATDYLLKATALPDT+FVQVGDAN+DH CWERPEDMDTPRTVLKID+N+
Sbjct: 121 DELNNAKEAIRWATDYLLKATALPDTVFVQVGDANKDHVCWERPEDMDTPRTVLKIDRNS 180

Query: 181 PGSEVAAETAAALASASLVFKKSDPTYSNLLIKTAIRVFEFADKYRGSYSNGLKNYVCPF 240
           PGSEVAAETAAALASASLVF++SDP+YS LLIK AIRVFEF DKYRGSYSNGLK++VCPF
Sbjct: 181 PGSEVAAETAAALASASLVFRRSDPSYSKLLIKRAIRVFEFGDKYRGSYSNGLKDFVCPF 240

Query: 241 YCSFSGYQDELLWGAAWLHRATKNGSFLNYIQENGQNLGGGEFDNTFGWDNKHVGARILL 300
           YCSFSGYQDELLWGAAWLHRATKNGS+LNYIQENGQ LGGGE DNTFGWDNKHVGARILL
Sbjct: 241 YCSFSGYQDELLWGAAWLHRATKNGSYLNYIQENGQILGGGELDNTFGWDNKHVGARILL 300

Query: 301 SKAFLIQNEKSFHDYKNHADSFICSLIPGAPSSSAQYTPGGLLFKMGDSNMQYVTSTSFL 360
           SKAFLIQN KS  DYK H+D+FICSL+PGAP SSA+YTPGGLL+KMGDSNMQYVTSTSFL
Sbjct: 301 SKAFLIQNVKSLRDYKGHSDNFICSLVPGAPFSSARYTPGGLLYKMGDSNMQYVTSTSFL 360

Query: 361 LLTYAKYLTSAHTTAYCTARTITPNILRAIAKKQIDYLLGENPLKMSYMVGYGGRYPRRI 420
           LLTYAKYLTSAHTTA+C  RTITPN LRAIA+KQIDYLLGENPLKMSYMVGYGGRYPRRI
Sbjct: 361 LLTYAKYLTSAHTTAHCAGRTITPNALRAIAQKQIDYLLGENPLKMSYMVGYGGRYPRRI 420

Query: 421 HHRGSSLPSIAEHPAKIGCSSGFSAMDSNSPNPNILVGAVVGGPDQNDEFPDERSDFEQS 480
           HHRGSSLPSIAEHPAKI CSSGFSAMDS+SPNPN+LVGAVVGGPDQND FPDERSD+EQS
Sbjct: 421 HHRGSSLPSIAEHPAKIDCSSGFSAMDSDSPNPNVLVGAVVGGPDQNDGFPDERSDYEQS 480

Query: 481 EPSTYINAPLVGSLAYFAHSFGQL 505
           EPSTYINAPLVGSLAYFAHSFGQL
Sbjct: 481 EPSTYINAPLVGSLAYFAHSFGQL 504

BLAST of HG10019301 vs. NCBI nr
Match: KAG6581281.1 (Endoglucanase 17, partial [Cucurbita argyrosperma subsp. sororia] >KAG7018003.1 Endoglucanase 17, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 939.9 bits (2428), Expect = 9.2e-270
Identity = 453/504 (89.88%), Postives = 480/504 (95.24%), Query Frame = 0

Query: 1   MALSPFSFKLIAFSFLLLTLSDASPATVGHHRRPRFTPHNYRDALAKSILFFQGQRSGKL 60
           M  SP SFKLI F +LLLTLS AS ATVGHHRRPRF+PHNYRDALAKSILFFQGQRSGKL
Sbjct: 1   MPHSPLSFKLITFFYLLLTLSGASLATVGHHRRPRFSPHNYRDALAKSILFFQGQRSGKL 60

Query: 61  PPNQKMTWRKDSGLFDGSTMNVDLVGGYYDAGDNVKFGFPMAFTTTMLSWSVIEFGGVMK 120
           PP+QKM WRKDSGLFDGSTMNVDLVGGYYDAGDNVKFGFPMAF+TTMLSWSV+EFGGVM+
Sbjct: 61  PPSQKMAWRKDSGLFDGSTMNVDLVGGYYDAGDNVKFGFPMAFSTTMLSWSVVEFGGVMR 120

Query: 121 NELNNAKEAIRWATDYLLKATALPDTIFVQVGDANRDHACWERPEDMDTPRTVLKIDKNT 180
           +ELNNAKEAIRWATDYLLKATALPDT+FVQVGDAN+DHACWERPEDMDTPRTVLKID+N+
Sbjct: 121 DELNNAKEAIRWATDYLLKATALPDTVFVQVGDANKDHACWERPEDMDTPRTVLKIDRNS 180

Query: 181 PGSEVAAETAAALASASLVFKKSDPTYSNLLIKTAIRVFEFADKYRGSYSNGLKNYVCPF 240
           PGSEVAAETAAALASASLVF++SDP+YS LLIK AIRVFEF DKYRGSYSNGLK++VCPF
Sbjct: 181 PGSEVAAETAAALASASLVFRRSDPSYSKLLIKRAIRVFEFGDKYRGSYSNGLKDFVCPF 240

Query: 241 YCSFSGYQDELLWGAAWLHRATKNGSFLNYIQENGQNLGGGEFDNTFGWDNKHVGARILL 300
           YCSFSGYQDELLWGAAWLHRATKNGS+LNYIQENGQ LGGGE DNTFGWDNKHVGARILL
Sbjct: 241 YCSFSGYQDELLWGAAWLHRATKNGSYLNYIQENGQILGGGELDNTFGWDNKHVGARILL 300

Query: 301 SKAFLIQNEKSFHDYKNHADSFICSLIPGAPSSSAQYTPGGLLFKMGDSNMQYVTSTSFL 360
           SKAFLIQN  S  DYK H+D+FICSL+PGAP SSAQYTPGGLL+KMGDSNMQYVTSTSFL
Sbjct: 301 SKAFLIQNVNSLRDYKGHSDNFICSLVPGAPFSSAQYTPGGLLYKMGDSNMQYVTSTSFL 360

Query: 361 LLTYAKYLTSAHTTAYCTARTITPNILRAIAKKQIDYLLGENPLKMSYMVGYGGRYPRRI 420
           LLTYAKYLTSAHTTA+C  RTITPN LRAIA+KQIDYLLGENPLKMSYMVGYGGRYP+RI
Sbjct: 361 LLTYAKYLTSAHTTAHCAGRTITPNALRAIAQKQIDYLLGENPLKMSYMVGYGGRYPQRI 420

Query: 421 HHRGSSLPSIAEHPAKIGCSSGFSAMDSNSPNPNILVGAVVGGPDQNDEFPDERSDFEQS 480
           HHRGSSLPSIAEHPAKI CSSGFSAMDS+SPNPN+LVGAVVGGPDQND FPDERSD+EQS
Sbjct: 421 HHRGSSLPSIAEHPAKIDCSSGFSAMDSDSPNPNVLVGAVVGGPDQNDGFPDERSDYEQS 480

Query: 481 EPSTYINAPLVGSLAYFAHSFGQL 505
           EPSTYINAPLVGSLAYFAHSFGQL
Sbjct: 481 EPSTYINAPLVGSLAYFAHSFGQL 504

BLAST of HG10019301 vs. NCBI nr
Match: XP_023528182.1 (endoglucanase 17-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 937.2 bits (2421), Expect = 6.0e-269
Identity = 452/504 (89.68%), Postives = 478/504 (94.84%), Query Frame = 0

Query: 1   MALSPFSFKLIAFSFLLLTLSDASPATVGHHRRPRFTPHNYRDALAKSILFFQGQRSGKL 60
           M  SP SFKLI F  LLLTLS AS A VGHHRRPRF+PHNYRDALAKSILFFQGQRSGKL
Sbjct: 1   MPHSPLSFKLITFFHLLLTLSGASLAAVGHHRRPRFSPHNYRDALAKSILFFQGQRSGKL 60

Query: 61  PPNQKMTWRKDSGLFDGSTMNVDLVGGYYDAGDNVKFGFPMAFTTTMLSWSVIEFGGVMK 120
           PP+QKM WRKDSGLFDGSTMNVDLVGGYYDAGDNVKFGFPMAF+TTMLSWSV+EFGGVM+
Sbjct: 61  PPSQKMAWRKDSGLFDGSTMNVDLVGGYYDAGDNVKFGFPMAFSTTMLSWSVVEFGGVMR 120

Query: 121 NELNNAKEAIRWATDYLLKATALPDTIFVQVGDANRDHACWERPEDMDTPRTVLKIDKNT 180
           +ELNNAKEAIRWATDYLLKATALPDT+FVQVGDAN+DHACWERPEDMDTPRTVLKID+N+
Sbjct: 121 DELNNAKEAIRWATDYLLKATALPDTVFVQVGDANKDHACWERPEDMDTPRTVLKIDRNS 180

Query: 181 PGSEVAAETAAALASASLVFKKSDPTYSNLLIKTAIRVFEFADKYRGSYSNGLKNYVCPF 240
           PGS+VAAETAAALASAS+VF+ SDP+YS LLIK AIRVFEF DKYRGSYSNGLK++VCPF
Sbjct: 181 PGSDVAAETAAALASASIVFRTSDPSYSKLLIKRAIRVFEFGDKYRGSYSNGLKDFVCPF 240

Query: 241 YCSFSGYQDELLWGAAWLHRATKNGSFLNYIQENGQNLGGGEFDNTFGWDNKHVGARILL 300
           YCSFSGYQDELLWGAAWLHRATKNGS+LNYIQENGQ LGGGE DNTFGWDNKHVGARILL
Sbjct: 241 YCSFSGYQDELLWGAAWLHRATKNGSYLNYIQENGQILGGGELDNTFGWDNKHVGARILL 300

Query: 301 SKAFLIQNEKSFHDYKNHADSFICSLIPGAPSSSAQYTPGGLLFKMGDSNMQYVTSTSFL 360
           SKAFLIQN KS  DYK H+D+FICSL+PGAP SSAQYTPGGLL+KMGDSNMQYVTSTSFL
Sbjct: 301 SKAFLIQNVKSLRDYKGHSDNFICSLVPGAPFSSAQYTPGGLLYKMGDSNMQYVTSTSFL 360

Query: 361 LLTYAKYLTSAHTTAYCTARTITPNILRAIAKKQIDYLLGENPLKMSYMVGYGGRYPRRI 420
           LLTYAKYLTSAHTTA+C  RTITPN LRAIA+KQIDYLLGENPLKMSYMVGYGGRYP+RI
Sbjct: 361 LLTYAKYLTSAHTTAHCAGRTITPNALRAIAQKQIDYLLGENPLKMSYMVGYGGRYPQRI 420

Query: 421 HHRGSSLPSIAEHPAKIGCSSGFSAMDSNSPNPNILVGAVVGGPDQNDEFPDERSDFEQS 480
           HHRGSSLPSIAEHPAKI CSSGFSAMDSNSPNPN+LVGAVVGGPDQND FPDERSD+EQS
Sbjct: 421 HHRGSSLPSIAEHPAKIDCSSGFSAMDSNSPNPNVLVGAVVGGPDQNDGFPDERSDYEQS 480

Query: 481 EPSTYINAPLVGSLAYFAHSFGQL 505
           EPSTYINAPLVGSLAYFAHSFGQL
Sbjct: 481 EPSTYINAPLVGSLAYFAHSFGQL 504

BLAST of HG10019301 vs. NCBI nr
Match: XP_008454321.1 (PREDICTED: endoglucanase 17 [Cucumis melo] >TYK29519.1 endoglucanase 17 [Cucumis melo var. makuwa])

HSP 1 Score: 933.3 bits (2411), Expect = 8.6e-268
Identity = 460/507 (90.73%), Postives = 480/507 (94.67%), Query Frame = 0

Query: 1   MALSPFSFKLIA-FSFLLLTLSDASPATVGHHRR-PRFTPHNYRDALAKSILFFQGQRSG 60
           MALSP SFKLIA  SFLLL+LS AS AT+G HRR PR+TPHNYRDALAKSILFFQGQRSG
Sbjct: 1   MALSPLSFKLIALISFLLLSLSKASLATLGQHRRHPRYTPHNYRDALAKSILFFQGQRSG 60

Query: 61  KLPPNQKMTWRKDSGLFDGSTMNVDLVGGYYDAGDNVKFGFPMAFTTTMLSWSVIEFGGV 120
           KLPPNQKM WRKDSGL DGS+MNVDLVGGYYDAGDNVKFGFPMAFTTTMLSWSVIEFGGV
Sbjct: 61  KLPPNQKMAWRKDSGLSDGSSMNVDLVGGYYDAGDNVKFGFPMAFTTTMLSWSVIEFGGV 120

Query: 121 MKNELNNAKEAIRWATDYLLKATALPDTIFVQVGDANRDHACWERPEDMDTPRTVLKIDK 180
           MKNELNNAK+AIRWATDYLLKATALPDTIFVQVGDAN+DHACWERPEDMDTPRTVLKIDK
Sbjct: 121 MKNELNNAKQAIRWATDYLLKATALPDTIFVQVGDANKDHACWERPEDMDTPRTVLKIDK 180

Query: 181 NTPGSEVAAETAAALASASLVFKKSDPTYSNLLIKTAIRVFEFADKYRGSYSNGLKNYVC 240
           N PGSEVAAETAAALASASLVFK SDPTYS LLIKTAIRVFEF DKYRGSYSNGLKN+VC
Sbjct: 181 NNPGSEVAAETAAALASASLVFKTSDPTYSKLLIKTAIRVFEFGDKYRGSYSNGLKNFVC 240

Query: 241 PFYCSFSGYQDELLWGAAWLHRATKNGSFLNYIQENGQNLGGGEFDNTFGWDNKHVGARI 300
           PFYCSFSGYQDELLWGAAWLHRATKN S+LNYIQENGQNLGG EFDNTFGWDNKHVGARI
Sbjct: 241 PFYCSFSGYQDELLWGAAWLHRATKNSSYLNYIQENGQNLGGVEFDNTFGWDNKHVGARI 300

Query: 301 LLSKAFLIQNEKSFHDYKNHADSFICSLIPGAPSSSA-QYTPGGLLFKMGDSNMQYVTST 360
           LLSKAFLIQN KS  +YK+HAD+FICSL+PGA SSS+ QYTPGGLLFKMGDSNMQYVTST
Sbjct: 301 LLSKAFLIQNVKSLQEYKDHADNFICSLVPGASSSSSVQYTPGGLLFKMGDSNMQYVTST 360

Query: 361 SFLLLTYAKYLTSAHTTAYCTARTITPNILRAIAKKQIDYLLGENPLKMSYMVGYGGRYP 420
           +FLLLTYAKYLTS+HTTA+C  RTITPN+LRAIAKKQIDYLLGENPLKMSYMVGYG RYP
Sbjct: 361 TFLLLTYAKYLTSSHTTAHCNGRTITPNVLRAIAKKQIDYLLGENPLKMSYMVGYGSRYP 420

Query: 421 RRIHHRGSSLPSIAEHPAKIGCSSGFSAMDSNSPNPNILVGAVVGGPDQNDEFPDERSDF 480
           +RIHHR SSLPSIAEHPAKI CSSGFSAM SNSPNPN+L+GAVVGGPDQND FPDERSDF
Sbjct: 421 QRIHHRASSLPSIAEHPAKIDCSSGFSAMHSNSPNPNVLIGAVVGGPDQNDGFPDERSDF 480

Query: 481 EQSEPSTYINAPLVGSLAYFAHSFGQL 505
           EQSEPSTYINAPLVGSLAYFAHSFGQL
Sbjct: 481 EQSEPSTYINAPLVGSLAYFAHSFGQL 507

BLAST of HG10019301 vs. ExPASy Swiss-Prot
Match: O81416 (Endoglucanase 17 OS=Arabidopsis thaliana OX=3702 GN=At4g02290 PE=2 SV=1)

HSP 1 Score: 778.1 bits (2008), Expect = 6.1e-224
Identity = 363/475 (76.42%), Postives = 418/475 (88.00%), Query Frame = 0

Query: 30  HHRRPRFTPHNYRDALAKSILFFQGQRSGKLPPNQKMTWRKDSGLFDGSTMNVDLVGGYY 89
           HH R     HNY+DAL KSILFF+GQRSGKLP NQ+M+WR+DSGL DGS ++VDLVGGYY
Sbjct: 42  HHHRHHLAKHNYKDALTKSILFFEGQRSGKLPSNQRMSWRRDSGLSDGSALHVDLVGGYY 101

Query: 90  DAGDNVKFGFPMAFTTTMLSWSVIEFGGVMKNELNNAKEAIRWATDYLLKATALPDTIFV 149
           DAGDN+KFGFPMAFTTTMLSWSVIEFGG+MK+EL NAK AIRWATDYLLKAT+ PDTI+V
Sbjct: 102 DAGDNIKFGFPMAFTTTMLSWSVIEFGGLMKSELQNAKIAIRWATDYLLKATSQPDTIYV 161

Query: 150 QVGDANRDHACWERPEDMDTPRTVLKIDKNTPGSEVAAETAAALASASLVFKKSDPTYSN 209
           QVGDAN+DH+CWERPEDMDT R+V K+DKN PGS+VAAETAAALA+A++VF+KSDP+YS 
Sbjct: 162 QVGDANKDHSCWERPEDMDTVRSVFKVDKNIPGSDVAAETAAALAAAAIVFRKSDPSYSK 221

Query: 210 LLIKTAIRVFEFADKYRGSYSNGLKNYVCPFYCSFSGYQDELLWGAAWLHRATKNGSFLN 269
           +L+K AI VF FADKYRG+YS GLK  VCPFYCS+SGYQDELLWGAAWL +ATKN  +LN
Sbjct: 222 VLLKRAISVFAFADKYRGTYSAGLKPDVCPFYCSYSGYQDELLWGAAWLQKATKNIKYLN 281

Query: 270 YIQENGQNLGGGEFDNTFGWDNKHVGARILLSKAFLIQNEKSFHDYKNHADSFICSLIPG 329
           YI+ NGQ LG  E+DNTFGWDNKH GARILL+KAFL+QN K+ H+YK HAD+FICS+IPG
Sbjct: 282 YIKINGQILGAAEYDNTFGWDNKHAGARILLTKAFLVQNVKTLHEYKGHADNFICSVIPG 341

Query: 330 APSSSAQYTPGGLLFKMGDSNMQYVTSTSFLLLTYAKYLTSAHTTAYCTARTITPNILRA 389
           AP SS QYTPGGLLFKM D+NMQYVTSTSFLLLTYAKYLTSA T  +C     TP  LR+
Sbjct: 342 APFSSTQYTPGGLLFKMADANMQYVTSTSFLLLTYAKYLTSAKTVVHCGGSVYTPGRLRS 401

Query: 390 IAKKQIDYLLGENPLKMSYMVGYGGRYPRRIHHRGSSLPSIAEHPAKIGCSSGFSAMDSN 449
           IAK+Q+DYLLG+NPL+MSYMVGYG ++PRRIHHRGSSLP +A HPAKI C  GF+ M+S 
Sbjct: 402 IAKRQVDYLLGDNPLRMSYMVGYGPKFPRRIHHRGSSLPCVASHPAKIQCHQGFAIMNSQ 461

Query: 450 SPNPNILVGAVVGGPDQNDEFPDERSDFEQSEPSTYINAPLVGSLAYFAHSFGQL 505
           SPNPN LVGAVVGGPDQ+D FPDERSD+EQSEP+TYIN+PLVG+LAYFAH++GQL
Sbjct: 462 SPNPNFLVGAVVGGPDQHDRFPDERSDYEQSEPATYINSPLVGALAYFAHAYGQL 516

BLAST of HG10019301 vs. ExPASy Swiss-Prot
Match: Q9SRX3 (Endoglucanase 1 OS=Arabidopsis thaliana OX=3702 GN=CEL2 PE=2 SV=1)

HSP 1 Score: 753.8 bits (1945), Expect = 1.2e-216
Identity = 369/503 (73.36%), Postives = 419/503 (83.30%), Query Frame = 0

Query: 1   MALSPFSFKLIAFSFLLLTLSD---ASPATVGHHRRPRFTPHNYRDALAKSILFFQGQRS 60
           MAL   S +LI F   +L LS+   +S +    H R     HNY+DAL+KSILFF+GQRS
Sbjct: 1   MALYLSSSRLITFLSFILLLSNGFSSSSSRPSIHHRHHLDNHNYKDALSKSILFFEGQRS 60

Query: 61  GKLPPNQKMTWRKDSGLFDGSTMNVDLVGGYYDAGDNVKFGFPMAFTTTMLSWSVIEFGG 120
           GKLPPNQ+MTWR +SGL DGS +NVDLVGGYYDAGDN+KFGFPMAFTTTMLSWS+IEFGG
Sbjct: 61  GKLPPNQRMTWRSNSGLSDGSALNVDLVGGYYDAGDNMKFGFPMAFTTTMLSWSLIEFGG 120

Query: 121 VMKNELNNAKEAIRWATDYLLKATALPDTIFVQVGDANRDHACWERPEDMDTPRTVLKID 180
           +MK+EL NAK+AIRWATD+LLKAT+ PDTI+VQVGD N DHACWERPEDMDTPR+V K+D
Sbjct: 121 LMKSELPNAKDAIRWATDFLLKATSHPDTIYVQVGDPNMDHACWERPEDMDTPRSVFKVD 180

Query: 181 KNTPGSEVAAETAAALASASLVFKKSDPTYSNLLIKTAIRVFEFADKYRGSYSNGLKNYV 240
           KN PGS++A E AAALA+AS+VF+K DP+YSN L++ AI VF FADKYRG YS GL   V
Sbjct: 181 KNNPGSDIAGEIAAALAAASIVFRKCDPSYSNHLLQRAITVFTFADKYRGPYSAGLAPEV 240

Query: 241 CPFYCSFSGYQDELLWGAAWLHRATKNGSFLNYIQENGQNLGGGEFDNTFGWDNKHVGAR 300
           CPFYCS+SGYQDELLWGAAWL +AT N ++LNYI+ NGQ LG  EFDN F WDNKHVGAR
Sbjct: 241 CPFYCSYSGYQDELLWGAAWLQKATNNPTYLNYIKANGQILGADEFDNMFSWDNKHVGAR 300

Query: 301 ILLSKAFLIQNEKSFHDYKNHADSFICSLIPGAPSSSAQYTPGGLLFKMGDSNMQYVTST 360
           ILLSK FLIQ  KS  +YK HADSFICS++PGA  SS+QYTPGGLLFKMG+SNMQYVTST
Sbjct: 301 ILLSKEFLIQKVKSLEEYKEHADSFICSVLPGA--SSSQYTPGGLLFKMGESNMQYVTST 360

Query: 361 SFLLLTYAKYLTSAHTTAYCTARTITPNILRAIAKKQIDYLLGENPLKMSYMVGYGGRYP 420
           SFLLLTYAKYLTSA T AYC    +TP  LR+IAKKQ+DYLLG NPLKMSYMVGYG +YP
Sbjct: 361 SFLLLTYAKYLTSARTVAYCGGSVVTPARLRSIAKKQVDYLLGGNPLKMSYMVGYGLKYP 420

Query: 421 RRIHHRGSSLPSIAEHPAKIGCSSGFSAMDSNSPNPNILVGAVVGGPDQNDEFPDERSDF 480
           RRIHHRGSSLPS+A HP +I C  GFS   S SPNPN LVGAVVGGPDQND+FPDERSD+
Sbjct: 421 RRIHHRGSSLPSVAVHPTRIQCHDGFSLFTSQSPNPNDLVGAVVGGPDQNDQFPDERSDY 480

Query: 481 EQSEPSTYINAPLVGSLAYFAHS 501
            +SEP+TYINAPLVG+LAY A S
Sbjct: 481 GRSEPATYINAPLVGALAYLARS 501

BLAST of HG10019301 vs. ExPASy Swiss-Prot
Match: Q8LQ92 (Endoglucanase 3 OS=Oryza sativa subsp. japonica OX=39947 GN=GLU8 PE=2 SV=1)

HSP 1 Score: 733.8 bits (1893), Expect = 1.3e-210
Identity = 345/495 (69.70%), Postives = 412/495 (83.23%), Query Frame = 0

Query: 10  LIAFSFLLLTLSDASPATVGHHRRPRFTPHNYRDALAKSILFFQGQRSGKLPPNQKMTWR 69
           L   + LL   + A  A    H  P   PH+YRDAL KSILFF+GQRSGKLPP+Q+++WR
Sbjct: 7   LFLLAVLLPHRNAAVVAAASPHHGP--APHDYRDALTKSILFFEGQRSGKLPPSQRVSWR 66

Query: 70  KDSGLFDGSTMNVDLVGGYYDAGDNVKFGFPMAFTTTMLSWSVIEFGGVMKNELNNAKEA 129
            DSGL DGS++ VDLVGGYYDAGDN+KFGFP+AF+ TML+WSV+EFGG+MK EL +A++A
Sbjct: 67  GDSGLSDGSSIKVDLVGGYYDAGDNMKFGFPLAFSMTMLAWSVVEFGGLMKGELQHARDA 126

Query: 130 IRWATDYLLKATALPDTIFVQVGDANRDHACWERPEDMDTPRTVLKIDKNTPGSEVAAET 189
           +RW +DYLLKATA PDT++VQVGDANRDHACWERPEDMDTPRTV K+D +TPG++VAAET
Sbjct: 127 VRWGSDYLLKATAHPDTVYVQVGDANRDHACWERPEDMDTPRTVYKVDPSTPGTDVAAET 186

Query: 190 AAALASASLVFKKSDPTYSNLLIKTAIRVFEFADKYRGSYSNGLKNYVCPFYCSFSGYQD 249
           AAALA+ASLVF+KSDP Y++ L+  A RVFEFADK+RG+YS  L  YVCP+YCS+SGYQD
Sbjct: 187 AAALAAASLVFRKSDPAYASRLVARAKRVFEFADKHRGTYSTRLSPYVCPYYCSYSGYQD 246

Query: 250 ELLWGAAWLHRATKNGSFLNYIQENGQNLGGGEFDNTFGWDNKHVGARILLSKAFLIQNE 309
           ELLWGAAWLHRATKN ++L+YIQ NGQ LG  E DNTFGWDNKH GARIL++KAFL+Q  
Sbjct: 247 ELLWGAAWLHRATKNPTYLSYIQMNGQVLGADEQDNTFGWDNKHAGARILIAKAFLVQKV 306

Query: 310 KSFHDYKNHADSFICSLIPGAPSSSAQYTPGGLLFKMGDSNMQYVTSTSFLLLTYAKYLT 369
            + H+YK HADSFICS++PG P+   QYT GGLLFK+ DSNMQYVTS+SFLLLTYAKYL 
Sbjct: 307 AALHEYKGHADSFICSMVPGTPTDQTQYTRGGLLFKLSDSNMQYVTSSSFLLLTYAKYLA 366

Query: 370 SAHTTAYCTARTITPNILRAIAKKQIDYLLGENPLKMSYMVGYGGRYPRRIHHRGSSLPS 429
            + TT  C    +TP  LRAIA++Q+DYLLG NP+ MSYMVGYG +YPRRIHHR SSLPS
Sbjct: 367 FSKTTVSCGGAAVTPARLRAIARQQVDYLLGSNPMGMSYMVGYGAKYPRRIHHRASSLPS 426

Query: 430 IAEHPAKIGCSSGFSAMDSNSPNPNILVGAVVGGPDQNDEFPDERSDFEQSEPSTYINAP 489
           +A HPA+IGCS GF+A+ S   NPN+LVGAVVGGP+  D+FPD+RSD E SEP+TYINAP
Sbjct: 427 VAAHPARIGCSQGFTALYSGVANPNVLVGAVVGGPNLQDQFPDQRSDHEHSEPATYINAP 486

Query: 490 LVGSLAYFAHSFGQL 505
           LVG+LAY AHS+GQL
Sbjct: 487 LVGALAYLAHSYGQL 499

BLAST of HG10019301 vs. ExPASy Swiss-Prot
Match: P05522 (Endoglucanase 1 OS=Persea americana OX=3435 GN=CEL1 PE=2 SV=1)

HSP 1 Score: 626.3 bits (1614), Expect = 3.0e-178
Identity = 293/460 (63.70%), Postives = 366/460 (79.57%), Query Frame = 0

Query: 40  NYRDALAKSILFFQGQRSGKLPPNQKMTWRKDSGLFDGSTMNVDLVGGYYDAGDNVKFGF 99
           +Y DAL KSILFF+GQRSGKLP NQ++TWR DSGL DGS+ +VDLVGGYYDAGDN+KFG 
Sbjct: 29  HYSDALEKSILFFEGQRSGKLPTNQRLTWRGDSGLSDGSSYHVDLVGGYYDAGDNLKFGL 88

Query: 100 PMAFTTTMLSWSVIEFGGVMKNELNNAKEAIRWATDYLLKA-TALPDTIFVQVGDANRDH 159
           PMAFTTTML+W +IEFG +M  ++ NA+ A+RW+TDYLLKA TA  ++++VQVG+ N DH
Sbjct: 89  PMAFTTTMLAWGIIEFGCLMPEQVENARAALRWSTDYLLKASTATSNSLYVQVGEPNADH 148

Query: 160 ACWERPEDMDTPRTVLKIDKNTPGSEVAAETAAALASASLVFKKSDPTYSNLLIKTAIRV 219
            CWERPEDMDTPR V K+    PGS+VAAETAAALA+AS+VF  SD +YS  L+ TA++V
Sbjct: 149 RCWERPEDMDTPRNVYKVSTQNPGSDVAAETAAALAAASIVFGDSDSSYSTKLLHTAVKV 208

Query: 220 FEFADKYRGSYSNGLKNYVCPFYCSFSGYQDELLWGAAWLHRATKNGSFLNYIQENGQNL 279
           FEFAD+YRGSYS+ L + VCPFYCS+SGY DELLWGA+WLHRA++N S++ YIQ NG  L
Sbjct: 209 FEFADQYRGSYSDSLGSVVCPFYCSYSGYNDELLWGASWLHRASQNASYMTYIQSNGHTL 268

Query: 280 GGGEFDNTFGWDNKHVGARILLSKAFLIQNEKSFHDYKNHADSFICSLIPGAPSSSAQYT 339
           G  + D +F WD+K VG ++LLSK FL    +    YK H D++ICSLIPG  S  AQYT
Sbjct: 269 GADDDDYSFSWDDKRVGTKVLLSKGFLQDRIEELQLYKVHTDNYICSLIPGTSSFQAQYT 328

Query: 340 PGGLLFKMGDSNMQYVTSTSFLLLTYAKYLTSAHTTAYCTARTITPNILRAIAKKQIDYL 399
           PGGLL+K   SN+QYVTST+FLLLTYA YL S+   A C   T+T   L ++AKKQ+DY+
Sbjct: 329 PGGLLYKGSASNLQYVTSTAFLLLTYANYLNSSGGHASCGTTTVTAKNLISLAKKQVDYI 388

Query: 400 LGENPLKMSYMVGYGGRYPRRIHHRGSSLPSIAEHPAKIGCSSGFSAMDSNSPNPNILVG 459
           LG+NP KMSYMVG+G RYP+ +HHRGSSLPS+  HP  I C++GF  + S+ PNPNILVG
Sbjct: 389 LGQNPAKMSYMVGFGERYPQHVHHRGSSLPSVQVHPNSIPCNAGFQYLYSSPPNPNILVG 448

Query: 460 AVVGGPDQNDEFPDERSDFEQSEPSTYINAPLVGSLAYFA 499
           A++GGPD  D F D+R++++QSEP+TYINAPLVG+LA+FA
Sbjct: 449 AILGGPDNRDSFSDDRNNYQQSEPATYINAPLVGALAFFA 488

BLAST of HG10019301 vs. ExPASy Swiss-Prot
Match: Q652F9 (Endoglucanase 17 OS=Oryza sativa subsp. japonica OX=39947 GN=GLU13 PE=2 SV=1)

HSP 1 Score: 614.4 bits (1583), Expect = 1.2e-174
Identity = 295/485 (60.82%), Postives = 374/485 (77.11%), Query Frame = 0

Query: 16  LLLTLSDASPATVGHHRRPRFTPHNYRDALAKSILFFQGQRSGKLPPNQKMTWRKDSGLF 75
           LLL L+ A+  T           H+Y DAL KSILFF+GQRSG+LPP+Q++ WR+DS L 
Sbjct: 9   LLLVLATATSVT---------GQHDYSDALHKSILFFEGQRSGRLPPDQRLRWRRDSALN 68

Query: 76  DGSTMNVDLVGGYYDAGDNVKFGFPMAFTTTMLSWSVIEFGGVMKNELNNAKEAIRWATD 135
           DG+T  VDL GGYYDAGDNVKFGFPMAFT T++SW +I+FG         A+EA+RWATD
Sbjct: 69  DGATAGVDLTGGYYDAGDNVKFGFPMAFTATLMSWGLIDFGRSFGAHAAEAREAVRWATD 128

Query: 136 YLLKATALPDTIFVQVGDANRDHACWERPEDMDTPRTVLKIDKNTPGSEVAAETAAALAS 195
           YL+KATA P+T++VQVGDA RDH+CWERPEDMDTPRTV K+D + PGS+VAAETAAALA+
Sbjct: 129 YLMKATATPNTVYVQVGDAFRDHSCWERPEDMDTPRTVYKVDPSHPGSDVAAETAAALAA 188

Query: 196 ASLVFKKSDPTYSNLLIKTAIRVFEFADKYRGSYSNGLKNYVCPFYCSFSGYQDELLWGA 255
           AS+VF+ +DP YSN L+  AI+VFEFADKYRG YS+ L   VCP YC +SGY+DELLWGA
Sbjct: 189 ASIVFRDADPDYSNRLLDRAIQVFEFADKYRGPYSSSLHAAVCPCYCDYSGYKDELLWGA 248

Query: 256 AWLHRATKNGSFLNYIQENGQNLGGGEFDNTFGWDNKHVGARILLSKAFLIQNEKSFHDY 315
           AWLH+A++   + +YI+ N   LG  E  N FGWDNKH G  +L+SK  L+  ++ F  +
Sbjct: 249 AWLHKASRRREYRDYIKRNEVVLGASEAINEFGWDNKHAGINVLISKEVLMGKDEYFQSF 308

Query: 316 KNHADSFICSLIPGAPS-SSAQYTPGGLLFKMGDSNMQYVTSTSFLLLTYAKYLTSAHTT 375
           + +AD+FIC+L+PG  +    QY+PGGLLFK+G+SNMQ+VTS SFLLL Y+ YL+ A+  
Sbjct: 309 RVNADNFICTLLPGISNHPQIQYSPGGLLFKVGNSNMQHVTSLSFLLLAYSNYLSHANVR 368

Query: 376 AYCTARTITPNILRAIAKKQIDYLLGENPLKMSYMVGYGGRYPRRIHHRGSSLPSIAEHP 435
             C   + +P  LR +AK+Q+DY+LG+NPL+MSYMVGYG RYP RIHHRGSSLPS+A HP
Sbjct: 369 VPCGTSSASPVQLRRVAKRQVDYILGDNPLRMSYMVGYGSRYPLRIHHRGSSLPSVAAHP 428

Query: 436 AKIGCSSGFSAMDSNSPNPNILVGAVVGGP-DQNDEFPDERSDFEQSEPSTYINAPLVGS 495
           A+IGC +G +   S +PNPN+LVGAVVGGP + +D FPD R+ F+QSEP+TYINAPL+G 
Sbjct: 429 AQIGCKAGATYYASAAPNPNLLVGAVVGGPSNTSDAFPDARAVFQQSEPTTYINAPLLGL 484

Query: 496 LAYFA 499
           LAYF+
Sbjct: 489 LAYFS 484

BLAST of HG10019301 vs. ExPASy TrEMBL
Match: A0A6J1F2B2 (Endoglucanase OS=Cucurbita moschata OX=3662 GN=LOC111441537 PE=3 SV=1)

HSP 1 Score: 940.3 bits (2429), Expect = 3.4e-270
Identity = 453/504 (89.88%), Postives = 480/504 (95.24%), Query Frame = 0

Query: 1   MALSPFSFKLIAFSFLLLTLSDASPATVGHHRRPRFTPHNYRDALAKSILFFQGQRSGKL 60
           M  SP SFKLI F +LLLTLS AS ATVGHHRRPRF+PHNYRDALAKSILFFQGQRSGKL
Sbjct: 1   MPHSPLSFKLITFFYLLLTLSGASLATVGHHRRPRFSPHNYRDALAKSILFFQGQRSGKL 60

Query: 61  PPNQKMTWRKDSGLFDGSTMNVDLVGGYYDAGDNVKFGFPMAFTTTMLSWSVIEFGGVMK 120
           PP+QKM WRKDSGLFDGSTMNVDLVGGYYDAGDNVKFGFPMAF+TTMLSWSV+EFGGVM+
Sbjct: 61  PPSQKMAWRKDSGLFDGSTMNVDLVGGYYDAGDNVKFGFPMAFSTTMLSWSVVEFGGVMR 120

Query: 121 NELNNAKEAIRWATDYLLKATALPDTIFVQVGDANRDHACWERPEDMDTPRTVLKIDKNT 180
           +ELNNAKEAIRWATDYLLKATALPDT+FVQVGDAN+DH CWERPEDMDTPRTVLKID+N+
Sbjct: 121 DELNNAKEAIRWATDYLLKATALPDTVFVQVGDANKDHVCWERPEDMDTPRTVLKIDRNS 180

Query: 181 PGSEVAAETAAALASASLVFKKSDPTYSNLLIKTAIRVFEFADKYRGSYSNGLKNYVCPF 240
           PGSEVAAETAAALASASLVF++SDP+YS LLIK AIRVFEF DKYRGSYSNGLK++VCPF
Sbjct: 181 PGSEVAAETAAALASASLVFRRSDPSYSKLLIKRAIRVFEFGDKYRGSYSNGLKDFVCPF 240

Query: 241 YCSFSGYQDELLWGAAWLHRATKNGSFLNYIQENGQNLGGGEFDNTFGWDNKHVGARILL 300
           YCSFSGYQDELLWGAAWLHRATKNGS+LNYIQENGQ LGGGE DNTFGWDNKHVGARILL
Sbjct: 241 YCSFSGYQDELLWGAAWLHRATKNGSYLNYIQENGQILGGGELDNTFGWDNKHVGARILL 300

Query: 301 SKAFLIQNEKSFHDYKNHADSFICSLIPGAPSSSAQYTPGGLLFKMGDSNMQYVTSTSFL 360
           SKAFLIQN KS  DYK H+D+FICSL+PGAP SSA+YTPGGLL+KMGDSNMQYVTSTSFL
Sbjct: 301 SKAFLIQNVKSLRDYKGHSDNFICSLVPGAPFSSARYTPGGLLYKMGDSNMQYVTSTSFL 360

Query: 361 LLTYAKYLTSAHTTAYCTARTITPNILRAIAKKQIDYLLGENPLKMSYMVGYGGRYPRRI 420
           LLTYAKYLTSAHTTA+C  RTITPN LRAIA+KQIDYLLGENPLKMSYMVGYGGRYPRRI
Sbjct: 361 LLTYAKYLTSAHTTAHCAGRTITPNALRAIAQKQIDYLLGENPLKMSYMVGYGGRYPRRI 420

Query: 421 HHRGSSLPSIAEHPAKIGCSSGFSAMDSNSPNPNILVGAVVGGPDQNDEFPDERSDFEQS 480
           HHRGSSLPSIAEHPAKI CSSGFSAMDS+SPNPN+LVGAVVGGPDQND FPDERSD+EQS
Sbjct: 421 HHRGSSLPSIAEHPAKIDCSSGFSAMDSDSPNPNVLVGAVVGGPDQNDGFPDERSDYEQS 480

Query: 481 EPSTYINAPLVGSLAYFAHSFGQL 505
           EPSTYINAPLVGSLAYFAHSFGQL
Sbjct: 481 EPSTYINAPLVGSLAYFAHSFGQL 504

BLAST of HG10019301 vs. ExPASy TrEMBL
Match: A0A5D3E1B8 (Endoglucanase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold655G001220 PE=3 SV=1)

HSP 1 Score: 933.3 bits (2411), Expect = 4.2e-268
Identity = 460/507 (90.73%), Postives = 480/507 (94.67%), Query Frame = 0

Query: 1   MALSPFSFKLIA-FSFLLLTLSDASPATVGHHRR-PRFTPHNYRDALAKSILFFQGQRSG 60
           MALSP SFKLIA  SFLLL+LS AS AT+G HRR PR+TPHNYRDALAKSILFFQGQRSG
Sbjct: 1   MALSPLSFKLIALISFLLLSLSKASLATLGQHRRHPRYTPHNYRDALAKSILFFQGQRSG 60

Query: 61  KLPPNQKMTWRKDSGLFDGSTMNVDLVGGYYDAGDNVKFGFPMAFTTTMLSWSVIEFGGV 120
           KLPPNQKM WRKDSGL DGS+MNVDLVGGYYDAGDNVKFGFPMAFTTTMLSWSVIEFGGV
Sbjct: 61  KLPPNQKMAWRKDSGLSDGSSMNVDLVGGYYDAGDNVKFGFPMAFTTTMLSWSVIEFGGV 120

Query: 121 MKNELNNAKEAIRWATDYLLKATALPDTIFVQVGDANRDHACWERPEDMDTPRTVLKIDK 180
           MKNELNNAK+AIRWATDYLLKATALPDTIFVQVGDAN+DHACWERPEDMDTPRTVLKIDK
Sbjct: 121 MKNELNNAKQAIRWATDYLLKATALPDTIFVQVGDANKDHACWERPEDMDTPRTVLKIDK 180

Query: 181 NTPGSEVAAETAAALASASLVFKKSDPTYSNLLIKTAIRVFEFADKYRGSYSNGLKNYVC 240
           N PGSEVAAETAAALASASLVFK SDPTYS LLIKTAIRVFEF DKYRGSYSNGLKN+VC
Sbjct: 181 NNPGSEVAAETAAALASASLVFKTSDPTYSKLLIKTAIRVFEFGDKYRGSYSNGLKNFVC 240

Query: 241 PFYCSFSGYQDELLWGAAWLHRATKNGSFLNYIQENGQNLGGGEFDNTFGWDNKHVGARI 300
           PFYCSFSGYQDELLWGAAWLHRATKN S+LNYIQENGQNLGG EFDNTFGWDNKHVGARI
Sbjct: 241 PFYCSFSGYQDELLWGAAWLHRATKNSSYLNYIQENGQNLGGVEFDNTFGWDNKHVGARI 300

Query: 301 LLSKAFLIQNEKSFHDYKNHADSFICSLIPGAPSSSA-QYTPGGLLFKMGDSNMQYVTST 360
           LLSKAFLIQN KS  +YK+HAD+FICSL+PGA SSS+ QYTPGGLLFKMGDSNMQYVTST
Sbjct: 301 LLSKAFLIQNVKSLQEYKDHADNFICSLVPGASSSSSVQYTPGGLLFKMGDSNMQYVTST 360

Query: 361 SFLLLTYAKYLTSAHTTAYCTARTITPNILRAIAKKQIDYLLGENPLKMSYMVGYGGRYP 420
           +FLLLTYAKYLTS+HTTA+C  RTITPN+LRAIAKKQIDYLLGENPLKMSYMVGYG RYP
Sbjct: 361 TFLLLTYAKYLTSSHTTAHCNGRTITPNVLRAIAKKQIDYLLGENPLKMSYMVGYGSRYP 420

Query: 421 RRIHHRGSSLPSIAEHPAKIGCSSGFSAMDSNSPNPNILVGAVVGGPDQNDEFPDERSDF 480
           +RIHHR SSLPSIAEHPAKI CSSGFSAM SNSPNPN+L+GAVVGGPDQND FPDERSDF
Sbjct: 421 QRIHHRASSLPSIAEHPAKIDCSSGFSAMHSNSPNPNVLIGAVVGGPDQNDGFPDERSDF 480

Query: 481 EQSEPSTYINAPLVGSLAYFAHSFGQL 505
           EQSEPSTYINAPLVGSLAYFAHSFGQL
Sbjct: 481 EQSEPSTYINAPLVGSLAYFAHSFGQL 507

BLAST of HG10019301 vs. ExPASy TrEMBL
Match: A0A1S3BZ39 (Endoglucanase OS=Cucumis melo OX=3656 GN=LOC103494756 PE=3 SV=1)

HSP 1 Score: 933.3 bits (2411), Expect = 4.2e-268
Identity = 460/507 (90.73%), Postives = 480/507 (94.67%), Query Frame = 0

Query: 1   MALSPFSFKLIA-FSFLLLTLSDASPATVGHHRR-PRFTPHNYRDALAKSILFFQGQRSG 60
           MALSP SFKLIA  SFLLL+LS AS AT+G HRR PR+TPHNYRDALAKSILFFQGQRSG
Sbjct: 1   MALSPLSFKLIALISFLLLSLSKASLATLGQHRRHPRYTPHNYRDALAKSILFFQGQRSG 60

Query: 61  KLPPNQKMTWRKDSGLFDGSTMNVDLVGGYYDAGDNVKFGFPMAFTTTMLSWSVIEFGGV 120
           KLPPNQKM WRKDSGL DGS+MNVDLVGGYYDAGDNVKFGFPMAFTTTMLSWSVIEFGGV
Sbjct: 61  KLPPNQKMAWRKDSGLSDGSSMNVDLVGGYYDAGDNVKFGFPMAFTTTMLSWSVIEFGGV 120

Query: 121 MKNELNNAKEAIRWATDYLLKATALPDTIFVQVGDANRDHACWERPEDMDTPRTVLKIDK 180
           MKNELNNAK+AIRWATDYLLKATALPDTIFVQVGDAN+DHACWERPEDMDTPRTVLKIDK
Sbjct: 121 MKNELNNAKQAIRWATDYLLKATALPDTIFVQVGDANKDHACWERPEDMDTPRTVLKIDK 180

Query: 181 NTPGSEVAAETAAALASASLVFKKSDPTYSNLLIKTAIRVFEFADKYRGSYSNGLKNYVC 240
           N PGSEVAAETAAALASASLVFK SDPTYS LLIKTAIRVFEF DKYRGSYSNGLKN+VC
Sbjct: 181 NNPGSEVAAETAAALASASLVFKTSDPTYSKLLIKTAIRVFEFGDKYRGSYSNGLKNFVC 240

Query: 241 PFYCSFSGYQDELLWGAAWLHRATKNGSFLNYIQENGQNLGGGEFDNTFGWDNKHVGARI 300
           PFYCSFSGYQDELLWGAAWLHRATKN S+LNYIQENGQNLGG EFDNTFGWDNKHVGARI
Sbjct: 241 PFYCSFSGYQDELLWGAAWLHRATKNSSYLNYIQENGQNLGGVEFDNTFGWDNKHVGARI 300

Query: 301 LLSKAFLIQNEKSFHDYKNHADSFICSLIPGAPSSSA-QYTPGGLLFKMGDSNMQYVTST 360
           LLSKAFLIQN KS  +YK+HAD+FICSL+PGA SSS+ QYTPGGLLFKMGDSNMQYVTST
Sbjct: 301 LLSKAFLIQNVKSLQEYKDHADNFICSLVPGASSSSSVQYTPGGLLFKMGDSNMQYVTST 360

Query: 361 SFLLLTYAKYLTSAHTTAYCTARTITPNILRAIAKKQIDYLLGENPLKMSYMVGYGGRYP 420
           +FLLLTYAKYLTS+HTTA+C  RTITPN+LRAIAKKQIDYLLGENPLKMSYMVGYG RYP
Sbjct: 361 TFLLLTYAKYLTSSHTTAHCNGRTITPNVLRAIAKKQIDYLLGENPLKMSYMVGYGSRYP 420

Query: 421 RRIHHRGSSLPSIAEHPAKIGCSSGFSAMDSNSPNPNILVGAVVGGPDQNDEFPDERSDF 480
           +RIHHR SSLPSIAEHPAKI CSSGFSAM SNSPNPN+L+GAVVGGPDQND FPDERSDF
Sbjct: 421 QRIHHRASSLPSIAEHPAKIDCSSGFSAMHSNSPNPNVLIGAVVGGPDQNDGFPDERSDF 480

Query: 481 EQSEPSTYINAPLVGSLAYFAHSFGQL 505
           EQSEPSTYINAPLVGSLAYFAHSFGQL
Sbjct: 481 EQSEPSTYINAPLVGSLAYFAHSFGQL 507

BLAST of HG10019301 vs. ExPASy TrEMBL
Match: A0A5A7TS46 (Endoglucanase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold46G001200 PE=3 SV=1)

HSP 1 Score: 931.0 bits (2405), Expect = 2.1e-267
Identity = 459/507 (90.53%), Postives = 479/507 (94.48%), Query Frame = 0

Query: 1   MALSPFSFKLIA-FSFLLLTLSDASPATVGHHRR-PRFTPHNYRDALAKSILFFQGQRSG 60
           MALSP SFKLIA  SFLLL+LS AS AT+G HRR PR+TPHNYRDALAKSILFFQGQRSG
Sbjct: 1   MALSPLSFKLIALISFLLLSLSKASLATLGQHRRHPRYTPHNYRDALAKSILFFQGQRSG 60

Query: 61  KLPPNQKMTWRKDSGLFDGSTMNVDLVGGYYDAGDNVKFGFPMAFTTTMLSWSVIEFGGV 120
           KLPPNQKM WRKDSGL DGS+MNVDLVGGYYDAGDNVKFGFPMAFTTTMLSWSVIEFGGV
Sbjct: 61  KLPPNQKMAWRKDSGLSDGSSMNVDLVGGYYDAGDNVKFGFPMAFTTTMLSWSVIEFGGV 120

Query: 121 MKNELNNAKEAIRWATDYLLKATALPDTIFVQVGDANRDHACWERPEDMDTPRTVLKIDK 180
           MKNELNNAK+AIRWATDYLLKATALPDTIFVQVGDAN+DHACWERPEDMDTPRTVLKIDK
Sbjct: 121 MKNELNNAKQAIRWATDYLLKATALPDTIFVQVGDANKDHACWERPEDMDTPRTVLKIDK 180

Query: 181 NTPGSEVAAETAAALASASLVFKKSDPTYSNLLIKTAIRVFEFADKYRGSYSNGLKNYVC 240
           N PGSEVAAETAAALASASLVFK SDPTYS LLIKTAIRVFEF DKYRGSYSNGLKN+VC
Sbjct: 181 NNPGSEVAAETAAALASASLVFKTSDPTYSKLLIKTAIRVFEFGDKYRGSYSNGLKNFVC 240

Query: 241 PFYCSFSGYQDELLWGAAWLHRATKNGSFLNYIQENGQNLGGGEFDNTFGWDNKHVGARI 300
           PFYCSFSGYQDELLWGAAWLHRATKN S+LNYIQENGQNLGG EFDNTFGWDNKHVGARI
Sbjct: 241 PFYCSFSGYQDELLWGAAWLHRATKNSSYLNYIQENGQNLGGVEFDNTFGWDNKHVGARI 300

Query: 301 LLSKAFLIQNEKSFHDYKNHADSFICSLIPGAPSSSA-QYTPGGLLFKMGDSNMQYVTST 360
           LLSKAFLIQN KS  +YK+HAD+FICSL+PGA SSS+ QYTPGGLLFKMGDSNMQYVTST
Sbjct: 301 LLSKAFLIQNVKSLQEYKDHADNFICSLVPGASSSSSVQYTPGGLLFKMGDSNMQYVTST 360

Query: 361 SFLLLTYAKYLTSAHTTAYCTARTITPNILRAIAKKQIDYLLGENPLKMSYMVGYGGRYP 420
           +FLLLTYAKYLTS+HTTA+C  RTITPN+LRAIAKKQIDYLLGENPLKMSYMVGYG RYP
Sbjct: 361 TFLLLTYAKYLTSSHTTAHCNGRTITPNVLRAIAKKQIDYLLGENPLKMSYMVGYGSRYP 420

Query: 421 RRIHHRGSSLPSIAEHPAKIGCSSGFSAMDSNSPNPNILVGAVVGGPDQNDEFPDERSDF 480
           +RIHHR SSLPSIAEHPAKI CSSGFSAM SNSPNPN+L+GAVVGGPDQND FPDERSDF
Sbjct: 421 QRIHHRASSLPSIAEHPAKIDCSSGFSAMHSNSPNPNVLIGAVVGGPDQNDGFPDERSDF 480

Query: 481 EQSEPSTYINAPLVGSLAYFAHSFGQL 505
           EQSEPSTYINAPLVGSLAYFAHSF QL
Sbjct: 481 EQSEPSTYINAPLVGSLAYFAHSFSQL 507

BLAST of HG10019301 vs. ExPASy TrEMBL
Match: A0A0A0KWL5 (Endoglucanase OS=Cucumis sativus OX=3659 GN=Csa_4G001940 PE=3 SV=1)

HSP 1 Score: 851.3 bits (2198), Expect = 2.1e-243
Identity = 431/508 (84.84%), Postives = 449/508 (88.39%), Query Frame = 0

Query: 1   MALSPFSFKLIA-FSFLLLTLSDASPA-TVGHHRR-PRFTPHNYRDALAKSILFFQGQRS 60
           MALSP SFKLI   SFLLL+LS AS A T+GHHRR PR+TPHNYRDALAKSILFFQGQRS
Sbjct: 1   MALSPLSFKLITLISFLLLSLSKASLATTLGHHRRHPRYTPHNYRDALAKSILFFQGQRS 60

Query: 61  GKLPPNQKMTWRKDSGLFDGSTMNVDLVGGYYDAGDNVKFGFPMAFTTTMLSWSVIEFGG 120
           GKLPPNQKM WRKDSGL DGS+MNVDLVGGYYDAGDNVKFGFPMAFTTTMLSWSV+EFGG
Sbjct: 61  GKLPPNQKMAWRKDSGLSDGSSMNVDLVGGYYDAGDNVKFGFPMAFTTTMLSWSVVEFGG 120

Query: 121 VMKNELNNAKEAIRWATDYLLKATALPDTIFVQVGDANRDHACWERPEDMDTPRTVLKID 180
           VMKNELNNAK+AIRWATDYLLKATALPDTIFVQVGDAN+DHACWERPEDMDTPRTVLKID
Sbjct: 121 VMKNELNNAKQAIRWATDYLLKATALPDTIFVQVGDANKDHACWERPEDMDTPRTVLKID 180

Query: 181 KNTPGSEVAAETAAALASASLVFKKSDPTYSNLLIKTAIRVFEFADKYRGSYSNGLKNYV 240
           KN PGSEVAAETAAALASASLVFKKSDPTYS LLIKTAIRVFEF DKYRGSYSNGL N+V
Sbjct: 181 KNNPGSEVAAETAAALASASLVFKKSDPTYSKLLIKTAIRVFEFGDKYRGSYSNGLNNFV 240

Query: 241 CPFYCSFSGYQDELLWGAAWLHRATKNGSFLNYIQENGQNLGGGEFDNTFGWDNKHVGAR 300
           CPFYCSFSGY                            QNLGG EFDNTFGWDNKHVGAR
Sbjct: 241 CPFYCSFSGY----------------------------QNLGGVEFDNTFGWDNKHVGAR 300

Query: 301 ILLSKAFLIQNEKSFHDYKNHADSFICSLIPGAPSSSA-QYTPGGLLFKMGDSNMQYVTS 360
           ILLSKAFLIQN KS ++YK+HAD+FICSLIP APSSS+  YTPGGLLFKMGDSNMQYVTS
Sbjct: 301 ILLSKAFLIQNVKSLYEYKDHADNFICSLIPDAPSSSSVHYTPGGLLFKMGDSNMQYVTS 360

Query: 361 TSFLLLTYAKYLTSAHTTAYCTARTITPNILRAIAKKQIDYLLGENPLKMSYMVGYGGRY 420
           T+FLLLTYAKYLTSAHTTA C  R+ITPNILR IAKKQIDYLLGENPLKMSYMVGYG  Y
Sbjct: 361 TTFLLLTYAKYLTSAHTTANCNGRSITPNILRTIAKKQIDYLLGENPLKMSYMVGYGSHY 420

Query: 421 PRRIHHRGSSLPSIAEHPAKIGCSSGFSAMDSNSPNPNILVGAVVGGPDQNDEFPDERSD 480
           P+RIHHR SSLPSIAEHPAKI CSSGF  M SNSPNPN+L+GAVVGGPDQNDEFPDERSD
Sbjct: 421 PQRIHHRASSLPSIAEHPAKIDCSSGFFVMHSNSPNPNVLIGAVVGGPDQNDEFPDERSD 480

Query: 481 FEQSEPSTYINAPLVGSLAYFAHSFGQL 505
           FEQSEPSTYINAPLVGSLAYFAHSFGQL
Sbjct: 481 FEQSEPSTYINAPLVGSLAYFAHSFGQL 480

BLAST of HG10019301 vs. TAIR 10
Match: AT4G02290.1 (glycosyl hydrolase 9B13 )

HSP 1 Score: 778.1 bits (2008), Expect = 4.3e-225
Identity = 363/475 (76.42%), Postives = 418/475 (88.00%), Query Frame = 0

Query: 30  HHRRPRFTPHNYRDALAKSILFFQGQRSGKLPPNQKMTWRKDSGLFDGSTMNVDLVGGYY 89
           HH R     HNY+DAL KSILFF+GQRSGKLP NQ+M+WR+DSGL DGS ++VDLVGGYY
Sbjct: 42  HHHRHHLAKHNYKDALTKSILFFEGQRSGKLPSNQRMSWRRDSGLSDGSALHVDLVGGYY 101

Query: 90  DAGDNVKFGFPMAFTTTMLSWSVIEFGGVMKNELNNAKEAIRWATDYLLKATALPDTIFV 149
           DAGDN+KFGFPMAFTTTMLSWSVIEFGG+MK+EL NAK AIRWATDYLLKAT+ PDTI+V
Sbjct: 102 DAGDNIKFGFPMAFTTTMLSWSVIEFGGLMKSELQNAKIAIRWATDYLLKATSQPDTIYV 161

Query: 150 QVGDANRDHACWERPEDMDTPRTVLKIDKNTPGSEVAAETAAALASASLVFKKSDPTYSN 209
           QVGDAN+DH+CWERPEDMDT R+V K+DKN PGS+VAAETAAALA+A++VF+KSDP+YS 
Sbjct: 162 QVGDANKDHSCWERPEDMDTVRSVFKVDKNIPGSDVAAETAAALAAAAIVFRKSDPSYSK 221

Query: 210 LLIKTAIRVFEFADKYRGSYSNGLKNYVCPFYCSFSGYQDELLWGAAWLHRATKNGSFLN 269
           +L+K AI VF FADKYRG+YS GLK  VCPFYCS+SGYQDELLWGAAWL +ATKN  +LN
Sbjct: 222 VLLKRAISVFAFADKYRGTYSAGLKPDVCPFYCSYSGYQDELLWGAAWLQKATKNIKYLN 281

Query: 270 YIQENGQNLGGGEFDNTFGWDNKHVGARILLSKAFLIQNEKSFHDYKNHADSFICSLIPG 329
           YI+ NGQ LG  E+DNTFGWDNKH GARILL+KAFL+QN K+ H+YK HAD+FICS+IPG
Sbjct: 282 YIKINGQILGAAEYDNTFGWDNKHAGARILLTKAFLVQNVKTLHEYKGHADNFICSVIPG 341

Query: 330 APSSSAQYTPGGLLFKMGDSNMQYVTSTSFLLLTYAKYLTSAHTTAYCTARTITPNILRA 389
           AP SS QYTPGGLLFKM D+NMQYVTSTSFLLLTYAKYLTSA T  +C     TP  LR+
Sbjct: 342 APFSSTQYTPGGLLFKMADANMQYVTSTSFLLLTYAKYLTSAKTVVHCGGSVYTPGRLRS 401

Query: 390 IAKKQIDYLLGENPLKMSYMVGYGGRYPRRIHHRGSSLPSIAEHPAKIGCSSGFSAMDSN 449
           IAK+Q+DYLLG+NPL+MSYMVGYG ++PRRIHHRGSSLP +A HPAKI C  GF+ M+S 
Sbjct: 402 IAKRQVDYLLGDNPLRMSYMVGYGPKFPRRIHHRGSSLPCVASHPAKIQCHQGFAIMNSQ 461

Query: 450 SPNPNILVGAVVGGPDQNDEFPDERSDFEQSEPSTYINAPLVGSLAYFAHSFGQL 505
           SPNPN LVGAVVGGPDQ+D FPDERSD+EQSEP+TYIN+PLVG+LAYFAH++GQL
Sbjct: 462 SPNPNFLVGAVVGGPDQHDRFPDERSDYEQSEPATYINSPLVGALAYFAHAYGQL 516

BLAST of HG10019301 vs. TAIR 10
Match: AT1G02800.1 (cellulase 2 )

HSP 1 Score: 753.8 bits (1945), Expect = 8.7e-218
Identity = 369/503 (73.36%), Postives = 419/503 (83.30%), Query Frame = 0

Query: 1   MALSPFSFKLIAFSFLLLTLSD---ASPATVGHHRRPRFTPHNYRDALAKSILFFQGQRS 60
           MAL   S +LI F   +L LS+   +S +    H R     HNY+DAL+KSILFF+GQRS
Sbjct: 1   MALYLSSSRLITFLSFILLLSNGFSSSSSRPSIHHRHHLDNHNYKDALSKSILFFEGQRS 60

Query: 61  GKLPPNQKMTWRKDSGLFDGSTMNVDLVGGYYDAGDNVKFGFPMAFTTTMLSWSVIEFGG 120
           GKLPPNQ+MTWR +SGL DGS +NVDLVGGYYDAGDN+KFGFPMAFTTTMLSWS+IEFGG
Sbjct: 61  GKLPPNQRMTWRSNSGLSDGSALNVDLVGGYYDAGDNMKFGFPMAFTTTMLSWSLIEFGG 120

Query: 121 VMKNELNNAKEAIRWATDYLLKATALPDTIFVQVGDANRDHACWERPEDMDTPRTVLKID 180
           +MK+EL NAK+AIRWATD+LLKAT+ PDTI+VQVGD N DHACWERPEDMDTPR+V K+D
Sbjct: 121 LMKSELPNAKDAIRWATDFLLKATSHPDTIYVQVGDPNMDHACWERPEDMDTPRSVFKVD 180

Query: 181 KNTPGSEVAAETAAALASASLVFKKSDPTYSNLLIKTAIRVFEFADKYRGSYSNGLKNYV 240
           KN PGS++A E AAALA+AS+VF+K DP+YSN L++ AI VF FADKYRG YS GL   V
Sbjct: 181 KNNPGSDIAGEIAAALAAASIVFRKCDPSYSNHLLQRAITVFTFADKYRGPYSAGLAPEV 240

Query: 241 CPFYCSFSGYQDELLWGAAWLHRATKNGSFLNYIQENGQNLGGGEFDNTFGWDNKHVGAR 300
           CPFYCS+SGYQDELLWGAAWL +AT N ++LNYI+ NGQ LG  EFDN F WDNKHVGAR
Sbjct: 241 CPFYCSYSGYQDELLWGAAWLQKATNNPTYLNYIKANGQILGADEFDNMFSWDNKHVGAR 300

Query: 301 ILLSKAFLIQNEKSFHDYKNHADSFICSLIPGAPSSSAQYTPGGLLFKMGDSNMQYVTST 360
           ILLSK FLIQ  KS  +YK HADSFICS++PGA  SS+QYTPGGLLFKMG+SNMQYVTST
Sbjct: 301 ILLSKEFLIQKVKSLEEYKEHADSFICSVLPGA--SSSQYTPGGLLFKMGESNMQYVTST 360

Query: 361 SFLLLTYAKYLTSAHTTAYCTARTITPNILRAIAKKQIDYLLGENPLKMSYMVGYGGRYP 420
           SFLLLTYAKYLTSA T AYC    +TP  LR+IAKKQ+DYLLG NPLKMSYMVGYG +YP
Sbjct: 361 SFLLLTYAKYLTSARTVAYCGGSVVTPARLRSIAKKQVDYLLGGNPLKMSYMVGYGLKYP 420

Query: 421 RRIHHRGSSLPSIAEHPAKIGCSSGFSAMDSNSPNPNILVGAVVGGPDQNDEFPDERSDF 480
           RRIHHRGSSLPS+A HP +I C  GFS   S SPNPN LVGAVVGGPDQND+FPDERSD+
Sbjct: 421 RRIHHRGSSLPSVAVHPTRIQCHDGFSLFTSQSPNPNDLVGAVVGGPDQNDQFPDERSDY 480

Query: 481 EQSEPSTYINAPLVGSLAYFAHS 501
            +SEP+TYINAPLVG+LAY A S
Sbjct: 481 GRSEPATYINAPLVGALAYLARS 501

BLAST of HG10019301 vs. TAIR 10
Match: AT1G70710.1 (glycosyl hydrolase 9B1 )

HSP 1 Score: 614.4 bits (1583), Expect = 8.3e-176
Identity = 291/463 (62.85%), Postives = 357/463 (77.11%), Query Frame = 0

Query: 39  HNYRDALAKSILFFQGQRSGKLPPNQKMTWRKDSGLFDGSTMNVDLVGGYYDAGDNVKFG 98
           H+YRDAL KSILFF+GQRSGKLPP+Q++ WR+DS L DGS+  VDL GGYYDAGDN+KFG
Sbjct: 27  HDYRDALRKSILFFEGQRSGKLPPDQRLKWRRDSALRDGSSAGVDLSGGYYDAGDNIKFG 86

Query: 99  FPMAFTTTMLSWSVIEFGGVMKNELNNAKEAIRWATDYLLKATALPDTIFVQVGDANRDH 158
           FPMAFTTTMLSWS+I+FG  M  EL NA +A++W TDYLLKATA+P  +FVQVGDA  DH
Sbjct: 87  FPMAFTTTMLSWSIIDFGKTMGPELRNAVKAVKWGTDYLLKATAIPGVVFVQVGDAYSDH 146

Query: 159 ACWERPEDMDTPRTVLKIDKNTPGSEVAAETAAALASASLVFKKSDPTYSNLLIKTAIRV 218
            CWERPEDMDT RTV KID+  PGS+VA ETAAALA+AS+VF+K DP YS LL+  A RV
Sbjct: 147 NCWERPEDMDTLRTVYKIDRAHPGSDVAGETAAALAAASIVFRKRDPAYSRLLLDRATRV 206

Query: 219 FEFADKYRGSYSNGLKNYVCPFYCSFSGYQDELLWGAAWLHRATKNGSFLNYIQENGQNL 278
           F FA++YRG+YSN L + VCPFYC F+GYQDELLWGAAWLH+A++  ++  +I +N   L
Sbjct: 207 FAFANRYRGAYSNSLYHAVCPFYCDFNGYQDELLWGAAWLHKASRKRAYREFIVKNEVIL 266

Query: 279 GGGEFDNTFGWDNKHVGARILLSKAFLIQNEKSFHDYKNHADSFICSLIPGAPSSSAQYT 338
             G+  N FGWDNKH G  +L+SK  L+   + F  +K +AD FICS++PG      QY+
Sbjct: 267 KAGDTINEFGWDNKHAGINVLISKEVLMGKAEYFESFKQNADGFICSILPGISHPQVQYS 326

Query: 339 PGGLLFKMGDSNMQYVTSTSFLLLTYAKYLTSAHTTAYCTARTITPNILRAIAKKQIDYL 398
            GGLL K G SNMQ+VTS SFLLL Y+ YL+ A     C   T +P++LR IAK+Q+DY+
Sbjct: 327 RGGLLVKTGGSNMQHVTSLSFLLLAYSNYLSHAKKVVPCGELTASPSLLRQIAKRQVDYI 386

Query: 399 LGENPLKMSYMVGYGGRYPRRIHHRGSSLPSIAEHPAKIGCSSGFSAMDSNSPNPNILVG 458
           LG+NP+ +SYMVGYG ++PRRIHHRGSS+PS++ HP+ IGC  G     S +PNPN+LVG
Sbjct: 387 LGDNPMGLSYMVGYGQKFPRRIHHRGSSVPSVSAHPSHIGCKEGSRYFLSPNPNPNLLVG 446

Query: 459 AVVGGPDQNDEFPDERSDFEQSEPSTYINAPLVGSLAYF-AHS 501
           AVVGGP+  D FPD R  F+QSEP+TYINAPLVG L YF AHS
Sbjct: 447 AVVGGPNVTDAFPDSRPYFQQSEPTTYINAPLVGLLGYFSAHS 489

BLAST of HG10019301 vs. TAIR 10
Match: AT1G23210.1 (glycosyl hydrolase 9B6 )

HSP 1 Score: 611.7 bits (1576), Expect = 5.4e-175
Identity = 297/498 (59.64%), Postives = 367/498 (73.69%), Query Frame = 0

Query: 1   MALSPFSFKLIAFSFLLLTLSDASPATVGHHRRPRFTPHNYRDALAKSILFFQGQRSGKL 60
           MA   F    I  + LLL     SP T        +  H+YRDAL KSILFF+GQRSGKL
Sbjct: 1   MAGKSFMTPAIMLAMLLL----ISPET--------YAGHDYRDALRKSILFFEGQRSGKL 60

Query: 61  PPNQKMTWRKDSGLFDGSTMNVDLVGGYYDAGDNVKFGFPMAFTTTMLSWSVIEFGGVMK 120
           PP+Q++ WR+DS L DGS+  VDL GGYYDAGDNVKFGFPMAFTTTM+SWSVI+FG  M 
Sbjct: 61  PPDQRLKWRRDSALRDGSSAGVDLTGGYYDAGDNVKFGFPMAFTTTMMSWSVIDFGKTMG 120

Query: 121 NELNNAKEAIRWATDYLLKATALPDTIFVQVGDANRDHACWERPEDMDTPRTVLKIDKNT 180
            EL NA +AI+W TDYL+KAT +PD +FVQVGDA  DH CWERPEDMDT RTV KIDK+ 
Sbjct: 121 PELENAVKAIKWGTDYLMKATQIPDVVFVQVGDAYSDHNCWERPEDMDTLRTVYKIDKDH 180

Query: 181 PGSEVAAETAAALASASLVFKKSDPTYSNLLIKTAIRVFEFADKYRGSYSNGLKNYVCPF 240
            GSEVA ETAAALA+AS+VF+K DP YS +L+  A RVF FA KYRG+YS+ L   VCPF
Sbjct: 181 SGSEVAGETAAALAAASIVFEKRDPVYSKMLLDRATRVFAFAQKYRGAYSDSLYQAVCPF 240

Query: 241 YCSFSGYQDELLWGAAWLHRATKNGSFLNYIQENGQNLGGGEFDNTFGWDNKHVGARILL 300
           YC F+GY+DELLWGAAWLH+A+K   +  +I +N   L  G+  + FGWDNKH G  +L+
Sbjct: 241 YCDFNGYEDELLWGAAWLHKASKKRVYREFIVKNQVILRAGDTIHEFGWDNKHAGINVLV 300

Query: 301 SKAFLIQNEKSFHDYKNHADSFICSLIPGAPSSSAQYTPGGLLFKMGDSNMQYVTSTSFL 360
           SK  L+   + F  +K +AD FICSL+PG      QY+ GGLL K G SNMQ+VTS SFL
Sbjct: 301 SKMVLMGKAEYFQSFKQNADEFICSLLPGISHPQVQYSQGGLLVKSGGSNMQHVTSLSFL 360

Query: 361 LLTYAKYLTSAHTTAYCTARTITPNILRAIAKKQIDYLLGENPLKMSYMVGYGGRYPRRI 420
           LLTY+ YL+ A+    C   T +P +LR +AK+Q+DY+LG+NP+KMSYMVGYG R+P++I
Sbjct: 361 LLTYSNYLSHANKVVPCGEFTASPALLRQVAKRQVDYILGDNPMKMSYMVGYGSRFPQKI 420

Query: 421 HHRGSSLPSIAEHPAKIGCSSGFSAMDSNSPNPNILVGAVVGGPDQNDEFPDERSDFEQS 480
           HHRGSS+PS+ +HP +IGC  G     SN+PNPN+L+GAVVGGP+  D+FPD R  F+ +
Sbjct: 421 HHRGSSVPSVVDHPDRIGCKDGSRYFFSNNPNPNLLIGAVVGGPNITDDFPDSRPYFQLT 480

Query: 481 EPSTYINAPLVGSLAYFA 499
           EP+TYINAPL+G L YF+
Sbjct: 481 EPTTYINAPLLGLLGYFS 486

BLAST of HG10019301 vs. TAIR 10
Match: AT1G22880.1 (cellulase 5 )

HSP 1 Score: 590.5 bits (1521), Expect = 1.3e-168
Identity = 288/498 (57.83%), Postives = 369/498 (74.10%), Query Frame = 0

Query: 4   SPFSFKLIAFSFLLLTLSDASPATVGHHRRPRFTPHNYRDALAKSILFFQGQRSGKLPPN 63
           SPF F +   S L L  + ASP              NYR+AL+KS+LFFQGQRSG+LP +
Sbjct: 3   SPFFF-VFLLSALSLENTYASP--------------NYREALSKSLLFFQGQRSGRLPSD 62

Query: 64  QKMTWRKDSGLFDGSTMNVDLVGGYYDAGDNVKFGFPMAFTTTMLSWSVIEFGGVMKNEL 123
           Q+++WR  SGL DGS+ +VDL GGYYDAGDNVKF FPMAFTTTMLSWS +E+G  M  EL
Sbjct: 63  QQLSWRSSSGLSDGSSAHVDLTGGYYDAGDNVKFNFPMAFTTTMLSWSSLEYGKKMGPEL 122

Query: 124 NNAKEAIRWATDYLLK-ATALPDTIFVQVGDANRDHACWERPEDMDTPRTVLKIDKNTPG 183
            N++ AIRWATDYLLK A A P  ++V VGD N DH CWERPEDMDTPRTV  +  + PG
Sbjct: 123 QNSRVAIRWATDYLLKCARATPGKLYVGVGDPNGDHKCWERPEDMDTPRTVYSVSPSNPG 182

Query: 184 SEVAAETAAALASASLVFKKSDPTYSNLLIKTAIRVFEFADKYRGSYSNGLKNYVCPFYC 243
           S+VAAETAAALA++S+VF+K DP YS LL+ TA +V +FA +YRG+YSN L + VCPFYC
Sbjct: 183 SDVAAETAAALAASSMVFRKVDPKYSRLLLATAKKVMQFAIQYRGAYSNSLSSSVCPFYC 242

Query: 244 SFSGYQDELLWGAAWLHRATKNGSFLNYIQENGQNLGGGEFDNTFGWDNKHVGARILLSK 303
           S+SGY+DELLWGAAWLHRAT +  + N+I    ++LGGG+  + F WDNK+ GA +LLS+
Sbjct: 243 SYSGYKDELLWGAAWLHRATNDPYYTNFI----KSLGGGDQPDIFSWDNKYAGAYVLLSR 302

Query: 304 AFLIQNEKSFHDYKNHADSFICSLIPGAPSSSAQYTPGGLLFKMGDSNMQYVTSTSFLLL 363
             ++  + +F  YK  A++F+C ++P +PSSS +YT GGL++K+  SN+QYVTS +FLL 
Sbjct: 303 RAVLNKDNNFELYKQAAENFMCKILPNSPSSSTKYTKGGLMYKLPQSNLQYVTSITFLLT 362

Query: 364 TYAKYLTSAHTTAYCTARTITPNILRAIAKKQIDYLLGENPLKMSYMVGYGGRYPRRIHH 423
           TYAKY+ S   T  C    I PN L  ++K+Q+DY+LG NP+KMSYMVG+   +P+RIHH
Sbjct: 363 TYAKYMKSTKQTFNCGNSLIVPNALINLSKRQVDYVLGVNPMKMSYMVGFSSNFPKRIHH 422

Query: 424 RGSSLPSIAEHPAKIGCSSGFSAMDSNSPNPNILVGAVVGGPDQNDEFPDERSDFEQSEP 483
           RGSSLPS A     +GC+ GF +  + +PNPNIL GA+VGGP+QNDE+PD+R D+ +SEP
Sbjct: 423 RGSSLPSRAVRSNSLGCNGGFQSFRTQNPNPNILTGAIVGGPNQNDEYPDQRDDYTRSEP 481

Query: 484 STYINAPLVGSLAYFAHS 501
           +TYINA  VG LAYFA S
Sbjct: 483 ATYINAAFVGPLAYFAAS 481

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038905683.11.0e-28194.25endoglucanase 17-like [Benincasa hispida][more]
XP_022934344.17.1e-27089.88endoglucanase 17-like [Cucurbita moschata][more]
KAG6581281.19.2e-27089.88Endoglucanase 17, partial [Cucurbita argyrosperma subsp. sororia] >KAG7018003.1 ... [more]
XP_023528182.16.0e-26989.68endoglucanase 17-like [Cucurbita pepo subsp. pepo][more]
XP_008454321.18.6e-26890.73PREDICTED: endoglucanase 17 [Cucumis melo] >TYK29519.1 endoglucanase 17 [Cucumis... [more]
Match NameE-valueIdentityDescription
O814166.1e-22476.42Endoglucanase 17 OS=Arabidopsis thaliana OX=3702 GN=At4g02290 PE=2 SV=1[more]
Q9SRX31.2e-21673.36Endoglucanase 1 OS=Arabidopsis thaliana OX=3702 GN=CEL2 PE=2 SV=1[more]
Q8LQ921.3e-21069.70Endoglucanase 3 OS=Oryza sativa subsp. japonica OX=39947 GN=GLU8 PE=2 SV=1[more]
P055223.0e-17863.70Endoglucanase 1 OS=Persea americana OX=3435 GN=CEL1 PE=2 SV=1[more]
Q652F91.2e-17460.82Endoglucanase 17 OS=Oryza sativa subsp. japonica OX=39947 GN=GLU13 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A6J1F2B23.4e-27089.88Endoglucanase OS=Cucurbita moschata OX=3662 GN=LOC111441537 PE=3 SV=1[more]
A0A5D3E1B84.2e-26890.73Endoglucanase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold655G001220... [more]
A0A1S3BZ394.2e-26890.73Endoglucanase OS=Cucumis melo OX=3656 GN=LOC103494756 PE=3 SV=1[more]
A0A5A7TS462.1e-26790.53Endoglucanase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold46G001200 ... [more]
A0A0A0KWL52.1e-24384.84Endoglucanase OS=Cucumis sativus OX=3659 GN=Csa_4G001940 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT4G02290.14.3e-22576.42glycosyl hydrolase 9B13 [more]
AT1G02800.18.7e-21873.36cellulase 2 [more]
AT1G70710.18.3e-17662.85glycosyl hydrolase 9B1 [more]
AT1G23210.15.4e-17559.64glycosyl hydrolase 9B6 [more]
AT1G22880.11.3e-16857.83cellulase 5 [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001701Glycoside hydrolase family 9PFAMPF00759Glyco_hydro_9coord: 41..493
e-value: 1.1E-143
score: 479.8
IPR012341Six-hairpin glycosidase-like superfamilyGENE3D1.50.10.10coord: 37..504
e-value: 5.4E-170
score: 568.3
NoneNo IPR availablePANTHERPTHR22298:SF159ENDOGLUCANASEcoord: 8..504
NoneNo IPR availablePANTHERPTHR22298ENDO-1,4-BETA-GLUCANASEcoord: 8..504
IPR033126Glycosyl hydrolases family 9, Asp/Glu active sitesPROSITEPS00698GH9_3coord: 470..488
IPR018221Glycoside hydrolase family 9, His active sitePROSITEPS00592GH9_2coord: 397..423
IPR008928Six-hairpin glycosidase superfamilySUPERFAMILY48208Six-hairpin glycosidasescoord: 32..499

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10019301.1HG10019301.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0030245 cellulose catabolic process
biological_process GO:0005975 carbohydrate metabolic process
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0008810 cellulase activity
molecular_function GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compounds