HG10003485 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10003485
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionEndoglucanase
LocationChr08: 2139782 .. 2142982 (+)
RNA-Seq ExpressionHG10003485
SyntenyHG10003485
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTGCTGCCATAACAAATACTTCAACTCTCTTCTTCTTTTTCTTTCTCTTCTTTTCATGCTCATTGGTCCAACGTGCTCGAGCCGACCTGAATTACAGAGATGCCTTGGCTAAGTCGATATTGTTCTTCGAAGGACAACGCTCCGGTAGGCTCCCGGCTGGCCAACGGATCACTTGGAGGTCCAACTCTGGCCTCTATGATGGTGAACTTGCTCACGTATGTGTTAATCATAATATTTCATAACTTTAAGAGTCCCCTTTTAATAATATTCGTATTTGTTTCTTAAGATATGTTCGGTAATTAATTTTTTATTAGTGTCTTTTATTTTATAGATGAAAAAAGAAAATTACTTTGGTAATGCCTTTTTAATTTACATCACATTAATTTTACAATAAAAATAAAATAATAATTGTCTTATCCTTAAACATGTTTGGTATTATTTTTGTTGTTAAAATCACTTTGAAACAAGTTTTTGATCATTCAAAATTAATTTGATGTTTAATTTTATACTTTTAATTGTATTTTTCATATTATAAAAATTGATTTCAAATAATTAAAAATATTTTTCGAATAATTTTTTAAAATGTCAAAAGTAATTTTAATTATTTTTAAATCACTCTCAAACCAGTCAAAACTAGCTAGATATTAATGTTTCAATTCATATTTTTTCAAAAAAAAAGTAAAAAAATCAATTGAACTCCAAAATATTAAGTATACATTTTGAAAAAGAAAAAAAAAACAAATAAAATCTATGAAATGCCCTCATAATTTTTTTTTAATCTTTTTTTTTAAGAAATTAAATTATTAAACAGTGATTTTGCTCAGTAAATTAAAAATTGAGTTTGTGAAAAATGGAATTGTTTTGTAATTGTTAGGTGGATTTAACCGGCGGGTACTACGACGCCGGCGACAATGTGAAATTCAACTTACCAATGGCATTCACAACCACAATGCTTTCATGGGGAGCACTCGAGTACGGCGCGCGTATGGGCACAGAATTACCCAACGCACGCGCCGCCATCCGTTGGGCCACCGATTACCTTCTCAAGTGCGCCACCGCCACTCCCGGCAAGCTCTACGTCGGCGTTGGCGACCCCCACGTCGACCACAAATGCTGGGAACGCCCTGAGGACATGGATACAGTTCGAACTGTCTACTCTGTTTCCGCCGGGAACCCTGGATCCGATGTTGCCGGCGAAACCGCCGCAGCCCTCGCCGCCGCATCGCTGGTGTTCCGCCGGGTTGACCGGAAATACTCCCGGCTGCTACTTGCGACGGCGAAGAAGGTGCTGCAGTTTGCTTTGGAGCACCGTGGATCGTACAGTGATTCGCTTTCCTCTGCTGTCTGTCCCTTTTATTGCTCTTATTCTGGATATAAGGTGAGCAATTTTTCAATAATAATTTCCTTTTTTAAAAAATTAAGCTTGTAAATAATAAAAAAAAATATTATACTATTTATTTTTATAAGTTTGTTTGTTATAGTAATATTAAAAATGTTTGATAAATGAAAATAGGATGAATTGGTGTGGGGAGCAACATGGCTACTAAGAGCAACAAATGATGTTCAATACTTCAATTTGTTGAAGTCATTGGGTGGTGATGATGTGACTGACATTTTCAGTTGGGACAACAAATATGCTGGTGCTCATGTCCTTTTGTCCAGGGTAAATTCAATATATAATACAGTTAAATTTTTCTATATTTATAAATAGTTTGATTCATTTTTTTATATTTGAAAACAAGTTAAAGAAAAATATGTTTGAATTTCCTCACTCTTATTGAAATTATTGTGGTAAATGTTGTGTGTAGCGAGCATTATTAAATAATGACAAGAACTTCGATTCATACAAACAAGAGGCTGAATCATTCATGTGTCGAATTCTACCAAATTCTCCTTACTCGACTACTCAATACACACAAGGTATGCAACCTAAATATCTTAAACATATTGACCAATTCAAGCATACTTCAATCAAGTGTTTTTATATTTAGCATACAATTGTAGTAAGTTTTGTAGAGTTAGAAACATGTTTTTTTTTTTGTGATATCTGTGAGTGTATGAGTTAGCTTACGCGCATGAACTGTGTCTCTTTCTTACCACTAGGTCAATCCATGGTGGTTAACATGTTTGAGAATGATTTTGAAATGTTTACAATCAAATTTAAAATTAACACACGATTTTTATATGATCAAAATTGATTAGGAAGGATTAACATCATGTGTTGCTTTGGAGTGATTTCAACTATTTTCAAAATCATTTCTAAATTTGAAAATAAATTTTATTAATTTGATTTTTTTTTCTTTTGTTGTGTAGGGGGATTGATGTTCAAATTGCCAGAAAGTAACCTACAATATGTGACATCCATAACGTTTTTGCTGAGCACATATTCCAAATACATGTCCGCCGCCAAACACACATTCAACTGCGGCAGCCTTCTTGTCACTCCGGCTTCCCTCAAGAACCTCGCTAAGAAACAGGTAATCCCCCCAAACGCTGTCGTTTTACACTCAATTTAACAATACATCTAACTCTATAATTAATTTTAAAATTTTCACCATTTTTGAAAAAAAAAATTAAAGTTAAAGTGTAATTTTGAAATCTATAGACCAAATCAAAACAAGACTTAAATGAAATCTATAACATTTTGAAATTTAGGAGACTAAATTGAAATTAAATTTCAAATTTAAACTAAAATTGTAATATTTTGAAACTTAGAATGGAAATTAAAACAAAAGTTTAAGACCAAAGAAGGTATTTTTTTTTCCTTCTTCAAACATCTTTCGAAAAATCAAAATAGTAATTTAAAAAAAGAAAAAAACTTATCCTATAGACCAATTTTCAAATTCTCATTCTTCTTTCTTGAAATTCAGGTGGATTATATATTGGGAGTGAACCCATTGAAAATGTCATACATGGTTGGATTTGGAAAAAACTTCCCAAGAAGAATTCACCACAGAGGATCTTCATTGCCTTCCAAGGCCACCCACCCTCAGGCCATCGCCTGCGACGGCGGCTTCCAACCCTTCTTCTACTCCTACAATCCCAACCCCAATATCTTAACCGGCGCCATTGTCGGTGGCCCCAACCAAAACGATGGCTTTCCTGACGACCGCACCGATTACAGCCACTCTGAGCCTGCCACATATATCAACGCCGCCCTTGTTGGCCCTCTTGCCTTCTTCTCTGGCAAGAATTGA

mRNA sequence

ATGGCTGCTGCCATAACAAATACTTCAACTCTCTTCTTCTTTTTCTTTCTCTTCTTTTCATGCTCATTGGTCCAACGTGCTCGAGCCGACCTGAATTACAGAGATGCCTTGGCTAAGTCGATATTGTTCTTCGAAGGACAACGCTCCGGTAGGCTCCCGGCTGGCCAACGGATCACTTGGAGGTCCAACTCTGGCCTCTATGATGGTGAACTTGCTCACGTGGATTTAACCGGCGGGTACTACGACGCCGGCGACAATGTGAAATTCAACTTACCAATGGCATTCACAACCACAATGCTTTCATGGGGAGCACTCGAGTACGGCGCGCGTATGGGCACAGAATTACCCAACGCACGCGCCGCCATCCGTTGGGCCACCGATTACCTTCTCAAGTGCGCCACCGCCACTCCCGGCAAGCTCTACGTCGGCGTTGGCGACCCCCACGTCGACCACAAATGCTGGGAACGCCCTGAGGACATGGATACAGTTCGAACTGTCTACTCTGTTTCCGCCGGGAACCCTGGATCCGATGTTGCCGGCGAAACCGCCGCAGCCCTCGCCGCCGCATCGCTGGTGTTCCGCCGGGTTGACCGGAAATACTCCCGGCTGCTACTTGCGACGGCGAAGAAGGTGCTGCAGTTTGCTTTGGAGCACCGTGGATCGTACAGTGATTCGCTTTCCTCTGCTGTCTGTCCCTTTTATTGCTCTTATTCTGGATATAAGGATGAATTGGTGTGGGGAGCAACATGGCTACTAAGAGCAACAAATGATGTTCAATACTTCAATTTGTTGAAGTCATTGGGTGGTGATGATGTGACTGACATTTTCAGTTGGGACAACAAATATGCTGGTGCTCATGTCCTTTTGTCCAGGCGAGCATTATTAAATAATGACAAGAACTTCGATTCATACAAACAAGAGGCTGAATCATTCATGTGTCGAATTCTACCAAATTCTCCTTACTCGACTACTCAATACACACAAGGGGGATTGATGTTCAAATTGCCAGAAAGTAACCTACAATATGTGACATCCATAACGTTTTTGCTGAGCACATATTCCAAATACATGTCCGCCGCCAAACACACATTCAACTGCGGCAGCCTTCTTGTCACTCCGGCTTCCCTCAAGAACCTCGCTAAGAAACAGGTGGATTATATATTGGGAGTGAACCCATTGAAAATGTCATACATGGTTGGATTTGGAAAAAACTTCCCAAGAAGAATTCACCACAGAGGATCTTCATTGCCTTCCAAGGCCACCCACCCTCAGGCCATCGCCTGCGACGGCGGCTTCCAACCCTTCTTCTACTCCTACAATCCCAACCCCAATATCTTAACCGGCGCCATTGTCGGTGGCCCCAACCAAAACGATGGCTTTCCTGACGACCGCACCGATTACAGCCACTCTGAGCCTGCCACATATATCAACGCCGCCCTTGTTGGCCCTCTTGCCTTCTTCTCTGGCAAGAATTGA

Coding sequence (CDS)

ATGGCTGCTGCCATAACAAATACTTCAACTCTCTTCTTCTTTTTCTTTCTCTTCTTTTCATGCTCATTGGTCCAACGTGCTCGAGCCGACCTGAATTACAGAGATGCCTTGGCTAAGTCGATATTGTTCTTCGAAGGACAACGCTCCGGTAGGCTCCCGGCTGGCCAACGGATCACTTGGAGGTCCAACTCTGGCCTCTATGATGGTGAACTTGCTCACGTGGATTTAACCGGCGGGTACTACGACGCCGGCGACAATGTGAAATTCAACTTACCAATGGCATTCACAACCACAATGCTTTCATGGGGAGCACTCGAGTACGGCGCGCGTATGGGCACAGAATTACCCAACGCACGCGCCGCCATCCGTTGGGCCACCGATTACCTTCTCAAGTGCGCCACCGCCACTCCCGGCAAGCTCTACGTCGGCGTTGGCGACCCCCACGTCGACCACAAATGCTGGGAACGCCCTGAGGACATGGATACAGTTCGAACTGTCTACTCTGTTTCCGCCGGGAACCCTGGATCCGATGTTGCCGGCGAAACCGCCGCAGCCCTCGCCGCCGCATCGCTGGTGTTCCGCCGGGTTGACCGGAAATACTCCCGGCTGCTACTTGCGACGGCGAAGAAGGTGCTGCAGTTTGCTTTGGAGCACCGTGGATCGTACAGTGATTCGCTTTCCTCTGCTGTCTGTCCCTTTTATTGCTCTTATTCTGGATATAAGGATGAATTGGTGTGGGGAGCAACATGGCTACTAAGAGCAACAAATGATGTTCAATACTTCAATTTGTTGAAGTCATTGGGTGGTGATGATGTGACTGACATTTTCAGTTGGGACAACAAATATGCTGGTGCTCATGTCCTTTTGTCCAGGCGAGCATTATTAAATAATGACAAGAACTTCGATTCATACAAACAAGAGGCTGAATCATTCATGTGTCGAATTCTACCAAATTCTCCTTACTCGACTACTCAATACACACAAGGGGGATTGATGTTCAAATTGCCAGAAAGTAACCTACAATATGTGACATCCATAACGTTTTTGCTGAGCACATATTCCAAATACATGTCCGCCGCCAAACACACATTCAACTGCGGCAGCCTTCTTGTCACTCCGGCTTCCCTCAAGAACCTCGCTAAGAAACAGGTGGATTATATATTGGGAGTGAACCCATTGAAAATGTCATACATGGTTGGATTTGGAAAAAACTTCCCAAGAAGAATTCACCACAGAGGATCTTCATTGCCTTCCAAGGCCACCCACCCTCAGGCCATCGCCTGCGACGGCGGCTTCCAACCCTTCTTCTACTCCTACAATCCCAACCCCAATATCTTAACCGGCGCCATTGTCGGTGGCCCCAACCAAAACGATGGCTTTCCTGACGACCGCACCGATTACAGCCACTCTGAGCCTGCCACATATATCAACGCCGCCCTTGTTGGCCCTCTTGCCTTCTTCTCTGGCAAGAATTGA

Protein sequence

MAAAITNTSTLFFFFFLFFSCSLVQRARADLNYRDALAKSILFFEGQRSGRLPAGQRITWRSNSGLYDGELAHVDLTGGYYDAGDNVKFNLPMAFTTTMLSWGALEYGARMGTELPNARAAIRWATDYLLKCATATPGKLYVGVGDPHVDHKCWERPEDMDTVRTVYSVSAGNPGSDVAGETAAALAAASLVFRRVDRKYSRLLLATAKKVLQFALEHRGSYSDSLSSAVCPFYCSYSGYKDELVWGATWLLRATNDVQYFNLLKSLGGDDVTDIFSWDNKYAGAHVLLSRRALLNNDKNFDSYKQEAESFMCRILPNSPYSTTQYTQGGLMFKLPESNLQYVTSITFLLSTYSKYMSAAKHTFNCGSLLVTPASLKNLAKKQVDYILGVNPLKMSYMVGFGKNFPRRIHHRGSSLPSKATHPQAIACDGGFQPFFYSYNPNPNILTGAIVGGPNQNDGFPDDRTDYSHSEPATYINAALVGPLAFFSGKN
Homology
BLAST of HG10003485 vs. NCBI nr
Match: XP_038890642.1 (endoglucanase 9-like [Benincasa hispida])

HSP 1 Score: 965.7 bits (2495), Expect = 1.5e-277
Identity = 468/490 (95.51%), Postives = 479/490 (97.76%), Query Frame = 0

Query: 1   MAAAITNTSTLFFFFFLFFSCSLVQRARADLNYRDALAKSILFFEGQRSGRLPAGQRITW 60
           MAAAITN+STLFFFF L  S SLV   R D NYRDAL+KSILFFEGQRSGR+PA QRITW
Sbjct: 1   MAAAITNSSTLFFFFLLLLSFSLVDHTRGDPNYRDALSKSILFFEGQRSGRIPANQRITW 60

Query: 61  RSNSGLYDGELAHVDLTGGYYDAGDNVKFNLPMAFTTTMLSWGALEYGARMGTELPNARA 120
           RSNSGLYDGELAHVDLTGGYYDAGDNVKFNLPMAFTTTMLSWGALEYGARMGTELPNARA
Sbjct: 61  RSNSGLYDGELAHVDLTGGYYDAGDNVKFNLPMAFTTTMLSWGALEYGARMGTELPNARA 120

Query: 121 AIRWATDYLLKCATATPGKLYVGVGDPHVDHKCWERPEDMDTVRTVYSVSAGNPGSDVAG 180
           AIRWATDYLLKCATATPGKLYVGVGDPHVDHKCWERPEDMDTVRTVYSVSAGNPGSDVA 
Sbjct: 121 AIRWATDYLLKCATATPGKLYVGVGDPHVDHKCWERPEDMDTVRTVYSVSAGNPGSDVAA 180

Query: 181 ETAAALAAASLVFRRVDRKYSRLLLATAKKVLQFALEHRGSYSDSLSSAVCPFYCSYSGY 240
           ETAAALAAASLVFRRVDRKYSR+LLATAKKV+QFALEHRGSYSDSLSSAVCPFYCSYSGY
Sbjct: 181 ETAAALAAASLVFRRVDRKYSRVLLATAKKVMQFALEHRGSYSDSLSSAVCPFYCSYSGY 240

Query: 241 KDELVWGATWLLRATNDVQYFNLLKSLGGDDVTDIFSWDNKYAGAHVLLSRRALLNNDKN 300
           KDELVWGATWLLRATNDVQYFNLLKSLGGDDVTDIFSWDNKYAGAHVLLSRRALLNNDKN
Sbjct: 241 KDELVWGATWLLRATNDVQYFNLLKSLGGDDVTDIFSWDNKYAGAHVLLSRRALLNNDKN 300

Query: 301 FDSYKQEAESFMCRILPNSPYSTTQYTQGGLMFKLPESNLQYVTSITFLLSTYSKYMSAA 360
           FDSYKQEAESFMCRILPNSPYS+TQYTQGGLMFKLPESNLQYVTSITFLL+TYSKYMSAA
Sbjct: 301 FDSYKQEAESFMCRILPNSPYSSTQYTQGGLMFKLPESNLQYVTSITFLLTTYSKYMSAA 360

Query: 361 KHTFNCGSLLVTPASLKNLAKKQVDYILGVNPLKMSYMVGFGKNFPRRIHHRGSSLPSKA 420
           KHTFNCGSL+VTPASLKNLAKKQVDYILGVNPLKMSYMVGFGK+FPRRIHHRGSSLPSKA
Sbjct: 361 KHTFNCGSLIVTPASLKNLAKKQVDYILGVNPLKMSYMVGFGKSFPRRIHHRGSSLPSKA 420

Query: 421 THPQAIACDGGFQPFFYSYNPNPNILTGAIVGGPNQNDGFPDDRTDYSHSEPATYINAAL 480
           +HPQAIACDGGFQPFFYSYNPNPNILTGA+VGGPNQNDGFPDDRTDYSHSEPATYINAAL
Sbjct: 421 SHPQAIACDGGFQPFFYSYNPNPNILTGAVVGGPNQNDGFPDDRTDYSHSEPATYINAAL 480

Query: 481 VGPLAFFSGK 491
           VGPLAFFSGK
Sbjct: 481 VGPLAFFSGK 490

BLAST of HG10003485 vs. NCBI nr
Match: XP_004141534.1 (endoglucanase 9 [Cucumis sativus] >KGN52647.1 hypothetical protein Csa_008478 [Cucumis sativus])

HSP 1 Score: 928.7 bits (2399), Expect = 2.1e-266
Identity = 448/491 (91.24%), Postives = 472/491 (96.13%), Query Frame = 0

Query: 1   MAAAITNTSTLFFFFFLFFSCSLVQRARADLNYRDALAKSILFFEGQRSGRLPAGQRITW 60
           MA+AI+N+STLF  FFL  S S   RA A  NYRDALAKSILFFEGQRSGR+PA QRITW
Sbjct: 1   MASAISNSSTLFLLFFLLLSFSFAGRALAGPNYRDALAKSILFFEGQRSGRIPANQRITW 60

Query: 61  RSNSGLYDGELAHVDLTGGYYDAGDNVKFNLPMAFTTTMLSWGALEYGARMGTELPNARA 120
           RSNSGLYDGEL HVDLTGGYYDAGDNVKFNLPMAFTTTMLSWGALEYGARMG+ELPN RA
Sbjct: 61  RSNSGLYDGELDHVDLTGGYYDAGDNVKFNLPMAFTTTMLSWGALEYGARMGSELPNTRA 120

Query: 121 AIRWATDYLLKCATATPGKLYVGVGDPHVDHKCWERPEDMDTVRTVYSVSAGNPGSDVAG 180
           AIRWATDYLLKCATATPGKLYVGVG+PH DHKCWERPEDMDTVRTVYSVSAGNPGSDVAG
Sbjct: 121 AIRWATDYLLKCATATPGKLYVGVGEPHADHKCWERPEDMDTVRTVYSVSAGNPGSDVAG 180

Query: 181 ETAAALAAASLVFRRVDRKYSRLLLATAKKVLQFALEHRGSYSDSLSSAVCPFYCSYSGY 240
           ETAAALAAASLVFRRVDRKYS++LLATAKKV++FALEHRGSYSDSLSSAVCPFYCSYSGY
Sbjct: 181 ETAAALAAASLVFRRVDRKYSKVLLATAKKVMEFALEHRGSYSDSLSSAVCPFYCSYSGY 240

Query: 241 KDELVWGATWLLRATNDVQYFNLLKSLGGDDVTDIFSWDNKYAGAHVLLSRRALLNNDKN 300
           KDELVWGA WLLRATN+V+YFNLLKSLGGDDVTDIFSWDNK+AGAHVLLSRR+LLNNDKN
Sbjct: 241 KDELVWGAAWLLRATNNVKYFNLLKSLGGDDVTDIFSWDNKFAGAHVLLSRRSLLNNDKN 300

Query: 301 FDSYKQEAESFMCRILPNSPYSTTQYTQGGLMFKLPESNLQYVTSITFLLSTYSKYMSAA 360
           FDSYKQEAE+FMCRILPNSP S+TQYTQG LMFKLPESNLQYVTSITFLL+TYSKYMSAA
Sbjct: 301 FDSYKQEAEAFMCRILPNSPSSSTQYTQGRLMFKLPESNLQYVTSITFLLTTYSKYMSAA 360

Query: 361 KHTFNCGSLLVTPASLKNLAKKQVDYILGVNPLKMSYMVGFGKNFPRRIHHRGSSLPSKA 420
           KHTFNCG+L+VTPASLKNLAK QVDYILGVNPLKMSYMVGFGKN+P+RIHHRGSSLPSKA
Sbjct: 361 KHTFNCGNLVVTPASLKNLAKIQVDYILGVNPLKMSYMVGFGKNYPKRIHHRGSSLPSKA 420

Query: 421 THPQAIACDGGFQPFFYSYNPNPNILTGAIVGGPNQNDGFPDDRTDYSHSEPATYINAAL 480
           THPQAIACDGGFQPFFYSYNPNPNILTGA+VGGPNQ+DGFPDDRTDYSHSEPATYINAAL
Sbjct: 421 THPQAIACDGGFQPFFYSYNPNPNILTGAVVGGPNQSDGFPDDRTDYSHSEPATYINAAL 480

Query: 481 VGPLAFFSGKN 492
           VGPLAFFSGK+
Sbjct: 481 VGPLAFFSGKH 491

BLAST of HG10003485 vs. NCBI nr
Match: KAG6607530.1 (Endoglucanase 9, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 919.1 bits (2374), Expect = 1.6e-263
Identity = 443/488 (90.78%), Postives = 462/488 (94.67%), Query Frame = 0

Query: 3   AAITNTSTLFFFFFLFFSCSLVQRARADLNYRDALAKSILFFEGQRSGRLPAGQRITWRS 62
           AA TN  T FFF  L  S      ARA+ NYRDALAKS+LFF+GQRSGR+P GQ+I WRS
Sbjct: 2   AATTNAPTFFFFILLLSSSFSFYTARANPNYRDALAKSLLFFQGQRSGRIPNGQQIAWRS 61

Query: 63  NSGLYDGELAHVDLTGGYYDAGDNVKFNLPMAFTTTMLSWGALEYGARMGTELPNARAAI 122
           NSGLYDGELAHVDLTGGYYDAGDNVKFNLPMAFTTTMLSWGALEYGARMG+ELPN RAAI
Sbjct: 62  NSGLYDGELAHVDLTGGYYDAGDNVKFNLPMAFTTTMLSWGALEYGARMGSELPNTRAAI 121

Query: 123 RWATDYLLKCATATPGKLYVGVGDPHVDHKCWERPEDMDTVRTVYSVSAGNPGSDVAGET 182
           RWATDYLLKCATATPGKLYVGVGDP+VDHKCWERPEDMDTVRTVYSVSA NPGSDVAGET
Sbjct: 122 RWATDYLLKCATATPGKLYVGVGDPNVDHKCWERPEDMDTVRTVYSVSAANPGSDVAGET 181

Query: 183 AAALAAASLVFRRVDRKYSRLLLATAKKVLQFALEHRGSYSDSLSSAVCPFYCSYSGYKD 242
           AAALAAASLVFRRVDRKYS LLLATAKKV QFA+EHRGSYSDSL SAVCPFYCSYSGYKD
Sbjct: 182 AAALAAASLVFRRVDRKYSGLLLATAKKVFQFAVEHRGSYSDSLGSAVCPFYCSYSGYKD 241

Query: 243 ELVWGATWLLRATNDVQYFNLLKSLGGDDVTDIFSWDNKYAGAHVLLSRRALLNNDKNFD 302
           ELVWGATWLLRATNDV+YFNLLKSLGGDDV DIFSWDNKYAGAHVLLSRRALLNNDKNFD
Sbjct: 242 ELVWGATWLLRATNDVRYFNLLKSLGGDDVPDIFSWDNKYAGAHVLLSRRALLNNDKNFD 301

Query: 303 SYKQEAESFMCRILPNSPYSTTQYTQGGLMFKLPESNLQYVTSITFLLSTYSKYMSAAKH 362
           SYKQ+AESFMCRILPNSPYS+TQYTQGGLMFKLP+SNLQYVTSITFLL+TYSKYMSAAKH
Sbjct: 302 SYKQKAESFMCRILPNSPYSSTQYTQGGLMFKLPQSNLQYVTSITFLLTTYSKYMSAAKH 361

Query: 363 TFNCGSLLVTPASLKNLAKKQVDYILGVNPLKMSYMVGFGKNFPRRIHHRGSSLPSKATH 422
           TFNCG +LVTP SLKNLAK+QVDYILGVNPLKMSYMVGFGKNFP+RIHHRGSSLPSKA+H
Sbjct: 362 TFNCGGVLVTPTSLKNLAKQQVDYILGVNPLKMSYMVGFGKNFPKRIHHRGSSLPSKASH 421

Query: 423 PQAIACDGGFQPFFYSYNPNPNILTGAIVGGPNQNDGFPDDRTDYSHSEPATYINAALVG 482
           PQAI CDGGFQPFFYSYNPNPNILTGA+VGGPNQNDGFPDDR+DYSHSEPATYINAALVG
Sbjct: 422 PQAIGCDGGFQPFFYSYNPNPNILTGAVVGGPNQNDGFPDDRSDYSHSEPATYINAALVG 481

Query: 483 PLAFFSGK 491
           PLAFFSGK
Sbjct: 482 PLAFFSGK 489

BLAST of HG10003485 vs. NCBI nr
Match: KAG7037172.1 (Endoglucanase 9, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 918.3 bits (2372), Expect = 2.8e-263
Identity = 444/489 (90.80%), Postives = 463/489 (94.68%), Query Frame = 0

Query: 3   AAITNTSTLFFFFFLFFSCSL-VQRARADLNYRDALAKSILFFEGQRSGRLPAGQRITWR 62
           AA TN  T FFF  L  S S     ARA+ NYRDALAKS+LFF+GQRSGR+P GQ+I WR
Sbjct: 2   AATTNVPTFFFFLLLLLSSSFSFYTARANPNYRDALAKSLLFFQGQRSGRIPNGQQIAWR 61

Query: 63  SNSGLYDGELAHVDLTGGYYDAGDNVKFNLPMAFTTTMLSWGALEYGARMGTELPNARAA 122
           SNSGLYDGELAHVDLTGGYYDAGDNVKFNLPMAFTTTMLSWGALEYGARMG+ELPN RAA
Sbjct: 62  SNSGLYDGELAHVDLTGGYYDAGDNVKFNLPMAFTTTMLSWGALEYGARMGSELPNTRAA 121

Query: 123 IRWATDYLLKCATATPGKLYVGVGDPHVDHKCWERPEDMDTVRTVYSVSAGNPGSDVAGE 182
           IRWATDYLLKCATATPGKLYVGVGDP+VDHKCWERPEDMDTVRTVYSVSA NPGSDVAGE
Sbjct: 122 IRWATDYLLKCATATPGKLYVGVGDPNVDHKCWERPEDMDTVRTVYSVSAANPGSDVAGE 181

Query: 183 TAAALAAASLVFRRVDRKYSRLLLATAKKVLQFALEHRGSYSDSLSSAVCPFYCSYSGYK 242
           TAAALAAASLVFRRVDRKYS LLLATAKKV QFA+EHRGSYSDSL SAVCPFYCSYSGYK
Sbjct: 182 TAAALAAASLVFRRVDRKYSGLLLATAKKVFQFAVEHRGSYSDSLGSAVCPFYCSYSGYK 241

Query: 243 DELVWGATWLLRATNDVQYFNLLKSLGGDDVTDIFSWDNKYAGAHVLLSRRALLNNDKNF 302
           DELVWGATWLLRATNDV+YFNLLKSLGGDDV DIFSWDNKYAGAHVLLSRRALLNNDKNF
Sbjct: 242 DELVWGATWLLRATNDVRYFNLLKSLGGDDVPDIFSWDNKYAGAHVLLSRRALLNNDKNF 301

Query: 303 DSYKQEAESFMCRILPNSPYSTTQYTQGGLMFKLPESNLQYVTSITFLLSTYSKYMSAAK 362
           DSYKQ+AESFMCRILPNSPYS+TQYTQGGLMFKLP+SNLQYVTSITFLL+TYSKYMSAAK
Sbjct: 302 DSYKQKAESFMCRILPNSPYSSTQYTQGGLMFKLPQSNLQYVTSITFLLTTYSKYMSAAK 361

Query: 363 HTFNCGSLLVTPASLKNLAKKQVDYILGVNPLKMSYMVGFGKNFPRRIHHRGSSLPSKAT 422
           HTFNCG +LVTP SLKNLAK+QVDYILGVNPLKMSYMVGFGKNFP+RIHHRGSSLPSKA+
Sbjct: 362 HTFNCGGVLVTPTSLKNLAKQQVDYILGVNPLKMSYMVGFGKNFPKRIHHRGSSLPSKAS 421

Query: 423 HPQAIACDGGFQPFFYSYNPNPNILTGAIVGGPNQNDGFPDDRTDYSHSEPATYINAALV 482
           HPQAI CDGGFQPFFYSYNPNPNILTGA+VGGPNQNDGFPDDR+DYSHSEPATYINAALV
Sbjct: 422 HPQAIGCDGGFQPFFYSYNPNPNILTGAVVGGPNQNDGFPDDRSDYSHSEPATYINAALV 481

Query: 483 GPLAFFSGK 491
           GPLAFFSGK
Sbjct: 482 GPLAFFSGK 490

BLAST of HG10003485 vs. NCBI nr
Match: XP_023523831.1 (endoglucanase 9-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 918.3 bits (2372), Expect = 2.8e-263
Identity = 443/488 (90.78%), Postives = 461/488 (94.47%), Query Frame = 0

Query: 3   AAITNTSTLFFFFFLFFSCSLVQRARADLNYRDALAKSILFFEGQRSGRLPAGQRITWRS 62
           AA TN  T FFF  L  S      ARA+ NYRDALAKS+LFF+GQRSGR+P GQ+I WRS
Sbjct: 2   AATTNAPTFFFFLLLLSSSFSFYTARANPNYRDALAKSLLFFQGQRSGRIPNGQQIAWRS 61

Query: 63  NSGLYDGELAHVDLTGGYYDAGDNVKFNLPMAFTTTMLSWGALEYGARMGTELPNARAAI 122
           NSGLYDGELAHVDLTGGYYDAGDNVKFNLPMAFTTTMLSWGALEYGARMG+ELPN RAAI
Sbjct: 62  NSGLYDGELAHVDLTGGYYDAGDNVKFNLPMAFTTTMLSWGALEYGARMGSELPNTRAAI 121

Query: 123 RWATDYLLKCATATPGKLYVGVGDPHVDHKCWERPEDMDTVRTVYSVSAGNPGSDVAGET 182
           RWATDYLLKCATATPGKLYVGVGDP+VDHKCWERPEDMDTVRTVYSVSA NPGSDVAGET
Sbjct: 122 RWATDYLLKCATATPGKLYVGVGDPNVDHKCWERPEDMDTVRTVYSVSAANPGSDVAGET 181

Query: 183 AAALAAASLVFRRVDRKYSRLLLATAKKVLQFALEHRGSYSDSLSSAVCPFYCSYSGYKD 242
           AAALAAASLVFRRVDRKYS LLLATAKKV QFA+EHRGSYSDSL SAVCPFYCSYSGYKD
Sbjct: 182 AAALAAASLVFRRVDRKYSGLLLATAKKVFQFAVEHRGSYSDSLGSAVCPFYCSYSGYKD 241

Query: 243 ELVWGATWLLRATNDVQYFNLLKSLGGDDVTDIFSWDNKYAGAHVLLSRRALLNNDKNFD 302
           ELVWGATWLLRATNDV+YFNLLKSLGGDDV DIFSWDNKYAGAHVLLSRRALLNNDKNFD
Sbjct: 242 ELVWGATWLLRATNDVRYFNLLKSLGGDDVPDIFSWDNKYAGAHVLLSRRALLNNDKNFD 301

Query: 303 SYKQEAESFMCRILPNSPYSTTQYTQGGLMFKLPESNLQYVTSITFLLSTYSKYMSAAKH 362
           SYKQEAESFMCRILPNSPYS+TQYTQGGLMFKLP+SNLQYVTSITFLL+TYSKYMSAAKH
Sbjct: 302 SYKQEAESFMCRILPNSPYSSTQYTQGGLMFKLPQSNLQYVTSITFLLTTYSKYMSAAKH 361

Query: 363 TFNCGSLLVTPASLKNLAKKQVDYILGVNPLKMSYMVGFGKNFPRRIHHRGSSLPSKATH 422
           TFNCG +LVTP SLKNLAK+QVDYILGVNPLKMSYMVGFGK FP+RIHHRGSSLPSKA+H
Sbjct: 362 TFNCGGVLVTPTSLKNLAKQQVDYILGVNPLKMSYMVGFGKTFPKRIHHRGSSLPSKASH 421

Query: 423 PQAIACDGGFQPFFYSYNPNPNILTGAIVGGPNQNDGFPDDRTDYSHSEPATYINAALVG 482
           PQAI CDGGFQPFFYSYNPNPNILTGA+VGGPNQNDGFPDDR+DYSHSEPATYINAALVG
Sbjct: 422 PQAIGCDGGFQPFFYSYNPNPNILTGAVVGGPNQNDGFPDDRSDYSHSEPATYINAALVG 481

Query: 483 PLAFFSGK 491
           PLAFFSGK
Sbjct: 482 PLAFFSGK 489

BLAST of HG10003485 vs. ExPASy Swiss-Prot
Match: Q9C9H5 (Endoglucanase 9 OS=Arabidopsis thaliana OX=3702 GN=CEL3 PE=1 SV=1)

HSP 1 Score: 768.8 bits (1984), Expect = 3.6e-221
Identity = 366/479 (76.41%), Postives = 418/479 (87.27%), Query Frame = 0

Query: 10  TLFFFFFLFFSCSLVQRARADLNYRDALAKSILFFEGQRSGRLPAGQRITWRSNSGLYDG 69
           T  FFF L FS  L+    A+ NY++AL+KS+LFF+GQRSG LP GQ+I+WR++SGL DG
Sbjct: 2   TSLFFFVLLFSSLLISNGDANPNYKEALSKSLLFFQGQRSGPLPRGQQISWRASSGLSDG 61

Query: 70  ELAHVDLTGGYYDAGDNVKFNLPMAFTTTMLSWGALEYGARMGTELPNARAAIRWATDYL 129
             AHVDLTGGYYDAGDNVKFNLPMAFTTTMLSW ALEYG RMG EL NAR  IRWATDYL
Sbjct: 62  SAAHVDLTGGYYDAGDNVKFNLPMAFTTTMLSWSALEYGKRMGPELENARVNIRWATDYL 121

Query: 130 LKCATATPGKLYVGVGDPHVDHKCWERPEDMDTVRTVYSVSAGNPGSDVAGETAAALAAA 189
           LKCA ATPGKLYVGVGDP+VDHKCWERPEDMDT RTVYSVSA NPGSDVA ETAAALAAA
Sbjct: 122 LKCARATPGKLYVGVGDPNVDHKCWERPEDMDTPRTVYSVSASNPGSDVAAETAAALAAA 181

Query: 190 SLVFRRVDRKYSRLLLATAKKVLQFALEHRGSYSDSLSSAVCPFYCSYSGYKDELVWGAT 249
           S+VFR+VD KYSRLLLATAK V+QFA++++G+YSDSLSS+VCPFYCSYSGYKDEL+WGA+
Sbjct: 182 SMVFRKVDSKYSRLLLATAKDVMQFAIQYQGAYSDSLSSSVCPFYCSYSGYKDELMWGAS 241

Query: 250 WLLRATNDVQYFNLLKSLGGDDVTDIFSWDNKYAGAHVLLSRRALLNNDKNFDSYKQEAE 309
           WLLRATN+  Y N +KSLGG D  DIFSWDNKYAGA+VLLSRRALLN D NF+ YKQ AE
Sbjct: 242 WLLRATNNPYYANFIKSLGGGDQPDIFSWDNKYAGAYVLLSRRALLNKDSNFEQYKQAAE 301

Query: 310 SFMCRILPNSPYSTTQYTQGGLMFKLPESNLQYVTSITFLLSTYSKYMSAAKHTFNCGSL 369
           +F+C+ILP+SP S+TQYTQGGLM+KLP+SNLQYVTSITFLL+TY+KYM A KHTFNCGS 
Sbjct: 302 NFICKILPDSPSSSTQYTQGGLMYKLPQSNLQYVTSITFLLTTYAKYMKATKHTFNCGSS 361

Query: 370 LVTPASLKNLAKKQVDYILGVNPLKMSYMVGFGKNFPRRIHHRGSSLPSKATHPQAIACD 429
           ++ P +L +L+K+QVDYILG NP+KMSYMVGF  NFP+RIHHR SSLPS A   Q++ C+
Sbjct: 362 VIVPNALISLSKRQVDYILGDNPIKMSYMVGFSSNFPKRIHHRASSLPSHALRSQSLGCN 421

Query: 430 GGFQPFFYSYNPNPNILTGAIVGGPNQNDGFPDDRTDYSHSEPATYINAALVGPLAFFS 489
           GGFQ  FY+ NPNPNILTGAIVGGPNQNDG+PD R DYSH+EPATYINAA VGPLA+F+
Sbjct: 422 GGFQS-FYTQNPNPNILTGAIVGGPNQNDGYPDQRDDYSHAEPATYINAAFVGPLAYFA 479

BLAST of HG10003485 vs. ExPASy Swiss-Prot
Match: Q2V4L8 (Endoglucanase 3 OS=Arabidopsis thaliana OX=3702 GN=CEL5 PE=2 SV=2)

HSP 1 Score: 747.7 bits (1929), Expect = 8.6e-215
Identity = 356/477 (74.63%), Postives = 413/477 (86.58%), Query Frame = 0

Query: 12  FFFFFLFFSCSLVQRARADLNYRDALAKSILFFEGQRSGRLPAGQRITWRSNSGLYDGEL 71
           FFF FL  + SL +   A  NYR+AL+KS+LFF+GQRSGRLP+ Q+++WRS+SGL DG  
Sbjct: 5   FFFVFLLSALSL-ENTYASPNYREALSKSLLFFQGQRSGRLPSDQQLSWRSSSGLSDGSS 64

Query: 72  AHVDLTGGYYDAGDNVKFNLPMAFTTTMLSWGALEYGARMGTELPNARAAIRWATDYLLK 131
           AHVDLTGGYYDAGDNVKFN PMAFTTTMLSW +LEYG +MG EL N+R AIRWATDYLLK
Sbjct: 65  AHVDLTGGYYDAGDNVKFNFPMAFTTTMLSWSSLEYGKKMGPELQNSRVAIRWATDYLLK 124

Query: 132 CATATPGKLYVGVGDPHVDHKCWERPEDMDTVRTVYSVSAGNPGSDVAGETAAALAAASL 191
           CA ATPGKLYVGVGDP+ DHKCWERPEDMDT RTVYSVS  NPGSDVA ETAAALAA+S+
Sbjct: 125 CARATPGKLYVGVGDPNGDHKCWERPEDMDTPRTVYSVSPSNPGSDVAAETAAALAASSM 184

Query: 192 VFRRVDRKYSRLLLATAKKVLQFALEHRGSYSDSLSSAVCPFYCSYSGYKDELVWGATWL 251
           VFR+VD KYSRLLLATAKKV+QFA+++RG+YS+SLSS+VCPFYCSYSGYKDEL+WGA WL
Sbjct: 185 VFRKVDPKYSRLLLATAKKVMQFAIQYRGAYSNSLSSSVCPFYCSYSGYKDELLWGAAWL 244

Query: 252 LRATNDVQYFNLLKSLGGDDVTDIFSWDNKYAGAHVLLSRRALLNNDKNFDSYKQEAESF 311
            RATND  Y N +KSLGG D  DIFSWDNKYAGA+VLLSRRA+LN D NF+ YKQ AE+F
Sbjct: 245 HRATNDPYYTNFIKSLGGGDQPDIFSWDNKYAGAYVLLSRRAVLNKDNNFELYKQAAENF 304

Query: 312 MCRILPNSPYSTTQYTQGGLMFKLPESNLQYVTSITFLLSTYSKYMSAAKHTFNCGSLLV 371
           MC+ILPNSP S+T+YT+GGLM+KLP+SNLQYVTSITFLL+TY+KYM + K TFNCG+ L+
Sbjct: 305 MCKILPNSPSSSTKYTKGGLMYKLPQSNLQYVTSITFLLTTYAKYMKSTKQTFNCGNSLI 364

Query: 372 TPASLKNLAKKQVDYILGVNPLKMSYMVGFGKNFPRRIHHRGSSLPSKATHPQAIACDGG 431
            P +L NL+K+QVDY+LGVNP+KMSYMVGF  NFP+RIHHRGSSLPS+A    ++ C+GG
Sbjct: 365 VPNALINLSKRQVDYVLGVNPMKMSYMVGFSSNFPKRIHHRGSSLPSRAVRSNSLGCNGG 424

Query: 432 FQPFFYSYNPNPNILTGAIVGGPNQNDGFPDDRTDYSHSEPATYINAALVGPLAFFS 489
           FQ  F + NPNPNILTGAIVGGPNQND +PD R DY+ SEPATYINAA VGPLA+F+
Sbjct: 425 FQS-FRTQNPNPNILTGAIVGGPNQNDEYPDQRDDYTRSEPATYINAAFVGPLAYFA 479

BLAST of HG10003485 vs. ExPASy Swiss-Prot
Match: Q7XTH4 (Endoglucanase 11 OS=Oryza sativa subsp. japonica OX=39947 GN=GLU4 PE=2 SV=3)

HSP 1 Score: 620.5 bits (1599), Expect = 1.6e-176
Identity = 298/459 (64.92%), Postives = 355/459 (77.34%), Query Frame = 0

Query: 32  NYRDALAKSILFFEGQRSGRLPAGQRITWRSNSGLYDGELAHVDLTGGYYDAGDNVKFNL 91
           +Y DALAKSILFF+GQRSGRLP  Q + WRSNSGL DG  A+VDLTGGYYD GDNVKF  
Sbjct: 38  DYADALAKSILFFQGQRSGRLPPDQAVKWRSNSGLSDGSAANVDLTGGYYDGGDNVKFGF 97

Query: 92  PMAFTTTMLSWGALEYGARM-GTELPNARAAIRWATDYLLKCATATPGKLYVGVGDPHVD 151
           PMAFTTTMLSWG +EYG RM G  L +AR A+RWA DYLL+ ATATPG LYVGVGDP  D
Sbjct: 98  PMAFTTTMLSWGVVEYGGRMRGRVLRDARDAVRWAADYLLRAATATPGVLYVGVGDPDAD 157

Query: 152 HKCWERPEDMDTVRTVYSVSAGNPGSDVAGETAAALAAASLVFRRVDRKYSRLLLATAKK 211
           H+CWERPEDMDT R VYSVSA +PGSDVA ETAAALAAASL  R  D  YSR LLA A+ 
Sbjct: 158 HRCWERPEDMDTPRAVYSVSASSPGSDVAAETAAALAAASLALRAADPGYSRRLLAAARD 217

Query: 212 VLQFALEHRGSYSDSLSSAVCPFYCSYSGYKDELVWGATWLLRATNDVQYFNLLKSLGGD 271
           V+ FA+ H+G YSD +   V  +Y SYSGY+DEL+WG+ WLL AT +  Y + L SLG +
Sbjct: 218 VMAFAVRHQGKYSDHVGGDVGAYYASYSGYQDELLWGSAWLLWATRNASYLDYLASLGAN 277

Query: 272 DVTDIFSWDNKYAGAHVLLSRRALLNNDKNFDSYKQEAESFMCRILPNSPYSTTQYTQGG 331
           D  D+FSWDNK AGA VLLSRRAL+N D+  D++++ AE F+CRILP SP STTQYT GG
Sbjct: 278 DGVDMFSWDNKLAGARVLLSRRALVNGDRRLDAFRRLAEDFICRILPGSPSSTTQYTPGG 337

Query: 332 LMFKLPESNLQYVTSITFLLSTYSKYMSAAKHTFNCGSLLVTPASLKNLAKKQVDYILGV 391
           +M+K   +NLQYVTS +FLL+T++KYM+ + HTF+C SL VT  +L+ LA+KQVDYILG 
Sbjct: 338 MMYKSGHANLQYVTSASFLLTTFAKYMAVSNHTFSCQSLPVTAKTLRALARKQVDYILGA 397

Query: 392 NPLKMSYMVGFGKNFPRRIHHRGSSLPSKATHPQAIACDGGFQPFFYSYNPNPNILTGAI 451
           NP  MSYMVG+G  FP+RIHHRG+S+PS A +P  I C  GF  +F +   NPN+ TGA+
Sbjct: 398 NPQGMSYMVGYGARFPQRIHHRGASMPSVAAYPAHIGCQEGFSGYFNAGGANPNVHTGAV 457

Query: 452 VGGPNQNDGFPDDRTDYSHSEPATYINAALVGPLAFFSG 490
           VGGP+Q+D FPD+R DY  SEP TY NAALVG LA+F+G
Sbjct: 458 VGGPDQHDAFPDERGDYDRSEPTTYTNAALVGCLAYFAG 496

BLAST of HG10003485 vs. ExPASy Swiss-Prot
Match: Q9SRX3 (Endoglucanase 1 OS=Arabidopsis thaliana OX=3702 GN=CEL2 PE=2 SV=1)

HSP 1 Score: 599.7 bits (1545), Expect = 2.9e-170
Identity = 288/461 (62.47%), Postives = 356/461 (77.22%), Query Frame = 0

Query: 32  NYRDALAKSILFFEGQRSGRLPAGQRITWRSNSGLYDGELAHVDLTGGYYDAGDNVKFNL 91
           NY+DAL+KSILFFEGQRSG+LP  QR+TWRSNSGL DG   +VDL GGYYDAGDN+KF  
Sbjct: 43  NYKDALSKSILFFEGQRSGKLPPNQRMTWRSNSGLSDGSALNVDLVGGYYDAGDNMKFGF 102

Query: 92  PMAFTTTMLSWGALEYGARMGTELPNARAAIRWATDYLLKCATATPGKLYVGVGDPHVDH 151
           PMAFTTTMLSW  +E+G  M +ELPNA+ AIRWATD+LLK AT+ P  +YV VGDP++DH
Sbjct: 103 PMAFTTTMLSWSLIEFGGLMKSELPNAKDAIRWATDFLLK-ATSHPDTIYVQVGDPNMDH 162

Query: 152 KCWERPEDMDTVRTVYSVSAGNPGSDVAGETAAALAAASLVFRRVDRKYSRLLLATAKKV 211
            CWERPEDMDT R+V+ V   NPGSD+AGE AAALAAAS+VFR+ D  YS  LL  A  V
Sbjct: 163 ACWERPEDMDTPRSVFKVDKNNPGSDIAGEIAAALAAASIVFRKCDPSYSNHLLQRAITV 222

Query: 212 LQFALEHRGSYSDSLSSAVCPFYCSYSGYKDELVWGATWLLRATNDVQYFNLLKS----L 271
             FA ++RG YS  L+  VCPFYCSYSGY+DEL+WGA WL +ATN+  Y N +K+    L
Sbjct: 223 FTFADKYRGPYSAGLAPEVCPFYCSYSGYQDELLWGAAWLQKATNNPTYLNYIKANGQIL 282

Query: 272 GGDDVTDIFSWDNKYAGAHVLLSRRALLNNDKNFDSYKQEAESFMCRILPNSPYSTTQYT 331
           G D+  ++FSWDNK+ GA +LLS+  L+   K+ + YK+ A+SF+C +LP +  S++QYT
Sbjct: 283 GADEFDNMFSWDNKHVGARILLSKEFLIQKVKSLEEYKEHADSFICSVLPGA--SSSQYT 342

Query: 332 QGGLMFKLPESNLQYVTSITFLLSTYSKYMSAAKHTFNCGSLLVTPASLKNLAKKQVDYI 391
            GGL+FK+ ESN+QYVTS +FLL TY+KY+++A+    CG  +VTPA L+++AKKQVDY+
Sbjct: 343 PGGLLFKMGESNMQYVTSTSFLLLTYAKYLTSARTVAYCGGSVVTPARLRSIAKKQVDYL 402

Query: 392 LGVNPLKMSYMVGFGKNFPRRIHHRGSSLPSKATHPQAIACDGGFQPFFYSYNPNPNILT 451
           LG NPLKMSYMVG+G  +PRRIHHRGSSLPS A HP  I C  GF   F S +PNPN L 
Sbjct: 403 LGGNPLKMSYMVGYGLKYPRRIHHRGSSLPSVAVHPTRIQCHDGFS-LFTSQSPNPNDLV 462

Query: 452 GAIVGGPNQNDGFPDDRTDYSHSEPATYINAALVGPLAFFS 489
           GA+VGGP+QND FPD+R+DY  SEPATYINA LVG LA+ +
Sbjct: 463 GAVVGGPDQNDQFPDERSDYGRSEPATYINAPLVGALAYLA 499

BLAST of HG10003485 vs. ExPASy Swiss-Prot
Match: O81416 (Endoglucanase 17 OS=Arabidopsis thaliana OX=3702 GN=At4g02290 PE=2 SV=1)

HSP 1 Score: 586.6 bits (1511), Expect = 2.5e-166
Identity = 286/500 (57.20%), Postives = 364/500 (72.80%), Query Frame = 0

Query: 4   AITNTSTLFFFFFL-----------FFSCSLVQRARADLNYRDALAKSILFFEGQRSGRL 63
           A+  T  L FFFFL            F+    +   A  NY+DAL KSILFFEGQRSG+L
Sbjct: 13  ALRVTIFLSFFFFLCNGFSYPTTSSLFNTHHHRHHLAKHNYKDALTKSILFFEGQRSGKL 72

Query: 64  PAGQRITWRSNSGLYDGELAHVDLTGGYYDAGDNVKFNLPMAFTTTMLSWGALEYGARMG 123
           P+ QR++WR +SGL DG   HVDL GGYYDAGDN+KF  PMAFTTTMLSW  +E+G  M 
Sbjct: 73  PSNQRMSWRRDSGLSDGSALHVDLVGGYYDAGDNIKFGFPMAFTTTMLSWSVIEFGGLMK 132

Query: 124 TELPNARAAIRWATDYLLKCATATPGKLYVGVGDPHVDHKCWERPEDMDTVRTVYSVSAG 183
           +EL NA+ AIRWATDYLLK AT+ P  +YV VGD + DH CWERPEDMDTVR+V+ V   
Sbjct: 133 SELQNAKIAIRWATDYLLK-ATSQPDTIYVQVGDANKDHSCWERPEDMDTVRSVFKVDKN 192

Query: 184 NPGSDVAGETAAALAAASLVFRRVDRKYSRLLLATAKKVLQFALEHRGSYSDSLSSAVCP 243
            PGSDVA ETAAALAAA++VFR+ D  YS++LL  A  V  FA ++RG+YS  L   VCP
Sbjct: 193 IPGSDVAAETAAALAAAAIVFRKSDPSYSKVLLKRAISVFAFADKYRGTYSAGLKPDVCP 252

Query: 244 FYCSYSGYKDELVWGATWLLRATNDVQYFNLLK----SLGGDDVTDIFSWDNKYAGAHVL 303
           FYCSYSGY+DEL+WGA WL +AT +++Y N +K     LG  +  + F WDNK+AGA +L
Sbjct: 253 FYCSYSGYQDELLWGAAWLQKATKNIKYLNYIKINGQILGAAEYDNTFGWDNKHAGARIL 312

Query: 304 LSRRALLNNDKNFDSYKQEAESFMCRILPNSPYSTTQYTQGGLMFKLPESNLQYVTSITF 363
           L++  L+ N K    YK  A++F+C ++P +P+S+TQYT GGL+FK+ ++N+QYVTS +F
Sbjct: 313 LTKAFLVQNVKTLHEYKGHADNFICSVIPGAPFSSTQYTPGGLLFKMADANMQYVTSTSF 372

Query: 364 LLSTYSKYMSAAKHTFNCGSLLVTPASLKNLAKKQVDYILGVNPLKMSYMVGFGKNFPRR 423
           LL TY+KY+++AK   +CG  + TP  L+++AK+QVDY+LG NPL+MSYMVG+G  FPRR
Sbjct: 373 LLLTYAKYLTSAKTVVHCGGSVYTPGRLRSIAKRQVDYLLGDNPLRMSYMVGYGPKFPRR 432

Query: 424 IHHRGSSLPSKATHPQAIACDGGFQPFFYSYNPNPNILTGAIVGGPNQNDGFPDDRTDYS 483
           IHHRGSSLP  A+HP  I C  GF     S +PNPN L GA+VGGP+Q+D FPD+R+DY 
Sbjct: 433 IHHRGSSLPCVASHPAKIQCHQGF-AIMNSQSPNPNFLVGAVVGGPDQHDRFPDERSDYE 492

Query: 484 HSEPATYINAALVGPLAFFS 489
            SEPATYIN+ LVG LA+F+
Sbjct: 493 QSEPATYINSPLVGALAYFA 510

BLAST of HG10003485 vs. ExPASy TrEMBL
Match: A0A0A0KW33 (Endoglucanase OS=Cucumis sativus OX=3659 GN=Csa_5G648680 PE=3 SV=1)

HSP 1 Score: 928.7 bits (2399), Expect = 1.0e-266
Identity = 448/491 (91.24%), Postives = 472/491 (96.13%), Query Frame = 0

Query: 1   MAAAITNTSTLFFFFFLFFSCSLVQRARADLNYRDALAKSILFFEGQRSGRLPAGQRITW 60
           MA+AI+N+STLF  FFL  S S   RA A  NYRDALAKSILFFEGQRSGR+PA QRITW
Sbjct: 1   MASAISNSSTLFLLFFLLLSFSFAGRALAGPNYRDALAKSILFFEGQRSGRIPANQRITW 60

Query: 61  RSNSGLYDGELAHVDLTGGYYDAGDNVKFNLPMAFTTTMLSWGALEYGARMGTELPNARA 120
           RSNSGLYDGEL HVDLTGGYYDAGDNVKFNLPMAFTTTMLSWGALEYGARMG+ELPN RA
Sbjct: 61  RSNSGLYDGELDHVDLTGGYYDAGDNVKFNLPMAFTTTMLSWGALEYGARMGSELPNTRA 120

Query: 121 AIRWATDYLLKCATATPGKLYVGVGDPHVDHKCWERPEDMDTVRTVYSVSAGNPGSDVAG 180
           AIRWATDYLLKCATATPGKLYVGVG+PH DHKCWERPEDMDTVRTVYSVSAGNPGSDVAG
Sbjct: 121 AIRWATDYLLKCATATPGKLYVGVGEPHADHKCWERPEDMDTVRTVYSVSAGNPGSDVAG 180

Query: 181 ETAAALAAASLVFRRVDRKYSRLLLATAKKVLQFALEHRGSYSDSLSSAVCPFYCSYSGY 240
           ETAAALAAASLVFRRVDRKYS++LLATAKKV++FALEHRGSYSDSLSSAVCPFYCSYSGY
Sbjct: 181 ETAAALAAASLVFRRVDRKYSKVLLATAKKVMEFALEHRGSYSDSLSSAVCPFYCSYSGY 240

Query: 241 KDELVWGATWLLRATNDVQYFNLLKSLGGDDVTDIFSWDNKYAGAHVLLSRRALLNNDKN 300
           KDELVWGA WLLRATN+V+YFNLLKSLGGDDVTDIFSWDNK+AGAHVLLSRR+LLNNDKN
Sbjct: 241 KDELVWGAAWLLRATNNVKYFNLLKSLGGDDVTDIFSWDNKFAGAHVLLSRRSLLNNDKN 300

Query: 301 FDSYKQEAESFMCRILPNSPYSTTQYTQGGLMFKLPESNLQYVTSITFLLSTYSKYMSAA 360
           FDSYKQEAE+FMCRILPNSP S+TQYTQG LMFKLPESNLQYVTSITFLL+TYSKYMSAA
Sbjct: 301 FDSYKQEAEAFMCRILPNSPSSSTQYTQGRLMFKLPESNLQYVTSITFLLTTYSKYMSAA 360

Query: 361 KHTFNCGSLLVTPASLKNLAKKQVDYILGVNPLKMSYMVGFGKNFPRRIHHRGSSLPSKA 420
           KHTFNCG+L+VTPASLKNLAK QVDYILGVNPLKMSYMVGFGKN+P+RIHHRGSSLPSKA
Sbjct: 361 KHTFNCGNLVVTPASLKNLAKIQVDYILGVNPLKMSYMVGFGKNYPKRIHHRGSSLPSKA 420

Query: 421 THPQAIACDGGFQPFFYSYNPNPNILTGAIVGGPNQNDGFPDDRTDYSHSEPATYINAAL 480
           THPQAIACDGGFQPFFYSYNPNPNILTGA+VGGPNQ+DGFPDDRTDYSHSEPATYINAAL
Sbjct: 421 THPQAIACDGGFQPFFYSYNPNPNILTGAVVGGPNQSDGFPDDRTDYSHSEPATYINAAL 480

Query: 481 VGPLAFFSGKN 492
           VGPLAFFSGK+
Sbjct: 481 VGPLAFFSGKH 491

BLAST of HG10003485 vs. ExPASy TrEMBL
Match: A0A5A7TDF3 (Endoglucanase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold169G00910 PE=3 SV=1)

HSP 1 Score: 917.9 bits (2371), Expect = 1.8e-263
Identity = 448/494 (90.69%), Postives = 472/494 (95.55%), Query Frame = 0

Query: 1   MAAAITN-TSTLF--FFFFLFFSCSLVQRARADLNYRDALAKSILFFEGQRSGRLPAGQR 60
           MA+ I+N +STL+  FFF L  S S   RARA+ NYRDALAKSILFFEGQRSGR+PA QR
Sbjct: 1   MASPISNSSSTLYSLFFFGLLLSFSFAGRARANPNYRDALAKSILFFEGQRSGRIPANQR 60

Query: 61  ITWRSNSGLYDGELAHVDLTGGYYDAGDNVKFNLPMAFTTTMLSWGALEYGARMGTELPN 120
           ITWRSNSGLYDGELAHVDLTGGYYDAGDNVKFNLPMAFTTTMLSWGALEYGARMG+EL N
Sbjct: 61  ITWRSNSGLYDGELAHVDLTGGYYDAGDNVKFNLPMAFTTTMLSWGALEYGARMGSELGN 120

Query: 121 ARAAIRWATDYLLKCATATPGKLYVGVGDPHVDHKCWERPEDMDTVRTVYSVSAGNPGSD 180
            RAAIRWATDYLLKCATATPGKLYVGVGDPH DHKCWERPEDMDTVRTVYSVSAGNPGSD
Sbjct: 121 TRAAIRWATDYLLKCATATPGKLYVGVGDPHADHKCWERPEDMDTVRTVYSVSAGNPGSD 180

Query: 181 VAGETAAALAAASLVFRRVDRKYSRLLLATAKKVLQFALEHRGSYSDSLSSAVCPFYCSY 240
           VAGETAAALAAASLVFRRVDRKYSR+LLATAKKV++FALEHRGSYSDSLSSAVCPFYCSY
Sbjct: 181 VAGETAAALAAASLVFRRVDRKYSRVLLATAKKVMEFALEHRGSYSDSLSSAVCPFYCSY 240

Query: 241 SGYKDELVWGATWLLRATNDVQYFNLLKSLGGDDVTDIFSWDNKYAGAHVLLSRRALLNN 300
           SGYKDELVWGA WLLRATNDV+YFNLLKSLGGDDVTDIFSWDNK+AGAHVLLSRR+LLNN
Sbjct: 241 SGYKDELVWGAAWLLRATNDVKYFNLLKSLGGDDVTDIFSWDNKFAGAHVLLSRRSLLNN 300

Query: 301 DKNFDSYKQEAESFMCRILPNSPYSTTQYTQGGLMFKLPESNLQYVTSITFLLSTYSKYM 360
           DKNFD YKQEAE+FMCRILPNSP S+T+YTQG LMFKLPESNLQYVTSITFLL+TYSKYM
Sbjct: 301 DKNFDLYKQEAEAFMCRILPNSPSSSTKYTQGRLMFKLPESNLQYVTSITFLLTTYSKYM 360

Query: 361 SAAKHTFNCGSLLVTPASLKNLAKKQVDYILGVNPLKMSYMVGFGKNFPRRIHHRGSSLP 420
           SAAKHTFNCG+L+VTPASLKNLAK QVDYILGVNPLKMSYMVG+GKNFP+RIHHRGSSLP
Sbjct: 361 SAAKHTFNCGNLVVTPASLKNLAKIQVDYILGVNPLKMSYMVGYGKNFPKRIHHRGSSLP 420

Query: 421 SKATHPQAIACDGGFQPFFYSYNPNPNILTGAIVGGPNQNDGFPDDRTDYSHSEPATYIN 480
           SKATHPQAIACDGGFQPFFYSYNPNPNIL GA+VGGPNQ+DGFPDDRTDYSHSEPATYIN
Sbjct: 421 SKATHPQAIACDGGFQPFFYSYNPNPNILIGAVVGGPNQSDGFPDDRTDYSHSEPATYIN 480

Query: 481 AALVGPLAFFSGKN 492
           AALVGPLAFFSGK+
Sbjct: 481 AALVGPLAFFSGKH 494

BLAST of HG10003485 vs. ExPASy TrEMBL
Match: A0A6J1F1W4 (Endoglucanase OS=Cucurbita moschata OX=3662 GN=LOC111438904 PE=3 SV=1)

HSP 1 Score: 917.1 bits (2369), Expect = 3.0e-263
Identity = 444/488 (90.98%), Postives = 463/488 (94.88%), Query Frame = 0

Query: 3   AAITNTSTLFFFFFLFFSCSLVQRARADLNYRDALAKSILFFEGQRSGRLPAGQRITWRS 62
           AA TN  T FFFF L  S      ARA+ NYRDALAKS+LFF+GQRSGR+P GQ+I WRS
Sbjct: 2   AATTNAPT-FFFFLLLSSSFSFYTARANPNYRDALAKSLLFFQGQRSGRIPNGQQIAWRS 61

Query: 63  NSGLYDGELAHVDLTGGYYDAGDNVKFNLPMAFTTTMLSWGALEYGARMGTELPNARAAI 122
           NSGLYDGELAHVDLTGGYYDAGDNVKFNLPMAFTTTMLSWGALEYGARMG+ELPN RAAI
Sbjct: 62  NSGLYDGELAHVDLTGGYYDAGDNVKFNLPMAFTTTMLSWGALEYGARMGSELPNTRAAI 121

Query: 123 RWATDYLLKCATATPGKLYVGVGDPHVDHKCWERPEDMDTVRTVYSVSAGNPGSDVAGET 182
           RWATDYLLKCATATPGKLYVGVGDP+VDHKCWERPEDMDTVRTVYSVSA NPGSDVAGET
Sbjct: 122 RWATDYLLKCATATPGKLYVGVGDPNVDHKCWERPEDMDTVRTVYSVSAANPGSDVAGET 181

Query: 183 AAALAAASLVFRRVDRKYSRLLLATAKKVLQFALEHRGSYSDSLSSAVCPFYCSYSGYKD 242
           AAALAAASLVFRRVDRKYS LLLATAKKV QFA+EHRGSYSDSL SAVCPFYCSYSGYKD
Sbjct: 182 AAALAAASLVFRRVDRKYSGLLLATAKKVFQFAVEHRGSYSDSLGSAVCPFYCSYSGYKD 241

Query: 243 ELVWGATWLLRATNDVQYFNLLKSLGGDDVTDIFSWDNKYAGAHVLLSRRALLNNDKNFD 302
           ELVWGATWLLRATNDV+YFNLLKSLGGDDV DIFSWDNKYAGAHVLLSRRALLNNDKNFD
Sbjct: 242 ELVWGATWLLRATNDVRYFNLLKSLGGDDVPDIFSWDNKYAGAHVLLSRRALLNNDKNFD 301

Query: 303 SYKQEAESFMCRILPNSPYSTTQYTQGGLMFKLPESNLQYVTSITFLLSTYSKYMSAAKH 362
           SYKQ+AESFMCRILPNSPYS+TQYTQGGLMFKLP+SNLQYVTSITFLL+TYSKYMSAAKH
Sbjct: 302 SYKQKAESFMCRILPNSPYSSTQYTQGGLMFKLPQSNLQYVTSITFLLTTYSKYMSAAKH 361

Query: 363 TFNCGSLLVTPASLKNLAKKQVDYILGVNPLKMSYMVGFGKNFPRRIHHRGSSLPSKATH 422
           TFNCG +LVTP SLKNLAK+QVDYILGVNPLKMSYMVGFGKNFP+RIHHRGSSLPSKA+H
Sbjct: 362 TFNCGGVLVTPTSLKNLAKQQVDYILGVNPLKMSYMVGFGKNFPKRIHHRGSSLPSKASH 421

Query: 423 PQAIACDGGFQPFFYSYNPNPNILTGAIVGGPNQNDGFPDDRTDYSHSEPATYINAALVG 482
           PQAI CDGGFQPFFYSYNPNPNILTGA+VGGPNQNDGFPDDR+DYSHSEPATYINAALVG
Sbjct: 422 PQAIGCDGGFQPFFYSYNPNPNILTGAVVGGPNQNDGFPDDRSDYSHSEPATYINAALVG 481

Query: 483 PLAFFSGK 491
           PLAFFSGK
Sbjct: 482 PLAFFSGK 488

BLAST of HG10003485 vs. ExPASy TrEMBL
Match: A0A6J1ICX3 (Endoglucanase OS=Cucurbita maxima OX=3661 GN=LOC111471942 PE=3 SV=1)

HSP 1 Score: 916.8 bits (2368), Expect = 3.9e-263
Identity = 442/488 (90.57%), Postives = 460/488 (94.26%), Query Frame = 0

Query: 3   AAITNTSTLFFFFFLFFSCSLVQRARADLNYRDALAKSILFFEGQRSGRLPAGQRITWRS 62
           A  TN ST FFF FL  S       R + NYRDALAKS+LFF+GQRSGR+P GQ+I WRS
Sbjct: 2   ATTTNVSTFFFFLFLLSSSFSFNITRGNPNYRDALAKSLLFFQGQRSGRIPNGQQIAWRS 61

Query: 63  NSGLYDGELAHVDLTGGYYDAGDNVKFNLPMAFTTTMLSWGALEYGARMGTELPNARAAI 122
           NSGLYDGELAHVDLTGGYYDAGDNVKFNLPMAFTTTMLSWGALEYGARMG+ELPN RAAI
Sbjct: 62  NSGLYDGELAHVDLTGGYYDAGDNVKFNLPMAFTTTMLSWGALEYGARMGSELPNTRAAI 121

Query: 123 RWATDYLLKCATATPGKLYVGVGDPHVDHKCWERPEDMDTVRTVYSVSAGNPGSDVAGET 182
           RWATDYLLKCATATPGKLYVGVGDP+VDHKCWERPEDMDTVRTVYSVSA NPGSDVAGET
Sbjct: 122 RWATDYLLKCATATPGKLYVGVGDPNVDHKCWERPEDMDTVRTVYSVSAANPGSDVAGET 181

Query: 183 AAALAAASLVFRRVDRKYSRLLLATAKKVLQFALEHRGSYSDSLSSAVCPFYCSYSGYKD 242
           AAALAAASLVFRRVDRKYS LLLATAKKV QFA+EHRGSYSDSL SAVCPFYCSYSGYKD
Sbjct: 182 AAALAAASLVFRRVDRKYSGLLLATAKKVFQFAVEHRGSYSDSLGSAVCPFYCSYSGYKD 241

Query: 243 ELVWGATWLLRATNDVQYFNLLKSLGGDDVTDIFSWDNKYAGAHVLLSRRALLNNDKNFD 302
           ELVWGATWLLRATNDVQY NLLKSLGGDDV DIFSWDNKYAGAHVLLSRRALLNNDKNFD
Sbjct: 242 ELVWGATWLLRATNDVQYINLLKSLGGDDVPDIFSWDNKYAGAHVLLSRRALLNNDKNFD 301

Query: 303 SYKQEAESFMCRILPNSPYSTTQYTQGGLMFKLPESNLQYVTSITFLLSTYSKYMSAAKH 362
           SYKQ AESFMCRILPNSPYS+TQYTQGGLMFKLP+SNLQYVTSITFLL+TYSKYMSAAKH
Sbjct: 302 SYKQTAESFMCRILPNSPYSSTQYTQGGLMFKLPQSNLQYVTSITFLLTTYSKYMSAAKH 361

Query: 363 TFNCGSLLVTPASLKNLAKKQVDYILGVNPLKMSYMVGFGKNFPRRIHHRGSSLPSKATH 422
           TFNCG LLVTPASLKNLAK+QVDYILGVNPLKMSYMVGFG+NFP+RIHHRGSSLPSKA+H
Sbjct: 362 TFNCGGLLVTPASLKNLAKQQVDYILGVNPLKMSYMVGFGRNFPKRIHHRGSSLPSKASH 421

Query: 423 PQAIACDGGFQPFFYSYNPNPNILTGAIVGGPNQNDGFPDDRTDYSHSEPATYINAALVG 482
           PQAI CDGGFQPFFYS+NPNPNILTGA+VGGPNQNDGFPDDR+DYSHSEPATYINAALVG
Sbjct: 422 PQAIGCDGGFQPFFYSFNPNPNILTGAVVGGPNQNDGFPDDRSDYSHSEPATYINAALVG 481

Query: 483 PLAFFSGK 491
           PLAFFSGK
Sbjct: 482 PLAFFSGK 489

BLAST of HG10003485 vs. ExPASy TrEMBL
Match: A0A6J1CKC4 (Endoglucanase OS=Momordica charantia OX=3673 GN=LOC111012089 PE=3 SV=1)

HSP 1 Score: 893.3 bits (2307), Expect = 4.7e-256
Identity = 431/490 (87.96%), Postives = 459/490 (93.67%), Query Frame = 0

Query: 3   AAITNTSTLFFFF-FLFFSCSLVQRARADLNYRDALAKSILFFEGQRSGRLPAGQRITWR 62
           AA    STLFF F  L F  S +  AR D NYRDALAKSILFF+GQRSGR+PAG +I+WR
Sbjct: 2   AATPKASTLFFLFSLLLFPFSWMDGARGDPNYRDALAKSILFFQGQRSGRIPAGLQISWR 61

Query: 63  SNSGLYDGELAHVDLTGGYYDAGDNVKFNLPMAFTTTMLSWGALEYGARMGTELPNARAA 122
           SNSGLYDGELAHVDLTGGYYDAGDNVKFN PMAFTTTMLSWGALE+GARMGT+L N RAA
Sbjct: 62  SNSGLYDGELAHVDLTGGYYDAGDNVKFNFPMAFTTTMLSWGALEHGARMGTQLANTRAA 121

Query: 123 IRWATDYLLKCATATPGKLYVGVGDPHVDHKCWERPEDMDTVRTVYSVSAGNPGSDVAGE 182
           IRWATDYLLKCATATPGK+YVGVGDP+VDH+CWERPEDMDTVRTVYSVSA NPGSDVAGE
Sbjct: 122 IRWATDYLLKCATATPGKVYVGVGDPNVDHRCWERPEDMDTVRTVYSVSAANPGSDVAGE 181

Query: 183 TAAALAAASLVFRRVDRKYSRLLLATAKKVLQFALEHRGSYSDSLSSAVCPFYCSYSGYK 242
           TAAALAAAS+VFR+VDRKYS LLLATAKKVLQFA++++GSYSDSL SAVCPFYCSYSGYK
Sbjct: 182 TAAALAAASMVFRKVDRKYSNLLLATAKKVLQFAVQYKGSYSDSLGSAVCPFYCSYSGYK 241

Query: 243 DELVWGATWLLRATNDVQYFNLLKSLGGDDVTDIFSWDNKYAGAHVLLSRRALLNNDKNF 302
           DELVWGA WLLRATNDVQYFNLLKSLGGDDVTDIFSWDNKYAGAHVLLSRRALLN DKNF
Sbjct: 242 DELVWGAAWLLRATNDVQYFNLLKSLGGDDVTDIFSWDNKYAGAHVLLSRRALLNKDKNF 301

Query: 303 DSYKQEAESFMCRILPNSPYSTTQYTQGGLMFKLPESNLQYVTSITFLLSTYSKYMSAAK 362
           DSYKQEAE+FMCRILPNSPYS+T YTQGGLMFKLPESNLQYVTSITFLL+TYSKYMSAAK
Sbjct: 302 DSYKQEAEAFMCRILPNSPYSSTHYTQGGLMFKLPESNLQYVTSITFLLATYSKYMSAAK 361

Query: 363 HTFNCGSLLVTPASLKNLAKKQVDYILGVNPLKMSYMVGFGKNFPRRIHHRGSSLPSKAT 422
           H+FNCGSLLVTPASLKNLAKKQVDYILG NPLKMSYMVG+G +FPRRIHHRGSSLPSKA+
Sbjct: 362 HSFNCGSLLVTPASLKNLAKKQVDYILGENPLKMSYMVGYGPHFPRRIHHRGSSLPSKAS 421

Query: 423 HPQAIACDGGFQPFFYSYNPNPNILTGAIVGGPNQNDGFPDDRTDYSHSEPATYINAALV 482
           HPQ I CDGGFQPFFYSYNPNPN+L GA+VGGPNQ+DGF DDR+DYSHSEPATYINAALV
Sbjct: 422 HPQTIGCDGGFQPFFYSYNPNPNLLIGAVVGGPNQSDGFSDDRSDYSHSEPATYINAALV 481

Query: 483 GPLAFFSGKN 492
           GPLAFFSGK+
Sbjct: 482 GPLAFFSGKS 491

BLAST of HG10003485 vs. TAIR 10
Match: AT1G71380.1 (cellulase 3 )

HSP 1 Score: 768.8 bits (1984), Expect = 2.6e-222
Identity = 366/479 (76.41%), Postives = 418/479 (87.27%), Query Frame = 0

Query: 10  TLFFFFFLFFSCSLVQRARADLNYRDALAKSILFFEGQRSGRLPAGQRITWRSNSGLYDG 69
           T  FFF L FS  L+    A+ NY++AL+KS+LFF+GQRSG LP GQ+I+WR++SGL DG
Sbjct: 2   TSLFFFVLLFSSLLISNGDANPNYKEALSKSLLFFQGQRSGPLPRGQQISWRASSGLSDG 61

Query: 70  ELAHVDLTGGYYDAGDNVKFNLPMAFTTTMLSWGALEYGARMGTELPNARAAIRWATDYL 129
             AHVDLTGGYYDAGDNVKFNLPMAFTTTMLSW ALEYG RMG EL NAR  IRWATDYL
Sbjct: 62  SAAHVDLTGGYYDAGDNVKFNLPMAFTTTMLSWSALEYGKRMGPELENARVNIRWATDYL 121

Query: 130 LKCATATPGKLYVGVGDPHVDHKCWERPEDMDTVRTVYSVSAGNPGSDVAGETAAALAAA 189
           LKCA ATPGKLYVGVGDP+VDHKCWERPEDMDT RTVYSVSA NPGSDVA ETAAALAAA
Sbjct: 122 LKCARATPGKLYVGVGDPNVDHKCWERPEDMDTPRTVYSVSASNPGSDVAAETAAALAAA 181

Query: 190 SLVFRRVDRKYSRLLLATAKKVLQFALEHRGSYSDSLSSAVCPFYCSYSGYKDELVWGAT 249
           S+VFR+VD KYSRLLLATAK V+QFA++++G+YSDSLSS+VCPFYCSYSGYKDEL+WGA+
Sbjct: 182 SMVFRKVDSKYSRLLLATAKDVMQFAIQYQGAYSDSLSSSVCPFYCSYSGYKDELMWGAS 241

Query: 250 WLLRATNDVQYFNLLKSLGGDDVTDIFSWDNKYAGAHVLLSRRALLNNDKNFDSYKQEAE 309
           WLLRATN+  Y N +KSLGG D  DIFSWDNKYAGA+VLLSRRALLN D NF+ YKQ AE
Sbjct: 242 WLLRATNNPYYANFIKSLGGGDQPDIFSWDNKYAGAYVLLSRRALLNKDSNFEQYKQAAE 301

Query: 310 SFMCRILPNSPYSTTQYTQGGLMFKLPESNLQYVTSITFLLSTYSKYMSAAKHTFNCGSL 369
           +F+C+ILP+SP S+TQYTQGGLM+KLP+SNLQYVTSITFLL+TY+KYM A KHTFNCGS 
Sbjct: 302 NFICKILPDSPSSSTQYTQGGLMYKLPQSNLQYVTSITFLLTTYAKYMKATKHTFNCGSS 361

Query: 370 LVTPASLKNLAKKQVDYILGVNPLKMSYMVGFGKNFPRRIHHRGSSLPSKATHPQAIACD 429
           ++ P +L +L+K+QVDYILG NP+KMSYMVGF  NFP+RIHHR SSLPS A   Q++ C+
Sbjct: 362 VIVPNALISLSKRQVDYILGDNPIKMSYMVGFSSNFPKRIHHRASSLPSHALRSQSLGCN 421

Query: 430 GGFQPFFYSYNPNPNILTGAIVGGPNQNDGFPDDRTDYSHSEPATYINAALVGPLAFFS 489
           GGFQ  FY+ NPNPNILTGAIVGGPNQNDG+PD R DYSH+EPATYINAA VGPLA+F+
Sbjct: 422 GGFQS-FYTQNPNPNILTGAIVGGPNQNDGYPDQRDDYSHAEPATYINAAFVGPLAYFA 479

BLAST of HG10003485 vs. TAIR 10
Match: AT1G22880.1 (cellulase 5 )

HSP 1 Score: 747.7 bits (1929), Expect = 6.1e-216
Identity = 356/477 (74.63%), Postives = 413/477 (86.58%), Query Frame = 0

Query: 12  FFFFFLFFSCSLVQRARADLNYRDALAKSILFFEGQRSGRLPAGQRITWRSNSGLYDGEL 71
           FFF FL  + SL +   A  NYR+AL+KS+LFF+GQRSGRLP+ Q+++WRS+SGL DG  
Sbjct: 5   FFFVFLLSALSL-ENTYASPNYREALSKSLLFFQGQRSGRLPSDQQLSWRSSSGLSDGSS 64

Query: 72  AHVDLTGGYYDAGDNVKFNLPMAFTTTMLSWGALEYGARMGTELPNARAAIRWATDYLLK 131
           AHVDLTGGYYDAGDNVKFN PMAFTTTMLSW +LEYG +MG EL N+R AIRWATDYLLK
Sbjct: 65  AHVDLTGGYYDAGDNVKFNFPMAFTTTMLSWSSLEYGKKMGPELQNSRVAIRWATDYLLK 124

Query: 132 CATATPGKLYVGVGDPHVDHKCWERPEDMDTVRTVYSVSAGNPGSDVAGETAAALAAASL 191
           CA ATPGKLYVGVGDP+ DHKCWERPEDMDT RTVYSVS  NPGSDVA ETAAALAA+S+
Sbjct: 125 CARATPGKLYVGVGDPNGDHKCWERPEDMDTPRTVYSVSPSNPGSDVAAETAAALAASSM 184

Query: 192 VFRRVDRKYSRLLLATAKKVLQFALEHRGSYSDSLSSAVCPFYCSYSGYKDELVWGATWL 251
           VFR+VD KYSRLLLATAKKV+QFA+++RG+YS+SLSS+VCPFYCSYSGYKDEL+WGA WL
Sbjct: 185 VFRKVDPKYSRLLLATAKKVMQFAIQYRGAYSNSLSSSVCPFYCSYSGYKDELLWGAAWL 244

Query: 252 LRATNDVQYFNLLKSLGGDDVTDIFSWDNKYAGAHVLLSRRALLNNDKNFDSYKQEAESF 311
            RATND  Y N +KSLGG D  DIFSWDNKYAGA+VLLSRRA+LN D NF+ YKQ AE+F
Sbjct: 245 HRATNDPYYTNFIKSLGGGDQPDIFSWDNKYAGAYVLLSRRAVLNKDNNFELYKQAAENF 304

Query: 312 MCRILPNSPYSTTQYTQGGLMFKLPESNLQYVTSITFLLSTYSKYMSAAKHTFNCGSLLV 371
           MC+ILPNSP S+T+YT+GGLM+KLP+SNLQYVTSITFLL+TY+KYM + K TFNCG+ L+
Sbjct: 305 MCKILPNSPSSSTKYTKGGLMYKLPQSNLQYVTSITFLLTTYAKYMKSTKQTFNCGNSLI 364

Query: 372 TPASLKNLAKKQVDYILGVNPLKMSYMVGFGKNFPRRIHHRGSSLPSKATHPQAIACDGG 431
            P +L NL+K+QVDY+LGVNP+KMSYMVGF  NFP+RIHHRGSSLPS+A    ++ C+GG
Sbjct: 365 VPNALINLSKRQVDYVLGVNPMKMSYMVGFSSNFPKRIHHRGSSLPSRAVRSNSLGCNGG 424

Query: 432 FQPFFYSYNPNPNILTGAIVGGPNQNDGFPDDRTDYSHSEPATYINAALVGPLAFFS 489
           FQ  F + NPNPNILTGAIVGGPNQND +PD R DY+ SEPATYINAA VGPLA+F+
Sbjct: 425 FQS-FRTQNPNPNILTGAIVGGPNQNDEYPDQRDDYTRSEPATYINAAFVGPLAYFA 479

BLAST of HG10003485 vs. TAIR 10
Match: AT1G22880.2 (cellulase 5 )

HSP 1 Score: 638.6 bits (1646), Expect = 4.0e-183
Identity = 301/396 (76.01%), Postives = 347/396 (87.63%), Query Frame = 0

Query: 93  MAFTTTMLSWGALEYGARMGTELPNARAAIRWATDYLLKCATATPGKLYVGVGDPHVDHK 152
           MAFTTTMLSW +LEYG +MG EL N+R AIRWATDYLLKCA ATPGKLYVGVGDP+ DHK
Sbjct: 1   MAFTTTMLSWSSLEYGKKMGPELQNSRVAIRWATDYLLKCARATPGKLYVGVGDPNGDHK 60

Query: 153 CWERPEDMDTVRTVYSVSAGNPGSDVAGETAAALAAASLVFRRVDRKYSRLLLATAKKVL 212
           CWERPEDMDT RTVYSVS  NPGSDVA ETAAALAA+S+VFR+VD KYSRLLLATAKKV+
Sbjct: 61  CWERPEDMDTPRTVYSVSPSNPGSDVAAETAAALAASSMVFRKVDPKYSRLLLATAKKVM 120

Query: 213 QFALEHRGSYSDSLSSAVCPFYCSYSGYKDELVWGATWLLRATNDVQYFNLLKSLGGDDV 272
           QFA+++RG+YS+SLSS+VCPFYCSYSGYKDEL+WGA WL RATND  Y N +KSLGG D 
Sbjct: 121 QFAIQYRGAYSNSLSSSVCPFYCSYSGYKDELLWGAAWLHRATNDPYYTNFIKSLGGGDQ 180

Query: 273 TDIFSWDNKYAGAHVLLSRRALLNNDKNFDSYKQEAESFMCRILPNSPYSTTQYTQGGLM 332
            DIFSWDNKYAGA+VLLSRRA+LN D NF+ YKQ AE+FMC+ILPNSP S+T+YT+GGLM
Sbjct: 181 PDIFSWDNKYAGAYVLLSRRAVLNKDNNFELYKQAAENFMCKILPNSPSSSTKYTKGGLM 240

Query: 333 FKLPESNLQYVTSITFLLSTYSKYMSAAKHTFNCGSLLVTPASLKNLAKKQVDYILGVNP 392
           +KLP+SNLQYVTSITFLL+TY+KYM + K TFNCG+ L+ P +L NL+K+QVDY+LGVNP
Sbjct: 241 YKLPQSNLQYVTSITFLLTTYAKYMKSTKQTFNCGNSLIVPNALINLSKRQVDYVLGVNP 300

Query: 393 LKMSYMVGFGKNFPRRIHHRGSSLPSKATHPQAIACDGGFQPFFYSYNPNPNILTGAIVG 452
           +KMSYMVGF  NFP+RIHHRGSSLPS+A    ++ C+GGFQ  F + NPNPNILTGAIVG
Sbjct: 301 MKMSYMVGFSSNFPKRIHHRGSSLPSRAVRSNSLGCNGGFQS-FRTQNPNPNILTGAIVG 360

Query: 453 GPNQNDGFPDDRTDYSHSEPATYINAALVGPLAFFS 489
           GPNQND +PD R DY+ SEPATYINAA VGPLA+F+
Sbjct: 361 GPNQNDEYPDQRDDYTRSEPATYINAAFVGPLAYFA 395

BLAST of HG10003485 vs. TAIR 10
Match: AT1G02800.1 (cellulase 2 )

HSP 1 Score: 599.7 bits (1545), Expect = 2.0e-171
Identity = 288/461 (62.47%), Postives = 356/461 (77.22%), Query Frame = 0

Query: 32  NYRDALAKSILFFEGQRSGRLPAGQRITWRSNSGLYDGELAHVDLTGGYYDAGDNVKFNL 91
           NY+DAL+KSILFFEGQRSG+LP  QR+TWRSNSGL DG   +VDL GGYYDAGDN+KF  
Sbjct: 43  NYKDALSKSILFFEGQRSGKLPPNQRMTWRSNSGLSDGSALNVDLVGGYYDAGDNMKFGF 102

Query: 92  PMAFTTTMLSWGALEYGARMGTELPNARAAIRWATDYLLKCATATPGKLYVGVGDPHVDH 151
           PMAFTTTMLSW  +E+G  M +ELPNA+ AIRWATD+LLK AT+ P  +YV VGDP++DH
Sbjct: 103 PMAFTTTMLSWSLIEFGGLMKSELPNAKDAIRWATDFLLK-ATSHPDTIYVQVGDPNMDH 162

Query: 152 KCWERPEDMDTVRTVYSVSAGNPGSDVAGETAAALAAASLVFRRVDRKYSRLLLATAKKV 211
            CWERPEDMDT R+V+ V   NPGSD+AGE AAALAAAS+VFR+ D  YS  LL  A  V
Sbjct: 163 ACWERPEDMDTPRSVFKVDKNNPGSDIAGEIAAALAAASIVFRKCDPSYSNHLLQRAITV 222

Query: 212 LQFALEHRGSYSDSLSSAVCPFYCSYSGYKDELVWGATWLLRATNDVQYFNLLKS----L 271
             FA ++RG YS  L+  VCPFYCSYSGY+DEL+WGA WL +ATN+  Y N +K+    L
Sbjct: 223 FTFADKYRGPYSAGLAPEVCPFYCSYSGYQDELLWGAAWLQKATNNPTYLNYIKANGQIL 282

Query: 272 GGDDVTDIFSWDNKYAGAHVLLSRRALLNNDKNFDSYKQEAESFMCRILPNSPYSTTQYT 331
           G D+  ++FSWDNK+ GA +LLS+  L+   K+ + YK+ A+SF+C +LP +  S++QYT
Sbjct: 283 GADEFDNMFSWDNKHVGARILLSKEFLIQKVKSLEEYKEHADSFICSVLPGA--SSSQYT 342

Query: 332 QGGLMFKLPESNLQYVTSITFLLSTYSKYMSAAKHTFNCGSLLVTPASLKNLAKKQVDYI 391
            GGL+FK+ ESN+QYVTS +FLL TY+KY+++A+    CG  +VTPA L+++AKKQVDY+
Sbjct: 343 PGGLLFKMGESNMQYVTSTSFLLLTYAKYLTSARTVAYCGGSVVTPARLRSIAKKQVDYL 402

Query: 392 LGVNPLKMSYMVGFGKNFPRRIHHRGSSLPSKATHPQAIACDGGFQPFFYSYNPNPNILT 451
           LG NPLKMSYMVG+G  +PRRIHHRGSSLPS A HP  I C  GF   F S +PNPN L 
Sbjct: 403 LGGNPLKMSYMVGYGLKYPRRIHHRGSSLPSVAVHPTRIQCHDGFS-LFTSQSPNPNDLV 462

Query: 452 GAIVGGPNQNDGFPDDRTDYSHSEPATYINAALVGPLAFFS 489
           GA+VGGP+QND FPD+R+DY  SEPATYINA LVG LA+ +
Sbjct: 463 GAVVGGPDQNDQFPDERSDYGRSEPATYINAPLVGALAYLA 499

BLAST of HG10003485 vs. TAIR 10
Match: AT4G02290.1 (glycosyl hydrolase 9B13 )

HSP 1 Score: 586.6 bits (1511), Expect = 1.8e-167
Identity = 286/500 (57.20%), Postives = 364/500 (72.80%), Query Frame = 0

Query: 4   AITNTSTLFFFFFL-----------FFSCSLVQRARADLNYRDALAKSILFFEGQRSGRL 63
           A+  T  L FFFFL            F+    +   A  NY+DAL KSILFFEGQRSG+L
Sbjct: 13  ALRVTIFLSFFFFLCNGFSYPTTSSLFNTHHHRHHLAKHNYKDALTKSILFFEGQRSGKL 72

Query: 64  PAGQRITWRSNSGLYDGELAHVDLTGGYYDAGDNVKFNLPMAFTTTMLSWGALEYGARMG 123
           P+ QR++WR +SGL DG   HVDL GGYYDAGDN+KF  PMAFTTTMLSW  +E+G  M 
Sbjct: 73  PSNQRMSWRRDSGLSDGSALHVDLVGGYYDAGDNIKFGFPMAFTTTMLSWSVIEFGGLMK 132

Query: 124 TELPNARAAIRWATDYLLKCATATPGKLYVGVGDPHVDHKCWERPEDMDTVRTVYSVSAG 183
           +EL NA+ AIRWATDYLLK AT+ P  +YV VGD + DH CWERPEDMDTVR+V+ V   
Sbjct: 133 SELQNAKIAIRWATDYLLK-ATSQPDTIYVQVGDANKDHSCWERPEDMDTVRSVFKVDKN 192

Query: 184 NPGSDVAGETAAALAAASLVFRRVDRKYSRLLLATAKKVLQFALEHRGSYSDSLSSAVCP 243
            PGSDVA ETAAALAAA++VFR+ D  YS++LL  A  V  FA ++RG+YS  L   VCP
Sbjct: 193 IPGSDVAAETAAALAAAAIVFRKSDPSYSKVLLKRAISVFAFADKYRGTYSAGLKPDVCP 252

Query: 244 FYCSYSGYKDELVWGATWLLRATNDVQYFNLLK----SLGGDDVTDIFSWDNKYAGAHVL 303
           FYCSYSGY+DEL+WGA WL +AT +++Y N +K     LG  +  + F WDNK+AGA +L
Sbjct: 253 FYCSYSGYQDELLWGAAWLQKATKNIKYLNYIKINGQILGAAEYDNTFGWDNKHAGARIL 312

Query: 304 LSRRALLNNDKNFDSYKQEAESFMCRILPNSPYSTTQYTQGGLMFKLPESNLQYVTSITF 363
           L++  L+ N K    YK  A++F+C ++P +P+S+TQYT GGL+FK+ ++N+QYVTS +F
Sbjct: 313 LTKAFLVQNVKTLHEYKGHADNFICSVIPGAPFSSTQYTPGGLLFKMADANMQYVTSTSF 372

Query: 364 LLSTYSKYMSAAKHTFNCGSLLVTPASLKNLAKKQVDYILGVNPLKMSYMVGFGKNFPRR 423
           LL TY+KY+++AK   +CG  + TP  L+++AK+QVDY+LG NPL+MSYMVG+G  FPRR
Sbjct: 373 LLLTYAKYLTSAKTVVHCGGSVYTPGRLRSIAKRQVDYLLGDNPLRMSYMVGYGPKFPRR 432

Query: 424 IHHRGSSLPSKATHPQAIACDGGFQPFFYSYNPNPNILTGAIVGGPNQNDGFPDDRTDYS 483
           IHHRGSSLP  A+HP  I C  GF     S +PNPN L GA+VGGP+Q+D FPD+R+DY 
Sbjct: 433 IHHRGSSLPCVASHPAKIQCHQGF-AIMNSQSPNPNFLVGAVVGGPDQHDRFPDERSDYE 492

Query: 484 HSEPATYINAALVGPLAFFS 489
            SEPATYIN+ LVG LA+F+
Sbjct: 493 QSEPATYINSPLVGALAYFA 510

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038890642.11.5e-27795.51endoglucanase 9-like [Benincasa hispida][more]
XP_004141534.12.1e-26691.24endoglucanase 9 [Cucumis sativus] >KGN52647.1 hypothetical protein Csa_008478 [C... [more]
KAG6607530.11.6e-26390.78Endoglucanase 9, partial [Cucurbita argyrosperma subsp. sororia][more]
KAG7037172.12.8e-26390.80Endoglucanase 9, partial [Cucurbita argyrosperma subsp. argyrosperma][more]
XP_023523831.12.8e-26390.78endoglucanase 9-like [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
Q9C9H53.6e-22176.41Endoglucanase 9 OS=Arabidopsis thaliana OX=3702 GN=CEL3 PE=1 SV=1[more]
Q2V4L88.6e-21574.63Endoglucanase 3 OS=Arabidopsis thaliana OX=3702 GN=CEL5 PE=2 SV=2[more]
Q7XTH41.6e-17664.92Endoglucanase 11 OS=Oryza sativa subsp. japonica OX=39947 GN=GLU4 PE=2 SV=3[more]
Q9SRX32.9e-17062.47Endoglucanase 1 OS=Arabidopsis thaliana OX=3702 GN=CEL2 PE=2 SV=1[more]
O814162.5e-16657.20Endoglucanase 17 OS=Arabidopsis thaliana OX=3702 GN=At4g02290 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KW331.0e-26691.24Endoglucanase OS=Cucumis sativus OX=3659 GN=Csa_5G648680 PE=3 SV=1[more]
A0A5A7TDF31.8e-26390.69Endoglucanase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold169G00910 ... [more]
A0A6J1F1W43.0e-26390.98Endoglucanase OS=Cucurbita moschata OX=3662 GN=LOC111438904 PE=3 SV=1[more]
A0A6J1ICX33.9e-26390.57Endoglucanase OS=Cucurbita maxima OX=3661 GN=LOC111471942 PE=3 SV=1[more]
A0A6J1CKC44.7e-25687.96Endoglucanase OS=Momordica charantia OX=3673 GN=LOC111012089 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT1G71380.12.6e-22276.41cellulase 3 [more]
AT1G22880.16.1e-21674.63cellulase 5 [more]
AT1G22880.24.0e-18376.01cellulase 5 [more]
AT1G02800.12.0e-17162.47cellulase 2 [more]
AT4G02290.11.8e-16757.20glycosyl hydrolase 9B13 [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR012341Six-hairpin glycosidase-like superfamilyGENE3D1.50.10.10coord: 30..489
e-value: 9.2E-161
score: 537.6
IPR001701Glycoside hydrolase family 9PFAMPF00759Glyco_hydro_9coord: 33..482
e-value: 2.1E-139
score: 465.8
NoneNo IPR availablePANTHERPTHR22298:SF54ENDOGLUCANASE 9coord: 16..489
NoneNo IPR availablePANTHERPTHR22298ENDO-1,4-BETA-GLUCANASEcoord: 16..489
IPR033126Glycosyl hydrolases family 9, Asp/Glu active sitesPROSITEPS00698GH9_3coord: 460..478
IPR018221Glycoside hydrolase family 9, His active sitePROSITEPS00592GH9_2coord: 386..412
IPR008928Six-hairpin glycosidase superfamilySUPERFAMILY48208Six-hairpin glycosidasescoord: 15..488

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10003485.1HG10003485.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0030245 cellulose catabolic process
biological_process GO:0005975 carbohydrate metabolic process
molecular_function GO:0008810 cellulase activity
molecular_function GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compounds