HG10023079 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10023079
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionEndoglucanase
LocationChr05: 31020799 .. 31025590 (-)
RNA-Seq ExpressionHG10023079
SyntenyHG10023079
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGCGTGGTTTCTTATTGGCTTTGGTCGCCGCCGTGCTCTATTTTGAAGCCGCCGCTGCCGGTGACTTCAACTATGGCGATGCTCTGGATTTGAGCTTCTTGTATCTGGAGGCGCAAAGGTCCGGTAAACTTCCGGCGGACCGCCGTGTTAAGTGGCGGGGGGATTCTGGCCTTAAGGATGGTCTTGCCCAAGGAGTAAGTTTCCCGGCCTTTTAAAAAAAAATCGATTTTTTTTAACTCTCTATGGCGCAAATTTGATTAATTATATTCTATAAATATGGAAATTAATTAAGTATAAAAAAAAGTTAAATTAAACTGTCCCTCATCTCATAAAATTGTTAGTCATTACATATTGTCAGATTGAATTACATTCCATAAACAAAACACATATTTAATACCTTAGGTAGATATTGTTTTTGTATTTTGTTGATACAAAACAAAAACTTTCAAGGATAAATTATAATAATTTTAAACTTGAGAGATGTTTTAAACACAAGCTTACAAGTTTAATAGACCTTTTCATAAAAAAAATCAAACGATTCGAGTTAACAGAAGGCATTTTGATCATAAATGGCAGGTGAATTTAGTGGGAGGGTACTATGATGCGGGAGACAATGTGAAATTCGGGCTACCAATGTCGTTCGTGACAACAATTCTTTCATGGGGAGCCATTGATTTCAAGAAGGAAATTACAGACATCAACCAAATGGACAACGCTTTGAAGGCCATTAAATGGGCCACTGATTACTTGATCAAGGCACACACTCAACCCAACGTTCTTTGGGCCCAGGTTGGCGATGGCGCCTCCGATCACTTCTGTTGGGAGCGCCCCGAGGATATGACTACTCCTCGGACTGCTTTCAAGATCGATGAGTCCCATCCTGGCTCTGACCTTGCCGGAGAAACGGCCGCTGCTTTGGCTGCAGCTTCCATTACTTTCAAGTCTTACAACTCTTCTTATTCTGATCTCTTGTTGGTTCATGCTAAAGAGGTTAGATAATTTCCCTCAACAAAATAATAATAATAATAATAGTAACTAACATACAATTTTGTCAAAAGGAAAAACTTAGGTGCAATCTCAATTAACATCTACTAAGTGGTAATTTTTATTTTTAAAAAAAAAAAAATTGATTTTTTTTCCTATTATTATATATGAAAAAGGGCGGTTATTGTAGCGGCCCAAGCTACACCTAATCATTGCCATTTTTAATTTTATTTTTTTGTCTAGTTTCAATTTGATTTCTATTTTTAAAATATTACACTTATATTCTTCAACTTTGTTTGATATTTTTTCTATTTACGGTAATTTTCCTATAAAAATTCATGTATGTTACAATGCCATAACTGTATTTTCTATATATACTTATCTCCTCAATCTAATGACTCTTATTTCTCACCATGTTTTTTTTTACTATATTGTCTTCAGCTGTTCACATTTGCAGACACATTTAGAGGTCTATACGACGATTCTATTCCATGTGCTTCAGAGTTCTACACTTCAACAGGATATTGGGTAAGAATTTTGAATCCCCCCCTTTCGTCTAATTGCTCAAATTCGATTTAATTAATTAATTAATAATTTTTATTTTTTAAATTGAAGGATGAACTGCTGTGGGCAGCGGCGTGGTTATTCAGAGCCACCGGAGACGAATCTTACTTGAAATACACGGTAGATAAAGCGGTGTCATTTGGTGGAACTGGATGGGCAATGAAAGAGTTCTCTTGGGACAACAAATATGCCGGTGTTCAAATTCTTTTAACTAATGTAAAGCTTAATTGATTAGATTATTTTAGATTGATTTGAGTTCGATTTTTCCTTCAAATATAAATTAAATTAAATGCCTAACACGTTTTTCTCGAATGTTTTTTAATGGAAATTATTTGATTTTGTAGGTGCTGTTGGAAGGACGTGGCGGTGGTTACGAGTCGACGTTGAAGCAATATCAAGCGAAAGCTAACTATTTTGCTTGCGCCTGTCTTCAGAAGAACGATGGATTCAACATTAACAAAACTCCCGGTAAATACAAATATTGCCAGATTGCCATATATATTATTATTATTATGGAAAAATAGTAAAAATCACACTTGTAATAGGGTATCAAACTTTCCATAGTAAAAATTTAGTCCTCAAACTAATACCAATGTTAAAATTAGATTCTCAAACTTATATAATTCTAGAAATTATAATGGTGCAAGTTTGAGAATTCAATTTTGATCATTTAGATATTCAATTTATACAATTATTGTAAGTTTGAGAGTTAATTTTAAATAAAAAGCTCAATTTTAAATTTTTAAACCCAAATATCCCCACATAAATTAATTTTTCCAATAAATGAGAATTGAAGTACAACTACTATTAAACTAGAATCATTATTTCTATGATTTATTTGTTAAAATATAAAGCATTAATGGGACTTATTTTTAACTTTCTAAGAAAAAGTTTTATTTTGATATTTAAACATATTTTGGTGTATGATATTTCAAAATATTTATTTACTGCTTTAAATTTTTTAATTTATTTTATTTTGATAATAAAATTTTGAAATGAATATTTTAATCTAAACTTTCCGAGAAAAGAAAATAAAGAAACAGTCTTTACTTTAAATTAAGTAATGATGCAACTTTCCTAAATCCTAGATGATCATATTTATAAAGAAATGCATTGAGAAGAAGTAGAAAGATTTATTTGTAATTTTAGTCTTCAACATTGAGCCTTGGGATTTTTTTTTTCTTTTTCCATTTGATTAATAGATTTAAAGTGTTACACATTTTAGTCTCTATATTTATAATTATTTGGTTTCAGTTTGTCTCTAAATTTTAAATTTTATATAATTAACCTTGTTTTTTTTATTAAAAGCTCAGCTAATATTCGCTGTTAATACGTATGAATTAATTTAAGACAATTATGAGTGAAAATTTAAAATTAATTTTAAAAGTGAAAATTAGTGACGGAAAACAAGGCTAATTTTTCGATTTGATTTTCGTGGGGGAAAAAAAACACAGGTGGATTGATGTACGCCCATGAATGGAACAATATGCAATACGCCTCCGCCGCCGCATTCCTCATGGCCGTCTACTCCGACTACCTCTCCAACGCCAACGCAAAGCTCACTTGTCCCGACGGCGTCTTCGAGCCTAAAGATCTCCTCAACTTCGCCCAATCTCAAGCCGATTACATCCTCGGCAAAAACCCTAACTCCCTCAGCTATTTAATCGGCTACGGTTCCAAATTCCCCCAAAAACTCCACCACAGAGGCTCTTCAATCGCTTCAATTTTCACCGATCCCTCTCCAATCGGCTGCGTTCAAGGCTTCGATTTCTGGTACCATCGCCCTCAAGGAAACCCTAACGTTCTTCACGGCGCTCTCGTCGGCGGTCCGGACAAGGACGACAGATTCGATGACGATCGATCTGATTATGAACAATCTGAGCCCACTCTCACCGCTTGTGCTCCTCTGGTTGGCCTCTTCTCCAAGTTGCAGAGCTCCGTTAATGGCTATTCTATTCCTGGTAATTCAGAGGCGGATTGGATTTTTAGGGTTCGAGATTATGTGAACTTCTTGCTGATTTTATGCTTGTCCGTTGAAAACAGGATCGCTGGGAAATAAGCCGCCGGTGAAACCTGAAAGGGAATCCCCTGACGCGAGCGCCCCTGTACCTGCAGGTGAGGGAAAAAAACAGGGTATATATATATATATATCATTATTACTATTTATAACGGTTTTATTTGCTTAATTTATCCATTTTAATTATAATCTTATTTTAATTAAATTAAAATTAACATTTGTGTTTATTTGATTTATGAAATTCACAAATTTATATTTAATAAATAATTTAAAATTAATTTTTTTTAACTTTTTTGAGATATATTAAATATAAAATTGAAAGTTCACAAATATGTATTTTTTTTAAAGTTTTACGTGCTAAATAAACACAAACTCTAAATTAAAATAAAATAAATAAAAAAAAAAAACTATGCAATTTAACATTTTATTTTCAATCGTTCCTTCATGCTATTGTGTTTTGTGAAAACCATTTCTTAGCTTTTTCTTGCTAGATACACAATTATTTAGTTGTGTGATTTTATTTACTTAGGTATCTTTAGTATAAGTTAAGACTTCGAAGTTGGAGATTTGATTATCTCAACCCATAAATTGTTTGAAAAAAATTGTAAATTCCTTTTTTGACACAAGACATTTTATAGAAAAATTATGTTGAAAATCATATGTTCGATACCCCTACCCACTTATTGTCAACATTTATTACTCTATTTGTTAAATACCATGTTAAAGTACAAACAAATATAAAAATAAAACTTAAGATAATATTCTTTTTTCTTCCTACAATTATGGGTTGAGTAACCAAACCATTAACCTCTAAAATGATAATTAGTGTCTTATCCACCAAGCTATAATCGGATTGACAATTAGAGATAGATAGTGAGAACTCCAATTTTTTTTCTTCACGTTCTTTAACAAGTCCTTGTGAACATTTTGCAGGGTCTCCAATTGAGTTTATCCACACAATAACAAGCACATGGACGACAAACAAGGAGAGTTATTACAGGCACCAAGTGAAGATAAAAAACACATCAGGAAAGCCGATCAACAATCTGAAACTCCAACTCGAAAACCTCTCAGGCCCTATTTGGGGACTTTCTCCAACAAAGCAGAAAGGGATTTATGAGCTTCCGGCGTGGCTAACAGTACTTCAGCCTGGCTCTGAATGCATTTTTATCTACATTCAGGAAGGTCCTCAAGCTAGAGTCACTGTTTCTAGCTACCATTAA

mRNA sequence

ATGGCGCGTGGTTTCTTATTGGCTTTGGTCGCCGCCGTGCTCTATTTTGAAGCCGCCGCTGCCGGTGACTTCAACTATGGCGATGCTCTGGATTTGAGCTTCTTGTATCTGGAGGCGCAAAGGTCCGGTAAACTTCCGGCGGACCGCCGTGTTAAGTGGCGGGGGGATTCTGGCCTTAAGGATGGTCTTGCCCAAGGAGTGAATTTAGTGGGAGGGTACTATGATGCGGGAGACAATGTGAAATTCGGGCTACCAATGTCGTTCGTGACAACAATTCTTTCATGGGGAGCCATTGATTTCAAGAAGGAAATTACAGACATCAACCAAATGGACAACGCTTTGAAGGCCATTAAATGGGCCACTGATTACTTGATCAAGGCACACACTCAACCCAACGTTCTTTGGGCCCAGGTTGGCGATGGCGCCTCCGATCACTTCTGTTGGGAGCGCCCCGAGGATATGACTACTCCTCGGACTGCTTTCAAGATCGATGAGTCCCATCCTGGCTCTGACCTTGCCGGAGAAACGGCCGCTGCTTTGGCTGCAGCTTCCATTACTTTCAAGTCTTACAACTCTTCTTATTCTGATCTCTTGTTGGTTCATGCTAAAGAGCTGTTCACATTTGCAGACACATTTAGAGGTCTATACGACGATTCTATTCCATGTGCTTCAGAGTTCTACACTTCAACAGGATATTGGGATGAACTGCTGTGGGCAGCGGCGTGGTTATTCAGAGCCACCGGAGACGAATCTTACTTGAAATACACGGTAGATAAAGCGGTGTCATTTGGTGGAACTGGATGGGCAATGAAAGAGTTCTCTTGGGACAACAAATATGCCGGTGTTCAAATTCTTTTAACTAATGTGCTGTTGGAAGGACGTGGCGGTGGTTACGAGTCGACGTTGAAGCAATATCAAGCGAAAGCTAACTATTTTGCTTGCGCCTGTCTTCAGAAGAACGATGGATTCAACATTAACAAAACTCCCGGTGGATTGATGTACGCCCATGAATGGAACAATATGCAATACGCCTCCGCCGCCGCATTCCTCATGGCCGTCTACTCCGACTACCTCTCCAACGCCAACGCAAAGCTCACTTGTCCCGACGGCGTCTTCGAGCCTAAAGATCTCCTCAACTTCGCCCAATCTCAAGCCGATTACATCCTCGGCAAAAACCCTAACTCCCTCAGCTATTTAATCGGCTACGGTTCCAAATTCCCCCAAAAACTCCACCACAGAGGCTCTTCAATCGCTTCAATTTTCACCGATCCCTCTCCAATCGGCTGCGTTCAAGGCTTCGATTTCTGGTACCATCGCCCTCAAGGAAACCCTAACGTTCTTCACGGCGCTCTCGTCGGCGGTCCGGACAAGGACGACAGATTCGATGACGATCGATCTGATTATGAACAATCTGAGCCCACTCTCACCGCTTGTGCTCCTCTGGTTGGCCTCTTCTCCAAGTTGCAGAGCTCCGTTAATGGCTATTCTATTCCTGGATCGCTGGGAAATAAGCCGCCGGTGAAACCTGAAAGGGAATCCCCTGACGCGAGCGCCCCTGTACCTGCAGGGTCTCCAATTGAGTTTATCCACACAATAACAAGCACATGGACGACAAACAAGGAGAGTTATTACAGGCACCAAGTGAAGATAAAAAACACATCAGGAAAGCCGATCAACAATCTGAAACTCCAACTCGAAAACCTCTCAGGCCCTATTTGGGGACTTTCTCCAACAAAGCAGAAAGGGATTTATGAGCTTCCGGCGTGGCTAACAGTACTTCAGCCTGGCTCTGAATGCATTTTTATCTACATTCAGGAAGGTCCTCAAGCTAGAGTCACTGTTTCTAGCTACCATTAA

Coding sequence (CDS)

ATGGCGCGTGGTTTCTTATTGGCTTTGGTCGCCGCCGTGCTCTATTTTGAAGCCGCCGCTGCCGGTGACTTCAACTATGGCGATGCTCTGGATTTGAGCTTCTTGTATCTGGAGGCGCAAAGGTCCGGTAAACTTCCGGCGGACCGCCGTGTTAAGTGGCGGGGGGATTCTGGCCTTAAGGATGGTCTTGCCCAAGGAGTGAATTTAGTGGGAGGGTACTATGATGCGGGAGACAATGTGAAATTCGGGCTACCAATGTCGTTCGTGACAACAATTCTTTCATGGGGAGCCATTGATTTCAAGAAGGAAATTACAGACATCAACCAAATGGACAACGCTTTGAAGGCCATTAAATGGGCCACTGATTACTTGATCAAGGCACACACTCAACCCAACGTTCTTTGGGCCCAGGTTGGCGATGGCGCCTCCGATCACTTCTGTTGGGAGCGCCCCGAGGATATGACTACTCCTCGGACTGCTTTCAAGATCGATGAGTCCCATCCTGGCTCTGACCTTGCCGGAGAAACGGCCGCTGCTTTGGCTGCAGCTTCCATTACTTTCAAGTCTTACAACTCTTCTTATTCTGATCTCTTGTTGGTTCATGCTAAAGAGCTGTTCACATTTGCAGACACATTTAGAGGTCTATACGACGATTCTATTCCATGTGCTTCAGAGTTCTACACTTCAACAGGATATTGGGATGAACTGCTGTGGGCAGCGGCGTGGTTATTCAGAGCCACCGGAGACGAATCTTACTTGAAATACACGGTAGATAAAGCGGTGTCATTTGGTGGAACTGGATGGGCAATGAAAGAGTTCTCTTGGGACAACAAATATGCCGGTGTTCAAATTCTTTTAACTAATGTGCTGTTGGAAGGACGTGGCGGTGGTTACGAGTCGACGTTGAAGCAATATCAAGCGAAAGCTAACTATTTTGCTTGCGCCTGTCTTCAGAAGAACGATGGATTCAACATTAACAAAACTCCCGGTGGATTGATGTACGCCCATGAATGGAACAATATGCAATACGCCTCCGCCGCCGCATTCCTCATGGCCGTCTACTCCGACTACCTCTCCAACGCCAACGCAAAGCTCACTTGTCCCGACGGCGTCTTCGAGCCTAAAGATCTCCTCAACTTCGCCCAATCTCAAGCCGATTACATCCTCGGCAAAAACCCTAACTCCCTCAGCTATTTAATCGGCTACGGTTCCAAATTCCCCCAAAAACTCCACCACAGAGGCTCTTCAATCGCTTCAATTTTCACCGATCCCTCTCCAATCGGCTGCGTTCAAGGCTTCGATTTCTGGTACCATCGCCCTCAAGGAAACCCTAACGTTCTTCACGGCGCTCTCGTCGGCGGTCCGGACAAGGACGACAGATTCGATGACGATCGATCTGATTATGAACAATCTGAGCCCACTCTCACCGCTTGTGCTCCTCTGGTTGGCCTCTTCTCCAAGTTGCAGAGCTCCGTTAATGGCTATTCTATTCCTGGATCGCTGGGAAATAAGCCGCCGGTGAAACCTGAAAGGGAATCCCCTGACGCGAGCGCCCCTGTACCTGCAGGGTCTCCAATTGAGTTTATCCACACAATAACAAGCACATGGACGACAAACAAGGAGAGTTATTACAGGCACCAAGTGAAGATAAAAAACACATCAGGAAAGCCGATCAACAATCTGAAACTCCAACTCGAAAACCTCTCAGGCCCTATTTGGGGACTTTCTCCAACAAAGCAGAAAGGGATTTATGAGCTTCCGGCGTGGCTAACAGTACTTCAGCCTGGCTCTGAATGCATTTTTATCTACATTCAGGAAGGTCCTCAAGCTAGAGTCACTGTTTCTAGCTACCATTAA

Protein sequence

MARGFLLALVAAVLYFEAAAAGDFNYGDALDLSFLYLEAQRSGKLPADRRVKWRGDSGLKDGLAQGVNLVGGYYDAGDNVKFGLPMSFVTTILSWGAIDFKKEITDINQMDNALKAIKWATDYLIKAHTQPNVLWAQVGDGASDHFCWERPEDMTTPRTAFKIDESHPGSDLAGETAAALAAASITFKSYNSSYSDLLLVHAKELFTFADTFRGLYDDSIPCASEFYTSTGYWDELLWAAAWLFRATGDESYLKYTVDKAVSFGGTGWAMKEFSWDNKYAGVQILLTNVLLEGRGGGYESTLKQYQAKANYFACACLQKNDGFNINKTPGGLMYAHEWNNMQYASAAAFLMAVYSDYLSNANAKLTCPDGVFEPKDLLNFAQSQADYILGKNPNSLSYLIGYGSKFPQKLHHRGSSIASIFTDPSPIGCVQGFDFWYHRPQGNPNVLHGALVGGPDKDDRFDDDRSDYEQSEPTLTACAPLVGLFSKLQSSVNGYSIPGSLGNKPPVKPERESPDASAPVPAGSPIEFIHTITSTWTTNKESYYRHQVKIKNTSGKPINNLKLQLENLSGPIWGLSPTKQKGIYELPAWLTVLQPGSECIFIYIQEGPQARVTVSSYH
Homology
BLAST of HG10023079 vs. NCBI nr
Match: XP_038899109.1 (endoglucanase 5 [Benincasa hispida])

HSP 1 Score: 1183.7 bits (3061), Expect = 0.0e+00
Identity = 562/618 (90.94%), Postives = 590/618 (95.47%), Query Frame = 0

Query: 1   MARGFLLALVAAVLYFEAAAAGDFNYGDALDLSFLYLEAQRSGKLPADRRVKWRGDSGLK 60
           MARGFLLAL+AAVL FEAAAAG+FNYGDALDLSFLY+EAQRSGKLPADRRVKWRGDSGLK
Sbjct: 1   MARGFLLALIAAVLCFEAAAAGEFNYGDALDLSFLYMEAQRSGKLPADRRVKWRGDSGLK 60

Query: 61  DGLAQGVNLVGGYYDAGDNVKFGLPMSFVTTILSWGAIDFKKEITDINQMDNALKAIKWA 120
           DG AQGVNLVGGYYDAGDNVKFGLPM+FVTTILSWG IDF KEIT +N +DNALKAIKWA
Sbjct: 61  DGFAQGVNLVGGYYDAGDNVKFGLPMAFVTTILSWGVIDFNKEITHLNHIDNALKAIKWA 120

Query: 121 TDYLIKAHTQPNVLWAQVGDGASDHFCWERPEDMTTPRTAFKIDESHPGSDLAGETAAAL 180
           TDYLIKAH QPNVLW QVGDGASDHFCWERPEDM+TPRTAFKIDESHPGSDLAGET+AAL
Sbjct: 121 TDYLIKAHPQPNVLWGQVGDGASDHFCWERPEDMSTPRTAFKIDESHPGSDLAGETSAAL 180

Query: 181 AAASITFKSYNSSYSDLLLVHAKELFTFADTFRGLYDDSIPCASEFYTSTGYWDELLWAA 240
           AAASI FK+YNSSYSDLLLVHAKELFTFADTFRG+YDDSIPCAS FYTS+GYWDELLWAA
Sbjct: 181 AAASIAFKTYNSSYSDLLLVHAKELFTFADTFRGVYDDSIPCASGFYTSSGYWDELLWAA 240

Query: 241 AWLFRATGDESYLKYTVDKAVSFGGTGWAMKEFSWDNKYAGVQILLTNVLLEGRGGGYES 300
           AWLFRATGDE YLKYTVDKAVSFGGTGWAMKEFSWDNKYAGVQ+LLT VLLEGRGGGYES
Sbjct: 241 AWLFRATGDEYYLKYTVDKAVSFGGTGWAMKEFSWDNKYAGVQVLLTKVLLEGRGGGYES 300

Query: 301 TLKQYQAKANYFACACLQKNDGFNINKTPGGLMYAHEWNNMQYASAAAFLMAVYSDYLSN 360
           TL+QYQAKA+YFACACLQKNDGFNINKTPGGLMYAH+WNNMQY SAA+FLMAVYSDYLS+
Sbjct: 301 TLRQYQAKADYFACACLQKNDGFNINKTPGGLMYAHQWNNMQYVSAASFLMAVYSDYLSD 360

Query: 361 ANAKLTCPDGVFEPKDLLNFAQSQADYILGKNPNSLSYLIGYGSKFPQKLHHRGSSIASI 420
           ANAKLTCPDGVF+PKDLLNFAQSQ DYILGKNPNSLSYLIGYGSKFPQKLHHR SSIASI
Sbjct: 361 ANAKLTCPDGVFQPKDLLNFAQSQIDYILGKNPNSLSYLIGYGSKFPQKLHHRASSIASI 420

Query: 421 FTDPSPIGCVQGFDFWYHRPQGNPNVLHGALVGGPDKDDRFDDDRSDYEQSEPTLTACAP 480
           FT+PSPIGCVQGFDFWYHRPQGNPN+L GALVGGPDK+DRFDDDRSDYEQSEPTLTACAP
Sbjct: 421 FTNPSPIGCVQGFDFWYHRPQGNPNILLGALVGGPDKNDRFDDDRSDYEQSEPTLTACAP 480

Query: 481 LVGLFSKLQSSVNGYSIPGSLGNKPPVKPERESPDASAPVPAGSPIEFIHTITSTWTTNK 540
           L+GLFSKLQSSVNG SIPGS GNKPPVKPE ES D +APV AGSP+EFIHTITSTWT NK
Sbjct: 481 LIGLFSKLQSSVNGDSIPGSRGNKPPVKPENESHDGNAPVSAGSPVEFIHTITSTWTVNK 540

Query: 541 ESYYRHQVKIKNTSGKPINNLKLQLENLSGPIWGLSPTKQKGIYELPAWLTVLQPGSECI 600
           ESYYRHQVKIKN SGK INNL+LQLENLSGPIWGLSPT+ KGIYELP WL VLQPGSEC+
Sbjct: 541 ESYYRHQVKIKNISGKSINNLRLQLENLSGPIWGLSPTEHKGIYELPTWLRVLQPGSECV 600

Query: 601 FIYIQEGPQARVTVSSYH 619
           FIYIQEGPQA+VTVSSYH
Sbjct: 601 FIYIQEGPQAKVTVSSYH 618

BLAST of HG10023079 vs. NCBI nr
Match: XP_022991484.1 (endoglucanase 5 [Cucurbita maxima])

HSP 1 Score: 1173.7 bits (3035), Expect = 0.0e+00
Identity = 556/618 (89.97%), Postives = 593/618 (95.95%), Query Frame = 0

Query: 1   MARGFLLALVAAVLYFEAAAAGDFNYGDALDLSFLYLEAQRSGKLPADRRVKWRGDSGLK 60
           M RG+LLALVAAVL FE AAAGDFNYGDA+DLSFLYLEAQRSGKLPADRRVKWRGDSGLK
Sbjct: 2   MPRGYLLALVAAVLCFE-AAAGDFNYGDAVDLSFLYLEAQRSGKLPADRRVKWRGDSGLK 61

Query: 61  DGLAQGVNLVGGYYDAGDNVKFGLPMSFVTTILSWGAIDFKKEITDINQMDNALKAIKWA 120
           DGLAQGVNLVGGYYDAGDNVKFGLPM+FVTTILSWGAIDFKKEITD+N MDNALKAIKW 
Sbjct: 62  DGLAQGVNLVGGYYDAGDNVKFGLPMAFVTTILSWGAIDFKKEITDLNHMDNALKAIKWG 121

Query: 121 TDYLIKAHTQPNVLWAQVGDGASDHFCWERPEDMTTPRTAFKIDESHPGSDLAGETAAAL 180
           TDYLIKAH + NVLWAQVGDGASDHFCW+RPEDM+TPRTA+K+DESHPGSDLAGETAAAL
Sbjct: 122 TDYLIKAHPERNVLWAQVGDGASDHFCWQRPEDMSTPRTAYKLDESHPGSDLAGETAAAL 181

Query: 181 AAASITFKSYNSSYSDLLLVHAKELFTFADTFRGLYDDSIPCASEFYTSTGYWDELLWAA 240
           AAASI FKSYNSSYSDLLLVHAKELFTFADTFRGLYDDSIPCA++FY+S+GYWDELLWAA
Sbjct: 182 AAASIAFKSYNSSYSDLLLVHAKELFTFADTFRGLYDDSIPCAADFYSSSGYWDELLWAA 241

Query: 241 AWLFRATGDESYLKYTVDKAVSFGGTGWAMKEFSWDNKYAGVQILLTNVLLEGRGGGYES 300
           AWLFRATGDE YLKYTVDKAVSFGGTGWAMKEFSWDNKYAGVQ+LLT VLLEGR   YES
Sbjct: 242 AWLFRATGDEYYLKYTVDKAVSFGGTGWAMKEFSWDNKYAGVQVLLTKVLLEGRAAAYES 301

Query: 301 TLKQYQAKANYFACACLQKNDGFNINKTPGGLMYAHEWNNMQYASAAAFLMAVYSDYLSN 360
           TLKQYQAKA+YFACACLQKNDGFNINKTPGGLMY  EWNNMQYASAAAFLMAVYS YLS+
Sbjct: 302 TLKQYQAKADYFACACLQKNDGFNINKTPGGLMYVREWNNMQYASAAAFLMAVYSVYLSD 361

Query: 361 ANAKLTCPDGVFEPKDLLNFAQSQADYILGKNPNSLSYLIGYGSKFPQKLHHRGSSIASI 420
           ANAKLTCPDGVFEPK+LLNFAQSQADYILGKNPNS+SYLIGYGSKFPQKLHHRGSSI SI
Sbjct: 362 ANAKLTCPDGVFEPKELLNFAQSQADYILGKNPNSISYLIGYGSKFPQKLHHRGSSIDSI 421

Query: 421 FTDPSPIGCVQGFDFWYHRPQGNPNVLHGALVGGPDKDDRFDDDRSDYEQSEPTLTACAP 480
           FTDPSPIGCVQGFDFWYHRPQGNPNVLHGALVGGPDK+DRFDDDRS+YEQ+EPTLTACAP
Sbjct: 422 FTDPSPIGCVQGFDFWYHRPQGNPNVLHGALVGGPDKNDRFDDDRSEYEQTEPTLTACAP 481

Query: 481 LVGLFSKLQSSVNGYSIPGSLGNKPPVKPERESPDASAPVPAGSPIEFIHTITSTWTTNK 540
           L+GLFSKLQSSVNGY IPGS GN PPVKPE+ESP+A+APVP G+P+EFIHTITSTW  +K
Sbjct: 482 LLGLFSKLQSSVNGYRIPGSRGNPPPVKPEKESPNANAPVPVGAPVEFIHTITSTWMVSK 541

Query: 541 ESYYRHQVKIKNTSGKPINNLKLQLENLSGPIWGLSPTKQKGIYELPAWLTVLQPGSECI 600
           +SYYRHQVKIKNTSGKPI++LKL+LENLSGP+WGLSPT+QKGIYELP WL +LQPGSEC+
Sbjct: 542 DSYYRHQVKIKNTSGKPISDLKLRLENLSGPVWGLSPTQQKGIYELPPWLRLLQPGSECV 601

Query: 601 FIYIQEGPQARVTVSSYH 619
           FIYIQEGPQA+VTVSSYH
Sbjct: 602 FIYIQEGPQAKVTVSSYH 618

BLAST of HG10023079 vs. NCBI nr
Match: XP_022953304.1 (endoglucanase 5 [Cucurbita moschata])

HSP 1 Score: 1171.0 bits (3028), Expect = 0.0e+00
Identity = 556/618 (89.97%), Postives = 592/618 (95.79%), Query Frame = 0

Query: 1   MARGFLLALVAAVLYFEAAAAGDFNYGDALDLSFLYLEAQRSGKLPADRRVKWRGDSGLK 60
           M RG+LLALVAAVL FE AAAGDFNYGDA+DLSFLYLEAQRSGKLPADRRVKWRGDSGLK
Sbjct: 2   MPRGYLLALVAAVLCFE-AAAGDFNYGDAVDLSFLYLEAQRSGKLPADRRVKWRGDSGLK 61

Query: 61  DGLAQGVNLVGGYYDAGDNVKFGLPMSFVTTILSWGAIDFKKEITDINQMDNALKAIKWA 120
           DGLAQGVNLVGGYYDAGDNVKFGLPM+FVTTILSWGAIDFKKEITD+N MD ALKAIKW 
Sbjct: 62  DGLAQGVNLVGGYYDAGDNVKFGLPMAFVTTILSWGAIDFKKEITDLNHMDKALKAIKWG 121

Query: 121 TDYLIKAHTQPNVLWAQVGDGASDHFCWERPEDMTTPRTAFKIDESHPGSDLAGETAAAL 180
           TDYLIKAH + NVLWAQVGDGASDHFCW+RPEDM+TPRTA+K+DESHPGSDLAGETAAAL
Sbjct: 122 TDYLIKAHPERNVLWAQVGDGASDHFCWQRPEDMSTPRTAYKLDESHPGSDLAGETAAAL 181

Query: 181 AAASITFKSYNSSYSDLLLVHAKELFTFADTFRGLYDDSIPCASEFYTSTGYWDELLWAA 240
           AAASI FKSYNSSYSDLLLVHAKELFTFADTFRGLYDDSIPCA++FY+S+GYWDELLWAA
Sbjct: 182 AAASIAFKSYNSSYSDLLLVHAKELFTFADTFRGLYDDSIPCAADFYSSSGYWDELLWAA 241

Query: 241 AWLFRATGDESYLKYTVDKAVSFGGTGWAMKEFSWDNKYAGVQILLTNVLLEGRGGGYES 300
           AWLFRATGDE YLKYTVDKAVSFGGTGWAMKEFSWDNKYAGVQ+LLT VLLEGR   Y S
Sbjct: 242 AWLFRATGDEYYLKYTVDKAVSFGGTGWAMKEFSWDNKYAGVQVLLTKVLLEGRAAAYGS 301

Query: 301 TLKQYQAKANYFACACLQKNDGFNINKTPGGLMYAHEWNNMQYASAAAFLMAVYSDYLSN 360
           TLKQYQAKA+YFACACLQKNDGFNINKTPGGLMY  EWNNMQYASAAAFLMAVYS YLS+
Sbjct: 302 TLKQYQAKADYFACACLQKNDGFNINKTPGGLMYVREWNNMQYASAAAFLMAVYSVYLSD 361

Query: 361 ANAKLTCPDGVFEPKDLLNFAQSQADYILGKNPNSLSYLIGYGSKFPQKLHHRGSSIASI 420
           ANAKLTCPDGVFEPK+LLNFAQSQADYILGKNPNS+SYLIGYGSKFPQKLHHRGSSIASI
Sbjct: 362 ANAKLTCPDGVFEPKELLNFAQSQADYILGKNPNSISYLIGYGSKFPQKLHHRGSSIASI 421

Query: 421 FTDPSPIGCVQGFDFWYHRPQGNPNVLHGALVGGPDKDDRFDDDRSDYEQSEPTLTACAP 480
           FTDPSPIGCVQGFDFWYHRPQGNPNVLHGALVGGPDK+DRFDDDRS+YEQ+EPTLTACAP
Sbjct: 422 FTDPSPIGCVQGFDFWYHRPQGNPNVLHGALVGGPDKNDRFDDDRSEYEQTEPTLTACAP 481

Query: 481 LVGLFSKLQSSVNGYSIPGSLGNKPPVKPERESPDASAPVPAGSPIEFIHTITSTWTTNK 540
           L+GLFSKLQSSVNGY IPGS GN PPVKPE+ESP+A+APVP G+P+EFIHTITSTW  +K
Sbjct: 482 LLGLFSKLQSSVNGYRIPGSRGNPPPVKPEKESPNANAPVPVGAPVEFIHTITSTWMVSK 541

Query: 541 ESYYRHQVKIKNTSGKPINNLKLQLENLSGPIWGLSPTKQKGIYELPAWLTVLQPGSECI 600
           ESYYRHQVKIKNTSGKPI++LKL+LENLSGPIWGLSPT+Q+GIYELP WL +LQPGSEC+
Sbjct: 542 ESYYRHQVKIKNTSGKPISDLKLRLENLSGPIWGLSPTQQEGIYELPPWLRLLQPGSECV 601

Query: 601 FIYIQEGPQARVTVSSYH 619
           FIYIQEGPQA+VTVSSYH
Sbjct: 602 FIYIQEGPQAKVTVSSYH 618

BLAST of HG10023079 vs. NCBI nr
Match: XP_023548134.1 (endoglucanase 5 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1169.8 bits (3025), Expect = 0.0e+00
Identity = 555/619 (89.66%), Postives = 593/619 (95.80%), Query Frame = 0

Query: 1   MARGFLLALVAAVLYFEAAA-AGDFNYGDALDLSFLYLEAQRSGKLPADRRVKWRGDSGL 60
           M RG+LLALVAAVL FEAAA AG+FNYGDA+DLSFLYLEAQRSGKLPADRRVKWRGDSGL
Sbjct: 1   MPRGYLLALVAAVLCFEAAATAGEFNYGDAVDLSFLYLEAQRSGKLPADRRVKWRGDSGL 60

Query: 61  KDGLAQGVNLVGGYYDAGDNVKFGLPMSFVTTILSWGAIDFKKEITDINQMDNALKAIKW 120
           KDGLAQGVNLVGGYYDAGDNVKFGLPM+FVTTILSWGAIDFKKEITD+N MD ALKAIKW
Sbjct: 61  KDGLAQGVNLVGGYYDAGDNVKFGLPMAFVTTILSWGAIDFKKEITDLNHMDKALKAIKW 120

Query: 121 ATDYLIKAHTQPNVLWAQVGDGASDHFCWERPEDMTTPRTAFKIDESHPGSDLAGETAAA 180
            TDYLIKAH + NVLWAQVGDGASDHFCW+RPEDM+TPRTA+K+DESHPGSDLAGETAAA
Sbjct: 121 GTDYLIKAHPERNVLWAQVGDGASDHFCWQRPEDMSTPRTAYKLDESHPGSDLAGETAAA 180

Query: 181 LAAASITFKSYNSSYSDLLLVHAKELFTFADTFRGLYDDSIPCASEFYTSTGYWDELLWA 240
           LAAASI FK+YNSSYSDLLLVHAKELFTFADTFRGLYDDSIPCA++FY+S+GYWDELLWA
Sbjct: 181 LAAASIAFKTYNSSYSDLLLVHAKELFTFADTFRGLYDDSIPCAADFYSSSGYWDELLWA 240

Query: 241 AAWLFRATGDESYLKYTVDKAVSFGGTGWAMKEFSWDNKYAGVQILLTNVLLEGRGGGYE 300
           AAWLFRATGDE YLKYTVDKAVSFGGTGWAMKEFSWDNKYAGVQ+LLT VLLEGR   YE
Sbjct: 241 AAWLFRATGDEYYLKYTVDKAVSFGGTGWAMKEFSWDNKYAGVQVLLTKVLLEGRAAAYE 300

Query: 301 STLKQYQAKANYFACACLQKNDGFNINKTPGGLMYAHEWNNMQYASAAAFLMAVYSDYLS 360
           STLKQYQAKA+YFACACLQKNDGFNINKTPGGLMY  EWNNMQYASAAAFLMAVYS YLS
Sbjct: 301 STLKQYQAKADYFACACLQKNDGFNINKTPGGLMYVREWNNMQYASAAAFLMAVYSVYLS 360

Query: 361 NANAKLTCPDGVFEPKDLLNFAQSQADYILGKNPNSLSYLIGYGSKFPQKLHHRGSSIAS 420
           +ANAKLTCPDGVFEPK+LLNFAQSQADYILGKNPNS+SYLIGYGSKFPQKLHHRGSSIAS
Sbjct: 361 DANAKLTCPDGVFEPKELLNFAQSQADYILGKNPNSISYLIGYGSKFPQKLHHRGSSIAS 420

Query: 421 IFTDPSPIGCVQGFDFWYHRPQGNPNVLHGALVGGPDKDDRFDDDRSDYEQSEPTLTACA 480
           IF+DPSPIGCVQGFDFWYHRPQGNPNVLHGALVGGPDK+DRFDDDRS+YEQ+EPTLTACA
Sbjct: 421 IFSDPSPIGCVQGFDFWYHRPQGNPNVLHGALVGGPDKNDRFDDDRSEYEQTEPTLTACA 480

Query: 481 PLVGLFSKLQSSVNGYSIPGSLGNKPPVKPERESPDASAPVPAGSPIEFIHTITSTWTTN 540
           PL+GLFSKLQSSVNGY IPGS GN PPVKPE +SP+A+APVP G+P+EFIHTITSTW  +
Sbjct: 481 PLLGLFSKLQSSVNGYRIPGSRGNPPPVKPEEKSPNANAPVPVGAPVEFIHTITSTWMVS 540

Query: 541 KESYYRHQVKIKNTSGKPINNLKLQLENLSGPIWGLSPTKQKGIYELPAWLTVLQPGSEC 600
           KESYYRHQVKIKNTSGKPI++LKL+LENLSGPIWGLSPT+QKGIYELP WL +LQPGSEC
Sbjct: 541 KESYYRHQVKIKNTSGKPISDLKLRLENLSGPIWGLSPTQQKGIYELPPWLRLLQPGSEC 600

Query: 601 IFIYIQEGPQARVTVSSYH 619
           +FIYIQEGPQA+VTVSSYH
Sbjct: 601 VFIYIQEGPQAKVTVSSYH 619

BLAST of HG10023079 vs. NCBI nr
Match: KAG7014275.1 (Endoglucanase 5, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1169.5 bits (3024), Expect = 0.0e+00
Identity = 555/618 (89.81%), Postives = 592/618 (95.79%), Query Frame = 0

Query: 1   MARGFLLALVAAVLYFEAAAAGDFNYGDALDLSFLYLEAQRSGKLPADRRVKWRGDSGLK 60
           M RG+LLALVAAVL F+ AAAGDFNYGDA+DLSFLYLEAQRSGKLPADRRVKWRGDSGLK
Sbjct: 2   MPRGYLLALVAAVLCFD-AAAGDFNYGDAVDLSFLYLEAQRSGKLPADRRVKWRGDSGLK 61

Query: 61  DGLAQGVNLVGGYYDAGDNVKFGLPMSFVTTILSWGAIDFKKEITDINQMDNALKAIKWA 120
           DGLAQGVNLVGGYYDAGDNVKFGLPM+FVTTILSWGAIDFKKEITD+N MD ALKAIKW 
Sbjct: 62  DGLAQGVNLVGGYYDAGDNVKFGLPMAFVTTILSWGAIDFKKEITDLNNMDKALKAIKWG 121

Query: 121 TDYLIKAHTQPNVLWAQVGDGASDHFCWERPEDMTTPRTAFKIDESHPGSDLAGETAAAL 180
           TDYLIKAH + NVLWAQVGDGASDHFCW+RPEDM+TPRTA+K+DESHPGSDLAGETAAAL
Sbjct: 122 TDYLIKAHPERNVLWAQVGDGASDHFCWQRPEDMSTPRTAYKLDESHPGSDLAGETAAAL 181

Query: 181 AAASITFKSYNSSYSDLLLVHAKELFTFADTFRGLYDDSIPCASEFYTSTGYWDELLWAA 240
           AAASI FKSYNSSYSDLLLVHAKELFTFADTFRGLYDDSIPCA++FY+S+GYWDELLWAA
Sbjct: 182 AAASIAFKSYNSSYSDLLLVHAKELFTFADTFRGLYDDSIPCAADFYSSSGYWDELLWAA 241

Query: 241 AWLFRATGDESYLKYTVDKAVSFGGTGWAMKEFSWDNKYAGVQILLTNVLLEGRGGGYES 300
           AWLFRATGDE YLKYTVDKAVSFGGTGWAMKEFSWDNKYAGVQ+LLT VLLEGR   Y S
Sbjct: 242 AWLFRATGDEYYLKYTVDKAVSFGGTGWAMKEFSWDNKYAGVQVLLTKVLLEGRAAAYGS 301

Query: 301 TLKQYQAKANYFACACLQKNDGFNINKTPGGLMYAHEWNNMQYASAAAFLMAVYSDYLSN 360
           TLKQYQAKA+YFACACLQKNDGFNINKTPGGLMY  EWNNMQYASAAAFLMAVYS YLS+
Sbjct: 302 TLKQYQAKADYFACACLQKNDGFNINKTPGGLMYVREWNNMQYASAAAFLMAVYSVYLSD 361

Query: 361 ANAKLTCPDGVFEPKDLLNFAQSQADYILGKNPNSLSYLIGYGSKFPQKLHHRGSSIASI 420
           ANAKLTCPDGVFEPK+LLNFAQSQADYILGKNPNS+SYLIGYGSKFPQKLHHRGSSIASI
Sbjct: 362 ANAKLTCPDGVFEPKELLNFAQSQADYILGKNPNSISYLIGYGSKFPQKLHHRGSSIASI 421

Query: 421 FTDPSPIGCVQGFDFWYHRPQGNPNVLHGALVGGPDKDDRFDDDRSDYEQSEPTLTACAP 480
           FTDPSPIGCVQGFDFWYHRPQGNPNVLHGALVGGPDK+DRFDDDRS+YEQ+EPTLTACAP
Sbjct: 422 FTDPSPIGCVQGFDFWYHRPQGNPNVLHGALVGGPDKNDRFDDDRSEYEQTEPTLTACAP 481

Query: 481 LVGLFSKLQSSVNGYSIPGSLGNKPPVKPERESPDASAPVPAGSPIEFIHTITSTWTTNK 540
           L+GLFSKLQSSV+GY IPGS GN PPVKPE+ESP+A+APVP G+P+EFIHTITSTW  +K
Sbjct: 482 LLGLFSKLQSSVDGYRIPGSRGNPPPVKPEKESPNANAPVPVGAPVEFIHTITSTWMVSK 541

Query: 541 ESYYRHQVKIKNTSGKPINNLKLQLENLSGPIWGLSPTKQKGIYELPAWLTVLQPGSECI 600
           ESYYRHQVKIKNTSGKPI++LKL+LENLSGPIWGLSPT+QKGIYELP WL +LQPGSEC+
Sbjct: 542 ESYYRHQVKIKNTSGKPISDLKLRLENLSGPIWGLSPTQQKGIYELPPWLRLLQPGSECV 601

Query: 601 FIYIQEGPQARVTVSSYH 619
           FIYIQEGPQA+VTVSSYH
Sbjct: 602 FIYIQEGPQAKVTVSSYH 618

BLAST of HG10023079 vs. ExPASy Swiss-Prot
Match: Q9M995 (Endoglucanase 5 OS=Arabidopsis thaliana OX=3702 GN=At1g48930 PE=1 SV=1)

HSP 1 Score: 840.5 bits (2170), Expect = 1.2e-242
Identity = 402/619 (64.94%), Postives = 485/619 (78.35%), Query Frame = 0

Query: 5   FLLALVAAVLYFEAAAAGD-FNYGDALDLSFLYLEAQRSGKLPADRRVKWRGDSGLKDGL 64
           F ++L+ +VL   A AA + +NYG ALD +FL+ EAQRSGKLPA +RVKWRG SGLKDGL
Sbjct: 9   FGVSLLLSVLLAAATAAAEYYNYGSALDKTFLFFEAQRSGKLPAAQRVKWRGPSGLKDGL 68

Query: 65  AQGVNLVGGYYDAGDNVKFGLPMSFVTTILSWGAIDFKKEITDINQMDNALKAIKWATDY 124
           AQGV+L GGYYDAGD+VKFGLPM+F  T+LSW A+D +KE++  NQM   L +I+W TDY
Sbjct: 69  AQGVSLEGGYYDAGDHVKFGLPMAFAVTMLSWAAVDNRKELSSSNQMQQTLWSIRWGTDY 128

Query: 125 LIKAHTQPNVLWAQVGDGASDHFCWERPEDMTTPRTAFKIDESHPGSDLAGETAAALAAA 184
            IKAH QPNVLW QVGDG SDH+CWERPEDMTT RTA+K+D  HPGSDLAGETAAALAAA
Sbjct: 129 FIKAHPQPNVLWGQVGDGESDHYCWERPEDMTTSRTAYKLDPYHPGSDLAGETAAALAAA 188

Query: 185 SITFKSYNSSYSDLLLVHAKELFTFADTFRGLYDDSIPCASEFYTSTGYWDELLWAAAWL 244
           S+ FK +NSSYS LLL HAKELF+FAD +RGLY +SIP A  FY S+GY DELLWAAAWL
Sbjct: 189 SLAFKPFNSSYSALLLSHAKELFSFADKYRGLYTNSIPNAKAFYMSSGYSDELLWAAAWL 248

Query: 245 FRATGDESYLKYTVDKAVSFGGTGWAMKEFSWDNKYAGVQILLTNVLLEGRGGGYESTLK 304
            RATGD+ YLKY +D +   GGTGW +KEFSWDNKYAGVQILL+ +LLEG+GG Y STLK
Sbjct: 249 HRATGDQYYLKYAMDNSGYMGGTGWGVKEFSWDNKYAGVQILLSKILLEGKGGIYTSTLK 308

Query: 305 QYQAKANYFACACLQKNDGFNINKTPGGLMYAHEWNNMQYASAAAFLMAVYSDYLSNANA 364
           QYQ KA+YFACACL+KN G+NI  TPGGLMY  EWNN+QYASAAA+L+AVYSDYLS ANA
Sbjct: 309 QYQTKADYFACACLKKNGGYNIQTTPGGLMYVREWNNLQYASAAAYLLAVYSDYLSAANA 368

Query: 365 KLTCPDGVFEPKDLLNFAQSQADYILGKNPNSLSYLIGYGSKFPQKLHHRGSSIASIFTD 424
           KL CPDG+ +P+ LL+FA+SQADYILGKN   +SY++GYG K+P ++HHRGSSI SIF  
Sbjct: 369 KLNCPDGLVQPQGLLDFARSQADYILGKNRQGMSYVVGYGPKYPIRVHHRGSSIPSIFAQ 428

Query: 425 PSPIGCVQGFDFWYHRPQGNPNVLHGALVGGPDKDDRFDDDRSDYEQSEPTLTACAPLVG 484
            S + CVQGFD WY R QG+PNV++GALVGGPD++D + DDRS+YEQSEPTL+  APLVG
Sbjct: 429 RSSVSCVQGFDSWYRRSQGDPNVIYGALVGGPDENDNYSDDRSNYEQSEPTLSGTAPLVG 488

Query: 485 LFSKLQ----SSVNGYSIPGSLGNKPPVKPERESPDASAPVPAGSPIEFIHTITSTWTTN 544
           LF+KL      S  G S       KP     + +P   +P  +G+ IEF+H+ITS W   
Sbjct: 489 LFAKLYGGSLGSYGGGSYKPYETTKPAASSYKATPTTYSPKQSGAQIEFLHSITSNWIAG 548

Query: 545 KESYYRHQVKIKNTSGKPINNLKLQLENLSGPIWGLSPTKQKGIYELPAWLTVLQPGSEC 604
              YYRH+V IKN S KPI++LKL++E+LSGPIWGL+PT QK  Y+LP W   L+ G   
Sbjct: 549 NTRYYRHKVIIKNNSQKPISDLKLKIEDLSGPIWGLNPTGQKYTYQLPQWQKTLRAGQAY 608

Query: 605 IFIYIQEGPQARVTVSSYH 619
            F+Y+Q GPQA+V+V SY+
Sbjct: 609 DFVYVQGGPQAKVSVLSYN 627

BLAST of HG10023079 vs. ExPASy Swiss-Prot
Match: A2XYW8 (Endoglucanase 13 OS=Oryza sativa subsp. indica OX=39946 GN=GLU6 PE=3 SV=2)

HSP 1 Score: 746.1 bits (1925), Expect = 3.1e-214
Identity = 352/619 (56.87%), Postives = 453/619 (73.18%), Query Frame = 0

Query: 2   ARGFLLALVAAVLYFEAAAAGDFNYGDALDLSFLYLEAQRSGKLPADRRVKWRGDSGLKD 61
           A   L+ L+AA    EA+A   F+Y  A D   L+ EAQRSGKLP DR V+WRGDS L D
Sbjct: 18  AAASLVLLLAAAASVEASA---FDYAGAFDKCLLFFEAQRSGKLPDDRLVRWRGDSALTD 77

Query: 62  GLAQGVNLVGGYYDAGDNVKFGLPMSFVTTILSWGAIDFKKEITDINQMDNALKAIKWAT 121
           G +QGV+LVGGYYD+GD+VKFGLPM++  T+LSWG ++F+KE+ D N++   L AI+W T
Sbjct: 78  GFSQGVDLVGGYYDSGDHVKFGLPMAYAVTMLSWGVVEFEKEMVDGNKLHRVLDAIRWGT 137

Query: 122 DYLIKAHTQPNVLWAQVGDGASDHFCWERPEDMTTPRTAFKIDESHPGSDLAGETAAALA 181
           +Y +KAHTQ N LW QVGDG SDH CWER EDM+TPRTAFKID ++PGS++AGETAAALA
Sbjct: 138 NYFVKAHTQHNALWVQVGDGDSDHLCWERAEDMSTPRTAFKIDINNPGSEVAGETAAALA 197

Query: 182 AASITFKSYNSSYSDLLLVHAKELFTFADTFRGLYDDSIPCASEFYTS-TGYWDELLWAA 241
           AA+  FK Y+  YSDLLL+H+K+LFTFADTFRG YDDS+  A +FY S +GY DELLWAA
Sbjct: 198 AAAKAFKPYDRMYSDLLLLHSKQLFTFADTFRGKYDDSLQSAKKFYPSASGYQDELLWAA 257

Query: 242 AWLFRATGDESYLKYTVDKAVSFGGTGWAMKEFSWDNKYAGVQILLTNVLLE--GRGGGY 301
           AWL+ ATGDE YL+Y    A +FGGTGWA+ EFSWDNKYAG+Q+LL+ VL E  G   GY
Sbjct: 258 AWLYEATGDEQYLRYVSQNAEAFGGTGWAVTEFSWDNKYAGLQVLLSKVLFEQGGSAAGY 317

Query: 302 ESTLKQYQAKANYFACACLQKNDGFNINKTPGGLMYAHEWNNMQYASAAAFLMAVYSDYL 361
             TLKQYQAKA +F CACLQKN+G N+  TPGGLMY  +W+NMQY S++AFL+ VY+DYL
Sbjct: 318 ADTLKQYQAKAEFFLCACLQKNNGHNVKMTPGGLMYVSDWSNMQYVSSSAFLLTVYADYL 377

Query: 362 SNANAKLTCPDGVFEPKDLLNFAQSQADYILGKNPNSLSYLIGYGSKFPQKLHHRGSSIA 421
           + +   L CPDG  +P ++L FA+SQ DY+LGKNP  +SY++GYGS +P  +HHRG+SI 
Sbjct: 378 AESRGTLRCPDGEVKPAEILLFARSQVDYVLGKNPKGMSYMVGYGSYYPTHVHHRGASIP 437

Query: 422 SIFTDPSPIGCVQGFDFWYHRPQGNPNVLHGALVGGPDKDDRFDDDRSDYEQSEPTLTAC 481
           SI+   + +GC++GFD +Y+    +PNVLHGALVGGPD +D +DDDR +Y+ +EPTL   
Sbjct: 438 SIYAMNATVGCMEGFDKYYNSKNADPNVLHGALVGGPDANDAYDDDRCNYQHAEPTLAGN 497

Query: 482 APLVGLFSKLQSSVNGYSIPGSLGNKPPVKPERESPDASAPVPAGSPIEFIHTITSTWTT 541
           AP+ G+F++L +S           N P   P   +P+A +P   GSP+EF+HT+T+TW  
Sbjct: 498 APMSGVFARLAAS--------PADNTPEYTP---APNAPSPSNGGSPLEFVHTVTNTWKA 557

Query: 542 NKESYYRHQVKIKNTSGKPINNLKLQLENLSGPIWGLSPTKQKGIYELPAWLTVLQPGSE 601
           N   YYRH V  KNT G  I  LKLQ++ LSG I+G+S T  K +YE P+W+T L  G++
Sbjct: 558 NGVDYYRHVVTAKNTCGHAITYLKLQIKELSGEIYGVSRTNAKDMYEFPSWMTRLDAGAQ 617

Query: 602 CIFIYIQEGPQARVTVSSY 618
              +YIQ GP A++ V  Y
Sbjct: 618 LTIVYIQGGPAAKIAVVEY 622

BLAST of HG10023079 vs. ExPASy Swiss-Prot
Match: Q0J930 (Endoglucanase 13 OS=Oryza sativa subsp. japonica OX=39947 GN=GLU6 PE=2 SV=1)

HSP 1 Score: 745.0 bits (1922), Expect = 7.0e-214
Identity = 351/619 (56.70%), Postives = 452/619 (73.02%), Query Frame = 0

Query: 2   ARGFLLALVAAVLYFEAAAAGDFNYGDALDLSFLYLEAQRSGKLPADRRVKWRGDSGLKD 61
           A   L+ L+AA    EA+A   F+Y  A D   L+ EAQRSGKLP DR V+WRGDS L D
Sbjct: 18  AAASLVLLLAAAASVEASA---FDYAGAFDKCLLFFEAQRSGKLPDDRLVRWRGDSALTD 77

Query: 62  GLAQGVNLVGGYYDAGDNVKFGLPMSFVTTILSWGAIDFKKEITDINQMDNALKAIKWAT 121
           G +QGV+LVGGYYD+GD+VKFGLPM++  T+LSWG ++F+KE+ D N++   L AI+W T
Sbjct: 78  GFSQGVDLVGGYYDSGDHVKFGLPMAYAVTMLSWGVVEFEKEMVDGNKLHRVLDAIRWGT 137

Query: 122 DYLIKAHTQPNVLWAQVGDGASDHFCWERPEDMTTPRTAFKIDESHPGSDLAGETAAALA 181
           +Y +KAHTQ N LW QVGDG SDH CWER EDM+TPRTAFKID ++PGS++AGETAAALA
Sbjct: 138 NYFVKAHTQHNALWVQVGDGDSDHLCWERAEDMSTPRTAFKIDINNPGSEVAGETAAALA 197

Query: 182 AASITFKSYNSSYSDLLLVHAKELFTFADTFRGLYDDSIPCASEFYTS-TGYWDELLWAA 241
           AA+  FK Y+  YSDLLL+H+K+LFTFADTFRG YDDS+  A +FY S +GY DELLWAA
Sbjct: 198 AAAKAFKPYDRMYSDLLLLHSKQLFTFADTFRGKYDDSLQSAKKFYPSASGYQDELLWAA 257

Query: 242 AWLFRATGDESYLKYTVDKAVSFGGTGWAMKEFSWDNKYAGVQILLTNVLLE--GRGGGY 301
           AWL+ ATGDE YL+Y    A +FGGTGWA+ EFSWDNKYAG+Q+LL+ VL E  G   GY
Sbjct: 258 AWLYEATGDEQYLRYVSQNAEAFGGTGWAVTEFSWDNKYAGLQVLLSKVLFEQGGSAAGY 317

Query: 302 ESTLKQYQAKANYFACACLQKNDGFNINKTPGGLMYAHEWNNMQYASAAAFLMAVYSDYL 361
             TLKQYQAKA +F CACLQKN+G N+  TPGGLMY  +W+NMQY S++AFL+ VY+DYL
Sbjct: 318 ADTLKQYQAKAEFFLCACLQKNNGHNVKMTPGGLMYVSDWSNMQYVSSSAFLLTVYADYL 377

Query: 362 SNANAKLTCPDGVFEPKDLLNFAQSQADYILGKNPNSLSYLIGYGSKFPQKLHHRGSSIA 421
           + +   L CPDG  +P ++L FA+SQ DY+LGKNP  +SY++GYGS +P  +HHRG+SI 
Sbjct: 378 AESRGTLRCPDGEVKPAEILRFARSQVDYVLGKNPKGMSYMVGYGSYYPTHVHHRGASIP 437

Query: 422 SIFTDPSPIGCVQGFDFWYHRPQGNPNVLHGALVGGPDKDDRFDDDRSDYEQSEPTLTAC 481
           SI+   + +GC++ FD +Y+    +PNVLHGALVGGPD +D +DDDR +Y+ +EPTL   
Sbjct: 438 SIYAMNATVGCMESFDKYYNSKNADPNVLHGALVGGPDANDAYDDDRCNYQHAEPTLAGN 497

Query: 482 APLVGLFSKLQSSVNGYSIPGSLGNKPPVKPERESPDASAPVPAGSPIEFIHTITSTWTT 541
           AP+ G+F++L +S           N P   P   +P+A +P   GSP+EF+HT+T+TW  
Sbjct: 498 APMSGVFARLAAS--------PADNTPEYTP---APNAPSPSNGGSPLEFVHTVTNTWKA 557

Query: 542 NKESYYRHQVKIKNTSGKPINNLKLQLENLSGPIWGLSPTKQKGIYELPAWLTVLQPGSE 601
           N   YYRH V  KNT G  I  LKLQ++ LSG I+G+S T  K +YE P+W+T L  G++
Sbjct: 558 NGVDYYRHVVTAKNTCGHAITYLKLQIKELSGEIYGVSRTNAKDMYEFPSWMTRLDAGAQ 617

Query: 602 CIFIYIQEGPQARVTVSSY 618
              +YIQ GP A++ V  Y
Sbjct: 618 LTIVYIQGGPAAKIAVVEY 622

BLAST of HG10023079 vs. ExPASy Swiss-Prot
Match: Q42059 (Endoglucanase 6 OS=Arabidopsis thaliana OX=3702 GN=At1g64390 PE=2 SV=2)

HSP 1 Score: 624.0 bits (1608), Expect = 1.8e-177
Identity = 304/617 (49.27%), Postives = 414/617 (67.10%), Query Frame = 0

Query: 8   ALVAAVLYFEAAAAGDFNYGDALDLSFLYLEAQRSGKLPADRRVKWRGDSGLKDGLAQGV 67
           AL+  +L F  A +G  +YG AL  S L+ EAQRSG LP ++RV WR  SGL DG + GV
Sbjct: 9   ALLLLLLCFPVAFSG-HDYGQALSKSLLFFEAQRSGVLPRNQRVTWRSHSGLTDGKSSGV 68

Query: 68  NLVGGYYDAGDNVKFGLPMSFVTTILSWGAIDFKKEITDINQMDNALKAIKWATDYLIKA 127
           NLVGGYYDAGDNVKFGLPM+F  T+++W  I++  ++    ++ N++ AIKW TDY IKA
Sbjct: 69  NLVGGYYDAGDNVKFGLPMAFTVTMMAWSVIEYGNQLQANGELGNSIDAIKWGTDYFIKA 128

Query: 128 HTQPNVLWAQVGDGASDHFCWERPEDMTTPRTAFKIDESHPGSDLAGETAAALAAASITF 187
           H +PNVL+ +VGDG +DH+CW+RPE+MTT R A++ID S+PGSDLAGETAAA+AAASI F
Sbjct: 129 HPEPNVLYGEVGDGNTDHYCWQRPEEMTTDRKAYRIDPSNPGSDLAGETAAAMAAASIVF 188

Query: 188 KSYNSSYSDLLLVHAKELFTFADTFRGLYDDSIPCASEFYTS-TGYWDELLWAAAWLFRA 247
           +  N  YS LLL HA +LF FAD +RG YD SI  A ++Y S +GY DELLWAAAWL++A
Sbjct: 189 RRSNPVYSRLLLTHAYQLFDFADKYRGKYDSSITVAQKYYRSVSGYNDELLWAAAWLYQA 248

Query: 248 TGDESYLKYTVDKAVSFGGTGWAMKEFSWDNKYAGVQILLTNVLLEGRGGGYESTLKQYQ 307
           + ++ YL Y      + GGTGW+M EF WD KYAGVQ L+   L++G+ G +    ++YQ
Sbjct: 249 SNNQFYLDYLGRNGDAMGGTGWSMTEFGWDVKYAGVQTLVAKFLMQGKAGRHAPVFRKYQ 308

Query: 308 AKANYFACACLQKNDGFNINKTPGGLMYAHEWNNMQYASAAAFLMAVYSDYLSNANAKLT 367
            KA+ F C+ L K+   NI KTPGGL++   WNNMQ+ ++A+FL  VYSDYL+++ + L 
Sbjct: 309 EKADSFMCSLLGKSSR-NIQKTPGGLIFRQRWNNMQFVTSASFLTTVYSDYLTSSRSNLR 368

Query: 368 CPDGVFEPKDLLNFAQSQADYILGKNPNSLSYLIGYGSKFPQKLHHRGSSIASIFTDPSP 427
           C  G   P  LL+FA+SQ DYILG NP + SY++GYG+ FPQ++HHRGSSI S+  D + 
Sbjct: 369 CAAGNVAPSQLLSFAKSQVDYILGDNPRATSYMVGYGNNFPQRVHHRGSSIVSVKVDRTF 428

Query: 428 IGCVQGFDFWYHRPQGNPNVLHGALVGGPDKDDRFDDDRSDYEQSEPTLTACAPLVGLFS 487
           + C  G+  W+ R   +PN+L GA+VGGPD  D F D R +YEQ+EP     APL+G+ +
Sbjct: 429 VTCRGGYATWFSRKGSDPNLLTGAIVGGPDAYDNFADRRDNYEQTEPATYNNAPLLGVLA 488

Query: 488 KLQSSVNGYS-----IPGSLGNKP-PVKPERESPDASAPVPAGSPIEFIHTITSTWTTNK 547
           +L S  +GYS     +P  +  +P P++     P  + PV A  P+  +  ITS+W +  
Sbjct: 489 RLSSGHSGYSQFLPVVPAPVVRRPMPIR----RPKVTTPVRASGPVAIVQKITSSWVSKG 548

Query: 548 ESYYRHQVKIKNTSGKPINNLKLQLENLSGPIWGLSPTKQKGIYELPAWLTVLQPGSECI 607
            +YYR+   + N S +P+ +L L ++NL GPIWGLS  +    + LP+W+  L  G    
Sbjct: 549 RTYYRYSTTVINKSSRPLKSLNLSIKNLYGPIWGLS--RSGNSFGLPSWMHSLPSGKSLE 608

Query: 608 FIYIQEGPQARVTVSSY 618
           F+YI     A V VSSY
Sbjct: 609 FVYIHSTTPANVAVSSY 617

BLAST of HG10023079 vs. ExPASy Swiss-Prot
Match: Q5NAT0 (Endoglucanase 2 OS=Oryza sativa subsp. japonica OX=39947 GN=GLU5 PE=1 SV=1)

HSP 1 Score: 602.1 bits (1551), Expect = 7.3e-171
Identity = 297/624 (47.60%), Postives = 409/624 (65.54%), Query Frame = 0

Query: 7   LALVAAVLYFEAAA------AGDFNYGDALDLSFLYLEAQRSGKLPADRRVKWRGDSGLK 66
           L +   VL F A A       G  +YG AL  S LY EAQRSG LP  +R+ WR +SGL 
Sbjct: 16  LGIALVVLVFAAMAQVARGGGGGHDYGMALSKSILYFEAQRSGVLPGSQRIAWRANSGLA 75

Query: 67  DGLAQGVNLVGGYYDAGDNVKFGLPMSFVTTILSWGAIDFKKEITDINQMDNALKAIKWA 126
           DG A GV+LVGGYYDAGDNVKFGLPM+F  T+++W  I++ +E+    ++ +A++AIKW 
Sbjct: 76  DGKANGVDLVGGYYDAGDNVKFGLPMAFTVTMMAWSVIEYGEEMAAAGELGHAVEAIKWG 135

Query: 127 TDYLIKAHTQPNVLWAQVGDGASDHFCWERPEDMTTPRTAFKIDESHPGSDLAGETAAAL 186
           TDY  KAH +PNVL+A+VGDG SDH CW+RPEDMTT R A+++D  +PGSDLAGETAAA+
Sbjct: 136 TDYFAKAHPEPNVLYAEVGDGDSDHNCWQRPEDMTTSRQAYRLDPQNPGSDLAGETAAAM 195

Query: 187 AAASITFKSYNSSYSDLLLVHAKELFTFADTFRGLYDDSIPCASEFYTS-TGYWDELLWA 246
           AAAS+ F+S N  Y+D LL H+K+LF FAD +RG YD+SI  A  +Y S +GY DELLWA
Sbjct: 196 AAASLVFRSSNPGYADQLLQHSKQLFDFADKYRGRYDNSITVARNYYGSFSGYGDELLWA 255

Query: 247 AAWLFRATGDESYLKYTVDKAVSFGGTGWAMKEFSWDNKYAGVQILLTNVLLEGRGGGYE 306
           +AWL++A+ D  YL Y  + A + GGTGW++ +F WD KY GVQIL    LL+G+ G + 
Sbjct: 256 SAWLYQASDDRRYLDYLANNADALGGTGWSINQFGWDVKYPGVQILAAKFLLQGKAGEHA 315

Query: 307 STLKQYQAKANYFACACLQKNDGFNINKTPGGLMYAHEWNNMQYASAAAFLMAVYSDYLS 366
             L+ Y+ KA++FAC+CL K+   N+ +TPGG++Y   WNN+Q+ ++A+FL+AVYSD+L+
Sbjct: 316 GVLQGYRRKADFFACSCLGKDAADNVGRTPGGMLYHQRWNNIQFVTSASFLLAVYSDHLA 375

Query: 367 NANAKLTCPDG-VFEPKDLLNFAQSQADYILGKNPNSLSYLIGYGSKFPQKLHHRGSSIA 426
               + +   G V    +LL FA+SQ DYILG NP   SY++GYG+ +P++ HHRGSSIA
Sbjct: 376 GGAVRCSGGGGAVAGAAELLAFAKSQVDYILGSNPRGTSYMVGYGAVYPRQAHHRGSSIA 435

Query: 427 SIFTDPSPIGCVQGFDFWYHRPQGNPNVLHGALVGGPDKDDRFDDDRSDYEQSEPTLTAC 486
           SI   PS + C +G+  WY R  GNPN+L GA+VGGPD+ D F D+R++YEQ+E      
Sbjct: 436 SIRASPSFVSCREGYASWYGRRGGNPNLLDGAVVGGPDEHDDFADERNNYEQTEAATYNN 495

Query: 487 APLVGLFSKLQSSVNGYSIPGSLGN--KPPVKPERESPDASAPVPAGSPIEFIHTITSTW 546
           APL+G+ ++L +  +G    G LG   +  +     S    A     SP+E     T++W
Sbjct: 496 APLMGILARLAAG-HGARARGRLGQSLQHGIAANHTSLPHGANHQHASPVEIEQKATASW 555

Query: 547 TTNKESYYRHQVKIKNTS---GKPINNLKLQLENLSGPIWGLSPTKQKGIYELPAWLTVL 606
             +  +Y+R+ V + N S   GK +  L + +  L GP+WGL    + G Y LP+W   L
Sbjct: 556 EKDGRTYHRYAVTVSNRSPAGGKTVEELHIGIGKLYGPVWGLEKAARYG-YVLPSWTPSL 615

Query: 607 QPGSECIFIYIQEGPQARVTVSSY 618
             G    F+Y+   P A V V+ Y
Sbjct: 616 PAGESAAFVYVHAAPPADVWVTGY 637

BLAST of HG10023079 vs. ExPASy TrEMBL
Match: A0A6J1JUZ6 (Endoglucanase OS=Cucurbita maxima OX=3661 GN=LOC111488085 PE=3 SV=1)

HSP 1 Score: 1173.7 bits (3035), Expect = 0.0e+00
Identity = 556/618 (89.97%), Postives = 593/618 (95.95%), Query Frame = 0

Query: 1   MARGFLLALVAAVLYFEAAAAGDFNYGDALDLSFLYLEAQRSGKLPADRRVKWRGDSGLK 60
           M RG+LLALVAAVL FE AAAGDFNYGDA+DLSFLYLEAQRSGKLPADRRVKWRGDSGLK
Sbjct: 2   MPRGYLLALVAAVLCFE-AAAGDFNYGDAVDLSFLYLEAQRSGKLPADRRVKWRGDSGLK 61

Query: 61  DGLAQGVNLVGGYYDAGDNVKFGLPMSFVTTILSWGAIDFKKEITDINQMDNALKAIKWA 120
           DGLAQGVNLVGGYYDAGDNVKFGLPM+FVTTILSWGAIDFKKEITD+N MDNALKAIKW 
Sbjct: 62  DGLAQGVNLVGGYYDAGDNVKFGLPMAFVTTILSWGAIDFKKEITDLNHMDNALKAIKWG 121

Query: 121 TDYLIKAHTQPNVLWAQVGDGASDHFCWERPEDMTTPRTAFKIDESHPGSDLAGETAAAL 180
           TDYLIKAH + NVLWAQVGDGASDHFCW+RPEDM+TPRTA+K+DESHPGSDLAGETAAAL
Sbjct: 122 TDYLIKAHPERNVLWAQVGDGASDHFCWQRPEDMSTPRTAYKLDESHPGSDLAGETAAAL 181

Query: 181 AAASITFKSYNSSYSDLLLVHAKELFTFADTFRGLYDDSIPCASEFYTSTGYWDELLWAA 240
           AAASI FKSYNSSYSDLLLVHAKELFTFADTFRGLYDDSIPCA++FY+S+GYWDELLWAA
Sbjct: 182 AAASIAFKSYNSSYSDLLLVHAKELFTFADTFRGLYDDSIPCAADFYSSSGYWDELLWAA 241

Query: 241 AWLFRATGDESYLKYTVDKAVSFGGTGWAMKEFSWDNKYAGVQILLTNVLLEGRGGGYES 300
           AWLFRATGDE YLKYTVDKAVSFGGTGWAMKEFSWDNKYAGVQ+LLT VLLEGR   YES
Sbjct: 242 AWLFRATGDEYYLKYTVDKAVSFGGTGWAMKEFSWDNKYAGVQVLLTKVLLEGRAAAYES 301

Query: 301 TLKQYQAKANYFACACLQKNDGFNINKTPGGLMYAHEWNNMQYASAAAFLMAVYSDYLSN 360
           TLKQYQAKA+YFACACLQKNDGFNINKTPGGLMY  EWNNMQYASAAAFLMAVYS YLS+
Sbjct: 302 TLKQYQAKADYFACACLQKNDGFNINKTPGGLMYVREWNNMQYASAAAFLMAVYSVYLSD 361

Query: 361 ANAKLTCPDGVFEPKDLLNFAQSQADYILGKNPNSLSYLIGYGSKFPQKLHHRGSSIASI 420
           ANAKLTCPDGVFEPK+LLNFAQSQADYILGKNPNS+SYLIGYGSKFPQKLHHRGSSI SI
Sbjct: 362 ANAKLTCPDGVFEPKELLNFAQSQADYILGKNPNSISYLIGYGSKFPQKLHHRGSSIDSI 421

Query: 421 FTDPSPIGCVQGFDFWYHRPQGNPNVLHGALVGGPDKDDRFDDDRSDYEQSEPTLTACAP 480
           FTDPSPIGCVQGFDFWYHRPQGNPNVLHGALVGGPDK+DRFDDDRS+YEQ+EPTLTACAP
Sbjct: 422 FTDPSPIGCVQGFDFWYHRPQGNPNVLHGALVGGPDKNDRFDDDRSEYEQTEPTLTACAP 481

Query: 481 LVGLFSKLQSSVNGYSIPGSLGNKPPVKPERESPDASAPVPAGSPIEFIHTITSTWTTNK 540
           L+GLFSKLQSSVNGY IPGS GN PPVKPE+ESP+A+APVP G+P+EFIHTITSTW  +K
Sbjct: 482 LLGLFSKLQSSVNGYRIPGSRGNPPPVKPEKESPNANAPVPVGAPVEFIHTITSTWMVSK 541

Query: 541 ESYYRHQVKIKNTSGKPINNLKLQLENLSGPIWGLSPTKQKGIYELPAWLTVLQPGSECI 600
           +SYYRHQVKIKNTSGKPI++LKL+LENLSGP+WGLSPT+QKGIYELP WL +LQPGSEC+
Sbjct: 542 DSYYRHQVKIKNTSGKPISDLKLRLENLSGPVWGLSPTQQKGIYELPPWLRLLQPGSECV 601

Query: 601 FIYIQEGPQARVTVSSYH 619
           FIYIQEGPQA+VTVSSYH
Sbjct: 602 FIYIQEGPQAKVTVSSYH 618

BLAST of HG10023079 vs. ExPASy TrEMBL
Match: A0A6J1GP97 (Endoglucanase OS=Cucurbita moschata OX=3662 GN=LOC111455891 PE=3 SV=1)

HSP 1 Score: 1171.0 bits (3028), Expect = 0.0e+00
Identity = 556/618 (89.97%), Postives = 592/618 (95.79%), Query Frame = 0

Query: 1   MARGFLLALVAAVLYFEAAAAGDFNYGDALDLSFLYLEAQRSGKLPADRRVKWRGDSGLK 60
           M RG+LLALVAAVL FE AAAGDFNYGDA+DLSFLYLEAQRSGKLPADRRVKWRGDSGLK
Sbjct: 2   MPRGYLLALVAAVLCFE-AAAGDFNYGDAVDLSFLYLEAQRSGKLPADRRVKWRGDSGLK 61

Query: 61  DGLAQGVNLVGGYYDAGDNVKFGLPMSFVTTILSWGAIDFKKEITDINQMDNALKAIKWA 120
           DGLAQGVNLVGGYYDAGDNVKFGLPM+FVTTILSWGAIDFKKEITD+N MD ALKAIKW 
Sbjct: 62  DGLAQGVNLVGGYYDAGDNVKFGLPMAFVTTILSWGAIDFKKEITDLNHMDKALKAIKWG 121

Query: 121 TDYLIKAHTQPNVLWAQVGDGASDHFCWERPEDMTTPRTAFKIDESHPGSDLAGETAAAL 180
           TDYLIKAH + NVLWAQVGDGASDHFCW+RPEDM+TPRTA+K+DESHPGSDLAGETAAAL
Sbjct: 122 TDYLIKAHPERNVLWAQVGDGASDHFCWQRPEDMSTPRTAYKLDESHPGSDLAGETAAAL 181

Query: 181 AAASITFKSYNSSYSDLLLVHAKELFTFADTFRGLYDDSIPCASEFYTSTGYWDELLWAA 240
           AAASI FKSYNSSYSDLLLVHAKELFTFADTFRGLYDDSIPCA++FY+S+GYWDELLWAA
Sbjct: 182 AAASIAFKSYNSSYSDLLLVHAKELFTFADTFRGLYDDSIPCAADFYSSSGYWDELLWAA 241

Query: 241 AWLFRATGDESYLKYTVDKAVSFGGTGWAMKEFSWDNKYAGVQILLTNVLLEGRGGGYES 300
           AWLFRATGDE YLKYTVDKAVSFGGTGWAMKEFSWDNKYAGVQ+LLT VLLEGR   Y S
Sbjct: 242 AWLFRATGDEYYLKYTVDKAVSFGGTGWAMKEFSWDNKYAGVQVLLTKVLLEGRAAAYGS 301

Query: 301 TLKQYQAKANYFACACLQKNDGFNINKTPGGLMYAHEWNNMQYASAAAFLMAVYSDYLSN 360
           TLKQYQAKA+YFACACLQKNDGFNINKTPGGLMY  EWNNMQYASAAAFLMAVYS YLS+
Sbjct: 302 TLKQYQAKADYFACACLQKNDGFNINKTPGGLMYVREWNNMQYASAAAFLMAVYSVYLSD 361

Query: 361 ANAKLTCPDGVFEPKDLLNFAQSQADYILGKNPNSLSYLIGYGSKFPQKLHHRGSSIASI 420
           ANAKLTCPDGVFEPK+LLNFAQSQADYILGKNPNS+SYLIGYGSKFPQKLHHRGSSIASI
Sbjct: 362 ANAKLTCPDGVFEPKELLNFAQSQADYILGKNPNSISYLIGYGSKFPQKLHHRGSSIASI 421

Query: 421 FTDPSPIGCVQGFDFWYHRPQGNPNVLHGALVGGPDKDDRFDDDRSDYEQSEPTLTACAP 480
           FTDPSPIGCVQGFDFWYHRPQGNPNVLHGALVGGPDK+DRFDDDRS+YEQ+EPTLTACAP
Sbjct: 422 FTDPSPIGCVQGFDFWYHRPQGNPNVLHGALVGGPDKNDRFDDDRSEYEQTEPTLTACAP 481

Query: 481 LVGLFSKLQSSVNGYSIPGSLGNKPPVKPERESPDASAPVPAGSPIEFIHTITSTWTTNK 540
           L+GLFSKLQSSVNGY IPGS GN PPVKPE+ESP+A+APVP G+P+EFIHTITSTW  +K
Sbjct: 482 LLGLFSKLQSSVNGYRIPGSRGNPPPVKPEKESPNANAPVPVGAPVEFIHTITSTWMVSK 541

Query: 541 ESYYRHQVKIKNTSGKPINNLKLQLENLSGPIWGLSPTKQKGIYELPAWLTVLQPGSECI 600
           ESYYRHQVKIKNTSGKPI++LKL+LENLSGPIWGLSPT+Q+GIYELP WL +LQPGSEC+
Sbjct: 542 ESYYRHQVKIKNTSGKPISDLKLRLENLSGPIWGLSPTQQEGIYELPPWLRLLQPGSECV 601

Query: 601 FIYIQEGPQARVTVSSYH 619
           FIYIQEGPQA+VTVSSYH
Sbjct: 602 FIYIQEGPQAKVTVSSYH 618

BLAST of HG10023079 vs. ExPASy TrEMBL
Match: A0A5A7URX3 (Endoglucanase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold459G00120 PE=3 SV=1)

HSP 1 Score: 1165.6 bits (3014), Expect = 0.0e+00
Identity = 555/619 (89.66%), Postives = 588/619 (94.99%), Query Frame = 0

Query: 1   MARGFLLALVAAVLYFE-AAAAGDFNYGDALDLSFLYLEAQRSGKLPADRRVKWRGDSGL 60
           MA+GFL+ALVAAVL FE AAAAG+FNYGDALDLSFLYLEAQRSGKLPADRRVKWRGDSGL
Sbjct: 1   MAQGFLVALVAAVLCFEAAAAAGEFNYGDALDLSFLYLEAQRSGKLPADRRVKWRGDSGL 60

Query: 61  KDGLAQGVNLVGGYYDAGDNVKFGLPMSFVTTILSWGAIDFKKEITDINQMDNALKAIKW 120
           KDGLAQGVNLVGGYYDAGDNVKFGLPM+FVTTILSWGAIDF KEIT+ NQMDN LKAIKW
Sbjct: 61  KDGLAQGVNLVGGYYDAGDNVKFGLPMAFVTTILSWGAIDFNKEITNANQMDNTLKAIKW 120

Query: 121 ATDYLIKAHTQPNVLWAQVGDGASDHFCWERPEDMTTPRTAFKIDESHPGSDLAGETAAA 180
           ATDY +KAHT  NVLW QVGDG+SDHFCW+RPEDMTTPRTAFKIDESHPGSDLAGETAAA
Sbjct: 121 ATDYFLKAHTSRNVLWGQVGDGSSDHFCWQRPEDMTTPRTAFKIDESHPGSDLAGETAAA 180

Query: 181 LAAASITFKSYNSSYSDLLLVHAKELFTFADTFRGLYDDSIPCASEFYTSTGYWDELLWA 240
           LAAASI FK+YNS+YS+LLLVHAKELFTFADTFRGLYDDSIPCAS FYTS+GYWDELLWA
Sbjct: 181 LAAASIAFKTYNSAYSNLLLVHAKELFTFADTFRGLYDDSIPCASGFYTSSGYWDELLWA 240

Query: 241 AAWLFRATGDESYLKYTVDKAVSFGGTGWAMKEFSWDNKYAGVQILLTNVLLEGRGGGYE 300
           AAWLFRATGDE YLKYTVDKAVSFGGTGWAMKEFSWDNKYAGVQ+LLT VLLEGR GGYE
Sbjct: 241 AAWLFRATGDEYYLKYTVDKAVSFGGTGWAMKEFSWDNKYAGVQVLLTKVLLEGRSGGYE 300

Query: 301 STLKQYQAKANYFACACLQKNDGFNINKTPGGLMYAHEWNNMQYASAAAFLMAVYSDYLS 360
           STLKQYQAKA+YFACACLQKNDG+NINKTPGGL+YAHEWNNMQYASAAAFLMAVYSDYLS
Sbjct: 301 STLKQYQAKADYFACACLQKNDGYNINKTPGGLLYAHEWNNMQYASAAAFLMAVYSDYLS 360

Query: 361 NANAKLTCPDGVFEPKDLLNFAQSQADYILGKNPNSLSYLIGYGSKFPQKLHHRGSSIAS 420
            ANAKL CPDGVFEPK+LLNFAQSQADYILGKNPNSLSYLIGYG KFPQKLHHRGSSIAS
Sbjct: 361 AANAKLICPDGVFEPKELLNFAQSQADYILGKNPNSLSYLIGYGPKFPQKLHHRGSSIAS 420

Query: 421 IFTDPSPIGCVQGFDFWYHRPQGNPNVLHGALVGGPDKDDRFDDDRSDYEQSEPTLTACA 480
           IFTDP+PIGCVQGFD+WYHRPQGNPNVLHGALVGGPDK+DRF DDRS+YEQ+EPTLTA A
Sbjct: 421 IFTDPAPIGCVQGFDYWYHRPQGNPNVLHGALVGGPDKNDRFGDDRSEYEQTEPTLTASA 480

Query: 481 PLVGLFSKLQSSVNGYSIPGSLGNKPPVKPERESPDASAPVPAGSPIEFIHTITSTWTTN 540
           PL+GLFSKL SSVNG+ IPGS G++PPVK E ESPDA+ PV AGSP+EFIHTITSTWT N
Sbjct: 481 PLIGLFSKLHSSVNGHQIPGSRGHQPPVKRENESPDANVPVAAGSPVEFIHTITSTWTVN 540

Query: 541 KESYYRHQVKIKNTSGKPINNLKLQLENLSGPIWGLSPTKQKGIYELPAWLTVLQPGSEC 600
           KESYYRHQVKIKNTSGK I NLKL LENL+GPIWGLSPT+QKGIYELP WLTVLQPGSEC
Sbjct: 541 KESYYRHQVKIKNTSGKSIKNLKLHLENLTGPIWGLSPTQQKGIYELPEWLTVLQPGSEC 600

Query: 601 IFIYIQEGPQARVTVSSYH 619
           +FIYIQEGPQA+VT+SSYH
Sbjct: 601 VFIYIQEGPQAKVTISSYH 619

BLAST of HG10023079 vs. ExPASy TrEMBL
Match: A0A1S3CEZ8 (Endoglucanase OS=Cucumis melo OX=3656 GN=LOC103499972 PE=3 SV=1)

HSP 1 Score: 1165.6 bits (3014), Expect = 0.0e+00
Identity = 555/619 (89.66%), Postives = 588/619 (94.99%), Query Frame = 0

Query: 1   MARGFLLALVAAVLYFE-AAAAGDFNYGDALDLSFLYLEAQRSGKLPADRRVKWRGDSGL 60
           MA+GFL+ALVAAVL FE AAAAG+FNYGDALDLSFLYLEAQRSGKLPADRRVKWRGDSGL
Sbjct: 1   MAQGFLVALVAAVLCFEAAAAAGEFNYGDALDLSFLYLEAQRSGKLPADRRVKWRGDSGL 60

Query: 61  KDGLAQGVNLVGGYYDAGDNVKFGLPMSFVTTILSWGAIDFKKEITDINQMDNALKAIKW 120
           KDGLAQGVNLVGGYYDAGDNVKFGLPM+FVTTILSWGAIDF KEIT+ NQMDN LKAIKW
Sbjct: 61  KDGLAQGVNLVGGYYDAGDNVKFGLPMAFVTTILSWGAIDFNKEITNANQMDNTLKAIKW 120

Query: 121 ATDYLIKAHTQPNVLWAQVGDGASDHFCWERPEDMTTPRTAFKIDESHPGSDLAGETAAA 180
           ATDY +KAHT  NVLW QVGDG+SDHFCW+RPEDMTTPRTAFKIDESHPGSDLAGETAAA
Sbjct: 121 ATDYFLKAHTSRNVLWGQVGDGSSDHFCWQRPEDMTTPRTAFKIDESHPGSDLAGETAAA 180

Query: 181 LAAASITFKSYNSSYSDLLLVHAKELFTFADTFRGLYDDSIPCASEFYTSTGYWDELLWA 240
           LAAASI FK+YNS+YS+LLLVHAKELFTFADTFRGLYDDSIPCAS FYTS+GYWDELLWA
Sbjct: 181 LAAASIAFKTYNSAYSNLLLVHAKELFTFADTFRGLYDDSIPCASGFYTSSGYWDELLWA 240

Query: 241 AAWLFRATGDESYLKYTVDKAVSFGGTGWAMKEFSWDNKYAGVQILLTNVLLEGRGGGYE 300
           AAWLFRATGDE YLKYTVDKAVSFGGTGWAMKEFSWDNKYAGVQ+LLT VLLEGR GGYE
Sbjct: 241 AAWLFRATGDEYYLKYTVDKAVSFGGTGWAMKEFSWDNKYAGVQVLLTKVLLEGRSGGYE 300

Query: 301 STLKQYQAKANYFACACLQKNDGFNINKTPGGLMYAHEWNNMQYASAAAFLMAVYSDYLS 360
           STLKQYQAKA+YFACACLQKNDG+NINKTPGGL+YAHEWNNMQYASAAAFLMAVYSDYLS
Sbjct: 301 STLKQYQAKADYFACACLQKNDGYNINKTPGGLLYAHEWNNMQYASAAAFLMAVYSDYLS 360

Query: 361 NANAKLTCPDGVFEPKDLLNFAQSQADYILGKNPNSLSYLIGYGSKFPQKLHHRGSSIAS 420
            ANAKL CPDGVFEPK+LLNFAQSQADYILGKNPNSLSYLIGYG KFPQKLHHRGSSIAS
Sbjct: 361 AANAKLICPDGVFEPKELLNFAQSQADYILGKNPNSLSYLIGYGPKFPQKLHHRGSSIAS 420

Query: 421 IFTDPSPIGCVQGFDFWYHRPQGNPNVLHGALVGGPDKDDRFDDDRSDYEQSEPTLTACA 480
           IFTDP+PIGCVQGFD+WYHRPQGNPNVLHGALVGGPDK+DRF DDRS+YEQ+EPTLTA A
Sbjct: 421 IFTDPAPIGCVQGFDYWYHRPQGNPNVLHGALVGGPDKNDRFGDDRSEYEQTEPTLTASA 480

Query: 481 PLVGLFSKLQSSVNGYSIPGSLGNKPPVKPERESPDASAPVPAGSPIEFIHTITSTWTTN 540
           PL+GLFSKL SSVNG+ IPGS G++PPVK E ESPDA+ PV AGSP+EFIHTITSTWT N
Sbjct: 481 PLIGLFSKLHSSVNGHQIPGSRGHQPPVKRENESPDANVPVAAGSPVEFIHTITSTWTVN 540

Query: 541 KESYYRHQVKIKNTSGKPINNLKLQLENLSGPIWGLSPTKQKGIYELPAWLTVLQPGSEC 600
           KESYYRHQVKIKNTSGK I NLKL LENL+GPIWGLSPT+QKGIYELP WLTVLQPGSEC
Sbjct: 541 KESYYRHQVKIKNTSGKSIKNLKLHLENLTGPIWGLSPTQQKGIYELPEWLTVLQPGSEC 600

Query: 601 IFIYIQEGPQARVTVSSYH 619
           +FIYIQEGPQA+VT+SSYH
Sbjct: 601 VFIYIQEGPQAKVTISSYH 619

BLAST of HG10023079 vs. ExPASy TrEMBL
Match: A0A0A0K9M6 (Endoglucanase OS=Cucumis sativus OX=3659 GN=Csa_7G420700 PE=3 SV=1)

HSP 1 Score: 1153.3 bits (2982), Expect = 0.0e+00
Identity = 548/619 (88.53%), Postives = 578/619 (93.38%), Query Frame = 0

Query: 1   MARGFLLALVAAVLYFEAA-AAGDFNYGDALDLSFLYLEAQRSGKLPADRRVKWRGDSGL 60
           MA+GFLLALV AVL FEAA A G+FNYGDALDLSFLYLEAQRSGKLP DRRVKWRGDSGL
Sbjct: 1   MAQGFLLALVVAVLCFEAATAVGEFNYGDALDLSFLYLEAQRSGKLPVDRRVKWRGDSGL 60

Query: 61  KDGLAQGVNLVGGYYDAGDNVKFGLPMSFVTTILSWGAIDFKKEITDINQMDNALKAIKW 120
           KDG AQGVNLVGGYYDAGDNVKFGLPM+FV TILSWGAIDF KEIT+ NQMDN LKAIKW
Sbjct: 61  KDGFAQGVNLVGGYYDAGDNVKFGLPMAFVATILSWGAIDFNKEITNANQMDNTLKAIKW 120

Query: 121 ATDYLIKAHTQPNVLWAQVGDGASDHFCWERPEDMTTPRTAFKIDESHPGSDLAGETAAA 180
           ATDY +KAHTQ NVLW QVGDG+SDHFCWERPEDMTTPRTAFKIDESHPGSDLAGETAAA
Sbjct: 121 ATDYFLKAHTQRNVLWGQVGDGSSDHFCWERPEDMTTPRTAFKIDESHPGSDLAGETAAA 180

Query: 181 LAAASITFKSYNSSYSDLLLVHAKELFTFADTFRGLYDDSIPCASEFYTSTGYWDELLWA 240
           LAAASI FK+YNS+YS+LLL HAKELFTFADTFRGLYDDSIPC S FYTS+GYWDELLWA
Sbjct: 181 LAAASIAFKTYNSAYSNLLLAHAKELFTFADTFRGLYDDSIPCVSGFYTSSGYWDELLWA 240

Query: 241 AAWLFRATGDESYLKYTVDKAVSFGGTGWAMKEFSWDNKYAGVQILLTNVLLEGRGGGYE 300
           A WLFRATGDE YLKYTVDKAVSFGGTGWAMKEFSWDNKYAGVQ+LLT VLLEGRGGGYE
Sbjct: 241 ATWLFRATGDEYYLKYTVDKAVSFGGTGWAMKEFSWDNKYAGVQVLLTKVLLEGRGGGYE 300

Query: 301 STLKQYQAKANYFACACLQKNDGFNINKTPGGLMYAHEWNNMQYASAAAFLMAVYSDYLS 360
           STLKQYQAKA+YFACACL+KNDGFNINKTPGGL+YAHEWNNMQYAS AAFLMAVYSDYLS
Sbjct: 301 STLKQYQAKADYFACACLEKNDGFNINKTPGGLLYAHEWNNMQYASTAAFLMAVYSDYLS 360

Query: 361 NANAKLTCPDGVFEPKDLLNFAQSQADYILGKNPNSLSYLIGYGSKFPQKLHHRGSSIAS 420
            ANAKL CPDGVFEPK+LLNFAQSQADYILGKNPNSLSYLIGYG KFPQKLHHRGSSIAS
Sbjct: 361 TANAKLICPDGVFEPKELLNFAQSQADYILGKNPNSLSYLIGYGPKFPQKLHHRGSSIAS 420

Query: 421 IFTDPSPIGCVQGFDFWYHRPQGNPNVLHGALVGGPDKDDRFDDDRSDYEQSEPTLTACA 480
           IFTDP P+GCVQGFD WYHRPQGNPN+LHGALVGGPDK+DRF D+RSDYEQ+EPTLTA A
Sbjct: 421 IFTDPVPVGCVQGFDTWYHRPQGNPNILHGALVGGPDKNDRFGDERSDYEQTEPTLTASA 480

Query: 481 PLVGLFSKLQSSVNGYSIPGSLGNKPPVKPERESPDASAPVPAGSPIEFIHTITSTWTTN 540
           PL+GLFSKL SSVNG+ IPGS G +PPVK E ESPDA+ PV AGSP+EFIHTITSTWT N
Sbjct: 481 PLIGLFSKLHSSVNGHQIPGSRGYQPPVKREEESPDANVPVSAGSPVEFIHTITSTWTVN 540

Query: 541 KESYYRHQVKIKNTSGKPINNLKLQLENLSGPIWGLSPTKQKGIYELPAWLTVLQPGSEC 600
           KESYYRHQVKIKNTSGK I NLKLQL+NL+GPIWGLSPT+QKG+YELP WLTVLQPGSEC
Sbjct: 541 KESYYRHQVKIKNTSGKSIKNLKLQLDNLTGPIWGLSPTQQKGVYELPTWLTVLQPGSEC 600

Query: 601 IFIYIQEGPQARVTVSSYH 619
            FIYIQEGPQA+VTVSSYH
Sbjct: 601 AFIYIQEGPQAKVTVSSYH 619

BLAST of HG10023079 vs. TAIR 10
Match: AT1G48930.1 (glycosyl hydrolase 9C1 )

HSP 1 Score: 840.5 bits (2170), Expect = 8.7e-244
Identity = 402/619 (64.94%), Postives = 485/619 (78.35%), Query Frame = 0

Query: 5   FLLALVAAVLYFEAAAAGD-FNYGDALDLSFLYLEAQRSGKLPADRRVKWRGDSGLKDGL 64
           F ++L+ +VL   A AA + +NYG ALD +FL+ EAQRSGKLPA +RVKWRG SGLKDGL
Sbjct: 9   FGVSLLLSVLLAAATAAAEYYNYGSALDKTFLFFEAQRSGKLPAAQRVKWRGPSGLKDGL 68

Query: 65  AQGVNLVGGYYDAGDNVKFGLPMSFVTTILSWGAIDFKKEITDINQMDNALKAIKWATDY 124
           AQGV+L GGYYDAGD+VKFGLPM+F  T+LSW A+D +KE++  NQM   L +I+W TDY
Sbjct: 69  AQGVSLEGGYYDAGDHVKFGLPMAFAVTMLSWAAVDNRKELSSSNQMQQTLWSIRWGTDY 128

Query: 125 LIKAHTQPNVLWAQVGDGASDHFCWERPEDMTTPRTAFKIDESHPGSDLAGETAAALAAA 184
            IKAH QPNVLW QVGDG SDH+CWERPEDMTT RTA+K+D  HPGSDLAGETAAALAAA
Sbjct: 129 FIKAHPQPNVLWGQVGDGESDHYCWERPEDMTTSRTAYKLDPYHPGSDLAGETAAALAAA 188

Query: 185 SITFKSYNSSYSDLLLVHAKELFTFADTFRGLYDDSIPCASEFYTSTGYWDELLWAAAWL 244
           S+ FK +NSSYS LLL HAKELF+FAD +RGLY +SIP A  FY S+GY DELLWAAAWL
Sbjct: 189 SLAFKPFNSSYSALLLSHAKELFSFADKYRGLYTNSIPNAKAFYMSSGYSDELLWAAAWL 248

Query: 245 FRATGDESYLKYTVDKAVSFGGTGWAMKEFSWDNKYAGVQILLTNVLLEGRGGGYESTLK 304
            RATGD+ YLKY +D +   GGTGW +KEFSWDNKYAGVQILL+ +LLEG+GG Y STLK
Sbjct: 249 HRATGDQYYLKYAMDNSGYMGGTGWGVKEFSWDNKYAGVQILLSKILLEGKGGIYTSTLK 308

Query: 305 QYQAKANYFACACLQKNDGFNINKTPGGLMYAHEWNNMQYASAAAFLMAVYSDYLSNANA 364
           QYQ KA+YFACACL+KN G+NI  TPGGLMY  EWNN+QYASAAA+L+AVYSDYLS ANA
Sbjct: 309 QYQTKADYFACACLKKNGGYNIQTTPGGLMYVREWNNLQYASAAAYLLAVYSDYLSAANA 368

Query: 365 KLTCPDGVFEPKDLLNFAQSQADYILGKNPNSLSYLIGYGSKFPQKLHHRGSSIASIFTD 424
           KL CPDG+ +P+ LL+FA+SQADYILGKN   +SY++GYG K+P ++HHRGSSI SIF  
Sbjct: 369 KLNCPDGLVQPQGLLDFARSQADYILGKNRQGMSYVVGYGPKYPIRVHHRGSSIPSIFAQ 428

Query: 425 PSPIGCVQGFDFWYHRPQGNPNVLHGALVGGPDKDDRFDDDRSDYEQSEPTLTACAPLVG 484
            S + CVQGFD WY R QG+PNV++GALVGGPD++D + DDRS+YEQSEPTL+  APLVG
Sbjct: 429 RSSVSCVQGFDSWYRRSQGDPNVIYGALVGGPDENDNYSDDRSNYEQSEPTLSGTAPLVG 488

Query: 485 LFSKLQ----SSVNGYSIPGSLGNKPPVKPERESPDASAPVPAGSPIEFIHTITSTWTTN 544
           LF+KL      S  G S       KP     + +P   +P  +G+ IEF+H+ITS W   
Sbjct: 489 LFAKLYGGSLGSYGGGSYKPYETTKPAASSYKATPTTYSPKQSGAQIEFLHSITSNWIAG 548

Query: 545 KESYYRHQVKIKNTSGKPINNLKLQLENLSGPIWGLSPTKQKGIYELPAWLTVLQPGSEC 604
              YYRH+V IKN S KPI++LKL++E+LSGPIWGL+PT QK  Y+LP W   L+ G   
Sbjct: 549 NTRYYRHKVIIKNNSQKPISDLKLKIEDLSGPIWGLNPTGQKYTYQLPQWQKTLRAGQAY 608

Query: 605 IFIYIQEGPQARVTVSSYH 619
            F+Y+Q GPQA+V+V SY+
Sbjct: 609 DFVYVQGGPQAKVSVLSYN 627

BLAST of HG10023079 vs. TAIR 10
Match: AT1G64390.1 (glycosyl hydrolase 9C2 )

HSP 1 Score: 624.0 bits (1608), Expect = 1.3e-178
Identity = 304/617 (49.27%), Postives = 414/617 (67.10%), Query Frame = 0

Query: 8   ALVAAVLYFEAAAAGDFNYGDALDLSFLYLEAQRSGKLPADRRVKWRGDSGLKDGLAQGV 67
           AL+  +L F  A +G  +YG AL  S L+ EAQRSG LP ++RV WR  SGL DG + GV
Sbjct: 9   ALLLLLLCFPVAFSG-HDYGQALSKSLLFFEAQRSGVLPRNQRVTWRSHSGLTDGKSSGV 68

Query: 68  NLVGGYYDAGDNVKFGLPMSFVTTILSWGAIDFKKEITDINQMDNALKAIKWATDYLIKA 127
           NLVGGYYDAGDNVKFGLPM+F  T+++W  I++  ++    ++ N++ AIKW TDY IKA
Sbjct: 69  NLVGGYYDAGDNVKFGLPMAFTVTMMAWSVIEYGNQLQANGELGNSIDAIKWGTDYFIKA 128

Query: 128 HTQPNVLWAQVGDGASDHFCWERPEDMTTPRTAFKIDESHPGSDLAGETAAALAAASITF 187
           H +PNVL+ +VGDG +DH+CW+RPE+MTT R A++ID S+PGSDLAGETAAA+AAASI F
Sbjct: 129 HPEPNVLYGEVGDGNTDHYCWQRPEEMTTDRKAYRIDPSNPGSDLAGETAAAMAAASIVF 188

Query: 188 KSYNSSYSDLLLVHAKELFTFADTFRGLYDDSIPCASEFYTS-TGYWDELLWAAAWLFRA 247
           +  N  YS LLL HA +LF FAD +RG YD SI  A ++Y S +GY DELLWAAAWL++A
Sbjct: 189 RRSNPVYSRLLLTHAYQLFDFADKYRGKYDSSITVAQKYYRSVSGYNDELLWAAAWLYQA 248

Query: 248 TGDESYLKYTVDKAVSFGGTGWAMKEFSWDNKYAGVQILLTNVLLEGRGGGYESTLKQYQ 307
           + ++ YL Y      + GGTGW+M EF WD KYAGVQ L+   L++G+ G +    ++YQ
Sbjct: 249 SNNQFYLDYLGRNGDAMGGTGWSMTEFGWDVKYAGVQTLVAKFLMQGKAGRHAPVFRKYQ 308

Query: 308 AKANYFACACLQKNDGFNINKTPGGLMYAHEWNNMQYASAAAFLMAVYSDYLSNANAKLT 367
            KA+ F C+ L K+   NI KTPGGL++   WNNMQ+ ++A+FL  VYSDYL+++ + L 
Sbjct: 309 EKADSFMCSLLGKSSR-NIQKTPGGLIFRQRWNNMQFVTSASFLTTVYSDYLTSSRSNLR 368

Query: 368 CPDGVFEPKDLLNFAQSQADYILGKNPNSLSYLIGYGSKFPQKLHHRGSSIASIFTDPSP 427
           C  G   P  LL+FA+SQ DYILG NP + SY++GYG+ FPQ++HHRGSSI S+  D + 
Sbjct: 369 CAAGNVAPSQLLSFAKSQVDYILGDNPRATSYMVGYGNNFPQRVHHRGSSIVSVKVDRTF 428

Query: 428 IGCVQGFDFWYHRPQGNPNVLHGALVGGPDKDDRFDDDRSDYEQSEPTLTACAPLVGLFS 487
           + C  G+  W+ R   +PN+L GA+VGGPD  D F D R +YEQ+EP     APL+G+ +
Sbjct: 429 VTCRGGYATWFSRKGSDPNLLTGAIVGGPDAYDNFADRRDNYEQTEPATYNNAPLLGVLA 488

Query: 488 KLQSSVNGYS-----IPGSLGNKP-PVKPERESPDASAPVPAGSPIEFIHTITSTWTTNK 547
           +L S  +GYS     +P  +  +P P++     P  + PV A  P+  +  ITS+W +  
Sbjct: 489 RLSSGHSGYSQFLPVVPAPVVRRPMPIR----RPKVTTPVRASGPVAIVQKITSSWVSKG 548

Query: 548 ESYYRHQVKIKNTSGKPINNLKLQLENLSGPIWGLSPTKQKGIYELPAWLTVLQPGSECI 607
            +YYR+   + N S +P+ +L L ++NL GPIWGLS  +    + LP+W+  L  G    
Sbjct: 549 RTYYRYSTTVINKSSRPLKSLNLSIKNLYGPIWGLS--RSGNSFGLPSWMHSLPSGKSLE 608

Query: 608 FIYIQEGPQARVTVSSY 618
           F+YI     A V VSSY
Sbjct: 609 FVYIHSTTPANVAVSSY 617

BLAST of HG10023079 vs. TAIR 10
Match: AT4G11050.1 (glycosyl hydrolase 9C3 )

HSP 1 Score: 597.8 bits (1540), Expect = 9.8e-171
Identity = 302/628 (48.09%), Postives = 399/628 (63.54%), Query Frame = 0

Query: 2   ARGFLLALVAAVLYFEAAAAGDFNYGDALDLSFLYLEAQRSGKLPADRRVKWRGDSGLKD 61
           +R  +  LV  +L     A    +Y  AL  S L+ EAQRSG LP ++RV WR  SGL D
Sbjct: 3   SRTTISILVVLLLGLVQLAISGHDYKQALSKSILFFEAQRSGHLPPNQRVSWRSHSGLYD 62

Query: 62  GLAQGVNLVGGYYDAGDNVKFGLPMSFVTTILSWGAIDFKKEITDINQMDNALKAIKWAT 121
           G + GV+LVGGYYDAGDNVKFGLPM+F  T + W  I++  ++    ++ +A+ A+KW T
Sbjct: 63  GKSSGVDLVGGYYDAGDNVKFGLPMAFTVTTMCWSIIEYGGQLESNGELGHAIDAVKWGT 122

Query: 122 DYLIKAHTQPNVLWAQVGDGASDHFCWERPEDMTTPRTAFKIDESHPGSDLAGETAAALA 181
           DY IKAH +PNVL+ +VGDG SDH+CW+RPE+MTT R A+KID ++PGSDLAGETAAA+A
Sbjct: 123 DYFIKAHPEPNVLYGEVGDGKSDHYCWQRPEEMTTDRRAYKIDRNNPGSDLAGETAAAMA 182

Query: 182 AASITFKSYNSSYSDLLLVHAKELFTFADTFRGLYDDSIPCASEFYTS-TGYWDELLWAA 241
           AASI F+  + SYS  LL HA +LF FAD +RG YD SI  A ++Y S +GY DELLWAA
Sbjct: 183 AASIVFRRSDPSYSAELLRHAHQLFEFADKYRGKYDSSITVAQKYYRSVSGYNDELLWAA 242

Query: 242 AWLFRATGDESYLKYTVDKAVSFGGTGWAMKEFSWDNKYAGVQILLTNVLLEGRGGGYES 301
           AWL++AT D+ YL Y      S GGTGW+M EF WD KYAGVQ L+  VL++G+GG + +
Sbjct: 243 AWLYQATNDKYYLDYLGKNGDSMGGTGWSMTEFGWDVKYAGVQTLVAKVLMQGKGGEHTA 302

Query: 302 TLKQYQAKANYFACACLQKNDGFNINKTPGGLMYAHEWNNMQYASAAAFLMAVYSDYLSN 361
             ++YQ KA  F C+ L K+   NI KTPGGL++   WNNMQ+ ++A+FL  VYSDYLS 
Sbjct: 303 VFERYQQKAEQFMCSLLGKSTK-NIKKTPGGLIFRQSWNNMQFVTSASFLATVYSDYLSY 362

Query: 362 ANAKLTCPDGVFEPKDLLNFAQSQADYILGKNPNSLSYLIGYGSKFPQKLHHRGSSIASI 421
           +   L C  G   P  LL F++SQ DYILG NP + SY++GYG  +P+++HHRGSSI S 
Sbjct: 363 SKRDLLCSQGNISPSQLLEFSKSQVDYILGDNPRATSYMVGYGENYPRQVHHRGSSIVSF 422

Query: 422 FTDPSPIGCVQGFDFWYHRPQGNPNVLHGALVGGPDKDDRFDDDRSDYEQSEPTLTACAP 481
             D   + C  G+  W+ R   +PNVL GALVGGPD  D F D R +YEQ+EP     AP
Sbjct: 423 NVDQKFVTCRGGYATWFSRKGSDPNVLTGALVGGPDAYDNFADQRDNYEQTEPATYNNAP 482

Query: 482 LVGLFSKLQSSVNGYS--------IPGSLGNKPPVKPER---ESPDASAPVPAGSPIEFI 541
           L+G+ ++L S   G+          P  +  KP   P+R   + P AS+P    SPI   
Sbjct: 483 LLGVLARLISGSTGFDQLLPGVSPTPSPVIIKPAPVPQRKPTKPPAASSP----SPITIS 542

Query: 542 HTITSTWTTNKESYYRHQVKIKNTSGKPINNLKLQLENLSGPIWGLSPTKQKGIYELPAW 601
             +T++W    + YYR+   + N S K +  LK+ +  L GPIWG+  TK    +  P+W
Sbjct: 543 QKMTNSWKNEGKVYYRYSTILTNRSTKTLKILKISITKLYGPIWGV--TKTGNSFSFPSW 602

Query: 602 LTVLQPGSECIFIYIQEGPQARVTVSSY 618
           +  L  G    F+YI     A V VS+Y
Sbjct: 603 MQSLPSGKSMEFVYIHSASPADVLVSNY 623

BLAST of HG10023079 vs. TAIR 10
Match: AT2G32990.1 (glycosyl hydrolase 9B8 )

HSP 1 Score: 576.6 bits (1485), Expect = 2.3e-164
Identity = 276/494 (55.87%), Postives = 352/494 (71.26%), Query Frame = 0

Query: 5   FLLALVAAVLYFEAAA--------AGDFNYGDALDLSFLYLEAQRSGKLPADRRVKWRGD 64
           FLL L+  V  F AA          G F+YG+AL  S LY EAQRSG+LP ++RV WR  
Sbjct: 13  FLLLLLITV--FSAALDGVSSETDVGGFDYGEALSKSLLYFEAQRSGRLPYNQRVTWRDH 72

Query: 65  SGLKDGLAQGVNLVGGYYDAGDNVKFGLPMSFVTTILSWGAIDFKKEITDINQMDNALKA 124
           SGL DGL QGV+LVGGY+DAGD+VKFGLPM+F  T+LSW  I++   +    ++ +AL+A
Sbjct: 73  SGLTDGLEQGVDLVGGYHDAGDHVKFGLPMAFTVTMLSWSVIEYGDSLASTGELSHALEA 132

Query: 125 IKWATDYLIKAHTQPNVLWAQVGDGASDHFCWERPEDMTTPRTAFKIDESHPGSDLAGET 184
           IKW TDY IKAHT PNVLWA+VGDG +DH+CW+RPEDMTT R AFKIDE++PGSD+AGET
Sbjct: 133 IKWGTDYFIKAHTSPNVLWAEVGDGDTDHYCWQRPEDMTTSRRAFKIDENNPGSDIAGET 192

Query: 185 AAALAAASITFKSYNSSYSDLLLVHAKELFTFADTFRGLYDDSIPCASEFYTS-TGYWDE 244
           AAA+AAASI F+S N  YS LLL HA++LF F D +RG YD+S+     +Y S +GY DE
Sbjct: 193 AAAMAAASIVFRSTNPHYSHLLLHHAQQLFEFGDKYRGKYDESLKVVKSYYASVSGYMDE 252

Query: 245 LLWAAAWLFRATGDESYLKYTVDKAVSFGGTGWAMKEFSWDNKYAGVQILLTNVLLEGRG 304
           LLW A WL+RAT +E Y+ Y VD A   GG  WAM EFSWD K+AGVQ+L + +L E + 
Sbjct: 253 LLWGATWLYRATDNEHYMSYVVDMAHQLGGLSWAMSEFSWDVKFAGVQLLASMLLKEEKH 312

Query: 305 GGYESTLKQYQAKANYFACACLQKN-DGFNINKTPGGLMYAHEWNNMQYASAAAFLMAVY 364
             +   L+QY++KA+++ C+ L KN +G N+ +TP GL+Y  +WNNMQY S A+FL+ VY
Sbjct: 313 KQHSKVLQQYKSKADHYLCSILNKNINGTNVQRTPAGLLYVRQWNNMQYVSTASFLLTVY 372

Query: 365 SDYLSNANAKLTCPDGVFEPKDLLNFAQSQADYILGKNPNSLSYLIGYGSKFPQKLHHRG 424
           SD+L  +N  L C +G   P ++L FA+SQ DYILG NP   SYL+GYG K+P ++HHRG
Sbjct: 373 SDHLRKSNTDLECHEGTVTPDEMLGFAKSQIDYILGSNPMETSYLVGYGPKYPIRVHHRG 432

Query: 425 SSIASIFTDPSPIGCVQGFDFWYHRPQGNPNVLHGALVGGPDKDDRFDDDRSDYEQSEPT 484
           +SIAS       IGC QG+D WY R + NP+VL GALVGGPD  D FDD R +Y Q+E  
Sbjct: 433 ASIASFKEHKGFIGCTQGYDNWYGRSEPNPSVLVGALVGGPDHQDDFDDRRGNYVQTEAC 492

Query: 485 LTACAPLVGLFSKL 489
               APLVG+F++L
Sbjct: 493 TYNTAPLVGVFARL 504

BLAST of HG10023079 vs. TAIR 10
Match: AT2G44550.1 (glycosyl hydrolase 9B10 )

HSP 1 Score: 536.6 bits (1381), Expect = 2.7e-152
Identity = 252/477 (52.83%), Postives = 332/477 (69.60%), Query Frame = 0

Query: 10  VAAVLYFEAAAAGDFNYGDALDLSFLYLEAQRSGKLPADRRVKWRGDSGLKDGLAQGVNL 69
           +  ++   A  A   NY +AL  S LY EAQRSGKLP ++RV WRGDS L+DG    ++L
Sbjct: 18  IVLIVMSMAREAVSTNYAEALKNSLLYFEAQRSGKLPPNQRVTWRGDSALRDGSDAHIDL 77

Query: 70  VGGYYDAGDNVKFGLPMSFVTTILSWGAIDFKKEITDINQMDNALKAIKWATDYLIKAHT 129
            GGYYDAGDN+KFG P++F TT+L+W  I+   ++   ++  NAL+A+KWATDYLIKAH 
Sbjct: 78  TGGYYDAGDNMKFGFPLAFTTTMLAWSNIEMASQLRAHHEKGNALRALKWATDYLIKAHP 137

Query: 130 QPNVLWAQVGDGASDHFCWERPEDMTTPRTAFKIDESHPGSDLAGETAAALAAASITFKS 189
           QPNVL+ QVG+G SDH CW RPEDMTTPRT+++ID  HPGSDLAGETAAA+AAASI F  
Sbjct: 138 QPNVLYGQVGEGNSDHKCWMRPEDMTTPRTSYRIDAQHPGSDLAGETAAAMAAASIAFAP 197

Query: 190 YNSSYSDLLLVHAKELFTFADTFRGLYDDSIPCASEFYTSTGYWDELLWAAAWLFRATGD 249
            + +Y+++L+ HAK+LF FA   RGLY +SIP A  FY S+GY DELLWAAAWL RAT D
Sbjct: 198 SDKAYANILIGHAKDLFAFAKAHRGLYQNSIPNAGGFYASSGYEDELLWAAAWLHRATND 257

Query: 250 ESYLKYTVDKAVSFGGTGWAMKEFSWDNKYAGVQILLTNVLLEGRGGGYESTLKQYQAKA 309
           + YL Y  +       TG     F+WD+K+ G Q+L+  + LEG+    E  + +Y++ A
Sbjct: 258 QIYLDYLTE-----AETGGPRTVFAWDDKFVGAQVLVAKLALEGKVESSEQ-IVEYKSMA 317

Query: 310 NYFACACLQKNDGFNINKTPGGLMYAHEWNNMQYASAAAFLMAVYSDYLSNANAKLTCPD 369
             F C C QK D  N+ KTPGGL+Y   WNN+QY +AA F+++ YS YL  A A + CPD
Sbjct: 318 EQFICNCAQKGDN-NVKKTPGGLLYFLPWNNLQYTTAATFVLSAYSKYLEAAKASIDCPD 377

Query: 370 GVFEPKDLLNFAQSQADYILGKNPNSLSYLIGYGSKFPQKLHHRGSSIASIFTDPSPIGC 429
           G  +  DLL  A+SQ DYILG NP  +SY++G G+ +P+K HHR +SI SI  D +P+ C
Sbjct: 378 GALQASDLLQVARSQVDYILGSNPQKMSYMVGVGTNYPKKPHHRAASIVSIRQDKTPVTC 437

Query: 430 VQGFDFWYHRPQGNPNVLHGALVGGPDKDDRFDDDRSDYEQSEPTLTACAPLVGLFS 487
             G+D WY+ P  NPNVL GA+VGGPD +D + D+RS+++Q+EP     APLVG+ +
Sbjct: 438 SGGYDKWYNNPAPNPNVLAGAVVGGPDDNDVYGDERSNFQQAEPATVTTAPLVGVLA 487

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038899109.10.0e+0090.94endoglucanase 5 [Benincasa hispida][more]
XP_022991484.10.0e+0089.97endoglucanase 5 [Cucurbita maxima][more]
XP_022953304.10.0e+0089.97endoglucanase 5 [Cucurbita moschata][more]
XP_023548134.10.0e+0089.66endoglucanase 5 [Cucurbita pepo subsp. pepo][more]
KAG7014275.10.0e+0089.81Endoglucanase 5, partial [Cucurbita argyrosperma subsp. argyrosperma][more]
Match NameE-valueIdentityDescription
Q9M9951.2e-24264.94Endoglucanase 5 OS=Arabidopsis thaliana OX=3702 GN=At1g48930 PE=1 SV=1[more]
A2XYW83.1e-21456.87Endoglucanase 13 OS=Oryza sativa subsp. indica OX=39946 GN=GLU6 PE=3 SV=2[more]
Q0J9307.0e-21456.70Endoglucanase 13 OS=Oryza sativa subsp. japonica OX=39947 GN=GLU6 PE=2 SV=1[more]
Q420591.8e-17749.27Endoglucanase 6 OS=Arabidopsis thaliana OX=3702 GN=At1g64390 PE=2 SV=2[more]
Q5NAT07.3e-17147.60Endoglucanase 2 OS=Oryza sativa subsp. japonica OX=39947 GN=GLU5 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A6J1JUZ60.0e+0089.97Endoglucanase OS=Cucurbita maxima OX=3661 GN=LOC111488085 PE=3 SV=1[more]
A0A6J1GP970.0e+0089.97Endoglucanase OS=Cucurbita moschata OX=3662 GN=LOC111455891 PE=3 SV=1[more]
A0A5A7URX30.0e+0089.66Endoglucanase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold459G00120 ... [more]
A0A1S3CEZ80.0e+0089.66Endoglucanase OS=Cucumis melo OX=3656 GN=LOC103499972 PE=3 SV=1[more]
A0A0A0K9M60.0e+0088.53Endoglucanase OS=Cucumis sativus OX=3659 GN=Csa_7G420700 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT1G48930.18.7e-24464.94glycosyl hydrolase 9C1 [more]
AT1G64390.11.3e-17849.27glycosyl hydrolase 9C2 [more]
AT4G11050.19.8e-17148.09glycosyl hydrolase 9C3 [more]
AT2G32990.12.3e-16455.87glycosyl hydrolase 9B8 [more]
AT2G44550.12.7e-15252.83glycosyl hydrolase 9B10 [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR019028Carbohydrate binding domain CBM49SMARTSM01063CBM49_2coord: 526..608
e-value: 7.1E-17
score: 72.1
IPR019028Carbohydrate binding domain CBM49PFAMPF09478CBM49coord: 526..605
e-value: 2.1E-22
score: 79.2
IPR012341Six-hairpin glycosidase-like superfamilyGENE3D1.50.10.10coord: 23..491
e-value: 6.8E-170
score: 567.7
IPR001701Glycoside hydrolase family 9PFAMPF00759Glyco_hydro_9coord: 26..484
e-value: 2.7E-142
score: 475.3
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 497..523
NoneNo IPR availablePANTHERPTHR22298:SF41ENDOGLUCANASE 5coord: 10..575
NoneNo IPR availablePANTHERPTHR22298ENDO-1,4-BETA-GLUCANASEcoord: 10..575
IPR018221Glycoside hydrolase family 9, His active sitePROSITEPS00592GH9_2coord: 387..413
IPR008928Six-hairpin glycosidase superfamilySUPERFAMILY48208Six-hairpin glycosidasescoord: 22..494

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10023079.1HG10023079.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0030245 cellulose catabolic process
biological_process GO:0005975 carbohydrate metabolic process
cellular_component GO:0005576 extracellular region
molecular_function GO:0030246 carbohydrate binding
molecular_function GO:0008810 cellulase activity
molecular_function GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compounds