HG10005100 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10005100
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionAlpha-galactosidase
LocationChr08: 22875520 .. 22879689 (+)
RNA-Seq ExpressionHG10005100
SyntenyHG10005100
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTTAGGATAAAGCCACTGGCTTCACTAATCATATGGTGCATACGAACTCTTATCTGTTTCAGGGTGTCCTCACAAACTGGACCAGAACGTGCTGCCCTCCCACCGAGAGGTTGGAACTCCTATGACTCATTTTCCTGGATTATTTCTGAAGAAGAATTCCTAAAAAATGTAGAGATTGTTGCCAATCGGCTTAAATCACAGGGATATGAGGTAGGAATTTGGTAACTGGCTGACAAATTTTGTTTGTAACATTGGTCATTGTATGATCTAATGTGGGGATTCCTTTAGCTTTATTGTAATGTAATGCAGTATAATGTAAGAAGATTAATCTTCGAGTTTTTTTTTTCACTTGATGCAGTACGTGGTTGTGGATTACCTCTGGTATAGGAAAAAGGTTCCAGGTGCTTACACTGATTCTCTTGGATTCGATGTGATTGATGAGTGGGGGAGGATGGTCCCGGACCCGGTTCGATGGCCTTCCTCTCAAGGTGGAAAGGGATTCTCTGAAGTTGCTAAGAAAGTTCATGATATGGGTTTGAAGTTTGGGATTCATGTCATGAGAGGAATAAGTACACAGGCAGTAAATGCCAATACTCCTATATTGGACATTTCAAAGGTACCTTGGGCAAGTTCTTTTATTTCTGCATACTGTGAGCATTGATAAATCGTCAGCACTATAAAACACAGGAGAAATGCTAATTGTGTAAAGTCCTTTTTTTGCATCTCTCTTTGGGATGAAAGATAGTGAGCCTTTAATGATTGTAAGTTTCTGCTTAAATTTTGTCAATTGCTTCTTAGGGAGACGCCTATGTAGAATCCGGGAAAAAGTGGTTTGCGAGTGATATAGGGATCAAGTCGAGGGCTTGTGGATGGATGCATAATGGTTTCATGAGTGTAGACGTTAAATCGGGGGCTGGAAAAGCCTTCTTAAGGTCCCTGTATCAACAATATGCTGATTGGGGTGTTGATTTTGGTAAGTTTGACTGTCATTAACAAAAAGTAAGGTCGCTTCCTACATGCTTACATATCTCTGTGAAGTATTTGATAGATTGTTATCAATTTCTTAAGAATTTCATCTTACTCTTCAACTTTAACATTTTAACTGATTATCTTTAGAGGACTTAATATGTTGGAAAATTCTGATATTTCAATAGTTTGATCGAAATTAGATCTAGGCATCTTTCATAGTTTCCCATGGTGAAATGCATTATTCATTGATACAAAGTTTAGTATGGTGTTAATCATGATCATAGATTTCAAATACACCGTATCTGCAAAGAAAATGTATCTCATACTTAGAAAATTAGATGCTGACTATAGCAATATATAGATCTACAATTTGCGTGTGAGATTATGCAATTTCTTTCCGCTCCCAACCATTTCCTGAAACCTGGATACTAACCTGAAATGATCATTCTTTATAACCTTATTTTTCTGGCAGTGAAACATGATTGTGTCTTTGGCGATGATTTGGATTTAGATGAAATAACCTTTGTCTCGGATGTATGCTCTCTCTCGTTCTCTCTCTCTCTCTCTCTCTCACACACACACACAACTGTTATTCATACGCTAAATGCTAATTCATTGAGTCTGCACAGGTTCTTAAACAACTTAACAGCCCAATAATGTATTCTCTGTCCCCTGGAACAAGTGTGACACCAGCCATGGCCAAGGCTGTAAGTGGGCTGGCCAATATGTATAGAATAACTGGTGATGATTGGGATACTTGGAACGACATCGTTTCTCACTTTGATGTCAGCCGGTAAATTAAATGTTTCAATTCAAGAGATTCAAATACAATATCATTTGATGTTGATGTCTTCTCAACTTGTATTTACAGTCTGTTTCATCTCTTCTTCAGGGATTTTTCTACTGCTAATATGATAGGTGCCGCAGGCTTGCTGGGTAAATCATGGCCTGATTTAGATATGCTTCCTCTGGGGTGGCTTACTGATCCAGGTTATACAATTTCATAAGTTATTTTTTCCTTTTCCTCTCTGGGAGCTGGGCGGTAAGGTTGCAATAAGACTGACTAAACTAAACATGACTGTACAATTAGGTCGAGTAACAATCTCAAGTATTTTGATAACATGTTCTTGTTGCTTGTTATTTGCAAAGAAGCCTTACTAGATTAGACAAAGAATTACATGCAACTGTCTTTTACTTGCTTAATTAGATATTAAGTAATTATTAAATAAAATTCCCTAATATAAAATAAATAAGTGACCAGTATGAGGAGACTGAATAAAAATAGATGGAATTGGGTAAACCATTTTTTGGGCGCCTACAGAAGTCCAAATAATATTTCAAATTAAGCTTGTATAGATGTTATTAATAATGTGAAATAAAATGTTTAGGTTCAAATGATGGCCCACATAGGACAACCAACCTCAACATAAATGAACAAAGAACTCAGGTACTTTTTTTTTTTGAGTTCAACAATATGTGGGGTAAATAATTTCTGACTTCTTAGTCGAGGGTGCATGTTTTATGCTTGAGTTATACTCATGTTGGCAGTTACATTTGTATTTAAAGGAGGGAGAGTATATTATCTTAGTACTTTTCATTTGACGCAGAAATGATGTTTGACTTAATTTTTCAGATGACTTTATGGTCTATGTCCAAGTCTCCGATTATGTATGGAGGAGATTTGAGGAATATAGATAATACCACCTTTAGTATCATCACAAATCCTACCCTACTAGAGATTAACTCCTTTAGCTCAAATAATACGGAGGCAAGTTGAGCTTTTTTTCCATTTTTAAAACTTAGAAGTTGGGAAAGACTTTCTTGATATCAACATTTTTACCTTCTATTTTTGTTGCCTTTACAGTTTCTTAAAATTTCTACAACAGAAGTTTCAAACAGTAGAGAGCAGATTGTGAAATGGCATTCCAGACATGTGGAGGCATCAGCTTCACCTATACTGGGTCTCACCAAATGTGCATATTCAGACACAGCTGGTTGGATTATTGAAAGTCTCAATCGAGGTCTCGAAAAAATATGTTGGAAGGAAAATCCAGAACATGAATATCAAACACCATTTTGTCTATATAAACGAGGATCTCGAGTTGCAATGTAATTGTTTTGATGCTTTACTTTTATTTGGCTGTGTAAAGTTTTGATTTTGAGCATTGTAACAAAAATTATGTTCTGCTACAGAGATAAAGAGGCAACTACTCGCCATGATCAAGTGGAACTTCTATCATTTCCAACTAGCACGGTGGATGTTTGCCTGGATGCTACTCCCAAAAGAAAGCGTAGTTCTGAAGAGTTCACGAGGGGGTCATTTTTCCCCTGTAGAAGGCATTCGAATCAGGTTCATCTTTGGCTTTTGTTTGTTTAATGTCTTCTTTCCATTTATGAGAATCAATCTGCATAGCTACATAAATGAGCAATCTAGTTATATTAACGACAGTTCTACACATTCGGGATTAATATTGTTATTGACCTTTGCCAGTAGGTAGTTTCTCCGGGTTAACCAACTGGCTAAATGTCTTTGTTTTGGAGAGGATGAAGAAAACAGTCTTTTTCTGGACTTTACATCTGCATATATTCATTTTCTCACTCTTATTATCTTTCTAATGCTCTTATTTTCTTCCTGGAATGTTCACTCTGATAATAACAGAAGTGGGATTTATACGCTAATGGGACGTTGGCAAACCATTATTCAGGGCATTGTGCAATTGTGAAGTATAACCGAGGTAATGAATCCGAGATCGACTTGCATTATTAAAACGACTAATTGTCAAGATCTTGACTTTTTTGCCATTGTGATCATTCATGGAAATCATTCCTGCAGCCAATGCTCTTCCGACCGGAGTTCGCTCTTGGGTCGCAACAGGAAGAGGAGGTTAGTGTATCTCTGAGATTCTAAATGGATTTTCTCATAGATGAAGCATTGTTCTGTTAATATTTTACTGAAGAATATTTTGATCTAATTTAGGTGAGATATATGTTGCTTTCTTCAATCTGAACAATGTGAAGACCGTTATATCCGCGAAGATTTCGGACCTTGCCGAGGTACTTCCAGGCAAGAAATTGGGTCAGACTTCATGTAAATGCAGAGAGGAATGGACTGGAAAAGACTTTGGACTGATCTCAGAATCAATAGCTGCACCAGTTGAAGGCCATGGTTCTGCACTATTTATCATCAATTGTAACTAG

mRNA sequence

ATGGTTAGGATAAAGCCACTGGCTTCACTAATCATATGGTGCATACGAACTCTTATCTGTTTCAGGGTGTCCTCACAAACTGGACCAGAACGTGCTGCCCTCCCACCGAGAGGTTGGAACTCCTATGACTCATTTTCCTGGATTATTTCTGAAGAAGAATTCCTAAAAAATGTAGAGATTGTTGCCAATCGGCTTAAATCACAGGGATATGAGTACGTGGTTGTGGATTACCTCTGGTATAGGAAAAAGGTTCCAGGTGCTTACACTGATTCTCTTGGATTCGATGTGATTGATGAGTGGGGGAGGATGGTCCCGGACCCGGTTCGATGGCCTTCCTCTCAAGGTGGAAAGGGATTCTCTGAAGTTGCTAAGAAAGTTCATGATATGGGTTTGAAGTTTGGGATTCATGTCATGAGAGGAATAAGTACACAGGCAGTAAATGCCAATACTCCTATATTGGACATTTCAAAGGGAGACGCCTATGTAGAATCCGGGAAAAAGTGGTTTGCGAGTGATATAGGGATCAAGTCGAGGGCTTGTGGATGGATGCATAATGGTTTCATGAGTGTAGACGTTAAATCGGGGGCTGGAAAAGCCTTCTTAAGGTCCCTGTATCAACAATATGCTGATTGGGGTGTTGATTTTGTGAAACATGATTGTGTCTTTGGCGATGATTTGGATTTAGATGAAATAACCTTTGTCTCGGATGTTCTTAAACAACTTAACAGCCCAATAATGTATTCTCTGTCCCCTGGAACAAGTGTGACACCAGCCATGGCCAAGGCTGTAAGTGGGCTGGCCAATATGTATAGAATAACTGGTGATGATTGGGATACTTGGAACGACATCGTTTCTCACTTTGATGTCAGCCGGGATTTTTCTACTGCTAATATGATAGGTGCCGCAGGCTTGCTGGGTAAATCATGGCCTGATTTAGATATGCTTCCTCTGGGGTGGCTTACTGATCCAGAAGTTTCAAACAGTAGAGAGCAGATTGTGAAATGGCATTCCAGACATGTGGAGGCATCAGCTTCACCTATACTGGGTCTCACCAAATGTGCATATTCAGACACAGCTGGTTGGATTATTGAAAGTCTCAATCGAGGTCTCGAAAAAATATGTTGGAAGGAAAATCCAGAACATGAATATCAAACACCATTTTGTCTATATAAACGAGGATCTCGAGTTGCAATAGATAAAGAGGCAACTACTCGCCATGATCAAGTGGAACTTCTATCATTTCCAACTAGCACGGTGGATGTTTGCCTGGATGCTACTCCCAAAAGAAAGCGTAGTTCTGAAGAGTTCACGAGGGGGTCATTTTTCCCCTGTAGAAGGCATTCGAATCAGAAGTGGGATTTATACGCTAATGGGACGTTGGCAAACCATTATTCAGGGCATTGTGCAATTGTGAAGTATAACCGAGCCAATGCTCTTCCGACCGGAGTTCGCTCTTGGGTCGCAACAGGAAGAGGAGGTGAGATATATGTTGCTTTCTTCAATCTGAACAATGTGAAGACCGTTATATCCGCGAAGATTTCGGACCTTGCCGAGGTACTTCCAGGCAAGAAATTGGGTCAGACTTCATGTAAATGCAGAGAGGAATGGACTGGAAAAGACTTTGGACTGATCTCAGAATCAATAGCTGCACCAGTTGAAGGCCATGGTTCTGCACTATTTATCATCAATTGTAACTAG

Coding sequence (CDS)

ATGGTTAGGATAAAGCCACTGGCTTCACTAATCATATGGTGCATACGAACTCTTATCTGTTTCAGGGTGTCCTCACAAACTGGACCAGAACGTGCTGCCCTCCCACCGAGAGGTTGGAACTCCTATGACTCATTTTCCTGGATTATTTCTGAAGAAGAATTCCTAAAAAATGTAGAGATTGTTGCCAATCGGCTTAAATCACAGGGATATGAGTACGTGGTTGTGGATTACCTCTGGTATAGGAAAAAGGTTCCAGGTGCTTACACTGATTCTCTTGGATTCGATGTGATTGATGAGTGGGGGAGGATGGTCCCGGACCCGGTTCGATGGCCTTCCTCTCAAGGTGGAAAGGGATTCTCTGAAGTTGCTAAGAAAGTTCATGATATGGGTTTGAAGTTTGGGATTCATGTCATGAGAGGAATAAGTACACAGGCAGTAAATGCCAATACTCCTATATTGGACATTTCAAAGGGAGACGCCTATGTAGAATCCGGGAAAAAGTGGTTTGCGAGTGATATAGGGATCAAGTCGAGGGCTTGTGGATGGATGCATAATGGTTTCATGAGTGTAGACGTTAAATCGGGGGCTGGAAAAGCCTTCTTAAGGTCCCTGTATCAACAATATGCTGATTGGGGTGTTGATTTTGTGAAACATGATTGTGTCTTTGGCGATGATTTGGATTTAGATGAAATAACCTTTGTCTCGGATGTTCTTAAACAACTTAACAGCCCAATAATGTATTCTCTGTCCCCTGGAACAAGTGTGACACCAGCCATGGCCAAGGCTGTAAGTGGGCTGGCCAATATGTATAGAATAACTGGTGATGATTGGGATACTTGGAACGACATCGTTTCTCACTTTGATGTCAGCCGGGATTTTTCTACTGCTAATATGATAGGTGCCGCAGGCTTGCTGGGTAAATCATGGCCTGATTTAGATATGCTTCCTCTGGGGTGGCTTACTGATCCAGAAGTTTCAAACAGTAGAGAGCAGATTGTGAAATGGCATTCCAGACATGTGGAGGCATCAGCTTCACCTATACTGGGTCTCACCAAATGTGCATATTCAGACACAGCTGGTTGGATTATTGAAAGTCTCAATCGAGGTCTCGAAAAAATATGTTGGAAGGAAAATCCAGAACATGAATATCAAACACCATTTTGTCTATATAAACGAGGATCTCGAGTTGCAATAGATAAAGAGGCAACTACTCGCCATGATCAAGTGGAACTTCTATCATTTCCAACTAGCACGGTGGATGTTTGCCTGGATGCTACTCCCAAAAGAAAGCGTAGTTCTGAAGAGTTCACGAGGGGGTCATTTTTCCCCTGTAGAAGGCATTCGAATCAGAAGTGGGATTTATACGCTAATGGGACGTTGGCAAACCATTATTCAGGGCATTGTGCAATTGTGAAGTATAACCGAGCCAATGCTCTTCCGACCGGAGTTCGCTCTTGGGTCGCAACAGGAAGAGGAGGTGAGATATATGTTGCTTTCTTCAATCTGAACAATGTGAAGACCGTTATATCCGCGAAGATTTCGGACCTTGCCGAGGTACTTCCAGGCAAGAAATTGGGTCAGACTTCATGTAAATGCAGAGAGGAATGGACTGGAAAAGACTTTGGACTGATCTCAGAATCAATAGCTGCACCAGTTGAAGGCCATGGTTCTGCACTATTTATCATCAATTGTAACTAG

Protein sequence

MVRIKPLASLIIWCIRTLICFRVSSQTGPERAALPPRGWNSYDSFSWIISEEEFLKNVEIVANRLKSQGYEYVVVDYLWYRKKVPGAYTDSLGFDVIDEWGRMVPDPVRWPSSQGGKGFSEVAKKVHDMGLKFGIHVMRGISTQAVNANTPILDISKGDAYVESGKKWFASDIGIKSRACGWMHNGFMSVDVKSGAGKAFLRSLYQQYADWGVDFVKHDCVFGDDLDLDEITFVSDVLKQLNSPIMYSLSPGTSVTPAMAKAVSGLANMYRITGDDWDTWNDIVSHFDVSRDFSTANMIGAAGLLGKSWPDLDMLPLGWLTDPEVSNSREQIVKWHSRHVEASASPILGLTKCAYSDTAGWIIESLNRGLEKICWKENPEHEYQTPFCLYKRGSRVAIDKEATTRHDQVELLSFPTSTVDVCLDATPKRKRSSEEFTRGSFFPCRRHSNQKWDLYANGTLANHYSGHCAIVKYNRANALPTGVRSWVATGRGGEIYVAFFNLNNVKTVISAKISDLAEVLPGKKLGQTSCKCREEWTGKDFGLISESIAAPVEGHGSALFIINCN
Homology
BLAST of HG10005100 vs. NCBI nr
Match: XP_038886461.1 (alpha-galactosidase mel1 [Benincasa hispida])

HSP 1 Score: 1050.8 bits (2716), Expect = 4.2e-303
Identity = 511/629 (81.24%), Postives = 529/629 (84.10%), Query Frame = 0

Query: 14  CIRTLICFR-----VSSQTGPERAALPPRGWNSYDSFSWIISEEEFLKNVEIVANRLKSQ 73
           C     CF      VSSQTGPERAALPPRGWNSYDSFSWIISEEEFL+N EIVANRLK  
Sbjct: 11  CFSLFFCFGFFFNWVSSQTGPERAALPPRGWNSYDSFSWIISEEEFLQNAEIVANRLKPM 70

Query: 74  GYEYVVVDYLWYRKKVPGAYTDSLGFDVIDEWGRMVPDPVRWPSSQGGKGFSEVAKKVHD 133
           GYEYVV DYLWYRKKVPGAYTDSLGFDVIDEWGRMVPDPVRWPSSQGGKGFS+VAKKVHD
Sbjct: 71  GYEYVVADYLWYRKKVPGAYTDSLGFDVIDEWGRMVPDPVRWPSSQGGKGFSKVAKKVHD 130

Query: 134 MGLKFGIHVMRGISTQAVNANTPILDISKGDAYVESGKKWFASDIGIKSRACGWMHNGFM 193
           MGLKFGIHVMRGISTQAVNANTPILDISKGDAYVESG+KWFASDIGIKSRACGWMHNGFM
Sbjct: 131 MGLKFGIHVMRGISTQAVNANTPILDISKGDAYVESGRKWFASDIGIKSRACGWMHNGFM 190

Query: 194 SVDVKSGAGKAFLRSLYQQYADWGVDFVKHDCVFGDDLDLDEITFVSDVLKQLNSPIMYS 253
           SV+VKSGAGKAFLRSLYQQYADWGVDFVKHDC+FGDDLDLDEITFVSDVLKQLNS I+YS
Sbjct: 191 SVNVKSGAGKAFLRSLYQQYADWGVDFVKHDCIFGDDLDLDEITFVSDVLKQLNSTILYS 250

Query: 254 LSPGTSVTPAMAKAVSGLANMYRITGDDWDTWNDIVSHFDVSRDFSTANMIGAAGLLGKS 313
           LSPGTSVTPAMAKAVSGLANMYRITGDDWD+WNDIVSHFDV+RDFSTANMIG AGLLGKS
Sbjct: 251 LSPGTSVTPAMAKAVSGLANMYRITGDDWDSWNDIVSHFDVTRDFSTANMIGTAGLLGKS 310

Query: 314 WPDLDMLPLGWLTDP--------------------------------------------- 373
           WPDLDMLPLGWLTDP                                             
Sbjct: 311 WPDLDMLPLGWLTDPGSNNGPHRTTNLNINEQITQMTLWSMSKSPIMYGGDLRNMDNITL 370

Query: 374 ---------------------------EVSNSREQIVKWHSRHVEASASPILGLTKCAYS 433
                                      +VSN REQI KWHSRH+EASASPILGLTKCAYS
Sbjct: 371 SIITNPTILEINSFSSNNMEFLKISTSKVSNRREQIEKWHSRHLEASASPILGLTKCAYS 430

Query: 434 DTAGWIIESLNRGLEKICWKENPEHEYQTPFCLYKRGSRVAIDKEATTRHDQVELLSFPT 493
           DTAGWI ESLNRG+EKICWK N EHEY+TPFCLYKRGSRVAIDKEATTRHDQVELLSFPT
Sbjct: 431 DTAGWISESLNRGVEKICWKANQEHEYRTPFCLYKRGSRVAIDKEATTRHDQVELLSFPT 490

Query: 494 STVDVCLDATPKRKRSSEEFTRGSFFPCRRHSNQKWDLYANGTLANHYSGHCAIVKYNRA 553
           STVDVCLDATPKRKRS EE TRGSFFPCRRH NQKWDLYANGTLANHYSGHCAIVKYN+A
Sbjct: 491 STVDVCLDATPKRKRSYEELTRGSFFPCRRHENQKWDLYANGTLANHYSGHCAIVKYNQA 550

Query: 554 NALPTGVRSWVATGRGGEIYVAFFNLNNVKTVISAKISDLAEVLPGKKLGQTSCKCREEW 566
            A+PTGVRSWVATGRGGEIYVAFFNLNNVKTVIS KI DLAEVLPGKKLGQTSCKCREEW
Sbjct: 551 KAIPTGVRSWVATGRGGEIYVAFFNLNNVKTVISTKILDLAEVLPGKKLGQTSCKCREEW 610

BLAST of HG10005100 vs. NCBI nr
Match: XP_011649499.1 (uncharacterized protein LOC101206292 [Cucumis sativus] >XP_031736692.1 uncharacterized protein LOC101206292 [Cucumis sativus] >KGN62330.1 hypothetical protein Csa_018547 [Cucumis sativus])

HSP 1 Score: 1007.7 bits (2604), Expect = 4.0e-290
Identity = 485/612 (79.25%), Postives = 508/612 (83.01%), Query Frame = 0

Query: 23  VSSQTGPERAALPPRGWNSYDSFSWIISEEEFLKNVEIVANRLKSQGYEYVVVDYLWYRK 82
           VSSQTGPERAALPPRGWNSYDSFSWIISEEEFLKN EIVAN+LKS+GYEYV+VDYLWYRK
Sbjct: 19  VSSQTGPERAALPPRGWNSYDSFSWIISEEEFLKNAEIVANQLKSKGYEYVIVDYLWYRK 78

Query: 83  KVPGAYTDSLGFDVIDEWGRMVPDPVRWPSSQGGKGFSEVAKKVHDMGLKFGIHVMRGIS 142
            VPGAYTDSLGFDVID+WGRM PDPVRWPSSQGGKGFSEVAKKVHDMGLKFGIHVMRGIS
Sbjct: 79  LVPGAYTDSLGFDVIDDWGRMAPDPVRWPSSQGGKGFSEVAKKVHDMGLKFGIHVMRGIS 138

Query: 143 TQAVNANTPILDISKGDAYVESGKKWFASDIGIKSRACGWMHNGFMSVDVKSGAGKAFLR 202
           TQAVNANTPILDISKGDAYVESGKKW ASDIGIKSRACGWMHNGFMSV+VKSGAGKAFLR
Sbjct: 139 TQAVNANTPILDISKGDAYVESGKKWLASDIGIKSRACGWMHNGFMSVNVKSGAGKAFLR 198

Query: 203 SLYQQYADWGVDFVKHDCVFGDDLDLDEITFVSDVLKQLNSPIMYSLSPGTSVTPAMAKA 262
           SLYQQYADWGVDFVKHDCVFGDDLDLDEITFVSDVLKQLNS I+YSLSPGTS TPAMAKA
Sbjct: 199 SLYQQYADWGVDFVKHDCVFGDDLDLDEITFVSDVLKQLNSTIVYSLSPGTSATPAMAKA 258

Query: 263 VSGLANMYRITGDDWDTWNDIVSHFDVSRDFSTANMIGAAGLLGKSWPDLDMLPLGWLTD 322
           VSGLANMYRITGDDWD+WNDIVSHFDV+RDF+TANMIG AGLLGKSWPDLDMLPLGWLTD
Sbjct: 259 VSGLANMYRITGDDWDSWNDIVSHFDVTRDFATANMIGTAGLLGKSWPDLDMLPLGWLTD 318

Query: 323 PEVSNS------------------------------------------------------ 382
           P  +N                                                       
Sbjct: 319 PGSNNGPHRTTNLNINEQRTQMTLWSISKSPIMFGGDLRNIDNTTFSIITNPTLLEINAF 378

Query: 383 ---------------REQIVKWHSRHVEASASPILGLTKCAYSDTAGWIIESLNRGLEKI 442
                          R++IVKWHSR +E SAS ILGLTKCAYSDT GWI ESLN GLEKI
Sbjct: 379 SSNNMEFLKIASTNFRKRIVKWHSRGLETSASRILGLTKCAYSDTTGWITESLNEGLEKI 438

Query: 443 CWKENPEHEYQTPFCLYKRGSRVAIDKEATTRHDQVELLSFPTSTVDVCLDATPKRKRSS 502
           CWKENPEHE QTPFCLYKRGSRVAIDKEA TR DQVELLSFPTS+VDVCLDATPKRK SS
Sbjct: 439 CWKENPEHESQTPFCLYKRGSRVAIDKEAATRRDQVELLSFPTSSVDVCLDATPKRKHSS 498

Query: 503 EEFTRGSFFPCRRHSNQKWDLYANGTLANHYSGHCAIVKYNRANALPTGVRSWVATGRGG 562
           E   RGSFFPC+ H NQKWDLYANGTLANHYSGHCAIVKYN+A ++PTG RSWVA GRGG
Sbjct: 499 EAIMRGSFFPCKGHENQKWDLYANGTLANHYSGHCAIVKYNKAKSIPTGARSWVAAGRGG 558

Query: 563 EIYVAFFNLNNVKTVISAKISDLAEVLPGKKLGQTSCKCREEWTGKDFGLISESIAAPVE 566
           E+YVAFFNLNN KTVIS KISDLA+ LPGKKLG  SCKCREEW+GKDFGL+S+ IAAPVE
Sbjct: 559 EVYVAFFNLNNAKTVISVKISDLAQALPGKKLGSNSCKCREEWSGKDFGLVSDLIAAPVE 618

BLAST of HG10005100 vs. NCBI nr
Match: XP_008444380.1 (PREDICTED: uncharacterized protein LOC103487727 [Cucumis melo] >XP_008444381.1 PREDICTED: uncharacterized protein LOC103487727 [Cucumis melo])

HSP 1 Score: 1003.0 bits (2592), Expect = 9.9e-289
Identity = 485/612 (79.25%), Postives = 511/612 (83.50%), Query Frame = 0

Query: 23  VSSQTGPERAALPPRGWNSYDSFSWIISEEEFLKNVEIVANRLKSQGYEYVVVDYLWYRK 82
           VSSQTGPERAALPPRGWNSYDSFSWIISEEEFL NVEIVAN+LKS+GYEYV+VDYLWYRK
Sbjct: 19  VSSQTGPERAALPPRGWNSYDSFSWIISEEEFLNNVEIVANKLKSKGYEYVIVDYLWYRK 78

Query: 83  KVPGAYTDSLGFDVIDEWGRMVPDPVRWPSSQGGKGFSEVAKKVHDMGLKFGIHVMRGIS 142
           KVPGAYTDSLGFDVIDEWGRM PDPVRWPSSQGGKGFSEVAKKVH MGLKFGIHVMRGIS
Sbjct: 79  KVPGAYTDSLGFDVIDEWGRMAPDPVRWPSSQGGKGFSEVAKKVHAMGLKFGIHVMRGIS 138

Query: 143 TQAVNANTPILDISKGDAYVESGKKWFASDIGIKSRACGWMHNGFMSVDVKSGAGKAFLR 202
           TQAVNANTPILDISKGDAYVESGKKW ASDIGIKSRACGWMHNGFMSV+VKSGAGKAFLR
Sbjct: 139 TQAVNANTPILDISKGDAYVESGKKWLASDIGIKSRACGWMHNGFMSVNVKSGAGKAFLR 198

Query: 203 SLYQQYADWGVDFVKHDCVFGDDLDLDEITFVSDVLKQLNSPIMYSLSPGTSVTPAMAKA 262
           SLYQQYADWGVDFVKHDCVFGDDLDLDEITFVSDVLKQLNS I+YSLSPGTS TPAMAKA
Sbjct: 199 SLYQQYADWGVDFVKHDCVFGDDLDLDEITFVSDVLKQLNSTIVYSLSPGTSATPAMAKA 258

Query: 263 VSGLANMYRITGDDWDTWNDIVSHFDVSRDFSTANMIGAAGLLGKSWPDLDMLPLGWLTD 322
           VSGLANMYRITGDDWDTWNDIVSHFDV+RDF+TANMIG AGLLGKSWPDLDMLPLGWLTD
Sbjct: 259 VSGLANMYRITGDDWDTWNDIVSHFDVTRDFATANMIGTAGLLGKSWPDLDMLPLGWLTD 318

Query: 323 PEVSNS------------------------------------------------------ 382
           P  +N                                                       
Sbjct: 319 PGSNNGPHRTTNLNIDEQRTQMTLWSISKSPIMFGGDLRNIDNTTFSIITNPTLLEINSF 378

Query: 383 ---------------REQIVKWHSRHVEASASPILGLTKCAYSDTAGWIIESLNRGLEKI 442
                          R++IVKWHSR +EASASPILGLTKCAYSDT GWI +S+++GLEKI
Sbjct: 379 SSNNMEFLKIASTNFRKRIVKWHSRGLEASASPILGLTKCAYSDTTGWITKSVDQGLEKI 438

Query: 443 CWKENPEHEYQTPFCLYKRGSRVAIDKEATTRHDQVELLSFPTSTVDVCLDATPKRKRSS 502
           CWK NPEHEYQTPFCLYKRGSRVAIDKEA T  DQVELLSF TS+V+VCLDATPKRK SS
Sbjct: 439 CWKANPEHEYQTPFCLYKRGSRVAIDKEAATHRDQVELLSFSTSSVEVCLDATPKRKHSS 498

Query: 503 EEFTRGSFFPCRRHSNQKWDLYANGTLANHYSGHCAIVKYNRANALPTGVRSWVATGRGG 562
           E   RGSFFPC+RH NQKWDLYANGTLANH SGHCAIVKY +A A+PTGVRSWVATGRGG
Sbjct: 499 EAIMRGSFFPCKRHENQKWDLYANGTLANHNSGHCAIVKYKQAKAIPTGVRSWVATGRGG 558

Query: 563 EIYVAFFNLNNVKTVISAKISDLAEVLPGKKLGQTSCKCREEWTGKDFGLISESIAAPVE 566
           E+YVAFFNLNNVKTVISAKISDLA+ LPGKKLG  SCK REEW+GKDFGL+S+ IAAPVE
Sbjct: 559 EVYVAFFNLNNVKTVISAKISDLAQALPGKKLGPNSCKYREEWSGKDFGLVSDLIAAPVE 618

BLAST of HG10005100 vs. NCBI nr
Match: KAA0061052.1 (Melibiase domain-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 1001.5 bits (2588), Expect = 2.9e-288
Identity = 484/612 (79.08%), Postives = 510/612 (83.33%), Query Frame = 0

Query: 23  VSSQTGPERAALPPRGWNSYDSFSWIISEEEFLKNVEIVANRLKSQGYEYVVVDYLWYRK 82
           VSSQTGPERAALPPRGWNSYDSFSWIISEEEFL NVEIVAN+LKS+GYEYV+VDYLWYRK
Sbjct: 19  VSSQTGPERAALPPRGWNSYDSFSWIISEEEFLNNVEIVANKLKSKGYEYVIVDYLWYRK 78

Query: 83  KVPGAYTDSLGFDVIDEWGRMVPDPVRWPSSQGGKGFSEVAKKVHDMGLKFGIHVMRGIS 142
           KVPGAYTDSLGFDVIDEWGRM PDPVRWPSSQGGKGFSEVAKKVH MGLKFGIHVMRGIS
Sbjct: 79  KVPGAYTDSLGFDVIDEWGRMAPDPVRWPSSQGGKGFSEVAKKVHAMGLKFGIHVMRGIS 138

Query: 143 TQAVNANTPILDISKGDAYVESGKKWFASDIGIKSRACGWMHNGFMSVDVKSGAGKAFLR 202
           TQAVNANTPILDISKGDAYVESGKKW ASDIGIKSRACGWMHNGFMSV+VKSGAGKAFLR
Sbjct: 139 TQAVNANTPILDISKGDAYVESGKKWLASDIGIKSRACGWMHNGFMSVNVKSGAGKAFLR 198

Query: 203 SLYQQYADWGVDFVKHDCVFGDDLDLDEITFVSDVLKQLNSPIMYSLSPGTSVTPAMAKA 262
           SLYQQYADWGVDFVKHDCVFGDDLDLDEITFVSDVLKQLNS I+YSLSPGTS TPAMAKA
Sbjct: 199 SLYQQYADWGVDFVKHDCVFGDDLDLDEITFVSDVLKQLNSTIVYSLSPGTSATPAMAKA 258

Query: 263 VSGLANMYRITGDDWDTWNDIVSHFDVSRDFSTANMIGAAGLLGKSWPDLDMLPLGWLTD 322
           VSGLANMYRITGDDWDTWNDIVSHFDV+RDF+TANMIG  GLLGKSWPDLDMLPLGWLTD
Sbjct: 259 VSGLANMYRITGDDWDTWNDIVSHFDVTRDFATANMIGTTGLLGKSWPDLDMLPLGWLTD 318

Query: 323 PEVSNS------------------------------------------------------ 382
           P  +N                                                       
Sbjct: 319 PGSNNGPHRTTNLNIDEQRTQMTLWSISKSPIMFGGDLRNIDNTTFSIITNPTLLEINSF 378

Query: 383 ---------------REQIVKWHSRHVEASASPILGLTKCAYSDTAGWIIESLNRGLEKI 442
                          R++IVKWHSR +EASASPILGLTKCAYSDT GWI +S+++GLEKI
Sbjct: 379 SSNNMEFLKIASTNFRKRIVKWHSRGLEASASPILGLTKCAYSDTTGWITKSVDQGLEKI 438

Query: 443 CWKENPEHEYQTPFCLYKRGSRVAIDKEATTRHDQVELLSFPTSTVDVCLDATPKRKRSS 502
           CWK NPEHEYQTPFCLYKRGSRVAIDKEA T  DQVELLSF TS+V+VCLDATPKRK SS
Sbjct: 439 CWKANPEHEYQTPFCLYKRGSRVAIDKEAATHRDQVELLSFSTSSVEVCLDATPKRKHSS 498

Query: 503 EEFTRGSFFPCRRHSNQKWDLYANGTLANHYSGHCAIVKYNRANALPTGVRSWVATGRGG 562
           E   RGSFFPC+RH NQKWDLYANGTLANH SGHCAIVKY +A A+PTGVRSWVATGRGG
Sbjct: 499 EAIMRGSFFPCKRHENQKWDLYANGTLANHNSGHCAIVKYKQAKAIPTGVRSWVATGRGG 558

Query: 563 EIYVAFFNLNNVKTVISAKISDLAEVLPGKKLGQTSCKCREEWTGKDFGLISESIAAPVE 566
           E+YVAFFNLNNVKTVISAKISDLA+ LPGKKLG  SCK REEW+GKDFGL+S+ IAAPVE
Sbjct: 559 EVYVAFFNLNNVKTVISAKISDLAQALPGKKLGPNSCKYREEWSGKDFGLVSDLIAAPVE 618

BLAST of HG10005100 vs. NCBI nr
Match: XP_022951774.1 (uncharacterized protein LOC111454511 [Cucurbita moschata] >XP_022951775.1 uncharacterized protein LOC111454511 [Cucurbita moschata])

HSP 1 Score: 954.9 bits (2467), Expect = 3.1e-274
Identity = 465/630 (73.81%), Postives = 498/630 (79.05%), Query Frame = 0

Query: 14  CIRTLICF-----RVSSQTGPERAALPPRGWNSYDSFSWIISEEEFLKNVEIVANRLKSQ 73
           C     CF     RVSSQTGPERAALPPRGWNSYDSF W ISE+EFL N EIVA RL S+
Sbjct: 10  CFCLFFCFGLLFNRVSSQTGPERAALPPRGWNSYDSFCWTISEKEFLDNAEIVAKRLNSK 69

Query: 74  GYEYVVVDYLWYRKKVPGAYTDSLGFDVIDEWGRMVPDPVRWPSSQGGKGFSEVAKKVHD 133
           GYEYVVVDYLWYRKKVPGAY DSLGFDVIDEWGR+VPDPVRWPSSQGGKGF+EVAKKVHD
Sbjct: 70  GYEYVVVDYLWYRKKVPGAYVDSLGFDVIDEWGRIVPDPVRWPSSQGGKGFTEVAKKVHD 129

Query: 134 MGLKFGIHVMRGISTQAVNANTPILDISKGDAYVESGKKWFASDIGIKSRACGWMHNGFM 193
           MGLKFGIHVMRGISTQAVNANTPILD+SKG AYVESG+KWFASDIGIKSR+C WMHNGFM
Sbjct: 130 MGLKFGIHVMRGISTQAVNANTPILDVSKGGAYVESGRKWFASDIGIKSRSCAWMHNGFM 189

Query: 194 SVDVKSGAGKAFLRSLYQQYADWGVDFVKHDCVFGDDLDLDEITFVSDVLKQLNSPIMYS 253
           SV+V SGAGKAFLRSLYQQ+ADWGVDFVKHDCVFGDDLDL EI+FVSDVLKQLN PI+YS
Sbjct: 190 SVNVNSGAGKAFLRSLYQQFADWGVDFVKHDCVFGDDLDLPEISFVSDVLKQLNRPILYS 249

Query: 254 LSPGTSVTPAMAKAVSGLANMYRITGDDWDTWNDIVSHFDVSRDFSTANMIGAAGLLGKS 313
           LSPGTSVTPAMAKAVSGL NMYRITGDDWDTWNDIVSHFDV+RDFSTANMIG  GLLGKS
Sbjct: 250 LSPGTSVTPAMAKAVSGLVNMYRITGDDWDTWNDIVSHFDVTRDFSTANMIGTTGLLGKS 309

Query: 314 WPDLDMLPLGWLTD---------------------------------------------- 373
           WPDLDMLPLGWLTD                                              
Sbjct: 310 WPDLDMLPLGWLTDQGSNDGPHRRSNLNINEQRTQMTLWCMSKSPIMYGGDLRNIDDMTY 369

Query: 374 ---------------------------PEVSNSREQIVKWHSRHVEASASPILGLTKCAY 433
                                       +VS  REQI+KWH R+++ S SPILGLTKCA 
Sbjct: 370 SIITNPTLLEINSFSSNNMEFLKIAATTKVSKCREQIMKWHFRYLKPSVSPILGLTKCAD 429

Query: 434 SDTAGWIIESLNRGLEKICWKENPEHEYQTPFCLYKRGSRVAIDKEATTRHDQVELLSFP 493
           S+  GWI E L+RG+EKICWK NPE EYQTP CLYKRGSRVAIDKEATTRHDQVELLS  
Sbjct: 430 SNAVGWITERLDRGVEKICWKANPELEYQTPLCLYKRGSRVAIDKEATTRHDQVELLSSH 489

Query: 494 TSTVDVCLDATPKRKRSSEEFTRGSFFPCRRHSNQKWDLYANGTLANHYSGHCAIVKYNR 553
           TSTVDVCLDAT KRK SSEEF RGSFFPC RH NQKWDLY+NGTL NH+SGHCAIVK N+
Sbjct: 490 TSTVDVCLDATSKRKHSSEEFMRGSFFPCSRHENQKWDLYSNGTLGNHHSGHCAIVKTNQ 549

Query: 554 ANALPTGVRSWVATGRGGEIYVAFFNLNNVKTVISAKISDLAEVLPGKKLGQTSCKCREE 566
           A A PTGVRSW+ATGRGGEIYVAFFNLN+VKTVISAKISDL EV+PGKKL  +SCKC+EE
Sbjct: 550 AKASPTGVRSWIATGRGGEIYVAFFNLNDVKTVISAKISDLGEVVPGKKLDHSSCKCKEE 609

BLAST of HG10005100 vs. ExPASy Swiss-Prot
Match: Q9URZ0 (Alpha-galactosidase mel1 OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=mel1 PE=3 SV=1)

HSP 1 Score: 81.3 bits (199), Expect = 4.0e-14
Identity = 78/307 (25.41%), Postives = 119/307 (38.76%), Query Frame = 0

Query: 35  PPRGWNSYDSFSWIISEEEFLKNVEIVANR-LKSQGYEYVVVDYLWYRKKVPGAYTDSLG 94
           P  GWNS++ ++  I E   L N + +    L   GYEY+V+D  W  K    A T    
Sbjct: 33  PQMGWNSWNKYACDIDESIILNNAKAIKEEGLLDLGYEYIVMDDCW-SKHERNATT---- 92

Query: 95  FDVIDEWGRMVPDPVRWPSSQGGKGFSEVAKKVHDMGLKFGIHVMRGISTQAVNANTPIL 154
                  GR+  +P ++P+     G   +AKK+HDMG KFG+                  
Sbjct: 93  -------GRLEANPDKFPN-----GIGSMAKKLHDMGFKFGM------------------ 152

Query: 155 DISKGDAYVESGKKWFASDIGIKSRACGWMHNGFMSVDVKSGAGKAFLRSLYQQYADWGV 214
                  Y  +GK   A   G  +            +D  +             +ADWGV
Sbjct: 153 -------YSSAGKYTCAGFPGSLNHE---------QIDADT-------------FADWGV 212

Query: 215 DFVKHDCVFGDD------LDLDEITFVSDVLKQLNSPIMYSLSP-GTSVTPAMAKAVSGL 274
           D++K+D  F +       +  +    +SD L +   PI YSL   G          +   
Sbjct: 213 DYLKYDNCFNEGKSGVPLISYERYKRMSDALNKTGRPIFYSLCQWGEDFVWNWGNTI--- 272

Query: 275 ANMYRITGDDWDTWN--DI-------------VSHFDVSRDFSTANMIGAAGLLGKSWPD 319
           AN +RI+GD +DT++  D+               H  V    S A+ + +   +   W D
Sbjct: 273 ANSWRISGDIFDTFSRKDVRCPCETIECFALQGDHCSVMNIISKASFLSSKAGMNSGWND 272

BLAST of HG10005100 vs. ExPASy Swiss-Prot
Match: A7XZT2 (Probable alpha-galactosidase B OS=Talaromyces emersonii OX=68825 PE=3 SV=1)

HSP 1 Score: 69.3 bits (168), Expect = 1.6e-10
Identity = 42/113 (37.17%), Postives = 59/113 (52.21%), Query Frame = 0

Query: 34  LPPRGWNSYDSFSWIISEEEFLKNVEIVAN-RLKSQGYEYVVVDYLWYRKKVPGAYTDSL 93
           LP  GWNS+++F   I EE+ L     + N  LK  GYEYV +D  W  K    A T   
Sbjct: 34  LPALGWNSWNAFGCDIDEEKILTAANQIVNLGLKDLGYEYVNIDDCWSVKSGRNATT--- 93

Query: 94  GFDVIDEWGRMVPDPVRWPSSQGGKGFSEVAKKVHDMGLKFGIHVMRGISTQA 146
                   GR++PD  ++P      G S +A+K+H++GLK GI+   G +T A
Sbjct: 94  --------GRIMPDLTKFPD-----GISGLAEKIHNLGLKIGIYSSAGWTTCA 130

BLAST of HG10005100 vs. ExPASy Swiss-Prot
Match: B0Y224 (Probable alpha-galactosidase B OS=Neosartorya fumigata (strain CEA10 / CBS 144.89 / FGSC A1163) OX=451804 GN=aglB PE=3 SV=1)

HSP 1 Score: 68.9 bits (167), Expect = 2.0e-10
Identity = 40/122 (32.79%), Postives = 62/122 (50.82%), Query Frame = 0

Query: 25  SQTGPERAALPPRGWNSYDSFSWIISEEEFLKNV-EIVANRLKSQGYEYVVVDYLWYRKK 84
           S++   +  LP  GWN++++F   I   + +    E+V   LK  GYEY+ +D  W  K 
Sbjct: 2   SRSKTRQGKLPALGWNTWNAFGCDIDATKIMTAANEVVNLGLKDLGYEYINIDDCWSVKS 61

Query: 85  VPGAYTDSLGFDVIDEWGRMVPDPVRWPSSQGGKGFSEVAKKVHDMGLKFGIHVMRGIST 144
              A T            R++PDP ++P      G S VA ++HD+GLK GI+   G++T
Sbjct: 62  GRDASTQ-----------RIIPDPDKFPD-----GISGVADQIHDLGLKIGIYSSAGLTT 107

Query: 145 QA 146
            A
Sbjct: 122 CA 107

BLAST of HG10005100 vs. ExPASy Swiss-Prot
Match: A1D0A3 (Probable alpha-galactosidase B OS=Neosartorya fischeri (strain ATCC 1020 / DSM 3700 / CBS 544.65 / FGSC A1164 / JCM 1740 / NRRL 181 / WB 181) OX=331117 GN=aglB PE=3 SV=1)

HSP 1 Score: 68.2 bits (165), Expect = 3.5e-10
Identity = 41/113 (36.28%), Postives = 58/113 (51.33%), Query Frame = 0

Query: 34  LPPRGWNSYDSFSWIISEEEFLKNV-EIVANRLKSQGYEYVVVDYLWYRKKVPGAYTDSL 93
           LP  GWNS+++F   I   + +    E+V   LK  GYEY+ +D  W  K    A T   
Sbjct: 32  LPALGWNSWNAFGCDIDAAKIMTAANEVVNLGLKDLGYEYINIDDCWSVKSGRDASTQ-- 91

Query: 94  GFDVIDEWGRMVPDPVRWPSSQGGKGFSEVAKKVHDMGLKFGIHVMRGISTQA 146
                    RMVPDP ++P      G S +A ++HD+GLK GI+   G++T A
Sbjct: 92  ---------RMVPDPEKFPD-----GISGLADQIHDLGLKVGIYSSAGLTTCA 128

BLAST of HG10005100 vs. ExPASy Swiss-Prot
Match: A1C5D3 (Probable alpha-galactosidase B OS=Aspergillus clavatus (strain ATCC 1007 / CBS 513.65 / DSM 816 / NCTC 3887 / NRRL 1) OX=344612 GN=aglB PE=3 SV=1)

HSP 1 Score: 67.4 bits (163), Expect = 5.9e-10
Identity = 41/113 (36.28%), Postives = 60/113 (53.10%), Query Frame = 0

Query: 34  LPPRGWNSYDSFSWIISEEEFLKNV-EIVANRLKSQGYEYVVVDYLWYRKKVPGAYTDSL 93
           LP  GWNS+++F   I + + +    EIV   LK  GYEY+ +D  W  K          
Sbjct: 33  LPALGWNSWNAFGCDIDDAKIMTAAKEIVNLGLKDLGYEYINIDDCWSVKS--------- 92

Query: 94  GFDVIDEWGRMVPDPVRWPSSQGGKGFSEVAKKVHDMGLKFGIHVMRGISTQA 146
           G D   +  R+VPDP ++P      G + VA ++HD+GLK GI+   G++T A
Sbjct: 93  GRDKTTK--RIVPDPAKFPD-----GIAGVADRIHDLGLKVGIYSSAGLTTCA 129

BLAST of HG10005100 vs. ExPASy TrEMBL
Match: A0A0A0LK91 (Alpha-galactosidase OS=Cucumis sativus OX=3659 GN=Csa_2G349640 PE=3 SV=1)

HSP 1 Score: 1007.7 bits (2604), Expect = 2.0e-290
Identity = 485/612 (79.25%), Postives = 508/612 (83.01%), Query Frame = 0

Query: 23  VSSQTGPERAALPPRGWNSYDSFSWIISEEEFLKNVEIVANRLKSQGYEYVVVDYLWYRK 82
           VSSQTGPERAALPPRGWNSYDSFSWIISEEEFLKN EIVAN+LKS+GYEYV+VDYLWYRK
Sbjct: 19  VSSQTGPERAALPPRGWNSYDSFSWIISEEEFLKNAEIVANQLKSKGYEYVIVDYLWYRK 78

Query: 83  KVPGAYTDSLGFDVIDEWGRMVPDPVRWPSSQGGKGFSEVAKKVHDMGLKFGIHVMRGIS 142
            VPGAYTDSLGFDVID+WGRM PDPVRWPSSQGGKGFSEVAKKVHDMGLKFGIHVMRGIS
Sbjct: 79  LVPGAYTDSLGFDVIDDWGRMAPDPVRWPSSQGGKGFSEVAKKVHDMGLKFGIHVMRGIS 138

Query: 143 TQAVNANTPILDISKGDAYVESGKKWFASDIGIKSRACGWMHNGFMSVDVKSGAGKAFLR 202
           TQAVNANTPILDISKGDAYVESGKKW ASDIGIKSRACGWMHNGFMSV+VKSGAGKAFLR
Sbjct: 139 TQAVNANTPILDISKGDAYVESGKKWLASDIGIKSRACGWMHNGFMSVNVKSGAGKAFLR 198

Query: 203 SLYQQYADWGVDFVKHDCVFGDDLDLDEITFVSDVLKQLNSPIMYSLSPGTSVTPAMAKA 262
           SLYQQYADWGVDFVKHDCVFGDDLDLDEITFVSDVLKQLNS I+YSLSPGTS TPAMAKA
Sbjct: 199 SLYQQYADWGVDFVKHDCVFGDDLDLDEITFVSDVLKQLNSTIVYSLSPGTSATPAMAKA 258

Query: 263 VSGLANMYRITGDDWDTWNDIVSHFDVSRDFSTANMIGAAGLLGKSWPDLDMLPLGWLTD 322
           VSGLANMYRITGDDWD+WNDIVSHFDV+RDF+TANMIG AGLLGKSWPDLDMLPLGWLTD
Sbjct: 259 VSGLANMYRITGDDWDSWNDIVSHFDVTRDFATANMIGTAGLLGKSWPDLDMLPLGWLTD 318

Query: 323 PEVSNS------------------------------------------------------ 382
           P  +N                                                       
Sbjct: 319 PGSNNGPHRTTNLNINEQRTQMTLWSISKSPIMFGGDLRNIDNTTFSIITNPTLLEINAF 378

Query: 383 ---------------REQIVKWHSRHVEASASPILGLTKCAYSDTAGWIIESLNRGLEKI 442
                          R++IVKWHSR +E SAS ILGLTKCAYSDT GWI ESLN GLEKI
Sbjct: 379 SSNNMEFLKIASTNFRKRIVKWHSRGLETSASRILGLTKCAYSDTTGWITESLNEGLEKI 438

Query: 443 CWKENPEHEYQTPFCLYKRGSRVAIDKEATTRHDQVELLSFPTSTVDVCLDATPKRKRSS 502
           CWKENPEHE QTPFCLYKRGSRVAIDKEA TR DQVELLSFPTS+VDVCLDATPKRK SS
Sbjct: 439 CWKENPEHESQTPFCLYKRGSRVAIDKEAATRRDQVELLSFPTSSVDVCLDATPKRKHSS 498

Query: 503 EEFTRGSFFPCRRHSNQKWDLYANGTLANHYSGHCAIVKYNRANALPTGVRSWVATGRGG 562
           E   RGSFFPC+ H NQKWDLYANGTLANHYSGHCAIVKYN+A ++PTG RSWVA GRGG
Sbjct: 499 EAIMRGSFFPCKGHENQKWDLYANGTLANHYSGHCAIVKYNKAKSIPTGARSWVAAGRGG 558

Query: 563 EIYVAFFNLNNVKTVISAKISDLAEVLPGKKLGQTSCKCREEWTGKDFGLISESIAAPVE 566
           E+YVAFFNLNN KTVIS KISDLA+ LPGKKLG  SCKCREEW+GKDFGL+S+ IAAPVE
Sbjct: 559 EVYVAFFNLNNAKTVISVKISDLAQALPGKKLGSNSCKCREEWSGKDFGLVSDLIAAPVE 618

BLAST of HG10005100 vs. ExPASy TrEMBL
Match: A0A1S3B9Q1 (Alpha-galactosidase OS=Cucumis melo OX=3656 GN=LOC103487727 PE=3 SV=1)

HSP 1 Score: 1003.0 bits (2592), Expect = 4.8e-289
Identity = 485/612 (79.25%), Postives = 511/612 (83.50%), Query Frame = 0

Query: 23  VSSQTGPERAALPPRGWNSYDSFSWIISEEEFLKNVEIVANRLKSQGYEYVVVDYLWYRK 82
           VSSQTGPERAALPPRGWNSYDSFSWIISEEEFL NVEIVAN+LKS+GYEYV+VDYLWYRK
Sbjct: 19  VSSQTGPERAALPPRGWNSYDSFSWIISEEEFLNNVEIVANKLKSKGYEYVIVDYLWYRK 78

Query: 83  KVPGAYTDSLGFDVIDEWGRMVPDPVRWPSSQGGKGFSEVAKKVHDMGLKFGIHVMRGIS 142
           KVPGAYTDSLGFDVIDEWGRM PDPVRWPSSQGGKGFSEVAKKVH MGLKFGIHVMRGIS
Sbjct: 79  KVPGAYTDSLGFDVIDEWGRMAPDPVRWPSSQGGKGFSEVAKKVHAMGLKFGIHVMRGIS 138

Query: 143 TQAVNANTPILDISKGDAYVESGKKWFASDIGIKSRACGWMHNGFMSVDVKSGAGKAFLR 202
           TQAVNANTPILDISKGDAYVESGKKW ASDIGIKSRACGWMHNGFMSV+VKSGAGKAFLR
Sbjct: 139 TQAVNANTPILDISKGDAYVESGKKWLASDIGIKSRACGWMHNGFMSVNVKSGAGKAFLR 198

Query: 203 SLYQQYADWGVDFVKHDCVFGDDLDLDEITFVSDVLKQLNSPIMYSLSPGTSVTPAMAKA 262
           SLYQQYADWGVDFVKHDCVFGDDLDLDEITFVSDVLKQLNS I+YSLSPGTS TPAMAKA
Sbjct: 199 SLYQQYADWGVDFVKHDCVFGDDLDLDEITFVSDVLKQLNSTIVYSLSPGTSATPAMAKA 258

Query: 263 VSGLANMYRITGDDWDTWNDIVSHFDVSRDFSTANMIGAAGLLGKSWPDLDMLPLGWLTD 322
           VSGLANMYRITGDDWDTWNDIVSHFDV+RDF+TANMIG AGLLGKSWPDLDMLPLGWLTD
Sbjct: 259 VSGLANMYRITGDDWDTWNDIVSHFDVTRDFATANMIGTAGLLGKSWPDLDMLPLGWLTD 318

Query: 323 PEVSNS------------------------------------------------------ 382
           P  +N                                                       
Sbjct: 319 PGSNNGPHRTTNLNIDEQRTQMTLWSISKSPIMFGGDLRNIDNTTFSIITNPTLLEINSF 378

Query: 383 ---------------REQIVKWHSRHVEASASPILGLTKCAYSDTAGWIIESLNRGLEKI 442
                          R++IVKWHSR +EASASPILGLTKCAYSDT GWI +S+++GLEKI
Sbjct: 379 SSNNMEFLKIASTNFRKRIVKWHSRGLEASASPILGLTKCAYSDTTGWITKSVDQGLEKI 438

Query: 443 CWKENPEHEYQTPFCLYKRGSRVAIDKEATTRHDQVELLSFPTSTVDVCLDATPKRKRSS 502
           CWK NPEHEYQTPFCLYKRGSRVAIDKEA T  DQVELLSF TS+V+VCLDATPKRK SS
Sbjct: 439 CWKANPEHEYQTPFCLYKRGSRVAIDKEAATHRDQVELLSFSTSSVEVCLDATPKRKHSS 498

Query: 503 EEFTRGSFFPCRRHSNQKWDLYANGTLANHYSGHCAIVKYNRANALPTGVRSWVATGRGG 562
           E   RGSFFPC+RH NQKWDLYANGTLANH SGHCAIVKY +A A+PTGVRSWVATGRGG
Sbjct: 499 EAIMRGSFFPCKRHENQKWDLYANGTLANHNSGHCAIVKYKQAKAIPTGVRSWVATGRGG 558

Query: 563 EIYVAFFNLNNVKTVISAKISDLAEVLPGKKLGQTSCKCREEWTGKDFGLISESIAAPVE 566
           E+YVAFFNLNNVKTVISAKISDLA+ LPGKKLG  SCK REEW+GKDFGL+S+ IAAPVE
Sbjct: 559 EVYVAFFNLNNVKTVISAKISDLAQALPGKKLGPNSCKYREEWSGKDFGLVSDLIAAPVE 618

BLAST of HG10005100 vs. ExPASy TrEMBL
Match: A0A5A7UYS7 (Alpha-galactosidase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold501G001900 PE=3 SV=1)

HSP 1 Score: 1001.5 bits (2588), Expect = 1.4e-288
Identity = 484/612 (79.08%), Postives = 510/612 (83.33%), Query Frame = 0

Query: 23  VSSQTGPERAALPPRGWNSYDSFSWIISEEEFLKNVEIVANRLKSQGYEYVVVDYLWYRK 82
           VSSQTGPERAALPPRGWNSYDSFSWIISEEEFL NVEIVAN+LKS+GYEYV+VDYLWYRK
Sbjct: 19  VSSQTGPERAALPPRGWNSYDSFSWIISEEEFLNNVEIVANKLKSKGYEYVIVDYLWYRK 78

Query: 83  KVPGAYTDSLGFDVIDEWGRMVPDPVRWPSSQGGKGFSEVAKKVHDMGLKFGIHVMRGIS 142
           KVPGAYTDSLGFDVIDEWGRM PDPVRWPSSQGGKGFSEVAKKVH MGLKFGIHVMRGIS
Sbjct: 79  KVPGAYTDSLGFDVIDEWGRMAPDPVRWPSSQGGKGFSEVAKKVHAMGLKFGIHVMRGIS 138

Query: 143 TQAVNANTPILDISKGDAYVESGKKWFASDIGIKSRACGWMHNGFMSVDVKSGAGKAFLR 202
           TQAVNANTPILDISKGDAYVESGKKW ASDIGIKSRACGWMHNGFMSV+VKSGAGKAFLR
Sbjct: 139 TQAVNANTPILDISKGDAYVESGKKWLASDIGIKSRACGWMHNGFMSVNVKSGAGKAFLR 198

Query: 203 SLYQQYADWGVDFVKHDCVFGDDLDLDEITFVSDVLKQLNSPIMYSLSPGTSVTPAMAKA 262
           SLYQQYADWGVDFVKHDCVFGDDLDLDEITFVSDVLKQLNS I+YSLSPGTS TPAMAKA
Sbjct: 199 SLYQQYADWGVDFVKHDCVFGDDLDLDEITFVSDVLKQLNSTIVYSLSPGTSATPAMAKA 258

Query: 263 VSGLANMYRITGDDWDTWNDIVSHFDVSRDFSTANMIGAAGLLGKSWPDLDMLPLGWLTD 322
           VSGLANMYRITGDDWDTWNDIVSHFDV+RDF+TANMIG  GLLGKSWPDLDMLPLGWLTD
Sbjct: 259 VSGLANMYRITGDDWDTWNDIVSHFDVTRDFATANMIGTTGLLGKSWPDLDMLPLGWLTD 318

Query: 323 PEVSNS------------------------------------------------------ 382
           P  +N                                                       
Sbjct: 319 PGSNNGPHRTTNLNIDEQRTQMTLWSISKSPIMFGGDLRNIDNTTFSIITNPTLLEINSF 378

Query: 383 ---------------REQIVKWHSRHVEASASPILGLTKCAYSDTAGWIIESLNRGLEKI 442
                          R++IVKWHSR +EASASPILGLTKCAYSDT GWI +S+++GLEKI
Sbjct: 379 SSNNMEFLKIASTNFRKRIVKWHSRGLEASASPILGLTKCAYSDTTGWITKSVDQGLEKI 438

Query: 443 CWKENPEHEYQTPFCLYKRGSRVAIDKEATTRHDQVELLSFPTSTVDVCLDATPKRKRSS 502
           CWK NPEHEYQTPFCLYKRGSRVAIDKEA T  DQVELLSF TS+V+VCLDATPKRK SS
Sbjct: 439 CWKANPEHEYQTPFCLYKRGSRVAIDKEAATHRDQVELLSFSTSSVEVCLDATPKRKHSS 498

Query: 503 EEFTRGSFFPCRRHSNQKWDLYANGTLANHYSGHCAIVKYNRANALPTGVRSWVATGRGG 562
           E   RGSFFPC+RH NQKWDLYANGTLANH SGHCAIVKY +A A+PTGVRSWVATGRGG
Sbjct: 499 EAIMRGSFFPCKRHENQKWDLYANGTLANHNSGHCAIVKYKQAKAIPTGVRSWVATGRGG 558

Query: 563 EIYVAFFNLNNVKTVISAKISDLAEVLPGKKLGQTSCKCREEWTGKDFGLISESIAAPVE 566
           E+YVAFFNLNNVKTVISAKISDLA+ LPGKKLG  SCK REEW+GKDFGL+S+ IAAPVE
Sbjct: 559 EVYVAFFNLNNVKTVISAKISDLAQALPGKKLGPNSCKYREEWSGKDFGLVSDLIAAPVE 618

BLAST of HG10005100 vs. ExPASy TrEMBL
Match: A0A6J1GJU2 (Alpha-galactosidase OS=Cucurbita moschata OX=3662 GN=LOC111454511 PE=3 SV=1)

HSP 1 Score: 954.9 bits (2467), Expect = 1.5e-274
Identity = 465/630 (73.81%), Postives = 498/630 (79.05%), Query Frame = 0

Query: 14  CIRTLICF-----RVSSQTGPERAALPPRGWNSYDSFSWIISEEEFLKNVEIVANRLKSQ 73
           C     CF     RVSSQTGPERAALPPRGWNSYDSF W ISE+EFL N EIVA RL S+
Sbjct: 10  CFCLFFCFGLLFNRVSSQTGPERAALPPRGWNSYDSFCWTISEKEFLDNAEIVAKRLNSK 69

Query: 74  GYEYVVVDYLWYRKKVPGAYTDSLGFDVIDEWGRMVPDPVRWPSSQGGKGFSEVAKKVHD 133
           GYEYVVVDYLWYRKKVPGAY DSLGFDVIDEWGR+VPDPVRWPSSQGGKGF+EVAKKVHD
Sbjct: 70  GYEYVVVDYLWYRKKVPGAYVDSLGFDVIDEWGRIVPDPVRWPSSQGGKGFTEVAKKVHD 129

Query: 134 MGLKFGIHVMRGISTQAVNANTPILDISKGDAYVESGKKWFASDIGIKSRACGWMHNGFM 193
           MGLKFGIHVMRGISTQAVNANTPILD+SKG AYVESG+KWFASDIGIKSR+C WMHNGFM
Sbjct: 130 MGLKFGIHVMRGISTQAVNANTPILDVSKGGAYVESGRKWFASDIGIKSRSCAWMHNGFM 189

Query: 194 SVDVKSGAGKAFLRSLYQQYADWGVDFVKHDCVFGDDLDLDEITFVSDVLKQLNSPIMYS 253
           SV+V SGAGKAFLRSLYQQ+ADWGVDFVKHDCVFGDDLDL EI+FVSDVLKQLN PI+YS
Sbjct: 190 SVNVNSGAGKAFLRSLYQQFADWGVDFVKHDCVFGDDLDLPEISFVSDVLKQLNRPILYS 249

Query: 254 LSPGTSVTPAMAKAVSGLANMYRITGDDWDTWNDIVSHFDVSRDFSTANMIGAAGLLGKS 313
           LSPGTSVTPAMAKAVSGL NMYRITGDDWDTWNDIVSHFDV+RDFSTANMIG  GLLGKS
Sbjct: 250 LSPGTSVTPAMAKAVSGLVNMYRITGDDWDTWNDIVSHFDVTRDFSTANMIGTTGLLGKS 309

Query: 314 WPDLDMLPLGWLTD---------------------------------------------- 373
           WPDLDMLPLGWLTD                                              
Sbjct: 310 WPDLDMLPLGWLTDQGSNDGPHRRSNLNINEQRTQMTLWCMSKSPIMYGGDLRNIDDMTY 369

Query: 374 ---------------------------PEVSNSREQIVKWHSRHVEASASPILGLTKCAY 433
                                       +VS  REQI+KWH R+++ S SPILGLTKCA 
Sbjct: 370 SIITNPTLLEINSFSSNNMEFLKIAATTKVSKCREQIMKWHFRYLKPSVSPILGLTKCAD 429

Query: 434 SDTAGWIIESLNRGLEKICWKENPEHEYQTPFCLYKRGSRVAIDKEATTRHDQVELLSFP 493
           S+  GWI E L+RG+EKICWK NPE EYQTP CLYKRGSRVAIDKEATTRHDQVELLS  
Sbjct: 430 SNAVGWITERLDRGVEKICWKANPELEYQTPLCLYKRGSRVAIDKEATTRHDQVELLSSH 489

Query: 494 TSTVDVCLDATPKRKRSSEEFTRGSFFPCRRHSNQKWDLYANGTLANHYSGHCAIVKYNR 553
           TSTVDVCLDAT KRK SSEEF RGSFFPC RH NQKWDLY+NGTL NH+SGHCAIVK N+
Sbjct: 490 TSTVDVCLDATSKRKHSSEEFMRGSFFPCSRHENQKWDLYSNGTLGNHHSGHCAIVKTNQ 549

Query: 554 ANALPTGVRSWVATGRGGEIYVAFFNLNNVKTVISAKISDLAEVLPGKKLGQTSCKCREE 566
           A A PTGVRSW+ATGRGGEIYVAFFNLN+VKTVISAKISDL EV+PGKKL  +SCKC+EE
Sbjct: 550 AKASPTGVRSWIATGRGGEIYVAFFNLNDVKTVISAKISDLGEVVPGKKLDHSSCKCKEE 609

BLAST of HG10005100 vs. ExPASy TrEMBL
Match: A0A6J1KK35 (Alpha-galactosidase OS=Cucurbita maxima OX=3661 GN=LOC111496438 PE=3 SV=1)

HSP 1 Score: 954.1 bits (2465), Expect = 2.6e-274
Identity = 462/630 (73.33%), Postives = 499/630 (79.21%), Query Frame = 0

Query: 14  CIRTLICF-----RVSSQTGPERAALPPRGWNSYDSFSWIISEEEFLKNVEIVANRLKSQ 73
           C+    CF     RVSSQTGPERAALPPRGWNSYDSF W ISE+EFL N EIVA RL S+
Sbjct: 9   CVCLFFCFGLLFNRVSSQTGPERAALPPRGWNSYDSFCWTISEKEFLDNAEIVAKRLNSK 68

Query: 74  GYEYVVVDYLWYRKKVPGAYTDSLGFDVIDEWGRMVPDPVRWPSSQGGKGFSEVAKKVHD 133
           GYEYVVVDYLWYRKKVPGAY DSLGFDVIDEWGRMVPDPVRWPSSQGGKGF+EVAKKVHD
Sbjct: 69  GYEYVVVDYLWYRKKVPGAYVDSLGFDVIDEWGRMVPDPVRWPSSQGGKGFTEVAKKVHD 128

Query: 134 MGLKFGIHVMRGISTQAVNANTPILDISKGDAYVESGKKWFASDIGIKSRACGWMHNGFM 193
           MGLKFGIHVMRGISTQAVNANTPILD+SKG AYVESG+KWFASDIGIKSR+C WMHNGFM
Sbjct: 129 MGLKFGIHVMRGISTQAVNANTPILDVSKGGAYVESGRKWFASDIGIKSRSCAWMHNGFM 188

Query: 194 SVDVKSGAGKAFLRSLYQQYADWGVDFVKHDCVFGDDLDLDEITFVSDVLKQLNSPIMYS 253
           SV+V SGAGKAFLRSLYQQ+ADWGVDFVKHDCVFGDDLDL EI+FVSDVLKQLN PI+YS
Sbjct: 189 SVNVNSGAGKAFLRSLYQQFADWGVDFVKHDCVFGDDLDLPEISFVSDVLKQLNRPILYS 248

Query: 254 LSPGTSVTPAMAKAVSGLANMYRITGDDWDTWNDIVSHFDVSRDFSTANMIGAAGLLGKS 313
           LSPGTSVTPAMAKAVSGL NMYRITGDDWDTWNDIVSHFDV+RDFSTANMIG  GLLGKS
Sbjct: 249 LSPGTSVTPAMAKAVSGLVNMYRITGDDWDTWNDIVSHFDVTRDFSTANMIGTTGLLGKS 308

Query: 314 WPDLDMLPLGWLTD---------------------------------------------- 373
           WPDLDMLPLGWLTD                                              
Sbjct: 309 WPDLDMLPLGWLTDQGSNDGPHRRSNLNINEQRTQMTLWCMSKSPIMYGGDLRNIDDMTY 368

Query: 374 ---------------------------PEVSNSREQIVKWHSRHVEASASPILGLTKCAY 433
                                       +VS  REQI+KWH R+++ S SPIL LTKCA 
Sbjct: 369 SIITNPTLLEINSFSTNNMEFLKIAATTKVSKCREQIMKWHFRYLKPSVSPILSLTKCAD 428

Query: 434 SDTAGWIIESLNRGLEKICWKENPEHEYQTPFCLYKRGSRVAIDKEATTRHDQVELLSFP 493
           S+  GWI ++L+RG+EKICWK NPEHEYQTP CLYKRGSRVAIDKEATTRHDQVELLS  
Sbjct: 429 SNAVGWITKTLDRGVEKICWKANPEHEYQTPLCLYKRGSRVAIDKEATTRHDQVELLSSR 488

Query: 494 TSTVDVCLDATPKRKRSSEEFTRGSFFPCRRHSNQKWDLYANGTLANHYSGHCAIVKYNR 553
           TS VDVCLDAT KRKRSSEEF RGSFFPC RH NQKWDLY+NGTL NH+SGHCAIVK ++
Sbjct: 489 TSAVDVCLDATSKRKRSSEEFMRGSFFPCSRHENQKWDLYSNGTLGNHHSGHCAIVKTSQ 548

Query: 554 ANALPTGVRSWVATGRGGEIYVAFFNLNNVKTVISAKISDLAEVLPGKKLGQTSCKCREE 566
           A   PTGVRSW+ATGRGGEIYVAFFNLN+VKTVISAKISDL EV+PGKKL  +SCKC+EE
Sbjct: 549 AKGSPTGVRSWIATGRGGEIYVAFFNLNDVKTVISAKISDLGEVVPGKKLDHSSCKCKEE 608

BLAST of HG10005100 vs. TAIR 10
Match: AT3G26380.1 (Melibiase family protein )

HSP 1 Score: 646.4 bits (1666), Expect = 2.2e-185
Identity = 325/636 (51.10%), Postives = 409/636 (64.31%), Query Frame = 0

Query: 9   SLIIWCIRTLICFRVS--SQTGPERAALPPRGWNSYDSFSWIISEEEFLKNVEIVANRLK 68
           S + + I  L  F +S  +++  + A+ PPRGWNSYDSF W ISE EFL++ EIV+ RL 
Sbjct: 14  STVFFIIFNLSIFSLSIEARSRQQHASFPPRGWNSYDSFCWTISEAEFLQSAEIVSKRLL 73

Query: 69  SQGYEYVVVDYLWYRKKVPGAYTDSLGFDVIDEWGRMVPDPVRWPSSQGGKGFSEVAKKV 128
             GY+YVVVDYLWYRKKV GAY DSLGFDVIDEWGR+ PDP RWPSS+GGKGF+EVA+KV
Sbjct: 74  PHGYQYVVVDYLWYRKKVEGAYVDSLGFDVIDEWGRLHPDPGRWPSSRGGKGFTEVAEKV 133

Query: 129 HDMGLKFGIHVMRGISTQAVNANTPILDISKGDAYVESGKKWFASDIGIKSRACGWMHNG 188
           H MGLKFGIHVM GISTQA NAN+ ++D  KG AY ESG++W A DIGIK RAC WM +G
Sbjct: 134 HRMGLKFGIHVMGGISTQAYNANSLVMDSVKGGAYEESGRQWRAKDIGIKERACVWMSHG 193

Query: 189 FMSVDVKSGAGKAFLRSLYQQYADWGVDFVKHDCVFGDDLDLDEITFVSDVLKQLNSPIM 248
           FMSV+ K GAGKAFLRSLY+QYA+WGVDF+KHDCVFG D +++EIT+VS+VLK+L+ P++
Sbjct: 194 FMSVNTKLGAGKAFLRSLYRQYAEWGVDFIKHDCVFGTDFNIEEITYVSEVLKELDRPVL 253

Query: 249 YSLSPGTSVTPAMAKAVSGLANMYRITGDDWDTWNDIVSHFDVSRDFSTANMIGAAGLLG 308
           YS+SPGTSVTP MAK VS L NMYRITGDDWDTW D+ +HFD+SRD S ++MIGA GL G
Sbjct: 254 YSISPGTSVTPTMAKEVSQLVNMYRITGDDWDTWKDVTAHFDISRDLSASSMIGARGLQG 313

Query: 309 KSWPDLDMLPLGWLTDP------------EVSNSREQIVKW------------------- 368
           KSWPDLDMLPLGWLTD              +   + Q+  W                   
Sbjct: 314 KSWPDLDMLPLGWLTDQGSNVGPHRACNLNLEEQKSQMTLWSIAKSPLMFGGDVRNLDAT 373

Query: 369 -----------------------------------------HSRHVEASASPILGLTKCA 428
                                                    H      S     GLT C 
Sbjct: 374 TYNLITNPTLLEINSYSSNNKEFPYITATRVSRNKHKGYPHHPTGKNISTKHAFGLTSCK 433

Query: 429 YSDTAGWIIESLNRGLEKICWKENPEHEYQTPFCLYKRGSRVAIDKEATTRHDQV---EL 488
                 W I   NRG  +ICW ++   + + PFCLY R + +A DK+   +H+Q+   +L
Sbjct: 434 EQKANTWFIVDKNRG--QICWNQHSSEKLEKPFCLYNRKALLASDKK--LKHNQLYQGKL 493

Query: 489 LSFPTSTVDVCLDATPKRKRSSEEFTRGSFFPCRRHSNQKWDLYANGTLANHYSGHCAIV 548
                     CL A+ ++K +S+++++G+  PC+  +NQ W+L++NGTL N YSG CA++
Sbjct: 494 HLHTNDKAQSCLAASSQQKLTSKDYSQGALSPCKLDANQMWELHSNGTLENSYSGLCAVL 553

Query: 549 K-YNRANALPTGVRSWVATGRGGEIYVAFFNLNNVKTVISAKISDLAEVLPGKK-LGQTS 566
                A A   GVRSW+ATGR GE+YVAFFNLN  KT ISAKISD+A  L GKK L   S
Sbjct: 554 NPVKAAEASSNGVRSWIATGRRGEVYVAFFNLNQEKTKISAKISDIATALRGKKNLVGAS 613

BLAST of HG10005100 vs. TAIR 10
Match: AT5G08380.1 (alpha-galactosidase 1 )

HSP 1 Score: 52.4 bits (124), Expect = 1.4e-06
Identity = 36/110 (32.73%), Postives = 46/110 (41.82%), Query Frame = 0

Query: 35  PPRGWNSYDSFSWIISEEEFLKNVE-IVANRLKSQGYEYVVVDYLWYRKKVPGAYTDSLG 94
           PP GWNS++ FS  I E+   +  + +V   L   GY YV +D  W              
Sbjct: 54  PPMGWNSWNHFSCNIDEKMIKETADALVTTGLSKLGYNYVNIDDCWAEIS---------- 113

Query: 95  FDVIDEWGRMVPDPVRWPSSQGGKGFSEVAKKVHDMGLKFGIHVMRGIST 144
               D  G +VP    +PS     G   VA  VH  GLK GI+   G  T
Sbjct: 114 ---RDSKGSLVPKKSTFPS-----GIKAVADYVHSKGLKLGIYSDAGYFT 145

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038886461.14.2e-30381.24alpha-galactosidase mel1 [Benincasa hispida][more]
XP_011649499.14.0e-29079.25uncharacterized protein LOC101206292 [Cucumis sativus] >XP_031736692.1 uncharact... [more]
XP_008444380.19.9e-28979.25PREDICTED: uncharacterized protein LOC103487727 [Cucumis melo] >XP_008444381.1 P... [more]
KAA0061052.12.9e-28879.08Melibiase domain-containing protein [Cucumis melo var. makuwa][more]
XP_022951774.13.1e-27473.81uncharacterized protein LOC111454511 [Cucurbita moschata] >XP_022951775.1 unchar... [more]
Match NameE-valueIdentityDescription
Q9URZ04.0e-1425.41Alpha-galactosidase mel1 OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) ... [more]
A7XZT21.6e-1037.17Probable alpha-galactosidase B OS=Talaromyces emersonii OX=68825 PE=3 SV=1[more]
B0Y2242.0e-1032.79Probable alpha-galactosidase B OS=Neosartorya fumigata (strain CEA10 / CBS 144.8... [more]
A1D0A33.5e-1036.28Probable alpha-galactosidase B OS=Neosartorya fischeri (strain ATCC 1020 / DSM 3... [more]
A1C5D35.9e-1036.28Probable alpha-galactosidase B OS=Aspergillus clavatus (strain ATCC 1007 / CBS 5... [more]
Match NameE-valueIdentityDescription
A0A0A0LK912.0e-29079.25Alpha-galactosidase OS=Cucumis sativus OX=3659 GN=Csa_2G349640 PE=3 SV=1[more]
A0A1S3B9Q14.8e-28979.25Alpha-galactosidase OS=Cucumis melo OX=3656 GN=LOC103487727 PE=3 SV=1[more]
A0A5A7UYS71.4e-28879.08Alpha-galactosidase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold501G... [more]
A0A6J1GJU21.5e-27473.81Alpha-galactosidase OS=Cucurbita moschata OX=3662 GN=LOC111454511 PE=3 SV=1[more]
A0A6J1KK352.6e-27473.33Alpha-galactosidase OS=Cucurbita maxima OX=3661 GN=LOC111496438 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT3G26380.12.2e-18551.10Melibiase family protein [more]
AT5G08380.11.4e-0632.73alpha-galactosidase 1 [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR013780Glycosyl hydrolase, all-betaGENE3D2.60.40.1180coord: 474..564
e-value: 8.3E-19
score: 69.3
IPR041233Alpha galactosidase, C-terminal beta sandwich domainPFAMPF17801Melibiase_Ccoord: 483..561
e-value: 4.2E-7
score: 29.9
IPR002241Glycoside hydrolase, family 27PFAMPF16499Melibiase_2coord: 48..144
e-value: 4.6E-5
score: 22.7
coord: 205..335
e-value: 5.7E-5
score: 22.4
IPR002241Glycoside hydrolase, family 27PANTHERPTHR11452ALPHA-GALACTOSIDASE/ALPHA-N-ACETYLGALACTOSAMINIDASEcoord: 9..325
coord: 442..562
IPR002241Glycoside hydrolase, family 27CDDcd14792GH27coord: 35..320
e-value: 7.94156E-59
score: 195.082
IPR013785Aldolase-type TIM barrelGENE3D3.20.20.70Aldolase class Icoord: 26..342
e-value: 1.2E-88
score: 299.6
NoneNo IPR availablePANTHERPTHR11452:SF42ALPHA-GALACTOSIDASEcoord: 9..325
coord: 442..562
NoneNo IPR availableSUPERFAMILY51011Glycosyl hydrolase domaincoord: 482..562
IPR035992Ricin B-like lectinsSUPERFAMILY50370Ricin B-like lectinscoord: 420..469
IPR017853Glycoside hydrolase superfamilySUPERFAMILY51445(Trans)glycosidasescoord: 29..321

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10005100.1HG10005100.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0005975 carbohydrate metabolic process
cellular_component GO:0016020 membrane
molecular_function GO:0052692 raffinose alpha-galactosidase activity
molecular_function GO:0003824 catalytic activity
molecular_function GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compounds