HG10003917 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10003917
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionBeta-galactosidase
LocationChr08: 11587341 .. 11594348 (+)
RNA-Seq ExpressionHG10003917
SyntenyHG10003917
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGAAATCAGAGTGCGGCTTCATCAGCTACAGATTACCGTGCATTGTTAATTCTGCTTTGGTATTGACGGCGGCGTTGTTTTTCCACTGCGTCCTCGGCGGCAACGACGACGGCAGCGGCAGCGTAACATACGATGGAAGATCCTTGATCGTCAATGGCGAACACAAACTTCTCTTCTCTGGTTCTATTCACTATCCGCGGAGTACTCCCGATGTAAGTTCTTCGATTAATTATTTCCTACTTTTGTTCATCCGTCTTTGATCTTCTGTGATGATTCCGATTGTTTCTGTTTCTAATTTGAGTTTAACCATTAATGGCGGTCGAATTTTCACTGTTTGAAATTTTTGGTGGGAAAATGAAAATTCTATTGGATCATTCGTTTCACATTGTAGTGTAAGAGGTGTTAGGTTAGTTCAAAAAGGGAAATCAAATGCGTTTGGTTATTGATAACCGTATTGGTTTCATCATTCAATGTTTGTTCGTTTTTATAAATTTACTTTTTTAAAAATGGAATTTCGGTGTCTTTTTTCCTCTTTTTCAGAAACGGTGGTGTTTAGATTGATAAAGTTTCCCGCCAACGAAGGAAGTAAATTACTGTTTTTAGCAGTATTTCAAATTTTTGGAGTAAAATTCACTTCACAAGGAATAAAATACTTTTAGCATTTTTATATATTTTGTTTCAGTGTGATTCAATTCAAATGAATTTCTTGGATTTAGGTACAATCAAACGCAGCAATTCTGAGATAAGTCTATATTGCATTCCCAAAAAGCATGTAGTTATAAATATAATAAGAATAATTATTTTTTTTAAAAAAAAATAAGAATTATAAATATAAACCAAGGCTTTCTTGCATATTGATACTATGGTTCACCTATGTTTTTAAGAATAATTTTTTCAAGTTTAACAAAGAAGGTAGACTTACTTTATCTACCGAACTATAATTGTTAGGATTTGATAATTATATATTCAAATATATTTTGGTTATACTTTTTTCAGGTTATTGTATTTTGAAATATTATGGTTTTTTTTTAATGTAATGAAATATTTTGTTTTGATCTACCAATTATGTATACAAATTTATTATGTTAAATTTATATTCTGCATTGAAAAATTAATTGAAATTTTAGTACAACATTAAATAATATTTGTATGTGTATATGTGGTTGTGTTTTGAATCACTTCAATTGTTAACGGATAGTCTAAAATTGAGAATTGAGCTAGAAAGTGAGTGATTTTGGGCCCGTTTGGATTGAATTTCTAAGTGTTTATAAATATATATATATATATTTTTTTGCACTTATAAACACTTTCTTCTTCTTCCTCTTTATTTAGTTATTTTGACTAGCTATATGGGGCCGAAGAATCAAATCTTTTACTTTGAAATCAATAATATAAACTTTATATCAATTTTATTAATCATGCTCATGTTGGTATAATGATTTCATTTTAGTTTTTAAGATTTATGTTGGTTTCTTTTCAACTTCTTTTTAGTATGATTTTCTTTTTTATAAAAATGAAATAAATAGATTCTTAGTCAAATTTAAAAAATATATGTATGGCAAAACTATGACTCTAGTTGAAAACCATTTAGTTTTAAATTTTAAGCATATAAATACTACATCCACTTCTAAGTTTCTTTGTTTTGTCATCAATAAAGTTTTAAAAAATAAAGTCAAATTTAAAAAATTTTAAAAAATAATTGTGATTAAATTTTAAAAAACAAGATTAAAAACAAAATCGTTATCAATTTGAGTATACTCTTTTTTAGCTTTTACAACTTTGTATAAATTGTGAAAACTTTAGTTGAAATTAGATAATAAAACAAAGGTAATACTTAAAAGAAAATTACCACATAAAATGGGATCCATCTTTATACATCTTCTTTGATCATTCGCATGAAAAACAAATTAAAAGGAAATATTTTGAAAAGGAAATAACAAGAATTTCCACGTGAAAATTATTATTCAACAAATTTATGATGCACTTAGTATTTTTGGTAAAAATAAAATAACAGTGAAAATGCTGTTTATATACATCATGTGATTCACGTCACCGCACAATGGGCGAATGGGAAAGCTCTCCGTGAAATTCAAAAATCAAGCGTAAAAAGGCTGGATCCCAATAAGAGCTCAATTCCCTACCCAGTTCAATTGGGCCTGGGTCAACCCTAAAGCCCACAAGAATTTTTATCTGAGCTTCAGTTGTTTGAAAGAGTTTGAATTAAGACAATGTTTTTTTTTTTTTTTTTTTTTTTTGTTAATTTGTTATACAGATGTGGCCTTCTTTGATAGCCAAAGCAAAGGAAGGTGGAATAGACGTTATACAAACCTACGTGTTTTGGAACCTTCACGAACCCCAACAAGGAACGGTAACTCAATCTCTCTCTACTTTCTAATATATCTCTAAAAGTTTAGGTGACATGGCAATGCATACAAATTAAAATCTTTTTTTTTTTTTTTTTTTTTTTTCAGTATGAATTTAGTGGAAGACGTGATATAGTAAGATTTGTGAAGGAAATACAAGCACAAGGATTATATGCTTGCCTTAGGATTGGACCCTTCATTGAGGCGGAATGGAGTTATGGGTAAAGACAAATTAAACTTTTTGTCACTACAATTACCTTATAAATAACCATTTCTTAGTCCTTGTCTTAACTTTCTAATACAGTGGTCTACCATTTTGGTTACATGATGTTCCTGGAATTGTTTATCGATCTGACAACGAACCATTCAAGGTATATACATATATATATTTTCTAGTGATATAATTAAATGAAAAAAAAAATTAGCTGCTGAAAAATGCATGGAATTAGGTGCTAATCATTGCCATAATTAAATGAGCAATATTTAATGCATTTAAGTTATACCTGTGTATGTTTATGCAATCAAACTCCAATCATACAATGAAAAAAATTTTAAAAATTAAAAAAGATTAAATTTTTATCACGTGAGAAATTGTTAAATCGCCGATTGACCAAAATTTTAAGTCGATGGTTAGGCAAATTTAAGATCATATTATCAGTGTGGTATAAGATGCACCTGAATATTTTGAGTTTGAATAACCAAGTGTGAGGCGTTACGTTTTCATGGTCACTTTGCTACTTTAATGTCAGCTTCACATGCAAAACTTCACCACCAAGATCGTGAATCTGATGAAGTCGGAAGGCCTATATGCTTCACAAGGAGGACCGATTATACTTTCACAGGTTTGAGCCGATGGGCAATAGCAAAAACATTATTGAATGTCTTTTCAATTCATGTTCCAACCTTTTATTCTTTGTGCACCTTGTTAAGAGAACTCATGGATGGAATTGCAGATTGAGAATGAATACACATTGGTGGAGGCAGCCTTTCGTGAGAAGGGACCGCCTTATGTTCTATGGGCAGCAGACATGGCGGTCAGCCTAAAGACTGGTGTGCCATGGAGCATGTGCAAGCAAAACGATGCACCTGACCCTGTGGTATATATGTAAAAATAACTCAATTATACAGTAATTATTTTTCCTCTCTGTCTCTCACGTGTTCATTTCTTATATTACTCATGGCAGATAAATACTTGTAATGGGATGAGATGTGGAGAAACATTCACAGGACCCAACTCGCCTAATAAGCCATCTATTTGGACTGAGAATTGGACTAGTTTGTAAGTTTTGTGGTTGCGTCGTATCTAAATATTTTGCATAAATTTTTTCACAAAAATGGAGGAAGAATTTGCCTTTTCTTTGAATGTTTGAGATTGTTAATTACTTGAATATCGGATTGAAATTTGTTGTTAAAAGATGGATACACTTTATGTTTGTTTGGGGCGTGGGATTACAGTTATCAAACATATGGTGAGGAACCGTACATAAGATCAGCAGAGGAGATTGCATTTCATGTTGCTCTTTTCATTGCTGCAAAGAATGGGACTTATGTTAATTATTATATGGTATTAATACAAATCTCCCTCTCAATTTTTTGTTTGTTCCTATGAACCTAACTAATGATGTATGTTTAACTTTAACCAAAATCAATGTTCATTTTTTCCCCACTAGTATCATGGAGGAACCAACTTCGGAAGATCAGCCTCTGCATTTATGATCACAGGTTACTATGATCAAGCCCCCCTGGATGAATATGGTATGGCATTTTCTAAAATTATCCATCTTAATTAAATTAGCTCAACTCTTCGAATTTTGCATCAACTGAAAACTGTTTACCACATTTTTACTGCAGGTTTAACTAGGGAACCGAAATGGGGTCATCTTAAAGAATTACATGCTGCAGTTAAGCTATGCTCCACGCCTTTGCTCACTGGAACTAAATCCAATTTCTCATTGGGCCAATCACTAGAAGTAAGTTTTAATTTTAATGTTTCTAACTCCATTCCATGCAAAATGAAAAGCTGTTCTAGGATAATGGATAAGAATGAAAATTATTACTTATTGAGGTTTCAATTAGGGAAGATTATAGATGACTTACAGTCCATTTGATTTTTAAAAATGGTGCATAATTTCTCACAATTTCTTAATCATAGTATTCATCTTTACTAAGTCCAACATTTGAGTTCTTGACAAGATACTTTTAAAATACTTTTAAGGAGGAACGAGGAGAGGAAAATAATGTTTATAAGTTTAATTTTTTCAAAAAAGCTAAAAAATAAATTATTACTAAAGAAAACTTTAGTCATGATTTTTTTTTTCCTTTTTAATGAAATGATGATTACCAAACAGAACTTTATTTATCGGTTTTTTTTTTTCTTACGGCAGGCAATTGTGTTCAAAACGGAATCAGGAGAATGTGCTGCCTTTTTGGTGAATAAGGGAGCTACAGATACGAATGTCCTCTTTCAAAATATTACTTACGAGTTACCTCTCAGTTCCATCAGCATATTACCAGATTGTAAAAATGTGGCCTTCAATACTAGAAGGGTAAGATTTACACCGACAACTAGCGGGAAGTTTTAGCAGCAATTTTATTTTTCCTCATCAATTTCAACTGTTGAAATATATTTGATCAGGTAAGCGTACAACATAATACAAGATCAATGAAGGTAGTACAAAAGTTTGGGTCGTCTGAAGAATGGCAAGAGTTCAAGGAACCGATACCTAACTTTGATGAAACCAAATTAAGGGCAAGCGAGTTATTAGAGCAGATGGGTACTACAAAAGACAGATCAGATTATCTTTGGTACACTTTTAGGTAAATTATATGTGAGAAATCAAGCGGCTGATATTAACCTCAATTAATCAAAAGAAAAAGTCCGAATGAATTAGTTCAATTTTTGTTGCTTGTCAATAATGGTTGTTGAAATGCAGGGTTCAACAGGATTCTCAGGACTCTCAACAAACACTTGAAGTGGATTCTCGAGCTCATGCCTTGCATGCATTTGTCAATGGGGTTTATGCAGGCTAGTCTTATCTTCCTTTTTATGGTTGAAACTCGAAGAAACCACAATATCGAGCATTTTACTATTGATGTTTTATAGGCTCCGCCCATGGAACTTACAAAGAAAAAGGTTTCTCTCTGGAGAATAATATTACATTGAGAAATGGCATCAACAACATCTCATTGCTCAGTGTGATGGTTGGGCTACCGGTACGGATCTTTTAATCTTCCCTTCCATAGTTTATGTAATTTTTTTTTGAAACACCATAGTTTATGTATTATAACTTACGCTAATAGGTATTTGTAACGTAATGCACCACACATCATAGTGAATTCTGATAGTTTTAATTTGAAAAAGGATTCTGGAGCATTTCTTGAGACCAGAGTTGCTGGACTACGAAGAGTGAGAATTCAAGGCGAGGATTTCTCGGAACAACCTTGGGGATACAAGGTTCTTTAATTTCTAAACTCCTAATATGAGTACATAAACACTTAAGCTGGTGTAATATTCATAAAAAAATACACATGCACAAACACATACTTAAAACTAACCTCTTTTTTATATTTTTGTTGATTTTTCCAAACAAATTTGCAAAGGTTGGCCTATCAGGAGAGCAATCACAAATATTTTTAGACACGGGGTCAAGCGATGTTCAGTGGAGCAGGTTAGGAAACTCTTCTCAGCCGCTCACATGGTACAAGGTACTTTCGATAATCAATTGTTTAGACTACTTTCTACAGCAACTATCTTGAACTCAAATGTTCATTAACTGACCTTTGCCATTGTCATTCATCGCTAATTGAACAGACTCAGTTTGATGCGCCTCCTGGTGATGACCCCATTGCACTGAATCTTGGTTCAATGGGGAAGGGTGCAGCGTGGGTTAACGGCTGGGGCATTGGCCGGTACTGGGTCTCCTTCCTCACCCCAAAAGGGGAGCCTTCACAGAAATGGTAAACAGTTTACTCAGCTTAATATTATACCTTATCTTTCAAATTTCCTTCCACTAGTTTCCTGTATCGATAACACAATTGTCTCTCCTTTCAATCCAGGTATAATGTACCACGTTCCTTCCTCAAGCCAGCTGGAAACCAGTTGGTTATTCTTGAAGAAGAAACAGGAAACCCGATTGGGATATCTTTGGATTCTGTTTCAATTACCAAAACATGTGGGCAAGTATCTGAATCACATTATCCTTTAGTAGCTTCATGGATGGGTGCAAAGAAACAGACGGCGAGTGGTACAAAGAACAGAACCAGAAGGCCTAAGGTTCGATTAAGCTGCCCTACAAACAAGAACATCTCCAACATCTTATTTGCAAGCTTTGGGACCCCGTCCGGTGACTGTCAAAGCTACGCTATTGGAATGTGTCACTCGCCAAACTCCAGAGCCATTGTAGAGCATGTGAGTTTTTTCCTGAAACGCATTAGAAAATCCTATGAAGAAAGTGGCATCTTGCATTTAATTAAATAAAAGGTAATCAAGACAATATTTACTCACATGAAAATTGTGTTGTGATGTTTCAGGCGTGTCTAGGGAGGGCCAAATGCTCGATTCCAATCTCAAATCTGAACTTTAGAGGCGATCCATGTCCACATGTCACCAAAACATTGTTGGTCGATGCACAATGCACATAA

mRNA sequence

ATGGCGAAATCAGAGTGCGGCTTCATCAGCTACAGATTACCGTGCATTGTTAATTCTGCTTTGGTATTGACGGCGGCGTTGTTTTTCCACTGCGTCCTCGGCGGCAACGACGACGGCAGCGGCAGCGTAACATACGATGGAAGATCCTTGATCGTCAATGGCGAACACAAACTTCTCTTCTCTGGTTCTATTCACTATCCGCGGAGTACTCCCGATTATGAATTTAGTGGAAGACGTGATATAGTAAGATTTGTGAAGGAAATACAAGCACAAGGATTATATGCTTGCCTTAGGATTGGACCCTTCATTGAGGCGGAATGGAGTTATGGTGGTCTACCATTTTGGTTACATGATGTTCCTGGAATTGTTTATCGATCTGACAACGAACCATTCAAGCTTCACATGCAAAACTTCACCACCAAGATCGTGAATCTGATGAAGTCGGAAGGCCTATATGCTTCACAAGGAGGACCGATTATACTTTCACAGATTGAGAATGAATACACATTGGTGGAGGCAGCCTTTCGTGAGAAGGGACCGCCTTATGTTCTATGGGCAGCAGACATGGCGGTCAGCCTAAAGACTGGTGTGCCATGGAGCATGTGCAAGCAAAACGATGCACCTGACCCTGTGATAAATACTTGTAATGGGATGAGATGTGGAGAAACATTCACAGGACCCAACTCGCCTAATAAGCCATCTATTTGGACTGAGAATTGGACTAGTTTTTATCAAACATATGGTGAGGAACCGTACATAAGATCAGCAGAGGAGATTGCATTTCATGTTGCTCTTTTCATTGCTGCAAAGAATGGGACTTATGTTAATTATTATATGTATCATGGAGGAACCAACTTCGGAAGATCAGCCTCTGCATTTATGATCACAGGTTACTATGATCAAGCCCCCCTGGATGAATATGGTTTAACTAGGGAACCGAAATGGGGTCATCTTAAAGAATTACATGCTGCAGTTAAGCTATGCTCCACGCCTTTGCTCACTGGAACTAAATCCAATTTCTCATTGGGCCAATCACTAGAAGCAATTGTGTTCAAAACGGAATCAGGAGAATGTGCTGCCTTTTTGGTGAATAAGGGAGCTACAGATACGAATGTCCTCTTTCAAAATATTACTTACGAGTTACCTCTCAGTTCCATCAGCATATTACCAGATTGTAAAAATGTGGCCTTCAATACTAGAAGGGTAAGCGTACAACATAATACAAGATCAATGAAGGTAGTACAAAAGTTTGGGTCGTCTGAAGAATGGCAAGAGTTCAAGGAACCGATACCTAACTTTGATGAAACCAAATTAAGGGCAAGCGAGTTATTAGAGCAGATGGGTACTACAAAAGACAGATCAGATTATCTTTGGTACACTTTTAGGGTTCAACAGGATTCTCAGGACTCTCAACAAACACTTGAAGTGGATTCTCGAGCTCATGCCTTGCATGCATTTGTCAATGGGGATTCTGGAGCATTTCTTGAGACCAGAGTTGCTGGACTACGAAGAGTGAGAATTCAAGGCGAGGATTTCTCGGAACAACCTTGGGGATACAAGGTTGGCCTATCAGGAGAGCAATCACAAATATTTTTAGACACGGGGTCAAGCGATGTTCAGTGGAGCAGGTTAGGAAACTCTTCTCAGCCGCTCACATGGTACAAGACTCAGTTTGATGCGCCTCCTGGTGATGACCCCATTGCACTGAATCTTGGTTCAATGGGGAAGGGTGCAGCGTGGGTTAACGGCTGGGGCATTGGCCGGTACTGGGTCTCCTTCCTCACCCCAAAAGGGGAGCCTTCACAGAAATGGTATAATGTACCACGTTCCTTCCTCAAGCCAGCTGGAAACCAGTTGGTTATTCTTGAAGAAGAAACAGGAAACCCGATTGGGATATCTTTGGATTCTGTTTCAATTACCAAAACATGTGGGCAAGTATCTGAATCACATTATCCTTTAGTAGCTTCATGGATGGGTGCAAAGAAACAGACGGCGAGTGGTACAAAGAACAGAACCAGAAGGCCTAAGGTTCGATTAAGCTGCCCTACAAACAAGAACATCTCCAACATCTTATTTGCAAGCTTTGGGACCCCGTCCGGTGACTGTCAAAGCTACGCTATTGGAATGTGTCACTCGCCAAACTCCAGAGCCATTGTAGAGCATGCGTGTCTAGGGAGGGCCAAATGCTCGATTCCAATCTCAAATCTGAACTTTAGAGGCGATCCATGTCCACATGTCACCAAAACATTGTTGGTCGATGCACAATGCACATAA

Coding sequence (CDS)

ATGGCGAAATCAGAGTGCGGCTTCATCAGCTACAGATTACCGTGCATTGTTAATTCTGCTTTGGTATTGACGGCGGCGTTGTTTTTCCACTGCGTCCTCGGCGGCAACGACGACGGCAGCGGCAGCGTAACATACGATGGAAGATCCTTGATCGTCAATGGCGAACACAAACTTCTCTTCTCTGGTTCTATTCACTATCCGCGGAGTACTCCCGATTATGAATTTAGTGGAAGACGTGATATAGTAAGATTTGTGAAGGAAATACAAGCACAAGGATTATATGCTTGCCTTAGGATTGGACCCTTCATTGAGGCGGAATGGAGTTATGGTGGTCTACCATTTTGGTTACATGATGTTCCTGGAATTGTTTATCGATCTGACAACGAACCATTCAAGCTTCACATGCAAAACTTCACCACCAAGATCGTGAATCTGATGAAGTCGGAAGGCCTATATGCTTCACAAGGAGGACCGATTATACTTTCACAGATTGAGAATGAATACACATTGGTGGAGGCAGCCTTTCGTGAGAAGGGACCGCCTTATGTTCTATGGGCAGCAGACATGGCGGTCAGCCTAAAGACTGGTGTGCCATGGAGCATGTGCAAGCAAAACGATGCACCTGACCCTGTGATAAATACTTGTAATGGGATGAGATGTGGAGAAACATTCACAGGACCCAACTCGCCTAATAAGCCATCTATTTGGACTGAGAATTGGACTAGTTTTTATCAAACATATGGTGAGGAACCGTACATAAGATCAGCAGAGGAGATTGCATTTCATGTTGCTCTTTTCATTGCTGCAAAGAATGGGACTTATGTTAATTATTATATGTATCATGGAGGAACCAACTTCGGAAGATCAGCCTCTGCATTTATGATCACAGGTTACTATGATCAAGCCCCCCTGGATGAATATGGTTTAACTAGGGAACCGAAATGGGGTCATCTTAAAGAATTACATGCTGCAGTTAAGCTATGCTCCACGCCTTTGCTCACTGGAACTAAATCCAATTTCTCATTGGGCCAATCACTAGAAGCAATTGTGTTCAAAACGGAATCAGGAGAATGTGCTGCCTTTTTGGTGAATAAGGGAGCTACAGATACGAATGTCCTCTTTCAAAATATTACTTACGAGTTACCTCTCAGTTCCATCAGCATATTACCAGATTGTAAAAATGTGGCCTTCAATACTAGAAGGGTAAGCGTACAACATAATACAAGATCAATGAAGGTAGTACAAAAGTTTGGGTCGTCTGAAGAATGGCAAGAGTTCAAGGAACCGATACCTAACTTTGATGAAACCAAATTAAGGGCAAGCGAGTTATTAGAGCAGATGGGTACTACAAAAGACAGATCAGATTATCTTTGGTACACTTTTAGGGTTCAACAGGATTCTCAGGACTCTCAACAAACACTTGAAGTGGATTCTCGAGCTCATGCCTTGCATGCATTTGTCAATGGGGATTCTGGAGCATTTCTTGAGACCAGAGTTGCTGGACTACGAAGAGTGAGAATTCAAGGCGAGGATTTCTCGGAACAACCTTGGGGATACAAGGTTGGCCTATCAGGAGAGCAATCACAAATATTTTTAGACACGGGGTCAAGCGATGTTCAGTGGAGCAGGTTAGGAAACTCTTCTCAGCCGCTCACATGGTACAAGACTCAGTTTGATGCGCCTCCTGGTGATGACCCCATTGCACTGAATCTTGGTTCAATGGGGAAGGGTGCAGCGTGGGTTAACGGCTGGGGCATTGGCCGGTACTGGGTCTCCTTCCTCACCCCAAAAGGGGAGCCTTCACAGAAATGGTATAATGTACCACGTTCCTTCCTCAAGCCAGCTGGAAACCAGTTGGTTATTCTTGAAGAAGAAACAGGAAACCCGATTGGGATATCTTTGGATTCTGTTTCAATTACCAAAACATGTGGGCAAGTATCTGAATCACATTATCCTTTAGTAGCTTCATGGATGGGTGCAAAGAAACAGACGGCGAGTGGTACAAAGAACAGAACCAGAAGGCCTAAGGTTCGATTAAGCTGCCCTACAAACAAGAACATCTCCAACATCTTATTTGCAAGCTTTGGGACCCCGTCCGGTGACTGTCAAAGCTACGCTATTGGAATGTGTCACTCGCCAAACTCCAGAGCCATTGTAGAGCATGCGTGTCTAGGGAGGGCCAAATGCTCGATTCCAATCTCAAATCTGAACTTTAGAGGCGATCCATGTCCACATGTCACCAAAACATTGTTGGTCGATGCACAATGCACATAA

Protein sequence

MAKSECGFISYRLPCIVNSALVLTAALFFHCVLGGNDDGSGSVTYDGRSLIVNGEHKLLFSGSIHYPRSTPDYEFSGRRDIVRFVKEIQAQGLYACLRIGPFIEAEWSYGGLPFWLHDVPGIVYRSDNEPFKLHMQNFTTKIVNLMKSEGLYASQGGPIILSQIENEYTLVEAAFREKGPPYVLWAADMAVSLKTGVPWSMCKQNDAPDPVINTCNGMRCGETFTGPNSPNKPSIWTENWTSFYQTYGEEPYIRSAEEIAFHVALFIAAKNGTYVNYYMYHGGTNFGRSASAFMITGYYDQAPLDEYGLTREPKWGHLKELHAAVKLCSTPLLTGTKSNFSLGQSLEAIVFKTESGECAAFLVNKGATDTNVLFQNITYELPLSSISILPDCKNVAFNTRRVSVQHNTRSMKVVQKFGSSEEWQEFKEPIPNFDETKLRASELLEQMGTTKDRSDYLWYTFRVQQDSQDSQQTLEVDSRAHALHAFVNGDSGAFLETRVAGLRRVRIQGEDFSEQPWGYKVGLSGEQSQIFLDTGSSDVQWSRLGNSSQPLTWYKTQFDAPPGDDPIALNLGSMGKGAAWVNGWGIGRYWVSFLTPKGEPSQKWYNVPRSFLKPAGNQLVILEEETGNPIGISLDSVSITKTCGQVSESHYPLVASWMGAKKQTASGTKNRTRRPKVRLSCPTNKNISNILFASFGTPSGDCQSYAIGMCHSPNSRAIVEHACLGRAKCSIPISNLNFRGDPCPHVTKTLLVDAQCT
Homology
BLAST of HG10003917 vs. NCBI nr
Match: XP_038885233.1 (beta-galactosidase 16 isoform X1 [Benincasa hispida])

HSP 1 Score: 1428.3 bits (3696), Expect = 0.0e+00
Identity = 709/829 (85.52%), Postives = 723/829 (87.21%), Query Frame = 0

Query: 1   MAKSECGFISYRLPCIVNSALVLTAALFFHCVLGGNDDGSGSVTYDGRSLIVNGEHKLLF 60
           MAKSE G I       +NSALVLTAAL  HCVLGGND GSG VTYDGRSLIVNGEHKLLF
Sbjct: 1   MAKSEYGII-----VCINSALVLTAAL-LHCVLGGNDGGSGGVTYDGRSLIVNGEHKLLF 60

Query: 61  SGSIHYPRSTPD--------------------------------YEFSGRRDIVRFVKEI 120
           SGSIHYPRSTPD                                YEFSGRRDIVRFVKEI
Sbjct: 61  SGSIHYPRSTPDMWPSLIAKAKEGGIDVIQTYVFWNLHEPQQGTYEFSGRRDIVRFVKEI 120

Query: 121 QAQGLYACLRIGPFIEAEWSYGGLPFWLHDVPGIVYRSDNEPFKLHMQNFTTKIVNLMKS 180
           QAQGLYA LRIGPFIEAEW+YGGLPFWLHDVPGIVYRSDNEPFK HMQNFTTKIVN+MKS
Sbjct: 121 QAQGLYATLRIGPFIEAEWNYGGLPFWLHDVPGIVYRSDNEPFKFHMQNFTTKIVNMMKS 180

Query: 181 EGLYASQGGPIILSQIENEYTLVEAAFREKGPPYVLWAADMAVSLKTGVPWSMCKQNDAP 240
           EGLYASQGGPIILSQIENEYTLVEAAFREKGPPYVLWAA+MAVSL+TGVPWSMCKQNDAP
Sbjct: 181 EGLYASQGGPIILSQIENEYTLVEAAFREKGPPYVLWAANMAVSLQTGVPWSMCKQNDAP 240

Query: 241 DPVINTCNGMRCGETFTGPNSPNKPSIWTENWTSFYQTYGEEPYIRSAEEIAFHVALFIA 300
           DPVINTCNGMRCGETFTGPNSPNKPSIWTENWTSFYQTYGEEPY+RSAEEIAFHVALFIA
Sbjct: 241 DPVINTCNGMRCGETFTGPNSPNKPSIWTENWTSFYQTYGEEPYMRSAEEIAFHVALFIA 300

Query: 301 AKNGTYVNYYMYHGGTNFGRSASAFMITGYYDQAPLDEYGLTREPKWGHLKELHAAVKLC 360
           AKNGTYVNYYMYHGGTNFGRSASA MITGYYDQAPLDEYGL REPKWGHLKELHAAVKLC
Sbjct: 301 AKNGTYVNYYMYHGGTNFGRSASALMITGYYDQAPLDEYGLIREPKWGHLKELHAAVKLC 360

Query: 361 STPLLTGTKSNFSLGQSLEAIVFKTESGECAAFLVNKGATDTNVLFQNITYELPLSSISI 420
           STPLLTGTKSNFSLGQSLEAIVF+TES ECAAFLVNKGATDTNVLFQN+TYELPLSSISI
Sbjct: 361 STPLLTGTKSNFSLGQSLEAIVFETESEECAAFLVNKGATDTNVLFQNVTYELPLSSISI 420

Query: 421 LPDCKNVAFNTRRVSVQHNTRSMKVVQKFGSSEEWQEFKEPIPNFDETKLRASELLEQMG 480
           LPDCKNVAFNTRRVSV HNTRSMK VQKF  SEEWQEFKE IPNFDET+LRA+ELLE  G
Sbjct: 421 LPDCKNVAFNTRRVSVHHNTRSMKAVQKFELSEEWQEFKELIPNFDETELRANELLEHTG 480

Query: 481 TTKDRSDYLWYTFRVQQDSQDSQQTLEVDSRAHALHAFVNG------------------- 540
           TTKDRSDYLWYTFR+QQDS DSQQTLEVDSRAHALHAFVNG                   
Sbjct: 481 TTKDRSDYLWYTFRIQQDSPDSQQTLEVDSRAHALHAFVNGVYAGSAHGTYKERSFSLEN 540

Query: 541 ---------------------DSGAFLETRVAGLRRVRIQGEDFSEQPWGYKVGLSGEQS 600
                                DSGAFLE+RVAGLRRVRIQGEDFSEQ WGYKVGLSGEQS
Sbjct: 541 NITLTNGINNISLLSVMVGLPDSGAFLESRVAGLRRVRIQGEDFSEQSWGYKVGLSGEQS 600

Query: 601 QIFLDTGSSDVQWSRLGNSSQPLTWYKTQFDAPPGDDPIALNLGSMGKGAAWVNGWGIGR 660
           QIFLDTGSS VQWSRLGNSSQPLTWYKTQFDAPPGDDPIALNLGSMGKGA WVNG GIGR
Sbjct: 601 QIFLDTGSSSVQWSRLGNSSQPLTWYKTQFDAPPGDDPIALNLGSMGKGAVWVNGRGIGR 660

Query: 661 YWVSFLTPKGEPSQKWYNVPRSFLKPAGNQLVILEEETGNPIGISLDSVSITKTCGQVSE 720
           YWVSFLTPKGEPSQKWYNVPRSFL P GNQLVILEEETGNPIGISLDSVSITKTCGQVSE
Sbjct: 661 YWVSFLTPKGEPSQKWYNVPRSFLMPTGNQLVILEEETGNPIGISLDSVSITKTCGQVSE 720

Query: 721 SHYPLVASWMGAKKQTASGTKNRTRRPKVRLSCPTNKNISNILFASFGTPSGDCQSYAIG 758
           SHYPLVASWMGAKKQ  SGTKNRTRRPKVRLSCPTNKNISNILFASFGTPSGDCQSYAIG
Sbjct: 721 SHYPLVASWMGAKKQRTSGTKNRTRRPKVRLSCPTNKNISNILFASFGTPSGDCQSYAIG 780

BLAST of HG10003917 vs. NCBI nr
Match: XP_011657429.1 (beta-galactosidase 16 isoform X2 [Cucumis sativus])

HSP 1 Score: 1381.7 bits (3575), Expect = 0.0e+00
Identity = 691/829 (83.35%), Postives = 713/829 (86.01%), Query Frame = 0

Query: 1   MAKSECGFISYRLPCIVNSALVLTAALFFHCVLGGNDDGSGSVTYDGRSLIVNGEHKLLF 60
           MAKSE G +   + CI ++ L+LTA L FHCVLGGN DG G+ TYDGRSLIVNGEHKLLF
Sbjct: 1   MAKSESGIMI--IICIYSAYLLLTAPL-FHCVLGGN-DGIGA-TYDGRSLIVNGEHKLLF 60

Query: 61  SGSIHYPRSTPD--------------------------------YEFSGRRDIVRFVKEI 120
           SGSIHYPRSTPD                                YEFSGRRDIVRFVKEI
Sbjct: 61  SGSIHYPRSTPDMWPSLIAKAKEGGIDVIQTYVFWNLHEPQQGTYEFSGRRDIVRFVKEI 120

Query: 121 QAQGLYACLRIGPFIEAEWSYGGLPFWLHDVPGIVYRSDNEPFKLHMQNFTTKIVNLMKS 180
           QAQGLYACLRIGPFIEAEWSYGGLPFWLHDV GIVYRSDNEPFKLHMQNFTTKIVN+MKS
Sbjct: 121 QAQGLYACLRIGPFIEAEWSYGGLPFWLHDVLGIVYRSDNEPFKLHMQNFTTKIVNMMKS 180

Query: 181 EGLYASQGGPIILSQIENEYTLVEAAFREKGPPYVLWAADMAVSLKTGVPWSMCKQNDAP 240
           EGLYASQGGPIILSQIENEYTLVEAAF EKGPPYV WAA MAVSL+TGVPWSMCKQNDAP
Sbjct: 181 EGLYASQGGPIILSQIENEYTLVEAAFGEKGPPYVQWAAKMAVSLQTGVPWSMCKQNDAP 240

Query: 241 DPVINTCNGMRCGETFTGPNSPNKPSIWTENWTSFYQTYGEEPYIRSAEEIAFHVALFIA 300
           DPVINTCNGMRCGETFTGPNSPNKPSIWTENWTSFYQTYGEEPYIRSAEEIAFHVALFIA
Sbjct: 241 DPVINTCNGMRCGETFTGPNSPNKPSIWTENWTSFYQTYGEEPYIRSAEEIAFHVALFIA 300

Query: 301 AKNGTYVNYYMYHGGTNFGRSASAFMITGYYDQAPLDEYGLTREPKWGHLKELHAAVKLC 360
           AKNGTYVNYYMYHGGTNFGRSASAFMITGYYDQ+PLDEYGLTREPKWGHLKELHAAVKLC
Sbjct: 301 AKNGTYVNYYMYHGGTNFGRSASAFMITGYYDQSPLDEYGLTREPKWGHLKELHAAVKLC 360

Query: 361 STPLLTGTKSNFSLGQSLEAIVFKTESGECAAFLVNKGATDTNVLFQNITYELPLSSISI 420
           STPLLTGTKSNFSLGQS+EAIVFKTES ECAAFLVN+GA D+NVLFQN+TYELPL SISI
Sbjct: 361 STPLLTGTKSNFSLGQSVEAIVFKTESNECAAFLVNRGAIDSNVLFQNVTYELPLGSISI 420

Query: 421 LPDCKNVAFNTRRVSVQHNTRSMKVVQKFGSSEEWQEFKEPIPNFDETKLRASELLEQMG 480
           LPDCKNVAFNTRRVSVQHNTRSM  VQKF    EW+EFKEPIPN D+T+LRA+ELLE MG
Sbjct: 421 LPDCKNVAFNTRRVSVQHNTRSMMAVQKF-DLLEWEEFKEPIPNIDDTELRANELLEHMG 480

Query: 481 TTKDRSDYLWYTFRVQQDSQDSQQTLEVDSRAHALHAFVNG------------------- 540
           TTKDRSDYLWYTFRVQQDS DSQQTLEVDSRAHALHAFVNG                   
Sbjct: 481 TTKDRSDYLWYTFRVQQDSPDSQQTLEVDSRAHALHAFVNGDYAGSAHGIYKEKGFSLAK 540

Query: 541 ---------------------DSGAFLETRVAGLRRVRIQGEDFSEQPWGYKVGLSGEQS 600
                                DSGAFLETRVAGLRRV IQGEDFSEQ WGYKVGLSGEQS
Sbjct: 541 NITLRNGINNISLLSVMVGLPDSGAFLETRVAGLRRVGIQGEDFSEQHWGYKVGLSGEQS 600

Query: 601 QIFLDTGSSDVQWSRLGNSSQPLTWYKTQFDAPPGDDPIALNLGSMGKGAAWVNGWGIGR 660
           QIFLDTGSS+VQWSRLGNSSQPLTWYKTQFDAPPGDDPIALNLGSMGKGA WVNG GIGR
Sbjct: 601 QIFLDTGSSNVQWSRLGNSSQPLTWYKTQFDAPPGDDPIALNLGSMGKGAVWVNGRGIGR 660

Query: 661 YWVSFLTPKGEPSQKWYNVPRSFLKPAGNQLVILEEETGNPIGISLDSVSITKTCGQVSE 720
           YWVSFLTPKGEPSQKWYNVPRSFLKP  NQLVILEEETGNP+ ISLDSV ITKTCGQVSE
Sbjct: 661 YWVSFLTPKGEPSQKWYNVPRSFLKPTDNQLVILEEETGNPVEISLDSVLITKTCGQVSE 720

Query: 721 SHYPLVASWMGAKKQTASGTKNRTRRPKVRLSCPTNKNISNILFASFGTPSGDCQSYAIG 758
           SHYPLVASWMGAKKQ     KNRTRRPKV+LSCP+ K ISNILFASFGTPSGDCQSYAIG
Sbjct: 721 SHYPLVASWMGAKKQKVRRVKNRTRRPKVQLSCPSKKKISNILFASFGTPSGDCQSYAIG 780

BLAST of HG10003917 vs. NCBI nr
Match: XP_008466742.1 (PREDICTED: beta-galactosidase 16 isoform X2 [Cucumis melo])

HSP 1 Score: 1378.2 bits (3566), Expect = 0.0e+00
Identity = 687/829 (82.87%), Postives = 708/829 (85.40%), Query Frame = 0

Query: 1   MAKSECGFISYRLPCIVNSALVLTAALFFHCVLGGNDDGSGSVTYDGRSLIVNGEHKLLF 60
           MAKSE   +       + SAL  TA L FHCVLGGN DG G VTYDGRSLIVNGEHKLLF
Sbjct: 1   MAKSESCIV-----ICIYSALFFTAPL-FHCVLGGN-DGIG-VTYDGRSLIVNGEHKLLF 60

Query: 61  SGSIHYPRSTPD--------------------------------YEFSGRRDIVRFVKEI 120
           SGSIHYPRSTPD                                YEFSGRRDIV+FVKEI
Sbjct: 61  SGSIHYPRSTPDMWPSLIAKAKEGGIDVIQTYVFWNLHEPQQGTYEFSGRRDIVQFVKEI 120

Query: 121 QAQGLYACLRIGPFIEAEWSYGGLPFWLHDVPGIVYRSDNEPFKLHMQNFTTKIVNLMKS 180
           QAQGLYACLRIGPFIEAEWSYGGLPFWLHDVPGIVYRSDNEPFKLHMQNFTTKIVN+MKS
Sbjct: 121 QAQGLYACLRIGPFIEAEWSYGGLPFWLHDVPGIVYRSDNEPFKLHMQNFTTKIVNMMKS 180

Query: 181 EGLYASQGGPIILSQIENEYTLVEAAFREKGPPYVLWAADMAVSLKTGVPWSMCKQNDAP 240
           EGLYASQGGPIILSQIENEYTLVEAAF EKGPPYV WAA MAVSL+TGVPWSMCKQNDAP
Sbjct: 181 EGLYASQGGPIILSQIENEYTLVEAAFGEKGPPYVQWAAKMAVSLQTGVPWSMCKQNDAP 240

Query: 241 DPVINTCNGMRCGETFTGPNSPNKPSIWTENWTSFYQTYGEEPYIRSAEEIAFHVALFIA 300
           DPVINTCNGMRCGETFTGPNSP KPSIWTENWTSFYQTYGEEPYIRSAEEIAFHVALFIA
Sbjct: 241 DPVINTCNGMRCGETFTGPNSPTKPSIWTENWTSFYQTYGEEPYIRSAEEIAFHVALFIA 300

Query: 301 AKNGTYVNYYMYHGGTNFGRSASAFMITGYYDQAPLDEYGLTREPKWGHLKELHAAVKLC 360
           AKNGTYVNYYMYHGGTNFGRSASAFMITGYYDQ PLDEYGLTREPKWGHLKELHAAVKLC
Sbjct: 301 AKNGTYVNYYMYHGGTNFGRSASAFMITGYYDQGPLDEYGLTREPKWGHLKELHAAVKLC 360

Query: 361 STPLLTGTKSNFSLGQSLEAIVFKTESGECAAFLVNKGATDTNVLFQNITYELPLSSISI 420
           STPLLTGTK NFSLGQSLEAIVFKTES ECAAFLVN+GA DT+VLFQN+TYELPL SISI
Sbjct: 361 STPLLTGTKFNFSLGQSLEAIVFKTESDECAAFLVNRGAIDTDVLFQNVTYELPLGSISI 420

Query: 421 LPDCKNVAFNTRRVSVQHNTRSMKVVQKFGSSEEWQEFKEPIPNFDETKLRASELLEQMG 480
           LPDCKNVAFNTRRVSVQ NTRSM  VQKF SSEEW+EFKEPIPNF++T+LRA++LLE MG
Sbjct: 421 LPDCKNVAFNTRRVSVQRNTRSMMTVQKFDSSEEWEEFKEPIPNFEDTELRANKLLEHMG 480

Query: 481 TTKDRSDYLWYTFRVQQDSQDSQQTLEVDSRAHALHAFVNG------------------- 540
           TTKDRSDYLWYTFRVQQDS DSQQT EVDSRAHALHAFVNG                   
Sbjct: 481 TTKDRSDYLWYTFRVQQDSPDSQQTFEVDSRAHALHAFVNGDYAGSAHGTYKEKGFSLVN 540

Query: 541 ---------------------DSGAFLETRVAGLRRVRIQGEDFSEQPWGYKVGLSGEQS 600
                                DSGAFLETRVAGLRRV IQGEDFSEQPWGYKVGLSGEQS
Sbjct: 541 NITLRNGINNISLLSVMVGLPDSGAFLETRVAGLRRVGIQGEDFSEQPWGYKVGLSGEQS 600

Query: 601 QIFLDTGSSDVQWSRLGNSSQPLTWYKTQFDAPPGDDPIALNLGSMGKGAAWVNGWGIGR 660
           QIFLDTGSS+VQWSRLGNSSQPLTWYKT+FDAPPGDDPIALNLGSMGKGAAWVNG GIGR
Sbjct: 601 QIFLDTGSSNVQWSRLGNSSQPLTWYKTRFDAPPGDDPIALNLGSMGKGAAWVNGRGIGR 660

Query: 661 YWVSFLTPKGEPSQKWYNVPRSFLKPAGNQLVILEEETGNPIGISLDSVSITKTCGQVSE 720
           YWVSFLTPKGEPSQKWYNVPRSFLKP  NQLVILEEETGNP+ ISLDSV ITKTCGQVSE
Sbjct: 661 YWVSFLTPKGEPSQKWYNVPRSFLKPTDNQLVILEEETGNPVEISLDSVLITKTCGQVSE 720

Query: 721 SHYPLVASWMGAKKQTASGTKNRTRRPKVRLSCPTNKNISNILFASFGTPSGDCQSYAIG 758
           SHYPLVASWMGAKKQ     KNRTRRPKV+LSCP+ K ISNILFASFGTPSGDCQSYAIG
Sbjct: 721 SHYPLVASWMGAKKQKVRSAKNRTRRPKVQLSCPSKKKISNILFASFGTPSGDCQSYAIG 780

BLAST of HG10003917 vs. NCBI nr
Match: XP_022993850.1 (beta-galactosidase 16 [Cucurbita maxima])

HSP 1 Score: 1368.2 bits (3540), Expect = 0.0e+00
Identity = 677/829 (81.66%), Postives = 710/829 (85.65%), Query Frame = 0

Query: 1   MAKSECGFISYRLPCIVNSALVLTAALFFHCVLGGNDDGSGSVTYDGRSLIVNGEHKLLF 60
           MAKS+ G +  RL C+  SALV TAAL FHCVLGGN DGS  V+YDGRSLIVNGEHKL F
Sbjct: 1   MAKSKYGTV--RLLCV--SALVFTAAL-FHCVLGGN-DGSDGVSYDGRSLIVNGEHKLFF 60

Query: 61  SGSIHYPRSTPD--------------------------------YEFSGRRDIVRFVKEI 120
           SGSIHYPRSTPD                                YEFSGRRDIV+FVKEI
Sbjct: 61  SGSIHYPRSTPDMWPSLIAKAKEGGIDVIQTYVFWNLHEPHQGRYEFSGRRDIVKFVKEI 120

Query: 121 QAQGLYACLRIGPFIEAEWSYGGLPFWLHDVPGIVYRSDNEPFKLHMQNFTTKIVNLMKS 180
           QAQGLYACLRIGPFIEAEW+YGGLPFWLHD+ GIVYRSDNEPFK +MQNFTTKIVN+MKS
Sbjct: 121 QAQGLYACLRIGPFIEAEWNYGGLPFWLHDISGIVYRSDNEPFKFYMQNFTTKIVNMMKS 180

Query: 181 EGLYASQGGPIILSQIENEYTLVEAAFREKGPPYVLWAADMAVSLKTGVPWSMCKQNDAP 240
           EGLYASQGGPIILSQIENEYTLVEAAFREKGPPYV WAADMAVSL+TGVPWSMCKQNDAP
Sbjct: 181 EGLYASQGGPIILSQIENEYTLVEAAFREKGPPYVRWAADMAVSLQTGVPWSMCKQNDAP 240

Query: 241 DPVINTCNGMRCGETFTGPNSPNKPSIWTENWTSFYQTYGEEPYIRSAEEIAFHVALFIA 300
           DPVINTCNGMRCGETF GPN+PNKPSIWTENWTSFYQTYG EPYIRSAEEIAFHVALFIA
Sbjct: 241 DPVINTCNGMRCGETFPGPNTPNKPSIWTENWTSFYQTYGGEPYIRSAEEIAFHVALFIA 300

Query: 301 AKNGTYVNYYMYHGGTNFGRSASAFMITGYYDQAPLDEYGLTREPKWGHLKELHAAVKLC 360
           AKNGTYVNYYMYHGGTNFGRSASAFMITGYYDQAPLDEYGL REPKWGHLKELHAA+KLC
Sbjct: 301 AKNGTYVNYYMYHGGTNFGRSASAFMITGYYDQAPLDEYGLIREPKWGHLKELHAAIKLC 360

Query: 361 STPLLTGTKSNFSLGQSLEAIVFKTESGECAAFLVNKGATDTNVLFQNITYELPLSSISI 420
           S PLLTGTKSNFSLG+S+EAIVFKT+SGECAAFLVNKGATD NVLFQ++TYELPLSSISI
Sbjct: 361 SKPLLTGTKSNFSLGKSIEAIVFKTKSGECAAFLVNKGATDMNVLFQSVTYELPLSSISI 420

Query: 421 LPDCKNVAFNTRRVSVQHNTRSMKVVQKFGSSEEWQEFKEPIPNFDETKLRASELLEQMG 480
           LPDCKNVAFNTRRVSVQ+NTRSM  VQKF S+ EWQEFKE IP+FDET LRA+ELLE   
Sbjct: 421 LPDCKNVAFNTRRVSVQYNTRSMNAVQKFDSNVEWQEFKESIPSFDETDLRANELLEHTD 480

Query: 481 TTKDRSDYLWYTFRVQQDSQDSQQTLEVDSRAHALHAFVNG------------------- 540
           TTKD SDYLWYT RV+ DS DSQQTL+VDS AHA+HAFVNG                   
Sbjct: 481 TTKDSSDYLWYTLRVEADSPDSQQTLKVDSLAHAMHAFVNGLYAGSAHGTYKEKGFSLEN 540

Query: 541 ---------------------DSGAFLETRVAGLRRVRIQGEDFSEQPWGYKVGLSGEQS 600
                                DSGAFLE+RVAGLRRVRIQGEDFS QPWGYKVGLSGEQS
Sbjct: 541 NITLRNGINNISLLSVMVGLPDSGAFLESRVAGLRRVRIQGEDFSTQPWGYKVGLSGEQS 600

Query: 601 QIFLDTGSSDVQWSRLGNSSQPLTWYKTQFDAPPGDDPIALNLGSMGKGAAWVNGWGIGR 660
           QIFLD+GSS+ QWSRLG+SSQPLTWYKTQFDAPPGDDPIALNLGSMGKGA WVNGWGIGR
Sbjct: 601 QIFLDSGSSNAQWSRLGDSSQPLTWYKTQFDAPPGDDPIALNLGSMGKGAVWVNGWGIGR 660

Query: 661 YWVSFLTPKGEPSQKWYNVPRSFLKPAGNQLVILEEETGNPIGISLDSVSITKTCGQVSE 720
           YWVSFLTP GEPSQKWYNVPRSFLKP GN LVILEEETGNP+GISLDSVSI+KTCGQVSE
Sbjct: 661 YWVSFLTPTGEPSQKWYNVPRSFLKPTGNLLVILEEETGNPVGISLDSVSISKTCGQVSE 720

Query: 721 SHYPLVASWMGAKKQTASGTKNRTRRPKVRLSCPTNKNISNILFASFGTPSGDCQSYAIG 758
           SHYPLVASWM AKKQ AS TKN++RRPKVRLSCPTNKNISNILFASFGTPSGDCQSYA+G
Sbjct: 721 SHYPLVASWMSAKKQRASRTKNKSRRPKVRLSCPTNKNISNILFASFGTPSGDCQSYAVG 780

BLAST of HG10003917 vs. NCBI nr
Match: XP_031742962.1 (beta-galactosidase 16 isoform X1 [Cucumis sativus])

HSP 1 Score: 1366.3 bits (3535), Expect = 0.0e+00
Identity = 691/858 (80.54%), Postives = 713/858 (83.10%), Query Frame = 0

Query: 1   MAKSECGFISYRLPCIVNSALVLTAALFFHCVLGGNDDGSGSVTYDGRSLIVNGEHKLLF 60
           MAKSE G +   + CI ++ L+LTA L FHCVLGGN DG G+ TYDGRSLIVNGEHKLLF
Sbjct: 1   MAKSESGIMI--IICIYSAYLLLTAPL-FHCVLGGN-DGIGA-TYDGRSLIVNGEHKLLF 60

Query: 61  SGSIHYPRSTPD--------------------------------YEFSGRRDIVRFVKEI 120
           SGSIHYPRSTPD                                YEFSGRRDIVRFVKEI
Sbjct: 61  SGSIHYPRSTPDMWPSLIAKAKEGGIDVIQTYVFWNLHEPQQGTYEFSGRRDIVRFVKEI 120

Query: 121 QAQGLYACLRIGPFIEAEWSYGGLPFWLHDVPGIVYRSDNEPFKLHMQNFTTKIVNLMKS 180
           QAQGLYACLRIGPFIEAEWSYGGLPFWLHDV GIVYRSDNEPFKLHMQNFTTKIVN+MKS
Sbjct: 121 QAQGLYACLRIGPFIEAEWSYGGLPFWLHDVLGIVYRSDNEPFKLHMQNFTTKIVNMMKS 180

Query: 181 EGLYASQGGPIILSQIENEYTLVEAAFREKGPPYVLWAADMAVSLKTGVPWSMCKQNDAP 240
           EGLYASQGGPIILSQIENEYTLVEAAF EKGPPYV WAA MAVSL+TGVPWSMCKQNDAP
Sbjct: 181 EGLYASQGGPIILSQIENEYTLVEAAFGEKGPPYVQWAAKMAVSLQTGVPWSMCKQNDAP 240

Query: 241 DPVINTCNGMRCGETFTGPNSPNKPSIWTENWTSFYQTYGEEPYIRSAEEIAFHVALFIA 300
           DPVINTCNGMRCGETFTGPNSPNKPSIWTENWTSFYQTYGEEPYIRSAEEIAFHVALFIA
Sbjct: 241 DPVINTCNGMRCGETFTGPNSPNKPSIWTENWTSFYQTYGEEPYIRSAEEIAFHVALFIA 300

Query: 301 AKNGTYVNYYMYHGGTNFGRSASAFMITGYYDQAPLDEYGLTREPKWGHLKELHAAVKLC 360
           AKNGTYVNYYMYHGGTNFGRSASAFMITGYYDQ+PLDEYGLTREPKWGHLKELHAAVKLC
Sbjct: 301 AKNGTYVNYYMYHGGTNFGRSASAFMITGYYDQSPLDEYGLTREPKWGHLKELHAAVKLC 360

Query: 361 STPLLTGTKSNFSLGQSLEAIVFKTESGECAAFLVNKGATDTNVLFQNITYELPLSSISI 420
           STPLLTGTKSNFSLGQS+EAIVFKTES ECAAFLVN+GA D+NVLFQN+TYELPL SISI
Sbjct: 361 STPLLTGTKSNFSLGQSVEAIVFKTESNECAAFLVNRGAIDSNVLFQNVTYELPLGSISI 420

Query: 421 LPDCKNVAFNTRRVSVQHNTRSMKVVQKFGSSEEWQEFKEPIPNFDETKLRASELLEQMG 480
           LPDCKNVAFNTRRVSVQHNTRSM  VQKF    EW+EFKEPIPN D+T+LRA+ELLE MG
Sbjct: 421 LPDCKNVAFNTRRVSVQHNTRSMMAVQKF-DLLEWEEFKEPIPNIDDTELRANELLEHMG 480

Query: 481 TTKDRSDYLWYTFRVQQDSQDSQQTLEVDSRAHALHAFVNG------------------- 540
           TTKDRSDYLWYTFRVQQDS DSQQTLEVDSRAHALHAFVNG                   
Sbjct: 481 TTKDRSDYLWYTFRVQQDSPDSQQTLEVDSRAHALHAFVNGDYAGSAHGIYKEKGFSLAK 540

Query: 541 ---------------------DSGAFLETRVAGLRRVRIQGEDFSEQPWGYKVGLSGEQS 600
                                DSGAFLETRVAGLRRV IQGEDFSEQ WGYKVGLSGEQS
Sbjct: 541 NITLRNGINNISLLSVMVGLPDSGAFLETRVAGLRRVGIQGEDFSEQHWGYKVGLSGEQS 600

Query: 601 QIFLDTGSSDVQWSRLGNSSQPLTWYK-----------------------------TQFD 660
           QIFLDTGSS+VQWSRLGNSSQPLTWYK                             TQFD
Sbjct: 601 QIFLDTGSSNVQWSRLGNSSQPLTWYKVLPIINSNYLEFKCSLTDLCHCHSLLIGQTQFD 660

Query: 661 APPGDDPIALNLGSMGKGAAWVNGWGIGRYWVSFLTPKGEPSQKWYNVPRSFLKPAGNQL 720
           APPGDDPIALNLGSMGKGA WVNG GIGRYWVSFLTPKGEPSQKWYNVPRSFLKP  NQL
Sbjct: 661 APPGDDPIALNLGSMGKGAVWVNGRGIGRYWVSFLTPKGEPSQKWYNVPRSFLKPTDNQL 720

Query: 721 VILEEETGNPIGISLDSVSITKTCGQVSESHYPLVASWMGAKKQTASGTKNRTRRPKVRL 758
           VILEEETGNP+ ISLDSV ITKTCGQVSESHYPLVASWMGAKKQ     KNRTRRPKV+L
Sbjct: 721 VILEEETGNPVEISLDSVLITKTCGQVSESHYPLVASWMGAKKQKVRRVKNRTRRPKVQL 780

BLAST of HG10003917 vs. ExPASy Swiss-Prot
Match: Q8GX69 (Beta-galactosidase 16 OS=Arabidopsis thaliana OX=3702 GN=BGAL16 PE=2 SV=2)

HSP 1 Score: 923.3 bits (2385), Expect = 1.8e-267
Identity = 456/799 (57.07%), Postives = 559/799 (69.96%), Query Frame = 0

Query: 42  SVTYDGRSLIVNGEHKLLFSGSIHYPRSTP------------------------------ 101
           +VTYDGRSLI++GEHK+LFSGSIHY RSTP                              
Sbjct: 24  NVTYDGRSLIIDGEHKILFSGSIHYTRSTPQMWPSLIAKAKSGGIDVVDTYVFWNVHEPQ 83

Query: 102 --DYEFSGRRDIVRFVKEIQAQGLYACLRIGPFIEAEWSYGGLPFWLHDVPGIVYRSDNE 161
              ++FSG RDIV+F+KE++  GLY CLRIGPFI+ EWSYGGLPFWLH+V GIV+R+DNE
Sbjct: 84  QGQFDFSGSRDIVKFIKEVKNHGLYVCLRIGPFIQGEWSYGGLPFWLHNVQGIVFRTDNE 143

Query: 162 PFKLHMQNFTTKIVNLMKSEGLYASQGGPIILSQIENEYTLVEAAFREKGPPYVLWAADM 221
           PFK HM+ +   IV LMKSE LYASQGGPIILSQIENEY +V  AFR++G  YV W A +
Sbjct: 144 PFKYHMKRYAKMIVKLMKSENLYASQGGPIILSQIENEYGMVGRAFRQEGKSYVKWTAKL 203

Query: 222 AVSLKTGVPWSMCKQNDAPDPVINTCNGMRCGETFTGPNSPNKPSIWTENWTSFYQTYGE 281
           AV L TGVPW MCKQ+DAPDP++N CNG +CGETF GPNSPNKP+IWTENWTSFYQTYGE
Sbjct: 204 AVELDTGVPWVMCKQDDAPDPLVNACNGRQCGETFKGPNSPNKPAIWTENWTSFYQTYGE 263

Query: 282 EPYIRSAEEIAFHVALFIAAKNGTYVNYYMYHGGTNFGRSASAFMITGYYDQAPLDEYGL 341
           EP IRSAE+IAFHVALFI AKNG++VNYYMYHGGTNFGR+AS F+IT YYDQAPLDEYGL
Sbjct: 264 EPLIRSAEDIAFHVALFI-AKNGSFVNYYMYHGGTNFGRNASQFVITSYYDQAPLDEYGL 323

Query: 342 TREPKWGHLKELHAAVKLCSTPLLTGTKSNFSLGQSLEAIVFKTESGECAAFLVNKGATD 401
            R+PKWGHLKELHAAVKLC  PLL+G ++  SLG+   A VF  ++  CAA LVN+   +
Sbjct: 324 LRQPKWGHLKELHAAVKLCEEPLLSGLQTTISLGKLQTAFVFGKKANLCAAILVNQDKCE 383

Query: 402 TNVLFQNITYELPLSSISILPDCKNVAFNTRRVSVQHNTRSMKVVQKFGSSEEWQEFKEP 461
           + V F+N +Y L   S+S+LPDCKNVAFNT +V+ Q+NTR+ K  Q   S + W+EF E 
Sbjct: 384 STVQFRNSSYRLSPKSVSVLPDCKNVAFNTAKVNAQYNTRTRKARQNLSSPQMWEEFTET 443

Query: 462 IPNFDETKLRASELLEQMGTTKDRSDYLWYTFRVQQDSQDSQQTLEVDSRAHALHAFVNG 521
           +P+F ET +R+  LLE M TT+D SDYLW T R QQ S+ +   L+V+   HALHAFVNG
Sbjct: 444 VPSFSETSIRSESLLEHMNTTQDTSDYLWQTTRFQQ-SEGAPSVLKVNHLGHALHAFVNG 503

Query: 522 ----------------------------------------DSGAFLETRVAGLRRVRIQG 581
                                                   +SGA LE RV G R V+I  
Sbjct: 504 RFIGSMHGTFKAHRFLLEKNMSLNNGTNNLALLSVMVGLPNSGAHLERRVVGSRSVKIWN 563

Query: 582 ED----FSEQPWGYKVGLSGEQSQIFLDTGSSDVQWSRLGNS-SQPLTWYKTQFDAPPGD 641
                 F+   WGY+VGL GE+  ++ + GS+ VQW +  +S SQPLTWYK  FD P G+
Sbjct: 564 GRYQLYFNNYSWGYQVGLKGEKFHVYTEDGSAKVQWKQYRDSKSQPLTWYKASFDTPEGE 623

Query: 642 DPIALNLGSMGKGAAWVNGWGIGRYWVSFLTPKGEPSQKWYNVPRSFLKPAGNQLVILEE 701
           DP+ALNLGSMGKG AWVNG  IGRYWVSF T KG PSQ WY++PRSFLKP  N LVILEE
Sbjct: 624 DPVALNLGSMGKGEAWVNGQSIGRYWVSFHTYKGNPSQIWYHIPRSFLKPNSNLLVILEE 683

Query: 702 E-TGNPIGISLDSVSITKTCGQVSESH-YPLVASWMGAKKQTASGTKNRT----RRPKVR 758
           E  GNP+GI++D+VS+T+ CG VS ++ +P++     + ++     KN T    R+PKV+
Sbjct: 684 EREGNPLGITIDTVSVTEVCGHVSNTNPHPVI-----SPRKKGLNRKNLTYRYDRKPKVQ 743

BLAST of HG10003917 vs. ExPASy Swiss-Prot
Match: Q9FFN4 (Beta-galactosidase 6 OS=Arabidopsis thaliana OX=3702 GN=BGAL6 PE=2 SV=1)

HSP 1 Score: 760.8 bits (1963), Expect = 1.5e-218
Identity = 377/699 (53.93%), Postives = 466/699 (66.67%), Query Frame = 0

Query: 21  LVLTAALFFHCVLGGNDDGSGSVTYDGRSLIVNGEHKLLFSGSIHYPRSTPD-------- 80
           L+L    F      G    +  VTYDGRSLI++G+ KLLFSGSIHYPRSTP+        
Sbjct: 12  LILIVGTFLE--FSGGATAAKGVTYDGRSLIIDGQRKLLFSGSIHYPRSTPEMWPSLIKK 71

Query: 81  ------------------------YEFSGRRDIVRFVKEIQAQGLYACLRIGPFIEAEWS 140
                                   Y+FSGR D+V+F+KEI++QGLY CLRIGPFIEAEW+
Sbjct: 72  TKEGGIDVIQTYVFWNLHEPKLGQYDFSGRNDLVKFIKEIRSQGLYVCLRIGPFIEAEWN 131

Query: 141 YGGLPFWLHDVPGIVYRSDNEPFKLHMQNFTTKIVNLMKSEGLYASQGGPIILSQIENEY 200
           YGGLPFWL DVPG+VYR+DNEPFK HMQ FT KIV+LMKSEGLYASQGGPIILSQIENEY
Sbjct: 132 YGGLPFWLRDVPGMVYRTDNEPFKFHMQKFTAKIVDLMKSEGLYASQGGPIILSQIENEY 191

Query: 201 TLVEAAFREKGPPYVLWAADMAVSLKTGVPWSMCKQNDAPDPVINTCNGMRCGETFTGPN 260
             VE AF EKG  Y+ WA  MAV LKTGVPW MCK  DAPDPVINTCNGM+CGETF GPN
Sbjct: 192 ANVEGAFHEKGASYIKWAGQMAVGLKTGVPWIMCKSPDAPDPVINTCNGMKCGETFPGPN 251

Query: 261 SPNKPSIWTENWTSFYQTYGEEPYIRSAEEIAFHVALFIAAKNGTYVNYYMYHGGTNFGR 320
           SPNKP +WTE+WTSF+Q YG+EPYIRSAE+IAFH ALF+ AKNG+Y+NYYMYHGGTNFGR
Sbjct: 252 SPNKPKMWTEDWTSFFQVYGKEPYIRSAEDIAFHAALFV-AKNGSYINYYMYHGGTNFGR 311

Query: 321 SASAFMITGYYDQAPLDEYGLTREPKWGHLKELHAAVKLCSTPLLTGTKSNFSLGQSLEA 380
           ++S++ ITGYYDQAPLDEYGL R+PK+GHLKELHAA+K  + PLL G ++  SLG   +A
Sbjct: 312 TSSSYFITGYYDQAPLDEYGLLRQPKYGHLKELHAAIKSSANPLLQGKQTILSLGPMQQA 371

Query: 381 IVFKTESGECAAFLVNKGATDTNVLFQNITYELPLSSISILPDCKNVAFNTRRVSVQHNT 440
            VF+  +  C AFLVN  A  + + F+N  Y L   SI IL +CKN+ + T +V+V+ NT
Sbjct: 372 YVFEDANNGCVAFLVNNDAKASQIQFRNNAYSLSPKSIGILQNCKNLIYETAKVNVKMNT 431

Query: 441 RSMKVVQKFGSSEEWQEFKEPIPNFDETKLRASELLEQMGTTKDRSDYLWYTFRVQQDSQ 500
           R    VQ F   + W  F+E IP F  T L+ + LLE    TKD++DYLWYT   + DS 
Sbjct: 432 RVTTPVQVFNVPDNWNLFRETIPAFPGTSLKTNALLEHTNLTKDKTDYLWYTSSFKLDSP 491

Query: 501 DSQQTLEVDSRAHALHAFVNG--------------------------------------- 560
            +  ++  +S  H +H FVN                                        
Sbjct: 492 CTNPSIYTESSGHVVHVFVNNALAGSGHGSRDIRVVKLQAPVSLINGQNNISILSGMVGL 551

Query: 561 -DSGAFLETRVAGLRRVRI-----QGEDFSEQPWGYKVGLSGEQSQIFLDTGSSDVQWS- 620
            DSGA++E R  GL +V+I     +  D S   WGY VGL GE+ +++     + V+WS 
Sbjct: 552 PDSGAYMERRSYGLTKVQISCGGTKPIDLSRSQWGYSVGLLGEKVRLYQWKNLNRVKWSM 611

Query: 621 -RLG-NSSQPLTWYKTQFDAPPGDDPIALNLGSMGKGAAWVNGWGIGRYWVSFLTPKGEP 640
            + G   ++PL WYKT FD P GD P+ L++ SMGKG  WVNG  IGRYWVSFLTP G+P
Sbjct: 612 NKAGLIKNRPLAWYKTTFDGPNGDGPVGLHMSSMGKGEIWVNGESIGRYWVSFLTPAGQP 671

BLAST of HG10003917 vs. ExPASy Swiss-Prot
Match: Q75HQ3 (Beta-galactosidase 7 OS=Oryza sativa subsp. japonica OX=39947 GN=Os05g0428100 PE=3 SV=1)

HSP 1 Score: 714.9 bits (1844), Expect = 9.5e-205
Identity = 371/807 (45.97%), Postives = 470/807 (58.24%), Query Frame = 0

Query: 43  VTYDGRSLIVNGEHKLLFSGSIHYPRSTPD------------------------------ 102
           +TYDGR+L+V+G  ++ FSG +HY RSTP+                              
Sbjct: 29  ITYDGRALVVSGARRMFFSGDMHYARSTPEMWPKLIAKAKNGGLDVIQTYVFWNVHEPIQ 88

Query: 103 --YEFSGRRDIVRFVKEIQAQGLYACLRIGPFIEAEWSYGGLPFWLHDVPGIVYRSDNEP 162
             Y F GR D+V+F++EIQAQGLY  LRIGPF+EAEW YGG PFWLHDVP I +RSDNEP
Sbjct: 89  GQYNFEGRYDLVKFIREIQAQGLYVSLRIGPFVEAEWKYGGFPFWLHDVPSITFRSDNEP 148

Query: 163 FKLHMQNFTTKIVNLMKSEGLYASQGGPIILSQIENEYTLVEAAFREKGPPYVLWAADMA 222
           FK HMQNF TKIV +MK EGLY  QGGPII+SQIENEY ++E AF   GP YV WAA MA
Sbjct: 149 FKQHMQNFVTKIVTMMKHEGLYYPQGGPIIISQIENEYQMIEPAFGASGPRYVRWAAAMA 208

Query: 223 VSLKTGVPWSMCKQNDAPDPVINTCNGMRCGETFTGPNSPNKPSIWTENWTS-------- 282
           V L+TGVPW MCKQNDAPDPVINTCNG+ CGETF GPNSPNKP++WTENWTS        
Sbjct: 209 VGLQTGVPWMMCKQNDAPDPVINTCNGLICGETFVGPNSPNKPALWTENWTSRSNGQNNS 268

Query: 283 --FYQTYGEEPYIRSAEEIAFHVALFIAAKNGTYVNYYMYHGGTNFGRSASAFMITGYYD 342
              Y  YG +  +R+ E+IAF VALFIA K G++V+YYMYHGGTNFGR A++++ T YYD
Sbjct: 269 AFSYPIYGNDTKLRAPEDIAFAVALFIARKKGSFVSYYMYHGGTNFGRFAASYVTTSYYD 328

Query: 343 QAPLDEYGLTREPKWGHLKELHAAVKLCSTPLLTGTKSNFSLGQSLEAIVFKTESGECAA 402
            APLDEY                                                 +C A
Sbjct: 329 GAPLDEYDF-----------------------------------------------KCVA 388

Query: 403 FLVNKGATDT-NVLFQNITYELPLSSISILPDCKNVAFNTRRVSVQHNTRSMKVVQKFGS 462
           FLVN    +T  V F+NI+ EL   SIS+L DC+NV F T +V+ QH +R+   VQ    
Sbjct: 389 FLVNFDQHNTPKVEFRNISLELAPKSISVLSDCRNVVFETAKVNAQHGSRTANAVQSLND 448

Query: 463 SEEWQEFKEPIP-NFDETKLRASELLEQMGTTKDRSDYLWYTFRVQQDSQDSQQT--LEV 522
              W+ F EP+P +  ++    ++L EQ+ TTKD +DYLWY    +  + D  Q   L V
Sbjct: 449 INNWKAFIEPVPQDLSKSTYTGNQLFEQLTTTKDETDYLWYIVSYKNRASDGNQIAHLYV 508

Query: 523 DSRAHALHAFVNG-----------------------------------------DSGAFL 582
            S AH LHAFVN                                          DSGA++
Sbjct: 509 KSLAHILHAFVNNEYVGSVHGSHDGPRNIVLNTHMSLKEGDNTISLLSVMVGSPDSGAYM 568

Query: 583 ETRVAGLRRVRI-QGED----FSEQPWGYKVGLSGEQSQIFLDTGSSDVQWSRLGN-SSQ 642
           E R  G++ V I QG+      +   WGY+VGL GE+  I+   G++ V+W  + N    
Sbjct: 569 ERRTFGIQTVGIQQGQQPMHLLNNDLWGYQVGLFGEKDSIYTQEGTNSVRWMDINNLIYH 628

Query: 643 PLTWYKTQFDAPPGDDPIALNLGSMGKGAAWVNGWGIGRYWVSFLTPKGEPSQKWYNVPR 702
           PLTWYKT F  PPG+D + LNL SMGKG  WVNG  IGRYWVSF  P G+PSQ  Y++PR
Sbjct: 629 PLTWYKTTFSTPPGNDAVTLNLTSMGKGEVWVNGESIGRYWVSFKAPSGQPSQSLYHIPR 688

Query: 703 SFLKPAGNQLVILEEETGNPIGISLDSVSITKTCGQVSESHYPLVASWMGAKKQTASGTK 757
            FL P  N LV++EE  G+P+ I+++++S+T  CG V E   P + S             
Sbjct: 689 GFLTPKDNLLVLVEEMGGDPLQITVNTMSVTTVCGNVDEFSVPPLQS------------- 748

BLAST of HG10003917 vs. ExPASy Swiss-Prot
Match: Q6ZJJ0 (Beta-galactosidase 11 OS=Oryza sativa subsp. japonica OX=39947 GN=Os08g0549200 PE=2 SV=1)

HSP 1 Score: 670.6 bits (1729), Expect = 2.1e-191
Identity = 351/808 (43.44%), Postives = 476/808 (58.91%), Query Frame = 0

Query: 43  VTYDGRSLIVNGEHKLLFSGSIHYPRSTPD------------------------------ 102
           +TYD RSLI++G  ++ FSGSIHYPRS PD                              
Sbjct: 33  ITYDRRSLIIDGHREIFFSGSIHYPRSPPDTWPDLISKAKEGGLNVIESYVFWNGHEPEQ 92

Query: 103 --YEFSGRRDIVRFVKEIQAQGLYACLRIGPFIEAEWSYGGLPFWLHDVPGIVYRSDNEP 162
             Y F GR D+++F K IQ + +YA +RIGPF++AEW++GGLP+WL ++P I++R++NEP
Sbjct: 93  GVYNFEGRYDLIKFFKLIQEKEMYAIVRIGPFVQAEWNHGGLPYWLREIPDIIFRTNNEP 152

Query: 163 FKLHMQNFTTKIVNLMKSEGLYASQGGPIILSQIENEYTLVEAAFREKGPPYVLWAADMA 222
           FK +M+ F T IVN +K   L+ASQGGPIIL+QIENEY  +E AF+E G  Y+ WAA MA
Sbjct: 153 FKKYMKQFVTLIVNKLKEAKLFASQGGPIILAQIENEYQHLEVAFKEAGTKYINWAAKMA 212

Query: 223 VSLKTGVPWSMCKQNDAPDPVINTCNGMRCGETFTGPNSPNKPSIWTENWTSFYQTYGEE 282
           ++  TGVPW MCKQ  AP  VI TCNG  CG+T+ GP    KP +WTENWT+ Y+ +G+ 
Sbjct: 213 IATNTGVPWIMCKQTKAPGEVIPTCNGRHCGDTWPGPADKKKPLLWTENWTAQYRVFGDP 272

Query: 283 PYIRSAEEIAFHVALFIAAKNGTYVNYYMYHGGTNFGRSASAFMITGYYDQAPLDEYGLT 342
           P  RSAE+IAF VA F +   GT  NYYMYHGGTNFGR+ +AF++  YYD+APLDE+GL 
Sbjct: 273 PSQRSAEDIAFSVARFFSV-GGTMANYYMYHGGTNFGRNGAAFVMPRYYDEAPLDEFGLY 332

Query: 343 REPKWGHLKELHAAVKLCSTPLLTGTKSNFSLGQSLEAIVFK-TESGECAAFLVNKGA-T 402
           +EPKWGHL++LH A++ C   LL G  S   LG+  EA VF+  E   C AFL N     
Sbjct: 333 KEPKWGHLRDLHHALRHCKKALLWGNPSVQPLGKLYEARVFEMKEKNVCVAFLSNHNTKE 392

Query: 403 DTNVLFQNITYELPLSSISILPDCKNVAFNTRRVSVQHNTRSMKVVQKFGSSEEWQEF-K 462
           D  V F+   Y +   SISIL DCK V F+T+ V+ QHN R+     +      W+ + +
Sbjct: 393 DGTVTFRGQKYFVARRSISILADCKTVVFSTQHVNSQHNQRTFHFADQTVQDNVWEMYSE 452

Query: 463 EPIPNFDETKLRASELLEQMGTTKDRSDYLWYT--FRVQQDS----QDSQQTLEVDSRAH 522
           E IP + +T +R    LEQ   TKD++DYLWYT  FR++ D     ++ +  LEV S  H
Sbjct: 453 EKIPRYSKTSIRTQRPLEQYNQTKDKTDYLWYTTSFRLETDDLPYRKEVKPVLEVSSHGH 512

Query: 523 ALHAFVNG----------------------------------------DSGAFLETRVAG 582
           A+ AFVN                                         DSG++LE R+AG
Sbjct: 513 AIVAFVNDAFVGCGHGTKINKAFTMEKAMDLKVGVNHVAILSSTLGLMDSGSYLEHRMAG 572

Query: 583 LRRVRIQG-----EDFSEQPWGYKVGLSGEQSQIFLDTGSSDVQWSRLGNSSQPLTWYKT 642
           +  V I+G      D +   WG+ VGL GE+ ++  + G   V W + G  +QPLTWY+ 
Sbjct: 573 VYTVTIRGLNTGTLDLTTNGWGHVVGLDGERRRVHSEQGMGAVAW-KPGKDNQPLTWYRR 632

Query: 643 QFDAPPGDDPIALNLGSMGKGAAWVNGWGIGRYWVSFLTPKGEPSQKWYNVPRSFLKPAG 702
           +FD P G DP+ ++L  MGKG  +VNG G+GRYWVS+    G+PSQ  Y+VPRS L+P G
Sbjct: 633 RFDPPSGTDPVVIDLTPMGKGFLFVNGEGLGRYWVSYHHALGKPSQYLYHVPRSLLRPKG 692

Query: 703 NQLVILEEETGNPIGISLDSVSITKTCGQVSESHYPLVASWMGAKKQT------ASGTKN 758
           N L+  EEE G P  I + +V     C  ++E + P    W    K +       +G   
Sbjct: 693 NTLMFFEEEGGKPDAIMILTVKRDNICTFMTEKN-PAHVRWSWESKDSQPKAVAGAGAGA 752

BLAST of HG10003917 vs. ExPASy Swiss-Prot
Match: Q9SCV9 (Beta-galactosidase 3 OS=Arabidopsis thaliana OX=3702 GN=BGAL3 PE=2 SV=1)

HSP 1 Score: 667.5 bits (1721), Expect = 1.7e-190
Identity = 367/824 (44.54%), Postives = 481/824 (58.37%), Query Frame = 0

Query: 43  VTYDGRSLIVNGEHKLLFSGSIHYPRSTPD------------------------------ 102
           VTYD ++L++NG+ ++LFSGSIHYPRSTPD                              
Sbjct: 33  VTYDRKALLINGQRRILFSGSIHYPRSTPDMWEDLIQKAKDGGIDVIETYVFWNLHEPSP 92

Query: 103 --YEFSGRRDIVRFVKEIQAQGLYACLRIGPFIEAEWSYGGLPFWLHDVPGIVYRSDNEP 162
             Y+F GR D+VRFVK I   GLYA LRIGP++ AEW++GG P WL  VPGI +R+DNEP
Sbjct: 93  GKYDFEGRNDLVRFVKTIHKAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNEP 152

Query: 163 FKLHMQNFTTKIVNLMKSEGLYASQGGPIILSQIENEYTLVEAAFREKGPPYVLWAADMA 222
           FK  M+ FT +IV LMKSE L+ SQGGPIILSQIENEY         +G  Y+ WAA MA
Sbjct: 153 FKRAMKGFTERIVELMKSENLFESQGGPIILSQIENEYGRQGQLLGAEGHNYMTWAAKMA 212

Query: 223 VSLKTGVPWSMCKQNDAPDPVINTCNGMRCGETFTGPNSPNKPSIWTENWTSFYQTYGEE 282
           ++ +TGVPW MCK++DAPDPVINTCNG  C ++F  PN P KP IWTE W+ ++  +G  
Sbjct: 213 IATETGVPWVMCKEDDAPDPVINTCNGFYC-DSF-APNKPYKPLIWTEAWSGWFTEFGGP 272

Query: 283 PYIRSAEEIAFHVALFIAAKNGTYVNYYMYHGGTNFGRSASAFMITGYYD-QAPLDEYGL 342
            + R  +++AF VA FI  K G++VNYYMYHGGTNFGR+A    +T  YD  AP+DEYGL
Sbjct: 273 MHHRPVQDLAFGVARFI-QKGGSFVNYYMYHGGTNFGRTAGGPFVTTSYDYDAPIDEYGL 332

Query: 343 TREPKWGHLKELHAAVKLCSTPLLTGTKSNFSLGQSLEAIVFKTESGECAAFLVNKGA-T 402
            R+PK+GHLKELH A+K+C   L++      S+G   +A V+  ESG+C+AFL N    +
Sbjct: 333 IRQPKYGHLKELHRAIKMCEKALVSADPVVTSIGNKQQAHVYSAESGDCSAFLANYDTES 392

Query: 403 DTNVLFQNITYELPLSSISILPDCKNVAFNTRRVSVQHNTRSMKVVQKFGSSEEWQEFKE 462
              VLF N+ Y LP  SISILPDC+N  FNT +V VQ  T  M+++     + +W+ + E
Sbjct: 393 AARVLFNNVHYNLPPWSISILPDCRNAVFNTAKVGVQ--TSQMEMLPTDTKNFQWESYLE 452

Query: 463 PIPNFDETKLRASE-LLEQMGTTKDRSDYLWYTFRVQQDSQDS------QQTLEVDSRAH 522
            + + D++    +  LLEQ+  T+D SDYLWY   V     +S        TL + S  H
Sbjct: 453 DLSSLDDSSTFTTHGLLEQINVTRDTSDYLWYMTSVDIGDSESFLHGGELPTLIIQSTGH 512

Query: 523 ALHAFVNGD-SGAFLETRV-------------AGLRRVRI-------------------- 582
           A+H FVNG  SG+   TR              +G  R+ +                    
Sbjct: 513 AVHIFVNGQLSGSAFGTRQNRRFTYQGKINLHSGTNRIALLSVAVGLPNVGGHFESWNTG 572

Query: 583 -----------QGE-DFSEQPWGYKVGLSGEQSQIFLDTGSSDVQW---SRLGNSSQPLT 642
                      QG+ D S Q W Y+VGL GE   +   T +  + W   S      QPLT
Sbjct: 573 ILGPVALHGLSQGKMDLSWQKWTYQVGLKGEAMNLAFPTNTPSIGWMDASLTVQKPQPLT 632

Query: 643 WYKTQFDAPPGDDPIALNLGSMGKGAAWVNGWGIGRYWVSFLTPK--------------- 702
           W+KT FDAP G++P+AL++  MGKG  WVNG  IGRYW +F T                 
Sbjct: 633 WHKTYFDAPEGNEPLALDMEGMGKGQIWVNGESIGRYWTAFATGDCSHCSYTGTYKPNKC 692

Query: 703 ----GEPSQKWYNVPRSFLKPAGNQLVILEEETGNPIGISLDSVSITKTCGQVSESHYPL 757
               G+P+Q+WY+VPR++LKP+ N LVI EE  GNP  +SL   S++  C +VSE H P 
Sbjct: 693 QTGCGQPTQRWYHVPRAWLKPSQNLLVIFEELGGNPSTVSLVKRSVSGVCAEVSEYH-PN 752

BLAST of HG10003917 vs. ExPASy TrEMBL
Match: A0A1S3CS54 (Beta-galactosidase OS=Cucumis melo OX=3656 GN=LOC103504084 PE=3 SV=1)

HSP 1 Score: 1378.2 bits (3566), Expect = 0.0e+00
Identity = 687/829 (82.87%), Postives = 708/829 (85.40%), Query Frame = 0

Query: 1   MAKSECGFISYRLPCIVNSALVLTAALFFHCVLGGNDDGSGSVTYDGRSLIVNGEHKLLF 60
           MAKSE   +       + SAL  TA L FHCVLGGN DG G VTYDGRSLIVNGEHKLLF
Sbjct: 1   MAKSESCIV-----ICIYSALFFTAPL-FHCVLGGN-DGIG-VTYDGRSLIVNGEHKLLF 60

Query: 61  SGSIHYPRSTPD--------------------------------YEFSGRRDIVRFVKEI 120
           SGSIHYPRSTPD                                YEFSGRRDIV+FVKEI
Sbjct: 61  SGSIHYPRSTPDMWPSLIAKAKEGGIDVIQTYVFWNLHEPQQGTYEFSGRRDIVQFVKEI 120

Query: 121 QAQGLYACLRIGPFIEAEWSYGGLPFWLHDVPGIVYRSDNEPFKLHMQNFTTKIVNLMKS 180
           QAQGLYACLRIGPFIEAEWSYGGLPFWLHDVPGIVYRSDNEPFKLHMQNFTTKIVN+MKS
Sbjct: 121 QAQGLYACLRIGPFIEAEWSYGGLPFWLHDVPGIVYRSDNEPFKLHMQNFTTKIVNMMKS 180

Query: 181 EGLYASQGGPIILSQIENEYTLVEAAFREKGPPYVLWAADMAVSLKTGVPWSMCKQNDAP 240
           EGLYASQGGPIILSQIENEYTLVEAAF EKGPPYV WAA MAVSL+TGVPWSMCKQNDAP
Sbjct: 181 EGLYASQGGPIILSQIENEYTLVEAAFGEKGPPYVQWAAKMAVSLQTGVPWSMCKQNDAP 240

Query: 241 DPVINTCNGMRCGETFTGPNSPNKPSIWTENWTSFYQTYGEEPYIRSAEEIAFHVALFIA 300
           DPVINTCNGMRCGETFTGPNSP KPSIWTENWTSFYQTYGEEPYIRSAEEIAFHVALFIA
Sbjct: 241 DPVINTCNGMRCGETFTGPNSPTKPSIWTENWTSFYQTYGEEPYIRSAEEIAFHVALFIA 300

Query: 301 AKNGTYVNYYMYHGGTNFGRSASAFMITGYYDQAPLDEYGLTREPKWGHLKELHAAVKLC 360
           AKNGTYVNYYMYHGGTNFGRSASAFMITGYYDQ PLDEYGLTREPKWGHLKELHAAVKLC
Sbjct: 301 AKNGTYVNYYMYHGGTNFGRSASAFMITGYYDQGPLDEYGLTREPKWGHLKELHAAVKLC 360

Query: 361 STPLLTGTKSNFSLGQSLEAIVFKTESGECAAFLVNKGATDTNVLFQNITYELPLSSISI 420
           STPLLTGTK NFSLGQSLEAIVFKTES ECAAFLVN+GA DT+VLFQN+TYELPL SISI
Sbjct: 361 STPLLTGTKFNFSLGQSLEAIVFKTESDECAAFLVNRGAIDTDVLFQNVTYELPLGSISI 420

Query: 421 LPDCKNVAFNTRRVSVQHNTRSMKVVQKFGSSEEWQEFKEPIPNFDETKLRASELLEQMG 480
           LPDCKNVAFNTRRVSVQ NTRSM  VQKF SSEEW+EFKEPIPNF++T+LRA++LLE MG
Sbjct: 421 LPDCKNVAFNTRRVSVQRNTRSMMTVQKFDSSEEWEEFKEPIPNFEDTELRANKLLEHMG 480

Query: 481 TTKDRSDYLWYTFRVQQDSQDSQQTLEVDSRAHALHAFVNG------------------- 540
           TTKDRSDYLWYTFRVQQDS DSQQT EVDSRAHALHAFVNG                   
Sbjct: 481 TTKDRSDYLWYTFRVQQDSPDSQQTFEVDSRAHALHAFVNGDYAGSAHGTYKEKGFSLVN 540

Query: 541 ---------------------DSGAFLETRVAGLRRVRIQGEDFSEQPWGYKVGLSGEQS 600
                                DSGAFLETRVAGLRRV IQGEDFSEQPWGYKVGLSGEQS
Sbjct: 541 NITLRNGINNISLLSVMVGLPDSGAFLETRVAGLRRVGIQGEDFSEQPWGYKVGLSGEQS 600

Query: 601 QIFLDTGSSDVQWSRLGNSSQPLTWYKTQFDAPPGDDPIALNLGSMGKGAAWVNGWGIGR 660
           QIFLDTGSS+VQWSRLGNSSQPLTWYKT+FDAPPGDDPIALNLGSMGKGAAWVNG GIGR
Sbjct: 601 QIFLDTGSSNVQWSRLGNSSQPLTWYKTRFDAPPGDDPIALNLGSMGKGAAWVNGRGIGR 660

Query: 661 YWVSFLTPKGEPSQKWYNVPRSFLKPAGNQLVILEEETGNPIGISLDSVSITKTCGQVSE 720
           YWVSFLTPKGEPSQKWYNVPRSFLKP  NQLVILEEETGNP+ ISLDSV ITKTCGQVSE
Sbjct: 661 YWVSFLTPKGEPSQKWYNVPRSFLKPTDNQLVILEEETGNPVEISLDSVLITKTCGQVSE 720

Query: 721 SHYPLVASWMGAKKQTASGTKNRTRRPKVRLSCPTNKNISNILFASFGTPSGDCQSYAIG 758
           SHYPLVASWMGAKKQ     KNRTRRPKV+LSCP+ K ISNILFASFGTPSGDCQSYAIG
Sbjct: 721 SHYPLVASWMGAKKQKVRSAKNRTRRPKVQLSCPSKKKISNILFASFGTPSGDCQSYAIG 780

BLAST of HG10003917 vs. ExPASy TrEMBL
Match: A0A6J1JZN0 (Beta-galactosidase OS=Cucurbita maxima OX=3661 GN=LOC111489734 PE=3 SV=1)

HSP 1 Score: 1368.2 bits (3540), Expect = 0.0e+00
Identity = 677/829 (81.66%), Postives = 710/829 (85.65%), Query Frame = 0

Query: 1   MAKSECGFISYRLPCIVNSALVLTAALFFHCVLGGNDDGSGSVTYDGRSLIVNGEHKLLF 60
           MAKS+ G +  RL C+  SALV TAAL FHCVLGGN DGS  V+YDGRSLIVNGEHKL F
Sbjct: 1   MAKSKYGTV--RLLCV--SALVFTAAL-FHCVLGGN-DGSDGVSYDGRSLIVNGEHKLFF 60

Query: 61  SGSIHYPRSTPD--------------------------------YEFSGRRDIVRFVKEI 120
           SGSIHYPRSTPD                                YEFSGRRDIV+FVKEI
Sbjct: 61  SGSIHYPRSTPDMWPSLIAKAKEGGIDVIQTYVFWNLHEPHQGRYEFSGRRDIVKFVKEI 120

Query: 121 QAQGLYACLRIGPFIEAEWSYGGLPFWLHDVPGIVYRSDNEPFKLHMQNFTTKIVNLMKS 180
           QAQGLYACLRIGPFIEAEW+YGGLPFWLHD+ GIVYRSDNEPFK +MQNFTTKIVN+MKS
Sbjct: 121 QAQGLYACLRIGPFIEAEWNYGGLPFWLHDISGIVYRSDNEPFKFYMQNFTTKIVNMMKS 180

Query: 181 EGLYASQGGPIILSQIENEYTLVEAAFREKGPPYVLWAADMAVSLKTGVPWSMCKQNDAP 240
           EGLYASQGGPIILSQIENEYTLVEAAFREKGPPYV WAADMAVSL+TGVPWSMCKQNDAP
Sbjct: 181 EGLYASQGGPIILSQIENEYTLVEAAFREKGPPYVRWAADMAVSLQTGVPWSMCKQNDAP 240

Query: 241 DPVINTCNGMRCGETFTGPNSPNKPSIWTENWTSFYQTYGEEPYIRSAEEIAFHVALFIA 300
           DPVINTCNGMRCGETF GPN+PNKPSIWTENWTSFYQTYG EPYIRSAEEIAFHVALFIA
Sbjct: 241 DPVINTCNGMRCGETFPGPNTPNKPSIWTENWTSFYQTYGGEPYIRSAEEIAFHVALFIA 300

Query: 301 AKNGTYVNYYMYHGGTNFGRSASAFMITGYYDQAPLDEYGLTREPKWGHLKELHAAVKLC 360
           AKNGTYVNYYMYHGGTNFGRSASAFMITGYYDQAPLDEYGL REPKWGHLKELHAA+KLC
Sbjct: 301 AKNGTYVNYYMYHGGTNFGRSASAFMITGYYDQAPLDEYGLIREPKWGHLKELHAAIKLC 360

Query: 361 STPLLTGTKSNFSLGQSLEAIVFKTESGECAAFLVNKGATDTNVLFQNITYELPLSSISI 420
           S PLLTGTKSNFSLG+S+EAIVFKT+SGECAAFLVNKGATD NVLFQ++TYELPLSSISI
Sbjct: 361 SKPLLTGTKSNFSLGKSIEAIVFKTKSGECAAFLVNKGATDMNVLFQSVTYELPLSSISI 420

Query: 421 LPDCKNVAFNTRRVSVQHNTRSMKVVQKFGSSEEWQEFKEPIPNFDETKLRASELLEQMG 480
           LPDCKNVAFNTRRVSVQ+NTRSM  VQKF S+ EWQEFKE IP+FDET LRA+ELLE   
Sbjct: 421 LPDCKNVAFNTRRVSVQYNTRSMNAVQKFDSNVEWQEFKESIPSFDETDLRANELLEHTD 480

Query: 481 TTKDRSDYLWYTFRVQQDSQDSQQTLEVDSRAHALHAFVNG------------------- 540
           TTKD SDYLWYT RV+ DS DSQQTL+VDS AHA+HAFVNG                   
Sbjct: 481 TTKDSSDYLWYTLRVEADSPDSQQTLKVDSLAHAMHAFVNGLYAGSAHGTYKEKGFSLEN 540

Query: 541 ---------------------DSGAFLETRVAGLRRVRIQGEDFSEQPWGYKVGLSGEQS 600
                                DSGAFLE+RVAGLRRVRIQGEDFS QPWGYKVGLSGEQS
Sbjct: 541 NITLRNGINNISLLSVMVGLPDSGAFLESRVAGLRRVRIQGEDFSTQPWGYKVGLSGEQS 600

Query: 601 QIFLDTGSSDVQWSRLGNSSQPLTWYKTQFDAPPGDDPIALNLGSMGKGAAWVNGWGIGR 660
           QIFLD+GSS+ QWSRLG+SSQPLTWYKTQFDAPPGDDPIALNLGSMGKGA WVNGWGIGR
Sbjct: 601 QIFLDSGSSNAQWSRLGDSSQPLTWYKTQFDAPPGDDPIALNLGSMGKGAVWVNGWGIGR 660

Query: 661 YWVSFLTPKGEPSQKWYNVPRSFLKPAGNQLVILEEETGNPIGISLDSVSITKTCGQVSE 720
           YWVSFLTP GEPSQKWYNVPRSFLKP GN LVILEEETGNP+GISLDSVSI+KTCGQVSE
Sbjct: 661 YWVSFLTPTGEPSQKWYNVPRSFLKPTGNLLVILEEETGNPVGISLDSVSISKTCGQVSE 720

Query: 721 SHYPLVASWMGAKKQTASGTKNRTRRPKVRLSCPTNKNISNILFASFGTPSGDCQSYAIG 758
           SHYPLVASWM AKKQ AS TKN++RRPKVRLSCPTNKNISNILFASFGTPSGDCQSYA+G
Sbjct: 721 SHYPLVASWMSAKKQRASRTKNKSRRPKVRLSCPTNKNISNILFASFGTPSGDCQSYAVG 780

BLAST of HG10003917 vs. ExPASy TrEMBL
Match: A0A6J1FGB8 (Beta-galactosidase OS=Cucurbita moschata OX=3662 GN=LOC111445236 PE=3 SV=1)

HSP 1 Score: 1365.5 bits (3533), Expect = 0.0e+00
Identity = 677/829 (81.66%), Postives = 708/829 (85.40%), Query Frame = 0

Query: 1   MAKSECGFISYRLPCIVNSALVLTAALFFHCVLGGNDDGSGSVTYDGRSLIVNGEHKLLF 60
           MAKSE G +   L C+  SALV TAAL FHCVLGGN DGS  V+YDGRSLIVNGEHKLLF
Sbjct: 1   MAKSEYGTVG--LLCV--SALVFTAAL-FHCVLGGN-DGSDGVSYDGRSLIVNGEHKLLF 60

Query: 61  SGSIHYPRSTPD--------------------------------YEFSGRRDIVRFVKEI 120
           SGSIHYPRSTPD                                YEFSGRRD+V+FVKEI
Sbjct: 61  SGSIHYPRSTPDMWPSLIAKAKEGGIDVIQTYVFWNLHEPHQGRYEFSGRRDVVKFVKEI 120

Query: 121 QAQGLYACLRIGPFIEAEWSYGGLPFWLHDVPGIVYRSDNEPFKLHMQNFTTKIVNLMKS 180
           QAQGLYACLRIGPFIEAEW+YGGLPFWLHDVP IVYRSDNEPFK +MQNFTTKIVNLMKS
Sbjct: 121 QAQGLYACLRIGPFIEAEWNYGGLPFWLHDVPEIVYRSDNEPFKFYMQNFTTKIVNLMKS 180

Query: 181 EGLYASQGGPIILSQIENEYTLVEAAFREKGPPYVLWAADMAVSLKTGVPWSMCKQNDAP 240
           EGLYASQGGPIILSQIENEYTLVEAAFREKGPPYV WAADMAVSL+TGVPWSMCKQNDAP
Sbjct: 181 EGLYASQGGPIILSQIENEYTLVEAAFREKGPPYVRWAADMAVSLQTGVPWSMCKQNDAP 240

Query: 241 DPVINTCNGMRCGETFTGPNSPNKPSIWTENWTSFYQTYGEEPYIRSAEEIAFHVALFIA 300
           DP+INTCNGMRCGETF GPNSPNKPS+WTENWTSFYQTYG EPYIRSAEEIAFHVALFIA
Sbjct: 241 DPMINTCNGMRCGETFLGPNSPNKPSMWTENWTSFYQTYGGEPYIRSAEEIAFHVALFIA 300

Query: 301 AKNGTYVNYYMYHGGTNFGRSASAFMITGYYDQAPLDEYGLTREPKWGHLKELHAAVKLC 360
           AKNGTYVNYYMYHGGTNFGRS SAFMITGYYDQAPLDEYGL REPKWGHLKELHAA+KLC
Sbjct: 301 AKNGTYVNYYMYHGGTNFGRSTSAFMITGYYDQAPLDEYGLIREPKWGHLKELHAAIKLC 360

Query: 361 STPLLTGTKSNFSLGQSLEAIVFKTESGECAAFLVNKGATDTNVLFQNITYELPLSSISI 420
           S PLLTGTKSNFSLG+SLEAIVFKTESGECAAFLVNKGATDTNVLFQ +TYELPLSSISI
Sbjct: 361 SKPLLTGTKSNFSLGKSLEAIVFKTESGECAAFLVNKGATDTNVLFQGVTYELPLSSISI 420

Query: 421 LPDCKNVAFNTRRVSVQHNTRSMKVVQKFGSSEEWQEFKEPIPNFDETKLRASELLEQMG 480
           LPDCKNVAFNTRRVSVQ+NTRSMK VQKF SSEEWQEFKE IP+F+ET LRA+ELLE  G
Sbjct: 421 LPDCKNVAFNTRRVSVQYNTRSMKTVQKFDSSEEWQEFKESIPSFNETDLRANELLEHTG 480

Query: 481 TTKDRSDYLWYTFRVQQDSQDSQQTLEVDSRAHALHAFVNG------------------- 540
           TTKD SDYLWYT RV+ DS DSQQTL+VDS AHA+HAFVNG                   
Sbjct: 481 TTKDSSDYLWYTLRVEADSPDSQQTLKVDSLAHAMHAFVNGLYAGSAHGTYKEKGFSLEN 540

Query: 541 ---------------------DSGAFLETRVAGLRRVRIQGEDFSEQPWGYKVGLSGEQS 600
                                DSGAFLE R+AGLRRVRIQ EDFS QPWGYKVGLSGEQS
Sbjct: 541 NITLRNGINNISLLSVMVGLPDSGAFLERRIAGLRRVRIQDEDFSAQPWGYKVGLSGEQS 600

Query: 601 QIFLDTGSSDVQWSRLGNSSQPLTWYKTQFDAPPGDDPIALNLGSMGKGAAWVNGWGIGR 660
           QIFLD GSS+VQWSRLG+SSQPLTWYKTQFDAPPGDDPIALNLGSMGKGA WVNGWGIGR
Sbjct: 601 QIFLDNGSSNVQWSRLGDSSQPLTWYKTQFDAPPGDDPIALNLGSMGKGAVWVNGWGIGR 660

Query: 661 YWVSFLTPKGEPSQKWYNVPRSFLKPAGNQLVILEEETGNPIGISLDSVSITKTCGQVSE 720
           YWVSFLTP GEPSQKWYNVPRSFLKP  N LVILEEETG+P+GISLDSVSI+KTCGQVSE
Sbjct: 661 YWVSFLTPTGEPSQKWYNVPRSFLKPTENLLVILEEETGSPVGISLDSVSISKTCGQVSE 720

Query: 721 SHYPLVASWMGAKKQTASGTKNRTRRPKVRLSCPTNKNISNILFASFGTPSGDCQSYAIG 758
           SHYPLVASWM AKKQ AS TKN++RRPKVRLSCPTNKNIS ILFASFGTPSGDCQSYA+G
Sbjct: 721 SHYPLVASWMSAKKQRASRTKNKSRRPKVRLSCPTNKNISKILFASFGTPSGDCQSYAVG 780

BLAST of HG10003917 vs. ExPASy TrEMBL
Match: A0A1S3CRZ8 (Beta-galactosidase OS=Cucumis melo OX=3656 GN=LOC103504084 PE=3 SV=1)

HSP 1 Score: 1362.8 bits (3526), Expect = 0.0e+00
Identity = 687/858 (80.07%), Postives = 708/858 (82.52%), Query Frame = 0

Query: 1   MAKSECGFISYRLPCIVNSALVLTAALFFHCVLGGNDDGSGSVTYDGRSLIVNGEHKLLF 60
           MAKSE   +       + SAL  TA L FHCVLGGN DG G VTYDGRSLIVNGEHKLLF
Sbjct: 1   MAKSESCIV-----ICIYSALFFTAPL-FHCVLGGN-DGIG-VTYDGRSLIVNGEHKLLF 60

Query: 61  SGSIHYPRSTPD--------------------------------YEFSGRRDIVRFVKEI 120
           SGSIHYPRSTPD                                YEFSGRRDIV+FVKEI
Sbjct: 61  SGSIHYPRSTPDMWPSLIAKAKEGGIDVIQTYVFWNLHEPQQGTYEFSGRRDIVQFVKEI 120

Query: 121 QAQGLYACLRIGPFIEAEWSYGGLPFWLHDVPGIVYRSDNEPFKLHMQNFTTKIVNLMKS 180
           QAQGLYACLRIGPFIEAEWSYGGLPFWLHDVPGIVYRSDNEPFKLHMQNFTTKIVN+MKS
Sbjct: 121 QAQGLYACLRIGPFIEAEWSYGGLPFWLHDVPGIVYRSDNEPFKLHMQNFTTKIVNMMKS 180

Query: 181 EGLYASQGGPIILSQIENEYTLVEAAFREKGPPYVLWAADMAVSLKTGVPWSMCKQNDAP 240
           EGLYASQGGPIILSQIENEYTLVEAAF EKGPPYV WAA MAVSL+TGVPWSMCKQNDAP
Sbjct: 181 EGLYASQGGPIILSQIENEYTLVEAAFGEKGPPYVQWAAKMAVSLQTGVPWSMCKQNDAP 240

Query: 241 DPVINTCNGMRCGETFTGPNSPNKPSIWTENWTSFYQTYGEEPYIRSAEEIAFHVALFIA 300
           DPVINTCNGMRCGETFTGPNSP KPSIWTENWTSFYQTYGEEPYIRSAEEIAFHVALFIA
Sbjct: 241 DPVINTCNGMRCGETFTGPNSPTKPSIWTENWTSFYQTYGEEPYIRSAEEIAFHVALFIA 300

Query: 301 AKNGTYVNYYMYHGGTNFGRSASAFMITGYYDQAPLDEYGLTREPKWGHLKELHAAVKLC 360
           AKNGTYVNYYMYHGGTNFGRSASAFMITGYYDQ PLDEYGLTREPKWGHLKELHAAVKLC
Sbjct: 301 AKNGTYVNYYMYHGGTNFGRSASAFMITGYYDQGPLDEYGLTREPKWGHLKELHAAVKLC 360

Query: 361 STPLLTGTKSNFSLGQSLEAIVFKTESGECAAFLVNKGATDTNVLFQNITYELPLSSISI 420
           STPLLTGTK NFSLGQSLEAIVFKTES ECAAFLVN+GA DT+VLFQN+TYELPL SISI
Sbjct: 361 STPLLTGTKFNFSLGQSLEAIVFKTESDECAAFLVNRGAIDTDVLFQNVTYELPLGSISI 420

Query: 421 LPDCKNVAFNTRRVSVQHNTRSMKVVQKFGSSEEWQEFKEPIPNFDETKLRASELLEQMG 480
           LPDCKNVAFNTRRVSVQ NTRSM  VQKF SSEEW+EFKEPIPNF++T+LRA++LLE MG
Sbjct: 421 LPDCKNVAFNTRRVSVQRNTRSMMTVQKFDSSEEWEEFKEPIPNFEDTELRANKLLEHMG 480

Query: 481 TTKDRSDYLWYTFRVQQDSQDSQQTLEVDSRAHALHAFVNG------------------- 540
           TTKDRSDYLWYTFRVQQDS DSQQT EVDSRAHALHAFVNG                   
Sbjct: 481 TTKDRSDYLWYTFRVQQDSPDSQQTFEVDSRAHALHAFVNGDYAGSAHGTYKEKGFSLVN 540

Query: 541 ---------------------DSGAFLETRVAGLRRVRIQGEDFSEQPWGYKVGLSGEQS 600
                                DSGAFLETRVAGLRRV IQGEDFSEQPWGYKVGLSGEQS
Sbjct: 541 NITLRNGINNISLLSVMVGLPDSGAFLETRVAGLRRVGIQGEDFSEQPWGYKVGLSGEQS 600

Query: 601 QIFLDTGSSDVQWSRLGNSSQPLTWYK-----------------------------TQFD 660
           QIFLDTGSS+VQWSRLGNSSQPLTWYK                             T+FD
Sbjct: 601 QIFLDTGSSNVQWSRLGNSSQPLTWYKVLPIINSNYLEFKCSRTDLCYCHSLLIGQTRFD 660

Query: 661 APPGDDPIALNLGSMGKGAAWVNGWGIGRYWVSFLTPKGEPSQKWYNVPRSFLKPAGNQL 720
           APPGDDPIALNLGSMGKGAAWVNG GIGRYWVSFLTPKGEPSQKWYNVPRSFLKP  NQL
Sbjct: 661 APPGDDPIALNLGSMGKGAAWVNGRGIGRYWVSFLTPKGEPSQKWYNVPRSFLKPTDNQL 720

Query: 721 VILEEETGNPIGISLDSVSITKTCGQVSESHYPLVASWMGAKKQTASGTKNRTRRPKVRL 758
           VILEEETGNP+ ISLDSV ITKTCGQVSESHYPLVASWMGAKKQ     KNRTRRPKV+L
Sbjct: 721 VILEEETGNPVEISLDSVLITKTCGQVSESHYPLVASWMGAKKQKVRSAKNRTRRPKVQL 780

BLAST of HG10003917 vs. ExPASy TrEMBL
Match: A0A6J1CI46 (Beta-galactosidase OS=Momordica charantia OX=3673 GN=LOC111011842 PE=3 SV=1)

HSP 1 Score: 1323.9 bits (3425), Expect = 0.0e+00
Identity = 656/829 (79.13%), Postives = 702/829 (84.68%), Query Frame = 0

Query: 1   MAKSECGFISYRLPCIVNSALVLTAALFFHCVLGGNDDGSGSVTYDGRSLIVNGEHKLLF 60
           MA SE G +  RL C+  SALVLT AL F  VLGGN++  G V+YDGRSLI+NGE KLLF
Sbjct: 1   MANSEYGVV--RLLCV--SALVLTTAL-FDSVLGGNEN-DGGVSYDGRSLIINGEQKLLF 60

Query: 61  SGSIHYPRSTPD--------------------------------YEFSGRRDIVRFVKEI 120
           SGSIHYPRSTPD                                YEFSGRRDIVRF+KEI
Sbjct: 61  SGSIHYPRSTPDMWPSLIAKAKEGGLDVIQTYVFWNLHEPQQGKYEFSGRRDIVRFLKEI 120

Query: 121 QAQGLYACLRIGPFIEAEWSYGGLPFWLHDVPGIVYRSDNEPFKLHMQNFTTKIVNLMKS 180
           QAQGL+ACLRIGPFIEAEWSYGGLPFWLHDVPGIVYRSDNEPFKLHMQNFTTKIVN+MKS
Sbjct: 121 QAQGLHACLRIGPFIEAEWSYGGLPFWLHDVPGIVYRSDNEPFKLHMQNFTTKIVNMMKS 180

Query: 181 EGLYASQGGPIILSQIENEYTLVEAAFREKGPPYVLWAADMAVSLKTGVPWSMCKQNDAP 240
           EGLYASQGGPIILSQIENEYTLVEAAF EKGPPYVLWAA+MAVSL+TGVPWSMC+QNDAP
Sbjct: 181 EGLYASQGGPIILSQIENEYTLVEAAFHEKGPPYVLWAANMAVSLQTGVPWSMCRQNDAP 240

Query: 241 DPVINTCNGMRCGETFTGPNSPNKPSIWTENWTSFYQTYGEEPYIRSAEEIAFHVALFIA 300
           DPVINTCNGMRCGETFTGPNSPNKPS+WTENWTSFYQTYG EPYIRSAEEIAFHVALFIA
Sbjct: 241 DPVINTCNGMRCGETFTGPNSPNKPSMWTENWTSFYQTYGGEPYIRSAEEIAFHVALFIA 300

Query: 301 AKNGTYVNYYMYHGGTNFGRSASAFMITGYYDQAPLDEYGLTREPKWGHLKELHAAVKLC 360
           AKNGTYVNYYMYHGGTNFGR+ASA++ITGYYDQAPLDEYGL REPKWGHLKELHAAVKLC
Sbjct: 301 AKNGTYVNYYMYHGGTNFGRTASAYVITGYYDQAPLDEYGLMREPKWGHLKELHAAVKLC 360

Query: 361 STPLLTGTKSNFSLGQSLEAIVFKTESGECAAFLVNKGATDTNVLFQNITYELPLSSISI 420
           S PLL+GTKSNFSLGQS EA VFKTESGECAAFLVN+GATD N+LFQN++Y+LPLSSISI
Sbjct: 361 SKPLLSGTKSNFSLGQSQEAYVFKTESGECAAFLVNRGATDVNILFQNVSYKLPLSSISI 420

Query: 421 LPDCKNVAFNTRRVSVQHNTRSMKVVQKFGSSEEWQEFKEPIPNFDETKLRASELLEQMG 480
           LPDCK VAFNTR VSVQHNTRSM+ VQ FGSSEEWQEFKE IP+F+ET+LRA ELLE MG
Sbjct: 421 LPDCKTVAFNTRMVSVQHNTRSMRAVQTFGSSEEWQEFKEMIPSFNETELRADELLEHMG 480

Query: 481 TTKDRSDYLWYTFRVQQDSQDSQQTLEVDSRAHALHAFVNG------------------- 540
           TTKD SDYLWYT RVQ DS DS  TLEVDSRAHALHAFVNG                   
Sbjct: 481 TTKDSSDYLWYTLRVQHDSPDSLLTLEVDSRAHALHAFVNGVYAGSAHGTFKERSFSLEK 540

Query: 541 ---------------------DSGAFLETRVAGLRRVRIQGEDFSEQPWGYKVGLSGEQS 600
                                DSGA+LE RVAGLRRV+IQGEDFS + WGYKVGL GEQS
Sbjct: 541 SITLRNGINNISLLSVMVGLPDSGAYLERRVAGLRRVQIQGEDFSAKSWGYKVGLVGEQS 600

Query: 601 QIFLDTGSSDVQWSRLGNSSQPLTWYKTQFDAPPGDDPIALNLGSMGKGAAWVNGWGIGR 660
            IFLDTGSS++QWSRLGNSSQPLTWYKT+FDAPPGDDPIALNLGSMGKGAAWVNG GIGR
Sbjct: 601 LIFLDTGSSEIQWSRLGNSSQPLTWYKTRFDAPPGDDPIALNLGSMGKGAAWVNGRGIGR 660

Query: 661 YWVSFLTPKGEPSQKWYNVPRSFLKPAGNQLVILEEETGNPIGISLDSVSITKTCGQVSE 720
           YWVSFLT KGEPSQKWYNVPRSFL+P GNQL+ILEEETGNP+GISLD+VSI+KTCGQVSE
Sbjct: 661 YWVSFLTSKGEPSQKWYNVPRSFLEPTGNQLIILEEETGNPLGISLDAVSISKTCGQVSE 720

Query: 721 SHYPLVASWMGAKKQTASGTKNRTRRPKVRLSCPTNKNISNILFASFGTPSGDCQSYAIG 758
           SHYP VASW+GAKKQ A+  KNR+R+PK+ LSCP +KNISNILFASFGTP+GDCQSYA G
Sbjct: 721 SHYPQVASWIGAKKQRAN-RKNRSRKPKLLLSCPHDKNISNILFASFGTPTGDCQSYATG 780

BLAST of HG10003917 vs. TAIR 10
Match: AT1G77410.1 (beta-galactosidase 16 )

HSP 1 Score: 923.3 bits (2385), Expect = 1.2e-268
Identity = 456/799 (57.07%), Postives = 559/799 (69.96%), Query Frame = 0

Query: 42  SVTYDGRSLIVNGEHKLLFSGSIHYPRSTP------------------------------ 101
           +VTYDGRSLI++GEHK+LFSGSIHY RSTP                              
Sbjct: 24  NVTYDGRSLIIDGEHKILFSGSIHYTRSTPQMWPSLIAKAKSGGIDVVDTYVFWNVHEPQ 83

Query: 102 --DYEFSGRRDIVRFVKEIQAQGLYACLRIGPFIEAEWSYGGLPFWLHDVPGIVYRSDNE 161
              ++FSG RDIV+F+KE++  GLY CLRIGPFI+ EWSYGGLPFWLH+V GIV+R+DNE
Sbjct: 84  QGQFDFSGSRDIVKFIKEVKNHGLYVCLRIGPFIQGEWSYGGLPFWLHNVQGIVFRTDNE 143

Query: 162 PFKLHMQNFTTKIVNLMKSEGLYASQGGPIILSQIENEYTLVEAAFREKGPPYVLWAADM 221
           PFK HM+ +   IV LMKSE LYASQGGPIILSQIENEY +V  AFR++G  YV W A +
Sbjct: 144 PFKYHMKRYAKMIVKLMKSENLYASQGGPIILSQIENEYGMVGRAFRQEGKSYVKWTAKL 203

Query: 222 AVSLKTGVPWSMCKQNDAPDPVINTCNGMRCGETFTGPNSPNKPSIWTENWTSFYQTYGE 281
           AV L TGVPW MCKQ+DAPDP++N CNG +CGETF GPNSPNKP+IWTENWTSFYQTYGE
Sbjct: 204 AVELDTGVPWVMCKQDDAPDPLVNACNGRQCGETFKGPNSPNKPAIWTENWTSFYQTYGE 263

Query: 282 EPYIRSAEEIAFHVALFIAAKNGTYVNYYMYHGGTNFGRSASAFMITGYYDQAPLDEYGL 341
           EP IRSAE+IAFHVALFI AKNG++VNYYMYHGGTNFGR+AS F+IT YYDQAPLDEYGL
Sbjct: 264 EPLIRSAEDIAFHVALFI-AKNGSFVNYYMYHGGTNFGRNASQFVITSYYDQAPLDEYGL 323

Query: 342 TREPKWGHLKELHAAVKLCSTPLLTGTKSNFSLGQSLEAIVFKTESGECAAFLVNKGATD 401
            R+PKWGHLKELHAAVKLC  PLL+G ++  SLG+   A VF  ++  CAA LVN+   +
Sbjct: 324 LRQPKWGHLKELHAAVKLCEEPLLSGLQTTISLGKLQTAFVFGKKANLCAAILVNQDKCE 383

Query: 402 TNVLFQNITYELPLSSISILPDCKNVAFNTRRVSVQHNTRSMKVVQKFGSSEEWQEFKEP 461
           + V F+N +Y L   S+S+LPDCKNVAFNT +V+ Q+NTR+ K  Q   S + W+EF E 
Sbjct: 384 STVQFRNSSYRLSPKSVSVLPDCKNVAFNTAKVNAQYNTRTRKARQNLSSPQMWEEFTET 443

Query: 462 IPNFDETKLRASELLEQMGTTKDRSDYLWYTFRVQQDSQDSQQTLEVDSRAHALHAFVNG 521
           +P+F ET +R+  LLE M TT+D SDYLW T R QQ S+ +   L+V+   HALHAFVNG
Sbjct: 444 VPSFSETSIRSESLLEHMNTTQDTSDYLWQTTRFQQ-SEGAPSVLKVNHLGHALHAFVNG 503

Query: 522 ----------------------------------------DSGAFLETRVAGLRRVRIQG 581
                                                   +SGA LE RV G R V+I  
Sbjct: 504 RFIGSMHGTFKAHRFLLEKNMSLNNGTNNLALLSVMVGLPNSGAHLERRVVGSRSVKIWN 563

Query: 582 ED----FSEQPWGYKVGLSGEQSQIFLDTGSSDVQWSRLGNS-SQPLTWYKTQFDAPPGD 641
                 F+   WGY+VGL GE+  ++ + GS+ VQW +  +S SQPLTWYK  FD P G+
Sbjct: 564 GRYQLYFNNYSWGYQVGLKGEKFHVYTEDGSAKVQWKQYRDSKSQPLTWYKASFDTPEGE 623

Query: 642 DPIALNLGSMGKGAAWVNGWGIGRYWVSFLTPKGEPSQKWYNVPRSFLKPAGNQLVILEE 701
           DP+ALNLGSMGKG AWVNG  IGRYWVSF T KG PSQ WY++PRSFLKP  N LVILEE
Sbjct: 624 DPVALNLGSMGKGEAWVNGQSIGRYWVSFHTYKGNPSQIWYHIPRSFLKPNSNLLVILEE 683

Query: 702 E-TGNPIGISLDSVSITKTCGQVSESH-YPLVASWMGAKKQTASGTKNRT----RRPKVR 758
           E  GNP+GI++D+VS+T+ CG VS ++ +P++     + ++     KN T    R+PKV+
Sbjct: 684 EREGNPLGITIDTVSVTEVCGHVSNTNPHPVI-----SPRKKGLNRKNLTYRYDRKPKVQ 743

BLAST of HG10003917 vs. TAIR 10
Match: AT5G63800.1 (Glycosyl hydrolase family 35 protein )

HSP 1 Score: 760.8 bits (1963), Expect = 1.1e-219
Identity = 377/699 (53.93%), Postives = 466/699 (66.67%), Query Frame = 0

Query: 21  LVLTAALFFHCVLGGNDDGSGSVTYDGRSLIVNGEHKLLFSGSIHYPRSTPD-------- 80
           L+L    F      G    +  VTYDGRSLI++G+ KLLFSGSIHYPRSTP+        
Sbjct: 12  LILIVGTFLE--FSGGATAAKGVTYDGRSLIIDGQRKLLFSGSIHYPRSTPEMWPSLIKK 71

Query: 81  ------------------------YEFSGRRDIVRFVKEIQAQGLYACLRIGPFIEAEWS 140
                                   Y+FSGR D+V+F+KEI++QGLY CLRIGPFIEAEW+
Sbjct: 72  TKEGGIDVIQTYVFWNLHEPKLGQYDFSGRNDLVKFIKEIRSQGLYVCLRIGPFIEAEWN 131

Query: 141 YGGLPFWLHDVPGIVYRSDNEPFKLHMQNFTTKIVNLMKSEGLYASQGGPIILSQIENEY 200
           YGGLPFWL DVPG+VYR+DNEPFK HMQ FT KIV+LMKSEGLYASQGGPIILSQIENEY
Sbjct: 132 YGGLPFWLRDVPGMVYRTDNEPFKFHMQKFTAKIVDLMKSEGLYASQGGPIILSQIENEY 191

Query: 201 TLVEAAFREKGPPYVLWAADMAVSLKTGVPWSMCKQNDAPDPVINTCNGMRCGETFTGPN 260
             VE AF EKG  Y+ WA  MAV LKTGVPW MCK  DAPDPVINTCNGM+CGETF GPN
Sbjct: 192 ANVEGAFHEKGASYIKWAGQMAVGLKTGVPWIMCKSPDAPDPVINTCNGMKCGETFPGPN 251

Query: 261 SPNKPSIWTENWTSFYQTYGEEPYIRSAEEIAFHVALFIAAKNGTYVNYYMYHGGTNFGR 320
           SPNKP +WTE+WTSF+Q YG+EPYIRSAE+IAFH ALF+ AKNG+Y+NYYMYHGGTNFGR
Sbjct: 252 SPNKPKMWTEDWTSFFQVYGKEPYIRSAEDIAFHAALFV-AKNGSYINYYMYHGGTNFGR 311

Query: 321 SASAFMITGYYDQAPLDEYGLTREPKWGHLKELHAAVKLCSTPLLTGTKSNFSLGQSLEA 380
           ++S++ ITGYYDQAPLDEYGL R+PK+GHLKELHAA+K  + PLL G ++  SLG   +A
Sbjct: 312 TSSSYFITGYYDQAPLDEYGLLRQPKYGHLKELHAAIKSSANPLLQGKQTILSLGPMQQA 371

Query: 381 IVFKTESGECAAFLVNKGATDTNVLFQNITYELPLSSISILPDCKNVAFNTRRVSVQHNT 440
            VF+  +  C AFLVN  A  + + F+N  Y L   SI IL +CKN+ + T +V+V+ NT
Sbjct: 372 YVFEDANNGCVAFLVNNDAKASQIQFRNNAYSLSPKSIGILQNCKNLIYETAKVNVKMNT 431

Query: 441 RSMKVVQKFGSSEEWQEFKEPIPNFDETKLRASELLEQMGTTKDRSDYLWYTFRVQQDSQ 500
           R    VQ F   + W  F+E IP F  T L+ + LLE    TKD++DYLWYT   + DS 
Sbjct: 432 RVTTPVQVFNVPDNWNLFRETIPAFPGTSLKTNALLEHTNLTKDKTDYLWYTSSFKLDSP 491

Query: 501 DSQQTLEVDSRAHALHAFVNG--------------------------------------- 560
            +  ++  +S  H +H FVN                                        
Sbjct: 492 CTNPSIYTESSGHVVHVFVNNALAGSGHGSRDIRVVKLQAPVSLINGQNNISILSGMVGL 551

Query: 561 -DSGAFLETRVAGLRRVRI-----QGEDFSEQPWGYKVGLSGEQSQIFLDTGSSDVQWS- 620
            DSGA++E R  GL +V+I     +  D S   WGY VGL GE+ +++     + V+WS 
Sbjct: 552 PDSGAYMERRSYGLTKVQISCGGTKPIDLSRSQWGYSVGLLGEKVRLYQWKNLNRVKWSM 611

Query: 621 -RLG-NSSQPLTWYKTQFDAPPGDDPIALNLGSMGKGAAWVNGWGIGRYWVSFLTPKGEP 640
            + G   ++PL WYKT FD P GD P+ L++ SMGKG  WVNG  IGRYWVSFLTP G+P
Sbjct: 612 NKAGLIKNRPLAWYKTTFDGPNGDGPVGLHMSSMGKGEIWVNGESIGRYWVSFLTPAGQP 671

BLAST of HG10003917 vs. TAIR 10
Match: AT4G36360.1 (beta-galactosidase 3 )

HSP 1 Score: 667.5 bits (1721), Expect = 1.2e-191
Identity = 367/824 (44.54%), Postives = 481/824 (58.37%), Query Frame = 0

Query: 43  VTYDGRSLIVNGEHKLLFSGSIHYPRSTPD------------------------------ 102
           VTYD ++L++NG+ ++LFSGSIHYPRSTPD                              
Sbjct: 33  VTYDRKALLINGQRRILFSGSIHYPRSTPDMWEDLIQKAKDGGIDVIETYVFWNLHEPSP 92

Query: 103 --YEFSGRRDIVRFVKEIQAQGLYACLRIGPFIEAEWSYGGLPFWLHDVPGIVYRSDNEP 162
             Y+F GR D+VRFVK I   GLYA LRIGP++ AEW++GG P WL  VPGI +R+DNEP
Sbjct: 93  GKYDFEGRNDLVRFVKTIHKAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNEP 152

Query: 163 FKLHMQNFTTKIVNLMKSEGLYASQGGPIILSQIENEYTLVEAAFREKGPPYVLWAADMA 222
           FK  M+ FT +IV LMKSE L+ SQGGPIILSQIENEY         +G  Y+ WAA MA
Sbjct: 153 FKRAMKGFTERIVELMKSENLFESQGGPIILSQIENEYGRQGQLLGAEGHNYMTWAAKMA 212

Query: 223 VSLKTGVPWSMCKQNDAPDPVINTCNGMRCGETFTGPNSPNKPSIWTENWTSFYQTYGEE 282
           ++ +TGVPW MCK++DAPDPVINTCNG  C ++F  PN P KP IWTE W+ ++  +G  
Sbjct: 213 IATETGVPWVMCKEDDAPDPVINTCNGFYC-DSF-APNKPYKPLIWTEAWSGWFTEFGGP 272

Query: 283 PYIRSAEEIAFHVALFIAAKNGTYVNYYMYHGGTNFGRSASAFMITGYYD-QAPLDEYGL 342
            + R  +++AF VA FI  K G++VNYYMYHGGTNFGR+A    +T  YD  AP+DEYGL
Sbjct: 273 MHHRPVQDLAFGVARFI-QKGGSFVNYYMYHGGTNFGRTAGGPFVTTSYDYDAPIDEYGL 332

Query: 343 TREPKWGHLKELHAAVKLCSTPLLTGTKSNFSLGQSLEAIVFKTESGECAAFLVNKGA-T 402
            R+PK+GHLKELH A+K+C   L++      S+G   +A V+  ESG+C+AFL N    +
Sbjct: 333 IRQPKYGHLKELHRAIKMCEKALVSADPVVTSIGNKQQAHVYSAESGDCSAFLANYDTES 392

Query: 403 DTNVLFQNITYELPLSSISILPDCKNVAFNTRRVSVQHNTRSMKVVQKFGSSEEWQEFKE 462
              VLF N+ Y LP  SISILPDC+N  FNT +V VQ  T  M+++     + +W+ + E
Sbjct: 393 AARVLFNNVHYNLPPWSISILPDCRNAVFNTAKVGVQ--TSQMEMLPTDTKNFQWESYLE 452

Query: 463 PIPNFDETKLRASE-LLEQMGTTKDRSDYLWYTFRVQQDSQDS------QQTLEVDSRAH 522
            + + D++    +  LLEQ+  T+D SDYLWY   V     +S        TL + S  H
Sbjct: 453 DLSSLDDSSTFTTHGLLEQINVTRDTSDYLWYMTSVDIGDSESFLHGGELPTLIIQSTGH 512

Query: 523 ALHAFVNGD-SGAFLETRV-------------AGLRRVRI-------------------- 582
           A+H FVNG  SG+   TR              +G  R+ +                    
Sbjct: 513 AVHIFVNGQLSGSAFGTRQNRRFTYQGKINLHSGTNRIALLSVAVGLPNVGGHFESWNTG 572

Query: 583 -----------QGE-DFSEQPWGYKVGLSGEQSQIFLDTGSSDVQW---SRLGNSSQPLT 642
                      QG+ D S Q W Y+VGL GE   +   T +  + W   S      QPLT
Sbjct: 573 ILGPVALHGLSQGKMDLSWQKWTYQVGLKGEAMNLAFPTNTPSIGWMDASLTVQKPQPLT 632

Query: 643 WYKTQFDAPPGDDPIALNLGSMGKGAAWVNGWGIGRYWVSFLTPK--------------- 702
           W+KT FDAP G++P+AL++  MGKG  WVNG  IGRYW +F T                 
Sbjct: 633 WHKTYFDAPEGNEPLALDMEGMGKGQIWVNGESIGRYWTAFATGDCSHCSYTGTYKPNKC 692

Query: 703 ----GEPSQKWYNVPRSFLKPAGNQLVILEEETGNPIGISLDSVSITKTCGQVSESHYPL 757
               G+P+Q+WY+VPR++LKP+ N LVI EE  GNP  +SL   S++  C +VSE H P 
Sbjct: 693 QTGCGQPTQRWYHVPRAWLKPSQNLLVIFEELGGNPSTVSLVKRSVSGVCAEVSEYH-PN 752

BLAST of HG10003917 vs. TAIR 10
Match: AT4G36360.2 (beta-galactosidase 3 )

HSP 1 Score: 663.3 bits (1710), Expect = 2.3e-190
Identity = 367/824 (44.54%), Postives = 481/824 (58.37%), Query Frame = 0

Query: 43  VTYDGRSLIVNGEHKLLFSGSIHYPRSTPD------------------------------ 102
           VTYD ++L++NG+ ++LFSGSIHYPRSTPD                              
Sbjct: 33  VTYDRKALLINGQRRILFSGSIHYPRSTPDMWEDLIQKAKDGGIDVIETYVFWNLHEPSP 92

Query: 103 --YEFSGRRDIVRFVKEIQAQGLYACLRIGPFIEAEWSYGGLPFWLHDVPGIVYRSDNEP 162
             Y+F GR D+VRFVK I   GLYA LRIGP++ AEW++GG P WL  VPGI +R+DNEP
Sbjct: 93  GKYDFEGRNDLVRFVKTIHKAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNEP 152

Query: 163 FKLHMQNFTTKIVNLMKSEGLYASQGGPIILSQIENEYTLVEAAFREKGPPYVLWAADMA 222
           FK  M+ FT +IV LMKSE L+ SQGGPIILSQIENEY         +G  Y+ WAA MA
Sbjct: 153 FKRAMKGFTERIVELMKSENLFESQGGPIILSQIENEYGRQGQLLGAEGHNYMTWAAKMA 212

Query: 223 VSLKTGVPWSMCKQNDAPDPVINTCNGMRCGETFTGPNSPNKPSIWTENWTSFYQTYGEE 282
           ++ +TGVPW MCK++DAPDPVINTCNG  C ++F  PN P KP IWTE W+ ++  +G  
Sbjct: 213 IATETGVPWVMCKEDDAPDPVINTCNGFYC-DSF-APNKPYKPLIWTEAWSGWFTEFGGP 272

Query: 283 PYIRSAEEIAFHVALFIAAKNGTYVNYYMYHGGTNFGRSASAFMITGYYD-QAPLDEYGL 342
            + R  +++AF VA FI  K G++VNYYMYHGGTNFGR+A    +T  YD  AP+DEYGL
Sbjct: 273 MHHRPVQDLAFGVARFI-QKGGSFVNYYMYHGGTNFGRTAGGPFVTTSYDYDAPIDEYGL 332

Query: 343 TREPKWGHLKELHAAVKLCSTPLLTGTKSNFSLGQSLEAIVFKTESGECAAFLVNKGA-T 402
            R+PK+GHLKELH A+K+C   L++      S+G   +A V+  ESG+C+AFL N    +
Sbjct: 333 IRQPKYGHLKELHRAIKMCEKALVSADPVVTSIGNKQQAHVYSAESGDCSAFLANYDTES 392

Query: 403 DTNVLFQNITYELPLSSISILPDCKNVAFNTRRVSVQHNTRSMKVVQKFGSSEEWQEFKE 462
              VLF N+ Y LP  SISILPDC+N  FNT +V VQ  T  M+++     + +W+ + E
Sbjct: 393 AARVLFNNVHYNLPPWSISILPDCRNAVFNTAKVGVQ--TSQMEMLPTDTKNFQWESYLE 452

Query: 463 PIPNFDETKLRASE-LLEQMGTTKDRSDYLWYTFRVQQDSQDS------QQTLEVDSRAH 522
            + + D++    +  LLEQ+  T+D SDYLWY   V     +S        TL + S  H
Sbjct: 453 DLSSLDDSSTFTTHGLLEQINVTRDTSDYLWYMTSVDIGDSESFLHGGELPTLIIQSTGH 512

Query: 523 ALHAFVNGD-SGAFLETRV-------------AGLRRVRI-------------------- 582
           A+H FVNG  SG+   TR              +G  R+ +                    
Sbjct: 513 AVHIFVNGQLSGSAFGTRQNRRFTYQGKINLHSGTNRIALLSVAVGLPNVGGHFESWNTG 572

Query: 583 -----------QGE-DFSEQPWGYKVGLSGEQSQIFLDTGSSDVQW---SRLGNSSQPLT 642
                      QG+ D S Q W Y+VGL GE   +   T +  + W   S      QPLT
Sbjct: 573 ILGPVALHGLSQGKMDLSWQKWTYQVGLKGEAMNLAFPTNTPSIGWMDASLTVQKPQPLT 632

Query: 643 WYKTQFDAPPGDDPIALNLGSMGKGAAWVNGWGIGRYWVSFLTPK--------------- 702
           W+KT FDAP G++P+AL++  MGKG  WVNG  IGRYW +F T                 
Sbjct: 633 WHKTYFDAPEGNEPLALDMEGMGKGQIWVNGESIGRYWTAFATGDCSHCSYTGTYKPNKC 692

Query: 703 ----GEPSQKWYNVPRSFLKPAGNQLVILEEETGNPIGISLDSVSITKTCGQVSESHYPL 757
               G+P+Q+WY+VPR++LKP+ N LVI EE  GNP  +SL   S++  C +VSE H P 
Sbjct: 693 QTGCGQPTQRWYHVPRAWLKPSQNLLVIFEELGGNPSTVSLVKRSVSGVCAEVSEYH-PN 752

BLAST of HG10003917 vs. TAIR 10
Match: AT2G16730.1 (glycosyl hydrolase family 35 protein )

HSP 1 Score: 655.6 bits (1690), Expect = 4.9e-188
Identity = 347/835 (41.56%), Postives = 467/835 (55.93%), Query Frame = 0

Query: 22  VLTAALFFHCVLGGND--------DGSGSVTYDGRSLIVNGEHKLLFSGSIHYPRSTPD- 81
           VL   L F   L  +D        D    VTYDG SLI+NG  +LL+SGSIHYPRSTP+ 
Sbjct: 15  VLVILLSFSGALSSDDKEKKTKSVDKKKEVTYDGTSLIINGNRELLYSGSIHYPRSTPEM 74

Query: 82  -------------------------------YEFSGRRDIVRFVKEIQAQGLYACLRIGP 141
                                          + FSGR D+V+F+K I+  GLY  LR+GP
Sbjct: 75  WPNIIKRAKQGGLNTIQTYVFWNVHEPEQGKFNFSGRADLVKFIKLIEKNGLYVTLRLGP 134

Query: 142 FIEAEWSYGGLPFWLHDVPGIVYRSDNEPFKLHMQNFTTKIVNLMKSEGLYASQGGPIIL 201
           FI+AEW++GGLP+WL +VPGI +R+DNEPFK H + +   ++++MK E L+ASQGGPIIL
Sbjct: 135 FIQAEWTHGGLPYWLREVPGIFFRTDNEPFKEHTERYVKVVLDMMKEEKLFASQGGPIIL 194

Query: 202 SQIENEYTLVEAAFREKGPPYVLWAADMAVSLKTGVPWSMCKQNDAPDPVINTCNGMRCG 261
            QIENEY+ V+ A++E G  Y+ WA+ +  S+  G+PW MCKQNDAPDP+IN CNG  CG
Sbjct: 195 GQIENEYSAVQRAYKEDGLNYIKWASKLVHSMDLGIPWVMCKQNDAPDPMINACNGRHCG 254

Query: 262 ETFTGPNSPNKPSIWTENWTSFYQTYGEEPYIRSAEEIAFHVALFIAAKNGTYVNYYMYH 321
           +TF GPN  NKPS+WTENWT+ ++ +G+ P  RS E+IA+ VA F  +KNGT+VNYYMYH
Sbjct: 255 DTFPGPNKDNKPSLWTENWTTQFRVFGDPPAQRSVEDIAYSVARFF-SKNGTHVNYYMYH 314

Query: 322 GGTNFGRSASAFMITGYYDQAPLDEYGLTREPKWGHLKELHAAVKLCSTPLLTGTKSNFS 381
           GGTNFGR+++ ++ T YYD APLDE+GL REPK+GHLK LH A+ LC   LL G      
Sbjct: 315 GGTNFGRTSAHYVTTRYYDDAPLDEFGLEREPKYGHLKHLHNALNLCKKALLWGQPRVEK 374

Query: 382 LGQSLEAIVFKTESGE-CAAFLVNKGA-TDTNVLFQNITYELPLSSISILPDCKNVAFNT 441
                E   ++    + CAAFL N        + F+   Y +P  SISILPDCK V +NT
Sbjct: 375 PSNETEIRYYEQPGTKVCAAFLANNNTEAAEKIKFRGKEYLIPHRSISILPDCKTVVYNT 434

Query: 442 RRVSVQHNTRSMKVVQKFGSSEEWQEFKEPIPNFDETKLRASELL--EQMGTTKDRSDYL 501
             +   H +R+    +K   + +++ F E +P    +K++    +  E  G TKD SDY 
Sbjct: 435 GEIISHHTSRNFMKSKKANKNFDFKVFTESVP----SKIKGDSFIPVELYGLTKDESDYG 494

Query: 502 WYTFRVQQDSQD------SQQTLEVDSRAHALHAFVNG---------------------- 561
           WYT   + D  D       +  L + S  HALH ++NG                      
Sbjct: 495 WYTTSFKIDDNDLSKKKGGKPNLRIASLGHALHVWLNGEYLGNGHGSHEEKSFVFQKPVT 554

Query: 562 ------------------DSGAFLETRVAGLRRVRIQG------EDFSEQPWGYKVGLSG 621
                             DSG+++E R  G R V I G      +   E  WG KVG+ G
Sbjct: 555 LKEGENHLTMLGVLTGFPDSGSYMEHRYTGPRSVSILGLGSGTLDLTEENKWGNKVGMEG 614

Query: 622 EQSQIFLDTGSSDVQWSRLGNSSQPLTWYKTQFDAPPGDDPIALNLGSMGKGAAWVNGWG 681
           E+  I  + G   V+W +       +TWY+T FDAP      A+ +  MGKG  WVNG G
Sbjct: 615 ERLGIHAEEGLKKVKWEKASGKEPGMTWYQTYFDAPESQSAAAIRMNGMGKGLIWVNGEG 674

Query: 682 IGRYWVSFLTPKGEPSQKWYNVPRSFLKPAGNQLVILEEETG-NPIGISLDSVSITKTCG 741
           +GRYW+SFL+P G+P+Q  Y++PRSFLKP  N LVI EEE    P  I    V+    C 
Sbjct: 675 VGRYWMSFLSPLGQPTQIEYHIPRSFLKPKKNLLVIFEEEPNVKPELIDFVIVNRDTVCS 734

Query: 742 QVSESHYPLVASWMGAKKQTASGTKNRTRRPKVRLSCPTNKNISNILFASFGTPSGDCQS 757
            + E++ P V  W     Q  + T +        L C   K IS + FASFG P+G C +
Sbjct: 735 YIGENYTPSVRHWTRKNDQVQAITDD--VHLTANLKCSGTKKISAVEFASFGNPNGTCGN 794

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038885233.10.0e+0085.52beta-galactosidase 16 isoform X1 [Benincasa hispida][more]
XP_011657429.10.0e+0083.35beta-galactosidase 16 isoform X2 [Cucumis sativus][more]
XP_008466742.10.0e+0082.87PREDICTED: beta-galactosidase 16 isoform X2 [Cucumis melo][more]
XP_022993850.10.0e+0081.66beta-galactosidase 16 [Cucurbita maxima][more]
XP_031742962.10.0e+0080.54beta-galactosidase 16 isoform X1 [Cucumis sativus][more]
Match NameE-valueIdentityDescription
Q8GX691.8e-26757.07Beta-galactosidase 16 OS=Arabidopsis thaliana OX=3702 GN=BGAL16 PE=2 SV=2[more]
Q9FFN41.5e-21853.93Beta-galactosidase 6 OS=Arabidopsis thaliana OX=3702 GN=BGAL6 PE=2 SV=1[more]
Q75HQ39.5e-20545.97Beta-galactosidase 7 OS=Oryza sativa subsp. japonica OX=39947 GN=Os05g0428100 PE... [more]
Q6ZJJ02.1e-19143.44Beta-galactosidase 11 OS=Oryza sativa subsp. japonica OX=39947 GN=Os08g0549200 P... [more]
Q9SCV91.7e-19044.54Beta-galactosidase 3 OS=Arabidopsis thaliana OX=3702 GN=BGAL3 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A1S3CS540.0e+0082.87Beta-galactosidase OS=Cucumis melo OX=3656 GN=LOC103504084 PE=3 SV=1[more]
A0A6J1JZN00.0e+0081.66Beta-galactosidase OS=Cucurbita maxima OX=3661 GN=LOC111489734 PE=3 SV=1[more]
A0A6J1FGB80.0e+0081.66Beta-galactosidase OS=Cucurbita moschata OX=3662 GN=LOC111445236 PE=3 SV=1[more]
A0A1S3CRZ80.0e+0080.07Beta-galactosidase OS=Cucumis melo OX=3656 GN=LOC103504084 PE=3 SV=1[more]
A0A6J1CI460.0e+0079.13Beta-galactosidase OS=Momordica charantia OX=3673 GN=LOC111011842 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT1G77410.11.2e-26857.07beta-galactosidase 16 [more]
AT5G63800.11.1e-21953.93Glycosyl hydrolase family 35 protein [more]
AT4G36360.11.2e-19144.54beta-galactosidase 3 [more]
AT4G36360.22.3e-19044.54beta-galactosidase 3 [more]
AT2G16730.14.9e-18841.56glycosyl hydrolase family 35 protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001944Glycoside hydrolase, family 35PRINTSPR00742GLHYDRLASE35coord: 549..563
score: 57.65
coord: 53..70
score: 61.44
coord: 292..308
score: 49.48
coord: 576..592
score: 54.33
coord: 273..288
score: 73.16
IPR001944Glycoside hydrolase, family 35PANTHERPTHR23421BETA-GALACTOSIDASE RELATEDcoord: 73..489
coord: 39..73
IPR001944Glycoside hydrolase, family 35PANTHERPTHR23421BETA-GALACTOSIDASE RELATEDcoord: 490..750
IPR041392Beta-galactosidase, beta-sandwich domainPFAMPF17834GHDcoord: 332..402
e-value: 7.7E-24
score: 83.5
IPR031330Glycoside hydrolase 35, catalytic domainPFAMPF01301Glyco_hydro_35coord: 72..324
e-value: 7.1E-80
score: 269.0
IPR000922D-galactoside/L-rhamnose binding SUEL lectin domainPFAMPF02140Gal_Lectincoord: 679..756
e-value: 4.3E-19
score: 68.6
IPR000922D-galactoside/L-rhamnose binding SUEL lectin domainPROSITEPS50228SUEL_LECTINcoord: 671..757
score: 15.6759
NoneNo IPR availableGENE3D3.20.20.80Glycosidasescoord: 73..329
e-value: 1.6E-69
score: 237.0
NoneNo IPR availableGENE3D3.20.20.80Glycosidasescoord: 35..72
e-value: 8.8E-6
score: 27.4
NoneNo IPR availableGENE3D2.60.120.260coord: 524..642
e-value: 2.2E-17
score: 65.6
NoneNo IPR availablePANTHERPTHR23421:SF154BETA-GALACTOSIDASEcoord: 490..750
NoneNo IPR availablePANTHERPTHR23421:SF154BETA-GALACTOSIDASEcoord: 73..489
NoneNo IPR availablePANTHERPTHR23421:SF154BETA-GALACTOSIDASEcoord: 39..73
IPR043159D-galactoside/L-rhamnose binding SUEL lectin domain superfamilyGENE3D2.60.120.740coord: 661..757
e-value: 6.7E-14
score: 54.0
IPR025300Beta-galactosidase jelly roll domainPFAMPF13364BetaGal_dom4_5coord: 549..623
e-value: 2.5E-4
score: 21.6
IPR019801Glycoside hydrolase, family 35, conserved sitePROSITEPS01182GLYCOSYL_HYDROL_F35coord: 156..168
IPR008979Galactose-binding-like domain superfamilySUPERFAMILY49785Galactose-binding domain-likecoord: 516..661
IPR008979Galactose-binding-like domain superfamilySUPERFAMILY49785Galactose-binding domain-likecoord: 386..490
IPR017853Glycoside hydrolase superfamilySUPERFAMILY51445(Trans)glycosidasescoord: 42..328

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10003917.1HG10003917.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0005975 carbohydrate metabolic process
cellular_component GO:0048046 apoplast
molecular_function GO:0004565 beta-galactosidase activity
molecular_function GO:0030246 carbohydrate binding
molecular_function GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compounds