Sgr028903 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr028903
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
Descriptionbeta-glucosidase BoGH3B isoform X1
Locationtig00153210: 1356889 .. 1377676 (-)
RNA-Seq ExpressionSgr028903
SyntenySgr028903
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCAAATTTCGGCTTTATTTCGTCGCGAGCAATCCATAATAAGCATATGCGAAGTATGCTGCTACGACTGTCACAACCGCCGCAGCCAGACTCGCCGTCACCGGCAACGGCATCCTTCCGACGGTGAAGCGGCCGGCAAGTGTCGAAGCTCTGATCGCTCTCGGCAATGGAGTTCCAATGCTCACCTTGGGCATTTCTTGATTCTTACCCACCTCCTCTGTTTTGGATTCCTTCGCCTTCGCGTCTTCTTTTTCCTTTCCCTTTTCTGGGTTTTGATTTTCGATTTCTTTCTTCTCCGTTGGCTTGTGATCGGGGCTGCTGATCATCTTCTGATCCTGCGTCGTTTTAAGCTCGCCGGAATCTCCTTGTACCGGCTTTGGCTCGGGCTCCGGACCCTTTTCAGCCTTTGGAGCCGCCGCCTCCTGCGTCGGACCTTCTTCCTTTACCTCCACCGGGGCTTGCTTTTGGAGTTCTGCGGATTTTCCCTTATCGTTATCCGCTGGCTTCACTGAATTAGAATCGCCTTTCGGAAGCGCCTCCTCCGGTCCCTTAGTTTCAGCCTTCGCCTCCGGCCGGGCTGCGTTTTCCGGCGAGATATCTTCCCTCCCCTTTTCAGGTGTTGGTCGTTCTGGGTCTTTGGGAGGCGGAGCCGCCGCCGTCCCTTGTCCGATATTTTCCGGGGAGATATCTTCCCTTCCCTTTTCAGGGCTTGGTTGTTCTGGTTCTTTGGGAGCCGCCGCCGCCTCCTCCGCCGTCCCTTGTCCGCTAGTTTCTTTGGGCATTGTAATGGTGAGAATTCCATCTTGAAATTTATGATAAACTTTGTCGATCATGCAATTTTCTGGAAGTGGGTAAGTTTTATCCAAACGGAACCATCTGTTGTTTCCCACATACCGATCTCCTGCCACCGCCACTGTCCGTGTCCCTTCTTCAACCTTAACCTTCACTTGCTGTGGGTTGAACTCTGAGAAACAATTTCAACCACAACAAACCCAATAAGAACAAAAAAAGATCAGACTCCAAAAAAACAAGACCCATCAGAACAATCAACACGACGGGCGACGGCGACATTTCAGAACACGTTAATGGAAGAGAAAGAAAGAAAGACGGCAAAATGTGGTGGAGGTGGCCGGACCTGGAAGTTGGAGCTGTAGAATATGAGCTTCGTTTTCTTCTCTTTCTTCAACCTTCGGCGTGAATGGCTCGTAGAAAGGGCGGAAGGACTGGCGGCGGAGAGCGTCAAGTCCGGCGATCCTTGGCCTCGCTGTCGTCATTTCTGGGTGGCCGAAAAAAATGGCGGTTTTTTTCGGGTCGGGTTTGTTGGGTCGGGTTTTGTGGGTAAAATGTAGGGCGTGTTTGGTGGTATTTATAGGGTTTGCAATGGGTTGCTTGGAGTTGAGATTATACCTTTTTTGATTTCTTTGATTAAGGGGACATGTATTACCCCTTTTTTGTAGTAAAGTATAACTTTCCCAACTCCATTTTGCTTCCAATTTGCTTCAATTATTTATTTATTATTATTATTTTTTGTCAATTACAAAGTTTTTGGAAAATGTTGGGCTAACATTCCTAATAAGCCTACATTCGTGTAGTCGAAGAGCGTTGAGAAGTTATGTAACTTGTGTTTGAATTATGTTGGGAAATTGTATTACAAAAATCTTAATTTATGTTTTGTTTATAAGTTTGCAGCTAATTTCAGTATAAATGGACTGGTTAAGACATTAATAATTTACTTTATAAGTAAAAGTTTTAAATTTCACTTCACATTTGTACTACTATATTCTAAACAAAAAGCTTGCAATTTGTAACAATAAGTGTGTAGCTTGAAAAATGTTGGGTTTTTAAGAAGTTTAGGCTTATGTAATTTATGGAGATTACAAAATTTATAGCATAGTATAAACAAATCCACGTACCAAAAATCCATGAAAAGTTCTTTTCAAAAAAGGAAATTGAAAAGAAAGATATGTGTGATTGAACTTTTAAGCATTTAATATCTATACCCATCAAATAACATCCGCTGTCCCAAATCAACTATAGATCAATTTTTCCACATTATAGAAATAAAAAAAAATTAATGACAAGTTTTTTTTTTGACAAAAAATGACAAGAAAAGAAAGGAACACGTGTGTCCATGAAGAGATAATTGAGCATAAAATTACAAAGCTTAATTAAAAAAATTGTGAAATCACAAATTTGGTAATTAAACTTTCAAGATATGTGTAATAAGTCCTCCTATTTTAAAAATTTTTAAGAGGTTTCTAAATTTTAAAAGTTGTATCTATAAGTCTCTATGTTTTAGAAATTTTCTAATAAGTCTATGAACTTTCAAATTTGTATTTTAACAGATCTTTATCATTGATTTCATCAATTCAACACTTATGTCATTAAACCCTTGAACTTATCTCAAATTAGTCTACTACATAATCTTTAAATAATAAAATCAACATGCAACATAAGCATTAAACTTATGGAGTTAATGACAAAAACCTATTAGACACAAAATTAAAAATTTAAGAATTTATTAGAAACTTTTGAAAGTTCAAATGCCTATTTTAAATTTTTTAAAGTATAAAAATCTATTAGAGGGATCAAATTTGTAATTTAACTTAAAAAAATTAATTAAGCTTAATTAAGAAGCCTAACCTTCTAAATCTTGATATTAAATAATGTGAAAGTAATGAGAAGAGAATCCTCCAAGTTTAGTATCTATGAATAGCGGCGTCCATTTCCCACGTTTTGCTTGGGTATTTTTGTAATTTTCTTGTCAAGGACTTTTTTCTTCGGTTTATTTATTTTTTAAAGAAAAGGGTATAGAAAATGTTTGGAAATATAGTAGAAATATAGTTTTCTATATTTTAAAATAAAAATTCAATGAAAAACATATTTGGATATTTATTTTTAAAAATTATTTTTATCTACTTTTACTGAAATGTTTTTATTTTATTTAATAAATATGTAAATTTTACTAATCTTTTGATAGAAGCCAGAAAATAAGATTCACACAAACTCGTACGTAGGTTGAGGACAAAGATTGTTTGAATAGTTGTTAAATTAGGTAGAATACTTTCAAGAATGATAACTAGATTAATGAATTATGGTCAATGTCGTATAAGTTGCAATTTTGAATTGGTAAAGTTTTTAAGATTAATTTATTTCATGTATTAATAGTTTGTAACCAAGTTAATCGTTGGCTAGTTGGGTCATTGTCAAATTTTATTTATGAAATGACTTGGAGCAACGATGATGAAAGTAAATATATTTTAGATGATCACAAATTGAAAGAAGTACCAACAAGACCAATAATGGCTAGGAAGAGAATGTTTTTTTTTAAGGTACCCTAAAGACTATATTTAATAAAGACGAGGTGGGGCTAGTAGCTTAAAGATTAAAGCACGTGGCTATAAACTAGGGCGTTGGGTGTTAAAATCCATCTTCACCCACAATCAACCGAAAAGGAAAGGATCCAAGTCTTAGAACTTGAGTATGGGTATTTTGTTGAAATGAAATGACCTTATCTTTTTATTTATTTATTTTTCTATTATTGGGTTAAAGGTGGAATTTTCTTATAAATTAAATATAATATACCCAACTCTAATCAACTTTTTTTTTTTTTTTCATTTAGAAGCCTACCACTTTTAAATTATATAACTATGAAAAAAAATTGTACGTCACACATTCATTCTATTTTATCATTTGAAATTTCGGTATAAATTTTATGAAAGTTTGTTAAGAAATTATCTTCATAATTAAAATCTTATCGAACTGCTACTTTTGTTCTTTTGAAAATATTACATCACATATATGGAATGAAGAATGAGGATCGAACGTACAATCAGAGATAGATATATCTTAATTACTAAACTATGTTCAAATTGTGCTAAATTTTATGGAGTATATGTTGAAGTTATTTTAAACATTTACACTTTTGTTTATAGTATAAATTATTTTCTTCTTTTCTATTTTCATTTAATAACCATCACAAATAGAATAGGGGTATGAAAGAAGGAACCAAAATATGGATTAACCAAATTGAATCATTTATAGTTGGACAATGGTTCTCAATAACGATTTATGGTTTCATTTTTAGAGGTGTATTTAAAAAAAAAATAATTAATGACTTATAAAAATAAAAACCATGTAATAATTTTATAATTAAAAAAAATAACTATTTTAAACTATAACTCATTTCGTGAAAATAACGCCATAAAAAAATTGTCCTCCAAATATATTTTCACCATTTTTCTTGTTTTAAATAAGCCCCTAAATCAAATTCGTTCAATCCTCCAGTATTATAATTTATTTTCTTCAATTATTATATGCATTAAAAGTTTGTACCACCAACACCAATTCATTTTATAATTTATTGTTTTTTTTAGAATATCAATAATTTATTGTAATTTAGACCTTGAAGATTGAGTTGATTGATTGCTTTTCTATGTCCTGTGCATTTATTTCCTACCTTATTTCTCCCATCATCTCAATTTTCTACATGAATCCTATATATTTTCAAGTTTATCCTTCTAATTTTCCTTTTTCCCTTTCATCTTTTTCCATTTTGTTTTTCTTAAATCCCTTCTGGATTATTGTTAACTCTATTTTTTTATCATTCAATATTGGTTTTATTACTTTATTTTGATCTGCAGTTATTGAATGCATAGATGAAAAAATGACGAAGTCCATTCGATCAAAACATTTTTATATGTATATATAAAAGTGACATTTTTGGTAAAATATTATGGGTTTTGAAAGTTGTTAAAAATTTAAATAGATGCCCTAAAACTTATTTGAAAAAAATATAATACTTTTTTGAAAAAAAATGGTACAATTTTTTAAGAGACATGTTTTCTAAGTCAATGTGTTTAACTCTTTCATACAATGCATTAGGCTCGGTCAAACATGTCTCTTAAAATAACTAACAAAAAAAAAAAAAAGTAACTACATGGAAGTAAATAGAATTATTTCTTAAAATTTAACGGTTAAATTATATGTTTTAAAAACCAGGTACTAATTAAGTAGACACAAATGTAAAAATTGATGGAAAATATAATATATCTTAAATAATATTATAAAATTATGTGATTATTGCCATGTCAATATAAAATTAATAGAAAAAAGGTGAAGTCAACTTTTTGCAACAGAAACTAATAGAAAATTAAAGACACGTAGGATTTTAAACTAGTTTGAAATTTTCAAAATTTAAATCGTACGAATTTGAAACTTAAAAGGTAAGATAAATATTTGTCTAAATATCTACTCTTCATTTGAAATAAAATGTGATAAAAAAGTTTTCTATTGAGAGTGACTGCATCCATCATCTTTTTGTTTGTTGCCACTAAGAAGTCAAGTTTTAGAACTCTTGAAGAAGATGTAATTCCTACTTCCTTTGTGATAAATGTTCAAATTTCAGTAGTTTCTCTTTTACTTTTTCTAAAAAAGGGTTGAAAATGATTTTTAAGTAGAAGACAATAAAATAATAATAATAATAACACTAACATTACAATTTTTACAATATAGAATTTTTAAAAATTTAAGAAACAAATTAAAAATGATATATACTCGCCAAAAACAATTGCCTTTACTTTACATAAGAGGGTAACTACTGGAAAAAAATTAAAATTTTCAAAAATTTAACATGTTATAAATTTTCTCTGTCTGTCTTTTTGAAATATAGCATAAGTTTGATTTTTATGATTTTGTAAAGTGATACATATTTATATTATATATTTGAATTTGAATTACATTTAGATTGTGTTATTTTTTAGAATTTAAAACTTCTTGTCATATTCAAAAATACATATTTCTATTATATGATACTAATTCTGTAAAGGTTAAACATATATTTTTAGTTTCTAATGTTTAAATAATTTCAATTTAATTTTTTATGTTTCAACATTTTTTAATTATACAATTTTTATTGGTCATATTAAAATTAGCTAATGATAAACTGATATTATATAATACTATTTATGTAAAAGCCTCCAACAAATTAACTTATCAATTTTAATAATTGTTTAAAATTTTTGAATTAAATTAAAACAATAAAAAAAGTCACCATCTATTTCTTTTAGAATATTACAATCACATTGTTTTAAAAAAAGAAATAATATTTTAATGTAAAGAAAATCATTCCGACGGATTTGGCACGAAAACGAATGACGATCGATAATTAGTCCGCGGGACGCTAACAGAAGGATGAAAAAACCGAAGATTCGAATGGAACGCTCAAGATTTCCATCGTCTTAGAGTGCTTACCTGCTTCTGCGACTACTCCGCAGCCGTACGGCGATCGAAGTGTTTGGAAACTTTCTCCTTCTTCGAAATTTCAACGAGATAATTGGAAAATCGCAGGTAATTAATCGCATCAACGCTGTTACCCGTAGTGATAGAATACTGATTGCTTGTGGTTCTGGCAAACTTGAAGTCGGCGATTGTCTCCAAAGATAAGAACGATAACGTGGCTTTGATTTCGATATTCATCCGAGAAATTTGATGCCAAATTGCTTTAAAGGACGCACCAGAGAGCCGATCGGATCGGATCTGTTTTCCTCTTCTCTTCTTGTTCGCTACCTGTTCGGCCATACCAGAGAGCCGCCGGAAGATCTCCGATTTCAGGTAAGCAGGTTTCTGCATAACATTTGTTCTCGATTTTCAGTTTTGCGAGCATTTCCACTTTCCTCCTTTTCTTCTTCTGTTCATACAAAATATCTCACAGGAAATTAAAATGTGAAAAGATTTTCTTACATTGATCGTAATGAAATCACCTGTGATTTTTTTTTAATACAGTATCATTTTTCTTATGAATTTGGTACTTCTCATGATGGAGAATGGAGTTTGATGTAGATCTGGATTTTTTTTTTTTTTATTAATATTGTATTTAAAAAAATATCCAAAAATATTACTTTCAAACTTAAATTTATGATTTAATATAGTTGTTTTAATTAGTTGGACAATATCGTCATTTGAACAGACGACCTTTCAGTACAACGATATTGTCCAACTAATTAAAACAACTATATTAAAATTATAAATTTGAAAAGTAATATATTTTCGTAAATTTTTTTAAAAATATAATATTAAAAAAAAACTCCGAGATTTGGGTGGTATTATTTGAATAACATGAATTTGAGTCGAGTTTTAAAATGCTATTTTAGTTGAAAAGCAAAAAAGAGAACTAGCGTTACTCCTATTTTGGAGTACAAGAAAAAAATCTATTTGATTTTTGCTGCTTATTTTAGTGTGTGTAATTAAAAATAATTTGGTTCAATTACAAATTTAGTCCCTGAACTTTCAGAATTATATCAAATGTGCATTGAACTTTAAAATGTGTCAAATAGATTTATGAAATTTCAATTTTGTGTCAAATAAGTTTCTGAACTTTAAAATGTTTAATAGGTCTCTAAACTTTCAATTTTGTGTCTAACATGTCTCTGATGTATTAGATTTTTTTTTTAATTTACAAATCTTTTAGACACAATATTGAAAGATCACAGACTTATTAGATACAAAATTGAATTTTGTGTATAATAGGTCTGTCAATTTTTAAAAATGTCCAAGAAATTAAGGACGTATTAGATAGTTTTAAAGTTTAGGAACCTATTTGATATGTTTTAAAGTTTATGAACTTATTTGACACAATCATGAAAGTTCAAGGACTAAACTTATAATTTAACTAATAATTTTCAATCATGAATTAGGAGTGAACATGATCATGAAATTTTTAAAAGTAAACAGACTAAATTGAAATTAAGATTAAAATTCAAGAACTAAAAAAAAAAAAAAAAATTTGACCTACAGTATACTAATCAATAAAATTTTATAATAATCTATTCATGTCTATAAATATATAATGGGATCGGGACCCTTTTAATTATTTTTTAAAAAAGTATTCCAAGCTACCTTTTTTTTTTTTTTTTTTTTTTCCCGTTCAGTATTCTAAGCTACTTAATAATAATTTATTCGTTTTCTTTCAGATAACCCATATAATTGAAGTGTTTATATCTTTATGTCGTTTTCCAGTATACTATTTGGTGTTATATATACACCTGGTTCGATCATTTATGTCCTATCCAAATGTAGGTTAGAAATGAAATATGAGATGTTTGCTCGTGTAATTTGTGTTTGAATTATTCAAATGAATAGATTAATTTAGGTATGTGTAATTTGGGTTTGATTTGTAAATTTATGGTTTATAATAGTTTATATATAATATAAGATGTGAGTTGCATGAACCTAATTTTATATAATATATAAGTTTATTAACTAATTTCAAATAAATAAATCCACCTGCTAAATCATTGAATCCAGTTTAATTAATAAGTCAAACAATCCTTTATCTTGCCAAATATTTTATTTCAAATTTTGGCCCCATCAATCGTGTAAAGGAACTTAGATCATCGACAGTTTTTTTTTTGTTAAAAAAACATTTTGGAAAATATCTAGATATTCTAGATTGTAGTTTTTTTTTTTTTTTTTTAACTGAAAGTAGACTCAGTATTATAGAGCTTAATTGATTTTTTTTTCTTGGAAAAAGAAAAGCTTTTTAGTATTAAATTCTTTTTTTTTTACTGGTTTTTAAAATATTTTCTTAAGAAATAAAAGAAAAGAAAAACCTATTTGATGACAATTTTATTTGTAACTTTTTTTTTTTAGGATATAGTATCACAATTATAAGAGTGAGAATTTAAACCTTCAACCTATGAAAATAGGTTATTAGTGGTTTAACCATTGAGATATACTCGTGTTGACTACAGTTGTATAAAAATATTACATCCCAAATTAAAAATAAATAAATAAAGTGCTAATTTTATGAAATATAAAAAAATAATGTTATTTATAACAACTCAATATGTTATTATGTTATTTGCCAATCCGACCATAACTTAGTGAATAAAAGCACTTATTAACTTTCTTAAAGTTAAAGGTTGTATCTCCCACCTCATATTTGTTGAACTAAAAAAAATATATATGTTATTATGTTCTTTACTCAAAATTTTATTTTTAAAATAACCTTTCTTTTTCTATTGGTTTTCATTTTTGTGTTTATGTACAGGATCGTCGATTCATGATGCTTAAATTAAAGCTGTTGTGGAACTGGAAACAGGTCTGTTTCCTTTTTCTTTTCCTTTTTTTCTCTCTTATAAATAATTTCTCTCGATTTCTCTTTCAAAAAAAAAAAAAAAAACTCTAAATTGAAAAATTCTAAATATATGCATATTTCAAAACCTTCATTATTTTTCCCCCTTTATCACCCAAACTCCAAAAAGTAAAATTAATTTTAATAATTTGTGTCATGTATATATAAATTTGCCTGAACCCAACTAACCCACCTGGCAACTTCTCAAGACCTTTTGACTTTGATTTCGTCCTCCATTGATTGATAATAATTTATCCATTATGAGGATTTTTTAAGTAAATAGTTTATCATTAATGGTAGTTGATTTAGTAATATAAGTATATTAAGTAACGATGGCTCTATAATATTTTTACTATTTTTATACAAATTACTTCTTAAATTTCCACTTCTATAAATGTTTTGTTGTTTCACAAATATTAGTATTATTTTCAAATTTTGGCAAGCGCGTCTTCTATTTTTTTTTTTTTTTTAAGAGGGAGATAATATTTTAATTTATGTTCTTACTTATACTATCCTCTTCTTTTGTATAAAAATTATTAGAGATGAGATAATTTAAACTCGAGATAAATATTATTAACTAACTAGCCTTTGTCAATTGAGTTGTTTCTACGAGACCAATCTTACTTATTATATGGATATATTAAAAATATTGTACAATGTATCAAAACTACTATAAAAATTTTACTCTGGATGAATATGGGGCGTGCTAACATTATAAAAAAAAAACTTTAGTTTTAATGCCTTTATTACACTCTGAATGAGCTAAAAATCTCATGTACAATTCTACGAGTAATAATGGTAAGGGTGACTTCTTTATCAATTTTCTATTATTACTCCCAACTCTTATTCTATATAGGATTAAGAAAGTCCTAACTAGAGAACATACCCACTTCAAAATTTCCCGCATCCTACTATTCTTTCTTGTTTGAATTAATTAATGTAAGAGTACTAGTCTAAGGCTTAGTTTGGGATTGCTGTGATTGTAAAAATAATTGCTAGTAATTATGCTTTTAAAAAAATTAGCTGTATTATGAAGTAAATTAGTGTTAGGTAAATTTCAATTTTAAACCTGAAAAACCAGAATATAGTGTTTGGTAAATTAATACTAAATTGCTGTGATTTAAAGTTCAATTATTTTTTTTTATATTGATACTCAATATCTAAATTTATTATAAGATAATTATTTAAAATTTTATATGATTTAAATTACAAATTATTATGTTTACTCTAAATTATTTGACAAAAGATTAGGAAAATAATTGTTATAAAAATTGAATTGATCAAAATTTCGTTTCAATTTAAAATATAAAACATTATAAAACCCTACAAAAGAAATCCCAAACGAGCCCTAACAATCTTAGAATGGACAATATTTAGTCAATTTAAAATTTCCATTTAAAGTATATACAATATCTCTATCTCACTTGTAATTTGTTTTTAAAAAATTCTAAAATAAATATACATAGTTTTAAGTAGATATAAACTTTGCAACGAGTGTCAACTATTATAAATTTTAGGATAAGTATTAAAATCTTATGAGTATTTTGATATTGGTTACTAAAATTAATTAATATGAATTTTAGTAAATAATTAAAAACATTGTCCTCTTTTCTAGATTTGAAACATCCAACTAAATAGTTTTGTTTGGTCCATAAACTTTAAAATATTTTTATTTTAGTCCATAAACCTTACACAAAAATTATTTTAGTCCTTGTTTATTTATTTAGTTTTCTTAAATAAAATTTTAATATAAACCCAAATTTTGTTCATTTATCTAATAAATAAATATTAAGCTACGTGTTATAAATTAGGTATCAAATTACAAGATATAAAATTCAAAAACTAAAATAAAAAAAGTTTAGAAACTAAAATAATACTTTCAAAATTTAGGATCGAACACAAAATTTGAACAGAGAAAAATTGTCCATCACATGTTAGTTCCCAAACACGTCATCTCTTGGTTCATTTTTTTAAAAGCAGCTATCGGTTTTTATTTATTATATTTTTATTTGATATTTGTAAATTAAAAAAGGTTAGAGTGAAGGGCAGAGCTCGTCCAATAATATGATAATATTGAAGAGAATCTGAAAAGCACTGCAAAAAGCCATTTTTCAAAATAACTCTCTCTTGGCTAAAATGTTTTTTTTTCTTTTTTGGAGAAAAGCTAAAACCTTTTTTTTTTCGAGTTCTGTTTTTCTTTCTTTATTTATTTTTAATCGCACGATCGCATAAACAGTGATGACTGCTTTTTTTTTTTTAATTATTTTTTTAATATTTTAACCAAAGTTAATTTTTTTTTTGGGTGATTTTGAAAAAGTGATTGGGGGAATTTTTTAAAATATTATTATTTTTTTAAAAAAAAATTTCATTCTTAATTCTTGTGGAGTCATGTGGAGGTTTTTTTTTTGAGTTGGAGTTATGCGGAGTTTAAATCTCACAAATCGTTTGTCTTTTACATATAAAAAAACTCACTTCATTTCTCTCATTTTTTTTATTCAACTCTAAATCCTAACACAATCAACTAATATAGTATAAGCTCATTATTTGACTAAATAATAAGAATTAGATTGAAAATTTTAAACCTATAATGACCAAGTTGAAACTAAGCTTAATTATAGAAACTAAATTGAAAATTTAAAGATCATACCAGTCAAATATAATAGAAATTATATCAAAATTATATGGGCTAAAATTTTAATTTAATTTTTTTAAAAAAAACTTAAACAAAAATTGAAAAATCATTTTTCTTTTTTTTTAAATCGGGCGCACGATGAATTTGCCACTCAACTGTGACTCAAAAGACACGTTCAGTAATGGCGTAAAAAATCTCTCTCTCTCACTCACTTTACCCAAGAGCAGCCCACGTTTTAGCTTTCTCGCTCCCAGCTCTCCACTTTTACGGATTCGCCATTCCCGAGCTTCACCACTGCAAAAGTTCTCTTCCCCTTTTGCAAAGCTCCTCCACTTCTTCCTCCTCACTAAGCTCAATTGTAACACTCTCCCATTTCCAAGTTTCCATTGGCTTGAATTACTTTTACGTCTTTTTGCTTTCTGGCACTCTCATTCACACTTTCAGGTAACCCCTCTCTCTCTCTTGATGTTTGTCGCTGGATTGCTTTTTCTCCTAGTCTGTATTTGATTGAGATACTGAAAATAAATGGGATTGAGCGGGAACAAAAAGTAGTGTTCAATCTTCCTTGATTCTAGTTTGGGATTGTTTTTTGTGATGCCTGTTCTTCAATTTACCTCAAGAAACCGATTGCAGCCAATTGAAACTGAGACGGAATGAACAAAATCTTAAAGAAAACAGACGCAACGACGCTGGATTGGGAGGGGTTCAGAATAGAGAAAAGAACATGAAAAAGAACGAAGAGATACTATCAACTCGCTCTCTTTGGTCACTTCTAACTAACCCATCAAATGGAAACTGAAAGTTAACTGAAAAATTTTCATAACTAAGCTAGGAGATCTTAGGGTCTCATCACCAATTTTAACACTGAAATATCTGATATAGCATTCGAAACTAGCGATTTCGGGCAACTCCGATCGTAAGAGAGAATCTTCTTTCTTCTTTCTTCCATTTCCTGCATTTTCTTGGCAACTAAGTACATTGTACACCAAATGGGTTCAATTAGTGTTGATTTTTGGTCTTTTCAGTGTGGGTTCACCTCTCAGAAAAGGAAGATGGCCAAGATTTTTGTTCAGGTGGTTGCGATTCTGTGCTTGGGTTGGTGGTGGTGGGCAACGATGGTGGACGCGGAGTACTTGAAATACAAAGACCCTAACCAACCAGTTTCTGTTCGAGTTAAGGACCTTCTTGGCCGCATGACTCTAGAAGAGAAAATCGGTCAGATGGTTCAGATTGACAGGAGCGTTGCCAATGTTACAGTTATGAAAGATTATTTCATTGGTAAAGCAATAATCTCTCTCCTTTTCTTTTACCTTCTCAGTTTTATGTACATGCTCGACTATTTTTTTTGTTAAGTTTTGGCCTAGTGGTAGTTCAGAGTAGGGTTTGGCGTTTTTTTCATTTTTGCGAGTGCCATGTTCTTGACCTGGTTTACATCTATTCTGAAAGGAAGTTGTTCTAGGACCCAACTACATTCGATTCCTGTGGCTAAAAGTTGAGATGGCAAATAATAAATGATTGTGTTCCATTATGACTTGTCACAAGTAGTAAATGATTGAAGAACTTCTCATTCATGCAAAACCCAAACAATAGCATTTTCAAAATTTGATAATCAAAGTCAAATGACAACTTTTCTGAAAGGGAAGCATAAACTGCGAAATTTGTACAGGAAGTGTGCTAAGTGGCGGTGGAAGTGTGCCGCTTCCAGATGCTAGTGCTCAGGATTGGATTAACATGATTAACGATTTCCAGAAGGGTTCTCTTTCTAGTAGATTGGGCATCCCAATGATGTATGGCATTGATGCTGTTCATGGCCATAACACTGTTTACAATGCTACCATATTTCCTCATAATGTTGGACTTGGAGCTACCAGGCAAGTGATTTTCTTTTCCTTATAGCCCTGCACATATAAATTTGTACTATACTATGTCAGTGTTTGTACAGATCCTTTGCTTCATATATCTCTTTGACGTAGTCTTTAGTTTAACCAAAACTTATGCTTAATTGAGCTGTGTTGAAGTTTGTCCAGCTAAATAACTCATTTTCTTTTGAACAATAAAAATTTATTTTGATGGGTCATGGTAGAATGCTCTCCTAAGTACCATTCAGCCGTCCAGTGGTTGGATCAAATTCATGTTGTACAAATTATTTCCTTGTGCTGCTAGTTTACTTTAGTAGATCCACTTCCAAATGTCACAACTTCTTTATATTTGTTCTTTCCCTCTCCCACACAGAGACCCTGGCCTAGTTCGAAGGATTGGTGCTGCAACAGCACTTGAAGTTAGAGCGACGGGGATTTCTTTTGCCTTTGCTCCATGCATTGCGGTAAACACATTTTTGATTGAATACCTCACTATTTGTTTTAGTGGAGAGTTTGATATATAACAGTCTCTTTCGCTGGAAATCTTCTTTTAGGTTTGTAGGGACCCAAGGTGGGGGCGGTGTTATGAAAGCTACAGCGAGGATCCAAAAATTGTGCAAGAAATGACCGAGATTATACCTGGTCTGCAGGGAGAGCCTCCTGCTAATTATCGGAAGGGGATTCCATATGTTGGTGGAACGTATGGCCTTTGAAGATGTAGAAGTTTGTGTTGTTTTTAAATATAAAATTGTGAATCCATGCAAAATATTTCCTAAAAGGCTTTGCACGCCAGTTAGGAAATGTTTTACCTTCCAAAATTATGCAGTGGAGTCAACAATTTCAAGTTTTAATCTTCTGATCAAGAGAAATCATATGGCATATAAATGTGTTGGTCCATCTTAATATAACATTCTGGAAGTTGAATGTGGAAATGTTGCAAATTTAGGAATGGAGCTCTGCACATGCTATGCATTCTGCGAAACTGTTATTCTTCAACCAAATTACTGAAACACTCTGGTTTCTTTGTAAACTCTCTCTCCCCCTCTCTCCAAAATTATTTAAATATACTTGTAATTTTTCTTGCAGTAAGAAGGTTATCGCCTGTGCAAAGCACTTTGTAGGAGATGGTGGGACAACTAATGGCATCAATGAAAATAATACCGTTATTGACAAGCATGGACTGCTCAGCATTCACATGCCTGCCTATTTAGATTCGATCATCAAGGGCGTTTCAACGGTAATGGCTTCCTATTCAAGTTGGAATGGTGTAAAGATGCATGCAAACCATGAGCTGATTACTGGCTTCCTCAAGGGCACTCTTAAATTTAAGGTTTGTCTGACAATTTAAATTTGTTTGGAATTTGTTTTTAGATACCATTTTGCTGATATTTGTCTTCTGCAATTATGCCACTATCTGATGTTTACTAATTTTTCTTTGTGCATTTTTAGGGTTTTGTCATCTCAGATTGGGAGGGTCTGGACAGAATTACTTCTCCGCCACATGCTAATTACACCTACTCTGTCCAAGCTGCAATTTTAGCTGGCATTGACATGGTTGGTGGCCTCTCTTTAAGGAAACCTTACATTAAATTTGGACTCTGTATATATGACTGATGCATTGTTTTGAATGTTGGAATTGAGTGCAAAATTTGATAAAGGAAGTTTGTGTATTTAGGCTTTGGTTCTGTGCTCTGCAGGTCATGGTTCCTTACAAGTATACAGAGTTCATTGATGATCTTACGTATCTAGTCAAGAGCAATGTCATTCCAATGGATCGCATTGATGATGCTGTTGGGAGAATTTTATCAGTCAAATTTACAATGGGTCTTTTTGAAAGCCCCTTAGGCGATTACAGCCTTGTCAATGAACTTGGGAGCCAGGTCTGACTGCAAACTATAGCAAAATATATAGATAACTTAATTTTTAATTGGTTATTCAATTTACATGAAGTCGATCTTTAGTTAGAAAATCAGATGGGGGTCATTTCAGCATCTTAAAAGCAATTATATGTCATAACTTATCTTCTTTTCATTTTCAATATCTGAGAACGTTTCTCATGCTTACACTTCTGTAAGTTGGAGTAAGAAGTCTCATGGACGTAGTCACGTAATCTTGCATCTATAATTTATAGGCACATAGAGACTTGGCAAGAGAAGCTGTGAGGCAGTCGCTCGTACTGCTGAAGAATGGGAAAAATGATAGTCGCCCATTGCTGCCCCTTCCAAAGAAGGCCCCAAAGATCCTGGTTGCTGGCACTCATGCTGATAATTTAGGATATCAATGTGGTGGATGGACAATTGAATGGCAAGGGTTCAGTGGCAACAATGGTACAAGGGGTATGTTATTTTCACCCATCCTTCGGAACCATAAATCAAACATTGAATTCTTCCTCAAACATGAGTTTTAAAAACTTCCCCAAAAAATAAAATTTATTGCTTTTTGAGAGCTATCAGAATTTAAAATTTTATGCATAAAAATTTTTTCCCAGCTAGTTGGGAAGATCTCTGGGGGGAATTTTGAAACTGAAGTATCCTTCTTGAGTTGTTCTAATAATTTGGAGATGTGTCTTAGCATTCTTATAGAATGTATATATCTGTAAATAAATGTAGAATTTTTTTATTGGGAGTTTAGCTCTGCAATAATTAGCAGTAGATAACAAAGAGTTTCCCTTTTGGACTTCTTAGACGACAGAAATTCAGTGGAATTCTATTAACGAAAAATGTAGTCTATTTAGGCACATATATTTAGTATAGCCAGATTATCCTCTCCTTTATTTGTTCACTCTTCAAGTTAATAATAGACCTCTTTGATATTAGAGTAGAAATCAATGTACATTTCATTTTCCCAAATGGCAGGAACTAGCATCCTCGCTGCTATCAAATCAATGGTCGATCCAAGCACAGAAGTTGTATTCCGTGAGGATCCTGACAGTGACTTTGTTAAGTCCAATGACTTTTCATACGCCATTGTTGTCGTTGGCGAAACCCCATATGCTGAGACTGAAGGGGATAGTACAACGCTTACCATGTTGGATCCTGGTCCAAGCATCATCAAAAACGTTTGTGATTCTGTAAAGTGCGTGGTGGTAGTCATTTCTGGAAGGCCAATAGTGATCGAACCATACATTTCATCAATTGATGCTCTTGTAGCAGCTTGGTTACCTGGCTCTGAAGGCCTAGGAGTCACTGATGTCCTTTATGGAGACTATGGGTTTAGTGGGAAGCTTCCAAGAACATGGTTCAAATCTGTAGATCAGCTACCAATGAATGTTGGAGATCCACACTATGATCCACTTTTTCCTTTTGGGTTCGGACTCACAACTGAATCGGTTAAGGACCTTGTCTCGAGGTAAGTACTAATTACTCCATATTATCTGATGTGCATTATTTGACCAATATCCACTTGTGTTCTGTGAATTTCTGAATGAAGCCAAGAACTCAGTTCTGTTTTGCAACACCATCTATATTGTTTCCTTTCACCATATTTCATGCCAAGTCACGTTCTAATTTCCATTTTGAAATTAAGAAATATAACCCAACTGATATTTGAGTTATATGCTCAGGCTGGGACAGTATATTCTTTTCCATCTCTTTTTCTTCCATGTTAAAATCTTACTGCATCTTGTGAATTGTGATCGCTTCCCACTTTCTCAAATGGAAAATGATATTTGTGTGAATTAAACTTGGTGATCTATCTTCAGGCTTCCTCCTGTTCTTATGTGTTAATCCCTTTCAACTGAATTCTGATTCTGTCGTTTTCATTTCATCATCCTTAGGTCTACATCGCCCGGAATTCGTGAAACTCCATCCTTTCTTGCAATGATCGTTGCTACAATTGCCATTTGTATATTGCAGGTACACTTCTAATAAGTTCTAACCAGCAGTCAGCCATAGGCATTAGTTTGTTCAAAACTTTAGCAATTATTTGTAGGAAATTAGAAGCTCTGAGGTTTAAGATATCAGCCTTGTAGGTTGAAAGAGCCTCAAATGCATGGCCATTTATTCAGTACCCTTGATTTTATGGATTGTAATAGCAAAAGCATCTCAGTTTTATTCAAATTAATTTATTTTCTCTATCATGATGTCCTCTGGCAGTATAAAAACAACTCTTGGGATGATTCCTCCACTGATAAAAGACTCCTCAGATAATTATTTTCTCAGTTTAATTATCAGATTATTTTTTTAATTATATAAGCCAAATGTAATTTTTATTTTATAAGCATAATTTTCAAATTTATTGTTATATATGTATATATATTTATAAAATATTATAGAGGTAAAAAAAATTGAGAGAAAAATAAGTTAGAAGAGAGAAAAAAAAATTTAGAGAGCCAAATCAACCGGGGGCGGGAATTGATTCAACAGAGAAATTCCAATTGAAAGCATAGAAAAAATAAAAGAAATATTGGTGAAATTAAAGTTGAATTGAGGGGTGAGATGTGAAATCATTGCATGCAACAAATGTAAAAATATAACGGTCGGAAATATAATGTATGTATAAAGTAATTAGAAGAAAGTGTTGTTAAGGATATTAGTTCGGTGAGATTCAGAAGAAGAAGCATCCTCCAACTCCAAGCATGTGTAATGCAGGTGCATCCATATCAAACCACCCTCGCCCTCTTCCTCTTCCTCTTCCTCTCCCACTTTCCTCTAATAATATTCAACACCAGGTGTCACGTCAGCATGGAATGCAACTTTATAAATAACACGATGTCTAGCAAATATTTAATTAATACCTGTAAGGTCATTAGAACTGTGGTTAAGCAGCCCACAAACTCCTTCCCTGACACATCCATCAAGTAATTCATGTATACATACTAGAGTAGAGTGTTGAGTGTTGAGTGTTTGTGATTTAGGAAGAGGCCTAGGAGAGGGTGGTGTAATGGTATGTAGTAATTTTTTTCTATTTGGAGAAGTTACCAGATGACGATGACGATGAAGATGCAGATGATTCCCATTTATGGAACCTGACCAGCCATGAATCGGAAGCGATCTGTACGGAGGAGAGTCCATCTACCCAGCTTCTGAATGGGGTCCCCTTCTCCTCCCCATATAACCCTCCCATTCCCTCTATCCATTCCTCCATACCTCTCTCATACCCACTCCCATCCACTACTATTCCCCCTGCATCCTGCATTTCCAAACAATACAACTTTCAAGATAACATCTCCCACTTATACTCCTAATTCTAACCATTTTACACTTACTAACATCGACTTCAGATTTTCAAGATACTGCTCTGACTTCTCCACTTCCCCGCGTCTCCATCTCTCGTAAAGCACATAAAACTTCACGACTTCGAAACAAGGATCAATACTCTCCACTCTGCAGCATCCTCTGCAATCTACCATGTCTCTGGGAGACATGTTTGGACCGAGTTGGAAGGTCCCAATAGCCTCTATGATCCCACATGCACATCTCTCTTTTGCATGAATTATTTTCGGGTTCTCCTTCGCATTCTCAACGTACCATTGCAGTAGCTCTTCCTGTGCATTGCTTACCTGCACACCAAGGAAAAAAGATCACACCATAGATCCTACTGTTCACAAGGATATGTTGATGATAATGCATCATTGGTTATGTGCTTGGGAGATGAAGGCTTTTTAGTTAAGGCCACAGACCATCACACCATAGACGTCGGGGACACTGAAAAGCTCGGCGTCATTTCCAGAGTCGCCACAAACTAGAACATTGGCTGGAACTTTCCCATCTGCTTTGAACTTCTTGAGAACATATGCAAGTGCCTCGCCTTTGCCAGCACCTTTTGGCAATATGTCCAATGCAATTCCACTGCTGTAGATTACTTTTACATCTAACTGATAAAGTATATTCAATGAGTTAAAAGAACACTTAACAACCTTCTTCCCTGCTCTATGCACAATAAAAGGGAAACAAAACCAGATGAGTGTAGAAGAGAAAGATAAAGAGCATAGTTTTGTTTTTTTCTTTGTTTTTCCTTTGTATCGGGATGTCGGGATTTACTCCTTGCTCATGACCCACCTTCCACTAATCTCGGGAGGCTTCGCCCAGCCCGAGGCTAACTCAGGGAGAACAAACAGAGTGTTTATGATAGCTAAAATTGAATTCAGCAAGTCCAGTTCAGACCAACAAAAACAGACTTACCCCTCGTTTCTCTAAACATTCAGACAGAGAAGTCGCCACTTCTGGGGCTTTATCCTTCTCTATGAAGAAGCTAACTTTATGGGGTCGTTGCTCCGTTTCTGACTGCGATATCCTCATTTTGTCGTGCAGGTTGAAGAAATATCCCAATCAAAACAATTAGGGCTAAAAAGCATCCTCAACAAGCAATGAGAATAAAGAATCCCCATTCGATATACAACTAAGGAATGAAAATCTCAAGCAAATTACATCAAACTGGCGTGCGTCTTTACCTGCGGCTTGAGTTCTGGAAATTTTACGATTTCCTCCACGACCACATCTCGATTCCACTTCTGATTCAGAAATTGTTCCCATTCCTCGTCTGTTACCATCGGATCATCATAGGCAATCTCCGTTCCCACAGACATAATTGTTATATCTGGTGTTAACAGAGGCTTTGTCCTCCTTAACCTTCTGTAACTTGTCGGCGATCTTCCGGTTGAGAATACGAGCAGAGAATTCCGACGGTAGTAGGCTTCCCAGAGGGCATTAAACTTGAGAAGCGAGGTGTTCTCGTTATCATCATGATCGACCTGTAATTATAATCCCCATTTCGCTAAAATTTCCGAATGAAAAAAATAGGGGGGAGAGGAATGGTGATTTCGCGAGGTTGGAGAAATTCACCATTGTGAGATCAAGATCGGAAACAATCATGAGGTTGGCTGAACCATCTAGTCGATCCATAGATGACGGGTCCCTTCTTCAAATTCAGATTAGGGCAACTCAGGTTCCAAAAAGTGAAGAAACGAAAAATTGATGGGTGTCCACTGAATCCCTTTTCCTCCTGAACGACGAAATTCTTGTCTTCGGCGTCGAATCAGAAGAAACCTTTTAGCTTTCTATCAGGATACGCTGACTTCACCGATAAAAGTAGATCTTTTACAATGTTCAAATGGCAGACTGCTGAATTATCAAGCAATCCGTAGGCTTAAATACGACACACAAATGCCAACTTTGAAGTTTGGGATTTCAATTGAACCGCCTAACCACATGAAGATGAACTGA

mRNA sequence

ATGCAAATTTCGGCTTTATTTCGTCGCGAGCAATCCATAATAAGCATATGCGAAGTATGCTGCTACGACTGTCACAACCGCCGCAGCCAGACTCGCCGTCACCGGCAACGGCATCCTTCCGACGGTGAAGCGGCCGGCAAGTGTCGAAGCTCTGATCGCTCTCGGCAATGGAGTTCCAATGCTCACCTTGGGCATTTCTTGATTCTTACCCACCTCCTCTGTTTTGGATTCCTTCGCCTTCGCGTCTTCTTTTTCCTTTCCCTTTTCTGGGTTTTGATTTTCGATTTCTTTCTTCTCCGTTGGCTTGTGATCGGGGCTGCTGATCATCTTCTGATCCTGCGTCGTTTTAAGCTCGCCGGAATCTCCTTTTCTGCGGATTTTCCCTTATCGTTATCCGCTGGCTTCACTGAATTAGAATCGCCTTTCGGAAGCGCCTCCTCCGGTCCCTTAGTTTCAGCCTTCGCCTCCGGCCGGGCTGCGTTTTCCGGCGAGATATCTTCCCTCCCCTTTTCAGGTGTTGGTCGTTCTGGGTCTTTGGGAGGCGGAGCCGCCGCCGTCCCTTGTCCGATATTTTCCGGGGAGATATCTTCCCTTCCCTTTTCAGGGCTTGGTTGTTCTGGTTCTTTGGGAGCCGCCGCCGCCTCCTCCGCCGTCCCTTGTCCGCTAGTTTCTTTGGGCATTGTAATGAAGGATGAAAAAACCGAAGATTCGAATGGAACGCTCAAGATTTCCATCGTCTTAGAGTGCTTACCTGCTTCTGCGACTACTCCGCAGCCGTACGGCGATCGAAGTGTTTGGAAACTTTCTCCTTCTTCGAAATTTCAACGAGATAATTGGAAAATCGCAGGACGCACCAGAGAGCCGATCGGATCGGATCTGTTTTCCTCTTCTCTTCTTGTTCGCTACCTGTTCGGCCATACCAGAGAGCCGCCGGAAGATCTCCGATTTCAGGATCGTCGATTCATGATGCTTAAATTAAAGCTGTTGTGGAACTGGAAACAGTGTGGGTTCACCTCTCAGAAAAGGAAGATGGCCAAGATTTTTGTTCAGGTGGTTGCGATTCTGTGCTTGGGTTGGTGGTGGTGGGCAACGATGGTGGACGCGGAGTACTTGAAATACAAAGACCCTAACCAACCAGTTTCTGTTCGAGTTAAGGACCTTCTTGGCCGCATGACTCTAGAAGAGAAAATCGGTCAGATGGTTCAGATTGACAGGAGCGTTGCCAATGTTACAGTTATGAAAGATTATTTCATTGGAAGTGTGCTAAGTGGCGGTGGAAGTGTGCCGCTTCCAGATGCTAGTGCTCAGGATTGGATTAACATGATTAACGATTTCCAGAAGGGTTCTCTTTCTAGTAGATTGGGCATCCCAATGATGTATGGCATTGATGCTGTTCATGGCCATAACACTGTTTACAATGCTACCATATTTCCTCATAATGTTGGACTTGGAGCTACCAGAGACCCTGGCCTAGTTCGAAGGATTGGTGCTGCAACAGCACTTGAAGTTAGAGCGACGGGGATTTCTTTTGCCTTTGCTCCATGCATTGCGGTTTGTAGGGACCCAAGGTGGGGGCGGTGTTATGAAAGCTACAGCGAGGATCCAAAAATTGTGCAAGAAATGACCGAGATTATACCTGGTCTGCAGGGAGAGCCTCCTGCTAATTATCGGAAGGGGATTCCATATGTTGGTGGAACTAAGAAGGTTATCGCCTGTGCAAAGCACTTTGTAGGAGATGGTGGGACAACTAATGGCATCAATGAAAATAATACCGTTATTGACAAGCATGGACTGCTCAGCATTCACATGCCTGCCTATTTAGATTCGATCATCAAGGGCGTTTCAACGGTAATGGCTTCCTATTCAAGTTGGAATGGTGTAAAGATGCATGCAAACCATGAGCTGATTACTGGCTTCCTCAAGGGCACTCTTAAATTTAAGGGTTTTGTCATCTCAGATTGGGAGGGTCTGGACAGAATTACTTCTCCGCCACATGCTAATTACACCTACTCTGTCCAAGCTGCAATTTTAGCTGGCATTGACATGGTCATGGTTCCTTACAAGTATACAGAGTTCATTGATGATCTTACGTATCTAGTCAAGAGCAATGTCATTCCAATGGATCGCATTGATGATGCTGTTGGGAGAATTTTATCAGTCAAATTTACAATGGGTCTTTTTGAAAGCCCCTTAGGCGATTACAGCCTTGTCAATGAACTTGGGAGCCAGGCACATAGAGACTTGGCAAGAGAAGCTGTGAGGCAGTCGCTCGTACTGCTGAAGAATGGGAAAAATGATAGTCGCCCATTGCTGCCCCTTCCAAAGAAGGCCCCAAAGATCCTGGTTGCTGGCACTCATGCTGATAATTTAGGATATCAATGTGGTGGATGGACAATTGAATGGCAAGGGTTCAGTGGCAACAATGGTACAAGGGGAACTAGCATCCTCGCTGCTATCAAATCAATGGTCGATCCAAGCACAGAAGTTGTATTCCGTGAGGATCCTGACAGTGACTTTGTTAAGTCCAATGACTTTTCATACGCCATTGTTGTCGTTGGCGAAACCCCATATGCTGAGACTGAAGGGGATAGTACAACGCTTACCATGTTGGATCCTGGTCCAAGCATCATCAAAAACGTTTGTGATTCTGTAAAGTGCGTGGTGGTAGTCATTTCTGGAAGGCCAATAGTGATCGAACCATACATTTCATCAATTGATGCTCTTGTAGCAGCTTGGTTACCTGGCTCTGAAGGCCTAGGAGTCACTGATGTCCTTTATGGAGACTATGGGTTTAGTGGGAAGCTTCCAAGAACATGGTTCAAATCTGTAGATCAGCTACCAATGAATGTTGGAGATCCACACTATGATCCACTTTTTCCTTTTGGGTTCGGACTCACAACTGAATCGGTTAAGGACCTTGTCTCGAGGTCTACATCGCCCGGAATTCGTGAAACTCCATCCTTTCTTGCAATGATCGTTGCTACAATTGCCATTTGTATATTGCAGAAGTTACCAGATGACGATGACGATGAAGATGCAGATGATTCCCATTTATGGAACCTGACCAGCCATGAATCGGAAGCGATCTCAAGTCCAGTTCAGACCAACAAAAACAGACTTACCCCTCGTTTCTCTAAACATTCAGACAGAGAAGTCGCCACTTCTGGGGCTTTATCCTTCTCTATGAAGAAGCTAACTTTATGGGGTCGTTGCTCCGTTTCTGACTGCGATATCCTCATTTTGTCGTGCAGGGGGGAGAGGAATGGTGATTTCGCGAGGTTGGAGAAATTCACCATTGTGAGATCAAGATCGGAAACAATCATGAGGTTGGCTGAACCATCTAGTCGATCCATAGATGACGGGTCCCTTCTTCAAATTCAGATTAGGGCAACTCAGGATACGCTGACTTCACCGATAAAAGTAGATCTTTTACAATGTTCAAATGGCAGACTGCTGAATTATCAAGCAATCCGTAGGCTTAAATACGACACACAAATGCCAACTTTGAAGTTTGGGATTTCAATTGAACCGCCTAACCACATGAAGATGAACTGA

Coding sequence (CDS)

ATGCAAATTTCGGCTTTATTTCGTCGCGAGCAATCCATAATAAGCATATGCGAAGTATGCTGCTACGACTGTCACAACCGCCGCAGCCAGACTCGCCGTCACCGGCAACGGCATCCTTCCGACGGTGAAGCGGCCGGCAAGTGTCGAAGCTCTGATCGCTCTCGGCAATGGAGTTCCAATGCTCACCTTGGGCATTTCTTGATTCTTACCCACCTCCTCTGTTTTGGATTCCTTCGCCTTCGCGTCTTCTTTTTCCTTTCCCTTTTCTGGGTTTTGATTTTCGATTTCTTTCTTCTCCGTTGGCTTGTGATCGGGGCTGCTGATCATCTTCTGATCCTGCGTCGTTTTAAGCTCGCCGGAATCTCCTTTTCTGCGGATTTTCCCTTATCGTTATCCGCTGGCTTCACTGAATTAGAATCGCCTTTCGGAAGCGCCTCCTCCGGTCCCTTAGTTTCAGCCTTCGCCTCCGGCCGGGCTGCGTTTTCCGGCGAGATATCTTCCCTCCCCTTTTCAGGTGTTGGTCGTTCTGGGTCTTTGGGAGGCGGAGCCGCCGCCGTCCCTTGTCCGATATTTTCCGGGGAGATATCTTCCCTTCCCTTTTCAGGGCTTGGTTGTTCTGGTTCTTTGGGAGCCGCCGCCGCCTCCTCCGCCGTCCCTTGTCCGCTAGTTTCTTTGGGCATTGTAATGAAGGATGAAAAAACCGAAGATTCGAATGGAACGCTCAAGATTTCCATCGTCTTAGAGTGCTTACCTGCTTCTGCGACTACTCCGCAGCCGTACGGCGATCGAAGTGTTTGGAAACTTTCTCCTTCTTCGAAATTTCAACGAGATAATTGGAAAATCGCAGGACGCACCAGAGAGCCGATCGGATCGGATCTGTTTTCCTCTTCTCTTCTTGTTCGCTACCTGTTCGGCCATACCAGAGAGCCGCCGGAAGATCTCCGATTTCAGGATCGTCGATTCATGATGCTTAAATTAAAGCTGTTGTGGAACTGGAAACAGTGTGGGTTCACCTCTCAGAAAAGGAAGATGGCCAAGATTTTTGTTCAGGTGGTTGCGATTCTGTGCTTGGGTTGGTGGTGGTGGGCAACGATGGTGGACGCGGAGTACTTGAAATACAAAGACCCTAACCAACCAGTTTCTGTTCGAGTTAAGGACCTTCTTGGCCGCATGACTCTAGAAGAGAAAATCGGTCAGATGGTTCAGATTGACAGGAGCGTTGCCAATGTTACAGTTATGAAAGATTATTTCATTGGAAGTGTGCTAAGTGGCGGTGGAAGTGTGCCGCTTCCAGATGCTAGTGCTCAGGATTGGATTAACATGATTAACGATTTCCAGAAGGGTTCTCTTTCTAGTAGATTGGGCATCCCAATGATGTATGGCATTGATGCTGTTCATGGCCATAACACTGTTTACAATGCTACCATATTTCCTCATAATGTTGGACTTGGAGCTACCAGAGACCCTGGCCTAGTTCGAAGGATTGGTGCTGCAACAGCACTTGAAGTTAGAGCGACGGGGATTTCTTTTGCCTTTGCTCCATGCATTGCGGTTTGTAGGGACCCAAGGTGGGGGCGGTGTTATGAAAGCTACAGCGAGGATCCAAAAATTGTGCAAGAAATGACCGAGATTATACCTGGTCTGCAGGGAGAGCCTCCTGCTAATTATCGGAAGGGGATTCCATATGTTGGTGGAACTAAGAAGGTTATCGCCTGTGCAAAGCACTTTGTAGGAGATGGTGGGACAACTAATGGCATCAATGAAAATAATACCGTTATTGACAAGCATGGACTGCTCAGCATTCACATGCCTGCCTATTTAGATTCGATCATCAAGGGCGTTTCAACGGTAATGGCTTCCTATTCAAGTTGGAATGGTGTAAAGATGCATGCAAACCATGAGCTGATTACTGGCTTCCTCAAGGGCACTCTTAAATTTAAGGGTTTTGTCATCTCAGATTGGGAGGGTCTGGACAGAATTACTTCTCCGCCACATGCTAATTACACCTACTCTGTCCAAGCTGCAATTTTAGCTGGCATTGACATGGTCATGGTTCCTTACAAGTATACAGAGTTCATTGATGATCTTACGTATCTAGTCAAGAGCAATGTCATTCCAATGGATCGCATTGATGATGCTGTTGGGAGAATTTTATCAGTCAAATTTACAATGGGTCTTTTTGAAAGCCCCTTAGGCGATTACAGCCTTGTCAATGAACTTGGGAGCCAGGCACATAGAGACTTGGCAAGAGAAGCTGTGAGGCAGTCGCTCGTACTGCTGAAGAATGGGAAAAATGATAGTCGCCCATTGCTGCCCCTTCCAAAGAAGGCCCCAAAGATCCTGGTTGCTGGCACTCATGCTGATAATTTAGGATATCAATGTGGTGGATGGACAATTGAATGGCAAGGGTTCAGTGGCAACAATGGTACAAGGGGAACTAGCATCCTCGCTGCTATCAAATCAATGGTCGATCCAAGCACAGAAGTTGTATTCCGTGAGGATCCTGACAGTGACTTTGTTAAGTCCAATGACTTTTCATACGCCATTGTTGTCGTTGGCGAAACCCCATATGCTGAGACTGAAGGGGATAGTACAACGCTTACCATGTTGGATCCTGGTCCAAGCATCATCAAAAACGTTTGTGATTCTGTAAAGTGCGTGGTGGTAGTCATTTCTGGAAGGCCAATAGTGATCGAACCATACATTTCATCAATTGATGCTCTTGTAGCAGCTTGGTTACCTGGCTCTGAAGGCCTAGGAGTCACTGATGTCCTTTATGGAGACTATGGGTTTAGTGGGAAGCTTCCAAGAACATGGTTCAAATCTGTAGATCAGCTACCAATGAATGTTGGAGATCCACACTATGATCCACTTTTTCCTTTTGGGTTCGGACTCACAACTGAATCGGTTAAGGACCTTGTCTCGAGGTCTACATCGCCCGGAATTCGTGAAACTCCATCCTTTCTTGCAATGATCGTTGCTACAATTGCCATTTGTATATTGCAGAAGTTACCAGATGACGATGACGATGAAGATGCAGATGATTCCCATTTATGGAACCTGACCAGCCATGAATCGGAAGCGATCTCAAGTCCAGTTCAGACCAACAAAAACAGACTTACCCCTCGTTTCTCTAAACATTCAGACAGAGAAGTCGCCACTTCTGGGGCTTTATCCTTCTCTATGAAGAAGCTAACTTTATGGGGTCGTTGCTCCGTTTCTGACTGCGATATCCTCATTTTGTCGTGCAGGGGGGAGAGGAATGGTGATTTCGCGAGGTTGGAGAAATTCACCATTGTGAGATCAAGATCGGAAACAATCATGAGGTTGGCTGAACCATCTAGTCGATCCATAGATGACGGGTCCCTTCTTCAAATTCAGATTAGGGCAACTCAGGATACGCTGACTTCACCGATAAAAGTAGATCTTTTACAATGTTCAAATGGCAGACTGCTGAATTATCAAGCAATCCGTAGGCTTAAATACGACACACAAATGCCAACTTTGAAGTTTGGGATTTCAATTGAACCGCCTAACCACATGAAGATGAACTGA

Protein sequence

MQISALFRREQSIISICEVCCYDCHNRRSQTRRHRQRHPSDGEAAGKCRSSDRSRQWSSNAHLGHFLILTHLLCFGFLRLRVFFFLSLFWVLIFDFFLLRWLVIGAADHLLILRRFKLAGISFSADFPLSLSAGFTELESPFGSASSGPLVSAFASGRAAFSGEISSLPFSGVGRSGSLGGGAAAVPCPIFSGEISSLPFSGLGCSGSLGAAAASSAVPCPLVSLGIVMKDEKTEDSNGTLKISIVLECLPASATTPQPYGDRSVWKLSPSSKFQRDNWKIAGRTREPIGSDLFSSSLLVRYLFGHTREPPEDLRFQDRRFMMLKLKLLWNWKQCGFTSQKRKMAKIFVQVVAILCLGWWWWATMVDAEYLKYKDPNQPVSVRVKDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYFIGSVLSGGGSVPLPDASAQDWINMINDFQKGSLSSRLGIPMMYGIDAVHGHNTVYNATIFPHNVGLGATRDPGLVRRIGAATALEVRATGISFAFAPCIAVCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDKHGLLSIHMPAYLDSIIKGVSTVMASYSSWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPPHANYTYSVQAAILAGIDMVMVPYKYTEFIDDLTYLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLGDYSLVNELGSQAHRDLAREAVRQSLVLLKNGKNDSRPLLPLPKKAPKILVAGTHADNLGYQCGGWTIEWQGFSGNNGTRGTSILAAIKSMVDPSTEVVFREDPDSDFVKSNDFSYAIVVVGETPYAETEGDSTTLTMLDPGPSIIKNVCDSVKCVVVVISGRPIVIEPYISSIDALVAAWLPGSEGLGVTDVLYGDYGFSGKLPRTWFKSVDQLPMNVGDPHYDPLFPFGFGLTTESVKDLVSRSTSPGIRETPSFLAMIVATIAICILQKLPDDDDDEDADDSHLWNLTSHESEAISSPVQTNKNRLTPRFSKHSDREVATSGALSFSMKKLTLWGRCSVSDCDILILSCRGERNGDFARLEKFTIVRSRSETIMRLAEPSSRSIDDGSLLQIQIRATQDTLTSPIKVDLLQCSNGRLLNYQAIRRLKYDTQMPTLKFGISIEPPNHMKMN
Homology
BLAST of Sgr028903 vs. NCBI nr
Match: XP_022150694.1 (uncharacterized protein LOC111018764 isoform X1 [Momordica charantia] >XP_022150696.1 uncharacterized protein LOC111018764 isoform X1 [Momordica charantia])

HSP 1 Score: 1288.5 bits (3333), Expect = 0.0e+00
Identity = 624/681 (91.63%), Postives = 653/681 (95.89%), Query Frame = 0

Query: 323  MLKLKLLWNWKQCGFTSQKRKMAKIFVQVVAILCLGWWWWATMVDAEYLKYKDPNQPVSV 382
            MLKLKLLW W+QCGFTSQKRKMA+IFVQVVAILCLGWWWWAT VDAEYLKYKDP QPV+V
Sbjct: 1    MLKLKLLWKWRQCGFTSQKRKMAQIFVQVVAILCLGWWWWATTVDAEYLKYKDPKQPVAV 60

Query: 383  RVKDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYFIGSVLSGGGSVPLPDASAQDWINMI 442
            RV DLLGRMTLEEKIGQMVQIDRSVANVTVMKDY IGSVLSGGGSVPLPDA A+DW+NMI
Sbjct: 61   RVMDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYSIGSVLSGGGSVPLPDARAEDWVNMI 120

Query: 443  NDFQKGSLSSRLGIPMMYGIDAVHGHNTVYNATIFPHNVGLGATRDPGLVRRIGAATALE 502
            N+FQKGSLSSRLGIPMMYGIDAVHGHN VYNAT+FPHNVGLGATR+P LVRRIGAATALE
Sbjct: 121  NEFQKGSLSSRLGIPMMYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLVRRIGAATALE 180

Query: 503  VRATGISFAFAPCIAVCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIPY 562
            VRATGISFAFAPCIAVCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIPY
Sbjct: 181  VRATGISFAFAPCIAVCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIPY 240

Query: 563  VGGTKKVIACAKHFVGDGGTTNGINENNTVIDKHGLLSIHMPAYLDSIIKGVSTVMASYS 622
            VGGTKKVIACAKHFVGDGGTTNGINENNTVID HGLLSIHMPAY DSIIKGVS+VM SYS
Sbjct: 241  VGGTKKVIACAKHFVGDGGTTNGINENNTVIDWHGLLSIHMPAYYDSIIKGVSSVMISYS 300

Query: 623  SWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPPHANYTYSVQAAILAGIDM 682
            SWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPPH+NYTYSVQAAILAGIDM
Sbjct: 301  SWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPPHSNYTYSVQAAILAGIDM 360

Query: 683  VMVPYKYTEFIDDLTYLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLGDYSLVNELGS 742
            VMVPYKYTEFIDDLT+LV+SNVIPMDRIDDA GRILSVKF+MGLFE+P+GDYSLVNELGS
Sbjct: 361  VMVPYKYTEFIDDLTHLVQSNVIPMDRIDDAAGRILSVKFSMGLFENPMGDYSLVNELGS 420

Query: 743  QAHRDLAREAVRQSLVLLKNGKNDSRPLLPLPKKAPKILVAGTHADNLGYQCGGWTIEWQ 802
            Q HRDLAREAVRQSLVLLKNGKNDS+P+LPLPKKAPKILVAGTH DNLGYQCGGWTI WQ
Sbjct: 421  QEHRDLAREAVRQSLVLLKNGKNDSQPVLPLPKKAPKILVAGTHVDNLGYQCGGWTIAWQ 480

Query: 803  GFSGNNGTRGTSILAAIKSMVDPSTEVVFREDPDSDFVKSNDFSYAIVVVGETPYAETEG 862
            GFSGNNGTRGTSILAAIKS VDPSTEVVF EDPDS+FVKSNDFSYAIVVVGE PYAE+ G
Sbjct: 481  GFSGNNGTRGTSILAAIKSTVDPSTEVVFSEDPDSNFVKSNDFSYAIVVVGEMPYAESVG 540

Query: 863  DSTTLTMLDPGPSIIKNVCDSVKCVVVVISGRPIVIEPYISSIDALVAAWLPGSEGLGVT 922
            DSTTLTMLDPGP+ IKNVCDSVKCVVVV+SGRPIV+EPYISSIDALVAAWLPG+EGLGVT
Sbjct: 541  DSTTLTMLDPGPNTIKNVCDSVKCVVVVVSGRPIVMEPYISSIDALVAAWLPGTEGLGVT 600

Query: 923  DVLYGDYGFSGKLPRTWFKSVDQLPMNVGDPHYDPLFPFGFGLTTESVKDLVSRSTSPGI 982
            D LYGD+GFSGKLPRTWFKSVDQLPMNVGDPHYDPLFPFGFGL T+SVKDLV+RSTS GI
Sbjct: 601  DCLYGDHGFSGKLPRTWFKSVDQLPMNVGDPHYDPLFPFGFGLGTDSVKDLVARSTSSGI 660

Query: 983  RETPSFLA-MIVATIAICILQ 1003
            R T S +A +IVA +AICILQ
Sbjct: 661  RGTASVIATIIVAALAICILQ 681

BLAST of Sgr028903 vs. NCBI nr
Match: XP_022150698.1 (uncharacterized protein LOC111018764 isoform X3 [Momordica charantia])

HSP 1 Score: 1288.5 bits (3333), Expect = 0.0e+00
Identity = 624/681 (91.63%), Postives = 653/681 (95.89%), Query Frame = 0

Query: 323  MLKLKLLWNWKQCGFTSQKRKMAKIFVQVVAILCLGWWWWATMVDAEYLKYKDPNQPVSV 382
            MLKLKLLW W+QCGFTSQKRKMA+IFVQVVAILCLGWWWWAT VDAEYLKYKDP QPV+V
Sbjct: 1    MLKLKLLWKWRQCGFTSQKRKMAQIFVQVVAILCLGWWWWATTVDAEYLKYKDPKQPVAV 60

Query: 383  RVKDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYFIGSVLSGGGSVPLPDASAQDWINMI 442
            RV DLLGRMTLEEKIGQMVQIDRSVANVTVMKDY IGSVLSGGGSVPLPDA A+DW+NMI
Sbjct: 61   RVMDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYSIGSVLSGGGSVPLPDARAEDWVNMI 120

Query: 443  NDFQKGSLSSRLGIPMMYGIDAVHGHNTVYNATIFPHNVGLGATRDPGLVRRIGAATALE 502
            N+FQKGSLSSRLGIPMMYGIDAVHGHN VYNAT+FPHNVGLGATR+P LVRRIGAATALE
Sbjct: 121  NEFQKGSLSSRLGIPMMYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLVRRIGAATALE 180

Query: 503  VRATGISFAFAPCIAVCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIPY 562
            VRATGISFAFAPCIAVCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIPY
Sbjct: 181  VRATGISFAFAPCIAVCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIPY 240

Query: 563  VGGTKKVIACAKHFVGDGGTTNGINENNTVIDKHGLLSIHMPAYLDSIIKGVSTVMASYS 622
            VGGTKKVIACAKHFVGDGGTTNGINENNTVID HGLLSIHMPAY DSIIKGVS+VM SYS
Sbjct: 241  VGGTKKVIACAKHFVGDGGTTNGINENNTVIDWHGLLSIHMPAYYDSIIKGVSSVMISYS 300

Query: 623  SWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPPHANYTYSVQAAILAGIDM 682
            SWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPPH+NYTYSVQAAILAGIDM
Sbjct: 301  SWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPPHSNYTYSVQAAILAGIDM 360

Query: 683  VMVPYKYTEFIDDLTYLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLGDYSLVNELGS 742
            VMVPYKYTEFIDDLT+LV+SNVIPMDRIDDA GRILSVKF+MGLFE+P+GDYSLVNELGS
Sbjct: 361  VMVPYKYTEFIDDLTHLVQSNVIPMDRIDDAAGRILSVKFSMGLFENPMGDYSLVNELGS 420

Query: 743  QAHRDLAREAVRQSLVLLKNGKNDSRPLLPLPKKAPKILVAGTHADNLGYQCGGWTIEWQ 802
            Q HRDLAREAVRQSLVLLKNGKNDS+P+LPLPKKAPKILVAGTH DNLGYQCGGWTI WQ
Sbjct: 421  QEHRDLAREAVRQSLVLLKNGKNDSQPVLPLPKKAPKILVAGTHVDNLGYQCGGWTIAWQ 480

Query: 803  GFSGNNGTRGTSILAAIKSMVDPSTEVVFREDPDSDFVKSNDFSYAIVVVGETPYAETEG 862
            GFSGNNGTRGTSILAAIKS VDPSTEVVF EDPDS+FVKSNDFSYAIVVVGE PYAE+ G
Sbjct: 481  GFSGNNGTRGTSILAAIKSTVDPSTEVVFSEDPDSNFVKSNDFSYAIVVVGEMPYAESVG 540

Query: 863  DSTTLTMLDPGPSIIKNVCDSVKCVVVVISGRPIVIEPYISSIDALVAAWLPGSEGLGVT 922
            DSTTLTMLDPGP+ IKNVCDSVKCVVVV+SGRPIV+EPYISSIDALVAAWLPG+EGLGVT
Sbjct: 541  DSTTLTMLDPGPNTIKNVCDSVKCVVVVVSGRPIVMEPYISSIDALVAAWLPGTEGLGVT 600

Query: 923  DVLYGDYGFSGKLPRTWFKSVDQLPMNVGDPHYDPLFPFGFGLTTESVKDLVSRSTSPGI 982
            D LYGD+GFSGKLPRTWFKSVDQLPMNVGDPHYDPLFPFGFGL T+SVKDLV+RSTS GI
Sbjct: 601  DCLYGDHGFSGKLPRTWFKSVDQLPMNVGDPHYDPLFPFGFGLGTDSVKDLVARSTSSGI 660

Query: 983  RETPSFLA-MIVATIAICILQ 1003
            R T S +A +IVA +AICILQ
Sbjct: 661  RGTASVIATIIVAALAICILQ 681

BLAST of Sgr028903 vs. NCBI nr
Match: XP_022150697.1 (uncharacterized protein LOC111018764 isoform X2 [Momordica charantia])

HSP 1 Score: 1266.9 bits (3277), Expect = 0.0e+00
Identity = 614/670 (91.64%), Postives = 643/670 (95.97%), Query Frame = 0

Query: 334  QCGFTSQKRKMAKIFVQVVAILCLGWWWWATMVDAEYLKYKDPNQPVSVRVKDLLGRMTL 393
            +CGFTSQKRKMA+IFVQVVAILCLGWWWWAT VDAEYLKYKDP QPV+VRV DLLGRMTL
Sbjct: 4    ECGFTSQKRKMAQIFVQVVAILCLGWWWWATTVDAEYLKYKDPKQPVAVRVMDLLGRMTL 63

Query: 394  EEKIGQMVQIDRSVANVTVMKDYFIGSVLSGGGSVPLPDASAQDWINMINDFQKGSLSSR 453
            EEKIGQMVQIDRSVANVTVMKDY IGSVLSGGGSVPLPDA A+DW+NMIN+FQKGSLSSR
Sbjct: 64   EEKIGQMVQIDRSVANVTVMKDYSIGSVLSGGGSVPLPDARAEDWVNMINEFQKGSLSSR 123

Query: 454  LGIPMMYGIDAVHGHNTVYNATIFPHNVGLGATRDPGLVRRIGAATALEVRATGISFAFA 513
            LGIPMMYGIDAVHGHN VYNAT+FPHNVGLGATR+P LVRRIGAATALEVRATGISFAFA
Sbjct: 124  LGIPMMYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLVRRIGAATALEVRATGISFAFA 183

Query: 514  PCIAVCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIPYVGGTKKVIACA 573
            PCIAVCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIPYVGGTKKVIACA
Sbjct: 184  PCIAVCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIPYVGGTKKVIACA 243

Query: 574  KHFVGDGGTTNGINENNTVIDKHGLLSIHMPAYLDSIIKGVSTVMASYSSWNGVKMHANH 633
            KHFVGDGGTTNGINENNTVID HGLLSIHMPAY DSIIKGVS+VM SYSSWNGVKMHANH
Sbjct: 244  KHFVGDGGTTNGINENNTVIDWHGLLSIHMPAYYDSIIKGVSSVMISYSSWNGVKMHANH 303

Query: 634  ELITGFLKGTLKFKGFVISDWEGLDRITSPPHANYTYSVQAAILAGIDMVMVPYKYTEFI 693
            ELITGFLKGTLKFKGFVISDWEGLDRITSPPH+NYTYSVQAAILAGIDMVMVPYKYTEFI
Sbjct: 304  ELITGFLKGTLKFKGFVISDWEGLDRITSPPHSNYTYSVQAAILAGIDMVMVPYKYTEFI 363

Query: 694  DDLTYLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLGDYSLVNELGSQAHRDLAREAV 753
            DDLT+LV+SNVIPMDRIDDA GRILSVKF+MGLFE+P+GDYSLVNELGSQ HRDLAREAV
Sbjct: 364  DDLTHLVQSNVIPMDRIDDAAGRILSVKFSMGLFENPMGDYSLVNELGSQEHRDLAREAV 423

Query: 754  RQSLVLLKNGKNDSRPLLPLPKKAPKILVAGTHADNLGYQCGGWTIEWQGFSGNNGTRGT 813
            RQSLVLLKNGKNDS+P+LPLPKKAPKILVAGTH DNLGYQCGGWTI WQGFSGNNGTRGT
Sbjct: 424  RQSLVLLKNGKNDSQPVLPLPKKAPKILVAGTHVDNLGYQCGGWTIAWQGFSGNNGTRGT 483

Query: 814  SILAAIKSMVDPSTEVVFREDPDSDFVKSNDFSYAIVVVGETPYAETEGDSTTLTMLDPG 873
            SILAAIKS VDPSTEVVF EDPDS+FVKSNDFSYAIVVVGE PYAE+ GDSTTLTMLDPG
Sbjct: 484  SILAAIKSTVDPSTEVVFSEDPDSNFVKSNDFSYAIVVVGEMPYAESVGDSTTLTMLDPG 543

Query: 874  PSIIKNVCDSVKCVVVVISGRPIVIEPYISSIDALVAAWLPGSEGLGVTDVLYGDYGFSG 933
            P+ IKNVCDSVKCVVVV+SGRPIV+EPYISSIDALVAAWLPG+EGLGVTD LYGD+GFSG
Sbjct: 544  PNTIKNVCDSVKCVVVVVSGRPIVMEPYISSIDALVAAWLPGTEGLGVTDCLYGDHGFSG 603

Query: 934  KLPRTWFKSVDQLPMNVGDPHYDPLFPFGFGLTTESVKDLVSRSTSPGIRETPSFLA-MI 993
            KLPRTWFKSVDQLPMNVGDPHYDPLFPFGFGL T+SVKDLV+RSTS GIR T S +A +I
Sbjct: 604  KLPRTWFKSVDQLPMNVGDPHYDPLFPFGFGLGTDSVKDLVARSTSSGIRGTASVIATII 663

Query: 994  VATIAICILQ 1003
            VA +AICILQ
Sbjct: 664  VAALAICILQ 673

BLAST of Sgr028903 vs. NCBI nr
Match: XP_008446716.1 (PREDICTED: beta-glucosidase BoGH3B isoform X1 [Cucumis melo])

HSP 1 Score: 1254.6 bits (3245), Expect = 0.0e+00
Identity = 601/681 (88.25%), Postives = 639/681 (93.83%), Query Frame = 0

Query: 321  FMMLKLKLLWNWKQCGFTSQKRKMAKIFVQVVAILCLGWWWWATMVDAEYLKYKDPNQPV 380
            FM+L LKLLW WK+CG  +Q +KMAKIFVQVV ILCLGW WWATMVDAE LKYKDP QPV
Sbjct: 97   FMVLNLKLLWKWKECGLNTQAKKMAKIFVQVVVILCLGWLWWATMVDAENLKYKDPKQPV 156

Query: 381  SVRVKDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYFIGSVLSGGGSVPLPDASAQDWIN 440
             VRVKDLLGRMTLEEKIGQMVQIDRSVAN TVMKDYFIGS+LSGGGSVPLPDA A+DW++
Sbjct: 157  GVRVKDLLGRMTLEEKIGQMVQIDRSVANATVMKDYFIGSILSGGGSVPLPDARAEDWVD 216

Query: 441  MINDFQKGSLSSRLGIPMMYGIDAVHGHNTVYNATIFPHNVGLGATRDPGLVRRIGAATA 500
            MINDFQKGSLSSRLGIPM YGIDAVHGHN VYNAT+FPHNVGLGATR+P L RRIGAATA
Sbjct: 217  MINDFQKGSLSSRLGIPMFYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLARRIGAATA 276

Query: 501  LEVRATGISFAFAPCIAVCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPANYRKGI 560
            LEVRATGIS+ FAPC+AVCRDPRWGRCYESYSEDPKIV+EMTEII GLQGEPPANYRKG 
Sbjct: 277  LEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPKIVKEMTEIIIGLQGEPPANYRKGT 336

Query: 561  PYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDKHGLLSIHMPAYLDSIIKGVSTVMAS 620
            PYVGGTKKVIACAKHFVGDGGTT+GINENNTVI++HGLLSIHMPAYLDSIIKGVS+VMAS
Sbjct: 337  PYVGGTKKVIACAKHFVGDGGTTHGINENNTVINRHGLLSIHMPAYLDSIIKGVSSVMAS 396

Query: 621  YSSWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPPHANYTYSVQAAILAGI 680
            YSSWNGVKMHAN ELIT FLKG LKFKGFVISDWEGLDRITS PH+NYTYSVQAAILAGI
Sbjct: 397  YSSWNGVKMHANRELITDFLKGALKFKGFVISDWEGLDRITSTPHSNYTYSVQAAILAGI 456

Query: 681  DMVMVPYKYTEFIDDLTYLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLGDYSLVNEL 740
            DMVM+PYKY EFIDDL +LVKSNVIPMDRIDDAVGRIL+VKFTMGLFESP+ DYSLVNEL
Sbjct: 457  DMVMIPYKYAEFIDDLKFLVKSNVIPMDRIDDAVGRILTVKFTMGLFESPMADYSLVNEL 516

Query: 741  GSQAHRDLAREAVRQSLVLLKNGKNDSRPLLPLPKKAPKILVAGTHADNLGYQCGGWTIE 800
            GSQAHRDLAR+AVRQSLVLLKNGKNDS+PLLPL KK+PKILVAGTHADNLGYQCGGWTI 
Sbjct: 517  GSQAHRDLARDAVRQSLVLLKNGKNDSKPLLPLSKKSPKILVAGTHADNLGYQCGGWTIA 576

Query: 801  WQGFSGNNGTRGTSILAAIKSMVDPSTEVVFREDPDSDFVKSNDFSYAIVVVGETPYAET 860
            WQGFSGNNGTRGT+ILAAIKS VDPSTEVVFREDPDSDFVKSNDFSYAIVV+GE PYAET
Sbjct: 577  WQGFSGNNGTRGTTILAAIKSTVDPSTEVVFREDPDSDFVKSNDFSYAIVVIGEAPYAET 636

Query: 861  EGDSTTLTMLDPGPSIIKNVCDSVKCVVVVISGRPIVIEPYISSIDALVAAWLPGSEGLG 920
             GDSTTLTMLDPGP+IIKNVCD V+CVV++ISGRPIVIEPYISSIDALVAAWLPG+EG G
Sbjct: 637  GGDSTTLTMLDPGPNIIKNVCDHVECVVILISGRPIVIEPYISSIDALVAAWLPGTEGQG 696

Query: 921  VTDVLYGDYGFSGKLPRTWFKSVDQLPMNVGDPHYDPLFPFGFGLTTESVKDLVSRSTSP 980
            VTD LYGD+GFSGKLPRTWFKSVDQLPMNVGDPHYDPLFPFGFGLTT SVKD+++RSTS 
Sbjct: 697  VTDALYGDHGFSGKLPRTWFKSVDQLPMNVGDPHYDPLFPFGFGLTTGSVKDIIARSTSA 756

Query: 981  GIRETPSFLAMIVATIAICIL 1002
            GIR TPS +A IV  I +CIL
Sbjct: 757  GIRGTPSLIASIVVAITLCIL 777

BLAST of Sgr028903 vs. NCBI nr
Match: XP_038892436.1 (beta-glucosidase BoGH3B-like isoform X1 [Benincasa hispida])

HSP 1 Score: 1250.7 bits (3235), Expect = 0.0e+00
Identity = 606/677 (89.51%), Postives = 637/677 (94.09%), Query Frame = 0

Query: 322 MMLKLKLLWNWKQCGFTSQKRKMAKIFVQVVAILCLGWWWWATMVDAEYLKYKDPNQPVS 381
           MML LKLLW WK+CG  +Q +KMAKIFVQVV ILCLGW WW TMVDAE LKYKDP Q V+
Sbjct: 1   MMLNLKLLWKWKECGLNTQGKKMAKIFVQVVVILCLGWLWWVTMVDAENLKYKDPKQSVA 60

Query: 382 VRVKDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYFIGSVLSGGGSVPLPDASAQDWINM 441
           VRVKDLLGRMT+EEKIGQM+QIDRSVAN TVMKDYFIGSVLSGGGSVPLPDA A+DW+NM
Sbjct: 61  VRVKDLLGRMTVEEKIGQMIQIDRSVANATVMKDYFIGSVLSGGGSVPLPDARAEDWVNM 120

Query: 442 INDFQKGSLSSRLGIPMMYGIDAVHGHNTVYNATIFPHNVGLGATRDPGLVRRIGAATAL 501
           INDFQKGSLSSRLGIPM YGIDAVHGHN VYNAT+FPHNVGLGATR+P L RRIGAATAL
Sbjct: 121 INDFQKGSLSSRLGIPMFYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLARRIGAATAL 180

Query: 502 EVRATGISFAFAPCIAVCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIP 561
           EVRATGIS+ FAPC+AVCRDPRWGRCYESYSEDPKIV+EMTEII GLQGEPPANYRKGIP
Sbjct: 181 EVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPKIVKEMTEIIIGLQGEPPANYRKGIP 240

Query: 562 YVGGTKKVIACAKHFVGDGGTTNGINENNTVIDKHGLLSIHMPAYLDSIIKGVSTVMASY 621
           YVGGTKKVIACAKHFVGDGGTT+GINENNTVI++HGLLSIHMPAYLDSIIKGVS+VMASY
Sbjct: 241 YVGGTKKVIACAKHFVGDGGTTHGINENNTVINRHGLLSIHMPAYLDSIIKGVSSVMASY 300

Query: 622 SSWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPPHANYTYSVQAAILAGID 681
           SSWNGVKMHAN ELIT FLKGTLK+KGFVISDWEGLDRITS PH+NYTYS+QAAILAGID
Sbjct: 301 SSWNGVKMHANRELITDFLKGTLKYKGFVISDWEGLDRITSTPHSNYTYSIQAAILAGID 360

Query: 682 MVMVPYKYTEFIDDLTYLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLGDYSLVNELG 741
           MVM+PYKY EFIDDLT+LVKSNVIPMDRIDDAVGRILSVKFTMGLFESPL DYSLVNELG
Sbjct: 361 MVMIPYKYAEFIDDLTFLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLSDYSLVNELG 420

Query: 742 SQAHRDLAREAVRQSLVLLKNGKNDSRP-LLPLPKKAPKILVAGTHADNLGYQCGGWTIE 801
           SQAHRDLAR+AVRQSLVLLKNGKNDS P LLPL KKAPKILVAGTHADNLGYQCGGWTI 
Sbjct: 421 SQAHRDLARDAVRQSLVLLKNGKNDSNPLLLPLSKKAPKILVAGTHADNLGYQCGGWTIA 480

Query: 802 WQGFSGNNGTRGTSILAAIKSMVDPSTEVVFREDPDSDFVKSNDFSYAIVVVGETPYAET 861
           WQGFSGNNGTRGT+ILAAIKS VDPSTEVVFREDPDSDFVKSNDFSYAIVV+GE PYAET
Sbjct: 481 WQGFSGNNGTRGTTILAAIKSTVDPSTEVVFREDPDSDFVKSNDFSYAIVVIGEAPYAET 540

Query: 862 EGDSTTLTMLDPGPSIIKNVCDSVKCVVVVISGRPIVIEPYISSIDALVAAWLPGSEGLG 921
            GDSTTLTMLDPGPSIIKNVC SVKCVVVVISGRPIV+EPYISS+DALVAAWLPG+EGLG
Sbjct: 541 GGDSTTLTMLDPGPSIIKNVCGSVKCVVVVISGRPIVMEPYISSVDALVAAWLPGTEGLG 600

Query: 922 VTDVLYGDYGFSGKLPRTWFKSVDQLPMNVGDPHYDPLFPFGFGLTTESVKDLVSRSTSP 981
           VTD LYGD+GFSGKLPRTWFKSVDQLPMNVGDPHYDPLFPFGFGLTT SVKD V+RSTS 
Sbjct: 601 VTDALYGDHGFSGKLPRTWFKSVDQLPMNVGDPHYDPLFPFGFGLTTGSVKDFVARSTSA 660

Query: 982 GIRETPSFLAMIVATIA 998
           GI  TPS +A IVATIA
Sbjct: 661 GICGTPSLIATIVATIA 677

BLAST of Sgr028903 vs. ExPASy Swiss-Prot
Match: A7LXU3 (Beta-glucosidase BoGH3B OS=Bacteroides ovatus (strain ATCC 8483 / DSM 1896 / JCM 5824 / NCTC 11153) OX=411476 GN=BACOVA_02659 PE=1 SV=1)

HSP 1 Score: 297.7 bits (761), Expect = 5.7e-79
Identity = 205/653 (31.39%), Postives = 331/653 (50.69%), Query Frame = 0

Query: 376 PNQP-VSVRVKDLLGRMTLEEKIGQMVQIDRSVAN-----------------VTVMKDYF 435
           P  P +   +++ L +MTLE+KIGQM +I   V +                  TV+  Y 
Sbjct: 30  PTDPAIETHIREWLQKMTLEQKIGQMCEITIDVVSDLETSRKKGFCLSEAMLDTVIGKYK 89

Query: 436 IGSVLSGGGSVPLPDASAQD-WINMINDFQKGSLSSRLGIPMMYGIDAVHGHNTVYNATI 495
           +GS+L    +VPL  A  ++ W   I   Q+ S+   +GIP +YG+D +HG     + T+
Sbjct: 90  VGSLL----NVPLGVAQKKEKWAEAIKQIQEKSM-KEIGIPCIYGVDQIHGTTYTLDGTM 149

Query: 496 FPHNVGLGATRDPGLVRRIGAATALEVRATGISFAFAPCIAVCRDPRWGRCYESYSEDPK 555
           FP  + +GAT +  L RR    +A E +A  I + FAP + + RDPRW R +E+Y ED  
Sbjct: 150 FPQGINMGATFNRELTRRGAKISAYETKAGCIPWTFAPVVDLGRDPRWARMWENYGEDCY 209

Query: 556 IVQEM-TEIIPGLQGEPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDK 615
           +  EM    + G QGE P           G   V AC KH++G G   +G +   + I +
Sbjct: 210 VNAEMGVSAVKGFQGEDPNRI--------GEYNVAACMKHYMGYGVPVSGKDRTPSSISR 269

Query: 616 HGLLSIHMPAYLDSIIKGVSTVMASYSSWNGVKMHANHELITGFLKGTLKFKGFVISDWE 675
             +   H   +L ++ +G  +VM +    NG+  HAN EL+T +LK  L + G +++DW 
Sbjct: 270 SDMREKHFAPFLAAVRQGALSVMVNSGVDNGLPFHANRELLTEWLKEDLNWDGLIVTDWA 329

Query: 676 GLDRITSPPH--ANYTYSVQAAILAGIDMVMVPYKYTEFIDDLTYLVKSNVIPMDRIDDA 735
            ++ + +  H  A    +V+  I AGIDM MVPY+   F D L  LV+   + M+RIDDA
Sbjct: 330 DINNLCTRDHIAATKKEAVKIVINAGIDMSMVPYE-VSFCDYLKELVEEGEVSMERIDDA 389

Query: 736 VGRILSVKFTMGLFESPLGDYSLVNELGSQAHRDLAREAVRQSLVLLKNGKNDSRPLLPL 795
           V R+L +K+ +GLF+ P  D    ++ GS+    +A +A  +S VLLKN  N    +LP+
Sbjct: 390 VARVLRLKYRLGLFDHPYWDIKKYDKFGSKEFAAVALQAAEESEVLLKNDGN----ILPI 449

Query: 796 PKKAPKILVAGTHADNLGYQCGGWTIEWQGFSGNNGTRG-TSILAAI-------KSMVDP 855
             K  KIL+ G +A+++    GGW+  WQG   +   +   +I  A+         + +P
Sbjct: 450 -AKGKKILLTGPNANSMRCLNGGWSYSWQGHVADEYAQAYHTIYEALCEKYGKENIIYEP 509

Query: 856 S-TEVVFRED-------PDSD--FVKSNDFSYAIVVVGETPYAETEGDSTTLTMLDPGPS 915
             T   ++ D       P+++     +      I  +GE  Y ET G+ T LT+ +   +
Sbjct: 510 GVTYASYKNDNWWEENKPETEKPVAAAAQADIIITCIGENSYCETPGNLTDLTLSENQRN 569

Query: 916 IIKNVCDSVKCVVVVIS-GRPIVIEPYISSIDALVAAWLPGS-EGLGVTDVLYGDYGFSG 972
           ++K +  + K +V+V++ GRP +I   +    A+V   LP +  G  + ++L GD  FSG
Sbjct: 570 LVKALAATGKPIVLVLNQGRPRIINDIVPLAKAVVNIMLPSNYGGDALANLLAGDANFSG 629

BLAST of Sgr028903 vs. ExPASy Swiss-Prot
Match: Q23892 (Lysosomal beta glucosidase OS=Dictyostelium discoideum OX=44689 GN=gluA PE=1 SV=2)

HSP 1 Score: 278.9 bits (712), Expect = 2.7e-73
Identity = 201/656 (30.64%), Postives = 332/656 (50.61%), Query Frame = 0

Query: 384 VKDLLGRMTLEEKIGQMVQID----RSVANVTV--------MKDYFIGSVL----SGGGS 443
           V +L+ +M++ EKIGQM Q+D     S   +T+         K Y+IGS L    SGG +
Sbjct: 80  VDNLMSKMSITEKIGQMTQLDITTLTSPNTITINETTLAYYAKTYYIGSYLNSPVSGGLA 139

Query: 444 VPLPDASAQDWINMINDFQKGSL-SSRLGIPMMYGIDAVHGHNTVYNATIFPHNVGLGAT 503
             +   ++  W++MIN  Q   +  S   IPM+YG+D+VHG N V+ AT+FPHN GL AT
Sbjct: 140 GDIHHINSSVWLDMINTIQTIVIEGSPNKIPMIYGLDSVHGANYVHKATLFPHNTGLAAT 199

Query: 504 RDPGLVRRIGAATALEVRATGISFAFAPCIAVCRDPRWGRCYESYSEDPKIVQEM-TEII 563
            +          T+ +  A GI + FAP + +   P W R YE++ EDP +   M    +
Sbjct: 200 FNIEHATTAAQITSKDTVAVGIPWVFAPVLGIGVQPLWSRIYETFGEDPYVASMMGAAAV 259

Query: 564 PGLQG-----EPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDKHGLLS 623
            G QG     + P N              +  AKH+ G    T+G +     I +  L  
Sbjct: 260 RGFQGGNNSFDGPIN----------APSAVCTAKHYFGYSDPTSGKDRTAAWIPERMLRR 319

Query: 624 IHMPAYLDSII-KGVSTVMASYSSWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDR 683
             +P++ ++I   G  T+M +    NGV MH +++ +T  L+G L+F+G  ++DW+ +++
Sbjct: 320 YFLPSFAEAITGAGAGTIMINSGEVNGVPMHTSYKYLTEVLRGELQFEGVAVTDWQDIEK 379

Query: 684 ITSPPH--ANYTYSVQAAILAGIDMVMVPYKYTEFIDDLTYLVKSNVIPMDRIDDAVGRI 743
           +    H   +   ++  A+ AGIDM MVP   + F   L  +V +  +P  R+D +V RI
Sbjct: 380 LVYFHHTAGSAEEAILQALDAGIDMSMVPLDLS-FPIILAEMVAAGTVPESRLDLSVRRI 439

Query: 744 LSVKFTMGLFESPL--GDYSLVNELGSQAHRDLAREAVRQSLVLLKNGKNDSRPLLPLPK 803
           L++K+ +GLF +P    + ++V+ +G    R+ A     +S+ LL+N  N    +LPL  
Sbjct: 440 LNLKYALGLFSNPYPNPNAAIVDTIGQVQDREAAAATAEESITLLQNKNN----ILPLNT 499

Query: 804 KAPK-ILVAGTHADNLGYQCGGWTIEWQG-FSGNNGTRGTSILAAIKSMVDPSTEVVFRE 863
              K +L+ G  AD++    GGW++ WQG +  +    GTSIL  ++ + + + +   + 
Sbjct: 500 NTIKNVLLTGPSADSIRNLNGGWSVHWQGAYEDSEFPFGTSILTGLREITNDTADFNIQY 559

Query: 864 ---------------DPDSDFVKSNDFSYAIVVVGETPYAETEGDSTTLTMLDPGPSIIK 923
                          D   +  +S+D    +VV+GE P AET GD   L+M      +++
Sbjct: 560 TIGHEIGVPTNQTSIDEAVELAQSSD--VVVVVIGELPEAETPGDIYDLSMDPNEVLLLQ 619

Query: 924 NVCDSVKCVV-VVISGRPIVIEP-YISSIDALVAAWLPGSE-GLGVTDVLYGDYGFSGKL 981
            + D+ K VV +++  RP ++ P  + S  A++ A+LPGSE G  + ++L G+   SG+L
Sbjct: 620 QLVDTGKPVVLILVEARPRILPPDLVYSCAAVLMAYLPGSEGGKPIANILMGNVNPSGRL 679

BLAST of Sgr028903 vs. ExPASy Swiss-Prot
Match: Q56078 (Periplasmic beta-glucosidase OS=Salmonella typhimurium (strain LT2 / SGSC1412 / ATCC 700720) OX=99287 GN=bglX PE=3 SV=2)

HSP 1 Score: 232.3 bits (591), Expect = 2.9e-59
Identity = 193/663 (29.11%), Postives = 305/663 (46.00%), Query Frame = 0

Query: 384 VKDLLGRMTLEEKIGQMVQIDRSVANV-----TVMKDYFIGSVLSGGGSVPLPDASAQDW 443
           V DLL +MT++EKIGQ+  I     N       ++KD  +G++ +          + QD 
Sbjct: 38  VTDLLKKMTVDEKIGQLRLISVGPDNPKEAIREMIKDGQVGAIFN--------TVTRQD- 97

Query: 444 INMINDFQKGSLSSRLGIPMMYGIDAVHGHNTVYNATIFPHNVGLGATRDPGLVRRIGAA 503
           I  + D  +    SRL IP+ +  D VHG  TV     FP ++GL ++ +   VR +G  
Sbjct: 98  IRQMQD--QVMALSRLKIPLFFAYDVVHGQRTV-----FPISLGLASSFNLDAVRTVGRV 157

Query: 504 TALEVRATGISFAFAPCIAVCRDPRWGRCYESYSEDPKIVQEMTE-IIPGLQGEPPANYR 563
           +A E    G++  +AP + V RDPRWGR  E + ED  +   M E ++  +QG+ PA+  
Sbjct: 158 SAYEAADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSIMGETMVKAMQGKSPAD-- 217

Query: 564 KGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDKHGLLSIHMPAYLDSIIKGVSTV 623
                      V+   KHF   G    G   N   +    L + +MP Y   +  G   V
Sbjct: 218 --------RYSVMTSVKHFAAYGAVEGGKEYNTVDMSSQRLFNDYMPPYKAGLDAGSGAV 277

Query: 624 MASYSSWNGVKMHANHELITGFLKGTLKFKGFVISDWEGL-DRITSPPHANYTYSVQAAI 683
           M + +S NG    ++  L+   L+    FKG  +SD   + + I     A+   +V+ A+
Sbjct: 278 MVALNSLNGTPATSDSWLLKDVLRDEWGFKGITVSDHGAIKELIKHGTAADPEDAVRVAL 337

Query: 684 LAGIDMVMVPYKYTEFIDDLTYLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLGDYS- 743
            AG+DM M    Y+++   L  L+KS  + M  +DDA   +L+VK+ MGLF  P      
Sbjct: 338 KAGVDMSMADEYYSKY---LPGLIKSGKVTMAELDDATRHVLNVKYDMGLFNDPYSHLGP 397

Query: 744 -----LVNELGSQAHRDLAREAVRQSLVLLKNGKNDSRPLLPLPKKAPKILVAGTHADNL 803
                +     S+ HR  ARE  R+S+VLLKN + ++ PL    KK+  I V G  AD+ 
Sbjct: 398 KESDPVDTNAESRLHRKEAREVARESVVLLKN-RLETLPL----KKSGTIAVVGPLADSQ 457

Query: 804 GYQCGGWTIEWQGFSGNNGTRGTSILAAIKSMVDPSTEVVFRE----------------- 863
               G W+      +     +  ++LA I++ V    ++++ +                 
Sbjct: 458 RDVMGSWS------AAGVANQSVTVLAGIQNAVGDGAKILYAKGANITNDKGIVDFLNLY 517

Query: 864 ------DP-------DSDFVKSNDFSYAIVVVGETPYAETEGDS-TTLTMLDPGPSIIKN 923
                 DP       D     +      + VVGE+     E  S T +T+      +I  
Sbjct: 518 EEAVKIDPRSPQAMIDEAVQAAKQADVVVAVVGESQGMAHEASSRTNITIPQSQRDLITA 577

Query: 924 VCDSVK-CVVVVISGRPIVIEPYISSIDALVAAWLPGSE-GLGVTDVLYGDYGFSGKLPR 978
           +  + K  V+V+++GRP+ +       DA++  W  G+E G  + DVL+GDY  SGKLP 
Sbjct: 578 LKATGKPLVLVLMNGRPLALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPI 637

BLAST of Sgr028903 vs. ExPASy Swiss-Prot
Match: P33363 (Periplasmic beta-glucosidase OS=Escherichia coli (strain K12) OX=83333 GN=bglX PE=3 SV=2)

HSP 1 Score: 223.0 bits (567), Expect = 1.8e-56
Identity = 192/682 (28.15%), Postives = 312/682 (45.75%), Query Frame = 0

Query: 384 VKDLLGRMTLEEKIGQMVQIDRSVANV-----TVMKDYFIGSVLSGGGSVPLPDASA-QD 443
           V +LL +MT++EKIGQ+  I     N       ++KD  +G++ +   +V   D  A QD
Sbjct: 38  VTELLKKMTVDEKIGQLRLISVGPDNPKEAIREMIKDGQVGAIFN---TVTRQDIRAMQD 97

Query: 444 WINMINDFQKGSLSSRLGIPMMYGIDAVHGHNTVYNATIFPHNVGLGATRDPGLVRRIGA 503
            +  +         SRL IP+ +  D +HG  TV     FP ++GL ++ +   V+ +G 
Sbjct: 98  QVMEL---------SRLKIPLFFAYDVLHGQRTV-----FPISLGLASSFNLDAVKTVGR 157

Query: 504 ATALEVRATGISFAFAPCIAVCRDPRWGRCYESYSEDPKIVQEMTE-IIPGLQGEPPANY 563
            +A E    G++  +AP + V RDPRWGR  E + ED  +   M + ++  +QG+ PA+ 
Sbjct: 158 VSAYEAADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSTMGKTMVEAMQGKSPAD- 217

Query: 564 RKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDKHGLLSIHMPAYLDSIIKGVST 623
                       V+   KHF   G    G   N   +    L + +MP Y   +  G   
Sbjct: 218 ---------RYSVMTSVKHFAAYGAVEGGKEYNTVDMSPQRLFNDYMPPYKAGLDAGSGA 277

Query: 624 VMASYSSWNGVKMHANHELITGFLKGTLKFKGFVISDWEGL-DRITSPPHANYTYSVQAA 683
           VM + +S NG    ++  L+   L+    FKG  +SD   + + I     A+   +V+ A
Sbjct: 278 VMVALNSLNGTPATSDSWLLKDVLRDQWGFKGITVSDHGAIKELIKHGTAADPEDAVRVA 337

Query: 684 ILAGIDMVMVPYKYTEFIDDLTYLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLGDYS 743
           + +GI+M M    Y+++   L  L+KS  + M  +DDA   +L+VK+ MGLF  P     
Sbjct: 338 LKSGINMSMSDEYYSKY---LPGLIKSGKVTMAELDDAARHVLNVKYDMGLFNDPYSHLG 397

Query: 744 ------LVNELGSQAHRDLAREAVRQSLVLLKNGKNDSRPLLPLPKKAPKILVAGTHADN 803
                 +     S+ HR  ARE  R+SLVLLKN + ++ PL    KK+  I V G  AD+
Sbjct: 398 PKESDPVDTNAESRLHRKEAREVARESLVLLKN-RLETLPL----KKSATIAVVGPLADS 457

Query: 804 LGYQCGGWTIEWQGFSGNNGTRGTSILAAIKSMVDPSTEVVF------------------ 863
                G W+      +     +  ++L  IK+ V  + +V++                  
Sbjct: 458 KRDVMGSWS------AAGVADQSVTVLTGIKNAVGENGKVLYAKGANVTSDKGIIDFLNQ 517

Query: 864 -----REDP-------DSDFVKSNDFSYAIVVVGETPYAETEGDS-TTLTMLDPGPSIIK 923
                + DP       D     +      + VVGE      E  S T +T+      +I 
Sbjct: 518 YEEAVKVDPRSPQEMIDEAVQTAKQSDVVVAVVGEAQGMAHEASSRTDITIPQSQRDLIA 577

Query: 924 NVCDSVK-CVVVVISGRPIVIEPYISSIDALVAAWLPGSE-GLGVTDVLYGDYGFSGKLP 983
            +  + K  V+V+++GRP+ +       DA++  W  G+E G  + DVL+GDY  SGKLP
Sbjct: 578 ALKATGKPLVLVLMNGRPLALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNPSGKLP 637

Query: 984 RTWFKSVDQLP-----MNVGDP------------HYD----PLFPFGFGL--TTESVKDL 996
            ++ +SV Q+P     +N G P            ++D     L+PFG+GL  TT +V D+
Sbjct: 638 MSFPRSVGQIPVYYSHLNTGRPYNADKPNKYTSRYFDEANGALYPFGYGLSYTTFTVSDV 676

BLAST of Sgr028903 vs. ExPASy Swiss-Prot
Match: T2KMH0 (Beta-xylosidase OS=Formosa agariphila (strain DSM 15362 / KCTC 12365 / LMG 23005 / KMM 3901) OX=1347342 GN=BN863_22130 PE=1 SV=1)

HSP 1 Score: 210.7 bits (535), Expect = 9.1e-53
Identity = 187/638 (29.31%), Postives = 292/638 (45.77%), Query Frame = 0

Query: 377 NQPVSVRVKDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYFIGSVLSGGGSVPLPDASAQ 436
           ++ +  +V  L+ +MTL+EKI +M Q                             DA A 
Sbjct: 30  DEEIDKKVATLISQMTLDEKIAEMTQ-----------------------------DAPAN 89

Query: 437 DWINMINDFQKGSLSSRLGIPMMYGIDAVHGHNTVY----NATIFPHNVGLGATRDPGLV 496
           +               RLGIP M   +A+HG   V     N T++P  V   +T +P L+
Sbjct: 90  E---------------RLGIPSMKYGEALHGLWLVLDYYGNTTVYPQAVAAASTWEPELI 149

Query: 497 RRIGAATALEVRATGISFAFAPCIAV-CRDPRWGRCYESYSEDPKIVQEM-TEIIPGLQG 556
           +++ + TA E RA G++  ++P + V   D R+GR  ESY EDP +V  M    I GLQG
Sbjct: 150 KKMASQTAREARALGVTHCYSPNLDVYAGDARYGRVEESYGEDPYLVSRMGVAFIEGLQG 209

Query: 557 EPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDKHGLLSIHMPAYLDSI 616
                + +          VIA AKHFVG      GIN   + + +  L  +++P +  ++
Sbjct: 210 TGEEQFDE--------NHVIATAKHFVGYPENRRGINGGFSDMSERRLREVYLPPFEAAV 269

Query: 617 IK-GVSTVMASYSSWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPPHANYT 676
            + GV +VM  +  +NGV  H N  L+   L+  L F GF++SD   + R+ +  H    
Sbjct: 270 KEAGVGSVMPGHQDFNGVPCHMNTWLLKDILRDELGFDGFIVSDNNDVGRLET-MHFIAE 329

Query: 677 YSVQAAIL---AGIDMVMVPYKYTEFIDDLTYLVKSNVIP----MDRIDDAVGRILSVKF 736
              +AAIL   AG+DM +V  K  E     T ++K  ++     M  ID A  RIL+ K+
Sbjct: 330 NRTEAAILGLKAGVDMDLVIGKNVELATYHTNILKDTILKNPALMKYIDQATSRILTAKY 389

Query: 737 TMGLFES-PLGDYSLVNELGSQAHRDLAREAVRQSLVLLKNGKNDSRPLLPLP-KKAPKI 796
            +GLF++ P    +   E G+  HR+ A E   +S+++LKN  N    LLPL   K   +
Sbjct: 390 KLGLFDAKPKKIDTETVETGTDEHREFALELAEKSIIMLKNDNN----LLPLDVSKIKSL 449

Query: 797 LVAGTHADNLGYQCGGWTIEWQGFSGNNGTRGTSILAAIKSMVDPSTEVVFREDPDSDFV 856
            V G +A     + G + +   G+SG       S+L  +K  V    ++ + +  D D  
Sbjct: 450 AVIGPNAHEERPKKGTYKL-LGGYSG-LPPYYVSVLDGLKKKVGEHVKINYAKGCDIDSF 509

Query: 857 KSNDFSYAI----------VVVGETPYAETE-GDSTTLTMLDPGPSIIKNVCDSVKCVVV 916
               F  AI          +VVG +     E GD   L +      +++ +  + K V+V
Sbjct: 510 SKEGFPEAISAAKNSDAVVLVVGSSHKTCGEGGDRADLDLYGVQKELVEAIHKTGKPVIV 569

Query: 917 V-ISGRPIVIEPYISSIDALVAAWLPGSE-GLGVTDVLYGDYGFSGKLPRTWFKSVDQLP 972
           V I+GRP+ I     +I +++  W  G   G  V +V++GD    GKL  ++ + V Q+P
Sbjct: 570 VLINGRPLSINYIAENIPSILETWYGGMRAGDAVANVIFGDVNPGGKLTMSFPRDVGQVP 608

BLAST of Sgr028903 vs. ExPASy TrEMBL
Match: A0A6J1DCA3 (uncharacterized protein LOC111018764 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111018764 PE=3 SV=1)

HSP 1 Score: 1288.5 bits (3333), Expect = 0.0e+00
Identity = 624/681 (91.63%), Postives = 653/681 (95.89%), Query Frame = 0

Query: 323  MLKLKLLWNWKQCGFTSQKRKMAKIFVQVVAILCLGWWWWATMVDAEYLKYKDPNQPVSV 382
            MLKLKLLW W+QCGFTSQKRKMA+IFVQVVAILCLGWWWWAT VDAEYLKYKDP QPV+V
Sbjct: 1    MLKLKLLWKWRQCGFTSQKRKMAQIFVQVVAILCLGWWWWATTVDAEYLKYKDPKQPVAV 60

Query: 383  RVKDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYFIGSVLSGGGSVPLPDASAQDWINMI 442
            RV DLLGRMTLEEKIGQMVQIDRSVANVTVMKDY IGSVLSGGGSVPLPDA A+DW+NMI
Sbjct: 61   RVMDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYSIGSVLSGGGSVPLPDARAEDWVNMI 120

Query: 443  NDFQKGSLSSRLGIPMMYGIDAVHGHNTVYNATIFPHNVGLGATRDPGLVRRIGAATALE 502
            N+FQKGSLSSRLGIPMMYGIDAVHGHN VYNAT+FPHNVGLGATR+P LVRRIGAATALE
Sbjct: 121  NEFQKGSLSSRLGIPMMYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLVRRIGAATALE 180

Query: 503  VRATGISFAFAPCIAVCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIPY 562
            VRATGISFAFAPCIAVCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIPY
Sbjct: 181  VRATGISFAFAPCIAVCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIPY 240

Query: 563  VGGTKKVIACAKHFVGDGGTTNGINENNTVIDKHGLLSIHMPAYLDSIIKGVSTVMASYS 622
            VGGTKKVIACAKHFVGDGGTTNGINENNTVID HGLLSIHMPAY DSIIKGVS+VM SYS
Sbjct: 241  VGGTKKVIACAKHFVGDGGTTNGINENNTVIDWHGLLSIHMPAYYDSIIKGVSSVMISYS 300

Query: 623  SWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPPHANYTYSVQAAILAGIDM 682
            SWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPPH+NYTYSVQAAILAGIDM
Sbjct: 301  SWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPPHSNYTYSVQAAILAGIDM 360

Query: 683  VMVPYKYTEFIDDLTYLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLGDYSLVNELGS 742
            VMVPYKYTEFIDDLT+LV+SNVIPMDRIDDA GRILSVKF+MGLFE+P+GDYSLVNELGS
Sbjct: 361  VMVPYKYTEFIDDLTHLVQSNVIPMDRIDDAAGRILSVKFSMGLFENPMGDYSLVNELGS 420

Query: 743  QAHRDLAREAVRQSLVLLKNGKNDSRPLLPLPKKAPKILVAGTHADNLGYQCGGWTIEWQ 802
            Q HRDLAREAVRQSLVLLKNGKNDS+P+LPLPKKAPKILVAGTH DNLGYQCGGWTI WQ
Sbjct: 421  QEHRDLAREAVRQSLVLLKNGKNDSQPVLPLPKKAPKILVAGTHVDNLGYQCGGWTIAWQ 480

Query: 803  GFSGNNGTRGTSILAAIKSMVDPSTEVVFREDPDSDFVKSNDFSYAIVVVGETPYAETEG 862
            GFSGNNGTRGTSILAAIKS VDPSTEVVF EDPDS+FVKSNDFSYAIVVVGE PYAE+ G
Sbjct: 481  GFSGNNGTRGTSILAAIKSTVDPSTEVVFSEDPDSNFVKSNDFSYAIVVVGEMPYAESVG 540

Query: 863  DSTTLTMLDPGPSIIKNVCDSVKCVVVVISGRPIVIEPYISSIDALVAAWLPGSEGLGVT 922
            DSTTLTMLDPGP+ IKNVCDSVKCVVVV+SGRPIV+EPYISSIDALVAAWLPG+EGLGVT
Sbjct: 541  DSTTLTMLDPGPNTIKNVCDSVKCVVVVVSGRPIVMEPYISSIDALVAAWLPGTEGLGVT 600

Query: 923  DVLYGDYGFSGKLPRTWFKSVDQLPMNVGDPHYDPLFPFGFGLTTESVKDLVSRSTSPGI 982
            D LYGD+GFSGKLPRTWFKSVDQLPMNVGDPHYDPLFPFGFGL T+SVKDLV+RSTS GI
Sbjct: 601  DCLYGDHGFSGKLPRTWFKSVDQLPMNVGDPHYDPLFPFGFGLGTDSVKDLVARSTSSGI 660

Query: 983  RETPSFLA-MIVATIAICILQ 1003
            R T S +A +IVA +AICILQ
Sbjct: 661  RGTASVIATIIVAALAICILQ 681

BLAST of Sgr028903 vs. ExPASy TrEMBL
Match: A0A6J1DA47 (uncharacterized protein LOC111018764 isoform X3 OS=Momordica charantia OX=3673 GN=LOC111018764 PE=3 SV=1)

HSP 1 Score: 1288.5 bits (3333), Expect = 0.0e+00
Identity = 624/681 (91.63%), Postives = 653/681 (95.89%), Query Frame = 0

Query: 323  MLKLKLLWNWKQCGFTSQKRKMAKIFVQVVAILCLGWWWWATMVDAEYLKYKDPNQPVSV 382
            MLKLKLLW W+QCGFTSQKRKMA+IFVQVVAILCLGWWWWAT VDAEYLKYKDP QPV+V
Sbjct: 1    MLKLKLLWKWRQCGFTSQKRKMAQIFVQVVAILCLGWWWWATTVDAEYLKYKDPKQPVAV 60

Query: 383  RVKDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYFIGSVLSGGGSVPLPDASAQDWINMI 442
            RV DLLGRMTLEEKIGQMVQIDRSVANVTVMKDY IGSVLSGGGSVPLPDA A+DW+NMI
Sbjct: 61   RVMDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYSIGSVLSGGGSVPLPDARAEDWVNMI 120

Query: 443  NDFQKGSLSSRLGIPMMYGIDAVHGHNTVYNATIFPHNVGLGATRDPGLVRRIGAATALE 502
            N+FQKGSLSSRLGIPMMYGIDAVHGHN VYNAT+FPHNVGLGATR+P LVRRIGAATALE
Sbjct: 121  NEFQKGSLSSRLGIPMMYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLVRRIGAATALE 180

Query: 503  VRATGISFAFAPCIAVCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIPY 562
            VRATGISFAFAPCIAVCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIPY
Sbjct: 181  VRATGISFAFAPCIAVCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIPY 240

Query: 563  VGGTKKVIACAKHFVGDGGTTNGINENNTVIDKHGLLSIHMPAYLDSIIKGVSTVMASYS 622
            VGGTKKVIACAKHFVGDGGTTNGINENNTVID HGLLSIHMPAY DSIIKGVS+VM SYS
Sbjct: 241  VGGTKKVIACAKHFVGDGGTTNGINENNTVIDWHGLLSIHMPAYYDSIIKGVSSVMISYS 300

Query: 623  SWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPPHANYTYSVQAAILAGIDM 682
            SWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPPH+NYTYSVQAAILAGIDM
Sbjct: 301  SWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPPHSNYTYSVQAAILAGIDM 360

Query: 683  VMVPYKYTEFIDDLTYLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLGDYSLVNELGS 742
            VMVPYKYTEFIDDLT+LV+SNVIPMDRIDDA GRILSVKF+MGLFE+P+GDYSLVNELGS
Sbjct: 361  VMVPYKYTEFIDDLTHLVQSNVIPMDRIDDAAGRILSVKFSMGLFENPMGDYSLVNELGS 420

Query: 743  QAHRDLAREAVRQSLVLLKNGKNDSRPLLPLPKKAPKILVAGTHADNLGYQCGGWTIEWQ 802
            Q HRDLAREAVRQSLVLLKNGKNDS+P+LPLPKKAPKILVAGTH DNLGYQCGGWTI WQ
Sbjct: 421  QEHRDLAREAVRQSLVLLKNGKNDSQPVLPLPKKAPKILVAGTHVDNLGYQCGGWTIAWQ 480

Query: 803  GFSGNNGTRGTSILAAIKSMVDPSTEVVFREDPDSDFVKSNDFSYAIVVVGETPYAETEG 862
            GFSGNNGTRGTSILAAIKS VDPSTEVVF EDPDS+FVKSNDFSYAIVVVGE PYAE+ G
Sbjct: 481  GFSGNNGTRGTSILAAIKSTVDPSTEVVFSEDPDSNFVKSNDFSYAIVVVGEMPYAESVG 540

Query: 863  DSTTLTMLDPGPSIIKNVCDSVKCVVVVISGRPIVIEPYISSIDALVAAWLPGSEGLGVT 922
            DSTTLTMLDPGP+ IKNVCDSVKCVVVV+SGRPIV+EPYISSIDALVAAWLPG+EGLGVT
Sbjct: 541  DSTTLTMLDPGPNTIKNVCDSVKCVVVVVSGRPIVMEPYISSIDALVAAWLPGTEGLGVT 600

Query: 923  DVLYGDYGFSGKLPRTWFKSVDQLPMNVGDPHYDPLFPFGFGLTTESVKDLVSRSTSPGI 982
            D LYGD+GFSGKLPRTWFKSVDQLPMNVGDPHYDPLFPFGFGL T+SVKDLV+RSTS GI
Sbjct: 601  DCLYGDHGFSGKLPRTWFKSVDQLPMNVGDPHYDPLFPFGFGLGTDSVKDLVARSTSSGI 660

Query: 983  RETPSFLA-MIVATIAICILQ 1003
            R T S +A +IVA +AICILQ
Sbjct: 661  RGTASVIATIIVAALAICILQ 681

BLAST of Sgr028903 vs. ExPASy TrEMBL
Match: A0A6J1DBF5 (uncharacterized protein LOC111018764 isoform X2 OS=Momordica charantia OX=3673 GN=LOC111018764 PE=3 SV=1)

HSP 1 Score: 1266.9 bits (3277), Expect = 0.0e+00
Identity = 614/670 (91.64%), Postives = 643/670 (95.97%), Query Frame = 0

Query: 334  QCGFTSQKRKMAKIFVQVVAILCLGWWWWATMVDAEYLKYKDPNQPVSVRVKDLLGRMTL 393
            +CGFTSQKRKMA+IFVQVVAILCLGWWWWAT VDAEYLKYKDP QPV+VRV DLLGRMTL
Sbjct: 4    ECGFTSQKRKMAQIFVQVVAILCLGWWWWATTVDAEYLKYKDPKQPVAVRVMDLLGRMTL 63

Query: 394  EEKIGQMVQIDRSVANVTVMKDYFIGSVLSGGGSVPLPDASAQDWINMINDFQKGSLSSR 453
            EEKIGQMVQIDRSVANVTVMKDY IGSVLSGGGSVPLPDA A+DW+NMIN+FQKGSLSSR
Sbjct: 64   EEKIGQMVQIDRSVANVTVMKDYSIGSVLSGGGSVPLPDARAEDWVNMINEFQKGSLSSR 123

Query: 454  LGIPMMYGIDAVHGHNTVYNATIFPHNVGLGATRDPGLVRRIGAATALEVRATGISFAFA 513
            LGIPMMYGIDAVHGHN VYNAT+FPHNVGLGATR+P LVRRIGAATALEVRATGISFAFA
Sbjct: 124  LGIPMMYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLVRRIGAATALEVRATGISFAFA 183

Query: 514  PCIAVCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIPYVGGTKKVIACA 573
            PCIAVCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIPYVGGTKKVIACA
Sbjct: 184  PCIAVCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIPYVGGTKKVIACA 243

Query: 574  KHFVGDGGTTNGINENNTVIDKHGLLSIHMPAYLDSIIKGVSTVMASYSSWNGVKMHANH 633
            KHFVGDGGTTNGINENNTVID HGLLSIHMPAY DSIIKGVS+VM SYSSWNGVKMHANH
Sbjct: 244  KHFVGDGGTTNGINENNTVIDWHGLLSIHMPAYYDSIIKGVSSVMISYSSWNGVKMHANH 303

Query: 634  ELITGFLKGTLKFKGFVISDWEGLDRITSPPHANYTYSVQAAILAGIDMVMVPYKYTEFI 693
            ELITGFLKGTLKFKGFVISDWEGLDRITSPPH+NYTYSVQAAILAGIDMVMVPYKYTEFI
Sbjct: 304  ELITGFLKGTLKFKGFVISDWEGLDRITSPPHSNYTYSVQAAILAGIDMVMVPYKYTEFI 363

Query: 694  DDLTYLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLGDYSLVNELGSQAHRDLAREAV 753
            DDLT+LV+SNVIPMDRIDDA GRILSVKF+MGLFE+P+GDYSLVNELGSQ HRDLAREAV
Sbjct: 364  DDLTHLVQSNVIPMDRIDDAAGRILSVKFSMGLFENPMGDYSLVNELGSQEHRDLAREAV 423

Query: 754  RQSLVLLKNGKNDSRPLLPLPKKAPKILVAGTHADNLGYQCGGWTIEWQGFSGNNGTRGT 813
            RQSLVLLKNGKNDS+P+LPLPKKAPKILVAGTH DNLGYQCGGWTI WQGFSGNNGTRGT
Sbjct: 424  RQSLVLLKNGKNDSQPVLPLPKKAPKILVAGTHVDNLGYQCGGWTIAWQGFSGNNGTRGT 483

Query: 814  SILAAIKSMVDPSTEVVFREDPDSDFVKSNDFSYAIVVVGETPYAETEGDSTTLTMLDPG 873
            SILAAIKS VDPSTEVVF EDPDS+FVKSNDFSYAIVVVGE PYAE+ GDSTTLTMLDPG
Sbjct: 484  SILAAIKSTVDPSTEVVFSEDPDSNFVKSNDFSYAIVVVGEMPYAESVGDSTTLTMLDPG 543

Query: 874  PSIIKNVCDSVKCVVVVISGRPIVIEPYISSIDALVAAWLPGSEGLGVTDVLYGDYGFSG 933
            P+ IKNVCDSVKCVVVV+SGRPIV+EPYISSIDALVAAWLPG+EGLGVTD LYGD+GFSG
Sbjct: 544  PNTIKNVCDSVKCVVVVVSGRPIVMEPYISSIDALVAAWLPGTEGLGVTDCLYGDHGFSG 603

Query: 934  KLPRTWFKSVDQLPMNVGDPHYDPLFPFGFGLTTESVKDLVSRSTSPGIRETPSFLA-MI 993
            KLPRTWFKSVDQLPMNVGDPHYDPLFPFGFGL T+SVKDLV+RSTS GIR T S +A +I
Sbjct: 604  KLPRTWFKSVDQLPMNVGDPHYDPLFPFGFGLGTDSVKDLVARSTSSGIRGTASVIATII 663

Query: 994  VATIAICILQ 1003
            VA +AICILQ
Sbjct: 664  VAALAICILQ 673

BLAST of Sgr028903 vs. ExPASy TrEMBL
Match: A0A1S3BGE4 (beta-glucosidase BoGH3B isoform X1 OS=Cucumis melo OX=3656 GN=LOC103489355 PE=3 SV=1)

HSP 1 Score: 1254.6 bits (3245), Expect = 0.0e+00
Identity = 601/681 (88.25%), Postives = 639/681 (93.83%), Query Frame = 0

Query: 321  FMMLKLKLLWNWKQCGFTSQKRKMAKIFVQVVAILCLGWWWWATMVDAEYLKYKDPNQPV 380
            FM+L LKLLW WK+CG  +Q +KMAKIFVQVV ILCLGW WWATMVDAE LKYKDP QPV
Sbjct: 97   FMVLNLKLLWKWKECGLNTQAKKMAKIFVQVVVILCLGWLWWATMVDAENLKYKDPKQPV 156

Query: 381  SVRVKDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYFIGSVLSGGGSVPLPDASAQDWIN 440
             VRVKDLLGRMTLEEKIGQMVQIDRSVAN TVMKDYFIGS+LSGGGSVPLPDA A+DW++
Sbjct: 157  GVRVKDLLGRMTLEEKIGQMVQIDRSVANATVMKDYFIGSILSGGGSVPLPDARAEDWVD 216

Query: 441  MINDFQKGSLSSRLGIPMMYGIDAVHGHNTVYNATIFPHNVGLGATRDPGLVRRIGAATA 500
            MINDFQKGSLSSRLGIPM YGIDAVHGHN VYNAT+FPHNVGLGATR+P L RRIGAATA
Sbjct: 217  MINDFQKGSLSSRLGIPMFYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLARRIGAATA 276

Query: 501  LEVRATGISFAFAPCIAVCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPANYRKGI 560
            LEVRATGIS+ FAPC+AVCRDPRWGRCYESYSEDPKIV+EMTEII GLQGEPPANYRKG 
Sbjct: 277  LEVRATGISYTFAPCLAVCRDPRWGRCYESYSEDPKIVKEMTEIIIGLQGEPPANYRKGT 336

Query: 561  PYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDKHGLLSIHMPAYLDSIIKGVSTVMAS 620
            PYVGGTKKVIACAKHFVGDGGTT+GINENNTVI++HGLLSIHMPAYLDSIIKGVS+VMAS
Sbjct: 337  PYVGGTKKVIACAKHFVGDGGTTHGINENNTVINRHGLLSIHMPAYLDSIIKGVSSVMAS 396

Query: 621  YSSWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPPHANYTYSVQAAILAGI 680
            YSSWNGVKMHAN ELIT FLKG LKFKGFVISDWEGLDRITS PH+NYTYSVQAAILAGI
Sbjct: 397  YSSWNGVKMHANRELITDFLKGALKFKGFVISDWEGLDRITSTPHSNYTYSVQAAILAGI 456

Query: 681  DMVMVPYKYTEFIDDLTYLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLGDYSLVNEL 740
            DMVM+PYKY EFIDDL +LVKSNVIPMDRIDDAVGRIL+VKFTMGLFESP+ DYSLVNEL
Sbjct: 457  DMVMIPYKYAEFIDDLKFLVKSNVIPMDRIDDAVGRILTVKFTMGLFESPMADYSLVNEL 516

Query: 741  GSQAHRDLAREAVRQSLVLLKNGKNDSRPLLPLPKKAPKILVAGTHADNLGYQCGGWTIE 800
            GSQAHRDLAR+AVRQSLVLLKNGKNDS+PLLPL KK+PKILVAGTHADNLGYQCGGWTI 
Sbjct: 517  GSQAHRDLARDAVRQSLVLLKNGKNDSKPLLPLSKKSPKILVAGTHADNLGYQCGGWTIA 576

Query: 801  WQGFSGNNGTRGTSILAAIKSMVDPSTEVVFREDPDSDFVKSNDFSYAIVVVGETPYAET 860
            WQGFSGNNGTRGT+ILAAIKS VDPSTEVVFREDPDSDFVKSNDFSYAIVV+GE PYAET
Sbjct: 577  WQGFSGNNGTRGTTILAAIKSTVDPSTEVVFREDPDSDFVKSNDFSYAIVVIGEAPYAET 636

Query: 861  EGDSTTLTMLDPGPSIIKNVCDSVKCVVVVISGRPIVIEPYISSIDALVAAWLPGSEGLG 920
             GDSTTLTMLDPGP+IIKNVCD V+CVV++ISGRPIVIEPYISSIDALVAAWLPG+EG G
Sbjct: 637  GGDSTTLTMLDPGPNIIKNVCDHVECVVILISGRPIVIEPYISSIDALVAAWLPGTEGQG 696

Query: 921  VTDVLYGDYGFSGKLPRTWFKSVDQLPMNVGDPHYDPLFPFGFGLTTESVKDLVSRSTSP 980
            VTD LYGD+GFSGKLPRTWFKSVDQLPMNVGDPHYDPLFPFGFGLTT SVKD+++RSTS 
Sbjct: 697  VTDALYGDHGFSGKLPRTWFKSVDQLPMNVGDPHYDPLFPFGFGLTTGSVKDIIARSTSA 756

Query: 981  GIRETPSFLAMIVATIAICIL 1002
            GIR TPS +A IV  I +CIL
Sbjct: 757  GIRGTPSLIASIVVAITLCIL 777

BLAST of Sgr028903 vs. ExPASy TrEMBL
Match: A0A6J1HX36 (uncharacterized protein LOC111467651 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111467651 PE=3 SV=1)

HSP 1 Score: 1250.0 bits (3233), Expect = 0.0e+00
Identity = 603/680 (88.68%), Postives = 635/680 (93.38%), Query Frame = 0

Query: 323  MLKLKLLWNWKQCGFTSQKRKMAKIFVQVVAILCLGWWWWATMVDAEYLKYKDPNQPVSV 382
            ML LKL W WK+CG  S  +KMAKIFVQVV ILCLGWWWWA MVDAE LKYKDP QPVSV
Sbjct: 1    MLNLKLPWKWKECGLNSSGKKMAKIFVQVVVILCLGWWWWAIMVDAENLKYKDPKQPVSV 60

Query: 383  RVKDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYFIGSVLSGGGSVPLPDASAQDWINMI 442
            RVKDLLGRMTLEEKIGQMVQIDRSVAN TVMK+YFIGSVLSGGGSVPLPDA AQDW++MI
Sbjct: 61   RVKDLLGRMTLEEKIGQMVQIDRSVANATVMKNYFIGSVLSGGGSVPLPDARAQDWVDMI 120

Query: 443  NDFQKGSLSSRLGIPMMYGIDAVHGHNTVYNATIFPHNVGLGATRDPGLVRRIGAATALE 502
            NDFQKGSLSSRLGIPM+YGIDAVHGHN VYNAT+FPHNVGLGATR+P L+RRIGAATALE
Sbjct: 121  NDFQKGSLSSRLGIPMIYGIDAVHGHNNVYNATVFPHNVGLGATRNPDLLRRIGAATALE 180

Query: 503  VRATGISFAFAPCIAVCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIPY 562
            VRATGIS+ FAPC+AVCRDPRWGRCYESYSEDPK+VQ MTEII GLQGEPPANYRKGIPY
Sbjct: 181  VRATGISYTFAPCLAVCRDPRWGRCYESYSEDPKLVQNMTEIIIGLQGEPPANYRKGIPY 240

Query: 563  VGGTKKVIACAKHFVGDGGTTNGINENNTVIDKHGLLSIHMPAYLDSIIKGVSTVMASYS 622
            VGGTKKVIACAKHFVGDGGTT+GINENNTVID+HGLL IHMPAYLDSIIKGVS+VM SYS
Sbjct: 241  VGGTKKVIACAKHFVGDGGTTHGINENNTVIDRHGLLGIHMPAYLDSIIKGVSSVMVSYS 300

Query: 623  SWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPPHANYTYSVQAAILAGIDM 682
            SWNGVKMHAN +LIT FLKGTLKFKGFVISDWEGLDRITS PH+NYTYSVQAAI AGIDM
Sbjct: 301  SWNGVKMHANRDLITRFLKGTLKFKGFVISDWEGLDRITSTPHSNYTYSVQAAISAGIDM 360

Query: 683  VMVPYKYTEFIDDLTYLVKSNVIPMDRIDDAVGRILSVKFTMGLFESPLGDYSLVNELGS 742
            VMVPYKY EFIDDL  LVK+NV+PMDRIDDAV RILSVKFTMGLFESPLGDYSLVNELGS
Sbjct: 361  VMVPYKYAEFIDDLKLLVKNNVVPMDRIDDAVARILSVKFTMGLFESPLGDYSLVNELGS 420

Query: 743  QAHRDLAREAVRQSLVLLKNGKNDSRPLLPLPKKAPKILVAGTHADNLGYQCGGWTIEWQ 802
            QAHRDLAR+AVRQSLVLLKNGKNDS PLLPL KKAPKILV GTHADNLGYQCGGWTI WQ
Sbjct: 421  QAHRDLARDAVRQSLVLLKNGKNDSDPLLPLSKKAPKILVVGTHADNLGYQCGGWTIAWQ 480

Query: 803  GFSGNNGTRGTSILAAIKSMVDPSTEVVFREDPDSDFVKSNDFSYAIVVVGETPYAETEG 862
            GFSGNN TRGT+ILAAIKS VDPSTEVVFREDPDSDFVKSN FSYAIVV+GE PYAET G
Sbjct: 481  GFSGNNATRGTTILAAIKSTVDPSTEVVFREDPDSDFVKSNGFSYAIVVIGEAPYAETGG 540

Query: 863  DSTTLTMLDPGPSIIKNVCDSVKCVVVVISGRPIVIEPYISSIDALVAAWLPGSEGLGVT 922
            DSTTLTMLDPGPSIIKNVC+SVKCVVVVISGRPIV+EPYISS+DALVAAWLPG+EGLGVT
Sbjct: 541  DSTTLTMLDPGPSIIKNVCESVKCVVVVISGRPIVMEPYISSMDALVAAWLPGTEGLGVT 600

Query: 923  DVLYGDYGFSGKLPRTWFKSVDQLPMNVGDPHYDPLFPFGFGLTTESVKDLVSRSTSPGI 982
            D LYGD+GFSGKLPRTWFKSVDQLPMN GD HYDPLFP GFGLTT SVKD+V+RSTS G 
Sbjct: 601  DALYGDHGFSGKLPRTWFKSVDQLPMNFGDRHYDPLFPLGFGLTTGSVKDIVARSTSAGT 660

Query: 983  RETPSFLAMIVATIAICILQ 1003
            R TPSF+AMIVATIA+C+LQ
Sbjct: 661  RGTPSFIAMIVATIAVCVLQ 680

BLAST of Sgr028903 vs. TAIR 10
Match: AT5G04885.1 (Glycosyl hydrolase family protein )

HSP 1 Score: 1007.3 bits (2603), Expect = 1.0e-293
Identity = 475/657 (72.30%), Postives = 561/657 (85.39%), Query Frame = 0

Query: 344  MAKIFVQVVAILCLGWWWWATMVDAEYLKYKDPNQPVSVRVKDLLGRMTLEEKIGQMVQI 403
            M++  V++V +L     W     D EYL YKDP Q VS RV DL GRMTLEEKIGQMVQI
Sbjct: 1    MSRDSVRIVGVLLWMCMWVCCYGDGEYLLYKDPKQTVSDRVADLFGRMTLEEKIGQMVQI 60

Query: 404  DRSVANVTVMKDYFIGSVLSGGGSVPLPDASAQDWINMINDFQKGSLSSRLGIPMMYGID 463
            DRSVA V +M+DYFIGSVLSGGGS PLP+ASAQ+W++MIN++QKG+L SRLGIPM+YGID
Sbjct: 61   DRSVATVNIMRDYFIGSVLSGGGSAPLPEASAQNWVDMINEYQKGALVSRLGIPMIYGID 120

Query: 464  AVHGHNTVYNATIFPHNVGLGATRDPGLVRRIGAATALEVRATGISFAFAPCIAVCRDPR 523
            AVHGHN VYNATIFPHNVGLGATRDP LV+RIGAATA+EVRATGI + FAPCIAVCRDPR
Sbjct: 121  AVHGHNNVYNATIFPHNVGLGATRDPDLVKRIGAATAVEVRATGIPYTFAPCIAVCRDPR 180

Query: 524  WGRCYESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIPYVGGTKKVIACAKHFVGDGGTT 583
            WGRCYESYSED K+V++MT++I GLQGEPP+NY+ G+P+VGG  KV ACAKH+VGDGGTT
Sbjct: 181  WGRCYESYSEDHKVVEDMTDVILGLQGEPPSNYKHGVPFVGGRDKVAACAKHYVGDGGTT 240

Query: 584  NGINENNTVIDKHGLLSIHMPAYLDSIIKGVSTVMASYSSWNGVKMHANHELITGFLKGT 643
             G+NENNTV D HGLLS+HMPAY D++ KGVSTVM SYSSWNG KMHAN ELITG+LKGT
Sbjct: 241  RGVNENNTVTDLHGLLSVHMPAYADAVYKGVSTVMVSYSSWNGEKMHANTELITGYLKGT 300

Query: 644  LKFKGFVISDWEGLDRITSPPHANYTYSVQAAILAGIDMVMVPYKYTEFIDDLTYLVKSN 703
            LKFKGFVISDW+G+D+I++PPH +YT SV+AAI AGIDMVMVP+ +TEF++DLT LVK+N
Sbjct: 301  LKFKGFVISDWQGVDKISTPPHTHYTASVRAAIQAGIDMVMVPFNFTEFVNDLTTLVKNN 360

Query: 704  VIPMDRIDDAVGRILSVKFTMGLFESPLGDYSLVNELGSQAHRDLAREAVRQSLVLLKNG 763
             IP+ RIDDAV RIL VKFTMGLFE+PL DYS  +ELGSQAHRDLAREAVR+SLVLLKNG
Sbjct: 361  SIPVTRIDDAVRRILLVKFTMGLFENPLADYSFSSELGSQAHRDLAREAVRKSLVLLKNG 420

Query: 764  KNDSRPLLPLPKKAPKILVAGTHADNLGYQCGGWTIEWQGFSGNNGTRGTSILAAIKSMV 823
             N + P+LPLP+K  KILVAGTHADNLGYQCGGWTI WQGFSGN  TRGT++L+A+KS V
Sbjct: 421  -NKTNPMLPLPRKTSKILVAGTHADNLGYQCGGWTITWQGFSGNKNTRGTTLLSAVKSAV 480

Query: 824  DPSTEVVFREDPDSDFVKSNDFSYAIVVVGETPYAETEGDSTTLTMLDPGPSIIKNVCDS 883
            D STEVVFRE+PD++F+KSN+F+YAI+ VGE PYAET GDS  LTMLDPGP+II + C +
Sbjct: 481  DQSTEVVFRENPDAEFIKSNNFAYAIIAVGEPPYAETAGDSDKLTMLDPGPAIISSTCQA 540

Query: 884  VKCVVVVISGRPIVIEPYISSIDALVAAWLPGSEGLGVTDVLYGDYGFSGKLPRTWFKSV 943
            VKCVVVVISGRP+V+EPY++SIDALVAAWLPG+EG G+TD L+GD+GFSGKLP TWF++ 
Sbjct: 541  VKCVVVVISGRPLVMEPYVASIDALVAAWLPGTEGQGITDALFGDHGFSGKLPVTWFRNT 600

Query: 944  DQLPMNVGDPHYDPLFPFGFGLTTESVKDLVSRSTSPGIRETPSFLAMIVATIAICI 1001
            +QLPM+ GD HYDPLF +G GL TESV  +V+RSTS     T   L  ++ +  +C+
Sbjct: 601  EQLPMSYGDTHYDPLFAYGSGLETESVASIVARSTSASATNTKPCLYTVLVSATLCL 656

BLAST of Sgr028903 vs. TAIR 10
Match: AT5G20950.1 (Glycosyl hydrolase family protein )

HSP 1 Score: 934.5 bits (2414), Expect = 8.5e-272
Identity = 438/601 (72.88%), Postives = 513/601 (85.36%), Query Frame = 0

Query: 371 LKYKDPNQPVSVRVKDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYFIGSVLSGGGSVPL 430
           LKYKDP QP+  R++DL+ RMTL+EKIGQMVQI+RSVA   VMK YFIGSVLSGGGSVP 
Sbjct: 24  LKYKDPKQPLGARIRDLMNRMTLQEKIGQMVQIERSVATPEVMKKYFIGSVLSGGGSVPS 83

Query: 431 PDASAQDWINMINDFQKGSLSSRLGIPMMYGIDAVHGHNTVYNATIFPHNVGLGATRDPG 490
             A+ + W+NM+N+ QK SLS+RLGIPM+YGIDAVHGHN VY ATIFPHNVGLG TRDP 
Sbjct: 84  EKATPETWVNMVNEIQKASLSTRLGIPMIYGIDAVHGHNNVYGATIFPHNVGLGVTRDPN 143

Query: 491 LVRRIGAATALEVRATGISFAFAPCIAVCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQG 550
           LV+RIGAATALEVRATGI +AFAPCIAVCRDPRWGRCYESYSED +IVQ+MTEIIPGLQG
Sbjct: 144 LVKRIGAATALEVRATGIPYAFAPCIAVCRDPRWGRCYESYSEDYRIVQQMTEIIPGLQG 203

Query: 551 EPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDKHGLLSIHMPAYLDSI 610
           + P   RKG+P+VGG  KV ACAKHFVGDGGT  GI+ENNTVID  GL  IHMP Y +++
Sbjct: 204 DLPTK-RKGVPFVGGKTKVAACAKHFVGDGGTVRGIDENNTVIDSKGLFGIHMPGYYNAV 263

Query: 611 IKGVSTVMASYSSWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPPHANYTY 670
            KGV+T+M SYS+WNG++MHAN EL+TGFLK  LKF+GFVISDW+G+DRIT+PPH NY+Y
Sbjct: 264 NKGVATIMVSYSAWNGLRMHANKELVTGFLKNKLKFRGFVISDWQGIDRITTPPHLNYSY 323

Query: 671 SVQAAILAGIDMVMVPYKYTEFIDDLTYLVKSNVIPMDRIDDAVGRILSVKFTMGLFESP 730
           SV A I AGIDM+MVPY YTEFID+++  ++  +IP+ RIDDA+ RIL VKFTMGLFE P
Sbjct: 324 SVYAGISAGIDMIMVPYNYTEFIDEISSQIQKKLIPISRIDDALKRILRVKFTMGLFEEP 383

Query: 731 LGDYSLVNELGSQAHRDLAREAVRQSLVLLKNGKNDSRPLLPLPKKAPKILVAGTHADNL 790
           L D S  N+LGS+ HR+LAREAVR+SLVLLKNGK  ++PLLPLPKK+ KILVAG HADNL
Sbjct: 384 LADLSFANQLGSKEHRELAREAVRKSLVLLKNGKTGAKPLLPLPKKSGKILVAGAHADNL 443

Query: 791 GYQCGGWTIEWQGFSGNNGTRGTSILAAIKSMVDPSTEVVFREDPDSDFVKSNDFSYAIV 850
           GYQCGGWTI WQG +GN+ T GT+ILAA+K+ V P+T+VV+ ++PD++FVKS  F YAIV
Sbjct: 444 GYQCGGWTITWQGLNGNDHTVGTTILAAVKNTVAPTTQVVYSQNPDANFVKSGKFDYAIV 503

Query: 851 VVGETPYAETEGDSTTLTMLDPGPSIIKNVCDSVKCVVVVISGRPIVIEPYISSIDALVA 910
           VVGE PYAE  GD+T LT+ DPGPSII NVC SVKCVVVV+SGRP+VI+PY+S+IDALVA
Sbjct: 504 VVGEPPYAEMFGDTTNLTISDPGPSIIGNVCGSVKCVVVVVSGRPVVIQPYVSTIDALVA 563

Query: 911 AWLPGSEGLGVTDVLYGDYGFSGKLPRTWFKSVDQLPMNVGDPHYDPLFPFGFGLTTESV 970
           AWLPG+EG GV D L+GDYGF+GKL RTWFKSV QLPMNVGD HYDPL+PFGFGLTT+  
Sbjct: 564 AWLPGTEGQGVADALFGDYGFTGKLARTWFKSVKQLPMNVGDRHYDPLYPFGFGLTTKPY 623

Query: 971 K 972
           K
Sbjct: 624 K 623

BLAST of Sgr028903 vs. TAIR 10
Match: AT5G20950.2 (Glycosyl hydrolase family protein )

HSP 1 Score: 934.5 bits (2414), Expect = 8.5e-272
Identity = 438/601 (72.88%), Postives = 513/601 (85.36%), Query Frame = 0

Query: 371 LKYKDPNQPVSVRVKDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYFIGSVLSGGGSVPL 430
           LKYKDP QP+  R++DL+ RMTL+EKIGQMVQI+RSVA   VMK YFIGSVLSGGGSVP 
Sbjct: 24  LKYKDPKQPLGARIRDLMNRMTLQEKIGQMVQIERSVATPEVMKKYFIGSVLSGGGSVPS 83

Query: 431 PDASAQDWINMINDFQKGSLSSRLGIPMMYGIDAVHGHNTVYNATIFPHNVGLGATRDPG 490
             A+ + W+NM+N+ QK SLS+RLGIPM+YGIDAVHGHN VY ATIFPHNVGLG TRDP 
Sbjct: 84  EKATPETWVNMVNEIQKASLSTRLGIPMIYGIDAVHGHNNVYGATIFPHNVGLGVTRDPN 143

Query: 491 LVRRIGAATALEVRATGISFAFAPCIAVCRDPRWGRCYESYSEDPKIVQEMTEIIPGLQG 550
           LV+RIGAATALEVRATGI +AFAPCIAVCRDPRWGRCYESYSED +IVQ+MTEIIPGLQG
Sbjct: 144 LVKRIGAATALEVRATGIPYAFAPCIAVCRDPRWGRCYESYSEDYRIVQQMTEIIPGLQG 203

Query: 551 EPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDKHGLLSIHMPAYLDSI 610
           + P   RKG+P+VGG  KV ACAKHFVGDGGT  GI+ENNTVID  GL  IHMP Y +++
Sbjct: 204 DLPTK-RKGVPFVGGKTKVAACAKHFVGDGGTVRGIDENNTVIDSKGLFGIHMPGYYNAV 263

Query: 611 IKGVSTVMASYSSWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPPHANYTY 670
            KGV+T+M SYS+WNG++MHAN EL+TGFLK  LKF+GFVISDW+G+DRIT+PPH NY+Y
Sbjct: 264 NKGVATIMVSYSAWNGLRMHANKELVTGFLKNKLKFRGFVISDWQGIDRITTPPHLNYSY 323

Query: 671 SVQAAILAGIDMVMVPYKYTEFIDDLTYLVKSNVIPMDRIDDAVGRILSVKFTMGLFESP 730
           SV A I AGIDM+MVPY YTEFID+++  ++  +IP+ RIDDA+ RIL VKFTMGLFE P
Sbjct: 324 SVYAGISAGIDMIMVPYNYTEFIDEISSQIQKKLIPISRIDDALKRILRVKFTMGLFEEP 383

Query: 731 LGDYSLVNELGSQAHRDLAREAVRQSLVLLKNGKNDSRPLLPLPKKAPKILVAGTHADNL 790
           L D S  N+LGS+ HR+LAREAVR+SLVLLKNGK  ++PLLPLPKK+ KILVAG HADNL
Sbjct: 384 LADLSFANQLGSKEHRELAREAVRKSLVLLKNGKTGAKPLLPLPKKSGKILVAGAHADNL 443

Query: 791 GYQCGGWTIEWQGFSGNNGTRGTSILAAIKSMVDPSTEVVFREDPDSDFVKSNDFSYAIV 850
           GYQCGGWTI WQG +GN+ T GT+ILAA+K+ V P+T+VV+ ++PD++FVKS  F YAIV
Sbjct: 444 GYQCGGWTITWQGLNGNDHTVGTTILAAVKNTVAPTTQVVYSQNPDANFVKSGKFDYAIV 503

Query: 851 VVGETPYAETEGDSTTLTMLDPGPSIIKNVCDSVKCVVVVISGRPIVIEPYISSIDALVA 910
           VVGE PYAE  GD+T LT+ DPGPSII NVC SVKCVVVV+SGRP+VI+PY+S+IDALVA
Sbjct: 504 VVGEPPYAEMFGDTTNLTISDPGPSIIGNVCGSVKCVVVVVSGRPVVIQPYVSTIDALVA 563

Query: 911 AWLPGSEGLGVTDVLYGDYGFSGKLPRTWFKSVDQLPMNVGDPHYDPLFPFGFGLTTESV 970
           AWLPG+EG GV D L+GDYGF+GKL RTWFKSV QLPMNVGD HYDPL+PFGFGLTT+  
Sbjct: 564 AWLPGTEGQGVADALFGDYGFTGKLARTWFKSVKQLPMNVGDRHYDPLYPFGFGLTTKPY 623

Query: 971 K 972
           K
Sbjct: 624 K 623

BLAST of Sgr028903 vs. TAIR 10
Match: AT5G20940.1 (Glycosyl hydrolase family protein )

HSP 1 Score: 881.3 bits (2276), Expect = 8.5e-256
Identity = 428/620 (69.03%), Postives = 506/620 (81.61%), Query Frame = 0

Query: 349 VQVVAILCLGWWWWATMVDAEYLKYKDPNQPVSVRVKDLLGRMTLEEKIGQMVQIDRSVA 408
           +Q + +L L     A  V     KYKDP +P+ VR+K+L+  MTLEEKIGQMVQ++R  A
Sbjct: 8   LQTLGLLLLCCTVAANKVPLANAKYKDPKEPLGVRIKNLMSHMTLEEKIGQMVQVERVNA 67

Query: 409 NVTVMKDYFIGSVLSGGGSVPLPDASAQDWINMINDFQKGSLSSRLGIPMMYGIDAVHGH 468
              VM+ YF+GSV SGGGSVP P    + W+NM+N+ QK +LS+RLGIP++YGIDAVHGH
Sbjct: 68  TTEVMQKYFVGSVFSGGGSVPKPYIGPEAWVNMVNEVQKKALSTRLGIPIIYGIDAVHGH 127

Query: 469 NTVYNATIFPHNVGLGATRDPGLVRRIGAATALEVRATGISFAFAPCIAVCRDPRWGRCY 528
           NTVYNATIFPHNVGLG TRDPGLV+RIG ATALEVRATGI + FAPCIAVCRDPRWGRCY
Sbjct: 128 NTVYNATIFPHNVGLGVTRDPGLVKRIGEATALEVRATGIQYVFAPCIAVCRDPRWGRCY 187

Query: 529 ESYSEDPKIVQEMTEIIPGLQGEPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINE 588
           ESYSED KIVQ+MTEIIPGLQG+ P   +KG+P+V G  KV ACAKHFVGDGGT  G+N 
Sbjct: 188 ESYSEDHKIVQQMTEIIPGLQGDLPTG-QKGVPFVAGKTKVAACAKHFVGDGGTLRGMNA 247

Query: 589 NNTVIDKHGLLSIHMPAYLDSIIKGVSTVMASYSSWNGVKMHANHELITGFLKGTLKFKG 648
           NNTVI+ +GLL IHMPAY D++ KGV+TVM SYSS NG+KMHAN +LITGFLK  LKF+G
Sbjct: 248 NNTVINSNGLLGIHMPAYHDAVNKGVATVMVSYSSINGLKMHANKKLITGFLKNKLKFRG 307

Query: 649 FVISDWEGLDRITSPPHANYTYSVQAAILAGIDMVMVPYKYTEFIDDLTYLVKSNVIPMD 708
            VISD+ G+D+I +P  ANY++SV AA  AG+DM M     T+ ID+LT  VK   IPM 
Sbjct: 308 IVISDYLGVDQINTPLGANYSHSVYAATTAGLDMFMGSSNLTKLIDELTSQVKRKFIPMS 367

Query: 709 RIDDAVGRILSVKFTMGLFESPLGDYSLVNELGSQAHRDLAREAVRQSLVLLKNGKNDSR 768
           RIDDAV RIL VKFTMGLFE+P+ D+SL  +LGS+ HR+LAREAVR+SLVLLKNG+N  +
Sbjct: 368 RIDDAVKRILRVKFTMGLFENPIADHSLAKKLGSKEHRELAREAVRKSLVLLKNGENADK 427

Query: 769 PLLPLPKKAPKILVAGTHADNLGYQCGGWTIEWQGFSGNNGTRGTSILAAIKSMVDPSTE 828
           PLLPLPKKA KILVAGTHADNLGYQCGGWTI WQG +GNN T GT+ILAA+K  VDP T+
Sbjct: 428 PLLPLPKKANKILVAGTHADNLGYQCGGWTITWQGLNGNNLTIGTTILAAVKKTVDPKTQ 487

Query: 829 VVFREDPDSDFVKSNDFSYAIVVVGETPYAETEGDSTTLTMLDPGPSIIKNVCDSVKCVV 888
           V++ ++PD++FVK+ DF YAIV VGE PYAE  GDST LT+ +PGPS I NVC SVKCVV
Sbjct: 488 VIYNQNPDTNFVKAGDFDYAIVAVGEKPYAEGFGDSTNLTISEPGPSTIGNVCASVKCVV 547

Query: 889 VVISGRPIVIEPYISSIDALVAAWLPGSEGLGVTDVLYGDYGFSGKLPRTWFKSVDQLPM 948
           VV+SGRP+V++  IS+IDALVAAWLPG+EG GV DVL+GDYGF+GKL RTWFK+VDQLPM
Sbjct: 548 VVVSGRPVVMQ--ISNIDALVAAWLPGTEGQGVADVLFGDYGFTGKLARTWFKTVDQLPM 607

Query: 949 NVGDPHYDPLFPFGFGLTTE 969
           NVGDPHYDPL+PFGFGL T+
Sbjct: 608 NVGDPHYDPLYPFGFGLITK 624

BLAST of Sgr028903 vs. TAIR 10
Match: AT3G47000.1 (Glycosyl hydrolase family protein )

HSP 1 Score: 735.3 bits (1897), Expect = 7.5e-212
Identity = 356/608 (58.55%), Postives = 451/608 (74.18%), Query Frame = 0

Query: 365 MVDAEYLKYKDPNQPVSVRVKDLLGRMTLEEKIGQMVQIDRSVANVTVMKDYFIGSVLSG 424
           +V+     YK+ + PV  RVKDLL RMTL EKIGQM QI+R VA+ +   D+FIGSVL+ 
Sbjct: 2   VVEESSCVYKNGDAPVEARVKDLLSRMTLPEKIGQMTQIERRVASPSAFTDFFIGSVLNA 61

Query: 425 GGSVPLPDASAQDWINMINDFQKGSLSSRLGIPMMYGIDAVHGHNTVYNATIFPHNVGLG 484
           GGSVP  DA + DW +MI+ FQ+ +L+SRLGIP++YG DAVHG+N VY AT+FPHN+GLG
Sbjct: 62  GGSVPFEDAKSSDWADMIDGFQRSALASRLGIPIIYGTDAVHGNNNVYGATVFPHNIGLG 121

Query: 485 ATRDPGLVRRIGAATALEVRATGISFAFAPCIAVCRDPRWGRCYESYSEDPKIVQEMTEI 544
           ATRD  LVRRIGAATALEVRA+G+ +AF+PC+AV RDPRWGRCYESY EDP++V EMT +
Sbjct: 122 ATRDADLVRRIGAATALEVRASGVHWAFSPCVAVLRDPRWGRCYESYGEDPELVCEMTSL 181

Query: 545 IPGLQGEPPANYRKGIPYVGGTKKVIACAKHFVGDGGTTNGINENNTVIDKHGLLSIHMP 604
           + GLQG PP  +  G P+V G   V+AC KHFVGDGGT  GINE NT+     L  IH+P
Sbjct: 182 VSGLQGVPPEEHPNGYPFVAGRNNVVACVKHFVGDGGTDKGINEGNTIASYEELEKIHIP 241

Query: 605 AYLDSIIKGVSTVMASYSSWNGVKMHANHELITGFLKGTLKFKGFVISDWEGLDRITSPP 664
            YL  + +GVSTVMASYSSWNG ++HA+  L+T  LK  L FKGF++SDWEGLDR++ P 
Sbjct: 242 PYLKCLAQGVSTVMASYSSWNGTRLHADRFLLTEILKEKLGFKGFLVSDWEGLDRLSEPQ 301

Query: 665 HANYTYSVQAAILAGIDMVMVPYKYTEFIDDLTYLVKSNVIPMDRIDDAVGRILSVKFTM 724
            +NY Y ++ A+ AGIDMVMVP+KY +FI D+T LV+S  IPM RI+DAV RIL VKF  
Sbjct: 302 GSNYRYCIKTAVNAGIDMVMVPFKYEQFIQDMTDLVESGEIPMARINDAVERILRVKFVA 361

Query: 725 GLFESPLGDYSLVNELGSQAHRDLAREAVRQSLVLLKNGKNDSRPLLPLPKKAPKILVAG 784
           GLF  PL D SL+  +G + HR+LA+EAVR+SLVLLK+GKN  +P LPL + A +ILV G
Sbjct: 362 GLFGHPLTDRSLLPTVGCKEHRELAQEAVRKSLVLLKSGKNADKPFLPLDRNAKRILVTG 421

Query: 785 THADNLGYQCGGWTIEWQGFSGNNGTRGTSILAAIKSMVDPSTEVVFREDPDSDFVKSND 844
           THAD+LGYQCGGWT  W G SG   T GT++L AIK  V   TEV++ + P  + + S++
Sbjct: 422 THADDLGYQCGGWTKTWFGLSGRI-TIGTTLLDAIKEAVGDETEVIYEKTPSKETLASSE 481

Query: 845 -FSYAIVVVGETPYAETEGDSTTLTMLDPGPSIIKNVCDSVKCVVVVISGRPIVIEP-YI 904
            FSYAIV VGE PYAET GD++ L +   G  I+  V + +  +V++ISGRP+V+EP  +
Sbjct: 482 GFSYAIVAVGEPPYAETMGDNSELRIPFNGTDIVTAVAEIIPTLVILISGRPVVLEPTVL 541

Query: 905 SSIDALVAAWLPGSEGLGVTDVLYGDYGFSGKLPRTWFKSVDQLPMNVGDPHYDPLFPFG 964
              +ALVAAWLPG+EG GV DV++GDY F GKLP +WFK V+ LP++     YDPLFPFG
Sbjct: 542 EKTEALVAAWLPGTEGQGVADVVFGDYDFKGKLPVSWFKHVEHLPLDAHANSYDPLFPFG 601

Query: 965 FGLTTESV 971
           FGL ++ V
Sbjct: 602 FGLNSKPV 608

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022150694.10.0e+0091.63uncharacterized protein LOC111018764 isoform X1 [Momordica charantia] >XP_022150... [more]
XP_022150698.10.0e+0091.63uncharacterized protein LOC111018764 isoform X3 [Momordica charantia][more]
XP_022150697.10.0e+0091.64uncharacterized protein LOC111018764 isoform X2 [Momordica charantia][more]
XP_008446716.10.0e+0088.25PREDICTED: beta-glucosidase BoGH3B isoform X1 [Cucumis melo][more]
XP_038892436.10.0e+0089.51beta-glucosidase BoGH3B-like isoform X1 [Benincasa hispida][more]
Match NameE-valueIdentityDescription
A7LXU35.7e-7931.39Beta-glucosidase BoGH3B OS=Bacteroides ovatus (strain ATCC 8483 / DSM 1896 / JCM... [more]
Q238922.7e-7330.64Lysosomal beta glucosidase OS=Dictyostelium discoideum OX=44689 GN=gluA PE=1 SV=... [more]
Q560782.9e-5929.11Periplasmic beta-glucosidase OS=Salmonella typhimurium (strain LT2 / SGSC1412 / ... [more]
P333631.8e-5628.15Periplasmic beta-glucosidase OS=Escherichia coli (strain K12) OX=83333 GN=bglX P... [more]
T2KMH09.1e-5329.31Beta-xylosidase OS=Formosa agariphila (strain DSM 15362 / KCTC 12365 / LMG 23005... [more]
Match NameE-valueIdentityDescription
A0A6J1DCA30.0e+0091.63uncharacterized protein LOC111018764 isoform X1 OS=Momordica charantia OX=3673 G... [more]
A0A6J1DA470.0e+0091.63uncharacterized protein LOC111018764 isoform X3 OS=Momordica charantia OX=3673 G... [more]
A0A6J1DBF50.0e+0091.64uncharacterized protein LOC111018764 isoform X2 OS=Momordica charantia OX=3673 G... [more]
A0A1S3BGE40.0e+0088.25beta-glucosidase BoGH3B isoform X1 OS=Cucumis melo OX=3656 GN=LOC103489355 PE=3 ... [more]
A0A6J1HX360.0e+0088.68uncharacterized protein LOC111467651 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... [more]
Match NameE-valueIdentityDescription
AT5G04885.11.0e-29372.30Glycosyl hydrolase family protein [more]
AT5G20950.18.5e-27272.88Glycosyl hydrolase family protein [more]
AT5G20950.28.5e-27272.88Glycosyl hydrolase family protein [more]
AT5G20940.18.5e-25669.03Glycosyl hydrolase family protein [more]
AT3G47000.17.5e-21258.55Glycosyl hydrolase family protein [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001764Glycoside hydrolase, family 3, N-terminalPRINTSPR00133GLHYDRLASE3coord: 477..496
score: 37.74
coord: 639..657
score: 45.49
coord: 453..469
score: 39.64
coord: 569..585
score: 41.04
coord: 523..539
score: 44.12
IPR001764Glycoside hydrolase, family 3, N-terminalPFAMPF00933Glyco_hydro_3coord: 392..720
e-value: 1.3E-69
score: 235.1
IPR036881Glycoside hydrolase family 3 C-terminal domain superfamilyGENE3D3.40.50.1700coord: 742..969
e-value: 3.6E-71
score: 241.6
IPR036881Glycoside hydrolase family 3 C-terminal domain superfamilySUPERFAMILY52279Beta-D-glucan exohydrolase, C-terminal domaincoord: 757..966
IPR036962Glycoside hydrolase, family 3, N-terminal domain superfamilyGENE3D3.20.20.300coord: 366..741
e-value: 2.2E-133
score: 446.8
IPR002772Glycoside hydrolase family 3 C-terminal domainPFAMPF01915Glyco_hydro_3_Ccoord: 758..966
e-value: 1.2E-35
score: 123.3
NoneNo IPR availablePANTHERPTHR30620:SF35GLYCOSYL HYDROLASE FAMILY PROTEINcoord: 360..992
NoneNo IPR availablePANTHERPTHR30620PERIPLASMIC BETA-GLUCOSIDASE-RELATEDcoord: 360..992
IPR017853Glycoside hydrolase superfamilySUPERFAMILY51445(Trans)glycosidasescoord: 371..756

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr028903.1Sgr028903.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009251 glucan catabolic process
biological_process GO:0005975 carbohydrate metabolic process
cellular_component GO:0005576 extracellular region
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0008422 beta-glucosidase activity
molecular_function GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compounds