Sgr011838 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr011838
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
DescriptionGlycosyl hydrolase family protein
Locationtig00153092: 38578 .. 41770 (+)
RNA-Seq ExpressionSgr011838
SyntenySgr011838
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCCAAAGTTCTCATCATTTTGATGGGGCTTTTGCTCATTTGTTTCTCTGAAGCATTGACAAAAGCTGAGTACTTGAGATATAGAGATCCAAAACAACCATTAAGAGCTCGAATCAATGACCTCCTTGGTCGAATGACTCTTGAGGAAAAGATAGGTCAAATGGTGCAAATTGAAAGGACTAATGCTTCAACTACGATTATGAAAAAGTATTTCATTGGTAAATATCTTACTTTAATTTTCTCCTTTCCTTTTAAGTATCGGAAAAGAAAAGATATTAAATCGTTTATCTTGTATAGAGAAATTATTATCAATAAAAATGTAACCTTATTTCATAGGGAGTGTTCTAAGTGGTGGAGGTAGTGCTCCTTCAAAGAAAGCTTCCGCTAAGGATTGGATCCACATGGTCAATAAAATCCAAAAAGGGGCTTTGTCAACTAGGCTTGGCATTCCAATGATATATGGAATTGATGCTGTACACGGTCACAACAATGTCTATAATGCAACGATCTTCCCTCACAATATTGGTCTTGGAGCGACAAGGCAAGACTAACAATGCCTACTTATGCTAAATTTTTCTTCAAAACTCTAATCGTATACTCTTCATGTAACTCATATTTACTGGAATTGATATATCTATATGTGGGTGGTTGAGTTATTTAGAGTATTTACCCACTAAATAAGTCTGCACATTTAATTGTATCCCTTGTATGCTAAAAATAAAATAAGAAAACACTATACATTTCACCCTTCATTAGCTAGCTCTTCTCTCAACTGAAAGAGTTATAACTTGCAAATTAGAAGTTCTAAGTTTTTAACTTTTTAACTTTCCAAATTTAGTACTTTGAAATTTCCCTTCAATTAAACAGGGATCCTCAACTTGTAAAGAAGATTGGTGTTGCTACTGCACTTGAAGTTAGAGCTACCGGAATTCCTTATGCTTTTGCACCTTGTGTAGCGGTAACTAACTAGTTCTATTTCAATACTTCAATTGATGAAGAATATTATAACCTTTCTAATGTCAATTTGAGATATATTACTATTTTTTGTTTTTTATATATGAGATGTGTTTTCTTTTGTGGTTTTTTTTTCTTACTTAGGTTTGCAGAGATCCACGATGGGGTCGATGTTATGAAAGCTACAGCGAAGATCCTAAAATTGTTCAAGCAATGACTGAGATCATATCAGGATTACAAGGAGAGATTCCACCAAATTCTCGCAAAGGTGTTCCTCATGTTGCTGGAAAGTAAGTCCATAAAAAAATTATAGCATCATATATTTAGTCTACTTTTTTGATAAATTTAACTTATTGGCTAGAAAATAATATTAAAAAAAAAAATTATAGTCATTAAACATAAAATTCATGTTTACCGGTGGTTAACCTCTAAATGCATATTGCTTCATCCCTATATTAGTTGATATTGAATTAAACTTCTGCTCATGACATGATGACTGGTTGATATAATTTGGAACGTAAAAGAAAAGCTCTAAATGTTTAATTTTGATGATGTTTTGTTTGTTCAAAATAGTAAGAGTAGTATCTTAGTGAAATTGCCCATAAGTCACCCTTTTCTATTATTTATATAGAGAAAAGGTTGCAGCTTGTGCAAAGCACTTTGTGGGCGATGGTGGAACGACTAAGGGTATTAATGAGAACAACACAGTGATAGATAGACATGGATTACTTAGCATTCACATGCCAGGTTACTATAACTCAATCATTAAGGGAGTTGCAACCATTATGGTTTCTTATTCAAGTTGGAATGGAGAGAAGATGCATGCAAACAAAAATCTTATAACCGACTTTCTTAAGAACACTCTTCATTTTCAGGTAATATGGCTTCTTTATTGTATTTTGTGGTTGTCTCTTTGGTACTTAATATATTCGTTATTTGTAGGAAAAACTATAAAAGAGTAACTTAGATCTCTTGTGAATTCTTTCAGGGTTTTGTAATCTCAGATTGGCAAGGTATTGATAGGATTACCACTCCACCTCATGCTAACTATACGTATTCCATTTTAGCAAGTGTTACTGCTGGTATTGACATGGTTGGTATTGTGTTGAACATTTACGGGTGTATTTCAACAAATAACAATTCTGATTTAGATTTTTTTGCTACAGATAATGGTTCCATACAACTACACAGAGTTCATTGATGCATCATCTACTTGGTAAAAAATAATATAATTCCTATTAGTCGAATTAATGATGCAGTGAAGAGAATATTGAGAGTTAAATTTGTCATGGGTTTATTTGAGAACTCATTAGCTGACCTAAGCTTGGTTAATGAGCTTGGTAAAAAGGTATTTATATGTAATGTCTTTAATATTATAAAACAAAAAAAAAAGAAAAGTTCTTTGCCTTTTTTTTATTATCATTATATTTTGTTTCACAATAGGAGCATAGAGAACTCGCTAGAGAAGCTGTAAGGAAATCATTAGTGTTGTTAAAGAATGGAGAATCAGCTGACAAACCATTGCTACCACTTCCAAAGAAGGCACAAAAATACTTGTTGCTGGTAGCCATGCAAACAACCTTGGATATCAATGTGGCGGTTGGACTATGGAATGGCAAGGACTTAGTGGCAACAACCTTACTATTGGTATAAAAATAACACAATATTTCCTTAAATATTATTTCTCTACGAATTTATGTTTTGGGCATTATGAACTAGCACAATATTTATATCAACTTCCTTTTGCTAGGTACAACCGTGCTTGCAGCTATAAAAGATACAATTGATCCTGAAACTGAAGTTATATTTAAGGAGAATCCAGATAAGGAGTTTTTTCAATCACACAAATTTTCTTATGCCATTGTTGTAGTGGGAGAATATCCATATGCAGAAACCAATGGTGATAGCTTGAATTTGACAATTCCCCACCCTAGTCCAAGCACCATCACAAATGTTTGTGGAGCTGTGAAATGTGTAGTATAATAATCTCAGGGCGGCCTGTAGTAATCCAACCTTATATTGCTTCAATTGATGCACTTGTTGCTGCTTGGCTTCCTGGAACTGAAGGCAAAGGCATTACTGATGTGTTATTTGGGGACTATGGCTTTACCGGCAAGCTTTCGCAAACATGGTTCAAGACTGTTGATCAGCTGCCAATGAATTTTGGAGATCCACATTACGATCCCCTTTTTCCGCTTGGATATGGTCTTACTACTAAGCCTATCATAACCAAATGA

mRNA sequence

ATGGCCAAAGTTCTCATCATTTTGATGGGGCTTTTGCTCATTTGTTTCTCTGAAGCATTGACAAAAGCTGAGTACTTGAGATATAGAGATCCAAAACAACCATTAAGAGCTCGAATCAATGACCTCCTTGGTCGAATGACTCTTGAGGAAAAGATAGGTCAAATGGTGCAAATTGAAAGGACTAATGCTTCAACTACGATTATGAAAAAGGATCCTCAACTTGTAAAGAAGATTGGTGTTGCTACTGCACTTGAAGTTAGAGCTACCGGAATTCCTTATGCTTTTGCACCTTGTGTAGCGGTTTGCAGAGATCCACGATGGGGTCGATGTTATGAAAGCTACAGCGAAGATCCTAAAATTGTTCAAGCAATGACTGAGATCATATCAGGATTACAAGGAGAGATTCCACCAAATTCTCGCAAAGGTGTTCCTCATGTTGCTGGAAAAGAAAAGGTTGCAGCTTGTGCAAAGCACTTTGTGGGCGATGGTGGAACGACTAAGGGTATTAATGAGAACAACACAGTGATAGATAGACATGGATTACTTAGCATTCACATGCCAGGTTACTATAACTCAATCATTAAGGGAGTTGCAACCATTATGGTTTCTTATTCAAGTTGGAATGGAGAGAAGATGCATGCAAACAAAAATCTTATAACCGACTTTCTTAAGAACACTCTTCATTTTCAGGGTTTTGTAATCTCAGATTGGCAAGGTATTGATAGGATTACCACTCCACCTCATGCTAACTATACGTATTCCATTTTAGCAAGTGTTACTGCTGGTATTGACATGGTTGGTATTGTGTTGAACATTTACGGGTTGAAGAGAATATTGAGAGTTAAATTTGTCATGGGTTTATTTGAGAACTCATTAGCTGACCTAAGCTTGGTTAATGAGCTTGGTAAAAAGGAGCATAGAGAACTCGCTAGAGAAGCTGTAAGGAAATCATTAGTGTTGTTAAAGAATGGAGAATCAGCTGACAAACCATTGCTACCACTTCCAAAGAAGGCACAAAAATACTTGTTGCTGGTAGCCATGCAAACAACCTTGGATATCAATGTGGCGGTTGGACTATGGAATGGCAAGGACTTAGTGGCAACAACCTTACTATTGCACAATATTTATATCAACTTCCTTTTGCTAGGTACAACCGTGCTTGCAGCTATAAAAGATACAATTGATCCTGAAACTGAAGTTATATTTAAGGAGAATCCAGATAAGGAGTTTTTTCAATCACACAAATTTTCTTATGCCATTGTTGTAGTGGGAGAATATCCATATGCAGAAACCAATGGGCGGCCTGTAGTAATCCAACCTTATATTGCTTCAATTGATGCACTTGTTGCTGCTTGGCTTCCTGGAACTGAAGGCAAAGGCATTACTGATGTGTTATTTGGGGACTATGGCTTTACCGGCAAGCTTTCGCAAACATGGTTCAAGACTGTTGATCAGCTGCCAATGAATTTTGGAGATCCACATTACGATCCCCTTTTTCCGCTTGGATATGGTCTTACTACTAAGCCTATCATAACCAAATGA

Coding sequence (CDS)

ATGGCCAAAGTTCTCATCATTTTGATGGGGCTTTTGCTCATTTGTTTCTCTGAAGCATTGACAAAAGCTGAGTACTTGAGATATAGAGATCCAAAACAACCATTAAGAGCTCGAATCAATGACCTCCTTGGTCGAATGACTCTTGAGGAAAAGATAGGTCAAATGGTGCAAATTGAAAGGACTAATGCTTCAACTACGATTATGAAAAAGGATCCTCAACTTGTAAAGAAGATTGGTGTTGCTACTGCACTTGAAGTTAGAGCTACCGGAATTCCTTATGCTTTTGCACCTTGTGTAGCGGTTTGCAGAGATCCACGATGGGGTCGATGTTATGAAAGCTACAGCGAAGATCCTAAAATTGTTCAAGCAATGACTGAGATCATATCAGGATTACAAGGAGAGATTCCACCAAATTCTCGCAAAGGTGTTCCTCATGTTGCTGGAAAAGAAAAGGTTGCAGCTTGTGCAAAGCACTTTGTGGGCGATGGTGGAACGACTAAGGGTATTAATGAGAACAACACAGTGATAGATAGACATGGATTACTTAGCATTCACATGCCAGGTTACTATAACTCAATCATTAAGGGAGTTGCAACCATTATGGTTTCTTATTCAAGTTGGAATGGAGAGAAGATGCATGCAAACAAAAATCTTATAACCGACTTTCTTAAGAACACTCTTCATTTTCAGGGTTTTGTAATCTCAGATTGGCAAGGTATTGATAGGATTACCACTCCACCTCATGCTAACTATACGTATTCCATTTTAGCAAGTGTTACTGCTGGTATTGACATGGTTGGTATTGTGTTGAACATTTACGGGTTGAAGAGAATATTGAGAGTTAAATTTGTCATGGGTTTATTTGAGAACTCATTAGCTGACCTAAGCTTGGTTAATGAGCTTGGTAAAAAGGAGCATAGAGAACTCGCTAGAGAAGCTGTAAGGAAATCATTAGTGTTGTTAAAGAATGGAGAATCAGCTGACAAACCATTGCTACCACTTCCAAAGAAGGCACAAAAATACTTGTTGCTGGTAGCCATGCAAACAACCTTGGATATCAATGTGGCGGTTGGACTATGGAATGGCAAGGACTTAGTGGCAACAACCTTACTATTGCACAATATTTATATCAACTTCCTTTTGCTAGGTACAACCGTGCTTGCAGCTATAAAAGATACAATTGATCCTGAAACTGAAGTTATATTTAAGGAGAATCCAGATAAGGAGTTTTTTCAATCACACAAATTTTCTTATGCCATTGTTGTAGTGGGAGAATATCCATATGCAGAAACCAATGGGCGGCCTGTAGTAATCCAACCTTATATTGCTTCAATTGATGCACTTGTTGCTGCTTGGCTTCCTGGAACTGAAGGCAAAGGCATTACTGATGTGTTATTTGGGGACTATGGCTTTACCGGCAAGCTTTCGCAAACATGGTTCAAGACTGTTGATCAGCTGCCAATGAATTTTGGAGATCCACATTACGATCCCCTTTTTCCGCTTGGATATGGTCTTACTACTAAGCCTATCATAACCAAATGA

Protein sequence

MAKVLIILMGLLLICFSEALTKAEYLRYRDPKQPLRARINDLLGRMTLEEKIGQMVQIERTNASTTIMKKDPQLVKKIGVATALEVRATGIPYAFAPCVAVCRDPRWGRCYESYSEDPKIVQAMTEIISGLQGEIPPNSRKGVPHVAGKEKVAACAKHFVGDGGTTKGINENNTVIDRHGLLSIHMPGYYNSIIKGVATIMVSYSSWNGEKMHANKNLITDFLKNTLHFQGFVISDWQGIDRITTPPHANYTYSILASVTAGIDMVGIVLNIYGLKRILRVKFVMGLFENSLADLSLVNELGKKEHRELAREAVRKSLVLLKNGESADKPLLPLPKKAQKYLLLVAMQTTLDINVAVGLWNGKDLVATTLLLHNIYINFLLLGTTVLAAIKDTIDPETEVIFKENPDKEFFQSHKFSYAIVVVGEYPYAETNGRPVVIQPYIASIDALVAAWLPGTEGKGITDVLFGDYGFTGKLSQTWFKTVDQLPMNFGDPHYDPLFPLGYGLTTKPIITK
Homology
BLAST of Sgr011838 vs. NCBI nr
Match: TYK19869.1 (beta-glucosidase BoGH3B-like [Cucumis melo var. makuwa])

HSP 1 Score: 760.4 bits (1962), Expect = 1.0e-215
Identity = 406/600 (67.67%), Postives = 443/600 (73.83%), Query Frame = 0

Query: 1   MAKVLIILMGLLLICFSEALTKAEYLRYRDPKQPLRARINDLLGRMTLEEKIGQMVQIER 60
           MAK + IL+GLLL+CF E   KAE L+Y+DPKQPL  RI DLLGRMTLEEKIGQM QIER
Sbjct: 1   MAKAINILIGLLLLCFFETWAKAENLKYKDPKQPLNVRIKDLLGRMTLEEKIGQMTQIER 60

Query: 61  TNASTTIMKK-----------------------------------DPQLVKKIGVATALE 120
            NAST +MKK                                   DPQL+K+IG A+ALE
Sbjct: 61  VNASTDVMKKYFIGSVLSGGGSVPSKEASAQDWVQMVNEIQQGALDPQLLKRIGEASALE 120

Query: 121 VRATGIPYAFAPCVAVCRDPRWGRCYESYSEDPKIVQAMTEIISGLQGEIPPNSRKGVPH 180
           +RATGIPYAFAPC+AVCRDPRWGRCYESY EDPK+VQ MTEII GLQGEIPPNSRKGVP+
Sbjct: 121 IRATGIPYAFAPCIAVCRDPRWGRCYESYGEDPKLVQEMTEIIPGLQGEIPPNSRKGVPY 180

Query: 181 VAGKEKVAACAKHFVGDGGTTKGINENNTVIDRHGLLSIHMPGYYNSIIKGVATIMVSYS 240
           VAGKEKV ACAKH+VGDGGTTKGI+ENNTVIDRHGLLSIHMPGYY+SIIKGVAT+MVSYS
Sbjct: 181 VAGKEKVVACAKHYVGDGGTTKGIDENNTVIDRHGLLSIHMPGYYHSIIKGVATVMVSYS 240

Query: 241 SWNGEKMHANKNLITDFLKNTLHFQGFVISDWQGIDRITTPPHANYTYSILASVTAGIDM 300
           SWNG KMHANK L+TDFLKNTLHFQGFVISDWQ IDRIT PPHANYTYSILASVTAG+DM
Sbjct: 241 SWNGVKMHANKELVTDFLKNTLHFQGFVISDWQAIDRITDPPHANYTYSILASVTAGLDM 300

Query: 301 VGIVLN----IYGL------------------KRILRVKFVMGLFENSLADLSLVNELGK 360
           + +  N    I GL                  KRILRVKF+MGLFEN +ADLSLVNELGK
Sbjct: 301 IMVPYNYTEFIDGLTYLVNNNFIPITRIDDAVKRILRVKFIMGLFENPIADLSLVNELGK 360

Query: 361 KEHRELAREAVRKSLVLLKNGESADKPLLPLPKKAQKYLLLVAMQTTLDINVAVGLWNGK 420
           +EHRELAREAVRKSLVLLKNG+SADKPLLPL KK QK  +LVA     ++    G W   
Sbjct: 361 QEHRELAREAVRKSLVLLKNGKSADKPLLPLEKKTQK--ILVAGSHADNLGYQCGGW--- 420

Query: 421 DLVATTLLLHNIYINFLLLGTTVLAAIKDTIDPETEVIFKENPDKEFFQSHKFSYAIVVV 480
                T+    +  N L  GTTVL AIKDT+DP TEVIF ENPDK F QS  FSYAIVVV
Sbjct: 421 -----TIEWQGLSGNNLTSGTTVLDAIKDTVDPSTEVIFNENPDKGFLQSGTFSYAIVVV 480

Query: 481 GEYPYAE-------------------------------TNGRPVVIQPYIASIDALVAAW 513
           GE+PYAE                                +GRPVVIQPY+ S+DALVAAW
Sbjct: 481 GEHPYAEMMGDSLNLTIPDPGPSTITNVCGVIKCVVVIISGRPVVIQPYVDSVDALVAAW 540

BLAST of Sgr011838 vs. NCBI nr
Match: XP_008443733.1 (PREDICTED: beta-glucosidase BoGH3B-like [Cucumis melo] >KAA0038317.1 beta-glucosidase BoGH3B-like [Cucumis melo var. makuwa])

HSP 1 Score: 746.1 bits (1925), Expect = 2.0e-211
Identity = 406/637 (63.74%), Postives = 443/637 (69.54%), Query Frame = 0

Query: 1   MAKVLIILMGLLLICFSEALTKAEYLRYRDPKQPLRARINDLLGRMTLEEKIGQMVQIER 60
           MAK + IL+GLLL+CF E   KAE L+Y+DPKQPL  RI DLLGRMTLEEKIGQM QIER
Sbjct: 1   MAKAINILIGLLLLCFFETWAKAENLKYKDPKQPLNVRIKDLLGRMTLEEKIGQMTQIER 60

Query: 61  TNASTTIMKK-------------------------------------------------- 120
            NAST +MKK                                                  
Sbjct: 61  VNASTDVMKKYFIGSVLSGGGSVPSKEASAQDWVQMVNEIQQGALSTRLGIPMIYGIDAV 120

Query: 121 ----------------------DPQLVKKIGVATALEVRATGIPYAFAPCVAVCRDPRWG 180
                                 DPQL+K+IG A+ALE+RATGIPYAFAPC+AVCRDPRWG
Sbjct: 121 HGHNNVYNATIFPHNIGLGATRDPQLLKRIGEASALEIRATGIPYAFAPCIAVCRDPRWG 180

Query: 181 RCYESYSEDPKIVQAMTEIISGLQGEIPPNSRKGVPHVAGKEKVAACAKHFVGDGGTTKG 240
           RCYESY EDPK+VQ MTEII GLQGEIPPNSRKGVP+VAGKEKV ACAKH+VGDGGTTKG
Sbjct: 181 RCYESYGEDPKLVQEMTEIIPGLQGEIPPNSRKGVPYVAGKEKVVACAKHYVGDGGTTKG 240

Query: 241 INENNTVIDRHGLLSIHMPGYYNSIIKGVATIMVSYSSWNGEKMHANKNLITDFLKNTLH 300
           I+ENNTVIDRHGLLSIHMPGYY+SIIKGVAT+MVSYSSWNG KMHANK L+TDFLKNTLH
Sbjct: 241 IDENNTVIDRHGLLSIHMPGYYHSIIKGVATVMVSYSSWNGVKMHANKELVTDFLKNTLH 300

Query: 301 FQGFVISDWQGIDRITTPPHANYTYSILASVTAGIDMVGIVLN----IYGL--------- 360
           FQGFVISDWQ IDRIT PPHANYTYSILASVTAG+DM+ +  N    I GL         
Sbjct: 301 FQGFVISDWQAIDRITDPPHANYTYSILASVTAGLDMIMVPYNYTEFIDGLTYLVNNNFI 360

Query: 361 ---------KRILRVKFVMGLFENSLADLSLVNELGKKEHRELAREAVRKSLVLLKNGES 420
                    KRILRVKF+MGLFEN +ADLSLVNELGK+EHRELAREAVRKSLVLLKNG+S
Sbjct: 361 PITRIDDAVKRILRVKFIMGLFENPIADLSLVNELGKQEHRELAREAVRKSLVLLKNGKS 420

Query: 421 ADKPLLPLPKKAQKYLLLVAMQTTLDINVAVGLWNGKDLVATTLLLHNIYINFLLLGTTV 480
           ADKPLLPL KK QK  +LVA     ++    G W        T+    +  N L  GTTV
Sbjct: 421 ADKPLLPLEKKTQK--ILVAGSHADNLGYQCGGW--------TIEWQGLSGNNLTSGTTV 480

Query: 481 LAAIKDTIDPETEVIFKENPDKEFFQSHKFSYAIVVVGEYPYAE---------------- 513
           L AIKDT+DP TEVIF ENPDK F QS  FSYAIVVVGE+PYAE                
Sbjct: 481 LDAIKDTVDPSTEVIFNENPDKGFLQSGTFSYAIVVVGEHPYAEMMGDSLNLTIPDPGPS 540

BLAST of Sgr011838 vs. NCBI nr
Match: XP_022155346.1 (uncharacterized protein LOC111022483 [Momordica charantia])

HSP 1 Score: 744.2 bits (1920), Expect = 7.5e-211
Identity = 401/635 (63.15%), Postives = 445/635 (70.08%), Query Frame = 0

Query: 1   MAKVLIILMGLLLICFSEALTKAEYLRYRDPKQPLRARINDLLGRMTLEEKIGQMVQIER 60
           MAK+ I LMG+LL+CFSEAL K  YLRY+DPKQPL  RI DLLGRMTLEEKIGQMVQI+R
Sbjct: 1   MAKIPIFLMGVLLLCFSEALAKPHYLRYKDPKQPLNVRIRDLLGRMTLEEKIGQMVQIDR 60

Query: 61  TNASTTIMKK-------------------------------------------------- 120
           T AS  +MKK                                                  
Sbjct: 61  TVASKEVMKKYLIGSVLSGGGSVPAKEASPKVWIDMVNEFQEGCLSTRLGIPMIYGIDAV 120

Query: 121 ----------------------DPQLVKKIGVATALEVRATGIPYAFAPCVAVCRDPRWG 180
                                 DP+LVK+IGVATALEVRATGI Y FAPC+AVCRDPRWG
Sbjct: 121 HGHNNVYKATIFPHNVGLGATRDPELVKRIGVATALEVRATGISYVFAPCIAVCRDPRWG 180

Query: 181 RCYESYSEDPKIVQAMTEIISGLQGEIPPNSRKGVPHVAGKEKVAACAKHFVGDGGTTKG 240
           RCYESYSEDPKIVQ+MTEIISGLQGEIP NSRKGVP+VAG+EKVAACAKHFVGDGGTTKG
Sbjct: 181 RCYESYSEDPKIVQSMTEIISGLQGEIPANSRKGVPYVAGREKVAACAKHFVGDGGTTKG 240

Query: 241 INENNTVIDRHGLLSIHMPGYYNSIIKGVATIMVSYSSWNGEKMHANKNLITDFLKNTLH 300
           INENNTVI+RHGLLS HMPGYYNSIIKGV+TIM+SYSSWNG+KMHAN+ LITDFLKNTL 
Sbjct: 241 INENNTVINRHGLLSTHMPGYYNSIIKGVSTIMISYSSWNGKKMHANRELITDFLKNTLR 300

Query: 301 FQGFVISDWQGIDRITTPPHANYTYSILASVTAGIDMVGIVLN----IYGL--------- 360
           F+GFVISDWQGIDRIT+PPHANYTYSI+  VTAGIDM+ +  N    I GL         
Sbjct: 301 FRGFVISDWQGIDRITSPPHANYTYSIITGVTAGIDMIMVPFNYTEFIDGLTYLVKTNVI 360

Query: 361 ---------KRILRVKFVMGLFENSLADLSLVNELGKKEHRELAREAVRKSLVLLKNGES 420
                    KRILRVKF+MGLFEN LAD S +++LGKKEHRELAREAVRKSLVLLKNGES
Sbjct: 361 PMSRIDDAVKRILRVKFIMGLFENPLADQSFIHQLGKKEHRELAREAVRKSLVLLKNGES 420

Query: 421 ADKPLLPLPKKAQKYLLLVAMQTTLDINVAVGLWNGKDLVATTLLLHNIYINFLLLGTTV 480
           ADKP+LPLPKKA K  +LVA     ++    G W        T+    +  N L  GTT+
Sbjct: 421 ADKPVLPLPKKAPK--ILVAGSHANNLGFQCGGW--------TIEWQGLGGNNLTTGTTI 480

Query: 481 LAAIKDTIDPETEVIFKENPDKEFFQSHKFSYAIVVVGEYPYAET--------------- 511
           L+AIKDT+DP+TEV+F ENPD EF +S+KFSYAIVVVGE+PYAET               
Sbjct: 481 LSAIKDTVDPKTEVVFDENPDAEFVKSNKFSYAIVVVGEHPYAETFGDSLNLTIAEPGPS 540

BLAST of Sgr011838 vs. NCBI nr
Match: XP_038905524.1 (LOW QUALITY PROTEIN: beta-glucosidase BoGH3B-like [Benincasa hispida])

HSP 1 Score: 735.7 bits (1898), Expect = 2.7e-208
Identity = 396/635 (62.36%), Postives = 443/635 (69.76%), Query Frame = 0

Query: 1   MAKVLIILMGLLLICFSEALTKAEYLRYRDPKQPLRARINDLLGRMTLEEKIGQMVQIER 60
           MA+VLI L+GLL +CFSE L +AEYL+Y+DPKQPL  RI DLLGRMT EEKIGQMVQIER
Sbjct: 1   MARVLITLVGLLFLCFSETLARAEYLKYKDPKQPLNVRIKDLLGRMTFEEKIGQMVQIER 60

Query: 61  TNASTTIMKK-------------------------------------------------- 120
            NA+  +M+K                                                  
Sbjct: 61  VNATFEVMQKYFIGSVLSGGGSVPSKKASAKDWVHMVNKIQKGALSTRLGIPMIYGVDAV 120

Query: 121 ----------------------DPQLVKKIGVATALEVRATGIPYAFAPCVAVCRDPRWG 180
                                 DPQLVK+ G+ATALEVRATGIPY FAPC+AVCRDPRWG
Sbjct: 121 HGHNNVYKATIFPHNIGLGATRDPQLVKRXGIATALEVRATGIPYTFAPCIAVCRDPRWG 180

Query: 181 RCYESYSEDPKIVQAMTEIISGLQGEIPPNSRKGVPHVAGKEKVAACAKHFVGDGGTTKG 240
           RCYESY EDPKI+QAM EII GLQG+IPPNSRKGVP+VAGK+ VAACAKHFVGDGGTTKG
Sbjct: 181 RCYESYGEDPKIIQAMXEIILGLQGDIPPNSRKGVPYVAGKKNVAACAKHFVGDGGTTKG 240

Query: 241 INENNTVIDRHGLLSIHMPGYYNSIIKGVATIMVSYSSWNGEKMHANKNLITDFLKNTLH 300
           INENNTVIDRH LLSIHMPGYYNSIIKGVAT+MVSYSS NGEKMHAN+NL+T+FLKNTL+
Sbjct: 241 INENNTVIDRHSLLSIHMPGYYNSIIKGVATLMVSYSSVNGEKMHANQNLVTNFLKNTLN 300

Query: 301 FQGFVISDWQGIDRITTPPHANYTYSILASVTAGIDMVGIVLN----IYGL--------- 360
           F+GFVISDWQGID+IT+PPH+NYTYSI+ASV AG+DM+ +  N    I GL         
Sbjct: 301 FRGFVISDWQGIDKITSPPHSNYTYSIMASVNAGVDMIMVPYNYTEFIDGLTYLVKNNAI 360

Query: 361 ---------KRILRVKFVMGLFENSLADLSLVNELGKKEHRELAREAVRKSLVLLKNGES 420
                    KRILRVKF+MGLFEN LADLSL+NELGK+EHRELAREAVRKSLVLLKNG+ 
Sbjct: 361 PISRIDDAVKRILRVKFIMGLFENPLADLSLINELGKQEHRELAREAVRKSLVLLKNGKF 420

Query: 421 ADKPLLPLPKKAQKYLLLVAMQTTLDINVAVGLWNGKDLVATTLLLHNIYINFLLLGTTV 480
            ++PLLPLPKKA K  +LVA     ++    G W        T+       N L +GT +
Sbjct: 421 PNQPLLPLPKKAPK--ILVAGSHANNLGNQCGGW--------TMEWQGFSGNNLTIGTPI 480

Query: 481 LAAIKDTIDPETEVIFKENPDKEFFQSHKFSYAIVVVGEYPYAETN-------------- 511
           LAAIKDT+DPET+VIF+ENP  EF +SH FSYAIVVVGE PYAETN              
Sbjct: 481 LAAIKDTVDPETKVIFEENPSVEFLKSHDFSYAIVVVGENPYAETNGDSLNLTIPHPGPE 540

BLAST of Sgr011838 vs. NCBI nr
Match: XP_011648555.1 (uncharacterized protein LOC101211593 [Cucumis sativus] >KGN66708.1 hypothetical protein Csa_006895 [Cucumis sativus])

HSP 1 Score: 733.0 bits (1891), Expect = 1.7e-207
Identity = 406/635 (63.94%), Postives = 439/635 (69.13%), Query Frame = 0

Query: 1   MAKVLIILMGLLLICFSEALTKAEYLRYRDPKQPLRARINDLLGRMTLEEKIGQMVQIER 60
           MAK  IIL+ LLLIC  E   KAE  +Y+DP Q L  RI DLLGRMTLEEKIGQMVQIER
Sbjct: 2   MAKA-IILIALLLICCFETGAKAENFKYKDPTQRLNVRIKDLLGRMTLEEKIGQMVQIER 61

Query: 61  TNASTTIMKK-------------------------------------------------- 120
            NAST +MKK                                                  
Sbjct: 62  VNASTEVMKKYFIGSVLSGGGSVPSKQASAQDWINMVNEIQKGALSTRLGIPMIYGIDAV 121

Query: 121 ----------------------DPQLVKKIGVATALEVRATGIPYAFAPCVAVCRDPRWG 180
                                 DPQL+K+IGVA+A E+RATGIPYAFAPCVAVCRDPRWG
Sbjct: 122 HGHNNVYNATIFPHNIGLGATRDPQLLKRIGVASAREIRATGIPYAFAPCVAVCRDPRWG 181

Query: 181 RCYESYSEDPKIVQAMTEIISGLQGEIPPNSRKGVPHVAGKEKVAACAKHFVGDGGTTKG 240
           RCYESY EDPKIVQ MTEII GLQGEIPPNSRKGVP+VAGKE V ACAKH+VGDGGTTKG
Sbjct: 182 RCYESYGEDPKIVQEMTEIIPGLQGEIPPNSRKGVPYVAGKENVVACAKHYVGDGGTTKG 241

Query: 241 INENNTVIDRHGLLSIHMPGYYNSIIKGVATIMVSYSSWNGEKMHANKNLITDFLKNTLH 300
           I+ENNTVIDRHGLLSIHMPGYY+SIIKGVATIMVSYSSWNGEKMHANKNL+TDFLKNTLH
Sbjct: 242 IDENNTVIDRHGLLSIHMPGYYHSIIKGVATIMVSYSSWNGEKMHANKNLVTDFLKNTLH 301

Query: 301 FQGFVISDWQGIDRITTPPHANYTYSILASVTAGIDMVGIVLN----IYGL--------- 360
           FQGFVISDW+ IDRIT PPHANYTYSILAS+TAG+DM+ I  N    I GL         
Sbjct: 302 FQGFVISDWEAIDRITDPPHANYTYSILASITAGLDMIMIPYNYPEFIDGLTNLVKSNYI 361

Query: 361 ---------KRILRVKFVMGLFENSLADLSLVNELGKKEHRELAREAVRKSLVLLKNGES 420
                    KRILRVKFVMGLFEN +ADLSLVNELGK+EHRELAREAVRKSLVLLKNG+S
Sbjct: 362 PISRIDDAVKRILRVKFVMGLFENPIADLSLVNELGKQEHRELAREAVRKSLVLLKNGKS 421

Query: 421 ADKPLLPLPKKAQKYLLLVAMQTTLDINVAVGLWNGKDLVATTLLLHNIYINFLLLGTTV 480
           ADKPLLPL KK QK  +LVA     ++    G W        T+    +  N L  GTTV
Sbjct: 422 ADKPLLPLEKKTQK--ILVAGSHANNLGYQCGGW--------TIEWQGLSGNNLTSGTTV 481

Query: 481 LAAIKDTIDPETEVIFKENPDKEFFQSHKFSYAIVVVGEYPYAETN-------------- 511
           L AIKDT+DP TEVIF ENPDK+  QS  FSYAIVVVGE+PYAE N              
Sbjct: 482 LDAIKDTVDPTTEVIFNENPDKKSLQSDTFSYAIVVVGEHPYAELNGDSLNLTIPDPGPN 541

BLAST of Sgr011838 vs. ExPASy Swiss-Prot
Match: T2KMH0 (Beta-xylosidase OS=Formosa agariphila (strain DSM 15362 / KCTC 12365 / LMG 23005 / KMM 3901) OX=1347342 GN=BN863_22130 PE=1 SV=1)

HSP 1 Score: 148.3 bits (373), Expect = 2.4e-34
Identity = 160/626 (25.56%), Postives = 252/626 (40.26%), Query Frame = 0

Query: 8   LMGLLLICFSEALTKAEYLRYRDPKQPLRARINDLLGRMTLEEKIGQMVQIERTN----- 67
           LMGLLL  F   + +       +  + +  ++  L+ +MTL+EKI +M Q    N     
Sbjct: 6   LMGLLLASFFTTVAQNNAQTKSNSDEEIDKKVATLISQMTLDEKIAEMTQDAPANERLGI 65

Query: 68  -------------------ASTTIMKK--------DPQLVKKIGVATALEVRATGIPYAF 127
                               +TT+  +        +P+L+KK+   TA E RA G+ + +
Sbjct: 66  PSMKYGEALHGLWLVLDYYGNTTVYPQAVAAASTWEPELIKKMASQTAREARALGVTHCY 125

Query: 128 APCVAV-CRDPRWGRCYESYSEDPKIVQAM-TEIISGLQGEIPPNSRKGVPHVAGKEK-- 187
           +P + V   D R+GR  ESY EDP +V  M    I GLQG              G+E+  
Sbjct: 126 SPNLDVYAGDARYGRVEESYGEDPYLVSRMGVAFIEGLQG-------------TGEEQFD 185

Query: 188 ---VAACAKHFVGDGGTTKGINENNTVIDRHGLLSIHMPGYYNSIIK-GVATIMVSYSSW 247
              V A AKHFVG     +GIN   + +    L  +++P +  ++ + GV ++M  +  +
Sbjct: 186 ENHVIATAKHFVGYPENRRGINGGFSDMSERRLREVYLPPFEAAVKEAGVGSVMPGHQDF 245

Query: 248 NGEKMHANKNLITDFLKNTLHFQGFVISDWQGIDRITTPPH--ANYTYSILASVTAGIDM 307
           NG   H N  L+ D L++ L F GF++SD   + R+ T      N T + +  + AG+DM
Sbjct: 246 NGVPCHMNTWLLKDILRDELGFDGFIVSDNNDVGRLETMHFIAENRTEAAILGLKAGVDM 305

Query: 308 VGIV----------LNI----------------YGLKRILRVKFVMGLFENSLADLSLVN 367
             ++           NI                    RIL  K+ +GLF+     +    
Sbjct: 306 DLVIGKNVELATYHTNILKDTILKNPALMKYIDQATSRILTAKYKLGLFDAKPKKIDTET 365

Query: 368 -ELGKKEHRELAREAVRKSLVLLKNGESADKPLLPLPKKAQKYLLLVAMQTTLDINVAVG 427
            E G  EHRE A E   KS+++LKN    D  LLPL     K L ++      +      
Sbjct: 366 VETGTDEHREFALELAEKSIIMLKN----DNNLLPLDVSKIKSLAVIGPNAHEE------ 425

Query: 428 LWNGKDLVATTLLLHNIYINFLLLGTTVLAAIKDTIDPETEVIFKENPDKEFFQSHKFSY 487
               +    T  LL   Y        +VL  +K  +    ++ + +  D + F    F  
Sbjct: 426 ----RPKKGTYKLLGG-YSGLPPYYVSVLDGLKKKVGEHVKINYAKGCDIDSFSKEGFPE 485

Query: 488 AI----------VVVGE----------------YPYAE-----------------TNGRP 507
           AI          +VVG                 Y   +                  NGRP
Sbjct: 486 AISAAKNSDAVVLVVGSSHKTCGEGGDRADLDLYGVQKELVEAIHKTGKPVIVVLINGRP 545

BLAST of Sgr011838 vs. ExPASy Swiss-Prot
Match: A7LXU3 (Beta-glucosidase BoGH3B OS=Bacteroides ovatus (strain ATCC 8483 / DSM 1896 / JCM 5824 / NCTC 11153) OX=411476 GN=BACOVA_02659 PE=1 SV=1)

HSP 1 Score: 145.2 bits (365), Expect = 2.0e-33
Identity = 130/516 (25.19%), Postives = 214/516 (41.47%), Query Frame = 0

Query: 73  QLVKKIGVATALEVRATGIPYAFAPCVAVCRDPRWGRCYESYSEDPKIVQAM-TEIISGL 132
           +L ++    +A E +A  IP+ FAP V + RDPRW R +E+Y ED  +   M    + G 
Sbjct: 158 ELTRRGAKISAYETKAGCIPWTFAPVVDLGRDPRWARMWENYGEDCYVNAEMGVSAVKGF 217

Query: 133 QGEIPPNSRKGVPHVAGKEKVAACAKHFVGDGGTTKGINENNTVIDRHGLLSIHMPGYYN 192
           QGE         P+  G+  VAAC KH++G G    G +   + I R  +   H   +  
Sbjct: 218 QGE--------DPNRIGEYNVAACMKHYMGYGVPVSGKDRTPSSISRSDMREKHFAPFLA 277

Query: 193 SIIKGVATIMVSYSSWNGEKMHANKNLITDFLKNTLHFQGFVISDWQGIDRITTPPH--A 252
           ++ +G  ++MV+    NG   HAN+ L+T++LK  L++ G +++DW  I+ + T  H  A
Sbjct: 278 AVRQGALSVMVNSGVDNGLPFHANRELLTEWLKEDLNWDGLIVTDWADINNLCTRDHIAA 337

Query: 253 NYTYSILASVTAGIDMVGIVLNIY---------------------GLKRILRVKFVMGLF 312
               ++   + AGIDM  +   +                       + R+LR+K+ +GLF
Sbjct: 338 TKKEAVKIVINAGIDMSMVPYEVSFCDYLKELVEEGEVSMERIDDAVARVLRLKYRLGLF 397

Query: 313 ENSLADLSLVNELGKKEHRELAREAVRKSLVLLKNGESADKPLLPLPKKAQKYLLLVAMQ 372
           ++   D+   ++ G KE   +A +A  +S VLLKN    D  +LP+ K  +  L      
Sbjct: 398 DHPYWDIKKYDKFGSKEFAAVALQAAEESEVLLKN----DGNILPIAKGKKILLTGPNAN 457

Query: 373 TTLDINVAVGL-WNGKDLVATTLLLHNIY---------INFLLLGTTVLAAIKDTIDPET 432
           +   +N      W G          H IY          N +       A+ K+    E 
Sbjct: 458 SMRCLNGGWSYSWQGHVADEYAQAYHTIYEALCEKYGKENIIYEPGVTYASYKNDNWWEE 517

Query: 433 EVIFKENPDKEFFQSHKFSYAIVVVGEYPYAET--------------------------- 492
               K   +K    + +    I  +GE  Y ET                           
Sbjct: 518 N---KPETEKPVAAAAQADIIITCIGENSYCETPGNLTDLTLSENQRNLVKALAATGKPI 577

Query: 493 -----NGRPVVIQPYIASIDALVAAWLPGT-EGKGITDVLFGDYGFTGKLSQTW------ 507
                 GRP +I   +    A+V   LP    G  + ++L GD  F+GK+  T+      
Sbjct: 578 VLVLNQGRPRIINDIVPLAKAVVNIMLPSNYGGDALANLLAGDANFSGKMPFTYPRLINA 637

BLAST of Sgr011838 vs. ExPASy Swiss-Prot
Match: Q23892 (Lysosomal beta glucosidase OS=Dictyostelium discoideum OX=44689 GN=gluA PE=1 SV=2)

HSP 1 Score: 132.9 bits (333), Expect = 1.0e-29
Identity = 134/504 (26.59%), Postives = 213/504 (42.26%), Query Frame = 0

Query: 82  TALEVRATGIPYAFAPCVAVCRDPRWGRCYESYSEDPKIVQAM-TEIISGLQGEIPPNSR 141
           T+ +  A GIP+ FAP + +   P W R YE++ EDP +   M    + G QG    NS 
Sbjct: 212 TSKDTVAVGIPWVFAPVLGIGVQPLWSRIYETFGEDPYVASMMGAAAVRGFQG--GNNSF 271

Query: 142 KGVPHVAGKEKVAAC-AKHFVGDGGTTKGINENNTVIDRHGLLSIHMPGYYNSII-KGVA 201
            G  +       A C AKH+ G    T G +     I    L    +P +  +I   G  
Sbjct: 272 DGPINAPS----AVCTAKHYFGYSDPTSGKDRTAAWIPERMLRRYFLPSFAEAITGAGAG 331

Query: 202 TIMVSYSSWNGEKMHANKNLITDFLKNTLHFQGFVISDWQGIDRITTPPH--ANYTYSIL 261
           TIM++    NG  MH +   +T+ L+  L F+G  ++DWQ I+++    H   +   +IL
Sbjct: 332 TIMINSGEVNGVPMHTSYKYLTEVLRGELQFEGVAVTDWQDIEKLVYFHHTAGSAEEAIL 391

Query: 262 ASVTAGIDMVGIVLNI---------------------YGLKRILRVKFVMGLFENSL--A 321
            ++ AGIDM  + L++                       ++RIL +K+ +GLF N     
Sbjct: 392 QALDAGIDMSMVPLDLSFPIILAEMVAAGTVPESRLDLSVRRILNLKYALGLFSNPYPNP 451

Query: 322 DLSLVNELGKKEHRELAREAVRKSLVLLKNGESADKPLLPLPKKAQKYLLLVAMQTTLDI 381
           + ++V+ +G+ + RE A     +S+ LL+N  +    +LPL     K +LL         
Sbjct: 452 NAAIVDTIGQVQDREAAAATAEESITLLQNKNN----ILPLNTNTIKNVLLTGPSADSIR 511

Query: 382 NVAVGL---WNG----KDLVATTLLLHNIYINFLLLGTTVLAAIKDTIDPETEVIFKENP 441
           N+  G    W G     +    T +L  +     +   T    I+ TI  E  V   +  
Sbjct: 512 NLNGGWSVHWQGAYEDSEFPFGTSILTGLR---EITNDTADFNIQYTIGHEIGVPTNQTS 571

Query: 442 -DKEFFQSHKFSYAIVVVGEYPYAETNG-------------------------------- 501
            D+    +      +VV+GE P AET G                                
Sbjct: 572 IDEAVELAQSSDVVVVVIGELPEAETPGDIYDLSMDPNEVLLLQQLVDTGKPVVLILVEA 631

Query: 502 RPVVIQP-YIASIDALVAAWLPGTE-GKGITDVLFGDYGFTGKLSQTWFKTVDQLPMNFG 507
           RP ++ P  + S  A++ A+LPG+E GK I ++L G+   +G+L  T+  T   +    G
Sbjct: 632 RPRILPPDLVYSCAAVLMAYLPGSEGGKPIANILMGNVNPSGRLPLTYPGTTGDI----G 691

BLAST of Sgr011838 vs. ExPASy Swiss-Prot
Match: Q56078 (Periplasmic beta-glucosidase OS=Salmonella typhimurium (strain LT2 / SGSC1412 / ATCC 700720) OX=99287 GN=bglX PE=3 SV=2)

HSP 1 Score: 126.7 bits (317), Expect = 7.5e-28
Identity = 133/530 (25.09%), Postives = 214/530 (40.38%), Query Frame = 0

Query: 75  VKKIGVATALEVRATGIPYAFAPCVAVCRDPRWGRCYESYSEDPKIVQAMTE-IISGLQG 134
           V+ +G  +A E    G+   +AP V V RDPRWGR  E + ED  +   M E ++  +QG
Sbjct: 135 VRTVGRVSAYEAADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSIMGETMVKAMQG 194

Query: 135 EIPPNSRKGVPHVAGKEKVAACAKHFVGDGGTTKGINENNTVIDRHGLLSIHMPGYYNSI 194
           + P          A +  V    KHF   G    G   N   +    L + +MP Y   +
Sbjct: 195 KSP----------ADRYSVMTSVKHFAAYGAVEGGKEYNTVDMSSQRLFNDYMPPYKAGL 254

Query: 195 IKGVATIMVSYSSWNGEKMHANKNLITDFLKNTLHFQGFVISDWQGI-DRITTPPHANYT 254
             G   +MV+ +S NG    ++  L+ D L++   F+G  +SD   I + I     A+  
Sbjct: 255 DAGSGAVMVALNSLNGTPATSDSWLLKDVLRDEWGFKGITVSDHGAIKELIKHGTAADPE 314

Query: 255 YSILASVTAGIDM-------------------VGIVLNIYGLKRILRVKFVMGLFENSLA 314
            ++  ++ AG+DM                   V +       + +L VK+ MGLF +  +
Sbjct: 315 DAVRVALKAGVDMSMADEYYSKYLPGLIKSGKVTMAELDDATRHVLNVKYDMGLFNDPYS 374

Query: 315 DLS------LVNELGKKEHRELAREAVRKSLVLLKNGESADKPLLPLPKKAQKYLLLVAM 374
            L       +      + HR+ ARE  R+S+VLLKN        LPL K     ++    
Sbjct: 375 HLGPKESDPVDTNAESRLHRKEAREVARESVVLLKNRLET----LPLKKSGTIAVVGPLA 434

Query: 375 QTTLDINVAVGLWNGKDLVATTLLLHNIYINFLLLGTTVLAAIKDTIDPETEVI------ 434
            +  D+   +G W+   +   ++ +     N +  G  +L A    I  +  ++      
Sbjct: 435 DSQRDV---MGSWSAAGVANQSVTVLAGIQNAVGDGAKILYAKGANITNDKGIVDFLNLY 494

Query: 435 ---FKENP-------DKEFFQSHKFSYAIVVVGE---------------YPYAE------ 494
               K +P       D+    + +    + VVGE                P ++      
Sbjct: 495 EEAVKIDPRSPQAMIDEAVQAAKQADVVVAVVGESQGMAHEASSRTNITIPQSQRDLITA 554

Query: 495 ------------TNGRPVVIQPYIASIDALVAAWLPGTE-GKGITDVLFGDYGFTGKLSQ 507
                        NGRP+ +       DA++  W  GTE G  I DVLFGDY  +GKL  
Sbjct: 555 LKATGKPLVLVLMNGRPLALVKEDQQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPI 614

BLAST of Sgr011838 vs. ExPASy Swiss-Prot
Match: P33363 (Periplasmic beta-glucosidase OS=Escherichia coli (strain K12) OX=83333 GN=bglX PE=3 SV=2)

HSP 1 Score: 125.6 bits (314), Expect = 1.7e-27
Identity = 136/544 (25.00%), Postives = 215/544 (39.52%), Query Frame = 0

Query: 75  VKKIGVATALEVRATGIPYAFAPCVAVCRDPRWGRCYESYSEDPKIVQAMTE-IISGLQG 134
           VK +G  +A E    G+   +AP V V RDPRWGR  E + ED  +   M + ++  +QG
Sbjct: 135 VKTVGRVSAYEAADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSTMGKTMVEAMQG 194

Query: 135 EIPPNSRKGVPHVAGKEKVAACAKHFVGDGGTTKGINENNTVIDRHGLLSIHMPGYYNSI 194
           + P          A +  V    KHF   G    G   N   +    L + +MP Y   +
Sbjct: 195 KSP----------ADRYSVMTSVKHFAAYGAVEGGKEYNTVDMSPQRLFNDYMPPYKAGL 254

Query: 195 IKGVATIMVSYSSWNGEKMHANKNLITDFLKNTLHFQGFVISDWQGI-DRITTPPHANYT 254
             G   +MV+ +S NG    ++  L+ D L++   F+G  +SD   I + I     A+  
Sbjct: 255 DAGSGAVMVALNSLNGTPATSDSWLLKDVLRDQWGFKGITVSDHGAIKELIKHGTAADPE 314

Query: 255 YSILASVTAGIDM-------------------VGIVLNIYGLKRILRVKFVMGLFENSLA 314
            ++  ++ +GI+M                   V +       + +L VK+ MGLF +  +
Sbjct: 315 DAVRVALKSGINMSMSDEYYSKYLPGLIKSGKVTMAELDDAARHVLNVKYDMGLFNDPYS 374

Query: 315 DLS------LVNELGKKEHRELAREAVRKSLVLLKNGESADKPLLPLPKKAQKYLLLVAM 374
            L       +      + HR+ ARE  R+SLVLLKN        LPL K A   ++    
Sbjct: 375 HLGPKESDPVDTNAESRLHRKEAREVARESLVLLKNRLET----LPLKKSATIAVVGPLA 434

Query: 375 QTTLDINVAVGLWNGKDLVATTLLLHNIYINFLLLGTTVLAAIKDTIDPETEVIFKENPD 434
            +  D+   +G W+   +   ++              TVL  IK+ +    +V++ +  +
Sbjct: 435 DSKRDV---MGSWSAAGVADQSV--------------TVLTGIKNAVGENGKVLYAKGAN 494

Query: 435 -----------------------------KEFFQSHKFSYAIV-VVGE------------ 494
                                         E  Q+ K S  +V VVGE            
Sbjct: 495 VTSDKGIIDFLNQYEEAVKVDPRSPQEMIDEAVQTAKQSDVVVAVVGEAQGMAHEASSRT 554

Query: 495 ---YPYAE------------------TNGRPVVIQPYIASIDALVAAWLPGTE-GKGITD 507
               P ++                   NGRP+ +       DA++  W  GTE G  I D
Sbjct: 555 DITIPQSQRDLIAALKATGKPLVLVLMNGRPLALVKEDQQADAILETWFAGTEGGNAIAD 614

BLAST of Sgr011838 vs. ExPASy TrEMBL
Match: A0A5D3D8S5 (Beta-glucosidase BoGH3B-like OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold811G00830 PE=3 SV=1)

HSP 1 Score: 760.4 bits (1962), Expect = 4.9e-216
Identity = 406/600 (67.67%), Postives = 443/600 (73.83%), Query Frame = 0

Query: 1   MAKVLIILMGLLLICFSEALTKAEYLRYRDPKQPLRARINDLLGRMTLEEKIGQMVQIER 60
           MAK + IL+GLLL+CF E   KAE L+Y+DPKQPL  RI DLLGRMTLEEKIGQM QIER
Sbjct: 1   MAKAINILIGLLLLCFFETWAKAENLKYKDPKQPLNVRIKDLLGRMTLEEKIGQMTQIER 60

Query: 61  TNASTTIMKK-----------------------------------DPQLVKKIGVATALE 120
            NAST +MKK                                   DPQL+K+IG A+ALE
Sbjct: 61  VNASTDVMKKYFIGSVLSGGGSVPSKEASAQDWVQMVNEIQQGALDPQLLKRIGEASALE 120

Query: 121 VRATGIPYAFAPCVAVCRDPRWGRCYESYSEDPKIVQAMTEIISGLQGEIPPNSRKGVPH 180
           +RATGIPYAFAPC+AVCRDPRWGRCYESY EDPK+VQ MTEII GLQGEIPPNSRKGVP+
Sbjct: 121 IRATGIPYAFAPCIAVCRDPRWGRCYESYGEDPKLVQEMTEIIPGLQGEIPPNSRKGVPY 180

Query: 181 VAGKEKVAACAKHFVGDGGTTKGINENNTVIDRHGLLSIHMPGYYNSIIKGVATIMVSYS 240
           VAGKEKV ACAKH+VGDGGTTKGI+ENNTVIDRHGLLSIHMPGYY+SIIKGVAT+MVSYS
Sbjct: 181 VAGKEKVVACAKHYVGDGGTTKGIDENNTVIDRHGLLSIHMPGYYHSIIKGVATVMVSYS 240

Query: 241 SWNGEKMHANKNLITDFLKNTLHFQGFVISDWQGIDRITTPPHANYTYSILASVTAGIDM 300
           SWNG KMHANK L+TDFLKNTLHFQGFVISDWQ IDRIT PPHANYTYSILASVTAG+DM
Sbjct: 241 SWNGVKMHANKELVTDFLKNTLHFQGFVISDWQAIDRITDPPHANYTYSILASVTAGLDM 300

Query: 301 VGIVLN----IYGL------------------KRILRVKFVMGLFENSLADLSLVNELGK 360
           + +  N    I GL                  KRILRVKF+MGLFEN +ADLSLVNELGK
Sbjct: 301 IMVPYNYTEFIDGLTYLVNNNFIPITRIDDAVKRILRVKFIMGLFENPIADLSLVNELGK 360

Query: 361 KEHRELAREAVRKSLVLLKNGESADKPLLPLPKKAQKYLLLVAMQTTLDINVAVGLWNGK 420
           +EHRELAREAVRKSLVLLKNG+SADKPLLPL KK QK  +LVA     ++    G W   
Sbjct: 361 QEHRELAREAVRKSLVLLKNGKSADKPLLPLEKKTQK--ILVAGSHADNLGYQCGGW--- 420

Query: 421 DLVATTLLLHNIYINFLLLGTTVLAAIKDTIDPETEVIFKENPDKEFFQSHKFSYAIVVV 480
                T+    +  N L  GTTVL AIKDT+DP TEVIF ENPDK F QS  FSYAIVVV
Sbjct: 421 -----TIEWQGLSGNNLTSGTTVLDAIKDTVDPSTEVIFNENPDKGFLQSGTFSYAIVVV 480

Query: 481 GEYPYAE-------------------------------TNGRPVVIQPYIASIDALVAAW 513
           GE+PYAE                                +GRPVVIQPY+ S+DALVAAW
Sbjct: 481 GEHPYAEMMGDSLNLTIPDPGPSTITNVCGVIKCVVVIISGRPVVIQPYVDSVDALVAAW 540

BLAST of Sgr011838 vs. ExPASy TrEMBL
Match: A0A5A7T9L3 (Beta-glucosidase BoGH3B-like OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold270G001870 PE=3 SV=1)

HSP 1 Score: 746.1 bits (1925), Expect = 9.6e-212
Identity = 406/637 (63.74%), Postives = 443/637 (69.54%), Query Frame = 0

Query: 1   MAKVLIILMGLLLICFSEALTKAEYLRYRDPKQPLRARINDLLGRMTLEEKIGQMVQIER 60
           MAK + IL+GLLL+CF E   KAE L+Y+DPKQPL  RI DLLGRMTLEEKIGQM QIER
Sbjct: 1   MAKAINILIGLLLLCFFETWAKAENLKYKDPKQPLNVRIKDLLGRMTLEEKIGQMTQIER 60

Query: 61  TNASTTIMKK-------------------------------------------------- 120
            NAST +MKK                                                  
Sbjct: 61  VNASTDVMKKYFIGSVLSGGGSVPSKEASAQDWVQMVNEIQQGALSTRLGIPMIYGIDAV 120

Query: 121 ----------------------DPQLVKKIGVATALEVRATGIPYAFAPCVAVCRDPRWG 180
                                 DPQL+K+IG A+ALE+RATGIPYAFAPC+AVCRDPRWG
Sbjct: 121 HGHNNVYNATIFPHNIGLGATRDPQLLKRIGEASALEIRATGIPYAFAPCIAVCRDPRWG 180

Query: 181 RCYESYSEDPKIVQAMTEIISGLQGEIPPNSRKGVPHVAGKEKVAACAKHFVGDGGTTKG 240
           RCYESY EDPK+VQ MTEII GLQGEIPPNSRKGVP+VAGKEKV ACAKH+VGDGGTTKG
Sbjct: 181 RCYESYGEDPKLVQEMTEIIPGLQGEIPPNSRKGVPYVAGKEKVVACAKHYVGDGGTTKG 240

Query: 241 INENNTVIDRHGLLSIHMPGYYNSIIKGVATIMVSYSSWNGEKMHANKNLITDFLKNTLH 300
           I+ENNTVIDRHGLLSIHMPGYY+SIIKGVAT+MVSYSSWNG KMHANK L+TDFLKNTLH
Sbjct: 241 IDENNTVIDRHGLLSIHMPGYYHSIIKGVATVMVSYSSWNGVKMHANKELVTDFLKNTLH 300

Query: 301 FQGFVISDWQGIDRITTPPHANYTYSILASVTAGIDMVGIVLN----IYGL--------- 360
           FQGFVISDWQ IDRIT PPHANYTYSILASVTAG+DM+ +  N    I GL         
Sbjct: 301 FQGFVISDWQAIDRITDPPHANYTYSILASVTAGLDMIMVPYNYTEFIDGLTYLVNNNFI 360

Query: 361 ---------KRILRVKFVMGLFENSLADLSLVNELGKKEHRELAREAVRKSLVLLKNGES 420
                    KRILRVKF+MGLFEN +ADLSLVNELGK+EHRELAREAVRKSLVLLKNG+S
Sbjct: 361 PITRIDDAVKRILRVKFIMGLFENPIADLSLVNELGKQEHRELAREAVRKSLVLLKNGKS 420

Query: 421 ADKPLLPLPKKAQKYLLLVAMQTTLDINVAVGLWNGKDLVATTLLLHNIYINFLLLGTTV 480
           ADKPLLPL KK QK  +LVA     ++    G W        T+    +  N L  GTTV
Sbjct: 421 ADKPLLPLEKKTQK--ILVAGSHADNLGYQCGGW--------TIEWQGLSGNNLTSGTTV 480

Query: 481 LAAIKDTIDPETEVIFKENPDKEFFQSHKFSYAIVVVGEYPYAE---------------- 513
           L AIKDT+DP TEVIF ENPDK F QS  FSYAIVVVGE+PYAE                
Sbjct: 481 LDAIKDTVDPSTEVIFNENPDKGFLQSGTFSYAIVVVGEHPYAEMMGDSLNLTIPDPGPS 540

BLAST of Sgr011838 vs. ExPASy TrEMBL
Match: A0A1S3B892 (beta-glucosidase BoGH3B-like OS=Cucumis melo OX=3656 GN=LOC103487249 PE=3 SV=1)

HSP 1 Score: 746.1 bits (1925), Expect = 9.6e-212
Identity = 406/637 (63.74%), Postives = 443/637 (69.54%), Query Frame = 0

Query: 1   MAKVLIILMGLLLICFSEALTKAEYLRYRDPKQPLRARINDLLGRMTLEEKIGQMVQIER 60
           MAK + IL+GLLL+CF E   KAE L+Y+DPKQPL  RI DLLGRMTLEEKIGQM QIER
Sbjct: 1   MAKAINILIGLLLLCFFETWAKAENLKYKDPKQPLNVRIKDLLGRMTLEEKIGQMTQIER 60

Query: 61  TNASTTIMKK-------------------------------------------------- 120
            NAST +MKK                                                  
Sbjct: 61  VNASTDVMKKYFIGSVLSGGGSVPSKEASAQDWVQMVNEIQQGALSTRLGIPMIYGIDAV 120

Query: 121 ----------------------DPQLVKKIGVATALEVRATGIPYAFAPCVAVCRDPRWG 180
                                 DPQL+K+IG A+ALE+RATGIPYAFAPC+AVCRDPRWG
Sbjct: 121 HGHNNVYNATIFPHNIGLGATRDPQLLKRIGEASALEIRATGIPYAFAPCIAVCRDPRWG 180

Query: 181 RCYESYSEDPKIVQAMTEIISGLQGEIPPNSRKGVPHVAGKEKVAACAKHFVGDGGTTKG 240
           RCYESY EDPK+VQ MTEII GLQGEIPPNSRKGVP+VAGKEKV ACAKH+VGDGGTTKG
Sbjct: 181 RCYESYGEDPKLVQEMTEIIPGLQGEIPPNSRKGVPYVAGKEKVVACAKHYVGDGGTTKG 240

Query: 241 INENNTVIDRHGLLSIHMPGYYNSIIKGVATIMVSYSSWNGEKMHANKNLITDFLKNTLH 300
           I+ENNTVIDRHGLLSIHMPGYY+SIIKGVAT+MVSYSSWNG KMHANK L+TDFLKNTLH
Sbjct: 241 IDENNTVIDRHGLLSIHMPGYYHSIIKGVATVMVSYSSWNGVKMHANKELVTDFLKNTLH 300

Query: 301 FQGFVISDWQGIDRITTPPHANYTYSILASVTAGIDMVGIVLN----IYGL--------- 360
           FQGFVISDWQ IDRIT PPHANYTYSILASVTAG+DM+ +  N    I GL         
Sbjct: 301 FQGFVISDWQAIDRITDPPHANYTYSILASVTAGLDMIMVPYNYTEFIDGLTYLVNNNFI 360

Query: 361 ---------KRILRVKFVMGLFENSLADLSLVNELGKKEHRELAREAVRKSLVLLKNGES 420
                    KRILRVKF+MGLFEN +ADLSLVNELGK+EHRELAREAVRKSLVLLKNG+S
Sbjct: 361 PITRIDDAVKRILRVKFIMGLFENPIADLSLVNELGKQEHRELAREAVRKSLVLLKNGKS 420

Query: 421 ADKPLLPLPKKAQKYLLLVAMQTTLDINVAVGLWNGKDLVATTLLLHNIYINFLLLGTTV 480
           ADKPLLPL KK QK  +LVA     ++    G W        T+    +  N L  GTTV
Sbjct: 421 ADKPLLPLEKKTQK--ILVAGSHADNLGYQCGGW--------TIEWQGLSGNNLTSGTTV 480

Query: 481 LAAIKDTIDPETEVIFKENPDKEFFQSHKFSYAIVVVGEYPYAE---------------- 513
           L AIKDT+DP TEVIF ENPDK F QS  FSYAIVVVGE+PYAE                
Sbjct: 481 LDAIKDTVDPSTEVIFNENPDKGFLQSGTFSYAIVVVGEHPYAEMMGDSLNLTIPDPGPS 540

BLAST of Sgr011838 vs. ExPASy TrEMBL
Match: A0A6J1DRG0 (uncharacterized protein LOC111022483 OS=Momordica charantia OX=3673 GN=LOC111022483 PE=3 SV=1)

HSP 1 Score: 744.2 bits (1920), Expect = 3.7e-211
Identity = 401/635 (63.15%), Postives = 445/635 (70.08%), Query Frame = 0

Query: 1   MAKVLIILMGLLLICFSEALTKAEYLRYRDPKQPLRARINDLLGRMTLEEKIGQMVQIER 60
           MAK+ I LMG+LL+CFSEAL K  YLRY+DPKQPL  RI DLLGRMTLEEKIGQMVQI+R
Sbjct: 1   MAKIPIFLMGVLLLCFSEALAKPHYLRYKDPKQPLNVRIRDLLGRMTLEEKIGQMVQIDR 60

Query: 61  TNASTTIMKK-------------------------------------------------- 120
           T AS  +MKK                                                  
Sbjct: 61  TVASKEVMKKYLIGSVLSGGGSVPAKEASPKVWIDMVNEFQEGCLSTRLGIPMIYGIDAV 120

Query: 121 ----------------------DPQLVKKIGVATALEVRATGIPYAFAPCVAVCRDPRWG 180
                                 DP+LVK+IGVATALEVRATGI Y FAPC+AVCRDPRWG
Sbjct: 121 HGHNNVYKATIFPHNVGLGATRDPELVKRIGVATALEVRATGISYVFAPCIAVCRDPRWG 180

Query: 181 RCYESYSEDPKIVQAMTEIISGLQGEIPPNSRKGVPHVAGKEKVAACAKHFVGDGGTTKG 240
           RCYESYSEDPKIVQ+MTEIISGLQGEIP NSRKGVP+VAG+EKVAACAKHFVGDGGTTKG
Sbjct: 181 RCYESYSEDPKIVQSMTEIISGLQGEIPANSRKGVPYVAGREKVAACAKHFVGDGGTTKG 240

Query: 241 INENNTVIDRHGLLSIHMPGYYNSIIKGVATIMVSYSSWNGEKMHANKNLITDFLKNTLH 300
           INENNTVI+RHGLLS HMPGYYNSIIKGV+TIM+SYSSWNG+KMHAN+ LITDFLKNTL 
Sbjct: 241 INENNTVINRHGLLSTHMPGYYNSIIKGVSTIMISYSSWNGKKMHANRELITDFLKNTLR 300

Query: 301 FQGFVISDWQGIDRITTPPHANYTYSILASVTAGIDMVGIVLN----IYGL--------- 360
           F+GFVISDWQGIDRIT+PPHANYTYSI+  VTAGIDM+ +  N    I GL         
Sbjct: 301 FRGFVISDWQGIDRITSPPHANYTYSIITGVTAGIDMIMVPFNYTEFIDGLTYLVKTNVI 360

Query: 361 ---------KRILRVKFVMGLFENSLADLSLVNELGKKEHRELAREAVRKSLVLLKNGES 420
                    KRILRVKF+MGLFEN LAD S +++LGKKEHRELAREAVRKSLVLLKNGES
Sbjct: 361 PMSRIDDAVKRILRVKFIMGLFENPLADQSFIHQLGKKEHRELAREAVRKSLVLLKNGES 420

Query: 421 ADKPLLPLPKKAQKYLLLVAMQTTLDINVAVGLWNGKDLVATTLLLHNIYINFLLLGTTV 480
           ADKP+LPLPKKA K  +LVA     ++    G W        T+    +  N L  GTT+
Sbjct: 421 ADKPVLPLPKKAPK--ILVAGSHANNLGFQCGGW--------TIEWQGLGGNNLTTGTTI 480

Query: 481 LAAIKDTIDPETEVIFKENPDKEFFQSHKFSYAIVVVGEYPYAET--------------- 511
           L+AIKDT+DP+TEV+F ENPD EF +S+KFSYAIVVVGE+PYAET               
Sbjct: 481 LSAIKDTVDPKTEVVFDENPDAEFVKSNKFSYAIVVVGEHPYAETFGDSLNLTIAEPGPS 540

BLAST of Sgr011838 vs. ExPASy TrEMBL
Match: A0A0A0LY55 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G661750 PE=3 SV=1)

HSP 1 Score: 733.0 bits (1891), Expect = 8.4e-208
Identity = 406/635 (63.94%), Postives = 439/635 (69.13%), Query Frame = 0

Query: 1   MAKVLIILMGLLLICFSEALTKAEYLRYRDPKQPLRARINDLLGRMTLEEKIGQMVQIER 60
           MAK  IIL+ LLLIC  E   KAE  +Y+DP Q L  RI DLLGRMTLEEKIGQMVQIER
Sbjct: 2   MAKA-IILIALLLICCFETGAKAENFKYKDPTQRLNVRIKDLLGRMTLEEKIGQMVQIER 61

Query: 61  TNASTTIMKK-------------------------------------------------- 120
            NAST +MKK                                                  
Sbjct: 62  VNASTEVMKKYFIGSVLSGGGSVPSKQASAQDWINMVNEIQKGALSTRLGIPMIYGIDAV 121

Query: 121 ----------------------DPQLVKKIGVATALEVRATGIPYAFAPCVAVCRDPRWG 180
                                 DPQL+K+IGVA+A E+RATGIPYAFAPCVAVCRDPRWG
Sbjct: 122 HGHNNVYNATIFPHNIGLGATRDPQLLKRIGVASAREIRATGIPYAFAPCVAVCRDPRWG 181

Query: 181 RCYESYSEDPKIVQAMTEIISGLQGEIPPNSRKGVPHVAGKEKVAACAKHFVGDGGTTKG 240
           RCYESY EDPKIVQ MTEII GLQGEIPPNSRKGVP+VAGKE V ACAKH+VGDGGTTKG
Sbjct: 182 RCYESYGEDPKIVQEMTEIIPGLQGEIPPNSRKGVPYVAGKENVVACAKHYVGDGGTTKG 241

Query: 241 INENNTVIDRHGLLSIHMPGYYNSIIKGVATIMVSYSSWNGEKMHANKNLITDFLKNTLH 300
           I+ENNTVIDRHGLLSIHMPGYY+SIIKGVATIMVSYSSWNGEKMHANKNL+TDFLKNTLH
Sbjct: 242 IDENNTVIDRHGLLSIHMPGYYHSIIKGVATIMVSYSSWNGEKMHANKNLVTDFLKNTLH 301

Query: 301 FQGFVISDWQGIDRITTPPHANYTYSILASVTAGIDMVGIVLN----IYGL--------- 360
           FQGFVISDW+ IDRIT PPHANYTYSILAS+TAG+DM+ I  N    I GL         
Sbjct: 302 FQGFVISDWEAIDRITDPPHANYTYSILASITAGLDMIMIPYNYPEFIDGLTNLVKSNYI 361

Query: 361 ---------KRILRVKFVMGLFENSLADLSLVNELGKKEHRELAREAVRKSLVLLKNGES 420
                    KRILRVKFVMGLFEN +ADLSLVNELGK+EHRELAREAVRKSLVLLKNG+S
Sbjct: 362 PISRIDDAVKRILRVKFVMGLFENPIADLSLVNELGKQEHRELAREAVRKSLVLLKNGKS 421

Query: 421 ADKPLLPLPKKAQKYLLLVAMQTTLDINVAVGLWNGKDLVATTLLLHNIYINFLLLGTTV 480
           ADKPLLPL KK QK  +LVA     ++    G W        T+    +  N L  GTTV
Sbjct: 422 ADKPLLPLEKKTQK--ILVAGSHANNLGYQCGGW--------TIEWQGLSGNNLTSGTTV 481

Query: 481 LAAIKDTIDPETEVIFKENPDKEFFQSHKFSYAIVVVGEYPYAETN-------------- 511
           L AIKDT+DP TEVIF ENPDK+  QS  FSYAIVVVGE+PYAE N              
Sbjct: 482 LDAIKDTVDPTTEVIFNENPDKKSLQSDTFSYAIVVVGEHPYAELNGDSLNLTIPDPGPN 541

BLAST of Sgr011838 vs. TAIR 10
Match: AT5G20950.1 (Glycosyl hydrolase family protein )

HSP 1 Score: 622.5 bits (1604), Expect = 3.1e-178
Identity = 341/634 (53.79%), Postives = 405/634 (63.88%), Query Frame = 0

Query: 1   MAKVLIILMGLLLICFSEALTKAEYLRYRDPKQPLRARINDLLGRMTLEEKIGQMVQIER 60
           ++KVL +++   ++  +E       L+Y+DPKQPL ARI DL+ RMTL+EKIGQMVQIER
Sbjct: 4   LSKVLCLMLLCCIVAAAEGT-----LKYKDPKQPLGARIRDLMNRMTLQEKIGQMVQIER 63

Query: 61  TNASTTIMKK-------------------------------------------------- 120
           + A+  +MKK                                                  
Sbjct: 64  SVATPEVMKKYFIGSVLSGGGSVPSEKATPETWVNMVNEIQKASLSTRLGIPMIYGIDAV 123

Query: 121 ----------------------DPQLVKKIGVATALEVRATGIPYAFAPCVAVCRDPRWG 180
                                 DP LVK+IG ATALEVRATGIPYAFAPC+AVCRDPRWG
Sbjct: 124 HGHNNVYGATIFPHNVGLGVTRDPNLVKRIGAATALEVRATGIPYAFAPCIAVCRDPRWG 183

Query: 181 RCYESYSEDPKIVQAMTEIISGLQGEIPPNSRKGVPHVAGKEKVAACAKHFVGDGGTTKG 240
           RCYESYSED +IVQ MTEII GLQG++ P  RKGVP V GK KVAACAKHFVGDGGT +G
Sbjct: 184 RCYESYSEDYRIVQQMTEIIPGLQGDL-PTKRKGVPFVGGKTKVAACAKHFVGDGGTVRG 243

Query: 241 INENNTVIDRHGLLSIHMPGYYNSIIKGVATIMVSYSSWNGEKMHANKNLITDFLKNTLH 300
           I+ENNTVID  GL  IHMPGYYN++ KGVATIMVSYS+WNG +MHANK L+T FLKN L 
Sbjct: 244 IDENNTVIDSKGLFGIHMPGYYNAVNKGVATIMVSYSAWNGLRMHANKELVTGFLKNKLK 303

Query: 301 FQGFVISDWQGIDRITTPPHANYTYSILASVTAGIDMVGIVLNIY--------------- 360
           F+GFVISDWQGIDRITTPPH NY+YS+ A ++AGIDM+ +  N                 
Sbjct: 304 FRGFVISDWQGIDRITTPPHLNYSYSVYAGISAGIDMIMVPYNYTEFIDEISSQIQKKLI 363

Query: 361 -------GLKRILRVKFVMGLFENSLADLSLVNELGKKEHRELAREAVRKSLVLLKNGES 420
                   LKRILRVKF MGLFE  LADLS  N+LG KEHRELAREAVRKSLVLLKNG++
Sbjct: 364 PISRIDDALKRILRVKFTMGLFEEPLADLSFANQLGSKEHRELAREAVRKSLVLLKNGKT 423

Query: 421 ADKPLLPLPKKAQKYLLLVAMQTTLDINVAVGLWNGKDLVATTLLLHNIYINFLLLGTTV 480
             KPLLPLPKK+ K  +LVA     ++    G W        T+    +  N   +GTT+
Sbjct: 424 GAKPLLPLPKKSGK--ILVAGAHADNLGYQCGGW--------TITWQGLNGNDHTVGTTI 483

Query: 481 LAAIKDTIDPETEVIFKENPDKEFFQSHKFSYAIVVVGEYPYAE---------------- 510
           LAA+K+T+ P T+V++ +NPD  F +S KF YAIVVVGE PYAE                
Sbjct: 484 LAAVKNTVAPTTQVVYSQNPDANFVKSGKFDYAIVVVGEPPYAEMFGDTTNLTISDPGPS 543

BLAST of Sgr011838 vs. TAIR 10
Match: AT5G20950.2 (Glycosyl hydrolase family protein )

HSP 1 Score: 622.5 bits (1604), Expect = 3.1e-178
Identity = 341/634 (53.79%), Postives = 405/634 (63.88%), Query Frame = 0

Query: 1   MAKVLIILMGLLLICFSEALTKAEYLRYRDPKQPLRARINDLLGRMTLEEKIGQMVQIER 60
           ++KVL +++   ++  +E       L+Y+DPKQPL ARI DL+ RMTL+EKIGQMVQIER
Sbjct: 4   LSKVLCLMLLCCIVAAAEGT-----LKYKDPKQPLGARIRDLMNRMTLQEKIGQMVQIER 63

Query: 61  TNASTTIMKK-------------------------------------------------- 120
           + A+  +MKK                                                  
Sbjct: 64  SVATPEVMKKYFIGSVLSGGGSVPSEKATPETWVNMVNEIQKASLSTRLGIPMIYGIDAV 123

Query: 121 ----------------------DPQLVKKIGVATALEVRATGIPYAFAPCVAVCRDPRWG 180
                                 DP LVK+IG ATALEVRATGIPYAFAPC+AVCRDPRWG
Sbjct: 124 HGHNNVYGATIFPHNVGLGVTRDPNLVKRIGAATALEVRATGIPYAFAPCIAVCRDPRWG 183

Query: 181 RCYESYSEDPKIVQAMTEIISGLQGEIPPNSRKGVPHVAGKEKVAACAKHFVGDGGTTKG 240
           RCYESYSED +IVQ MTEII GLQG++ P  RKGVP V GK KVAACAKHFVGDGGT +G
Sbjct: 184 RCYESYSEDYRIVQQMTEIIPGLQGDL-PTKRKGVPFVGGKTKVAACAKHFVGDGGTVRG 243

Query: 241 INENNTVIDRHGLLSIHMPGYYNSIIKGVATIMVSYSSWNGEKMHANKNLITDFLKNTLH 300
           I+ENNTVID  GL  IHMPGYYN++ KGVATIMVSYS+WNG +MHANK L+T FLKN L 
Sbjct: 244 IDENNTVIDSKGLFGIHMPGYYNAVNKGVATIMVSYSAWNGLRMHANKELVTGFLKNKLK 303

Query: 301 FQGFVISDWQGIDRITTPPHANYTYSILASVTAGIDMVGIVLNIY--------------- 360
           F+GFVISDWQGIDRITTPPH NY+YS+ A ++AGIDM+ +  N                 
Sbjct: 304 FRGFVISDWQGIDRITTPPHLNYSYSVYAGISAGIDMIMVPYNYTEFIDEISSQIQKKLI 363

Query: 361 -------GLKRILRVKFVMGLFENSLADLSLVNELGKKEHRELAREAVRKSLVLLKNGES 420
                   LKRILRVKF MGLFE  LADLS  N+LG KEHRELAREAVRKSLVLLKNG++
Sbjct: 364 PISRIDDALKRILRVKFTMGLFEEPLADLSFANQLGSKEHRELAREAVRKSLVLLKNGKT 423

Query: 421 ADKPLLPLPKKAQKYLLLVAMQTTLDINVAVGLWNGKDLVATTLLLHNIYINFLLLGTTV 480
             KPLLPLPKK+ K  +LVA     ++    G W        T+    +  N   +GTT+
Sbjct: 424 GAKPLLPLPKKSGK--ILVAGAHADNLGYQCGGW--------TITWQGLNGNDHTVGTTI 483

Query: 481 LAAIKDTIDPETEVIFKENPDKEFFQSHKFSYAIVVVGEYPYAE---------------- 510
           LAA+K+T+ P T+V++ +NPD  F +S KF YAIVVVGE PYAE                
Sbjct: 484 LAAVKNTVAPTTQVVYSQNPDANFVKSGKFDYAIVVVGEPPYAEMFGDTTNLTISDPGPS 543

BLAST of Sgr011838 vs. TAIR 10
Match: AT5G20940.1 (Glycosyl hydrolase family protein )

HSP 1 Score: 590.1 bits (1520), Expect = 1.7e-168
Identity = 333/632 (52.69%), Postives = 398/632 (62.97%), Query Frame = 0

Query: 5   LIILMGLLLICFSEALTKAEY--LRYRDPKQPLRARINDLLGRMTLEEKIGQMVQIERTN 64
           L+  +GLLL+C + A  K      +Y+DPK+PL  RI +L+  MTLEEKIGQMVQ+ER N
Sbjct: 7   LLQTLGLLLLCCTVAANKVPLANAKYKDPKEPLGVRIKNLMSHMTLEEKIGQMVQVERVN 66

Query: 65  ASTTIMKK---------------------------------------------------- 124
           A+T +M+K                                                    
Sbjct: 67  ATTEVMQKYFVGSVFSGGGSVPKPYIGPEAWVNMVNEVQKKALSTRLGIPIIYGIDAVHG 126

Query: 125 --------------------DPQLVKKIGVATALEVRATGIPYAFAPCVAVCRDPRWGRC 184
                               DP LVK+IG ATALEVRATGI Y FAPC+AVCRDPRWGRC
Sbjct: 127 HNTVYNATIFPHNVGLGVTRDPGLVKRIGEATALEVRATGIQYVFAPCIAVCRDPRWGRC 186

Query: 185 YESYSEDPKIVQAMTEIISGLQGEIPPNSRKGVPHVAGKEKVAACAKHFVGDGGTTKGIN 244
           YESYSED KIVQ MTEII GLQG++ P  +KGVP VAGK KVAACAKHFVGDGGT +G+N
Sbjct: 187 YESYSEDHKIVQQMTEIIPGLQGDL-PTGQKGVPFVAGKTKVAACAKHFVGDGGTLRGMN 246

Query: 245 ENNTVIDRHGLLSIHMPGYYNSIIKGVATIMVSYSSWNGEKMHANKNLITDFLKNTLHFQ 304
            NNTVI+ +GLL IHMP Y++++ KGVAT+MVSYSS NG KMHANK LIT FLKN L F+
Sbjct: 247 ANNTVINSNGLLGIHMPAYHDAVNKGVATVMVSYSSINGLKMHANKKLITGFLKNKLKFR 306

Query: 305 GFVISDWQGIDRITTPPHANYTYSILASVTAGIDMVGIVLNIYGL--------------- 364
           G VISD+ G+D+I TP  ANY++S+ A+ TAG+DM     N+  L               
Sbjct: 307 GIVISDYLGVDQINTPLGANYSHSVYAATTAGLDMFMGSSNLTKLIDELTSQVKRKFIPM 366

Query: 365 -------KRILRVKFVMGLFENSLADLSLVNELGKKEHRELAREAVRKSLVLLKNGESAD 424
                  KRILRVKF MGLFEN +AD SL  +LG KEHRELAREAVRKSLVLLKNGE+AD
Sbjct: 367 SRIDDAVKRILRVKFTMGLFENPIADHSLAKKLGSKEHRELAREAVRKSLVLLKNGENAD 426

Query: 425 KPLLPLPKKAQKYLLLVAMQTTLDINVAVGLWNGKDLVATTLLLHNIYINFLLLGTTVLA 484
           KPLLPLPKKA K  +LVA     ++    G W        T+    +  N L +GTT+LA
Sbjct: 427 KPLLPLPKKANK--ILVAGTHADNLGYQCGGW--------TITWQGLNGNNLTIGTTILA 486

Query: 485 AIKDTIDPETEVIFKENPDKEFFQSHKFSYAIVVVGEYPYAE------------------ 510
           A+K T+DP+T+VI+ +NPD  F ++  F YAIV VGE PYAE                  
Sbjct: 487 AVKKTVDPKTQVIYNQNPDTNFVKAGDFDYAIVAVGEKPYAEGFGDSTNLTISEPGPSTI 546

BLAST of Sgr011838 vs. TAIR 10
Match: AT5G04885.1 (Glycosyl hydrolase family protein )

HSP 1 Score: 574.7 bits (1480), Expect = 7.4e-164
Identity = 310/631 (49.13%), Postives = 390/631 (61.81%), Query Frame = 0

Query: 5   LIILMGLLLICFSEALTKAEYLRYRDPKQPLRARINDLLGRMTLEEKIGQMVQIERTNAS 64
           +++ M + + C+ +     EYL Y+DPKQ +  R+ DL GRMTLEEKIGQMVQI+R+ A+
Sbjct: 11  VLLWMCMWVCCYGD----GEYLLYKDPKQTVSDRVADLFGRMTLEEKIGQMVQIDRSVAT 70

Query: 65  TTIMK------------------------------------------------------- 124
             IM+                                                       
Sbjct: 71  VNIMRDYFIGSVLSGGGSAPLPEASAQNWVDMINEYQKGALVSRLGIPMIYGIDAVHGHN 130

Query: 125 -----------------KDPQLVKKIGVATALEVRATGIPYAFAPCVAVCRDPRWGRCYE 184
                            +DP LVK+IG ATA+EVRATGIPY FAPC+AVCRDPRWGRCYE
Sbjct: 131 NVYNATIFPHNVGLGATRDPDLVKRIGAATAVEVRATGIPYTFAPCIAVCRDPRWGRCYE 190

Query: 185 SYSEDPKIVQAMTEIISGLQGEIPPNSRKGVPHVAGKEKVAACAKHFVGDGGTTKGINEN 244
           SYSED K+V+ MT++I GLQGE P N + GVP V G++KVAACAKH+VGDGGTT+G+NEN
Sbjct: 191 SYSEDHKVVEDMTDVILGLQGEPPSNYKHGVPFVGGRDKVAACAKHYVGDGGTTRGVNEN 250

Query: 245 NTVIDRHGLLSIHMPGYYNSIIKGVATIMVSYSSWNGEKMHANKNLITDFLKNTLHFQGF 304
           NTV D HGLLS+HMP Y +++ KGV+T+MVSYSSWNGEKMHAN  LIT +LK TL F+GF
Sbjct: 251 NTVTDLHGLLSVHMPAYADAVYKGVSTVMVSYSSWNGEKMHANTELITGYLKGTLKFKGF 310

Query: 305 VISDWQGIDRITTPPHANYTYSILASVTAGIDMVGIVLNIY------------------- 364
           VISDWQG+D+I+TPPH +YT S+ A++ AGIDMV +  N                     
Sbjct: 311 VISDWQGVDKISTPPHTHYTASVRAAIQAGIDMVMVPFNFTEFVNDLTTLVKNNSIPVTR 370

Query: 365 ---GLKRILRVKFVMGLFENSLADLSLVNELGKKEHRELAREAVRKSLVLLKNGESADKP 424
               ++RIL VKF MGLFEN LAD S  +ELG + HR+LAREAVRKSLVLLKNG   + P
Sbjct: 371 IDDAVRRILLVKFTMGLFENPLADYSFSSELGSQAHRDLAREAVRKSLVLLKNGNKTN-P 430

Query: 425 LLPLPKKAQKYLLLVAMQTTLDINVAVGLWNGKDLVATTLLLHNIYINFLLLGTTVLAAI 484
           +LPLP+K  K  +LVA     ++    G W        T+       N    GTT+L+A+
Sbjct: 431 MLPLPRKTSK--ILVAGTHADNLGYQCGGW--------TITWQGFSGNKNTRGTTLLSAV 490

Query: 485 KDTIDPETEVIFKENPDKEFFQSHKFSYAIVVVGEYPYAET------------------- 511
           K  +D  TEV+F+ENPD EF +S+ F+YAI+ VGE PYAET                   
Sbjct: 491 KSAVDQSTEVVFRENPDAEFIKSNNFAYAIIAVGEPPYAETAGDSDKLTMLDPGPAIISS 550

BLAST of Sgr011838 vs. TAIR 10
Match: AT3G62710.1 (Glycosyl hydrolase family protein )

HSP 1 Score: 466.8 bits (1200), Expect = 2.2e-131
Identity = 288/627 (45.93%), Postives = 350/627 (55.82%), Query Frame = 0

Query: 25  YLRYRDPKQPLRARINDLLGRMTLEEKIGQMVQIERTNAS-------------------- 84
           Y++Y+DPK  +  R+ DLL RMTL EK+GQM QI+R N S                    
Sbjct: 35  YIKYKDPKVAVEERVEDLLIRMTLPEKLGQMCQIDRFNFSQVTGGVATVVPEIFTKYMIG 94

Query: 85  -------------------TTIMKK----------------------------------- 144
                              T  MKK                                   
Sbjct: 95  SVLSNPYDTGKDIAKRIFQTNAMKKLSLSTRLGIPLLYAVDAVHGHNTFIDATIFPHNVG 154

Query: 145 -----DPQLVKKIGVATALEVRATGIPYAFAPCVAVCRDPRWGRCYESYSEDPKIVQAMT 204
                DPQLVKKIG  TA EVRATG+  AFAPCVAVCRDPRWGRCYESYSEDP +V  MT
Sbjct: 155 LGATRDPQLVKKIGAITAQEVRATGVAQAFAPCVAVCRDPRWGRCYESYSEDPAVVNMMT 214

Query: 205 E-IISGLQGEIPPNSRKGVPHVAG-KEKVAACAKHFVGDGGTTKGINENNTVIDRHGLLS 264
           E II GLQG          P++A  K  VA CAKHFVGDGGT  GINENNTV D   L  
Sbjct: 215 ESIIDGLQG--------NAPYLADPKINVAGCAKHFVGDGGTINGINENNTVADNATLFG 274

Query: 265 IHMPGYYNSIIKGVATIMVSYSSWNGEKMHANKNLITDFLKNTLHFQGFVISDWQGIDRI 324
           IHMP +  ++ KG+A+IM SYSS NG KMHAN+ +ITD+LKNTL FQGFVISDW GID+I
Sbjct: 275 IHMPPFEIAVKKGIASIMASYSSLNGVKMHANRAMITDYLKNTLKFQGFVISDWLGIDKI 334

Query: 325 TTPPHANYTYSILASVTAGIDMVGI----------VLNIY------------GLKRILRV 384
           T    +NYTYSI AS+ AGIDMV +          + N+              ++RILRV
Sbjct: 335 TPIEKSNYTYSIEASINAGIDMVMVPWAYPEYLEKLTNLVNGGYIPMSRIDDAVRRILRV 394

Query: 385 KFVMGLFENSLADLSL-VNELGKKEHRELAREAVRKSLVLLKNGESADKPLLPLPKKAQK 444
           KF +GLFENSLAD  L   E G + HRE+ REAVRKS+VLLKNG++    ++PLPKK +K
Sbjct: 395 KFSIGLFENSLADEKLPTTEFGSEAHREVGREAVRKSMVLLKNGKTDADKIVPLPKKVKK 454

Query: 445 YLLLVAMQTTLDINVAVG----LW---NGKDLVATTLLLHNIYINFLLLGTTVLAAIKDT 504
             ++VA +   D+    G     W   NG      T   H +     + GTT+L AI+  
Sbjct: 455 --IVVAGRHANDMGWQCGGFSLTWQGFNGTGEDMPTNTKHGLPTG-KIKGTTILEAIQKA 514

Query: 505 IDPETEVIFKENPDKEFFQSH-KFSYAIVVVGEYPYAET--------------------- 508
           +DP TEV++ E P+++  + H   +Y IVVVGE PYAET                     
Sbjct: 515 VDPTTEVVYVEEPNQDTAKLHADAAYTIVVVGETPYAETFGDSPTLGITKPGPDTLSHTC 574

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
TYK19869.11.0e-21567.67beta-glucosidase BoGH3B-like [Cucumis melo var. makuwa][more]
XP_008443733.12.0e-21163.74PREDICTED: beta-glucosidase BoGH3B-like [Cucumis melo] >KAA0038317.1 beta-glucos... [more]
XP_022155346.17.5e-21163.15uncharacterized protein LOC111022483 [Momordica charantia][more]
XP_038905524.12.7e-20862.36LOW QUALITY PROTEIN: beta-glucosidase BoGH3B-like [Benincasa hispida][more]
XP_011648555.11.7e-20763.94uncharacterized protein LOC101211593 [Cucumis sativus] >KGN66708.1 hypothetical ... [more]
Match NameE-valueIdentityDescription
T2KMH02.4e-3425.56Beta-xylosidase OS=Formosa agariphila (strain DSM 15362 / KCTC 12365 / LMG 23005... [more]
A7LXU32.0e-3325.19Beta-glucosidase BoGH3B OS=Bacteroides ovatus (strain ATCC 8483 / DSM 1896 / JCM... [more]
Q238921.0e-2926.59Lysosomal beta glucosidase OS=Dictyostelium discoideum OX=44689 GN=gluA PE=1 SV=... [more]
Q560787.5e-2825.09Periplasmic beta-glucosidase OS=Salmonella typhimurium (strain LT2 / SGSC1412 / ... [more]
P333631.7e-2725.00Periplasmic beta-glucosidase OS=Escherichia coli (strain K12) OX=83333 GN=bglX P... [more]
Match NameE-valueIdentityDescription
A0A5D3D8S54.9e-21667.67Beta-glucosidase BoGH3B-like OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_sca... [more]
A0A5A7T9L39.6e-21263.74Beta-glucosidase BoGH3B-like OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_sca... [more]
A0A1S3B8929.6e-21263.74beta-glucosidase BoGH3B-like OS=Cucumis melo OX=3656 GN=LOC103487249 PE=3 SV=1[more]
A0A6J1DRG03.7e-21163.15uncharacterized protein LOC111022483 OS=Momordica charantia OX=3673 GN=LOC111022... [more]
A0A0A0LY558.4e-20863.94Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G661750 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT5G20950.13.1e-17853.79Glycosyl hydrolase family protein [more]
AT5G20950.23.1e-17853.79Glycosyl hydrolase family protein [more]
AT5G20940.11.7e-16852.69Glycosyl hydrolase family protein [more]
AT5G04885.17.4e-16449.13Glycosyl hydrolase family protein [more]
AT3G62710.12.2e-13145.93Glycosyl hydrolase family protein [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001764Glycoside hydrolase, family 3, N-terminalPRINTSPR00133GLHYDRLASE3coord: 152..168
score: 39.22
coord: 222..240
score: 47.24
coord: 106..122
score: 44.12
IPR001764Glycoside hydrolase, family 3, N-terminalPFAMPF00933Glyco_hydro_3coord: 69..266
e-value: 3.2E-43
score: 148.4
IPR036881Glycoside hydrolase family 3 C-terminal domain superfamilyGENE3D3.40.50.1700coord: 433..508
e-value: 2.9E-24
score: 88.1
coord: 303..432
e-value: 2.0E-17
score: 65.8
IPR036881Glycoside hydrolase family 3 C-terminal domain superfamilySUPERFAMILY52279Beta-D-glucan exohydrolase, C-terminal domaincoord: 318..506
IPR036962Glycoside hydrolase, family 3, N-terminal domain superfamilyGENE3D3.20.20.300coord: 21..67
e-value: 1.9E-9
score: 38.9
coord: 68..302
e-value: 1.5E-72
score: 246.6
IPR002772Glycoside hydrolase family 3 C-terminal domainPFAMPF01915Glyco_hydro_3_Ccoord: 432..506
e-value: 9.2E-16
score: 58.4
NoneNo IPR availablePANTHERPTHR30620:SF91BETA-D-GLUCOSIDE GLUCOHYDROLASEcoord: 432..510
coord: 12..70
NoneNo IPR availablePANTHERPTHR30620:SF91BETA-D-GLUCOSIDE GLUCOHYDROLASEcoord: 70..433
NoneNo IPR availablePANTHERPTHR30620PERIPLASMIC BETA-GLUCOSIDASE-RELATEDcoord: 432..510
coord: 12..70
coord: 70..433
IPR017853Glycoside hydrolase superfamilySUPERFAMILY51445(Trans)glycosidasescoord: 26..317

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr011838.1Sgr011838.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009251 glucan catabolic process
biological_process GO:0005975 carbohydrate metabolic process
molecular_function GO:0008422 beta-glucosidase activity
molecular_function GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compounds