Sgr028610 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr028610
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
DescriptionDomain of unknown function (DUF303)
Locationtig00153204: 2789192 .. 2790058 (-)
RNA-Seq ExpressionSgr028610
SyntenySgr028610
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTTTGTTGAGACTATCAATCTTGCTTTGTATGATGCTATTTGGCCCTTCCCTTTCTGGGGCTACTTCTCCTAAAAACATATTCATCCTCGCCGGTCAGAGCAACATGGCTGGCCGAGGCGGCGTTGAGAAAGACCGATGGGGCAAAGATTTTGTGTGGGATGGATATGTCCCACCCGAGGCTCAGCCTGACCCATCCATCCTCCGATTGAACCCTGAGCGCCAATGGGAGGTAGCACGAGAGCCCGTCCATGAGGGCATTGATATCAACAAGACGGTTGGGGTTGGCTCGGCAATTGCATTTGCTCACCAACTGCAGGCGAAAGGTGGGTTAAAGGTCGGCTCTGTGGGTTTAGTTCCTTGTGCTAGAGGTGGCACTCTGATCCAACAATGGATAAAAAACCCTAGCAATCCTAATGGGACTTTTTATAAAAATTTCATTGAACGAATCAAAGCTTCAGATAGAGAAGGTGGGGTGGTGCGTGCTCTTTTCTGGTTGCAAGGAGAAAGTGATGCAGCCAGTAGTGGCACGGCTGGTAGATACAAAGATAACTTGAAGAAGTTCTTCACCGACATCCGCAATGATATCAAGCCCAGATTTTTACCCATCATTCTTGTGAAAATAGCTGTCTATGACACATATATGAAGCATGATACTCATGATTTGCCAGCAGTGAGGGCCGCAGAAGATGCAGTCCAGCGTGAGTTGCCAGATGTTGTGACCATCGACTCCTTACAATTGGTGAACACTGCCACCGGAGAAGGCTTTAACCAGGATCATGGTCATTTTAATGTCAAGACTGAGATTGCATTGGGTAAATGGTTGGCTGATACCTACCTCTCCCATTATGGCCACTTGCTTTGA

mRNA sequence

ATGGCTTTGTTGAGACTATCAATCTTGCTTTGTATGATGCTATTTGGCCCTTCCCTTTCTGGGGCTACTTCTCCTAAAAACATATTCATCCTCGCCGGTCAGAGCAACATGGCTGGCCGAGGCGGCGTTGAGAAAGACCGATGGGGCAAAGATTTTGTGTGGGATGGATATGTCCCACCCGAGGCTCAGCCTGACCCATCCATCCTCCGATTGAACCCTGAGCGCCAATGGGAGGTAGCACGAGAGCCCGTCCATGAGGGCATTGATATCAACAAGACGGTTGGGGTTGGCTCGGCAATTGCATTTGCTCACCAACTGCAGGCGAAAGGTGGGTTAAAGGTCGGCTCTGTGGGTTTAGTTCCTTGTGCTAGAGGTGGCACTCTGATCCAACAATGGATAAAAAACCCTAGCAATCCTAATGGGACTTTTTATAAAAATTTCATTGAACGAATCAAAGCTTCAGATAGAGAAGGTGGGGTGGTGCGTGCTCTTTTCTGGTTGCAAGGAGAAAGTGATGCAGCCAGTAGTGGCACGGCTGGTAGATACAAAGATAACTTGAAGAAGTTCTTCACCGACATCCGCAATGATATCAAGCCCAGATTTTTACCCATCATTCTTGTGAAAATAGCTGTCTATGACACATATATGAAGCATGATACTCATGATTTGCCAGCAGTGAGGGCCGCAGAAGATGCAGTCCAGCGTGAGTTGCCAGATGTTGTGACCATCGACTCCTTACAATTGGTGAACACTGCCACCGGAGAAGGCTTTAACCAGGATCATGGTCATTTTAATGTCAAGACTGAGATTGCATTGGGTAAATGGTTGGCTGATACCTACCTCTCCCATTATGGCCACTTGCTTTGA

Coding sequence (CDS)

ATGGCTTTGTTGAGACTATCAATCTTGCTTTGTATGATGCTATTTGGCCCTTCCCTTTCTGGGGCTACTTCTCCTAAAAACATATTCATCCTCGCCGGTCAGAGCAACATGGCTGGCCGAGGCGGCGTTGAGAAAGACCGATGGGGCAAAGATTTTGTGTGGGATGGATATGTCCCACCCGAGGCTCAGCCTGACCCATCCATCCTCCGATTGAACCCTGAGCGCCAATGGGAGGTAGCACGAGAGCCCGTCCATGAGGGCATTGATATCAACAAGACGGTTGGGGTTGGCTCGGCAATTGCATTTGCTCACCAACTGCAGGCGAAAGGTGGGTTAAAGGTCGGCTCTGTGGGTTTAGTTCCTTGTGCTAGAGGTGGCACTCTGATCCAACAATGGATAAAAAACCCTAGCAATCCTAATGGGACTTTTTATAAAAATTTCATTGAACGAATCAAAGCTTCAGATAGAGAAGGTGGGGTGGTGCGTGCTCTTTTCTGGTTGCAAGGAGAAAGTGATGCAGCCAGTAGTGGCACGGCTGGTAGATACAAAGATAACTTGAAGAAGTTCTTCACCGACATCCGCAATGATATCAAGCCCAGATTTTTACCCATCATTCTTGTGAAAATAGCTGTCTATGACACATATATGAAGCATGATACTCATGATTTGCCAGCAGTGAGGGCCGCAGAAGATGCAGTCCAGCGTGAGTTGCCAGATGTTGTGACCATCGACTCCTTACAATTGGTGAACACTGCCACCGGAGAAGGCTTTAACCAGGATCATGGTCATTTTAATGTCAAGACTGAGATTGCATTGGGTAAATGGTTGGCTGATACCTACCTCTCCCATTATGGCCACTTGCTTTGA

Protein sequence

MALLRLSILLCMMLFGPSLSGATSPKNIFILAGQSNMAGRGGVEKDRWGKDFVWDGYVPPEAQPDPSILRLNPERQWEVAREPVHEGIDINKTVGVGSAIAFAHQLQAKGGLKVGSVGLVPCARGGTLIQQWIKNPSNPNGTFYKNFIERIKASDREGGVVRALFWLQGESDAASSGTAGRYKDNLKKFFTDIRNDIKPRFLPIILVKIAVYDTYMKHDTHDLPAVRAAEDAVQRELPDVVTIDSLQLVNTATGEGFNQDHGHFNVKTEIALGKWLADTYLSHYGHLL
Homology
BLAST of Sgr028610 vs. NCBI nr
Match: XP_022131651.1 (probable carbohydrate esterase At4g34215 [Momordica charantia])

HSP 1 Score: 526.6 bits (1355), Expect = 1.4e-145
Identity = 259/288 (89.93%), Postives = 272/288 (94.44%), Query Frame = 0

Query: 1   MALLRLSILLCMMLFGPSLSGATSPKNIFILAGQSNMAGRGGVEKDRWGKDFVWDGYVPP 60
           MALLRLSI+LCMMLFGPSLSGATSPKNIFILAGQSNMAGRGGVEK+R G D  WDGYVPP
Sbjct: 1   MALLRLSIMLCMMLFGPSLSGATSPKNIFILAGQSNMAGRGGVEKNRTG-DLEWDGYVPP 60

Query: 61  EAQPDPSILRLNPERQWEVAREPVHEGIDINKTVGVGSAIAFAHQLQAKGGLKVGSVGLV 120
           E+QPDPSILRLNPERQWEVAREPVH GIDI KTVGVG AIAFAHQLQAKGG KVGSVGLV
Sbjct: 61  ESQPDPSILRLNPERQWEVAREPVHRGIDIGKTVGVGPAIAFAHQLQAKGGSKVGSVGLV 120

Query: 121 PCARGGTLIQQWIKNPSNPNGTFYKNFIERIKASDREGGVVRALFWLQGESDAASSGTAG 180
           PCARGGTLI+QW+KNPSNPN TFYKNFIERI+ASDREGGVVRALFWLQGESDAASS TA 
Sbjct: 121 PCARGGTLIEQWVKNPSNPNATFYKNFIERIQASDREGGVVRALFWLQGESDAASSDTAE 180

Query: 181 RYKDNLKKFFTDIRNDIKPRFLPIILVKIAVYDTYMKHDTHDLPAVRAAEDAVQRELPDV 240
           RYK+NLKKFFTDIRNDIKPR LPIILVKIAVYDT+MKHDTHDLPAVRAAEDAVQRELP+V
Sbjct: 181 RYKNNLKKFFTDIRNDIKPRVLPIILVKIAVYDTFMKHDTHDLPAVRAAEDAVQRELPNV 240

Query: 241 VTIDSLQLVNTATGEGFNQDHGHFNVKTEIALGKWLADTYLSHYGHLL 289
           VTID+L+LVNT T EGFN D GHFN+KTEIALGKWLADTYLS+YGHLL
Sbjct: 241 VTIDALKLVNTTTAEGFNLDRGHFNIKTEIALGKWLADTYLSNYGHLL 287

BLAST of Sgr028610 vs. NCBI nr
Match: XP_022157447.1 (probable carbohydrate esterase At4g34215 [Momordica charantia])

HSP 1 Score: 516.2 bits (1328), Expect = 1.9e-142
Identity = 257/288 (89.24%), Postives = 267/288 (92.71%), Query Frame = 0

Query: 1   MALLRLSILLCMMLFGPSLSGATSPKNIFILAGQSNMAGRGGVEKDRWGKDFVWDGYVPP 60
           M +LRLSILLCMML GPSLS ATSPKNIFILAGQSNMAGRGGVE DR G   VWDGYVPP
Sbjct: 1   MVILRLSILLCMMLCGPSLSRATSPKNIFILAGQSNMAGRGGVENDRPG-HLVWDGYVPP 60

Query: 61  EAQPDPSILRLNPERQWEVAREPVHEGIDINKTVGVGSAIAFAHQLQAKGGLKVGSVGLV 120
           EAQPDPSILRLNPERQWEVAREPVHEGIDINKTVGVG AIAFA QLQAKGG KVGSVGLV
Sbjct: 61  EAQPDPSILRLNPERQWEVAREPVHEGIDINKTVGVGPAIAFARQLQAKGGSKVGSVGLV 120

Query: 121 PCARGGTLIQQWIKNPSNPNGTFYKNFIERIKASDREGGVVRALFWLQGESDAASSGTAG 180
           PCARGGTLI+QW+KNPSN + TFYKNFIERI+ASDREGGVVRALFWLQGESDAAS  TA 
Sbjct: 121 PCARGGTLIEQWVKNPSNTSATFYKNFIERIQASDREGGVVRALFWLQGESDAASRDTAN 180

Query: 181 RYKDNLKKFFTDIRNDIKPRFLPIILVKIAVYDTYMKHDTHDLPAVRAAEDAVQRELPDV 240
           RYK+NLKKFFTDIRNDIKPRFLPIILVKIAVYDT+MKHDTHDLPAVRAAEDAVQRELP+V
Sbjct: 181 RYKNNLKKFFTDIRNDIKPRFLPIILVKIAVYDTFMKHDTHDLPAVRAAEDAVQRELPNV 240

Query: 241 VTIDSLQLVNTATGEGFNQDHGHFNVKTEIALGKWLADTYLSHYGHLL 289
           VTIDSL+LVNT T EGFN D GHFN+KTEIALGKWLADTYLS YGHLL
Sbjct: 241 VTIDSLKLVNTTTVEGFNLDRGHFNIKTEIALGKWLADTYLSQYGHLL 287

BLAST of Sgr028610 vs. NCBI nr
Match: XP_022158585.1 (probable carbohydrate esterase At4g34215 [Momordica charantia])

HSP 1 Score: 516.2 bits (1328), Expect = 1.9e-142
Identity = 254/288 (88.19%), Postives = 271/288 (94.10%), Query Frame = 0

Query: 1   MALLRLSILLCMMLFGPSLSGATSPKNIFILAGQSNMAGRGGVEKDRWGKDFVWDGYVPP 60
           MALLRLSILLCM+L GPSLSGATSPKNIFILAGQSNMAGRGGVEKDR G + VWDGYVPP
Sbjct: 1   MALLRLSILLCMILCGPSLSGATSPKNIFILAGQSNMAGRGGVEKDRSG-NLVWDGYVPP 60

Query: 61  EAQPDPSILRLNPERQWEVAREPVHEGIDINKTVGVGSAIAFAHQLQAKGGLKVGSVGLV 120
           EAQPDPSILRLNPERQWEVAREPVHEGIDINKTVGVGSAIAFAHQLQAKGG KVGSVGLV
Sbjct: 61  EAQPDPSILRLNPERQWEVAREPVHEGIDINKTVGVGSAIAFAHQLQAKGGSKVGSVGLV 120

Query: 121 PCARGGTLIQQWIKNPSNPNGTFYKNFIERIKASDREGGVVRALFWLQGESDAASSGTAG 180
           PCARGGTLI+QW+KNPSN + TFYKNFIERI+ASDREGGVVRALFWLQGESDAAS  TA 
Sbjct: 121 PCARGGTLIEQWVKNPSNTSATFYKNFIERIQASDREGGVVRALFWLQGESDAASRDTAS 180

Query: 181 RYKDNLKKFFTDIRNDIKPRFLPIILVKIAVYDTYMKHDTHDLPAVRAAEDAVQRELPDV 240
           RYK+NLKKFFTDIRNDIKPRFLPII++KIA YDT+++HDTHDLP VRAAEDAVQREL ++
Sbjct: 181 RYKNNLKKFFTDIRNDIKPRFLPIIVIKIAGYDTFIEHDTHDLPTVRAAEDAVQRELLNI 240

Query: 241 VTIDSLQLVNTATGEGFNQDHGHFNVKTEIALGKWLADTYLSHYGHLL 289
           VTID+L+LVNT TGEGFN D GH+NVKTEIALGKWLADTYLSHYG LL
Sbjct: 241 VTIDALKLVNTITGEGFNLDRGHYNVKTEIALGKWLADTYLSHYGQLL 287

BLAST of Sgr028610 vs. NCBI nr
Match: XP_022158593.1 (probable carbohydrate esterase At4g34215 [Momordica charantia])

HSP 1 Score: 506.1 bits (1302), Expect = 1.9e-139
Identity = 246/277 (88.81%), Postives = 261/277 (94.22%), Query Frame = 0

Query: 12  MMLFGPSLSGATSPKNIFILAGQSNMAGRGGVEKDRWGKDFVWDGYVPPEAQPDPSILRL 71
           M+L GPSLSGATSPKNIFILAGQSNMAGRGGVEKDR G + VWDGYVPPEAQPDPSILRL
Sbjct: 1   MILCGPSLSGATSPKNIFILAGQSNMAGRGGVEKDRSG-NLVWDGYVPPEAQPDPSILRL 60

Query: 72  NPERQWEVAREPVHEGIDINKTVGVGSAIAFAHQLQAKGGLKVGSVGLVPCARGGTLIQQ 131
           NPERQWEVAREPVHEGIDINKTVGVGSAIAFAHQLQAKGG KVGSVGLVPCARGGTLI+Q
Sbjct: 61  NPERQWEVAREPVHEGIDINKTVGVGSAIAFAHQLQAKGGSKVGSVGLVPCARGGTLIEQ 120

Query: 132 WIKNPSNPNGTFYKNFIERIKASDREGGVVRALFWLQGESDAASSGTAGRYKDNLKKFFT 191
           W+KNPSN + TFYKNFIERI+ASDREGGVVRALFWLQGESDAAS  TA RYK+NLKKFFT
Sbjct: 121 WVKNPSNTSATFYKNFIERIQASDREGGVVRALFWLQGESDAASRDTASRYKNNLKKFFT 180

Query: 192 DIRNDIKPRFLPIILVKIAVYDTYMKHDTHDLPAVRAAEDAVQRELPDVVTIDSLQLVNT 251
           DIRNDIKPRFLPII+VKIA YDT+M+HDTHDLPAVRAAEDAVQRELP++VTID+L+LVNT
Sbjct: 181 DIRNDIKPRFLPIIVVKIAGYDTFMEHDTHDLPAVRAAEDAVQRELPNIVTIDALKLVNT 240

Query: 252 ATGEGFNQDHGHFNVKTEIALGKWLADTYLSHYGHLL 289
            TGEGFN D GH+N KTEIALGKWLADTYLSHYG LL
Sbjct: 241 VTGEGFNSDRGHYNFKTEIALGKWLADTYLSHYGQLL 276

BLAST of Sgr028610 vs. NCBI nr
Match: XP_022158365.1 (probable carbohydrate esterase At4g34215 [Momordica charantia])

HSP 1 Score: 499.2 bits (1284), Expect = 2.4e-137
Identity = 247/288 (85.76%), Postives = 262/288 (90.97%), Query Frame = 0

Query: 1   MALLRLSILLCMMLFGPSLSGATSPKNIFILAGQSNMAGRGGVEKDRWGKDFVWDGYVPP 60
           MALLRL ILLCMML G SLSGATSPKNIFILAGQSNMAGRGGVE DR G   VWD YVPP
Sbjct: 1   MALLRLLILLCMMLRGTSLSGATSPKNIFILAGQSNMAGRGGVENDRPG-HLVWDRYVPP 60

Query: 61  EAQPDPSILRLNPERQWEVAREPVHEGIDINKTVGVGSAIAFAHQLQAKGGLKVGSVGLV 120
           EAQPDPSILRLNP+RQWEVAREPVHEGIDINKTVGVG AI+FAHQLQAKGG KVGSVGLV
Sbjct: 61  EAQPDPSILRLNPDRQWEVAREPVHEGIDINKTVGVGPAISFAHQLQAKGGSKVGSVGLV 120

Query: 121 PCARGGTLIQQWIKNPSNPNGTFYKNFIERIKASDREGGVVRALFWLQGESDAASSGTAG 180
           PCARGGTLI+QW+KNP N + TFYKNFIERI+ASDREGGVVRAL W QG SDAAS  TA 
Sbjct: 121 PCARGGTLIEQWLKNPGNTSATFYKNFIERIQASDREGGVVRALLWFQGASDAASRDTAN 180

Query: 181 RYKDNLKKFFTDIRNDIKPRFLPIILVKIAVYDTYMKHDTHDLPAVRAAEDAVQRELPDV 240
           RYK+NLKKFFTDIRND+KPRFLPII+V+ AVYDT+MKHDTHDLPAVRAA+DAVQRELP+V
Sbjct: 181 RYKNNLKKFFTDIRNDVKPRFLPIIVVQSAVYDTFMKHDTHDLPAVRAAQDAVQRELPNV 240

Query: 241 VTIDSLQLVNTATGEGFNQDHGHFNVKTEIALGKWLADTYLSHYGHLL 289
           VTIDSL+LVNT T EGFN D GHFN KTEIALGKWLADTYLSHYG+LL
Sbjct: 241 VTIDSLKLVNTTTSEGFNLDRGHFNTKTEIALGKWLADTYLSHYGNLL 287

BLAST of Sgr028610 vs. ExPASy Swiss-Prot
Match: Q8L9J9 (Probable carbohydrate esterase At4g34215 OS=Arabidopsis thaliana OX=3702 GN=At4g34215 PE=1 SV=2)

HSP 1 Score: 175.6 bits (444), Expect = 7.9e-43
Identity = 98/268 (36.57%), Postives = 143/268 (53.36%), Query Frame = 0

Query: 17  PSLSGATSPKNIFILAGQSNMAGRGGVEKDRWGKDFVWDGYVPPEAQPDPSILRLNPERQ 76
           P +     P  IFIL+GQSNMAGRGGV KD     +VWD  +PPE  P+ SILRL+ + +
Sbjct: 13  PEIQSPIPPNQIFILSGQSNMAGRGGVFKDHHNNRWVWDKILPPECAPNSSILRLSADLR 72

Query: 77  WEVAREPVHEGIDINKTVGVGSAIAFAHQLQAKGGLKVGSVGLVPCARGGTLIQQWIKNP 136
           WE A EP+H  ID  K  GVG  +AFA+ ++ +       +GLVPCA GGT I++W +  
Sbjct: 73  WEEAHEPLHVDIDTGKVCGVGPGMAFANAVKNRLETDSAVIGLVPCASGGTAIKEWER-- 132

Query: 137 SNPNGTFYKNFIERIKASDREGGVVRALFWLQGESDAASSGTAGRYKDNLKKFFTDIRND 196
                  Y+  ++R + S + GG ++A+ W QGESD      A  Y +N+ +   ++R+D
Sbjct: 133 ---GSHLYERMVKRTEESRKCGGEIKAVLWYQGESDVLDIHDAESYGNNMDRLIKNLRHD 192

Query: 197 IKPRFLPIILVKIAVYDTYMKHDTHDLPAVRAAEDAVQRELPDVVTIDSLQLVNTATGEG 256
           +    LPII V IA    Y+          +  E  +  +L +VV +D       A G  
Sbjct: 193 LNLPSLPIIQVAIASGGGYID---------KVREAQLGLKLSNVVCVD-------AKGLP 252

Query: 257 FNQDHGHFNVKTEIALGKWLADTYLSHY 285
              D+ H   + ++ LG  LA  YLS++
Sbjct: 253 LKSDNLHLTTEAQVQLGLSLAQAYLSNF 259

BLAST of Sgr028610 vs. ExPASy TrEMBL
Match: A0A6J1BQ38 (probable carbohydrate esterase At4g34215 OS=Momordica charantia OX=3673 GN=LOC111004778 PE=4 SV=1)

HSP 1 Score: 526.6 bits (1355), Expect = 6.7e-146
Identity = 259/288 (89.93%), Postives = 272/288 (94.44%), Query Frame = 0

Query: 1   MALLRLSILLCMMLFGPSLSGATSPKNIFILAGQSNMAGRGGVEKDRWGKDFVWDGYVPP 60
           MALLRLSI+LCMMLFGPSLSGATSPKNIFILAGQSNMAGRGGVEK+R G D  WDGYVPP
Sbjct: 1   MALLRLSIMLCMMLFGPSLSGATSPKNIFILAGQSNMAGRGGVEKNRTG-DLEWDGYVPP 60

Query: 61  EAQPDPSILRLNPERQWEVAREPVHEGIDINKTVGVGSAIAFAHQLQAKGGLKVGSVGLV 120
           E+QPDPSILRLNPERQWEVAREPVH GIDI KTVGVG AIAFAHQLQAKGG KVGSVGLV
Sbjct: 61  ESQPDPSILRLNPERQWEVAREPVHRGIDIGKTVGVGPAIAFAHQLQAKGGSKVGSVGLV 120

Query: 121 PCARGGTLIQQWIKNPSNPNGTFYKNFIERIKASDREGGVVRALFWLQGESDAASSGTAG 180
           PCARGGTLI+QW+KNPSNPN TFYKNFIERI+ASDREGGVVRALFWLQGESDAASS TA 
Sbjct: 121 PCARGGTLIEQWVKNPSNPNATFYKNFIERIQASDREGGVVRALFWLQGESDAASSDTAE 180

Query: 181 RYKDNLKKFFTDIRNDIKPRFLPIILVKIAVYDTYMKHDTHDLPAVRAAEDAVQRELPDV 240
           RYK+NLKKFFTDIRNDIKPR LPIILVKIAVYDT+MKHDTHDLPAVRAAEDAVQRELP+V
Sbjct: 181 RYKNNLKKFFTDIRNDIKPRVLPIILVKIAVYDTFMKHDTHDLPAVRAAEDAVQRELPNV 240

Query: 241 VTIDSLQLVNTATGEGFNQDHGHFNVKTEIALGKWLADTYLSHYGHLL 289
           VTID+L+LVNT T EGFN D GHFN+KTEIALGKWLADTYLS+YGHLL
Sbjct: 241 VTIDALKLVNTTTAEGFNLDRGHFNIKTEIALGKWLADTYLSNYGHLL 287

BLAST of Sgr028610 vs. ExPASy TrEMBL
Match: A0A6J1DUH7 (probable carbohydrate esterase At4g34215 OS=Momordica charantia OX=3673 GN=LOC111024143 PE=4 SV=1)

HSP 1 Score: 516.2 bits (1328), Expect = 9.1e-143
Identity = 257/288 (89.24%), Postives = 267/288 (92.71%), Query Frame = 0

Query: 1   MALLRLSILLCMMLFGPSLSGATSPKNIFILAGQSNMAGRGGVEKDRWGKDFVWDGYVPP 60
           M +LRLSILLCMML GPSLS ATSPKNIFILAGQSNMAGRGGVE DR G   VWDGYVPP
Sbjct: 1   MVILRLSILLCMMLCGPSLSRATSPKNIFILAGQSNMAGRGGVENDRPG-HLVWDGYVPP 60

Query: 61  EAQPDPSILRLNPERQWEVAREPVHEGIDINKTVGVGSAIAFAHQLQAKGGLKVGSVGLV 120
           EAQPDPSILRLNPERQWEVAREPVHEGIDINKTVGVG AIAFA QLQAKGG KVGSVGLV
Sbjct: 61  EAQPDPSILRLNPERQWEVAREPVHEGIDINKTVGVGPAIAFARQLQAKGGSKVGSVGLV 120

Query: 121 PCARGGTLIQQWIKNPSNPNGTFYKNFIERIKASDREGGVVRALFWLQGESDAASSGTAG 180
           PCARGGTLI+QW+KNPSN + TFYKNFIERI+ASDREGGVVRALFWLQGESDAAS  TA 
Sbjct: 121 PCARGGTLIEQWVKNPSNTSATFYKNFIERIQASDREGGVVRALFWLQGESDAASRDTAN 180

Query: 181 RYKDNLKKFFTDIRNDIKPRFLPIILVKIAVYDTYMKHDTHDLPAVRAAEDAVQRELPDV 240
           RYK+NLKKFFTDIRNDIKPRFLPIILVKIAVYDT+MKHDTHDLPAVRAAEDAVQRELP+V
Sbjct: 181 RYKNNLKKFFTDIRNDIKPRFLPIILVKIAVYDTFMKHDTHDLPAVRAAEDAVQRELPNV 240

Query: 241 VTIDSLQLVNTATGEGFNQDHGHFNVKTEIALGKWLADTYLSHYGHLL 289
           VTIDSL+LVNT T EGFN D GHFN+KTEIALGKWLADTYLS YGHLL
Sbjct: 241 VTIDSLKLVNTTTVEGFNLDRGHFNIKTEIALGKWLADTYLSQYGHLL 287

BLAST of Sgr028610 vs. ExPASy TrEMBL
Match: A0A6J1DW87 (probable carbohydrate esterase At4g34215 OS=Momordica charantia OX=3673 GN=LOC111025037 PE=4 SV=1)

HSP 1 Score: 516.2 bits (1328), Expect = 9.1e-143
Identity = 254/288 (88.19%), Postives = 271/288 (94.10%), Query Frame = 0

Query: 1   MALLRLSILLCMMLFGPSLSGATSPKNIFILAGQSNMAGRGGVEKDRWGKDFVWDGYVPP 60
           MALLRLSILLCM+L GPSLSGATSPKNIFILAGQSNMAGRGGVEKDR G + VWDGYVPP
Sbjct: 1   MALLRLSILLCMILCGPSLSGATSPKNIFILAGQSNMAGRGGVEKDRSG-NLVWDGYVPP 60

Query: 61  EAQPDPSILRLNPERQWEVAREPVHEGIDINKTVGVGSAIAFAHQLQAKGGLKVGSVGLV 120
           EAQPDPSILRLNPERQWEVAREPVHEGIDINKTVGVGSAIAFAHQLQAKGG KVGSVGLV
Sbjct: 61  EAQPDPSILRLNPERQWEVAREPVHEGIDINKTVGVGSAIAFAHQLQAKGGSKVGSVGLV 120

Query: 121 PCARGGTLIQQWIKNPSNPNGTFYKNFIERIKASDREGGVVRALFWLQGESDAASSGTAG 180
           PCARGGTLI+QW+KNPSN + TFYKNFIERI+ASDREGGVVRALFWLQGESDAAS  TA 
Sbjct: 121 PCARGGTLIEQWVKNPSNTSATFYKNFIERIQASDREGGVVRALFWLQGESDAASRDTAS 180

Query: 181 RYKDNLKKFFTDIRNDIKPRFLPIILVKIAVYDTYMKHDTHDLPAVRAAEDAVQRELPDV 240
           RYK+NLKKFFTDIRNDIKPRFLPII++KIA YDT+++HDTHDLP VRAAEDAVQREL ++
Sbjct: 181 RYKNNLKKFFTDIRNDIKPRFLPIIVIKIAGYDTFIEHDTHDLPTVRAAEDAVQRELLNI 240

Query: 241 VTIDSLQLVNTATGEGFNQDHGHFNVKTEIALGKWLADTYLSHYGHLL 289
           VTID+L+LVNT TGEGFN D GH+NVKTEIALGKWLADTYLSHYG LL
Sbjct: 241 VTIDALKLVNTITGEGFNLDRGHYNVKTEIALGKWLADTYLSHYGQLL 287

BLAST of Sgr028610 vs. ExPASy TrEMBL
Match: A0A6J1DWJ1 (probable carbohydrate esterase At4g34215 OS=Momordica charantia OX=3673 GN=LOC111025047 PE=4 SV=1)

HSP 1 Score: 506.1 bits (1302), Expect = 9.4e-140
Identity = 246/277 (88.81%), Postives = 261/277 (94.22%), Query Frame = 0

Query: 12  MMLFGPSLSGATSPKNIFILAGQSNMAGRGGVEKDRWGKDFVWDGYVPPEAQPDPSILRL 71
           M+L GPSLSGATSPKNIFILAGQSNMAGRGGVEKDR G + VWDGYVPPEAQPDPSILRL
Sbjct: 1   MILCGPSLSGATSPKNIFILAGQSNMAGRGGVEKDRSG-NLVWDGYVPPEAQPDPSILRL 60

Query: 72  NPERQWEVAREPVHEGIDINKTVGVGSAIAFAHQLQAKGGLKVGSVGLVPCARGGTLIQQ 131
           NPERQWEVAREPVHEGIDINKTVGVGSAIAFAHQLQAKGG KVGSVGLVPCARGGTLI+Q
Sbjct: 61  NPERQWEVAREPVHEGIDINKTVGVGSAIAFAHQLQAKGGSKVGSVGLVPCARGGTLIEQ 120

Query: 132 WIKNPSNPNGTFYKNFIERIKASDREGGVVRALFWLQGESDAASSGTAGRYKDNLKKFFT 191
           W+KNPSN + TFYKNFIERI+ASDREGGVVRALFWLQGESDAAS  TA RYK+NLKKFFT
Sbjct: 121 WVKNPSNTSATFYKNFIERIQASDREGGVVRALFWLQGESDAASRDTASRYKNNLKKFFT 180

Query: 192 DIRNDIKPRFLPIILVKIAVYDTYMKHDTHDLPAVRAAEDAVQRELPDVVTIDSLQLVNT 251
           DIRNDIKPRFLPII+VKIA YDT+M+HDTHDLPAVRAAEDAVQRELP++VTID+L+LVNT
Sbjct: 181 DIRNDIKPRFLPIIVVKIAGYDTFMEHDTHDLPAVRAAEDAVQRELPNIVTIDALKLVNT 240

Query: 252 ATGEGFNQDHGHFNVKTEIALGKWLADTYLSHYGHLL 289
            TGEGFN D GH+N KTEIALGKWLADTYLSHYG LL
Sbjct: 241 VTGEGFNSDRGHYNFKTEIALGKWLADTYLSHYGQLL 276

BLAST of Sgr028610 vs. ExPASy TrEMBL
Match: A0A6J1DVM5 (probable carbohydrate esterase At4g34215 OS=Momordica charantia OX=3673 GN=LOC111024870 PE=4 SV=1)

HSP 1 Score: 499.2 bits (1284), Expect = 1.1e-137
Identity = 247/288 (85.76%), Postives = 262/288 (90.97%), Query Frame = 0

Query: 1   MALLRLSILLCMMLFGPSLSGATSPKNIFILAGQSNMAGRGGVEKDRWGKDFVWDGYVPP 60
           MALLRL ILLCMML G SLSGATSPKNIFILAGQSNMAGRGGVE DR G   VWD YVPP
Sbjct: 1   MALLRLLILLCMMLRGTSLSGATSPKNIFILAGQSNMAGRGGVENDRPG-HLVWDRYVPP 60

Query: 61  EAQPDPSILRLNPERQWEVAREPVHEGIDINKTVGVGSAIAFAHQLQAKGGLKVGSVGLV 120
           EAQPDPSILRLNP+RQWEVAREPVHEGIDINKTVGVG AI+FAHQLQAKGG KVGSVGLV
Sbjct: 61  EAQPDPSILRLNPDRQWEVAREPVHEGIDINKTVGVGPAISFAHQLQAKGGSKVGSVGLV 120

Query: 121 PCARGGTLIQQWIKNPSNPNGTFYKNFIERIKASDREGGVVRALFWLQGESDAASSGTAG 180
           PCARGGTLI+QW+KNP N + TFYKNFIERI+ASDREGGVVRAL W QG SDAAS  TA 
Sbjct: 121 PCARGGTLIEQWLKNPGNTSATFYKNFIERIQASDREGGVVRALLWFQGASDAASRDTAN 180

Query: 181 RYKDNLKKFFTDIRNDIKPRFLPIILVKIAVYDTYMKHDTHDLPAVRAAEDAVQRELPDV 240
           RYK+NLKKFFTDIRND+KPRFLPII+V+ AVYDT+MKHDTHDLPAVRAA+DAVQRELP+V
Sbjct: 181 RYKNNLKKFFTDIRNDVKPRFLPIIVVQSAVYDTFMKHDTHDLPAVRAAQDAVQRELPNV 240

Query: 241 VTIDSLQLVNTATGEGFNQDHGHFNVKTEIALGKWLADTYLSHYGHLL 289
           VTIDSL+LVNT T EGFN D GHFN KTEIALGKWLADTYLSHYG+LL
Sbjct: 241 VTIDSLKLVNTTTSEGFNLDRGHFNTKTEIALGKWLADTYLSHYGNLL 287

BLAST of Sgr028610 vs. TAIR 10
Match: AT3G53010.1 (Domain of unknown function (DUF303) )

HSP 1 Score: 179.5 bits (454), Expect = 3.9e-45
Identity = 113/272 (41.54%), Postives = 153/272 (56.25%), Query Frame = 0

Query: 17  PSLSGATSPKN--IFILAGQSNMAGRGGVEKDRWGKDFVWDGYVPPEAQPDPSILRLNPE 76
           P L   T  +N  IFILAGQSNMAGRGGV  D      VWDG +PPE + +PSILRL  +
Sbjct: 18  PHLQSQTITRNISIFILAGQSNMAGRGGVYNDTATNTTVWDGVIPPECRSNPSILRLTSK 77

Query: 77  RQWEVAREPVHEGIDINKTVGVGSAIAFAHQLQAKGGLKVGSVGLVPCARGGTLIQQWIK 136
            +W+ A+EP+H  IDINKT GVG  + FA+++      + G VGLVPC+ GGT + QW K
Sbjct: 78  LEWKEAKEPLHVDIDINKTNGVGPGMPFANRVVN----RFGQVGLVPCSIGGTKLSQWQK 137

Query: 137 NPSNPNGTF-YKNFIERIKA--SDREGGVVRALFWLQGESDAASSGTAGRYKDNLKKFFT 196
                 G F Y+  ++R KA  +   GG  RA+ W QGESD      A  YK  L KFF+
Sbjct: 138 ------GEFLYEETVKRAKAAMASGGGGSYRAVLWYQGESDTVDMVDASVYKKRLVKFFS 197

Query: 197 DIRNDIKPRFLPIILVKIAV-YDTYMKHDTHDLPAVRAAEDAVQRELPDVVTIDSLQLVN 256
           D+RND++   LPII V +A     Y       L AVR A+  ++ +L +V  +D      
Sbjct: 198 DLRNDLQHPNLPIIQVALATGAGPY-------LDAVRKAQ--LKTDLENVYCVD------ 257

Query: 257 TATGEGFNQDHGHFNVKTEIALGKWLADTYLS 283
            A G     D  H    +++ LG  +A+++L+
Sbjct: 258 -ARGLPLEPDGLHLTTSSQVQLGHMIAESFLA 263

BLAST of Sgr028610 vs. TAIR 10
Match: AT4G34215.1 (Domain of unknown function (DUF303) )

HSP 1 Score: 175.6 bits (444), Expect = 5.6e-44
Identity = 98/268 (36.57%), Postives = 143/268 (53.36%), Query Frame = 0

Query: 17  PSLSGATSPKNIFILAGQSNMAGRGGVEKDRWGKDFVWDGYVPPEAQPDPSILRLNPERQ 76
           P +     P  IFIL+GQSNMAGRGGV KD     +VWD  +PPE  P+ SILRL+ + +
Sbjct: 13  PEIQSPIPPNQIFILSGQSNMAGRGGVFKDHHNNRWVWDKILPPECAPNSSILRLSADLR 72

Query: 77  WEVAREPVHEGIDINKTVGVGSAIAFAHQLQAKGGLKVGSVGLVPCARGGTLIQQWIKNP 136
           WE A EP+H  ID  K  GVG  +AFA+ ++ +       +GLVPCA GGT I++W +  
Sbjct: 73  WEEAHEPLHVDIDTGKVCGVGPGMAFANAVKNRLETDSAVIGLVPCASGGTAIKEWER-- 132

Query: 137 SNPNGTFYKNFIERIKASDREGGVVRALFWLQGESDAASSGTAGRYKDNLKKFFTDIRND 196
                  Y+  ++R + S + GG ++A+ W QGESD      A  Y +N+ +   ++R+D
Sbjct: 133 ---GSHLYERMVKRTEESRKCGGEIKAVLWYQGESDVLDIHDAESYGNNMDRLIKNLRHD 192

Query: 197 IKPRFLPIILVKIAVYDTYMKHDTHDLPAVRAAEDAVQRELPDVVTIDSLQLVNTATGEG 256
           +    LPII V IA    Y+          +  E  +  +L +VV +D       A G  
Sbjct: 193 LNLPSLPIIQVAIASGGGYID---------KVREAQLGLKLSNVVCVD-------AKGLP 252

Query: 257 FNQDHGHFNVKTEIALGKWLADTYLSHY 285
              D+ H   + ++ LG  LA  YLS++
Sbjct: 253 LKSDNLHLTTEAQVQLGLSLAQAYLSNF 259

BLAST of Sgr028610 vs. TAIR 10
Match: AT4G34215.2 (Domain of unknown function (DUF303) )

HSP 1 Score: 175.6 bits (444), Expect = 5.6e-44
Identity = 98/268 (36.57%), Postives = 143/268 (53.36%), Query Frame = 0

Query: 17  PSLSGATSPKNIFILAGQSNMAGRGGVEKDRWGKDFVWDGYVPPEAQPDPSILRLNPERQ 76
           P +     P  IFIL+GQSNMAGRGGV KD     +VWD  +PPE  P+ SILRL+ + +
Sbjct: 13  PEIQSPIPPNQIFILSGQSNMAGRGGVFKDHHNNRWVWDKILPPECAPNSSILRLSADLR 72

Query: 77  WEVAREPVHEGIDINKTVGVGSAIAFAHQLQAKGGLKVGSVGLVPCARGGTLIQQWIKNP 136
           WE A EP+H  ID  K  GVG  +AFA+ ++ +       +GLVPCA GGT I++W +  
Sbjct: 73  WEEAHEPLHVDIDTGKVCGVGPGMAFANAVKNRLETDSAVIGLVPCASGGTAIKEWER-- 132

Query: 137 SNPNGTFYKNFIERIKASDREGGVVRALFWLQGESDAASSGTAGRYKDNLKKFFTDIRND 196
                  Y+  ++R + S + GG ++A+ W QGESD      A  Y +N+ +   ++R+D
Sbjct: 133 ---GSHLYERMVKRTEESRKCGGEIKAVLWYQGESDVLDIHDAESYGNNMDRLIKNLRHD 192

Query: 197 IKPRFLPIILVKIAVYDTYMKHDTHDLPAVRAAEDAVQRELPDVVTIDSLQLVNTATGEG 256
           +    LPII V IA    Y+          +  E  +  +L +VV +D       A G  
Sbjct: 193 LNLPSLPIIQVAIASGGGYID---------KVREAQLGLKLSNVVCVD-------AKGLP 252

Query: 257 FNQDHGHFNVKTEIALGKWLADTYLSHY 285
              D+ H   + ++ LG  LA  YLS++
Sbjct: 253 LKSDNLHLTTEAQVQLGLSLAQAYLSNF 259

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022131651.11.4e-14589.93probable carbohydrate esterase At4g34215 [Momordica charantia][more]
XP_022157447.11.9e-14289.24probable carbohydrate esterase At4g34215 [Momordica charantia][more]
XP_022158585.11.9e-14288.19probable carbohydrate esterase At4g34215 [Momordica charantia][more]
XP_022158593.11.9e-13988.81probable carbohydrate esterase At4g34215 [Momordica charantia][more]
XP_022158365.12.4e-13785.76probable carbohydrate esterase At4g34215 [Momordica charantia][more]
Match NameE-valueIdentityDescription
Q8L9J97.9e-4336.57Probable carbohydrate esterase At4g34215 OS=Arabidopsis thaliana OX=3702 GN=At4g... [more]
Match NameE-valueIdentityDescription
A0A6J1BQ386.7e-14689.93probable carbohydrate esterase At4g34215 OS=Momordica charantia OX=3673 GN=LOC11... [more]
A0A6J1DUH79.1e-14389.24probable carbohydrate esterase At4g34215 OS=Momordica charantia OX=3673 GN=LOC11... [more]
A0A6J1DW879.1e-14388.19probable carbohydrate esterase At4g34215 OS=Momordica charantia OX=3673 GN=LOC11... [more]
A0A6J1DWJ19.4e-14088.81probable carbohydrate esterase At4g34215 OS=Momordica charantia OX=3673 GN=LOC11... [more]
A0A6J1DVM51.1e-13785.76probable carbohydrate esterase At4g34215 OS=Momordica charantia OX=3673 GN=LOC11... [more]
Match NameE-valueIdentityDescription
AT3G53010.13.9e-4541.54Domain of unknown function (DUF303) [more]
AT4G34215.15.6e-4436.57Domain of unknown function (DUF303) [more]
AT4G34215.25.6e-4436.57Domain of unknown function (DUF303) [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR005181Sialate O-acetylesterase domainPFAMPF03629SASAcoord: 26..281
e-value: 4.8E-65
score: 219.3
IPR036514SGNH hydrolase superfamilyGENE3D3.40.50.1110SGNH hydrolasecoord: 24..285
e-value: 7.1E-56
score: 191.7
NoneNo IPR availablePANTHERPTHR31988:SF19BNACNNG62850D PROTEINcoord: 23..284
NoneNo IPR availablePANTHERPTHR31988ESTERASE, PUTATIVE (DUF303)-RELATEDcoord: 23..284
NoneNo IPR availableSUPERFAMILY52266SGNH hydrolasecoord: 27..285

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr028610.1Sgr028610.1mRNA