Sgr016842 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr016842
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
DescriptionSASA domain-containing protein
Locationtig00153010: 1846603 .. 1847471 (-)
RNA-Seq ExpressionSgr016842
SyntenySgr016842
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACTGATTTGAAATATCCAGACAACATTTTCATTCTCGGTGGCCAGAGTAACATGGCCGGCCGAGGTGGTGTATCGAAAGACCCAATCACAGACAAGAATAAATGGGACGGATATATCCCACCAGAGTCTCAATCTCACAAGTCAATCCTTCGATTGAATGCTGATTTGAAATGGGAACAAGCTCGGGAACCACTTCATTGGGATATTGATTACAACAAGACCAATGGGGTTGGACCGGGAATGGCTTTTGCCAATGAGCTTTTGGCCAAAGCTGGCGAGAGCATCGGTGTCATCGGTCTCGTTCCGTGTGCCATTGGAGGAACTCACTTGAGAGAATGGATTAAAGGAACTGTTTATTACACCAGATTGGTCAACCGAATTAAAGCTTCGGAAAAACATGGGGGAAAAGTCCAAGCATTTTTCTGGTATCAAGGAGAGTCTGATGCTTCAGTGGAAGTAGAATCTAAGTTGTATGAAGAAAACCTTACTAAGTTCTTCACTGACCTGCGTAAAGACTTGAACCACCCAGAACTACCCATCATCCTGGTTTGTCATCTTTGTCTCTCTCAATATCTTTGTGTTTTTCAGTGCTAAAGTCTTTGTAAAATATGCAGATGAAGATAGTAACTCACGATATTTTTACAAGTCCAATTATAAACTACAAGGAAGATGTATGGAAGGCTCAGGAGGCAGTCACACACAAGCTACTGAATGTAAGAATGGTGGACGCCATGGAAGCAGTTGGCAACCTTGAGCAAGGGCTTAACGAAGATAAAGGTCATCTCAATGTCAAATCTGAAGTGAAATTGGGCAAAATGTTAGCTCATGACTTCTACTCAAATTTCGACCACAAGCTCACTTGCTAA

mRNA sequence

ATGACTGATTTGAAATATCCAGACAACATTTTCATTCTCGGTGGCCAGAGTAACATGGCCGGCCGAGGTGGTGTATCGAAAGACCCAATCACAGACAAGAATAAATGGGACGGATATATCCCACCAGAGTCTCAATCTCACAAGTCAATCCTTCGATTGAATGCTGATTTGAAATGGGAACAAGCTCGGGAACCACTTCATTGGGATATTGATTACAACAAGACCAATGGGGTTGGACCGGGAATGGCTTTTGCCAATGAGCTTTTGGCCAAAGCTGGCGAGAGCATCGGTGTCATCGGTCTCGTTCCGTGTGCCATTGGAGGAACTCACTTGAGAGAATGGATTAAAGGAACTGTTTATTACACCAGATTGGTCAACCGAATTAAAGCTTCGGAAAAACATGGGGGAAAAGTCCAAGCATTTTTCTGGTATCAAGGAGAGTCTGATGCTTCAGTGGAAGTAGAATCTAAGTTGTATGAAGAAAACCTTACTAAGTTCTTCACTGACCTGCGTAAAGACTTGAACCACCCAGAACTACCCATCATCCTGATGAAGATAGTAACTCACGATATTTTTACAAGTCCAATTATAAACTACAAGGAAGATGTATGGAAGGCTCAGGAGGCAGTCACACACAAGCTACTGAATGTAAGAATGGTGGACGCCATGGAAGCAGTTGGCAACCTTGAGCAAGGGCTTAACGAAGATAAAGGTCATCTCAATGTCAAATCTGAAGTGAAATTGGGCAAAATGTTAGCTCATGACTTCTACTCAAATTTCGACCACAAGCTCACTTGCTAA

Coding sequence (CDS)

ATGACTGATTTGAAATATCCAGACAACATTTTCATTCTCGGTGGCCAGAGTAACATGGCCGGCCGAGGTGGTGTATCGAAAGACCCAATCACAGACAAGAATAAATGGGACGGATATATCCCACCAGAGTCTCAATCTCACAAGTCAATCCTTCGATTGAATGCTGATTTGAAATGGGAACAAGCTCGGGAACCACTTCATTGGGATATTGATTACAACAAGACCAATGGGGTTGGACCGGGAATGGCTTTTGCCAATGAGCTTTTGGCCAAAGCTGGCGAGAGCATCGGTGTCATCGGTCTCGTTCCGTGTGCCATTGGAGGAACTCACTTGAGAGAATGGATTAAAGGAACTGTTTATTACACCAGATTGGTCAACCGAATTAAAGCTTCGGAAAAACATGGGGGAAAAGTCCAAGCATTTTTCTGGTATCAAGGAGAGTCTGATGCTTCAGTGGAAGTAGAATCTAAGTTGTATGAAGAAAACCTTACTAAGTTCTTCACTGACCTGCGTAAAGACTTGAACCACCCAGAACTACCCATCATCCTGATGAAGATAGTAACTCACGATATTTTTACAAGTCCAATTATAAACTACAAGGAAGATGTATGGAAGGCTCAGGAGGCAGTCACACACAAGCTACTGAATGTAAGAATGGTGGACGCCATGGAAGCAGTTGGCAACCTTGAGCAAGGGCTTAACGAAGATAAAGGTCATCTCAATGTCAAATCTGAAGTGAAATTGGGCAAAATGTTAGCTCATGACTTCTACTCAAATTTCGACCACAAGCTCACTTGCTAA

Protein sequence

MTDLKYPDNIFILGGQSNMAGRGGVSKDPITDKNKWDGYIPPESQSHKSILRLNADLKWEQAREPLHWDIDYNKTNGVGPGMAFANELLAKAGESIGVIGLVPCAIGGTHLREWIKGTVYYTRLVNRIKASEKHGGKVQAFFWYQGESDASVEVESKLYEENLTKFFTDLRKDLNHPELPIILMKIVTHDIFTSPIINYKEDVWKAQEAVTHKLLNVRMVDAMEAVGNLEQGLNEDKGHLNVKSEVKLGKMLAHDFYSNFDHKLTC
Homology
BLAST of Sgr016842 vs. NCBI nr
Match: XP_022141681.1 (probable carbohydrate esterase At4g34215 [Momordica charantia])

HSP 1 Score: 455.3 bits (1170), Expect = 3.6e-124
Identity = 217/259 (83.78%), Postives = 233/259 (89.96%), Query Frame = 0

Query: 7   PDNIFILGGQSNMAGRGGVSKDPITDKNKWDGYIPPESQSHKSILRLNADLKWEQAREPL 66
           P NIFIL GQSNMAGRGGVSKDPIT+KN WDGYIPPESQS++SILRL ADL+WEQA EPL
Sbjct: 13  PGNIFILAGQSNMAGRGGVSKDPITEKNIWDGYIPPESQSNESILRLTADLRWEQASEPL 72

Query: 67  HWDIDYNKTNGVGPGMAFANELLAKAGESIGVIGLVPCAIGGTHLREWIKGTVYYTRLVN 126
           HWDIDY+KTNG+GPGMAFANEL  + G+SIGVIGLVPCAIGGTHLREWIKGT YYT+L++
Sbjct: 73  HWDIDYHKTNGIGPGMAFANELSVQVGKSIGVIGLVPCAIGGTHLREWIKGTAYYTKLID 132

Query: 127 RIKASEKHGGKVQAFFWYQGESDASVEVESKLYEENLTKFFTDLRKDLNHPELPIILMKI 186
           RIKASEKHGGKVQ F WYQGESDASVE ESK YE  LTKFFTDLR D N+ ELPIIL+KI
Sbjct: 133 RIKASEKHGGKVQGFLWYQGESDASVEEESKSYETELTKFFTDLRTDSNNLELPIILVKI 192

Query: 187 VTHDIFTSPIINYKEDVWKAQEAVTHKLLNVRMVDAMEAVGNLEQGLNEDKGHLNVKSEV 246
           VTHDIFTSPIIN+KEDVWKAQE VT KL NVRMVD  EAVGN E+GLNEDKGHLNVKSEV
Sbjct: 193 VTHDIFTSPIINFKEDVWKAQETVTEKLSNVRMVDGSEAVGNFEEGLNEDKGHLNVKSEV 252

Query: 247 KLGKMLAHDFYSNFDHKLT 266
           KLGKMLAH FYSNF H+LT
Sbjct: 253 KLGKMLAHAFYSNFSHRLT 271

BLAST of Sgr016842 vs. NCBI nr
Match: KAG7016422.1 (putative carbohydrate esterase, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 414.5 bits (1064), Expect = 7.1e-112
Identity = 195/256 (76.17%), Postives = 216/256 (84.38%), Query Frame = 0

Query: 7   PDNIFILGGQSNMAGRGGVSKDPITDKNKWDGYIPPESQSHKSILRLNADLKWEQAREPL 66
           P NIFIL GQSNMAGRGGVSKDP TDKN WDGYIPPESQ ++SI R  AD+ WEQAREPL
Sbjct: 6   PANIFILAGQSNMAGRGGVSKDPTTDKNVWDGYIPPESQPNQSIFRFTADMVWEQAREPL 65

Query: 67  HWDIDYNKTNGVGPGMAFANELLAKAGESIGVIGLVPCAIGGTHLREWIKGTVYYTRLVN 126
           HWDID  KTNGVGPGM FANELLAKAG SIG IGLVPCAIGG+HLREW+KGT  YT+LV 
Sbjct: 66  HWDIDVVKTNGVGPGMPFANELLAKAGPSIGTIGLVPCAIGGSHLREWVKGTDRYTKLVE 125

Query: 127 RIKASEKHGGKVQAFFWYQGESDASVEVESKLYEENLTKFFTDLRKDLNHPELPIILMKI 186
           R+K SE+HGGKV+ FFWYQGESDA+VE E+K YE  L+KFFTDLR D+NHP+LPIIL+KI
Sbjct: 126 RMKRSEEHGGKVKGFFWYQGESDAAVEEEAKSYERELSKFFTDLRADMNHPDLPIILVKI 185

Query: 187 VTHDIFTSPIINYKEDVWKAQEAVTHKLLNVRMVDAMEAVGNLEQGLNEDKGHLNVKSEV 246
           VTHD F SP   +KE+VW AQEAVT KL NVRMVD   AVGN ++GLNED+GHLNVKSEV
Sbjct: 186 VTHDFFISPDFEFKEEVWNAQEAVTQKLPNVRMVDGRVAVGNFDEGLNEDRGHLNVKSEV 245

Query: 247 KLGKMLAHDFYSNFDH 263
            LGKM AH +YSNF H
Sbjct: 246 NLGKMFAHSYYSNFAH 261

BLAST of Sgr016842 vs. NCBI nr
Match: XP_022939276.1 (probable carbohydrate esterase At4g34215 [Cucurbita moschata])

HSP 1 Score: 414.1 bits (1063), Expect = 9.3e-112
Identity = 194/256 (75.78%), Postives = 216/256 (84.38%), Query Frame = 0

Query: 7   PDNIFILGGQSNMAGRGGVSKDPITDKNKWDGYIPPESQSHKSILRLNADLKWEQAREPL 66
           P NIFIL GQSNMAGRGGVSKDP TDKN WDGYIPPESQ ++SI R  AD+ WEQAREPL
Sbjct: 6   PANIFILAGQSNMAGRGGVSKDPTTDKNVWDGYIPPESQPNQSIFRFTADMVWEQAREPL 65

Query: 67  HWDIDYNKTNGVGPGMAFANELLAKAGESIGVIGLVPCAIGGTHLREWIKGTVYYTRLVN 126
           HWDID  KTNGVGPGM FANELLAKAG SIG IGLVPCAIGG+HLREW+KGT  YT+LV 
Sbjct: 66  HWDIDVVKTNGVGPGMPFANELLAKAGPSIGTIGLVPCAIGGSHLREWVKGTNRYTKLVE 125

Query: 127 RIKASEKHGGKVQAFFWYQGESDASVEVESKLYEENLTKFFTDLRKDLNHPELPIILMKI 186
           R+K SE+HGGKV+ FFWYQGESDA+VE E+K YE  L+KFFTDLR D+NHP+LPIIL+KI
Sbjct: 126 RMKRSEEHGGKVKGFFWYQGESDAAVEEEAKSYERELSKFFTDLRADMNHPDLPIILVKI 185

Query: 187 VTHDIFTSPIINYKEDVWKAQEAVTHKLLNVRMVDAMEAVGNLEQGLNEDKGHLNVKSEV 246
           VTHD F SP   +KE+VW AQEAVT KL N+RMVD   AVGN ++GLNED+GHLNVKSEV
Sbjct: 186 VTHDFFISPDFEFKEEVWNAQEAVTQKLPNIRMVDGRVAVGNFDEGLNEDRGHLNVKSEV 245

Query: 247 KLGKMLAHDFYSNFDH 263
            LGKM AH +YSNF H
Sbjct: 246 NLGKMFAHSYYSNFAH 261

BLAST of Sgr016842 vs. NCBI nr
Match: KAG6578894.1 (putative carbohydrate esterase, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 413.3 bits (1061), Expect = 1.6e-111
Identity = 194/256 (75.78%), Postives = 216/256 (84.38%), Query Frame = 0

Query: 7   PDNIFILGGQSNMAGRGGVSKDPITDKNKWDGYIPPESQSHKSILRLNADLKWEQAREPL 66
           P NIFIL GQSNMAGRGGVSKDP TDKN WDGYIPPESQ ++SI R  AD+ WEQAREPL
Sbjct: 6   PANIFILAGQSNMAGRGGVSKDPTTDKNVWDGYIPPESQPNQSIFRFTADMVWEQAREPL 65

Query: 67  HWDIDYNKTNGVGPGMAFANELLAKAGESIGVIGLVPCAIGGTHLREWIKGTVYYTRLVN 126
           HWDID  KTNGVGPGM FANELLAKAG SIG IGLVPCAIGG+HLREW+KGT  YT+LV 
Sbjct: 66  HWDIDVVKTNGVGPGMPFANELLAKAGPSIGTIGLVPCAIGGSHLREWVKGTDRYTKLVE 125

Query: 127 RIKASEKHGGKVQAFFWYQGESDASVEVESKLYEENLTKFFTDLRKDLNHPELPIILMKI 186
           R+K SE+HGGKV+ FFWYQGESDA+VE E+K YE  L+KFFTDLR D+NHP+LPIIL+KI
Sbjct: 126 RMKRSEEHGGKVKGFFWYQGESDAAVEEEAKSYERELSKFFTDLRADMNHPDLPIILVKI 185

Query: 187 VTHDIFTSPIINYKEDVWKAQEAVTHKLLNVRMVDAMEAVGNLEQGLNEDKGHLNVKSEV 246
           VTHD F SP   +KE+VW AQEAVT KL NVRMVD   AVGN ++GLNED+GHLNV+SEV
Sbjct: 186 VTHDFFISPDFEFKEEVWNAQEAVTQKLPNVRMVDGRVAVGNFDEGLNEDRGHLNVRSEV 245

Query: 247 KLGKMLAHDFYSNFDH 263
            LGKM AH +YSNF H
Sbjct: 246 NLGKMFAHSYYSNFAH 261

BLAST of Sgr016842 vs. NCBI nr
Match: XP_022993914.1 (probable carbohydrate esterase At4g34215 [Cucurbita maxima])

HSP 1 Score: 412.9 bits (1060), Expect = 2.1e-111
Identity = 194/256 (75.78%), Postives = 216/256 (84.38%), Query Frame = 0

Query: 7   PDNIFILGGQSNMAGRGGVSKDPITDKNKWDGYIPPESQSHKSILRLNADLKWEQAREPL 66
           P NIFIL GQSNMAGRGGVSKDP TDKN WDGYIPPESQ ++SI R  AD+ WEQAREPL
Sbjct: 6   PANIFILAGQSNMAGRGGVSKDPTTDKNVWDGYIPPESQPNQSIFRFTADMVWEQAREPL 65

Query: 67  HWDIDYNKTNGVGPGMAFANELLAKAGESIGVIGLVPCAIGGTHLREWIKGTVYYTRLVN 126
           HWDID  KTNGVGPGM FANELLAKAG SIG IGLVPCAIGG+HLREW+KGT  YT+LV 
Sbjct: 66  HWDIDVVKTNGVGPGMPFANELLAKAGPSIGTIGLVPCAIGGSHLREWVKGTDRYTKLVE 125

Query: 127 RIKASEKHGGKVQAFFWYQGESDASVEVESKLYEENLTKFFTDLRKDLNHPELPIILMKI 186
           R+K SE+HGGKV+ FFWYQGESDA+VE E+K YE  L+KFFTDLR D+NHP+LPIIL+KI
Sbjct: 126 RMKRSEEHGGKVKGFFWYQGESDAAVEEEAKSYERELSKFFTDLRADVNHPDLPIILVKI 185

Query: 187 VTHDIFTSPIINYKEDVWKAQEAVTHKLLNVRMVDAMEAVGNLEQGLNEDKGHLNVKSEV 246
           VTHD F SP   +K++VW AQEAVT KL NVRMVD   AVGN ++GLNED+GHLNVKSEV
Sbjct: 186 VTHDFFISPDFEFKDEVWNAQEAVTQKLPNVRMVDGRVAVGNFDEGLNEDRGHLNVKSEV 245

Query: 247 KLGKMLAHDFYSNFDH 263
            LGKM AH +YSNF H
Sbjct: 246 NLGKMFAHSYYSNFAH 261

BLAST of Sgr016842 vs. ExPASy Swiss-Prot
Match: Q8L9J9 (Probable carbohydrate esterase At4g34215 OS=Arabidopsis thaliana OX=3702 GN=At4g34215 PE=1 SV=2)

HSP 1 Score: 209.9 bits (533), Expect = 3.5e-53
Identity = 118/254 (46.46%), Postives = 153/254 (60.24%), Query Frame = 0

Query: 7   PDNIFILGGQSNMAGRGGVSKDPITDKNKWDGYIPPESQSHKSILRLNADLKWEQAREPL 66
           P+ IFIL GQSNMAGRGGV KD   ++  WD  +PPE   + SILRL+ADL+WE+A EPL
Sbjct: 21  PNQIFILSGQSNMAGRGGVFKDHHNNRWVWDKILPPECAPNSSILRLSADLRWEEAHEPL 80

Query: 67  HWDIDYNKTNGVGPGMAFANELLAKAGESIGVIGLVPCAIGGTHLREWIKGTVYYTRLVN 126
           H DID  K  GVGPGMAFAN +  +      VIGLVPCA GGT ++EW +G+  Y R+V 
Sbjct: 81  HVDIDTGKVCGVGPGMAFANAVKNRLETDSAVIGLVPCASGGTAIKEWERGSHLYERMVK 140

Query: 127 RIKASEKHGGKVQAFFWYQGESDASVEVESKLYEENLTKFFTDLRKDLNHPELPIILMKI 186
           R + S K GG+++A  WYQGESD     +++ Y  N+ +   +LR DLN P LPII + I
Sbjct: 141 RTEESRKCGGEIKAVLWYQGESDVLDIHDAESYGNNMDRLIKNLRHDLNLPSLPIIQVAI 200

Query: 187 VTHDIFTSPIINYKEDVWKAQEAVTHKLLNVRMVDAMEAVGNLEQGLNEDKGHLNVKSEV 246
            +          Y + V +AQ  +  KL NV  VDA          L  D  HL  +++V
Sbjct: 201 ASGG-------GYIDKVREAQLGL--KLSNVVCVDAKGL------PLKSDNLHLTTEAQV 259

Query: 247 KLGKMLAHDFYSNF 261
           +LG  LA  + SNF
Sbjct: 261 QLGLSLAQAYLSNF 259

BLAST of Sgr016842 vs. ExPASy TrEMBL
Match: A0A6J1CJZ1 (probable carbohydrate esterase At4g34215 OS=Momordica charantia OX=3673 GN=LOC111011984 PE=4 SV=1)

HSP 1 Score: 455.3 bits (1170), Expect = 1.8e-124
Identity = 217/259 (83.78%), Postives = 233/259 (89.96%), Query Frame = 0

Query: 7   PDNIFILGGQSNMAGRGGVSKDPITDKNKWDGYIPPESQSHKSILRLNADLKWEQAREPL 66
           P NIFIL GQSNMAGRGGVSKDPIT+KN WDGYIPPESQS++SILRL ADL+WEQA EPL
Sbjct: 13  PGNIFILAGQSNMAGRGGVSKDPITEKNIWDGYIPPESQSNESILRLTADLRWEQASEPL 72

Query: 67  HWDIDYNKTNGVGPGMAFANELLAKAGESIGVIGLVPCAIGGTHLREWIKGTVYYTRLVN 126
           HWDIDY+KTNG+GPGMAFANEL  + G+SIGVIGLVPCAIGGTHLREWIKGT YYT+L++
Sbjct: 73  HWDIDYHKTNGIGPGMAFANELSVQVGKSIGVIGLVPCAIGGTHLREWIKGTAYYTKLID 132

Query: 127 RIKASEKHGGKVQAFFWYQGESDASVEVESKLYEENLTKFFTDLRKDLNHPELPIILMKI 186
           RIKASEKHGGKVQ F WYQGESDASVE ESK YE  LTKFFTDLR D N+ ELPIIL+KI
Sbjct: 133 RIKASEKHGGKVQGFLWYQGESDASVEEESKSYETELTKFFTDLRTDSNNLELPIILVKI 192

Query: 187 VTHDIFTSPIINYKEDVWKAQEAVTHKLLNVRMVDAMEAVGNLEQGLNEDKGHLNVKSEV 246
           VTHDIFTSPIIN+KEDVWKAQE VT KL NVRMVD  EAVGN E+GLNEDKGHLNVKSEV
Sbjct: 193 VTHDIFTSPIINFKEDVWKAQETVTEKLSNVRMVDGSEAVGNFEEGLNEDKGHLNVKSEV 252

Query: 247 KLGKMLAHDFYSNFDHKLT 266
           KLGKMLAH FYSNF H+LT
Sbjct: 253 KLGKMLAHAFYSNFSHRLT 271

BLAST of Sgr016842 vs. ExPASy TrEMBL
Match: A0A6J1FFF9 (probable carbohydrate esterase At4g34215 OS=Cucurbita moschata OX=3662 GN=LOC111445241 PE=4 SV=1)

HSP 1 Score: 414.1 bits (1063), Expect = 4.5e-112
Identity = 194/256 (75.78%), Postives = 216/256 (84.38%), Query Frame = 0

Query: 7   PDNIFILGGQSNMAGRGGVSKDPITDKNKWDGYIPPESQSHKSILRLNADLKWEQAREPL 66
           P NIFIL GQSNMAGRGGVSKDP TDKN WDGYIPPESQ ++SI R  AD+ WEQAREPL
Sbjct: 6   PANIFILAGQSNMAGRGGVSKDPTTDKNVWDGYIPPESQPNQSIFRFTADMVWEQAREPL 65

Query: 67  HWDIDYNKTNGVGPGMAFANELLAKAGESIGVIGLVPCAIGGTHLREWIKGTVYYTRLVN 126
           HWDID  KTNGVGPGM FANELLAKAG SIG IGLVPCAIGG+HLREW+KGT  YT+LV 
Sbjct: 66  HWDIDVVKTNGVGPGMPFANELLAKAGPSIGTIGLVPCAIGGSHLREWVKGTNRYTKLVE 125

Query: 127 RIKASEKHGGKVQAFFWYQGESDASVEVESKLYEENLTKFFTDLRKDLNHPELPIILMKI 186
           R+K SE+HGGKV+ FFWYQGESDA+VE E+K YE  L+KFFTDLR D+NHP+LPIIL+KI
Sbjct: 126 RMKRSEEHGGKVKGFFWYQGESDAAVEEEAKSYERELSKFFTDLRADMNHPDLPIILVKI 185

Query: 187 VTHDIFTSPIINYKEDVWKAQEAVTHKLLNVRMVDAMEAVGNLEQGLNEDKGHLNVKSEV 246
           VTHD F SP   +KE+VW AQEAVT KL N+RMVD   AVGN ++GLNED+GHLNVKSEV
Sbjct: 186 VTHDFFISPDFEFKEEVWNAQEAVTQKLPNIRMVDGRVAVGNFDEGLNEDRGHLNVKSEV 245

Query: 247 KLGKMLAHDFYSNFDH 263
            LGKM AH +YSNF H
Sbjct: 246 NLGKMFAHSYYSNFAH 261

BLAST of Sgr016842 vs. ExPASy TrEMBL
Match: A0A6J1K1G7 (probable carbohydrate esterase At4g34215 OS=Cucurbita maxima OX=3661 GN=LOC111489769 PE=4 SV=1)

HSP 1 Score: 412.9 bits (1060), Expect = 1.0e-111
Identity = 194/256 (75.78%), Postives = 216/256 (84.38%), Query Frame = 0

Query: 7   PDNIFILGGQSNMAGRGGVSKDPITDKNKWDGYIPPESQSHKSILRLNADLKWEQAREPL 66
           P NIFIL GQSNMAGRGGVSKDP TDKN WDGYIPPESQ ++SI R  AD+ WEQAREPL
Sbjct: 6   PANIFILAGQSNMAGRGGVSKDPTTDKNVWDGYIPPESQPNQSIFRFTADMVWEQAREPL 65

Query: 67  HWDIDYNKTNGVGPGMAFANELLAKAGESIGVIGLVPCAIGGTHLREWIKGTVYYTRLVN 126
           HWDID  KTNGVGPGM FANELLAKAG SIG IGLVPCAIGG+HLREW+KGT  YT+LV 
Sbjct: 66  HWDIDVVKTNGVGPGMPFANELLAKAGPSIGTIGLVPCAIGGSHLREWVKGTDRYTKLVE 125

Query: 127 RIKASEKHGGKVQAFFWYQGESDASVEVESKLYEENLTKFFTDLRKDLNHPELPIILMKI 186
           R+K SE+HGGKV+ FFWYQGESDA+VE E+K YE  L+KFFTDLR D+NHP+LPIIL+KI
Sbjct: 126 RMKRSEEHGGKVKGFFWYQGESDAAVEEEAKSYERELSKFFTDLRADVNHPDLPIILVKI 185

Query: 187 VTHDIFTSPIINYKEDVWKAQEAVTHKLLNVRMVDAMEAVGNLEQGLNEDKGHLNVKSEV 246
           VTHD F SP   +K++VW AQEAVT KL NVRMVD   AVGN ++GLNED+GHLNVKSEV
Sbjct: 186 VTHDFFISPDFEFKDEVWNAQEAVTQKLPNVRMVDGRVAVGNFDEGLNEDRGHLNVKSEV 245

Query: 247 KLGKMLAHDFYSNFDH 263
            LGKM AH +YSNF H
Sbjct: 246 NLGKMFAHSYYSNFAH 261

BLAST of Sgr016842 vs. ExPASy TrEMBL
Match: A0A6J1CKF9 (probable carbohydrate esterase At4g34215 OS=Momordica charantia OX=3673 GN=LOC111012113 PE=4 SV=1)

HSP 1 Score: 399.4 bits (1025), Expect = 1.1e-107
Identity = 186/259 (71.81%), Postives = 215/259 (83.01%), Query Frame = 0

Query: 7   PDNIFILGGQSNMAGRGGVSKDPITDKNKWDGYIPPESQSHKSILRLNADLKWEQAREPL 66
           P+NIFILGGQSNMAGRGGV KDP T K  WDG +PP+ Q +KSILR +A+  WE+A EPL
Sbjct: 27  PNNIFILGGQSNMAGRGGVEKDPNTQKMVWDGIVPPKCQPNKSILRFSANSVWEEALEPL 86

Query: 67  HWDIDYNKTNGVGPGMAFANELLAKAGESIGVIGLVPCAIGGTHLREWIKGTVYYTRLVN 126
           HWDID NKTNG+GPGM FA+E+LAKAG   GVIGLVPCAIGGTHLREW+KGT  YTRLVN
Sbjct: 87  HWDIDVNKTNGIGPGMPFAHEILAKAGNKSGVIGLVPCAIGGTHLREWVKGTQNYTRLVN 146

Query: 127 RIKASEKHGGKVQAFFWYQGESDASVEVESKLYEENLTKFFTDLRKDLNHPELPIILMKI 186
           RIKASE  GGK+Q   WYQGESDA+VE ESK YE NLTKF+TDLR D NHP+LPIIL+KI
Sbjct: 147 RIKASEAQGGKIQGLLWYQGESDAAVEEESKFYESNLTKFYTDLRTDTNHPDLPIILVKI 206

Query: 187 VTHDIFTSPIINYKEDVWKAQEAVTHKLLNVRMVDAMEAVGNLEQGLNEDKGHLNVKSEV 246
           VTHD F SP+IN+ +DVWKAQE +T  L+NVR+VD  +AVGN + G+N+D GHL+ KSEV
Sbjct: 207 VTHDFFISPLINFLKDVWKAQEDITRDLVNVRIVDGKQAVGNFDTGMNQDGGHLSTKSEV 266

Query: 247 KLGKMLAHDFYSNFDHKLT 266
           KLGKMLA  FYSNF ++LT
Sbjct: 267 KLGKMLADSFYSNFGNRLT 285

BLAST of Sgr016842 vs. ExPASy TrEMBL
Match: A0A0A0K9J9 (SASA domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G058180 PE=4 SV=1)

HSP 1 Score: 386.3 bits (991), Expect = 1.0e-103
Identity = 186/256 (72.66%), Postives = 206/256 (80.47%), Query Frame = 0

Query: 7   PDNIFILGGQSNMAGRGGVSKDPITDKNKWDGYIPPESQSHKSILRLNADLKWEQAREPL 66
           P+NIFIL GQSNMAGRGGVS DP TDK  WDGYIP E +S+ SI RLNAD+ WEQA EPL
Sbjct: 14  PNNIFILAGQSNMAGRGGVSLDPTTDKMVWDGYIPLECESNDSIFRLNADMVWEQAHEPL 73

Query: 67  HWDIDYNKTNGVGPGMAFANELLAKAGESIGVIGLVPCAIGGTHLREWIKGTVYYTRLVN 126
           HWDID  KTNG+GPGMAFANELLA  G+ IG IGLVPCAIGG+HL+EW+KGT  Y  LV 
Sbjct: 74  HWDIDVVKTNGIGPGMAFANELLAIGGKRIGAIGLVPCAIGGSHLKEWVKGTNRYDNLVE 133

Query: 127 RIKASEKHGGKVQAFFWYQGESDASVEVESKLYEENLTKFFTDLRKDLNHPELPIILMKI 186
           RI+ASEK+GG VQ   WYQGESDA+VE E+  YE  LTKFF DLR D NHPELPIIL+K+
Sbjct: 134 RIRASEKNGGTVQGILWYQGESDAAVEEEAMCYERELTKFFIDLRADTNHPELPIILVKL 193

Query: 187 VTHDIFTSPIINYKEDVWKAQEAVTHKLLNVRMVDAMEAVGNLEQGLNEDKGHLNVKSEV 246
           VTHD F SP I++KE+V  A EAVTH+L NV MVD   AVGN + GLNEDKGHLNVKSEV
Sbjct: 194 VTHDFFLSPNISFKEEVCNALEAVTHRLPNVTMVDGPMAVGNFDDGLNEDKGHLNVKSEV 253

Query: 247 KLGKMLAHDFYSNFDH 263
           KLGKM AH FYSNF H
Sbjct: 254 KLGKMFAHSFYSNFAH 269

BLAST of Sgr016842 vs. TAIR 10
Match: AT3G53010.1 (Domain of unknown function (DUF303) )

HSP 1 Score: 217.6 bits (553), Expect = 1.2e-56
Identity = 123/250 (49.20%), Postives = 154/250 (61.60%), Query Frame = 0

Query: 9   NIFILGGQSNMAGRGGVSKDPITDKNKWDGYIPPESQSHKSILRLNADLKWEQAREPLHW 68
           +IFIL GQSNMAGRGGV  D  T+   WDG IPPE +S+ SILRL + L+W++A+EPLH 
Sbjct: 30  SIFILAGQSNMAGRGGVYNDTATNTTVWDGVIPPECRSNPSILRLTSKLEWKEAKEPLHV 89

Query: 69  DIDYNKTNGVGPGMAFANELLAKAGESIGVIGLVPCAIGGTHLREWIKGTVYYTRLVNRI 128
           DID NKTNGVGPGM FAN ++ + G+    +GLVPC+IGGT L +W KG   Y   V R 
Sbjct: 90  DIDINKTNGVGPGMPFANRVVNRFGQ----VGLVPCSIGGTKLSQWQKGEFLYEETVKRA 149

Query: 129 KA--SEKHGGKVQAFFWYQGESDASVEVESKLYEENLTKFFTDLRKDLNHPELPIILMKI 188
           KA  +   GG  +A  WYQGESD    V++ +Y++ L KFF+DLR DL HP LPII + +
Sbjct: 150 KAAMASGGGGSYRAVLWYQGESDTVDMVDASVYKKRLVKFFSDLRNDLQHPNLPIIQVAL 209

Query: 189 VTHDIFTSPIINYKEDVWKAQEAVTHKLLNVRMVDAMEAVGNLEQGLNEDKGHLNVKSEV 248
            T      P   Y + V KAQ  +   L NV  VDA          L  D  HL   S+V
Sbjct: 210 ATG---AGP---YLDAVRKAQ--LKTDLENVYCVDARGL------PLEPDGLHLTTSSQV 261

Query: 249 KLGKMLAHDF 257
           +LG M+A  F
Sbjct: 270 QLGHMIAESF 261

BLAST of Sgr016842 vs. TAIR 10
Match: AT4G34215.1 (Domain of unknown function (DUF303) )

HSP 1 Score: 209.9 bits (533), Expect = 2.5e-54
Identity = 118/254 (46.46%), Postives = 153/254 (60.24%), Query Frame = 0

Query: 7   PDNIFILGGQSNMAGRGGVSKDPITDKNKWDGYIPPESQSHKSILRLNADLKWEQAREPL 66
           P+ IFIL GQSNMAGRGGV KD   ++  WD  +PPE   + SILRL+ADL+WE+A EPL
Sbjct: 21  PNQIFILSGQSNMAGRGGVFKDHHNNRWVWDKILPPECAPNSSILRLSADLRWEEAHEPL 80

Query: 67  HWDIDYNKTNGVGPGMAFANELLAKAGESIGVIGLVPCAIGGTHLREWIKGTVYYTRLVN 126
           H DID  K  GVGPGMAFAN +  +      VIGLVPCA GGT ++EW +G+  Y R+V 
Sbjct: 81  HVDIDTGKVCGVGPGMAFANAVKNRLETDSAVIGLVPCASGGTAIKEWERGSHLYERMVK 140

Query: 127 RIKASEKHGGKVQAFFWYQGESDASVEVESKLYEENLTKFFTDLRKDLNHPELPIILMKI 186
           R + S K GG+++A  WYQGESD     +++ Y  N+ +   +LR DLN P LPII + I
Sbjct: 141 RTEESRKCGGEIKAVLWYQGESDVLDIHDAESYGNNMDRLIKNLRHDLNLPSLPIIQVAI 200

Query: 187 VTHDIFTSPIINYKEDVWKAQEAVTHKLLNVRMVDAMEAVGNLEQGLNEDKGHLNVKSEV 246
            +          Y + V +AQ  +  KL NV  VDA          L  D  HL  +++V
Sbjct: 201 ASGG-------GYIDKVREAQLGL--KLSNVVCVDAKGL------PLKSDNLHLTTEAQV 259

Query: 247 KLGKMLAHDFYSNF 261
           +LG  LA  + SNF
Sbjct: 261 QLGLSLAQAYLSNF 259

BLAST of Sgr016842 vs. TAIR 10
Match: AT4G34215.2 (Domain of unknown function (DUF303) )

HSP 1 Score: 209.9 bits (533), Expect = 2.5e-54
Identity = 118/254 (46.46%), Postives = 153/254 (60.24%), Query Frame = 0

Query: 7   PDNIFILGGQSNMAGRGGVSKDPITDKNKWDGYIPPESQSHKSILRLNADLKWEQAREPL 66
           P+ IFIL GQSNMAGRGGV KD   ++  WD  +PPE   + SILRL+ADL+WE+A EPL
Sbjct: 21  PNQIFILSGQSNMAGRGGVFKDHHNNRWVWDKILPPECAPNSSILRLSADLRWEEAHEPL 80

Query: 67  HWDIDYNKTNGVGPGMAFANELLAKAGESIGVIGLVPCAIGGTHLREWIKGTVYYTRLVN 126
           H DID  K  GVGPGMAFAN +  +      VIGLVPCA GGT ++EW +G+  Y R+V 
Sbjct: 81  HVDIDTGKVCGVGPGMAFANAVKNRLETDSAVIGLVPCASGGTAIKEWERGSHLYERMVK 140

Query: 127 RIKASEKHGGKVQAFFWYQGESDASVEVESKLYEENLTKFFTDLRKDLNHPELPIILMKI 186
           R + S K GG+++A  WYQGESD     +++ Y  N+ +   +LR DLN P LPII + I
Sbjct: 141 RTEESRKCGGEIKAVLWYQGESDVLDIHDAESYGNNMDRLIKNLRHDLNLPSLPIIQVAI 200

Query: 187 VTHDIFTSPIINYKEDVWKAQEAVTHKLLNVRMVDAMEAVGNLEQGLNEDKGHLNVKSEV 246
            +          Y + V +AQ  +  KL NV  VDA          L  D  HL  +++V
Sbjct: 201 ASGG-------GYIDKVREAQLGL--KLSNVVCVDAKGL------PLKSDNLHLTTEAQV 259

Query: 247 KLGKMLAHDFYSNF 261
           +LG  LA  + SNF
Sbjct: 261 QLGLSLAQAYLSNF 259

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022141681.13.6e-12483.78probable carbohydrate esterase At4g34215 [Momordica charantia][more]
KAG7016422.17.1e-11276.17putative carbohydrate esterase, partial [Cucurbita argyrosperma subsp. argyrospe... [more]
XP_022939276.19.3e-11275.78probable carbohydrate esterase At4g34215 [Cucurbita moschata][more]
KAG6578894.11.6e-11175.78putative carbohydrate esterase, partial [Cucurbita argyrosperma subsp. sororia][more]
XP_022993914.12.1e-11175.78probable carbohydrate esterase At4g34215 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
Q8L9J93.5e-5346.46Probable carbohydrate esterase At4g34215 OS=Arabidopsis thaliana OX=3702 GN=At4g... [more]
Match NameE-valueIdentityDescription
A0A6J1CJZ11.8e-12483.78probable carbohydrate esterase At4g34215 OS=Momordica charantia OX=3673 GN=LOC11... [more]
A0A6J1FFF94.5e-11275.78probable carbohydrate esterase At4g34215 OS=Cucurbita moschata OX=3662 GN=LOC111... [more]
A0A6J1K1G71.0e-11175.78probable carbohydrate esterase At4g34215 OS=Cucurbita maxima OX=3661 GN=LOC11148... [more]
A0A6J1CKF91.1e-10771.81probable carbohydrate esterase At4g34215 OS=Momordica charantia OX=3673 GN=LOC11... [more]
A0A0A0K9J91.0e-10372.66SASA domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G058180 PE=4 S... [more]
Match NameE-valueIdentityDescription
AT3G53010.11.2e-5649.20Domain of unknown function (DUF303) [more]
AT4G34215.12.5e-5446.46Domain of unknown function (DUF303) [more]
AT4G34215.22.5e-5446.46Domain of unknown function (DUF303) [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR036514SGNH hydrolase superfamilyGENE3D3.40.50.1110SGNH hydrolasecoord: 7..260
e-value: 4.6E-60
score: 205.5
IPR005181Sialate O-acetylesterase domainPFAMPF03629SASAcoord: 9..257
e-value: 4.8E-74
score: 248.8
NoneNo IPR availablePANTHERPTHR31988:SF19BNACNNG62850D PROTEINcoord: 6..260
NoneNo IPR availablePANTHERPTHR31988ESTERASE, PUTATIVE (DUF303)-RELATEDcoord: 6..260
NoneNo IPR availableSUPERFAMILY52266SGNH hydrolasecoord: 9..257

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr016842.1Sgr016842.1mRNA