Sgr000124 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr000124
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
DescriptionExpression protein
Locationtig00000049: 26519 .. 26848 (-)
RNA-Seq ExpressionSgr000124
SyntenySgr000124
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGTAACTGTCTCCGGCGGGACGACTCGGCTGCGCGGTGGGCCGGCGAGGAGTGGGACTTTCTGGCGGCGGACGCCCGGGGACGACGCGCAAAGCTGATGGACGAGGACGAGATCGACGAAGGGCTTCTCGGGTGCGGCTCCACGGCGGCGACGGTGAAGATCAAGATCACGAAGAAGCAGCTGGAGGAGCTGCTGGGGAAGGTGGACATCAAGGAGCTCTCCGTGCAGCAGGTTCTGACTCAGTTGATCAGCGTCGGCGATCAGTTTCACGACTCTCGCCACCGGTCGTGGCGGCCGGTGTTACAGAGCATCCCTGAAGTGGACTGA

mRNA sequence

ATGGGTAACTGTCTCCGGCGGGACGACTCGGCTGCGCGGTGGGCCGGCGAGGAGTGGGACTTTCTGGCGGCGGACGCCCGGGGACGACGCGCAAAGCTGATGGACGAGGACGAGATCGACGAAGGGCTTCTCGGGTGCGGCTCCACGGCGGCGACGGTGAAGATCAAGATCACGAAGAAGCAGCTGGAGGAGCTGCTGGGGAAGGTGGACATCAAGGAGCTCTCCGTGCAGCAGGTTCTGACTCAGTTGATCAGCGTCGGCGATCAGTTTCACGACTCTCGCCACCGGTCGTGGCGGCCGGTGTTACAGAGCATCCCTGAAGTGGACTGA

Coding sequence (CDS)

ATGGGTAACTGTCTCCGGCGGGACGACTCGGCTGCGCGGTGGGCCGGCGAGGAGTGGGACTTTCTGGCGGCGGACGCCCGGGGACGACGCGCAAAGCTGATGGACGAGGACGAGATCGACGAAGGGCTTCTCGGGTGCGGCTCCACGGCGGCGACGGTGAAGATCAAGATCACGAAGAAGCAGCTGGAGGAGCTGCTGGGGAAGGTGGACATCAAGGAGCTCTCCGTGCAGCAGGTTCTGACTCAGTTGATCAGCGTCGGCGATCAGTTTCACGACTCTCGCCACCGGTCGTGGCGGCCGGTGTTACAGAGCATCCCTGAAGTGGACTGA

Protein sequence

MGNCLRRDDSAARWAGEEWDFLAADARGRRAKLMDEDEIDEGLLGCGSTAATVKIKITKKQLEELLGKVDIKELSVQQVLTQLISVGDQFHDSRHRSWRPVLQSIPEVD
Homology
BLAST of Sgr000124 vs. NCBI nr
Match: XP_038896531.1 (uncharacterized protein LOC120084782 [Benincasa hispida])

HSP 1 Score: 154.1 bits (388), Expect = 7.1e-34
Identity = 77/110 (70.00%), Postives = 88/110 (80.00%), Query Frame = 0

Query: 1   MGNCLRRDDSAARWAGEEWDFLAADAR-GRRAKLMDEDEIDEGLLGCGSTAATVKIKITK 60
           MGNCLRRDD+ A+WAGE+WDFLAA+   G+   L D ++         +TAATVKIKITK
Sbjct: 1   MGNCLRRDDAVAQWAGEDWDFLAAEGHDGQEGLLRDGEKKKCSAAVASTTAATVKIKITK 60

Query: 61  KQLEELLGKVDIKELSVQQVLTQLISVGDQFHDSRHRSWRPVLQSIPEVD 110
           +QLEELLGKVDIKE+SVQQVL QLI VGDQFH+SRHR WRPVLQ IPEVD
Sbjct: 61  RQLEELLGKVDIKEISVQQVLAQLIGVGDQFHESRHRHWRPVLQCIPEVD 110

BLAST of Sgr000124 vs. NCBI nr
Match: XP_022936225.1 (uncharacterized protein LOC111442898 [Cucurbita moschata] >KAG6592042.1 hypothetical protein SDJN03_14388, partial [Cucurbita argyrosperma subsp. sororia] >KAG7024918.1 hypothetical protein SDJN02_13738, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 152.9 bits (385), Expect = 1.6e-33
Identity = 84/115 (73.04%), Postives = 90/115 (78.26%), Query Frame = 0

Query: 1   MGNCLRRDDSAARWAGEEWDFLAADARGRRAKLMDEDEIDEGLLG-----CGSTAAT-VK 60
           MGNCLRRD +AA+WAGEEWDFLAA+  GR           EGLLG       STAAT VK
Sbjct: 1   MGNCLRRDGAAAQWAGEEWDFLAAEDDGR-----------EGLLGEIEKKKSSTAATEVK 60

Query: 61  IKITKKQLEELLGKVDIKELSVQQVLTQLISVGDQFHDSRHRSWRPVLQSIPEVD 110
           IKITK+QLEELLGKVDIKE+SVQQVL QLI VGDQFH+SRHR WRPVLQSIPEVD
Sbjct: 61  IKITKRQLEELLGKVDIKEMSVQQVLAQLIGVGDQFHESRHRHWRPVLQSIPEVD 104

BLAST of Sgr000124 vs. NCBI nr
Match: XP_022975730.1 (uncharacterized protein LOC111475920 [Cucurbita maxima])

HSP 1 Score: 151.0 bits (380), Expect = 6.0e-33
Identity = 78/109 (71.56%), Postives = 88/109 (80.73%), Query Frame = 0

Query: 1   MGNCLRRDDSAARWAGEEWDFLAADARGRRAKLMDEDEIDEGLLGCGSTAATVKIKITKK 60
           MGNCLRRD +AA+WAGEEWD LAA+  GR   L+ E+E  +      +TA  VKIKITK+
Sbjct: 1   MGNCLRRDGAAAQWAGEEWDLLAAEDDGREG-LLGENEKKKS----STTATEVKIKITKR 60

Query: 61  QLEELLGKVDIKELSVQQVLTQLISVGDQFHDSRHRSWRPVLQSIPEVD 110
           QLEELLGKVDIKE+SVQQVL QLI VGDQFH+SRHR WRPVLQSIPEVD
Sbjct: 61  QLEELLGKVDIKEMSVQQVLAQLIGVGDQFHESRHRHWRPVLQSIPEVD 104

BLAST of Sgr000124 vs. NCBI nr
Match: XP_023536165.1 (uncharacterized protein LOC111797410 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 149.4 bits (376), Expect = 1.7e-32
Identity = 82/115 (71.30%), Postives = 89/115 (77.39%), Query Frame = 0

Query: 1   MGNCLRRDDSAARWAGEEWDFLAADARGRRAKLMDEDEIDEGLLG-----CGSTAAT-VK 60
           MGNC+RRD +AA+WAGEEWDFLAA+  GR           EGLLG       STAAT VK
Sbjct: 1   MGNCIRRDGAAAQWAGEEWDFLAAEDDGR-----------EGLLGEIEKKKSSTAATEVK 60

Query: 61  IKITKKQLEELLGKVDIKELSVQQVLTQLISVGDQFHDSRHRSWRPVLQSIPEVD 110
           IKITK+QLEELLGKVDIKE+SVQQVL QLI V DQFH+SRHR WRPVLQSIPEVD
Sbjct: 61  IKITKRQLEELLGKVDIKEMSVQQVLAQLIGVDDQFHESRHRHWRPVLQSIPEVD 104

BLAST of Sgr000124 vs. NCBI nr
Match: XP_022139678.1 (uncharacterized protein LOC111010526 [Momordica charantia])

HSP 1 Score: 137.5 bits (345), Expect = 6.9e-29
Identity = 71/109 (65.14%), Postives = 79/109 (72.48%), Query Frame = 0

Query: 1   MGNCLRRDDSAARWAGEEWDFLAADARGRRAKLMDEDEIDEGLLGCGSTAATVKIKITKK 60
           MGNCLRRD +A RWAGEEW FLA + +G  A                + A  VKIKITKK
Sbjct: 1   MGNCLRRDGAAERWAGEEWGFLAEEKQGSAA-------------APTAAATEVKIKITKK 60

Query: 61  QLEELLGKVDIKELSVQQVLTQLISVGDQFHDSRHRSWRPVLQSIPEVD 110
           QLEELLGK D+K LSVQQVL QLI+VGDQFH+SRHR WRPVLQSIPE+D
Sbjct: 61  QLEELLGKADVKGLSVQQVLAQLIAVGDQFHESRHRPWRPVLQSIPEMD 96

BLAST of Sgr000124 vs. ExPASy TrEMBL
Match: A0A6J1FD17 (uncharacterized protein LOC111442898 OS=Cucurbita moschata OX=3662 GN=LOC111442898 PE=4 SV=1)

HSP 1 Score: 152.9 bits (385), Expect = 7.6e-34
Identity = 84/115 (73.04%), Postives = 90/115 (78.26%), Query Frame = 0

Query: 1   MGNCLRRDDSAARWAGEEWDFLAADARGRRAKLMDEDEIDEGLLG-----CGSTAAT-VK 60
           MGNCLRRD +AA+WAGEEWDFLAA+  GR           EGLLG       STAAT VK
Sbjct: 1   MGNCLRRDGAAAQWAGEEWDFLAAEDDGR-----------EGLLGEIEKKKSSTAATEVK 60

Query: 61  IKITKKQLEELLGKVDIKELSVQQVLTQLISVGDQFHDSRHRSWRPVLQSIPEVD 110
           IKITK+QLEELLGKVDIKE+SVQQVL QLI VGDQFH+SRHR WRPVLQSIPEVD
Sbjct: 61  IKITKRQLEELLGKVDIKEMSVQQVLAQLIGVGDQFHESRHRHWRPVLQSIPEVD 104

BLAST of Sgr000124 vs. ExPASy TrEMBL
Match: A0A6J1IHJ3 (uncharacterized protein LOC111475920 OS=Cucurbita maxima OX=3661 GN=LOC111475920 PE=4 SV=1)

HSP 1 Score: 151.0 bits (380), Expect = 2.9e-33
Identity = 78/109 (71.56%), Postives = 88/109 (80.73%), Query Frame = 0

Query: 1   MGNCLRRDDSAARWAGEEWDFLAADARGRRAKLMDEDEIDEGLLGCGSTAATVKIKITKK 60
           MGNCLRRD +AA+WAGEEWD LAA+  GR   L+ E+E  +      +TA  VKIKITK+
Sbjct: 1   MGNCLRRDGAAAQWAGEEWDLLAAEDDGREG-LLGENEKKKS----STTATEVKIKITKR 60

Query: 61  QLEELLGKVDIKELSVQQVLTQLISVGDQFHDSRHRSWRPVLQSIPEVD 110
           QLEELLGKVDIKE+SVQQVL QLI VGDQFH+SRHR WRPVLQSIPEVD
Sbjct: 61  QLEELLGKVDIKEMSVQQVLAQLIGVGDQFHESRHRHWRPVLQSIPEVD 104

BLAST of Sgr000124 vs. ExPASy TrEMBL
Match: A0A6J1CG88 (uncharacterized protein LOC111010526 OS=Momordica charantia OX=3673 GN=LOC111010526 PE=4 SV=1)

HSP 1 Score: 137.5 bits (345), Expect = 3.3e-29
Identity = 71/109 (65.14%), Postives = 79/109 (72.48%), Query Frame = 0

Query: 1   MGNCLRRDDSAARWAGEEWDFLAADARGRRAKLMDEDEIDEGLLGCGSTAATVKIKITKK 60
           MGNCLRRD +A RWAGEEW FLA + +G  A                + A  VKIKITKK
Sbjct: 1   MGNCLRRDGAAERWAGEEWGFLAEEKQGSAA-------------APTAAATEVKIKITKK 60

Query: 61  QLEELLGKVDIKELSVQQVLTQLISVGDQFHDSRHRSWRPVLQSIPEVD 110
           QLEELLGK D+K LSVQQVL QLI+VGDQFH+SRHR WRPVLQSIPE+D
Sbjct: 61  QLEELLGKADVKGLSVQQVLAQLIAVGDQFHESRHRPWRPVLQSIPEMD 96

BLAST of Sgr000124 vs. ExPASy TrEMBL
Match: A0A5B6U831 (Expression protein OS=Gossypium australe OX=47621 GN=EPI10_009028 PE=4 SV=1)

HSP 1 Score: 111.3 bits (277), Expect = 2.6e-21
Identity = 61/120 (50.83%), Postives = 81/120 (67.50%), Query Frame = 0

Query: 1   MGNCLRRDDSAARWAGEEWDFLAA---------DARGRRAKLMDEDEIDEGLLGCGS--T 60
           MGNCLR   S+ +WAG++W   AA         D + +   ++ +    +G +   S  T
Sbjct: 1   MGNCLRH-QSSTQWAGDDWGTTAADYGDDDGFSDIKTKGKGIVGDHRQKDGFITTSSTPT 60

Query: 61  AATVKIKITKKQLEELLGKVDIKELSVQQVLTQLISVGDQFHDSRHRSWRPVLQSIPEVD 110
           A  VK+KITKKQLEELLG+VD+KELSVQQVL+QLI+V +QF ++  RSWRP LQSIPEV+
Sbjct: 61  AHEVKVKITKKQLEELLGRVDVKELSVQQVLSQLINVSNQFDETNQRSWRPALQSIPEVN 119

BLAST of Sgr000124 vs. ExPASy TrEMBL
Match: A0A5D2QUV4 (Uncharacterized protein OS=Gossypium tomentosum OX=34277 GN=ES332_A04G022400v1 PE=4 SV=1)

HSP 1 Score: 110.9 bits (276), Expect = 3.3e-21
Identity = 60/120 (50.00%), Postives = 79/120 (65.83%), Query Frame = 0

Query: 1   MGNCLRRDDSAARWAGEEWDFLAA---------DARGRRAKLMDEDEIDEGLLGCGSTAA 60
           MGNCLR   S+ +WAG++W   AA         D + +   ++ +    +G +   ST  
Sbjct: 1   MGNCLRH-QSSTQWAGDDWGTTAADYGDDDGFSDIKTKEKGIVGDHHQKDGFITTSSTPT 60

Query: 61  T--VKIKITKKQLEELLGKVDIKELSVQQVLTQLISVGDQFHDSRHRSWRPVLQSIPEVD 110
              VK+KITKKQLEELLG+VD+KELSVQQVL QLI+V +QF ++  RSWRP LQSIPEV+
Sbjct: 61  VHEVKVKITKKQLEELLGRVDVKELSVQQVLAQLINVSNQFDETNQRSWRPALQSIPEVN 119

BLAST of Sgr000124 vs. TAIR 10
Match: AT3G20340.1 (Expression of the gene is downregulated in the presence of paraquat, an inducer of photoxidative stress. )

HSP 1 Score: 68.6 bits (166), Expect = 3.6e-12
Identity = 42/116 (36.21%), Postives = 66/116 (56.90%), Query Frame = 0

Query: 1   MGNCLRRDDSAARWAGEEWD-FLAADARGRRAKLMDEDEIDEGLLGCGSTAAT----VKI 60
           MGNCLR  +S   WAGE+WD F+  D            +    ++   S ++     +KI
Sbjct: 1   MGNCLRH-ESEMHWAGEDWDEFITEDEEDHHYSSKTTRDGKPVIVTRDSKSSVPSHEIKI 60

Query: 61  KITKKQLEELLGKVDIKELSVQQVLTQLISVGDQFHD--SRHRSWRPVLQSIPEVD 110
           ++TKKQL +LL KV++ +L+ QQ       + ++ ++  ++ R WRPVLQSIPEV+
Sbjct: 61  RLTKKQLHDLLSKVNVHDLTFQQQTFSCPILNNRGYEEANQQRLWRPVLQSIPEVN 115

BLAST of Sgr000124 vs. TAIR 10
Match: AT4G21920.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: N-terminal protein myristoylation; LOCATED IN: cellular_component unknown; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G20340.1); Has 40 Blast hits to 40 proteins in 10 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 40; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 68.2 bits (165), Expect = 4.8e-12
Identity = 45/129 (34.88%), Postives = 71/129 (55.04%), Query Frame = 0

Query: 1   MGNCL-RRDDSAARWAGEEWDFLAADARGRRAKLMDEDEID-EGLLG-----------CG 60
           MGNC+   + +   W+G++        R RR+ ++ +D  D E LLG             
Sbjct: 1   MGNCICVTEKTTTSWSGDDNGSYNKRRRRRRSTVVHDDNDDGEKLLGETSNVTSTSSSSS 60

Query: 61  STAATVKIKITKKQLEELLGKVDIKELSVQQVLTQLI-SVGDQFHDS------RHRSWRP 110
           S    +KI+ITKK+LE+L+  + +K L+ +++L++LI   GDQ   S       H+ W+P
Sbjct: 61  SERREIKIRITKKELEDLMRNIGLKSLTAEEILSKLIFEGGDQIGFSAVDVTNHHQPWKP 120

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038896531.17.1e-3470.00uncharacterized protein LOC120084782 [Benincasa hispida][more]
XP_022936225.11.6e-3373.04uncharacterized protein LOC111442898 [Cucurbita moschata] >KAG6592042.1 hypothet... [more]
XP_022975730.16.0e-3371.56uncharacterized protein LOC111475920 [Cucurbita maxima][more]
XP_023536165.11.7e-3271.30uncharacterized protein LOC111797410 [Cucurbita pepo subsp. pepo][more]
XP_022139678.16.9e-2965.14uncharacterized protein LOC111010526 [Momordica charantia][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1FD177.6e-3473.04uncharacterized protein LOC111442898 OS=Cucurbita moschata OX=3662 GN=LOC1114428... [more]
A0A6J1IHJ32.9e-3371.56uncharacterized protein LOC111475920 OS=Cucurbita maxima OX=3661 GN=LOC111475920... [more]
A0A6J1CG883.3e-2965.14uncharacterized protein LOC111010526 OS=Momordica charantia OX=3673 GN=LOC111010... [more]
A0A5B6U8312.6e-2150.83Expression protein OS=Gossypium australe OX=47621 GN=EPI10_009028 PE=4 SV=1[more]
A0A5D2QUV43.3e-2150.00Uncharacterized protein OS=Gossypium tomentosum OX=34277 GN=ES332_A04G022400v1 P... [more]
Match NameE-valueIdentityDescription
AT3G20340.13.6e-1236.21Expression of the gene is downregulated in the presence of paraquat, an inducer ... [more]
AT4G21920.14.8e-1234.88unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: N-termin... [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR33647:SF5OS01G0793900 PROTEINcoord: 1..108
NoneNo IPR availablePANTHERPTHR33647OS01G0793900 PROTEINcoord: 1..108

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr000124.1Sgr000124.1mRNA