Sgr014493 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr014493
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
DescriptionUnknown protein
Locationtig00000589: 836046 .. 837451 (+)
RNA-Seq ExpressionSgr014493
SyntenySgr014493
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTTCCAACGCTTCCCCCACCGACGGGACTTTTTCGACTCCTCTTTCCGGCTGCCGGTCCCTAGGACTGATGCAGCGTCGTTGAAAGCCGAGTCGAAGGACCCGGAGCAATCGTCGCGCAGGGAATTGGAGTTGGAGTCTCTATCCGGGATGGCCAGCTTGGGCCCATGGCAGATATCGGTGGTTGCAGGAGACACTTTCGCGTTCGAAACCGACTTCATCTCATCAATCTCAGCCACCACCGCCGCCGCAGTTTTCCTGATAGAGTTGGACCTGTCGAGGCTCTTCCGCCGACGAGAAGAAGAGTCCGAATAGTAGTCCCGGGTCTGGGACGACCCGCCGGGGATATTCTCTTCTTCATTGGTAGAATTATTGGATTCTTCCACAGGAATTTGGGTATCGGAACGGAAGACATTGATTGGAGCATCTTCAACGACAGAAAGCATGGTGGGCATTCTTGGAAAGGTTCTGCTGATCAAATACCCATCCCAAGAAGCCCGAGGCTCGTCGAAAGAATAACGGGGATCGTCGAAGGACATCCGACCAGTATCGAGCGAGAACCTCGGATCAGTATCGCACGATCGTCGACCGAACCCGTAATCGGCAATCTCCGACTGTGTTTCTCTAAAATGACGTCCGATAGGCTTCTCCACAGGCAATGTCGTAGATGCACCGCCATTTCTCTGCTTCTTCTCCTTCTGTTTGTCCCTCCACTTCTGCAGCTTCTTGCTGAATACTGAAGCGGCGGACCAGAAGCTACCGGCAATCTCCTTCAAGTCCCTTCCAGAGGGCTTCTTGGTGTACGAATCGAGATCTATGTGATCTTTCATGGTCTTCAAGTGCTCTTGTACTGATTCTGGCTCTAATTCTTCCTGAATTTCTTCTTCTTCGACAATCTCCTGAACTCCGTTTTCAATTACAACATCCACCACATTGGGTTCCTCACAAACCCTAATTTCTCCGTGGTCGGTCGTGGGGTCTCCTGCTTCAGCCTCGAGAATGGGACCTCGAAGGCTCGAAGTGGACTCCTGTAGATTTTTATTCTCAAGAGTAATCTCAGCACCAGCGACCCTAGAAGTAGAAGATTCCTTGAGAGACGGATGACGGGCGTCGTCCTGGGAAAAGAGAGTGCAGAGAGTGTTACGAACTCTAACGTCGCAAGATCTCCTCTGGGGCTCAAAGACGCCGGAGAAGGCCTCGTTCTTGGAGGCAGAGAAGGATTTGGTACGACGAAGCTCAGGAAAGAAAGAGGAAGGCTTAGTGCGGTTGGAAGAGGAGGGGAGTCCAGCAGCAGAACCAAGTCCGGGCCTGAAGATAGCCTTAAGAGCAGCGGCAGCAGTGGAAGGGGGTTTCTTGGCAGAAGAAGAAGAGGAATCGAGGAGAGAGAGGCGCTCGCAGAGGCATAA

mRNA sequence

ATGTTCCAACGCTTCCCCCACCGACGGGACTTTTTCGACTCCTCTTTCCGGCTGCCGGTCCCTAGGACTGATGCAGCGTCGTTGAAAGCCGAGTCGAAGGACCCGGAGCAATCGTCGCGCAGGGAATTGGAGTTGGAGTCTCTATCCGGGATGGCCAGCTTGGGCCCATGGCAGATATCGGTGGTTGCAGGAGACACTTTCGCGTTCGAAACCGACTTCATCTCATCAATCTCAGCCACCACCGCCGCCGCAGTTTTCCTGATAGAGTTGGACCTGTCGAGGCTCTTCCGCCGACGAGAAGAAGAGAATTTGGGTATCGGAACGGAAGACATTGATTGGAGCATCTTCAACGACAGAAAGCATGGTGGGCATTCTTGGAAAGGTTCTGCTGATCAAATACCCATCCCAAGAAGCCCGAGGCTCGTCGAAAGAATAACGGGGATCGTCGAAGGACATCCGACCAGTATCGAGCGAGAACCTCGGATCAGCTTCTCCACAGGCAATGTCGTAGATGCACCGCCATTTCTCTGCTTCTTCTCCTTCTGTTTGTCCCTCCACTTCTGCAGCTTCTTGCTGAATACTGAAGCGGCGGACCAGAAGCTACCGGCAATCTCCTTCAAGTCCCTTCCAGAGGGCTTCTTGGTCACCAGCGACCCTAGAAGTAGAAGATTCCTTGAGAGACGGATGACGGGCGTCGTCCTGGGAAAAGAGAGTGCAGAGAGTGTTACGAACTCTAACGTCGCAAGATCTCCTCTGGGGCTCAAAGACGCCGGAGAAGGCCTCGTTCTTGGAGGCAGAGAAGGATTTGGTACGACGAAGCTCAGGAAAGAAAGAGGAAGGCTTAGTGCGGTTGGAAGAGGAGGGGAGTCCAGCAGCAGAACCAAGTCCGGGCCTGAAGATAGCCTTAAGAGCAGCGGCAGCAGTGGAAGGGGGTTTCTTGGCAGAAGAAGAAGAGGAATCGAGGAGAGAGAGGCGCTCGCAGAGGCATAA

Coding sequence (CDS)

ATGTTCCAACGCTTCCCCCACCGACGGGACTTTTTCGACTCCTCTTTCCGGCTGCCGGTCCCTAGGACTGATGCAGCGTCGTTGAAAGCCGAGTCGAAGGACCCGGAGCAATCGTCGCGCAGGGAATTGGAGTTGGAGTCTCTATCCGGGATGGCCAGCTTGGGCCCATGGCAGATATCGGTGGTTGCAGGAGACACTTTCGCGTTCGAAACCGACTTCATCTCATCAATCTCAGCCACCACCGCCGCCGCAGTTTTCCTGATAGAGTTGGACCTGTCGAGGCTCTTCCGCCGACGAGAAGAAGAGAATTTGGGTATCGGAACGGAAGACATTGATTGGAGCATCTTCAACGACAGAAAGCATGGTGGGCATTCTTGGAAAGGTTCTGCTGATCAAATACCCATCCCAAGAAGCCCGAGGCTCGTCGAAAGAATAACGGGGATCGTCGAAGGACATCCGACCAGTATCGAGCGAGAACCTCGGATCAGCTTCTCCACAGGCAATGTCGTAGATGCACCGCCATTTCTCTGCTTCTTCTCCTTCTGTTTGTCCCTCCACTTCTGCAGCTTCTTGCTGAATACTGAAGCGGCGGACCAGAAGCTACCGGCAATCTCCTTCAAGTCCCTTCCAGAGGGCTTCTTGGTCACCAGCGACCCTAGAAGTAGAAGATTCCTTGAGAGACGGATGACGGGCGTCGTCCTGGGAAAAGAGAGTGCAGAGAGTGTTACGAACTCTAACGTCGCAAGATCTCCTCTGGGGCTCAAAGACGCCGGAGAAGGCCTCGTTCTTGGAGGCAGAGAAGGATTTGGTACGACGAAGCTCAGGAAAGAAAGAGGAAGGCTTAGTGCGGTTGGAAGAGGAGGGGAGTCCAGCAGCAGAACCAAGTCCGGGCCTGAAGATAGCCTTAAGAGCAGCGGCAGCAGTGGAAGGGGGTTTCTTGGCAGAAGAAGAAGAGGAATCGAGGAGAGAGAGGCGCTCGCAGAGGCATAA

Protein sequence

MFQRFPHRRDFFDSSFRLPVPRTDAASLKAESKDPEQSSRRELELESLSGMASLGPWQISVVAGDTFAFETDFISSISATTAAAVFLIELDLSRLFRRREEENLGIGTEDIDWSIFNDRKHGGHSWKGSADQIPIPRSPRLVERITGIVEGHPTSIEREPRISFSTGNVVDAPPFLCFFSFCLSLHFCSFLLNTEAADQKLPAISFKSLPEGFLVTSDPRSRRFLERRMTGVVLGKESAESVTNSNVARSPLGLKDAGEGLVLGGREGFGTTKLRKERGRLSAVGRGGESSSRTKSGPEDSLKSSGSSGRGFLGRRRRGIEEREALAEA
Homology
BLAST of Sgr014493 vs. NCBI nr
Match: ACR35350.1 (unknown [Zea mays])

HSP 1 Score: 60.1 bits (144), Expect = 4.2e-05
Identity = 44/92 (47.83%), Postives = 46/92 (50.00%), Query Frame = 0

Query: 136 PRSPRLVERITGIVEGHPTSIEREPRISFSTGNVVDA----PPFLCFFSFCLSLHFCSFL 195
           PR  RL  R  G+ EG            FS G +  A    P   CF SFCL L FC FL
Sbjct: 32  PRRKRLRRRNDGL-EG------------FSGGGIAAAAALLPAASCFLSFCLRLQFCHFL 91

Query: 196 LNTEAADQKLPAISFKSLPEGFLVTSDPRSRR 224
           LNTEAA QKLPAIS  SL  GFL   D    R
Sbjct: 92  LNTEAASQKLPAISI-SLGGGFLGCCDCEDSR 109

BLAST of Sgr014493 vs. NCBI nr
Match: KAF7024648.1 (hypothetical protein CFC21_036961 [Triticum aestivum])

HSP 1 Score: 59.7 bits (143), Expect = 5.5e-05
Identity = 38/78 (48.72%), Postives = 43/78 (55.13%), Query Frame = 0

Query: 153 PTSIEREPRI------SFSTGNVVDAPPFL---CFFSFCLSLHFCSFLLNTEAADQKLPA 212
           P S  R  R+       FS G +  A   L    F SFCL LHFC FLLNTE A +KLPA
Sbjct: 24  PASPRRRRRLRNDGLDGFSGGGIAAAAALLTAASFLSFCLRLHFCHFLLNTE-APKKLPA 83

Query: 213 ISFKSLPEGFLVTSDPRS 222
           ISF+S   GF  + D RS
Sbjct: 84  ISFRSFGGGFTGSDDSRS 100

BLAST of Sgr014493 vs. NCBI nr
Match: KAF7038629.1 (hypothetical protein CFC21_048789 [Triticum aestivum])

HSP 1 Score: 58.9 bits (141), Expect = 9.4e-05
Identity = 34/61 (55.74%), Postives = 38/61 (62.30%), Query Frame = 0

Query: 164 FSTGNVVDAPPFL---CFFSFCLSLHFCSFLLNTEAADQKLPAISFKSLPEGFLVTSDPR 222
           FS G +  A   L    F SFCL LHFC FLLNTE A +KLPAISF+S   GF  + D R
Sbjct: 41  FSGGGIAAAAALLTAASFLSFCLRLHFCHFLLNTE-APKKLPAISFRSFGGGFTGSDDSR 100

BLAST of Sgr014493 vs. NCBI nr
Match: KAE8098808.1 (hypothetical protein FH972_016845 [Carpinus fangiana])

HSP 1 Score: 56.6 bits (135), Expect = 4.7e-04
Identity = 31/56 (55.36%), Postives = 35/56 (62.50%), Query Frame = 0

Query: 6  PHRRDFFDSSFRLPVPRTDAASLKAESKDPEQSSRRELE---LESLSGMASLGPWQ 59
          P R DF D  FR P P T+A SL   SK  EQSSR+E E   L+SLSG  S GPW+
Sbjct: 12 PFRLDFLDDPFRSPFPTTEAVSLNPISKVSEQSSRKEFEFESLDSLSGTTSFGPWK 67

BLAST of Sgr014493 vs. NCBI nr
Match: ONM01077.1 (hypothetical protein ZEAMMB73_Zm00001d030535 [Zea mays])

HSP 1 Score: 56.2 bits (134), Expect = 6.1e-04
Identity = 39/100 (39.00%), Postives = 46/100 (46.00%), Query Frame = 0

Query: 126 WKGSADQIPIPRSPRLVERITGIVEGHPTSIEREPRISFSTGNVVDA--PPFLCFFSFCL 185
           W  +  ++  PR  RL  R  G+               FS G +V A  P    F SF L
Sbjct: 22  WPPANSELASPRRKRLRRRNDGL-------------DGFSGGGIVAALLPTASYFLSFYL 81

Query: 186 SLHFCSFLLNTEAADQKLPAISFKSLPEGFLVTSDPRSRR 224
            L FC FLLN EA  QKLPAIS +SL   FL   D  + R
Sbjct: 82  PLQFCHFLLNIEATSQKLPAISLRSLGGSFLGCCDCEASR 108

BLAST of Sgr014493 vs. ExPASy TrEMBL
Match: A0A0A0LCW8 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G732515 PE=4 SV=1)

HSP 1 Score: 77.4 bits (189), Expect = 1.2e-10
Identity = 34/55 (61.82%), Postives = 41/55 (74.55%), Query Frame = 0

Query: 111 IDWSIFNDRKHGGHSWKGSADQIPIPRSPRLVERITGIVEGHPTSIEREPRISFS 166
           +D  IFND KHGGHSW+ S DQIP+P   R +E ITGIV+GH  SIERE RI+ +
Sbjct: 1   MDRGIFNDGKHGGHSWERSTDQIPVPGCSRFIEGITGIVKGHSASIERESRINIA 55

BLAST of Sgr014493 vs. ExPASy TrEMBL
Match: A0A0A9K6X0 (Uncharacterized protein OS=Arundo donax OX=35708 PE=4 SV=1)

HSP 1 Score: 67.8 bits (164), Expect = 9.8e-08
Identity = 35/53 (66.04%), Postives = 37/53 (69.81%), Query Frame = 0

Query: 164 FSTGNVVDA--PPFLCFFSFCLSLHFCSFLLNTEAADQKLPAISFKSLPEGFL 215
           FS G +  A  P   CF SFCL LHFC FLLNTEAA QKLPAIS +SL  GFL
Sbjct: 42  FSGGGIAAALLPAASCFLSFCLRLHFCHFLLNTEAASQKLPAISLRSLVGGFL 94

BLAST of Sgr014493 vs. ExPASy TrEMBL
Match: A0A0A9J1C3 (Uncharacterized protein OS=Arundo donax OX=35708 PE=4 SV=1)

HSP 1 Score: 67.0 bits (162), Expect = 1.7e-07
Identity = 49/118 (41.53%), Postives = 58/118 (49.15%), Query Frame = 0

Query: 97  RRREEENLGIGTEDIDWSIFNDRKHGGHSWKGSADQIPIPRSPRLVERITGIVEGHPTSI 156
           R    ENLG        S+ ++R+   +S      + P PR  RL  R  G+        
Sbjct: 2   RPASRENLG--------SVSHERRPPSNS------EPPSPRWNRLRRRNEGL-------- 61

Query: 157 EREPRISFSTGNVVDAPPFLCFFSFCLSLHFCSFLLNTEAADQKLPAISFKSLPEGFL 215
                  FS G +  A    CF SFCL LHFC FLLNTEAA QKLPAIS +SL  GFL
Sbjct: 62  -----DGFSGGGIA-AAAASCFLSFCLRLHFCHFLLNTEAATQKLPAISLRSLGGGFL 91

BLAST of Sgr014493 vs. ExPASy TrEMBL
Match: C4J2F0 (Uncharacterized protein OS=Zea mays OX=4577 PE=2 SV=1)

HSP 1 Score: 60.1 bits (144), Expect = 2.0e-05
Identity = 44/92 (47.83%), Postives = 46/92 (50.00%), Query Frame = 0

Query: 136 PRSPRLVERITGIVEGHPTSIEREPRISFSTGNVVDA----PPFLCFFSFCLSLHFCSFL 195
           PR  RL  R  G+ EG            FS G +  A    P   CF SFCL L FC FL
Sbjct: 32  PRRKRLRRRNDGL-EG------------FSGGGIAAAAALLPAASCFLSFCLRLQFCHFL 91

Query: 196 LNTEAADQKLPAISFKSLPEGFLVTSDPRSRR 224
           LNTEAA QKLPAIS  SL  GFL   D    R
Sbjct: 92  LNTEAASQKLPAISI-SLGGGFLGCCDCEDSR 109

BLAST of Sgr014493 vs. ExPASy TrEMBL
Match: A0A3B6EMA8 (Uncharacterized protein OS=Triticum aestivum OX=4565 PE=4 SV=1)

HSP 1 Score: 59.7 bits (143), Expect = 2.7e-05
Identity = 38/78 (48.72%), Postives = 43/78 (55.13%), Query Frame = 0

Query: 153 PTSIEREPRI------SFSTGNVVDAPPFL---CFFSFCLSLHFCSFLLNTEAADQKLPA 212
           P S  R  R+       FS G +  A   L    F SFCL LHFC FLLNTE A +KLPA
Sbjct: 24  PASPRRRRRLRNDGLDGFSGGGIAAAAALLTAASFLSFCLRLHFCHFLLNTE-APKKLPA 83

Query: 213 ISFKSLPEGFLVTSDPRS 222
           ISF+S   GF  + D RS
Sbjct: 84  ISFRSFGGGFTGSDDSRS 100

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
ACR35350.14.2e-0547.83unknown [Zea mays][more]
KAF7024648.15.5e-0548.72hypothetical protein CFC21_036961 [Triticum aestivum][more]
KAF7038629.19.4e-0555.74hypothetical protein CFC21_048789 [Triticum aestivum][more]
KAE8098808.14.7e-0455.36hypothetical protein FH972_016845 [Carpinus fangiana][more]
ONM01077.16.1e-0439.00hypothetical protein ZEAMMB73_Zm00001d030535 [Zea mays][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0LCW81.2e-1061.82Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G732515 PE=4 SV=1[more]
A0A0A9K6X09.8e-0866.04Uncharacterized protein OS=Arundo donax OX=35708 PE=4 SV=1[more]
A0A0A9J1C31.7e-0741.53Uncharacterized protein OS=Arundo donax OX=35708 PE=4 SV=1[more]
C4J2F02.0e-0547.83Uncharacterized protein OS=Zea mays OX=4577 PE=2 SV=1[more]
A0A3B6EMA82.7e-0548.72Uncharacterized protein OS=Triticum aestivum OX=4565 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 275..329
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 314..329
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 287..308

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr014493.1Sgr014493.1mRNA