Sgr019862 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr019862
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
DescriptionUnknown protein
Locationtig00153424: 199404 .. 202678 (+)
RNA-Seq ExpressionSgr019862
SyntenySgr019862
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTCTGCGAGTAAGTCTTACGATAGGTTGCAACTGTGAAACAACTCTCTGTGAATCGATGAACGTTCTGCCTGCAGGATAGAAGAGCTGCCACAGCGTGAGCACACGGTAGACCATACAGCTGCCAGCCCCGACAAAGGCAGCAACGGTTACGAATGTCCACGATGTTCGTTCCTTCGTGAGATATAACTTCAAATTCAGCTTCGTTTGCTCGAAGCACTTGATAGGTACGTGCGTGCTCGAGAGCCTCGGCAACACGCCTCTCGGCAGTAGGCACAAGTATTGACGTCCACTGCATGCTGGTCTCTCGCCTTTCATTGAACCATGTCATTAGCTGCCTCCTAATGCACTCCATCATTTGAATTATAGGAAGCCCAGAGGCTTCTGAAATCCAACTGTTTAATGATTCGATGATGTTTGCAGTCAAATGCCCAAACCTCGTTCCCTCGAAGTAAGCTGTAGCCCATAAGCGAGGGGGAATTCGGCGTATCCAGTATGCAGCATCTTGTGATATCTCTTCAATTTCCAAAACTTTTGCTTCAAATTCAATCACTGTGAGAGCATATGCAGCATCCCAGAGAAGTTTGACAAGCATTGGATTATTGAACTCTTTACGGAAGCTTTCACTCAAATGCCGCATGCAAAACCCATGGAAAGCAGTTGGAAAATTCGCTTCCACTCCATCCACGATGCATTTTAGCCTGTCTGATAAGATTGTAAGCCTCGGCATATTTTCAGTGTTAATCTCAAGCAGGTTATGAAGCTCAGAAAGAAACCACATCCAATTGTCATCATTTTCCTCATCAACAACACCAAACGCCAAAGGGAACAAAGCACCATCACCATCAAAACCAGTGGCAAGAAGTAAAGTACCGAGATATTTGCTTTTCAAGTATGTTCTATCAAGCCCGAGTAAAGGCCGACAAGCATTCAAAAAGCCATAAATTGATGCCTGGAAAGATATGAAGAGACGTTGGAAGCAGTTATCAGTTGAATTTCCATAAACTGATGCAATGCTTCCTGGGTTTGTTCTTTTAACCTGTTCGCAATACTGTGGAAGCAAGCGGTATCCTTCTTCAAATGAACCGCGCATGGCAGCCATGATACGCTCCTTGCCCCTCCAAGCTTGTTTGTATGACAAGGTAATACCATGAACACGGTGAATCTCCTCAAGTATCTCCTTTGGTTTATAATTAGGATTTTCTCGCAGTCGTTGCTCCATGGAGCTTGCAACCCACTGAACAGAGGCTTGCTGGTGGCCGAGATGATTAATCCCACCACACGTATGAGTGTCATGAATTGTCCTGATTGTGAAGGTAGGGACACCCGGAAGCTTTGCAGCATGAATACGCCATGGACATCCGTCAGCAGCACATTTGGCCGTAAAGCGTGTTTTATCAGATTTGATGGTTTGCACTTCAAAGTGCAAGGCAATAGCAGTGTCCCTCAGTGCCCTCCTACAGCTCTTAACGTCAGGGAACTCTTGTCCCACTGAAAGCTCATAACTAGGAGCTGTAATGATCGTGCGAGCCTGAAGCACAGAAGCAGGTGTAACCACAAGTTGAGACTGCTCGGCTGCCATATCCTCAACAGCTTGAATTGCCAATTCATCATTCTGATCCACAGATAGTTCAAGATTCTCATCAAGTTCTTGGTTTTCTGAAACAACCAACTGATTGTTATCGGACAAGGCTAACTCATGACCCTGTACAGGAAGAGACAACTGATGGCCAACTTGGTCAGGTTTCCTATCCATACCTAAATCACTTTCATGCGCATAATCGTGATCATCGTCCCCTCCTTGGTCATGACTTTGCCCTAAACCCAACTCATGATCGTGGGCATGCCCCAAACCATCGGGATCATGGGAATGTACCAAACCCTCAGGATGATCATGGGCATGTCCCAATCCCAATTCGTGGTCATGGTTCGGCCCCAAACCCAGATGATGCTCATGTGCTTGTCCTAGGTCCAAATTGTGACCCTGACCAAGACCCATATTATGATTGTGCCCTAACATCAACTGTTGGTTCTGCCCCAGAGCTAAATTGTGGTTTTGACCAAGTATCAGGTCATGGTTTGCCATTGAAATAGAAAATGAAAATCACTTGAACAAAACAGATGTTCCACAGTCCACAAAGGCAGCTACAAATTCAAAGGCCAGAAGACATCTGAACTGTATCATGCCACACCTGTTAAGATAAAAAACAGCTATTGCTCAGAAGATACCACCTACTAAATAACCTGTCTAAATATTATAAGCAACAACATTGATACAATACAAAACATAACAAAAATATAAATTCCCCATGAGATAACAGACCAAACTGAGACATCAGCTTCTTCAAAATTTGACACCCTGGGTCATTTGATATCATAAAACAGATGGATTATCAAATTGGAAGCCATTAGTAACAAAACTCAAACAGGCCTTTCAAGAATAAATGTGCTTCCAAATAATTAAATAACAAGAATAAACCAGTGAGGATCTGTAATCTTAAAAGAAGATCCATCATTATCTATGTATAAACAAACATACACAACTCAAATAGAAAAACAGTAAATGTCCAAATTTCCCATATAGTCACTAAACAAGATTTAACATCATTGACATCCCAAATTTCATGCATTATTCAACAGTTATCCAATATGAATATAAAGATAATAGTAAATGTAGTTTCATAACTTTCCTGACTCTCCATATAGACTTGAAACAAATATCTATAAGACACCCTCTCAAACGAACCTCAACACTACCAGTTACTCTTAGTTTTAAACATTTTCCCAACTGCTCAAACCTCCATTAAAATAGCAACTGAAAGTCTTGAACTGAAAAATCTTCAATGGCATATAATTGGCACACTCGAACATGACAACAATGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAATCACCTGAAGCAAACCCAGAGAGCGAATTCCAAATCAAGCACATTCAAGAAACCAGAAGAAGAAAAACTCAAAAAAGAAGAACTGGGTGTTCCTCCAAACTACAGGATGAAGAAATCCCAGAAGCGAAACGGCAGAAATCGACCTGGAGACTAGGGTTTGGGAATCGCGGAGGTTGAAAGGAAGAGGATGTGAATTTCGAAAAGGGGTTCGAAGGGGAAGGTTTTGGCGGGTAATTCAGTTGGAGAAAATCGGAAAAGAGAGAGAGAGGAGAGGGGCAAAGAAGGGAGGAGGGGAGGATGGGGGCTGGGGCGATGGGGAATTACAAGGGCAGAAGAGGGAGAGAGAGTTTGAGGGTTGA

mRNA sequence

ATGGTCTGCGAGATAGAAGAGCTGCCACAGCGTGAGCACACGGTAGACCATACAGCTGCCAGCCCCGACAAAGGCAGCAACGGTTACGAATGTCCACGATGTTCGTTCCTTCCTGTAGCCCATAAGCGAGGGGGAATTCGGCGTATCCAGTATGCAGCATCTTGTGATATCTCTTCAATTTCCAAAACTTTTGCTTCAAATTCAATCACTGTGAGAGCATATGCAGCATCCCAGAGAAGTTTGACAAGCATTGGATTATTGAACTCTTTACGGAAGCTTTCACTCAAATGCCGCATGCAAAACCCATGGAAAGCAGTTGGAAAATTCGCTTCCACTCCATCCACGATGCATTTTAGCCTGTCTGATAAGATTGTAAGCCTCGGCATATTTTCAGTGTTAATCTCAAGCAGCAAGCGGTATCCTTCTTCAAATGAACCGCGCATGGCAGCCATGATACGCTCCTTGCCCCTCCAAGCTTGTTTGTATGACAAGTCGTTGCTCCATGGAGCTTGCAACCCACTGAACAGAGGCTTGCTGGTGGCCGAGATGATTAATCCCACCACACGTATGAGTGTCATGAATTGTCCTGATTGTGAAGGTAGGGACACCCGGAAGCTTTGCAGCATGAATACGCCATGGACATCCGTCAGCAGCACATTTGGCCGTAAAGCGTGTTTTATCAGATTTGATGGTTTGCACTTCAAAGTGCAAGGCAATAGCAGTGTCCCTCAGTGCCCTCCTACAGCTCTTAACCTTGAATTGCCAATTCATCATTCTGATCCACAGATAGTTCAAGATTCTCATCAAGTTCTTGGTTTTCTGAAACAACCAACTGATTGTTATCGGACAAGGCTAACTCATGACCCTGTACAGGAAGAGACAACTGATGGCCAACTTGGTCAGGTTTCCTATCCATACCTAAATCACTTTCATGCGCATAATCGTGATCATCGTCCCCTCCTTGGTCATGACTTTGCCCTAAACCCAACTCATGATCGTGGGCATGCCCCAAACCATCGGGATCATGGGAATGTACCAAACCCTCAGGATGATCATGGGCATGTCCCAATCCCAATTCGTGGTCATGGTTCGGCCCCAAACCCAGATGATGCTCATGTGCTTGTCCTAGGTCCAAATTGTGACCCTGACCAAGACCCATATTATGATTGTGCCCTAACATCAACTGTTGGTTCTGCCCCAGAGCTAAATTGTGGTTTTGACCAAGTATCAGAAATCGACCTGGAGACTAGGGTTTGGGAATCGCGGAGGTTGAAAGGAAGAGGATGTGAATTTCGAAAAGGGGTTCGAAGGGGAAGGTTTTGGCGGGTAATTCAGTTGGAGAAAATCGGAAAAGAGAGAGAGAGGAGAGGGGCAAAGAAGGGAGGAGGGGAGGATGGGGGCTGGGGCGATGGGGAATTACAAGGGCAGAAGAGGGAGAGAGAGTTTGAGGGTTGA

Coding sequence (CDS)

ATGGTCTGCGAGATAGAAGAGCTGCCACAGCGTGAGCACACGGTAGACCATACAGCTGCCAGCCCCGACAAAGGCAGCAACGGTTACGAATGTCCACGATGTTCGTTCCTTCCTGTAGCCCATAAGCGAGGGGGAATTCGGCGTATCCAGTATGCAGCATCTTGTGATATCTCTTCAATTTCCAAAACTTTTGCTTCAAATTCAATCACTGTGAGAGCATATGCAGCATCCCAGAGAAGTTTGACAAGCATTGGATTATTGAACTCTTTACGGAAGCTTTCACTCAAATGCCGCATGCAAAACCCATGGAAAGCAGTTGGAAAATTCGCTTCCACTCCATCCACGATGCATTTTAGCCTGTCTGATAAGATTGTAAGCCTCGGCATATTTTCAGTGTTAATCTCAAGCAGCAAGCGGTATCCTTCTTCAAATGAACCGCGCATGGCAGCCATGATACGCTCCTTGCCCCTCCAAGCTTGTTTGTATGACAAGTCGTTGCTCCATGGAGCTTGCAACCCACTGAACAGAGGCTTGCTGGTGGCCGAGATGATTAATCCCACCACACGTATGAGTGTCATGAATTGTCCTGATTGTGAAGGTAGGGACACCCGGAAGCTTTGCAGCATGAATACGCCATGGACATCCGTCAGCAGCACATTTGGCCGTAAAGCGTGTTTTATCAGATTTGATGGTTTGCACTTCAAAGTGCAAGGCAATAGCAGTGTCCCTCAGTGCCCTCCTACAGCTCTTAACCTTGAATTGCCAATTCATCATTCTGATCCACAGATAGTTCAAGATTCTCATCAAGTTCTTGGTTTTCTGAAACAACCAACTGATTGTTATCGGACAAGGCTAACTCATGACCCTGTACAGGAAGAGACAACTGATGGCCAACTTGGTCAGGTTTCCTATCCATACCTAAATCACTTTCATGCGCATAATCGTGATCATCGTCCCCTCCTTGGTCATGACTTTGCCCTAAACCCAACTCATGATCGTGGGCATGCCCCAAACCATCGGGATCATGGGAATGTACCAAACCCTCAGGATGATCATGGGCATGTCCCAATCCCAATTCGTGGTCATGGTTCGGCCCCAAACCCAGATGATGCTCATGTGCTTGTCCTAGGTCCAAATTGTGACCCTGACCAAGACCCATATTATGATTGTGCCCTAACATCAACTGTTGGTTCTGCCCCAGAGCTAAATTGTGGTTTTGACCAAGTATCAGAAATCGACCTGGAGACTAGGGTTTGGGAATCGCGGAGGTTGAAAGGAAGAGGATGTGAATTTCGAAAAGGGGTTCGAAGGGGAAGGTTTTGGCGGGTAATTCAGTTGGAGAAAATCGGAAAAGAGAGAGAGAGGAGAGGGGCAAAGAAGGGAGGAGGGGAGGATGGGGGCTGGGGCGATGGGGAATTACAAGGGCAGAAGAGGGAGAGAGAGTTTGAGGGTTGA

Protein sequence

MVCEIEELPQREHTVDHTAASPDKGSNGYECPRCSFLPVAHKRGGIRRIQYAASCDISSISKTFASNSITVRAYAASQRSLTSIGLLNSLRKLSLKCRMQNPWKAVGKFASTPSTMHFSLSDKIVSLGIFSVLISSSKRYPSSNEPRMAAMIRSLPLQACLYDKSLLHGACNPLNRGLLVAEMINPTTRMSVMNCPDCEGRDTRKLCSMNTPWTSVSSTFGRKACFIRFDGLHFKVQGNSSVPQCPPTALNLELPIHHSDPQIVQDSHQVLGFLKQPTDCYRTRLTHDPVQEETTDGQLGQVSYPYLNHFHAHNRDHRPLLGHDFALNPTHDRGHAPNHRDHGNVPNPQDDHGHVPIPIRGHGSAPNPDDAHVLVLGPNCDPDQDPYYDCALTSTVGSAPELNCGFDQVSEIDLETRVWESRRLKGRGCEFRKGVRRGRFWRVIQLEKIGKERERRGAKKGGGEDGGWGDGELQGQKREREFEG
Homology
BLAST of Sgr019862 vs. NCBI nr
Match: KAE8637260.1 (hypothetical protein CSA_017676, partial [Cucumis sativus])

HSP 1 Score: 167.2 bits (422), Expect = 3.6e-37
Identity = 77/105 (73.33%), Postives = 80/105 (76.19%), Query Frame = 0

Query: 307 LNHFHAHNRDHRPLLGHDFALNPTHDRGHAPNHRDHGNVPNPQDDHGHVPIPIRGHGSAP 366
           LN FH HN DH PLLGHDFALNPT D G  PNH DHGNVP P+ DHGHVP P  GHG  P
Sbjct: 1   LNRFHVHNCDHCPLLGHDFALNPTRDHGRVPNHHDHGNVPTPRCDHGHVPSPTHGHGLTP 60

Query: 367 NPDDAHVLVLGPNCDPDQDPYYDCALTSTVGSAPELNCGFDQVSE 412
           N DDAHVLV GPNCDP QDPYYDC LTSTVGS  ELNCG   +S+
Sbjct: 61  NLDDAHVLVRGPNCDPGQDPYYDCGLTSTVGSVQELNCGSVPISD 105

BLAST of Sgr019862 vs. NCBI nr
Match: KZV24787.1 (hypothetical protein F511_34760 [Dorcoceras hygrometricum])

HSP 1 Score: 60.8 bits (146), Expect = 3.6e-05
Identity = 31/56 (55.36%), Postives = 38/56 (67.86%), Query Frame = 0

Query: 99  MQNPWKAVGKFASTPSTMHFSLSDKIVSLGIFSVLISSSKRYPSSNEPRMAAMIRS 155
           MQNPWKAVGK  ST STM+F L DKIV LG+FSVL+S   R    N  + +++  S
Sbjct: 1   MQNPWKAVGKLDSTASTMYFCLLDKIVILGMFSVLVSRRLRSSERNHIQFSSLSSS 56

BLAST of Sgr019862 vs. ExPASy TrEMBL
Match: A0A7C9EQX9 (Uncharacterized protein OS=Opuntia streptacantha OX=393608 PE=4 SV=1)

HSP 1 Score: 77.0 bits (188), Expect = 2.4e-10
Identity = 39/54 (72.22%), Postives = 44/54 (81.48%), Query Frame = 0

Query: 83  SIGLLNSLRKLSLKCRMQNPWKAVGKFASTPSTMHFSLSDKIVSLGIFSVLISS 137
           S+  LNSLRKLSL+C MQNPWKAVGKFASTPS M   L DK+VS+G+FSV  SS
Sbjct: 2   SVVSLNSLRKLSLRCLMQNPWKAVGKFASTPSIMPLCLLDKMVSIGMFSVFSSS 55

BLAST of Sgr019862 vs. ExPASy TrEMBL
Match: A0A2Z7AZU1 (Uncharacterized protein OS=Dorcoceras hygrometricum OX=472368 GN=F511_34760 PE=4 SV=1)

HSP 1 Score: 60.8 bits (146), Expect = 1.8e-05
Identity = 31/56 (55.36%), Postives = 38/56 (67.86%), Query Frame = 0

Query: 99  MQNPWKAVGKFASTPSTMHFSLSDKIVSLGIFSVLISSSKRYPSSNEPRMAAMIRS 155
           MQNPWKAVGK  ST STM+F L DKIV LG+FSVL+S   R    N  + +++  S
Sbjct: 1   MQNPWKAVGKLDSTASTMYFCLLDKIVILGMFSVLVSRRLRSSERNHIQFSSLSSS 56

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAE8637260.13.6e-3773.33hypothetical protein CSA_017676, partial [Cucumis sativus][more]
KZV24787.13.6e-0555.36hypothetical protein F511_34760 [Dorcoceras hygrometricum][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A7C9EQX92.4e-1072.22Uncharacterized protein OS=Opuntia streptacantha OX=393608 PE=4 SV=1[more]
A0A2Z7AZU11.8e-0555.36Uncharacterized protein OS=Dorcoceras hygrometricum OX=472368 GN=F511_34760 PE=4... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 452..484
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 329..353
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 329..348

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr019862.1Sgr019862.1mRNA