Sgr019794.1 (mRNA) Monk fruit (Qingpiguo) v1

Overview
NameSgr019794.1
TypemRNA
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
DescriptionUnknown protein
Locationtig00153414: 341860 .. 344705 (+)
Sequence length1290
RNA-Seq ExpressionSgr019794.1
SyntenySgr019794.1
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGATATCTGTCCTCATTTATAGAAGTGCATTTTCACACATCCTTATTCTGGTGAACTTTGGTTGCCCCCACATCTTTCAACCAATCAACAACTTCCTTGATGGTTGGTCTTTTAAAGGGATTCTGACTGACACACATGCAGGCAACATCGAGAACTTGGAGCATCTCGTCTTCAAAGCCCTTCCTCTCAGGATTGGGTCAAAGACTTCATCTTGTTTGCCCTCATTCCTCATTTGTTGCACCCAGCCAACCAACTCTCGCGATGCCTTTGGCTTTGATATTTCTACAGGTCTTTTACCAGTAAGCAACTCAAGCACGACCACACCAAAACTGTACATATCTCCCCTCAAAGTGGCGACCCATGCTTGTCCATACTCTGGAGGAATATAGCCTAAGGTGCCAACAAGTTCGGTTGTAACATGAGTCTGGTAAGGATTGATCAATCTAGATAGTCCAAAATCTGCAACATGAGCTTCAAATTTCTCATCGAGCAGGATGTTACTGGACTTTATATCACGGTGTACAATATGTGGCTCACATATTTGGTGCATGTAAGCCAATCCAGAACTTGCTCCTCGCAAAATCTTCAATCGAGTTGGCCAATCAAGTTGGATGCACCGTCACCTTTTCATGCAACCAGTAATCTAAACTTCCATTTTCCATGTATGAATACATTAGCAGCCGACTGCCCTCATGCACACAGTAACCTTGAAGAGTAACCAGGTTCTGGTGTTTCGCTGCAGACAGAGCCTCTACTTCTGCTTTAAATTCCCTTTCCATCAGTCCCAAATCTCCCGAGAGTTTCTTCACAGCAAGTCGCGTCCCATTTGCTAATGTTGCTTTGTAAACCAATCCAAAACCACCGCAGCCAATTATGTTTCTTGATTGAAATCATCAGTGGCCTTCATTATGTCAGAAACGGTAAGCTCCTTGATGTTGTTGGCATTGTTTGGAAACAAAATGACTATGCTAGTGTTGTTGTCAGGATTGTAAGTAGAACTAATTGAAATTATGTCCAAATCCATTTTGTCGGTATCCCCTCTTGGATCAATCCTCCTCTTGGATAATATCCATAACGCTAGCAGGGTGATGATGAAAGCAATACCAAACAGGTACCCAAGACTAATCCTATGGCAAGTTTTTTACTTGAGCTTTTATTTTGAGCAGTGGAATGGGTTACTTTAGTCTGGTTAGAGCATGAGCGCTGCACTATAGGAGGACCACACAACCCTGAATTTCCTTCATAGCTAGAGCTAGGAAAGGTATCAAACTGACCCCCAGTTGGTATTGGTCCTTGAAGGTCGTTAAAGGCCACGCTAAACCAAGACAAGAAATGGAGACCTCTGAGGGAATGGGGGATTTCACCAGTTAGGTGATTATGAGAGAGGTCCAATCTCTCCAAGTTGCTGAGATTGGAAAGCGTATCTGGTATGCTGCCGGAGAAGCTGTTGTTGCTCAGGTCTAATACATGTATAACCTTCAATTGCCCAATCTCCAAGGGGATGGTGCCACTTATGGTGTTGTTGCCCAGGTATATAGCCGGTGGAAGGCTGGAGAGCTGATTGTATTGCTGATTGGTAGCATTGTTGGTGCAACAAAGACGGGCAATGCAAGAAAACTCTGCTTTGCCGGATCTAGAATCTGCTGCGACATCAAAGCCTGTAGTCTGCAAAGCTGGGTGGGGAATTCTCCGGAAATTCTGTTATTAGACAAGTCAACATAAAATAGACTCGGCAGATTGCCTAACCATTCAGGTATGGAACCCACAAGACGATTAAAAGACAGGTCCAAGACCTCTAGATTCCTGAGCTTTTCTATCCATGAGGGTACTTTACCAGTTAGCTGACAAGCTCCAAGAGCCAAAACCTGGATATTTTGGAATGTGTTTGCATCTACAATTATATCCCCATCCGGCAATGCTTCGCCATTATAACTATTGGAGAGGACAAGAGTTCCAAGGTTTCCGCAACCCATCAAATTCCTTAGTGCTCCAGTTAAGTTGGTTAGATTATTCTTGGAAATTGAGAGGAACGACAGTGATTGAAGTGCTGCAATTTCGTGAGAGATCGCTCCAGAAAACTGATTACTGGCCAATCTAACTGCCTTTAGAGACCTGCACGAGTAAAGAGTTGATGGTATACTACCTGTGAACATATTGTTGCCAAGGTCAAGTGTGGTGAGCCTAAGGAGGCGAGAGAAATTAAGATTAGAGAGATCTCCTTGCAGCTTATTGACCCGCAAATTCAACAAAGTAAGATTGGTGCAATTCATCAGTGATGTGGGCAGAGTGCCTGTGAGATTGTTAATGTGAAGCGATATCTGTTCCAGGTTGGAGAGCTTTCCAATATCGGTGGGATGGGGCCAATCAAAGAATTGGAGTACAATTCTAGGATTCTGAGGTCGGTGAGGTTAACAATTCCCTCGCCGATTTTCCCCGAAAGTGATTAACATGTAAAGAAAGCTCCTTTAGAGTGAAAACGTTGTAAAGGTCGGAGGGAATAGAACCAGAGAGACTATTGAAACCCGCTCGAAAAACCTCCAATTGGGACAGTTCCCCAGCCCCTGAGGTATTCCACCGCCGAATTCATTATAGGAAAAATCCAATAACCTGACGGAACTTATGGATGTAGTATTAACACAAAAGGAGGTTGGGATGAGACCTGTAAAACTATTGTTGCTGACATTGAAACTGGTCAAACTCCCCGAAATTGCCACTTGCTGAATAAACGAAGCGGTTATTGTCCCAGAGAACTGATTGCTGGACAAATCCAGAGTCTCAATCACGAGTCCGGACGATGAAGACGGCGGCGGGCGGCGGCAGCAAGTGGCCGGCAAGAAGGTTGTAG

mRNA sequence

ATGATATCTGTCCTCATTTATAGAAGTGCATTTTCACACATCCTTATTCTGGTGAACTTTGGTTGCCCCCACATCTTTCAACCAATCAACAACTTCCTTGATGGTTGGTCTTTTAAAGGGATTCTGACTGACACACATGCAGGCAACATCGAGAACTTGGAGCATCTCGTCTTCAAAGCCCTTCCTCTCAGGATTGGGTCAAAGACTTCATCTTGTTTGCCCTCATTCCTCATTTGTTGCACCCAGCCAACCAACTCTCGCGATGCCTTTGGCTTTGATATTTCTACAGGTCTTTTACCACAGCCGACTGCCCTCATGCACACAGTAACCTTGAAGAGTAACCAGGTTCTGGTGTTTCGCTGCAGACAGAGCCTCTACTTCTGCTTTAAATTCCCTTTCCATCAGTCCCAAATCTCCCGAGAGTTTCTTCACAGCAAAAACGGTAAGCTCCTTGATGTTGTTGGCATTGTTTGGAAACAAAATGACTATGCTAGTGTTGTTGTCAGGATTGTGATTATGAGAGAGGTCCAATCTCTCCAAGTTGCTGAGATTGGAAAGCGTATCTGGTATGCTGCCGGAGAAGCTGTTGTTGCTCAGGTATATAGCCGGTGGAAGGCTGGAGAGCTGATTGTATTGCTGATTGGTAGCATTGTTGGTGCAACAAAGACGGGCAATGCAAGAAAACTCTGCTTTGCCGGATCTAGAATCTGCTGCGACATCAAAGCCTGTAGTCTGCAAAGCTGGGTGGGGAATTCTCCGGAAATTCTGTTATTAGACAAGGTACTTTACCAGTTAGCTGACAAGCTCCAAGAGCCAAAACCTGGATATTTTGGAATGTGTTTGCATCTACAATTATATCCCCATCCGGCAATGCTTCGCCATTATAACTATTGGAGAGGACAAGAGTTCCAAGGTTTCCGCAACCCATCAAATTCCTTAGTGCTCCAGTTAAGTTGGTCAAGTGTGGTGAGCCTAAGGAGGCGAGAGAAATTAAGATTAGAGAGATCTCCTTGCAGCTTATTGACCCGCAAATTCAACAAAGTTGGAGAGCTTTCCAATATCGGTGGGATGGGGCCAATCAAAGAATTGGAGTACAATTCTAGGATTCTGAGGTCGGAGGGAATAGAACCAGAGAGACTATTGAAACCCGCTCGAAAAACCTCCAATTGGGACAGTTCCCCAGCCCCTGAGAGAACTGATTGCTGGACAAATCCAGAGTCTCAATCACGAGTCCGGACGATGAAGACGGCGGCGGGCGGCGGCAGCAAGTGGCCGGCAAGAAGGTTGTAG

Coding sequence (CDS)

ATGATATCTGTCCTCATTTATAGAAGTGCATTTTCACACATCCTTATTCTGGTGAACTTTGGTTGCCCCCACATCTTTCAACCAATCAACAACTTCCTTGATGGTTGGTCTTTTAAAGGGATTCTGACTGACACACATGCAGGCAACATCGAGAACTTGGAGCATCTCGTCTTCAAAGCCCTTCCTCTCAGGATTGGGTCAAAGACTTCATCTTGTTTGCCCTCATTCCTCATTTGTTGCACCCAGCCAACCAACTCTCGCGATGCCTTTGGCTTTGATATTTCTACAGGTCTTTTACCACAGCCGACTGCCCTCATGCACACAGTAACCTTGAAGAGTAACCAGGTTCTGGTGTTTCGCTGCAGACAGAGCCTCTACTTCTGCTTTAAATTCCCTTTCCATCAGTCCCAAATCTCCCGAGAGTTTCTTCACAGCAAAAACGGTAAGCTCCTTGATGTTGTTGGCATTGTTTGGAAACAAAATGACTATGCTAGTGTTGTTGTCAGGATTGTGATTATGAGAGAGGTCCAATCTCTCCAAGTTGCTGAGATTGGAAAGCGTATCTGGTATGCTGCCGGAGAAGCTGTTGTTGCTCAGGTATATAGCCGGTGGAAGGCTGGAGAGCTGATTGTATTGCTGATTGGTAGCATTGTTGGTGCAACAAAGACGGGCAATGCAAGAAAACTCTGCTTTGCCGGATCTAGAATCTGCTGCGACATCAAAGCCTGTAGTCTGCAAAGCTGGGTGGGGAATTCTCCGGAAATTCTGTTATTAGACAAGGTACTTTACCAGTTAGCTGACAAGCTCCAAGAGCCAAAACCTGGATATTTTGGAATGTGTTTGCATCTACAATTATATCCCCATCCGGCAATGCTTCGCCATTATAACTATTGGAGAGGACAAGAGTTCCAAGGTTTCCGCAACCCATCAAATTCCTTAGTGCTCCAGTTAAGTTGGTCAAGTGTGGTGAGCCTAAGGAGGCGAGAGAAATTAAGATTAGAGAGATCTCCTTGCAGCTTATTGACCCGCAAATTCAACAAAGTTGGAGAGCTTTCCAATATCGGTGGGATGGGGCCAATCAAAGAATTGGAGTACAATTCTAGGATTCTGAGGTCGGAGGGAATAGAACCAGAGAGACTATTGAAACCCGCTCGAAAAACCTCCAATTGGGACAGTTCCCCAGCCCCTGAGAGAACTGATTGCTGGACAAATCCAGAGTCTCAATCACGAGTCCGGACGATGAAGACGGCGGCGGGCGGCGGCAGCAAGTGGCCGGCAAGAAGGTTGTAG

Protein sequence

MISVLIYRSAFSHILILVNFGCPHIFQPINNFLDGWSFKGILTDTHAGNIENLEHLVFKALPLRIGSKTSSCLPSFLICCTQPTNSRDAFGFDISTGLLPQPTALMHTVTLKSNQVLVFRCRQSLYFCFKFPFHQSQISREFLHSKNGKLLDVVGIVWKQNDYASVVVRIVIMREVQSLQVAEIGKRIWYAAGEAVVAQVYSRWKAGELIVLLIGSIVGATKTGNARKLCFAGSRICCDIKACSLQSWVGNSPEILLLDKVLYQLADKLQEPKPGYFGMCLHLQLYPHPAMLRHYNYWRGQEFQGFRNPSNSLVLQLSWSSVVSLRRREKLRLERSPCSLLTRKFNKVGELSNIGGMGPIKELEYNSRILRSEGIEPERLLKPARKTSNWDSSPAPERTDCWTNPESQSRVRTMKTAAGGGSKWPARRL
Homology
BLAST of Sgr019794.1 vs. NCBI nr
Match: XP_021912785.1 (uncharacterized protein LOC110826446 [Carica papaya])

HSP 1 Score: 66.2 bits (160), Expect = 7.7e-07
Identity = 35/60 (58.33%), Postives = 41/60 (68.33%), Query Frame = 0

Query: 59  KALPLRIGSKTSSCLPSFLICCTQPTNSRDAFGFDISTGLLPQPTALMHTVTLKSNQVLV 118
           K LPLR GSKTSS LPS LI CTQPTNS D FG + STGL P  ++ M T  L ++ + V
Sbjct: 39  KPLPLRRGSKTSSSLPSLLISCTQPTNSLDIFGLNTSTGLFPVSSSSMTTPKLYTSPLNV 98

BLAST of Sgr019794.1 vs. NCBI nr
Match: TYH47497.1 (hypothetical protein ES332_D10G001800v1 [Gossypium tomentosum])

HSP 1 Score: 56.6 bits (135), Expect = 6.1e-04
Identity = 23/49 (46.94%), Postives = 33/49 (67.35%), Query Frame = 0

Query: 11  FSHILILVNFGCPHIFQPINNFLDGWSFKGILTDTHAGNIENLEHLVFK 60
           FS I ILV+  C +  +P+NNFLD W  + IL D  AG+I++L+HL  +
Sbjct: 60  FSRIFILVSIDCSNTLEPLNNFLDSWPLERILVDAQAGHIQHLQHLFIR 108

BLAST of Sgr019794.1 vs. ExPASy TrEMBL
Match: A0A5D2IY05 (Uncharacterized protein OS=Gossypium tomentosum OX=34277 GN=ES332_D10G001800v1 PE=4 SV=1)

HSP 1 Score: 56.6 bits (135), Expect = 2.9e-04
Identity = 23/49 (46.94%), Postives = 33/49 (67.35%), Query Frame = 0

Query: 11  FSHILILVNFGCPHIFQPINNFLDGWSFKGILTDTHAGNIENLEHLVFK 60
           FS I ILV+  C +  +P+NNFLD W  + IL D  AG+I++L+HL  +
Sbjct: 60  FSRIFILVSIDCSNTLEPLNNFLDSWPLERILVDAQAGHIQHLQHLFIR 108

BLAST of Sgr019794.1 vs. ExPASy TrEMBL
Match: A0A5D2NJ87 (Uncharacterized protein OS=Gossypium tomentosum OX=34277 GN=ES332_A10G001100v1 PE=4 SV=1)

HSP 1 Score: 55.5 bits (132), Expect = 6.5e-04
Identity = 23/49 (46.94%), Postives = 33/49 (67.35%), Query Frame = 0

Query: 11 FSHILILVNFGCPHIFQPINNFLDGWSFKGILTDTHAGNIENLEHLVFK 60
          FS I ILV+  C +  +P+NNFLD W  + IL D  AG+I++L+HL  +
Sbjct: 50 FSRIFILVSIDCSNTREPLNNFLDSWPLERILVDAQAGHIQHLQHLFIR 98

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_021912785.17.7e-0758.33uncharacterized protein LOC110826446 [Carica papaya][more]
TYH47497.16.1e-0446.94hypothetical protein ES332_D10G001800v1 [Gossypium tomentosum][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A5D2IY052.9e-0446.94Uncharacterized protein OS=Gossypium tomentosum OX=34277 GN=ES332_D10G001800v1 P... [more]
A0A5D2NJ876.5e-0446.94Uncharacterized protein OS=Gossypium tomentosum OX=34277 GN=ES332_A10G001100v1 P... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 386..415
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 381..429

Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Sgr019794Sgr019794gene


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
Sgr019794.1.exon1Sgr019794.1.exon1exon
Sgr019794.1.exon2Sgr019794.1.exon2exon
Sgr019794.1.exon3Sgr019794.1.exon3exon
Sgr019794.1.exon4Sgr019794.1.exon4exon
Sgr019794.1.exon5Sgr019794.1.exon5exon
Sgr019794.1.exon6Sgr019794.1.exon6exon
Sgr019794.1.exon7Sgr019794.1.exon7exon
Sgr019794.1.exon8Sgr019794.1.exon8exon
Sgr019794.1.exon9Sgr019794.1.exon9exon
Sgr019794.1.exon10Sgr019794.1.exon10exon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
cds.Sgr019794.1cds.Sgr019794.1CDS
cds.Sgr019794.1cds.Sgr019794.1_2CDS
cds.Sgr019794.1cds.Sgr019794.1_3CDS
cds.Sgr019794.1cds.Sgr019794.1_4CDS
cds.Sgr019794.1cds.Sgr019794.1_5CDS
cds.Sgr019794.1cds.Sgr019794.1_6CDS
cds.Sgr019794.1cds.Sgr019794.1_7CDS
cds.Sgr019794.1cds.Sgr019794.1_8CDS
cds.Sgr019794.1cds.Sgr019794.1_9CDS
cds.Sgr019794.1cds.Sgr019794.1_10CDS


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Sgr019794.1Sgr019794.1-proteinpolypeptide