Sgr019794 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr019794
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
DescriptionUnknown protein
Locationtig00153414: 341860 .. 344705 (+)
RNA-Seq ExpressionSgr019794
SyntenySgr019794
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGATATCTGTCCTCATTTATAGAAGTGCATTTTCACACATCCTTATTCTGGTGAACTTTGGTTGCCCCCACATCTTTCAACCAATCAACAACTTCCTTGATGGTTGGTCTTTTAAAGGGATTCTGACTGACACACATGCAGGCAACATCGAGAACTTGGAGCATCTCGTCTTCAAAGCCCTTCCTCTCAGGATTGGGTCAAAGACTTCATCTTGTTTGCCCTCATTCCTCATTTGTTGCACCCAGCCAACCAACTCTCGCGATGCCTTTGGCTTTGATATTTCTACAGGTCTTTTACCAGTAAGCAACTCAAGCACGACCACACCAAAACTGTACATATCTCCCCTCAAAGTGGCGACCCATGCTTGTCCATACTCTGGAGGAATATAGCCTAAGGTGCCAACAAGTTCGGTTGTAACATGAGTCTGGTAAGGATTGATCAATCTAGATAGTCCAAAATCTGCAACATGAGCTTCAAATTTCTCATCGAGCAGGATGTTACTGGACTTTATATCACGGTGTACAATATGTGGCTCACATATTTGGTGCATGTAAGCCAATCCAGAACTTGCTCCTCGCAAAATCTTCAATCGAGTTGGCCAATCAAGTTGGATGCACCGTCACCTTTTCATGCAACCAGTAATCTAAACTTCCATTTTCCATGTATGAATACATTAGCAGCCGACTGCCCTCATGCACACAGTAACCTTGAAGAGTAACCAGGTTCTGGTGTTTCGCTGCAGACAGAGCCTCTACTTCTGCTTTAAATTCCCTTTCCATCAGTCCCAAATCTCCCGAGAGTTTCTTCACAGCAAGTCGCGTCCCATTTGCTAATGTTGCTTTGTAAACCAATCCAAAACCACCGCAGCCAATTATGTTTCTTGATTGAAATCATCAGTGGCCTTCATTATGTCAGAAACGGTAAGCTCCTTGATGTTGTTGGCATTGTTTGGAAACAAAATGACTATGCTAGTGTTGTTGTCAGGATTGTAAGTAGAACTAATTGAAATTATGTCCAAATCCATTTTGTCGGTATCCCCTCTTGGATCAATCCTCCTCTTGGATAATATCCATAACGCTAGCAGGGTGATGATGAAAGCAATACCAAACAGGTACCCAAGACTAATCCTATGGCAAGTTTTTTACTTGAGCTTTTATTTTGAGCAGTGGAATGGGTTACTTTAGTCTGGTTAGAGCATGAGCGCTGCACTATAGGAGGACCACACAACCCTGAATTTCCTTCATAGCTAGAGCTAGGAAAGGTATCAAACTGACCCCCAGTTGGTATTGGTCCTTGAAGGTCGTTAAAGGCCACGCTAAACCAAGACAAGAAATGGAGACCTCTGAGGGAATGGGGGATTTCACCAGTTAGGTGATTATGAGAGAGGTCCAATCTCTCCAAGTTGCTGAGATTGGAAAGCGTATCTGGTATGCTGCCGGAGAAGCTGTTGTTGCTCAGGTCTAATACATGTATAACCTTCAATTGCCCAATCTCCAAGGGGATGGTGCCACTTATGGTGTTGTTGCCCAGGTATATAGCCGGTGGAAGGCTGGAGAGCTGATTGTATTGCTGATTGGTAGCATTGTTGGTGCAACAAAGACGGGCAATGCAAGAAAACTCTGCTTTGCCGGATCTAGAATCTGCTGCGACATCAAAGCCTGTAGTCTGCAAAGCTGGGTGGGGAATTCTCCGGAAATTCTGTTATTAGACAAGTCAACATAAAATAGACTCGGCAGATTGCCTAACCATTCAGGTATGGAACCCACAAGACGATTAAAAGACAGGTCCAAGACCTCTAGATTCCTGAGCTTTTCTATCCATGAGGGTACTTTACCAGTTAGCTGACAAGCTCCAAGAGCCAAAACCTGGATATTTTGGAATGTGTTTGCATCTACAATTATATCCCCATCCGGCAATGCTTCGCCATTATAACTATTGGAGAGGACAAGAGTTCCAAGGTTTCCGCAACCCATCAAATTCCTTAGTGCTCCAGTTAAGTTGGTTAGATTATTCTTGGAAATTGAGAGGAACGACAGTGATTGAAGTGCTGCAATTTCGTGAGAGATCGCTCCAGAAAACTGATTACTGGCCAATCTAACTGCCTTTAGAGACCTGCACGAGTAAAGAGTTGATGGTATACTACCTGTGAACATATTGTTGCCAAGGTCAAGTGTGGTGAGCCTAAGGAGGCGAGAGAAATTAAGATTAGAGAGATCTCCTTGCAGCTTATTGACCCGCAAATTCAACAAAGTAAGATTGGTGCAATTCATCAGTGATGTGGGCAGAGTGCCTGTGAGATTGTTAATGTGAAGCGATATCTGTTCCAGGTTGGAGAGCTTTCCAATATCGGTGGGATGGGGCCAATCAAAGAATTGGAGTACAATTCTAGGATTCTGAGGTCGGTGAGGTTAACAATTCCCTCGCCGATTTTCCCCGAAAGTGATTAACATGTAAAGAAAGCTCCTTTAGAGTGAAAACGTTGTAAAGGTCGGAGGGAATAGAACCAGAGAGACTATTGAAACCCGCTCGAAAAACCTCCAATTGGGACAGTTCCCCAGCCCCTGAGGTATTCCACCGCCGAATTCATTATAGGAAAAATCCAATAACCTGACGGAACTTATGGATGTAGTATTAACACAAAAGGAGGTTGGGATGAGACCTGTAAAACTATTGTTGCTGACATTGAAACTGGTCAAACTCCCCGAAATTGCCACTTGCTGAATAAACGAAGCGGTTATTGTCCCAGAGAACTGATTGCTGGACAAATCCAGAGTCTCAATCACGAGTCCGGACGATGAAGACGGCGGCGGGCGGCGGCAGCAAGTGGCCGGCAAGAAGGTTGTAG

mRNA sequence

ATGATATCTGTCCTCATTTATAGAAGTGCATTTTCACACATCCTTATTCTGGTGAACTTTGGTTGCCCCCACATCTTTCAACCAATCAACAACTTCCTTGATGGTTGGTCTTTTAAAGGGATTCTGACTGACACACATGCAGGCAACATCGAGAACTTGGAGCATCTCGTCTTCAAAGCCCTTCCTCTCAGGATTGGGTCAAAGACTTCATCTTGTTTGCCCTCATTCCTCATTTGTTGCACCCAGCCAACCAACTCTCGCGATGCCTTTGGCTTTGATATTTCTACAGGTCTTTTACCACAGCCGACTGCCCTCATGCACACAGTAACCTTGAAGAGTAACCAGGTTCTGGTGTTTCGCTGCAGACAGAGCCTCTACTTCTGCTTTAAATTCCCTTTCCATCAGTCCCAAATCTCCCGAGAGTTTCTTCACAGCAAAAACGGTAAGCTCCTTGATGTTGTTGGCATTGTTTGGAAACAAAATGACTATGCTAGTGTTGTTGTCAGGATTGTGATTATGAGAGAGGTCCAATCTCTCCAAGTTGCTGAGATTGGAAAGCGTATCTGGTATGCTGCCGGAGAAGCTGTTGTTGCTCAGGTATATAGCCGGTGGAAGGCTGGAGAGCTGATTGTATTGCTGATTGGTAGCATTGTTGGTGCAACAAAGACGGGCAATGCAAGAAAACTCTGCTTTGCCGGATCTAGAATCTGCTGCGACATCAAAGCCTGTAGTCTGCAAAGCTGGGTGGGGAATTCTCCGGAAATTCTGTTATTAGACAAGGTACTTTACCAGTTAGCTGACAAGCTCCAAGAGCCAAAACCTGGATATTTTGGAATGTGTTTGCATCTACAATTATATCCCCATCCGGCAATGCTTCGCCATTATAACTATTGGAGAGGACAAGAGTTCCAAGGTTTCCGCAACCCATCAAATTCCTTAGTGCTCCAGTTAAGTTGGTCAAGTGTGGTGAGCCTAAGGAGGCGAGAGAAATTAAGATTAGAGAGATCTCCTTGCAGCTTATTGACCCGCAAATTCAACAAAGTTGGAGAGCTTTCCAATATCGGTGGGATGGGGCCAATCAAAGAATTGGAGTACAATTCTAGGATTCTGAGGTCGGAGGGAATAGAACCAGAGAGACTATTGAAACCCGCTCGAAAAACCTCCAATTGGGACAGTTCCCCAGCCCCTGAGAGAACTGATTGCTGGACAAATCCAGAGTCTCAATCACGAGTCCGGACGATGAAGACGGCGGCGGGCGGCGGCAGCAAGTGGCCGGCAAGAAGGTTGTAG

Coding sequence (CDS)

ATGATATCTGTCCTCATTTATAGAAGTGCATTTTCACACATCCTTATTCTGGTGAACTTTGGTTGCCCCCACATCTTTCAACCAATCAACAACTTCCTTGATGGTTGGTCTTTTAAAGGGATTCTGACTGACACACATGCAGGCAACATCGAGAACTTGGAGCATCTCGTCTTCAAAGCCCTTCCTCTCAGGATTGGGTCAAAGACTTCATCTTGTTTGCCCTCATTCCTCATTTGTTGCACCCAGCCAACCAACTCTCGCGATGCCTTTGGCTTTGATATTTCTACAGGTCTTTTACCACAGCCGACTGCCCTCATGCACACAGTAACCTTGAAGAGTAACCAGGTTCTGGTGTTTCGCTGCAGACAGAGCCTCTACTTCTGCTTTAAATTCCCTTTCCATCAGTCCCAAATCTCCCGAGAGTTTCTTCACAGCAAAAACGGTAAGCTCCTTGATGTTGTTGGCATTGTTTGGAAACAAAATGACTATGCTAGTGTTGTTGTCAGGATTGTGATTATGAGAGAGGTCCAATCTCTCCAAGTTGCTGAGATTGGAAAGCGTATCTGGTATGCTGCCGGAGAAGCTGTTGTTGCTCAGGTATATAGCCGGTGGAAGGCTGGAGAGCTGATTGTATTGCTGATTGGTAGCATTGTTGGTGCAACAAAGACGGGCAATGCAAGAAAACTCTGCTTTGCCGGATCTAGAATCTGCTGCGACATCAAAGCCTGTAGTCTGCAAAGCTGGGTGGGGAATTCTCCGGAAATTCTGTTATTAGACAAGGTACTTTACCAGTTAGCTGACAAGCTCCAAGAGCCAAAACCTGGATATTTTGGAATGTGTTTGCATCTACAATTATATCCCCATCCGGCAATGCTTCGCCATTATAACTATTGGAGAGGACAAGAGTTCCAAGGTTTCCGCAACCCATCAAATTCCTTAGTGCTCCAGTTAAGTTGGTCAAGTGTGGTGAGCCTAAGGAGGCGAGAGAAATTAAGATTAGAGAGATCTCCTTGCAGCTTATTGACCCGCAAATTCAACAAAGTTGGAGAGCTTTCCAATATCGGTGGGATGGGGCCAATCAAAGAATTGGAGTACAATTCTAGGATTCTGAGGTCGGAGGGAATAGAACCAGAGAGACTATTGAAACCCGCTCGAAAAACCTCCAATTGGGACAGTTCCCCAGCCCCTGAGAGAACTGATTGCTGGACAAATCCAGAGTCTCAATCACGAGTCCGGACGATGAAGACGGCGGCGGGCGGCGGCAGCAAGTGGCCGGCAAGAAGGTTGTAG

Protein sequence

MISVLIYRSAFSHILILVNFGCPHIFQPINNFLDGWSFKGILTDTHAGNIENLEHLVFKALPLRIGSKTSSCLPSFLICCTQPTNSRDAFGFDISTGLLPQPTALMHTVTLKSNQVLVFRCRQSLYFCFKFPFHQSQISREFLHSKNGKLLDVVGIVWKQNDYASVVVRIVIMREVQSLQVAEIGKRIWYAAGEAVVAQVYSRWKAGELIVLLIGSIVGATKTGNARKLCFAGSRICCDIKACSLQSWVGNSPEILLLDKVLYQLADKLQEPKPGYFGMCLHLQLYPHPAMLRHYNYWRGQEFQGFRNPSNSLVLQLSWSSVVSLRRREKLRLERSPCSLLTRKFNKVGELSNIGGMGPIKELEYNSRILRSEGIEPERLLKPARKTSNWDSSPAPERTDCWTNPESQSRVRTMKTAAGGGSKWPARRL
Homology
BLAST of Sgr019794 vs. NCBI nr
Match: XP_021912785.1 (uncharacterized protein LOC110826446 [Carica papaya])

HSP 1 Score: 66.2 bits (160), Expect = 7.7e-07
Identity = 35/60 (58.33%), Postives = 41/60 (68.33%), Query Frame = 0

Query: 59  KALPLRIGSKTSSCLPSFLICCTQPTNSRDAFGFDISTGLLPQPTALMHTVTLKSNQVLV 118
           K LPLR GSKTSS LPS LI CTQPTNS D FG + STGL P  ++ M T  L ++ + V
Sbjct: 39  KPLPLRRGSKTSSSLPSLLISCTQPTNSLDIFGLNTSTGLFPVSSSSMTTPKLYTSPLNV 98

BLAST of Sgr019794 vs. NCBI nr
Match: TYH47497.1 (hypothetical protein ES332_D10G001800v1 [Gossypium tomentosum])

HSP 1 Score: 56.6 bits (135), Expect = 6.1e-04
Identity = 23/49 (46.94%), Postives = 33/49 (67.35%), Query Frame = 0

Query: 11  FSHILILVNFGCPHIFQPINNFLDGWSFKGILTDTHAGNIENLEHLVFK 60
           FS I ILV+  C +  +P+NNFLD W  + IL D  AG+I++L+HL  +
Sbjct: 60  FSRIFILVSIDCSNTLEPLNNFLDSWPLERILVDAQAGHIQHLQHLFIR 108

BLAST of Sgr019794 vs. ExPASy TrEMBL
Match: A0A5D2IY05 (Uncharacterized protein OS=Gossypium tomentosum OX=34277 GN=ES332_D10G001800v1 PE=4 SV=1)

HSP 1 Score: 56.6 bits (135), Expect = 2.9e-04
Identity = 23/49 (46.94%), Postives = 33/49 (67.35%), Query Frame = 0

Query: 11  FSHILILVNFGCPHIFQPINNFLDGWSFKGILTDTHAGNIENLEHLVFK 60
           FS I ILV+  C +  +P+NNFLD W  + IL D  AG+I++L+HL  +
Sbjct: 60  FSRIFILVSIDCSNTLEPLNNFLDSWPLERILVDAQAGHIQHLQHLFIR 108

BLAST of Sgr019794 vs. ExPASy TrEMBL
Match: A0A5D2NJ87 (Uncharacterized protein OS=Gossypium tomentosum OX=34277 GN=ES332_A10G001100v1 PE=4 SV=1)

HSP 1 Score: 55.5 bits (132), Expect = 6.5e-04
Identity = 23/49 (46.94%), Postives = 33/49 (67.35%), Query Frame = 0

Query: 11 FSHILILVNFGCPHIFQPINNFLDGWSFKGILTDTHAGNIENLEHLVFK 60
          FS I ILV+  C +  +P+NNFLD W  + IL D  AG+I++L+HL  +
Sbjct: 50 FSRIFILVSIDCSNTREPLNNFLDSWPLERILVDAQAGHIQHLQHLFIR 98

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_021912785.17.7e-0758.33uncharacterized protein LOC110826446 [Carica papaya][more]
TYH47497.16.1e-0446.94hypothetical protein ES332_D10G001800v1 [Gossypium tomentosum][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A5D2IY052.9e-0446.94Uncharacterized protein OS=Gossypium tomentosum OX=34277 GN=ES332_D10G001800v1 P... [more]
A0A5D2NJ876.5e-0446.94Uncharacterized protein OS=Gossypium tomentosum OX=34277 GN=ES332_A10G001100v1 P... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 386..415
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 381..429

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr019794.1Sgr019794.1mRNA