Sgr015000 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr015000
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
DescriptionUnknown protein
Locationtig00002486: 1002294 .. 1006331 (+)
RNA-Seq ExpressionSgr015000
SyntenySgr015000
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGGGATGGCGAATGGGTCTTCAAGAATTTCTATCCTACTGCTGGAAACAGTGGACGATGAACTTGGGCGTTGATACGGGGCTTTAATCCACTGAAACCAAGATGAACTCGAGGAAGTGTCCCGGAAGGGCTCTCTCTGCCTGCTGCAGCCGTAGGGCTCGATGGGGGCGTCGCGAACTCGTCCACTGGTTGTAATTCTTCATATTTTGTGAAGGTAACAATCACTCTGATGGTTGGAACCACAGGGATGGCAACCTGTGCAAAAAATATAGCATCAATAACCAATTTAACTGAAGAACGCCAATACTACTTCAAGTTAATTAAACAGTCCAACACACTTAATATGGTAACATGGCAGATATTGCATTACCCTACTTACTGACTCTGGCTCGGAACCGACACTGCTTTCTATTGAAAAGAACAAAAACTTCACCATGTTTAGAACTAATAATCACGAATAGAATAAAATGAAAAAGAATCCATTGACTCAACCGAGCTGATAAAGGCCAGAAGGGATATTAGCTAACTTAGTCCTCAGAACCGAAATAGGCAATCGAAGGACTGATAGTCCCATCAACTACACGATGAAGTGCAATTCAAAATTTTGTATTTATTTCGTGAGATAGAATCATTCTCTATCTGTTTCGTGCTTTAATATGGAGTTTGCATAGATTAGGGGTACAATGACCAAGCATCATCTTTCAATAACTTGCTCATCACAGAACAGATCGATTGATAGTGATACATTTCTTATTAAAAATAAAAAAACAAAAACATTATTGGCTAACAAAGAATCATCAGTTCCCAATCATCATTTTACCAATCAAAGCAAGAATACGGTACAAGATTTATAGGTAGGCATAGGAAATAAGGTCAATCAAATAATTAGTAAACATACCATAGATTCATAGAGCTAAGGAAATGAAAAAGAAGCTACATGCAGATCAGATTTTCAAAGCTGATGCCAGTACTTCAAATCCAAACAGCCTAAGTTGTTCTCATAAGAAACCAAACTCTCAAAAAAGCTTTAATCAATTGGTAACGGTCCAACACTACTGCCAACCTTGATTTACTTGGGATTGAATTCATCAACAATAAAAGTTGTTATAGCCCAAATATTGGTGAGATGGAGCCCTACCTAAAACCATTGTCCTAACCATTAATGAGGGCCGATAGCTCAAATGCGTTATGCACACACAAAAATTGAGAAAGAATTCTTTCACATCCACACCACTGTCACTTTAGGTATAGTTTTTGTGTTACTTATTACTAGGCTTGTCCTTGGTGTGTATTCGTAAGTTTTCACATACCAACAAAAAAAAAATTGTTTCCCGTTATAAAATTTTGTTTAAAAAAGAAGGGGAAAAAAAAGTAAACCCATAGTGAACATCAAGTTAAGCTGATGTCTTCGCATGCTTGGAGTATTGAAATTGTTAAGCTACAACAAATTCCAAAGTTTAGTTTCTTCGATCTAATACTAGCAAACTTCTGAGCACATGTCATATGCACAATTTAGATCGTTGACTATAATGTGACACTTAAAACAATGGACCCCTCATGGGGGAATCCTGCAAGCATTCTCTTTCCAAATCACCTACACTCCAGCTACGGAGTATTGGTGTTGAATCCTACTGCTAAAGATGAGCACAGCAACACAGAACACAGCCAAGAAGGCAACAGGCAGTGATTCAAACTAGGCTAGAAGGCCTCTTCTGCTTCTTTTTTTTGTGGCTGGGGGGGGGGGGGGGGGGGGTGGAAGTAAACAATTAAGACAGAAGAAAAAGAAAATTATCCTCCAGTTAGCTTCGTTAAATATAAATTTCAGCTAATAGTACAAGTATCATACCTTGACTGGAAAGGTTCCCATTGGGAGTTTTGTGGTAAGCAGTTCTCTCAAACGACGAACTGCCTTGACTTTGTTAGCTAAAATATCCAGCAATGGCAATAGTTCATCAGTTTGCAGAGGAAAGTTTTGAGAAAGCCAAAGAACGGGTCTCAACCCTTTCTTATACTCATTTTCATGACTCCCATCCTTGTGGCGATTTTTGCTCTCGGACGCCGAAGAGGTACTTCTGATTTCCCTCCCTCTTCGGTGCTCGGAAATTATCTCTACAGAATGTCTTCCAGGTCTGCTGCTATGACTTTCAGGAGGACAATCCCCTAGAAGATCACCTACCTTATCATCCATAGAAAGGGAGCTTCTTGGAGGAGCAATCTTCTTCTGTCCTTCATGTTTGCTATCCCGTTTCCTCCATGCCCCAAACCATCCTTTCTTTTCCTGCCTAATCTCTCCATTTCTACATCCATTTGCATCCTCAACAGGAACTTCCCTTTGCTCAAAACAACTATGGCGATGCCCAACAATCCCATCACCATTCTCAGAAGTTAATTCAGATGAATCTAATTTAAGTGCATTCTCAAGTTGCTTTTTTTCATCCTCTGTTAAAATGTCATCAAGCTCCTCGCTCTCAGTTTCATTTTCATTACAAGATGAGAAGAACTCATCATCCGTCATAGCCCCAGGGACCCTCCTAGATTTGATACTTACAACCACATTATGCATGTCATATACTTTAGCTTTCCAGGCACCCACCATTTCGGTTTTCTCCTGGCGCCTCCAAGTCAACTGGGGTAAAAGAACGGCCTGAGTGACATCAATTCCAGGTCTAAATATATTGGTCTGAGACATTGCAATCACTTCTTGTCTAATCTCCTCTCAGTTGCCTGAGCTCCAGCACCATCCAAAGCATTCATTACCTCCTTCTCCTTGTGTGAGACCATACAGAGTGAACCAGAAGGTACTTTCCCATCCTCTGTCCCATCACCAAGAAAAAGAATACTCTGCTCAGCACGTTGGATCCTAAATCCATCAAACCCAGCCAGTGTCATATCTGCCCTCAAGTTAGCACCTCTCTTCCAGATCTTGTAAGTATCTGAAGGGGCAATCCTAGAAATGAAAGGAATCACCGAACTCTCAAAGTGGAATGTGATTTCCATGTAGAAATCCCTCATTCTTCGCATGGTCTCGATCAAGCGAGGCAATCTCCGACACCATTTTGCCCAAGCCAATGGCTGGTAATGCCTAACTATAATCATAGCAATCCCTTCTTGTCTACTGCAAATTGCTTCTTGAAGTGCACTCCAACCATGCTCATTCTGCAAGCTCCAATCTGCCCCAGCAACCATAAGCATTTCGGTAGCAGTCTCATCCCCGAGCTTTACAGCCAAATGCAGTGGGGTGTCCCGATTTGGGACATCCCGACGATCAACCATGGCAGAAATAGCATCAGCCTTGTTTTCCTCAGCTAATGAAGCAGCCTCAGTGCGAATCTCAGATGGATTACAAAGCCGAGGAAGGCTTGCAAGGATTCTCTTGAGATTGGTATAATCCCTGGAGGCCACAGCCTTGTGTACGGGACTATGCGCATACTTCGAAACATCAATACCGGCCATGGCTCAGGAGTATTCAAATCAAGAGTCATTCAACTCAACAACTCAAGAAAGATGGAGGAAGTCCACTGTTTGGAAATTAAACTCAATAAGCACAGAGGAGCACACGAGCCACGACATAATGATTAAGGAGAAGCAAATTCAACCAAGACGAATAAGCCCAGATCATAATTAGCCAAGAAAAAGAAGAATTTCTTGAATCCTTCCCGGGGTTCCAGCAGACTCAATTGACTCTGGACCCGGCCGTCACGAGAAGACAGACGACAGCCCAATAGAGAAGGTTGACGCCAATATCAAACAAAGAAGAAAGGTATCAGAGGTGGGCCACCGCGAATATCCTTCAAGTCAAGAAATCTCACCTTTTCCTCCTCCTGCTTGCTTCTTCCATGACCCCTCCAGAGACCAAGATTACAGGAAAAGAGACTCAACCCCACCCTCTACTCTCTCTCTCCTCCCTCCCTCCCTCCCTCCCTCTTTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTGCAAGTAGAAAAAGAACAAAAGATCCTTGTGATCCAATCCAGATCAGACAAACCCAAAAACATGCATGTGGGTGCGTGA

mRNA sequence

ATGGGGGATGGCGAATGGGTCTTCAAGAATTTCTATCCTACTGCTGGAAACACCGTAGGGCTCGATGGGGGCGTCGCGAACTCGTCCACTGGTTGTAATTCTTCATATTTTGTGAAGATCGTTGACTATAATGTGACACTTAAAACAATGGACCCCTCATGGGGGAATCCTGCAAGCATTCTCTTTCCAAATCACCTACACTCCAGCTACGGAGTATTGGTGTTGAATCCTACTGCTAAAGATGAGCACAGCAACACAGAACACAGCCAAGAAGGCAACAGGCATTTGCAGAGGAAAGTTTTGAGAAAGCCAAAGAACGGGTCTCAACCCTTTCTTATACTCATTTTCATGACTCCCATCCTTGTGGCGATTTTTGCTCTCGGACGCCGAAGAGGAGGACAATCCCCTAGAAGATCACCTACCTTATCATCCATAGAAAGGGAGCTTCTTGGAGGAGCAATCTTCTTCTGTCCTTCATGTTTGCTATCCCGTTTCCTCCATGCCCCAAACCATCCTTTCTTTTCCTGCCTAATCTCTCCATTTCTACATCCATTTGCATCCTCAACAGGAACTTCCCTTTGCTCAAAACAACTATGGCGATGCCCAACAATCCCATCACCATTCTCAGAAGTTAATTCAGATGAATCTAATTTAAGTGCATTCTCAAGTTGCTTTTTTTCATCCTCTGTTAAAATGTCATCAAGCTCCTCGCTCTCAGTTTCATTTTCATTACAAGATGAGAAGAACTCATCATCCGTCATAGCCCCAGGGACCCTCCTAGATTTGATACTTACAACCACATTATGCATGTCATATACTTTAGCTTTCCAGGCACCCACCATTTCGGTTTTCTCCTGGCGCCTCCAAGTCAACTGGGCACCATCCAAAGCATTCATTACCTCCTTCTCCTTGTGTGAGACCATACAGAGTGAACCAGAAGGTACTTTCCCATCCTCTGTCCCATCACCAAGAAAAAGAATACTCTGCTCAGCACGTTGGATCCTAAATCCATCAAACCCAGCCAGTGTCATATCTGCCCTCAAGTTAGCACCTCTCTTCCAGATCTTAAATCCCTCATTCTTCGCATGGTCTCGATCAAGCGAGGCAATCTCCGACACCATTTTGCCCAAGCCAATGGCTGCAACCATAAGCATTTCGGTAGCAGTCTCATCCCCGAGCTTTACAGCCAAATGCAGTGGGGTGTCCCGATTTGGGACATCCCGACGATCAACCATGGCAGAAATAGCATCAGCCTTGTTTTCCTCAGCTAATGAAGCAGCCTCACCTTGTGTACGGGACTATGCGCATACTTCGAAACATCAATACCGGCCATGGCTCAGGAGTATTCAAATCAAGAGTCATTCAACTCAACAACTCAAGAAAGATGGAGGAACAGACTCAATTGACTCTGGACCCGGCCGTCACGAGAAGACAGACGACAGCCCAATAGAGAAGGTTGACGCCAATATCAAACAAAGAAGAAAGGTATCAGAGGTGGGCCACCGCGAATATCCTTCAAGTCAAGAAATCTCACCTTTTCCTCCTCCTGCTTGCTTCTTCCATGACCCCTCCAGAGACCAAGATTACAGGAAAAGAGACTCAACCCCACCCTCTACTCTCTCTCTCCTCCCTCCCTCCCTCCCTCCCTCTTTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTGCAAGTAGAAAAAGAACAAAAGATCCTTGTGATCCAATCCAGATCAGACAAACCCAAAAACATGCATGTGGGTGCGTGA

Coding sequence (CDS)

ATGGGGGATGGCGAATGGGTCTTCAAGAATTTCTATCCTACTGCTGGAAACACCGTAGGGCTCGATGGGGGCGTCGCGAACTCGTCCACTGGTTGTAATTCTTCATATTTTGTGAAGATCGTTGACTATAATGTGACACTTAAAACAATGGACCCCTCATGGGGGAATCCTGCAAGCATTCTCTTTCCAAATCACCTACACTCCAGCTACGGAGTATTGGTGTTGAATCCTACTGCTAAAGATGAGCACAGCAACACAGAACACAGCCAAGAAGGCAACAGGCATTTGCAGAGGAAAGTTTTGAGAAAGCCAAAGAACGGGTCTCAACCCTTTCTTATACTCATTTTCATGACTCCCATCCTTGTGGCGATTTTTGCTCTCGGACGCCGAAGAGGAGGACAATCCCCTAGAAGATCACCTACCTTATCATCCATAGAAAGGGAGCTTCTTGGAGGAGCAATCTTCTTCTGTCCTTCATGTTTGCTATCCCGTTTCCTCCATGCCCCAAACCATCCTTTCTTTTCCTGCCTAATCTCTCCATTTCTACATCCATTTGCATCCTCAACAGGAACTTCCCTTTGCTCAAAACAACTATGGCGATGCCCAACAATCCCATCACCATTCTCAGAAGTTAATTCAGATGAATCTAATTTAAGTGCATTCTCAAGTTGCTTTTTTTCATCCTCTGTTAAAATGTCATCAAGCTCCTCGCTCTCAGTTTCATTTTCATTACAAGATGAGAAGAACTCATCATCCGTCATAGCCCCAGGGACCCTCCTAGATTTGATACTTACAACCACATTATGCATGTCATATACTTTAGCTTTCCAGGCACCCACCATTTCGGTTTTCTCCTGGCGCCTCCAAGTCAACTGGGCACCATCCAAAGCATTCATTACCTCCTTCTCCTTGTGTGAGACCATACAGAGTGAACCAGAAGGTACTTTCCCATCCTCTGTCCCATCACCAAGAAAAAGAATACTCTGCTCAGCACGTTGGATCCTAAATCCATCAAACCCAGCCAGTGTCATATCTGCCCTCAAGTTAGCACCTCTCTTCCAGATCTTAAATCCCTCATTCTTCGCATGGTCTCGATCAAGCGAGGCAATCTCCGACACCATTTTGCCCAAGCCAATGGCTGCAACCATAAGCATTTCGGTAGCAGTCTCATCCCCGAGCTTTACAGCCAAATGCAGTGGGGTGTCCCGATTTGGGACATCCCGACGATCAACCATGGCAGAAATAGCATCAGCCTTGTTTTCCTCAGCTAATGAAGCAGCCTCACCTTGTGTACGGGACTATGCGCATACTTCGAAACATCAATACCGGCCATGGCTCAGGAGTATTCAAATCAAGAGTCATTCAACTCAACAACTCAAGAAAGATGGAGGAACAGACTCAATTGACTCTGGACCCGGCCGTCACGAGAAGACAGACGACAGCCCAATAGAGAAGGTTGACGCCAATATCAAACAAAGAAGAAAGGTATCAGAGGTGGGCCACCGCGAATATCCTTCAAGTCAAGAAATCTCACCTTTTCCTCCTCCTGCTTGCTTCTTCCATGACCCCTCCAGAGACCAAGATTACAGGAAAAGAGACTCAACCCCACCCTCTACTCTCTCTCTCCTCCCTCCCTCCCTCCCTCCCTCTTTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTGCAAGTAGAAAAAGAACAAAAGATCCTTGTGATCCAATCCAGATCAGACAAACCCAAAAACATGCATGTGGGTGCGTGA

Protein sequence

MGDGEWVFKNFYPTAGNTVGLDGGVANSSTGCNSSYFVKIVDYNVTLKTMDPSWGNPASILFPNHLHSSYGVLVLNPTAKDEHSNTEHSQEGNRHLQRKVLRKPKNGSQPFLILIFMTPILVAIFALGRRRGGQSPRRSPTLSSIERELLGGAIFFCPSCLLSRFLHAPNHPFFSCLISPFLHPFASSTGTSLCSKQLWRCPTIPSPFSEVNSDESNLSAFSSCFFSSSVKMSSSSSLSVSFSLQDEKNSSSVIAPGTLLDLILTTTLCMSYTLAFQAPTISVFSWRLQVNWAPSKAFITSFSLCETIQSEPEGTFPSSVPSPRKRILCSARWILNPSNPASVISALKLAPLFQILNPSFFAWSRSSEAISDTILPKPMAATISISVAVSSPSFTAKCSGVSRFGTSRRSTMAEIASALFSSANEAASPCVRDYAHTSKHQYRPWLRSIQIKSHSTQQLKKDGGTDSIDSGPGRHEKTDDSPIEKVDANIKQRRKVSEVGHREYPSSQEISPFPPPACFFHDPSRDQDYRKRDSTPPSTLSLLPPSLPPSFSLSLSLSLSLQVEKEQKILVIQSRSDKPKNMHVGA
Homology
BLAST of Sgr015000 vs. ExPASy TrEMBL
Match: A0A0A9P8Q3 (Uncharacterized protein OS=Arundo donax OX=35708 PE=4 SV=1)

HSP 1 Score: 99.8 bits (247), Expect = 4.1e-17
Identity = 64/138 (46.38%), Postives = 73/138 (52.90%), Query Frame = 0

Query: 254 IAPGTLLDLILTTTLCMSYTLAFQAPTISVFSWRLQVN---------------------- 313
           +APGT LD  LTTT C+SYTLAF  PT SV S R  V+                      
Sbjct: 1   MAPGTFLDFTLTTTWCISYTLAFHGPTASVLSCRRHVSCGSSTAWVTSIPGRKMLVCDIA 60

Query: 314 ----W---------APSKAFITSFSLCETIQSEPEGTFPSSVPSPRKRILCSARWILNPS 357
               W         APS AFI S SL   + S+P GT PSS PSPRK ++ S RWIL PS
Sbjct: 61  VTSCWTSASEAGAPAPSSAFIISLSLWLIMHSDPGGTLPSSEPSPRKSMVWSERWILKPS 120

BLAST of Sgr015000 vs. ExPASy TrEMBL
Match: A0A0A9K0K9 (Uncharacterized protein OS=Arundo donax OX=35708 PE=4 SV=1)

HSP 1 Score: 99.8 bits (247), Expect = 4.1e-17
Identity = 64/138 (46.38%), Postives = 73/138 (52.90%), Query Frame = 0

Query: 254 IAPGTLLDLILTTTLCMSYTLAFQAPTISVFSWRLQVN---------------------- 313
           +APGT LD  LTTT C+SYTLAF  PT SV S R  V+                      
Sbjct: 1   MAPGTFLDFTLTTTWCISYTLAFHGPTASVLSCRRHVSCGSSTAWVTSIPGRKMLVCDIA 60

Query: 314 ----W---------APSKAFITSFSLCETIQSEPEGTFPSSVPSPRKRILCSARWILNPS 357
               W         APS AFI S SL   + S+P GT PSS PSPRK ++ S RWIL PS
Sbjct: 61  VTSCWISASEAGAPAPSSAFIISLSLWLIMHSDPGGTLPSSEPSPRKSMVWSERWILKPS 120

BLAST of Sgr015000 vs. ExPASy TrEMBL
Match: A0A0A9M313 (Uncharacterized protein OS=Arundo donax OX=35708 PE=4 SV=1)

HSP 1 Score: 75.1 bits (183), Expect = 1.1e-09
Identity = 39/64 (60.94%), Postives = 45/64 (70.31%), Query Frame = 0

Query: 293 APSKAFITSFSLCETIQSEPEGTFPSSVPSPRKRILCSARWILNPSNPASVISALKLAPL 352
           APS AFI S SL  T+ S+P  T PSS PSPRK ++ S RWIL PS PASV+SA +LAP 
Sbjct: 22  APSSAFIISLSLWLTMHSDPGATLPSSEPSPRKSMVWSERWILRPSKPASVMSARRLAPR 81

Query: 353 FQIL 357
           F  L
Sbjct: 82  FHTL 85

BLAST of Sgr015000 vs. ExPASy TrEMBL
Match: A0A0A9RY64 (Uncharacterized protein OS=Arundo donax OX=35708 PE=4 SV=1)

HSP 1 Score: 57.4 bits (137), Expect = 2.4e-04
Identity = 33/53 (62.26%), Postives = 37/53 (69.81%), Query Frame = 0

Query: 249 NSSSVIAPGTLLDLILTTTLCMSYTLAFQAPTISVFSWRLQVNWAPSKAFITS 302
           +SSSVIAPGTLLD  +T T  MSYTLAFQ+PT SV S R Q     S A +TS
Sbjct: 4   SSSSVIAPGTLLDFTVTRTFNMSYTLAFQSPTASVLSCRRQFKCGTSSACVTS 56

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A9P8Q34.1e-1746.38Uncharacterized protein OS=Arundo donax OX=35708 PE=4 SV=1[more]
A0A0A9K0K94.1e-1746.38Uncharacterized protein OS=Arundo donax OX=35708 PE=4 SV=1[more]
A0A0A9M3131.1e-0960.94Uncharacterized protein OS=Arundo donax OX=35708 PE=4 SV=1[more]
A0A0A9RY642.4e-0462.26Uncharacterized protein OS=Arundo donax OX=35708 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 467..502
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 453..546

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr015000.1Sgr015000.1mRNA