Sgr025423 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr025423
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
DescriptionUnknown protein
Locationtig00006406: 1208201 .. 1209787 (+)
RNA-Seq ExpressionSgr025423
SyntenySgr025423
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCTGGGAAAAAAGGAAGAAAAGAAAAGGGTTTCGGAGGAGACGGCGGTGGGGGAGTTTTTCCGACCACCGCATTACGTCCGTCCGTCGAAATCAGGCGCACCCACGTGGAGAGAATCCGACCCGAGAACCCGCCAAATCGTAGACAACCCGAATCCTTGCTGCTGGATGTTGCCGATAATCGACAGCCCGCTGGTTGTACCGGCGAAGGCGAAGCAGAAGCGGCCGCTGCCGTCCACCGGTATCAGATAATTGGACGCTGGTAATGACACGTCAGCGTCCCTGAAATGCAACACCACCGTCGGAACCTTCACCGTCGTCTTCCCCGACAGGTCGTAGCACGTGTCGAAGAGAGAAAACTCAGGCGCTGATTTCAAACTCGAAGCTCCTGCACGGAAGGCGTCGCGCAAGGCAATGTACGCCGGCCGGTTTATCGGGTAACGGATGTACCGCAATCGATGATGACGCCGCCGTTGCCGGCCGGATCGAGCTTGAAATGTGATGCTGAGATGCCGGAGACGGGCGTGCCTCCGACGCTGATCCCTAACAGTTCGACGTAGTAAAAGGTGTCCAGCCTAGGGTTTGTGAGCAGAGGAGTGAACCGGGCGGTTCGCGAGACGGCGGAGTCGCCGAAGACGACGGAGGACGGTTTGGAAGAGGCGGACCGGTCCACCAAGCAGTAGGAGAACTTCTGGTTGAAACTCCGTCCGGATTGCGAAGGGAACGATAACCCTCCCCGGCCGAGACCTAGAAGCCCCGCAGCACCAACGAACAAACCCTCATTATCGTGGCCACAGCCAAGGGCGACACGTTCCACTTTGGTCCGCCGGAAGGTGAGGGTTTCGGTTACGAACTCCCCGGTGGTGTACGAACCGTCGCCGTAAGAAACCTGGTAGAGGCATGTCTGCCGTTGGTTGCATCCCGGAGATTCGAGGCGGCGACACAGAGGCGTCCGGCAAGGGACTTTGGCGAAGGATCCGGATTTGACCGGGTTGAAAACCGGGTCGGCCTGGGAGTAGCAATTCTTGCAGGGGGCGCACTGCAGCCAGACGATGTCGCTTCCGGTGTCCAACACCATGTAAACATACTTGGGAGGCGTGCCGACGCCGATGCGCGTGAAGTACTCGCCGCTGCCCTGAGCGAGTCCCGAGATCACCGAGCTACTGAACCCGGTGCCGGTCCCACTCCCACTCGCTTGGCTCACATTCCGAGAACCACCCTGACCCAGTGAACTCAGCTTCGTGACTCGGAGCGCGTCTCTCTGAAGCCTTAGCTGGAAGAGCTCCTCCGGCGTTCTGTTCAGGGACAGAGCGTCCAAATGGTGAAGCTGCAACGTGAAGCCGGTCTCCGCGTCGCCGCCCTCCGAAGAGAAAAACGACTCGAAACCGTCGTCGGATTCAGGCCGTGAAAGGGTAGGTGAGGTGGGAAGAGGTCTGGGTATTAGGGTTTGGAAGTCGGAGACAGCGGTGGAGAGGGAGAGAAGAGTGAGAAGGAAGGAGATAAAGGGAAATGCAGTAGCTTTTGCCTCCATTTTAGAGAGAGAGAGAAGACGTACATGTTCTGCAGAGGTGAGAGAGTATAAATAA

mRNA sequence

ATGTCTGGGAAAAAAGGAAGAAAAGAAAAGGGTTTCGGAGGAGACGGCGGTGGGGGAGTTTTTCCGACCACCGCATTACGTCCGTCCGTCGAAATCAGGCGCACCCACGTGGAGAGAATCCGACCCGAGAACCCGCCAAATCGTAGACAACCCGAATCCTTGCTGCTGGATGTTGCCGATAATCGACAGCCCGCTGGTTGTACCGGCGAAGGCGAAGCAGAAGCGGCCGCTGCCGTCCACCGGTATCAGATAATTGGACGCTGGCGTCGCGCAAGGCAATGTACGCCGGCCGGTTTATCGGGTAACGGATGTACCGCAATCGATGATGACGCCGCCGTTGCCGGCCGGATCGAGCTTGAAATCCTAGGGTTTGTGAGCAGAGGAGTGAACCGGGCGGTTCGCGAGACGGCGGAGTCGCCGAAGACGACGGAGGACGGTTTGGAAGAGGCGGACCGGTCCACCAAGCACACCAACGAACAAACCCTCATTATCGTGGCCACAGCCAAGGGCGACACGTTCCACTTTGGTCCGCCGGAAGGTGAGGGTTTCGGTTACGAACTCCCCGGTGGTGTACGAACCGTCGCCGTAAGAAACCTGGTAGAGGCATGTCTGCCGTTGGTTGCATCCCGGAGATTCGAGGCGGCGACACAGAGGCGTCCGGCAAGGGACTTTGGCGAAGGATCCGGATTTGACCGGGTTGAAAACCGGGTCGGCCTGGGAGTAGCAATTCTTGCAGGGGGCGCACTGCAGCCAGACGATGTCGCTTCCGGTGTCCAACACCATGTAAACATACTTGGGAGGCGTGCCGACGCCGATGCGCGTGAAGTACTCGCCGCTGCCCTGAGCGAGTCCCGAGATCACCGAGCTACTGAACCCGGTGCCGGTCCCACTCCCACTCGCTTGGCTCACATTCCGAGAACCACCCTGACCCACTGCAACGTGAAGCCGGTCTCCGCGTCGCCGCCCTCCGAAGAGAAAAACGACTCGAAACCGTCGTCGGATTCAGGCCGTGAAAGGGTAGGTGAGGTGGGAAGAGGTCTGGGTATTAGGGTTTGGAAGTCGGAGACAGCGGTGGAGAGGGAGAGAAGAGTGAGAAGGAAGGAGATAAAGGGAAATGCAGTAGCTTTTGCCTCCATTTTAGAGAGAGAGAGAAGACGTACATGTTCTGCAGAGGTGAGAGAGTATAAATAA

Coding sequence (CDS)

ATGTCTGGGAAAAAAGGAAGAAAAGAAAAGGGTTTCGGAGGAGACGGCGGTGGGGGAGTTTTTCCGACCACCGCATTACGTCCGTCCGTCGAAATCAGGCGCACCCACGTGGAGAGAATCCGACCCGAGAACCCGCCAAATCGTAGACAACCCGAATCCTTGCTGCTGGATGTTGCCGATAATCGACAGCCCGCTGGTTGTACCGGCGAAGGCGAAGCAGAAGCGGCCGCTGCCGTCCACCGGTATCAGATAATTGGACGCTGGCGTCGCGCAAGGCAATGTACGCCGGCCGGTTTATCGGGTAACGGATGTACCGCAATCGATGATGACGCCGCCGTTGCCGGCCGGATCGAGCTTGAAATCCTAGGGTTTGTGAGCAGAGGAGTGAACCGGGCGGTTCGCGAGACGGCGGAGTCGCCGAAGACGACGGAGGACGGTTTGGAAGAGGCGGACCGGTCCACCAAGCACACCAACGAACAAACCCTCATTATCGTGGCCACAGCCAAGGGCGACACGTTCCACTTTGGTCCGCCGGAAGGTGAGGGTTTCGGTTACGAACTCCCCGGTGGTGTACGAACCGTCGCCGTAAGAAACCTGGTAGAGGCATGTCTGCCGTTGGTTGCATCCCGGAGATTCGAGGCGGCGACACAGAGGCGTCCGGCAAGGGACTTTGGCGAAGGATCCGGATTTGACCGGGTTGAAAACCGGGTCGGCCTGGGAGTAGCAATTCTTGCAGGGGGCGCACTGCAGCCAGACGATGTCGCTTCCGGTGTCCAACACCATGTAAACATACTTGGGAGGCGTGCCGACGCCGATGCGCGTGAAGTACTCGCCGCTGCCCTGAGCGAGTCCCGAGATCACCGAGCTACTGAACCCGGTGCCGGTCCCACTCCCACTCGCTTGGCTCACATTCCGAGAACCACCCTGACCCACTGCAACGTGAAGCCGGTCTCCGCGTCGCCGCCCTCCGAAGAGAAAAACGACTCGAAACCGTCGTCGGATTCAGGCCGTGAAAGGGTAGGTGAGGTGGGAAGAGGTCTGGGTATTAGGGTTTGGAAGTCGGAGACAGCGGTGGAGAGGGAGAGAAGAGTGAGAAGGAAGGAGATAAAGGGAAATGCAGTAGCTTTTGCCTCCATTTTAGAGAGAGAGAGAAGACGTACATGTTCTGCAGAGGTGAGAGAGTATAAATAA

Protein sequence

MSGKKGRKEKGFGGDGGGGVFPTTALRPSVEIRRTHVERIRPENPPNRRQPESLLLDVADNRQPAGCTGEGEAEAAAAVHRYQIIGRWRRARQCTPAGLSGNGCTAIDDDAAVAGRIELEILGFVSRGVNRAVRETAESPKTTEDGLEEADRSTKHTNEQTLIIVATAKGDTFHFGPPEGEGFGYELPGGVRTVAVRNLVEACLPLVASRRFEAATQRRPARDFGEGSGFDRVENRVGLGVAILAGGALQPDDVASGVQHHVNILGRRADADAREVLAAALSESRDHRATEPGAGPTPTRLAHIPRTTLTHCNVKPVSASPPSEEKNDSKPSSDSGRERVGEVGRGLGIRVWKSETAVERERRVRRKEIKGNAVAFASILERERRRTCSAEVREYK
Homology
BLAST of Sgr025423 vs. NCBI nr
Match: XP_011465823.1 (PREDICTED: glycine-rich cell wall structural protein 1-like [Fragaria vesca subsp. vesca])

HSP 1 Score: 114.4 bits (285), Expect = 2.3e-21
Identity = 67/144 (46.53%), Postives = 83/144 (57.64%), Query Frame = 0

Query: 154 TKHTNEQTLIIVATAKGDTFHFGPPEGEGFGYEL--PGGVRTVAVRNLVEACLPLVASRR 213
           T   +E+  ++VA A+GD   FG  EGEGFG E+   GGV                    
Sbjct: 13  TGGADEEAFVVVAVAEGDFGDFGAAEGEGFGGEVAEAGGV-------------------- 72

Query: 214 FEAATQRRPARDFGEGSGFDRVENRVGLGVAILAGGALQPDDVASGVQHHVNILGRRADA 273
            E A +R  AR+ GEG GF RVENRV LGVA+   GAL+PDD+ +GV+ HV +   RADA
Sbjct: 73  -EVAAERGGAREGGEGGGFSRVENRVCLGVAVFTRGALEPDDIGAGVEDHVKVFWGRADA 132

Query: 274 DAREVLAAALSESRDHRATEPGAG 296
           DA EVL+AAL E+ D    EPG G
Sbjct: 133 DAGEVLSAALGEAGDDGGAEPGGG 135

BLAST of Sgr025423 vs. NCBI nr
Match: XP_011463556.1 (PREDICTED: spidroin-1-like [Fragaria vesca subsp. vesca])

HSP 1 Score: 105.5 bits (262), Expect = 1.1e-18
Identity = 63/142 (44.37%), Postives = 78/142 (54.93%), Query Frame = 0

Query: 154 TKHTNEQTLIIVATAKGDTFHFGPPEGEGFGYELPGGVRTVAVRNLVEACLPLVASRRFE 213
           T   +E+  ++VA A+GD   FG  E         GGV+                     
Sbjct: 13  TDGADEEAFVVVAAAEGDFGDFGAAEA--------GGVK--------------------- 72

Query: 214 AATQRRPARDFGEGSGFDRVENRVGLGVAILAGGALQPDDVASGVQHHVNILGRRADADA 273
            AT+R  AR+ GEG GF RV+NRVGLGVA    GAL+PDD+ +GV+ HV + G RADADA
Sbjct: 73  VATERGGAREGGEGGGFSRVKNRVGLGVAAFTRGALEPDDIGAGVEDHVEVFGGRADADA 125

Query: 274 REVLAAALSESRDHRATEPGAG 296
            EVLAAAL E+ D    EPG G
Sbjct: 133 GEVLAAALGEAGDDGGAEPGGG 125

BLAST of Sgr025423 vs. NCBI nr
Match: KYP71940.1 (hypothetical protein KK1_011220, partial [Cajanus cajan])

HSP 1 Score: 78.2 bits (191), Expect = 1.8e-10
Identity = 46/76 (60.53%), Postives = 52/76 (68.42%), Query Frame = 0

Query: 220 PARDFGEGSGFDRVENRVGLGVAILAGGALQPDDVASGVQHHVNILGRRADADAREVLAA 279
           PA D GEGSGF  VEN VGLGVA  A GAL+PD VAS VQ HV++    + A AREVLAA
Sbjct: 5   PAGDSGEGSGFGGVENGVGLGVAFAARGALEPDYVASRVQDHVHVSSGCSYAYAREVLAA 64

Query: 280 ALSESRDHRATEPGAG 296
           AL ++R   A E G G
Sbjct: 65  ALGQARYDGAAEAGPG 80

BLAST of Sgr025423 vs. NCBI nr
Match: EFJ04226.1 (hypothetical protein SELMODRAFT_432611, partial [Selaginella moellendorffii])

HSP 1 Score: 77.8 bits (190), Expect = 2.3e-10
Identity = 49/113 (43.36%), Postives = 68/113 (60.18%), Query Frame = 0

Query: 179 EGEGFGYELPGGVRTVAVRNLVEACLPLVASRRFEAATQRRPARDFGEGSGFDRVENRVG 238
           EG+  G EL      +AVR+LV+A +P  A+ + E   QR PA D  E  G  R+E RV 
Sbjct: 11  EGKSLGGELTDSEGAIAVRDLVQALVPPAAADQ-ELLAQRAPAGDGLESRGEGRIEQRVR 70

Query: 239 LGVAILAGGALQPDDVASGVQHHVNILGRRADADAREVLAAALSESRDHRATE 292
           L VA LA  AL+P+++ASGV +HV+   R +D++  EVLA A+ E+   R  E
Sbjct: 71  LAVAALARQALEPENIASGVGNHVDGSRRCSDSERDEVLATAVGEAGSQRRLE 122

BLAST of Sgr025423 vs. NCBI nr
Match: RDX95927.1 (hypothetical protein CR513_21477, partial [Mucuna pruriens])

HSP 1 Score: 68.9 bits (167), Expect = 1.1e-07
Identity = 48/99 (48.48%), Postives = 59/99 (59.60%), Query Frame = 0

Query: 193 TVAVRNLVEACLPLVASRRFEAATQRRPARDFGEGSGFDRVENRVGLGVAILAGGALQPD 252
           TV+V N   A   + A    +  T RR A D   G G +RVENRVGL VA  A  AL P 
Sbjct: 13  TVSVANSPTA-RSVAALGEVQGFTLRRVAVDRRIGRGQNRVENRVGLLVAFGAWRALDPT 72

Query: 253 DVASGVQHHVNILGRRADADAREVLAAALSESRDHRATE 292
           DVAS V+HHV +    +D+D+ EVLAA+L+ SRD  A E
Sbjct: 73  DVASCVEHHVGLTRWLSDSDSEEVLAASLTRSRDDWALE 110

BLAST of Sgr025423 vs. ExPASy TrEMBL
Match: A0A151TY14 (Uncharacterized protein (Fragment) OS=Cajanus cajan OX=3821 GN=KK1_011220 PE=4 SV=1)

HSP 1 Score: 78.2 bits (191), Expect = 8.7e-11
Identity = 46/76 (60.53%), Postives = 52/76 (68.42%), Query Frame = 0

Query: 220 PARDFGEGSGFDRVENRVGLGVAILAGGALQPDDVASGVQHHVNILGRRADADAREVLAA 279
           PA D GEGSGF  VEN VGLGVA  A GAL+PD VAS VQ HV++    + A AREVLAA
Sbjct: 5   PAGDSGEGSGFGGVENGVGLGVAFAARGALEPDYVASRVQDHVHVSSGCSYAYAREVLAA 64

Query: 280 ALSESRDHRATEPGAG 296
           AL ++R   A E G G
Sbjct: 65  ALGQARYDGAAEAGPG 80

BLAST of Sgr025423 vs. ExPASy TrEMBL
Match: D8TGJ2 (Uncharacterized protein (Fragment) OS=Selaginella moellendorffii OX=88036 GN=SELMODRAFT_432611 PE=4 SV=1)

HSP 1 Score: 77.8 bits (190), Expect = 1.1e-10
Identity = 49/113 (43.36%), Postives = 68/113 (60.18%), Query Frame = 0

Query: 179 EGEGFGYELPGGVRTVAVRNLVEACLPLVASRRFEAATQRRPARDFGEGSGFDRVENRVG 238
           EG+  G EL      +AVR+LV+A +P  A+ + E   QR PA D  E  G  R+E RV 
Sbjct: 11  EGKSLGGELTDSEGAIAVRDLVQALVPPAAADQ-ELLAQRAPAGDGLESRGEGRIEQRVR 70

Query: 239 LGVAILAGGALQPDDVASGVQHHVNILGRRADADAREVLAAALSESRDHRATE 292
           L VA LA  AL+P+++ASGV +HV+   R +D++  EVLA A+ E+   R  E
Sbjct: 71  LAVAALARQALEPENIASGVGNHVDGSRRCSDSERDEVLATAVGEAGSQRRLE 122

BLAST of Sgr025423 vs. ExPASy TrEMBL
Match: A0A371GZF0 (Uncharacterized protein (Fragment) OS=Mucuna pruriens OX=157652 GN=CR513_21477 PE=4 SV=1)

HSP 1 Score: 68.9 bits (167), Expect = 5.3e-08
Identity = 48/99 (48.48%), Postives = 59/99 (59.60%), Query Frame = 0

Query: 193 TVAVRNLVEACLPLVASRRFEAATQRRPARDFGEGSGFDRVENRVGLGVAILAGGALQPD 252
           TV+V N   A   + A    +  T RR A D   G G +RVENRVGL VA  A  AL P 
Sbjct: 13  TVSVANSPTA-RSVAALGEVQGFTLRRVAVDRRIGRGQNRVENRVGLLVAFGAWRALDPT 72

Query: 253 DVASGVQHHVNILGRRADADAREVLAAALSESRDHRATE 292
           DVAS V+HHV +    +D+D+ EVLAA+L+ SRD  A E
Sbjct: 73  DVASCVEHHVGLTRWLSDSDSEEVLAASLTRSRDDWALE 110

BLAST of Sgr025423 vs. ExPASy TrEMBL
Match: A0A498IZU6 (Uncharacterized protein OS=Malus domestica OX=3750 GN=DVH24_034524 PE=4 SV=1)

HSP 1 Score: 68.6 bits (166), Expect = 6.9e-08
Identity = 38/62 (61.29%), Postives = 45/62 (72.58%), Query Frame = 0

Query: 227 GSGFDRVENRVGLGVAILAGGALQPDDVASGVQHHVNILGRRADADAREVLAAALSESRD 286
           G   DRVENRV L VAI AG AL+P DVA+GV+HHV  L R  +AD+ EVL AAL+ +RD
Sbjct: 4   GGRSDRVENRVRLLVAIGAGAALEPVDVAAGVEHHVERLRRSPEADSGEVLVAALANARD 63

Query: 287 HR 289
            R
Sbjct: 64  DR 65

BLAST of Sgr025423 vs. ExPASy TrEMBL
Match: J3MYK1 (Uncharacterized protein OS=Oryza brachyantha OX=4533 PE=4 SV=1)

HSP 1 Score: 59.7 bits (143), Expect = 3.2e-05
Identity = 48/127 (37.80%), Postives = 66/127 (51.97%), Query Frame = 0

Query: 159 EQTLIIVATAKGDTFHF-GPPEGEGFGYELPGGVRTVAVRNLVEACLPLVASRRFEAATQ 218
           EQ  + VA A+ +     G  + E   +E+  G+R VAVR+LV A L   A  R E    
Sbjct: 268 EQPGVGVAAAEDEARELGGRRQRERVTHEVAVGLRLVAVRDLVPA-LVRRAGPRVELLAL 327

Query: 219 RRPARDFGEGSGFDRVENRVGLGVAILAGGALQPDDVASGVQHHVNILGRRADADAREVL 278
            R A   G      RVE RV L VA+ A  AL P  VA+GV+HH  +L RR++    +V+
Sbjct: 328 LRAAGHGGVRRRLRRVEQRVVLLVAVAARRALHPRQVAAGVEHHRVLLRRRSEPHGHDVV 387

Query: 279 AAALSES 285
           A A  ++
Sbjct: 388 AGAKGDA 393

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_011465823.12.3e-2146.53PREDICTED: glycine-rich cell wall structural protein 1-like [Fragaria vesca subs... [more]
XP_011463556.11.1e-1844.37PREDICTED: spidroin-1-like [Fragaria vesca subsp. vesca][more]
KYP71940.11.8e-1060.53hypothetical protein KK1_011220, partial [Cajanus cajan][more]
EFJ04226.12.3e-1043.36hypothetical protein SELMODRAFT_432611, partial [Selaginella moellendorffii][more]
RDX95927.11.1e-0748.48hypothetical protein CR513_21477, partial [Mucuna pruriens][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A151TY148.7e-1160.53Uncharacterized protein (Fragment) OS=Cajanus cajan OX=3821 GN=KK1_011220 PE=4 S... [more]
D8TGJ21.1e-1043.36Uncharacterized protein (Fragment) OS=Selaginella moellendorffii OX=88036 GN=SEL... [more]
A0A371GZF05.3e-0848.48Uncharacterized protein (Fragment) OS=Mucuna pruriens OX=157652 GN=CR513_21477 P... [more]
A0A498IZU66.9e-0861.29Uncharacterized protein OS=Malus domestica OX=3750 GN=DVH24_034524 PE=4 SV=1[more]
J3MYK13.2e-0537.80Uncharacterized protein OS=Oryza brachyantha OX=4533 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 327..344
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 137..156
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..28
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 135..156
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 315..344
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 35..54
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 281..302

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr025423.1Sgr025423.1mRNA