Sgr020029 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr020029
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
DescriptionReverse transcriptase domain-containing protein
Locationtig00153446: 1169940 .. 1170901 (+)
RNA-Seq ExpressionSgr020029
SyntenySgr020029
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGCCCGAAAATTAAGTCAAGCCTAGTAGAAGAGGATGACAGGAAGAAGGAAGGGACATGGAAGCGTGTGAGTCGGAATATAAATCAAAATGATGAGCACGAGTTCATGCTCCATGAAGAAGCAAAGGGTATCATCTGCAATTTGGCAGGGACTAAACATCAGATACAACACTTAGAAGTACAAGTAACAAAAAAACAAGCAGGAACAAGAAAATGCACTGATGGTTCAGAGGATAAGTCGGGGCTGATGAATATTTCAATGGTGGAGGCTGCTCGGCAGCCCCGCCGCCAACAATGTAAATCTTAAGTTGGAATGTTCGAGGGTTGGGGAACCCTCGAGCATTCCGGGCGCTATGCCAAGTAGTGCATAGTAATAAACTCGACCTAGTGTTCTTAATTGAGACAAAGTACAAAAAGAAACTAGGAGACCAAATGCAGTCAAGTCTTCGTTATTATTGTTGTTTCACTATTCCTAGCAGGGGGAATAGTGGTGGTTTAATGTTGTTGTGGTTTAATTTTGTTATGGTTTAAAGAATTGAATGTTAACATTATGTCTTATTCGATAGGTCATATCGACACAATTATTAAAGATGGCAATGGAAGCTGGCAATTTACAGGTTTTTATGGTAATCCAGCTACGGAGCTTAGAAGCGAATCATGGGCTCTTTTGGAGAGGTTACATGCAATATTCGGTCTTCCTTGATTGATTGGGGGGGATTTCAATGAGATCACAATAGATTCAGAGAAGAGTGGTGGATCAAATAGAAATCAAAATCAAATGAAATCCTTCAGAGATGTGATAGATAATTGTAACCTCATGGACCCGAGTTATAGAGGAGATGATTTCACTTGGACCAACAGACAATTTACAGGTTACCTTGTCTGGGAAAGACTTGATAGATTTTTAATGAATTTCGATATGTTGTGCAGGTGTGGTCATATTATCGTGGAGCACTTGAG

mRNA sequence

ATGGGCCCGAAAATTAAGTCAAGCCTAGTAGAAGAGGATGACAGGAAGAAGGAAGGGACATGGAAGCGTGTGAGTCGGAATATAAATCAAAATGATGAGCACGAGTTCATGCTCCATGAAGAAGCAAAGGGTATCATCTGCAATTTGGCAGGGACTAAACATCAGATACAACACTTAGAAGTACAAGTAACAAAAAAACAAGCAGGAACAAGAAAATGCACTGATGGTCATATCGACACAATTATTAAAGATGGCAATGGAAGCTGGCAATTTACAGGTTTTTATGGTAATCCAGCTACGGAGCTTAGAAGCGAATCATGGGCTCTTTTGGAGAGGTTACATGCAATATTCGATTCAGAGAAGAGTGGTGGATCAAATAGAAATCAAAATCAAATGAAATCCTTCAGAGATGTGATAGATAATTGTAACCTCATGGACCCGAGTTATAGAGGAGATGATTTCACTTGGACCAACAGACAATTTACAGGTTACCTTGTCTGGGAAAGACTTGATAGATTTTTAATGAATTTCGATATGTTGTGCAGGTGTGGTCATATTATCGTGGAGCACTTGAG

Coding sequence (CDS)

ATGGGCCCGAAAATTAAGTCAAGCCTAGTAGAAGAGGATGACAGGAAGAAGGAAGGGACATGGAAGCGTGTGAGTCGGAATATAAATCAAAATGATGAGCACGAGTTCATGCTCCATGAAGAAGCAAAGGGTATCATCTGCAATTTGGCAGGGACTAAACATCAGATACAACACTTAGAAGTACAAGTAACAAAAAAACAAGCAGGAACAAGAAAATGCACTGATGGTCATATCGACACAATTATTAAAGATGGCAATGGAAGCTGGCAATTTACAGGTTTTTATGGTAATCCAGCTACGGAGCTTAGAAGCGAATCATGGGCTCTTTTGGAGAGGTTACATGCAATATTCGATTCAGAGAAGAGTGGTGGATCAAATAGAAATCAAAATCAAATGAAATCCTTCAGAGATGTGATAGATAATTGTAACCTCATGGACCCGAGTTATAGAGGAGATGATTTCACTTGGACCAACAGACAATTTACAGGTTACCTTGTCTGGGAAAGACTTGATAGATTTTTAATGAATTTCGATATGTTGTGCAGGTGTGGTCATATTATCGTGGAGCACTTGAG

Protein sequence

MGPKIKSSLVEEDDRKKEGTWKRVSRNINQNDEHEFMLHEEAKGIICNLAGTKHQIQHLEVQVTKKQAGTRKCTDGHIDTIIKDGNGSWQFTGFYGNPATELRSESWALLERLHAIFDSEKSGGSNRNQNQMKSFRDVIDNCNLMDPSYRGDDFTWTNRQFTGYLVWERLDRFLMNFDMLCRCGHIIVEHLX
Homology
BLAST of Sgr020029 vs. NCBI nr
Match: XP_023912780.1 (uncharacterized protein LOC112024376, partial [Quercus suber])

HSP 1 Score: 114.4 bits (285), Expect = 1.1e-21
Identity = 58/131 (44.27%), Postives = 76/131 (58.02%), Query Frame = 0

Query: 77  HIDTIIKDG-NGSWQFTGFYGNPATELRSESWALLERLHAIFD---------------SE 136
           HID II+ G   +W+FTGFYG P T  R ESW LL++LH+ F+               +E
Sbjct: 87  HIDCIIRGGTEEAWRFTGFYGEPVTHKRHESWELLQQLHSQFNLPWLCAGDFNEIVKGAE 146

Query: 137 KSGGSNRNQNQMKSFRDVIDNCNLMDPSYRGDDFTWTNRQFTGYLVWERLDRFLMNFDML 192
           K GGSNR+ +QM+ FRD +D C  +D  ++G+ FTW      G  +WERLDR L N + L
Sbjct: 147 KQGGSNRSHSQMQLFRDTMDRCGFIDLGFKGNPFTWKKYYRDGQTLWERLDRGLANNEWL 206

BLAST of Sgr020029 vs. NCBI nr
Match: XP_023911264.1 (uncharacterized protein LOC112022870, partial [Quercus suber])

HSP 1 Score: 113.2 bits (282), Expect = 2.4e-21
Identity = 58/131 (44.27%), Postives = 75/131 (57.25%), Query Frame = 0

Query: 77  HIDTIIKDG-NGSWQFTGFYGNPATELRSESWALLERLHAIFD---------------SE 136
           HID II+ G   +W+FTGFYG P T  R ESW LL++LH+ F+                E
Sbjct: 87  HIDCIIRGGTEEAWRFTGFYGEPVTHKRHESWELLQQLHSQFNLPWLCAGDFNEIVKGVE 146

Query: 137 KSGGSNRNQNQMKSFRDVIDNCNLMDPSYRGDDFTWTNRQFTGYLVWERLDRFLMNFDML 192
           K GGSNR+ +QM+ FRD +D C  +D  ++G+ FTW      G  +WERLDR L N + L
Sbjct: 147 KQGGSNRSHSQMQLFRDTMDRCGFIDLGFKGNPFTWKKYYRDGQTLWERLDRGLANNEWL 206

BLAST of Sgr020029 vs. NCBI nr
Match: XP_030969978.1 (uncharacterized protein LOC115990270 [Quercus lobata])

HSP 1 Score: 110.5 bits (275), Expect = 1.6e-20
Identity = 59/131 (45.04%), Postives = 73/131 (55.73%), Query Frame = 0

Query: 77  HIDTIIKDGNG-SWQFTGFYGNPATELRSESWALLERLHAIFD---------------SE 136
           HID+II  GN  +W+FTGFYG PAT +R E+W  L  L+   +               SE
Sbjct: 88  HIDSIINKGNDEAWRFTGFYGEPATHMRIEAWNKLRLLNTKHNLPWLCAGDFNEITRHSE 147

Query: 137 KSGGSNRNQNQMKSFRDVIDNCNLMDPSYRGDDFTWTNRQFTGYLVWERLDRFLMNFDML 192
           K GG+NR+Q QM+ FRDVID C  +D  Y GD FTW      G+ +WERLDR L N D  
Sbjct: 148 KLGGNNRSQAQMQLFRDVIDECGFLDLGYVGDQFTWRKHFADGHSLWERLDRGLANHDWF 207

BLAST of Sgr020029 vs. NCBI nr
Match: XP_023886153.1 (uncharacterized protein LOC111998282 [Quercus suber])

HSP 1 Score: 108.6 bits (270), Expect = 6.0e-20
Identity = 58/131 (44.27%), Postives = 70/131 (53.44%), Query Frame = 0

Query: 77  HIDTIIKDG-NGSWQFTGFYGNPATELRSESWALLERLHAIF---------------DSE 136
           HID ++  G  G+W+FTGFYG P T  R ESW LL  L++                  SE
Sbjct: 511 HIDAVVGKGKEGAWRFTGFYGEPVTHKRLESWNLLRELNSRMTLPWICMGDFNEITRQSE 570

Query: 137 KSGGSNRNQNQMKSFRDVIDNCNLMDPSYRGDDFTWTNRQFTGYLVWERLDRFLMNFDML 192
           K GGS R+ +QM+ FRD ID C  MD  + G  FTW      G+ VWERLDR L N + L
Sbjct: 571 KLGGSVRSHSQMQLFRDAIDECGFMDLGFTGSQFTWKKHFNDGHSVWERLDRGLANSEWL 630

BLAST of Sgr020029 vs. NCBI nr
Match: XP_030922974.1 (uncharacterized protein LOC115949843 [Quercus lobata])

HSP 1 Score: 106.7 bits (265), Expect = 2.3e-19
Identity = 57/131 (43.51%), Postives = 71/131 (54.20%), Query Frame = 0

Query: 77  HIDTIIKDGNG-SWQFTGFYGNPATELRSESWALLERLHAIFD---------------SE 136
           HID+II  GN  +W+F GFYG PAT  R E W+ L  L+   +               SE
Sbjct: 88  HIDSIINKGNDEAWRFIGFYGEPATHKRIEVWSKLRLLNTKHNLPWLCAGDFNEITRHSE 147

Query: 137 KSGGSNRNQNQMKSFRDVIDNCNLMDPSYRGDDFTWTNRQFTGYLVWERLDRFLMNFDML 192
           K GG+NR+Q QM+ FRDVID C  +D  Y GD FTW      G+ +WERLDR L N D  
Sbjct: 148 KLGGNNRSQAQMQLFRDVIDECGFLDLEYVGDQFTWRKHFVDGHSLWERLDRGLANHDWF 207

BLAST of Sgr020029 vs. ExPASy TrEMBL
Match: A0A2N9IIR5 (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS51703 PE=4 SV=1)

HSP 1 Score: 104.8 bits (260), Expect = 4.2e-19
Identity = 54/131 (41.22%), Postives = 69/131 (52.67%), Query Frame = 0

Query: 77  HIDTIIKDGNG-SWQFTGFYGNPATELRSESWALLERLHAIFD---------------SE 136
           HID+II +G   +W+FTGFYG P T  R  SW +L  L   F                SE
Sbjct: 502 HIDSIINEGTADAWRFTGFYGAPETHNRHHSWDMLRTLSRQFSLPWCCAGDFNELVSLSE 561

Query: 137 KSGGSNRNQNQMKSFRDVIDNCNLMDPSYRGDDFTWTNRQFTGYLVWERLDRFLMNFDML 192
           K GG  R   QM++FRDV+D C   D  + G +FTW N +  G  VWERLDR ++N + L
Sbjct: 562 KRGGRPRPDAQMQAFRDVLDECGFQDLGFHGPEFTWCNNRINGATVWERLDRMVVNSEWL 621

BLAST of Sgr020029 vs. ExPASy TrEMBL
Match: A0A2N9I6L8 (Reverse transcriptase domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS49538 PE=4 SV=1)

HSP 1 Score: 104.4 bits (259), Expect = 5.5e-19
Identity = 54/131 (41.22%), Postives = 71/131 (54.20%), Query Frame = 0

Query: 77  HIDTIIKDGNG-SWQFTGFYGNPATELRSESWALLERLHAIFD---------------SE 136
           HIDTII +G   +W+FTGFYG P T+ R+ SW +L  LH  F                 E
Sbjct: 268 HIDTIINEGTELAWRFTGFYGAPETQNRAHSWNVLRTLHQQFSLPWCCAGDFNELVSGEE 327

Query: 137 KSGGSNRNQNQMKSFRDVIDNCNLMDPSYRGDDFTWTNRQFTGYLVWERLDRFLMNFDML 192
           K GG  R   QM++FR V+D+C   D  + G +FTW N +  G  VW RLDRF++N + L
Sbjct: 328 KKGGRPRPDAQMQAFRSVLDDCGFQDLGFNGPEFTWCNNRQYGATVWARLDRFVVNTEWL 387

BLAST of Sgr020029 vs. ExPASy TrEMBL
Match: A0A2N9HKV4 (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS40213 PE=4 SV=1)

HSP 1 Score: 104.0 bits (258), Expect = 7.2e-19
Identity = 53/131 (40.46%), Postives = 71/131 (54.20%), Query Frame = 0

Query: 77  HIDTIIKDGNG-SWQFTGFYGNPATELRSESWALLERLHAIFD---------------SE 136
           HIDTII +G   +W+FTGFYG P T+ R+ SW +L  LH  F                 E
Sbjct: 428 HIDTIINEGTELAWRFTGFYGAPETQNRAHSWNVLRTLHQQFSLPWCCAGDFNELVSGEE 487

Query: 137 KSGGSNRNQNQMKSFRDVIDNCNLMDPSYRGDDFTWTNRQFTGYLVWERLDRFLMNFDML 192
           K GG  R   QM++FR V+D+C   D  + G +FTW N +  G  +W RLDRF++N + L
Sbjct: 488 KKGGRPRPDAQMQAFRSVLDDCGFQDLGFNGPEFTWCNNRQYGATIWARLDRFVVNTEWL 547

BLAST of Sgr020029 vs. ExPASy TrEMBL
Match: A0A5C7H9Y2 (CCHC-type domain-containing protein OS=Acer yangbiense OX=1000413 GN=EZV62_019269 PE=4 SV=1)

HSP 1 Score: 103.6 bits (257), Expect = 9.4e-19
Identity = 52/122 (42.62%), Postives = 68/122 (55.74%), Query Frame = 0

Query: 67  QAGTRKCTDGHIDTIIKDGNG-SWQFTGFYGNPATELRSESWALLERLHAIFD------- 126
           +   R  T GHID +IKD +   W+FTGFYG P    R  SW+LL RL  + +       
Sbjct: 447 EVSIRSFTKGHIDAVIKDSDSLVWRFTGFYGEPIPSFRMHSWSLLRRLGRMSNLPWIVVG 506

Query: 127 --------SEKSGGSNRNQNQMKSFRDVIDNCNLMDPSYRGDDFTWTNRQFTGYLVWERL 173
                    EK GG  R+   M SFR+ +D+C LMD  Y G+ +TW+NRQF G L+ ER+
Sbjct: 507 DFNEILQLDEKKGGVIRSNTTMSSFREAVDDCALMDMGYVGNKYTWSNRQFKGELIQERI 566

BLAST of Sgr020029 vs. ExPASy TrEMBL
Match: A0A2N9FVV5 (Reverse transcriptase domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS22618 PE=4 SV=1)

HSP 1 Score: 102.1 bits (253), Expect = 2.7e-18
Identity = 50/131 (38.17%), Postives = 71/131 (54.20%), Query Frame = 0

Query: 77  HIDTIIKDGNGS-WQFTGFYGNPATELRSESWALLERLHA---------------IFDSE 136
           HIDT+I +G    W+F GFYG P T+ R  SW +L  LH+               +   E
Sbjct: 323 HIDTLINEGTDEIWRFMGFYGAPETQHRMNSWNILRLLHSQSSLPWCCAGDFNELVSLDE 382

Query: 137 KSGGSNRNQNQMKSFRDVIDNCNLMDPSYRGDDFTWTNRQFTGYLVWERLDRFLMNFDML 192
           K GG  R ++QM++FRDV+D+C   D  + G  FTW N +  G  VWE+LDR ++N + L
Sbjct: 383 KRGGRPRTESQMQAFRDVLDDCGFQDLGFHGPKFTWCNNRLNGVTVWEQLDRVVVNSEWL 442

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_023912780.11.1e-2144.27uncharacterized protein LOC112024376, partial [Quercus suber][more]
XP_023911264.12.4e-2144.27uncharacterized protein LOC112022870, partial [Quercus suber][more]
XP_030969978.11.6e-2045.04uncharacterized protein LOC115990270 [Quercus lobata][more]
XP_023886153.16.0e-2044.27uncharacterized protein LOC111998282 [Quercus suber][more]
XP_030922974.12.3e-1943.51uncharacterized protein LOC115949843 [Quercus lobata][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A2N9IIR54.2e-1941.22Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS51703 PE=4 SV=1[more]
A0A2N9I6L85.5e-1941.22Reverse transcriptase domain-containing protein OS=Fagus sylvatica OX=28930 GN=F... [more]
A0A2N9HKV47.2e-1940.46Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS40213 PE=4 SV=1[more]
A0A5C7H9Y29.4e-1942.62CCHC-type domain-containing protein OS=Acer yangbiense OX=1000413 GN=EZV62_01926... [more]
A0A2N9FVV52.7e-1838.17Reverse transcriptase domain-containing protein OS=Fagus sylvatica OX=28930 GN=F... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..20
IPR036691Endonuclease/exonuclease/phosphatase superfamilySUPERFAMILY56219DNase I-likecoord: 71..185

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr020029.1Sgr020029.1mRNA