Sgr018364 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr018364
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
DescriptionRetrotran_gag_3 domain-containing protein
Locationtig00153197: 535207 .. 536252 (-)
RNA-Seq ExpressionSgr018364
SyntenySgr018364
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCCCTACCTTTGGCCACTTGAAAGTAATGCAACATTCGGCCTCCGCCATCCAGAGGCACATGTCCTCGACTCTTCTAAGATTGGCACTCACCCTTGGCAAGTCTTACAACCGAAAGCAACGCCCTATGTTGACCCTCTGTTCATCCTGTTGCCATATATAGCAAACATTTATTTTGTGCAACCTATTTATTTTGTATTCATATTCTCCTTCCATAAGTAGATTGTATAGTTATAGGTGGAGATGCATTAGCCTCAAATCTCTGTAAATCAGCCTTTAATTACTTTATATTCAGCTATATATTCAAGGAAATCAGAATGAACAGGCATTGAGCCATTCAATCAAAATTCTCTCACAACAAAGGTTATCATGGTATCAGAGCTCTGGTCTTAGAGTTTTGAAATCTTCGGTCATTCTTGGGTGTTCTTGGAACTTCAGGTTGCTATAGATTGTAAGTTTCATCTGTGTGATTCGTTTTTTGGAAGCTCACGTATTTTGATAATTTTGTTCTAGTCATATTTTGGGTTGCTTATATTGTTTTCTGCTGAAAATGGTTGGTAACCATTCTGAATTTTTGTGTTAAATTTACGGGAAAAAATTATTCGGCATGGGAATTTCAATTCTGTTTATATGTTATTGGAAAAGAGTTATGGGAACATATTGATGGTACTACTCCAGCACCAACAGATGCTACTCAGTTGGCTCAATGGAAGATCAAAGATGCTAGGGTGATGTCTTGGATTACTGGGTCATGTGATCCTCAAATTGTTCTTAATTTACGTTCCTATAGCAGCGCTCAAGCCATGTGGAACTATTTAAAAAAGATTTATGCTCAAACAAATTCAGCCAGGAGATTTCAATTGGAGTGTGAAATTTCAAATTATACACAGGGGAGTCTCTCTATTCAGGATTACTATTCTGGTTTTCAAAATTTATGGGCTGAATTTTCTGATATAGTGTGTGCTACAGTGTCTAAAGAATCTCTTACTGATGTTTTGGCTATTCATGAGATTAACAAGCGTGATCAGTTCTTGATGAAGCTATGA

mRNA sequence

ATGCCCTACCTTTGGCCACTTGAAAGTAATGCAACATTCGGCCTCCGCCATCCAGAGGCACATGTCCTCGACTCTTCTAAGATTGGCACTCACCCTTGGCAATTATGGGAACATATTGATGGTACTACTCCAGCACCAACAGATGCTACTCAGTTGGCTCAATGGAAGATCAAAGATGCTAGGGTGATGTCTTGGATTACTGGGTCATGTGATCCTCAAATTGTTCTTAATTTACGTTCCTATAGCAGCGCTCAAGCCATGTGGAACTATTTAAAAAAGATTTATGCTCAAACAAATTCAGCCAGGAGATTTCAATTGGAGTGTGAAATTTCAAATTATACACAGGGGAGTCTCTCTATTCAGGATTACTATTCTGGTTTTCAAAATTTATGGGCTGAATTTTCTGATATAGTGTGTGCTACAGTGTCTAAAGAATCTCTTACTGATGTTTTGGCTATTCATGAGATTAACAAGCGTGATCAGTTCTTGATGAAGCTATGA

Coding sequence (CDS)

ATGCCCTACCTTTGGCCACTTGAAAGTAATGCAACATTCGGCCTCCGCCATCCAGAGGCACATGTCCTCGACTCTTCTAAGATTGGCACTCACCCTTGGCAATTATGGGAACATATTGATGGTACTACTCCAGCACCAACAGATGCTACTCAGTTGGCTCAATGGAAGATCAAAGATGCTAGGGTGATGTCTTGGATTACTGGGTCATGTGATCCTCAAATTGTTCTTAATTTACGTTCCTATAGCAGCGCTCAAGCCATGTGGAACTATTTAAAAAAGATTTATGCTCAAACAAATTCAGCCAGGAGATTTCAATTGGAGTGTGAAATTTCAAATTATACACAGGGGAGTCTCTCTATTCAGGATTACTATTCTGGTTTTCAAAATTTATGGGCTGAATTTTCTGATATAGTGTGTGCTACAGTGTCTAAAGAATCTCTTACTGATGTTTTGGCTATTCATGAGATTAACAAGCGTGATCAGTTCTTGATGAAGCTATGA

Protein sequence

MPYLWPLESNATFGLRHPEAHVLDSSKIGTHPWQLWEHIDGTTPAPTDATQLAQWKIKDARVMSWITGSCDPQIVLNLRSYSSAQAMWNYLKKIYAQTNSARRFQLECEISNYTQGSLSIQDYYSGFQNLWAEFSDIVCATVSKESLTDVLAIHEINKRDQFLMKL
Homology
BLAST of Sgr018364 vs. NCBI nr
Match: DAD32765.1 (TPA_asm: hypothetical protein HUJ06_011616 [Nelumbo nucifera])

HSP 1 Score: 239.6 bits (610), Expect = 2.0e-59
Identity = 113/133 (84.96%), Postives = 119/133 (89.47%), Query Frame = 0

Query: 34  QLWEHIDGTTPAPTDATQLAQWKIKDARVMSWITGSCDPQIVLNLRSYSSAQAMWNYLKK 93
           +LW HI GTTP P DATQL QWKIKDARVMSWITGSCD QIVLNLR Y SAQ MW YLKK
Sbjct: 32  ELWGHIGGTTPVPADATQLTQWKIKDARVMSWITGSCDSQIVLNLRPYRSAQTMWEYLKK 91

Query: 94  IYAQTNSARRFQLECEISNYTQGSLSIQDYYSGFQNLWAEFSDIVCATVSKESLTDVLAI 153
           +Y QTNSARRFQLECEI+NYTQGSLSIQDYYSGFQNLWAEFSDIVCA VSK+SL DVL +
Sbjct: 92  LYNQTNSARRFQLECEIANYTQGSLSIQDYYSGFQNLWAEFSDIVCAAVSKDSLADVLVV 151

Query: 154 HEINKRDQFLMKL 167
           HEI+KRDQFLMKL
Sbjct: 152 HEISKRDQFLMKL 164

BLAST of Sgr018364 vs. NCBI nr
Match: DAD42694.1 (TPA_asm: hypothetical protein HUJ06_000924 [Nelumbo nucifera])

HSP 1 Score: 231.1 bits (588), Expect = 7.0e-57
Identity = 109/133 (81.95%), Postives = 119/133 (89.47%), Query Frame = 0

Query: 34  QLWEHIDGTTPAPTDATQLAQWKIKDARVMSWITGSCDPQIVLNLRSYSSAQAMWNYLKK 93
           +LW HIDGTTPAP DAT+LA+WKIKDARVM WITGSCD +IVLNLR Y SAQ MW YLKK
Sbjct: 337 ELWGHIDGTTPAPADATKLAKWKIKDARVMPWITGSCDSKIVLNLRPYRSAQTMWEYLKK 396

Query: 94  IYAQTNSARRFQLECEISNYTQGSLSIQDYYSGFQNLWAEFSDIVCATVSKESLTDVLAI 153
           +Y QTNSARRFQLECEI++YTQGSLSIQDYYS FQNLWAEFSDIVCA VSK SL DVL +
Sbjct: 397 VYNQTNSARRFQLECEIADYTQGSLSIQDYYSSFQNLWAEFSDIVCAAVSKVSLADVLVV 456

Query: 154 HEINKRDQFLMKL 167
           +EI+KRDQFLMKL
Sbjct: 457 YEISKRDQFLMKL 469

BLAST of Sgr018364 vs. NCBI nr
Match: XP_010266766.1 (PREDICTED: uncharacterized protein LOC104604200 [Nelumbo nucifera])

HSP 1 Score: 221.5 bits (563), Expect = 5.5e-54
Identity = 107/133 (80.45%), Postives = 116/133 (87.22%), Query Frame = 0

Query: 34  QLWEHIDGTTPAPTDATQLAQWKIKDARVMSWITGSCDPQIVLNLRSYSSAQAMWNYLKK 93
           +LW HIDGTTPAP DAT+LA+WKIKDARVMSWITGSCD +IVLNL  Y SAQ M  YLKK
Sbjct: 3   ELWGHIDGTTPAPADATKLAEWKIKDARVMSWITGSCDSKIVLNLCPYRSAQTMREYLKK 62

Query: 94  IYAQTNSARRFQLECEISNYTQGSLSIQDYYSGFQNLWAEFSDIVCATVSKESLTDVLAI 153
           +Y QTNSARRFQLECEI +YTQGSLSIQDYYS FQNLW EFSDIVCA VSK SL DVL +
Sbjct: 63  VYNQTNSARRFQLECEIVDYTQGSLSIQDYYSRFQNLWVEFSDIVCAAVSKVSLADVLVV 122

Query: 154 HEINKRDQFLMKL 167
           +EI+KRDQFLMKL
Sbjct: 123 YEISKRDQFLMKL 135

BLAST of Sgr018364 vs. NCBI nr
Match: CAD1825734.1 (unnamed protein product [Ananas comosus var. bracteatus])

HSP 1 Score: 211.8 bits (538), Expect = 4.4e-51
Identity = 97/133 (72.93%), Postives = 114/133 (85.71%), Query Frame = 0

Query: 34  QLWEHIDGTTPAPTDATQLAQWKIKDARVMSWITGSCDPQIVLNLRSYSSAQAMWNYLKK 93
           +LW HIDG  PAP D TQL+QWK+KDARVMSWI GSCD Q+VLNLR Y +A+ MW YLKK
Sbjct: 32  ELWGHIDGIVPAPIDVTQLSQWKVKDARVMSWIIGSCDSQLVLNLRPYKTAKDMWEYLKK 91

Query: 94  IYAQTNSARRFQLECEISNYTQGSLSIQDYYSGFQNLWAEFSDIVCATVSKESLTDVLAI 153
           +Y QT+SARRFQLECEI+NYTQ  LSIQDY+S FQ LWAEFSDIVCAT+SK+S  DVLA+
Sbjct: 92  VYNQTHSARRFQLECEITNYTQRGLSIQDYFSVFQKLWAEFSDIVCATMSKDSQKDVLAV 151

Query: 154 HEINKRDQFLMKL 167
           ++++K DQFLMKL
Sbjct: 152 YDVSKHDQFLMKL 164

BLAST of Sgr018364 vs. NCBI nr
Match: XP_006346877.1 (PREDICTED: uncharacterized protein LOC102591997 [Solanum tuberosum])

HSP 1 Score: 203.0 bits (515), Expect = 2.0e-48
Identity = 94/133 (70.68%), Postives = 114/133 (85.71%), Query Frame = 0

Query: 34  QLWEHIDGTTPAPTDATQLAQWKIKDARVMSWITGSCDPQIVLNLRSYSSAQAMWNYLKK 93
           +LW HIDG+ PAPTDAT+L +WKIKDARVM+WI GS DP IVLNLR Y +A+AMW+YL+K
Sbjct: 33  ELWGHIDGSDPAPTDATKLGEWKIKDARVMTWILGSIDPLIVLNLRPYKTAKAMWDYLQK 92

Query: 94  IYAQTNSARRFQLECEISNYTQGSLSIQDYYSGFQNLWAEFSDIVCATVSKESLTDVLAI 153
           +Y Q NSARRFQLE EI+NY+QG LS+QDY+SGFQNLWAEF+DIV A +  ESL+ + A+
Sbjct: 93  VYNQDNSARRFQLEYEIANYSQGGLSVQDYFSGFQNLWAEFTDIVYAKIPTESLSVIQAV 152

Query: 154 HEINKRDQFLMKL 167
           HE +KRDQFLMKL
Sbjct: 153 HEQSKRDQFLMKL 165

BLAST of Sgr018364 vs. ExPASy Swiss-Prot
Match: Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)

HSP 1 Score: 48.1 bits (113), Expect = 1.1e-04
Identity = 24/86 (27.91%), Postives = 42/86 (48.84%), Query Frame = 0

Query: 41  GTTPAPTDATQLAQWKIKDARVMSWITGSCDPQIVLNLRSYSSAQAMWNYLKKIYAQTNS 100
           GT  AP       +WK +D  + S + G+    +   +   ++A  +W  L+KIYA  + 
Sbjct: 63  GTDAAPRVNPDYTRWKRQDKLIYSAVLGAISMSVQPAVSRATTAAQIWETLRKIYANPSY 122

Query: 101 ARRFQLECEISNYTQGSLSIQDYYSG 127
               QL  ++  +T+G+ +I DY  G
Sbjct: 123 GHVTQLRTQLKQWTKGTKTIDDYMQG 148

BLAST of Sgr018364 vs. ExPASy TrEMBL
Match: A0A1U8AL74 (uncharacterized protein LOC104604200 OS=Nelumbo nucifera OX=4432 GN=LOC104604200 PE=4 SV=1)

HSP 1 Score: 221.5 bits (563), Expect = 2.7e-54
Identity = 107/133 (80.45%), Postives = 116/133 (87.22%), Query Frame = 0

Query: 34  QLWEHIDGTTPAPTDATQLAQWKIKDARVMSWITGSCDPQIVLNLRSYSSAQAMWNYLKK 93
           +LW HIDGTTPAP DAT+LA+WKIKDARVMSWITGSCD +IVLNL  Y SAQ M  YLKK
Sbjct: 3   ELWGHIDGTTPAPADATKLAEWKIKDARVMSWITGSCDSKIVLNLCPYRSAQTMREYLKK 62

Query: 94  IYAQTNSARRFQLECEISNYTQGSLSIQDYYSGFQNLWAEFSDIVCATVSKESLTDVLAI 153
           +Y QTNSARRFQLECEI +YTQGSLSIQDYYS FQNLW EFSDIVCA VSK SL DVL +
Sbjct: 63  VYNQTNSARRFQLECEIVDYTQGSLSIQDYYSRFQNLWVEFSDIVCAAVSKVSLADVLVV 122

Query: 154 HEINKRDQFLMKL 167
           +EI+KRDQFLMKL
Sbjct: 123 YEISKRDQFLMKL 135

BLAST of Sgr018364 vs. ExPASy TrEMBL
Match: A0A6V7P508 (Uncharacterized protein OS=Ananas comosus var. bracteatus OX=296719 GN=CB5_LOCUS8945 PE=4 SV=1)

HSP 1 Score: 211.8 bits (538), Expect = 2.1e-51
Identity = 97/133 (72.93%), Postives = 114/133 (85.71%), Query Frame = 0

Query: 34  QLWEHIDGTTPAPTDATQLAQWKIKDARVMSWITGSCDPQIVLNLRSYSSAQAMWNYLKK 93
           +LW HIDG  PAP D TQL+QWK+KDARVMSWI GSCD Q+VLNLR Y +A+ MW YLKK
Sbjct: 32  ELWGHIDGIVPAPIDVTQLSQWKVKDARVMSWIIGSCDSQLVLNLRPYKTAKDMWEYLKK 91

Query: 94  IYAQTNSARRFQLECEISNYTQGSLSIQDYYSGFQNLWAEFSDIVCATVSKESLTDVLAI 153
           +Y QT+SARRFQLECEI+NYTQ  LSIQDY+S FQ LWAEFSDIVCAT+SK+S  DVLA+
Sbjct: 92  VYNQTHSARRFQLECEITNYTQRGLSIQDYFSVFQKLWAEFSDIVCATMSKDSQKDVLAV 151

Query: 154 HEINKRDQFLMKL 167
           ++++K DQFLMKL
Sbjct: 152 YDVSKHDQFLMKL 164

BLAST of Sgr018364 vs. ExPASy TrEMBL
Match: A0A5J5AIJ4 (Uncharacterized protein OS=Nyssa sinensis OX=561372 GN=F0562_034198 PE=4 SV=1)

HSP 1 Score: 190.7 bits (483), Expect = 5.0e-45
Identity = 85/133 (63.91%), Postives = 111/133 (83.46%), Query Frame = 0

Query: 34  QLWEHIDGTTPAPTDATQLAQWKIKDARVMSWITGSCDPQIVLNLRSYSSAQAMWNYLKK 93
           +LW H+DG+ PAPTD  +L QWK+KDARVM+WI GS DP ++LNL+ + +A++MW YLKK
Sbjct: 4   ELWGHVDGSDPAPTDPMKLVQWKVKDARVMTWILGSVDPLLILNLKPHKTAKSMWEYLKK 63

Query: 94  IYAQTNSARRFQLECEISNYTQGSLSIQDYYSGFQNLWAEFSDIVCATVSKESLTDVLAI 153
           +Y Q +SARRFQLE +++ Y+QG+LS+Q+Y+ GFQNLWAEFSDIV A VS ESL+ V A+
Sbjct: 64  VYHQDHSARRFQLETDLAAYSQGTLSVQEYFCGFQNLWAEFSDIVYANVSAESLSAVQAV 123

Query: 154 HEINKRDQFLMKL 167
           HE +KRDQFLMKL
Sbjct: 124 HEASKRDQFLMKL 136

BLAST of Sgr018364 vs. ExPASy TrEMBL
Match: A0A5C7IEH8 (Uncharacterized protein OS=Acer yangbiense OX=1000413 GN=EZV62_008644 PE=4 SV=1)

HSP 1 Score: 189.1 bits (479), Expect = 1.5e-44
Identity = 90/133 (67.67%), Postives = 106/133 (79.70%), Query Frame = 0

Query: 34  QLWEHIDGTTPAPTDATQLAQWKIKDARVMSWITGSCDPQIVLNLRSYSSAQAMWNYLKK 93
           +LW HIDG+ PAPT+  +LA WK+KDARVMSWI GS DP IVLNLR Y +A+ MW YL K
Sbjct: 33  ELWGHIDGSDPAPTEPKELANWKVKDARVMSWILGSVDPLIVLNLRPYKTAKTMWEYLLK 92

Query: 94  IYAQTNSARRFQLECEISNYTQGSLSIQDYYSGFQNLWAEFSDIVCATVSKESLTDVLAI 153
           +Y Q N+A RFQLE EI+NYTQG+LSIQDY+S FQNLW EFSD+V A V   SL+ V A+
Sbjct: 93  VYHQDNTACRFQLEYEIANYTQGNLSIQDYFSSFQNLWGEFSDMVYAKVPAASLSAVQAV 152

Query: 154 HEINKRDQFLMKL 167
           HE +KRDQFLMKL
Sbjct: 153 HEQSKRDQFLMKL 165

BLAST of Sgr018364 vs. ExPASy TrEMBL
Match: A0A5C7HJ24 (CCHC-type domain-containing protein OS=Acer yangbiense OX=1000413 GN=EZV62_018177 PE=4 SV=1)

HSP 1 Score: 181.0 bits (458), Expect = 4.0e-42
Identity = 88/133 (66.17%), Postives = 104/133 (78.20%), Query Frame = 0

Query: 34  QLWEHIDGTTPAPTDATQLAQWKIKDARVMSWITGSCDPQIVLNLRSYSSAQAMWNYLKK 93
           +L  HIDG+  APT+  +LA WK+KDARVMSWI G  DP IVLNLR Y +A+ MW YL K
Sbjct: 169 ELCGHIDGSDLAPTEPKELANWKVKDARVMSWILGFVDPLIVLNLRPYKTAKTMWEYLLK 228

Query: 94  IYAQTNSARRFQLECEISNYTQGSLSIQDYYSGFQNLWAEFSDIVCATVSKESLTDVLAI 153
           +Y Q N+ARRFQLE EI+NYTQG+LSIQDY+S FQNLW EFSD+V A V   SL+ V A+
Sbjct: 229 VYHQDNTARRFQLEYEIANYTQGNLSIQDYFSSFQNLWGEFSDMVYAKVPATSLSAVQAV 288

Query: 154 HEINKRDQFLMKL 167
           HE +KRDQFLMKL
Sbjct: 289 HEQSKRDQFLMKL 301

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
DAD32765.12.0e-5984.96TPA_asm: hypothetical protein HUJ06_011616 [Nelumbo nucifera][more]
DAD42694.17.0e-5781.95TPA_asm: hypothetical protein HUJ06_000924 [Nelumbo nucifera][more]
XP_010266766.15.5e-5480.45PREDICTED: uncharacterized protein LOC104604200 [Nelumbo nucifera][more]
CAD1825734.14.4e-5172.93unnamed protein product [Ananas comosus var. bracteatus][more]
XP_006346877.12.0e-4870.68PREDICTED: uncharacterized protein LOC102591997 [Solanum tuberosum][more]
Match NameE-valueIdentityDescription
Q94HW21.1e-0427.91Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... [more]
Match NameE-valueIdentityDescription
A0A1U8AL742.7e-5480.45uncharacterized protein LOC104604200 OS=Nelumbo nucifera OX=4432 GN=LOC104604200... [more]
A0A6V7P5082.1e-5172.93Uncharacterized protein OS=Ananas comosus var. bracteatus OX=296719 GN=CB5_LOCUS... [more]
A0A5J5AIJ45.0e-4563.91Uncharacterized protein OS=Nyssa sinensis OX=561372 GN=F0562_034198 PE=4 SV=1[more]
A0A5C7IEH81.5e-4467.67Uncharacterized protein OS=Acer yangbiense OX=1000413 GN=EZV62_008644 PE=4 SV=1[more]
A0A5C7HJ244.0e-4266.17CCHC-type domain-containing protein OS=Acer yangbiense OX=1000413 GN=EZV62_01817... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePFAMPF14223Retrotran_gag_2coord: 55..137
e-value: 1.1E-10
score: 41.5
NoneNo IPR availablePANTHERPTHR37610:SF36SUBFAMILY NOT NAMEDcoord: 34..147
NoneNo IPR availablePANTHERPTHR37610FAMILY NOT NAMEDcoord: 34..147

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr018364.1Sgr018364.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0008270 zinc ion binding