Sgr019549 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr019549
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
DescriptionRNase H domain-containing protein
Locationtig00153348: 693633 .. 694067 (+)
RNA-Seq ExpressionSgr019549
SyntenySgr019549
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAATCCCCAAGTGACAATTTATATAAAATCAATACTGATGCTTCTTTCCTTAAAGATCTCTATGATGCAAGTATTGGAGTGATGATCAAAAACCACAGTGGGGAGGTGATGCTTACCGCGTCTATTTACATTGCCTTGGTAGTTGACTTTGACACTGTTGAAGTTATGGCAGCTCAAGAGGGAGTGCAGTTGGGAGTGGAAATGGACTTATGGTTGGCCATCCTCGAGATTGATTCTCTGCGTATCTACAAGCAGTTAAGGGAAGAGGTAGAGGATGTATCCAAAATTGGATGCATGCTCTTAGATGTAAAAGCCCAAATGAAGATGGAGAACAAAATGGAGGCTAGCTTCACACTTCACACACAGAGAGGGGAATCGAGCTGCCTATGTGCTAGCCTAGAGAGCAGTGGTAACGAAAGCATTTGGAACTAG

mRNA sequence

ATGGAATCCCCAAGTGACAATTTATATAAAATCAATACTGATGCTTCTTTCCTTAAAGATCTCTATGATGCAAGTATTGGAGTGATGATCAAAAACCACAGTGGGGAGGTGATGCTTACCGCGTCTATTTACATTGCCTTGGTAGTTGACTTTGACACTGTTGAAGTTATGGCAGCTCAAGAGGGAGTGCAGTTGGGAGTGGAAATGGACTTATGGTTGGCCATCCTCGAGATTGATTCTCTGCGTATCTACAAGCAGTTAAGGGAAGAGGTAGAGGATGTATCCAAAATTGGATGCATGCTCTTAGATGTAAAAGCCCAAATGAAGATGGAGAACAAAATGGAGGCTAGCTTCACACTTCACACACAGAGAGGGGAATCGAGCTGCCTATGTGCTAGCCTAGAGAGCAGTGGTAACGAAAGCATTTGGAACTAG

Coding sequence (CDS)

ATGGAATCCCCAAGTGACAATTTATATAAAATCAATACTGATGCTTCTTTCCTTAAAGATCTCTATGATGCAAGTATTGGAGTGATGATCAAAAACCACAGTGGGGAGGTGATGCTTACCGCGTCTATTTACATTGCCTTGGTAGTTGACTTTGACACTGTTGAAGTTATGGCAGCTCAAGAGGGAGTGCAGTTGGGAGTGGAAATGGACTTATGGTTGGCCATCCTCGAGATTGATTCTCTGCGTATCTACAAGCAGTTAAGGGAAGAGGTAGAGGATGTATCCAAAATTGGATGCATGCTCTTAGATGTAAAAGCCCAAATGAAGATGGAGAACAAAATGGAGGCTAGCTTCACACTTCACACACAGAGAGGGGAATCGAGCTGCCTATGTGCTAGCCTAGAGAGCAGTGGTAACGAAAGCATTTGGAACTAG

Protein sequence

MESPSDNLYKINTDASFLKDLYDASIGVMIKNHSGEVMLTASIYIALVVDFDTVEVMAAQEGVQLGVEMDLWLAILEIDSLRIYKQLREEVEDVSKIGCMLLDVKAQMKMENKMEASFTLHTQRGESSCLCASLESSGNESIWN
Homology
BLAST of Sgr019549 vs. NCBI nr
Match: XP_022140628.1 (uncharacterized protein LOC111011237 [Momordica charantia])

HSP 1 Score: 73.2 bits (178), Expect = 2.1e-09
Identity = 45/125 (36.00%), Postives = 70/125 (56.00%), Query Frame = 0

Query: 4   PSDNLYKINTDASFLKDLYDASIGVMIKNHSGEVMLTASIYIALVVDFDTVEVMAAQEGV 63
           P   +YKINTDASFL     A +G++I+N  G+VM +A+ Y+  +   D  E + A EG+
Sbjct: 37  PDKRIYKINTDASFLASDQHAGLGIIIRNDRGQVMASATKYLENIQSVDMAEAIVAVEGL 96

Query: 64  QLGVEMDLWLAILEIDSLRIYKQLREEVEDVSKIGCMLLDVKAQMKMENKMEASFTLHTQ 123
           QL  ++ +   ILE DS RI+    +  ED+S+ G ++L  KA+      + ASF    +
Sbjct: 97  QLASKIGVNPVILETDSSRIFNLFSQPSEDLSETGEIVL--KAKNFWTQSLHASFNFVKR 156

Query: 124 RGESS 129
            G  +
Sbjct: 157 EGNKA 159

BLAST of Sgr019549 vs. NCBI nr
Match: XP_022139684.1 (uncharacterized protein LOC111010533 [Momordica charantia])

HSP 1 Score: 70.9 bits (172), Expect = 1.0e-08
Identity = 41/118 (34.75%), Postives = 70/118 (59.32%), Query Frame = 0

Query: 6   DNLYKINTDASFLKDLYDASIGV-MIKNHSGEVMLTASIYIALVVDFDTVEVMAAQEGVQ 65
           + ++K+ TDASF    ++A +GV +I++H G+V+ +A+ Y+  V   D  E +AA EG++
Sbjct: 95  EGVFKLKTDASFSSIDFNAGLGVIIIRDHRGQVLASATKYLEHVASVDDAEALAAVEGLR 154

Query: 66  LGVEMDLWLAILEIDSLRIYKQLREEVEDVSKIGCMLLDVKAQMKMENKMEASFTLHT 123
           + +E  +   +LE DSLRIY     + E +SK G ++  VK  +    ++  SFT  T
Sbjct: 155 VAMETGISPILLETDSLRIYNLFARDKEGLSKTGSIIEYVKTHLATRLQVSYSFTKRT 212

BLAST of Sgr019549 vs. NCBI nr
Match: XP_028115288.1 (uncharacterized protein LOC114313136 isoform X3 [Camellia sinensis])

HSP 1 Score: 70.1 bits (170), Expect = 1.8e-08
Identity = 48/141 (34.04%), Postives = 68/141 (48.23%), Query Frame = 0

Query: 3   SPSDNLYKINTDASFLKDLYDASIGVMIKNHSGEVMLTASIYIALVVDFDTVEVMAAQEG 62
           +P++   KIN D +  K+L    +GV+I+NH GEVM   S  +   VD D  E  AA + 
Sbjct: 661 APANGWLKINFDGALFKELKAVGVGVVIRNHLGEVMAALSERLPFWVDSDCAEAYAATKA 720

Query: 63  VQLGVEMDLWLAILEIDSLRIYKQLREEVEDVSKIGCMLLDVKAQMKMENKMEASFTLHT 122
           V+L  ++ L    LE DSLRI K LREE E +S+ G +LL      K  ++   S     
Sbjct: 721 VELARDLGLSDIHLEGDSLRIVKVLREEAEFMSEYGHILLQTVGVCKSFSRFHVSHVGRQ 780

Query: 123 QRGESSCLCASLESSGNESIW 144
             G +  L           +W
Sbjct: 781 GNGLAHGLARMARHQNYHQVW 801

BLAST of Sgr019549 vs. NCBI nr
Match: XP_028115297.1 (uncharacterized protein LOC114313136 isoform X6 [Camellia sinensis] >XP_028115298.1 uncharacterized protein LOC114313136 isoform X6 [Camellia sinensis] >XP_028115299.1 uncharacterized protein LOC114313136 isoform X6 [Camellia sinensis] >XP_028115300.1 uncharacterized protein LOC114313136 isoform X6 [Camellia sinensis] >XP_028115301.1 uncharacterized protein LOC114313136 isoform X6 [Camellia sinensis])

HSP 1 Score: 70.1 bits (170), Expect = 1.8e-08
Identity = 48/141 (34.04%), Postives = 68/141 (48.23%), Query Frame = 0

Query: 3   SPSDNLYKINTDASFLKDLYDASIGVMIKNHSGEVMLTASIYIALVVDFDTVEVMAAQEG 62
           +P++   KIN D +  K+L    +GV+I+NH GEVM   S  +   VD D  E  AA + 
Sbjct: 671 APANGWLKINFDGALFKELKAVGVGVVIRNHLGEVMAALSERLPFWVDSDCAEAYAATKA 730

Query: 63  VQLGVEMDLWLAILEIDSLRIYKQLREEVEDVSKIGCMLLDVKAQMKMENKMEASFTLHT 122
           V+L  ++ L    LE DSLRI K LREE E +S+ G +LL      K  ++   S     
Sbjct: 731 VELARDLGLSDIHLEGDSLRIVKVLREEAEFMSEYGHILLQTVGVCKSFSRFHVSHVGRQ 790

Query: 123 QRGESSCLCASLESSGNESIW 144
             G +  L           +W
Sbjct: 791 GNGLAHGLARMARHQNYHQVW 811

BLAST of Sgr019549 vs. NCBI nr
Match: XP_028115286.1 (uncharacterized protein LOC114313136 isoform X1 [Camellia sinensis])

HSP 1 Score: 70.1 bits (170), Expect = 1.8e-08
Identity = 48/141 (34.04%), Postives = 68/141 (48.23%), Query Frame = 0

Query: 3   SPSDNLYKINTDASFLKDLYDASIGVMIKNHSGEVMLTASIYIALVVDFDTVEVMAAQEG 62
           +P++   KIN D +  K+L    +GV+I+NH GEVM   S  +   VD D  E  AA + 
Sbjct: 704 APANGWLKINFDGALFKELKAVGVGVVIRNHLGEVMAALSERLPFWVDSDCAEAYAATKA 763

Query: 63  VQLGVEMDLWLAILEIDSLRIYKQLREEVEDVSKIGCMLLDVKAQMKMENKMEASFTLHT 122
           V+L  ++ L    LE DSLRI K LREE E +S+ G +LL      K  ++   S     
Sbjct: 764 VELARDLGLSDIHLEGDSLRIVKVLREEAEFMSEYGHILLQTVGVCKSFSRFHVSHVGRQ 823

Query: 123 QRGESSCLCASLESSGNESIW 144
             G +  L           +W
Sbjct: 824 GNGLAHGLARMARHQNYHQVW 844

BLAST of Sgr019549 vs. ExPASy TrEMBL
Match: A0A6J1CIF1 (uncharacterized protein LOC111011237 OS=Momordica charantia OX=3673 GN=LOC111011237 PE=4 SV=1)

HSP 1 Score: 73.2 bits (178), Expect = 1.0e-09
Identity = 45/125 (36.00%), Postives = 70/125 (56.00%), Query Frame = 0

Query: 4   PSDNLYKINTDASFLKDLYDASIGVMIKNHSGEVMLTASIYIALVVDFDTVEVMAAQEGV 63
           P   +YKINTDASFL     A +G++I+N  G+VM +A+ Y+  +   D  E + A EG+
Sbjct: 37  PDKRIYKINTDASFLASDQHAGLGIIIRNDRGQVMASATKYLENIQSVDMAEAIVAVEGL 96

Query: 64  QLGVEMDLWLAILEIDSLRIYKQLREEVEDVSKIGCMLLDVKAQMKMENKMEASFTLHTQ 123
           QL  ++ +   ILE DS RI+    +  ED+S+ G ++L  KA+      + ASF    +
Sbjct: 97  QLASKIGVNPVILETDSSRIFNLFSQPSEDLSETGEIVL--KAKNFWTQSLHASFNFVKR 156

Query: 124 RGESS 129
            G  +
Sbjct: 157 EGNKA 159

BLAST of Sgr019549 vs. ExPASy TrEMBL
Match: A0A6J1CDQ4 (uncharacterized protein LOC111010533 OS=Momordica charantia OX=3673 GN=LOC111010533 PE=4 SV=1)

HSP 1 Score: 70.9 bits (172), Expect = 5.0e-09
Identity = 41/118 (34.75%), Postives = 70/118 (59.32%), Query Frame = 0

Query: 6   DNLYKINTDASFLKDLYDASIGV-MIKNHSGEVMLTASIYIALVVDFDTVEVMAAQEGVQ 65
           + ++K+ TDASF    ++A +GV +I++H G+V+ +A+ Y+  V   D  E +AA EG++
Sbjct: 95  EGVFKLKTDASFSSIDFNAGLGVIIIRDHRGQVLASATKYLEHVASVDDAEALAAVEGLR 154

Query: 66  LGVEMDLWLAILEIDSLRIYKQLREEVEDVSKIGCMLLDVKAQMKMENKMEASFTLHT 123
           + +E  +   +LE DSLRIY     + E +SK G ++  VK  +    ++  SFT  T
Sbjct: 155 VAMETGISPILLETDSLRIYNLFARDKEGLSKTGSIIEYVKTHLATRLQVSYSFTKRT 212

BLAST of Sgr019549 vs. ExPASy TrEMBL
Match: A0A7J7GYW5 (Uncharacterized protein OS=Camellia sinensis OX=4442 GN=HYC85_016191 PE=3 SV=1)

HSP 1 Score: 70.1 bits (170), Expect = 8.6e-09
Identity = 47/141 (33.33%), Postives = 68/141 (48.23%), Query Frame = 0

Query: 3   SPSDNLYKINTDASFLKDLYDASIGVMIKNHSGEVMLTASIYIALVVDFDTVEVMAAQEG 62
           +P++   KIN D +  K+L    +GV+I+NH GEVM   S  +   VD D  E  AA + 
Sbjct: 670 APANGWLKINFDGALFKELKAVGVGVVIRNHLGEVMAALSERLPFWVDSDCAEAYAATKA 729

Query: 63  VQLGVEMDLWLAILEIDSLRIYKQLREEVEDVSKIGCMLLDVKAQMKMENKMEASFTLHT 122
           V+L  ++      LE DSLRI K LREE E +S+ G +LL      K  ++   S     
Sbjct: 730 VELARDLGFSDIHLEGDSLRIVKALREEAEFMSEYGHILLQTVGACKSFSRFHVSHVGRQ 789

Query: 123 QRGESSCLCASLESSGNESIW 144
             G +  L         + +W
Sbjct: 790 GNGLAHGLARMARHQNYQQVW 810

BLAST of Sgr019549 vs. ExPASy TrEMBL
Match: A0A5B6V0L8 (Reverse transcriptase OS=Gossypium australe OX=47621 GN=EPI10_029142 PE=4 SV=1)

HSP 1 Score: 65.1 bits (157), Expect = 2.8e-07
Identity = 36/120 (30.00%), Postives = 66/120 (55.00%), Query Frame = 0

Query: 2   ESPSDNLYKINTDASFLKDLYDASIGVMIKNHSGEVMLTASIYIALVVDFDTVEVMAAQE 61
           +SP   + KIN D +F KD + ++ G++ +N  G+V+++++ +  +V      E +A +E
Sbjct: 402 QSPPQQVVKINFDDAFDKDRHQSASGIVARNSEGKVLVSSTSFHKMVDSAFAAEAIACRE 461

Query: 62  GVQLGVEMDLWLAILEIDSLRIYKQLREEVEDVSKIGCMLLDVKAQMKMENKMEASFTLH 121
            VQ+G+ M      +E DSL + K+ R  V D S+IG  + D+     +  K+   F L+
Sbjct: 462 AVQIGINMQKEEIFVEGDSLTVIKKCRNSVADKSQIGAYINDIHQMKTIFKKLRFDFILN 521

BLAST of Sgr019549 vs. ExPASy TrEMBL
Match: A0A6J1DBJ7 (uncharacterized protein LOC111018973 OS=Momordica charantia OX=3673 GN=LOC111018973 PE=4 SV=1)

HSP 1 Score: 64.7 bits (156), Expect = 3.6e-07
Identity = 46/141 (32.62%), Postives = 71/141 (50.35%), Query Frame = 0

Query: 4   PSDNLYKINTDASFLKDLYDASIGVMIKNHSGEVMLTASIYIALVVDFDTVEVMAAQEGV 63
           P+  L K+N DA+F K+ + A +GV+I++ +G V LTA   +A   D D VE  A  EG+
Sbjct: 107 PAAPLLKVNVDAAFRKESFVAGVGVIIRDSTGLVYLTAIRLLARASDVDWVEGFAVYEGI 166

Query: 64  QLGVEMDLWLAILEIDSLRIYKQLREEVEDVSKIGCMLLDVKAQMKME-NKMEASFTLHT 123
            L VE       +E DSLRI+  L  +  D S++G +   +K  +     ++  SFT   
Sbjct: 167 LLAVEAGFIRFQIETDSLRIFNLLTTDCVDDSEVGVLCSVIKLFLSSHAERVSFSFTHRN 226

Query: 124 QRGESSCLCASLESSGNESIW 144
               +  L     +S +  IW
Sbjct: 227 GNAXAHLLAQLALTSPHLQIW 247

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022140628.12.1e-0936.00uncharacterized protein LOC111011237 [Momordica charantia][more]
XP_022139684.11.0e-0834.75uncharacterized protein LOC111010533 [Momordica charantia][more]
XP_028115288.11.8e-0834.04uncharacterized protein LOC114313136 isoform X3 [Camellia sinensis][more]
XP_028115297.11.8e-0834.04uncharacterized protein LOC114313136 isoform X6 [Camellia sinensis] >XP_02811529... [more]
XP_028115286.11.8e-0834.04uncharacterized protein LOC114313136 isoform X1 [Camellia sinensis][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1CIF11.0e-0936.00uncharacterized protein LOC111011237 OS=Momordica charantia OX=3673 GN=LOC111011... [more]
A0A6J1CDQ45.0e-0934.75uncharacterized protein LOC111010533 OS=Momordica charantia OX=3673 GN=LOC111010... [more]
A0A7J7GYW58.6e-0933.33Uncharacterized protein OS=Camellia sinensis OX=4442 GN=HYC85_016191 PE=3 SV=1[more]
A0A5B6V0L82.8e-0730.00Reverse transcriptase OS=Gossypium australe OX=47621 GN=EPI10_029142 PE=4 SV=1[more]
A0A6J1DBJ73.6e-0732.62uncharacterized protein LOC111018973 OS=Momordica charantia OX=3673 GN=LOC111018... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002156Ribonuclease H domainPFAMPF13456RVT_3coord: 12..121
e-value: 3.3E-9
score: 36.6
NoneNo IPR availablePANTHERPTHR47074:SF25SUBFAMILY NOT NAMEDcoord: 4..121
NoneNo IPR availablePANTHERPTHR47074BNAC02G40300D PROTEINcoord: 4..121

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr019549.1Sgr019549.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008152 metabolic process
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0004523 RNA-DNA hybrid ribonuclease activity