Sgr013235 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr013235
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
DescriptionReverse transcriptase
Locationtig00153764: 69620 .. 70000 (+)
RNA-Seq ExpressionSgr013235
SyntenySgr013235
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAACAAATTAAATTCAAGTTTGAGGTATTTTGGTTGTTTCCCTGTGAGTAGTATTGGTTCGAAGGGAGGTATTTGTTTGTTGTGGAAAGAAAATGTGGATGTTTCGATTCGCTCCTTTTCGGGGGCTCATATTGATGCGATGATTTGTTCTGATGGTAAACGATGGAGATTCATAGGTTTATATGGCCAATCCAATGCTAGTAATCGCAAGTTTACTTGGGAACTACTTCGTTGTTTGCATGGTTTTGATGACTCAGCATGGGCGATTGGTGGGGATTTAAATGAAATATTATGGGACCTTGAGAAGTGTGGTGAGGCCAATAAAGATGAGTATTTGATGCCGGCATTTAGAGAGGTTTTGGATGATTGTCGTTTATAG

mRNA sequence

ATGAACAAATTAAATTCAAGTTTGAGGTATTTTGGTTGTTTCCCTGTGAGTAGTATTGGTTCGAAGGGAGGTATTTGTTTGTTGTGGAAAGAAAATGTGGATGTTTCGATTCGCTCCTTTTCGGGGGCTCATATTGATGCGATGATTTGTTCTGATGGTAAACGATGGAGATTCATAGGTTTATATGGCCAATCCAATGCTAGTAATCGCAAGTTTACTTGGGAACTACTTCGTTGTTTGCATGGTTTTGATGACTCAGCATGGGCGATTGGTGGGGATTTAAATGAAATATTATGGGACCTTGAGAAGTGTGGTGAGGCCAATAAAGATGAGTATTTGATGCCGGCATTTAGAGAGGTTTTGGATGATTGTCGTTTATAG

Coding sequence (CDS)

ATGAACAAATTAAATTCAAGTTTGAGGTATTTTGGTTGTTTCCCTGTGAGTAGTATTGGTTCGAAGGGAGGTATTTGTTTGTTGTGGAAAGAAAATGTGGATGTTTCGATTCGCTCCTTTTCGGGGGCTCATATTGATGCGATGATTTGTTCTGATGGTAAACGATGGAGATTCATAGGTTTATATGGCCAATCCAATGCTAGTAATCGCAAGTTTACTTGGGAACTACTTCGTTGTTTGCATGGTTTTGATGACTCAGCATGGGCGATTGGTGGGGATTTAAATGAAATATTATGGGACCTTGAGAAGTGTGGTGAGGCCAATAAAGATGAGTATTTGATGCCGGCATTTAGAGAGGTTTTGGATGATTGTCGTTTATAG

Protein sequence

MNKLNSSLRYFGCFPVSSIGSKGGICLLWKENVDVSIRSFSGAHIDAMICSDGKRWRFIGLYGQSNASNRKFTWELLRCLHGFDDSAWAIGGDLNEILWDLEKCGEANKDEYLMPAFREVLDDCRL
Homology
BLAST of Sgr013235 vs. NCBI nr
Match: XP_040956169.1 (uncharacterized protein LOC107892503 [Gossypium hirsutum])

HSP 1 Score: 120.2 bits (300), Expect = 1.3e-23
Identity = 55/128 (42.97%), Postives = 78/128 (60.94%), Query Frame = 0

Query: 1   MNKLNSSLRYFGCFPVSSIGSKGGICLLWKENVDVSIRSFSGAHIDAMICSDGK--RWRF 60
           M K+  S  ++    V SIGS+GG+CL W+ N  ++++SFS  HID +I  +G+  +WRF
Sbjct: 1   MEKIRRSCGFYFGIDVDSIGSRGGLCLAWRGNAKIALQSFSNRHIDVIIEEEGEGVKWRF 60

Query: 61  IGLYGQSNASNRKFTWELLRCLHGFDDSAWAIGGDLNEILWDLEKCGEANKDEYLMPAFR 120
            G YG   + +R+ +W LLR L    D  W + GD NEIL+  EK G   + E  M AFR
Sbjct: 61  TGFYGSPYSYDREHSWNLLRQLKNQGDDPWLVCGDFNEILYSFEKKGGLPRKERRMEAFR 120

Query: 121 EVLDDCRL 127
           + L+DCRL
Sbjct: 121 KALEDCRL 128

BLAST of Sgr013235 vs. NCBI nr
Match: XP_024039545.1 (uncharacterized protein LOC112098147 [Citrus clementina])

HSP 1 Score: 118.6 bits (296), Expect = 3.8e-23
Identity = 59/127 (46.46%), Postives = 74/127 (58.27%), Query Frame = 0

Query: 1   MNKLNSSLRYFGCFPVSSIGSKGGICLLWKENVDVSIRSFSGAHIDA-MICSDGKRWRFI 60
           MN+++  L Y  CF VSSIG  GG+ LLWK   +V I+SF+  HIDA ++  +GK  R  
Sbjct: 44  MNEVSRKLNYENCFAVSSIGKGGGLALLWKSETNVQIKSFNQHHIDAEVVMENGKLIRCT 103

Query: 61  GLYGQSNASNRKFTWELLRCLHGFDDSAWAIGGDLNEILWDLEKCGEANKDEYLMPAFRE 120
           G+YG  +   RK TW LLR L GF  + W   GD NEIL   EK G   +   L+  FRE
Sbjct: 104 GVYGHPDMRQRKHTWTLLRRLSGFSSTPWTCFGDFNEILHPFEKSGGNERQVSLITDFRE 163

Query: 121 VLDDCRL 127
            L DC L
Sbjct: 164 ALRDCDL 170

BLAST of Sgr013235 vs. NCBI nr
Match: KAA3479129.1 (reverse transcriptase [Gossypium australe])

HSP 1 Score: 117.5 bits (293), Expect = 8.5e-23
Identity = 55/128 (42.97%), Postives = 74/128 (57.81%), Query Frame = 0

Query: 1   MNKLNSSLRYFGCFPVSSIGSKGGICLLWKENVDVSIRSFSGAHIDAMICSD--GKRWRF 60
           M ++  S  +     V S GSKGG+C+ WKENV +S RS+S  HIDA +     G +WRF
Sbjct: 292 MKRVRRSYGFHNGVEVDSDGSKGGLCMAWKENVPISARSYSRRHIDAFVDDQIHGNKWRF 351

Query: 61  IGLYGQSNASNRKFTWELLRCLHGFDDSAWAIGGDLNEILWDLEKCGEANKDEYLMPAFR 120
            G YG   A  R+ +W LL+ L   +  +W + GD NEIL+  EK G   ++E  M  FR
Sbjct: 352 TGFYGSPYAREREESWNLLKSLREDEGQSWLVYGDFNEILYSFEKKGGLPREEQRMEDFR 411

Query: 121 EVLDDCRL 127
            VL DC+L
Sbjct: 412 NVLQDCQL 419

BLAST of Sgr013235 vs. NCBI nr
Match: XP_017636142.1 (PREDICTED: uncharacterized protein LOC108478216 [Gossypium arboreum])

HSP 1 Score: 117.1 bits (292), Expect = 1.1e-22
Identity = 53/128 (41.41%), Postives = 74/128 (57.81%), Query Frame = 0

Query: 1   MNKLNSSLRYFGCFPVSSIGSKGGICLLWKENVDVSIRSFSGAHIDAMICSDG--KRWRF 60
           M K+     +     VS+ GS+GGICL WKE++ VS++ FS  HID ++  +     WRF
Sbjct: 44  MKKIRRRCGFGNGIDVSAEGSRGGICLAWKEDIQVSLKIFSLTHIDVLVKGENITDEWRF 103

Query: 61  IGLYGQSNASNRKFTWELLRCLHGFDDSAWAIGGDLNEILWDLEKCGEANKDEYLMPAFR 120
            G YG     N+  +W LLR L    D  W +GGD NEI++  E  G   ++E +M AFR
Sbjct: 104 KGFYGSPYIQNKNVSWNLLRNLGKESDHPWLVGGDFNEIMYSFENSGGQQREEKMMEAFR 163

Query: 121 EVLDDCRL 127
           EVL++C L
Sbjct: 164 EVLEECHL 171

BLAST of Sgr013235 vs. NCBI nr
Match: KAA3466735.1 (reverse transcriptase [Gossypium australe])

HSP 1 Score: 116.3 bits (290), Expect = 1.9e-22
Identity = 51/109 (46.79%), Postives = 70/109 (64.22%), Query Frame = 0

Query: 20  GSKGGICLLWKENVDVSIRSFSGAHIDAMICSD--GKRWRFIGLYGQSNASNRKFTWELL 79
           GS+GGICL WKE + VS+++FS +HID MI  D   + W F+G YG    SN+  +W LL
Sbjct: 33  GSRGGICLAWKEEITVSLKNFSKSHIDVMINEDSVNEEWEFMGFYGSPYVSNKSASWNLL 92

Query: 80  RCLHGFDDSAWAIGGDLNEILWDLEKCGEANKDEYLMPAFREVLDDCRL 127
           R L    +  W + GD NEI++  EK G   ++E  M AFREVL++C+L
Sbjct: 93  RILGQEQNHPWLVSGDFNEIMYSFEKSGGQPREEKKMEAFREVLEECQL 141

BLAST of Sgr013235 vs. ExPASy TrEMBL
Match: A0A2N9EK24 (CCHC-type domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS7108 PE=4 SV=1)

HSP 1 Score: 118.6 bits (296), Expect = 1.8e-23
Identity = 54/116 (46.55%), Postives = 76/116 (65.52%), Query Frame = 0

Query: 12  GCFPVSSIGSKGGICLLWKENVDVSIRSFSGAHIDA-MICSDGKRWRFIGLYGQSNASNR 71
           G F V   G+ GG+ L+W ++ DV+I+SFS +HIDA +I ++G+ WR  G YGQ +AS R
Sbjct: 464 GAFAVDRHGTGGGLALMWADDYDVNIQSFSHSHIDAWIIDNEGRNWRLTGFYGQPDASKR 523

Query: 72  KFTWELLRCLHGFDDSAWAIGGDLNEILWDLEKCGEANKDEYLMPAFREVLDDCRL 127
             +W LLR LHG     W + GDLNEI+ + E  G+ ++  + M AFR+ LDDC L
Sbjct: 524 HESWSLLRHLHGTSILPWLVMGDLNEIVANSESTGQWDRQPHFMQAFRDALDDCNL 579

BLAST of Sgr013235 vs. ExPASy TrEMBL
Match: A0A5B6WBN2 (Reverse transcriptase OS=Gossypium australe OX=47621 GN=EPI10_019672 PE=4 SV=1)

HSP 1 Score: 117.5 bits (293), Expect = 4.1e-23
Identity = 55/128 (42.97%), Postives = 74/128 (57.81%), Query Frame = 0

Query: 1   MNKLNSSLRYFGCFPVSSIGSKGGICLLWKENVDVSIRSFSGAHIDAMICSD--GKRWRF 60
           M ++  S  +     V S GSKGG+C+ WKENV +S RS+S  HIDA +     G +WRF
Sbjct: 292 MKRVRRSYGFHNGVEVDSDGSKGGLCMAWKENVPISARSYSRRHIDAFVDDQIHGNKWRF 351

Query: 61  IGLYGQSNASNRKFTWELLRCLHGFDDSAWAIGGDLNEILWDLEKCGEANKDEYLMPAFR 120
            G YG   A  R+ +W LL+ L   +  +W + GD NEIL+  EK G   ++E  M  FR
Sbjct: 352 TGFYGSPYAREREESWNLLKSLREDEGQSWLVYGDFNEILYSFEKKGGLPREEQRMEDFR 411

Query: 121 EVLDDCRL 127
            VL DC+L
Sbjct: 412 NVLQDCQL 419

BLAST of Sgr013235 vs. ExPASy TrEMBL
Match: A0A6P4PK91 (uncharacterized protein LOC108478216 OS=Gossypium arboreum OX=29729 GN=LOC108478216 PE=4 SV=1)

HSP 1 Score: 117.1 bits (292), Expect = 5.4e-23
Identity = 53/128 (41.41%), Postives = 74/128 (57.81%), Query Frame = 0

Query: 1   MNKLNSSLRYFGCFPVSSIGSKGGICLLWKENVDVSIRSFSGAHIDAMICSDG--KRWRF 60
           M K+     +     VS+ GS+GGICL WKE++ VS++ FS  HID ++  +     WRF
Sbjct: 44  MKKIRRRCGFGNGIDVSAEGSRGGICLAWKEDIQVSLKIFSLTHIDVLVKGENITDEWRF 103

Query: 61  IGLYGQSNASNRKFTWELLRCLHGFDDSAWAIGGDLNEILWDLEKCGEANKDEYLMPAFR 120
            G YG     N+  +W LLR L    D  W +GGD NEI++  E  G   ++E +M AFR
Sbjct: 104 KGFYGSPYIQNKNVSWNLLRNLGKESDHPWLVGGDFNEIMYSFENSGGQQREEKMMEAFR 163

Query: 121 EVLDDCRL 127
           EVL++C L
Sbjct: 164 EVLEECHL 171

BLAST of Sgr013235 vs. ExPASy TrEMBL
Match: A0A5B6VC18 (Reverse transcriptase OS=Gossypium australe OX=47621 GN=EPI10_001806 PE=4 SV=1)

HSP 1 Score: 116.3 bits (290), Expect = 9.2e-23
Identity = 51/109 (46.79%), Postives = 70/109 (64.22%), Query Frame = 0

Query: 20  GSKGGICLLWKENVDVSIRSFSGAHIDAMICSD--GKRWRFIGLYGQSNASNRKFTWELL 79
           GS+GGICL WKE + VS+++FS +HID MI  D   + W F+G YG    SN+  +W LL
Sbjct: 33  GSRGGICLAWKEEITVSLKNFSKSHIDVMINEDSVNEEWEFMGFYGSPYVSNKSASWNLL 92

Query: 80  RCLHGFDDSAWAIGGDLNEILWDLEKCGEANKDEYLMPAFREVLDDCRL 127
           R L    +  W + GD NEI++  EK G   ++E  M AFREVL++C+L
Sbjct: 93  RILGQEQNHPWLVSGDFNEIMYSFEKSGGQPREEKKMEAFREVLEECQL 141

BLAST of Sgr013235 vs. ExPASy TrEMBL
Match: A0A803P3X8 (Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1)

HSP 1 Score: 116.3 bits (290), Expect = 9.2e-23
Identity = 54/125 (43.20%), Postives = 75/125 (60.00%), Query Frame = 0

Query: 1   MNKLNSSLRYFGCFPVSSIGSKGGICLLWKENVDVSIRSFSGAHIDAMICSD-GKRWRFI 60
           M ++   LRY GCF V++ G  GG+ LLWK+  +VSI+S++ +HIDA++ +  G  WRF 
Sbjct: 520 MERIRVVLRYDGCFVVAANGKSGGLALLWKDPYEVSIKSYTVSHIDALVENGLGFTWRFT 579

Query: 61  GLYGQSNASNRKFTWELLRCLHGFDDSAWAIGGDLNEILWDLEKCGEANKDEYLMPAFRE 120
           G YG  +   RKF+W+L+  L    + AW  GGD NEI+   EK G   K E  M AFR 
Sbjct: 580 GFYGSPDPGGRKFSWQLMEKLRNMVNGAWICGGDFNEIVKGSEKKGGGPKQESQMSAFRR 639

Query: 121 VLDDC 125
            +  C
Sbjct: 640 AISYC 644

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_040956169.11.3e-2342.97uncharacterized protein LOC107892503 [Gossypium hirsutum][more]
XP_024039545.13.8e-2346.46uncharacterized protein LOC112098147 [Citrus clementina][more]
KAA3479129.18.5e-2342.97reverse transcriptase [Gossypium australe][more]
XP_017636142.11.1e-2241.41PREDICTED: uncharacterized protein LOC108478216 [Gossypium arboreum][more]
KAA3466735.11.9e-2246.79reverse transcriptase [Gossypium australe][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A2N9EK241.8e-2346.55CCHC-type domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS7108... [more]
A0A5B6WBN24.1e-2342.97Reverse transcriptase OS=Gossypium australe OX=47621 GN=EPI10_019672 PE=4 SV=1[more]
A0A6P4PK915.4e-2341.41uncharacterized protein LOC108478216 OS=Gossypium arboreum OX=29729 GN=LOC108478... [more]
A0A5B6VC189.2e-2346.79Reverse transcriptase OS=Gossypium australe OX=47621 GN=EPI10_001806 PE=4 SV=1[more]
A0A803P3X89.2e-2343.20Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR036691Endonuclease/exonuclease/phosphatase superfamilyGENE3D3.60.10.10Endonuclease/exonuclease/phosphatasecoord: 1..126
e-value: 3.1E-12
score: 48.7
IPR036691Endonuclease/exonuclease/phosphatase superfamilySUPERFAMILY56219DNase I-likecoord: 5..125

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr013235.1Sgr013235.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0003824 catalytic activity