Sgr024673 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr024673
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
DescriptionUnknown protein
Locationtig00002486: 1748414 .. 1748749 (+)
RNA-Seq ExpressionSgr024673
SyntenySgr024673
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGGATGCAGAGAGAACAACCGGCGCCGGTTCCCGGAGACAGCCTCCCACCCGACTACAGAGGCGGGCTCCGGCGTCGATTCAGATCAGTCGCCCGTCGAACTGGAACGTGGCCATCCCGTTGTTATCCCCCCTCGTGTCTCCTTCTTCTTCAGACCAAGCCGATCTGCTGATGGCTGATAACAAGCCGAGAGAGGAAGCGAGGTCGCTGGAGCAGAGGCATCGGCGTAGCTTGGAGATGGAGAAGGCGCCCACCTTCGCGAAGTGGCAGCACCCTGCGGCTCCATTCTATTACGGGCCGGTCCCAAGGGCCACCCCCTTTGTGCCTGTGTGA

mRNA sequence

ATGGCGGATGCAGAGAGAACAACCGGCGCCGGTTCCCGGAGACAGCCTCCCACCCGACTACAGAGGCGGGCTCCGGCGTCGATTCAGATCAGTCGCCCGTCGAACTGGAACGTGGCCATCCCGTTGTTATCCCCCCTCGTGTCTCCTTCTTCTTCAGACCAAGCCGATCTGCTGATGGCTGATAACAAGCCGAGAGAGGAAGCGAGGTCGCTGGAGCAGAGGCATCGGCGTAGCTTGGAGATGGAGAAGGCGCCCACCTTCGCGAAGTGGCAGCACCCTGCGGCTCCATTCTATTACGGGCCGGTCCCAAGGGCCACCCCCTTTGTGCCTGTGTGA

Coding sequence (CDS)

ATGGCGGATGCAGAGAGAACAACCGGCGCCGGTTCCCGGAGACAGCCTCCCACCCGACTACAGAGGCGGGCTCCGGCGTCGATTCAGATCAGTCGCCCGTCGAACTGGAACGTGGCCATCCCGTTGTTATCCCCCCTCGTGTCTCCTTCTTCTTCAGACCAAGCCGATCTGCTGATGGCTGATAACAAGCCGAGAGAGGAAGCGAGGTCGCTGGAGCAGAGGCATCGGCGTAGCTTGGAGATGGAGAAGGCGCCCACCTTCGCGAAGTGGCAGCACCCTGCGGCTCCATTCTATTACGGGCCGGTCCCAAGGGCCACCCCCTTTGTGCCTGTGTGA

Protein sequence

MADAERTTGAGSRRQPPTRLQRRAPASIQISRPSNWNVAIPLLSPLVSPSSSDQADLLMADNKPREEARSLEQRHRRSLEMEKAPTFAKWQHPAAPFYYGPVPRATPFVPV
Homology
BLAST of Sgr024673 vs. NCBI nr
Match: XP_022143991.1 (uncharacterized protein At4g14450, chloroplastic-like [Momordica charantia])

HSP 1 Score: 124.8 bits (312), Expect = 4.7e-25
Identity = 70/111 (63.06%), Postives = 79/111 (71.17%), Query Frame = 0

Query: 1   MADAERTTGAGSRRQPPTRLQRRAPASIQISRPSNWNVAIPLLSPLVSPSSSDQADLLMA 60
           M DAER     S  Q  TRLQRRAP+SIQISRP++WNVAIPLLSPLVSP   +Q D+LM 
Sbjct: 1   MGDAERRGAGDSVSQEATRLQRRAPSSIQISRPASWNVAIPLLSPLVSP-CLEQVDVLMG 60

Query: 61  DNKPREEARSLEQRHRRSLEMEKAPTFAKWQHPAAPFYYGPVPRATPFVPV 112
           +NK REEARS           +K  TF +W+HPAAPFYYGPV R TPFVPV
Sbjct: 61  ENKAREEARS----------RDKPATFTRWKHPAAPFYYGPVQRTTPFVPV 100

BLAST of Sgr024673 vs. NCBI nr
Match: KAE8650334.1 (hypothetical protein Csa_009641 [Cucumis sativus])

HSP 1 Score: 108.2 bits (269), Expect = 4.5e-20
Identity = 66/115 (57.39%), Postives = 74/115 (64.35%), Query Frame = 0

Query: 1   MADAERTTGAGSRRQPPTRLQRRAPASIQISRPSNWNVAIPLLSPLVSPS----SSDQAD 60
           M+DAER T   +    PTRLQ +APASI+I R  NWNVAIPLLSPLVSPS    S+ +  
Sbjct: 1   MSDAERKT---ATTVAPTRLQSQAPASIEIKRALNWNVAIPLLSPLVSPSSCGNSAPEKM 60

Query: 61  LLMADNKPREEARSLEQRHRRSLEMEKAPTFAKWQHPAAPFYYGPVPRATPFVPV 112
           L MA+N  REE + L              TF KWQHPAAPFYY PVPRA PFVPV
Sbjct: 61  LSMAENNAREETKGL--------------TFTKWQHPAAPFYYEPVPRANPFVPV 98

BLAST of Sgr024673 vs. NCBI nr
Match: PON32321.1 (hypothetical protein PanWU01x14_362290 [Parasponia andersonii])

HSP 1 Score: 105.1 bits (261), Expect = 3.8e-19
Identity = 63/111 (56.76%), Postives = 71/111 (63.96%), Query Frame = 0

Query: 1   MADAERTTGAGSRRQPPTRLQRRAPASIQISRPSNWNVAIPLLSPLVSPSSSDQADLLMA 60
           MAD    T     R+ P+RLQRRAPAS+QIS  S+WNVAIPLLSPL SPSSS +A    A
Sbjct: 1   MADVTIPTSGACLRRQPSRLQRRAPASLQISPVSDWNVAIPLLSPLASPSSSPKALDRTA 60

Query: 61  DNKPREEARSLEQRHRRSLEMEKAPTFAKWQHPAAPFYYGPVPRATPFVPV 112
           +NK RE+ R      R   E +K   F KWQHPAAPF Y P P   PFVPV
Sbjct: 61  ENKSREDQR------RLVAEPDKPIVFKKWQHPAAPFCYEPAPLVRPFVPV 105

BLAST of Sgr024673 vs. NCBI nr
Match: PON98714.1 (hypothetical protein TorRG33x02_056140 [Trema orientale])

HSP 1 Score: 104.0 bits (258), Expect = 8.6e-19
Identity = 62/111 (55.86%), Postives = 71/111 (63.96%), Query Frame = 0

Query: 1   MADAERTTGAGSRRQPPTRLQRRAPASIQISRPSNWNVAIPLLSPLVSPSSSDQADLLMA 60
           MAD  R T     R+ P+RLQRRAPAS+QI   S+WNVAIPLLSPL SPSSS +A    A
Sbjct: 1   MADVTRPTSGACPRRQPSRLQRRAPASLQIRPVSDWNVAIPLLSPLASPSSSPKALDRTA 60

Query: 61  DNKPREEARSLEQRHRRSLEMEKAPTFAKWQHPAAPFYYGPVPRATPFVPV 112
           + K RE+ R      R+  E +K   F KWQHPAAPF Y P P   PFVPV
Sbjct: 61  EIKSREDQR------RQVAEPDKPIVFKKWQHPAAPFCYEPAPLVRPFVPV 105

BLAST of Sgr024673 vs. NCBI nr
Match: XP_030506624.1 (uncharacterized protein At4g14450, chloroplastic [Cannabis sativa])

HSP 1 Score: 104.0 bits (258), Expect = 8.6e-19
Identity = 62/113 (54.87%), Postives = 72/113 (63.72%), Query Frame = 0

Query: 1   MADAERTTGAGSRRQPPTRLQRRAPASIQISRPSNWNVAIPLLSPLVSPSSSDQADLLMA 60
           MA+  R +     R+ P+RLQRRAPAS+QIS  S+WNVAIPLLSPLVSPSS         
Sbjct: 1   MAEVSRPSSVPGTRRQPSRLQRRAPASLQISPVSDWNVAIPLLSPLVSPSSPKTLVDRTV 60

Query: 61  DNKPREEARSL--EQRHRRSLEMEKAPTFAKWQHPAAPFYYGPVPRATPFVPV 112
           +NK R++   L   QR R+S E EK   F KWQHPAAPF Y   P   PFVPV
Sbjct: 61  ENKSRDDHHHLHHHQRLRQSSESEKPIVFKKWQHPAAPFCYDSTPLVRPFVPV 113

BLAST of Sgr024673 vs. ExPASy Swiss-Prot
Match: Q6NN02 (Uncharacterized protein At4g14450, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=At4g14450 PE=2 SV=1)

HSP 1 Score: 66.2 bits (160), Expect = 2.6e-10
Identity = 48/111 (43.24%), Postives = 62/111 (55.86%), Query Frame = 0

Query: 7   TTGAGSRRQPPTRLQRRAPA-SIQISRPSNWNVAIPLLSPLVS--PSSSDQADLLMADNK 66
           ++ A S  +  ++LQRRAP+  I+ +  SNWNVAIPLLSPL     SS DQ+ +    NK
Sbjct: 27  SSSAASNERQLSQLQRRAPSLMIKPTSFSNWNVAIPLLSPLAPSLTSSFDQSHVPPPQNK 86

Query: 67  ---PREEARSLEQRHRRSLEMEKAPTFAKWQHPAAPFYYGPVPRATPFVPV 112
              P EE            E++K P F KWQHPA+PF Y P     PF+ V
Sbjct: 87  TEIPVEE------------EVKKTPVFKKWQHPASPFCYEPTTFVPPFIQV 125

BLAST of Sgr024673 vs. ExPASy TrEMBL
Match: A0A6J1CRZ0 (uncharacterized protein At4g14450, chloroplastic-like OS=Momordica charantia OX=3673 GN=LOC111013769 PE=4 SV=1)

HSP 1 Score: 124.8 bits (312), Expect = 2.3e-25
Identity = 70/111 (63.06%), Postives = 79/111 (71.17%), Query Frame = 0

Query: 1   MADAERTTGAGSRRQPPTRLQRRAPASIQISRPSNWNVAIPLLSPLVSPSSSDQADLLMA 60
           M DAER     S  Q  TRLQRRAP+SIQISRP++WNVAIPLLSPLVSP   +Q D+LM 
Sbjct: 1   MGDAERRGAGDSVSQEATRLQRRAPSSIQISRPASWNVAIPLLSPLVSP-CLEQVDVLMG 60

Query: 61  DNKPREEARSLEQRHRRSLEMEKAPTFAKWQHPAAPFYYGPVPRATPFVPV 112
           +NK REEARS           +K  TF +W+HPAAPFYYGPV R TPFVPV
Sbjct: 61  ENKAREEARS----------RDKPATFTRWKHPAAPFYYGPVQRTTPFVPV 100

BLAST of Sgr024673 vs. ExPASy TrEMBL
Match: A0A0A0L8D3 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G141830 PE=4 SV=1)

HSP 1 Score: 108.2 bits (269), Expect = 2.2e-20
Identity = 66/115 (57.39%), Postives = 74/115 (64.35%), Query Frame = 0

Query: 1   MADAERTTGAGSRRQPPTRLQRRAPASIQISRPSNWNVAIPLLSPLVSPS----SSDQAD 60
           M+DAER T   +    PTRLQ +APASI+I R  NWNVAIPLLSPLVSPS    S+ +  
Sbjct: 1   MSDAERKT---ATTVAPTRLQSQAPASIEIKRALNWNVAIPLLSPLVSPSSCGNSAPEKM 60

Query: 61  LLMADNKPREEARSLEQRHRRSLEMEKAPTFAKWQHPAAPFYYGPVPRATPFVPV 112
           L MA+N  REE + L              TF KWQHPAAPFYY PVPRA PFVPV
Sbjct: 61  LSMAENNAREETKGL--------------TFTKWQHPAAPFYYEPVPRANPFVPV 98

BLAST of Sgr024673 vs. ExPASy TrEMBL
Match: A0A2P5A713 (Uncharacterized protein OS=Parasponia andersonii OX=3476 GN=PanWU01x14_362290 PE=4 SV=1)

HSP 1 Score: 105.1 bits (261), Expect = 1.9e-19
Identity = 63/111 (56.76%), Postives = 71/111 (63.96%), Query Frame = 0

Query: 1   MADAERTTGAGSRRQPPTRLQRRAPASIQISRPSNWNVAIPLLSPLVSPSSSDQADLLMA 60
           MAD    T     R+ P+RLQRRAPAS+QIS  S+WNVAIPLLSPL SPSSS +A    A
Sbjct: 1   MADVTIPTSGACLRRQPSRLQRRAPASLQISPVSDWNVAIPLLSPLASPSSSPKALDRTA 60

Query: 61  DNKPREEARSLEQRHRRSLEMEKAPTFAKWQHPAAPFYYGPVPRATPFVPV 112
           +NK RE+ R      R   E +K   F KWQHPAAPF Y P P   PFVPV
Sbjct: 61  ENKSREDQR------RLVAEPDKPIVFKKWQHPAAPFCYEPAPLVRPFVPV 105

BLAST of Sgr024673 vs. ExPASy TrEMBL
Match: A0A803Q755 (Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1)

HSP 1 Score: 104.0 bits (258), Expect = 4.1e-19
Identity = 62/113 (54.87%), Postives = 72/113 (63.72%), Query Frame = 0

Query: 1   MADAERTTGAGSRRQPPTRLQRRAPASIQISRPSNWNVAIPLLSPLVSPSSSDQADLLMA 60
           MA+  R +     R+ P+RLQRRAPAS+QIS  S+WNVAIPLLSPLVSPSS         
Sbjct: 1   MAEVSRPSSVPGTRRQPSRLQRRAPASLQISPVSDWNVAIPLLSPLVSPSSPKTLVDRTV 60

Query: 61  DNKPREEARSL--EQRHRRSLEMEKAPTFAKWQHPAAPFYYGPVPRATPFVPV 112
           +NK R++   L   QR R+S E EK   F KWQHPAAPF Y   P   PFVPV
Sbjct: 61  ENKSRDDHHHLHHHQRLRQSSESEKPIVFKKWQHPAAPFCYDSTPLVRPFVPV 113

BLAST of Sgr024673 vs. ExPASy TrEMBL
Match: A0A2P5FLQ7 (Uncharacterized protein OS=Trema orientale OX=63057 GN=TorRG33x02_056140 PE=4 SV=1)

HSP 1 Score: 104.0 bits (258), Expect = 4.1e-19
Identity = 62/111 (55.86%), Postives = 71/111 (63.96%), Query Frame = 0

Query: 1   MADAERTTGAGSRRQPPTRLQRRAPASIQISRPSNWNVAIPLLSPLVSPSSSDQADLLMA 60
           MAD  R T     R+ P+RLQRRAPAS+QI   S+WNVAIPLLSPL SPSSS +A    A
Sbjct: 1   MADVTRPTSGACPRRQPSRLQRRAPASLQIRPVSDWNVAIPLLSPLASPSSSPKALDRTA 60

Query: 61  DNKPREEARSLEQRHRRSLEMEKAPTFAKWQHPAAPFYYGPVPRATPFVPV 112
           + K RE+ R      R+  E +K   F KWQHPAAPF Y P P   PFVPV
Sbjct: 61  EIKSREDQR------RQVAEPDKPIVFKKWQHPAAPFCYEPAPLVRPFVPV 105

BLAST of Sgr024673 vs. TAIR 10
Match: AT1G04330.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G23170.1); Has 74 Blast hits to 74 proteins in 6 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 74; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 69.7 bits (169), Expect = 1.7e-12
Identity = 49/107 (45.79%), Postives = 61/107 (57.01%), Query Frame = 0

Query: 4   AERTTGAGSRRQPPTRLQRRAPASIQISR-PSNWNVAIPLLSPLVSPSSSDQADLLMADN 63
           A   +G+G+R+   +RLQRRAP  ++I+   +NW VAIPLLSP  SP             
Sbjct: 5   ARNISGSGNRKS--SRLQRRAPPPLKINPCEANWKVAIPLLSPTESP-----------PQ 64

Query: 64  KPREEARSLEQRHRRSLEMEKAPTFAKWQHPAAPFYYGPVPRAT-PF 109
           KP    +  EQR  +  E EK P F KWQHPAAPFYY P P +  PF
Sbjct: 65  KPPAVMKREEQRWGK--EAEKPPVFKKWQHPAAPFYYQPAPSSNQPF 96

BLAST of Sgr024673 vs. TAIR 10
Match: AT3G23170.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G14450.1); Has 74 Blast hits to 74 proteins in 6 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 74; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 69.3 bits (168), Expect = 2.2e-12
Identity = 48/109 (44.04%), Postives = 62/109 (56.88%), Query Frame = 0

Query: 7   TTGAGSRRQPPTRLQRRAPA-SIQISRP--SNWNVAIPLLSPL-VSPSSSDQADLLMADN 66
           ++ +G  R+ P+RL +R PA  I  + P  +NWN AIPLLSPL +SP SS        D 
Sbjct: 11  SSDSGDLRRQPSRLLKRPPALKIVPATPAANNWNTAIPLLSPLALSPESSP------VDQ 70

Query: 67  KPREEARSLEQRHRRSLEMEKAPTFAKWQHPAAPFYYGPVPRATPFVPV 112
            P      +E+    ++ +EK P F KWQHPAAPFYY       PFVPV
Sbjct: 71  PP------VEKNQSTAVAVEKTPVFKKWQHPAAPFYYESSTFVPPFVPV 107

BLAST of Sgr024673 vs. TAIR 10
Match: AT4G14450.1 (unknown protein; LOCATED IN: chloroplast; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G23170.1); Has 74 Blast hits to 74 proteins in 6 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 74; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 66.2 bits (160), Expect = 1.8e-11
Identity = 48/111 (43.24%), Postives = 62/111 (55.86%), Query Frame = 0

Query: 7   TTGAGSRRQPPTRLQRRAPA-SIQISRPSNWNVAIPLLSPLVS--PSSSDQADLLMADNK 66
           ++ A S  +  ++LQRRAP+  I+ +  SNWNVAIPLLSPL     SS DQ+ +    NK
Sbjct: 27  SSSAASNERQLSQLQRRAPSLMIKPTSFSNWNVAIPLLSPLAPSLTSSFDQSHVPPPQNK 86

Query: 67  ---PREEARSLEQRHRRSLEMEKAPTFAKWQHPAAPFYYGPVPRATPFVPV 112
              P EE            E++K P F KWQHPA+PF Y P     PF+ V
Sbjct: 87  TEIPVEE------------EVKKTPVFKKWQHPASPFCYEPTTFVPPFIQV 125

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022143991.14.7e-2563.06uncharacterized protein At4g14450, chloroplastic-like [Momordica charantia][more]
KAE8650334.14.5e-2057.39hypothetical protein Csa_009641 [Cucumis sativus][more]
PON32321.13.8e-1956.76hypothetical protein PanWU01x14_362290 [Parasponia andersonii][more]
PON98714.18.6e-1955.86hypothetical protein TorRG33x02_056140 [Trema orientale][more]
XP_030506624.18.6e-1954.87uncharacterized protein At4g14450, chloroplastic [Cannabis sativa][more]
Match NameE-valueIdentityDescription
Q6NN022.6e-1043.24Uncharacterized protein At4g14450, chloroplastic OS=Arabidopsis thaliana OX=3702... [more]
Match NameE-valueIdentityDescription
A0A6J1CRZ02.3e-2563.06uncharacterized protein At4g14450, chloroplastic-like OS=Momordica charantia OX=... [more]
A0A0A0L8D32.2e-2057.39Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G141830 PE=4 SV=1[more]
A0A2P5A7131.9e-1956.76Uncharacterized protein OS=Parasponia andersonii OX=3476 GN=PanWU01x14_362290 PE... [more]
A0A803Q7554.1e-1954.87Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1[more]
A0A2P5FLQ74.1e-1955.86Uncharacterized protein OS=Trema orientale OX=63057 GN=TorRG33x02_056140 PE=4 SV... [more]
Match NameE-valueIdentityDescription
AT1G04330.11.7e-1245.79unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT3G23170.12.2e-1244.04unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT4G14450.11.8e-1143.24unknown protein; LOCATED IN: chloroplast; BEST Arabidopsis thaliana protein matc... [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..28
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 61..80
NoneNo IPR availablePANTHERPTHR33912:SF2PUTATIVE-RELATEDcoord: 1..111
IPR040381Uncharacterized protein At4g14450-likePANTHERPTHR33912OS01G0939400 PROTEINcoord: 1..111

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr024673.1Sgr024673.1mRNA