Sgr017215 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr017215
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
DescriptionUnknown protein
Locationtig00153033: 659187 .. 662808 (+)
RNA-Seq ExpressionSgr017215
SyntenySgr017215
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGATCGACGGCAGATCGCAATTCGGTTACCGGTGCGAAACCCAGCCTTTCTGATTACGCCACGCAACCACATACCCGATGATGTGGAAATCACATTACGTGGCACGAATCTGGCGCAGTAACTGGTGCGCGTTGGTGTAAGCGTGTGACAGAGCATATCCCCCAGTCCCTCCATTACCCCCACAGGTGCGCTCGCACGCAAAGGAGCCAAACCGCCACAACCTTGAAGGTTCCGTTTAATTGAATCGGACTGCAAATCCGGCGTCGACCACCGCTCTCTCTCACAGGTTCTCTGATCGATCCCGTCGTTTGAACTTCTCTGCGATATTTGTTATTTGGCTTTTTCACGATTGATTCAAACGGGTTGAGCACAGAGTTTTAATTAATTCGCGATATGCCGATTCGTATACTCCGAATTCCAGAGTTTCGACGGTCTTTTTTATAGTTTCGGCATTCCTGAGCTGCTTTCCAGGTAAAATTTTTTCCGAAGAGATGTAAAATTATGGTTTTGATATTGCTGTTCTTGCTTTGATTTGTTTTCTATGTTAATTGCTATTTGCAATTTTTACAAAATCACGAATTAAATGTAGGTAATGTTTTCAGGGAACGTAGAATCGGCGGTTGTTACGCTTGATTCAATCGAAATATACAAGACACACGAGTGGTTGGCATCCAAACCGACAGTTTATTTTCACTGTCAAGGGGAAAACAAAACGAAATTGCCTGATGTACAGAAAGAGCATGTTTTATACAGCTTCAATGGGGAAGAATCTTGGCAGGTTTGTGGCTTTTCAAGTTGTTTTGAACAGCGTTGAAGCTTTTCCACCTTTTTCCTTCTTTTCATCTCGACTTTATTGGGTGTTGTAATTCACTTGAATGACTTAGAGATTAAATTTTACAGCCCCAAAAAATCGCTCGATGGAACAAAGTATCCTGTTTGCTTCTAATATGTGTAATGATTGATACTCTTCACACAAATTGGAAAATAACATATCATTAGGACATTCAGATATAAGTTTTCTTTGTCTCTATGTCTCATTCAGAAAGAACAAAGACCTTTTGAATTATTTGTTGCACAGAAACATATTCAAAGTAATGAGGGTGGTTATAATTTGGTGTGCATTTGTGAATGGATAAAAGGGCCATACCATCTTCTCATTGATCTCTGCTGAGGAAAGGGGAATGGCATACACAGTAATTAATCGCATTGTTTATTTTTGGTGATTTGGTTCCAATGTTGAAATGCTATCTTTGTGCTCTCAGCCAATGACTGAATTTGGAAGTAAAAAGTGTAAGCGATGTGGGTTCTTTGAGGAGGACAGGATTAAATCTGATGATGTATTTGAAGAGTGGGAATTTTGTCCGACTGATTTTACAGCTCCTGCTGGAAAATATGTACGATTCAACAAAAACGAGTTCAATGCCACTTTTGTGTGCTTGAAGTGCACAGCTATTCCGATGGTGAGTAGCGATTTGCTATAGTTTATGATTCTGCATTTAGTTGAACTAATACGGCCTTCATACAGCCAATTAAGTATATTTCTACTGTTTATGATTTGAGCCTTTTGCGAGTTTTAAAATACTAAAGAAATTAGTCGAGGTAGCCAACCTAAGATTTAATATCCTACGAGTTTCCTTGGCAACCAAATGTAGTAAGGTCAGGTGATTGTTTCGTGAGATTAGTTGTGGTGTGCGTAAGCTAGCTCGGACACTCATGGATATAAAAAATATATATAGTTCTCTTCTACATGTGAATGATTATATGGCCTTCATCTACTAATGCTGTTTCCTTGGACATTTGCATACTTTTGAGTTTTAACATTTAAATTGCACGTGAAGAAAAATGAAATCATCTTACTCAGTTCACTCCCTGGGATATTTTAATTATTTATCATGGGAACTTTTGAGTTCGAAGAAAAACAGGATTTGTTAACATAAATGGAAGCTCTGTTGTTCTTAGAATCATTGAAGTTCTGCATCTTTTTTTTCCCTTCAAATTTCGCCTCTTCTAGGCCGATGAATGTAATTATTTAAACCTTAATGCACACCTTCTGAGGGAACCCTTTTTAGCATACATTTTTTCTGGATTTTTTGGGAAATTTTAAATCCTTAAATGAGGATTCTAAATGAGGTATTAGCACTAGGTAGATAATTGATGAAACAAAAAAGCTCCCCCAAGAACGTTTAGAGTTTTAAGAACATAATGAATAGCCTAATCTATGCACTTGCACTCATGCTTCCTGCTTCTTAATTATCTCGGATAAATCTGTGACCATGACTCTGTTGACTTGTCCATGTGTTCTGGAAGTGTTGCAATTTGCATTTAAGATTTTGTCTAAAACTGAAACTTGCAGGTTCCAGTTCAAGTTCACATTCGCCTGGTGGAGGAAAGGGATTGCATGTTGCTGTAGTCATAGTTATAATGTCTGATGGTAGTTGTACATTGAAGACGGGAGCAAGATCAGGCTAGATTTCTGAAGCTGTTTGAAGATGGGGATGACATTGAGGATGAATTGGGCCTTGGCGATGTAATATGAGATGAGTTGGTGTGCTTCAAGAAAGAGAAATATTAATGTACAGCGATACGATTTTGCATAATACATTTACCTTTGCGGTTCGAATAGAAATAGTATGAAATTTCTGCTTCATTCATTGATCACAGGGAGAACATCGTGGAAAATGCTTAGAGGAGGGGGAGGGAGGGAAAATGCTTAGATGTCTGAATGATTTTTGTTTATTTCTTTAGTGTAACAAACAAGGGGAAAGAGAGAGAGAACCAATTAGATGTCTGAATGATTTTTGTTTATTTCTTTGGTGTAACAAAGAAAGGGAAATTCTTTTTATTGCTTTTGGTCGAAACATTTGAAGAGAAAATAAACAACTCTGATAGTCTGATTACATCACATCCCAGCAAATTGCTGTCTGATGGAAAGTAAAGAACGATTAGTCTAAATTCACTGACGCTCATCGGTGCCATGAGGAATCTGAGATACAAGCAAGGTCAGTACCCCAGGATGGCATCAAGGAGTGCCATAGCATCAGAACTGCAGTCATGTTCAAGTTTCAACTCTTGCCCACTCCAGTCATGACCATGCTCATCTTCTCCAAGTCCAAACCAAGATTGAAACCAATCACAAGCACATTCATAGCCCAAATTTCTAGAATCTCCATTATAATCCTCATTGTGGAGAATGTTTTCTGAATTCTCCATTCTGCAGCCATTTTGATTGCCGCAAGGGTCTGGGCAATTGAAGAAGAGATCTCTGCAAATTTCACTGTAAGAAGTGGAATCGAGCTCATTCAGCAAATACCCAAAACTATGGTAAGTCTGCTCACCCTCAGAATTGTCTCCAATTGGGTCTCCATCAGACTGGTGTCTGCTTGGGAAACTGATCTCTGGGGTCCAAAATGGACGAGAATGATGTTCTGAAGAGAGTTCCTCATTGAAAGGAAAAGAGGGGTTCAGACCAGGGTAGTTGTTGTAGCTGTTTCCATATGGCATGAGCAGGCCAATGGAGTCAGAGCCACTGTAACTCATAGCAGATATTCAAGCTCTAAATCTGTTGTTTACAACGTACTGGATGTAAGGGAATTACAGGCCACAGTACTATGGGTGTCCTAG

mRNA sequence

ATGATCGACGGCAGATCGCAATTCGGTTACCGAGCATATCCCCCAGTCCCTCCATTACCCCCACAGGTGCGCTCGCACGCAAAGGAGCCAAACCGCCACAACCTTGAAGGGAACGTAGAATCGGCGGTTGTTACGCTTGATTCAATCGAAATATACAAGACACACGAGTGGTTGGCATCCAAACCGACAGTTTATTTTCACTGTCAAGGGGAAAACAAAACGAAATTGCCTGATGTACAGAAAGAGCATGTTTTATACAGCTTCAATGGGGAAGAATCTTGGCAGCCAATGACTGAATTTGGAAGTAAAAAGTGTAAGCGATGTGGGTTCTTTGAGGAGGACAGGATTAAATCTGATGATGTATTTGAAGAGTGGGAATTTTGTCCGACTGATTTTACAGCTCCTGCTGGAAAATATGTACGATTCAACAAAAACGAGTTCAATGCCACTTTTGTGTGCTTGAAGTGCACAGCTATTCCGATGGTTCCAGTTCAAGTTCACATTCGCCTGGTGGAGGAAAGGGATTGCATGTTGCTACGGGAGCAAGATCAGGCTAGATTTCTGAAGCTGTTTGAAGATGGGGATGACATTGAGGATGAATTGGGCCTTGGCGATGGAGAACATCGTGGAAAATGCTTAGAGGAGGGGGAGGGAGGGAAAATGCTTAGATCCATTTTGATTGCCGCAAGGGTCTGGGCAATTGAAGAAGAGATCTCTGCAAATTTCACTGTAAGAAGTGGAATCGAGCTCATTCAGCAAATACCCAAAACTATGACCAGGGTAGTTGTTGTAGCTGTTTCCATATGGCATGAGCAGGCCAATGGAGTCAGAGCCACTGTAACTCATAGCAGATATTCAAGCTCTAAATCTGTTGTTTACAACGTACTGGATGTAAGGGAATTACAGGCCACAGTACTATGGGTGTCCTAG

Coding sequence (CDS)

ATGATCGACGGCAGATCGCAATTCGGTTACCGAGCATATCCCCCAGTCCCTCCATTACCCCCACAGGTGCGCTCGCACGCAAAGGAGCCAAACCGCCACAACCTTGAAGGGAACGTAGAATCGGCGGTTGTTACGCTTGATTCAATCGAAATATACAAGACACACGAGTGGTTGGCATCCAAACCGACAGTTTATTTTCACTGTCAAGGGGAAAACAAAACGAAATTGCCTGATGTACAGAAAGAGCATGTTTTATACAGCTTCAATGGGGAAGAATCTTGGCAGCCAATGACTGAATTTGGAAGTAAAAAGTGTAAGCGATGTGGGTTCTTTGAGGAGGACAGGATTAAATCTGATGATGTATTTGAAGAGTGGGAATTTTGTCCGACTGATTTTACAGCTCCTGCTGGAAAATATGTACGATTCAACAAAAACGAGTTCAATGCCACTTTTGTGTGCTTGAAGTGCACAGCTATTCCGATGGTTCCAGTTCAAGTTCACATTCGCCTGGTGGAGGAAAGGGATTGCATGTTGCTACGGGAGCAAGATCAGGCTAGATTTCTGAAGCTGTTTGAAGATGGGGATGACATTGAGGATGAATTGGGCCTTGGCGATGGAGAACATCGTGGAAAATGCTTAGAGGAGGGGGAGGGAGGGAAAATGCTTAGATCCATTTTGATTGCCGCAAGGGTCTGGGCAATTGAAGAAGAGATCTCTGCAAATTTCACTGTAAGAAGTGGAATCGAGCTCATTCAGCAAATACCCAAAACTATGACCAGGGTAGTTGTTGTAGCTGTTTCCATATGGCATGAGCAGGCCAATGGAGTCAGAGCCACTGTAACTCATAGCAGATATTCAAGCTCTAAATCTGTTGTTTACAACGTACTGGATGTAAGGGAATTACAGGCCACAGTACTATGGGTGTCCTAG

Protein sequence

MIDGRSQFGYRAYPPVPPLPPQVRSHAKEPNRHNLEGNVESAVVTLDSIEIYKTHEWLASKPTVYFHCQGENKTKLPDVQKEHVLYSFNGEESWQPMTEFGSKKCKRCGFFEEDRIKSDDVFEEWEFCPTDFTAPAGKYVRFNKNEFNATFVCLKCTAIPMVPVQVHIRLVEERDCMLLREQDQARFLKLFEDGDDIEDELGLGDGEHRGKCLEEGEGGKMLRSILIAARVWAIEEEISANFTVRSGIELIQQIPKTMTRVVVVAVSIWHEQANGVRATVTHSRYSSSKSVVYNVLDVRELQATVLWVS
Homology
BLAST of Sgr017215 vs. NCBI nr
Match: XP_022143802.1 (uncharacterized protein LOC111013628 [Momordica charantia])

HSP 1 Score: 271.2 bits (692), Expect = 1.1e-68
Identity = 135/197 (68.53%), Postives = 147/197 (74.62%), Query Frame = 0

Query: 37  GNVESAVVTLDSIEIYKTHEWLASKPTVYFHCQGENKTKLPDVQKEHVLYSFNGEESWQP 96
           G  ESAVVTLDS+ IYKTHEWLASKPTVYF CQG NKTKLPDVQKEHVLYSFNGEESWQP
Sbjct: 27  GGAESAVVTLDSVMIYKTHEWLASKPTVYFQCQGGNKTKLPDVQKEHVLYSFNGEESWQP 86

Query: 97  MTEFGSKKCKRCGFFEEDRIKSDDVFEEWEFCPTDFTAPAGKYVRFNKNEFNATFVCLKC 156
           +TEF SKKCKRCGF+EED IKSDDVFEEWEFCP+DFTAPAGKYVRFN+ EFNATF+CL+C
Sbjct: 87  LTEFESKKCKRCGFYEEDSIKSDDVFEEWEFCPSDFTAPAGKYVRFNEKEFNATFMCLQC 146

Query: 157 TAIPMVPVQ----------VHIRLVEERDCML------------------LREQDQARFL 206
           TA   V             +H+  +     ++                   REQDQARFL
Sbjct: 147 TAYSNVSSSSAPTHNSEQGMHVAAIIVISALVSTVLIIGIVVGYKYWQKKRREQDQARFL 206

BLAST of Sgr017215 vs. NCBI nr
Match: XP_038883476.1 (uncharacterized protein LOC120074430 isoform X1 [Benincasa hispida])

HSP 1 Score: 267.7 bits (683), Expect = 1.2e-67
Identity = 137/197 (69.54%), Postives = 147/197 (74.62%), Query Frame = 0

Query: 37  GNVESAVVTLDSIEIYKTHEWLASKPTVYFHCQGENKTKLPDVQKEHVLYSFNGEESWQP 96
           G VESAVVTLDSI IYKTHEWLAS+PTVYF CQG NKTKLPDVQKEHVLYSFNGEESWQP
Sbjct: 27  GGVESAVVTLDSIVIYKTHEWLASEPTVYFQCQGGNKTKLPDVQKEHVLYSFNGEESWQP 86

Query: 97  MTEFGSKKCKRCGFFEEDRIKSDDVFEEWEFCPTDFTAPAGKYVRFNKNEFNATFVCLKC 156
           +TEF SKKCKRCGF+EED IKSDDVFEEWEFCP+DFTAPAGKYVRFN  EFNATF+CL+C
Sbjct: 87  LTEFESKKCKRCGFYEEDSIKSDDVFEEWEFCPSDFTAPAGKYVRFNAEEFNATFLCLQC 146

Query: 157 TAIPMV----------------PVQVHIRLVEERDCML------------LREQDQARFL 206
           TA   V                 V + I +V     +L             REQDQARFL
Sbjct: 147 TAYSNVTSSSFPSYDGEKGMHSAVIIVISVVASTVLILGMVVGYKYWQQKRREQDQARFL 206

BLAST of Sgr017215 vs. NCBI nr
Match: KAA0047322.1 (putative transmembrane protein [Cucumis melo var. makuwa])

HSP 1 Score: 266.2 bits (679), Expect = 3.6e-67
Identity = 133/186 (71.51%), Postives = 144/186 (77.42%), Query Frame = 0

Query: 37  GNVESAVVTLDSIEIYKTHEWLASKPTVYFHCQGENKTKLPDVQKEHVLYSFNGEESWQP 96
           G  ESAVVTLDSI IYKTHEWLA+KPTVYFHC G NKT LPDVQKEHVLYSFNGEESWQP
Sbjct: 27  GGAESAVVTLDSIVIYKTHEWLAAKPTVYFHCLGGNKTTLPDVQKEHVLYSFNGEESWQP 86

Query: 97  MTEFGSKKCKRCGFFEEDRIKSDDVFEEWEFCPTDFTAPAGKYVRFNKNEFNATFVCLKC 156
           +TEF SKKCKRCGF+EED IKSDDVFEEWEFCP+DFT+PAGKYVRFN  EFNATF+CL+C
Sbjct: 87  LTEFKSKKCKRCGFYEEDSIKSDDVFEEWEFCPSDFTSPAGKYVRFNPKEFNATFLCLQC 146

Query: 157 TAIP-----MVPVQVHIRLVEERDCML------------LREQDQARFLKLFEDGDDIED 206
           TA          V + I +V     +L             R+QDQARFLKLFEDGDDIED
Sbjct: 147 TAYSNEKGMHAAVIIVISIVASIVLILGMVVGYKYWQKKRRQQDQARFLKLFEDGDDIED 206

BLAST of Sgr017215 vs. NCBI nr
Match: XP_022963068.1 (uncharacterized protein LOC111463378 isoform X2 [Cucurbita moschata])

HSP 1 Score: 265.0 bits (676), Expect = 8.1e-67
Identity = 130/166 (78.31%), Postives = 140/166 (84.34%), Query Frame = 0

Query: 40  ESAVVTLDSIEIYKTHEWLASKPTVYFHCQGENKTKLPDVQKEHVLYSFNGEESWQPMTE 99
           ESAVVTLDSI IYKTHEWLASKPTVYF CQG NKTKLPDVQKEHVLYSFNGEESWQP+TE
Sbjct: 30  ESAVVTLDSIVIYKTHEWLASKPTVYFKCQGGNKTKLPDVQKEHVLYSFNGEESWQPLTE 89

Query: 100 FGSKKCKRCGFFEEDRIKSDDVFEEWEFCPTDFTAPAGKYVRFNKNEFNATFVCLKCTAI 159
           F SKKCKRCGF+EED IKSDDVFEEWE CP+DFTAP G+YVR+NK EFNATF+CL+CTA 
Sbjct: 90  FKSKKCKRCGFYEEDSIKSDDVFEEWELCPSDFTAPDGEYVRYNKKEFNATFLCLECTAY 149

Query: 160 PMVPVQVHIRLVEERDCMLLREQDQARFLKLFEDGDDIEDELGLGD 206
                  +    ++R     REQDQARFLKLFEDGDDIEDELGL D
Sbjct: 150 S----NGYKYWQKKR-----REQDQARFLKLFEDGDDIEDELGLND 186

BLAST of Sgr017215 vs. NCBI nr
Match: XP_004142096.1 (uncharacterized protein LOC101220441 [Cucumis sativus] >KGN54203.1 hypothetical protein Csa_018036 [Cucumis sativus])

HSP 1 Score: 262.7 bits (670), Expect = 4.0e-66
Identity = 133/198 (67.17%), Postives = 145/198 (73.23%), Query Frame = 0

Query: 40  ESAVVTLDSIEIYKTHEWLASKPTVYFHCQGENKTKLPDVQKEHVLYSFNGEESWQPMTE 99
           ESAVVTLDSI IYKTHEWLA+KPTVYFHCQG N+T LPDVQKEHVLYSFNGEESWQP+TE
Sbjct: 30  ESAVVTLDSIVIYKTHEWLAAKPTVYFHCQGGNRTTLPDVQKEHVLYSFNGEESWQPLTE 89

Query: 100 FGSKKCKRCGFFEEDRIKSDDVFEEWEFCPTDFTAPAGKYVRFNKNEFNATFVCLKCTAI 159
           F SKKCKRCGF+EED IKSDDVFEEWEFCP+DFTAPAGKYVRFN  EFNATF+CLKCTA 
Sbjct: 90  FKSKKCKRCGFYEEDSIKSDDVFEEWEFCPSDFTAPAGKYVRFNPKEFNATFLCLKCTAY 149

Query: 160 PMV--------------------PVQVHIRLVEERDCML------------LREQDQARF 206
             V                     + + I +V     ++             R+QDQARF
Sbjct: 150 SNVTSTSSSTSSITDGGEKGMQSAIIIVISIVASVVLIIGMVVGYKYWQKKRRQQDQARF 209

BLAST of Sgr017215 vs. ExPASy TrEMBL
Match: A0A6J1CRX4 (uncharacterized protein LOC111013628 OS=Momordica charantia OX=3673 GN=LOC111013628 PE=4 SV=1)

HSP 1 Score: 271.2 bits (692), Expect = 5.5e-69
Identity = 135/197 (68.53%), Postives = 147/197 (74.62%), Query Frame = 0

Query: 37  GNVESAVVTLDSIEIYKTHEWLASKPTVYFHCQGENKTKLPDVQKEHVLYSFNGEESWQP 96
           G  ESAVVTLDS+ IYKTHEWLASKPTVYF CQG NKTKLPDVQKEHVLYSFNGEESWQP
Sbjct: 27  GGAESAVVTLDSVMIYKTHEWLASKPTVYFQCQGGNKTKLPDVQKEHVLYSFNGEESWQP 86

Query: 97  MTEFGSKKCKRCGFFEEDRIKSDDVFEEWEFCPTDFTAPAGKYVRFNKNEFNATFVCLKC 156
           +TEF SKKCKRCGF+EED IKSDDVFEEWEFCP+DFTAPAGKYVRFN+ EFNATF+CL+C
Sbjct: 87  LTEFESKKCKRCGFYEEDSIKSDDVFEEWEFCPSDFTAPAGKYVRFNEKEFNATFMCLQC 146

Query: 157 TAIPMVPVQ----------VHIRLVEERDCML------------------LREQDQARFL 206
           TA   V             +H+  +     ++                   REQDQARFL
Sbjct: 147 TAYSNVSSSSAPTHNSEQGMHVAAIIVISALVSTVLIIGIVVGYKYWQKKRREQDQARFL 206

BLAST of Sgr017215 vs. ExPASy TrEMBL
Match: A0A5A7TV39 (Putative transmembrane protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold908G001430 PE=4 SV=1)

HSP 1 Score: 266.2 bits (679), Expect = 1.8e-67
Identity = 133/186 (71.51%), Postives = 144/186 (77.42%), Query Frame = 0

Query: 37  GNVESAVVTLDSIEIYKTHEWLASKPTVYFHCQGENKTKLPDVQKEHVLYSFNGEESWQP 96
           G  ESAVVTLDSI IYKTHEWLA+KPTVYFHC G NKT LPDVQKEHVLYSFNGEESWQP
Sbjct: 27  GGAESAVVTLDSIVIYKTHEWLAAKPTVYFHCLGGNKTTLPDVQKEHVLYSFNGEESWQP 86

Query: 97  MTEFGSKKCKRCGFFEEDRIKSDDVFEEWEFCPTDFTAPAGKYVRFNKNEFNATFVCLKC 156
           +TEF SKKCKRCGF+EED IKSDDVFEEWEFCP+DFT+PAGKYVRFN  EFNATF+CL+C
Sbjct: 87  LTEFKSKKCKRCGFYEEDSIKSDDVFEEWEFCPSDFTSPAGKYVRFNPKEFNATFLCLQC 146

Query: 157 TAIP-----MVPVQVHIRLVEERDCML------------LREQDQARFLKLFEDGDDIED 206
           TA          V + I +V     +L             R+QDQARFLKLFEDGDDIED
Sbjct: 147 TAYSNEKGMHAAVIIVISIVASIVLILGMVVGYKYWQKKRRQQDQARFLKLFEDGDDIED 206

BLAST of Sgr017215 vs. ExPASy TrEMBL
Match: A0A6J1HGN3 (uncharacterized protein LOC111463378 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111463378 PE=4 SV=1)

HSP 1 Score: 265.0 bits (676), Expect = 3.9e-67
Identity = 130/166 (78.31%), Postives = 140/166 (84.34%), Query Frame = 0

Query: 40  ESAVVTLDSIEIYKTHEWLASKPTVYFHCQGENKTKLPDVQKEHVLYSFNGEESWQPMTE 99
           ESAVVTLDSI IYKTHEWLASKPTVYF CQG NKTKLPDVQKEHVLYSFNGEESWQP+TE
Sbjct: 30  ESAVVTLDSIVIYKTHEWLASKPTVYFKCQGGNKTKLPDVQKEHVLYSFNGEESWQPLTE 89

Query: 100 FGSKKCKRCGFFEEDRIKSDDVFEEWEFCPTDFTAPAGKYVRFNKNEFNATFVCLKCTAI 159
           F SKKCKRCGF+EED IKSDDVFEEWE CP+DFTAP G+YVR+NK EFNATF+CL+CTA 
Sbjct: 90  FKSKKCKRCGFYEEDSIKSDDVFEEWELCPSDFTAPDGEYVRYNKKEFNATFLCLECTAY 149

Query: 160 PMVPVQVHIRLVEERDCMLLREQDQARFLKLFEDGDDIEDELGLGD 206
                  +    ++R     REQDQARFLKLFEDGDDIEDELGL D
Sbjct: 150 S----NGYKYWQKKR-----REQDQARFLKLFEDGDDIEDELGLND 186

BLAST of Sgr017215 vs. ExPASy TrEMBL
Match: A0A0A0KXD0 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G293130 PE=4 SV=1)

HSP 1 Score: 262.7 bits (670), Expect = 1.9e-66
Identity = 133/198 (67.17%), Postives = 145/198 (73.23%), Query Frame = 0

Query: 40  ESAVVTLDSIEIYKTHEWLASKPTVYFHCQGENKTKLPDVQKEHVLYSFNGEESWQPMTE 99
           ESAVVTLDSI IYKTHEWLA+KPTVYFHCQG N+T LPDVQKEHVLYSFNGEESWQP+TE
Sbjct: 30  ESAVVTLDSIVIYKTHEWLAAKPTVYFHCQGGNRTTLPDVQKEHVLYSFNGEESWQPLTE 89

Query: 100 FGSKKCKRCGFFEEDRIKSDDVFEEWEFCPTDFTAPAGKYVRFNKNEFNATFVCLKCTAI 159
           F SKKCKRCGF+EED IKSDDVFEEWEFCP+DFTAPAGKYVRFN  EFNATF+CLKCTA 
Sbjct: 90  FKSKKCKRCGFYEEDSIKSDDVFEEWEFCPSDFTAPAGKYVRFNPKEFNATFLCLKCTAY 149

Query: 160 PMV--------------------PVQVHIRLVEERDCML------------LREQDQARF 206
             V                     + + I +V     ++             R+QDQARF
Sbjct: 150 SNVTSTSSSTSSITDGGEKGMQSAIIIVISIVASVVLIIGMVVGYKYWQKKRRQQDQARF 209

BLAST of Sgr017215 vs. ExPASy TrEMBL
Match: A0A6J1I9J5 (uncharacterized protein LOC111471262 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111471262 PE=4 SV=1)

HSP 1 Score: 262.3 bits (669), Expect = 2.5e-66
Identity = 129/166 (77.71%), Postives = 139/166 (83.73%), Query Frame = 0

Query: 40  ESAVVTLDSIEIYKTHEWLASKPTVYFHCQGENKTKLPDVQKEHVLYSFNGEESWQPMTE 99
           ESAVVTLDSI IYKTHEWLASKPTVYF CQG NKTKLPDVQK HVLYSFNGEESWQP+TE
Sbjct: 30  ESAVVTLDSIVIYKTHEWLASKPTVYFKCQGGNKTKLPDVQKVHVLYSFNGEESWQPLTE 89

Query: 100 FGSKKCKRCGFFEEDRIKSDDVFEEWEFCPTDFTAPAGKYVRFNKNEFNATFVCLKCTAI 159
           F SKKCKRCGF+EED IKSDDVFEEWE CP+DFTAP G+YVR+NK EFNATF+CL+CTA 
Sbjct: 90  FKSKKCKRCGFYEEDSIKSDDVFEEWELCPSDFTAPDGEYVRYNKKEFNATFLCLECTAY 149

Query: 160 PMVPVQVHIRLVEERDCMLLREQDQARFLKLFEDGDDIEDELGLGD 206
                  +    ++R     REQDQARFLKLFEDGDDIEDELGL D
Sbjct: 150 S----NGYKYWQKKR-----REQDQARFLKLFEDGDDIEDELGLND 186

BLAST of Sgr017215 vs. TAIR 10
Match: AT3G53490.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G02720.1); Has 70 Blast hits to 70 proteins in 13 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 70; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 191.8 bits (486), Expect = 8.1e-49
Identity = 99/200 (49.50%), Postives = 129/200 (64.50%), Query Frame = 0

Query: 34  NLEGNVESAVVTLDSIEIYKTHEWLASKPTVYFHCQGENKTKLPDVQKEHVLYSFNGEES 93
           +L G + +  VTLDS++I+ TH+W ++KPTV+F C+GENKT LPDV++ +V YSFNGEES
Sbjct: 24  SLPGTILTQEVTLDSVQIFTTHDWFSTKPTVFFQCKGENKTVLPDVKRTNVSYSFNGEES 83

Query: 94  WQPMTEFGSKKCKRCGFFEEDRIKSDDVFEEWEFCPTDFTAPAGKYVRFNKNEFNATFVC 153
           WQP+TE    KCKRCG +E+D +K  D F+EWE CP+DFTA  G Y RF + EFNATF+C
Sbjct: 84  WQPLTELQGTKCKRCGIYEDDPLKY-DTFDEWELCPSDFTA-EGSYKRFKEKEFNATFLC 143

Query: 154 LKCTAI--------------------PMVPVQVHIRLVEERDCMLL----------REQD 204
             C+ +                    P + V + + L+      LL          R+Q+
Sbjct: 144 HGCSQVGAGSNKESGTEKEEQKGGMHPGIVVLIVVLLLGVVAVGLLVGYKYWRKKKRQQE 203

BLAST of Sgr017215 vs. TAIR 10
Match: AT5G02720.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G53490.1); Has 47 Blast hits to 47 proteins in 13 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 47; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 84.3 bits (207), Expect = 1.8e-16
Identity = 51/135 (37.78%), Postives = 71/135 (52.59%), Query Frame = 0

Query: 97  MTEFGSKKCKRCGFFEEDRIKSDDVFEEWEFCPTDFTAPAGKYVRFNKNEFNATFVCLKC 156
           MT  G +KCKRCG +E+  + SD  F+ WE CPTDF+A +  Y+ F + E NATFVC  C
Sbjct: 1   MTGIGGEKCKRCGIYEQGSLVSDKEFDVWEVCPTDFSA-SQVYMHFKEKEINATFVCHGC 60

Query: 157 ----TAIPMVPVQVH--------IRLVEERDCMLL----------------REQDQARFL 204
               +A+     Q          I ++    C  L                +++DQARF+
Sbjct: 61  AKFHSAVAASSPQEEGYNGLTFMIAIIAGVLCTTLVVVGGVFMFKHTQRMKKQRDQARFM 120

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022143802.11.1e-6868.53uncharacterized protein LOC111013628 [Momordica charantia][more]
XP_038883476.11.2e-6769.54uncharacterized protein LOC120074430 isoform X1 [Benincasa hispida][more]
KAA0047322.13.6e-6771.51putative transmembrane protein [Cucumis melo var. makuwa][more]
XP_022963068.18.1e-6778.31uncharacterized protein LOC111463378 isoform X2 [Cucurbita moschata][more]
XP_004142096.14.0e-6667.17uncharacterized protein LOC101220441 [Cucumis sativus] >KGN54203.1 hypothetical ... [more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1CRX45.5e-6968.53uncharacterized protein LOC111013628 OS=Momordica charantia OX=3673 GN=LOC111013... [more]
A0A5A7TV391.8e-6771.51Putative transmembrane protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_s... [more]
A0A6J1HGN33.9e-6778.31uncharacterized protein LOC111463378 isoform X2 OS=Cucurbita moschata OX=3662 GN... [more]
A0A0A0KXD01.9e-6667.17Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G293130 PE=4 SV=1[more]
A0A6J1I9J52.5e-6677.71uncharacterized protein LOC111471262 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... [more]
Match NameE-valueIdentityDescription
AT3G53490.18.1e-4949.50unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT5G02720.11.8e-1637.78unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 13..32
NoneNo IPR availablePANTHERPTHR33780:SF3EXPRESSED PROTEINcoord: 35..160
coord: 180..205
NoneNo IPR availablePANTHERPTHR33780EXPRESSED PROTEINcoord: 35..160
coord: 180..205

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr017215.1Sgr017215.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016020 membrane