Sgr022093 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr022093
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
DescriptionTransmembrane protein
Locationtig00153874: 1101685 .. 1105554 (+)
RNA-Seq ExpressionSgr022093
SyntenySgr022093
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGTTTCTCCATGAACGGCGACCCATCAAGCTCCATGACTACCCGATCCCACTACCACACCCACAAGATCTTCCTCTACTGCAACTACATCCTCCTCGGTGCCGCCTCCAGCTGCATCTTCCTGACGCTCTCCCTCCGCCTGGTCCCCTCCCTGTGCGGCGTCTCCATCGTCTTCCTCCACATCCTCACCATCGCCAGCGCCGTCTCGGGGTGTGCCATGGTGGCCTCCGCCGGCGCCACCCGGTGGTTCGGAGTGCACATGGTGTTCACCGTCCTCACCGCTATCTTCCAAGGGTCGGTGGCGGTGCTGATCTTCACGAGAACGGGGGAGTTTCTGGGGCAGCTGAAGTCGTACGTGAGGGAGGAGGATGGGGCGGTGATAGTGAAGTTGGCGGGGGGGCTGAGCGTGGTGATGTTCTGGTTGGAGTGGGTGGTGCTCACGCTGGCTTTTTTCTTGAGGTATTATGCGTATGTTGAAGGAGAGGGAGTGAATAATGGGGTGGCGATGAGGAGTGCGAAAGTGCAGCAGGATGAGGATTTGAAGGACTGGCCATGGCCATTCCAAGTTTGATCTGATCAAGAAGGAAGGGAATCATTTTGGGTGTTTTCTGTTTTTTGGATTATTGTGATTCGTTCGTTCATTTGGTTGATTCTTTTTTTCTCGGATCTGGGTTTTTTTTTTTTTTGTTGGTATTATATATATGTTTATGTGTACTAGTATACTTATCGGGAAATTTCTTAAGTGAAATGTTGAATTTACGGTTACGGGTAAATGGTGCTGGCCCGCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTGTGTTTGAACTCTCCTCTCTCTTTCTATCTTTTTTCTATCTTCTTCGCGGTTAAATTTATGAGAAGTTTTTTTTTTTTTTTTTTAAAAAATTGTATATATACATACATTGAAACAGTCATATTTTTTTGTCGGTGAAGCAAACAATAAAATTCCAACTACAAAAACACTCAAAATCATAACAAATAAATGTCGAAACCCAATGCTTTTTTCTTTTTAAAAAAAAAATGTCCAAACCCAATCACAAAAATTTCCAAACTTAACCACAAAATCGAGCATAGCATATAAAAAACACTCAAACTTAACAACAAAATGCCTAAATTCAATTAATTAAAAATATCCAAATATCAAAACATACAAAAACTATCTCATAGCTTATGAAAACTTAAACTAATTAAGGGTGTAGGGATTTAGCAACACTTATTTCAAGTTCAAATTTTAGTTTGAGGTTAAAATTGACTCACACCCTCAATCTTTCAAAATTTTTGATGTACATTTTTTTTATAAAGAAAAAAAATTGTGGAATTCTCTATGAACCGGGAATTGAATAAATCACAAGGATGTTTCCATTGGTGATATATCAATGAAAATGCCAAAAAAAAAAAAGAAAATAAATACTAATGGAACTTATATTAGTAAAGAAGTCATAGGTTCAAATTTTTATCTTATTTGTGATGTGATATTTTCAAAAAAATAATTACTAATATGAGTAGCAACGAGTTTTATTTATTTAATTTTTTTTATATAAAAAGTCGCAAGGAGTTTTAAGTTTATTAATACAAGTGGGAAAAAATTGCTTATATAAGTAGAGGAAGCTCTAATATAATGTAGTTTTAACTTATCGTCCTCATAGTTGATATTAGTGGCATTGGTCTAATTTATAAGTTGATATTTACTCTAAAAAATTAAAATAATAATAAGTGAACTCCAGTAGATAAGTTTTATTCATAAGAGTGTACAACGTTTATTCATTCATTCAAATATTTTTATGAATTTTTTAATTCTCATAATATTGGTAGAAATTTAAAGGAAAAATTAGACCACCTTAGAGAAGAAAGTTTCCAATTTCTCCTTCAATCTTTTAAAAAATTGACAAAAATTTCAACTTTTCTTTTTTTAGATGGCCTAAATGCTTCTCTAAATTTGTGTCAACTTATTAAGTATCAATGTCATGTAAACAGTGGTAGATTTAGAACTTAATGTCAGAGGGGTCACAATCTCGTATAAATATAATTTTAGATTTATTAATAAATATCATCTAGATTAAGTAATATTGATCTAAATTATAATAAATCATTATCAATAATATTAGTTTGTTTAGTATAAATACTAAAAAATATGACTTTTTATATATATAATTTGATATTTATTTGTTCAAAAGAAAATATGACTAAATTATATATATATATATATATATATATATATATATATAATTTAAATTTATTAAAGAAAATATGATTTACAGGGGTAAACTTACTCGAGATAAAAGTTTAAATTATTTATTTTAAAAGGAAAAAAATATGAATTTCTATATAAAAAAATAAAAAATACTTCTTTGGCAAAAATATACTAAAATTAGAAAATTTGGAACCAGAAGCCTCGAATTACTTACTTTAAAAGGGAAAAAGATGAGTTTTTATATAATTTTTATTTTAAAAAAATAATTTTCTTTTGCTAAAATATACTACAAAATAGTAGACAATCTGCGAACTAGTTTTAGCTTCGAACCTAAAAACTGGTGTTTCACACCTCAAGACTACAGCCAACTAGGGTGCACATCGGTTCGGTTTCTTTGGTTTTGGGCCAAACCGAGGACCAAATAGATGTGCTCGGTTTTTGCAAATGAAGAACCGACCAATGTCCAATAAGGACAAGAACCGACCGATTTGGTTCTTCTCGATTCAGTTCTTCTTGGTCCGTTTCTTCTTGATTTTGGGCCTTGTTGGGCATTTTCCTATTTTGTGACTTTGGCCCACTTTTTCATTTTAAATTTCAGATTTTCATTTTATTTTTGCCCACTTTTTCATTTAGAATTTCATATTATAATTTTAAATCCCGAAATGCATATTCCATAAAAAATTTTAAATATCACTCGTTCAATTTTTCAGTTTTTTTATTCTCACTAACCACTTCAAATGATTAAAAAAACTCTTAAAATGATTTTAAAAAATAAAAGATACATGTCAAAATAATAAACATTCATGTCCCAATACCTAATGAAATGTCCAAACAAATTAAAAATAAGTTCAAACAAATAAAAAATGCAAGTCCAAATCCAAGTCTAAACATATTAAAAATACTAAGTCCAAGTATAAACAAATTAAAAATACAAAGTCCAAGTCCAAACATATTAAAATTACAAGTTCAAGTCCAAACATATCAAAACTACAAGTTCAAGTCCAAATGAAGATCTACCATTCCATCAGTAACTTTTTCCCTTCTATTTGATGCGTATTGGACAAGCCAATTCTAAATATATATATATATATATTTGATGCATATTGGACAAGCCAATACTAAACATGTGTGTGTGTGTGTGTGTAAAATTAAAAACCAACATTAGGTAACTCCTACAAATAAAAGTGAATAAGAATATAATACAGTATTATTAATTATAAAATATAACTATAATCCCAAAGTAATTAATTTTTATTACCTTTAGAAACCTTTGTTATTCATCACGATCACCATCGGCAATATCTTTAATGTCACTAATAGGGGATGAACGTATACAATTTTGTAAGCAAATGAGAGTTTCAACTGTTTGTGGAGATAGAGTGGTGGTAAGGGTGTAGGCGATGGCCGACGAGCGAGGGTGTAGGCAACGGCCGACAGACGACATACAAGTGAGGGTGTAGGCAACAGTAAAGGGGAGGCTAAAGGCGACACTGCGATGGGTGAGGCTGAAGGTGTTGCGGCAGTGATGGCAAAGGCGAGGTTGACAGCAATGGAGACAGGTGAGGCTGAAGGTCTTGCAATGGTGACAGGTGAGGCAATTGCGACAGCGAGGAGTGGTCTCGTGGTCATGGCGTGGGCAACGAAAAGCAGTTGA

mRNA sequence

ATGGGTTTCTCCATGAACGGCGACCCATCAAGCTCCATGACTACCCGATCCCACTACCACACCCACAAGATCTTCCTCTACTGCAACTACATCCTCCTCGGTGCCGCCTCCAGCTGCATCTTCCTGACGCTCTCCCTCCGCCTGGTCCCCTCCCTGTGCGGCGTCTCCATCGTCTTCCTCCACATCCTCACCATCGCCAGCGCCGTCTCGGGGTGTGCCATGGTGGCCTCCGCCGGCGCCACCCGGTGGTTCGGAGTGCACATGGTGTTCACCGTCCTCACCGCTATCTTCCAAGGGTCGGTGGCGGTGCTGATCTTCACGAGAACGGGGGAGTTTCTGGGGCAGCTGAAGTCGTACGTGAGGGAGGAGGATGGGGCGGTGATAGTGAAGTTGGCGGGGGGGCTGAGCGTGGTGATGTTCTGGTTGGAGTGGGTGGTGCTCACGCTGGCTTTTTTCTTGAGGTATTATGCGTATGTTGAAGGAGAGGGAGTGAATAATGGGGTGGCGATGAGGAGTGCGAAAGTGCAGCAGGATGAGGATTTGAAGGACTGGCCATGGCCATTCCAAGCAACGGCCGACAGACGACATACAAGTGAGGGTGTAGGCAACAGTAAAGGGGAGGCTAAAGGCGACACTGCGATGGGTGAGGCTGAAGGTGTTGCGGCAGTGATGGCAAAGGCGAGGTTGACAGCAATGGAGACAGGTGAGGCTGAAGGTCTTGCAATGGTGACAGGTGAGGCAATTGCGACAGCGAGGAGTGGTCTCGTGGTCATGGCGTGGGCAACGAAAAGCAGTTGA

Coding sequence (CDS)

ATGGGTTTCTCCATGAACGGCGACCCATCAAGCTCCATGACTACCCGATCCCACTACCACACCCACAAGATCTTCCTCTACTGCAACTACATCCTCCTCGGTGCCGCCTCCAGCTGCATCTTCCTGACGCTCTCCCTCCGCCTGGTCCCCTCCCTGTGCGGCGTCTCCATCGTCTTCCTCCACATCCTCACCATCGCCAGCGCCGTCTCGGGGTGTGCCATGGTGGCCTCCGCCGGCGCCACCCGGTGGTTCGGAGTGCACATGGTGTTCACCGTCCTCACCGCTATCTTCCAAGGGTCGGTGGCGGTGCTGATCTTCACGAGAACGGGGGAGTTTCTGGGGCAGCTGAAGTCGTACGTGAGGGAGGAGGATGGGGCGGTGATAGTGAAGTTGGCGGGGGGGCTGAGCGTGGTGATGTTCTGGTTGGAGTGGGTGGTGCTCACGCTGGCTTTTTTCTTGAGGTATTATGCGTATGTTGAAGGAGAGGGAGTGAATAATGGGGTGGCGATGAGGAGTGCGAAAGTGCAGCAGGATGAGGATTTGAAGGACTGGCCATGGCCATTCCAAGCAACGGCCGACAGACGACATACAAGTGAGGGTGTAGGCAACAGTAAAGGGGAGGCTAAAGGCGACACTGCGATGGGTGAGGCTGAAGGTGTTGCGGCAGTGATGGCAAAGGCGAGGTTGACAGCAATGGAGACAGGTGAGGCTGAAGGTCTTGCAATGGTGACAGGTGAGGCAATTGCGACAGCGAGGAGTGGTCTCGTGGTCATGGCGTGGGCAACGAAAAGCAGTTGA

Protein sequence

MGFSMNGDPSSSMTTRSHYHTHKIFLYCNYILLGAASSCIFLTLSLRLVPSLCGVSIVFLHILTIASAVSGCAMVASAGATRWFGVHMVFTVLTAIFQGSVAVLIFTRTGEFLGQLKSYVREEDGAVIVKLAGGLSVVMFWLEWVVLTLAFFLRYYAYVEGEGVNNGVAMRSAKVQQDEDLKDWPWPFQATADRRHTSEGVGNSKGEAKGDTAMGEAEGVAAVMAKARLTAMETGEAEGLAMVTGEAIATARSGLVVMAWATKSS
Homology
BLAST of Sgr022093 vs. NCBI nr
Match: XP_008449000.1 (PREDICTED: uncharacterized protein LOC103491004 [Cucumis melo] >TYK19381.1 uncharacterized protein E5676_scaffold443G00100 [Cucumis melo var. makuwa])

HSP 1 Score: 308.5 bits (789), Expect = 5.5e-80
Identity = 160/190 (84.21%), Postives = 178/190 (93.68%), Query Frame = 0

Query: 1   MGFSMNGDPSSSMTTRSHYHTHKIFLYCNYILLGAASSCIFLTLSLRLVPSLCGVSIVFL 60
           MGFS+   PS+SM TRSHYHTHK+FLY NYILLGAASSCIFLTLSLRL+PSLCG+S++FL
Sbjct: 1   MGFSITSTPSTSM-TRSHYHTHKLFLYTNYILLGAASSCIFLTLSLRLLPSLCGLSLIFL 60

Query: 61  HILTIASAVSGCAMVASAGATRWFGVHMVFTVLTAIFQGSVAVLIFTRTGEFLGQLKSYV 120
           HILTIA+AVSGCAM A+A +TRWFGVHMVFTVLTAIFQGSVA+L++TRTG+FL +LKSYV
Sbjct: 61  HILTIAAAVSGCAM-AAASSTRWFGVHMVFTVLTAIFQGSVAMLVYTRTGDFLLELKSYV 120

Query: 121 REEDGAVIVKLAGGLSVVMFWLEWVVLTLAFFLRYYAYVEGEGV-NNGVAMRSAKVQQDE 180
           REEDGAVI+KLAGGLSVVMF LEWVVLTLAFFLRYYA+VEG+G  NNG AMRSAKVQQDE
Sbjct: 121 REEDGAVILKLAGGLSVVMFVLEWVVLTLAFFLRYYAFVEGDGSNNNGAAMRSAKVQQDE 180

Query: 181 DLKDWPWPFQ 190
           DLKDWPWPFQ
Sbjct: 181 DLKDWPWPFQ 188

BLAST of Sgr022093 vs. NCBI nr
Match: XP_038904159.1 (uncharacterized protein LOC120090516 [Benincasa hispida])

HSP 1 Score: 308.1 bits (788), Expect = 7.1e-80
Identity = 160/191 (83.77%), Postives = 178/191 (93.19%), Query Frame = 0

Query: 1   MGFSMNGDPSSSMTTRSHYHTHKIFLYCNYILLGAASSCIFLTLSLRLVPSLCGVSIVFL 60
           MGFS+  +PS+SM +RSHYHTHK+FLY NYILLGAASSCIFLTLSLRL+PSLCG+S++FL
Sbjct: 1   MGFSITANPSTSM-SRSHYHTHKLFLYTNYILLGAASSCIFLTLSLRLLPSLCGLSLIFL 60

Query: 61  HILTIASAVSGCAMVASAGATRWFGVHMVFTVLTAIFQGSVAVLIFTRTGEFLGQLKSYV 120
           HILTIA+AVSGCAM A+A A RWFGVHMVFTVLTAIFQGSVA+L+FTRTG+FL +LKSYV
Sbjct: 61  HILTIAAAVSGCAMAAAASA-RWFGVHMVFTVLTAIFQGSVAMLVFTRTGDFLWELKSYV 120

Query: 121 REEDGAVIVKLAGGLSVVMFWLEWVVLTLAFFLRYYAYVEGEGVN--NGVAMRSAKVQQD 180
           REEDGAVI+KLAGGLSVVMF LEWVVLTLAFFL+YYA+VEGEG N  NGV MRSAKVQQD
Sbjct: 121 REEDGAVILKLAGGLSVVMFVLEWVVLTLAFFLKYYAFVEGEGSNSSNGVGMRSAKVQQD 180

Query: 181 EDLKDWPWPFQ 190
           EDLKDWPWPFQ
Sbjct: 181 EDLKDWPWPFQ 189

BLAST of Sgr022093 vs. NCBI nr
Match: XP_004148268.1 (uncharacterized protein LOC101206234 [Cucumis sativus])

HSP 1 Score: 307.4 bits (786), Expect = 1.2e-79
Identity = 158/189 (83.60%), Postives = 175/189 (92.59%), Query Frame = 0

Query: 1   MGFSMNGDPSSSMTTRSHYHTHKIFLYCNYILLGAASSCIFLTLSLRLVPSLCGVSIVFL 60
           MGFS+   PS+SM TRSHYHTHK+FLY NYILLGAASSCIFLTLSLRL+PSLCG+S++FL
Sbjct: 1   MGFSITSTPSTSM-TRSHYHTHKLFLYTNYILLGAASSCIFLTLSLRLLPSLCGLSLIFL 60

Query: 61  HILTIASAVSGCAMVASAGATRWFGVHMVFTVLTAIFQGSVAVLIFTRTGEFLGQLKSYV 120
           HILTIA+AVSGCAM A+A +TRWFGVHMVFTVLTAIFQGSVA+L++TRTG+FL +LKSYV
Sbjct: 61  HILTIAAAVSGCAM-AAASSTRWFGVHMVFTVLTAIFQGSVAMLVYTRTGDFLSELKSYV 120

Query: 121 REEDGAVIVKLAGGLSVVMFWLEWVVLTLAFFLRYYAYVEGEGVNNGVAMRSAKVQQDED 180
           REEDGAVI+KLAGGLSVVMF LEWVVLTLAFFLRYYA+VE    NNG AMRSAKVQQDED
Sbjct: 121 REEDGAVILKLAGGLSVVMFVLEWVVLTLAFFLRYYAFVEEGSNNNGAAMRSAKVQQDED 180

Query: 181 LKDWPWPFQ 190
           LKDWPWPFQ
Sbjct: 181 LKDWPWPFQ 187

BLAST of Sgr022093 vs. NCBI nr
Match: KAE8650098.1 (hypothetical protein Csa_010636 [Cucumis sativus])

HSP 1 Score: 307.4 bits (786), Expect = 1.2e-79
Identity = 158/189 (83.60%), Postives = 175/189 (92.59%), Query Frame = 0

Query: 1   MGFSMNGDPSSSMTTRSHYHTHKIFLYCNYILLGAASSCIFLTLSLRLVPSLCGVSIVFL 60
           MGFS+   PS+SM TRSHYHTHK+FLY NYILLGAASSCIFLTLSLRL+PSLCG+S++FL
Sbjct: 1   MGFSITSTPSTSM-TRSHYHTHKLFLYTNYILLGAASSCIFLTLSLRLLPSLCGLSLIFL 60

Query: 61  HILTIASAVSGCAMVASAGATRWFGVHMVFTVLTAIFQGSVAVLIFTRTGEFLGQLKSYV 120
           HILTIA+AVSGCAM A+A +TRWFGVHMVFTVLTAIFQGSVA+L++TRTG+FL +LKSYV
Sbjct: 61  HILTIAAAVSGCAM-AAASSTRWFGVHMVFTVLTAIFQGSVAMLVYTRTGDFLSELKSYV 120

Query: 121 REEDGAVIVKLAGGLSVVMFWLEWVVLTLAFFLRYYAYVEGEGVNNGVAMRSAKVQQDED 180
           REEDGAVI+KLAGGLSVVMF LEWVVLTLAFFLRYYA+VE    NNG AMRSAKVQQDED
Sbjct: 121 REEDGAVILKLAGGLSVVMFVLEWVVLTLAFFLRYYAFVEEGSNNNGAAMRSAKVQQDED 180

Query: 181 LKDWPWPFQ 190
           LKDWPWPFQ
Sbjct: 181 LKDWPWPFQ 187

BLAST of Sgr022093 vs. NCBI nr
Match: XP_023540529.1 (uncharacterized protein LOC111800866 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 299.7 bits (766), Expect = 2.5e-77
Identity = 155/192 (80.73%), Postives = 175/192 (91.15%), Query Frame = 0

Query: 1   MGFSMNGDPSSSMT-TRSHYHTHKIFLYCNYILLGAASSCIFLTLSLRLVPSLCGVSIVF 60
           MGFS++G P SS + +RSHYHTHK+FLY NYILLGAASSCIFLTLSLRL+PSLCG+S++F
Sbjct: 1   MGFSISGHPQSSTSMSRSHYHTHKLFLYTNYILLGAASSCIFLTLSLRLLPSLCGLSLIF 60

Query: 61  LHILTIASAVSGCAMVASAGATRWFGVHMVFTVLTAIFQGSVAVLIFTRTGEFLGQLKSY 120
           LHILTIA+AVSGCAM AS  + RWFG+HMVFTVLTAIFQGSVAVL++TRTG+FLG+LKSY
Sbjct: 61  LHILTIAAAVSGCAM-ASTASGRWFGIHMVFTVLTAIFQGSVAVLVYTRTGDFLGELKSY 120

Query: 121 VREEDGAVIVKLAGGLSVVMFWLEWVVLTLAFFLRYYAYVEGEG--VNNGVAMRSAKVQQ 180
           VREEDGAVI+KLAGGLSVVMF LEWVVLTLAFFL+YYA+VEG G   N   AMRSAKV+Q
Sbjct: 121 VREEDGAVILKLAGGLSVVMFVLEWVVLTLAFFLKYYAFVEGGGGSGNGAAAMRSAKVEQ 180

Query: 181 DEDLKDWPWPFQ 190
           DEDLKDWPWPFQ
Sbjct: 181 DEDLKDWPWPFQ 191

BLAST of Sgr022093 vs. ExPASy TrEMBL
Match: A0A5D3D734 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold443G00100 PE=4 SV=1)

HSP 1 Score: 308.5 bits (789), Expect = 2.6e-80
Identity = 160/190 (84.21%), Postives = 178/190 (93.68%), Query Frame = 0

Query: 1   MGFSMNGDPSSSMTTRSHYHTHKIFLYCNYILLGAASSCIFLTLSLRLVPSLCGVSIVFL 60
           MGFS+   PS+SM TRSHYHTHK+FLY NYILLGAASSCIFLTLSLRL+PSLCG+S++FL
Sbjct: 1   MGFSITSTPSTSM-TRSHYHTHKLFLYTNYILLGAASSCIFLTLSLRLLPSLCGLSLIFL 60

Query: 61  HILTIASAVSGCAMVASAGATRWFGVHMVFTVLTAIFQGSVAVLIFTRTGEFLGQLKSYV 120
           HILTIA+AVSGCAM A+A +TRWFGVHMVFTVLTAIFQGSVA+L++TRTG+FL +LKSYV
Sbjct: 61  HILTIAAAVSGCAM-AAASSTRWFGVHMVFTVLTAIFQGSVAMLVYTRTGDFLLELKSYV 120

Query: 121 REEDGAVIVKLAGGLSVVMFWLEWVVLTLAFFLRYYAYVEGEGV-NNGVAMRSAKVQQDE 180
           REEDGAVI+KLAGGLSVVMF LEWVVLTLAFFLRYYA+VEG+G  NNG AMRSAKVQQDE
Sbjct: 121 REEDGAVILKLAGGLSVVMFVLEWVVLTLAFFLRYYAFVEGDGSNNNGAAMRSAKVQQDE 180

Query: 181 DLKDWPWPFQ 190
           DLKDWPWPFQ
Sbjct: 181 DLKDWPWPFQ 188

BLAST of Sgr022093 vs. ExPASy TrEMBL
Match: A0A1S3BL21 (uncharacterized protein LOC103491004 OS=Cucumis melo OX=3656 GN=LOC103491004 PE=4 SV=1)

HSP 1 Score: 308.5 bits (789), Expect = 2.6e-80
Identity = 160/190 (84.21%), Postives = 178/190 (93.68%), Query Frame = 0

Query: 1   MGFSMNGDPSSSMTTRSHYHTHKIFLYCNYILLGAASSCIFLTLSLRLVPSLCGVSIVFL 60
           MGFS+   PS+SM TRSHYHTHK+FLY NYILLGAASSCIFLTLSLRL+PSLCG+S++FL
Sbjct: 1   MGFSITSTPSTSM-TRSHYHTHKLFLYTNYILLGAASSCIFLTLSLRLLPSLCGLSLIFL 60

Query: 61  HILTIASAVSGCAMVASAGATRWFGVHMVFTVLTAIFQGSVAVLIFTRTGEFLGQLKSYV 120
           HILTIA+AVSGCAM A+A +TRWFGVHMVFTVLTAIFQGSVA+L++TRTG+FL +LKSYV
Sbjct: 61  HILTIAAAVSGCAM-AAASSTRWFGVHMVFTVLTAIFQGSVAMLVYTRTGDFLLELKSYV 120

Query: 121 REEDGAVIVKLAGGLSVVMFWLEWVVLTLAFFLRYYAYVEGEGV-NNGVAMRSAKVQQDE 180
           REEDGAVI+KLAGGLSVVMF LEWVVLTLAFFLRYYA+VEG+G  NNG AMRSAKVQQDE
Sbjct: 121 REEDGAVILKLAGGLSVVMFVLEWVVLTLAFFLRYYAFVEGDGSNNNGAAMRSAKVQQDE 180

Query: 181 DLKDWPWPFQ 190
           DLKDWPWPFQ
Sbjct: 181 DLKDWPWPFQ 188

BLAST of Sgr022093 vs. ExPASy TrEMBL
Match: A0A0A0L253 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G045140 PE=4 SV=1)

HSP 1 Score: 307.4 bits (786), Expect = 5.9e-80
Identity = 158/189 (83.60%), Postives = 175/189 (92.59%), Query Frame = 0

Query: 1   MGFSMNGDPSSSMTTRSHYHTHKIFLYCNYILLGAASSCIFLTLSLRLVPSLCGVSIVFL 60
           MGFS+   PS+SM TRSHYHTHK+FLY NYILLGAASSCIFLTLSLRL+PSLCG+S++FL
Sbjct: 1   MGFSITSTPSTSM-TRSHYHTHKLFLYTNYILLGAASSCIFLTLSLRLLPSLCGLSLIFL 60

Query: 61  HILTIASAVSGCAMVASAGATRWFGVHMVFTVLTAIFQGSVAVLIFTRTGEFLGQLKSYV 120
           HILTIA+AVSGCAM A+A +TRWFGVHMVFTVLTAIFQGSVA+L++TRTG+FL +LKSYV
Sbjct: 61  HILTIAAAVSGCAM-AAASSTRWFGVHMVFTVLTAIFQGSVAMLVYTRTGDFLSELKSYV 120

Query: 121 REEDGAVIVKLAGGLSVVMFWLEWVVLTLAFFLRYYAYVEGEGVNNGVAMRSAKVQQDED 180
           REEDGAVI+KLAGGLSVVMF LEWVVLTLAFFLRYYA+VE    NNG AMRSAKVQQDED
Sbjct: 121 REEDGAVILKLAGGLSVVMFVLEWVVLTLAFFLRYYAFVEEGSNNNGAAMRSAKVQQDED 180

Query: 181 LKDWPWPFQ 190
           LKDWPWPFQ
Sbjct: 181 LKDWPWPFQ 187

BLAST of Sgr022093 vs. ExPASy TrEMBL
Match: A0A6J1L2J4 (uncharacterized protein LOC111498512 OS=Cucurbita maxima OX=3661 GN=LOC111498512 PE=4 SV=1)

HSP 1 Score: 298.5 bits (763), Expect = 2.7e-77
Identity = 155/192 (80.73%), Postives = 174/192 (90.62%), Query Frame = 0

Query: 1   MGFSMNGDPSSSMT-TRSHYHTHKIFLYCNYILLGAASSCIFLTLSLRLVPSLCGVSIVF 60
           MGFS++G P SS + +RSHYHTHK+FLY NYILLGAASSCIFLTLSLRL+PSLCG+S++F
Sbjct: 1   MGFSISGHPQSSTSMSRSHYHTHKLFLYTNYILLGAASSCIFLTLSLRLLPSLCGLSLIF 60

Query: 61  LHILTIASAVSGCAMVASAGATRWFGVHMVFTVLTAIFQGSVAVLIFTRTGEFLGQLKSY 120
           LHILTIA+AVSGCAM AS  + RWFG HMVFTVLTAIFQGSVAVL++TRTG+FLG+LKSY
Sbjct: 61  LHILTIAAAVSGCAM-ASTASGRWFGTHMVFTVLTAIFQGSVAVLVYTRTGDFLGELKSY 120

Query: 121 VREEDGAVIVKLAGGLSVVMFWLEWVVLTLAFFLRYYAYVEGEG--VNNGVAMRSAKVQQ 180
           VREEDGAVI+KLAGGLSVVMF LEWVVLTLAFFL+YYA+VEG G   N   AMRSAKV+Q
Sbjct: 121 VREEDGAVILKLAGGLSVVMFVLEWVVLTLAFFLKYYAFVEGGGGSGNGAAAMRSAKVEQ 180

Query: 181 DEDLKDWPWPFQ 190
           DEDLKDWPWPFQ
Sbjct: 181 DEDLKDWPWPFQ 191

BLAST of Sgr022093 vs. ExPASy TrEMBL
Match: A0A6J1H1G8 (uncharacterized protein LOC111459467 OS=Cucurbita moschata OX=3662 GN=LOC111459467 PE=4 SV=1)

HSP 1 Score: 298.1 bits (762), Expect = 3.6e-77
Identity = 154/192 (80.21%), Postives = 174/192 (90.62%), Query Frame = 0

Query: 1   MGFSMNGDPSSSMT-TRSHYHTHKIFLYCNYILLGAASSCIFLTLSLRLVPSLCGVSIVF 60
           MGFS++G P SS + +RSHYHTHK+FLY NYILLGAASSCIFLTLSLRL+PSLCG+S++F
Sbjct: 1   MGFSISGHPQSSTSMSRSHYHTHKLFLYTNYILLGAASSCIFLTLSLRLLPSLCGLSLIF 60

Query: 61  LHILTIASAVSGCAMVASAGATRWFGVHMVFTVLTAIFQGSVAVLIFTRTGEFLGQLKSY 120
           LHILTIA+AVSGCAM AS  + RWFG+HMVFTVLTAIFQGSVAVL++TRTG+FLG+LKSY
Sbjct: 61  LHILTIAAAVSGCAM-ASTASGRWFGIHMVFTVLTAIFQGSVAVLVYTRTGDFLGELKSY 120

Query: 121 VREEDGAVIVKLAGGLSVVMFWLEWVVLTLAFFLRYYAYVEGEG--VNNGVAMRSAKVQQ 180
           VREEDGAVI+KLAGGLSV MF LEWVVLTLAFFL+YYA+VEG G   N   AMRSAKV+Q
Sbjct: 121 VREEDGAVILKLAGGLSVTMFVLEWVVLTLAFFLKYYAFVEGGGGSGNGAAAMRSAKVEQ 180

Query: 181 DEDLKDWPWPFQ 190
           DEDLKDWPWPFQ
Sbjct: 181 DEDLKDWPWPFQ 191

BLAST of Sgr022093 vs. TAIR 10
Match: AT5G16250.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G02640.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )

HSP 1 Score: 240.7 bits (613), Expect = 1.3e-63
Identity = 126/181 (69.61%), Postives = 146/181 (80.66%), Query Frame = 0

Query: 10  SSSMTTRSHYHTHKIFLYCNYILLGAASSCIFLTLSLRLVPSLCGVSIVFLHILTIASAV 69
           SSS    SHYHTHKIFL+ NYILLGAASSCIFLTLSLRL+PS+CG  ++ LH  TIA+AV
Sbjct: 6   SSSPVEESHYHTHKIFLFSNYILLGAASSCIFLTLSLRLIPSICGFLLILLHATTIAAAV 65

Query: 70  SGCAMVASAGATRWFGVHMVFTVLTAIFQGSVAVLIFTRTGEFLGQLKSYVREEDGAVIV 129
           SGCA  AS G  RW+  HMV TVLTAIFQGSV+VLIFT T +FLG LKSYVREED AVI+
Sbjct: 66  SGCA-AASCGRNRWYAAHMVATVLTAIFQGSVSVLIFTNTSKFLGSLKSYVREEDAAVIL 125

Query: 130 KLAGGLSVVMFWLEWVVLTLAFFLRYYAYVEGEGVNNGVAM-RSAKVQQDEDLKDWPWPF 189
           KL GGL +V+F L+W+VL  AFFL+YYAYV+G    +GVAM R+ KVQ +E+ KDWPWPF
Sbjct: 126 KLGGGLCIVIFCLDWIVLVCAFFLKYYAYVDG---GDGVAMKRTGKVQSEENPKDWPWPF 182

BLAST of Sgr022093 vs. TAIR 10
Match: AT3G02640.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G16250.1); Has 96 Blast hits to 96 proteins in 16 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 95; Viruses - 0; Other Eukaryotes - 1 (source: NCBI BLink). )

HSP 1 Score: 221.9 bits (564), Expect = 6.3e-58
Identity = 117/174 (67.24%), Postives = 141/174 (81.03%), Query Frame = 0

Query: 17  SHYHTHKIFLYCNYILLGAASSCIFLTLSLRLVPSLCGVSIVFLHILTIASAVSGCAMVA 76
           SHY+THK+FL  NY+LLGA+SSCIFLTLSLRL+PSLCG  ++ LH  TIA+AVSGCA  A
Sbjct: 14  SHYYTHKLFLTANYVLLGASSSCIFLTLSLRLIPSLCGFFLILLHATTIAAAVSGCA-AA 73

Query: 77  SAGATRWFGVHMVFTVLTAIFQGSVAVLIFTRTGEFLGQLKSYVREEDGAVIVKLAGGLS 136
           S G  RW+  HM+ TVLTAIFQGSV+VLIFT T  FL  L SYVRE++ ++I+KLAGGL 
Sbjct: 74  SYGKNRWYAAHMIATVLTAIFQGSVSVLIFTNTSNFLESLNSYVREKEASMILKLAGGLC 133

Query: 137 VVMFWLEWVVLTLAFFLRYYAYVEGEGVNNGVAM-RSAKVQQDEDLKDWPWPFQ 190
           VV+F LEW+VL LAFFL+YYAYV+G+  NNGVAM R+ KVQ +E LK+ PW FQ
Sbjct: 134 VVIFCLEWIVLVLAFFLKYYAYVDGD--NNGVAMKRTGKVQSEETLKNSPWAFQ 184

BLAST of Sgr022093 vs. TAIR 10
Match: AT5G36800.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: N-terminal protein myristoylation; LOCATED IN: endomembrane system; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G36710.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )

HSP 1 Score: 197.6 bits (501), Expect = 1.3e-50
Identity = 113/181 (62.43%), Postives = 143/181 (79.01%), Query Frame = 0

Query: 15  TRSHYHTHKIFLYCNYILLGAASSCIFLTLSLRLVPSLCGVSIVFLHILTIASAVSGCAM 74
           ++S  +TH IFL CNYILLG+ASSCIFLT+SLRL PSL G+S++FL+ LTIA+AVSGC++
Sbjct: 4   SKSKGNTHNIFLLCNYILLGSASSCIFLTISLRLFPSLSGLSLIFLYTLTIATAVSGCSI 63

Query: 75  VASA----GATRWFGVHMVFTVLTAIFQGSVAVLIFTRTGEFLGQLKSYVREEDGAVIVK 134
            AS+     + R +G HMV TVLTAIFQG+V+VLIFTRTG+FL  LKSYVREEDG VI+K
Sbjct: 64  FASSTSATASDRLYGSHMVATVLTAIFQGAVSVLIFTRTGDFLRFLKSYVREEDGEVILK 123

Query: 135 LAGGLSVVMFWLEWVVLTLAFFLRYYAYVEGEGVNNGVAMRSAKV-QQDEDLKDWP-WPF 190
           L+GGL V+MF LEW+VL LAF L+Y  Y++   V++       KV +Q+EDLKDWP +PF
Sbjct: 124 LSGGLCVLMFCLEWIVLVLAFLLKYSDYLDESVVDDD----DFKVRRQEEDLKDWPSYPF 180

BLAST of Sgr022093 vs. TAIR 10
Match: AT5G36710.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G36800.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 197.6 bits (501), Expect = 1.3e-50
Identity = 113/181 (62.43%), Postives = 143/181 (79.01%), Query Frame = 0

Query: 15  TRSHYHTHKIFLYCNYILLGAASSCIFLTLSLRLVPSLCGVSIVFLHILTIASAVSGCAM 74
           ++S  +TH IFL CNYILLG+ASSCIFLT+SLRL PSL G+S++FL+ LTIA+AVSGC++
Sbjct: 4   SKSKGNTHNIFLLCNYILLGSASSCIFLTISLRLFPSLSGLSLIFLYTLTIATAVSGCSI 63

Query: 75  VASA----GATRWFGVHMVFTVLTAIFQGSVAVLIFTRTGEFLGQLKSYVREEDGAVIVK 134
            AS+     + R +G HMV TVLTAIFQG+V+VLIFTRTG+FL  LKSYVREEDG VI+K
Sbjct: 64  FASSTSATASDRLYGSHMVATVLTAIFQGAVSVLIFTRTGDFLRFLKSYVREEDGEVILK 123

Query: 135 LAGGLSVVMFWLEWVVLTLAFFLRYYAYVEGEGVNNGVAMRSAKV-QQDEDLKDWP-WPF 190
           L+GGL V+MF LEW+VL LAF L+Y  Y++   V++       KV +Q+EDLKDWP +PF
Sbjct: 124 LSGGLCVLMFCLEWIVLVLAFLLKYSDYLDESVVDDD----DFKVRRQEEDLKDWPSYPF 180

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_008449000.15.5e-8084.21PREDICTED: uncharacterized protein LOC103491004 [Cucumis melo] >TYK19381.1 uncha... [more]
XP_038904159.17.1e-8083.77uncharacterized protein LOC120090516 [Benincasa hispida][more]
XP_004148268.11.2e-7983.60uncharacterized protein LOC101206234 [Cucumis sativus][more]
KAE8650098.11.2e-7983.60hypothetical protein Csa_010636 [Cucumis sativus][more]
XP_023540529.12.5e-7780.73uncharacterized protein LOC111800866 [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A5D3D7342.6e-8084.21Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A1S3BL212.6e-8084.21uncharacterized protein LOC103491004 OS=Cucumis melo OX=3656 GN=LOC103491004 PE=... [more]
A0A0A0L2535.9e-8083.60Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G045140 PE=4 SV=1[more]
A0A6J1L2J42.7e-7780.73uncharacterized protein LOC111498512 OS=Cucurbita maxima OX=3661 GN=LOC111498512... [more]
A0A6J1H1G83.6e-7780.21uncharacterized protein LOC111459467 OS=Cucurbita moschata OX=3662 GN=LOC1114594... [more]
Match NameE-valueIdentityDescription
AT5G16250.11.3e-6369.61unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT3G02640.16.3e-5867.24unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT5G36800.11.3e-5062.43unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: N-termin... [more]
AT5G36710.11.3e-5062.43unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR34124F16B3.27 PROTEIN-RELATEDcoord: 7..189
NoneNo IPR availablePANTHERPTHR34124:SF2F16B3.27 PROTEIN-RELATEDcoord: 7..189

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr022093.1Sgr022093.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane