Sgr027939 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr027939
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
Descriptionhydroxyproline-rich glycoprotein family protein
Locationtig00153056: 1685362 .. 1685883 (+)
RNA-Seq ExpressionSgr027939
SyntenySgr027939
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAGTACGATGATCGCGAAGCACAGGATCCAGCCTCCCTAAGCTTCCCCGTTGGGTTGGTTCTTCTTCTGACATTTTTATTCTGCATGTGCTGCTTCTTTTCTTGTTGCCTCCACTGGGAAAAACTCCGATCATTCCTCGGCTGTCCCGATCATCTTAATCACCGGCATCCTCCTATTCCGCCGGAAAGCTCCGAATTCTTGGCACCGGAGAAGATTGTGCCTTTGCATACGGTGCGTTTTGTTTCTTTTCTTCTTCGTCGTTTTTCTGTTAATTAATTTCATGACGGTTGCGCCTTTGTGCTGTTTGTTTTGATGATCTTCTTTATCTTGTTTCTGAACGAGAAGCTGTGGAAGCAGAATCAGTCGCAGAGCCTGCCGGTGTTGATGCCCGGCGACGAGGTTCCGAGATTCATAGCAATGGCTTGTCCTCCATGTGCGGCGGCGGCGGCTTCGTTTGTGGAAGTTTTAGTGCAGCAACCTCCACAGACTATTCCGGATGCTTCAGGAGCTCTCTCTTAA

mRNA sequence

ATGGAGTACGATGATCGCGAAGCACAGGATCCAGCCTCCCTAAGCTTCCCCGTTGGGTTGGTTCTTCTTCTGACATTTTTATTCTGCATGTGCTGCTTCTTTTCTTGTTGCCTCCACTGGGAAAAACTCCGATCATTCCTCGGCTGTCCCGATCATCTTAATCACCGGCATCCTCCTATTCCGCCGGAAAGCTCCGAATTCTTGGCACCGGAGAAGATTGTGCCTTTGCATACGAAGCTGTGGAAGCAGAATCAGTCGCAGAGCCTGCCGGTGTTGATGCCCGGCGACGAGGTTCCGAGATTCATAGCAATGGCTTGTCCTCCATGTGCGGCGGCGGCGGCTTCGTTTGTGGAAGTTTTAGTGCAGCAACCTCCACAGACTATTCCGGATGCTTCAGGAGCTCTCTCTTAA

Coding sequence (CDS)

ATGGAGTACGATGATCGCGAAGCACAGGATCCAGCCTCCCTAAGCTTCCCCGTTGGGTTGGTTCTTCTTCTGACATTTTTATTCTGCATGTGCTGCTTCTTTTCTTGTTGCCTCCACTGGGAAAAACTCCGATCATTCCTCGGCTGTCCCGATCATCTTAATCACCGGCATCCTCCTATTCCGCCGGAAAGCTCCGAATTCTTGGCACCGGAGAAGATTGTGCCTTTGCATACGAAGCTGTGGAAGCAGAATCAGTCGCAGAGCCTGCCGGTGTTGATGCCCGGCGACGAGGTTCCGAGATTCATAGCAATGGCTTGTCCTCCATGTGCGGCGGCGGCGGCTTCGTTTGTGGAAGTTTTAGTGCAGCAACCTCCACAGACTATTCCGGATGCTTCAGGAGCTCTCTCTTAA

Protein sequence

MEYDDREAQDPASLSFPVGLVLLLTFLFCMCCFFSCCLHWEKLRSFLGCPDHLNHRHPPIPPESSEFLAPEKIVPLHTKLWKQNQSQSLPVLMPGDEVPRFIAMACPPCAAAAASFVEVLVQQPPQTIPDASGALS
Homology
BLAST of Sgr027939 vs. NCBI nr
Match: XP_038899315.1 (uncharacterized protein At5g65660-like [Benincasa hispida])

HSP 1 Score: 228.8 bits (582), Expect = 2.8e-56
Identity = 104/132 (78.79%), Postives = 120/132 (90.91%), Query Frame = 0

Query: 1   MEYDDREAQDPASLSFPVGLVLLLTFLFCMCCFFSCCLHWEKLRSFLGCPDHLNHRHPPI 60
           MEYD+REAQDPASLSFPVGLVLLLTFLFCMCCFFSCCLHWEKLRSFLGCPDHL+H HPPI
Sbjct: 1   MEYDNREAQDPASLSFPVGLVLLLTFLFCMCCFFSCCLHWEKLRSFLGCPDHLHHHHPPI 60

Query: 61  PPESSEFLAPEKIVPLHTKLWKQNQSQSLPVLMPGDEVPRFIAMACPPCAAAAASFVEVL 120
           PP+++ F  P+KI P+HTK+W++N+ QSL VLMPGDEVPRFIAMACPPCAAAA   VE++
Sbjct: 61  PPQTAAFSPPDKISPIHTKIWRENRPQSLSVLMPGDEVPRFIAMACPPCAAAA--LVEIV 120

Query: 121 VQQPPQTIPDAS 133
           VQ+P Q+ PD+S
Sbjct: 121 VQKPSQSFPDSS 130

BLAST of Sgr027939 vs. NCBI nr
Match: XP_022140744.1 (uncharacterized protein At5g65660-like [Momordica charantia])

HSP 1 Score: 216.1 bits (549), Expect = 1.9e-52
Identity = 109/139 (78.42%), Postives = 120/139 (86.33%), Query Frame = 0

Query: 1   MEYDDREAQDPASLSFPVGLVLLLTFLFCMCCFFSCCLHWEKLRSFLGCPDHLNHRH-PP 60
           M+YDDREAQDPASLSFPVGLVLLLTF FCMCCFFSCCLHW+KLRSFLGCPD   HRH PP
Sbjct: 1   MDYDDREAQDPASLSFPVGLVLLLTFFFCMCCFFSCCLHWDKLRSFLGCPD---HRHPPP 60

Query: 61  IPPESSEFLAPEKIVPLHTKLWKQNQSQSLPVLMPGDEVPRFIAMACPPCAA--AAASFV 120
           IPPE++    P+KI+PLHT     NQSQSLPV+MPGDE PRFIAMACPPCAA  AAA FV
Sbjct: 61  IPPENAASSPPDKILPLHT-----NQSQSLPVVMPGDEFPRFIAMACPPCAAAVAAAPFV 120

Query: 121 EVLVQQPPQTIPDASGALS 137
           +VLVQ+PPQ+ PD+SGALS
Sbjct: 121 DVLVQKPPQSTPDSSGALS 131

BLAST of Sgr027939 vs. NCBI nr
Match: XP_008466113.1 (PREDICTED: uncharacterized protein At5g65660-like [Cucumis melo] >KAA0038642.1 uncharacterized protein E6C27_scaffold92G001760 [Cucumis melo var. makuwa] >TYK31243.1 uncharacterized protein E5676_scaffold455G005320 [Cucumis melo var. makuwa])

HSP 1 Score: 213.4 bits (542), Expect = 1.2e-51
Identity = 100/134 (74.63%), Postives = 115/134 (85.82%), Query Frame = 0

Query: 1   MEYDDREAQDPASLSFPVGLVLLLTFLFCMCCFFSCCLHWEKLRSFLGCPDHLNHRHPPI 60
           MEYD+REAQDPASLSFPVGLVLLLTFLFCMCCFF CCLHWEKLRSFLGCPDHL+H HPPI
Sbjct: 1   MEYDNREAQDPASLSFPVGLVLLLTFLFCMCCFFCCCLHWEKLRSFLGCPDHLHHHHPPI 60

Query: 61  PPESS--EFLAPEKIVPLHTKLWKQNQSQSLPVLMPGDEVPRFIAMACPPCAAAAASFVE 120
           PP  S      P+K  P+HT +WK+N+ QS+ VLMPGDEVPRFIA+ACPPCA AAA+ VE
Sbjct: 61  PPPHSPAALSPPDKFSPIHT-IWKENRPQSVSVLMPGDEVPRFIALACPPCATAAAALVE 120

Query: 121 VLVQQPPQTIPDAS 133
           ++VQ+P Q+I D+S
Sbjct: 121 IVVQKPSQSISDSS 133

BLAST of Sgr027939 vs. NCBI nr
Match: KAG6591711.1 (hypothetical protein SDJN03_14057, partial [Cucurbita argyrosperma subsp. sororia] >KAG7024594.1 hypothetical protein SDJN02_13412 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 207.6 bits (527), Expect = 6.7e-50
Identity = 97/131 (74.05%), Postives = 111/131 (84.73%), Query Frame = 0

Query: 1   MEYDDREAQDPASLSFPVGLVLLLTFLFCMCCFFSCCLHWEKLRSFLGCPDHLNHRHPPI 60
           MEYDDREAQD ASLSFPVGLVLLLTFLFCMCCFFSCCLHWEKLRS LGCPDH +HRHP I
Sbjct: 1   MEYDDREAQDSASLSFPVGLVLLLTFLFCMCCFFSCCLHWEKLRSILGCPDHADHRHPSI 60

Query: 61  PPESSEFLAPEKIVPLHTKLWKQNQSQSLPVLMPGDEVPRFIAMACPPCAAAAASFVEVL 120
           PPE+  F  P+K  P+HT +WK+N+ +SL VLMPGDEVPRFIAMACPPC   AA  VE++
Sbjct: 61  PPETPAFSPPDKFSPIHT-IWKENRPESLTVLMPGDEVPRFIAMACPPC--GAAPLVEIV 120

Query: 121 VQQPPQTIPDA 132
           +Q+P Q+I +A
Sbjct: 121 IQKPSQSISEA 128

BLAST of Sgr027939 vs. NCBI nr
Match: XP_022936981.1 (uncharacterized protein At5g65660-like [Cucurbita moschata])

HSP 1 Score: 207.6 bits (527), Expect = 6.7e-50
Identity = 100/135 (74.07%), Postives = 113/135 (83.70%), Query Frame = 0

Query: 1   MEYDDREAQDPASLSFPVGLVLLLTFLFCMCCFFSCCLHWEKLRSFLGCPDHLNHRHPPI 60
           MEYDDREAQD ASLSFPVGLVLLLTFLFCMCCFFSCCLHWEKLRS LGCPDH +HRHP I
Sbjct: 1   MEYDDREAQDSASLSFPVGLVLLLTFLFCMCCFFSCCLHWEKLRSILGCPDHADHRHPSI 60

Query: 61  PPESSEFLAPEKIVPLHTKLWKQNQSQSLPVLMPGDEVPRFIAMACPPCAAAAASFVEVL 120
           PPE+  F  P+K  P+HT +WK+N+ +SL VLMPGDEVPRFIAMACPPC   AA  VE++
Sbjct: 61  PPETPAFSPPDKFSPIHT-IWKENRPESLTVLMPGDEVPRFIAMACPPC--GAAPLVEIV 120

Query: 121 VQQPPQTIPDASGAL 136
           +Q+P Q+I   SGAL
Sbjct: 121 IQKPSQSI---SGAL 129

BLAST of Sgr027939 vs. ExPASy Swiss-Prot
Match: Q9LSK9 (Uncharacterized protein At5g65660 OS=Arabidopsis thaliana OX=3702 GN=At5g65660 PE=2 SV=1)

HSP 1 Score: 73.6 bits (179), Expect = 2.0e-12
Identity = 50/117 (42.74%), Postives = 65/117 (55.56%), Query Frame = 0

Query: 13  SLSFPVGLVLLLTFLFCMCCFFSCCLHWEKLRSFLGCPDHLNHRHPPIPPESSEFLAPEK 72
           SL FP+G  LLL  +F +   FSCC HW+K RS       L +  P    ES+    P K
Sbjct: 18  SLGFPLGTALLLIIIFSLSGIFSCCYHWDKHRSL---RRSLANGRPSADIESN----PYK 77

Query: 73  IVPLHTKLWKQNQSQSLPVLMPGDEVPRFIAMACPPCAAAAASFVEVLVQQPPQTIP 130
             P   ++ K+ Q+ S+PVLMPGD  P+FIA+ C PCA      + V VQ PPQ+ P
Sbjct: 78  PKPPFPEM-KKPQNLSVPVLMPGDNTPKFIALPC-PCAPPRPEKLTVDVQTPPQSPP 125

BLAST of Sgr027939 vs. ExPASy TrEMBL
Match: A0A6J1CGJ4 (uncharacterized protein At5g65660-like OS=Momordica charantia OX=3673 GN=LOC111011330 PE=4 SV=1)

HSP 1 Score: 216.1 bits (549), Expect = 9.2e-53
Identity = 109/139 (78.42%), Postives = 120/139 (86.33%), Query Frame = 0

Query: 1   MEYDDREAQDPASLSFPVGLVLLLTFLFCMCCFFSCCLHWEKLRSFLGCPDHLNHRH-PP 60
           M+YDDREAQDPASLSFPVGLVLLLTF FCMCCFFSCCLHW+KLRSFLGCPD   HRH PP
Sbjct: 1   MDYDDREAQDPASLSFPVGLVLLLTFFFCMCCFFSCCLHWDKLRSFLGCPD---HRHPPP 60

Query: 61  IPPESSEFLAPEKIVPLHTKLWKQNQSQSLPVLMPGDEVPRFIAMACPPCAA--AAASFV 120
           IPPE++    P+KI+PLHT     NQSQSLPV+MPGDE PRFIAMACPPCAA  AAA FV
Sbjct: 61  IPPENAASSPPDKILPLHT-----NQSQSLPVVMPGDEFPRFIAMACPPCAAAVAAAPFV 120

Query: 121 EVLVQQPPQTIPDASGALS 137
           +VLVQ+PPQ+ PD+SGALS
Sbjct: 121 DVLVQKPPQSTPDSSGALS 131

BLAST of Sgr027939 vs. ExPASy TrEMBL
Match: A0A5D3E545 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold455G005320 PE=4 SV=1)

HSP 1 Score: 213.4 bits (542), Expect = 5.9e-52
Identity = 100/134 (74.63%), Postives = 115/134 (85.82%), Query Frame = 0

Query: 1   MEYDDREAQDPASLSFPVGLVLLLTFLFCMCCFFSCCLHWEKLRSFLGCPDHLNHRHPPI 60
           MEYD+REAQDPASLSFPVGLVLLLTFLFCMCCFF CCLHWEKLRSFLGCPDHL+H HPPI
Sbjct: 1   MEYDNREAQDPASLSFPVGLVLLLTFLFCMCCFFCCCLHWEKLRSFLGCPDHLHHHHPPI 60

Query: 61  PPESS--EFLAPEKIVPLHTKLWKQNQSQSLPVLMPGDEVPRFIAMACPPCAAAAASFVE 120
           PP  S      P+K  P+HT +WK+N+ QS+ VLMPGDEVPRFIA+ACPPCA AAA+ VE
Sbjct: 61  PPPHSPAALSPPDKFSPIHT-IWKENRPQSVSVLMPGDEVPRFIALACPPCATAAAALVE 120

Query: 121 VLVQQPPQTIPDAS 133
           ++VQ+P Q+I D+S
Sbjct: 121 IVVQKPSQSISDSS 133

BLAST of Sgr027939 vs. ExPASy TrEMBL
Match: A0A1S3CQS4 (uncharacterized protein At5g65660-like OS=Cucumis melo OX=3656 GN=LOC103503633 PE=4 SV=1)

HSP 1 Score: 213.4 bits (542), Expect = 5.9e-52
Identity = 100/134 (74.63%), Postives = 115/134 (85.82%), Query Frame = 0

Query: 1   MEYDDREAQDPASLSFPVGLVLLLTFLFCMCCFFSCCLHWEKLRSFLGCPDHLNHRHPPI 60
           MEYD+REAQDPASLSFPVGLVLLLTFLFCMCCFF CCLHWEKLRSFLGCPDHL+H HPPI
Sbjct: 1   MEYDNREAQDPASLSFPVGLVLLLTFLFCMCCFFCCCLHWEKLRSFLGCPDHLHHHHPPI 60

Query: 61  PPESS--EFLAPEKIVPLHTKLWKQNQSQSLPVLMPGDEVPRFIAMACPPCAAAAASFVE 120
           PP  S      P+K  P+HT +WK+N+ QS+ VLMPGDEVPRFIA+ACPPCA AAA+ VE
Sbjct: 61  PPPHSPAALSPPDKFSPIHT-IWKENRPQSVSVLMPGDEVPRFIALACPPCATAAAALVE 120

Query: 121 VLVQQPPQTIPDAS 133
           ++VQ+P Q+I D+S
Sbjct: 121 IVVQKPSQSISDSS 133

BLAST of Sgr027939 vs. ExPASy TrEMBL
Match: A0A6J1F906 (uncharacterized protein At5g65660-like OS=Cucurbita moschata OX=3662 GN=LOC111443410 PE=4 SV=1)

HSP 1 Score: 207.6 bits (527), Expect = 3.3e-50
Identity = 100/135 (74.07%), Postives = 113/135 (83.70%), Query Frame = 0

Query: 1   MEYDDREAQDPASLSFPVGLVLLLTFLFCMCCFFSCCLHWEKLRSFLGCPDHLNHRHPPI 60
           MEYDDREAQD ASLSFPVGLVLLLTFLFCMCCFFSCCLHWEKLRS LGCPDH +HRHP I
Sbjct: 1   MEYDDREAQDSASLSFPVGLVLLLTFLFCMCCFFSCCLHWEKLRSILGCPDHADHRHPSI 60

Query: 61  PPESSEFLAPEKIVPLHTKLWKQNQSQSLPVLMPGDEVPRFIAMACPPCAAAAASFVEVL 120
           PPE+  F  P+K  P+HT +WK+N+ +SL VLMPGDEVPRFIAMACPPC   AA  VE++
Sbjct: 61  PPETPAFSPPDKFSPIHT-IWKENRPESLTVLMPGDEVPRFIAMACPPC--GAAPLVEIV 120

Query: 121 VQQPPQTIPDASGAL 136
           +Q+P Q+I   SGAL
Sbjct: 121 IQKPSQSI---SGAL 129

BLAST of Sgr027939 vs. ExPASy TrEMBL
Match: A0A6J1IEI5 (uncharacterized protein At5g65660-like OS=Cucurbita maxima OX=3661 GN=LOC111476516 PE=4 SV=1)

HSP 1 Score: 204.5 bits (519), Expect = 2.8e-49
Identity = 99/135 (73.33%), Postives = 112/135 (82.96%), Query Frame = 0

Query: 1   MEYDDREAQDPASLSFPVGLVLLLTFLFCMCCFFSCCLHWEKLRSFLGCPDHLNHRHPPI 60
           MEYDDREAQD ASLSFPVGLVLLLTFLFCMCCFFSCCLHWEKLRS LGCPDH +H HP I
Sbjct: 1   MEYDDREAQDSASLSFPVGLVLLLTFLFCMCCFFSCCLHWEKLRSILGCPDHSDHLHPSI 60

Query: 61  PPESSEFLAPEKIVPLHTKLWKQNQSQSLPVLMPGDEVPRFIAMACPPCAAAAASFVEVL 120
           PPE+  F  P+K  P+HT +WK+N+ +SL VLMPGDEVPRFIAMACPPC   AA  VE++
Sbjct: 61  PPETPAFSPPDKFSPIHT-IWKENRPESLTVLMPGDEVPRFIAMACPPC--GAAPLVEIV 120

Query: 121 VQQPPQTIPDASGAL 136
           +Q+P Q+I   SGAL
Sbjct: 121 IQKPSQSI---SGAL 129

BLAST of Sgr027939 vs. TAIR 10
Match: AT5G65660.1 (hydroxyproline-rich glycoprotein family protein )

HSP 1 Score: 73.6 bits (179), Expect = 1.4e-13
Identity = 50/117 (42.74%), Postives = 65/117 (55.56%), Query Frame = 0

Query: 13  SLSFPVGLVLLLTFLFCMCCFFSCCLHWEKLRSFLGCPDHLNHRHPPIPPESSEFLAPEK 72
           SL FP+G  LLL  +F +   FSCC HW+K RS       L +  P    ES+    P K
Sbjct: 18  SLGFPLGTALLLIIIFSLSGIFSCCYHWDKHRSL---RRSLANGRPSADIESN----PYK 77

Query: 73  IVPLHTKLWKQNQSQSLPVLMPGDEVPRFIAMACPPCAAAAASFVEVLVQQPPQTIP 130
             P   ++ K+ Q+ S+PVLMPGD  P+FIA+ C PCA      + V VQ PPQ+ P
Sbjct: 78  PKPPFPEM-KKPQNLSVPVLMPGDNTPKFIALPC-PCAPPRPEKLTVDVQTPPQSPP 125

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038899315.12.8e-5678.79uncharacterized protein At5g65660-like [Benincasa hispida][more]
XP_022140744.11.9e-5278.42uncharacterized protein At5g65660-like [Momordica charantia][more]
XP_008466113.11.2e-5174.63PREDICTED: uncharacterized protein At5g65660-like [Cucumis melo] >KAA0038642.1 u... [more]
KAG6591711.16.7e-5074.05hypothetical protein SDJN03_14057, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_022936981.16.7e-5074.07uncharacterized protein At5g65660-like [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
Q9LSK92.0e-1242.74Uncharacterized protein At5g65660 OS=Arabidopsis thaliana OX=3702 GN=At5g65660 P... [more]
Match NameE-valueIdentityDescription
A0A6J1CGJ49.2e-5378.42uncharacterized protein At5g65660-like OS=Momordica charantia OX=3673 GN=LOC1110... [more]
A0A5D3E5455.9e-5274.63Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A1S3CQS45.9e-5274.63uncharacterized protein At5g65660-like OS=Cucumis melo OX=3656 GN=LOC103503633 P... [more]
A0A6J1F9063.3e-5074.07uncharacterized protein At5g65660-like OS=Cucurbita moschata OX=3662 GN=LOC11144... [more]
A0A6J1IEI52.8e-4973.33uncharacterized protein At5g65660-like OS=Cucurbita maxima OX=3661 GN=LOC1114765... [more]
Match NameE-valueIdentityDescription
AT5G65660.11.4e-1342.74hydroxyproline-rich glycoprotein family protein [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR037699Uncharacterized protein At5g65660-likePANTHERPTHR34291HYDROXYPROLINE-RICH GLYCOPROTEIN FAMILY PROTEINcoord: 1..128
NoneNo IPR availablePANTHERPTHR34291:SF7PROTEIN, PUTATIVE-RELATEDcoord: 1..128

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr027939.1Sgr027939.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane