Sgr020898 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr020898
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
DescriptionLate embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family
Locationtig00153577: 128520 .. 129113 (-)
RNA-Seq ExpressionSgr020898
SyntenySgr020898
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCAGGTCCTCTGCAGCCACCCCCGCCGCAGCCAGGCCGCCCAAGGATACTGCGATACGTCGCCTTGGTCCTGCTGGCACTGATTGTACTCGTTGGCCTCACCGTGCTCATCATCTGGCTGACCGTCAGGCCGAAACGGCTAAGCTACACGGTCGAAAGCGCCTCGGTCCAAAACTTCGACCTGAGCAACACCCAACTCAATGCCTCCTTTAGCTTTGGGGTAAGAGCATATAATCCCAACACCAAAGTCTCGGTGTACTACGATTCCATCTCCGTCACGGTCGGCTTCGGCGATCAAGACTTGGCGTTCGGAGTGATCAATCCCTTCTACCAACGTCACGAAGAAGTGAAATGGTTGGACGTGAACCTCGCCACTGAAAACATTCCTCTTCATGACTCCGTATCCAAGGATCTAAGGCTCGAAAAGGCGGCGGGAGAGATTGATTTAGACCTTTGGATCAAGGCGAGAATAAGGTTTAAGGTTGGGATATGGAAGTCGCACCGGACGCTCCGAATTCGGTGTTCGCCGGTGATTGTCTACTTCTCTAATGGCAAGAGTTTCAAGAAGACTACTTGCTTTGCAGAAGTCTAA

mRNA sequence

ATGGCAGGTCCTCTGCAGCCACCCCCGCCGCAGCCAGGCCGCCCAAGGATACTGCGATACGTCGCCTTGGTCCTGCTGGCACTGATTGTACTCGTTGGCCTCACCGTGCTCATCATCTGGCTGACCGTCAGGCCGAAACGGCTAAGCTACACGGTCGAAAGCGCCTCGGTCCAAAACTTCGACCTGAGCAACACCCAACTCAATGCCTCCTTTAGCTTTGGGGTAAGAGCATATAATCCCAACACCAAAGTCTCGGTGTACTACGATTCCATCTCCGTCACGGTCGGCTTCGGCGATCAAGACTTGGCGTTCGGAGTGATCAATCCCTTCTACCAACGTCACGAAGAAGTGAAATGGTTGGACGTGAACCTCGCCACTGAAAACATTCCTCTTCATGACTCCGTATCCAAGGATCTAAGGCTCGAAAAGGCGGCGGGAGAGATTGATTTAGACCTTTGGATCAAGGCGAGAATAAGGTTTAAGGTTGGGATATGGAAGTCGCACCGGACGCTCCGAATTCGGTGTTCGCCGGTGATTGTCTACTTCTCTAATGGCAAGAGTTTCAAGAAGACTACTTGCTTTGCAGAAGTCTAA

Coding sequence (CDS)

ATGGCAGGTCCTCTGCAGCCACCCCCGCCGCAGCCAGGCCGCCCAAGGATACTGCGATACGTCGCCTTGGTCCTGCTGGCACTGATTGTACTCGTTGGCCTCACCGTGCTCATCATCTGGCTGACCGTCAGGCCGAAACGGCTAAGCTACACGGTCGAAAGCGCCTCGGTCCAAAACTTCGACCTGAGCAACACCCAACTCAATGCCTCCTTTAGCTTTGGGGTAAGAGCATATAATCCCAACACCAAAGTCTCGGTGTACTACGATTCCATCTCCGTCACGGTCGGCTTCGGCGATCAAGACTTGGCGTTCGGAGTGATCAATCCCTTCTACCAACGTCACGAAGAAGTGAAATGGTTGGACGTGAACCTCGCCACTGAAAACATTCCTCTTCATGACTCCGTATCCAAGGATCTAAGGCTCGAAAAGGCGGCGGGAGAGATTGATTTAGACCTTTGGATCAAGGCGAGAATAAGGTTTAAGGTTGGGATATGGAAGTCGCACCGGACGCTCCGAATTCGGTGTTCGCCGGTGATTGTCTACTTCTCTAATGGCAAGAGTTTCAAGAAGACTACTTGCTTTGCAGAAGTCTAA

Protein sequence

MAGPLQPPPPQPGRPRILRYVALVLLALIVLVGLTVLIIWLTVRPKRLSYTVESASVQNFDLSNTQLNASFSFGVRAYNPNTKVSVYYDSISVTVGFGDQDLAFGVINPFYQRHEEVKWLDVNLATENIPLHDSVSKDLRLEKAAGEIDLDLWIKARIRFKVGIWKSHRTLRIRCSPVIVYFSNGKSFKKTTCFAEV
Homology
BLAST of Sgr020898 vs. NCBI nr
Match: XP_022152939.1 (uncharacterized protein At1g08160 [Momordica charantia])

HSP 1 Score: 301.6 bits (771), Expect = 5.0e-78
Identity = 157/198 (79.29%), Postives = 173/198 (87.37%), Query Frame = 0

Query: 1   MAGPLQPPPPQPGRPRILRYVALVLLALIVLVGLTVLIIWLTVRPKRLSYTVESASVQNF 60
           MAGP QPPP   GR R+LR VALVLLALIVLVGL VLIIWLTVRPKRLSYTVESASVQNF
Sbjct: 1   MAGPPQPPP--SGRSRVLRCVALVLLALIVLVGLAVLIIWLTVRPKRLSYTVESASVQNF 60

Query: 61  DLSNTQLNASFSFGVRAYNPNTKVSVYYDSISVTVGFGDQDLAFGVINPFYQRHEEVKWL 120
           DLSNTQLNASF+F VRAYNPN++VSVYYD I VTVGFGDQDLA+G INPFYQ H+ V  L
Sbjct: 61  DLSNTQLNASFNFRVRAYNPNSRVSVYYDKILVTVGFGDQDLAYGTINPFYQPHKGVTRL 120

Query: 121 DVNLATENIPLHDSVSKDLRLEKAAGEIDLDLWIKARIRFKVGIWKS-HRTLRIRCSPVI 180
           D+N A +N+PL++SVSKDL LEKAAGE+DLDLWIKA+IRFKVGIWKS H+TLRI CSPVI
Sbjct: 121 DINPAAQNVPLYNSVSKDLGLEKAAGEMDLDLWIKAKIRFKVGIWKSGHQTLRIHCSPVI 180

Query: 181 VYFSNGKSFKKTTCFAEV 198
           +Y S  K F +TTCFAEV
Sbjct: 181 IYLSKSKPFNETTCFAEV 196

BLAST of Sgr020898 vs. NCBI nr
Match: XP_038894942.1 (uncharacterized protein At1g08160-like [Benincasa hispida])

HSP 1 Score: 298.9 bits (764), Expect = 3.2e-77
Identity = 147/198 (74.24%), Postives = 175/198 (88.38%), Query Frame = 0

Query: 1   MAGPLQPPPPQPGRPRILRYVALVLLALIVLVGLTVLIIWLTVRPKRLSYTVESASVQNF 60
           MAGP Q PPP+ G  RILR+V + ++ALI+LVGL VLIIWLT+RPKRLSYTVESASV NF
Sbjct: 1   MAGPPQ-PPPRAGPSRILRFVLMFIVALIILVGLAVLIIWLTIRPKRLSYTVESASVHNF 60

Query: 61  DLSNTQLNASFSFGVRAYNPNTKVSVYYDSISVTVGFGDQDLAFGVINPFYQRHEEVKWL 120
           D+++TQLNASFSFGVRAYNPN +V++YYDSI+ TVGFGDQDLAFGV++PFYQ H++ +WL
Sbjct: 61  DMTSTQLNASFSFGVRAYNPNKRVAIYYDSITATVGFGDQDLAFGVLSPFYQPHKDERWL 120

Query: 121 DVNLATENIPLHDSVSKDLRLEKAAGEIDLDLWIKARIRFKVGIWKS-HRTLRIRCSPVI 180
           +++L  +N  LHDSVSKDL LE+AAGE+DLDLWIKARIRFKVG+WKS HRTLRIRCSPVI
Sbjct: 121 NIHLDAQNFLLHDSVSKDLALERAAGEMDLDLWIKARIRFKVGVWKSAHRTLRIRCSPVI 180

Query: 181 VYFSNGKSFKKTTCFAEV 198
           VY S  ++FKKTTCF EV
Sbjct: 181 VYLSKSQTFKKTTCFTEV 197

BLAST of Sgr020898 vs. NCBI nr
Match: XP_008464346.1 (PREDICTED: uncharacterized protein At1g08160 [Cucumis melo] >KAA0032608.1 uncharacterized protein E6C27_scaffold43053G00810 [Cucumis melo var. makuwa] >TYK20878.1 uncharacterized protein E5676_scaffold284G00070 [Cucumis melo var. makuwa])

HSP 1 Score: 298.5 bits (763), Expect = 4.2e-77
Identity = 149/198 (75.25%), Postives = 173/198 (87.37%), Query Frame = 0

Query: 1   MAGPLQPPPPQPGRPRILRYVALVLLALIVLVGLTVLIIWLTVRPKRLSYTVESASVQNF 60
           MAGP Q PP + G  RILR+V + L+ALI+LVGL VLIIWLT+RPKRLSYTVESA V NF
Sbjct: 1   MAGPPQ-PPSRAGPSRILRFVIIFLVALIILVGLAVLIIWLTIRPKRLSYTVESAEVHNF 60

Query: 61  DLSNTQLNASFSFGVRAYNPNTKVSVYYDSISVTVGFGDQDLAFGVINPFYQRHEEVKWL 120
           D++NTQLNASFSFGVRAYNPN +VSVYYDSI+ TVGFGDQDLAFGV++PFYQ H++ +WL
Sbjct: 61  DMTNTQLNASFSFGVRAYNPNKRVSVYYDSITATVGFGDQDLAFGVLSPFYQPHKDEQWL 120

Query: 121 DVNLATENIPLHDSVSKDLRLEKAAGEIDLDLWIKARIRFKVGIWKS-HRTLRIRCSPVI 180
           +++L  +N  LHDSVSKDL LE++AGE+DLDLWIKARIRFKVG+WKS HRTLRIRCSPVI
Sbjct: 121 NIHLNAQNFLLHDSVSKDLALERSAGEMDLDLWIKARIRFKVGVWKSAHRTLRIRCSPVI 180

Query: 181 VYFSNGKSFKKTTCFAEV 198
           VY S  K+FKKTTCF EV
Sbjct: 181 VYLSKSKTFKKTTCFTEV 197

BLAST of Sgr020898 vs. NCBI nr
Match: XP_022937555.1 (uncharacterized protein At1g08160 [Cucurbita moschata] >XP_023537506.1 uncharacterized protein At1g08160 [Cucurbita pepo subsp. pepo] >KAG6586016.1 hypothetical protein SDJN03_18749, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 295.4 bits (755), Expect = 3.6e-76
Identity = 152/198 (76.77%), Postives = 169/198 (85.35%), Query Frame = 0

Query: 1   MAGPLQPPPPQPGRPRILRYVALVLLALIVLVGLTVLIIWLTVRPKRLSYTVESASVQNF 60
           MAGP Q    +P RP ILRYV LVL+ALIVLVGL VLIIWLTVRPKRLSYTVESA+V NF
Sbjct: 1   MAGPPQ-LSSRPARPNILRYVILVLVALIVLVGLVVLIIWLTVRPKRLSYTVESAAVHNF 60

Query: 61  DLSNTQLNASFSFGVRAYNPNTKVSVYYDSISVTVGFGDQDLAFGVINPFYQRHEEVKWL 120
           D+S TQLNASF+FGV+AYNPN  VSVYYD ++VTVGFGDQDLAFGVI PFYQ H++V WL
Sbjct: 61  DMSTTQLNASFNFGVKAYNPNRHVSVYYDHVTVTVGFGDQDLAFGVIKPFYQPHKDVTWL 120

Query: 121 DVNLATENIPLHDSVSKDLRLEKAAGEIDLDLWIKARIRFKVGIWKS-HRTLRIRCSPVI 180
           +++L  +N  LHDSVSKDL LEKAAGE+DLDLWIKARIRFKVG+WKS HRTLRIRCSPVI
Sbjct: 121 NMDLNAKNFLLHDSVSKDLALEKAAGEMDLDLWIKARIRFKVGVWKSGHRTLRIRCSPVI 180

Query: 181 VYFSNGKSFKKTTCFAEV 198
           VY S  K FK+T CF EV
Sbjct: 181 VYLSKDKEFKRTACFTEV 197

BLAST of Sgr020898 vs. NCBI nr
Match: XP_022969533.1 (uncharacterized protein At1g08160 [Cucurbita maxima])

HSP 1 Score: 293.9 bits (751), Expect = 1.0e-75
Identity = 150/198 (75.76%), Postives = 168/198 (84.85%), Query Frame = 0

Query: 1   MAGPLQPPPPQPGRPRILRYVALVLLALIVLVGLTVLIIWLTVRPKRLSYTVESASVQNF 60
           MAGP Q   P+P RP ILRYV LVL+ALIVLVGL VLIIWLTVRPKRL YTVESA+V NF
Sbjct: 1   MAGPPQ-LSPRPARPNILRYVILVLVALIVLVGLVVLIIWLTVRPKRLRYTVESAAVHNF 60

Query: 61  DLSNTQLNASFSFGVRAYNPNTKVSVYYDSISVTVGFGDQDLAFGVINPFYQRHEEVKWL 120
           D+S TQLNASF+FGV+AYNPN  VSVYYD ++VTVGFGDQDLAFGVI PFYQ H++V WL
Sbjct: 61  DMSTTQLNASFNFGVKAYNPNRHVSVYYDHVTVTVGFGDQDLAFGVIKPFYQPHKDVTWL 120

Query: 121 DVNLATENIPLHDSVSKDLRLEKAAGEIDLDLWIKARIRFKVGIWK-SHRTLRIRCSPVI 180
           +++L  +N  LHDSVSKDL LEKAAGE+DLDLWIKARIR+KVG+WK  HRTLRIRCSPVI
Sbjct: 121 NMDLNAKNFLLHDSVSKDLALEKAAGEMDLDLWIKARIRYKVGVWKLGHRTLRIRCSPVI 180

Query: 181 VYFSNGKSFKKTTCFAEV 198
           VY S  K FK+T CF EV
Sbjct: 181 VYLSKDKEFKRTACFTEV 197

BLAST of Sgr020898 vs. ExPASy Swiss-Prot
Match: Q8VZ13 (Uncharacterized protein At1g08160 OS=Arabidopsis thaliana OX=3702 GN=At1g08160 PE=2 SV=1)

HSP 1 Score: 128.6 bits (322), Expect = 7.6e-29
Identity = 77/193 (39.90%), Postives = 120/193 (62.18%), Query Frame = 0

Query: 6   QPPPPQPGRPRILRYVALVLLALIVLVGLTVLIIWLTVRPKRLSYTVESASVQNFDLSNT 65
           QP P +   P +   VALVLL L  LVGL +LI +LT+RPKRL YTVE+ASVQ F + N 
Sbjct: 27  QPLPGRRMNPVLCIIVALVLLGL--LVGLAILITYLTLRPKRLIYTVEAASVQEFAIGNN 86

Query: 66  --QLNASFSFGVRAYNPNTKVSVYYDSISVTVGFGDQDLAFGVINPFYQRHEEVKWLDVN 125
              +NA FS+ +++YNP   VSV Y S+ ++    +Q +A   I+PF QR +    ++  
Sbjct: 87  DDHINAKFSYVIKSYNPEKHVSVRYHSMRISTAHHNQSVAHKNISPFKQRPKNETRIETQ 146

Query: 126 LATENIPLHDSVSKDLRLEKAAGEIDLDLWIKARIRFKVGIWKS-HRTLRIRCSPVIVYF 185
           L + N+ L    ++DLR EK+ G I+++++I AR+ +K  I++S  RTL+  C+PV++  
Sbjct: 147 LVSHNVALSKFNARDLRAEKSKGTIEMEVYITARVSYKTWIFRSRRRTLKAVCTPVMINV 206

Query: 186 SNGK--SFKKTTC 194
           ++     F++  C
Sbjct: 207 TSSSLDGFQRVLC 217

BLAST of Sgr020898 vs. ExPASy Swiss-Prot
Match: Q9FNH6 (NDR1/HIN1-like protein 3 OS=Arabidopsis thaliana OX=3702 GN=NHL3 PE=1 SV=1)

HSP 1 Score: 83.6 bits (205), Expect = 2.8e-15
Identity = 63/188 (33.51%), Postives = 96/188 (51.06%), Query Frame = 0

Query: 17  ILRYVALVLLALIVLVGLTVLIIWLTVRPKRLSYTVESASVQNFDL---SNTQLNASFSF 76
           IL  +  +L+ + VL+G+  LIIWL  RP  + + V  A +  F L   +N + N   +F
Sbjct: 44  ILSVIFNILITIAVLLGIAALIIWLIFRPNAIKFHVTDAKLTEFTLDPTNNLRYNLDLNF 103

Query: 77  GVRAYNPNTKVSVYYDSISVTVGFGDQDLAFGV---INPFYQRHEEVKWLDVNLATENIP 136
            +R  NPN ++ VYYD I V   +GDQ   FG+   I+ FYQ H+    +   L  + + 
Sbjct: 104 TIR--NPNRRIGVYYDEIEVRGYYGDQ--RFGMSNNISKFYQGHKNTTVVGTKLVGQQLV 163

Query: 137 LHD-SVSKDLRLEKAAGEIDLDLWIKARIRFKVGIWKSHR-TLRIRCSPVIVYFSNGKS- 194
           L D    KDL  +  +    +D  ++ +IRFK G+ KS R   +I+C   +   SN  S 
Sbjct: 164 LLDGGERKDLNEDVNSQIYRIDAKLRLKIRFKFGLIKSWRFKPKIKCDLKVPLTSNSTSG 223

BLAST of Sgr020898 vs. ExPASy Swiss-Prot
Match: Q9FI03 (NDR1/HIN1-like protein 26 OS=Arabidopsis thaliana OX=3702 GN=NHL26 PE=2 SV=1)

HSP 1 Score: 81.3 bits (199), Expect = 1.4e-14
Identity = 42/156 (26.92%), Postives = 81/156 (51.92%), Query Frame = 0

Query: 34  LTVLIIWLTVRPKRLSYTVESASVQNFDLSNTQ---LNASFSFGVRAYNPNTKVSVYYDS 93
           L + ++WL + P+R  +++  A + + +L+ +    LN+S    + + NPN KV +YYD 
Sbjct: 40  LIIFLVWLILHPERPEFSLTEADIYSLNLTTSSTHLLNSSVQLTLFSKNPNKKVGIYYDK 99

Query: 94  ISVTVGF-GDQDLAFGVINPFYQRHEEVKWLDVNLATENIPLHDSVSKDLRLEKAAGEID 153
           + V   + G Q  +   + PFYQ HEE+  L   L    +P+  S    +  E++ G+I 
Sbjct: 100 LLVYAAYRGQQITSEASLPPFYQSHEEINLLTAFLQGTELPVAQSFGYQISRERSTGKII 159

Query: 154 LDLWIKARIRFKVGIWKSHR-TLRIRCSPVIVYFSN 185
           + + +  ++R+K+G W S      + C  ++ +  N
Sbjct: 160 IGMKMDGKLRWKIGTWVSGAYRFNVNCLAIVAFGMN 195

BLAST of Sgr020898 vs. ExPASy Swiss-Prot
Match: Q9SJ52 (NDR1/HIN1-like protein 10 OS=Arabidopsis thaliana OX=3702 GN=NHL10 PE=2 SV=1)

HSP 1 Score: 79.7 bits (195), Expect = 4.0e-14
Identity = 52/180 (28.89%), Postives = 90/180 (50.00%), Query Frame = 0

Query: 3   GPLQPPPPQPGRPR----------ILRYVALVLLALIVLVGLTVLIIWLTVRPKRLSYTV 62
           GP  PPP   G  R          +L     V+++LIV++G+  LI WL VRP+ + + V
Sbjct: 13  GPSVPPPAPKGYYRRGHGRGCGCCLLSLFVKVIISLIVILGVAALIFWLIVRPRAIKFHV 72

Query: 63  ESASVQNFDLSNTQ--LNASFSFGVRAYNPNTKVSVYYDSISVTVGFGDQDLAFGVINPF 122
             AS+  FD ++    L  + +  V   NPN ++ +YYD I     +  +  +   + PF
Sbjct: 73  TDASLTRFDHTSPDNILRYNLALTVPVRNPNKRIGLYYDRIEAHAYYEGKRFSTITLTPF 132

Query: 123 YQRHEEVKWLDVNLATENIPLHDS-VSKDLRLEKAAGEIDLDLWIKARIRFKVGIWKSHR 170
           YQ H+    L      +N+ + ++  S+ L  E+ +G  ++++  + R+RFK+G  K  R
Sbjct: 133 YQGHKNTTVLTPTFQGQNLVIFNAGQSRTLNAERISGVYNIEIKFRLRVRFKLGDLKFRR 192

BLAST of Sgr020898 vs. ExPASy Swiss-Prot
Match: Q9SRN1 (NDR1/HIN1-like protein 2 OS=Arabidopsis thaliana OX=3702 GN=NHL2 PE=2 SV=1)

HSP 1 Score: 77.8 bits (190), Expect = 1.5e-13
Identity = 56/211 (26.54%), Postives = 95/211 (45.02%), Query Frame = 0

Query: 3   GPLQPPPPQPGRPR-----------------------ILRYVALVLLALIVLVGLTVLII 62
           GP  PPPP+  R                         IL  +  +L+A+ V++G+  LI+
Sbjct: 14  GPSIPPPPKAHRSYNSPGFGCCCFSCLGSCLRCCGCCILSLICNILIAVAVILGVAALIL 73

Query: 63  WLTVRPKRLSYTVESASVQNFDLS-NTQLNASFSFGVRAYNPNTKVSVYYDSISVTVGFG 122
           WL  RP  + + V  A++  F    N  L+ S        NPN +V VYYD  SV+  +G
Sbjct: 74  WLIFRPNAVKFYVADANLNRFSFDPNNNLHYSLDLNFTIRNPNQRVGVYYDEFSVSGYYG 133

Query: 123 DQDLAFGVINPFYQRHEEVKWLDVNLATEN-IPLHDSVSKDLRLEKAAGEIDLDLWIKAR 182
           DQ      ++ FYQ H+    +   +  +N + L D    DL+ ++ +G   ++  ++  
Sbjct: 134 DQRFGSANVSSFYQGHKNTTVILTKIEGQNLVVLGDGARTDLKDDEKSGIYRINAKLRLS 193

Query: 183 IRFKVGIWKSHRTL-RIRCSPVIVYFSNGKS 188
           +RFK    KS +   +I+C  + +   +  S
Sbjct: 194 VRFKFWFIKSWKLKPKIKCDDLKIPLGSSNS 224

BLAST of Sgr020898 vs. ExPASy TrEMBL
Match: A0A6J1DHJ8 (uncharacterized protein At1g08160 OS=Momordica charantia OX=3673 GN=LOC111020551 PE=4 SV=1)

HSP 1 Score: 301.6 bits (771), Expect = 2.4e-78
Identity = 157/198 (79.29%), Postives = 173/198 (87.37%), Query Frame = 0

Query: 1   MAGPLQPPPPQPGRPRILRYVALVLLALIVLVGLTVLIIWLTVRPKRLSYTVESASVQNF 60
           MAGP QPPP   GR R+LR VALVLLALIVLVGL VLIIWLTVRPKRLSYTVESASVQNF
Sbjct: 1   MAGPPQPPP--SGRSRVLRCVALVLLALIVLVGLAVLIIWLTVRPKRLSYTVESASVQNF 60

Query: 61  DLSNTQLNASFSFGVRAYNPNTKVSVYYDSISVTVGFGDQDLAFGVINPFYQRHEEVKWL 120
           DLSNTQLNASF+F VRAYNPN++VSVYYD I VTVGFGDQDLA+G INPFYQ H+ V  L
Sbjct: 61  DLSNTQLNASFNFRVRAYNPNSRVSVYYDKILVTVGFGDQDLAYGTINPFYQPHKGVTRL 120

Query: 121 DVNLATENIPLHDSVSKDLRLEKAAGEIDLDLWIKARIRFKVGIWKS-HRTLRIRCSPVI 180
           D+N A +N+PL++SVSKDL LEKAAGE+DLDLWIKA+IRFKVGIWKS H+TLRI CSPVI
Sbjct: 121 DINPAAQNVPLYNSVSKDLGLEKAAGEMDLDLWIKAKIRFKVGIWKSGHQTLRIHCSPVI 180

Query: 181 VYFSNGKSFKKTTCFAEV 198
           +Y S  K F +TTCFAEV
Sbjct: 181 IYLSKSKPFNETTCFAEV 196

BLAST of Sgr020898 vs. ExPASy TrEMBL
Match: A0A5D3DBK2 (LEA_2 domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold284G00070 PE=4 SV=1)

HSP 1 Score: 298.5 bits (763), Expect = 2.0e-77
Identity = 149/198 (75.25%), Postives = 173/198 (87.37%), Query Frame = 0

Query: 1   MAGPLQPPPPQPGRPRILRYVALVLLALIVLVGLTVLIIWLTVRPKRLSYTVESASVQNF 60
           MAGP Q PP + G  RILR+V + L+ALI+LVGL VLIIWLT+RPKRLSYTVESA V NF
Sbjct: 1   MAGPPQ-PPSRAGPSRILRFVIIFLVALIILVGLAVLIIWLTIRPKRLSYTVESAEVHNF 60

Query: 61  DLSNTQLNASFSFGVRAYNPNTKVSVYYDSISVTVGFGDQDLAFGVINPFYQRHEEVKWL 120
           D++NTQLNASFSFGVRAYNPN +VSVYYDSI+ TVGFGDQDLAFGV++PFYQ H++ +WL
Sbjct: 61  DMTNTQLNASFSFGVRAYNPNKRVSVYYDSITATVGFGDQDLAFGVLSPFYQPHKDEQWL 120

Query: 121 DVNLATENIPLHDSVSKDLRLEKAAGEIDLDLWIKARIRFKVGIWKS-HRTLRIRCSPVI 180
           +++L  +N  LHDSVSKDL LE++AGE+DLDLWIKARIRFKVG+WKS HRTLRIRCSPVI
Sbjct: 121 NIHLNAQNFLLHDSVSKDLALERSAGEMDLDLWIKARIRFKVGVWKSAHRTLRIRCSPVI 180

Query: 181 VYFSNGKSFKKTTCFAEV 198
           VY S  K+FKKTTCF EV
Sbjct: 181 VYLSKSKTFKKTTCFTEV 197

BLAST of Sgr020898 vs. ExPASy TrEMBL
Match: A0A1S3CLP6 (uncharacterized protein At1g08160 OS=Cucumis melo OX=3656 GN=LOC103502251 PE=4 SV=1)

HSP 1 Score: 298.5 bits (763), Expect = 2.0e-77
Identity = 149/198 (75.25%), Postives = 173/198 (87.37%), Query Frame = 0

Query: 1   MAGPLQPPPPQPGRPRILRYVALVLLALIVLVGLTVLIIWLTVRPKRLSYTVESASVQNF 60
           MAGP Q PP + G  RILR+V + L+ALI+LVGL VLIIWLT+RPKRLSYTVESA V NF
Sbjct: 1   MAGPPQ-PPSRAGPSRILRFVIIFLVALIILVGLAVLIIWLTIRPKRLSYTVESAEVHNF 60

Query: 61  DLSNTQLNASFSFGVRAYNPNTKVSVYYDSISVTVGFGDQDLAFGVINPFYQRHEEVKWL 120
           D++NTQLNASFSFGVRAYNPN +VSVYYDSI+ TVGFGDQDLAFGV++PFYQ H++ +WL
Sbjct: 61  DMTNTQLNASFSFGVRAYNPNKRVSVYYDSITATVGFGDQDLAFGVLSPFYQPHKDEQWL 120

Query: 121 DVNLATENIPLHDSVSKDLRLEKAAGEIDLDLWIKARIRFKVGIWKS-HRTLRIRCSPVI 180
           +++L  +N  LHDSVSKDL LE++AGE+DLDLWIKARIRFKVG+WKS HRTLRIRCSPVI
Sbjct: 121 NIHLNAQNFLLHDSVSKDLALERSAGEMDLDLWIKARIRFKVGVWKSAHRTLRIRCSPVI 180

Query: 181 VYFSNGKSFKKTTCFAEV 198
           VY S  K+FKKTTCF EV
Sbjct: 181 VYLSKSKTFKKTTCFTEV 197

BLAST of Sgr020898 vs. ExPASy TrEMBL
Match: A0A6J1FH28 (uncharacterized protein At1g08160 OS=Cucurbita moschata OX=3662 GN=LOC111443929 PE=4 SV=1)

HSP 1 Score: 295.4 bits (755), Expect = 1.7e-76
Identity = 152/198 (76.77%), Postives = 169/198 (85.35%), Query Frame = 0

Query: 1   MAGPLQPPPPQPGRPRILRYVALVLLALIVLVGLTVLIIWLTVRPKRLSYTVESASVQNF 60
           MAGP Q    +P RP ILRYV LVL+ALIVLVGL VLIIWLTVRPKRLSYTVESA+V NF
Sbjct: 1   MAGPPQ-LSSRPARPNILRYVILVLVALIVLVGLVVLIIWLTVRPKRLSYTVESAAVHNF 60

Query: 61  DLSNTQLNASFSFGVRAYNPNTKVSVYYDSISVTVGFGDQDLAFGVINPFYQRHEEVKWL 120
           D+S TQLNASF+FGV+AYNPN  VSVYYD ++VTVGFGDQDLAFGVI PFYQ H++V WL
Sbjct: 61  DMSTTQLNASFNFGVKAYNPNRHVSVYYDHVTVTVGFGDQDLAFGVIKPFYQPHKDVTWL 120

Query: 121 DVNLATENIPLHDSVSKDLRLEKAAGEIDLDLWIKARIRFKVGIWKS-HRTLRIRCSPVI 180
           +++L  +N  LHDSVSKDL LEKAAGE+DLDLWIKARIRFKVG+WKS HRTLRIRCSPVI
Sbjct: 121 NMDLNAKNFLLHDSVSKDLALEKAAGEMDLDLWIKARIRFKVGVWKSGHRTLRIRCSPVI 180

Query: 181 VYFSNGKSFKKTTCFAEV 198
           VY S  K FK+T CF EV
Sbjct: 181 VYLSKDKEFKRTACFTEV 197

BLAST of Sgr020898 vs. ExPASy TrEMBL
Match: A0A6J1I194 (uncharacterized protein At1g08160 OS=Cucurbita maxima OX=3661 GN=LOC111468522 PE=4 SV=1)

HSP 1 Score: 293.9 bits (751), Expect = 5.0e-76
Identity = 150/198 (75.76%), Postives = 168/198 (84.85%), Query Frame = 0

Query: 1   MAGPLQPPPPQPGRPRILRYVALVLLALIVLVGLTVLIIWLTVRPKRLSYTVESASVQNF 60
           MAGP Q   P+P RP ILRYV LVL+ALIVLVGL VLIIWLTVRPKRL YTVESA+V NF
Sbjct: 1   MAGPPQ-LSPRPARPNILRYVILVLVALIVLVGLVVLIIWLTVRPKRLRYTVESAAVHNF 60

Query: 61  DLSNTQLNASFSFGVRAYNPNTKVSVYYDSISVTVGFGDQDLAFGVINPFYQRHEEVKWL 120
           D+S TQLNASF+FGV+AYNPN  VSVYYD ++VTVGFGDQDLAFGVI PFYQ H++V WL
Sbjct: 61  DMSTTQLNASFNFGVKAYNPNRHVSVYYDHVTVTVGFGDQDLAFGVIKPFYQPHKDVTWL 120

Query: 121 DVNLATENIPLHDSVSKDLRLEKAAGEIDLDLWIKARIRFKVGIWK-SHRTLRIRCSPVI 180
           +++L  +N  LHDSVSKDL LEKAAGE+DLDLWIKARIR+KVG+WK  HRTLRIRCSPVI
Sbjct: 121 NMDLNAKNFLLHDSVSKDLALEKAAGEMDLDLWIKARIRYKVGVWKLGHRTLRIRCSPVI 180

Query: 181 VYFSNGKSFKKTTCFAEV 198
           VY S  K FK+T CF EV
Sbjct: 181 VYLSKDKEFKRTACFTEV 197

BLAST of Sgr020898 vs. TAIR 10
Match: AT5G22870.1 (Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family )

HSP 1 Score: 171.0 bits (432), Expect = 9.4e-43
Identity = 88/197 (44.67%), Postives = 132/197 (67.01%), Query Frame = 0

Query: 4   PLQPPPPQP-GRPRILRYVALVLLALIVLVGLTVLIIWLTVRPKRLSYTVESASVQNFDL 63
           P++  P QP  RP ++ Y+ LV+L LI +  +  LI WL  +PK+L YTVE+ASVQNF+L
Sbjct: 11  PMETSPAQPLRRPSLICYIFLVILTLIFMAAVGFLITWLETKPKKLRYTVENASVQNFNL 70

Query: 64  SN-TQLNASFSFGVRAYNPNTKVSVYYDSISVTVGFGDQDLAFGVINPFYQRHEEVKWLD 123
           +N   ++A+F F ++++NPN ++SVYY S+ + V F DQ LAF  + PF+Q    VK +D
Sbjct: 71  TNDNHMSATFQFTIQSHNPNHRISVYYSSVEIFVKFKDQTLAFDTVEPFHQPRMNVKQID 130

Query: 124 VNLATENIPLHDSVSKDLRLEKAAGEIDLDLWIKARIRFKVGIWK-SHRTLRIRCSPVIV 183
             L  EN+ +  S  KDLR + + G+I  ++++KAR+RFKVGIWK SHRT +I+CS V V
Sbjct: 131 ETLIAENVAVSKSNGKDLRSQNSLGKIGFEVFVKARVRFKVGIWKSSHRTAKIKCSHVTV 190

Query: 184 YFSNGKSFKKTTCFAEV 198
             S     + ++C A++
Sbjct: 191 SLSQPNKSQNSSCDADI 207

BLAST of Sgr020898 vs. TAIR 10
Match: AT1G08160.1 (Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family )

HSP 1 Score: 128.6 bits (322), Expect = 5.4e-30
Identity = 77/193 (39.90%), Postives = 120/193 (62.18%), Query Frame = 0

Query: 6   QPPPPQPGRPRILRYVALVLLALIVLVGLTVLIIWLTVRPKRLSYTVESASVQNFDLSNT 65
           QP P +   P +   VALVLL L  LVGL +LI +LT+RPKRL YTVE+ASVQ F + N 
Sbjct: 27  QPLPGRRMNPVLCIIVALVLLGL--LVGLAILITYLTLRPKRLIYTVEAASVQEFAIGNN 86

Query: 66  --QLNASFSFGVRAYNPNTKVSVYYDSISVTVGFGDQDLAFGVINPFYQRHEEVKWLDVN 125
              +NA FS+ +++YNP   VSV Y S+ ++    +Q +A   I+PF QR +    ++  
Sbjct: 87  DDHINAKFSYVIKSYNPEKHVSVRYHSMRISTAHHNQSVAHKNISPFKQRPKNETRIETQ 146

Query: 126 LATENIPLHDSVSKDLRLEKAAGEIDLDLWIKARIRFKVGIWKS-HRTLRIRCSPVIVYF 185
           L + N+ L    ++DLR EK+ G I+++++I AR+ +K  I++S  RTL+  C+PV++  
Sbjct: 147 LVSHNVALSKFNARDLRAEKSKGTIEMEVYITARVSYKTWIFRSRRRTLKAVCTPVMINV 206

Query: 186 SNGK--SFKKTTC 194
           ++     F++  C
Sbjct: 207 TSSSLDGFQRVLC 217

BLAST of Sgr020898 vs. TAIR 10
Match: AT5G06320.1 (NDR1/HIN1-like 3 )

HSP 1 Score: 83.6 bits (205), Expect = 2.0e-16
Identity = 63/188 (33.51%), Postives = 96/188 (51.06%), Query Frame = 0

Query: 17  ILRYVALVLLALIVLVGLTVLIIWLTVRPKRLSYTVESASVQNFDL---SNTQLNASFSF 76
           IL  +  +L+ + VL+G+  LIIWL  RP  + + V  A +  F L   +N + N   +F
Sbjct: 44  ILSVIFNILITIAVLLGIAALIIWLIFRPNAIKFHVTDAKLTEFTLDPTNNLRYNLDLNF 103

Query: 77  GVRAYNPNTKVSVYYDSISVTVGFGDQDLAFGV---INPFYQRHEEVKWLDVNLATENIP 136
            +R  NPN ++ VYYD I V   +GDQ   FG+   I+ FYQ H+    +   L  + + 
Sbjct: 104 TIR--NPNRRIGVYYDEIEVRGYYGDQ--RFGMSNNISKFYQGHKNTTVVGTKLVGQQLV 163

Query: 137 LHD-SVSKDLRLEKAAGEIDLDLWIKARIRFKVGIWKSHR-TLRIRCSPVIVYFSNGKS- 194
           L D    KDL  +  +    +D  ++ +IRFK G+ KS R   +I+C   +   SN  S 
Sbjct: 164 LLDGGERKDLNEDVNSQIYRIDAKLRLKIRFKFGLIKSWRFKPKIKCDLKVPLTSNSTSG 223

BLAST of Sgr020898 vs. TAIR 10
Match: AT5G53730.1 (Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family )

HSP 1 Score: 81.3 bits (199), Expect = 9.8e-16
Identity = 42/156 (26.92%), Postives = 81/156 (51.92%), Query Frame = 0

Query: 34  LTVLIIWLTVRPKRLSYTVESASVQNFDLSNTQ---LNASFSFGVRAYNPNTKVSVYYDS 93
           L + ++WL + P+R  +++  A + + +L+ +    LN+S    + + NPN KV +YYD 
Sbjct: 40  LIIFLVWLILHPERPEFSLTEADIYSLNLTTSSTHLLNSSVQLTLFSKNPNKKVGIYYDK 99

Query: 94  ISVTVGF-GDQDLAFGVINPFYQRHEEVKWLDVNLATENIPLHDSVSKDLRLEKAAGEID 153
           + V   + G Q  +   + PFYQ HEE+  L   L    +P+  S    +  E++ G+I 
Sbjct: 100 LLVYAAYRGQQITSEASLPPFYQSHEEINLLTAFLQGTELPVAQSFGYQISRERSTGKII 159

Query: 154 LDLWIKARIRFKVGIWKSHR-TLRIRCSPVIVYFSN 185
           + + +  ++R+K+G W S      + C  ++ +  N
Sbjct: 160 IGMKMDGKLRWKIGTWVSGAYRFNVNCLAIVAFGMN 195

BLAST of Sgr020898 vs. TAIR 10
Match: AT2G35980.1 (Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family )

HSP 1 Score: 79.7 bits (195), Expect = 2.9e-15
Identity = 52/180 (28.89%), Postives = 90/180 (50.00%), Query Frame = 0

Query: 3   GPLQPPPPQPGRPR----------ILRYVALVLLALIVLVGLTVLIIWLTVRPKRLSYTV 62
           GP  PPP   G  R          +L     V+++LIV++G+  LI WL VRP+ + + V
Sbjct: 13  GPSVPPPAPKGYYRRGHGRGCGCCLLSLFVKVIISLIVILGVAALIFWLIVRPRAIKFHV 72

Query: 63  ESASVQNFDLSNTQ--LNASFSFGVRAYNPNTKVSVYYDSISVTVGFGDQDLAFGVINPF 122
             AS+  FD ++    L  + +  V   NPN ++ +YYD I     +  +  +   + PF
Sbjct: 73  TDASLTRFDHTSPDNILRYNLALTVPVRNPNKRIGLYYDRIEAHAYYEGKRFSTITLTPF 132

Query: 123 YQRHEEVKWLDVNLATENIPLHDS-VSKDLRLEKAAGEIDLDLWIKARIRFKVGIWKSHR 170
           YQ H+    L      +N+ + ++  S+ L  E+ +G  ++++  + R+RFK+G  K  R
Sbjct: 133 YQGHKNTTVLTPTFQGQNLVIFNAGQSRTLNAERISGVYNIEIKFRLRVRFKLGDLKFRR 192

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022152939.15.0e-7879.29uncharacterized protein At1g08160 [Momordica charantia][more]
XP_038894942.13.2e-7774.24uncharacterized protein At1g08160-like [Benincasa hispida][more]
XP_008464346.14.2e-7775.25PREDICTED: uncharacterized protein At1g08160 [Cucumis melo] >KAA0032608.1 unchar... [more]
XP_022937555.13.6e-7676.77uncharacterized protein At1g08160 [Cucurbita moschata] >XP_023537506.1 uncharact... [more]
XP_022969533.11.0e-7575.76uncharacterized protein At1g08160 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
Q8VZ137.6e-2939.90Uncharacterized protein At1g08160 OS=Arabidopsis thaliana OX=3702 GN=At1g08160 P... [more]
Q9FNH62.8e-1533.51NDR1/HIN1-like protein 3 OS=Arabidopsis thaliana OX=3702 GN=NHL3 PE=1 SV=1[more]
Q9FI031.4e-1426.92NDR1/HIN1-like protein 26 OS=Arabidopsis thaliana OX=3702 GN=NHL26 PE=2 SV=1[more]
Q9SJ524.0e-1428.89NDR1/HIN1-like protein 10 OS=Arabidopsis thaliana OX=3702 GN=NHL10 PE=2 SV=1[more]
Q9SRN11.5e-1326.54NDR1/HIN1-like protein 2 OS=Arabidopsis thaliana OX=3702 GN=NHL2 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A6J1DHJ82.4e-7879.29uncharacterized protein At1g08160 OS=Momordica charantia OX=3673 GN=LOC111020551... [more]
A0A5D3DBK22.0e-7775.25LEA_2 domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_... [more]
A0A1S3CLP62.0e-7775.25uncharacterized protein At1g08160 OS=Cucumis melo OX=3656 GN=LOC103502251 PE=4 S... [more]
A0A6J1FH281.7e-7676.77uncharacterized protein At1g08160 OS=Cucurbita moschata OX=3662 GN=LOC111443929 ... [more]
A0A6J1I1945.0e-7675.76uncharacterized protein At1g08160 OS=Cucurbita maxima OX=3661 GN=LOC111468522 PE... [more]
Match NameE-valueIdentityDescription
AT5G22870.19.4e-4344.67Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family [more]
AT1G08160.15.4e-3039.90Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family [more]
AT5G06320.12.0e-1633.51NDR1/HIN1-like 3 [more]
AT5G53730.19.8e-1626.92Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family [more]
AT2G35980.12.9e-1528.89Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR004864Late embryogenesis abundant protein, LEA_2 subgroupPFAMPF03168LEA_2coord: 75..175
e-value: 2.0E-13
score: 50.7
NoneNo IPR availablePANTHERPTHR31852LATE EMBRYOGENESIS ABUNDANT (LEA) HYDROXYPROLINE-RICH GLYCOPROTEIN FAMILYcoord: 12..196
NoneNo IPR availablePANTHERPTHR31852:SF5GB|AAF18257.1coord: 12..196

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr020898.1Sgr020898.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0046658 anchored component of plasma membrane
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0009506 plasmodesma