Sgr029645 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr029645
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
Descriptionglycine-rich cell wall structural protein 1.8-like
Locationtig00153449: 1346571 .. 1347305 (-)
RNA-Seq ExpressionSgr029645
SyntenySgr029645
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTATCTCTAGAGCTTTCTCCCTTGGTTTTCTTCTGCTGGTGGGTTTTGGCTTAGCTTCTGCTGCCAGAACCCTTCTCAGTTATGATCCTCCTGCACGTCACCCAGTGGTAGGATATGATTACGATCGTCCTGTGCATAACCCGAGAGTAGGGTATGATCCTGACCATCATGATGGACCCTATGGTGGATATGGTGGTGGTGCTGGTGGAGGATATGGGGGCGGAGCTGGCTCTGCTCTTGGAGGATCGGGATATGGAAGTGGTGGCGGGGAAGGAGGTGGTTCTGGATATGGAAGTGTAGGAGATCACGGGGTTGGTTATGGTAGTGGTGGAGGGGCTGGTTCTGGATATGGAGATGTAGGAGGGCATGGAAAAGGCTATGGTAGCGGTGGTGGTGGAGGACACGGGAGTGGATACGGAGGCGAGGCGGAGCATGGGGTTGGTTATGGTGGTGGTGCAGGTGGAGGATATGGAAGTGGAGGCGGCACCGGATATGGCCCAGGAGGAGAGCATGGGGTTGGCTATGGAAGTGGAGGAGGAGCTGGGTCTGGTAGTGGTTACGGTGGTGGAGCTAAAGGATATGGAGGAGGAAGCGGTGGTGGAAAAGGTGGTGGTGGTGGAGCAGGTTACGGTCCTGGAGGAGGACATGGAAGCGGATATGGTGGTGGTGAAGGAGCAGGAAGTGGCTATGGCGGCAGTGATGGAGGATATGATGGTGGATATGCACCTTAA

mRNA sequence

ATGGCTATCTCTAGAGCTTTCTCCCTTGGTTTTCTTCTGCTGGTGGGTTTTGGCTTAGCTTCTGCTGCCAGAACCCTTCTCAGTTATGATCCTCCTGCACGTCACCCAGTGGTAGGATATGATTACGATCGTCCTGTGCATAACCCGAGAGTAGGGTATGATCCTGACCATCATGATGGACCCTATGGTGGATATGGTGGTGGTGCTGGTGGAGGATATGGGGGCGGAGCTGGCTCTGCTCTTGGAGGATCGGGATATGGAAGTGGTGGCGGGGAAGGAGGTGGTTCTGGATATGGAAGTGTAGGAGATCACGGGGTTGGTTATGGTAGTGGTGGAGGGGCTGGTTCTGGATATGGAGATGTAGGAGGGCATGGAAAAGGCTATGGTAGCGGTGGTGGTGGAGGACACGGGAGTGGATACGGAGGCGAGGCGGAGCATGGGGTTGGTTATGGTGGTGGTGCAGGTGGAGGATATGGAAGTGGAGGCGGCACCGGATATGGCCCAGGAGGAGAGCATGGGGTTGGCTATGGAAGTGGAGGAGGAGCTGGGTCTGGTAGTGGTTACGGTGGTGGAGCTAAAGGATATGGAGGAGGAAGCGGTGGTGGAAAAGGTGGTGGTGGTGGAGCAGGTTACGGTCCTGGAGGAGGACATGGAAGCGGATATGGTGGTGGTGAAGGAGCAGGAAGTGGCTATGGCGGCAGTGATGGAGGATATGATGGTGGATATGCACCTTAA

Coding sequence (CDS)

ATGGCTATCTCTAGAGCTTTCTCCCTTGGTTTTCTTCTGCTGGTGGGTTTTGGCTTAGCTTCTGCTGCCAGAACCCTTCTCAGTTATGATCCTCCTGCACGTCACCCAGTGGTAGGATATGATTACGATCGTCCTGTGCATAACCCGAGAGTAGGGTATGATCCTGACCATCATGATGGACCCTATGGTGGATATGGTGGTGGTGCTGGTGGAGGATATGGGGGCGGAGCTGGCTCTGCTCTTGGAGGATCGGGATATGGAAGTGGTGGCGGGGAAGGAGGTGGTTCTGGATATGGAAGTGTAGGAGATCACGGGGTTGGTTATGGTAGTGGTGGAGGGGCTGGTTCTGGATATGGAGATGTAGGAGGGCATGGAAAAGGCTATGGTAGCGGTGGTGGTGGAGGACACGGGAGTGGATACGGAGGCGAGGCGGAGCATGGGGTTGGTTATGGTGGTGGTGCAGGTGGAGGATATGGAAGTGGAGGCGGCACCGGATATGGCCCAGGAGGAGAGCATGGGGTTGGCTATGGAAGTGGAGGAGGAGCTGGGTCTGGTAGTGGTTACGGTGGTGGAGCTAAAGGATATGGAGGAGGAAGCGGTGGTGGAAAAGGTGGTGGTGGTGGAGCAGGTTACGGTCCTGGAGGAGGACATGGAAGCGGATATGGTGGTGGTGAAGGAGCAGGAAGTGGCTATGGCGGCAGTGATGGAGGATATGATGGTGGATATGCACCTTAA

Protein sequence

MAISRAFSLGFLLLVGFGLASAARTLLSYDPPARHPVVGYDYDRPVHNPRVGYDPDHHDGPYGGYGGGAGGGYGGGAGSALGGSGYGSGGGEGGGSGYGSVGDHGVGYGSGGGAGSGYGDVGGHGKGYGSGGGGGHGSGYGGEAEHGVGYGGGAGGGYGSGGGTGYGPGGEHGVGYGSGGGAGSGSGYGGGAKGYGGGSGGGKGGGGGAGYGPGGGHGSGYGGGEGAGSGYGGSDGGYDGGYAP
Homology
BLAST of Sgr029645 vs. NCBI nr
Match: XP_038905926.1 (glycine-rich cell wall structural protein 1.8-like [Benincasa hispida])

HSP 1 Score: 208.8 bits (530), Expect = 5.4e-50
Identity = 192/271 (70.85%), Postives = 200/271 (73.80%), Query Frame = 0

Query: 1   MAISRAFSLGFLLLVGFGLASAARTLLSYD-PPARHPVVGYD-YDRPVHNPRVGYDPDHH 60
           MAI R  S GFLLLVG GLASA R LLSYD PP R    GYD YD PVHNP+VGY+ DHH
Sbjct: 9   MAILRTLSFGFLLLVGLGLASATRALLSYDIPPHRS---GYDNYDHPVHNPKVGYERDHH 68

Query: 61  DGPYGGYGGGAGGGYGGGAGSALGGSGYGSGGGEGGGSGYGSVGDHGVGYGS-------- 120
           DGP   YGGGAGGGYGGGAGS+LGGSGYGSGG  GGGSGY   GDHGVGYGS        
Sbjct: 69  DGP---YGGGAGGGYGGGAGSSLGGSGYGSGG--GGGSGYAGAGDHGVGYGSGGGGGYGA 128

Query: 121 ---------------GGGAGSGYGDVGGHGKGYGSGGGGGHGSGYGGEAEHGVGYGGGAG 180
                          GGG+G GYGD+GGHGKGYGSGGGG  GSGYGG  +HGVGYG G G
Sbjct: 129 GVGSDLGGSGYGSGGGGGSGGGYGDLGGHGKGYGSGGGG--GSGYGGRGDHGVGYGSGGG 188

Query: 181 GGYGSGGGTGYGPGGEHGVGYGSGGGAGSGSGYGGGAKGYGGGSGGGKGGGGGAGYGPGG 240
           GGYGSGGG GYGPG EHGVGYGSGGG GSGSGY GG+KGYGGGS    GGGGGAGYG GG
Sbjct: 189 GGYGSGGGAGYGPGVEHGVGYGSGGGGGSGSGY-GGSKGYGGGS----GGGGGAGYG-GG 248

Query: 241 GHGSGYGGGEGAGSGYGGS--DGGYDGGYAP 245
            HGS    G GAGSGYGGS  +GGYDGGYAP
Sbjct: 249 AHGS----GGGAGSGYGGSGEEGGYDGGYAP 259

BLAST of Sgr029645 vs. NCBI nr
Match: XP_023539508.1 (glycine-rich cell wall structural protein 1.8-like [Cucurbita pepo subsp. pepo] >XP_023539510.1 glycine-rich cell wall structural protein 1.8-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 207.6 bits (527), Expect = 1.2e-49
Identity = 182/267 (68.16%), Postives = 193/267 (72.28%), Query Frame = 0

Query: 1   MAISRAFSLGFLLLVGFGLASAARTLLSYDPPARHPVVGYDYDRPVHNPRVGYDPDHHDG 60
           MAIS++ SL FLLL+G GLASAARTLLSYDPP  H  VGY Y    HNPRVGYD DHHDG
Sbjct: 1   MAISKSLSLTFLLLLGLGLASAARTLLSYDPP-HHSDVGYGYQ---HNPRVGYDHDHHDG 60

Query: 61  PYGGYGGGAGGGYGGGAGSALGGSGYGSGGGEGGGSGYGSVGD----------------- 120
           PYG YGGG+GGGYG GAGSALGGSGYGSGGG GGGSGY  VGD                 
Sbjct: 61  PYGAYGGGSGGGYGAGAGSALGGSGYGSGGGGGGGSGYAGVGDLGGSGYGSGGGGGSGVG 120

Query: 121 ------HGVGYGSGGGAGS--GYGDVGGHGKGYGSGGGGGHGSGYGGEAEHGVGYGGGAG 180
                 HG GYGSGGG GS  GYGD+GGHGKGYGSGGGG  GSGYGG A+HGVGYG G G
Sbjct: 121 YGDLGGHGKGYGSGGGGGSGVGYGDLGGHGKGYGSGGGG--GSGYGGGADHGVGYGSGGG 180

Query: 181 GGYGSGGGTGYGPGGEHGVGYGSGGGAGSGSGYGGGAKGYGGGSGGGKGGGGGAGYGPGG 240
           GGYG+GGG GYGPGG+ GVGYGSGGG GSG GYGGGAKGYGGG+        G+GYG GG
Sbjct: 181 GGYGAGGGAGYGPGGDRGVGYGSGGGGGSGGGYGGGAKGYGGGA-------KGSGYGSGG 240

Query: 241 GHGSGYGGGEGAGSGYGGSDGGYDGGY 243
           G GSGYGG         G +GGYDGGY
Sbjct: 241 GAGSGYGG--------VGGEGGYDGGY 246

BLAST of Sgr029645 vs. NCBI nr
Match: XP_022924241.1 (glycine-rich cell wall structural protein 1.8-like [Cucurbita moschata])

HSP 1 Score: 203.0 bits (515), Expect = 3.0e-48
Identity = 184/285 (64.56%), Postives = 197/285 (69.12%), Query Frame = 0

Query: 1   MAISRAFSLGFLLLVGFGLASAARTLLSYDPPARHPVVGYDYDRPVHNPRVGYDPDHHDG 60
           MAIS++ SL FLLL+G GLASAARTLLSYDPP  H  VGY Y    HNPRVGYD DHHDG
Sbjct: 1   MAISKSLSLTFLLLLGLGLASAARTLLSYDPP-HHSDVGYGYQ---HNPRVGYDHDHHDG 60

Query: 61  PYGGYGGGAGGGYGGGAGSALGGSGYGSGGGEG---------GGSGYGS----------- 120
           PYG YGGG+GGGYG GAGS+LGGSGYGSGGG G         GGSGYGS           
Sbjct: 61  PYGAYGGGSGGGYGAGAGSSLGGSGYGSGGGGGSGYAGVGDVGGSGYGSGGGGGSGVGYG 120

Query: 121 -VGDHGVGYGSGGGAGS--GYGDVGGHGKGYGSGGGGGH--------------------G 180
            +G HG GYGSGGG GS  GYGD+GGHGKGYGSGGGGG                     G
Sbjct: 121 DLGGHGKGYGSGGGGGSGVGYGDLGGHGKGYGSGGGGGSGVGYGDLGGHGKGYGSGGGGG 180

Query: 181 SGYGGEAEHGVGYGGGAGGGYGSGGGTGYGPGGEHGVGYGSGGGAGSGSGYGGGAKGYGG 240
           SGYGG A+HGVGYG G GGGYG+GGG GYGPGG+HGVGYGSGGG GSG GYGGGAKGYGG
Sbjct: 181 SGYGGGADHGVGYGSGGGGGYGAGGGAGYGPGGDHGVGYGSGGGGGSGGGYGGGAKGYGG 240

Query: 241 GSGGGKGGGGGAGYGPGGGHGSGYGGGEGAGSGYGGSDGGYDGGY 243
           G+        G+GYG GGG GSGYG       G GG +GGYDGGY
Sbjct: 241 GA-------KGSGYGSGGGAGSGYG-------GVGGEEGGYDGGY 267

BLAST of Sgr029645 vs. NCBI nr
Match: XP_023005512.1 (glycine-rich cell wall structural protein 1.8-like [Cucurbita maxima])

HSP 1 Score: 201.4 bits (511), Expect = 8.7e-48
Identity = 185/267 (69.29%), Postives = 195/267 (73.03%), Query Frame = 0

Query: 1   MAISRAFSLGFLLLVGFGLASAARTLLSYDPPARHPVVGYDYDRPVHNPRVGYDPDHHDG 60
           MAIS++ SL FLLL+G GLASAARTLLSYDPP  H  VGY Y    HNPRVGYD DHHDG
Sbjct: 1   MAISKSLSLTFLLLLGLGLASAARTLLSYDPP-HHSDVGYGYQ---HNPRVGYDHDHHDG 60

Query: 61  PYGGYGGGAGGGY--GGGAGSALGGSGYGSGGGEGGGSGYGSVGD-HGVGYGSGGGAGS- 120
           PYG YGGG+GGGY  G GAGSALGGSGYGSGGG GGGSGY  VGD  G GYGSGGG GS 
Sbjct: 61  PYGAYGGGSGGGYGAGAGAGSALGGSGYGSGGGGGGGSGYAGVGDLGGSGYGSGGGGGSG 120

Query: 121 -GYGDVGGHGKGYGSGGGGGH--------------------GSGYGGEAEHGVGYGGGAG 180
            GYGD+GGHGKGYGSGGGGG                     GSGYGG A+HGVGYG G G
Sbjct: 121 VGYGDLGGHGKGYGSGGGGGSGVGYGDLSGHGKGYGSGGGGGSGYGGGADHGVGYGSGGG 180

Query: 181 GGYGSGGGTGYGPGGEHGVGYGSGGGAGSGSGYGGGAKGYGGGSGGGKGGGGGAGYGPGG 240
           GGYG+GGG GYGPGG+HGVGYGSGGG GSG GYGG  KGYGGG  G  GG  G+GYG GG
Sbjct: 181 GGYGAGGGAGYGPGGDHGVGYGSGGGGGSGGGYGGRDKGYGGGPKGYGGGAKGSGYGSGG 240

Query: 241 GHGSGYGGGEGAGSGYGGSDGGYDGGY 243
           G GSGYG       G GG +GGYDGGY
Sbjct: 241 GAGSGYG-------GVGGEEGGYDGGY 256

BLAST of Sgr029645 vs. NCBI nr
Match: KAG6596726.1 (hypothetical protein SDJN03_09906, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 198.0 bits (502), Expect = 9.6e-47
Identity = 184/309 (59.55%), Postives = 196/309 (63.43%), Query Frame = 0

Query: 1   MAISRAFSLGFLLLVGFGLASAARTLLSYDPPARHPVVGYDYDRPVHNPRVGYDPDHHDG 60
           MAIS++ SL FLLL+G GLASAARTLLSYDPP  H  VGY Y    HNPRVGYD DHHDG
Sbjct: 1   MAISKSLSLTFLLLLGLGLASAARTLLSYDPP-HHSDVGYGYQ---HNPRVGYDHDHHDG 60

Query: 61  PYGGYGGGAGGGYGGGAGSALGGSGYGSGGGEGGGSGYGSVGD----------------- 120
           PYG YGGG+GGGYG GAGS+LGGSGYGSGGG GGGSGY  VGD                 
Sbjct: 61  PYGAYGGGSGGGYGAGAGSSLGGSGYGSGGGGGGGSGYAGVGDVGGSGYGSGGGGGSGVG 120

Query: 121 ----------------------------HGVGYGSGGGAGS--GYGDVGGHGKGYGSGGG 180
                                       HG GYGSGGG GS  GYGD+GGHGKGYGSGGG
Sbjct: 121 YGDLGGHGKGYGSGGGGGSGVGYGDLGGHGKGYGSGGGGGSGVGYGDLGGHGKGYGSGGG 180

Query: 181 GGH--------------------GSGYGGEAEHGVGYGGGAGGGYGSGGGTGYGPGGEHG 240
           GG                     GSGYGG A+HGVGYG G GGGYG+GGG GYGPGG+HG
Sbjct: 181 GGSGVGYGDLGGHGKGYGSGGGGGSGYGGGADHGVGYGSGGGGGYGAGGGAGYGPGGDHG 240

Query: 241 VGYGSGGGAGSGSGYGGGAKGYGGGSGGGKGGGGGAGYGPGGGHGSGYGGGEGAGSGYGG 243
           VGYGSGGG GSG GYGGGAKGYGGG+        G+GYG GGG GSGYG       G GG
Sbjct: 241 VGYGSGGGGGSGGGYGGGAKGYGGGA-------KGSGYGSGGGAGSGYG-------GVGG 291

BLAST of Sgr029645 vs. ExPASy Swiss-Prot
Match: P10496 (Glycine-rich cell wall structural protein 1.8 OS=Phaseolus vulgaris OX=3885 PE=2 SV=1)

HSP 1 Score: 64.7 bits (156), Expect = 1.7e-09
Identity = 118/195 (60.51%), Postives = 120/195 (61.54%), Query Frame = 0

Query: 52  GYDPDHHDGPYGGYGGGAGGGYGGGAGSALGGSGYGSGGGEGGGSGYGSVGDHGVGYGSG 111
           G   +H  G  GG GGGAGGGYG G G   GG+G G GGG GGG G G     G G G G
Sbjct: 266 GAGGEHGGGAGGGQGGGAGGGYGAG-GEHGGGAGGGQGGGAGGGYGAGGEHGGGGGGGQG 325

Query: 112 GGAGSGYGDVGGHGKGYGSGGGGGHGSGYGGEAEHGVGYGGGAGGGYGSGGGTGYGPGGE 171
           GGAG GY  VG HG GYG G GGG G GYG   EHG GYGGG GG    G G GYG GGE
Sbjct: 326 GGAGGGYAAVGEHGGGYGGGQGGGDGGGYGTGGEHGGGYGGGQGG----GAGGGYGTGGE 385

Query: 172 HGVGYGSGGGAGSGSGYGG--GAKGYGGGSGGGKGGGGGAGYGPGGGHGSGYGGGEGAGS 231
           HG GYG G G G G G GG  GA GYGGG GG  GGG G GYG GG HG GYGGG G G 
Sbjct: 386 HGGGYGGGQGGGGGYGAGGDHGAAGYGGGEGG--GGGSGGGYGDGGAHGGGYGGGAGGGG 445

Query: 232 GYGGS---DGGYDGG 242
           GYG      GGY GG
Sbjct: 446 GYGAGGAHGGGYGGG 453

BLAST of Sgr029645 vs. ExPASy Swiss-Prot
Match: P0C5C7 (Glycine-rich cell wall structural protein 2 OS=Oryza sativa subsp. indica OX=39946 GN=GRP0.9 PE=2 SV=1)

HSP 1 Score: 47.8 bits (112), Expect = 2.1e-04
Identity = 111/231 (48.05%), Postives = 125/231 (54.11%), Query Frame = 0

Query: 1   MAISRAFSLGFLLLVGFGLASAARTLLSYDPPARHPVVGYDYDRPVHNPRVGYDPDHHDG 60
           MA ++  +L  L+L+  G+ ++ARTLL Y P                             
Sbjct: 1   MATTKHLALAILVLLSIGMTTSARTLLGYGP----------------------------- 60

Query: 61  PYGGYGGGAGGGYGGGAGSALGGSGYGSGGGEGGGSGYGSVGDHGVGYGSGGGAGSGYGD 120
             GG GGG GGG GGG G   GGSGYGSG G G G G G  G  G GYG GGG G G G+
Sbjct: 61  --GGGGGGGGGGEGGGGG--YGGSGYGSGSGYGEGGGSG--GAAGGGYGRGGGGGGGGGE 120

Query: 121 VGGHGKGYGSGGGGGHGSGYGGEAEHGVGYGGGAGGGYGSGGGTGYGPGGEHGVGYGSGG 180
            GG G GYGSG G G+G+G GG    G G GGG GGG G G G GYG G  +G GYGSG 
Sbjct: 121 GGGSGSGYGSGQGSGYGAGVGGAG--GYGSGGGGGGGQGGGAG-GYGQGSGYGSGYGSGA 180

Query: 181 GAGSGSGYGGGAKGYGGGSGGGKGGGGGAGYGPGGGHGSGYGGGEGAGSGY 232
           G   G GY         GSGGG GGGGG G G G G GSGYG G G G+G+
Sbjct: 181 GGAHGGGY---------GSGGGGGGGGGQGGGSGSGSGSGYGSGSGGGNGH 184

BLAST of Sgr029645 vs. ExPASy Swiss-Prot
Match: A3C5A7 (Glycine-rich cell wall structural protein 2 OS=Oryza sativa subsp. japonica OX=39947 GN=GRP0.9 PE=2 SV=1)

HSP 1 Score: 47.8 bits (112), Expect = 2.1e-04
Identity = 111/231 (48.05%), Postives = 125/231 (54.11%), Query Frame = 0

Query: 1   MAISRAFSLGFLLLVGFGLASAARTLLSYDPPARHPVVGYDYDRPVHNPRVGYDPDHHDG 60
           MA ++  +L  L+L+  G+ ++ARTLL Y P                             
Sbjct: 1   MATTKHLALAILVLLSIGMTTSARTLLGYGP----------------------------- 60

Query: 61  PYGGYGGGAGGGYGGGAGSALGGSGYGSGGGEGGGSGYGSVGDHGVGYGSGGGAGSGYGD 120
             GG GGG GGG GGG G   GGSGYGSG G G G G G  G  G GYG GGG G G G+
Sbjct: 61  --GGGGGGGGGGEGGGGG--YGGSGYGSGSGYGEGGGSG--GAAGGGYGRGGGGGGGGGE 120

Query: 121 VGGHGKGYGSGGGGGHGSGYGGEAEHGVGYGGGAGGGYGSGGGTGYGPGGEHGVGYGSGG 180
            GG G GYGSG G G+G+G GG    G G GGG GGG G G G GYG G  +G GYGSG 
Sbjct: 121 GGGSGSGYGSGQGSGYGAGVGGAG--GYGSGGGGGGGQGGGAG-GYGQGSGYGSGYGSGA 180

Query: 181 GAGSGSGYGGGAKGYGGGSGGGKGGGGGAGYGPGGGHGSGYGGGEGAGSGY 232
           G   G GY         GSGGG GGGGG G G G G GSGYG G G G+G+
Sbjct: 181 GGAHGGGY---------GSGGGGGGGGGQGGGSGSGSGSGYGSGSGGGNGH 184

BLAST of Sgr029645 vs. ExPASy Swiss-Prot
Match: Q9SIH2 (Glycine-rich protein DOT1 OS=Arabidopsis thaliana OX=3702 GN=DOT1 PE=2 SV=1)

HSP 1 Score: 47.4 bits (111), Expect = 2.7e-04
Identity = 125/252 (49.60%), Postives = 135/252 (53.57%), Query Frame = 0

Query: 14  LVGFGLASAARTLLSYDPPARHPVVGYDYDRPVH---NPRVGYDPDHHDGPYGG------ 73
           L+G GL SA R LLS    +   V  Y  +  +       +G  P    G  GG      
Sbjct: 13  LIGLGLCSARRALLS-SSESEAEVAAYGVNSGLSAGLGVGIGGGPGGGSGYGGGSGEGGG 72

Query: 74  --------YGGGAGGGYGGGAGSALG---GSGYGSGGGEGGGSGYGSVGDHGVGYGSGGG 133
                    GGG GGG+GGGAG   G   G GYG G GEGGG+GYG     G G G GGG
Sbjct: 73  AGGHGEGHIGGGGGGGHGGGAGGGGGGGPGGGYGGGSGEGGGAGYGGGEAGGHGGGGGGG 132

Query: 134 AGSGYGDVGG-HGKGYGSGGGGGHGSGYGGEAEHGVGYGGGAGGGYGSGGGTGYGPGGEH 193
           AG G G  GG HG GYG G G G G GYGG      G+GGG GGG G GGG G G GG H
Sbjct: 133 AGGGGGGGGGAHGGGYGGGQGAGAGGGYGGGG--AGGHGGGGGGGNGGGGGGGSGEGGAH 192

Query: 194 GVGYGSGGGAGSGSGYGGGAKGYGGGSGGGKGGGGGAGYGPGGGHGSGYGGGEGAGSGYG 245
           G GYG+GGGAG G G G GA G+GGG GGG G GGG G G G    SGYG G GAG G G
Sbjct: 193 GGGYGAGGGAGEGYGGGAGAGGHGGGGGGGGGSGGGGGGGGGYAAASGYGHGGGAGGGEG 252

BLAST of Sgr029645 vs. ExPASy TrEMBL
Match: A0A6J1E911 (glycine-rich cell wall structural protein 1.8-like OS=Cucurbita moschata OX=3662 GN=LOC111431731 PE=4 SV=1)

HSP 1 Score: 203.0 bits (515), Expect = 1.4e-48
Identity = 184/285 (64.56%), Postives = 197/285 (69.12%), Query Frame = 0

Query: 1   MAISRAFSLGFLLLVGFGLASAARTLLSYDPPARHPVVGYDYDRPVHNPRVGYDPDHHDG 60
           MAIS++ SL FLLL+G GLASAARTLLSYDPP  H  VGY Y    HNPRVGYD DHHDG
Sbjct: 1   MAISKSLSLTFLLLLGLGLASAARTLLSYDPP-HHSDVGYGYQ---HNPRVGYDHDHHDG 60

Query: 61  PYGGYGGGAGGGYGGGAGSALGGSGYGSGGGEG---------GGSGYGS----------- 120
           PYG YGGG+GGGYG GAGS+LGGSGYGSGGG G         GGSGYGS           
Sbjct: 61  PYGAYGGGSGGGYGAGAGSSLGGSGYGSGGGGGSGYAGVGDVGGSGYGSGGGGGSGVGYG 120

Query: 121 -VGDHGVGYGSGGGAGS--GYGDVGGHGKGYGSGGGGGH--------------------G 180
            +G HG GYGSGGG GS  GYGD+GGHGKGYGSGGGGG                     G
Sbjct: 121 DLGGHGKGYGSGGGGGSGVGYGDLGGHGKGYGSGGGGGSGVGYGDLGGHGKGYGSGGGGG 180

Query: 181 SGYGGEAEHGVGYGGGAGGGYGSGGGTGYGPGGEHGVGYGSGGGAGSGSGYGGGAKGYGG 240
           SGYGG A+HGVGYG G GGGYG+GGG GYGPGG+HGVGYGSGGG GSG GYGGGAKGYGG
Sbjct: 181 SGYGGGADHGVGYGSGGGGGYGAGGGAGYGPGGDHGVGYGSGGGGGSGGGYGGGAKGYGG 240

Query: 241 GSGGGKGGGGGAGYGPGGGHGSGYGGGEGAGSGYGGSDGGYDGGY 243
           G+        G+GYG GGG GSGYG       G GG +GGYDGGY
Sbjct: 241 GA-------KGSGYGSGGGAGSGYG-------GVGGEEGGYDGGY 267

BLAST of Sgr029645 vs. ExPASy TrEMBL
Match: A0A6J1KTC0 (glycine-rich cell wall structural protein 1.8-like OS=Cucurbita maxima OX=3661 GN=LOC111498476 PE=4 SV=1)

HSP 1 Score: 201.4 bits (511), Expect = 4.2e-48
Identity = 185/267 (69.29%), Postives = 195/267 (73.03%), Query Frame = 0

Query: 1   MAISRAFSLGFLLLVGFGLASAARTLLSYDPPARHPVVGYDYDRPVHNPRVGYDPDHHDG 60
           MAIS++ SL FLLL+G GLASAARTLLSYDPP  H  VGY Y    HNPRVGYD DHHDG
Sbjct: 1   MAISKSLSLTFLLLLGLGLASAARTLLSYDPP-HHSDVGYGYQ---HNPRVGYDHDHHDG 60

Query: 61  PYGGYGGGAGGGY--GGGAGSALGGSGYGSGGGEGGGSGYGSVGD-HGVGYGSGGGAGS- 120
           PYG YGGG+GGGY  G GAGSALGGSGYGSGGG GGGSGY  VGD  G GYGSGGG GS 
Sbjct: 61  PYGAYGGGSGGGYGAGAGAGSALGGSGYGSGGGGGGGSGYAGVGDLGGSGYGSGGGGGSG 120

Query: 121 -GYGDVGGHGKGYGSGGGGGH--------------------GSGYGGEAEHGVGYGGGAG 180
            GYGD+GGHGKGYGSGGGGG                     GSGYGG A+HGVGYG G G
Sbjct: 121 VGYGDLGGHGKGYGSGGGGGSGVGYGDLSGHGKGYGSGGGGGSGYGGGADHGVGYGSGGG 180

Query: 181 GGYGSGGGTGYGPGGEHGVGYGSGGGAGSGSGYGGGAKGYGGGSGGGKGGGGGAGYGPGG 240
           GGYG+GGG GYGPGG+HGVGYGSGGG GSG GYGG  KGYGGG  G  GG  G+GYG GG
Sbjct: 181 GGYGAGGGAGYGPGGDHGVGYGSGGGGGSGGGYGGRDKGYGGGPKGYGGGAKGSGYGSGG 240

Query: 241 GHGSGYGGGEGAGSGYGGSDGGYDGGY 243
           G GSGYG       G GG +GGYDGGY
Sbjct: 241 GAGSGYG-------GVGGEEGGYDGGY 256

BLAST of Sgr029645 vs. ExPASy TrEMBL
Match: A0A6J1CWP3 (glycine-rich cell wall structural protein 1.8-like isoform X1 OS=Momordica charantia OX=3673 GN=LOC111014911 PE=4 SV=1)

HSP 1 Score: 193.4 bits (490), Expect = 1.1e-45
Identity = 183/269 (68.03%), Postives = 194/269 (72.12%), Query Frame = 0

Query: 1   MAISRAFSLGFLLLVGFGLASAARTLLSYDPPARHPVVGYDYDRPVHNPRVGYDPDHHDG 60
           MAIS+AFSL FLLL+GFGLASAARTLL ++P A +P     YDR     RVGYD DHHD 
Sbjct: 1   MAISKAFSLSFLLLLGFGLASAARTLLGHEPDAYNP-----YDR-----RVGYDRDHHD- 60

Query: 61  PYGGYGGGAGGGYGGGAGSALGGSGYGSGGGEGGGSGYGSVGDHGVGYGS---------- 120
             G YGGG+GGGYG GAGS+LGGSGYGSGGG GGGSGYG  GDHGVGYGS          
Sbjct: 61  --GAYGGGSGGGYGAGAGSSLGGSGYGSGGGGGGGSGYGGAGDHGVGYGSGGGGGYGGGM 120

Query: 121 -------------GGGAGSGYGDVGGHGKGYGSGGGGGHGSGYGGEAEHGVGYGGGAGGG 180
                        GGG+G GYGD GG GKGYGSGGGG  GSGYGG  +HG GYG G GGG
Sbjct: 121 GSGLGGAGYGSGGGGGSGGGYGDAGGRGKGYGSGGGG--GSGYGGGGDHGAGYGSGGGGG 180

Query: 181 YGS--GGGTGYGPGGEHGVGYGSGGGAGSGSGYGGGAKGYGGGSGGGKGGGGGAGYGPGG 240
           YGS  GGG GYGP GEHGVGYGSGGG GSG GYGG   GYGGGSGGGKGGGG  GYG GG
Sbjct: 181 YGSGGGGGAGYGP-GEHGVGYGSGGGGGSGGGYGGSKGGYGGGSGGGKGGGG--GYGAGG 240

Query: 241 GHGSGYGGGEGAGSGYG--GSDGGYDGGY 243
            HG GYGGG G+G GYG  G +GGYDGGY
Sbjct: 241 AHGGGYGGGGGSGGGYGSSGEEGGYDGGY 251

BLAST of Sgr029645 vs. ExPASy TrEMBL
Match: A0A1S3BJM7 (glycine-rich cell wall structural protein 2-like OS=Cucumis melo OX=3656 GN=LOC103490774 PE=4 SV=1)

HSP 1 Score: 168.7 bits (426), Expect = 3.0e-38
Identity = 177/277 (63.90%), Postives = 192/277 (69.31%), Query Frame = 0

Query: 1   MAISRAFSLGFLLLVGFGLASAARTLLSYDPPARHPVVGYDYD-RPVHNPRVGYDPDHHD 60
           MAIS+  S GFLLLV  GLASAAR+LL YD P        +YD  PV NP+VGY+ DHHD
Sbjct: 1   MAISKTLSFGFLLLVSLGLASAARSLLYYDMPPHRSGYDNNYDNHPVVNPKVGYEHDHHD 60

Query: 61  GPY-------GGYGGGAGGGY--GGGAGSALGGSGYGSGGGEGGGSGYGSVGDHGVGYGS 120
           G Y         YGGGAGGGY  GGGAGS+LGGSGYGSGG  GGGSGYG VG+HGVGYGS
Sbjct: 61  GYYHDRDHHDAPYGGGAGGGYGAGGGAGSSLGGSGYGSGG--GGGSGYGGVGNHGVGYGS 120

Query: 121 -----------------------GGGAGSGYGDVGGHGKGYGSGGGGGHGSGYGGEAEHG 180
                                  GGG+G GYGD+GGHGKGYGSGGGG  GSGYGG  +HG
Sbjct: 121 GGGGGYGAGVGSDLGGSGYGSGGGGGSGGGYGDLGGHGKGYGSGGGG--GSGYGGRGDHG 180

Query: 181 VGYGGGAGGGYGSGGGTGYGPGGEHGVGYGSGGGAGSGSGYGGGAKGYGGGSGGGKGGGG 240
           VGYG G GGGYGSG G   G G +HGVGYGSGGG G+G GY GG+KGYG GS    GGGG
Sbjct: 181 VGYGSGGGGGYGSGVG---GAGVDHGVGYGSGGGGGAGGGY-GGSKGYGAGS----GGGG 240

Query: 241 GAGYGPGGGHGSGYGGGEGAGSGYGGSDGGYDGGYAP 245
           GAGYG GG HGSGYG G GAG+G  G +GGYDGGYAP
Sbjct: 241 GAGYG-GGAHGSGYGSGGGAGAG-SGEEGGYDGGYAP 263

BLAST of Sgr029645 vs. ExPASy TrEMBL
Match: A0A0A0L1U7 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G009540 PE=4 SV=1)

HSP 1 Score: 168.3 bits (425), Expect = 3.9e-38
Identity = 182/281 (64.77%), Postives = 194/281 (69.04%), Query Frame = 0

Query: 1   MAISRAFSLGFLLLVGFGLASAARTLLSYD-PPARHPVVGYD-YDRPVHNPRVGY----- 60
           MAIS+  S GFLLLV  GLASAAR+LLSYD PP R    GYD YD PV NP+VGY     
Sbjct: 1   MAISKTLSFGFLLLVSLGLASAARSLLSYDIPPHRS---GYDNYDHPVVNPKVGYEHDRR 60

Query: 61  -----DPDHHDGPYGGYGGGAGGGY--GGGAGSALGGSGYGSGGGEGGGSGYGSVGDHGV 120
                D DHHD P   YGGGAGGGY  G GAGS+LGGSGYGSGG  GGGSGYG VG+H V
Sbjct: 61  DGYYHDRDHHDAP---YGGGAGGGYGAGAGAGSSLGGSGYGSGG--GGGSGYGGVGNHEV 120

Query: 121 GYGS-----------------------GGGAGSGYGDVGGHGKGYGSGGGGGHGSGYGGE 180
           GYGS                       GGG+G GYGD+GG GKGYGSGGGG  GSGYGG 
Sbjct: 121 GYGSGGGGGYGAGVGSDLGGSGYGSGGGGGSGGGYGDLGGRGKGYGSGGGG--GSGYGGR 180

Query: 181 AEHGVGYGGGAGGGYGSGGGTGYGPGGEHGVGYGSGGGAGSGSGYGGGAKGYGGGSGGGK 240
            +HGVGYG G GGGYGSG G G G   +HGVGYGSGGG G+GSGY GG+KGYGGGS    
Sbjct: 181 GDHGVGYGSGGGGGYGSGVGGGAGV-VDHGVGYGSGGGGGAGSGY-GGSKGYGGGS---- 240

Query: 241 GGGGGAGYGPGGGHGSGYGGGEGAGSGYGGSDGGYDGGYAP 245
           GGGGGAGYG GG HGSGYG G GAGS   G +GGYDGGYAP
Sbjct: 241 GGGGGAGYG-GGAHGSGYGSGGGAGS---GEEGGYDGGYAP 261

BLAST of Sgr029645 vs. TAIR 10
Match: AT2G36120.1 (Glycine-rich protein family )

HSP 1 Score: 47.4 bits (111), Expect = 1.9e-05
Identity = 125/252 (49.60%), Postives = 135/252 (53.57%), Query Frame = 0

Query: 14  LVGFGLASAARTLLSYDPPARHPVVGYDYDRPVH---NPRVGYDPDHHDGPYGG------ 73
           L+G GL SA R LLS    +   V  Y  +  +       +G  P    G  GG      
Sbjct: 13  LIGLGLCSARRALLS-SSESEAEVAAYGVNSGLSAGLGVGIGGGPGGGSGYGGGSGEGGG 72

Query: 74  --------YGGGAGGGYGGGAGSALG---GSGYGSGGGEGGGSGYGSVGDHGVGYGSGGG 133
                    GGG GGG+GGGAG   G   G GYG G GEGGG+GYG     G G G GGG
Sbjct: 73  AGGHGEGHIGGGGGGGHGGGAGGGGGGGPGGGYGGGSGEGGGAGYGGGEAGGHGGGGGGG 132

Query: 134 AGSGYGDVGG-HGKGYGSGGGGGHGSGYGGEAEHGVGYGGGAGGGYGSGGGTGYGPGGEH 193
           AG G G  GG HG GYG G G G G GYGG      G+GGG GGG G GGG G G GG H
Sbjct: 133 AGGGGGGGGGAHGGGYGGGQGAGAGGGYGGGG--AGGHGGGGGGGNGGGGGGGSGEGGAH 192

Query: 194 GVGYGSGGGAGSGSGYGGGAKGYGGGSGGGKGGGGGAGYGPGGGHGSGYGGGEGAGSGYG 245
           G GYG+GGGAG G G G GA G+GGG GGG G GGG G G G    SGYG G GAG G G
Sbjct: 193 GGGYGAGGGAGEGYGGGAGAGGHGGGGGGGGGSGGGGGGGGGYAAASGYGHGGGAGGGEG 252

BLAST of Sgr029645 vs. TAIR 10
Match: AT5G46730.1 (glycine-rich protein )

HSP 1 Score: 42.0 bits (97), Expect = 8.2e-04
Identity = 114/203 (56.16%), Postives = 120/203 (59.11%), Query Frame = 0

Query: 52  GYDPDHHDGPYGGYGG------GAGGGYGGGAGSALGGSGYGSGGGEGGGSGY-GSVGDH 111
           GY     +G  GGYGG      G G G+GGG G A    GY SG GEGGG GY G+ G H
Sbjct: 60  GYGGGSGEGAGGGYGGAEGYASGGGSGHGGGGGGAASSGGYASGAGEGGGGGYGGAAGGH 119

Query: 112 --GVGYGSGGGAGSGYGDVGGHGKGYGSG---GGGGHGSGYGGEAEHGVGYGGGAGGGYG 171
             G G GSGGG GS YG  G H  GYG+G   GGG   SGYGG A     YGGG G G G
Sbjct: 120 AGGGGGGSGGGGGSAYGAGGEHASGYGNGAGEGGGAGASGYGGGA-----YGGGGGHGGG 179

Query: 172 SGGGTGYGPGGEHGVGYGSGGGAGSGSGYGGGAKGYGGGSGGGKGGGGGAGYGPGGGHGS 231
            GGG+  G  G  G G G GGGAG G G  GGA GYGGG GGG GGGG   YG GG HG 
Sbjct: 180 GGGGSAGGAHGGSGYGGGEGGGAG-GGGSHGGAGGYGGGGGGGSGGGG--AYGGGGAHGG 239

Query: 232 GYGGGEGAGSGY-GGSDGGYDGG 242
           GYG G G G GY GG+ GGY GG
Sbjct: 240 GYGSGGGEGGGYGGGAAGGYGGG 254

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038905926.15.4e-5070.85glycine-rich cell wall structural protein 1.8-like [Benincasa hispida][more]
XP_023539508.11.2e-4968.16glycine-rich cell wall structural protein 1.8-like [Cucurbita pepo subsp. pepo] ... [more]
XP_022924241.13.0e-4864.56glycine-rich cell wall structural protein 1.8-like [Cucurbita moschata][more]
XP_023005512.18.7e-4869.29glycine-rich cell wall structural protein 1.8-like [Cucurbita maxima][more]
KAG6596726.19.6e-4759.55hypothetical protein SDJN03_09906, partial [Cucurbita argyrosperma subsp. sorori... [more]
Match NameE-valueIdentityDescription
P104961.7e-0960.51Glycine-rich cell wall structural protein 1.8 OS=Phaseolus vulgaris OX=3885 PE=2... [more]
P0C5C72.1e-0448.05Glycine-rich cell wall structural protein 2 OS=Oryza sativa subsp. indica OX=399... [more]
A3C5A72.1e-0448.05Glycine-rich cell wall structural protein 2 OS=Oryza sativa subsp. japonica OX=3... [more]
Q9SIH22.7e-0449.60Glycine-rich protein DOT1 OS=Arabidopsis thaliana OX=3702 GN=DOT1 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A6J1E9111.4e-4864.56glycine-rich cell wall structural protein 1.8-like OS=Cucurbita moschata OX=3662... [more]
A0A6J1KTC04.2e-4869.29glycine-rich cell wall structural protein 1.8-like OS=Cucurbita maxima OX=3661 G... [more]
A0A6J1CWP31.1e-4568.03glycine-rich cell wall structural protein 1.8-like isoform X1 OS=Momordica chara... [more]
A0A1S3BJM73.0e-3863.90glycine-rich cell wall structural protein 2-like OS=Cucumis melo OX=3656 GN=LOC1... [more]
A0A0A0L1U73.9e-3864.77Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G009540 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G36120.11.9e-0549.60Glycine-rich protein family [more]
AT5G46730.18.2e-0456.16glycine-rich protein [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePRINTSPR01228EGGSHELLcoord: 200..218
score: 48.68
coord: 130..145
score: 42.19
coord: 4..20
score: 29.41
coord: 159..169
score: 35.23
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 206..244
NoneNo IPR availablePANTHERPTHR37612FIBROIN HEAVY CHAIN FIB-H LIKE PROTEINcoord: 1..236

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr029645.1Sgr029645.1mRNA