Clc03G04870 (gene) Watermelon (cordophanus) v2

Overview
NameClc03G04870
Typegene
OrganismCitrullus lanatus subsp. cordophanus cv. cordophanus (Watermelon (cordophanus) v2)
Descriptionglycine-rich cell wall structural protein 1-like
LocationClcChr03: 4718210 .. 4719566 (-)
RNA-Seq ExpressionClc03G04870
SyntenyClc03G04870
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GGGAAAGGGAGATGGGTTTTAGGCTGCGGCTTCCTCTTTATAAGGCACAAAGAACCAGCGGAAACCCATGCAAAAGTTCTTTCCCACTTTTCTTCTAATAATTAATAATGCCTCAATCTTGTTTTACATTTCTATGTCTTATAGTTCTTGTTGTAAGCCGACTTAGTTTTGGCTATGGAGGCGAAACAGGGTGGGGATTTAACAATGAAGAATCTTGCCGGTATTGGAGGGGATGTGGAAGTTTCTTTGGGTGGCCGGCTGACAAGGGTTCTTCACCTGGAGGTGCTGGTGGTGGAGGAGGCGGCGGTGGCGGCGGAGGAAATTCAGGAGGTGGAGTTGGGTCCGGCCATGGTGAAGGTTATGGAGCTGGGTTTGGTGTAGGTAGTAATGGCGGAGGCGGTGGTGGCGGTGGGGGAAGTGGGTCTGGGTCTGGAGAAGGGTCTGGTCATGGTAGTGGATATGGAGCTGGCGGCGGTGGTGGGATTGGTGGAGGCGGAGGTGGTGGTGGGGGTGGTGGTAGTGCCAGTGGTAGTAATGGTTATGGCCAAGGAATGGGATTCGGAGCAGGGTTTGGGCTTGGCGGAGGAGGTGGCGGTGGAGGAGGTGGCGGCGGTGGTGGAAGTAGCAATTCCGTTTGGGCAGGGGAGGCGTATGGACATGGAAGTGGGTTTGGTGGAGGTGGAGGTATGGGTGGCGGGGGAGCTGGAGGAGGAGGTGGCGGCGGTGGTGGTAGCGGAGGAGGAGGAGGTACAAATGGAGGAAATGGCTATGGTAGTGGTTTTGGAGGTGGCGTAGGTAGCGGAGGCAGTGGTGGTGGCGGTGGCGGTGGTGGAGGCGGAGGAAGAGGGAGTTCAAACGGAGGAAATAACGGTCATGGTAGTGGTTTTGGTGGTGGAGCAGGTAACGGTGTAGGCGGCGTCGGCGGCGGTGGGGGAGGCGGAGGGGGTGGAGTAAGTAGTCCAAATGGAGGATATGGTAAAGGGGAGGGCGGTGGATTTGGAGGCGGAGGAGGGAATGCAAATAGCTTTGGCGGCGGTGGAAAGGGAGGACAAGGCATGGGAATGAGTTTTGGAATGGGGTTTGGTATGGGAATTGGGTTTGGGATGGGAAGCAACAATAATGGAGCTGATGAGTCCAATGATCATACTCAAGCCAAAACCACCACTGCCCAACCCTAGACATCTTTTGCCATTACCATGCAATTTAAAGATACACAAAATTTGTTTGTAATCTTAATATGCCATCATTTTCCATGTATTTATCACATATATTGTTTCCCTTTCAAATATTCAAGTTCAAGTTTATGATTTCATGTCCAAAACTTTGATGATCATAAAATTACTATTTTACTTCGGTA

mRNA sequence

GGGAAAGGGAGATGGGTTTTAGGCTGCGGCTTCCTCTTTATAAGGCACAAAGAACCAGCGGAAACCCATGCAAAAGTTCTTTCCCACTTTTCTTCTAATAATTAATAATGCCTCAATCTTGTTTTACATTTCTATGTCTTATAGTTCTTGTTGTAAGCCGACTTAGTTTTGGCTATGGAGGCGAAACAGGGTGGGGATTTAACAATGAAGAATCTTGCCGGTATTGGAGGGGATGTGGAAGTTTCTTTGGGTGGCCGGCTGACAAGGGTTCTTCACCTGGAGGTGCTGGTGGTGGAGGAGGCGGCGGTGGCGGCGGAGGAAATTCAGGAGGTGGAGTTGGGTCCGGCCATGGTGAAGGTTATGGAGCTGGGTTTGGTGTAGGTAGTAATGGCGGAGGCGGTGGTGGCGGTGGGGGAAGTGGGTCTGGGTCTGGAGAAGGGTCTGGTCATGGTAGTGGATATGGAGCTGGCGGCGGTGGTGGGATTGGTGGAGGCGGAGGTGGTGGTGGGGGTGGTGGTAGTGCCAGTGGTAGTAATGGTTATGGCCAAGGAATGGGATTCGGAGCAGGGTTTGGGCTTGGCGGAGGAGGTGGCGGTGGAGGAGGTGGCGGCGGTGGTGGAAGTAGCAATTCCGTTTGGGCAGGGGAGGCGTATGGACATGGAAGTGGGTTTGGTGGAGGTGGAGGTATGGGTGGCGGGGGAGCTGGAGGAGGAGGTGGCGGCGGTGGTGGTAGCGGAGGAGGAGGAGGTACAAATGGAGGAAATGGCTATGGTAGTGGTTTTGGAGGTGGCGTAGGTAGCGGAGGCAGTGGTGGTGGCGGTGGCGGTGGTGGAGGCGGAGGAAGAGGGAGTTCAAACGGAGGAAATAACGGTCATGGTAGTGGTTTTGGTGGTGGAGCAGGTAACGGTGTAGGCGGCGTCGGCGGCGGTGGGGGAGGCGGAGGGGGTGGAGTAAGTAGTCCAAATGGAGGATATGGTAAAGGGGAGGGCGGTGGATTTGGAGGCGGAGGAGGGAATGCAAATAGCTTTGGCGGCGGTGGAAAGGGAGGACAAGGCATGGGAATGAGTTTTGGAATGGGGTTTGGTATGGGAATTGGGTTTGGGATGGGAAGCAACAATAATGGAGCTGATGAGTCCAATGATCATACTCAAGCCAAAACCACCACTGCCCAACCCTAGACATCTTTTGCCATTACCATGCAATTTAAAGATACACAAAATTTGTTTGTAATCTTAATATGCCATCATTTTCCATGTATTTATCACATATATTGTTTCCCTTTCAAATATTCAAGTTCAAGTTTATGATTTCATGTCCAAAACTTTGATGATCATAAAATTACTATTTTACTTCGGTA

Coding sequence (CDS)

ATGCCTCAATCTTGTTTTACATTTCTATGTCTTATAGTTCTTGTTGTAAGCCGACTTAGTTTTGGCTATGGAGGCGAAACAGGGTGGGGATTTAACAATGAAGAATCTTGCCGGTATTGGAGGGGATGTGGAAGTTTCTTTGGGTGGCCGGCTGACAAGGGTTCTTCACCTGGAGGTGCTGGTGGTGGAGGAGGCGGCGGTGGCGGCGGAGGAAATTCAGGAGGTGGAGTTGGGTCCGGCCATGGTGAAGGTTATGGAGCTGGGTTTGGTGTAGGTAGTAATGGCGGAGGCGGTGGTGGCGGTGGGGGAAGTGGGTCTGGGTCTGGAGAAGGGTCTGGTCATGGTAGTGGATATGGAGCTGGCGGCGGTGGTGGGATTGGTGGAGGCGGAGGTGGTGGTGGGGGTGGTGGTAGTGCCAGTGGTAGTAATGGTTATGGCCAAGGAATGGGATTCGGAGCAGGGTTTGGGCTTGGCGGAGGAGGTGGCGGTGGAGGAGGTGGCGGCGGTGGTGGAAGTAGCAATTCCGTTTGGGCAGGGGAGGCGTATGGACATGGAAGTGGGTTTGGTGGAGGTGGAGGTATGGGTGGCGGGGGAGCTGGAGGAGGAGGTGGCGGCGGTGGTGGTAGCGGAGGAGGAGGAGGTACAAATGGAGGAAATGGCTATGGTAGTGGTTTTGGAGGTGGCGTAGGTAGCGGAGGCAGTGGTGGTGGCGGTGGCGGTGGTGGAGGCGGAGGAAGAGGGAGTTCAAACGGAGGAAATAACGGTCATGGTAGTGGTTTTGGTGGTGGAGCAGGTAACGGTGTAGGCGGCGTCGGCGGCGGTGGGGGAGGCGGAGGGGGTGGAGTAAGTAGTCCAAATGGAGGATATGGTAAAGGGGAGGGCGGTGGATTTGGAGGCGGAGGAGGGAATGCAAATAGCTTTGGCGGCGGTGGAAAGGGAGGACAAGGCATGGGAATGAGTTTTGGAATGGGGTTTGGTATGGGAATTGGGTTTGGGATGGGAAGCAACAATAATGGAGCTGATGAGTCCAATGATCATACTCAAGCCAAAACCACCACTGCCCAACCCTAG

Protein sequence

MPQSCFTFLCLIVLVVSRLSFGYGGETGWGFNNEESCRYWRGCGSFFGWPADKGSSPGGAGGGGGGGGGGGNSGGGVGSGHGEGYGAGFGVGSNGGGGGGGGGSGSGSGEGSGHGSGYGAGGGGGIGGGGGGGGGGGSASGSNGYGQGMGFGAGFGLGGGGGGGGGGGGGGSSNSVWAGEAYGHGSGFGGGGGMGGGGAGGGGGGGGGSGGGGGTNGGNGYGSGFGGGVGSGGSGGGGGGGGGGGRGSSNGGNNGHGSGFGGGAGNGVGGVGGGGGGGGGGVSSPNGGYGKGEGGGFGGGGGNANSFGGGGKGGQGMGMSFGMGFGMGIGFGMGSNNNGADESNDHTQAKTTTAQP
Homology
BLAST of Clc03G04870 vs. NCBI nr
Match: KAG6584096.1 (hypothetical protein SDJN03_20028, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 288.1 bits (736), Expect = 1.0e-73
Identity = 266/355 (74.93%), Postives = 276/355 (77.75%), Query Frame = 0

Query: 1   MPQSCFTFLCLIVLVVSRLSFGYGGETGWGFNNEESCRYWRGCGSFFGWPADKGSSPGGA 60
           M   CF FLCLI+LV+ R SFGYGG+TGWGFNNEESCRYWRGCGSFFGWP DKGS+    
Sbjct: 1   MAHFCFRFLCLILLVLGRFSFGYGGDTGWGFNNEESCRYWRGCGSFFGWPVDKGSA---- 60

Query: 61  GGGGGGGGGGGNSGGGVGSGHGEGYGAGFGVGSNGGGGGGGGG--SGSGSGEGSGHGSGY 120
            GGGGGGGGGG+SGGGVGSG GEGYGAGFGVGS+GG GGGGGG   G GS EGSGHGSGY
Sbjct: 61  -GGGGGGGGGGHSGGGVGSGFGEGYGAGFGVGSHGGSGGGGGGGYGGGGSEEGSGHGSGY 120

Query: 121 GAGGGGGIGGGGGGGGGGGSASGSNGYGQGMGFGAGFGLGGGGGGGGGGGGGGSSNSVWA 180
           GAGGGGG GGGGGGGGGGG   GSNGYGQGMGFGAGFG GGGGGGGGGGGGGG  NSV  
Sbjct: 121 GAGGGGGAGGGGGGGGGGG---GSNGYGQGMGFGAGFGFGGGGGGGGGGGGGGGGNSVRG 180

Query: 181 GEAYGHGSGFGGGGGMGGGGAGGGGGGGGGSGGGGGTNGGNGYGSGFGGGVGSGGSGGGG 240
           GEAYGHGSGFGGGGG+GGGGAGGGGG     GG GGTNGGNGYGSGFGGGVG+GGSG GG
Sbjct: 181 GEAYGHGSGFGGGGGIGGGGAGGGGG-----GGSGGTNGGNGYGSGFGGGVGTGGSGSGG 240

Query: 241 GGGGGGGRGSSNGGNNGHGSGFGGGAGNGVGGVGGGGGGGGGGVSSPNGGYGKGEGGGFG 300
                                            GGGGGGGGGG S PNGGYGKGEGGGFG
Sbjct: 241 ---------------------------------GGGGGGGGGGESGPNGGYGKGEGGGFG 300

Query: 301 GGGGNANSF-GGGGKGGQGMGMSFGMGFGMGIGFGMGSNNNGADESNDHTQAKTT 353
           GG G  NS  GGGGKGGQGMGM FGMGFGMG+GFGMG+ NNGADESN H QAKTT
Sbjct: 301 GGEGFTNSVGGGGGKGGQGMGMGFGMGFGMGVGFGMGNTNNGADESNGHAQAKTT 309

BLAST of Clc03G04870 vs. NCBI nr
Match: KGN64507.1 (hypothetical protein Csa_013094 [Cucumis sativus])

HSP 1 Score: 284.6 bits (727), Expect = 1.1e-72
Identity = 286/368 (77.72%), Postives = 304/368 (82.61%), Query Frame = 0

Query: 1   MPQSCFTFLCLIVLVVSRLSFGYGGETGWGFNNEESCRYWRGCGSFFGWPADKGSSPGGA 60
           MPQS F FLC I+LV+++LSFGY    GWGF+NE+SCRYWRGCG+FFGWPADK  S  G 
Sbjct: 2   MPQSSFKFLCFILLVLTQLSFGY----GWGFDNEDSCRYWRGCGTFFGWPADKPGS--GG 61

Query: 61  GGGGGGGGGGGNSGGGVGSGHGEGYGAGFGVGSNGGGGGGGGGSGSGSGEGSGHGSGYGA 120
           GGGGG G GGGNSG GVG GHGEGYGAGFGVG NGGGGGGGGG G GSGEG GHGSGYGA
Sbjct: 62  GGGGGSGSGGGNSGDGVGFGHGEGYGAGFGVGGNGGGGGGGGGGGGGSGEGYGHGSGYGA 121

Query: 121 GGGGGIGGG--GGGGGGGGSASGS----NGYGQGMGFGAGFGL-GGGGGGGGGGGGGGSS 180
           GGGG  GGG  GGGGGGGGS SGS    NG+GQGMGFGAGFGL GGGGGGGGGGGGGG  
Sbjct: 122 GGGGVTGGGAAGGGGGGGGSGSGSGNGGNGFGQGMGFGAGFGLGGGGGGGGGGGGGGGGG 181

Query: 181 NSVWAGEAYGHGSGFGGGGGMGGGGAGGGGGGGGGSGGGGGTNGGNGYGSGFGGGVGSGG 240
           NSVW GEAYGHGSGFG     GGGGAG GGGGGGG  GGGGTNGGNGYGSGFGGG+GSG 
Sbjct: 182 NSVWGGEAYGHGSGFG-----GGGGAGAGGGGGGGGSGGGGTNGGNGYGSGFGGGIGSGS 241

Query: 241 SGGGGGGGGGGGRGSSNGGNN-GHGSGFGGGAGNGVGGVGGGGGGGGGGVSSPNGGYGKG 300
           S  GGGGGGGGGR S+NGG++ G GSGFGGG GNG GG GGGG GGGGG+SS NGGYGKG
Sbjct: 242 S--GGGGGGGGGRSSTNGGSSKGDGSGFGGGVGNGAGG-GGGGSGGGGGISSSNGGYGKG 301

Query: 301 EGGGFGGGGGNANSFGGGGKGGQGMGMSFGMGFGMGIGFGMG---SNNNGADE-SNDHTQ 357
           EG GFG GGGN N+FGGGGKGG+GMGM FGMGFGMGIGFGMG   SNNNGAD+  ND T+
Sbjct: 302 EGSGFGVGGGNTNNFGGGGKGGEGMGMGFGMGFGMGIGFGMGNSNSNNNGADDYKNDQTK 355

BLAST of Clc03G04870 vs. NCBI nr
Match: XP_011648420.1 (glycine-rich protein DOT1 [Cucumis sativus])

HSP 1 Score: 284.6 bits (727), Expect = 1.1e-72
Identity = 286/368 (77.72%), Postives = 304/368 (82.61%), Query Frame = 0

Query: 1   MPQSCFTFLCLIVLVVSRLSFGYGGETGWGFNNEESCRYWRGCGSFFGWPADKGSSPGGA 60
           MPQS F FLC I+LV+++LSFGY    GWGF+NE+SCRYWRGCG+FFGWPADK  S  G 
Sbjct: 1   MPQSSFKFLCFILLVLTQLSFGY----GWGFDNEDSCRYWRGCGTFFGWPADKPGS--GG 60

Query: 61  GGGGGGGGGGGNSGGGVGSGHGEGYGAGFGVGSNGGGGGGGGGSGSGSGEGSGHGSGYGA 120
           GGGGG G GGGNSG GVG GHGEGYGAGFGVG NGGGGGGGGG G GSGEG GHGSGYGA
Sbjct: 61  GGGGGSGSGGGNSGDGVGFGHGEGYGAGFGVGGNGGGGGGGGGGGGGSGEGYGHGSGYGA 120

Query: 121 GGGGGIGGG--GGGGGGGGSASGS----NGYGQGMGFGAGFGL-GGGGGGGGGGGGGGSS 180
           GGGG  GGG  GGGGGGGGS SGS    NG+GQGMGFGAGFGL GGGGGGGGGGGGGG  
Sbjct: 121 GGGGVTGGGAAGGGGGGGGSGSGSGNGGNGFGQGMGFGAGFGLGGGGGGGGGGGGGGGGG 180

Query: 181 NSVWAGEAYGHGSGFGGGGGMGGGGAGGGGGGGGGSGGGGGTNGGNGYGSGFGGGVGSGG 240
           NSVW GEAYGHGSGFG     GGGGAG GGGGGGG  GGGGTNGGNGYGSGFGGG+GSG 
Sbjct: 181 NSVWGGEAYGHGSGFG-----GGGGAGAGGGGGGGGSGGGGTNGGNGYGSGFGGGIGSGS 240

Query: 241 SGGGGGGGGGGGRGSSNGGNN-GHGSGFGGGAGNGVGGVGGGGGGGGGGVSSPNGGYGKG 300
           S  GGGGGGGGGR S+NGG++ G GSGFGGG GNG GG GGGG GGGGG+SS NGGYGKG
Sbjct: 241 S--GGGGGGGGGRSSTNGGSSKGDGSGFGGGVGNGAGG-GGGGSGGGGGISSSNGGYGKG 300

Query: 301 EGGGFGGGGGNANSFGGGGKGGQGMGMSFGMGFGMGIGFGMG---SNNNGADE-SNDHTQ 357
           EG GFG GGGN N+FGGGGKGG+GMGM FGMGFGMGIGFGMG   SNNNGAD+  ND T+
Sbjct: 301 EGSGFGVGGGNTNNFGGGGKGGEGMGMGFGMGFGMGIGFGMGNSNSNNNGADDYKNDQTK 354

BLAST of Clc03G04870 vs. NCBI nr
Match: XP_023001646.1 (glycine-rich cell wall structural protein 1-like [Cucurbita maxima])

HSP 1 Score: 278.5 bits (711), Expect = 8.1e-71
Identity = 264/359 (73.54%), Postives = 277/359 (77.16%), Query Frame = 0

Query: 1   MPQSCFTFLCLIVLVVSRLSFGYGGETGWGFNNEESCRYWRGCGSFFGWPADKGSSPGGA 60
           M   CFTFLCLI+LV+ R SFG+GG+ GWGFNN+ESCRYWRGCGSFFGWP DKGS+    
Sbjct: 1   MAHFCFTFLCLILLVLGRFSFGHGGDPGWGFNNDESCRYWRGCGSFFGWPVDKGSA---- 60

Query: 61  GGGGGGGGGGGNSGGGVGSGHGEGYGAGFGVGSNGGGGGGGGG--SGSGSGEGSGHGSGY 120
            GGGGGGGGGG+SGGGVGSG+GEGYGAGFGVGS+GG GGGGGG   G GS EGSGHGSGY
Sbjct: 61  -GGGGGGGGGGHSGGGVGSGYGEGYGAGFGVGSHGGSGGGGGGGYGGGGSEEGSGHGSGY 120

Query: 121 GAGGGGGIGGGGGGGGGGGSASGSNGYGQGMGFGAGFGLGGGGGGGGGGGGGGSSNSVWA 180
           GAGGGGG GGGGGGGGGGG   G NGYGQGMGFGAGFG GGGGGGGGGGGGGG  NSV  
Sbjct: 121 GAGGGGGAGGGGGGGGGGG---GGNGYGQGMGFGAGFGFGGGGGGGGGGGGGG--NSVRG 180

Query: 181 GEAYGHGSGFGGGGGMGGGGAGGGGGGGGGSGGGGGTNGGNGYGSGFGGGVGSGGSGGGG 240
           GEAYGHGSGFGGGGG+GGGGAGGGGGGGGGS   GGTNGGNGYGSGFGGGVG GGSG   
Sbjct: 181 GEAYGHGSGFGGGGGIGGGGAGGGGGGGGGS---GGTNGGNGYGSGFGGGVGIGGSGS-- 240

Query: 241 GGGGGGGRGSSNGGNNGHGSGFGGGAGNGVGGVGGGGGGGGGGVSSPNGGYGKGEGGGFG 300
                                            GGGGGGGGGG S P+GGYGKGEGGGFG
Sbjct: 241 ---------------------------------GGGGGGGGGGESGPHGGYGKGEGGGFG 300

Query: 301 GGGGNANSFG-GGGKGGQGMGMSFGMGFGMGIGFGMGSNNNGADESNDHTQAKTTTAQP 357
           GG G  NS G  GGKGGQGMGM FGMGFGMGIGFGMG+ NNGADESN   QAKTT A+P
Sbjct: 301 GGEGFTNSVGSSGGKGGQGMGMGFGMGFGMGIGFGMGNTNNGADESNGQAQAKTTIAKP 311

BLAST of Clc03G04870 vs. NCBI nr
Match: ADN34063.1 (hypothetical protein [Cucumis melo subsp. melo] >KAA0056844.1 hypothetical protein E6C27_scaffold96G00120 [Cucumis melo var. makuwa] >TYJ99347.1 hypothetical protein E5676_scaffold248G005640 [Cucumis melo var. makuwa])

HSP 1 Score: 265.8 bits (678), Expect = 5.5e-67
Identity = 277/367 (75.48%), Postives = 293/367 (79.84%), Query Frame = 0

Query: 1   MPQSCFTFLCLIVLVVSRLSFGYGGETGWGFNNEESCRYWRGCGSFFGWPADKGSSPGGA 60
           MP+S F F C I+LV+++LSFGY    GWGF+NE+SCRYWRGCG+FFGWPADK  S GG 
Sbjct: 1   MPRSSFNFFCFILLVLTQLSFGY----GWGFDNEDSCRYWRGCGTFFGWPADKPGS-GGG 60

Query: 61  GGGGGGGGGGGNSGGGVGSGHGEGYGAGFGVGSNGGGGGGGGGSGSGSGEGSGHGSGYGA 120
           GGGGG GGGGGNSG GVG GHGEGYGAGFGVG NGGGGGGGGG G GS EG GHGSGYGA
Sbjct: 61  GGGGGSGGGGGNSGDGVGFGHGEGYGAGFGVGGNGGGGGGGGGGGGGSREGYGHGSGYGA 120

Query: 121 GG----GGGIGGGGGGGGGG-GSASGSNGYGQGMGFGAGFGLGGGGGGGGGGGGGGSSNS 180
           GG    GGG GGGGGGGGGG GS +G NGYGQGMGFGAGFGLGGGGGGGGGGGGG   NS
Sbjct: 121 GGGGVAGGGAGGGGGGGGGGSGSGNGGNGYGQGMGFGAGFGLGGGGGGGGGGGGG---NS 180

Query: 181 VWAGEAYGHGSGFGGGGGMGGGGAGGGGGGGGGSGGGGGTNGGNGYGSGFGGGVGSGGSG 240
           V  GEAYGHGSGFGGGGGMGGGGAGGGGGGGG   GGGG NGGNGYGSGFGGG+GSGGSG
Sbjct: 181 VLGGEAYGHGSGFGGGGGMGGGGAGGGGGGGG--SGGGGANGGNGYGSGFGGGIGSGGSG 240

Query: 241 GGGGGGGGGGRGSSNGGNNGHGSGFGGGAGNGVGGVGGGGGGGGGGVSSPNGGYGKGEGG 300
           GGGGGGGGG  G   G +NGHGSGFGGG GNG GG GGGG GGGGG+S+ NGGYGKGEG 
Sbjct: 241 GGGGGGGGGSNG---GSSNGHGSGFGGGVGNGAGG-GGGGSGGGGGISNSNGGYGKGEGS 300

Query: 301 GFGGGGGNANSFGGGGKGGQGMGMSFGMGFGMGIGFGMG-----SNNNGA-DESNDHTQA 357
           G           GG GKGG+GMGM FGMGFGMGIGFGMG     +NNNGA D  ND T+ 
Sbjct: 301 G----------IGGSGKGGEGMGMGFGMGFGMGIGFGMGNNNNNNNNNGAYDYKNDETKV 343

BLAST of Clc03G04870 vs. ExPASy TrEMBL
Match: A0A0A0LUS3 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G062330 PE=4 SV=1)

HSP 1 Score: 284.6 bits (727), Expect = 5.5e-73
Identity = 286/368 (77.72%), Postives = 304/368 (82.61%), Query Frame = 0

Query: 1   MPQSCFTFLCLIVLVVSRLSFGYGGETGWGFNNEESCRYWRGCGSFFGWPADKGSSPGGA 60
           MPQS F FLC I+LV+++LSFGY    GWGF+NE+SCRYWRGCG+FFGWPADK  S  G 
Sbjct: 2   MPQSSFKFLCFILLVLTQLSFGY----GWGFDNEDSCRYWRGCGTFFGWPADKPGS--GG 61

Query: 61  GGGGGGGGGGGNSGGGVGSGHGEGYGAGFGVGSNGGGGGGGGGSGSGSGEGSGHGSGYGA 120
           GGGGG G GGGNSG GVG GHGEGYGAGFGVG NGGGGGGGGG G GSGEG GHGSGYGA
Sbjct: 62  GGGGGSGSGGGNSGDGVGFGHGEGYGAGFGVGGNGGGGGGGGGGGGGSGEGYGHGSGYGA 121

Query: 121 GGGGGIGGG--GGGGGGGGSASGS----NGYGQGMGFGAGFGL-GGGGGGGGGGGGGGSS 180
           GGGG  GGG  GGGGGGGGS SGS    NG+GQGMGFGAGFGL GGGGGGGGGGGGGG  
Sbjct: 122 GGGGVTGGGAAGGGGGGGGSGSGSGNGGNGFGQGMGFGAGFGLGGGGGGGGGGGGGGGGG 181

Query: 181 NSVWAGEAYGHGSGFGGGGGMGGGGAGGGGGGGGGSGGGGGTNGGNGYGSGFGGGVGSGG 240
           NSVW GEAYGHGSGFG     GGGGAG GGGGGGG  GGGGTNGGNGYGSGFGGG+GSG 
Sbjct: 182 NSVWGGEAYGHGSGFG-----GGGGAGAGGGGGGGGSGGGGTNGGNGYGSGFGGGIGSGS 241

Query: 241 SGGGGGGGGGGGRGSSNGGNN-GHGSGFGGGAGNGVGGVGGGGGGGGGGVSSPNGGYGKG 300
           S  GGGGGGGGGR S+NGG++ G GSGFGGG GNG GG GGGG GGGGG+SS NGGYGKG
Sbjct: 242 S--GGGGGGGGGRSSTNGGSSKGDGSGFGGGVGNGAGG-GGGGSGGGGGISSSNGGYGKG 301

Query: 301 EGGGFGGGGGNANSFGGGGKGGQGMGMSFGMGFGMGIGFGMG---SNNNGADE-SNDHTQ 357
           EG GFG GGGN N+FGGGGKGG+GMGM FGMGFGMGIGFGMG   SNNNGAD+  ND T+
Sbjct: 302 EGSGFGVGGGNTNNFGGGGKGGEGMGMGFGMGFGMGIGFGMGNSNSNNNGADDYKNDQTK 355

BLAST of Clc03G04870 vs. ExPASy TrEMBL
Match: A0A6J1KJ79 (glycine-rich cell wall structural protein 1-like OS=Cucurbita maxima OX=3661 GN=LOC111495720 PE=4 SV=1)

HSP 1 Score: 278.5 bits (711), Expect = 3.9e-71
Identity = 264/359 (73.54%), Postives = 277/359 (77.16%), Query Frame = 0

Query: 1   MPQSCFTFLCLIVLVVSRLSFGYGGETGWGFNNEESCRYWRGCGSFFGWPADKGSSPGGA 60
           M   CFTFLCLI+LV+ R SFG+GG+ GWGFNN+ESCRYWRGCGSFFGWP DKGS+    
Sbjct: 1   MAHFCFTFLCLILLVLGRFSFGHGGDPGWGFNNDESCRYWRGCGSFFGWPVDKGSA---- 60

Query: 61  GGGGGGGGGGGNSGGGVGSGHGEGYGAGFGVGSNGGGGGGGGG--SGSGSGEGSGHGSGY 120
            GGGGGGGGGG+SGGGVGSG+GEGYGAGFGVGS+GG GGGGGG   G GS EGSGHGSGY
Sbjct: 61  -GGGGGGGGGGHSGGGVGSGYGEGYGAGFGVGSHGGSGGGGGGGYGGGGSEEGSGHGSGY 120

Query: 121 GAGGGGGIGGGGGGGGGGGSASGSNGYGQGMGFGAGFGLGGGGGGGGGGGGGGSSNSVWA 180
           GAGGGGG GGGGGGGGGGG   G NGYGQGMGFGAGFG GGGGGGGGGGGGGG  NSV  
Sbjct: 121 GAGGGGGAGGGGGGGGGGG---GGNGYGQGMGFGAGFGFGGGGGGGGGGGGGG--NSVRG 180

Query: 181 GEAYGHGSGFGGGGGMGGGGAGGGGGGGGGSGGGGGTNGGNGYGSGFGGGVGSGGSGGGG 240
           GEAYGHGSGFGGGGG+GGGGAGGGGGGGGGS   GGTNGGNGYGSGFGGGVG GGSG   
Sbjct: 181 GEAYGHGSGFGGGGGIGGGGAGGGGGGGGGS---GGTNGGNGYGSGFGGGVGIGGSGS-- 240

Query: 241 GGGGGGGRGSSNGGNNGHGSGFGGGAGNGVGGVGGGGGGGGGGVSSPNGGYGKGEGGGFG 300
                                            GGGGGGGGGG S P+GGYGKGEGGGFG
Sbjct: 241 ---------------------------------GGGGGGGGGGESGPHGGYGKGEGGGFG 300

Query: 301 GGGGNANSFG-GGGKGGQGMGMSFGMGFGMGIGFGMGSNNNGADESNDHTQAKTTTAQP 357
           GG G  NS G  GGKGGQGMGM FGMGFGMGIGFGMG+ NNGADESN   QAKTT A+P
Sbjct: 301 GGEGFTNSVGSSGGKGGQGMGMGFGMGFGMGIGFGMGNTNNGADESNGQAQAKTTIAKP 311

BLAST of Clc03G04870 vs. ExPASy TrEMBL
Match: A0A5D3BJP2 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold248G005640 PE=4 SV=1)

HSP 1 Score: 265.8 bits (678), Expect = 2.6e-67
Identity = 277/367 (75.48%), Postives = 293/367 (79.84%), Query Frame = 0

Query: 1   MPQSCFTFLCLIVLVVSRLSFGYGGETGWGFNNEESCRYWRGCGSFFGWPADKGSSPGGA 60
           MP+S F F C I+LV+++LSFGY    GWGF+NE+SCRYWRGCG+FFGWPADK  S GG 
Sbjct: 1   MPRSSFNFFCFILLVLTQLSFGY----GWGFDNEDSCRYWRGCGTFFGWPADKPGS-GGG 60

Query: 61  GGGGGGGGGGGNSGGGVGSGHGEGYGAGFGVGSNGGGGGGGGGSGSGSGEGSGHGSGYGA 120
           GGGGG GGGGGNSG GVG GHGEGYGAGFGVG NGGGGGGGGG G GS EG GHGSGYGA
Sbjct: 61  GGGGGSGGGGGNSGDGVGFGHGEGYGAGFGVGGNGGGGGGGGGGGGGSREGYGHGSGYGA 120

Query: 121 GG----GGGIGGGGGGGGGG-GSASGSNGYGQGMGFGAGFGLGGGGGGGGGGGGGGSSNS 180
           GG    GGG GGGGGGGGGG GS +G NGYGQGMGFGAGFGLGGGGGGGGGGGGG   NS
Sbjct: 121 GGGGVAGGGAGGGGGGGGGGSGSGNGGNGYGQGMGFGAGFGLGGGGGGGGGGGGG---NS 180

Query: 181 VWAGEAYGHGSGFGGGGGMGGGGAGGGGGGGGGSGGGGGTNGGNGYGSGFGGGVGSGGSG 240
           V  GEAYGHGSGFGGGGGMGGGGAGGGGGGGG   GGGG NGGNGYGSGFGGG+GSGGSG
Sbjct: 181 VLGGEAYGHGSGFGGGGGMGGGGAGGGGGGGG--SGGGGANGGNGYGSGFGGGIGSGGSG 240

Query: 241 GGGGGGGGGGRGSSNGGNNGHGSGFGGGAGNGVGGVGGGGGGGGGGVSSPNGGYGKGEGG 300
           GGGGGGGGG  G   G +NGHGSGFGGG GNG GG GGGG GGGGG+S+ NGGYGKGEG 
Sbjct: 241 GGGGGGGGGSNG---GSSNGHGSGFGGGVGNGAGG-GGGGSGGGGGISNSNGGYGKGEGS 300

Query: 301 GFGGGGGNANSFGGGGKGGQGMGMSFGMGFGMGIGFGMG-----SNNNGA-DESNDHTQA 357
           G           GG GKGG+GMGM FGMGFGMGIGFGMG     +NNNGA D  ND T+ 
Sbjct: 301 G----------IGGSGKGGEGMGMGFGMGFGMGIGFGMGNNNNNNNNNGAYDYKNDETKV 343

BLAST of Clc03G04870 vs. ExPASy TrEMBL
Match: E5GC64 (Uncharacterized protein OS=Cucumis melo subsp. melo OX=412675 PE=4 SV=1)

HSP 1 Score: 265.8 bits (678), Expect = 2.6e-67
Identity = 277/367 (75.48%), Postives = 293/367 (79.84%), Query Frame = 0

Query: 1   MPQSCFTFLCLIVLVVSRLSFGYGGETGWGFNNEESCRYWRGCGSFFGWPADKGSSPGGA 60
           MP+S F F C I+LV+++LSFGY    GWGF+NE+SCRYWRGCG+FFGWPADK  S GG 
Sbjct: 1   MPRSSFNFFCFILLVLTQLSFGY----GWGFDNEDSCRYWRGCGTFFGWPADKPGS-GGG 60

Query: 61  GGGGGGGGGGGNSGGGVGSGHGEGYGAGFGVGSNGGGGGGGGGSGSGSGEGSGHGSGYGA 120
           GGGGG GGGGGNSG GVG GHGEGYGAGFGVG NGGGGGGGGG G GS EG GHGSGYGA
Sbjct: 61  GGGGGSGGGGGNSGDGVGFGHGEGYGAGFGVGGNGGGGGGGGGGGGGSREGYGHGSGYGA 120

Query: 121 GG----GGGIGGGGGGGGGG-GSASGSNGYGQGMGFGAGFGLGGGGGGGGGGGGGGSSNS 180
           GG    GGG GGGGGGGGGG GS +G NGYGQGMGFGAGFGLGGGGGGGGGGGGG   NS
Sbjct: 121 GGGGVAGGGAGGGGGGGGGGSGSGNGGNGYGQGMGFGAGFGLGGGGGGGGGGGGG---NS 180

Query: 181 VWAGEAYGHGSGFGGGGGMGGGGAGGGGGGGGGSGGGGGTNGGNGYGSGFGGGVGSGGSG 240
           V  GEAYGHGSGFGGGGGMGGGGAGGGGGGGG   GGGG NGGNGYGSGFGGG+GSGGSG
Sbjct: 181 VLGGEAYGHGSGFGGGGGMGGGGAGGGGGGGG--SGGGGANGGNGYGSGFGGGIGSGGSG 240

Query: 241 GGGGGGGGGGRGSSNGGNNGHGSGFGGGAGNGVGGVGGGGGGGGGGVSSPNGGYGKGEGG 300
           GGGGGGGGG  G   G +NGHGSGFGGG GNG GG GGGG GGGGG+S+ NGGYGKGEG 
Sbjct: 241 GGGGGGGGGSNG---GSSNGHGSGFGGGVGNGAGG-GGGGSGGGGGISNSNGGYGKGEGS 300

Query: 301 GFGGGGGNANSFGGGGKGGQGMGMSFGMGFGMGIGFGMG-----SNNNGA-DESNDHTQA 357
           G           GG GKGG+GMGM FGMGFGMGIGFGMG     +NNNGA D  ND T+ 
Sbjct: 301 G----------IGGSGKGGEGMGMGFGMGFGMGIGFGMGNNNNNNNNNGAYDYKNDETKV 343

BLAST of Clc03G04870 vs. ExPASy TrEMBL
Match: A0A6J1EAJ0 (glycine-rich cell wall structural protein 2-like OS=Cucurbita moschata OX=3662 GN=LOC111431377 PE=4 SV=1)

HSP 1 Score: 219.5 bits (558), Expect = 2.2e-53
Identity = 222/353 (62.89%), Postives = 232/353 (65.72%), Query Frame = 0

Query: 1   MPQSCFTFLCLIVLVVSRLSFGYGGETGWGFNNEESCRYWRGCGSFFGWPADKGSSPGGA 60
           M   CF FLCLI+LV+ R SFGYGG+TGWGFNNEESCRYWRGCGSFFGWP DKGS+    
Sbjct: 1   MAHFCFRFLCLILLVLGRFSFGYGGDTGWGFNNEESCRYWRGCGSFFGWPVDKGSA---- 60

Query: 61  GGGGGGGGGGGNSGGGVGSGHGEGYGAGFGVGSNGGGGGGGGGSGSGSGEGSGHGSGYGA 120
            GGGGGGGGGG+SGGGVGSG GEGYGAGFGVGS+GG G                      
Sbjct: 61  -GGGGGGGGGGHSGGGVGSGFGEGYGAGFGVGSHGGSG---------------------- 120

Query: 121 GGGGGIGGGGGGGGGGGSASGSNGYGQGMGFGAGFGLGGGGGGGGGGGGGGSSNSVWAGE 180
                 GGGGGG GGGGS  GS                   G GGGGGGGG  NSV  GE
Sbjct: 121 ------GGGGGGYGGGGSEEGS-------------------GHGGGGGGGG--NSVRGGE 180

Query: 181 AYGHGSGFGGGGGMGGGGAGGGGGGGGGSGGGGGTNGGNGYGSGFGGGVGSGGSGGGGGG 240
           AYGHGSGFGGGGG+GGGGAGGGGG     GG GGTNGGNGYGSGFGGGVG+GGSG G   
Sbjct: 181 AYGHGSGFGGGGGIGGGGAGGGGG-----GGSGGTNGGNGYGSGFGGGVGTGGSGSG--- 240

Query: 241 GGGGGRGSSNGGNNGHGSGFGGGAGNGVGGVGGGGGGGGGGVSSPNGGYGKGEGGGFGGG 300
                                          GGGGGGGGGG S PNGGYGKGEGGGFGGG
Sbjct: 241 -------------------------------GGGGGGGGGGESGPNGGYGKGEGGGFGGG 260

Query: 301 GGNANSF-GGGGKGGQGMGMSFGMGFGMGIGFGMGSNNNGADESNDHTQAKTT 353
            G  NS  GGGGKGGQGMGM FGMGFGMG+GFGMG+ NNGADESN H QAKTT
Sbjct: 301 EGFTNSVGGGGGKGGQGMGMGFGMGFGMGVGFGMGNTNNGADESNGHAQAKTT 260

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAG6584096.11.0e-7374.93hypothetical protein SDJN03_20028, partial [Cucurbita argyrosperma subsp. sorori... [more]
KGN64507.11.1e-7277.72hypothetical protein Csa_013094 [Cucumis sativus][more]
XP_011648420.11.1e-7277.72glycine-rich protein DOT1 [Cucumis sativus][more]
XP_023001646.18.1e-7173.54glycine-rich cell wall structural protein 1-like [Cucurbita maxima][more]
ADN34063.15.5e-6775.48hypothetical protein [Cucumis melo subsp. melo] >KAA0056844.1 hypothetical prote... [more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0LUS35.5e-7377.72Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G062330 PE=4 SV=1[more]
A0A6J1KJ793.9e-7173.54glycine-rich cell wall structural protein 1-like OS=Cucurbita maxima OX=3661 GN=... [more]
A0A5D3BJP22.6e-6775.48Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
E5GC642.6e-6775.48Uncharacterized protein OS=Cucumis melo subsp. melo OX=412675 PE=4 SV=1[more]
A0A6J1EAJ02.2e-5362.89glycine-rich cell wall structural protein 2-like OS=Cucurbita moschata OX=3662 G... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Watermelon (cordophanus) v2
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 331..356
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 336..356
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 54..77
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 235..258

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Clc03G04870.1Clc03G04870.1mRNA