CcUC08G146950 (gene) Watermelon (PI 537277) v1

Overview
NameCcUC08G146950
Typegene
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
Descriptionkeratin, type II cytoskeletal 3 isoform X1
LocationCicolChr08: 4558354 .. 4562101 (-)
RNA-Seq ExpressionCcUC08G146950
SyntenyCcUC08G146950
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AGAGTATTTTCTATTCATATTCCCGAATTACTTGTTCATATCATCTTCTTCTTATTCGTCCCGACGAGGAAGAAATCAGAAATGCCTCGAAATATGAACGGAAACAGACACCAAGGCCGAGAAGATGACAAAGGATTACTCTGGAATCTTCCAGTTCTTAAATCTTCCAGAATCGGAAAGTTAGGTCCCGCCTTCGGTCTGGGCGTTGGTTGTGGCGTCGGCTTTGGCATCGGCCTCATCGGAGGTCAATTCTTCATCATCTCTCTCTCTTTTCCTTTTTGTTTCAGTCGTAACTAACTTGTTATTCCCCTCGTTGGGTTCCTTTCATTTGACTCTTTCAATTATGTTACTTAAATTCAATTTTTCTTGTTCCACCCTGCGACTGATTCGGAGGTATTCTTGTTAAATTTGTTTGGTTTAGGCTGATTATCATTTTTAATTTGTATTTGTGCCATCTACTGTTTTTGACTCTTATCCTTCAATATTTCATGTTGCTTGCATTCTGGTTGAGAGTGTTTTTGGATGTATATTTTCCTACAAAATAGCTCAAATATGAATCCCCAAATAGACCCAGATGTAGTTCATTCAGGTGTTTCTTCTTTTTCTTTTAGTCTATCTGAGTGTTCATCTTTAAATTTCTTCTTGTCCTTTGGGGAGTACAATGTATTTTTTTTATTTCCGAGTGGAGGGAGGATGGTATGAGTGGTATACTTCTGGATGCTGTTGACTATTTCATGCTTCATCTTTACTATTCAACAATTTAGACATGCCTTTTGATGGATACATTCAAGAAGAACAGTAAATCTGTATACGGTTGTGAAGTTTACGGGCCCTTGACACTCAAATGTTGTAGGGTCAGACGGGTTCTCCCATGAGGTTAGTCGAGGTATGCGTTAAGCTGGCCTAGATGCTCACGGATATAAAAGAAAATGTCATTTATATGTGTATATATGTGAGGCCTTAGCCTCAATTTTATTTATCAACAATGGGGGAAGAATTCATGAGCTTCCCAATCTCTCTCTAACCGTGTGATTACAATAAGAGAAATCATTCATACCTCCTCAACAGCCTCTCATTCTCTATATACTCAAACTTCCCACATAACTAACAAATATTCCCCATTCCACCTCACTAATCAACTCATAACCAACTTATATTTTTACCCAAGTGTACACTTTTACCCCTGAGATCCTATCAATATGTTTTGTTTGTATGATTGTAATCAACATGAAAAGCTGTTATCATTTTCACAAGTTTATTATCCAACAACTTGTTTCTCTTTCTTTGCATTTCTTTTTTCCTGTAAATATATATATTCAGTTAACTTTTATGGTGGATCACTTTTTAGTCAGGAGGGGTGGGTGTTGAATTTTATTTATCTCAATTTTGAAGTCTACAAGCGTTACTTGACTACATGTTTTAAGTTCATCAATCGGTTTAGAAGGATTTTCAGGTGTCAAAAGCTTGAAATTTACTAGGCTTTGAAATTGGACTTCACAAAGTACTTCCATGTTGATGAGGGTTAAATTCTCTCTATCCTTTGTGATAGTATGGTTTTTAGTCTATAAGAAACGCCATTGATTTTCCTTTTGATGGTAGACATTATATGGATGAGAAAAAAGAGAACAAAAGGAACTGGGAAGATAAAGCTTCCTTGCAACTCCCACATGCACACTTACGCACACACATAGAAGGAGAGCTATCATGGATTTTAGGTTTCGGTGGAAGGTACGGCTTGATTGCCAACTAATAAATATTATCATTTTGTAAATTTTATTTTGAACTTTATACTGTGGTGAACTATCCTTGTTTAGTATGATTGAAACTACTCAAATATGAATTTGCCAAAAATTTTCTCCACTCTAGGTGCTGGATTTGGTCCAGGAATTCCTGGCTTACAACTTGGCTTTGGTCTTGGTGCTGGATGTGGAATTGGCTTAGGATTTGGCTATGGTGTTGGCAGGGGCATTGCTCAAGATGACAAACGGAGATACTCTAACGTCGGGGATCTATTAAATGGTCATCAAAGTCTTTTTCCTCAGTAAGTTCTAAAACCAACTGTTTTCAACTTGTGACCATCATGTTCTGTTAGTGATGCATTATTACACGTCAGGGATAATTAACTTTGTTGTGGGTTTGGGGACCTGGCTTTTTCCTAGCTAAGATGACAATTTTCTATGACGGATGATTCAAAATACTCCCATTCTTTATCTTTTCTCTCCTCTCTCTTCCAACTCTCTGTCCCATTTCTCTCTCTTTTTGCATAGTTTCTCCCTTCATGCTCTCTCTCCTTCCCTATTTTTCTCAATCTTATCAGTACCTTCGCTCTCTATATTGTGATTCACTTTCCACCTGAAAAGCACAAATACAAATACAAGACATGGATCCGACACGACAGGGACACGGCGACCCGTCATATTTTAAAAATCTAAGACATGACACAACAAGGACACATTTGTTAAAAAATACATTTAAAGAAATATACATCACTTTTATATCAAAAGAAAATTCAAAGTAAATGGATTGATGCATTTATATGTTTAAAAAACTTAGCTTGATGTATTTTGCACTAAAAAACTATTACTATTGTTGCATATGTATCTTTTTAGTCTATTCAACAAGTGTTCAATGTATGTCTAACTAACTAGTGTACGATACGTGTCCAACAAGTAATAGAGTGTCCAAGTGTCGGACACGAACATGTTAGTCAGACTAAAGTGTTTGTGCTTCTTAGCTTTCTGCCTTCACCCTTCCAACAGGACTTCTTACATTTAAAATATCAGTGGTCAGCTCATTTTCTTTCAATAAGGTAGAACAATTGGGTCCAGCTAAGATGTTGGTCTTGAAAAATTGCCAATTGTTTTTAAAATATTCATGTTTTTTCAAAATTGCAATATTACCATTGACCTTTTAAAATGTTTTTTTTTTTTTTTTCTGCTCAAATTACTAGCATAAATCTTTCTTGTTCCCGTTTGACTAGGGGGATGCTATCTTGTCCACCTCAACTGTCTTGGTGAAATCATTAAACTTAACTGTTAATTGTCCAGATCGTTAAACCACCCTATGTTTTACCATCTTGATTACATTTATAGGATTGTCAGAATAATAATTCCTTTTGAAATCATCTTCTAAGTTGCACAGGATTAAAATAGTGTTTTTCTTGAGCTGTAATAAGTTGCACAGAAACTAATTGATGCATTTCCACGATTTTCATATTAATATATAATCTGTGCATCTGTAGAACTTAAATATTCTGACATCAAGGAACATTTCTTGATTTTCATTTTTACTTGTCATCAGTAGGGACGATGTTGGCGCGCTTGTTGATGAGCTTGTCTTAAATACAAAGAGGCTTATACGAGCTACTTCAAGGGAGATTGACAAGTGGAAAAGATGAAATTACTTATCCTGTTTTTACAATGTACAAGTCCAATCCCCTGCCCCCATCATTAGAATGAACCAGAAAAAAAACATAGAAGAAAGAAACATGCTGGCTGTCTGTCTATATGGAGTTATATTACTCTGTGGCTTTTAGAGGCAAAAGTTATGTACAATTGTTTCGAAAGATGTATTTAGTTGCTCGAAATCTCATGTGCAGTGGATTTTGAGCAGTCATAAAATGTCGAATACTTGATGAATTAGTTTAGATGGTGTCTTTAAGGCTGATTTATCATGTCCATACTACAGAAAAAGCTGTTTCATGAGAGGATACTTGTTTCATTTTAAGCTTTTTCTGGCTCATTCTGAT

mRNA sequence

AGAGTATTTTCTATTCATATTCCCGAATTACTTGTTCATATCATCTTCTTCTTATTCGTCCCGACGAGGAAGAAATCAGAAATGCCTCGAAATATGAACGGAAACAGACACCAAGGCCGAGAAGATGACAAAGGATTACTCTGGAATCTTCCAGTTCTTAAATCTTCCAGAATCGGAAAGTTAGGTCCCGCCTTCGGTCTGGGCGTTGGTTGTGGCGTCGGCTTTGGCATCGGCCTCATCGGAGGTGCTGGATTTGGTCCAGGAATTCCTGGCTTACAACTTGGCTTTGGTCTTGGTGCTGGATGTGGAATTGGCTTAGGATTTGGCTATGGTGTTGGCAGGGGCATTGCTCAAGATGACAAACGGAGATACTCTAACGTCGGGGATCTATTAAATGGTCATCAAAGTCTTTTTCCTCATAGGGACGATGTTGGCGCGCTTGTTGATGAGCTTGTCTTAAATACAAAGAGGCTTATACGAGCTACTTCAAGGGAGATTGACAAGTGGAAAAGATGAAATTACTTATCCTGTTTTTACAATGTACAAGTCCAATCCCCTGCCCCCATCATTAGAATGAACCAGAAAAAAAACATAGAAGAAAGAAACATGCTGGCTGTCTGTCTATATGGAGTTATATTACTCTGTGGCTTTTAGAGGCAAAAGTTATGTACAATTGTTTCGAAAGATGTATTTAGTTGCTCGAAATCTCATGTGCAGTGGATTTTGAGCAGTCATAAAATGTCGAATACTTGATGAATTAGTTTAGATGGTGTCTTTAAGGCTGATTTATCATGTCCATACTACAGAAAAAGCTGTTTCATGAGAGGATACTTGTTTCATTTTAAGCTTTTTCTGGCTCATTCTGAT

Coding sequence (CDS)

ATGCCTCGAAATATGAACGGAAACAGACACCAAGGCCGAGAAGATGACAAAGGATTACTCTGGAATCTTCCAGTTCTTAAATCTTCCAGAATCGGAAAGTTAGGTCCCGCCTTCGGTCTGGGCGTTGGTTGTGGCGTCGGCTTTGGCATCGGCCTCATCGGAGGTGCTGGATTTGGTCCAGGAATTCCTGGCTTACAACTTGGCTTTGGTCTTGGTGCTGGATGTGGAATTGGCTTAGGATTTGGCTATGGTGTTGGCAGGGGCATTGCTCAAGATGACAAACGGAGATACTCTAACGTCGGGGATCTATTAAATGGTCATCAAAGTCTTTTTCCTCATAGGGACGATGTTGGCGCGCTTGTTGATGAGCTTGTCTTAAATACAAAGAGGCTTATACGAGCTACTTCAAGGGAGATTGACAAGTGGAAAAGATGA

Protein sequence

MPRNMNGNRHQGREDDKGLLWNLPVLKSSRIGKLGPAFGLGVGCGVGFGIGLIGGAGFGPGIPGLQLGFGLGAGCGIGLGFGYGVGRGIAQDDKRRYSNVGDLLNGHQSLFPHRDDVGALVDELVLNTKRLIRATSREIDKWKR
Homology
BLAST of CcUC08G146950 vs. NCBI nr
Match: XP_023521208.1 (ATP-dependent RNA helicase glh-2 isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 269.2 bits (687), Expect = 2.0e-68
Identity = 127/144 (88.19%), Postives = 139/144 (96.53%), Query Frame = 0

Query: 1   MPRNMNGNRHQGREDDKGLLWNLPVLKSSRIGKLGPAFGLGVGCGVGFGIGLIGGAGFGP 60
           MPRNMNG+RH+GRED+ G+LW LPVLKSSRIG+LGPAFGLGVGCGVGFGIGL+GGAGFGP
Sbjct: 1   MPRNMNGSRHEGREDESGMLWKLPVLKSSRIGRLGPAFGLGVGCGVGFGIGLVGGAGFGP 60

Query: 61  GIPGLQLGFGLGAGCGIGLGFGYGVGRGIAQDDKRRYSNVGDLLNGHQSLFPHRDDVGAL 120
           GIPGLQLGFGLGAGCG+GLGFGYG GRGIAQDDKRRYSNVGDLLNGHQS+FPHRD++GAL
Sbjct: 61  GIPGLQLGFGLGAGCGVGLGFGYGAGRGIAQDDKRRYSNVGDLLNGHQSIFPHRDEIGAL 120

Query: 121 VDELVLNTKRLIRATSREIDKWKR 145
           VDEL LNTK+LIR T+REIDKWKR
Sbjct: 121 VDELALNTKKLIRVTAREIDKWKR 144

BLAST of CcUC08G146950 vs. NCBI nr
Match: XP_038889093.1 (ctenidin-1 isoform X1 [Benincasa hispida])

HSP 1 Score: 266.5 bits (680), Expect = 1.3e-67
Identity = 130/144 (90.28%), Postives = 138/144 (95.83%), Query Frame = 0

Query: 1   MPRNMNGNRHQGREDDKGLLWNLPVLKSSRIGKLGPAFGLGVGCGVGFGIGLIGGAGFGP 60
           MPRN NGNRHQG ED+ GLLWNLPVLKSSR GKLGPAFGLGVGCGVGFGIGL+GGAGFGP
Sbjct: 1   MPRNTNGNRHQGGEDEGGLLWNLPVLKSSRFGKLGPAFGLGVGCGVGFGIGLVGGAGFGP 60

Query: 61  GIPGLQLGFGLGAGCGIGLGFGYGVGRGIAQDDKRRYSNVGDLLNGHQSLFPHRDDVGAL 120
           GIPGLQLGFGLGAGCG+GLGFGYGVGRGIAQDDKRRYSNVGDLL+GHQS+FP +DD+ AL
Sbjct: 61  GIPGLQLGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNVGDLLHGHQSIFPQKDDIVAL 120

Query: 121 VDELVLNTKRLIRATSREIDKWKR 145
           VDELVLN+KRLIRATSREIDKWKR
Sbjct: 121 VDELVLNSKRLIRATSREIDKWKR 144

BLAST of CcUC08G146950 vs. NCBI nr
Match: XP_016902400.1 (PREDICTED: glycine-rich cell wall structural protein 1 [Cucumis melo] >XP_016902401.1 PREDICTED: glycine-rich cell wall structural protein 1 [Cucumis melo])

HSP 1 Score: 266.5 bits (680), Expect = 1.3e-67
Identity = 128/144 (88.89%), Postives = 139/144 (96.53%), Query Frame = 0

Query: 1   MPRNMNGNRHQGREDDKGLLWNLPVLKSSRIGKLGPAFGLGVGCGVGFGIGLIGGAGFGP 60
           MPRNMNGNR++  ED++GLLWNLPVLKSSR G LGPAFGLGVGCGVGFGIGL+GGAGFGP
Sbjct: 1   MPRNMNGNRNRDGEDERGLLWNLPVLKSSRFGNLGPAFGLGVGCGVGFGIGLVGGAGFGP 60

Query: 61  GIPGLQLGFGLGAGCGIGLGFGYGVGRGIAQDDKRRYSNVGDLLNGHQSLFPHRDDVGAL 120
           GIPGLQLGFGLGAGCG+GLGFGYGVGRGIAQDDKRRYSNVGD+L GHQS+FPH+DD+GAL
Sbjct: 61  GIPGLQLGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNVGDVLRGHQSIFPHQDDIGAL 120

Query: 121 VDELVLNTKRLIRATSREIDKWKR 145
           VD+LVLNTKRLIRATSREIDKWKR
Sbjct: 121 VDDLVLNTKRLIRATSREIDKWKR 144

BLAST of CcUC08G146950 vs. NCBI nr
Match: XP_004141514.1 (glycine-rich cell wall structural protein 1 [Cucumis sativus] >KGN52595.1 hypothetical protein Csa_008740 [Cucumis sativus])

HSP 1 Score: 263.8 bits (673), Expect = 8.4e-67
Identity = 127/144 (88.19%), Postives = 138/144 (95.83%), Query Frame = 0

Query: 1   MPRNMNGNRHQGREDDKGLLWNLPVLKSSRIGKLGPAFGLGVGCGVGFGIGLIGGAGFGP 60
           MPRNMNGNR+Q  ED++GLLWNLPVLKSSR G LGPAFGLGVGCGVGFGIGL+GGAGFGP
Sbjct: 1   MPRNMNGNRNQDGEDERGLLWNLPVLKSSRFGNLGPAFGLGVGCGVGFGIGLVGGAGFGP 60

Query: 61  GIPGLQLGFGLGAGCGIGLGFGYGVGRGIAQDDKRRYSNVGDLLNGHQSLFPHRDDVGAL 120
           GIPGLQLGFGLGAGCG+GLGFGYGVGRGIAQDDKRRYSNVGD+L G QS+FPH+DD+GAL
Sbjct: 61  GIPGLQLGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNVGDVLRGRQSIFPHQDDIGAL 120

Query: 121 VDELVLNTKRLIRATSREIDKWKR 145
           VD+LVLNTKRLIRATS+EIDKWKR
Sbjct: 121 VDDLVLNTKRLIRATSKEIDKWKR 144

BLAST of CcUC08G146950 vs. NCBI nr
Match: XP_022932819.1 (keratin, type II cytoskeletal 2 epidermal isoform X1 [Cucurbita moschata])

HSP 1 Score: 261.9 bits (668), Expect = 3.2e-66
Identity = 124/144 (86.11%), Postives = 137/144 (95.14%), Query Frame = 0

Query: 1   MPRNMNGNRHQGREDDKGLLWNLPVLKSSRIGKLGPAFGLGVGCGVGFGIGLIGGAGFGP 60
           MPRNMNG+RH+G ED+ G+LW LPVLKSSRIG+LGPAFGLGVGCGVGFGIGL+GGAGFGP
Sbjct: 1   MPRNMNGSRHEGGEDESGMLWKLPVLKSSRIGRLGPAFGLGVGCGVGFGIGLVGGAGFGP 60

Query: 61  GIPGLQLGFGLGAGCGIGLGFGYGVGRGIAQDDKRRYSNVGDLLNGHQSLFPHRDDVGAL 120
           GIPGLQLGFGLGAGCG+GLGFGYG GRGIAQDDKRRYSNVGDLLNG QS+FPHRD++GAL
Sbjct: 61  GIPGLQLGFGLGAGCGVGLGFGYGAGRGIAQDDKRRYSNVGDLLNGRQSIFPHRDEIGAL 120

Query: 121 VDELVLNTKRLIRATSREIDKWKR 145
           VDEL LNTK+LIR T++EIDKWKR
Sbjct: 121 VDELALNTKKLIRVTAQEIDKWKR 144

BLAST of CcUC08G146950 vs. ExPASy TrEMBL
Match: A0A1S4E347 (glycine-rich cell wall structural protein 1 OS=Cucumis melo OX=3656 GN=LOC103498610 PE=4 SV=1)

HSP 1 Score: 266.5 bits (680), Expect = 6.3e-68
Identity = 128/144 (88.89%), Postives = 139/144 (96.53%), Query Frame = 0

Query: 1   MPRNMNGNRHQGREDDKGLLWNLPVLKSSRIGKLGPAFGLGVGCGVGFGIGLIGGAGFGP 60
           MPRNMNGNR++  ED++GLLWNLPVLKSSR G LGPAFGLGVGCGVGFGIGL+GGAGFGP
Sbjct: 1   MPRNMNGNRNRDGEDERGLLWNLPVLKSSRFGNLGPAFGLGVGCGVGFGIGLVGGAGFGP 60

Query: 61  GIPGLQLGFGLGAGCGIGLGFGYGVGRGIAQDDKRRYSNVGDLLNGHQSLFPHRDDVGAL 120
           GIPGLQLGFGLGAGCG+GLGFGYGVGRGIAQDDKRRYSNVGD+L GHQS+FPH+DD+GAL
Sbjct: 61  GIPGLQLGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNVGDVLRGHQSIFPHQDDIGAL 120

Query: 121 VDELVLNTKRLIRATSREIDKWKR 145
           VD+LVLNTKRLIRATSREIDKWKR
Sbjct: 121 VDDLVLNTKRLIRATSREIDKWKR 144

BLAST of CcUC08G146950 vs. ExPASy TrEMBL
Match: A0A0A0KSM8 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G646690 PE=4 SV=1)

HSP 1 Score: 263.8 bits (673), Expect = 4.1e-67
Identity = 127/144 (88.19%), Postives = 138/144 (95.83%), Query Frame = 0

Query: 1   MPRNMNGNRHQGREDDKGLLWNLPVLKSSRIGKLGPAFGLGVGCGVGFGIGLIGGAGFGP 60
           MPRNMNGNR+Q  ED++GLLWNLPVLKSSR G LGPAFGLGVGCGVGFGIGL+GGAGFGP
Sbjct: 1   MPRNMNGNRNQDGEDERGLLWNLPVLKSSRFGNLGPAFGLGVGCGVGFGIGLVGGAGFGP 60

Query: 61  GIPGLQLGFGLGAGCGIGLGFGYGVGRGIAQDDKRRYSNVGDLLNGHQSLFPHRDDVGAL 120
           GIPGLQLGFGLGAGCG+GLGFGYGVGRGIAQDDKRRYSNVGD+L G QS+FPH+DD+GAL
Sbjct: 61  GIPGLQLGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNVGDVLRGRQSIFPHQDDIGAL 120

Query: 121 VDELVLNTKRLIRATSREIDKWKR 145
           VD+LVLNTKRLIRATS+EIDKWKR
Sbjct: 121 VDDLVLNTKRLIRATSKEIDKWKR 144

BLAST of CcUC08G146950 vs. ExPASy TrEMBL
Match: A0A6J1EY31 (keratin, type II cytoskeletal 2 epidermal isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111439279 PE=4 SV=1)

HSP 1 Score: 261.9 bits (668), Expect = 1.5e-66
Identity = 124/144 (86.11%), Postives = 137/144 (95.14%), Query Frame = 0

Query: 1   MPRNMNGNRHQGREDDKGLLWNLPVLKSSRIGKLGPAFGLGVGCGVGFGIGLIGGAGFGP 60
           MPRNMNG+RH+G ED+ G+LW LPVLKSSRIG+LGPAFGLGVGCGVGFGIGL+GGAGFGP
Sbjct: 1   MPRNMNGSRHEGGEDESGMLWKLPVLKSSRIGRLGPAFGLGVGCGVGFGIGLVGGAGFGP 60

Query: 61  GIPGLQLGFGLGAGCGIGLGFGYGVGRGIAQDDKRRYSNVGDLLNGHQSLFPHRDDVGAL 120
           GIPGLQLGFGLGAGCG+GLGFGYG GRGIAQDDKRRYSNVGDLLNG QS+FPHRD++GAL
Sbjct: 61  GIPGLQLGFGLGAGCGVGLGFGYGAGRGIAQDDKRRYSNVGDLLNGRQSIFPHRDEIGAL 120

Query: 121 VDELVLNTKRLIRATSREIDKWKR 145
           VDEL LNTK+LIR T++EIDKWKR
Sbjct: 121 VDELALNTKKLIRVTAQEIDKWKR 144

BLAST of CcUC08G146950 vs. ExPASy TrEMBL
Match: A0A6J1ICW2 (keratin, type II cytoskeletal 3 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111472645 PE=4 SV=1)

HSP 1 Score: 256.5 bits (654), Expect = 6.5e-65
Identity = 121/144 (84.03%), Postives = 136/144 (94.44%), Query Frame = 0

Query: 1   MPRNMNGNRHQGREDDKGLLWNLPVLKSSRIGKLGPAFGLGVGCGVGFGIGLIGGAGFGP 60
           MPRNMNG+R +G ED+ G+LW LPVLKSSRIG+LGPAFGLGVGCGVGFGIGL+GGAGFGP
Sbjct: 1   MPRNMNGSRREGGEDESGMLWKLPVLKSSRIGRLGPAFGLGVGCGVGFGIGLVGGAGFGP 60

Query: 61  GIPGLQLGFGLGAGCGIGLGFGYGVGRGIAQDDKRRYSNVGDLLNGHQSLFPHRDDVGAL 120
           GIPGLQLGFGLGAGCG+GLGFGYG GRGIAQDDKRRYSNVGDLLNG QS+FPHRD++GA+
Sbjct: 61  GIPGLQLGFGLGAGCGVGLGFGYGAGRGIAQDDKRRYSNVGDLLNGRQSIFPHRDEIGAV 120

Query: 121 VDELVLNTKRLIRATSREIDKWKR 145
           VDEL LNTK+LI+ T++EIDKWKR
Sbjct: 121 VDELALNTKKLIQVTAKEIDKWKR 144

BLAST of CcUC08G146950 vs. ExPASy TrEMBL
Match: A0A6J1F360 (keratin, type II cytoskeletal 2 epidermal isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111439279 PE=4 SV=1)

HSP 1 Score: 252.7 bits (644), Expect = 9.4e-64
Identity = 122/144 (84.72%), Postives = 136/144 (94.44%), Query Frame = 0

Query: 1   MPRNMNGNRHQGREDDKGLLWNLPVLKSSRIGKLGPAFGLGVGCGVGFGIGLIGGAGFGP 60
           MPRNMNG+RH+G ED+ G+LW LPVLKSSRIG+LGPAFGLGVGCGVGFGIGL+GGAGFGP
Sbjct: 1   MPRNMNGSRHEGGEDESGMLWKLPVLKSSRIGRLGPAFGLGVGCGVGFGIGLVGGAGFGP 60

Query: 61  GIPGLQLGFGLGAGCGIGLGFGYGVGRGIAQDDKRRYSNVGDLLNGHQSLFPHRDDVGAL 120
           GIPGLQLGFGLGAGCG+GLGFGYG GRGIAQDDKRRYSNVGDLLNG QS+FP +D++GAL
Sbjct: 61  GIPGLQLGFGLGAGCGVGLGFGYGAGRGIAQDDKRRYSNVGDLLNGRQSIFP-QDEIGAL 120

Query: 121 VDELVLNTKRLIRATSREIDKWKR 145
           VDEL LNTK+LIR T++EIDKWKR
Sbjct: 121 VDELALNTKKLIRVTAQEIDKWKR 143

BLAST of CcUC08G146950 vs. TAIR 10
Match: AT4G10330.1 (glycine-rich protein )

HSP 1 Score: 167.9 bits (424), Expect = 5.8e-42
Identity = 84/135 (62.22%), Postives = 101/135 (74.81%), Query Frame = 0

Query: 8   NRHQGREDDKGLLWNLPVLKSSRIGKLGPAFGLGVGCGVGFGIGLIGGAGFGPGIPGLQL 67
           NR +  +DDKGLLW LP ++   IGK+GPAFGLGVGCG GFG GLIGG GFGPG+PGLQ 
Sbjct: 3   NRRRTGDDDKGLLWKLPQVRIRDIGKVGPAFGLGVGCGFGFGAGLIGGVGFGPGVPGLQF 62

Query: 68  GFGLGAGCGIGLGFGYGVGRGIAQDDKRRYSNVGDLLNGHQSLFPHRDDVGALVDELVLN 127
           G G GAGCGIG+GFGYGVGRG A D  R Y NVG          P  ++V +L+DELV++
Sbjct: 63  GLGFGAGCGIGVGFGYGVGRGAAYDHSRSYYNVGK---------PSLNEVDSLIDELVVS 122

Query: 128 TKRLIRATSREIDKW 143
           TK+L++AT+ EIDKW
Sbjct: 123 TKKLVKATTNEIDKW 128

BLAST of CcUC08G146950 vs. TAIR 10
Match: AT1G66820.1 (glycine-rich protein )

HSP 1 Score: 55.5 bits (132), Expect = 4.2e-08
Identity = 36/70 (51.43%), Postives = 43/70 (61.43%), Query Frame = 0

Query: 34  LGPAFGLGVGCGVGFGIGLIGGAGFGPGIPGLQ-----LGFGLGAGCGIGLGFGYGVGRG 93
           +GP  G G+GCG G GIGL GG G G    GL      LGFG+G G G G G+G+GVG G
Sbjct: 39  VGPGIGGGIGCGAGIGIGLSGGLGIGAS-EGLDHSNVVLGFGIGCGIGFGFGYGFGVGGG 98

Query: 94  IAQDD-KRRY 98
            + DD K R+
Sbjct: 99  YSFDDIKERF 107

BLAST of CcUC08G146950 vs. TAIR 10
Match: AT1G27695.1 (glycine-rich protein )

HSP 1 Score: 42.0 bits (97), Expect = 4.8e-04
Identity = 29/53 (54.72%), Postives = 34/53 (64.15%), Query Frame = 0

Query: 34 LGPAFGLGVGCGVGFGIGLIGGAGFGPGIPGLQLGFGLGAGCGIGLGFGYGVG 87
          +G  FG GVGC  GFG+G     GFG G+P   LG G G GCG+GLG G+G G
Sbjct: 9  VGVGFGFGVGC--GFGVGW----GFG-GMPMNILGVGAGGGCGVGLGLGWGFG 54

BLAST of CcUC08G146950 vs. TAIR 10
Match: AT1G27695.2 (glycine-rich protein )

HSP 1 Score: 42.0 bits (97), Expect = 4.8e-04
Identity = 29/53 (54.72%), Postives = 34/53 (64.15%), Query Frame = 0

Query: 34 LGPAFGLGVGCGVGFGIGLIGGAGFGPGIPGLQLGFGLGAGCGIGLGFGYGVG 87
          +G  FG GVGC  GFG+G     GFG G+P   LG G G GCG+GLG G+G G
Sbjct: 9  VGVGFGFGVGC--GFGVGW----GFG-GMPMNILGVGAGGGCGVGLGLGWGFG 54

BLAST of CcUC08G146950 vs. TAIR 10
Match: AT4G14301.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G23450.1); Has 101 Blast hits to 93 proteins in 34 species: Archae - 0; Bacteria - 27; Metazoa - 16; Fungi - 5; Plants - 34; Viruses - 3; Other Eukaryotes - 16 (source: NCBI BLink). )

HSP 1 Score: 41.6 bits (96), Expect = 6.3e-04
Identity = 32/58 (55.17%), Postives = 36/58 (62.07%), Query Frame = 0

Query: 31  IGKLGPAFGLGVGCGVGFGIGLIGGAGFGPGIPGLQLGFGLGAGCGIGLGFGYGVGRG 89
           IGK G  FG G+G G GFG G+  G GFG GI G   G G G G G G GFG G+G+G
Sbjct: 53  IGKGGGIFGHGIGKGGGFGGGISKGGGFGGGI-GKGGGIGGGIGKGKGWGFGGGIGKG 109

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_023521208.12.0e-6888.19ATP-dependent RNA helicase glh-2 isoform X1 [Cucurbita pepo subsp. pepo][more]
XP_038889093.11.3e-6790.28ctenidin-1 isoform X1 [Benincasa hispida][more]
XP_016902400.11.3e-6788.89PREDICTED: glycine-rich cell wall structural protein 1 [Cucumis melo] >XP_016902... [more]
XP_004141514.18.4e-6788.19glycine-rich cell wall structural protein 1 [Cucumis sativus] >KGN52595.1 hypoth... [more]
XP_022932819.13.2e-6686.11keratin, type II cytoskeletal 2 epidermal isoform X1 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A1S4E3476.3e-6888.89glycine-rich cell wall structural protein 1 OS=Cucumis melo OX=3656 GN=LOC103498... [more]
A0A0A0KSM84.1e-6788.19Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G646690 PE=4 SV=1[more]
A0A6J1EY311.5e-6686.11keratin, type II cytoskeletal 2 epidermal isoform X1 OS=Cucurbita moschata OX=36... [more]
A0A6J1ICW26.5e-6584.03keratin, type II cytoskeletal 3 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111... [more]
A0A6J1F3609.4e-6484.72keratin, type II cytoskeletal 2 epidermal isoform X2 OS=Cucurbita moschata OX=36... [more]
Match NameE-valueIdentityDescription
AT4G10330.15.8e-4262.22glycine-rich protein [more]
AT1G66820.14.2e-0851.43glycine-rich protein [more]
AT1G27695.14.8e-0454.72glycine-rich protein [more]
AT1G27695.24.8e-0454.72glycine-rich protein [more]
AT4G14301.16.3e-0455.17unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (PI 537277) v1
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR34201GLYCINE-RICH PROTEINcoord: 5..144
NoneNo IPR availablePANTHERPTHR34201:SF1GLYCINE-RICH PROTEINcoord: 5..144

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CcUC08G146950.1CcUC08G146950.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0060429 epithelium development
biological_process GO:0043588 skin development
cellular_component GO:0016021 integral component of membrane