ClCG03G010940 (gene) Watermelon (Charleston Gray) v2.5

Overview
NameClCG03G010940
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionLOW QUALITY PROTEIN: uncharacterized protein LOC111022007
LocationCG_Chr03: 20139401 .. 20140447 (+)
RNA-Seq ExpressionClCG03G010940
SyntenyClCG03G010940
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCAAGGAAGATCTCTTGGAAAAAGAAGACTATTGGTTCCAGGAGAAGATTAAAATTATAGATCGACAAAGGTATGATGTTTAACATTTTCTTAACTTTTTGTATGTTTAATTCTTTTAAGAAATCAAAGAACAGAATCACACGCTTCCGCCGGGATTCAATTCCCTTCAATGAATTTTATTTGAGGATGCAACTCAAGAGAAGCATCTCAGTGGTCTAAGGAGTCCAGGTCAACTTCTCTGCTGGGGTGATTAATGAAGTATACATGGTTCCCAACGAGGAGGGGGACAAATGCAAAAGGAGGATGTATGCTCCATTTGAGGAGCAAGTGGCGGATGCACTGAAGTTTGTGGCCATCAGAGGAGTTGAGTGGGTTATGTCGCCTATTGGATGTAGGACATTGAGACCGGGAGATATTAGGGAGAACCTGGCCATTTGGGTTTACTTTATCAAGCGTCGGATTATGCCTACGACTTATGACACCACCATTGCTGTCGAAAGGATTATGCTGTTGTACAACATCTGATGAGTGCTCCCTATCAATTTGGGAGTCATCATAAGGAGAGACCTCATGGAGTGCAGCCTGTGCACCATAGGAAGACTGTTCTTCCCCCCTTTGATTGACAAACTTTGTGCCAAAGCCAGAGTGGTGGTTGCTGAGGAGGAGAACTGTAAGGTCAAGCTTGCCATTGATCTTGTATTGATCAGAAAGCTCCAAGGCAACTTGGCGTAGAAGAAAAGCAAAGTCTTTGGCAACCAGAGTGTTCACTCACCAACTAGAGCCACTCCCACATCCATTAGGACGTGCACTCTAAGAGCGGATGACATATTCTCTCCCTTGGAGCCAAAGGTGTTGCCACTAGAGCAAGAAGTGTCACCACCTATTGCTGAAGAGCAGAGAGAACCTAAGGCACCACCCGCATTGCCCGACAGGAACCAACTAAAGCTGCAACACCATCTATGCTCACCATCGTTGCTATTAGCTATCTGGCCGACCAGTTCGAGAGATTCATGGTGTAGTCTCGTGACTATTGGCACTACGTGA

mRNA sequence

ATGGCAAGGAAGATCTCTTGGAAAAAGAAGACTATTGGAGTCCAGGTCAACTTCTCTGCTGGGGTGATTAATGAAGTATACATGGTTCCCAACGAGGAGGGGGACAAATGCAAAAGGAGGATGTATGCTCCATTTGAGGAGCAAGTGGCGGATGCACTGAAGTTTGTGGCCATCAGAGGAGTTGAGTGGGTTATGTCGCCTATTGGATGTAGGACATTGAGACCGGGAGATATTAGGGAGAACCTGGCCATTTGGGTTTACTTTATCAAGCGTCGGATTATGCCTACGACTTATGACACCACCATTGCTGTCGAAAGGATTATGCTGAGAGACCTCATGGAGTGCAGCCTGTGCACCATAGGAAGACTGTTCTTCCCCCCTTTGATTGACAAACTTTGTGCCAAAGCCAGAGTGGTGGTTGCTGAGGAGGAGAACTGTAAGGTCAAGCTTGCCATTGATCTTAAGAAAAGCAAAGTCTTTGGCAACCAGAGTGTTCACTCACCAACTAGAGCCACTCCCACATCCATTAGGACGTGCACTCTAAGAGCGGATGACATATTCTCTCCCTTGGAGCCAAAGGTGTTGCCACTAGAGCAAGAAGTGTCACCACCTATTGCTGAAGAGCAGAGAGAACCTAAGGCACCACCCGCATTGCCCGACAGGAACCAACTAAAGCTGCAACACCATCTATGCTCACCATCGTTGCTATTAGCTATCTGGCCGACCAGTTCGAGAGATTCATGGTGTAGTCTCGTGACTATTGGCACTACGTGA

Coding sequence (CDS)

ATGGCAAGGAAGATCTCTTGGAAAAAGAAGACTATTGGAGTCCAGGTCAACTTCTCTGCTGGGGTGATTAATGAAGTATACATGGTTCCCAACGAGGAGGGGGACAAATGCAAAAGGAGGATGTATGCTCCATTTGAGGAGCAAGTGGCGGATGCACTGAAGTTTGTGGCCATCAGAGGAGTTGAGTGGGTTATGTCGCCTATTGGATGTAGGACATTGAGACCGGGAGATATTAGGGAGAACCTGGCCATTTGGGTTTACTTTATCAAGCGTCGGATTATGCCTACGACTTATGACACCACCATTGCTGTCGAAAGGATTATGCTGAGAGACCTCATGGAGTGCAGCCTGTGCACCATAGGAAGACTGTTCTTCCCCCCTTTGATTGACAAACTTTGTGCCAAAGCCAGAGTGGTGGTTGCTGAGGAGGAGAACTGTAAGGTCAAGCTTGCCATTGATCTTAAGAAAAGCAAAGTCTTTGGCAACCAGAGTGTTCACTCACCAACTAGAGCCACTCCCACATCCATTAGGACGTGCACTCTAAGAGCGGATGACATATTCTCTCCCTTGGAGCCAAAGGTGTTGCCACTAGAGCAAGAAGTGTCACCACCTATTGCTGAAGAGCAGAGAGAACCTAAGGCACCACCCGCATTGCCCGACAGGAACCAACTAAAGCTGCAACACCATCTATGCTCACCATCGTTGCTATTAGCTATCTGGCCGACCAGTTCGAGAGATTCATGGTGTAGTCTCGTGACTATTGGCACTACGTGA

Protein sequence

MARKISWKKKTIGVQVNFSAGVINEVYMVPNEEGDKCKRRMYAPFEEQVADALKFVAIRGVEWVMSPIGCRTLRPGDIRENLAIWVYFIKRRIMPTTYDTTIAVERIMLRDLMECSLCTIGRLFFPPLIDKLCAKARVVVAEEENCKVKLAIDLKKSKVFGNQSVHSPTRATPTSIRTCTLRADDIFSPLEPKVLPLEQEVSPPIAEEQREPKAPPALPDRNQLKLQHHLCSPSLLLAIWPTSSRDSWCSLVTIGTT
Homology
BLAST of ClCG03G010940 vs. NCBI nr
Match: PON50458.1 (hypothetical protein PanWU01x14_223230, partial [Parasponia andersonii])

HSP 1 Score: 72.8 bits (177), Expect = 4.9e-09
Identity = 60/212 (28.30%), Postives = 101/212 (47.64%), Query Frame = 0

Query: 13  GVQVNFSAGVINEVYMVPNEEGDKCKRRMYAPFEEQVADALKFVAIRGVEWVMSPIGCRT 72
           GVQV  SA  IN +Y +  +  D+    +    E ++A  L+ VAI G EW +S  G  T
Sbjct: 23  GVQVPLSAEAINTIYGL-GDLVDEHSEFVEDITEPELAMVLETVAIAGAEWNVSSQGVYT 82

Query: 73  LRPGDIRENLAIWVYFIKRRIMPTTYDTTIAVERIML-----------------RDLMEC 132
                +     IW +F+K R++PTT+   ++ ER++L                 R++  C
Sbjct: 83  CLRSSLNPPAKIWYHFLKSRLLPTTHGKIVSKERVLLLYSMLTGKSINMGRMIHREICAC 142

Query: 133 SLCTIGRLFFPPLIDKLCAKARV--VVAEEE--NCKVKLAIDLKKSKVFG-NQSVHSPTR 192
           +    G LFFP LI ++C  AR   +V EE+  N     AI + +    G  +  H P+ 
Sbjct: 143 AARKSGALFFPSLIIRMCRNARAPYLVNEEKLHNTGEIDAIAVARIAQEGPAEPSHQPSS 202

Query: 193 ATPTSIRTCTLRADDIFSPLEPKVLPLEQEVS 203
           + PT++ + +  ++ +      ++  LEQ +S
Sbjct: 203 SRPTAVSSSSTTSETL-----QQLKSLEQHIS 228

BLAST of ClCG03G010940 vs. NCBI nr
Match: PON62892.1 (hypothetical protein PanWU01x14_135680 [Parasponia andersonii])

HSP 1 Score: 72.8 bits (177), Expect = 4.9e-09
Identity = 47/151 (31.13%), Postives = 74/151 (49.01%), Query Frame = 0

Query: 13  GVQVNFSAGVINEVYMVPNEEGDKCKRRMYAPFEEQVADALKFVAIRGVEWVMSPIGCRT 72
           GVQV  S   IN +Y +  +  D+    + A  E ++A  L+ VAI G EW +S  G  T
Sbjct: 13  GVQVPLSTEAINTIYGL-GDPVDEHSEFVEAITEPELATVLETVAIAGAEWNVSSQGAYT 72

Query: 73  LRPGDIRENLAIWVYFIKRRIMPTTYDTTIAVERIML-----------------RDLMEC 132
                +     +W +F+K R++PTT+  T++ ER++L                 R++   
Sbjct: 73  CLRSSLNPPAKVWYHFLKSRLLPTTHGKTVSKERVLLLYSMLTGKSINVGQIIHREICAY 132

Query: 133 SLCTIGRLFFPPLIDKLCAKARV--VVAEEE 145
           +    G LFFP LI ++C  AR   +V EE+
Sbjct: 133 AARKSGALFFPSLITRMCCNARAPYLVNEEK 162

BLAST of ClCG03G010940 vs. NCBI nr
Match: XP_038904385.1 (uncharacterized protein LOC120090747 [Benincasa hispida])

HSP 1 Score: 71.6 bits (174), Expect = 1.1e-08
Identity = 38/108 (35.19%), Postives = 62/108 (57.41%), Query Frame = 0

Query: 13  GVQVNFSAGVINEVYM---VPNEEGDKCKRRMYAPFEEQVADALKFVAIRGVEWVMSPIG 72
           G  V FSA  INE+Y    +P+  G+K    +  P EE++ DAL+ +   G +W +S  G
Sbjct: 235 GCIVPFSARDINELYKMKDIPDASGNKI---IDDPQEEKMEDALRTLTQSGTQWSVSLKG 294

Query: 73  CRTLRPGDIRENLAIWVYFIKRRIMPTTYDTTIAVERIMLRDLMECSL 118
            +TL    +     +WVY +KRRI+PT++D T++ +R+M    + C +
Sbjct: 295 IKTLASSKLLPEARLWVYLVKRRIIPTSHDKTVSRDRVMAAYCIACGI 339

BLAST of ClCG03G010940 vs. NCBI nr
Match: EXB49850.1 (hypothetical protein L484_000844 [Morus notabilis])

HSP 1 Score: 67.4 bits (163), Expect = 2.1e-07
Identity = 42/159 (26.42%), Postives = 79/159 (49.69%), Query Frame = 0

Query: 14  VQVNFSAGVINEVYMVPNEEGDKCKRRMYAPFEEQVADALKFVAIRGVEWVMSPIGCRTL 73
           + + F++  IN V  +PN++ D+    +    EEQ+ + LK +AI G +W++S  G  T 
Sbjct: 123 IDITFTSNYINGVLGIPNQD-DEFVELITDAIEEQLKEVLKTIAILGAQWLLSAKGSYTC 182

Query: 74  RPGDIRENLAIWVYFIKRRIMPTTYDTTIA-----------------VERIMLRDLMECS 133
              +++    +W +F+  R++ +T+  TI+                 V R+++  +  C+
Sbjct: 183 NRHELQPAAKVWYHFLASRLLLSTHGKTISRNRAILLYAVLVGKPINVGRLIIDQIRACA 242

Query: 134 LCTIGRLFFPPLIDKLCAKARVV-VAEEENCKVKLAIDL 155
               G L+FP LI +LC ++ V   A E   +   A+DL
Sbjct: 243 EKGKGGLYFPSLISELCIQSHVAWEASEPRLRNTGAMDL 280

BLAST of ClCG03G010940 vs. NCBI nr
Match: PON46472.1 (hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii])

HSP 1 Score: 67.0 bits (162), Expect = 2.7e-07
Identity = 55/210 (26.19%), Postives = 94/210 (44.76%), Query Frame = 0

Query: 13  GVQVNFSAGVINEVYMVPNEEGDKCKRRMYAPFEEQVADALKFVAIRGVEWVMSPIGCRT 72
           GVQV++S   IN V+ +  +  D+    +    ++ +   L+ VA  G EW +S  G  T
Sbjct: 106 GVQVSWSEEAINAVFGL-GDPVDEHSEFIQNITQQDLITVLETVAAAGAEWNVSAQGAYT 165

Query: 73  LRPGDIRENLAIWVYFIKRRIMPTTYDTTIAVERIML-----------------RDLMEC 132
                +     +W +F+K R++PTT+  T++ +R++L                  ++  C
Sbjct: 166 CIRSALTPAAKVWYHFLKSRLLPTTHGKTVSKDRMLLLHSMLIGKSINVGRMIHSEIRAC 225

Query: 133 SLCTIGRLFFPPLIDKLCAKARV-VVAEEENCKVKLAIDLKKSKVFGNQSVHSPTRAT-- 192
           +    G LFFP LI +LC  AR   +  EE       ID   +      +   PT +T  
Sbjct: 226 AARKTGALFFPSLITRLCRNARAPFLVNEEKLHNTGEID---AIAVARIAQEGPTESTQQ 285

Query: 193 PTSIRTCTLRADDIFSPLEPKVLPLEQEVS 203
           P+S R  T  ++     +  ++  LEQ +S
Sbjct: 286 PSSSRPATASSNRTNGDILQQLKALEQRLS 311

BLAST of ClCG03G010940 vs. ExPASy TrEMBL
Match: A0A2P5BNT0 (Uncharacterized protein (Fragment) OS=Parasponia andersonii OX=3476 GN=PanWU01x14_223230 PE=4 SV=1)

HSP 1 Score: 72.8 bits (177), Expect = 2.4e-09
Identity = 60/212 (28.30%), Postives = 101/212 (47.64%), Query Frame = 0

Query: 13  GVQVNFSAGVINEVYMVPNEEGDKCKRRMYAPFEEQVADALKFVAIRGVEWVMSPIGCRT 72
           GVQV  SA  IN +Y +  +  D+    +    E ++A  L+ VAI G EW +S  G  T
Sbjct: 23  GVQVPLSAEAINTIYGL-GDLVDEHSEFVEDITEPELAMVLETVAIAGAEWNVSSQGVYT 82

Query: 73  LRPGDIRENLAIWVYFIKRRIMPTTYDTTIAVERIML-----------------RDLMEC 132
                +     IW +F+K R++PTT+   ++ ER++L                 R++  C
Sbjct: 83  CLRSSLNPPAKIWYHFLKSRLLPTTHGKIVSKERVLLLYSMLTGKSINMGRMIHREICAC 142

Query: 133 SLCTIGRLFFPPLIDKLCAKARV--VVAEEE--NCKVKLAIDLKKSKVFG-NQSVHSPTR 192
           +    G LFFP LI ++C  AR   +V EE+  N     AI + +    G  +  H P+ 
Sbjct: 143 AARKSGALFFPSLIIRMCRNARAPYLVNEEKLHNTGEIDAIAVARIAQEGPAEPSHQPSS 202

Query: 193 ATPTSIRTCTLRADDIFSPLEPKVLPLEQEVS 203
           + PT++ + +  ++ +      ++  LEQ +S
Sbjct: 203 SRPTAVSSSSTTSETL-----QQLKSLEQHIS 228

BLAST of ClCG03G010940 vs. ExPASy TrEMBL
Match: A0A2P5CPE8 (Uncharacterized protein OS=Parasponia andersonii OX=3476 GN=PanWU01x14_135680 PE=4 SV=1)

HSP 1 Score: 72.8 bits (177), Expect = 2.4e-09
Identity = 47/151 (31.13%), Postives = 74/151 (49.01%), Query Frame = 0

Query: 13  GVQVNFSAGVINEVYMVPNEEGDKCKRRMYAPFEEQVADALKFVAIRGVEWVMSPIGCRT 72
           GVQV  S   IN +Y +  +  D+    + A  E ++A  L+ VAI G EW +S  G  T
Sbjct: 13  GVQVPLSTEAINTIYGL-GDPVDEHSEFVEAITEPELATVLETVAIAGAEWNVSSQGAYT 72

Query: 73  LRPGDIRENLAIWVYFIKRRIMPTTYDTTIAVERIML-----------------RDLMEC 132
                +     +W +F+K R++PTT+  T++ ER++L                 R++   
Sbjct: 73  CLRSSLNPPAKVWYHFLKSRLLPTTHGKTVSKERVLLLYSMLTGKSINVGQIIHREICAY 132

Query: 133 SLCTIGRLFFPPLIDKLCAKARV--VVAEEE 145
           +    G LFFP LI ++C  AR   +V EE+
Sbjct: 133 AARKSGALFFPSLITRMCCNARAPYLVNEEK 162

BLAST of ClCG03G010940 vs. ExPASy TrEMBL
Match: W9RBS1 (Uncharacterized protein OS=Morus notabilis OX=981085 GN=L484_000844 PE=4 SV=1)

HSP 1 Score: 67.4 bits (163), Expect = 1.0e-07
Identity = 42/159 (26.42%), Postives = 79/159 (49.69%), Query Frame = 0

Query: 14  VQVNFSAGVINEVYMVPNEEGDKCKRRMYAPFEEQVADALKFVAIRGVEWVMSPIGCRTL 73
           + + F++  IN V  +PN++ D+    +    EEQ+ + LK +AI G +W++S  G  T 
Sbjct: 123 IDITFTSNYINGVLGIPNQD-DEFVELITDAIEEQLKEVLKTIAILGAQWLLSAKGSYTC 182

Query: 74  RPGDIRENLAIWVYFIKRRIMPTTYDTTIA-----------------VERIMLRDLMECS 133
              +++    +W +F+  R++ +T+  TI+                 V R+++  +  C+
Sbjct: 183 NRHELQPAAKVWYHFLASRLLLSTHGKTISRNRAILLYAVLVGKPINVGRLIIDQIRACA 242

Query: 134 LCTIGRLFFPPLIDKLCAKARVV-VAEEENCKVKLAIDL 155
               G L+FP LI +LC ++ V   A E   +   A+DL
Sbjct: 243 EKGKGGLYFPSLISELCIQSHVAWEASEPRLRNTGAMDL 280

BLAST of ClCG03G010940 vs. ExPASy TrEMBL
Match: A0A2P5BCG4 (Uncharacterized protein (Fragment) OS=Parasponia andersonii OX=3476 GN=PanWU01x14_251180 PE=4 SV=1)

HSP 1 Score: 67.0 bits (162), Expect = 1.3e-07
Identity = 55/210 (26.19%), Postives = 94/210 (44.76%), Query Frame = 0

Query: 13  GVQVNFSAGVINEVYMVPNEEGDKCKRRMYAPFEEQVADALKFVAIRGVEWVMSPIGCRT 72
           GVQV++S   IN V+ +  +  D+    +    ++ +   L+ VA  G EW +S  G  T
Sbjct: 106 GVQVSWSEEAINAVFGL-GDPVDEHSEFIQNITQQDLITVLETVAAAGAEWNVSAQGAYT 165

Query: 73  LRPGDIRENLAIWVYFIKRRIMPTTYDTTIAVERIML-----------------RDLMEC 132
                +     +W +F+K R++PTT+  T++ +R++L                  ++  C
Sbjct: 166 CIRSALTPAAKVWYHFLKSRLLPTTHGKTVSKDRMLLLHSMLIGKSINVGRMIHSEIRAC 225

Query: 133 SLCTIGRLFFPPLIDKLCAKARV-VVAEEENCKVKLAIDLKKSKVFGNQSVHSPTRAT-- 192
           +    G LFFP LI +LC  AR   +  EE       ID   +      +   PT +T  
Sbjct: 226 AARKTGALFFPSLITRLCRNARAPFLVNEEKLHNTGEID---AIAVARIAQEGPTESTQQ 285

Query: 193 PTSIRTCTLRADDIFSPLEPKVLPLEQEVS 203
           P+S R  T  ++     +  ++  LEQ +S
Sbjct: 286 PSSSRPATASSNRTNGDILQQLKALEQRLS 311

BLAST of ClCG03G010940 vs. ExPASy TrEMBL
Match: A0A2P5DXM3 (Uncharacterized protein OS=Parasponia andersonii OX=3476 GN=PanWU01x14_023740 PE=4 SV=1)

HSP 1 Score: 65.5 bits (158), Expect = 3.8e-07
Identity = 42/149 (28.19%), Postives = 71/149 (47.65%), Query Frame = 0

Query: 13  GVQVNFSAGVINEVYMVPNEEGDKCKRRMYAPFEEQVADALKFVAIRGVEWVMSPIGCRT 72
           GVQV++S   IN V+ +  +  D+    +    E ++   L+ VA  G EW +S  G  T
Sbjct: 36  GVQVSWSEEAINAVFGL-GDPVDEHSEFIENITEPELITVLETVAAAGAEWNVSAQGAYT 95

Query: 73  LRPGDIRENLAIWVYFIKRRIMPTTYDTTIAVERIML-----------------RDLMEC 132
                +     +W +F+K R++PTT+   ++ +R++L                  ++  C
Sbjct: 96  CIRSALTPAAKVWYHFLKSRLLPTTHGKIVSKDRMLLLHSMLNGKSINVGRMIHSEIRAC 155

Query: 133 SLCTIGRLFFPPLIDKLCAKARVVVAEEE 145
           +    G LFFP LI +LC  A  +V EE+
Sbjct: 156 AAQKTGALFFPSLITRLCRNAPFLVNEEK 183

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PON50458.14.9e-0928.30hypothetical protein PanWU01x14_223230, partial [Parasponia andersonii][more]
PON62892.14.9e-0931.13hypothetical protein PanWU01x14_135680 [Parasponia andersonii][more]
XP_038904385.11.1e-0835.19uncharacterized protein LOC120090747 [Benincasa hispida][more]
EXB49850.12.1e-0726.42hypothetical protein L484_000844 [Morus notabilis][more]
PON46472.12.7e-0726.19hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A2P5BNT02.4e-0928.30Uncharacterized protein (Fragment) OS=Parasponia andersonii OX=3476 GN=PanWU01x1... [more]
A0A2P5CPE82.4e-0931.13Uncharacterized protein OS=Parasponia andersonii OX=3476 GN=PanWU01x14_135680 PE... [more]
W9RBS11.0e-0726.42Uncharacterized protein OS=Morus notabilis OX=981085 GN=L484_000844 PE=4 SV=1[more]
A0A2P5BCG41.3e-0726.19Uncharacterized protein (Fragment) OS=Parasponia andersonii OX=3476 GN=PanWU01x1... [more]
A0A2P5DXM33.8e-0728.19Uncharacterized protein OS=Parasponia andersonii OX=3476 GN=PanWU01x14_023740 PE... [more]
Match NameE-valueIdentityDescription
Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG03G010940.1ClCG03G010940.1mRNA