ClCG02G009235 (gene) Watermelon (Charleston Gray) v2.5

Overview
NameClCG02G009235
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionRetrotransposon protein
LocationCG_Chr02: 13218070 .. 13218645 (+)
RNA-Seq ExpressionClCG02G009235
SyntenyClCG02G009235
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCATGAGAAGGTGCCTGGGTGCGCACTTAATCAAAACACGATCGAATGCAAGGTGAGGACTCTTAAGAAACAATACAATGCAGTATCAAAGATGTTATGTCAATCGGGATTTGGCTGGAACGAGGAGTTCAAATTTGTCCAGGTCGAGAGGGAGATTTTCGATATCTGGGTTCAGGTAAAATTCTTGAAAACAGAATATACTGGTATGTCTTAAATATGTAAATATTAATAATCTGTACGTGTATAAATGCAGAGTCATCCTAGCGCAAAGGGGATGTGGAACAAGCCGTTTCCCCATTACGATGACCTCTCCACCGTCTTTGGGAGAGATAGAGCTGTAGGACAATCAAGTGAGGACCCACACGTGATGACGAGTAATGCATTCAGAGAGTTTGAAGATGAGATTCGACTTGGATTGCAGAACTGTCCCACACCTGATGTTTGCCAAACAGATTCACCATCAAATCCGGATGGAACGATGAAGATACAACAGACCAATCTACAGGTAGAGCAACACCTGCGGAGTCATCTCGAGGAAGCAAGAGGAAGAGGTCATCGTTCCAAGCTGAAATGA

mRNA sequence

ATGCATGAGAAGGTGCCTGGGTGCGCACTTAATCAAAACACGATCGAATGCAAGGTGAGGACTCTTAAGAAACAATACAATGCAGTATCAAAGATGTTATGTCAATCGGGATTTGGCTGGAACGAGGAGTTCAAATTTGTCCAGGTCGAGAGGGAGATTTTCGATATCTGGGTTCAGAGTCATCCTAGCGCAAAGGGGATGTGGAACAAGCCGTTTCCCCATTACGATGACCTCTCCACCGTCTTTGGGAGAGATAGAGCTGTAGGACAATCAAGTGAGGACCCACACGTGATGACGAGTAATGCATTCAGAGAGTTTGAAGATGAGATTCGACTTGGATTGCAGAACTGTCCCACACCTGATGTTTGCCAAACAGATTCACCATCAAATCCGGATGGAACGATGAAGATACAACAGACCAATCTACAGGTAGAGCAACACCTGCGGAGTCATCTCGAGGAAGCAAGAGGAAGAGGTCATCGTTCCAAGCTGAAATGA

Coding sequence (CDS)

ATGCATGAGAAGGTGCCTGGGTGCGCACTTAATCAAAACACGATCGAATGCAAGGTGAGGACTCTTAAGAAACAATACAATGCAGTATCAAAGATGTTATGTCAATCGGGATTTGGCTGGAACGAGGAGTTCAAATTTGTCCAGGTCGAGAGGGAGATTTTCGATATCTGGGTTCAGAGTCATCCTAGCGCAAAGGGGATGTGGAACAAGCCGTTTCCCCATTACGATGACCTCTCCACCGTCTTTGGGAGAGATAGAGCTGTAGGACAATCAAGTGAGGACCCACACGTGATGACGAGTAATGCATTCAGAGAGTTTGAAGATGAGATTCGACTTGGATTGCAGAACTGTCCCACACCTGATGTTTGCCAAACAGATTCACCATCAAATCCGGATGGAACGATGAAGATACAACAGACCAATCTACAGGTAGAGCAACACCTGCGGAGTCATCTCGAGGAAGCAAGAGGAAGAGGTCATCGTTCCAAGCTGAAATGA

Protein sequence

MHEKVPGCALNQNTIECKVRTLKKQYNAVSKMLCQSGFGWNEEFKFVQVEREIFDIWVQSHPSAKGMWNKPFPHYDDLSTVFGRDRAVGQSSEDPHVMTSNAFREFEDEIRLGLQNCPTPDVCQTDSPSNPDGTMKIQQTNLQVEQHLRSHLEEARGRGHRSKLK
Homology
BLAST of ClCG02G009235 vs. NCBI nr
Match: XP_038902479.1 (uncharacterized protein At2g29880-like [Benincasa hispida])

HSP 1 Score: 231.9 bits (590), Expect = 4.1e-57
Identity = 105/132 (79.55%), Postives = 117/132 (88.64%), Query Frame = 0

Query: 1   MHEKVPGCALNQNTIECKVRTLKKQYNAVSKMLCQSGFGWNEEFKFVQVEREIFDIWVQS 60
           +HEKVPGC LNQNTIECKVR+LKKQYN VS+ML QSGF WNEEFK VQVEREIFD+WV S
Sbjct: 51  LHEKVPGCTLNQNTIECKVRSLKKQYNIVSEMLSQSGFDWNEEFKCVQVEREIFDLWVLS 110

Query: 61  HPSAKGMWNKPFPHYDDLSTVFGRDRAVGQSSEDPHVMTSNAFREFEDEIRLGLQNCPTP 120
           HP+AK MWNKPFPHYDD STVFG+DR VG+SSEDP+VM +NAFREFEDEIRLG Q+C TP
Sbjct: 111 HPNAKRMWNKPFPHYDDFSTVFGKDRVVGKSSEDPYVMATNAFREFEDEIRLGSQDCQTP 170

Query: 121 DVCQTDSPSNPD 133
           +V QT+SP N D
Sbjct: 171 EVRQTESPLNQD 182

BLAST of ClCG02G009235 vs. NCBI nr
Match: XP_038880837.1 (uncharacterized protein LOC120072528 [Benincasa hispida])

HSP 1 Score: 227.3 bits (578), Expect = 1.0e-55
Identity = 104/125 (83.20%), Postives = 114/125 (91.20%), Query Frame = 0

Query: 1   MHEKVPGCALNQNTIECKVRTLKKQYNAVSKMLCQSGFGWNEEFKFVQVEREIFDIWVQS 60
           +HEKVPGCALNQNTIECKVR+LKKQYNAVS+ML QSGFGWNEEFK VQVEREI D+WV+S
Sbjct: 14  LHEKVPGCALNQNTIECKVRSLKKQYNAVSEMLSQSGFGWNEEFKCVQVEREILDLWVRS 73

Query: 61  HPSAKGMWNKPFPHYDDLSTVFGRDRAVGQSSEDPHVMTSNAFREFEDEIRLGLQNCPTP 120
           HP+AK MWNK F HYDDLSTVFG+DR VGQSSEDP+VM +NAFREFEDEIRLG Q+C TP
Sbjct: 74  HPNAKEMWNKSFSHYDDLSTVFGKDRVVGQSSEDPYVMATNAFREFEDEIRLGSQDCRTP 133

Query: 121 DVCQT 126
           DV QT
Sbjct: 134 DVRQT 138

BLAST of ClCG02G009235 vs. NCBI nr
Match: XP_038875070.1 (uncharacterized protein LOC120067596 [Benincasa hispida])

HSP 1 Score: 226.1 bits (575), Expect = 2.2e-55
Identity = 104/132 (78.79%), Postives = 117/132 (88.64%), Query Frame = 0

Query: 1   MHEKVPGCALNQNTIECKVRTLKKQYNAVSKMLCQSGFGWNEEFKFVQVEREIFDIWVQS 60
           +HEKVPGC LNQNTI+CKVR+LKKQYNAVS+ML QS F WNEEFK VQVEREIF++WVQS
Sbjct: 51  LHEKVPGCTLNQNTIKCKVRSLKKQYNAVSEMLSQSRFDWNEEFKCVQVEREIFNLWVQS 110

Query: 61  HPSAKGMWNKPFPHYDDLSTVFGRDRAVGQSSEDPHVMTSNAFREFEDEIRLGLQNCPTP 120
           HP+ KGMWNK F HYDDLSTVF +DRAVGQSSEDP+VM +NAFREFEDEIRLG Q+C TP
Sbjct: 111 HPNLKGMWNKSFSHYDDLSTVFRKDRAVGQSSEDPYVMATNAFREFEDEIRLGSQDCHTP 170

Query: 121 DVCQTDSPSNPD 133
           +V QT+SP N D
Sbjct: 171 EVRQTESPLNQD 182

BLAST of ClCG02G009235 vs. NCBI nr
Match: XP_038895773.1 (uncharacterized protein LOC120083935 [Benincasa hispida])

HSP 1 Score: 218.8 bits (556), Expect = 3.5e-53
Identity = 101/120 (84.17%), Postives = 111/120 (92.50%), Query Frame = 0

Query: 2   HEKVPGCALNQNTIECKVRTLKKQYNAVSKMLCQSGFGWNEEFKFVQVEREIFDIWVQSH 61
           HEKV GCALNQNTIECKVR+LKKQ NAVS+ML QSGF WNEEFK VQVEREIFD WV+SH
Sbjct: 90  HEKVLGCALNQNTIECKVRSLKKQCNAVSEMLSQSGFDWNEEFKCVQVEREIFDPWVRSH 149

Query: 62  PSAKGMWNKPFPHYDDLSTVFGRDRAVGQSSEDPHVMTSNAFREFEDEIRLGLQNCPTPD 121
           P+AKGMWNKPFPHYDDLSTVFG+ +AVGQSSEDP+VMT+NAFREFEDEIRLG Q+C TP+
Sbjct: 150 PNAKGMWNKPFPHYDDLSTVFGKYKAVGQSSEDPYVMTTNAFREFEDEIRLGSQDCHTPE 209

BLAST of ClCG02G009235 vs. NCBI nr
Match: XP_038895852.1 (uncharacterized protein LOC120084021 [Benincasa hispida])

HSP 1 Score: 197.6 bits (501), Expect = 8.5e-47
Identity = 96/133 (72.18%), Postives = 108/133 (81.20%), Query Frame = 0

Query: 1   MHEKVPGCALNQNTIECKVRTLKKQYNAVSKMLCQSGFGWNEEFKFVQVEREIFDIWVQS 60
           +HEKV  CALNQNTIECKVR+LKKQYNAVS+ML QSGF WNEEFK            VQS
Sbjct: 51  LHEKVLECALNQNTIECKVRSLKKQYNAVSEMLSQSGFNWNEEFK-----------CVQS 110

Query: 61  HPSAKGMWNKPFPHYDDLSTVFGRDRAVGQSSEDPHVMTSNAFREFEDEIRLGLQNCPTP 120
           HP+AKGMWNK FPHYDDLSTVFG+DRAVGQSSE+P++M +NAFREFED+IRLG Q+  TP
Sbjct: 111 HPNAKGMWNKSFPHYDDLSTVFGKDRAVGQSSENPYMMATNAFREFEDDIRLGSQDYHTP 170

Query: 121 DVCQTDSPSNPDG 134
           +V QT SP N DG
Sbjct: 171 EVRQTKSPLNQDG 172

BLAST of ClCG02G009235 vs. ExPASy TrEMBL
Match: A0A5A7U0H7 (Retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold648G002060 PE=4 SV=1)

HSP 1 Score: 122.5 bits (306), Expect = 1.7e-24
Identity = 59/116 (50.86%), Postives = 82/116 (70.69%), Query Frame = 0

Query: 1   MHEKVPGCALNQ-NTIECKVRTLKKQYNAVSKML--CQSGFGWNEEFKFVQVEREIFDIW 60
           M EK+PG  + + +TI+C V++LKK Y+A+++M     SGFGWNEEF+ +  ER++FD W
Sbjct: 52  MAEKLPGTNIQESSTIDCHVKSLKKTYHAIAEMRGPSCSGFGWNEEFQCIIAERDLFDSW 111

Query: 61  VQSHPSAKGMWNKPFPHYDDLSTVFGRDRAVGQSSEDPHVMTSNAFREFEDEIRLG 114
           ++SHP+AKG+ +K FP+YDDLS VFG+DRA G  SE    + SN    F D I LG
Sbjct: 112 IKSHPAAKGLLHKSFPYYDDLSYVFGKDRATGARSETFPNVGSNVSNMFNDTIPLG 167

BLAST of ClCG02G009235 vs. ExPASy TrEMBL
Match: A0A1S3B4L3 (uncharacterized protein LOC103485953 OS=Cucumis melo OX=3656 GN=LOC103485953 PE=4 SV=1)

HSP 1 Score: 122.5 bits (306), Expect = 1.7e-24
Identity = 59/116 (50.86%), Postives = 82/116 (70.69%), Query Frame = 0

Query: 1   MHEKVPGCALNQ-NTIECKVRTLKKQYNAVSKML--CQSGFGWNEEFKFVQVEREIFDIW 60
           M EK+PG  + + +TI+C V++LKK Y+A+++M     SGFGWNEEF+ +  ER++FD W
Sbjct: 52  MAEKLPGTNIQESSTIDCHVKSLKKTYHAIAEMRGPSCSGFGWNEEFQCIIAERDLFDSW 111

Query: 61  VQSHPSAKGMWNKPFPHYDDLSTVFGRDRAVGQSSEDPHVMTSNAFREFEDEIRLG 114
           ++SHP+AKG+ +K FP+YDDLS VFG+DRA G  SE    + SN    F D I LG
Sbjct: 112 IKSHPAAKGLLHKSFPYYDDLSYVFGKDRATGARSETFPNVGSNVSNMFNDTIPLG 167

BLAST of ClCG02G009235 vs. ExPASy TrEMBL
Match: A0A5A7VKT2 (Retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold538G001150 PE=4 SV=1)

HSP 1 Score: 120.2 bits (300), Expect = 8.3e-24
Identity = 63/159 (39.62%), Postives = 94/159 (59.12%), Query Frame = 0

Query: 1   MHEKVPGCALNQNT-IECKVRTLKKQYNAVSKML--CQSGFGWNEEFKFVQVEREIFDIW 60
           M EK+PGC +   T I+C+++TLK+ + A+++M     SGFGWN+E K +  E+E+FD W
Sbjct: 52  MAEKLPGCQVRATTVIDCRIKTLKRTFQAIAEMRGPACSGFGWNDEVKCIIAEKELFDNW 111

Query: 61  VQSHPSAKGMWNKPFPHYDDLSTVFGRDRAVGQSSEDPHVMTSNAFREFEDEIRLGLQNC 120
           V+SHP+AKG+ NKPFP+YD+L+ VFGRDRA G+ +E    + SN      D   +G  N 
Sbjct: 112 VRSHPAAKGLLNKPFPYYDELTYVFGRDRATGRFAETFADVGSNELGGGYDRFDIGDGNE 171

Query: 121 PTPDVCQTDSPSNPDGTMK----------IQQTNLQVEQ 147
             P +  +       G+ +          + QTN Q+ Q
Sbjct: 172 DFPPMTGSSGSKRKRGSQRDFDVEAIHLALDQTNEQLRQ 210

BLAST of ClCG02G009235 vs. ExPASy TrEMBL
Match: A0A5A7SS81 (Retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1327G00060 PE=4 SV=1)

HSP 1 Score: 119.4 bits (298), Expect = 1.4e-23
Identity = 69/165 (41.82%), Postives = 103/165 (62.42%), Query Frame = 0

Query: 1   MHEKVPGCALNQNT-IECKVRTLKKQYNAVSKM--LCQSGFGWNEEFKFVQVEREIFDIW 60
           M EK+PGC ++  T I+ +++TLK+ + A+++M  L  SGFGWN+E K +  ++E+FD W
Sbjct: 2   MAEKLPGCQVHTTTIIDYRIKTLKQTFQAIAEMRELACSGFGWNDEQKCIIAKKELFDNW 61

Query: 61  VQSHPSAKGMWNKPFPHYDDLSTVFGRDRAVGQSSEDPHVMTSNAFREFEDEIRLGLQNC 120
           V+SHP+AKG  NKPFP+YD+L+ VFGRDRA G+ +E    + SN   E+ DE  +G  N 
Sbjct: 62  VRSHPAAKGFLNKPFPYYDELTYVFGRDRATGRFAETFADVGSNEPSEY-DEFDMGDGNE 121

Query: 121 PTPDV-CQTDSPSNPDGTMKIQQTNLQVEQHLRSHLEEARGRGHR 162
             P V  Q   PS  D  ++  + +L +E   RS   + +   HR
Sbjct: 122 EFPPVYSQGIDPSQDD--IRASRPSLALEGRTRSSGSKRKRESHR 163

BLAST of ClCG02G009235 vs. ExPASy TrEMBL
Match: A0A5A7VCA3 (Retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold82G001180 PE=4 SV=1)

HSP 1 Score: 118.2 bits (295), Expect = 3.2e-23
Identity = 69/168 (41.07%), Postives = 99/168 (58.93%), Query Frame = 0

Query: 1   MHEKVPGCALNQNT-IECKVRTLKKQYNAVSKML--CQSGFGWNEEFKFVQVEREIFDIW 60
           M EK+PGC +   T I+C+++TLK+ + A++KM     SGFGWN+E K +  E+E+FD W
Sbjct: 52  MAEKLPGCQVRATTVIDCRIKTLKRTFQAIAKMRGPACSGFGWNDEEKCIVAEKELFDNW 111

Query: 61  VQSHPSAKGMWNKPFPHYDDLSTVFGRDRAVGQSSEDPHVMTSNAFREFEDEIRLGLQNC 120
           V+SHP+AKG+ NKPFP+YD+L+ VFGRDRA G+ +E    + SN      D   +G  N 
Sbjct: 112 VRSHPAAKGLLNKPFPYYDELTYVFGRDRATGRFAETFADVGSNEPGGGYDRFDMGDGN- 171

Query: 121 PTPDVCQTDSPSNPDGTMKIQQTNLQVEQHLRSHLEEARGRGHRSKLK 166
                   D PS  +  + I Q +++  +   SH  E R     SK K
Sbjct: 172 -------EDFPSVYNQGVDISQDDVRASR--PSHASEGRTGSSGSKRK 209

BLAST of ClCG02G009235 vs. TAIR 10
Match: AT2G24960.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 21 plant structures; EXPRESSED DURING: 12 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G02210.2); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 56.6 bits (135), Expect = 2.2e-08
Identity = 23/84 (27.38%), Postives = 49/84 (58.33%), Query Frame = 0

Query: 7   GCALNQNTIECKVRTLKKQYNAVSKMLCQSGFGWNEEFKFVQVEREIFDIWVQSHPSAKG 66
           G   N++ ++ + + L++ YN +  +L Q+GF W+     V  + +I++ ++Q+HP A+ 
Sbjct: 370 GSQHNKDVLKNRYKHLRRLYNDIKFLLEQNGFSWDARRDMVIADDDIWNTYIQAHPEARS 429

Query: 67  MWNKPFPHYDDLSTVFGRDRAVGQ 91
              K  P Y +L  +FG++ + G+
Sbjct: 430 YRVKTIPSYPNLCFIFGKETSDGR 453

BLAST of ClCG02G009235 vs. TAIR 10
Match: AT2G24960.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G02210.2); Has 1453 Blast hits to 509 proteins in 26 species: Archae - 0; Bacteria - 0; Metazoa - 1; Fungi - 39; Plants - 1363; Viruses - 0; Other Eukaryotes - 50 (source: NCBI BLink). )

HSP 1 Score: 55.1 bits (131), Expect = 6.3e-08
Identity = 29/104 (27.88%), Postives = 54/104 (51.92%), Query Frame = 0

Query: 7   GCALNQNTIECKVRTLKKQYNAVSKMLCQSGFGWNEEFKFVQVEREIFDIWVQSHPSAKG 66
           G   +++ ++ +   L KQYN V  +L   GF W++  + V  +  ++ +++++HP A+ 
Sbjct: 58  GSQYDKDVLKSRYTNLWKQYNDVKCLLDHGGFVWDQTHQTVIGDDSLWSLYLKAHPEARV 117

Query: 67  MWNKPFPHYDDLSTVFGRDRAVGQSSEDPHVMTSNAFREFEDEI 111
              KP  ++ DL  ++G   A G+ S   H +      E EDEI
Sbjct: 118 YKTKPVLNFSDLCLIYGYTVADGRYSMSSHDL------EIEDEI 155

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038902479.14.1e-5779.55uncharacterized protein At2g29880-like [Benincasa hispida][more]
XP_038880837.11.0e-5583.20uncharacterized protein LOC120072528 [Benincasa hispida][more]
XP_038875070.12.2e-5578.79uncharacterized protein LOC120067596 [Benincasa hispida][more]
XP_038895773.13.5e-5384.17uncharacterized protein LOC120083935 [Benincasa hispida][more]
XP_038895852.18.5e-4772.18uncharacterized protein LOC120084021 [Benincasa hispida][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A5A7U0H71.7e-2450.86Retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A1S3B4L31.7e-2450.86uncharacterized protein LOC103485953 OS=Cucumis melo OX=3656 GN=LOC103485953 PE=... [more]
A0A5A7VKT28.3e-2439.62Retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold... [more]
A0A5A7SS811.4e-2341.82Retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A5A7VCA33.2e-2341.07Retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold... [more]
Match NameE-valueIdentityDescription
AT2G24960.22.2e-0827.38unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT2G24960.16.3e-0827.88unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (Charleston Gray) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR46250MYB/SANT-LIKE DNA-BINDING DOMAIN PROTEIN-RELATEDcoord: 1..136

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG02G009235.1ClCG02G009235.1mRNA