ClCG01G013520 (gene) Watermelon (Charleston Gray) v2.5

Overview
NameClCG01G013520
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionRetrovirus-related Pol polyprotein from transposon 412 family
LocationCG_Chr01: 26719480 .. 26721187 (-)
RNA-Seq ExpressionClCG01G013520
SyntenyClCG01G013520
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTGCACGAGGCACCAAAACTGCTGGTAATGTGGCTACATCAAGTCAGATGCATCATCAGACATAGGTTGCAGAAGAAAAGGGCAATGTTGCCGCATTATTTGTCCAGCATCGTGAGGGGAAAGCCCCCGTGGTTGAAGAGGAGGACATTGCTCCTCTTTTCGAGTATGAGACAAGAGGGTGAACACGTGAAGAAAGTTTGTTGGAGTCCACGTCAACTGAAGATTTGGAAGAGATTGAGGGGGCCTCGAGGCCTAAGGTGATGGCTCCAAGGAGAGTCACTAGATCTGCCTCTACAGCTAAGAGGAAGGCAAATTCGAGCTTAGCTGCCCTGACTCAGAAAAGGGTAAAAGCAAAGCATGCACAACGCATTGTGCTTTCTTCGTCTAGTGAATTTGAAGGCGAGAACGAGGTTGAGTAGCGAAAGGAGGCCCTTAAAAGGAGGAAAATTTTTTATGAATACAAATTCTCAGGGAGGAACGACACCCTACCCGAGTATGTGAGGATCTTAATTAAGCATTATGAGTGGGAAGCACTAACCAGCCCACCTACACCTGCCAGCAGATCTCTGGTTGAAGACTTCTACCTTGGGTTGCGGCCTGAGAAGGACATCTTTGTGGTCCAAGTAGTCCAAGTGGATTTCTCTCTTGAGGCCATCAATGAGGTGTATATGGTCCCTGATGAAGGAAGGGATGAGTGCAGAAAACGAATGTACACTCCCACAGAGGAGCAGGTGGCCAAGGCACTCAAGCTGGTGGCGCTTAAAGGAGCTAAGTGGGTCATATCGCCCACAGGATGCAGGACACTACGACCAGATGTCATAAGAGACAACCTTGCCATCTGGCTATATTTCGTCAAGCATCGCATCATGCCCACCACACATGACTCCACTATATCCTTGAAACGAGTCATGTTCCTCTACAACTTAAGAAAGGTCCTCCCAATTGTTGGGATTTATGTCCTAAATCTTGTAGTCTATGTAATTTTGTAAATAAATTAATAAATAAAGTATTTTTTAATTGTTATTTGTTTGCATAACTCAAAATCCAATAAACTAAGATCCTAGGTTATTTAATGTATCTTGAACGATATGTGGTTGACATATCAATGGATCATGTTCAAGAAATAACCTAAAAGATCTATAGTATATGGGTGAGATTGGGTGCCTCATCCTAGTACTATGGATACAACCTACTTTGTAAGTGTTACAAATGTTGTAAGTGTTACAAATGGTTTGATCCAAATCATTCGTGCAATGACATGTAAGTGGGGGTATCCTATACAGTGAGTCTGTATAAGATTGGACCACAAAATTAAATCTCTCTTTATAAATCCATTAACTGAAGAGTTTTATATTTCAAATGAGAACCATGTAACTTGATCTCAATCCTTAGTGAATTATGAACTCCTGTTCATGAGGATCATTTTTTGACTTGTATGGATGAGAGTGGTCTTAGCTACCAACTTAATATGTCTACCATTTTGGGAATGATTCCGAGTGAGGAGTTGGGAACATAAATTCACAAGATGAATTCACTTCTTCCCTACTCTAGGGAAAGTAGATAGGTTGTTCTCTTAAGTGATGATTTCGAAACTTGAACAATGAGGCCTTACCCTCTCACTGATTTGAGATGAACTTTGTTTATGGTTGGACCATAGACAGTATTGTTCATTAGAGGATCAGTGGTACTTAAGGTGCAGAGGTAA

mRNA sequence

ATGGTGCACGAGGCACCAAAACTGCTGGTTGCAGAAGAAAAGGGCAATGTTGCCGCATTATTTGTCCAGCATCGTGAGGGGAAAGCCCCCGTGGTTGAAGAGGAGGACATTGCTCCTCTTTTCGATTTGTTGGAGTCCACGTCAACTGAAGATTTGGAAGAGATTGAGGGGGCCTCGAGGCCTAAGGTGATGGCTCCAAGGAGAGTCACTAGATCTGCCTCTACAGCTAAGAGGAAGGCAAATTCGAGCTTAGCTGCCCTGACTCAGAAAAGGGTAAAAGCAAAGCATGCACAACGCATTGTGCTTTCTTCGTCTAGTGAATTTGAAGGCGAGAACGAGGTTGACCCACCTACACCTGCCAGCAGATCTCTGGTTGAAGACTTCTACCTTGGGTTGCGGCCTGAGAAGGACATCTTTGTGGTCCAAGTAGTCCAAGTGGATTTCTCTCTTGAGGCCATCAATGAGGTGTATATGGTCCCTGATGAAGGAAGGGATGAGTGCAGAAAACGAATGTACACTCCCACAGAGGAGCAGGTGGCCAAGGCACTCAAGCTGGTGGCGCTTAAAGGAGCTAAGTGGGTCATATCGCCCACAGGATGCAGGACACTACGACCAGATGTCATAAGAGACAACCTTGCCATCTGGCTATATTTCGTCAAGCATCGCATCATGCCCACCACACATGACTCCACTATATCCTTGAAACGAGTCATGTTCCTCTACAACTTAAGAAAGACAGTATTGTTCATTAGAGGATCAGTGGTACTTAAGGTGCAGAGGTAA

Coding sequence (CDS)

ATGGTGCACGAGGCACCAAAACTGCTGGTTGCAGAAGAAAAGGGCAATGTTGCCGCATTATTTGTCCAGCATCGTGAGGGGAAAGCCCCCGTGGTTGAAGAGGAGGACATTGCTCCTCTTTTCGATTTGTTGGAGTCCACGTCAACTGAAGATTTGGAAGAGATTGAGGGGGCCTCGAGGCCTAAGGTGATGGCTCCAAGGAGAGTCACTAGATCTGCCTCTACAGCTAAGAGGAAGGCAAATTCGAGCTTAGCTGCCCTGACTCAGAAAAGGGTAAAAGCAAAGCATGCACAACGCATTGTGCTTTCTTCGTCTAGTGAATTTGAAGGCGAGAACGAGGTTGACCCACCTACACCTGCCAGCAGATCTCTGGTTGAAGACTTCTACCTTGGGTTGCGGCCTGAGAAGGACATCTTTGTGGTCCAAGTAGTCCAAGTGGATTTCTCTCTTGAGGCCATCAATGAGGTGTATATGGTCCCTGATGAAGGAAGGGATGAGTGCAGAAAACGAATGTACACTCCCACAGAGGAGCAGGTGGCCAAGGCACTCAAGCTGGTGGCGCTTAAAGGAGCTAAGTGGGTCATATCGCCCACAGGATGCAGGACACTACGACCAGATGTCATAAGAGACAACCTTGCCATCTGGCTATATTTCGTCAAGCATCGCATCATGCCCACCACACATGACTCCACTATATCCTTGAAACGAGTCATGTTCCTCTACAACTTAAGAAAGACAGTATTGTTCATTAGAGGATCAGTGGTACTTAAGGTGCAGAGGTAA

Protein sequence

MVHEAPKLLVAEEKGNVAALFVQHREGKAPVVEEEDIAPLFDLLESTSTEDLEEIEGASRPKVMAPRRVTRSASTAKRKANSSLAALTQKRVKAKHAQRIVLSSSSEFEGENEVDPPTPASRSLVEDFYLGLRPEKDIFVVQVVQVDFSLEAINEVYMVPDEGRDECRKRMYTPTEEQVAKALKLVALKGAKWVISPTGCRTLRPDVIRDNLAIWLYFVKHRIMPTTHDSTISLKRVMFLYNLRKTVLFIRGSVVLKVQR
Homology
BLAST of ClCG01G013520 vs. NCBI nr
Match: PON50458.1 (hypothetical protein PanWU01x14_223230, partial [Parasponia andersonii])

HSP 1 Score: 79.3 bits (194), Expect = 5.3e-11
Identity = 46/121 (38.02%), Postives = 69/121 (57.02%), Query Frame = 0

Query: 124 LVEDFYLGL-RPEKDIFVVQVVQVDFSLEAINEVYMVPDEGRDECRKRMYTPTEEQVAKA 183
           LV +FY  L  P+ D   V+ VQV  S EAIN +Y + D   DE  + +   TE ++A  
Sbjct: 3   LVREFYTNLTNPDDDTVYVRGVQVPLSAEAINTIYGLGDL-VDEHSEFVEDITEPELAMV 62

Query: 184 LKLVALKGAKWVISPTGCRTLRPDVIRDNLAIWLYFVKHRIMPTTHDSTISLKRVMFLYN 243
           L+ VA+ GA+W +S  G  T     +     IW +F+K R++PTTH   +S +RV+ LY+
Sbjct: 63  LETVAIAGAEWNVSSQGVYTCLRSSLNPPAKIWYHFLKSRLLPTTHGKIVSKERVLLLYS 122

BLAST of ClCG01G013520 vs. NCBI nr
Match: XP_038904385.1 (uncharacterized protein LOC120090747 [Benincasa hispida])

HSP 1 Score: 77.0 bits (188), Expect = 2.6e-10
Identity = 43/120 (35.83%), Postives = 64/120 (53.33%), Query Frame = 0

Query: 123 SLVEDFYLG-LRPEKDIFVVQVVQVDFSLEAINEVYMVPDEGRDECRKRMYTPTEEQVAK 182
           ++V  FY G L   KD  +++   V FS   INE+Y + D       K +  P EE++  
Sbjct: 214 TVVRAFYKGRLHGTKDAVIMKGCIVPFSARDINELYKMKDIPDASGNKIIDDPQEEKMED 273

Query: 183 ALKLVALKGAKWVISPTGCRTLRPDVIRDNLAIWLYFVKHRIMPTTHDSTISLKRVMFLY 242
           AL+ +   G +W +S  G +TL    +     +W+Y VK RI+PT+HD T+S  RVM  Y
Sbjct: 274 ALRTLTQSGTQWSVSLKGIKTLASSKLLPEARLWVYLVKRRIIPTSHDKTVSRDRVMAAY 333

BLAST of ClCG01G013520 vs. NCBI nr
Match: PON62892.1 (hypothetical protein PanWU01x14_135680 [Parasponia andersonii])

HSP 1 Score: 77.0 bits (188), Expect = 2.6e-10
Identity = 41/110 (37.27%), Postives = 64/110 (58.18%), Query Frame = 0

Query: 134 PEKDIFVVQVVQVDFSLEAINEVYMVPDEGRDECRKRMYTPTEEQVAKALKLVALKGAKW 193
           P+ D   V+ VQV  S EAIN +Y + D   DE  + +   TE ++A  L+ VA+ GA+W
Sbjct: 4   PDDDTVYVRGVQVPLSTEAINTIYGLGDP-VDEHSEFVEAITEPELATVLETVAIAGAEW 63

Query: 194 VISPTGCRTLRPDVIRDNLAIWLYFVKHRIMPTTHDSTISLKRVMFLYNL 244
            +S  G  T     +     +W +F+K R++PTTH  T+S +RV+ LY++
Sbjct: 64  NVSSQGAYTCLRSSLNPPAKVWYHFLKSRLLPTTHGKTVSKERVLLLYSM 112

BLAST of ClCG01G013520 vs. NCBI nr
Match: XP_038876674.1 (chromatin assembly factor 1 subunit A-like, partial [Benincasa hispida])

HSP 1 Score: 77.0 bits (188), Expect = 2.6e-10
Identity = 67/250 (26.80%), Postives = 112/250 (44.80%), Query Frame = 0

Query: 12  EEKGNVAALFVQHREGKAPVVEEEDIAPLFDLLESTSTEDLEEIEGASRPKVMAPRRVTR 71
           + K   A  FV+  E +    EEE+        ++   E+   ++     K  A RR  +
Sbjct: 177 KSKRRKAGEFVEEFEREEKEEEEEE--------KNKKEEEERRLKEEKEKKCEANRRCLQ 236

Query: 72  SASTAKRKANSSLAALTQKRVKAKHAQRIVLSSSSEFEGENEVDPPTPASRSLVEDFYLG 131
                +           ++  K + A R+ L+     +G++      PA  S+V DFY G
Sbjct: 237 KKKGKELAEREKEEQRKEREKKQRQAARLALAKE---KGKSIEKSSKPA--SVVRDFYRG 296

Query: 132 -LRPEKDIFVVQVVQVDFSLEAINEVYMVPDEGRDECRKRMYTPTEEQVAKALKLVALKG 191
            L   +D   ++   V FS   INE+Y + D       K +  PTE+Q+  AL+++   G
Sbjct: 297 RLHGTRDAVTLKRETVSFSARDINEIYQMKDNPYASGNKIIDDPTEQQMEDALQVLMQLG 356

Query: 192 AKWVISPTGCRTLRPDVIRDNLAIWLYFVKHRIMPTTHDSTISLKRVMFLYNLRKTVLFI 251
            KW +S  G  TL    +     +W+Y VK R++ TTHD T+S  RVM  Y + +++   
Sbjct: 357 MKWSVSLKGVSTLASKSLLLEGRLWVYLVKKRLISTTHDKTVSRDRVMATYCIVRSIPID 413

Query: 252 RGSVVLKVQR 261
            G ++ +  R
Sbjct: 417 VGQLIARQLR 413

BLAST of ClCG01G013520 vs. NCBI nr
Match: PON70375.1 (hypothetical protein PanWU01x14_080440 [Parasponia andersonii])

HSP 1 Score: 74.3 bits (181), Expect = 1.7e-09
Identity = 43/130 (33.08%), Postives = 71/130 (54.62%), Query Frame = 0

Query: 115 DPPTPASRSLVEDFYLGL-RPEKDIFVVQVVQVDFSLEAINEVYMVPDEGRDECRKRMYT 174
           DP  P    LV +FY  +  P+ D   ++ VQV  S+EAIN ++ + D   DE  + +  
Sbjct: 102 DPIVP----LVREFYTNMTNPDDDTVYIRGVQVPLSVEAINTIFSLGDP-IDEHSEFVED 161

Query: 175 PTEEQVAKALKLVALKGAKWVISPTGCRTLRPDVIRDNLAIWLYFVKHRIMPTTHDSTIS 234
            T+ ++   L+ VA+ GA+W +S  G  T     +     +W +F+K R++PTTH  T+S
Sbjct: 162 ITKPELVIVLETVAIVGAEWNVSSQGAYTCLRSSLNPPAKVWYHFLKSRLLPTTHGKTVS 221

Query: 235 LKRVMFLYNL 244
            + V  LY++
Sbjct: 222 KEHVSLLYSM 226

BLAST of ClCG01G013520 vs. ExPASy TrEMBL
Match: A0A2P5BNT0 (Uncharacterized protein (Fragment) OS=Parasponia andersonii OX=3476 GN=PanWU01x14_223230 PE=4 SV=1)

HSP 1 Score: 79.3 bits (194), Expect = 2.6e-11
Identity = 46/121 (38.02%), Postives = 69/121 (57.02%), Query Frame = 0

Query: 124 LVEDFYLGL-RPEKDIFVVQVVQVDFSLEAINEVYMVPDEGRDECRKRMYTPTEEQVAKA 183
           LV +FY  L  P+ D   V+ VQV  S EAIN +Y + D   DE  + +   TE ++A  
Sbjct: 3   LVREFYTNLTNPDDDTVYVRGVQVPLSAEAINTIYGLGDL-VDEHSEFVEDITEPELAMV 62

Query: 184 LKLVALKGAKWVISPTGCRTLRPDVIRDNLAIWLYFVKHRIMPTTHDSTISLKRVMFLYN 243
           L+ VA+ GA+W +S  G  T     +     IW +F+K R++PTTH   +S +RV+ LY+
Sbjct: 63  LETVAIAGAEWNVSSQGVYTCLRSSLNPPAKIWYHFLKSRLLPTTHGKIVSKERVLLLYS 122

BLAST of ClCG01G013520 vs. ExPASy TrEMBL
Match: A0A2P5CPE8 (Uncharacterized protein OS=Parasponia andersonii OX=3476 GN=PanWU01x14_135680 PE=4 SV=1)

HSP 1 Score: 77.0 bits (188), Expect = 1.3e-10
Identity = 41/110 (37.27%), Postives = 64/110 (58.18%), Query Frame = 0

Query: 134 PEKDIFVVQVVQVDFSLEAINEVYMVPDEGRDECRKRMYTPTEEQVAKALKLVALKGAKW 193
           P+ D   V+ VQV  S EAIN +Y + D   DE  + +   TE ++A  L+ VA+ GA+W
Sbjct: 4   PDDDTVYVRGVQVPLSTEAINTIYGLGDP-VDEHSEFVEAITEPELATVLETVAIAGAEW 63

Query: 194 VISPTGCRTLRPDVIRDNLAIWLYFVKHRIMPTTHDSTISLKRVMFLYNL 244
            +S  G  T     +     +W +F+K R++PTTH  T+S +RV+ LY++
Sbjct: 64  NVSSQGAYTCLRSSLNPPAKVWYHFLKSRLLPTTHGKTVSKERVLLLYSM 112

BLAST of ClCG01G013520 vs. ExPASy TrEMBL
Match: A0A2P5BCG4 (Uncharacterized protein (Fragment) OS=Parasponia andersonii OX=3476 GN=PanWU01x14_251180 PE=4 SV=1)

HSP 1 Score: 74.3 bits (181), Expect = 8.2e-10
Identity = 45/130 (34.62%), Postives = 72/130 (55.38%), Query Frame = 0

Query: 115 DPPTPASRSLVEDFYLGLR-PEKDIFVVQVVQVDFSLEAINEVYMVPDEGRDECRKRMYT 174
           DP  P    LV +FY  L  PE++   V+ VQV +S EAIN V+ + D   DE  + +  
Sbjct: 81  DPIVP----LVREFYANLTDPEENTVYVRGVQVSWSEEAINAVFGLGDP-VDEHSEFIQN 140

Query: 175 PTEEQVAKALKLVALKGAKWVISPTGCRTLRPDVIRDNLAIWLYFVKHRIMPTTHDSTIS 234
            T++ +   L+ VA  GA+W +S  G  T     +     +W +F+K R++PTTH  T+S
Sbjct: 141 ITQQDLITVLETVAAAGAEWNVSAQGAYTCIRSALTPAAKVWYHFLKSRLLPTTHGKTVS 200

Query: 235 LKRVMFLYNL 244
             R++ L+++
Sbjct: 201 KDRMLLLHSM 205

BLAST of ClCG01G013520 vs. ExPASy TrEMBL
Match: A0A2P5DAQ2 (Uncharacterized protein OS=Parasponia andersonii OX=3476 GN=PanWU01x14_080440 PE=4 SV=1)

HSP 1 Score: 74.3 bits (181), Expect = 8.2e-10
Identity = 43/130 (33.08%), Postives = 71/130 (54.62%), Query Frame = 0

Query: 115 DPPTPASRSLVEDFYLGL-RPEKDIFVVQVVQVDFSLEAINEVYMVPDEGRDECRKRMYT 174
           DP  P    LV +FY  +  P+ D   ++ VQV  S+EAIN ++ + D   DE  + +  
Sbjct: 102 DPIVP----LVREFYTNMTNPDDDTVYIRGVQVPLSVEAINTIFSLGDP-IDEHSEFVED 161

Query: 175 PTEEQVAKALKLVALKGAKWVISPTGCRTLRPDVIRDNLAIWLYFVKHRIMPTTHDSTIS 234
            T+ ++   L+ VA+ GA+W +S  G  T     +     +W +F+K R++PTTH  T+S
Sbjct: 162 ITKPELVIVLETVAIVGAEWNVSSQGAYTCLRSSLNPPAKVWYHFLKSRLLPTTHGKTVS 221

Query: 235 LKRVMFLYNL 244
            + V  LY++
Sbjct: 222 KEHVSLLYSM 226

BLAST of ClCG01G013520 vs. ExPASy TrEMBL
Match: A0A5D2FHZ7 (Uncharacterized protein OS=Gossypium darwinii OX=34276 GN=ES288_A08G093000v1 PE=4 SV=1)

HSP 1 Score: 73.6 bits (179), Expect = 1.4e-09
Identity = 38/135 (28.15%), Postives = 74/135 (54.81%), Query Frame = 0

Query: 124 LVEDFYLGLRPEKDIFVV-QVVQVDFSLEAINEVYMVPDEGRDECRKRMYTPTEEQVAKA 183
           L+++FY  L  +    V+ +  +V F+  +IN+++ +PD   DE    M     + + + 
Sbjct: 77  LLQEFYASLTTQDANKVINRKKKVPFTSMSINDLFNLPDAEEDEHYPMMNNINWDFLQQV 136

Query: 184 LKLVALKGAKWVISPTGCRTLRPDVIRDNLAIWLYFVKHRIMPTTHDSTISLKRVMFLYN 243
           L +V   G++W+I   G  + R + ++    +W YFV++  MP +H STIS++R++ LY 
Sbjct: 137 LDVVTNSGSQWIIRKYGSHSCRREYLKSIAKVWFYFVRYSFMPISHSSTISMERMLLLYA 196

Query: 244 LRKTVLFIRGSVVLK 258
           + +      G ++LK
Sbjct: 197 ILREKYINVGEIILK 211

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PON50458.15.3e-1138.02hypothetical protein PanWU01x14_223230, partial [Parasponia andersonii][more]
XP_038904385.12.6e-1035.83uncharacterized protein LOC120090747 [Benincasa hispida][more]
PON62892.12.6e-1037.27hypothetical protein PanWU01x14_135680 [Parasponia andersonii][more]
XP_038876674.12.6e-1026.80chromatin assembly factor 1 subunit A-like, partial [Benincasa hispida][more]
PON70375.11.7e-0933.08hypothetical protein PanWU01x14_080440 [Parasponia andersonii][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A2P5BNT02.6e-1138.02Uncharacterized protein (Fragment) OS=Parasponia andersonii OX=3476 GN=PanWU01x1... [more]
A0A2P5CPE81.3e-1037.27Uncharacterized protein OS=Parasponia andersonii OX=3476 GN=PanWU01x14_135680 PE... [more]
A0A2P5BCG48.2e-1034.62Uncharacterized protein (Fragment) OS=Parasponia andersonii OX=3476 GN=PanWU01x1... [more]
A0A2P5DAQ28.2e-1033.08Uncharacterized protein OS=Parasponia andersonii OX=3476 GN=PanWU01x14_080440 PE... [more]
A0A5D2FHZ71.4e-0928.15Uncharacterized protein OS=Gossypium darwinii OX=34276 GN=ES288_A08G093000v1 PE=... [more]
Match NameE-valueIdentityDescription
Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG01G013520.1ClCG01G013520.1mRNA