ClCG01G011330 (gene) Watermelon (Charleston Gray) v2.5

Overview
NameClCG01G011330
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionReverse transcriptase
LocationCG_Chr01: 18600195 .. 18601150 (-)
RNA-Seq ExpressionClCG01G011330
SyntenyClCG01G011330
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCCCGACAAACCAAACGAGGTAGGCAAGCAGACGCCGAAACATCGCATGTTATCGGTGGATTTCGAGAAGGTTCGGAGGGAGATTCTAGTAACCCTCAGGTAAATATGGAAGATCAAATCTTTAATAGAATAGCCCAAGGGTTGGCATCTAGTGCTGAAACAGCTCAAGGAGATCCCGAGAGAAAGTATGAGATCAAGAGGTTCAAAGCCTTAGAAGCTCAGGTATTTGAGGGAATCACTGATCCTGAAGATGTCGAAGTATGACTAAACCAGATAGAGAAATGCTTCAGGGTCATGCGTTGTCCTGAAGAGAGGAAACTTGATTTGGTTACTTATTTGCTCTAGAAAGGAGTGGAGGATTGGTGGAAGTTGATAGAAAATAGGGGAAATGATGCAGATCCTCTAACGTGGGTAGCCTTTAAGAAGGCTTTTCAAGATAAATATTACCCACATACATATTGTGACGAGAAGAGGGATGAGTTTTTAAGATTGGTGCAAGGGATATGACTGTAGCTGAGTACGAAAAGAAGTTTACAAAGCTAGCTAAGTACGCTCTAACAATGATTGTGGATGAAGCTGATAAGTGCAAGTGTTTTGTGTTAGGTTTGCATAGTAAAATCCGTACTCCAGTTACAACAAGTATAGTGTGGACAAACTTTGCACGAATGATTGAGGCTGCCATGAGAGCTAAAAAGAGTATAACTGGTGGGAAGCCTGAGAGGGAAGGAAATAAAGGAGATTATAGAACGGGACAAGCTTTGGGTACATCTCGGGAACAGTCTTACCGAGGTGATAGTAGAAGGTTTGTACCCAGTGTTTCTAGTAATAGAAGTTTTAAAGCTTGATTTGGCGGATCATACTTTACAAGACCTAGAGGTGGTGGTTCAAGACAAACTCAGAGGGGTTTTCCTCAGACTTTGGGACCTCCAAGTGGTTCATAGTTTAGGCAACTGA

mRNA sequence

ATGCCCGACAAACCAAACGAGGTAGGCAAGCAGACGCCGAAACATCGCATGTTATCGGTGGATTTCGAGAAGGTTCGGAGGGAGATTCTAGTAACCCTCAGTGCTGAAACAGCTCAAGGAGATCCCGAGAGAAAGTATGAGATCAAGAGGTTCAAAGCCTTAGAAGCTCAGGTATTTGAGGGAATCACTGATCCTGAAGATGTCGAAAAAGGAGTGGAGGATTGGTGGAAGTTGATAGAAAATAGGGGAAATGATGCAGATCCTCTAACGTGGATTGGTGCAAGGGATATGACTGTAGCTGAGTACGAAAAGAAGTTTACAAAGCTAGCTAAGTACGCTCTAACAATGATTGTGGATGAAGCTGATAAGTGCAAGTGTTTTGTGTTAGGTTTGCATAGTAAAATCCGTACTCCAGTTACAACAAGTATAGTGTGGACAAACTTTGCACGAATGATTGAGGCTGCCATGAGAGCTAAAAAGAGTATAACTGGTGGGAAGCCTGAGAGGGAAGGAAATAAAGGAGATTATAGAACGGGACAAGCTTTGGGTACATCTCGGGAACAGTCTTACCGAGGTGATAGTAGAAGGTTTGTACCCAGTGTTTCTAACCTAGAGGTGGTGGTTCAAGACAAACTCAGAGGGGTTTTCCTCAGACTTTGGGACCTCCAAGTGGTTCATAGTTTAGGCAACTGA

Coding sequence (CDS)

ATGCCCGACAAACCAAACGAGGTAGGCAAGCAGACGCCGAAACATCGCATGTTATCGGTGGATTTCGAGAAGGTTCGGAGGGAGATTCTAGTAACCCTCAGTGCTGAAACAGCTCAAGGAGATCCCGAGAGAAAGTATGAGATCAAGAGGTTCAAAGCCTTAGAAGCTCAGGTATTTGAGGGAATCACTGATCCTGAAGATGTCGAAAAAGGAGTGGAGGATTGGTGGAAGTTGATAGAAAATAGGGGAAATGATGCAGATCCTCTAACGTGGATTGGTGCAAGGGATATGACTGTAGCTGAGTACGAAAAGAAGTTTACAAAGCTAGCTAAGTACGCTCTAACAATGATTGTGGATGAAGCTGATAAGTGCAAGTGTTTTGTGTTAGGTTTGCATAGTAAAATCCGTACTCCAGTTACAACAAGTATAGTGTGGACAAACTTTGCACGAATGATTGAGGCTGCCATGAGAGCTAAAAAGAGTATAACTGGTGGGAAGCCTGAGAGGGAAGGAAATAAAGGAGATTATAGAACGGGACAAGCTTTGGGTACATCTCGGGAACAGTCTTACCGAGGTGATAGTAGAAGGTTTGTACCCAGTGTTTCTAACCTAGAGGTGGTGGTTCAAGACAAACTCAGAGGGGTTTTCCTCAGACTTTGGGACCTCCAAGTGGTTCATAGTTTAGGCAACTGA

Protein sequence

MPDKPNEVGKQTPKHRMLSVDFEKVRREILVTLSAETAQGDPERKYEIKRFKALEAQVFEGITDPEDVEKGVEDWWKLIENRGNDADPLTWIGARDMTVAEYEKKFTKLAKYALTMIVDEADKCKCFVLGLHSKIRTPVTTSIVWTNFARMIEAAMRAKKSITGGKPEREGNKGDYRTGQALGTSREQSYRGDSRRFVPSVSNLEVVVQDKLRGVFLRLWDLQVVHSLGN
Homology
BLAST of ClCG01G011330 vs. NCBI nr
Match: KAA0060484.1 (Gag protease polyprotein-like protein [Cucumis melo var. makuwa] >TYK18569.1 Gag protease polyprotein-like protein [Cucumis melo var. makuwa])

HSP 1 Score: 123.2 bits (308), Expect = 2.8e-24
Identity = 76/225 (33.78%), Postives = 108/225 (48.00%), Query Frame = 0

Query: 34  SAETAQGDPERKYEIKRFKALEAQVFEGITDPEDVE------------------------ 93
           S E+ Q DPE+KY I+R KAL A  F G T+P D E                        
Sbjct: 44  SGESTQSDPEKKYGIERLKALGATTFAGTTNPADAEAWLTLIEKCFRVTRCPEDRKVELA 103

Query: 94  -----KGVEDWWKLIENRGNDADPLTW--------------------------IGARDMT 153
                 G EDWW++ E+R      ++W                          +    MT
Sbjct: 104 AFLLQNGAEDWWRMEESRRRTTGDISWNEFKKAFFDKFYPRSFRDAKRNEFLRLTQGSMT 163

Query: 154 VAEYEKKFTKLAKYALTMIVDEADKCKCFVLGLHSKIRTPVTTSIVWTNFARMIEAAMRA 204
           +AEYEKK+T+L+ YA  +I DE ++CK F  GL  +IRTPVT    W +F++++EAA+R 
Sbjct: 164 IAEYEKKYTELSMYATRVIEDEVERCKRFEEGLREEIRTPVTACADWNDFSKLVEAALRV 223

BLAST of ClCG01G011330 vs. NCBI nr
Match: KAA0036813.1 (DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa])

HSP 1 Score: 122.5 bits (306), Expect = 4.8e-24
Identity = 75/196 (38.27%), Postives = 102/196 (52.04%), Query Frame = 0

Query: 37  TAQGDPERKYEIKRFKALEAQVFEGITDPEDVE--------------------------- 96
           +AQ DPE+KY  +R KAL A  F G T+P DVE                           
Sbjct: 59  SAQSDPEKKYGFERLKALGATTFAGTTNPTDVEAWLTLIEKCFRVTRYLEDRKVELAAFL 118

Query: 97  --KGVEDWWKLIENRGNDADPLTWIGARDMTVAEYEKKFTKLAKYALTMIVDEADKCKCF 156
                EDWW++ E+R              MTVAEYEKK+T+L+KYA  +IVDE ++CK F
Sbjct: 119 LQNDAEDWWRMEESRRRTTG--------TMTVAEYEKKYTELSKYATRVIVDEGERCKRF 178

Query: 157 VLGLHSKIRTPVTTSIVWTNFARMIEAAMRAKKSITGGKPEREGNKGDYRTGQALGTSRE 204
             GL  +IRTPVT    W +F++++E A+R +KS+   K ERE +K       ++  +R 
Sbjct: 179 EEGLREEIRTPVTACADWNDFSKLVEVALRVEKSLNERKREREASKNLRTFSSSMHRNRP 238

BLAST of ClCG01G011330 vs. NCBI nr
Match: TYJ95881.1 (retrotransposon protein, putative, Ty3-gypsy subclass [Cucumis melo var. makuwa])

HSP 1 Score: 120.2 bits (300), Expect = 2.4e-23
Identity = 76/222 (34.23%), Postives = 106/222 (47.75%), Query Frame = 0

Query: 37  TAQGDPERKYEIKRFKALEAQVFEGITDPEDVE--------------------------- 96
           +AQ DPE+KY  +R KAL A  F G T+P DVE                           
Sbjct: 59  SAQSDPEKKYGFERLKALGATTFAGTTNPTDVEAWLTLIEKCFRVTRYLEDRKVELAAFL 118

Query: 97  --KGVEDWWKLIENRGNDADPLTW--------------------------IGARDMTVAE 156
                EDWW++ E+R      ++W                          +    MTVAE
Sbjct: 119 LQNDAEDWWRMEESRRRTTGDMSWDEFKKAFFDKFYPRSFRDAKHNEFVRLTQGTMTVAE 178

Query: 157 YEKKFTKLAKYALTMIVDEADKCKCFVLGLHSKIRTPVTTSIVWTNFARMIEAAMRAKKS 204
           YEKK+T+L+KYA  +IVDE ++CK F  GL  +IRTPVT    W +F++++E A+R +KS
Sbjct: 179 YEKKYTELSKYATRVIVDEGERCKRFEEGLREEIRTPVTACADWNDFSKLVEVALRVEKS 238

BLAST of ClCG01G011330 vs. NCBI nr
Match: KAA0035225.1 (DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa] >TYK21839.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa])

HSP 1 Score: 115.9 bits (289), Expect = 4.5e-22
Identity = 68/171 (39.77%), Postives = 93/171 (54.39%), Query Frame = 0

Query: 37  TAQGDPERKYEIKRFKALEAQVFEGITDPEDVE--------------------------- 96
           +AQ DPE+KY I+R KAL A  F G T+P D E                           
Sbjct: 78  SAQSDPEKKYGIERLKALGATTFVGTTNPADAEAWLTLIEKCFRVTRCPEDRKVELASFL 137

Query: 97  -------KGVEDWWKLIENRGNDADPLTWIGARDMTVAEYEKKFTKLAKYALTMIVDEAD 156
                  KG EDWW++ E+R      +     R MTV +YEKK+T+L+KYA  +I DE +
Sbjct: 138 LQNGGGAKG-EDWWRMEESRRRITSDI-----RSMTVTKYEKKYTELSKYATRVIEDEVE 197

Query: 157 KCKCFVLGLHSKIRTPVTTSIVWTNFARMIEAAMRAKKSITGGKPEREGNK 174
           +CK F  GL  +IRTPVTT   W +F++++EAA+R +KS+   K ERE +K
Sbjct: 198 RCKRFEEGLQEEIRTPVTTCADWNDFSKLVEAALRVEKSLNERKRERETSK 242

BLAST of ClCG01G011330 vs. NCBI nr
Match: TYK15233.1 (uncharacterized protein E5676_scaffold892G00030 [Cucumis melo var. makuwa])

HSP 1 Score: 114.4 bits (285), Expect = 1.3e-21
Identity = 76/223 (34.08%), Postives = 105/223 (47.09%), Query Frame = 0

Query: 34  SAETAQGDPERKYEIKRFKALEAQVFEGITDPEDVE------------------------ 93
           S E+AQ DP++KY I+R KAL A  F G T+P DVE                        
Sbjct: 174 SGESAQSDPKKKYGIERLKALGATTFAGTTNPTDVEAWLTLIEKCFRVTRCPEDRKVELA 233

Query: 94  -----KGVEDWWKLIENRGNDADPLTW--------------------------IGARDMT 153
                 G EDWW++ E+R      ++W                          +    MT
Sbjct: 234 AFLLQNGAEDWWRMEESRRRTTGDISWDEFKKAFFDKFYPRSFRDAKRNEFLRLTQGSMT 293

Query: 154 VAEYEKKFTKLAKYALTMIVDEADKCKCFVLGLHSKIRTPVTTSIVWTNFARMIEAAMRA 202
           VAEYEKK+T+L+KYA  +I DE ++ K F  GL  +IRT VT    W +F++++EAA+R 
Sbjct: 294 VAEYEKKYTELSKYATRVIEDEVERYKRFEEGLREEIRTSVTACADWNDFSKLVEAALRV 353

BLAST of ClCG01G011330 vs. ExPASy TrEMBL
Match: A0A5A7UZM6 (Gag protease polyprotein-like protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold119G00750 PE=4 SV=1)

HSP 1 Score: 123.2 bits (308), Expect = 1.4e-24
Identity = 76/225 (33.78%), Postives = 108/225 (48.00%), Query Frame = 0

Query: 34  SAETAQGDPERKYEIKRFKALEAQVFEGITDPEDVE------------------------ 93
           S E+ Q DPE+KY I+R KAL A  F G T+P D E                        
Sbjct: 44  SGESTQSDPEKKYGIERLKALGATTFAGTTNPADAEAWLTLIEKCFRVTRCPEDRKVELA 103

Query: 94  -----KGVEDWWKLIENRGNDADPLTW--------------------------IGARDMT 153
                 G EDWW++ E+R      ++W                          +    MT
Sbjct: 104 AFLLQNGAEDWWRMEESRRRTTGDISWNEFKKAFFDKFYPRSFRDAKRNEFLRLTQGSMT 163

Query: 154 VAEYEKKFTKLAKYALTMIVDEADKCKCFVLGLHSKIRTPVTTSIVWTNFARMIEAAMRA 204
           +AEYEKK+T+L+ YA  +I DE ++CK F  GL  +IRTPVT    W +F++++EAA+R 
Sbjct: 164 IAEYEKKYTELSMYATRVIEDEVERCKRFEEGLREEIRTPVTACADWNDFSKLVEAALRV 223

BLAST of ClCG01G011330 vs. ExPASy TrEMBL
Match: A0A5A7T1M0 (Reverse transcriptase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold20G001070 PE=4 SV=1)

HSP 1 Score: 122.5 bits (306), Expect = 2.3e-24
Identity = 75/196 (38.27%), Postives = 102/196 (52.04%), Query Frame = 0

Query: 37  TAQGDPERKYEIKRFKALEAQVFEGITDPEDVE--------------------------- 96
           +AQ DPE+KY  +R KAL A  F G T+P DVE                           
Sbjct: 59  SAQSDPEKKYGFERLKALGATTFAGTTNPTDVEAWLTLIEKCFRVTRYLEDRKVELAAFL 118

Query: 97  --KGVEDWWKLIENRGNDADPLTWIGARDMTVAEYEKKFTKLAKYALTMIVDEADKCKCF 156
                EDWW++ E+R              MTVAEYEKK+T+L+KYA  +IVDE ++CK F
Sbjct: 119 LQNDAEDWWRMEESRRRTTG--------TMTVAEYEKKYTELSKYATRVIVDEGERCKRF 178

Query: 157 VLGLHSKIRTPVTTSIVWTNFARMIEAAMRAKKSITGGKPEREGNKGDYRTGQALGTSRE 204
             GL  +IRTPVT    W +F++++E A+R +KS+   K ERE +K       ++  +R 
Sbjct: 179 EEGLREEIRTPVTACADWNDFSKLVEVALRVEKSLNERKREREASKNLRTFSSSMHRNRP 238

BLAST of ClCG01G011330 vs. ExPASy TrEMBL
Match: A0A5D3DES5 (DNA/RNA polymerases superfamily protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold991G00660 PE=4 SV=1)

HSP 1 Score: 115.9 bits (289), Expect = 2.2e-22
Identity = 68/171 (39.77%), Postives = 93/171 (54.39%), Query Frame = 0

Query: 37  TAQGDPERKYEIKRFKALEAQVFEGITDPEDVE--------------------------- 96
           +AQ DPE+KY I+R KAL A  F G T+P D E                           
Sbjct: 78  SAQSDPEKKYGIERLKALGATTFVGTTNPADAEAWLTLIEKCFRVTRCPEDRKVELASFL 137

Query: 97  -------KGVEDWWKLIENRGNDADPLTWIGARDMTVAEYEKKFTKLAKYALTMIVDEAD 156
                  KG EDWW++ E+R      +     R MTV +YEKK+T+L+KYA  +I DE +
Sbjct: 138 LQNGGGAKG-EDWWRMEESRRRITSDI-----RSMTVTKYEKKYTELSKYATRVIEDEVE 197

Query: 157 KCKCFVLGLHSKIRTPVTTSIVWTNFARMIEAAMRAKKSITGGKPEREGNK 174
           +CK F  GL  +IRTPVTT   W +F++++EAA+R +KS+   K ERE +K
Sbjct: 198 RCKRFEEGLQEEIRTPVTTCADWNDFSKLVEAALRVEKSLNERKRERETSK 242

BLAST of ClCG01G011330 vs. ExPASy TrEMBL
Match: A0A5A7TBS0 (CCHC-type domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold64G002900 PE=4 SV=1)

HSP 1 Score: 114.4 bits (285), Expect = 6.4e-22
Identity = 76/223 (34.08%), Postives = 105/223 (47.09%), Query Frame = 0

Query: 34  SAETAQGDPERKYEIKRFKALEAQVFEGITDPEDVE------------------------ 93
           S E+AQ DP++KY I+R KAL A  F G T+P DVE                        
Sbjct: 138 SGESAQSDPKKKYGIERLKALGATTFAGTTNPTDVEAWLTLIEKCFRVTRCPEDRKVELA 197

Query: 94  -----KGVEDWWKLIENRGNDADPLTW--------------------------IGARDMT 153
                 G EDWW++ E+R      ++W                          +    MT
Sbjct: 198 AFLLQNGAEDWWRMEESRRRTTGDISWDEFKKAFFDKFYPRSFRDAKRNEFLRLTQGSMT 257

Query: 154 VAEYEKKFTKLAKYALTMIVDEADKCKCFVLGLHSKIRTPVTTSIVWTNFARMIEAAMRA 202
           VAEYEKK+T+L+KYA  +I DE ++ K F  GL  +IRT VT    W +F++++EAA+R 
Sbjct: 258 VAEYEKKYTELSKYATRVIEDEVERYKRFEEGLREEIRTSVTACADWNDFSKLVEAALRV 317

BLAST of ClCG01G011330 vs. ExPASy TrEMBL
Match: A0A5D3CTK6 (CCHC-type domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold892G00030 PE=4 SV=1)

HSP 1 Score: 114.4 bits (285), Expect = 6.4e-22
Identity = 76/223 (34.08%), Postives = 105/223 (47.09%), Query Frame = 0

Query: 34  SAETAQGDPERKYEIKRFKALEAQVFEGITDPEDVE------------------------ 93
           S E+AQ DP++KY I+R KAL A  F G T+P DVE                        
Sbjct: 174 SGESAQSDPKKKYGIERLKALGATTFAGTTNPTDVEAWLTLIEKCFRVTRCPEDRKVELA 233

Query: 94  -----KGVEDWWKLIENRGNDADPLTW--------------------------IGARDMT 153
                 G EDWW++ E+R      ++W                          +    MT
Sbjct: 234 AFLLQNGAEDWWRMEESRRRTTGDISWDEFKKAFFDKFYPRSFRDAKRNEFLRLTQGSMT 293

Query: 154 VAEYEKKFTKLAKYALTMIVDEADKCKCFVLGLHSKIRTPVTTSIVWTNFARMIEAAMRA 202
           VAEYEKK+T+L+KYA  +I DE ++ K F  GL  +IRT VT    W +F++++EAA+R 
Sbjct: 294 VAEYEKKYTELSKYATRVIEDEVERYKRFEEGLREEIRTSVTACADWNDFSKLVEAALRV 353

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAA0060484.12.8e-2433.78Gag protease polyprotein-like protein [Cucumis melo var. makuwa] >TYK18569.1 Gag... [more]
KAA0036813.14.8e-2438.27DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa][more]
TYJ95881.12.4e-2334.23retrotransposon protein, putative, Ty3-gypsy subclass [Cucumis melo var. makuwa][more]
KAA0035225.14.5e-2239.77DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa] >TYK21839.1 D... [more]
TYK15233.11.3e-2134.08uncharacterized protein E5676_scaffold892G00030 [Cucumis melo var. makuwa][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A5A7UZM61.4e-2433.78Gag protease polyprotein-like protein OS=Cucumis melo var. makuwa OX=1194695 GN=... [more]
A0A5A7T1M02.3e-2438.27Reverse transcriptase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold20... [more]
A0A5D3DES52.2e-2239.77DNA/RNA polymerases superfamily protein OS=Cucumis melo var. makuwa OX=1194695 G... [more]
A0A5A7TBS06.4e-2234.08CCHC-type domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6... [more]
A0A5D3CTK66.4e-2234.08CCHC-type domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Watermelon (Charleston Gray) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 160..194

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG01G011330.1ClCG01G011330.1mRNA