ClCG04G004140 (gene) Watermelon (Charleston Gray) v2.5

Overview
NameClCG04G004140
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionRetrovirus-related Pol polyprotein from transposon 17.6
LocationCG_Chr04: 15770887 .. 15771855 (-)
RNA-Seq ExpressionClCG04G004140
SyntenyClCG04G004140
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACATTCCATAAGGAAATTTTGGAGTTTCAAAAAGAAACTTCAATTGGTATGCAACAGTTGGCAAACCAAATTACTCAATTGGCCACCACGGTGAACAAGCTTGACACCCAAAGTGGTAAAATCTCAACCCAACCTGAGCCCCATGTGAGGAATGTAACTGTTGTAACTACTGTAAATCCATTTGATTTCCCTGAATCCTCTGAATTTATGAACTTTGTGACTAATAAAGAATCCAATGAATCATTTGGTTTTGATAAGAAATGACATAGCCCCGAACACCCACGAGCCTTCAACAGGTTAGTACTGATCCATTGAATCAGTCTGCTTTGAATGCTTACATTATGTTGCCCCTTTTCCAAGAAGATTCCTCAATCCTAAGAGGAATGAAGAAAAAGTACTCACTCAAACCCTTGGAATTGTGACTGCTAGCTTCCCACAGTGGGTCAAAAAGGAGGAAGTAATACATATAGTGCAGCAACCATGGGCACTCCAAGTAAACTCTCAGTGAAACACGACAATCCAAGTATGTTTTTCTTACCTTGCAAAATAGGTAAGAAGAAGTTGTTAGATGCAATAATTGACCTAGGAGCTTCTGTGAACATCATACATACATCGTTGTTTTATGAGCTTAGACTGTTAGATCTTTGTAGTACAAATGTGGTGGTTCAACTAGCAAACCCTAGTTTAACTAAGCCATTAGGATTTTTAAAAGATGTGTATGTGAATGTAGATGACTTAGTTTTCCTTGTTGATTTCTATGTCCTGAATTAATCTTGCTTTAAAAAAGATTGATGTAGGAGAAGGGTCCCTATCTATGGAATTTAATGGGGATGTTAAAAATTCTTTGTCTTGGATAATTTTGCTTCCATTAATTCTTTTGATTCCTGTCAGTTACTACCTTTGCAGGATAATGGTTCAGTAAATTCTGGAGAGATGGATGGAAAGGAGGAGTCGATCAACTCCTAG

mRNA sequence

ATGACATTCCATAAGGAAATTTTGGAGTTTCAAAAAGAAACTTCAATTGGTATGCAACAGTTGGCAAACCAAATTACTCAATTGGCCACCACGGTGAACAAGCTTGACACCCAAAGTGGTAAAATCTCAACCCAACCTGAGCCCCATGTGAGGAATGTAACTGTTGTAACTACTGTAAATCCATTTGATTTCCCTGAATCCTCTGAATTTATGAACTTTCTTAGACTGTTAGATCTTTGTAGTACAAATGTGGTGGTTCAACTAGCAAACCCTAGTTTAACTAAGCCATTAGGATTTTTAAAAGATGTGTATGTGAATGTAGATGACTTAGTTTTCCTTGTTGATTTCTATTTACTACCTTTGCAGGATAATGGTTCAGTAAATTCTGGAGAGATGGATGGAAAGGAGGAGTCGATCAACTCCTAG

Coding sequence (CDS)

ATGACATTCCATAAGGAAATTTTGGAGTTTCAAAAAGAAACTTCAATTGGTATGCAACAGTTGGCAAACCAAATTACTCAATTGGCCACCACGGTGAACAAGCTTGACACCCAAAGTGGTAAAATCTCAACCCAACCTGAGCCCCATGTGAGGAATGTAACTGTTGTAACTACTGTAAATCCATTTGATTTCCCTGAATCCTCTGAATTTATGAACTTTCTTAGACTGTTAGATCTTTGTAGTACAAATGTGGTGGTTCAACTAGCAAACCCTAGTTTAACTAAGCCATTAGGATTTTTAAAAGATGTGTATGTGAATGTAGATGACTTAGTTTTCCTTGTTGATTTCTATTTACTACCTTTGCAGGATAATGGTTCAGTAAATTCTGGAGAGATGGATGGAAAGGAGGAGTCGATCAACTCCTAG

Protein sequence

MTFHKEILEFQKETSIGMQQLANQITQLATTVNKLDTQSGKISTQPEPHVRNVTVVTTVNPFDFPESSEFMNFLRLLDLCSTNVVVQLANPSLTKPLGFLKDVYVNVDDLVFLVDFYLLPLQDNGSVNSGEMDGKEESINS
Homology
BLAST of ClCG04G004140 vs. NCBI nr
Match: RDY06300.1 (hypothetical protein CR513_09736, partial [Mucuna pruriens])

HSP 1 Score: 72.8 bits (177), Expect = 2.7e-09
Identity = 52/139 (37.41%), Postives = 77/139 (55.40%), Query Frame = 0

Query: 2   TFHKEILEFQKETSIGMQQLANQITQLATTVNKL-DTQSGKISTQPEPHVRNVTVV---T 61
           T +   L+FQ+  S  +Q    Q++QLAT++++L  T+SG + +Q  P+ R    V   T
Sbjct: 32  TMNSSNLQFQQNMSAIVQDFKMQVSQLATSISQLQSTESGNLPSQTIPNPRGNASVHHST 91

Query: 62  TVN--PF-----DFPESSEFM-----NFLRLLDLCSTNVVVQLANPSLTKPLGFLKDVYV 121
           T+   PF     D   S  FM       L   DL  T + +QLAN S+ +PL  L+DV V
Sbjct: 92  TIGDCPFADAMLDLGASINFMLTSIYKSLNCGDLEPTGITIQLANRSVVQPLSVLEDVLV 151

Query: 122 NVDDLVFLVDFYLLPLQDN 125
            VD L+FL DFY+L ++D+
Sbjct: 152 QVDKLIFLADFYVLDMEDD 170

BLAST of ClCG04G004140 vs. NCBI nr
Match: XP_012453231.1 (PREDICTED: uncharacterized protein LOC105775248 [Gossypium raimondii])

HSP 1 Score: 71.6 bits (174), Expect = 6.0e-09
Identity = 48/160 (30.00%), Postives = 81/160 (50.62%), Query Frame = 0

Query: 9   EFQKETSIGMQQLANQITQLATTVNKLDTQSGKISTQPEPHVR-NVTVVTTVNPFDFPES 68
           ++Q+ T   +Q+L NQ+++L+  VN+L++Q GK+ +Q EP+ R NV+ +T  +     +S
Sbjct: 10  KYQQRTDASIQELTNQVSKLSMAVNRLESQ-GKLPSQTEPNPRLNVSAITLRSEKVLDKS 69

Query: 69  SEFMNFLRLLDLC-----------------------------------STNVVVQLANPS 128
               NF R   L                                     T V++QLA+ S
Sbjct: 70  LAMQNFSRNCALARGGNVGIKKAMCDLGASINVMSYHIYKLINVGHLKKTGVIIQLADRS 129

Query: 129 LTKPLGFLKDVYVNVDDLVFLVDFYLLPLQDNGSVNSGEM 133
           +  P G L+DV V V++LVF  DFY++ ++D+ S NS ++
Sbjct: 130 VIYPEGLLEDVLVKVNELVFPADFYIINMEDDNSTNSSDI 168

BLAST of ClCG04G004140 vs. NCBI nr
Match: XP_027109017.1 (uncharacterized protein LOC113728855 [Coffea arabica])

HSP 1 Score: 62.8 bits (151), Expect = 2.8e-06
Identity = 44/142 (30.99%), Postives = 72/142 (50.70%), Query Frame = 0

Query: 9   EFQKETSIGMQQLANQITQLATTVNKLDTQS-GKISTQPEPHVRNVTVVT------TVNP 68
           +FQ++T  GM+ +  +I+Q+AT +N+L++ + GK+ +QPE + RNV           VN 
Sbjct: 433 QFQQDTKAGMKDMEARISQMATAINRLESHAYGKLPSQPEVNPRNVLKYAKFLKDLCVNK 492

Query: 69  FDFPESSEFM---NFLRLLD------------LCSTNVVVQLANPSLTKPLGFLKDVYVN 128
                    M   N   +L             L  T +++QLA+ +   P G ++DV V 
Sbjct: 493 RKLRGDERVMVGENVSAVLQRKLPPKCGDPGPLKETGIIIQLADRTCAYPDGIVEDVLVQ 552

BLAST of ClCG04G004140 vs. NCBI nr
Match: XP_027109421.1 (uncharacterized protein LOC113729311 [Coffea arabica])

HSP 1 Score: 62.8 bits (151), Expect = 2.8e-06
Identity = 44/142 (30.99%), Postives = 72/142 (50.70%), Query Frame = 0

Query: 9   EFQKETSIGMQQLANQITQLATTVNKLDTQS-GKISTQPEPHVRNVTVVT------TVNP 68
           +FQ++T  GM+ +  +I+Q+AT +N+L++ + GK+ +QPE + RNV           VN 
Sbjct: 440 QFQQDTKAGMKDMEARISQMATAINRLESHAYGKLPSQPEVNPRNVPKYAKFLKDLCVNK 499

Query: 69  FDFPESSEFM---NFLRLLD------------LCSTNVVVQLANPSLTKPLGFLKDVYVN 128
                    M   N   +L             L  T +++QLA+ +   P G ++DV V 
Sbjct: 500 RKLRGDERVMVGENVSAVLQRKLPPKCGDPGPLKETGIIIQLADRTCAYPDGIVEDVLVQ 559

BLAST of ClCG04G004140 vs. NCBI nr
Match: XP_027082556.1 (uncharacterized protein LOC113704887 [Coffea arabica])

HSP 1 Score: 62.8 bits (151), Expect = 2.8e-06
Identity = 44/142 (30.99%), Postives = 72/142 (50.70%), Query Frame = 0

Query: 9   EFQKETSIGMQQLANQITQLATTVNKLDTQS-GKISTQPEPHVRNVTVVT------TVNP 68
           +FQ++T  GM+ +  +I+Q+AT +N+L++ + GK+ +QPE + RNV           VN 
Sbjct: 440 QFQQDTKAGMKDMEARISQMATAINRLESHAYGKLPSQPEVNPRNVPKYAKFLKDLCVNK 499

Query: 69  FDFPESSEFM---NFLRLLD------------LCSTNVVVQLANPSLTKPLGFLKDVYVN 128
                    M   N   +L             L  T +++QLA+ +   P G ++DV V 
Sbjct: 500 RKLRGDERVMVGENVSAVLQRKLPPKCGDPGPLKETGIIIQLADRTCAYPDGIVEDVLVQ 559

BLAST of ClCG04G004140 vs. ExPASy TrEMBL
Match: A0A371HU36 (Uncharacterized protein (Fragment) OS=Mucuna pruriens OX=157652 GN=CR513_09736 PE=4 SV=1)

HSP 1 Score: 72.8 bits (177), Expect = 1.3e-09
Identity = 52/139 (37.41%), Postives = 77/139 (55.40%), Query Frame = 0

Query: 2   TFHKEILEFQKETSIGMQQLANQITQLATTVNKL-DTQSGKISTQPEPHVRNVTVV---T 61
           T +   L+FQ+  S  +Q    Q++QLAT++++L  T+SG + +Q  P+ R    V   T
Sbjct: 32  TMNSSNLQFQQNMSAIVQDFKMQVSQLATSISQLQSTESGNLPSQTIPNPRGNASVHHST 91

Query: 62  TVN--PF-----DFPESSEFM-----NFLRLLDLCSTNVVVQLANPSLTKPLGFLKDVYV 121
           T+   PF     D   S  FM       L   DL  T + +QLAN S+ +PL  L+DV V
Sbjct: 92  TIGDCPFADAMLDLGASINFMLTSIYKSLNCGDLEPTGITIQLANRSVVQPLSVLEDVLV 151

Query: 122 NVDDLVFLVDFYLLPLQDN 125
            VD L+FL DFY+L ++D+
Sbjct: 152 QVDKLIFLADFYVLDMEDD 170

BLAST of ClCG04G004140 vs. ExPASy TrEMBL
Match: A0A6P6TXT1 (uncharacterized protein LOC113704887 OS=Coffea arabica OX=13443 GN=LOC113704887 PE=4 SV=1)

HSP 1 Score: 62.8 bits (151), Expect = 1.3e-06
Identity = 44/142 (30.99%), Postives = 72/142 (50.70%), Query Frame = 0

Query: 9   EFQKETSIGMQQLANQITQLATTVNKLDTQS-GKISTQPEPHVRNVTVVT------TVNP 68
           +FQ++T  GM+ +  +I+Q+AT +N+L++ + GK+ +QPE + RNV           VN 
Sbjct: 440 QFQQDTKAGMKDMEARISQMATAINRLESHAYGKLPSQPEVNPRNVPKYAKFLKDLCVNK 499

Query: 69  FDFPESSEFM---NFLRLLD------------LCSTNVVVQLANPSLTKPLGFLKDVYVN 128
                    M   N   +L             L  T +++QLA+ +   P G ++DV V 
Sbjct: 500 RKLRGDERVMVGENVSAVLQRKLPPKCGDPGPLKETGIIIQLADRTCAYPDGIVEDVLVQ 559

BLAST of ClCG04G004140 vs. ExPASy TrEMBL
Match: A0A6P6W4E6 (uncharacterized protein LOC113728855 OS=Coffea arabica OX=13443 GN=LOC113728855 PE=4 SV=1)

HSP 1 Score: 62.8 bits (151), Expect = 1.3e-06
Identity = 44/142 (30.99%), Postives = 72/142 (50.70%), Query Frame = 0

Query: 9   EFQKETSIGMQQLANQITQLATTVNKLDTQS-GKISTQPEPHVRNVTVVT------TVNP 68
           +FQ++T  GM+ +  +I+Q+AT +N+L++ + GK+ +QPE + RNV           VN 
Sbjct: 433 QFQQDTKAGMKDMEARISQMATAINRLESHAYGKLPSQPEVNPRNVLKYAKFLKDLCVNK 492

Query: 69  FDFPESSEFM---NFLRLLD------------LCSTNVVVQLANPSLTKPLGFLKDVYVN 128
                    M   N   +L             L  T +++QLA+ +   P G ++DV V 
Sbjct: 493 RKLRGDERVMVGENVSAVLQRKLPPKCGDPGPLKETGIIIQLADRTCAYPDGIVEDVLVQ 552

BLAST of ClCG04G004140 vs. ExPASy TrEMBL
Match: A0A6P6S520 (uncharacterized protein LOC113687487 OS=Coffea arabica OX=13443 GN=LOC113687487 PE=4 SV=1)

HSP 1 Score: 62.8 bits (151), Expect = 1.3e-06
Identity = 44/142 (30.99%), Postives = 72/142 (50.70%), Query Frame = 0

Query: 9   EFQKETSIGMQQLANQITQLATTVNKLDTQS-GKISTQPEPHVRNVTVVT------TVNP 68
           +FQ++T  GM+ +  +I+Q+AT +N+L++ + GK+ +QPE + RNV           VN 
Sbjct: 440 QFQQDTKAGMKDMEARISQMATAINRLESHAYGKLPSQPEVNPRNVPKYAKFLKDLCVNK 499

Query: 69  FDFPESSEFM---NFLRLLD------------LCSTNVVVQLANPSLTKPLGFLKDVYVN 128
                    M   N   +L             L  T +++QLA+ +   P G ++DV V 
Sbjct: 500 RKLRGDERVMVGENVSAVLQRKLPPKCGDPGPLKETGIIIQLADRTCAYPDGIVEDVLVQ 559

BLAST of ClCG04G004140 vs. ExPASy TrEMBL
Match: A0A6P6W4D6 (uncharacterized protein LOC113729311 OS=Coffea arabica OX=13443 GN=LOC113729311 PE=4 SV=1)

HSP 1 Score: 62.8 bits (151), Expect = 1.3e-06
Identity = 44/142 (30.99%), Postives = 72/142 (50.70%), Query Frame = 0

Query: 9   EFQKETSIGMQQLANQITQLATTVNKLDTQS-GKISTQPEPHVRNVTVVT------TVNP 68
           +FQ++T  GM+ +  +I+Q+AT +N+L++ + GK+ +QPE + RNV           VN 
Sbjct: 440 QFQQDTKAGMKDMEARISQMATAINRLESHAYGKLPSQPEVNPRNVPKYAKFLKDLCVNK 499

Query: 69  FDFPESSEFM---NFLRLLD------------LCSTNVVVQLANPSLTKPLGFLKDVYVN 128
                    M   N   +L             L  T +++QLA+ +   P G ++DV V 
Sbjct: 500 RKLRGDERVMVGENVSAVLQRKLPPKCGDPGPLKETGIIIQLADRTCAYPDGIVEDVLVQ 559

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
RDY06300.12.7e-0937.41hypothetical protein CR513_09736, partial [Mucuna pruriens][more]
XP_012453231.16.0e-0930.00PREDICTED: uncharacterized protein LOC105775248 [Gossypium raimondii][more]
XP_027109017.12.8e-0630.99uncharacterized protein LOC113728855 [Coffea arabica][more]
XP_027109421.12.8e-0630.99uncharacterized protein LOC113729311 [Coffea arabica][more]
XP_027082556.12.8e-0630.99uncharacterized protein LOC113704887 [Coffea arabica][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A371HU361.3e-0937.41Uncharacterized protein (Fragment) OS=Mucuna pruriens OX=157652 GN=CR513_09736 P... [more]
A0A6P6TXT11.3e-0630.99uncharacterized protein LOC113704887 OS=Coffea arabica OX=13443 GN=LOC113704887 ... [more]
A0A6P6W4E61.3e-0630.99uncharacterized protein LOC113728855 OS=Coffea arabica OX=13443 GN=LOC113728855 ... [more]
A0A6P6S5201.3e-0630.99uncharacterized protein LOC113687487 OS=Coffea arabica OX=13443 GN=LOC113687487 ... [more]
A0A6P6W4D61.3e-0630.99uncharacterized protein LOC113729311 OS=Coffea arabica OX=13443 GN=LOC113729311 ... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Watermelon (Charleston Gray) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 18..38

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG04G004140.1ClCG04G004140.1mRNA