Clc11G09065 (gene) Watermelon (cordophanus) v2

Overview
NameClc11G09065
Typegene
OrganismCitrullus lanatus subsp. cordophanus cv. cordophanus (Watermelon (cordophanus) v2)
DescriptionRetrotransposon protein
LocationClcChr11: 11285625 .. 11286033 (-)
RNA-Seq ExpressionClc11G09065
SyntenyClc11G09065
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCAGGTAAGCCCACATCTGGAGTCAAGGGTCAGGACCCTGAAGAGACAGTACAACGTGATTGCTGAAATGTTGGGCTCAGGCTGTAGTGGGTTTGGTTGGAATGCAGAGCGCAAATATATTGACTGTGAGGCGAAGATATTTGACATATGGGTCAAGGTAATTTCATTACTTATTTTATTTATTTCTTCTTCTTTTAAATTTGGCTTGGCCATAACATAACACATCTCACCTTTAACAGAGTCATCCGAGTGCGAAAGGATTGTGCCATAAGTCATTTCCGTACTATGACGACTTGACCATAGTATTCGGCAAAGACAGAGCCACAAGGAGTCATGCAACCACCACTGCAGAGGTCGGATCTGAACCTGTTATGGAAGAGAAGAACGAGGACATCCTGAATAACTAG

mRNA sequence

ATGCAGGTAAGCCCACATCTGGAGTCAAGGGTCAGGACCCTGAAGAGACAGTACAACGTGATTGCTGAAATGTTGGGCTCAGGCTGTAGTGGGTTTGGTTGGAATGCAGAGCGCAAATATATTGACTGTGAGGCGAAGATATTTGACATATGGGTCAAGAGTCATCCGAGTGCGAAAGGATTGTGCCATAAGTCATTTCCGTACTATGACGACTTGACCATAGTATTCGGCAAAGACAGAGCCACAAGGAGTCATGCAACCACCACTGCAGAGGTCGGATCTGAACCTGTTATGGAAGAGAAGAACGAGGACATCCTGAATAACTAG

Coding sequence (CDS)

ATGCAGGTAAGCCCACATCTGGAGTCAAGGGTCAGGACCCTGAAGAGACAGTACAACGTGATTGCTGAAATGTTGGGCTCAGGCTGTAGTGGGTTTGGTTGGAATGCAGAGCGCAAATATATTGACTGTGAGGCGAAGATATTTGACATATGGGTCAAGAGTCATCCGAGTGCGAAAGGATTGTGCCATAAGTCATTTCCGTACTATGACGACTTGACCATAGTATTCGGCAAAGACAGAGCCACAAGGAGTCATGCAACCACCACTGCAGAGGTCGGATCTGAACCTGTTATGGAAGAGAAGAACGAGGACATCCTGAATAACTAG

Protein sequence

MQVSPHLESRVRTLKRQYNVIAEMLGSGCSGFGWNAERKYIDCEAKIFDIWVKSHPSAKGLCHKSFPYYDDLTIVFGKDRATRSHATTTAEVGSEPVMEEKNEDILNN
Homology
BLAST of Clc11G09065 vs. NCBI nr
Match: KAA0064022.1 (retrotransposon protein [Cucumis melo var. makuwa])

HSP 1 Score: 121.7 bits (304), Expect = 3.9e-24
Identity = 54/88 (61.36%), Postives = 67/88 (76.14%), Query Frame = 0

Query: 7   LESRVRTLKRQYNVIAEMLGSGCSGFGWNAERKYIDCEAKIFDIWVKSHPSAKGLCHKSF 66
           ++ R++TLKR +  IAEM G  CSGFGWN E K I  E ++FD WV+SHP+AKGL +KSF
Sbjct: 67  IDCRIKTLKRTFQAIAEMRGPACSGFGWNDEEKCIVAEKELFDNWVRSHPAAKGLLNKSF 126

Query: 67  PYYDDLTIVFGKDRATRSHATTTAEVGS 95
           PYYD+LT VFG+DRAT   A T A+VGS
Sbjct: 127 PYYDELTYVFGRDRATGRFAETFADVGS 154

BLAST of Clc11G09065 vs. NCBI nr
Match: XP_008441954.1 (PREDICTED: uncharacterized protein LOC103485953 [Cucumis melo] >KAA0047736.1 retrotransposon protein [Cucumis melo var. makuwa] >TYK08388.1 retrotransposon protein [Cucumis melo var. makuwa])

HSP 1 Score: 120.6 bits (301), Expect = 8.6e-24
Identity = 58/114 (50.88%), Postives = 75/114 (65.79%), Query Frame = 0

Query: 1   MQVSPHLESRVRTLKRQYNVIAEMLGSGCSGFGWNAERKYIDCEAKIFDIWVKSHPSAKG 60
           +Q S  ++  V++LK+ Y+ IAEM G  CSGFGWN E + I  E  +FD W+KSHP+AKG
Sbjct: 61  IQESSTIDCHVKSLKKTYHAIAEMRGPSCSGFGWNEEFQCIIAERDLFDSWIKSHPAAKG 120

Query: 61  LCHKSFPYYDDLTIVFGKDRATRSHATTTAEVGSE---------PVMEEKNEDI 106
           L HKSFPYYDDL+ VFGKDRAT + + T   VGS          P+ +  +EDI
Sbjct: 121 LLHKSFPYYDDLSYVFGKDRATGARSETFPNVGSNVSNMFNDTIPLGDSHDEDI 174

BLAST of Clc11G09065 vs. NCBI nr
Match: XP_038987004.1 (uncharacterized protein At2g29880-like [Phoenix dactylifera])

HSP 1 Score: 120.6 bits (301), Expect = 8.6e-24
Identity = 53/101 (52.48%), Postives = 72/101 (71.29%), Query Frame = 0

Query: 1   MQVSPHLESRVRTLKRQYNVIAEMLGSGCSGFGWNAERKYIDCEAKIFDIWVKSHPSAKG 60
           ++ +PH+ESRV+ LK+QYN IAEMLG  CSGFGW+   K + CE   F  WVKSHP+A+G
Sbjct: 71  LKATPHIESRVKLLKKQYNAIAEMLGPNCSGFGWDDINKCVTCEEDTFKEWVKSHPNAQG 130

Query: 61  LCHKSFPYYDDLTIVFGKDRATRSHATTTAEVGSEPVMEEK 102
           + +K FP+++DLT +FG+DRAT   A   A+   E  MEE+
Sbjct: 131 MRNKPFPHFEDLTNIFGRDRATGMGAEAPADAVEEIEMEEQ 171

BLAST of Clc11G09065 vs. NCBI nr
Match: XP_017251089.1 (PREDICTED: uncharacterized protein At2g29880-like [Daucus carota subsp. sativus] >KZM95656.1 hypothetical protein DCAR_018898 [Daucus carota subsp. sativus])

HSP 1 Score: 120.2 bits (300), Expect = 1.1e-23
Identity = 56/102 (54.90%), Postives = 69/102 (67.65%), Query Frame = 0

Query: 1   MQVSPHLESRVRTLKRQYNVIAEMLGSGCSGFGWNAERKYIDCEAKIFDIWVKSHPSAKG 60
           M+  PH+ESRVR  ++QY  I EM G  CSGFGWN   K I CE  IF+ W+KSHP+AKG
Sbjct: 70  MKARPHIESRVRLWRKQYFAIEEMRGPNCSGFGWNELDKSITCEKSIFEDWLKSHPNAKG 129

Query: 61  LCHKSFPYYDDLTIVFGKDRATRSHATTTAEVGSEPVMEEKN 103
           L +KSFPYYD+L+ VFGKDRA      + A+   E   EE+N
Sbjct: 130 LRNKSFPYYDELSQVFGKDRANGECVESPADAVEEIANEEEN 171

BLAST of Clc11G09065 vs. NCBI nr
Match: KAA0057610.1 (retrotransposon protein [Cucumis melo var. makuwa])

HSP 1 Score: 120.2 bits (300), Expect = 1.1e-23
Identity = 60/108 (55.56%), Postives = 74/108 (68.52%), Query Frame = 0

Query: 7   LESRVRTLKRQYNVIAEMLGSGCSGFGWNAERKYIDCEAKIFDIWVKSHPSAKGLCHKSF 66
           ++ R++TLKR +  IAEM G  CSGFGWN E K I  E ++FD WV+SHP+AKGL +K F
Sbjct: 16  IDCRIKTLKRTFQAIAEMRGPACSGFGWNDEEKCIVAEKELFDNWVRSHPAAKGLLNKPF 75

Query: 67  PYYDDLTIVFGKDRATRSHATTTAEVGS-EP-------VMEEKNEDIL 107
           PYYD+LT VFG+DRAT   A T A+VGS EP        M + NED L
Sbjct: 76  PYYDELTYVFGRDRATGRFAETFADVGSNEPGGGYDRFDMGDGNEDFL 123

BLAST of Clc11G09065 vs. ExPASy TrEMBL
Match: A0A5A7V6Q9 (Retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold99G00200 PE=4 SV=1)

HSP 1 Score: 121.7 bits (304), Expect = 1.9e-24
Identity = 54/88 (61.36%), Postives = 67/88 (76.14%), Query Frame = 0

Query: 7   LESRVRTLKRQYNVIAEMLGSGCSGFGWNAERKYIDCEAKIFDIWVKSHPSAKGLCHKSF 66
           ++ R++TLKR +  IAEM G  CSGFGWN E K I  E ++FD WV+SHP+AKGL +KSF
Sbjct: 67  IDCRIKTLKRTFQAIAEMRGPACSGFGWNDEEKCIVAEKELFDNWVRSHPAAKGLLNKSF 126

Query: 67  PYYDDLTIVFGKDRATRSHATTTAEVGS 95
           PYYD+LT VFG+DRAT   A T A+VGS
Sbjct: 127 PYYDELTYVFGRDRATGRFAETFADVGS 154

BLAST of Clc11G09065 vs. ExPASy TrEMBL
Match: A0A5A7U0H7 (Retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold648G002060 PE=4 SV=1)

HSP 1 Score: 120.6 bits (301), Expect = 4.2e-24
Identity = 58/114 (50.88%), Postives = 75/114 (65.79%), Query Frame = 0

Query: 1   MQVSPHLESRVRTLKRQYNVIAEMLGSGCSGFGWNAERKYIDCEAKIFDIWVKSHPSAKG 60
           +Q S  ++  V++LK+ Y+ IAEM G  CSGFGWN E + I  E  +FD W+KSHP+AKG
Sbjct: 61  IQESSTIDCHVKSLKKTYHAIAEMRGPSCSGFGWNEEFQCIIAERDLFDSWIKSHPAAKG 120

Query: 61  LCHKSFPYYDDLTIVFGKDRATRSHATTTAEVGSE---------PVMEEKNEDI 106
           L HKSFPYYDDL+ VFGKDRAT + + T   VGS          P+ +  +EDI
Sbjct: 121 LLHKSFPYYDDLSYVFGKDRATGARSETFPNVGSNVSNMFNDTIPLGDSHDEDI 174

BLAST of Clc11G09065 vs. ExPASy TrEMBL
Match: A0A1S3B4L3 (uncharacterized protein LOC103485953 OS=Cucumis melo OX=3656 GN=LOC103485953 PE=4 SV=1)

HSP 1 Score: 120.6 bits (301), Expect = 4.2e-24
Identity = 58/114 (50.88%), Postives = 75/114 (65.79%), Query Frame = 0

Query: 1   MQVSPHLESRVRTLKRQYNVIAEMLGSGCSGFGWNAERKYIDCEAKIFDIWVKSHPSAKG 60
           +Q S  ++  V++LK+ Y+ IAEM G  CSGFGWN E + I  E  +FD W+KSHP+AKG
Sbjct: 61  IQESSTIDCHVKSLKKTYHAIAEMRGPSCSGFGWNEEFQCIIAERDLFDSWIKSHPAAKG 120

Query: 61  LCHKSFPYYDDLTIVFGKDRATRSHATTTAEVGSE---------PVMEEKNEDI 106
           L HKSFPYYDDL+ VFGKDRAT + + T   VGS          P+ +  +EDI
Sbjct: 121 LLHKSFPYYDDLSYVFGKDRATGARSETFPNVGSNVSNMFNDTIPLGDSHDEDI 174

BLAST of Clc11G09065 vs. ExPASy TrEMBL
Match: A0A161XV48 (Myb_DNA-bind_3 domain-containing protein OS=Daucus carota subsp. sativus OX=79200 GN=DCAR_018898 PE=4 SV=1)

HSP 1 Score: 120.2 bits (300), Expect = 5.4e-24
Identity = 56/102 (54.90%), Postives = 69/102 (67.65%), Query Frame = 0

Query: 1   MQVSPHLESRVRTLKRQYNVIAEMLGSGCSGFGWNAERKYIDCEAKIFDIWVKSHPSAKG 60
           M+  PH+ESRVR  ++QY  I EM G  CSGFGWN   K I CE  IF+ W+KSHP+AKG
Sbjct: 70  MKARPHIESRVRLWRKQYFAIEEMRGPNCSGFGWNELDKSITCEKSIFEDWLKSHPNAKG 129

Query: 61  LCHKSFPYYDDLTIVFGKDRATRSHATTTAEVGSEPVMEEKN 103
           L +KSFPYYD+L+ VFGKDRA      + A+   E   EE+N
Sbjct: 130 LRNKSFPYYDELSQVFGKDRANGECVESPADAVEEIANEEEN 171

BLAST of Clc11G09065 vs. ExPASy TrEMBL
Match: A0A5A7UR77 (Retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold497G00880 PE=4 SV=1)

HSP 1 Score: 120.2 bits (300), Expect = 5.4e-24
Identity = 60/108 (55.56%), Postives = 74/108 (68.52%), Query Frame = 0

Query: 7   LESRVRTLKRQYNVIAEMLGSGCSGFGWNAERKYIDCEAKIFDIWVKSHPSAKGLCHKSF 66
           ++ R++TLKR +  IAEM G  CSGFGWN E K I  E ++FD WV+SHP+AKGL +K F
Sbjct: 16  IDCRIKTLKRTFQAIAEMRGPACSGFGWNDEEKCIVAEKELFDNWVRSHPAAKGLLNKPF 75

Query: 67  PYYDDLTIVFGKDRATRSHATTTAEVGS-EP-------VMEEKNEDIL 107
           PYYD+LT VFG+DRAT   A T A+VGS EP        M + NED L
Sbjct: 76  PYYDELTYVFGRDRATGRFAETFADVGSNEPGGGYDRFDMGDGNEDFL 123

BLAST of Clc11G09065 vs. TAIR 10
Match: AT5G27260.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G29880.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )

HSP 1 Score: 57.0 bits (136), Expect = 1.1e-08
Identity = 28/81 (34.57%), Postives = 47/81 (58.02%), Query Frame = 0

Query: 6   HLESRVRTLKRQYNVIAEMLGSGCSGFGWNAERKYIDCEAKIFDIWVKSHPSAKGLCHKS 65
           H  SR++ LK QY    + L    SGFGW+   K      +++  ++K+HP+ K L + +
Sbjct: 69  HYLSRMKYLKIQYQSCLD-LQRFSSGFGWDPLTKRFTASDEVWSDYLKAHPNNKQLRYDT 128

Query: 66  FPYYDDLTIVFGKDRATRSHA 87
           F ++D+L I+FG+  AT  +A
Sbjct: 129 FEFFDELQIIFGEGVATGKNA 148

BLAST of Clc11G09065 vs. TAIR 10
Match: AT1G30140.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G27260.1); Has 313 Blast hits to 256 proteins in 15 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 8; Plants - 295; Viruses - 0; Other Eukaryotes - 10 (source: NCBI BLink). )

HSP 1 Score: 52.0 bits (123), Expect = 3.5e-07
Identity = 27/78 (34.62%), Postives = 44/78 (56.41%), Query Frame = 0

Query: 9   SRVRTLKRQYNVIAEMLGSGCSGFGWNAERKYIDCEAKIFDIWVKSHPSAKGLCHKSFPY 68
           SR++ LK  Y    + L    SGFGW+ E K      +++  ++K+HP+ K +  +S  +
Sbjct: 68  SRLKFLKNLYQSYLD-LKRFSSGFGWDPETKKFTAPDEVWRDYLKAHPNHKHMQTESIDH 127

Query: 69  YDDLTIVFGKDRATRSHA 87
           ++DL I+FG   AT S A
Sbjct: 128 FEDLQIIFGDVVATGSFA 144

BLAST of Clc11G09065 vs. TAIR 10
Match: AT2G24960.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G02210.2); Has 1453 Blast hits to 509 proteins in 26 species: Archae - 0; Bacteria - 0; Metazoa - 1; Fungi - 39; Plants - 1363; Viruses - 0; Other Eukaryotes - 50 (source: NCBI BLink). )

HSP 1 Score: 47.4 bits (111), Expect = 8.6e-06
Identity = 30/104 (28.85%), Postives = 46/104 (44.23%), Query Frame = 0

Query: 7   LESRVRTLKRQYNVIAEMLGSGCSGFGWNAERKYIDCEAKIFDIWVKSHPSAKGLCHKSF 66
           L  R   L + Y  +  +L     GF W+  R  I  +  ++D ++K HP A+    KS 
Sbjct: 223 LRHRYNKLLKYYKDMEAILKE--DGFSWDETRLMISADDAVWDSYIKDHPLARTYRMKSL 282

Query: 67  PYYDDLTIVF------GKDRATRSHATTTAEVGSEPVMEEKNED 105
           P Y+DL  +F      G D      A  T+E  +    +E+N D
Sbjct: 283 PSYNDLDTIFACQAEQGTDHRDDGSAAQTSETKAS---QEQNSD 321

BLAST of Clc11G09065 vs. TAIR 10
Match: AT2G24960.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 21 plant structures; EXPRESSED DURING: 12 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G02210.2); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 47.4 bits (111), Expect = 8.6e-06
Identity = 30/104 (28.85%), Postives = 46/104 (44.23%), Query Frame = 0

Query: 7   LESRVRTLKRQYNVIAEMLGSGCSGFGWNAERKYIDCEAKIFDIWVKSHPSAKGLCHKSF 66
           L  R   L + Y  +  +L     GF W+  R  I  +  ++D ++K HP A+    KS 
Sbjct: 223 LRHRYNKLLKYYKDMEAILKE--DGFSWDETRLMISADDAVWDSYIKDHPLARTYRMKSL 282

Query: 67  PYYDDLTIVF------GKDRATRSHATTTAEVGSEPVMEEKNED 105
           P Y+DL  +F      G D      A  T+E  +    +E+N D
Sbjct: 283 PSYNDLDTIFACQAEQGTDHRDDGSAAQTSETKAS---QEQNSD 321

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAA0064022.13.9e-2461.36retrotransposon protein [Cucumis melo var. makuwa][more]
XP_008441954.18.6e-2450.88PREDICTED: uncharacterized protein LOC103485953 [Cucumis melo] >KAA0047736.1 ret... [more]
XP_038987004.18.6e-2452.48uncharacterized protein At2g29880-like [Phoenix dactylifera][more]
XP_017251089.11.1e-2354.90PREDICTED: uncharacterized protein At2g29880-like [Daucus carota subsp. sativus]... [more]
KAA0057610.11.1e-2355.56retrotransposon protein [Cucumis melo var. makuwa][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A5A7V6Q91.9e-2461.36Retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold... [more]
A0A5A7U0H74.2e-2450.88Retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A1S3B4L34.2e-2450.88uncharacterized protein LOC103485953 OS=Cucumis melo OX=3656 GN=LOC103485953 PE=... [more]
A0A161XV485.4e-2454.90Myb_DNA-bind_3 domain-containing protein OS=Daucus carota subsp. sativus OX=7920... [more]
A0A5A7UR775.4e-2455.56Retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold... [more]
Match NameE-valueIdentityDescription
AT5G27260.11.1e-0834.57unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT1G30140.13.5e-0734.62unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT2G24960.18.6e-0628.85unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT2G24960.28.6e-0628.85unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (cordophanus) v2
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 86..108
NoneNo IPR availablePANTHERPTHR46250MYB/SANT-LIKE DNA-BINDING DOMAIN PROTEIN-RELATEDcoord: 2..97

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Clc11G09065.1Clc11G09065.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016020 membrane