Cla97C11G215120 (gene) Watermelon (97103) v2.5

Overview
NameCla97C11G215120
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionRetrotransposon protein
LocationCla97Chr11: 11246756 .. 11247163 (-)
RNA-Seq ExpressionCla97C11G215120
SyntenyCla97C11G215120
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCAGGTAAGCCCACATCTGGAGTCAAGGGTCAGGACCCTGAAGAGACAGTACAACGTGATTGCTGAAATGTTGGGCTCAGGCTGTAGTGGGTTTGGTTGGAATGCAGAGCGCAAATATATTGACTGTGAGGCGAAGATATTTGACATATGGGTCAAGGTAATTTCATTACTTATTTTATTTATTTCTTCTTCTTTTAATTTGGCTTGGCCATAACATAACACATCTCACCTTTAACAGAGTCATCCGAGTGCGAAAGAACTGTGCCATAAGTCATTTCCGTACTATGACGACTTGACCATAGTATTCGGCAAAGACAGAGCCACAAGGAGTCATGCAACCACCACTGCAGAGGTCGGATCTGAACCTGTTATGGAAGAGAAGAACGAGGACATCCTGAATAACTAG

mRNA sequence

ATGCAGGTAAGCCCACATCTGGAGTCAAGGGTCAGGACCCTGAAGAGACAGTACAACGTGATTGCTGAAATGTTGGGCTCAGGCTGTAGTGGGTTTGGTTGGAATGCAGAGCGCAAATATATTGACTGTGAGGCGAAGATATTTGACATATGGGTCAAGAGTCATCCGAGTGCGAAAGAACTGTGCCATAAGTCATTTCCGTACTATGACGACTTGACCATAGTATTCGGCAAAGACAGAGCCACAAGGAGTCATGCAACCACCACTGCAGAGGTCGGATCTGAACCTGTTATGGAAGAGAAGAACGAGGACATCCTGAATAACTAG

Coding sequence (CDS)

ATGCAGGTAAGCCCACATCTGGAGTCAAGGGTCAGGACCCTGAAGAGACAGTACAACGTGATTGCTGAAATGTTGGGCTCAGGCTGTAGTGGGTTTGGTTGGAATGCAGAGCGCAAATATATTGACTGTGAGGCGAAGATATTTGACATATGGGTCAAGAGTCATCCGAGTGCGAAAGAACTGTGCCATAAGTCATTTCCGTACTATGACGACTTGACCATAGTATTCGGCAAAGACAGAGCCACAAGGAGTCATGCAACCACCACTGCAGAGGTCGGATCTGAACCTGTTATGGAAGAGAAGAACGAGGACATCCTGAATAACTAG

Protein sequence

MQVSPHLESRVRTLKRQYNVIAEMLGSGCSGFGWNAERKYIDCEAKIFDIWVKSHPSAKELCHKSFPYYDDLTIVFGKDRATRSHATTTAEVGSEPVMEEKNEDILNN
Homology
BLAST of Cla97C11G215120 vs. NCBI nr
Match: KAA0064022.1 (retrotransposon protein [Cucumis melo var. makuwa])

HSP 1 Score: 118.2 bits (295), Expect = 4.3e-23
Identity = 53/88 (60.23%), Postives = 66/88 (75.00%), Query Frame = 0

Query: 7   LESRVRTLKRQYNVIAEMLGSGCSGFGWNAERKYIDCEAKIFDIWVKSHPSAKELCHKSF 66
           ++ R++TLKR +  IAEM G  CSGFGWN E K I  E ++FD WV+SHP+AK L +KSF
Sbjct: 67  IDCRIKTLKRTFQAIAEMRGPACSGFGWNDEEKCIVAEKELFDNWVRSHPAAKGLLNKSF 126

Query: 67  PYYDDLTIVFGKDRATRSHATTTAEVGS 95
           PYYD+LT VFG+DRAT   A T A+VGS
Sbjct: 127 PYYDELTYVFGRDRATGRFAETFADVGS 154

BLAST of Cla97C11G215120 vs. NCBI nr
Match: XP_038987004.1 (uncharacterized protein At2g29880-like [Phoenix dactylifera])

HSP 1 Score: 117.1 bits (292), Expect = 9.5e-23
Identity = 52/101 (51.49%), Postives = 71/101 (70.30%), Query Frame = 0

Query: 1   MQVSPHLESRVRTLKRQYNVIAEMLGSGCSGFGWNAERKYIDCEAKIFDIWVKSHPSAKE 60
           ++ +PH+ESRV+ LK+QYN IAEMLG  CSGFGW+   K + CE   F  WVKSHP+A+ 
Sbjct: 71  LKATPHIESRVKLLKKQYNAIAEMLGPNCSGFGWDDINKCVTCEEDTFKEWVKSHPNAQG 130

Query: 61  LCHKSFPYYDDLTIVFGKDRATRSHATTTAEVGSEPVMEEK 102
           + +K FP+++DLT +FG+DRAT   A   A+   E  MEE+
Sbjct: 131 MRNKPFPHFEDLTNIFGRDRATGMGAEAPADAVEEIEMEEQ 171

BLAST of Cla97C11G215120 vs. NCBI nr
Match: XP_008441954.1 (PREDICTED: uncharacterized protein LOC103485953 [Cucumis melo] >KAA0047736.1 retrotransposon protein [Cucumis melo var. makuwa] >TYK08388.1 retrotransposon protein [Cucumis melo var. makuwa])

HSP 1 Score: 116.7 bits (291), Expect = 1.2e-22
Identity = 57/114 (50.00%), Postives = 74/114 (64.91%), Query Frame = 0

Query: 1   MQVSPHLESRVRTLKRQYNVIAEMLGSGCSGFGWNAERKYIDCEAKIFDIWVKSHPSAKE 60
           +Q S  ++  V++LK+ Y+ IAEM G  CSGFGWN E + I  E  +FD W+KSHP+AK 
Sbjct: 61  IQESSTIDCHVKSLKKTYHAIAEMRGPSCSGFGWNEEFQCIIAERDLFDSWIKSHPAAKG 120

Query: 61  LCHKSFPYYDDLTIVFGKDRATRSHATTTAEVGSE---------PVMEEKNEDI 106
           L HKSFPYYDDL+ VFGKDRAT + + T   VGS          P+ +  +EDI
Sbjct: 121 LLHKSFPYYDDLSYVFGKDRATGARSETFPNVGSNVSNMFNDTIPLGDSHDEDI 174

BLAST of Cla97C11G215120 vs. NCBI nr
Match: KAA0057610.1 (retrotransposon protein [Cucumis melo var. makuwa])

HSP 1 Score: 116.7 bits (291), Expect = 1.2e-22
Identity = 59/108 (54.63%), Postives = 73/108 (67.59%), Query Frame = 0

Query: 7   LESRVRTLKRQYNVIAEMLGSGCSGFGWNAERKYIDCEAKIFDIWVKSHPSAKELCHKSF 66
           ++ R++TLKR +  IAEM G  CSGFGWN E K I  E ++FD WV+SHP+AK L +K F
Sbjct: 16  IDCRIKTLKRTFQAIAEMRGPACSGFGWNDEEKCIVAEKELFDNWVRSHPAAKGLLNKPF 75

Query: 67  PYYDDLTIVFGKDRATRSHATTTAEVGS-EP-------VMEEKNEDIL 107
           PYYD+LT VFG+DRAT   A T A+VGS EP        M + NED L
Sbjct: 76  PYYDELTYVFGRDRATGRFAETFADVGSNEPGGGYDRFDMGDGNEDFL 123

BLAST of Cla97C11G215120 vs. NCBI nr
Match: KAA0038975.1 (retrotransposon protein [Cucumis melo var. makuwa])

HSP 1 Score: 116.3 bits (290), Expect = 1.6e-22
Identity = 52/88 (59.09%), Postives = 65/88 (73.86%), Query Frame = 0

Query: 7   LESRVRTLKRQYNVIAEMLGSGCSGFGWNAERKYIDCEAKIFDIWVKSHPSAKELCHKSF 66
           ++ R++TLKR +  IAEM G  CSGFGWN E K I  E ++FD WV+SHP+AK L +K F
Sbjct: 67  IDCRIKTLKRTFQAIAEMRGPACSGFGWNDEEKCIVAEKELFDNWVRSHPAAKGLLNKPF 126

Query: 67  PYYDDLTIVFGKDRATRSHATTTAEVGS 95
           PYYD+LT VFG+DRAT   A T A+VGS
Sbjct: 127 PYYDELTYVFGRDRATGRFAETFADVGS 154

BLAST of Cla97C11G215120 vs. ExPASy TrEMBL
Match: A0A5A7V6Q9 (Retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold99G00200 PE=4 SV=1)

HSP 1 Score: 118.2 bits (295), Expect = 2.1e-23
Identity = 53/88 (60.23%), Postives = 66/88 (75.00%), Query Frame = 0

Query: 7   LESRVRTLKRQYNVIAEMLGSGCSGFGWNAERKYIDCEAKIFDIWVKSHPSAKELCHKSF 66
           ++ R++TLKR +  IAEM G  CSGFGWN E K I  E ++FD WV+SHP+AK L +KSF
Sbjct: 67  IDCRIKTLKRTFQAIAEMRGPACSGFGWNDEEKCIVAEKELFDNWVRSHPAAKGLLNKSF 126

Query: 67  PYYDDLTIVFGKDRATRSHATTTAEVGS 95
           PYYD+LT VFG+DRAT   A T A+VGS
Sbjct: 127 PYYDELTYVFGRDRATGRFAETFADVGS 154

BLAST of Cla97C11G215120 vs. ExPASy TrEMBL
Match: A0A5A7U0H7 (Retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold648G002060 PE=4 SV=1)

HSP 1 Score: 116.7 bits (291), Expect = 6.0e-23
Identity = 57/114 (50.00%), Postives = 74/114 (64.91%), Query Frame = 0

Query: 1   MQVSPHLESRVRTLKRQYNVIAEMLGSGCSGFGWNAERKYIDCEAKIFDIWVKSHPSAKE 60
           +Q S  ++  V++LK+ Y+ IAEM G  CSGFGWN E + I  E  +FD W+KSHP+AK 
Sbjct: 61  IQESSTIDCHVKSLKKTYHAIAEMRGPSCSGFGWNEEFQCIIAERDLFDSWIKSHPAAKG 120

Query: 61  LCHKSFPYYDDLTIVFGKDRATRSHATTTAEVGSE---------PVMEEKNEDI 106
           L HKSFPYYDDL+ VFGKDRAT + + T   VGS          P+ +  +EDI
Sbjct: 121 LLHKSFPYYDDLSYVFGKDRATGARSETFPNVGSNVSNMFNDTIPLGDSHDEDI 174

BLAST of Cla97C11G215120 vs. ExPASy TrEMBL
Match: A0A5A7UR77 (Retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold497G00880 PE=4 SV=1)

HSP 1 Score: 116.7 bits (291), Expect = 6.0e-23
Identity = 59/108 (54.63%), Postives = 73/108 (67.59%), Query Frame = 0

Query: 7   LESRVRTLKRQYNVIAEMLGSGCSGFGWNAERKYIDCEAKIFDIWVKSHPSAKELCHKSF 66
           ++ R++TLKR +  IAEM G  CSGFGWN E K I  E ++FD WV+SHP+AK L +K F
Sbjct: 16  IDCRIKTLKRTFQAIAEMRGPACSGFGWNDEEKCIVAEKELFDNWVRSHPAAKGLLNKPF 75

Query: 67  PYYDDLTIVFGKDRATRSHATTTAEVGS-EP-------VMEEKNEDIL 107
           PYYD+LT VFG+DRAT   A T A+VGS EP        M + NED L
Sbjct: 76  PYYDELTYVFGRDRATGRFAETFADVGSNEPGGGYDRFDMGDGNEDFL 123

BLAST of Cla97C11G215120 vs. ExPASy TrEMBL
Match: A0A1S3B4L3 (uncharacterized protein LOC103485953 OS=Cucumis melo OX=3656 GN=LOC103485953 PE=4 SV=1)

HSP 1 Score: 116.7 bits (291), Expect = 6.0e-23
Identity = 57/114 (50.00%), Postives = 74/114 (64.91%), Query Frame = 0

Query: 1   MQVSPHLESRVRTLKRQYNVIAEMLGSGCSGFGWNAERKYIDCEAKIFDIWVKSHPSAKE 60
           +Q S  ++  V++LK+ Y+ IAEM G  CSGFGWN E + I  E  +FD W+KSHP+AK 
Sbjct: 61  IQESSTIDCHVKSLKKTYHAIAEMRGPSCSGFGWNEEFQCIIAERDLFDSWIKSHPAAKG 120

Query: 61  LCHKSFPYYDDLTIVFGKDRATRSHATTTAEVGSE---------PVMEEKNEDI 106
           L HKSFPYYDDL+ VFGKDRAT + + T   VGS          P+ +  +EDI
Sbjct: 121 LLHKSFPYYDDLSYVFGKDRATGARSETFPNVGSNVSNMFNDTIPLGDSHDEDI 174

BLAST of Cla97C11G215120 vs. ExPASy TrEMBL
Match: A0A5D3CWL2 (Retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold21G00640 PE=4 SV=1)

HSP 1 Score: 116.3 bits (290), Expect = 7.9e-23
Identity = 52/88 (59.09%), Postives = 65/88 (73.86%), Query Frame = 0

Query: 7   LESRVRTLKRQYNVIAEMLGSGCSGFGWNAERKYIDCEAKIFDIWVKSHPSAKELCHKSF 66
           ++ R++TLKR +  IAEM G  CSGFGWN E K I  E ++FD WV+SHP+AK L +K F
Sbjct: 67  IDCRIKTLKRTFQAIAEMRGPACSGFGWNDEEKCIVAEKELFDNWVRSHPAAKGLLNKPF 126

Query: 67  PYYDDLTIVFGKDRATRSHATTTAEVGS 95
           PYYD+LT VFG+DRAT   A T A+VGS
Sbjct: 127 PYYDELTYVFGRDRATGRFAETFADVGS 154

BLAST of Cla97C11G215120 vs. TAIR 10
Match: AT5G27260.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G29880.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )

HSP 1 Score: 58.2 bits (139), Expect = 4.9e-09
Identity = 28/81 (34.57%), Postives = 48/81 (59.26%), Query Frame = 0

Query: 6   HLESRVRTLKRQYNVIAEMLGSGCSGFGWNAERKYIDCEAKIFDIWVKSHPSAKELCHKS 65
           H  SR++ LK QY    + L    SGFGW+   K      +++  ++K+HP+ K+L + +
Sbjct: 69  HYLSRMKYLKIQYQSCLD-LQRFSSGFGWDPLTKRFTASDEVWSDYLKAHPNNKQLRYDT 128

Query: 66  FPYYDDLTIVFGKDRATRSHA 87
           F ++D+L I+FG+  AT  +A
Sbjct: 129 FEFFDELQIIFGEGVATGKNA 148

BLAST of Cla97C11G215120 vs. TAIR 10
Match: AT1G30140.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G27260.1); Has 313 Blast hits to 256 proteins in 15 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 8; Plants - 295; Viruses - 0; Other Eukaryotes - 10 (source: NCBI BLink). )

HSP 1 Score: 52.0 bits (123), Expect = 3.5e-07
Identity = 27/78 (34.62%), Postives = 44/78 (56.41%), Query Frame = 0

Query: 9   SRVRTLKRQYNVIAEMLGSGCSGFGWNAERKYIDCEAKIFDIWVKSHPSAKELCHKSFPY 68
           SR++ LK  Y    + L    SGFGW+ E K      +++  ++K+HP+ K +  +S  +
Sbjct: 68  SRLKFLKNLYQSYLD-LKRFSSGFGWDPETKKFTAPDEVWRDYLKAHPNHKHMQTESIDH 127

Query: 69  YDDLTIVFGKDRATRSHA 87
           ++DL I+FG   AT S A
Sbjct: 128 FEDLQIIFGDVVATGSFA 144

BLAST of Cla97C11G215120 vs. TAIR 10
Match: AT2G24960.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G02210.2); Has 1453 Blast hits to 509 proteins in 26 species: Archae - 0; Bacteria - 0; Metazoa - 1; Fungi - 39; Plants - 1363; Viruses - 0; Other Eukaryotes - 50 (source: NCBI BLink). )

HSP 1 Score: 47.0 bits (110), Expect = 1.1e-05
Identity = 30/104 (28.85%), Postives = 46/104 (44.23%), Query Frame = 0

Query: 7   LESRVRTLKRQYNVIAEMLGSGCSGFGWNAERKYIDCEAKIFDIWVKSHPSAKELCHKSF 66
           L  R   L + Y  +  +L     GF W+  R  I  +  ++D ++K HP A+    KS 
Sbjct: 223 LRHRYNKLLKYYKDMEAILKE--DGFSWDETRLMISADDAVWDSYIKDHPLARTYRMKSL 282

Query: 67  PYYDDLTIVF------GKDRATRSHATTTAEVGSEPVMEEKNED 105
           P Y+DL  +F      G D      A  T+E  +    +E+N D
Sbjct: 283 PSYNDLDTIFACQAEQGTDHRDDGSAAQTSETKAS---QEQNSD 321

BLAST of Cla97C11G215120 vs. TAIR 10
Match: AT2G24960.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 21 plant structures; EXPRESSED DURING: 12 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G02210.2); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 47.0 bits (110), Expect = 1.1e-05
Identity = 30/104 (28.85%), Postives = 46/104 (44.23%), Query Frame = 0

Query: 7   LESRVRTLKRQYNVIAEMLGSGCSGFGWNAERKYIDCEAKIFDIWVKSHPSAKELCHKSF 66
           L  R   L + Y  +  +L     GF W+  R  I  +  ++D ++K HP A+    KS 
Sbjct: 223 LRHRYNKLLKYYKDMEAILKE--DGFSWDETRLMISADDAVWDSYIKDHPLARTYRMKSL 282

Query: 67  PYYDDLTIVF------GKDRATRSHATTTAEVGSEPVMEEKNED 105
           P Y+DL  +F      G D      A  T+E  +    +E+N D
Sbjct: 283 PSYNDLDTIFACQAEQGTDHRDDGSAAQTSETKAS---QEQNSD 321

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAA0064022.14.3e-2360.23retrotransposon protein [Cucumis melo var. makuwa][more]
XP_038987004.19.5e-2351.49uncharacterized protein At2g29880-like [Phoenix dactylifera][more]
XP_008441954.11.2e-2250.00PREDICTED: uncharacterized protein LOC103485953 [Cucumis melo] >KAA0047736.1 ret... [more]
KAA0057610.11.2e-2254.63retrotransposon protein [Cucumis melo var. makuwa][more]
KAA0038975.11.6e-2259.09retrotransposon protein [Cucumis melo var. makuwa][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A5A7V6Q92.1e-2360.23Retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold... [more]
A0A5A7U0H76.0e-2350.00Retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A5A7UR776.0e-2354.63Retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold... [more]
A0A1S3B4L36.0e-2350.00uncharacterized protein LOC103485953 OS=Cucumis melo OX=3656 GN=LOC103485953 PE=... [more]
A0A5D3CWL27.9e-2359.09Retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
Match NameE-valueIdentityDescription
AT5G27260.14.9e-0934.57unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT1G30140.13.5e-0734.62unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT2G24960.11.1e-0528.85unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT2G24960.21.1e-0528.85unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (97103) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 86..108
NoneNo IPR availablePANTHERPTHR46250MYB/SANT-LIKE DNA-BINDING DOMAIN PROTEIN-RELATEDcoord: 2..97

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C11G215120.1Cla97C11G215120.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016020 membrane