Cla001862 (gene) Watermelon (97103) v1

NameCla001862
Typegene
OrganismCitrullus. lanatus (Watermelon (97103) v1)
DescriptionRetrotransposon protein (AHRD V1 *-*- E5GBB2_CUCME)
LocationChr1 : 12011004 .. 12011654 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGATGCAGCAAAGGATTCCAGGGTGTTCCATACAGGTAAGCCCGAATTTGGAGTCCAGGGTCAGGATGTTGAAGAGACAGTACAGCGTGATCGTTGAAATGTTGGGCCTAGGATATAGTGGGTTTGGTTGGAATGCGGAGCGCAAATGTAATGACTGTGAGCCGAAGATATTTGACGCATGGGTCAAGGTAATTTTATTATTTTTCTATTTATTCTAGGTTGTTTTAAATATGTTTTAGCCATTACATAACACATTTAATCTTTAACAGAGTCATCCGAGTGCAAAAGGACTGCGCCATAATTCATTTCCGTTTTATGACGACTTGGCCATTATATTTGGCAAAGACAAAGCAACAGGGAGTCGTGCCACTACCATTGCAGAGGTCGGATCTAAACCTGTTGTGGAAGAGGAGAACGAGGACATCTTGAATAACCAACCCCCGGACTTTGAGAACTTCTATATTCCCGATCCACCGTTCACCAGCTCGACCACATTAGAGGACCTTCCAACTACCCTCGGCGATAGAGGGTCTGGGAGTAGCATGTCAACAAGAAGTAGGAGGTCCCGAAGTTCCTCAATTGGAGAGTATAGCGAGGTGGTTCGAGATGGCTTCCAACTTCTGATGAAGTCCATTGACGGCATTGCATAG

mRNA sequence

ATGATGCAGCAAAGGATTCCAGGGTGTTCCATACAGGTAAGCCCGAATTTGGAGTCCAGGGTCAGGATGTTGAAGAGACAGTACAGCGTGATCGTTGAAATGTTGGGCCTAGGATATAGTGGGTTTGGTTGGAATGCGGAGCGCAAATGTAATGACTGTGAGCCGAAGATATTTGACGCATGGGTCAAGAGTCATCCGAGTGCAAAAGGACTGCGCCATAATTCATTTCCGTTTTATGACGACTTGGCCATTATATTTGGCAAAGACAAAGCAACAGGGAGTCGTGCCACTACCATTGCAGAGGTCGGATCTAAACCTGTTGTGGAAGAGGAGAACGAGGACATCTTGAATAACCAACCCCCGGACTTTGAGAACTTCTATATTCCCGATCCACCGTTCACCAGCTCGACCACATTAGAGGACCTTCCAACTACCCTCGGCGATAGAGGGTCTGGGAGTAGCATGTCAACAAGAAGTAGGAGGTCCCGAAGTTCCTCAATTGGAGAGTATAGCGAGGTGGTTCGAGATGGCTTCCAACTTCTGATGAAGTCCATTGACGGCATTGCATAG

Coding sequence (CDS)

ATGATGCAGCAAAGGATTCCAGGGTGTTCCATACAGGTAAGCCCGAATTTGGAGTCCAGGGTCAGGATGTTGAAGAGACAGTACAGCGTGATCGTTGAAATGTTGGGCCTAGGATATAGTGGGTTTGGTTGGAATGCGGAGCGCAAATGTAATGACTGTGAGCCGAAGATATTTGACGCATGGGTCAAGAGTCATCCGAGTGCAAAAGGACTGCGCCATAATTCATTTCCGTTTTATGACGACTTGGCCATTATATTTGGCAAAGACAAAGCAACAGGGAGTCGTGCCACTACCATTGCAGAGGTCGGATCTAAACCTGTTGTGGAAGAGGAGAACGAGGACATCTTGAATAACCAACCCCCGGACTTTGAGAACTTCTATATTCCCGATCCACCGTTCACCAGCTCGACCACATTAGAGGACCTTCCAACTACCCTCGGCGATAGAGGGTCTGGGAGTAGCATGTCAACAAGAAGTAGGAGGTCCCGAAGTTCCTCAATTGGAGAGTATAGCGAGGTGGTTCGAGATGGCTTCCAACTTCTGATGAAGTCCATTGACGGCATTGCATAG

Protein sequence

MMQQRIPGCSIQVSPNLESRVRMLKRQYSVIVEMLGLGYSGFGWNAERKCNDCEPKIFDAWVKSHPSAKGLRHNSFPFYDDLAIIFGKDKATGSRATTIAEVGSKPVVEEENEDILNNQPPDFENFYIPDPPFTSSTTLEDLPTTLGDRGSGSSMSTRSRRSRSSSIGEYSEVVRDGFQLLMKSIDGIA
BLAST of Cla001862 vs. TrEMBL
Match: E5GBB2_CUCME (Retrotransposon protein OS=Cucumis melo subsp. melo PE=4 SV=1)

HSP 1 Score: 122.1 bits (305), Expect = 7.3e-25
Identity = 62/163 (38.04%), Postives = 91/163 (55.83%), Query Frame = 1

Query: 1   MMQQRIPGCSIQVSPNLESRVRMLKRQYSVIVEMLGLGYSGFGWNAERKCNDCEPKIFDA 60
           MM +++ GC ++ +  ++ R++ LKR +  I EMLG   SGFGWN E KC   E ++FD 
Sbjct: 368 MMAEKLSGCQVRATTVIDCRIKTLKRTFQAIAEMLGPACSGFGWNDEEKCIVAEKELFDN 427

Query: 61  WVKSHPSAKGLRHNSFPFYDDLAIIFGKDKATGSRATTIAEVGSKPVVEEENEDILNNQP 120
           WV+S P+AKGL +N FP+YD+L  +FG+D+ATG  A T A+VGS       +   + +  
Sbjct: 428 WVRSPPAAKGLLNNPFPYYDELTYVFGRDRATGRFAETFADVGSNEPGGGYDRFDMGDGN 487

Query: 121 PDFENFYIPDPPFTSSTTLEDLPTTLGDRGSGSSMSTRSRRSR 164
            DF   Y               P+   +  +GSS S R R S+
Sbjct: 488 EDFPPVYSRGVDILQDDVRASRPSRASEGKTGSSGSKRKRGSQ 530

BLAST of Cla001862 vs. TrEMBL
Match: A0A162AHN3_DAUCA (Uncharacterized protein OS=Daucus carota subsp. sativus GN=DCAR_009785 PE=4 SV=1)

HSP 1 Score: 119.0 bits (297), Expect = 6.2e-24
Identity = 54/112 (48.21%), Postives = 77/112 (68.75%), Query Frame = 1

Query: 1   MMQQRIPGCSIQVSPNLESRVRMLKRQYSVIVEMLGLGYSGFGWNAERKCNDCEPKIFDA 60
           +M + +PGC ++  P++ESRVR+L++Q+  I EM G   SGFGWN   K   CE  IF+ 
Sbjct: 62  IMGELLPGCGMKARPHIESRVRLLRKQFFAIEEMRGPNCSGFGWNELEKSITCEKSIFEE 121

Query: 61  WVKSHPSAKGLRHNSFPFYDDLAIIFGKDKATGSRATTIAEVGSKPVVEEEN 113
           W+KSHP+AKGLR+ SFPFYD+LA +FGKD+A G    + A+   +   +EE+
Sbjct: 122 WLKSHPNAKGLRNKSFPFYDELAQVFGKDRANGEGVESPADAVEEIANDEES 173

BLAST of Cla001862 vs. TrEMBL
Match: A0A0K9RVV5_SPIOL (Uncharacterized protein OS=Spinacia oleracea GN=SOVF_028730 PE=4 SV=1)

HSP 1 Score: 116.7 bits (291), Expect = 3.1e-23
Identity = 58/164 (35.37%), Postives = 95/164 (57.93%), Query Frame = 1

Query: 1   MMQQRIPGCSIQVSPNLESRVRMLKRQYSVIVEMLGLGYSGFGWNAERKCNDCEPKIFDA 60
           +M++++P C  +  P++ESRV++L++QY  I EML    SGFGWN E K   C   ++D 
Sbjct: 63  LMKEKLPECDKKAKPHIESRVKLLRKQYDAIPEMLSPSASGFGWNDEGKFVTCPQSVWDE 122

Query: 61  WVKSHPSAKGLRHNSFPFYDDLAIIFGKDKATGSRATTIAEVGSKPVVEEENEDILNNQP 120
           W+KSH +A GLR+  FPF+DDL  IFGKD+  G+ AT + +V  +  ++EE +++     
Sbjct: 123 WIKSHKNAAGLRNKPFPFFDDLGKIFGKDRDVGNEATNVYDVLEE--MDEEEQEV----- 182

Query: 121 PDFENFYIPDPPFTSSTTLEDLPTTLGDRGSGSSMSTRSRRSRS 165
                     P    S  L  +    G   S ++ ++R++R+R+
Sbjct: 183 ----------PESIESINLTPIGNPTGHSSSPTTTTSRTKRART 209

BLAST of Cla001862 vs. TrEMBL
Match: E5GCB5_CUCME (Retrotransposon protein OS=Cucumis melo subsp. melo PE=4 SV=1)

HSP 1 Score: 116.3 bits (290), Expect = 4.0e-23
Identity = 65/171 (38.01%), Postives = 98/171 (57.31%), Query Frame = 1

Query: 1   MMQQRIPGCSIQVSPNLESRVRMLKRQYSVIVEMLGLGYSGFGWNAERKCNDCEPKIFDA 60
           MM  +IPG +I  S  ++SR++++KR +  + EM G   SGFGWN E+KC   E ++FD 
Sbjct: 406 MMAFKIPGSNIHAS-TIDSRIKLMKRMFHALAEMRGPNCSGFGWNDEKKCIVAEKEVFDD 465

Query: 61  WVKSHPSAKGLRHNSFPFYDDLAIIFGKDKATGSRATTIAEVGSK--PVVEEENEDILNN 120
           W  SHP+AKGL + SF  YD+L+ +FGKD+ATG RA + A++GS   P  +    D + +
Sbjct: 466 W--SHPAAKGLLNKSFVHYDELSYVFGKDRATGGRAESFADIGSNDPPGYDAGAADAMPD 525

Query: 121 QPPDFENFYIPDPPFTSSTTLEDLPTTLGDRGSGSSMSTRSRRSRSSSIGE 170
              DF   Y P    +    +E     + +R + SS S R R   ++  G+
Sbjct: 526 --TDFPPMYSPGLNMSPDDLMETRTARVSERRNVSSGSKRKRPGHATDSGD 571

BLAST of Cla001862 vs. TrEMBL
Match: A0A161XV48_DAUCA (Uncharacterized protein OS=Daucus carota subsp. sativus GN=DCAR_018898 PE=4 SV=1)

HSP 1 Score: 115.9 bits (289), Expect = 5.2e-23
Identity = 54/112 (48.21%), Postives = 75/112 (66.96%), Query Frame = 1

Query: 1   MMQQRIPGCSIQVSPNLESRVRMLKRQYSVIVEMLGLGYSGFGWNAERKCNDCEPKIFDA 60
           +M+   PGC ++  P++ESRVR+ ++QY  I EM G   SGFGWN   K   CE  IF+ 
Sbjct: 60  IMEDIQPGCGMKARPHIESRVRLWRKQYFAIEEMRGPNCSGFGWNELDKSITCEKSIFED 119

Query: 61  WVKSHPSAKGLRHNSFPFYDDLAIIFGKDKATGSRATTIAEVGSKPVVEEEN 113
           W+KSHP+AKGLR+ SFP+YD+L+ +FGKD+A G    + A+   +   EEEN
Sbjct: 120 WLKSHPNAKGLRNKSFPYYDELSQVFGKDRANGECVESPADAVEEIANEEEN 171

BLAST of Cla001862 vs. TAIR10
Match: AT5G27260.1 (AT5G27260.1 unknown protein)

HSP 1 Score: 63.5 bits (153), Expect = 1.6e-10
Identity = 35/109 (32.11%), Postives = 58/109 (53.21%), Query Frame = 1

Query: 19  SRVRMLKRQYSVIVEMLGLGYSGFGWNAERKCNDCEPKIFDAWVKSHPSAKGLRHNSFPF 78
           SR++ LK QY   +++     SGFGW+   K      +++  ++K+HP+ K LR+++F F
Sbjct: 72  SRMKYLKIQYQSCLDLQRFS-SGFGWDPLTKRFTASDEVWSDYLKAHPNNKQLRYDTFEF 131

Query: 79  YDDLAIIFGKDKATGSRATTIAEVGSKPVVEEENEDILNNQPPDFENFY 128
           +D+L IIFG+  ATG  A  + +  +  +     E+       DF+N Y
Sbjct: 132 FDELQIIFGEGVATGKNAIGLCD-STDGLTYRAGENPRKEYVDDFDNVY 178

BLAST of Cla001862 vs. TAIR10
Match: AT1G30140.1 (AT1G30140.1 unknown protein)

HSP 1 Score: 53.9 bits (128), Expect = 1.2e-07
Identity = 29/86 (33.72%), Postives = 47/86 (54.65%), Query Frame = 1

Query: 16  NLESRVRMLKRQYSVIVEMLGLGYSGFGWNAERKCNDCEPKIFDAWVKSHPSAKGLRHNS 75
           N  SR++ LK  Y   +++     SGFGW+ E K      +++  ++K+HP+ K ++  S
Sbjct: 65  NYMSRLKFLKNLYQSYLDLKRFS-SGFGWDPETKKFTAPDEVWRDYLKAHPNHKHMQTES 124

Query: 76  FPFYDDLAIIFGKDKATGSRATTIAE 102
              ++DL IIFG   ATGS A  +++
Sbjct: 125 IDHFEDLQIIFGDVVATGSFAVGMSD 149

BLAST of Cla001862 vs. NCBI nr
Match: gi|659111294|ref|XP_008455678.1| (PREDICTED: uncharacterized protein At2g29880-like [Cucumis melo])

HSP 1 Score: 142.5 bits (358), Expect = 7.5e-31
Identity = 79/191 (41.36%), Postives = 114/191 (59.69%), Query Frame = 1

Query: 1   MMQQRIPGCSIQVSPNLESRVRMLKRQYSVIVEMLGLGYSGFGWNAERKCNDCEPKIFDA 60
           +M+++IP  +IQV+ NLESRV+ LK+QY+ I +M+G   S FGWN ERKC + E  +FD 
Sbjct: 84  LMKEKIPRSNIQVTLNLESRVKFLKKQYTAIAKMMGPACSRFGWNEERKCIEAEKSVFDD 143

Query: 61  WVKSHPSAKGLRHNSFPFYDDLAIIFGKDKATGSRATTIAEVGSKPVVEEENEDILNNQP 120
           WVK HP+A+GL +  F ++ DL I+FG+DKATG R     E+ S+   + E +D+  N  
Sbjct: 144 WVKGHPNARGLLNKPFAYFYDLEIVFGRDKATGGRCKPFVEMASQTARDTEEDDMDIN-- 203

Query: 121 PDFENFYIPDPPFTSSTTLEDLPTTL--GDRGSGSSMSTRSRRSRSSSIGEYSEVVRDGF 180
              E+F IP+P      + ED+P+TL      +GSS  ++ RRS     G+  +  R   
Sbjct: 204 --LEDFDIPNPHGLEPPSGEDMPSTLISMTHDAGSSRPSKKRRSYP---GDLMDTFRASM 263

Query: 181 QLLMKSIDGIA 190
           Q   K I  IA
Sbjct: 264 QETSKEIGKIA 267

BLAST of Cla001862 vs. NCBI nr
Match: gi|659111565|ref|XP_008455793.1| (PREDICTED: uncharacterized protein At2g29880-like [Cucumis melo])

HSP 1 Score: 128.3 bits (321), Expect = 1.5e-26
Identity = 60/123 (48.78%), Postives = 81/123 (65.85%), Query Frame = 1

Query: 2   MQQRIPGCSIQVSPNLESRVRMLKRQYSVIVEMLGLGYSGFGWNAERKCNDCEPKIFDAW 61
           M +++PG +IQ SP ++ RV+ LK+ Y  I EM G   SGFGWN E +C   E  +FD+W
Sbjct: 1   MAEKLPGTNIQASPTIDCRVKSLKKTYHAIAEMRGPSCSGFGWNEEFQCIIVERDLFDSW 60

Query: 62  VKSHPSAKGLRHNSFPFYDDLAIIFGKDKATGSRATTIAEVGSK---------PVVEEEN 116
           VKSHP+ KGL H SFP+YDDL  +FGKD+ATG+R+ T  +VGS          P+ +  +
Sbjct: 61  VKSHPATKGLLHKSFPYYDDLTYVFGKDRATGARSETFVDVGSNVPNMFNDTIPLGDSHD 120

BLAST of Cla001862 vs. NCBI nr
Match: gi|659082641|ref|XP_008441954.1| (PREDICTED: uncharacterized protein At2g29880-like [Cucumis melo])

HSP 1 Score: 127.9 bits (320), Expect = 1.9e-26
Identity = 73/194 (37.63%), Postives = 104/194 (53.61%), Query Frame = 1

Query: 1   MMQQRIPGCSIQVSPNLESRVRMLKRQYSVIVEMLGLGYSGFGWNAERKCNDCEPKIFDA 60
           MM +++PG +IQ S  ++  V+ LK+ Y  I EM G   SGFGWN E +C   E  +FD+
Sbjct: 51  MMAEKLPGTNIQESSTIDCHVKSLKKTYHAIAEMRGPSCSGFGWNEEFQCIIAERDLFDS 110

Query: 61  WVKSHPSAKGLRHNSFPFYDDLAIIFGKDKATGSRATTIAEVGSKPVVEEENEDILNNQP 120
           W+KSHP+AKGL H SFP+YDDL+ +FGKD+ATG+R+ T   VGS       N   + N  
Sbjct: 111 WIKSHPAAKGLLHKSFPYYDDLSYVFGKDRATGARSETFPNVGS-------NVSNMFNDT 170

Query: 121 PDFENFYIPDPPFTSSTTLEDLPTTL-----GDRGSGSSMSTRSRRSRSSSIGEYSEVVR 180
               + +  D P   S  +   P  +     G      + S+ S+R R S   E  EV+R
Sbjct: 171 IPLGDSHDEDIPTMYSQGVHMSPDEMFGIRAGQASERRNCSSVSKRKRGSERYETVEVIR 230

Query: 181 DGFQLLMKSIDGIA 190
              +   + +  IA
Sbjct: 231 SVMEFGNEQLKAIA 237

BLAST of Cla001862 vs. NCBI nr
Match: gi|720026936|ref|XP_010264414.1| (PREDICTED: uncharacterized protein LOC104602427 [Nelumbo nucifera])

HSP 1 Score: 123.2 bits (308), Expect = 4.7e-25
Identity = 54/111 (48.65%), Postives = 76/111 (68.47%), Query Frame = 1

Query: 2   MQQRIPGCSIQVSPNLESRVRMLKRQYSVIVEMLGLGYSGFGWNAERKCNDCEPKIFDAW 61
           M+  +PGC ++ +P++ESRV+ +KRQY+ I EML    SGFGW+  +KC  C+ ++F+ W
Sbjct: 58  METNLPGCGLKANPHIESRVKHMKRQYAAICEMLSPSCSGFGWDDVKKCITCKDEVFNGW 117

Query: 62  VKSHPSAKGLRHNSFPFYDDLAIIFGKDKATGSRATTIAEVGSKPVVEEEN 113
           VKSHP AKGLR+  FPF+D L+IIFG+D+ATG      A+       EE N
Sbjct: 118 VKSHPHAKGLRNKPFPFFDGLSIIFGRDRATGESVEAPADAAENVEREEYN 168

BLAST of Cla001862 vs. NCBI nr
Match: gi|307135889|gb|ADN33754.1| (retrotransposon protein [Cucumis melo subsp. melo])

HSP 1 Score: 122.1 bits (305), Expect = 1.0e-24
Identity = 62/163 (38.04%), Postives = 91/163 (55.83%), Query Frame = 1

Query: 1   MMQQRIPGCSIQVSPNLESRVRMLKRQYSVIVEMLGLGYSGFGWNAERKCNDCEPKIFDA 60
           MM +++ GC ++ +  ++ R++ LKR +  I EMLG   SGFGWN E KC   E ++FD 
Sbjct: 368 MMAEKLSGCQVRATTVIDCRIKTLKRTFQAIAEMLGPACSGFGWNDEEKCIVAEKELFDN 427

Query: 61  WVKSHPSAKGLRHNSFPFYDDLAIIFGKDKATGSRATTIAEVGSKPVVEEENEDILNNQP 120
           WV+S P+AKGL +N FP+YD+L  +FG+D+ATG  A T A+VGS       +   + +  
Sbjct: 428 WVRSPPAAKGLLNNPFPYYDELTYVFGRDRATGRFAETFADVGSNEPGGGYDRFDMGDGN 487

Query: 121 PDFENFYIPDPPFTSSTTLEDLPTTLGDRGSGSSMSTRSRRSR 164
            DF   Y               P+   +  +GSS S R R S+
Sbjct: 488 EDFPPVYSRGVDILQDDVRASRPSRASEGKTGSSGSKRKRGSQ 530

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
E5GBB2_CUCME7.3e-2538.04Retrotransposon protein OS=Cucumis melo subsp. melo PE=4 SV=1[more]
A0A162AHN3_DAUCA6.2e-2448.21Uncharacterized protein OS=Daucus carota subsp. sativus GN=DCAR_009785 PE=4 SV=1[more]
A0A0K9RVV5_SPIOL3.1e-2335.37Uncharacterized protein OS=Spinacia oleracea GN=SOVF_028730 PE=4 SV=1[more]
E5GCB5_CUCME4.0e-2338.01Retrotransposon protein OS=Cucumis melo subsp. melo PE=4 SV=1[more]
A0A161XV48_DAUCA5.2e-2348.21Uncharacterized protein OS=Daucus carota subsp. sativus GN=DCAR_018898 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G27260.11.6e-1032.11 unknown protein[more]
AT1G30140.11.2e-0733.72 unknown protein[more]
Match NameE-valueIdentityDescription
gi|659111294|ref|XP_008455678.1|7.5e-3141.36PREDICTED: uncharacterized protein At2g29880-like [Cucumis melo][more]
gi|659111565|ref|XP_008455793.1|1.5e-2648.78PREDICTED: uncharacterized protein At2g29880-like [Cucumis melo][more]
gi|659082641|ref|XP_008441954.1|1.9e-2637.63PREDICTED: uncharacterized protein At2g29880-like [Cucumis melo][more]
gi|720026936|ref|XP_010264414.1|4.7e-2548.65PREDICTED: uncharacterized protein LOC104602427 [Nelumbo nucifera][more]
gi|307135889|gb|ADN33754.1|1.0e-2438.04retrotransposon protein [Cucumis melo subsp. melo][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla001862Cla001862.1mRNA


Analysis Name: InterPro Annotations of watermelon (97103)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR31704FAMILY NOT NAMEDcoord: 1..115
score: 4.3
NoneNo IPR availablePANTHERPTHR31704:SF16SUBFAMILY NOT NAMEDcoord: 1..115
score: 4.3

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
Cla001862Cla97C01G012810Watermelon (97103) v2wmwmbB248
The following gene(s) are paralogous to this gene:

None