Cla011607 (gene) Watermelon (97103) v1

NameCla011607
Typegene
OrganismCitrullus. lanatus (Watermelon (97103) v1)
DescriptionRetrotransposon protein (AHRD V1 ***- E5GBB2_CUCME)
LocationChr4 : 3961717 .. 3962286 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGATGCAACAAAAAATTTCAGGGTGTTCCATACAGGTAAACCCAAATCTAGAGTCCAGGGTCAAGATGTTGAAGAGACAATACAGTGCGATCGCTGAAATGTTGTGCCCAGGATGTAGTAGGTTTGGATGGAACGCAGAGCGAAAGTGTACTGACTGTGAGCCTAAGATATTCGATGCGTGGGTCAAGGTAATGAACAAAATTTATTATTTGTTATATGTTTTTTTAAATATATTTTAGCCAAAACATAACACATTTCATCTCATACAGAGTCATCCGAGTGCAAAAGGACTGAGGCATAAGTCATTTCCATTTTATGATGACTTGGCGATTGTATCTAGCAAAGATAGAGCAACAGGGAGTCGTGTCACTACCACTGCTGAGGTCGGATCTGAACCTGTAGTGGAAGAGAAGAATGAGGACATTTTAAATAACCAATCCCCATACTTTGAAATTTTTTATATTCCTGATCCACCGTTCGCCAGCTCGCCCACATCAGAAGACCTTCCAACTACCCCGGTCGGTAGAGGGGCTGGGAGTAGCATGCCAACAAGAAGTAGGAGATCTTGA

mRNA sequence

ATGATGCAACAAAAAATTTCAGGGTGTTCCATACAGGTAAACCCAAATCTAGAGTCCAGGGTCAAGATGTTGAAGAGACAATACAGTGCGATCGCTGAAATGTTGTGCCCAGGATGTAGTAGGTTTGGATGGAACGCAGAGCGAAAGTGTACTGACTGTGAGCCTAAGATATTCGATGCGTGGGTCAAGAGTCATCCGAGTGCAAAAGGACTGAGGCATAAGTCATTTCCATTTTATGATGACTTGGCGATTGTATCTAGCAAAGATAGAGCAACAGGGAGTCGTGTCACTACCACTGCTGAGGTCGGATCTGAACCTGTAGTGGAAGAGAAGAATGAGGACATTTTAAATAACCAATCCCCATACTTTGAAATTTTTTATATTCCTGATCCACCGTTCGCCAGCTCGCCCACATCAGAAGACCTTCCAACTACCCCGGTCGGTAGAGGGGCTGGGAGTAGCATGCCAACAAGAAGTAGGAGATCTTGA

Coding sequence (CDS)

ATGATGCAACAAAAAATTTCAGGGTGTTCCATACAGGTAAACCCAAATCTAGAGTCCAGGGTCAAGATGTTGAAGAGACAATACAGTGCGATCGCTGAAATGTTGTGCCCAGGATGTAGTAGGTTTGGATGGAACGCAGAGCGAAAGTGTACTGACTGTGAGCCTAAGATATTCGATGCGTGGGTCAAGAGTCATCCGAGTGCAAAAGGACTGAGGCATAAGTCATTTCCATTTTATGATGACTTGGCGATTGTATCTAGCAAAGATAGAGCAACAGGGAGTCGTGTCACTACCACTGCTGAGGTCGGATCTGAACCTGTAGTGGAAGAGAAGAATGAGGACATTTTAAATAACCAATCCCCATACTTTGAAATTTTTTATATTCCTGATCCACCGTTCGCCAGCTCGCCCACATCAGAAGACCTTCCAACTACCCCGGTCGGTAGAGGGGCTGGGAGTAGCATGCCAACAAGAAGTAGGAGATCTTGA

Protein sequence

MMQQKISGCSIQVNPNLESRVKMLKRQYSAIAEMLCPGCSRFGWNAERKCTDCEPKIFDAWVKSHPSAKGLRHKSFPFYDDLAIVSSKDRATGSRVTTTAEVGSEPVVEEKNEDILNNQSPYFEIFYIPDPPFASSPTSEDLPTTPVGRGAGSSMPTRSRRS
BLAST of Cla011607 vs. TrEMBL
Match: A0A162AHN3_DAUCA (Uncharacterized protein OS=Daucus carota subsp. sativus GN=DCAR_009785 PE=4 SV=1)

HSP 1 Score: 119.8 bits (299), Expect = 3.1e-24
Identity = 55/112 (49.11%), Postives = 77/112 (68.75%), Query Frame = 1

Query: 1   MMQQKISGCSIQVNPNLESRVKMLKRQYSAIAEMLCPGCSRFGWNAERKCTDCEPKIFDA 60
           +M + + GC ++  P++ESRV++L++Q+ AI EM  P CS FGWN   K   CE  IF+ 
Sbjct: 62  IMGELLPGCGMKARPHIESRVRLLRKQFFAIEEMRGPNCSGFGWNELEKSITCEKSIFEE 121

Query: 61  WVKSHPSAKGLRHKSFPFYDDLAIVSSKDRATGSRVTTTAEVGSEPVVEEKN 113
           W+KSHP+AKGLR+KSFPFYD+LA V  KDRA G  V + A+   E   +E++
Sbjct: 122 WLKSHPNAKGLRNKSFPFYDELAQVFGKDRANGEGVESPADAVEEIANDEES 173

BLAST of Cla011607 vs. TrEMBL
Match: E5GBB2_CUCME (Retrotransposon protein OS=Cucumis melo subsp. melo PE=4 SV=1)

HSP 1 Score: 117.9 bits (294), Expect = 1.2e-23
Identity = 54/104 (51.92%), Postives = 71/104 (68.27%), Query Frame = 1

Query: 1   MMQQKISGCSIQVNPNLESRVKMLKRQYSAIAEMLCPGCSRFGWNAERKCTDCEPKIFDA 60
           MM +K+SGC ++    ++ R+K LKR + AIAEML P CS FGWN E KC   E ++FD 
Sbjct: 368 MMAEKLSGCQVRATTVIDCRIKTLKRTFQAIAEMLGPACSGFGWNDEEKCIVAEKELFDN 427

Query: 61  WVKSHPSAKGLRHKSFPFYDDLAIVSSKDRATGSRVTTTAEVGS 105
           WV+S P+AKGL +  FP+YD+L  V  +DRATG    T A+VGS
Sbjct: 428 WVRSPPAAKGLLNNPFPYYDELTYVFGRDRATGRFAETFADVGS 471

BLAST of Cla011607 vs. TrEMBL
Match: A0A161XV48_DAUCA (Uncharacterized protein OS=Daucus carota subsp. sativus GN=DCAR_018898 PE=4 SV=1)

HSP 1 Score: 116.7 bits (291), Expect = 2.6e-23
Identity = 55/112 (49.11%), Postives = 75/112 (66.96%), Query Frame = 1

Query: 1   MMQQKISGCSIQVNPNLESRVKMLKRQYSAIAEMLCPGCSRFGWNAERKCTDCEPKIFDA 60
           +M+    GC ++  P++ESRV++ ++QY AI EM  P CS FGWN   K   CE  IF+ 
Sbjct: 60  IMEDIQPGCGMKARPHIESRVRLWRKQYFAIEEMRGPNCSGFGWNELDKSITCEKSIFED 119

Query: 61  WVKSHPSAKGLRHKSFPFYDDLAIVSSKDRATGSRVTTTAEVGSEPVVEEKN 113
           W+KSHP+AKGLR+KSFP+YD+L+ V  KDRA G  V + A+   E   EE+N
Sbjct: 120 WLKSHPNAKGLRNKSFPYYDELSQVFGKDRANGECVESPADAVEEIANEEEN 171

BLAST of Cla011607 vs. TrEMBL
Match: A0A0J8BIR4_BETVU (Uncharacterized protein OS=Beta vulgaris subsp. vulgaris GN=BVRB_1g018220 PE=4 SV=1)

HSP 1 Score: 114.8 bits (286), Expect = 1.0e-22
Identity = 65/162 (40.12%), Postives = 90/162 (55.56%), Query Frame = 1

Query: 1   MMQQKISGCSIQVNPNLESRVKMLKRQYSAIAEMLCPGCSRFGWNAERKCTDCEPKIFDA 60
           +M QK+ GC  +  P++ESRVK L++QY AI EML P  S FGWN E K   C   ++D 
Sbjct: 58  IMLQKLPGCEKKAKPHIESRVKHLRKQYDAITEMLSPSASGFGWNDEEKFVTCPQAVWDE 117

Query: 61  WVKSHPSAKGLRHKSFPFYDDLAIVSSKDRATGSRVTTTAEVGSEPVVEEKNEDILNNQS 120
           W+KSH +A GLR+K FPFY++L  +  KDRA G+   T  +V  E          + + +
Sbjct: 118 WIKSHKNAAGLRNKPFPFYEELGKIWGKDRAVGNESGTVYDVLQE----------MEHGA 177

Query: 121 PYFEIFYIPD--PPFASSPTSEDLPTTPVGRGAGSSMPTRSR 161
              E   +PD     ++SPT  D PT P      S+ P+ SR
Sbjct: 178 RVEEEHQVPDLNAEESNSPTQCD-PTGPPSSTPQSTTPSSSR 208

BLAST of Cla011607 vs. TrEMBL
Match: A0A0K9RVV5_SPIOL (Uncharacterized protein OS=Spinacia oleracea GN=SOVF_028730 PE=4 SV=1)

HSP 1 Score: 112.8 bits (281), Expect = 3.8e-22
Identity = 62/158 (39.24%), Postives = 88/158 (55.70%), Query Frame = 1

Query: 1   MMQQKISGCSIQVNPNLESRVKMLKRQYSAIAEMLCPGCSRFGWNAERKCTDCEPKIFDA 60
           +M++K+  C  +  P++ESRVK+L++QY AI EML P  S FGWN E K   C   ++D 
Sbjct: 63  LMKEKLPECDKKAKPHIESRVKLLRKQYDAIPEMLSPSASGFGWNDEGKFVTCPQSVWDE 122

Query: 61  WVKSHPSAKGLRHKSFPFYDDLAIVSSKDRATGSRVTTTAEVGSEPVVEEKNEDILNNQS 120
           W+KSH +A GLR+K FPF+DDL  +  KDR  G+  T   +V  E + EE+ E       
Sbjct: 123 WIKSHKNAAGLRNKPFPFFDDLGKIFGKDRDVGNEATNVYDV-LEEMDEEEQE------- 182

Query: 121 PYFEIFYIPDPPFASSPTSEDLPTTPVGRGAG-SSMPT 158
                  +P+       + E +  TP+G   G SS PT
Sbjct: 183 -------VPE-------SIESINLTPIGNPTGHSSSPT 198

BLAST of Cla011607 vs. NCBI nr
Match: gi|659111294|ref|XP_008455678.1| (PREDICTED: uncharacterized protein At2g29880-like [Cucumis melo])

HSP 1 Score: 150.6 bits (379), Expect = 2.4e-33
Identity = 77/164 (46.95%), Postives = 107/164 (65.24%), Query Frame = 1

Query: 1   MMQQKISGCSIQVNPNLESRVKMLKRQYSAIAEMLCPGCSRFGWNAERKCTDCEPKIFDA 60
           +M++KI   +IQV  NLESRVK LK+QY+AIA+M+ P CSRFGWN ERKC + E  +FD 
Sbjct: 84  LMKEKIPRSNIQVTLNLESRVKFLKKQYTAIAKMMGPACSRFGWNEERKCIEAEKSVFDD 143

Query: 61  WVKSHPSAKGLRHKSFPFYDDLAIVSSKDRATGSRVTTTAEVGSEPVVEEKNEDILNNQS 120
           WVK HP+A+GL +K F ++ DL IV  +D+ATG R     E+ S+   + + +D+  N  
Sbjct: 144 WVKGHPNARGLLNKPFAYFYDLEIVFGRDKATGGRCKPFVEMASQTARDTEEDDMDIN-- 203

Query: 121 PYFEIFYIPDPPFASSPTSEDLPTTPVG--RGAGSSMPTRSRRS 163
              E F IP+P     P+ ED+P+T +     AGSS P++ RRS
Sbjct: 204 --LEDFDIPNPHGLEPPSGEDMPSTLISMTHDAGSSRPSKKRRS 243

BLAST of Cla011607 vs. NCBI nr
Match: gi|720026936|ref|XP_010264414.1| (PREDICTED: uncharacterized protein LOC104602427 [Nelumbo nucifera])

HSP 1 Score: 131.7 bits (330), Expect = 1.1e-27
Identity = 65/148 (43.92%), Postives = 89/148 (60.14%), Query Frame = 1

Query: 2   MQQKISGCSIQVNPNLESRVKMLKRQYSAIAEMLCPGCSRFGWNAERKCTDCEPKIFDAW 61
           M+  + GC ++ NP++ESRVK +KRQY+AI EML P CS FGW+  +KC  C+ ++F+ W
Sbjct: 58  METNLPGCGLKANPHIESRVKHMKRQYAAICEMLSPSCSGFGWDDVKKCITCKDEVFNGW 117

Query: 62  VKSHPSAKGLRHKSFPFYDDLAIVSSKDRATGSRVTTTAEVGSEPVVEEKNEDILNNQSP 121
           VKSHP AKGLR+K FPF+D L+I+  +DRATG  V   A+       EE N   +  +S 
Sbjct: 118 VKSHPHAKGLRNKPFPFFDGLSIIFGRDRATGESVEAPADAAENVEREEYNNPPIVGES- 177

Query: 122 YFEIFYIPDPPFASSPTSEDLPTTPVGR 150
                  P P  AS  +  +    PV R
Sbjct: 178 INGASSAPHPHRASKRSRSNTDDDPVAR 204

BLAST of Cla011607 vs. NCBI nr
Match: gi|672198585|ref|XP_008777372.1| (PREDICTED: uncharacterized protein LOC103697309 [Phoenix dactylifera])

HSP 1 Score: 131.7 bits (330), Expect = 1.1e-27
Identity = 68/170 (40.00%), Postives = 97/170 (57.06%), Query Frame = 1

Query: 1   MMQQKISGCSIQVNPNLESRVKMLKRQYSAIAEMLCPGCSRFGWNAERKCTDCEPKIFDA 60
           MM++K+ GC ++ N ++E+ VK+LK+QY+AIAEM  P CS FGWN   KC   +  ++D 
Sbjct: 65  MMEKKLPGCGLRGNSHIENHVKLLKKQYNAIAEMFGPNCSGFGWNDREKCVVADKDVYDL 124

Query: 61  WVKSHPSAKGLRHKSFPFYDDLAIVSSKDRATGSRVTTTAEVGSEPVVEEKNEDILNNQS 120
           WVKSHP A GLR+K FP+Y+DL+IV  KDRA G  V   A+   +    EK E+ L +  
Sbjct: 125 WVKSHPHAAGLRNKPFPYYEDLSIVFGKDRANGEGVEDPADACEQ---IEKEEEALGDTM 184

Query: 121 PYFEIF--------YIPDPPFASSPTSEDLPTTPVGRGAGSSMPTRSRRS 163
              +           I  PPF   PTS    T  +G+  GS +  + + +
Sbjct: 185 SLGDDMDAEGGSPNAIDAPPFICQPTSVGTSTATIGKKKGSLVSKKRKHN 231

BLAST of Cla011607 vs. NCBI nr
Match: gi|659111565|ref|XP_008455793.1| (PREDICTED: uncharacterized protein At2g29880-like [Cucumis melo])

HSP 1 Score: 127.5 bits (319), Expect = 2.1e-26
Identity = 63/123 (51.22%), Postives = 80/123 (65.04%), Query Frame = 1

Query: 2   MQQKISGCSIQVNPNLESRVKMLKRQYSAIAEMLCPGCSRFGWNAERKCTDCEPKIFDAW 61
           M +K+ G +IQ +P ++ RVK LK+ Y AIAEM  P CS FGWN E +C   E  +FD+W
Sbjct: 1   MAEKLPGTNIQASPTIDCRVKSLKKTYHAIAEMRGPSCSGFGWNEEFQCIIVERDLFDSW 60

Query: 62  VKSHPSAKGLRHKSFPFYDDLAIVSSKDRATGSRVTTTAEVGSE---------PVVEEKN 116
           VKSHP+ KGL HKSFP+YDDL  V  KDRATG+R  T  +VGS          P+ +  +
Sbjct: 61  VKSHPATKGLLHKSFPYYDDLTYVFGKDRATGARSETFVDVGSNVPNMFNDTIPLGDSHD 120

BLAST of Cla011607 vs. NCBI nr
Match: gi|672208170|ref|XP_008780089.1| (PREDICTED: uncharacterized protein LOC103699870, partial [Phoenix dactylifera])

HSP 1 Score: 124.8 bits (312), Expect = 1.4e-25
Identity = 56/111 (50.45%), Postives = 76/111 (68.47%), Query Frame = 1

Query: 1   MMQQKISGCSIQVNPNLESRVKMLKRQYSAIAEMLCPGCSRFGWNAERKCTDCEPKIFDA 60
           MM++K+ GC ++ NP +E+ VK+LK+QY+AIAEML P CSRFGWN   KC   +  ++D 
Sbjct: 65  MMEKKLLGCGLRGNPYIENHVKLLKKQYNAIAEMLGPNCSRFGWNDREKCVVVDKDVYDL 124

Query: 61  WVKSHPSAKGLRHKSFPFYDDLAIVSSKDRATGSRVTTTAEVGSEPVVEEK 112
           WVKSHP   GLR+K FP+Y+DL+IV  KDRA G      A+   +   EE+
Sbjct: 125 WVKSHPHVAGLRNKPFPYYEDLSIVFGKDRANGEGAEDRADACEQIEKEEE 175

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
A0A162AHN3_DAUCA3.1e-2449.11Uncharacterized protein OS=Daucus carota subsp. sativus GN=DCAR_009785 PE=4 SV=1[more]
E5GBB2_CUCME1.2e-2351.92Retrotransposon protein OS=Cucumis melo subsp. melo PE=4 SV=1[more]
A0A161XV48_DAUCA2.6e-2349.11Uncharacterized protein OS=Daucus carota subsp. sativus GN=DCAR_018898 PE=4 SV=1[more]
A0A0J8BIR4_BETVU1.0e-2240.12Uncharacterized protein OS=Beta vulgaris subsp. vulgaris GN=BVRB_1g018220 PE=4 S... [more]
A0A0K9RVV5_SPIOL3.8e-2239.24Uncharacterized protein OS=Spinacia oleracea GN=SOVF_028730 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
gi|659111294|ref|XP_008455678.1|2.4e-3346.95PREDICTED: uncharacterized protein At2g29880-like [Cucumis melo][more]
gi|720026936|ref|XP_010264414.1|1.1e-2743.92PREDICTED: uncharacterized protein LOC104602427 [Nelumbo nucifera][more]
gi|672198585|ref|XP_008777372.1|1.1e-2740.00PREDICTED: uncharacterized protein LOC103697309 [Phoenix dactylifera][more]
gi|659111565|ref|XP_008455793.1|2.1e-2651.22PREDICTED: uncharacterized protein At2g29880-like [Cucumis melo][more]
gi|672208170|ref|XP_008780089.1|1.4e-2550.45PREDICTED: uncharacterized protein LOC103699870, partial [Phoenix dactylifera][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla011607Cla011607.1mRNA


Analysis Name: InterPro Annotations of watermelon (97103)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR31704FAMILY NOT NAMEDcoord: 1..115
score: 4.9
NoneNo IPR availablePANTHERPTHR31704:SF16SUBFAMILY NOT NAMEDcoord: 1..115
score: 4.9