Cla011047 (gene) Watermelon (97103) v1

NameCla011047
Typegene
OrganismCitrullus. lanatus (Watermelon (97103) v1)
DescriptionRetrotransposon protein (AHRD V1 ***- E5GCB5_CUCME)
LocationChr1 : 16402060 .. 16402699 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTTGGGCCTAGGCTATAGTGGGTTTGGTTGGAACGCGGAGCGCAAATGTATTGACTGTGAGGCGAAGATATTTGACACATGTGTCAAGGTAATTTCATTACTTGTTTTATTTATTTCTTCTTCTTTTAAATTTGTCTTGGCCATAACATAACACATTTCACCTTTAACAGAGTCATCCGAGTGCAAAAGAACTACGCCATAAGTCATTTCCGTACTATGACGACTTGGCCATCGTATTCGGCAAAGACAGAGCCACAGGGAGTCATGCAACCACCACTGCAAAGGTTGGATTTGAACCTGTTATGGAAGAGGAAAACGAGGACATCCTGAATAACCAATCCCTAGACTTTGAGAACTTCTATATTCCTGATCCACCTTTTGGCAGCTCGCCCATGTCAGACGACATTCCAACTACCCCCAGCGGTAGAGGGTCTGAGAGTAGCTTGCCATCAAGGAGTAGGAGGTCCCGAATTTCCTCGATTGGAGAGTACAACGAGGTGGTTCGTGAGGGATTCCAACTTCTGACGAAGTCCATTGACGACATTGCACAGTGGCTTGTCATGAACAAGGACCTGGCAAGGCGTCGTTGTCGAGAACTATACGCTGAGCTACAATCCATTCCTGGTCTGTCGGTATAG

mRNA sequence

ATGTTGGGCCTAGGCTATAGTGGGTTTGGTTGGAACGCGGAGCGCAAATGTATTGACTGTGAGGCGAAGATATTTGACACATGTGTCAAGAGTCATCCGAGTGCAAAAGAACTACGCCATAAGTCATTTCCGTACTATGACGACTTGGCCATCGTATTCGGCAAAGACAGAGCCACAGGGAGTCATGCAACCACCACTGCAAAGGTTGGATTTGAACCTGTTATGGAAGAGGAAAACGAGGACATCCTGAATAACCAATCCCTAGACTTTGAGAACTTCTATATTCCTGATCCACCTTTTGGCAGCTCGCCCATGTCAGACGACATTCCAACTACCCCCAGCGGTAGAGGGTCTGAGAGTAGCTTGCCATCAAGGAGTAGGAGGTCCCGAATTTCCTCGATTGGAGAGTACAACGAGGTGGTTCGTGAGGGATTCCAACTTCTGACGAAGTCCATTGACGACATTGCACAGTGGCTTGTCATGAACAAGGACCTGGCAAGGCGTCGTTGTCGAGAACTATACGCTGAGCTACAATCCATTCCTGGTCTGTCGGTATAG

Coding sequence (CDS)

ATGTTGGGCCTAGGCTATAGTGGGTTTGGTTGGAACGCGGAGCGCAAATGTATTGACTGTGAGGCGAAGATATTTGACACATGTGTCAAGAGTCATCCGAGTGCAAAAGAACTACGCCATAAGTCATTTCCGTACTATGACGACTTGGCCATCGTATTCGGCAAAGACAGAGCCACAGGGAGTCATGCAACCACCACTGCAAAGGTTGGATTTGAACCTGTTATGGAAGAGGAAAACGAGGACATCCTGAATAACCAATCCCTAGACTTTGAGAACTTCTATATTCCTGATCCACCTTTTGGCAGCTCGCCCATGTCAGACGACATTCCAACTACCCCCAGCGGTAGAGGGTCTGAGAGTAGCTTGCCATCAAGGAGTAGGAGGTCCCGAATTTCCTCGATTGGAGAGTACAACGAGGTGGTTCGTGAGGGATTCCAACTTCTGACGAAGTCCATTGACGACATTGCACAGTGGCTTGTCATGAACAAGGACCTGGCAAGGCGTCGTTGTCGAGAACTATACGCTGAGCTACAATCCATTCCTGGTCTGTCGGTATAG

Protein sequence

MLGLGYSGFGWNAERKCIDCEAKIFDTCVKSHPSAKELRHKSFPYYDDLAIVFGKDRATGSHATTTAKVGFEPVMEEENEDILNNQSLDFENFYIPDPPFGSSPMSDDIPTTPSGRGSESSLPSRSRRSRISSIGEYNEVVREGFQLLTKSIDDIAQWLVMNKDLARRRCRELYAELQSIPGLSV
BLAST of Cla011047 vs. TrEMBL
Match: E5GCB5_CUCME (Retrotransposon protein OS=Cucumis melo subsp. melo PE=4 SV=1)

HSP 1 Score: 104.4 bits (259), Expect = 1.5e-19
Identity = 67/188 (35.64%), Postives = 102/188 (54.26%), Query Frame = 1

Query: 1   MLGLGYSGFGWNAERKCIDCEAKIFDTCVKSHPSAKELRHKSFPYYDDLAIVFGKDRATG 60
           M G   SGFGWN E+KCI  E ++FD    SHP+AK L +KSF +YD+L+ VFGKDRATG
Sbjct: 438 MRGPNCSGFGWNDEKKCIVAEKEVFDDW--SHPAAKGLLNKSFVHYDELSYVFGKDRATG 497

Query: 61  SHATTTAKVGFE--PVMEEENEDILNNQSLDFENFYIPDPPFGSSPMSDDIPTTPSGRGS 120
             A + A +G    P  +    D +     DF   Y P    G +   DD+  T + R S
Sbjct: 498 GRAESFADIGSNDPPGYDAGAADAM--PDTDFPPMYSP----GLNMSPDDLMETRTARVS 557

Query: 121 E-SSLPSRSRRSRISSIGEYNEVVREGFQLLTKSIDDIAQWLVMNKDLARRRCRELYAEL 180
           E  ++ S S+R R     +  ++VR   +   + +  IA+W ++ +  A +  +E+   L
Sbjct: 558 ERRNVSSGSKRKRPGHATDSGDIVRTAIEYGNEQLHRIAEWPILQRQDATQTRQEIVRHL 617

Query: 181 QSIPGLSV 186
           ++IP L++
Sbjct: 618 EAIPELTL 617

BLAST of Cla011047 vs. TrEMBL
Match: E5GBB2_CUCME (Retrotransposon protein OS=Cucumis melo subsp. melo PE=4 SV=1)

HSP 1 Score: 94.7 bits (234), Expect = 1.2e-16
Identity = 61/185 (32.97%), Postives = 88/185 (47.57%), Query Frame = 1

Query: 1   MLGLGYSGFGWNAERKCIDCEAKIFDTCVKSHPSAKELRHKSFPYYDDLAIVFGKDRATG 60
           MLG   SGFGWN E KCI  E ++FD  V+S P+AK L +  FPYYD+L  VFG+DRATG
Sbjct: 401 MLGPACSGFGWNDEEKCIVAEKELFDNWVRSPPAAKGLLNNPFPYYDELTYVFGRDRATG 460

Query: 61  SHATTTAKVGFEPVMEEENEDILNNQSLDFENFYIPDPPFGSSPMSDDIPTTPSGRGSES 120
             A T A VG        +   + + + DF   Y      G   + DD+  +   R SE 
Sbjct: 461 RFAETFADVGSNEPGGGYDRFDMGDGNEDFPPVY----SRGVDILQDDVRASRPSRASEG 520

Query: 121 SLPSRSRRSRISSIGEYN-EVVREGFQLLTKSIDDIAQWLVMNKDLARRRCRELYAELQS 180
              S   + +  S  +++ E +        + +  IA+W   N         E +  L+ 
Sbjct: 521 KTGSSGSKRKRGSQRDFDVEAIHLALDQTNEQLRQIAEWPARNLANDNHVRTEFFRILRE 580

Query: 181 IPGLS 185
           +P L+
Sbjct: 581 MPELT 581

BLAST of Cla011047 vs. TrEMBL
Match: A0A0A0M0S6_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G665930 PE=4 SV=1)

HSP 1 Score: 91.3 bits (225), Expect = 1.4e-15
Identity = 60/185 (32.43%), Postives = 96/185 (51.89%), Query Frame = 1

Query: 1   MLGLGYSGFGWNAERKCIDCEAKIFDTCVKSHPSAKELRHKSFPYYDDLAIVFGKDRATG 60
           +L   Y+ FGWN ERKCI  E  +FD  VK H +A+ L +KSF Y+ DL IV G+DR  G
Sbjct: 59  LLEATYNRFGWNEERKCIKVEKSMFDDWVKEHHNARGLLNKSFSYFYDLQIVIGRDRTIG 118

Query: 61  SHATTTAKVGFEPVMEEENEDILNNQSLDFENFYIPDPPFGSSPMSDDIPTTPSGRGSES 120
               T  ++  +   + E +DI     ++ E+F IP P     P   ++ +TP+    ++
Sbjct: 119 DRCKTPVEMDPQTTKDIEKDDI----GINLEDFDIPKPHGLELPSVKNMSSTPTSMILDA 178

Query: 121 SLPSRSRRSRISSIGEYNEVVREGFQLLTKSIDDIAQWLVMNKDLARRRCRELYAELQSI 180
               +SR+ R  S   +   +RE F+ + K +    + + +   L     ++LY ELQ+I
Sbjct: 179 RSYRQSRKRRSYSC-TFCASMRETFKEIGKIVAMEERKMKIESSLH----KQLYVELQTI 234

Query: 181 PGLSV 186
            G+ V
Sbjct: 239 HGMDV 234

BLAST of Cla011047 vs. TrEMBL
Match: A0A0J8BIR4_BETVU (Uncharacterized protein OS=Beta vulgaris subsp. vulgaris GN=BVRB_1g018220 PE=4 SV=1)

HSP 1 Score: 86.3 bits (212), Expect = 4.3e-14
Identity = 64/192 (33.33%), Postives = 93/192 (48.44%), Query Frame = 1

Query: 1   MLGLGYSGFGWNAERKCIDCEAKIFDTCVKSHPSAKELRHKSFPYYDDLAIVFGKDRATG 60
           ML    SGFGWN E K + C   ++D  +KSH +A  LR+K FP+Y++L  ++GKDRA G
Sbjct: 91  MLSPSASGFGWNDEEKFVTCPQAVWDEWIKSHKNAAGLRNKPFPFYEELGKIWGKDRAVG 150

Query: 61  SHATTTAKVGFE-----PVMEEENEDILNNQSLDFENFYIPDPPFGSSPMSDDIPTTPSG 120
           + + T   V  E      V EE     LN +  +      P  P  S+P S    TTPS 
Sbjct: 151 NESGTVYDVLQEMEHGARVEEEHQVPDLNAEESNSPTQCDPTGPPSSTPQS----TTPS- 210

Query: 121 RGSESSLPSRSRRSRISSIGEYNE---VVREGFQLLTKSIDDIAQWLVMNKDLARRRCRE 180
               SS   R+R   I ++ E++     + +  +  T+ I  +A       D A RR + 
Sbjct: 211 ----SSRTKRARTETIEALKEFSTKIGKISDVMEAATEHIGRLANCFQHESDSADRRMK- 270

Query: 181 LYAELQSIPGLS 185
           L  E+  + GLS
Sbjct: 271 LTTEIMKMEGLS 272

BLAST of Cla011047 vs. TrEMBL
Match: A0A161XV48_DAUCA (Uncharacterized protein OS=Daucus carota subsp. sativus GN=DCAR_018898 PE=4 SV=1)

HSP 1 Score: 85.9 bits (211), Expect = 5.7e-14
Identity = 44/79 (55.70%), Postives = 51/79 (64.56%), Query Frame = 1

Query: 1   MLGLGYSGFGWNAERKCIDCEAKIFDTCVKSHPSAKELRHKSFPYYDDLAIVFGKDRATG 60
           M G   SGFGWN   K I CE  IF+  +KSHP+AK LR+KSFPYYD+L+ VFGKDRA G
Sbjct: 93  MRGPNCSGFGWNELDKSITCEKSIFEDWLKSHPNAKGLRNKSFPYYDELSQVFGKDRANG 152

Query: 61  SHATTTAKVGFEPVMEEEN 80
               + A    E   EEEN
Sbjct: 153 ECVESPADAVEEIANEEEN 171

BLAST of Cla011047 vs. NCBI nr
Match: gi|659071532|ref|XP_008460440.1| (PREDICTED: uncharacterized protein LOC103499248 [Cucumis melo])

HSP 1 Score: 125.9 bits (315), Expect = 7.1e-26
Identity = 69/186 (37.10%), Postives = 105/186 (56.45%), Query Frame = 1

Query: 2   LGLGYSGFGWNAERKCIDCEAKIFDTCVKSHPSAKELRHKSFPYYDDLAIVFGKDRATGS 61
           +G   S FGWN E+KCI+ +  +FD  VK HP+A+ L +KSFPY+ DL I+FG+DRATG 
Sbjct: 1   MGPACSRFGWNEEQKCIEAQKSVFDDWVKGHPNARGLLNKSFPYFYDLKIMFGRDRATGG 60

Query: 62  HATTTAKVGFEPVMEEENEDILNNQSLDFENFYIPDPPFGSSPMSDDIPTTPSGRGSE-- 121
              T  ++G +   + E +D+     ++ E+F IP+P     P  +D+ +TP+    +  
Sbjct: 61  RCKTPIEMGLQIARDTEEDDM----DINLEDFDIPNPHGLEPPSGEDMSSTPTSMAHDVG 120

Query: 122 SSLPSRSRRSRISSIGEYNEVVREGFQLLTKSIDDIAQWLVMNKDLARRRCRELYAELQS 181
           SS PS+ RRS    +    +  R   +  +K I  IA W     ++     + LYAELQ+
Sbjct: 121 SSKPSKKRRSYSEDL---MDTFRASMRETSKEIRKIAAWQREKMEIESSIHKRLYAELQT 179

Query: 182 IPGLSV 186
           IPG+ V
Sbjct: 181 IPGMDV 179

BLAST of Cla011047 vs. NCBI nr
Match: gi|659125959|ref|XP_008462939.1| (PREDICTED: uncharacterized protein At2g29880-like [Cucumis melo])

HSP 1 Score: 113.2 bits (282), Expect = 4.8e-22
Identity = 53/143 (37.06%), Postives = 84/143 (58.74%), Query Frame = 1

Query: 1   MLGLGYSGFGWNAERKCIDCEAKIFDTCVKSHPSAKELRHKSFPYYDDLAIVFGKDRATG 60
           M+G   SGFGWN ERKCI+ E  +FD  VK HP+A++L +K FPY+ DL IVFG+D ATG
Sbjct: 55  MMGQACSGFGWNEERKCIEAEKSVFDDWVKGHPNARDLLNKPFPYFYDLKIVFGRDMATG 114

Query: 61  SHATTTAKVGFEPVMEEENEDILNNQSLDFENFYIPDPPFGSSPMSDDIPTTPSGRGSES 120
               T  ++G +   + E +D+     ++ E+F IP+P     P  +D+P+TP+    ++
Sbjct: 115 DRCKTPVEMGSQTARDTEEDDM----DINLEDFDIPNPHGLEPPSGEDMPSTPTSMAHDA 174

Query: 121 SLPSRSRRSRISSIGEYNEVVRE 144
                S++  +   G +  + R+
Sbjct: 175 GSSRPSKKKTVIFRGPHGHISRK 193

BLAST of Cla011047 vs. NCBI nr
Match: gi|659125386|ref|XP_008462659.1| (PREDICTED: uncharacterized protein LOC103500963 [Cucumis melo])

HSP 1 Score: 109.4 bits (272), Expect = 6.9e-21
Identity = 61/161 (37.89%), Postives = 92/161 (57.14%), Query Frame = 1

Query: 1   MLGLGYSGFGWNAERKCIDCEAKIFDTCVKSHPSAKELRHKSFPYYDDLAIVFGKDRATG 60
           M+   YS FGWN ERKCI+ E  +FD  VK HP+A+ L +K+FPY+ DL +VFG+DRAT 
Sbjct: 1   MMRPAYSRFGWNEERKCIEAEKSVFDDWVKGHPNARGLLNKAFPYFYDLEVVFGRDRATI 60

Query: 61  SHATTTAKVGFEPVMEEENEDILNNQSLDFENFYIPDPPFGSSPMSDDIPTTPSGRGSE- 120
               T  ++G +   + E +D+     ++ E+F IP+P     P  +D+ +TP+    + 
Sbjct: 61  GRCKTPVQMGSQIAKDTEEDDM----DINLEDFDIPNPHELEPPSREDMTSTPTSMAHDA 120

Query: 121 -SSLPSRSRRSRISS-IGEYNEVVREGFQLLTKSIDDIAQW 159
            SS PS+ RRS     +  ++  +RE     +K I  IA W
Sbjct: 121 GSSRPSKKRRSYSGDLVNTFHASMRE----TSKEIGKIAAW 153

BLAST of Cla011047 vs. NCBI nr
Match: gi|659111294|ref|XP_008455678.1| (PREDICTED: uncharacterized protein At2g29880-like [Cucumis melo])

HSP 1 Score: 105.9 bits (263), Expect = 7.6e-20
Identity = 60/160 (37.50%), Postives = 87/160 (54.37%), Query Frame = 1

Query: 1   MLGLGYSGFGWNAERKCIDCEAKIFDTCVKSHPSAKELRHKSFPYYDDLAIVFGKDRATG 60
           M+G   S FGWN ERKCI+ E  +FD  VK HP+A+ L +K F Y+ DL IVFG+D+ATG
Sbjct: 117 MMGPACSRFGWNEERKCIEAEKSVFDDWVKGHPNARGLLNKPFAYFYDLEIVFGRDKATG 176

Query: 61  SHATTTAKVGFEPVMEEENEDILNNQSLDFENFYIPDPPFGSSPMSDDIPTT--PSGRGS 120
                  ++  +   + E +D+     ++ E+F IP+P     P  +D+P+T       +
Sbjct: 177 GRCKPFVEMASQTARDTEEDDM----DINLEDFDIPNPHGLEPPSGEDMPSTLISMTHDA 236

Query: 121 ESSLPSRSRRSRISSIGEYNEVVREGFQLLTKSIDDIAQW 159
            SS PS+ RR   S  G+  +  R   Q  +K I  IA W
Sbjct: 237 GSSRPSKKRR---SYPGDLMDTFRASMQETSKEIGKIAAW 269

BLAST of Cla011047 vs. NCBI nr
Match: gi|659082641|ref|XP_008441954.1| (PREDICTED: uncharacterized protein At2g29880-like [Cucumis melo])

HSP 1 Score: 105.1 bits (261), Expect = 1.3e-19
Identity = 69/188 (36.70%), Postives = 97/188 (51.60%), Query Frame = 1

Query: 1   MLGLGYSGFGWNAERKCIDCEAKIFDTCVKSHPSAKELRHKSFPYYDDLAIVFGKDRATG 60
           M G   SGFGWN E +CI  E  +FD+ +KSHP+AK L HKSFPYYDDL+ VFGKDRATG
Sbjct: 84  MRGPSCSGFGWNEEFQCIIAERDLFDSWIKSHPAAKGLLHKSFPYYDDLSYVFGKDRATG 143

Query: 61  SHATTTAKVGFEPVMEEENEDILNNQSLDFENFYIPDPPF----GSSPMSDDIPTTPSGR 120
           + + T   VG        N   + N ++   + +  D P     G     D++    +G+
Sbjct: 144 ARSETFPNVG-------SNVSNMFNDTIPLGDSHDEDIPTMYSQGVHMSPDEMFGIRAGQ 203

Query: 121 GSE-SSLPSRSRRSRISSIGEYNEVVREGFQLLTKSIDDIAQWLVMNKDLARRRCRELYA 180
            SE  +  S S+R R S   E  EV+R   +   + +  IA W    + +      ++  
Sbjct: 204 ASERRNCSSVSKRKRGSERYETVEVIRSVMEFGNEQLKAIADWPKEKRAMEVEMRAQVVK 263

Query: 181 ELQSIPGL 184
           +LQ IP L
Sbjct: 264 QLQDIPKL 264

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
E5GCB5_CUCME1.5e-1935.64Retrotransposon protein OS=Cucumis melo subsp. melo PE=4 SV=1[more]
E5GBB2_CUCME1.2e-1632.97Retrotransposon protein OS=Cucumis melo subsp. melo PE=4 SV=1[more]
A0A0A0M0S6_CUCSA1.4e-1532.43Uncharacterized protein OS=Cucumis sativus GN=Csa_1G665930 PE=4 SV=1[more]
A0A0J8BIR4_BETVU4.3e-1433.33Uncharacterized protein OS=Beta vulgaris subsp. vulgaris GN=BVRB_1g018220 PE=4 S... [more]
A0A161XV48_DAUCA5.7e-1455.70Uncharacterized protein OS=Daucus carota subsp. sativus GN=DCAR_018898 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
gi|659071532|ref|XP_008460440.1|7.1e-2637.10PREDICTED: uncharacterized protein LOC103499248 [Cucumis melo][more]
gi|659125959|ref|XP_008462939.1|4.8e-2237.06PREDICTED: uncharacterized protein At2g29880-like [Cucumis melo][more]
gi|659125386|ref|XP_008462659.1|6.9e-2137.89PREDICTED: uncharacterized protein LOC103500963 [Cucumis melo][more]
gi|659111294|ref|XP_008455678.1|7.6e-2037.50PREDICTED: uncharacterized protein At2g29880-like [Cucumis melo][more]
gi|659082641|ref|XP_008441954.1|1.3e-1936.70PREDICTED: uncharacterized protein At2g29880-like [Cucumis melo][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla011047Cla011047.1mRNA


Analysis Name: InterPro Annotations of watermelon (97103)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR31704FAMILY NOT NAMEDcoord: 4..184
score: 8.3

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
Cla011047Cla97C01G011270Watermelon (97103) v2wmwmbB246
The following gene(s) are paralogous to this gene:

None