Cla003509 (gene) Watermelon (97103) v1

NameCla003509
Typegene
OrganismCitrullus. lanatus (Watermelon (97103) v1)
DescriptionGag/pol protein (Fragment) (AHRD V1 ***- E2GK51_BRYDI)
LocationChr3 : 13098515 .. 13099123 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAATGGTGATAACTACACTTCGTGGAAAAATACGATTAACACTGTACTCATCGTGGAGGACCTGGAGTTTGCAATCACTGAGGAGTGTCCTCAAGTTCTTGCTCAGAATGCATCAAAAAATATTTGTGATGCATATGAGAAATGGATGAGGGCAAACGAAAAGGCCAGAGCATATATTATTGTCAGCTTATCTGAAGTTTTGGCAAAGAAGCATGAAACCATGGCCACTGCTTGTGAGATCATGGATTCATTACAAGAGATGTTTGGACAATCGTCCTCAAAGATCAGGCATGATGCTCTGAAATTCATTTATAATGTATGTATGAACGAGGGGTCATCAGTGCGAGAACATGTTCTCAATATGATAGTCCACTTCAATGTGACTGAAATGAATGGGGCAATTATCGATGAGGCCAGTCAACTTATTTTTATTATGGCATCTCTACTAGAGAGTTTCCTTCAGTTCAGGACCAATGCTGTTATGAACAAACTGAACTACTCTCTTACAATCCCCTTTAACGAGCTGCAGTCTTATGAGTCCATGCATAAAAGCAAAAACCAAATGGGGAGGCAAATGTTGCCAGTTTCTCTAAAAGGTTCAACATAG

mRNA sequence

ATGAATGGTGATAACTACACTTCGTGGAAAAATACGATTAACACTGTACTCATCGTGGAGGACCTGGAGTTTGCAATCACTGAGGAGTGTCCTCAAGTTCTTGCTCAGAATGCATCAAAAAATATTTGTGATGCATATGAGAAATGGATGAGGGCAAACGAAAAGGCCAGAGCATATATTATTGTCAGCTTATCTGAAGTTTTGGCAAAGAAGCATGAAACCATGGCCACTGCTTGTGAGATCATGGATTCATTACAAGAGATGTTTGGACAATCGTCCTCAAAGATCAGGCATGATGCTCTGAAATTCATTTATAATGTATGTATGAACGAGGGGTCATCAGTGCGAGAACATGTTCTCAATATGATAGTCCACTTCAATGTGACTGAAATGAATGGGGCAATTATCGATGAGGCCAGTCAACTTATTTTTATTATGGCATCTCTACTAGAGAGTTTCCTTCAGTTCAGGACCAATGCTGTTATGAACAAACTGAACTACTCTCTTACAATCCCCTTTAACGAGCTGCAGTCTTATGAGTCCATGCATAAAAGCAAAAACCAAATGGGGAGGCAAATGTTGCCAGTTTCTCTAAAAGGTTCAACATAG

Coding sequence (CDS)

ATGAATGGTGATAACTACACTTCGTGGAAAAATACGATTAACACTGTACTCATCGTGGAGGACCTGGAGTTTGCAATCACTGAGGAGTGTCCTCAAGTTCTTGCTCAGAATGCATCAAAAAATATTTGTGATGCATATGAGAAATGGATGAGGGCAAACGAAAAGGCCAGAGCATATATTATTGTCAGCTTATCTGAAGTTTTGGCAAAGAAGCATGAAACCATGGCCACTGCTTGTGAGATCATGGATTCATTACAAGAGATGTTTGGACAATCGTCCTCAAAGATCAGGCATGATGCTCTGAAATTCATTTATAATGTATGTATGAACGAGGGGTCATCAGTGCGAGAACATGTTCTCAATATGATAGTCCACTTCAATGTGACTGAAATGAATGGGGCAATTATCGATGAGGCCAGTCAACTTATTTTTATTATGGCATCTCTACTAGAGAGTTTCCTTCAGTTCAGGACCAATGCTGTTATGAACAAACTGAACTACTCTCTTACAATCCCCTTTAACGAGCTGCAGTCTTATGAGTCCATGCATAAAAGCAAAAACCAAATGGGGAGGCAAATGTTGCCAGTTTCTCTAAAAGGTTCAACATAG

Protein sequence

MNGDNYTSWKNTINTVLIVEDLEFAITEECPQVLAQNASKNICDAYEKWMRANEKARAYIIVSLSEVLAKKHETMATACEIMDSLQEMFGQSSSKIRHDALKFIYNVCMNEGSSVREHVLNMIVHFNVTEMNGAIIDEASQLIFIMASLLESFLQFRTNAVMNKLNYSLTIPFNELQSYESMHKSKNQMGRQMLPVSLKGST
BLAST of Cla003509 vs. TrEMBL
Match: E2GK51_BRYDI (Gag/pol protein (Fragment) OS=Bryonia dioica PE=4 SV=1)

HSP 1 Score: 226.9 bits (577), Expect = 2.3e-56
Identity = 102/188 (54.26%), Postives = 152/188 (80.85%), Query Frame = 1

Query: 1   MNGDNYTSWKNTINTVLIVEDLEFAITEECPQVLAQNASKNICDAYEKWMRANEKARAYI 60
           +NGDNY++WK+ +NT+L+V+DL F +TEECPQ  A NA++ + +AY++W++AN+KAR YI
Sbjct: 14  LNGDNYSAWKSNLNTILVVDDLRFVLTEECPQAPALNANRTVREAYDRWVKANDKARVYI 73

Query: 61  IVSLSEVLAKKHETMATACEIMDSLQEMFGQSSSKIRHDALKFIYNVCMNEGSSVREHVL 120
           + S+++VLAKKH+++ATA  IMDSL+EMFGQ S  +RH+A+K IY   M EG+SVREHVL
Sbjct: 74  LASMTDVLAKKHDSIATAKGIMDSLREMFGQPSWSLRHEAIKHIYTKRMKEGTSVREHVL 133

Query: 121 NMIVHFNVTEMNGAIIDEASQLIFIMASLLESFLQFRTNAVMNKLNYSLTIPFNELQSYE 180
           +M++HFN+ E+NG  IDEA+Q+ FI+ SL +SF+ F+TNA +NK+ ++LT   NELQ ++
Sbjct: 134 DMMMHFNIAEVNGGPIDEANQVSFILQSLPKSFVPFQTNASLNKIEFNLTTLLNELQRFQ 193

Query: 181 SMHKSKNQ 189
           ++  SK +
Sbjct: 194 NLTLSKGK 201

BLAST of Cla003509 vs. TrEMBL
Match: W9SH28_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_004989 PE=4 SV=1)

HSP 1 Score: 182.2 bits (461), Expect = 6.4e-43
Identity = 90/191 (47.12%), Postives = 130/191 (68.06%), Query Frame = 1

Query: 1   MNGDNYTSWKNTINTVLIVEDLEFAITEECPQVLAQNASKNICDAYEKWMRANEKARAYI 60
           ++GDNY  WK+ +N +LI ED +F + +ECP   A NA+K   + Y++W++AN KA+ ++
Sbjct: 14  LDGDNYAKWKSNMNILLICEDYKFVLVDECPPEPAANATKTAREPYDRWIKANNKAKCFM 73

Query: 61  IVSLSEVLAKKHETMATACEIMDSLQEMFGQSSSKIRHDALKFIYNVCMNEGSSVREHVL 120
           + S+S+VL KKHE M TA EIM+SL+ MFG  S K R DA++   N  M +GSSV+ HVL
Sbjct: 74  LASMSDVLCKKHEEMETAYEIMESLEAMFGAPSEKARLDAVRAFMNDKMKKGSSVKAHVL 133

Query: 121 NMIVHFNVTEMNGAIIDEASQLIFIMASLLESFLQFRTNAVMNKLNYSLTIPFNELQSYE 180
           NMI H +  E+NGA IDEA+QL  I+ SL   F +F  N VMNK   +LT   N+LQ++E
Sbjct: 134 NMIDHLHDAELNGARIDEATQLGIILESLSPDFHEFVNNFVMNKKKSNLTELMNDLQNFE 193

Query: 181 SMHKSKNQMGR 192
           S +++K +  +
Sbjct: 194 STNQAKGRRSK 204

BLAST of Cla003509 vs. TrEMBL
Match: W9RXH5_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_001554 PE=4 SV=1)

HSP 1 Score: 177.2 bits (448), Expect = 2.0e-41
Identity = 89/184 (48.37%), Postives = 125/184 (67.93%), Query Frame = 1

Query: 1   MNGDNYTSWKNTINTVLIVEDLEFAITEECPQVLAQNASKNICDAYEKWMRANEKARAYI 60
           ++GDNY  WK+ +N +L+ ED +F + EECP   A NASK   + Y++W++AN KA+ ++
Sbjct: 14  LDGDNYAKWKSNMNILLVCEDYKFLLAEECPLEPADNASKTAREPYDRWIKANNKAKCFM 73

Query: 61  IVSLSEVLAKKHETMATACEIMDSLQEMFGQSSSKIRHDALKFIYNVCMNEGSSVREHVL 120
           + S+S+VL KKH  M TA EIM+SL+ MFG  S K   DA++   N  M +GSSV+ HVL
Sbjct: 74  LASMSDVLRKKHGEMETAYEIMESLEAMFGAPSEKACLDAVRAFMNDKMKKGSSVKAHVL 133

Query: 121 NMIVHFNVTEMNGAIIDEASQLIFIMASLLESFLQFRTNAVMNKLNYSLTIPFNELQSYE 180
           NMI H + TE+NGA IDEA+Q+  I+ SL   F +F  N VMNK   +LT   N+LQ++E
Sbjct: 134 NMIDHLHDTELNGARIDEATQVGIILESLSPDFHEFVNNLVMNKKKSNLTELMNDLQNFE 193

Query: 181 SMHK 185
           S +K
Sbjct: 194 STNK 197

BLAST of Cla003509 vs. TrEMBL
Match: E2GK52_BRYDI (Gag/pol protein (Fragment) OS=Bryonia dioica PE=4 SV=1)

HSP 1 Score: 174.9 bits (442), Expect = 1.0e-40
Identity = 79/149 (53.02%), Postives = 118/149 (79.19%), Query Frame = 1

Query: 1   MNGDNYTSWKNTINTVLIVEDLEFAITEECPQVLAQNASKNICDAYEKWMRANEKARAYI 60
           +N DNY++WK+ +NT+L+VEDL F +TEEC Q  A NA++ + +AY++W +AN+KA  YI
Sbjct: 14  LNSDNYSAWKSNLNTILVVEDLRFILTEECHQAPALNANRTVREAYDRWGKANDKACVYI 73

Query: 61  IVSLSEVLAKKHETMATACEIMDSLQEMFGQSSSKIRHDALKFIYNVCMNEGSSVREHVL 120
           + S+++VLAKK++++AT   IMDS +EMFGQ S  +RH+A+K IY   M EG+SVREHVL
Sbjct: 74  LASMTDVLAKKYDSIATTKGIMDSFREMFGQPSWSLRHEAIKRIYTKRMKEGTSVREHVL 133

Query: 121 NMIVHFNVTEMNGAIIDEASQLIFIMASL 150
           +M++HFN+ +++G  IDEA+Q+ FI+ SL
Sbjct: 134 DMMMHFNIAKVHGGPIDEANQVSFILQSL 162

BLAST of Cla003509 vs. TrEMBL
Match: W9RVT2_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_006473 PE=4 SV=1)

HSP 1 Score: 172.6 bits (436), Expect = 5.0e-40
Identity = 87/185 (47.03%), Postives = 123/185 (66.49%), Query Frame = 1

Query: 2   NGDNYTSWKNTINTVLIVEDLEFAITEECPQVLAQNASKNICDAYEKWMRANEKARAYII 61
           +GDNY  WK+ +N +L+ ED +F + EECPQ  A NASK   + Y++W++AN KA+ +++
Sbjct: 15  DGDNYAKWKSNMNILLVCEDYKFVLVEECPQEPAVNASKTAREPYDRWIKANNKAKCFML 74

Query: 62  VSLSEVLAKKHETMATACEIMDSLQEMFGQSSSKIRHDALKFIYNVCMNEGSSVREHVLN 121
            S+S+VL KKHE M TA EIM+SL+ +FG  S K   DA++   N  M +GSSV+ HVLN
Sbjct: 75  ASMSDVLHKKHEEMETAYEIMESLEAIFGAPSEKAHLDAVRAFMNDKMKKGSSVKAHVLN 134

Query: 122 MIVHFNVTEMNGAIIDEASQLIFIMASLLESFLQFRTNAVMNKLNYSLTIPFNELQSYES 181
           MI H +  E+NGA IDEA+Q+  I+ SL     +F  N VMNK   + T   N+LQ+ ES
Sbjct: 135 MIDHLHDAELNGARIDEATQVGIILESLSPDCHEFVNNFVMNKKKSNFTKLMNDLQNLES 194

Query: 182 MHKSK 187
            +K +
Sbjct: 195 ANKRR 199

BLAST of Cla003509 vs. NCBI nr
Match: gi|659113933|ref|XP_008456826.1| (PREDICTED: uncharacterized protein LOC103496664 [Cucumis melo])

HSP 1 Score: 268.1 bits (684), Expect = 1.3e-68
Identity = 129/198 (65.15%), Postives = 164/198 (82.83%), Query Frame = 1

Query: 2   NGDNYTSWKNTINTVLIVEDLEFAITEECPQVLAQNASKNICDAYEKWMRANEKARAYII 61
           NG+NY SWKNTINTVLI++DL F + E+CPQV A NA++ + +AYE+W +ANEKARAY++
Sbjct: 15  NGNNYASWKNTINTVLIIDDLRFVLVEKCPQVSAANATRTVREAYERWAKANEKARAYLL 74

Query: 62  VSLSEVLAKKHETMATACEIMDSLQEMFGQSSSKIRHDALKFIYNVCMNEGSSVREHVLN 121
            SLSEVLAKK+E+M TA EIMDSLQEMFGQ+S +I+HDALK+IYN  MN+G+ VREHVLN
Sbjct: 75  ASLSEVLAKKNESMLTAREIMDSLQEMFGQASYQIKHDALKYIYNARMNDGALVREHVLN 134

Query: 122 MIVHFNVTEMNGAIIDEASQLIFIMASLLESFLQFRTNAVMNKLNYSLTIPFNELQSYES 181
           M+V+FNV EMNGA+IDEA+Q+ FI+ SLLESFLQFR+N VMNK+ Y+LT   NELQ++ES
Sbjct: 135 MMVYFNVAEMNGAVIDEANQVSFILESLLESFLQFRSNVVMNKIAYTLTTLLNELQTFES 194

Query: 182 MHKSKNQMGRQMLPVSLK 200
           + K K Q G   +  S +
Sbjct: 195 LMKIKGQKGEANVATSTR 212

BLAST of Cla003509 vs. NCBI nr
Match: gi|299474487|gb|ADJ18449.1| (gag/pol protein [Bryonia dioica])

HSP 1 Score: 226.9 bits (577), Expect = 3.2e-56
Identity = 102/188 (54.26%), Postives = 152/188 (80.85%), Query Frame = 1

Query: 1   MNGDNYTSWKNTINTVLIVEDLEFAITEECPQVLAQNASKNICDAYEKWMRANEKARAYI 60
           +NGDNY++WK+ +NT+L+V+DL F +TEECPQ  A NA++ + +AY++W++AN+KAR YI
Sbjct: 14  LNGDNYSAWKSNLNTILVVDDLRFVLTEECPQAPALNANRTVREAYDRWVKANDKARVYI 73

Query: 61  IVSLSEVLAKKHETMATACEIMDSLQEMFGQSSSKIRHDALKFIYNVCMNEGSSVREHVL 120
           + S+++VLAKKH+++ATA  IMDSL+EMFGQ S  +RH+A+K IY   M EG+SVREHVL
Sbjct: 74  LASMTDVLAKKHDSIATAKGIMDSLREMFGQPSWSLRHEAIKHIYTKRMKEGTSVREHVL 133

Query: 121 NMIVHFNVTEMNGAIIDEASQLIFIMASLLESFLQFRTNAVMNKLNYSLTIPFNELQSYE 180
           +M++HFN+ E+NG  IDEA+Q+ FI+ SL +SF+ F+TNA +NK+ ++LT   NELQ ++
Sbjct: 134 DMMMHFNIAEVNGGPIDEANQVSFILQSLPKSFVPFQTNASLNKIEFNLTTLLNELQRFQ 193

Query: 181 SMHKSKNQ 189
           ++  SK +
Sbjct: 194 NLTLSKGK 201

BLAST of Cla003509 vs. NCBI nr
Match: gi|659072276|ref|XP_008464721.1| (PREDICTED: uncharacterized protein LOC103502537 [Cucumis melo])

HSP 1 Score: 216.5 bits (550), Expect = 4.4e-53
Identity = 102/140 (72.86%), Postives = 123/140 (87.86%), Query Frame = 1

Query: 1   MNGDNYTSWKNTINTVLIVEDLEFAITEECPQVLAQNASKNICDAYEKWMRANEKARAYI 60
           +NG+NY SWKNTINTVLI++DL F + EECPQV A NA++ + +AYE+W +ANEKARAYI
Sbjct: 13  LNGNNYASWKNTINTVLIIDDLIFVLVEECPQVPAANATRTVREAYERWAKANEKARAYI 72

Query: 61  IVSLSEVLAKKHETMATACEIMDSLQEMFGQSSSKIRHDALKFIYNVCMNEGSSVREHVL 120
           + SLS+VLAKKHE+M T  EIMDSLQEMFGQ+S +I+HDALK+IYN  MNEG+SVREHVL
Sbjct: 73  LASLSKVLAKKHESMLTTREIMDSLQEMFGQASYQIKHDALKYIYNARMNEGASVREHVL 132

Query: 121 NMIVHFNVTEMNGAIIDEAS 141
           NM+VHFNV EMNGA+IDEAS
Sbjct: 133 NMMVHFNVAEMNGAVIDEAS 152

BLAST of Cla003509 vs. NCBI nr
Match: gi|778697615|ref|XP_011654359.1| (PREDICTED: uncharacterized protein LOC105435361 [Cucumis sativus])

HSP 1 Score: 194.5 bits (493), Expect = 1.8e-46
Identity = 87/156 (55.77%), Postives = 126/156 (80.77%), Query Frame = 1

Query: 1   MNGDNYTSWKNTINTVLIVEDLEFAITEECPQVLAQNASKNICDAYEKWMRANEKARAYI 60
           +N DNY +WK+ +NT+L+V+DL F +TEECPQ  A NA++   +AY++W++ANEKAR YI
Sbjct: 14  INDDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTGREAYDRWIKANEKARVYI 73

Query: 61  IVSLSEVLAKKHETMATACEIMDSLQEMFGQSSSKIRHDALKFIYNVCMNEGSSVREHVL 120
           + S+S+VLAKKHE++ATA EIMDSL+ MFGQ    +RH+A+K+IY   M EG+SVREHVL
Sbjct: 74  LASMSDVLAKKHESLATAKEIMDSLRGMFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVL 133

Query: 121 NMIVHFNVTEMNGAIIDEASQLIFIMASLLESFLQF 157
           +M++HFN+ ++NG +I+E +Q+ FI+ SL +SF+ F
Sbjct: 134 DMMMHFNIAQVNGGLIEEVNQVSFILESLPKSFIPF 169

BLAST of Cla003509 vs. NCBI nr
Match: gi|659118732|ref|XP_008459275.1| (PREDICTED: uncharacterized protein LOC103498451 [Cucumis melo])

HSP 1 Score: 192.6 bits (488), Expect = 6.8e-46
Identity = 86/180 (47.78%), Postives = 130/180 (72.22%), Query Frame = 1

Query: 5   NYTSWKNTINTVLIVEDLEFAITEECPQVLAQNASKNICDAYEKWMRANEKARAYIIVSL 64
           +Y +WK+ +N +L++ DL F + EEC   L Q   K++ DAY++W +AN+KA  YI+ S+
Sbjct: 7   DYATWKSKLNMILVITDLRFILMEECSLFLTQGTFKSVRDAYDRWKKANDKAHVYIMASM 66

Query: 65  SEVLAKKHETMATACEIMDSLQEMFGQSSSKIRHDALKFIYNVCMNEGSSVREHVLNMIV 124
           S++L+ KH+ M T  +I+DSL+EMFGQ S +I+ + +K++YN  M +  SV++HVLNMIV
Sbjct: 67  SDILSNKHKIMVTTRQIVDSLREMFGQLSIQIKQETIKYVYNARMKDSQSVKKHVLNMIV 126

Query: 125 HFNVTEMNGAIIDEASQLIFIMASLLESFLQFRTNAVMNKLNYSLTIPFNELQSYESMHK 184
           HFNV EMN  + DE SQ+ FI+  L +S LQF  NA MNK+ Y++TI  NELQ+++S+ +
Sbjct: 127 HFNVVEMNVVVFDEKSQVSFILKYLPKSSLQFNNNAEMNKIKYNMTIFLNELQTFQSLKR 186

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
E2GK51_BRYDI2.3e-5654.26Gag/pol protein (Fragment) OS=Bryonia dioica PE=4 SV=1[more]
W9SH28_9ROSA6.4e-4347.12Uncharacterized protein OS=Morus notabilis GN=L484_004989 PE=4 SV=1[more]
W9RXH5_9ROSA2.0e-4148.37Uncharacterized protein OS=Morus notabilis GN=L484_001554 PE=4 SV=1[more]
E2GK52_BRYDI1.0e-4053.02Gag/pol protein (Fragment) OS=Bryonia dioica PE=4 SV=1[more]
W9RVT2_9ROSA5.0e-4047.03Uncharacterized protein OS=Morus notabilis GN=L484_006473 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
gi|659113933|ref|XP_008456826.1|1.3e-6865.15PREDICTED: uncharacterized protein LOC103496664 [Cucumis melo][more]
gi|299474487|gb|ADJ18449.1|3.2e-5654.26gag/pol protein [Bryonia dioica][more]
gi|659072276|ref|XP_008464721.1|4.4e-5372.86PREDICTED: uncharacterized protein LOC103502537 [Cucumis melo][more]
gi|778697615|ref|XP_011654359.1|1.8e-4655.77PREDICTED: uncharacterized protein LOC105435361 [Cucumis sativus][more]
gi|659118732|ref|XP_008459275.1|6.8e-4647.78PREDICTED: uncharacterized protein LOC103498451 [Cucumis melo][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR016024ARM-type_fold
Vocabulary: Molecular Function
TermDefinition
GO:0005488binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
cellular_component GO:0005575 cellular_component
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0005488 binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla003509Cla003509.1mRNA


Analysis Name: InterPro Annotations of watermelon (97103)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR016024Armadillo-type foldunknownSSF48371ARM repeatcoord: 32..163
score: 3.1
NoneNo IPR availablePANTHERPTHR11439GAG-POL-RELATED RETROTRANSPOSONcoord: 1..188
score: 6.9
NoneNo IPR availablePANTHERPTHR11439:SF185SUBFAMILY NOT NAMEDcoord: 1..188
score: 6.9
NoneNo IPR availablePFAMPF14223Retrotran_gag_2coord: 49..185
score: 8.3

The following gene(s) are orthologous to this gene:

None

The following gene(s) are paralogous to this gene:

None