Cla012456 (gene) Watermelon (97103) v1

NameCla012456
Typegene
OrganismCitrullus. lanatus (Watermelon (97103) v1)
DescriptionRetrotransposon gag protein (AHRD V1 ***- E5GB67_CUCME); contains Interpro domain(s) IPR005162 Retrotransposon gag protein
LocationChr8 : 1093884 .. 1094342 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAGGTGTTAGTCAGACTTCACACTTGTATTCCAAACCGTATACAAAGAGGATTGACAAGCTTAGGATGTCAAACGGGTACCAGCCACCCAAGTTTCAGTAGTTTGATGGAATGGGGCAATTCAAAGCAACATGTTGCTCATTTCATCGAAACATGTGAAAATGCTAGTACGCGAGGGGACTTGGAGTCTGAGTCAATTGACAGCTGGAAACAACTCGAAAGAGAGTTTTTAATTCGCTTTTACAACACAAGACGAATAGCCAGTATGTTCGAGCTCACCAACACTAAGCAACGAAAAGGTGAACCCGTCATCGATTACATAAATCGTTGGAGAGCCGCAAGTCTAGATTGCAAAGATCGTCTCACTGAACTCTCTGTCGTTGAGAGGTGCATTCAAGGCATGCATTGGGGACTCCTTTACATTCTCCAAGGTATAAAGCCTCGCACCTTTGAGTAA

mRNA sequence

ATGGAGGTGTTAGTCAGACTTCACACTTGTATTCCAAACCGTATACAAAGAGGATTGACAAGCTTAGGATGTCAAACGGGTACCAGCCACCCAAGTTTCAGTAGTTTGATGGAATGGGGCAATTCAAAGCAACATGTTGCTCATTTCATCGAAACATGTGAAAATGCTAGTACGCGAGGGGACTTGGAGTCTGAGTCAATTGACAGCTGGAAACAACTCGAAAGAGAGTTTTTAATTCGCTTTTACAACACAAGACGAATAGCCAGTATGTTCGAGCTCACCAACACTAAGCAACGAAAAGGTGAACCCGTCATCGATTACATAAATCGTTGGAGAGCCGCAAGTCTAGATTGCAAAGATCGTCTCACTGAACTCTCTGTCGTTGAGAGGTGCATTCAAGGCATGCATTGGGGACTCCTTTACATTCTCCAAGGTATAAAGCCTCGCACCTTTGAGTAA

Coding sequence (CDS)

ATGGAGGTGTTAGTCAGACTTCACACTTGTATTCCAAACCGTATACAAAGAGGATTGACAAGCTTAGGATGTCAAACGGGTACCAGCCACCCAAGTTTCAGTAGTTTGATGGAATGGGGCAATTCAAAGCAACATGTTGCTCATTTCATCGAAACATGTGAAAATGCTAGTACGCGAGGGGACTTGGAGTCTGAGTCAATTGACAGCTGGAAACAACTCGAAAGAGAGTTTTTAATTCGCTTTTACAACACAAGACGAATAGCCAGTATGTTCGAGCTCACCAACACTAAGCAACGAAAAGGTGAACCCGTCATCGATTACATAAATCGTTGGAGAGCCGCAAGTCTAGATTGCAAAGATCGTCTCACTGAACTCTCTGTCGTTGAGAGGTGCATTCAAGGCATGCATTGGGGACTCCTTTACATTCTCCAAGGTATAAAGCCTCGCACCTTTGAGTAA

Protein sequence

MEVLVRLHTCIPNRIQRGLTSLGCQTGTSHPSFSSLMEWGNSKQHVAHFIETCENASTRGDLESESIDSWKQLEREFLIRFYNTRRIASMFELTNTKQRKGEPVIDYINRWRAASLDCKDRLTELSVVERCIQGMHWGLLYILQGIKPRTFE
BLAST of Cla012456 vs. TrEMBL
Match: E5GCP6_CUCME (Ty3-gypsy retrotransposon protein OS=Cucumis melo subsp. melo PE=4 SV=1)

HSP 1 Score: 189.9 bits (481), Expect = 2.3e-45
Identity = 96/154 (62.34%), Postives = 106/154 (68.83%), Query Frame = 1

Query: 19  LTSLGCQTGTSHPSFSSLMEWGNSKQHVAHFIETCENASTRGD----------------- 78
           + +L    G   P F      GN KQH+AHF+ETCENA +RGD                 
Sbjct: 199 IDNLRMPLGYQPPKFQQFDGRGNPKQHIAHFVETCENAGSRGDQLVKQFVRSLKGNAFEW 258

Query: 79  ---LESESIDSWKQLEREFLIRFYNTRRIASMFELTNTKQRKGEPVIDYINRWRAASLDC 138
              LE E ID+W+QLE EFL RFY+TRR+ SM ELTNTKQRKGEPVIDYINRWRA SLDC
Sbjct: 259 YTDLEPEVIDNWEQLEIEFLNRFYSTRRVVSMMELTNTKQRKGEPVIDYINRWRALSLDC 318

Query: 139 KDRLTELSVVERCIQGMHWGLLYILQGIKPRTFE 153
           KD+LTELS VE C QGMHW LLYILQGIKPRTFE
Sbjct: 319 KDKLTELSAVEMCTQGMHWELLYILQGIKPRTFE 352

BLAST of Cla012456 vs. TrEMBL
Match: M5WXF7_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa020726mg PE=4 SV=1)

HSP 1 Score: 186.0 bits (471), Expect = 3.3e-44
Identity = 94/144 (65.28%), Postives = 103/144 (71.53%), Query Frame = 1

Query: 17  RGLTSLGCQTGTSHPSFSSLMEWGNSKQHVAHFIETCENASTRGD--------LESESID 76
           R + +L   TG   P F      GN KQ +AHFIETC NA T GD        LESES++
Sbjct: 249 RRIENLRMPTGYQPPKFQQFDGKGNPKQRIAHFIETCNNAGTEGDHLVKQFVHLESESLN 308

Query: 77  SWKQLEREFLIRFYNTRRIASMFELTNTKQRKGEPVIDYINRWRAASLDCKDRLTELSVV 136
           SW Q+EREFL RFY+TRR  SM ELTNTKQ K EPV+DYINRW + SLDCK RL ELS V
Sbjct: 309 SWDQMEREFLNRFYSTRRTVSMMELTNTKQWKDEPVVDYINRWLSLSLDCKYRLLELSAV 368

Query: 137 ERCIQGMHWGLLYILQGIKPRTFE 153
           E CIQGMHWGLLYILQGIKPRTFE
Sbjct: 369 EMCIQGMHWGLLYILQGIKPRTFE 392

BLAST of Cla012456 vs. TrEMBL
Match: E5GBB6_CUCME (Retrotransposon protein putative ty3-gypsy sub-class (Fragment) OS=Cucumis melo subsp. melo PE=4 SV=1)

HSP 1 Score: 185.3 bits (469), Expect = 5.7e-44
Identity = 91/126 (72.22%), Postives = 97/126 (76.98%), Query Frame = 1

Query: 27  GTSHPSFSSLMEWGNSKQHVAHFIETCENASTRGDLESESIDSWKQLEREFLIRFYNTRR 86
           G   P F      GN KQHVAHFIETCE + TRGD        W+QLER+FL RFY+TRR
Sbjct: 36  GYQPPKFQQFDGKGNPKQHVAHFIETCETSGTRGDF-------WEQLERDFLNRFYSTRR 95

Query: 87  IASMFELTNTKQRKGEPVIDYINRWRAASLDCKDRLTELSVVERCIQGMHWGLLYILQGI 146
           I SM ELT TK+RKGEPVIDYINRWRA SLDCKDRL+ELS VE C QGMHWGLLYILQGI
Sbjct: 96  IVSMIELTATKKRKGEPVIDYINRWRALSLDCKDRLSELSAVEMCTQGMHWGLLYILQGI 154

Query: 147 KPRTFE 153
           KPRTFE
Sbjct: 156 KPRTFE 154

BLAST of Cla012456 vs. TrEMBL
Match: M5W7Y6_PRUPE (Uncharacterized protein (Fragment) OS=Prunus persica GN=PRUPE_ppa018422mg PE=4 SV=1)

HSP 1 Score: 179.5 bits (454), Expect = 3.1e-42
Identity = 88/136 (64.71%), Postives = 102/136 (75.00%), Query Frame = 1

Query: 19  LTSLGCQTGTSHPSFSSLMEWGNSKQHVAHFIETCENASTRGD--LESESIDSWKQLERE 78
           L +L   TG   P F      GN KQHVAHFIE C +A T  D  ++  SI +W+Q+ERE
Sbjct: 252 LDNLRMPTGYQPPKFMQFNGKGNPKQHVAHFIEMCNSAGTNDDYLVKQFSIHNWEQMERE 311

Query: 79  FLIRFYNTRRIASMFELTNTKQRKGEPVIDYINRWRAASLDCKDRLTELSVVERCIQGMH 138
           FL RFY+TRR  SM ELT+TKQR+ EPV+DYINRWR+ SLDCKDR++ELS VE CIQGMH
Sbjct: 312 FLNRFYSTRRSVSMLELTSTKQRQDEPVVDYINRWRSLSLDCKDRVSELSAVEMCIQGMH 371

Query: 139 WGLLYILQGIKPRTFE 153
           W LLYILQGIKPRTFE
Sbjct: 372 WSLLYILQGIKPRTFE 387

BLAST of Cla012456 vs. TrEMBL
Match: E5GB67_CUCME (Retrotransposon gag protein OS=Cucumis melo subsp. melo PE=4 SV=1)

HSP 1 Score: 178.3 bits (451), Expect = 6.9e-42
Identity = 90/154 (58.44%), Postives = 102/154 (66.23%), Query Frame = 1

Query: 19  LTSLGCQTGTSHPSFSSLMEWGNSKQHVAHFIETCENASTRGD----------------- 78
           + +L    G   P F      GN KQH+AHF+ETCENA +RGD                 
Sbjct: 136 IDNLRMPLGYQPPKFQQFDGKGNPKQHIAHFVETCENAGSRGDQLVRQFVRSLKGNAFEW 195

Query: 79  ---LESESIDSWKQLEREFLIRFYNTRRIASMFELTNTKQRKGEPVIDYINRWRAASLDC 138
              LE E  DSW+QLE+EFL RFY+T+   SM ELT TKQRKG+ +IDYINRWRA SLDC
Sbjct: 196 YTDLEPEVTDSWEQLEKEFLNRFYSTKCTVSMMELTTTKQRKGDSIIDYINRWRALSLDC 255

Query: 139 KDRLTELSVVERCIQGMHWGLLYILQGIKPRTFE 153
           KD+LTELS VE C QGMHW LLYILQGIKPRTFE
Sbjct: 256 KDKLTELSAVEMCTQGMHWELLYILQGIKPRTFE 289

BLAST of Cla012456 vs. NCBI nr
Match: gi|307136441|gb|ADN34247.1| (ty3-gypsy retrotransposon protein [Cucumis melo subsp. melo])

HSP 1 Score: 189.9 bits (481), Expect = 3.3e-45
Identity = 96/154 (62.34%), Postives = 106/154 (68.83%), Query Frame = 1

Query: 19  LTSLGCQTGTSHPSFSSLMEWGNSKQHVAHFIETCENASTRGD----------------- 78
           + +L    G   P F      GN KQH+AHF+ETCENA +RGD                 
Sbjct: 199 IDNLRMPLGYQPPKFQQFDGRGNPKQHIAHFVETCENAGSRGDQLVKQFVRSLKGNAFEW 258

Query: 79  ---LESESIDSWKQLEREFLIRFYNTRRIASMFELTNTKQRKGEPVIDYINRWRAASLDC 138
              LE E ID+W+QLE EFL RFY+TRR+ SM ELTNTKQRKGEPVIDYINRWRA SLDC
Sbjct: 259 YTDLEPEVIDNWEQLEIEFLNRFYSTRRVVSMMELTNTKQRKGEPVIDYINRWRALSLDC 318

Query: 139 KDRLTELSVVERCIQGMHWGLLYILQGIKPRTFE 153
           KD+LTELS VE C QGMHW LLYILQGIKPRTFE
Sbjct: 319 KDKLTELSAVEMCTQGMHWELLYILQGIKPRTFE 352

BLAST of Cla012456 vs. NCBI nr
Match: gi|595883646|ref|XP_007212829.1| (hypothetical protein PRUPE_ppa020726mg [Prunus persica])

HSP 1 Score: 186.0 bits (471), Expect = 4.8e-44
Identity = 94/144 (65.28%), Postives = 103/144 (71.53%), Query Frame = 1

Query: 17  RGLTSLGCQTGTSHPSFSSLMEWGNSKQHVAHFIETCENASTRGD--------LESESID 76
           R + +L   TG   P F      GN KQ +AHFIETC NA T GD        LESES++
Sbjct: 249 RRIENLRMPTGYQPPKFQQFDGKGNPKQRIAHFIETCNNAGTEGDHLVKQFVHLESESLN 308

Query: 77  SWKQLEREFLIRFYNTRRIASMFELTNTKQRKGEPVIDYINRWRAASLDCKDRLTELSVV 136
           SW Q+EREFL RFY+TRR  SM ELTNTKQ K EPV+DYINRW + SLDCK RL ELS V
Sbjct: 309 SWDQMEREFLNRFYSTRRTVSMMELTNTKQWKDEPVVDYINRWLSLSLDCKYRLLELSAV 368

Query: 137 ERCIQGMHWGLLYILQGIKPRTFE 153
           E CIQGMHWGLLYILQGIKPRTFE
Sbjct: 369 EMCIQGMHWGLLYILQGIKPRTFE 392

BLAST of Cla012456 vs. NCBI nr
Match: gi|307135894|gb|ADN33758.1| (retrotransposon protein putative ty3-gypsy sub-class [Cucumis melo subsp. melo])

HSP 1 Score: 185.3 bits (469), Expect = 8.1e-44
Identity = 91/126 (72.22%), Postives = 97/126 (76.98%), Query Frame = 1

Query: 27  GTSHPSFSSLMEWGNSKQHVAHFIETCENASTRGDLESESIDSWKQLEREFLIRFYNTRR 86
           G   P F      GN KQHVAHFIETCE + TRGD        W+QLER+FL RFY+TRR
Sbjct: 36  GYQPPKFQQFDGKGNPKQHVAHFIETCETSGTRGDF-------WEQLERDFLNRFYSTRR 95

Query: 87  IASMFELTNTKQRKGEPVIDYINRWRAASLDCKDRLTELSVVERCIQGMHWGLLYILQGI 146
           I SM ELT TK+RKGEPVIDYINRWRA SLDCKDRL+ELS VE C QGMHWGLLYILQGI
Sbjct: 96  IVSMIELTATKKRKGEPVIDYINRWRALSLDCKDRLSELSAVEMCTQGMHWGLLYILQGI 154

Query: 147 KPRTFE 153
           KPRTFE
Sbjct: 156 KPRTFE 154

BLAST of Cla012456 vs. NCBI nr
Match: gi|698468031|ref|XP_009783263.1| (PREDICTED: uncharacterized protein LOC104231895 [Nicotiana sylvestris])

HSP 1 Score: 180.6 bits (457), Expect = 2.0e-42
Identity = 87/133 (65.41%), Postives = 98/133 (73.68%), Query Frame = 1

Query: 40  GNSKQHVAHFIETCENASTRG--------------------DLESESIDSWKQLEREFLI 99
           GN +QH+AHF+ETC NA T G                    DLE+ESIDSW+QLE+EFL 
Sbjct: 17  GNPRQHIAHFVETCSNAGTHGDLLVKQFVRSLKGNAFGWYIDLETESIDSWEQLEKEFLN 76

Query: 100 RFYNTRRIASMFELTNTKQRKGEPVIDYINRWRAASLDCKDRLTELSVVERCIQGMHWGL 153
           RFY+TRR  SM ELT TKQRK EPV+DYIN W+A  LDCKDRL+E+SVVE CIQGMHW L
Sbjct: 77  RFYSTRRTVSMIELTGTKQRKDEPVVDYINHWKALGLDCKDRLSEISVVEMCIQGMHWDL 136

BLAST of Cla012456 vs. NCBI nr
Match: gi|595841541|ref|XP_007208227.1| (hypothetical protein PRUPE_ppa018422mg, partial [Prunus persica])

HSP 1 Score: 179.5 bits (454), Expect = 4.5e-42
Identity = 88/136 (64.71%), Postives = 102/136 (75.00%), Query Frame = 1

Query: 19  LTSLGCQTGTSHPSFSSLMEWGNSKQHVAHFIETCENASTRGD--LESESIDSWKQLERE 78
           L +L   TG   P F      GN KQHVAHFIE C +A T  D  ++  SI +W+Q+ERE
Sbjct: 252 LDNLRMPTGYQPPKFMQFNGKGNPKQHVAHFIEMCNSAGTNDDYLVKQFSIHNWEQMERE 311

Query: 79  FLIRFYNTRRIASMFELTNTKQRKGEPVIDYINRWRAASLDCKDRLTELSVVERCIQGMH 138
           FL RFY+TRR  SM ELT+TKQR+ EPV+DYINRWR+ SLDCKDR++ELS VE CIQGMH
Sbjct: 312 FLNRFYSTRRSVSMLELTSTKQRQDEPVVDYINRWRSLSLDCKDRVSELSAVEMCIQGMH 371

Query: 139 WGLLYILQGIKPRTFE 153
           W LLYILQGIKPRTFE
Sbjct: 372 WSLLYILQGIKPRTFE 387

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
E5GCP6_CUCME2.3e-4562.34Ty3-gypsy retrotransposon protein OS=Cucumis melo subsp. melo PE=4 SV=1[more]
M5WXF7_PRUPE3.3e-4465.28Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa020726mg PE=4 SV=1[more]
E5GBB6_CUCME5.7e-4472.22Retrotransposon protein putative ty3-gypsy sub-class (Fragment) OS=Cucumis melo ... [more]
M5W7Y6_PRUPE3.1e-4264.71Uncharacterized protein (Fragment) OS=Prunus persica GN=PRUPE_ppa018422mg PE=4 S... [more]
E5GB67_CUCME6.9e-4258.44Retrotransposon gag protein OS=Cucumis melo subsp. melo PE=4 SV=1[more]
Match NameE-valueIdentityDescription
gi|307136441|gb|ADN34247.1|3.3e-4562.34ty3-gypsy retrotransposon protein [Cucumis melo subsp. melo][more]
gi|595883646|ref|XP_007212829.1|4.8e-4465.28hypothetical protein PRUPE_ppa020726mg [Prunus persica][more]
gi|307135894|gb|ADN33758.1|8.1e-4472.22retrotransposon protein putative ty3-gypsy sub-class [Cucumis melo subsp. melo][more]
gi|698468031|ref|XP_009783263.1|2.0e-4265.41PREDICTED: uncharacterized protein LOC104231895 [Nicotiana sylvestris][more]
gi|595841541|ref|XP_007208227.1|4.5e-4264.71hypothetical protein PRUPE_ppa018422mg, partial [Prunus persica][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR005162Retrotrans_gag_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla012456Cla012456.1mRNA


Analysis Name: InterPro Annotations of watermelon (97103)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR005162Retrotransposon gag domainPFAMPF03732Retrotrans_gagcoord: 62..136
score: 2.