ClCG09G014950 (gene) Watermelon (Charleston Gray)

NameClCG09G014950
Typegene
OrganismCitrullus lanatus (Watermelon (Charleston Gray))
DescriptionTy3-gypsy retrotransposon protein
LocationCG_Chr09 : 27963847 .. 27968728 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAGAATGGTGCAATTCTTCCAGATGTATCAAAATTGGAACCTCTGGATGAATCCAATTACTGTCGCTGGTCTTAGAAACTCCTATATTTTTCGAGCAACTAAAAATCGATTACATTCTCACCATCGATAGTTCTGATAAAGGTAAGATTACTAACAAAGGTAAGTCTACTATGGTAATTGATCTTGACAGCTCCAAAGTCACAGATATGTTAAAATTCAGTCAGTCAAAATCCGAACCTGTTGTAGATCCAGATAAATTTGAGAAAGATAACAAGACAATCTATGGCCATTTACTCAATCATATGACAAACTCATTATTCAATTTATTCATGGTTTAAAAGTCAGCAAAGGTGATATGAGACACTGTGGAGTCTAGGTATAGAAGAGATGATGTAGGCCATTAGAAGTATGTTGTTTAATAATGGTTGTAATTCCAAATGACAGACGAGAAACCAATTGTGGATCAAGTGGATAAATATGAGAATCTGGTGGTGAACGTTTTGTCTGAAGTTATGTAGATGTGCGAAATTCTCCAAGCAAATGTACTGCATGAAAAATTTCCACTGTCCTAGACTGATTACAGAAATCACCTGAACACAAGAAGAACGATTTGACATTGCAGTAATTGATCACTCACATGCACACGTAATAAGCAAATCGACAGAAAGATAAGCTTATCTCTAAAAATTTAAATTTCGTTAATGCTAACCTAGTTGAGTCTTTTGCTGTTAATAGAAAAAGGTTAAAAAATAATAGCAAGCAATCTGTCAAAAATGATTTTTATAAGAGGAAGGAACATCATAAAGTTACTGGTGGGGAGATGGAAAAGAAAAAATTGATATGCTATGTCTATGGAAAACAAGGACACAAATCCTACCAGTGTAACTAAAGGAAAGGAACGCCAGATCAACGACCGATCCGACAGGCCAATCTTGTTGAGCAAAATGACATCATTACTGCTGTGGTCATGGAATCCAACTTGGTAGAAAACAAATCAAATTGGATTCTCAACGCTAGAGCCTTAGATAGTATCTTCCACGACTTCCAAGATACTATCGATCATTAATGTGTGTTTATGGATAACTATGCTACTGTAGGAGTACTTAGGAAAGGGAACATTCTCTTAAAACTTACTTCTAGTAAAACATTATCATTAAGTGATGTGTTGTATGTACCTTTGTTATGTAGGAACTTGGTGTCTAGAAGTTTGTTGAACAAGGCTGGGCTTAAAATTGTACTAGAGGCTGACAAGGTTATTCTCACCAAAAATGGTAACTTTGTTCCTAAGGGGTACTTAGCTAATGGTCTTTTTGTTTTAAATTGTTGAGACTGGCGTGTCCCTAGGTCTCGTAGTTTTGTAAATTTGTAAATACATTGTATTTATCAATAAAATAAGAAGTTATTTTATTTATCTTTTGCATTAACTCAATTCAATAAACTAAGATTCCATGATTATTTTATGTAAACTTAAACATGTATGTGGTTGACATACGAGGAAGGATCATGTTTAAGTAATAACCAAAAGGTCTATAGTATAGGAATAAGGTTGGGTACCTTAATCTGGTAACACTATGGATATGGCTCACTTTATAATTGTTACAAAGAGTTGTAAATTGTTACAAATGATTTGATTCTAATCGTTCATGTGGAGATATATGAGTGGGGGTATCCTATGTAAAGAGTTTGTATAAGAGTAGAACGCGAAATGTTTAATTTTTCTTTGCAACGCTATTAATTGAAGAAATTAACATTTCATAGGATGACTATAGGTGACTCGACCTTAATCCGGAGTAAGTTGTGAACTCCTATCTATGAAGACAATTTTTTGATCTATATGGATGAGAGGTTCCGATCGCTGACTCAATATGACTACCATTTTAGGGATTTGTCTGATTAGGGAGCTAGGAACATATCTTCACAAGATGGAATTCACTCCTTCCCATGTTTAGAGTAAGTAGATAAATTACTCCCTTATGGGCTAATTCCGGGTCTTGAACAATGACGCCTCACCCTCTCATTGGCCCGAGAGGGGTTAGTTTATAGTTGGATTATAAACAGTTTGTTCATTAGAAGAAATAGTGGTACTTATGGAGTTAGATATAACTACAGGGTAAAACGGTAATTTGACCTAGTTGTAGTTATGAGTAATTTGTGAAGAGTCGGCTTATTGTTGATTAGTAAATTTCGTGGACACAGAAATATATCTATAGTGTGAAAAGTGCAACTGTAAGTCTTTGGTAGAATGACCTACAGTTAATAAATATTGATTGATCTAATTAAAGAGTTTGATTGATTAATCTCATATCATTGGAGCTTCTACTATAAATCCATAAGGTCCTCTTGCTAGCTCATAAAAGGATTAGTATTAGAATCAAACATTTTGGTTTACTTTGAAATGTTCAAATTAATAAGGGAATTAATTGTATGTGATAATATAAAGTTTATTTTGAATGAGATTCAAAATTTATACTTTATGTTATAAGAGAGACAAATATTTAAATAAGATTTAAATATTTATTATATGAATGAGATTCATATTAAAACTATAGGTTAAAATTAATATAAATGAGATTGATATTAAAAGTATAGATGATGTGAGAGAAGGTTATTTGAATATGATTCAAATTTAAGTTAAATATAAATAAATTAATAAATTAATTAATTTTTAATTAATTAATTTTATTTAATTCTATTTACTAAATATAATTTAATATATTATAATTAAATTAACATTAATTTTAATTAATAATACTATTATATTATTATTTAAAAATAACTCCCATGTAGTGTAGAGTTATTTTTTGTTCTCTCTCACATAGCAGTTTCACTTAAAAAGAAGAGGAAATTGGAGAAGGAGGTGTGACAAGTTTTTACAAAAGAGTTCTGTATTCCTCTCTCAAGTTCCTCTAAAAAATTCTCTTTTCCTCTTAAGAACAATTCTCTTACCAAAGAAATTGAGAGCCCACAACTCCATAGTTGATTCTTCCTACAAAATAGCAAGGAAGATTGTGTAGTGGTGTTCCAAAGAAGAAGAAGAATATCAAATTCTGGTTTGTGATTTGTAGAAGAAGAAGCGTTTCCCACATCAAAGACGGTCTTCAAAGGTAGGAAACTTCTATTCTTCTTAACCTTCTCAAAAGCATGCTAAAGTTTAGTTGTTTATCCAAGCATTTGTTCTCTGTAATTTAAGTTTCTCAAAACAAATTTAATGGCACATGGCGATCACTCGCTTCTGTTGAGGATTCAATTCCTTCATAAACATCATTAATGTTGCTTCTAAGATGAATGAAATTGCTTCCAGTTCTAATTGTATCTGTTGAATGAGAAATTTCGGCCAACCAAATTTCGCCATGTCATCGTGAGTCCAAATGCATCGTGGAATTTATAAATCAGAGTGGGTCGAGTAATTAAAGTTGCTCGGGTTAAAAGCCTAACCCAATTCAAGTCCATGGTGAATTATTTTTTTGGGCCAAAGAAGATATTGGGCCTAAACCCAATCCCAATTTTGGGGCCCAAAAGCAATTGGGCCCAAGCCCATGAAGGCCCATAGAAACTCTATAAATAGAGGCTTTATCTCTTCATTTAGGATGGATCCTTTTTGGAGAGTGAAGAAAAGTACTGAAGCTCTGGAAATCTGGATTCTGAAGAATTGAAGGCTGAAGCTCAAAATCTAAAGACTGAAGCTCCCAAGCTCACAAGCTCTGAAGATTGAAGCTCCCTGAAGCTCCAAAGAAGCTAAAGATTCAAAGATCCAAAGATTGAAGACTCTAGAGTTGAAGACCTTGAAAGAAAAGCTTCATAGAAGATCTCAAGAATTCAAGTCGCACGCGTTTACTTTTTGAGAGAAAGAATAAAAGGATCAAATATATTAGAGATTGTACTCATAACACTGAAATTAATACAAATATGAAGCTCAAGTTCCACGAATCAAATCTTCGGAAATCTCGTCGAACAAATTGGCACGCCCGGTGGGACAGTCTCTACCTTTCATCTCTTTCTCCCGTACAAGACATACAACATAACATCCAAAGACAATGCTTCTAACGCTTCAAGTAACACTTCCAGAAGACCGATTACTCGCAGCCGCTCTAAAGAAATACAGTCGGAAGAGCAACCTACCTTTGAGATCGCGAGAAATATATGGGAACGGATCTCAAGAGACCCGAAAGCTGGGGTCGTCATCAAAGAGAATCCTACACTTGACAAGCCTACCTCAGCTTCTGAACGACCAAGCGAGGAGGCATCCCAACCAAATGTAATGTCAGTCATGATGGCCGACGTGGGAACAAGTGAGGAGAGAATGGCTGAACTTGAAAAGAAAGTTAACATACTACTGAAGGCAGTTGAAGAAAGGGATTATGAGATTGCATCCCTCAAGAATCATATTGAAAGTCGTGATGCGGCTGAATCAAGTCATACACCTGCAGTCAAGAATAATGACAAAGGGAAGAAGGTTTTGCAAGATAGTCAACCCCAAGATTCAACTTCAATTGCTTCGTTGTCTGTCCAAAAGTTGCAGAAAATGATTGCGAACTCCATCAAGGCTCAATATGGAGGTCCCAGTCAAACCTCCCTCTTGTATTCCAAGCCGTATACGAAGAGGATTGACAACCTAAGGATGCCAAATAGGTATCAGCCGCCAAAATTCCAGCAGTTCGACGGAAAGGGCAACCCAAAGCAACATGTTGCTCACTTCATTGAAACCTGTGAAAACGCTGGAACTAGAGGAGACCTGCTAGTTAAACAGTTTGTTCGAACTCTGAAGGAAACGCCTTCGACTGGTACACGGACCTGGAGCCCGAAACTATCGATAGCTGGGAGCAGCTTGAGAGAGAATTCCTCAATCGTTTCTACAGTACTAGGCGCATCGTTAGTATGA

mRNA sequence

ATGAAGAATGGTGCAATTCTTCCAGATGTATCAAAATTGGAACCTCTGGATGAATCCAATTACTGTCGCTGTTCTGATAAAGGTAAGATTACTAACAAAGGTAAGTCTACTATGGTAATTGATCTTGACAGCTCCAAAGTCACAGATATGTTAAAATTCAGTCAGTCAAAATCCGAACCTGTTGACACAAATCCTACCAGTGTAACTAAAGGAAAGGAACGCCAGATCAACGACCGATCCGACAGGCCAATCTTGTTGAGCAAAATGACATCATTACTGCTGTGGTCATGGAATCCAACTTGGAACTTGGTGTCTAGAAGTTTGTTGAACAAGGCTGGGCTTAAAATTGTACTAGAGGCTGACAAGGTTATTCTCACCAAAAATGGTAACTTTGTTCCTAAGGGTGGTGTTCCAAAGAAGAAGAAGAATATCAAATTCTGGTTTGTGATTTGTAGAAGAAGAAGCGTTTCCCACATCAAAGACGGTCTTCAAAGCTCCAAAGAAGCTAAAGATTCAAAGATCCAAAGATTGAAGACTCTAGAGTTGAAGACCTTGAAAGAAAAGCTTCATAGAAGATCTCAAGAATTCAACTCAAGTTCCACGAATCAAATCTTCGGAAATCTCGTCGAACAAATTGGCACGCCCGGTGGGACAGTCTCTACCTTTCATCTCTTTCTCCCGTACAAGACATACAACATAACATCCAAAGACAATGCTTCTAACGCTTCAAGTAACACTTCCAGAAGACCGATTACTCGCAGCCGCTCTAAAGAAATACAGTCGGAAGAGCAACCTACCTTTGAGATCGCGAGAAATATATGGGAACGGATCTCAAGAGACCCGAAAGCTGGGGTCGTCATCAAAGAGAATCCTACACTTGACAAGCCTACCTCAGCTTCTGAACGACCAAGCGAGGAGGCATCCCAACCAAATGTAATGTCAGTCATGATGGCCGACGTGGGAACAAGTGAGGAGAGAATGGCTGAACTTGAAAAGAAAGTTAACATACTACTGAAGGCAGTTGAAGAAAGGGATTATGAGATTGCATCCCTCAAGAATCATATTGAAAGTCGTGATGCGGCTGAATCAAGTCATACACCTGCAGTCAAGAATAATGACAAAGGGAAGAAGGTTTTGCAAGATAGTCAACCCCAAGATTCAACTTCAATTGCTTCGTTGTCTGTCCAAAAGTTGCAGAAAATGATTGCGAACTCCATCAAGGCTCAATATGGAGGTCCCAGTCAAACCTCCCTCTTGTATTCCAAGCCGTATACGAAGAGGATTGACAACCTAAGGATGCCAAATAGGTATCAGCCGCCAAAATTCCAGCAGTTCGACGGAAAGGGCAACCCAAAGCAACATGTTGCTCACTTCATTGAAACCTGTGAAAACGCTGGAACTAGAGGAGACCTGCTAGTTAAACAGTTTGTTCGAACTCTGAAGGAAACGCCTTCGACTGGTACACGGACCTGGAGCCCGAAACTATCGATAGCTGGGAGCAGCTTGAGAGAGAATTCCTCAATCGTTTCTACAGTACTAGGCGCATCGTTAGTATGA

Coding sequence (CDS)

ATGAAGAATGGTGCAATTCTTCCAGATGTATCAAAATTGGAACCTCTGGATGAATCCAATTACTGTCGCTGTTCTGATAAAGGTAAGATTACTAACAAAGGTAAGTCTACTATGGTAATTGATCTTGACAGCTCCAAAGTCACAGATATGTTAAAATTCAGTCAGTCAAAATCCGAACCTGTTGACACAAATCCTACCAGTGTAACTAAAGGAAAGGAACGCCAGATCAACGACCGATCCGACAGGCCAATCTTGTTGAGCAAAATGACATCATTACTGCTGTGGTCATGGAATCCAACTTGGAACTTGGTGTCTAGAAGTTTGTTGAACAAGGCTGGGCTTAAAATTGTACTAGAGGCTGACAAGGTTATTCTCACCAAAAATGGTAACTTTGTTCCTAAGGGTGGTGTTCCAAAGAAGAAGAAGAATATCAAATTCTGGTTTGTGATTTGTAGAAGAAGAAGCGTTTCCCACATCAAAGACGGTCTTCAAAGCTCCAAAGAAGCTAAAGATTCAAAGATCCAAAGATTGAAGACTCTAGAGTTGAAGACCTTGAAAGAAAAGCTTCATAGAAGATCTCAAGAATTCAACTCAAGTTCCACGAATCAAATCTTCGGAAATCTCGTCGAACAAATTGGCACGCCCGGTGGGACAGTCTCTACCTTTCATCTCTTTCTCCCGTACAAGACATACAACATAACATCCAAAGACAATGCTTCTAACGCTTCAAGTAACACTTCCAGAAGACCGATTACTCGCAGCCGCTCTAAAGAAATACAGTCGGAAGAGCAACCTACCTTTGAGATCGCGAGAAATATATGGGAACGGATCTCAAGAGACCCGAAAGCTGGGGTCGTCATCAAAGAGAATCCTACACTTGACAAGCCTACCTCAGCTTCTGAACGACCAAGCGAGGAGGCATCCCAACCAAATGTAATGTCAGTCATGATGGCCGACGTGGGAACAAGTGAGGAGAGAATGGCTGAACTTGAAAAGAAAGTTAACATACTACTGAAGGCAGTTGAAGAAAGGGATTATGAGATTGCATCCCTCAAGAATCATATTGAAAGTCGTGATGCGGCTGAATCAAGTCATACACCTGCAGTCAAGAATAATGACAAAGGGAAGAAGGTTTTGCAAGATAGTCAACCCCAAGATTCAACTTCAATTGCTTCGTTGTCTGTCCAAAAGTTGCAGAAAATGATTGCGAACTCCATCAAGGCTCAATATGGAGGTCCCAGTCAAACCTCCCTCTTGTATTCCAAGCCGTATACGAAGAGGATTGACAACCTAAGGATGCCAAATAGGTATCAGCCGCCAAAATTCCAGCAGTTCGACGGAAAGGGCAACCCAAAGCAACATGTTGCTCACTTCATTGAAACCTGTGAAAACGCTGGAACTAGAGGAGACCTGCTAGTTAAACAGTTTGTTCGAACTCTGAAGGAAACGCCTTCGACTGGTACACGGACCTGGAGCCCGAAACTATCGATAGCTGGGAGCAGCTTGAGAGAGAATTCCTCAATCGTTTCTACAGTACTAGGCGCATCGTTAGTATGA

Protein sequence

MKNGAILPDVSKLEPLDESNYCRCSDKGKITNKGKSTMVIDLDSSKVTDMLKFSQSKSEPVDTNPTSVTKGKERQINDRSDRPILLSKMTSLLLWSWNPTWNLVSRSLLNKAGLKIVLEADKVILTKNGNFVPKGGVPKKKKNIKFWFVICRRRSVSHIKDGLQSSKEAKDSKIQRLKTLELKTLKEKLHRRSQEFNSSSTNQIFGNLVEQIGTPGGTVSTFHLFLPYKTYNITSKDNASNASSNTSRRPITRSRSKEIQSEEQPTFEIARNIWERISRDPKAGVVIKENPTLDKPTSASERPSEEASQPNVMSVMMADVGTSEERMAELEKKVNILLKAVEERDYEIASLKNHIESRDAAESSHTPAVKNNDKGKKVLQDSQPQDSTSIASLSVQKLQKMIANSIKAQYGGPSQTSLLYSKPYTKRIDNLRMPNRYQPPKFQQFDGKGNPKQHVAHFIETCENAGTRGDLLVKQFVRTLKETPSTGTRTWSPKLSIAGSSLRENSSIVSTVLGASLV
BLAST of ClCG09G014950 vs. TrEMBL
Match: E5GCP6_CUCME (Ty3-gypsy retrotransposon protein OS=Cucumis melo subsp. melo PE=4 SV=1)

HSP 1 Score: 269.6 bits (688), Expect = 7.8e-69
Identity = 138/245 (56.33%), Postives = 185/245 (75.51%), Query Frame = 1

Query: 240 SNASSNTSRRPITRSRSKEI--QSEEQPTFEIARNIWERISRDPKAGVVIKENPTLDKPT 299
           S+ +S++    +T+S  K    + E++  F + +   E++   PK G++I++NP  +  T
Sbjct: 10  SSVASDSYIGLVTQSHLKRSMQEQEQEQGFVLKKKSLEQLIESPKGGIIIRDNPLFNNST 69

Query: 300 SASERPSEEASQPNVMSVMMADVGTSEERMAELEKKVNILLKAVEERDYEIASLKNHIES 359
            AS   S++AS   V+SVMM DV T+E  + E+E+K+N L+K +EERD+EIA+LK+ +++
Sbjct: 70  PASNL-SDKASHLEVVSVMMVDV-TAEATVTEMERKINFLMKVIEERDHEIAALKDQMKA 129

Query: 360 RDAAESSHTPAVKNNDKGKKVLQDSQPQD-STSIASLSVQKLQKMIANSIKAQYGGPSQT 419
            +  ESS TP VK  DKGK V+Q++QPQ  S S+ASLSVQ+LQ MIANSI+AQYGGP QT
Sbjct: 130 CETGESSQTPVVKATDKGKNVVQENQPQQQSVSVASLSVQQLQDMIANSIRAQYGGPPQT 189

Query: 420 SLLYSKPYTKRIDNLRMPNRYQPPKFQQFDGKGNPKQHVAHFIETCENAGTRGDLLVKQF 479
           S +YSK YTKRIDNLRMP  YQPPKFQQFDG+GNPKQH+AHF+ETCENAG+RGD LVKQF
Sbjct: 190 SFMYSKSYTKRIDNLRMPLGYQPPKFQQFDGRGNPKQHIAHFVETCENAGSRGDQLVKQF 249

Query: 480 VRTLK 482
           VR+LK
Sbjct: 250 VRSLK 252

BLAST of ClCG09G014950 vs. TrEMBL
Match: E5GB67_CUCME (Retrotransposon gag protein OS=Cucumis melo subsp. melo PE=4 SV=1)

HSP 1 Score: 213.8 bits (543), Expect = 5.1e-52
Identity = 113/220 (51.36%), Postives = 145/220 (65.91%), Query Frame = 1

Query: 262 EEQPTFEIARNIWERISRDPKAGVVIKENPTLDKPTSASERPSEEASQPNVMSVMMADVG 321
           E++  F + +   E++    K G++I++NP  +  T  S   S++ S   V+SVMM DV 
Sbjct: 3   EQEQGFVLKKKSLEQLIESLKGGIIIRDNPLFNNSTPTSNL-SDKESHLEVVSVMMIDV- 62

Query: 322 TSEERMAELEKKVNILLKAVEERDYEIASLKNHIESRDAAESSHTPAVKNNDKGKKVLQD 381
           T+E  MAE+EKK+N L+K VEERD+EIA+LK+ +++ ++AESS T  VK  DKGK V   
Sbjct: 63  TAEATMAEMEKKINFLMKVVEERDHEIAALKDQMKACESAESSQTSVVKTTDKGKNV--- 122

Query: 382 SQPQDSTSIASLSVQKLQKMIANSIKAQYGGPSQTSLLYSKPYTKRIDNLRMPNRYQPPK 441
                                       YGGP QTS +YSKPYTKRIDNLRMP  YQPPK
Sbjct: 123 ----------------------------YGGPPQTSFMYSKPYTKRIDNLRMPLGYQPPK 182

Query: 442 FQQFDGKGNPKQHVAHFIETCENAGTRGDLLVKQFVRTLK 482
           FQQFDGKGNPKQH+AHF+ETCENAG+RGD LV+QFVR+LK
Sbjct: 183 FQQFDGKGNPKQHIAHFVETCENAGSRGDQLVRQFVRSLK 189

BLAST of ClCG09G014950 vs. TrEMBL
Match: M5W5Z1_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa022233mg PE=4 SV=1)

HSP 1 Score: 182.6 bits (462), Expect = 1.2e-42
Identity = 114/264 (43.18%), Postives = 153/264 (57.95%), Query Frame = 1

Query: 236 KDNASNASSNTSRRPITRSRSKEIQSEEQ----PTFEIARNIWERISRDPKAGVVIKENP 295
           K  +++ SS  S   +TRS++K I         PT   A+++  R   D   G      P
Sbjct: 15  KSVSASFSSRDSAGVVTRSKAKAIFVTRHVTSIPTQAKAKDV-NRRHADSSTGSYHGSPP 74

Query: 296 TLDKPTSASERPSEEASQPNVMSVMMADVGTSEERMAELEKKVNILLKAVEERDYEIASL 355
              +   AS    E  S    M VM+    + EE+MA + + V  L K VEE+D +IASL
Sbjct: 75  DDQQKKIASASVGESYSM--AMQVMVTGAMSIEEQMAHMSEAVTKLTKMVEEKDVQIASL 134

Query: 356 KNHIE-------SRDAAESSHTPAVKNNDKGKKVLQDSQPQ-----DSTSIASLSVQKLQ 415
            N  E       S+D  +       ++++KG     +S  +     ++ S+ SLS+Q+LQ
Sbjct: 135 INKWETHQDKEPSQDVHKKESHHEAESSEKGLGHETESGDKSHGKGNTASVGSLSIQQLQ 194

Query: 416 KMIANSIKAQYGGPSQTSLLYSKPYTKRIDNLRMPNRYQPPKFQQFDGKGNPKQHVAHFI 475
            MI N+I+AQYGGPSQ + +YSKPYTKR+DNLRMP  YQPPKF QFDGKGNPKQHVAHFI
Sbjct: 195 DMITNTIRAQYGGPSQDTFIYSKPYTKRLDNLRMPMGYQPPKFMQFDGKGNPKQHVAHFI 254

Query: 476 ETCENAGTRGDLLVKQFVRTLKET 484
           + C +AGT  D LVKQFVR+L+ T
Sbjct: 255 DMCNSAGTNDDYLVKQFVRSLRGT 275

BLAST of ClCG09G014950 vs. TrEMBL
Match: M5W7Y6_PRUPE (Uncharacterized protein (Fragment) OS=Prunus persica GN=PRUPE_ppa018422mg PE=4 SV=1)

HSP 1 Score: 173.7 bits (439), Expect = 5.8e-40
Identity = 109/263 (41.44%), Postives = 144/263 (54.75%), Query Frame = 1

Query: 230 TYNITSKDNASNASS-NTSRRPITRSRSKEI-------QSEEQPTFEIARNIWERISRDP 289
           T ++TS    + A   N  R P+     K I        S E+ +F  ++N  ER S  P
Sbjct: 41  TRHVTSIPTQAKAKDVNRRREPVINLAKKTIARADERNSSNEEKSFSCSKNSRER-SLSP 100

Query: 290 KAGVVIKENPTLDKPTSASERPSEEASQPN----VMSVMMADVGTSEERMAELEKKVNIL 349
            +      +     P    ++    AS        M VM+    + EE++A + + V  L
Sbjct: 101 VSDADSSTSSYHGSPPDYQQKKIASASVGESYFMAMQVMVTGAMSIEEQLAHMSEAVTKL 160

Query: 350 LKAVEERDYEIASLKNHIESRDAAESSHTPAVKNN----DKGKKVLQDSQPQDSTSIASL 409
            K VEE+D +IASL N  E+    E S     K +    +  +K L      +S S+ SL
Sbjct: 161 TKMVEEKDVQIASLINKWETHQDKEPSQDVHKKESHHEAESSEKGL--GHETESASVGSL 220

Query: 410 SVQKLQKMIANSIKAQYGGPSQTSLLYSKPYTKRIDNLRMPNRYQPPKFQQFDGKGNPKQ 469
           S+Q+LQ MI N+I+AQYG PSQ + +YSKPYTKR+DNLRMP  YQPPKF QF+GKGNPKQ
Sbjct: 221 SIQQLQDMITNTIRAQYGRPSQDTFIYSKPYTKRLDNLRMPTGYQPPKFMQFNGKGNPKQ 280

Query: 470 HVAHFIETCENAGTRGDLLVKQF 477
           HVAHFIE C +AGT  D LVKQF
Sbjct: 281 HVAHFIEMCNSAGTNDDYLVKQF 300

BLAST of ClCG09G014950 vs. TrEMBL
Match: M5WEC9_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa025679mg PE=4 SV=1)

HSP 1 Score: 172.2 bits (435), Expect = 1.7e-39
Identity = 93/183 (50.82%), Postives = 121/183 (66.12%), Query Frame = 1

Query: 313 MSVMMADVGTSEERMAELEKKVNILLKAVEERDYEIASLKNHIESRDAAESSHTPAVK-- 372
           M VM+    + EE++A + + V  L+K VEE+D +IASL N  E+    E S     K  
Sbjct: 3   MQVMVTGAMSIEEQLAHMSEAVTKLMKMVEEKDVQIASLINKWETHQDKEPSEDVHKKES 62

Query: 373 -----NNDKGKKVLQDSQPQ-----DSTSIASLSVQKLQKMIANSIKAQYGGPSQTSLLY 432
                +++KG     +S  +     ++ S+ SLS+Q+LQ MI N+I+AQYGGPSQ + +Y
Sbjct: 63  HHEAESSEKGLGHETESGDKSHGKGNTASVGSLSIQQLQDMITNTIRAQYGGPSQDTFIY 122

Query: 433 SKPYTKRIDNLRMPNRYQPPKFQQFDGKGNPKQHVAHFIETCENAGTRGDLLVKQFVRTL 484
           SKPYTKR+DNLRMP  YQP KF QFDGKGNPKQHVAHFIE C + GT  D LVKQFVR+L
Sbjct: 123 SKPYTKRLDNLRMPTGYQPLKFMQFDGKGNPKQHVAHFIEMCNSVGTNDDYLVKQFVRSL 182

BLAST of ClCG09G014950 vs. NCBI nr
Match: gi|307136441|gb|ADN34247.1| (ty3-gypsy retrotransposon protein [Cucumis melo subsp. melo])

HSP 1 Score: 269.6 bits (688), Expect = 1.1e-68
Identity = 138/245 (56.33%), Postives = 185/245 (75.51%), Query Frame = 1

Query: 240 SNASSNTSRRPITRSRSKEI--QSEEQPTFEIARNIWERISRDPKAGVVIKENPTLDKPT 299
           S+ +S++    +T+S  K    + E++  F + +   E++   PK G++I++NP  +  T
Sbjct: 10  SSVASDSYIGLVTQSHLKRSMQEQEQEQGFVLKKKSLEQLIESPKGGIIIRDNPLFNNST 69

Query: 300 SASERPSEEASQPNVMSVMMADVGTSEERMAELEKKVNILLKAVEERDYEIASLKNHIES 359
            AS   S++AS   V+SVMM DV T+E  + E+E+K+N L+K +EERD+EIA+LK+ +++
Sbjct: 70  PASNL-SDKASHLEVVSVMMVDV-TAEATVTEMERKINFLMKVIEERDHEIAALKDQMKA 129

Query: 360 RDAAESSHTPAVKNNDKGKKVLQDSQPQD-STSIASLSVQKLQKMIANSIKAQYGGPSQT 419
            +  ESS TP VK  DKGK V+Q++QPQ  S S+ASLSVQ+LQ MIANSI+AQYGGP QT
Sbjct: 130 CETGESSQTPVVKATDKGKNVVQENQPQQQSVSVASLSVQQLQDMIANSIRAQYGGPPQT 189

Query: 420 SLLYSKPYTKRIDNLRMPNRYQPPKFQQFDGKGNPKQHVAHFIETCENAGTRGDLLVKQF 479
           S +YSK YTKRIDNLRMP  YQPPKFQQFDG+GNPKQH+AHF+ETCENAG+RGD LVKQF
Sbjct: 190 SFMYSKSYTKRIDNLRMPLGYQPPKFQQFDGRGNPKQHIAHFVETCENAGSRGDQLVKQF 249

Query: 480 VRTLK 482
           VR+LK
Sbjct: 250 VRSLK 252

BLAST of ClCG09G014950 vs. NCBI nr
Match: gi|659074577|ref|XP_008437679.1| (PREDICTED: uncharacterized protein LOC103483019 [Cucumis melo])

HSP 1 Score: 249.6 bits (636), Expect = 1.2e-62
Identity = 121/144 (84.03%), Postives = 133/144 (92.36%), Query Frame = 1

Query: 338 LKAVEERDYEIASLKNHIESRDAAESSHTPAVKNNDKGKKVLQDSQPQDSTSIASLSVQK 397
           +KAVEERD+EIA LKNHIESRDAAESSHT  +KN +KGK ++Q+SQPQ+STSIASLSVQ+
Sbjct: 1   MKAVEERDFEIALLKNHIESRDAAESSHTHTIKNANKGKAIMQESQPQNSTSIASLSVQQ 60

Query: 398 LQKMIANSIKAQYGGPSQTSLLYSKPYTKRIDNLRMPNRYQPPKFQQFDGKGNPKQHVAH 457
           LQ+MIANSIK QYGGP QT  LYSKPYTKRIDN+RMP+ YQPPKFQQFDGKGNPKQHVAH
Sbjct: 61  LQEMIANSIKTQYGGPVQTFSLYSKPYTKRIDNMRMPHEYQPPKFQQFDGKGNPKQHVAH 120

Query: 458 FIETCENAGTRGDLLVKQFVRTLK 482
           FIETCE AGTRGDLLVKQFVRTLK
Sbjct: 121 FIETCETAGTRGDLLVKQFVRTLK 144

BLAST of ClCG09G014950 vs. NCBI nr
Match: gi|307135838|gb|ADN33709.1| (retrotransposon gag protein [Cucumis melo subsp. melo])

HSP 1 Score: 213.8 bits (543), Expect = 7.3e-52
Identity = 113/220 (51.36%), Postives = 145/220 (65.91%), Query Frame = 1

Query: 262 EEQPTFEIARNIWERISRDPKAGVVIKENPTLDKPTSASERPSEEASQPNVMSVMMADVG 321
           E++  F + +   E++    K G++I++NP  +  T  S   S++ S   V+SVMM DV 
Sbjct: 3   EQEQGFVLKKKSLEQLIESLKGGIIIRDNPLFNNSTPTSNL-SDKESHLEVVSVMMIDV- 62

Query: 322 TSEERMAELEKKVNILLKAVEERDYEIASLKNHIESRDAAESSHTPAVKNNDKGKKVLQD 381
           T+E  MAE+EKK+N L+K VEERD+EIA+LK+ +++ ++AESS T  VK  DKGK V   
Sbjct: 63  TAEATMAEMEKKINFLMKVVEERDHEIAALKDQMKACESAESSQTSVVKTTDKGKNV--- 122

Query: 382 SQPQDSTSIASLSVQKLQKMIANSIKAQYGGPSQTSLLYSKPYTKRIDNLRMPNRYQPPK 441
                                       YGGP QTS +YSKPYTKRIDNLRMP  YQPPK
Sbjct: 123 ----------------------------YGGPPQTSFMYSKPYTKRIDNLRMPLGYQPPK 182

Query: 442 FQQFDGKGNPKQHVAHFIETCENAGTRGDLLVKQFVRTLK 482
           FQQFDGKGNPKQH+AHF+ETCENAG+RGD LV+QFVR+LK
Sbjct: 183 FQQFDGKGNPKQHIAHFVETCENAGSRGDQLVRQFVRSLK 189

BLAST of ClCG09G014950 vs. NCBI nr
Match: gi|659118742|ref|XP_008459280.1| (PREDICTED: uncharacterized protein LOC103498458 [Cucumis melo])

HSP 1 Score: 194.5 bits (493), Expect = 4.6e-46
Identity = 103/211 (48.82%), Postives = 140/211 (66.35%), Query Frame = 1

Query: 271 RNIWERISRDPKAGVVIKENPTLDKPTSASERPSEEASQPNVMSVMMADVGTSEERMAEL 330
           ++I +++   PKAG+ IKENP  D   SA  +  +EA  P+VMSVMM D+ T+E  M E+
Sbjct: 2   QSILKQLMLSPKAGIFIKENPLYDNFDSALSKLKKEA-HPDVMSVMMVDI-TAEAAMTEM 61

Query: 331 EKKVNILLKAVEERDYEIASLKNHIESRDAAESSHTPAVKNNDKGKKVLQDSQPQDSTSI 390
           E+K+N+++K V+E+D++I +L+  ++  +  +SS TP                       
Sbjct: 62  ERKINLVMKVVKEQDHKITALREQMQIHETTKSSQTP----------------------- 121

Query: 391 ASLSVQKLQKMIANSIKAQYGGPSQTSLLYSKPYTKRIDNLRMPNRYQPPKFQQFDGKGN 450
                 +LQ MI +SI+AQYGGP QTS +YSKPYTKRI++LRMP  YQP KFQQFD KGN
Sbjct: 122 ------QLQDMITSSIRAQYGGPPQTSFMYSKPYTKRINDLRMPVGYQPLKFQQFDRKGN 181

Query: 451 PKQHVAHFIETCENAGTRGDLLVKQFVRTLK 482
             QHVAHF+ETC+N G+RGD LVKQFVR+LK
Sbjct: 182 LNQHVAHFVETCKNTGSRGDQLVKQFVRSLK 181

BLAST of ClCG09G014950 vs. NCBI nr
Match: gi|848894615|ref|XP_012847343.1| (PREDICTED: uncharacterized protein LOC105967287 [Erythranthe guttata])

HSP 1 Score: 193.7 bits (491), Expect = 7.8e-46
Identity = 106/209 (50.72%), Postives = 138/209 (66.03%), Query Frame = 1

Query: 284 GVVIKENPTLDKPTSASERPSEEASQPNVMSVMMADVGTSEERMAELEKKVNILLKAVEE 343
           G +     T    +S SE    ++     + VM+ D  + +E++A L + V  L K VE+
Sbjct: 28  GKIDNTTTTNGSSSSGSEEIDSKSQAFTTLPVMVVDATSVDEQLAHLTQTVANLQKIVED 87

Query: 344 RDYEIASLKNHIESRDAAESS--HTPAVKNNDKGKKVLQDSQPQ---------DSTSIAS 403
           +D +IA L + +E  +  E S  H      ++K K+V +++ P+          S S+A+
Sbjct: 88  KDIKIAQLMDKLEQSEVGEFSPKHESYPLRDEKAKQV-EEAHPKPDFVQGATHSSMSVAT 147

Query: 404 LSVQKLQKMIANSIKAQYGGPSQTSLLYSKPYTKRIDNLRMPNRYQPPKFQQFDGKGNPK 463
           LSVQ+LQ+MIAN+IKAQYGGPSQ S +YSKPYTKRID LRMP  YQPPK QQFDGKGNPK
Sbjct: 148 LSVQQLQEMIANTIKAQYGGPSQVSPMYSKPYTKRIDALRMPAGYQPPKLQQFDGKGNPK 207

Query: 464 QHVAHFIETCENAGTRGDLLVKQFVRTLK 482
           QH+AHFIETC NAGT GDLLVKQFVR+LK
Sbjct: 208 QHIAHFIETCNNAGTNGDLLVKQFVRSLK 235

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
E5GCP6_CUCME7.8e-6956.33Ty3-gypsy retrotransposon protein OS=Cucumis melo subsp. melo PE=4 SV=1[more]
E5GB67_CUCME5.1e-5251.36Retrotransposon gag protein OS=Cucumis melo subsp. melo PE=4 SV=1[more]
M5W5Z1_PRUPE1.2e-4243.18Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa022233mg PE=4 SV=1[more]
M5W7Y6_PRUPE5.8e-4041.44Uncharacterized protein (Fragment) OS=Prunus persica GN=PRUPE_ppa018422mg PE=4 S... [more]
M5WEC9_PRUPE1.7e-3950.82Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa025679mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
gi|307136441|gb|ADN34247.1|1.1e-6856.33ty3-gypsy retrotransposon protein [Cucumis melo subsp. melo][more]
gi|659074577|ref|XP_008437679.1|1.2e-6284.03PREDICTED: uncharacterized protein LOC103483019 [Cucumis melo][more]
gi|307135838|gb|ADN33709.1|7.3e-5251.36retrotransposon gag protein [Cucumis melo subsp. melo][more]
gi|659118742|ref|XP_008459280.1|4.6e-4648.82PREDICTED: uncharacterized protein LOC103498458 [Cucumis melo][more]
gi|848894615|ref|XP_012847343.1|7.8e-4650.72PREDICTED: uncharacterized protein LOC105967287 [Erythranthe guttata][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG09G014950.1ClCG09G014950.1mRNA


Analysis Name: InterPro Annotations of watermelon (Charleston Gray)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableunknownCoilCoilcoord: 320..340
scor

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
ClCG09G014950Cla004389Watermelon (97103) v1wcgwmB012
The following gene(s) are paralogous to this gene:

None