ClCG05G015920 (gene) Watermelon (Charleston Gray)

NameClCG05G015920
Typegene
OrganismCitrullus lanatus (Watermelon (Charleston Gray))
DescriptionRetrotransposon protein, putative, Ty1-copia subclass
LocationCG_Chr05 : 27901727 .. 27903495 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTTGACGCCTCGATCTACACTCAAATTGCTGACTCATCTAAGGAGTCGGCTACACTTGAAAGCAAAGGAGGAATAAGCAAAGTGCTCAACACCTACCAAGTCAAATCTAGGTTATCCCCAGTCGGAACTTCATCCTTCGTAAACTTACTAACCAGGTGACTACCATCAAATTAGATCGAACTAACTTCCTTCTTTGGCAGAGTATTGTGATTCCCATATTAAAAAGCTATAAGCGTGAAGGACATATGTCTGGAAAAACCCCTACTCTTGAAATGTCGATCATTCTTTCTCCATCTAACGATGAACTTGAAGGACTATTAATGCCAAATCCAAAGTATGATATCTGGATAGCAGCTGACCAATTATTGGTAGGCAGGCTTTACAGCTCCATGACCCTGAGGTGGTTGTTTAAGTGATGGGGTATGATGAAGCAAAACCATTATGTGATGCAGTCCAAGAGTACTATGGAGTTCAATCAAGATCCAAGAAGATTTAATCGTCTGATGCTGCAACAGACAAGAAAGGGAATAATGAGAATGCACGAGTATCTTGACATTATGAAAAGGTATATCGACAACCTTGCTATTGTTGGTTCTCCCATGGATATGAGGAGTTTTGTCTCTCATGTTACAACCGGTTTGGATGAGGAAGATAATGTTGTGGTGTGTGTTATGAGAAGCAAAGATCTGACTTTGAGTCAAATTCAACTGGAATTGATTGCTTTTAAGCAACGTCATGAACAACTTGAAAAGTTCAAGAATGTTGTGTCTATGAATCAAGTGTCTACAAACCGTGCAAAGCCTGAAAATCCACCAACAAACAACCATCAAAGCATTCCTCATTCAAATCAGAATAATAGAGGCTTTTATCCTTCGAAAAGAGGAGGTCACGGCAGAGGAAGAAGTAACAGGTACTCTCTTTATCCCCTGACAATCGACTCATTTGTCAAGTGTGTGGCAAAATGAGACATACTGCTACGATTTGTTTTCATAGATACGACAAACCAGAAAATACCACCTCAACCAATATGGGAAGAAACTCTATTCTACAAACCCGTTCTACAAATGAATCTCCCTCAGCTTTGATGGCTTTTCCTGAAACCCTACAAGATCCATCTTGGTATTTGGACAGGGAGCAAGCAACCATGTTACTAGTGAACTTGGAAATTTGTCTCTAAAAGGTAATCACTTATCTATAAAAAATATAGTTGTTGTTGATGGTACTAAGGTACCAACTACCACTGTTGGATCTGGTTATATTGAAACTTTGAATGGGTAGTTATTTCTTAAAGATGTGTTAGTAGTTCCTCTATTACAAAAGAAACTTCTTAGCATCTCACAATTAGTTCATAATAATCCAGTGATAGTTGAGTTTGATAACCTTTTTTGCTATGTTAAGGAAAGGAAGTCCAAGGAAGTTTGTCTGGTAGGTTGTCGTGAGCATGGGCTGTATCGGTAGATGGGAGTAAATGAAGTCTCAATTTTAGTGAATAAAGAAGCTGAAGACAGAGTCTGTATGTCAGTAACCCCTACCATGAAGATTTCAAGCAAGAATGGAAATGAAGATTTGGTTTCTTTTGTTAACTATGTATTGTTTACTTGTACTCTTGATGTTTGGCACCAAAGATTGGGTCATCCATCTCCTAGAATCGTAAATCGAATCTTACATAGTGTAATGCTTCTATCGAAAGTAATGAATGATAAGTTCTCTTTTTGTGAGGCCTGTAAATTTAGAAAACTACGCAAGTTACCTTTTACTCTTTCTGA

mRNA sequence

ATGGTTGACGCCTCGATCTACACTCAAATTGCTGACTCATCTAAGGAGTCGGCTACACTTGAAAGCAAAGGAGGAATAAGCAAAGTGCTCAACACCTACCAAGTCAAATCTAGGTTATCCCCAGTCGGAACTTCATCCTTCAGTATTGTGATTCCCATATTAAAAAGCTATAAGCGTGAAGGACATATGTCTGGAAAAACCCCTACTCTTGAAATGTCGATCATTCTTTCTCCATCTAACGATGAACTTGAAGGACTATTAATGCCAAATCCAAAGTATGATATCTGGATAGCAGCTGACCAATTATTGTCCAAGAGTACTATGGAGTTCAATCAAGATCCAAGAAGATTTAATCGTCTGATGCTGCAACAGACAAGAAAGGGAATAATGAGAATGCACGAGTATCTTGACATTATGAAAAGGTATATCGACAACCTTGCTATTGTTGGTTCTCCCATGGATATGAGGAGTTTTGTCTCTCATGTTACAACCGGTTTGGATGAGGAAGATAATGTTGTGGTGTGTGTTATGAGAAGCAAAGATCTGACTTTGAGTCAAATTCAACTGGAATTGATTGCTTTTAAGCAACGTCATGAACAACTTGAAAAGTTCAAGAATGTTGTGTCTATGAATCAAGTGTCTACAAACCGTGCAAAGCCTGAAAATCCACCAACAAACAACCATCAAAGCATTCCTCATTCAAATCAGAATAATAGAGGCTTTTATCCTTCGAAAAGAGGAGGTCACGGCAGAGGAAGAAATACGACAAACCAGAAAATACCACCTCAACCAATATGGGAAGAAACTCTATTCTACAAACCCGGAGCAAGCAACCATGTTACTAGTGAACTTGGAAATTTGTCTCTAAAAGGTAATCACTTATCTATAAAAAATATAGTTGTTGTTGATGGTACTAAGGTACCAACTACCACTGTTGGATCTGGTTATATTGAAACTTTGAATGGCATCTCACAATTAGTTCATAATAATCCAGTGATAGTTGAGTTTGATAACCTTTTTTGCTATGTTAAGGAAAGGAAGTCCAAGGAAGTTTGTCTGATGGGAGTAAATGAAGTCTCAATTTTAGTGAATAAAGAAGCTGAAGACAGAGTCTGTATGTCAGTAACCCCTACCATGAAGATTTCAAGCAAGAATGGAAATGAAGATTTGGTTTCTTTTGTTAACTATGTATTGTTTACTTGTACTCTTGATGTTTGGCACCAAAGATTGGGTCATCCATCTCCTAGAATCAAAACTACGCAAGTTACCTTTTACTCTTTCTGA

Coding sequence (CDS)

ATGGTTGACGCCTCGATCTACACTCAAATTGCTGACTCATCTAAGGAGTCGGCTACACTTGAAAGCAAAGGAGGAATAAGCAAAGTGCTCAACACCTACCAAGTCAAATCTAGGTTATCCCCAGTCGGAACTTCATCCTTCAGTATTGTGATTCCCATATTAAAAAGCTATAAGCGTGAAGGACATATGTCTGGAAAAACCCCTACTCTTGAAATGTCGATCATTCTTTCTCCATCTAACGATGAACTTGAAGGACTATTAATGCCAAATCCAAAGTATGATATCTGGATAGCAGCTGACCAATTATTGTCCAAGAGTACTATGGAGTTCAATCAAGATCCAAGAAGATTTAATCGTCTGATGCTGCAACAGACAAGAAAGGGAATAATGAGAATGCACGAGTATCTTGACATTATGAAAAGGTATATCGACAACCTTGCTATTGTTGGTTCTCCCATGGATATGAGGAGTTTTGTCTCTCATGTTACAACCGGTTTGGATGAGGAAGATAATGTTGTGGTGTGTGTTATGAGAAGCAAAGATCTGACTTTGAGTCAAATTCAACTGGAATTGATTGCTTTTAAGCAACGTCATGAACAACTTGAAAAGTTCAAGAATGTTGTGTCTATGAATCAAGTGTCTACAAACCGTGCAAAGCCTGAAAATCCACCAACAAACAACCATCAAAGCATTCCTCATTCAAATCAGAATAATAGAGGCTTTTATCCTTCGAAAAGAGGAGGTCACGGCAGAGGAAGAAATACGACAAACCAGAAAATACCACCTCAACCAATATGGGAAGAAACTCTATTCTACAAACCCGGAGCAAGCAACCATGTTACTAGTGAACTTGGAAATTTGTCTCTAAAAGGTAATCACTTATCTATAAAAAATATAGTTGTTGTTGATGGTACTAAGGTACCAACTACCACTGTTGGATCTGGTTATATTGAAACTTTGAATGGCATCTCACAATTAGTTCATAATAATCCAGTGATAGTTGAGTTTGATAACCTTTTTTGCTATGTTAAGGAAAGGAAGTCCAAGGAAGTTTGTCTGATGGGAGTAAATGAAGTCTCAATTTTAGTGAATAAAGAAGCTGAAGACAGAGTCTGTATGTCAGTAACCCCTACCATGAAGATTTCAAGCAAGAATGGAAATGAAGATTTGGTTTCTTTTGTTAACTATGTATTGTTTACTTGTACTCTTGATGTTTGGCACCAAAGATTGGGTCATCCATCTCCTAGAATCAAAACTACGCAAGTTACCTTTTACTCTTTCTGA

Protein sequence

MVDASIYTQIADSSKESATLESKGGISKVLNTYQVKSRLSPVGTSSFSIVIPILKSYKREGHMSGKTPTLEMSIILSPSNDELEGLLMPNPKYDIWIAADQLLSKSTMEFNQDPRRFNRLMLQQTRKGIMRMHEYLDIMKRYIDNLAIVGSPMDMRSFVSHVTTGLDEEDNVVVCVMRSKDLTLSQIQLELIAFKQRHEQLEKFKNVVSMNQVSTNRAKPENPPTNNHQSIPHSNQNNRGFYPSKRGGHGRGRNTTNQKIPPQPIWEETLFYKPGASNHVTSELGNLSLKGNHLSIKNIVVVDGTKVPTTTVGSGYIETLNGISQLVHNNPVIVEFDNLFCYVKERKSKEVCLMGVNEVSILVNKEAEDRVCMSVTPTMKISSKNGNEDLVSFVNYVLFTCTLDVWHQRLGHPSPRIKTTQVTFYSF
BLAST of ClCG05G015920 vs. TrEMBL
Match: A0A151SLL4_CAJCA (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan GN=KK1_001901 PE=4 SV=1)

HSP 1 Score: 79.0 bits (193), Expect = 1.6e-11
Identity = 85/340 (25.00%), Postives = 138/340 (40.59%), Query Frame = 1

Query: 122 LQQTRKGIMRMHEYLDIMKRYIDNLAIVGSPMDMRSFVSHVTTGLDEEDNVVVCVMRSKD 181
           L  + KG   +  YL  ++   ++L+++   +D    V H   GLD         +R++D
Sbjct: 111 LNSSTKGSSTVIAYLQSIRSIAEDLSLIIHAVDDIDLVIHTLNGLDPSFREFTTSIRTRD 170

Query: 182 LTLS--QIQLELIAFKQRHEQLEKFKNVVSM--NQVSTNRAKPENPPTNNHQSIPHSN-- 241
             +S  ++ ++L+ ++   ++ E F +  S+  N V+  R    N   NN+ S   S   
Sbjct: 171 TPISFDELSIKLLDYEMYLKRDEHFNHQPSITANYVNQGRFHRSNKGRNNNHSSSSSTSL 230

Query: 242 --QNNRGFYPS------KRGGHGRG---RNTTNQKIPPQPIWEET--------LFYKPGA 301
             +NN             + GHG     + T N K    P+   T          +  GA
Sbjct: 231 GKRNNSSSNSDIVCQLCSKKGHGAQTCYKFTKNNKQSSNPVAYATHATTPPPEWLFNSGA 290

Query: 302 SNHVTSELGNLSLKGNHLSIKNIVVVDGTKVPTTTVGS-------------------GYI 361
           S+H+T++L NLSL   +     + V +G  +P T VGS                   G  
Sbjct: 291 SHHITNDLNNLSLTSTYTGNDKLYVANGMSLPITHVGSTTLHPPTRPLSFTNVLYAPGIT 350

Query: 362 ETLNGISQLVHNNPVIVEFDNLFCYVKERKSKEVCLMGVNEVSILVNKEAEDRVCMSVTP 418
           + L  +SQL + N V +EF   F  VK+        MG N    L+    +D V      
Sbjct: 351 QNLISVSQLCNTNDVSIEFFPSFFEVKDLS------MGAN----LLRGPKDDHVY----- 410

BLAST of ClCG05G015920 vs. TrEMBL
Match: Q94HW7_ARATH (Polyprotein OS=Arabidopsis thaliana GN=T4M14.20 PE=4 SV=1)

HSP 1 Score: 75.9 bits (185), Expect = 1.4e-10
Identity = 86/379 (22.69%), Postives = 143/379 (37.73%), Query Frame = 1

Query: 119 RLMLQQTRKGIMRMHEYLDIMKRYIDNLAIVGSPMDMRSFVSHVTTGLDEEDNVVVCVMR 178
           R  L+Q  KG   + +Y+  +    D LA++G PMD    V  V   L EE   V+  + 
Sbjct: 129 RTQLKQWTKGTKTIDDYMQGLVTRFDQLALLGKPMDHDEQVERVLENLPEEYKPVIDQIA 188

Query: 179 SKDL--TLSQIQLELIAFKQRHEQLEKFKNV-VSMNQVSTNRAKPENPPTNNHQSIPHSN 238
           +KD   TL++I   L+  + +   +     + ++ N VS       N   N +++  + N
Sbjct: 189 AKDTPPTLTEIHERLLNHESKILAVSSATVIPITANAVSHRNTTTTNNNNNGNRNNRYDN 248

Query: 239 QNNRG-----------FYPSKRG--------------GHGRGR--------NTTNQKIPP 298
           +NN             F+P+                 GH   R        ++ N + PP
Sbjct: 249 RNNNNNSKPWQQSSTNFHPNNNQSKPYLGKCQICGVQGHSAKRCSQLQHFLSSVNSQQPP 308

Query: 299 QPI--WE-------------ETLFYKPGASNHVTSELGNLSLKGNHLSIKNIVVVDGTKV 358
            P   W+                    GA++H+TS+  NLSL   +    +++V DG+ +
Sbjct: 309 SPFTPWQPRANLALGSPYSSNNWLLDSGATHHITSDFNNLSLHQPYTGGDDVMVADGSTI 368

Query: 359 PTTTVGSGYIET-------------------LNGISQLVHNNPVIVEFDNLFCYVKERKS 418
           P +  GS  + T                   L  + +L + N V VEF      VK+  +
Sbjct: 369 PISHTGSTSLSTKSRPLNLHNILYVPNIHKNLISVYRLCNANGVSVEFFPASFQVKDLNT 428

Query: 419 KEVCLMGVNEVSILVNKEAEDR-VCMSVTPTMKISSKNGNEDLVSFVNYVLFTCTLDVWH 427
               L G  +  +     A  + V +  +P+ K                     T   WH
Sbjct: 429 GVPLLQGKTKDELYEWPIASSQPVSLFASPSSK--------------------ATHSSWH 487

BLAST of ClCG05G015920 vs. TrEMBL
Match: Q94HW2_ARATH (Polyprotein OS=Arabidopsis thaliana GN=T4M14.18 PE=4 SV=1)

HSP 1 Score: 75.9 bits (185), Expect = 1.4e-10
Identity = 86/379 (22.69%), Postives = 143/379 (37.73%), Query Frame = 1

Query: 119 RLMLQQTRKGIMRMHEYLDIMKRYIDNLAIVGSPMDMRSFVSHVTTGLDEEDNVVVCVMR 178
           R  L+Q  KG   + +Y+  +    D LA++G PMD    V  V   L EE   V+  + 
Sbjct: 129 RTQLKQWTKGTKTIDDYMQGLVTRFDQLALLGKPMDHDEQVERVLENLPEEYKPVIDQIA 188

Query: 179 SKDL--TLSQIQLELIAFKQRHEQLEKFKNV-VSMNQVSTNRAKPENPPTNNHQSIPHSN 238
           +KD   TL++I   L+  + +   +     + ++ N VS       N   N +++  + N
Sbjct: 189 AKDTPPTLTEIHERLLNHESKILAVSSATVIPITANAVSHRNTTTTNNNNNGNRNNRYDN 248

Query: 239 QNNRG-----------FYPSKRG--------------GHGRGR--------NTTNQKIPP 298
           +NN             F+P+                 GH   R        ++ N + PP
Sbjct: 249 RNNNNNSKPWQQSSTNFHPNNNQSKPYLGKCQICGVQGHSAKRCSQLQHFLSSVNSQQPP 308

Query: 299 QPI--WE-------------ETLFYKPGASNHVTSELGNLSLKGNHLSIKNIVVVDGTKV 358
            P   W+                    GA++H+TS+  NLSL   +    +++V DG+ +
Sbjct: 309 SPFTPWQPRANLALGSPYSSNNWLLDSGATHHITSDFNNLSLHQPYTGGDDVMVADGSTI 368

Query: 359 PTTTVGSGYIET-------------------LNGISQLVHNNPVIVEFDNLFCYVKERKS 418
           P +  GS  + T                   L  + +L + N V VEF      VK+  +
Sbjct: 369 PISHTGSTSLSTKSRPLNLHNILYVPNIHKNLISVYRLCNANGVSVEFFPASFQVKDLNT 428

Query: 419 KEVCLMGVNEVSILVNKEAEDR-VCMSVTPTMKISSKNGNEDLVSFVNYVLFTCTLDVWH 427
               L G  +  +     A  + V +  +P+ K                     T   WH
Sbjct: 429 GVPLLQGKTKDELYEWPIASSQPVSLFASPSSK--------------------ATHSSWH 487

BLAST of ClCG05G015920 vs. TrEMBL
Match: F7J130_ARATH (Polyprotein OS=Arabidopsis thaliana PE=4 SV=1)

HSP 1 Score: 75.1 bits (183), Expect = 2.3e-10
Identity = 90/380 (23.68%), Postives = 145/380 (38.16%), Query Frame = 1

Query: 119 RLMLQQTRKGIMRMHEYLDIMKRYIDNLAIVGSPMDMRSFVSHVTTGLDEEDNVVVCVMR 178
           R  L+Q  KG   + +Y+     + D LA++G PMD    V  V   L EE   V+  + 
Sbjct: 110 RTQLKQWTKGTKTIDDYMQGFVTHFDQLALLGKPMDHDEQVERVLENLPEEYKPVIDQIA 169

Query: 179 SKDL--TLSQIQLELIAFKQRHEQLEKFK------NVVSMNQVST-------NRAKPENP 238
           +KD   TL++I   L+  + +   +          N VS    +T       NR    + 
Sbjct: 170 AKDTPPTLTEIHERLLNQESKILAVSSATVIPITANAVSHRNTTTTTNNNNGNRTNRYDN 229

Query: 239 PTNNHQSIPHSNQNNRGFYPSKRG--------------GHGRGR--------NTTNQKIP 298
             NN+ S P   Q++  F P+                 GH   R        ++ N + P
Sbjct: 230 RNNNNNSKPWQ-QSSSNFRPNNNQSKPYLGKCQICGVQGHSAKRCSQLQHFLSSVNSQQP 289

Query: 299 PQP--IWE-------------ETLFYKPGASNHVTSELGNLSLKGNHLSIKNIVVVDGTK 358
           P P  +W+              +     GA++H+TS+  NLSL   +    +++VVDG+ 
Sbjct: 290 PSPFTLWQPRANLALGSPYSSNSWLLDSGATHHITSDFNNLSLHQPYTGGDDVMVVDGST 349

Query: 359 VPTTTVGSGYIET-------------------LNGISQLVHNNPVIVEFDNLFCYVKERK 418
           +P +  GS  + T                   L  + +L + N V VEF      VK+  
Sbjct: 350 IPISHTGSTSLSTKSRPLNLHNILYVPNIHKNLISVYRLCNANGVSVEFFLASFQVKDLN 409

Query: 419 SKEVCLMGVNEVSILVNKEAEDR-VCMSVTPTMKISSKNGNEDLVSFVNYVLFTCTLDVW 427
           +    L G  +  +     A  + V +  +P+ K                     T   W
Sbjct: 410 TGVPLLQGKTKDELYEWPIASSQPVSLFASPSSK--------------------ATHSSW 468

BLAST of ClCG05G015920 vs. TrEMBL
Match: F7J134_ARATH (Polyprotein OS=Arabidopsis thaliana PE=4 SV=1)

HSP 1 Score: 75.1 bits (183), Expect = 2.3e-10
Identity = 86/379 (22.69%), Postives = 142/379 (37.47%), Query Frame = 1

Query: 119 RLMLQQTRKGIMRMHEYLDIMKRYIDNLAIVGSPMDMRSFVSHVTTGLDEEDNVVVCVMR 178
           R  L+Q  KG   + +Y+       D LA++G PMD    V  V   L EE   V+  + 
Sbjct: 129 RTQLKQWTKGTKTIDDYMQGFVTRFDQLALLGKPMDHDEQVERVLENLPEEYKPVIDQIA 188

Query: 179 SKDL--TLSQIQLELIAFKQRHEQLEKFKNV-VSMNQVSTNRAKPENPPTNNHQSIPHSN 238
           +KD   TL++I   L+  + +   +     + ++ N VS       N   N +++  + N
Sbjct: 189 AKDTPPTLTEIHERLLNHESKILAVSSATVIPITANAVSHRNTTTTNNNNNGNRNNRYDN 248

Query: 239 QNNRG-----------FYPSKRG--------------GHGRGR--------NTTNQKIPP 298
           +NN             F+P+                 GH   R        ++ N + PP
Sbjct: 249 RNNNNNSKPWQQSSTNFHPNNNQSKPYLGKCQICGVQGHSAKRCSQLQHFLSSVNSQQPP 308

Query: 299 QPI--WE-------------ETLFYKPGASNHVTSELGNLSLKGNHLSIKNIVVVDGTKV 358
            P   W+                    GA++H+TS+  NLSL   +    +++V DG+ +
Sbjct: 309 SPFTPWQPRANLALGSPYSSNNWLLDSGATHHITSDFNNLSLHQPYTGGDDVMVADGSTI 368

Query: 359 PTTTVGSGYIET-------------------LNGISQLVHNNPVIVEFDNLFCYVKERKS 418
           P +  GS  + T                   L  + +L + N V VEF      VK+  +
Sbjct: 369 PISHTGSTSLSTKSRPLNLHNILYVPNIHKNLISVYRLCNANGVSVEFFPASFQVKDLNT 428

Query: 419 KEVCLMGVNEVSILVNKEAEDR-VCMSVTPTMKISSKNGNEDLVSFVNYVLFTCTLDVWH 427
               L G  +  +     A  + V +  +P+ K                     T   WH
Sbjct: 429 GVPLLQGKTKDELYEWPIASSQPVSLFASPSSK--------------------ATHSSWH 487

BLAST of ClCG05G015920 vs. NCBI nr
Match: gi|1012344486|gb|KYP55678.1| (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan])

HSP 1 Score: 79.0 bits (193), Expect = 2.3e-11
Identity = 85/340 (25.00%), Postives = 138/340 (40.59%), Query Frame = 1

Query: 122 LQQTRKGIMRMHEYLDIMKRYIDNLAIVGSPMDMRSFVSHVTTGLDEEDNVVVCVMRSKD 181
           L  + KG   +  YL  ++   ++L+++   +D    V H   GLD         +R++D
Sbjct: 111 LNSSTKGSSTVIAYLQSIRSIAEDLSLIIHAVDDIDLVIHTLNGLDPSFREFTTSIRTRD 170

Query: 182 LTLS--QIQLELIAFKQRHEQLEKFKNVVSM--NQVSTNRAKPENPPTNNHQSIPHSN-- 241
             +S  ++ ++L+ ++   ++ E F +  S+  N V+  R    N   NN+ S   S   
Sbjct: 171 TPISFDELSIKLLDYEMYLKRDEHFNHQPSITANYVNQGRFHRSNKGRNNNHSSSSSTSL 230

Query: 242 --QNNRGFYPS------KRGGHGRG---RNTTNQKIPPQPIWEET--------LFYKPGA 301
             +NN             + GHG     + T N K    P+   T          +  GA
Sbjct: 231 GKRNNSSSNSDIVCQLCSKKGHGAQTCYKFTKNNKQSSNPVAYATHATTPPPEWLFNSGA 290

Query: 302 SNHVTSELGNLSLKGNHLSIKNIVVVDGTKVPTTTVGS-------------------GYI 361
           S+H+T++L NLSL   +     + V +G  +P T VGS                   G  
Sbjct: 291 SHHITNDLNNLSLTSTYTGNDKLYVANGMSLPITHVGSTTLHPPTRPLSFTNVLYAPGIT 350

Query: 362 ETLNGISQLVHNNPVIVEFDNLFCYVKERKSKEVCLMGVNEVSILVNKEAEDRVCMSVTP 418
           + L  +SQL + N V +EF   F  VK+        MG N    L+    +D V      
Sbjct: 351 QNLISVSQLCNTNDVSIEFFPSFFEVKDLS------MGAN----LLRGPKDDHVY----- 410

BLAST of ClCG05G015920 vs. NCBI nr
Match: gi|14475946|gb|AAK62793.1|AC027036_14 (polyprotein, putative [Arabidopsis thaliana])

HSP 1 Score: 75.9 bits (185), Expect = 1.9e-10
Identity = 86/379 (22.69%), Postives = 143/379 (37.73%), Query Frame = 1

Query: 119 RLMLQQTRKGIMRMHEYLDIMKRYIDNLAIVGSPMDMRSFVSHVTTGLDEEDNVVVCVMR 178
           R  L+Q  KG   + +Y+  +    D LA++G PMD    V  V   L EE   V+  + 
Sbjct: 129 RTQLKQWTKGTKTIDDYMQGLVTRFDQLALLGKPMDHDEQVERVLENLPEEYKPVIDQIA 188

Query: 179 SKDL--TLSQIQLELIAFKQRHEQLEKFKNV-VSMNQVSTNRAKPENPPTNNHQSIPHSN 238
           +KD   TL++I   L+  + +   +     + ++ N VS       N   N +++  + N
Sbjct: 189 AKDTPPTLTEIHERLLNHESKILAVSSATVIPITANAVSHRNTTTTNNNNNGNRNNRYDN 248

Query: 239 QNNRG-----------FYPSKRG--------------GHGRGR--------NTTNQKIPP 298
           +NN             F+P+                 GH   R        ++ N + PP
Sbjct: 249 RNNNNNSKPWQQSSTNFHPNNNQSKPYLGKCQICGVQGHSAKRCSQLQHFLSSVNSQQPP 308

Query: 299 QPI--WE-------------ETLFYKPGASNHVTSELGNLSLKGNHLSIKNIVVVDGTKV 358
            P   W+                    GA++H+TS+  NLSL   +    +++V DG+ +
Sbjct: 309 SPFTPWQPRANLALGSPYSSNNWLLDSGATHHITSDFNNLSLHQPYTGGDDVMVADGSTI 368

Query: 359 PTTTVGSGYIET-------------------LNGISQLVHNNPVIVEFDNLFCYVKERKS 418
           P +  GS  + T                   L  + +L + N V VEF      VK+  +
Sbjct: 369 PISHTGSTSLSTKSRPLNLHNILYVPNIHKNLISVYRLCNANGVSVEFFPASFQVKDLNT 428

Query: 419 KEVCLMGVNEVSILVNKEAEDR-VCMSVTPTMKISSKNGNEDLVSFVNYVLFTCTLDVWH 427
               L G  +  +     A  + V +  +P+ K                     T   WH
Sbjct: 429 GVPLLQGKTKDELYEWPIASSQPVSLFASPSSK--------------------ATHSSWH 487

BLAST of ClCG05G015920 vs. NCBI nr
Match: gi|14475941|gb|AAK62788.1|AC027036_9 (polyprotein, putative [Arabidopsis thaliana])

HSP 1 Score: 75.9 bits (185), Expect = 1.9e-10
Identity = 86/379 (22.69%), Postives = 143/379 (37.73%), Query Frame = 1

Query: 119 RLMLQQTRKGIMRMHEYLDIMKRYIDNLAIVGSPMDMRSFVSHVTTGLDEEDNVVVCVMR 178
           R  L+Q  KG   + +Y+  +    D LA++G PMD    V  V   L EE   V+  + 
Sbjct: 129 RTQLKQWTKGTKTIDDYMQGLVTRFDQLALLGKPMDHDEQVERVLENLPEEYKPVIDQIA 188

Query: 179 SKDL--TLSQIQLELIAFKQRHEQLEKFKNV-VSMNQVSTNRAKPENPPTNNHQSIPHSN 238
           +KD   TL++I   L+  + +   +     + ++ N VS       N   N +++  + N
Sbjct: 189 AKDTPPTLTEIHERLLNHESKILAVSSATVIPITANAVSHRNTTTTNNNNNGNRNNRYDN 248

Query: 239 QNNRG-----------FYPSKRG--------------GHGRGR--------NTTNQKIPP 298
           +NN             F+P+                 GH   R        ++ N + PP
Sbjct: 249 RNNNNNSKPWQQSSTNFHPNNNQSKPYLGKCQICGVQGHSAKRCSQLQHFLSSVNSQQPP 308

Query: 299 QPI--WE-------------ETLFYKPGASNHVTSELGNLSLKGNHLSIKNIVVVDGTKV 358
            P   W+                    GA++H+TS+  NLSL   +    +++V DG+ +
Sbjct: 309 SPFTPWQPRANLALGSPYSSNNWLLDSGATHHITSDFNNLSLHQPYTGGDDVMVADGSTI 368

Query: 359 PTTTVGSGYIET-------------------LNGISQLVHNNPVIVEFDNLFCYVKERKS 418
           P +  GS  + T                   L  + +L + N V VEF      VK+  +
Sbjct: 369 PISHTGSTSLSTKSRPLNLHNILYVPNIHKNLISVYRLCNANGVSVEFFPASFQVKDLNT 428

Query: 419 KEVCLMGVNEVSILVNKEAEDR-VCMSVTPTMKISSKNGNEDLVSFVNYVLFTCTLDVWH 427
               L G  +  +     A  + V +  +P+ K                     T   WH
Sbjct: 429 GVPLLQGKTKDELYEWPIASSQPVSLFASPSSK--------------------ATHSSWH 487

BLAST of ClCG05G015920 vs. NCBI nr
Match: gi|4996365|dbj|BAA78425.1| (polyprotein [Arabidopsis thaliana])

HSP 1 Score: 75.1 bits (183), Expect = 3.3e-10
Identity = 90/380 (23.68%), Postives = 145/380 (38.16%), Query Frame = 1

Query: 119 RLMLQQTRKGIMRMHEYLDIMKRYIDNLAIVGSPMDMRSFVSHVTTGLDEEDNVVVCVMR 178
           R  L+Q  KG   + +Y+     + D LA++G PMD    V  V   L EE   V+  + 
Sbjct: 110 RTQLKQWTKGTKTIDDYMQGFVTHFDQLALLGKPMDHDEQVERVLENLPEEYKPVIDQIA 169

Query: 179 SKDL--TLSQIQLELIAFKQRHEQLEKFK------NVVSMNQVST-------NRAKPENP 238
           +KD   TL++I   L+  + +   +          N VS    +T       NR    + 
Sbjct: 170 AKDTPPTLTEIHERLLNQESKILAVSSATVIPITANAVSHRNTTTTTNNNNGNRTNRYDN 229

Query: 239 PTNNHQSIPHSNQNNRGFYPSKRG--------------GHGRGR--------NTTNQKIP 298
             NN+ S P   Q++  F P+                 GH   R        ++ N + P
Sbjct: 230 RNNNNNSKPWQ-QSSSNFRPNNNQSKPYLGKCQICGVQGHSAKRCSQLQHFLSSVNSQQP 289

Query: 299 PQP--IWE-------------ETLFYKPGASNHVTSELGNLSLKGNHLSIKNIVVVDGTK 358
           P P  +W+              +     GA++H+TS+  NLSL   +    +++VVDG+ 
Sbjct: 290 PSPFTLWQPRANLALGSPYSSNSWLLDSGATHHITSDFNNLSLHQPYTGGDDVMVVDGST 349

Query: 359 VPTTTVGSGYIET-------------------LNGISQLVHNNPVIVEFDNLFCYVKERK 418
           +P +  GS  + T                   L  + +L + N V VEF      VK+  
Sbjct: 350 IPISHTGSTSLSTKSRPLNLHNILYVPNIHKNLISVYRLCNANGVSVEFFLASFQVKDLN 409

Query: 419 SKEVCLMGVNEVSILVNKEAEDR-VCMSVTPTMKISSKNGNEDLVSFVNYVLFTCTLDVW 427
           +    L G  +  +     A  + V +  +P+ K                     T   W
Sbjct: 410 TGVPLLQGKTKDELYEWPIASSQPVSLFASPSSK--------------------ATHSSW 468

BLAST of ClCG05G015920 vs. NCBI nr
Match: gi|338746555|dbj|BAK41507.1| (polyprotein [Arabidopsis thaliana])

HSP 1 Score: 75.1 bits (183), Expect = 3.3e-10
Identity = 90/380 (23.68%), Postives = 145/380 (38.16%), Query Frame = 1

Query: 119 RLMLQQTRKGIMRMHEYLDIMKRYIDNLAIVGSPMDMRSFVSHVTTGLDEEDNVVVCVMR 178
           R  L+Q  KG   + +Y+     + D LA++G PMD    V  V   L EE   V+  + 
Sbjct: 110 RTQLKQWTKGTKTIDDYMQGFVTHFDQLALLGKPMDHDEQVERVLENLPEEYKPVIDQIA 169

Query: 179 SKDL--TLSQIQLELIAFKQRHEQLEKFK------NVVSMNQVST-------NRAKPENP 238
           +KD   TL++I   L+  + +   +          N VS    +T       NR    + 
Sbjct: 170 AKDTPPTLTEIHERLLNQESKILAVSSATVIPITANAVSHRNTTTTTNNNNGNRTNRYDN 229

Query: 239 PTNNHQSIPHSNQNNRGFYPSKRG--------------GHGRGR--------NTTNQKIP 298
             NN+ S P   Q++  F P+                 GH   R        ++ N + P
Sbjct: 230 RNNNNNSKPWQ-QSSSNFRPNNNQSKPYLGKCQICGVQGHSAKRCSQLQHFLSSVNSQQP 289

Query: 299 PQP--IWE-------------ETLFYKPGASNHVTSELGNLSLKGNHLSIKNIVVVDGTK 358
           P P  +W+              +     GA++H+TS+  NLSL   +    +++VVDG+ 
Sbjct: 290 PSPFTLWQPRANLALGSPYSSNSWLLDSGATHHITSDFNNLSLHQPYTGGDDVMVVDGST 349

Query: 359 VPTTTVGSGYIET-------------------LNGISQLVHNNPVIVEFDNLFCYVKERK 418
           +P +  GS  + T                   L  + +L + N V VEF      VK+  
Sbjct: 350 IPISHTGSTSLSTKSRPLNLHNILYVPNIHKNLISVYRLCNANGVSVEFFLASFQVKDLN 409

Query: 419 SKEVCLMGVNEVSILVNKEAEDR-VCMSVTPTMKISSKNGNEDLVSFVNYVLFTCTLDVW 427
           +    L G  +  +     A  + V +  +P+ K                     T   W
Sbjct: 410 TGVPLLQGKTKDELYEWPIASSQPVSLFASPSSK--------------------ATHSSW 468

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
A0A151SLL4_CAJCA1.6e-1125.00Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan GN=... [more]
Q94HW7_ARATH1.4e-1022.69Polyprotein OS=Arabidopsis thaliana GN=T4M14.20 PE=4 SV=1[more]
Q94HW2_ARATH1.4e-1022.69Polyprotein OS=Arabidopsis thaliana GN=T4M14.18 PE=4 SV=1[more]
F7J130_ARATH2.3e-1023.68Polyprotein OS=Arabidopsis thaliana PE=4 SV=1[more]
F7J134_ARATH2.3e-1022.69Polyprotein OS=Arabidopsis thaliana PE=4 SV=1[more]
Match NameE-valueIdentityDescription
gi|1012344486|gb|KYP55678.1|2.3e-1125.00Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan][more]
gi|14475946|gb|AAK62793.1|AC027036_141.9e-1022.69polyprotein, putative [Arabidopsis thaliana][more]
gi|14475941|gb|AAK62788.1|AC027036_91.9e-1022.69polyprotein, putative [Arabidopsis thaliana][more]
gi|4996365|dbj|BAA78425.1|3.3e-1023.68polyprotein [Arabidopsis thaliana][more]
gi|338746555|dbj|BAK41507.1|3.3e-1023.68polyprotein [Arabidopsis thaliana][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG05G015920.1ClCG05G015920.1mRNA


Analysis Name: InterPro Annotations of watermelon (Charleston Gray)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR11439GAG-POL-RELATED RETROTRANSPOSONcoord: 403..417
score: 6.0E-20coord: 119..369
score: 6.0
NoneNo IPR availablePANTHERPTHR11439:SF185SUBFAMILY NOT NAMEDcoord: 119..369
score: 6.0E-20coord: 403..417
score: 6.0

The following gene(s) are orthologous to this gene:

None

The following gene(s) are paralogous to this gene:

None