ClCG03G009350 (gene) Watermelon (Charleston Gray)

NameClCG03G009350
Typegene
OrganismCitrullus lanatus (Watermelon (Charleston Gray))
DescriptionRetrotransposon protein, putative, Ty1-copia subclass
LocationCG_Chr03 : 12410023 .. 12411859 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCAAACACGACGCTCACCGGAAATGCCTCTACCTCCGAACTAGTGAAGTTCAGCAACCCACCCTTGAATTAGCTCTTGAATCAACTCACGCCCTATTCTTCGAAGCTACAAGCTTAAAGGTCATCTTACTGGTAAGAACCTTTGCCCTCCTATGTTTCTTCCATCACCGGGAGATACCTCGGCGAAAAATACCAATGTCGTTGGTGCTTCGAGCTCCCAATTCGTTGCAGATGACGCGATTAGCTCGTCGGTCGAAAGAAGTCTGAATCCCCAATATGAAGCATGGATTGCTGTGGATCAATTGTTGTTGGGCTGGCTGTACAACTCTATGGCACCAGACGTGGTCGTTCAACTTATGGGATTCGAAAACGTGAAGGAGTTGTGGGAAGCAATTGAGGAACTCTTCGGGGTTCAATCGAGGGCAGAAGAAGACTATCTCAGGCAAGTTTTTCAACAAACTCGAAAAGGAAATATGAAGATGAGTGAGTACCTACGGGTGATGAAAACCCGCTCTGATAACCTAGCACAAGCTGGAAGTCCAGTAACAACCAGGGCCCTAGTTTCACAAATTCTACTAGGGCTAAATGAAGAATACAACCCCGTGGTGGTCAGAATACAAGGAAAACCAATAATATCATGGCTAGATATGCAATTGAAACTTCTCTCATATGAAAAGCGTCTCGAGCATCTGAATTCTGTTAAAACTAAGAGTAATTTCACTCAAACACCCTCAATCAATGTTGTAATTAATCGGAATTCAAACGGTTCAAAGCCACATAATAACCAAAAACAACAATAAAGTAATGGTCAAAGAGGTAGTGGCGCCAATCCTTTCTTCAACAACAACAACAATGCTAATCAAGGGCGTGGAAGAGGACGAGGAGAAAGCAATAACAGACCAATGTGTGGCAAATTTGGTCATACTACATTAATCTGTTATCATATATTTGATAAGGAATACAATCAAAACTCCAACCAAGGTAGAGGCAACGGTGGTACACCAGGGAATAACATTGGCAATCCAACAATCTTTGCTACTCAAACATCCAATCCTTTTGTAGCCAACGCTAGAGACAGTCAGGGATCCAAGTTGGTATGCTGACAACGAAGCTACTAACAATGTCACATCGGATTTCAACAATATAACTCATCCCACCGAGTATGGAGGTAGTGAACTAGTAATAGTCGGTAATGGTGAAAAACTTTAAATATCCTACATTGGGAACTCATGTTTATCTAATGAGAAAAATAGCCTAATGCTTAAAAATATCATGTGTGTACCATCATTGCAAAAAATTTAATTAGTGTATCAAAATTGGCACAAGACAATGTTGTGTGTGTTGAATTTGATAGTGATTATTATCTTATAAAGGACAAAGCTACGGGATGCACACTGCTGAAAGGGGAACTCAGTGATGGGCTTTATTGGTTGTGGTTGAATGGAGTAAGAGTCCTGAAGGGAGGAATAAGTGAAGATAGTTCAACTCAACACATAAATAAGGGTTCAACTGCATTCATTCTATCGAGAACCAGAGTTAATGTAGCAGAATCACGAGTGTTATGGCACAAACGGTTGGGTGATCCATCACTGAAAACTTTAGAGTTAATCATTAGAGAATGTAATATTCCGACTAAGATGAATGAACAACTTGAATTTTGTGAATCTTTTCAATTAGGCAAAGCACATACTCTACCTTTTCCCAACTCTGTCGCACAAGCGTCTGAAAATTTTGATCTAGTTCATACTGACGTGTGGGGACTTGCACCGATAACATCTACAAATGGATATAGGTACTATGTCTCATTTTTGGATGATTACAGCAGATATCTTTAG

mRNA sequence

ATGGCAAACACGACGCTCACCGGAAATGCCTCTACCTCCGAACTAGTGAACTACAAGCTTAAAGGTCATCTTACTGGTAAGAACCTTTGCCCTCCTATGTTTCTTCCATCACCGGGAGATACCTCGGCGAAAAATACCAATGTCGTTGGTGCTTCGAGCTCCCAATTCGTTGCAGATGACGCGATTAGCTCGTCGGTCGAAAGAAGTCTGAATCCCCAATATGAAGCATGGATTGCTGTGGATCAATTGTTGTTGGGCTGGCTGTACAACTCTATGGCACCAGACGTGGTCGTTCAACTTATGGGATTCGAAAACGTGAAGGAGTTGTGGGAAGCAATTGAGGAACTCTTCGGGGTTCAATCGAGGGCAGAAGAAGACTATCTCAGGCAAGTTTTTCAACAAACTCGAAAAGGAAATATGAAGATGAGTGAGTACCTACGGGTGATGAAAACCCGCTCTGATAACCTAGCACAAGCTGGAAGTCCAGTAACAACCAGGGCCCTAGTTTCACAAATTCTACTAGGGCTAAATGAAGAATACAACCCCGTGGTGGTCAGAATACAAGGAAAACCAATAATATCATGGCTAGATATGCAATTGAAACTTCTCTCATATGAAAAGCGTCTCGAGCATCTGAATTCTGTTAAAACTAAGAGTAGTGGCGCCAATCCTTTCTTCAACAACAACAACAATGCTAATCAAGGGCGTGGAAGAGGACGAGGAGAAAGCAATAACAGACCAATGTGTGGCAAATTTGGTCATACTACATTAATCTGTTATCATATATTTGATAAGGAATACAATCAAAACTCCAACCAAGAGACAGTCAGGGATCCAAGTTGGTATGCTGACAACGAAGCTACTAACAATGTCACATCGGATTTCAACAATATAACTCATCCCACCGAGTATGGAGGTAGTGAACTAGTAATAGTCGGTAATGACAATGTTGTGTGTGTTGAATTTGATAGTGATTATTATCTTATAAAGGACAAAGCTACGGGATGCACACTGCTGAAAGGGGAACTCAGTGATGGGCTTTATTGGTTGTGGTTGAATGGAGTAAGAGTCCTGAAGGGAGGAATAAGTGAAGATAGTTCAACTCAACACATAAATAAGGGTTCAACTGCATTCATTCTATCGAGAACCAGAGTTAATGTAGCAGAATCACGAGTGTTATGGCACAAACGGTTGGGTGATCCATCACTGAAAACTTTAGAGTTAATCATTAGAGAATGTAATATTCCGACTAAGATGAATGAACAACTTGAATTTTGTGAATCTTTTCAATTAGGCAAAGCACATACTCTACCTTTTCCCAACTCTGTCGCACAAGCGTCTGAAAATTTTGATCTAGTTCATACTGACGTGTGGGGACTTGCACCGATAACATCTACAAATGGATATAGGTACTATGTCTCATTTTTGGATGATTACAGCAGATATCTTTAG

Coding sequence (CDS)

ATGGCAAACACGACGCTCACCGGAAATGCCTCTACCTCCGAACTAGTGAACTACAAGCTTAAAGGTCATCTTACTGGTAAGAACCTTTGCCCTCCTATGTTTCTTCCATCACCGGGAGATACCTCGGCGAAAAATACCAATGTCGTTGGTGCTTCGAGCTCCCAATTCGTTGCAGATGACGCGATTAGCTCGTCGGTCGAAAGAAGTCTGAATCCCCAATATGAAGCATGGATTGCTGTGGATCAATTGTTGTTGGGCTGGCTGTACAACTCTATGGCACCAGACGTGGTCGTTCAACTTATGGGATTCGAAAACGTGAAGGAGTTGTGGGAAGCAATTGAGGAACTCTTCGGGGTTCAATCGAGGGCAGAAGAAGACTATCTCAGGCAAGTTTTTCAACAAACTCGAAAAGGAAATATGAAGATGAGTGAGTACCTACGGGTGATGAAAACCCGCTCTGATAACCTAGCACAAGCTGGAAGTCCAGTAACAACCAGGGCCCTAGTTTCACAAATTCTACTAGGGCTAAATGAAGAATACAACCCCGTGGTGGTCAGAATACAAGGAAAACCAATAATATCATGGCTAGATATGCAATTGAAACTTCTCTCATATGAAAAGCGTCTCGAGCATCTGAATTCTGTTAAAACTAAGAGTAGTGGCGCCAATCCTTTCTTCAACAACAACAACAATGCTAATCAAGGGCGTGGAAGAGGACGAGGAGAAAGCAATAACAGACCAATGTGTGGCAAATTTGGTCATACTACATTAATCTGTTATCATATATTTGATAAGGAATACAATCAAAACTCCAACCAAGAGACAGTCAGGGATCCAAGTTGGTATGCTGACAACGAAGCTACTAACAATGTCACATCGGATTTCAACAATATAACTCATCCCACCGAGTATGGAGGTAGTGAACTAGTAATAGTCGGTAATGACAATGTTGTGTGTGTTGAATTTGATAGTGATTATTATCTTATAAAGGACAAAGCTACGGGATGCACACTGCTGAAAGGGGAACTCAGTGATGGGCTTTATTGGTTGTGGTTGAATGGAGTAAGAGTCCTGAAGGGAGGAATAAGTGAAGATAGTTCAACTCAACACATAAATAAGGGTTCAACTGCATTCATTCTATCGAGAACCAGAGTTAATGTAGCAGAATCACGAGTGTTATGGCACAAACGGTTGGGTGATCCATCACTGAAAACTTTAGAGTTAATCATTAGAGAATGTAATATTCCGACTAAGATGAATGAACAACTTGAATTTTGTGAATCTTTTCAATTAGGCAAAGCACATACTCTACCTTTTCCCAACTCTGTCGCACAAGCGTCTGAAAATTTTGATCTAGTTCATACTGACGTGTGGGGACTTGCACCGATAACATCTACAAATGGATATAGGTACTATGTCTCATTTTTGGATGATTACAGCAGATATCTTTAG

Protein sequence

MANTTLTGNASTSELVNYKLKGHLTGKNLCPPMFLPSPGDTSAKNTNVVGASSSQFVADDAISSSVERSLNPQYEAWIAVDQLLLGWLYNSMAPDVVVQLMGFENVKELWEAIEELFGVQSRAEEDYLRQVFQQTRKGNMKMSEYLRVMKTRSDNLAQAGSPVTTRALVSQILLGLNEEYNPVVVRIQGKPIISWLDMQLKLLSYEKRLEHLNSVKTKSSGANPFFNNNNNANQGRGRGRGESNNRPMCGKFGHTTLICYHIFDKEYNQNSNQETVRDPSWYADNEATNNVTSDFNNITHPTEYGGSELVIVGNDNVVCVEFDSDYYLIKDKATGCTLLKGELSDGLYWLWLNGVRVLKGGISEDSSTQHINKGSTAFILSRTRVNVAESRVLWHKRLGDPSLKTLELIIRECNIPTKMNEQLEFCESFQLGKAHTLPFPNSVAQASENFDLVHTDVWGLAPITSTNGYRYYVSFLDDYSRYL
BLAST of ClCG03G009350 vs. TrEMBL
Match: A0A151S6M8_CAJCA (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan GN=KK1_027809 PE=4 SV=1)

HSP 1 Score: 249.2 bits (635), Expect = 1.0e-62
Identity = 162/460 (35.22%), Postives = 234/460 (50.87%), Query Frame = 1

Query: 92  MAPDVVVQLMGFENVKELWEAIEELFGVQSRAEEDYLRQVFQQTRKGNMKMSEYLRVMKT 151
           M  +V  QL+  E  +++WE  + L G  +R+   +L+  F +TRKG +KM EYL  MK 
Sbjct: 1   MTQEVATQLLHCETSQQIWEDAQSLAGAHTRSRITFLKTEFHRTRKGGLKMEEYLTKMKE 60

Query: 152 RSDNLAQAGSPVTTRALVSQILLGLNEEYNPVVVRIQGKPIISWLDMQLKLLSYEKRLEH 211
            +D+LA AGS V+T  LV+Q L GL+ EYNP+VV++  K  ++W++MQ +LL+YE RLE 
Sbjct: 61  IADDLALAGSSVSTMDLVTQTLAGLDNEYNPIVVQLSDKEHLTWVEMQAQLLTYENRLEQ 120

Query: 212 LNSV--------------------KTKSSGANPFFNNNNNANQGRGRGRGESNN--RPMC 271
           +N+                     K+ + G       N  A  GRGRGR   +     +C
Sbjct: 121 INNQSNLTLNPSSNISTILYNRRGKSNAFGGGRGGQINRGARGGRGRGRATKDRIVCQVC 180

Query: 272 GKFGHTTLICYHIFDKEY-NQNSNQE--------------------TVRDPSWYADNEAT 331
            K GH    CYH F+K Y  QNS+++                    TV D  WY D+ A+
Sbjct: 181 CKPGHAASHCYHRFNKNYIGQNSDEQKSEKDKEQNYNFNAYVASPSTVEDLDWYFDSGAS 240

Query: 332 NNVTSDFNNITHPTEYGGSELVIVGNDN----VVCVEFDSD-----------YYLIKDKA 391
           N+VT D N +    E  G   + VGN      + C +   D            Y+ K   
Sbjct: 241 NHVTYDQNKVQEVNENDGKSFLTVGNGANLKIIACGDSSLDTQQKSLNLKDILYVPKITK 300

Query: 392 TGCTLLKGELSDGLYWLW----------LNGVRVLKGGISEDSSTQHINKGSTAFILSRT 451
              ++ K    + +Y  +          L G  +L+G I +      +  GST+   +  
Sbjct: 301 NLLSISKLTFDNDIYVEFHDVACFVKDKLTGRILLEGKIKD--GLYQLPGGSTS---TNK 360

Query: 452 RVNVAES-RVLWHKRLGDPSLKTLELIIRECNIPTKMNEQLEFCESFQLGKAHTLPFPNS 483
           R +V  S +  WH++LG P+ K L  +++ CNI     E  EFCE+ Q GKAH LPF NS
Sbjct: 361 RPHVFFSIKETWHRKLGHPNSKVLNEVMKLCNIEASPCENFEFCEACQFGKAHNLPFQNS 420

BLAST of ClCG03G009350 vs. TrEMBL
Match: A5C001_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_037543 PE=4 SV=1)

HSP 1 Score: 214.2 bits (544), Expect = 3.6e-52
Identity = 136/485 (28.04%), Postives = 220/485 (45.36%), Query Frame = 1

Query: 71  NPQYEAWIAVDQLLLGWLYNSMAPDVVVQLMGFENVKELWEAIEELFGVQSRAEEDYLRQ 130
           NP +  W   D+++L W+Y+S+ P+++ Q++G+++    W A+E  F   SRA    LR 
Sbjct: 143 NPDFVMWRRFDRMILSWIYSSLTPEIMGQIVGYQSSHAXWFALEXXFXASSRARVMQLRL 202

Query: 131 VFQQTRKGNMKMSEYLRVMKTRSDNLAQAGSPVTTRALVSQILLGLNEEYNPVVVRIQGK 190
            FQ TRKG++ M EY+  +K+ +DNLA  G PVT R  + Q+L GL  +YN +V  +  +
Sbjct: 203 EFQTTRKGSLTMMEYILKLKSLADNLAAIGEPVTDRDQILQLLGGLGADYNSIVASLTAR 262

Query: 191 PIISWLDMQLKLLSYE---KRLEHLNSVKTKSSGANPFFNNNNNANQGRGRGRGESNNRP 250
                   +  ++S      + +H N+ ++        FN     N GR +         
Sbjct: 263 EDEDNSVAEDNVISANLATPQYQHFNNKRSSGQNRQSGFNTRRGTNGGRSQSSQHRPQCQ 322

Query: 251 MCGKFGHTTLICYHIFD---KEYNQN-------------------SNQETVRDPSWYADN 310
           +CGKFGHT + CYH FD   + YN N                   ++  T+ D +W+ D 
Sbjct: 323 LCGKFGHTVVRCYHRFDINFQGYNPNMDTVQTNKPNAKNQVQAMMASPSTISDEAWFFDT 382

Query: 311 EATNNVTSDFNNITHPTEYGGSELVIVGNDNVVCV------------------------- 370
            AT++++   + ++    Y G++ VIVGN   + +                         
Sbjct: 383 GATHHLSQSIDPLSDVQPYMGNDKVIVGNGKHLRILHTGTTFFPSSSKTFQLRQVLHVPD 442

Query: 371 -------------------EFDSDYYLIKDKATGCTLLKGELSDGLYWLWLNGVRVLKGG 430
                              EF   ++ +KD+ T   LL+G L  GLY      V      
Sbjct: 443 IATNLISVSQFCADNNTFFEFHPRFFFVKDQVTKKILLQGSLEHGLYRFPARFV------ 502

Query: 431 ISEDSSTQHINKGSTAFILS----RTRVNVAESRVLWHKRLGDPSLKTLELIIRECNIPT 483
                          AF+ S     + +++  +  LWH RLG P+   L+ I+  CNI  
Sbjct: 503 -----------PSPAAFVSSSYDRSSNLSLTTTTTLWHSRLGHPADNILKHILTSCNISH 562

BLAST of ClCG03G009350 vs. TrEMBL
Match: A5BQ73_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_039158 PE=4 SV=1)

HSP 1 Score: 212.6 bits (540), Expect = 1.1e-51
Identity = 146/497 (29.38%), Postives = 230/497 (46.28%), Query Frame = 1

Query: 70  LNPQYEAWIAVDQLLLGWLYNSMAPDVVVQLMGFENVKELWEAIEELFGVQSRAEEDYLR 129
           +NP + A    D+ +L W+Y+S+ P ++ Q++G  +    W A+E++F   SRA    L 
Sbjct: 78  INPAFVAXRRQDRTILSWIYSSLTPGIMAQIIGHNSSHSAWNALEKIFSSCSRARIMQLX 137

Query: 130 QVFQQTRKGNMKMSEYLRVMKTRSDNLAQAGSPVTTRALVSQILLGLNEEYNPVVVRIQ- 189
             FQ T+KG+M M +Y+  +K  +D+LA  G PV+ +  +  +L GL  +YN VV  I  
Sbjct: 138 LEFQSTKKGSMSMIDYIMKVKGAADSLAAIGEPVSEQDQIMNLLGGLGSDYNAVVTAINI 197

Query: 190 GKPIISWLDMQLKLLSYEKRLEHLNSV--------------------KTKSSGANPFFNN 249
            +  IS   +   LL++E+RLE   S+                    +  + G  P F  
Sbjct: 198 REDKISLEAVHSMLLAFEQRLEQQGSIEQLPAMSANYASXSNNRGGGRKYNGGRGPNFMM 257

Query: 250 NNNANQGRGRG---------RGESNNRP---MCGKFGHTTLICYHIFDKEYNQNSNQET- 309
            N+  +GRGRG            S+ RP   +CGKFGHT  +CYH FD  +    N  T 
Sbjct: 258 TNSNFRGRGRGXRYGQSGRQNSSSSERPQCQLCGKFGHTVQVCYHRFDITFQSTQNNTTG 317

Query: 310 -------------------VRDPSWYADNEATNNVTSDFNNITHPTEYGGSELVIVGNDN 369
                                D +WY D+ A++++T +  N+T+ T Y G++ V +GN  
Sbjct: 318 VSNSGNSNXMPAMVAXSNNXADDNWYLDSGASHHLTQNVANLTNATPYTGADKVTIGNGK 377

Query: 370 VVCVE-------FDSDYYLIKDKATGCTLLKGEL-----------------SDGLYWLWL 429
            + +        F + +     K      +   L                 S+G +   L
Sbjct: 378 HLTISNTXFTRLFSNPHSFQLKKVFHVPFISANLISVAKFCSDNNALIEFHSNGFFLKDL 437

Query: 430 NGVRVLKGGISEDSSTQH--INKGSTAFI-----LSRTRVNVAESRVLWHKRLGDPSLKT 483
           +  RVL  G  E+   +   I+   TA++      +    N+   R LWH RLG  +   
Sbjct: 438 HTKRVLAQGKLENGLYKFPVISNKKTAYVGITNDSTFQCSNIENKRELWHHRLGHAATDI 497

BLAST of ClCG03G009350 vs. TrEMBL
Match: Q25AL5_ORYSA (H0102C09.1 protein OS=Oryza sativa GN=H0102C09.1 PE=4 SV=1)

HSP 1 Score: 205.3 bits (521), Expect = 1.7e-49
Identity = 154/543 (28.36%), Postives = 244/543 (44.94%), Query Frame = 1

Query: 19  KLKGHLTGKNLCPPMFLPSPGDTSAKNTNVVGASSSQFVADDAISSSVERSLNPQYEAWI 78
           +L+GH+ GKN  PP          A+ T  V                  ++ NP Y+ W 
Sbjct: 180 RLEGHINGKNPAPP----------AEITKTVDGKEV-------------KTSNPNYDEWF 239

Query: 79  AVDQLLLGWLYNSMAPDVVVQLMGFENVKELWEAIEELFGVQSRAEEDYLRQVFQQTRKG 138
           A DQ +LG+L++S+  + ++Q+   +   E W+ ++++F  ++RA    +R      +KG
Sbjct: 240 AADQQILGFLFSSLTRETLLQVAAVKTAAEAWKTLDDMFTSRTRARSLNVRLALTTLQKG 299

Query: 139 NMKMSEYLRVMKTRSDNLAQAGSPVTTRALVSQILLGLNEEYNPVVVRIQGK-PIISWLD 198
           N  +SEY+  MKT +D +A AG P+    L++ IL GL+E+++ VV  I G+   ++  +
Sbjct: 300 NSSISEYIGKMKTLADEVAAAGKPLDDEELIAYILNGLDEDFDSVVSTIVGRVEPVTVAE 359

Query: 199 MQLKLLSYEKRLEHLNSVKT--------------KSSGANPFFNNNNNANQGR----GRG 258
           +  +LLS+E RL    +  T              +  GANP       A +GR    GRG
Sbjct: 360 VYSQLLSFENRLAMRQAQATANMANRGGRGGGGSRGGGANP--GRGRGATRGRGAAPGRG 419

Query: 259 RGES------NNRPMCG---KFGHTTLICYHIFDKEYNQNSN------QETVRDPSWYAD 318
           RG +      +NRP+C    K GH    C+H FD+++  +            RD +WY D
Sbjct: 420 RGNNQQQRSYDNRPLCQVCYKRGHVAADCWHRFDEDFVPDDKLVAAAIHTHARDSNWYVD 479

Query: 319 NEATNNVTSDFNNIT-----------HPTEYGGSELVIVGN------------------- 378
             AT+++TS+   +T           H     G E+  +G+                   
Sbjct: 480 TGATDHITSELEKLTARDVYKGHDQIHTASGSGMEIKHIGHSIVHTPTRPLHLNNVLHVP 539

Query: 379 --------------DNVVCVEFDSDYYLIKDKATGCTLLKGELSDGLYWLWLNGVRVLKG 438
                         DN V +E  S ++LIKD+AT  T+LKG    GLY +          
Sbjct: 540 QANKNLISAHKLAADNSVFLEIHSKHFLIKDQATRRTVLKGRRQKGLYPV---------P 599

Query: 439 GISEDSSTQHINKGSTAFILSRTRVNVAESRVLWHKRLGDPSLKTLELIIRECNIPTKMN 483
             S  SS + +               V  S   WH RLG  S   +  +I +  +P    
Sbjct: 600 SASPPSSAKVV-------------CAVTPSFERWHSRLGHSSAPIISRVISKNKLPCLDE 659

BLAST of ClCG03G009350 vs. TrEMBL
Match: V9GZT4_MAIZE (Copia-like retrotransposon Hopscotch polyprotein OS=Zea mays GN=gag PE=4 SV=1)

HSP 1 Score: 199.9 bits (507), Expect = 7.0e-48
Identity = 156/547 (28.52%), Postives = 239/547 (43.69%), Query Frame = 1

Query: 24  LTGKNLCPPMFLPSPGDTSAKNTNVVGASSSQFVADDAISSSVERSLNPQYEAWIAVDQL 83
           LTG  +CPP  +    D S +   V                      NP Y  WIA DQ 
Sbjct: 49  LTGVEICPPKTI---SDASDRTVTVA---------------------NPAYGRWIARDQA 108

Query: 84  LLGWLYNSMAPDVVVQLMGFENVKELWEAIEELFGVQSRAEEDYLRQVFQQTRKGNMKMS 143
           +LG+L +S++ +V+  ++       +W  + E++   SRA +   R     T+KG   ++
Sbjct: 109 VLGYLLSSLSREVLSSVVNCSTSASVWTTLSEMYSSHSRARKVNTRIALATTKKGASSVA 168

Query: 144 EYLRVMKTRSDNLAQAGSPVTTRALVSQILLGLNEEYNPVVVRIQGK--PIISWLDMQLK 203
           EY   M+  +D L  AG P+     VS +L GL+E++NP+V  +  +  PI    D+  +
Sbjct: 169 EYFAKMRGFADELGAAGKPLDDEEFVSFLLTGLDEDFNPLVTAVVARSDPITPG-DLYTQ 228

Query: 204 LLSYEKRLEHL---------NSVKTKSSGANPFFNNNNNANQGRGRGRG----------- 263
           LLSYE R+ HL         +S   +S G    +  +      RGRGRG           
Sbjct: 229 LLSYENRM-HLQTGSSSLMQSSANARSPGRGMSWGRSGGRGFSRGRGRGRGPSRGGFQSF 288

Query: 264 ------------ESNNRP---MCGKFGHTTLICYHIFDKEY---NQNSNQETVRDPS--- 323
                       ++++RP   +C + GHT L C++ FD+ Y    +++N    ++ S   
Sbjct: 289 GRGNNYSGATDADTSSRPRCQVCSRVGHTALNCWYRFDENYVPDQRSANSAAHQNGSNVP 348

Query: 324 WYADNEATNNVTSDFNNITHPTEYGGSELVIVGN-------------------------- 383
           WY D  AT+++T D + +T   +Y G++ +I  N                          
Sbjct: 349 WYTDTGATDHITGDLDRLTMHDKYTGTDQIIAANGTGMTISNIGNAIVPTSSRSLHLRSV 408

Query: 384 ------------------DNVVCVEFDSDYYLIKDKATGCTLLKGELSDGLYWLWLNGVR 443
                             DN V +EF S ++LIKD+ T   LL G+  DGLY L  +   
Sbjct: 409 LHVPSTHKNLISVHRLTNDNDVFIEFHSSHFLIKDRQTKAVLLHGKCRDGLYPLPPHPDL 468

Query: 444 VLKGGISEDSSTQHINKGSTAFILSRTRVNVAESRVLWHKRLGDPSLKTLELIIRECNIP 483
            LK   S                   TRV +      WHKRLG PS   +  +I   N+P
Sbjct: 469 RLKHNFSS------------------TRVPLEH----WHKRLGHPSRDIVHRVISNNNLP 528

BLAST of ClCG03G009350 vs. TAIR10
Match: AT5G48050.1 (AT5G48050.1 Retrotransposon gag protein (InterPro:IPR005162))

HSP 1 Score: 71.6 bits (174), Expect = 1.5e-12
Identity = 59/225 (26.22%), Postives = 98/225 (43.56%), Query Frame = 1

Query: 77  WIAVDQLLLGWLYNSMAPDVVVQLMGFE-NVKELWEAIEELFGVQSRAEEDYLRQVFQQT 136
           W   D L+  W+Y ++   ++  ++      ++LW ++E LF     A         + T
Sbjct: 67  WKERDGLVKMWIYGTITDSLLDTIIKVGCTARDLWLSLENLFRDNKEARALQFENELRTT 126

Query: 137 RKGNMKMSEYLRVMKTRSDNLAQAGSPVTTRALVSQILLGLNEEYNPVVVRIQGK-PIIS 196
              ++ + EY + +K+ SD L    SP++ R LV  +L GL E+Y+ ++  I+ K P  S
Sbjct: 127 TIDDLSVHEYCQKLKSLSDLLTNVDSPISDRVLVMHLLNGLTEKYDYILNVIKHKSPFPS 186

Query: 197 WLDMQLKLLSYEKRLEHLNSVKTKSSGAN-PFFNN-----------------NNNANQGR 256
           + + +  LL  E RL   N  K+  S  N P  +N                 NNN+N GR
Sbjct: 187 FTEARSMLLMEESRLS--NKSKSSLSHTNHPSLSNVLFTVPRQQERYPQEYHNNNSNMGR 246

Query: 257 GRGRGESNNRPMCGKFGHTTLICYHIFDKEYNQNSNQETVRDPSW 282
           GR + ++         G ++       D  YN N+N    + P+W
Sbjct: 247 GRSKKKNRG-------GGSS-------DGRYNNNNNWRLNQPPTW 275

BLAST of ClCG03G009350 vs. NCBI nr
Match: gi|828332633|ref|XP_004513130.2| (PREDICTED: uncharacterized protein LOC101488260, partial [Cicer arietinum])

HSP 1 Score: 286.6 bits (732), Expect = 8.2e-74
Identity = 172/495 (34.75%), Postives = 248/495 (50.10%), Query Frame = 1

Query: 19  KLKGHLTGKNLCPPMFLPSPGDTSAKNTNVVGASSSQFVADDAISSSVERSLNPQYEAWI 78
           KL G++ G   CP  F+ S  DT+ KN                         NP YE WI
Sbjct: 45  KLDGYIIGTKECPEQFI-STNDTTKKN-------------------------NPDYEEWI 104

Query: 79  AVDQLLLGWLYNSMAPDVVVQLMGFENVKELWEAIEELFGVQSRAEEDYLRQVFQQTRKG 138
           A DQ LLGWL NS+A D+  QL+  E  KELW   + L G  +++   YL+  F  TRKG
Sbjct: 105 AHDQALLGWLRNSVAIDIATQLLHCETSKELWNEAQSLTGAHTKSRTIYLKSEFHNTRKG 164

Query: 139 NMKMSEYLRVMKTRSDNLAQAGSPVTTRALVSQILLGLNEEYNPVVVRIQGKPIISWLDM 198
            MKM +YL  MK  SD L  AGSP+++  L+ Q L GL+ +YNPVVV++  +  ++W+D+
Sbjct: 165 QMKMDQYLLKMKNLSDKLKLAGSPISSSDLIIQTLNGLDADYNPVVVKLSDQINLNWVDL 224

Query: 199 QLKLLSYEKRLEHLNSVKTKSSGAN------PFFNNNNNANQGRGR------GRGESNNR 258
           Q +LL++E R+E LN+    S  A+        F +N +  +G  R      GRG   ++
Sbjct: 225 QAQLLAFENRMEQLNNFSNLSMNASANLASQTHFRSNKSGTRGNWRGSNFRGGRGRGRSK 284

Query: 259 P---MCGKFGHTTLICYHIFDKEYN------QNSNQET----------VRDPSWYADNEA 318
           P   +C K GHTT+ C++ FDK Y       +N+ QE            +D  WY D+ A
Sbjct: 285 PTCQVCNKIGHTTVQCFYRFDKSYTCSNHYAENNKQENHSAFIASPYHGQDYEWYFDSGA 344

Query: 319 TNNVTSDFNNITHPTEYGGSELVIVGNDNVVCVEFDSDYYLIKDKATGCTLLKGELSDGL 378
            N+VT     +   +E  G+                              LLKG++ DGL
Sbjct: 345 NNHVTHQNEKLQDLSESNGN-----------------------------ALLKGKVKDGL 404

Query: 379 YWLWLNGVRVLKGGISEDSSTQHINKGSTAFILSRTRVNVAESRVLWHKRLGDPSLKTLE 438
           Y L               S+   +NK S  +      ++V E+   WH++LG P+ K LE
Sbjct: 405 YQL--------------SSANSQVNKDSCIY------MSVKEN---WHRKLGHPNNKVLE 461

Query: 439 LIIRECNIPTKMNEQLEFCESFQLGKAHTLPFPNSVAQASENFDLVHTDVWGLAPITSTN 483
            +++ CN+ T  N+    CE+ Q GK H LPF +S + A E  DL+H+ VWG API S +
Sbjct: 465 KVLKNCNVKTSSNDHFLLCEACQFGKLHLLPFTSSYSHAKEPLDLIHSGVWGPAPILSPS 461

BLAST of ClCG03G009350 vs. NCBI nr
Match: gi|828335907|ref|XP_012575528.1| (PREDICTED: uncharacterized protein LOC105853050 [Cicer arietinum])

HSP 1 Score: 251.5 bits (641), Expect = 2.9e-63
Identity = 171/497 (34.41%), Postives = 251/497 (50.50%), Query Frame = 1

Query: 18  YKLKGHLTGKNLCPPMFLPSPGDTSAKNTNVVGASSSQFVADDAISSSVERSLNPQYEAW 77
           Y+L GH+ G   CP  F+ S    S KN N                        P +E W
Sbjct: 40  YRLDGHMLGTKECPEKFIASTD--SIKNPN------------------------PAFEDW 99

Query: 78  IAVDQLLLGWLYNSMAPDVVVQLMGFENVKELWEAIEELFGVQSRAEEDYLRQVFQQTRK 137
            A D  LLGWL NSM  ++  QL+  E  K+LW+  + L G  +R+   YL+  F    K
Sbjct: 100 QAHDSQLLGWLMNSMTTEMATQLLHCETSKQLWDEAQSLAGAHTRSRVTYLKSEFHSIIK 159

Query: 138 GNMKMSEYLRVMKTRSDNLAQAGSPVTTRALVSQILLGLNEEYNPVVVRIQGKPIISWLD 197
           G MKM EY   MK  +D L  AGSP++   L+ Q L GL+ EYNP+VV++  +  ++W+D
Sbjct: 160 GEMKMEEYPIKMKNLADKLKLAGSPISNSDLIIQTLNGLDSEYNPMVVKLSDQTSLTWVD 219

Query: 198 MQLKLLSYEKRLEHLNSVKTKSSGA------------NPFFNNNN----NANQ------- 257
           +Q K L+++ RL+ LNS+   +  A            N F  NNN    N+N        
Sbjct: 220 LQAKFLTFDSRLDQLNSLTNLTLNASANVANKTDYRGNKFNTNNNWRGSNSNWRGSNFRG 279

Query: 258 ---GRGRGR-GESNN-RPMCGKFGHTTLICYHIFDKEYNQNS----NQETVRDPSWYADN 317
              GRGRGR G SN+      KF   T        K + +NS    + E ++  +  +  
Sbjct: 280 WRGGRGRGRIGASNHVTHQTDKFEDLT--------KHHGKNSLIVGSGEKLKIVATSSSK 339

Query: 318 EATNNVTSDFNNITHPTEYGGSELVIVGNDNVVCVEFDSDYYLIKDKATGCTLLKGELSD 377
             + N+  D   + + T+   S   +  ++N++ VEFD++   +KDK TG  +L+G L D
Sbjct: 340 LNSLNL-HDVLYVPNITKNMLSVSKLTAHNNIL-VEFDANCCFVKDKLTGKAILRGTLKD 399

Query: 378 GLYWLWLNGVRVLKGGISEDSSTQHINKGSTAFILSRTRVNVAESRVLWHKRLGDPSLKT 437
           GLY L               S T+   +   A+      V+V ES   WH+RLG P+ K 
Sbjct: 400 GLYQL---------------SRTE---RDPCAY------VSVKES---WHRRLGHPNNKV 459

Query: 438 LELIIRECNIPTKMNEQLEFCESFQLGKAHTLPFPNSVAQASENFDLVHTDVWGLAPITS 483
           L+ +++ CN+    ++   FCE+ Q GK H LPF NS + A +  +LVH DVWG AP+TS
Sbjct: 460 LDRVLKNCNVKLSPSDHFNFCEACQYGKMHFLPFKNSSSHAKKILELVHADVWGPAPVTS 473

BLAST of ClCG03G009350 vs. NCBI nr
Match: gi|1012339207|gb|KYP50444.1| (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan])

HSP 1 Score: 249.2 bits (635), Expect = 1.5e-62
Identity = 162/460 (35.22%), Postives = 234/460 (50.87%), Query Frame = 1

Query: 92  MAPDVVVQLMGFENVKELWEAIEELFGVQSRAEEDYLRQVFQQTRKGNMKMSEYLRVMKT 151
           M  +V  QL+  E  +++WE  + L G  +R+   +L+  F +TRKG +KM EYL  MK 
Sbjct: 1   MTQEVATQLLHCETSQQIWEDAQSLAGAHTRSRITFLKTEFHRTRKGGLKMEEYLTKMKE 60

Query: 152 RSDNLAQAGSPVTTRALVSQILLGLNEEYNPVVVRIQGKPIISWLDMQLKLLSYEKRLEH 211
            +D+LA AGS V+T  LV+Q L GL+ EYNP+VV++  K  ++W++MQ +LL+YE RLE 
Sbjct: 61  IADDLALAGSSVSTMDLVTQTLAGLDNEYNPIVVQLSDKEHLTWVEMQAQLLTYENRLEQ 120

Query: 212 LNSV--------------------KTKSSGANPFFNNNNNANQGRGRGRGESNN--RPMC 271
           +N+                     K+ + G       N  A  GRGRGR   +     +C
Sbjct: 121 INNQSNLTLNPSSNISTILYNRRGKSNAFGGGRGGQINRGARGGRGRGRATKDRIVCQVC 180

Query: 272 GKFGHTTLICYHIFDKEY-NQNSNQE--------------------TVRDPSWYADNEAT 331
            K GH    CYH F+K Y  QNS+++                    TV D  WY D+ A+
Sbjct: 181 CKPGHAASHCYHRFNKNYIGQNSDEQKSEKDKEQNYNFNAYVASPSTVEDLDWYFDSGAS 240

Query: 332 NNVTSDFNNITHPTEYGGSELVIVGNDN----VVCVEFDSD-----------YYLIKDKA 391
           N+VT D N +    E  G   + VGN      + C +   D            Y+ K   
Sbjct: 241 NHVTYDQNKVQEVNENDGKSFLTVGNGANLKIIACGDSSLDTQQKSLNLKDILYVPKITK 300

Query: 392 TGCTLLKGELSDGLYWLW----------LNGVRVLKGGISEDSSTQHINKGSTAFILSRT 451
              ++ K    + +Y  +          L G  +L+G I +      +  GST+   +  
Sbjct: 301 NLLSISKLTFDNDIYVEFHDVACFVKDKLTGRILLEGKIKD--GLYQLPGGSTS---TNK 360

Query: 452 RVNVAES-RVLWHKRLGDPSLKTLELIIRECNIPTKMNEQLEFCESFQLGKAHTLPFPNS 483
           R +V  S +  WH++LG P+ K L  +++ CNI     E  EFCE+ Q GKAH LPF NS
Sbjct: 361 RPHVFFSIKETWHRKLGHPNSKVLNEVMKLCNIEASPCENFEFCEACQFGKAHNLPFQNS 420

BLAST of ClCG03G009350 vs. NCBI nr
Match: gi|147816383|emb|CAN68489.1| (hypothetical protein VITISV_037543 [Vitis vinifera])

HSP 1 Score: 214.2 bits (544), Expect = 5.2e-52
Identity = 136/485 (28.04%), Postives = 220/485 (45.36%), Query Frame = 1

Query: 71  NPQYEAWIAVDQLLLGWLYNSMAPDVVVQLMGFENVKELWEAIEELFGVQSRAEEDYLRQ 130
           NP +  W   D+++L W+Y+S+ P+++ Q++G+++    W A+E  F   SRA    LR 
Sbjct: 143 NPDFVMWRRFDRMILSWIYSSLTPEIMGQIVGYQSSHAXWFALEXXFXASSRARVMQLRL 202

Query: 131 VFQQTRKGNMKMSEYLRVMKTRSDNLAQAGSPVTTRALVSQILLGLNEEYNPVVVRIQGK 190
            FQ TRKG++ M EY+  +K+ +DNLA  G PVT R  + Q+L GL  +YN +V  +  +
Sbjct: 203 EFQTTRKGSLTMMEYILKLKSLADNLAAIGEPVTDRDQILQLLGGLGADYNSIVASLTAR 262

Query: 191 PIISWLDMQLKLLSYE---KRLEHLNSVKTKSSGANPFFNNNNNANQGRGRGRGESNNRP 250
                   +  ++S      + +H N+ ++        FN     N GR +         
Sbjct: 263 EDEDNSVAEDNVISANLATPQYQHFNNKRSSGQNRQSGFNTRRGTNGGRSQSSQHRPQCQ 322

Query: 251 MCGKFGHTTLICYHIFD---KEYNQN-------------------SNQETVRDPSWYADN 310
           +CGKFGHT + CYH FD   + YN N                   ++  T+ D +W+ D 
Sbjct: 323 LCGKFGHTVVRCYHRFDINFQGYNPNMDTVQTNKPNAKNQVQAMMASPSTISDEAWFFDT 382

Query: 311 EATNNVTSDFNNITHPTEYGGSELVIVGNDNVVCV------------------------- 370
            AT++++   + ++    Y G++ VIVGN   + +                         
Sbjct: 383 GATHHLSQSIDPLSDVQPYMGNDKVIVGNGKHLRILHTGTTFFPSSSKTFQLRQVLHVPD 442

Query: 371 -------------------EFDSDYYLIKDKATGCTLLKGELSDGLYWLWLNGVRVLKGG 430
                              EF   ++ +KD+ T   LL+G L  GLY      V      
Sbjct: 443 IATNLISVSQFCADNNTFFEFHPRFFFVKDQVTKKILLQGSLEHGLYRFPARFV------ 502

Query: 431 ISEDSSTQHINKGSTAFILS----RTRVNVAESRVLWHKRLGDPSLKTLELIIRECNIPT 483
                          AF+ S     + +++  +  LWH RLG P+   L+ I+  CNI  
Sbjct: 503 -----------PSPAAFVSSSYDRSSNLSLTTTTTLWHSRLGHPADNILKHILTSCNISH 562

BLAST of ClCG03G009350 vs. NCBI nr
Match: gi|147856699|emb|CAN81355.1| (hypothetical protein VITISV_039158 [Vitis vinifera])

HSP 1 Score: 212.6 bits (540), Expect = 1.5e-51
Identity = 146/497 (29.38%), Postives = 230/497 (46.28%), Query Frame = 1

Query: 70  LNPQYEAWIAVDQLLLGWLYNSMAPDVVVQLMGFENVKELWEAIEELFGVQSRAEEDYLR 129
           +NP + A    D+ +L W+Y+S+ P ++ Q++G  +    W A+E++F   SRA    L 
Sbjct: 78  INPAFVAXRRQDRTILSWIYSSLTPGIMAQIIGHNSSHSAWNALEKIFSSCSRARIMQLX 137

Query: 130 QVFQQTRKGNMKMSEYLRVMKTRSDNLAQAGSPVTTRALVSQILLGLNEEYNPVVVRIQ- 189
             FQ T+KG+M M +Y+  +K  +D+LA  G PV+ +  +  +L GL  +YN VV  I  
Sbjct: 138 LEFQSTKKGSMSMIDYIMKVKGAADSLAAIGEPVSEQDQIMNLLGGLGSDYNAVVTAINI 197

Query: 190 GKPIISWLDMQLKLLSYEKRLEHLNSV--------------------KTKSSGANPFFNN 249
            +  IS   +   LL++E+RLE   S+                    +  + G  P F  
Sbjct: 198 REDKISLEAVHSMLLAFEQRLEQQGSIEQLPAMSANYASXSNNRGGGRKYNGGRGPNFMM 257

Query: 250 NNNANQGRGRG---------RGESNNRP---MCGKFGHTTLICYHIFDKEYNQNSNQET- 309
            N+  +GRGRG            S+ RP   +CGKFGHT  +CYH FD  +    N  T 
Sbjct: 258 TNSNFRGRGRGXRYGQSGRQNSSSSERPQCQLCGKFGHTVQVCYHRFDITFQSTQNNTTG 317

Query: 310 -------------------VRDPSWYADNEATNNVTSDFNNITHPTEYGGSELVIVGNDN 369
                                D +WY D+ A++++T +  N+T+ T Y G++ V +GN  
Sbjct: 318 VSNSGNSNXMPAMVAXSNNXADDNWYLDSGASHHLTQNVANLTNATPYTGADKVTIGNGK 377

Query: 370 VVCVE-------FDSDYYLIKDKATGCTLLKGEL-----------------SDGLYWLWL 429
            + +        F + +     K      +   L                 S+G +   L
Sbjct: 378 HLTISNTXFTRLFSNPHSFQLKKVFHVPFISANLISVAKFCSDNNALIEFHSNGFFLKDL 437

Query: 430 NGVRVLKGGISEDSSTQH--INKGSTAFI-----LSRTRVNVAESRVLWHKRLGDPSLKT 483
           +  RVL  G  E+   +   I+   TA++      +    N+   R LWH RLG  +   
Sbjct: 438 HTKRVLAQGKLENGLYKFPVISNKKTAYVGITNDSTFQCSNIENKRELWHHRLGHAATDI 497

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
A0A151S6M8_CAJCA1.0e-6235.22Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan GN=... [more]
A5C001_VITVI3.6e-5228.04Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_037543 PE=4 SV=1[more]
A5BQ73_VITVI1.1e-5129.38Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_039158 PE=4 SV=1[more]
Q25AL5_ORYSA1.7e-4928.36H0102C09.1 protein OS=Oryza sativa GN=H0102C09.1 PE=4 SV=1[more]
V9GZT4_MAIZE7.0e-4828.52Copia-like retrotransposon Hopscotch polyprotein OS=Zea mays GN=gag PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G48050.11.5e-1226.22 Retrotransposon gag protein (InterPro:IPR005162)[more]
Match NameE-valueIdentityDescription
gi|828332633|ref|XP_004513130.2|8.2e-7434.75PREDICTED: uncharacterized protein LOC101488260, partial [Cicer arietinum][more]
gi|828335907|ref|XP_012575528.1|2.9e-6334.41PREDICTED: uncharacterized protein LOC105853050 [Cicer arietinum][more]
gi|1012339207|gb|KYP50444.1|1.5e-6235.22Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan][more]
gi|147816383|emb|CAN68489.1|5.2e-5228.04hypothetical protein VITISV_037543 [Vitis vinifera][more]
gi|147856699|emb|CAN81355.1|1.5e-5129.38hypothetical protein VITISV_039158 [Vitis vinifera][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR025724GAG-pre-integrase_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG03G009350.1ClCG03G009350.1mRNA


Analysis Name: InterPro Annotations of watermelon (Charleston Gray)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR025724GAG-pre-integrase domainPFAMPF13976gag_pre-integrscoord: 386..434
score: 3.
NoneNo IPR availablePANTHERPTHR11439GAG-POL-RELATED RETROTRANSPOSONcoord: 65..367
score: 7.2E-80coord: 18..34
score: 7.2E-80coord: 391..482
score: 7.2
NoneNo IPR availablePANTHERPTHR11439:SF185SUBFAMILY NOT NAMEDcoord: 65..367
score: 7.2E-80coord: 18..34
score: 7.2E-80coord: 391..482
score: 7.2
NoneNo IPR availablePFAMPF14223Retrotran_gag_2coord: 77..209
score: 3.4

The following gene(s) are orthologous to this gene:

None

The following gene(s) are paralogous to this gene:

None