ClCG01G016000 (gene) Watermelon (Charleston Gray)

NameClCG01G016000
Typegene
OrganismCitrullus lanatus (Watermelon (Charleston Gray))
DescriptionWRKY DNA-binding protein 51
LocationCG_Chr01 : 30334925 .. 30338482 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAACTCCCCACAAAACCCTAGCTTCTTCTTTGACAACCATCAATTTCAAGACTCATCTTCTTCATTCATGGATTTTCTCAATTTCTCAGGTTACCCGCCACCCGATTTCAGCCTCGAAGCCGAGACCGCAGCGTTATCGTTCTCGGAGGCGGGAACCAGCGACGGGAGCAGATCCATGGAAGCAACATCCATAGACAATAATACCATGCAAGTGATATATTATATATCTCAAAAGCTTTTAATTTATATATTTAGTTAACCAATTAATAAAACCATGCATATATATATTGATATATTCAATGAAATTTTATTTGTTAAAAAATGGAAATTGATAAAACAATTAATGTGTATGTGTGTTTTTGCAGAGATGGGTGGTGTGAGGCGAAGGGTGTGAATAGAAAAAAAGAGAAAGGAGGTGGGTGCAGTAATAGAGTTGCATTTATAACAAAGTCGGAATTGGAAATAATGGATGATGGCTTCAAATGGAGAAAGTATGGCAAAAAATCTGTCAAGAATACCCCTCATCCAAGGTAATTTATTCAAGAACTTCCTTCATTTGTTTCTTCAATTATTATTCTAACAATTCCTTCATTTGTTTCTTCAATTATTATTCTAACAATACCGAGAGAACATCCAACCTCTATCATTAAAGTTAATTATACATCTTTTATATTAGTTGAGCTATGCCCTTTTAACGTCGAAGGTTTTTAATTCAATTGTAACATTATTAACATAAAATAATGTTATATATATATATATATATATATATATATATATATATATATATATATATATATATATATATATACTTTTTACTAACTTATTGTTAGGAATTTAAATGAGATACAAATAAATATGGGATAAATTAAATTATTTCTCAAAAATATATATATTAGAGCGAAATGAGTAATCATCATAAAATAACAAATAATAAAGATAAAAGGAAAAGGTGAACAATACCAAAAGAAGAAACATTAATTCGTCCTAAATTTGGGACTATATACTGACGTCATACCAAAATAGTTAGAGAGAAGTACAAACTTTTATTTTCAAGAAGATTTTCACCCCCTCTAAGTTTTCTTTAATTATTATTCATCTCCAAATTACAAACTTTTGCTTACAAATTAATTGAAAAAGGTTTTTATTGAAGTTGATAATATATATTTAAGTTTAGTTGAATTAAGTTGTTTATAGAGGAAGAGGAAAGATAGGCAGCTAGAATATTGAGATTAAGGGCCGTTTGGATTCACTCTAGAAAAAAAAATGTTTTTCAAATTAACTTATTTTTATATAATTTTTTTTTATAAATATTTATTAAAATATAATTGAAAAACTATTTTGAGTGGTTGTCAAACACTTCAATTTCTTTCAAAATGACTTATTTTTTAAATTAAATCATTCTAAACAAATTCTAAGATTGACTTTTAATTGTGGTGGGAAAAAAAAGTTGAATGTCATGAACATGATAATTAGAGAGATCAAAGTGGATAAATTAAACTTCAATATTCATACAAATATTCCATTAATAACCTTTTTCCTTTTGTTAAATCACTACCAAATTGCCAATTTGCCCAAATGGGAATTAAAATTAAAAAGTTCATTTCTCGCTGTCTCATATGCTACAATATATAGAAATCAGATATGAATGAGTAATTAAGGTCCTTACGTAAATGTGATAGATGATAATTTACTTTTTTGATGCGTTTAATGACATTTAATAAGATGCAAATTCGAAACAATATACTTTACCTTTTCAAGTTAATCATAATTTGGATAAGGGCAATGCAAGAAACTGACTACAAATCAATCTTCTATAATTTGTTACTTTTTACGTATGGGATTTTGTCTCTGTTAGGGAATATCTATTATTGCCAAATATTAGCTGAATTAGTATAAAATTATATGCTTTTCACCAAGAATTCAAACATTCAAATCTTTCACCTTCATATTTTTGTTTAAAAAAAAAAAGGAAAAATATATACATATTAATTCATTTTATTTGCATAATTCAACCGATTAATAGGTTAAAACATATACTCTCAATCAAAAGAGCAAGGTTTCAAATGTCTACCCTACTTTCTATCCATGCTTGGAGCTTTGGTTTTTCAACTGTTTGGTTTGGAGATGTACATCCCTCCTTGGAGGGTTTATGCTTTCTGTTTTGATTTGTTAGGTCTTTAAATTGCTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTACTAAAAAAANNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTCCTTTTTGTTGTACTAAAAACAATATAGATAAGAGAAATTACTTTAAATTATAAATTTATAAACATATAAATCATGTGTGTGAAATAATTGAAAGGCTCTAATTTAGTGAAAATTGCATTGTACAATATGTTAAAGGATCGATTGTTTACATTTTTAAGATGCATGTAGCCTACCACAAAAAATACATAAATAAATTATTTTAGTTTCGAGTTTAAGGCTTTTCATAGTTACAGTGTACTTCTTATAGACTAAGTTATGCCCTAATTGAAATAAATATATATATTTGTTAGTTTAATTATGTTGATGTCCTAAATTTGTTAGGAATTACTACAAATGCTCAAGTGGAGGGTGTGGAGTGAAAAAAAGAGTAGAAAGAGATAGAGATGATTCAAGCTATGTTATAACAACATATGAAGGAGTTCACAACCATGAGAGCCCTTTTTTGAAGTATTGCAATGATCCAATATTATTTCATCCCATTTGCCCTTCCTCTTCTCCTCCCTATTCTTCTACTACTACCCTTTGA

mRNA sequence

ATGAACTCCCCACAAAACCCTAGCTTCTTCTTTGACAACCATCAATTTCAAGACTCATCTTCTTCATTCATGGATTTTCTCAATTTCTCAGGTTACCCGCCACCCGATTTCAGCCTCGAAGCCGAGACCGCAGCGTTATCGTTCTCGGAGGCGGGAACCAGCGACGGGAGCAGATCCATGGAAGCAACATCCATAGACAATAATACCATGCAAGCGAAGGGTGTGAATAGAAAAAAAGAGAAAGGAGGTGGGTGCAGTAATAGAGTTGCATTTATAACAAAGTCGGAATTGGAAATAATGGATGATGGCTTCAAATGGAGAAAGTATGGCAAAAAATCTGTCAAGAATACCCCTCATCCAAGGAATTACTACAAATGCTCAAGTGGAGGGTGTGGAGTGAAAAAAAGAGTAGAAAGAGATAGAGATGATTCAAGCTATGTTATAACAACATATGAAGGAGTTCACAACCATGAGAGCCCTTTTTTGAAGTATTGCAATGATCCAATATTATTTCATCCCATTTGCCCTTCCTCTTCTCCTCCCTATTCTTCTACTACTACCCTTTGA

Coding sequence (CDS)

ATGAACTCCCCACAAAACCCTAGCTTCTTCTTTGACAACCATCAATTTCAAGACTCATCTTCTTCATTCATGGATTTTCTCAATTTCTCAGGTTACCCGCCACCCGATTTCAGCCTCGAAGCCGAGACCGCAGCGTTATCGTTCTCGGAGGCGGGAACCAGCGACGGGAGCAGATCCATGGAAGCAACATCCATAGACAATAATACCATGCAAGCGAAGGGTGTGAATAGAAAAAAAGAGAAAGGAGGTGGGTGCAGTAATAGAGTTGCATTTATAACAAAGTCGGAATTGGAAATAATGGATGATGGCTTCAAATGGAGAAAGTATGGCAAAAAATCTGTCAAGAATACCCCTCATCCAAGGAATTACTACAAATGCTCAAGTGGAGGGTGTGGAGTGAAAAAAAGAGTAGAAAGAGATAGAGATGATTCAAGCTATGTTATAACAACATATGAAGGAGTTCACAACCATGAGAGCCCTTTTTTGAAGTATTGCAATGATCCAATATTATTTCATCCCATTTGCCCTTCCTCTTCTCCTCCCTATTCTTCTACTACTACCCTTTGA

Protein sequence

MNSPQNPSFFFDNHQFQDSSSSFMDFLNFSGYPPPDFSLEAETAALSFSEAGTSDGSRSMEATSIDNNTMQAKGVNRKKEKGGGCSNRVAFITKSELEIMDDGFKWRKYGKKSVKNTPHPRNYYKCSSGGCGVKKRVERDRDDSSYVITTYEGVHNHESPFLKYCNDPILFHPICPSSSPPYSSTTTL
BLAST of ClCG01G016000 vs. Swiss-Prot
Match: WRK51_ARATH (Probable WRKY transcription factor 51 OS=Arabidopsis thaliana GN=WRKY51 PE=1 SV=1)

HSP 1 Score: 142.5 bits (358), Expect = 4.7e-33
Identity = 82/180 (45.56%), Postives = 108/180 (60.00%), Query Frame = 1

Query: 1   MNSPQNPSFFFDNHQFQDSSSSFMDFLNFSGYPPPDFS-------LEAETAA---LSFSE 60
           MN  QNPS  F     ++  + FMD  +FS     D         +E E ++   +  SE
Sbjct: 1   MNISQNPSPNFTYFSDENFINPFMDNNDFSNLMFFDIDEGGNNGLIEEEISSPTSIVSSE 60

Query: 61  AGTSDGSRSMEATSIDNNTMQAKGVNRKKEKGGGCSNRVAFITKSELEIMDDGFKWRKYG 120
             T +   S  AT++       +G +++ ++     +RVAF T+S++++MDDGFKWRKYG
Sbjct: 61  TFTGESGGSGSATTLSKKESTNRG-SKESDQTKETGHRVAFRTRSKIDVMDDGFKWRKYG 120

Query: 121 KKSVKNTPHPRNYYKCSSGGCGVKKRVERDRDDSSYVITTYEGVHNHESPFLKYCNDPIL 171
           KKSVKN  + RNYYKCSS GC VKKRVERD DD++YVITTYEGVHNHES    Y N+ +L
Sbjct: 121 KKSVKNNINKRNYYKCSSEGCSVKKRVERDGDDAAYVITTYEGVHNHESLSNVYYNEMVL 179

BLAST of ClCG01G016000 vs. Swiss-Prot
Match: WRK50_ARATH (Probable WRKY transcription factor 50 OS=Arabidopsis thaliana GN=WRKY50 PE=2 SV=1)

HSP 1 Score: 135.2 bits (339), Expect = 7.5e-31
Identity = 62/91 (68.13%), Postives = 69/91 (75.82%), Query Frame = 1

Query: 69  TMQAKGVNRKKEKGGGCSNRVAFITKSELEIMDDGFKWRKYGKKSVKNTPHPRNYYKCSS 128
           T  A   N+ K++      RVAF T+SE+E++DDGFKWRKYGKK VKN+PHPRNYYKCS 
Sbjct: 81  TATASADNQNKKEKKKIKGRVAFKTRSEVEVLDDGFKWRKYGKKMVKNSPHPRNYYKCSV 140

Query: 129 GGCGVKKRVERDRDDSSYVITTYEGVHNHES 160
            GC VKKRVERDRDD S+VITTYEG HNH S
Sbjct: 141 DGCPVKKRVERDRDDPSFVITTYEGSHNHSS 171

BLAST of ClCG01G016000 vs. Swiss-Prot
Match: WRK48_ARATH (Probable WRKY transcription factor 48 OS=Arabidopsis thaliana GN=WRKY48 PE=2 SV=1)

HSP 1 Score: 124.0 bits (310), Expect = 1.7e-27
Identity = 81/224 (36.16%), Postives = 109/224 (48.66%), Query Frame = 1

Query: 2   NSPQNPSFFFD----NHQFQDSSSSFMDFLN-------------FSGYPPPDFSLEAETA 61
           +S  NP+ F D    +HQF  SS+S     +             F+  P P         
Sbjct: 86  HSSNNPNSFLDLLRQDHQFASSSNSSSFSFDAFPLPNNNNNTSFFTDLPLPQAESSEVVN 145

Query: 62  ALSFSEAGTSDGSRSMEATSIDNNT---------------MQAKGVN-----RKKEKGGG 121
               S   TS  S S EA + DNN+                + KG       +KK +   
Sbjct: 146 TTPTSPNSTSVSSSSNEAAN-DNNSGKEVTVKDQEEGDQQQEQKGTKPQLKAKKKNQKKA 205

Query: 122 CSNRVAFITKSELEIMDDGFKWRKYGKKSVKNTPHPRNYYKCSSGGCGVKKRVERDRDDS 181
              R AF+TKS+++ +DDG++WRKYG+K+VKN+P+PR+YY+C++ GCGVKKRVER  DD 
Sbjct: 206 REARFAFLTKSDIDNLDDGYRWRKYGQKAVKNSPYPRSYYRCTTVGCGVKKRVERSSDDP 265

Query: 182 SYVITTYEGVHNHESPF-----LKYCNDPILFHPICPSSSPPYS 184
           S V+TTYEG H H  P      +     PIL H    +SS  +S
Sbjct: 266 SIVMTTYEGQHTHPFPMTPRGHIGMLTSPILDHGATTASSSSFS 308

BLAST of ClCG01G016000 vs. Swiss-Prot
Match: WRK23_ARATH (Probable WRKY transcription factor 23 OS=Arabidopsis thaliana GN=WRKY23 PE=2 SV=1)

HSP 1 Score: 113.2 bits (282), Expect = 3.0e-24
Identity = 52/112 (46.43%), Postives = 74/112 (66.07%), Query Frame = 1

Query: 49  SEAGTSDGSRSMEATSIDNNTMQAKGVNRKKEKGGGCSNRVAFITKSELEIMDDGFKWRK 108
           +E    +G    +  S     ++AK  N+K+++      RVAF+TKSE++ ++DG++WRK
Sbjct: 126 TEDNEEEGGEDQQEKSHTKKQLKAKKNNQKRQREA----RVAFMTKSEVDHLEDGYRWRK 185

Query: 109 YGKKSVKNTPHPRNYYKCSSGGCGVKKRVERDRDDSSYVITTYEGVHNHESP 161
           YG+K+VKN+P PR+YY+C++  C VKKRVER   D S V+TTYEG H H SP
Sbjct: 186 YGQKAVKNSPFPRSYYRCTTASCNVKKRVERSFRDPSTVVTTYEGQHTHISP 233

BLAST of ClCG01G016000 vs. Swiss-Prot
Match: WRK71_ARATH (Probable WRKY transcription factor 71 OS=Arabidopsis thaliana GN=WRKY71 PE=2 SV=1)

HSP 1 Score: 112.1 bits (279), Expect = 6.8e-24
Identity = 52/92 (56.52%), Postives = 66/92 (71.74%), Query Frame = 1

Query: 69  TMQAKGVNRKKEKGGGCSNRVAFITKSELEIMDDGFKWRKYGKKSVKNTPHPRNYYKCSS 128
           T Q K    KKE+      RVAF+TKSE++ ++DG++WRKYG+K+VKN+P+PR+YY+C++
Sbjct: 108 TKQGKKKGEKKER----EVRVAFMTKSEIDHLEDGYRWRKYGQKAVKNSPYPRSYYRCTT 167

Query: 129 GGCGVKKRVERDRDDSSYVITTYEGVHNHESP 161
             C VKKRVER   D S VITTYEG HNH  P
Sbjct: 168 QKCNVKKRVERSFQDPSIVITTYEGKHNHPIP 195

BLAST of ClCG01G016000 vs. TrEMBL
Match: A0A0A0KFX2_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G486960 PE=4 SV=1)

HSP 1 Score: 301.2 bits (770), Expect = 8.7e-79
Identity = 155/199 (77.89%), Postives = 164/199 (82.41%), Query Frame = 1

Query: 1   MNSPQNPSFFFDNHQFQD---SSSSFMDFLNFSGYPPPDFSLEAETAALSFSEAGTSDGS 60
           MNS QNP+FFFD+HQ  D   SSSS MDFLNFSGYP PDF LEAET   S SEA T DGS
Sbjct: 1   MNSLQNPNFFFDHHQQFDQDHSSSSIMDFLNFSGYPLPDFGLEAETTTFSLSEAETGDGS 60

Query: 61  RSMEATSIDNNTM-----QAKGVNRKKEKGGGCSNRVAFITKSELEIMDDGFKWRKYGKK 120
            SM+ATSIDNNT+     + KGV RKK +  G +NRVAFITKSELEI+DDGFKWRKYGKK
Sbjct: 61  GSMKATSIDNNTIDDGWFEGKGVKRKKPRENGRTNRVAFITKSELEILDDGFKWRKYGKK 120

Query: 121 SVKNTPHPRNYYKCSSGGCGVKKRVERDRDDSSYVITTYEGVHNHESPFLKYCNDPILF- 180
           SVKN+PHPRNYYKCSSG CGVKKRVERDRDDSSYVITTYEGVHNHESPFL YCN   LF 
Sbjct: 121 SVKNSPHPRNYYKCSSGECGVKKRVERDRDDSSYVITTYEGVHNHESPFLMYCNGSKLFH 180

Query: 181 -HPICP-SSSPPYSSTTTL 189
            HPICP SSSPPYSSTTTL
Sbjct: 181 PHPICPNSSSPPYSSTTTL 199

BLAST of ClCG01G016000 vs. TrEMBL
Match: A0A061DIH4_THECC (WRKY DNA-binding protein 51, putative OS=Theobroma cacao GN=TCM_001265 PE=4 SV=1)

HSP 1 Score: 173.7 bits (439), Expect = 2.1e-40
Identity = 94/188 (50.00%), Postives = 120/188 (63.83%), Query Frame = 1

Query: 2   NSPQNPSFFFDNHQFQDSSS-SFMDFLNFSGYPPPDFSLEAETAALSFSEAGTSDGSRSM 61
           NS  NPS+ F    F         D+L     P   F  +  + ++  SE G    +   
Sbjct: 64  NSNPNPSYTFFPESFDPMPEFELADYLMLDDCP---FEEDTSSQSMVSSEKGMGGANGFS 123

Query: 62  EATSIDNNTMQAKGVNRKKEKGGGCSNRVAFITKSELEIMDDGFKWRKYGKKSVKNTPHP 121
            ATS + N +   GV + K + G   NRVAF TKSELE+MDDG+KWRKYGKKSVKN+P+P
Sbjct: 124 GATSRNTNIICKSGVRKNKLELG---NRVAFRTKSELEVMDDGYKWRKYGKKSVKNSPNP 183

Query: 122 RNYYKCSSGGCGVKKRVERDRDDSSYVITTYEGVHNHESPFLKYCNDPILFHPICPS--S 181
           RNYYKCSSGGC VKKR+ERDRDD+SYVITTY+G+HNH+SP++ Y N   +  P   +  +
Sbjct: 184 RNYYKCSSGGCNVKKRIERDRDDTSYVITTYDGIHNHDSPYMVYYNQMPIVAPNAWNLRT 243

Query: 182 SPPYSSTT 187
           SPP SS+T
Sbjct: 244 SPPSSSST 245

BLAST of ClCG01G016000 vs. TrEMBL
Match: A0A059AKZ1_EUCGR (Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_I00316 PE=4 SV=1)

HSP 1 Score: 166.4 bits (420), Expect = 3.4e-38
Identity = 96/171 (56.14%), Postives = 116/171 (67.84%), Query Frame = 1

Query: 1   MNSPQNPSFFFDNHQFQDSSS----SFMDFLNFSGYPPPDFSLEAETAALSF-SEAGTSD 60
           MN PQ+P+   +N +F++          D+L   G    DFS  A+T A S  + AG+S+
Sbjct: 1   MNGPQDPNP--NNTRFEEMVDLPEFQLSDYLLLEGDFEGDFS--AKTMANSEENSAGSSN 60

Query: 61  GSRSMEATSIDNNTMQAKGVNRKKEKGGGCSNRVAFITKSELEIMDDGFKWRKYGKKSVK 120
           G    EA+   N+ +  +G N+      G   RVAF TKSELE+MDDGFKWRKYGKKSVK
Sbjct: 61  GFAPPEASERRNSNICKRGKNKS-----GTGMRVAFRTKSELEVMDDGFKWRKYGKKSVK 120

Query: 121 NTPHPRNYYKCSSGGCGVKKRVERDRDDSSYVITTYEGVHNHESPFLKYCN 167
           N+P+PRNYYKCSS GC VKKRVERDRDDSSYV+TTYEGVHNHESP L Y N
Sbjct: 121 NSPNPRNYYKCSSRGCLVKKRVERDRDDSSYVMTTYEGVHNHESPSLVYYN 162

BLAST of ClCG01G016000 vs. TrEMBL
Match: M5W247_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa015480mg PE=4 SV=1)

HSP 1 Score: 166.4 bits (420), Expect = 3.4e-38
Identity = 92/168 (54.76%), Postives = 109/168 (64.88%), Query Frame = 1

Query: 6   NPSFFFDNHQFQDSSSSFMDFLNFSGYPPPDFSLEAETAALSFSE-------AGTSDGSR 65
           NPS     H F +S    MDF  FS Y   D+ ++    + S S        A  S GS 
Sbjct: 8   NPSSAGPYH-FGESIDPSMDFDEFSDYFMLDYDVDDHQDSSSLSTVSPEKFMADRSTGSS 67

Query: 66  SMEATSIDNNTMQAKGVNRKKEKGGGCSNRVAFITKSELEIMDDGFKWRKYGKKSVKNTP 125
               +   NN M+ +   R+ +   G  +RVAF TKSELE+MDDGFKWRKYGKKSVKN+P
Sbjct: 68  GGATSRNSNNNMKCRNEGRRNKIEMG--HRVAFRTKSELEVMDDGFKWRKYGKKSVKNSP 127

Query: 126 HPRNYYKCSSGGCGVKKRVERDRDDSSYVITTYEGVHNHESPFLKYCN 167
           +PRNYYKCSSGGC VKKRVERDR+DSSYVITTY+GVHNHESP + Y N
Sbjct: 128 NPRNYYKCSSGGCNVKKRVERDREDSSYVITTYDGVHNHESPCVVYYN 172

BLAST of ClCG01G016000 vs. TrEMBL
Match: B9H5K7_POPTR (WRKY transcription factor 51 family protein OS=Populus trichocarpa GN=POPTR_0005s08720g PE=4 SV=2)

HSP 1 Score: 166.4 bits (420), Expect = 3.4e-38
Identity = 96/174 (55.17%), Postives = 115/174 (66.09%), Query Frame = 1

Query: 1   MNSPQ---NPSFFFDNHQFQDSSSSFM--DFLNFSGYPPPDFSLEAETAALSFSEAGTSD 60
           MN P+   N  F + N  F   ++ F   D+L        D S     A+     +G+S 
Sbjct: 17  MNFPEINPNHDFSYFNEGFDPPATEFQVSDYLMLDDGFGEDNSSSQSMASSEQVPSGSSS 76

Query: 61  GSRSMEATSIDNNTMQAKGVNRKKEKGGGCSNRVAFITKSELEIMDDGFKWRKYGKKSVK 120
           G     ATS  NN+MQ  GV + K +GG   +RVAF TKSELE+MDDGFKWRKYGKKSVK
Sbjct: 77  GYSG--ATS-RNNSMQ-NGVKKNKIEGG---HRVAFRTKSELEVMDDGFKWRKYGKKSVK 136

Query: 121 NTPHPRNYYKCSSGGCGVKKRVERDRDDSSYVITTYEGVHNHESPFLKYCNDPI 170
           N+PHPRNYYKCSSGGC VKKRVERD +DS+YVITTY+GVHNHESP + Y N+ I
Sbjct: 137 NSPHPRNYYKCSSGGCDVKKRVERDGEDSAYVITTYDGVHNHESPCMVYYNNQI 183

BLAST of ClCG01G016000 vs. TAIR10
Match: AT5G64810.1 (AT5G64810.1 WRKY DNA-binding protein 51)

HSP 1 Score: 142.5 bits (358), Expect = 2.6e-34
Identity = 82/180 (45.56%), Postives = 108/180 (60.00%), Query Frame = 1

Query: 1   MNSPQNPSFFFDNHQFQDSSSSFMDFLNFSGYPPPDFS-------LEAETAA---LSFSE 60
           MN  QNPS  F     ++  + FMD  +FS     D         +E E ++   +  SE
Sbjct: 1   MNISQNPSPNFTYFSDENFINPFMDNNDFSNLMFFDIDEGGNNGLIEEEISSPTSIVSSE 60

Query: 61  AGTSDGSRSMEATSIDNNTMQAKGVNRKKEKGGGCSNRVAFITKSELEIMDDGFKWRKYG 120
             T +   S  AT++       +G +++ ++     +RVAF T+S++++MDDGFKWRKYG
Sbjct: 61  TFTGESGGSGSATTLSKKESTNRG-SKESDQTKETGHRVAFRTRSKIDVMDDGFKWRKYG 120

Query: 121 KKSVKNTPHPRNYYKCSSGGCGVKKRVERDRDDSSYVITTYEGVHNHESPFLKYCNDPIL 171
           KKSVKN  + RNYYKCSS GC VKKRVERD DD++YVITTYEGVHNHES    Y N+ +L
Sbjct: 121 KKSVKNNINKRNYYKCSSEGCSVKKRVERDGDDAAYVITTYEGVHNHESLSNVYYNEMVL 179

BLAST of ClCG01G016000 vs. TAIR10
Match: AT5G26170.1 (AT5G26170.1 WRKY DNA-binding protein 50)

HSP 1 Score: 135.2 bits (339), Expect = 4.2e-32
Identity = 62/91 (68.13%), Postives = 69/91 (75.82%), Query Frame = 1

Query: 69  TMQAKGVNRKKEKGGGCSNRVAFITKSELEIMDDGFKWRKYGKKSVKNTPHPRNYYKCSS 128
           T  A   N+ K++      RVAF T+SE+E++DDGFKWRKYGKK VKN+PHPRNYYKCS 
Sbjct: 81  TATASADNQNKKEKKKIKGRVAFKTRSEVEVLDDGFKWRKYGKKMVKNSPHPRNYYKCSV 140

Query: 129 GGCGVKKRVERDRDDSSYVITTYEGVHNHES 160
            GC VKKRVERDRDD S+VITTYEG HNH S
Sbjct: 141 DGCPVKKRVERDRDDPSFVITTYEGSHNHSS 171

BLAST of ClCG01G016000 vs. TAIR10
Match: AT5G49520.1 (AT5G49520.1 WRKY DNA-binding protein 48)

HSP 1 Score: 124.0 bits (310), Expect = 9.7e-29
Identity = 81/224 (36.16%), Postives = 109/224 (48.66%), Query Frame = 1

Query: 2   NSPQNPSFFFD----NHQFQDSSSSFMDFLN-------------FSGYPPPDFSLEAETA 61
           +S  NP+ F D    +HQF  SS+S     +             F+  P P         
Sbjct: 86  HSSNNPNSFLDLLRQDHQFASSSNSSSFSFDAFPLPNNNNNTSFFTDLPLPQAESSEVVN 145

Query: 62  ALSFSEAGTSDGSRSMEATSIDNNT---------------MQAKGVN-----RKKEKGGG 121
               S   TS  S S EA + DNN+                + KG       +KK +   
Sbjct: 146 TTPTSPNSTSVSSSSNEAAN-DNNSGKEVTVKDQEEGDQQQEQKGTKPQLKAKKKNQKKA 205

Query: 122 CSNRVAFITKSELEIMDDGFKWRKYGKKSVKNTPHPRNYYKCSSGGCGVKKRVERDRDDS 181
              R AF+TKS+++ +DDG++WRKYG+K+VKN+P+PR+YY+C++ GCGVKKRVER  DD 
Sbjct: 206 REARFAFLTKSDIDNLDDGYRWRKYGQKAVKNSPYPRSYYRCTTVGCGVKKRVERSSDDP 265

Query: 182 SYVITTYEGVHNHESPF-----LKYCNDPILFHPICPSSSPPYS 184
           S V+TTYEG H H  P      +     PIL H    +SS  +S
Sbjct: 266 SIVMTTYEGQHTHPFPMTPRGHIGMLTSPILDHGATTASSSSFS 308

BLAST of ClCG01G016000 vs. TAIR10
Match: AT2G47260.1 (AT2G47260.1 WRKY DNA-binding protein 23)

HSP 1 Score: 113.2 bits (282), Expect = 1.7e-25
Identity = 52/112 (46.43%), Postives = 74/112 (66.07%), Query Frame = 1

Query: 49  SEAGTSDGSRSMEATSIDNNTMQAKGVNRKKEKGGGCSNRVAFITKSELEIMDDGFKWRK 108
           +E    +G    +  S     ++AK  N+K+++      RVAF+TKSE++ ++DG++WRK
Sbjct: 126 TEDNEEEGGEDQQEKSHTKKQLKAKKNNQKRQREA----RVAFMTKSEVDHLEDGYRWRK 185

Query: 109 YGKKSVKNTPHPRNYYKCSSGGCGVKKRVERDRDDSSYVITTYEGVHNHESP 161
           YG+K+VKN+P PR+YY+C++  C VKKRVER   D S V+TTYEG H H SP
Sbjct: 186 YGQKAVKNSPFPRSYYRCTTASCNVKKRVERSFRDPSTVVTTYEGQHTHISP 233

BLAST of ClCG01G016000 vs. TAIR10
Match: AT1G29860.1 (AT1G29860.1 WRKY DNA-binding protein 71)

HSP 1 Score: 112.1 bits (279), Expect = 3.8e-25
Identity = 52/92 (56.52%), Postives = 66/92 (71.74%), Query Frame = 1

Query: 69  TMQAKGVNRKKEKGGGCSNRVAFITKSELEIMDDGFKWRKYGKKSVKNTPHPRNYYKCSS 128
           T Q K    KKE+      RVAF+TKSE++ ++DG++WRKYG+K+VKN+P+PR+YY+C++
Sbjct: 108 TKQGKKKGEKKER----EVRVAFMTKSEIDHLEDGYRWRKYGQKAVKNSPYPRSYYRCTT 167

Query: 129 GGCGVKKRVERDRDDSSYVITTYEGVHNHESP 161
             C VKKRVER   D S VITTYEG HNH  P
Sbjct: 168 QKCNVKKRVERSFQDPSIVITTYEGKHNHPIP 195

BLAST of ClCG01G016000 vs. NCBI nr
Match: gi|659079204|ref|XP_008440131.1| (PREDICTED: probable WRKY transcription factor 51 [Cucumis melo])

HSP 1 Score: 308.9 bits (790), Expect = 6.0e-81
Identity = 156/201 (77.61%), Postives = 169/201 (84.08%), Query Frame = 1

Query: 1   MNSPQNPSFFFDNHQF--QDSSSSFMDFLNFSGYPPPDFSLEAETAALSFSEAGTSDGSR 60
           MNS QNP+FFFD+HQ   QDSSSSFMDFLNFSGYP PDF LEAET   S SEAGT DGSR
Sbjct: 1   MNSLQNPTFFFDHHQQLDQDSSSSFMDFLNFSGYPLPDFGLEAETTMFSLSEAGTGDGSR 60

Query: 61  SMEATSIDNNTM-----QAKGVNRKKEKGGGCSNRVAFITKSELEIMDDGFKWRKYGKKS 120
           SM+ATSIDNNT+     + KGV RKKE+G GC+++VAFITKSELEI+DDG+KWRKYGKKS
Sbjct: 61  SMKATSIDNNTIDDGWFEGKGVKRKKERGNGCNHKVAFITKSELEILDDGYKWRKYGKKS 120

Query: 121 VKNTPHPRNYYKCSSGGCGVKKRVERDRDDSSYVITTYEGVHNHESPFLKYCNDPILFH- 180
           VKN+PHPRNYYKCSSGGCGVKKRVERDRDDSSYVITTYEGVHNHESPFL Y N   L H 
Sbjct: 121 VKNSPHPRNYYKCSSGGCGVKKRVERDRDDSSYVITTYEGVHNHESPFLMYSNGSKLCHP 180

Query: 181 ---PICPSSS--PPYSSTTTL 189
              PICP+SS   PYSSTTTL
Sbjct: 181 HPQPICPNSSSPDPYSSTTTL 201

BLAST of ClCG01G016000 vs. NCBI nr
Match: gi|449448420|ref|XP_004141964.1| (PREDICTED: probable WRKY transcription factor 51 [Cucumis sativus])

HSP 1 Score: 301.2 bits (770), Expect = 1.3e-78
Identity = 155/199 (77.89%), Postives = 164/199 (82.41%), Query Frame = 1

Query: 1   MNSPQNPSFFFDNHQFQD---SSSSFMDFLNFSGYPPPDFSLEAETAALSFSEAGTSDGS 60
           MNS QNP+FFFD+HQ  D   SSSS MDFLNFSGYP PDF LEAET   S SEA T DGS
Sbjct: 1   MNSLQNPNFFFDHHQQFDQDHSSSSIMDFLNFSGYPLPDFGLEAETTTFSLSEAETGDGS 60

Query: 61  RSMEATSIDNNTM-----QAKGVNRKKEKGGGCSNRVAFITKSELEIMDDGFKWRKYGKK 120
            SM+ATSIDNNT+     + KGV RKK +  G +NRVAFITKSELEI+DDGFKWRKYGKK
Sbjct: 61  GSMKATSIDNNTIDDGWFEGKGVKRKKPRENGRTNRVAFITKSELEILDDGFKWRKYGKK 120

Query: 121 SVKNTPHPRNYYKCSSGGCGVKKRVERDRDDSSYVITTYEGVHNHESPFLKYCNDPILF- 180
           SVKN+PHPRNYYKCSSG CGVKKRVERDRDDSSYVITTYEGVHNHESPFL YCN   LF 
Sbjct: 121 SVKNSPHPRNYYKCSSGECGVKKRVERDRDDSSYVITTYEGVHNHESPFLMYCNGSKLFH 180

Query: 181 -HPICP-SSSPPYSSTTTL 189
            HPICP SSSPPYSSTTTL
Sbjct: 181 PHPICPNSSSPPYSSTTTL 199

BLAST of ClCG01G016000 vs. NCBI nr
Match: gi|590707958|ref|XP_007048143.1| (WRKY DNA-binding protein 51, putative [Theobroma cacao])

HSP 1 Score: 173.7 bits (439), Expect = 3.0e-40
Identity = 94/188 (50.00%), Postives = 120/188 (63.83%), Query Frame = 1

Query: 2   NSPQNPSFFFDNHQFQDSSS-SFMDFLNFSGYPPPDFSLEAETAALSFSEAGTSDGSRSM 61
           NS  NPS+ F    F         D+L     P   F  +  + ++  SE G    +   
Sbjct: 64  NSNPNPSYTFFPESFDPMPEFELADYLMLDDCP---FEEDTSSQSMVSSEKGMGGANGFS 123

Query: 62  EATSIDNNTMQAKGVNRKKEKGGGCSNRVAFITKSELEIMDDGFKWRKYGKKSVKNTPHP 121
            ATS + N +   GV + K + G   NRVAF TKSELE+MDDG+KWRKYGKKSVKN+P+P
Sbjct: 124 GATSRNTNIICKSGVRKNKLELG---NRVAFRTKSELEVMDDGYKWRKYGKKSVKNSPNP 183

Query: 122 RNYYKCSSGGCGVKKRVERDRDDSSYVITTYEGVHNHESPFLKYCNDPILFHPICPS--S 181
           RNYYKCSSGGC VKKR+ERDRDD+SYVITTY+G+HNH+SP++ Y N   +  P   +  +
Sbjct: 184 RNYYKCSSGGCNVKKRIERDRDDTSYVITTYDGIHNHDSPYMVYYNQMPIVAPNAWNLRT 243

Query: 182 SPPYSSTT 187
           SPP SS+T
Sbjct: 244 SPPSSSST 245

BLAST of ClCG01G016000 vs. NCBI nr
Match: gi|743823385|ref|XP_011021947.1| (PREDICTED: probable WRKY transcription factor 51 [Populus euphratica])

HSP 1 Score: 166.8 bits (421), Expect = 3.7e-38
Identity = 91/173 (52.60%), Postives = 116/173 (67.05%), Query Frame = 1

Query: 1   MNSPQNPS--FFFDNHQFQDSSSSFM--DFLNFSGYPPPDFSLEAETAALSFSEAGTSDG 60
           M+ P+NP+  F + +  F   ++ F   D+L   G    D S     A+     +G+S G
Sbjct: 17  MDFPENPNHDFSYFDEGFDPPATEFQVSDYLMLDGGFGEDNSSSQSMASSEQVPSGSSSG 76

Query: 61  SRSMEATSIDNNTMQAKGVNRKKEKGGGCSNRVAFITKSELEIMDDGFKWRKYGKKSVKN 120
                ATS +N+     GV + K +GG   +RVAF TKSELE+MDDG+KWRKYGKKSVKN
Sbjct: 77  YSG--ATSRNNSIKCKNGVKKNKIEGG---HRVAFRTKSELEVMDDGYKWRKYGKKSVKN 136

Query: 121 TPHPRNYYKCSSGGCGVKKRVERDRDDSSYVITTYEGVHNHESPFLKYCNDPI 170
           +P+PRNYYKCSS GC VKKRVERDR+DS+YVITTY+GVHNHESP + Y N+ I
Sbjct: 137 SPNPRNYYKCSSSGCDVKKRVERDREDSAYVITTYDGVHNHESPCMVYYNNQI 184

BLAST of ClCG01G016000 vs. NCBI nr
Match: gi|568819928|ref|XP_006464492.1| (PREDICTED: probable WRKY transcription factor 50 [Citrus sinensis])

HSP 1 Score: 166.8 bits (421), Expect = 3.7e-38
Identity = 87/162 (53.70%), Postives = 106/162 (65.43%), Query Frame = 1

Query: 7   PSFFFDNHQFQDSSSSFMDFLNFS----GYPPPDFSLEAETAALSFSEAGTSDGSRSMEA 66
           P    D+HQ Q S     D+L F     G+    +S +++ ++      G+S G     A
Sbjct: 18  PDHNIDHHQHQSSEFELSDYLLFDDHHHGFDEDAYSSQSKASSDKIIMGGSSSG-----A 77

Query: 67  TSIDNNTMQAKGVNRKKEKGGGCSNRVAFITKSELEIMDDGFKWRKYGKKSVKNTPHPRN 126
           TS  NN     G+ + K + G    R AF TKSELE+MDDGFKWRKYGKKSVKN+P+PRN
Sbjct: 78  TSEKNNIKSKNGIKKMKMEVG---QRFAFRTKSELEVMDDGFKWRKYGKKSVKNSPNPRN 137

Query: 127 YYKCSSGGCGVKKRVERDRDDSSYVITTYEGVHNHESPFLKY 165
           YYKCS+GGC VKKRVERDR+DSSYVITTYEG HNHESP + Y
Sbjct: 138 YYKCSTGGCQVKKRVERDREDSSYVITTYEGTHNHESPCVVY 171

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
WRK51_ARATH4.7e-3345.56Probable WRKY transcription factor 51 OS=Arabidopsis thaliana GN=WRKY51 PE=1 SV=... [more]
WRK50_ARATH7.5e-3168.13Probable WRKY transcription factor 50 OS=Arabidopsis thaliana GN=WRKY50 PE=2 SV=... [more]
WRK48_ARATH1.7e-2736.16Probable WRKY transcription factor 48 OS=Arabidopsis thaliana GN=WRKY48 PE=2 SV=... [more]
WRK23_ARATH3.0e-2446.43Probable WRKY transcription factor 23 OS=Arabidopsis thaliana GN=WRKY23 PE=2 SV=... [more]
WRK71_ARATH6.8e-2456.52Probable WRKY transcription factor 71 OS=Arabidopsis thaliana GN=WRKY71 PE=2 SV=... [more]
Match NameE-valueIdentityDescription
A0A0A0KFX2_CUCSA8.7e-7977.89Uncharacterized protein OS=Cucumis sativus GN=Csa_6G486960 PE=4 SV=1[more]
A0A061DIH4_THECC2.1e-4050.00WRKY DNA-binding protein 51, putative OS=Theobroma cacao GN=TCM_001265 PE=4 SV=1[more]
A0A059AKZ1_EUCGR3.4e-3856.14Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_I00316 PE=4 SV=1[more]
M5W247_PRUPE3.4e-3854.76Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa015480mg PE=4 SV=1[more]
B9H5K7_POPTR3.4e-3855.17WRKY transcription factor 51 family protein OS=Populus trichocarpa GN=POPTR_0005... [more]
Match NameE-valueIdentityDescription
AT5G64810.12.6e-3445.56 WRKY DNA-binding protein 51[more]
AT5G26170.14.2e-3268.13 WRKY DNA-binding protein 50[more]
AT5G49520.19.7e-2936.16 WRKY DNA-binding protein 48[more]
AT2G47260.11.7e-2546.43 WRKY DNA-binding protein 23[more]
AT1G29860.13.8e-2556.52 WRKY DNA-binding protein 71[more]
Match NameE-valueIdentityDescription
gi|659079204|ref|XP_008440131.1|6.0e-8177.61PREDICTED: probable WRKY transcription factor 51 [Cucumis melo][more]
gi|449448420|ref|XP_004141964.1|1.3e-7877.89PREDICTED: probable WRKY transcription factor 51 [Cucumis sativus][more]
gi|590707958|ref|XP_007048143.1|3.0e-4050.00WRKY DNA-binding protein 51, putative [Theobroma cacao][more]
gi|743823385|ref|XP_011021947.1|3.7e-3852.60PREDICTED: probable WRKY transcription factor 51 [Populus euphratica][more]
gi|568819928|ref|XP_006464492.1|3.7e-3853.70PREDICTED: probable WRKY transcription factor 50 [Citrus sinensis][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR003657WRKY_dom
Vocabulary: Molecular Function
TermDefinition
GO:0003700transcription factor activity, sequence-specific DNA binding
GO:0043565sequence-specific DNA binding
Vocabulary: Biological Process
TermDefinition
GO:0006355regulation of transcription, DNA-templated
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006355 regulation of transcription, DNA-templated
biological_process GO:0050896 response to stimulus
biological_process GO:0042742 defense response to bacterium
biological_process GO:0050832 defense response to fungus
biological_process GO:0009867 jasmonic acid mediated signaling pathway
cellular_component GO:0005667 transcription factor complex
cellular_component GO:0005634 nucleus
cellular_component GO:0005575 cellular_component
molecular_function GO:0043565 sequence-specific DNA binding
molecular_function GO:0003700 transcription factor activity, sequence-specific DNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG01G016000.1ClCG01G016000.1mRNA


Analysis Name: InterPro Annotations of watermelon (Charleston Gray)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR003657WRKY domainGENE3DG3DSA:2.20.25.80coord: 87..160
score: 3.0
IPR003657WRKY domainPFAMPF03106WRKYcoord: 101..158
score: 6.5
IPR003657WRKY domainSMARTSM00774WRKY_clscoord: 100..159
score: 1.5
IPR003657WRKY domainPROFILEPS50811WRKYcoord: 95..160
score: 30
IPR003657WRKY domainunknownSSF118290WRKY DNA-binding domaincoord: 93..160
score: 8.89
NoneNo IPR availablePANTHERPTHR31221FAMILY NOT NAMEDcoord: 1..170
score: 5.6
NoneNo IPR availablePANTHERPTHR31221:SF36WRKY TRANSCRIPTION FACTOR 50-RELATEDcoord: 1..170
score: 5.6

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
ClCG01G016000ClCG02G000970Watermelon (Charleston Gray)wcgwcgB087