Cp4.1LG15g00430 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG15g00430
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionWRKY transcription factor, putative
LocationCp4.1LG15 : 343905 .. 346634 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TAAGGTGGGAAAAGAAGAAAAAAAAGGTAGGAGTTGGGGTGTGGCTTGTTCAAGGAGTTGGGTTCTCTTCTCCAAGCGAGACAAAAGCAAAAGCAAAAGCAAAAACAAAAACAAAAACAAAAACAAAAACAAATCATGGAAGTTTTAGTTTAGGGTTTCAGAAACTTACACGCTCTGTTTTTCTCTCTTCTTATGATTTCTTCTTCCATTTCAACTGTTTTGATGCTTCCTTTCTGGGTTTTTGTTGTATAGTTCTAAGCTTTCTTGGGCTTATTTTGATTCATTTTCTTGTGTTCTAAACCTCTGTGTTTTGTTCTTCAAATGGAAGAGCTTGAAGAAGCTAACAGAGCCGCCATAGATAGCTGTCATGGAGTTTTGAATCTTTTAGCTCACCCTTCTCCTCAAGAACAAGGTGAGCTGAGGAAGAATTTAATGGTGGAAACTGGAGAGGCTGTTTTTAAGCTCAAGAAAGTGGTATCTCTTTTGAATTCTGGGTTTGGTTATGCAAAAGTTAGAAGATTCAAGAACATTCCTCTGCCTTTACCCCAAAAATTTCTCTTAGATCCTCCAAACAATGGTTTGAATGGTAAAATTTCTGTTTTTTTGGGAAACCCAGATTTGGAAATGGATCAAAATCATAAGAATTCCCTCAAAATTCATAAACAAGCTGCTCCATCTTTGAACTTTAGCTTCCCCCAGCAACAGCAGCAGCAGAGGAAGCAAGCAGAGATGATGTTTCTTAGGAGTAACAGTGGAATGAATATGAATTTTGATGCTTCTAAGTGCACATTGACAATGTCATCTGCTAGATCTTTTATTTCTTCGTTGAGTATGGACGGGAGCGTTGCGGACGGGAGCTCGTTCCATTTGATCGGACCGTCGTCGACGATGTCGGGCGATAACAAGAGGAGGTTTTCTGGAAGGGGAGATGAAGGGAGCTTGAAATGTGGTAGCACTGGCAAGTGCCATTGCTCAAAGAAGAGGTTGGATTGATATCCATCTTGTTGGGTGTTGATATCCCTTTGGTTTACTGATAATTTGGGTTGGTTTTTGTAGGAAACATCGAGTGAAGAGATCAATTAAGGTTCCTGCTATAAGTAACAAACTTGCTGATATCCCTTCTGATGATTATTCATGGAGGAAATATGGGCAGAAGCCAATTAAGGGTTCTCCTCATCCCAGGTATGACTAAACCTCGGGTCGGTCGTAACTGATATTGTCCACTTCGTGCATAACTCTAGCCTTTTGACTCTGATAATGACCTCACTGGCGTGTTTACTTGTGAGATCCCACATCGGTTGGAGGGGGAACGAAGGATTCCTTAGAAGGGTGTGAAAATCTCTCCCCAACAAACGCGATAGTGATATGTAATGGGCCAAAACGGACAATATCTGCTAGCAGTGGGCTTGACAAATGGTATCAGAGTCGGACACCGGATGGTGTGCCAGCAAGAACGCTGGGCTCCAAGGGGTGGATTGTCAAATCCCACATCGGTTAGAGAGGGGAACAAAACATTTCTTATAAGGGTGTGGAAACCTCTCACCAACTGACCGTTTTAAAACTATGAGGCTGATGGTGATACGTAACGGGTCACAACGGATAATATCTGCTAGCGGTGGGCTTGGACTGTTACAAATGGTATTAGAGTCAGATATCGGGTGGTGTGCCAGTGAGGGCGCTGGGCACCCAAGGAGTGGATTGTGAGATCCCATATCGGTTGGAGGGGGGAACAAAACATTCATTATAAGGGTGTGGGAACCTCTTCCTAACAGACGCGTTTTATAAATGTAAGATTGACGGCGATACGTAACGGGCCAAAGTGGACAATATTTGCTAACGGTGGGTTTGGGTTGTTACAAAATGGTATCAGAGCTAGATATTGGGAGGTGTGCCAGCGTGGACGCTGGGCCCCAAAGGGGTGGATTGTGAGATCTCATATAGGTTGGAGAAGGGAACAAAGCATTCGTTGTAAGAGTGTGGAAACCTTTCCCTAGCATACGCGTTTTAAAACTATGAGGGTGACGGCAATACGTAACAGATCAAAACAGATAATATCTGCTAGAGGTGAGCTATAAGAAGCTTATAATGACTTCTACTTCCAATGACCCTTGTTAAAGAGCGTCTGTTTGCTAAATCATTAGAACAAGATGTTCTTATAACAGGATCATTCCCAAGAAAAATGAAAAGAAAAGGGTCCAAGTTTTTGTGATTTTTCGACAACTCGAAGAACGAAAACTTCGGGCGATCATCAAAACGAATGGATGCCTGTTTCTTTTCCATTTTCGTCTCGTTGGATCATTTCGAACTCGAAAAATTGAGGATTTGAGAGAGGGAAAGTGTTGATGATTGAGGAAGTTTGTAAAATGAGTTTGCTTTTGTTGTAAAATGTGTGCAGGGGTTACTACAAATGCAGCAGCATGAGAGGTTGTCCAGCGAGAAAGCACGTCGAGCGATGCTTAGAAGACCCGTCGATGCTTATCGTAACGTACGAAGGAGAGCATAATCACCCGAAAATGTCGACGCAATCTGCACACACTTAGCCTACAACAAGCTCAAATTAAGCAGGATTGGAGCCTCCATTTTTGTTTTGTATTAATCAAAACTTAGTTTGGGATTGGTGTTGGTGCTTCTTTTGTACATACTTTTGGCCTCAACATTTGAACCAAGCTCTTCAATTTGGAGACTTTCCTTTCCAAATCATAAAGGGAACAAATTTAATCCGACATTTGT

mRNA sequence

TAAGGTGGGAAAAGAAGAAAAAAAAGGTAGGAGTTGGGGTGTGGCTTGTTCAAGGAGTTGGGTTCTCTTCTCCAAGCGAGACAAAAGCAAAAGCAAAAGCAAAAACAAAAACAAAAACAAAAACAAAAACAAATCATGGAAGTTTTAGTTTAGGGTTTCAGAAACTTACACGCTCTGTTTTTCTCTCTTCTTATGATTTCTTCTTCCATTTCAACTGTTTTGATGCTTCCTTTCTGGGTTTTTGTTGTATAGTTCTAAGCTTTCTTGGGCTTATTTTGATTCATTTTCTTGTGTTCTAAACCTCTGTGTTTTGTTCTTCAAATGGAAGAGCTTGAAGAAGCTAACAGAGCCGCCATAGATAGCTGTCATGGAGTTTTGAATCTTTTAGCTCACCCTTCTCCTCAAGAACAAGGTGAGCTGAGGAAGAATTTAATGGTGGAAACTGGAGAGGCTGTTTTTAAGCTCAAGAAAGTGGTATCTCTTTTGAATTCTGGGTTTGGTTATGCAAAAGTTAGAAGATTCAAGAACATTCCTCTGCCTTTACCCCAAAAATTTCTCTTAGATCCTCCAAACAATGGTTTGAATGGTAAAATTTCTGTTTTTTTGGGAAACCCAGATTTGGAAATGGATCAAAATCATAAGAATTCCCTCAAAATTCATAAACAAGCTGCTCCATCTTTGAACTTTAGCTTCCCCCAGCAACAGCAGCAGCAGAGGAAGCAAGCAGAGATGATGTTTCTTAGGAGTAACAGTGGAATGAATATGAATTTTGATGCTTCTAAGTGCACATTGACAATGTCATCTGCTAGATCTTTTATTTCTTCGTTGAGTATGGACGGGAGCGTTGCGGACGGGAGCTCGTTCCATTTGATCGGACCGTCGTCGACGATGTCGGGCGATAACAAGAGGAGGTTTTCTGGAAGGGGAGATGAAGGGAGCTTGAAATGTGGTAGCACTGGCAAGTGCCATTGCTCAAAGAAGAGGAAACATCGAGTGAAGAGATCAATTAAGGTTCCTGCTATAAGTAACAAACTTGCTGATATCCCTTCTGATGATTATTCATGGAGGAAATATGGGCAGAAGCCAATTAAGGGTTCTCCTCATCCCAGGGGTTACTACAAATGCAGCAGCATGAGAGGTTGTCCAGCGAGAAAGCACGTCGAGCGATGCTTAGAAGACCCGTCGATGCTTATCGTAACGTACGAAGGAGAGCATAATCACCCGAAAATGTCGACGCAATCTGCACACACTTAGCCTACAACAAGCTCAAATTAAGCAGGATTGGAGCCTCCATTTTTGTTTTGTATTAATCAAAACTTAGTTTGGGATTGGTGTTGGTGCTTCTTTTGTACATACTTTTGGCCTCAACATTTGAACCAAGCTCTTCAATTTGGAGACTTTCCTTTCCAAATCATAAAGGGAACAAATTTAATCCGACATTTGT

Coding sequence (CDS)

ATGGAAGAGCTTGAAGAAGCTAACAGAGCCGCCATAGATAGCTGTCATGGAGTTTTGAATCTTTTAGCTCACCCTTCTCCTCAAGAACAAGGTGAGCTGAGGAAGAATTTAATGGTGGAAACTGGAGAGGCTGTTTTTAAGCTCAAGAAAGTGGTATCTCTTTTGAATTCTGGGTTTGGTTATGCAAAAGTTAGAAGATTCAAGAACATTCCTCTGCCTTTACCCCAAAAATTTCTCTTAGATCCTCCAAACAATGGTTTGAATGGTAAAATTTCTGTTTTTTTGGGAAACCCAGATTTGGAAATGGATCAAAATCATAAGAATTCCCTCAAAATTCATAAACAAGCTGCTCCATCTTTGAACTTTAGCTTCCCCCAGCAACAGCAGCAGCAGAGGAAGCAAGCAGAGATGATGTTTCTTAGGAGTAACAGTGGAATGAATATGAATTTTGATGCTTCTAAGTGCACATTGACAATGTCATCTGCTAGATCTTTTATTTCTTCGTTGAGTATGGACGGGAGCGTTGCGGACGGGAGCTCGTTCCATTTGATCGGACCGTCGTCGACGATGTCGGGCGATAACAAGAGGAGGTTTTCTGGAAGGGGAGATGAAGGGAGCTTGAAATGTGGTAGCACTGGCAAGTGCCATTGCTCAAAGAAGAGGAAACATCGAGTGAAGAGATCAATTAAGGTTCCTGCTATAAGTAACAAACTTGCTGATATCCCTTCTGATGATTATTCATGGAGGAAATATGGGCAGAAGCCAATTAAGGGTTCTCCTCATCCCAGGGGTTACTACAAATGCAGCAGCATGAGAGGTTGTCCAGCGAGAAAGCACGTCGAGCGATGCTTAGAAGACCCGTCGATGCTTATCGTAACGTACGAAGGAGAGCATAATCACCCGAAAATGTCGACGCAATCTGCACACACTTAG

Protein sequence

MEELEEANRAAIDSCHGVLNLLAHPSPQEQGELRKNLMVETGEAVFKLKKVVSLLNSGFGYAKVRRFKNIPLPLPQKFLLDPPNNGLNGKISVFLGNPDLEMDQNHKNSLKIHKQAAPSLNFSFPQQQQQQRKQAEMMFLRSNSGMNMNFDASKCTLTMSSARSFISSLSMDGSVADGSSFHLIGPSSTMSGDNKRRFSGRGDEGSLKCGSTGKCHCSKKRKHRVKRSIKVPAISNKLADIPSDDYSWRKYGQKPIKGSPHPRGYYKCSSMRGCPARKHVERCLEDPSMLIVTYEGEHNHPKMSTQSAHT
BLAST of Cp4.1LG15g00430 vs. Swiss-Prot
Match: WRK74_ARATH (Probable WRKY transcription factor 74 OS=Arabidopsis thaliana GN=WRKY74 PE=2 SV=2)

HSP 1 Score: 285.8 bits (730), Expect = 5.6e-76
Identity = 178/338 (52.66%), Postives = 216/338 (63.91%), Query Frame = 1

Query: 1   MEELEEANRAAIDSCHGVLNLLAHPSPQEQGELRKNLMVETGEAVFKLKKVVSLLNSGFG 60
           MEE+E AN+AA++SCHGVLNLL+     +Q    K++MVET EAV K K+V SLL+ G G
Sbjct: 1   MEEVEAANKAAVESCHGVLNLLS-----QQTNDSKSIMVETREAVCKFKRVSSLLSRGLG 60

Query: 61  YAKVRRFKNIPLP-----LPQKFLLDPP---NNGLNGKISVFL---------GNPDLEM- 120
             K+++  N         LPQ   L+ P   NN ++G I +           G P L + 
Sbjct: 61  QRKIKKLNNNNYKFSSSLLPQHMFLESPVCSNNAISGCIPILAPKPLQIVPAGPPPLMLF 120

Query: 121 DQNHKNSLKIHKQAAPSLNFSFPQQQQQQRKQAEMMFLRSNSGMNMNFDASKCTLTMS-- 180
           +QN        +   PS     P+  Q      + ++ RS SG+N+ FD S      S  
Sbjct: 121 NQNMCLDKSFLELKPPSSRAVDPKPYQFIHTHQQGVYSRSKSGLNLKFDGSIGASCYSPS 180

Query: 181 ---SARSFISSLSMDGSVA--DGSSFHLIG--PSSTMSGDNKRRFSGRGDEGSLKCGSTG 240
               +RSF+SSLSMDGSV   D +SFHLIG    S     + RR S     GSLKCGS  
Sbjct: 181 ISNGSRSFVSSLSMDGSVTDYDRNSFHLIGLPQGSDHISQHSRRTS---CSGSLKCGSKS 240

Query: 241 KCHCSKKRKHRVKRSIKVPAISNKLADIPSDDYSWRKYGQKPIKGSPHPRGYYKCSSMRG 300
           KCHCSKKRK RVKRSIKVPAISNK+ADIP D+YSWRKYGQKPIKGSPHPRGYYKCSS+RG
Sbjct: 241 KCHCSKKRKLRVKRSIKVPAISNKIADIPPDEYSWRKYGQKPIKGSPHPRGYYKCSSVRG 300

Query: 301 CPARKHVERCLEDPSMLIVTYEGEHNHPK-MSTQSAHT 311
           CPARKHVERC+E+ SMLIVTYEGEHNH + +S+QSAHT
Sbjct: 301 CPARKHVERCVEETSMLIVTYEGEHNHSRILSSQSAHT 330

BLAST of Cp4.1LG15g00430 vs. Swiss-Prot
Match: WRK39_ARATH (Probable WRKY transcription factor 39 OS=Arabidopsis thaliana GN=WRKY39 PE=2 SV=1)

HSP 1 Score: 284.3 bits (726), Expect = 1.6e-75
Identity = 172/339 (50.74%), Postives = 207/339 (61.06%), Query Frame = 1

Query: 1   MEELEEANRAAIDSCHGVLNLLAHPSPQEQGELRKNLMVETGEAVFKLKKVVSLLNSGFG 60
           MEE+E ANR+AI+SCHGVLNLL+  +        K+L VETGE V K K+V SLL  G G
Sbjct: 1   MEEVEAANRSAIESCHGVLNLLSQRTSDP-----KSLTVETGEVVSKFKRVASLLTRGLG 60

Query: 61  YAKVRRFKNIPLPLPQKFLLDPP---NNGLNGKISVFLGNPDLEM---DQNHKNSLKIHK 120
           + K R         PQ   L+ P    N L+G  +  L    L+M      +      H+
Sbjct: 61  HGKFRSTNKFRSSFPQHIFLESPICCGNDLSGDYTQVLAPEPLQMVPASAVYNEMEPKHQ 120

Query: 121 QAAPSLNFS----------------FPQQQQQQRKQAEMMFLRSNSGMNMNFDASKCTL- 180
              PSL  S                F    Q      ++ + RSNSG+N+ FD S  +  
Sbjct: 121 LGHPSLMLSHKMCVDKSFLELKPPPFRAPYQLIHNHQQIAYSRSNSGVNLKFDGSGSSCY 180

Query: 181 ---TMSSARSFISSLSMDGSVA--DGSSFHLIGPSSTMSGDNKRRFSGRGDEGSLKCGST 240
                + +RSF+SSLSMD SV   D +SFHL G S      + R+       GSLKCGS 
Sbjct: 181 TPSVSNGSRSFVSSLSMDASVTDYDRNSFHLTGLSRGSDQQHTRKMC----SGSLKCGSR 240

Query: 241 GKCHCSKKRKHRVKRSIKVPAISNKLADIPSDDYSWRKYGQKPIKGSPHPRGYYKCSSMR 300
            KCHCSKKRK RVKRSIKVPAISNK+ADIP D+YSWRKYGQKPIKGSPHPRGYYKCSS+R
Sbjct: 241 SKCHCSKKRKLRVKRSIKVPAISNKIADIPPDEYSWRKYGQKPIKGSPHPRGYYKCSSVR 300

Query: 301 GCPARKHVERCLEDPSMLIVTYEGEHNHPK-MSTQSAHT 311
           GCPARKHVERC+++ SMLIVTYEGEHNH + +S+QSAHT
Sbjct: 301 GCPARKHVERCIDETSMLIVTYEGEHNHSRILSSQSAHT 330

BLAST of Cp4.1LG15g00430 vs. Swiss-Prot
Match: WRK21_ARATH (Probable WRKY transcription factor 21 OS=Arabidopsis thaliana GN=WRKY21 PE=2 SV=1)

HSP 1 Score: 258.1 bits (658), Expect = 1.3e-67
Identity = 133/191 (69.63%), Postives = 158/191 (82.72%), Query Frame = 1

Query: 126 QQQQQQRKQAEMMFLRSNSGMNMNFDASKCTLTMSSARSFISSLSMDGSVAD---GSSFH 185
           QQQQ Q+ QAE+M  + N G++++FD S CT TMSS RSF+SSLS+DGSVA+    +SFH
Sbjct: 190 QQQQLQKHQAELMLRKCNGGISLSFDNSSCTPTMSSTRSFVSSLSIDGSVANIEGKNSFH 249

Query: 186 LIGPSSTMSGD--NKRRFSGRGDE-GSLKCGSTGKCHCSKKRKHRVKRSIKVPAISNKLA 245
              PSST      +KR+   +GDE GSLKCGS+ +CHC+KKRKHRV+RSI+VPAISNK+A
Sbjct: 250 FGVPSSTDQNSLHSKRKCPLKGDEHGSLKCGSSSRCHCAKKRKHRVRRSIRVPAISNKVA 309

Query: 246 DIPSDDYSWRKYGQKPIKGSPHPRGYYKCSSMRGCPARKHVERCLEDPSMLIVTYEGEHN 305
           DIP DDYSWRKYGQKPIKGSP+PRGYYKCSSMRGCPARKHVERCLEDP+MLIVTYE EHN
Sbjct: 310 DIPPDDYSWRKYGQKPIKGSPYPRGYYKCSSMRGCPARKHVERCLEDPAMLIVTYEAEHN 369

Query: 306 HPKMSTQSAHT 311
           HPK+ +Q+  T
Sbjct: 370 HPKLPSQAITT 380

BLAST of Cp4.1LG15g00430 vs. Swiss-Prot
Match: WRKY1_MAIZE (Protein WRKY1 OS=Zea mays PE=1 SV=1)

HSP 1 Score: 221.1 bits (562), Expect = 1.7e-56
Identity = 137/276 (49.64%), Postives = 173/276 (62.68%), Query Frame = 1

Query: 70  IPLPLPQKFLLDPPNNGLNGKISVFLGNPDLEMDQ--NHKNSLKIHKQAAPSLNFSFPQQ 129
           IP   P++ LL+ P  G+ G  S     P ++M Q  +          A P  +  F QQ
Sbjct: 127 IPTQFPKRLLLEKPTAGMEGSTSQ--SPPIVQMVQPVSVAPPAGTPTPALPPAHLHFIQQ 186

Query: 130 QQ--------QQRKQAEMMFLRSN----------------SGMNMNFDASKCTLTMSSAR 189
           QQ        QQ K    M  RSN                 G+N+ FD+S CT   SS+R
Sbjct: 187 QQSYQRFQLMQQMKIQSEMMKRSNLGDQGGSLSGGGGGGRKGVNLKFDSSNCTA--SSSR 246

Query: 190 SFISSLSMDGSVA--DGS----SFHLIGPSSTMSGD-----NKRRFSGRGDEGSLKCGST 249
           SF+SSLSM+GS+A  DGS     F L+  S T S        +RR +GR ++G+ +C + 
Sbjct: 247 SFLSSLSMEGSLASLDGSRTSRPFQLLSGSQTASTPELGLVQRRRCAGR-EDGTGRCATG 306

Query: 250 GKCHCSKKRKHRVKRSIKVPAISNKLADIPSDDYSWRKYGQKPIKGSPHPRGYYKCSSMR 309
            +CHCSKKRK R++RSIKVPAISNK+ADIP+D++SWRKYGQKPIKGSPHPRGYYKCSS+R
Sbjct: 307 SRCHCSKKRKLRIRRSIKVPAISNKVADIPADEFSWRKYGQKPIKGSPHPRGYYKCSSVR 366

BLAST of Cp4.1LG15g00430 vs. Swiss-Prot
Match: WRK17_ARATH (Probable WRKY transcription factor 17 OS=Arabidopsis thaliana GN=WRKY17 PE=2 SV=2)

HSP 1 Score: 181.0 bits (458), Expect = 2.0e-44
Identity = 124/318 (38.99%), Postives = 177/318 (55.66%), Query Frame = 1

Query: 4   LEEANRAAIDSCHGVLNLLAHPSPQEQGELRKNLMVETGEAVFKLKKVVSLLNSGFGYAK 63
           ++EA    + S   ++ +L++  P+E+      +   T   V K KKV+SLLN   G+A 
Sbjct: 17  IQEAASQGLKSMEHLIRVLSN-RPEERNVDCSEI---TDFTVSKFKKVISLLNRS-GHA- 76

Query: 64  VRRFKNIPLPLPQKFLLDPPNNGLNGKISVFLGNPDLEMDQNHKNSLKIHKQAAPSLNFS 123
             RF+  P+  P    + PP       + V    P  ++      S     Q + +L+F+
Sbjct: 77  --RFRRGPVHSPPSSSVPPP-------VKVTTPAPT-QISAPAPVSFVQANQQSVTLDFT 136

Query: 124 FPQQQQQQRKQAEMMFLRSNSGMNMNFDASKCTLTMSSARSFISS-LSMDGSVADGSSFH 183
            P     + K +E++            + +K + ++SS  SF+SS ++ DGSV+ GSS  
Sbjct: 137 RPSVFGAKTKSSEVV------------EFAKESFSVSSNSSFMSSAITGDGSVSKGSSIF 196

Query: 184 LI-GPSSTMSGDNKRRFSG-------------RGDEGSLKCGSTGKCHCSKKRKHRVKRS 243
           L   P+  ++   K   SG              G  G +     GKCHC K RK+R+KR+
Sbjct: 197 LAPAPAVPVTSSGKPPLSGLPYRKRCFEHDHSEGFSGKISGSGNGKCHCKKSRKNRMKRT 256

Query: 244 IKVPAISNKLADIPSDDYSWRKYGQKPIKGSPHPRGYYKCSSMRGCPARKHVERCLEDPS 303
           ++VPA+S K+ADIP D+YSWRKYGQKPIKGSPHPRGYYKCS+ RGCPARKHVER L+D +
Sbjct: 257 VRVPAVSAKIADIPPDEYSWRKYGQKPIKGSPHPRGYYKCSTFRGCPARKHVERALDDST 306

Query: 304 MLIVTYEGEHNHPKMSTQ 307
           MLIVTYEGEH H + + Q
Sbjct: 317 MLIVTYEGEHRHHQSTMQ 306

BLAST of Cp4.1LG15g00430 vs. TrEMBL
Match: E7CEW1_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=WRKY9 PE=2 SV=1)

HSP 1 Score: 459.1 bits (1180), Expect = 4.1e-126
Identity = 248/349 (71.06%), Postives = 273/349 (78.22%), Query Frame = 1

Query: 1   MEELEEANRAAIDSCHGVLNLLAHP--SPQEQGELRKNLMVETGEAVFKLKKVVSLLNSG 60
           MEE+EEAN++AI+SCHGVLNLL  P  SP       KNLM+ET EAVFK KKV+SLLNS 
Sbjct: 1   MEEVEEANKSAIESCHGVLNLLLQPPPSPHHHQHHFKNLMLETKEAVFKFKKVISLLNSD 60

Query: 61  FGYAKVRRFKNIPLPLPQKFLLDPPNN--------------GLNGKISVFLGNPDLEMDQ 120
           F + + R F  IPLPLPQ  LLD PN               G N K+S+ LGNPDLE+ Q
Sbjct: 61  FSHPRFRNFNKIPLPLPQNSLLDSPNYTLHPPNKNLFNSPPGFNSKVSILLGNPDLELSQ 120

Query: 121 NHKNSLKIHKQAAPSLNFSFP-----QQQQQQRKQ--------------AEMMFLRSNSG 180
           N KNSL I KQ+ PSL+FSFP     QQQQQQ++Q              AEMMFLR+N+G
Sbjct: 121 NDKNSLHIPKQS-PSLSFSFPHHHHPQQQQQQQQQQQSLLAHQKQMKHQAEMMFLRNNNG 180

Query: 181 MNMNFDASKCTLTMSSARSFISSLSMDGSV-ADGSSFHLIGPSSTM---SGDNKRRFSGR 240
           MN+NFD S CT+TMSSARSFISSLSMDGSV  D SSFHLIGPS+T    SG++KR+FS R
Sbjct: 181 MNLNFDTSNCTMTMSSARSFISSLSMDGSVIGDRSSFHLIGPSTTTTTTSGNSKRKFSAR 240

Query: 241 GDEGSLKCGSTGKCHCSKKRKHRVKRSIKVPAISNKLADIPSDDYSWRKYGQKPIKGSPH 300
           G+EGSLKCGST KCHCSKKRKHRVKRSIKVPAISNKLADIPSDDYSWRKYGQKPIKGSPH
Sbjct: 241 GEEGSLKCGSTSKCHCSKKRKHRVKRSIKVPAISNKLADIPSDDYSWRKYGQKPIKGSPH 300

Query: 301 PRGYYKCSSMRGCPARKHVERCLEDPSMLIVTYEGEHNHPKMSTQSAHT 311
           PRGYYKCSS+RGCPARKHVERCLEDPSMLIVTYEGEHNHPKMSTQSAHT
Sbjct: 301 PRGYYKCSSIRGCPARKHVERCLEDPSMLIVTYEGEHNHPKMSTQSAHT 348

BLAST of Cp4.1LG15g00430 vs. TrEMBL
Match: W9RBZ3_9ROSA (Putative WRKY transcription factor 21 OS=Morus notabilis GN=L484_005757 PE=4 SV=1)

HSP 1 Score: 405.2 bits (1040), Expect = 7.1e-110
Identity = 216/355 (60.85%), Postives = 262/355 (73.80%), Query Frame = 1

Query: 1   MEELEEANRAAIDSCHGVLNLLAHPSPQEQGELRKNLMVETGEAVFKLKKVVSLLNSGFG 60
           MEE+EEAN+AA++SCH VL+ L+ P  Q Q   R+NL+VETGEAVF+ K+VVSLLN+G G
Sbjct: 1   MEEVEEANKAAVESCHRVLSSLSQPQDQVQ---RRNLVVETGEAVFRFKRVVSLLNTGLG 60

Query: 61  YAKVRRFKNIPLPLPQKFLLDPPN-----------------------------NGLNGKI 120
           +A+VR+ K +P+ LPQ  LLD PN                              G N K 
Sbjct: 61  HARVRKLKKLPISLPQSILLDNPNCSRTGTDQTPSKTPLFLQSSCFPENSLQEMGSNVKS 120

Query: 121 SVFLGNPDLEMDQNHKNSLKIHKQAAPSLNFSFPQQQQQQ------------RKQAEMMF 180
           S+ LGNP LE+  + KN +++ +Q  P+ ++ F  QQQQQ            ++QAEM+F
Sbjct: 121 SLCLGNPSLELSSSGKNPIQLSQQPNPAAHYHFLHQQQQQQQRLLLHQQQQMKQQAEMLF 180

Query: 181 LRSNSGMNMNFDASKCTLTMSSARSFISSLSMDGSVA--DGSSFHLIGP--SSTMSGDNK 240
            +SNSG+N+NFD+S CT TMSS RSFISSLS+DGSVA  DGS+FHLIG   SS  +  +K
Sbjct: 181 RKSNSGINLNFDSSSCTPTMSSTRSFISSLSIDGSVANLDGSAFHLIGAPRSSDQNSQHK 240

Query: 241 RRFSGRGDEGSLKCGSTGKCHCSKKRKHRVKRSIKVPAISNKLADIPSDDYSWRKYGQKP 300
           R+ S RG++GS+KCGS+G+CHCSKKRKHRVKRSIKVPAISNKLADIP DDYSWRKYGQKP
Sbjct: 241 RKCSVRGEDGSVKCGSSGRCHCSKKRKHRVKRSIKVPAISNKLADIPPDDYSWRKYGQKP 300

Query: 301 IKGSPHPRGYYKCSSMRGCPARKHVERCLEDPSMLIVTYEGEHNHPKMSTQSAHT 311
           IKGSPHPRGYYKCSSMRGCPARKHVERCLEDPSMLIVTYEGEHNHP++  QSA+T
Sbjct: 301 IKGSPHPRGYYKCSSMRGCPARKHVERCLEDPSMLIVTYEGEHNHPRIQPQSANT 352

BLAST of Cp4.1LG15g00430 vs. TrEMBL
Match: A0A141E6C1_MORAL (Transcription factor WRKY21-like protein OS=Morus alba PE=2 SV=1)

HSP 1 Score: 404.4 bits (1038), Expect = 1.2e-109
Identity = 215/355 (60.56%), Postives = 262/355 (73.80%), Query Frame = 1

Query: 1   MEELEEANRAAIDSCHGVLNLLAHPSPQEQGELRKNLMVETGEAVFKLKKVVSLLNSGFG 60
           MEE+EEAN+AA++SCH VL++L+ P  Q Q   R+NL+VETGEAVF+ K+VVSLLN+G G
Sbjct: 1   MEEVEEANKAAVESCHRVLSILSQPQDQVQ---RRNLVVETGEAVFRFKRVVSLLNTGLG 60

Query: 61  YAKVRRFKNIPLPLPQKFLLDPPN-----------------------------NGLNGKI 120
           +A+VR+ K +P+ LPQ  LLD PN                              G N K 
Sbjct: 61  HARVRKLKKLPVSLPQSILLDNPNCSRTGTDQTPSKTPHFLQSSCFPENPLQEMGSNVKS 120

Query: 121 SVFLGNPDLEMDQNHKNSLKIHKQAAPSLNFSFPQQQQQQ------------RKQAEMMF 180
           S+ LGNP LE+  + K  +++ +Q  P+ ++ F  QQQQQ            ++QAEM+F
Sbjct: 121 SLCLGNPSLELSSSGKTQIQLSQQPNPAAHYHFLHQQQQQQQRLLLHQQQQMKQQAEMLF 180

Query: 181 LRSNSGMNMNFDASKCTLTMSSARSFISSLSMDGSVA--DGSSFHLIGP--SSTMSGDNK 240
            +SNSG+N+NFD+S CT TMSS RSFISSLS+DGSVA  DGS+FHLIG   SS  +  +K
Sbjct: 181 RKSNSGINLNFDSSSCTPTMSSTRSFISSLSIDGSVANLDGSAFHLIGAPRSSDQNSQHK 240

Query: 241 RRFSGRGDEGSLKCGSTGKCHCSKKRKHRVKRSIKVPAISNKLADIPSDDYSWRKYGQKP 300
           R+ S RG++GS+KCGS+G+CHCSKKRKHRVKRSIKVPAISNKLADIP DDYSWRKYGQKP
Sbjct: 241 RKCSVRGEDGSVKCGSSGRCHCSKKRKHRVKRSIKVPAISNKLADIPPDDYSWRKYGQKP 300

Query: 301 IKGSPHPRGYYKCSSMRGCPARKHVERCLEDPSMLIVTYEGEHNHPKMSTQSAHT 311
           IKGSPHPRGYYKCSSMRGCPARKHVERCLEDPSMLIVTYEGEHNHP++  QSA+T
Sbjct: 301 IKGSPHPRGYYKCSSMRGCPARKHVERCLEDPSMLIVTYEGEHNHPRIQPQSANT 352

BLAST of Cp4.1LG15g00430 vs. TrEMBL
Match: A0A059D2M1_EUCGR (Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_B01503 PE=4 SV=1)

HSP 1 Score: 399.4 bits (1025), Expect = 3.9e-108
Identity = 218/347 (62.82%), Postives = 263/347 (75.79%), Query Frame = 1

Query: 1   MEELEEANRAAIDSCHGVLNLLAHPSPQEQGELRKNLMVETGEAVFKLKKVVSLLNSGFG 60
           MEE+EEANRAA++S H VL+LL+ P  Q Q    +NLM+ETG+AVF+ K+VVSLLN+  G
Sbjct: 1   MEEVEEANRAAVESSHRVLSLLSQPQDQVQC---RNLMLETGDAVFRFKRVVSLLNTCLG 60

Query: 61  YAKVRRFKNIPLPLPQKFLLD----------------PPNNGLNG--------KISVFLG 120
           +A+VR+ K +P PLPQK LLD                PPN   N         K S+ LG
Sbjct: 61  HARVRKLKKLPTPLPQKALLDNPIVRTDQSSKSLQLLPPNYPENAIQELNTHHKASLSLG 120

Query: 121 NPDLEMDQNHKNSLKIHKQAAPSLNFSFPQQQQQQRK---------QAEMMFLRSNSGMN 180
           NP+LE+  N K+ L + +QA PS ++ F QQQQQQ+K         QAEMM+ RSNSG++
Sbjct: 121 NPNLELSSNGKSPLHLAQQA-PSAHYHFLQQQQQQQKLQFQQQMKHQAEMMYHRSNSGIS 180

Query: 181 MNFDASKCTLTMSSARSFISSLSMDGSVA--DGSSFHLIGP--SSTMSGDNKRRFSGRGD 240
           +NFD+S CT TMSS RSFISSLS+DGSVA  D ++FHLIG   SS  +  +KR+ SGRG+
Sbjct: 181 LNFDSSSCTPTMSSTRSFISSLSIDGSVANLDANAFHLIGAARSSDQNSQHKRKCSGRGE 240

Query: 241 EGSLKCGSTGKCHCSKKRKHRVKRSIKVPAISNKLADIPSDDYSWRKYGQKPIKGSPHPR 300
           +GS+KCGS+ +CHCSKKRKHRVKRSIKVPAISNKLADIP DDYSWRKYGQKPIKGSPHPR
Sbjct: 241 DGSMKCGSSSRCHCSKKRKHRVKRSIKVPAISNKLADIPPDDYSWRKYGQKPIKGSPHPR 300

Query: 301 GYYKCSSMRGCPARKHVERCLEDPSMLIVTYEGEHNHPKMSTQSAHT 311
           GYYKCSSMRGCPARKHVERCLEDPSMLIVTYEG+HNHP++ +QSA+T
Sbjct: 301 GYYKCSSMRGCPARKHVERCLEDPSMLIVTYEGDHNHPRVPSQSANT 343

BLAST of Cp4.1LG15g00430 vs. TrEMBL
Match: A0A140H8N2_MANES (WRKY transcription factor 28 OS=Manihot esculenta PE=2 SV=1)

HSP 1 Score: 399.1 bits (1024), Expect = 5.1e-108
Identity = 218/349 (62.46%), Postives = 254/349 (72.78%), Query Frame = 1

Query: 1   MEELEEANRAAIDSCHGVLNLLAHPSPQEQGELRKNLMVETGEAVFKLKKVVSLLNSGFG 60
           MEE+EEANR A++SCH VL LL+ P  Q Q    +NLMVETGEAVF+ K+V+SLLNS  G
Sbjct: 1   MEEVEEANRTAVESCHRVLGLLSQPQDQVQ---YRNLMVETGEAVFRFKRVISLLNSNLG 60

Query: 61  YAKVRRFKNIPLPLPQKFLLDPPNN--------------------------GLNGKISVF 120
           +A+VR+ K +P PL Q  LLD P++                          G N K S+ 
Sbjct: 61  HARVRKLKKLPTPLCQSLLLDNPHHRTDLQSKNFQFMQSGSYLDSRPIQELGSNAKNSLC 120

Query: 121 LGNPDLEMDQNHKNSLKIHKQAAPSLNFSFPQQQQ--------QQRKQAEMMFLRSNSGM 180
            G P LE+  N KN L  H Q  PS +++F QQQQ        Q ++QAEMMF RSNSG+
Sbjct: 121 FGTPSLELSSNGKNPLH-HSQQTPSTHYNFLQQQQRLQLQQQQQMKQQAEMMFRRSNSGI 180

Query: 181 NMNFDASKCTLTMSSARSFISSLSMDGSVA--DGSSFHLIGP---SSTMSGDNKRRFSGR 240
           N+NFD S CT TMSS RSFISSLS+DGSVA  +GS+FHLIG    S   S   KR+ SGR
Sbjct: 181 NLNFDNSSCTPTMSSTRSFISSLSIDGSVANLEGSAFHLIGAPRSSDQNSQQLKRKCSGR 240

Query: 241 GDEGSLKCGSTGKCHCSKKRKHRVKRSIKVPAISNKLADIPSDDYSWRKYGQKPIKGSPH 300
            ++GS+KCGS+G+CHCSKKRKHRVKRSIKVPAISNKLADIP DDYSWRKYGQKPIKGSPH
Sbjct: 241 VEDGSVKCGSSGRCHCSKKRKHRVKRSIKVPAISNKLADIPPDDYSWRKYGQKPIKGSPH 300

Query: 301 PRGYYKCSSMRGCPARKHVERCLEDPSMLIVTYEGEHNHPKMSTQSAHT 311
           PRGYYKCSSMRGCPARKHVERCLEDPSMLIVTYEGEHNHP++ +QSA+T
Sbjct: 301 PRGYYKCSSMRGCPARKHVERCLEDPSMLIVTYEGEHNHPRIPSQSANT 345

BLAST of Cp4.1LG15g00430 vs. TAIR10
Match: AT5G28650.1 (AT5G28650.1 WRKY DNA-binding protein 74)

HSP 1 Score: 285.8 bits (730), Expect = 3.2e-77
Identity = 178/338 (52.66%), Postives = 216/338 (63.91%), Query Frame = 1

Query: 1   MEELEEANRAAIDSCHGVLNLLAHPSPQEQGELRKNLMVETGEAVFKLKKVVSLLNSGFG 60
           MEE+E AN+AA++SCHGVLNLL+     +Q    K++MVET EAV K K+V SLL+ G G
Sbjct: 1   MEEVEAANKAAVESCHGVLNLLS-----QQTNDSKSIMVETREAVCKFKRVSSLLSRGLG 60

Query: 61  YAKVRRFKNIPLP-----LPQKFLLDPP---NNGLNGKISVFL---------GNPDLEM- 120
             K+++  N         LPQ   L+ P   NN ++G I +           G P L + 
Sbjct: 61  QRKIKKLNNNNYKFSSSLLPQHMFLESPVCSNNAISGCIPILAPKPLQIVPAGPPPLMLF 120

Query: 121 DQNHKNSLKIHKQAAPSLNFSFPQQQQQQRKQAEMMFLRSNSGMNMNFDASKCTLTMS-- 180
           +QN        +   PS     P+  Q      + ++ RS SG+N+ FD S      S  
Sbjct: 121 NQNMCLDKSFLELKPPSSRAVDPKPYQFIHTHQQGVYSRSKSGLNLKFDGSIGASCYSPS 180

Query: 181 ---SARSFISSLSMDGSVA--DGSSFHLIG--PSSTMSGDNKRRFSGRGDEGSLKCGSTG 240
               +RSF+SSLSMDGSV   D +SFHLIG    S     + RR S     GSLKCGS  
Sbjct: 181 ISNGSRSFVSSLSMDGSVTDYDRNSFHLIGLPQGSDHISQHSRRTS---CSGSLKCGSKS 240

Query: 241 KCHCSKKRKHRVKRSIKVPAISNKLADIPSDDYSWRKYGQKPIKGSPHPRGYYKCSSMRG 300
           KCHCSKKRK RVKRSIKVPAISNK+ADIP D+YSWRKYGQKPIKGSPHPRGYYKCSS+RG
Sbjct: 241 KCHCSKKRKLRVKRSIKVPAISNKIADIPPDEYSWRKYGQKPIKGSPHPRGYYKCSSVRG 300

Query: 301 CPARKHVERCLEDPSMLIVTYEGEHNHPK-MSTQSAHT 311
           CPARKHVERC+E+ SMLIVTYEGEHNH + +S+QSAHT
Sbjct: 301 CPARKHVERCVEETSMLIVTYEGEHNHSRILSSQSAHT 330

BLAST of Cp4.1LG15g00430 vs. TAIR10
Match: AT3G04670.1 (AT3G04670.1 WRKY DNA-binding protein 39)

HSP 1 Score: 284.3 bits (726), Expect = 9.2e-77
Identity = 172/339 (50.74%), Postives = 207/339 (61.06%), Query Frame = 1

Query: 1   MEELEEANRAAIDSCHGVLNLLAHPSPQEQGELRKNLMVETGEAVFKLKKVVSLLNSGFG 60
           MEE+E ANR+AI+SCHGVLNLL+  +        K+L VETGE V K K+V SLL  G G
Sbjct: 1   MEEVEAANRSAIESCHGVLNLLSQRTSDP-----KSLTVETGEVVSKFKRVASLLTRGLG 60

Query: 61  YAKVRRFKNIPLPLPQKFLLDPP---NNGLNGKISVFLGNPDLEM---DQNHKNSLKIHK 120
           + K R         PQ   L+ P    N L+G  +  L    L+M      +      H+
Sbjct: 61  HGKFRSTNKFRSSFPQHIFLESPICCGNDLSGDYTQVLAPEPLQMVPASAVYNEMEPKHQ 120

Query: 121 QAAPSLNFS----------------FPQQQQQQRKQAEMMFLRSNSGMNMNFDASKCTL- 180
              PSL  S                F    Q      ++ + RSNSG+N+ FD S  +  
Sbjct: 121 LGHPSLMLSHKMCVDKSFLELKPPPFRAPYQLIHNHQQIAYSRSNSGVNLKFDGSGSSCY 180

Query: 181 ---TMSSARSFISSLSMDGSVA--DGSSFHLIGPSSTMSGDNKRRFSGRGDEGSLKCGST 240
                + +RSF+SSLSMD SV   D +SFHL G S      + R+       GSLKCGS 
Sbjct: 181 TPSVSNGSRSFVSSLSMDASVTDYDRNSFHLTGLSRGSDQQHTRKMC----SGSLKCGSR 240

Query: 241 GKCHCSKKRKHRVKRSIKVPAISNKLADIPSDDYSWRKYGQKPIKGSPHPRGYYKCSSMR 300
            KCHCSKKRK RVKRSIKVPAISNK+ADIP D+YSWRKYGQKPIKGSPHPRGYYKCSS+R
Sbjct: 241 SKCHCSKKRKLRVKRSIKVPAISNKIADIPPDEYSWRKYGQKPIKGSPHPRGYYKCSSVR 300

Query: 301 GCPARKHVERCLEDPSMLIVTYEGEHNHPK-MSTQSAHT 311
           GCPARKHVERC+++ SMLIVTYEGEHNH + +S+QSAHT
Sbjct: 301 GCPARKHVERCIDETSMLIVTYEGEHNHSRILSSQSAHT 330

BLAST of Cp4.1LG15g00430 vs. TAIR10
Match: AT2G30590.1 (AT2G30590.1 WRKY DNA-binding protein 21)

HSP 1 Score: 258.1 bits (658), Expect = 7.1e-69
Identity = 133/191 (69.63%), Postives = 158/191 (82.72%), Query Frame = 1

Query: 126 QQQQQQRKQAEMMFLRSNSGMNMNFDASKCTLTMSSARSFISSLSMDGSVAD---GSSFH 185
           QQQQ Q+ QAE+M  + N G++++FD S CT TMSS RSF+SSLS+DGSVA+    +SFH
Sbjct: 190 QQQQLQKHQAELMLRKCNGGISLSFDNSSCTPTMSSTRSFVSSLSIDGSVANIEGKNSFH 249

Query: 186 LIGPSSTMSGD--NKRRFSGRGDE-GSLKCGSTGKCHCSKKRKHRVKRSIKVPAISNKLA 245
              PSST      +KR+   +GDE GSLKCGS+ +CHC+KKRKHRV+RSI+VPAISNK+A
Sbjct: 250 FGVPSSTDQNSLHSKRKCPLKGDEHGSLKCGSSSRCHCAKKRKHRVRRSIRVPAISNKVA 309

Query: 246 DIPSDDYSWRKYGQKPIKGSPHPRGYYKCSSMRGCPARKHVERCLEDPSMLIVTYEGEHN 305
           DIP DDYSWRKYGQKPIKGSP+PRGYYKCSSMRGCPARKHVERCLEDP+MLIVTYE EHN
Sbjct: 310 DIPPDDYSWRKYGQKPIKGSPYPRGYYKCSSMRGCPARKHVERCLEDPAMLIVTYEAEHN 369

Query: 306 HPKMSTQSAHT 311
           HPK+ +Q+  T
Sbjct: 370 HPKLPSQAITT 380

BLAST of Cp4.1LG15g00430 vs. TAIR10
Match: AT2G24570.1 (AT2G24570.1 WRKY DNA-binding protein 17)

HSP 1 Score: 181.0 bits (458), Expect = 1.1e-45
Identity = 124/318 (38.99%), Postives = 177/318 (55.66%), Query Frame = 1

Query: 4   LEEANRAAIDSCHGVLNLLAHPSPQEQGELRKNLMVETGEAVFKLKKVVSLLNSGFGYAK 63
           ++EA    + S   ++ +L++  P+E+      +   T   V K KKV+SLLN   G+A 
Sbjct: 17  IQEAASQGLKSMEHLIRVLSN-RPEERNVDCSEI---TDFTVSKFKKVISLLNRS-GHA- 76

Query: 64  VRRFKNIPLPLPQKFLLDPPNNGLNGKISVFLGNPDLEMDQNHKNSLKIHKQAAPSLNFS 123
             RF+  P+  P    + PP       + V    P  ++      S     Q + +L+F+
Sbjct: 77  --RFRRGPVHSPPSSSVPPP-------VKVTTPAPT-QISAPAPVSFVQANQQSVTLDFT 136

Query: 124 FPQQQQQQRKQAEMMFLRSNSGMNMNFDASKCTLTMSSARSFISS-LSMDGSVADGSSFH 183
            P     + K +E++            + +K + ++SS  SF+SS ++ DGSV+ GSS  
Sbjct: 137 RPSVFGAKTKSSEVV------------EFAKESFSVSSNSSFMSSAITGDGSVSKGSSIF 196

Query: 184 LI-GPSSTMSGDNKRRFSG-------------RGDEGSLKCGSTGKCHCSKKRKHRVKRS 243
           L   P+  ++   K   SG              G  G +     GKCHC K RK+R+KR+
Sbjct: 197 LAPAPAVPVTSSGKPPLSGLPYRKRCFEHDHSEGFSGKISGSGNGKCHCKKSRKNRMKRT 256

Query: 244 IKVPAISNKLADIPSDDYSWRKYGQKPIKGSPHPRGYYKCSSMRGCPARKHVERCLEDPS 303
           ++VPA+S K+ADIP D+YSWRKYGQKPIKGSPHPRGYYKCS+ RGCPARKHVER L+D +
Sbjct: 257 VRVPAVSAKIADIPPDEYSWRKYGQKPIKGSPHPRGYYKCSTFRGCPARKHVERALDDST 306

Query: 304 MLIVTYEGEHNHPKMSTQ 307
           MLIVTYEGEH H + + Q
Sbjct: 317 MLIVTYEGEHRHHQSTMQ 306

BLAST of Cp4.1LG15g00430 vs. TAIR10
Match: AT4G31550.1 (AT4G31550.1 WRKY DNA-binding protein 11)

HSP 1 Score: 178.7 bits (452), Expect = 5.5e-45
Identity = 124/322 (38.51%), Postives = 175/322 (54.35%), Query Frame = 1

Query: 4   LEEANRAAIDSCHGVLNLLAHPSPQEQGELRKNLMVETGEAVFKLKKVVSLLNSGFGYAK 63
           ++EA    + S   ++ +L++  P++Q  +  + +  T   V K K V+SLLN   G+A+
Sbjct: 17  IQEAASQGLQSMEHLIRVLSN-RPEQQHNVDCSEI--TDFTVSKFKTVISLLNRT-GHAR 76

Query: 64  VRRFKNIPLPLPQKFLLDPPNNGLNGKISVFLGNPDLEMDQNHKNSLKIHKQAAP----- 123
            RR        P        +  L  +I V    P+  + +   N    H Q  P     
Sbjct: 77  FRRG-------PVHSTSSAASQKLQSQI-VKNTQPEAPIVRTTTN----HPQIVPPPSSV 136

Query: 124 SLNFSFPQQQQQQRKQAEMMFLRSNSGMNMNFDASKCTLTMSSARSFISS-LSMDGSVAD 183
           +L+FS P     + K AE+ F + N  +++N              SF+SS ++ DGSV++
Sbjct: 137 TLDFSKPSIFGTKAKSAELEFSKENFSVSLN-------------SSFMSSAITGDGSVSN 196

Query: 184 GSSFHLIGPSSTMSGDNKRRFSGR-------------GDEGSLKCGSTGKCHCSKKRKHR 243
           G  F    P   ++   K   +G                 G +   + GKCHC K RK+R
Sbjct: 197 GKIFLASAPLQPVNSSGKPPLAGHPYRKRCLEHEHSESFSGKVSGSAYGKCHCKKSRKNR 256

Query: 244 VKRSIKVPAISNKLADIPSDDYSWRKYGQKPIKGSPHPRGYYKCSSMRGCPARKHVERCL 303
           +KR+++VPAIS K+ADIP D+YSWRKYGQKPIKGSPHPRGYYKCS+ RGCPARKHVER L
Sbjct: 257 MKRTVRVPAISAKIADIPPDEYSWRKYGQKPIKGSPHPRGYYKCSTFRGCPARKHVERAL 309

Query: 304 EDPSMLIVTYEGEHNHPKMSTQ 307
           +DP+MLIVTYEGEH H + + Q
Sbjct: 317 DDPAMLIVTYEGEHRHNQSAMQ 309

BLAST of Cp4.1LG15g00430 vs. NCBI nr
Match: gi|449474207|ref|XP_004154104.1| (PREDICTED: probable WRKY transcription factor 21 isoform X1 [Cucumis sativus])

HSP 1 Score: 459.1 bits (1180), Expect = 5.9e-126
Identity = 248/349 (71.06%), Postives = 273/349 (78.22%), Query Frame = 1

Query: 1   MEELEEANRAAIDSCHGVLNLLAHP--SPQEQGELRKNLMVETGEAVFKLKKVVSLLNSG 60
           MEE+EEAN++AI+SCHGVLNLL  P  SP       KNLM+ET EAVFK KKV+SLLNS 
Sbjct: 1   MEEVEEANKSAIESCHGVLNLLLQPPPSPHHHQHHFKNLMLETKEAVFKFKKVISLLNSD 60

Query: 61  FGYAKVRRFKNIPLPLPQKFLLDPPNN--------------GLNGKISVFLGNPDLEMDQ 120
           F + + R F  IPLPLPQ  LLD PN               G N K+S+ LGNPDLE+ Q
Sbjct: 61  FSHPRFRNFNKIPLPLPQNSLLDSPNYTLHPPNKNLFNSPPGFNSKVSILLGNPDLELSQ 120

Query: 121 NHKNSLKIHKQAAPSLNFSFP-----QQQQQQRKQ--------------AEMMFLRSNSG 180
           N KNSL I KQ+ PSL+FSFP     QQQQQQ++Q              AEMMFLR+N+G
Sbjct: 121 NDKNSLHIPKQS-PSLSFSFPHHHHPQQQQQQQQQQQSLLAHQKQMKHQAEMMFLRNNNG 180

Query: 181 MNMNFDASKCTLTMSSARSFISSLSMDGSV-ADGSSFHLIGPSSTM---SGDNKRRFSGR 240
           MN+NFD S CT+TMSSARSFISSLSMDGSV  D SSFHLIGPS+T    SG++KR+FS R
Sbjct: 181 MNLNFDTSNCTMTMSSARSFISSLSMDGSVIGDRSSFHLIGPSTTTTTTSGNSKRKFSAR 240

Query: 241 GDEGSLKCGSTGKCHCSKKRKHRVKRSIKVPAISNKLADIPSDDYSWRKYGQKPIKGSPH 300
           G+EGSLKCGST KCHCSKKRKHRVKRSIKVPAISNKLADIPSDDYSWRKYGQKPIKGSPH
Sbjct: 241 GEEGSLKCGSTSKCHCSKKRKHRVKRSIKVPAISNKLADIPSDDYSWRKYGQKPIKGSPH 300

Query: 301 PRGYYKCSSMRGCPARKHVERCLEDPSMLIVTYEGEHNHPKMSTQSAHT 311
           PRGYYKCSS+RGCPARKHVERCLEDPSMLIVTYEGEHNHPKMSTQSAHT
Sbjct: 301 PRGYYKCSSIRGCPARKHVERCLEDPSMLIVTYEGEHNHPKMSTQSAHT 348

BLAST of Cp4.1LG15g00430 vs. NCBI nr
Match: gi|659096892|ref|XP_008449343.1| (PREDICTED: probable WRKY transcription factor 21 [Cucumis melo])

HSP 1 Score: 456.1 bits (1172), Expect = 5.0e-125
Identity = 246/344 (71.51%), Postives = 271/344 (78.78%), Query Frame = 1

Query: 1   MEELEEANRAAIDSCHGVLNLLAHPSPQEQGELR-KNLMVETGEAVFKLKKVVSLLNSGF 60
           MEE+EEA ++AI+SCHGVLNLL  P P    +   KNLMVET EAVFK KKV+SLLNS F
Sbjct: 1   MEEVEEATKSAIESCHGVLNLLLQPPPSPPHQHHFKNLMVETKEAVFKFKKVISLLNSDF 60

Query: 61  GYAKVRRFKNIPLPLPQKFLLDPPNN--------------GLNGKISVFLGNPDLEMDQN 120
            + + R F  IPLPLPQ  LLD PN               G N K+S+FLGNPDLE+ QN
Sbjct: 61  SHPRFRNFNKIPLPLPQNSLLDSPNYTLHPPNKNLFNFPPGSNSKVSIFLGNPDLELSQN 120

Query: 121 HKNSLKIHKQAAPSLNFSFP-----QQQQQQ----------RKQAEMMFLRSNSGMNMNF 180
            KN+L I KQ+ PSLNFSFP     QQQQQQ          ++QAEM FLR+N+GMN+NF
Sbjct: 121 DKNTLHIPKQS-PSLNFSFPHHHHHQQQQQQQSVLAHQKQMKQQAEMTFLRNNNGMNLNF 180

Query: 181 DASKCTLTMSSARSFISSLSMDGSV-ADGSSFHLIGPSSTM---SGDNKRRFSGRGDEGS 240
           D S CTLTMSSARSFISSLSMDGSV  D SSFHLIGPS+T    SG++KR+FS RG+EGS
Sbjct: 181 DTSNCTLTMSSARSFISSLSMDGSVIGDRSSFHLIGPSTTTTTTSGNSKRKFSARGEEGS 240

Query: 241 LKCGSTGKCHCSKKRKHRVKRSIKVPAISNKLADIPSDDYSWRKYGQKPIKGSPHPRGYY 300
           LKCGST KCHCSKKRKHRVKRSIKVPAISNKLADIPSDDYSWRKYGQKPIKGSPHPRGYY
Sbjct: 241 LKCGSTSKCHCSKKRKHRVKRSIKVPAISNKLADIPSDDYSWRKYGQKPIKGSPHPRGYY 300

Query: 301 KCSSMRGCPARKHVERCLEDPSMLIVTYEGEHNHPKMSTQSAHT 311
           KCSS+RGCPARKHVERCLEDPSMLIVTYEGEH+HPKM TQSAHT
Sbjct: 301 KCSSIRGCPARKHVERCLEDPSMLIVTYEGEHSHPKMLTQSAHT 343

BLAST of Cp4.1LG15g00430 vs. NCBI nr
Match: gi|778668993|ref|XP_011649180.1| (PREDICTED: probable WRKY transcription factor 21 isoform X2 [Cucumis sativus])

HSP 1 Score: 447.2 bits (1149), Expect = 2.3e-122
Identity = 245/349 (70.20%), Postives = 270/349 (77.36%), Query Frame = 1

Query: 1   MEELEEANRAAIDSCHGVLNLLAHP--SPQEQGELRKNLMVETGEAVFKLKKVVSLLNSG 60
           MEE+EEAN++AI+SCHGVLNLL  P  SP       KNLM+ET EAVFK KKV+SLLNS 
Sbjct: 1   MEEVEEANKSAIESCHGVLNLLLQPPPSPHHHQHHFKNLMLETKEAVFKFKKVISLLNSD 60

Query: 61  FGYAKVRRFKNIPLPLPQKFLLDPPNN--------------GLNGKISVFLGNPDLEMDQ 120
           F + + R F  IPLPLPQ  LLD PN               G N K+S+ LGNPDLE+ Q
Sbjct: 61  FSHPRFRNFNKIPLPLPQNSLLDSPNYTLHPPNKNLFNSPPGFNSKVSILLGNPDLELSQ 120

Query: 121 NHKNSLKIHKQAAPSLNFSFP-----QQQQQQRKQ--------------AEMMFLRSNSG 180
           N KNSL I KQ+ PSL+FSFP     QQQQQQ++Q              AEMMFLR+N+G
Sbjct: 121 NDKNSLHIPKQS-PSLSFSFPHHHHPQQQQQQQQQQQSLLAHQKQMKHQAEMMFLRNNNG 180

Query: 181 MNMNFDASKCTLTMSSARSFISSLSMDGSV-ADGSSFHLIGPSSTM---SGDNKRRFSGR 240
           MN+NFD S CT+TMSSARSFISSLSMDGSV  D SSFHLIGPS+T    SG++KR+FS R
Sbjct: 181 MNLNFDTSNCTMTMSSARSFISSLSMDGSVIGDRSSFHLIGPSTTTTTTSGNSKRKFSAR 240

Query: 241 GDEGSLKCGSTGKCHCSKKRKHRVKRSIKVPAISNKLADIPSDDYSWRKYGQKPIKGSPH 300
           G+EGSLKCGST KCHCSKK   RVKRSIKVPAISNKLADIPSDDYSWRKYGQKPIKGSPH
Sbjct: 241 GEEGSLKCGSTSKCHCSKK---RVKRSIKVPAISNKLADIPSDDYSWRKYGQKPIKGSPH 300

Query: 301 PRGYYKCSSMRGCPARKHVERCLEDPSMLIVTYEGEHNHPKMSTQSAHT 311
           PRGYYKCSS+RGCPARKHVERCLEDPSMLIVTYEGEHNHPKMSTQSAHT
Sbjct: 301 PRGYYKCSSIRGCPARKHVERCLEDPSMLIVTYEGEHNHPKMSTQSAHT 345

BLAST of Cp4.1LG15g00430 vs. NCBI nr
Match: gi|703112292|ref|XP_010100083.1| (putative WRKY transcription factor 21 [Morus notabilis])

HSP 1 Score: 405.2 bits (1040), Expect = 1.0e-109
Identity = 216/355 (60.85%), Postives = 262/355 (73.80%), Query Frame = 1

Query: 1   MEELEEANRAAIDSCHGVLNLLAHPSPQEQGELRKNLMVETGEAVFKLKKVVSLLNSGFG 60
           MEE+EEAN+AA++SCH VL+ L+ P  Q Q   R+NL+VETGEAVF+ K+VVSLLN+G G
Sbjct: 1   MEEVEEANKAAVESCHRVLSSLSQPQDQVQ---RRNLVVETGEAVFRFKRVVSLLNTGLG 60

Query: 61  YAKVRRFKNIPLPLPQKFLLDPPN-----------------------------NGLNGKI 120
           +A+VR+ K +P+ LPQ  LLD PN                              G N K 
Sbjct: 61  HARVRKLKKLPISLPQSILLDNPNCSRTGTDQTPSKTPLFLQSSCFPENSLQEMGSNVKS 120

Query: 121 SVFLGNPDLEMDQNHKNSLKIHKQAAPSLNFSFPQQQQQQ------------RKQAEMMF 180
           S+ LGNP LE+  + KN +++ +Q  P+ ++ F  QQQQQ            ++QAEM+F
Sbjct: 121 SLCLGNPSLELSSSGKNPIQLSQQPNPAAHYHFLHQQQQQQQRLLLHQQQQMKQQAEMLF 180

Query: 181 LRSNSGMNMNFDASKCTLTMSSARSFISSLSMDGSVA--DGSSFHLIGP--SSTMSGDNK 240
            +SNSG+N+NFD+S CT TMSS RSFISSLS+DGSVA  DGS+FHLIG   SS  +  +K
Sbjct: 181 RKSNSGINLNFDSSSCTPTMSSTRSFISSLSIDGSVANLDGSAFHLIGAPRSSDQNSQHK 240

Query: 241 RRFSGRGDEGSLKCGSTGKCHCSKKRKHRVKRSIKVPAISNKLADIPSDDYSWRKYGQKP 300
           R+ S RG++GS+KCGS+G+CHCSKKRKHRVKRSIKVPAISNKLADIP DDYSWRKYGQKP
Sbjct: 241 RKCSVRGEDGSVKCGSSGRCHCSKKRKHRVKRSIKVPAISNKLADIPPDDYSWRKYGQKP 300

Query: 301 IKGSPHPRGYYKCSSMRGCPARKHVERCLEDPSMLIVTYEGEHNHPKMSTQSAHT 311
           IKGSPHPRGYYKCSSMRGCPARKHVERCLEDPSMLIVTYEGEHNHP++  QSA+T
Sbjct: 301 IKGSPHPRGYYKCSSMRGCPARKHVERCLEDPSMLIVTYEGEHNHPRIQPQSANT 352

BLAST of Cp4.1LG15g00430 vs. NCBI nr
Match: gi|923555735|gb|ALB35175.1| (transcription factor WRKY21-like protein [Morus alba])

HSP 1 Score: 404.4 bits (1038), Expect = 1.7e-109
Identity = 215/355 (60.56%), Postives = 262/355 (73.80%), Query Frame = 1

Query: 1   MEELEEANRAAIDSCHGVLNLLAHPSPQEQGELRKNLMVETGEAVFKLKKVVSLLNSGFG 60
           MEE+EEAN+AA++SCH VL++L+ P  Q Q   R+NL+VETGEAVF+ K+VVSLLN+G G
Sbjct: 1   MEEVEEANKAAVESCHRVLSILSQPQDQVQ---RRNLVVETGEAVFRFKRVVSLLNTGLG 60

Query: 61  YAKVRRFKNIPLPLPQKFLLDPPN-----------------------------NGLNGKI 120
           +A+VR+ K +P+ LPQ  LLD PN                              G N K 
Sbjct: 61  HARVRKLKKLPVSLPQSILLDNPNCSRTGTDQTPSKTPHFLQSSCFPENPLQEMGSNVKS 120

Query: 121 SVFLGNPDLEMDQNHKNSLKIHKQAAPSLNFSFPQQQQQQ------------RKQAEMMF 180
           S+ LGNP LE+  + K  +++ +Q  P+ ++ F  QQQQQ            ++QAEM+F
Sbjct: 121 SLCLGNPSLELSSSGKTQIQLSQQPNPAAHYHFLHQQQQQQQRLLLHQQQQMKQQAEMLF 180

Query: 181 LRSNSGMNMNFDASKCTLTMSSARSFISSLSMDGSVA--DGSSFHLIGP--SSTMSGDNK 240
            +SNSG+N+NFD+S CT TMSS RSFISSLS+DGSVA  DGS+FHLIG   SS  +  +K
Sbjct: 181 RKSNSGINLNFDSSSCTPTMSSTRSFISSLSIDGSVANLDGSAFHLIGAPRSSDQNSQHK 240

Query: 241 RRFSGRGDEGSLKCGSTGKCHCSKKRKHRVKRSIKVPAISNKLADIPSDDYSWRKYGQKP 300
           R+ S RG++GS+KCGS+G+CHCSKKRKHRVKRSIKVPAISNKLADIP DDYSWRKYGQKP
Sbjct: 241 RKCSVRGEDGSVKCGSSGRCHCSKKRKHRVKRSIKVPAISNKLADIPPDDYSWRKYGQKP 300

Query: 301 IKGSPHPRGYYKCSSMRGCPARKHVERCLEDPSMLIVTYEGEHNHPKMSTQSAHT 311
           IKGSPHPRGYYKCSSMRGCPARKHVERCLEDPSMLIVTYEGEHNHP++  QSA+T
Sbjct: 301 IKGSPHPRGYYKCSSMRGCPARKHVERCLEDPSMLIVTYEGEHNHPRIQPQSANT 352

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
WRK74_ARATH5.6e-7652.66Probable WRKY transcription factor 74 OS=Arabidopsis thaliana GN=WRKY74 PE=2 SV=... [more]
WRK39_ARATH1.6e-7550.74Probable WRKY transcription factor 39 OS=Arabidopsis thaliana GN=WRKY39 PE=2 SV=... [more]
WRK21_ARATH1.3e-6769.63Probable WRKY transcription factor 21 OS=Arabidopsis thaliana GN=WRKY21 PE=2 SV=... [more]
WRKY1_MAIZE1.7e-5649.64Protein WRKY1 OS=Zea mays PE=1 SV=1[more]
WRK17_ARATH2.0e-4438.99Probable WRKY transcription factor 17 OS=Arabidopsis thaliana GN=WRKY17 PE=2 SV=... [more]
Match NameE-valueIdentityDescription
E7CEW1_CUCSA4.1e-12671.06Uncharacterized protein OS=Cucumis sativus GN=WRKY9 PE=2 SV=1[more]
W9RBZ3_9ROSA7.1e-11060.85Putative WRKY transcription factor 21 OS=Morus notabilis GN=L484_005757 PE=4 SV=... [more]
A0A141E6C1_MORAL1.2e-10960.56Transcription factor WRKY21-like protein OS=Morus alba PE=2 SV=1[more]
A0A059D2M1_EUCGR3.9e-10862.82Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_B01503 PE=4 SV=1[more]
A0A140H8N2_MANES5.1e-10862.46WRKY transcription factor 28 OS=Manihot esculenta PE=2 SV=1[more]
Match NameE-valueIdentityDescription
AT5G28650.13.2e-7752.66 WRKY DNA-binding protein 74[more]
AT3G04670.19.2e-7750.74 WRKY DNA-binding protein 39[more]
AT2G30590.17.1e-6969.63 WRKY DNA-binding protein 21[more]
AT2G24570.11.1e-4538.99 WRKY DNA-binding protein 17[more]
AT4G31550.15.5e-4538.51 WRKY DNA-binding protein 11[more]
Match NameE-valueIdentityDescription
gi|449474207|ref|XP_004154104.1|5.9e-12671.06PREDICTED: probable WRKY transcription factor 21 isoform X1 [Cucumis sativus][more]
gi|659096892|ref|XP_008449343.1|5.0e-12571.51PREDICTED: probable WRKY transcription factor 21 [Cucumis melo][more]
gi|778668993|ref|XP_011649180.1|2.3e-12270.20PREDICTED: probable WRKY transcription factor 21 isoform X2 [Cucumis sativus][more]
gi|703112292|ref|XP_010100083.1|1.0e-10960.85putative WRKY transcription factor 21 [Morus notabilis][more]
gi|923555735|gb|ALB35175.1|1.7e-10960.56transcription factor WRKY21-like protein [Morus alba][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0043565sequence-specific DNA binding
GO:0003700transcription factor activity, sequence-specific DNA binding
Vocabulary: Biological Process
TermDefinition
GO:0006355regulation of transcription, DNA-templated
Vocabulary: INTERPRO
TermDefinition
IPR018872Zn-cluster-dom
IPR003657WRKY_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0005667 transcription factor complex
molecular_function GO:0043565 sequence-specific DNA binding
molecular_function GO:0003700 transcription factor activity, sequence-specific DNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG15g00430.1Cp4.1LG15g00430.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR003657WRKY domainGENE3DG3DSA:2.20.25.80coord: 230..302
score: 1.3
IPR003657WRKY domainPFAMPF03106WRKYcoord: 244..301
score: 7.6
IPR003657WRKY domainSMARTSM00774WRKY_clscoord: 242..302
score: 9.3
IPR003657WRKY domainPROFILEPS50811WRKYcoord: 237..303
score: 30
IPR003657WRKY domainunknownSSF118290WRKY DNA-binding domaincoord: 240..302
score: 7.06
IPR018872Zn-cluster domainPFAMPF10533Plant_zn_clustcoord: 200..240
score: 1.8
NoneNo IPR availablePANTHERPTHR31282FAMILY NOT NAMEDcoord: 151..300
score: 5.6E-138coord: 1..110
score: 5.6E
NoneNo IPR availablePANTHERPTHR31282:SF12WRKY TRANSCRIPTION FACTOR 39-RELATEDcoord: 151..300
score: 5.6E-138coord: 1..110
score: 5.6E

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cp4.1LG15g00430Cp4.1LG05g01350Cucurbita pepo (Zucchini)cpecpeB273