Cp4.1LG10g05720 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG10g05720
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionWRKY protein
LocationCp4.1LG10 : 689343 .. 691269 (-)
   



The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TATAACAAGAAGACGATCTAACCCTGAAGCCTGGACCAAAACCCTCAACCTCATAGCTTCTTCTTCGTCTTCTTCTTCTTCTTATACGTCGTCGTTTTGGGTCTTCTTGTTCTTCTTCTGTCTGATTCCATGTCAGATGAAATGTTTAAAGATCTATTTTTGTCGAGCCCTTTTGGATACGGTGGGTTGGGTGATTCAGAAACAGGCTTGGATGAGTACGAGTCTCTAGCCAGAGCTTTTGAGCTGTCCTCGGATTACTCGAATGAAATTTCGGGGACTCCCATGAATTCCTCCGCTTCCTTCTCCTCCTCTGATGCTGGAGCTGACGAGGATGATTCCCTCAAAGACAAGGATAAGCAGATCAAAGACATGGATGACGGCGGAGAGAGCTCCAAGACCGCGTATGTCTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTNCACGTCACGAGCAAGAACGGATCTTACCCAATGACCGACTTAGTCTTTTACGAGAGCACACAGGAGCGAAATAGCTCGATCGATAAAGAGAGGAATGCTCACATTATATTTTGCTTTCTCTTTTTTTTATTTTATTGTCAAAAAATACAAAACATTAACGTTTATATATATTAAAAGTTTATAAGTAGTGTGGGCCCTACTTCATCAAGTTTTCATTTTTATTTTATTAAAAATAAATTATAACAAAAAAAAACGCGTAAAAATATTAGAATGGAAAAAAATAAATGCGGAATATGATTGGACGTTGAGATTGTACGGCACCAATGAGAACGAGGCAAAAATGACACGCATGTGAAGTTAGTGTAATCCCTCAGGACATCAATTAGCCTCGTCCTATTTAATTTTTTTAAAAAAAACGTAATAAACCAATCGAATATGCTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTAATTAAATTTAAAGGGTTTAGCTTTTTTTCATTTTTGGTGGGTGGGTGTTCTTACCAACAGGGCTAAATCAAAGAAGAAAGTAGAGAAGAAAGAAAGAGAGCCAAGAGTTGCTTTCATGACCAAGAGCGAGGTCGATCATCTTGAAGATGGGTATCGATGGAGAAAATATGGACAGAAGGCTGTCAAGAACAGCGCTTATCCTAGGTTTGATTTGATTGTTCTTCAAAACTTAAATTAAATTTTGGGTCTAAGGGTTAATTTTAAAATTGTCTGAAATTCATCATGAATTTATAAGTAAGGAATACATGTACGTTATTTGTAGGAGCTATTACTGATGAATTTCATGAGTAAGGAATACATGTATGTTATTTGTAGGAGCTATTACAGATGCACGACGCAGAAATGCGGAGTGAAGAAACGGGTGGAGAGATCGTATGAAGATCCATCCATAGTAATTACGACGTACGAAGGGCAACACAATCACCCAATTCCCGCGACGTTAAGGGGGAACCTGTCGGCAGTGAGCGGCGCGTTTCCGCCGTCCATGTTGGCACCAATGCCGGTGGTCGGTGGCGTGAGGTATCTTCCACAGCTGATGAACAACACTTCCGTCAACAACAACCAGCCCATCGGAGGTGGTGACACCGTTTATTCACAAAGCAGCAGCTTCAATTATCCTTATAACGGACGGCAACAAGACTACGGACTTCTGCAGGACATTTTTCCGACGGCGCCGCCGTTCTTGAACCGACAACCATGAATGAGGTTTTGATGACAATCTCATGTCGTGTACATATGGTCTGATTATCGTTGTTGGGATAAACATAAAAACTGGCAAGATCGATCTTGCATGCATTTTACATACAACAATGGCGCCGTTTAGATTTGGCGTTGTGATCAGAGATAGAAATTAAAGTTTTGACATTTGGCTCTGATGTAGGGTTTTCAATATTATTATTATTATTATTATTATTATCGTTATTATTTATGGCTCTAGTTGAGTTTCATTTTAT

mRNA sequence

TATAACAAGAAGACGATCTAACCCTGAAGCCTGGACCAAAACCCTCAACCTCATAGCTTCTTCTTCGTCTTCTTCTTCTTCTTATACGTCGTCGTTTTGGGTCTTCTTGTTCTTCTTCTGTCTGATTCCATGTCAGATGAAATGTTTAAAGATCTATTTTTGTCGAGCCCTTTTGGATACGGTGGGTTGGGTGATTCAGAAACAGGCTTGGATGAGTACGAGTCTCTAGCCAGAGCTTTTGAGCTGTCCTCGGATTACTCGAATGAAATTTCGGGGACTCCCATGAATTCCTCCGCTTCCTTCTCCTCCTCTGATGCTGGAGCTGACGAGGATGATTCCCTCAAAGACAAGGATAAGCAGATCAAAGACATGGATGACGGCGGAGAGAGCTCCAAGACCGCGGCTAAATCAAAGAAGAAAGTAGAGAAGAAAGAAAGAGAGCCAAGAGTTGCTTTCATGACCAAGAGCGAGGTCGATCATCTTGAAGATGGGTATCGATGGAGAAAATATGGACAGAAGGCTGTCAAGAACAGCGCTTATCCTAGGAGCTATTACAGATGCACGACGCAGAAATGCGGAGTGAAGAAACGGGTGGAGAGATCGTATGAAGATCCATCCATAGTAATTACGACGTACGAAGGGCAACACAATCACCCAATTCCCGCGACGTTAAGGGGGAACCTGTCGGCAGTGAGCGGCGCGTTTCCGCCGTCCATGTTGGCACCAATGCCGGTGGTCGGTGGCGTGAGGTATCTTCCACAGCTGATGAACAACACTTCCGTCAACAACAACCAGCCCATCGGAGGTGGTGACACCGTTTATTCACAAAGCAGCAGCTTCAATTATCCTTATAACGGACGGCAACAAGACTACGGACTTCTGCAGGACATTTTTCCGACGGCGCCGCCGTTCTTGAACCGACAACCATGAATGAGGTTTTGATGACAATCTCATGTCGTGTACATATGGTCTGATTATCGTTGTTGGGATAAACATAAAAACTGGCAAGATCGATCTTGCATGCATTTTACATACAACAATGGCGCCGTTTAGATTTGGCGTTGTGATCAGAGATAGAAATTAAAGTTTTGACATTTGGCTCTGATGTAGGGTTTTCAATATTATTATTATTATTATTATTATTATCGTTATTATTTATGGCTCTAGTTGAGTTTCATTTTAT

Coding sequence (CDS)

ATGTCAGATGAAATGTTTAAAGATCTATTTTTGTCGAGCCCTTTTGGATACGGTGGGTTGGGTGATTCAGAAACAGGCTTGGATGAGTACGAGTCTCTAGCCAGAGCTTTTGAGCTGTCCTCGGATTACTCGAATGAAATTTCGGGGACTCCCATGAATTCCTCCGCTTCCTTCTCCTCCTCTGATGCTGGAGCTGACGAGGATGATTCCCTCAAAGACAAGGATAAGCAGATCAAAGACATGGATGACGGCGGAGAGAGCTCCAAGACCGCGGCTAAATCAAAGAAGAAAGTAGAGAAGAAAGAAAGAGAGCCAAGAGTTGCTTTCATGACCAAGAGCGAGGTCGATCATCTTGAAGATGGGTATCGATGGAGAAAATATGGACAGAAGGCTGTCAAGAACAGCGCTTATCCTAGGAGCTATTACAGATGCACGACGCAGAAATGCGGAGTGAAGAAACGGGTGGAGAGATCGTATGAAGATCCATCCATAGTAATTACGACGTACGAAGGGCAACACAATCACCCAATTCCCGCGACGTTAAGGGGGAACCTGTCGGCAGTGAGCGGCGCGTTTCCGCCGTCCATGTTGGCACCAATGCCGGTGGTCGGTGGCGTGAGGTATCTTCCACAGCTGATGAACAACACTTCCGTCAACAACAACCAGCCCATCGGAGGTGGTGACACCGTTTATTCACAAAGCAGCAGCTTCAATTATCCTTATAACGGACGGCAACAAGACTACGGACTTCTGCAGGACATTTTTCCGACGGCGCCGCCGTTCTTGAACCGACAACCATGA

Protein sequence

MSDEMFKDLFLSSPFGYGGLGDSETGLDEYESLARAFELSSDYSNEISGTPMNSSASFSSSDAGADEDDSLKDKDKQIKDMDDGGESSKTAAKSKKKVEKKEREPRVAFMTKSEVDHLEDGYRWRKYGQKAVKNSAYPRSYYRCTTQKCGVKKRVERSYEDPSIVITTYEGQHNHPIPATLRGNLSAVSGAFPPSMLAPMPVVGGVRYLPQLMNNTSVNNNQPIGGGDTVYSQSSSFNYPYNGRQQDYGLLQDIFPTAPPFLNRQP
BLAST of Cp4.1LG10g05720 vs. Swiss-Prot
Match: WRK71_ARATH (Probable WRKY transcription factor 71 OS=Arabidopsis thaliana GN=WRKY71 PE=2 SV=1)

HSP 1 Score: 188.3 bits (477), Expect = 1.0e-46
Identity = 126/249 (50.60%), Postives = 154/249 (61.85%), Query Frame = 1

Query: 29  EYESLARAFE----------LSSDYSNEISGTPMNSSASFSSSDAGADEDDSLKDKDKQI 88
           +Y SL + F+          +S   +N       NS    SSS+ G  ++++  DK  Q+
Sbjct: 32  DYNSLEKVFKFSPYSSPFQSVSPSVNNPYLNLTSNSPVVSSSSNEGEPKENT-NDKSDQM 91

Query: 89  KDMDDG----GESSKTAAKS-KKKVEKKEREPRVAFMTKSEVDHLEDGYRWRKYGQKAVK 148
           +D +      GESSK   K  KKK EKKERE RVAFMTKSE+DHLEDGYRWRKYGQKAVK
Sbjct: 92  EDNEGDLHGVGESSKQLTKQGKKKGEKKEREVRVAFMTKSEIDHLEDGYRWRKYGQKAVK 151

Query: 149 NSAYPRSYYRCTTQKCGVKKRVERSYEDPSIVITTYEGQHNHPIPATLRGNLSA----VS 208
           NS YPRSYYRCTTQKC VKKRVERS++DPSIVITTYEG+HNHPIP+TLRG ++A    V 
Sbjct: 152 NSPYPRSYYRCTTQKCNVKKRVERSFQDPSIVITTYEGKHNHPIPSTLRGTVAAEHLLVH 211

Query: 209 GAFPPSMLAPMPVVGGVRYLPQLMNNTSVNNNQPIGGGDTVYSQ-SSSFNYPYNGRQQDY 258
                S+L   P      +   LM   S  N Q +G     +   +SS+N+  N    DY
Sbjct: 212 RGGGGSLLHSFP----RHHQDFLMMKHSPANYQSVGSLSYEHGHGTSSYNFNNNQPVVDY 271

BLAST of Cp4.1LG10g05720 vs. Swiss-Prot
Match: WRK28_ARATH (Probable WRKY transcription factor 28 OS=Arabidopsis thaliana GN=WRKY28 PE=2 SV=1)

HSP 1 Score: 180.6 bits (457), Expect = 2.2e-44
Identity = 121/276 (43.84%), Postives = 157/276 (56.88%), Query Frame = 1

Query: 6   FKDLFLSSPFGYGGLGDSETGLDEYESLARAFELSSDYSNEISGTPMNSSA--------- 65
           F D   SSP  Y  L     GL    S      +  + + +++   +N  A         
Sbjct: 49  FTDCLQSSPAAYESLLQKTFGLSPSSSEVFNSSIDQEPNRDVTNDVINGGACNETETRVS 108

Query: 66  --SFSSSDAGADEDDSLKDKDKQIKDMDDGGESSKTAAKSKKKVEKKEREPRVAFMTKSE 125
             + SSS+A    +DS K + K+ + + +  + SK   K+KK   KK+REPRV+FMTKSE
Sbjct: 109 PSNSSSSEADHPGEDSGKSRRKR-ELVGEEDQISKKVGKTKKTEVKKQREPRVSFMTKSE 168

Query: 126 VDHLEDGYRWRKYGQKAVKNSAYPRSYYRCTTQKCGVKKRVERSYEDPSIVITTYEGQHN 185
           VDHLEDGYRWRKYGQKAVKNS YPRSYYRCTTQKC VKKRVERS++DP++VITTYEGQHN
Sbjct: 169 VDHLEDGYRWRKYGQKAVKNSPYPRSYYRCTTQKCNVKKRVERSFQDPTVVITTYEGQHN 228

Query: 186 HPIPATLRGNLSAVSGAFPPSMLAPMPVVGGVRYLPQLMNNTSVNNNQPIGGGDTVY--- 245
           HPIP  LRG+ SA +  F   ++ P      +       N  SV      G G + Y   
Sbjct: 229 HPIPTNLRGS-SAAAAMFSADLMTPRSFAHDMFRTAAYTNGGSVAAALDYGYGQSGYGSV 288

Query: 246 -SQSSSFNYPYNGRQQDYGLLQDIFPTAPPFLNRQP 267
            S  SS    + G   +Y LL++IFP+   F  ++P
Sbjct: 289 NSNPSSHQVYHQG--GEYELLREIFPSI--FFKQEP 318

BLAST of Cp4.1LG10g05720 vs. Swiss-Prot
Match: WRKY8_ARATH (Probable WRKY transcription factor 8 OS=Arabidopsis thaliana GN=WRKY8 PE=1 SV=1)

HSP 1 Score: 162.2 bits (409), Expect = 8.1e-39
Identity = 108/260 (41.54%), Postives = 148/260 (56.92%), Query Frame = 1

Query: 22  DSETGLDE---YESLARAFELSSDYSNEI-SGTPMNSSASFSSSDAGADEDDSLKDKDKQ 81
           D E GL     Y S  ++ E+  D    I S   +++S S S +D    ED     K ++
Sbjct: 83  DQENGLYNAYNYNSSQKSHEVVGDGCATIKSEVRVSASPSSSEADHHPGEDSGKIRKKRE 142

Query: 82  IKDMDDGGESSKTAAKSKKKVEKKEREPRVAFMTKSEVDHLEDGYRWRKYGQKAVKNSAY 141
           ++D  +  + S+   K+KKK E+K++EPRV+FMTK+EVDHLEDGYRWRKYGQKAVKNS Y
Sbjct: 143 VRDGGEDDQRSQKVVKTKKK-EEKKKEPRVSFMTKTEVDHLEDGYRWRKYGQKAVKNSPY 202

Query: 142 PRSYYRCTTQKCGVKKRVERSYEDPSIVITTYEGQHNHPIPATLRGNL--SAVSGAFPPS 201
           PRSYYRCTTQKC VKKRVERSY+DP++VITTYE QHNHPIP   R  +     +  + PS
Sbjct: 203 PRSYYRCTTQKCNVKKRVERSYQDPTVVITTYESQHNHPIPTNRRTAMFSGTTASDYNPS 262

Query: 202 MLAPMPVVGGVRYLPQLMNNTSVNNNQPIGGGDTVYSQSSSFNYPYNGRQQDYG------ 261
                           + ++  +N  +     D      +S N   +  QQ +G      
Sbjct: 263 S-------------SPIFSDLIINTPRSFSNDDLFRVPYASVNVNPSYHQQQHGFHQQES 322

Query: 262 ---LLQDIFPTAPPFLNRQP 267
              LL+++FP+   F  ++P
Sbjct: 323 EFELLKEMFPSV--FFKQEP 326

BLAST of Cp4.1LG10g05720 vs. Swiss-Prot
Match: WRK48_ARATH (Probable WRKY transcription factor 48 OS=Arabidopsis thaliana GN=WRKY48 PE=2 SV=1)

HSP 1 Score: 161.0 bits (406), Expect = 1.8e-38
Identity = 119/290 (41.03%), Postives = 157/290 (54.14%), Query Frame = 1

Query: 3   DEMFKDLFLSSPFGYGG--LGDSETGLDEYESLARAFELSSDYSNEISGTPMNSSASFSS 62
           D  F     SS F +    L ++      +  L      SS+  N    +P ++S S SS
Sbjct: 101 DHQFASSSNSSSFSFDAFPLPNNNNNTSFFTDLPLPQAESSEVVNTTPTSPNSTSVSSSS 160

Query: 63  SDAGADEDDSLKDKDKQIKDMDDGGES-----SKTAAKSKKKVEKKEREPRVAFMTKSEV 122
           ++A  D +     K+  +KD ++G +      +K   K+KKK +KK RE R AF+TKS++
Sbjct: 161 NEAANDNNSG---KEVTVKDQEEGDQQQEQKGTKPQLKAKKKNQKKAREARFAFLTKSDI 220

Query: 123 DHLEDGYRWRKYGQKAVKNSAYPRSYYRCTTQKCGVKKRVERSYEDPSIVITTYEGQHNH 182
           D+L+DGYRWRKYGQKAVKNS YPRSYYRCTT  CGVKKRVERS +DPSIV+TTYEGQH H
Sbjct: 221 DNLDDGYRWRKYGQKAVKNSPYPRSYYRCTTVGCGVKKRVERSSDDPSIVMTTYEGQHTH 280

Query: 183 PIPATLRGNLSAVSG---------AFPPSMLAPMPVVGGVRYL------PQLM---NNTS 242
           P P T RG++  ++          A   S   P P     RYL      P  M   N+ S
Sbjct: 281 PFPMTPRGHIGMLTSPILDHGATTASSSSFSIPQP-----RYLLTQHHQPYNMYNNNSLS 340

Query: 243 VNNNQPIGGGDTVYSQSSSF-NYPYNGRQ---------QDYGLLQDIFPT 258
           + N +   G       SSSF  + Y+  Q         +D+GLLQDI P+
Sbjct: 341 MINRRSSDGTFVNPGPSSSFPGFGYDMSQASTSTSSSIRDHGLLQDILPS 382

BLAST of Cp4.1LG10g05720 vs. Swiss-Prot
Match: WRK23_ARATH (Probable WRKY transcription factor 23 OS=Arabidopsis thaliana GN=WRKY23 PE=2 SV=1)

HSP 1 Score: 154.8 bits (390), Expect = 1.3e-36
Identity = 102/227 (44.93%), Postives = 128/227 (56.39%), Query Frame = 1

Query: 50  TPMNSSASFSSSDAGADEDDSLKDKDKQIKDMDDGGESSKTAAKSKKKVEKKEREPRVAF 109
           TP +SS S +SS+A  +E    +D +++  +       +K   K+KK  +K++RE RVAF
Sbjct: 105 TPNSSSISSASSEALNEEKPKTEDNEEEGGEDQQEKSHTKKQLKAKKNNQKRQREARVAF 164

Query: 110 MTKSEVDHLEDGYRWRKYGQKAVKNSAYPRSYYRCTTQKCGVKKRVERSYEDPSIVITTY 169
           MTKSEVDHLEDGYRWRKYGQKAVKNS +PRSYYRCTT  C VKKRVERS+ DPS V+TTY
Sbjct: 165 MTKSEVDHLEDGYRWRKYGQKAVKNSPFPRSYYRCTTASCNVKKRVERSFRDPSTVVTTY 224

Query: 170 EGQHNHPIPATLR----GNLSAVSGAFPP--SMLAPMPVVGGVRYLPQLMNNTSVNNNQ- 229
           EGQH H  P T R    G     SGA     +     P+ G     PQ       ++ Q 
Sbjct: 225 EGQHTHISPLTSRPISTGGFFGSSGAASSLGNGCFGFPIDGSTLISPQFQQLVQYHHQQQ 284

Query: 230 -----PIGGGDTVYSQSSSFNYPYNGR-------QQDYGLLQDIFPT 258
                   GG   Y  S +  Y  + R        +D GLLQD+ P+
Sbjct: 285 QQELMSCFGGVNEYLNSHANEYGDDNRVKKSRVLVKDNGLLQDVVPS 331

BLAST of Cp4.1LG10g05720 vs. TrEMBL
Match: A0A0A0KNS1_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G522650 PE=4 SV=1)

HSP 1 Score: 348.6 bits (893), Expect = 6.8e-93
Identity = 202/277 (72.92%), Postives = 221/277 (79.78%), Query Frame = 1

Query: 1   MSDEMFKDLFLSSPFGYGGLGDSETGLDEYESLARAFELSSDYSN---EISGTP-MNSSA 60
           MSDEMFKDLF S             G+DEYES+ RAF ++SDYSN   EISGT  MNSS 
Sbjct: 1   MSDEMFKDLFYS-------------GMDEYESIVRAFGITSDYSNINNEISGTTAMNSSC 60

Query: 61  SFSSSDAGA-DEDDSLKDKDKQI-KDM--DDGGESSKTAA--KSKKKVEKKEREPRVAFM 120
           S SSSDAG  +EDDS+K+K+KQI KD+  D+GGESSK A   KSKKK EKKERE RVAFM
Sbjct: 61  SLSSSDAGGGEEDDSVKEKEKQISKDVVEDNGGESSKAAGSGKSKKKGEKKEREARVAFM 120

Query: 121 TKSEVDHLEDGYRWRKYGQKAVKNSAYPRSYYRCTTQKCGVKKRVERSYEDPSIVITTYE 180
           TKSEVDHLEDGYRWRKYGQKAVKNSAYPRSYYRCTTQKCGVKKRVERSYEDPSIVITTYE
Sbjct: 121 TKSEVDHLEDGYRWRKYGQKAVKNSAYPRSYYRCTTQKCGVKKRVERSYEDPSIVITTYE 180

Query: 181 GQHNHPIPATLRGNLSAVSGAFPPSMLAPMPVVGGVRYLP-QLMNNTSVNNNQPIGGGDT 240
           GQHNH IPATLRGNLSA SG F PSML PMPVVGGV +LP +L++N    NNQ +GGG T
Sbjct: 181 GQHNHLIPATLRGNLSAASGTFSPSMLTPMPVVGGVGFLPAELLSN--AGNNQAVGGGAT 240

Query: 241 VYSQSSSFNYPYNGRQQDYGLLQDIFPTAPPFLNRQP 267
           VYS  ++F+Y YNGRQ +YGLLQDIFP    F NRQP
Sbjct: 241 VYSH-NNFDYTYNGRQPEYGLLQDIFPAPSSFFNRQP 261

BLAST of Cp4.1LG10g05720 vs. TrEMBL
Match: E7CEY1_CUCSA (WRKY protein OS=Cucumis sativus GN=WRKY46 PE=2 SV=1)

HSP 1 Score: 345.9 bits (886), Expect = 4.4e-92
Identity = 199/276 (72.10%), Postives = 219/276 (79.35%), Query Frame = 1

Query: 1   MSDEMFKDLFLSSPFGYGGLGDSETGLDEYESLARAFELSSDYSN---EISGTP-MNSSA 60
           MSDEMFKDLF S             G+DEYES+ RAF ++SDYSN   EISGT  MNSS 
Sbjct: 1   MSDEMFKDLFYS-------------GMDEYESIVRAFGITSDYSNINNEISGTTAMNSSC 60

Query: 61  SFSSSDAGA-DEDDSLKDKDKQI-KDM--DDGGESSKTAA--KSKKKVEKKEREPRVAFM 120
           S SSSDAG  +EDDS+K+K+KQI KD+  D+GGESSK A   KSKKK EKKERE  VAFM
Sbjct: 61  SLSSSDAGGGEEDDSVKEKEKQISKDVVEDNGGESSKAAGSGKSKKKGEKKEREAGVAFM 120

Query: 121 TKSEVDHLEDGYRWRKYGQKAVKNSAYPRSYYRCTTQKCGVKKRVERSYEDPSIVITTYE 180
           TKSEVDHLEDGYRWRKYGQKAVKNSAYPRSYYRCTTQKCGVKKRVERSYEDPSIVITTYE
Sbjct: 121 TKSEVDHLEDGYRWRKYGQKAVKNSAYPRSYYRCTTQKCGVKKRVERSYEDPSIVITTYE 180

Query: 181 GQHNHPIPATLRGNLSAVSGAFPPSMLAPMPVVGGVRYLPQLMNNTSVNNNQPIGGGDTV 240
           GQHNH IPATLRGNLSA SG F PSML PMPVVGGV +LP  + +++  NNQ +GGG TV
Sbjct: 181 GQHNHLIPATLRGNLSAASGTFSPSMLTPMPVVGGVGFLPAEL-SSNAGNNQAVGGGATV 240

Query: 241 YSQSSSFNYPYNGRQQDYGLLQDIFPTAPPFLNRQP 267
           YS  ++F+Y YNGRQ +YGLLQDIFP    F NRQP
Sbjct: 241 YSH-NNFDYTYNGRQPEYGLLQDIFPAPSSFFNRQP 261

BLAST of Cp4.1LG10g05720 vs. TrEMBL
Match: A0A061GF56_THECC (WRKY DNA-binding protein 28 OS=Theobroma cacao GN=TCM_029543 PE=4 SV=1)

HSP 1 Score: 236.5 bits (602), Expect = 3.7e-59
Identity = 130/214 (60.75%), Postives = 157/214 (73.36%), Query Frame = 1

Query: 44  SNEISGTPMNSSASFSSSDAGADEDDSLKDKDKQIKDMDDGGESSKTAAKSKKKVEKKER 103
           ++E+  TP NSS S SSS+AG +ED     KD+Q K  +DGGESSK   K+KKK EKK+R
Sbjct: 89  TSEVMTTP-NSSVSSSSSEAGCEEDSDKSKKDRQPKGSEDGGESSKKGNKAKKKGEKKQR 148

Query: 104 EPRVAFMTKSEVDHLEDGYRWRKYGQKAVKNSAYPRSYYRCTTQKCGVKKRVERSYEDPS 163
           EPR AFMTKSEVDHLEDGYRWRKYGQKAVKNS YPRSYYRCTTQKC VKKRVERS++DPS
Sbjct: 149 EPRFAFMTKSEVDHLEDGYRWRKYGQKAVKNSPYPRSYYRCTTQKCTVKKRVERSFQDPS 208

Query: 164 IVITTYEGQHNHPIPATLRGNLSAVSGAFPPSMLAPMPVVGGVRYLPQLMNNTSVNNNQP 223
           +VITTYEGQHNHP+P TLRG   + +G FPPSML P P + G  +  +L      + N  
Sbjct: 209 VVITTYEGQHNHPLPTTLRG---SAAGLFPPSMLTPSP-LAGPSFPHELFMQMPHHMNNQ 268

Query: 224 IGGGDTVYSQSSSFNYPYNGRQQDYGLLQDIFPT 258
            G   +++++S S    Y+ +  DYGLLQDI P+
Sbjct: 269 AGSAGSMFAESFSPFQQYHHQVPDYGLLQDIVPS 297

BLAST of Cp4.1LG10g05720 vs. TrEMBL
Match: U5GX71_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0001s34520g PE=4 SV=1)

HSP 1 Score: 228.0 bits (580), Expect = 1.3e-56
Identity = 145/285 (50.88%), Postives = 175/285 (61.40%), Query Frame = 1

Query: 14  PFGYGGLGDSETGLDEYESLARAFELSSDYS------------------------NEISG 73
           P  Y  L +   G  +Y SLA+AF LS   S                        +++  
Sbjct: 42  PSSYMSLTECLHGSVDYNSLAKAFGLSPSSSEVFSSIEESSRPVEARDLDGGNSTDQVPA 101

Query: 74  TPMNSSASFSSSDAGADEDDSLKDKDKQIKDMDDGGESSKTAAKSKKKVEKKEREPRVAF 133
           TP NSS SFSSS+AG DED     K+ Q +  +DGGE+S    K+KKK EK+++EPR AF
Sbjct: 102 TP-NSSVSFSSSEAGGDEDSGKTKKETQPEKPEDGGENSDKKDKAKKKAEKRQKEPRFAF 161

Query: 134 MTKSEVDHLEDGYRWRKYGQKAVKNSAYPRSYYRCTTQKCGVKKRVERSYEDPSIVITTY 193
           MTKSEVDHLEDGYRWRKYGQKAVKNS YPRSYYRCTTQKC VKKRVERS++DPSIVITTY
Sbjct: 162 MTKSEVDHLEDGYRWRKYGQKAVKNSPYPRSYYRCTTQKCTVKKRVERSFQDPSIVITTY 221

Query: 194 EGQHNHPIPATLRGNLSAVSGAFPPSMLAPMPVVGGVRYLPQLMNNTSVN--------NN 253
           EGQHNHPIP TLRG+ SA+   F  SMLAP P+  G    P   ++   N        NN
Sbjct: 222 EGQHNHPIPTTLRGSASAM---FSHSMLAPAPMASG----PSFPHHQGYNFVQIPDAMNN 281

Query: 254 QPIGGGDTVYSQSSSFNYPYNGRQQDYGLLQDIFPTAPPFLNRQP 267
           Q +G     Y Q+ + +     +  DYGLLQDI P+   FL ++P
Sbjct: 282 QNMG----AYPQNVNQHVHQQYQVPDYGLLQDIVPSI--FLRQEP 312

BLAST of Cp4.1LG10g05720 vs. TrEMBL
Match: A0A0A7EB12_GOSAI (WRKY29 OS=Gossypium aridum PE=2 SV=1)

HSP 1 Score: 224.6 bits (571), Expect = 1.5e-55
Identity = 145/280 (51.79%), Postives = 174/280 (62.14%), Query Frame = 1

Query: 6   FKDLFLSSPFGYGGL----GDSETGLDEYESL-----------ARAFELSSDYSNEISGT 65
           F D   S+   YG L    G S T  + + S+           A A EL  + + E++ T
Sbjct: 48  FTDCLHSTSMDYGSLEKAFGLSPTSSEVFSSVEGGNRTMMKQHAGADELGGN-TGEVTAT 107

Query: 66  PMNSSASFSSSDAGADEDDSLKDKDKQIKDMDDGGESSKTAAKSKKKVEKKEREPRVAFM 125
            +NSS S SSS+AG +ED     KD Q K  +DGGESSK   K+K K EKK+REPR AF+
Sbjct: 108 -LNSSVSSSSSEAGCEEDSDKSKKDGQPKGSEDGGESSKKGNKAKLKGEKKQREPRFAFV 167

Query: 126 TKSEVDHLEDGYRWRKYGQKAVKNSAYPRSYYRCTTQKCGVKKRVERSYEDPSIVITTYE 185
           TKSEVD LEDGYRWRKYGQKAVKNS YPRSYYRCTTQKC VKKRVERS++DPS VITTYE
Sbjct: 168 TKSEVDQLEDGYRWRKYGQKAVKNSPYPRSYYRCTTQKCTVKKRVERSFQDPSTVITTYE 227

Query: 186 GQHNHPIPATLRGNLSAVSGAFPPSMLAPMPVVGGVRY----LPQLMNNTSVNNNQPIGG 245
           GQHNHP+P TLRG+    +G FPPSML P P +G   +    L Q+ N     NNQ   G
Sbjct: 228 GQHNHPLPTTLRGS---AAGLFPPSMLTPSP-LGRPSFPHELLMQMPNYHHQMNNQAPAG 287

Query: 246 GDTVYSQSSSFNYPYNGRQQDYGLLQDIFPTAPPFLNRQP 267
                + S    Y +  +  DYGLLQD+ P+   FL  +P
Sbjct: 288 SMFAENFSPFHQYVHQHQGPDYGLLQDMVPST--FLKHEP 319

BLAST of Cp4.1LG10g05720 vs. TAIR10
Match: AT1G29860.1 (AT1G29860.1 WRKY DNA-binding protein 71)

HSP 1 Score: 188.3 bits (477), Expect = 5.9e-48
Identity = 126/249 (50.60%), Postives = 154/249 (61.85%), Query Frame = 1

Query: 29  EYESLARAFE----------LSSDYSNEISGTPMNSSASFSSSDAGADEDDSLKDKDKQI 88
           +Y SL + F+          +S   +N       NS    SSS+ G  ++++  DK  Q+
Sbjct: 32  DYNSLEKVFKFSPYSSPFQSVSPSVNNPYLNLTSNSPVVSSSSNEGEPKENT-NDKSDQM 91

Query: 89  KDMDDG----GESSKTAAKS-KKKVEKKEREPRVAFMTKSEVDHLEDGYRWRKYGQKAVK 148
           +D +      GESSK   K  KKK EKKERE RVAFMTKSE+DHLEDGYRWRKYGQKAVK
Sbjct: 92  EDNEGDLHGVGESSKQLTKQGKKKGEKKEREVRVAFMTKSEIDHLEDGYRWRKYGQKAVK 151

Query: 149 NSAYPRSYYRCTTQKCGVKKRVERSYEDPSIVITTYEGQHNHPIPATLRGNLSA----VS 208
           NS YPRSYYRCTTQKC VKKRVERS++DPSIVITTYEG+HNHPIP+TLRG ++A    V 
Sbjct: 152 NSPYPRSYYRCTTQKCNVKKRVERSFQDPSIVITTYEGKHNHPIPSTLRGTVAAEHLLVH 211

Query: 209 GAFPPSMLAPMPVVGGVRYLPQLMNNTSVNNNQPIGGGDTVYSQ-SSSFNYPYNGRQQDY 258
                S+L   P      +   LM   S  N Q +G     +   +SS+N+  N    DY
Sbjct: 212 RGGGGSLLHSFP----RHHQDFLMMKHSPANYQSVGSLSYEHGHGTSSYNFNNNQPVVDY 271

BLAST of Cp4.1LG10g05720 vs. TAIR10
Match: AT4G18170.1 (AT4G18170.1 WRKY DNA-binding protein 28)

HSP 1 Score: 180.6 bits (457), Expect = 1.2e-45
Identity = 121/276 (43.84%), Postives = 157/276 (56.88%), Query Frame = 1

Query: 6   FKDLFLSSPFGYGGLGDSETGLDEYESLARAFELSSDYSNEISGTPMNSSA--------- 65
           F D   SSP  Y  L     GL    S      +  + + +++   +N  A         
Sbjct: 49  FTDCLQSSPAAYESLLQKTFGLSPSSSEVFNSSIDQEPNRDVTNDVINGGACNETETRVS 108

Query: 66  --SFSSSDAGADEDDSLKDKDKQIKDMDDGGESSKTAAKSKKKVEKKEREPRVAFMTKSE 125
             + SSS+A    +DS K + K+ + + +  + SK   K+KK   KK+REPRV+FMTKSE
Sbjct: 109 PSNSSSSEADHPGEDSGKSRRKR-ELVGEEDQISKKVGKTKKTEVKKQREPRVSFMTKSE 168

Query: 126 VDHLEDGYRWRKYGQKAVKNSAYPRSYYRCTTQKCGVKKRVERSYEDPSIVITTYEGQHN 185
           VDHLEDGYRWRKYGQKAVKNS YPRSYYRCTTQKC VKKRVERS++DP++VITTYEGQHN
Sbjct: 169 VDHLEDGYRWRKYGQKAVKNSPYPRSYYRCTTQKCNVKKRVERSFQDPTVVITTYEGQHN 228

Query: 186 HPIPATLRGNLSAVSGAFPPSMLAPMPVVGGVRYLPQLMNNTSVNNNQPIGGGDTVY--- 245
           HPIP  LRG+ SA +  F   ++ P      +       N  SV      G G + Y   
Sbjct: 229 HPIPTNLRGS-SAAAAMFSADLMTPRSFAHDMFRTAAYTNGGSVAAALDYGYGQSGYGSV 288

Query: 246 -SQSSSFNYPYNGRQQDYGLLQDIFPTAPPFLNRQP 267
            S  SS    + G   +Y LL++IFP+   F  ++P
Sbjct: 289 NSNPSSHQVYHQG--GEYELLREIFPSI--FFKQEP 318

BLAST of Cp4.1LG10g05720 vs. TAIR10
Match: AT5G46350.1 (AT5G46350.1 WRKY DNA-binding protein 8)

HSP 1 Score: 162.2 bits (409), Expect = 4.5e-40
Identity = 108/260 (41.54%), Postives = 148/260 (56.92%), Query Frame = 1

Query: 22  DSETGLDE---YESLARAFELSSDYSNEI-SGTPMNSSASFSSSDAGADEDDSLKDKDKQ 81
           D E GL     Y S  ++ E+  D    I S   +++S S S +D    ED     K ++
Sbjct: 83  DQENGLYNAYNYNSSQKSHEVVGDGCATIKSEVRVSASPSSSEADHHPGEDSGKIRKKRE 142

Query: 82  IKDMDDGGESSKTAAKSKKKVEKKEREPRVAFMTKSEVDHLEDGYRWRKYGQKAVKNSAY 141
           ++D  +  + S+   K+KKK E+K++EPRV+FMTK+EVDHLEDGYRWRKYGQKAVKNS Y
Sbjct: 143 VRDGGEDDQRSQKVVKTKKK-EEKKKEPRVSFMTKTEVDHLEDGYRWRKYGQKAVKNSPY 202

Query: 142 PRSYYRCTTQKCGVKKRVERSYEDPSIVITTYEGQHNHPIPATLRGNL--SAVSGAFPPS 201
           PRSYYRCTTQKC VKKRVERSY+DP++VITTYE QHNHPIP   R  +     +  + PS
Sbjct: 203 PRSYYRCTTQKCNVKKRVERSYQDPTVVITTYESQHNHPIPTNRRTAMFSGTTASDYNPS 262

Query: 202 MLAPMPVVGGVRYLPQLMNNTSVNNNQPIGGGDTVYSQSSSFNYPYNGRQQDYG------ 261
                           + ++  +N  +     D      +S N   +  QQ +G      
Sbjct: 263 S-------------SPIFSDLIINTPRSFSNDDLFRVPYASVNVNPSYHQQQHGFHQQES 322

Query: 262 ---LLQDIFPTAPPFLNRQP 267
              LL+++FP+   F  ++P
Sbjct: 323 EFELLKEMFPSV--FFKQEP 326

BLAST of Cp4.1LG10g05720 vs. TAIR10
Match: AT5G49520.1 (AT5G49520.1 WRKY DNA-binding protein 48)

HSP 1 Score: 161.0 bits (406), Expect = 1.0e-39
Identity = 119/290 (41.03%), Postives = 157/290 (54.14%), Query Frame = 1

Query: 3   DEMFKDLFLSSPFGYGG--LGDSETGLDEYESLARAFELSSDYSNEISGTPMNSSASFSS 62
           D  F     SS F +    L ++      +  L      SS+  N    +P ++S S SS
Sbjct: 101 DHQFASSSNSSSFSFDAFPLPNNNNNTSFFTDLPLPQAESSEVVNTTPTSPNSTSVSSSS 160

Query: 63  SDAGADEDDSLKDKDKQIKDMDDGGES-----SKTAAKSKKKVEKKEREPRVAFMTKSEV 122
           ++A  D +     K+  +KD ++G +      +K   K+KKK +KK RE R AF+TKS++
Sbjct: 161 NEAANDNNSG---KEVTVKDQEEGDQQQEQKGTKPQLKAKKKNQKKAREARFAFLTKSDI 220

Query: 123 DHLEDGYRWRKYGQKAVKNSAYPRSYYRCTTQKCGVKKRVERSYEDPSIVITTYEGQHNH 182
           D+L+DGYRWRKYGQKAVKNS YPRSYYRCTT  CGVKKRVERS +DPSIV+TTYEGQH H
Sbjct: 221 DNLDDGYRWRKYGQKAVKNSPYPRSYYRCTTVGCGVKKRVERSSDDPSIVMTTYEGQHTH 280

Query: 183 PIPATLRGNLSAVSG---------AFPPSMLAPMPVVGGVRYL------PQLM---NNTS 242
           P P T RG++  ++          A   S   P P     RYL      P  M   N+ S
Sbjct: 281 PFPMTPRGHIGMLTSPILDHGATTASSSSFSIPQP-----RYLLTQHHQPYNMYNNNSLS 340

Query: 243 VNNNQPIGGGDTVYSQSSSF-NYPYNGRQ---------QDYGLLQDIFPT 258
           + N +   G       SSSF  + Y+  Q         +D+GLLQDI P+
Sbjct: 341 MINRRSSDGTFVNPGPSSSFPGFGYDMSQASTSTSSSIRDHGLLQDILPS 382

BLAST of Cp4.1LG10g05720 vs. TAIR10
Match: AT2G47260.1 (AT2G47260.1 WRKY DNA-binding protein 23)

HSP 1 Score: 154.8 bits (390), Expect = 7.2e-38
Identity = 102/227 (44.93%), Postives = 128/227 (56.39%), Query Frame = 1

Query: 50  TPMNSSASFSSSDAGADEDDSLKDKDKQIKDMDDGGESSKTAAKSKKKVEKKEREPRVAF 109
           TP +SS S +SS+A  +E    +D +++  +       +K   K+KK  +K++RE RVAF
Sbjct: 105 TPNSSSISSASSEALNEEKPKTEDNEEEGGEDQQEKSHTKKQLKAKKNNQKRQREARVAF 164

Query: 110 MTKSEVDHLEDGYRWRKYGQKAVKNSAYPRSYYRCTTQKCGVKKRVERSYEDPSIVITTY 169
           MTKSEVDHLEDGYRWRKYGQKAVKNS +PRSYYRCTT  C VKKRVERS+ DPS V+TTY
Sbjct: 165 MTKSEVDHLEDGYRWRKYGQKAVKNSPFPRSYYRCTTASCNVKKRVERSFRDPSTVVTTY 224

Query: 170 EGQHNHPIPATLR----GNLSAVSGAFPP--SMLAPMPVVGGVRYLPQLMNNTSVNNNQ- 229
           EGQH H  P T R    G     SGA     +     P+ G     PQ       ++ Q 
Sbjct: 225 EGQHTHISPLTSRPISTGGFFGSSGAASSLGNGCFGFPIDGSTLISPQFQQLVQYHHQQQ 284

Query: 230 -----PIGGGDTVYSQSSSFNYPYNGR-------QQDYGLLQDIFPT 258
                   GG   Y  S +  Y  + R        +D GLLQD+ P+
Sbjct: 285 QQELMSCFGGVNEYLNSHANEYGDDNRVKKSRVLVKDNGLLQDVVPS 331

BLAST of Cp4.1LG10g05720 vs. NCBI nr
Match: gi|659078320|ref|XP_008439664.1| (PREDICTED: probable WRKY transcription factor 71 [Cucumis melo])

HSP 1 Score: 355.1 bits (910), Expect = 1.0e-94
Identity = 202/276 (73.19%), Postives = 222/276 (80.43%), Query Frame = 1

Query: 1   MSDEMFKDLFLSSPFGYGGLGDSETGLDEYESLARAFELSSDYSN---EISGTP-MNSSA 60
           MSDEMFKDLF      YGG+       DEYES+ RAF ++SDYSN   EISGT  MNSS 
Sbjct: 1   MSDEMFKDLF------YGGM-------DEYESIVRAFGITSDYSNNNNEISGTTAMNSSC 60

Query: 61  SFSSSDAGADEDD-SLKDKDKQI-KDM--DDGGESSKTAA--KSKKKVEKKEREPRVAFM 120
           SFSSSDAG  EDD S+K+K+K I KD+  D+GGE+SK A   KSKKK EK+ERE RVAFM
Sbjct: 61  SFSSSDAGGGEDDDSVKEKEKHISKDVVEDNGGENSKAAGSGKSKKKGEKREREARVAFM 120

Query: 121 TKSEVDHLEDGYRWRKYGQKAVKNSAYPRSYYRCTTQKCGVKKRVERSYEDPSIVITTYE 180
           TKSEVDHLEDGYRWRKYGQKAVKNSAYPRSYYRCTTQKCGVKKRVERSYEDPSIVITTYE
Sbjct: 121 TKSEVDHLEDGYRWRKYGQKAVKNSAYPRSYYRCTTQKCGVKKRVERSYEDPSIVITTYE 180

Query: 181 GQHNHPIPATLRGNLSAVSGAFPPSMLAPMPVVGGVRYLPQLMNNTSVNNNQPIGGGDTV 240
           GQHNHPIPATLRGNLSA SG FPPSML PMPVVGGV +LP  + + + +NNQ +GGG TV
Sbjct: 181 GQHNHPIPATLRGNLSAASGTFPPSMLTPMPVVGGVGFLPAELLSNASSNNQAVGGGATV 240

Query: 241 YSQSSSFNYPYNGRQQDYGLLQDIFPTAPPFLNRQP 267
           YS  +SF+Y YNGRQ +YGLLQDIFP    F NRQP
Sbjct: 241 YSH-NSFDYTYNGRQPEYGLLQDIFPAPSSFFNRQP 262

BLAST of Cp4.1LG10g05720 vs. NCBI nr
Match: gi|700194167|gb|KGN49371.1| (hypothetical protein Csa_6G522650 [Cucumis sativus])

HSP 1 Score: 348.6 bits (893), Expect = 9.7e-93
Identity = 202/277 (72.92%), Postives = 221/277 (79.78%), Query Frame = 1

Query: 1   MSDEMFKDLFLSSPFGYGGLGDSETGLDEYESLARAFELSSDYSN---EISGTP-MNSSA 60
           MSDEMFKDLF S             G+DEYES+ RAF ++SDYSN   EISGT  MNSS 
Sbjct: 1   MSDEMFKDLFYS-------------GMDEYESIVRAFGITSDYSNINNEISGTTAMNSSC 60

Query: 61  SFSSSDAGA-DEDDSLKDKDKQI-KDM--DDGGESSKTAA--KSKKKVEKKEREPRVAFM 120
           S SSSDAG  +EDDS+K+K+KQI KD+  D+GGESSK A   KSKKK EKKERE RVAFM
Sbjct: 61  SLSSSDAGGGEEDDSVKEKEKQISKDVVEDNGGESSKAAGSGKSKKKGEKKEREARVAFM 120

Query: 121 TKSEVDHLEDGYRWRKYGQKAVKNSAYPRSYYRCTTQKCGVKKRVERSYEDPSIVITTYE 180
           TKSEVDHLEDGYRWRKYGQKAVKNSAYPRSYYRCTTQKCGVKKRVERSYEDPSIVITTYE
Sbjct: 121 TKSEVDHLEDGYRWRKYGQKAVKNSAYPRSYYRCTTQKCGVKKRVERSYEDPSIVITTYE 180

Query: 181 GQHNHPIPATLRGNLSAVSGAFPPSMLAPMPVVGGVRYLP-QLMNNTSVNNNQPIGGGDT 240
           GQHNH IPATLRGNLSA SG F PSML PMPVVGGV +LP +L++N    NNQ +GGG T
Sbjct: 181 GQHNHLIPATLRGNLSAASGTFSPSMLTPMPVVGGVGFLPAELLSN--AGNNQAVGGGAT 240

Query: 241 VYSQSSSFNYPYNGRQQDYGLLQDIFPTAPPFLNRQP 267
           VYS  ++F+Y YNGRQ +YGLLQDIFP    F NRQP
Sbjct: 241 VYSH-NNFDYTYNGRQPEYGLLQDIFPAPSSFFNRQP 261

BLAST of Cp4.1LG10g05720 vs. NCBI nr
Match: gi|789500073|ref|NP_001292668.1| (probable WRKY transcription factor 71 [Cucumis sativus])

HSP 1 Score: 345.9 bits (886), Expect = 6.3e-92
Identity = 199/276 (72.10%), Postives = 219/276 (79.35%), Query Frame = 1

Query: 1   MSDEMFKDLFLSSPFGYGGLGDSETGLDEYESLARAFELSSDYSN---EISGTP-MNSSA 60
           MSDEMFKDLF S             G+DEYES+ RAF ++SDYSN   EISGT  MNSS 
Sbjct: 1   MSDEMFKDLFYS-------------GMDEYESIVRAFGITSDYSNINNEISGTTAMNSSC 60

Query: 61  SFSSSDAGA-DEDDSLKDKDKQI-KDM--DDGGESSKTAA--KSKKKVEKKEREPRVAFM 120
           S SSSDAG  +EDDS+K+K+KQI KD+  D+GGESSK A   KSKKK EKKERE  VAFM
Sbjct: 61  SLSSSDAGGGEEDDSVKEKEKQISKDVVEDNGGESSKAAGSGKSKKKGEKKEREAGVAFM 120

Query: 121 TKSEVDHLEDGYRWRKYGQKAVKNSAYPRSYYRCTTQKCGVKKRVERSYEDPSIVITTYE 180
           TKSEVDHLEDGYRWRKYGQKAVKNSAYPRSYYRCTTQKCGVKKRVERSYEDPSIVITTYE
Sbjct: 121 TKSEVDHLEDGYRWRKYGQKAVKNSAYPRSYYRCTTQKCGVKKRVERSYEDPSIVITTYE 180

Query: 181 GQHNHPIPATLRGNLSAVSGAFPPSMLAPMPVVGGVRYLPQLMNNTSVNNNQPIGGGDTV 240
           GQHNH IPATLRGNLSA SG F PSML PMPVVGGV +LP  + +++  NNQ +GGG TV
Sbjct: 181 GQHNHLIPATLRGNLSAASGTFSPSMLTPMPVVGGVGFLPAEL-SSNAGNNQAVGGGATV 240

Query: 241 YSQSSSFNYPYNGRQQDYGLLQDIFPTAPPFLNRQP 267
           YS  ++F+Y YNGRQ +YGLLQDIFP    F NRQP
Sbjct: 241 YSH-NNFDYTYNGRQPEYGLLQDIFPAPSSFFNRQP 261

BLAST of Cp4.1LG10g05720 vs. NCBI nr
Match: gi|590622865|ref|XP_007025165.1| (WRKY DNA-binding protein 28 [Theobroma cacao])

HSP 1 Score: 236.5 bits (602), Expect = 5.4e-59
Identity = 130/214 (60.75%), Postives = 157/214 (73.36%), Query Frame = 1

Query: 44  SNEISGTPMNSSASFSSSDAGADEDDSLKDKDKQIKDMDDGGESSKTAAKSKKKVEKKER 103
           ++E+  TP NSS S SSS+AG +ED     KD+Q K  +DGGESSK   K+KKK EKK+R
Sbjct: 89  TSEVMTTP-NSSVSSSSSEAGCEEDSDKSKKDRQPKGSEDGGESSKKGNKAKKKGEKKQR 148

Query: 104 EPRVAFMTKSEVDHLEDGYRWRKYGQKAVKNSAYPRSYYRCTTQKCGVKKRVERSYEDPS 163
           EPR AFMTKSEVDHLEDGYRWRKYGQKAVKNS YPRSYYRCTTQKC VKKRVERS++DPS
Sbjct: 149 EPRFAFMTKSEVDHLEDGYRWRKYGQKAVKNSPYPRSYYRCTTQKCTVKKRVERSFQDPS 208

Query: 164 IVITTYEGQHNHPIPATLRGNLSAVSGAFPPSMLAPMPVVGGVRYLPQLMNNTSVNNNQP 223
           +VITTYEGQHNHP+P TLRG   + +G FPPSML P P + G  +  +L      + N  
Sbjct: 209 VVITTYEGQHNHPLPTTLRG---SAAGLFPPSMLTPSP-LAGPSFPHELFMQMPHHMNNQ 268

Query: 224 IGGGDTVYSQSSSFNYPYNGRQQDYGLLQDIFPT 258
            G   +++++S S    Y+ +  DYGLLQDI P+
Sbjct: 269 AGSAGSMFAESFSPFQQYHHQVPDYGLLQDIVPS 297

BLAST of Cp4.1LG10g05720 vs. NCBI nr
Match: gi|743927645|ref|XP_011008002.1| (PREDICTED: probable WRKY transcription factor 28 [Populus euphratica])

HSP 1 Score: 228.8 bits (582), Expect = 1.1e-56
Identity = 144/281 (51.25%), Postives = 173/281 (61.57%), Query Frame = 1

Query: 14  PFGYGGLGDSETGLDEYESLARAFELSSDYS------------------------NEISG 73
           P  Y  L +   G  +Y SLA+AF LS   S                        +++  
Sbjct: 42  PSSYMSLTECLHGSVDYNSLAKAFGLSPSSSEVFSSIEENSKPVEARDLDGGNSTDQVPA 101

Query: 74  TPMNSSASFSSSDAGADEDDSLKDKDKQIKDMDDGGESSKTAAKSKKKVEKKEREPRVAF 133
           TP NSS SFSSS+AG DED     K+ Q +  +DGGE+S    K+KKK EK+++EPR AF
Sbjct: 102 TP-NSSVSFSSSEAGCDEDSGKTKKETQTEKPEDGGENSDKKDKAKKKAEKRQKEPRFAF 161

Query: 134 MTKSEVDHLEDGYRWRKYGQKAVKNSAYPRSYYRCTTQKCGVKKRVERSYEDPSIVITTY 193
           MTKSEVDHLEDGYRWRKYGQKAVKNS YPRSYYRCTTQKC VKKRVERS++DPSIVITTY
Sbjct: 162 MTKSEVDHLEDGYRWRKYGQKAVKNSPYPRSYYRCTTQKCTVKKRVERSFQDPSIVITTY 221

Query: 194 EGQHNHPIPATLRGNLSAVSGAFPPSMLAPMPVVGGVRYLPQLMNN----TSVNNNQPIG 253
           EGQHNHPIP TLRG+ SA+   F  SMLAP P+  G  +      N     +  NNQ +G
Sbjct: 222 EGQHNHPIPTTLRGSASAM---FSHSMLAPAPMASGPSFPHHQGYNFVQIPAAMNNQNMG 281

Query: 254 GGDTVYSQSSSFNYPYNGRQQDYGLLQDIFPTAPPFLNRQP 267
                Y Q+ + +     +  DYGLLQDI P+   FL  +P
Sbjct: 282 ----AYPQNVNQHVHQQYQVPDYGLLQDIVPSI--FLGHEP 312

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
WRK71_ARATH1.0e-4650.60Probable WRKY transcription factor 71 OS=Arabidopsis thaliana GN=WRKY71 PE=2 SV=... [more]
WRK28_ARATH2.2e-4443.84Probable WRKY transcription factor 28 OS=Arabidopsis thaliana GN=WRKY28 PE=2 SV=... [more]
WRKY8_ARATH8.1e-3941.54Probable WRKY transcription factor 8 OS=Arabidopsis thaliana GN=WRKY8 PE=1 SV=1[more]
WRK48_ARATH1.8e-3841.03Probable WRKY transcription factor 48 OS=Arabidopsis thaliana GN=WRKY48 PE=2 SV=... [more]
WRK23_ARATH1.3e-3644.93Probable WRKY transcription factor 23 OS=Arabidopsis thaliana GN=WRKY23 PE=2 SV=... [more]
Match NameE-valueIdentityDescription
A0A0A0KNS1_CUCSA6.8e-9372.92Uncharacterized protein OS=Cucumis sativus GN=Csa_6G522650 PE=4 SV=1[more]
E7CEY1_CUCSA4.4e-9272.10WRKY protein OS=Cucumis sativus GN=WRKY46 PE=2 SV=1[more]
A0A061GF56_THECC3.7e-5960.75WRKY DNA-binding protein 28 OS=Theobroma cacao GN=TCM_029543 PE=4 SV=1[more]
U5GX71_POPTR1.3e-5650.88Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0001s34520g PE=4 SV=1[more]
A0A0A7EB12_GOSAI1.5e-5551.79WRKY29 OS=Gossypium aridum PE=2 SV=1[more]
Match NameE-valueIdentityDescription
AT1G29860.15.9e-4850.60 WRKY DNA-binding protein 71[more]
AT4G18170.11.2e-4543.84 WRKY DNA-binding protein 28[more]
AT5G46350.14.5e-4041.54 WRKY DNA-binding protein 8[more]
AT5G49520.11.0e-3941.03 WRKY DNA-binding protein 48[more]
AT2G47260.17.2e-3844.93 WRKY DNA-binding protein 23[more]
Match NameE-valueIdentityDescription
gi|659078320|ref|XP_008439664.1|1.0e-9473.19PREDICTED: probable WRKY transcription factor 71 [Cucumis melo][more]
gi|700194167|gb|KGN49371.1|9.7e-9372.92hypothetical protein Csa_6G522650 [Cucumis sativus][more]
gi|789500073|ref|NP_001292668.1|6.3e-9272.10probable WRKY transcription factor 71 [Cucumis sativus][more]
gi|590622865|ref|XP_007025165.1|5.4e-5960.75WRKY DNA-binding protein 28 [Theobroma cacao][more]
gi|743927645|ref|XP_011008002.1|1.1e-5651.25PREDICTED: probable WRKY transcription factor 28 [Populus euphratica][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0043565sequence-specific DNA binding
GO:0003700transcription factor activity, sequence-specific DNA binding
Vocabulary: Biological Process
TermDefinition
GO:0006355regulation of transcription, DNA-templated
Vocabulary: INTERPRO
TermDefinition
IPR003657WRKY_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0005667 transcription factor complex
cellular_component GO:0005634 nucleus
molecular_function GO:0043565 sequence-specific DNA binding
molecular_function GO:0003700 transcription factor activity, sequence-specific DNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG10g05720.1Cp4.1LG10g05720.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR003657WRKY domainGENE3DG3DSA:2.20.25.80coord: 103..178
score: 4.4
IPR003657WRKY domainPFAMPF03106WRKYcoord: 119..176
score: 2.2
IPR003657WRKY domainSMARTSM00774WRKY_clscoord: 118..177
score: 1.2
IPR003657WRKY domainPROFILEPS50811WRKYcoord: 113..178
score: 31
IPR003657WRKY domainunknownSSF118290WRKY DNA-binding domaincoord: 110..178
score: 6.8
NoneNo IPR availablePANTHERPTHR31221FAMILY NOT NAMEDcoord: 26..256
score: 2.2
NoneNo IPR availablePANTHERPTHR31221:SF37WRKY TRANSCRIPTION FACTOR 71-RELATEDcoord: 26..256
score: 2.2

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cp4.1LG10g05720Cp4.1LG16g07300Cucurbita pepo (Zucchini)cpecpeB067
Cp4.1LG10g05720Cp4.1LG19g06270Cucurbita pepo (Zucchini)cpecpeB083