BLAST of Cp4.1LG01g01970 vs. Swiss-Prot
Match:
WRKY9_ARATH (Probable WRKY transcription factor 9 OS=Arabidopsis thaliana GN=WRKY9 PE=2 SV=1)
HSP 1 Score: 218.0 bits (554), Expect = 2.2e-55
Identity = 169/379 (44.59%), Postives = 223/379 (58.84%), Query Frame = 1
Query: 1 MEIDLSLKIDHHKQEPNQEHQEQD-EHEEHEEHEAYRVREKKGVDDTEIHVAASTLKVFL 60
M IDLSLK++ +++ E + E++E EEH+A +++ V + E +S+L +
Sbjct: 27 MGIDLSLKLEAEEKKKEIEGSKHSRENKEDEEHDASGDEDEQMVKEDEDD--SSSLGLRT 86
Query: 61 PQHNVNVGEISELQMEMNRMKEENKMLRKEVEQTMKDYYDLEMKIAIIQQNNLQKKDSHN 120
+ E+ +LQ++M +KEEN LRK VEQT++DY LEMK +I + +K D
Sbjct: 87 REEENEREELLQLQIQMESVKEENTRLRKLVEQTLEDYRHLEMKFPVIDKT--KKMDLEM 146
Query: 121 FLPSHENENKRVEEPNRELELGEMAKKRRV-RSPSKDNEMRESELGLSLGLHTNNDLEED 180
FL + R +++ A+KR RSPS E E+GLSL L
Sbjct: 147 FLGV---------QGKRCVDITSKARKRGAERSPSM-----EREIGLSLSL--------- 206
Query: 181 NDHKDQEEETREK-SKEHVTSNMKAMQQSKPQR-PELQGMAPPHNRKARVSVRARCEAAT 240
+ K ++EE++E H N ++ + P+ QG NRKARVSVRARCE AT
Sbjct: 207 -EKKQKQEESKEAVQSHHQRYNSSSLDMNMPRIISSSQG-----NRKARVSVRARCETAT 266
Query: 241 MNDGCQWRKYGQKIAKGNPCPRAYYRCTVAPGCPVRKQVQRCLEDMSILITTYEGTHNHP 300
MNDGCQWRKYGQK AKGNPCPRAYYRCTVAPGCPVRKQVQRCLEDMSILITTYEGTHNHP
Sbjct: 267 MNDGCQWRKYGQKTAKGNPCPRAYYRCTVAPGCPVRKQVQRCLEDMSILITTYEGTHNHP 326
Query: 301 LPVGATAMASTASAAASFTLLDSTNLPLPNPQNPNNILNSS--SYSANPNHPSAGLLLNL 360
LPVGATAMASTAS + L S NL P+ ++SS +Y N ++ + +
Sbjct: 327 LPVGATAMASTASTSPFLLLDSSDNLSHPSYYQTPQAIDSSLITYPQNSSYNNR----TI 368
Query: 361 TANNFYAPMATASTSAAHN 374
+ NF P S++ N
Sbjct: 387 RSLNFDGPSRGDHVSSSQN 368
BLAST of Cp4.1LG01g01970 vs. Swiss-Prot
Match:
WRK31_ARATH (Probable WRKY transcription factor 31 OS=Arabidopsis thaliana GN=WRKY31 PE=2 SV=1)
HSP 1 Score: 187.2 bits (474), Expect = 4.2e-46
Identity = 154/425 (36.24%), Postives = 215/425 (50.59%), Query Frame = 1
Query: 68 EISELQMEMNRMKEENKMLRKEVEQTMKDYYDLEMKIAIIQQNNLQKKDSHNFLPSHEN- 127
E ++LQ E+ +MK EN+ LR + Q ++ L+M++ + + Q+ S + L + E+
Sbjct: 110 ENAQLQEELKKMKIENQRLRDMLSQATTNFNALQMQLVAVMRQQEQRNSSQDHLLAQESK 169
Query: 128 -ENKRVEE-----PNRELELGEMAKKRRVRSPSKDNEMRESELGLSLGLHTNNDLEEDND 187
E ++ +E P + ++LG + + E G L +++ E+
Sbjct: 170 AEGRKRQELQIMVPRQFMDLGPSSGAAEHGAEVSSEERTTVRSGSPPSLLESSNPRENGK 229
Query: 188 HKDQEEETREKSKEHVTSNMKAMQQSKPQRPELQG----------MAPPHNRKARVSVRA 247
EE+ E+S+ + N + + P G A RKARVSVRA
Sbjct: 230 RLLGREESSEESESNAWGNPNKVPKHNPSSSNSNGNRNGNVIDQSAAEATMRKARVSVRA 289
Query: 248 RCEAATMNDGCQWRKYGQKIAKGNPCPRAYYRCTVAPGCPVRKQVQRCLEDMSILITTYE 307
R EAA ++DGCQWRKYGQK+AKGNPCPRAYYRCT+A GCPVRKQVQRC ED SILITTYE
Sbjct: 290 RSEAAMISDGCQWRKYGQKMAKGNPCPRAYYRCTMAGGCPVRKQVQRCAEDRSILITTYE 349
Query: 308 GTHNHPLPVGATAMASTASAAASFTLLDSTNLPLPNPQNPNNIL---------NSSSYSA 367
G HNHPLP ATAMAST +AAAS LL + NP N+L + ++ SA
Sbjct: 350 GNHNHPLPPAATAMASTTTAAASM-LLSGSMSSQDGLMNPTNLLARAILPCSSSMATISA 409
Query: 368 NPNHPSAGLLL---------NLTANNFYAPMA-------TASTSAAHNSYYQNNFQANFF 427
+ P+ L L N+T NN A + Y N Q+ F
Sbjct: 410 SAPFPTITLDLTNSPNGNNPNMTTNNPLMQFAQRPGFNPAVLPQVVGQAMYNNQQQSKFS 469
Query: 428 SRPLDGRTWKSAAEENKQPLVGESVSAIASDPKFRVAVAEAISSLIN---KDGNLTAPNS 448
L + + AA + V + +AIASDP F A+A AI+S++N N T N+
Sbjct: 470 GLQLPAQPLQIAATSSVAESVSAASAAIASDPNFAAALAAAITSIMNGSSHQNNNTNNNN 529
BLAST of Cp4.1LG01g01970 vs. Swiss-Prot
Match:
WRK72_ARATH (Probable WRKY transcription factor 72 OS=Arabidopsis thaliana GN=WRKY72 PE=2 SV=1)
HSP 1 Score: 176.8 bits (447), Expect = 5.7e-43
Identity = 164/484 (33.88%), Postives = 233/484 (48.14%), Query Frame = 1
Query: 62 HNVNVG-----EISELQMEMNRMKEENKMLRKEVEQTMKDYYDLEMK-IAIIQQ--NNLQ 121
H N G E+ + EM+ +KEEN+ L+ +E+ DY L+++ IIQQ +N
Sbjct: 24 HEANKGDGDHQELESAKAEMSEVKEENEKLKGMLERIESDYKSLKLRFFDIIQQEPSNTA 83
Query: 122 KKDSHNFLPSHENENKRVEEPNRELELGEMAKKRRVRSPSKDNEMRE------------- 181
K + N + + + ++E EL ++ RR SPS +E
Sbjct: 84 TK-NQNMVDHPKPTTTDLSSFDQERELVSLSLGRRSSSPSDSVPKKEEKTDAISAEVNAD 143
Query: 182 ---SELGLSLGLHTNNDLEEDNDHKDQEEETREKSKEHVTSNMKAMQQSKPQRPELQGMA 241
++ GL+LG++ N E + E S+E ++S P P G A
Sbjct: 144 EELTKAGLTLGINNGNG-GEPKEGLSMENRANSGSEEAWAPGKVTGKRSSP-APASGGDA 203
Query: 242 ------PPHNRKARVSVRARCEAATMNDGCQWRKYGQKIAKGNPCPRAYYRCTVAPGCPV 301
H ++ARV VRARC+ TMNDGCQWRKYGQKIAKGNPCPRAYYRCTVAPGCPV
Sbjct: 204 DGEAGQQNHVKRARVCVRARCDTPTMNDGCQWRKYGQKIAKGNPCPRAYYRCTVAPGCPV 263
Query: 302 RKQVQRCLEDMSILITTYEGTHNHPLPVGATAMASTASAAASFTLLDSTNLPLPNPQNPN 361
RKQVQRC +DMSILITTYEGTH+H LP+ AT MAST SAAAS L S++ P N
Sbjct: 264 RKQVQRCADDMSILITTYEGTHSHSLPLSATTMASTTSAAASMLLSGSSSSPAAE-MIGN 323
Query: 362 NILNSSSYSAN-----------PNHPSAGLLLNLTANNFYAPMATASTSAAHNSYYQNNF 421
N+ ++S ++ N P HP+ + L+LT AP ++S+S++ S N F
Sbjct: 324 NLYDNSRFNNNNKSFYSPTLHSPLHPT--VTLDLT-----APQHSSSSSSSLLSLNFNKF 383
Query: 422 QANFFSRPLDGRTWKSAAEENKQP-------LVGESVSAI-------------------- 458
+F P + S + + P + G S+
Sbjct: 384 SNSFQRFPSTSLNFSSTSSTSSNPSTLNLPAIWGNGYSSYTPYPYNNVQFGTSNLGKTVQ 443
BLAST of Cp4.1LG01g01970 vs. Swiss-Prot
Match:
WRK42_ARATH (WRKY transcription factor 42 OS=Arabidopsis thaliana GN=WRKY42 PE=2 SV=1)
HSP 1 Score: 159.8 bits (403), Expect = 7.2e-38
Identity = 159/465 (34.19%), Postives = 239/465 (51.40%), Query Frame = 1
Query: 11 HHKQEPNQEHQEQDEHEEHEEHEAYRVREKKGVDDTEIHVAASTLKVFLPQHNVNVGEIS 70
H K+E ++ D +H + G D++ + L V + + E +
Sbjct: 59 HVKRENSRVDDHDDRSTDHINIGLNLLTANTGSDESMVD---DGLSVDMEEKRTKC-ENA 118
Query: 71 ELQMEMNRMKEENKMLRKEVEQTMKDYYDLEMKIAIIQQNNLQKKDSHNFLPSHENENKR 130
+L+ E+ + E+N+ L++ + QT ++ L+M++ + + Q++D H+ + N+N +
Sbjct: 119 QLREELKKASEDNQRLKQMLSQTTNNFNSLQMQLVAVMR---QQEDHHHLATTENNDNVK 178
Query: 131 VEE------PNRELELG----EMAKKRR--VRSPSKDNEMRES---ELGLSLGLHTNNDL 190
P + ++LG E++ + R VRS S + + +S + G + + +
Sbjct: 179 NRHEVPEMVPRQFIDLGPHSDEVSSEERTTVRSGSPPSLLEKSSSRQNGKRVLVREESPE 238
Query: 191 EEDNDHKDQEEETREKSKEHVTSNMKAMQQSKPQRPEL--QGMAPPHNRKARVSVRARCE 250
E N ++ + K H +S++ S+ ++ Q A RKARVSVRAR E
Sbjct: 239 TESNGWRNPNKVP----KHHASSSICGGNGSENASSKVIEQAAAEATMRKARVSVRARSE 298
Query: 251 AATMNDGCQWRKYGQKIAKGNPCPRAYYRCTVAPGCPVRKQVQRCLEDMSILITTYEGTH 310
A ++DGCQWRKYGQK+AKGNPCPRAYYRCT+A GCPVRKQVQRC ED +ILITTYEG H
Sbjct: 299 APMLSDGCQWRKYGQKMAKGNPCPRAYYRCTMAVGCPVRKQVQRCAEDRTILITTYEGNH 358
Query: 311 NHPLPVGATAMASTASAAASFTLLDSTNLPLPNPQNPNNIL-------NSSSYSANPNHP 370
NHPLP A MAST +AAAS L ST NP N+L +SS + + + P
Sbjct: 359 NHPLPPAAMNMASTTTAAASMLLSGSTMSNQDGLMNPTNLLARTILPCSSSMATISASAP 418
Query: 371 SAGLLLNLT--------ANNFYAPMATASTSAAHNSYYQNNF--QANFFSRPLDGRTWKS 430
+ L+LT NN + S N + QA ++++ ++ S
Sbjct: 419 FPTITLDLTESPNGNNPTNNPLMQFSQRSGLVELNQSVLPHMMGQALYYNQ----QSKFS 478
Query: 431 AAEENKQPL-VGESVS----AIASDPKFRVAVAEAISSLINKDGN 437
QPL GESVS AIAS+P F A+A AI+S+IN N
Sbjct: 479 GLHMPSQPLNAGESVSAATAAIASNPNFAAALAAAITSIINGSNN 508
BLAST of Cp4.1LG01g01970 vs. Swiss-Prot
Match:
WRK47_ARATH (Probable WRKY transcription factor 47 OS=Arabidopsis thaliana GN=WRKY47 PE=2 SV=2)
HSP 1 Score: 159.1 bits (401), Expect = 1.2e-37
Identity = 124/329 (37.69%), Postives = 172/329 (52.28%), Query Frame = 1
Query: 173 NDLEEDNDHKDQEEETREKSKEHVTSNMKAMQQSKPQRPEL--------QGMAPPHN--- 232
N +D +H+ + +S + V + M + P+ P + + PH+
Sbjct: 164 NRRPKDMNHETPATTLKRRSPDDVDG--RDMHRGSPKTPRIDQNKSTNHEEQQNPHDQLP 223
Query: 233 -RKARVSVRARCEAATMNDGCQWRKYGQKIAKGNPCPRAYYRCTVAPGCPVRKQVQRCLE 292
RKARVSVRAR +A T+NDGCQWRKYGQK+AKGNPCPRAYYRCT+A GCPVRKQVQRC E
Sbjct: 224 YRKARVSVRARSDATTVNDGCQWRKYGQKMAKGNPCPRAYYRCTMAVGCPVRKQVQRCAE 283
Query: 293 DMSILITTYEGTHNHPLPVGATAMASTASAAASFTLLDSTNLPLPNPQNPNNILNSSSYS 352
D +IL TTYEG HNHPLP ATAMA+T SAAA+ L S++ L + + +SSS+
Sbjct: 284 DTTILTTTYEGNHNHPLPPSATAMAATTSAAAAMLLSGSSSSNLHQTLSSPSATSSSSF- 343
Query: 353 ANPNHPSAGLLLNLTANNFYAPMATASTSAAHNSYYQNNFQANFFSRPL--DGRTWKSAA 412
N P + L+A+ + + T+ F + + + +S
Sbjct: 344 -YHNFPYTSTIATLSASAPFPTITLDLTNPPRPLQPPPQFLSQYGPAAFLPNANQIRSMN 403
Query: 413 EENKQPLVG-------------ESV-SAIASDPKFRVAVAEAISSLINKDGNLTAPNSVK 472
N+Q L+ +SV +AIA DP F A+A AIS++I N N+
Sbjct: 404 NNNQQLLIPNLFGPQAPPREMVDSVRAAIAMDPNFTAALAAAISNIIGGGNN---DNNNN 463
Query: 473 RSSFGTEKDGDGGDSGGGNNSWVVQSLST 474
+ D G S G++ + QS +T
Sbjct: 464 TDINDNKVDAKSGGSSNGDSPQLPQSCTT 485
BLAST of Cp4.1LG01g01970 vs. TrEMBL
Match:
A0A0A0LC02_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G212490 PE=4 SV=1)
HSP 1 Score: 478.4 bits (1230), Expect = 1.0e-131
Identity = 311/503 (61.83%), Postives = 357/503 (70.97%), Query Frame = 1
Query: 1 MEIDLSLKIDHHKQEPNQEH----------QEQDEHEEHEEHEAYRVREKKGVDDTEIHV 60
MEIDLSLKIDHHK+E + H Q QD+H+ EE E E++ D + HV
Sbjct: 1 MEIDLSLKIDHHKEEHHHHHLIKHQKNDQQQRQDDHDREEEGEGEGEEEEEEEIDIDHHV 60
Query: 61 AAST---LKVFLPQHNVNVGEISELQMEMNRMKEENKMLRKEVEQTMKDYYDLEMKIAII 120
ST LKVFLP +N NVGEISELQMEM+R+KEENK LRK VEQTMKDYYDLEMKI
Sbjct: 61 VPSTTSGLKVFLPHNNTNVGEISELQMEMDRIKEENKALRKAVEQTMKDYYDLEMKIGFF 120
Query: 121 QQNN-LQKK--DSHNFLPSHENENKRVEEPNR-ELELGEMAKK-RRVRSPSKDNEMRESE 180
QQNN L K HNFL H NENKR EE + +LELGEMAKK RRV S SK++EMRESE
Sbjct: 121 QQNNNLNNKLECDHNFLSFHGNENKRHEELTKHDLELGEMAKKKRRVGSASKEDEMRESE 180
Query: 181 LGLSLGLHT---NNDLEEDNDHKDQ--EEETRE-KSKEH--VTSNMKAMQQSKPQRPELQ 240
LGLSLGLHT N+DLE++++ ++ EEE RE K+KE+ + SN ++Q +KPQRPELQ
Sbjct: 181 LGLSLGLHTKNSNDDLEQEDNDRELLIEEERREIKNKENSIIMSNFNSIQ-NKPQRPELQ 240
Query: 241 GMAPPHNRKARVSVRARCEAATMNDGCQWRKYGQKIAKGNPCPRAYYRCTVAPGCPVRKQ 300
MAPP NRKARVSVRARCE+ATMNDGCQWRKYGQKIAKGNPCPRAYYRCTVAPGCPVRKQ
Sbjct: 241 AMAPPQNRKARVSVRARCESATMNDGCQWRKYGQKIAKGNPCPRAYYRCTVAPGCPVRKQ 300
Query: 301 VQRCLEDMSILITTYEGTHNHPLPVGATAMASTASAA-ASFTLLDSTNLPLPNPQNPNNI 360
VQRCLEDMSILITTYEGTHNHPLPVGATAMASTASAA ASF LLDS+N N N +N
Sbjct: 301 VQRCLEDMSILITTYEGTHNHPLPVGATAMASTASAASASFMLLDSSN---TNNTNLSNS 360
Query: 361 LNSSSYSANPNHPSAGLLLNLTANNFYAPMATASTSAAHNSYYQNNFQANFFSRPLDGRT 420
L+ + N + PS N T + F T+STS +S+Y +NFQ N PLD RT
Sbjct: 361 LHLNPNILNSSSPSFLQTQNPTNHLFTPLFPTSSTSHFPHSFYHSNFQPNHLVGPLDRRT 420
Query: 421 WKSAAEENKQPLVGESVSAIASDPKFRVAVAEAISSLINKDGNLTAPNSVKRSSFGTEKD 477
WK + P ++VSAIASDPKFRVAVA AISSLINK+ N S+ + K
Sbjct: 421 WKPTDDNKPPPFTPDAVSAIASDPKFRVAVAAAISSLINKE-NEHMTTSMTGETVTDGKG 480
BLAST of Cp4.1LG01g01970 vs. TrEMBL
Match:
S5CKA9_JATCU (Uncharacterized protein OS=Jatropha curcas GN=WRKY34 PE=4 SV=1)
HSP 1 Score: 353.6 bits (906), Expect = 3.8e-94
Identity = 267/549 (48.63%), Postives = 335/549 (61.02%), Query Frame = 1
Query: 1 MEIDLSLKIDHHKQEPNQEHQEQDEHEEHEEHEAYRVRE------KKGVDDTEIH----- 60
M+IDLSLKID +E +E +E++E EE E +A V+E +K D EI+
Sbjct: 1 MDIDLSLKIDTEDKEQEREEEEEEEEEEEEAKKAKEVQEMQENSREKRPDVQEINDNEAT 60
Query: 61 --------VAASTLKVFLPQHNVNVGEISELQMEMNRMKEENKMLRKEVEQTMKDYYDLE 120
V S+L++ L Q N E+S LQMEMNRMKEENK+LRK VEQTMKDYYDL+
Sbjct: 61 PTITGGEVVDDSSLELSL-QENTKTEELSALQMEMNRMKEENKVLRKVVEQTMKDYYDLQ 120
Query: 121 MKIAIIQQNNLQKKDSHNFLPSHENENKRVEEPNRELELGEMAKKRRVRSPSKDNEM-RE 180
MK A+IQQN +KD FLP NE K E P + + R + SKD+++ E
Sbjct: 121 MKFAVIQQNT--QKDPPIFLPLRGNE-KAFEVPKSVPKFFDTNDNRNRATLSKDDKIIEE 180
Query: 181 SELGLSLGLHTNNDLEEDNDHKDQEEETREKSKEHVTSNMKAMQQSKPQRPE--LQGM-- 240
ELGLSL L +D+D +++EE+ +E+ + N ++Q +K QR + L G+
Sbjct: 181 RELGLSLRLQN-----DDSDRQEREEDYKEEINKEENGNYASVQNNKLQRTDNNLPGITS 240
Query: 241 --APPHNRKARVSVRARCEAATMNDGCQWRKYGQKIAKGNPCPRAYYRCTVAPGCPVRKQ 300
A NRKARVSVRARC+AATMNDGCQWRKYGQKIAKGNPCPRAYYRCTVAPGCPVRKQ
Sbjct: 241 HGASLPNRKARVSVRARCQAATMNDGCQWRKYGQKIAKGNPCPRAYYRCTVAPGCPVRKQ 300
Query: 301 VQRCLEDMSILITTYEGTHNHPLPVGATAMASTASAAASFTLLDSTNLPLPNPQNPNNIL 360
VQRCLEDMSILITTYEGTHNHPLPVGATAMASTASAAASF LLDS+N P N +N
Sbjct: 301 VQRCLEDMSILITTYEGTHNHPLPVGATAMASTASAAASFMLLDSSN---PLSDNISNFT 360
Query: 361 NSSS------------------YSANPNHPSAGLLLNLTANNFYA-------PMATASTS 420
+S S NPN PS G++L+LT N+ + +AT+S+S
Sbjct: 361 TQASNFPFRGASHMFYPNSMPFRSINPNDPSKGIVLDLTNNSTHQDHPPPQFALATSSSS 420
Query: 421 AAHN---------------SYYQNNFQA-----NFFSRPL---DGRTWKSAAEENKQPLV 476
+H+ S +QNN + +F + P R WKS EE+K L
Sbjct: 421 PSHSLAQPPPPMFSWMQNKSIHQNNGNSTIATNHFLASPRVDDHQRRWKS--EEDKSSL- 480
BLAST of Cp4.1LG01g01970 vs. TrEMBL
Match:
E7CEW8_CUCSA (WRKY protein OS=Cucumis sativus GN=WRKY19 PE=2 SV=1)
HSP 1 Score: 350.5 bits (898), Expect = 3.2e-93
Identity = 215/341 (63.05%), Postives = 249/341 (73.02%), Query Frame = 1
Query: 145 KKRRVRSPSKDNEMRESELGLSLGLHT---NNDLEEDNDHKDQ--EEETRE-KSKEH--V 204
KKRRV S SK++EMRESELGLSLGLHT N+DLE++++ ++ EEE RE K+KE+ +
Sbjct: 4 KKRRVGSASKEDEMRESELGLSLGLHTKNSNDDLEQEDNDRELLIEEERREIKNKENSII 63
Query: 205 TSNMKAMQQSKPQRPELQGMAPPHNRKARVSVRARCEAATMNDGCQWRKYGQKIAKGNPC 264
SN ++Q +KPQRPELQ MAPP NRKARVSVRARCE+ATMNDGCQWRKYGQKIAKGNPC
Sbjct: 64 MSNFNSIQ-NKPQRPELQAMAPPQNRKARVSVRARCESATMNDGCQWRKYGQKIAKGNPC 123
Query: 265 PRAYYRCTVAPGCPVRKQVQRCLEDMSILITTYEGTHNHPLPVGATAMASTASAA-ASFT 324
PRAYYRCTVAPGCPVRKQVQRCLEDMSILITTYEGTHNHPLPVGATAMASTASAA ASF
Sbjct: 124 PRAYYRCTVAPGCPVRKQVQRCLEDMSILITTYEGTHNHPLPVGATAMASTASAASASFM 183
Query: 325 LLDSTNLPLPNPQNPNNILNSSSYSANPNHPSAGLLLNLTANNFYAPMATASTSAAHNSY 384
LLDS+N N N +N L+ + N + PS N T + F T+STS +S+
Sbjct: 184 LLDSSNT---NNTNLSNSLHLNPNILNSSSPSFLQTQNPTNHLFTPLFPTSSTSHFPHSF 243
Query: 385 YQNNFQANFFSRPLDGRTWKSAAEENKQPLVGESVSAIASDPKFRVAVAEAISSLINKDG 444
Y +NFQ N PLD RTWK + P ++VSAIASDPKFRVAVA AISSLINK+
Sbjct: 244 YHSNFQPNHLVGPLDRRTWKPTDDNKPPPFTPDAVSAIASDPKFRVAVAAAISSLINKE- 303
Query: 445 NLTAPNSVKRSSFGTEKDGDGGDSGGGNNSWVVQSLSTNGN 477
N S+ + K G G DS GN WVV+SLS+ N
Sbjct: 304 NEHMTTSMTGETVTDGKGGGGSDSDSGNKKWVVESLSSKSN 339
BLAST of Cp4.1LG01g01970 vs. TrEMBL
Match:
A0A061DXG8_THECC (WRKY DNA-binding protein 9, putative isoform 1 OS=Theobroma cacao GN=TCM_006426 PE=4 SV=1)
HSP 1 Score: 349.7 bits (896), Expect = 5.4e-93
Identity = 258/522 (49.43%), Postives = 330/522 (63.22%), Query Frame = 1
Query: 1 MEIDLSLKIDHHKQEPNQEHQEQDEHEEHEE--HEAYRVREKKGVDDTEIHVA-ASTLKV 60
MEIDLSLKID ++E +E +E++E EE E+ EA E+ D E+ A A+T +V
Sbjct: 8 MEIDLSLKIDAKEEEEEEEEEEEEEVEEEEKDVEEAKETMEEDDNQDREVMTAIAATGEV 67
Query: 61 -------FLPQHNVNVGEISELQMEMNRMKEENKMLRKEVEQTMKDYYDLEMKIAIIQQN 120
F Q N+ E+S LQMEM+RMKEENK+LRK VE+TM+DYYDL+MK A IQQN
Sbjct: 68 EVGAPLEFSLQENMKTEELSVLQMEMSRMKEENKVLRKVVEKTMQDYYDLQMKFAAIQQN 127
Query: 121 NLQKKDSHNFLPSHENENKRVEEPNRELELGEMAKKRRVRSPSKDNEMRESELGLSLGLH 180
N QKKD FL NEN E+ +K+ SPS+D+ E+ELGLSL L
Sbjct: 128 N-QKKDPQIFLSLSGNENSSQEQQANPRTSNVNNQKQG--SPSQDDNDEENELGLSLRLQ 187
Query: 181 TNNDLEEDNDHKDQEEETREKSKEHVTSNMKAMQQSKPQRPELQGM----APPHNRKARV 240
T + E +E++ +E + +TSN+ ++Q +K + L + A P NRKARV
Sbjct: 188 TISSQREIRQGDQKEDQRKELESQEITSNVASVQ-NKLDQSHLSAITSHAASPPNRKARV 247
Query: 241 SVRARCEAATMNDGCQWRKYGQKIAKGNPCPRAYYRCTVAPGCPVRKQVQRCLEDMSILI 300
SVRARC+ ATMNDGCQWRKYGQKIAKGNPCPRAYYRCTVAPGCPVRKQVQRCLEDMSILI
Sbjct: 248 SVRARCQTATMNDGCQWRKYGQKIAKGNPCPRAYYRCTVAPGCPVRKQVQRCLEDMSILI 307
Query: 301 TTYEGTHNHPLPVGATAMASTAS-AAASFTLLDSTN-------------LPLPNPQNPNN 360
TTYEGTHNHPLPVGATAMASTAS AAASF LLDS+N LP NP N+
Sbjct: 308 TTYEGTHNHPLPVGATAMASTASAAAASFMLLDSSNPLSNGIPNITQATLPYQNPHLINS 367
Query: 361 ILNSSSY-SANPNHPSAGLLLNLTANNFY--APMATASTSAAHNSYYQNNF--------- 420
+ S++ + N PS G++L+LT N+ + + ++S++H+S +Q F
Sbjct: 368 VNPSNNVRNMTLNDPSKGIVLDLTNNHHFDHHQLPITASSSSHSSAHQQAFPWMPSRLNY 427
Query: 421 -QAN------FFSRPLDGRTWKSAAEENKQPLVGESVSAIASDPKFRVAVAEAISSLINK 476
AN F + + R WKS +E+K + E+V+AIASDPKFRVAVA AI+SLINK
Sbjct: 428 HNANPLPSNAFATSRTNEREWKS--DEDKS--LAENVTAIASDPKFRVAVAAAITSLINK 487
BLAST of Cp4.1LG01g01970 vs. TrEMBL
Match:
A0A061DZ88_THECC (WRKY DNA-binding protein 9, putative isoform 2 OS=Theobroma cacao GN=TCM_006426 PE=4 SV=1)
HSP 1 Score: 339.3 bits (869), Expect = 7.4e-90
Identity = 255/522 (48.85%), Postives = 324/522 (62.07%), Query Frame = 1
Query: 1 MEIDLSLKIDHHKQEPNQEHQEQDEHEEHEE--HEAYRVREKKGVDDTEIHVA-ASTLKV 60
MEIDLSLKID ++E +E +E++E EE E+ EA E+ D E+ A A+T +V
Sbjct: 8 MEIDLSLKIDAKEEEEEEEEEEEEEVEEEEKDVEEAKETMEEDDNQDREVMTAIAATGEV 67
Query: 61 -------FLPQHNVNVGEISELQMEMNRMKEENKMLRKEVEQTMKDYYDLEMKIAIIQQN 120
F Q N+ E MEM+RMKEENK+LRK VE+TM+DYYDL+MK A IQQN
Sbjct: 68 EVGAPLEFSLQENMKTEE-----MEMSRMKEENKVLRKVVEKTMQDYYDLQMKFAAIQQN 127
Query: 121 NLQKKDSHNFLPSHENENKRVEEPNRELELGEMAKKRRVRSPSKDNEMRESELGLSLGLH 180
N QKKD FL NEN E+ ++ SPS+D+ E+ELGLSL L
Sbjct: 128 N-QKKDPQIFLSLSGNENSSQEQQANPRTSN--VNNQKQGSPSQDDNDEENELGLSLRLQ 187
Query: 181 TNNDLEEDNDHKDQEEETREKSKEHVTSNMKAMQQSKPQRPELQGM----APPHNRKARV 240
T + E +E++ +E + +TSN+ A Q+K + L + A P NRKARV
Sbjct: 188 TISSQREIRQGDQKEDQRKELESQEITSNV-ASVQNKLDQSHLSAITSHAASPPNRKARV 247
Query: 241 SVRARCEAATMNDGCQWRKYGQKIAKGNPCPRAYYRCTVAPGCPVRKQVQRCLEDMSILI 300
SVRARC+ ATMNDGCQWRKYGQKIAKGNPCPRAYYRCTVAPGCPVRKQVQRCLEDMSILI
Sbjct: 248 SVRARCQTATMNDGCQWRKYGQKIAKGNPCPRAYYRCTVAPGCPVRKQVQRCLEDMSILI 307
Query: 301 TTYEGTHNHPLPVGATAMASTAS-AAASFTLLDSTN-------------LPLPNPQNPNN 360
TTYEGTHNHPLPVGATAMASTAS AAASF LLDS+N LP NP N+
Sbjct: 308 TTYEGTHNHPLPVGATAMASTASAAAASFMLLDSSNPLSNGIPNITQATLPYQNPHLINS 367
Query: 361 ILNSSSY-SANPNHPSAGLLLNLTANNFY--APMATASTSAAHNSYYQNNF--------- 420
+ S++ + N PS G++L+LT N+ + + ++S++H+S +Q F
Sbjct: 368 VNPSNNVRNMTLNDPSKGIVLDLTNNHHFDHHQLPITASSSSHSSAHQQAFPWMPSRLNY 427
Query: 421 -QAN------FFSRPLDGRTWKSAAEENKQPLVGESVSAIASDPKFRVAVAEAISSLINK 476
AN F + + R WKS +E+K + E+V+AIASDPKFRVAVA AI+SLINK
Sbjct: 428 HNANPLPSNAFATSRTNEREWKS--DEDKS--LAENVTAIASDPKFRVAVAAAITSLINK 487
BLAST of Cp4.1LG01g01970 vs. TAIR10
Match:
AT1G68150.1 (AT1G68150.1 WRKY DNA-binding protein 9)
HSP 1 Score: 218.0 bits (554), Expect = 1.3e-56
Identity = 169/379 (44.59%), Postives = 223/379 (58.84%), Query Frame = 1
Query: 1 MEIDLSLKIDHHKQEPNQEHQEQD-EHEEHEEHEAYRVREKKGVDDTEIHVAASTLKVFL 60
M IDLSLK++ +++ E + E++E EEH+A +++ V + E +S+L +
Sbjct: 27 MGIDLSLKLEAEEKKKEIEGSKHSRENKEDEEHDASGDEDEQMVKEDEDD--SSSLGLRT 86
Query: 61 PQHNVNVGEISELQMEMNRMKEENKMLRKEVEQTMKDYYDLEMKIAIIQQNNLQKKDSHN 120
+ E+ +LQ++M +KEEN LRK VEQT++DY LEMK +I + +K D
Sbjct: 87 REEENEREELLQLQIQMESVKEENTRLRKLVEQTLEDYRHLEMKFPVIDKT--KKMDLEM 146
Query: 121 FLPSHENENKRVEEPNRELELGEMAKKRRV-RSPSKDNEMRESELGLSLGLHTNNDLEED 180
FL + R +++ A+KR RSPS E E+GLSL L
Sbjct: 147 FLGV---------QGKRCVDITSKARKRGAERSPSM-----EREIGLSLSL--------- 206
Query: 181 NDHKDQEEETREK-SKEHVTSNMKAMQQSKPQR-PELQGMAPPHNRKARVSVRARCEAAT 240
+ K ++EE++E H N ++ + P+ QG NRKARVSVRARCE AT
Sbjct: 207 -EKKQKQEESKEAVQSHHQRYNSSSLDMNMPRIISSSQG-----NRKARVSVRARCETAT 266
Query: 241 MNDGCQWRKYGQKIAKGNPCPRAYYRCTVAPGCPVRKQVQRCLEDMSILITTYEGTHNHP 300
MNDGCQWRKYGQK AKGNPCPRAYYRCTVAPGCPVRKQVQRCLEDMSILITTYEGTHNHP
Sbjct: 267 MNDGCQWRKYGQKTAKGNPCPRAYYRCTVAPGCPVRKQVQRCLEDMSILITTYEGTHNHP 326
Query: 301 LPVGATAMASTASAAASFTLLDSTNLPLPNPQNPNNILNSS--SYSANPNHPSAGLLLNL 360
LPVGATAMASTAS + L S NL P+ ++SS +Y N ++ + +
Sbjct: 327 LPVGATAMASTASTSPFLLLDSSDNLSHPSYYQTPQAIDSSLITYPQNSSYNNR----TI 368
Query: 361 TANNFYAPMATASTSAAHN 374
+ NF P S++ N
Sbjct: 387 RSLNFDGPSRGDHVSSSQN 368
BLAST of Cp4.1LG01g01970 vs. TAIR10
Match:
AT4G22070.1 (AT4G22070.1 WRKY DNA-binding protein 31)
HSP 1 Score: 187.2 bits (474), Expect = 2.4e-47
Identity = 154/425 (36.24%), Postives = 215/425 (50.59%), Query Frame = 1
Query: 68 EISELQMEMNRMKEENKMLRKEVEQTMKDYYDLEMKIAIIQQNNLQKKDSHNFLPSHEN- 127
E ++LQ E+ +MK EN+ LR + Q ++ L+M++ + + Q+ S + L + E+
Sbjct: 110 ENAQLQEELKKMKIENQRLRDMLSQATTNFNALQMQLVAVMRQQEQRNSSQDHLLAQESK 169
Query: 128 -ENKRVEE-----PNRELELGEMAKKRRVRSPSKDNEMRESELGLSLGLHTNNDLEEDND 187
E ++ +E P + ++LG + + E G L +++ E+
Sbjct: 170 AEGRKRQELQIMVPRQFMDLGPSSGAAEHGAEVSSEERTTVRSGSPPSLLESSNPRENGK 229
Query: 188 HKDQEEETREKSKEHVTSNMKAMQQSKPQRPELQG----------MAPPHNRKARVSVRA 247
EE+ E+S+ + N + + P G A RKARVSVRA
Sbjct: 230 RLLGREESSEESESNAWGNPNKVPKHNPSSSNSNGNRNGNVIDQSAAEATMRKARVSVRA 289
Query: 248 RCEAATMNDGCQWRKYGQKIAKGNPCPRAYYRCTVAPGCPVRKQVQRCLEDMSILITTYE 307
R EAA ++DGCQWRKYGQK+AKGNPCPRAYYRCT+A GCPVRKQVQRC ED SILITTYE
Sbjct: 290 RSEAAMISDGCQWRKYGQKMAKGNPCPRAYYRCTMAGGCPVRKQVQRCAEDRSILITTYE 349
Query: 308 GTHNHPLPVGATAMASTASAAASFTLLDSTNLPLPNPQNPNNIL---------NSSSYSA 367
G HNHPLP ATAMAST +AAAS LL + NP N+L + ++ SA
Sbjct: 350 GNHNHPLPPAATAMASTTTAAASM-LLSGSMSSQDGLMNPTNLLARAILPCSSSMATISA 409
Query: 368 NPNHPSAGLLL---------NLTANNFYAPMA-------TASTSAAHNSYYQNNFQANFF 427
+ P+ L L N+T NN A + Y N Q+ F
Sbjct: 410 SAPFPTITLDLTNSPNGNNPNMTTNNPLMQFAQRPGFNPAVLPQVVGQAMYNNQQQSKFS 469
Query: 428 SRPLDGRTWKSAAEENKQPLVGESVSAIASDPKFRVAVAEAISSLIN---KDGNLTAPNS 448
L + + AA + V + +AIASDP F A+A AI+S++N N T N+
Sbjct: 470 GLQLPAQPLQIAATSSVAESVSAASAAIASDPNFAAALAAAITSIMNGSSHQNNNTNNNN 529
BLAST of Cp4.1LG01g01970 vs. TAIR10
Match:
AT5G15130.1 (AT5G15130.1 WRKY DNA-binding protein 72)
HSP 1 Score: 176.8 bits (447), Expect = 3.2e-44
Identity = 164/484 (33.88%), Postives = 233/484 (48.14%), Query Frame = 1
Query: 62 HNVNVG-----EISELQMEMNRMKEENKMLRKEVEQTMKDYYDLEMK-IAIIQQ--NNLQ 121
H N G E+ + EM+ +KEEN+ L+ +E+ DY L+++ IIQQ +N
Sbjct: 24 HEANKGDGDHQELESAKAEMSEVKEENEKLKGMLERIESDYKSLKLRFFDIIQQEPSNTA 83
Query: 122 KKDSHNFLPSHENENKRVEEPNRELELGEMAKKRRVRSPSKDNEMRE------------- 181
K + N + + + ++E EL ++ RR SPS +E
Sbjct: 84 TK-NQNMVDHPKPTTTDLSSFDQERELVSLSLGRRSSSPSDSVPKKEEKTDAISAEVNAD 143
Query: 182 ---SELGLSLGLHTNNDLEEDNDHKDQEEETREKSKEHVTSNMKAMQQSKPQRPELQGMA 241
++ GL+LG++ N E + E S+E ++S P P G A
Sbjct: 144 EELTKAGLTLGINNGNG-GEPKEGLSMENRANSGSEEAWAPGKVTGKRSSP-APASGGDA 203
Query: 242 ------PPHNRKARVSVRARCEAATMNDGCQWRKYGQKIAKGNPCPRAYYRCTVAPGCPV 301
H ++ARV VRARC+ TMNDGCQWRKYGQKIAKGNPCPRAYYRCTVAPGCPV
Sbjct: 204 DGEAGQQNHVKRARVCVRARCDTPTMNDGCQWRKYGQKIAKGNPCPRAYYRCTVAPGCPV 263
Query: 302 RKQVQRCLEDMSILITTYEGTHNHPLPVGATAMASTASAAASFTLLDSTNLPLPNPQNPN 361
RKQVQRC +DMSILITTYEGTH+H LP+ AT MAST SAAAS L S++ P N
Sbjct: 264 RKQVQRCADDMSILITTYEGTHSHSLPLSATTMASTTSAAASMLLSGSSSSPAAE-MIGN 323
Query: 362 NILNSSSYSAN-----------PNHPSAGLLLNLTANNFYAPMATASTSAAHNSYYQNNF 421
N+ ++S ++ N P HP+ + L+LT AP ++S+S++ S N F
Sbjct: 324 NLYDNSRFNNNNKSFYSPTLHSPLHPT--VTLDLT-----APQHSSSSSSSLLSLNFNKF 383
Query: 422 QANFFSRPLDGRTWKSAAEENKQP-------LVGESVSAI-------------------- 458
+F P + S + + P + G S+
Sbjct: 384 SNSFQRFPSTSLNFSSTSSTSSNPSTLNLPAIWGNGYSSYTPYPYNNVQFGTSNLGKTVQ 443
BLAST of Cp4.1LG01g01970 vs. TAIR10
Match:
AT4G04450.1 (AT4G04450.1 WRKY family transcription factor)
HSP 1 Score: 159.8 bits (403), Expect = 4.0e-39
Identity = 159/465 (34.19%), Postives = 239/465 (51.40%), Query Frame = 1
Query: 11 HHKQEPNQEHQEQDEHEEHEEHEAYRVREKKGVDDTEIHVAASTLKVFLPQHNVNVGEIS 70
H K+E ++ D +H + G D++ + L V + + E +
Sbjct: 59 HVKRENSRVDDHDDRSTDHINIGLNLLTANTGSDESMVD---DGLSVDMEEKRTKC-ENA 118
Query: 71 ELQMEMNRMKEENKMLRKEVEQTMKDYYDLEMKIAIIQQNNLQKKDSHNFLPSHENENKR 130
+L+ E+ + E+N+ L++ + QT ++ L+M++ + + Q++D H+ + N+N +
Sbjct: 119 QLREELKKASEDNQRLKQMLSQTTNNFNSLQMQLVAVMR---QQEDHHHLATTENNDNVK 178
Query: 131 VEE------PNRELELG----EMAKKRR--VRSPSKDNEMRES---ELGLSLGLHTNNDL 190
P + ++LG E++ + R VRS S + + +S + G + + +
Sbjct: 179 NRHEVPEMVPRQFIDLGPHSDEVSSEERTTVRSGSPPSLLEKSSSRQNGKRVLVREESPE 238
Query: 191 EEDNDHKDQEEETREKSKEHVTSNMKAMQQSKPQRPEL--QGMAPPHNRKARVSVRARCE 250
E N ++ + K H +S++ S+ ++ Q A RKARVSVRAR E
Sbjct: 239 TESNGWRNPNKVP----KHHASSSICGGNGSENASSKVIEQAAAEATMRKARVSVRARSE 298
Query: 251 AATMNDGCQWRKYGQKIAKGNPCPRAYYRCTVAPGCPVRKQVQRCLEDMSILITTYEGTH 310
A ++DGCQWRKYGQK+AKGNPCPRAYYRCT+A GCPVRKQVQRC ED +ILITTYEG H
Sbjct: 299 APMLSDGCQWRKYGQKMAKGNPCPRAYYRCTMAVGCPVRKQVQRCAEDRTILITTYEGNH 358
Query: 311 NHPLPVGATAMASTASAAASFTLLDSTNLPLPNPQNPNNIL-------NSSSYSANPNHP 370
NHPLP A MAST +AAAS L ST NP N+L +SS + + + P
Sbjct: 359 NHPLPPAAMNMASTTTAAASMLLSGSTMSNQDGLMNPTNLLARTILPCSSSMATISASAP 418
Query: 371 SAGLLLNLT--------ANNFYAPMATASTSAAHNSYYQNNF--QANFFSRPLDGRTWKS 430
+ L+LT NN + S N + QA ++++ ++ S
Sbjct: 419 FPTITLDLTESPNGNNPTNNPLMQFSQRSGLVELNQSVLPHMMGQALYYNQ----QSKFS 478
Query: 431 AAEENKQPL-VGESVS----AIASDPKFRVAVAEAISSLINKDGN 437
QPL GESVS AIAS+P F A+A AI+S+IN N
Sbjct: 479 GLHMPSQPLNAGESVSAATAAIASNPNFAAALAAAITSIINGSNN 508
BLAST of Cp4.1LG01g01970 vs. TAIR10
Match:
AT4G01720.1 (AT4G01720.1 WRKY family transcription factor)
HSP 1 Score: 159.1 bits (401), Expect = 6.9e-39
Identity = 124/329 (37.69%), Postives = 172/329 (52.28%), Query Frame = 1
Query: 173 NDLEEDNDHKDQEEETREKSKEHVTSNMKAMQQSKPQRPEL--------QGMAPPHN--- 232
N +D +H+ + +S + V + M + P+ P + + PH+
Sbjct: 164 NRRPKDMNHETPATTLKRRSPDDVDG--RDMHRGSPKTPRIDQNKSTNHEEQQNPHDQLP 223
Query: 233 -RKARVSVRARCEAATMNDGCQWRKYGQKIAKGNPCPRAYYRCTVAPGCPVRKQVQRCLE 292
RKARVSVRAR +A T+NDGCQWRKYGQK+AKGNPCPRAYYRCT+A GCPVRKQVQRC E
Sbjct: 224 YRKARVSVRARSDATTVNDGCQWRKYGQKMAKGNPCPRAYYRCTMAVGCPVRKQVQRCAE 283
Query: 293 DMSILITTYEGTHNHPLPVGATAMASTASAAASFTLLDSTNLPLPNPQNPNNILNSSSYS 352
D +IL TTYEG HNHPLP ATAMA+T SAAA+ L S++ L + + +SSS+
Sbjct: 284 DTTILTTTYEGNHNHPLPPSATAMAATTSAAAAMLLSGSSSSNLHQTLSSPSATSSSSF- 343
Query: 353 ANPNHPSAGLLLNLTANNFYAPMATASTSAAHNSYYQNNFQANFFSRPL--DGRTWKSAA 412
N P + L+A+ + + T+ F + + + +S
Sbjct: 344 -YHNFPYTSTIATLSASAPFPTITLDLTNPPRPLQPPPQFLSQYGPAAFLPNANQIRSMN 403
Query: 413 EENKQPLVG-------------ESV-SAIASDPKFRVAVAEAISSLINKDGNLTAPNSVK 472
N+Q L+ +SV +AIA DP F A+A AIS++I N N+
Sbjct: 404 NNNQQLLIPNLFGPQAPPREMVDSVRAAIAMDPNFTAALAAAISNIIGGGNN---DNNNN 463
Query: 473 RSSFGTEKDGDGGDSGGGNNSWVVQSLST 474
+ D G S G++ + QS +T
Sbjct: 464 TDINDNKVDAKSGGSSNGDSPQLPQSCTT 485
BLAST of Cp4.1LG01g01970 vs. NCBI nr
Match:
gi|778674482|ref|XP_011650228.1| (PREDICTED: uncharacterized protein LOC101215114 isoform X1 [Cucumis sativus])
HSP 1 Score: 478.4 bits (1230), Expect = 1.5e-131
Identity = 311/503 (61.83%), Postives = 357/503 (70.97%), Query Frame = 1
Query: 1 MEIDLSLKIDHHKQEPNQEH----------QEQDEHEEHEEHEAYRVREKKGVDDTEIHV 60
MEIDLSLKIDHHK+E + H Q QD+H+ EE E E++ D + HV
Sbjct: 1 MEIDLSLKIDHHKEEHHHHHLIKHQKNDQQQRQDDHDREEEGEGEGEEEEEEEIDIDHHV 60
Query: 61 AAST---LKVFLPQHNVNVGEISELQMEMNRMKEENKMLRKEVEQTMKDYYDLEMKIAII 120
ST LKVFLP +N NVGEISELQMEM+R+KEENK LRK VEQTMKDYYDLEMKI
Sbjct: 61 VPSTTSGLKVFLPHNNTNVGEISELQMEMDRIKEENKALRKAVEQTMKDYYDLEMKIGFF 120
Query: 121 QQNN-LQKK--DSHNFLPSHENENKRVEEPNR-ELELGEMAKK-RRVRSPSKDNEMRESE 180
QQNN L K HNFL H NENKR EE + +LELGEMAKK RRV S SK++EMRESE
Sbjct: 121 QQNNNLNNKLECDHNFLSFHGNENKRHEELTKHDLELGEMAKKKRRVGSASKEDEMRESE 180
Query: 181 LGLSLGLHT---NNDLEEDNDHKDQ--EEETRE-KSKEH--VTSNMKAMQQSKPQRPELQ 240
LGLSLGLHT N+DLE++++ ++ EEE RE K+KE+ + SN ++Q +KPQRPELQ
Sbjct: 181 LGLSLGLHTKNSNDDLEQEDNDRELLIEEERREIKNKENSIIMSNFNSIQ-NKPQRPELQ 240
Query: 241 GMAPPHNRKARVSVRARCEAATMNDGCQWRKYGQKIAKGNPCPRAYYRCTVAPGCPVRKQ 300
MAPP NRKARVSVRARCE+ATMNDGCQWRKYGQKIAKGNPCPRAYYRCTVAPGCPVRKQ
Sbjct: 241 AMAPPQNRKARVSVRARCESATMNDGCQWRKYGQKIAKGNPCPRAYYRCTVAPGCPVRKQ 300
Query: 301 VQRCLEDMSILITTYEGTHNHPLPVGATAMASTASAA-ASFTLLDSTNLPLPNPQNPNNI 360
VQRCLEDMSILITTYEGTHNHPLPVGATAMASTASAA ASF LLDS+N N N +N
Sbjct: 301 VQRCLEDMSILITTYEGTHNHPLPVGATAMASTASAASASFMLLDSSN---TNNTNLSNS 360
Query: 361 LNSSSYSANPNHPSAGLLLNLTANNFYAPMATASTSAAHNSYYQNNFQANFFSRPLDGRT 420
L+ + N + PS N T + F T+STS +S+Y +NFQ N PLD RT
Sbjct: 361 LHLNPNILNSSSPSFLQTQNPTNHLFTPLFPTSSTSHFPHSFYHSNFQPNHLVGPLDRRT 420
Query: 421 WKSAAEENKQPLVGESVSAIASDPKFRVAVAEAISSLINKDGNLTAPNSVKRSSFGTEKD 477
WK + P ++VSAIASDPKFRVAVA AISSLINK+ N S+ + K
Sbjct: 421 WKPTDDNKPPPFTPDAVSAIASDPKFRVAVAAAISSLINKE-NEHMTTSMTGETVTDGKG 480
BLAST of Cp4.1LG01g01970 vs. NCBI nr
Match:
gi|659112178|ref|XP_008456102.1| (PREDICTED: probable WRKY transcription factor 9 [Cucumis melo])
HSP 1 Score: 474.9 bits (1221), Expect = 1.6e-130
Identity = 311/512 (60.74%), Postives = 356/512 (69.53%), Query Frame = 1
Query: 1 MEIDLSLKIDHHKQEPN------------QEHQEQDEHEEHEEHEAYRVREKKGVDDTEI 60
MEIDLSLKIDHHK+E + Q+HQ+ +H++ EE E E+ +D +
Sbjct: 1 MEIDLSLKIDHHKEEHHHHHLIKHQKTDQQQHQDDHDHDKEEEEEDEEEEEEIDIDHHVV 60
Query: 61 HVAASTLKVFLPQHNVNVGEISELQMEMNRMKEENKMLRKEVEQTMKDYYDLEMKIAIIQ 120
S LKV LP +N+NVGEISELQMEM+R+KEENK LRK VEQTMKDYYDLEMKI Q
Sbjct: 61 PSTTSGLKVLLPHNNINVGEISELQMEMDRIKEENKALRKAVEQTMKDYYDLEMKIGFFQ 120
Query: 121 QNN-LQKK--DSHNFLPSHENENKRVEEPNRE-LELGEMAKK-RRVRSPSKDNEMRESEL 180
QNN L K HNFL H NENKR EEP ++ LEL EMAKK RRV S K++EMRESEL
Sbjct: 121 QNNNLNNKLECDHNFLSFHGNENKRHEEPTKQDLELREMAKKKRRVGSALKEDEMRESEL 180
Query: 181 GLSLGLHT---NNDL-EEDNDHK----DQEEETREKSKEHVTSNMKAMQQSKPQRPELQG 240
GLSLGLHT NNDL +EDND + ++ E R K + N ++Q +KPQRPELQ
Sbjct: 181 GLSLGLHTKNNNNDLKQEDNDREILIEEERREVRNKESSIIMENFNSIQ-NKPQRPELQA 240
Query: 241 MAPPHNRKARVSVRARCEAATMNDGCQWRKYGQKIAKGNPCPRAYYRCTVAPGCPVRKQV 300
MAPP NRKARVSVRARCE+ATMNDGCQWRKYGQKIAKGNPCPRAYYRCTVAPGCPVRKQV
Sbjct: 241 MAPPQNRKARVSVRARCESATMNDGCQWRKYGQKIAKGNPCPRAYYRCTVAPGCPVRKQV 300
Query: 301 QRCLEDMSILITTYEGTHNHPLPVGATAMASTASAA-ASFTLLDS-----TNLPLPNPQN 360
QRCLEDMSILITTYEGTHNHPLPVGATAMASTASAA ASF LLDS TNL QN
Sbjct: 301 QRCLEDMSILITTYEGTHNHPLPVGATAMASTASAASASFMLLDSSNNNNTNLSNSLHQN 360
Query: 361 PNNILNSSSYS----ANPNHPSAGLLLNLTANNFYAPM-ATASTSAAHNSYYQNNFQANF 420
P NILNSSS S NPN N+ + P+ T+STS +S+Y +NFQ N
Sbjct: 361 P-NILNSSSPSFLQTQNPN------------NHLFTPLFPTSSTSHFPHSFYHSNFQPNH 420
Query: 421 FSRPLDGRTWKSAAEENKQPLVGESVSAIASDPKFRVAVAEAISSLINKDGNLTAPNSVK 477
PLD RTWK + PL ++VSAIASDPKFRVAVA AISSLINK+ N + +
Sbjct: 421 LVSPLDRRTWKPVDDNKPPPLTPDAVSAIASDPKFRVAVAAAISSLINKE-NEHVTTTGE 480
BLAST of Cp4.1LG01g01970 vs. NCBI nr
Match:
gi|802557761|ref|XP_012065723.1| (PREDICTED: probable WRKY transcription factor 9 [Jatropha curcas])
HSP 1 Score: 353.6 bits (906), Expect = 5.4e-94
Identity = 267/549 (48.63%), Postives = 335/549 (61.02%), Query Frame = 1
Query: 1 MEIDLSLKIDHHKQEPNQEHQEQDEHEEHEEHEAYRVRE------KKGVDDTEIH----- 60
M+IDLSLKID +E +E +E++E EE E +A V+E +K D EI+
Sbjct: 9 MDIDLSLKIDTEDKEQEREEEEEEEEEEEEAKKAKEVQEMQENSREKRPDVQEINDNEAT 68
Query: 61 --------VAASTLKVFLPQHNVNVGEISELQMEMNRMKEENKMLRKEVEQTMKDYYDLE 120
V S+L++ L Q N E+S LQMEMNRMKEENK+LRK VEQTMKDYYDL+
Sbjct: 69 PTITGGEVVDDSSLELSL-QENTKTEELSALQMEMNRMKEENKVLRKVVEQTMKDYYDLQ 128
Query: 121 MKIAIIQQNNLQKKDSHNFLPSHENENKRVEEPNRELELGEMAKKRRVRSPSKDNEM-RE 180
MK A+IQQN +KD FLP NE K E P + + R + SKD+++ E
Sbjct: 129 MKFAVIQQNT--QKDPPIFLPLRGNE-KAFEVPKSVPKFFDTNDNRNRATLSKDDKIIEE 188
Query: 181 SELGLSLGLHTNNDLEEDNDHKDQEEETREKSKEHVTSNMKAMQQSKPQRPE--LQGM-- 240
ELGLSL L +D+D +++EE+ +E+ + N ++Q +K QR + L G+
Sbjct: 189 RELGLSLRLQN-----DDSDRQEREEDYKEEINKEENGNYASVQNNKLQRTDNNLPGITS 248
Query: 241 --APPHNRKARVSVRARCEAATMNDGCQWRKYGQKIAKGNPCPRAYYRCTVAPGCPVRKQ 300
A NRKARVSVRARC+AATMNDGCQWRKYGQKIAKGNPCPRAYYRCTVAPGCPVRKQ
Sbjct: 249 HGASLPNRKARVSVRARCQAATMNDGCQWRKYGQKIAKGNPCPRAYYRCTVAPGCPVRKQ 308
Query: 301 VQRCLEDMSILITTYEGTHNHPLPVGATAMASTASAAASFTLLDSTNLPLPNPQNPNNIL 360
VQRCLEDMSILITTYEGTHNHPLPVGATAMASTASAAASF LLDS+N P N +N
Sbjct: 309 VQRCLEDMSILITTYEGTHNHPLPVGATAMASTASAAASFMLLDSSN---PLSDNISNFT 368
Query: 361 NSSS------------------YSANPNHPSAGLLLNLTANNFYA-------PMATASTS 420
+S S NPN PS G++L+LT N+ + +AT+S+S
Sbjct: 369 TQASNFPFRGASHMFYPNSMPFRSINPNDPSKGIVLDLTNNSTHQDHPPPQFALATSSSS 428
Query: 421 AAHN---------------SYYQNNFQA-----NFFSRPL---DGRTWKSAAEENKQPLV 476
+H+ S +QNN + +F + P R WKS EE+K L
Sbjct: 429 PSHSLAQPPPPMFSWMQNKSIHQNNGNSTIATNHFLASPRVDDHQRRWKS--EEDKSSL- 488
BLAST of Cp4.1LG01g01970 vs. NCBI nr
Match:
gi|522191312|gb|AGQ04223.1| (WRKY transcription factor 34 [Jatropha curcas])
HSP 1 Score: 353.6 bits (906), Expect = 5.4e-94
Identity = 267/549 (48.63%), Postives = 335/549 (61.02%), Query Frame = 1
Query: 1 MEIDLSLKIDHHKQEPNQEHQEQDEHEEHEEHEAYRVRE------KKGVDDTEIH----- 60
M+IDLSLKID +E +E +E++E EE E +A V+E +K D EI+
Sbjct: 1 MDIDLSLKIDTEDKEQEREEEEEEEEEEEEAKKAKEVQEMQENSREKRPDVQEINDNEAT 60
Query: 61 --------VAASTLKVFLPQHNVNVGEISELQMEMNRMKEENKMLRKEVEQTMKDYYDLE 120
V S+L++ L Q N E+S LQMEMNRMKEENK+LRK VEQTMKDYYDL+
Sbjct: 61 PTITGGEVVDDSSLELSL-QENTKTEELSALQMEMNRMKEENKVLRKVVEQTMKDYYDLQ 120
Query: 121 MKIAIIQQNNLQKKDSHNFLPSHENENKRVEEPNRELELGEMAKKRRVRSPSKDNEM-RE 180
MK A+IQQN +KD FLP NE K E P + + R + SKD+++ E
Sbjct: 121 MKFAVIQQNT--QKDPPIFLPLRGNE-KAFEVPKSVPKFFDTNDNRNRATLSKDDKIIEE 180
Query: 181 SELGLSLGLHTNNDLEEDNDHKDQEEETREKSKEHVTSNMKAMQQSKPQRPE--LQGM-- 240
ELGLSL L +D+D +++EE+ +E+ + N ++Q +K QR + L G+
Sbjct: 181 RELGLSLRLQN-----DDSDRQEREEDYKEEINKEENGNYASVQNNKLQRTDNNLPGITS 240
Query: 241 --APPHNRKARVSVRARCEAATMNDGCQWRKYGQKIAKGNPCPRAYYRCTVAPGCPVRKQ 300
A NRKARVSVRARC+AATMNDGCQWRKYGQKIAKGNPCPRAYYRCTVAPGCPVRKQ
Sbjct: 241 HGASLPNRKARVSVRARCQAATMNDGCQWRKYGQKIAKGNPCPRAYYRCTVAPGCPVRKQ 300
Query: 301 VQRCLEDMSILITTYEGTHNHPLPVGATAMASTASAAASFTLLDSTNLPLPNPQNPNNIL 360
VQRCLEDMSILITTYEGTHNHPLPVGATAMASTASAAASF LLDS+N P N +N
Sbjct: 301 VQRCLEDMSILITTYEGTHNHPLPVGATAMASTASAAASFMLLDSSN---PLSDNISNFT 360
Query: 361 NSSS------------------YSANPNHPSAGLLLNLTANNFYA-------PMATASTS 420
+S S NPN PS G++L+LT N+ + +AT+S+S
Sbjct: 361 TQASNFPFRGASHMFYPNSMPFRSINPNDPSKGIVLDLTNNSTHQDHPPPQFALATSSSS 420
Query: 421 AAHN---------------SYYQNNFQA-----NFFSRPL---DGRTWKSAAEENKQPLV 476
+H+ S +QNN + +F + P R WKS EE+K L
Sbjct: 421 PSHSLAQPPPPMFSWMQNKSIHQNNGNSTIATNHFLASPRVDDHQRRWKS--EEDKSSL- 480
BLAST of Cp4.1LG01g01970 vs. NCBI nr
Match:
gi|525507256|ref|NP_001267666.1| (uncharacterized protein LOC101215114 [Cucumis sativus])
HSP 1 Score: 350.5 bits (898), Expect = 4.6e-93
Identity = 215/341 (63.05%), Postives = 249/341 (73.02%), Query Frame = 1
Query: 145 KKRRVRSPSKDNEMRESELGLSLGLHT---NNDLEEDNDHKDQ--EEETRE-KSKEH--V 204
KKRRV S SK++EMRESELGLSLGLHT N+DLE++++ ++ EEE RE K+KE+ +
Sbjct: 4 KKRRVGSASKEDEMRESELGLSLGLHTKNSNDDLEQEDNDRELLIEEERREIKNKENSII 63
Query: 205 TSNMKAMQQSKPQRPELQGMAPPHNRKARVSVRARCEAATMNDGCQWRKYGQKIAKGNPC 264
SN ++Q +KPQRPELQ MAPP NRKARVSVRARCE+ATMNDGCQWRKYGQKIAKGNPC
Sbjct: 64 MSNFNSIQ-NKPQRPELQAMAPPQNRKARVSVRARCESATMNDGCQWRKYGQKIAKGNPC 123
Query: 265 PRAYYRCTVAPGCPVRKQVQRCLEDMSILITTYEGTHNHPLPVGATAMASTASAA-ASFT 324
PRAYYRCTVAPGCPVRKQVQRCLEDMSILITTYEGTHNHPLPVGATAMASTASAA ASF
Sbjct: 124 PRAYYRCTVAPGCPVRKQVQRCLEDMSILITTYEGTHNHPLPVGATAMASTASAASASFM 183
Query: 325 LLDSTNLPLPNPQNPNNILNSSSYSANPNHPSAGLLLNLTANNFYAPMATASTSAAHNSY 384
LLDS+N N N +N L+ + N + PS N T + F T+STS +S+
Sbjct: 184 LLDSSNT---NNTNLSNSLHLNPNILNSSSPSFLQTQNPTNHLFTPLFPTSSTSHFPHSF 243
Query: 385 YQNNFQANFFSRPLDGRTWKSAAEENKQPLVGESVSAIASDPKFRVAVAEAISSLINKDG 444
Y +NFQ N PLD RTWK + P ++VSAIASDPKFRVAVA AISSLINK+
Sbjct: 244 YHSNFQPNHLVGPLDRRTWKPTDDNKPPPFTPDAVSAIASDPKFRVAVAAAISSLINKE- 303
Query: 445 NLTAPNSVKRSSFGTEKDGDGGDSGGGNNSWVVQSLSTNGN 477
N S+ + K G G DS GN WVV+SLS+ N
Sbjct: 304 NEHMTTSMTGETVTDGKGGGGSDSDSGNKKWVVESLSSKSN 339
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
WRKY9_ARATH | 2.2e-55 | 44.59 | Probable WRKY transcription factor 9 OS=Arabidopsis thaliana GN=WRKY9 PE=2 SV=1 | [more] |
WRK31_ARATH | 4.2e-46 | 36.24 | Probable WRKY transcription factor 31 OS=Arabidopsis thaliana GN=WRKY31 PE=2 SV=... | [more] |
WRK72_ARATH | 5.7e-43 | 33.88 | Probable WRKY transcription factor 72 OS=Arabidopsis thaliana GN=WRKY72 PE=2 SV=... | [more] |
WRK42_ARATH | 7.2e-38 | 34.19 | WRKY transcription factor 42 OS=Arabidopsis thaliana GN=WRKY42 PE=2 SV=1 | [more] |
WRK47_ARATH | 1.2e-37 | 37.69 | Probable WRKY transcription factor 47 OS=Arabidopsis thaliana GN=WRKY47 PE=2 SV=... | [more] |
Match Name | E-value | Identity | Description | |
A0A0A0LC02_CUCSA | 1.0e-131 | 61.83 | Uncharacterized protein OS=Cucumis sativus GN=Csa_3G212490 PE=4 SV=1 | [more] |
S5CKA9_JATCU | 3.8e-94 | 48.63 | Uncharacterized protein OS=Jatropha curcas GN=WRKY34 PE=4 SV=1 | [more] |
E7CEW8_CUCSA | 3.2e-93 | 63.05 | WRKY protein OS=Cucumis sativus GN=WRKY19 PE=2 SV=1 | [more] |
A0A061DXG8_THECC | 5.4e-93 | 49.43 | WRKY DNA-binding protein 9, putative isoform 1 OS=Theobroma cacao GN=TCM_006426 ... | [more] |
A0A061DZ88_THECC | 7.4e-90 | 48.85 | WRKY DNA-binding protein 9, putative isoform 2 OS=Theobroma cacao GN=TCM_006426 ... | [more] |