Cla021203.1 (mRNA) Watermelon (97103) v1

NameCla021203
TypemRNA
OrganismCitrullus. lanatus (Watermelon (97103) v1)
DescriptionWRKY transcription factor (Fragment) (AHRD V1 *-** F1DK13_MAIZE); contains Interpro domain(s) IPR003657 DNA-binding WRKY
LocationChr5 : 1196965 .. 1198094 (-)
Sequence length372
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAGGAATATGGCAAACTACAATACCCACCTCCGCCGCCGTCGCCGCCTCCGCCATGTTCAGATCCTGATGCAACCCCTCTGGAAATCGACTGGATCGCAGTTCTCTCTGACCAAGAAGTCACTGGAGACTTAGCGGCGGCAGCAGCAACTTGCGAGTCATCGGAAACGAGAAGGGACGAAGAGAAGGGGGAGCGGAGGAAGAAAGGCGGTCGCCGGTGGAGAAGGACCGCCGGCTGCCGGAGGTTTGAGTTTCAGACGAGGAGTGCTGAAGATATTCTTGATGATGGATATCGGTGGCGCAAGTACGGACAAAAAGCTGTCAAACACAGCCTCCATCCGAGGTATAAATATATATTTATTATTTATATATTGAGTTTTTTTCTTTTTTAAATGTTAAAAATATTAATATTTTGAGCTTTCTTTTAAAAAAGGCACAAGGGTGATTTCCATCATACTTTCATTATATTTTGAGCTTTCTTTTAAAAGAAACGAATATTTATATTTATACTTTCATTAGTTAGGATGAATAAATTTTAGTTTTTTACTTTAAAATTTGTAATAAGTCATTTAAATTAATATCTAATAATTAAATAGTCCTTACAAATATAATTTATTATTATTTATTATTATTATTTTTTTTTTTTGGTTATTGTGTGGAATGGGTGGTGAAGGAGCTACTATAGGTGTACATACCTCACATGCAATGTGAAGAAACAAGTTCAAAGGCTATCAAAAGACACAAGTATTGTTGTGACTACTTATGAGGGAACTCACAACCATCCTTCTCACTTCCTCATGCAAACCCTAACTCCTCTTCTCAAACAAATTCAAACTACTTTTCCACTTTCTAAATTTTTTATGTATTAGAAACATATATGCTTGGTTTGTTCATTTATGATTTCTAAAAGTCATAATATATAGTACATATACACTATCGATACTAATTAAGAGTTAAAAGAGGTATAAGGTTAATTTATGTAGCTCATAAATTATCGATTTCATAAACTTGATCAGATCAAATAAATTTTGGCATACCTATGCCTCCATTTAATAAATATTTATAGATTAATATATGTGTTTTGTTTACCAACTTATTCAGTATTTTCAACAATCAAGCCAAGTTCTAA

mRNA sequence

ATGGAGGAATATGGCAAACTACAATACCCACCTCCGCCGCCGTCGCCGCCTCCGCCATGTTCAGATCCTGATGCAACCCCTCTGGAAATCGACTGGATCGCAGTTCTCTCTGACCAAGAAGTCACTGGAGACTTAGCGGCGGCAGCAGCAACTTGCGAGTCATCGGAAACGAGAAGGGACGAAGAGAAGGGGGAGCGGAGGAAGAAAGGCGGTCGCCGGTGGAGAAGGACCGCCGGCTGCCGGAGGTTTGAGTTTCAGACGAGGAGTGCTGAAGATATTCTTGATGATGGATATCGGTGGCGCAAGTACGGACAAAAAGCTGTCAAACACAGCCTCCATCCGAGTATTTTCAACAATCAAGCCAAGTTCTAA

Coding sequence (CDS)

ATGGAGGAATATGGCAAACTACAATACCCACCTCCGCCGCCGTCGCCGCCTCCGCCATGTTCAGATCCTGATGCAACCCCTCTGGAAATCGACTGGATCGCAGTTCTCTCTGACCAAGAAGTCACTGGAGACTTAGCGGCGGCAGCAGCAACTTGCGAGTCATCGGAAACGAGAAGGGACGAAGAGAAGGGGGAGCGGAGGAAGAAAGGCGGTCGCCGGTGGAGAAGGACCGCCGGCTGCCGGAGGTTTGAGTTTCAGACGAGGAGTGCTGAAGATATTCTTGATGATGGATATCGGTGGCGCAAGTACGGACAAAAAGCTGTCAAACACAGCCTCCATCCGAGTATTTTCAACAATCAAGCCAAGTTCTAA

Protein sequence

MEEYGKLQYPPPPPSPPPPCSDPDATPLEIDWIAVLSDQEVTGDLAAAAATCESSETRRDEEKGERRKKGGRRWRRTAGCRRFEFQTRSAEDILDDGYRWRKYGQKAVKHSLHPSIFNNQAKF
BLAST of Cla021203 vs. Swiss-Prot
Match: WRK56_ARATH (Probable WRKY transcription factor 56 OS=Arabidopsis thaliana GN=WRKY56 PE=2 SV=1)

HSP 1 Score: 63.2 bits (152), Expect = 2.4e-09
Identity = 29/61 (47.54%), Postives = 37/61 (60.66%), Query Frame = 1

Query: 54  SSETRRDEEKGERRKKGGRRWRRTAGCRRFEFQTRSAEDILDDGYRWRKYGQKAVKHSLH 113
           ++    D  K   + KG    +RT   +R  F TRS +D+LDDGYRWRKYGQK+VK++ H
Sbjct: 77  NNSNNSDHNKNCNKGKG----KRTLAMQRIAFHTRSDDDVLDDGYRWRKYGQKSVKNNAH 133

Query: 114 P 115
           P
Sbjct: 137 P 133

BLAST of Cla021203 vs. Swiss-Prot
Match: WRK24_ARATH (Probable WRKY transcription factor 24 OS=Arabidopsis thaliana GN=WRKY24 PE=2 SV=1)

HSP 1 Score: 62.8 bits (151), Expect = 3.1e-09
Identity = 29/53 (54.72%), Postives = 35/53 (66.04%), Query Frame = 1

Query: 62  EKGERRKKGGRRWRRTAGCRRFEFQTRSAEDILDDGYRWRKYGQKAVKHSLHP 115
           EKG+  K+     +R+    R  F TRS +D+LDDGYRWRKYGQK+VKH+ HP
Sbjct: 70  EKGKELKE-----KRSRKVPRIAFHTRSDDDVLDDGYRWRKYGQKSVKHNAHP 117

BLAST of Cla021203 vs. Swiss-Prot
Match: WRK75_ARATH (Probable WRKY transcription factor 75 OS=Arabidopsis thaliana GN=WRKY75 PE=2 SV=1)

HSP 1 Score: 60.5 bits (145), Expect = 1.5e-08
Identity = 33/64 (51.56%), Postives = 40/64 (62.50%), Query Frame = 1

Query: 53  ESSETRRD--EEKGERRKKGGRRWRRTAGCRRFEFQTRSAEDILDDGYRWRKYGQKAVKH 112
           ESS+ R +   +  E  KK G++       +R+ FQTRS  DILDDGYRWRKYGQKAVK+
Sbjct: 30  ESSKVRSEGCSKSVESSKKKGKK-------QRYAFQTRSQVDILDDGYRWRKYGQKAVKN 86

Query: 113 SLHP 115
           +  P
Sbjct: 90  NKFP 86

BLAST of Cla021203 vs. Swiss-Prot
Match: WRK12_ARATH (Probable WRKY transcription factor 12 OS=Arabidopsis thaliana GN=WRKY12 PE=2 SV=1)

HSP 1 Score: 59.3 bits (142), Expect = 3.4e-08
Identity = 25/33 (75.76%), Postives = 27/33 (81.82%), Query Frame = 1

Query: 82  RFEFQTRSAEDILDDGYRWRKYGQKAVKHSLHP 115
           RF FQT+S  D+LDDGY+WRKYGQK VK+SLHP
Sbjct: 132 RFCFQTKSDVDVLDDGYKWRKYGQKVVKNSLHP 164

BLAST of Cla021203 vs. Swiss-Prot
Match: WRK43_ARATH (Probable WRKY transcription factor 43 OS=Arabidopsis thaliana GN=WRKY43 PE=1 SV=1)

HSP 1 Score: 59.3 bits (142), Expect = 3.4e-08
Identity = 25/33 (75.76%), Postives = 28/33 (84.85%), Query Frame = 1

Query: 82  RFEFQTRSAEDILDDGYRWRKYGQKAVKHSLHP 115
           RF F+T+S  DILDDGYRWRKYGQK+VK+SL+P
Sbjct: 17  RFSFRTKSDADILDDGYRWRKYGQKSVKNSLYP 49

BLAST of Cla021203 vs. TrEMBL
Match: A0A0A0L635_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G121060 PE=4 SV=1)

HSP 1 Score: 167.5 bits (423), Expect = 9.9e-39
Identity = 82/113 (72.57%), Postives = 89/113 (78.76%), Query Frame = 1

Query: 2   EEYGKLQYPPPPPSPPPPCSDPDATPLEIDWIAVLSDQEVTGDLAAAAATCESSETRRDE 61
           EE  +LQ+PPP   P  PCSDP AT LEIDWIAVLS QE T DL   ++TCES E RRDE
Sbjct: 3   EENHQLQHPPPLLFPSLPCSDPLATSLEIDWIAVLSGQEATRDLPPTSSTCESLERRRDE 62

Query: 62  EKGERRKKGGRRWRRTAGCRRFEFQTRSAEDILDDGYRWRKYGQKAVKHSLHP 115
           EK  +RKKGGR+ R+  G RRFEFQTRS EDILDDGYRWRKYGQKAVKHSLHP
Sbjct: 63  EKSNQRKKGGRQRRKAVGRRRFEFQTRSTEDILDDGYRWRKYGQKAVKHSLHP 115

BLAST of Cla021203 vs. TrEMBL
Match: B9RDY7_RICCO (WRKY transcription factor, putative OS=Ricinus communis GN=RCOM_1616720 PE=4 SV=1)

HSP 1 Score: 93.6 bits (231), Expect = 1.8e-16
Identity = 56/115 (48.70%), Postives = 67/115 (58.26%), Query Frame = 1

Query: 9   YPPPPPSPP-----PPCSDPDATPLEIDWIAVLSDQEVTGDLAAAAATCES--SETRRDE 68
           + PP P  P     PP  DP     +IDW+A+LS Q V G+         S   ET  +E
Sbjct: 25  FTPPHPLLPSSSLHPPVIDPQVVLPDIDWVALLSSQSVVGENRPMMMENASLIGETGAEE 84

Query: 69  EKGERRK--KGGRRWRRTAGCRRFEFQTRSAEDILDDGYRWRKYGQKAVKHSLHP 115
           EKG + K  K GR  +      RF FQTRSA+DILDDGYRWRKYGQKAVK+S +P
Sbjct: 85  EKGNKDKLRKSGRIKKHITP--RFAFQTRSADDILDDGYRWRKYGQKAVKNSSYP 137

BLAST of Cla021203 vs. TrEMBL
Match: A0A061DST4_THECC (WRKY DNA-binding protein 56 OS=Theobroma cacao GN=TCM_005256 PE=4 SV=1)

HSP 1 Score: 85.9 bits (211), Expect = 3.8e-14
Identity = 55/139 (39.57%), Postives = 76/139 (54.68%), Query Frame = 1

Query: 2   EEYGKLQYPPPPP--------------SPPPPCSDPDAT---PLE------IDWIAVLSD 61
           +E  +   PPPP               +P  P S   ++   PLE      IDW+++LS 
Sbjct: 23  QESNQAPVPPPPQPLLSVPFNSPFLFAAPSLPSSSSSSSLHPPLETQILPDIDWVSLLSG 82

Query: 62  QEVTGD---LAAAAATCESSETRRDEEKGERRKKGGRRWRRTAGCRRFEFQTRSAEDILD 115
           Q V G+   +  +A +  +      +EKG + K+ G R ++ A   RF FQTRSA+DILD
Sbjct: 83  QGVLGENKPMIESAVSLMAENGADQDEKGNKDKRKGSRIKK-ASRPRFAFQTRSADDILD 142

BLAST of Cla021203 vs. NCBI nr
Match: gi|659075133|ref|XP_008437983.1| (PREDICTED: probable WRKY transcription factor 75 [Cucumis melo])

HSP 1 Score: 176.8 bits (447), Expect = 2.3e-41
Identity = 86/113 (76.11%), Postives = 92/113 (81.42%), Query Frame = 1

Query: 2   EEYGKLQYPPPPPSPPPPCSDPDATPLEIDWIAVLSDQEVTGDLAAAAATCESSETRRDE 61
           EE+ +LQ P PPP   PPCSDP AT LEIDWIAVL  QE  GDL  A++TCESSE RRDE
Sbjct: 3   EEHHQLQQPSPPP---PPCSDPLATSLEIDWIAVLYGQEAIGDLPPASSTCESSERRRDE 62

Query: 62  EKGERRKKGGRRWRRTAGCRRFEFQTRSAEDILDDGYRWRKYGQKAVKHSLHP 115
           EK  RRK GGRRWR+ AG RRFEFQTRS EDILDDGYRWRKYGQKAVKHSL+P
Sbjct: 63  EKTNRRKNGGRRWRKAAGRRRFEFQTRSTEDILDDGYRWRKYGQKAVKHSLYP 112

BLAST of Cla021203 vs. NCBI nr
Match: gi|449433065|ref|XP_004134318.1| (PREDICTED: probable WRKY transcription factor 43 [Cucumis sativus])

HSP 1 Score: 167.5 bits (423), Expect = 1.4e-38
Identity = 82/113 (72.57%), Postives = 89/113 (78.76%), Query Frame = 1

Query: 2   EEYGKLQYPPPPPSPPPPCSDPDATPLEIDWIAVLSDQEVTGDLAAAAATCESSETRRDE 61
           EE  +LQ+PPP   P  PCSDP AT LEIDWIAVLS QE T DL   ++TCES E RRDE
Sbjct: 3   EENHQLQHPPPLLFPSLPCSDPLATSLEIDWIAVLSGQEATRDLPPTSSTCESLERRRDE 62

Query: 62  EKGERRKKGGRRWRRTAGCRRFEFQTRSAEDILDDGYRWRKYGQKAVKHSLHP 115
           EK  +RKKGGR+ R+  G RRFEFQTRS EDILDDGYRWRKYGQKAVKHSLHP
Sbjct: 63  EKSNQRKKGGRQRRKAVGRRRFEFQTRSTEDILDDGYRWRKYGQKAVKHSLHP 115

BLAST of Cla021203 vs. NCBI nr
Match: gi|1003003046|gb|AMO00393.1| (WRKY transcription factor 25 [Manihot esculenta])

HSP 1 Score: 97.1 bits (240), Expect = 2.4e-17
Identity = 58/126 (46.03%), Postives = 74/126 (58.73%), Query Frame = 1

Query: 10  PPPPPSPP--------------------PPCSDPDATPLEIDWIAVLSDQEVTGDLAAAA 69
           PPPPP PP                    PP  +P   P +IDW+++LS Q   G+    A
Sbjct: 7   PPPPPPPPLPQLPPSLFSPSLLASSSLQPPFIEPQVLP-DIDWVSLLSCQYGEGE--NKA 66

Query: 70  ATCESSETRRDEEKGER-RKKGGRRWRRTAGCRRFEFQTRSAEDILDDGYRWRKYGQKAV 115
            T ES+    +EEKG + R+K GR  ++ +   RF FQTRSA+DILDDGYRWRKYGQKAV
Sbjct: 67  KTLESASVISEEEKGNKDRRKCGRMMKKHSR-PRFAFQTRSADDILDDGYRWRKYGQKAV 126

BLAST of Cla021203 vs. NCBI nr
Match: gi|223549106|gb|EEF50595.1| (WRKY transcription factor, putative [Ricinus communis])

HSP 1 Score: 93.6 bits (231), Expect = 2.6e-16
Identity = 56/115 (48.70%), Postives = 67/115 (58.26%), Query Frame = 1

Query: 9   YPPPPPSPP-----PPCSDPDATPLEIDWIAVLSDQEVTGDLAAAAATCES--SETRRDE 68
           + PP P  P     PP  DP     +IDW+A+LS Q V G+         S   ET  +E
Sbjct: 25  FTPPHPLLPSSSLHPPVIDPQVVLPDIDWVALLSSQSVVGENRPMMMENASLIGETGAEE 84

Query: 69  EKGERRK--KGGRRWRRTAGCRRFEFQTRSAEDILDDGYRWRKYGQKAVKHSLHP 115
           EKG + K  K GR  +      RF FQTRSA+DILDDGYRWRKYGQKAVK+S +P
Sbjct: 85  EKGNKDKLRKSGRIKKHITP--RFAFQTRSADDILDDGYRWRKYGQKAVKNSSYP 137

BLAST of Cla021203 vs. NCBI nr
Match: gi|1000981950|ref|XP_015584103.1| (PREDICTED: probable WRKY transcription factor 43 [Ricinus communis])

HSP 1 Score: 93.6 bits (231), Expect = 2.6e-16
Identity = 56/115 (48.70%), Postives = 67/115 (58.26%), Query Frame = 1

Query: 9   YPPPPPSPP-----PPCSDPDATPLEIDWIAVLSDQEVTGDLAAAAATCES--SETRRDE 68
           + PP P  P     PP  DP     +IDW+A+LS Q V G+         S   ET  +E
Sbjct: 25  FTPPHPLLPSSSLHPPVIDPQVVLPDIDWVALLSSQSVVGENRPMMMENASLIGETGAEE 84

Query: 69  EKGERRK--KGGRRWRRTAGCRRFEFQTRSAEDILDDGYRWRKYGQKAVKHSLHP 115
           EKG + K  K GR  +      RF FQTRSA+DILDDGYRWRKYGQKAVK+S +P
Sbjct: 85  EKGNKDKLRKSGRIKKHITP--RFAFQTRSADDILDDGYRWRKYGQKAVKNSSYP 137

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
WRK56_ARATH2.4e-0947.54Probable WRKY transcription factor 56 OS=Arabidopsis thaliana GN=WRKY56 PE=2 SV=... [more]
WRK24_ARATH3.1e-0954.72Probable WRKY transcription factor 24 OS=Arabidopsis thaliana GN=WRKY24 PE=2 SV=... [more]
WRK75_ARATH1.5e-0851.56Probable WRKY transcription factor 75 OS=Arabidopsis thaliana GN=WRKY75 PE=2 SV=... [more]
WRK12_ARATH3.4e-0875.76Probable WRKY transcription factor 12 OS=Arabidopsis thaliana GN=WRKY12 PE=2 SV=... [more]
WRK43_ARATH3.4e-0875.76Probable WRKY transcription factor 43 OS=Arabidopsis thaliana GN=WRKY43 PE=1 SV=... [more]
Match NameE-valueIdentityDescription
A0A0A0L635_CUCSA9.9e-3972.57Uncharacterized protein OS=Cucumis sativus GN=Csa_3G121060 PE=4 SV=1[more]
B9RDY7_RICCO1.8e-1648.70WRKY transcription factor, putative OS=Ricinus communis GN=RCOM_1616720 PE=4 SV=... [more]
A0A061DST4_THECC3.8e-1439.57WRKY DNA-binding protein 56 OS=Theobroma cacao GN=TCM_005256 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
gi|659075133|ref|XP_008437983.1|2.3e-4176.11PREDICTED: probable WRKY transcription factor 75 [Cucumis melo][more]
gi|449433065|ref|XP_004134318.1|1.4e-3872.57PREDICTED: probable WRKY transcription factor 43 [Cucumis sativus][more]
gi|1003003046|gb|AMO00393.1|2.4e-1746.03WRKY transcription factor 25 [Manihot esculenta][more]
gi|223549106|gb|EEF50595.1|2.6e-1648.70WRKY transcription factor, putative [Ricinus communis][more]
gi|1000981950|ref|XP_015584103.1|2.6e-1648.70PREDICTED: probable WRKY transcription factor 43 [Ricinus communis][more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
IPR003657WRKY_dom
Vocabulary: Molecular Function
TermDefinition
GO:0003700transcription factor activity, sequence-specific DNA binding
GO:0043565sequence-specific DNA binding
Vocabulary: Biological Process
TermDefinition
GO:0006355regulation of transcription, DNA-templated
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0005667 transcription factor complex
molecular_function GO:0043565 sequence-specific DNA binding
molecular_function GO:0003700 transcription factor activity, sequence-specific DNA binding

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Cla021203Cla021203gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Cla021203Cla021203.1-proteinpolypeptide


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cla021203.1.cds2Cla021203.1.cds2CDS
Cla021203.1.cds1Cla021203.1.cds1CDS


Analysis Name: InterPro Annotations of watermelon (97103)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR003657WRKY domainGENE3DG3DSA:2.20.25.80coord: 82..114
score: 1.1
IPR003657WRKY domainPFAMPF03106WRKYcoord: 95..114
score: 1.
IPR003657WRKY domainSMARTSM00774WRKY_clscoord: 94..122
score: 0.
IPR003657WRKY domainPROFILEPS50811WRKYcoord: 89..114
score: 11
IPR003657WRKY domainunknownSSF118290WRKY DNA-binding domaincoord: 87..116
score: 1.4
NoneNo IPR availablePANTHERPTHR31221FAMILY NOT NAMEDcoord: 6..114
score: 4.5
NoneNo IPR availablePANTHERPTHR31221:SF7WRKY TRANSCRIPTION FACTOR 24-RELATEDcoord: 6..114
score: 4.5