Cla002923 (gene) Watermelon (97103) v1

NameCla002923
Typegene
OrganismCitrullus. lanatus (Watermelon (97103) v1)
DescriptionHeat stress transcription factor (AHRD V1 ***- D4QAU8_CARPA); contains Interpro domain(s) IPR000232 Heat shock factor (HSF)-type, DNA-binding
LocationChr10 : 21752846 .. 21755033 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAGAGGCAATAGTAGGCTTAGGTAATGTGTGTGAGAACTACGTCGGCCCCGCCCGAAAGCCGACTCCGCCGCCGTTCTTGGCCAAAACCTACATGCTTGTTGAGGATCCGACGACAGACGACATCATATCGTGGAACTCTGATGGAACAGCGTTTGTCGTGTGGCGGCCGCAGGAGTTTGCAACAGATCTCCTCCCCACCCTCTTCAAACATAACAACTTCTCTAGCTTTGTTCGCCAACTTAACACATATGTAAGTTGATTTTTGTGTTTTTTTTTTTTTTTTTTTTTCACTTTTTCCATTCTCTCAAACTCTCTTTTTCATTTACTTTCATATGTATCCTATTTACAAATTTTTCAATTGGTTTTCTATTTTGGTTTTGATTTCAGTATGTATCATAGTTTAGAACTCGTTTAATGATGATTTGTTTTAATCTCTTTGCAAAATTATATGCTGGTTTATTTTTCTTAATTTTTTTTTTTTAATCTAGATTATTATTGAAATTATAGAAAATATGAAAAAGTCCTAAGAAAAATCACTCAAAAAATTTTTGCAAGTGAGAAAAGAATATAGGTACAGGCATATAAGGAAAAAAAATGTTCAATTTTAACTCATTAAAATATTTTCTAATGATCAAAAAGTGCTTTGAAAATCTCTATCCTTCTAATGGTAATTGAATATAGAGGAGAAGTCTAAGATATATTTAAAAAGCTTTATTTATAAAATAATAAACAATGATATGTGATTTTAGTGCTATTGACCAACAACTTTTTGTTTTTCTTTTTAGATCTTTATTGGCCAAAATAGTTCAATGTTGATCTCTCATGACTTTTTGTTTCTAATTAATTAAATAAATCTCTCCACACACTTCTTATCACCCACTTTTCTCTCCACATAATTTTCTCATTAACTTCTCTCTACCCACATGCTTTTCACTAATTTCTAAATACACACTTTCATTACTTTTTTTTCTAGTGTGGACTTTTTCTTTCTAGTGTGGATGATTTAATAAAAAGATAACTTTCAAATTAATCCAGATTTTTTAAAATATTTTTTTATTAATTATTTTTAAAATATATATATATTTTTTACAGTATCAATCATTTAATAAACTATATTAAACACAGGTTAAGTTATTTCTCACAAGTTACTCCAAACACACTACTATTATTATCTTATGTGTTATTGACCTCTCTTTTTTTTTTTTCTTTTTTTTTTTTTAATTCAAGTCAACACATTTGTAAAATAATTCCATGTACGCTAGGGAATTTGAATTTTGTTTAATATTTTAAACTCACTAAGTAAATATGCTTTTTACAGACAATATGAAATGCACAGTTTTCAAAATTTGTGATTATCAACATTATTTTAAAAAGAAATAAAATCACATGATTTCAAACAACATTCCTCAAGTTAAAAGTAAATTTTTGAGTTTAGTGTTAAATCTCACCTGGTATGAAAACGAATAAACACTTGTTGTGCCCAGTGCAACAAGCGGTCCAACGATTAATTTTTCAAACTCATAAATGTAGGGAGCATTAAAACATTTTGATATAAAATTAAATTTGTCATCACTTATCAAGTTAAGTTTTTAAATTTTTGAATTCTAGTAGTGATTAAGCACGAAATATAGAAAATCTTAATGCCTATATACTTGTGTACAATGAAACTTTGATAGGGATTTCGCAAGATTGCAACAAGCAGATGGGAGTTTTGCAATGACAAGTTTCAAAAGGGATGTAAAGAAAGATTATGCGAAATACGAAGGAGAAAAGCATGGAGCAACAAACGACAACATAATAATAATGCAAAAGTAATTCAAGCCACACATCAAGTTGATCGCGATGAAGATCAACGGTCGTCGTCAACTACATCGTCATTTGATGATCAATACATGATGCTTGTTGACGAAAACAAGAGGTTGAAGAAGGAGAATGGAGTGTTGAGCTTTGAGCTAACAAACATGAAGAAGAAATGCAGGGAGCTTCTTGATTTGGTGGCTAAGTACGAATTCGCTGTTATTACTGGCGACGAAAAGAAGGAAGATGAGATGAAGCCAAACTTGAAACTGTTTGGAGTCAAGTTGGAGATTGAGGAAGAAGATGAAATGGTGATCAGGCAAAACAAGAGGAAGAGATCAATTTATCAAAATAAACATTTTCTACTCTTACAATCAAGCAAATGA

mRNA sequence

ATGGAAGAGGCAATAGTAGGCTTAGGTAATGTGTGTGAGAACTACGTCGGCCCCGCCCGAAAGCCGACTCCGCCGCCGTTCTTGGCCAAAACCTACATGCTTGTTGAGGATCCGACGACAGACGACATCATATCGTGGAACTCTGATGGAACAGCGTTTGTCGTGTGGCGGCCGCAGGAGTTTGCAACAGATCTCCTCCCCACCCTCTTCAAACATAACAACTTCTCTAGCTTTGTTCGCCAACTTAACACATATGGATTTCGCAAGATTGCAACAAGCAGATGGGAGTTTTGCAATGACAAGTTTCAAAAGGGATGTAAAGAAAGATTATGCGAAATACGAAGGAGAAAAGCATGGAGCAACAAACGACAACATAATAATAATGCAAAAGTAATTCAAGCCACACATCAAGTTGATCGCGATGAAGATCAACGGTCGTCGTCAACTACATCGTCATTTGATGATCAATACATGATGCTTGTTGACGAAAACAAGAGGTTGAAGAAGGAGAATGGAGTGTTGAGCTTTGAGCTAACAAACATGAAGAAGAAATGCAGGGAGCTTCTTGATTTGGTGGCTAAGTACGAATTCGCTGTTATTACTGGCGACGAAAAGAAGGAAGATGAGATGAAGCCAAACTTGAAACTGTTTGGAGTCAAGTTGGAGATTGAGGAAGAAGATGAAATGGTGATCAGGCAAAACAAGAGGAAGAGATCAATTTATCAAAATAAACATTTTCTACTCTTACAATCAAGCAAATGA

Coding sequence (CDS)

ATGGAAGAGGCAATAGTAGGCTTAGGTAATGTGTGTGAGAACTACGTCGGCCCCGCCCGAAAGCCGACTCCGCCGCCGTTCTTGGCCAAAACCTACATGCTTGTTGAGGATCCGACGACAGACGACATCATATCGTGGAACTCTGATGGAACAGCGTTTGTCGTGTGGCGGCCGCAGGAGTTTGCAACAGATCTCCTCCCCACCCTCTTCAAACATAACAACTTCTCTAGCTTTGTTCGCCAACTTAACACATATGGATTTCGCAAGATTGCAACAAGCAGATGGGAGTTTTGCAATGACAAGTTTCAAAAGGGATGTAAAGAAAGATTATGCGAAATACGAAGGAGAAAAGCATGGAGCAACAAACGACAACATAATAATAATGCAAAAGTAATTCAAGCCACACATCAAGTTGATCGCGATGAAGATCAACGGTCGTCGTCAACTACATCGTCATTTGATGATCAATACATGATGCTTGTTGACGAAAACAAGAGGTTGAAGAAGGAGAATGGAGTGTTGAGCTTTGAGCTAACAAACATGAAGAAGAAATGCAGGGAGCTTCTTGATTTGGTGGCTAAGTACGAATTCGCTGTTATTACTGGCGACGAAAAGAAGGAAGATGAGATGAAGCCAAACTTGAAACTGTTTGGAGTCAAGTTGGAGATTGAGGAAGAAGATGAAATGGTGATCAGGCAAAACAAGAGGAAGAGATCAATTTATCAAAATAAACATTTTCTACTCTTACAATCAAGCAAATGA

Protein sequence

MEEAIVGLGNVCENYVGPARKPTPPPFLAKTYMLVEDPTTDDIISWNSDGTAFVVWRPQEFATDLLPTLFKHNNFSSFVRQLNTYGFRKIATSRWEFCNDKFQKGCKERLCEIRRRKAWSNKRQHNNNAKVIQATHQVDRDEDQRSSSTTSSFDDQYMMLVDENKRLKKENGVLSFELTNMKKKCRELLDLVAKYEFAVITGDEKKEDEMKPNLKLFGVKLEIEEEDEMVIRQNKRKRSIYQNKHFLLLQSSK
BLAST of Cla002923 vs. Swiss-Prot
Match: HSFB3_ARATH (Heat stress transcription factor B-3 OS=Arabidopsis thaliana GN=HSFB3 PE=2 SV=1)

HSP 1 Score: 221.5 bits (563), Expect = 1.1e-56
Identity = 119/222 (53.60%), Postives = 147/222 (66.22%), Query Frame = 1

Query: 14  NYVGPARKPTPPPFLAKTYMLVEDPTTDDIISWNSDGTAFVVWRPQEFATDLLPTLFKHN 73
           N    A    PPPFL KTY +VEDPTTD +ISWN  GT FVVW+P EFA DLLPTLFKH 
Sbjct: 28  NSTSTAELQPPPPFLVKTYKVVEDPTTDGVISWNEYGTGFVVWQPAEFARDLLPTLFKHC 87

Query: 74  NFSSFVRQLNTYGFRKIATSRWEFCNDKFQKGCKERLCEIRRRKA--WSNKRQHNNNAKV 133
           NFSSFVRQLNTYGFRK+ T RWEF N+ F+KG +E +  IRRRK+  WS+ +   +N +V
Sbjct: 88  NFSSFVRQLNTYGFRKVTTIRWEFSNEMFRKGQRELMSNIRRRKSQHWSHNK---SNHQV 147

Query: 134 IQATHQVDRD-----------EDQRSSSTTSSFDDQYMMLVDENKRLKKENGVLSFELTN 193
           +  T  V+++           EDQ+SS+T+SSF   Y  L+DENK LK EN +LS EL  
Sbjct: 148 VPTTTMVNQEGHQRIGIDHHHEDQQSSATSSSF--VYTALLDENKCLKNENELLSCELGK 207

Query: 194 MKKKCRELLDLVAKYEFAVITGDEKKEDEMKPNLKLFGVKLE 223
            KKKC++L++LV +Y        ++ +DE    LKLFGVKLE
Sbjct: 208 TKKKCKQLMELVERYRGEDEDATDESDDEEDEGLKLFGVKLE 244

BLAST of Cla002923 vs. Swiss-Prot
Match: HSF24_SOLPE (Heat shock factor protein HSF24 OS=Solanum peruvianum GN=HSF24 PE=2 SV=1)

HSP 1 Score: 175.6 bits (444), Expect = 6.7e-43
Identity = 93/190 (48.95%), Postives = 116/190 (61.05%), Query Frame = 1

Query: 19  ARKPTPPPFLAKTYMLVEDPTTDDIISWNSDGTAFVVWRPQEFATDLLPTLFKHNNFSSF 78
           +++  P PFL KTY LV+D  TDD+ISWN  GT FVVW+  EFA DLLP  FKHNNFSSF
Sbjct: 2   SQRTAPAPFLLKTYQLVDDAATDDVISWNEIGTTFVVWKTAEFAKDLLPKYFKHNNFSSF 61

Query: 79  VRQLNTYGFRKIATSRWEFCNDKFQKGCKERLCEIRRRKAWSNKRQHNNNAKVIQATHQV 138
           VRQLNTYGFRKI   +WEF N+ F++G KE L  IRRRK  ++       +    A+   
Sbjct: 62  VRQLNTYGFRKIVPDKWEFANENFKRGQKELLTAIRRRKTVTS-TPAGGKSVAAGASASP 121

Query: 139 DRDEDQRSSSTTSSFD-------------DQYMMLVDENKRLKKENGVLSFELTNMKKKC 196
           D   D   SS+TSS D              Q+  L DEN++LKK+N +LS EL   KK+C
Sbjct: 122 DNSGDDIGSSSTSSPDSKNPGSVDTPGKLSQFTDLSDENEKLKKDNQMLSSELVQAKKQC 181

BLAST of Cla002923 vs. Swiss-Prot
Match: HFB2B_ORYSJ (Heat stress transcription factor B-2b OS=Oryza sativa subsp. japonica GN=HSFB2B PE=2 SV=1)

HSP 1 Score: 170.6 bits (431), Expect = 2.2e-41
Identity = 94/207 (45.41%), Postives = 114/207 (55.07%), Query Frame = 1

Query: 16  VGPARKPTPPPFLAKTYMLVEDPTTDDIISWNSDGTAFVVWRPQEFATDLLPTLFKHNNF 75
           VG  ++  P PFL KTY LV+DP  DD+ISWN DG+ FVVWRP EFA DLLP  FKHNNF
Sbjct: 38  VGQQQRTVPTPFLTKTYQLVDDPAVDDVISWNDDGSTFVVWRPAEFARDLLPKYFKHNNF 97

Query: 76  SSFVRQLNTYGFRKIATSRWEFCNDKFQKGCKERLCEIRRRKAWSNKRQHNNNAKVIQAT 135
           SSFVRQLNTYGFRKI   RWEF ND F++G +  LCEI RRK  +        A V  A 
Sbjct: 98  SSFVRQLNTYGFRKIVPDRWEFANDCFRRGERRLLCEIHRRKV-TPPAPAATTAAVAAAI 157

Query: 136 HQVDRDEDQRSSSTTSSFDDQYMM---------------------------LVDENKRLK 195
                    R  S   S ++Q +                            + DEN+RL+
Sbjct: 158 PMALPVTTTRDGSPVLSGEEQVISSSSSPEPPLVLPQAPSGSGSGGVASGDVGDENERLR 217

BLAST of Cla002923 vs. Swiss-Prot
Match: HFB2A_ARATH (Heat stress transcription factor B-2a OS=Arabidopsis thaliana GN=HSFB2A PE=2 SV=1)

HSP 1 Score: 164.1 bits (414), Expect = 2.0e-39
Identity = 97/235 (41.28%), Postives = 133/235 (56.60%), Query Frame = 1

Query: 19  ARKPTPPPFLAKTYMLVEDPTTDDIISWNSDGTAFVVWRPQEFATDLLPTLFKHNNFSSF 78
           +++  P PFL KT+ LVED + DD+ISWN DG++F+VW P +FA DLLP  FKHNNFSSF
Sbjct: 16  SQRSIPTPFLTKTFNLVEDSSIDDVISWNEDGSSFIVWNPTDFAKDLLPKHFKHNNFSSF 75

Query: 79  VRQLNTYGFRKIATSRWEFCNDKFQKGCKERLCEIRRRKAWS--------NKRQHNNNAK 138
           VRQLNTYGF+K+   RWEF ND F++G K  L EI+RRK  +        +  Q N    
Sbjct: 76  VRQLNTYGFKKVVPDRWEFSNDFFKRGEKRLLREIQRRKITTTHQTVVAPSSEQRNQTMV 135

Query: 139 VIQATHQVDRDEDQRSSSTTSSF----------DDQYMMLVDENKRLKKENGVLSFELTN 198
           V  +    D + +Q  SS+ SS+              + L++EN++L+ +N  L+ ELT 
Sbjct: 136 VSPSNSGEDNNNNQVMSSSPSSWYCHQTKTTGNGGLSVELLEENEKLRSQNIQLNRELTQ 195

Query: 199 MKKKCRELLDLVAKYEFAVIT-------GDEKKEDEMKPNLKLFGVKLEIEEEDE 229
           MK  C  +  L++ Y  +  T       G   +  E  P  K F  ++EIEEE+E
Sbjct: 196 MKSICDNIYSLMSNYVGSQPTDRSYSPGGSSSQPMEFLP-AKRFS-EMEIEEEEE 248

BLAST of Cla002923 vs. Swiss-Prot
Match: HSFB1_ARATH (Heat stress transcription factor B-1 OS=Arabidopsis thaliana GN=HSFB1 PE=2 SV=2)

HSP 1 Score: 163.7 bits (413), Expect = 2.6e-39
Identity = 99/245 (40.41%), Postives = 132/245 (53.88%), Query Frame = 1

Query: 16  VGPARKPTPPPFLAKTYMLVEDPTTDDIISWNSDGTAFVVWRPQEFATDLLPTLFKHNNF 75
           V  A++  P PFL+KTY LV+D +TDD++SWN +GTAFVVW+  EFA DLLP  FKHNNF
Sbjct: 4   VTAAQRSVPAPFLSKTYQLVDDHSTDDVVSWNEEGTAFVVWKTAEFAKDLLPQYFKHNNF 63

Query: 76  SSFVRQLNTYGFRKIATSRWEFCNDKFQKGCKERLCEIRRRKAWSNKRQHNNNAKVIQAT 135
           SSF+RQLNTYGFRK    +WEF ND F++G ++ L +IRRRK+               + 
Sbjct: 64  SSFIRQLNTYGFRKTVPDKWEFANDYFRRGGEDLLTDIRRRKSVIASTAGKCVVVGSPSE 123

Query: 136 HQVDRDEDQRSSSTTS--------SFDDQYMMLVDENKRLKKENGVLSFELTNMKKKCRE 195
                 +D  SSST+S        S ++    L  EN++LK+EN  LS EL   KK+  E
Sbjct: 124 SNSGGGDDHGSSSTSSPGSSKNPGSVENMVADLSGENEKLKRENNNLSSELAAAKKQRDE 183

Query: 196 LLDLVAKY---------------EFAVITGDEKKEDE-----------MKPNLKLFGVKL 227
           L+  +  +               +F  +  DE+ E E           +   LKLFGV L
Sbjct: 184 LVTFLTGHLKVRPEQIDKMIKGGKFKPVESDEESECEGCDGGGGAEEGVGEGLKLFGVWL 243

BLAST of Cla002923 vs. TrEMBL
Match: A0A0A0LF37_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G822450 PE=3 SV=1)

HSP 1 Score: 389.8 bits (1000), Expect = 2.5e-105
Identity = 193/253 (76.28%), Postives = 216/253 (85.38%), Query Frame = 1

Query: 5   IVGLGNVCENY--VGPARKPTPPPFLAKTYMLVEDPTTDDIISWNSDGTAFVVWRPQEFA 64
           +V LGNVC+    +G  R+  P PFL KTYMLVEDP TDD+ISWNSDGT F+VW+P EFA
Sbjct: 1   MVSLGNVCDQLDSIGAVRELAPSPFLTKTYMLVEDPMTDDVISWNSDGTTFIVWQPPEFA 60

Query: 65  TDLLPTLFKHNNFSSFVRQLNTYGFRKIATSRWEFCNDKFQKGCKERLCEIRRRKAWSNK 124
            DLLPTLFKHNNFSSFVRQLNTYGFRKIATSRWEF N+KF+KGCKERLCEI RRKAW+NK
Sbjct: 61  IDLLPTLFKHNNFSSFVRQLNTYGFRKIATSRWEFYNEKFKKGCKERLCEIHRRKAWTNK 120

Query: 125 RQHNNNAKVIQATHQVDRDEDQRSSSTTSSFDDQYMMLVDENKRLKKENGVLSFELTNMK 184
           R+HN+NAK IQ THQ + DEDQRS ST+SS DDQY ML  ENK+LKKENGVLSFELTNMK
Sbjct: 121 RKHNSNAKAIQVTHQDNHDEDQRSLSTSSS-DDQYTMLAYENKKLKKENGVLSFELTNMK 180

Query: 185 KKCRELLDLVAKYEFAVITGDEKKEDE--MKPNLKLFGVKLEIEEEDEMVIRQNKRKRSI 244
           KKCRELLDLVAKY+F V+ G++KK DE  MKPNLKLFGVKLE+EEEDEM I+QNKRKRS 
Sbjct: 181 KKCRELLDLVAKYKFVVVNGNKKKADEIMMKPNLKLFGVKLEVEEEDEMEIKQNKRKRSN 240

Query: 245 YQNKHFLLLQSSK 254
           Y +K FLL Q+ K
Sbjct: 241 YPDKPFLLSQTCK 252

BLAST of Cla002923 vs. TrEMBL
Match: A0A061F3V0_THECC (Heat shock factor protein, putative OS=Theobroma cacao GN=TCM_024565 PE=3 SV=1)

HSP 1 Score: 287.7 bits (735), Expect = 1.3e-74
Identity = 154/234 (65.81%), Postives = 181/234 (77.35%), Query Frame = 1

Query: 20  RKPTPPPFLAKTYMLVEDPTTDDIISWNSDGTAFVVWRPQEFATDLLPTLFKHNNFSSFV 79
           RK TPPPFL KTYMLVEDP TDD+ISWN+DGT FVVW+P EFA DLLPTLFKH+NFSSFV
Sbjct: 17  RKSTPPPFLMKTYMLVEDPITDDVISWNADGTGFVVWQPAEFARDLLPTLFKHSNFSSFV 76

Query: 80  RQLNTYGFRKIATSRWEFCNDKFQKGCKERLCEIRRRKAWSNKRQHNNNAKVIQATHQVD 139
           RQLNTYGFRK+ATSRWEFCN+ F+KG +E LC IRRRKAW+NK+Q    A  IQ + Q D
Sbjct: 77  RQLNTYGFRKVATSRWEFCNEMFRKGDRELLCNIRRRKAWANKQQ---TAATIQVSPQ-D 136

Query: 140 RDEDQRSSSTTSSFDDQYMMLVDENKRLKKENGVLSFELTNMKKKCRELLDLVAKYEFAV 199
            DEDQ+SSST+SS    +  LVDENKRLKKENGVLS ELT+MK+KC+ELLDLVAKY    
Sbjct: 137 SDEDQKSSSTSSS--SGHNSLVDENKRLKKENGVLSLELTSMKRKCKELLDLVAKY---- 196

Query: 200 ITGDEKKEDEMKPNLKLFGVKLEIEEEDEMVIRQNKRKRSIYQNKHFLLLQSSK 254
              +E+KEDE   + KLFGV+LE+E E E    + +R+  I ++   LL QS K
Sbjct: 197 AQFEEEKEDE---SPKLFGVRLEVEGERE----RKRRRAEISESASILLSQSCK 233

BLAST of Cla002923 vs. TrEMBL
Match: A0A067K1D9_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_21103 PE=3 SV=1)

HSP 1 Score: 285.8 bits (730), Expect = 5.1e-74
Identity = 155/235 (65.96%), Postives = 181/235 (77.02%), Query Frame = 1

Query: 20  RKPTPPPFLAKTYMLVEDPTTDDIISWNSDGTAFVVWRPQEFATDLLPTLFKHNNFSSFV 79
           RK TPPPFL KTYMLVEDP TD ++SWN DGT F+VW+P EFA DLLPTLFKH+NFSSFV
Sbjct: 23  RKSTPPPFLLKTYMLVEDPATDHVVSWNDDGTGFIVWQPAEFARDLLPTLFKHSNFSSFV 82

Query: 80  RQLNTYGFRKIATSRWEFCNDKFQKGCKERLCEIRRRKAWSNKRQHNNNAKVIQATHQVD 139
           RQLNTYGFRK+ATSRWEFCND F+KG KE LC+IRRRKAWSNK+Q N     IQ T Q +
Sbjct: 83  RQLNTYGFRKVATSRWEFCNDMFRKGEKELLCQIRRRKAWSNKQQPNAQ---IQTTPQ-E 142

Query: 140 RDEDQRSSSTTSSFDDQYMMLVDENKRLKKENGVLSFELTNMKKKCRELLDLVAKYEFAV 199
            DEDQRSSST+SS   +Y +LVDENKRLKKEN VLS ELT+MK+KC+ELLDLVAKY    
Sbjct: 143 SDEDQRSSSTSSS--SEYGVLVDENKRLKKENKVLSSELTDMKRKCKELLDLVAKY--TR 202

Query: 200 ITGDEKKEDEMKPNLKLFGVKLEIEEEDEMVIRQNKRKRS-IYQNKHFLLLQSSK 254
              +E++E++  P  KLFGV+LE+  E     R+ KRKR+ I +    LL QS K
Sbjct: 203 FEKEEEEEEDASP--KLFGVRLEVAGE-----RERKRKRAEIRECATILLSQSCK 242

BLAST of Cla002923 vs. TrEMBL
Match: A0A059DIB7_EUCGR (Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_A02338 PE=3 SV=1)

HSP 1 Score: 282.3 bits (721), Expect = 5.7e-73
Identity = 154/242 (63.64%), Postives = 180/242 (74.38%), Query Frame = 1

Query: 20  RKPTPPPFLAKTYMLVEDPTTDDIISWNSDGTAFVVWRPQEFATDLLPTLFKHNNFSSFV 79
           RK TPPPFL KTYMLVEDP TDD+ISWN+DGTAFVVW+P EFA DLLPTLFKH+NFSSFV
Sbjct: 20  RKSTPPPFLLKTYMLVEDPATDDVISWNADGTAFVVWQPAEFARDLLPTLFKHSNFSSFV 79

Query: 80  RQLNTYGFRKIATSRWEFCNDKFQKGCKERLCEIRRRKAWSNKRQHNNNAKVIQ-ATHQV 139
           RQLNTYGFRKIATSRWEFCND F+KG +E LCEIRRRKAW+NK + N + +  Q   H  
Sbjct: 80  RQLNTYGFRKIATSRWEFCNDMFRKGERELLCEIRRRKAWANKPRSNPSTQAHQDNNHNH 139

Query: 140 DRDEDQRSSSTTSSFDDQYMMLVDENKRLKKENGVLSFELTNMKKKCRELLDLVAKYEFA 199
             DEDQRSSST+SS   +Y  L+ ENKRLKKENG LS ELT MK KC+ELL+LVAKY   
Sbjct: 140 SSDEDQRSSSTSSS--SEYSNLIHENKRLKKENGALSSELTGMKYKCKELLELVAKYT-R 199

Query: 200 VITGDEKKEDEMKPN------LKLFGVKLEIEEEDEMVIRQNKRKRS-IYQNKHFLLLQS 254
              GD ++E+E K N      LKLFGV+LE++ +      + KRK + I ++   LL QS
Sbjct: 200 HEDGDGEEEEEEKGNNNGGEGLKLFGVRLEVKGDQR---GERKRKGARIRESASILLSQS 255

BLAST of Cla002923 vs. TrEMBL
Match: V4SC63_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10005649mg PE=3 SV=1)

HSP 1 Score: 281.2 bits (718), Expect = 1.3e-72
Identity = 148/247 (59.92%), Postives = 176/247 (71.26%), Query Frame = 1

Query: 19  ARKPTPPPFLAKTYMLVEDPTTDDIISWNSDGTAFVVWRPQEFATDLLPTLFKHNNFSSF 78
           ARK TPPPFL KTYML+EDP TD++ISWN DGT FVVW+P EFA DLLPTLFKH+NFSSF
Sbjct: 23  ARKSTPPPFLLKTYMLIEDPATDNVISWNGDGTGFVVWQPAEFARDLLPTLFKHSNFSSF 82

Query: 79  VRQLNTYGFRKIATSRWEFCNDKFQKGCKERLCEIRRRKAWSNKRQHNNNAKVIQATHQV 138
           VRQLNTYGFRK+ATSRWEFCN  F+KG K+ LC+IRRRKAW+N++Q    A    AT Q 
Sbjct: 83  VRQLNTYGFRKVATSRWEFCNQMFRKGEKDLLCKIRRRKAWANRQQ----AATAAATQQE 142

Query: 139 DRDEDQRSSSTTSSFDDQYMMLVDENKRLKKENGVLSFELTNMKKKCRELLDLVAKYEFA 198
             DEDQRSSS+TSS   +Y  L DENKRLKKENG LS EL +MK+KC+ELLDLVAKY   
Sbjct: 143 SHDEDQRSSSSTSSL-SEYNTLRDENKRLKKENGNLSSELVSMKRKCKELLDLVAKYANM 202

Query: 199 VITGDEKKEDEMKPN------------LKLFGVKLEIEEEDEMVIRQNKRKRSIYQNKHF 254
               D+  +DE   +            +KLFGV+ E++E +    R+ KR   I Q+ + 
Sbjct: 203 DNDDDDDDDDEDDDDHEINNNQRGPLIVKLFGVRFEVDEGERERKRKRKRAEIISQSANI 262

BLAST of Cla002923 vs. NCBI nr
Match: gi|659085244|ref|XP_008443318.1| (PREDICTED: heat stress transcription factor B-3-like [Cucumis melo])

HSP 1 Score: 407.9 bits (1047), Expect = 1.3e-110
Identity = 201/255 (78.82%), Postives = 222/255 (87.06%), Query Frame = 1

Query: 1   MEEAIVGLGNVCENYVGPARKPTPPPFLAKTYMLVEDPTTDDIISWNSDGTAFVVWRPQE 60
           M EA+V LGNVCE  +    +PTP PFL KTYMLVEDP TD++ISWNSDGT F+VW+P E
Sbjct: 1   MGEAMVSLGNVCEESIAVVSEPTPSPFLTKTYMLVEDPMTDNVISWNSDGTTFIVWQPPE 60

Query: 61  FATDLLPTLFKHNNFSSFVRQLNTYGFRKIATSRWEFCNDKFQKGCKERLCEIRRRKAWS 120
           FA DLLPTLFKHNNFSSFVRQLNTYGFRKIATSRWEF NDKF+KGCKERLCEIRRRKAW+
Sbjct: 61  FAIDLLPTLFKHNNFSSFVRQLNTYGFRKIATSRWEFYNDKFKKGCKERLCEIRRRKAWT 120

Query: 121 NKRQHNNNAKVIQATHQVDRDEDQRSSSTTSSFDDQYMMLVDENKRLKKENGVLSFELTN 180
           NKR+HN+NAK IQ THQ + DEDQRS ST+SS DDQY ML  ENK+LKKENGVLSFELTN
Sbjct: 121 NKRKHNSNAKAIQVTHQDNHDEDQRSLSTSSS-DDQYTMLAYENKKLKKENGVLSFELTN 180

Query: 181 MKKKCRELLDLVAKYEFAVITGDEKKEDE--MKPNLKLFGVKLEIEEEDEMVIRQNKRKR 240
           MKKKCRELLDLVAKYEF VI G+EKKEDE  +KPNLKLFGV+LE+EEEDEM I+QNKRKR
Sbjct: 181 MKKKCRELLDLVAKYEFMVINGNEKKEDEIMLKPNLKLFGVQLEVEEEDEMEIKQNKRKR 240

Query: 241 SIYQNKHFLLLQSSK 254
           SI+ NK FLL Q+ K
Sbjct: 241 SIHPNKPFLLSQTCK 254

BLAST of Cla002923 vs. NCBI nr
Match: gi|449438018|ref|XP_004136787.1| (PREDICTED: heat stress transcription factor B-3-like [Cucumis sativus])

HSP 1 Score: 389.8 bits (1000), Expect = 3.6e-105
Identity = 193/253 (76.28%), Postives = 216/253 (85.38%), Query Frame = 1

Query: 5   IVGLGNVCENY--VGPARKPTPPPFLAKTYMLVEDPTTDDIISWNSDGTAFVVWRPQEFA 64
           +V LGNVC+    +G  R+  P PFL KTYMLVEDP TDD+ISWNSDGT F+VW+P EFA
Sbjct: 1   MVSLGNVCDQLDSIGAVRELAPSPFLTKTYMLVEDPMTDDVISWNSDGTTFIVWQPPEFA 60

Query: 65  TDLLPTLFKHNNFSSFVRQLNTYGFRKIATSRWEFCNDKFQKGCKERLCEIRRRKAWSNK 124
            DLLPTLFKHNNFSSFVRQLNTYGFRKIATSRWEF N+KF+KGCKERLCEI RRKAW+NK
Sbjct: 61  IDLLPTLFKHNNFSSFVRQLNTYGFRKIATSRWEFYNEKFKKGCKERLCEIHRRKAWTNK 120

Query: 125 RQHNNNAKVIQATHQVDRDEDQRSSSTTSSFDDQYMMLVDENKRLKKENGVLSFELTNMK 184
           R+HN+NAK IQ THQ + DEDQRS ST+SS DDQY ML  ENK+LKKENGVLSFELTNMK
Sbjct: 121 RKHNSNAKAIQVTHQDNHDEDQRSLSTSSS-DDQYTMLAYENKKLKKENGVLSFELTNMK 180

Query: 185 KKCRELLDLVAKYEFAVITGDEKKEDE--MKPNLKLFGVKLEIEEEDEMVIRQNKRKRSI 244
           KKCRELLDLVAKY+F V+ G++KK DE  MKPNLKLFGVKLE+EEEDEM I+QNKRKRS 
Sbjct: 181 KKCRELLDLVAKYKFVVVNGNKKKADEIMMKPNLKLFGVKLEVEEEDEMEIKQNKRKRSN 240

Query: 245 YQNKHFLLLQSSK 254
           Y +K FLL Q+ K
Sbjct: 241 YPDKPFLLSQTCK 252

BLAST of Cla002923 vs. NCBI nr
Match: gi|645278471|ref|XP_008244246.1| (PREDICTED: heat stress transcription factor B-3 [Prunus mume])

HSP 1 Score: 289.3 bits (739), Expect = 6.6e-75
Identity = 156/241 (64.73%), Postives = 187/241 (77.59%), Query Frame = 1

Query: 20  RKPTPPPFLAKTYMLVEDPTTDDIISWNSDGTAFVVWRPQEFATDLLPTLFKHNNFSSFV 79
           RK +PPPFL KTYMLVEDP TDD+ISWN DG+AFVVW+P EFA DLLPTLFKH+NFSSFV
Sbjct: 15  RKSSPPPFLLKTYMLVEDPATDDVISWNDDGSAFVVWQPAEFARDLLPTLFKHSNFSSFV 74

Query: 80  RQLNTYGFRKIATSRWEFCNDKFQKGCKERLCEIRRRKAWSNKRQHNNNAKVIQAT---- 139
           RQLNTYGFRK++TSRWEFCNDKF+KG K++LCEIRRRKAW++K+Q  NN  + QA     
Sbjct: 75  RQLNTYGFRKVSTSRWEFCNDKFRKGEKDQLCEIRRRKAWASKQQPINNIALNQAAQAMP 134

Query: 140 HQVDRDEDQRSSSTTSSFDDQYMMLVDENKRLKKENGVLSFELTNMKKKCRELLDLVAKY 199
           +Q + DEDQRS+S+TSS  D Y  LVDENKRLK+ENGVLS ELT+MK+KC+ELLDLVAK 
Sbjct: 135 NQDEFDEDQRSNSSTSSSSD-YSSLVDENKRLKQENGVLSSELTSMKRKCKELLDLVAK- 194

Query: 200 EFAVITGD--EKKEDEMKPNLKLFGVKLEIEEEDEMVIRQNKRKRS-IYQNKHFLLLQSS 254
                 GD  EK+E+  +   KLFGV+LE+E E E      KRKR+ I ++   LL Q+ 
Sbjct: 195 -----CGDSAEKEEENSERVPKLFGVRLEVEGETE-----RKRKRAEISESASILLSQAC 243

BLAST of Cla002923 vs. NCBI nr
Match: gi|590635576|ref|XP_007028662.1| (Heat shock factor protein, putative [Theobroma cacao])

HSP 1 Score: 287.7 bits (735), Expect = 1.9e-74
Identity = 154/234 (65.81%), Postives = 181/234 (77.35%), Query Frame = 1

Query: 20  RKPTPPPFLAKTYMLVEDPTTDDIISWNSDGTAFVVWRPQEFATDLLPTLFKHNNFSSFV 79
           RK TPPPFL KTYMLVEDP TDD+ISWN+DGT FVVW+P EFA DLLPTLFKH+NFSSFV
Sbjct: 17  RKSTPPPFLMKTYMLVEDPITDDVISWNADGTGFVVWQPAEFARDLLPTLFKHSNFSSFV 76

Query: 80  RQLNTYGFRKIATSRWEFCNDKFQKGCKERLCEIRRRKAWSNKRQHNNNAKVIQATHQVD 139
           RQLNTYGFRK+ATSRWEFCN+ F+KG +E LC IRRRKAW+NK+Q    A  IQ + Q D
Sbjct: 77  RQLNTYGFRKVATSRWEFCNEMFRKGDRELLCNIRRRKAWANKQQ---TAATIQVSPQ-D 136

Query: 140 RDEDQRSSSTTSSFDDQYMMLVDENKRLKKENGVLSFELTNMKKKCRELLDLVAKYEFAV 199
            DEDQ+SSST+SS    +  LVDENKRLKKENGVLS ELT+MK+KC+ELLDLVAKY    
Sbjct: 137 SDEDQKSSSTSSS--SGHNSLVDENKRLKKENGVLSLELTSMKRKCKELLDLVAKY---- 196

Query: 200 ITGDEKKEDEMKPNLKLFGVKLEIEEEDEMVIRQNKRKRSIYQNKHFLLLQSSK 254
              +E+KEDE   + KLFGV+LE+E E E    + +R+  I ++   LL QS K
Sbjct: 197 AQFEEEKEDE---SPKLFGVRLEVEGERE----RKRRRAEISESASILLSQSCK 233

BLAST of Cla002923 vs. NCBI nr
Match: gi|802729186|ref|XP_012086190.1| (PREDICTED: heat stress transcription factor B-3 [Jatropha curcas])

HSP 1 Score: 285.8 bits (730), Expect = 7.3e-74
Identity = 155/235 (65.96%), Postives = 181/235 (77.02%), Query Frame = 1

Query: 20  RKPTPPPFLAKTYMLVEDPTTDDIISWNSDGTAFVVWRPQEFATDLLPTLFKHNNFSSFV 79
           RK TPPPFL KTYMLVEDP TD ++SWN DGT F+VW+P EFA DLLPTLFKH+NFSSFV
Sbjct: 23  RKSTPPPFLLKTYMLVEDPATDHVVSWNDDGTGFIVWQPAEFARDLLPTLFKHSNFSSFV 82

Query: 80  RQLNTYGFRKIATSRWEFCNDKFQKGCKERLCEIRRRKAWSNKRQHNNNAKVIQATHQVD 139
           RQLNTYGFRK+ATSRWEFCND F+KG KE LC+IRRRKAWSNK+Q N     IQ T Q +
Sbjct: 83  RQLNTYGFRKVATSRWEFCNDMFRKGEKELLCQIRRRKAWSNKQQPNAQ---IQTTPQ-E 142

Query: 140 RDEDQRSSSTTSSFDDQYMMLVDENKRLKKENGVLSFELTNMKKKCRELLDLVAKYEFAV 199
            DEDQRSSST+SS   +Y +LVDENKRLKKEN VLS ELT+MK+KC+ELLDLVAKY    
Sbjct: 143 SDEDQRSSSTSSS--SEYGVLVDENKRLKKENKVLSSELTDMKRKCKELLDLVAKY--TR 202

Query: 200 ITGDEKKEDEMKPNLKLFGVKLEIEEEDEMVIRQNKRKRS-IYQNKHFLLLQSSK 254
              +E++E++  P  KLFGV+LE+  E     R+ KRKR+ I +    LL QS K
Sbjct: 203 FEKEEEEEEDASP--KLFGVRLEVAGE-----RERKRKRAEIRECATILLSQSCK 242

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
HSFB3_ARATH1.1e-5653.60Heat stress transcription factor B-3 OS=Arabidopsis thaliana GN=HSFB3 PE=2 SV=1[more]
HSF24_SOLPE6.7e-4348.95Heat shock factor protein HSF24 OS=Solanum peruvianum GN=HSF24 PE=2 SV=1[more]
HFB2B_ORYSJ2.2e-4145.41Heat stress transcription factor B-2b OS=Oryza sativa subsp. japonica GN=HSFB2B ... [more]
HFB2A_ARATH2.0e-3941.28Heat stress transcription factor B-2a OS=Arabidopsis thaliana GN=HSFB2A PE=2 SV=... [more]
HSFB1_ARATH2.6e-3940.41Heat stress transcription factor B-1 OS=Arabidopsis thaliana GN=HSFB1 PE=2 SV=2[more]
Match NameE-valueIdentityDescription
A0A0A0LF37_CUCSA2.5e-10576.28Uncharacterized protein OS=Cucumis sativus GN=Csa_3G822450 PE=3 SV=1[more]
A0A061F3V0_THECC1.3e-7465.81Heat shock factor protein, putative OS=Theobroma cacao GN=TCM_024565 PE=3 SV=1[more]
A0A067K1D9_JATCU5.1e-7465.96Uncharacterized protein OS=Jatropha curcas GN=JCGZ_21103 PE=3 SV=1[more]
A0A059DIB7_EUCGR5.7e-7363.64Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_A02338 PE=3 SV=1[more]
V4SC63_9ROSI1.3e-7259.92Uncharacterized protein OS=Citrus clementina GN=CICLE_v10005649mg PE=3 SV=1[more]
Match NameE-valueIdentityDescription
gi|659085244|ref|XP_008443318.1|1.3e-11078.82PREDICTED: heat stress transcription factor B-3-like [Cucumis melo][more]
gi|449438018|ref|XP_004136787.1|3.6e-10576.28PREDICTED: heat stress transcription factor B-3-like [Cucumis sativus][more]
gi|645278471|ref|XP_008244246.1|6.6e-7564.73PREDICTED: heat stress transcription factor B-3 [Prunus mume][more]
gi|590635576|ref|XP_007028662.1|1.9e-7465.81Heat shock factor protein, putative [Theobroma cacao][more]
gi|802729186|ref|XP_012086190.1|7.3e-7465.96PREDICTED: heat stress transcription factor B-3 [Jatropha curcas][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR000232HSF_DNA-bd
IPR011991Winged helix-turn-helix DNA-binding domain
IPR027725HSF_fam
Vocabulary: Molecular Function
TermDefinition
GO:0003700transcription factor activity, sequence-specific DNA binding
GO:0043565sequence-specific DNA binding
Vocabulary: Biological Process
TermDefinition
GO:0006355regulation of transcription, DNA-templated
Vocabulary: Cellular Component
TermDefinition
GO:0005634nucleus
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006355 regulation of transcription, DNA-templated
biological_process GO:0009408 response to heat
biological_process GO:0061408 positive regulation of transcription from RNA polymerase II promoter in response to heat stress
cellular_component GO:0005634 nucleus
cellular_component GO:0005667 transcription factor complex
molecular_function GO:0043565 sequence-specific DNA binding
molecular_function GO:0003700 transcription factor activity, sequence-specific DNA binding
molecular_function GO:0000978 RNA polymerase II core promoter proximal region sequence-specific DNA binding
This gene is associated with the following unigenes:
Unigene NameAnalysis NameSequence type in Unigene
WMU78990watermelon EST collection version 2.0transcribed_cluster

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla002923Cla002923.1mRNA


The following transcribed_cluster feature(s) are associated with this gene:

Feature NameUnique NameType
WMU78990WMU78990transcribed_cluster


Analysis Name: InterPro Annotations of watermelon (97103)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000232Heat shock factor (HSF)-type, DNA-bindingPRINTSPR00056HSFDOMAINcoord: 78..90
score: 2.0E-19coord: 65..77
score: 2.0E-19coord: 27..50
score: 2.0
IPR000232Heat shock factor (HSF)-type, DNA-bindingPFAMPF00447HSF_DNA-bindcoord: 27..116
score: 1.5
IPR000232Heat shock factor (HSF)-type, DNA-bindingSMARTSM00415hsfneu3coord: 23..116
score: 6.7
IPR000232Heat shock factor (HSF)-type, DNA-bindingPROSITEPS00434HSF_DOMAINcoord: 66..90
scor
IPR011991Winged helix-turn-helix DNA-binding domainGENE3DG3DSA:1.10.10.10coord: 20..111
score: 2.0
IPR011991Winged helix-turn-helix DNA-binding domainunknownSSF46785"Winged helix" DNA-binding domaincoord: 23..116
score: 3.45
IPR027725Heat shock transcription factor familyPANTHERPTHR10015HEAT SHOCK TRANSCRIPTION FACTORcoord: 24..196
score: 9.7