Cla97C05G106720 (gene) Watermelon (97103) v2.5

Overview
NameCla97C05G106720
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionHomeobox protein
LocationCla97Chr05: 33865821 .. 33867128 (+)
RNA-Seq ExpressionCla97C05G106720
SyntenyCla97C05G106720
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGATTCTGAAACAACCCGTTTTCTAGCAAAAGCTGAAGCTTCTTCTCTGCCCTGTTTTTGGGTACCTGATTCTTGCTGTTCCATGAAGGGAGAAGGAAACCAAGGTGGAAAGAAAAGAAGGCTCACAGTGGATCAAGTTCGTTTGCTTGAGAAGAATTTTAATGATGAGAACAAGCTTGAACATGACAGGAAAGTTCAAATTGCTGAGGAGATTGGTTTGCGGCCTCGCCAAGTTGCAGTTTGGTTTCAGAATCGAAGGGCTCGATCCAAGACTAAAAGAATTGAGAAAGATTATGAATCCTTGAATGCTGAATATGATAAGCTTAAGAATGATTTTGACAGTCTTCTCAAGCTGAATCAAGAACTCAAAGCAGAGGTATGGTCTCTGTTTATGGCATTAAAAACCTTCAGATCTGTTTACATATTATCTTCATTGTCGTTTAGTTTCAGTCCCATTATCATTTATTATCCACTTTGTGTTGGGTTCCAAAACCGGGTTAAAATCTCTCTTGTCATGTTCTAAATTTTTCCTAAATATGATTGTTAAACGAGATTTTTATACTATCAATATTGATTTTTGATTGATTACAAATATTATTAGGGAGCGATATTGAACATGTAATACCCGAACATGTACGTCTAGAAGTATATTAGGATGTGTTACACTGGAAGACAACTTTTATCAAAAGTCATCTCAAACTCCCCCTAAGACATATTAGGAATTAGGAGGATAACTCTTAGAAAATGTATTTCACTTAAAACCACAAATGTGTTCCACTCAAAACCTGTTTTATCAAAACTCATCTCAAACTCACTTTAAGTCATTTTCTACTGAATGGATCAACTCCAGAGAATGTGTTTCACTTAAAACCACTTCATTAAAGCCATCTCAAACTCACTCTAAGATGGTTTTTACCAGGTTGATCAATTGAGAGAAAAATGGGTTGCTAGCGAGAAGATGAAGAATCCTTTTGAATCAGTTGGAGTTGAAGCCATGGATTCATCAGTTACAGAACTTGGAAAAGCAAATACAAAGACAATGGTCGAAATTTTGTACAAGGTGCAAAAAGAGTCATCCAGACAGGAAGAAGGTAGCCGAAGTTCAAGCAAAAGTGATGGTTTTTATTCTGAGAGCCCCGCCAGGGAAAACCAATCAAAGTCGGGCAATTTCTTGCAAGATGAAGAAGATGAATTAGGCTACTTAGGAAAACTAGAAGATGAACTTTCTGCTAATGAGCTCATGGATTCCTTTAACACTTTCAGTAGTATAGTTGAAAATCAATCTTTCTGCTTCTGGTCTTACTGA

mRNA sequence

ATGGATTCTGAAACAACCCGTTTTCTAGCAAAAGCTGAAGCTTCTTCTCTGCCCTGTTTTTGGGTACCTGATTCTTGCTGTTCCATGAAGGGAGAAGGAAACCAAGGTGGAAAGAAAAGAAGGCTCACAGTGGATCAAGTTCGTTTGCTTGAGAAGAATTTTAATGATGAGAACAAGCTTGAACATGACAGGAAAGTTCAAATTGCTGAGGAGATTGGTTTGCGGCCTCGCCAAGTTGCAGTTTGGTTTCAGAATCGAAGGGCTCGATCCAAGACTAAAAGAATTGAGAAAGATTATGAATCCTTGAATGCTGAATATGATAAGCTTAAGAATGATTTTGACAGTCTTCTCAAGCTGAATCAAGAACTCAAAGCAGAGGTTGATCAATTGAGAGAAAAATGGGTTGCTAGCGAGAAGATGAAGAATCCTTTTGAATCAGTTGGAGTTGAAGCCATGGATTCATCAGTTACAGAACTTGGAAAAGCAAATACAAAGACAATGGTCGAAATTTTGTACAAGGTGCAAAAAGAGTCATCCAGACAGGAAGAAGGTAGCCGAAGTTCAAGCAAAAGTGATGGTTTTTATTCTGAGAGCCCCGCCAGGGAAAACCAATCAAAGTCGGGCAATTTCTTGCAAGATGAAGAAGATGAATTAGGCTACTTAGGAAAACTAGAAGATGAACTTTCTGCTAATGAGCTCATGGATTCCTTTAACACTTTCAGTAGTATAGTTGAAAATCAATCTTTCTGCTTCTGGTCTTACTGA

Coding sequence (CDS)

ATGGATTCTGAAACAACCCGTTTTCTAGCAAAAGCTGAAGCTTCTTCTCTGCCCTGTTTTTGGGTACCTGATTCTTGCTGTTCCATGAAGGGAGAAGGAAACCAAGGTGGAAAGAAAAGAAGGCTCACAGTGGATCAAGTTCGTTTGCTTGAGAAGAATTTTAATGATGAGAACAAGCTTGAACATGACAGGAAAGTTCAAATTGCTGAGGAGATTGGTTTGCGGCCTCGCCAAGTTGCAGTTTGGTTTCAGAATCGAAGGGCTCGATCCAAGACTAAAAGAATTGAGAAAGATTATGAATCCTTGAATGCTGAATATGATAAGCTTAAGAATGATTTTGACAGTCTTCTCAAGCTGAATCAAGAACTCAAAGCAGAGGTTGATCAATTGAGAGAAAAATGGGTTGCTAGCGAGAAGATGAAGAATCCTTTTGAATCAGTTGGAGTTGAAGCCATGGATTCATCAGTTACAGAACTTGGAAAAGCAAATACAAAGACAATGGTCGAAATTTTGTACAAGGTGCAAAAAGAGTCATCCAGACAGGAAGAAGGTAGCCGAAGTTCAAGCAAAAGTGATGGTTTTTATTCTGAGAGCCCCGCCAGGGAAAACCAATCAAAGTCGGGCAATTTCTTGCAAGATGAAGAAGATGAATTAGGCTACTTAGGAAAACTAGAAGATGAACTTTCTGCTAATGAGCTCATGGATTCCTTTAACACTTTCAGTAGTATAGTTGAAAATCAATCTTTCTGCTTCTGGTCTTACTGA

Protein sequence

MDSETTRFLAKAEASSLPCFWVPDSCCSMKGEGNQGGKKRRLTVDQVRLLEKNFNDENKLEHDRKVQIAEEIGLRPRQVAVWFQNRRARSKTKRIEKDYESLNAEYDKLKNDFDSLLKLNQELKAEVDQLREKWVASEKMKNPFESVGVEAMDSSVTELGKANTKTMVEILYKVQKESSRQEEGSRSSSKSDGFYSESPARENQSKSGNFLQDEEDELGYLGKLEDELSANELMDSFNTFSSIVENQSFCFWSY
Homology
BLAST of Cla97C05G106720 vs. NCBI nr
Match: XP_038892732.1 (homeobox-leucine zipper protein ATHB-54 [Benincasa hispida])

HSP 1 Score: 424.1 bits (1089), Expect = 8.6e-115
Identity = 224/254 (88.19%), Postives = 230/254 (90.55%), Query Frame = 0

Query: 1   MDSETTRFLAKAEASSLPCFWVPDSCCSMKGEGNQGGKKRRLTVDQVRLLEKNFNDENKL 60
           MDSETT FL KAEASSLPCF V D C SMKGEGNQGGKKRRLTVDQVRLLEKNFNDE KL
Sbjct: 1   MDSETTHFLPKAEASSLPCFLVSDFCSSMKGEGNQGGKKRRLTVDQVRLLEKNFNDEIKL 60

Query: 61  EHDRKVQIAEEIGLRPRQVAVWFQNRRARSKTKRIEKDYESLNAEYDKLKNDFDSLLKLN 120
           EH+RKVQIAEEIGLRPRQVAVWFQNRRARSKTKRIE DYESL+ EYDKLKNDFDSLLK+N
Sbjct: 61  EHERKVQIAEEIGLRPRQVAVWFQNRRARSKTKRIESDYESLSVEYDKLKNDFDSLLKVN 120

Query: 121 QELKAEVDQLREKWVASEKMKNPFESVGVEAMDSSVTELGKANTKTMVEILYKVQKESSR 180
           QELKAEVDQLR+KW ASEK KN FE V VEAMDSSVTELGKANTKTM EILYKVQ  S R
Sbjct: 121 QELKAEVDQLRDKWAASEKKKNSFEPVEVEAMDSSVTELGKANTKTMAEILYKVQLGSYR 180

Query: 181 QEEGSRSSSKSDGFYSESPARENQSKSGNFLQDEEDELGYLGKLEDELSANELMDSFNTF 240
           QEEGS SSSKSDGFYSESP RENQSKSGNFLQ+EEDELGYLGKLEDELSANELMDSFN  
Sbjct: 181 QEEGSLSSSKSDGFYSESPTRENQSKSGNFLQEEEDELGYLGKLEDELSANELMDSFNIL 240

Query: 241 SSIVENQSFCFWSY 255
           S+ VENQS CFWSY
Sbjct: 241 SNAVENQSLCFWSY 254

BLAST of Cla97C05G106720 vs. NCBI nr
Match: XP_004135200.1 (homeobox-leucine zipper protein HAT5 [Cucumis sativus] >KGN51881.1 hypothetical protein Csa_008673 [Cucumis sativus])

HSP 1 Score: 401.0 bits (1029), Expect = 7.8e-108
Identity = 214/254 (84.25%), Postives = 223/254 (87.80%), Query Frame = 0

Query: 1   MDSETTRFLAKAEASSLPCFWVPDSCCSMKGEGNQ-GGKKRRLTVDQVRLLEKNFNDENK 60
           MDSETT FL+K EASSLPCFWV DSC SMKGEG   GGKKRRL+VDQVRLLEKNFNDENK
Sbjct: 1   MDSETTHFLSKPEASSLPCFWVSDSCSSMKGEGTTLGGKKRRLSVDQVRLLEKNFNDENK 60

Query: 61  LEHDRKVQIAEEIGLRPRQVAVWFQNRRARSKTKRIEKDYESLNAEYDKLKNDFDSLLKL 120
           LEH+RKVQIAEEIGLRPRQVAVWFQNRRARSK KRIE DYE L+AEYDKLK+DFDSLL +
Sbjct: 61  LEHERKVQIAEEIGLRPRQVAVWFQNRRARSKMKRIESDYECLSAEYDKLKSDFDSLLNM 120

Query: 121 NQELKAEVDQLREKWVASEKMKNPFESVGVEAMDSSVTELGKANTKTMVEILYKVQKESS 180
           N ELKAEVDQLR  W A EKMKN FE VGVEAMDSSVT+L KAN KTM EILYKVQ  SS
Sbjct: 121 NHELKAEVDQLRTTWAAVEKMKNHFEPVGVEAMDSSVTKLEKANAKTMGEILYKVQMGSS 180

Query: 181 RQEEGSRSSSKSDGFYSESPARENQSKSGNFLQDEEDELGYLGKLEDELSANELMDSFNT 240
           R EEGS SSSKSDGFYSESP R+NQSKS NFLQDEEDELG LGKLEDELSANELM+SFN 
Sbjct: 181 RHEEGSLSSSKSDGFYSESPTRDNQSKSANFLQDEEDELGCLGKLEDELSANELMNSFNI 240

Query: 241 FSSIVENQSFCFWS 254
            SS VENQSFCFWS
Sbjct: 241 LSSAVENQSFCFWS 254

BLAST of Cla97C05G106720 vs. NCBI nr
Match: XP_008446309.1 (PREDICTED: homeobox-leucine zipper protein ATHB-16 [Cucumis melo])

HSP 1 Score: 395.6 bits (1015), Expect = 3.3e-106
Identity = 213/254 (83.86%), Postives = 221/254 (87.01%), Query Frame = 0

Query: 1   MDSETTRFLAKAEASSLPCFWVPDSCCSMKGEGNQGGKKRRLTVDQVRLLEKNFNDENKL 60
           MDSETT FL+K EASSLPCFWV DSC SMKGEG  GGKKRRL+VDQVRLLEKNFNDENKL
Sbjct: 1   MDSETTHFLSKPEASSLPCFWVSDSCSSMKGEGTLGGKKRRLSVDQVRLLEKNFNDENKL 60

Query: 61  EHDRKVQIAEEIGLRPRQVAVWFQNRRARSKTKRIEKDYESLNAEYDKLKNDFDSLLKLN 120
           EH+RKVQIAEEIGLRPRQVAVWFQNRRARSK KRIE DYE LNAEYDKLK+DFDSLL +N
Sbjct: 61  EHERKVQIAEEIGLRPRQVAVWFQNRRARSKMKRIESDYECLNAEYDKLKSDFDSLLNMN 120

Query: 121 QELKAEVDQLREKWVASEKMKNPFESVGVEAMDSSVTELGKANTKTMVEILYKVQKESSR 180
            ELKAEVDQLR KW A EKMKN FE VGVEAM SSVTEL KA  KTM EILY+VQ  SSR
Sbjct: 121 HELKAEVDQLRAKWAAMEKMKNHFEPVGVEAMVSSVTELEKAKAKTMGEILYEVQMGSSR 180

Query: 181 QE-EGSRSSSKSDGFYSESPARENQSKSGNFLQDEEDELGYLGKLEDELSANELMDSFNT 240
            E EGS SSSKSD FYSESP RENQSKS NFLQDEEDELGYL KLEDELSA+ELM+SFN 
Sbjct: 181 HELEGSLSSSKSDCFYSESPTRENQSKSANFLQDEEDELGYLEKLEDELSADELMNSFNI 240

Query: 241 FSSIVENQSFCFWS 254
            SS VENQSFCFWS
Sbjct: 241 LSSAVENQSFCFWS 254

BLAST of Cla97C05G106720 vs. NCBI nr
Match: KAA0034384.1 (homeobox-leucine zipper protein ATHB-16 [Cucumis melo var. makuwa] >TYK15535.1 homeobox-leucine zipper protein ATHB-16 [Cucumis melo var. makuwa])

HSP 1 Score: 346.7 bits (888), Expect = 1.7e-91
Identity = 190/226 (84.07%), Postives = 197/226 (87.17%), Query Frame = 0

Query: 29  MKGEGNQGGKKRRLTVDQVRLLEKNFNDENKLEHDRKVQIAEEIGLRPRQVAVWFQNRRA 88
           MKGEG  GGKKRRL+VDQVRLLEKNFNDENKLEH+RKVQIAEEIGLRPRQVAVWFQNRRA
Sbjct: 1   MKGEGTLGGKKRRLSVDQVRLLEKNFNDENKLEHERKVQIAEEIGLRPRQVAVWFQNRRA 60

Query: 89  RSKTKRIEKDYESLNAEYDKLKNDFDSLLKLNQELKAEVDQLREKWVASEKMKNPFESVG 148
           RSK KRIE DYE LNAEYDKLK+DFDSLL +N ELKAEVDQLR KW A EKMKN FE VG
Sbjct: 61  RSKMKRIESDYECLNAEYDKLKSDFDSLLNMNHELKAEVDQLRAKWAAMEKMKNHFEPVG 120

Query: 149 VEAMDSSVTELGKANTKTMVEILYKVQKESSRQE-EGSRSSSKSDGFYSESPARENQSKS 208
           VEAM SSVTEL KA  KTM EILY+VQ  SSR E EGS SSSKSD FYSESP RENQSKS
Sbjct: 121 VEAMVSSVTELEKAKAKTMGEILYEVQMGSSRHELEGSLSSSKSDCFYSESPTRENQSKS 180

Query: 209 GNFLQDEEDELGYLGKLEDELSANELMDSFNTFSSIVENQSFCFWS 254
            NFLQDEEDELGYL KLEDELSA+ELM+SFN  SS VENQSFCFWS
Sbjct: 181 ANFLQDEEDELGYLEKLEDELSADELMNSFNILSSAVENQSFCFWS 226

BLAST of Cla97C05G106720 vs. NCBI nr
Match: XP_022151198.1 (homeobox-leucine zipper protein ATHB-16 [Momordica charantia])

HSP 1 Score: 335.9 bits (860), Expect = 3.1e-88
Identity = 187/264 (70.83%), Postives = 210/264 (79.55%), Query Frame = 0

Query: 1   MDSETTRFLAKAEASSLPCFWVPDSCCSMKGE---------GNQGGKKRRLTVDQVRLLE 60
           MDSETT F+ +AE +S    W+ +S  SM+G           N  GKKRRLTVDQVRLLE
Sbjct: 1   MDSETTLFVPEAETNS-QSLWISNSGSSMEGRFLQEQADGARNSAGKKRRLTVDQVRLLE 60

Query: 61  KNFNDENKLEHDRKVQIAEEIGLRPRQVAVWFQNRRARSKTKRIEKDYESLNAEYDKLKN 120
           +NFN ENKLEH+RKVQ+AEEIGLRPRQVAVWFQNRRARSKTK+IE DYESLNAEY KLK 
Sbjct: 61  RNFNVENKLEHERKVQLAEEIGLRPRQVAVWFQNRRARSKTKKIEIDYESLNAEYHKLKK 120

Query: 121 DFDSLLKLNQELKAEVDQLREKWVASEKMKNPFE--SVGVEAMDSSVTELGKANTKTMVE 180
           D+ SL+KLN +LKAE D+LREKW A+EKM+NP E   V VEAMDSSVTELGK NT TM E
Sbjct: 121 DYTSLVKLNHDLKAEADELREKWAAAEKMRNPLEPVEVEVEAMDSSVTELGKPNTSTMGE 180

Query: 181 ILYKVQKESSRQEEGSRSSSKSDGFYSESPARENQSKSGNFLQDEEDELGYLGKLEDELS 240
            LY VQ  SSRQEEGSRSSSKSDGFYSESP  ENQS+S NFL+DEEDELG L KLEDE+ 
Sbjct: 181 DLYNVQMGSSRQEEGSRSSSKSDGFYSESPTMENQSQSDNFLRDEEDELGKLVKLEDEIY 240

Query: 241 ANELMDSFNTFSSIVENQSFCFWS 254
           A+E +DSFN  S+ VE+QS CFWS
Sbjct: 241 ADEFIDSFNFISTAVEDQSLCFWS 263

BLAST of Cla97C05G106720 vs. ExPASy Swiss-Prot
Match: P46668 (Homeobox-leucine zipper protein ATHB-6 OS=Arabidopsis thaliana OX=3702 GN=ATHB-6 PE=1 SV=1)

HSP 1 Score: 118.2 bits (295), Expect = 1.3e-25
Identity = 59/104 (56.73%), Postives = 81/104 (77.88%), Query Frame = 0

Query: 30  KGEGNQGGKKRRLTVDQVRLLEKNFNDENKLEHDRKVQIAEEIGLRPRQVAVWFQNRRAR 89
           +G      KKRRL+++QV+ LEKNF  ENKLE +RKV++A+E+GL+PRQVAVWFQNRRAR
Sbjct: 54  RGHVGLSEKKRRLSINQVKALEKNFELENKLEPERKVKLAQELGLQPRQVAVWFQNRRAR 113

Query: 90  SKTKRIEKDYESLNAEYDKLKNDFDSLLKLNQELKAEVDQLREK 134
            KTK++EKDY  L  +YD L+++FDSL + N+ L  E+ +L+ K
Sbjct: 114 WKTKQLEKDYGVLKTQYDSLRHNFDSLRRDNESLLQEISKLKTK 157

BLAST of Cla97C05G106720 vs. ExPASy Swiss-Prot
Match: Q02283 (Homeobox-leucine zipper protein HAT5 OS=Arabidopsis thaliana OX=3702 GN=HAT5 PE=1 SV=1)

HSP 1 Score: 117.5 bits (293), Expect = 2.2e-25
Identity = 58/105 (55.24%), Postives = 82/105 (78.10%), Query Frame = 0

Query: 38  KKRRLTVDQVRLLEKNFNDENKLEHDRKVQIAEEIGLRPRQVAVWFQNRRARSKTKRIEK 97
           KKRRLT +QV LLEK+F  ENKLE +RK Q+A+++GL+PRQVAVWFQNRRAR KTK++E+
Sbjct: 68  KKRRLTTEQVHLLEKSFETENKLEPERKTQLAKKLGLQPRQVAVWFQNRRARWKTKQLER 127

Query: 98  DYESLNAEYDKLKNDFDSLLKLNQELKAEVDQLREKWVASEKMKN 143
           DY+ L + YD+L +++DS++  N +L++EV  L EK    ++  N
Sbjct: 128 DYDLLKSTYDQLLSNYDSIVMDNDKLRSEVTSLTEKLQGKQETAN 172

BLAST of Cla97C05G106720 vs. ExPASy Swiss-Prot
Match: Q940J1 (Homeobox-leucine zipper protein ATHB-16 OS=Arabidopsis thaliana OX=3702 GN=ATHB-16 PE=2 SV=2)

HSP 1 Score: 117.1 bits (292), Expect = 2.9e-25
Identity = 60/105 (57.14%), Postives = 78/105 (74.29%), Query Frame = 0

Query: 38  KKRRLTVDQVRLLEKNFNDENKLEHDRKVQIAEEIGLRPRQVAVWFQNRRARSKTKRIEK 97
           KKRRL VDQV+ LEKNF  ENKLE +RK ++A+E+GL+PRQVAVWFQNRRAR KTK++EK
Sbjct: 59  KKRRLKVDQVKALEKNFELENKLEPERKTKLAQELGLQPRQVAVWFQNRRARWKTKQLEK 118

Query: 98  DYESLNAEYDKLKNDFDSLLKLNQELKAEVDQLREKWVASEKMKN 143
           DY  L  +YD L+++FDSL + N  L  E+ +++ K    E   N
Sbjct: 119 DYGVLKGQYDSLRHNFDSLRRDNDSLLQEISKIKAKVNGEEDNNN 163

BLAST of Cla97C05G106720 vs. ExPASy Swiss-Prot
Match: A2XD08 (Homeobox-leucine zipper protein HOX21 OS=Oryza sativa subsp. indica OX=39946 GN=HOX21 PE=2 SV=2)

HSP 1 Score: 116.3 bits (290), Expect = 5.0e-25
Identity = 61/111 (54.95%), Postives = 85/111 (76.58%), Query Frame = 0

Query: 29  MKGEGNQGG-KKRRLTVDQVRLLEKNFNDENKLEHDRKVQIAEEIGLRPRQVAVWFQNRR 88
           +  +G+Q G KKRRL V+QVR LEKNF   NKLE +RK+Q+A  +GL+PRQVA+WFQNRR
Sbjct: 114 LSDDGSQAGEKKRRLNVEQVRTLEKNFELGNKLEPERKMQLARALGLQPRQVAIWFQNRR 173

Query: 89  ARSKTKRIEKDYESLNAEYDKLKNDFDSLLKLNQELKAEVDQLREKWVASE 139
           AR KTK++EKDY++L  + D +K + D+LL  N++L+AE+  L+ +  ASE
Sbjct: 174 ARWKTKQLEKDYDALKRQLDAVKAENDALLNHNKKLQAEIVALKGREAASE 224

BLAST of Cla97C05G106720 vs. ExPASy Swiss-Prot
Match: Q8S7W9 (Homeobox-leucine zipper protein HOX21 OS=Oryza sativa subsp. japonica OX=39947 GN=HOX21 PE=2 SV=1)

HSP 1 Score: 116.3 bits (290), Expect = 5.0e-25
Identity = 61/111 (54.95%), Postives = 85/111 (76.58%), Query Frame = 0

Query: 29  MKGEGNQGG-KKRRLTVDQVRLLEKNFNDENKLEHDRKVQIAEEIGLRPRQVAVWFQNRR 88
           +  +G+Q G KKRRL V+QVR LEKNF   NKLE +RK+Q+A  +GL+PRQVA+WFQNRR
Sbjct: 120 LSDDGSQAGEKKRRLNVEQVRTLEKNFELGNKLEPERKMQLARALGLQPRQVAIWFQNRR 179

Query: 89  ARSKTKRIEKDYESLNAEYDKLKNDFDSLLKLNQELKAEVDQLREKWVASE 139
           AR KTK++EKDY++L  + D +K + D+LL  N++L+AE+  L+ +  ASE
Sbjct: 180 ARWKTKQLEKDYDALKRQLDAVKAENDALLNHNKKLQAEIVALKGREAASE 230

BLAST of Cla97C05G106720 vs. ExPASy TrEMBL
Match: A0A0A0KVU6 (Homeobox protein OS=Cucumis sativus OX=3659 GN=Csa_5G604260 PE=4 SV=1)

HSP 1 Score: 401.0 bits (1029), Expect = 3.8e-108
Identity = 214/254 (84.25%), Postives = 223/254 (87.80%), Query Frame = 0

Query: 1   MDSETTRFLAKAEASSLPCFWVPDSCCSMKGEGNQ-GGKKRRLTVDQVRLLEKNFNDENK 60
           MDSETT FL+K EASSLPCFWV DSC SMKGEG   GGKKRRL+VDQVRLLEKNFNDENK
Sbjct: 1   MDSETTHFLSKPEASSLPCFWVSDSCSSMKGEGTTLGGKKRRLSVDQVRLLEKNFNDENK 60

Query: 61  LEHDRKVQIAEEIGLRPRQVAVWFQNRRARSKTKRIEKDYESLNAEYDKLKNDFDSLLKL 120
           LEH+RKVQIAEEIGLRPRQVAVWFQNRRARSK KRIE DYE L+AEYDKLK+DFDSLL +
Sbjct: 61  LEHERKVQIAEEIGLRPRQVAVWFQNRRARSKMKRIESDYECLSAEYDKLKSDFDSLLNM 120

Query: 121 NQELKAEVDQLREKWVASEKMKNPFESVGVEAMDSSVTELGKANTKTMVEILYKVQKESS 180
           N ELKAEVDQLR  W A EKMKN FE VGVEAMDSSVT+L KAN KTM EILYKVQ  SS
Sbjct: 121 NHELKAEVDQLRTTWAAVEKMKNHFEPVGVEAMDSSVTKLEKANAKTMGEILYKVQMGSS 180

Query: 181 RQEEGSRSSSKSDGFYSESPARENQSKSGNFLQDEEDELGYLGKLEDELSANELMDSFNT 240
           R EEGS SSSKSDGFYSESP R+NQSKS NFLQDEEDELG LGKLEDELSANELM+SFN 
Sbjct: 181 RHEEGSLSSSKSDGFYSESPTRDNQSKSANFLQDEEDELGCLGKLEDELSANELMNSFNI 240

Query: 241 FSSIVENQSFCFWS 254
            SS VENQSFCFWS
Sbjct: 241 LSSAVENQSFCFWS 254

BLAST of Cla97C05G106720 vs. ExPASy TrEMBL
Match: A0A1S3BER7 (homeobox-leucine zipper protein ATHB-16 OS=Cucumis melo OX=3656 GN=LOC103489083 PE=4 SV=1)

HSP 1 Score: 395.6 bits (1015), Expect = 1.6e-106
Identity = 213/254 (83.86%), Postives = 221/254 (87.01%), Query Frame = 0

Query: 1   MDSETTRFLAKAEASSLPCFWVPDSCCSMKGEGNQGGKKRRLTVDQVRLLEKNFNDENKL 60
           MDSETT FL+K EASSLPCFWV DSC SMKGEG  GGKKRRL+VDQVRLLEKNFNDENKL
Sbjct: 1   MDSETTHFLSKPEASSLPCFWVSDSCSSMKGEGTLGGKKRRLSVDQVRLLEKNFNDENKL 60

Query: 61  EHDRKVQIAEEIGLRPRQVAVWFQNRRARSKTKRIEKDYESLNAEYDKLKNDFDSLLKLN 120
           EH+RKVQIAEEIGLRPRQVAVWFQNRRARSK KRIE DYE LNAEYDKLK+DFDSLL +N
Sbjct: 61  EHERKVQIAEEIGLRPRQVAVWFQNRRARSKMKRIESDYECLNAEYDKLKSDFDSLLNMN 120

Query: 121 QELKAEVDQLREKWVASEKMKNPFESVGVEAMDSSVTELGKANTKTMVEILYKVQKESSR 180
            ELKAEVDQLR KW A EKMKN FE VGVEAM SSVTEL KA  KTM EILY+VQ  SSR
Sbjct: 121 HELKAEVDQLRAKWAAMEKMKNHFEPVGVEAMVSSVTELEKAKAKTMGEILYEVQMGSSR 180

Query: 181 QE-EGSRSSSKSDGFYSESPARENQSKSGNFLQDEEDELGYLGKLEDELSANELMDSFNT 240
            E EGS SSSKSD FYSESP RENQSKS NFLQDEEDELGYL KLEDELSA+ELM+SFN 
Sbjct: 181 HELEGSLSSSKSDCFYSESPTRENQSKSANFLQDEEDELGYLEKLEDELSADELMNSFNI 240

Query: 241 FSSIVENQSFCFWS 254
            SS VENQSFCFWS
Sbjct: 241 LSSAVENQSFCFWS 254

BLAST of Cla97C05G106720 vs. ExPASy TrEMBL
Match: A0A5D3CVI2 (Homeobox-leucine zipper protein ATHB-16 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold35G00270 PE=4 SV=1)

HSP 1 Score: 346.7 bits (888), Expect = 8.4e-92
Identity = 190/226 (84.07%), Postives = 197/226 (87.17%), Query Frame = 0

Query: 29  MKGEGNQGGKKRRLTVDQVRLLEKNFNDENKLEHDRKVQIAEEIGLRPRQVAVWFQNRRA 88
           MKGEG  GGKKRRL+VDQVRLLEKNFNDENKLEH+RKVQIAEEIGLRPRQVAVWFQNRRA
Sbjct: 1   MKGEGTLGGKKRRLSVDQVRLLEKNFNDENKLEHERKVQIAEEIGLRPRQVAVWFQNRRA 60

Query: 89  RSKTKRIEKDYESLNAEYDKLKNDFDSLLKLNQELKAEVDQLREKWVASEKMKNPFESVG 148
           RSK KRIE DYE LNAEYDKLK+DFDSLL +N ELKAEVDQLR KW A EKMKN FE VG
Sbjct: 61  RSKMKRIESDYECLNAEYDKLKSDFDSLLNMNHELKAEVDQLRAKWAAMEKMKNHFEPVG 120

Query: 149 VEAMDSSVTELGKANTKTMVEILYKVQKESSRQE-EGSRSSSKSDGFYSESPARENQSKS 208
           VEAM SSVTEL KA  KTM EILY+VQ  SSR E EGS SSSKSD FYSESP RENQSKS
Sbjct: 121 VEAMVSSVTELEKAKAKTMGEILYEVQMGSSRHELEGSLSSSKSDCFYSESPTRENQSKS 180

Query: 209 GNFLQDEEDELGYLGKLEDELSANELMDSFNTFSSIVENQSFCFWS 254
            NFLQDEEDELGYL KLEDELSA+ELM+SFN  SS VENQSFCFWS
Sbjct: 181 ANFLQDEEDELGYLEKLEDELSADELMNSFNILSSAVENQSFCFWS 226

BLAST of Cla97C05G106720 vs. ExPASy TrEMBL
Match: A0A6J1DBJ2 (homeobox-leucine zipper protein ATHB-16 OS=Momordica charantia OX=3673 GN=LOC111019178 PE=4 SV=1)

HSP 1 Score: 335.9 bits (860), Expect = 1.5e-88
Identity = 187/264 (70.83%), Postives = 210/264 (79.55%), Query Frame = 0

Query: 1   MDSETTRFLAKAEASSLPCFWVPDSCCSMKGE---------GNQGGKKRRLTVDQVRLLE 60
           MDSETT F+ +AE +S    W+ +S  SM+G           N  GKKRRLTVDQVRLLE
Sbjct: 1   MDSETTLFVPEAETNS-QSLWISNSGSSMEGRFLQEQADGARNSAGKKRRLTVDQVRLLE 60

Query: 61  KNFNDENKLEHDRKVQIAEEIGLRPRQVAVWFQNRRARSKTKRIEKDYESLNAEYDKLKN 120
           +NFN ENKLEH+RKVQ+AEEIGLRPRQVAVWFQNRRARSKTK+IE DYESLNAEY KLK 
Sbjct: 61  RNFNVENKLEHERKVQLAEEIGLRPRQVAVWFQNRRARSKTKKIEIDYESLNAEYHKLKK 120

Query: 121 DFDSLLKLNQELKAEVDQLREKWVASEKMKNPFE--SVGVEAMDSSVTELGKANTKTMVE 180
           D+ SL+KLN +LKAE D+LREKW A+EKM+NP E   V VEAMDSSVTELGK NT TM E
Sbjct: 121 DYTSLVKLNHDLKAEADELREKWAAAEKMRNPLEPVEVEVEAMDSSVTELGKPNTSTMGE 180

Query: 181 ILYKVQKESSRQEEGSRSSSKSDGFYSESPARENQSKSGNFLQDEEDELGYLGKLEDELS 240
            LY VQ  SSRQEEGSRSSSKSDGFYSESP  ENQS+S NFL+DEEDELG L KLEDE+ 
Sbjct: 181 DLYNVQMGSSRQEEGSRSSSKSDGFYSESPTMENQSQSDNFLRDEEDELGKLVKLEDEIY 240

Query: 241 ANELMDSFNTFSSIVENQSFCFWS 254
           A+E +DSFN  S+ VE+QS CFWS
Sbjct: 241 ADEFIDSFNFISTAVEDQSLCFWS 263

BLAST of Cla97C05G106720 vs. ExPASy TrEMBL
Match: A0A6J1GYD2 (homeobox-leucine zipper protein ATHB-16-like OS=Cucurbita moschata OX=3662 GN=LOC111458422 PE=4 SV=1)

HSP 1 Score: 327.0 bits (837), Expect = 6.9e-86
Identity = 182/265 (68.68%), Postives = 204/265 (76.98%), Query Frame = 0

Query: 1   MDSETTRFLAKAEASSLPCFWVPDSCCSMKGE---------GNQGGKKRRLTVDQVRLLE 60
           MDS+TT  L   E  SLP  W+ DSC SM+GE          N+GGKKRRLT+DQVR+LE
Sbjct: 1   MDSDTTHCLPLPE--SLPSLWISDSCSSMEGEFSQPQAKADRNRGGKKRRLTLDQVRMLE 60

Query: 61  KNFNDENKLEHDRKVQIAEEIGLRPRQVAVWFQNRRARSKTKRIEKDYESLNAEYDKLKN 120
           + FN ENKLEH+RKVQIAEEIGLRPRQVAVWFQNRRARSKTK+IE DYESLNAEYDKLKN
Sbjct: 61  RTFNAENKLEHERKVQIAEEIGLRPRQVAVWFQNRRARSKTKKIEIDYESLNAEYDKLKN 120

Query: 121 DFDSLLKLNQELKAEVDQLREKWVASEKMKNPFESVGVEAMDSSVTELGKANTKTMVEIL 180
           DFDSLLK+N ELKAEV+QLR+KW A+EKM NP+E VG                      L
Sbjct: 121 DFDSLLKVNHELKAEVNQLRDKWAATEKMNNPYEPVGA---------------------L 180

Query: 181 YKVQKESSRQEEGSRSSSKSDGFYSESPARENQSKSGNFLQDEE--DELGYLGKLEDELS 240
           YKV+  SSRQE+GSRSSSKSD FY+ESP RENQS+SGNFL+DEE  DELGYLG LEDELS
Sbjct: 181 YKVEMGSSRQEQGSRSSSKSDVFYAESPTRENQSRSGNFLRDEEEDDELGYLGILEDELS 240

Query: 241 ANELMDSFNTFSSIVENQSFCFWSY 255
           A+ELMDSFN  SS  E+QSFCFWS+
Sbjct: 241 ADELMDSFNVMSSAAEDQSFCFWSF 242

BLAST of Cla97C05G106720 vs. TAIR 10
Match: AT2G22430.1 (homeobox protein 6 )

HSP 1 Score: 118.2 bits (295), Expect = 9.4e-27
Identity = 59/104 (56.73%), Postives = 81/104 (77.88%), Query Frame = 0

Query: 30  KGEGNQGGKKRRLTVDQVRLLEKNFNDENKLEHDRKVQIAEEIGLRPRQVAVWFQNRRAR 89
           +G      KKRRL+++QV+ LEKNF  ENKLE +RKV++A+E+GL+PRQVAVWFQNRRAR
Sbjct: 54  RGHVGLSEKKRRLSINQVKALEKNFELENKLEPERKVKLAQELGLQPRQVAVWFQNRRAR 113

Query: 90  SKTKRIEKDYESLNAEYDKLKNDFDSLLKLNQELKAEVDQLREK 134
            KTK++EKDY  L  +YD L+++FDSL + N+ L  E+ +L+ K
Sbjct: 114 WKTKQLEKDYGVLKTQYDSLRHNFDSLRRDNESLLQEISKLKTK 157

BLAST of Cla97C05G106720 vs. TAIR 10
Match: AT3G01470.1 (homeobox 1 )

HSP 1 Score: 117.5 bits (293), Expect = 1.6e-26
Identity = 58/105 (55.24%), Postives = 82/105 (78.10%), Query Frame = 0

Query: 38  KKRRLTVDQVRLLEKNFNDENKLEHDRKVQIAEEIGLRPRQVAVWFQNRRARSKTKRIEK 97
           KKRRLT +QV LLEK+F  ENKLE +RK Q+A+++GL+PRQVAVWFQNRRAR KTK++E+
Sbjct: 68  KKRRLTTEQVHLLEKSFETENKLEPERKTQLAKKLGLQPRQVAVWFQNRRARWKTKQLER 127

Query: 98  DYESLNAEYDKLKNDFDSLLKLNQELKAEVDQLREKWVASEKMKN 143
           DY+ L + YD+L +++DS++  N +L++EV  L EK    ++  N
Sbjct: 128 DYDLLKSTYDQLLSNYDSIVMDNDKLRSEVTSLTEKLQGKQETAN 172

BLAST of Cla97C05G106720 vs. TAIR 10
Match: AT4G40060.1 (homeobox protein 16 )

HSP 1 Score: 117.1 bits (292), Expect = 2.1e-26
Identity = 60/105 (57.14%), Postives = 78/105 (74.29%), Query Frame = 0

Query: 38  KKRRLTVDQVRLLEKNFNDENKLEHDRKVQIAEEIGLRPRQVAVWFQNRRARSKTKRIEK 97
           KKRRL VDQV+ LEKNF  ENKLE +RK ++A+E+GL+PRQVAVWFQNRRAR KTK++EK
Sbjct: 59  KKRRLKVDQVKALEKNFELENKLEPERKTKLAQELGLQPRQVAVWFQNRRARWKTKQLEK 118

Query: 98  DYESLNAEYDKLKNDFDSLLKLNQELKAEVDQLREKWVASEKMKN 143
           DY  L  +YD L+++FDSL + N  L  E+ +++ K    E   N
Sbjct: 119 DYGVLKGQYDSLRHNFDSLRRDNDSLLQEISKIKAKVNGEEDNNN 163

BLAST of Cla97C05G106720 vs. TAIR 10
Match: AT1G27045.1 (Homeobox-leucine zipper protein family )

HSP 1 Score: 113.2 bits (282), Expect = 3.0e-25
Identity = 62/113 (54.87%), Postives = 83/113 (73.45%), Query Frame = 0

Query: 38  KKRRLTVDQVRLLEKNFNDENKLEHDRKVQIAEEIGLRPRQVAVWFQNRRARSKTKRIEK 97
           KKR+LT  Q+RLLE++F +E +LE DRK+ +AE++GL+P QVAVWFQNRRAR KTK++E 
Sbjct: 68  KKRKLTPIQLRLLEESFEEEKRLEPDRKLWLAEKLGLQPSQVAVWFQNRRARYKTKQLEH 127

Query: 98  DYESLNAEYDKLKNDFDSLLKLNQELKAEVDQLREKWVASEKMKNPFESVGVE 151
           D +SL A Y KLK D+D L   NQ LK++VD L+EK     KM+   E+  +E
Sbjct: 128 DCDSLKASYAKLKTDWDILFVQNQTLKSKVDLLKEKL----KMQENLETQSIE 176

BLAST of Cla97C05G106720 vs. TAIR 10
Match: AT5G15150.1 (homeobox 3 )

HSP 1 Score: 110.9 bits (276), Expect = 1.5e-24
Identity = 55/97 (56.70%), Postives = 79/97 (81.44%), Query Frame = 0

Query: 36  GGKKRRLTVDQVRLLEKNFNDENKLEHDRKVQIAEEIGLRPRQVAVWFQNRRARSKTKRI 95
           G KK+RL ++QVR LEK+F   NKLE +RK+Q+A+ +GL+PRQ+A+WFQNRRAR KTK++
Sbjct: 113 GEKKKRLNLEQVRALEKSFELGNKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQL 172

Query: 96  EKDYESLNAEYDKLKNDFDSLLKLNQELKAEVDQLRE 133
           E+DY+SL  ++D LK+D DSLL  N++L AE+  L++
Sbjct: 173 ERDYDSLKKQFDVLKSDNDSLLAHNKKLHAELVALKK 209

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038892732.18.6e-11588.19homeobox-leucine zipper protein ATHB-54 [Benincasa hispida][more]
XP_004135200.17.8e-10884.25homeobox-leucine zipper protein HAT5 [Cucumis sativus] >KGN51881.1 hypothetical ... [more]
XP_008446309.13.3e-10683.86PREDICTED: homeobox-leucine zipper protein ATHB-16 [Cucumis melo][more]
KAA0034384.11.7e-9184.07homeobox-leucine zipper protein ATHB-16 [Cucumis melo var. makuwa] >TYK15535.1 h... [more]
XP_022151198.13.1e-8870.83homeobox-leucine zipper protein ATHB-16 [Momordica charantia][more]
Match NameE-valueIdentityDescription
P466681.3e-2556.73Homeobox-leucine zipper protein ATHB-6 OS=Arabidopsis thaliana OX=3702 GN=ATHB-6... [more]
Q022832.2e-2555.24Homeobox-leucine zipper protein HAT5 OS=Arabidopsis thaliana OX=3702 GN=HAT5 PE=... [more]
Q940J12.9e-2557.14Homeobox-leucine zipper protein ATHB-16 OS=Arabidopsis thaliana OX=3702 GN=ATHB-... [more]
A2XD085.0e-2554.95Homeobox-leucine zipper protein HOX21 OS=Oryza sativa subsp. indica OX=39946 GN=... [more]
Q8S7W95.0e-2554.95Homeobox-leucine zipper protein HOX21 OS=Oryza sativa subsp. japonica OX=39947 G... [more]
Match NameE-valueIdentityDescription
A0A0A0KVU63.8e-10884.25Homeobox protein OS=Cucumis sativus OX=3659 GN=Csa_5G604260 PE=4 SV=1[more]
A0A1S3BER71.6e-10683.86homeobox-leucine zipper protein ATHB-16 OS=Cucumis melo OX=3656 GN=LOC103489083 ... [more]
A0A5D3CVI28.4e-9284.07Homeobox-leucine zipper protein ATHB-16 OS=Cucumis melo var. makuwa OX=1194695 G... [more]
A0A6J1DBJ21.5e-8870.83homeobox-leucine zipper protein ATHB-16 OS=Momordica charantia OX=3673 GN=LOC111... [more]
A0A6J1GYD26.9e-8668.68homeobox-leucine zipper protein ATHB-16-like OS=Cucurbita moschata OX=3662 GN=LO... [more]
Match NameE-valueIdentityDescription
AT2G22430.19.4e-2756.73homeobox protein 6 [more]
AT3G01470.11.6e-2655.24homeobox 1 [more]
AT4G40060.12.1e-2657.14homeobox protein 16 [more]
AT1G27045.13.0e-2554.87Homeobox-leucine zipper protein family [more]
AT5G15150.11.5e-2456.70homeobox 3 [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (97103) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 85..140
NoneNo IPR availableGENE3D1.10.10.60coord: 31..100
e-value: 1.8E-17
score: 64.6
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 179..212
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 188..207
NoneNo IPR availablePANTHERPTHR24326HOMEOBOX-LEUCINE ZIPPER PROTEINcoord: 34..232
NoneNo IPR availablePANTHERPTHR24326:SF575HOMEOBOX-LEUCINE ZIPPER PROTEIN HAT5-LIKEcoord: 34..232
IPR000047Helix-turn-helix motifPRINTSPR00031HTHREPRESSRcoord: 73..89
score: 58.88
coord: 64..73
score: 31.26
IPR001356Homeobox domainSMARTSM00389HOX_1coord: 36..97
e-value: 2.3E-16
score: 70.3
IPR001356Homeobox domainPFAMPF00046Homeodomaincoord: 38..91
e-value: 1.2E-15
score: 57.0
IPR001356Homeobox domainPROSITEPS50071HOMEOBOX_2coord: 33..93
score: 16.535826
IPR001356Homeobox domainCDDcd00086homeodomaincoord: 38..94
e-value: 9.79906E-14
score: 62.2608
IPR003106Leucine zipper, homeobox-associatedPFAMPF02183HALZcoord: 93..133
e-value: 3.2E-12
score: 46.5
IPR017970Homeobox, conserved sitePROSITEPS00027HOMEOBOX_1coord: 68..91
IPR009057Homeobox-like domain superfamilySUPERFAMILY46689Homeodomain-likecoord: 18..93

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C05G106720.1Cla97C05G106720.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0045893 positive regulation of transcription, DNA-templated
biological_process GO:0006357 regulation of transcription by RNA polymerase II
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0005634 nucleus
molecular_function GO:0000981 DNA-binding transcription factor activity, RNA polymerase II-specific
molecular_function GO:0043565 sequence-specific DNA binding
molecular_function GO:0003677 DNA binding