Cla013625 (gene) Watermelon (97103) v1

NameCla013625
Typegene
OrganismCitrullus. lanatus (Watermelon (97103) v1)
DescriptionAT hook motif DNA-binding family protein (AHRD V1 ***- A1L4X7_ARATH); contains Interpro domain(s) IPR005175 Protein of unknown function DUF296
LocationChr2 : 27381598 .. 27382772 (-)
   



The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGATGTTTATGCAACAATGTAAGCGGGAAATTTGTATCCTTTCTGCATCTGGTTCAATCTCCAATGCATCTCTCCGTCAGCCAGCCGCATCCGGAGGCAATATTGCATATGAGGTATTTTTGTCTTCCCCATCTGAGATTCCATCTCTTTCTCAGTGTGCAAAAGAAAAGAAAATGAGAATAATGCTTGTAGCAATCCTGGATAAATTTGTTAAATCTTGATATATTTTGTGGGCCGTTGCAGAAGATGGACAAGGGATAACTGGCTACTCATTTTGATGATCATCTACGTTTCTTTTTTCCTCCACAAATTTGCTTCTCATGAATTATGCATTCTAATAACTATTTGCAATGTAATGATACTTAGGGTCGTTTTGAGATTGTTTCGTTATGCGGATCTTATGTACGAACTGACCTTGGAGGAAAGACGGGTGGTCTTAGTGTATGTCTATCAAGTGCTGAAGGCCACATCATAGGAGGGGGAGTTGGTGGACCGTTGAAGGCTGCTGGACCCGTGCAGGTATGTTAATGGCCAGATTGGATCTGTCCTTTCCTGAATAAGTCCAGTTGCGCACTGTTTAGGGAGTGGGAGATTTCAGTCAATGCTTGGATGTTCTATTGCATTCATGTAACTTTGTTAAGTTAGTAGGCTTAAAGAGTCTTCATAGATATATTATTCTTTTGCTTAGTTCATTATTGTACCGATCTGTCCACATCAGGTTATTGTTGGAACCTTCGTAATCGACCCCAAGAAGGAAGTTGGTGGTGGTAAAGGCGATGCATCTGCTGGCAAGTTGTCCTCACCTATTGGTGGGACATCGATGTCAAATCTACGCTATGGCTCCAACATTGACTCGGGAGGTAATCATGTAAGGGGAAATGATGAACACCAAGGTCTTGGAGAGGGTCATTTTTTGCTTCAGCCCCGGGGAGTGAATCTGACATCACCGCGATCAACGGATTGGAGAACGGGTCTGGATGCCACAAACACTGCTTATGATTTGACAGGTATGATTTTACTGTGACAAAGGCACACAACTCATCTTTTACTTTTTCTGAACTAGTGATATTATGAGATTGGGAACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTGGTAGGAAGAACAGGCCATCATTCTCCCGAAAACGGAGATTACGATCAGATTCCCGATTGA

mRNA sequence

ATGATGTTTATGCAACAATGTAAGCGGGAAATTTGTATCCTTTCTGCATCTGGTTCAATCTCCAATGCATCTCTCCGTCAGCCAGCCGCATCCGGAGGCAATATTGCATATGAGGGTCGTTTTGAGATTGTTTCGTTATGCGGATCTTATGTACGAACTGACCTTGGAGGAAAGACGGGTGGTCTTAGTGTATGTCTATCAAGTGCTGAAGGCCACATCATAGGAGGGGGAGTTGGTGGACCGTTGAAGGCTGCTGGACCCGTGCAGGTTATTGTTGGAACCTTCGTAATCGACCCCAAGAAGGAAGTTGGTGGTGGTAAAGGCGATGCATCTGCTGGCAAGTTGTCCTCACCTATTGGTGGGACATCGATGTCAAATCTACGCTATGGCTCCAACATTGACTCGGGAGGTAATCATGTAAGGGGAAATGATGAACACCAAGGTCTTGGAGAGGGTCATTTTTTGCTTCAGCCCCGGGGAGTGAATCTGACATCACCGCGATCAACGGATTGGAGAACGGGTCTGGATGCCACAAACACTGCTTATGATTTGACAGGAAGAACAGGCCATCATTCTCCCGAAAACGGAGATTACGATCAGATTCCCGATTGA

Coding sequence (CDS)

ATGATGTTTATGCAACAATGTAAGCGGGAAATTTGTATCCTTTCTGCATCTGGTTCAATCTCCAATGCATCTCTCCGTCAGCCAGCCGCATCCGGAGGCAATATTGCATATGAGGGTCGTTTTGAGATTGTTTCGTTATGCGGATCTTATGTACGAACTGACCTTGGAGGAAAGACGGGTGGTCTTAGTGTATGTCTATCAAGTGCTGAAGGCCACATCATAGGAGGGGGAGTTGGTGGACCGTTGAAGGCTGCTGGACCCGTGCAGGTTATTGTTGGAACCTTCGTAATCGACCCCAAGAAGGAAGTTGGTGGTGGTAAAGGCGATGCATCTGCTGGCAAGTTGTCCTCACCTATTGGTGGGACATCGATGTCAAATCTACGCTATGGCTCCAACATTGACTCGGGAGGTAATCATGTAAGGGGAAATGATGAACACCAAGGTCTTGGAGAGGGTCATTTTTTGCTTCAGCCCCGGGGAGTGAATCTGACATCACCGCGATCAACGGATTGGAGAACGGGTCTGGATGCCACAAACACTGCTTATGATTTGACAGGAAGAACAGGCCATCATTCTCCCGAAAACGGAGATTACGATCAGATTCCCGATTGA

Protein sequence

MMFMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRFEIVSLCGSYVRTDLGGKTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEVGGGKGDASAGKLSSPIGGTSMSNLRYGSNIDSGGNHVRGNDEHQGLGEGHFLLQPRGVNLTSPRSTDWRTGLDATNTAYDLTGRTGHHSPENGDYDQIPD
BLAST of Cla013625 vs. Swiss-Prot
Match: AHL14_ARATH (AT-hook motif nuclear-localized protein 14 OS=Arabidopsis thaliana GN=AHL14 PE=1 SV=1)

HSP 1 Score: 212.6 bits (540), Expect = 4.0e-54
Identity = 119/223 (53.36%), Postives = 156/223 (69.96%), Query Frame = 1

Query: 1   MMFMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRFEIVSLCGSYVRTDLGGKTG 60
           MMF  Q K E+C+LSASG+ISNASLRQPA SGGN+ YEG++EI+SL GSY+RT+ GGK+G
Sbjct: 189 MMFANQSKHELCVLSASGTISNASLRQPAPSGGNLPYEGQYEILSLSGSYIRTEQGGKSG 248

Query: 61  GLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEV--GGGKGDA--SAGKLS 120
           GLSV LS+++G IIGG +G  L AAGPVQVI+GTF +D KK+    GGKGDA  S  +L+
Sbjct: 249 GLSVSLSASDGQIIGGAIGSHLTAAGPVQVILGTFQLDRKKDAAGSGGKGDASNSGSRLT 308

Query: 121 SPIGGTSMSNLRYGSNIDS-GGNHVRGNDE------HQ-GL-GEGHFLLQ-PRGVNLTSP 180
           SP+    +  + +   ++S G N +RGNDE      HQ GL G  HF++Q P+G+++T  
Sbjct: 309 SPVSSGQLLGMGFPPGMESTGRNPMRGNDEQHDHHHHQAGLGGPHHFMMQAPQGIHMTHS 368

Query: 181 RSTDWRTGLDATNT-----AYDLTGRTGHHSPENGDYD-QIPD 204
           R ++WR G ++ +       YDL+GR GH S ENGDY+ QIPD
Sbjct: 369 RPSEWRGGGNSGHDGRGGGGYDLSGRIGHESSENGDYEQQIPD 411

BLAST of Cla013625 vs. Swiss-Prot
Match: AHL3_ARATH (AT-hook motif nuclear-localized protein 3 OS=Arabidopsis thaliana GN=AHL3 PE=1 SV=1)

HSP 1 Score: 116.3 bits (290), Expect = 3.9e-25
Identity = 57/99 (57.58%), Postives = 71/99 (71.72%), Query Frame = 1

Query: 1   MMFMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRFEIVSLCGSYVRTDLGG--- 60
           M F QQ  R ICILSA+G ISN +LRQ   SGG + YEGRFEI+SL GS+++ D GG   
Sbjct: 184 MTFSQQGSRAICILSANGPISNVTLRQSMTSGGTLTYEGRFEILSLTGSFMQNDSGGTRS 243

Query: 61  KTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFV 97
           + GG+SVCL+  +G + GGG+ G   AAGPVQV+VGTF+
Sbjct: 244 RAGGMSVCLAGPDGRVFGGGLAGLFLAAGPVQVMVGTFI 282

BLAST of Cla013625 vs. Swiss-Prot
Match: AHL1_ARATH (AT-hook motif nuclear-localized protein 1 OS=Arabidopsis thaliana GN=AHL1 PE=1 SV=1)

HSP 1 Score: 114.8 bits (286), Expect = 1.1e-24
Identity = 61/127 (48.03%), Postives = 83/127 (65.35%), Query Frame = 1

Query: 3   FMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRFEIVSLCGSYVRTDLGG---KT 62
           F QQ  R IC+LSA+G IS+ +LRQP +SGG + YEGRFEI+SL GS++  D GG   +T
Sbjct: 190 FSQQGPRSICVLSANGVISSVTLRQPDSSGGTLTYEGRFEILSLSGSFMPNDSGGTRSRT 249

Query: 63  GGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEVGGGKGDASAGKLSSPI 122
           GG+SV L+S +G ++GGG+ G L AA PVQV+VG+F+     +    K +     LSSP 
Sbjct: 250 GGMSVSLASPDGRVVGGGLAGLLVAASPVQVVVGSFLAGTDHQDQKPKKNKHDFMLSSPT 309

Query: 123 GGTSMSN 127
               +S+
Sbjct: 310 AAIPISS 316

BLAST of Cla013625 vs. Swiss-Prot
Match: AHL6_ARATH (AT-hook motif nuclear-localized protein 6 OS=Arabidopsis thaliana GN=AHL6 PE=2 SV=1)

HSP 1 Score: 110.9 bits (276), Expect = 1.6e-23
Identity = 54/100 (54.00%), Postives = 73/100 (73.00%), Query Frame = 1

Query: 1   MMFMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRFEIVSLCGSYVRTDLG---G 60
           M + QQ  R ICILSA+GSISN +L QP  +GG + YEGRFEI+SL GS++ T+ G   G
Sbjct: 178 MPYSQQGSRAICILSATGSISNVTLGQPTNAGGTLTYEGRFEILSLSGSFMPTENGGTKG 237

Query: 61  KTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVI 98
           + GG+S+ L+   G+I GGG+ G L AAGPVQV++G+F++
Sbjct: 238 RAGGMSISLAGPNGNIFGGGLAGMLIAAGPVQVVMGSFIV 277

BLAST of Cla013625 vs. Swiss-Prot
Match: AHL10_ARATH (AT-hook motif nuclear-localized protein 10 OS=Arabidopsis thaliana GN=AHL10 PE=1 SV=2)

HSP 1 Score: 109.4 bits (272), Expect = 4.7e-23
Identity = 59/114 (51.75%), Postives = 78/114 (68.42%), Query Frame = 1

Query: 9   REICILSASGSISNASLRQPAASGGNIAYEGRFEIVSLCGSYVRTDLGG---KTGGLSVC 68
           R +C+LSA+G+ISN +LRQ A SGG + YEGRFEI+SL GS+   +  G   +TGGLSV 
Sbjct: 189 RAVCVLSANGAISNVTLRQSATSGGTVTYEGRFEILSLSGSFHLLENNGQRSRTGGLSVS 248

Query: 69  LSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEVGGGKGDASAGKLSSPI 120
           LSS +G+++GG V G L AA PVQ++VG+F+ D +KE     G      LSSP+
Sbjct: 249 LSSPDGNVLGGSVAGLLIAASPVQIVVGSFLPDGEKEPKQHVGQMG---LSSPV 299

BLAST of Cla013625 vs. TrEMBL
Match: A0A0A0LJ73_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_2G009590 PE=4 SV=1)

HSP 1 Score: 399.4 bits (1025), Expect = 2.5e-108
Identity = 194/203 (95.57%), Postives = 196/203 (96.55%), Query Frame = 1

Query: 1   MMFMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRFEIVSLCGSYVRTDLGGKTG 60
           M FMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRFEIVSLCGSYVRTDLGGKTG
Sbjct: 174 MQFMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRFEIVSLCGSYVRTDLGGKTG 233

Query: 61  GLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEVGGGKGDASAGKLSSPIG 120
           GLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKE GGGKGD SA KL SPIG
Sbjct: 234 GLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEFGGGKGDGSAVKLPSPIG 293

Query: 121 GTSMSNLRYGSNIDSGGNHVRGNDEHQGLGEGHFLLQPRGVNLTSPRSTDWRTGLDATNT 180
           GTSMSNLRYGSNIDSGGN +RGNDEHQGLGE HFLLQPRGVNLTSPRSTDWRTGLDATNT
Sbjct: 294 GTSMSNLRYGSNIDSGGNQIRGNDEHQGLGESHFLLQPRGVNLTSPRSTDWRTGLDATNT 353

Query: 181 AYDLTGRTGHHSPENGDYDQIPD 204
           AYDL+GRTGHHSPENGDYDQIPD
Sbjct: 354 AYDLSGRTGHHSPENGDYDQIPD 376

BLAST of Cla013625 vs. TrEMBL
Match: A0A061F8Z8_THECC (AT hook motif DNA-binding family protein OS=Theobroma cacao GN=TCM_026254 PE=4 SV=1)

HSP 1 Score: 287.0 bits (733), Expect = 1.8e-74
Identity = 146/208 (70.19%), Postives = 172/208 (82.69%), Query Frame = 1

Query: 1   MMFMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRFEIVSLCGSYVRTDLGGKTG 60
           MMFMQQ KREICILSASG+ISNASLRQPA SGGNI YEGRFEI+SL GSYVRT+ GG+TG
Sbjct: 144 MMFMQQSKREICILSASGTISNASLRQPATSGGNITYEGRFEIISLSGSYVRTETGGRTG 203

Query: 61  GLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEV-GGGKGDASAGKLSSPI 120
           GLSVCLSSA+G IIGGG+GGPLKAAGPVQVIVGTFVID KK+V  G KGDAS  KL SP+
Sbjct: 204 GLSVCLSSADGQIIGGGIGGPLKAAGPVQVIVGTFVIDNKKDVSAGAKGDASGSKLPSPV 263

Query: 121 GGTSMSNLRYGSNID-SGGNHVRGNDEHQGLGEGHFLLQPRGVNLTSPRSTDWRTGLDAT 180
           GGTS+SN+ + S  + SG N + GND+HQ  G  HF++QPRG+++ +PR ++WR+GLD  
Sbjct: 264 GGTSVSNVGFRSAFETSGRNPIGGNDDHQSFGGSHFMMQPRGMHV-APRPSEWRSGLD-D 323

Query: 181 NTAYDLTGRTG---HHSPENGDYDQIPD 204
            T ++LTG+TG   H SPENGDYDQI D
Sbjct: 324 RTGFELTGKTGHGAHQSPENGDYDQIAD 349

BLAST of Cla013625 vs. TrEMBL
Match: A0A0B0NXI7_GOSAR (Putative DNA-binding ESCAROLA-like protein OS=Gossypium arboreum GN=F383_10547 PE=4 SV=1)

HSP 1 Score: 277.3 bits (708), Expect = 1.5e-71
Identity = 141/204 (69.12%), Postives = 167/204 (81.86%), Query Frame = 1

Query: 1   MMFMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRFEIVSLCGSYVRTDLGGKTG 60
           M+FMQQ KRE+CILSASG+ISNASLRQPA SGGNIAYEGRFEI+SL GSYVRT++GG+TG
Sbjct: 143 MLFMQQSKRELCILSASGTISNASLRQPATSGGNIAYEGRFEIISLSGSYVRTEIGGRTG 202

Query: 61  GLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEVGGG-KGDASAGKLSSPI 120
           GLSVCLSSA+G IIGGGVGGPLKAAGPVQVIVGTF++D KK+     KGDAS  KL SP+
Sbjct: 203 GLSVCLSSADGQIIGGGVGGPLKAAGPVQVIVGTFMVDNKKDGSANVKGDASGSKLPSPV 262

Query: 121 GGTSMSNLRYGSNID-SGGNHVRGNDEHQGLGEGHFLLQPRGVNLTSPRSTDWRTGLDAT 180
            GTS+SN+ +    + SG N + GND+HQ  G  HFL+QP+G++L +PR TDWRTGLD  
Sbjct: 263 AGTSVSNIGFRPAFEASGRNPIDGNDDHQSFGGSHFLMQPQGLHL-APRPTDWRTGLD-D 322

Query: 181 NTAYDLTGRTG---HHSPENGDYD 200
            T ++LTG+TG   H SPENGDYD
Sbjct: 323 RTGFELTGKTGHGAHQSPENGDYD 344

BLAST of Cla013625 vs. TrEMBL
Match: A0A0D2PFJ7_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_007G280000 PE=4 SV=1)

HSP 1 Score: 276.6 bits (706), Expect = 2.5e-71
Identity = 140/204 (68.63%), Postives = 167/204 (81.86%), Query Frame = 1

Query: 1   MMFMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRFEIVSLCGSYVRTDLGGKTG 60
           M+FMQQ KRE+CILSASG+ISNASLRQPA SGGNIAYEGRFEI+SL GSYVRT++GG+TG
Sbjct: 30  MLFMQQSKRELCILSASGTISNASLRQPATSGGNIAYEGRFEIISLSGSYVRTEIGGRTG 89

Query: 61  GLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEVGGG-KGDASAGKLSSPI 120
           GLSVCLSSA+G IIGGGVGGPLKAAGPVQVIVGTF++D KK+     KGDAS  KL SP+
Sbjct: 90  GLSVCLSSADGQIIGGGVGGPLKAAGPVQVIVGTFMVDNKKDGSANVKGDASGSKLPSPV 149

Query: 121 GGTSMSNLRYGSNID-SGGNHVRGNDEHQGLGEGHFLLQPRGVNLTSPRSTDWRTGLDAT 180
            GTS+SN+ +    + SG N + GND+HQ  G  HF++QP+G++L +PR TDWRTGLD  
Sbjct: 150 AGTSVSNIGFRPAFEASGRNPIDGNDDHQSFGGSHFMMQPQGLHL-APRPTDWRTGLD-D 209

Query: 181 NTAYDLTGRTG---HHSPENGDYD 200
            T ++LTG+TG   H SPENGDYD
Sbjct: 210 RTGFELTGKTGHGAHQSPENGDYD 231

BLAST of Cla013625 vs. TrEMBL
Match: A0A0D2R085_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_007G280000 PE=4 SV=1)

HSP 1 Score: 276.6 bits (706), Expect = 2.5e-71
Identity = 140/204 (68.63%), Postives = 167/204 (81.86%), Query Frame = 1

Query: 1   MMFMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRFEIVSLCGSYVRTDLGGKTG 60
           M+FMQQ KRE+CILSASG+ISNASLRQPA SGGNIAYEGRFEI+SL GSYVRT++GG+TG
Sbjct: 1   MLFMQQSKRELCILSASGTISNASLRQPATSGGNIAYEGRFEIISLSGSYVRTEIGGRTG 60

Query: 61  GLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEVGGG-KGDASAGKLSSPI 120
           GLSVCLSSA+G IIGGGVGGPLKAAGPVQVIVGTF++D KK+     KGDAS  KL SP+
Sbjct: 61  GLSVCLSSADGQIIGGGVGGPLKAAGPVQVIVGTFMVDNKKDGSANVKGDASGSKLPSPV 120

Query: 121 GGTSMSNLRYGSNID-SGGNHVRGNDEHQGLGEGHFLLQPRGVNLTSPRSTDWRTGLDAT 180
            GTS+SN+ +    + SG N + GND+HQ  G  HF++QP+G++L +PR TDWRTGLD  
Sbjct: 121 AGTSVSNIGFRPAFEASGRNPIDGNDDHQSFGGSHFMMQPQGLHL-APRPTDWRTGLD-D 180

Query: 181 NTAYDLTGRTG---HHSPENGDYD 200
            T ++LTG+TG   H SPENGDYD
Sbjct: 181 RTGFELTGKTGHGAHQSPENGDYD 202

BLAST of Cla013625 vs. NCBI nr
Match: gi|449443249|ref|XP_004139392.1| (PREDICTED: AT-hook motif nuclear-localized protein 14 [Cucumis sativus])

HSP 1 Score: 399.4 bits (1025), Expect = 3.7e-108
Identity = 194/203 (95.57%), Postives = 196/203 (96.55%), Query Frame = 1

Query: 1   MMFMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRFEIVSLCGSYVRTDLGGKTG 60
           M FMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRFEIVSLCGSYVRTDLGGKTG
Sbjct: 160 MQFMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRFEIVSLCGSYVRTDLGGKTG 219

Query: 61  GLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEVGGGKGDASAGKLSSPIG 120
           GLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKE GGGKGD SA KL SPIG
Sbjct: 220 GLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEFGGGKGDGSAVKLPSPIG 279

Query: 121 GTSMSNLRYGSNIDSGGNHVRGNDEHQGLGEGHFLLQPRGVNLTSPRSTDWRTGLDATNT 180
           GTSMSNLRYGSNIDSGGN +RGNDEHQGLGE HFLLQPRGVNLTSPRSTDWRTGLDATNT
Sbjct: 280 GTSMSNLRYGSNIDSGGNQIRGNDEHQGLGESHFLLQPRGVNLTSPRSTDWRTGLDATNT 339

Query: 181 AYDLTGRTGHHSPENGDYDQIPD 204
           AYDL+GRTGHHSPENGDYDQIPD
Sbjct: 340 AYDLSGRTGHHSPENGDYDQIPD 362

BLAST of Cla013625 vs. NCBI nr
Match: gi|700205653|gb|KGN60772.1| (hypothetical protein Csa_2G009590 [Cucumis sativus])

HSP 1 Score: 399.4 bits (1025), Expect = 3.7e-108
Identity = 194/203 (95.57%), Postives = 196/203 (96.55%), Query Frame = 1

Query: 1   MMFMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRFEIVSLCGSYVRTDLGGKTG 60
           M FMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRFEIVSLCGSYVRTDLGGKTG
Sbjct: 174 MQFMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRFEIVSLCGSYVRTDLGGKTG 233

Query: 61  GLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEVGGGKGDASAGKLSSPIG 120
           GLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKE GGGKGD SA KL SPIG
Sbjct: 234 GLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEFGGGKGDGSAVKLPSPIG 293

Query: 121 GTSMSNLRYGSNIDSGGNHVRGNDEHQGLGEGHFLLQPRGVNLTSPRSTDWRTGLDATNT 180
           GTSMSNLRYGSNIDSGGN +RGNDEHQGLGE HFLLQPRGVNLTSPRSTDWRTGLDATNT
Sbjct: 294 GTSMSNLRYGSNIDSGGNQIRGNDEHQGLGESHFLLQPRGVNLTSPRSTDWRTGLDATNT 353

Query: 181 AYDLTGRTGHHSPENGDYDQIPD 204
           AYDL+GRTGHHSPENGDYDQIPD
Sbjct: 354 AYDLSGRTGHHSPENGDYDQIPD 376

BLAST of Cla013625 vs. NCBI nr
Match: gi|659070735|ref|XP_008456410.1| (PREDICTED: putative DNA-binding protein ESCAROLA isoform X1 [Cucumis melo])

HSP 1 Score: 398.7 bits (1023), Expect = 6.2e-108
Identity = 193/203 (95.07%), Postives = 195/203 (96.06%), Query Frame = 1

Query: 1   MMFMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRFEIVSLCGSYVRTDLGGKTG 60
           M FMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRFEIVSLCGSYVRTDLGGKTG
Sbjct: 162 MQFMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRFEIVSLCGSYVRTDLGGKTG 221

Query: 61  GLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEVGGGKGDASAGKLSSPIG 120
           GLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKE GGGKGD SAGKL SPIG
Sbjct: 222 GLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEFGGGKGDGSAGKLPSPIG 281

Query: 121 GTSMSNLRYGSNIDSGGNHVRGNDEHQGLGEGHFLLQPRGVNLTSPRSTDWRTGLDATNT 180
           GTSMSNLRYGSNIDSGGN +RGNDEHQGLGE HFLLQPRGVNLTSPRSTDWRTGLDATN 
Sbjct: 282 GTSMSNLRYGSNIDSGGNQIRGNDEHQGLGESHFLLQPRGVNLTSPRSTDWRTGLDATNA 341

Query: 181 AYDLTGRTGHHSPENGDYDQIPD 204
           AYDL+GRT HHSPENGDYDQIPD
Sbjct: 342 AYDLSGRTSHHSPENGDYDQIPD 364

BLAST of Cla013625 vs. NCBI nr
Match: gi|659070737|ref|XP_008456418.1| (PREDICTED: putative DNA-binding protein ESCAROLA isoform X2 [Cucumis melo])

HSP 1 Score: 398.7 bits (1023), Expect = 6.2e-108
Identity = 193/203 (95.07%), Postives = 195/203 (96.06%), Query Frame = 1

Query: 1   MMFMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRFEIVSLCGSYVRTDLGGKTG 60
           M FMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRFEIVSLCGSYVRTDLGGKTG
Sbjct: 160 MQFMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRFEIVSLCGSYVRTDLGGKTG 219

Query: 61  GLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEVGGGKGDASAGKLSSPIG 120
           GLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKE GGGKGD SAGKL SPIG
Sbjct: 220 GLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEFGGGKGDGSAGKLPSPIG 279

Query: 121 GTSMSNLRYGSNIDSGGNHVRGNDEHQGLGEGHFLLQPRGVNLTSPRSTDWRTGLDATNT 180
           GTSMSNLRYGSNIDSGGN +RGNDEHQGLGE HFLLQPRGVNLTSPRSTDWRTGLDATN 
Sbjct: 280 GTSMSNLRYGSNIDSGGNQIRGNDEHQGLGESHFLLQPRGVNLTSPRSTDWRTGLDATNA 339

Query: 181 AYDLTGRTGHHSPENGDYDQIPD 204
           AYDL+GRT HHSPENGDYDQIPD
Sbjct: 340 AYDLSGRTSHHSPENGDYDQIPD 362

BLAST of Cla013625 vs. NCBI nr
Match: gi|590642328|ref|XP_007030487.1| (AT hook motif DNA-binding family protein [Theobroma cacao])

HSP 1 Score: 287.0 bits (733), Expect = 2.6e-74
Identity = 146/208 (70.19%), Postives = 172/208 (82.69%), Query Frame = 1

Query: 1   MMFMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRFEIVSLCGSYVRTDLGGKTG 60
           MMFMQQ KREICILSASG+ISNASLRQPA SGGNI YEGRFEI+SL GSYVRT+ GG+TG
Sbjct: 144 MMFMQQSKREICILSASGTISNASLRQPATSGGNITYEGRFEIISLSGSYVRTETGGRTG 203

Query: 61  GLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEV-GGGKGDASAGKLSSPI 120
           GLSVCLSSA+G IIGGG+GGPLKAAGPVQVIVGTFVID KK+V  G KGDAS  KL SP+
Sbjct: 204 GLSVCLSSADGQIIGGGIGGPLKAAGPVQVIVGTFVIDNKKDVSAGAKGDASGSKLPSPV 263

Query: 121 GGTSMSNLRYGSNID-SGGNHVRGNDEHQGLGEGHFLLQPRGVNLTSPRSTDWRTGLDAT 180
           GGTS+SN+ + S  + SG N + GND+HQ  G  HF++QPRG+++ +PR ++WR+GLD  
Sbjct: 264 GGTSVSNVGFRSAFETSGRNPIGGNDDHQSFGGSHFMMQPRGMHV-APRPSEWRSGLD-D 323

Query: 181 NTAYDLTGRTG---HHSPENGDYDQIPD 204
            T ++LTG+TG   H SPENGDYDQI D
Sbjct: 324 RTGFELTGKTGHGAHQSPENGDYDQIAD 349

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
AHL14_ARATH4.0e-5453.36AT-hook motif nuclear-localized protein 14 OS=Arabidopsis thaliana GN=AHL14 PE=1... [more]
AHL3_ARATH3.9e-2557.58AT-hook motif nuclear-localized protein 3 OS=Arabidopsis thaliana GN=AHL3 PE=1 S... [more]
AHL1_ARATH1.1e-2448.03AT-hook motif nuclear-localized protein 1 OS=Arabidopsis thaliana GN=AHL1 PE=1 S... [more]
AHL6_ARATH1.6e-2354.00AT-hook motif nuclear-localized protein 6 OS=Arabidopsis thaliana GN=AHL6 PE=2 S... [more]
AHL10_ARATH4.7e-2351.75AT-hook motif nuclear-localized protein 10 OS=Arabidopsis thaliana GN=AHL10 PE=1... [more]
Match NameE-valueIdentityDescription
A0A0A0LJ73_CUCSA2.5e-10895.57Uncharacterized protein OS=Cucumis sativus GN=Csa_2G009590 PE=4 SV=1[more]
A0A061F8Z8_THECC1.8e-7470.19AT hook motif DNA-binding family protein OS=Theobroma cacao GN=TCM_026254 PE=4 S... [more]
A0A0B0NXI7_GOSAR1.5e-7169.12Putative DNA-binding ESCAROLA-like protein OS=Gossypium arboreum GN=F383_10547 P... [more]
A0A0D2PFJ7_GOSRA2.5e-7168.63Uncharacterized protein OS=Gossypium raimondii GN=B456_007G280000 PE=4 SV=1[more]
A0A0D2R085_GOSRA2.5e-7168.63Uncharacterized protein OS=Gossypium raimondii GN=B456_007G280000 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
gi|449443249|ref|XP_004139392.1|3.7e-10895.57PREDICTED: AT-hook motif nuclear-localized protein 14 [Cucumis sativus][more]
gi|700205653|gb|KGN60772.1|3.7e-10895.57hypothetical protein Csa_2G009590 [Cucumis sativus][more]
gi|659070735|ref|XP_008456410.1|6.2e-10895.07PREDICTED: putative DNA-binding protein ESCAROLA isoform X1 [Cucumis melo][more]
gi|659070737|ref|XP_008456418.1|6.2e-10895.07PREDICTED: putative DNA-binding protein ESCAROLA isoform X2 [Cucumis melo][more]
gi|590642328|ref|XP_007030487.1|2.6e-7470.19AT hook motif DNA-binding family protein [Theobroma cacao][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR005175PPC_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0006355 regulation of transcription, DNA-templated
biological_process GO:0006351 transcription, DNA-templated
cellular_component GO:0005575 cellular_component
cellular_component GO:0005634 nucleus
molecular_function GO:0003677 DNA binding
molecular_function GO:0003674 molecular_function
molecular_function GO:0003680 AT DNA binding
This gene is associated with the following unigenes:
Unigene NameAnalysis NameSequence type in Unigene
WMU40167watermelon EST collection version 2.0transcribed_cluster
WMU40569watermelon EST collection version 2.0transcribed_cluster

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla013625Cla013625.1mRNA


The following transcribed_cluster feature(s) are associated with this gene:

Feature NameUnique NameType
WMU40569WMU40569transcribed_cluster
WMU40167WMU40167transcribed_cluster


Analysis Name: InterPro Annotations of watermelon (97103)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR005175PPC domainPFAMPF03479DUF296coord: 2..95
score: 3.4
IPR005175PPC domainPROFILEPS51742PPCcoord: 1..120
score: 25
NoneNo IPR availableGENE3DG3DSA:3.30.1330.80coord: 3..95
score: 4.2
NoneNo IPR availablePANTHERPTHR31500FAMILY NOT NAMEDcoord: 1..203
score: 4.4
NoneNo IPR availablePANTHERPTHR31500:SF22AT HOOK MOTIF DNA-BINDING FAMILY PROTEIN-RELATEDcoord: 1..203
score: 4.4
NoneNo IPR availableunknownSSF117856AF0104/ALDC/Ptd012-likecoord: 5..96
score: 1.62