CmaCh09G000200 (gene) Cucurbita maxima (Rimu)

NameCmaCh09G000200
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionAT hook motif DNA-binding family protein
LocationCma_Chr09 : 119736 .. 120659 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGAATGGCGGGGTACAGCGAAGAAGGAGGAGCGGGTTCACGCTACGTGCAACACCACCATCATCATCATCAACTTCTGAATCCGGAACTGCATCTGGACACGCCCTCCTCCTCCTCTATTCCATTTCCTCATCCTCATCTATTCAACCATTCCAATCACTCCGATGATGGTGATGATAATAACGATACTAATAATACTACTAATCCCTCTCAAAACCAACACGCCTCCCTTCGCCGCCCTCGTGGCCGTCCTCCTGGATCCAAAAACAAGCCCAAACCTCCCATTATCCTCACTCGCGATAGTCCCAACGTCCTCGGCTCTCATGTCCTCGAGGTCTCCGCCTCCGCTGATATCGTCGATAGTCTCTCCAATTACGCCCGCCGCAGAGGGAGAGGACTCAGCATCCTCTGCGGCACTGGTACTGTCGCCAACGTCACGCTCCGTCAGCCATCTGCCTCTCCTACGGCTAGCGTCATCACTCTTCATGGCAGGTTCGAGATTCTCTCCCTCACGGGCACTGTGCTTCCCCCGCCAGCTCCGCCGCAGGCAGGTGGACTGTCTATATTCCTAGCCGGGGCACAAGGGCAGGTGGTTGGAGGGACGGTGGTGGGGCCCTTGGTGGCTTCGGGGCCGGTGATTTTGATGGCGGCGTCATTCTCCAACGCTGTGTTTGAAAGGTTGCCTCTGGAAGAAGAGGAGGAAGGGGGAGTGCAAGTTCAACCGACGGCGTCGCAGTCGTCGGGAGTGACCGGAGGCGGACAGATGGGGGAGGGCGGTGGGAATGCAAACGAGGGGAGCGGCGGCGGAGTTGGGTTCCTGGGCAACAGTACAATAGCGGGATATCCTCCGTTTGCAGGGGATTTGTTTGGTTGGGGAAGTGGAAGTGGAAGTGGGAATGCCACAAAGCCTCGTCAGTTCTAA

mRNA sequence

ATGAGAATGGCGGGGTACAGCGAAGAAGGAGGAGCGGGTTCACGCTACGTGCAACACCACCATCATCATCATCAACTTCTGAATCCGGAACTGCATCTGGACACGCCCTCCTCCTCCTCTATTCCATTTCCTCATCCTCATCTATTCAACCATTCCAATCACTCCGATGATGGTGATGATAATAACGATACTAATAATACTACTAATCCCTCTCAAAACCAACACGCCTCCCTTCGCCGCCCTCGTGGCCGTCCTCCTGGATCCAAAAACAAGCCCAAACCTCCCATTATCCTCACTCGCGATAGTCCCAACGTCCTCGGCTCTCATGTCCTCGAGGTCTCCGCCTCCGCTGATATCGTCGATAGTCTCTCCAATTACGCCCGCCGCAGAGGGAGAGGACTCAGCATCCTCTGCGGCACTGGTACTGTCGCCAACGTCACGCTCCGTCAGCCATCTGCCTCTCCTACGGCTAGCGTCATCACTCTTCATGGCAGGTTCGAGATTCTCTCCCTCACGGGCACTGTGCTTCCCCCGCCAGCTCCGCCGCAGGCAGGTGGACTGTCTATATTCCTAGCCGGGGCACAAGGGCAGGTGGTTGGAGGGACGGTGGTGGGGCCCTTGGTGGCTTCGGGGCCGGTGATTTTGATGGCGGCGTCATTCTCCAACGCTGTGTTTGAAAGGTTGCCTCTGGAAGAAGAGGAGGAAGGGGGAGTGCAAGTTCAACCGACGGCGTCGCAGTCGTCGGGAGTGACCGGAGGCGGACAGATGGGGGAGGGCGGTGGGAATGCAAACGAGGGGAGCGGCGGCGGAGTTGGGTTCCTGGGCAACAGTACAATAGCGGGATATCCTCCGTTTGCAGGGGATTTGTTTGGTTGGGGAAGTGGAAGTGGAAGTGGGAATGCCACAAAGCCTCGTCAGTTCTAA

Coding sequence (CDS)

ATGAGAATGGCGGGGTACAGCGAAGAAGGAGGAGCGGGTTCACGCTACGTGCAACACCACCATCATCATCATCAACTTCTGAATCCGGAACTGCATCTGGACACGCCCTCCTCCTCCTCTATTCCATTTCCTCATCCTCATCTATTCAACCATTCCAATCACTCCGATGATGGTGATGATAATAACGATACTAATAATACTACTAATCCCTCTCAAAACCAACACGCCTCCCTTCGCCGCCCTCGTGGCCGTCCTCCTGGATCCAAAAACAAGCCCAAACCTCCCATTATCCTCACTCGCGATAGTCCCAACGTCCTCGGCTCTCATGTCCTCGAGGTCTCCGCCTCCGCTGATATCGTCGATAGTCTCTCCAATTACGCCCGCCGCAGAGGGAGAGGACTCAGCATCCTCTGCGGCACTGGTACTGTCGCCAACGTCACGCTCCGTCAGCCATCTGCCTCTCCTACGGCTAGCGTCATCACTCTTCATGGCAGGTTCGAGATTCTCTCCCTCACGGGCACTGTGCTTCCCCCGCCAGCTCCGCCGCAGGCAGGTGGACTGTCTATATTCCTAGCCGGGGCACAAGGGCAGGTGGTTGGAGGGACGGTGGTGGGGCCCTTGGTGGCTTCGGGGCCGGTGATTTTGATGGCGGCGTCATTCTCCAACGCTGTGTTTGAAAGGTTGCCTCTGGAAGAAGAGGAGGAAGGGGGAGTGCAAGTTCAACCGACGGCGTCGCAGTCGTCGGGAGTGACCGGAGGCGGACAGATGGGGGAGGGCGGTGGGAATGCAAACGAGGGGAGCGGCGGCGGAGTTGGGTTCCTGGGCAACAGTACAATAGCGGGATATCCTCCGTTTGCAGGGGATTTGTTTGGTTGGGGAAGTGGAAGTGGAAGTGGGAATGCCACAAAGCCTCGTCAGTTCTAA

Protein sequence

MRMAGYSEEGGAGSRYVQHHHHHHQLLNPELHLDTPSSSSIPFPHPHLFNHSNHSDDGDDNNDTNNTTNPSQNQHASLRRPRGRPPGSKNKPKPPIILTRDSPNVLGSHVLEVSASADIVDSLSNYARRRGRGLSILCGTGTVANVTLRQPSASPTASVITLHGRFEILSLTGTVLPPPAPPQAGGLSIFLAGAQGQVVGGTVVGPLVASGPVILMAASFSNAVFERLPLEEEEEGGVQVQPTASQSSGVTGGGQMGEGGGNANEGSGGGVGFLGNSTIAGYPPFAGDLFGWGSGSGSGNATKPRQF
BLAST of CmaCh09G000200 vs. Swiss-Prot
Match: AHL27_ARATH (AT-hook motif nuclear-localized protein 27 OS=Arabidopsis thaliana GN=AHL27 PE=1 SV=1)

HSP 1 Score: 264.6 bits (675), Expect = 1.3e-69
Identity = 174/331 (52.57%), Postives = 206/331 (62.24%), Query Frame = 1

Query: 3   MAGYSEEGGAGSRYVQHHHHHHQLLNPELH---LDTPSSSSIPFPHPHLFN-HSNHSDDG 62
           M G  E+GG  SRY       H L  PE+H   L      ++   H H    H       
Sbjct: 1   MEGGYEQGGGASRYF------HNLFRPEIHHQQLQPQGGINLIDQHHHQHQQHQQQQQPS 60

Query: 63  DDNNDTNNT-----------TNPSQNQHASLRRPRGRPPGSKNKPKPPIILTRDSPNVLG 122
           DD+ +++++           ++P+ +  A  +RPRGRPPGSKNK KPPII+TRDSPN L 
Sbjct: 61  DDSRESDHSNKDHHQQGRPDSDPNTSSSAPGKRPRGRPPGSKNKAKPPIIVTRDSPNALR 120

Query: 123 SHVLEVSASADIVDSLSNYARRRGRGLSILCGTGTVANVTLRQP-------SASPTASVI 182
           SHVLEVS  ADIV+S+S YARRRGRG+S+L G GTV+NVTLRQP         S    V+
Sbjct: 121 SHVLEVSPGADIVESVSTYARRRGRGVSVLGGNGTVSNVTLRQPVTPGNGGGVSGGGGVV 180

Query: 183 TLHGRFEILSLTGTVLPPPAPPQAGGLSIFLAGAQGQVVGGTVVGPLVASGPVILMAASF 242
           TLHGRFEILSLTGTVLPPPAPP AGGLSIFLAG QGQVVGG+VV PL+AS PVILMAASF
Sbjct: 181 TLHGRFEILSLTGTVLPPPAPPGAGGLSIFLAGGQGQVVGGSVVAPLIASAPVILMAASF 240

Query: 243 SNAVFERLPLEEEEEGG-------------VQVQPTASQSSGVTGGGQMGEGGGNANEGS 299
           SNAVFERLP+EEEEE G             +Q  P+AS  SGVTG GQ+G        G+
Sbjct: 241 SNAVFERLPIEEEEEEGGGGGGGGGGGPPQMQQAPSASPPSGVTGQGQLG--------GN 300

BLAST of CmaCh09G000200 vs. Swiss-Prot
Match: AHL29_ARATH (AT-hook motif nuclear-localized protein 29 OS=Arabidopsis thaliana GN=AHL29 PE=1 SV=1)

HSP 1 Score: 261.9 bits (668), Expect = 8.6e-69
Identity = 172/316 (54.43%), Postives = 204/316 (64.56%), Query Frame = 1

Query: 5   GYSEEGGAGSRYVQHHHHHHQLLNPELHLDT---PSSSSIPFPHPHLFNHSNHSDDGDDN 64
           GY + GGA SRY       H L  PELH      P    +P P P       +SDD  D+
Sbjct: 4   GYDQSGGA-SRYF------HNLFRPELHHQLQPQPQLHPLPQPQPQPQPQQQNSDDESDS 63

Query: 65  NDTNNTTNPSQNQHASLRRPRGRPPGSKNKPKPPIILTRDSPNVLGSHVLEVSASADIVD 124
           N    +   +       +RPRGRPPGSKNKPKPP+I+TRDSPNVL SHVLEVS+ ADIV+
Sbjct: 64  NKDPGSDPVTSGSTG--KRPRGRPPGSKNKPKPPVIVTRDSPNVLRSHVLEVSSGADIVE 123

Query: 125 SLSNYARRRGRGLSILCGTGTVANVTLRQPSASP-------TASVITLHGRFEILSLTGT 184
           S++ YARRRGRG+SIL G GTVANV+LRQP+ +        T  V+ LHGRFEILSLTGT
Sbjct: 124 SVTTYARRRGRGVSILSGNGTVANVSLRQPATTAAHGANGGTGGVVALHGRFEILSLTGT 183

Query: 185 VLPPPAPPQAGGLSIFLAGAQGQVVGGTVVGPLVASGPVILMAASFSNAVFERLPLEEE- 244
           VLPPPAPP +GGLSIFL+G QGQV+GG VV PLVASGPVILMAASFSNA FERLPLE+E 
Sbjct: 184 VLPPPAPPGSGGLSIFLSGVQGQVIGGNVVAPLVASGPVILMAASFSNATFERLPLEDEG 243

Query: 245 EEGGVQVQPTASQSSGVTGGGQMGEGGG---------NANEGSGGGVGFLGNSTIAGYPP 299
            EGG               GG++GEGGG         +++  SG G G L    ++GY  
Sbjct: 244 GEGG--------------EGGEVGEGGGGEGGPPPATSSSPPSGAGQGQL-RGNMSGYDQ 295

BLAST of CmaCh09G000200 vs. Swiss-Prot
Match: AHL25_ARATH (AT-hook motif nuclear-localized protein 25 OS=Arabidopsis thaliana GN=AHL25 PE=1 SV=1)

HSP 1 Score: 249.2 bits (635), Expect = 5.8e-65
Identity = 154/292 (52.74%), Postives = 191/292 (65.41%), Query Frame = 1

Query: 24  HQLLNPELHLDTPSSSSIPFPHPHLFNHSNHSDDGDDNNDTNN--TTNPSQNQHASLRRP 83
           H LL  ELHL  P  S  P    ++  + + +D+           T++ + +  +S RRP
Sbjct: 6   HPLLGQELHLQRPEDSRTPPDQNNMELNRSEADEAKAETTPTGGATSSATASGSSSGRRP 65

Query: 84  RGRPPGSKNKPKPPIILTRDSPNVLGSHVLEVSASADIVDSLSNYARRRGRGLSILCGTG 143
           RGRP GSKNKPKPP I+TRDSPNVL SHVLEV++ +DI +++S YA RRG G+ I+ GTG
Sbjct: 66  RGRPAGSKNKPKPPTIITRDSPNVLRSHVLEVTSGSDISEAVSTYATRRGCGVCIISGTG 125

Query: 144 TVANVTLRQPSASPTASVITLHGRFEILSLTGTVLPPPAPPQAGGLSIFLAGAQGQVVGG 203
            V NVT+RQP+A     VITLHGRF+ILSLTGT LPPPAPP AGGL+++LAG QGQVVGG
Sbjct: 126 AVTNVTIRQPAAPAGGGVITLHGRFDILSLTGTALPPPAPPGAGGLTVYLAGGQGQVVGG 185

Query: 204 TVVGPLVASGPVILMAASFSNAVFERLPLEEEE------EGGVQVQPTASQSSGVTGGGQ 263
            V G L+ASGPV+LMAASF+NAV++RLP+EEEE       G  Q QP ASQSS VTG G 
Sbjct: 186 NVAGSLIASGPVVLMAASFANAVYDRLPIEEEETPPPRTTGVQQQQPEASQSSEVTGSGA 245

Query: 264 MGEGGGNANEGSGGGVGFLG-NSTIAGYPPFAGDLFGW--GSGSGSGNATKP 305
                       GGGV F      +  +    GD++G   GSG G G AT+P
Sbjct: 246 QACESNLQGGNGGGGVAFYNLGMNMNNFQFSGGDIYGMSGGSGGGGGGATRP 297

BLAST of CmaCh09G000200 vs. Swiss-Prot
Match: AHL19_ARATH (AT-hook motif nuclear-localized protein 19 OS=Arabidopsis thaliana GN=AHL19 PE=2 SV=1)

HSP 1 Score: 218.4 bits (555), Expect = 1.1e-55
Identity = 143/289 (49.48%), Postives = 187/289 (64.71%), Query Frame = 1

Query: 25  QLLNPELHLDTPSSSSIPFPHPHLFNHSNHSDDGDDNNDTNNTTNPSQNQH--------A 84
           QL  P+LH+    S ++     H  N+ +H  + D+NN+ ++  N S + H        A
Sbjct: 24  QLKKPDLHI----SMNMAMDSGH--NNHHHHQEVDNNNNDDDRDNLSGDDHEPREGAVEA 83

Query: 85  SLRRPRGRPPGSKNKPKPPIILTRDSPNVLGSHVLEVSASADIVDSLSNYARRRGRGLSI 144
             RRPRGRP GSKNKPKPPI +TRDSPN L SHV+E+++  D++++L+ +ARRR RG+ I
Sbjct: 84  PTRRPRGRPAGSKNKPKPPIFVTRDSPNALKSHVMEIASGTDVIETLATFARRRQRGICI 143

Query: 145 LCGTGTVANVTLRQPSASPT------ASVITLHGRFEILSLTGTVLPPPAPPQAGGLSIF 204
           L G GTVANVTLRQPS +        A+V+ L GRFEILSLTG+ LP PAPP + GL+I+
Sbjct: 144 LSGNGTVANVTLRQPSTAAVAAAPGGAAVLALQGRFEILSLTGSFLPGPAPPGSTGLTIY 203

Query: 205 LAGAQGQVVGGTVVGPLVASGPVILMAASFSNAVFERLPLEEEEEGGVQVQPTASQSSGV 264
           LAG QGQVVGG+VVGPL+A+GPV+L+AA+FSNA +ERLPLEEEE      +      SG 
Sbjct: 204 LAGGQGQVVGGSVVGPLMAAGPVMLIAATFSNATYERLPLEEEE----AAERGGGGGSGG 263

Query: 265 TGGGQMGEGGGNANEGSGGGVGFLGNSTIAGYPPFAGDLFGWGSGSGSG 300
              GQ+G GG   + G+GGG    GN  +  Y    G+L   G   G G
Sbjct: 264 VVPGQLGGGGSPLSSGAGGGD---GNQGLPVY-NMPGNLVSNGGSGGGG 298

BLAST of CmaCh09G000200 vs. Swiss-Prot
Match: AHL24_ARATH (AT-hook motif nuclear-localized protein 24 OS=Arabidopsis thaliana GN=AHL24 PE=2 SV=1)

HSP 1 Score: 205.3 bits (521), Expect = 9.6e-52
Identity = 117/214 (54.67%), Postives = 150/214 (70.09%), Query Frame = 1

Query: 50  NHSNHSDDGDDNNDTNNTTNPSQNQHASLRRPRGRPPGSKNKPKPPIILTRDSPNVLGSH 109
           N  +   D D +  +      S   H   RRPRGRP GSKNKPKPPII+TRDS N L +H
Sbjct: 76  NSGSEGKDIDIHGGSGEGGGGSGGDHQMTRRPRGRPAGSKNKPKPPIIITRDSANALRTH 135

Query: 110 VLEVSASADIVDSLSNYARRRGRGLSILCGTGTVANVTLRQPSASPT-ASVITLHGRFEI 169
           V+E+    D+V+S++ +ARRR RG+ ++ GTG V NVT+RQP + P+  SV++LHGRFEI
Sbjct: 136 VMEIGDGCDLVESVATFARRRQRGVCVMSGTGNVTNVTIRQPGSHPSPGSVVSLHGRFEI 195

Query: 170 LSLTGTVLPPPAPPQAGGLSIFLAGAQGQVVGGTVVGPLVASGPVILMAASFSNAVFERL 229
           LSL+G+ LPPPAPP A GLS++LAG QGQVVGG+VVGPL+ +GPV++MAASFSNA +ERL
Sbjct: 196 LSLSGSFLPPPAPPTATGLSVYLAGGQGQVVGGSVVGPLLCAGPVVVMAASFSNAAYERL 255

Query: 230 PLEEEEEGGVQVQPTASQSSGVTGGGQMGEGGGN 263
           PLEE+E             + V GGG    GGG+
Sbjct: 256 PLEEDE-----------MQTPVHGGG----GGGS 274

BLAST of CmaCh09G000200 vs. TrEMBL
Match: A0A067KKW8_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_08028 PE=4 SV=1)

HSP 1 Score: 312.4 bits (799), Expect = 6.2e-82
Identity = 193/307 (62.87%), Postives = 219/307 (71.34%), Query Frame = 1

Query: 3   MAGYSEEGGAGSRYVQHHHHHHQLLNPELHLDTPSSSSIPFPHPHLFNHSNHSDDGDDNN 62
           M+GY +   AGSRYV      HQLL PELHL  PS      P     +  N S    D+ 
Sbjct: 1   MSGYEQT--AGSRYV------HQLLRPELHLQRPSLPVQLSPD----SKDNTSPQSKDHK 60

Query: 63  DTNNTTNPSQNQHASLRRPRGRPPGSKNKPKPPIILTRDSPNVLGSHVLEVSASADIVDS 122
            T +T   + +   S RRPRGRPPGSKNKPKPPII+TRDSPN L SHVLEVS  +DIV++
Sbjct: 61  -TTDTDTAATSSGGSYRRPRGRPPGSKNKPKPPIIVTRDSPNALRSHVLEVSTGSDIVET 120

Query: 123 LSNYARRRGRGLSILCGTGTVANVTLRQPSASPTASVITLHGRFEILSLTGTVLPPPAPP 182
           +SNYAR+RGRG+ +L G GTVANVTLRQP ASP  SV+TL GRFEILSL+GTVLPPPAPP
Sbjct: 121 VSNYARKRGRGVCVLSGNGTVANVTLRQP-ASPAGSVVTLQGRFEILSLSGTVLPPPAPP 180

Query: 183 QAGGLSIFLAGAQGQVVGGTVVGPLVASGPVILMAASFSNAVFERLPLEEEEEGGVQVQP 242
            AGGLSIFL+G QGQVVGG+VVGPL+ASGPV+LMAASF+NAVFERLPL +EEEG VQVQ 
Sbjct: 181 GAGGLSIFLSGGQGQVVGGSVVGPLLASGPVVLMAASFANAVFERLPL-DEEEGNVQVQS 240

Query: 243 TASQSSGVTGGGQMGE-----GGGNANEGSGGGVGFLGNSTIAGYPPFAGDLFGWGSGSG 302
           TASQSSGVTG G  G      GGG    G GGG  F       G  PF+GDLFGWG    
Sbjct: 241 TASQSSGVTGSGGAGHLADGGGGGRGGGGGGGGGAFFNVGGNVGNYPFSGDLFGWG---- 287

Query: 303 SGNATKP 305
            G+AT+P
Sbjct: 301 -GSATRP 287

BLAST of CmaCh09G000200 vs. TrEMBL
Match: A0A061FMX7_THECC (AT-hook DNA-binding family protein, putative OS=Theobroma cacao GN=TCM_034810 PE=4 SV=1)

HSP 1 Score: 312.4 bits (799), Expect = 6.2e-82
Identity = 189/296 (63.85%), Postives = 215/296 (72.64%), Query Frame = 1

Query: 3   MAGYSEEGGAGSRYVQHHHHHHQLLNPELHLDTPSSSSIPFPHPHLFNHSNHSDDGDDNN 62
           MAGY E  G GSRY Q      Q   PELHL  PS +          + S  S D D NN
Sbjct: 62  MAGY-EAAGPGSRYGQ------QPFRPELHLQMPSLTPPS-------DDSRDSQDNDPNN 121

Query: 63  DTNNTTNPSQNQHASLRRPRGRPPGSKNKPKPPIILTRDSPNVLGSHVLEVSASADIVDS 122
              +    + +     RRPRGRP GSKNKPKPPII+TRDSPN L SHVLE+S+ ADIVDS
Sbjct: 122 PDLSDAAAATSSGGPTRRPRGRPAGSKNKPKPPIIVTRDSPNALRSHVLEISSGADIVDS 181

Query: 123 LSNYARRRGRGLSILCGTGTVANVTLRQPSASPTASVITLHGRFEILSLTGTVLPPPAPP 182
           LSNYARRRGRG+ +L G+GTVANV+LRQP ASP ASV+TLHGRFEILSL G VLPPPAPP
Sbjct: 182 LSNYARRRGRGICVLSGSGTVANVSLRQP-ASPPASVLTLHGRFEILSLCGKVLPPPAPP 241

Query: 183 QAGGLSIFLAGAQGQVVGGTVVGPLVASGPVILMAASFSNAVFERLPLEEEEEGGVQVQP 242
             GGLSIFL+G QGQVVGG VVGPLVASGPV+LMAASF+NAVFERLP +EEEEG VQVQP
Sbjct: 242 GVGGLSIFLSGGQGQVVGGRVVGPLVASGPVVLMAASFANAVFERLPPDEEEEGTVQVQP 301

Query: 243 TASQSSGVTGGGQMGEGGGNANEGSGGGVG--FLGNSTIAGYPPFAGDLFGWGSGS 297
           T SQSSGVTG GQ+ +GGG ++  +    G  F+   +   Y PF+GDLFGWGSG+
Sbjct: 302 TGSQSSGVTGSGQLPDGGGTSSAAASATAGSLFIMGGSGPNY-PFSGDLFGWGSGT 341

BLAST of CmaCh09G000200 vs. TrEMBL
Match: F6I0P7_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_03s0038g04110 PE=4 SV=1)

HSP 1 Score: 309.7 bits (792), Expect = 4.0e-81
Identity = 192/308 (62.34%), Postives = 217/308 (70.45%), Query Frame = 1

Query: 3   MAGYSEEGGAGSRYVQHHHHHHQLLNPELHLD-TPSSSSIPFPHPHLFNHSNHSDDGDDN 62
           MAG   E GAGSRY+      HQL  PEL L+ TP         PH     N S D  +N
Sbjct: 1   MAGM--EQGAGSRYI------HQLFRPELQLERTPQQ-------PHQPPQLNDSGDSPEN 60

Query: 63  NDTNNTTNPSQNQHASLRRPRGRPPGSKNKPKPPIILTRDSPNVLGSHVLEVSASADIVD 122
            D  +          S RRPRGRPPGSKNK KPPII+TRDSPN L SHVLE+SA ADIV+
Sbjct: 61  EDRTDPDGSPGAATTSSRRPRGRPPGSKNKAKPPIIITRDSPNALRSHVLEISAGADIVE 120

Query: 123 SLSNYARRRGRGLSILCGTGTVANVTLRQPSASPTASVITLHGRFEILSLTGTVLPPPAP 182
           S+SNYARRRGRG+ IL G G V +VTLRQP+A P+ SV+TLHGRFEILSLTGT LPPPAP
Sbjct: 121 SVSNYARRRGRGVCILSGGGAVTDVTLRQPAA-PSGSVVTLHGRFEILSLTGTALPPPAP 180

Query: 183 PQAGGLSIFLAGAQGQVVGGTVVGPLVASGPVILMAASFSNAVFERLPLEEEEEGGVQVQ 242
           P AGGL+I+L G QGQVVGG VVGPLVASGPV+LMAASF+NAV++RLPLEEEEE  VQVQ
Sbjct: 181 PGAGGLTIYLGGGQGQVVGGRVVGPLVASGPVLLMAASFANAVYDRLPLEEEEESPVQVQ 240

Query: 243 PTASQSSGVT-GGGQMGEGGG----NANEGSGGGVGFLGNSTIAGYPPFAGDLFGWGSGS 302
           PTASQSSGVT GGGQ+G+GG      A  G+G GV F       G  PF GD+FGW    
Sbjct: 241 PTASQSSGVTGGGGQLGDGGNGSTTTAGGGAGAGVPFYNLGPNMGNYPFPGDVFGW---- 287

Query: 303 GSGNATKP 305
            +G AT+P
Sbjct: 301 -NGGATRP 287

BLAST of CmaCh09G000200 vs. TrEMBL
Match: B9H5W8_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0005s27850g PE=4 SV=2)

HSP 1 Score: 308.9 bits (790), Expect = 6.8e-81
Identity = 192/317 (60.57%), Postives = 228/317 (71.92%), Query Frame = 1

Query: 3   MAGYSEEGGAGSRYVQHHHHHHQLLNPELHL----------DTPSSSSIPFPHPHLFNHS 62
           MAG+    G  SRYV H  +H+ LL PELHL          D+  +++ P P  H    +
Sbjct: 1   MAGFE---GNNSRYV-HGQNHNNLLRPELHLIQRPSSIPSSDSRDNNNTPSPPDHANQTA 60

Query: 63  NHSDDGDDNNDTNNTTNPSQNQHASLRRPRGRPPGSKNKPKPPIILTRDSPNVLGSHVLE 122
           +H  D      +   TNP+       RRPRGRP GSKNKPKPPII+TRDSPN L SHV+E
Sbjct: 61  HHHPDSSATTSSGGGTNPN-------RRPRGRPAGSKNKPKPPIIVTRDSPNALRSHVIE 120

Query: 123 VSASADIVDSLSNYARRRGRGLSILCGTGTVANVTLRQPSASPTASVITLHGRFEILSLT 182
           +S  ADIV+S+S YAR+RGRG+ +L G+GTVANVTLRQP ASP  SV+TLHGRFEILSL+
Sbjct: 121 ISNGADIVESVSTYARKRGRGVCVLSGSGTVANVTLRQP-ASPAGSVLTLHGRFEILSLS 180

Query: 183 GTVLPPPAPPQAGGLSIFLAGAQGQVVGGTVVGPLVASGPVILMAASFSNAVFERLPLEE 242
           GTVLPPPAPP AGGLSIFL+G QGQVVGG VVGPL+A+GPV+LMAASF+NAVFERLPL++
Sbjct: 181 GTVLPPPAPPGAGGLSIFLSGGQGQVVGGNVVGPLMAAGPVVLMAASFANAVFERLPLDD 240

Query: 243 EEE-GGVQVQPTASQSSGVTG-GGQMGEGGGNANEGSGGGVGF--LGNSTIAGYPPFAGD 302
           +EE G VQVQPTASQSSGVTG GGQMG+GGG +  G G G GF  +      G  PF+GD
Sbjct: 241 QEEAGAVQVQPTASQSSGVTGSGGQMGDGGGGSGTG-GAGSGFFNMAGGAHHGNYPFSGD 299

Query: 303 LFG-WGSGSGSGNATKP 305
           LFG WG     G+A +P
Sbjct: 301 LFGPWG-----GSAARP 299

BLAST of CmaCh09G000200 vs. TrEMBL
Match: B9R8U1_RICCO (DNA binding protein, putative OS=Ricinus communis GN=RCOM_1602220 PE=4 SV=1)

HSP 1 Score: 308.5 bits (789), Expect = 8.9e-81
Identity = 194/310 (62.58%), Postives = 223/310 (71.94%), Query Frame = 1

Query: 3   MAGYSEEG---GAGSRYVQHHHHHHQLLNPELHLDTPSSSSIPFPHPHLFNHSNHSDDGD 62
           MAGY+ E    G GSRYV      HQLL PELHL  PS  S P         +N S    
Sbjct: 1   MAGYNNEQSATGTGSRYV------HQLLRPELHLQRPSFPSQPSSDS---KDNNISPQSK 60

Query: 63  DNNDTNNTTNPSQNQHASLRRPRGRPPGSKNKPKPPIILTRDSPNVLGSHVLEVSASADI 122
           D+N  +++   +     S RRPRGRP GSKNKPKPPII+TRDSPN L SHVLEVS  +DI
Sbjct: 61  DHNKFSDSEAAAATSSGSNRRPRGRPAGSKNKPKPPIIVTRDSPNALRSHVLEVSTGSDI 120

Query: 123 VDSLSNYARRRGRGLSILCGTGTVANVTLRQPSASPTASVITLHGRFEILSLTGTVLPPP 182
           ++S+S YAR+RGRG+ +L G GTVANVTLRQP ASP  SV+TLHGRFEILSL+GTVLPPP
Sbjct: 121 MESVSIYARKRGRGVCVLSGNGTVANVTLRQP-ASPAGSVVTLHGRFEILSLSGTVLPPP 180

Query: 183 APPQAGGLSIFLAGAQGQVVGGTVVGPLVASGPVILMAASFSNAVFERLPLEEEEEGGVQ 242
           APP AGGLSIFL+G QGQVVGG+VVGPL+ASGPV+LMAASF+NAVFERLPL +EE+G V 
Sbjct: 181 APPGAGGLSIFLSGGQGQVVGGSVVGPLMASGPVVLMAASFANAVFERLPL-DEEDGTVP 240

Query: 243 VQPTASQSSGVTGG----GQMGEGGGNANEGSGGGVG-FLGNSTIAGYPPFAGDLFGWGS 302
           VQ TASQSSGVTGG    GQ+G+GGG      GGG G F     +A Y PF+GDLFGWG 
Sbjct: 241 VQSTASQSSGVTGGGGGAGQLGDGGG------GGGAGLFNMGGNVANY-PFSGDLFGWGV 287

Query: 303 GSGSGNATKP 305
                NA +P
Sbjct: 301 -----NAARP 287

BLAST of CmaCh09G000200 vs. TAIR10
Match: AT1G20900.1 (AT1G20900.1 Predicted AT-hook DNA-binding family protein)

HSP 1 Score: 264.6 bits (675), Expect = 7.5e-71
Identity = 174/331 (52.57%), Postives = 206/331 (62.24%), Query Frame = 1

Query: 3   MAGYSEEGGAGSRYVQHHHHHHQLLNPELH---LDTPSSSSIPFPHPHLFN-HSNHSDDG 62
           M G  E+GG  SRY       H L  PE+H   L      ++   H H    H       
Sbjct: 1   MEGGYEQGGGASRYF------HNLFRPEIHHQQLQPQGGINLIDQHHHQHQQHQQQQQPS 60

Query: 63  DDNNDTNNT-----------TNPSQNQHASLRRPRGRPPGSKNKPKPPIILTRDSPNVLG 122
           DD+ +++++           ++P+ +  A  +RPRGRPPGSKNK KPPII+TRDSPN L 
Sbjct: 61  DDSRESDHSNKDHHQQGRPDSDPNTSSSAPGKRPRGRPPGSKNKAKPPIIVTRDSPNALR 120

Query: 123 SHVLEVSASADIVDSLSNYARRRGRGLSILCGTGTVANVTLRQP-------SASPTASVI 182
           SHVLEVS  ADIV+S+S YARRRGRG+S+L G GTV+NVTLRQP         S    V+
Sbjct: 121 SHVLEVSPGADIVESVSTYARRRGRGVSVLGGNGTVSNVTLRQPVTPGNGGGVSGGGGVV 180

Query: 183 TLHGRFEILSLTGTVLPPPAPPQAGGLSIFLAGAQGQVVGGTVVGPLVASGPVILMAASF 242
           TLHGRFEILSLTGTVLPPPAPP AGGLSIFLAG QGQVVGG+VV PL+AS PVILMAASF
Sbjct: 181 TLHGRFEILSLTGTVLPPPAPPGAGGLSIFLAGGQGQVVGGSVVAPLIASAPVILMAASF 240

Query: 243 SNAVFERLPLEEEEEGG-------------VQVQPTASQSSGVTGGGQMGEGGGNANEGS 299
           SNAVFERLP+EEEEE G             +Q  P+AS  SGVTG GQ+G        G+
Sbjct: 241 SNAVFERLPIEEEEEEGGGGGGGGGGGPPQMQQAPSASPPSGVTGQGQLG--------GN 300

BLAST of CmaCh09G000200 vs. TAIR10
Match: AT1G76500.1 (AT1G76500.1 Predicted AT-hook DNA-binding family protein)

HSP 1 Score: 261.9 bits (668), Expect = 4.9e-70
Identity = 172/316 (54.43%), Postives = 204/316 (64.56%), Query Frame = 1

Query: 5   GYSEEGGAGSRYVQHHHHHHQLLNPELHLDT---PSSSSIPFPHPHLFNHSNHSDDGDDN 64
           GY + GGA SRY       H L  PELH      P    +P P P       +SDD  D+
Sbjct: 4   GYDQSGGA-SRYF------HNLFRPELHHQLQPQPQLHPLPQPQPQPQPQQQNSDDESDS 63

Query: 65  NDTNNTTNPSQNQHASLRRPRGRPPGSKNKPKPPIILTRDSPNVLGSHVLEVSASADIVD 124
           N    +   +       +RPRGRPPGSKNKPKPP+I+TRDSPNVL SHVLEVS+ ADIV+
Sbjct: 64  NKDPGSDPVTSGSTG--KRPRGRPPGSKNKPKPPVIVTRDSPNVLRSHVLEVSSGADIVE 123

Query: 125 SLSNYARRRGRGLSILCGTGTVANVTLRQPSASP-------TASVITLHGRFEILSLTGT 184
           S++ YARRRGRG+SIL G GTVANV+LRQP+ +        T  V+ LHGRFEILSLTGT
Sbjct: 124 SVTTYARRRGRGVSILSGNGTVANVSLRQPATTAAHGANGGTGGVVALHGRFEILSLTGT 183

Query: 185 VLPPPAPPQAGGLSIFLAGAQGQVVGGTVVGPLVASGPVILMAASFSNAVFERLPLEEE- 244
           VLPPPAPP +GGLSIFL+G QGQV+GG VV PLVASGPVILMAASFSNA FERLPLE+E 
Sbjct: 184 VLPPPAPPGSGGLSIFLSGVQGQVIGGNVVAPLVASGPVILMAASFSNATFERLPLEDEG 243

Query: 245 EEGGVQVQPTASQSSGVTGGGQMGEGGG---------NANEGSGGGVGFLGNSTIAGYPP 299
            EGG               GG++GEGGG         +++  SG G G L    ++GY  
Sbjct: 244 GEGG--------------EGGEVGEGGGGEGGPPPATSSSPPSGAGQGQL-RGNMSGYDQ 295

BLAST of CmaCh09G000200 vs. TAIR10
Match: AT4G35390.1 (AT4G35390.1 AT-hook protein of GA feedback 1)

HSP 1 Score: 249.2 bits (635), Expect = 3.3e-66
Identity = 154/292 (52.74%), Postives = 191/292 (65.41%), Query Frame = 1

Query: 24  HQLLNPELHLDTPSSSSIPFPHPHLFNHSNHSDDGDDNNDTNN--TTNPSQNQHASLRRP 83
           H LL  ELHL  P  S  P    ++  + + +D+           T++ + +  +S RRP
Sbjct: 6   HPLLGQELHLQRPEDSRTPPDQNNMELNRSEADEAKAETTPTGGATSSATASGSSSGRRP 65

Query: 84  RGRPPGSKNKPKPPIILTRDSPNVLGSHVLEVSASADIVDSLSNYARRRGRGLSILCGTG 143
           RGRP GSKNKPKPP I+TRDSPNVL SHVLEV++ +DI +++S YA RRG G+ I+ GTG
Sbjct: 66  RGRPAGSKNKPKPPTIITRDSPNVLRSHVLEVTSGSDISEAVSTYATRRGCGVCIISGTG 125

Query: 144 TVANVTLRQPSASPTASVITLHGRFEILSLTGTVLPPPAPPQAGGLSIFLAGAQGQVVGG 203
            V NVT+RQP+A     VITLHGRF+ILSLTGT LPPPAPP AGGL+++LAG QGQVVGG
Sbjct: 126 AVTNVTIRQPAAPAGGGVITLHGRFDILSLTGTALPPPAPPGAGGLTVYLAGGQGQVVGG 185

Query: 204 TVVGPLVASGPVILMAASFSNAVFERLPLEEEE------EGGVQVQPTASQSSGVTGGGQ 263
            V G L+ASGPV+LMAASF+NAV++RLP+EEEE       G  Q QP ASQSS VTG G 
Sbjct: 186 NVAGSLIASGPVVLMAASFANAVYDRLPIEEEETPPPRTTGVQQQQPEASQSSEVTGSGA 245

Query: 264 MGEGGGNANEGSGGGVGFLG-NSTIAGYPPFAGDLFGW--GSGSGSGNATKP 305
                       GGGV F      +  +    GD++G   GSG G G AT+P
Sbjct: 246 QACESNLQGGNGGGGVAFYNLGMNMNNFQFSGGDIYGMSGGSGGGGGGATRP 297

BLAST of CmaCh09G000200 vs. TAIR10
Match: AT3G04570.1 (AT3G04570.1 AT-hook motif nuclear-localized protein 19)

HSP 1 Score: 218.4 bits (555), Expect = 6.2e-57
Identity = 143/289 (49.48%), Postives = 187/289 (64.71%), Query Frame = 1

Query: 25  QLLNPELHLDTPSSSSIPFPHPHLFNHSNHSDDGDDNNDTNNTTNPSQNQH--------A 84
           QL  P+LH+    S ++     H  N+ +H  + D+NN+ ++  N S + H        A
Sbjct: 24  QLKKPDLHI----SMNMAMDSGH--NNHHHHQEVDNNNNDDDRDNLSGDDHEPREGAVEA 83

Query: 85  SLRRPRGRPPGSKNKPKPPIILTRDSPNVLGSHVLEVSASADIVDSLSNYARRRGRGLSI 144
             RRPRGRP GSKNKPKPPI +TRDSPN L SHV+E+++  D++++L+ +ARRR RG+ I
Sbjct: 84  PTRRPRGRPAGSKNKPKPPIFVTRDSPNALKSHVMEIASGTDVIETLATFARRRQRGICI 143

Query: 145 LCGTGTVANVTLRQPSASPT------ASVITLHGRFEILSLTGTVLPPPAPPQAGGLSIF 204
           L G GTVANVTLRQPS +        A+V+ L GRFEILSLTG+ LP PAPP + GL+I+
Sbjct: 144 LSGNGTVANVTLRQPSTAAVAAAPGGAAVLALQGRFEILSLTGSFLPGPAPPGSTGLTIY 203

Query: 205 LAGAQGQVVGGTVVGPLVASGPVILMAASFSNAVFERLPLEEEEEGGVQVQPTASQSSGV 264
           LAG QGQVVGG+VVGPL+A+GPV+L+AA+FSNA +ERLPLEEEE      +      SG 
Sbjct: 204 LAGGQGQVVGGSVVGPLMAAGPVMLIAATFSNATYERLPLEEEE----AAERGGGGGSGG 263

Query: 265 TGGGQMGEGGGNANEGSGGGVGFLGNSTIAGYPPFAGDLFGWGSGSGSG 300
              GQ+G GG   + G+GGG    GN  +  Y    G+L   G   G G
Sbjct: 264 VVPGQLGGGGSPLSSGAGGGD---GNQGLPVY-NMPGNLVSNGGSGGGG 298

BLAST of CmaCh09G000200 vs. TAIR10
Match: AT4G22810.1 (AT4G22810.1 Predicted AT-hook DNA-binding family protein)

HSP 1 Score: 205.3 bits (521), Expect = 5.4e-53
Identity = 117/214 (54.67%), Postives = 150/214 (70.09%), Query Frame = 1

Query: 50  NHSNHSDDGDDNNDTNNTTNPSQNQHASLRRPRGRPPGSKNKPKPPIILTRDSPNVLGSH 109
           N  +   D D +  +      S   H   RRPRGRP GSKNKPKPPII+TRDS N L +H
Sbjct: 76  NSGSEGKDIDIHGGSGEGGGGSGGDHQMTRRPRGRPAGSKNKPKPPIIITRDSANALRTH 135

Query: 110 VLEVSASADIVDSLSNYARRRGRGLSILCGTGTVANVTLRQPSASPT-ASVITLHGRFEI 169
           V+E+    D+V+S++ +ARRR RG+ ++ GTG V NVT+RQP + P+  SV++LHGRFEI
Sbjct: 136 VMEIGDGCDLVESVATFARRRQRGVCVMSGTGNVTNVTIRQPGSHPSPGSVVSLHGRFEI 195

Query: 170 LSLTGTVLPPPAPPQAGGLSIFLAGAQGQVVGGTVVGPLVASGPVILMAASFSNAVFERL 229
           LSL+G+ LPPPAPP A GLS++LAG QGQVVGG+VVGPL+ +GPV++MAASFSNA +ERL
Sbjct: 196 LSLSGSFLPPPAPPTATGLSVYLAGGQGQVVGGSVVGPLLCAGPVVVMAASFSNAAYERL 255

Query: 230 PLEEEEEGGVQVQPTASQSSGVTGGGQMGEGGGN 263
           PLEE+E             + V GGG    GGG+
Sbjct: 256 PLEEDE-----------MQTPVHGGG----GGGS 274

BLAST of CmaCh09G000200 vs. NCBI nr
Match: gi|802604323|ref|XP_012073562.1| (PREDICTED: AT-hook motif nuclear-localized protein 25-like [Jatropha curcas])

HSP 1 Score: 312.4 bits (799), Expect = 8.9e-82
Identity = 193/307 (62.87%), Postives = 219/307 (71.34%), Query Frame = 1

Query: 3   MAGYSEEGGAGSRYVQHHHHHHQLLNPELHLDTPSSSSIPFPHPHLFNHSNHSDDGDDNN 62
           M+GY +   AGSRYV      HQLL PELHL  PS      P     +  N S    D+ 
Sbjct: 1   MSGYEQT--AGSRYV------HQLLRPELHLQRPSLPVQLSPD----SKDNTSPQSKDHK 60

Query: 63  DTNNTTNPSQNQHASLRRPRGRPPGSKNKPKPPIILTRDSPNVLGSHVLEVSASADIVDS 122
            T +T   + +   S RRPRGRPPGSKNKPKPPII+TRDSPN L SHVLEVS  +DIV++
Sbjct: 61  -TTDTDTAATSSGGSYRRPRGRPPGSKNKPKPPIIVTRDSPNALRSHVLEVSTGSDIVET 120

Query: 123 LSNYARRRGRGLSILCGTGTVANVTLRQPSASPTASVITLHGRFEILSLTGTVLPPPAPP 182
           +SNYAR+RGRG+ +L G GTVANVTLRQP ASP  SV+TL GRFEILSL+GTVLPPPAPP
Sbjct: 121 VSNYARKRGRGVCVLSGNGTVANVTLRQP-ASPAGSVVTLQGRFEILSLSGTVLPPPAPP 180

Query: 183 QAGGLSIFLAGAQGQVVGGTVVGPLVASGPVILMAASFSNAVFERLPLEEEEEGGVQVQP 242
            AGGLSIFL+G QGQVVGG+VVGPL+ASGPV+LMAASF+NAVFERLPL +EEEG VQVQ 
Sbjct: 181 GAGGLSIFLSGGQGQVVGGSVVGPLLASGPVVLMAASFANAVFERLPL-DEEEGNVQVQS 240

Query: 243 TASQSSGVTGGGQMGE-----GGGNANEGSGGGVGFLGNSTIAGYPPFAGDLFGWGSGSG 302
           TASQSSGVTG G  G      GGG    G GGG  F       G  PF+GDLFGWG    
Sbjct: 241 TASQSSGVTGSGGAGHLADGGGGGRGGGGGGGGGAFFNVGGNVGNYPFSGDLFGWG---- 287

Query: 303 SGNATKP 305
            G+AT+P
Sbjct: 301 -GSATRP 287

BLAST of CmaCh09G000200 vs. NCBI nr
Match: gi|590597591|ref|XP_007018654.1| (AT-hook DNA-binding family protein, putative [Theobroma cacao])

HSP 1 Score: 312.4 bits (799), Expect = 8.9e-82
Identity = 189/296 (63.85%), Postives = 215/296 (72.64%), Query Frame = 1

Query: 3   MAGYSEEGGAGSRYVQHHHHHHQLLNPELHLDTPSSSSIPFPHPHLFNHSNHSDDGDDNN 62
           MAGY E  G GSRY Q      Q   PELHL  PS +          + S  S D D NN
Sbjct: 62  MAGY-EAAGPGSRYGQ------QPFRPELHLQMPSLTPPS-------DDSRDSQDNDPNN 121

Query: 63  DTNNTTNPSQNQHASLRRPRGRPPGSKNKPKPPIILTRDSPNVLGSHVLEVSASADIVDS 122
              +    + +     RRPRGRP GSKNKPKPPII+TRDSPN L SHVLE+S+ ADIVDS
Sbjct: 122 PDLSDAAAATSSGGPTRRPRGRPAGSKNKPKPPIIVTRDSPNALRSHVLEISSGADIVDS 181

Query: 123 LSNYARRRGRGLSILCGTGTVANVTLRQPSASPTASVITLHGRFEILSLTGTVLPPPAPP 182
           LSNYARRRGRG+ +L G+GTVANV+LRQP ASP ASV+TLHGRFEILSL G VLPPPAPP
Sbjct: 182 LSNYARRRGRGICVLSGSGTVANVSLRQP-ASPPASVLTLHGRFEILSLCGKVLPPPAPP 241

Query: 183 QAGGLSIFLAGAQGQVVGGTVVGPLVASGPVILMAASFSNAVFERLPLEEEEEGGVQVQP 242
             GGLSIFL+G QGQVVGG VVGPLVASGPV+LMAASF+NAVFERLP +EEEEG VQVQP
Sbjct: 242 GVGGLSIFLSGGQGQVVGGRVVGPLVASGPVVLMAASFANAVFERLPPDEEEEGTVQVQP 301

Query: 243 TASQSSGVTGGGQMGEGGGNANEGSGGGVG--FLGNSTIAGYPPFAGDLFGWGSGS 297
           T SQSSGVTG GQ+ +GGG ++  +    G  F+   +   Y PF+GDLFGWGSG+
Sbjct: 302 TGSQSSGVTGSGQLPDGGGTSSAAASATAGSLFIMGGSGPNY-PFSGDLFGWGSGT 341

BLAST of CmaCh09G000200 vs. NCBI nr
Match: gi|731382545|ref|XP_010645606.1| (PREDICTED: putative DNA-binding protein ESCAROLA [Vitis vinifera])

HSP 1 Score: 309.7 bits (792), Expect = 5.8e-81
Identity = 192/308 (62.34%), Postives = 217/308 (70.45%), Query Frame = 1

Query: 3   MAGYSEEGGAGSRYVQHHHHHHQLLNPELHLD-TPSSSSIPFPHPHLFNHSNHSDDGDDN 62
           MAG   E GAGSRY+      HQL  PEL L+ TP         PH     N S D  +N
Sbjct: 1   MAGM--EQGAGSRYI------HQLFRPELQLERTPQQ-------PHQPPQLNDSGDSPEN 60

Query: 63  NDTNNTTNPSQNQHASLRRPRGRPPGSKNKPKPPIILTRDSPNVLGSHVLEVSASADIVD 122
            D  +          S RRPRGRPPGSKNK KPPII+TRDSPN L SHVLE+SA ADIV+
Sbjct: 61  EDRTDPDGSPGAATTSSRRPRGRPPGSKNKAKPPIIITRDSPNALRSHVLEISAGADIVE 120

Query: 123 SLSNYARRRGRGLSILCGTGTVANVTLRQPSASPTASVITLHGRFEILSLTGTVLPPPAP 182
           S+SNYARRRGRG+ IL G G V +VTLRQP+A P+ SV+TLHGRFEILSLTGT LPPPAP
Sbjct: 121 SVSNYARRRGRGVCILSGGGAVTDVTLRQPAA-PSGSVVTLHGRFEILSLTGTALPPPAP 180

Query: 183 PQAGGLSIFLAGAQGQVVGGTVVGPLVASGPVILMAASFSNAVFERLPLEEEEEGGVQVQ 242
           P AGGL+I+L G QGQVVGG VVGPLVASGPV+LMAASF+NAV++RLPLEEEEE  VQVQ
Sbjct: 181 PGAGGLTIYLGGGQGQVVGGRVVGPLVASGPVLLMAASFANAVYDRLPLEEEEESPVQVQ 240

Query: 243 PTASQSSGVT-GGGQMGEGGG----NANEGSGGGVGFLGNSTIAGYPPFAGDLFGWGSGS 302
           PTASQSSGVT GGGQ+G+GG      A  G+G GV F       G  PF GD+FGW    
Sbjct: 241 PTASQSSGVTGGGGQLGDGGNGSTTTAGGGAGAGVPFYNLGPNMGNYPFPGDVFGW---- 287

Query: 303 GSGNATKP 305
            +G AT+P
Sbjct: 301 -NGGATRP 287

BLAST of CmaCh09G000200 vs. NCBI nr
Match: gi|225457666|ref|XP_002273442.1| (PREDICTED: putative DNA-binding protein ESCAROLA [Vitis vinifera])

HSP 1 Score: 309.3 bits (791), Expect = 7.5e-81
Identity = 194/311 (62.38%), Postives = 223/311 (71.70%), Query Frame = 1

Query: 3   MAGYSEEGGAGSRYVQHHHHHHQLLNPELHLDTPSSSSIPFPHPHLFNHSNHSDDGDDNN 62
           M GY  E G+GSRYV      HQLL PELHL  PSS  +P  H      S+  D+  D+ 
Sbjct: 1   MEGY--EPGSGSRYV------HQLLGPELHLQRPSS--LP-QHQATQQPSDSRDESPDDQ 60

Query: 63  DTNNTTNPSQNQHA------SLRRPRGRPPGSKNKPKPPIILTRDSPNVLGSHVLEVSAS 122
           +    T  +    +      S RRPRGRPPGSKNKPKPPII+TRDSPN L SHVLEV+A 
Sbjct: 61  EQRADTEEAAAASSGGATTSSNRRPRGRPPGSKNKPKPPIIVTRDSPNALRSHVLEVAAG 120

Query: 123 ADIVDSLSNYARRRGRGLSILCGTGTVANVTLRQPSASPTASVITLHGRFEILSLTGTVL 182
           AD+++S+ NYARRRGRG+ +L G GTV NVTLRQP ASP  S++TLHGRFEILSL+GTVL
Sbjct: 121 ADVMESVLNYARRRGRGVCVLSGGGTVMNVTLRQP-ASPAGSIVTLHGRFEILSLSGTVL 180

Query: 183 PPPAPPQAGGLSIFLAGAQGQVVGGTVVGPLVASGPVILMAASFSNAVFERLPLEEEEEG 242
           PPPAPP AGGLSIFL+G QGQVVGG+VVGPL+ASGPV+LMAASF+NAVFERLPL EEEEG
Sbjct: 181 PPPAPPSAGGLSIFLSGGQGQVVGGSVVGPLMASGPVVLMAASFANAVFERLPL-EEEEG 240

Query: 243 GVQVQPTASQSSGVTG---GGQMGEGGGNANEGSGGGVGFLGNSTIAGYPPFAGDLFGWG 302
            VQVQPTASQSSGVTG   GGQ+G+GGG+   G G GV         G  PF GDL  WG
Sbjct: 241 AVQVQPTASQSSGVTGGGAGGQLGDGGGS---GGGAGVPIYNMGASMGNFPFPGDLLRWG 290

Query: 303 SGSGSGNATKP 305
                G+A +P
Sbjct: 301 -----GSAPRP 290

BLAST of CmaCh09G000200 vs. NCBI nr
Match: gi|566173628|ref|XP_002307001.2| (hypothetical protein POPTR_0005s27850g [Populus trichocarpa])

HSP 1 Score: 308.9 bits (790), Expect = 9.8e-81
Identity = 192/317 (60.57%), Postives = 228/317 (71.92%), Query Frame = 1

Query: 3   MAGYSEEGGAGSRYVQHHHHHHQLLNPELHL----------DTPSSSSIPFPHPHLFNHS 62
           MAG+    G  SRYV H  +H+ LL PELHL          D+  +++ P P  H    +
Sbjct: 1   MAGFE---GNNSRYV-HGQNHNNLLRPELHLIQRPSSIPSSDSRDNNNTPSPPDHANQTA 60

Query: 63  NHSDDGDDNNDTNNTTNPSQNQHASLRRPRGRPPGSKNKPKPPIILTRDSPNVLGSHVLE 122
           +H  D      +   TNP+       RRPRGRP GSKNKPKPPII+TRDSPN L SHV+E
Sbjct: 61  HHHPDSSATTSSGGGTNPN-------RRPRGRPAGSKNKPKPPIIVTRDSPNALRSHVIE 120

Query: 123 VSASADIVDSLSNYARRRGRGLSILCGTGTVANVTLRQPSASPTASVITLHGRFEILSLT 182
           +S  ADIV+S+S YAR+RGRG+ +L G+GTVANVTLRQP ASP  SV+TLHGRFEILSL+
Sbjct: 121 ISNGADIVESVSTYARKRGRGVCVLSGSGTVANVTLRQP-ASPAGSVLTLHGRFEILSLS 180

Query: 183 GTVLPPPAPPQAGGLSIFLAGAQGQVVGGTVVGPLVASGPVILMAASFSNAVFERLPLEE 242
           GTVLPPPAPP AGGLSIFL+G QGQVVGG VVGPL+A+GPV+LMAASF+NAVFERLPL++
Sbjct: 181 GTVLPPPAPPGAGGLSIFLSGGQGQVVGGNVVGPLMAAGPVVLMAASFANAVFERLPLDD 240

Query: 243 EEE-GGVQVQPTASQSSGVTG-GGQMGEGGGNANEGSGGGVGF--LGNSTIAGYPPFAGD 302
           +EE G VQVQPTASQSSGVTG GGQMG+GGG +  G G G GF  +      G  PF+GD
Sbjct: 241 QEEAGAVQVQPTASQSSGVTGSGGQMGDGGGGSGTG-GAGSGFFNMAGGAHHGNYPFSGD 299

Query: 303 LFG-WGSGSGSGNATKP 305
           LFG WG     G+A +P
Sbjct: 301 LFGPWG-----GSAARP 299

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
AHL27_ARATH1.3e-6952.57AT-hook motif nuclear-localized protein 27 OS=Arabidopsis thaliana GN=AHL27 PE=1... [more]
AHL29_ARATH8.6e-6954.43AT-hook motif nuclear-localized protein 29 OS=Arabidopsis thaliana GN=AHL29 PE=1... [more]
AHL25_ARATH5.8e-6552.74AT-hook motif nuclear-localized protein 25 OS=Arabidopsis thaliana GN=AHL25 PE=1... [more]
AHL19_ARATH1.1e-5549.48AT-hook motif nuclear-localized protein 19 OS=Arabidopsis thaliana GN=AHL19 PE=2... [more]
AHL24_ARATH9.6e-5254.67AT-hook motif nuclear-localized protein 24 OS=Arabidopsis thaliana GN=AHL24 PE=2... [more]
Match NameE-valueIdentityDescription
A0A067KKW8_JATCU6.2e-8262.87Uncharacterized protein OS=Jatropha curcas GN=JCGZ_08028 PE=4 SV=1[more]
A0A061FMX7_THECC6.2e-8263.85AT-hook DNA-binding family protein, putative OS=Theobroma cacao GN=TCM_034810 PE... [more]
F6I0P7_VITVI4.0e-8162.34Putative uncharacterized protein OS=Vitis vinifera GN=VIT_03s0038g04110 PE=4 SV=... [more]
B9H5W8_POPTR6.8e-8160.57Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0005s27850g PE=4 SV=2[more]
B9R8U1_RICCO8.9e-8162.58DNA binding protein, putative OS=Ricinus communis GN=RCOM_1602220 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G20900.17.5e-7152.57 Predicted AT-hook DNA-binding family protein[more]
AT1G76500.14.9e-7054.43 Predicted AT-hook DNA-binding family protein[more]
AT4G35390.13.3e-6652.74 AT-hook protein of GA feedback 1[more]
AT3G04570.16.2e-5749.48 AT-hook motif nuclear-localized protein 19[more]
AT4G22810.15.4e-5354.67 Predicted AT-hook DNA-binding family protein[more]
Match NameE-valueIdentityDescription
gi|802604323|ref|XP_012073562.1|8.9e-8262.87PREDICTED: AT-hook motif nuclear-localized protein 25-like [Jatropha curcas][more]
gi|590597591|ref|XP_007018654.1|8.9e-8263.85AT-hook DNA-binding family protein, putative [Theobroma cacao][more]
gi|731382545|ref|XP_010645606.1|5.8e-8162.34PREDICTED: putative DNA-binding protein ESCAROLA [Vitis vinifera][more]
gi|225457666|ref|XP_002273442.1|7.5e-8162.38PREDICTED: putative DNA-binding protein ESCAROLA [Vitis vinifera][more]
gi|566173628|ref|XP_002307001.2|9.8e-8160.57hypothetical protein POPTR_0005s27850g [Populus trichocarpa][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR005175PPC_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003677 DNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh09G000200.1CmaCh09G000200.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR005175PPC domainPFAMPF03479DUF296coord: 109..221
score: 5.0
IPR005175PPC domainPROFILEPS51742PPCcoord: 103..243
score: 33
NoneNo IPR availableGENE3DG3DSA:3.30.1330.80coord: 108..245
score: 1.7
NoneNo IPR availablePANTHERPTHR31100FAMILY NOT NAMEDcoord: 18..295
score: 1.2E
NoneNo IPR availablePANTHERPTHR31100:SF5AT-HOOK MOTIF NUCLEAR LOCALIZED PROTEIN 29-RELATEDcoord: 18..295
score: 1.2E
NoneNo IPR availableunknownSSF117856AF0104/ALDC/Ptd012-likecoord: 105..234
score: 3.14

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
CmaCh09G000200CmoCh09G000190Cucurbita moschata (Rifu)cmacmoB027
The following gene(s) are paralogous to this gene:

None