CmaCh20G010750 (gene) Cucurbita maxima (Rimu)

NameCmaCh20G010750
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionTrihelix transcription factor GT-2
LocationCma_Chr20 : 8150436 .. 8153081 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AATGGCGGATGCGAAGCTTGAAGGTAAGAGTGAGCTATGGAATCCCATAGCTTAAAATCCTTCTCTTCCTCGAATTTACGCTTTACCGATACCTTCAGCCCTACAATTTTTATTTACATTTCTTACTTTTCATCACTCCCTTCACACTTTAATTTCTTTTCTCTTCTCCGGAAGCTGTAAAATCCCTATAACTGATGCGCGAAATTTCGCCTTCACCGGAAATTTCTACCGCCGTCGTGAACCGTGCCTCCGAGGATGATGATGTGGCGGCTTCCGCCGGACTTGAGGAGGAGGTTGACCGGAATTGGGCCGGTAATCGGTGGCCGCGAGAGGAGACTATGGCGTTACTGAAGGTGCGGTCTAGAATGGACTCTGTGTTTAGGGATTCGAGCATTAAAGCTCCTCTTTGGGAAGAAGTATCCAGGTTAGTTCGGATTCGAAGATTTTCTATTTGTTGATTACTCTCTATTTGCTTGGAACTGTTTGGAATTTGGATTGGATTCTTACTTCTGTTTGGAATTTGGATTGAATTTGGATGCGAACTGTTCGAATTGTTGATTACTCTCTATTTGCTTGGAATTTGGATTGGATTCTTACTTCTCTTCATTGATTCTCCTGGAAAATTGTTTCGTTCTTCTGATTAACTTGTGATGTTCTATGCTCTTCTGGTTCCCTAAATTGAATGTTCTGAATGGAAGCTATGGAGTTTTGAAACTGAATCTTATGTGATAAGGGCTGCTAGATTTGCAAAGCTAGGCGTACCTCTAGAATCCTTGCAAACTTTTACAAGTAAAGCTAACTTAACGGATAAATAGCTCTTAGATCGAACTTGGAATTTATGATGGTCACTCTTAGGTACCAAGACGACTTGTTACAGAATTAGCTTAATCAAGACTAATCCAATGAATCGGTTTAACATCAAACTTAAAGCAAGGTTTGAAAAAGAATAACGGTTTCAAGAAAACACAAGTTTTTTGTATTTATTGTCATTTCATTCTTTTCTTGATTTGATACTTAAATCTTCTTTGCCTTACTTCTTGTTATTGCTCCTTCGGGTACATGCCATTGATTAGTGGTTGGATTCATATCGACTAGGAAATTAGCTGAGTTTGGGTATAATAGAAGTGCGAAGAAATGCAAAGAGAAGTTTGAGAACATTTATAAGTATCACAAAAGGACTAAAGATGGCAGATCAAGCAAACCGAATGGAAAAAATTATAGGTATTTTGAGCAATTACAAGCTCTAGATAATCATCCATTGCTTCCCTCTCAAGCTGATTCAAAGGAAGAATTCCCAAACAAAATTGTTCACAATGCAATTCCATGTTCCATAATAAACCCGGATTCGAATTTCGTTGAAACTACCACCACTTTGATATCGATGTCAACCACATCTTGCTCGAGTAAGGAGTCGGGTGGGACGAGGAAGAAGAAGAAGAAGAGGAAGTTGGTGGACTTTTTTGGGAGGTTAATGGAGGAGGTGATTGAAAAACAGGAGAAATTGCAAAAGAAGTTTGTGGAGGCTTTGGAGAAATGTGAAGAAGAGAGGTTAGCTAGGGAAGAAGAATGGAAGATGCAAGAATTAGCTCGAATCAAGAACGAGCGAGAGCGTTTTAATCACGAGAGATCCATTGCAGCTGCAAAGGATGCAGCTGTTCTTTCGTTCTTGAAGGCATTTTCGGAACAGGTTGGCACGGTGCAGTTTCCCGAAAGCTTGATTCTGATGGAGAATTTGACTGGCAGGCAAGATCATAGTAATGTTGACAGAAATACAAGTACTCGAGAGAACGGAAACGATGGTAATTCGAATCAGATTAGCTCGTCTCGATGGCCGAAAGACGAGATCAATGCTCTGATTCAGCTCAAGACTAACCTGCAGATGAAGTACCAAGAAAATGGCCCTAAAGGTCCTCTTTGGGAGGAGATATCATTGTCCATGAAGAAACTTGGGTATGATAGAAATGCAAAGAGGTGTAAAGAGAAATGGGAGAACATCAACAAATACTTCAAGAGAGTAAAGGAAAGCAACAAAAAGCGACCCGAGGACTCGAAGACATGTTCTTATTTCCAGCAGCTCGATGCATTGTACAAACAGAAATCCAAGAAAGTCATCGACAATCATCCGAATTACGAACTGAAACCCGAGGAACTATTGATGCACATGATGGGCAGCCAAGAAGAAAACCACCAACCCGAATCAGCAACAGACTATGGCAAAGCTGAGAATGCGGATAAGAACCAAGAAGACGATGACGATGACGATGACGATAACAACGACGAAGATGAATATTATTATATTGTAGCCAACAACAACAGCAATCAAGTGGAAGTAGGCAGCTGAAAGACAGATGAACAGCAAAATCAAATGGCTCAAATGGTCTCGCGTCTCGCATCGTGTTAAAAAGGTCAGATCCTTCAAATCTCTGTCTACATATATATATATATATATGATTGTATATCTTTCTTTGTTTCTTGCATAGATGATGATACTTGTAAGCTAGAGAGAGATAGAATTGTTTATTGTGTGCAAAAGTTGAAAAGTGTTTGTTTGTCTCTGTATCAATAAACAAAAAACCTGCATTATTTTGCTGGTAAGATGCTGTAGTAGGCTGTTTAATGATAATGCAAAAGCTTCTATTCATCTTC

mRNA sequence

AATGGCGGATGCGAAGCTTGAAGGTAAGAGTGAGCTATGGAATCCCATAGCTTAAAATCCTTCTCTTCCTCGAATTTACGCTTTACCGATACCTTCAGCCCTACAATTTTTATTTACATTTCTTACTTTTCATCACTCCCTTCACACTTTAATTTCTTTTCTCTTCTCCGGAAGCTGTAAAATCCCTATAACTGATGCGCGAAATTTCGCCTTCACCGGAAATTTCTACCGCCGTCGTGAACCGTGCCTCCGAGGATGATGATGTGGCGGCTTCCGCCGGACTTGAGGAGGAGGTTGACCGGAATTGGGCCGGTAATCGGTGGCCGCGAGAGGAGACTATGGCGTTACTGAAGGTGCGGTCTAGAATGGACTCTGTGTTTAGGGATTCGAGCATTAAAGCTCCTCTTTGGGAAGAAGTATCCAGGAAATTAGCTGAGTTTGGGTATAATAGAAGTGCGAAGAAATGCAAAGAGAAGTTTGAGAACATTTATAAGTATCACAAAAGGACTAAAGATGGCAGATCAAGCAAACCGAATGGAAAAAATTATAGGTATTTTGAGCAATTACAAGCTCTAGATAATCATCCATTGCTTCCCTCTCAAGCTGATTCAAAGGAAGAATTCCCAAACAAAATTGTTCACAATGCAATTCCATGTTCCATAATAAACCCGGATTCGAATTTCGTTGAAACTACCACCACTTTGATATCGATGTCAACCACATCTTGCTCGAGTAAGGAGTCGGGTGGGACGAGGAAGAAGAAGAAGAAGAGGAAGTTGGTGGACTTTTTTGGGAGGTTAATGGAGGAGGTGATTGAAAAACAGGAGAAATTGCAAAAGAAGTTTGTGGAGGCTTTGGAGAAATGTGAAGAAGAGAGGTTAGCTAGGGAAGAAGAATGGAAGATGCAAGAATTAGCTCGAATCAAGAACGAGCGAGAGCGTTTTAATCACGAGAGATCCATTGCAGCTGCAAAGGATGCAGCTGTTCTTTCGTTCTTGAAGGCATTTTCGGAACAGGTTGGCACGGTGCAGTTTCCCGAAAGCTTGATTCTGATGGAGAATTTGACTGGCAGGCAAGATCATAGTAATGTTGACAGAAATACAAGTACTCGAGAGAACGGAAACGATGGTAATTCGAATCAGATTAGCTCGTCTCGATGGCCGAAAGACGAGATCAATGCTCTGATTCAGCTCAAGACTAACCTGCAGATGAAGTACCAAGAAAATGGCCCTAAAGGTCCTCTTTGGGAGGAGATATCATTGTCCATGAAGAAACTTGGGTATGATAGAAATGCAAAGAGGTGTAAAGAGAAATGGGAGAACATCAACAAATACTTCAAGAGAGTAAAGGAAAGCAACAAAAAGCGACCCGAGGACTCGAAGACATGTTCTTATTTCCAGCAGCTCGATGCATTGTACAAACAGAAATCCAAGAAAGTCATCGACAATCATCCGAATTACGAACTGAAACCCGAGGAACTATTGATGCACATGATGGGCAGCCAAGAAGAAAACCACCAACCCGAATCAGCAACAGACTATGGCAAAGCTGAGAATGCGGATAAGAACCAAGAAGACGATGACGATGACGATGACGATAACAACGACGAAGATGAATATTATTATATTGTAGCCAACAACAACAGCAATCAAGTGGAAGTAGGCAGCTGAAAGACAGATGAACAGCAAAATCAAATGGCTCAAATGGTCTCGCGTCTCGCATCGTGTTAAAAAGGTCAGATCCTTCAAATCTCTGTCTACATATATATATATATATATGATTGTATATCTTTCTTTGTTTCTTGCATAGATGATGATACTTGTAAGCTAGAGAGAGATAGAATTGTTTATTGTGTGCAAAAGTTGAAAAGTGTTTGTTTGTCTCTGTATCAATAAACAAAAAACCTGCATTATTTTGCTGGTAAGATGCTGTAGTAGGCTGTTTAATGATAATGCAAAAGCTTCTATTCATCTTC

Coding sequence (CDS)

ATGCGCGAAATTTCGCCTTCACCGGAAATTTCTACCGCCGTCGTGAACCGTGCCTCCGAGGATGATGATGTGGCGGCTTCCGCCGGACTTGAGGAGGAGGTTGACCGGAATTGGGCCGGTAATCGGTGGCCGCGAGAGGAGACTATGGCGTTACTGAAGGTGCGGTCTAGAATGGACTCTGTGTTTAGGGATTCGAGCATTAAAGCTCCTCTTTGGGAAGAAGTATCCAGGAAATTAGCTGAGTTTGGGTATAATAGAAGTGCGAAGAAATGCAAAGAGAAGTTTGAGAACATTTATAAGTATCACAAAAGGACTAAAGATGGCAGATCAAGCAAACCGAATGGAAAAAATTATAGGTATTTTGAGCAATTACAAGCTCTAGATAATCATCCATTGCTTCCCTCTCAAGCTGATTCAAAGGAAGAATTCCCAAACAAAATTGTTCACAATGCAATTCCATGTTCCATAATAAACCCGGATTCGAATTTCGTTGAAACTACCACCACTTTGATATCGATGTCAACCACATCTTGCTCGAGTAAGGAGTCGGGTGGGACGAGGAAGAAGAAGAAGAAGAGGAAGTTGGTGGACTTTTTTGGGAGGTTAATGGAGGAGGTGATTGAAAAACAGGAGAAATTGCAAAAGAAGTTTGTGGAGGCTTTGGAGAAATGTGAAGAAGAGAGGTTAGCTAGGGAAGAAGAATGGAAGATGCAAGAATTAGCTCGAATCAAGAACGAGCGAGAGCGTTTTAATCACGAGAGATCCATTGCAGCTGCAAAGGATGCAGCTGTTCTTTCGTTCTTGAAGGCATTTTCGGAACAGGTTGGCACGGTGCAGTTTCCCGAAAGCTTGATTCTGATGGAGAATTTGACTGGCAGGCAAGATCATAGTAATGTTGACAGAAATACAAGTACTCGAGAGAACGGAAACGATGGTAATTCGAATCAGATTAGCTCGTCTCGATGGCCGAAAGACGAGATCAATGCTCTGATTCAGCTCAAGACTAACCTGCAGATGAAGTACCAAGAAAATGGCCCTAAAGGTCCTCTTTGGGAGGAGATATCATTGTCCATGAAGAAACTTGGGTATGATAGAAATGCAAAGAGGTGTAAAGAGAAATGGGAGAACATCAACAAATACTTCAAGAGAGTAAAGGAAAGCAACAAAAAGCGACCCGAGGACTCGAAGACATGTTCTTATTTCCAGCAGCTCGATGCATTGTACAAACAGAAATCCAAGAAAGTCATCGACAATCATCCGAATTACGAACTGAAACCCGAGGAACTATTGATGCACATGATGGGCAGCCAAGAAGAAAACCACCAACCCGAATCAGCAACAGACTATGGCAAAGCTGAGAATGCGGATAAGAACCAAGAAGACGATGACGATGACGATGACGATAACAACGACGAAGATGAATATTATTATATTGTAGCCAACAACAACAGCAATCAAGTGGAAGTAGGCAGCTGA

Protein sequence

MREISPSPEISTAVVNRASEDDDVAASAGLEEEVDRNWAGNRWPREETMALLKVRSRMDSVFRDSSIKAPLWEEVSRKLAEFGYNRSAKKCKEKFENIYKYHKRTKDGRSSKPNGKNYRYFEQLQALDNHPLLPSQADSKEEFPNKIVHNAIPCSIINPDSNFVETTTTLISMSTTSCSSKESGGTRKKKKKRKLVDFFGRLMEEVIEKQEKLQKKFVEALEKCEEERLAREEEWKMQELARIKNERERFNHERSIAAAKDAAVLSFLKAFSEQVGTVQFPESLILMENLTGRQDHSNVDRNTSTRENGNDGNSNQISSSRWPKDEINALIQLKTNLQMKYQENGPKGPLWEEISLSMKKLGYDRNAKRCKEKWENINKYFKRVKESNKKRPEDSKTCSYFQQLDALYKQKSKKVIDNHPNYELKPEELLMHMMGSQEENHQPESATDYGKAENADKNQEDDDDDDDDNNDEDEYYYIVANNNSNQVEVGS
BLAST of CmaCh20G010750 vs. Swiss-Prot
Match: TGT2_ARATH (Trihelix transcription factor GT-2 OS=Arabidopsis thaliana GN=GT-2 PE=2 SV=1)

HSP 1 Score: 207.2 bits (526), Expect = 4.0e-52
Identity = 156/386 (40.41%), Postives = 223/386 (57.77%), Query Frame = 1

Query: 129 NHPLLPSQADSKEEFPNKIVHNAIPCS---IINPDSNFVETTTTLISMSTTSCSSKESGG 188
           N   L  Q  S   FP    +N    S   I N   N V +     S +++S +S E   
Sbjct: 188 NPTFLAKQPSSTTPFPFYSSNNTTTVSQPPISNDLMNNVSSLNLFSSSTSSSTASDEEED 247

Query: 189 TRKKKKKRKLVDF----FGRLMEEVIEKQEKLQKKFVEALEKCEEERLAREEEWKMQELA 248
             + K  RK   +    F +L +E++EKQEK+QK+F+E LE  E+ER++REE W++QE+ 
Sbjct: 248 HHQVKSSRKKRKYWKGLFTKLTKELMEKQEKMQKRFLETLEYREKERISREEAWRVQEIG 307

Query: 249 RIKNERERFNHERSIAAAKDAAVLSFLKAFSEQVGTVQFPE--SLILMENLTGRQDHSNV 308
           RI  E E   HERS AAAKDAA++SFL   S   G  Q P+  +    +    + DHS  
Sbjct: 308 RINREHETLIHERSNAAAKDAAIISFLHKISG--GQPQQPQQHNHKPSQRKQYQSDHSIT 367

Query: 309 DRNTSTR--------ENGNDGNSNQIS--SSRWPKDEINALIQLKTNLQMKYQENGPKGP 368
             +   R        + GN  N++ +S  SSRWPK E+ ALI+++ NL+  YQENG KGP
Sbjct: 368 FESKEPRAVLLDTTIKMGNYDNNHSVSPSSSRWPKTEVEALIRIRKNLEANYQENGTKGP 427

Query: 369 LWEEISLSMKKLGYDRNAKRCKEKWENINKYFKRVKESNKKRPEDSKTCSYFQQLDALYK 428
           LWEEIS  M++LGY+R+AKRCKEKWENINKYFK+VKESNKKRP DSKTC YF QL+ALY 
Sbjct: 428 LWEEISAGMRRLGYNRSAKRCKEKWENINKYFKKVKESNKKRPLDSKTCPYFHQLEALYN 487

Query: 429 QKSKKVIDNHP-NYELKPEELLMHMMGSQE--ENHQPESATDYGKAENADKNQED-DDDD 488
           +++K      P    + P+  L+    +Q   E  Q E   D    E  +  +++ D+++
Sbjct: 488 ERNKSGAMPLPLPLMVTPQRQLLLSQETQTEFETDQREKVGDKEDEEEGESEEDEYDEEE 547

Query: 489 DDDNNDEDEYYYIVANNNSNQVEVGS 492
           + + ++E   + IV N  S+ +++ +
Sbjct: 548 EGEGDNETSEFEIVLNKTSSPMDINN 571

BLAST of CmaCh20G010750 vs. Swiss-Prot
Match: PTL_ARATH (Trihelix transcription factor PTL OS=Arabidopsis thaliana GN=PTL PE=2 SV=1)

HSP 1 Score: 185.7 bits (470), Expect = 1.3e-45
Identity = 142/417 (34.05%), Postives = 232/417 (55.64%), Query Frame = 1

Query: 42  RWPREETMALLKVRSRMDSVFRDSSIKAPLWEEVSRKLAE-FGYNRSAKKCKEKFENIYK 101
           RWPR+ET+ LL++RSR+D  F++++ K PLW+EVSR ++E  GY RS KKC+EKFEN+YK
Sbjct: 119 RWPRQETLTLLEIRSRLDHKFKEANQKGPLWDEVSRIMSEEHGYQRSGKKCREKFENLYK 178

Query: 102 YHKRTKDGRSSKPNGKNYRYFEQLQAL--DNHPLLPSQADSKEEFPNKI--VHNAIPCSI 161
           Y+++TK+G++ + +GK+YR+F QL+AL  D++ L+     + +   + +   H   P ++
Sbjct: 179 YYRKTKEGKAGRQDGKHYRFFRQLEALYGDSNNLVSCPNHNTQFMSSALHGFHTQNPMNV 238

Query: 162 INPDSNFVET----------------TTTLISMSTTSCSSKESGGTRKKKK-KRKLVDFF 221
               SN                     ++ + + T+S    +S   RKK+  K K+ +F 
Sbjct: 239 TTTTSNIHNVDSVHGFHQSLSLSNNYNSSELELMTSSSEGNDSSSRRKKRSWKAKIKEFI 298

Query: 222 GRLMEEVIEKQEKLQKKFVEALEKCEEERLAREEEWKMQELARIKNERERFNHERSIAAA 281
              M+ +IE+Q+   +K  + +E  EE+R+ +EEEW+  E ARI  E   +  ER+   A
Sbjct: 299 DTNMKRLIERQDVWLEKLTKVIEDKEEQRMMKEEEWRKIEAARIDKEHLFWAKERARMEA 358

Query: 282 KDAAVLSFLKAFSEQVGTVQFPESLILMENLTGRQDHSNVDRNTS--TRENGNDGN-SNQ 341
           +D AV+  L+  + +      P    L  +   R + +N  RN S    ENG+D   +N 
Sbjct: 359 RDVAVIEALQYLTGK------PLIKPLCSSPEERTNGNNEIRNNSETQNENGSDQTMTNN 418

Query: 342 I----SSSRWPKDEINALIQLKTNLQMKYQE--NGPKGP-LWEEISLSMKKLGYD-RNAK 401
           +    SSS W + EI  L++++T++   +QE   G     LWEEI+  + +LG+D R+A 
Sbjct: 419 VCVKGSSSCWGEQEILKLMEIRTSMDSTFQEILGGCSDEFLWEEIAAKLIQLGFDQRSAL 478

Query: 402 RCKEKWENI-NKYFKRVKESNKKRPEDSKTCSYF---QQLDALYKQKSKKVIDNHPN 422
            CKEKWE I N   K  K+ NKKR ++S +C  +    + + +Y  +     DN P+
Sbjct: 479 LCKEKWEWISNGMRKEKKQINKKRKDNSSSCGVYYPRNEENPIYNNRESGYNDNDPH 529

BLAST of CmaCh20G010750 vs. Swiss-Prot
Match: GTL1_ARATH (Trihelix transcription factor GTL1 OS=Arabidopsis thaliana GN=GTL1 PE=1 SV=2)

HSP 1 Score: 184.5 bits (467), Expect = 2.8e-45
Identity = 119/299 (39.80%), Postives = 172/299 (57.53%), Query Frame = 1

Query: 39  AGNRWPREETMALLKVRSRMDSVFRDSSIKAPLWEEVSRKLAEFGYNRSAKKCKEKFENI 98
           +GNRWPREET+ALL++RS MDS FRD+++KAPLWE VSRKL E GY RS+KKCKEKFEN+
Sbjct: 59  SGNRWPREETLALLRIRSDMDSTFRDATLKAPLWEHVSRKLLELGYKRSSKKCKEKFENV 118

Query: 99  YKYHKRTKDGRSSKPNGKNYRYFEQLQALDNHP----------------LLPSQADSKEE 158
            KY+KRTK+ R  + +GK Y++F QL+AL+  P                L+PS + S   
Sbjct: 119 QKYYKRTKETRGGRHDGKAYKFFSQLEALNTTPPSSSLDVTPLSVANPILMPSSSSSP-- 178

Query: 159 FP----------------NKIVHNAIPCSIINPDSNFVETTTTLISMSTTSCSSKES--- 218
           FP                + +     P  +  P    + T  T  S S+++ S   S   
Sbjct: 179 FPVFSQPQPQTQTQPPQTHNVSFTPTPPPLPLPSMGPIFTGVTFSSHSSSTASGMGSDDD 238

Query: 219 -----------GGTRKKKKKR-------KLVDFFGRLMEEVIEKQEKLQKKFVEALEKCE 278
                       G+  +K+KR       K+++ F  L+ +V++KQ  +Q+ F+EALEK E
Sbjct: 239 DDDMDVDQANIAGSSSRKRKRGNRGGGGKMMELFEGLVRQVMQKQAAMQRSFLEALEKRE 298

Query: 279 EERLAREEEWKMQELARIKNERERFNHERSIAAAKDAAVLSFLKAFSEQVGTVQFPESL 285
           +ERL REE WK QE+AR+  E E  + ER+ +A++DAA++S ++  +    T+Q P SL
Sbjct: 299 QERLDREEAWKRQEMARLAREHEVMSQERAASASRDAAIISLIQKITGH--TIQLPPSL 353

BLAST of CmaCh20G010750 vs. Swiss-Prot
Match: GTL2_ARATH (Trihelix transcription factor GTL2 OS=Arabidopsis thaliana GN=At5g28300 PE=2 SV=1)

HSP 1 Score: 94.4 bits (233), Expect = 3.8e-18
Identity = 43/62 (69.35%), Postives = 50/62 (80.65%), Query Frame = 1

Query: 349 PLWEEISLSMKKLGYDRNAKRCKEKWENINKYFKRVKESNKKRPEDSKTCSYFQQLDALY 408
           PLWE IS  M ++GY R+AKRCKEKWENINKYF++ K+ NKKRP DS+TC YF QL ALY
Sbjct: 497 PLWERISKKMLEIGYKRSAKRCKEKWENINKYFRKTKDVNKKRPLDSRTCPYFHQLTALY 556

Query: 409 KQ 411
            Q
Sbjct: 557 SQ 558

BLAST of CmaCh20G010750 vs. Swiss-Prot
Match: TGT4_ARATH (Trihelix transcription factor GT-4 OS=Arabidopsis thaliana GN=GT-4 PE=2 SV=1)

HSP 1 Score: 63.5 bits (153), Expect = 7.2e-09
Identity = 39/136 (28.68%), Postives = 66/136 (48.53%), Query Frame = 1

Query: 281 PESLILMENLTGRQDHSNVDRNTSTRENGNDGNSNQISSSRWPKDEINALIQLKTNLQMK 340
           P  +IL E+ +G +DH  +       E              W +DE   LI L+  +   
Sbjct: 28  PHQIILGES-SGGEDHEIIKAPKKRAET-------------WAQDETRTLISLRREMDNL 87

Query: 341 YQENGPKGPLWEEISLSMKKLGYDRNAKRCKEKWENINKYFKRVKESNKKRPEDSKT-CS 400
           +  +     LWE+IS  M++ G+DR+   C +KW NI K FK+ K+   K      T  S
Sbjct: 88  FNTSKSNKHLWEQISKKMREKGFDRSPSMCTDKWRNILKEFKKAKQHEDKATSGGSTKMS 147

Query: 401 YFQQLDALYKQKSKKV 416
           Y+ +++ +++++ KKV
Sbjct: 148 YYNEIEDIFRERKKKV 149

BLAST of CmaCh20G010750 vs. TrEMBL
Match: A0A0A0LK12_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_2G301510 PE=4 SV=1)

HSP 1 Score: 729.6 bits (1882), Expect = 2.6e-207
Identity = 396/494 (80.16%), Postives = 434/494 (87.85%), Query Frame = 1

Query: 1   MREISPSPEISTAVV---NRASEDDDVAASAGLEEEVDRNWAGNRWPREETMALLKVRSR 60
           M EISPSPE S+A V   NR  +++  AASAG+ EE DRNW GNRWPREETMALLKVRS 
Sbjct: 1   MLEISPSPENSSAAVADANRVFKEEAAAASAGVLEEADRNWPGNRWPREETMALLKVRSS 60

Query: 61  MDSVFRDSSIKAPLWEEVSRKLAEFGYNRSAKKCKEKFENIYKYHKRTKDGRSSKPNGKN 120
           MD+ FRD+S+KAPLWEEVSRKL E GYNR+AKKCKEKFENIYKYHKRTKDGRS K NGKN
Sbjct: 61  MDTAFRDASLKAPLWEEVSRKLGELGYNRNAKKCKEKFENIYKYHKRTKDGRSGKSNGKN 120

Query: 121 YRYFEQLQALDNHPLLPSQADSKEEFP----NKIVHNAIPCSIINPDSNFVETTTTLISM 180
           YRYFEQL+ALDNH LLPSQADS EE P    N +VHNAIPCS++NP +NFVETTTT +S 
Sbjct: 121 YRYFEQLEALDNHSLLPSQADSMEEIPRIIPNNVVHNAIPCSVVNPGANFVETTTTSLST 180

Query: 181 STTSCSSKESGGTRKKKKKRKLVDFFGRLMEEVIEKQEKLQKKFVEALEKCEEERLAREE 240
           STTS SSKESGGTRKKK  RK V+FF RLM EVIEKQEKLQKKFVEALEKCE ERLAREE
Sbjct: 181 STTSSSSKESGGTRKKK--RKFVEFFERLMNEVIEKQEKLQKKFVEALEKCEVERLAREE 240

Query: 241 EWKMQELARIKNERERFNHERSIAAAKDAAVLSFLKAFSEQVGTVQFPESLILMENLTGR 300
           EWKMQELARIK ERER N ERSIAAAKDAAVLSFLK FSEQ GTVQFPE+L+LMENLT +
Sbjct: 241 EWKMQELARIKKERERLNQERSIAAAKDAAVLSFLKVFSEQGGTVQFPENLLLMENLTEK 300

Query: 301 QDHSNVDRNTSTRENGNDGNSNQISSSRWPKDEINALIQLKTNLQMKYQENGPKGPLWEE 360
           QD +N +RNTST+EN N+GNSNQISSSRWPK+EI+ALIQL+TNLQMKYQ+NGPKGPLWEE
Sbjct: 301 QDDANGERNTSTQENINNGNSNQISSSRWPKEEIDALIQLRTNLQMKYQDNGPKGPLWEE 360

Query: 361 ISLSMKKLGYDRNAKRCKEKWENINKYFKRVKESNKKRPEDSKTCSYFQQLDALYKQKSK 420
           ISL+MKKLGYDRNAKRCKEKWENINKYFKRVKESNKKRPEDSKTC YFQQLDALYKQKSK
Sbjct: 361 ISLAMKKLGYDRNAKRCKEKWENINKYFKRVKESNKKRPEDSKTCPYFQQLDALYKQKSK 420

Query: 421 KVIDN--HPNYELKPEELLMHMMGSQEENHQPESATDYGKAENADKNQEDDDDDDDDNND 480
           KVI+N  +PNYELKPEELLMHMMGSQEE HQPESATD G+AENAD   ++ +D+ ++  D
Sbjct: 421 KVINNPANPNYELKPEELLMHMMGSQEETHQPESATDDGEAENAD--NQNQEDEGEEGED 480

Query: 481 EDEYYYIVANNNSN 486
           EDE Y IVANNN+N
Sbjct: 481 EDEDYRIVANNNNN 490

BLAST of CmaCh20G010750 vs. TrEMBL
Match: A0A0D2SY59_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_008G099700 PE=4 SV=1)

HSP 1 Score: 515.0 bits (1325), Expect = 1.0e-142
Identity = 297/493 (60.24%), Postives = 369/493 (74.85%), Query Frame = 1

Query: 1   MREISPSPEISTAVVNRASEDDDVAASAGLEEEVDRNWAGNRWPREETMALLKVRSRMDS 60
           M E S  PE +T   N + E+++ A      EE + N++GNRWPR+ET+ALLK+RS MD 
Sbjct: 1   MMENSSFPESNTVGDNVSLENEEEAKVKN--EESEGNFSGNRWPRQETLALLKIRSEMDV 60

Query: 61  VFRDSSIKAPLWEEVSRKLAEFGYNRSAKKCKEKFENIYKYHKRTKDGRSSKPNGKNYRY 120
            FRDS +KAPLWEEVSRKLAE GYNR AKKCKEKFEN+YKYH+RTK+GRS K NGK+YR+
Sbjct: 61  AFRDSGVKAPLWEEVSRKLAELGYNRGAKKCKEKFENVYKYHRRTKEGRSGKSNGKSYRF 120

Query: 121 FEQLQALDNHPLL--PSQADSKEEF-PNKIVHNAIPCSIINPDSNFVETTTTLISMSTTS 180
           FEQL+ALD+HP L  P+  D      P  ++H+AIP S+ NP SNF ET+T     STTS
Sbjct: 121 FEQLEALDHHPSLVPPASGDINTSVEPLNVIHDAIPFSVRNPASNFNETST-----STTS 180

Query: 181 CSSKESGGTRKKKKKRKLVDFFGRLMEEVIEKQEKLQKKFVEALEKCEEERLAREEEWKM 240
            SSKES GTRKKK  RKL DFF RLM E++EKQE LQKKF+EA+EK E +R+AREE WK+
Sbjct: 181 SSSKESDGTRKKK--RKLTDFFERLMREMMEKQENLQKKFIEAIEKSELDRMAREEAWKV 240

Query: 241 QELARIKNERERFNHERSIAAAKDAAVLSFLKAFSEQVGTVQFPESLILMENLTGRQDHS 300
           QELAR+K ERE    ERSIAAAKDAAVL+FL+ FS+Q  +VQ P+    +E +  RQ++S
Sbjct: 241 QELARLKRERELLVQERSIAAAKDAAVLAFLQKFSDQTTSVQLPDISFPVEKVVDRQENS 300

Query: 301 NVDRNTSTRENGNDGNSNQISSSRWPKDEINALIQLKTNLQMKYQENGPKGPLWEEISLS 360
           N          G++   + +S+SRWPKDE+ ALI+L+TNL M+YQ+ GPKGPLWEEIS +
Sbjct: 301 N----------GSESYMH-LSTSRWPKDEVEALIRLRTNLDMQYQDAGPKGPLWEEISTA 360

Query: 361 MKKLGYDRNAKRCKEKWENINKYFKRVKESNKKRPEDSKTCSYFQQLDALYKQKSKKVID 420
           MKKLGYDR+AKRCKEKWEN+NKYFKRVKESNKKRPEDSKTC YF QLDALYK+K+K++  
Sbjct: 361 MKKLGYDRSAKRCKEKWENMNKYFKRVKESNKKRPEDSKTCPYFHQLDALYKEKTKRI-- 420

Query: 421 NHPNYELKPEELLMHMMGSQEENHQPESATDYGKAENADKNQEDDDDDDDDNNDEDEYYY 480
           +   YELKPEELLMHMMG+QEE    ESAT+  ++EN ++N+E      ++ N E + Y 
Sbjct: 421 DGSGYELKPEELLMHMMGAQEERLHQESATEDVESENVNQNRE------ENRNAEGDAYQ 465

Query: 481 IVANNNSNQVEVG 491
           IVAN+ S    +G
Sbjct: 481 IVANDPSPMPIIG 465

BLAST of CmaCh20G010750 vs. TrEMBL
Match: A0A061DR08_THECC (Duplicated homeodomain-like superfamily protein, putative OS=Theobroma cacao GN=TCM_001348 PE=4 SV=1)

HSP 1 Score: 511.9 bits (1317), Expect = 8.5e-142
Identity = 297/494 (60.12%), Postives = 368/494 (74.49%), Query Frame = 1

Query: 1   MREISPSPEISTAVVNRASEDDDVAASAGLEEEVDRNWAGNRWPREETMALLKVRSRMDS 60
           M E S  PE +T   N + E+++        EE +RN+ GNRWPR+ET+ALLK+RS MD 
Sbjct: 1   MMENSGFPENNTVADNVSLENEEEVTVKN--EESERNFPGNRWPRQETLALLKIRSDMDV 60

Query: 61  VFRDSSIKAPLWEEVSRKLAEFGYNRSAKKCKEKFENIYKYHKRTKDGRSSKPNGKNYRY 120
            FRDS +KAPLWEEVSRKLAE GYNRSAKKCKEKFENIYKYH+RTK+GRS + NGKNYR+
Sbjct: 61  AFRDSGVKAPLWEEVSRKLAELGYNRSAKKCKEKFENIYKYHRRTKEGRSGRSNGKNYRF 120

Query: 121 FEQLQALDNHP-LLPSQAD--SKEEFPNKIVHNAIPCSIINPDSNFVETTTTLISMSTTS 180
           FEQL+ALD+HP LLP      +    P  ++ +AIPCSI NP  +F ET     S STTS
Sbjct: 121 FEQLEALDHHPSLLPPATGHINTSMQPFSVIRDAIPCSIRNPVLSFNET-----SASTTS 180

Query: 181 CSSKESGGTRKKKKKRKLVDFFGRLMEEVIEKQEKLQKKFVEALEKCEEERLAREEEWKM 240
            S KES G RKKK  RKL +FFGRLM EV+EKQE LQKKF+EA+EK E++R+AREE WKM
Sbjct: 181 SSGKESDGMRKKK--RKLTEFFGRLMREVMEKQENLQKKFIEAIEKSEQDRMAREEAWKM 240

Query: 241 QELARIKNERERFNHERSIAAAKDAAVLSFLKAFSEQVGTVQFPESLILMENLTGRQDHS 300
           QEL RIK ERE    ERSIAAAKDAAVL+FL+ FS+Q  +V+ PE+   +E +  RQ++S
Sbjct: 241 QELDRIKRERELLVQERSIAAAKDAAVLAFLQKFSDQATSVRLPETPFPVEKVVERQENS 300

Query: 301 NVDRNTSTRENGNDGNSNQISSSRWPKDEINALIQLKTNLQMKYQENGPKGPLWEEISLS 360
           N          G++   + +SSSRWPKDE+ ALI+L+ NL ++YQ+NGPKGPLWEEIS +
Sbjct: 301 N----------GSESYMH-LSSSRWPKDEVEALIRLRANLDLQYQDNGPKGPLWEEISTA 360

Query: 361 MKKLGYDRNAKRCKEKWENINKYFKRVKESNKKRPEDSKTCSYFQQLDALYKQKSKKVID 420
           MKKLGYDR+AKRCKEKWEN+NKYFKRVKESNKKRPEDSKTC YF QLDALYK+K+K+   
Sbjct: 361 MKKLGYDRSAKRCKEKWENMNKYFKRVKESNKKRPEDSKTCPYFHQLDALYKEKTKRGDG 420

Query: 421 N-HPNYELKPEELLMHMMGSQEENHQPESATDYGKAENADKNQEDDDDDDDDNNDEDEYY 480
           + +  YELKPEELLMHMM + +E    ES T+ G++ENAD+NQE++ + +++  D    Y
Sbjct: 421 SVNSGYELKPEELLMHMMSAPDERPHQESVTEDGESENADQNQEENGNAEEEEGDA---Y 471

Query: 481 YIVANNNSNQVEVG 491
            IVAN+ S    +G
Sbjct: 481 QIVANDPSPMAIIG 471

BLAST of CmaCh20G010750 vs. TrEMBL
Match: F6I0I8_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_04s0044g00510 PE=4 SV=1)

HSP 1 Score: 499.2 bits (1284), Expect = 5.7e-138
Identity = 301/530 (56.79%), Postives = 371/530 (70.00%), Query Frame = 1

Query: 1   MREISPSPEIS-TAVVNRASEDDDVAA-SAGLEEEV-------DRNWAGNRWPREETMAL 60
           M  IS  PE S TA   R  ED    A   G EEE        DRN+AGNRWPREET+AL
Sbjct: 1   MLGISDFPESSGTASGGREGEDGGGGAVPTGCEEEERVRGEESDRNFAGNRWPREETLAL 60

Query: 61  LKVRSRMDSVFRDSSIKAPLWEEVSRKLAEFGYNRSAKKCKEKFENIYKYHKRTKDGRSS 120
           LK+RS MD VFRDSS+KAPLWEEVSRKL E GY+R+AKKCKEKFENI+KYHKRTK+GRS+
Sbjct: 61  LKIRSDMDVVFRDSSLKAPLWEEVSRKLGELGYHRNAKKCKEKFENIFKYHKRTKEGRSN 120

Query: 121 KPNGKNYRYFEQLQALDNHPLLPSQADSKEEFPNKIVH-----------------NAIPC 180
           + NGKNYR+FEQL+ALDNHPL+P  +  K E    +                   NA+PC
Sbjct: 121 RQNGKNYRFFEQLEALDNHPLMPPPSPVKYETSTPMAASMPQTNPIDVTNVSQGINAVPC 180

Query: 181 SIINPDSNFVETTTTLISMSTTSCSSKESGGTRKKKKKRKLVDFFGRLMEEVIEKQEKLQ 240
           SI  P  + V  +T     STTS S KES G+RKKK+K  +  FF +LM+EVIEKQE LQ
Sbjct: 181 SIQKPAVDCVAAST-----STTSSSGKESEGSRKKKRKWGV--FFEKLMKEVIEKQENLQ 240

Query: 241 KKFVEALEKCEEERLAREEEWKMQELARIKNERERFNHERSIAAAKDAAVLSFLKAFSEQ 300
           +KF+EA+EKCE++R+AREE WK+QEL RIK E E    ERSIAAAKDAAVL+FL+  +EQ
Sbjct: 241 RKFIEAIEKCEQDRIAREEAWKLQELDRIKREHEILVQERSIAAAKDAAVLAFLQKIAEQ 300

Query: 301 VGTVQFPESLILMENLTGRQDHSNVDRNTSTRENGNDGNSNQISSSRWPKDEINALIQLK 360
            G VQ PE+    E +  +QD+SN +            NS Q+SSSRWPK E+ ALI+L+
Sbjct: 301 AGPVQLPENPS-SEKVFEKQDNSNGE------------NSIQMSSSRWPKAEVEALIRLR 360

Query: 361 TNLQMKYQENGPKGPLWEEISLSMKKLGYDRNAKRCKEKWENINKYFKRVKESNKKRPED 420
           TN  M+YQE+GPKGPLWEEISL+M+K+GY+R+AKRCKEKWENINKYFKRV++SNK+RPED
Sbjct: 361 TNFDMQYQESGPKGPLWEEISLAMRKIGYERSAKRCKEKWENINKYFKRVRDSNKRRPED 420

Query: 421 SKTCSYFQQLDALYKQKSKKV--IDNHPNYELKPEELLMHMMGSQEENHQPESATDYGKA 480
           SKTC YF QLDALYK+K+KKV   DN+  Y LKPE++LM MMG  E+  Q ES T+ G +
Sbjct: 421 SKTCPYFHQLDALYKEKTKKVENPDNNSGYNLKPEDILMQMMGQSEQRPQSESVTEEGGS 480

Query: 481 ENADKNQEDDDDD------------DDDNNDEDEYYYIVANNNSNQVEVG 491
           EN + NQE+++++            D D +DE + Y IVANN S+   +G
Sbjct: 481 ENVNANQEEEEEEEEEEEDGDEEGGDGDEDDEADGYQIVANNTSSMAIMG 510

BLAST of CmaCh20G010750 vs. TrEMBL
Match: W9RGP4_9ROSA (Trihelix transcription factor GT-2 OS=Morus notabilis GN=L484_012188 PE=4 SV=1)

HSP 1 Score: 488.8 bits (1257), Expect = 7.7e-135
Identity = 279/471 (59.24%), Postives = 343/471 (72.82%), Query Frame = 1

Query: 32  EEVDRNWAGNRWPREETMALLKVRSRMDSVFRDSSIKAPLWEEVSRKLAEFGYNRSAKKC 91
           EE DR+W GNRWPR+ET+ALL++RS MDS FRDSS+KAPLWE++SRK+ E GYNRSAKKC
Sbjct: 32  EEGDRSWLGNRWPRQETLALLEIRSDMDSKFRDSSVKAPLWEDISRKMGELGYNRSAKKC 91

Query: 92  KEKFENIYKYHKRTKDGRSSKPNGKNYRYFEQLQALDNHPLLPSQADSKEEF---PNKIV 151
           KEKFENIYKYHKRT+DGRS + NGKNYR+FEQL+ALD+H   P   +        PN +V
Sbjct: 92  KEKFENIYKYHKRTRDGRSGRANGKNYRFFEQLEALDHHSFDPPSMEETRPTTIPPNNVV 151

Query: 152 HNAIPCSIINP-DSNFVETTTTLISMSTTSCSSKESGGTRKKKKKRKLVDFFGRLMEEVI 211
            NAIPCS+  P ++NF E      S S+TS S +ES G RKKK  RKL  FF RLM+EV+
Sbjct: 152 LNAIPCSVHKPVEANFDEN-----SSSSTSSSGEESEGARKKK--RKLTRFFERLMKEVM 211

Query: 212 EKQEKLQKKFVEALEKCEEERLAREEEWKMQELARIKNERERFNHERSIAAAKDAAVLSF 271
           E+QE LQ+KF+E LEKCE++R+AREE WK QEL R+K E E   HER+IAAAKDAAVL+F
Sbjct: 212 ERQESLQRKFIETLEKCEQDRIAREEAWKAQELERLKRESELLVHERAIAAAKDAAVLAF 271

Query: 272 LKAFSEQVGTVQFPESLILMENLTGRQDHSNVDRNT------STRENGNDGNSNQISSSR 331
           LK FSEQ   VQFPE+ I      G +   +   N       S  +  N  N +Q+SSSR
Sbjct: 272 LKKFSEQSDQVQFPENPIASFQKDGDKQEKSQGGNLEQVSLESQEKGSNHRNFSQMSSSR 331

Query: 332 WPKDEINALIQLKTNLQMKYQENGPKGPLWEEISLSMKKLGYDRNAKRCKEKWENINKYF 391
           WPKDE++ALI+L+TNL ++YQ+NGPKGPLWE+IS +M+K+GYDR++KRCKEKWENINKYF
Sbjct: 332 WPKDEVDALIRLRTNLDVQYQDNGPKGPLWEDISAAMRKIGYDRSSKRCKEKWENINKYF 391

Query: 392 KRVKESNKKRPEDSKTCSYFQQLDALYKQKSKKVIDN-HPNYELKPEELLMHMMGSQEEN 451
           KRVK+SNKKR EDSKTC YF QLDALY +K+KK  D+ +  Y+L+PEELLMHMMGSQEE 
Sbjct: 392 KRVKDSNKKRVEDSKTCPYFYQLDALYNKKTKKANDSVNSGYDLRPEELLMHMMGSQEEQ 451

Query: 452 HQP--ESATDYGKAENADKNQEDDDDDDDDNNDEDEYYYIVANNNSNQVEV 490
            Q   ES TD             D ++ +D  D D Y    A+NN +Q+ V
Sbjct: 452 QQRQLESVTD------------QDGEESNDKVDGDGYQTNTADNNPSQMTV 483

BLAST of CmaCh20G010750 vs. TAIR10
Match: AT1G76880.1 (AT1G76880.1 Duplicated homeodomain-like superfamily protein)

HSP 1 Score: 275.0 bits (702), Expect = 8.9e-74
Identity = 216/589 (36.67%), Postives = 304/589 (51.61%), Query Frame = 1

Query: 5   SPSPEISTAVVNRASEDDDVAASAGL----EEEVDRNWAGNRWPREETMALLKVRSRMDS 64
           +P P+ +    N ++  +  AA+ G     EE  DR + GNRWPR+ET+ALLK+RS M  
Sbjct: 23  APPPQSNN---NDSAATEAAAAAVGAFEVSEEMHDRGFGGNRWPRQETLALLKIRSDMGI 82

Query: 65  VFRDSSIKAPLWEEVSRKLAEFGYNRSAKKCKEKFENIYKYHKRTKDGRSSKPNGKNYRY 124
            FRD+S+K PLWEEVSRK+AE GY R+AKKCKEKFEN+YKYHKRTK+GR+ K  GK YR+
Sbjct: 83  AFRDASVKGPLWEEVSRKMAEHGYIRNAKKCKEKFENVYKYHKRTKEGRTGKSEGKTYRF 142

Query: 125 FEQLQALDNH------------PLLPSQADSKEEFPNK------------IVHNAIPCSI 184
           F+QL+AL++             PL P Q ++     N              V   +P S 
Sbjct: 143 FDQLEALESQSTTSLHHHQQQTPLRPQQNNNNNNNNNNNSSIFSTPPPVTTVMPTLPSSS 202

Query: 185 INP-------------DSNFVETTTTLISMSTTSCSSKESGG---TRKKKKKRKLVDFFG 244
           I P               +F+   +T  S S ++ S  E GG   T +KK+KRK   FF 
Sbjct: 203 IPPYTQQINVPSFPNISGDFLSDNSTSSSSSYSTSSDMEMGGGTATTRKKRKRKWKVFFE 262

Query: 245 RLMEEVIEKQEKLQKKFVEALEKCEEERLAREEEWKMQELARIKNERERFNHERSIAAAK 304
           RLM++V++KQE+LQ+KF+EA+EK E ERL REE W++QE+ARI  E E    ERS++AAK
Sbjct: 263 RLMKQVVDKQEELQRKFLEAVEKREHERLVREESWRVQEIARINREHEILAQERSMSAAK 322

Query: 305 DAAVLSFLKAFSEQVGTVQFPESLILMENLTGRQDHSNVDRNTSTRENGNDGNSNQISSS 364
           DAAV++FL+  SE+              N    Q      R +    N N     Q S  
Sbjct: 323 DAAVMAFLQKLSEK------------QPNQPQPQPQPQQVRPSMQLNNNNQQQPPQRSPP 382

Query: 365 RWPKDEINALIQ-LKTNLQMKYQENG-------------PKGPLWEEISLSMKKLGYDR- 424
             P   +   IQ + + L     +NG              + P  E  +L   +   D  
Sbjct: 383 PQPPAPLPQPIQAVVSTLDTTKTDNGGDQNMTPAASASSSRWPKVEIEALIKLRTNLDSK 442

Query: 425 -------------------------NAKRCKEKWENINKYFKRVKESNKKRPEDSKTCSY 484
                                    N+KRCKEKWENINKYFK+VKESNKKRPEDSKTC Y
Sbjct: 443 YQENGPKGPLWEEISAGMRRLGFNRNSKRCKEKWENINKYFKKVKESNKKRPEDSKTCPY 502

Query: 485 FQQLDALYKQKSKKVIDNH------PNYELKPEELLMHMM------------------GS 486
           F QLDALY++++K   +N+       +  +KP+  +  M+                   +
Sbjct: 503 FHQLDALYRERNKFHSNNNIAASSSSSGLVKPDNSVPLMVQPEQQWPPAVTTATTTPAAA 562

BLAST of CmaCh20G010750 vs. TAIR10
Match: AT1G76890.2 (AT1G76890.2 Duplicated homeodomain-like superfamily protein)

HSP 1 Score: 207.2 bits (526), Expect = 2.3e-53
Identity = 156/386 (40.41%), Postives = 223/386 (57.77%), Query Frame = 1

Query: 129 NHPLLPSQADSKEEFPNKIVHNAIPCS---IINPDSNFVETTTTLISMSTTSCSSKESGG 188
           N   L  Q  S   FP    +N    S   I N   N V +     S +++S +S E   
Sbjct: 188 NPTFLAKQPSSTTPFPFYSSNNTTTVSQPPISNDLMNNVSSLNLFSSSTSSSTASDEEED 247

Query: 189 TRKKKKKRKLVDF----FGRLMEEVIEKQEKLQKKFVEALEKCEEERLAREEEWKMQELA 248
             + K  RK   +    F +L +E++EKQEK+QK+F+E LE  E+ER++REE W++QE+ 
Sbjct: 248 HHQVKSSRKKRKYWKGLFTKLTKELMEKQEKMQKRFLETLEYREKERISREEAWRVQEIG 307

Query: 249 RIKNERERFNHERSIAAAKDAAVLSFLKAFSEQVGTVQFPE--SLILMENLTGRQDHSNV 308
           RI  E E   HERS AAAKDAA++SFL   S   G  Q P+  +    +    + DHS  
Sbjct: 308 RINREHETLIHERSNAAAKDAAIISFLHKISG--GQPQQPQQHNHKPSQRKQYQSDHSIT 367

Query: 309 DRNTSTR--------ENGNDGNSNQIS--SSRWPKDEINALIQLKTNLQMKYQENGPKGP 368
             +   R        + GN  N++ +S  SSRWPK E+ ALI+++ NL+  YQENG KGP
Sbjct: 368 FESKEPRAVLLDTTIKMGNYDNNHSVSPSSSRWPKTEVEALIRIRKNLEANYQENGTKGP 427

Query: 369 LWEEISLSMKKLGYDRNAKRCKEKWENINKYFKRVKESNKKRPEDSKTCSYFQQLDALYK 428
           LWEEIS  M++LGY+R+AKRCKEKWENINKYFK+VKESNKKRP DSKTC YF QL+ALY 
Sbjct: 428 LWEEISAGMRRLGYNRSAKRCKEKWENINKYFKKVKESNKKRPLDSKTCPYFHQLEALYN 487

Query: 429 QKSKKVIDNHP-NYELKPEELLMHMMGSQE--ENHQPESATDYGKAENADKNQED-DDDD 488
           +++K      P    + P+  L+    +Q   E  Q E   D    E  +  +++ D+++
Sbjct: 488 ERNKSGAMPLPLPLMVTPQRQLLLSQETQTEFETDQREKVGDKEDEEEGESEEDEYDEEE 547

Query: 489 DDDNNDEDEYYYIVANNNSNQVEVGS 492
           + + ++E   + IV N  S+ +++ +
Sbjct: 548 EGEGDNETSEFEIVLNKTSSPMDINN 571

BLAST of CmaCh20G010750 vs. TAIR10
Match: AT5G03680.1 (AT5G03680.1 Duplicated homeodomain-like superfamily protein)

HSP 1 Score: 185.7 bits (470), Expect = 7.1e-47
Identity = 142/417 (34.05%), Postives = 232/417 (55.64%), Query Frame = 1

Query: 42  RWPREETMALLKVRSRMDSVFRDSSIKAPLWEEVSRKLAE-FGYNRSAKKCKEKFENIYK 101
           RWPR+ET+ LL++RSR+D  F++++ K PLW+EVSR ++E  GY RS KKC+EKFEN+YK
Sbjct: 119 RWPRQETLTLLEIRSRLDHKFKEANQKGPLWDEVSRIMSEEHGYQRSGKKCREKFENLYK 178

Query: 102 YHKRTKDGRSSKPNGKNYRYFEQLQAL--DNHPLLPSQADSKEEFPNKI--VHNAIPCSI 161
           Y+++TK+G++ + +GK+YR+F QL+AL  D++ L+     + +   + +   H   P ++
Sbjct: 179 YYRKTKEGKAGRQDGKHYRFFRQLEALYGDSNNLVSCPNHNTQFMSSALHGFHTQNPMNV 238

Query: 162 INPDSNFVET----------------TTTLISMSTTSCSSKESGGTRKKKK-KRKLVDFF 221
               SN                     ++ + + T+S    +S   RKK+  K K+ +F 
Sbjct: 239 TTTTSNIHNVDSVHGFHQSLSLSNNYNSSELELMTSSSEGNDSSSRRKKRSWKAKIKEFI 298

Query: 222 GRLMEEVIEKQEKLQKKFVEALEKCEEERLAREEEWKMQELARIKNERERFNHERSIAAA 281
              M+ +IE+Q+   +K  + +E  EE+R+ +EEEW+  E ARI  E   +  ER+   A
Sbjct: 299 DTNMKRLIERQDVWLEKLTKVIEDKEEQRMMKEEEWRKIEAARIDKEHLFWAKERARMEA 358

Query: 282 KDAAVLSFLKAFSEQVGTVQFPESLILMENLTGRQDHSNVDRNTS--TRENGNDGN-SNQ 341
           +D AV+  L+  + +      P    L  +   R + +N  RN S    ENG+D   +N 
Sbjct: 359 RDVAVIEALQYLTGK------PLIKPLCSSPEERTNGNNEIRNNSETQNENGSDQTMTNN 418

Query: 342 I----SSSRWPKDEINALIQLKTNLQMKYQE--NGPKGP-LWEEISLSMKKLGYD-RNAK 401
           +    SSS W + EI  L++++T++   +QE   G     LWEEI+  + +LG+D R+A 
Sbjct: 419 VCVKGSSSCWGEQEILKLMEIRTSMDSTFQEILGGCSDEFLWEEIAAKLIQLGFDQRSAL 478

Query: 402 RCKEKWENI-NKYFKRVKESNKKRPEDSKTCSYF---QQLDALYKQKSKKVIDNHPN 422
            CKEKWE I N   K  K+ NKKR ++S +C  +    + + +Y  +     DN P+
Sbjct: 479 LCKEKWEWISNGMRKEKKQINKKRKDNSSSCGVYYPRNEENPIYNNRESGYNDNDPH 529

BLAST of CmaCh20G010750 vs. TAIR10
Match: AT1G33240.1 (AT1G33240.1 GT-2-like 1)

HSP 1 Score: 184.5 bits (467), Expect = 1.6e-46
Identity = 119/299 (39.80%), Postives = 172/299 (57.53%), Query Frame = 1

Query: 39  AGNRWPREETMALLKVRSRMDSVFRDSSIKAPLWEEVSRKLAEFGYNRSAKKCKEKFENI 98
           +GNRWPREET+ALL++RS MDS FRD+++KAPLWE VSRKL E GY RS+KKCKEKFEN+
Sbjct: 59  SGNRWPREETLALLRIRSDMDSTFRDATLKAPLWEHVSRKLLELGYKRSSKKCKEKFENV 118

Query: 99  YKYHKRTKDGRSSKPNGKNYRYFEQLQALDNHP----------------LLPSQADSKEE 158
            KY+KRTK+ R  + +GK Y++F QL+AL+  P                L+PS + S   
Sbjct: 119 QKYYKRTKETRGGRHDGKAYKFFSQLEALNTTPPSSSLDVTPLSVANPILMPSSSSSP-- 178

Query: 159 FP----------------NKIVHNAIPCSIINPDSNFVETTTTLISMSTTSCSSKES--- 218
           FP                + +     P  +  P    + T  T  S S+++ S   S   
Sbjct: 179 FPVFSQPQPQTQTQPPQTHNVSFTPTPPPLPLPSMGPIFTGVTFSSHSSSTASGMGSDDD 238

Query: 219 -----------GGTRKKKKKR-------KLVDFFGRLMEEVIEKQEKLQKKFVEALEKCE 278
                       G+  +K+KR       K+++ F  L+ +V++KQ  +Q+ F+EALEK E
Sbjct: 239 DDDMDVDQANIAGSSSRKRKRGNRGGGGKMMELFEGLVRQVMQKQAAMQRSFLEALEKRE 298

Query: 279 EERLAREEEWKMQELARIKNERERFNHERSIAAAKDAAVLSFLKAFSEQVGTVQFPESL 285
           +ERL REE WK QE+AR+  E E  + ER+ +A++DAA++S ++  +    T+Q P SL
Sbjct: 299 QERLDREEAWKRQEMARLAREHEVMSQERAASASRDAAIISLIQKITGH--TIQLPPSL 353

BLAST of CmaCh20G010750 vs. TAIR10
Match: AT3G10000.1 (AT3G10000.1 Homeodomain-like superfamily protein)

HSP 1 Score: 142.9 bits (359), Expect = 5.3e-34
Identity = 105/306 (34.31%), Postives = 167/306 (54.58%), Query Frame = 1

Query: 42  RWPREETMALLKVRSRMDSVFRDSSIKAPLWEEVSRKLAE-FGYNRSAKKCKEKFENIYK 101
           RWPR+ET+ LL+VRSR+D  F++++ K PLW+EVSR ++E  GY RS KKC+EKFEN+YK
Sbjct: 88  RWPRQETLMLLEVRSRLDHKFKEANQKGPLWDEVSRIMSEEHGYTRSGKKCREKFENLYK 147

Query: 102 YHKRTKDGRSSK-PNGKNYRYFEQLQAL-----------DNHPLLPSQADSKEEFPNKIV 161
           Y+K+TK+G+S +  +GKNYR+F QL+A+           +N   + + A     F    +
Sbjct: 148 YYKKTKEGKSGRRQDGKNYRFFRQLEAIYGESKDSVSCYNNTQFIMTNA-LHSNFRASNI 207

Query: 162 HNAIP------CSIINPDSNFVETTTTLISMSTTSCSSKESGGTRKKKK----KRKLVDF 221
           HN +P       +  N  S  +  +    S S    +S   G    K++    K K+ +F
Sbjct: 208 HNIVPHHQNPLMTNTNTQSQSLSISNNFNSSSDLDLTSSSEGNETTKREGMHWKEKIKEF 267

Query: 222 FGRLMEEVIEKQEKLQKKFVEALEKCEEERLAREEEWKMQELARIKNERERFNHERSIAA 281
            G  ME +IEKQ+   +K ++ +E  E +R+ REEEW+  E  RI  ER  +  ER    
Sbjct: 268 IGVHMERLIEKQDFWLEKLMKIVEDKEHQRMLREEEWRRIEAERIDKERSFWTKERERIE 327

Query: 282 AKDAAVLSFLKAFSEQVGTVQFPESLILMENLTGRQDHSNVDRNTSTRENGNDGNSNQIS 325
           A+D AV++ L+  + +   +  P+S    E + G    +  D+  +  E  ++GN  ++ 
Sbjct: 328 ARDVAVINALQYLTGR--ALIRPDSSSPTERING----NGSDKMMADNEFADEGNKGKMD 386

BLAST of CmaCh20G010750 vs. NCBI nr
Match: gi|659121978|ref|XP_008460913.1| (PREDICTED: trihelix transcription factor GT-2-like [Cucumis melo])

HSP 1 Score: 733.4 bits (1892), Expect = 2.6e-208
Identity = 397/496 (80.04%), Postives = 435/496 (87.70%), Query Frame = 1

Query: 1   MREISPSPEIS-----TAVVNRASEDDDVAASAGLEEEVDRNWAGNRWPREETMALLKVR 60
           M EISPSPE S     TA  NR S++D  AASAG+ EE DRNW GNRWPREETMALLKVR
Sbjct: 1   MLEISPSPENSSAAAATAAANRVSKEDAAAASAGVLEEADRNWPGNRWPREETMALLKVR 60

Query: 61  SRMDSVFRDSSIKAPLWEEVSRKLAEFGYNRSAKKCKEKFENIYKYHKRTKDGRSSKPNG 120
           S MD+ FRD+S+KAPLWEEVSRKL E GYNR+AKKCKEKFENIYKYHKRTKDGRS K NG
Sbjct: 61  SSMDTAFRDASLKAPLWEEVSRKLGELGYNRNAKKCKEKFENIYKYHKRTKDGRSGKSNG 120

Query: 121 KNYRYFEQLQALDNHPLLPSQADSKEEFP----NKIVHNAIPCSIINPDSNFVETTTTLI 180
           KNYRYFEQL+ALDNHPLLPSQADS EE P    N +VHNAIPCS++NP +NFVETTTT +
Sbjct: 121 KNYRYFEQLEALDNHPLLPSQADSMEEIPKIIPNNVVHNAIPCSVVNPGANFVETTTTSL 180

Query: 181 SMSTTSCSSKESGGTRKKKKKRKLVDFFGRLMEEVIEKQEKLQKKFVEALEKCEEERLAR 240
           S STTSCSSKESGGTRKKK  RK V+FF RLM EVIEKQEKLQKKFVEALEKCE ERLAR
Sbjct: 181 STSTTSCSSKESGGTRKKK--RKFVEFFERLMNEVIEKQEKLQKKFVEALEKCEVERLAR 240

Query: 241 EEEWKMQELARIKNERERFNHERSIAAAKDAAVLSFLKAFSEQVGTVQFPESLILMENLT 300
           EEEWKMQELARIK ERER N ERSIAAAKDAAVLSFLK  SEQ GTVQFPE+L+LMENLT
Sbjct: 241 EEEWKMQELARIKKERERLNQERSIAAAKDAAVLSFLKVISEQGGTVQFPENLLLMENLT 300

Query: 301 GRQDHSNVDRNTSTRENGNDGNSNQISSSRWPKDEINALIQLKTNLQMKYQENGPKGPLW 360
            +QD +N +RNTST+EN N+GNSNQISSSRWPK+EI+ALIQL+TNLQMKYQ++GPKGPLW
Sbjct: 301 EKQDDANGERNTSTQENINNGNSNQISSSRWPKEEIDALIQLRTNLQMKYQDSGPKGPLW 360

Query: 361 EEISLSMKKLGYDRNAKRCKEKWENINKYFKRVKESNKKRPEDSKTCSYFQQLDALYKQK 420
           EEISL+MKKLGYDRNAKRCKEKWENINKYFKRVKESNKKRPEDSKTC YFQQLDALYKQK
Sbjct: 361 EEISLAMKKLGYDRNAKRCKEKWENINKYFKRVKESNKKRPEDSKTCPYFQQLDALYKQK 420

Query: 421 SKKVIDN--HPNYELKPEELLMHMMGSQEENHQPESATDYGKAENADKNQEDDDDDDDDN 480
           SKKVI+N  +PNYELKPEELLMHMMGSQEE HQPESATD G+AENAD   ++ +D+ ++ 
Sbjct: 421 SKKVINNPANPNYELKPEELLMHMMGSQEETHQPESATDDGEAENAD--NQNQEDEGEEG 480

Query: 481 NDEDEYYYIVANNNSN 486
            DEDE Y IVAN+N+N
Sbjct: 481 EDEDEDYRIVANSNNN 492

BLAST of CmaCh20G010750 vs. NCBI nr
Match: gi|778670187|ref|XP_004147355.2| (PREDICTED: trihelix transcription factor GT-2-like [Cucumis sativus])

HSP 1 Score: 729.6 bits (1882), Expect = 3.7e-207
Identity = 396/494 (80.16%), Postives = 434/494 (87.85%), Query Frame = 1

Query: 1   MREISPSPEISTAVV---NRASEDDDVAASAGLEEEVDRNWAGNRWPREETMALLKVRSR 60
           M EISPSPE S+A V   NR  +++  AASAG+ EE DRNW GNRWPREETMALLKVRS 
Sbjct: 1   MLEISPSPENSSAAVADANRVFKEEAAAASAGVLEEADRNWPGNRWPREETMALLKVRSS 60

Query: 61  MDSVFRDSSIKAPLWEEVSRKLAEFGYNRSAKKCKEKFENIYKYHKRTKDGRSSKPNGKN 120
           MD+ FRD+S+KAPLWEEVSRKL E GYNR+AKKCKEKFENIYKYHKRTKDGRS K NGKN
Sbjct: 61  MDTAFRDASLKAPLWEEVSRKLGELGYNRNAKKCKEKFENIYKYHKRTKDGRSGKSNGKN 120

Query: 121 YRYFEQLQALDNHPLLPSQADSKEEFP----NKIVHNAIPCSIINPDSNFVETTTTLISM 180
           YRYFEQL+ALDNH LLPSQADS EE P    N +VHNAIPCS++NP +NFVETTTT +S 
Sbjct: 121 YRYFEQLEALDNHSLLPSQADSMEEIPRIIPNNVVHNAIPCSVVNPGANFVETTTTSLST 180

Query: 181 STTSCSSKESGGTRKKKKKRKLVDFFGRLMEEVIEKQEKLQKKFVEALEKCEEERLAREE 240
           STTS SSKESGGTRKKK  RK V+FF RLM EVIEKQEKLQKKFVEALEKCE ERLAREE
Sbjct: 181 STTSSSSKESGGTRKKK--RKFVEFFERLMNEVIEKQEKLQKKFVEALEKCEVERLAREE 240

Query: 241 EWKMQELARIKNERERFNHERSIAAAKDAAVLSFLKAFSEQVGTVQFPESLILMENLTGR 300
           EWKMQELARIK ERER N ERSIAAAKDAAVLSFLK FSEQ GTVQFPE+L+LMENLT +
Sbjct: 241 EWKMQELARIKKERERLNQERSIAAAKDAAVLSFLKVFSEQGGTVQFPENLLLMENLTEK 300

Query: 301 QDHSNVDRNTSTRENGNDGNSNQISSSRWPKDEINALIQLKTNLQMKYQENGPKGPLWEE 360
           QD +N +RNTST+EN N+GNSNQISSSRWPK+EI+ALIQL+TNLQMKYQ+NGPKGPLWEE
Sbjct: 301 QDDANGERNTSTQENINNGNSNQISSSRWPKEEIDALIQLRTNLQMKYQDNGPKGPLWEE 360

Query: 361 ISLSMKKLGYDRNAKRCKEKWENINKYFKRVKESNKKRPEDSKTCSYFQQLDALYKQKSK 420
           ISL+MKKLGYDRNAKRCKEKWENINKYFKRVKESNKKRPEDSKTC YFQQLDALYKQKSK
Sbjct: 361 ISLAMKKLGYDRNAKRCKEKWENINKYFKRVKESNKKRPEDSKTCPYFQQLDALYKQKSK 420

Query: 421 KVIDN--HPNYELKPEELLMHMMGSQEENHQPESATDYGKAENADKNQEDDDDDDDDNND 480
           KVI+N  +PNYELKPEELLMHMMGSQEE HQPESATD G+AENAD   ++ +D+ ++  D
Sbjct: 421 KVINNPANPNYELKPEELLMHMMGSQEETHQPESATDDGEAENAD--NQNQEDEGEEGED 480

Query: 481 EDEYYYIVANNNSN 486
           EDE Y IVANNN+N
Sbjct: 481 EDEDYRIVANNNNN 490

BLAST of CmaCh20G010750 vs. NCBI nr
Match: gi|823207569|ref|XP_012437382.1| (PREDICTED: trihelix transcription factor GT-2-like [Gossypium raimondii])

HSP 1 Score: 515.0 bits (1325), Expect = 1.4e-142
Identity = 297/493 (60.24%), Postives = 369/493 (74.85%), Query Frame = 1

Query: 1   MREISPSPEISTAVVNRASEDDDVAASAGLEEEVDRNWAGNRWPREETMALLKVRSRMDS 60
           M E S  PE +T   N + E+++ A      EE + N++GNRWPR+ET+ALLK+RS MD 
Sbjct: 1   MMENSSFPESNTVGDNVSLENEEEAKVKN--EESEGNFSGNRWPRQETLALLKIRSEMDV 60

Query: 61  VFRDSSIKAPLWEEVSRKLAEFGYNRSAKKCKEKFENIYKYHKRTKDGRSSKPNGKNYRY 120
            FRDS +KAPLWEEVSRKLAE GYNR AKKCKEKFEN+YKYH+RTK+GRS K NGK+YR+
Sbjct: 61  AFRDSGVKAPLWEEVSRKLAELGYNRGAKKCKEKFENVYKYHRRTKEGRSGKSNGKSYRF 120

Query: 121 FEQLQALDNHPLL--PSQADSKEEF-PNKIVHNAIPCSIINPDSNFVETTTTLISMSTTS 180
           FEQL+ALD+HP L  P+  D      P  ++H+AIP S+ NP SNF ET+T     STTS
Sbjct: 121 FEQLEALDHHPSLVPPASGDINTSVEPLNVIHDAIPFSVRNPASNFNETST-----STTS 180

Query: 181 CSSKESGGTRKKKKKRKLVDFFGRLMEEVIEKQEKLQKKFVEALEKCEEERLAREEEWKM 240
            SSKES GTRKKK  RKL DFF RLM E++EKQE LQKKF+EA+EK E +R+AREE WK+
Sbjct: 181 SSSKESDGTRKKK--RKLTDFFERLMREMMEKQENLQKKFIEAIEKSELDRMAREEAWKV 240

Query: 241 QELARIKNERERFNHERSIAAAKDAAVLSFLKAFSEQVGTVQFPESLILMENLTGRQDHS 300
           QELAR+K ERE    ERSIAAAKDAAVL+FL+ FS+Q  +VQ P+    +E +  RQ++S
Sbjct: 241 QELARLKRERELLVQERSIAAAKDAAVLAFLQKFSDQTTSVQLPDISFPVEKVVDRQENS 300

Query: 301 NVDRNTSTRENGNDGNSNQISSSRWPKDEINALIQLKTNLQMKYQENGPKGPLWEEISLS 360
           N          G++   + +S+SRWPKDE+ ALI+L+TNL M+YQ+ GPKGPLWEEIS +
Sbjct: 301 N----------GSESYMH-LSTSRWPKDEVEALIRLRTNLDMQYQDAGPKGPLWEEISTA 360

Query: 361 MKKLGYDRNAKRCKEKWENINKYFKRVKESNKKRPEDSKTCSYFQQLDALYKQKSKKVID 420
           MKKLGYDR+AKRCKEKWEN+NKYFKRVKESNKKRPEDSKTC YF QLDALYK+K+K++  
Sbjct: 361 MKKLGYDRSAKRCKEKWENMNKYFKRVKESNKKRPEDSKTCPYFHQLDALYKEKTKRI-- 420

Query: 421 NHPNYELKPEELLMHMMGSQEENHQPESATDYGKAENADKNQEDDDDDDDDNNDEDEYYY 480
           +   YELKPEELLMHMMG+QEE    ESAT+  ++EN ++N+E      ++ N E + Y 
Sbjct: 421 DGSGYELKPEELLMHMMGAQEERLHQESATEDVESENVNQNRE------ENRNAEGDAYQ 465

Query: 481 IVANNNSNQVEVG 491
           IVAN+ S    +G
Sbjct: 481 IVANDPSPMPIIG 465

BLAST of CmaCh20G010750 vs. NCBI nr
Match: gi|590708292|ref|XP_007048236.1| (Duplicated homeodomain-like superfamily protein, putative [Theobroma cacao])

HSP 1 Score: 511.9 bits (1317), Expect = 1.2e-141
Identity = 297/494 (60.12%), Postives = 368/494 (74.49%), Query Frame = 1

Query: 1   MREISPSPEISTAVVNRASEDDDVAASAGLEEEVDRNWAGNRWPREETMALLKVRSRMDS 60
           M E S  PE +T   N + E+++        EE +RN+ GNRWPR+ET+ALLK+RS MD 
Sbjct: 1   MMENSGFPENNTVADNVSLENEEEVTVKN--EESERNFPGNRWPRQETLALLKIRSDMDV 60

Query: 61  VFRDSSIKAPLWEEVSRKLAEFGYNRSAKKCKEKFENIYKYHKRTKDGRSSKPNGKNYRY 120
            FRDS +KAPLWEEVSRKLAE GYNRSAKKCKEKFENIYKYH+RTK+GRS + NGKNYR+
Sbjct: 61  AFRDSGVKAPLWEEVSRKLAELGYNRSAKKCKEKFENIYKYHRRTKEGRSGRSNGKNYRF 120

Query: 121 FEQLQALDNHP-LLPSQAD--SKEEFPNKIVHNAIPCSIINPDSNFVETTTTLISMSTTS 180
           FEQL+ALD+HP LLP      +    P  ++ +AIPCSI NP  +F ET     S STTS
Sbjct: 121 FEQLEALDHHPSLLPPATGHINTSMQPFSVIRDAIPCSIRNPVLSFNET-----SASTTS 180

Query: 181 CSSKESGGTRKKKKKRKLVDFFGRLMEEVIEKQEKLQKKFVEALEKCEEERLAREEEWKM 240
            S KES G RKKK  RKL +FFGRLM EV+EKQE LQKKF+EA+EK E++R+AREE WKM
Sbjct: 181 SSGKESDGMRKKK--RKLTEFFGRLMREVMEKQENLQKKFIEAIEKSEQDRMAREEAWKM 240

Query: 241 QELARIKNERERFNHERSIAAAKDAAVLSFLKAFSEQVGTVQFPESLILMENLTGRQDHS 300
           QEL RIK ERE    ERSIAAAKDAAVL+FL+ FS+Q  +V+ PE+   +E +  RQ++S
Sbjct: 241 QELDRIKRERELLVQERSIAAAKDAAVLAFLQKFSDQATSVRLPETPFPVEKVVERQENS 300

Query: 301 NVDRNTSTRENGNDGNSNQISSSRWPKDEINALIQLKTNLQMKYQENGPKGPLWEEISLS 360
           N          G++   + +SSSRWPKDE+ ALI+L+ NL ++YQ+NGPKGPLWEEIS +
Sbjct: 301 N----------GSESYMH-LSSSRWPKDEVEALIRLRANLDLQYQDNGPKGPLWEEISTA 360

Query: 361 MKKLGYDRNAKRCKEKWENINKYFKRVKESNKKRPEDSKTCSYFQQLDALYKQKSKKVID 420
           MKKLGYDR+AKRCKEKWEN+NKYFKRVKESNKKRPEDSKTC YF QLDALYK+K+K+   
Sbjct: 361 MKKLGYDRSAKRCKEKWENMNKYFKRVKESNKKRPEDSKTCPYFHQLDALYKEKTKRGDG 420

Query: 421 N-HPNYELKPEELLMHMMGSQEENHQPESATDYGKAENADKNQEDDDDDDDDNNDEDEYY 480
           + +  YELKPEELLMHMM + +E    ES T+ G++ENAD+NQE++ + +++  D    Y
Sbjct: 421 SVNSGYELKPEELLMHMMSAPDERPHQESVTEDGESENADQNQEENGNAEEEEGDA---Y 471

Query: 481 YIVANNNSNQVEVG 491
            IVAN+ S    +G
Sbjct: 481 QIVANDPSPMAIIG 471

BLAST of CmaCh20G010750 vs. NCBI nr
Match: gi|225431601|ref|XP_002276933.1| (PREDICTED: trihelix transcription factor GT-2 [Vitis vinifera])

HSP 1 Score: 499.2 bits (1284), Expect = 8.2e-138
Identity = 301/530 (56.79%), Postives = 371/530 (70.00%), Query Frame = 1

Query: 1   MREISPSPEIS-TAVVNRASEDDDVAA-SAGLEEEV-------DRNWAGNRWPREETMAL 60
           M  IS  PE S TA   R  ED    A   G EEE        DRN+AGNRWPREET+AL
Sbjct: 1   MLGISDFPESSGTASGGREGEDGGGGAVPTGCEEEERVRGEESDRNFAGNRWPREETLAL 60

Query: 61  LKVRSRMDSVFRDSSIKAPLWEEVSRKLAEFGYNRSAKKCKEKFENIYKYHKRTKDGRSS 120
           LK+RS MD VFRDSS+KAPLWEEVSRKL E GY+R+AKKCKEKFENI+KYHKRTK+GRS+
Sbjct: 61  LKIRSDMDVVFRDSSLKAPLWEEVSRKLGELGYHRNAKKCKEKFENIFKYHKRTKEGRSN 120

Query: 121 KPNGKNYRYFEQLQALDNHPLLPSQADSKEEFPNKIVH-----------------NAIPC 180
           + NGKNYR+FEQL+ALDNHPL+P  +  K E    +                   NA+PC
Sbjct: 121 RQNGKNYRFFEQLEALDNHPLMPPPSPVKYETSTPMAASMPQTNPIDVTNVSQGINAVPC 180

Query: 181 SIINPDSNFVETTTTLISMSTTSCSSKESGGTRKKKKKRKLVDFFGRLMEEVIEKQEKLQ 240
           SI  P  + V  +T     STTS S KES G+RKKK+K  +  FF +LM+EVIEKQE LQ
Sbjct: 181 SIQKPAVDCVAAST-----STTSSSGKESEGSRKKKRKWGV--FFEKLMKEVIEKQENLQ 240

Query: 241 KKFVEALEKCEEERLAREEEWKMQELARIKNERERFNHERSIAAAKDAAVLSFLKAFSEQ 300
           +KF+EA+EKCE++R+AREE WK+QEL RIK E E    ERSIAAAKDAAVL+FL+  +EQ
Sbjct: 241 RKFIEAIEKCEQDRIAREEAWKLQELDRIKREHEILVQERSIAAAKDAAVLAFLQKIAEQ 300

Query: 301 VGTVQFPESLILMENLTGRQDHSNVDRNTSTRENGNDGNSNQISSSRWPKDEINALIQLK 360
            G VQ PE+    E +  +QD+SN +            NS Q+SSSRWPK E+ ALI+L+
Sbjct: 301 AGPVQLPENPS-SEKVFEKQDNSNGE------------NSIQMSSSRWPKAEVEALIRLR 360

Query: 361 TNLQMKYQENGPKGPLWEEISLSMKKLGYDRNAKRCKEKWENINKYFKRVKESNKKRPED 420
           TN  M+YQE+GPKGPLWEEISL+M+K+GY+R+AKRCKEKWENINKYFKRV++SNK+RPED
Sbjct: 361 TNFDMQYQESGPKGPLWEEISLAMRKIGYERSAKRCKEKWENINKYFKRVRDSNKRRPED 420

Query: 421 SKTCSYFQQLDALYKQKSKKV--IDNHPNYELKPEELLMHMMGSQEENHQPESATDYGKA 480
           SKTC YF QLDALYK+K+KKV   DN+  Y LKPE++LM MMG  E+  Q ES T+ G +
Sbjct: 421 SKTCPYFHQLDALYKEKTKKVENPDNNSGYNLKPEDILMQMMGQSEQRPQSESVTEEGGS 480

Query: 481 ENADKNQEDDDDD------------DDDNNDEDEYYYIVANNNSNQVEVG 491
           EN + NQE+++++            D D +DE + Y IVANN S+   +G
Sbjct: 481 ENVNANQEEEEEEEEEEEDGDEEGGDGDEDDEADGYQIVANNTSSMAIMG 510

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
TGT2_ARATH4.0e-5240.41Trihelix transcription factor GT-2 OS=Arabidopsis thaliana GN=GT-2 PE=2 SV=1[more]
PTL_ARATH1.3e-4534.05Trihelix transcription factor PTL OS=Arabidopsis thaliana GN=PTL PE=2 SV=1[more]
GTL1_ARATH2.8e-4539.80Trihelix transcription factor GTL1 OS=Arabidopsis thaliana GN=GTL1 PE=1 SV=2[more]
GTL2_ARATH3.8e-1869.35Trihelix transcription factor GTL2 OS=Arabidopsis thaliana GN=At5g28300 PE=2 SV=... [more]
TGT4_ARATH7.2e-0928.68Trihelix transcription factor GT-4 OS=Arabidopsis thaliana GN=GT-4 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0LK12_CUCSA2.6e-20780.16Uncharacterized protein OS=Cucumis sativus GN=Csa_2G301510 PE=4 SV=1[more]
A0A0D2SY59_GOSRA1.0e-14260.24Uncharacterized protein OS=Gossypium raimondii GN=B456_008G099700 PE=4 SV=1[more]
A0A061DR08_THECC8.5e-14260.12Duplicated homeodomain-like superfamily protein, putative OS=Theobroma cacao GN=... [more]
F6I0I8_VITVI5.7e-13856.79Putative uncharacterized protein OS=Vitis vinifera GN=VIT_04s0044g00510 PE=4 SV=... [more]
W9RGP4_9ROSA7.7e-13559.24Trihelix transcription factor GT-2 OS=Morus notabilis GN=L484_012188 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G76880.18.9e-7436.67 Duplicated homeodomain-like superfamily protein[more]
AT1G76890.22.3e-5340.41 Duplicated homeodomain-like superfamily protein[more]
AT5G03680.17.1e-4734.05 Duplicated homeodomain-like superfamily protein[more]
AT1G33240.11.6e-4639.80 GT-2-like 1[more]
AT3G10000.15.3e-3434.31 Homeodomain-like superfamily protein[more]
Match NameE-valueIdentityDescription
gi|659121978|ref|XP_008460913.1|2.6e-20880.04PREDICTED: trihelix transcription factor GT-2-like [Cucumis melo][more]
gi|778670187|ref|XP_004147355.2|3.7e-20780.16PREDICTED: trihelix transcription factor GT-2-like [Cucumis sativus][more]
gi|823207569|ref|XP_012437382.1|1.4e-14260.24PREDICTED: trihelix transcription factor GT-2-like [Gossypium raimondii][more]
gi|590708292|ref|XP_007048236.1|1.2e-14160.12Duplicated homeodomain-like superfamily protein, putative [Theobroma cacao][more]
gi|225431601|ref|XP_002276933.1|8.2e-13856.79PREDICTED: trihelix transcription factor GT-2 [Vitis vinifera][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001005SANT/Myb
IPR009057Homeobox-like_sf
IPR017877Myb-like_dom
Vocabulary: Molecular Function
TermDefinition
GO:0003677DNA binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003677 DNA binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh20G010750.1CmaCh20G010750.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001005SANT/Myb domainSMARTSM00717santcoord: 318..380
score: 0.025coord: 39..101
score: 0.
IPR009057Homeodomain-likeGENE3DG3DSA:1.10.10.60coord: 315..377
score: 5.
IPR017877Myb-like domainPROFILEPS50090MYB_LIKEcoord: 41..99
score: 7.491coord: 314..378
score: 7
NoneNo IPR availableunknownCoilCoilcoord: 203..227
scor
NoneNo IPR availablePANTHERPTHR21654FAMILY NOT NAMEDcoord: 32..481
score: 5.5E
NoneNo IPR availablePANTHERPTHR21654:SF14SUBFAMILY NOT NAMEDcoord: 32..481
score: 5.5E
NoneNo IPR availablePFAMPF13837Myb_DNA-bind_4coord: 41..127
score: 1.9E-18coord: 320..407
score: 1.5

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
CmaCh20G010750CmaCh02G006130Cucurbita maxima (Rimu)cmacmaB480
The following block(s) are covering this gene:
GeneOrganismBlock
CmaCh20G010750Cucurbita maxima (Rimu)cmacmaB484
CmaCh20G010750Cucurbita pepo (Zucchini)cmacpeB565
CmaCh20G010750Cucumber (Gy14) v2cgybcmaB794
CmaCh20G010750Cucumber (Chinese Long) v3cmacucB0670
CmaCh20G010750Watermelon (97103) v2cmawmbB561
CmaCh20G010750Wax gourdcmawgoB0701