CmaCh04G005650 (gene) Cucurbita maxima (Rimu)

NameCmaCh04G005650
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionBromodomain-containing factor 1
LocationCma_Chr04 : 2886130 .. 2889124 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CAAATTCAGAATTACTTTTTAAATCGAGAAAAGAAAATGCATTAGACGAAACATGTTGCAATCCCACTCGCGTTTTACGCGCGTGAAGCTCACCCTCGGCCACCACTTCTGCGTTTCACAGTGAACAACGGAAGTCCGGATTCCGAACTAACCAGAAAAAACGATGAGCTTCATCAATTCATCCGACAACTTTCCGAATCCATCATCAATCCGCTGTACGATTTCTTGGATTTCTTCTCCTTGTGTATCGGAAACCGTTTTATTTTTTTTTCTCGTTCCTGTGATCGTCAAGCTTAAACCCTACGTTTTGTTGTTTTGGATTTCTGTTCTACATCCTTCAGCTTCATCGATCGTGCATTAAGCTTGTAAGAACTCATGATTTTTATGGAATTTTTTCTAGTATCATTTGGTATCCTTCTCTTTTGACAGTATCGGCGTTTTGTTATTGTAGCAGATTACATGTATGGCTTCAGTGCTGCGAGGTGGTGGTGAACCGGGAGGGAATCCCAGGAGGACGGATAATAACAAGTTTAGTACCGGAAACCAACGGAAAGAGAGTAAAATCCCCAAAAGTGCTCGAAATTCCGTACAGATTCCTGCCATCGCGGCCGCCAATGGCGGTGGTGATCCGTCTTCTCCTTCTCATTATCCGATCGATGCTTTAGTCACGTCTAGGGATTCTTCAGGTCAAAATCGCTACATCGAACAGGTTAATGCTGATGAAGTGCCGGGATATACGCGGTTTGAGAACCGGGTTAGGATTTGCTTGAATTCCAGATCAAGATCTGAGATTAAAGAGCTTACGACGAAGCTTAAGGGCGAGCTCGGTCACGTCAGGAGCCTCGTGAAGAAATTTGAATCTCAAGAACTGCAGATCAGTGGTTATGGTGGTGATGTTGGACATAGTCAGTCGCAGTTTTCTGCTAATAATTTGGTAGAGAAGGTTGGTAACACATGGAAGAATGATTCTGTGGTGGGTTCCGCCGATGTGCCCGCTTCTCGACTTGTTCAAAGCGTTTCGGTGGCCGAAAACTTCGGGGAGTTTGCAGAGAAAGAGATGAACAAGCATAAAAATTCAAGATATAATCCTAAGACGGAGTTCCCGGTATCGGATTGCGATTCGAATAGAGGTAAGATTGATCCATTGCTTAAAAGCTGTAACAATTTGCTGGAAAGATTGATGAAACATAAACATGGTTGGGTGTTTAATGTGCCTGTTGATGCCAAGCGTTTGGGGCTTCATGATTATCACAAGATCATAACGAAGCCAATGGACTTAGGTACTGTAAAGATGAGATTAAACAAGAACTGGTACAAGTCACCAAGAGAGTTTGCTGAGGATGTGAGACTCACATTTAGTAATGCCATTACATATAACCCGAAAGGGGAGGACGTTCATATAATGGCTGAGCAGTTATCGAACGTATTCGAAGAGAAATGGAGGATTATTGAAGCCAAACAAGATGTTGGTAAGGATGATGGATCCAGAAAATCTCCAGCTCTTGCCACACCGCCAGTGGAATCAAGAACTTTCAGTAAATCGAAGTCTACGACGAAGCCTCCGCCTGCAAATAGGGAGAGTTTAGGTAAGTCGGATTCGATAACAAAGCCTGCAAACGTTCCTGATAAGAAACCAAATGCTAAGAATCATGGAAACAGAGATATGAATTATGAAGAAAAGCAGAAACTGAGCATTGATCTTCAGGATTTACCATCAGATGAGCTGAATAATGTTGTGAAGATCATTAAAAAGAGGAACCAGGGACTCTTCCAAAACGATGATGAAATTGAGTTGGATATTGGTAGTGTTGACTCCAAAACCCTCTGGGAACTCGAGAGGTTTGTGGCTGATTACAAAAACAGCTTGATCACGAACAAGAGAAAAGTCGACGCCGATCTTCAATCATCGCGCTACTCGACCAAGGATATGGTAAGATTTGAATCAATCTGCTACAGTTTCAAGTTCCTTTCTTAGGGTGAAACAAACTATATTGTAACGATGCAGGATCGAGCTGTGGATGATGCAGGAGGAGGACCTGTAGGCGGCAATGCAGGTAAAATTGTTGAAATCTTTGGTGAATGCTTCAGTCTATAACTTTTGGTAACTTAGAAATGGAATGTTATAAAAGCATGTTTCCACCATTGAAGCAACCTCTTGATACCTAGCAGTAGCAGTAACTTGGAGATGTTAATGGTTATATGATCTATGTTTCTACAGACTCTGAAGGTGAGGGCGACAGTTCCTCGACGTGTGGAGATGCGAATCAGTCTCCCTAGGGTTGAATCGACTAAAACAAGGTACGAGTTCAAAGTCTAGTACTGAATGTTCTTGTTGGCTTGGAAGCATCACATATGAAAAAAAGATTTGACTACCATTTGGTGTGATATTGATGAACATGGGAAGTTTGTGTTATCCTCCTCTCGATAGCTTCGTCTGTTGATATTTAAAAACTCGGAGACCGAAATTGAGAAGGCTGGAAAAGTCTAGGAATTCGGCAATTGTTTTCAGGAGCGTCTTCATCTTTCCCAATTTTACGATAAGCAAGTTGAAACTTGGAAGTTGATTGTTCTATTATCCTGAAGTAATCCTATCTGAATAATCATATAATTTGATCAGAATCTCCAAACCAATTTTCATTTATGTTTCCCATTGATGCTCAAGTAACAAGTCGAATTCGTTCCTCTGCAACAGGATCGTATGGTTTGGTTGCATCACGGAGCAATCTCCATCTCCCATGGTGATTTGGAGGGAAGAGATCAAAAGTTGTATTGATAAATTCAGATGATATTCATGATTCCTTGGCATTTTATGAAATCAGAAATTTCCCAATTTCTTATTTTGATAATTGTATATTTTTGTGTAATAGAACTTAGAAGGTAGCATAAGTTAATTTCTTTCTAGGATGGATTGATTGATCTTTCCATTTTCTATAGCAGCTGTATATACTAAAGGATCTACTTTGGAATATGAATTTAATCTTTTCTTTGTCAGCAA

mRNA sequence

CAAATTCAGAATTACTTTTTAAATCGAGAAAAGAAAATGCATTAGACGAAACATGTTGCAATCCCACTCGCGTTTTACGCGCGTGAAGCTCACCCTCGGCCACCACTTCTGCGTTTCACAGTGAACAACGGAAGTCCGGATTCCGAACTAACCAGAAAAAACGATGAGCTTCATCAATTCATCCGACAACTTTCCGAATCCATCATCAATCCGCTGTACGATTTCTTGGATTTCTTCTCCTTGTGTATCGGAAACCGTTTTATTTTTTTTTCTCGTTCCTGTGATCGTCAAGCTTAAACCCTACGTTTTGTTGTTTTGGATTTCTGTTCTACATCCTTCAGCTTCATCGATCGTGCATTAAGCTTATTACATGTATGGCTTCAGTGCTGCGAGGTGGTGGTGAACCGGGAGGGAATCCCAGGAGGACGGATAATAACAAGTTTAGTACCGGAAACCAACGGAAAGAGAGTAAAATCCCCAAAAGTGCTCGAAATTCCGTACAGATTCCTGCCATCGCGGCCGCCAATGGCGGTGGTGATCCGTCTTCTCCTTCTCATTATCCGATCGATGCTTTAGTCACGTCTAGGGATTCTTCAGGTCAAAATCGCTACATCGAACAGGTTAATGCTGATGAAGTGCCGGGATATACGCGGTTTGAGAACCGGGTTAGGATTTGCTTGAATTCCAGATCAAGATCTGAGATTAAAGAGCTTACGACGAAGCTTAAGGGCGAGCTCGGTCACGTCAGGAGCCTCGTGAAGAAATTTGAATCTCAAGAACTGCAGATCAGTGGTTATGGTGGTGATGTTGGACATAGTCAGTCGCAGTTTTCTGCTAATAATTTGGTAGAGAAGGTTGGTAACACATGGAAGAATGATTCTGTGGTGGGTTCCGCCGATGTGCCCGCTTCTCGACTTGTTCAAAGCGTTTCGGTGGCCGAAAACTTCGGGGAGTTTGCAGAGAAAGAGATGAACAAGCATAAAAATTCAAGATATAATCCTAAGACGGAGTTCCCGGTATCGGATTGCGATTCGAATAGAGGTAAGATTGATCCATTGCTTAAAAGCTGTAACAATTTGCTGGAAAGATTGATGAAACATAAACATGGTTGGGTGTTTAATGTGCCTGTTGATGCCAAGCGTTTGGGGCTTCATGATTATCACAAGATCATAACGAAGCCAATGGACTTAGGTACTGTAAAGATGAGATTAAACAAGAACTGGTACAAGTCACCAAGAGAGTTTGCTGAGGATGTGAGACTCACATTTAGTAATGCCATTACATATAACCCGAAAGGGGAGGACGTTCATATAATGGCTGAGCAGTTATCGAACGTATTCGAAGAGAAATGGAGGATTATTGAAGCCAAACAAGATGTTGGTAAGGATGATGGATCCAGAAAATCTCCAGCTCTTGCCACACCGCCAGTGGAATCAAGAACTTTCAGTAAATCGAAGTCTACGACGAAGCCTCCGCCTGCAAATAGGGAGAGTTTAGGTAAGTCGGATTCGATAACAAAGCCTGCAAACGTTCCTGATAAGAAACCAAATGCTAAGAATCATGGAAACAGAGATATGAATTATGAAGAAAAGCAGAAACTGAGCATTGATCTTCAGGATTTACCATCAGATGAGCTGAATAATGTTGTGAAGATCATTAAAAAGAGGAACCAGGGACTCTTCCAAAACGATGATGAAATTGAGTTGGATATTGGTAGTGTTGACTCCAAAACCCTCTGGGAACTCGAGAGGTTTGTGGCTGATTACAAAAACAGCTTGATCACGAACAAGAGAAAAGTCGACGCCGATCTTCAATCATCGCGCTACTCGACCAAGGATATGGATCGAGCTGTGGATGATGCAGGAGGAGGACCTGTAGGCGGCAATGCAGACTCTGAAGGTGAGGGCGACAGTTCCTCGACGTGTGGAGATGCGAATCAGTCTCCCTAGGGTTGAATCGACTAAAACAAGGATCGTATGGTTTGGTTGCATCACGGAGCAATCTCCATCTCCCATGGTGATTTGGAGGGAAGAGATCAAAAGTTGTATTGATAAATTCAGATGATATTCATGATTCCTTGGCATTTTATGAAATCAGAAATTTCCCAATTTCTTATTTTGATAATTGTATATTTTTGTGTAATAGAACTTAGAAGGTAGCATAAGTTAATTTCTTTCTAGGATGGATTGATTGATCTTTCCATTTTCTATAGCAGCTGTATATACTAAAGGATCTACTTTGGAATATGAATTTAATCTTTTCTTTGTCAGCAA

Coding sequence (CDS)

ATGGCTTCAGTGCTGCGAGGTGGTGGTGAACCGGGAGGGAATCCCAGGAGGACGGATAATAACAAGTTTAGTACCGGAAACCAACGGAAAGAGAGTAAAATCCCCAAAAGTGCTCGAAATTCCGTACAGATTCCTGCCATCGCGGCCGCCAATGGCGGTGGTGATCCGTCTTCTCCTTCTCATTATCCGATCGATGCTTTAGTCACGTCTAGGGATTCTTCAGGTCAAAATCGCTACATCGAACAGGTTAATGCTGATGAAGTGCCGGGATATACGCGGTTTGAGAACCGGGTTAGGATTTGCTTGAATTCCAGATCAAGATCTGAGATTAAAGAGCTTACGACGAAGCTTAAGGGCGAGCTCGGTCACGTCAGGAGCCTCGTGAAGAAATTTGAATCTCAAGAACTGCAGATCAGTGGTTATGGTGGTGATGTTGGACATAGTCAGTCGCAGTTTTCTGCTAATAATTTGGTAGAGAAGGTTGGTAACACATGGAAGAATGATTCTGTGGTGGGTTCCGCCGATGTGCCCGCTTCTCGACTTGTTCAAAGCGTTTCGGTGGCCGAAAACTTCGGGGAGTTTGCAGAGAAAGAGATGAACAAGCATAAAAATTCAAGATATAATCCTAAGACGGAGTTCCCGGTATCGGATTGCGATTCGAATAGAGGTAAGATTGATCCATTGCTTAAAAGCTGTAACAATTTGCTGGAAAGATTGATGAAACATAAACATGGTTGGGTGTTTAATGTGCCTGTTGATGCCAAGCGTTTGGGGCTTCATGATTATCACAAGATCATAACGAAGCCAATGGACTTAGGTACTGTAAAGATGAGATTAAACAAGAACTGGTACAAGTCACCAAGAGAGTTTGCTGAGGATGTGAGACTCACATTTAGTAATGCCATTACATATAACCCGAAAGGGGAGGACGTTCATATAATGGCTGAGCAGTTATCGAACGTATTCGAAGAGAAATGGAGGATTATTGAAGCCAAACAAGATGTTGGTAAGGATGATGGATCCAGAAAATCTCCAGCTCTTGCCACACCGCCAGTGGAATCAAGAACTTTCAGTAAATCGAAGTCTACGACGAAGCCTCCGCCTGCAAATAGGGAGAGTTTAGGTAAGTCGGATTCGATAACAAAGCCTGCAAACGTTCCTGATAAGAAACCAAATGCTAAGAATCATGGAAACAGAGATATGAATTATGAAGAAAAGCAGAAACTGAGCATTGATCTTCAGGATTTACCATCAGATGAGCTGAATAATGTTGTGAAGATCATTAAAAAGAGGAACCAGGGACTCTTCCAAAACGATGATGAAATTGAGTTGGATATTGGTAGTGTTGACTCCAAAACCCTCTGGGAACTCGAGAGGTTTGTGGCTGATTACAAAAACAGCTTGATCACGAACAAGAGAAAAGTCGACGCCGATCTTCAATCATCGCGCTACTCGACCAAGGATATGGATCGAGCTGTGGATGATGCAGGAGGAGGACCTGTAGGCGGCAATGCAGACTCTGAAGGTGAGGGCGACAGTTCCTCGACGTGTGGAGATGCGAATCAGTCTCCCTAG

Protein sequence

MASVLRGGGEPGGNPRRTDNNKFSTGNQRKESKIPKSARNSVQIPAIAAANGGGDPSSPSHYPIDALVTSRDSSGQNRYIEQVNADEVPGYTRFENRVRICLNSRSRSEIKELTTKLKGELGHVRSLVKKFESQELQISGYGGDVGHSQSQFSANNLVEKVGNTWKNDSVVGSADVPASRLVQSVSVAENFGEFAEKEMNKHKNSRYNPKTEFPVSDCDSNRGKIDPLLKSCNNLLERLMKHKHGWVFNVPVDAKRLGLHDYHKIITKPMDLGTVKMRLNKNWYKSPREFAEDVRLTFSNAITYNPKGEDVHIMAEQLSNVFEEKWRIIEAKQDVGKDDGSRKSPALATPPVESRTFSKSKSTTKPPPANRESLGKSDSITKPANVPDKKPNAKNHGNRDMNYEEKQKLSIDLQDLPSDELNNVVKIIKKRNQGLFQNDDEIELDIGSVDSKTLWELERFVADYKNSLITNKRKVDADLQSSRYSTKDMDRAVDDAGGGPVGGNADSEGEGDSSSTCGDANQSP
BLAST of CmaCh04G005650 vs. Swiss-Prot
Match: GTE4_ARATH (Transcription factor GTE4 OS=Arabidopsis thaliana GN=GTE4 PE=2 SV=1)

HSP 1 Score: 262.3 bits (669), Expect = 1.1e-68
Identity = 188/480 (39.17%), Postives = 255/480 (53.12%), Query Frame = 1

Query: 85  ADEVPGYTRFENRVRICLNSRSRSEIKELTTKLKGELGHVRSLVKKFESQELQISGYGGD 144
           A  +P     + R+RI + S ++ + +E+  KL+ +L  VR +VKK E +E +I  Y   
Sbjct: 255 AGSMPMEEDADGRIRIHVASTTKQQKEEIRKKLEDQLNVVRGMVKKIEDKEGEIGAY--- 314

Query: 145 VGHSQSQFSANNLVEKVGNTWKNDSVVGSADVP-----ASRLVQ--SVSVAEN---FGEF 204
              + S+   N  +   G   +  S   SA +P     A R V   S+SV EN     E 
Sbjct: 315 ---NDSRVLINTGINNGGG--RILSGFASAGLPREVIRAPRPVNQLSISVLENTQGVNEH 374

Query: 205 AEKEMNKHKNSRYNPKTEFPVSD----CDSNR-----------------GKIDPLLKSCN 264
            EKE    K +++   +EF + D     +SN+                 G    + K+C+
Sbjct: 375 VEKEKRTPKANQFYRNSEFLLGDKLPPAESNKKSKSSSKKQGGDVGHGFGAGTKVFKNCS 434

Query: 265 NLLERLMKHKHGWVFNVPVDAKRLGLHDYHKIITKPMDLGTVKMRLNKNWYKSPREFAED 324
            LLERLMKHKHGWVFN PVD K LGL DY+ II  PMDLGT+K  L KN YKSPREFAED
Sbjct: 435 ALLERLMKHKHGWVFNAPVDVKGLGLLDYYTIIEHPMDLGTIKSALMKNLYKSPREFAED 494

Query: 325 VRLTFSNAITYNPKGEDVHIMAEQLSNVFEEKWRIIEAKQDVGKDDGSRKSPALATPPVE 384
           VRLTF NA+TYNP+G+DVH+MA  L  +FEE+W +IEA  +      +     L TP + 
Sbjct: 495 VRLTFHNAMTYNPEGQDVHLMAVTLLQIFEERWAVIEADYNREMRFVTGYEMNLPTPTMR 554

Query: 385 SRTFSKSKSTTKPPPAN-RESLGKSD-----SITKPANVPD-----------KKPNAKNH 444
           SR       T  PPP N R ++ ++D       T P   P            KKP A   
Sbjct: 555 SRL----GPTMPPPPINVRNTIDRADWSNRQPTTTPGRTPTSATPSGRTPALKKPKANEP 614

Query: 445 GNRDMNYEEKQKLSIDLQDLPSDELNNVVKIIKKRNQGLFQNDDEIELDIGSVDSKTLWE 504
             RDM YEEKQKLS  LQ+LP D+L+ +V+I+ KRN  +   D+EIE+DI SVD +TLWE
Sbjct: 615 NKRDMTYEEKQKLSGHLQNLPPDKLDAIVQIVNKRNTAVKLRDEEIEVDIDSVDPETLWE 674

Query: 505 LERFVADYKNSLITNKRKVDADLQSSRYSTKDMDRAVDDAGGGPVGGNADSEGEGDSSST 517
           L+RFV +YK  L   KRK +  +Q+   + ++  + +  A   P       EG   +  T
Sbjct: 675 LDRFVTNYKKGLSKKKRKAELAIQARAEAERNSQQQMAPA---PAAHEFSREGGNTAKKT 719

BLAST of CmaCh04G005650 vs. Swiss-Prot
Match: GTE3_ARATH (Transcription factor GTE3, chloroplastic OS=Arabidopsis thaliana GN=GTE3 PE=1 SV=1)

HSP 1 Score: 238.4 bits (607), Expect = 1.7e-61
Identity = 140/273 (51.28%), Postives = 175/273 (64.10%), Query Frame = 1

Query: 228 LLKSCNNLLERLMKHKHGWVFNVPVDAKRLGLHDYHKIITKPMDLGTVKMRLNKNWYKSP 287
           +LKSCNNLL +LMKHK GW+FN PVD   LGLHDYH II +PMDLGTVK RL+K+ YKSP
Sbjct: 119 ILKSCNNLLTKLMKHKSGWIFNTPVDVVTLGLHDYHNIIKEPMDLGTVKTRLSKSLYKSP 178

Query: 288 REFAEDVRLTFSNAITYNPKGEDVHIMAEQLSNVFEEKWRIIEA-------KQDVGKDDG 347
            EFAEDVRLTF+NA+ YNP G DV+ MAE L N+FEEKW  +E        KQ   +D  
Sbjct: 179 LEFAEDVRLTFNNAMLYNPVGHDVYHMAEILLNLFEEKWVPLETQYELLIRKQQPVRDID 238

Query: 348 SRKSPALATPPVESRTFSKSKSTTKPPP----ANRESLGKSDSITKPAN------VPDKK 407
                +  T  VE+        +  PPP        +L +++S+T P        VP+K 
Sbjct: 239 FHAPVSTNTHNVEALPLPAPTPSLSPPPPPKVVENRTLERAESMTNPVKPAVLPVVPEKL 298

Query: 408 PNAKNHGNRDMNYEEKQKLSIDLQDLPSDELNNVVKIIKKRNQGLFQNDDEIELDIGSVD 467
               +  NRD+ ++EK++LS DLQDLP D+L  VV+IIKKR   L Q DDEIELDI S+D
Sbjct: 299 VEEAS-ANRDLTFDEKRQLSEDLQDLPYDKLEAVVQIIKKRTPELSQQDDEIELDIDSLD 358

Query: 468 SKTLWELERFVADYKNSLITNKRKVDADLQSSR 484
            +TLWEL RFV +YK SL  +K+K +  L S R
Sbjct: 359 LETLWELFRFVTEYKESL--SKKKEEQGLDSER 388

BLAST of CmaCh04G005650 vs. Swiss-Prot
Match: GTE5_ARATH (Transcription factor GTE5, chloroplastic OS=Arabidopsis thaliana GN=GTE5 PE=1 SV=1)

HSP 1 Score: 218.8 bits (556), Expect = 1.4e-55
Identity = 131/266 (49.25%), Postives = 165/266 (62.03%), Query Frame = 1

Query: 228 LLKSCNNLLERLMKHKHGWVFNVPVDAKRLGLHDYHKIITKPMDLGTVKMRLNKNWYKSP 287
           + K+CN+LL +LMKHK  WVFNVPVDAK LGLHDYH I+ +PMDLGTVK +L K+ YKSP
Sbjct: 132 IFKNCNSLLTKLMKHKSAWVFNVPVDAKGLGLHDYHNIVKEPMDLGTVKTKLGKSLYKSP 191

Query: 288 REFAEDVRLTFSNAITYNPKGEDVHIMAEQLSNVFEEKWRIIEAKQD--------VGKDD 347
            +FAEDVRLTF+NAI YNP G DV+  AE L N+FE+KW  IE + D            +
Sbjct: 192 LDFAEDVRLTFNNAILYNPIGHDVYRFAELLLNMFEDKWVSIEMQYDNLHRKFKPTRDIE 251

Query: 348 GSRKSPALA--TPPVESRTFSKSKSTTKPPP--------ANRESLGKSDSIT---KPANV 407
               +P++A    P+ +   S S S+  PPP            +  + +S+T   +P  V
Sbjct: 252 FPAPAPSIAPIVEPLPAIVPSPSPSSPPPPPPPPVAAPVLENRTWEREESMTIPVEPEAV 311

Query: 408 PDKKPNAKNH----GNRDMNYEEKQKLSIDLQDLPSDELNNVVKIIKKRNQGLFQNDDEI 467
                 A+       NRD+  EEK++LS +LQDLP D+L  VV+IIKK N  L Q DDEI
Sbjct: 312 ITAPEKAEEEEAPVNNRDLTLEEKRRLSEELQDLPYDKLETVVQIIKKSNPELSQKDDEI 371

Query: 468 ELDIGSVDSKTLWELERFVADYKNSL 469
           ELDI S+D  TLWEL RFV  YK SL
Sbjct: 372 ELDIDSLDINTLWELYRFVTGYKESL 397

BLAST of CmaCh04G005650 vs. Swiss-Prot
Match: GTE10_ARATH (Transcription factor GTE10 OS=Arabidopsis thaliana GN=GTE10 PE=1 SV=2)

HSP 1 Score: 144.8 bits (364), Expect = 2.6e-33
Identity = 123/378 (32.54%), Postives = 169/378 (44.71%), Query Frame = 1

Query: 100 ICLNSRSRSEIKELTTKLKGELGHVRSLVKK---FESQELQISGYGGDVGHSQSQFSAN- 159
           + L+  SRSE K L  KLK EL  VR L KK   F S  + +S Y     HS S      
Sbjct: 61  LSLSKMSRSERKNLVHKLKMELQQVRDLSKKIASFSSDTVLLSPYND---HSCSDGPRRP 120

Query: 160 ---NLVEKVGNTWKNDSVVGSADVPASRLVQSVSVAENFGEFAEKEMNKHKNSRYNPKTE 219
              N    VG+  K    V S                      +K+ NK   SR N  T 
Sbjct: 121 PPENFATFVGSQGKKRPPVRS----------------------DKQRNKKGPSRLNVPTS 180

Query: 220 FPVSDCDSNRGKIDPLLKSCNNLLERLMKHKHGWVFNVPVDAKRLGLHDYHKIITKPMDL 279
           + V+           ++K C  LL RL  HK GW F  PVD   L + DY  +I  PMDL
Sbjct: 181 YTVAS----------VMKECETLLNRLWSHKSGWPFRTPVDPVMLNIPDYFNVIKHPMDL 240

Query: 280 GTVKMRLNKNWYKSPREFAEDVRLTFSNAITYNPKGEDVHIMAEQLSNVFEEKWRIIEAK 339
           GT++ RL K  Y SP +FA DVRLTFSN+I YNP G   H MA+ +S  FE  W+ IE K
Sbjct: 241 GTIRSRLCKGEYSSPLDFAADVRLTFSNSIAYNPPGNQFHTMAQGISKYFESGWKSIEKK 300

Query: 340 QDVGKDDGSRKSPALATPPVESRTFSKSKSTTKP---PPANRESLGKSDS--ITKPANVP 399
             + K            PPV   T S S  +  P    P  ++    +D+    +PA + 
Sbjct: 301 IPMSK------------PPVIPLTSSASLESEIPFEVAPMRKKEAAMNDNKLRVEPAKLV 360

Query: 400 DKKPNAKNHGNRDMNYEEKQKLSIDLQDLPSDELNNVVKIIKKRNQGLFQN-DDEIELDI 459
                        M   EK+KL  DL  L  D    +  ++++++    Q+ + EIE+DI
Sbjct: 361 -------------MTDGEKKKLGQDLMALEEDFPQKIADLLREQSGSDGQSGEGEIEIDI 378

Query: 460 GSVDSKTLWELERFVADY 465
            ++  + L+ + + + DY
Sbjct: 421 EALSDEILFMVRKLLDDY 378

BLAST of CmaCh04G005650 vs. Swiss-Prot
Match: GTE8_ARATH (Transcription factor GTE8 OS=Arabidopsis thaliana GN=GTE8 PE=2 SV=2)

HSP 1 Score: 142.5 bits (358), Expect = 1.3e-32
Identity = 149/500 (29.80%), Postives = 220/500 (44.00%), Query Frame = 1

Query: 39  RNSVQIPAIAAANGGGDPSSPSHYPIDALVT-SRDSSGQNRYIEQVNADEVPGYTRFENR 98
           RN+ + P  +  +G       S   ID  VT S +SS   R    +N+++   Y     R
Sbjct: 13  RNTFEAPEESEGSG-------SSAQIDTEVTASENSSTPARKCIMLNSNDEDPYG--VQR 72

Query: 99  VRICLNSRSRSEIKELTTKLKGELGHVRSLVKKFESQELQ---ISGYGGDVGHSQSQFSA 158
             I L + S+SE K+L  +LK EL   + ++K  E Q +    +S     VG S  Q   
Sbjct: 73  QVISLYNMSQSERKDLIYRLKLELEQTKIVLKNAELQRMNPAAVSSTSDRVGFSTGQ--- 132

Query: 159 NNLVEKVGNTWK-NDSVVGSADVPASRLVQSVSVAENFGEFAEKEMNKHKNSRYNPKTEF 218
             +  +V N+ K +D  VGS             V    G    +  N+  + ++    E 
Sbjct: 133 -KISSRVSNSKKPSDFAVGSGK----------KVRHQNG--TSRGWNRGTSGKFESSKET 192

Query: 219 PVSDCDSNRGKIDPLLKSCNNLLERLMKHKHGWVFNVPVDAKRLGLHDYHKIITKPMDLG 278
             S  +        L+K C+ LL +L  H H WVF  PVD  +L + DY   I  PMDLG
Sbjct: 193 MTSTPNIT------LMKQCDTLLRKLWSHPHSWVFQAPVDVVKLNIPDYLTTIKHPMDLG 252

Query: 279 TVKMRLNKNWYKSPREFAEDVRLTFSNAITYNPKGEDVHIMAEQLSNVFEEKWRIIEAKQ 338
           TVK  L    Y SP EFA DVRLTF+NA+TYNP G DVHIM + LS +FE +W+ I+ K 
Sbjct: 253 TVKKNLASGVYSSPHEFAADVRLTFTNAMTYNPPGHDVHIMGDILSKLFEARWKTIKKKL 312

Query: 339 DVGKDDGSRKSPALATPPVESRTFSKSKSTTKPPPANRESLGK--SDSITKPANVPDKKP 398
                   +  PA+   P + R     K+    PPA +  +     +S+ +P      KP
Sbjct: 313 ---PPCSMQTLPAVTLEPNDER-----KAAISVPPAKKRKMASPVRESVPEPV-----KP 372

Query: 399 NAKNHGNRDMNYEEKQKLSIDLQDLPSDELNNVVKIIKKRN-QGLFQNDDEIELDIGSVD 458
                    M   E+ +L   L+ L  +   +++  +KK N  G    +DEIE+DI  + 
Sbjct: 373 L--------MTEVERHRLGRQLESLLDELPAHIIDFLKKHNSNGGEIAEDEIEIDIDVLS 432

Query: 459 SKTLWELERFVADY---KNSLITNKRKVDADL-QSSRYSTKDMDRAVD------DAGGGP 518
            + L  L   + +Y   K +  TN    + +L   SR S   + R  +      D    P
Sbjct: 433 DEVLVTLRNLLDEYIQNKEAKQTNVEPCEIELINGSRPSNSSLQRGNEMADEYVDGNEPP 460

Query: 519 VGGNADSEGEGDSSSTCGDA 521
           +  ++     G S     DA
Sbjct: 493 ISRSSSDSDSGSSEDQSDDA 460

BLAST of CmaCh04G005650 vs. TrEMBL
Match: A0A0A0KYZ1_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G083710 PE=4 SV=1)

HSP 1 Score: 737.6 bits (1903), Expect = 1.0e-209
Identity = 411/544 (75.55%), Postives = 447/544 (82.17%), Query Frame = 1

Query: 1   MASVLRGGGEPGGNPRRTDNNKFSTGNQRKESKIPKS-ARNSVQIPAIAAANGGGDPSSP 60
           MASVL+G G+ GGNPR+ DN+KF+ G Q+K+SKI K  ARNS+Q P +AA NGG +PSSP
Sbjct: 1   MASVLQGDGDAGGNPRKRDNDKFNAGKQQKQSKIAKRVARNSLQTPTVAATNGGANPSSP 60

Query: 61  SHYPIDALVTSRDSSGQNRYIEQVNADEVPGYTRFENRVRICLNSRSRSEIKELTTKLKG 120
           SH PIDALVTSR  SGQN   E VNA+EVP YTRFENRVRI LNSRSR  IKELTTKLKG
Sbjct: 61  SHNPIDALVTSRFYSGQNHCSEPVNAEEVPVYTRFENRVRINLNSRSRFGIKELTTKLKG 120

Query: 121 ELGHVRSLVKKFESQELQISGYGGDVGHSQSQFSANNLVEKVG--NTWKNDSVVGSADVP 180
           EL  VRSLVKKFE+QELQ+SGYGGDVGHSQSQFSANNLVE+VG  +T K +S VGSADVP
Sbjct: 121 ELDQVRSLVKKFETQELQLSGYGGDVGHSQSQFSANNLVERVGTVSTMKVNSEVGSADVP 180

Query: 181 ASRLVQSVSVAENFGEFAEKEMNKHKNSRYNPKTEFPVSDCDSNRGKIDPLLKSCNNLLE 240
           ASRLV+  SVAENFGEFAEKE++KHKNS+Y    E P+SDC+ N GKI P+LKSC+NLLE
Sbjct: 181 ASRLVRCASVAENFGEFAEKEVSKHKNSKYASTKELPMSDCNLNGGKIGPVLKSCSNLLE 240

Query: 241 RLMKHKHGWVFNVPVDAKRLGLHDYHKIITKPMDLGTVKMRLNKNWYKSPREFAEDVRLT 300
           RLMKHK GWVFNVPVDAKRLGLHDYHKIITKPMDLGT+KMRLNKNWYKSPREFAEDVRLT
Sbjct: 241 RLMKHKFGWVFNVPVDAKRLGLHDYHKIITKPMDLGTIKMRLNKNWYKSPREFAEDVRLT 300

Query: 301 FSNAITYNPKGEDVHIMAEQLSNVFEEKWRIIEAKQDVGK----DDG-------SRKSPA 360
           FSNAITYNPKGEDVH+MAEQLSN+FEEKW+ IE KQ+VGK    DDG       SRKSPA
Sbjct: 301 FSNAITYNPKGEDVHMMAEQLSNIFEEKWKTIEGKQNVGKGFQVDDGSVLPTPTSRKSPA 360

Query: 361 LATPPVESRTFSKSKSTTKPPPANRESLGKSDSITKPANV--PDKKPNAKNHGNRDMNYE 420
           LAT PVESRTFS+S STTK           S+    P +V  PDKKP AKNH  RDM YE
Sbjct: 361 LATRPVESRTFSRSDSTTK-------HFLTSNPKQPPTDVAPPDKKPKAKNHEIRDMTYE 420

Query: 421 EKQKLSIDLQDLPSDELNNVVKIIKKRNQGLFQNDDEIELDIGSVDSKTLWELERFVADY 480
           EKQKLSIDLQDLPSD+LNNVVKIIKKRNQGLFQNDDEIELDIGSVDS+TLWELERFVA+Y
Sbjct: 421 EKQKLSIDLQDLPSDKLNNVVKIIKKRNQGLFQNDDEIELDIGSVDSETLWELERFVANY 480

Query: 481 KNSLITNKRKVDADLQS----SRYSTKDMD-RAVDDAGGGPVGGNADSEGEGDSSSTCGD 524
           K SLI NKRK DA+LQS    S YST D D  AV  AGG PVGGNADS  E DSSSTCGD
Sbjct: 481 KKSLIKNKRKADANLQSGEKLSHYSTNDTDLLAVAKAGGKPVGGNADS--ENDSSSTCGD 535

BLAST of CmaCh04G005650 vs. TrEMBL
Match: A0A061DNG1_THECC (Global transcription factor group E4, putative isoform 1 OS=Theobroma cacao GN=TCM_003703 PE=4 SV=1)

HSP 1 Score: 335.9 bits (860), Expect = 8.9e-89
Identity = 217/473 (45.88%), Postives = 288/473 (60.89%), Query Frame = 1

Query: 55  DPSSPSHYPIDALVT--SRDSSGQNRYIEQVNADEVPGYTRFENRVRICLNSRSRSEIKE 114
           D +S    P+  + T  S DSS  N++  QV A      +  ENRV+I L SRS+ E+++
Sbjct: 123 DMNSAHQQPVPYVDTAVSDDSSNLNKH--QVVASNGAVKSSSENRVKINLASRSKQEMRD 182

Query: 115 LTTKLKGELGHVRSLVKKFESQELQISGYGGDVGHSQSQFSANNLVEKVGNTWKNDSVVG 174
           L  KL+ EL  VR+LVK+ E++E QISG+      S S+   N+ V+      +  S V 
Sbjct: 183 LRRKLESELDLVRNLVKRIEAKEGQISGF------SNSRLLLNDSVDY--GLKRVQSEVA 242

Query: 175 SADVPASRLVQS-------VSVAENF--GEFAEKEMNKHKNSRYNPKTEFPVSD-----C 234
           SA +P   + QS       +SV EN    E  EKE    K +++   +EF ++       
Sbjct: 243 SAGIPQEPVRQSRPLNQLSISVLENSQGNENLEKEKRTPKANQFYRNSEFLLAKDKFPPA 302

Query: 235 DSNR------------------GKIDPLLKSCNNLLERLMKHKHGWVFNVPVDAKRLGLH 294
           +SN+                  G  +   KSC++LLERLMKHKHGWVFN PVD K LGLH
Sbjct: 303 ESNKKSKLNGKKAGGGEFTHGFGMGNKFFKSCSSLLERLMKHKHGWVFNAPVDVKGLGLH 362

Query: 295 DYHKIITKPMDLGTVKMRLNKNWYKSPREFAEDVRLTFSNAITYNPKGEDVHIMAEQLSN 354
           DY+ II  PMDLGTVK RLNKNWYKSPREFAEDVRLTF NA+TYNPKG+DVH+MAEQLS 
Sbjct: 363 DYYSIIKHPMDLGTVKSRLNKNWYKSPREFAEDVRLTFRNAMTYNPKGQDVHVMAEQLSK 422

Query: 355 VFEEKWRIIEA----------KQDVGKDDGS-RKSPALATPPVE-SRTFSKSKSTTKPPP 414
           +FE+KW +IE           + +V     + RK+  +  PP++  R   +S+S  +P  
Sbjct: 423 IFEDKWAVIETDYIREMRLAIEYEVSLPTPTPRKAHPMLPPPLDMRRILDRSESMIRPVD 482

Query: 415 ANRESLGKSDSITKPANVPDKKPNAKNHGNRDMNYEEKQKLSIDLQDLPSDELNNVVKII 474
              + +  + S   PA    KKP AK+   RDM YEEKQKLS +LQ LPS++L+N+V+II
Sbjct: 483 MRPKLIATTPSSRTPA---PKKPKAKDPYKRDMTYEEKQKLSTNLQSLPSEKLDNIVQII 542

Query: 475 KKRNQGLFQNDDEIELDIGSVDSKTLWELERFVADYKNSLITNKRKVDADLQS 482
           KKRN  LFQ+DDEIE+DI SVD++TLWEL+RFV +YK SL  NKRK +  +Q+
Sbjct: 543 KKRNSALFQHDDEIEVDIDSVDTETLWELDRFVTNYKKSLSKNKRKAELAIQA 582

BLAST of CmaCh04G005650 vs. TrEMBL
Match: A0A061DW54_THECC (Global transcription factor group E4, putative isoform 2 OS=Theobroma cacao GN=TCM_003703 PE=4 SV=1)

HSP 1 Score: 335.9 bits (860), Expect = 8.9e-89
Identity = 217/473 (45.88%), Postives = 288/473 (60.89%), Query Frame = 1

Query: 55  DPSSPSHYPIDALVT--SRDSSGQNRYIEQVNADEVPGYTRFENRVRICLNSRSRSEIKE 114
           D +S    P+  + T  S DSS  N++  QV A      +  ENRV+I L SRS+ E+++
Sbjct: 123 DMNSAHQQPVPYVDTAVSDDSSNLNKH--QVVASNGAVKSSSENRVKINLASRSKQEMRD 182

Query: 115 LTTKLKGELGHVRSLVKKFESQELQISGYGGDVGHSQSQFSANNLVEKVGNTWKNDSVVG 174
           L  KL+ EL  VR+LVK+ E++E QISG+      S S+   N+ V+      +  S V 
Sbjct: 183 LRRKLESELDLVRNLVKRIEAKEGQISGF------SNSRLLLNDSVDY--GLKRVQSEVA 242

Query: 175 SADVPASRLVQS-------VSVAENF--GEFAEKEMNKHKNSRYNPKTEFPVSD-----C 234
           SA +P   + QS       +SV EN    E  EKE    K +++   +EF ++       
Sbjct: 243 SAGIPQEPVRQSRPLNQLSISVLENSQGNENLEKEKRTPKANQFYRNSEFLLAKDKFPPA 302

Query: 235 DSNR------------------GKIDPLLKSCNNLLERLMKHKHGWVFNVPVDAKRLGLH 294
           +SN+                  G  +   KSC++LLERLMKHKHGWVFN PVD K LGLH
Sbjct: 303 ESNKKSKLNGKKAGGGEFTHGFGMGNKFFKSCSSLLERLMKHKHGWVFNAPVDVKGLGLH 362

Query: 295 DYHKIITKPMDLGTVKMRLNKNWYKSPREFAEDVRLTFSNAITYNPKGEDVHIMAEQLSN 354
           DY+ II  PMDLGTVK RLNKNWYKSPREFAEDVRLTF NA+TYNPKG+DVH+MAEQLS 
Sbjct: 363 DYYSIIKHPMDLGTVKSRLNKNWYKSPREFAEDVRLTFRNAMTYNPKGQDVHVMAEQLSK 422

Query: 355 VFEEKWRIIEA----------KQDVGKDDGS-RKSPALATPPVE-SRTFSKSKSTTKPPP 414
           +FE+KW +IE           + +V     + RK+  +  PP++  R   +S+S  +P  
Sbjct: 423 IFEDKWAVIETDYIREMRLAIEYEVSLPTPTPRKAHPMLPPPLDMRRILDRSESMIRPVD 482

Query: 415 ANRESLGKSDSITKPANVPDKKPNAKNHGNRDMNYEEKQKLSIDLQDLPSDELNNVVKII 474
              + +  + S   PA    KKP AK+   RDM YEEKQKLS +LQ LPS++L+N+V+II
Sbjct: 483 MRPKLIATTPSSRTPA---PKKPKAKDPYKRDMTYEEKQKLSTNLQSLPSEKLDNIVQII 542

Query: 475 KKRNQGLFQNDDEIELDIGSVDSKTLWELERFVADYKNSLITNKRKVDADLQS 482
           KKRN  LFQ+DDEIE+DI SVD++TLWEL+RFV +YK SL  NKRK +  +Q+
Sbjct: 543 KKRNSALFQHDDEIEVDIDSVDTETLWELDRFVTNYKKSLSKNKRKAELAIQA 582

BLAST of CmaCh04G005650 vs. TrEMBL
Match: A0A151SES9_CAJCA (Bromodomain-containing protein 4 OS=Cajanus cajan GN=KK1_024682 PE=4 SV=1)

HSP 1 Score: 334.7 bits (857), Expect = 2.0e-88
Identity = 231/517 (44.68%), Postives = 293/517 (56.67%), Query Frame = 1

Query: 19  DNNKFSTGNQRKESKIPKSARNSVQIPAIAAANGGGDPSSPSHYPIDALVTSRDSSGQNR 78
           DN+    GN R    +  +  N    PA+   +G    S+  H        S DSS  NR
Sbjct: 71  DNSTVDNGNDR----VKDNCNNVSAQPAVVPEDGN---SARLHVNSRLDAVSDDSSSLNR 130

Query: 79  YIEQ-----VNADEVPGYTRF----ENRVRICLNSRSRSEIKELTTKLKGELGHVRSLVK 138
             ++     V  D V G        EN VRI L SRS+ E +EL  KL+ EL  +RSLV 
Sbjct: 131 LQDEPLSHHVRQDSVAGLREQSPVPENCVRISLASRSKQEKRELRRKLESELDRIRSLVN 190

Query: 139 KFESQELQISGYGGDVGHSQSQFSANNLVEKVGNTWKNDSVVGSADV---PASRLVQ-SV 198
           + E ++    G  G       + + + +V  V    +  S V SA V   P   L Q SV
Sbjct: 191 RIEEKQ----GLLGASAVVPRERARSEVVSAVVPRERARSEVASAVVSREPTRPLHQLSV 250

Query: 199 SVAEN---FGEFAEKEMNKHKNSRYNPKTEF-----------------------PVSDCD 258
           SV EN    GE  EKE    K +++   +EF                        +S+  
Sbjct: 251 SVLENSQGVGEIVEKEKRTPKANQFYRNSEFLLGKDKFPPAESNKKSKLNGKKHGMSEMG 310

Query: 259 SNRGKIDPLLKSCNNLLERLMKHKHGWVFNVPVDAKRLGLHDYHKIITKPMDLGTVKMRL 318
              G    LLKSC++LLE+LMKHKHGWVFN PVD + LGLHDY  IIT PMDLGTVK RL
Sbjct: 311 HGHGMGSKLLKSCSSLLEKLMKHKHGWVFNSPVDVEGLGLHDYFSIITHPMDLGTVKSRL 370

Query: 319 NKNWYKSPREFAEDVRLTFSNAITYNPKGEDVHIMAEQLSNVFEEKWRIIEAKQD----V 378
           NKNWYKSP+EFAEDVRLTF NA+TYNPKG+DVH+MAEQLSNVFEE+W IIE+  +     
Sbjct: 371 NKNWYKSPKEFAEDVRLTFRNAMTYNPKGQDVHVMAEQLSNVFEERWAIIESNYNSNMRY 430

Query: 379 GKDDG---------SRKSPALATPPVE-SRTFSKSKSTTKPPPANRESLGKSDSITKPAN 438
           G D G         SRK P+   PP++  R   +S+S T+PP        +  SIT  + 
Sbjct: 431 GLDYGAAIPTPSPLSRKGPSFRPPPIDMRRILDRSESMTQPP--------RIMSITPSSR 490

Query: 439 VP-DKKPNAKNHGNRDMNYEEKQKLSIDLQDLPSDELNNVVKIIKKRNQGLFQNDDEIEL 482
            P  KKP AK+   RDM Y+EKQKLS +LQ LPS++L+ +V+IIKKRN  L Q+DDEIE+
Sbjct: 491 TPAPKKPKAKDPHKRDMTYDEKQKLSTNLQSLPSEKLDAIVQIIKKRNSALSQHDDEIEV 550

BLAST of CmaCh04G005650 vs. TrEMBL
Match: K7M2K1_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_13G292200 PE=4 SV=1)

HSP 1 Score: 333.6 bits (854), Expect = 4.4e-88
Identity = 234/528 (44.32%), Postives = 309/528 (58.52%), Query Frame = 1

Query: 2   ASVLRGGGEPGGNPRRTDNNKFSTGNQRKESKIPKSARNSVQIPAIAAANGGGDPSSPS- 61
           A+   GG          D NK ++     + +   ++ N+  +P        G+ + P  
Sbjct: 51  AATTNGGDNGSATATAVDYNKDNSTVNNGDVRAKDNSNNASVLPVPVPVPEDGNSARPQV 110

Query: 62  HYPIDALVTSRDSSGQNRYIEQVNADEVPGYTRF----ENRVRICLNSRSRSEIKELTTK 121
           +  +D  V S DSS  NR  ++  +  VPG        EN VRI L SRS+ E +EL  +
Sbjct: 111 NSRLD--VISDDSSSLNRPRDEPLS--VPGVRERSPGPENCVRISLASRSKQEKRELRRR 170

Query: 122 LKGELGHVRSLVKKFESQELQISGYGGDVGHSQSQFSANNLVEK-VGN---TWKNDSVVG 181
           L+GEL  VRSLV   E +   + GYG          +++ +V++ +GN     +  S V 
Sbjct: 171 LQGELIRVRSLVNGIEEKLGVLGGYG----------NSDRMVDRGIGNGIGAKRAHSEVA 230

Query: 182 SADV----PASRLVQ-SVSVAEN---FGEFAEKEMNKHKNSRYNPKTEFPVSD-----CD 241
           SA V    P   L Q SVSV EN    GE  EKE    K +++   +EF ++       +
Sbjct: 231 SAVVTLREPTRPLHQLSVSVLENSQGVGEIVEKEKRTPKANQFYRNSEFLLAKDKFPPAE 290

Query: 242 SNR----------------GKIDPLLKSCNNLLERLMKHKHGWVFNVPVDAKRLGLHDYH 301
           SN+                G    LLKSC++LLE+LMKHKHGWVF+ PVD + LGLHDY 
Sbjct: 291 SNKKSKLNGKKHGTGEMGHGMGSKLLKSCSSLLEKLMKHKHGWVFDTPVDVEGLGLHDYF 350

Query: 302 KIITKPMDLGTVKMRLNKNWYKSPREFAEDVRLTFSNAITYNPKGEDVHIMAEQLSNVFE 361
            IIT PMDLGTVK RLNKNWY+SP+EFAEDVRLTF NA+TYNPKG+DVHIMAEQLSN+FE
Sbjct: 351 SIITHPMDLGTVKSRLNKNWYRSPKEFAEDVRLTFHNAMTYNPKGQDVHIMAEQLSNIFE 410

Query: 362 EKWRIIEA----KQDVGKDDG-----SRKSPALATPPVE-SRTFSKSKSTTKPPPANRES 421
           E+W IIE+    +   G D G     SRK+P    PP++  R   +S+S T+PP    + 
Sbjct: 411 ERWAIIESNYNREMTYGLDYGAPSPVSRKAPPFRPPPIDMRRILDRSESMTQPP----KI 470

Query: 422 LGKSDSITKPANVPDKKPNAKNHGNRDMNYEEKQKLSIDLQDLPSDELNNVVKIIKKRNQ 481
           +G + S   PA    KKP AK+   RDM YEEKQKLS  LQ LPS++L+ +V+IIKKRN 
Sbjct: 471 MGITPSSRTPA---PKKPKAKDPHKRDMTYEEKQKLSTHLQSLPSEKLDAIVQIIKKRNS 530

BLAST of CmaCh04G005650 vs. TAIR10
Match: AT1G06230.1 (AT1G06230.1 global transcription factor group E4)

HSP 1 Score: 262.3 bits (669), Expect = 6.3e-70
Identity = 188/480 (39.17%), Postives = 255/480 (53.12%), Query Frame = 1

Query: 85  ADEVPGYTRFENRVRICLNSRSRSEIKELTTKLKGELGHVRSLVKKFESQELQISGYGGD 144
           A  +P     + R+RI + S ++ + +E+  KL+ +L  VR +VKK E +E +I  Y   
Sbjct: 255 AGSMPMEEDADGRIRIHVASTTKQQKEEIRKKLEDQLNVVRGMVKKIEDKEGEIGAY--- 314

Query: 145 VGHSQSQFSANNLVEKVGNTWKNDSVVGSADVP-----ASRLVQ--SVSVAEN---FGEF 204
              + S+   N  +   G   +  S   SA +P     A R V   S+SV EN     E 
Sbjct: 315 ---NDSRVLINTGINNGGG--RILSGFASAGLPREVIRAPRPVNQLSISVLENTQGVNEH 374

Query: 205 AEKEMNKHKNSRYNPKTEFPVSD----CDSNR-----------------GKIDPLLKSCN 264
            EKE    K +++   +EF + D     +SN+                 G    + K+C+
Sbjct: 375 VEKEKRTPKANQFYRNSEFLLGDKLPPAESNKKSKSSSKKQGGDVGHGFGAGTKVFKNCS 434

Query: 265 NLLERLMKHKHGWVFNVPVDAKRLGLHDYHKIITKPMDLGTVKMRLNKNWYKSPREFAED 324
            LLERLMKHKHGWVFN PVD K LGL DY+ II  PMDLGT+K  L KN YKSPREFAED
Sbjct: 435 ALLERLMKHKHGWVFNAPVDVKGLGLLDYYTIIEHPMDLGTIKSALMKNLYKSPREFAED 494

Query: 325 VRLTFSNAITYNPKGEDVHIMAEQLSNVFEEKWRIIEAKQDVGKDDGSRKSPALATPPVE 384
           VRLTF NA+TYNP+G+DVH+MA  L  +FEE+W +IEA  +      +     L TP + 
Sbjct: 495 VRLTFHNAMTYNPEGQDVHLMAVTLLQIFEERWAVIEADYNREMRFVTGYEMNLPTPTMR 554

Query: 385 SRTFSKSKSTTKPPPAN-RESLGKSD-----SITKPANVPD-----------KKPNAKNH 444
           SR       T  PPP N R ++ ++D       T P   P            KKP A   
Sbjct: 555 SRL----GPTMPPPPINVRNTIDRADWSNRQPTTTPGRTPTSATPSGRTPALKKPKANEP 614

Query: 445 GNRDMNYEEKQKLSIDLQDLPSDELNNVVKIIKKRNQGLFQNDDEIELDIGSVDSKTLWE 504
             RDM YEEKQKLS  LQ+LP D+L+ +V+I+ KRN  +   D+EIE+DI SVD +TLWE
Sbjct: 615 NKRDMTYEEKQKLSGHLQNLPPDKLDAIVQIVNKRNTAVKLRDEEIEVDIDSVDPETLWE 674

Query: 505 LERFVADYKNSLITNKRKVDADLQSSRYSTKDMDRAVDDAGGGPVGGNADSEGEGDSSST 517
           L+RFV +YK  L   KRK +  +Q+   + ++  + +  A   P       EG   +  T
Sbjct: 675 LDRFVTNYKKGLSKKKRKAELAIQARAEAERNSQQQMAPA---PAAHEFSREGGNTAKKT 719

BLAST of CmaCh04G005650 vs. TAIR10
Match: AT1G73150.1 (AT1G73150.1 global transcription factor group E3)

HSP 1 Score: 238.4 bits (607), Expect = 9.8e-63
Identity = 140/273 (51.28%), Postives = 175/273 (64.10%), Query Frame = 1

Query: 228 LLKSCNNLLERLMKHKHGWVFNVPVDAKRLGLHDYHKIITKPMDLGTVKMRLNKNWYKSP 287
           +LKSCNNLL +LMKHK GW+FN PVD   LGLHDYH II +PMDLGTVK RL+K+ YKSP
Sbjct: 119 ILKSCNNLLTKLMKHKSGWIFNTPVDVVTLGLHDYHNIIKEPMDLGTVKTRLSKSLYKSP 178

Query: 288 REFAEDVRLTFSNAITYNPKGEDVHIMAEQLSNVFEEKWRIIEA-------KQDVGKDDG 347
            EFAEDVRLTF+NA+ YNP G DV+ MAE L N+FEEKW  +E        KQ   +D  
Sbjct: 179 LEFAEDVRLTFNNAMLYNPVGHDVYHMAEILLNLFEEKWVPLETQYELLIRKQQPVRDID 238

Query: 348 SRKSPALATPPVESRTFSKSKSTTKPPP----ANRESLGKSDSITKPAN------VPDKK 407
                +  T  VE+        +  PPP        +L +++S+T P        VP+K 
Sbjct: 239 FHAPVSTNTHNVEALPLPAPTPSLSPPPPPKVVENRTLERAESMTNPVKPAVLPVVPEKL 298

Query: 408 PNAKNHGNRDMNYEEKQKLSIDLQDLPSDELNNVVKIIKKRNQGLFQNDDEIELDIGSVD 467
               +  NRD+ ++EK++LS DLQDLP D+L  VV+IIKKR   L Q DDEIELDI S+D
Sbjct: 299 VEEAS-ANRDLTFDEKRQLSEDLQDLPYDKLEAVVQIIKKRTPELSQQDDEIELDIDSLD 358

Query: 468 SKTLWELERFVADYKNSLITNKRKVDADLQSSR 484
            +TLWEL RFV +YK SL  +K+K +  L S R
Sbjct: 359 LETLWELFRFVTEYKESL--SKKKEEQGLDSER 388

BLAST of CmaCh04G005650 vs. TAIR10
Match: AT1G17790.1 (AT1G17790.1 DNA-binding bromodomain-containing protein)

HSP 1 Score: 218.8 bits (556), Expect = 8.1e-57
Identity = 131/266 (49.25%), Postives = 165/266 (62.03%), Query Frame = 1

Query: 228 LLKSCNNLLERLMKHKHGWVFNVPVDAKRLGLHDYHKIITKPMDLGTVKMRLNKNWYKSP 287
           + K+CN+LL +LMKHK  WVFNVPVDAK LGLHDYH I+ +PMDLGTVK +L K+ YKSP
Sbjct: 132 IFKNCNSLLTKLMKHKSAWVFNVPVDAKGLGLHDYHNIVKEPMDLGTVKTKLGKSLYKSP 191

Query: 288 REFAEDVRLTFSNAITYNPKGEDVHIMAEQLSNVFEEKWRIIEAKQD--------VGKDD 347
            +FAEDVRLTF+NAI YNP G DV+  AE L N+FE+KW  IE + D            +
Sbjct: 192 LDFAEDVRLTFNNAILYNPIGHDVYRFAELLLNMFEDKWVSIEMQYDNLHRKFKPTRDIE 251

Query: 348 GSRKSPALA--TPPVESRTFSKSKSTTKPPP--------ANRESLGKSDSIT---KPANV 407
               +P++A    P+ +   S S S+  PPP            +  + +S+T   +P  V
Sbjct: 252 FPAPAPSIAPIVEPLPAIVPSPSPSSPPPPPPPPVAAPVLENRTWEREESMTIPVEPEAV 311

Query: 408 PDKKPNAKNH----GNRDMNYEEKQKLSIDLQDLPSDELNNVVKIIKKRNQGLFQNDDEI 467
                 A+       NRD+  EEK++LS +LQDLP D+L  VV+IIKK N  L Q DDEI
Sbjct: 312 ITAPEKAEEEEAPVNNRDLTLEEKRRLSEELQDLPYDKLETVVQIIKKSNPELSQKDDEI 371

Query: 468 ELDIGSVDSKTLWELERFVADYKNSL 469
           ELDI S+D  TLWEL RFV  YK SL
Sbjct: 372 ELDIDSLDINTLWELYRFVTGYKESL 397

BLAST of CmaCh04G005650 vs. TAIR10
Match: AT5G63320.1 (AT5G63320.1 nuclear protein X1)

HSP 1 Score: 144.8 bits (364), Expect = 1.5e-34
Identity = 123/378 (32.54%), Postives = 169/378 (44.71%), Query Frame = 1

Query: 100 ICLNSRSRSEIKELTTKLKGELGHVRSLVKK---FESQELQISGYGGDVGHSQSQFSAN- 159
           + L+  SRSE K L  KLK EL  VR L KK   F S  + +S Y     HS S      
Sbjct: 61  LSLSKMSRSERKNLVHKLKMELQQVRDLSKKIASFSSDTVLLSPYND---HSCSDGPRRP 120

Query: 160 ---NLVEKVGNTWKNDSVVGSADVPASRLVQSVSVAENFGEFAEKEMNKHKNSRYNPKTE 219
              N    VG+  K    V S                      +K+ NK   SR N  T 
Sbjct: 121 PPENFATFVGSQGKKRPPVRS----------------------DKQRNKKGPSRLNVPTS 180

Query: 220 FPVSDCDSNRGKIDPLLKSCNNLLERLMKHKHGWVFNVPVDAKRLGLHDYHKIITKPMDL 279
           + V+           ++K C  LL RL  HK GW F  PVD   L + DY  +I  PMDL
Sbjct: 181 YTVAS----------VMKECETLLNRLWSHKSGWPFRTPVDPVMLNIPDYFNVIKHPMDL 240

Query: 280 GTVKMRLNKNWYKSPREFAEDVRLTFSNAITYNPKGEDVHIMAEQLSNVFEEKWRIIEAK 339
           GT++ RL K  Y SP +FA DVRLTFSN+I YNP G   H MA+ +S  FE  W+ IE K
Sbjct: 241 GTIRSRLCKGEYSSPLDFAADVRLTFSNSIAYNPPGNQFHTMAQGISKYFESGWKSIEKK 300

Query: 340 QDVGKDDGSRKSPALATPPVESRTFSKSKSTTKP---PPANRESLGKSDS--ITKPANVP 399
             + K            PPV   T S S  +  P    P  ++    +D+    +PA + 
Sbjct: 301 IPMSK------------PPVIPLTSSASLESEIPFEVAPMRKKEAAMNDNKLRVEPAKLV 360

Query: 400 DKKPNAKNHGNRDMNYEEKQKLSIDLQDLPSDELNNVVKIIKKRNQGLFQN-DDEIELDI 459
                        M   EK+KL  DL  L  D    +  ++++++    Q+ + EIE+DI
Sbjct: 361 -------------MTDGEKKKLGQDLMALEEDFPQKIADLLREQSGSDGQSGEGEIEIDI 378

Query: 460 GSVDSKTLWELERFVADY 465
            ++  + L+ + + + DY
Sbjct: 421 EALSDEILFMVRKLLDDY 378

BLAST of CmaCh04G005650 vs. TAIR10
Match: AT3G27260.1 (AT3G27260.1 global transcription factor group E8)

HSP 1 Score: 142.5 bits (358), Expect = 7.3e-34
Identity = 149/500 (29.80%), Postives = 220/500 (44.00%), Query Frame = 1

Query: 39  RNSVQIPAIAAANGGGDPSSPSHYPIDALVT-SRDSSGQNRYIEQVNADEVPGYTRFENR 98
           RN+ + P  +  +G       S   ID  VT S +SS   R    +N+++   Y     R
Sbjct: 13  RNTFEAPEESEGSG-------SSAQIDTEVTASENSSTPARKCIMLNSNDEDPYG--VQR 72

Query: 99  VRICLNSRSRSEIKELTTKLKGELGHVRSLVKKFESQELQ---ISGYGGDVGHSQSQFSA 158
             I L + S+SE K+L  +LK EL   + ++K  E Q +    +S     VG S  Q   
Sbjct: 73  QVISLYNMSQSERKDLIYRLKLELEQTKIVLKNAELQRMNPAAVSSTSDRVGFSTGQ--- 132

Query: 159 NNLVEKVGNTWK-NDSVVGSADVPASRLVQSVSVAENFGEFAEKEMNKHKNSRYNPKTEF 218
             +  +V N+ K +D  VGS             V    G    +  N+  + ++    E 
Sbjct: 133 -KISSRVSNSKKPSDFAVGSGK----------KVRHQNG--TSRGWNRGTSGKFESSKET 192

Query: 219 PVSDCDSNRGKIDPLLKSCNNLLERLMKHKHGWVFNVPVDAKRLGLHDYHKIITKPMDLG 278
             S  +        L+K C+ LL +L  H H WVF  PVD  +L + DY   I  PMDLG
Sbjct: 193 MTSTPNIT------LMKQCDTLLRKLWSHPHSWVFQAPVDVVKLNIPDYLTTIKHPMDLG 252

Query: 279 TVKMRLNKNWYKSPREFAEDVRLTFSNAITYNPKGEDVHIMAEQLSNVFEEKWRIIEAKQ 338
           TVK  L    Y SP EFA DVRLTF+NA+TYNP G DVHIM + LS +FE +W+ I+ K 
Sbjct: 253 TVKKNLASGVYSSPHEFAADVRLTFTNAMTYNPPGHDVHIMGDILSKLFEARWKTIKKKL 312

Query: 339 DVGKDDGSRKSPALATPPVESRTFSKSKSTTKPPPANRESLGK--SDSITKPANVPDKKP 398
                   +  PA+   P + R     K+    PPA +  +     +S+ +P      KP
Sbjct: 313 ---PPCSMQTLPAVTLEPNDER-----KAAISVPPAKKRKMASPVRESVPEPV-----KP 372

Query: 399 NAKNHGNRDMNYEEKQKLSIDLQDLPSDELNNVVKIIKKRN-QGLFQNDDEIELDIGSVD 458
                    M   E+ +L   L+ L  +   +++  +KK N  G    +DEIE+DI  + 
Sbjct: 373 L--------MTEVERHRLGRQLESLLDELPAHIIDFLKKHNSNGGEIAEDEIEIDIDVLS 432

Query: 459 SKTLWELERFVADY---KNSLITNKRKVDADL-QSSRYSTKDMDRAVD------DAGGGP 518
            + L  L   + +Y   K +  TN    + +L   SR S   + R  +      D    P
Sbjct: 433 DEVLVTLRNLLDEYIQNKEAKQTNVEPCEIELINGSRPSNSSLQRGNEMADEYVDGNEPP 460

Query: 519 VGGNADSEGEGDSSSTCGDA 521
           +  ++     G S     DA
Sbjct: 493 ISRSSSDSDSGSSEDQSDDA 460

BLAST of CmaCh04G005650 vs. NCBI nr
Match: gi|449439059|ref|XP_004137305.1| (PREDICTED: transcription factor GTE3, chloroplastic [Cucumis sativus])

HSP 1 Score: 737.6 bits (1903), Expect = 1.5e-209
Identity = 411/544 (75.55%), Postives = 447/544 (82.17%), Query Frame = 1

Query: 1   MASVLRGGGEPGGNPRRTDNNKFSTGNQRKESKIPKS-ARNSVQIPAIAAANGGGDPSSP 60
           MASVL+G G+ GGNPR+ DN+KF+ G Q+K+SKI K  ARNS+Q P +AA NGG +PSSP
Sbjct: 1   MASVLQGDGDAGGNPRKRDNDKFNAGKQQKQSKIAKRVARNSLQTPTVAATNGGANPSSP 60

Query: 61  SHYPIDALVTSRDSSGQNRYIEQVNADEVPGYTRFENRVRICLNSRSRSEIKELTTKLKG 120
           SH PIDALVTSR  SGQN   E VNA+EVP YTRFENRVRI LNSRSR  IKELTTKLKG
Sbjct: 61  SHNPIDALVTSRFYSGQNHCSEPVNAEEVPVYTRFENRVRINLNSRSRFGIKELTTKLKG 120

Query: 121 ELGHVRSLVKKFESQELQISGYGGDVGHSQSQFSANNLVEKVG--NTWKNDSVVGSADVP 180
           EL  VRSLVKKFE+QELQ+SGYGGDVGHSQSQFSANNLVE+VG  +T K +S VGSADVP
Sbjct: 121 ELDQVRSLVKKFETQELQLSGYGGDVGHSQSQFSANNLVERVGTVSTMKVNSEVGSADVP 180

Query: 181 ASRLVQSVSVAENFGEFAEKEMNKHKNSRYNPKTEFPVSDCDSNRGKIDPLLKSCNNLLE 240
           ASRLV+  SVAENFGEFAEKE++KHKNS+Y    E P+SDC+ N GKI P+LKSC+NLLE
Sbjct: 181 ASRLVRCASVAENFGEFAEKEVSKHKNSKYASTKELPMSDCNLNGGKIGPVLKSCSNLLE 240

Query: 241 RLMKHKHGWVFNVPVDAKRLGLHDYHKIITKPMDLGTVKMRLNKNWYKSPREFAEDVRLT 300
           RLMKHK GWVFNVPVDAKRLGLHDYHKIITKPMDLGT+KMRLNKNWYKSPREFAEDVRLT
Sbjct: 241 RLMKHKFGWVFNVPVDAKRLGLHDYHKIITKPMDLGTIKMRLNKNWYKSPREFAEDVRLT 300

Query: 301 FSNAITYNPKGEDVHIMAEQLSNVFEEKWRIIEAKQDVGK----DDG-------SRKSPA 360
           FSNAITYNPKGEDVH+MAEQLSN+FEEKW+ IE KQ+VGK    DDG       SRKSPA
Sbjct: 301 FSNAITYNPKGEDVHMMAEQLSNIFEEKWKTIEGKQNVGKGFQVDDGSVLPTPTSRKSPA 360

Query: 361 LATPPVESRTFSKSKSTTKPPPANRESLGKSDSITKPANV--PDKKPNAKNHGNRDMNYE 420
           LAT PVESRTFS+S STTK           S+    P +V  PDKKP AKNH  RDM YE
Sbjct: 361 LATRPVESRTFSRSDSTTK-------HFLTSNPKQPPTDVAPPDKKPKAKNHEIRDMTYE 420

Query: 421 EKQKLSIDLQDLPSDELNNVVKIIKKRNQGLFQNDDEIELDIGSVDSKTLWELERFVADY 480
           EKQKLSIDLQDLPSD+LNNVVKIIKKRNQGLFQNDDEIELDIGSVDS+TLWELERFVA+Y
Sbjct: 421 EKQKLSIDLQDLPSDKLNNVVKIIKKRNQGLFQNDDEIELDIGSVDSETLWELERFVANY 480

Query: 481 KNSLITNKRKVDADLQS----SRYSTKDMD-RAVDDAGGGPVGGNADSEGEGDSSSTCGD 524
           K SLI NKRK DA+LQS    S YST D D  AV  AGG PVGGNADS  E DSSSTCGD
Sbjct: 481 KKSLIKNKRKADANLQSGEKLSHYSTNDTDLLAVAKAGGKPVGGNADS--ENDSSSTCGD 535

BLAST of CmaCh04G005650 vs. NCBI nr
Match: gi|659101628|ref|XP_008451707.1| (PREDICTED: transcription factor GTE4 [Cucumis melo])

HSP 1 Score: 729.6 bits (1882), Expect = 4.0e-207
Identity = 408/542 (75.28%), Postives = 440/542 (81.18%), Query Frame = 1

Query: 1   MASVLRGGGEPGGNPRRTDNNKFSTGNQRKESKIPKS-ARNSVQIPAIAAANGGGDPSSP 60
           MASVL+GGG+ GGNPR+TDN+KF+ G Q+K SKIPK  ARNS+Q P +AA NGG +PSSP
Sbjct: 1   MASVLQGGGDAGGNPRKTDNDKFNAGKQQKLSKIPKHVARNSLQTPTVAATNGGANPSSP 60

Query: 61  SHYPIDALVTSRDSSGQNRYIEQVNADEVPGYTRFENRVRICLNSRSRSEIKELTTKLKG 120
           SH PIDALVTSR  SGQN   E VNA+EVP YTRFENRVRI LNSRSRS IKELTTKLKG
Sbjct: 61  SHNPIDALVTSRFYSGQNHCSEPVNAEEVPVYTRFENRVRINLNSRSRSGIKELTTKLKG 120

Query: 121 ELGHVRSLVKKFESQELQISGYGGDVGHSQSQFSANNLVEKVG--NTWKNDSVVGSADVP 180
           EL  VRSLVKKFE+QELQ+SGYGGDVGHSQSQFSANNLVE+VG  +T K +S VGSADVP
Sbjct: 121 ELDQVRSLVKKFETQELQLSGYGGDVGHSQSQFSANNLVERVGTVSTIKVNSEVGSADVP 180

Query: 181 ASRLVQSVSVAENFGEFAEKEMNKHKNSRYNPKTEFPVSDCDSNRGKIDPLLKSCNNLLE 240
           ASRLV+ VSVAENFGEFAEKE++KHK S+Y    EFP+SDC+ N GKI P+LKSCNNLLE
Sbjct: 181 ASRLVRCVSVAENFGEFAEKEVSKHKTSKYASTEEFPMSDCNLNGGKIGPVLKSCNNLLE 240

Query: 241 RLMKHKHGWVFNVPVDAKRLGLHDYHKIITKPMDLGTVKMRLNKNWYKSPREFAEDVRLT 300
           RLMKHK GWVFNVPVDAKRLGLHDYHKIITKPMDLGT+KMRLNKNWYKS REFAEDVRLT
Sbjct: 241 RLMKHKFGWVFNVPVDAKRLGLHDYHKIITKPMDLGTIKMRLNKNWYKSSREFAEDVRLT 300

Query: 301 FSNAITYNPKGEDVHIMAEQLSNVFEEKWRIIEAKQDVGK----DDGS-------RKSPA 360
           FSNAITYNPKGEDVHIMAEQLS +FEEKW+ IE KQ  GK    DDGS       RKSPA
Sbjct: 301 FSNAITYNPKGEDVHIMAEQLSKIFEEKWKAIEGKQIAGKGFQVDDGSVLPTPTYRKSPA 360

Query: 361 LATPPVESRTFSKSKSTTKPPPANRESLGKSDSITKPANVPDKKPNAKNHGNRDMNYEEK 420
           LAT PVESRTFS+S STTK  P        +D  T     PDKKP AKNH  RDM YEEK
Sbjct: 361 LATRPVESRTFSRSDSTTKHLPTPNPKQTPTDVAT-----PDKKPKAKNHEIRDMTYEEK 420

Query: 421 QKLSIDLQDLPSDELNNVVKIIKKRNQGLFQNDDEIELDIGSVDSKTLWELERFVADYKN 480
           QKLS DLQDLPSD+LNNVV+IIKKRNQGLFQNDDEIELDIGSVDS+TLWELERFVA+YK 
Sbjct: 421 QKLSTDLQDLPSDKLNNVVRIIKKRNQGLFQNDDEIELDIGSVDSETLWELERFVANYKK 480

Query: 481 SLITNKRKVDADLQS----SRYSTKDMD-RAVDDAGGGPVGGNADSEGEGDSSSTCGDAN 524
           SLI NKRK DA+LQS    S YS  D D  AV  AGG  VG NADS  E DS S CGD N
Sbjct: 481 SLIKNKRKADANLQSGEKLSHYSINDTDLLAVAKAGGKHVGRNADS--ENDSFSACGDGN 535

BLAST of CmaCh04G005650 vs. NCBI nr
Match: gi|1009161953|ref|XP_015899173.1| (PREDICTED: transcription factor GTE3, chloroplastic [Ziziphus jujuba])

HSP 1 Score: 354.4 bits (908), Expect = 3.5e-94
Identity = 243/528 (46.02%), Postives = 304/528 (57.58%), Query Frame = 1

Query: 1   MASVLRGGGEPGGNPRRTDNNKFSTGNQRKESKIPKSARNSVQIPAIAAANGGGDPSSPS 60
           MAS    G +     R   +NK     + ++ K P           +A      D S P 
Sbjct: 1   MASGSMLGDDAKEKHRSAQSNKKFHSRKNQKPKNPNLLSRRSSQTLVAPITDNNDSSPPH 60

Query: 61  HY-PIDALVTSRDSSGQN----RYIEQVNADEVPGYTRFENRVRICLNSRSRSEIKELTT 120
           H+  +D    S D S  +    R  EQ N +  PGY  FENRVRI L+SRS+ +I+EL  
Sbjct: 61  HFLRVDDAAGSNDLSYHDHPLPRGSEQANENGFPGYMEFENRVRISLDSRSKMDIRELRR 120

Query: 121 KLKGELGHVRSLVKKFESQELQISGYGGDVGHSQSQFSANNLVEKVGNTWKNDSVVGSAD 180
           KL  EL  VR LVKK ES+E Q+SGY      S SQFSAN  V+ +G+T + +S VG   
Sbjct: 121 KLLSELDQVRCLVKKLESKEFQLSGY------SHSQFSANYAVDNMGSTERLNSGVGLKG 180

Query: 181 VPASRLVQSVS--VAEN-FGEFAEKEMNKHKNSRYNPKTEFPVSDC----DSNRGKIDP- 240
              SRL + +S  VAEN  G   E    K K     P+++  +       D  RG+  P 
Sbjct: 181 PRDSRLFRGLSDSVAENNHGVVGEVGGKKKKKKLPTPESDKKMKTGGGKKDELRGRFLPG 240

Query: 241 -------LLKSCNNLLERLMKHKHGWVFNVPVDAKRLGLHDYHKIITKPMDLGTVKMRLN 300
                  L  SC++LL +LMKHK GW+FNVPVD K LGLHDYH I+  PMDLGTVK RLN
Sbjct: 241 KDKYSSQLFNSCSDLLGKLMKHKFGWIFNVPVDVKGLGLHDYHTIVKHPMDLGTVKTRLN 300

Query: 301 KNWYKSPREFAEDVRLTFSNAITYNPKGEDVHIMAEQLSNVFEEKWRIIEAKQ------D 360
           K WYKSP EFAEDVRLTF NA+ YNPKG+D + MAEQL  +FE KW  +EA+       +
Sbjct: 301 KGWYKSPMEFAEDVRLTFHNAMFYNPKGQDAYFMAEQLLKIFEPKWLALEAEYNLNKTLE 360

Query: 361 VGKDD----GSRK--SPALATPPVESRTFSKSKSTTKPPPANRESLGKSDSITKPAN--- 420
           VGK D     SRK  +PA A P +  R  S       PPP +  +L +S+S TKP +   
Sbjct: 361 VGKADLPTPASRKVQNPATAPPRLPPRPAS-------PPPMS--TLDRSESHTKPVDPKL 420

Query: 421 ------------VPDKKPNAKNHGNRDMNYEEKQKLSIDLQDLPSDELNNVVKIIKKRNQ 480
                       VP KKP AK+   RDM YEEKQKLS +LQ+LPS++L+NVV+IIKKRN 
Sbjct: 421 KPEGFGHVGRTPVP-KKPKAKDPDKRDMTYEEKQKLSANLQNLPSEKLDNVVQIIKKRNP 480

Query: 481 GLFQNDDEIELDIGSVDSKTLWELERFVADYKNSLITNKRKVDADLQS 482
            LFQ +DEIE+DI +VD +TLWEL+RFV +YK SL   KRK +  LQS
Sbjct: 481 RLFQQEDEIEVDIANVDPETLWELDRFVTNYKKSLSKIKRKNELALQS 512

BLAST of CmaCh04G005650 vs. NCBI nr
Match: gi|720074605|ref|XP_010279073.1| (PREDICTED: transcription factor GTE4 [Nelumbo nucifera])

HSP 1 Score: 345.5 bits (885), Expect = 1.6e-91
Identity = 236/567 (41.62%), Postives = 318/567 (56.08%), Query Frame = 1

Query: 3   SVLRGGGEPGGNPRRTDNNKFST--GNQRKESKIPKSAR---------NSVQIPAIAAAN 62
           +++ GGG+      R   +K  T   + +    +P+            NS Q   +   +
Sbjct: 2   ALVGGGGDGSREKHRWAESKVYTRKAHNKGSKNVPQQPSSQTLAPEDGNSSQQQLLTRFD 61

Query: 63  GGGDPSSPSHYPIDALVTSRDSSGQNRYIEQVNADEVPGYTRFENRVRICLNSRSRSEIK 122
              D SS  +    A+  SRD    N  +        P + R ENR+ I L+SRS+ E++
Sbjct: 62  AASDDSSSLNRRQVAVPNSRDPPAGNGSVR-------PAFPRLENRITINLSSRSKQEMR 121

Query: 123 ELTTKLKGELGHVRSLVKKFESQELQISGYGGDVGHSQSQFSANNLVEKVGNTWKNDSVV 182
           EL  KL  EL  VRSLVKK E++ELQ++G+ G  G+S SQ SAN+ ++  G   +  S V
Sbjct: 122 ELRRKLVNELDQVRSLVKKLEAKELQLTGFSGG-GYSHSQLSANDAIDN-GGAKRVHSEV 181

Query: 183 GSADVPASRLVQ--SVSVAEN---FGEFAEKEMNKHKNSRYNP-------KTEFPVSDCD 242
            S     SR +   S+SV EN     +  EKE    K ++Y         K +FP  D +
Sbjct: 182 ASVGPHESRPLHQLSISVVENSQGVSDVVEKEKRTPKANQYYRNSDFVLGKEKFPPPDSN 241

Query: 243 ----SNRGKIDPLL-----------------KSCNNLLERLMKHKHGWVFNVPVDAKRLG 302
               SN  K   ++                 KSC+NLL +LMKHKHGWVFN PVD K LG
Sbjct: 242 KKSKSNSSKKHGVVGDGEYGFVMDKHTAQAFKSCSNLLAKLMKHKHGWVFNTPVDVKGLG 301

Query: 303 LHDYHKIITKPMDLGTVKMRLNKNWYKSPREFAEDVRLTFSNAITYNPKGEDVHIMAEQL 362
           LHDY+ II  PMDLGTVK RLNKNWYKSPREFAEDVRLTF NA+TYNPKG+DVHIMAEQL
Sbjct: 302 LHDYYSIIKHPMDLGTVKSRLNKNWYKSPREFAEDVRLTFRNAMTYNPKGQDVHIMAEQL 361

Query: 363 SNVFEEKWRIIEA------KQDVGKDDG-----SRKSPALATPPVES--RTFSKSKSTTK 422
           + +FEEKW +++A      + ++  D G     SRK P    PP+    RT  +S+STT 
Sbjct: 362 AKIFEEKWAVLQAEHNLDSRYEMDHDMGLPTPTSRKVPPSLPPPLTDMRRTLDRSESTTH 421

Query: 423 PPPANRESLGKSDSITKPANVPDKKPNAKNHGNRDMNYEEKQKLSIDLQDLPSDELNNVV 482
           P     +    + +   PA    KKP AK+   RDM YEEKQ+LS +LQ LPS++L+N+V
Sbjct: 422 PIDPKMKPAAFTPTGRTPA---PKKPKAKDPFKRDMTYEEKQRLSTNLQSLPSEKLDNIV 481

Query: 483 KIIKKRNQGLFQNDDEIELDIGSVDSKTLWELERFVADYKNSLITNKRKVDADLQSSRYS 513
           +IIKKRN  L Q+DDEIE+DI SVD++TLWEL+RFV +YK SL  NKRK +   + +  +
Sbjct: 482 QIIKKRNSSLCQHDDEIEVDIDSVDAETLWELDRFVTNYKKSLSKNKRKAEIAAELAMQA 541

BLAST of CmaCh04G005650 vs. NCBI nr
Match: gi|590714910|ref|XP_007050048.1| (Global transcription factor group E4, putative isoform 1 [Theobroma cacao])

HSP 1 Score: 335.9 bits (860), Expect = 1.3e-88
Identity = 217/473 (45.88%), Postives = 288/473 (60.89%), Query Frame = 1

Query: 55  DPSSPSHYPIDALVT--SRDSSGQNRYIEQVNADEVPGYTRFENRVRICLNSRSRSEIKE 114
           D +S    P+  + T  S DSS  N++  QV A      +  ENRV+I L SRS+ E+++
Sbjct: 123 DMNSAHQQPVPYVDTAVSDDSSNLNKH--QVVASNGAVKSSSENRVKINLASRSKQEMRD 182

Query: 115 LTTKLKGELGHVRSLVKKFESQELQISGYGGDVGHSQSQFSANNLVEKVGNTWKNDSVVG 174
           L  KL+ EL  VR+LVK+ E++E QISG+      S S+   N+ V+      +  S V 
Sbjct: 183 LRRKLESELDLVRNLVKRIEAKEGQISGF------SNSRLLLNDSVDY--GLKRVQSEVA 242

Query: 175 SADVPASRLVQS-------VSVAENF--GEFAEKEMNKHKNSRYNPKTEFPVSD-----C 234
           SA +P   + QS       +SV EN    E  EKE    K +++   +EF ++       
Sbjct: 243 SAGIPQEPVRQSRPLNQLSISVLENSQGNENLEKEKRTPKANQFYRNSEFLLAKDKFPPA 302

Query: 235 DSNR------------------GKIDPLLKSCNNLLERLMKHKHGWVFNVPVDAKRLGLH 294
           +SN+                  G  +   KSC++LLERLMKHKHGWVFN PVD K LGLH
Sbjct: 303 ESNKKSKLNGKKAGGGEFTHGFGMGNKFFKSCSSLLERLMKHKHGWVFNAPVDVKGLGLH 362

Query: 295 DYHKIITKPMDLGTVKMRLNKNWYKSPREFAEDVRLTFSNAITYNPKGEDVHIMAEQLSN 354
           DY+ II  PMDLGTVK RLNKNWYKSPREFAEDVRLTF NA+TYNPKG+DVH+MAEQLS 
Sbjct: 363 DYYSIIKHPMDLGTVKSRLNKNWYKSPREFAEDVRLTFRNAMTYNPKGQDVHVMAEQLSK 422

Query: 355 VFEEKWRIIEA----------KQDVGKDDGS-RKSPALATPPVE-SRTFSKSKSTTKPPP 414
           +FE+KW +IE           + +V     + RK+  +  PP++  R   +S+S  +P  
Sbjct: 423 IFEDKWAVIETDYIREMRLAIEYEVSLPTPTPRKAHPMLPPPLDMRRILDRSESMIRPVD 482

Query: 415 ANRESLGKSDSITKPANVPDKKPNAKNHGNRDMNYEEKQKLSIDLQDLPSDELNNVVKII 474
              + +  + S   PA    KKP AK+   RDM YEEKQKLS +LQ LPS++L+N+V+II
Sbjct: 483 MRPKLIATTPSSRTPA---PKKPKAKDPYKRDMTYEEKQKLSTNLQSLPSEKLDNIVQII 542

Query: 475 KKRNQGLFQNDDEIELDIGSVDSKTLWELERFVADYKNSLITNKRKVDADLQS 482
           KKRN  LFQ+DDEIE+DI SVD++TLWEL+RFV +YK SL  NKRK +  +Q+
Sbjct: 543 KKRNSALFQHDDEIEVDIDSVDTETLWELDRFVTNYKKSLSKNKRKAELAIQA 582

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
GTE4_ARATH1.1e-6839.17Transcription factor GTE4 OS=Arabidopsis thaliana GN=GTE4 PE=2 SV=1[more]
GTE3_ARATH1.7e-6151.28Transcription factor GTE3, chloroplastic OS=Arabidopsis thaliana GN=GTE3 PE=1 SV... [more]
GTE5_ARATH1.4e-5549.25Transcription factor GTE5, chloroplastic OS=Arabidopsis thaliana GN=GTE5 PE=1 SV... [more]
GTE10_ARATH2.6e-3332.54Transcription factor GTE10 OS=Arabidopsis thaliana GN=GTE10 PE=1 SV=2[more]
GTE8_ARATH1.3e-3229.80Transcription factor GTE8 OS=Arabidopsis thaliana GN=GTE8 PE=2 SV=2[more]
Match NameE-valueIdentityDescription
A0A0A0KYZ1_CUCSA1.0e-20975.55Uncharacterized protein OS=Cucumis sativus GN=Csa_4G083710 PE=4 SV=1[more]
A0A061DNG1_THECC8.9e-8945.88Global transcription factor group E4, putative isoform 1 OS=Theobroma cacao GN=T... [more]
A0A061DW54_THECC8.9e-8945.88Global transcription factor group E4, putative isoform 2 OS=Theobroma cacao GN=T... [more]
A0A151SES9_CAJCA2.0e-8844.68Bromodomain-containing protein 4 OS=Cajanus cajan GN=KK1_024682 PE=4 SV=1[more]
K7M2K1_SOYBN4.4e-8844.32Uncharacterized protein OS=Glycine max GN=GLYMA_13G292200 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G06230.16.3e-7039.17 global transcription factor group E4[more]
AT1G73150.19.8e-6351.28 global transcription factor group E3[more]
AT1G17790.18.1e-5749.25 DNA-binding bromodomain-containing protein[more]
AT5G63320.11.5e-3432.54 nuclear protein X1[more]
AT3G27260.17.3e-3429.80 global transcription factor group E8[more]
Match NameE-valueIdentityDescription
gi|449439059|ref|XP_004137305.1|1.5e-20975.55PREDICTED: transcription factor GTE3, chloroplastic [Cucumis sativus][more]
gi|659101628|ref|XP_008451707.1|4.0e-20775.28PREDICTED: transcription factor GTE4 [Cucumis melo][more]
gi|1009161953|ref|XP_015899173.1|3.5e-9446.02PREDICTED: transcription factor GTE3, chloroplastic [Ziziphus jujuba][more]
gi|720074605|ref|XP_010279073.1|1.6e-9141.62PREDICTED: transcription factor GTE4 [Nelumbo nucifera][more]
gi|590714910|ref|XP_007050048.1|1.3e-8845.88Global transcription factor group E4, putative isoform 1 [Theobroma cacao][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001487Bromodomain
IPR027353NET_dom
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh04G005650.1CmaCh04G005650.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001487BromodomainPRINTSPR00503BROMODOMAINcoord: 243..256
score: 5.5E-18coord: 275..293
score: 5.5E-18coord: 259..275
score: 5.5E-18coord: 293..312
score: 5.5
IPR001487BromodomainGENE3DG3DSA:1.20.920.10coord: 221..336
score: 9.8
IPR001487BromodomainPFAMPF00439Bromodomaincoord: 232..316
score: 2.8
IPR001487BromodomainSMARTSM00297bromo_6coord: 221..331
score: 1.1
IPR001487BromodomainPROFILEPS50014BROMODOMAIN_2coord: 240..312
score: 18
IPR001487BromodomainunknownSSF47370Bromodomaincoord: 219..332
score: 3.53
IPR027353NET domainPFAMPF17035BETcoord: 401..462
score: 6.7
IPR027353NET domainPROFILEPS51525NETcoord: 391..472
score: 18
NoneNo IPR availablePANTHERPTHR22880FALZ-RELATED BROMODOMAIN-CONTAINING PROTEINScoord: 228..489
score: 8.8E
NoneNo IPR availablePANTHERPTHR22880:SF172TRANSCRIPTION FACTOR GTE3, CHLOROPLASTIC-RELATEDcoord: 228..489
score: 8.8E