CmaCh03G008210 (gene) Cucurbita maxima (Rimu)

NameCmaCh03G008210
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionHD domain class transcription factor
LocationCma_Chr03 : 6250002 .. 6252188 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GAATTTGAGATGAATTTGGTTCTGGCAAACCGCAGTGAACCCAACCCTAGTTAGATTCCAACCCTACTTTCTGACACGAGTTGGATTATTTAAAGACAGAGAAAGTATTAGACTAATCGTAATTGAGCTGCACACTCCCCCTCTCCTCCTCCTCCTCCTCCTCCTCCTATTTCATTTTCTAGAGCTTCTCCCTCAACCCACGATCTCTAATAATGCAGTAGAGAGAGAAACAAGAGAGAGAACGAGAGAAAAATATCATTTTGGGCAATCGATCATCATCATCATCTTCTTCTTCTTCTGCGGGCTGGGTTGGATTTTGATATTTTTGTTTCCATTTTGAGGATCGATTGGAATTGGGTTTTGTTTTCTGGACTTTTCTTTTGATGGGTTCAGAGAAGGAAGGAGAATCGAAGACCAAGAAAGATCCAGGGGAAACAGGGGAAGTAACTGGGAACTTGTACAGAGTCGGCCTTGTTTCACATCTGGAACGTCCTCTGCTTCTTCCTACAACTCTATTCCTTTCTGCTACTTTGAACCACTCTTTCAGTTCCTAACAATCCCTCGCTACGCTTCTAACAAATCACGCCTCCAAGCTTTTAGGGTTCACTTGAATACAAGGATCTCATCATATTTTGGGAGCTGTTTTTTTTAATCTCTTCCGTCATGAAGAGACATGGCAGCTCAGATTCGTTGGGCGCTCTGATGTCTGTGTGTCCTACTTCAGGTTTAAATATAATCAACAATTTTGTTTTATATATCTTCATTTCTTTATTCTTCTTCTGCTGCTTAATTTCCATTTTCCCGACACCCCCATGTGAATTTTTCTTTTCTTTTCTTTTCTTTCTTGTCTCTTGTTAAAACCCAATAACTCGCAGGAATCTTTACGAAACACGCTCTTGCTTGCAGATGAACAGAGTCCGAGGAACAGCCATGTTTATGGCAGGGAATTTCAGTCGATGTTGGACGGGTTGGACGAAGAAGGCTCCATGGAAGAACAGTGTCATGTGGGAGAGAAGAAAAGGAGACTCAGTGTTGATCAAGTTAAGGCCTTGGAGAAAACATTCGAGATTGAAAACAAGCTGGAACCAGATAGGAAAGTGAAGCTTGCTCTAGAACTTGGCCTGCAGCCAAGGCAAGTTGCTGTATGGTTCCAAAACCGTAGAGCCCGATGGAAAACTAAGCAACTGGAGAGAGATTATGGGCTTCTGAAAGCCAATTACGAAACTCTCAAGCGTAGTTTCGATACCCTTCAGAAGGACAATGATGGTCTGCTCAAAGAGGTCGATAATTGTAAAATCCCAGAGAGTTTTTGTTCTAAATGTTGAAGTTTTGTTTTTCCCCCAATTCACCTGTCTCTGTTTTATAGATTAAAGAATTGAAATTGAAGCTCGAGGAAGAGAAGACAGAGAGGCATTTATCAGTGAAGGAAGAGATTTTCGTGCCAGAATCGGGCAATTTACTTATTGAGCAGACGAACAATCATCTTTCGGTGGATCATGTTCCTCTTCCTGTTTCTTCGGATCATTCCGACGACTTCAACTACGAGAGCTTCAGAATAGCAAGCGCCGAGGATGGGGACGAACAAAGAGTTAAAGTATCGTTGTTCCCTGATTTTAAAGATGGGTCATCGGACAGCGACTCAAGCGCCATATTAAACGAAGACAACAGCCCAAACGCCGTCGTATCGTCCACGGCCGCCGGGATGCTCCAAAGCAAACAGCAAATTCTGTCGTCTCCGACGTCGTCTTTGAACTGGTTCCCGTATCAAAAGGCAGCAGCTTATAATAATGCACAACAATACGTGAAAATCGAAGAGTACAATTTCTTCAGCGGAGAAGAGAGCTGTGATTTGTTCTCCGATGAACAAGCGCCGTCTATGCATTGGTACTGCCCCGATGAGTGGAACTAAAAGCAATTCCAATCATGAGAGTGAAATCTAGCGTGGGAATTGGCACGGTGGGATTGGTCGGAGAGGGAACAGGTGGCGGCTCAGAGTGTTGATTGGAGTGAAGCTGTAAAGATTGATGGTGGGTGGGATGATTAGAAAGAACAAATGTTGTAATGAACAAATAATGATTCCCTTGGGCTTTTTCCCCCCTTTCATTTCTAAACCATTTTTATGCCCTAGATTCCTATAATATTTGTACTTCACTAAATTAAATCCATCTTTGGTGAATATTTTGAAAA

mRNA sequence

GAATTTGAGATGAATTTGGTTCTGGCAAACCGCAGTGAACCCAACCCTAGTTAGATTCCAACCCTACTTTCTGACACGAGTTGGATTATTTAAAGACAGAGAAAGTATTAGACTAATCGTAATTGAGCTGCACACTCCCCCTCTCCTCCTCCTCCTCCTCCTCCTCCTATTTCATTTTCTAGAGCTTCTCCCTCAACCCACGATCTCTAATAATGCAGTAGAGAGAGAAACAAGAGAGAGAACGAGAGAAAAATATCATTTTGGGCAATCGATCATCATCATCATCTTCTTCTTCTTCTGCGGGCTGGGTTGGATTTTGATATTTTTGTTTCCATTTTGAGGATCGATTGGAATTGGGTTTTGTTTTCTGGACTTTTCTTTTGATGGGTTCAGAGAAGGAAGGAGAATCGAAGACCAAGAAAGATCCAGGGGAAACAGGGGAAGTAACTGGGAACTTGTACAGAGTCGGCCTTGTTTCACATCTGGAACGTCCTCTGCTTCTTCCTACAACTCTATTCCTTTCTGCTACTTTGAACCACTCTTTCAGTTCCTAACAATCCCTCGCTACGCTTCTAACAAATCACGCCTCCAAGCTTTTAGGGTTCACTTGAATACAAGGATCTCATCATATTTTGGGAGCTGTTTTTTTTAATCTCTTCCGTCATGAAGAGACATGGCAGCTCAGATTCGTTGGGCGCTCTGATGTCTGTGTGTCCTACTTCAGATGAACAGAGTCCGAGGAACAGCCATGTTTATGGCAGGGAATTTCAGTCGATGTTGGACGGGTTGGACGAAGAAGGCTCCATGGAAGAACAGTGTCATGTGGGAGAGAAGAAAAGGAGACTCAGTGTTGATCAAGTTAAGGCCTTGGAGAAAACATTCGAGATTGAAAACAAGCTGGAACCAGATAGGAAAGTGAAGCTTGCTCTAGAACTTGGCCTGCAGCCAAGGCAAGTTGCTGTATGGTTCCAAAACCGTAGAGCCCGATGGAAAACTAAGCAACTGGAGAGAGATTATGGGCTTCTGAAAGCCAATTACGAAACTCTCAAGCGTAGTTTCGATACCCTTCAGAAGGACAATGATGGTCTGCTCAAAGAGATTAAAGAATTGAAATTGAAGCTCGAGGAAGAGAAGACAGAGAGGCATTTATCAGTGAAGGAAGAGATTTTCGTGCCAGAATCGGGCAATTTACTTATTGAGCAGACGAACAATCATCTTTCGGTGGATCATGTTCCTCTTCCTGTTTCTTCGGATCATTCCGACGACTTCAACTACGAGAGCTTCAGAATAGCAAGCGCCGAGGATGGGGACGAACAAAGAGTTAAAGTATCGTTGTTCCCTGATTTTAAAGATGGGTCATCGGACAGCGACTCAAGCGCCATATTAAACGAAGACAACAGCCCAAACGCCGTCGTATCGTCCACGGCCGCCGGGATGCTCCAAAGCAAACAGCAAATTCTGTCGTCTCCGACGTCGTCTTTGAACTGGTTCCCGTATCAAAAGGCAGCAGCTTATAATAATGCACAACAATACGTGAAAATCGAAGAGTACAATTTCTTCAGCGGAGAAGAGAGCTGTGATTTGTTCTCCGATGAACAAGCGCCGTCTATGCATTGGTACTGCCCCGATGAGTGGAACTAAAAGCAATTCCAATCATGAGAGTGAAATCTAGCGTGGGAATTGGCACGGTGGGATTGGTCGGAGAGGGAACAGGTGGCGGCTCAGAGTGTTGATTGGAGTGAAGCTGTAAAGATTGATGGTGGGTGGGATGATTAGAAAGAACAAATGTTGTAATGAACAAATAATGATTCCCTTGGGCTTTTTCCCCCCTTTCATTTCTAAACCATTTTTATGCCCTAGATTCCTATAATATTTGTACTTCACTAAATTAAATCCATCTTTGGTGAATATTTTGAAAA

Coding sequence (CDS)

ATGAAGAGACATGGCAGCTCAGATTCGTTGGGCGCTCTGATGTCTGTGTGTCCTACTTCAGATGAACAGAGTCCGAGGAACAGCCATGTTTATGGCAGGGAATTTCAGTCGATGTTGGACGGGTTGGACGAAGAAGGCTCCATGGAAGAACAGTGTCATGTGGGAGAGAAGAAAAGGAGACTCAGTGTTGATCAAGTTAAGGCCTTGGAGAAAACATTCGAGATTGAAAACAAGCTGGAACCAGATAGGAAAGTGAAGCTTGCTCTAGAACTTGGCCTGCAGCCAAGGCAAGTTGCTGTATGGTTCCAAAACCGTAGAGCCCGATGGAAAACTAAGCAACTGGAGAGAGATTATGGGCTTCTGAAAGCCAATTACGAAACTCTCAAGCGTAGTTTCGATACCCTTCAGAAGGACAATGATGGTCTGCTCAAAGAGATTAAAGAATTGAAATTGAAGCTCGAGGAAGAGAAGACAGAGAGGCATTTATCAGTGAAGGAAGAGATTTTCGTGCCAGAATCGGGCAATTTACTTATTGAGCAGACGAACAATCATCTTTCGGTGGATCATGTTCCTCTTCCTGTTTCTTCGGATCATTCCGACGACTTCAACTACGAGAGCTTCAGAATAGCAAGCGCCGAGGATGGGGACGAACAAAGAGTTAAAGTATCGTTGTTCCCTGATTTTAAAGATGGGTCATCGGACAGCGACTCAAGCGCCATATTAAACGAAGACAACAGCCCAAACGCCGTCGTATCGTCCACGGCCGCCGGGATGCTCCAAAGCAAACAGCAAATTCTGTCGTCTCCGACGTCGTCTTTGAACTGGTTCCCGTATCAAAAGGCAGCAGCTTATAATAATGCACAACAATACGTGAAAATCGAAGAGTACAATTTCTTCAGCGGAGAAGAGAGCTGTGATTTGTTCTCCGATGAACAAGCGCCGTCTATGCATTGGTACTGCCCCGATGAGTGGAACTAA

Protein sequence

MKRHGSSDSLGALMSVCPTSDEQSPRNSHVYGREFQSMLDGLDEEGSMEEQCHVGEKKRRLSVDQVKALEKTFEIENKLEPDRKVKLALELGLQPRQVAVWFQNRRARWKTKQLERDYGLLKANYETLKRSFDTLQKDNDGLLKEIKELKLKLEEEKTERHLSVKEEIFVPESGNLLIEQTNNHLSVDHVPLPVSSDHSDDFNYESFRIASAEDGDEQRVKVSLFPDFKDGSSDSDSSAILNEDNSPNAVVSSTAAGMLQSKQQILSSPTSSLNWFPYQKAAAYNNAQQYVKIEEYNFFSGEESCDLFSDEQAPSMHWYCPDEWN
BLAST of CmaCh03G008210 vs. Swiss-Prot
Match: ATHB6_ARATH (Homeobox-leucine zipper protein ATHB-6 OS=Arabidopsis thaliana GN=ATHB-6 PE=1 SV=1)

HSP 1 Score: 251.9 bits (642), Expect = 9.4e-66
Identity = 165/343 (48.10%), Postives = 209/343 (60.93%), Query Frame = 1

Query: 1   MKRHGSSDSLGALMSVCPTS--DEQSPRNSHVYGREFQSMLDGLDEEGS--MEEQCHVG- 60
           MKR  SSDS+G L+S+CPT+  DEQSPR     GREFQSML+G +EE    +EE+ HVG 
Sbjct: 2   MKRLSSSDSVGGLISLCPTTSTDEQSPRRYG--GREFQSMLEGYEEEEEAIVEERGHVGL 61

Query: 61  -EKKRRLSVDQVKALEKTFEIENKLEPDRKVKLALELGLQPRQVAVWFQNRRARWKTKQL 120
            EKKRRLS++QVKALEK FE+ENKLEP+RKVKLA ELGLQPRQVAVWFQNRRARWKTKQL
Sbjct: 62  SEKKRRLSINQVKALEKNFELENKLEPERKVKLAQELGLQPRQVAVWFQNRRARWKTKQL 121

Query: 121 ERDYGLLKANYETLKRSFDTLQKDNDGLLKEIKELKLKL-------EEEKTERHLSVKEE 180
           E+DYG+LK  Y++L+ +FD+L++DN+ LL+EI +LK KL       EEE+    ++ + +
Sbjct: 122 EKDYGVLKTQYDSLRHNFDSLRRDNESLLQEISKLKTKLNGGGGEEEEEENNAAVTTESD 181

Query: 181 IFVPESGNLLIEQTNNHLSVDHVPLPVSSDHSDDFNYESFRIASAEDGDEQRVKVSLFPD 240
           I V E    L E+     S      P   +HSD  NY SF      D    +   S F  
Sbjct: 182 ISVKEEEVSLPEKITEAPSSP----PQFLEHSDGLNYRSF--TDLRDLLPLKAAASSFAA 241

Query: 241 FKDGSSDSDSSAILNEDNSPNAVVSSTAAGMLQSKQQILSSPTSSLNWFPYQKAAAYNNA 300
               S  SDSSA+LNE++S N  V++                                N 
Sbjct: 242 AAGSSDSSDSSALLNEESSSNVTVAAPV-------------------------TVPGGNF 301

Query: 301 QQYVKIEE----YNFFSGEESCDLFSDEQAPSMHWYCP-DEWN 326
            Q+VK+E+     +F SGEE+C+ FSDEQ PS+HWY   D WN
Sbjct: 302 FQFVKMEQTEDHEDFLSGEEACEFFSDEQPPSLHWYSTVDHWN 311

BLAST of CmaCh03G008210 vs. Swiss-Prot
Match: ATB16_ARATH (Homeobox-leucine zipper protein ATHB-16 OS=Arabidopsis thaliana GN=ATHB-16 PE=2 SV=2)

HSP 1 Score: 213.4 bits (542), Expect = 3.7e-54
Identity = 151/338 (44.67%), Postives = 200/338 (59.17%), Query Frame = 1

Query: 1   MKRHGSSDSLGALMSVCPTSDEQSPRNSHVYGREFQSMLDGLDEEGSMEEQC-----HVG 60
           MKR  SSDS+  L+S   ++DEQSPR    YG  +QSML+G DE+ ++ E+      H+G
Sbjct: 1   MKRLSSSDSMCGLIST--STDEQSPRG---YGSNYQSMLEGYDEDATLIEEYSGNHHHMG 60

Query: 61  --EKKRRLSVDQVKALEKTFEIENKLEPDRKVKLALELGLQPRQVAVWFQNRRARWKTKQ 120
             EKKRRL VDQVKALEK FE+ENKLEP+RK KLA ELGLQPRQVAVWFQNRRARWKTKQ
Sbjct: 61  LSEKKRRLKVDQVKALEKNFELENKLEPERKTKLAQELGLQPRQVAVWFQNRRARWKTKQ 120

Query: 121 LERDYGLLKANYETLKRSFDTLQKDNDGLLKEIKELKLKLE-EEKTERHLSVKEEIFVPE 180
           LE+DYG+LK  Y++L+ +FD+L++DND LL+EI ++K K+  EE    + ++ E +   E
Sbjct: 121 LEKDYGVLKGQYDSLRHNFDSLRRDNDSLLQEISKIKAKVNGEEDNNNNKAITEGVKEEE 180

Query: 181 SGNLLIEQTNNHLSVDHVPLPVSSDHSDDFNYESFRIASAEDGDEQRVKVSLFPDFKDGS 240
                + +T+   S+   PL    +HS  FNY            + R  +      + GS
Sbjct: 181 -----VHKTD---SIPSSPLQF-LEHSSGFNYR-------RSFTDLRDLLPNSTVVEAGS 240

Query: 241 SDS-DSSAILNEDNSPNAVVSSTAAGMLQSKQQILSSPTSSLNWFPYQKAAAYNNAQQYV 300
           SDS DSSA+LN++ S              S    L+ P +              +  Q+V
Sbjct: 241 SDSCDSSAVLNDETS--------------SDNGRLTPPVT----------VTGGSFLQFV 293

Query: 301 KIEE----YNFFSGEESCDLFSDEQAPSMHWY-CPDEW 325
           K E+     +F SGEE+C  FSDEQ PS+HWY   D W
Sbjct: 301 KTEQTEDHEDFLSGEEACGFFSDEQPPSLHWYSASDHW 293

BLAST of CmaCh03G008210 vs. Swiss-Prot
Match: ATHB5_ARATH (Homeobox-leucine zipper protein ATHB-5 OS=Arabidopsis thaliana GN=ATHB-5 PE=1 SV=1)

HSP 1 Score: 213.4 bits (542), Expect = 3.7e-54
Identity = 153/345 (44.35%), Postives = 208/345 (60.29%), Query Frame = 1

Query: 1   MKR-HGSSDSLGALMSVC-PTSDEQ-SPRNS-----HVYGREFQSMLDGLDEEGSMEEQC 60
           MKR  GSSDSL   + +   T+D+Q SPR +     +    ++  M D L+++GS+E+  
Sbjct: 1   MKRSRGSSDSLSGFLPIRHSTTDKQISPRPTTTGFLYSGAGDYSQMFDALEDDGSLEDLG 60

Query: 61  HVG-------EKKRRLSVDQVKALEKTFEIENKLEPDRKVKLALELGLQPRQVAVWFQNR 120
            VG       EKKRRL V+QVKALEK FEI+NKLEP+RKVKLA ELGLQPRQVA+WFQNR
Sbjct: 61  GVGHASSTAAEKKRRLGVEQVKALEKNFEIDNKLEPERKVKLAQELGLQPRQVAIWFQNR 120

Query: 121 RARWKTKQLERDYGLLKANYETLKRSFDTLQKDNDGLLKEIKELKLKLEEEKTERHLSVK 180
           RARWKTKQLERDYG+LK+N++ LKR+ D+LQ+DND LL +IKELK KL  E  +      
Sbjct: 121 RARWKTKQLERDYGVLKSNFDALKRNRDSLQRDNDSLLGQIKELKAKLNVEGVKG----- 180

Query: 181 EEIFVPESGNLLIEQTNNHLSVDHVPLPVSSDHSDDFNYESFRIASAEDGDEQRVKVSLF 240
               + E+G L   + N  +  ++  L +S        +      ++E   E     S+F
Sbjct: 181 ----IEENGALKAVEANQSVMANNEVLELSHRSPSPPPHIPTDAPTSELAFEM---FSIF 240

Query: 241 P---DFKDGSSD-SDSSAILNEDNSPNAVVSSTAAGMLQSKQQILSSPTSSLNWFPYQKA 300
           P   +F+D  +D SDSSA+LNE+ SPN V    AAG + +     +   S++  F     
Sbjct: 241 PRTENFRDDPADSSDSSAVLNEEYSPNTV---EAAGAVAA----TTVEMSTMGCF----- 300

Query: 301 AAYNNAQQYVKIEEY-NFFSGEESCDLFSDEQAPSMHWYCPDEWN 326
                  Q+VK+EE+ + FSGEE+C LF+D +     WYC D+WN
Sbjct: 301 ------SQFVKMEEHEDLFSGEEACKLFADNE----QWYCSDQWN 311

BLAST of CmaCh03G008210 vs. Swiss-Prot
Match: HOX4_ORYSI (Homeobox-leucine zipper protein HOX4 OS=Oryza sativa subsp. indica GN=HOX4 PE=1 SV=1)

HSP 1 Score: 154.8 bits (390), Expect = 1.6e-36
Identity = 83/132 (62.88%), Postives = 103/132 (78.03%), Query Frame = 1

Query: 41  GLDEEGSMEEQCHV----GEKKRRLSVDQVKALEKTFEIENKLEPDRKVKLALELGLQPR 100
           G++ EG +EE+       GEKKRRLSV+QV+ALE++FE+ENKLEP+RK +LA +LGLQPR
Sbjct: 31  GMEAEGDVEEEMMACGGGGEKKRRLSVEQVRALERSFEVENKLEPERKARLARDLGLQPR 90

Query: 101 QVAVWFQNRRARWKTKQLERDYGLLKANYETLKRSFDTLQKDNDGLLKEIKELKLKL-EE 160
           QVAVWFQNRRARWKTKQLERDY  L+ +Y++L+   D L++D D LL EIKELK KL +E
Sbjct: 91  QVAVWFQNRRARWKTKQLERDYAALRHSYDSLRLDHDALRRDKDALLAEIKELKAKLGDE 150

Query: 161 EKTERHLSVKEE 168
           E      SVKEE
Sbjct: 151 EAAASFTSVKEE 162

BLAST of CmaCh03G008210 vs. Swiss-Prot
Match: HOX4_ORYSJ (Homeobox-leucine zipper protein HOX4 OS=Oryza sativa subsp. japonica GN=HOX4 PE=1 SV=1)

HSP 1 Score: 154.8 bits (390), Expect = 1.6e-36
Identity = 83/132 (62.88%), Postives = 103/132 (78.03%), Query Frame = 1

Query: 41  GLDEEGSMEEQCHV----GEKKRRLSVDQVKALEKTFEIENKLEPDRKVKLALELGLQPR 100
           G++ EG +EE+       GEKKRRLSV+QV+ALE++FE+ENKLEP+RK +LA +LGLQPR
Sbjct: 31  GMEAEGDVEEEMMACGGGGEKKRRLSVEQVRALERSFEVENKLEPERKARLARDLGLQPR 90

Query: 101 QVAVWFQNRRARWKTKQLERDYGLLKANYETLKRSFDTLQKDNDGLLKEIKELKLKL-EE 160
           QVAVWFQNRRARWKTKQLERDY  L+ +Y++L+   D L++D D LL EIKELK KL +E
Sbjct: 91  QVAVWFQNRRARWKTKQLERDYAALRHSYDSLRLDHDALRRDKDALLAEIKELKAKLGDE 150

Query: 161 EKTERHLSVKEE 168
           E      SVKEE
Sbjct: 151 EAAASFTSVKEE 162

BLAST of CmaCh03G008210 vs. TrEMBL
Match: A0A0A0KGQ3_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G499720 PE=4 SV=1)

HSP 1 Score: 559.7 bits (1441), Expect = 2.4e-156
Identity = 285/325 (87.69%), Postives = 304/325 (93.54%), Query Frame = 1

Query: 1   MKRHGSSDSLGALMSVCPTSDEQSPRNSHVYGREFQSMLDGLDEEGSMEEQCHVGEKKRR 60
           MKRHGSSDSLGALMSVCPTS+EQSPRNSHVYGREFQSMLDGLDEEGS+EE CHVGEKKRR
Sbjct: 1   MKRHGSSDSLGALMSVCPTSEEQSPRNSHVYGREFQSMLDGLDEEGSIEEHCHVGEKKRR 60

Query: 61  LSVDQVKALEKTFEIENKLEPDRKVKLALELGLQPRQVAVWFQNRRARWKTKQLERDYGL 120
           LSVDQVKALEKTFEIENKLEP+RKVKLA ELGLQPRQVAVWFQNRRARWKTKQLERDYGL
Sbjct: 61  LSVDQVKALEKTFEIENKLEPERKVKLAQELGLQPRQVAVWFQNRRARWKTKQLERDYGL 120

Query: 121 LKANYETLKRSFDTLQKDNDGLLKEIKELKLKLEEEKTERHLSVKEEIFVPESGNLLIEQ 180
           LKANYE+LKRSFDTLQ+DND LLKEIKELK KLEEEKTE +LSVKEEIFV ES NLLIEQ
Sbjct: 121 LKANYESLKRSFDTLQQDNDALLKEIKELKSKLEEEKTESNLSVKEEIFVSESDNLLIEQ 180

Query: 181 TNNHLSVDHVPLPVSSDHSDDFNYESFRIASAEDGDEQRVKVSLFPDFKDGSSDSDSSAI 240
           T NHL VDH+ LPV+SDHSDDFNYESFR   A+DGD+QRV+VSLF DFKDGSSDSDSSAI
Sbjct: 181 TTNHLPVDHISLPVASDHSDDFNYESFRTVGADDGDDQRVEVSLFTDFKDGSSDSDSSAI 240

Query: 241 LNEDNSPNAVVSSTAAGMLQSKQQILSSPTSSLNWFPYQKAAAYNNAQQYVKIEEYNFFS 300
           LNEDNSPNAVVSS  AGMLQS  QILSSP +SLN +P+QK AAYNNAQQ+VKIEE+NFFS
Sbjct: 241 LNEDNSPNAVVSSATAGMLQSHHQILSSPATSLNCYPFQK-AAYNNAQQFVKIEEHNFFS 300

Query: 301 GEESCDLFSDEQAPSMHWYCPDEWN 326
           GEE+C+LFSDEQAPSMHWYCPD+WN
Sbjct: 301 GEETCNLFSDEQAPSMHWYCPDQWN 324

BLAST of CmaCh03G008210 vs. TrEMBL
Match: A0A061DJ94_THECC (Alanine--glyoxylate aminotransferase 2 isoform 1 OS=Theobroma cacao GN=TCM_001039 PE=4 SV=1)

HSP 1 Score: 390.2 bits (1001), Expect = 2.5e-105
Identity = 220/347 (63.40%), Postives = 257/347 (74.06%), Query Frame = 1

Query: 1   MKRHGSSDSLGALMSVCPTSDEQSPRNSHVYGREFQSMLDGLDEEGSMEEQCHVGEKKRR 60
           MKR GSSDSLGALMS+CPT+DE SPRN+H+Y REFQSMLDGLDEEG +EE  HV EKKRR
Sbjct: 1   MKRLGSSDSLGALMSICPTTDEHSPRNNHIYSREFQSMLDGLDEEGCVEESGHVAEKKRR 60

Query: 61  LSVDQVKALEKTFEIENKLEPDRKVKLALELGLQPRQVAVWFQNRRARWKTKQLERDYGL 120
           LSVDQVKALEK FE+ENKLEP+RKVKLA ELGLQPRQVAVWFQNRRARWKTKQLERDYGL
Sbjct: 61  LSVDQVKALEKNFEVENKLEPERKVKLAQELGLQPRQVAVWFQNRRARWKTKQLERDYGL 120

Query: 121 LKANYETLKRSFDTLQKDNDGLLKEIKELKLKLEEEKTERHLSVKEEIFVPESGNLLIEQ 180
           LK +YETLK ++DTLQ DN+ LLKEI+ELK KL  E TE +LSVKEE+ V E+ N  +EQ
Sbjct: 121 LKTSYETLKVNYDTLQHDNEALLKEIRELKAKLNGESTESNLSVKEEVIVHETDNKTLEQ 180

Query: 181 TNNHLSVDHVPLPVS----SDHSDDFNYESFRIASAEDGDEQRVKVSLFPDFKDGSSDSD 240
           +         P PVS    S    + NYESF  +    G       +LFPD KDGSSDSD
Sbjct: 181 SE--------PPPVSSLVTSSEPAELNYESFNNSIGSVG------ATLFPDLKDGSSDSD 240

Query: 241 SSAILNEDN---SP-NAVVSSTAAGMLQSKQQILSSPT--------------SSLNWFPY 300
           SSAILNEDN   SP NA +SS  +G+LQS+Q +L SPT              SS+N F +
Sbjct: 241 SSAILNEDNNNCSPNNAAISS--SGVLQSQQHLLMSPTTTSSLNFNSSSSSPSSMNCFQF 300

Query: 301 QKAAAYNNAQQYVKIEEYNFFSGEESCDLFSDEQAPSMHWYCPDEWN 326
            K + Y  + QYVK+EE+NFFS +E+C+ FSDEQAPS+HWY P++WN
Sbjct: 301 SK-STYQPSHQYVKMEEHNFFSADEACNFFSDEQAPSLHWYSPEQWN 330

BLAST of CmaCh03G008210 vs. TrEMBL
Match: A0A067KD47_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_18664 PE=4 SV=1)

HSP 1 Score: 382.5 bits (981), Expect = 5.2e-103
Identity = 213/334 (63.77%), Postives = 254/334 (76.05%), Query Frame = 1

Query: 1   MKRHGSSDSLGALMSVCPTSDEQSPRNS-HVYGREFQSMLDGLDEEGSMEEQCHVGEKKR 60
           MKR  SSDSLGAL+S+CPTSDE SPRNS HVYGREFQSMLDGLDEE  +EE  HV EKKR
Sbjct: 1   MKRLSSSDSLGALISICPTSDEHSPRNSNHVYGREFQSMLDGLDEEACVEEAGHVSEKKR 60

Query: 61  RLSVDQVKALEKTFEIENKLEPDRKVKLALELGLQPRQVAVWFQNRRARWKTKQLERDYG 120
           RLSVDQVKALEK FE+ENKLEP+RKVKLA ELGLQPRQVAVWFQNRRARWKTKQLERDYG
Sbjct: 61  RLSVDQVKALEKNFEVENKLEPERKVKLAQELGLQPRQVAVWFQNRRARWKTKQLERDYG 120

Query: 121 LLKANYETLKRSFDTLQKDNDGLLKEIKELKLKLEEEKTERHLSVKEEIFVPESGNLLIE 180
           +LKANYETLK ++D LQ DN+ LLKEI+ELK KL+E+  E ++SVKEEI + E+     E
Sbjct: 121 VLKANYETLKVNYDALQHDNEALLKEIRELKAKLDEDNAESNVSVKEEIIIAETDEKGSE 180

Query: 181 QTNNHLSVDHVPLPVSSDHSDDFNYESFRIASAEDGDEQRVKVSLFPDFKDGSSDSDSSA 240
           +         +   ++   + D NYESF I S+ + +   + VSLFPDFKDGSSDSDSSA
Sbjct: 181 EPP-------ILTSIAGSETKDMNYESFNINSS-NSNNGILAVSLFPDFKDGSSDSDSSA 240

Query: 241 ILNEDN-----SPNAVVSSTAAGMLQSKQQILSSPT---SSLNWFPYQKAAAYNNAQQYV 300
           ILNEDN     SPN  +SS+  G+ QS  Q++ SP+   SS + F + K  +Y    Q+V
Sbjct: 241 ILNEDNNNSNNSPNPAISSS--GVPQSHNQLMMSPSRPSSSSSPFQFIKTGSYQT--QFV 300

Query: 301 KIEEYNFFSGEESCDLFSDEQAPSMHWYCPDEWN 326
           K+EE+NFFS EE+C+ FSDEQAPS+ WYCPD+WN
Sbjct: 301 KMEEHNFFSSEEACNFFSDEQAPSLQWYCPDQWN 322

BLAST of CmaCh03G008210 vs. TrEMBL
Match: A0A061DID8_THECC (Alanine--glyoxylate aminotransferase 2 isoform 3 OS=Theobroma cacao GN=TCM_001039 PE=4 SV=1)

HSP 1 Score: 369.4 bits (947), Expect = 4.5e-99
Identity = 208/334 (62.28%), Postives = 245/334 (73.35%), Query Frame = 1

Query: 14  MSVCPTSDEQSPRNSHVYGREFQSMLDGLDEEGSMEEQCHVGEKKRRLSVDQVKALEKTF 73
           MS+CPT+DE SPRN+H+Y REFQSMLDGLDEEG +EE  HV EKKRRLSVDQVKALEK F
Sbjct: 1   MSICPTTDEHSPRNNHIYSREFQSMLDGLDEEGCVEESGHVAEKKRRLSVDQVKALEKNF 60

Query: 74  EIENKLEPDRKVKLALELGLQPRQVAVWFQNRRARWKTKQLERDYGLLKANYETLKRSFD 133
           E+ENKLEP+RKVKLA ELGLQPRQVAVWFQNRRARWKTKQLERDYGLLK +YETLK ++D
Sbjct: 61  EVENKLEPERKVKLAQELGLQPRQVAVWFQNRRARWKTKQLERDYGLLKTSYETLKVNYD 120

Query: 134 TLQKDNDGLLKEIKELKLKLEEEKTERHLSVKEEIFVPESGNLLIEQTNNHLSVDHVPLP 193
           TLQ DN+ LLKEI+ELK KL  E TE +LSVKEE+ V E+ N  +EQ+         P P
Sbjct: 121 TLQHDNEALLKEIRELKAKLNGESTESNLSVKEEVIVHETDNKTLEQSE--------PPP 180

Query: 194 VS----SDHSDDFNYESFRIASAEDGDEQRVKVSLFPDFKDGSSDSDSSAILNEDN---S 253
           VS    S    + NYESF  +    G       +LFPD KDGSSDSDSSAILNEDN   S
Sbjct: 181 VSSLVTSSEPAELNYESFNNSIGSVG------ATLFPDLKDGSSDSDSSAILNEDNNNCS 240

Query: 254 P-NAVVSSTAAGMLQSKQQILSSPT--------------SSLNWFPYQKAAAYNNAQQYV 313
           P NA +SS  +G+LQS+Q +L SPT              SS+N F + K + Y  + QYV
Sbjct: 241 PNNAAISS--SGVLQSQQHLLMSPTTTSSLNFNSSSSSPSSMNCFQFSK-STYQPSHQYV 300

Query: 314 KIEEYNFFSGEESCDLFSDEQAPSMHWYCPDEWN 326
           K+EE+NFFS +E+C+ FSDEQAPS+HWY P++WN
Sbjct: 301 KMEEHNFFSADEACNFFSDEQAPSLHWYSPEQWN 317

BLAST of CmaCh03G008210 vs. TrEMBL
Match: M5WIS1_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa008318mg PE=4 SV=1)

HSP 1 Score: 365.9 bits (938), Expect = 5.0e-98
Identity = 210/352 (59.66%), Postives = 259/352 (73.58%), Query Frame = 1

Query: 1   MKRHGSSDSLGALMSVCPTSDEQSPRNSHVYGREFQSMLDGLDEEGSMEEQCHVGEKKRR 60
           MKR GSSDSLGA++S+CP+++EQSPRN+HVY R+FQSMLDGLDEEG +EE  HV EKKRR
Sbjct: 1   MKRLGSSDSLGAMISICPSTEEQSPRNNHVYRRDFQSMLDGLDEEGCVEEGGHVSEKKRR 60

Query: 61  LSVDQVKALEKTFEIENKLEPDRKVKLALELGLQPRQVAVWFQNRRARWKTKQLERDYGL 120
           LSV+QVKALEK FE+ENKLEP+RKVKLA ELGLQPRQVAVWFQNRRARWKTKQLERD+G+
Sbjct: 61  LSVEQVKALEKNFEVENKLEPERKVKLAQELGLQPRQVAVWFQNRRARWKTKQLERDFGV 120

Query: 121 LKANYETLKRSFDTLQKDNDGLLKEIKELKLKLEEEKTE-RHLSVKEEIFVPESGNLLIE 180
           LKANY++LK ++D LQ +N+ L+KEIK+LK KL+EE TE  +LSVKEE        ++ +
Sbjct: 121 LKANYDSLKLNYDNLQHENEALVKEIKQLKSKLQEENTESNNLSVKEE-------QMVAK 180

Query: 181 QTNNHLSVDH------VPLPVSSD----HSDDFNYESFRIASAEDGDEQRVKVSLFPDFK 240
             +N+  VDH       P P+ S      S + N+ESF   +  +G      VSLFPDFK
Sbjct: 181 DQSNYKVVDHELSKSPPPPPLGSSVPATESKELNFESFN--NTNNGAVGLEAVSLFPDFK 240

Query: 241 DGSSDSDSSAILNEDNSPNAVVSSTAAGMLQSKQQILSSP----------------TSSL 300
           DGSSDSDSSAILNEDNSPN  +SS  +GMLQ+  Q++ SP                +SS+
Sbjct: 241 DGSSDSDSSAILNEDNSPNLTISS--SGMLQN-HQLMKSPASTSLKFNCCSSSSPSSSSM 300

Query: 301 NWFPYQKAAAYNNAQQYVKIEEYNFFSGEESCDLFSDEQAPSMHWYCPDEWN 326
           N F +QK   Y+   Q+VKIEE+NFFS EE+C  FSDEQAP++ W CPD+WN
Sbjct: 301 NCFQFQK--TYH--PQFVKIEEHNFFSSEEACSFFSDEQAPTLQWCCPDQWN 336

BLAST of CmaCh03G008210 vs. TAIR10
Match: AT2G22430.1 (AT2G22430.1 homeobox protein 6)

HSP 1 Score: 251.9 bits (642), Expect = 5.3e-67
Identity = 165/343 (48.10%), Postives = 209/343 (60.93%), Query Frame = 1

Query: 1   MKRHGSSDSLGALMSVCPTS--DEQSPRNSHVYGREFQSMLDGLDEEGS--MEEQCHVG- 60
           MKR  SSDS+G L+S+CPT+  DEQSPR     GREFQSML+G +EE    +EE+ HVG 
Sbjct: 2   MKRLSSSDSVGGLISLCPTTSTDEQSPRRYG--GREFQSMLEGYEEEEEAIVEERGHVGL 61

Query: 61  -EKKRRLSVDQVKALEKTFEIENKLEPDRKVKLALELGLQPRQVAVWFQNRRARWKTKQL 120
            EKKRRLS++QVKALEK FE+ENKLEP+RKVKLA ELGLQPRQVAVWFQNRRARWKTKQL
Sbjct: 62  SEKKRRLSINQVKALEKNFELENKLEPERKVKLAQELGLQPRQVAVWFQNRRARWKTKQL 121

Query: 121 ERDYGLLKANYETLKRSFDTLQKDNDGLLKEIKELKLKL-------EEEKTERHLSVKEE 180
           E+DYG+LK  Y++L+ +FD+L++DN+ LL+EI +LK KL       EEE+    ++ + +
Sbjct: 122 EKDYGVLKTQYDSLRHNFDSLRRDNESLLQEISKLKTKLNGGGGEEEEEENNAAVTTESD 181

Query: 181 IFVPESGNLLIEQTNNHLSVDHVPLPVSSDHSDDFNYESFRIASAEDGDEQRVKVSLFPD 240
           I V E    L E+     S      P   +HSD  NY SF      D    +   S F  
Sbjct: 182 ISVKEEEVSLPEKITEAPSSP----PQFLEHSDGLNYRSF--TDLRDLLPLKAAASSFAA 241

Query: 241 FKDGSSDSDSSAILNEDNSPNAVVSSTAAGMLQSKQQILSSPTSSLNWFPYQKAAAYNNA 300
               S  SDSSA+LNE++S N  V++                                N 
Sbjct: 242 AAGSSDSSDSSALLNEESSSNVTVAAPV-------------------------TVPGGNF 301

Query: 301 QQYVKIEE----YNFFSGEESCDLFSDEQAPSMHWYCP-DEWN 326
            Q+VK+E+     +F SGEE+C+ FSDEQ PS+HWY   D WN
Sbjct: 302 FQFVKMEQTEDHEDFLSGEEACEFFSDEQPPSLHWYSTVDHWN 311

BLAST of CmaCh03G008210 vs. TAIR10
Match: AT4G40060.1 (AT4G40060.1 homeobox protein 16)

HSP 1 Score: 213.4 bits (542), Expect = 2.1e-55
Identity = 151/338 (44.67%), Postives = 200/338 (59.17%), Query Frame = 1

Query: 1   MKRHGSSDSLGALMSVCPTSDEQSPRNSHVYGREFQSMLDGLDEEGSMEEQC-----HVG 60
           MKR  SSDS+  L+S   ++DEQSPR    YG  +QSML+G DE+ ++ E+      H+G
Sbjct: 1   MKRLSSSDSMCGLIST--STDEQSPRG---YGSNYQSMLEGYDEDATLIEEYSGNHHHMG 60

Query: 61  --EKKRRLSVDQVKALEKTFEIENKLEPDRKVKLALELGLQPRQVAVWFQNRRARWKTKQ 120
             EKKRRL VDQVKALEK FE+ENKLEP+RK KLA ELGLQPRQVAVWFQNRRARWKTKQ
Sbjct: 61  LSEKKRRLKVDQVKALEKNFELENKLEPERKTKLAQELGLQPRQVAVWFQNRRARWKTKQ 120

Query: 121 LERDYGLLKANYETLKRSFDTLQKDNDGLLKEIKELKLKLE-EEKTERHLSVKEEIFVPE 180
           LE+DYG+LK  Y++L+ +FD+L++DND LL+EI ++K K+  EE    + ++ E +   E
Sbjct: 121 LEKDYGVLKGQYDSLRHNFDSLRRDNDSLLQEISKIKAKVNGEEDNNNNKAITEGVKEEE 180

Query: 181 SGNLLIEQTNNHLSVDHVPLPVSSDHSDDFNYESFRIASAEDGDEQRVKVSLFPDFKDGS 240
                + +T+   S+   PL    +HS  FNY            + R  +      + GS
Sbjct: 181 -----VHKTD---SIPSSPLQF-LEHSSGFNYR-------RSFTDLRDLLPNSTVVEAGS 240

Query: 241 SDS-DSSAILNEDNSPNAVVSSTAAGMLQSKQQILSSPTSSLNWFPYQKAAAYNNAQQYV 300
           SDS DSSA+LN++ S              S    L+ P +              +  Q+V
Sbjct: 241 SDSCDSSAVLNDETS--------------SDNGRLTPPVT----------VTGGSFLQFV 293

Query: 301 KIEE----YNFFSGEESCDLFSDEQAPSMHWY-CPDEW 325
           K E+     +F SGEE+C  FSDEQ PS+HWY   D W
Sbjct: 301 KTEQTEDHEDFLSGEEACGFFSDEQPPSLHWYSASDHW 293

BLAST of CmaCh03G008210 vs. TAIR10
Match: AT5G65310.1 (AT5G65310.1 homeobox protein 5)

HSP 1 Score: 213.4 bits (542), Expect = 2.1e-55
Identity = 153/345 (44.35%), Postives = 208/345 (60.29%), Query Frame = 1

Query: 1   MKR-HGSSDSLGALMSVC-PTSDEQ-SPRNS-----HVYGREFQSMLDGLDEEGSMEEQC 60
           MKR  GSSDSL   + +   T+D+Q SPR +     +    ++  M D L+++GS+E+  
Sbjct: 1   MKRSRGSSDSLSGFLPIRHSTTDKQISPRPTTTGFLYSGAGDYSQMFDALEDDGSLEDLG 60

Query: 61  HVG-------EKKRRLSVDQVKALEKTFEIENKLEPDRKVKLALELGLQPRQVAVWFQNR 120
            VG       EKKRRL V+QVKALEK FEI+NKLEP+RKVKLA ELGLQPRQVA+WFQNR
Sbjct: 61  GVGHASSTAAEKKRRLGVEQVKALEKNFEIDNKLEPERKVKLAQELGLQPRQVAIWFQNR 120

Query: 121 RARWKTKQLERDYGLLKANYETLKRSFDTLQKDNDGLLKEIKELKLKLEEEKTERHLSVK 180
           RARWKTKQLERDYG+LK+N++ LKR+ D+LQ+DND LL +IKELK KL  E  +      
Sbjct: 121 RARWKTKQLERDYGVLKSNFDALKRNRDSLQRDNDSLLGQIKELKAKLNVEGVKG----- 180

Query: 181 EEIFVPESGNLLIEQTNNHLSVDHVPLPVSSDHSDDFNYESFRIASAEDGDEQRVKVSLF 240
               + E+G L   + N  +  ++  L +S        +      ++E   E     S+F
Sbjct: 181 ----IEENGALKAVEANQSVMANNEVLELSHRSPSPPPHIPTDAPTSELAFEM---FSIF 240

Query: 241 P---DFKDGSSD-SDSSAILNEDNSPNAVVSSTAAGMLQSKQQILSSPTSSLNWFPYQKA 300
           P   +F+D  +D SDSSA+LNE+ SPN V    AAG + +     +   S++  F     
Sbjct: 241 PRTENFRDDPADSSDSSAVLNEEYSPNTV---EAAGAVAA----TTVEMSTMGCF----- 300

Query: 301 AAYNNAQQYVKIEEY-NFFSGEESCDLFSDEQAPSMHWYCPDEWN 326
                  Q+VK+EE+ + FSGEE+C LF+D +     WYC D+WN
Sbjct: 301 ------SQFVKMEEHEDLFSGEEACKLFADNE----QWYCSDQWN 311

BLAST of CmaCh03G008210 vs. TAIR10
Match: AT3G01470.1 (AT3G01470.1 homeobox 1)

HSP 1 Score: 129.8 bits (325), Expect = 3.1e-30
Identity = 70/122 (57.38%), Postives = 90/122 (73.77%), Query Frame = 1

Query: 33  REFQSMLDGLDEEGSMEEQCHVGEKKRRLSVDQVKALEKTFEIENKLEPDRKVKLALELG 92
           R F S  + L ++   ++Q  + EKKRRL+ +QV  LEK+FE ENKLEP+RK +LA +LG
Sbjct: 46  RPFFSSPEDLYDDDFYDDQ--LPEKKRRLTTEQVHLLEKSFETENKLEPERKTQLAKKLG 105

Query: 93  LQPRQVAVWFQNRRARWKTKQLERDYGLLKANYETLKRSFDTLQKDNDGLLKEIKELKLK 152
           LQPRQVAVWFQNRRARWKTKQLERDY LLK+ Y+ L  ++D++  DND L  E+  L  K
Sbjct: 106 LQPRQVAVWFQNRRARWKTKQLERDYDLLKSTYDQLLSNYDSIVMDNDKLRSEVTSLTEK 165

Query: 153 LE 155
           L+
Sbjct: 166 LQ 165

BLAST of CmaCh03G008210 vs. TAIR10
Match: AT1G69780.1 (AT1G69780.1 Homeobox-leucine zipper protein family)

HSP 1 Score: 125.9 bits (315), Expect = 4.4e-29
Identity = 75/154 (48.70%), Postives = 100/154 (64.94%), Query Frame = 1

Query: 44  EEGSMEEQCHVGEKKRRLSVDQVKALEKTFEIENKLEPDRKVKLALELGLQPRQVAVWFQ 103
           EE   ++   +GEKKRRL+++QVK LEK FE+ NKLEP+RK++LA  LGLQPRQ+A+WFQ
Sbjct: 72  EEDYSDDGSQMGEKKRRLNMEQVKTLEKNFELGNKLEPERKMQLARALGLQPRQIAIWFQ 131

Query: 104 NRRARWKTKQLERDYGLLKANYETLKRSFDTLQKDNDGLLKEIKELKLKLEEEKTERHLS 163
           NRRARWKTKQLE+DY  LK  ++TLK   D LQ  N  L  EI  LK     E+TE   S
Sbjct: 132 NRRARWKTKQLEKDYDTLKRQFDTLKAENDLLQTHNQKLQAEIMGLK---NREQTE---S 191

Query: 164 VKEEIFVPESGNLLIEQTNNHLSVDHVPLPVSSD 198
           +        S +   + ++++L +D    P S+D
Sbjct: 192 INLNKETEGSCSNRSDNSSDNLRLDISTAPPSND 219

BLAST of CmaCh03G008210 vs. NCBI nr
Match: gi|659080027|ref|XP_008440572.1| (PREDICTED: homeobox-leucine zipper protein ATHB-6-like [Cucumis melo])

HSP 1 Score: 560.5 bits (1443), Expect = 2.0e-156
Identity = 285/325 (87.69%), Postives = 304/325 (93.54%), Query Frame = 1

Query: 1   MKRHGSSDSLGALMSVCPTSDEQSPRNSHVYGREFQSMLDGLDEEGSMEEQCHVGEKKRR 60
           MKRHGSSDSLGALMSVCPTS+EQSPRNSHVYGREFQSMLDGLDEEGS+EE CHVGEKKRR
Sbjct: 1   MKRHGSSDSLGALMSVCPTSEEQSPRNSHVYGREFQSMLDGLDEEGSIEEHCHVGEKKRR 60

Query: 61  LSVDQVKALEKTFEIENKLEPDRKVKLALELGLQPRQVAVWFQNRRARWKTKQLERDYGL 120
           LSVDQVKALEKTFEIENKLEP+RKVKLA ELGLQPRQVAVWFQNRRARWKTKQLERDYGL
Sbjct: 61  LSVDQVKALEKTFEIENKLEPERKVKLAQELGLQPRQVAVWFQNRRARWKTKQLERDYGL 120

Query: 121 LKANYETLKRSFDTLQKDNDGLLKEIKELKLKLEEEKTERHLSVKEEIFVPESGNLLIEQ 180
           LKANYE+LKRSFDTLQ+DND LLKEIKELK KLEEEKTE +LSVKEEIFV ES NLLIEQ
Sbjct: 121 LKANYESLKRSFDTLQQDNDALLKEIKELKSKLEEEKTESNLSVKEEIFVSESDNLLIEQ 180

Query: 181 TNNHLSVDHVPLPVSSDHSDDFNYESFRIASAEDGDEQRVKVSLFPDFKDGSSDSDSSAI 240
           T NHL VDH+ LPV+SDHSDDF+YESFR   A+DGD+QRV+VSLFPDFKDGSSDSDSSAI
Sbjct: 181 TTNHLPVDHISLPVASDHSDDFDYESFRTVGADDGDDQRVEVSLFPDFKDGSSDSDSSAI 240

Query: 241 LNEDNSPNAVVSSTAAGMLQSKQQILSSPTSSLNWFPYQKAAAYNNAQQYVKIEEYNFFS 300
           LNEDNSPNAVVSS  AGMLQS  QILSSP +SLN FP+QK A YNNAQQ+VKIEE+NFFS
Sbjct: 241 LNEDNSPNAVVSSATAGMLQSHHQILSSPATSLNCFPFQK-ATYNNAQQFVKIEEHNFFS 300

Query: 301 GEESCDLFSDEQAPSMHWYCPDEWN 326
           GEE+C+LFSDEQAPSMHWYCPD+WN
Sbjct: 301 GEETCNLFSDEQAPSMHWYCPDQWN 324

BLAST of CmaCh03G008210 vs. NCBI nr
Match: gi|449451407|ref|XP_004143453.1| (PREDICTED: homeobox-leucine zipper protein ATHB-6 [Cucumis sativus])

HSP 1 Score: 559.7 bits (1441), Expect = 3.4e-156
Identity = 285/325 (87.69%), Postives = 304/325 (93.54%), Query Frame = 1

Query: 1   MKRHGSSDSLGALMSVCPTSDEQSPRNSHVYGREFQSMLDGLDEEGSMEEQCHVGEKKRR 60
           MKRHGSSDSLGALMSVCPTS+EQSPRNSHVYGREFQSMLDGLDEEGS+EE CHVGEKKRR
Sbjct: 1   MKRHGSSDSLGALMSVCPTSEEQSPRNSHVYGREFQSMLDGLDEEGSIEEHCHVGEKKRR 60

Query: 61  LSVDQVKALEKTFEIENKLEPDRKVKLALELGLQPRQVAVWFQNRRARWKTKQLERDYGL 120
           LSVDQVKALEKTFEIENKLEP+RKVKLA ELGLQPRQVAVWFQNRRARWKTKQLERDYGL
Sbjct: 61  LSVDQVKALEKTFEIENKLEPERKVKLAQELGLQPRQVAVWFQNRRARWKTKQLERDYGL 120

Query: 121 LKANYETLKRSFDTLQKDNDGLLKEIKELKLKLEEEKTERHLSVKEEIFVPESGNLLIEQ 180
           LKANYE+LKRSFDTLQ+DND LLKEIKELK KLEEEKTE +LSVKEEIFV ES NLLIEQ
Sbjct: 121 LKANYESLKRSFDTLQQDNDALLKEIKELKSKLEEEKTESNLSVKEEIFVSESDNLLIEQ 180

Query: 181 TNNHLSVDHVPLPVSSDHSDDFNYESFRIASAEDGDEQRVKVSLFPDFKDGSSDSDSSAI 240
           T NHL VDH+ LPV+SDHSDDFNYESFR   A+DGD+QRV+VSLF DFKDGSSDSDSSAI
Sbjct: 181 TTNHLPVDHISLPVASDHSDDFNYESFRTVGADDGDDQRVEVSLFTDFKDGSSDSDSSAI 240

Query: 241 LNEDNSPNAVVSSTAAGMLQSKQQILSSPTSSLNWFPYQKAAAYNNAQQYVKIEEYNFFS 300
           LNEDNSPNAVVSS  AGMLQS  QILSSP +SLN +P+QK AAYNNAQQ+VKIEE+NFFS
Sbjct: 241 LNEDNSPNAVVSSATAGMLQSHHQILSSPATSLNCYPFQK-AAYNNAQQFVKIEEHNFFS 300

Query: 301 GEESCDLFSDEQAPSMHWYCPDEWN 326
           GEE+C+LFSDEQAPSMHWYCPD+WN
Sbjct: 301 GEETCNLFSDEQAPSMHWYCPDQWN 324

BLAST of CmaCh03G008210 vs. NCBI nr
Match: gi|590706919|ref|XP_007047858.1| (Alanine--glyoxylate aminotransferase 2 isoform 1 [Theobroma cacao])

HSP 1 Score: 390.2 bits (1001), Expect = 3.6e-105
Identity = 220/347 (63.40%), Postives = 257/347 (74.06%), Query Frame = 1

Query: 1   MKRHGSSDSLGALMSVCPTSDEQSPRNSHVYGREFQSMLDGLDEEGSMEEQCHVGEKKRR 60
           MKR GSSDSLGALMS+CPT+DE SPRN+H+Y REFQSMLDGLDEEG +EE  HV EKKRR
Sbjct: 1   MKRLGSSDSLGALMSICPTTDEHSPRNNHIYSREFQSMLDGLDEEGCVEESGHVAEKKRR 60

Query: 61  LSVDQVKALEKTFEIENKLEPDRKVKLALELGLQPRQVAVWFQNRRARWKTKQLERDYGL 120
           LSVDQVKALEK FE+ENKLEP+RKVKLA ELGLQPRQVAVWFQNRRARWKTKQLERDYGL
Sbjct: 61  LSVDQVKALEKNFEVENKLEPERKVKLAQELGLQPRQVAVWFQNRRARWKTKQLERDYGL 120

Query: 121 LKANYETLKRSFDTLQKDNDGLLKEIKELKLKLEEEKTERHLSVKEEIFVPESGNLLIEQ 180
           LK +YETLK ++DTLQ DN+ LLKEI+ELK KL  E TE +LSVKEE+ V E+ N  +EQ
Sbjct: 121 LKTSYETLKVNYDTLQHDNEALLKEIRELKAKLNGESTESNLSVKEEVIVHETDNKTLEQ 180

Query: 181 TNNHLSVDHVPLPVS----SDHSDDFNYESFRIASAEDGDEQRVKVSLFPDFKDGSSDSD 240
           +         P PVS    S    + NYESF  +    G       +LFPD KDGSSDSD
Sbjct: 181 SE--------PPPVSSLVTSSEPAELNYESFNNSIGSVG------ATLFPDLKDGSSDSD 240

Query: 241 SSAILNEDN---SP-NAVVSSTAAGMLQSKQQILSSPT--------------SSLNWFPY 300
           SSAILNEDN   SP NA +SS  +G+LQS+Q +L SPT              SS+N F +
Sbjct: 241 SSAILNEDNNNCSPNNAAISS--SGVLQSQQHLLMSPTTTSSLNFNSSSSSPSSMNCFQF 300

Query: 301 QKAAAYNNAQQYVKIEEYNFFSGEESCDLFSDEQAPSMHWYCPDEWN 326
            K + Y  + QYVK+EE+NFFS +E+C+ FSDEQAPS+HWY P++WN
Sbjct: 301 SK-STYQPSHQYVKMEEHNFFSADEACNFFSDEQAPSLHWYSPEQWN 330

BLAST of CmaCh03G008210 vs. NCBI nr
Match: gi|802673601|ref|XP_012081624.1| (PREDICTED: homeobox-leucine zipper protein ATHB-6-like [Jatropha curcas])

HSP 1 Score: 382.5 bits (981), Expect = 7.4e-103
Identity = 213/334 (63.77%), Postives = 254/334 (76.05%), Query Frame = 1

Query: 1   MKRHGSSDSLGALMSVCPTSDEQSPRNS-HVYGREFQSMLDGLDEEGSMEEQCHVGEKKR 60
           MKR  SSDSLGAL+S+CPTSDE SPRNS HVYGREFQSMLDGLDEE  +EE  HV EKKR
Sbjct: 1   MKRLSSSDSLGALISICPTSDEHSPRNSNHVYGREFQSMLDGLDEEACVEEAGHVSEKKR 60

Query: 61  RLSVDQVKALEKTFEIENKLEPDRKVKLALELGLQPRQVAVWFQNRRARWKTKQLERDYG 120
           RLSVDQVKALEK FE+ENKLEP+RKVKLA ELGLQPRQVAVWFQNRRARWKTKQLERDYG
Sbjct: 61  RLSVDQVKALEKNFEVENKLEPERKVKLAQELGLQPRQVAVWFQNRRARWKTKQLERDYG 120

Query: 121 LLKANYETLKRSFDTLQKDNDGLLKEIKELKLKLEEEKTERHLSVKEEIFVPESGNLLIE 180
           +LKANYETLK ++D LQ DN+ LLKEI+ELK KL+E+  E ++SVKEEI + E+     E
Sbjct: 121 VLKANYETLKVNYDALQHDNEALLKEIRELKAKLDEDNAESNVSVKEEIIIAETDEKGSE 180

Query: 181 QTNNHLSVDHVPLPVSSDHSDDFNYESFRIASAEDGDEQRVKVSLFPDFKDGSSDSDSSA 240
           +         +   ++   + D NYESF I S+ + +   + VSLFPDFKDGSSDSDSSA
Sbjct: 181 EPP-------ILTSIAGSETKDMNYESFNINSS-NSNNGILAVSLFPDFKDGSSDSDSSA 240

Query: 241 ILNEDN-----SPNAVVSSTAAGMLQSKQQILSSPT---SSLNWFPYQKAAAYNNAQQYV 300
           ILNEDN     SPN  +SS+  G+ QS  Q++ SP+   SS + F + K  +Y    Q+V
Sbjct: 241 ILNEDNNNSNNSPNPAISSS--GVPQSHNQLMMSPSRPSSSSSPFQFIKTGSYQT--QFV 300

Query: 301 KIEEYNFFSGEESCDLFSDEQAPSMHWYCPDEWN 326
           K+EE+NFFS EE+C+ FSDEQAPS+ WYCPD+WN
Sbjct: 301 KMEEHNFFSSEEACNFFSDEQAPSLQWYCPDQWN 322

BLAST of CmaCh03G008210 vs. NCBI nr
Match: gi|590706927|ref|XP_007047860.1| (Alanine--glyoxylate aminotransferase 2 isoform 3 [Theobroma cacao])

HSP 1 Score: 369.4 bits (947), Expect = 6.5e-99
Identity = 208/334 (62.28%), Postives = 245/334 (73.35%), Query Frame = 1

Query: 14  MSVCPTSDEQSPRNSHVYGREFQSMLDGLDEEGSMEEQCHVGEKKRRLSVDQVKALEKTF 73
           MS+CPT+DE SPRN+H+Y REFQSMLDGLDEEG +EE  HV EKKRRLSVDQVKALEK F
Sbjct: 1   MSICPTTDEHSPRNNHIYSREFQSMLDGLDEEGCVEESGHVAEKKRRLSVDQVKALEKNF 60

Query: 74  EIENKLEPDRKVKLALELGLQPRQVAVWFQNRRARWKTKQLERDYGLLKANYETLKRSFD 133
           E+ENKLEP+RKVKLA ELGLQPRQVAVWFQNRRARWKTKQLERDYGLLK +YETLK ++D
Sbjct: 61  EVENKLEPERKVKLAQELGLQPRQVAVWFQNRRARWKTKQLERDYGLLKTSYETLKVNYD 120

Query: 134 TLQKDNDGLLKEIKELKLKLEEEKTERHLSVKEEIFVPESGNLLIEQTNNHLSVDHVPLP 193
           TLQ DN+ LLKEI+ELK KL  E TE +LSVKEE+ V E+ N  +EQ+         P P
Sbjct: 121 TLQHDNEALLKEIRELKAKLNGESTESNLSVKEEVIVHETDNKTLEQSE--------PPP 180

Query: 194 VS----SDHSDDFNYESFRIASAEDGDEQRVKVSLFPDFKDGSSDSDSSAILNEDN---S 253
           VS    S    + NYESF  +    G       +LFPD KDGSSDSDSSAILNEDN   S
Sbjct: 181 VSSLVTSSEPAELNYESFNNSIGSVG------ATLFPDLKDGSSDSDSSAILNEDNNNCS 240

Query: 254 P-NAVVSSTAAGMLQSKQQILSSPT--------------SSLNWFPYQKAAAYNNAQQYV 313
           P NA +SS  +G+LQS+Q +L SPT              SS+N F + K + Y  + QYV
Sbjct: 241 PNNAAISS--SGVLQSQQHLLMSPTTTSSLNFNSSSSSPSSMNCFQFSK-STYQPSHQYV 300

Query: 314 KIEEYNFFSGEESCDLFSDEQAPSMHWYCPDEWN 326
           K+EE+NFFS +E+C+ FSDEQAPS+HWY P++WN
Sbjct: 301 KMEEHNFFSADEACNFFSDEQAPSLHWYSPEQWN 317

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
ATHB6_ARATH9.4e-6648.10Homeobox-leucine zipper protein ATHB-6 OS=Arabidopsis thaliana GN=ATHB-6 PE=1 SV... [more]
ATB16_ARATH3.7e-5444.67Homeobox-leucine zipper protein ATHB-16 OS=Arabidopsis thaliana GN=ATHB-16 PE=2 ... [more]
ATHB5_ARATH3.7e-5444.35Homeobox-leucine zipper protein ATHB-5 OS=Arabidopsis thaliana GN=ATHB-5 PE=1 SV... [more]
HOX4_ORYSI1.6e-3662.88Homeobox-leucine zipper protein HOX4 OS=Oryza sativa subsp. indica GN=HOX4 PE=1 ... [more]
HOX4_ORYSJ1.6e-3662.88Homeobox-leucine zipper protein HOX4 OS=Oryza sativa subsp. japonica GN=HOX4 PE=... [more]
Match NameE-valueIdentityDescription
A0A0A0KGQ3_CUCSA2.4e-15687.69Uncharacterized protein OS=Cucumis sativus GN=Csa_6G499720 PE=4 SV=1[more]
A0A061DJ94_THECC2.5e-10563.40Alanine--glyoxylate aminotransferase 2 isoform 1 OS=Theobroma cacao GN=TCM_00103... [more]
A0A067KD47_JATCU5.2e-10363.77Uncharacterized protein OS=Jatropha curcas GN=JCGZ_18664 PE=4 SV=1[more]
A0A061DID8_THECC4.5e-9962.28Alanine--glyoxylate aminotransferase 2 isoform 3 OS=Theobroma cacao GN=TCM_00103... [more]
M5WIS1_PRUPE5.0e-9859.66Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa008318mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G22430.15.3e-6748.10 homeobox protein 6[more]
AT4G40060.12.1e-5544.67 homeobox protein 16[more]
AT5G65310.12.1e-5544.35 homeobox protein 5[more]
AT3G01470.13.1e-3057.38 homeobox 1[more]
AT1G69780.14.4e-2948.70 Homeobox-leucine zipper protein family[more]
Match NameE-valueIdentityDescription
gi|659080027|ref|XP_008440572.1|2.0e-15687.69PREDICTED: homeobox-leucine zipper protein ATHB-6-like [Cucumis melo][more]
gi|449451407|ref|XP_004143453.1|3.4e-15687.69PREDICTED: homeobox-leucine zipper protein ATHB-6 [Cucumis sativus][more]
gi|590706919|ref|XP_007047858.1|3.6e-10563.40Alanine--glyoxylate aminotransferase 2 isoform 1 [Theobroma cacao][more]
gi|802673601|ref|XP_012081624.1|7.4e-10363.77PREDICTED: homeobox-leucine zipper protein ATHB-6-like [Jatropha curcas][more]
gi|590706927|ref|XP_007047860.1|6.5e-9962.28Alanine--glyoxylate aminotransferase 2 isoform 3 [Theobroma cacao][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR000047HTH_motif
IPR001356Homeobox_dom
IPR003106Leu_zip_homeo
IPR009057Homeobox-like_sf
IPR017970Homeobox_CS
Vocabulary: Molecular Function
TermDefinition
GO:0003677DNA binding
GO:0003700transcription factor activity, sequence-specific DNA binding
GO:0043565sequence-specific DNA binding
Vocabulary: Biological Process
TermDefinition
GO:0006355regulation of transcription, DNA-templated
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0005634 nucleus
cellular_component GO:0005667 transcription factor complex
molecular_function GO:0043565 sequence-specific DNA binding
molecular_function GO:0003700 transcription factor activity, sequence-specific DNA binding
molecular_function GO:0016740 transferase activity
molecular_function GO:0003677 DNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh03G008210.1CmaCh03G008210.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000047Helix-turn-helix motifPRINTSPR00031HTHREPRESSRcoord: 92..108
score: 2.8E-5coord: 83..92
score: 2.
IPR001356Homeobox domainPFAMPF00046Homeoboxcoord: 57..110
score: 1.3
IPR001356Homeobox domainSMARTSM00389HOX_1coord: 55..116
score: 2.8
IPR001356Homeobox domainPROFILEPS50071HOMEOBOX_2coord: 56..112
score: 16
IPR003106Leucine zipper, homeobox-associatedPFAMPF02183HALZcoord: 112..153
score: 8.3
IPR009057Homeodomain-likeGENE3DG3DSA:1.10.10.60coord: 59..119
score: 1.1
IPR009057Homeodomain-likeunknownSSF46689Homeodomain-likecoord: 40..114
score: 3.21
IPR017970Homeobox, conserved sitePROSITEPS00027HOMEOBOX_1coord: 87..110
scor
NoneNo IPR availableunknownCoilCoilcoord: 111..166
scor
NoneNo IPR availablePANTHERPTHR24326FAMILY NOT NAMEDcoord: 14..160
score: 3.5
NoneNo IPR availablePANTHERPTHR24326:SF196HOMEOBOX-LEUCINE ZIPPER PROTEIN ATHB-16-RELATEDcoord: 14..160
score: 3.5

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
CmaCh03G008210CmaCh07G005590Cucurbita maxima (Rimu)cmacmaB523