CmoCh16G007860 (gene) Cucurbita moschata (Rifu)

NameCmoCh16G007860
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionHD domain class transcription factor
LocationCmo_Chr16 : 3982870 .. 3984071 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAGAGACGATCAGATTCCATGGCTGCACTCATCTCCATTTCCCCAACATCAGGTTTACATTAAAACCACCAATTTCTTCCCCCTTTTGTTTTTGTACTTGTAATTTACAGACAAACCCATGTGGGTTTCTTTTTCTGTCTCCTCCGACTTATGGTCAATCAATCAATCAACCAATCGGCTTGTTTCCTTTCTGGGTTTTGCAAGATTCGATGGATTTTTGAGTAATTCTTTGCCAATTTTGTGTACAGATCAAGAACAGAGTCCGAGAAATAAGAACAGTAACCATGTTTATGAGATGGAATTCCAGTGTATGCTTGATGGGTTTGATGAGGAAGAATTAGGGCATGTTTCTGAGAAGAAAAGGCGACTTGGTGTGGAGCAAGTTAAGGCGTTAGAGAAGAATTTCGAAGTTGAAAATAAGCTCGAACCAGAGAGGAAATTGAAGCTTGCTCAAGAACTTGGATTACAGCCTCGACAAGTGGCTGTTTGGTTCCAAAATCGTCGAGCTAGATGGAAAACTAAGCAATTAGAAAGAGATTATGGTGTTCTTAAAACCAATTACGACAATCTTAAACTCAGTTTTGAAGCTCTCCAAAATGATAATCAAGCTCTTCTCAAAGAGGTAAAAATTATAGCCCTCCCCTGTTTTTTGAATGTTCAGAAATGTTGTAATCATTATGAACTCTGTAAATTTATGTTAGCAGATTCGGGAATTGAAAGCAAAGATTCAAGAAGATAACTCAGAGATGTTGGTGCCGGCCGATTCTGAAAATGCTTTGATCGAACAAACTAAGCCGGAAATTACCGATGACTTCTCTGTTCCACCGGCGAGAAGCTTCAACAACAATGGCGGAGAAGGGGATGAGCCACCAACAAAAGATGGGTCATCCGACAGCGATTCGAGCGCGATTTTAAACGAAGATTACAGCCCGACGGCCGGTGTTTCTTCACCGGGAGTGTTGCAGAACAACAACCATTTCATGACGGGAGCGATACCTCCATCATCTCCGTCCATCGCCACCGCTGGCGTGAAACTGAACCGCGCGACGACGGCGCTGAATTACTTGCAGTTTCAAAAGGGGTATCAACAAACCCAGATGATGTTTCCGAAAATGGAGGAGCATAATTTCTTCGGCGGAGAGGAGGCTTGTAACTTCTTTTCCGATGAGCAAGCTCCGACTCTGCACTGGTGGAGCTGA

mRNA sequence

ATGAAGAGACGATCAGATTCCATGGCTGCACTCATCTCCATTTCCCCAACATCAGATCAAGAACAGAGTCCGAGAAATAAGAACAGTAACCATGTTTATGAGATGGAATTCCAGTGTATGCTTGATGGGTTTGATGAGGAAGAATTAGGGCATGTTTCTGAGAAGAAAAGGCGACTTGGTGTGGAGCAAGTTAAGGCGTTAGAGAAGAATTTCGAAGTTGAAAATAAGCTCGAACCAGAGAGGAAATTGAAGCTTGCTCAAGAACTTGGATTACAGCCTCGACAAGTGGCTGTTTGGTTCCAAAATCGTCGAGCTAGATGGAAAACTAAGCAATTAGAAAGAGATTATGGTGTTCTTAAAACCAATTACGACAATCTTAAACTCAGTTTTGAAGCTCTCCAAAATGATAATCAAGCTCTTCTCAAAGAGATTCGGGAATTGAAAGCAAAGATTCAAGAAGATAACTCAGAGATGTTGGTGCCGGCCGATTCTGAAAATGCTTTGATCGAACAAACTAAGCCGGAAATTACCGATGACTTCTCTGTTCCACCGGCGAGAAGCTTCAACAACAATGGCGGAGAAGGGGATGAGCCACCAACAAAAGATGGGTCATCCGACAGCGATTCGAGCGCGATTTTAAACGAAGATTACAGCCCGACGGCCGGTGTTTCTTCACCGGGAGTGTTGCAGAACAACAACCATTTCATGACGGGAGCGATACCTCCATCATCTCCGTCCATCGCCACCGCTGGCGTGAAACTGAACCGCGCGACGACGGCGCTGAATTACTTGCAGTTTCAAAAGGGGTATCAACAAACCCAGATGATGTTTCCGAAAATGGAGGAGCATAATTTCTTCGGCGGAGAGGAGGCTTGTAACTTCTTTTCCGATGAGCAAGCTCCGACTCTGCACTGGTGGAGCTGA

Coding sequence (CDS)

ATGAAGAGACGATCAGATTCCATGGCTGCACTCATCTCCATTTCCCCAACATCAGATCAAGAACAGAGTCCGAGAAATAAGAACAGTAACCATGTTTATGAGATGGAATTCCAGTGTATGCTTGATGGGTTTGATGAGGAAGAATTAGGGCATGTTTCTGAGAAGAAAAGGCGACTTGGTGTGGAGCAAGTTAAGGCGTTAGAGAAGAATTTCGAAGTTGAAAATAAGCTCGAACCAGAGAGGAAATTGAAGCTTGCTCAAGAACTTGGATTACAGCCTCGACAAGTGGCTGTTTGGTTCCAAAATCGTCGAGCTAGATGGAAAACTAAGCAATTAGAAAGAGATTATGGTGTTCTTAAAACCAATTACGACAATCTTAAACTCAGTTTTGAAGCTCTCCAAAATGATAATCAAGCTCTTCTCAAAGAGATTCGGGAATTGAAAGCAAAGATTCAAGAAGATAACTCAGAGATGTTGGTGCCGGCCGATTCTGAAAATGCTTTGATCGAACAAACTAAGCCGGAAATTACCGATGACTTCTCTGTTCCACCGGCGAGAAGCTTCAACAACAATGGCGGAGAAGGGGATGAGCCACCAACAAAAGATGGGTCATCCGACAGCGATTCGAGCGCGATTTTAAACGAAGATTACAGCCCGACGGCCGGTGTTTCTTCACCGGGAGTGTTGCAGAACAACAACCATTTCATGACGGGAGCGATACCTCCATCATCTCCGTCCATCGCCACCGCTGGCGTGAAACTGAACCGCGCGACGACGGCGCTGAATTACTTGCAGTTTCAAAAGGGGTATCAACAAACCCAGATGATGTTTCCGAAAATGGAGGAGCATAATTTCTTCGGCGGAGAGGAGGCTTGTAACTTCTTTTCCGATGAGCAAGCTCCGACTCTGCACTGGTGGAGCTGA
BLAST of CmoCh16G007860 vs. Swiss-Prot
Match: ATHB6_ARATH (Homeobox-leucine zipper protein ATHB-6 OS=Arabidopsis thaliana GN=ATHB-6 PE=1 SV=1)

HSP 1 Score: 213.8 bits (543), Expect = 2.7e-54
Identity = 160/345 (46.38%), Postives = 205/345 (59.42%), Query Frame = 1

Query: 1   MKR--RSDSMAALISISPT-SDQEQSPRNKNSNHVYEMEFQCMLDGFDEEE------LGH 60
           MKR   SDS+  LIS+ PT S  EQSPR          EFQ ML+G++EEE       GH
Sbjct: 2   MKRLSSSDSVGGLISLCPTTSTDEQSPRRYGGR-----EFQSMLEGYEEEEEAIVEERGH 61

Query: 61  V--SEKKRRLGVEQVKALEKNFEVENKLEPERKLKLAQELGLQPRQVAVWFQNRRARWKT 120
           V  SEKKRRL + QVKALEKNFE+ENKLEPERK+KLAQELGLQPRQVAVWFQNRRARWKT
Sbjct: 62  VGLSEKKRRLSINQVKALEKNFELENKLEPERKVKLAQELGLQPRQVAVWFQNRRARWKT 121

Query: 121 KQLERDYGVLKTNYDNLKLSFEALQNDNQALLKEIRELKAKI-------QEDNSEMLVPA 180
           KQLE+DYGVLKT YD+L+ +F++L+ DN++LL+EI +LK K+       +E+ +   V  
Sbjct: 122 KQLEKDYGVLKTQYDSLRHNFDSLRRDNESLLQEISKLKTKLNGGGGEEEEEENNAAVTT 181

Query: 181 DSENALIEQ--TKPE-ITDDFSVPPARSFNNNGGE-------GDEPPTKD---------G 240
           +S+ ++ E+  + PE IT+  S PP    +++G          D  P K          G
Sbjct: 182 ESDISVKEEEVSLPEKITEAPSSPPQFLEHSDGLNYRSFTDLRDLLPLKAAASSFAAAAG 241

Query: 241 SSDS-DSSAILNEDYSPTAGVSSPGVLQNNNHFMTGAIPPSSPSIATAGVKLNRATTALN 300
           SSDS DSSA+LNE+ S    V++P  +            P                   N
Sbjct: 242 SSDSSDSSALLNEESSSNVTVAAPVTV------------PGG-----------------N 301

Query: 301 YLQFQKGYQQTQMMFPKMEEHNFFGGEEACNFFSDEQAPTLHWWS 308
           + QF K  +QT+      +  +F  GEEAC FFSDEQ P+LHW+S
Sbjct: 302 FFQFVK-MEQTE------DHEDFLSGEEACEFFSDEQPPSLHWYS 305

BLAST of CmoCh16G007860 vs. Swiss-Prot
Match: ATB16_ARATH (Homeobox-leucine zipper protein ATHB-16 OS=Arabidopsis thaliana GN=ATHB-16 PE=2 SV=2)

HSP 1 Score: 206.5 bits (524), Expect = 4.3e-52
Identity = 155/339 (45.72%), Postives = 193/339 (56.93%), Query Frame = 1

Query: 1   MKR--RSDSMAALISISPTSDQEQSPRNKNSNHVYEMEFQCMLDGFDEEEL-------GH 60
           MKR   SDSM  LIS   TS  EQSPR   SN      +Q ML+G+DE+          H
Sbjct: 1   MKRLSSSDSMCGLIS---TSTDEQSPRGYGSN------YQSMLEGYDEDATLIEEYSGNH 60

Query: 61  ----VSEKKRRLGVEQVKALEKNFEVENKLEPERKLKLAQELGLQPRQVAVWFQNRRARW 120
               +SEKKRRL V+QVKALEKNFE+ENKLEPERK KLAQELGLQPRQVAVWFQNRRARW
Sbjct: 61  HHMGLSEKKRRLKVDQVKALEKNFELENKLEPERKTKLAQELGLQPRQVAVWFQNRRARW 120

Query: 121 KTKQLERDYGVLKTNYDNLKLSFEALQNDNQALLKEIRELKAKI--QEDNSEMLVPADSE 180
           KTKQLE+DYGVLK  YD+L+ +F++L+ DN +LL+EI ++KAK+  +EDN+       + 
Sbjct: 121 KTKQLEKDYGVLKGQYDSLRHNFDSLRRDNDSLLQEISKIKAKVNGEEDNN-------NN 180

Query: 181 NALIEQTKPEI---TDDFSVPPARSFNNNGGEGDEPPTKD-------------GSSDS-D 240
            A+ E  K E    TD     P +   ++ G        D             GSSDS D
Sbjct: 181 KAITEGVKEEEVHKTDSIPSSPLQFLEHSSGFNYRRSFTDLRDLLPNSTVVEAGSSDSCD 240

Query: 241 SSAILNEDYSPTAGVSSPGVLQNNNHFMTGAIPPSSPSIATAGVKLNRATTALNYLQFQK 300
           SSA+LN++ S   G  +P                  P   T G          ++LQF K
Sbjct: 241 SSAVLNDETSSDNGRLTP------------------PVTVTGG----------SFLQFVK 288

Query: 301 GYQQTQMMFPKMEEHNFFGGEEACNFFSDEQAPTLHWWS 308
             +QT+      +  +F  GEEAC FFSDEQ P+LHW+S
Sbjct: 301 T-EQTE------DHEDFLSGEEACGFFSDEQPPSLHWYS 288

BLAST of CmoCh16G007860 vs. Swiss-Prot
Match: ATHB5_ARATH (Homeobox-leucine zipper protein ATHB-5 OS=Arabidopsis thaliana GN=ATHB-5 PE=1 SV=1)

HSP 1 Score: 183.3 bits (464), Expect = 3.9e-45
Identity = 134/334 (40.12%), Postives = 183/334 (54.79%), Query Frame = 1

Query: 5   SDSMAALISIS-PTSDQEQSPRNKNSNHVYEM--EFQCMLDGFDE----EELGHV----- 64
           SDS++  + I   T+D++ SPR   +  +Y    ++  M D  ++    E+LG V     
Sbjct: 8   SDSLSGFLPIRHSTTDKQISPRPTTTGFLYSGAGDYSQMFDALEDDGSLEDLGGVGHASS 67

Query: 65  --SEKKRRLGVEQVKALEKNFEVENKLEPERKLKLAQELGLQPRQVAVWFQNRRARWKTK 124
             +EKKRRLGVEQVKALEKNFE++NKLEPERK+KLAQELGLQPRQVA+WFQNRRARWKTK
Sbjct: 68  TAAEKKRRLGVEQVKALEKNFEIDNKLEPERKVKLAQELGLQPRQVAIWFQNRRARWKTK 127

Query: 125 QLERDYGVLKTNYDNLKLSFEALQNDNQALLKEIRELKAKIQ-------EDNSEMLVPAD 184
           QLERDYGVLK+N+D LK + ++LQ DN +LL +I+ELKAK+        E+N  +     
Sbjct: 128 QLERDYGVLKSNFDALKRNRDSLQRDNDSLLGQIKELKAKLNVEGVKGIEENGALKAVEA 187

Query: 185 SENALIEQTKPEITDDFSVPPARSFNNNGGEGDEPPTK-----------------DGSSD 244
           +++ +      E++     PP           D P ++                 D +  
Sbjct: 188 NQSVMANNEVLELSHRSPSPPPHI------PTDAPTSELAFEMFSIFPRTENFRDDPADS 247

Query: 245 SDSSAILNEDYSPTAGVSSPGVLQNNNHFMTGAIPPSSPSIATAGVKLNRATTALNYLQF 300
           SDSSA+LNE+YSP            N     GA+  ++  ++T G               
Sbjct: 248 SDSSAVLNEEYSP------------NTVEAAGAVAATTVEMSTMGC-------------- 302

BLAST of CmoCh16G007860 vs. Swiss-Prot
Match: HOX4_ORYSJ (Homeobox-leucine zipper protein HOX4 OS=Oryza sativa subsp. japonica GN=HOX4 PE=1 SV=1)

HSP 1 Score: 167.5 bits (423), Expect = 2.2e-40
Identity = 119/271 (43.91%), Postives = 154/271 (56.83%), Query Frame = 1

Query: 42  DGFDEEEL---GHVSEKKRRLGVEQVKALEKNFEVENKLEPERKLKLAQELGLQPRQVAV 101
           +G  EEE+   G   EKKRRL VEQV+ALE++FEVENKLEPERK +LA++LGLQPRQVAV
Sbjct: 35  EGDVEEEMMACGGGGEKKRRLSVEQVRALERSFEVENKLEPERKARLARDLGLQPRQVAV 94

Query: 102 WFQNRRARWKTKQLERDYGVLKTNYDNLKLSFEALQNDNQALLKEIRELKAKIQEDNSEM 161
           WFQNRRARWKTKQLERDY  L+ +YD+L+L  +AL+ D  ALL EI+ELKAK+ ++ +  
Sbjct: 95  WFQNRRARWKTKQLERDYAALRHSYDSLRLDHDALRRDKDALLAEIKELKAKLGDEEAAA 154

Query: 162 LVPADSENALIEQTKPEITDDFSVPPARSFNNNGGEGDEPPTKDGSSDSDSSAILNEDYS 221
              +  E       +P  +D    PPA  F              GSSDSDSSA+LN+  +
Sbjct: 155 SFTSVKE-------EPAASDG---PPAAGF--------------GSSDSDSSAVLNDVDA 214

Query: 222 PTAGVSSPGVLQNNNHFMTGAIPPSSPSIATAGVKLNRATTALNYLQFQKGYQQTQMMFP 281
             A  ++   L        GA P        AG     A  A +   F  G      +  
Sbjct: 215 AGAAPAATDALAPEACTFLGAPP-------AAGAGAGAAAAASHEEVFFHG----NFLKV 270

Query: 282 KMEEHNFFGGEEAC-NFFSDEQAPTL-HWWS 308
           + +E  F   +E C  FF+D+Q P L  WW+
Sbjct: 275 EEDETGFLDDDEPCGGFFADDQPPPLSSWWA 270

BLAST of CmoCh16G007860 vs. Swiss-Prot
Match: HOX4_ORYSI (Homeobox-leucine zipper protein HOX4 OS=Oryza sativa subsp. indica GN=HOX4 PE=1 SV=1)

HSP 1 Score: 167.5 bits (423), Expect = 2.2e-40
Identity = 119/271 (43.91%), Postives = 154/271 (56.83%), Query Frame = 1

Query: 42  DGFDEEEL---GHVSEKKRRLGVEQVKALEKNFEVENKLEPERKLKLAQELGLQPRQVAV 101
           +G  EEE+   G   EKKRRL VEQV+ALE++FEVENKLEPERK +LA++LGLQPRQVAV
Sbjct: 35  EGDVEEEMMACGGGGEKKRRLSVEQVRALERSFEVENKLEPERKARLARDLGLQPRQVAV 94

Query: 102 WFQNRRARWKTKQLERDYGVLKTNYDNLKLSFEALQNDNQALLKEIRELKAKIQEDNSEM 161
           WFQNRRARWKTKQLERDY  L+ +YD+L+L  +AL+ D  ALL EI+ELKAK+ ++ +  
Sbjct: 95  WFQNRRARWKTKQLERDYAALRHSYDSLRLDHDALRRDKDALLAEIKELKAKLGDEEAAA 154

Query: 162 LVPADSENALIEQTKPEITDDFSVPPARSFNNNGGEGDEPPTKDGSSDSDSSAILNEDYS 221
              +  E       +P  +D    PPA  F              GSSDSDSSA+LN+  +
Sbjct: 155 SFTSVKE-------EPAASDG---PPAAGF--------------GSSDSDSSAVLNDVDA 214

Query: 222 PTAGVSSPGVLQNNNHFMTGAIPPSSPSIATAGVKLNRATTALNYLQFQKGYQQTQMMFP 281
             A  ++   L        GA P        AG     A  A +   F  G      +  
Sbjct: 215 AGAAPAATDALAPEACTFLGAPP-------AAGAGAGAAAAASHEEVFFHG----NFLKV 270

Query: 282 KMEEHNFFGGEEAC-NFFSDEQAPTL-HWWS 308
           + +E  F   +E C  FF+D+Q P L  WW+
Sbjct: 275 EEDETGFLDDDEPCGGFFADDQPPPLSSWWA 270

BLAST of CmoCh16G007860 vs. TrEMBL
Match: A0A061DJ94_THECC (Alanine--glyoxylate aminotransferase 2 isoform 1 OS=Theobroma cacao GN=TCM_001039 PE=4 SV=1)

HSP 1 Score: 309.7 bits (792), Expect = 4.0e-81
Identity = 189/333 (56.76%), Postives = 233/333 (69.97%), Query Frame = 1

Query: 1   MKR--RSDSMAALISISPTSDQEQSPRNKNSNHVYEMEFQCMLDGFDEE----ELGHVSE 60
           MKR   SDS+ AL+SI PT+D E SPRN   NH+Y  EFQ MLDG DEE    E GHV+E
Sbjct: 1   MKRLGSSDSLGALMSICPTTD-EHSPRN---NHIYSREFQSMLDGLDEEGCVEESGHVAE 60

Query: 61  KKRRLGVEQVKALEKNFEVENKLEPERKLKLAQELGLQPRQVAVWFQNRRARWKTKQLER 120
           KKRRL V+QVKALEKNFEVENKLEPERK+KLAQELGLQPRQVAVWFQNRRARWKTKQLER
Sbjct: 61  KKRRLSVDQVKALEKNFEVENKLEPERKVKLAQELGLQPRQVAVWFQNRRARWKTKQLER 120

Query: 121 DYGVLKTNYDNLKLSFEALQNDNQALLKEIRELKAKIQEDNSEM-------LVPADSENA 180
           DYG+LKT+Y+ LK++++ LQ+DN+ALLKEIRELKAK+  +++E        ++  +++N 
Sbjct: 121 DYGLLKTSYETLKVNYDTLQHDNEALLKEIRELKAKLNGESTESNLSVKEEVIVHETDNK 180

Query: 181 LIEQTKPEITDDF--SVPPA----RSFNNNGGEGDE---PPTKDGSSDSDSSAILNED-- 240
            +EQ++P        S  PA     SFNN+ G       P  KDGSSDSDSSAILNED  
Sbjct: 181 TLEQSEPPPVSSLVTSSEPAELNYESFNNSIGSVGATLFPDLKDGSSDSDSSAILNEDNN 240

Query: 241 -YSP-TAGVSSPGVLQNNNHFMTGAIPPSSPSIATAGVKLNRATTALNYLQFQKGYQQTQ 300
             SP  A +SS GVLQ+  H +      SS +  ++    + + +++N  QF K   Q  
Sbjct: 241 NCSPNNAAISSSGVLQSQQHLLMSPTTTSSLNFNSS----SSSPSSMNCFQFSKSTYQPS 300

Query: 301 MMFPKMEEHNFFGGEEACNFFSDEQAPTLHWWS 308
             + KMEEHNFF  +EACNFFSDEQAP+LHW+S
Sbjct: 301 HQYVKMEEHNFFSADEACNFFSDEQAPSLHWYS 325

BLAST of CmoCh16G007860 vs. TrEMBL
Match: A9PHT9_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0007s05010g PE=2 SV=1)

HSP 1 Score: 308.9 bits (790), Expect = 6.8e-81
Identity = 185/326 (56.75%), Postives = 224/326 (68.71%), Query Frame = 1

Query: 5   SDSMAALISISPTSDQEQSPRNKNSNHVYEMEFQCMLDGFDEEEL-----GHVSEKKRRL 64
           SDS+ AL+SI P+++ E SPRN    HVY  EFQ MLDG DEE       GHV+EKKRRL
Sbjct: 8   SDSLGALMSICPSAE-EHSPRNHT--HVYSREFQSMLDGLDEEGCVEEAGGHVTEKKRRL 67

Query: 65  GVEQVKALEKNFEVENKLEPERKLKLAQELGLQPRQVAVWFQNRRARWKTKQLERDYGVL 124
             +QVKALEKNFEVENKLEPERK+KLAQELGLQPRQVAVWFQNRRARWKTKQLERDYGVL
Sbjct: 68  SGDQVKALEKNFEVENKLEPERKVKLAQELGLQPRQVAVWFQNRRARWKTKQLERDYGVL 127

Query: 125 KTNYDNLKLSFEALQNDNQALLKEIRELKAKIQEDNSEM-------LVPADSENALIEQT 184
           K NYD+LK +F+ALQ+DN+ALLKEIRELKAK+ E+N+E        ++ A+SE+ + E+ 
Sbjct: 128 KANYDSLKHNFDALQHDNEALLKEIRELKAKLNEENAESNVSVKEEIILAESEDKMPEED 187

Query: 185 KPEITDDFSVPPAR-----SFNNNG------GEGDEPPTKDGSSDSDSSAILNEDYSPTA 244
            P + D  +    +     +FNN+       G    P  KDGSSDSDSSAILNED SP  
Sbjct: 188 TPALLDSVAASETKELNYETFNNHSSINIGLGASLFPDFKDGSSDSDSSAILNEDNSPNP 247

Query: 245 GVSSPGVLQNNNHFMTGAIPPSSPSIATAGVKLNRATTALNYLQFQKGYQQTQMMFPKME 304
            +SS G+LQ+         PP S S+       + + +++N  QF K Y   Q  F K+E
Sbjct: 248 AISSSGILQSQLMMS----PPPSSSLRFNCSASSSSPSSMNCFQFSKSY---QTQFVKLE 307

Query: 305 EHNFFGGEEACNFFSDEQAPTLHWWS 308
           EHNFF  EEACNFFSDEQ P+L W+S
Sbjct: 308 EHNFFSSEEACNFFSDEQPPSLPWYS 323

BLAST of CmoCh16G007860 vs. TrEMBL
Match: A0A067KD47_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_18664 PE=4 SV=1)

HSP 1 Score: 308.1 bits (788), Expect = 1.2e-80
Identity = 196/335 (58.51%), Postives = 220/335 (65.67%), Query Frame = 1

Query: 1   MKR--RSDSMAALISISPTSDQEQSPRNKNSNHVYEMEFQCMLDGFDEE----ELGHVSE 60
           MKR   SDS+ ALISI PTSD E SPRN  SNHVY  EFQ MLDG DEE    E GHVSE
Sbjct: 1   MKRLSSSDSLGALISICPTSD-EHSPRN--SNHVYGREFQSMLDGLDEEACVEEAGHVSE 60

Query: 61  KKRRLGVEQVKALEKNFEVENKLEPERKLKLAQELGLQPRQVAVWFQNRRARWKTKQLER 120
           KKRRL V+QVKALEKNFEVENKLEPERK+KLAQELGLQPRQVAVWFQNRRARWKTKQLER
Sbjct: 61  KKRRLSVDQVKALEKNFEVENKLEPERKVKLAQELGLQPRQVAVWFQNRRARWKTKQLER 120

Query: 121 DYGVLKTNYDNLKLSFEALQNDNQALLKEIRELKAKIQEDNSEMLVPADSENALIE---- 180
           DYGVLK NY+ LK++++ALQ+DN+ALLKEIRELKAK+ EDN+E  V    E  + E    
Sbjct: 121 DYGVLKANYETLKVNYDALQHDNEALLKEIRELKAKLDEDNAESNVSVKEEIIIAETDEK 180

Query: 181 --------------QTKPEITDDFSVPPARSFNNNGGEGDEPPTKDGSSDSDSSAILNED 240
                         +TK    + F++  + S N        P  KDGSSDSDSSAILNED
Sbjct: 181 GSEEPPILTSIAGSETKDMNYESFNINSSNSNNGILAVSLFPDFKDGSSDSDSSAILNED 240

Query: 241 Y-----SPTAGVSSPGVLQNNNHFMTGAIPPSSPSIATAGVKLNRATTALNYLQFQKGYQ 300
                 SP   +SS GV Q++N  M     PSS S     +K               G  
Sbjct: 241 NNNSNNSPNPAISSSGVPQSHNQLMMSPSRPSSSSSPFQFIKT--------------GSY 300

Query: 301 QTQMMFPKMEEHNFFGGEEACNFFSDEQAPTLHWW 307
           QTQ  F KMEEHNFF  EEACNFFSDEQAP+L W+
Sbjct: 301 QTQ--FVKMEEHNFFSSEEACNFFSDEQAPSLQWY 316

BLAST of CmoCh16G007860 vs. TrEMBL
Match: B9H4Q5_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0005s07290g PE=4 SV=2)

HSP 1 Score: 307.8 bits (787), Expect = 1.5e-80
Identity = 184/326 (56.44%), Postives = 219/326 (67.18%), Query Frame = 1

Query: 5   SDSMAALISISPTSDQEQSPRNKNSNHVYEMEFQCMLDGFDEEEL-----GHVSEKKRRL 64
           SDS+ AL+SI PT++ E SPRN  S HVY  EFQ ML+G DEE       GHV+EKKRRL
Sbjct: 8   SDSLGALMSICPTTE-EHSPRN--STHVYSREFQSMLNGLDEEGCVEESGGHVTEKKRRL 67

Query: 65  GVEQVKALEKNFEVENKLEPERKLKLAQELGLQPRQVAVWFQNRRARWKTKQLERDYGVL 124
             +QVKALEKNFEVENKLEPERK+KLAQELGLQPRQVAVWFQNRRARWKTKQLERDYGVL
Sbjct: 68  SGDQVKALEKNFEVENKLEPERKVKLAQELGLQPRQVAVWFQNRRARWKTKQLERDYGVL 127

Query: 125 KTNYDNLKLSFEALQNDNQALLKEIRELKAKIQEDNSEM-------LVPADSENALIEQT 184
           K NYD+LK +F+A+Q DN+ALLKEIRELKAK+ E+N+E        ++ A+SE+ + E+ 
Sbjct: 128 KANYDSLKHNFDAIQQDNEALLKEIRELKAKLNEENTESNVSVKEEIILAESEDKVTEED 187

Query: 185 KPEITDDFSVPP------------ARSFNNNGGEGDEPPTKDGSSDSDSSAILNEDYSPT 244
            P + D  +               + S NN  G    P  KDG SDSDSSAILNED SP 
Sbjct: 188 TPPLLDSLTASAEAKELNYENFNSSSSINNGLGASLFPDFKDGLSDSDSSAILNEDNSPN 247

Query: 245 AGVSSPGVLQNNNHFMTGAIPPSSPSIATAGVKLNRATTALNYLQFQKGYQQTQMMFPKM 304
             +SS G+LQ+         PP S S+       + + + +N  QF K YQ     F K+
Sbjct: 248 PAISSSGILQSQLMMS----PPPSSSLKFNCSTSSSSPSTMNSFQFSKTYQT---QFVKL 307

Query: 305 EEHNFFGGEEACNFFSDEQAPTLHWW 307
           EEHNF   EEACNFFSDEQ PTLHW+
Sbjct: 308 EEHNFLSSEEACNFFSDEQPPTLHWY 323

BLAST of CmoCh16G007860 vs. TrEMBL
Match: M5W009_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa008318mg PE=4 SV=1)

HSP 1 Score: 305.4 bits (781), Expect = 7.6e-80
Identity = 191/339 (56.34%), Postives = 230/339 (67.85%), Query Frame = 1

Query: 1   MKR--RSDSMAALISISPTSDQEQSPRNKNSNHVYEMEFQCMLDGFDEE----ELGHVSE 60
           MKR   SDS+ A+ISI P++ +EQSPRN   NHVY  +FQ MLDG DEE    E GHVSE
Sbjct: 1   MKRLGSSDSLGAMISICPSTAEEQSPRN---NHVYRRDFQSMLDGLDEEGCVEEGGHVSE 60

Query: 61  KKRRLGVEQVKALEKNFEVENKLEPERKLKLAQELGLQPRQVAVWFQNRRARWKTKQLER 120
           KKRRL VEQVKALEKNFEVENKLEPERK+KLAQELGLQPRQVAVWFQNRRARWKTKQLER
Sbjct: 61  KKRRLSVEQVKALEKNFEVENKLEPERKVKLAQELGLQPRQVAVWFQNRRARWKTKQLER 120

Query: 121 DYGVLKTNYDNLKLSFEALQNDNQALLKEIRELKAKIQEDNS---------EMLVPADSE 180
           D+GVLK NYD+LKL+++ LQ++N+AL+KEI++LK+K+QE+N+         E +V  D  
Sbjct: 121 DFGVLKANYDSLKLNYDNLQHENEALVKEIKQLKSKLQEENTESNNLSVKEEQMVAKDQS 180

Query: 181 NALI-----EQTKPEITDDFSVPPA-------RSFN--NNGGEGDE-----PPTKDGSSD 240
           N  +      ++ P      SVP          SFN  NNG  G E     P  KDGSSD
Sbjct: 181 NYKVVDHELSKSPPPPPLGSSVPATESKELNFESFNNTNNGAVGLEAVSLFPDFKDGSSD 240

Query: 241 SDSSAILNEDYSPTAGVSSPGVLQNNNHFMTGAIPPSSPSIATAGVKLNRATTALNYLQF 300
           SDSSAILNED SP   +SS G+LQN+    +   P S+          + +++++N  QF
Sbjct: 241 SDSSAILNEDNSPNLTISSSGMLQNHQLMKS---PASTSLKFNCCSSSSPSSSSMNCFQF 300

Query: 301 QKGYQQTQMMFPKMEEHNFFGGEEACNFFSDEQAPTLHW 306
           QK Y      F K+EEHNFF  EEAC+FFSDEQAPTL W
Sbjct: 301 QKTYHP---QFVKIEEHNFFSSEEACSFFSDEQAPTLQW 330

BLAST of CmoCh16G007860 vs. TAIR10
Match: AT2G22430.1 (AT2G22430.1 homeobox protein 6)

HSP 1 Score: 213.8 bits (543), Expect = 1.5e-55
Identity = 160/345 (46.38%), Postives = 205/345 (59.42%), Query Frame = 1

Query: 1   MKR--RSDSMAALISISPT-SDQEQSPRNKNSNHVYEMEFQCMLDGFDEEE------LGH 60
           MKR   SDS+  LIS+ PT S  EQSPR          EFQ ML+G++EEE       GH
Sbjct: 2   MKRLSSSDSVGGLISLCPTTSTDEQSPRRYGGR-----EFQSMLEGYEEEEEAIVEERGH 61

Query: 61  V--SEKKRRLGVEQVKALEKNFEVENKLEPERKLKLAQELGLQPRQVAVWFQNRRARWKT 120
           V  SEKKRRL + QVKALEKNFE+ENKLEPERK+KLAQELGLQPRQVAVWFQNRRARWKT
Sbjct: 62  VGLSEKKRRLSINQVKALEKNFELENKLEPERKVKLAQELGLQPRQVAVWFQNRRARWKT 121

Query: 121 KQLERDYGVLKTNYDNLKLSFEALQNDNQALLKEIRELKAKI-------QEDNSEMLVPA 180
           KQLE+DYGVLKT YD+L+ +F++L+ DN++LL+EI +LK K+       +E+ +   V  
Sbjct: 122 KQLEKDYGVLKTQYDSLRHNFDSLRRDNESLLQEISKLKTKLNGGGGEEEEEENNAAVTT 181

Query: 181 DSENALIEQ--TKPE-ITDDFSVPPARSFNNNGGE-------GDEPPTKD---------G 240
           +S+ ++ E+  + PE IT+  S PP    +++G          D  P K          G
Sbjct: 182 ESDISVKEEEVSLPEKITEAPSSPPQFLEHSDGLNYRSFTDLRDLLPLKAAASSFAAAAG 241

Query: 241 SSDS-DSSAILNEDYSPTAGVSSPGVLQNNNHFMTGAIPPSSPSIATAGVKLNRATTALN 300
           SSDS DSSA+LNE+ S    V++P  +            P                   N
Sbjct: 242 SSDSSDSSALLNEESSSNVTVAAPVTV------------PGG-----------------N 301

Query: 301 YLQFQKGYQQTQMMFPKMEEHNFFGGEEACNFFSDEQAPTLHWWS 308
           + QF K  +QT+      +  +F  GEEAC FFSDEQ P+LHW+S
Sbjct: 302 FFQFVK-MEQTE------DHEDFLSGEEACEFFSDEQPPSLHWYS 305

BLAST of CmoCh16G007860 vs. TAIR10
Match: AT4G40060.1 (AT4G40060.1 homeobox protein 16)

HSP 1 Score: 206.5 bits (524), Expect = 2.4e-53
Identity = 155/339 (45.72%), Postives = 193/339 (56.93%), Query Frame = 1

Query: 1   MKR--RSDSMAALISISPTSDQEQSPRNKNSNHVYEMEFQCMLDGFDEEEL-------GH 60
           MKR   SDSM  LIS   TS  EQSPR   SN      +Q ML+G+DE+          H
Sbjct: 1   MKRLSSSDSMCGLIS---TSTDEQSPRGYGSN------YQSMLEGYDEDATLIEEYSGNH 60

Query: 61  ----VSEKKRRLGVEQVKALEKNFEVENKLEPERKLKLAQELGLQPRQVAVWFQNRRARW 120
               +SEKKRRL V+QVKALEKNFE+ENKLEPERK KLAQELGLQPRQVAVWFQNRRARW
Sbjct: 61  HHMGLSEKKRRLKVDQVKALEKNFELENKLEPERKTKLAQELGLQPRQVAVWFQNRRARW 120

Query: 121 KTKQLERDYGVLKTNYDNLKLSFEALQNDNQALLKEIRELKAKI--QEDNSEMLVPADSE 180
           KTKQLE+DYGVLK  YD+L+ +F++L+ DN +LL+EI ++KAK+  +EDN+       + 
Sbjct: 121 KTKQLEKDYGVLKGQYDSLRHNFDSLRRDNDSLLQEISKIKAKVNGEEDNN-------NN 180

Query: 181 NALIEQTKPEI---TDDFSVPPARSFNNNGGEGDEPPTKD-------------GSSDS-D 240
            A+ E  K E    TD     P +   ++ G        D             GSSDS D
Sbjct: 181 KAITEGVKEEEVHKTDSIPSSPLQFLEHSSGFNYRRSFTDLRDLLPNSTVVEAGSSDSCD 240

Query: 241 SSAILNEDYSPTAGVSSPGVLQNNNHFMTGAIPPSSPSIATAGVKLNRATTALNYLQFQK 300
           SSA+LN++ S   G  +P                  P   T G          ++LQF K
Sbjct: 241 SSAVLNDETSSDNGRLTP------------------PVTVTGG----------SFLQFVK 288

Query: 301 GYQQTQMMFPKMEEHNFFGGEEACNFFSDEQAPTLHWWS 308
             +QT+      +  +F  GEEAC FFSDEQ P+LHW+S
Sbjct: 301 T-EQTE------DHEDFLSGEEACGFFSDEQPPSLHWYS 288

BLAST of CmoCh16G007860 vs. TAIR10
Match: AT5G65310.1 (AT5G65310.1 homeobox protein 5)

HSP 1 Score: 183.3 bits (464), Expect = 2.2e-46
Identity = 134/334 (40.12%), Postives = 183/334 (54.79%), Query Frame = 1

Query: 5   SDSMAALISIS-PTSDQEQSPRNKNSNHVYEM--EFQCMLDGFDE----EELGHV----- 64
           SDS++  + I   T+D++ SPR   +  +Y    ++  M D  ++    E+LG V     
Sbjct: 8   SDSLSGFLPIRHSTTDKQISPRPTTTGFLYSGAGDYSQMFDALEDDGSLEDLGGVGHASS 67

Query: 65  --SEKKRRLGVEQVKALEKNFEVENKLEPERKLKLAQELGLQPRQVAVWFQNRRARWKTK 124
             +EKKRRLGVEQVKALEKNFE++NKLEPERK+KLAQELGLQPRQVA+WFQNRRARWKTK
Sbjct: 68  TAAEKKRRLGVEQVKALEKNFEIDNKLEPERKVKLAQELGLQPRQVAIWFQNRRARWKTK 127

Query: 125 QLERDYGVLKTNYDNLKLSFEALQNDNQALLKEIRELKAKIQ-------EDNSEMLVPAD 184
           QLERDYGVLK+N+D LK + ++LQ DN +LL +I+ELKAK+        E+N  +     
Sbjct: 128 QLERDYGVLKSNFDALKRNRDSLQRDNDSLLGQIKELKAKLNVEGVKGIEENGALKAVEA 187

Query: 185 SENALIEQTKPEITDDFSVPPARSFNNNGGEGDEPPTK-----------------DGSSD 244
           +++ +      E++     PP           D P ++                 D +  
Sbjct: 188 NQSVMANNEVLELSHRSPSPPPHI------PTDAPTSELAFEMFSIFPRTENFRDDPADS 247

Query: 245 SDSSAILNEDYSPTAGVSSPGVLQNNNHFMTGAIPPSSPSIATAGVKLNRATTALNYLQF 300
           SDSSA+LNE+YSP            N     GA+  ++  ++T G               
Sbjct: 248 SDSSAVLNEEYSP------------NTVEAAGAVAATTVEMSTMGC-------------- 302

BLAST of CmoCh16G007860 vs. TAIR10
Match: AT3G01470.1 (AT3G01470.1 homeobox 1)

HSP 1 Score: 128.3 bits (321), Expect = 8.4e-30
Identity = 68/111 (61.26%), Postives = 84/111 (75.68%), Query Frame = 1

Query: 42  DGFDEEELGHVSEKKRRLGVEQVKALEKNFEVENKLEPERKLKLAQELGLQPRQVAVWFQ 101
           D F +++L    EKKRRL  EQV  LEK+FE ENKLEPERK +LA++LGLQPRQVAVWFQ
Sbjct: 58  DDFYDDQL---PEKKRRLTTEQVHLLEKSFETENKLEPERKTQLAKKLGLQPRQVAVWFQ 117

Query: 102 NRRARWKTKQLERDYGVLKTNYDNLKLSFEALQNDNQALLKEIRELKAKIQ 153
           NRRARWKTKQLERDY +LK+ YD L  +++++  DN  L  E+  L  K+Q
Sbjct: 118 NRRARWKTKQLERDYDLLKSTYDQLLSNYDSIVMDNDKLRSEVTSLTEKLQ 165

BLAST of CmoCh16G007860 vs. TAIR10
Match: AT1G69780.1 (AT1G69780.1 Homeobox-leucine zipper protein family)

HSP 1 Score: 127.1 bits (318), Expect = 1.9e-29
Identity = 75/151 (49.67%), Postives = 91/151 (60.26%), Query Frame = 1

Query: 52  VSEKKRRLGVEQVKALEKNFEVENKLEPERKLKLAQELGLQPRQVAVWFQNRRARWKTKQ 111
           + EKKRRL +EQVK LEKNFE+ NKLEPERK++LA+ LGLQPRQ+A+WFQNRRARWKTKQ
Sbjct: 82  MGEKKRRLNMEQVKTLEKNFELGNKLEPERKMQLARALGLQPRQIAIWFQNRRARWKTKQ 141

Query: 112 LERDYGVLKTNYDNLKLSFEALQNDNQALLKEIRELKAKIQED--NSEMLVPADSENALI 171
           LE+DY  LK  +D LK   + LQ  NQ L  EI  LK + Q +  N          N   
Sbjct: 142 LEKDYDTLKRQFDTLKAENDLLQTHNQKLQAEIMGLKNREQTESINLNKETEGSCSNRSD 201

Query: 172 EQTKPEITDDFSVPPARSFNNNGGEGDEPPT 201
             +     D  + PP+      GG    P T
Sbjct: 202 NSSDNLRLDISTAPPSNDSTLTGGHPPPPQT 232

BLAST of CmoCh16G007860 vs. NCBI nr
Match: gi|743861233|ref|XP_011031070.1| (PREDICTED: homeobox-leucine zipper protein ATHB-6-like isoform X1 [Populus euphratica])

HSP 1 Score: 311.2 bits (796), Expect = 2.0e-81
Identity = 185/326 (56.75%), Postives = 219/326 (67.18%), Query Frame = 1

Query: 5   SDSMAALISISPTSDQEQSPRNKNSNHVYEMEFQCMLDGFDEEEL-----GHVSEKKRRL 64
           SDS+ AL+SI PT++ E SPRN  S HVY  EFQ MLDG DEE       GHV+EKKRRL
Sbjct: 8   SDSLGALMSICPTTE-EHSPRN--STHVYSREFQSMLDGLDEEGCVEEAGGHVTEKKRRL 67

Query: 65  GVEQVKALEKNFEVENKLEPERKLKLAQELGLQPRQVAVWFQNRRARWKTKQLERDYGVL 124
             +QVKALEKNFEVENKLEPERK+KLAQELGLQPRQVAVWFQNRRARWKTKQLERDYGVL
Sbjct: 68  SGDQVKALEKNFEVENKLEPERKVKLAQELGLQPRQVAVWFQNRRARWKTKQLERDYGVL 127

Query: 125 KTNYDNLKLSFEALQNDNQALLKEIRELKAKIQEDNSEM-------LVPADSENALIEQT 184
           K NYD+LK +F+A+Q DN+ALLKEIRELKAK+ E+N+E        ++ A+SE  + E+ 
Sbjct: 128 KANYDSLKHNFDAIQQDNEALLKEIRELKAKLNEENTESNVSVKEEIILAESEGKVTEED 187

Query: 185 KPEITDDFSVPP------------ARSFNNNGGEGDEPPTKDGSSDSDSSAILNEDYSPT 244
            P + D  +               + S NN  G    P  KDG SDSDSSAILNED SP 
Sbjct: 188 TPPLLDSLTASAEAKELNYENFNSSSSINNGLGASLFPDLKDGLSDSDSSAILNEDNSPN 247

Query: 245 AGVSSPGVLQNNNHFMTGAIPPSSPSIATAGVKLNRATTALNYLQFQKGYQQTQMMFPKM 304
             +SS G+LQ+         PP S S+       + + + +N  QF K YQ     + K+
Sbjct: 248 PAISSSGILQSQLMMS----PPPSSSLKFNCSNSSSSPSTMNCFQFSKTYQT---QYVKL 307

Query: 305 EEHNFFGGEEACNFFSDEQAPTLHWW 307
           EEHNFF  EEACNFFSDEQ PTLHW+
Sbjct: 308 EEHNFFNSEEACNFFSDEQPPTLHWY 323

BLAST of CmoCh16G007860 vs. NCBI nr
Match: gi|743861237|ref|XP_011031071.1| (PREDICTED: homeobox-leucine zipper protein ATHB-6-like isoform X2 [Populus euphratica])

HSP 1 Score: 310.1 bits (793), Expect = 4.4e-81
Identity = 185/326 (56.75%), Postives = 218/326 (66.87%), Query Frame = 1

Query: 5   SDSMAALISISPTSDQEQSPRNKNSNHVYEMEFQCMLDGFDEEEL-----GHVSEKKRRL 64
           SDS+ AL+SI PT+  E SPRN  S HVY  EFQ MLDG DEE       GHV+EKKRRL
Sbjct: 8   SDSLGALMSICPTT--EHSPRN--STHVYSREFQSMLDGLDEEGCVEEAGGHVTEKKRRL 67

Query: 65  GVEQVKALEKNFEVENKLEPERKLKLAQELGLQPRQVAVWFQNRRARWKTKQLERDYGVL 124
             +QVKALEKNFEVENKLEPERK+KLAQELGLQPRQVAVWFQNRRARWKTKQLERDYGVL
Sbjct: 68  SGDQVKALEKNFEVENKLEPERKVKLAQELGLQPRQVAVWFQNRRARWKTKQLERDYGVL 127

Query: 125 KTNYDNLKLSFEALQNDNQALLKEIRELKAKIQEDNSEM-------LVPADSENALIEQT 184
           K NYD+LK +F+A+Q DN+ALLKEIRELKAK+ E+N+E        ++ A+SE  + E+ 
Sbjct: 128 KANYDSLKHNFDAIQQDNEALLKEIRELKAKLNEENTESNVSVKEEIILAESEGKVTEED 187

Query: 185 KPEITDDFSVPP------------ARSFNNNGGEGDEPPTKDGSSDSDSSAILNEDYSPT 244
            P + D  +               + S NN  G    P  KDG SDSDSSAILNED SP 
Sbjct: 188 TPPLLDSLTASAEAKELNYENFNSSSSINNGLGASLFPDLKDGLSDSDSSAILNEDNSPN 247

Query: 245 AGVSSPGVLQNNNHFMTGAIPPSSPSIATAGVKLNRATTALNYLQFQKGYQQTQMMFPKM 304
             +SS G+LQ+         PP S S+       + + + +N  QF K YQ     + K+
Sbjct: 248 PAISSSGILQSQLMMS----PPPSSSLKFNCSNSSSSPSTMNCFQFSKTYQT---QYVKL 307

Query: 305 EEHNFFGGEEACNFFSDEQAPTLHWW 307
           EEHNFF  EEACNFFSDEQ PTLHW+
Sbjct: 308 EEHNFFNSEEACNFFSDEQPPTLHWY 322

BLAST of CmoCh16G007860 vs. NCBI nr
Match: gi|590706919|ref|XP_007047858.1| (Alanine--glyoxylate aminotransferase 2 isoform 1 [Theobroma cacao])

HSP 1 Score: 309.7 bits (792), Expect = 5.8e-81
Identity = 189/333 (56.76%), Postives = 233/333 (69.97%), Query Frame = 1

Query: 1   MKR--RSDSMAALISISPTSDQEQSPRNKNSNHVYEMEFQCMLDGFDEE----ELGHVSE 60
           MKR   SDS+ AL+SI PT+D E SPRN   NH+Y  EFQ MLDG DEE    E GHV+E
Sbjct: 1   MKRLGSSDSLGALMSICPTTD-EHSPRN---NHIYSREFQSMLDGLDEEGCVEESGHVAE 60

Query: 61  KKRRLGVEQVKALEKNFEVENKLEPERKLKLAQELGLQPRQVAVWFQNRRARWKTKQLER 120
           KKRRL V+QVKALEKNFEVENKLEPERK+KLAQELGLQPRQVAVWFQNRRARWKTKQLER
Sbjct: 61  KKRRLSVDQVKALEKNFEVENKLEPERKVKLAQELGLQPRQVAVWFQNRRARWKTKQLER 120

Query: 121 DYGVLKTNYDNLKLSFEALQNDNQALLKEIRELKAKIQEDNSEM-------LVPADSENA 180
           DYG+LKT+Y+ LK++++ LQ+DN+ALLKEIRELKAK+  +++E        ++  +++N 
Sbjct: 121 DYGLLKTSYETLKVNYDTLQHDNEALLKEIRELKAKLNGESTESNLSVKEEVIVHETDNK 180

Query: 181 LIEQTKPEITDDF--SVPPA----RSFNNNGGEGDE---PPTKDGSSDSDSSAILNED-- 240
            +EQ++P        S  PA     SFNN+ G       P  KDGSSDSDSSAILNED  
Sbjct: 181 TLEQSEPPPVSSLVTSSEPAELNYESFNNSIGSVGATLFPDLKDGSSDSDSSAILNEDNN 240

Query: 241 -YSP-TAGVSSPGVLQNNNHFMTGAIPPSSPSIATAGVKLNRATTALNYLQFQKGYQQTQ 300
             SP  A +SS GVLQ+  H +      SS +  ++    + + +++N  QF K   Q  
Sbjct: 241 NCSPNNAAISSSGVLQSQQHLLMSPTTTSSLNFNSS----SSSPSSMNCFQFSKSTYQPS 300

Query: 301 MMFPKMEEHNFFGGEEACNFFSDEQAPTLHWWS 308
             + KMEEHNFF  +EACNFFSDEQAP+LHW+S
Sbjct: 301 HQYVKMEEHNFFSADEACNFFSDEQAPSLHWYS 325

BLAST of CmoCh16G007860 vs. NCBI nr
Match: gi|470103473|ref|XP_004288161.1| (PREDICTED: homeobox-leucine zipper protein ATHB-6-like isoform X2 [Fragaria vesca subsp. vesca])

HSP 1 Score: 308.9 bits (790), Expect = 9.8e-81
Identity = 196/337 (58.16%), Postives = 228/337 (67.66%), Query Frame = 1

Query: 1   MKR--RSDSMAALISISPTSDQEQSPRNKNSNHVYEMEFQCMLDGFDEE----ELGHVSE 60
           MKR   SDS+ ALISI PT+  E SPRN   NHVY  +FQ MLDG DEE    E GHV+E
Sbjct: 1   MKRLGSSDSLGALISICPTTTDEHSPRN---NHVYSRDFQSMLDGLDEEGCVEESGHVAE 60

Query: 61  KKRRLGVEQVKALEKNFEVENKLEPERKLKLAQELGLQPRQVAVWFQNRRARWKTKQLER 120
           KKRRL VEQVKALEKNFEVENKLEPERK+KLAQELGLQPRQVAVWFQNRRARWKTKQLER
Sbjct: 61  KKRRLSVEQVKALEKNFEVENKLEPERKVKLAQELGLQPRQVAVWFQNRRARWKTKQLER 120

Query: 121 DYGVLKTNYDNLKLSFEALQNDNQALLKEIRELKAKIQEDNSE----------MLVPADS 180
           DYGVLK NYD+LK+SF++LQ+DNQAL KEI+ELKAK QE+N+E           L    S
Sbjct: 121 DYGVLKANYDSLKISFDSLQHDNQALHKEIKELKAKFQEENTESNHSVKEEQMALANESS 180

Query: 181 ENALIEQTKPEITDDFSVPPA--------RSFNN---NGGEGDE----PPTKDGSSDSDS 240
              +IEQ+KP+  +  + PP          SFNN   NG  G E    P  KDGSSDSDS
Sbjct: 181 YKMVIEQSKPQSPE--TSPPVSGSKELNFESFNNTNSNGAVGVEVSLFPDFKDGSSDSDS 240

Query: 241 SAILNEDYSPTAGVSSPGVLQNNNHFMTGAIPPSSPSIATAGVKLNRATTALNYLQFQKG 300
           SAILNED       +SP    N +H +  A   +S     +    + +++++N  QFQK 
Sbjct: 241 SAILNEDQ------NSPNGTINQHHQLMPA--SNSLKFNCSASSSSPSSSSMNCFQFQKS 300

Query: 301 YQQTQMMFPKMEEHNFFGGEEACNFFSDEQAPTLHWW 307
             Q Q  F K+EEHNFF  EEACNFFSDEQAP+L W+
Sbjct: 301 SYQPQ--FVKIEEHNFFSSEEACNFFSDEQAPSLQWY 322

BLAST of CmoCh16G007860 vs. NCBI nr
Match: gi|566179669|ref|XP_006380403.1| (hypothetical protein POPTR_0007s05010g [Populus trichocarpa])

HSP 1 Score: 308.9 bits (790), Expect = 9.8e-81
Identity = 185/326 (56.75%), Postives = 224/326 (68.71%), Query Frame = 1

Query: 5   SDSMAALISISPTSDQEQSPRNKNSNHVYEMEFQCMLDGFDEEEL-----GHVSEKKRRL 64
           SDS+ AL+SI P+++ E SPRN    HVY  EFQ MLDG DEE       GHV+EKKRRL
Sbjct: 8   SDSLGALMSICPSAE-EHSPRNHT--HVYSREFQSMLDGLDEEGCVEEAGGHVTEKKRRL 67

Query: 65  GVEQVKALEKNFEVENKLEPERKLKLAQELGLQPRQVAVWFQNRRARWKTKQLERDYGVL 124
             +QVKALEKNFEVENKLEPERK+KLAQELGLQPRQVAVWFQNRRARWKTKQLERDYGVL
Sbjct: 68  SGDQVKALEKNFEVENKLEPERKVKLAQELGLQPRQVAVWFQNRRARWKTKQLERDYGVL 127

Query: 125 KTNYDNLKLSFEALQNDNQALLKEIRELKAKIQEDNSEM-------LVPADSENALIEQT 184
           K NYD+LK +F+ALQ+DN+ALLKEIRELKAK+ E+N+E        ++ A+SE+ + E+ 
Sbjct: 128 KANYDSLKHNFDALQHDNEALLKEIRELKAKLNEENAESNVSVKEEIILAESEDKMPEED 187

Query: 185 KPEITDDFSVPPAR-----SFNNNG------GEGDEPPTKDGSSDSDSSAILNEDYSPTA 244
            P + D  +    +     +FNN+       G    P  KDGSSDSDSSAILNED SP  
Sbjct: 188 TPALLDSVAASETKELNYETFNNHSSINIGLGASLFPDFKDGSSDSDSSAILNEDNSPNP 247

Query: 245 GVSSPGVLQNNNHFMTGAIPPSSPSIATAGVKLNRATTALNYLQFQKGYQQTQMMFPKME 304
            +SS G+LQ+         PP S S+       + + +++N  QF K Y   Q  F K+E
Sbjct: 248 AISSSGILQSQLMMS----PPPSSSLRFNCSASSSSPSSMNCFQFSKSY---QTQFVKLE 307

Query: 305 EHNFFGGEEACNFFSDEQAPTLHWWS 308
           EHNFF  EEACNFFSDEQ P+L W+S
Sbjct: 308 EHNFFSSEEACNFFSDEQPPSLPWYS 323

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
ATHB6_ARATH2.7e-5446.38Homeobox-leucine zipper protein ATHB-6 OS=Arabidopsis thaliana GN=ATHB-6 PE=1 SV... [more]
ATB16_ARATH4.3e-5245.72Homeobox-leucine zipper protein ATHB-16 OS=Arabidopsis thaliana GN=ATHB-16 PE=2 ... [more]
ATHB5_ARATH3.9e-4540.12Homeobox-leucine zipper protein ATHB-5 OS=Arabidopsis thaliana GN=ATHB-5 PE=1 SV... [more]
HOX4_ORYSJ2.2e-4043.91Homeobox-leucine zipper protein HOX4 OS=Oryza sativa subsp. japonica GN=HOX4 PE=... [more]
HOX4_ORYSI2.2e-4043.91Homeobox-leucine zipper protein HOX4 OS=Oryza sativa subsp. indica GN=HOX4 PE=1 ... [more]
Match NameE-valueIdentityDescription
A0A061DJ94_THECC4.0e-8156.76Alanine--glyoxylate aminotransferase 2 isoform 1 OS=Theobroma cacao GN=TCM_00103... [more]
A9PHT9_POPTR6.8e-8156.75Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0007s05010g PE=2 SV=1[more]
A0A067KD47_JATCU1.2e-8058.51Uncharacterized protein OS=Jatropha curcas GN=JCGZ_18664 PE=4 SV=1[more]
B9H4Q5_POPTR1.5e-8056.44Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0005s07290g PE=4 SV=2[more]
M5W009_PRUPE7.6e-8056.34Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa008318mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G22430.11.5e-5546.38 homeobox protein 6[more]
AT4G40060.12.4e-5345.72 homeobox protein 16[more]
AT5G65310.12.2e-4640.12 homeobox protein 5[more]
AT3G01470.18.4e-3061.26 homeobox 1[more]
AT1G69780.11.9e-2949.67 Homeobox-leucine zipper protein family[more]
Match NameE-valueIdentityDescription
gi|743861233|ref|XP_011031070.1|2.0e-8156.75PREDICTED: homeobox-leucine zipper protein ATHB-6-like isoform X1 [Populus euphr... [more]
gi|743861237|ref|XP_011031071.1|4.4e-8156.75PREDICTED: homeobox-leucine zipper protein ATHB-6-like isoform X2 [Populus euphr... [more]
gi|590706919|ref|XP_007047858.1|5.8e-8156.76Alanine--glyoxylate aminotransferase 2 isoform 1 [Theobroma cacao][more]
gi|470103473|ref|XP_004288161.1|9.8e-8158.16PREDICTED: homeobox-leucine zipper protein ATHB-6-like isoform X2 [Fragaria vesc... [more]
gi|566179669|ref|XP_006380403.1|9.8e-8156.75hypothetical protein POPTR_0007s05010g [Populus trichocarpa][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR000047HTH_motif
IPR001356Homeobox_dom
IPR003106Leu_zip_homeo
IPR009057Homeobox-like_sf
IPR017970Homeobox_CS
Vocabulary: Molecular Function
TermDefinition
GO:0003677DNA binding
GO:0003700transcription factor activity, sequence-specific DNA binding
GO:0043565sequence-specific DNA binding
Vocabulary: Biological Process
TermDefinition
GO:0006355regulation of transcription, DNA-templated
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0005667 transcription factor complex
molecular_function GO:0043565 sequence-specific DNA binding
molecular_function GO:0003700 transcription factor activity, sequence-specific DNA binding
molecular_function GO:0016740 transferase activity
molecular_function GO:0003677 DNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh16G007860.1CmoCh16G007860.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000047Helix-turn-helix motifPRINTSPR00031HTHREPRESSRcoord: 90..106
score: 1.2E-5coord: 81..90
score: 1.
IPR001356Homeobox domainPFAMPF00046Homeoboxcoord: 56..108
score: 1.1
IPR001356Homeobox domainSMARTSM00389HOX_1coord: 53..114
score: 4.5
IPR001356Homeobox domainPROFILEPS50071HOMEOBOX_2coord: 50..110
score: 17
IPR003106Leucine zipper, homeobox-associatedPFAMPF02183HALZcoord: 110..152
score: 1.0
IPR009057Homeodomain-likeGENE3DG3DSA:1.10.10.60coord: 59..117
score: 2.1
IPR009057Homeodomain-likeunknownSSF46689Homeodomain-likecoord: 45..112
score: 6.42
IPR017970Homeobox, conserved sitePROSITEPS00027HOMEOBOX_1coord: 85..108
scor
NoneNo IPR availableunknownCoilCoilcoord: 123..157
scor
NoneNo IPR availablePANTHERPTHR24326FAMILY NOT NAMEDcoord: 1..170
score: 7.4
NoneNo IPR availablePANTHERPTHR24326:SF196HOMEOBOX-LEUCINE ZIPPER PROTEIN ATHB-16-RELATEDcoord: 1..170
score: 7.4

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
CmoCh16G007860Melon (DHL92) v3.6.1cmomedB340
CmoCh16G007860Cucurbita moschata (Rifu)cmocmoB285
CmoCh16G007860Melon (DHL92) v3.5.1cmomeB297