CmoCh14G021960 (gene) Cucurbita moschata (Rifu)

NameCmoCh14G021960
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionHB transcription factor
LocationCmo_Chr14 : 15656407 .. 15657713 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATTCTCTCTTTCTATCAAGGACCCCACCAGACTTGGATATAAATGATGCATAAAATAAGTGAGCAAATATATTAGAGATATTGCAAATCATTCACTCTCCTAACACCTGGTTTTGGAATAAAGCCTTGTGGTGTCCAAATTGAATCTTAAGCAGCATGCTAAATGATGATGATGCCGAATATTCTCCTCCAGAATCCATGGCAGAGGCTTTTGGCATGAGAAAAAAAAGCATGAACAGAAGAAGGTTCAGTGAAGAGCAGATCAAATCATTGGAGTCAATTTTCGAGTCTGAGTCAAGGCTTGAGCCCAGAAAGAAGTTGCAGCTAGCGGGAGAGCTGGGGTTGCATCCACGCCAGGTTGCAATATGGTTTCAAAACAAGAGAGCTAGATGGAAGTCAAAGCAGCTTGAACGAGACTACAGCGTTTTACGAGCTAATTACAACACTCTAGTTTCCCGGTTTGAAGCTCTTAAGAAGGAAAAGCAAGCTTTGGCTATACAGGTATTTGTTCTTTGTTTCTATAGAAAAACACCGTCCGTTTCTAGTTTTTGCTAATCATTGTTATTTCTTTACTTCGGCCAGTTGCAGAAGCTAAATGATCTCGTGCAGAGGTCCATGGAGGAAACTGAGAGCTGCAGGGGAGGTCTCTCCTTAGAAACTATTGATGGCAAGTCTGAAAATGGCCACAGAACAAAGTATGAGTCTGAAGTAAAACCTTGTGTGTCAGCTGAGGAGAAATCAGAACATGAACTAGAGGTTCTTTCCAACTATGGTAGTGGCACAAAGGAAGCCTATCTTGGATTAGACGAGCCCCAGTTCAGGGAACCTGCTCAGGGCTCCTTGATATCAACACCAAATTGGAGCAATTTAGAGTCTGAAGGGCTTTTCAGTCAGTCCAATACCAATGGCCAGTGGTGGGACTTCTGGTCGTGAAAAGTAGAAACACAAAAAAAAAAACCATAAAGGATCTGTAAACAAATAAAATTCCAATTTTTTTTTTTAAATAGATCATCCTAGCAATGATTTCGGCTCAAAGGACGAGGAACAGTTCAGAGCATAACGTCTTCTATCAAGGATAGCAATGATTGTAATGTGGTTGTTTCAAGCACTGATGCTCCAAAATTCACCATAACTTCAGATGGAAGTCAGAAAATATTCTGTGTATAACCAAAAAATTCGGTGAGCAACGCTAGGAAATACCACCGGCAAAGAGTGCTGCCTTGGAAAACTGAAACAATATGCTCATATGCAGACAGGGGTGCAGTCTGGAGTGCTTGCAGTGTGTGAGAGTTCCTTTCTTCTTCTGTT

mRNA sequence

ATTCTCTCTTTCTATCAAGGACCCCACCAGACTTGGATATAAATGATGCATAAAATAAGTGAGCAAATATATTAGAGATATTGCAAATCATTCACTCTCCTAACACCTGGTTTTGGAATAAAGCCTTGTGGTGTCCAAATTGAATCTTAAGCAGCATGCTAAATGATGATGATGCCGAATATTCTCCTCCAGAATCCATGGCAGAGGCTTTTGGCATGAGAAAAAAAAGCATGAACAGAAGAAGGTTCAGTGAAGAGCAGATCAAATCATTGGAGTCAATTTTCGAGTCTGAGTCAAGGCTTGAGCCCAGAAAGAAGTTGCAGCTAGCGGGAGAGCTGGGGTTGCATCCACGCCAGGTTGCAATATGGTTTCAAAACAAGAGAGCTAGATGGAAGTCAAAGCAGCTTGAACGAGACTACAGCGTTTTACGAGCTAATTACAACACTCTAGTTTCCCGGTTTGAAGCTCTTAAGAAGGAAAAGCAAGCTTTGGCTATACAGTTGCAGAAGCTAAATGATCTCGTGCAGAGGTCCATGGAGGAAACTGAGAGCTGCAGGGGAGGTCTCTCCTTAGAAACTATTGATGGCAAGTCTGAAAATGGCCACAGAACAAAGTATGAGTCTGAAGTAAAACCTTGTGTGTCAGCTGAGGAGAAATCAGAACATGAACTAGAGGTTCTTTCCAACTATGGTAGTGGCACAAAGGAAGCCTATCTTGGATTAGACGAGCCCCAGTTCAGGGAACCTGCTCAGGGCTCCTTGATATCAACACCAAATTGGAGCAATTTAGAGTCTGAAGGGCTTTTCAGTCAGTCCAATACCAATGGCCAGTGGTGGGACTTCTGGTCGTGAAAAGTAGAAACACAAAAAAAAAAACCATAAAGGATCTGTAAACAAATAAAATTCCAATTTTTTTTTTTAAATAGATCATCCTAGCAATGATTTCGGCTCAAAGGACGAGGAACAGTTCAGAGCATAACGTCTTCTATCAAGGATAGCAATGATTGTAATGTGGTTGTTTCAAGCACTGATGCTCCAAAATTCACCATAACTTCAGATGGAAGTCAGAAAATATTCTGTGTATAACCAAAAAATTCGGTGAGCAACGCTAGGAAATACCACCGGCAAAGAGTGCTGCCTTGGAAAACTGAAACAATATGCTCATATGCAGACAGGGGTGCAGTCTGGAGTGCTTGCAGTGTGTGAGAGTTCCTTTCTTCTTCTGTT

Coding sequence (CDS)

ATGCTAAATGATGATGATGCCGAATATTCTCCTCCAGAATCCATGGCAGAGGCTTTTGGCATGAGAAAAAAAAGCATGAACAGAAGAAGGTTCAGTGAAGAGCAGATCAAATCATTGGAGTCAATTTTCGAGTCTGAGTCAAGGCTTGAGCCCAGAAAGAAGTTGCAGCTAGCGGGAGAGCTGGGGTTGCATCCACGCCAGGTTGCAATATGGTTTCAAAACAAGAGAGCTAGATGGAAGTCAAAGCAGCTTGAACGAGACTACAGCGTTTTACGAGCTAATTACAACACTCTAGTTTCCCGGTTTGAAGCTCTTAAGAAGGAAAAGCAAGCTTTGGCTATACAGTTGCAGAAGCTAAATGATCTCGTGCAGAGGTCCATGGAGGAAACTGAGAGCTGCAGGGGAGGTCTCTCCTTAGAAACTATTGATGGCAAGTCTGAAAATGGCCACAGAACAAAGTATGAGTCTGAAGTAAAACCTTGTGTGTCAGCTGAGGAGAAATCAGAACATGAACTAGAGGTTCTTTCCAACTATGGTAGTGGCACAAAGGAAGCCTATCTTGGATTAGACGAGCCCCAGTTCAGGGAACCTGCTCAGGGCTCCTTGATATCAACACCAAATTGGAGCAATTTAGAGTCTGAAGGGCTTTTCAGTCAGTCCAATACCAATGGCCAGTGGTGGGACTTCTGGTCGTGA
BLAST of CmoCh14G021960 vs. Swiss-Prot
Match: ATHB7_ARATH (Homeobox-leucine zipper protein ATHB-7 OS=Arabidopsis thaliana GN=ATHB-7 PE=1 SV=2)

HSP 1 Score: 177.6 bits (449), Expect = 1.6e-43
Identity = 118/256 (46.09%), Postives = 153/256 (59.77%), Query Frame = 1

Query: 5   DDAEYSPPESMAEAFGMRKKSM-------NRRRFSEEQIKSLESIFESESRLEPRKKLQL 64
           +  EYSP    AE F   KK         N+RRFS+EQIKSLE +FESE+RLEPRKK+QL
Sbjct: 3   EGGEYSPAMMSAEPFLTMKKMKKSNHNKNNQRRFSDEQIKSLEMMFESETRLEPRKKVQL 62

Query: 65  AGELGLHPRQVAIWFQNKRARWKSKQLERDYSVLRANYNTLVSRFEALKKEKQALAIQLQ 124
           A ELGL PRQVAIWFQNKRARWKSKQLE +Y++LR NY+ L S+FE+LKKEKQAL  +LQ
Sbjct: 63  ARELGLQPRQVAIWFQNKRARWKSKQLETEYNILRQNYDNLASQFESLKKEKQALVSELQ 122

Query: 125 KLNDLVQ-RSMEETESCRGG---LSLETIDGKSEN-GHRTKYESEVKPCVSAEEK----- 184
           +L +  Q ++ EE   C G    ++L +   +SEN  +R +   EV+P +  ++      
Sbjct: 123 RLKEATQKKTQEEERQCSGDQAVVALSSTHHESENEENRRRKPEEVRPEMEMKDDKGHHG 182

Query: 185 ---SEHELEVLSN-YGSGTKEAYLG--LDEP----QFREPAQGSLISTPNWSNLESE--G 232
                H+ E   N Y +  K  Y G   +EP       EPA   L S+ +W   +S+   
Sbjct: 183 VMCDHHDYEDDDNGYSNNIKREYFGGFEEEPDHLMNIVEPADSCLTSSDDWRGFKSDTTT 242

BLAST of CmoCh14G021960 vs. Swiss-Prot
Match: ATB12_ARATH (Homeobox-leucine zipper protein ATHB-12 OS=Arabidopsis thaliana GN=ATHB-12 PE=1 SV=1)

HSP 1 Score: 177.6 bits (449), Expect = 1.6e-43
Identity = 111/223 (49.78%), Postives = 138/223 (61.88%), Query Frame = 1

Query: 23  KKSMNRRRFSEEQIKSLESIFESESRLEPRKKLQLAGELGLHPRQVAIWFQNKRARWKSK 82
           KKS N++RFSEEQIKSLE IFESE+RLEPRKK+Q+A ELGL PRQVAIWFQNKRARWK+K
Sbjct: 26  KKSNNQKRFSEEQIKSLELIFESETRLEPRKKVQVARELGLQPRQVAIWFQNKRARWKTK 85

Query: 83  QLERDYSVLRANYNTLVSRFEALKKEKQALAIQLQKLNDLVQRSMEET-ESCRG--GLSL 142
           QLE++Y+ LRANYN L S+FE +KKEKQ+L  +LQ+LN+ +QR  EE    C G  GL+L
Sbjct: 86  QLEKEYNTLRANYNNLASQFEIMKKEKQSLVSELQRLNEEMQRPKEEKHHECCGDQGLAL 145

Query: 143 ETIDGKSENGHRTKYESEVKPCVSAEEKSEHELEVLSN---YGSGTKEAYLGLDEPQFRE 202
            +    S   H  K E E +           +  VL N   Y +  K  Y G +E    E
Sbjct: 146 SS----STESHNGKSEPEGR---------LDQGSVLCNDGDYNNNIKTEYFGFEEETDHE 205

Query: 203 -------PAQGSLISTPNWSNLESEGLFSQSNTN-GQWWDFWS 232
                       L S+ NW    S+ L  QS++N   WW+FWS
Sbjct: 206 LMNIVEKADDSCLTSSENWGGFNSDSLLDQSSSNYPNWWEFWS 235

BLAST of CmoCh14G021960 vs. Swiss-Prot
Match: HOX6_ORYSJ (Homeobox-leucine zipper protein HOX6 OS=Oryza sativa subsp. japonica GN=HOX6 PE=2 SV=1)

HSP 1 Score: 134.0 bits (336), Expect = 2.0e-30
Identity = 67/97 (69.07%), Postives = 85/97 (87.63%), Query Frame = 1

Query: 28  RRRFSEEQIKSLESIFESESRLEPRKKLQLAGELGLHPRQVAIWFQNKRARWKSKQLERD 87
           ++RFSEEQIKSLES+F ++++LEPR+KLQLA ELGL PRQVAIWFQNKRARWKSKQLER+
Sbjct: 31  KKRFSEEQIKSLESMFATQTKLEPRQKLQLARELGLQPRQVAIWFQNKRARWKSKQLERE 90

Query: 88  YSVLRANYNTLVSRFEALKKEKQALAIQLQKLNDLVQ 125
           YS LR +Y+ L+  +E+LKKEK AL  QL+KL +++Q
Sbjct: 91  YSALRDDYDALLCSYESLKKEKLALIKQLEKLAEMLQ 127

BLAST of CmoCh14G021960 vs. Swiss-Prot
Match: HOX6_ORYSI (Homeobox-leucine zipper protein HOX6 OS=Oryza sativa subsp. indica GN=HOX6 PE=2 SV=2)

HSP 1 Score: 134.0 bits (336), Expect = 2.0e-30
Identity = 67/97 (69.07%), Postives = 85/97 (87.63%), Query Frame = 1

Query: 28  RRRFSEEQIKSLESIFESESRLEPRKKLQLAGELGLHPRQVAIWFQNKRARWKSKQLERD 87
           ++RFSEEQIKSLES+F ++++LEPR+KLQLA ELGL PRQVAIWFQNKRARWKSKQLER+
Sbjct: 31  KKRFSEEQIKSLESMFATQTKLEPRQKLQLARELGLQPRQVAIWFQNKRARWKSKQLERE 90

Query: 88  YSVLRANYNTLVSRFEALKKEKQALAIQLQKLNDLVQ 125
           YS LR +Y+ L+  +E+LKKEK AL  QL+KL +++Q
Sbjct: 91  YSALRDDYDALLCSYESLKKEKLALIKQLEKLAEMLQ 127

BLAST of CmoCh14G021960 vs. Swiss-Prot
Match: HOX22_ORYSI (Homeobox-leucine zipper protein HOX22 OS=Oryza sativa subsp. indica GN=HOX22 PE=2 SV=2)

HSP 1 Score: 123.6 bits (309), Expect = 2.8e-27
Identity = 71/135 (52.59%), Postives = 92/135 (68.15%), Query Frame = 1

Query: 16  AEAFGMRKKSMNRRRFSEEQIKSLESIFESE-SRLEPRKKLQLAGELGLHPRQVAIWFQN 75
           A A G       +RRF+EEQI+SLES+F +  ++LEPR+K +LA ELGL PRQVAIWFQN
Sbjct: 62  AAAPGRGGAGERKRRFTEEQIRSLESMFHAHHAKLEPREKAELARELGLQPRQVAIWFQN 121

Query: 76  KRARWKSKQLERDYSVLRANYNTLVSRFEALKKEKQALAIQLQKLNDLVQRSMEETESCR 135
           KRARW+SKQLE DY+ LR+ Y+ L SR E+LK+EK AL +QL +L + ++    E  S  
Sbjct: 122 KRARWRSKQLEHDYAALRSKYDALHSRVESLKQEKLALTVQLHELRERLRE--REERSGN 181

Query: 136 GGLSLETIDGKSENG 150
           GG +       S NG
Sbjct: 182 GGAATTAASSSSCNG 194

BLAST of CmoCh14G021960 vs. TrEMBL
Match: A0A0A0L317_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G119240 PE=4 SV=1)

HSP 1 Score: 411.4 bits (1056), Expect = 7.4e-112
Identity = 210/231 (90.91%), Postives = 220/231 (95.24%), Query Frame = 1

Query: 1   MLNDDDAEYSPPESMAEAFGMRKKSMNRRRFSEEQIKSLESIFESESRLEPRKKLQLAGE 60
           MLN+D AEYSPP SMAEAF MRKKSMNRRRFSEEQIKSLESIFESESRLEPRKKLQLAGE
Sbjct: 1   MLNED-AEYSPPASMAEAFAMRKKSMNRRRFSEEQIKSLESIFESESRLEPRKKLQLAGE 60

Query: 61  LGLHPRQVAIWFQNKRARWKSKQLERDYSVLRANYNTLVSRFEALKKEKQALAIQLQKLN 120
           LGLHPRQVAIWFQNKRARWKSKQLERDYSVLRANYNTL SRFEALKKEKQAL +QLQKLN
Sbjct: 61  LGLHPRQVAIWFQNKRARWKSKQLERDYSVLRANYNTLASRFEALKKEKQALTMQLQKLN 120

Query: 121 DLVQRSMEETESCRGGLSLETIDGKSENGHRTKYESEVKPCVSAEEKSEHELEVLSNYGS 180
           +LVQRSMEETESCRG LS+ETIDGKSE  HRTKYESEVKPC+SAEEKSEHELEVLSNYGS
Sbjct: 121 NLVQRSMEETESCRGVLSIETIDGKSEIDHRTKYESEVKPCLSAEEKSEHELEVLSNYGS 180

Query: 181 GTKEAYLGLDEPQFREPAQGSLISTPNWSNLESEGLFSQSNTNGQWWDFWS 232
           G KEAY+GL++PQ RE +QGSLISTPNWSNL+SEGLFSQSNTNGQWWDFWS
Sbjct: 181 GVKEAYIGLEDPQLRESSQGSLISTPNWSNLDSEGLFSQSNTNGQWWDFWS 230

BLAST of CmoCh14G021960 vs. TrEMBL
Match: B9I924_POPTR (Homeobox leucine zipper family protein OS=Populus trichocarpa GN=POPTR_0014s09860g PE=4 SV=1)

HSP 1 Score: 255.8 bits (652), Expect = 5.2e-65
Identity = 143/239 (59.83%), Postives = 175/239 (73.22%), Query Frame = 1

Query: 5   DDAEYSPPESMAEAFGM--------RKKSMNRRRFSEEQIKSLESIFESESRLEPRKKLQ 64
           D  EYSP  S  E F          +KK+  +RRFS+EQIKSLE++FESE+RLEPRKK+Q
Sbjct: 3   DGGEYSP--SATEPFSCLNSVTTSRKKKNKIKRRFSDEQIKSLETMFESETRLEPRKKMQ 62

Query: 65  LAGELGLHPRQVAIWFQNKRARWKSKQLERDYSVLRANYNTLVSRFEALKKEKQALAIQL 124
           LA ELGL PRQVAIWFQNKRARWKSKQLERDYS+LRANYN+L SRFE LKKEKQALAIQL
Sbjct: 63  LARELGLQPRQVAIWFQNKRARWKSKQLERDYSMLRANYNSLASRFETLKKEKQALAIQL 122

Query: 125 QKLNDLVQRSMEETESCRGGLSLETIDGKSENGHRTKYESEVKPCVSAEEKSEHELEVLS 184
           QKLNDL+++ +EE E C  G ++ + +G+SENG  TK ESE KP +S E+  EH L VLS
Sbjct: 123 QKLNDLMKKPVEEGECCGQGAAVNSSEGESENGDATKGESETKPRLSIEQ-PEHGLGVLS 182

Query: 185 NYGSGTKEAYLGLDEP----QFREPAQGSLISTPNWSNLESEGLFSQSNTNGQWWDFWS 232
           +  S  K  Y  L+E        EPA+GSL S  +W +++S+GLF QS++  QWWDFW+
Sbjct: 183 DEDSSIKVDYFELEEEPNLMSMVEPAEGSLTSQEDWGSIDSDGLFDQSSSGYQWWDFWA 238

BLAST of CmoCh14G021960 vs. TrEMBL
Match: B9GR61_POPTR (Homeobox leucine zipper family protein OS=Populus trichocarpa GN=POPTR_0002s17680g PE=4 SV=1)

HSP 1 Score: 250.4 bits (638), Expect = 2.2e-63
Identity = 141/239 (59.00%), Postives = 171/239 (71.55%), Query Frame = 1

Query: 5   DDAEYSPPESMAEAFGM--------RKKSMNRRRFSEEQIKSLESIFESESRLEPRKKLQ 64
           D  EYSP  S  E F          +KK+ N+RRFS+EQIKSLES+FESE+RLEPRKK+Q
Sbjct: 3   DGGEYSP--SATEPFSCMNGVTTSRKKKNKNKRRFSDEQIKSLESMFESETRLEPRKKMQ 62

Query: 65  LAGELGLHPRQVAIWFQNKRARWKSKQLERDYSVLRANYNTLVSRFEALKKEKQALAIQL 124
           LA ELGL PRQVAIWFQNKRARWKSKQLERD+S+LRANYN+L SRFE LKKEKQAL IQL
Sbjct: 63  LAKELGLQPRQVAIWFQNKRARWKSKQLERDFSILRANYNSLASRFETLKKEKQALVIQL 122

Query: 125 QKLNDLVQRSMEETESCRGGLSLETIDGKSENGHRTKYESEVKPCVSAEEKSEHELEVLS 184
           QK+NDL+++  EE E C  G ++ +I+GKSEN   T  ESE  P +S  E+ EH L VLS
Sbjct: 123 QKINDLMKKPGEEGECCGQGPAVNSIEGKSENADTTMGESETNPRLSI-ERPEHGLGVLS 182

Query: 185 NYGSGTKEAYLGLDEP----QFREPAQGSLISTPNWSNLESEGLFSQSNTNGQWWDFWS 232
           +  S  K  Y  L+E        EPA GSL S  +W +L+S+ LF QS+++ QWWDFW+
Sbjct: 183 DEDSSIKAEYFELEEEPNLISMVEPADGSLTSQEDWGSLDSDRLFDQSSSDYQWWDFWA 238

BLAST of CmoCh14G021960 vs. TrEMBL
Match: A0A096YDU1_ROSHC (Homeobox-leucine zipper protein 1 OS=Rosa hybrid cultivar GN=HB1 PE=2 SV=1)

HSP 1 Score: 243.0 bits (619), Expect = 3.5e-61
Identity = 149/251 (59.36%), Postives = 169/251 (67.33%), Query Frame = 1

Query: 1   MLNDDDAEYSPPESMAEAFG-----------MRKKSM--NRRRFSEEQIKSLESIFESES 60
           ML  D   YSP    AEAF            ++KKS   N++RFS+EQI+SLESIFESES
Sbjct: 1   MLEVDRVGYSPSPEEAEAFSYMNDSLGAGSSLKKKSSKNNKKRFSDEQIRSLESIFESES 60

Query: 61  RLEPRKKLQLAGELGLHPRQVAIWFQNKRARWKSKQLERDYSVLRANYNTLVSRFEALKK 120
           RLEPRKKLQLA ELGL PRQVAIWFQNKRARWKSKQLERDYS+LRANYN L SRFEALKK
Sbjct: 61  RLEPRKKLQLAKELGLQPRQVAIWFQNKRARWKSKQLERDYSILRANYNNLASRFEALKK 120

Query: 121 EKQALAIQLQKLNDLVQRSMEETESCRGGLSLETIDGKSENG-HRTKYESEVKPCVSAE- 180
           EKQAL  QLQKLND +QR  EE  +        +IDG+S+NG   T+ ESE KP      
Sbjct: 121 EKQALVTQLQKLNDKMQRPKEERNN--------SIDGESDNGDDATRSESEGKPNYQMSL 180

Query: 181 EKSEHELEVLSNYGSGTKEAYLGLDEP----QFREPAQGSLISTPNWSNLESE-GLFSQS 232
           EKSEH L V+S+  S  K  Y GL+E      F E A GSL S  +W  L S+ GLF QS
Sbjct: 181 EKSEHRLRVMSDDDSSIKAEYFGLEEEPNLVNFVESADGSLTSPEDWGRLNSDGGLFDQS 240

BLAST of CmoCh14G021960 vs. TrEMBL
Match: A0A067JWT5_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_14207 PE=4 SV=1)

HSP 1 Score: 237.3 bits (604), Expect = 1.9e-59
Identity = 140/246 (56.91%), Postives = 168/246 (68.29%), Query Frame = 1

Query: 4   DDDAEYSPPESMAEAF----------GMRKKSMNRRRFSEEQIKSLESIFESESRLEPRK 63
           D++  YSP  S  + F            RKK+ N+RRFS+EQIKSLE++FESESRLEPRK
Sbjct: 56  DEEEVYSPCASAEDLFTAMDTALSTTSRRKKTKNKRRFSDEQIKSLETMFESESRLEPRK 115

Query: 64  KLQLAGELGLHPRQVAIWFQNKRARWKSKQLERDYSVLRANYNTLVSRFEALKKEKQALA 123
           KLQLA ELGL PRQVAIWFQNKRARWKSKQLERDYSVLRANYN L SRFE LKKEKQALA
Sbjct: 116 KLQLAKELGLQPRQVAIWFQNKRARWKSKQLERDYSVLRANYNNLASRFETLKKEKQALA 175

Query: 124 IQLQKLNDLVQRSMEETESCRGGLSLETIDGKSENG-HRTKYESEVKPCVSAEEKSEHEL 183
           IQLQKLN+L+++  EE E C          G+ E G + ++ ESE K  +S+ E+S + L
Sbjct: 176 IQLQKLNELIEKPREEGECC----------GEQETGVNSSEGESEAKGVISSFERSRNGL 235

Query: 184 EVLSNYGSGTKEAYLGLDEP------QFREPAQGSLISTPNWSNLESEGLFSQSNT-NGQ 232
            V S+  S  K  Y GL+E          E A GSL S  +W +LES+GLF QSN+ + Q
Sbjct: 236 GVASDEDSSIKVEYFGLEEEPDNNLISMVEAADGSLTSQEDWRSLESDGLFDQSNSGSDQ 291

BLAST of CmoCh14G021960 vs. TAIR10
Match: AT2G46680.1 (AT2G46680.1 homeobox 7)

HSP 1 Score: 177.6 bits (449), Expect = 9.1e-45
Identity = 118/256 (46.09%), Postives = 153/256 (59.77%), Query Frame = 1

Query: 5   DDAEYSPPESMAEAFGMRKKSM-------NRRRFSEEQIKSLESIFESESRLEPRKKLQL 64
           +  EYSP    AE F   KK         N+RRFS+EQIKSLE +FESE+RLEPRKK+QL
Sbjct: 3   EGGEYSPAMMSAEPFLTMKKMKKSNHNKNNQRRFSDEQIKSLEMMFESETRLEPRKKVQL 62

Query: 65  AGELGLHPRQVAIWFQNKRARWKSKQLERDYSVLRANYNTLVSRFEALKKEKQALAIQLQ 124
           A ELGL PRQVAIWFQNKRARWKSKQLE +Y++LR NY+ L S+FE+LKKEKQAL  +LQ
Sbjct: 63  ARELGLQPRQVAIWFQNKRARWKSKQLETEYNILRQNYDNLASQFESLKKEKQALVSELQ 122

Query: 125 KLNDLVQ-RSMEETESCRGG---LSLETIDGKSEN-GHRTKYESEVKPCVSAEEK----- 184
           +L +  Q ++ EE   C G    ++L +   +SEN  +R +   EV+P +  ++      
Sbjct: 123 RLKEATQKKTQEEERQCSGDQAVVALSSTHHESENEENRRRKPEEVRPEMEMKDDKGHHG 182

Query: 185 ---SEHELEVLSN-YGSGTKEAYLG--LDEP----QFREPAQGSLISTPNWSNLESE--G 232
                H+ E   N Y +  K  Y G   +EP       EPA   L S+ +W   +S+   
Sbjct: 183 VMCDHHDYEDDDNGYSNNIKREYFGGFEEEPDHLMNIVEPADSCLTSSDDWRGFKSDTTT 242

BLAST of CmoCh14G021960 vs. TAIR10
Match: AT3G61890.1 (AT3G61890.1 homeobox 12)

HSP 1 Score: 177.6 bits (449), Expect = 9.1e-45
Identity = 111/223 (49.78%), Postives = 138/223 (61.88%), Query Frame = 1

Query: 23  KKSMNRRRFSEEQIKSLESIFESESRLEPRKKLQLAGELGLHPRQVAIWFQNKRARWKSK 82
           KKS N++RFSEEQIKSLE IFESE+RLEPRKK+Q+A ELGL PRQVAIWFQNKRARWK+K
Sbjct: 26  KKSNNQKRFSEEQIKSLELIFESETRLEPRKKVQVARELGLQPRQVAIWFQNKRARWKTK 85

Query: 83  QLERDYSVLRANYNTLVSRFEALKKEKQALAIQLQKLNDLVQRSMEET-ESCRG--GLSL 142
           QLE++Y+ LRANYN L S+FE +KKEKQ+L  +LQ+LN+ +QR  EE    C G  GL+L
Sbjct: 86  QLEKEYNTLRANYNNLASQFEIMKKEKQSLVSELQRLNEEMQRPKEEKHHECCGDQGLAL 145

Query: 143 ETIDGKSENGHRTKYESEVKPCVSAEEKSEHELEVLSN---YGSGTKEAYLGLDEPQFRE 202
            +    S   H  K E E +           +  VL N   Y +  K  Y G +E    E
Sbjct: 146 SS----STESHNGKSEPEGR---------LDQGSVLCNDGDYNNNIKTEYFGFEEETDHE 205

Query: 203 -------PAQGSLISTPNWSNLESEGLFSQSNTN-GQWWDFWS 232
                       L S+ NW    S+ L  QS++N   WW+FWS
Sbjct: 206 LMNIVEKADDSCLTSSENWGGFNSDSLLDQSSSNYPNWWEFWS 235

BLAST of CmoCh14G021960 vs. TAIR10
Match: AT2G22430.1 (AT2G22430.1 homeobox protein 6)

HSP 1 Score: 108.2 bits (269), Expect = 6.8e-24
Identity = 56/129 (43.41%), Postives = 85/129 (65.89%), Query Frame = 1

Query: 13  ESMAEAFGMRKKSMNRRRFSEEQIKSLESIFESESRLEPRKKLQLAGELGLHPRQVAIWF 72
           E++ E  G    S  +RR S  Q+K+LE  FE E++LEP +K++LA ELGL PRQVA+WF
Sbjct: 48  EAIVEERGHVGLSEKKRRLSINQVKALEKNFELENKLEPERKVKLAQELGLQPRQVAVWF 107

Query: 73  QNKRARWKSKQLERDYSVLRANYNTLVSRFEALKKEKQALAIQLQKLNDLVQRSMEETES 132
           QN+RARWK+KQLE+DY VL+  Y++L   F++L+++ ++L  ++ KL   +     E E 
Sbjct: 108 QNRRARWKTKQLEKDYGVLKTQYDSLRHNFDSLRRDNESLLQEISKLKTKLNGGGGEEEE 167

Query: 133 CRGGLSLET 142
                ++ T
Sbjct: 168 EENNAAVTT 176

BLAST of CmoCh14G021960 vs. TAIR10
Match: AT5G65310.1 (AT5G65310.1 homeobox protein 5)

HSP 1 Score: 105.5 bits (262), Expect = 4.4e-23
Identity = 55/122 (45.08%), Postives = 83/122 (68.03%), Query Frame = 1

Query: 28  RRRFSEEQIKSLESIFESESRLEPRKKLQLAGELGLHPRQVAIWFQNKRARWKSKQLERD 87
           +RR   EQ+K+LE  FE +++LEP +K++LA ELGL PRQVAIWFQN+RARWK+KQLERD
Sbjct: 73  KRRLGVEQVKALEKNFEIDNKLEPERKVKLAQELGLQPRQVAIWFQNRRARWKTKQLERD 132

Query: 88  YSVLRANYNTLVSRFEALKKEKQALAIQLQKLNDLVQRSMEETESCRGGLSLETIDGKSE 147
           Y VL++N++ L    ++L+++  +L  Q+++L              +  L++E + G  E
Sbjct: 133 YGVLKSNFDALKRNRDSLQRDNDSLLGQIKEL--------------KAKLNVEGVKGIEE 180

Query: 148 NG 150
           NG
Sbjct: 193 NG 180

BLAST of CmoCh14G021960 vs. TAIR10
Match: AT4G40060.1 (AT4G40060.1 homeobox protein 16)

HSP 1 Score: 104.8 bits (260), Expect = 7.5e-23
Identity = 48/95 (50.53%), Postives = 72/95 (75.79%), Query Frame = 1

Query: 25  SMNRRRFSEEQIKSLESIFESESRLEPRKKLQLAGELGLHPRQVAIWFQNKRARWKSKQL 84
           S  +RR   +Q+K+LE  FE E++LEP +K +LA ELGL PRQVA+WFQN+RARWK+KQL
Sbjct: 57  SEKKRRLKVDQVKALEKNFELENKLEPERKTKLAQELGLQPRQVAVWFQNRRARWKTKQL 116

Query: 85  ERDYSVLRANYNTLVSRFEALKKEKQALAIQLQKL 120
           E+DY VL+  Y++L   F++L+++  +L  ++ K+
Sbjct: 117 EKDYGVLKGQYDSLRHNFDSLRRDNDSLLQEISKI 151

BLAST of CmoCh14G021960 vs. NCBI nr
Match: gi|449432008|ref|XP_004133792.1| (PREDICTED: homeobox-leucine zipper protein ATHB-12 [Cucumis sativus])

HSP 1 Score: 411.4 bits (1056), Expect = 1.1e-111
Identity = 210/231 (90.91%), Postives = 220/231 (95.24%), Query Frame = 1

Query: 1   MLNDDDAEYSPPESMAEAFGMRKKSMNRRRFSEEQIKSLESIFESESRLEPRKKLQLAGE 60
           MLN+D AEYSPP SMAEAF MRKKSMNRRRFSEEQIKSLESIFESESRLEPRKKLQLAGE
Sbjct: 1   MLNED-AEYSPPASMAEAFAMRKKSMNRRRFSEEQIKSLESIFESESRLEPRKKLQLAGE 60

Query: 61  LGLHPRQVAIWFQNKRARWKSKQLERDYSVLRANYNTLVSRFEALKKEKQALAIQLQKLN 120
           LGLHPRQVAIWFQNKRARWKSKQLERDYSVLRANYNTL SRFEALKKEKQAL +QLQKLN
Sbjct: 61  LGLHPRQVAIWFQNKRARWKSKQLERDYSVLRANYNTLASRFEALKKEKQALTMQLQKLN 120

Query: 121 DLVQRSMEETESCRGGLSLETIDGKSENGHRTKYESEVKPCVSAEEKSEHELEVLSNYGS 180
           +LVQRSMEETESCRG LS+ETIDGKSE  HRTKYESEVKPC+SAEEKSEHELEVLSNYGS
Sbjct: 121 NLVQRSMEETESCRGVLSIETIDGKSEIDHRTKYESEVKPCLSAEEKSEHELEVLSNYGS 180

Query: 181 GTKEAYLGLDEPQFREPAQGSLISTPNWSNLESEGLFSQSNTNGQWWDFWS 232
           G KEAY+GL++PQ RE +QGSLISTPNWSNL+SEGLFSQSNTNGQWWDFWS
Sbjct: 181 GVKEAYIGLEDPQLRESSQGSLISTPNWSNLDSEGLFSQSNTNGQWWDFWS 230

BLAST of CmoCh14G021960 vs. NCBI nr
Match: gi|659074881|ref|XP_008437846.1| (PREDICTED: homeobox-leucine zipper protein ATHB-7 [Cucumis melo])

HSP 1 Score: 384.4 bits (986), Expect = 1.4e-103
Identity = 202/231 (87.45%), Postives = 214/231 (92.64%), Query Frame = 1

Query: 1   MLNDDDAEYSPPESMAEAFGMRKKSMNRRRFSEEQIKSLESIFESESRLEPRKKLQLAGE 60
           MLN+D AEYSP  SMAEAF MRKKSMNRRRFSEEQIKSLESIFESESRLEPRKKLQLAGE
Sbjct: 1   MLNED-AEYSPQASMAEAFAMRKKSMNRRRFSEEQIKSLESIFESESRLEPRKKLQLAGE 60

Query: 61  LGLHPRQVAIWFQNKRARWKSKQLERDYSVLRANYNTLVSRFEALKKEKQALAIQLQKLN 120
           LGLHPRQVAIWFQNKRARWKSKQLERDYSVLRANYNTL SRFEALKKEKQAL++QLQKLN
Sbjct: 61  LGLHPRQVAIWFQNKRARWKSKQLERDYSVLRANYNTLASRFEALKKEKQALSMQLQKLN 120

Query: 121 DLVQRSMEETESCRGGLSLETIDGKSENGHRTKYESEVKPCVSAEEKSEHELEVLSNYGS 180
           +L+QRSMEETESCRG LS+ETIDG SEN +RTKYESE KPCVSAEEKSEHELEVLS   S
Sbjct: 121 NLIQRSMEETESCRGVLSVETIDGTSENDNRTKYESEAKPCVSAEEKSEHELEVLS---S 180

Query: 181 GTKEAYLGLDEPQFREPAQGSLISTPNWSNLESEGLFSQSNTNGQWWDFWS 232
           G KEAY+GL++PQ REP   SLISTPNWSNL+SEGLFSQSNTNGQWW+FWS
Sbjct: 181 GVKEAYIGLEDPQLREP---SLISTPNWSNLDSEGLFSQSNTNGQWWNFWS 224

BLAST of CmoCh14G021960 vs. NCBI nr
Match: gi|224130632|ref|XP_002320889.1| (homeobox leucine zipper family protein [Populus trichocarpa])

HSP 1 Score: 255.8 bits (652), Expect = 7.4e-65
Identity = 143/239 (59.83%), Postives = 175/239 (73.22%), Query Frame = 1

Query: 5   DDAEYSPPESMAEAFGM--------RKKSMNRRRFSEEQIKSLESIFESESRLEPRKKLQ 64
           D  EYSP  S  E F          +KK+  +RRFS+EQIKSLE++FESE+RLEPRKK+Q
Sbjct: 3   DGGEYSP--SATEPFSCLNSVTTSRKKKNKIKRRFSDEQIKSLETMFESETRLEPRKKMQ 62

Query: 65  LAGELGLHPRQVAIWFQNKRARWKSKQLERDYSVLRANYNTLVSRFEALKKEKQALAIQL 124
           LA ELGL PRQVAIWFQNKRARWKSKQLERDYS+LRANYN+L SRFE LKKEKQALAIQL
Sbjct: 63  LARELGLQPRQVAIWFQNKRARWKSKQLERDYSMLRANYNSLASRFETLKKEKQALAIQL 122

Query: 125 QKLNDLVQRSMEETESCRGGLSLETIDGKSENGHRTKYESEVKPCVSAEEKSEHELEVLS 184
           QKLNDL+++ +EE E C  G ++ + +G+SENG  TK ESE KP +S E+  EH L VLS
Sbjct: 123 QKLNDLMKKPVEEGECCGQGAAVNSSEGESENGDATKGESETKPRLSIEQ-PEHGLGVLS 182

Query: 185 NYGSGTKEAYLGLDEP----QFREPAQGSLISTPNWSNLESEGLFSQSNTNGQWWDFWS 232
           +  S  K  Y  L+E        EPA+GSL S  +W +++S+GLF QS++  QWWDFW+
Sbjct: 183 DEDSSIKVDYFELEEEPNLMSMVEPAEGSLTSQEDWGSIDSDGLFDQSSSGYQWWDFWA 238

BLAST of CmoCh14G021960 vs. NCBI nr
Match: gi|743790427|ref|XP_011038888.1| (PREDICTED: homeobox-leucine zipper protein ATHB-7 [Populus euphratica])

HSP 1 Score: 251.1 bits (640), Expect = 1.8e-63
Identity = 134/214 (62.62%), Postives = 168/214 (78.50%), Query Frame = 1

Query: 22  RKKSMNRRRFSEEQIKSLESIFESESRLEPRKKLQLAGELGLHPRQVAIWFQNKRARWKS 81
           +KK+  +RRFS+EQIKSLE++FESE+RLEPRKK+QLA ELGL PRQVAIWFQNKRARWKS
Sbjct: 26  KKKNKIKRRFSDEQIKSLETMFESETRLEPRKKMQLAKELGLQPRQVAIWFQNKRARWKS 85

Query: 82  KQLERDYSVLRANYNTLVSRFEALKKEKQALAIQLQKLNDLVQRSMEETESCRGGLSLET 141
           KQLERDYS+LRA+YN+L SRFE LKKEKQALAIQLQKLNDL+++ +EE E C  G ++ +
Sbjct: 86  KQLERDYSMLRASYNSLASRFETLKKEKQALAIQLQKLNDLMKKPVEEGERCGQGAAVNS 145

Query: 142 IDGKSENGHRTKYESEVKPCVSAEEKSEHELEVLSNYGSGTKEAYLGLDEP----QFREP 201
            +G+SENG  TK ESE +P +S E+  EH L VLS+  S  K  Y  L+E        EP
Sbjct: 146 SEGESENGDTTKGESETRPRLSIEQ-PEHGLGVLSDEDSSIKADYFELEEEPNLMSMVEP 205

Query: 202 AQGSLISTPNWSNLESEGLFSQSNTNGQWWDFWS 232
           A+GSL S  +W +L+S+GLF QS+++ QWWDFW+
Sbjct: 206 AEGSLTSQEDWGSLDSDGLFDQSSSDYQWWDFWA 238

BLAST of CmoCh14G021960 vs. NCBI nr
Match: gi|224068066|ref|XP_002302659.1| (homeobox leucine zipper family protein [Populus trichocarpa])

HSP 1 Score: 250.4 bits (638), Expect = 3.1e-63
Identity = 141/239 (59.00%), Postives = 171/239 (71.55%), Query Frame = 1

Query: 5   DDAEYSPPESMAEAFGM--------RKKSMNRRRFSEEQIKSLESIFESESRLEPRKKLQ 64
           D  EYSP  S  E F          +KK+ N+RRFS+EQIKSLES+FESE+RLEPRKK+Q
Sbjct: 3   DGGEYSP--SATEPFSCMNGVTTSRKKKNKNKRRFSDEQIKSLESMFESETRLEPRKKMQ 62

Query: 65  LAGELGLHPRQVAIWFQNKRARWKSKQLERDYSVLRANYNTLVSRFEALKKEKQALAIQL 124
           LA ELGL PRQVAIWFQNKRARWKSKQLERD+S+LRANYN+L SRFE LKKEKQAL IQL
Sbjct: 63  LAKELGLQPRQVAIWFQNKRARWKSKQLERDFSILRANYNSLASRFETLKKEKQALVIQL 122

Query: 125 QKLNDLVQRSMEETESCRGGLSLETIDGKSENGHRTKYESEVKPCVSAEEKSEHELEVLS 184
           QK+NDL+++  EE E C  G ++ +I+GKSEN   T  ESE  P +S  E+ EH L VLS
Sbjct: 123 QKINDLMKKPGEEGECCGQGPAVNSIEGKSENADTTMGESETNPRLSI-ERPEHGLGVLS 182

Query: 185 NYGSGTKEAYLGLDEP----QFREPAQGSLISTPNWSNLESEGLFSQSNTNGQWWDFWS 232
           +  S  K  Y  L+E        EPA GSL S  +W +L+S+ LF QS+++ QWWDFW+
Sbjct: 183 DEDSSIKAEYFELEEEPNLISMVEPADGSLTSQEDWGSLDSDRLFDQSSSDYQWWDFWA 238

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
ATHB7_ARATH1.6e-4346.09Homeobox-leucine zipper protein ATHB-7 OS=Arabidopsis thaliana GN=ATHB-7 PE=1 SV... [more]
ATB12_ARATH1.6e-4349.78Homeobox-leucine zipper protein ATHB-12 OS=Arabidopsis thaliana GN=ATHB-12 PE=1 ... [more]
HOX6_ORYSJ2.0e-3069.07Homeobox-leucine zipper protein HOX6 OS=Oryza sativa subsp. japonica GN=HOX6 PE=... [more]
HOX6_ORYSI2.0e-3069.07Homeobox-leucine zipper protein HOX6 OS=Oryza sativa subsp. indica GN=HOX6 PE=2 ... [more]
HOX22_ORYSI2.8e-2752.59Homeobox-leucine zipper protein HOX22 OS=Oryza sativa subsp. indica GN=HOX22 PE=... [more]
Match NameE-valueIdentityDescription
A0A0A0L317_CUCSA7.4e-11290.91Uncharacterized protein OS=Cucumis sativus GN=Csa_3G119240 PE=4 SV=1[more]
B9I924_POPTR5.2e-6559.83Homeobox leucine zipper family protein OS=Populus trichocarpa GN=POPTR_0014s0986... [more]
B9GR61_POPTR2.2e-6359.00Homeobox leucine zipper family protein OS=Populus trichocarpa GN=POPTR_0002s1768... [more]
A0A096YDU1_ROSHC3.5e-6159.36Homeobox-leucine zipper protein 1 OS=Rosa hybrid cultivar GN=HB1 PE=2 SV=1[more]
A0A067JWT5_JATCU1.9e-5956.91Uncharacterized protein OS=Jatropha curcas GN=JCGZ_14207 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G46680.19.1e-4546.09 homeobox 7[more]
AT3G61890.19.1e-4549.78 homeobox 12[more]
AT2G22430.16.8e-2443.41 homeobox protein 6[more]
AT5G65310.14.4e-2345.08 homeobox protein 5[more]
AT4G40060.17.5e-2350.53 homeobox protein 16[more]
Match NameE-valueIdentityDescription
gi|449432008|ref|XP_004133792.1|1.1e-11190.91PREDICTED: homeobox-leucine zipper protein ATHB-12 [Cucumis sativus][more]
gi|659074881|ref|XP_008437846.1|1.4e-10387.45PREDICTED: homeobox-leucine zipper protein ATHB-7 [Cucumis melo][more]
gi|224130632|ref|XP_002320889.1|7.4e-6559.83homeobox leucine zipper family protein [Populus trichocarpa][more]
gi|743790427|ref|XP_011038888.1|1.8e-6362.62PREDICTED: homeobox-leucine zipper protein ATHB-7 [Populus euphratica][more]
gi|224068066|ref|XP_002302659.1|3.1e-6359.00homeobox leucine zipper family protein [Populus trichocarpa][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR000047HTH_motif
IPR001356Homeobox_dom
IPR003106Leu_zip_homeo
IPR009057Homeobox-like_sf
IPR017970Homeobox_CS
Vocabulary: Molecular Function
TermDefinition
GO:0003677DNA binding
GO:0003700transcription factor activity, sequence-specific DNA binding
GO:0043565sequence-specific DNA binding
Vocabulary: Biological Process
TermDefinition
GO:0006355regulation of transcription, DNA-templated
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0005634 nucleus
cellular_component GO:0005667 transcription factor complex
cellular_component GO:0005575 cellular_component
molecular_function GO:0043565 sequence-specific DNA binding
molecular_function GO:0003700 transcription factor activity, sequence-specific DNA binding
molecular_function GO:0003677 DNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh14G021960.1CmoCh14G021960.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000047Helix-turn-helix motifPRINTSPR00031HTHREPRESSRcoord: 62..78
score: 3.1E-5coord: 53..62
score: 3.
IPR001356Homeobox domainPFAMPF00046Homeoboxcoord: 27..80
score: 3.3
IPR001356Homeobox domainSMARTSM00389HOX_1coord: 24..86
score: 1.7
IPR001356Homeobox domainPROFILEPS50071HOMEOBOX_2coord: 22..82
score: 17
IPR003106Leucine zipper, homeobox-associatedPFAMPF02183HALZcoord: 82..123
score: 3.9
IPR009057Homeodomain-likeGENE3DG3DSA:1.10.10.60coord: 27..89
score: 1.1
IPR009057Homeodomain-likeunknownSSF46689Homeodomain-likecoord: 11..84
score: 4.18
IPR017970Homeobox, conserved sitePROSITEPS00027HOMEOBOX_1coord: 57..80
scor
NoneNo IPR availableunknownCoilCoilcoord: 102..129
scor
NoneNo IPR availablePANTHERPTHR24326FAMILY NOT NAMEDcoord: 3..131
score: 1.4
NoneNo IPR availablePANTHERPTHR24326:SF122HOMEOBOX-LEUCINE ZIPPER PROTEIN ATHB-12-RELATEDcoord: 3..131
score: 1.4

The following gene(s) are paralogous to this gene:

None