Cp4.1LG13g06910 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG13g06910
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionHomeobox associated leucine zipper protein
LocationCp4.1LG13 : 4160635 .. 4162962 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCCCGGCTTTGCTTCTTCTCCGCCTCGTCCTTCAGGTAAGCACTCGACTCGTACACTCGTATGTTTAGATTTGTGATTCTGTTGCATTTTGATTAAAGCGAGAGTGAGACGAGGAATTCATGCAGCGAAAGAATTGGACATGAACCGGGTACCGGAGGTAGGAGATGGCGCGGAGGAAGAACAAGCGGTGGCGGAGAGGACGGAGGAGGAAGAGGAGAGCTGCAGCATTAACAATGGAGGTGGTCAGCCAAAGAAGAAGCTTAGTCTATCAAAACATCAGTCTCGTCTCCTTGAAGATATCTTCAGGCATAACCATTCATTAAACCCTGTGAGCAAAATCATCTCAATATTAATTAATGATTTTCTACTTAATTACTTAATTAATTAATTAATTAATTTTATTATTATTTATTATTTTTTATTTAGAAGCAAAAGGAAGCCTTGGCTATGGCACTCCAGCTGAAGCCAAGGCAAGTTGAAGTCTGGTTTCAGAATCGTAGGGCCAGGTAATTTTTATTCATAATTATTTTTCATTTTATTTTTTAAACATTTAATTAGAACAAAAATAAATAAAAGAAATTATTTATTTTTTTATTTTGTTTTAAGATGCTTATTTAAAATTTTGGTTATTAAAGAAATAAATAAGGTTAGTTATTATTAAAATTGAACAATAATAATACTAAATATTTTTATAATTTAGATATACTAATTATTAGTTAATCAAATTTTTGAAAAGCATTAAAAACTAATAAAAGAGTTCTTAAAAAATAAAATACTGAATTAAACTTAATAAAATAAATAATTTTATAGAGTAAAAATAATAAAAACACAAAAGTTTTTTTTTATAAATAAAATAAAATAAGATAATATAAGTTGAATTATTTTGAATATTTTTAATTAAGAAAATAAAAATAAAAAACAATAACTAATATTCAATTAACTAATTTAATCGGACAACTCATATCCAAAATTTGATTTAATAAACCAAAGTTAGAAGCGTTGGACCAATGAGTTCGATCATTGAAAGTTGAGCGAGCCTTGTCTAACTTAAATTACCTTAATTTGTTCTGCTTTTTGTCGTGACCTGACCACAACGAATATAATGACTTTTATGAATTTCTTTATTAAAAAAAAAAAAAAAAAACTATAAATGAAATTGTTATCCAACCAAACCTAAGCCTTTTATAACGCATCGAACAAGAGTATATCTTAAATTTATATATTTTGTTGACTTTGAATTAATTATCGTAGAGTACAAAATGAGAAAAAAATATACATTTATTTTTTTTATTTCATTCAGAAAAGAACCAAAAAAAAAAAAAAAAANNNNNNNNNNNNNNNNNNTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTGTAACATTTTTATTAATAACCTTGAAATTCTAAAAAAATTTAAAAAGATACTCTTTTCTCATTTTTTTCCCCTCATTTTTTCAACACTAAATCCATTTTTTTCATGAATTTTTAAAATTTATATAATCATAAATATAAGAAATTTCAATATCCAAACCATAAATTTATTTGAAAAATAATTATTTTGAGAGATTAAAAAAAAAAAAAAAACAAATAAAATCTATCATTCAAATTTGATGTGCTATGTCATTTTTTCTTTTTGTTCTCTATTCAATTTGTCTTAAATAATATTAACTTGCATAATTATTTGAATTAAACTAACAAATTTTGTAAATTATTAACGATCAACATGCTAATTGATATATATTTTTGTATATCGATTTGGTAATGTGACATTTAGTTGGTAAGGTTTGACATTTTAGCATATTCATATTTCTCTAATTCAATAATATATTTTCTTTTATTTTTTTCTTGGACTTGATTTAGGAAGAGGGTTAAAGTTGAATTTTTGAAAAATTGATAATAGTTCGATTTGTTGGTTAACATTTTTGTATTGGATCAACCTGACTGACAAGTAGTACAATAATATTATAGGAGCAAGCTGAAGCAAACTGAGATGGAATGTCAGTATTTAAGGAGGTGGTTTGGGTCATTGACGGAGCAAAACCGCCGGCTGCGGCGGGAACTGGAGGAGCTCAGAGCTACTAAGGTTGCTCTCCCAGCCGTCGTCTCAAGCCACGGACGACAGCCACCCATCCCGGAGTCCACTATAACCATGTGTCCCCAGTGCAAGCGCATAACCGCTGCCACTATAACATCTAGTAGGGCTGCAGCCACCCAGACCGCCATTGGCACCACTGCAACGCCATCGAAGGCAGTTCGGTCCACCCTCAAGTTGCAGCAGCCGTCTCAAGGTTGA

mRNA sequence

ATGCCCGGCTTTGCTTCTTCTCCGCCTCGTCCTTCAGCGAGAGTGAGACGAGGAATTCATGCAGCGAAAGAATTGGACATGAACCGGGTACCGGAGGTAGGAGATGGCGCGGAGGAAGAACAAGCGGTGGCGGAGAGGACGGAGGAGGAAGAGGAGAGCTGCAGCATTAACAATGGAGGTGGTCAGCCAAAGAAGAAGCTTAGTCTATCAAAACATCAGTCTCGTCTCCTTGAAGATATCTTCAGGCATAACCATTCATTAAACCCTAAGCAAAAGGAAGCCTTGGCTATGGCACTCCAGCTGAAGCCAAGGCAAGTTGAAGTCTGGTTTCAGAATCGTAGGGCCAGGAGGTGGTTTGGGTCATTGACGGAGCAAAACCGCCGGCTGCGGCGGGAACTGGAGGAGCTCAGAGCTACTAAGGTTGCTCTCCCAGCCGTCGTCTCAAGCCACGGACGACAGCCACCCATCCCGGAGTCCACTATAACCATGTGTCCCCAGTGCAAGCGCATAACCGCTGCCACTATAACATCTAGTAGGGCTGCAGCCACCCAGACCGCCATTGGCACCACTGCAACGCCATCGAAGGCAGTTCGGTCCACCCTCAAGTTGCAGCAGCCGTCTCAAGGTTGA

Coding sequence (CDS)

ATGCCCGGCTTTGCTTCTTCTCCGCCTCGTCCTTCAGCGAGAGTGAGACGAGGAATTCATGCAGCGAAAGAATTGGACATGAACCGGGTACCGGAGGTAGGAGATGGCGCGGAGGAAGAACAAGCGGTGGCGGAGAGGACGGAGGAGGAAGAGGAGAGCTGCAGCATTAACAATGGAGGTGGTCAGCCAAAGAAGAAGCTTAGTCTATCAAAACATCAGTCTCGTCTCCTTGAAGATATCTTCAGGCATAACCATTCATTAAACCCTAAGCAAAAGGAAGCCTTGGCTATGGCACTCCAGCTGAAGCCAAGGCAAGTTGAAGTCTGGTTTCAGAATCGTAGGGCCAGGAGGTGGTTTGGGTCATTGACGGAGCAAAACCGCCGGCTGCGGCGGGAACTGGAGGAGCTCAGAGCTACTAAGGTTGCTCTCCCAGCCGTCGTCTCAAGCCACGGACGACAGCCACCCATCCCGGAGTCCACTATAACCATGTGTCCCCAGTGCAAGCGCATAACCGCTGCCACTATAACATCTAGTAGGGCTGCAGCCACCCAGACCGCCATTGGCACCACTGCAACGCCATCGAAGGCAGTTCGGTCCACCCTCAAGTTGCAGCAGCCGTCTCAAGGTTGA

Protein sequence

MPGFASSPPRPSARVRRGIHAAKELDMNRVPEVGDGAEEEQAVAERTEEEEESCSINNGGGQPKKKLSLSKHQSRLLEDIFRHNHSLNPKQKEALAMALQLKPRQVEVWFQNRRARRWFGSLTEQNRRLRRELEELRATKVALPAVVSSHGRQPPIPESTITMCPQCKRITAATITSSRAAATQTAIGTTATPSKAVRSTLKLQQPSQG
BLAST of Cp4.1LG13g06910 vs. Swiss-Prot
Match: HOX3_ORYSJ (Homeobox-leucine zipper protein HOX3 OS=Oryza sativa subsp. japonica GN=HOX3 PE=1 SV=1)

HSP 1 Score: 155.6 bits (392), Expect = 5.9e-37
Identity = 94/166 (56.63%), Postives = 115/166 (69.28%), Query Frame = 1

Query: 23  KELDMNRVPEVGDGAEEEQAVAERTEEEEESCSINNGGGQPKKKLSLSKHQSRLLEDIFR 82
           ++LD+N   +   G EEE+      EE+EE   +  GG    KKL LSK QSRLLE+ FR
Sbjct: 41  RDLDIN---QPASGGEEEEFPMGSVEEDEEERGV--GGPHRPKKLRLSKEQSRLLEESFR 100

Query: 83  HNHSLNPKQKEALAMALQLKPRQVEVWFQNRRAR--------------RWFGSLTEQNRR 142
            NH+L PKQKEALA+ L+L+PRQVEVWFQNRRAR              R FGSLTE+NRR
Sbjct: 101 LNHTLTPKQKEALAIKLKLRPRQVEVWFQNRRARTKLKQTEMECEYLKRCFGSLTEENRR 160

Query: 143 LRRELEELRATKVALPAVVSSHGRQPPIPESTITMCPQCKRITAAT 175
           L+RE+EELRA +VA P V+S H RQ P+P S +TMCP+C+RITAAT
Sbjct: 161 LQREVEELRAMRVAPPTVLSPHTRQ-PLPASALTMCPRCERITAAT 200

BLAST of Cp4.1LG13g06910 vs. Swiss-Prot
Match: HOX3_ORYSI (Homeobox-leucine zipper protein HOX3 OS=Oryza sativa subsp. indica GN=HOX3 PE=1 SV=1)

HSP 1 Score: 155.6 bits (392), Expect = 5.9e-37
Identity = 94/166 (56.63%), Postives = 115/166 (69.28%), Query Frame = 1

Query: 23  KELDMNRVPEVGDGAEEEQAVAERTEEEEESCSINNGGGQPKKKLSLSKHQSRLLEDIFR 82
           ++LD+N   +   G EEE+      EE+EE   +  GG    KKL LSK QSRLLE+ FR
Sbjct: 41  RDLDIN---QPASGGEEEEFPMGSVEEDEEERGV--GGPHRPKKLRLSKEQSRLLEESFR 100

Query: 83  HNHSLNPKQKEALAMALQLKPRQVEVWFQNRRAR--------------RWFGSLTEQNRR 142
            NH+L PKQKEALA+ L+L+PRQVEVWFQNRRAR              R FGSLTE+NRR
Sbjct: 101 LNHTLTPKQKEALAIKLKLRPRQVEVWFQNRRARTKLKQTEMECEYLKRCFGSLTEENRR 160

Query: 143 LRRELEELRATKVALPAVVSSHGRQPPIPESTITMCPQCKRITAAT 175
           L+RE+EELRA +VA P V+S H RQ P+P S +TMCP+C+RITAAT
Sbjct: 161 LQREVEELRAMRVAPPTVLSPHTRQ-PLPASALTMCPRCERITAAT 200

BLAST of Cp4.1LG13g06910 vs. Swiss-Prot
Match: ATB17_ARATH (Homeobox-leucine zipper protein ATHB-17 OS=Arabidopsis thaliana GN=ATHB-17 PE=2 SV=1)

HSP 1 Score: 143.3 bits (360), Expect = 3.0e-33
Identity = 93/191 (48.69%), Postives = 113/191 (59.16%), Query Frame = 1

Query: 1   MPGFASSPPRPSARVRRGIHAAKELDMNRVPEVGDGAEEEQAVAERTEEEEESCSINNGG 60
           +PGF+SSP   S     G      LDMNR+P   DG +EE              S ++G 
Sbjct: 90  VPGFSSSPL--SDEGSGGGRDQLRLDMNRLPSSEDGDDEE-------------FSHDDGS 149

Query: 61  GQPKKKLSLSKHQSRLLEDIFRHNHSLNPKQKEALAMALQLKPRQVEVWFQNRRAR---- 120
             P+KKL L++ QSRLLED FR NH+LNPKQKE LA  L L+PRQ+EVWFQNRRAR    
Sbjct: 150 APPRKKLRLTREQSRLLEDSFRQNHTLNPKQKEVLAKHLMLRPRQIEVWFQNRRARSKLK 209

Query: 121 ----------RWFGSLTEQNRRLRRELEELRATKVALPAVVSSHGRQPPIPESTITMCPQ 178
                     RWFGSLTE+N RL RE+EELRA KV  P  V+S         S++TMCP+
Sbjct: 210 QTEMECEYLKRWFGSLTEENHRLHREVEELRAMKVG-PTTVNS--------ASSLTMCPR 256

BLAST of Cp4.1LG13g06910 vs. Swiss-Prot
Match: HOX1_ORYSJ (Homeobox-leucine zipper protein HOX1 OS=Oryza sativa subsp. japonica GN=HOX1 PE=1 SV=1)

HSP 1 Score: 112.1 bits (279), Expect = 7.5e-24
Identity = 78/175 (44.57%), Postives = 103/175 (58.86%), Query Frame = 1

Query: 42  AVAERTEEEEESCSINNGGGQPKKKLSLSKHQSRLLEDIFRHNHSLNPKQKEALAMALQL 101
           A A    ++E+S      GG  +KKL LSK Q+ +LED F+ +++LNPKQK ALA  L L
Sbjct: 140 AAAAAASDDEDS------GGGSRKKLRLSKDQAAVLEDTFKEHNTLNPKQKAALARQLNL 199

Query: 102 KPRQVEVWFQNRRAR--------------RWFGSLTEQNRRLRRELEELRATKVALPAVV 161
           KPRQVEVWFQNRRAR              R   +LT++NRRL REL+ELRA K+A  A  
Sbjct: 200 KPRQVEVWFQNRRARTKLKQTEVDCELLKRCCETLTDENRRLHRELQELRALKLATAAAA 259

Query: 162 SSH--GRQPPIPESTITMCPQCKRITAATIT--SSRAAATQTAIGTTATPSKAVR 199
             H  G + P P +T+TMCP C+R+ +A  T  ++  AA    + T   P  A +
Sbjct: 260 PHHLYGARVP-PPTTLTMCPSCERVASAATTTRNNSGAAPARPVPTRPWPPAAAQ 307

BLAST of Cp4.1LG13g06910 vs. Swiss-Prot
Match: HOX1_ORYSI (Homeobox-leucine zipper protein HOX1 OS=Oryza sativa subsp. indica GN=HOX1 PE=1 SV=2)

HSP 1 Score: 112.1 bits (279), Expect = 7.5e-24
Identity = 78/175 (44.57%), Postives = 103/175 (58.86%), Query Frame = 1

Query: 42  AVAERTEEEEESCSINNGGGQPKKKLSLSKHQSRLLEDIFRHNHSLNPKQKEALAMALQL 101
           A A    ++E+S      GG  +KKL LSK Q+ +LED F+ +++LNPKQK ALA  L L
Sbjct: 140 AAAAAASDDEDS------GGGSRKKLRLSKDQAAVLEDTFKEHNTLNPKQKAALARQLNL 199

Query: 102 KPRQVEVWFQNRRAR--------------RWFGSLTEQNRRLRRELEELRATKVALPAVV 161
           KPRQVEVWFQNRRAR              R   +LT++NRRL REL+ELRA K+A  A  
Sbjct: 200 KPRQVEVWFQNRRARTKLKQTEVDCELLKRCCETLTDENRRLHRELQELRALKLATAAAA 259

Query: 162 SSH--GRQPPIPESTITMCPQCKRITAATIT--SSRAAATQTAIGTTATPSKAVR 199
             H  G + P P +T+TMCP C+R+ +A  T  ++  AA    + T   P  A +
Sbjct: 260 PHHLYGARVP-PPTTLTMCPSCERVASAATTTRNNSGAAPARPVPTRPWPPAAAQ 307

BLAST of Cp4.1LG13g06910 vs. TrEMBL
Match: A0A0A0KPM6_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G576600 PE=4 SV=1)

HSP 1 Score: 200.3 bits (508), Expect = 2.3e-48
Identity = 128/211 (60.66%), Postives = 146/211 (69.19%), Query Frame = 1

Query: 14  RVRRGIHAAKELDMNRVPEVGDGAEEEQAVAERTEEEEESCSINNGGG-QPKKKLSLSKH 73
           +V+  +   +ELDMNRVP  G+ AEE+ A     EE EE  SINN GG QP+KKL LSK 
Sbjct: 4   KVKVRVCVVRELDMNRVPAEGE-AEEDWARGPSVEEGEEESSINNNGGTQPRKKLRLSKD 63

Query: 74  QSRLLEDIFRHNHSLNPKQKEALAMALQLKPRQVEVWFQNRRAR--------------RW 133
           QSRLLE+ FR NH+LNPKQKE LAM L+LKPRQVEVWFQNRRAR              R 
Sbjct: 64  QSRLLEESFRLNHTLNPKQKEGLAMELKLKPRQVEVWFQNRRARSKLKQTELECEYMKRC 123

Query: 134 FGSLTEQNRRLRRELEELRATKVALPAVVSSHGRQPP-IPESTITMCPQCKRITAATITS 193
           FGSLTEQNRRL+ ELEELRA KVA PAVVS H R PP +  STIT+CP+C+RI    I+S
Sbjct: 124 FGSLTEQNRRLQWELEELRAIKVAPPAVVSRHNRHPPLLMRSTITICPRCERI----ISS 183

Query: 194 SRAAATQTAIGTTATPSKAVRSTLKLQQPSQ 209
               A QTA   TA PSK V S L+L+QPSQ
Sbjct: 184 KNTVADQTATTATAMPSKVVLSALQLRQPSQ 209

BLAST of Cp4.1LG13g06910 vs. TrEMBL
Match: D7T8T6_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_01s0011g04870 PE=4 SV=1)

HSP 1 Score: 183.0 bits (463), Expect = 3.9e-43
Identity = 106/189 (56.08%), Postives = 131/189 (69.31%), Query Frame = 1

Query: 1   MPGFASSPPRPSARVRRGIHAAKELDMNRVPEVGDGAEEEQAVAERTEEEEESCSINNGG 60
           +PGF SSP  PS+    G+   ++LD+N+VP    GAEEE       +EEE     +  G
Sbjct: 17  VPGFTSSPSLPSS-AGEGVCGVRDLDINQVPL---GAEEEWTTGSMEDEEE-----SGNG 76

Query: 61  GQPKKKLSLSKHQSRLLEDIFRHNHSLNPKQKEALAMALQLKPRQVEVWFQNRRAR---- 120
           G P+KKL LSK QSRLLE+ FR NH+LNPKQKEALAM L+L+PRQVEVWFQNRRAR    
Sbjct: 77  GPPRKKLRLSKDQSRLLEESFRQNHTLNPKQKEALAMQLKLRPRQVEVWFQNRRARSKLK 136

Query: 121 ----------RWFGSLTEQNRRLRRELEELRATKVALPAVVSSHGRQPPIPESTITMCPQ 176
                     RWFGSLTEQNRRL+RE+EELRA KVA P V+S H  + P+P ST+TMCP+
Sbjct: 137 QTEMECEYLKRWFGSLTEQNRRLQREVEELRAMKVAPPTVISPHSCE-PLPASTLTMCPR 195

BLAST of Cp4.1LG13g06910 vs. TrEMBL
Match: A0A061EF99_THECC (Homeobox-leucine zipper protein HOX3 OS=Theobroma cacao GN=TCM_010868 PE=4 SV=1)

HSP 1 Score: 179.9 bits (455), Expect = 3.3e-42
Identity = 110/221 (49.77%), Postives = 141/221 (63.80%), Query Frame = 1

Query: 1   MPGFASSPPRPSARVRRGIHAAKELDMNRVPEVGDGAEEEQAVAERTEEEEESCSINNGG 60
           +PGF+SSP  PS+  + G    ++LD+N+VP    G  E++ +    E+EEESC+    G
Sbjct: 17  VPGFSSSPSLPSSGDQGGC-TVRDLDINQVPS---GGAEDEWITASMEDEEESCN----G 76

Query: 61  GQPKKKLSLSKHQSRLLEDIFRHNHSLNPKQKEALAMALQLKPRQVEVWFQNRRAR---- 120
             P+KKL L+K QSRLLE+ FR NH+LNPKQKEALAM L+L+PRQVEVWFQNRRAR    
Sbjct: 77  APPRKKLRLTKEQSRLLEESFRQNHTLNPKQKEALAMQLKLRPRQVEVWFQNRRARSKLK 136

Query: 121 ----------RWFGSLTEQNRRLRRELEELRATKVALPAVVSSHGRQPPIPESTITMCPQ 180
                     RWFGSLTEQNRRL+RE+EELRA KV  P V+S H  + P+P ST+TMCP+
Sbjct: 137 QTEMECEYLKRWFGSLTEQNRRLQREVEELRAMKVGPPTVISPHSCE-PLPASTLTMCPR 196

Query: 181 CKRITAATITSSRAAATQTAIGTTATPSKAVRSTLKLQQPS 208
           C+R+T   +       T      T   SK   S L+ +  S
Sbjct: 197 CERVTTTALDKGPTKMTAATATATTLSSKVGTSALQSRPSS 228

BLAST of Cp4.1LG13g06910 vs. TrEMBL
Match: A0A0B0MRC6_GOSAR (Homeobox-leucine zipper HOX3 OS=Gossypium arboreum GN=F383_25892 PE=4 SV=1)

HSP 1 Score: 179.5 bits (454), Expect = 4.3e-42
Identity = 108/201 (53.73%), Postives = 138/201 (68.66%), Query Frame = 1

Query: 1   MPGFASSPPRPSARVRRGIHAAKELDMNRVPEVGDGAEEEQAVAERTEEEEESCSINNGG 60
           +PGFASSP  PS+  + G    ++LD+N+VP       E++ +    E+EEESC   N G
Sbjct: 17  VPGFASSPSFPSSGDQGGC-TVRDLDINQVPA------EDEWITASMEDEEESC---NNG 76

Query: 61  GQPKKKLSLSKHQSRLLEDIFRHNHSLNPKQKEALAMALQLKPRQVEVWFQNRRAR---- 120
             P+KKL L+K QSRLLE+ FR NH+LNPKQKEALA+ L+L+PRQVEVWFQNRRAR    
Sbjct: 77  APPRKKLRLTKEQSRLLEESFRLNHTLNPKQKEALALQLKLRPRQVEVWFQNRRARSKLK 136

Query: 121 ----------RWFGSLTEQNRRLRRELEELRATKVALPAVVSSHGRQPPIPESTITMCPQ 180
                     RWFGSLTEQNRRL+RE+EELRA KVA P V+S H  + P+P ST+TMCP+
Sbjct: 137 QTEMECEYLKRWFGSLTEQNRRLQREVEELRAMKVAPPTVISPHSCE-PLPASTLTMCPR 196

Query: 181 CKRITAATITS-SRAAATQTA 187
           C+R+T  T  +  + +A  TA
Sbjct: 197 CERVTTTTTAAIEKGSAKMTA 206

BLAST of Cp4.1LG13g06910 vs. TrEMBL
Match: B9SJ50_RICCO (Homeobox protein, putative OS=Ricinus communis GN=RCOM_0843040 PE=4 SV=1)

HSP 1 Score: 178.7 bits (452), Expect = 7.3e-42
Identity = 113/215 (52.56%), Postives = 140/215 (65.12%), Query Frame = 1

Query: 1   MPGFASSPPRPSARVRRGIHAAKELDMNRVPEVGDGAEEEQAVAERTEEEEESCSINNGG 60
           MPGFASSP  PS          K+LD+N++P    G  EE+ +    E+EEES   N  G
Sbjct: 17  MPGFASSPSVPSYG-----EGVKDLDINQLPA---GVAEEEWITAGIEDEEES---NING 76

Query: 61  GQPKKKLSLSKHQSRLLEDIFRHNHSLNPKQKEALAMALQLKPRQVEVWFQNRRAR---- 120
           G P+KKL LSK QSRLLE+ FR +H+LNP+QKEALAM L+L+PRQVEVWFQNRRAR    
Sbjct: 77  GPPRKKLRLSKEQSRLLEESFRQHHTLNPRQKEALAMQLKLRPRQVEVWFQNRRARSKLK 136

Query: 121 ----------RWFGSLTEQNRRLRRELEELRATKVALPAVVSSHGRQPPIPESTITMCPQ 180
                     RWFGSLTEQNRRL+RE+EELRA KV  P V+S H  + P+P ST+TMCP+
Sbjct: 137 QTEMECEYLKRWFGSLTEQNRRLQREVEELRAMKVGPPTVLSPHSCE-PLPASTLTMCPR 196

Query: 181 CKRITAATITSSRAAATQTAIGTTATPSKAVRSTL 202
           C+R+T +T T++      T   T AT      +TL
Sbjct: 197 CERVTTSTNTAAAFDKGPTRTATPATTPTTAVATL 219

BLAST of Cp4.1LG13g06910 vs. TAIR10
Match: AT2G01430.1 (AT2G01430.1 homeobox-leucine zipper protein 17)

HSP 1 Score: 143.3 bits (360), Expect = 1.7e-34
Identity = 93/191 (48.69%), Postives = 113/191 (59.16%), Query Frame = 1

Query: 1   MPGFASSPPRPSARVRRGIHAAKELDMNRVPEVGDGAEEEQAVAERTEEEEESCSINNGG 60
           +PGF+SSP   S     G      LDMNR+P   DG +EE              S ++G 
Sbjct: 90  VPGFSSSPL--SDEGSGGGRDQLRLDMNRLPSSEDGDDEE-------------FSHDDGS 149

Query: 61  GQPKKKLSLSKHQSRLLEDIFRHNHSLNPKQKEALAMALQLKPRQVEVWFQNRRAR---- 120
             P+KKL L++ QSRLLED FR NH+LNPKQKE LA  L L+PRQ+EVWFQNRRAR    
Sbjct: 150 APPRKKLRLTREQSRLLEDSFRQNHTLNPKQKEVLAKHLMLRPRQIEVWFQNRRARSKLK 209

Query: 121 ----------RWFGSLTEQNRRLRRELEELRATKVALPAVVSSHGRQPPIPESTITMCPQ 178
                     RWFGSLTE+N RL RE+EELRA KV  P  V+S         S++TMCP+
Sbjct: 210 QTEMECEYLKRWFGSLTEENHRLHREVEELRAMKVG-PTTVNS--------ASSLTMCPR 256

BLAST of Cp4.1LG13g06910 vs. TAIR10
Match: AT1G70920.1 (AT1G70920.1 homeobox-leucine zipper protein 18)

HSP 1 Score: 110.9 bits (276), Expect = 9.4e-25
Identity = 81/215 (37.67%), Postives = 111/215 (51.63%), Query Frame = 1

Query: 1   MPGFASSPPRPSARVRRGIHAAKELDMNRVPEVGDGAEEEQAVAERTEEEEESCSINNGG 60
           +P F+ SP           H  ++ D+N+ P+  +  E          E++     +N G
Sbjct: 16  IPSFSPSPSLGDH------HGMRDFDINQTPKTEEDREWMIGATPHVNEDD-----SNSG 75

Query: 61  GQPKKKLSLSKHQSRLLEDIFRHNHSLNPKQKEALAMALQLKPRQVEVWFQNRRAR---- 120
           G+ +KKL L+K QS LLE+ F  NH+L PKQK+ LA  L+L  RQVEVWFQNRRAR    
Sbjct: 76  GRRRKKLRLTKEQSHLLEESFIQNHTLTPKQKKDLATFLKLSQRQVEVWFQNRRARSKLK 135

Query: 121 ----------RWFGSLTEQNRRLRRELEELRATKVALPAVVSSHGRQPPIPESTITMCPQ 180
                     RWFGSL EQNRRL+ E+EELRA K              P   S +TMCP+
Sbjct: 136 HTEMECEYLKRWFGSLKEQNRRLQIEVEELRALK--------------PSSTSALTMCPR 195

Query: 181 CKRITAATITSSRAAATQTAIGTTATPSKAVRSTL 202
           C+R+T A    S A      + + +  + +  S+L
Sbjct: 196 CERVTDAVDNDSNAVQEGAVLSSRSRMTISSSSSL 205

BLAST of Cp4.1LG13g06910 vs. TAIR10
Match: AT5G06710.1 (AT5G06710.1 homeobox from Arabidopsis thaliana)

HSP 1 Score: 104.0 bits (258), Expect = 1.2e-22
Identity = 76/205 (37.07%), Postives = 106/205 (51.71%), Query Frame = 1

Query: 1   MPGFASSPPRPSARVRRGIHAAKELDMNRVPEVGDGAEEEQAVAERTEEEEESCSINNGG 60
           +P  + SPP       +     K     R     D  +E +  A R   E+     ++  
Sbjct: 130 VPSMSVSPPDSVTSSFQLDFGIKSYGYERRSNKRDIDDEVERSASRASNEDN----DDEN 189

Query: 61  GQPKKKLSLSKHQSRLLEDIFRHNHSLNPKQKEALAMALQLKPRQVEVWFQNRRAR---- 120
           G  +KKL LSK QS  LED F+ + +LNPKQK ALA  L L+PRQVEVWFQNRRAR    
Sbjct: 190 GSTRKKLRLSKDQSAFLEDSFKEHSTLNPKQKIALAKQLNLRPRQVEVWFQNRRARTKLK 249

Query: 121 ----------RWFGSLTEQNRRLRRELEELRATKVALPAVVSSHGRQPPIPESTITMCPQ 180
                     R   SLTE+NRRL++E++ELR  K + P  +        +P +T+TMCP 
Sbjct: 250 QTEVDCEYLKRCCESLTEENRRLQKEVKELRTLKTSTPFYMQ-------LPATTLTMCPS 309

Query: 181 CKRITAATITSSRAAATQTAIGTTA 192
           C+R+  +    S +AA    + T++
Sbjct: 310 CERVATSAAQPSTSAAHNLCLSTSS 323

BLAST of Cp4.1LG13g06910 vs. TAIR10
Match: AT4G16780.1 (AT4G16780.1 homeobox protein 2)

HSP 1 Score: 101.3 bits (251), Expect = 7.5e-22
Identity = 73/200 (36.50%), Postives = 106/200 (53.00%), Query Frame = 1

Query: 4   FASSPPRPSARVRRGIHAAKELDMNRVPEVGDGAEEEQAVA-----------ERTEEEEE 63
           F SS P   +  +      + +D+NR P   +  +E+  V+           +R+E EE+
Sbjct: 50  FTSSVPNSDSSQKETRTFIRGIDVNRPPSTAEYGDEDAGVSSPNSTVSSSTGKRSEREED 109

Query: 64  S-------CSINNGGGQPKKKLSLSKHQSRLLEDIFRHNHSLNPKQKEALAMALQLKPRQ 123
           +        S +  G   +KKL LSK QS +LE+ F+ + +LNPKQK+ALA  L L+ RQ
Sbjct: 110 TDPQGSRGISDDEDGDNSRKKLRLSKDQSAILEETFKDHSTLNPKQKQALAKQLGLRARQ 169

Query: 124 VEVWFQNRRA--------------RRWFGSLTEQNRRLRRELEELRATKVALPAVVSSHG 172
           VEVWFQNRRA              RR   +LTE+NRRL++E+ ELRA K      +S   
Sbjct: 170 VEVWFQNRRARTKLKQTEVDCEFLRRCCENLTEENRRLQKEVTELRALK------LSPQF 229

BLAST of Cp4.1LG13g06910 vs. TAIR10
Match: AT2G44910.1 (AT2G44910.1 homeobox-leucine zipper protein 4)

HSP 1 Score: 100.1 bits (248), Expect = 1.7e-21
Identity = 80/203 (39.41%), Postives = 110/203 (54.19%), Query Frame = 1

Query: 36  GAEEEQAVAERTEEEE-ESCSINNGGGQ-------------PKKKLSLSKHQSRLLEDIF 95
           G + + AVA   +E E E  S + GGG               +KKL LSK Q+ +LE+ F
Sbjct: 120 GNKRDLAVARGGDENEAERASCSRGGGSGGSDDEDGGNGDGSRKKLRLSKDQALVLEETF 179

Query: 96  RHNHSLNPKQKEALAMALQLKPRQVEVWFQNRRAR--------------RWFGSLTEQNR 155
           + + +LNPKQK ALA  L L+ RQVEVWFQNRRAR              R   +LTE+NR
Sbjct: 180 KEHSTLNPKQKLALAKQLNLRARQVEVWFQNRRARTKLKQTEVDCEYLKRCCDNLTEENR 239

Query: 156 RLRRELEELRATKVALPAVVSSHGRQPPIPESTITMCPQCKRI--TAATITSS-RAAATQ 208
           RL++E+ ELRA K      +S H      P +T+TMCP C+R+  +AAT+T++     T 
Sbjct: 240 RLQKEVSELRALK------LSPHLYMHMTPPTTLTMCPSCERVSSSAATVTAAPSTTTTP 299

BLAST of Cp4.1LG13g06910 vs. NCBI nr
Match: gi|778704059|ref|XP_011655468.1| (PREDICTED: homeobox-leucine zipper protein HOX3-like [Cucumis sativus])

HSP 1 Score: 203.8 bits (517), Expect = 3.0e-49
Identity = 136/226 (60.18%), Postives = 152/226 (67.26%), Query Frame = 1

Query: 1   MPGFASSP--PRPSARVRRGIHAAKELDMNRVPEVGDGAEEEQAVAERTEEEEESCSINN 60
           MPGF+SSP   RP           +ELDMNRVP  G+ AEE+ A     EE EE  SINN
Sbjct: 18  MPGFSSSPLTTRPLF--------VRELDMNRVPAEGE-AEEDWARGPSVEEGEEESSINN 77

Query: 61  GGG-QPKKKLSLSKHQSRLLEDIFRHNHSLNPKQKEALAMALQLKPRQVEVWFQNRRAR- 120
            GG QP+KKL LSK QSRLLE+ FR NH+LNPKQKE LAM L+LKPRQVEVWFQNRRAR 
Sbjct: 78  NGGTQPRKKLRLSKDQSRLLEESFRLNHTLNPKQKEGLAMELKLKPRQVEVWFQNRRARS 137

Query: 121 -------------RWFGSLTEQNRRLRRELEELRATKVALPAVVSSHGRQPP-IPESTIT 180
                        R FGSLTEQNRRL+ ELEELRA KVA PAVVS H R PP +  STIT
Sbjct: 138 KLKQTELECEYMKRCFGSLTEQNRRLQWELEELRAIKVAPPAVVSRHNRHPPLLMRSTIT 197

Query: 181 MCPQCKRITAATITSSRAAATQTAIGTTATPSKAVRSTLKLQQPSQ 209
           +CP+C+RI    I+S    A QTA   TA PSK V S L+L+QPSQ
Sbjct: 198 ICPRCERI----ISSKNTVADQTATTATAMPSKVVLSALQLRQPSQ 230

BLAST of Cp4.1LG13g06910 vs. NCBI nr
Match: gi|700196341|gb|KGN51518.1| (hypothetical protein Csa_5G576600 [Cucumis sativus])

HSP 1 Score: 200.3 bits (508), Expect = 3.4e-48
Identity = 128/211 (60.66%), Postives = 146/211 (69.19%), Query Frame = 1

Query: 14  RVRRGIHAAKELDMNRVPEVGDGAEEEQAVAERTEEEEESCSINNGGG-QPKKKLSLSKH 73
           +V+  +   +ELDMNRVP  G+ AEE+ A     EE EE  SINN GG QP+KKL LSK 
Sbjct: 4   KVKVRVCVVRELDMNRVPAEGE-AEEDWARGPSVEEGEEESSINNNGGTQPRKKLRLSKD 63

Query: 74  QSRLLEDIFRHNHSLNPKQKEALAMALQLKPRQVEVWFQNRRAR--------------RW 133
           QSRLLE+ FR NH+LNPKQKE LAM L+LKPRQVEVWFQNRRAR              R 
Sbjct: 64  QSRLLEESFRLNHTLNPKQKEGLAMELKLKPRQVEVWFQNRRARSKLKQTELECEYMKRC 123

Query: 134 FGSLTEQNRRLRRELEELRATKVALPAVVSSHGRQPP-IPESTITMCPQCKRITAATITS 193
           FGSLTEQNRRL+ ELEELRA KVA PAVVS H R PP +  STIT+CP+C+RI    I+S
Sbjct: 124 FGSLTEQNRRLQWELEELRAIKVAPPAVVSRHNRHPPLLMRSTITICPRCERI----ISS 183

Query: 194 SRAAATQTAIGTTATPSKAVRSTLKLQQPSQ 209
               A QTA   TA PSK V S L+L+QPSQ
Sbjct: 184 KNTVADQTATTATAMPSKVVLSALQLRQPSQ 209

BLAST of Cp4.1LG13g06910 vs. NCBI nr
Match: gi|659072906|ref|XP_008467159.1| (PREDICTED: homeobox-leucine zipper protein HOX3-like [Cucumis melo])

HSP 1 Score: 191.8 bits (486), Expect = 1.2e-45
Identity = 132/226 (58.41%), Postives = 151/226 (66.81%), Query Frame = 1

Query: 1   MPGFASSPPRPSARVRRGIHAAKELDMNRVPEVGDGAEEEQAVAERTEEE-EESCSI--N 60
           MPGF+SS P     + R      ELDMNRVP  G+ AE + A  +  EEE EE  SI  N
Sbjct: 18  MPGFSSSLPTTRPLLVR------ELDMNRVPAEGE-AEADWARGQSVEEEGEEESSINNN 77

Query: 61  NGGGQPKKKLSLSKHQSRLLEDIFRHNHSLNPKQKEALAMALQLKPRQVEVWFQNRRAR- 120
           NGG QP+KKL LSK QSRLLE+ FR NH+LNPKQKE LAM L+LKPRQVEVWFQNRRAR 
Sbjct: 78  NGGTQPRKKLRLSKDQSRLLEESFRLNHTLNPKQKEGLAMELKLKPRQVEVWFQNRRARS 137

Query: 121 -------------RWFGSLTEQNRRLRRELEELRATKVALPAVVSSHGR-QPPIPESTIT 180
                        R FGSLTEQNRRL+ ELEELRA KVA PAVVS H R QPP+  STIT
Sbjct: 138 KLKQTELECEYMKRCFGSLTEQNRRLQWELEELRAIKVAPPAVVSRHNRSQPPLSRSTIT 197

Query: 181 MCPQCKRITAATITSSRAAATQTAIGTTATPSKAVRSTLKLQQPSQ 209
           +CP+C+RIT+   T +  A T      TA  S+ V S LKL+QPS+
Sbjct: 198 ICPRCERITSNKNTVAENATTT----ATAMQSEVVLSALKLRQPSR 232

BLAST of Cp4.1LG13g06910 vs. NCBI nr
Match: gi|698515821|ref|XP_009802790.1| (PREDICTED: homeobox-leucine zipper protein HOX3-like [Nicotiana sylvestris])

HSP 1 Score: 185.7 bits (470), Expect = 8.5e-44
Identity = 109/190 (57.37%), Postives = 134/190 (70.53%), Query Frame = 1

Query: 1   MPGFASSPPRPSARVRRGIHAAKELDMNRVPEVGDGAEEEQAVAERTEEEEESCSINN-G 60
           +PG +SS   PS+    G +  +ELD+N+VP  G+  EEE ++ E   EEEESC+IN   
Sbjct: 17  IPGSSSSSSFPSSGEGGGYNLMRELDINQVPSNGNINEEEISIEE---EEEESCNINGVN 76

Query: 61  GGQPKKKLSLSKHQSRLLEDIFRHNHSLNPKQKEALAMALQLKPRQVEVWFQNRRAR--- 120
           GG+P+KKL L+K QS LLE+ FR NH+LNPKQKEALAM L+LKPRQVEVWFQNRRAR   
Sbjct: 77  GGRPRKKLRLTKEQSFLLEESFRQNHTLNPKQKEALAMQLKLKPRQVEVWFQNRRARSKL 136

Query: 121 -----------RWFGSLTEQNRRLRRELEELRATKVALPAVVSSHGRQPPIPESTITMCP 176
                      RWFGSLTEQNRRL++E++ELRA KV  P V+S H  QP  P ST+TMCP
Sbjct: 137 KQTELECEYLKRWFGSLTEQNRRLKKEVQELRAMKVGPPTVLSPHSCQPR-PASTLTMCP 196

BLAST of Cp4.1LG13g06910 vs. NCBI nr
Match: gi|731369662|ref|XP_010657445.1| (PREDICTED: homeobox-leucine zipper protein HOX3 [Vitis vinifera])

HSP 1 Score: 183.0 bits (463), Expect = 5.5e-43
Identity = 106/189 (56.08%), Postives = 131/189 (69.31%), Query Frame = 1

Query: 1   MPGFASSPPRPSARVRRGIHAAKELDMNRVPEVGDGAEEEQAVAERTEEEEESCSINNGG 60
           +PGF SSP  PS+    G+   ++LD+N+VP    GAEEE       +EEE     +  G
Sbjct: 17  VPGFTSSPSLPSS-AGEGVCGVRDLDINQVPL---GAEEEWTTGSMEDEEE-----SGNG 76

Query: 61  GQPKKKLSLSKHQSRLLEDIFRHNHSLNPKQKEALAMALQLKPRQVEVWFQNRRAR---- 120
           G P+KKL LSK QSRLLE+ FR NH+LNPKQKEALAM L+L+PRQVEVWFQNRRAR    
Sbjct: 77  GPPRKKLRLSKDQSRLLEESFRQNHTLNPKQKEALAMQLKLRPRQVEVWFQNRRARSKLK 136

Query: 121 ----------RWFGSLTEQNRRLRRELEELRATKVALPAVVSSHGRQPPIPESTITMCPQ 176
                     RWFGSLTEQNRRL+RE+EELRA KVA P V+S H  + P+P ST+TMCP+
Sbjct: 137 QTEMECEYLKRWFGSLTEQNRRLQREVEELRAMKVAPPTVISPHSCE-PLPASTLTMCPR 195

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
HOX3_ORYSJ5.9e-3756.63Homeobox-leucine zipper protein HOX3 OS=Oryza sativa subsp. japonica GN=HOX3 PE=... [more]
HOX3_ORYSI5.9e-3756.63Homeobox-leucine zipper protein HOX3 OS=Oryza sativa subsp. indica GN=HOX3 PE=1 ... [more]
ATB17_ARATH3.0e-3348.69Homeobox-leucine zipper protein ATHB-17 OS=Arabidopsis thaliana GN=ATHB-17 PE=2 ... [more]
HOX1_ORYSJ7.5e-2444.57Homeobox-leucine zipper protein HOX1 OS=Oryza sativa subsp. japonica GN=HOX1 PE=... [more]
HOX1_ORYSI7.5e-2444.57Homeobox-leucine zipper protein HOX1 OS=Oryza sativa subsp. indica GN=HOX1 PE=1 ... [more]
Match NameE-valueIdentityDescription
A0A0A0KPM6_CUCSA2.3e-4860.66Uncharacterized protein OS=Cucumis sativus GN=Csa_5G576600 PE=4 SV=1[more]
D7T8T6_VITVI3.9e-4356.08Putative uncharacterized protein OS=Vitis vinifera GN=VIT_01s0011g04870 PE=4 SV=... [more]
A0A061EF99_THECC3.3e-4249.77Homeobox-leucine zipper protein HOX3 OS=Theobroma cacao GN=TCM_010868 PE=4 SV=1[more]
A0A0B0MRC6_GOSAR4.3e-4253.73Homeobox-leucine zipper HOX3 OS=Gossypium arboreum GN=F383_25892 PE=4 SV=1[more]
B9SJ50_RICCO7.3e-4252.56Homeobox protein, putative OS=Ricinus communis GN=RCOM_0843040 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G01430.11.7e-3448.69 homeobox-leucine zipper protein 17[more]
AT1G70920.19.4e-2537.67 homeobox-leucine zipper protein 18[more]
AT5G06710.11.2e-2237.07 homeobox from Arabidopsis thaliana[more]
AT4G16780.17.5e-2236.50 homeobox protein 2[more]
AT2G44910.11.7e-2139.41 homeobox-leucine zipper protein 4[more]
Match NameE-valueIdentityDescription
gi|778704059|ref|XP_011655468.1|3.0e-4960.18PREDICTED: homeobox-leucine zipper protein HOX3-like [Cucumis sativus][more]
gi|700196341|gb|KGN51518.1|3.4e-4860.66hypothetical protein Csa_5G576600 [Cucumis sativus][more]
gi|659072906|ref|XP_008467159.1|1.2e-4558.41PREDICTED: homeobox-leucine zipper protein HOX3-like [Cucumis melo][more]
gi|698515821|ref|XP_009802790.1|8.5e-4457.37PREDICTED: homeobox-leucine zipper protein HOX3-like [Nicotiana sylvestris][more]
gi|731369662|ref|XP_010657445.1|5.5e-4356.08PREDICTED: homeobox-leucine zipper protein HOX3 [Vitis vinifera][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0043565sequence-specific DNA binding
GO:0003700transcription factor activity, sequence-specific DNA binding
GO:0003677DNA binding
Vocabulary: Biological Process
TermDefinition
GO:0006355regulation of transcription, DNA-templated
Vocabulary: INTERPRO
TermDefinition
IPR009057Homeobox-like_sf
IPR003106Leu_zip_homeo
IPR001356Homeobox_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0005667 transcription factor complex
cellular_component GO:0005634 nucleus
molecular_function GO:0043565 sequence-specific DNA binding
molecular_function GO:0003700 transcription factor activity, sequence-specific DNA binding
molecular_function GO:0003677 DNA binding
molecular_function GO:0044212 transcription regulatory region DNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG13g06910.1Cp4.1LG13g06910.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001356Homeobox domainPFAMPF00046Homeoboxcoord: 64..116
score: 6.9
IPR001356Homeobox domainSMARTSM00389HOX_1coord: 62..124
score: 2.3
IPR001356Homeobox domainPROFILEPS50071HOMEOBOX_2coord: 60..120
score: 15
IPR003106Leucine zipper, homeobox-associatedSMARTSM00340halzcoord: 104..149
score: 1.
IPR009057Homeodomain-likeGENE3DG3DSA:1.10.10.60coord: 70..116
score: 3.5
IPR009057Homeodomain-likeunknownSSF46689Homeodomain-likecoord: 53..116
score: 1.07
NoneNo IPR availableunknownCoilCoilcoord: 119..139
scor
NoneNo IPR availablePANTHERPTHR24326FAMILY NOT NAMEDcoord: 38..184
score: 4.5
NoneNo IPR availablePANTHERPTHR24326:SF114HOMEOBOX-LEUCINE ZIPPER PROTEIN ATHB-17-RELATEDcoord: 38..184
score: 4.5

The following gene(s) are paralogous to this gene:

None