Cp4.1LG01g09070 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG01g09070
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionHomeobox associated leucine zipper protein
LocationCp4.1LG01 : 4473872 .. 4476141 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GCAATTTAGAGTTCATTTCATTTGGGAATTGGGGAAAGCCAACTCCAGCATGACCTAACCCCAGTCAGATTGTAACCCAAGCTTGATTTATTTAAAGAGAGAGAAAATGTGAAGGCTGCCTGGGTTGTGAAAGCTGCTTCCCTAACCTCCTTCCCATTTCCTTCTCTTTCCCTCTCCTCCTCTGGTTCTACGTTACTTTCTAGATCACCCTTTTTCTTTAATCCAATAATCTTTGCTTTTGTTGCTTGATTCGAGTGCTCTTCATCTGGGTTCTTGTTGTTTTGGAGTGGGAATCTTAAATGGGTTCTGATGGAGGAGAAATGTTGATGAATGGTGATGATTCAGGGAAGGAGAACTGTTTGTACTGTAATTATGAACAGCATTGTACAACTCCTTCTTCTTTTTCTTTCACCGCTTTGATCCACACACTTTAACAAACAAAGAACAACAACAAATCATAATCTGGAACAAAAATCAACCCATCTCATCATCATCTCATCTTCTAAGTTTTCTGTTACTCTACTGTGTGTGTGTTTAATTTTGTGGCGATTCCCAATCATGAAGAGGCCTGCAGATTCCATGGGTGCGCTCATGTCCATTTCCCCAACTACAGGTTCTACTTTGCTGCTCTGCTTCTTTTTCTAAAAGCAAAATTATAACCATTATCTGAATTCCCCCTTCCTTTTCTTCTTCTACTTGTAATTTTTACACATACCCATGTGCCTTTCCTTTTCTCTGTCTCCTCCAACTTTGTTCTTTGTTAATAATCAATCAATCGGTCTCTTTTCCTTTCTGGGTTTTGACGGATTCGATCAATTTTTTACTGATTGTCTTCCAACTTTGTGTACAGATCAAGAACAGAGTCCGAGAAACAACGGTGTGAATGGCACGGAATTCCAGTCGATGCTGGATGGATTTGGTGAAGAAGGTTACGTTGAAGAATTGGGACATGTTTCTGAGAAGAAGAGACGGCTGAGTGTGGAGCAAGTTAAGGCTCTAGAGAAGAATTTCGAAGTTGAAAACAAGCTCGAACCAGAGAGGAAAGTGAAGCTGGCTCGAGAACTTGGGTTACAGTCCCGACAAGTGGCTGTTTGGTTCCAAAATCGTCGAGCCAGATGGAAAACCAAGCAATTGGAAAGAGATTATGGCGTTCTCAAAACCAATTACCAGAATCTCAAACTCACTTATGAAACTCTCCAACACGACAATCAAGCTCTCCTTAAAGAGGTACGATAATTACAATTTAAGAATACAAAACCCATCCTCTGTTTTTTTTTTTTTTTTTTCCAGGATCAGAAATGTGATGACTGTAAATTGGAATTTGCAGATTCAGGAACTGAAAAAGAAGCTTCAAGAAGATAACTCAGAGAGCAATCTTTCGGTGGAGGAAGAGATGACGGTGCCGGCCGATTCTGAGAATGCTCTGATTGAACAAATTAAGCCGGAAATTACCGATCAGTTCTCTGTTCCACTAGCAACGGAGACCCAAGACTTCAATTACGAGAGCCTCCACAGCAATGGCGGAGAAGGGGAAGAAGTCTCATTGTTCCCCGATTTCAAAGATGGGTCATCCGATAGCGATTCAAGCGCAATTTTGAACGAAGATTACGGTCCAACGGTGGCCATTTCTTCACCGGTGGTTCTGCAACACAACCACCAACACTTCATGACAGGGGCAGCATCTCCGTCTCCTTCCCCCGACGTGAAACTGAACTGCGCAACGACGGCGCTGAATTACTTGCAATATCAAAAAGGGTATCAACAACAAACACAGATGTTTCCGAAAATGGAGGAGCATAATTTCTTCAGTGGAGAGGAGACTTGTAACTTCTTTTCCGACGAGCAAGCTCCGACTCTGCACTGGTGGAGCTAATTCCATGGCGGCATAAAACGAAATTTAAGAGAGAAAATAAAGATGCGAAGTAAATTTCAAAATTTGAAAATCGATTGGATGAAGAAGATGACGATCGCGGATGGAGAAATTCAGGTGGGGATTCTCAATTAAAAAATAATTTGGGCTAATTTTTACTTTTTCATCCAGTTCGTGAGGGAAAATGATGATGAGATGGTAATTGAAATCTACGTTGTAAACAAAAAATAAAATTATCCGACAGCTTATCATAACAATATTTAGCTAACTATATAAATGTCTGTGAATTTTAATAATTATAAACAATATAAAATTTTATAGATGTAATTAATTTATTATTAAAATTCTATTTCTAAATTTGTAAATAAAAAAAGGGCAGGAATCATTTATGATATTTAAA

mRNA sequence

GCAATTTAGAGTTCATTTCATTTGGGAATTGGGGAAAGCCAACTCCAGCATGACCTAACCCCAGTCAGATTGTAACCCAAGCTTGATTTATTTAAAGAGAGAGAAAATGTGAAGGCTGCCTGGGTTGTGAAAGCTGCTTCCCTAACCTCCTTCCCATTTCCTTCTCTTTCCCTCTCCTCCTCTGGTTCTACGTTACTTTCTAGATCACCCTTTTTCTTTAATCCAATAATCTTTGCTTTTGTTGCTTGATTCGAGTGCTCTTCATCTGGGTTCTTGTTGTTTTGGAGTGGGAATCTTAAATGGGTTCTGATGGAGGAGAAATGTTGATGAATGGTGATGATTCAGGGAAGGAGAACTGTTTGTACTGTAATTATGAACAGCATTGTACAACTCCTTCTTCTTTTTCTTTCACCGCTTTGATCCACACACTTTAACAAACAAAGAACAACAACAAATCATAATCTGGAACAAAAATCAACCCATCTCATCATCATCTCATCTTCTAAGTTTTCTGTTACTCTACTGTGTGTGTGTTTAATTTTGTGGCGATTCCCAATCATGAAGAGGCCTGCAGATTCCATGGGTGCGCTCATGTCCATTTCCCCAACTACAGATCAAGAACAGAGTCCGAGAAACAACGGTGTGAATGGCACGGAATTCCAGTCGATGCTGGATGGATTTGGTGAAGAAGGTTACGTTGAAGAATTGGGACATGTTTCTGAGAAGAAGAGACGGCTGAGTGTGGAGCAAGTTAAGGCTCTAGAGAAGAATTTCGAAGTTGAAAACAAGCTCGAACCAGAGAGGAAAGTGAAGCTGGCTCGAGAACTTGGGTTACAGTCCCGACAAGTGGCTGTTTGGTTCCAAAATCGTCGAGCCAGATGGAAAACCAAGCAATTGGAAAGAGATTATGGCGTTCTCAAAACCAATTACCAGAATCTCAAACTCACTTATGAAACTCTCCAACACGACAATCAAGCTCTCCTTAAAGAGATTCAGGAACTGAAAAAGAAGCTTCAAGAAGATAACTCAGAGAGCAATCTTTCGGTGGAGGAAGAGATGACGGTGCCGGCCGATTCTGAGAATGCTCTGATTGAACAAATTAAGCCGGAAATTACCGATCAGTTCTCTGTTCCACTAGCAACGGAGACCCAAGACTTCAATTACGAGAGCCTCCACAGCAATGGCGGAGAAGGGGAAGAAGTCTCATTGTTCCCCGATTTCAAAGATGGGTCATCCGATAGCGATTCAAGCGCAATTTTGAACGAAGATTACGGTCCAACGGTGGCCATTTCTTCACCGGTGGTTCTGCAACACAACCACCAACACTTCATGACAGGGGCAGCATCTCCGTCTCCTTCCCCCGACGTGAAACTGAACTGCGCAACGACGGCGCTGAATTACTTGCAATATCAAAAAGGGTATCAACAACAAACACAGATGTTTCCGAAAATGGAGGAGCATAATTTCTTCAGTGGAGAGGAGACTTGTAACTTCTTTTCCGACGAGCAAGCTCCGACTCTGCACTGGTGGAGCTAATTCCATGGCGGCATAAAACGAAATTTAAGAGAGAAAATAAAGATGCGAAGTAAATTTCAAAATTTGAAAATCGATTGGATGAAGAAGATGACGATCGCGGATGGAGAAATTCAGGTGGGGATTCTCAATTAAAAAATAATTTGGGCTAATTTTTACTTTTTCATCCAGTTCGTGAGGGAAAATGATGATGAGATGGTAATTGAAATCTACGTTGTAAACAAAAAATAAAATTATCCGACAGCTTATCATAACAATATTTAGCTAACTATATAAATGTCTGTGAATTTTAATAATTATAAACAATATAAAATTTTATAGATGTAATTAATTTATTATTAAAATTCTATTTCTAAATTTGTAAATAAAAAAAGGGCAGGAATCATTTATGATATTTAAA

Coding sequence (CDS)

ATGAAGAGGCCTGCAGATTCCATGGGTGCGCTCATGTCCATTTCCCCAACTACAGATCAAGAACAGAGTCCGAGAAACAACGGTGTGAATGGCACGGAATTCCAGTCGATGCTGGATGGATTTGGTGAAGAAGGTTACGTTGAAGAATTGGGACATGTTTCTGAGAAGAAGAGACGGCTGAGTGTGGAGCAAGTTAAGGCTCTAGAGAAGAATTTCGAAGTTGAAAACAAGCTCGAACCAGAGAGGAAAGTGAAGCTGGCTCGAGAACTTGGGTTACAGTCCCGACAAGTGGCTGTTTGGTTCCAAAATCGTCGAGCCAGATGGAAAACCAAGCAATTGGAAAGAGATTATGGCGTTCTCAAAACCAATTACCAGAATCTCAAACTCACTTATGAAACTCTCCAACACGACAATCAAGCTCTCCTTAAAGAGATTCAGGAACTGAAAAAGAAGCTTCAAGAAGATAACTCAGAGAGCAATCTTTCGGTGGAGGAAGAGATGACGGTGCCGGCCGATTCTGAGAATGCTCTGATTGAACAAATTAAGCCGGAAATTACCGATCAGTTCTCTGTTCCACTAGCAACGGAGACCCAAGACTTCAATTACGAGAGCCTCCACAGCAATGGCGGAGAAGGGGAAGAAGTCTCATTGTTCCCCGATTTCAAAGATGGGTCATCCGATAGCGATTCAAGCGCAATTTTGAACGAAGATTACGGTCCAACGGTGGCCATTTCTTCACCGGTGGTTCTGCAACACAACCACCAACACTTCATGACAGGGGCAGCATCTCCGTCTCCTTCCCCCGACGTGAAACTGAACTGCGCAACGACGGCGCTGAATTACTTGCAATATCAAAAAGGGTATCAACAACAAACACAGATGTTTCCGAAAATGGAGGAGCATAATTTCTTCAGTGGAGAGGAGACTTGTAACTTCTTTTCCGACGAGCAAGCTCCGACTCTGCACTGGTGGAGCTAA

Protein sequence

MKRPADSMGALMSISPTTDQEQSPRNNGVNGTEFQSMLDGFGEEGYVEELGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQSRQVAVWFQNRRARWKTKQLERDYGVLKTNYQNLKLTYETLQHDNQALLKEIQELKKKLQEDNSESNLSVEEEMTVPADSENALIEQIKPEITDQFSVPLATETQDFNYESLHSNGGEGEEVSLFPDFKDGSSDSDSSAILNEDYGPTVAISSPVVLQHNHQHFMTGAASPSPSPDVKLNCATTALNYLQYQKGYQQQTQMFPKMEEHNFFSGEETCNFFSDEQAPTLHWWS
BLAST of Cp4.1LG01g09070 vs. Swiss-Prot
Match: ATHB6_ARATH (Homeobox-leucine zipper protein ATHB-6 OS=Arabidopsis thaliana GN=ATHB-6 PE=1 SV=1)

HSP 1 Score: 227.3 bits (578), Expect = 2.5e-58
Identity = 159/342 (46.49%), Postives = 203/342 (59.36%), Query Frame = 1

Query: 1   MKR--PADSMGALMSISPTTD-QEQSPRNNGVNGTEFQSMLDGFGEE--GYVEELGHV-- 60
           MKR   +DS+G L+S+ PTT   EQSPR  G  G EFQSML+G+ EE    VEE GHV  
Sbjct: 2   MKRLSSSDSVGGLISLCPTTSTDEQSPRRYG--GREFQSMLEGYEEEEEAIVEERGHVGL 61

Query: 61  SEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQSRQVAVWFQNRRARWKTKQL 120
           SEKKRRLS+ QVKALEKNFE+ENKLEPERKVKLA+ELGLQ RQVAVWFQNRRARWKTKQL
Sbjct: 62  SEKKRRLSINQVKALEKNFELENKLEPERKVKLAQELGLQPRQVAVWFQNRRARWKTKQL 121

Query: 121 ERDYGVLKTNYQNLKLTYETLQHDNQALLKEIQELKKKL-----QEDNSESNLSVEEEMT 180
           E+DYGVLKT Y +L+  +++L+ DN++LL+EI +LK KL     +E+  E+N +V  E  
Sbjct: 122 EKDYGVLKTQYDSLRHNFDSLRRDNESLLQEISKLKTKLNGGGGEEEEEENNAAVTTESD 181

Query: 181 VPADSENALIEQIKPEITDQFSVP--LATETQDFNYES---LHSNGGEGEEVSLFPDFKD 240
           +    E   + +   +IT+  S P      +   NY S   L          S F     
Sbjct: 182 ISVKEEEVSLPE---KITEAPSSPPQFLEHSDGLNYRSFTDLRDLLPLKAAASSFAAAAG 241

Query: 241 GSSDSDSSAILNEDYGPTVAISSPVVLQHNHQHFMTGAASPSPSPDVKLNCATTALNYLQ 300
            S  SDSSA+LNE+    V +++PV +                             N+ Q
Sbjct: 242 SSDSSDSSALLNEESSSNVTVAAPVTVP--------------------------GGNFFQ 301

Query: 301 YQKGYQQQTQMFPKMEEHNFFSGEETCNFFSDEQAPTLHWWS 326
           + K   +QT+     +  +F SGEE C FFSDEQ P+LHW+S
Sbjct: 302 FVK--MEQTE-----DHEDFLSGEEACEFFSDEQPPSLHWYS 305

BLAST of Cp4.1LG01g09070 vs. Swiss-Prot
Match: ATB16_ARATH (Homeobox-leucine zipper protein ATHB-16 OS=Arabidopsis thaliana GN=ATHB-16 PE=2 SV=2)

HSP 1 Score: 188.3 bits (477), Expect = 1.3e-46
Identity = 147/341 (43.11%), Postives = 195/341 (57.18%), Query Frame = 1

Query: 1   MKRPADSMGALMSISPTTDQEQSPRNNGVNGTEFQSMLDGFGEEG-YVEELGH------V 60
           MKR + S      IS +TD EQSPR  G N   +QSML+G+ E+   +EE         +
Sbjct: 1   MKRLSSSDSMCGLISTSTD-EQSPRGYGSN---YQSMLEGYDEDATLIEEYSGNHHHMGL 60

Query: 61  SEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQSRQVAVWFQNRRARWKTKQL 120
           SEKKRRL V+QVKALEKNFE+ENKLEPERK KLA+ELGLQ RQVAVWFQNRRARWKTKQL
Sbjct: 61  SEKKRRLKVDQVKALEKNFELENKLEPERKTKLAQELGLQPRQVAVWFQNRRARWKTKQL 120

Query: 121 ERDYGVLKTNYQNLKLTYETLQHDNQALLKEIQELKKKL--QEDNSESNL---SVEEEMT 180
           E+DYGVLK  Y +L+  +++L+ DN +LL+EI ++K K+  +EDN+ +      V+EE  
Sbjct: 121 EKDYGVLKGQYDSLRHNFDSLRRDNDSLLQEISKIKAKVNGEEDNNNNKAITEGVKEEEV 180

Query: 181 VPADSENALIEQIKPEITDQFSVPLATETQDFNYESLHSNGGEGEEVSLFPD---FKDGS 240
              DS         P    QF       +  FNY    ++  +     L P+    + GS
Sbjct: 181 HKTDSI--------PSSPLQF----LEHSSGFNYRRSFTDLRD-----LLPNSTVVEAGS 240

Query: 241 SDS-DSSAILNEDYGPTVAISSPVVLQHNHQHFMTGAASPSPSPDVKLNCATTALNYLQY 300
           SDS DSSA+LN++                     T + +   +P V +    T  ++LQ+
Sbjct: 241 SDSCDSSAVLNDE---------------------TSSDNGRLTPPVTV----TGGSFLQF 288

Query: 301 QKGYQQQTQMFPKMEEHNFFSGEETCNFFSDEQAPTLHWWS 326
            K  Q +       +  +F SGEE C FFSDEQ P+LHW+S
Sbjct: 301 VKTEQTE-------DHEDFLSGEEACGFFSDEQPPSLHWYS 288

BLAST of Cp4.1LG01g09070 vs. Swiss-Prot
Match: ATHB5_ARATH (Homeobox-leucine zipper protein ATHB-5 OS=Arabidopsis thaliana GN=ATHB-5 PE=1 SV=1)

HSP 1 Score: 183.3 bits (464), Expect = 4.1e-45
Identity = 139/345 (40.29%), Postives = 184/345 (53.33%), Query Frame = 1

Query: 1   MKRPADSMGALMSISP----TTDQEQSPRNNGVN-----GTEFQSMLDGFGEEGYVEELG 60
           MKR   S  +L    P    TTD++ SPR            ++  M D   ++G +E+LG
Sbjct: 1   MKRSRGSSDSLSGFLPIRHSTTDKQISPRPTTTGFLYSGAGDYSQMFDALEDDGSLEDLG 60

Query: 61  HV-------SEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQSRQVAVWFQNR 120
            V       +EKKRRL VEQVKALEKNFE++NKLEPERKVKLA+ELGLQ RQVA+WFQNR
Sbjct: 61  GVGHASSTAAEKKRRLGVEQVKALEKNFEIDNKLEPERKVKLAQELGLQPRQVAIWFQNR 120

Query: 121 RARWKTKQLERDYGVLKTNYQNLKLTYETLQHDNQALLKEIQELKKKLQ-------EDNS 180
           RARWKTKQLERDYGVLK+N+  LK   ++LQ DN +LL +I+ELK KL        E+N 
Sbjct: 121 RARWKTKQLERDYGVLKSNFDALKRNRDSLQRDNDSLLGQIKELKAKLNVEGVKGIEENG 180

Query: 181 ESNLSVEEEMTVPADSENALIEQIKPEITDQFSVPLATETQDFNYESLHSNGGEGEEVSL 240
               +VE   +V A++E   +    P       +P    T +  +E            S+
Sbjct: 181 ALK-AVEANQSVMANNEVLELSHRSPSPPPH--IPTDAPTSELAFEMF----------SI 240

Query: 241 FP---DFKDGSSD-SDSSAILNEDYGPTVAISSPVVLQHNHQHFMTGAASPSPSPDVKLN 300
           FP   +F+D  +D SDSSA+LNE+Y P    ++             GA + +      + 
Sbjct: 241 FPRTENFRDDPADSSDSSAVLNEEYSPNTVEAA-------------GAVAATTVEMSTMG 300

Query: 301 CATTALNYLQYQKGYQQQTQMFPKMEEH-NFFSGEETCNFFSDEQ 318
           C +                  F KMEEH + FSGEE C  F+D +
Sbjct: 301 CFS-----------------QFVKMEEHEDLFSGEEACKLFADNE 302

BLAST of Cp4.1LG01g09070 vs. Swiss-Prot
Match: HOX4_ORYSI (Homeobox-leucine zipper protein HOX4 OS=Oryza sativa subsp. indica GN=HOX4 PE=1 SV=1)

HSP 1 Score: 144.4 bits (363), Expect = 2.1e-33
Identity = 99/179 (55.31%), Postives = 119/179 (66.48%), Query Frame = 1

Query: 1   MKRPADSMGALMSISPTTDQEQSPRNNGVNGTEFQSMLDGFGEEGYVEE----LGHVSEK 60
           MKRP  + G   S S  T    S  ++G  G        G   EG VEE     G   EK
Sbjct: 1   MKRPGGAGGGGGSPSLVTMANSS--DDGYGGV-------GMEAEGDVEEEMMACGGGGEK 60

Query: 61  KRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQSRQVAVWFQNRRARWKTKQLERD 120
           KRRLSVEQV+ALE++FEVENKLEPERK +LAR+LGLQ RQVAVWFQNRRARWKTKQLERD
Sbjct: 61  KRRLSVEQVRALERSFEVENKLEPERKARLARDLGLQPRQVAVWFQNRRARWKTKQLERD 120

Query: 121 YGVLKTNYQNLKLTYETLQHDNQALLKEIQELKKKL-QEDNSESNLSVEEEMTVPADSE 175
           Y  L+ +Y +L+L ++ L+ D  ALL EI+ELK KL  E+ + S  SV+EE   PA S+
Sbjct: 121 YAALRHSYDSLRLDHDALRRDKDALLAEIKELKAKLGDEEAAASFTSVKEE---PAASD 167

BLAST of Cp4.1LG01g09070 vs. Swiss-Prot
Match: HOX4_ORYSJ (Homeobox-leucine zipper protein HOX4 OS=Oryza sativa subsp. japonica GN=HOX4 PE=1 SV=1)

HSP 1 Score: 144.4 bits (363), Expect = 2.1e-33
Identity = 99/179 (55.31%), Postives = 119/179 (66.48%), Query Frame = 1

Query: 1   MKRPADSMGALMSISPTTDQEQSPRNNGVNGTEFQSMLDGFGEEGYVEE----LGHVSEK 60
           MKRP  + G   S S  T    S  ++G  G        G   EG VEE     G   EK
Sbjct: 1   MKRPGGAGGGGGSPSLVTMANSS--DDGYGGV-------GMEAEGDVEEEMMACGGGGEK 60

Query: 61  KRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQSRQVAVWFQNRRARWKTKQLERD 120
           KRRLSVEQV+ALE++FEVENKLEPERK +LAR+LGLQ RQVAVWFQNRRARWKTKQLERD
Sbjct: 61  KRRLSVEQVRALERSFEVENKLEPERKARLARDLGLQPRQVAVWFQNRRARWKTKQLERD 120

Query: 121 YGVLKTNYQNLKLTYETLQHDNQALLKEIQELKKKL-QEDNSESNLSVEEEMTVPADSE 175
           Y  L+ +Y +L+L ++ L+ D  ALL EI+ELK KL  E+ + S  SV+EE   PA S+
Sbjct: 121 YAALRHSYDSLRLDHDALRRDKDALLAEIKELKAKLGDEEAAASFTSVKEE---PAASD 167

BLAST of Cp4.1LG01g09070 vs. TrEMBL
Match: M5W009_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa008318mg PE=4 SV=1)

HSP 1 Score: 352.8 bits (904), Expect = 4.4e-94
Identity = 215/340 (63.24%), Postives = 253/340 (74.41%), Query Frame = 1

Query: 1   MKR--PADSMGALMSISPTTDQEQSPRNNGVNGTEFQSMLDGFGEEGYVEELGHVSEKKR 60
           MKR   +DS+GA++SI P+T +EQSPRNN V   +FQSMLDG  EEG VEE GHVSEKKR
Sbjct: 1   MKRLGSSDSLGAMISICPSTAEEQSPRNNHVYRRDFQSMLDGLDEEGCVEEGGHVSEKKR 60

Query: 61  RLSVEQVKALEKNFEVENKLEPERKVKLARELGLQSRQVAVWFQNRRARWKTKQLERDYG 120
           RLSVEQVKALEKNFEVENKLEPERKVKLA+ELGLQ RQVAVWFQNRRARWKTKQLERD+G
Sbjct: 61  RLSVEQVKALEKNFEVENKLEPERKVKLAQELGLQPRQVAVWFQNRRARWKTKQLERDFG 120

Query: 121 VLKTNYQNLKLTYETLQHDNQALLKEIQELKKKLQEDNSES-NLSVEEEMTVPADSENAL 180
           VLK NY +LKL Y+ LQH+N+AL+KEI++LK KLQE+N+ES NLSV+EE  V  D  N  
Sbjct: 121 VLKANYDSLKLNYDNLQHENEALVKEIKQLKSKLQEENTESNNLSVKEEQMVAKDQSNYK 180

Query: 181 I-----EQIKPEITDQFSVPLATETQDFNYESLH--SNGGEG-EEVSLFPDFKDGSSDSD 240
           +      +  P      SVP ATE+++ N+ES +  +NG  G E VSLFPDFKDGSSDSD
Sbjct: 181 VVDHELSKSPPPPPLGSSVP-ATESKELNFESFNNTNNGAVGLEAVSLFPDFKDGSSDSD 240

Query: 241 SSAILNEDYGPTVAISSPVVLQHNHQHFMTGAASPSPSPDVKLNC------ATTALNYLQ 300
           SSAILNED  P + ISS  +LQ NHQ  M   AS S    +K NC      +++++N  Q
Sbjct: 241 SSAILNEDNSPNLTISSSGMLQ-NHQ-LMKSPASTS----LKFNCCSSSSPSSSSMNCFQ 300

Query: 301 YQKGYQQQTQMFPKMEEHNFFSGEETCNFFSDEQAPTLHW 324
           +QK Y  Q   F K+EEHNFFS EE C+FFSDEQAPTL W
Sbjct: 301 FQKTYHPQ---FVKIEEHNFFSSEEACSFFSDEQAPTLQW 330

BLAST of Cp4.1LG01g09070 vs. TrEMBL
Match: A0A0A0KGQ3_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G499720 PE=4 SV=1)

HSP 1 Score: 350.1 bits (897), Expect = 2.8e-93
Identity = 204/335 (60.90%), Postives = 247/335 (73.73%), Query Frame = 1

Query: 1   MKR--PADSMGALMSISPTTDQEQSPRNNGVNGTEFQSMLDGFGEEGYVEELGHVSEKKR 60
           MKR   +DS+GALMS+ PT++ EQSPRN+ V G EFQSMLDG  EEG +EE  HV EKKR
Sbjct: 1   MKRHGSSDSLGALMSVCPTSE-EQSPRNSHVYGREFQSMLDGLDEEGSIEEHCHVGEKKR 60

Query: 61  RLSVEQVKALEKNFEVENKLEPERKVKLARELGLQSRQVAVWFQNRRARWKTKQLERDYG 120
           RLSV+QVKALEK FE+ENKLEPERKVKLA+ELGLQ RQVAVWFQNRRARWKTKQLERDYG
Sbjct: 61  RLSVDQVKALEKTFEIENKLEPERKVKLAQELGLQPRQVAVWFQNRRARWKTKQLERDYG 120

Query: 121 VLKTNYQNLKLTYETLQHDNQALLKEIQELKKKLQEDNSESNLSVEEEMTVPADSENALI 180
           +LK NY++LK +++TLQ DN ALLKEI+ELK KL+E+ +ESNLSV+EE+ V ++S+N LI
Sbjct: 121 LLKANYESLKRSFDTLQQDNDALLKEIKELKSKLEEEKTESNLSVKEEIFV-SESDNLLI 180

Query: 181 EQIKPEI-TDQFSVPLATE-TQDFNYESLHSNGGEG-----EEVSLFPDFKDGSSDSDSS 240
           EQ    +  D  S+P+A++ + DFNYES  + G +       EVSLF DFKDGSSDSDSS
Sbjct: 181 EQTTNHLPVDHISLPVASDHSDDFNYESFRTVGADDGDDQRVEVSLFTDFKDGSSDSDSS 240

Query: 241 AILNEDYGPTVAISSPV--VLQHNHQHFMTGAASPSPSPDVKLNCATTALNYLQYQKGYQ 300
           AILNED  P   +SS    +LQ +HQ                L+   T+LN   +QK   
Sbjct: 241 AILNEDNSPNAVVSSATAGMLQSHHQ---------------ILSSPATSLNCYPFQKAAY 300

Query: 301 QQTQMFPKMEEHNFFSGEETCNFFSDEQAPTLHWW 325
              Q F K+EEHNFFSGEETCN FSDEQAP++HW+
Sbjct: 301 NNAQQFVKIEEHNFFSGEETCNLFSDEQAPSMHWY 318

BLAST of Cp4.1LG01g09070 vs. TrEMBL
Match: M5WIS1_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa008318mg PE=4 SV=1)

HSP 1 Score: 349.0 bits (894), Expect = 6.3e-93
Identity = 215/340 (63.24%), Postives = 253/340 (74.41%), Query Frame = 1

Query: 1   MKR--PADSMGALMSISPTTDQEQSPRNNGVNGTEFQSMLDGFGEEGYVEELGHVSEKKR 60
           MKR   +DS+GA++SI P+T+ EQSPRNN V   +FQSMLDG  EEG VEE GHVSEKKR
Sbjct: 1   MKRLGSSDSLGAMISICPSTE-EQSPRNNHVYRRDFQSMLDGLDEEGCVEEGGHVSEKKR 60

Query: 61  RLSVEQVKALEKNFEVENKLEPERKVKLARELGLQSRQVAVWFQNRRARWKTKQLERDYG 120
           RLSVEQVKALEKNFEVENKLEPERKVKLA+ELGLQ RQVAVWFQNRRARWKTKQLERD+G
Sbjct: 61  RLSVEQVKALEKNFEVENKLEPERKVKLAQELGLQPRQVAVWFQNRRARWKTKQLERDFG 120

Query: 121 VLKTNYQNLKLTYETLQHDNQALLKEIQELKKKLQEDNSES-NLSVEEEMTVPADSENAL 180
           VLK NY +LKL Y+ LQH+N+AL+KEI++LK KLQE+N+ES NLSV+EE  V  D  N  
Sbjct: 121 VLKANYDSLKLNYDNLQHENEALVKEIKQLKSKLQEENTESNNLSVKEEQMVAKDQSNYK 180

Query: 181 I-----EQIKPEITDQFSVPLATETQDFNYESLH--SNGGEG-EEVSLFPDFKDGSSDSD 240
           +      +  P      SVP ATE+++ N+ES +  +NG  G E VSLFPDFKDGSSDSD
Sbjct: 181 VVDHELSKSPPPPPLGSSVP-ATESKELNFESFNNTNNGAVGLEAVSLFPDFKDGSSDSD 240

Query: 241 SSAILNEDYGPTVAISSPVVLQHNHQHFMTGAASPSPSPDVKLNC------ATTALNYLQ 300
           SSAILNED  P + ISS  +LQ NHQ  M   AS S    +K NC      +++++N  Q
Sbjct: 241 SSAILNEDNSPNLTISSSGMLQ-NHQ-LMKSPASTS----LKFNCCSSSSPSSSSMNCFQ 300

Query: 301 YQKGYQQQTQMFPKMEEHNFFSGEETCNFFSDEQAPTLHW 324
           +QK Y  Q   F K+EEHNFFS EE C+FFSDEQAPTL W
Sbjct: 301 FQKTYHPQ---FVKIEEHNFFSSEEACSFFSDEQAPTLQW 329

BLAST of Cp4.1LG01g09070 vs. TrEMBL
Match: A0A061DJ94_THECC (Alanine--glyoxylate aminotransferase 2 isoform 1 OS=Theobroma cacao GN=TCM_001039 PE=4 SV=1)

HSP 1 Score: 345.5 bits (885), Expect = 7.0e-92
Identity = 203/334 (60.78%), Postives = 250/334 (74.85%), Query Frame = 1

Query: 1   MKR--PADSMGALMSISPTTDQEQSPRNNGVNGTEFQSMLDGFGEEGYVEELGHVSEKKR 60
           MKR   +DS+GALMSI PTTD E SPRNN +   EFQSMLDG  EEG VEE GHV+EKKR
Sbjct: 1   MKRLGSSDSLGALMSICPTTD-EHSPRNNHIYSREFQSMLDGLDEEGCVEESGHVAEKKR 60

Query: 61  RLSVEQVKALEKNFEVENKLEPERKVKLARELGLQSRQVAVWFQNRRARWKTKQLERDYG 120
           RLSV+QVKALEKNFEVENKLEPERKVKLA+ELGLQ RQVAVWFQNRRARWKTKQLERDYG
Sbjct: 61  RLSVDQVKALEKNFEVENKLEPERKVKLAQELGLQPRQVAVWFQNRRARWKTKQLERDYG 120

Query: 121 VLKTNYQNLKLTYETLQHDNQALLKEIQELKKKLQEDNSESNLSVEEEMTVPADSENALI 180
           +LKT+Y+ LK+ Y+TLQHDN+ALLKEI+ELK KL  +++ESNLSV+EE+ V  +++N  +
Sbjct: 121 LLKTSYETLKVNYDTLQHDNEALLKEIRELKAKLNGESTESNLSVKEEVIV-HETDNKTL 180

Query: 181 EQIKPEITDQFSVPLATETQDFNYESLHSNGGEGEEVSLFPDFKDGSSDSDSSAILNEDY 240
           EQ +P      S+  ++E  + NYES +++ G     +LFPD KDGSSDSDSSAILNED 
Sbjct: 181 EQSEPPPVS--SLVTSSEPAELNYESFNNSIG-SVGATLFPDLKDGSSDSDSSAILNEDN 240

Query: 241 G----PTVAISSPVVLQHNHQHFMTGAASPSPSPDVKLNCAT---TALNYLQYQKGYQQQ 300
                   AISS  VLQ + QH +    SP+ +  +  N ++   +++N  Q+ K   Q 
Sbjct: 241 NNCSPNNAAISSSGVLQ-SQQHLL---MSPTTTSSLNFNSSSSSPSSMNCFQFSKSTYQP 300

Query: 301 TQMFPKMEEHNFFSGEETCNFFSDEQAPTLHWWS 326
           +  + KMEEHNFFS +E CNFFSDEQAP+LHW+S
Sbjct: 301 SHQYVKMEEHNFFSADEACNFFSDEQAPSLHWYS 325

BLAST of Cp4.1LG01g09070 vs. TrEMBL
Match: A9PHT9_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0007s05010g PE=2 SV=1)

HSP 1 Score: 341.7 bits (875), Expect = 1.0e-90
Identity = 203/330 (61.52%), Postives = 244/330 (73.94%), Query Frame = 1

Query: 5   ADSMGALMSISPTTDQEQSPRNNG-VNGTEFQSMLDGFGEEGYVEEL-GHVSEKKRRLSV 64
           +DS+GALMSI P+ + E SPRN+  V   EFQSMLDG  EEG VEE  GHV+EKKRRLS 
Sbjct: 8   SDSLGALMSICPSAE-EHSPRNHTHVYSREFQSMLDGLDEEGCVEEAGGHVTEKKRRLSG 67

Query: 65  EQVKALEKNFEVENKLEPERKVKLARELGLQSRQVAVWFQNRRARWKTKQLERDYGVLKT 124
           +QVKALEKNFEVENKLEPERKVKLA+ELGLQ RQVAVWFQNRRARWKTKQLERDYGVLK 
Sbjct: 68  DQVKALEKNFEVENKLEPERKVKLAQELGLQPRQVAVWFQNRRARWKTKQLERDYGVLKA 127

Query: 125 NYQNLKLTYETLQHDNQALLKEIQELKKKLQEDNSESNLSVEEEMTVPADSENALIEQIK 184
           NY +LK  ++ LQHDN+ALLKEI+ELK KL E+N+ESN+SV+EE+ + A+SE+ + E+  
Sbjct: 128 NYDSLKHNFDALQHDNEALLKEIRELKAKLNEENAESNVSVKEEI-ILAESEDKMPEEDT 187

Query: 185 PEITDQFSVPLATETQDFNYESL--HSNGGEGEEVSLFPDFKDGSSDSDSSAILNEDYGP 244
           P + D  +   A+ET++ NYE+   HS+   G   SLFPDFKDGSSDSDSSAILNED  P
Sbjct: 188 PALLDSVA---ASETKELNYETFNNHSSINIGLGASLFPDFKDGSSDSDSSAILNEDNSP 247

Query: 245 TVAISSPVVLQHNHQHFMTGAASPSPSPDVKLNCAT-----TALNYLQYQKGYQQQTQMF 304
             AISS  +LQ           SP PS  ++ NC+      +++N  Q+ K YQ Q   F
Sbjct: 248 NPAISSSGILQSQLM------MSPPPSSSLRFNCSASSSSPSSMNCFQFSKSYQTQ---F 307

Query: 305 PKMEEHNFFSGEETCNFFSDEQAPTLHWWS 326
            K+EEHNFFS EE CNFFSDEQ P+L W+S
Sbjct: 308 VKLEEHNFFSSEEACNFFSDEQPPSLPWYS 323

BLAST of Cp4.1LG01g09070 vs. TAIR10
Match: AT2G22430.1 (AT2G22430.1 homeobox protein 6)

HSP 1 Score: 227.3 bits (578), Expect = 1.4e-59
Identity = 159/342 (46.49%), Postives = 203/342 (59.36%), Query Frame = 1

Query: 1   MKR--PADSMGALMSISPTTD-QEQSPRNNGVNGTEFQSMLDGFGEE--GYVEELGHV-- 60
           MKR   +DS+G L+S+ PTT   EQSPR  G  G EFQSML+G+ EE    VEE GHV  
Sbjct: 2   MKRLSSSDSVGGLISLCPTTSTDEQSPRRYG--GREFQSMLEGYEEEEEAIVEERGHVGL 61

Query: 61  SEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQSRQVAVWFQNRRARWKTKQL 120
           SEKKRRLS+ QVKALEKNFE+ENKLEPERKVKLA+ELGLQ RQVAVWFQNRRARWKTKQL
Sbjct: 62  SEKKRRLSINQVKALEKNFELENKLEPERKVKLAQELGLQPRQVAVWFQNRRARWKTKQL 121

Query: 121 ERDYGVLKTNYQNLKLTYETLQHDNQALLKEIQELKKKL-----QEDNSESNLSVEEEMT 180
           E+DYGVLKT Y +L+  +++L+ DN++LL+EI +LK KL     +E+  E+N +V  E  
Sbjct: 122 EKDYGVLKTQYDSLRHNFDSLRRDNESLLQEISKLKTKLNGGGGEEEEEENNAAVTTESD 181

Query: 181 VPADSENALIEQIKPEITDQFSVP--LATETQDFNYES---LHSNGGEGEEVSLFPDFKD 240
           +    E   + +   +IT+  S P      +   NY S   L          S F     
Sbjct: 182 ISVKEEEVSLPE---KITEAPSSPPQFLEHSDGLNYRSFTDLRDLLPLKAAASSFAAAAG 241

Query: 241 GSSDSDSSAILNEDYGPTVAISSPVVLQHNHQHFMTGAASPSPSPDVKLNCATTALNYLQ 300
            S  SDSSA+LNE+    V +++PV +                             N+ Q
Sbjct: 242 SSDSSDSSALLNEESSSNVTVAAPVTVP--------------------------GGNFFQ 301

Query: 301 YQKGYQQQTQMFPKMEEHNFFSGEETCNFFSDEQAPTLHWWS 326
           + K   +QT+     +  +F SGEE C FFSDEQ P+LHW+S
Sbjct: 302 FVK--MEQTE-----DHEDFLSGEEACEFFSDEQPPSLHWYS 305

BLAST of Cp4.1LG01g09070 vs. TAIR10
Match: AT4G40060.1 (AT4G40060.1 homeobox protein 16)

HSP 1 Score: 188.3 bits (477), Expect = 7.2e-48
Identity = 147/341 (43.11%), Postives = 195/341 (57.18%), Query Frame = 1

Query: 1   MKRPADSMGALMSISPTTDQEQSPRNNGVNGTEFQSMLDGFGEEG-YVEELGH------V 60
           MKR + S      IS +TD EQSPR  G N   +QSML+G+ E+   +EE         +
Sbjct: 1   MKRLSSSDSMCGLISTSTD-EQSPRGYGSN---YQSMLEGYDEDATLIEEYSGNHHHMGL 60

Query: 61  SEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQSRQVAVWFQNRRARWKTKQL 120
           SEKKRRL V+QVKALEKNFE+ENKLEPERK KLA+ELGLQ RQVAVWFQNRRARWKTKQL
Sbjct: 61  SEKKRRLKVDQVKALEKNFELENKLEPERKTKLAQELGLQPRQVAVWFQNRRARWKTKQL 120

Query: 121 ERDYGVLKTNYQNLKLTYETLQHDNQALLKEIQELKKKL--QEDNSESNL---SVEEEMT 180
           E+DYGVLK  Y +L+  +++L+ DN +LL+EI ++K K+  +EDN+ +      V+EE  
Sbjct: 121 EKDYGVLKGQYDSLRHNFDSLRRDNDSLLQEISKIKAKVNGEEDNNNNKAITEGVKEEEV 180

Query: 181 VPADSENALIEQIKPEITDQFSVPLATETQDFNYESLHSNGGEGEEVSLFPD---FKDGS 240
              DS         P    QF       +  FNY    ++  +     L P+    + GS
Sbjct: 181 HKTDSI--------PSSPLQF----LEHSSGFNYRRSFTDLRD-----LLPNSTVVEAGS 240

Query: 241 SDS-DSSAILNEDYGPTVAISSPVVLQHNHQHFMTGAASPSPSPDVKLNCATTALNYLQY 300
           SDS DSSA+LN++                     T + +   +P V +    T  ++LQ+
Sbjct: 241 SDSCDSSAVLNDE---------------------TSSDNGRLTPPVTV----TGGSFLQF 288

Query: 301 QKGYQQQTQMFPKMEEHNFFSGEETCNFFSDEQAPTLHWWS 326
            K  Q +       +  +F SGEE C FFSDEQ P+LHW+S
Sbjct: 301 VKTEQTE-------DHEDFLSGEEACGFFSDEQPPSLHWYS 288

BLAST of Cp4.1LG01g09070 vs. TAIR10
Match: AT5G65310.1 (AT5G65310.1 homeobox protein 5)

HSP 1 Score: 183.3 bits (464), Expect = 2.3e-46
Identity = 139/345 (40.29%), Postives = 184/345 (53.33%), Query Frame = 1

Query: 1   MKRPADSMGALMSISP----TTDQEQSPRNNGVN-----GTEFQSMLDGFGEEGYVEELG 60
           MKR   S  +L    P    TTD++ SPR            ++  M D   ++G +E+LG
Sbjct: 1   MKRSRGSSDSLSGFLPIRHSTTDKQISPRPTTTGFLYSGAGDYSQMFDALEDDGSLEDLG 60

Query: 61  HV-------SEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQSRQVAVWFQNR 120
            V       +EKKRRL VEQVKALEKNFE++NKLEPERKVKLA+ELGLQ RQVA+WFQNR
Sbjct: 61  GVGHASSTAAEKKRRLGVEQVKALEKNFEIDNKLEPERKVKLAQELGLQPRQVAIWFQNR 120

Query: 121 RARWKTKQLERDYGVLKTNYQNLKLTYETLQHDNQALLKEIQELKKKLQ-------EDNS 180
           RARWKTKQLERDYGVLK+N+  LK   ++LQ DN +LL +I+ELK KL        E+N 
Sbjct: 121 RARWKTKQLERDYGVLKSNFDALKRNRDSLQRDNDSLLGQIKELKAKLNVEGVKGIEENG 180

Query: 181 ESNLSVEEEMTVPADSENALIEQIKPEITDQFSVPLATETQDFNYESLHSNGGEGEEVSL 240
               +VE   +V A++E   +    P       +P    T +  +E            S+
Sbjct: 181 ALK-AVEANQSVMANNEVLELSHRSPSPPPH--IPTDAPTSELAFEMF----------SI 240

Query: 241 FP---DFKDGSSD-SDSSAILNEDYGPTVAISSPVVLQHNHQHFMTGAASPSPSPDVKLN 300
           FP   +F+D  +D SDSSA+LNE+Y P    ++             GA + +      + 
Sbjct: 241 FPRTENFRDDPADSSDSSAVLNEEYSPNTVEAA-------------GAVAATTVEMSTMG 300

Query: 301 CATTALNYLQYQKGYQQQTQMFPKMEEH-NFFSGEETCNFFSDEQ 318
           C +                  F KMEEH + FSGEE C  F+D +
Sbjct: 301 CFS-----------------QFVKMEEHEDLFSGEEACKLFADNE 302

BLAST of Cp4.1LG01g09070 vs. TAIR10
Match: AT1G69780.1 (AT1G69780.1 Homeobox-leucine zipper protein family)

HSP 1 Score: 130.2 bits (326), Expect = 2.3e-30
Identity = 75/135 (55.56%), Postives = 93/135 (68.89%), Query Frame = 1

Query: 42  GEEGYVEELGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQSRQVAVWF 101
           GEE Y ++   + EKKRRL++EQVK LEKNFE+ NKLEPERK++LAR LGLQ RQ+A+WF
Sbjct: 71  GEEDYSDDGSQMGEKKRRLNMEQVKTLEKNFELGNKLEPERKMQLARALGLQPRQIAIWF 130

Query: 102 QNRRARWKTKQLERDYGVLKTNYQNLKLTYETLQHDNQALLKEIQELKKKLQEDNSESNL 161
           QNRRARWKTKQLE+DY  LK  +  LK   + LQ  NQ L  EI  LK +  E     NL
Sbjct: 131 QNRRARWKTKQLEKDYDTLKRQFDTLKAENDLLQTHNQKLQAEIMGLKNR--EQTESINL 190

Query: 162 SVEEEMTVPADSENA 177
           + E E +    S+N+
Sbjct: 191 NKETEGSCSNRSDNS 203

BLAST of Cp4.1LG01g09070 vs. TAIR10
Match: AT3G01470.1 (AT3G01470.1 homeobox 1)

HSP 1 Score: 126.3 bits (316), Expect = 3.4e-29
Identity = 71/127 (55.91%), Postives = 89/127 (70.08%), Query Frame = 1

Query: 34  FQSMLDGFGEEGYVEELGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQ 93
           F S  D + ++ Y ++L    EKKRRL+ EQV  LEK+FE ENKLEPERK +LA++LGLQ
Sbjct: 49  FSSPEDLYDDDFYDDQL---PEKKRRLTTEQVHLLEKSFETENKLEPERKTQLAKKLGLQ 108

Query: 94  SRQVAVWFQNRRARWKTKQLERDYGVLKTNYQNLKLTYETLQHDNQALLKEIQELKKKLQ 153
            RQVAVWFQNRRARWKTKQLERDY +LK+ Y  L   Y+++  DN  L  E+  L +KLQ
Sbjct: 109 PRQVAVWFQNRRARWKTKQLERDYDLLKSTYDQLLSNYDSIVMDNDKLRSEVTSLTEKLQ 168

Query: 154 EDNSESN 161
                +N
Sbjct: 169 GKQETAN 172

BLAST of Cp4.1LG01g09070 vs. NCBI nr
Match: gi|595826046|ref|XP_007205507.1| (hypothetical protein PRUPE_ppa008318mg [Prunus persica])

HSP 1 Score: 352.8 bits (904), Expect = 6.3e-94
Identity = 215/340 (63.24%), Postives = 253/340 (74.41%), Query Frame = 1

Query: 1   MKR--PADSMGALMSISPTTDQEQSPRNNGVNGTEFQSMLDGFGEEGYVEELGHVSEKKR 60
           MKR   +DS+GA++SI P+T +EQSPRNN V   +FQSMLDG  EEG VEE GHVSEKKR
Sbjct: 1   MKRLGSSDSLGAMISICPSTAEEQSPRNNHVYRRDFQSMLDGLDEEGCVEEGGHVSEKKR 60

Query: 61  RLSVEQVKALEKNFEVENKLEPERKVKLARELGLQSRQVAVWFQNRRARWKTKQLERDYG 120
           RLSVEQVKALEKNFEVENKLEPERKVKLA+ELGLQ RQVAVWFQNRRARWKTKQLERD+G
Sbjct: 61  RLSVEQVKALEKNFEVENKLEPERKVKLAQELGLQPRQVAVWFQNRRARWKTKQLERDFG 120

Query: 121 VLKTNYQNLKLTYETLQHDNQALLKEIQELKKKLQEDNSES-NLSVEEEMTVPADSENAL 180
           VLK NY +LKL Y+ LQH+N+AL+KEI++LK KLQE+N+ES NLSV+EE  V  D  N  
Sbjct: 121 VLKANYDSLKLNYDNLQHENEALVKEIKQLKSKLQEENTESNNLSVKEEQMVAKDQSNYK 180

Query: 181 I-----EQIKPEITDQFSVPLATETQDFNYESLH--SNGGEG-EEVSLFPDFKDGSSDSD 240
           +      +  P      SVP ATE+++ N+ES +  +NG  G E VSLFPDFKDGSSDSD
Sbjct: 181 VVDHELSKSPPPPPLGSSVP-ATESKELNFESFNNTNNGAVGLEAVSLFPDFKDGSSDSD 240

Query: 241 SSAILNEDYGPTVAISSPVVLQHNHQHFMTGAASPSPSPDVKLNC------ATTALNYLQ 300
           SSAILNED  P + ISS  +LQ NHQ  M   AS S    +K NC      +++++N  Q
Sbjct: 241 SSAILNEDNSPNLTISSSGMLQ-NHQ-LMKSPASTS----LKFNCCSSSSPSSSSMNCFQ 300

Query: 301 YQKGYQQQTQMFPKMEEHNFFSGEETCNFFSDEQAPTLHW 324
           +QK Y  Q   F K+EEHNFFS EE C+FFSDEQAPTL W
Sbjct: 301 FQKTYHPQ---FVKIEEHNFFSSEEACSFFSDEQAPTLQW 330

BLAST of Cp4.1LG01g09070 vs. NCBI nr
Match: gi|659080027|ref|XP_008440572.1| (PREDICTED: homeobox-leucine zipper protein ATHB-6-like [Cucumis melo])

HSP 1 Score: 351.7 bits (901), Expect = 1.4e-93
Identity = 204/335 (60.90%), Postives = 248/335 (74.03%), Query Frame = 1

Query: 1   MKR--PADSMGALMSISPTTDQEQSPRNNGVNGTEFQSMLDGFGEEGYVEELGHVSEKKR 60
           MKR   +DS+GALMS+ PT++ EQSPRN+ V G EFQSMLDG  EEG +EE  HV EKKR
Sbjct: 1   MKRHGSSDSLGALMSVCPTSE-EQSPRNSHVYGREFQSMLDGLDEEGSIEEHCHVGEKKR 60

Query: 61  RLSVEQVKALEKNFEVENKLEPERKVKLARELGLQSRQVAVWFQNRRARWKTKQLERDYG 120
           RLSV+QVKALEK FE+ENKLEPERKVKLA+ELGLQ RQVAVWFQNRRARWKTKQLERDYG
Sbjct: 61  RLSVDQVKALEKTFEIENKLEPERKVKLAQELGLQPRQVAVWFQNRRARWKTKQLERDYG 120

Query: 121 VLKTNYQNLKLTYETLQHDNQALLKEIQELKKKLQEDNSESNLSVEEEMTVPADSENALI 180
           +LK NY++LK +++TLQ DN ALLKEI+ELK KL+E+ +ESNLSV+EE+ V ++S+N LI
Sbjct: 121 LLKANYESLKRSFDTLQQDNDALLKEIKELKSKLEEEKTESNLSVKEEIFV-SESDNLLI 180

Query: 181 EQIKPEI-TDQFSVPLATE-TQDFNYESLHSNGGEG-----EEVSLFPDFKDGSSDSDSS 240
           EQ    +  D  S+P+A++ + DF+YES  + G +       EVSLFPDFKDGSSDSDSS
Sbjct: 181 EQTTNHLPVDHISLPVASDHSDDFDYESFRTVGADDGDDQRVEVSLFPDFKDGSSDSDSS 240

Query: 241 AILNEDYGPTVAISSPV--VLQHNHQHFMTGAASPSPSPDVKLNCATTALNYLQYQKGYQ 300
           AILNED  P   +SS    +LQ +HQ                L+   T+LN   +QK   
Sbjct: 241 AILNEDNSPNAVVSSATAGMLQSHHQ---------------ILSSPATSLNCFPFQKATY 300

Query: 301 QQTQMFPKMEEHNFFSGEETCNFFSDEQAPTLHWW 325
              Q F K+EEHNFFSGEETCN FSDEQAP++HW+
Sbjct: 301 NNAQQFVKIEEHNFFSGEETCNLFSDEQAPSMHWY 318

BLAST of Cp4.1LG01g09070 vs. NCBI nr
Match: gi|645219318|ref|XP_008235150.1| (PREDICTED: homeobox-leucine zipper protein ATHB-6-like [Prunus mume])

HSP 1 Score: 350.5 bits (898), Expect = 3.1e-93
Identity = 213/340 (62.65%), Postives = 252/340 (74.12%), Query Frame = 1

Query: 1   MKR--PADSMGALMSISPTTDQEQSPRNNGVNGTEFQSMLDGFGEEGYVEELGHVSEKKR 60
           MKR   +DS+GA++SI P+T +E SPRNN V   +F SMLDG  EEG VEE GHVSEKKR
Sbjct: 1   MKRLGSSDSLGAMISICPSTAEEHSPRNNHVYRRDFHSMLDGLDEEGCVEEGGHVSEKKR 60

Query: 61  RLSVEQVKALEKNFEVENKLEPERKVKLARELGLQSRQVAVWFQNRRARWKTKQLERDYG 120
           RLSVEQVKALEKNFEVENKLEPERKVKLA+ELGLQ RQVAVWFQNRRARWKTKQLERD+G
Sbjct: 61  RLSVEQVKALEKNFEVENKLEPERKVKLAQELGLQPRQVAVWFQNRRARWKTKQLERDFG 120

Query: 121 VLKTNYQNLKLTYETLQHDNQALLKEIQELKKKLQEDNSES-NLSVEEEMTVPADSENAL 180
           VLK NY +LKL Y++LQH+N+AL+KEI++LK KLQE+N+ES NLSV+EE  V  D  N  
Sbjct: 121 VLKANYDSLKLNYDSLQHENEALVKEIKQLKSKLQEENTESNNLSVKEEQMVAKDQSNYK 180

Query: 181 I-----EQIKPEITDQFSVPLATETQDFNYESLH--SNGGEG-EEVSLFPDFKDGSSDSD 240
           +      +  P      SVP ATE+++ N+ES +  +NG  G E VSLFPDFKDGSSDSD
Sbjct: 181 VVDHEKSKSPPPPPPGSSVP-ATESKELNFESFNNTNNGAVGLEAVSLFPDFKDGSSDSD 240

Query: 241 SSAILNEDYGPTVAISSPVVLQHNHQHFMTGAASPSPSPDVKLNC------ATTALNYLQ 300
           SSAILNED  P + ISS  +LQ NHQ  M   AS S    +K NC      +++++N  Q
Sbjct: 241 SSAILNEDNSPNLTISSSGMLQ-NHQ-LMKSPASTS----LKFNCSSSSSPSSSSMNCFQ 300

Query: 301 YQKGYQQQTQMFPKMEEHNFFSGEETCNFFSDEQAPTLHW 324
           +QK Y  Q   F K+EEHNFFS EE C+FFSDEQAPTL W
Sbjct: 301 FQKTYHPQ---FVKIEEHNFFSSEEACSFFSDEQAPTLQW 330

BLAST of Cp4.1LG01g09070 vs. NCBI nr
Match: gi|449451407|ref|XP_004143453.1| (PREDICTED: homeobox-leucine zipper protein ATHB-6 [Cucumis sativus])

HSP 1 Score: 350.1 bits (897), Expect = 4.1e-93
Identity = 204/335 (60.90%), Postives = 247/335 (73.73%), Query Frame = 1

Query: 1   MKR--PADSMGALMSISPTTDQEQSPRNNGVNGTEFQSMLDGFGEEGYVEELGHVSEKKR 60
           MKR   +DS+GALMS+ PT++ EQSPRN+ V G EFQSMLDG  EEG +EE  HV EKKR
Sbjct: 1   MKRHGSSDSLGALMSVCPTSE-EQSPRNSHVYGREFQSMLDGLDEEGSIEEHCHVGEKKR 60

Query: 61  RLSVEQVKALEKNFEVENKLEPERKVKLARELGLQSRQVAVWFQNRRARWKTKQLERDYG 120
           RLSV+QVKALEK FE+ENKLEPERKVKLA+ELGLQ RQVAVWFQNRRARWKTKQLERDYG
Sbjct: 61  RLSVDQVKALEKTFEIENKLEPERKVKLAQELGLQPRQVAVWFQNRRARWKTKQLERDYG 120

Query: 121 VLKTNYQNLKLTYETLQHDNQALLKEIQELKKKLQEDNSESNLSVEEEMTVPADSENALI 180
           +LK NY++LK +++TLQ DN ALLKEI+ELK KL+E+ +ESNLSV+EE+ V ++S+N LI
Sbjct: 121 LLKANYESLKRSFDTLQQDNDALLKEIKELKSKLEEEKTESNLSVKEEIFV-SESDNLLI 180

Query: 181 EQIKPEI-TDQFSVPLATE-TQDFNYESLHSNGGEG-----EEVSLFPDFKDGSSDSDSS 240
           EQ    +  D  S+P+A++ + DFNYES  + G +       EVSLF DFKDGSSDSDSS
Sbjct: 181 EQTTNHLPVDHISLPVASDHSDDFNYESFRTVGADDGDDQRVEVSLFTDFKDGSSDSDSS 240

Query: 241 AILNEDYGPTVAISSPV--VLQHNHQHFMTGAASPSPSPDVKLNCATTALNYLQYQKGYQ 300
           AILNED  P   +SS    +LQ +HQ                L+   T+LN   +QK   
Sbjct: 241 AILNEDNSPNAVVSSATAGMLQSHHQ---------------ILSSPATSLNCYPFQKAAY 300

Query: 301 QQTQMFPKMEEHNFFSGEETCNFFSDEQAPTLHWW 325
              Q F K+EEHNFFSGEETCN FSDEQAP++HW+
Sbjct: 301 NNAQQFVKIEEHNFFSGEETCNLFSDEQAPSMHWY 318

BLAST of Cp4.1LG01g09070 vs. NCBI nr
Match: gi|595826040|ref|XP_007205506.1| (hypothetical protein PRUPE_ppa008318mg [Prunus persica])

HSP 1 Score: 349.0 bits (894), Expect = 9.1e-93
Identity = 215/340 (63.24%), Postives = 253/340 (74.41%), Query Frame = 1

Query: 1   MKR--PADSMGALMSISPTTDQEQSPRNNGVNGTEFQSMLDGFGEEGYVEELGHVSEKKR 60
           MKR   +DS+GA++SI P+T+ EQSPRNN V   +FQSMLDG  EEG VEE GHVSEKKR
Sbjct: 1   MKRLGSSDSLGAMISICPSTE-EQSPRNNHVYRRDFQSMLDGLDEEGCVEEGGHVSEKKR 60

Query: 61  RLSVEQVKALEKNFEVENKLEPERKVKLARELGLQSRQVAVWFQNRRARWKTKQLERDYG 120
           RLSVEQVKALEKNFEVENKLEPERKVKLA+ELGLQ RQVAVWFQNRRARWKTKQLERD+G
Sbjct: 61  RLSVEQVKALEKNFEVENKLEPERKVKLAQELGLQPRQVAVWFQNRRARWKTKQLERDFG 120

Query: 121 VLKTNYQNLKLTYETLQHDNQALLKEIQELKKKLQEDNSES-NLSVEEEMTVPADSENAL 180
           VLK NY +LKL Y+ LQH+N+AL+KEI++LK KLQE+N+ES NLSV+EE  V  D  N  
Sbjct: 121 VLKANYDSLKLNYDNLQHENEALVKEIKQLKSKLQEENTESNNLSVKEEQMVAKDQSNYK 180

Query: 181 I-----EQIKPEITDQFSVPLATETQDFNYESLH--SNGGEG-EEVSLFPDFKDGSSDSD 240
           +      +  P      SVP ATE+++ N+ES +  +NG  G E VSLFPDFKDGSSDSD
Sbjct: 181 VVDHELSKSPPPPPLGSSVP-ATESKELNFESFNNTNNGAVGLEAVSLFPDFKDGSSDSD 240

Query: 241 SSAILNEDYGPTVAISSPVVLQHNHQHFMTGAASPSPSPDVKLNC------ATTALNYLQ 300
           SSAILNED  P + ISS  +LQ NHQ  M   AS S    +K NC      +++++N  Q
Sbjct: 241 SSAILNEDNSPNLTISSSGMLQ-NHQ-LMKSPASTS----LKFNCCSSSSPSSSSMNCFQ 300

Query: 301 YQKGYQQQTQMFPKMEEHNFFSGEETCNFFSDEQAPTLHW 324
           +QK Y  Q   F K+EEHNFFS EE C+FFSDEQAPTL W
Sbjct: 301 FQKTYHPQ---FVKIEEHNFFSSEEACSFFSDEQAPTLQW 329

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
ATHB6_ARATH2.5e-5846.49Homeobox-leucine zipper protein ATHB-6 OS=Arabidopsis thaliana GN=ATHB-6 PE=1 SV... [more]
ATB16_ARATH1.3e-4643.11Homeobox-leucine zipper protein ATHB-16 OS=Arabidopsis thaliana GN=ATHB-16 PE=2 ... [more]
ATHB5_ARATH4.1e-4540.29Homeobox-leucine zipper protein ATHB-5 OS=Arabidopsis thaliana GN=ATHB-5 PE=1 SV... [more]
HOX4_ORYSI2.1e-3355.31Homeobox-leucine zipper protein HOX4 OS=Oryza sativa subsp. indica GN=HOX4 PE=1 ... [more]
HOX4_ORYSJ2.1e-3355.31Homeobox-leucine zipper protein HOX4 OS=Oryza sativa subsp. japonica GN=HOX4 PE=... [more]
Match NameE-valueIdentityDescription
M5W009_PRUPE4.4e-9463.24Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa008318mg PE=4 SV=1[more]
A0A0A0KGQ3_CUCSA2.8e-9360.90Uncharacterized protein OS=Cucumis sativus GN=Csa_6G499720 PE=4 SV=1[more]
M5WIS1_PRUPE6.3e-9363.24Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa008318mg PE=4 SV=1[more]
A0A061DJ94_THECC7.0e-9260.78Alanine--glyoxylate aminotransferase 2 isoform 1 OS=Theobroma cacao GN=TCM_00103... [more]
A9PHT9_POPTR1.0e-9061.52Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0007s05010g PE=2 SV=1[more]
Match NameE-valueIdentityDescription
AT2G22430.11.4e-5946.49 homeobox protein 6[more]
AT4G40060.17.2e-4843.11 homeobox protein 16[more]
AT5G65310.12.3e-4640.29 homeobox protein 5[more]
AT1G69780.12.3e-3055.56 Homeobox-leucine zipper protein family[more]
AT3G01470.13.4e-2955.91 homeobox 1[more]
Match NameE-valueIdentityDescription
gi|595826046|ref|XP_007205507.1|6.3e-9463.24hypothetical protein PRUPE_ppa008318mg [Prunus persica][more]
gi|659080027|ref|XP_008440572.1|1.4e-9360.90PREDICTED: homeobox-leucine zipper protein ATHB-6-like [Cucumis melo][more]
gi|645219318|ref|XP_008235150.1|3.1e-9362.65PREDICTED: homeobox-leucine zipper protein ATHB-6-like [Prunus mume][more]
gi|449451407|ref|XP_004143453.1|4.1e-9360.90PREDICTED: homeobox-leucine zipper protein ATHB-6 [Cucumis sativus][more]
gi|595826040|ref|XP_007205506.1|9.1e-9363.24hypothetical protein PRUPE_ppa008318mg [Prunus persica][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0043565sequence-specific DNA binding
GO:0003700transcription factor activity, sequence-specific DNA binding
GO:0003677DNA binding
Vocabulary: Biological Process
TermDefinition
GO:0006355regulation of transcription, DNA-templated
Vocabulary: INTERPRO
TermDefinition
IPR017970Homeobox_CS
IPR009057Homeobox-like_sf
IPR003106Leu_zip_homeo
IPR001356Homeobox_dom
IPR000047HTH_motif
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0005667 transcription factor complex
molecular_function GO:0043565 sequence-specific DNA binding
molecular_function GO:0003700 transcription factor activity, sequence-specific DNA binding
molecular_function GO:0016740 transferase activity
molecular_function GO:0003677 DNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG01g09070.1Cp4.1LG01g09070.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000047Helix-turn-helix motifPRINTSPR00031HTHREPRESSRcoord: 82..91
score: 1.1E-5coord: 91..107
score: 1.
IPR001356Homeobox domainPFAMPF00046Homeoboxcoord: 56..109
score: 7.7
IPR001356Homeobox domainSMARTSM00389HOX_1coord: 54..115
score: 2.3
IPR001356Homeobox domainPROFILEPS50071HOMEOBOX_2coord: 51..111
score: 17
IPR003106Leucine zipper, homeobox-associatedPFAMPF02183HALZcoord: 111..153
score: 1.2
IPR009057Homeodomain-likeGENE3DG3DSA:1.10.10.60coord: 58..118
score: 2.2
IPR009057Homeodomain-likeunknownSSF46689Homeodomain-likecoord: 43..113
score: 4.06
IPR017970Homeobox, conserved sitePROSITEPS00027HOMEOBOX_1coord: 86..109
scor
NoneNo IPR availableunknownCoilCoilcoord: 124..162
scor
NoneNo IPR availablePANTHERPTHR24326FAMILY NOT NAMEDcoord: 33..181
score: 2.1
NoneNo IPR availablePANTHERPTHR24326:SF196HOMEOBOX-LEUCINE ZIPPER PROTEIN ATHB-16-RELATEDcoord: 33..181
score: 2.1

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cp4.1LG01g09070Cp4.1LG14g00900Cucurbita pepo (Zucchini)cpecpeB233