Cp4.1LG03g07630 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG03g07630
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
Descriptionhomeobox-leucine zipper protein HAT5-like
LocationCp4.1LG03: 3293854 .. 3298165 (-)
RNA-Seq ExpressionCp4.1LG03g07630
SyntenyCp4.1LG03g07630
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CCCCTTTAGGTAGCGGAAGGCACCTGGAAAAAGCCATCGATTGGAAGAACAGAAACAGAGTCCAGCCGTCGCGTCTCCGGCCGCCGCTGTTTCCTCTGTCCGATCAAATCGCTCTTCTTCTTTCTGCGTGTCGCCGGAACTCGTATATCCGACGACCGATTCTTGTTGGGATCTGTGGAGCAGAAATTAGAAGGCATGGTGGGATTTTTTGCGATTCCGTTGGAGATTTCCGCTAGGTTACTCTGGACTACCAGTTTCTTCCGCCACAAGCGATCTTGATCCTGTTTTTCCTTTTCCTTTTCGAGATTACTGTTCGTGTTTTAGGTTAGTAGTAAACCTTCGCCTTAGTCAAATTCGAAATTTCGTACAAATTTAACCTCTACGTCGCTGGAATCGGAGATTTTAATGGAATCTGGGCGGTTCTTGTTTAATCCGCCGCCTTGCGGTGGCAACATGCTTTACCTTGGCGGAGCCGGTAGCGATCCTGTTCTGCGAGGTCAATTTCTCTTCCGATCTCTTTTATTTGTATTGATTCATAGCTATAAAGTTTTTTCAAAGCATTTTAATCAAACTTCTACTTCTATACTTTAATATACACTTATTCGTTAGAACATTTAATGTTTTTATTTTCAATTTTACTTCAAATAGAACTCAATAACAGAGATACTTTTAAAATTTTAATTTCCTTTAATAACCCTTCAATTTAAAATATTTTTTAAAAAGTTTTGAAAACTTCAAAAAAAAAACTTTTCAAAATGTTTTTTTTTTTTTTAGAATTTAGCTATAAGAATTCAATTAATTACTTGAAGAAAATGAAATCATATTAAAAAATTGTGAGAAAAACAAAACATATTAAGAAATATTCACCCAAATTTTAAAAACAANTTTGTGGATAATTTTTTAATAATTTTTTTTTATTTTCGACAAATTTATGAACTTTTCAAAGGAGAGAGAGAGAGAGGAGTGATGACGTAGAGTTTTTGAAAACCCTATGATTTGTTTATGTTTTCTCATTATTTAAAGCGTCATTATAATGTTTATAATTAAGTAGTGGTCTAGAATTTATTTAAATTTGTTAATAATGGTTAGTTAATAAATTACCGCATGCTAACTTTTTGGAATAATCCAAAAATAAGCCTATATTTTTCCTCTTATTTTAATGGAAGTGGGTTTTTCTTTTGTTTTGATCTTCAACTTTTCGATTGAGTGCTCTTGTGTGTTTCTGCATGCATTGTTCCTTGTTTAATCAATTTACATGTTAGTGGAAACCCTTTTGTTTACAATAATAATAATGATGTTTCAAATTTTATATTGATGTATGAACTTTATGAACTTAAATTATGAAATTTTAAATATTTGGTTCTATAAAATTAATTTGTTTACCAAATTTCTAATATCCATAAAATAGTTGAACAAAAAAAGAAACTGTGATTCATGTTTATAAATACTGGATTAGCATTAAATAGTTTTGTATGGATTTTCCCTCGCAATTAAATCAAAACTTATTTTATTTTCCAATATTGTGATGAAAAATGAAGGAGGAAGAACAATGATTAGTATGACTATGCATGAGAGTCCAAAGGGGAGGCCATTTTTTCGATCACCGGACGATCTTTACGATGATGAATATTACGACGAGTTCTACCCTGAGAAGAAGCGTCGTCTCACCCATGATCAGGTTTGTTATATTTGCCTTTACTGCCCCATTTTGCCTGCTTCTGCTGCTGCACTCTTTCTGTGGGACAAAAATGAAGAAGTTGCGGAAAATTGTTGAAGGTTCAAATGTTGGAGAAGAGCTTTGATGAAGAGAACAAACTGGAGCCAGAGAGGAAGTCCCAACTGGCCAAGAAGTTGGGGCTGCAACCAAGGCAGGTGGCTGTGTGGTTCCAGAATCGTCGTGCTCGATGGAAGACAAAGCAGCTCGAAAGGGACTATGATGTTCTTAAAGCTTCATATGATTTGCTTCTGTCTAACTATGACTCAGTTATCAAAGAGAATGCAGATCTTAAATCTCAGGTGAGTTATATATACATACACATTCTTAAATTATGGTAGACGAGTAAAATTTTCACCTTATTCACTATTGGAGTATTGAATCTAGCTCGGTATATAGCTTAGTGAATTCCCGGTCAGTTGTTATCTAGTGCATTTCGAAAGATCGAACTAGAACAGCATGTCTGAAAGGATAAGAAAGAAAATAGGAAAGTTCATAAAAAACTAACAACAAGAGTTAAAGACTTAAAACAAATTTCATTATATTACTTTCAGGTGACTTCCTTAACTGAGAAATTTCTGGCTAAAGAGCTGGATGGAGGAGGAGAAGCACCAATTCCATGTGTGACATCAGAGCCTCTTCTAGCAGACATTGGCCATGTCTCCGCCCCACACTCCAGCAGAAAGGCTGAAGATCGTCTCAGTTCAGGGAGCGATGGCAGTACGGTGATCGACGATAATTGTCGACAACTCATTGATTGCTGTGATTCTTACTTCCCCAGCAACGAGTATCTGCAATGTGCACCTCTGCCTAATGGGTTGCAAATGGAACATGATGATAGCAATAACAATAGCAACTACTTGTTCTCAGACATGTTTGCAGTAACAGGCCAACAAAATCAGGAGGGAATGGGAGGGCCTCCTGCTTGGTGGACATGGCCTTAGGACAGTCATTCGACTTGTAACAGCCTGTTGCTTATACAAAATGTTAATGAAGTTCTTGTTTGTAATGAATAAAGTTGAGATATTATAGACATTGTTTGAAGTATTGTATGGCTGTGCTTAAAGAATGGTGCAAAATGTAGTGTCCACAAACTTGAACTGTAAACTTATAAGACAATGCAAGGATATAGCATACTTCAGATTTCTTGTTTTTTTTTTTTTTTTTTCAATTGATATGCAGAAAAACCTAATCCACATGATGAAGATTCTCAAATCCGATCAGTGGAAAATCATACTCCTATTTTCAATTAACGACACCGATACTGAACTCCTATATGAAATCGGGTCATCAAGAATAATGAGAAGACAGAAAAAGATGTGCAGCTATCTACCTTTGATTGCTATAAGCACAAATATTCAAGATTGCCTATACAAGGCAAGAGTACTTGCTGGTACCTGTTACAATGATTGAGGAAGGCCCCAACCAAGTCACTCGTCAAGCACCAAGGAAGACGAGACCTACAAAAGTAAGGTCAAATTTACCATTGTTGATACTAATGCCTGCAATGGCAACAGCTGCAGAACATAGATAGGTTTAGAGTATGAGAAACATGCAGGGTAAAATTCATTTGGAGCAAAAGATAACAGGCAGGTTATTCACGAGACGAAATAGAAACCAAAACGAGTTTTCCTGCCCTCCAGTGAATGATGGGCAGGCATCTACAAATACCCAGTTGNAAAAAAAAAAAAAAAAAAAAAAACTGTGGGTGGGGAGAAGGGGGGAAAGAAATGTAAAGAATTAACTATGTTTACAACAGAGTTTCAATTCTTTTTCTAGACTTCCCTTCTCCCTTTTCTATTTATACATAATGCTTCTGTGTGTATGGAACTTGCTGTTTAATGACGACATCCTTCTAAACGAGTCTCGCTTGATAATAGATATCATTGGGACCCTACCTGCTCCAAGCATAAAATGAAAAACAAGAAACAAAAAAAGAAAGCTCCAGAAGTTCTGTCCAAAATTGGGCTAAGGGACCATATTATCGATTTAGAGTTGGAGCCTCAAGCAGTCTTTATGAACAAAGCGCTCAGACATAGATTCTGAACCGCTTATCCCAGTTAGCAAGGTTACCTCTATGTACAGACGGATCTGTCGGCAGCAGGGGCGCAATGGCTGCAGCAGTAAAATCGCAAATAACTCCAAATTTGTCAAGGAGCAATGGAAAAAGGTATAAACCATAAAATGAATTGAAGGTTTTGTCTTCAAACACAAGAACTCAACAACAACCATAACCACTGGAATAAACCCTGTGGATATAGTTTGTCGGGAGTTATATGATGAGACTCTCAGGTTAAACCTCTGGTTCCTTTCTTTTCTTGCCAGATTTCTTGCTGCTTATCCTGGGCTTTTCTGTTAAATCCACACCAGCTAGCTTCTGATCTGGTATACTGCTCGGGTCCAAATAGTCATTTGGATCTTGGTTGGGCGATATCTTCTCGTATATAATAGCAGCATCAAGAGAGAATATGCGGAGTGCAAGAGAAGATACCGTAGAAATTTTGGCAGCAGCAGAGAGAGATGACCAGTACCACCATTCATTCTTCAAGTATTCCGTTCTTATCATGTCCTCTAATGCAATTGTTGCCTGGACCATCT

mRNA sequence

CCCCTTTAGGTAGCGGAAGGCACCTGGAAAAAGCCATCGATTGGAAGAACAGAAACAGAGTCCAGCCGTCGCGTCTCCGGCCGCCGCTGTTTCCTCTGTCCGATCAAATCGCTCTTCTTCTTTCTGCGTGTCGCCGGAACTCGTATATCCGACGACCGATTCTTGTTGGGATCTGTGGAGCAGAAATTAGAAGGCATGGTGGGATTTTTTGCGATTCCGTTGGAGATTTCCGCTAGGTTACTCTGGACTACCAGTTTCTTCCGCCACAAGCGATCTTGATCCTGTTTTTCCTTTTCCTTTTCGAGATTACTGTTCGTGTTTTAGGTTAGTAGTAAACCTTCGCCTTAGTCAAATTCGAAATTTCGTACAAATTTAACCTCTACGTCGCTGGAATCGGAGATTTTAATGGAATCTGGGCGGTTCTTGTTTAATCCGCCGCCTTGCGGTGGCAACATGCTTTACCTTGGCGGAGCCGGTAGCGATCCTGTTCTGCGAGGAGGAAGAACAATGATTAGTATGACTATGCATGAGAGTCCAAAGGGGAGGCCATTTTTTCGATCACCGGACGATCTTTACGATGATGAATATTACGACGAGTTCTACCCTGAGAAGAAGCGTCGTCTCACCCATGATCAGGTTCAAATGTTGGAGAAGAGCTTTGATGAAGAGAACAAACTGGAGCCAGAGAGGAAGTCCCAACTGGCCAAGAAGTTGGGGCTGCAACCAAGGCAGGTGGCTGTGTGGTTCCAGAATCGTCGTGCTCGATGGAAGACAAAGCAGCTCGAAAGGGACTATGATGTTCTTAAAGCTTCATATGATTTGCTTCTGTCTAACTATGACTCAGTTATCAAAGAGAATGCAGATCTTAAATCTCAGGTGACTTCCTTAACTGAGAAATTTCTGGCTAAAGAGCTGGATGGAGGAGGAGAAGCACCAATTCCATGTGTGACATCAGAGCCTCTTCTAGCAGACATTGGCCATGTCTCCGCCCCACACTCCAGCAGAAAGGCTGAAGATCGTCTCAGTTCAGGGAGCGATGGCAGTACGGTGATCGACGATAATTGTCGACAACTCATTGATTGCTGTGATTCTTACTTCCCCAGCAACGAGTATCTGCAATGTGCACCTCTGCCTAATGGGTTGCAAATGGAACATGATGATAGCAATAACAATAGCAACTACTTGTTCTCAGACATGTTTGCAGTAACAGGCCAACAAAATCAGGAGGGAATGGGAGGGCCTCCTGCTTGGTGGACATGGCCTTAGGACAGTCATTCGACTTGTAACAGCCTGTTGCTTATACAAAATGTTAATGAAGTTCTTGTTTGTAATGAATAAAGTTGAGATATTATAGACATTGTTTGAAGTATTGTATGGCTGTGCTTAAAGAATGGTGCAAAATGTAGTGTCCACAAACTTGAACTGTAAACTTATAAGACAATGCAAGGATATAGCATACTTCAGATTTCTTGTTTTTTTTTTTTTTTTTTCAATTGATATGCAGAAAAACCTAATCCACATGATGAAGATTCTCAAATCCGATCAGTGGAAAATCATACTCCTATTTTCAATTAACGACACCGATACTGAACTCCTATATGAAATCGGGTCATCAAGAATAATGAGAAGACAGAAAAAGATGTGCAGCTATCTACCTTTGATTGCTATAAGCACAAATATTCAAGATTGCCTATACAAGGCAAGAGTACTTGCTGGTACCTGTTACAATGATTGAGGAAGGCCCCAACCAAGTCACTCGTCAAGCACCAAGGAAGACGAGACCTACAAAAGTAAGGTCAAATTTACCATTGTTGATACTAATGCCTGCAATGGCAACAGCTGCAGAACATAGATAGGTTTAGAGTATGAGAAACATGCAGGGTAAAATTCATTTGGAGCAAAAGATAACAGGCAGGTTATTCACGAGACGAAATAGAAACCAAAACGAGTTTTCCTGCCCTCCAGTGAATGATGGGCAGGCATCTACAAATACCCAGTTGNAAAAAAAAAAAAAAAAAAAAAAACTGTGGGTGGGGAGAAGGGGGGAAAGAAATGTAAAGAATTAACTATGTTTACAACAGAGTTTCAATTCTTTTTCTAGACTTCCCTTCTCCCTTTTCTATTTATACATAATGCTTCTGTGTGTATGGAACTTGCTGTTTAATGACGACATCCTTCTAAACGAGTCTCGCTTGATAATAGATATCATTGGGACCCTACCTGCTCCAAGCATAAAATGAAAAACAAGAAACAAAAAAAGAAAGCTCCAGAAGTTCTGTCCAAAATTGGGCTAAGGGACCATATTATCGATTTAGAGTTGGAGCCTCAAGCAGTCTTTATGAACAAAGCGCTCAGACATAGATTCTGAACCGCTTATCCCAGTTAGCAAGGTTACCTCTATGTACAGACGGATCTGTCGGCAGCAGGGGCGCAATGGCTGCAGCAGTAAAATCGCAAATAACTCCAAATTTGTCAAGGAGCAATGGAAAAAGGTATAAACCATAAAATGAATTGAAGGTTTTGTCTTCAAACACAAGAACTCAACAACAACCATAACCACTGGAATAAACCCTGTGGATATAGTTTGTCGGGAGTTATATGATGAGACTCTCAGGTTAAACCTCTGGTTCCTTTCTTTTCTTGCCAGATTTCTTGCTGCTTATCCTGGGCTTTTCTGTTAAATCCACACCAGCTAGCTTCTGATCTGGTATACTGCTCGGGTCCAAATAGTCATTTGGATCTTGGTTGGGCGATATCTTCTCGTATATAATAGCAGCATCAAGAGAGAATATGCGGAGTGCAAGAGAAGATACCGTAGAAATTTTGGCAGCAGCAGAGAGAGATGACCAGTACCACCATTCATTCTTCAAGTATTCCGTTCTTATCATGTCCTCTAATGCAATTGTTGCCTGGACCATCT

Coding sequence (CDS)

ATGGAATCTGGGCGGTTCTTGTTTAATCCGCCGCCTTGCGGTGGCAACATGCTTTACCTTGGCGGAGCCGGTAGCGATCCTGTTCTGCGAGGAGGAAGAACAATGATTAGTATGACTATGCATGAGAGTCCAAAGGGGAGGCCATTTTTTCGATCACCGGACGATCTTTACGATGATGAATATTACGACGAGTTCTACCCTGAGAAGAAGCGTCGTCTCACCCATGATCAGGTTCAAATGTTGGAGAAGAGCTTTGATGAAGAGAACAAACTGGAGCCAGAGAGGAAGTCCCAACTGGCCAAGAAGTTGGGGCTGCAACCAAGGCAGGTGGCTGTGTGGTTCCAGAATCGTCGTGCTCGATGGAAGACAAAGCAGCTCGAAAGGGACTATGATGTTCTTAAAGCTTCATATGATTTGCTTCTGTCTAACTATGACTCAGTTATCAAAGAGAATGCAGATCTTAAATCTCAGGTGACTTCCTTAACTGAGAAATTTCTGGCTAAAGAGCTGGATGGAGGAGGAGAAGCACCAATTCCATGTGTGACATCAGAGCCTCTTCTAGCAGACATTGGCCATGTCTCCGCCCCACACTCCAGCAGAAAGGCTGAAGATCGTCTCAGTTCAGGGAGCGATGGCAGTACGGTGATCGACGATAATTGTCGACAACTCATTGATTGCTGTGATTCTTACTTCCCCAGCAACGAGTATCTGCAATGTGCACCTCTGCCTAATGGGTTGCAAATGGAACATGATGATAGCAATAACAATAGCAACTACTTGTTCTCAGACATGTTTGCAGTAACAGGCCAACAAAATCAGGAGGGAATGGGAGGGCCTCCTGCTTGGTGGACATGGCCTTAG

Protein sequence

MESGRFLFNPPPCGGNMLYLGGAGSDPVLRGGRTMISMTMHESPKGRPFFRSPDDLYDDEYYDEFYPEKKRRLTHDQVQMLEKSFDEENKLEPERKSQLAKKLGLQPRQVAVWFQNRRARWKTKQLERDYDVLKASYDLLLSNYDSVIKENADLKSQVTSLTEKFLAKELDGGGEAPIPCVTSEPLLADIGHVSAPHSSRKAEDRLSSGSDGSTVIDDNCRQLIDCCDSYFPSNEYLQCAPLPNGLQMEHDDSNNNSNYLFSDMFAVTGQQNQEGMGGPPAWWTWP
Homology
BLAST of Cp4.1LG03g07630 vs. ExPASy Swiss-Prot
Match: Q02283 (Homeobox-leucine zipper protein HAT5 OS=Arabidopsis thaliana OX=3702 GN=HAT5 PE=1 SV=1)

HSP 1 Score: 274.2 bits (700), Expect = 1.6e-72
Identity = 155/288 (53.82%), Postives = 196/288 (68.06%), Query Frame = 0

Query: 1   MESGRFLFNPPPCGGNMLYLGGAGSDPVLRGGRTMISMTMHESPKGRPFFRSPDDLYDDE 60
           MES  F F+P    GN ++  G   +PV++GG     M M E+ K RPFF SP+DLYDD+
Sbjct: 1   MESNSFFFDPSASHGNSMFFLG-NLNPVVQGGGARSMMNMEETSKRRPFFSSPEDLYDDD 60

Query: 61  YYDEFYPEKKRRLTHDQVQMLEKSFDEENKLEPERKSQLAKKLGLQPRQVAVWFQNRRAR 120
           +YD+  PEKKRRLT +QV +LEKSF+ ENKLEPERK+QLAKKLGLQPRQVAVWFQNRRAR
Sbjct: 61  FYDDQLPEKKRRLTTEQVHLLEKSFETENKLEPERKTQLAKKLGLQPRQVAVWFQNRRAR 120

Query: 121 WKTKQLERDYDVLKASYDLLLSNYDSVIKENADLKSQVTSLTEKFLAKELDGGGEAP--I 180
           WKTKQLERDYD+LK++YD LLSNYDS++ +N  L+S+VTSLTEK   K+ +   E P  +
Sbjct: 121 WKTKQLERDYDLLKSTYDQLLSNYDSIVMDNDKLRSEVTSLTEKLQGKQ-ETANEPPGQV 180

Query: 181 PCVTSEPLLADIGHVSAPHSSRKAEDRLSSGSDGSTVIDDNCRQLIDCCDSYFPSNEYLQ 240
           P    EP   D  +++A  ++ K EDRLSSGS GS V+DD+  QL+D CDSYFPS   +Q
Sbjct: 181 P----EPNQLDPVYINA--AAIKTEDRLSSGSVGSAVLDDDAPQLLDSCDSYFPSIVPIQ 240

Query: 241 CAPLPNGLQMEHDDSNNNSNYLFSDMFAVTGQQNQEGMGGPPAWWTWP 287
                N    +HD    N    F+D+F  T   + +  G   A+W WP
Sbjct: 241 ----DNSNASDHD----NDRSCFADVFVPTTSPSHDHHGESLAFWGWP 272

BLAST of Cp4.1LG03g07630 vs. ExPASy Swiss-Prot
Match: Q6YWR4 (Homeobox-leucine zipper protein HOX16 OS=Oryza sativa subsp. japonica OX=39947 GN=HOX16 PE=2 SV=1)

HSP 1 Score: 221.9 bits (564), Expect = 9.5e-57
Identity = 149/343 (43.44%), Postives = 193/343 (56.27%), Query Frame = 0

Query: 1   MESGRFLFNPPPCG-GNMLYL------GGAGSDPVL-RGGRTMISMTMHESPKGRPFFRS 60
           MESGR +F+    G G ML+L      GG G   +  RG R ++ M        RPFF +
Sbjct: 1   MESGRLIFSTAGSGAGQMLFLDCGAGGGGVGGGAMFHRGARPVLGMEEGGRGVKRPFFTT 60

Query: 61  PDDLYDDEYYDEFYPEKKRRLTHDQVQMLEKSFDEENKLEPERKSQLAKKLGLQPRQVAV 120
           PD+L ++EYYDE  PEKKRRLT +QV +LE+SF+EENKLEPERK++LA+KLGLQPRQVAV
Sbjct: 61  PDELLEEEYYDEQLPEKKRRLTPEQVHLLERSFEEENKLEPERKTELARKLGLQPRQVAV 120

Query: 121 WFQNRRARWKTKQLERDYDVLKASYDLLLSNYDSVIKENADLKSQVTSLTEKFLAKELDG 180
           WFQNRRARWKTKQLERD+D LKAS+D L +++D+++++N  L SQV SLTEK   KE   
Sbjct: 121 WFQNRRARWKTKQLERDFDRLKASFDALRADHDALLQDNHRLHSQVMSLTEKLQEKETTT 180

Query: 181 GGEAPIPC-VTSEPLLADIGHVSAPHSSR------------------KAEDRLSSGSDGS 240
            G A     V   P  AD+  V+ P +                    KAEDRLS+GS GS
Sbjct: 181 EGSAGAAVDVPGLPAAADV-KVAVPDAEEPALEEAAAAFEEQQEQQVKAEDRLSTGSGGS 240

Query: 241 TVIDDNCRQLIDC-----------CDSYFP-SNEYLQCAPLP-----NGLQMEHDD---S 286
            V+D + + ++ C            +SYFP  +EY  C   P      G+Q E DD   S
Sbjct: 241 AVVDTDAQLVVGCGRQHLAAVDSSVESYFPGGDEYHDCVMGPMDHAAGGIQSEEDDGAGS 300

BLAST of Cp4.1LG03g07630 vs. ExPASy Swiss-Prot
Match: A2X980 (Homeobox-leucine zipper protein HOX16 OS=Oryza sativa subsp. indica OX=39946 GN=HOX16 PE=2 SV=1)

HSP 1 Score: 221.1 bits (562), Expect = 1.6e-56
Identity = 149/345 (43.19%), Postives = 193/345 (55.94%), Query Frame = 0

Query: 1   MESGRFLFNPPPCG-GNMLYL--------GGAGSDPVL-RGGRTMISMTMHESPKGRPFF 60
           MESGR +F+    G G ML+L        GG G   +  RG R ++ M        RPFF
Sbjct: 1   MESGRLIFSTAGSGAGQMLFLDCGAGGGGGGVGGGAMFHRGARPVLGMEEGGRGVKRPFF 60

Query: 61  RSPDDLYDDEYYDEFYPEKKRRLTHDQVQMLEKSFDEENKLEPERKSQLAKKLGLQPRQV 120
            +PD+L ++EYYDE  PEKKRRLT +QV +LE+SF+EENKLEPERK++LA+KLGLQPRQV
Sbjct: 61  TTPDELLEEEYYDEQLPEKKRRLTPEQVHLLERSFEEENKLEPERKTELARKLGLQPRQV 120

Query: 121 AVWFQNRRARWKTKQLERDYDVLKASYDLLLSNYDSVIKENADLKSQVTSLTEKFLAKEL 180
           AVWFQNRRARWKTKQLERD+D LKAS+D L +++D+++++N  L SQV SLTEK   KE 
Sbjct: 121 AVWFQNRRARWKTKQLERDFDRLKASFDALRADHDALLQDNHRLHSQVMSLTEKLQEKET 180

Query: 181 DGGGEAPIPC-VTSEPLLADIGHVSAPHSSR------------------KAEDRLSSGSD 240
              G A     V   P  AD+  V+ P +                    KAEDRLS+GS 
Sbjct: 181 TTEGSAGAAVDVPGLPAAADV-KVAVPDAEEPALEEAAAAFEEQQEQQVKAEDRLSTGSG 240

Query: 241 GSTVIDDNCRQLIDC-----------CDSYFP-SNEYLQCAPLP-----NGLQMEHDD-- 286
           GS V+D + + ++ C            +SYFP  +EY  C   P      G+Q E DD  
Sbjct: 241 GSAVVDTDAQLVVGCGRQHLAAVDSSVESYFPGGDEYHDCVMGPMDHAAGGIQSEEDDGA 300

BLAST of Cp4.1LG03g07630 vs. ExPASy Swiss-Prot
Match: Q9XH36 (Homeobox-leucine zipper protein HOX5 OS=Oryza sativa subsp. indica OX=39946 GN=HOX5 PE=1 SV=1)

HSP 1 Score: 169.9 bits (429), Expect = 4.3e-41
Identity = 95/186 (51.08%), Postives = 125/186 (67.20%), Query Frame = 0

Query: 1   MESGRFLFNPP------PCGGNMLYLGGAGS-----------DPVLRGGRTMISMTMHES 60
           M+ GR +F+        P G  ML  GG GS             VL    +  S +   +
Sbjct: 1   MDPGRVVFDSGVARRACPGGAQMLLFGGGGSANSGGFFRGVPAAVLGMDESRSSSSAAGA 60

Query: 61  PKGRPFFRSPDDLYDDEYYDEFYPEKKRRLTHDQVQMLEKSFDEENKLEPERKSQLAKKL 120
              RPFF + ++L ++EYYDE  PEKKRRLT +QVQMLE+SF+EENKLEPERK++LA++L
Sbjct: 61  GAKRPFFTTHEELLEEEYYDEQAPEKKRRLTAEQVQMLERSFEEENKLEPERKTELARRL 120

Query: 121 GLQPRQVAVWFQNRRARWKTKQLERDYDVLKASYDLLLSNYDSVIKENADLKSQVTSLTE 170
           G+ PRQVAVWFQNRRARWKTKQLE D+D LKA+YD L +++ +++ +N  L++QV SLTE
Sbjct: 121 GMAPRQVAVWFQNRRARWKTKQLEHDFDRLKAAYDALAADHHALLSDNDRLRAQVISLTE 180

BLAST of Cp4.1LG03g07630 vs. ExPASy Swiss-Prot
Match: Q6ZA74 (Homeobox-leucine zipper protein HOX5 OS=Oryza sativa subsp. japonica OX=39947 GN=HOX5 PE=1 SV=1)

HSP 1 Score: 169.9 bits (429), Expect = 4.3e-41
Identity = 95/186 (51.08%), Postives = 125/186 (67.20%), Query Frame = 0

Query: 1   MESGRFLFNPP------PCGGNMLYLGGAGS-----------DPVLRGGRTMISMTMHES 60
           M+ GR +F+        P G  ML  GG GS             VL    +  S +   +
Sbjct: 1   MDPGRVVFDSGVARRACPGGAQMLLFGGGGSANSGGFFRGVPAAVLGMDESRSSSSAAGA 60

Query: 61  PKGRPFFRSPDDLYDDEYYDEFYPEKKRRLTHDQVQMLEKSFDEENKLEPERKSQLAKKL 120
              RPFF + ++L ++EYYDE  PEKKRRLT +QVQMLE+SF+EENKLEPERK++LA++L
Sbjct: 61  GAKRPFFTTHEELLEEEYYDEQAPEKKRRLTAEQVQMLERSFEEENKLEPERKTELARRL 120

Query: 121 GLQPRQVAVWFQNRRARWKTKQLERDYDVLKASYDLLLSNYDSVIKENADLKSQVTSLTE 170
           G+ PRQVAVWFQNRRARWKTKQLE D+D LKA+YD L +++ +++ +N  L++QV SLTE
Sbjct: 121 GMAPRQVAVWFQNRRARWKTKQLEHDFDRLKAAYDALAADHHALLSDNDRLRAQVISLTE 180

BLAST of Cp4.1LG03g07630 vs. NCBI nr
Match: XP_023526286.1 (homeobox-leucine zipper protein HAT5-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 588 bits (1515), Expect = 5.51e-212
Identity = 286/286 (100.00%), Postives = 286/286 (100.00%), Query Frame = 0

Query: 1   MESGRFLFNPPPCGGNMLYLGGAGSDPVLRGGRTMISMTMHESPKGRPFFRSPDDLYDDE 60
           MESGRFLFNPPPCGGNMLYLGGAGSDPVLRGGRTMISMTMHESPKGRPFFRSPDDLYDDE
Sbjct: 1   MESGRFLFNPPPCGGNMLYLGGAGSDPVLRGGRTMISMTMHESPKGRPFFRSPDDLYDDE 60

Query: 61  YYDEFYPEKKRRLTHDQVQMLEKSFDEENKLEPERKSQLAKKLGLQPRQVAVWFQNRRAR 120
           YYDEFYPEKKRRLTHDQVQMLEKSFDEENKLEPERKSQLAKKLGLQPRQVAVWFQNRRAR
Sbjct: 61  YYDEFYPEKKRRLTHDQVQMLEKSFDEENKLEPERKSQLAKKLGLQPRQVAVWFQNRRAR 120

Query: 121 WKTKQLERDYDVLKASYDLLLSNYDSVIKENADLKSQVTSLTEKFLAKELDGGGEAPIPC 180
           WKTKQLERDYDVLKASYDLLLSNYDSVIKENADLKSQVTSLTEKFLAKELDGGGEAPIPC
Sbjct: 121 WKTKQLERDYDVLKASYDLLLSNYDSVIKENADLKSQVTSLTEKFLAKELDGGGEAPIPC 180

Query: 181 VTSEPLLADIGHVSAPHSSRKAEDRLSSGSDGSTVIDDNCRQLIDCCDSYFPSNEYLQCA 240
           VTSEPLLADIGHVSAPHSSRKAEDRLSSGSDGSTVIDDNCRQLIDCCDSYFPSNEYLQCA
Sbjct: 181 VTSEPLLADIGHVSAPHSSRKAEDRLSSGSDGSTVIDDNCRQLIDCCDSYFPSNEYLQCA 240

Query: 241 PLPNGLQMEHDDSNNNSNYLFSDMFAVTGQQNQEGMGGPPAWWTWP 286
           PLPNGLQMEHDDSNNNSNYLFSDMFAVTGQQNQEGMGGPPAWWTWP
Sbjct: 241 PLPNGLQMEHDDSNNNSNYLFSDMFAVTGQQNQEGMGGPPAWWTWP 286

BLAST of Cp4.1LG03g07630 vs. NCBI nr
Match: XP_022934435.1 (homeobox-leucine zipper protein HAT5-like [Cucurbita moschata])

HSP 1 Score: 573 bits (1477), Expect = 3.43e-206
Identity = 278/286 (97.20%), Postives = 281/286 (98.25%), Query Frame = 0

Query: 1   MESGRFLFNPPPCGGNMLYLGGAGSDPVLRGGRTMISMTMHESPKGRPFFRSPDDLYDDE 60
           MESGRFLFNPPPCGGNMLYLGGAG DPVLRGGRTM+SMTMHESPKGRPFFRSPDDLYDDE
Sbjct: 1   MESGRFLFNPPPCGGNMLYLGGAGGDPVLRGGRTMMSMTMHESPKGRPFFRSPDDLYDDE 60

Query: 61  YYDEFYPEKKRRLTHDQVQMLEKSFDEENKLEPERKSQLAKKLGLQPRQVAVWFQNRRAR 120
           YYDEFYPEKKRRLTHDQVQMLEKSFDE+NKLEPERKSQLAKKLGLQPRQVAVWFQNRRAR
Sbjct: 61  YYDEFYPEKKRRLTHDQVQMLEKSFDEDNKLEPERKSQLAKKLGLQPRQVAVWFQNRRAR 120

Query: 121 WKTKQLERDYDVLKASYDLLLSNYDSVIKENADLKSQVTSLTEKFLAKELDGGGEAPIPC 180
           WKTKQLERDYDVLKASYDLLLSNYDSVIKENADLKSQV SLTEK LAKELDGGGEAPIPC
Sbjct: 121 WKTKQLERDYDVLKASYDLLLSNYDSVIKENADLKSQVASLTEKCLAKELDGGGEAPIPC 180

Query: 181 VTSEPLLADIGHVSAPHSSRKAEDRLSSGSDGSTVIDDNCRQLIDCCDSYFPSNEYLQCA 240
           VTSEPLLADIGHVSAPHSSRKAEDRLSSGSD STVIDDNCRQL DCCDSYFPSNEYLQCA
Sbjct: 181 VTSEPLLADIGHVSAPHSSRKAEDRLSSGSDDSTVIDDNCRQLTDCCDSYFPSNEYLQCA 240

Query: 241 PLPNGLQMEHDDSNNNSNYLFSDMFAVTGQQNQEGMGGPPAWWTWP 286
           PLPNGLQMEHDDSNNNSNYLFSDMFAVTGQQNQEG+GGPPAWWTWP
Sbjct: 241 PLPNGLQMEHDDSNNNSNYLFSDMFAVTGQQNQEGLGGPPAWWTWP 286

BLAST of Cp4.1LG03g07630 vs. NCBI nr
Match: KAG6581018.1 (Homeobox-leucine zipper protein HAT5, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 571 bits (1472), Expect = 1.98e-205
Identity = 277/286 (96.85%), Postives = 281/286 (98.25%), Query Frame = 0

Query: 1   MESGRFLFNPPPCGGNMLYLGGAGSDPVLRGGRTMISMTMHESPKGRPFFRSPDDLYDDE 60
           MESGRFLFNPPPCGG+MLYLGGAG DPVLRGGRTM+SMTMHESPKGRPFFRSPDDLYDDE
Sbjct: 1   MESGRFLFNPPPCGGSMLYLGGAGGDPVLRGGRTMMSMTMHESPKGRPFFRSPDDLYDDE 60

Query: 61  YYDEFYPEKKRRLTHDQVQMLEKSFDEENKLEPERKSQLAKKLGLQPRQVAVWFQNRRAR 120
           YYDEFYPEKKRRLTHDQVQMLEKSFDE+NKLEPERKSQLAKKLGLQPRQVAVWFQNRRAR
Sbjct: 61  YYDEFYPEKKRRLTHDQVQMLEKSFDEDNKLEPERKSQLAKKLGLQPRQVAVWFQNRRAR 120

Query: 121 WKTKQLERDYDVLKASYDLLLSNYDSVIKENADLKSQVTSLTEKFLAKELDGGGEAPIPC 180
           WKTKQLERDYDVLKASYDLLLSNYDSVIKENADLKSQV SLTEK LAKELDGGGEAPIPC
Sbjct: 121 WKTKQLERDYDVLKASYDLLLSNYDSVIKENADLKSQVASLTEKCLAKELDGGGEAPIPC 180

Query: 181 VTSEPLLADIGHVSAPHSSRKAEDRLSSGSDGSTVIDDNCRQLIDCCDSYFPSNEYLQCA 240
           VTSEPLLADIGHVSAPHSSRKAEDRLSSGSD STVIDDNCRQL DCCDSYFPSNEYLQCA
Sbjct: 181 VTSEPLLADIGHVSAPHSSRKAEDRLSSGSDDSTVIDDNCRQLTDCCDSYFPSNEYLQCA 240

Query: 241 PLPNGLQMEHDDSNNNSNYLFSDMFAVTGQQNQEGMGGPPAWWTWP 286
           PLPNGLQMEHDDSNNNSNYLFSDMFAVTGQQNQEG+GGPPAWWTWP
Sbjct: 241 PLPNGLQMEHDDSNNNSNYLFSDMFAVTGQQNQEGLGGPPAWWTWP 286

BLAST of Cp4.1LG03g07630 vs. NCBI nr
Match: XP_022982559.1 (homeobox-leucine zipper protein HAT5-like [Cucurbita maxima])

HSP 1 Score: 562 bits (1448), Expect = 8.71e-202
Identity = 276/286 (96.50%), Postives = 280/286 (97.90%), Query Frame = 0

Query: 1   MESGRFLFNPPPCGGNMLYLGGAGSDPVLRGGRTMISMTMHESPKGRPFFRSPDDLYDDE 60
           MESGRFLFNP PCGGNMLYLGGAG DPVLRGGRTMISMTMHESPKGRPFF+SPDDLYDDE
Sbjct: 1   MESGRFLFNPRPCGGNMLYLGGAGGDPVLRGGRTMISMTMHESPKGRPFFQSPDDLYDDE 60

Query: 61  YYDEFYPEKKRRLTHDQVQMLEKSFDEENKLEPERKSQLAKKLGLQPRQVAVWFQNRRAR 120
           YYDEFYPEKKRRLTHDQVQMLEKSFDEENKLEPERKSQLAKKLGLQPRQVAVWFQNRRAR
Sbjct: 61  YYDEFYPEKKRRLTHDQVQMLEKSFDEENKLEPERKSQLAKKLGLQPRQVAVWFQNRRAR 120

Query: 121 WKTKQLERDYDVLKASYDLLLSNYDSVIKENADLKSQVTSLTEKFLAKELDGGGEAPIPC 180
           WKTKQLERDYDVLKASYDLLLSNYDSVIKENADLKSQV SLTEK LAKELDGG EAPIPC
Sbjct: 121 WKTKQLERDYDVLKASYDLLLSNYDSVIKENADLKSQVASLTEKCLAKELDGG-EAPIPC 180

Query: 181 VTSEPLLADIGHVSAPHSSRKAEDRLSSGSDGSTVIDDNCRQLIDCCDSYFPSNEYLQCA 240
           VTSEPLLADIG+VS PHSSRKAEDRLSSGSDGSTVIDDNCRQLIDCCDSYFPSNEYLQCA
Sbjct: 181 VTSEPLLADIGNVSTPHSSRKAEDRLSSGSDGSTVIDDNCRQLIDCCDSYFPSNEYLQCA 240

Query: 241 PLPNGLQMEHDDSNNNSNYLFSDMFAVTGQQNQEGMGGPPAWWTWP 286
           PLPNGLQMEHD+SNNNSNYLFSDMFAVTGQQNQEG+GGPPAWWTWP
Sbjct: 241 PLPNGLQMEHDNSNNNSNYLFSDMFAVTGQQNQEGLGGPPAWWTWP 285

BLAST of Cp4.1LG03g07630 vs. NCBI nr
Match: KAG7017759.1 (Homeobox-leucine zipper protein HAT5 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 556 bits (1434), Expect = 4.01e-199
Identity = 278/318 (87.42%), Postives = 281/318 (88.36%), Query Frame = 0

Query: 1   MESGRFLFNPPPCGGNMLYLGGAGSDPVLRGGRTMISMTMHESPKGRPFFRSPDDLYDDE 60
           MESGRFLFNPPPCGGNMLYLGGAG DPVLRGGRTM+SMTMHESPKGRPFFRSPDDLYDDE
Sbjct: 1   MESGRFLFNPPPCGGNMLYLGGAGGDPVLRGGRTMMSMTMHESPKGRPFFRSPDDLYDDE 60

Query: 61  YYDEFYPEKKRRLTHDQV--------------------------------QMLEKSFDEE 120
           YYDEFYPEKKRRLTHDQV                                QMLEKSFDE+
Sbjct: 61  YYDEFYPEKKRRLTHDQVCYICLYCPILPASAAALSVGQKFRSCVKLLKVQMLEKSFDED 120

Query: 121 NKLEPERKSQLAKKLGLQPRQVAVWFQNRRARWKTKQLERDYDVLKASYDLLLSNYDSVI 180
           NKLEPERKSQLAKKLGLQPRQVAVWFQNRRARWKTKQLERDYDVLKASYDLLLSNYDSVI
Sbjct: 121 NKLEPERKSQLAKKLGLQPRQVAVWFQNRRARWKTKQLERDYDVLKASYDLLLSNYDSVI 180

Query: 181 KENADLKSQVTSLTEKFLAKELDGGGEAPIPCVTSEPLLADIGHVSAPHSSRKAEDRLSS 240
           KENADLKSQV SLTEK LAKELDGGGEAPIPCVTSEPLLADIGHVSAPHSSRKAEDRLSS
Sbjct: 181 KENADLKSQVASLTEKCLAKELDGGGEAPIPCVTSEPLLADIGHVSAPHSSRKAEDRLSS 240

Query: 241 GSDGSTVIDDNCRQLIDCCDSYFPSNEYLQCAPLPNGLQMEHDDSNNNSNYLFSDMFAVT 286
           GSD STVIDDNCRQL DCCDSYFPSNEYLQCAPLPNGLQMEHDDSNNNSNYLFSDMFAVT
Sbjct: 241 GSDDSTVIDDNCRQLTDCCDSYFPSNEYLQCAPLPNGLQMEHDDSNNNSNYLFSDMFAVT 300

BLAST of Cp4.1LG03g07630 vs. ExPASy TrEMBL
Match: A0A6J1F7P1 (homeobox-leucine zipper protein HAT5-like OS=Cucurbita moschata OX=3662 GN=LOC111441616 PE=4 SV=1)

HSP 1 Score: 573 bits (1477), Expect = 1.66e-206
Identity = 278/286 (97.20%), Postives = 281/286 (98.25%), Query Frame = 0

Query: 1   MESGRFLFNPPPCGGNMLYLGGAGSDPVLRGGRTMISMTMHESPKGRPFFRSPDDLYDDE 60
           MESGRFLFNPPPCGGNMLYLGGAG DPVLRGGRTM+SMTMHESPKGRPFFRSPDDLYDDE
Sbjct: 1   MESGRFLFNPPPCGGNMLYLGGAGGDPVLRGGRTMMSMTMHESPKGRPFFRSPDDLYDDE 60

Query: 61  YYDEFYPEKKRRLTHDQVQMLEKSFDEENKLEPERKSQLAKKLGLQPRQVAVWFQNRRAR 120
           YYDEFYPEKKRRLTHDQVQMLEKSFDE+NKLEPERKSQLAKKLGLQPRQVAVWFQNRRAR
Sbjct: 61  YYDEFYPEKKRRLTHDQVQMLEKSFDEDNKLEPERKSQLAKKLGLQPRQVAVWFQNRRAR 120

Query: 121 WKTKQLERDYDVLKASYDLLLSNYDSVIKENADLKSQVTSLTEKFLAKELDGGGEAPIPC 180
           WKTKQLERDYDVLKASYDLLLSNYDSVIKENADLKSQV SLTEK LAKELDGGGEAPIPC
Sbjct: 121 WKTKQLERDYDVLKASYDLLLSNYDSVIKENADLKSQVASLTEKCLAKELDGGGEAPIPC 180

Query: 181 VTSEPLLADIGHVSAPHSSRKAEDRLSSGSDGSTVIDDNCRQLIDCCDSYFPSNEYLQCA 240
           VTSEPLLADIGHVSAPHSSRKAEDRLSSGSD STVIDDNCRQL DCCDSYFPSNEYLQCA
Sbjct: 181 VTSEPLLADIGHVSAPHSSRKAEDRLSSGSDDSTVIDDNCRQLTDCCDSYFPSNEYLQCA 240

Query: 241 PLPNGLQMEHDDSNNNSNYLFSDMFAVTGQQNQEGMGGPPAWWTWP 286
           PLPNGLQMEHDDSNNNSNYLFSDMFAVTGQQNQEG+GGPPAWWTWP
Sbjct: 241 PLPNGLQMEHDDSNNNSNYLFSDMFAVTGQQNQEGLGGPPAWWTWP 286

BLAST of Cp4.1LG03g07630 vs. ExPASy TrEMBL
Match: A0A6J1J4X1 (homeobox-leucine zipper protein HAT5-like OS=Cucurbita maxima OX=3661 GN=LOC111481400 PE=4 SV=1)

HSP 1 Score: 562 bits (1448), Expect = 4.22e-202
Identity = 276/286 (96.50%), Postives = 280/286 (97.90%), Query Frame = 0

Query: 1   MESGRFLFNPPPCGGNMLYLGGAGSDPVLRGGRTMISMTMHESPKGRPFFRSPDDLYDDE 60
           MESGRFLFNP PCGGNMLYLGGAG DPVLRGGRTMISMTMHESPKGRPFF+SPDDLYDDE
Sbjct: 1   MESGRFLFNPRPCGGNMLYLGGAGGDPVLRGGRTMISMTMHESPKGRPFFQSPDDLYDDE 60

Query: 61  YYDEFYPEKKRRLTHDQVQMLEKSFDEENKLEPERKSQLAKKLGLQPRQVAVWFQNRRAR 120
           YYDEFYPEKKRRLTHDQVQMLEKSFDEENKLEPERKSQLAKKLGLQPRQVAVWFQNRRAR
Sbjct: 61  YYDEFYPEKKRRLTHDQVQMLEKSFDEENKLEPERKSQLAKKLGLQPRQVAVWFQNRRAR 120

Query: 121 WKTKQLERDYDVLKASYDLLLSNYDSVIKENADLKSQVTSLTEKFLAKELDGGGEAPIPC 180
           WKTKQLERDYDVLKASYDLLLSNYDSVIKENADLKSQV SLTEK LAKELDGG EAPIPC
Sbjct: 121 WKTKQLERDYDVLKASYDLLLSNYDSVIKENADLKSQVASLTEKCLAKELDGG-EAPIPC 180

Query: 181 VTSEPLLADIGHVSAPHSSRKAEDRLSSGSDGSTVIDDNCRQLIDCCDSYFPSNEYLQCA 240
           VTSEPLLADIG+VS PHSSRKAEDRLSSGSDGSTVIDDNCRQLIDCCDSYFPSNEYLQCA
Sbjct: 181 VTSEPLLADIGNVSTPHSSRKAEDRLSSGSDGSTVIDDNCRQLIDCCDSYFPSNEYLQCA 240

Query: 241 PLPNGLQMEHDDSNNNSNYLFSDMFAVTGQQNQEGMGGPPAWWTWP 286
           PLPNGLQMEHD+SNNNSNYLFSDMFAVTGQQNQEG+GGPPAWWTWP
Sbjct: 241 PLPNGLQMEHDNSNNNSNYLFSDMFAVTGQQNQEGLGGPPAWWTWP 285

BLAST of Cp4.1LG03g07630 vs. ExPASy TrEMBL
Match: A0A5D3CRJ0 (Homeobox-leucine zipper protein HAT5 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1119G00170 PE=4 SV=1)

HSP 1 Score: 470 bits (1209), Expect = 1.03e-165
Identity = 237/285 (83.16%), Postives = 255/285 (89.47%), Query Frame = 0

Query: 1   MESGRFLFNPPPCGGNMLYLGGAGSDPVLRGGRTMISMTMHESPKGRPFFRSPDDLYDDE 60
           MESGRFLFNPPP GGNMLYLGGAG DP LRGGRTM+SM+M+ESPKGRPFFRSPDDLYDDE
Sbjct: 1   MESGRFLFNPPPYGGNMLYLGGAGGDPCLRGGRTMMSMSMNESPKGRPFFRSPDDLYDDE 60

Query: 61  YYDEFYPEKKRRLTHDQVQMLEKSFDEENKLEPERKSQLAKKLGLQPRQVAVWFQNRRAR 120
           YYDEFYPEKKRRLTHDQVQMLEK+F+EENKLEPERKSQLAKKLGLQPRQVAVWFQNRRAR
Sbjct: 61  YYDEFYPEKKRRLTHDQVQMLEKNFEEENKLEPERKSQLAKKLGLQPRQVAVWFQNRRAR 120

Query: 121 WKTKQLERDYDVLKASYDLLLSNYDSVIKENADLKSQVTSLTEKFLAKELDGGGEAPIPC 180
           WKTKQLERDYDVLKASYD L+SNYD+++KENA LKS+V SLTEK LAKELDGG EA IP 
Sbjct: 121 WKTKQLERDYDVLKASYDSLVSNYDAIVKENAVLKSEVASLTEKCLAKELDGG-EATIPS 180

Query: 181 VTSEPLLADIGHVSAPHSSRKAEDRLSSGSDGSTVIDDNCRQLIDCCDSYFPSNEYLQCA 240
           +TSE LLADI ++S P S RKAEDRLSSGSD S V+DDNC QLID  DSYFPS EY QCA
Sbjct: 181 ITSELLLADITNISIPQSGRKAEDRLSSGSDSSAVVDDNCPQLIDSGDSYFPSIEYPQCA 240

Query: 241 PLPNGLQMEHDDSNNNSNYLFSDMFAVTGQQNQEGMGGPPAWWTW 285
            LPNGLQME +D+N+NSNYLFSDMFA T QQ+QEG   PPAWW W
Sbjct: 241 HLPNGLQMEDNDTNDNSNYLFSDMFATTNQQSQEGR--PPAWWAW 282

BLAST of Cp4.1LG03g07630 vs. ExPASy TrEMBL
Match: A0A1S3B860 (homeobox-leucine zipper protein HAT5 OS=Cucumis melo OX=3656 GN=LOC103487074 PE=4 SV=1)

HSP 1 Score: 470 bits (1209), Expect = 1.03e-165
Identity = 237/285 (83.16%), Postives = 255/285 (89.47%), Query Frame = 0

Query: 1   MESGRFLFNPPPCGGNMLYLGGAGSDPVLRGGRTMISMTMHESPKGRPFFRSPDDLYDDE 60
           MESGRFLFNPPP GGNMLYLGGAG DP LRGGRTM+SM+M+ESPKGRPFFRSPDDLYDDE
Sbjct: 1   MESGRFLFNPPPYGGNMLYLGGAGGDPCLRGGRTMMSMSMNESPKGRPFFRSPDDLYDDE 60

Query: 61  YYDEFYPEKKRRLTHDQVQMLEKSFDEENKLEPERKSQLAKKLGLQPRQVAVWFQNRRAR 120
           YYDEFYPEKKRRLTHDQVQMLEK+F+EENKLEPERKSQLAKKLGLQPRQVAVWFQNRRAR
Sbjct: 61  YYDEFYPEKKRRLTHDQVQMLEKNFEEENKLEPERKSQLAKKLGLQPRQVAVWFQNRRAR 120

Query: 121 WKTKQLERDYDVLKASYDLLLSNYDSVIKENADLKSQVTSLTEKFLAKELDGGGEAPIPC 180
           WKTKQLERDYDVLKASYD L+SNYD+++KENA LKS+V SLTEK LAKELDGG EA IP 
Sbjct: 121 WKTKQLERDYDVLKASYDSLVSNYDAIVKENAVLKSEVASLTEKCLAKELDGG-EATIPS 180

Query: 181 VTSEPLLADIGHVSAPHSSRKAEDRLSSGSDGSTVIDDNCRQLIDCCDSYFPSNEYLQCA 240
           +TSE LLADI ++S P S RKAEDRLSSGSD S V+DDNC QLID  DSYFPS EY QCA
Sbjct: 181 ITSELLLADITNISIPQSGRKAEDRLSSGSDSSAVVDDNCPQLIDSGDSYFPSIEYPQCA 240

Query: 241 PLPNGLQMEHDDSNNNSNYLFSDMFAVTGQQNQEGMGGPPAWWTW 285
            LPNGLQME +D+N+NSNYLFSDMFA T QQ+QEG   PPAWW W
Sbjct: 241 HLPNGLQMEDNDTNDNSNYLFSDMFATTNQQSQEGR--PPAWWAW 282

BLAST of Cp4.1LG03g07630 vs. ExPASy TrEMBL
Match: A0A6J1EQW9 (homeobox-leucine zipper protein HAT5 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111435068 PE=4 SV=1)

HSP 1 Score: 459 bits (1180), Expect = 3.61e-161
Identity = 242/294 (82.31%), Postives = 254/294 (86.39%), Query Frame = 0

Query: 1   MESGRFLFNPPPCGGNMLYLGGAG-SDPVLRGGRTMISMTMH----ESPKGRPFFRSPDD 60
           MESGRFLFNPP  GGNML LGGAG SD  LR GRTM+SM+M+    ESPKGRPFFRSPDD
Sbjct: 1   MESGRFLFNPPAYGGNMLCLGGAGGSDRFLREGRTMMSMSMNMSMQESPKGRPFFRSPDD 60

Query: 61  LYDDEYYDEFYPEKKRRLTHDQVQMLEKSFDEENKLEPERKSQLAKKLGLQPRQVAVWFQ 120
           LYDDEYYDE YPEKKRRL ++QVQMLEKSF+EENKLEPERKSQLAKKLGLQPRQVAVWFQ
Sbjct: 61  LYDDEYYDELYPEKKRRLANEQVQMLEKSFEEENKLEPERKSQLAKKLGLQPRQVAVWFQ 120

Query: 121 NRRARWKTKQLERDYDVLKASYDLLLSNYDSVIKENADLKSQVTSLTEKFLAKELDGGGE 180
           NRRARWKTKQLERDYDVLKASYDLL+SNYDS++KENA LKS+V SLTEK +AKELDGG E
Sbjct: 121 NRRARWKTKQLERDYDVLKASYDLLMSNYDSIVKENAVLKSEVASLTEKCVAKELDGG-E 180

Query: 181 APIPCVTSEPLLADIGHVSAPHSS---RKAEDRLSSGSDGSTVIDDNCRQLIDCCDSYFP 240
           APIP  T EPLLAD  HVSAPHS    RKAEDRLSSGSD S VIDDNC QLID  DSYFP
Sbjct: 181 APIPRTTLEPLLADTAHVSAPHSGGSGRKAEDRLSSGSDSSAVIDDNCLQLIDSGDSYFP 240

Query: 241 SNEYLQCAPLPNGLQMEHDDSNNNSNYLFSDMFAVTGQQNQEGMGGPPAWWTWP 286
           SNEY Q APLP GLQMEHDD N+NSNYLFSDMFA T QQNQE  GGPPAWW WP
Sbjct: 241 SNEYPQRAPLPPGLQMEHDDRNDNSNYLFSDMFAETNQQNQE--GGPPAWWAWP 291

BLAST of Cp4.1LG03g07630 vs. TAIR 10
Match: AT3G01470.1 (homeobox 1 )

HSP 1 Score: 274.2 bits (700), Expect = 1.1e-73
Identity = 155/288 (53.82%), Postives = 196/288 (68.06%), Query Frame = 0

Query: 1   MESGRFLFNPPPCGGNMLYLGGAGSDPVLRGGRTMISMTMHESPKGRPFFRSPDDLYDDE 60
           MES  F F+P    GN ++  G   +PV++GG     M M E+ K RPFF SP+DLYDD+
Sbjct: 1   MESNSFFFDPSASHGNSMFFLG-NLNPVVQGGGARSMMNMEETSKRRPFFSSPEDLYDDD 60

Query: 61  YYDEFYPEKKRRLTHDQVQMLEKSFDEENKLEPERKSQLAKKLGLQPRQVAVWFQNRRAR 120
           +YD+  PEKKRRLT +QV +LEKSF+ ENKLEPERK+QLAKKLGLQPRQVAVWFQNRRAR
Sbjct: 61  FYDDQLPEKKRRLTTEQVHLLEKSFETENKLEPERKTQLAKKLGLQPRQVAVWFQNRRAR 120

Query: 121 WKTKQLERDYDVLKASYDLLLSNYDSVIKENADLKSQVTSLTEKFLAKELDGGGEAP--I 180
           WKTKQLERDYD+LK++YD LLSNYDS++ +N  L+S+VTSLTEK   K+ +   E P  +
Sbjct: 121 WKTKQLERDYDLLKSTYDQLLSNYDSIVMDNDKLRSEVTSLTEKLQGKQ-ETANEPPGQV 180

Query: 181 PCVTSEPLLADIGHVSAPHSSRKAEDRLSSGSDGSTVIDDNCRQLIDCCDSYFPSNEYLQ 240
           P    EP   D  +++A  ++ K EDRLSSGS GS V+DD+  QL+D CDSYFPS   +Q
Sbjct: 181 P----EPNQLDPVYINA--AAIKTEDRLSSGSVGSAVLDDDAPQLLDSCDSYFPSIVPIQ 240

Query: 241 CAPLPNGLQMEHDDSNNNSNYLFSDMFAVTGQQNQEGMGGPPAWWTWP 287
                N    +HD    N    F+D+F  T   + +  G   A+W WP
Sbjct: 241 ----DNSNASDHD----NDRSCFADVFVPTTSPSHDHHGESLAFWGWP 272

BLAST of Cp4.1LG03g07630 vs. TAIR 10
Match: AT2G22430.1 (homeobox protein 6 )

HSP 1 Score: 128.3 bits (321), Expect = 1.0e-29
Identity = 67/107 (62.62%), Postives = 84/107 (78.50%), Query Frame = 0

Query: 68  EKKRRLTHDQVQMLEKSFDEENKLEPERKSQLAKKLGLQPRQVAVWFQNRRARWKTKQLE 127
           EKKRRL+ +QV+ LEK+F+ ENKLEPERK +LA++LGLQPRQVAVWFQNRRARWKTKQLE
Sbjct: 61  EKKRRLSINQVKALEKNFELENKLEPERKVKLAQELGLQPRQVAVWFQNRRARWKTKQLE 120

Query: 128 RDYDVLKASYDLLLSNYDSVIKENADLKSQVTSLTEKFLAKELDGGG 175
           +DY VLK  YD L  N+DS+ ++N  L  +++ L  K     L+GGG
Sbjct: 121 KDYGVLKTQYDSLRHNFDSLRRDNESLLQEISKLKTK-----LNGGG 162

BLAST of Cp4.1LG03g07630 vs. TAIR 10
Match: AT4G40060.1 (homeobox protein 16 )

HSP 1 Score: 127.1 bits (318), Expect = 2.3e-29
Identity = 82/182 (45.05%), Postives = 109/182 (59.89%), Query Frame = 0

Query: 57  YDDEYYDEFYPEKKRRLTHDQVQMLEKSFDEENKLEPERKSQLAKKLGLQPRQVAVWFQN 116
           Y   ++     EKKRRL  DQV+ LEK+F+ ENKLEPERK++LA++LGLQPRQVAVWFQN
Sbjct: 47  YSGNHHHMGLSEKKRRLKVDQVKALEKNFELENKLEPERKTKLAQELGLQPRQVAVWFQN 106

Query: 117 RRARWKTKQLERDYDVLKASYDLLLSNYDSVIKENADLKSQVTSLTEKFLAKELDGGGEA 176
           RRARWKTKQLE+DY VLK  YD L  N+DS+ ++N  L  +++ +  K   +E +   +A
Sbjct: 107 RRARWKTKQLEKDYGVLKGQYDSLRHNFDSLRRDNDSLLQEISKIKAKVNGEEDNNNNKA 166

Query: 177 PIPCVTSEPLLADIGHVSAP-----HSS----RKAEDRLSSGSDGSTVIDDNCRQLIDCC 230
               V  E +       S+P     HSS    R++   L      STV++       D C
Sbjct: 167 ITEGVKEEEVHKTDSIPSSPLQFLEHSSGFNYRRSFTDLRDLLPNSTVVEAGSS---DSC 225

BLAST of Cp4.1LG03g07630 vs. TAIR 10
Match: AT5G65310.1 (homeobox protein 5 )

HSP 1 Score: 122.9 bits (307), Expect = 4.3e-28
Identity = 63/105 (60.00%), Postives = 81/105 (77.14%), Query Frame = 0

Query: 68  EKKRRLTHDQVQMLEKSFDEENKLEPERKSQLAKKLGLQPRQVAVWFQNRRARWKTKQLE 127
           EKKRRL  +QV+ LEK+F+ +NKLEPERK +LA++LGLQPRQVA+WFQNRRARWKTKQLE
Sbjct: 71  EKKRRLGVEQVKALEKNFEIDNKLEPERKVKLAQELGLQPRQVAIWFQNRRARWKTKQLE 130

Query: 128 RDYDVLKASYDLLLSNYDSVIKENADLKSQVTSLTEKFLAKELDG 173
           RDY VLK+++D L  N DS+ ++N  L  Q+  L  K   + + G
Sbjct: 131 RDYGVLKSNFDALKRNRDSLQRDNDSLLGQIKELKAKLNVEGVKG 175

BLAST of Cp4.1LG03g07630 vs. TAIR 10
Match: AT5G65310.2 (homeobox protein 5 )

HSP 1 Score: 122.9 bits (307), Expect = 4.3e-28
Identity = 63/105 (60.00%), Postives = 81/105 (77.14%), Query Frame = 0

Query: 68  EKKRRLTHDQVQMLEKSFDEENKLEPERKSQLAKKLGLQPRQVAVWFQNRRARWKTKQLE 127
           EKKRRL  +QV+ LEK+F+ +NKLEPERK +LA++LGLQPRQVA+WFQNRRARWKTKQLE
Sbjct: 53  EKKRRLGVEQVKALEKNFEIDNKLEPERKVKLAQELGLQPRQVAIWFQNRRARWKTKQLE 112

Query: 128 RDYDVLKASYDLLLSNYDSVIKENADLKSQVTSLTEKFLAKELDG 173
           RDY VLK+++D L  N DS+ ++N  L  Q+  L  K   + + G
Sbjct: 113 RDYGVLKSNFDALKRNRDSLQRDNDSLLGQIKELKAKLNVEGVKG 157

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q022831.6e-7253.82Homeobox-leucine zipper protein HAT5 OS=Arabidopsis thaliana OX=3702 GN=HAT5 PE=... [more]
Q6YWR49.5e-5743.44Homeobox-leucine zipper protein HOX16 OS=Oryza sativa subsp. japonica OX=39947 G... [more]
A2X9801.6e-5643.19Homeobox-leucine zipper protein HOX16 OS=Oryza sativa subsp. indica OX=39946 GN=... [more]
Q9XH364.3e-4151.08Homeobox-leucine zipper protein HOX5 OS=Oryza sativa subsp. indica OX=39946 GN=H... [more]
Q6ZA744.3e-4151.08Homeobox-leucine zipper protein HOX5 OS=Oryza sativa subsp. japonica OX=39947 GN... [more]
Match NameE-valueIdentityDescription
XP_023526286.15.51e-212100.00homeobox-leucine zipper protein HAT5-like [Cucurbita pepo subsp. pepo][more]
XP_022934435.13.43e-20697.20homeobox-leucine zipper protein HAT5-like [Cucurbita moschata][more]
KAG6581018.11.98e-20596.85Homeobox-leucine zipper protein HAT5, partial [Cucurbita argyrosperma subsp. sor... [more]
XP_022982559.18.71e-20296.50homeobox-leucine zipper protein HAT5-like [Cucurbita maxima][more]
KAG7017759.14.01e-19987.42Homeobox-leucine zipper protein HAT5 [Cucurbita argyrosperma subsp. argyrosperma... [more]
Match NameE-valueIdentityDescription
A0A6J1F7P11.66e-20697.20homeobox-leucine zipper protein HAT5-like OS=Cucurbita moschata OX=3662 GN=LOC11... [more]
A0A6J1J4X14.22e-20296.50homeobox-leucine zipper protein HAT5-like OS=Cucurbita maxima OX=3661 GN=LOC1114... [more]
A0A5D3CRJ01.03e-16583.16Homeobox-leucine zipper protein HAT5 OS=Cucumis melo var. makuwa OX=1194695 GN=E... [more]
A0A1S3B8601.03e-16583.16homeobox-leucine zipper protein HAT5 OS=Cucumis melo OX=3656 GN=LOC103487074 PE=... [more]
A0A6J1EQW93.61e-16182.31homeobox-leucine zipper protein HAT5 isoform X1 OS=Cucurbita moschata OX=3662 GN... [more]
Match NameE-valueIdentityDescription
AT3G01470.11.1e-7353.82homeobox 1 [more]
AT2G22430.11.0e-2962.62homeobox protein 6 [more]
AT4G40060.12.3e-2945.05homeobox protein 16 [more]
AT5G65310.14.3e-2860.00homeobox protein 5 [more]
AT5G65310.24.3e-2860.00homeobox protein 5 [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 144..164
NoneNo IPR availableGENE3D1.10.10.60coord: 64..130
e-value: 2.2E-20
score: 73.9
NoneNo IPR availablePANTHERPTHR24326HOMEOBOX-LEUCINE ZIPPER PROTEINcoord: 1..252
NoneNo IPR availablePANTHERPTHR24326:SF497HOMEOBOX-LEUCINE ZIPPER PROTEIN HAT5coord: 1..252
IPR000047Helix-turn-helix motifPRINTSPR00031HTHREPRESSRcoord: 95..104
score: 48.54
coord: 104..120
score: 59.22
IPR001356Homeobox domainSMARTSM00389HOX_1coord: 67..128
e-value: 1.7E-19
score: 80.8
IPR001356Homeobox domainPFAMPF00046Homeodomaincoord: 69..122
e-value: 2.4E-18
score: 65.7
IPR001356Homeobox domainPROSITEPS50071HOMEOBOX_2coord: 64..124
score: 17.604715
IPR001356Homeobox domainCDDcd00086homeodomaincoord: 69..125
e-value: 1.78345E-19
score: 78.054
IPR003106Leucine zipper, homeobox-associatedPFAMPF02183HALZcoord: 124..164
e-value: 2.3E-16
score: 59.7
IPR017970Homeobox, conserved sitePROSITEPS00027HOMEOBOX_1coord: 99..122
IPR009057Homeobox-like domain superfamilySUPERFAMILY46689Homeodomain-likecoord: 65..126

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG03g07630.1Cp4.1LG03g07630.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006357 regulation of transcription by RNA polymerase II
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0005634 nucleus
molecular_function GO:0000981 DNA-binding transcription factor activity, RNA polymerase II-specific
molecular_function GO:0043565 sequence-specific DNA binding
molecular_function GO:0003677 DNA binding