Cp4.1LG13g08820.1 (mRNA) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG13g08820.1
TypemRNA
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
Descriptionhomeobox-leucine zipper protein HAT5-like
LocationCp4.1LG13: 7748511 .. 7751543 (+)
Sequence length1504
RNA-Seq ExpressionCp4.1LG13g08820.1
SyntenyCp4.1LG13g08820.1
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTACCCGCTGATAACAACCAGGGTTGTATAGTGCAGCGTTGAATCCCGTCCGCCGGGGGACTCTTTATTTCCCGGTCGCCGGAGTTTCTGTTTCCGATCAAGCTTGTATTCTTTCTTCTTCTTTTTTTCTTTCTAATTCTCTGTTTTTATTTAGCGTTCATTCTGTTCTTCGCTTTTTCTCAGGTTTGTTTTTGTGTTTTTTCTCAGAGGTTTTCGAGACTGACGAAGCGCTGGGTAATAACAGAGCTGGTCGTGTGATGGGGTTTTGTATTTCTCCATTGGGAACACCGGCTAGGTTACTGTGGAGCACCAGCTTCTTCCGCCACAAGCTCATGATTTTCTAATTTTCAGCCCTATCGCTGAAAATACCAAATTCCATCTCATTCGTTTCTCGTTTTAATAAAAGATTTCCCTCGATTTCGTGTCTCTGTGCCAATCTGCCGCCGCCGTAAATCGCGGGTGATCCCCTGCTCACGTTTCCCGTTTCAAATTGAAGAGAATAATTGATGATGTCGGACGGATTGATCTTCAACCGATCTCCAGGTCACGCCAACATGCTGTTTTTCGGGAGTGGCGATTCCATTCTTCGAGGTTATTTCATTTTAGTTCAGTTATGAGCTTTTAATTTTAGGGATCTTTAACTATATACAATTCCTTTAAACTGTAATTCTGTAAATGTTAGGATCATGTTTCACGAAGGGGATGGAGGAAAATTCGAAGCGACCCTTTTTCAGCTCTCCGGATGACCTGTTCGATGATGACTACTACGATGACCAGCCGCCTGAGAAGAAGCGCCGGCTAACTCAAGAGCAGGTATTTAAGCAACTCAATTCCGAAAATATCATGCAATTATTCCATTTTTTTGTTTCTTCTTTACTTAATGGGTGGGGATGGAATTGTGGAATCTGAAGTCTGAACTGGATGCTTATACTTATGGCATATTCCTTTGATTCAATGTGCTGTTGCTTTAATGCGACTACCTTTGAATCAATGCTGATCTCCTTGTTTAGTTCAGTGGTGAAAAAAAATTAAGCATTATTCTCGGATTCTGAAACTTCAATCTCTCTAGATAAATTGCTCGAGGAAAATTCCCATGGGCAATCGTTTATGTGGCTAATATTTGGCCTGTTTAGAATTACTTTATAAGTGCTTACAAGCTTTTTAAGTGCAAAAAAATTGTGTTCTTCGTTTATGTGTCTAACATAGAGCTTTTATGCTTAGAGTTTTTAGAGGGATTGAGAAAGACTCTTTCGATGTTAGGTCTCTTGTGAGATTTAATGTCTCTCTTTGGATGTCGGTGACGAGTTTCTTTTGTAATTACTCGCTTGATGTTATTTTGCTTGATTGGAGGCCCTTGCTTTAGGGGCCTCCCTTTTGTGGGATTGTTTTTTTTTTTTGTACGTCCGTGTATTCTTTCGTTTTTCTTAATGCAAGTTGGTCTTTTATTAAAAAGATTGCAAAAAACGTGTTTTGAGCGCTTAAAAGGTCATTTCAAACGGACCCATAATTGGTACAATCCATCAAGTTCATCTTGAGATTATGGTTTTCTTCTGGTCTGTTTTCTGTTTCTTTTATTGTACTTATTTTTAATCCTCCATTGTTCTCATTTCTTAGTAACAAATGAGACTGACCATGGCCAATGCTTAGGTGCATCTGCTGGAGATTAGCTTTGAGGCAGAAAACAAACTGGAGCCAGAGCGCAAGACTGAGCTAGCAAAGAAGCTGGGTCTGCAGCCAAGGCAAGTGGCTGTGTGGTTTCAAAACCGCCGAGCTCGATGGAAGACAAAGCAGCTTGAAAGGGACTATGATCTTCTCAAATCTTCATATGATTCCTTTTGTTCTAGCTACGACTTTATGGCCAAGGAGAATGAAAGACTCAAAGCCGAGGTATTTATGCTCCAAATCTATTATTAATATTTTTCATAAACCTTTGATATGGTATGATAATCCTTATAGATGCTGAAAGCTCAACATTTTTTGATATTTGCTTGCTTGGCTGTGTTTGATCTGACATATCTTCTGCACATAAATTTAAGTAGATTCTTTGCAATCCTTTAGGATGGTGGTCCATGGACTGTTTATCTTAGTTTCTTAAGACGACTTGGTAACTTCTAGTTAATATTACAAGTTTCATCATCACATCTGGTCTAATAACAACGTCTTTTAACCTCGTGGTCCTATTATCGAATTGGACTTTTTGGGCTCCTCATCGAGTTAGTCTAGCATAGGTTCTGAACCCAAATTTCTGGGGGTCAAGAGAAGATATGGATTCTTTTCCCAGATATGATGGTAATCTTTCATAGGAAAAGGAATGGTGAAATTAGTTAAAGAGGGACTTGGTATTGTTACTGAGCCAGGGGTCTTATGCATTCTCAATTACTGAACTCAGGTAGCTTCTTTAACTGAGAAACTTCAAGCTAAAGAAGTGGTTGAATCATCGTTTCAAGCTAAGAAATCCGAGCCAATTCTGGAAGATCAACTCCTTGTTTCTGTTGAGCAACACAATATGAAGATTGATGACCATCATAGTTGCAGGAGCAATGGAAGCGCTGTGCTGGATGAGGATGGTCCTCAACTCTTAGACAGTGGGGATTCATACCTCCTGAGCAGTGACTACGATGGCTGTGTTTTGCCAGTTTTTGGAGTGAACTCGGAGGAGGAGGATGTGAGTGATGACGGTCAGGGCTACTTCTCGGATGTTTATACAGCAGCCGATCAGCAGACTCACGAGGCAGAGCCATTGACCTGGTGGGACTGGTCGTTGTAAATGCGCGCTCGAATCATTTCGAAATTTAGTAGGATGGAACTACAATATAATGTTTACTGTATGATACTGATGAATAAATCTAATGCGAACATTACTTAGTGTATGGATAAAATTTAGACCTTGTACCGTTACATGGTTCTTCTGCGTTTATATGAATTCCCTTGGAAAGTTGGCCACCTTTAGGACTGGCAAAGGTGTGAGAACCATTCTAATTTTGAAACAAAATTTGGTTTAAGGTTGTGGAAGGTATTAGTGTAACACTCCAA

mRNA sequence

TTACCCGCTGATAACAACCAGGGTTGTATAGTGCAGCGTTGAATCCCGTCCGCCGGGGGACTCTTTATTTCCCGGTCGCCGGAGTTTCTGTTTCCGATCAAGCTTAGGTTTTCGAGACTGACGAAGCGCTGGGTAATAACAGAGCTGGTCGTGTGATGGGGTTTTGTATTTCTCCATTGGGAACACCGGCTAGGTTACTGTGGAGCACCAGCTTCTTCCGCCACAAGCTCATGATTTTCTAATTTTCAGCCCTATCGCTGAAAATACCAAATTCCATCTCATTCGTTTCTCGTTTTAATAAAAGATTTCCCTCGATTTCGTGTCTCTGTGCCAATCTGCCGCCGCCGTAAATCGCGGGTGATCCCCTGCTCACGTTTCCCGTTTCAAATTGAAGAGAATAATTGATGATGTCGGACGGATTGATCTTCAACCGATCTCCAGGTCACGCCAACATGCTGTTTTTCGGGAGTGGCGATTCCATTCTTCGAGGATCATGTTTCACGAAGGGGATGGAGGAAAATTCGAAGCGACCCTTTTTCAGCTCTCCGGATGACCTGTTCGATGATGACTACTACGATGACCAGCCGCCTGAGAAGAAGCGCCGGCTAACTCAAGAGCAGGTGCATCTGCTGGAGATTAGCTTTGAGGCAGAAAACAAACTGGAGCCAGAGCGCAAGACTGAGCTAGCAAAGAAGCTGGGTCTGCAGCCAAGGCAAGTGGCTGTGTGGTTTCAAAACCGCCGAGCTCGATGGAAGACAAAGCAGCTTGAAAGGGACTATGATCTTCTCAAATCTTCATATGATTCCTTTTGTTCTAGCTACGACTTTATGGCCAAGGAGAATGAAAGACTCAAAGCCGAGGTAGCTTCTTTAACTGAGAAACTTCAAGCTAAAGAAGTGGTTGAATCATCGTTTCAAGCTAAGAAATCCGAGCCAATTCTGGAAGATCAACTCCTTGTTTCTGTTGAGCAACACAATATGAAGATTGATGACCATCATAGTTGCAGGAGCAATGGAAGCGCTGTGCTGGATGAGGATGGTCCTCAACTCTTAGACAGTGGGGATTCATACCTCCTGAGCAGTGACTACGATGGCTGTGTTTTGCCAGTTTTTGGAGTGAACTCGGAGGAGGAGGATGTGAGTGATGACGGTCAGGGCTACTTCTCGGATGTTTATACAGCAGCCGATCAGCAGACTCACGAGGCAGAGCCATTGACCTGGTGGGACTGGTCGTTGTAAATGCGCGCTCGAATCATTTCGAAATTTAGTAGGATGGAACTACAATATAATGTTTACTGTATGATACTGATGAATAAATCTAATGCGAACATTACTTAGTGTATGGATAAAATTTAGACCTTGTACCGTTACATGGTTCTTCTGCGTTTATATGAATTCCCTTGGAAAGTTGGCCACCTTTAGGACTGGCAAAGGTGTGAGAACCATTCTAATTTTGAAACAAAATTTGGTTTAAGGTTGTGGAAGGTATTAGTGTAACACTCCAA

Coding sequence (CDS)

ATGATGTCGGACGGATTGATCTTCAACCGATCTCCAGGTCACGCCAACATGCTGTTTTTCGGGAGTGGCGATTCCATTCTTCGAGGATCATGTTTCACGAAGGGGATGGAGGAAAATTCGAAGCGACCCTTTTTCAGCTCTCCGGATGACCTGTTCGATGATGACTACTACGATGACCAGCCGCCTGAGAAGAAGCGCCGGCTAACTCAAGAGCAGGTGCATCTGCTGGAGATTAGCTTTGAGGCAGAAAACAAACTGGAGCCAGAGCGCAAGACTGAGCTAGCAAAGAAGCTGGGTCTGCAGCCAAGGCAAGTGGCTGTGTGGTTTCAAAACCGCCGAGCTCGATGGAAGACAAAGCAGCTTGAAAGGGACTATGATCTTCTCAAATCTTCATATGATTCCTTTTGTTCTAGCTACGACTTTATGGCCAAGGAGAATGAAAGACTCAAAGCCGAGGTAGCTTCTTTAACTGAGAAACTTCAAGCTAAAGAAGTGGTTGAATCATCGTTTCAAGCTAAGAAATCCGAGCCAATTCTGGAAGATCAACTCCTTGTTTCTGTTGAGCAACACAATATGAAGATTGATGACCATCATAGTTGCAGGAGCAATGGAAGCGCTGTGCTGGATGAGGATGGTCCTCAACTCTTAGACAGTGGGGATTCATACCTCCTGAGCAGTGACTACGATGGCTGTGTTTTGCCAGTTTTTGGAGTGAACTCGGAGGAGGAGGATGTGAGTGATGACGGTCAGGGCTACTTCTCGGATGTTTATACAGCAGCCGATCAGCAGACTCACGAGGCAGAGCCATTGACCTGGTGGGACTGGTCGTTGTAA

Protein sequence

MMSDGLIFNRSPGHANMLFFGSGDSILRGSCFTKGMEENSKRPFFSSPDDLFDDDYYDDQPPEKKRRLTQEQVHLLEISFEAENKLEPERKTELAKKLGLQPRQVAVWFQNRRARWKTKQLERDYDLLKSSYDSFCSSYDFMAKENERLKAEVASLTEKLQAKEVVESSFQAKKSEPILEDQLLVSVEQHNMKIDDHHSCRSNGSAVLDEDGPQLLDSGDSYLLSSDYDGCVLPVFGVNSEEEDVSDDGQGYFSDVYTAADQQTHEAEPLTWWDWSL
Homology
BLAST of Cp4.1LG13g08820.1 vs. ExPASy Swiss-Prot
Match: Q02283 (Homeobox-leucine zipper protein HAT5 OS=Arabidopsis thaliana OX=3702 GN=HAT5 PE=1 SV=1)

HSP 1 Score: 255.8 bits (652), Expect = 5.8e-67
Identity = 151/281 (53.74%), Postives = 190/281 (67.62%), Query Frame = 0

Query: 1   MMSDGLIFNRSPGHANMLFF-GSGDSILR--GSCFTKGMEENSK-RPFFSSPDDLFDDDY 60
           M S+   F+ S  H N +FF G+ + +++  G+     MEE SK RPFFSSP+DL+DDD+
Sbjct: 1   MESNSFFFDPSASHGNSMFFLGNLNPVVQGGGARSMMNMEETSKRRPFFSSPEDLYDDDF 60

Query: 61  YDDQPPEKKRRLTQEQVHLLEISFEAENKLEPERKTELAKKLGLQPRQVAVWFQNRRARW 120
           YDDQ PEKKRRLT EQVHLLE SFE ENKLEPERKT+LAKKLGLQPRQVAVWFQNRRARW
Sbjct: 61  YDDQLPEKKRRLTTEQVHLLEKSFETENKLEPERKTQLAKKLGLQPRQVAVWFQNRRARW 120

Query: 121 KTKQLERDYDLLKSSYDSFCSSYDFMAKENERLKAEVASLTEKLQAKEVVESSFQAKKSE 180
           KTKQLERDYDLLKS+YD   S+YD +  +N++L++EV SLTEKLQ K+   +    +  E
Sbjct: 121 KTKQLERDYDLLKSTYDQLLSNYDSIVMDNDKLRSEVTSLTEKLQGKQETANEPPGQVPE 180

Query: 181 PILEDQLLVSVEQHNMKIDDHHSCRSNGSAVLDEDGPQLLDSGDSYLLSSDYDGCVLPVF 240
           P   D + ++     +K +D  S  S GSAVLD+D PQLLDS DSY  S      ++P+ 
Sbjct: 181 PNQLDPVYINAAA--IKTEDRLSSGSVGSAVLDDDAPQLLDSCDSYFPS------IVPIQ 240

Query: 241 GVNSEEEDVSDDGQGYFSDVY--TAADQQTHEAEPLTWWDW 276
             NS   D  D+ +  F+DV+  T +    H  E L +W W
Sbjct: 241 D-NSNASD-HDNDRSCFADVFVPTTSPSHDHHGESLAFWGW 271

BLAST of Cp4.1LG13g08820.1 vs. ExPASy Swiss-Prot
Match: Q6YWR4 (Homeobox-leucine zipper protein HOX16 OS=Oryza sativa subsp. japonica OX=39947 GN=HOX16 PE=2 SV=1)

HSP 1 Score: 211.5 bits (537), Expect = 1.2e-53
Identity = 145/343 (42.27%), Postives = 188/343 (54.81%), Query Frame = 0

Query: 1   MMSDGLIFNRSPGHANMLFF----------GSGDSILRGSCFTKGMEENS---KRPFFSS 60
           M S  LIF+ +   A  + F          G G    RG+    GMEE     KRPFF++
Sbjct: 1   MESGRLIFSTAGSGAGQMLFLDCGAGGGGVGGGAMFHRGARPVLGMEEGGRGVKRPFFTT 60

Query: 61  PDDLFDDDYYDDQPPEKKRRLTQEQVHLLEISFEAENKLEPERKTELAKKLGLQPRQVAV 120
           PD+L +++YYD+Q PEKKRRLT EQVHLLE SFE ENKLEPERKTELA+KLGLQPRQVAV
Sbjct: 61  PDELLEEEYYDEQLPEKKRRLTPEQVHLLERSFEEENKLEPERKTELARKLGLQPRQVAV 120

Query: 121 WFQNRRARWKTKQLERDYDLLKSSYDSFCSSYDFMAKENERLKAEVASLTEKLQAKE-VV 180
           WFQNRRARWKTKQLERD+D LK+S+D+  + +D + ++N RL ++V SLTEKLQ KE   
Sbjct: 121 WFQNRRARWKTKQLERDFDRLKASFDALRADHDALLQDNHRLHSQVMSLTEKLQEKETTT 180

Query: 181 ESSFQA------------------KKSEPILEDQLLVSVEQ--HNMKIDDHHSCRSNGSA 240
           E S  A                     EP LE+      EQ    +K +D  S  S GSA
Sbjct: 181 EGSAGAAVDVPGLPAAADVKVAVPDAEEPALEEAAAAFEEQQEQQVKAEDRLSTGSGGSA 240

Query: 241 VLDEDGPQLLDSGDSYLLSSD------------YDGCVL-----PVFGVNSEEED--VSD 277
           V+D D   ++  G  +L + D            Y  CV+        G+ SEE+D   SD
Sbjct: 241 VVDTDAQLVVGCGRQHLAAVDSSVESYFPGGDEYHDCVMGPMDHAAGGIQSEEDDGAGSD 300

BLAST of Cp4.1LG13g08820.1 vs. ExPASy Swiss-Prot
Match: A2X980 (Homeobox-leucine zipper protein HOX16 OS=Oryza sativa subsp. indica OX=39946 GN=HOX16 PE=2 SV=1)

HSP 1 Score: 210.7 bits (535), Expect = 2.1e-53
Identity = 145/345 (42.03%), Postives = 188/345 (54.49%), Query Frame = 0

Query: 1   MMSDGLIFNRSPGHANMLFF------------GSGDSILRGSCFTKGMEENS---KRPFF 60
           M S  LIF+ +   A  + F            G G    RG+    GMEE     KRPFF
Sbjct: 1   MESGRLIFSTAGSGAGQMLFLDCGAGGGGGGVGGGAMFHRGARPVLGMEEGGRGVKRPFF 60

Query: 61  SSPDDLFDDDYYDDQPPEKKRRLTQEQVHLLEISFEAENKLEPERKTELAKKLGLQPRQV 120
           ++PD+L +++YYD+Q PEKKRRLT EQVHLLE SFE ENKLEPERKTELA+KLGLQPRQV
Sbjct: 61  TTPDELLEEEYYDEQLPEKKRRLTPEQVHLLERSFEEENKLEPERKTELARKLGLQPRQV 120

Query: 121 AVWFQNRRARWKTKQLERDYDLLKSSYDSFCSSYDFMAKENERLKAEVASLTEKLQAKE- 180
           AVWFQNRRARWKTKQLERD+D LK+S+D+  + +D + ++N RL ++V SLTEKLQ KE 
Sbjct: 121 AVWFQNRRARWKTKQLERDFDRLKASFDALRADHDALLQDNHRLHSQVMSLTEKLQEKET 180

Query: 181 VVESSFQA------------------KKSEPILEDQLLVSVEQ--HNMKIDDHHSCRSNG 240
             E S  A                     EP LE+      EQ    +K +D  S  S G
Sbjct: 181 TTEGSAGAAVDVPGLPAAADVKVAVPDAEEPALEEAAAAFEEQQEQQVKAEDRLSTGSGG 240

Query: 241 SAVLDEDGPQLLDSGDSYLLSSD------------YDGCVL-----PVFGVNSEEED--V 277
           SAV+D D   ++  G  +L + D            Y  CV+        G+ SEE+D   
Sbjct: 241 SAVVDTDAQLVVGCGRQHLAAVDSSVESYFPGGDEYHDCVMGPMDHAAGGIQSEEDDGAG 300

BLAST of Cp4.1LG13g08820.1 vs. ExPASy Swiss-Prot
Match: Q6ZA74 (Homeobox-leucine zipper protein HOX5 OS=Oryza sativa subsp. japonica OX=39947 GN=HOX5 PE=1 SV=1)

HSP 1 Score: 182.2 bits (461), Expect = 8.1e-45
Identity = 119/290 (41.03%), Postives = 160/290 (55.17%), Query Frame = 0

Query: 1   MMSDGLIFNRSPGHANMLFFGSGDSILRGSCF------TKGMEEN----------SKRPF 60
           +   G+     PG A ML FG G S   G  F        GM+E+          +KRPF
Sbjct: 7   VFDSGVARRACPGGAQMLLFGGGGSANSGGFFRGVPAAVLGMDESRSSSSAAGAGAKRPF 66

Query: 61  FSSPDDLFDDDYYDDQPPEKKRRLTQEQVHLLEISFEAENKLEPERKTELAKKLGLQPRQ 120
           F++ ++L +++YYD+Q PEKKRRLT EQV +LE SFE ENKLEPERKTELA++LG+ PRQ
Sbjct: 67  FTTHEELLEEEYYDEQAPEKKRRLTAEQVQMLERSFEEENKLEPERKTELARRLGMAPRQ 126

Query: 121 VAVWFQNRRARWKTKQLERDYDLLKSSYDSFCSSYDFMAKENERLKAEVASLTEKLQAKE 180
           VAVWFQNRRARWKTKQLE D+D LK++YD+  + +  +  +N+RL+A+V SLTEKLQ KE
Sbjct: 127 VAVWFQNRRARWKTKQLEHDFDRLKAAYDALAADHHALLSDNDRLRAQVISLTEKLQDKE 186

Query: 181 VVESSFQAKKSEPILEDQLLVSVEQHNMKIDDH-HSCRSNGSAVLD----------EDGP 240
              SS              + +  Q   + D+H  +  + G A +D          +  P
Sbjct: 187 TSPSS------------ATITTAAQEVDQPDEHTEAASTTGFATVDGALAAPPPGHQQPP 246

Query: 241 QLLDSGDSYLLSSDYD-GCVLPVFGVNSEEEDVSDDGQGYFSDVYTAADQ 263
              D   S   + D D G  + VF V     D       YF+D   A ++
Sbjct: 247 HKDDLVSSGGTNDDGDGGAAVVVFDVTEGANDRLSCESAYFADAAEAYER 284

BLAST of Cp4.1LG13g08820.1 vs. ExPASy Swiss-Prot
Match: Q9XH36 (Homeobox-leucine zipper protein HOX5 OS=Oryza sativa subsp. indica OX=39946 GN=HOX5 PE=1 SV=1)

HSP 1 Score: 181.4 bits (459), Expect = 1.4e-44
Identity = 116/284 (40.85%), Postives = 155/284 (54.58%), Query Frame = 0

Query: 1   MMSDGLIFNRSPGHANMLFFGSGDSILRGSCF------TKGMEEN----------SKRPF 60
           +   G+     PG A ML FG G S   G  F        GM+E+          +KRPF
Sbjct: 7   VFDSGVARRACPGGAQMLLFGGGGSANSGGFFRGVPAAVLGMDESRSSSSAAGAGAKRPF 66

Query: 61  FSSPDDLFDDDYYDDQPPEKKRRLTQEQVHLLEISFEAENKLEPERKTELAKKLGLQPRQ 120
           F++ ++L +++YYD+Q PEKKRRLT EQV +LE SFE ENKLEPERKTELA++LG+ PRQ
Sbjct: 67  FTTHEELLEEEYYDEQAPEKKRRLTAEQVQMLERSFEEENKLEPERKTELARRLGMAPRQ 126

Query: 121 VAVWFQNRRARWKTKQLERDYDLLKSSYDSFCSSYDFMAKENERLKAEVASLTEKLQAKE 180
           VAVWFQNRRARWKTKQLE D+D LK++YD+  + +  +  +N+RL+A+V SLTEKLQ KE
Sbjct: 127 VAVWFQNRRARWKTKQLEHDFDRLKAAYDALAADHHALLSDNDRLRAQVISLTEKLQDKE 186

Query: 181 VVESSFQAKKSEPILEDQLLVSVEQHNMKIDDHHSCRSNGSAVLDEDGPQLLDSGDSYLL 240
              SS     +      Q +   ++H            +G+      G Q     D  + 
Sbjct: 187 TSPSSATITTAA-----QEVDQPDEHTEAASTTGFATVDGALAAPPPGHQQPPHKDDLVS 246

Query: 241 S------SDYDGCVLPVFGVNSEEEDVSDDGQGYFSDVYTAADQ 263
           S       D  G  + VF V     D       YF+D   A ++
Sbjct: 247 SGGTNDDGDGGGAAVVVFDVTEGANDRLSCESAYFADAAEAYER 285

BLAST of Cp4.1LG13g08820.1 vs. NCBI nr
Match: XP_023551382.1 (homeobox-leucine zipper protein HAT5-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 552 bits (1423), Expect = 2.91e-198
Identity = 277/277 (100.00%), Postives = 277/277 (100.00%), Query Frame = 0

Query: 1   MMSDGLIFNRSPGHANMLFFGSGDSILRGSCFTKGMEENSKRPFFSSPDDLFDDDYYDDQ 60
           MMSDGLIFNRSPGHANMLFFGSGDSILRGSCFTKGMEENSKRPFFSSPDDLFDDDYYDDQ
Sbjct: 1   MMSDGLIFNRSPGHANMLFFGSGDSILRGSCFTKGMEENSKRPFFSSPDDLFDDDYYDDQ 60

Query: 61  PPEKKRRLTQEQVHLLEISFEAENKLEPERKTELAKKLGLQPRQVAVWFQNRRARWKTKQ 120
           PPEKKRRLTQEQVHLLEISFEAENKLEPERKTELAKKLGLQPRQVAVWFQNRRARWKTKQ
Sbjct: 61  PPEKKRRLTQEQVHLLEISFEAENKLEPERKTELAKKLGLQPRQVAVWFQNRRARWKTKQ 120

Query: 121 LERDYDLLKSSYDSFCSSYDFMAKENERLKAEVASLTEKLQAKEVVESSFQAKKSEPILE 180
           LERDYDLLKSSYDSFCSSYDFMAKENERLKAEVASLTEKLQAKEVVESSFQAKKSEPILE
Sbjct: 121 LERDYDLLKSSYDSFCSSYDFMAKENERLKAEVASLTEKLQAKEVVESSFQAKKSEPILE 180

Query: 181 DQLLVSVEQHNMKIDDHHSCRSNGSAVLDEDGPQLLDSGDSYLLSSDYDGCVLPVFGVNS 240
           DQLLVSVEQHNMKIDDHHSCRSNGSAVLDEDGPQLLDSGDSYLLSSDYDGCVLPVFGVNS
Sbjct: 181 DQLLVSVEQHNMKIDDHHSCRSNGSAVLDEDGPQLLDSGDSYLLSSDYDGCVLPVFGVNS 240

Query: 241 EEEDVSDDGQGYFSDVYTAADQQTHEAEPLTWWDWSL 277
           EEEDVSDDGQGYFSDVYTAADQQTHEAEPLTWWDWSL
Sbjct: 241 EEEDVSDDGQGYFSDVYTAADQQTHEAEPLTWWDWSL 277

BLAST of Cp4.1LG13g08820.1 vs. NCBI nr
Match: XP_022938616.1 (homeobox-leucine zipper protein HAT5-like [Cucurbita moschata] >KAG6578642.1 Homeobox-leucine zipper protein HAT5, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 548 bits (1413), Expect = 9.75e-197
Identity = 275/277 (99.28%), Postives = 276/277 (99.64%), Query Frame = 0

Query: 1   MMSDGLIFNRSPGHANMLFFGSGDSILRGSCFTKGMEENSKRPFFSSPDDLFDDDYYDDQ 60
           MMSDGLIFNRSPGHANMLFFGSGDSILRGSCFTKGMEENSKRPFFSSPDDLFDDDYYDDQ
Sbjct: 1   MMSDGLIFNRSPGHANMLFFGSGDSILRGSCFTKGMEENSKRPFFSSPDDLFDDDYYDDQ 60

Query: 61  PPEKKRRLTQEQVHLLEISFEAENKLEPERKTELAKKLGLQPRQVAVWFQNRRARWKTKQ 120
           PPEKKRRLTQEQVHLLEISFEAENKLEPERKTELAKKLGLQPRQVAVWFQNRRARWKTKQ
Sbjct: 61  PPEKKRRLTQEQVHLLEISFEAENKLEPERKTELAKKLGLQPRQVAVWFQNRRARWKTKQ 120

Query: 121 LERDYDLLKSSYDSFCSSYDFMAKENERLKAEVASLTEKLQAKEVVESSFQAKKSEPILE 180
           LERDYDLLKSSYDSFCSSYDFMAKENERLKAEVASLTEKLQAKEVVESSFQAKKSEPILE
Sbjct: 121 LERDYDLLKSSYDSFCSSYDFMAKENERLKAEVASLTEKLQAKEVVESSFQAKKSEPILE 180

Query: 181 DQLLVSVEQHNMKIDDHHSCRSNGSAVLDEDGPQLLDSGDSYLLSSDYDGCVLPVFGVNS 240
           DQLLVSV QHNMKIDDHHSCRSNGSAVLDEDGPQLLDSGDSYLLS+DYDGCVLPVFGVNS
Sbjct: 181 DQLLVSVVQHNMKIDDHHSCRSNGSAVLDEDGPQLLDSGDSYLLSNDYDGCVLPVFGVNS 240

Query: 241 EEEDVSDDGQGYFSDVYTAADQQTHEAEPLTWWDWSL 277
           EEEDVSDDGQGYFSDVYTAADQQTHEAEPLTWWDWSL
Sbjct: 241 EEEDVSDDGQGYFSDVYTAADQQTHEAEPLTWWDWSL 277

BLAST of Cp4.1LG13g08820.1 vs. NCBI nr
Match: KAG7016182.1 (Homeobox-leucine zipper protein HAT5, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 546 bits (1408), Expect = 5.64e-196
Identity = 274/277 (98.92%), Postives = 276/277 (99.64%), Query Frame = 0

Query: 1   MMSDGLIFNRSPGHANMLFFGSGDSILRGSCFTKGMEENSKRPFFSSPDDLFDDDYYDDQ 60
           MMSDGLIFNRSPGHANMLFFGSGDSILRGSCFTKGMEENSKRPFFSSPDDLFDDDYYDDQ
Sbjct: 1   MMSDGLIFNRSPGHANMLFFGSGDSILRGSCFTKGMEENSKRPFFSSPDDLFDDDYYDDQ 60

Query: 61  PPEKKRRLTQEQVHLLEISFEAENKLEPERKTELAKKLGLQPRQVAVWFQNRRARWKTKQ 120
           PPEKKRRLTQEQVHLLEISFEAENKLEPERKTELAKKLGLQPRQVAVWFQNRRARWKTKQ
Sbjct: 61  PPEKKRRLTQEQVHLLEISFEAENKLEPERKTELAKKLGLQPRQVAVWFQNRRARWKTKQ 120

Query: 121 LERDYDLLKSSYDSFCSSYDFMAKENERLKAEVASLTEKLQAKEVVESSFQAKKSEPILE 180
           LERDYDLLKSSYDSFCSSYDFMAKENERLKAEVASLTEKLQAKEVVESSFQAKKSEPILE
Sbjct: 121 LERDYDLLKSSYDSFCSSYDFMAKENERLKAEVASLTEKLQAKEVVESSFQAKKSEPILE 180

Query: 181 DQLLVSVEQHNMKIDDHHSCRSNGSAVLDEDGPQLLDSGDSYLLSSDYDGCVLPVFGVNS 240
           DQLLVSV QHNMKIDDHHSCRSNGSAVLDEDGPQLLDSG+SYLLS+DYDGCVLPVFGVNS
Sbjct: 181 DQLLVSVVQHNMKIDDHHSCRSNGSAVLDEDGPQLLDSGNSYLLSNDYDGCVLPVFGVNS 240

Query: 241 EEEDVSDDGQGYFSDVYTAADQQTHEAEPLTWWDWSL 277
           EEEDVSDDGQGYFSDVYTAADQQTHEAEPLTWWDWSL
Sbjct: 241 EEEDVSDDGQGYFSDVYTAADQQTHEAEPLTWWDWSL 277

BLAST of Cp4.1LG13g08820.1 vs. NCBI nr
Match: XP_022993660.1 (homeobox-leucine zipper protein HAT5-like [Cucurbita maxima])

HSP 1 Score: 546 bits (1406), Expect = 1.14e-195
Identity = 272/277 (98.19%), Postives = 276/277 (99.64%), Query Frame = 0

Query: 1   MMSDGLIFNRSPGHANMLFFGSGDSILRGSCFTKGMEENSKRPFFSSPDDLFDDDYYDDQ 60
           M+SDGLIFNRSPGHANMLFFGSGDSILRGSCFTKGMEENSKRPFFSSPDDLFDDDYYDDQ
Sbjct: 1   MLSDGLIFNRSPGHANMLFFGSGDSILRGSCFTKGMEENSKRPFFSSPDDLFDDDYYDDQ 60

Query: 61  PPEKKRRLTQEQVHLLEISFEAENKLEPERKTELAKKLGLQPRQVAVWFQNRRARWKTKQ 120
           PPEKKRRLTQEQVHLLEISFEAENKLEPERKTELAKKLGLQPRQVA+WFQNRRARWKTKQ
Sbjct: 61  PPEKKRRLTQEQVHLLEISFEAENKLEPERKTELAKKLGLQPRQVAIWFQNRRARWKTKQ 120

Query: 121 LERDYDLLKSSYDSFCSSYDFMAKENERLKAEVASLTEKLQAKEVVESSFQAKKSEPILE 180
           LERDYDLLKSSYDSFCSSYDFMAKENERLKAEVASLTEKLQAKEVVESSFQAKKSEPILE
Sbjct: 121 LERDYDLLKSSYDSFCSSYDFMAKENERLKAEVASLTEKLQAKEVVESSFQAKKSEPILE 180

Query: 181 DQLLVSVEQHNMKIDDHHSCRSNGSAVLDEDGPQLLDSGDSYLLSSDYDGCVLPVFGVNS 240
           D+LLVSV QHNMKIDDHHSCRSNGSAVLDEDGPQLLDSGDSYLLS+DYDGCVLPVFGVNS
Sbjct: 181 DELLVSVVQHNMKIDDHHSCRSNGSAVLDEDGPQLLDSGDSYLLSNDYDGCVLPVFGVNS 240

Query: 241 EEEDVSDDGQGYFSDVYTAADQQTHEAEPLTWWDWSL 277
           EEEDVSDDGQGYFSDVYTAADQQTHEAEPLTWWDWSL
Sbjct: 241 EEEDVSDDGQGYFSDVYTAADQQTHEAEPLTWWDWSL 277

BLAST of Cp4.1LG13g08820.1 vs. NCBI nr
Match: XP_038890996.1 (homeobox-leucine zipper protein HAT5-like [Benincasa hispida] >XP_038890997.1 homeobox-leucine zipper protein HAT5-like [Benincasa hispida] >XP_038890998.1 homeobox-leucine zipper protein HAT5-like [Benincasa hispida])

HSP 1 Score: 494 bits (1271), Expect = 4.45e-175
Identity = 251/277 (90.61%), Postives = 259/277 (93.50%), Query Frame = 0

Query: 1   MMSDGLIFNRSPGHANMLFFGSGDSILRGSCFTKGMEENSKR-PFFSSPDDLFDDDYYDD 60
           MMSDG IFNRSPGHANMLFFGS DSILRG  FT GMEE SKR PFFSSPDDLFDDDYYDD
Sbjct: 1   MMSDGWIFNRSPGHANMLFFGSSDSILRGPSFTMGMEETSKRQPFFSSPDDLFDDDYYDD 60

Query: 61  QPPEKKRRLTQEQVHLLEISFEAENKLEPERKTELAKKLGLQPRQVAVWFQNRRARWKTK 120
           QPPEKKRRLTQEQVHLLEISFE+ENKLEPERKTELAKKLGLQPRQVAVWFQNRRARWKTK
Sbjct: 61  QPPEKKRRLTQEQVHLLEISFESENKLEPERKTELAKKLGLQPRQVAVWFQNRRARWKTK 120

Query: 121 QLERDYDLLKSSYDSFCSSYDFMAKENERLKAEVASLTEKLQAKEVVESSFQAKKSEPIL 180
           QLERDYDLLKSSYDSF SSYDF+AKENERLKAEVASLTEKLQAKEVVESSF AK  EP L
Sbjct: 121 QLERDYDLLKSSYDSFRSSYDFIAKENERLKAEVASLTEKLQAKEVVESSFHAKNPEPFL 180

Query: 181 EDQLLVSVEQHNMKIDDHHSCRSNGSAVLDEDGPQLLDSGDSYLLSSDYDGCVLPVFGVN 240
           EDQLLV V Q ++KI+DHHSCRSNGSAVLDEDGPQLLDSGDSYLLS+DYDGCVLPVFGVN
Sbjct: 181 EDQLLVPVVQQSIKIEDHHSCRSNGSAVLDEDGPQLLDSGDSYLLSNDYDGCVLPVFGVN 240

Query: 241 SEEEDVSDDGQGYFSDVYTAADQQTHEAEPLTWWDWS 276
           SEEED SDDGQGYFSDVYT ADQQ+HE EPLTWWDW+
Sbjct: 241 SEEEDGSDDGQGYFSDVYTTADQQSHEGEPLTWWDWT 277

BLAST of Cp4.1LG13g08820.1 vs. ExPASy TrEMBL
Match: A0A6J1FDM9 (homeobox-leucine zipper protein HAT5-like OS=Cucurbita moschata OX=3662 GN=LOC111444797 PE=4 SV=1)

HSP 1 Score: 548 bits (1413), Expect = 4.72e-197
Identity = 275/277 (99.28%), Postives = 276/277 (99.64%), Query Frame = 0

Query: 1   MMSDGLIFNRSPGHANMLFFGSGDSILRGSCFTKGMEENSKRPFFSSPDDLFDDDYYDDQ 60
           MMSDGLIFNRSPGHANMLFFGSGDSILRGSCFTKGMEENSKRPFFSSPDDLFDDDYYDDQ
Sbjct: 1   MMSDGLIFNRSPGHANMLFFGSGDSILRGSCFTKGMEENSKRPFFSSPDDLFDDDYYDDQ 60

Query: 61  PPEKKRRLTQEQVHLLEISFEAENKLEPERKTELAKKLGLQPRQVAVWFQNRRARWKTKQ 120
           PPEKKRRLTQEQVHLLEISFEAENKLEPERKTELAKKLGLQPRQVAVWFQNRRARWKTKQ
Sbjct: 61  PPEKKRRLTQEQVHLLEISFEAENKLEPERKTELAKKLGLQPRQVAVWFQNRRARWKTKQ 120

Query: 121 LERDYDLLKSSYDSFCSSYDFMAKENERLKAEVASLTEKLQAKEVVESSFQAKKSEPILE 180
           LERDYDLLKSSYDSFCSSYDFMAKENERLKAEVASLTEKLQAKEVVESSFQAKKSEPILE
Sbjct: 121 LERDYDLLKSSYDSFCSSYDFMAKENERLKAEVASLTEKLQAKEVVESSFQAKKSEPILE 180

Query: 181 DQLLVSVEQHNMKIDDHHSCRSNGSAVLDEDGPQLLDSGDSYLLSSDYDGCVLPVFGVNS 240
           DQLLVSV QHNMKIDDHHSCRSNGSAVLDEDGPQLLDSGDSYLLS+DYDGCVLPVFGVNS
Sbjct: 181 DQLLVSVVQHNMKIDDHHSCRSNGSAVLDEDGPQLLDSGDSYLLSNDYDGCVLPVFGVNS 240

Query: 241 EEEDVSDDGQGYFSDVYTAADQQTHEAEPLTWWDWSL 277
           EEEDVSDDGQGYFSDVYTAADQQTHEAEPLTWWDWSL
Sbjct: 241 EEEDVSDDGQGYFSDVYTAADQQTHEAEPLTWWDWSL 277

BLAST of Cp4.1LG13g08820.1 vs. ExPASy TrEMBL
Match: A0A6J1JZ53 (homeobox-leucine zipper protein HAT5-like OS=Cucurbita maxima OX=3661 GN=LOC111489584 PE=4 SV=1)

HSP 1 Score: 546 bits (1406), Expect = 5.51e-196
Identity = 272/277 (98.19%), Postives = 276/277 (99.64%), Query Frame = 0

Query: 1   MMSDGLIFNRSPGHANMLFFGSGDSILRGSCFTKGMEENSKRPFFSSPDDLFDDDYYDDQ 60
           M+SDGLIFNRSPGHANMLFFGSGDSILRGSCFTKGMEENSKRPFFSSPDDLFDDDYYDDQ
Sbjct: 1   MLSDGLIFNRSPGHANMLFFGSGDSILRGSCFTKGMEENSKRPFFSSPDDLFDDDYYDDQ 60

Query: 61  PPEKKRRLTQEQVHLLEISFEAENKLEPERKTELAKKLGLQPRQVAVWFQNRRARWKTKQ 120
           PPEKKRRLTQEQVHLLEISFEAENKLEPERKTELAKKLGLQPRQVA+WFQNRRARWKTKQ
Sbjct: 61  PPEKKRRLTQEQVHLLEISFEAENKLEPERKTELAKKLGLQPRQVAIWFQNRRARWKTKQ 120

Query: 121 LERDYDLLKSSYDSFCSSYDFMAKENERLKAEVASLTEKLQAKEVVESSFQAKKSEPILE 180
           LERDYDLLKSSYDSFCSSYDFMAKENERLKAEVASLTEKLQAKEVVESSFQAKKSEPILE
Sbjct: 121 LERDYDLLKSSYDSFCSSYDFMAKENERLKAEVASLTEKLQAKEVVESSFQAKKSEPILE 180

Query: 181 DQLLVSVEQHNMKIDDHHSCRSNGSAVLDEDGPQLLDSGDSYLLSSDYDGCVLPVFGVNS 240
           D+LLVSV QHNMKIDDHHSCRSNGSAVLDEDGPQLLDSGDSYLLS+DYDGCVLPVFGVNS
Sbjct: 181 DELLVSVVQHNMKIDDHHSCRSNGSAVLDEDGPQLLDSGDSYLLSNDYDGCVLPVFGVNS 240

Query: 241 EEEDVSDDGQGYFSDVYTAADQQTHEAEPLTWWDWSL 277
           EEEDVSDDGQGYFSDVYTAADQQTHEAEPLTWWDWSL
Sbjct: 241 EEEDVSDDGQGYFSDVYTAADQQTHEAEPLTWWDWSL 277

BLAST of Cp4.1LG13g08820.1 vs. ExPASy TrEMBL
Match: A0A5A7T5T2 (Homeobox-leucine zipper protein HAT5-like OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold808G00880 PE=4 SV=1)

HSP 1 Score: 484 bits (1247), Expect = 9.79e-172
Identity = 246/277 (88.81%), Postives = 254/277 (91.70%), Query Frame = 0

Query: 1   MMSDGLIFNRSPGHANMLFFGSGDSILRGSCFTKGMEENSKR-PFFSSPDDLFDDDYYDD 60
           MMSDGLIFNRSPGHANMLFFGS D I RG  F  GMEE SKR  FFSSPDDLFDDDYYDD
Sbjct: 1   MMSDGLIFNRSPGHANMLFFGSSDPIPRGPSFMMGMEETSKRRSFFSSPDDLFDDDYYDD 60

Query: 61  QPPEKKRRLTQEQVHLLEISFEAENKLEPERKTELAKKLGLQPRQVAVWFQNRRARWKTK 120
           QPPEKKRRLTQEQVHLLEISFE+ENKLEPERKTELAKKLGLQPRQVAVWFQNRRARWKTK
Sbjct: 61  QPPEKKRRLTQEQVHLLEISFESENKLEPERKTELAKKLGLQPRQVAVWFQNRRARWKTK 120

Query: 121 QLERDYDLLKSSYDSFCSSYDFMAKENERLKAEVASLTEKLQAKEVVESSFQAKKSEPIL 180
           QLERDYDLLKSSYDSF SSYDF+AKENERLKAEVASLTEKLQAKEVVESSF AK  +P L
Sbjct: 121 QLERDYDLLKSSYDSFRSSYDFIAKENERLKAEVASLTEKLQAKEVVESSFHAKNPDPFL 180

Query: 181 EDQLLVSVEQHNMKIDDHHSCRSNGSAVLDEDGPQLLDSGDSYLLSSDYDGCVLPVFGVN 240
           EDQLLV V Q N+KI+DHHSCRSNGSAVLDEDGPQLLDSGDSY+LS+DYDGCVLP FGVN
Sbjct: 181 EDQLLVPVVQQNIKIEDHHSCRSNGSAVLDEDGPQLLDSGDSYILSNDYDGCVLPAFGVN 240

Query: 241 SEEEDVSDDGQGYFSDVYTAADQQTHEAEPLTWWDWS 276
           SEEED SDDGQGYFSDVYT  DQQTHE EPLTWWDW+
Sbjct: 241 SEEEDGSDDGQGYFSDVYTTVDQQTHEGEPLTWWDWT 277

BLAST of Cp4.1LG13g08820.1 vs. ExPASy TrEMBL
Match: A0A1S3C9T9 (homeobox-leucine zipper protein HAT5-like OS=Cucumis melo OX=3656 GN=LOC103498065 PE=4 SV=1)

HSP 1 Score: 484 bits (1247), Expect = 9.79e-172
Identity = 246/277 (88.81%), Postives = 254/277 (91.70%), Query Frame = 0

Query: 1   MMSDGLIFNRSPGHANMLFFGSGDSILRGSCFTKGMEENSKR-PFFSSPDDLFDDDYYDD 60
           MMSDGLIFNRSPGHANMLFFGS D I RG  F  GMEE SKR  FFSSPDDLFDDDYYDD
Sbjct: 1   MMSDGLIFNRSPGHANMLFFGSSDPIPRGPSFMMGMEETSKRRSFFSSPDDLFDDDYYDD 60

Query: 61  QPPEKKRRLTQEQVHLLEISFEAENKLEPERKTELAKKLGLQPRQVAVWFQNRRARWKTK 120
           QPPEKKRRLTQEQVHLLEISFE+ENKLEPERKTELAKKLGLQPRQVAVWFQNRRARWKTK
Sbjct: 61  QPPEKKRRLTQEQVHLLEISFESENKLEPERKTELAKKLGLQPRQVAVWFQNRRARWKTK 120

Query: 121 QLERDYDLLKSSYDSFCSSYDFMAKENERLKAEVASLTEKLQAKEVVESSFQAKKSEPIL 180
           QLERDYDLLKSSYDSF SSYDF+AKENERLKAEVASLTEKLQAKEVVESSF AK  +P L
Sbjct: 121 QLERDYDLLKSSYDSFRSSYDFIAKENERLKAEVASLTEKLQAKEVVESSFHAKNPDPFL 180

Query: 181 EDQLLVSVEQHNMKIDDHHSCRSNGSAVLDEDGPQLLDSGDSYLLSSDYDGCVLPVFGVN 240
           EDQLLV V Q N+KI+DHHSCRSNGSAVLDEDGPQLLDSGDSY+LS+DYDGCVLP FGVN
Sbjct: 181 EDQLLVPVVQQNIKIEDHHSCRSNGSAVLDEDGPQLLDSGDSYILSNDYDGCVLPAFGVN 240

Query: 241 SEEEDVSDDGQGYFSDVYTAADQQTHEAEPLTWWDWS 276
           SEEED SDDGQGYFSDVYT  DQQTHE EPLTWWDW+
Sbjct: 241 SEEEDGSDDGQGYFSDVYTTVDQQTHEGEPLTWWDWT 277

BLAST of Cp4.1LG13g08820.1 vs. ExPASy TrEMBL
Match: A0A6J1H729 (homeobox-leucine zipper protein HAT5-like OS=Cucurbita moschata OX=3662 GN=LOC111460778 PE=4 SV=1)

HSP 1 Score: 478 bits (1231), Expect = 2.59e-169
Identity = 241/276 (87.32%), Postives = 256/276 (92.75%), Query Frame = 0

Query: 2   MSDGLIFNRSPGHANMLFFGSGDSILRGSCFTKGMEENSKR-PFFSSPDDLFDDDYYDDQ 61
           M+DGLIFNRSPGH+NMLFFGSGDS+LRG  F  GMEE SKR PFFSSPDDLFDDDYYDDQ
Sbjct: 1   MTDGLIFNRSPGHSNMLFFGSGDSVLRGPRFAMGMEETSKRRPFFSSPDDLFDDDYYDDQ 60

Query: 62  PPEKKRRLTQEQVHLLEISFEAENKLEPERKTELAKKLGLQPRQVAVWFQNRRARWKTKQ 121
           PPEKKRRLTQEQVHLLEISFE+ENKLEPERKTELA KLGLQPRQVAVWFQNRRARWKTKQ
Sbjct: 61  PPEKKRRLTQEQVHLLEISFESENKLEPERKTELANKLGLQPRQVAVWFQNRRARWKTKQ 120

Query: 122 LERDYDLLKSSYDSFCSSYDFMAKENERLKAEVASLTEKLQAKEVVESSFQAKKSEPILE 181
           LERDYDLLKSSYDSF SSYDF+ KENERLKAEVASL+EKL AKEVVESSFQAKKSEP LE
Sbjct: 121 LERDYDLLKSSYDSFRSSYDFVVKENERLKAEVASLSEKLLAKEVVESSFQAKKSEPFLE 180

Query: 182 DQLLVSVEQHNMKIDDHHSCRSNGSAVLDEDGPQLLDSGDSYLLSSDYDGCVLPVFGVNS 241
            Q+LV V QH+MKI+DHHSCRSNGSAVLDEDGP LLDSGDSYLLS+DY+GCVLPVFG+NS
Sbjct: 181 HQILVPVVQHSMKIEDHHSCRSNGSAVLDEDGPHLLDSGDSYLLSNDYNGCVLPVFGMNS 240

Query: 242 EEEDVSDDGQGYFSDVYTAADQQTHEAEPLTWWDWS 276
           EEED SDDGQGYFS +YT  DQQT+E EPLTWWDW+
Sbjct: 241 EEEDGSDDGQGYFSAIYTTDDQQTNEVEPLTWWDWT 276

BLAST of Cp4.1LG13g08820.1 vs. TAIR 10
Match: AT3G01470.1 (homeobox 1 )

HSP 1 Score: 255.8 bits (652), Expect = 4.1e-68
Identity = 151/281 (53.74%), Postives = 190/281 (67.62%), Query Frame = 0

Query: 1   MMSDGLIFNRSPGHANMLFF-GSGDSILR--GSCFTKGMEENSK-RPFFSSPDDLFDDDY 60
           M S+   F+ S  H N +FF G+ + +++  G+     MEE SK RPFFSSP+DL+DDD+
Sbjct: 1   MESNSFFFDPSASHGNSMFFLGNLNPVVQGGGARSMMNMEETSKRRPFFSSPEDLYDDDF 60

Query: 61  YDDQPPEKKRRLTQEQVHLLEISFEAENKLEPERKTELAKKLGLQPRQVAVWFQNRRARW 120
           YDDQ PEKKRRLT EQVHLLE SFE ENKLEPERKT+LAKKLGLQPRQVAVWFQNRRARW
Sbjct: 61  YDDQLPEKKRRLTTEQVHLLEKSFETENKLEPERKTQLAKKLGLQPRQVAVWFQNRRARW 120

Query: 121 KTKQLERDYDLLKSSYDSFCSSYDFMAKENERLKAEVASLTEKLQAKEVVESSFQAKKSE 180
           KTKQLERDYDLLKS+YD   S+YD +  +N++L++EV SLTEKLQ K+   +    +  E
Sbjct: 121 KTKQLERDYDLLKSTYDQLLSNYDSIVMDNDKLRSEVTSLTEKLQGKQETANEPPGQVPE 180

Query: 181 PILEDQLLVSVEQHNMKIDDHHSCRSNGSAVLDEDGPQLLDSGDSYLLSSDYDGCVLPVF 240
           P   D + ++     +K +D  S  S GSAVLD+D PQLLDS DSY  S      ++P+ 
Sbjct: 181 PNQLDPVYINAAA--IKTEDRLSSGSVGSAVLDDDAPQLLDSCDSYFPS------IVPIQ 240

Query: 241 GVNSEEEDVSDDGQGYFSDVY--TAADQQTHEAEPLTWWDW 276
             NS   D  D+ +  F+DV+  T +    H  E L +W W
Sbjct: 241 D-NSNASD-HDNDRSCFADVFVPTTSPSHDHHGESLAFWGW 271

BLAST of Cp4.1LG13g08820.1 vs. TAIR 10
Match: AT4G40060.1 (homeobox protein 16 )

HSP 1 Score: 125.2 bits (313), Expect = 8.4e-29
Identity = 62/102 (60.78%), Postives = 80/102 (78.43%), Query Frame = 0

Query: 63  EKKRRLTQEQVHLLEISFEAENKLEPERKTELAKKLGLQPRQVAVWFQNRRARWKTKQLE 122
           EKKRRL  +QV  LE +FE ENKLEPERKT+LA++LGLQPRQVAVWFQNRRARWKTKQLE
Sbjct: 58  EKKRRLKVDQVKALEKNFELENKLEPERKTKLAQELGLQPRQVAVWFQNRRARWKTKQLE 117

Query: 123 RDYDLLKSSYDSFCSSYDFMAKENERLKAEVASLTEKLQAKE 165
           +DY +LK  YDS   ++D + ++N+ L  E++ +  K+  +E
Sbjct: 118 KDYGVLKGQYDSLRHNFDSLRRDNDSLLQEISKIKAKVNGEE 159

BLAST of Cp4.1LG13g08820.1 vs. TAIR 10
Match: AT1G69780.1 (Homeobox-leucine zipper protein family )

HSP 1 Score: 124.4 bits (311), Expect = 1.4e-28
Identity = 64/115 (55.65%), Postives = 83/115 (72.17%), Query Frame = 0

Query: 53  DDDYYDD--QPPEKKRRLTQEQVHLLEISFEAENKLEPERKTELAKKLGLQPRQVAVWFQ 112
           ++DY DD  Q  EKKRRL  EQV  LE +FE  NKLEPERK +LA+ LGLQPRQ+A+WFQ
Sbjct: 72  EEDYSDDGSQMGEKKRRLNMEQVKTLEKNFELGNKLEPERKMQLARALGLQPRQIAIWFQ 131

Query: 113 NRRARWKTKQLERDYDLLKSSYDSFCSSYDFMAKENERLKAEVASLTEKLQAKEV 166
           NRRARWKTKQLE+DYD LK  +D+  +  D +   N++L+AE+  L  + Q + +
Sbjct: 132 NRRARWKTKQLEKDYDTLKRQFDTLKAENDLLQTHNQKLQAEIMGLKNREQTESI 186

BLAST of Cp4.1LG13g08820.1 vs. TAIR 10
Match: AT2G22430.1 (homeobox protein 6 )

HSP 1 Score: 121.3 bits (303), Expect = 1.2e-27
Identity = 79/187 (42.25%), Postives = 105/187 (56.15%), Query Frame = 0

Query: 22  SGDSI--LRGSCFTKGMEENSKRPFFSSPDDLFDDDYYDDQP-----------PEKKRRL 81
           S DS+  L   C T   +E S R +         + Y +++             EKKRRL
Sbjct: 7   SSDSVGGLISLCPTTSTDEQSPRRYGGREFQSMLEGYEEEEEAIVEERGHVGLSEKKRRL 66

Query: 82  TQEQVHLLEISFEAENKLEPERKTELAKKLGLQPRQVAVWFQNRRARWKTKQLERDYDLL 141
           +  QV  LE +FE ENKLEPERK +LA++LGLQPRQVAVWFQNRRARWKTKQLE+DY +L
Sbjct: 67  SINQVKALEKNFELENKLEPERKVKLAQELGLQPRQVAVWFQNRRARWKTKQLEKDYGVL 126

Query: 142 KSSYDSFCSSYDFMAKENERLKAEVASLTEKL------------QAKEVVESSFQAKKSE 184
           K+ YDS   ++D + ++NE L  E++ L  KL             A    ES    K+ E
Sbjct: 127 KTQYDSLRHNFDSLRRDNESLLQEISKLKTKLNGGGGEEEEEENNAAVTTESDISVKEEE 186

BLAST of Cp4.1LG13g08820.1 vs. TAIR 10
Match: AT5G65310.1 (homeobox protein 5 )

HSP 1 Score: 118.6 bits (296), Expect = 7.8e-27
Identity = 71/154 (46.10%), Postives = 96/154 (62.34%), Query Frame = 0

Query: 12  PGHANMLFFGSGDSILRGSCFTKGMEENSKRPFFSSPDDLFDDDYYDDQPPEKKRRLTQE 71
           P     L+ G+GD     S     +E++       S +DL    +      EKKRRL  E
Sbjct: 30  PTTTGFLYSGAGDY----SQMFDALEDD------GSLEDLGGVGHASSTAAEKKRRLGVE 89

Query: 72  QVHLLEISFEAENKLEPERKTELAKKLGLQPRQVAVWFQNRRARWKTKQLERDYDLLKSS 131
           QV  LE +FE +NKLEPERK +LA++LGLQPRQVA+WFQNRRARWKTKQLERDY +LKS+
Sbjct: 90  QVKALEKNFEIDNKLEPERKVKLAQELGLQPRQVAIWFQNRRARWKTKQLERDYGVLKSN 149

Query: 132 YDSFCSSYDFMAKENERLKAEVASLTEKLQAKEV 166
           +D+   + D + ++N+ L  ++  L  KL  + V
Sbjct: 150 FDALKRNRDSLQRDNDSLLGQIKELKAKLNVEGV 173

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q022835.8e-6753.74Homeobox-leucine zipper protein HAT5 OS=Arabidopsis thaliana OX=3702 GN=HAT5 PE=... [more]
Q6YWR41.2e-5342.27Homeobox-leucine zipper protein HOX16 OS=Oryza sativa subsp. japonica OX=39947 G... [more]
A2X9802.1e-5342.03Homeobox-leucine zipper protein HOX16 OS=Oryza sativa subsp. indica OX=39946 GN=... [more]
Q6ZA748.1e-4541.03Homeobox-leucine zipper protein HOX5 OS=Oryza sativa subsp. japonica OX=39947 GN... [more]
Q9XH361.4e-4440.85Homeobox-leucine zipper protein HOX5 OS=Oryza sativa subsp. indica OX=39946 GN=H... [more]
Match NameE-valueIdentityDescription
XP_023551382.12.91e-198100.00homeobox-leucine zipper protein HAT5-like [Cucurbita pepo subsp. pepo][more]
XP_022938616.19.75e-19799.28homeobox-leucine zipper protein HAT5-like [Cucurbita moschata] >KAG6578642.1 Hom... [more]
KAG7016182.15.64e-19698.92Homeobox-leucine zipper protein HAT5, partial [Cucurbita argyrosperma subsp. arg... [more]
XP_022993660.11.14e-19598.19homeobox-leucine zipper protein HAT5-like [Cucurbita maxima][more]
XP_038890996.14.45e-17590.61homeobox-leucine zipper protein HAT5-like [Benincasa hispida] >XP_038890997.1 ho... [more]
Match NameE-valueIdentityDescription
A0A6J1FDM94.72e-19799.28homeobox-leucine zipper protein HAT5-like OS=Cucurbita moschata OX=3662 GN=LOC11... [more]
A0A6J1JZ535.51e-19698.19homeobox-leucine zipper protein HAT5-like OS=Cucurbita maxima OX=3661 GN=LOC1114... [more]
A0A5A7T5T29.79e-17288.81Homeobox-leucine zipper protein HAT5-like OS=Cucumis melo var. makuwa OX=1194695... [more]
A0A1S3C9T99.79e-17288.81homeobox-leucine zipper protein HAT5-like OS=Cucumis melo OX=3656 GN=LOC10349806... [more]
A0A6J1H7292.59e-16987.32homeobox-leucine zipper protein HAT5-like OS=Cucurbita moschata OX=3662 GN=LOC11... [more]
Match NameE-valueIdentityDescription
AT3G01470.14.1e-6853.74homeobox 1 [more]
AT4G40060.18.4e-2960.78homeobox protein 16 [more]
AT1G69780.11.4e-2855.65Homeobox-leucine zipper protein family [more]
AT2G22430.11.2e-2742.25homeobox protein 6 [more]
AT5G65310.17.8e-2746.10homeobox protein 5 [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 139..166
NoneNo IPR availableGENE3D1.10.10.60coord: 57..125
e-value: 2.5E-20
score: 73.7
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 42..62
NoneNo IPR availablePANTHERPTHR24326:SF497HOMEOBOX-LEUCINE ZIPPER PROTEIN HAT5coord: 1..260
NoneNo IPR availablePANTHERPTHR24326HOMEOBOX-LEUCINE ZIPPER PROTEINcoord: 1..260
IPR000047Helix-turn-helix motifPRINTSPR00031HTHREPRESSRcoord: 90..99
score: 53.5
coord: 99..115
score: 59.22
IPR001356Homeobox domainSMARTSM00389HOX_1coord: 62..123
e-value: 3.6E-19
score: 79.7
IPR001356Homeobox domainPFAMPF00046Homeodomaincoord: 64..117
e-value: 4.5E-17
score: 61.6
IPR001356Homeobox domainPROSITEPS50071HOMEOBOX_2coord: 59..119
score: 17.750473
IPR001356Homeobox domainCDDcd00086homeodomaincoord: 64..120
e-value: 2.27764E-18
score: 74.9724
IPR003106Leucine zipper, homeobox-associatedPFAMPF02183HALZcoord: 119..161
e-value: 4.3E-16
score: 58.8
IPR017970Homeobox, conserved sitePROSITEPS00027HOMEOBOX_1coord: 94..117
IPR009057Homeobox-like domain superfamilySUPERFAMILY46689Homeodomain-likecoord: 47..121

Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Cp4.1LG13g08820Cp4.1LG13g08820gene


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cp4.1LG13g08820.1:exon:001Cp4.1LG13g08820.1:exon:001exon
Cp4.1LG13g08820.1:exon:002Cp4.1LG13g08820.1:exon:002exon
Cp4.1LG13g08820.1:exon:003Cp4.1LG13g08820.1:exon:003exon
Cp4.1LG13g08820.1:exon:004Cp4.1LG13g08820.1:exon:004exon
Cp4.1LG13g08820.1:exon:005Cp4.1LG13g08820.1:exon:005exon


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cp4.1LG13g08820.1:five_prime_utr:002Cp4.1LG13g08820.1:five_prime_utr:002five_prime_UTR
Cp4.1LG13g08820.1:five_prime_utr:001Cp4.1LG13g08820.1:five_prime_utr:001five_prime_UTR


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cp4.1LG13g08820.1:cds:004Cp4.1LG13g08820.1:cds:004CDS
Cp4.1LG13g08820.1:cds:003Cp4.1LG13g08820.1:cds:003CDS
Cp4.1LG13g08820.1:cds:002Cp4.1LG13g08820.1:cds:002CDS
Cp4.1LG13g08820.1:cds:001Cp4.1LG13g08820.1:cds:001CDS


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cp4.1LG13g08820.1:three_prime_utr:001Cp4.1LG13g08820.1:three_prime_utr:001three_prime_UTR


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Cp4.1LG13g08820.1Cp4.1LG13g08820.1-proteinpolypeptide


GO Annotation
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006355 regulation of transcription, DNA-templated
biological_process GO:0045893 positive regulation of transcription, DNA-templated
biological_process GO:0006357 regulation of transcription by RNA polymerase II
cellular_component GO:0005634 nucleus
molecular_function GO:0003677 DNA binding
molecular_function GO:0000981 DNA-binding transcription factor activity, RNA polymerase II-specific
molecular_function GO:0043565 sequence-specific DNA binding