Cp4.1LG19g05650 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG19g05650
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionHomeobox leucine-zipper protein
LocationCp4.1LG19 : 7886271 .. 7891338 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTGCTTTTCTTTCTGTGGGTTTTGGTTTTGGATTTTCTTTTTTCCTCGAAATTGATGTTTTTGTGTGATAAAGAGCAGAGGACTCTGTTCATGCTCTTCTTCAACCTGTGTGCTTTGAGGAAGAAGCAGAGCACTAACTGATGAAGGAGAGAGGAGGCTCTCTTGGAGGAGAGAGCCTGCCCGGTAAAAACACTAACAACACCAACACCAACACAGCTCAAGAAGAAGACCAAGACGACCCTCAAATCCCATTTTGCTGCTAACTTCCCTTTTTTCTCAGTCAAAAAATTTCAGTCCTTGAGGCTTTCTTTTTGGTGGGTTTTGAATCTTCCTTTGTTCCAGTTCGTTTCAAAAGGTTCATGGCGCTTGTAATGCACAAAGACTCATCAAACCAACAGATGGATACAAGCAAATACGTACGGTATACACCTGAACAAGTTGAGGCATTAGAGAGAGTCTATGCAGAATGTCCAAAGCCCAGTTCTCTTAGAAGGCAGCAGCTCATTAGAGAGTGCCCAATCCTTTCTAATATTGAGCCCAAACAGATCAAAGTTTGGTTTCAAAACCGCAGGTATTATTCTATTTGTTGCTTCTATTCTCTTTTCTTTTCAGAACGATAATCTCTCTTTCAGAAAACATGTACTAAGTGTTTTTGTTTGTTTGTTTTTGCTATCTGCAATACTGTCATTAGTTTTGGTTTTGGGGTATGAGCTTGTGGTACATCTGGGCAAGGAGTTGAATAGCTATGGGAGATGGGTTTGTAAGGAAGTGATGCTTTTGATTGAAAATAACCACTTTTGTCCATAGAACTTCTGTGCTTTGGGGTTAAAGTTCATTCAAGTGCTTGACTACTGTCTTTAATTGTTTGGATTTGAACTAAAACTATATAACCTTCTGGTTAGTTTTCTAATATTGTCTCCAAGCCGCAAGTTTTCCATTATTCACAGTTCTTACTGCACATCATCTCCAATTAAACTTCTTACTGCACCAGTGCCAAAACAATTTCACAGTTCCTTTTTTCCATTATTCTCTGTTAGATGCCGGGAGAAGCAGAGGAAGGAATCTTCTCGCCTCCAGAGCGTAAACCGAAAGTTGTCTGCGATGAACAAGTTATTGATGGAGGAGAATGACCGTTTACAGAAGCAGGTGTCCCATTTAGTGTATGAGAATGGATTTATGCGCCAGCAACTGCATAGTGTAATCCCTTTTAGAACTATGACCTTCATCTTTGTCTTTAACTAATGTGCTATTATGATTGCTATTTCTTAATCTTAGCTCTTCTTTTCTGATGAACAGACATCTGGGACGACTACAGACAATAGCTGTGAGTCTGTAGTCATGAGCGGTCAGCCCCAACAACAGCAAAACCCAAATCCTCAGCATTCTAATAGGGATGCAAACAACCCAGCTGGGTCAGTATGTTTATCATCTTGCTGCATATCGCACTACTTTTTTCTTCAGCATTCTAATATTTAAAGTGCTTTTATGAATTGGGTGCCGTAGTCTTCTTGCAATAGCCGAGGAGACCCTGGCAGAGTTCCTTTCCAAGGCTACAGGAACTGCTGTCGACTGGGTCCAGATGATTGGGATGAAGGTAAGTTCTCTTTGAATGATTGGGGTCAAGGTTTACTTAACGTTCTTAATGACTAAATGTCTAATTTGCACAGCCTGGTCCGGATTCTATTGGAATCGTTGCTGTTTCCCGCAACTGCAGTGGGGTAGCAGCGCGAGCCTGTGGTCTTGTTAGTCTAGAGCCAATGAAGGTACTTGCTGTTTGAAGATGATGGTTTGATCGTGGCAACTTTCTTGTGTGAGCATAATGAACCTGAAATTTGATTGATGCAGGTTGCCGAAATTCTCAAAGATCGCCTTTCATGGTATCGTGACTGTCGCTGTCTTAATGTTTTAAGTGTAATCCCTACTGGAAATGGGGGAACAATAGAGCTCATATATATGCAGGTTACCTAATACTTTTCTTTCTCTCACTTTCATGGTTAATTTTTCATTCTACCGTTTCTAATTCCATCGTTTTTCCTCGTTAGACTTACGCTCCAACGACATTAGCAGCTGCACGTGACTTCTGGACCATGAGATATACAACAAGTTTAGAAGATGGCAGTCTAGTGGTAGATTTATTCATCATAACTTTGAAACACTTGTTTTTCATTTGGATATGGTGAGCCCTGATTGCAACTGTTGTTTTGGATTAATTTGCAGGTCTGTGAAAGGTCATTAACTACTTCAAGTGGTGGTCCAGCAGGGCCTCCGCCATCTACTTTTGTGAGAGCTGAAATGCTTCCTAGTGGTTATCTAATACGAGCATGCGAGGGTGGTGGATCTATTATTCACATTGTTGACCACATTGATTTGGATGTGAGCTTTCAACTAAAGTATTCAATCAGCATGTTTTTTTTTATTGTTTATACACTTTTTTTTTCATCTTCCTTCTGTTCATCTGTAGGTTTGGAGTGTCCCTGAAGTTCTTAGACCGCTTTATGAGTCATCCAAGATCCTTGCTCAGAAAATGACGATCGCTGTAAGAATTGTCCCATGATGTTTCATTCTCTTGACAAGCAAATCATATCTGTTCTTACTGTGTGGGACAAATTCGTCTTACAGGCATTGCGTCACATTAGACAAATTGCACAAGAGACTAGTGGAGAGATCCAATATAGTGGGGGACGTCAACCAGCTGTATTACGAACGTTCGGTCAGAAACTCTGCAGGTTGTGTTTGCATGAACATAACTGGTTTATTTACTATGGAGAAAATAGTCCTTGGTGATTTAGTTCTAATCCTAAAATGGTCTTTAGGGGTTTCAACGACGCTGTTAATGGGTTTGCGGATGATGGTTGGTCAGCTATGAGCAGCGATGGTTTGGAGGATGTGACAATTGTCATAAACTCATCAGCAAACAAGTTACCTGGGTCACAGTATAAAACGTCTATGTATCCCTCTTTTGGTGGAGTGATGTGTGCAAAAGCATCAATGTTGCTTCAGGTGTGTATTTGGTTGACTTTATGCGTCTACTCTTCTACTCGGAAGTACTTGGTTTCAATCTTAATTCGTCTACTCGTCTACAGAATGTTCCCCCTGCTTTGCTTATTCGTTTCTTGAGGGAGCATCGATCTGAATGGGCTGATTATGGAGTTGATGCCTACTCTGCCGCGAGTTTAAAAGCCAGTCCATATGCTGTTCCATGCGCAAGGCCAGGTGGTTTCCCTAGCAGCCAGGTTATTTTGCCTCTTGCTCACACTGTCGAGCACGAAGAGGTAATGATTCCTACCTATCAGAGAAATTCCTGCAGGTTTTCCTTTTACTATAAATATAGCGTCGTCTTTGTACAGTTTCTGGAGGTGGTTCGGCTGGAGGGCCTTGCATTCTCCCCTGAAGATGTTGCTTTGGCTGGACGAGATATGTACTTATTGCAGGTTAGTTTATTACAATGTGTAAGTTGGATCTTATCGAATGCGATGTTTTTCTCTCGTGATGTGTTTTCTTCTTCATGGCAGCTCTGTAATGGAGTTGATGAGAATGCAGTTGGAGCTTGTGCTCAGCTTGTGTTTGCGCCTATAGATGAATCTTTTGCCGATGATGCTCCATTACTTCCTTCTGGTTTTCGTGTCATACCCCTGGATCCAAAAACGGTCTGTTTCTGTATTTATTGGACACATCCCCTTTTAAAATCATAAGTTAATTTGCTTTCCTTGGCTATGTTTCCCCCATATTGCAGGATGAGCCCACTGCTAATCGAACATTGGATTTGGCTTCTACTCTTGAAGTCGGTGCCAACGGTGCACGCTCTGGTGGTGAAGCTGATTTGAGCACCTACAACCTCCGGTCGGTCTTGACTATTGCCTTCCAGTTCACTTATGAGAATCACTTACAGGAAAATGTGGCTGCTATGGCTCGACAATATGTTCGTAGCGTTGTGGGGTCTGTTCAGAGGGTTGCTATGGCCATTGCCCCATCACAGTTGAGTTCCAATATTGGAATAAAACCCCTTCCTGGTTCTCCTGAAGCTCTTACTTTGGCTCGATGGATTTGCCGAAGCTACAGGCATGTATACATTATCCACTCGAGTCTTCGACTTTTATGTTCCTCATAAAAACCAGGCTTATTATTCTTCATATGTATGGTTTATAGTATCAAATCTGATCTATGTATGAAGAACTCCTCCCCTGCCCCAAATAGGTCAACTTACAACAGTTTTGTGTTTCAGGATCCATGTTGGAGCTGAGCTCCTTCAAGCCGATTCCCAGTCTGGGGATGCCATGTTGAAGCAGCTCTGGCACCACTCGGATGCAATCATGTGTTGCTCCGTCAAAGCCAACGTAATCAATGTTTCCACCCTAAAAACCCAACTACATAACAGCCACGGAACTGCCCTGGAAAGCTAACATGTTTGGGAAATTTTCATTGCAGGCATCTGCTGTGTTCACCTTTGCCAACCAGGCCGGTCTGGACATGCTTGAAACCACTCTTGTTGGCCTGCAAGATATAATGCTCGACAAAATTCTCGACGAAGCGGGTCGGAAGATCCTCTGTTCTGAATTCCCCAAAATAATGCAGCAGGTAAGCATGGTTTGATTTTAATCATGGTTTCTTCTATCCTACAGATTAGGAATATCTTACATACATTTTTTTTGCCATTGCAGGGATTCACAAATCTGCCATCTGGCATTTGCGTATCGAGTATGGGTCGACCTATTTCTTACGAGCAAGGCGTTGCCTGGAAGGTTCTAAATGATGATGATTCCAATCACTGTCTGGCTTTCATGTTCATAAACTGGTCTTTTGTGTGATGGCGAAGGACATGACTCTAAGTGGGATTGTAATTTCAACTTAGACATATAATGCCTTTGGTTGACTGCAGGACTTTTTGCTTTTTCTTCCTCTTTTTGTTGGAAAAACAAATGTTTAGGTACTTGGATGGACCCTTTTGTTTCAAACTTAGACACCCATGTCTTAAAGTCTTTCATTTCATGAACTCTATTATATGCTTCTAATATGATAGCTACGTCTTCTGCTGAAGGATCATTTATCTTTATAGCTACATCTGCTTCAT

mRNA sequence

TTGCTTTTCTTTCTGTGGGTTTTGGTTTTGGATTTTCTTTTTTCCTCGAAATTGATGTTTTTGTGTGATAAAGAGCAGAGGACTCTGTTCATGCTCTTCTTCAACCTGTGTGCTTTGAGGAAGAAGCAGAGCACTAACTGATGAAGGAGAGAGGAGGCTCTCTTGGAGGAGAGAGCCTGCCCGGTAAAAACACTAACAACACCAACACCAACACAGCTCAAGAAGAAGACCAAGACGACCCTCAAATCCCATTTTGCTGCTAACTTCCCTTTTTTCTCAGGCTTTCTTTTTGGTGGGTTTTGAATCTTCCTTTGTTCCAGTTCGTTTCAAAAGGTTCATGGCGCTTGTAATGCACAAAGACTCATCAAACCAACAGATGGATACAAGCAAATACGTACGGTATACACCTGAACAAGTTGAGGCATTAGAGAGAGTCTATGCAGAATGTCCAAAGCCCAGTTCTCTTAGAAGGCAGCAGCTCATTAGAGAGTGCCCAATCCTTTCTAATATTGAGCCCAAACAGATCAAAGTTTGGTTTCAAAACCGCAGATGCCGGGAGAAGCAGAGGAAGGAATCTTCTCGCCTCCAGAGCGTAAACCGAAAGTTGTCTGCGATGAACAAGTTATTGATGGAGGAGAATGACCGTTTACAGAAGCAGGTGTCCCATTTAGTGTATGAGAATGGATTTATGCGCCAGCAACTGCATAGTCTCTTCTTTTCTGATGAACAGACATCTGGGACGACTACAGACAATAGCTGTGAGTCTGTAGTCATGAGCGGTCAGCCCCAACAACAGCAAAACCCAAATCCTCAGCATTCTAATAGGGATGCAAACAACCCAGCTGGTCTTCTTGCAATAGCCGAGGAGACCCTGGCAGAGTTCCTTTCCAAGGCTACAGGAACTGCTGTCGACTGGGTCCAGATGATTGGGATGAAGCCTGGTCCGGATTCTATTGGAATCGTTGCTGTTTCCCGCAACTGCAGTGGGGTAGCAGCGCGAGCCTGTGGTCTTGTTAGTCTAGAGCCAATGAAGGTTGCCGAAATTCTCAAAGATCGCCTTTCATGGTATCGTGACTGTCGCTGTCTTAATGTTTTAAGTGTAATCCCTACTGGAAATGGGGGAACAATAGAGCTCATATATATGCAGACTTACGCTCCAACGACATTAGCAGCTGCACGTGACTTCTGGACCATGAGATATACAACAAGTTTAGAAGATGGCAGTCTAGTGGTCTGTGAAAGGTCATTAACTACTTCAAGTGGTGGTCCAGCAGGGCCTCCGCCATCTACTTTTGTGAGAGCTGAAATGCTTCCTAGTGGTTATCTAATACGAGCATGCGAGGGTGGTGGATCTATTATTCACATTGTTGACCACATTGATTTGGATGTTTGGAGTGTCCCTGAAGTTCTTAGACCGCTTTATGAGTCATCCAAGATCCTTGCTCAGAAAATGACGATCGCTGCATTGCGTCACATTAGACAAATTGCACAAGAGACTAGTGGAGAGATCCAATATAGTGGGGGACGTCAACCAGCTGTATTACGAACGTTCGGTCAGAAACTCTGCAGGGGTTTCAACGACGCTGTTAATGGGTTTGCGGATGATGGTTGGTCAGCTATGAGCAGCGATGGTTTGGAGGATGTGACAATTGTCATAAACTCATCAGCAAACAAGTTACCTGGGTCACAGTATAAAACGTCTATGTATCCCTCTTTTGGTGGAGTGATGTGTGCAAAAGCATCAATGTTGCTTCAGAATGTTCCCCCTGCTTTGCTTATTCGTTTCTTGAGGGAGCATCGATCTGAATGGGCTGATTATGGAGTTGATGCCTACTCTGCCGCGAGTTTAAAAGCCAGTCCATATGCTGTTCCATGCGCAAGGCCAGGTGGTTTCCCTAGCAGCCAGGTTATTTTGCCTCTTGCTCACACTGTCGAGCACGAAGAGTTTCTGGAGGTGGTTCGGCTGGAGGGCCTTGCATTCTCCCCTGAAGATGTTGCTTTGGCTGGACGAGATATGTACTTATTGCAGCTCTGTAATGGAGTTGATGAGAATGCAGTTGGAGCTTGTGCTCAGCTTGTGTTTGCGCCTATAGATGAATCTTTTGCCGATGATGCTCCATTACTTCCTTCTGGTTTTCGTGTCATACCCCTGGATCCAAAAACGGCATCTGCTGTGTTCACCTTTGCCAACCAGGCCGGTCTGGACATGCTTGAAACCACTCTTGTTGGCCTGCAAGATATAATGCTCGACAAAATTCTCGACGAAGCGGGTCGGAAGATCCTCTGTTCTGAATTCCCCAAAATAATGCAGCAGGGATTCACAAATCTGCCATCTGGCATTTGCGTATCGAGTATGGGTCGACCTATTTCTTACGAGCAAGGCGTTGCCTGGAAGGTTCTAAATGATGATGATTCCAATCACTGTCTGGCTTTCATGTTCATAAACTGGTCTTTTGTGTGATGGCGAAGGACATGACTCTAAGTGGGATTGTAATTTCAACTTAGACATATAATGCCTTTGGTTGACTGCAGGACTTTTTGCTTTTTCTTCCTCTTTTTGTTGGAAAAACAAATGTTTAGGTACTTGGATGGACCCTTTTGTTTCAAACTTAGACACCCATGTCTTAAAGTCTTTCATTTCATGAACTCTATTATATGCTTCTAATATGATAGCTACGTCTTCTGCTGAAGGATCATTTATCTTTATAGCTACATCTGCTTCAT

Coding sequence (CDS)

ATGGCGCTTGTAATGCACAAAGACTCATCAAACCAACAGATGGATACAAGCAAATACGTACGGTATACACCTGAACAAGTTGAGGCATTAGAGAGAGTCTATGCAGAATGTCCAAAGCCCAGTTCTCTTAGAAGGCAGCAGCTCATTAGAGAGTGCCCAATCCTTTCTAATATTGAGCCCAAACAGATCAAAGTTTGGTTTCAAAACCGCAGATGCCGGGAGAAGCAGAGGAAGGAATCTTCTCGCCTCCAGAGCGTAAACCGAAAGTTGTCTGCGATGAACAAGTTATTGATGGAGGAGAATGACCGTTTACAGAAGCAGGTGTCCCATTTAGTGTATGAGAATGGATTTATGCGCCAGCAACTGCATAGTCTCTTCTTTTCTGATGAACAGACATCTGGGACGACTACAGACAATAGCTGTGAGTCTGTAGTCATGAGCGGTCAGCCCCAACAACAGCAAAACCCAAATCCTCAGCATTCTAATAGGGATGCAAACAACCCAGCTGGTCTTCTTGCAATAGCCGAGGAGACCCTGGCAGAGTTCCTTTCCAAGGCTACAGGAACTGCTGTCGACTGGGTCCAGATGATTGGGATGAAGCCTGGTCCGGATTCTATTGGAATCGTTGCTGTTTCCCGCAACTGCAGTGGGGTAGCAGCGCGAGCCTGTGGTCTTGTTAGTCTAGAGCCAATGAAGGTTGCCGAAATTCTCAAAGATCGCCTTTCATGGTATCGTGACTGTCGCTGTCTTAATGTTTTAAGTGTAATCCCTACTGGAAATGGGGGAACAATAGAGCTCATATATATGCAGACTTACGCTCCAACGACATTAGCAGCTGCACGTGACTTCTGGACCATGAGATATACAACAAGTTTAGAAGATGGCAGTCTAGTGGTCTGTGAAAGGTCATTAACTACTTCAAGTGGTGGTCCAGCAGGGCCTCCGCCATCTACTTTTGTGAGAGCTGAAATGCTTCCTAGTGGTTATCTAATACGAGCATGCGAGGGTGGTGGATCTATTATTCACATTGTTGACCACATTGATTTGGATGTTTGGAGTGTCCCTGAAGTTCTTAGACCGCTTTATGAGTCATCCAAGATCCTTGCTCAGAAAATGACGATCGCTGCATTGCGTCACATTAGACAAATTGCACAAGAGACTAGTGGAGAGATCCAATATAGTGGGGGACGTCAACCAGCTGTATTACGAACGTTCGGTCAGAAACTCTGCAGGGGTTTCAACGACGCTGTTAATGGGTTTGCGGATGATGGTTGGTCAGCTATGAGCAGCGATGGTTTGGAGGATGTGACAATTGTCATAAACTCATCAGCAAACAAGTTACCTGGGTCACAGTATAAAACGTCTATGTATCCCTCTTTTGGTGGAGTGATGTGTGCAAAAGCATCAATGTTGCTTCAGAATGTTCCCCCTGCTTTGCTTATTCGTTTCTTGAGGGAGCATCGATCTGAATGGGCTGATTATGGAGTTGATGCCTACTCTGCCGCGAGTTTAAAAGCCAGTCCATATGCTGTTCCATGCGCAAGGCCAGGTGGTTTCCCTAGCAGCCAGGTTATTTTGCCTCTTGCTCACACTGTCGAGCACGAAGAGTTTCTGGAGGTGGTTCGGCTGGAGGGCCTTGCATTCTCCCCTGAAGATGTTGCTTTGGCTGGACGAGATATGTACTTATTGCAGCTCTGTAATGGAGTTGATGAGAATGCAGTTGGAGCTTGTGCTCAGCTTGTGTTTGCGCCTATAGATGAATCTTTTGCCGATGATGCTCCATTACTTCCTTCTGGTTTTCGTGTCATACCCCTGGATCCAAAAACGGCATCTGCTGTGTTCACCTTTGCCAACCAGGCCGGTCTGGACATGCTTGAAACCACTCTTGTTGGCCTGCAAGATATAATGCTCGACAAAATTCTCGACGAAGCGGGTCGGAAGATCCTCTGTTCTGAATTCCCCAAAATAATGCAGCAGGGATTCACAAATCTGCCATCTGGCATTTGCGTATCGAGTATGGGTCGACCTATTTCTTACGAGCAAGGCGTTGCCTGGAAGGTTCTAAATGATGATGATTCCAATCACTGTCTGGCTTTCATGTTCATAAACTGGTCTTTTGTGTGA

Protein sequence

MALVMHKDSSNQQMDTSKYVRYTPEQVEALERVYAECPKPSSLRRQQLIRECPILSNIEPKQIKVWFQNRRCREKQRKESSRLQSVNRKLSAMNKLLMEENDRLQKQVSHLVYENGFMRQQLHSLFFSDEQTSGTTTDNSCESVVMSGQPQQQQNPNPQHSNRDANNPAGLLAIAEETLAEFLSKATGTAVDWVQMIGMKPGPDSIGIVAVSRNCSGVAARACGLVSLEPMKVAEILKDRLSWYRDCRCLNVLSVIPTGNGGTIELIYMQTYAPTTLAAARDFWTMRYTTSLEDGSLVVCERSLTTSSGGPAGPPPSTFVRAEMLPSGYLIRACEGGGSIIHIVDHIDLDVWSVPEVLRPLYESSKILAQKMTIAALRHIRQIAQETSGEIQYSGGRQPAVLRTFGQKLCRGFNDAVNGFADDGWSAMSSDGLEDVTIVINSSANKLPGSQYKTSMYPSFGGVMCAKASMLLQNVPPALLIRFLREHRSEWADYGVDAYSAASLKASPYAVPCARPGGFPSSQVILPLAHTVEHEEFLEVVRLEGLAFSPEDVALAGRDMYLLQLCNGVDENAVGACAQLVFAPIDESFADDAPLLPSGFRVIPLDPKTASAVFTFANQAGLDMLETTLVGLQDIMLDKILDEAGRKILCSEFPKIMQQGFTNLPSGICVSSMGRPISYEQGVAWKVLNDDDSNHCLAFMFINWSFV
BLAST of Cp4.1LG19g05650 vs. Swiss-Prot
Match: ATB14_ARATH (Homeobox-leucine zipper protein ATHB-14 OS=Arabidopsis thaliana GN=ATHB-14 PE=1 SV=1)

HSP 1 Score: 1001.1 bits (2587), Expect = 6.0e-291
Identity = 498/607 (82.04%), Postives = 548/607 (90.28%), Query Frame = 1

Query: 4   VMHKDSSNQQMDTSKYVRYTPEQVEALERVYAECPKPSSLRRQQLIRECPILSNIEPKQI 63
           +M+++S ++ +D+ KYVRYTPEQVEALERVY ECPKPSSLRRQQLIRECPILSNIEPKQI
Sbjct: 11  MMNRESPDKGLDSGKYVRYTPEQVEALERVYTECPKPSSLRRQQLIRECPILSNIEPKQI 70

Query: 64  KVWFQNRRCREKQRKESSRLQSVNRKLSAMNKLLMEENDRLQKQVSHLVYENGFMRQQLH 123
           KVWFQNRRCREKQRKE++RLQ+VNRKL+AMNKLLMEENDRLQKQVS+LVYENG M+ QLH
Sbjct: 71  KVWFQNRRCREKQRKEAARLQTVNRKLNAMNKLLMEENDRLQKQVSNLVYENGHMKHQLH 130

Query: 124 SLFFSDEQTSGTTTDNSCESVVMSGQPQQQQNPNPQHSNRDANNPAGLLAIAEETLAEFL 183
           +        SGTTTDNSCESVV+SGQ  QQQNPNPQH  RDANNPAGLL+IAEE LAEFL
Sbjct: 131 T-------ASGTTTDNSCESVVVSGQQHQQQNPNPQHQQRDANNPAGLLSIAEEALAEFL 190

Query: 184 SKATGTAVDWVQMIGMKPGPDSIGIVAVSRNCSGVAARACGLVSLEPMKVAEILKDRLSW 243
           SKATGTAVDWVQMIGMKPGPDSIGIVA+SRNCSG+AARACGLVSLEPMKVAEILKDR SW
Sbjct: 191 SKATGTAVDWVQMIGMKPGPDSIGIVAISRNCSGIAARACGLVSLEPMKVAEILKDRPSW 250

Query: 244 YRDCRCLNVLSVIPTGNGGTIELIYMQTYAPTTLAAARDFWTMRYTTSLEDGSLVVCERS 303
            RDCR ++ LSVIP GNGGTIELIY Q YAPTTLAAARDFWT+RY+T LEDGS VVCERS
Sbjct: 251 LRDCRSVDTLSVIPAGNGGTIELIYTQMYAPTTLAAARDFWTLRYSTCLEDGSYVVCERS 310

Query: 304 LTTSSGGPAGPPPSTFVRAEMLPSGYLIRACEGGGSIIHIVDHIDLDVWSVPEVLRPLYE 363
           LT+++GGP GPP S FVRAEM PSG+LIR C+GGGSI+HIVDH+DLD WSVPEV+RPLYE
Sbjct: 311 LTSATGGPTGPPSSNFVRAEMKPSGFLIRPCDGGGSILHIVDHVDLDAWSVPEVMRPLYE 370

Query: 364 SSKILAQKMTIAALRHIRQIAQETSGEIQYSGGRQPAVLRTFGQKLCRGFNDAVNGFADD 423
           SSKILAQKMT+AALRH+RQIAQETSGE+QY GGRQPAVLRTF Q+LCRGFNDAVNGF DD
Sbjct: 371 SSKILAQKMTVAALRHVRQIAQETSGEVQYGGGRQPAVLRTFSQRLCRGFNDAVNGFVDD 430

Query: 424 GWSAMSSDGLEDVTIVINSSANKLPGSQYKTSMYPSFG-GVMCAKASMLLQNVPPALLIR 483
           GWS M SDG EDVT++IN S  K  GSQY  S  PSFG GV+CAKASMLLQNVPPA+L+R
Sbjct: 431 GWSPMGSDGAEDVTVMINLSPGKFGGSQYGNSFLPSFGSGVLCAKASMLLQNVPPAVLVR 490

Query: 484 FLREHRSEWADYGVDAYSAASLKASPYAVPCARPGGFPSSQVILPLAHTVEHEEFLEVVR 543
           FLREHRSEWADYGVDAY+AASL+ASP+AVPCAR GGFPS+QVILPLA TVEHEE LEVVR
Sbjct: 491 FLREHRSEWADYGVDAYAAASLRASPFAVPCARAGGFPSNQVILPLAQTVEHEESLEVVR 550

Query: 544 LEGLAFSPEDVALAGRDMYLLQLCNGVDENAVGACAQLVFAPIDESFADDAPLLPSGFRV 603
           LEG A+SPED+ LA RDMYLLQLC+GVDEN VG CAQLVFAPIDESFADDAPLLPSGFR+
Sbjct: 551 LEGHAYSPEDMGLA-RDMYLLQLCSGVDENVVGGCAQLVFAPIDESFADDAPLLPSGFRI 609

Query: 604 IPLDPKT 610
           IPL+ K+
Sbjct: 611 IPLEQKS 609

BLAST of Cp4.1LG19g05650 vs. Swiss-Prot
Match: ATBH9_ARATH (Homeobox-leucine zipper protein ATHB-9 OS=Arabidopsis thaliana GN=ATHB-9 PE=1 SV=1)

HSP 1 Score: 989.9 bits (2558), Expect = 1.4e-287
Identity = 498/604 (82.45%), Postives = 547/604 (90.56%), Query Frame = 1

Query: 7   KDSSNQQMDTSKYVRYTPEQVEALERVYAECPKPSSLRRQQLIRECPILSNIEPKQIKVW 66
           +DS ++  D+ KYVRYTPEQVEALERVYAECPKPSSLRRQQLIRECPIL NIEP+QIKVW
Sbjct: 10  RDSPDKGFDSGKYVRYTPEQVEALERVYAECPKPSSLRRQQLIRECPILCNIEPRQIKVW 69

Query: 67  FQNRRCREKQRKESSRLQSVNRKLSAMNKLLMEENDRLQKQVSHLVYENGFMRQQLHSLF 126
           FQNRRCREKQRKES+RLQ+VNRKLSAMNKLLMEENDRLQKQVS+LVYENGFM+ ++H+  
Sbjct: 70  FQNRRCREKQRKESARLQTVNRKLSAMNKLLMEENDRLQKQVSNLVYENGFMKHRIHT-- 129

Query: 127 FSDEQTSGTTTDNSCESVVMSGQPQQQQNPNPQHSNRDANNPAGLLAIAEETLAEFLSKA 186
                 SGTTTDNSCESVV+SGQ +QQQNP  QH  RD NNPA LL+IAEETLAEFL KA
Sbjct: 130 -----ASGTTTDNSCESVVVSGQQRQQQNPTHQHPQRDVNNPANLLSIAEETLAEFLCKA 189

Query: 187 TGTAVDWVQMIGMKPGPDSIGIVAVSRNCSGVAARACGLVSLEPMKVAEILKDRLSWYRD 246
           TGTAVDWVQMIGMKPGPDSIGIVAVSRNCSG+AARACGLVSLEPMKVAEILKDR SW+RD
Sbjct: 190 TGTAVDWVQMIGMKPGPDSIGIVAVSRNCSGIAARACGLVSLEPMKVAEILKDRPSWFRD 249

Query: 247 CRCLNVLSVIPTGNGGTIELIYMQTYAPTTLAAARDFWTMRYTTSLEDGSLVVCERSLTT 306
           CRC+  L+VIPTGNGGTIEL+  Q YAPTTLAAARDFWT+RY+TSLEDGS VVCERSLT+
Sbjct: 250 CRCVETLNVIPTGNGGTIELVNTQIYAPTTLAAARDFWTLRYSTSLEDGSYVVCERSLTS 309

Query: 307 SSGGPAGPPPSTFVRAEMLPSGYLIRACEGGGSIIHIVDHIDLDVWSVPEVLRPLYESSK 366
           ++GGP GP  S+FVRA+ML SG+LIR C+GGGSIIHIVDH+DLDV SVPEVLRPLYESSK
Sbjct: 310 ATGGPNGPLSSSFVRAKMLSSGFLIRPCDGGGSIIHIVDHVDLDVSSVPEVLRPLYESSK 369

Query: 367 ILAQKMTIAALRHIRQIAQETSGEIQYSGGRQPAVLRTFGQKLCRGFNDAVNGFADDGWS 426
           ILAQKMT+AALRH+RQIAQETSGE+QYSGGRQPAVLRTF Q+LCRGFNDAVNGF DDGWS
Sbjct: 370 ILAQKMTVAALRHVRQIAQETSGEVQYSGGRQPAVLRTFSQRLCRGFNDAVNGFVDDGWS 429

Query: 427 AMSSDGLEDVTIVINSSANKLPGSQYKTSMYPSFG-GVMCAKASMLLQNVPPALLIRFLR 486
            MSSDG ED+TI+INSS+ K  GSQY +S  PSFG GV+CAKASMLLQNVPP +LIRFLR
Sbjct: 430 PMSSDGGEDITIMINSSSAKFAGSQYGSSFLPSFGSGVLCAKASMLLQNVPPLVLIRFLR 489

Query: 487 EHRSEWADYGVDAYSAASLKASPYAVPCARPGGFPSSQVILPLAHTVEHEEFLEVVRLEG 546
           EHR+EWADYGVDAYSAASL+A+PYAVPC R GGFPS+QVILPLA T+EHEEFLEVVRL G
Sbjct: 490 EHRAEWADYGVDAYSAASLRATPYAVPCVRTGGFPSNQVILPLAQTLEHEEFLEVVRLGG 549

Query: 547 LAFSPEDVALAGRDMYLLQLCNGVDENAVGACAQLVFAPIDESFADDAPLLPSGFRVIPL 606
            A+SPED+ L+ RDMYLLQLC+GVDEN VG CAQLVFAPIDESFADDAPLLPSGFRVIPL
Sbjct: 550 HAYSPEDMGLS-RDMYLLQLCSGVDENVVGGCAQLVFAPIDESFADDAPLLPSGFRVIPL 605

Query: 607 DPKT 610
           D KT
Sbjct: 610 DQKT 605

BLAST of Cp4.1LG19g05650 vs. Swiss-Prot
Match: HOX32_ORYSJ (Homeobox-leucine zipper protein HOX32 OS=Oryza sativa subsp. japonica GN=HOX32 PE=2 SV=1)

HSP 1 Score: 962.6 bits (2487), Expect = 2.4e-279
Identity = 498/621 (80.19%), Postives = 534/621 (85.99%), Query Frame = 1

Query: 13  QMDTSKYVRYTPEQVEALERVYAECPKPSSLRRQQLIRECPILSNIEPKQIKVWFQNRRC 72
           Q+DT KYVRYTPEQVEALERVY ECPKPSSLRRQQLIRECPILSNIEPKQIKVWFQNRRC
Sbjct: 27  QVDTGKYVRYTPEQVEALERVYGECPKPSSLRRQQLIRECPILSNIEPKQIKVWFQNRRC 86

Query: 73  REKQRKESSRLQSVNRKLSAMNKLLMEENDRLQKQVSHLVYENGFMRQQLHSLFFSDEQT 132
           REKQRKE+SRLQ+VNRKL+AMNKLLMEENDRLQKQVS LVYENG+MRQQLH+        
Sbjct: 87  REKQRKEASRLQTVNRKLTAMNKLLMEENDRLQKQVSRLVYENGYMRQQLHN-------P 146

Query: 133 SGTTTDNSCESVVMSGQPQQQQNPNPQHSNRDANNPAGLLAIAEETLAEFLSKATGTAVD 192
           S  TTD SCESVV SGQ  QQQNP      RDANNPAGLLAIAEETLAEFLSKATGTAVD
Sbjct: 147 SVATTDTSCESVVTSGQHHQQQNPAATRPQRDANNPAGLLAIAEETLAEFLSKATGTAVD 206

Query: 193 WVQMIGMKPGPDSIGIVAVSRNCSGVAARACGLVSLEPMKVAEILKDRLSWYRDCRCLNV 252
           WVQM+GMKPGPDSIGI+AVS NCSGVAARACGLVSLEP KVAEILKDR SWYRDCRC++V
Sbjct: 207 WVQMVGMKPGPDSIGIIAVSHNCSGVAARACGLVSLEPTKVAEILKDRPSWYRDCRCVDV 266

Query: 253 LSVIPTGNGGTIELIYMQTYAPTTLAAARDFWTMRYTTSLEDGSLVVCERSLTTSSGGPA 312
           L VIPTGNGGTIELIYMQTYAPTTLAA RDFW +RYT+ LEDGSLV+CERSLT S+GGP+
Sbjct: 267 LHVIPTGNGGTIELIYMQTYAPTTLAAPRDFWILRYTSGLEDGSLVICERSLTQSTGGPS 326

Query: 313 GPPPSTFVRAEMLPSGYLIRACEGGGSIIHIVDHIDLDVWSVPEVLRPLYESSKILAQKM 372
           GP    FVRAE+LPSGYLIR CEGGGS+IHIVDH+DLD WSVPEVLRPLYES KILAQKM
Sbjct: 327 GPNTPNFVRAEVLPSGYLIRPCEGGGSMIHIVDHVDLDAWSVPEVLRPLYESPKILAQKM 386

Query: 373 TIAALRHIRQIAQETSGEIQYSGGRQPAVLRTFGQKLCRGFNDAVNGFADDGWSAMSSDG 432
           TIAALRHIRQIA E+SGE+ Y GGRQPAVLRTF Q+L RGFNDAVNGF DDGWS MSSDG
Sbjct: 387 TIAALRHIRQIAHESSGEMPYGGGRQPAVLRTFSQRLSRGFNDAVNGFPDDGWSLMSSDG 446

Query: 433 LEDVTIVINSSANKLPGSQYKTSMYPSF--GGVMCAKASMLLQNVPPALLIRFLREHRSE 492
            EDVTI  NSS NKL GS   +S   S   GG++CAKASMLLQNVPPALL+RFLREHRSE
Sbjct: 447 AEDVTIAFNSSPNKLVGSHVNSSQLFSAIGGGILCAKASMLLQNVPPALLVRFLREHRSE 506

Query: 493 WADYGVDAYSAASLKASPYAVPCARPGGFPSSQVILPLAHTVEHEEFLEVVRLEGLAFSP 552
           WAD GVDAYSAA+L+ASPYAVP  R GGF  SQVILPLAHT+EHEEFLEV+RLEG +   
Sbjct: 507 WADPGVDAYSAAALRASPYAVPGLRAGGFMGSQVILPLAHTLEHEEFLEVIRLEGHSLCH 566

Query: 553 EDVALAGRDMYLLQLCNGVDENAVGACAQLVFAPIDESFADDAPLLPSGFRVIPLDPKTA 612
           ++V L+ RDMYLLQLC+GVDENA GACAQLVFAPIDESFADDAPLLPSGFRVIPLD KT 
Sbjct: 567 DEVVLS-RDMYLLQLCSGVDENAAGACAQLVFAPIDESFADDAPLLPSGFRVIPLDGKTD 626

Query: 613 SAVFTFANQAGLDMLETTLVG 632
           +   T      LD+  T  VG
Sbjct: 627 APSAT----RTLDLASTLEVG 635

BLAST of Cp4.1LG19g05650 vs. Swiss-Prot
Match: HOX32_ORYSI (Homeobox-leucine zipper protein HOX32 OS=Oryza sativa subsp. indica GN=HOX32 PE=2 SV=1)

HSP 1 Score: 962.6 bits (2487), Expect = 2.4e-279
Identity = 498/621 (80.19%), Postives = 534/621 (85.99%), Query Frame = 1

Query: 13  QMDTSKYVRYTPEQVEALERVYAECPKPSSLRRQQLIRECPILSNIEPKQIKVWFQNRRC 72
           Q+DT KYVRYTPEQVEALERVY ECPKPSSLRRQQLIRECPILSNIEPKQIKVWFQNRRC
Sbjct: 27  QVDTGKYVRYTPEQVEALERVYGECPKPSSLRRQQLIRECPILSNIEPKQIKVWFQNRRC 86

Query: 73  REKQRKESSRLQSVNRKLSAMNKLLMEENDRLQKQVSHLVYENGFMRQQLHSLFFSDEQT 132
           REKQRKE+SRLQ+VNRKL+AMNKLLMEENDRLQKQVS LVYENG+MRQQLH+        
Sbjct: 87  REKQRKEASRLQTVNRKLTAMNKLLMEENDRLQKQVSRLVYENGYMRQQLHN-------P 146

Query: 133 SGTTTDNSCESVVMSGQPQQQQNPNPQHSNRDANNPAGLLAIAEETLAEFLSKATGTAVD 192
           S  TTD SCESVV SGQ  QQQNP      RDANNPAGLLAIAEETLAEFLSKATGTAVD
Sbjct: 147 SVATTDTSCESVVTSGQHHQQQNPAATRPQRDANNPAGLLAIAEETLAEFLSKATGTAVD 206

Query: 193 WVQMIGMKPGPDSIGIVAVSRNCSGVAARACGLVSLEPMKVAEILKDRLSWYRDCRCLNV 252
           WVQM+GMKPGPDSIGI+AVS NCSGVAARACGLVSLEP KVAEILKDR SWYRDCRC++V
Sbjct: 207 WVQMVGMKPGPDSIGIIAVSHNCSGVAARACGLVSLEPTKVAEILKDRPSWYRDCRCVDV 266

Query: 253 LSVIPTGNGGTIELIYMQTYAPTTLAAARDFWTMRYTTSLEDGSLVVCERSLTTSSGGPA 312
           L VIPTGNGGTIELIYMQTYAPTTLAA RDFW +RYT+ LEDGSLV+CERSLT S+GGP+
Sbjct: 267 LHVIPTGNGGTIELIYMQTYAPTTLAAPRDFWILRYTSGLEDGSLVICERSLTQSTGGPS 326

Query: 313 GPPPSTFVRAEMLPSGYLIRACEGGGSIIHIVDHIDLDVWSVPEVLRPLYESSKILAQKM 372
           GP    FVRAE+LPSGYLIR CEGGGS+IHIVDH+DLD WSVPEVLRPLYES KILAQKM
Sbjct: 327 GPNTPNFVRAEVLPSGYLIRPCEGGGSMIHIVDHVDLDAWSVPEVLRPLYESPKILAQKM 386

Query: 373 TIAALRHIRQIAQETSGEIQYSGGRQPAVLRTFGQKLCRGFNDAVNGFADDGWSAMSSDG 432
           TIAALRHIRQIA E+SGE+ Y GGRQPAVLRTF Q+L RGFNDAVNGF DDGWS MSSDG
Sbjct: 387 TIAALRHIRQIAHESSGEMPYGGGRQPAVLRTFSQRLSRGFNDAVNGFPDDGWSLMSSDG 446

Query: 433 LEDVTIVINSSANKLPGSQYKTSMYPSF--GGVMCAKASMLLQNVPPALLIRFLREHRSE 492
            EDVTI  NSS NKL GS   +S   S   GG++CAKASMLLQNVPPALL+RFLREHRSE
Sbjct: 447 AEDVTIAFNSSPNKLVGSHVNSSQLFSAIGGGILCAKASMLLQNVPPALLVRFLREHRSE 506

Query: 493 WADYGVDAYSAASLKASPYAVPCARPGGFPSSQVILPLAHTVEHEEFLEVVRLEGLAFSP 552
           WAD GVDAYSAA+L+ASPYAVP  R GGF  SQVILPLAHT+EHEEFLEV+RLEG +   
Sbjct: 507 WADPGVDAYSAAALRASPYAVPGLRAGGFMGSQVILPLAHTLEHEEFLEVIRLEGHSLCH 566

Query: 553 EDVALAGRDMYLLQLCNGVDENAVGACAQLVFAPIDESFADDAPLLPSGFRVIPLDPKTA 612
           ++V L+ RDMYLLQLC+GVDENA GACAQLVFAPIDESFADDAPLLPSGFRVIPLD KT 
Sbjct: 567 DEVVLS-RDMYLLQLCSGVDENAAGACAQLVFAPIDESFADDAPLLPSGFRVIPLDGKTD 626

Query: 613 SAVFTFANQAGLDMLETTLVG 632
           +   T      LD+  T  VG
Sbjct: 627 APSAT----RTLDLASTLEVG 635

BLAST of Cp4.1LG19g05650 vs. Swiss-Prot
Match: HOX33_ORYSI (Homeobox-leucine zipper protein HOX33 OS=Oryza sativa subsp. indica GN=HOX33 PE=2 SV=2)

HSP 1 Score: 943.7 bits (2438), Expect = 1.1e-273
Identity = 482/600 (80.33%), Postives = 522/600 (87.00%), Query Frame = 1

Query: 13  QMDTSKYVRYTPEQVEALERVYAECPKPSSLRRQQLIRECPILSNIEPKQIKVWFQNRRC 72
           Q+D  KYVRYTPEQVEALERVY ECPKPSSLRRQQLIRECPILSNIEPKQIKVWFQNRRC
Sbjct: 24  QVDAGKYVRYTPEQVEALERVYTECPKPSSLRRQQLIRECPILSNIEPKQIKVWFQNRRC 83

Query: 73  REKQRKESSRLQSVNRKLSAMNKLLMEENDRLQKQVSHLVYENGFMRQQLHSLFFSDEQT 132
           REKQRKE+SRLQ+VNRKL+AMNKLLMEENDRLQKQVS LVYENG+MR QLH+        
Sbjct: 84  REKQRKEASRLQTVNRKLNAMNKLLMEENDRLQKQVSRLVYENGYMRTQLHN-------P 143

Query: 133 SGTTTDNSCESVVMSGQPQQQQNPNPQHSNRDANNPAGLLAIAEETLAEFLSKATGTAVD 192
           S  TTD SCESVV SGQ  QQQNP   H  RDANNPAGLLAIAEETLAEF+SKATGTAV+
Sbjct: 144 SAATTDTSCESVVTSGQHHQQQNPAVLHPQRDANNPAGLLAIAEETLAEFMSKATGTAVE 203

Query: 193 WVQMIGMKPGPDSIGIVAVSRNCSGVAARACGLVSLEPMKVAEILKDRLSWYRDCRCLNV 252
           WVQM+GMKPGPDSIGI+AVS NCSGVAARACGLVSLEP KVAEILKDR SWYRDCRC+++
Sbjct: 204 WVQMVGMKPGPDSIGIIAVSHNCSGVAARACGLVSLEPTKVAEILKDRPSWYRDCRCVDI 263

Query: 253 LSVIPTGNGGTIELIYMQTYAPTTLAAARDFWTMRYTTSLEDGSLVVCERSLTTSSGGPA 312
           + VIPTGNGGTIELIYMQTYAPTTLAA RDFWT+RYT+ LEDGSLV+CERSLT S+GGP+
Sbjct: 264 IHVIPTGNGGTIELIYMQTYAPTTLAAPRDFWTLRYTSGLEDGSLVICERSLTQSTGGPS 323

Query: 313 GPPPSTFVRAEMLPSGYLIRACEGGGSIIHIVDHIDLDVWSVPEVLRPLYESSKILAQKM 372
           GP    F+RAE+LPSGYLIR CEGGGS+I+IVDH+DLD WSVPEVLRPLYES KILAQKM
Sbjct: 324 GPNTPNFIRAEVLPSGYLIRPCEGGGSMIYIVDHVDLDAWSVPEVLRPLYESPKILAQKM 383

Query: 373 TIAALRHIRQIAQETSGEIQYSGGRQPAVLRTFGQKLCRGFNDAVNGFADDGWSAMSSDG 432
           TIAALRHIRQIA E+SGEI Y  GRQPAV RTF Q+L RGFNDAV+GF DDGWS +SSDG
Sbjct: 384 TIAALRHIRQIAHESSGEIPYGAGRQPAVFRTFSQRLSRGFNDAVSGFPDDGWSLLSSDG 443

Query: 433 LEDVTIVINSSANKLPGSQYKTSMYPSF----GGVMCAKASMLLQNVPPALLIRFLREHR 492
            ED+TI +NSS NKL GS    S  P F    GG++CAKASMLLQNVPPALL+RFLREHR
Sbjct: 444 SEDITISVNSSPNKLVGSH--VSPNPLFSTVGGGILCAKASMLLQNVPPALLVRFLREHR 503

Query: 493 SEWADYGVDAYSAASLKASPYAVPCARPGGFPSSQVILPLAHTVEHEEFLEVVRLEGLAF 552
           SEWAD GVDAYSAASL+ASPYAVP  R  GF  SQVILPLAHT+EHEEFLEV+RLEG  F
Sbjct: 504 SEWADPGVDAYSAASLRASPYAVPGLRTSGFMGSQVILPLAHTLEHEEFLEVIRLEGHGF 563

Query: 553 SPEDVALAGRDMYLLQLCNGVDENAVGACAQLVFAPIDESFADDAPLLPSGFRVIPLDPK 609
           S ++V L+ RDMYLLQLC+GVDENA  A AQLVFAPIDESFADDAPLLPSGFRVIPLD K
Sbjct: 564 SHDEVLLS-RDMYLLQLCSGVDENATSASAQLVFAPIDESFADDAPLLPSGFRVIPLDTK 613

BLAST of Cp4.1LG19g05650 vs. TrEMBL
Match: A0A0A0KM58_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G525430 PE=4 SV=1)

HSP 1 Score: 1135.6 bits (2936), Expect = 0.0e+00
Identity = 574/610 (94.10%), Postives = 590/610 (96.72%), Query Frame = 1

Query: 1   MALVMHKDSSNQQMDTSKYVRYTPEQVEALERVYAECPKPSSLRRQQLIRECPILSNIEP 60
           MALV+HKD+SN+QMD+SKYVRYTPEQVEALERVYAECPKPSSLRRQQLIRECPILSNIEP
Sbjct: 1   MALVIHKDTSNKQMDSSKYVRYTPEQVEALERVYAECPKPSSLRRQQLIRECPILSNIEP 60

Query: 61  KQIKVWFQNRRCREKQRKESSRLQSVNRKLSAMNKLLMEENDRLQKQVSHLVYENGFMRQ 120
           KQIKVWFQNRRCREKQRKESSRLQSVNRKLSAMNKLLMEENDRLQKQVSHLVYENGFMRQ
Sbjct: 61  KQIKVWFQNRRCREKQRKESSRLQSVNRKLSAMNKLLMEENDRLQKQVSHLVYENGFMRQ 120

Query: 121 QLHSLFFSDEQTSGTTTDNSCESVVMSGQPQQQQNPNPQHSNRDANNPAGLLAIAEETLA 180
           QLHS        SGTTTDNSCESVVMSGQPQQQQNPNPQH NRD NNPAGLLA+AEETLA
Sbjct: 121 QLHS-------ASGTTTDNSCESVVMSGQPQQQQNPNPQHPNRDVNNPAGLLAVAEETLA 180

Query: 181 EFLSKATGTAVDWVQMIGMKPGPDSIGIVAVSRNCSGVAARACGLVSLEPMKVAEILKDR 240
           EFLSKATGTAVDWVQMIGMKPGPDSIGIVAVSRNCSGVAARACGLVSLEP KVAEILKDR
Sbjct: 181 EFLSKATGTAVDWVQMIGMKPGPDSIGIVAVSRNCSGVAARACGLVSLEPTKVAEILKDR 240

Query: 241 LSWYRDCRCLNVLSVIPTGNGGTIELIYMQTYAPTTLAAARDFWTMRYTTSLEDGSLVVC 300
           LSWYRDCRCLNVLSVIPTGNGGTIELIYMQTYAPTTLAAARDFWTMRYTTSLEDGSLVVC
Sbjct: 241 LSWYRDCRCLNVLSVIPTGNGGTIELIYMQTYAPTTLAAARDFWTMRYTTSLEDGSLVVC 300

Query: 301 ERSLTTSSGGPAGPPPSTFVRAEMLPSGYLIRACEGGGSIIHIVDHIDLDVWSVPEVLRP 360
           ERSL++SSGGPAGPPPSTFVRAEMLPSGYLIRACEGGGSIIHIVDHIDLDVWSVPEVLRP
Sbjct: 301 ERSLSSSSGGPAGPPPSTFVRAEMLPSGYLIRACEGGGSIIHIVDHIDLDVWSVPEVLRP 360

Query: 361 LYESSKILAQKMTIAALRHIRQIAQETSGEIQYSGGRQPAVLRTFGQKLCRGFNDAVNGF 420
           LYESSKILAQK+TIAALRHIRQIAQET+GEIQ +GGRQPAVLRTF QKLCRGFNDAVNGF
Sbjct: 361 LYESSKILAQKITIAALRHIRQIAQETNGEIQCTGGRQPAVLRTFSQKLCRGFNDAVNGF 420

Query: 421 ADDGWSAMSSDGLEDVTIVINSSANKLPGSQYKTSMYPSF-GGVMCAKASMLLQNVPPAL 480
           ADDGWS M SDG+EDVTI+IN+SANK  GSQY TS+YPSF GGVMCAKASMLLQNVPPAL
Sbjct: 421 ADDGWSPMGSDGVEDVTILINTSANKFSGSQYNTSLYPSFGGGVMCAKASMLLQNVPPAL 480

Query: 481 LIRFLREHRSEWADYGVDAYSAASLKASPYAVPCARPGGFPSSQVILPLAHTVEHEEFLE 540
           L+RFLREHRSEWADYGVDAYSAASLKASPYAVPCARPGGFPSSQVILPLA TVEHEEFLE
Sbjct: 481 LVRFLREHRSEWADYGVDAYSAASLKASPYAVPCARPGGFPSSQVILPLATTVEHEEFLE 540

Query: 541 VVRLEGLAFSPEDVALAGRDMYLLQLCNGVDENAVGACAQLVFAPIDESFADDAPLLPSG 600
           VVRLEGLAFSPEDVALAGRDMYLLQLC+GVDENAVGACAQLVFAPIDESFADDAPLLPSG
Sbjct: 541 VVRLEGLAFSPEDVALAGRDMYLLQLCSGVDENAVGACAQLVFAPIDESFADDAPLLPSG 600

Query: 601 FRVIPLDPKT 610
           FRVIPLDPKT
Sbjct: 601 FRVIPLDPKT 603

BLAST of Cp4.1LG19g05650 vs. TrEMBL
Match: W9RI95_9ROSA (Homeobox-leucine zipper protein HOX32 OS=Morus notabilis GN=L484_026012 PE=4 SV=1)

HSP 1 Score: 1080.1 bits (2792), Expect = 0.0e+00
Identity = 549/634 (86.59%), Postives = 582/634 (91.80%), Query Frame = 1

Query: 1   MALVMHKDSSN---QQMDTSKYVRYTPEQVEALERVYAECPKPSSLRRQQLIRECPILSN 60
           MALV+HKD+SN   +QMD+SKYVRYTPEQVEALERVY+ECPKPSSLRRQQLIRECPILSN
Sbjct: 1   MALVIHKDNSNNINKQMDSSKYVRYTPEQVEALERVYSECPKPSSLRRQQLIRECPILSN 60

Query: 61  IEPKQIKVWFQNRRCREKQRKESSRLQSVNRKLSAMNKLLMEENDRLQKQVSHLVYENGF 120
           IEPKQIKVWFQNRRCREKQRKE+SRLQ+VNRKLSAMNKLLMEENDRLQKQVSHLVYENG+
Sbjct: 61  IEPKQIKVWFQNRRCREKQRKEASRLQTVNRKLSAMNKLLMEENDRLQKQVSHLVYENGY 120

Query: 121 MRQQLHSLFFSDEQTSGTTTDNSCESVVMSGQPQQQQNPNPQHSNRDANNPAGLLAIAEE 180
           MRQQLH+        SG TTDNSCESVVMSGQ QQQQNP PQH  RDANNPAGLLAIAEE
Sbjct: 121 MRQQLHT-------ASGATTDNSCESVVMSGQNQQQQNPTPQHPQRDANNPAGLLAIAEE 180

Query: 181 TLAEFLSKATGTAVDWVQMIGMKPGPDSIGIVAVSRNCSGVAARACGLVSLEPMKVAEIL 240
           TLAEFLSKATGTAVDWVQMIGMKPGPDSIGIVAVSRNCSGVAARACGLVSLEP KVAEIL
Sbjct: 181 TLAEFLSKATGTAVDWVQMIGMKPGPDSIGIVAVSRNCSGVAARACGLVSLEPAKVAEIL 240

Query: 241 KDRLSWYRDCRCLNVLSVIPTGNGGTIELIYMQTYAPTTLAAARDFWTMRYTTSLEDGSL 300
           KDR SW+RDCRC++VLSVIPTGNGGTIELIYMQTYAPTTLAAARDFWT+RYTTSLED SL
Sbjct: 241 KDRPSWFRDCRCVDVLSVIPTGNGGTIELIYMQTYAPTTLAAARDFWTLRYTTSLEDNSL 300

Query: 301 VVCERSLTTSSGGPAGPPPSTFVRAEMLPSGYLIRACEGGGSIIHIVDHIDLDVWSVPEV 360
           V+CERSLTTS+GGP GPP S FVRAEMLPSGYLIR CEGGGSII+IVDH+DLD WSVPEV
Sbjct: 301 VICERSLTTSTGGPTGPPSSCFVRAEMLPSGYLIRPCEGGGSIINIVDHVDLDAWSVPEV 360

Query: 361 LRPLYESSKILAQKMTIAALRHIRQIAQETSGEIQYSGGRQPAVLRTFGQKLCRGFNDAV 420
           LRPLYESSKILAQKMT+AALRHIRQIAQETSGEIQY GGRQPAVLRTF Q+LCRGFNDAV
Sbjct: 361 LRPLYESSKILAQKMTVAALRHIRQIAQETSGEIQYGGGRQPAVLRTFSQRLCRGFNDAV 420

Query: 421 NGFADDGWSAMSSDGLEDVTIVINSSANKLPGSQYKTSMYPSFGGVMCAKASMLLQNVPP 480
           NGF DDGWS + SDG EDVTIVINSS+NK  GSQY  SM+P+FGGV+CAKASMLLQNVPP
Sbjct: 421 NGFVDDGWSLLGSDGAEDVTIVINSSSNKFLGSQYNASMFPTFGGVLCAKASMLLQNVPP 480

Query: 481 ALLIRFLREHRSEWADYGVDAYSAASLKASPYAVPCARPGGFPSSQVILPLAHTVEHEEF 540
           ALL+RFLREHRSEWADYGVDAYSA+ LKASPYA+PCARPGGFPSSQVILPLAHTVEHEEF
Sbjct: 481 ALLVRFLREHRSEWADYGVDAYSASCLKASPYAIPCARPGGFPSSQVILPLAHTVEHEEF 540

Query: 541 LEVVRLEGLAFSPEDVALAGRDMYLLQLCNGVDENAVGACAQLVFAPIDESFADDAPLLP 600
           LEVVRLEG AFSPE+VALA RDMYLLQLC+GVDE+AVGACAQLVFAPIDESFAD+APLLP
Sbjct: 541 LEVVRLEGHAFSPEEVALA-RDMYLLQLCSGVDESAVGACAQLVFAPIDESFADEAPLLP 600

Query: 601 SGFRVIPLDPKTASAVFTFANQAGLDMLETTLVG 632
           SGFRVIPLDPK  +   T      LD+  T  VG
Sbjct: 601 SGFRVIPLDPKADTPAAT----RTLDLASTLEVG 622

BLAST of Cp4.1LG19g05650 vs. TrEMBL
Match: A0A061G7E3_THECC (Homeobox-leucine zipper family protein / lipid-binding START domain-containing protein isoform 5 OS=Theobroma cacao GN=TCM_026907 PE=4 SV=1)

HSP 1 Score: 1075.5 bits (2780), Expect = 0.0e+00
Identity = 550/615 (89.43%), Postives = 576/615 (93.66%), Query Frame = 1

Query: 1   MALVMHKDSSN-QQMDTSKYVRYTPEQVEALERVYAECPKPSSLRRQQLIRECPILSNIE 60
           MAL MHKDSSN +QMD+SKYVRYTPEQVEALERVY+ECPKPSSLRRQQLIRECPILSNIE
Sbjct: 1   MALSMHKDSSNNKQMDSSKYVRYTPEQVEALERVYSECPKPSSLRRQQLIRECPILSNIE 60

Query: 61  PKQIKVWFQNRRCREKQRKESSRLQSVNRKLSAMNKLLMEENDRLQKQVSHLVYENGFMR 120
           PKQIKVWFQNRRCREKQRKE+SRLQ+VNRKLSAMNKLLMEENDRLQKQVS LVYENG+MR
Sbjct: 61  PKQIKVWFQNRRCREKQRKEASRLQTVNRKLSAMNKLLMEENDRLQKQVSQLVYENGYMR 120

Query: 121 QQLHSLFFSDEQTSGTTTD-NSCESVVMSGQPQQQQNPNPQHSNRDANNPAGLLAIAEET 180
           QQL +        S TTTD NSCESVVMSGQ QQQQNP PQH  RDAN+PAGLLAIAEET
Sbjct: 121 QQLQT-------GSATTTDNNSCESVVMSGQHQQQQNPTPQHPQRDANSPAGLLAIAEET 180

Query: 181 LAEFLSKATGTAVDWVQMIGMKPGPDSIGIVAVSRNCSGVAARACGLVSLEPMKVAEILK 240
           LAEFLSKATGTAVDWVQMIGMKPGPDSIGIVAVSRNCSGVAARACGLVSLEP KVAEILK
Sbjct: 181 LAEFLSKATGTAVDWVQMIGMKPGPDSIGIVAVSRNCSGVAARACGLVSLEPTKVAEILK 240

Query: 241 DRLSWYRDCRCLNVLSVIPTGNGGTIELIYMQTYAPTTLAAARDFWTMRYTTSLEDGSLV 300
           DR SW+RDCRCL+VLSVIPTGNGGTIELIYMQTYAPTTLAAARDFWT+RYTTSLEDGSLV
Sbjct: 241 DRPSWFRDCRCLDVLSVIPTGNGGTIELIYMQTYAPTTLAAARDFWTLRYTTSLEDGSLV 300

Query: 301 VCERSLTTSSGGPAGPPPSTFVRAEMLPSGYLIRACEGGGSIIHIVDHIDLDVWSVPEVL 360
           +CERSLT+S+GGP GPP S+FVRAEMLPSG+LIR CEGGGSIIHIVDH+DLDVWSVPEVL
Sbjct: 301 ICERSLTSSTGGPTGPPTSSFVRAEMLPSGFLIRPCEGGGSIIHIVDHVDLDVWSVPEVL 360

Query: 361 RPLYESSKILAQKMTIAALRHIRQIAQETSGEIQYSGGRQPAVLRTFGQKLCRGFNDAVN 420
           RPLYESSKILAQKMTIAALRHIRQIAQETSGEIQY GGRQPAVLRTF Q+LCRGFNDAVN
Sbjct: 361 RPLYESSKILAQKMTIAALRHIRQIAQETSGEIQYGGGRQPAVLRTFSQRLCRGFNDAVN 420

Query: 421 GFADDGWSAMSSDGLEDVTIVINSSANKLPGSQYKTSMYPSF-GGVMCAKASMLLQNVPP 480
           GFADDGWS M SDG+EDVTI+INSS  K  GSQY TSM+PSF GGV+CAKASMLLQNVPP
Sbjct: 421 GFADDGWSLMGSDGVEDVTIMINSSPGKFLGSQYNTSMFPSFGGGVLCAKASMLLQNVPP 480

Query: 481 ALLIRFLREHRSEWADYGVDAYSAASLKASPYAVPCARPGGFPSSQVILPLAHTVEHEEF 540
           ALL+RFLREHRSEWADYGVD YSAA LKASPYAVPCARPGGFPSSQVILPLAHTVEHEEF
Sbjct: 481 ALLVRFLREHRSEWADYGVDTYSAACLKASPYAVPCARPGGFPSSQVILPLAHTVEHEEF 540

Query: 541 LEVVRLEGLAFSPEDVALAGRDMYLLQLCNGVDENAVGACAQLVFAPIDESFADDAPLLP 600
           LEVVRLEG AF+PEDVALA RDMYLLQLC+G+DENAVGACAQLVFAPIDESFADDAPLLP
Sbjct: 541 LEVVRLEGHAFTPEDVALA-RDMYLLQLCSGIDENAVGACAQLVFAPIDESFADDAPLLP 600

Query: 601 SGFRVIPLDPKTASA 613
           SGFRVIPLDPKT  A
Sbjct: 601 SGFRVIPLDPKTDGA 607

BLAST of Cp4.1LG19g05650 vs. TrEMBL
Match: A0A061GEN1_THECC (Homeobox-leucine zipper family protein / lipid-binding START domain-containing protein isoform 3 OS=Theobroma cacao GN=TCM_026907 PE=4 SV=1)

HSP 1 Score: 1075.1 bits (2779), Expect = 0.0e+00
Identity = 549/614 (89.41%), Postives = 576/614 (93.81%), Query Frame = 1

Query: 1   MALVMHKDSSN-QQMDTSKYVRYTPEQVEALERVYAECPKPSSLRRQQLIRECPILSNIE 60
           MAL MHKDSSN +QMD+SKYVRYTPEQVEALERVY+ECPKPSSLRRQQLIRECPILSNIE
Sbjct: 1   MALSMHKDSSNNKQMDSSKYVRYTPEQVEALERVYSECPKPSSLRRQQLIRECPILSNIE 60

Query: 61  PKQIKVWFQNRRCREKQRKESSRLQSVNRKLSAMNKLLMEENDRLQKQVSHLVYENGFMR 120
           PKQIKVWFQNRRCREKQRKE+SRLQ+VNRKLSAMNKLLMEENDRLQKQVS LVYENG+MR
Sbjct: 61  PKQIKVWFQNRRCREKQRKEASRLQTVNRKLSAMNKLLMEENDRLQKQVSQLVYENGYMR 120

Query: 121 QQLHSLFFSDEQTSGTTTD-NSCESVVMSGQPQQQQNPNPQHSNRDANNPAGLLAIAEET 180
           QQL +        S TTTD NSCESVVMSGQ QQQQNP PQH  RDAN+PAGLLAIAEET
Sbjct: 121 QQLQT-------GSATTTDNNSCESVVMSGQHQQQQNPTPQHPQRDANSPAGLLAIAEET 180

Query: 181 LAEFLSKATGTAVDWVQMIGMKPGPDSIGIVAVSRNCSGVAARACGLVSLEPMKVAEILK 240
           LAEFLSKATGTAVDWVQMIGMKPGPDSIGIVAVSRNCSGVAARACGLVSLEP KVAEILK
Sbjct: 181 LAEFLSKATGTAVDWVQMIGMKPGPDSIGIVAVSRNCSGVAARACGLVSLEPTKVAEILK 240

Query: 241 DRLSWYRDCRCLNVLSVIPTGNGGTIELIYMQTYAPTTLAAARDFWTMRYTTSLEDGSLV 300
           DR SW+RDCRCL+VLSVIPTGNGGTIELIYMQTYAPTTLAAARDFWT+RYTTSLEDGSLV
Sbjct: 241 DRPSWFRDCRCLDVLSVIPTGNGGTIELIYMQTYAPTTLAAARDFWTLRYTTSLEDGSLV 300

Query: 301 VCERSLTTSSGGPAGPPPSTFVRAEMLPSGYLIRACEGGGSIIHIVDHIDLDVWSVPEVL 360
           +CERSLT+S+GGP GPP S+FVRAEMLPSG+LIR CEGGGSIIHIVDH+DLDVWSVPEVL
Sbjct: 301 ICERSLTSSTGGPTGPPTSSFVRAEMLPSGFLIRPCEGGGSIIHIVDHVDLDVWSVPEVL 360

Query: 361 RPLYESSKILAQKMTIAALRHIRQIAQETSGEIQYSGGRQPAVLRTFGQKLCRGFNDAVN 420
           RPLYESSKILAQKMTIAALRHIRQIAQETSGEIQY GGRQPAVLRTF Q+LCRGFNDAVN
Sbjct: 361 RPLYESSKILAQKMTIAALRHIRQIAQETSGEIQYGGGRQPAVLRTFSQRLCRGFNDAVN 420

Query: 421 GFADDGWSAMSSDGLEDVTIVINSSANKLPGSQYKTSMYPSF-GGVMCAKASMLLQNVPP 480
           GFADDGWS M SDG+EDVTI+INSS  K  GSQY TSM+PSF GGV+CAKASMLLQNVPP
Sbjct: 421 GFADDGWSLMGSDGVEDVTIMINSSPGKFLGSQYNTSMFPSFGGGVLCAKASMLLQNVPP 480

Query: 481 ALLIRFLREHRSEWADYGVDAYSAASLKASPYAVPCARPGGFPSSQVILPLAHTVEHEEF 540
           ALL+RFLREHRSEWADYGVD YSAA LKASPYAVPCARPGGFPSSQVILPLAHTVEHEEF
Sbjct: 481 ALLVRFLREHRSEWADYGVDTYSAACLKASPYAVPCARPGGFPSSQVILPLAHTVEHEEF 540

Query: 541 LEVVRLEGLAFSPEDVALAGRDMYLLQLCNGVDENAVGACAQLVFAPIDESFADDAPLLP 600
           LEVVRLEG AF+PEDVALA RDMYLLQLC+G+DENAVGACAQLVFAPIDESFADDAPLLP
Sbjct: 541 LEVVRLEGHAFTPEDVALA-RDMYLLQLCSGIDENAVGACAQLVFAPIDESFADDAPLLP 600

Query: 601 SGFRVIPLDPKTAS 612
           SGFRVIPLDPKT +
Sbjct: 601 SGFRVIPLDPKTVT 606

BLAST of Cp4.1LG19g05650 vs. TrEMBL
Match: A0A061G8K4_THECC (Homeobox-leucine zipper family protein / lipid-binding START domain-containing protein isoform 6 OS=Theobroma cacao GN=TCM_026907 PE=4 SV=1)

HSP 1 Score: 1075.1 bits (2779), Expect = 0.0e+00
Identity = 549/614 (89.41%), Postives = 576/614 (93.81%), Query Frame = 1

Query: 1   MALVMHKDSSN-QQMDTSKYVRYTPEQVEALERVYAECPKPSSLRRQQLIRECPILSNIE 60
           MAL MHKDSSN +QMD+SKYVRYTPEQVEALERVY+ECPKPSSLRRQQLIRECPILSNIE
Sbjct: 1   MALSMHKDSSNNKQMDSSKYVRYTPEQVEALERVYSECPKPSSLRRQQLIRECPILSNIE 60

Query: 61  PKQIKVWFQNRRCREKQRKESSRLQSVNRKLSAMNKLLMEENDRLQKQVSHLVYENGFMR 120
           PKQIKVWFQNRRCREKQRKE+SRLQ+VNRKLSAMNKLLMEENDRLQKQVS LVYENG+MR
Sbjct: 61  PKQIKVWFQNRRCREKQRKEASRLQTVNRKLSAMNKLLMEENDRLQKQVSQLVYENGYMR 120

Query: 121 QQLHSLFFSDEQTSGTTTD-NSCESVVMSGQPQQQQNPNPQHSNRDANNPAGLLAIAEET 180
           QQL +        S TTTD NSCESVVMSGQ QQQQNP PQH  RDAN+PAGLLAIAEET
Sbjct: 121 QQLQT-------GSATTTDNNSCESVVMSGQHQQQQNPTPQHPQRDANSPAGLLAIAEET 180

Query: 181 LAEFLSKATGTAVDWVQMIGMKPGPDSIGIVAVSRNCSGVAARACGLVSLEPMKVAEILK 240
           LAEFLSKATGTAVDWVQMIGMKPGPDSIGIVAVSRNCSGVAARACGLVSLEP KVAEILK
Sbjct: 181 LAEFLSKATGTAVDWVQMIGMKPGPDSIGIVAVSRNCSGVAARACGLVSLEPTKVAEILK 240

Query: 241 DRLSWYRDCRCLNVLSVIPTGNGGTIELIYMQTYAPTTLAAARDFWTMRYTTSLEDGSLV 300
           DR SW+RDCRCL+VLSVIPTGNGGTIELIYMQTYAPTTLAAARDFWT+RYTTSLEDGSLV
Sbjct: 241 DRPSWFRDCRCLDVLSVIPTGNGGTIELIYMQTYAPTTLAAARDFWTLRYTTSLEDGSLV 300

Query: 301 VCERSLTTSSGGPAGPPPSTFVRAEMLPSGYLIRACEGGGSIIHIVDHIDLDVWSVPEVL 360
           +CERSLT+S+GGP GPP S+FVRAEMLPSG+LIR CEGGGSIIHIVDH+DLDVWSVPEVL
Sbjct: 301 ICERSLTSSTGGPTGPPTSSFVRAEMLPSGFLIRPCEGGGSIIHIVDHVDLDVWSVPEVL 360

Query: 361 RPLYESSKILAQKMTIAALRHIRQIAQETSGEIQYSGGRQPAVLRTFGQKLCRGFNDAVN 420
           RPLYESSKILAQKMTIAALRHIRQIAQETSGEIQY GGRQPAVLRTF Q+LCRGFNDAVN
Sbjct: 361 RPLYESSKILAQKMTIAALRHIRQIAQETSGEIQYGGGRQPAVLRTFSQRLCRGFNDAVN 420

Query: 421 GFADDGWSAMSSDGLEDVTIVINSSANKLPGSQYKTSMYPSF-GGVMCAKASMLLQNVPP 480
           GFADDGWS M SDG+EDVTI+INSS  K  GSQY TSM+PSF GGV+CAKASMLLQNVPP
Sbjct: 421 GFADDGWSLMGSDGVEDVTIMINSSPGKFLGSQYNTSMFPSFGGGVLCAKASMLLQNVPP 480

Query: 481 ALLIRFLREHRSEWADYGVDAYSAASLKASPYAVPCARPGGFPSSQVILPLAHTVEHEEF 540
           ALL+RFLREHRSEWADYGVD YSAA LKASPYAVPCARPGGFPSSQVILPLAHTVEHEEF
Sbjct: 481 ALLVRFLREHRSEWADYGVDTYSAACLKASPYAVPCARPGGFPSSQVILPLAHTVEHEEF 540

Query: 541 LEVVRLEGLAFSPEDVALAGRDMYLLQLCNGVDENAVGACAQLVFAPIDESFADDAPLLP 600
           LEVVRLEG AF+PEDVALA RDMYLLQLC+G+DENAVGACAQLVFAPIDESFADDAPLLP
Sbjct: 541 LEVVRLEGHAFTPEDVALA-RDMYLLQLCSGIDENAVGACAQLVFAPIDESFADDAPLLP 600

Query: 601 SGFRVIPLDPKTAS 612
           SGFRVIPLDPKT +
Sbjct: 601 SGFRVIPLDPKTVT 606

BLAST of Cp4.1LG19g05650 vs. TAIR10
Match: AT2G34710.1 (AT2G34710.1 Homeobox-leucine zipper family protein / lipid-binding START domain-containing protein)

HSP 1 Score: 1001.1 bits (2587), Expect = 3.4e-292
Identity = 498/607 (82.04%), Postives = 548/607 (90.28%), Query Frame = 1

Query: 4   VMHKDSSNQQMDTSKYVRYTPEQVEALERVYAECPKPSSLRRQQLIRECPILSNIEPKQI 63
           +M+++S ++ +D+ KYVRYTPEQVEALERVY ECPKPSSLRRQQLIRECPILSNIEPKQI
Sbjct: 11  MMNRESPDKGLDSGKYVRYTPEQVEALERVYTECPKPSSLRRQQLIRECPILSNIEPKQI 70

Query: 64  KVWFQNRRCREKQRKESSRLQSVNRKLSAMNKLLMEENDRLQKQVSHLVYENGFMRQQLH 123
           KVWFQNRRCREKQRKE++RLQ+VNRKL+AMNKLLMEENDRLQKQVS+LVYENG M+ QLH
Sbjct: 71  KVWFQNRRCREKQRKEAARLQTVNRKLNAMNKLLMEENDRLQKQVSNLVYENGHMKHQLH 130

Query: 124 SLFFSDEQTSGTTTDNSCESVVMSGQPQQQQNPNPQHSNRDANNPAGLLAIAEETLAEFL 183
           +        SGTTTDNSCESVV+SGQ  QQQNPNPQH  RDANNPAGLL+IAEE LAEFL
Sbjct: 131 T-------ASGTTTDNSCESVVVSGQQHQQQNPNPQHQQRDANNPAGLLSIAEEALAEFL 190

Query: 184 SKATGTAVDWVQMIGMKPGPDSIGIVAVSRNCSGVAARACGLVSLEPMKVAEILKDRLSW 243
           SKATGTAVDWVQMIGMKPGPDSIGIVA+SRNCSG+AARACGLVSLEPMKVAEILKDR SW
Sbjct: 191 SKATGTAVDWVQMIGMKPGPDSIGIVAISRNCSGIAARACGLVSLEPMKVAEILKDRPSW 250

Query: 244 YRDCRCLNVLSVIPTGNGGTIELIYMQTYAPTTLAAARDFWTMRYTTSLEDGSLVVCERS 303
            RDCR ++ LSVIP GNGGTIELIY Q YAPTTLAAARDFWT+RY+T LEDGS VVCERS
Sbjct: 251 LRDCRSVDTLSVIPAGNGGTIELIYTQMYAPTTLAAARDFWTLRYSTCLEDGSYVVCERS 310

Query: 304 LTTSSGGPAGPPPSTFVRAEMLPSGYLIRACEGGGSIIHIVDHIDLDVWSVPEVLRPLYE 363
           LT+++GGP GPP S FVRAEM PSG+LIR C+GGGSI+HIVDH+DLD WSVPEV+RPLYE
Sbjct: 311 LTSATGGPTGPPSSNFVRAEMKPSGFLIRPCDGGGSILHIVDHVDLDAWSVPEVMRPLYE 370

Query: 364 SSKILAQKMTIAALRHIRQIAQETSGEIQYSGGRQPAVLRTFGQKLCRGFNDAVNGFADD 423
           SSKILAQKMT+AALRH+RQIAQETSGE+QY GGRQPAVLRTF Q+LCRGFNDAVNGF DD
Sbjct: 371 SSKILAQKMTVAALRHVRQIAQETSGEVQYGGGRQPAVLRTFSQRLCRGFNDAVNGFVDD 430

Query: 424 GWSAMSSDGLEDVTIVINSSANKLPGSQYKTSMYPSFG-GVMCAKASMLLQNVPPALLIR 483
           GWS M SDG EDVT++IN S  K  GSQY  S  PSFG GV+CAKASMLLQNVPPA+L+R
Sbjct: 431 GWSPMGSDGAEDVTVMINLSPGKFGGSQYGNSFLPSFGSGVLCAKASMLLQNVPPAVLVR 490

Query: 484 FLREHRSEWADYGVDAYSAASLKASPYAVPCARPGGFPSSQVILPLAHTVEHEEFLEVVR 543
           FLREHRSEWADYGVDAY+AASL+ASP+AVPCAR GGFPS+QVILPLA TVEHEE LEVVR
Sbjct: 491 FLREHRSEWADYGVDAYAAASLRASPFAVPCARAGGFPSNQVILPLAQTVEHEESLEVVR 550

Query: 544 LEGLAFSPEDVALAGRDMYLLQLCNGVDENAVGACAQLVFAPIDESFADDAPLLPSGFRV 603
           LEG A+SPED+ LA RDMYLLQLC+GVDEN VG CAQLVFAPIDESFADDAPLLPSGFR+
Sbjct: 551 LEGHAYSPEDMGLA-RDMYLLQLCSGVDENVVGGCAQLVFAPIDESFADDAPLLPSGFRI 609

Query: 604 IPLDPKT 610
           IPL+ K+
Sbjct: 611 IPLEQKS 609

BLAST of Cp4.1LG19g05650 vs. TAIR10
Match: AT1G30490.1 (AT1G30490.1 Homeobox-leucine zipper family protein / lipid-binding START domain-containing protein)

HSP 1 Score: 989.9 bits (2558), Expect = 7.8e-289
Identity = 498/604 (82.45%), Postives = 547/604 (90.56%), Query Frame = 1

Query: 7   KDSSNQQMDTSKYVRYTPEQVEALERVYAECPKPSSLRRQQLIRECPILSNIEPKQIKVW 66
           +DS ++  D+ KYVRYTPEQVEALERVYAECPKPSSLRRQQLIRECPIL NIEP+QIKVW
Sbjct: 10  RDSPDKGFDSGKYVRYTPEQVEALERVYAECPKPSSLRRQQLIRECPILCNIEPRQIKVW 69

Query: 67  FQNRRCREKQRKESSRLQSVNRKLSAMNKLLMEENDRLQKQVSHLVYENGFMRQQLHSLF 126
           FQNRRCREKQRKES+RLQ+VNRKLSAMNKLLMEENDRLQKQVS+LVYENGFM+ ++H+  
Sbjct: 70  FQNRRCREKQRKESARLQTVNRKLSAMNKLLMEENDRLQKQVSNLVYENGFMKHRIHT-- 129

Query: 127 FSDEQTSGTTTDNSCESVVMSGQPQQQQNPNPQHSNRDANNPAGLLAIAEETLAEFLSKA 186
                 SGTTTDNSCESVV+SGQ +QQQNP  QH  RD NNPA LL+IAEETLAEFL KA
Sbjct: 130 -----ASGTTTDNSCESVVVSGQQRQQQNPTHQHPQRDVNNPANLLSIAEETLAEFLCKA 189

Query: 187 TGTAVDWVQMIGMKPGPDSIGIVAVSRNCSGVAARACGLVSLEPMKVAEILKDRLSWYRD 246
           TGTAVDWVQMIGMKPGPDSIGIVAVSRNCSG+AARACGLVSLEPMKVAEILKDR SW+RD
Sbjct: 190 TGTAVDWVQMIGMKPGPDSIGIVAVSRNCSGIAARACGLVSLEPMKVAEILKDRPSWFRD 249

Query: 247 CRCLNVLSVIPTGNGGTIELIYMQTYAPTTLAAARDFWTMRYTTSLEDGSLVVCERSLTT 306
           CRC+  L+VIPTGNGGTIEL+  Q YAPTTLAAARDFWT+RY+TSLEDGS VVCERSLT+
Sbjct: 250 CRCVETLNVIPTGNGGTIELVNTQIYAPTTLAAARDFWTLRYSTSLEDGSYVVCERSLTS 309

Query: 307 SSGGPAGPPPSTFVRAEMLPSGYLIRACEGGGSIIHIVDHIDLDVWSVPEVLRPLYESSK 366
           ++GGP GP  S+FVRA+ML SG+LIR C+GGGSIIHIVDH+DLDV SVPEVLRPLYESSK
Sbjct: 310 ATGGPNGPLSSSFVRAKMLSSGFLIRPCDGGGSIIHIVDHVDLDVSSVPEVLRPLYESSK 369

Query: 367 ILAQKMTIAALRHIRQIAQETSGEIQYSGGRQPAVLRTFGQKLCRGFNDAVNGFADDGWS 426
           ILAQKMT+AALRH+RQIAQETSGE+QYSGGRQPAVLRTF Q+LCRGFNDAVNGF DDGWS
Sbjct: 370 ILAQKMTVAALRHVRQIAQETSGEVQYSGGRQPAVLRTFSQRLCRGFNDAVNGFVDDGWS 429

Query: 427 AMSSDGLEDVTIVINSSANKLPGSQYKTSMYPSFG-GVMCAKASMLLQNVPPALLIRFLR 486
            MSSDG ED+TI+INSS+ K  GSQY +S  PSFG GV+CAKASMLLQNVPP +LIRFLR
Sbjct: 430 PMSSDGGEDITIMINSSSAKFAGSQYGSSFLPSFGSGVLCAKASMLLQNVPPLVLIRFLR 489

Query: 487 EHRSEWADYGVDAYSAASLKASPYAVPCARPGGFPSSQVILPLAHTVEHEEFLEVVRLEG 546
           EHR+EWADYGVDAYSAASL+A+PYAVPC R GGFPS+QVILPLA T+EHEEFLEVVRL G
Sbjct: 490 EHRAEWADYGVDAYSAASLRATPYAVPCVRTGGFPSNQVILPLAQTLEHEEFLEVVRLGG 549

Query: 547 LAFSPEDVALAGRDMYLLQLCNGVDENAVGACAQLVFAPIDESFADDAPLLPSGFRVIPL 606
            A+SPED+ L+ RDMYLLQLC+GVDEN VG CAQLVFAPIDESFADDAPLLPSGFRVIPL
Sbjct: 550 HAYSPEDMGLS-RDMYLLQLCSGVDENVVGGCAQLVFAPIDESFADDAPLLPSGFRVIPL 605

Query: 607 DPKT 610
           D KT
Sbjct: 610 DQKT 605

BLAST of Cp4.1LG19g05650 vs. TAIR10
Match: AT5G60690.1 (AT5G60690.1 Homeobox-leucine zipper family protein / lipid-binding START domain-containing protein)

HSP 1 Score: 850.9 bits (2197), Expect = 5.6e-247
Identity = 443/639 (69.33%), Postives = 511/639 (79.97%), Query Frame = 1

Query: 1   MALVMHK----DSSNQQMDTS-KYVRYTPEQVEALERVYAECPKPSSLRRQQLIRECPIL 60
           MA+  H+    DS N+ +D+S KYVRYT EQVEALERVYAECPKPSSLRRQQLIREC IL
Sbjct: 3   MAVANHRERSSDSMNRHLDSSGKYVRYTAEQVEALERVYAECPKPSSLRRQQLIRECSIL 62

Query: 61  SNIEPKQIKVWFQNRRCREKQRKESSRLQSVNRKLSAMNKLLMEENDRLQKQVSHLVYEN 120
           +NIEPKQIKVWFQNRRCR+KQRKE+SRLQSVNRKLSAMNKLLMEENDRLQKQVS LV EN
Sbjct: 63  ANIEPKQIKVWFQNRRCRDKQRKEASRLQSVNRKLSAMNKLLMEENDRLQKQVSQLVCEN 122

Query: 121 GFMRQQLHSLFFSDEQTSGTTTDNSCESVVMSGQPQQQQNPNPQHSNRDANNPAGLLAIA 180
           G+M+QQL ++            D SCESVV +          PQHS RDAN+PAGLL+IA
Sbjct: 123 GYMKQQLTTV----------VNDPSCESVVTT----------PQHSLRDANSPAGLLSIA 182

Query: 181 EETLAEFLSKATGTAVDWVQMIGMKPGPDSIGIVAVSRNCSGVAARACGLVSLEPMKVAE 240
           EETLAEFLSKATGTAVDWVQM GMKPGPDS+GI A+S+ C+GVAARACGLVSLEPMK+AE
Sbjct: 183 EETLAEFLSKATGTAVDWVQMPGMKPGPDSVGIFAISQRCNGVAARACGLVSLEPMKIAE 242

Query: 241 ILKDRLSWYRDCRCLNVLSVIPTGNGGTIELIYMQTYAPTTLAAARDFWTMRYTTSLEDG 300
           ILKDR SW+RDCR L V ++ P GNGGTIEL+YMQTYAPTTLA ARDFWT+RYTTSL++G
Sbjct: 243 ILKDRPSWFRDCRSLEVFTMFPAGNGGTIELVYMQTYAPTTLAPARDFWTLRYTTSLDNG 302

Query: 301 SLVVCERSLTTSSGGPAGPPPSTFVRAEMLPSGYLIRACEGGGSIIHIVDHIDLDVWSVP 360
           S VVCERSL+ S  GP     S FVRAEML SGYLIR C+GGGSIIHIVDH++L+ WSVP
Sbjct: 303 SFVVCERSLSGSGAGPNAASASQFVRAEMLSSGYLIRPCDGGGSIIHIVDHLNLEAWSVP 362

Query: 361 EVLRPLYESSKILAQKMTIAALRHIRQIAQETSGEIQYSGGRQPAVLRTFGQKLCRGFND 420
           +VLRPLYESSK++AQKMTI+ALR+IRQ+AQE++GE+ Y  GRQPAVLRTF Q+L RGFND
Sbjct: 363 DVLRPLYESSKVVAQKMTISALRYIRQLAQESNGEVVYGLGRQPAVLRTFSQRLSRGFND 422

Query: 421 AVNGFADDGWSAMSSDGLEDVTIVINSS--ANKLPGSQYKTSMYPSFGGVMCAKASMLLQ 480
           AVNGF DDGWS M  DG ED+ + INS+   N +  S          GGV+CAKASMLLQ
Sbjct: 423 AVNGFGDDGWSTMHCDGAEDIIVAINSTKHLNNISNS------LSFLGGVLCAKASMLLQ 482

Query: 481 NVPPALLIRFLREHRSEWADYGVDAYSAASLKASPYAVPCARPGGFPSSQVILPLAHTVE 540
           NVPPA+LIRFLREHRSEWAD+ VDAYSAA+LKA  +A P  RP  F  SQ+I+PL HT+E
Sbjct: 483 NVPPAVLIRFLREHRSEWADFNVDAYSAATLKAGSFAYPGMRPTRFTGSQIIMPLGHTIE 542

Query: 541 HEEFLEVVRLEGLAFSPEDVALAGRDMYLLQLCNGVDENAVGACAQLVFAPIDESFADDA 600
           HEE LEVVRLEG + + ED A   RD++LLQ+C G+DENAVGAC++L+FAPI+E F DDA
Sbjct: 543 HEEMLEVVRLEGHSLAQED-AFMSRDVHLLQICTGIDENAVGACSELIFAPINEMFPDDA 602

Query: 601 PLLPSGFRVIPLDPKTASAV-FTFANQAGLDMLETTLVG 632
           PL+PSGFRVIP+D KT        AN   LD+  +  VG
Sbjct: 603 PLVPSGFRVIPVDAKTGDVQDLLTANHRTLDLTSSLEVG 614

BLAST of Cp4.1LG19g05650 vs. TAIR10
Match: AT4G32880.1 (AT4G32880.1 homeobox gene 8)

HSP 1 Score: 798.9 bits (2062), Expect = 2.5e-231
Identity = 406/607 (66.89%), Postives = 491/607 (80.89%), Query Frame = 1

Query: 9   SSNQQMDTSKYVRYTPEQVEALERVYAECPKPSSLRRQQLIRECPILSNIEPKQIKVWFQ 68
           +++  MD  KYVRYTPEQVEALER+Y +CPKPSS+RRQQLIRECPILSNIEPKQIKVWFQ
Sbjct: 6   NNSHNMDNGKYVRYTPEQVEALERLYNDCPKPSSMRRQQLIRECPILSNIEPKQIKVWFQ 65

Query: 69  NRRCREKQRKESSRLQSVNRKLSAMNKLLMEENDRLQKQVSHLVYENGFMRQQLHSLFFS 128
           NRRCREKQRKE+SRLQ+VNRKL+AMNKLLMEENDRLQKQVSHLVYEN + RQ        
Sbjct: 66  NRRCREKQRKEASRLQAVNRKLTAMNKLLMEENDRLQKQVSHLVYENSYFRQH------P 125

Query: 129 DEQTSGTTTDNSCESVVMSGQPQQQQNPNPQHSNRDANNPAGLLAIAEETLAEFLSKATG 188
             Q +  TTD SCESVV SGQ     +  PQH  RDA+ PAGLL+IA+ETL EF+SKATG
Sbjct: 126 QNQGNLATTDTSCESVVTSGQ----HHLTPQHQPRDAS-PAGLLSIADETLTEFISKATG 185

Query: 189 TAVDWVQMIGMKPGPDSIGIVAVSRNCSGVAARACGLVSLEPMKVAEILKDRLSWYRDCR 248
           TAV+WVQM GMKPGPDSIGIVA+S  C+G+AARACGLV L+P +VAEILKD+  W RDCR
Sbjct: 186 TAVEWVQMPGMKPGPDSIGIVAISHGCTGIAARACGLVGLDPTRVAEILKDKPCWLRDCR 245

Query: 249 CLNVLSVIPTGNGGTIELIYMQTYAPTTLAAARDFWTMRYTTSLEDGSLVVCERSLTTSS 308
            L++++V+ T NGGT+ELIYMQ YAPTTLA ARDFW +RYT+ +EDGSLV+CERSL  + 
Sbjct: 246 SLDIVNVLSTANGGTLELIYMQLYAPTTLAPARDFWMLRYTSVMEDGSLVICERSLNNTQ 305

Query: 309 GGPAGPPPSTFVRAEMLPSGYLIRACEGGGSIIHIVDHIDLDVWSVPEVLRPLYESSKIL 368
            GP+ PP   FVRAE+LPSGYLIR CEGGGSI+HIVDH DL+ WSVPEVLR LYESS +L
Sbjct: 306 NGPSMPPSPHFVRAEILPSGYLIRPCEGGGSILHIVDHFDLEPWSVPEVLRSLYESSTLL 365

Query: 369 AQKMTIAALRHIRQIAQETSGEIQYSGGRQPAVLRTFGQKLCRGFNDAVNGFADDGWSAM 428
           AQ+ T+AALR++RQI+QE S       GR+PA LR   Q+L +GFN+AVNGF+D+GWS +
Sbjct: 366 AQRTTMAALRYLRQISQEISQPNVTGWGRRPAALRALSQRLSKGFNEAVNGFSDEGWSIL 425

Query: 429 SSDGLEDVTIVINSSANK------LPGSQYKTSMYPSFGGVMCAKASMLLQNVPPALLIR 488
            SDG++DVT+++NSS  K      LP +   TSM PS   V+CAKASMLLQNVPP++L+R
Sbjct: 426 ESDGIDDVTLLVNSSPTKMMMTSSLPFANGYTSM-PS--AVLCAKASMLLQNVPPSILLR 485

Query: 489 FLREHRSEWADYGVDAYSAASLKASPYAVPCARPGGFPSSQVILPLAHTVEHEEFLEVVR 548
           FLREHR EWAD  +DAYSAA++KA P ++P  RPG F   QVILPLAHT+E+EEF+EV++
Sbjct: 486 FLREHRQEWADNSIDAYSAAAIKAGPCSLPIPRPGSF-GGQVILPLAHTIENEEFMEVIK 545

Query: 549 LEGLAFSPEDVALAGRDMYLLQLCNGVDENAVGACAQLVFAPIDESFADDAPLLPSGFRV 608
           LE L    ED+ +   D++LLQ+C+GVDENAV +CA+L+FAPID SF+DDAP++PSGFR+
Sbjct: 546 LESLGHYQEDMMMPA-DIFLLQMCSGVDENAVESCAELIFAPIDASFSDDAPIIPSGFRI 596

Query: 609 IPLDPKT 610
           IPLD K+
Sbjct: 606 IPLDSKS 596

BLAST of Cp4.1LG19g05650 vs. TAIR10
Match: AT1G52150.2 (AT1G52150.2 Homeobox-leucine zipper family protein / lipid-binding START domain-containing protein)

HSP 1 Score: 787.7 bits (2033), Expect = 5.9e-228
Identity = 415/604 (68.71%), Postives = 484/604 (80.13%), Query Frame = 1

Query: 7   KDSSNQQMDTSKYVRYTPEQVEALERVYAECPKPSSLRRQQLIRECPILSNIEPKQIKVW 66
           KD     +D  KYVRYTPEQVEALER+Y +CPKPSS+RRQQLIRECPILSNIEPKQIKVW
Sbjct: 6   KDGKLGCLDNGKYVRYTPEQVEALERLYHDCPKPSSIRRQQLIRECPILSNIEPKQIKVW 65

Query: 67  FQNRRCREKQRKESSRLQSVNRKLSAMNKLLMEENDRLQKQVSHLVYENGFMRQQLHSLF 126
           FQNRRCREKQRKE+SRLQ+VNRKL+AMNKLLMEENDRLQKQVS LV+EN + RQ   +  
Sbjct: 66  FQNRRCREKQRKEASRLQAVNRKLTAMNKLLMEENDRLQKQVSQLVHENSYFRQHTPN-- 125

Query: 127 FSDEQTSGTTTDNSCESVVMSGQPQQQQNPNPQHSNRDANNPAGLLAIAEETLAEFLSKA 186
                 S    D SCESVV SGQ  Q  + NPQ   RDA+ PAGLL+IAEETLAEFLSKA
Sbjct: 126 -----PSLPAKDTSCESVVTSGQ-HQLASQNPQ---RDAS-PAGLLSIAEETLAEFLSKA 185

Query: 187 TGTAVDWVQMIGMKPGPDSIGIVAVSRNCSGVAARACGLVSLEPMKVAEILKDRLSWYRD 246
           TGTAV+WVQM GMKPGPDSIGI+A+S  C+GVAARACGLV LEP +VAEI+KDR SW+R+
Sbjct: 186 TGTAVEWVQMPGMKPGPDSIGIIAISHGCTGVAARACGLVGLEPTRVAEIVKDRPSWFRE 245

Query: 247 CRCLNVLSVIPTGNGGTIELIYMQTYAPTTLAAARDFWTMRYTTSLEDGSLVVCERSLTT 306
           CR + V++V+PT NGGT+EL+YMQ YAPTTLA  RDFW +RYT+ LEDGSLVVCERSL +
Sbjct: 246 CRAVEVMNVLPTANGGTVELLYMQLYAPTTLAPPRDFWLLRYTSVLEDGSLVVCERSLKS 305

Query: 307 SSGGPAGPPPSTFVRAEMLPSGYLIRACEGGGSIIHIVDHIDLDVWSVPEVLRPLYESSK 366
           +  GP+ P    FVRAEML SGYLIR C+GGGSIIHIVDH+DL+  SVPEVLRPLYES K
Sbjct: 306 TQNGPSMPLVQNFVRAEMLSSGYLIRPCDGGGSIIHIVDHMDLEACSVPEVLRPLYESPK 365

Query: 367 ILAQKMTIAALRHIRQIAQE-TSGEIQYSG-GRQPAVLRTFGQKLCRGFNDAVNGFADDG 426
           +LAQK T+AALR ++QIAQE T      +G GR+PA LR   Q+L RGFN+AVNGF D+G
Sbjct: 366 VLAQKTTMAALRQLKQIAQEVTQTNSSVNGWGRRPAALRALSQRLSRGFNEAVNGFTDEG 425

Query: 427 WSAMSSDGLEDVTIVINSSANKLPGSQ--YKTSMYPSFGGVMCAKASMLLQNVPPALLIR 486
           WS +  D ++DVTI +NSS +KL G    +     P    V+CAKASMLLQNVPPA+L+R
Sbjct: 426 WSVI-GDSMDDVTITVNSSPDKLMGLNLTFANGFAPVSNVVLCAKASMLLQNVPPAILLR 485

Query: 487 FLREHRSEWADYGVDAYSAASLKASPYAVPCARPGGFPSSQVILPLAHTVEHEEFLEVVR 546
           FLREHRSEWAD  +DAY AA++K  P +   AR GGF   QVILPLAHT+EHEEF+EV++
Sbjct: 486 FLREHRSEWADNNIDAYLAAAVKVGPCS---ARVGGF-GGQVILPLAHTIEHEEFMEVIK 545

Query: 547 LEGLAFSPEDVALAGRDMYLLQLCNGVDENAVGACAQLVFAPIDESFADDAPLLPSGFRV 606
           LEGL  SPED A+  RD++LLQLC+G+DENAVG CA+L+FAPID SFADDAPLLPSGFR+
Sbjct: 546 LEGLGHSPED-AIVPRDIFLLQLCSGMDENAVGTCAELIFAPIDASFADDAPLLPSGFRI 591

BLAST of Cp4.1LG19g05650 vs. NCBI nr
Match: gi|778721560|ref|XP_011658319.1| (PREDICTED: homeobox-leucine zipper protein ATHB-14-like [Cucumis sativus])

HSP 1 Score: 1135.6 bits (2936), Expect = 0.0e+00
Identity = 574/610 (94.10%), Postives = 590/610 (96.72%), Query Frame = 1

Query: 1   MALVMHKDSSNQQMDTSKYVRYTPEQVEALERVYAECPKPSSLRRQQLIRECPILSNIEP 60
           MALV+HKD+SN+QMD+SKYVRYTPEQVEALERVYAECPKPSSLRRQQLIRECPILSNIEP
Sbjct: 1   MALVIHKDTSNKQMDSSKYVRYTPEQVEALERVYAECPKPSSLRRQQLIRECPILSNIEP 60

Query: 61  KQIKVWFQNRRCREKQRKESSRLQSVNRKLSAMNKLLMEENDRLQKQVSHLVYENGFMRQ 120
           KQIKVWFQNRRCREKQRKESSRLQSVNRKLSAMNKLLMEENDRLQKQVSHLVYENGFMRQ
Sbjct: 61  KQIKVWFQNRRCREKQRKESSRLQSVNRKLSAMNKLLMEENDRLQKQVSHLVYENGFMRQ 120

Query: 121 QLHSLFFSDEQTSGTTTDNSCESVVMSGQPQQQQNPNPQHSNRDANNPAGLLAIAEETLA 180
           QLHS        SGTTTDNSCESVVMSGQPQQQQNPNPQH NRD NNPAGLLA+AEETLA
Sbjct: 121 QLHS-------ASGTTTDNSCESVVMSGQPQQQQNPNPQHPNRDVNNPAGLLAVAEETLA 180

Query: 181 EFLSKATGTAVDWVQMIGMKPGPDSIGIVAVSRNCSGVAARACGLVSLEPMKVAEILKDR 240
           EFLSKATGTAVDWVQMIGMKPGPDSIGIVAVSRNCSGVAARACGLVSLEP KVAEILKDR
Sbjct: 181 EFLSKATGTAVDWVQMIGMKPGPDSIGIVAVSRNCSGVAARACGLVSLEPTKVAEILKDR 240

Query: 241 LSWYRDCRCLNVLSVIPTGNGGTIELIYMQTYAPTTLAAARDFWTMRYTTSLEDGSLVVC 300
           LSWYRDCRCLNVLSVIPTGNGGTIELIYMQTYAPTTLAAARDFWTMRYTTSLEDGSLVVC
Sbjct: 241 LSWYRDCRCLNVLSVIPTGNGGTIELIYMQTYAPTTLAAARDFWTMRYTTSLEDGSLVVC 300

Query: 301 ERSLTTSSGGPAGPPPSTFVRAEMLPSGYLIRACEGGGSIIHIVDHIDLDVWSVPEVLRP 360
           ERSL++SSGGPAGPPPSTFVRAEMLPSGYLIRACEGGGSIIHIVDHIDLDVWSVPEVLRP
Sbjct: 301 ERSLSSSSGGPAGPPPSTFVRAEMLPSGYLIRACEGGGSIIHIVDHIDLDVWSVPEVLRP 360

Query: 361 LYESSKILAQKMTIAALRHIRQIAQETSGEIQYSGGRQPAVLRTFGQKLCRGFNDAVNGF 420
           LYESSKILAQK+TIAALRHIRQIAQET+GEIQ +GGRQPAVLRTF QKLCRGFNDAVNGF
Sbjct: 361 LYESSKILAQKITIAALRHIRQIAQETNGEIQCTGGRQPAVLRTFSQKLCRGFNDAVNGF 420

Query: 421 ADDGWSAMSSDGLEDVTIVINSSANKLPGSQYKTSMYPSF-GGVMCAKASMLLQNVPPAL 480
           ADDGWS M SDG+EDVTI+IN+SANK  GSQY TS+YPSF GGVMCAKASMLLQNVPPAL
Sbjct: 421 ADDGWSPMGSDGVEDVTILINTSANKFSGSQYNTSLYPSFGGGVMCAKASMLLQNVPPAL 480

Query: 481 LIRFLREHRSEWADYGVDAYSAASLKASPYAVPCARPGGFPSSQVILPLAHTVEHEEFLE 540
           L+RFLREHRSEWADYGVDAYSAASLKASPYAVPCARPGGFPSSQVILPLA TVEHEEFLE
Sbjct: 481 LVRFLREHRSEWADYGVDAYSAASLKASPYAVPCARPGGFPSSQVILPLATTVEHEEFLE 540

Query: 541 VVRLEGLAFSPEDVALAGRDMYLLQLCNGVDENAVGACAQLVFAPIDESFADDAPLLPSG 600
           VVRLEGLAFSPEDVALAGRDMYLLQLC+GVDENAVGACAQLVFAPIDESFADDAPLLPSG
Sbjct: 541 VVRLEGLAFSPEDVALAGRDMYLLQLCSGVDENAVGACAQLVFAPIDESFADDAPLLPSG 600

Query: 601 FRVIPLDPKT 610
           FRVIPLDPKT
Sbjct: 601 FRVIPLDPKT 603

BLAST of Cp4.1LG19g05650 vs. NCBI nr
Match: gi|659078112|ref|XP_008439554.1| (PREDICTED: homeobox-leucine zipper protein ATHB-14-like [Cucumis melo])

HSP 1 Score: 1135.6 bits (2936), Expect = 0.0e+00
Identity = 574/610 (94.10%), Postives = 590/610 (96.72%), Query Frame = 1

Query: 1   MALVMHKDSSNQQMDTSKYVRYTPEQVEALERVYAECPKPSSLRRQQLIRECPILSNIEP 60
           MALV+HKD+SN+QMD+SKYVRYTPEQVEALERVYAECPKPSSLRRQQLIRECPILSNIEP
Sbjct: 1   MALVIHKDTSNKQMDSSKYVRYTPEQVEALERVYAECPKPSSLRRQQLIRECPILSNIEP 60

Query: 61  KQIKVWFQNRRCREKQRKESSRLQSVNRKLSAMNKLLMEENDRLQKQVSHLVYENGFMRQ 120
           KQIKVWFQNRRCREKQRKESSRLQSVNRKLSAMNKLLMEENDRLQKQVSHLVYENGFMRQ
Sbjct: 61  KQIKVWFQNRRCREKQRKESSRLQSVNRKLSAMNKLLMEENDRLQKQVSHLVYENGFMRQ 120

Query: 121 QLHSLFFSDEQTSGTTTDNSCESVVMSGQPQQQQNPNPQHSNRDANNPAGLLAIAEETLA 180
           QLHS        SGTTTDNSCESVVMSGQPQQQQNPNPQH NRD NNPAGLLA+AEETLA
Sbjct: 121 QLHS-------ASGTTTDNSCESVVMSGQPQQQQNPNPQHPNRDVNNPAGLLAVAEETLA 180

Query: 181 EFLSKATGTAVDWVQMIGMKPGPDSIGIVAVSRNCSGVAARACGLVSLEPMKVAEILKDR 240
           EFLSKATGTAVDWVQMIGMKPGPDSIGIVAVSRNCSGVAARACGLVSLEP KVAEILKDR
Sbjct: 181 EFLSKATGTAVDWVQMIGMKPGPDSIGIVAVSRNCSGVAARACGLVSLEPTKVAEILKDR 240

Query: 241 LSWYRDCRCLNVLSVIPTGNGGTIELIYMQTYAPTTLAAARDFWTMRYTTSLEDGSLVVC 300
           LSWYRDCRCLNVLSVIPTGNGGTIELIYMQTYAPTTLAAARDFWTMRYTTSLEDGSLVVC
Sbjct: 241 LSWYRDCRCLNVLSVIPTGNGGTIELIYMQTYAPTTLAAARDFWTMRYTTSLEDGSLVVC 300

Query: 301 ERSLTTSSGGPAGPPPSTFVRAEMLPSGYLIRACEGGGSIIHIVDHIDLDVWSVPEVLRP 360
           ERSL++SSGGPAGPPPSTFVRAEMLPSGYLIRACEGGGSIIHIVDHIDLDVWSVPEVLRP
Sbjct: 301 ERSLSSSSGGPAGPPPSTFVRAEMLPSGYLIRACEGGGSIIHIVDHIDLDVWSVPEVLRP 360

Query: 361 LYESSKILAQKMTIAALRHIRQIAQETSGEIQYSGGRQPAVLRTFGQKLCRGFNDAVNGF 420
           LYESSKILAQK+TIAALRHIRQIAQET+GEIQ +GGRQPAVLRTF QKLCRGFNDAVNGF
Sbjct: 361 LYESSKILAQKITIAALRHIRQIAQETNGEIQCTGGRQPAVLRTFSQKLCRGFNDAVNGF 420

Query: 421 ADDGWSAMSSDGLEDVTIVINSSANKLPGSQYKTSMYPSF-GGVMCAKASMLLQNVPPAL 480
           ADDGWS M SDG+EDVTI+IN+SANK  GSQY TS+YPSF GGVMCAKASMLLQNVPPAL
Sbjct: 421 ADDGWSPMGSDGVEDVTILINTSANKFSGSQYNTSLYPSFGGGVMCAKASMLLQNVPPAL 480

Query: 481 LIRFLREHRSEWADYGVDAYSAASLKASPYAVPCARPGGFPSSQVILPLAHTVEHEEFLE 540
           L+RFLREHRSEWADYGVDAYSAASLKASPYAVPCARPGGFPSSQVILPLA TVEHEEFLE
Sbjct: 481 LVRFLREHRSEWADYGVDAYSAASLKASPYAVPCARPGGFPSSQVILPLATTVEHEEFLE 540

Query: 541 VVRLEGLAFSPEDVALAGRDMYLLQLCNGVDENAVGACAQLVFAPIDESFADDAPLLPSG 600
           VVRLEGLAFSPEDVALAGRDMYLLQLC+GVDENAVGACAQLVFAPIDESFADDAPLLPSG
Sbjct: 541 VVRLEGLAFSPEDVALAGRDMYLLQLCSGVDENAVGACAQLVFAPIDESFADDAPLLPSG 600

Query: 601 FRVIPLDPKT 610
           FRVIPLDPKT
Sbjct: 601 FRVIPLDPKT 603

BLAST of Cp4.1LG19g05650 vs. NCBI nr
Match: gi|703106720|ref|XP_010098570.1| (Homeobox-leucine zipper protein HOX32 [Morus notabilis])

HSP 1 Score: 1080.1 bits (2792), Expect = 0.0e+00
Identity = 549/634 (86.59%), Postives = 582/634 (91.80%), Query Frame = 1

Query: 1   MALVMHKDSSN---QQMDTSKYVRYTPEQVEALERVYAECPKPSSLRRQQLIRECPILSN 60
           MALV+HKD+SN   +QMD+SKYVRYTPEQVEALERVY+ECPKPSSLRRQQLIRECPILSN
Sbjct: 1   MALVIHKDNSNNINKQMDSSKYVRYTPEQVEALERVYSECPKPSSLRRQQLIRECPILSN 60

Query: 61  IEPKQIKVWFQNRRCREKQRKESSRLQSVNRKLSAMNKLLMEENDRLQKQVSHLVYENGF 120
           IEPKQIKVWFQNRRCREKQRKE+SRLQ+VNRKLSAMNKLLMEENDRLQKQVSHLVYENG+
Sbjct: 61  IEPKQIKVWFQNRRCREKQRKEASRLQTVNRKLSAMNKLLMEENDRLQKQVSHLVYENGY 120

Query: 121 MRQQLHSLFFSDEQTSGTTTDNSCESVVMSGQPQQQQNPNPQHSNRDANNPAGLLAIAEE 180
           MRQQLH+        SG TTDNSCESVVMSGQ QQQQNP PQH  RDANNPAGLLAIAEE
Sbjct: 121 MRQQLHT-------ASGATTDNSCESVVMSGQNQQQQNPTPQHPQRDANNPAGLLAIAEE 180

Query: 181 TLAEFLSKATGTAVDWVQMIGMKPGPDSIGIVAVSRNCSGVAARACGLVSLEPMKVAEIL 240
           TLAEFLSKATGTAVDWVQMIGMKPGPDSIGIVAVSRNCSGVAARACGLVSLEP KVAEIL
Sbjct: 181 TLAEFLSKATGTAVDWVQMIGMKPGPDSIGIVAVSRNCSGVAARACGLVSLEPAKVAEIL 240

Query: 241 KDRLSWYRDCRCLNVLSVIPTGNGGTIELIYMQTYAPTTLAAARDFWTMRYTTSLEDGSL 300
           KDR SW+RDCRC++VLSVIPTGNGGTIELIYMQTYAPTTLAAARDFWT+RYTTSLED SL
Sbjct: 241 KDRPSWFRDCRCVDVLSVIPTGNGGTIELIYMQTYAPTTLAAARDFWTLRYTTSLEDNSL 300

Query: 301 VVCERSLTTSSGGPAGPPPSTFVRAEMLPSGYLIRACEGGGSIIHIVDHIDLDVWSVPEV 360
           V+CERSLTTS+GGP GPP S FVRAEMLPSGYLIR CEGGGSII+IVDH+DLD WSVPEV
Sbjct: 301 VICERSLTTSTGGPTGPPSSCFVRAEMLPSGYLIRPCEGGGSIINIVDHVDLDAWSVPEV 360

Query: 361 LRPLYESSKILAQKMTIAALRHIRQIAQETSGEIQYSGGRQPAVLRTFGQKLCRGFNDAV 420
           LRPLYESSKILAQKMT+AALRHIRQIAQETSGEIQY GGRQPAVLRTF Q+LCRGFNDAV
Sbjct: 361 LRPLYESSKILAQKMTVAALRHIRQIAQETSGEIQYGGGRQPAVLRTFSQRLCRGFNDAV 420

Query: 421 NGFADDGWSAMSSDGLEDVTIVINSSANKLPGSQYKTSMYPSFGGVMCAKASMLLQNVPP 480
           NGF DDGWS + SDG EDVTIVINSS+NK  GSQY  SM+P+FGGV+CAKASMLLQNVPP
Sbjct: 421 NGFVDDGWSLLGSDGAEDVTIVINSSSNKFLGSQYNASMFPTFGGVLCAKASMLLQNVPP 480

Query: 481 ALLIRFLREHRSEWADYGVDAYSAASLKASPYAVPCARPGGFPSSQVILPLAHTVEHEEF 540
           ALL+RFLREHRSEWADYGVDAYSA+ LKASPYA+PCARPGGFPSSQVILPLAHTVEHEEF
Sbjct: 481 ALLVRFLREHRSEWADYGVDAYSASCLKASPYAIPCARPGGFPSSQVILPLAHTVEHEEF 540

Query: 541 LEVVRLEGLAFSPEDVALAGRDMYLLQLCNGVDENAVGACAQLVFAPIDESFADDAPLLP 600
           LEVVRLEG AFSPE+VALA RDMYLLQLC+GVDE+AVGACAQLVFAPIDESFAD+APLLP
Sbjct: 541 LEVVRLEGHAFSPEEVALA-RDMYLLQLCSGVDESAVGACAQLVFAPIDESFADEAPLLP 600

Query: 601 SGFRVIPLDPKTASAVFTFANQAGLDMLETTLVG 632
           SGFRVIPLDPK  +   T      LD+  T  VG
Sbjct: 601 SGFRVIPLDPKADTPAAT----RTLDLASTLEVG 622

BLAST of Cp4.1LG19g05650 vs. NCBI nr
Match: gi|590614211|ref|XP_007022874.1| (Homeobox-leucine zipper family protein / lipid-binding START domain-containing protein isoform 5 [Theobroma cacao])

HSP 1 Score: 1075.5 bits (2780), Expect = 0.0e+00
Identity = 550/615 (89.43%), Postives = 576/615 (93.66%), Query Frame = 1

Query: 1   MALVMHKDSSN-QQMDTSKYVRYTPEQVEALERVYAECPKPSSLRRQQLIRECPILSNIE 60
           MAL MHKDSSN +QMD+SKYVRYTPEQVEALERVY+ECPKPSSLRRQQLIRECPILSNIE
Sbjct: 1   MALSMHKDSSNNKQMDSSKYVRYTPEQVEALERVYSECPKPSSLRRQQLIRECPILSNIE 60

Query: 61  PKQIKVWFQNRRCREKQRKESSRLQSVNRKLSAMNKLLMEENDRLQKQVSHLVYENGFMR 120
           PKQIKVWFQNRRCREKQRKE+SRLQ+VNRKLSAMNKLLMEENDRLQKQVS LVYENG+MR
Sbjct: 61  PKQIKVWFQNRRCREKQRKEASRLQTVNRKLSAMNKLLMEENDRLQKQVSQLVYENGYMR 120

Query: 121 QQLHSLFFSDEQTSGTTTD-NSCESVVMSGQPQQQQNPNPQHSNRDANNPAGLLAIAEET 180
           QQL +        S TTTD NSCESVVMSGQ QQQQNP PQH  RDAN+PAGLLAIAEET
Sbjct: 121 QQLQT-------GSATTTDNNSCESVVMSGQHQQQQNPTPQHPQRDANSPAGLLAIAEET 180

Query: 181 LAEFLSKATGTAVDWVQMIGMKPGPDSIGIVAVSRNCSGVAARACGLVSLEPMKVAEILK 240
           LAEFLSKATGTAVDWVQMIGMKPGPDSIGIVAVSRNCSGVAARACGLVSLEP KVAEILK
Sbjct: 181 LAEFLSKATGTAVDWVQMIGMKPGPDSIGIVAVSRNCSGVAARACGLVSLEPTKVAEILK 240

Query: 241 DRLSWYRDCRCLNVLSVIPTGNGGTIELIYMQTYAPTTLAAARDFWTMRYTTSLEDGSLV 300
           DR SW+RDCRCL+VLSVIPTGNGGTIELIYMQTYAPTTLAAARDFWT+RYTTSLEDGSLV
Sbjct: 241 DRPSWFRDCRCLDVLSVIPTGNGGTIELIYMQTYAPTTLAAARDFWTLRYTTSLEDGSLV 300

Query: 301 VCERSLTTSSGGPAGPPPSTFVRAEMLPSGYLIRACEGGGSIIHIVDHIDLDVWSVPEVL 360
           +CERSLT+S+GGP GPP S+FVRAEMLPSG+LIR CEGGGSIIHIVDH+DLDVWSVPEVL
Sbjct: 301 ICERSLTSSTGGPTGPPTSSFVRAEMLPSGFLIRPCEGGGSIIHIVDHVDLDVWSVPEVL 360

Query: 361 RPLYESSKILAQKMTIAALRHIRQIAQETSGEIQYSGGRQPAVLRTFGQKLCRGFNDAVN 420
           RPLYESSKILAQKMTIAALRHIRQIAQETSGEIQY GGRQPAVLRTF Q+LCRGFNDAVN
Sbjct: 361 RPLYESSKILAQKMTIAALRHIRQIAQETSGEIQYGGGRQPAVLRTFSQRLCRGFNDAVN 420

Query: 421 GFADDGWSAMSSDGLEDVTIVINSSANKLPGSQYKTSMYPSF-GGVMCAKASMLLQNVPP 480
           GFADDGWS M SDG+EDVTI+INSS  K  GSQY TSM+PSF GGV+CAKASMLLQNVPP
Sbjct: 421 GFADDGWSLMGSDGVEDVTIMINSSPGKFLGSQYNTSMFPSFGGGVLCAKASMLLQNVPP 480

Query: 481 ALLIRFLREHRSEWADYGVDAYSAASLKASPYAVPCARPGGFPSSQVILPLAHTVEHEEF 540
           ALL+RFLREHRSEWADYGVD YSAA LKASPYAVPCARPGGFPSSQVILPLAHTVEHEEF
Sbjct: 481 ALLVRFLREHRSEWADYGVDTYSAACLKASPYAVPCARPGGFPSSQVILPLAHTVEHEEF 540

Query: 541 LEVVRLEGLAFSPEDVALAGRDMYLLQLCNGVDENAVGACAQLVFAPIDESFADDAPLLP 600
           LEVVRLEG AF+PEDVALA RDMYLLQLC+G+DENAVGACAQLVFAPIDESFADDAPLLP
Sbjct: 541 LEVVRLEGHAFTPEDVALA-RDMYLLQLCSGIDENAVGACAQLVFAPIDESFADDAPLLP 600

Query: 601 SGFRVIPLDPKTASA 613
           SGFRVIPLDPKT  A
Sbjct: 601 SGFRVIPLDPKTDGA 607

BLAST of Cp4.1LG19g05650 vs. NCBI nr
Match: gi|1009105636|ref|XP_015882797.1| (PREDICTED: homeobox-leucine zipper protein ATHB-14-like isoform X1 [Ziziphus jujuba])

HSP 1 Score: 1075.1 bits (2779), Expect = 0.0e+00
Identity = 549/636 (86.32%), Postives = 579/636 (91.04%), Query Frame = 1

Query: 1   MALVMHKDSSN-----QQMDTSKYVRYTPEQVEALERVYAECPKPSSLRRQQLIRECPIL 60
           MALVMHKDSS+     QQMD+SKYVRYTPEQVEALERVY+ECPKPSSLRRQQLIRECPIL
Sbjct: 1   MALVMHKDSSSNSNKQQQMDSSKYVRYTPEQVEALERVYSECPKPSSLRRQQLIRECPIL 60

Query: 61  SNIEPKQIKVWFQNRRCREKQRKESSRLQSVNRKLSAMNKLLMEENDRLQKQVSHLVYEN 120
           SNIEPKQIKVWFQNRRCREKQRKE+SRLQ+VNRKLSAMNKLLMEENDRLQKQVSHLVYEN
Sbjct: 61  SNIEPKQIKVWFQNRRCREKQRKEASRLQTVNRKLSAMNKLLMEENDRLQKQVSHLVYEN 120

Query: 121 GFMRQQLHSLFFSDEQTSGTTTDNSCESVVMSGQPQQQQNPNPQHSNRDANNPAGLLAIA 180
           G+MRQQL        Q SGTTTDNSCESVVM+GQ QQQQNP PQ   RDANNPAGLLAIA
Sbjct: 121 GYMRQQL--------QASGTTTDNSCESVVMNGQHQQQQNPTPQQPQRDANNPAGLLAIA 180

Query: 181 EETLAEFLSKATGTAVDWVQMIGMKPGPDSIGIVAVSRNCSGVAARACGLVSLEPMKVAE 240
           EETLAEFLSKATGTAVDWVQMIGMKPGPDSIGIVAVSRNCSGVAAR CGLVSLEP KVAE
Sbjct: 181 EETLAEFLSKATGTAVDWVQMIGMKPGPDSIGIVAVSRNCSGVAARVCGLVSLEPTKVAE 240

Query: 241 ILKDRLSWYRDCRCLNVLSVIPTGNGGTIELIYMQTYAPTTLAAARDFWTMRYTTSLEDG 300
           ILKDR SWYRDCRC++VLSVIPT N GTIELIYMQTYAPTTLAAARDFWT+RYTTSLEDG
Sbjct: 241 ILKDRPSWYRDCRCIDVLSVIPTANAGTIELIYMQTYAPTTLAAARDFWTLRYTTSLEDG 300

Query: 301 SLVVCERSLTTSSGGPAGPPPSTFVRAEMLPSGYLIRACEGGGSIIHIVDHIDLDVWSVP 360
           SLV+CERSLT+S+GGP GPPPS+FVRAEMLPSG+LIR CEGGGSIIHIVDHIDLD WSVP
Sbjct: 301 SLVICERSLTSSTGGPTGPPPSSFVRAEMLPSGFLIRPCEGGGSIIHIVDHIDLDAWSVP 360

Query: 361 EVLRPLYESSKILAQKMTIAALRHIRQIAQETSGEIQYSGGRQPAVLRTFGQKLCRGFND 420
           EVLRPLYESSKILAQKMTIAALRHIRQIAQETSGEIQ  G RQPAVLRTF Q+LCRGFND
Sbjct: 361 EVLRPLYESSKILAQKMTIAALRHIRQIAQETSGEIQCGGSRQPAVLRTFSQRLCRGFND 420

Query: 421 AVNGFADDGWSAMSSDGLEDVTIVINSSANKLPGSQYKTSMYPSFGGVMCAKASMLLQNV 480
           AVNGFADDGWS M SDG+EDVTIVINSS NK  GSQ+ TS++P+FGGV+CAKASMLLQNV
Sbjct: 421 AVNGFADDGWSLMGSDGVEDVTIVINSSPNKFLGSQHNTSIFPTFGGVLCAKASMLLQNV 480

Query: 481 PPALLIRFLREHRSEWADYGVDAYSAASLKASPYAVPCARPGGFPSSQVILPLAHTVEHE 540
           PPALL+RFLREHRSEWADYGVDAYSAA LKASPYAVPCARPGGFP SQVILPLAHTVEHE
Sbjct: 481 PPALLVRFLREHRSEWADYGVDAYSAACLKASPYAVPCARPGGFPGSQVILPLAHTVEHE 540

Query: 541 EFLEVVRLEGLAFSPEDVALAGRDMYLLQLCNGVDENAVGACAQLVFAPIDESFADDAPL 600
           EF+EVVRLEG  FSPEDVALA RDMY+LQLC+G+DENAVGACAQLVFAPIDESFAD+APL
Sbjct: 541 EFMEVVRLEGHTFSPEDVALA-RDMYILQLCSGIDENAVGACAQLVFAPIDESFADEAPL 600

Query: 601 LPSGFRVIPLDPKTASAVFTFANQAGLDMLETTLVG 632
           LPSGFRVIPLDPK+     T      LD+  T  VG
Sbjct: 601 LPSGFRVIPLDPKSDGPAAT----RTLDLASTLEVG 623

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
ATB14_ARATH6.0e-29182.04Homeobox-leucine zipper protein ATHB-14 OS=Arabidopsis thaliana GN=ATHB-14 PE=1 ... [more]
ATBH9_ARATH1.4e-28782.45Homeobox-leucine zipper protein ATHB-9 OS=Arabidopsis thaliana GN=ATHB-9 PE=1 SV... [more]
HOX32_ORYSJ2.4e-27980.19Homeobox-leucine zipper protein HOX32 OS=Oryza sativa subsp. japonica GN=HOX32 P... [more]
HOX32_ORYSI2.4e-27980.19Homeobox-leucine zipper protein HOX32 OS=Oryza sativa subsp. indica GN=HOX32 PE=... [more]
HOX33_ORYSI1.1e-27380.33Homeobox-leucine zipper protein HOX33 OS=Oryza sativa subsp. indica GN=HOX33 PE=... [more]
Match NameE-valueIdentityDescription
A0A0A0KM58_CUCSA0.0e+0094.10Uncharacterized protein OS=Cucumis sativus GN=Csa_6G525430 PE=4 SV=1[more]
W9RI95_9ROSA0.0e+0086.59Homeobox-leucine zipper protein HOX32 OS=Morus notabilis GN=L484_026012 PE=4 SV=... [more]
A0A061G7E3_THECC0.0e+0089.43Homeobox-leucine zipper family protein / lipid-binding START domain-containing p... [more]
A0A061GEN1_THECC0.0e+0089.41Homeobox-leucine zipper family protein / lipid-binding START domain-containing p... [more]
A0A061G8K4_THECC0.0e+0089.41Homeobox-leucine zipper family protein / lipid-binding START domain-containing p... [more]
Match NameE-valueIdentityDescription
AT2G34710.13.4e-29282.04 Homeobox-leucine zipper family protein / lipid-binding START domain-... [more]
AT1G30490.17.8e-28982.45 Homeobox-leucine zipper family protein / lipid-binding START domain-... [more]
AT5G60690.15.6e-24769.33 Homeobox-leucine zipper family protein / lipid-binding START domain-... [more]
AT4G32880.12.5e-23166.89 homeobox gene 8[more]
AT1G52150.25.9e-22868.71 Homeobox-leucine zipper family protein / lipid-binding START domain-... [more]
Match NameE-valueIdentityDescription
gi|778721560|ref|XP_011658319.1|0.0e+0094.10PREDICTED: homeobox-leucine zipper protein ATHB-14-like [Cucumis sativus][more]
gi|659078112|ref|XP_008439554.1|0.0e+0094.10PREDICTED: homeobox-leucine zipper protein ATHB-14-like [Cucumis melo][more]
gi|703106720|ref|XP_010098570.1|0.0e+0086.59Homeobox-leucine zipper protein HOX32 [Morus notabilis][more]
gi|590614211|ref|XP_007022874.1|0.0e+0089.43Homeobox-leucine zipper family protein / lipid-binding START domain-containing p... [more]
gi|1009105636|ref|XP_015882797.1|0.0e+0086.32PREDICTED: homeobox-leucine zipper protein ATHB-14-like isoform X1 [Ziziphus juj... [more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0008289lipid binding
GO:0003677DNA binding
Vocabulary: INTERPRO
TermDefinition
IPR023393START-like_dom_sf
IPR013978MEKHLA
IPR009057Homeobox-like_sf
IPR002913START_lipid-bd_dom
IPR001356Homeobox_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0005634 nucleus
cellular_component GO:0005667 transcription factor complex
cellular_component GO:0005575 cellular_component
molecular_function GO:0008289 lipid binding
molecular_function GO:0043565 sequence-specific DNA binding
molecular_function GO:0003700 transcription factor activity, sequence-specific DNA binding
molecular_function GO:0003677 DNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG19g05650.1Cp4.1LG19g05650.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001356Homeobox domainPFAMPF00046Homeoboxcoord: 18..76
score: 2.8
IPR001356Homeobox domainSMARTSM00389HOX_1coord: 15..81
score: 7.5
IPR001356Homeobox domainPROFILEPS50071HOMEOBOX_2coord: 13..77
score: 1
IPR002913START domainPFAMPF01852STARTcoord: 174..381
score: 2.0
IPR002913START domainSMARTSM00234START_1coord: 173..383
score: 2.2
IPR002913START domainPROFILEPS50848STARTcoord: 164..392
score: 24
IPR009057Homeodomain-likeGENE3DG3DSA:1.10.10.60coord: 20..76
score: 1.2
IPR009057Homeodomain-likeunknownSSF46689Homeodomain-likecoord: 18..79
score: 1.71
IPR013978MEKHLAPFAMPF08670MEKHLAcoord: 610..706
score: 2.3
IPR023393START-like domainGENE3DG3DSA:3.30.530.20coord: 172..378
score: 2.0
NoneNo IPR availableunknownCoilCoilcoord: 83..107
scor
NoneNo IPR availablePANTHERPTHR24326FAMILY NOT NAMEDcoord: 9..107
score: 3.2E-159coord: 562..706
score: 3.2E
NoneNo IPR availablePANTHERPTHR24326:SF191HOMEOBOX-LEUCINE ZIPPER PROTEIN ATHB-14-RELATEDcoord: 9..107
score: 3.2E-159coord: 562..706
score: 3.2E
NoneNo IPR availableunknownSSF55961Bet v1-likecoord: 174..385
score: 6.04

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cp4.1LG19g05650Cp4.1LG10g06480Cucurbita pepo (Zucchini)cpecpeB083
Cp4.1LG19g05650Cp4.1LG20g05670Cucurbita pepo (Zucchini)cpecpeB412
The following block(s) are covering this gene:
GeneOrganismBlock
Cp4.1LG19g05650Cucurbita pepo (Zucchini)cpecpeB043
Cp4.1LG19g05650Cucurbita maxima (Rimu)cmacpeB437
Cp4.1LG19g05650Cucurbita moschata (Rifu)cmocpeB402
Cp4.1LG19g05650Silver-seed gourdcarcpeB0593
Cp4.1LG19g05650Silver-seed gourdcarcpeB1029