Cp4.1LG15g01980 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG15g01980
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionHomeobox-leucine zipper family protein
LocationCp4.1LG15 : 1602997 .. 1610065 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TGTGAGGCCTCAAGATTTACCAAACAAACAAAGCATAAAGCTCTCTCCTTTTTGCAAGATGTGAGGTTTATAGAGAGAGAGAGACAACAAAATTTGTCTAAGTTTTCCATTTGTGGGTCTGAATTTTAACACAACCTTCCTCCTCCTCCTCCTCTTCTTCTTCATCCCAACCTCTTTCTCCGACCACCGCCGCAATCTCCGCCGCCAACCACCGCCAAAACCAAGAACAATCTTCATTTGGTGGAATTTCCTTGGTGGGTTAGAAAAAACACTAAAGAAAACAGCTAAAGTTGGTAGCTTTATTACAATATCTTTGCATGCCAAGCTTCTAAACACAAAACTCCCTCTTTTCCCAACAGATCTTTGAATCAATTATGACCAAAACGTTCTAATAATCTCATTTTCAGGTCAGTGTGGTGTAACTCCAAGAATGTATGGAGATTGCCAAGTGATATCAAGCAATATGGGAACAAACATAAACCCTAATTTCAATTTCATCTCCAATTTCCAACACTTTTCTTCCATATTGCCTGTAAGTTCTTCATTTTTTATCCTTTGTTCTTCATTTTTTTGGTTGAAATTATTATGGGTTTTGAATGTTTGCAGAAGGAGGAAAATGGGGTATTGAGAGGGAAAGAAGATATGGAAAGTGGGTCTGGAAGTGAGCAGCTTGTTGAAGAACATCAAGGAATAATTGAAATGGATAGCAATAATGATAATGTTCTTCACCAAAATTTGAAGAAGAAACGCTATCATAGGCATACAGCTCGTCAGATCCAAGAAATGGAGGCGTAAATTTCTTGTTTTTTTTTGTGTTTTTTTGTTTGAAGATTGATGTTTCAAAAGGGTTTGATTTTGTTTGTTTGGTTTTGGTAGTTTGTTTAAGGAATGTCCACACCCTGATGACAAGCAGAGGCTTAAACTTAGCCAAGAACTCGGACTCAAACCTCGTCAAGTTAAGTTTTGGTTCCAAAATCGAAGAACTCAGATGAAGGTTTGTGTTTTTTTTTTTTTTTTTTTGGTTCAAAAAGTCGTTCTAAACTGACTCTAAACGTCGAGTTTTACAAACCCGTGTTATATTTCATCTTGATTTGGCTTTAAACCTCGCTAATTGAAGCTTTGGCTCCAAAATCGAAGAACTCAGATGAAGTTTCGTGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTNGTCGTTCTAAACTTACTCTAAATGTCGAGTTTTACAAACCCGTTTTATATTTCATCTTGATTTGGCTTTAAACCTCGCTAAGTGAAGTTTTGGCTCCAAAATCGAACATCCCAGATGAAGGTTTGTGTTTTTTTTTGTTATGTATACAAAAAGTTGTTCTAAACTGACTCAGAACGTCGAGTTTTACAAACCCGATTTATATTTCACCTTGATTTGGCCTTAAACCTCGTCAAGTCGAGTTTTGGCTCCAAAATCGAAGAACCGAGGTGAAGGTTTGTGTTTTTTTTTTAATTATTCAAAAAGTTGTTCTAAACGGACTCTAAACGTCAAATTTTACAAACCCCTTCTATATTTCACTTTGATTTAGCTTTAAACCTCTTCCAGTCAAGTTTTAGATCCAAAATCGAAGAACCGAGATGAAGGTTTGTAAAAAGTTGTTCTAAACCGACTTCTAAACGTCAAGTTTTACAAACTCGTTTTATATTTCAACTTGATTTCGCTTAAACCTAGTCAATTTTTTGCTCCAAAATAGAAGAATCCAAATGAAAAATTGAATTTTTTTTAAGTATTTAGGAAATCATTTCAAACCGACTTAAACTCATTTTATATTTCACTTTAATTTGGCCTTAAACCTCGTCGAGTCAAGTTTTGATTCCAAAATCGAAGAACTCATATAAACACTCTATTTTTTAAGTATTTAAAAAGTCATTTCAAACAGATTCTGAATGCCAAATTTTACAAACCCATTTTATATTTCACCTTGGTTTGGCCTTAAACTCCACCCTGTCAAGTTTTTGGCTCCAAAATCGAAGAACCCATGAAGATTCATGTTTTTGAAGTATTTAAAAAGTCATTTCAAACAGACTCTGAGTGTCAAATTTTACAAACCCATTTTATATTTCACCTTGGTTTGGCCTCAAACCTCACCACGTTAAGTTTTGGGCTCCAAAATCGAAGAACCCAGATGAATGTTCGTGTTTTTTTAGTATTTAAAAAGTCATTCCAAGCCAATCCGCTAACCCATTTCATATTTCACCTTGATTCTTGAGAAAAAAAAAAAAAGAAAAAAAAAAGAAGAGTATTTTGGTGATTCCTTCCATTTAAGAGCCTTGAAACCCTAAAACTGTGAACCCATAATGATCTTTACAAGCTACCATGGCTATCTTCTTTCTTTTAACAGTCTGTCCGCCATTAAAGCGACCTTGTGGTTGCTGTATTAAAAAGCCCTAACTGCTCAAATGTCTGTGTTTCCCAATGGCATTACAAATTAAATGCTTTCCCTTTAACTTTTACCACTGTTTTAAATCAGCTTAGTAAGTGAATTGGTGACTTAATTAGGGTTTCTTTTTGGTAATCAATTAATTAGAGACTTAATCTTTGGTTTAATTTGTTTGACAGGCACAACACGACAGAGCTGATAATGCCATACTTCGAGCTGAAAATGAAACCTTAAAAAATGAAAATTATAGACTGCAAACTGCCCTAAGAAACATCATATGCCCTAGCTGTGGAGGGCAGGGTATCCTTGGGGAACCTAGCTTGGATGAACAGCAGCTTCGTCTCGAGAATGCTAGGCTTCGAGATCAGGTAAATTAATTAGGTCAAATTATAAGTTGATTCGTCTGTGAATCTTCGAGTTTGTGTCTCGTGATTCGTCCTAGAATTTTAAAAAGGGTCTAATGAGACCTTAAAGGCCAAATCAAGGTGAGATTTGCGAATTTTCGAGTATCTTTCGAACTTTCTAATTAGGTCAAATTATAAGTTTGTTAGTCCGTGAATTTTCGAGTCCGTGTCTCGTGATTCGTCTTAGAAATTTAAAGTGTCTAATAAGCCATCGATCTTGTAGATTTGTGTCTGTTTGATCTTTGAACTTTTAATTTTGTTGTTAACAGTTGGAACAAGTTTGCTCGATGACCACAAGATACACAGGTCGCCCAATTGAAGGGATGCCCATGCAACCATCTTTGGATTTGGACATGAATATATACTCGAGGCAATACACGGAGGCCATGGTTTCGTCCTCCGACATGATGTCGATGCCCTCGATGCTCCCGCCCGAGGCTGCCCACTTCTCGAAGGGCGACCTATTAATTGATGAGGAAAAGACACTGGCAATGGACCTGGCTGTGTCGTCCATAGCTGAACTTGTTAAGATGTGTCGCTCGACTGAGCCTCTTTGGGTTCGAGATAGCGAGAGCGGTAAGGACATTCTAAATGTCGAAGAGCATGCCAGGATGTTTCCATGGCCATTGAACCTCAAGCAACACTTGATCAATGAGTTTAGGACCGAAGCCACTCGTGACAGCGCCGTTGTTATAATGAATAGCATAACTCTTGTCGACGCCTTTCTTGATGCCGTAAGTTCCCATTTCTATGCTTTTGTTTCGAAATTCAATGGCTATGCATCTAGTTCTCCGTTCCTGGTTCTCCTATAGCCGGATTTCGTTATTCGTCGACCCGATCCAACTTGAAAATTTGACTAATCTTTTTTAATTGGAGTTTTTTTAGAACAAATGGATGGAATTATTTCCTTCAATTGTGGCCAAAGCAAGAACTATTCAAGTCATTTCATCAAGTGTTTCAGGCCATGCTAGTGGTTCCCTTCAGCTGGTAATATAATCGAAACCCTAAACTTCCAATTCCATCTCTAATACTAAGAACTTTTTCTTCCTTTCCTATAGATGTATGCAGAACTTCAAGCCCTTTCTCCGCTCGTTCCGACGAGAGAAGCTCACTTTCTCCGGTGCTGCCAACAAAACGCCGACGAAGGAAGCTGGGCCATCGTCGATTTCCCGATCGACACCTTTCACGACAGCCTTCAGCACTCGTTTCCGAGATACAGAAGAAAGCCCTCTGGTTGCATTATTCAAGACATGCCCAATGGATACTCTAGGGTAAACCCTTCAAACCTCTTCTTTTCCGAGTCTAAGCGCAAAATCTCGGGTTCGACAAATTACCACTCAACTCAAAAATTCAAATCATGGGTTACGAAAAATTGATTCAAATTCATTCAGGGTGTGAAAACGTCAGCTAAATGATTTCCACTCTCCTCGAACATAGAATAATTAGAAGAGACTGGAACAGTGATATAGCAATTATATTAAACGGTTGAACGTATGAGAGAAATTACAAAACATGTTTCGAACCGTACATTTAGATTACGATGGATCGACCGTGCCGAGTGATTATTAATATGTTTGTAGGTTACATGGGTGGAGCATGGAGAGATAGAAGAGAAGCCAATCCATCAAATATTCAATCACTTTGTGCATAGTGGAATGGCTTTTGGGGCACACCGTTGGTTGGCTATCTTACAAAGGCAATGCGAGAGAATTGCAAGCCTCATGGCTAGAAACATCTCTGATCTTGGAGGTTCTAACTCTATTTTATCTCTTTCTCTTTGAATTTGACCTAATTTTGTGATAAATTTGTACATTATGGTTTCTTTTGGGATTTGTTTTCAGTAATACCTTCACCGGAAGCTAGACAAAACCTAATGAAACTAGCACAAAGAATGATCAGAACTTTCTCCGTCAACATAAGCACCTCCGGTGGCCAGTCATGGACGGCGTTATCTGAATCTCCCGACGATACTGTTCGTATAACCACACGAAAAATCGTCGAGCCTGGCCAACCTAATGGGGTTATTCTCAGCGCCGTCTCGACCACTTGGCTTCCTTATCCTCATTATCGAGTCTTCGATCTTCTTCGAGACGAACGTCGACGGTCTCAGGTATTTTGACCCAATAATTTGTCGAGTCGCTATAGATTACGTTAGTCTCAGGTATTTTGACCTAAATAATTTGTCGAGTCGCTATAGATTACGGTTGTTTTAGGTATTTTGACCTAAACATTTCGTCGAGTCGCTATAGATTGTGGTAGTCTCAGGTATTTTGACCTAAATATTTTGTCGAGTCACTATAGATTACGGTAGTCTCAGGTACTTTGACCTAAATAATTCGTTGAGTCGCTATAGATTACGGTAGTCTCAGGTACTTTGACCTAAATATTTCGTCGAGTTGCTATAGATTATGGAAGTCTCAGGTATTTTGACCAAAATAATTCATCGAGTCACTCATCGAGTCACTGTAGATTATAGTAGTCTCAGGTATTTTGACCTAAATAATTCGTCGAGTCGCTGTAGATTACGATAGTCTCAGGTATTTTGACCTAAATAATTCGTCGAGTCGCTATAGATTACGGTAGTCTCAGGTATTTTGACCTACATGAGTCGCTATAGATTACGGTAGTCTCGGGTATTTTGACCTAAATATTTCGTCAGTCGCTATAGATTACGGTAGTCTCAAGTATTTTGACCTAAATATTTCGTCGAGTCGCTATAGATTACGGTAGTCTCAAGTATTTTGACCTAAATATTTCGTCGAGTCGCTATAGATTACGGTAGTCTCAAGTATTTTGACCTAAATATTTCGTCGAGTCGCTATAGATTACGGTAGTCTCAAGTATTTTGACCTAAATATTTCGTCGAGTCGCTATAGATTACGGTAGTCTCAAGTATTTTGACCTAAATATTTCGTCGAGTCGCTATAGATTACGGTAGTCTCAAGTATTTTGACCTAAATATTTCGTCGAGTCGCTATAGATTACGGTAGTCTCAAGTATTTTGACCTAAATAATTCATTGAGTCGCTATAGATCACGGTAGATTTAAAAACGTTGATATCGTTTCAGCTCGAGGTTCTTTCGAATGGGAATTCGCTGCACGAAGTTGCTCACATTGCCAATGGCTCTCACCCTGGAAATTGCATCTCTCTTCTTCGTATCAATGTGAGTTTTTCTTATGAAACTTGAAAACCCTAATCTCTGATTTCATCTCTATTGAAACTATTTGAATGTTGCAGGTGGCCAGCAACTCGTCCCAGCACGTCGAGCTAATGCTGCAAGAGAGCTGCACGGATCAGTCGGGCAGCCTCGTCGTCTACGCAACGATCGACGTCGATTCAATTCAATTAGCCATGAGTGGAGAAGACCCATCTTGCATTCCTCTCCTCCCCATAGGATTTTCCATTGTCCCCGTCGTCGGGTCTACCGTTGACGGACACCCATCACCACCACCCGAAGACGGTGTCGCTAACTCCGGCTGCCTCCTTACCGTCGGCTTGCAAGTTCTAGCCAGCACTATTCCATCAGCAAAGCTCAACCTATCAAGCGTTACTGCCATCAACAATCACCTATGTAACACGGTGCACCAAATCAACGCCGCTCTCGGCAGCCCGGCTAGTATCGAGAACGGTAATACCATTGCTGAGCCTAATAATGCACCAGCGCCGGCACCGGCACCGGCGCCAGCACCCAAGCAATAAGCTTTTGCATGGTAGGCAAAGAAAGAAGTGAGCTGCCAATTACCCAAAATCTCCCTCCCTCATGCCACCCATGACGTAGATTCCAAGGGTATTTTCGGTAACATGAAAAAAAAAACTAGGGTTTTTGAGTGGGAGATTGTGGGTAATTGGGAGAGAGAGATGGTAAATTAGTTAAAAAAAAAGTGTAATTAATTAAAATTATAGGAGTAGTTTTTGGTGGGTAAATAAGAGAGAGAAATAGGGTAGATTTGATGTGTGGATGTGGATCAGGGTGTAGGGTTATTGTGGAAGTCAAGAACGCACCAGATAAAACCCTTGGTCCCTGACAAGGCTGTCAGAAAGTGGGGCCTTGTCCAGTCAGCAAGGGTGGTGGTTCGGGCATTGACTTCTTCTGCTTAAATTGTGTGTCAGTCAGACGTCGTCGTTTCAGTTGCTTTTCAATTCTTATTTTTATTTAGATGAATTATTATGACTGAATTTATTGCTTCCCATTAATTGCCTTTTCTCCTTTCTCTTTTCCAAAAAAAAAAAAAAAAAAAAAAATTATT

mRNA sequence

TGTGAGGCCTCAAGATTTACCAAACAAACAAAGCATAAAGCTCTCTCCTTTTTGCAAGATGTGAGGTTTATAGAGAGAGAGAGACAACAAAATTTGTCTAAGTTTTCCATTTGTGGGTCTGAATTTTAACACAACCTTCCTCCTCCTCCTCCTCTTCTTCTTCATCCCAACCTCTTTCTCCGACCACCGCCGCAATCTCCGCCGCCAACCACCGCCAAAACCAAGAACAATCTTCATTTGGTGGAATTTCCTTGGTCAGTGTGGTGTAACTCCAAGAATGTATGGAGATTGCCAAGTGATATCAAGCAATATGGGAACAAACATAAACCCTAATTTCAATTTCATCTCCAATTTCCAACACTTTTCTTCCATATTGCCTAAGGAGGAAAATGGGGTATTGAGAGGGAAAGAAGATATGGAAAGTGGGTCTGGAAGCACAACACGACAGAGCTGATAATGCCATACTTCGAGCTGAAAATGAAACCTTAAAAAATGAAAATTATAGACTGCAAACTGCCCTAAGAAACATCATATGCCCTAGCTGTGGAGGGCAGGGTATCCTTGGGGAACCTAGCTTGGATGAACAGCAGCTTCGTCTCGAGAATGCTAGGCTTCGAGATCAGTTGGAACAAGTTTGCTCGATGACCACAAGATACACAGGTCGCCCAATTGAAGGGATGCCCATGCAACCATCTTTGGATTTGGACATGAATATATACTCGAGGCAATACACGGAGGCCATGGTTTCGTCCTCCGACATGATGTCGATGCCCTCGATGCTCCCGCCCGAGGCTGCCCACTTCTCGAAGGGCGACCTATTAATTGATGAGGAAAAGACACTGGCAATGGACCTGGCTGTGTCGTCCATAGCTGAACTTGTTAAGATGTGTCGCTCGACTGAGCCTCTTTGGGTTCGAGATAGCGAGAGCGGTAAGGACATTCTAAATGTCGAAGAGCATGCCAGGATGTTTCCATGGCCATTGAACCTCAAGCAACACTTGATCAATGAGTTTAGGACCGAAGCCACTCGTGACAGCGCCGTTGTTATAATGAATAGCATAACTCTTGTCGACGCCTTTCTTGATGCCAACAAATGGATGGAATTATTTCCTTCAATTGTGGCCAAAGCAAGAACTATTCAAGTCATTTCATCAAGTGTTTCAGGCCATGCTAGTGGTTCCCTTCAGCTGATGTATGCAGAACTTCAAGCCCTTTCTCCGCTCGTTCCGACGAGAGAAGCTCACTTTCTCCGGTGCTGCCAACAAAACGCCGACGAAGGAAGCTGGGCCATCGTCGATTTCCCGATCGACACCTTTCACGACAGCCTTCAGCACTCGTTTCCGAGATACAGAAGAAAGCCCTCTGGTTGCATTATTCAAGACATGCCCAATGGATACTCTAGGGTTACATGGGTGGAGCATGGAGAGATAGAAGAGAAGCCAATCCATCAAATATTCAATCACTTTGTGCATAGTGGAATGGCTTTTGGGGCACACCGTTGGTTGGCTATCTTACAAAGGCAATGCGAGAGAATTGCAAGCCTCATGGCTAGAAACATCTCTGATCTTGGAGTAATACCTTCACCGGAAGCTAGACAAAACCTAATGAAACTAGCACAAAGAATGATCAGAACTTTCTCCGTCAACATAAGCACCTCCGGTGGCCAGTCATGGACGGCGTTATCTGAATCTCCCGACGATACTGTTCGTATAACCACACGAAAAATCGTCGAGCCTGGCCAACCTAATGGGGTTATTCTCAGCGCCGTCTCGACCACTTGGCTTCCTTATCCTCATTATCGAGTCTTCGATCTTCTTCGAGACGAACGTCGACGGTCTCAGCTCGAGGTTCTTTCGAATGGGAATTCGCTGCACGAAGTTGCTCACATTGCCAATGGCTCTCACCCTGGAAATTGCATCTCTCTTCTTCGTATCAATGTGGCCAGCAACTCGTCCCAGCACGTCGAGCTAATGCTGCAAGAGAGCTGCACGGATCAGTCGGGCAGCCTCGTCGTCTACGCAACGATCGACGTCGATTCAATTCAATTAGCCATGAGTGGAGAAGACCCATCTTGCATTCCTCTCCTCCCCATAGGATTTTCCATTGTCCCCGTCGTCGGGTCTACCGTTGACGGACACCCATCACCACCACCCGAAGACGGTGTCGCTAACTCCGGCTGCCTCCTTACCGTCGGCTTGCAAGTTCTAGCCAGCACTATTCCATCAGCAAAGCTCAACCTATCAAGCGTTACTGCCATCAACAATCACCTATGTAACACGGTGCACCAAATCAACGCCGCTCTCGGCAGCCCGGCTAGTATCGAGAACGGTAATACCATTGCTGAGCCTAATAATGCACCAGCGCCGGCACCGGCACCGGCGCCAGCACCCAAGCAATAAGCTTTTGCATGGTAGGCAAAGAAAGAAGTGAGCTGCCAATTACCCAAAATCTCCCTCCCTCATGCCACCCATGACGTAGATTCCAAGGGTATTTTCGGTAACATGAAAAAAAAAACTAGGGTTTTTGAGTGGGAGATTGTGGGTAATTGGGAGAGAGAGATGGTAAATTAGTTAAAAAAAAAGTGTAATTAATTAAAATTATAGGAGTAGTTTTTGGTGGGTAAATAAGAGAGAGAAATAGGGTAGATTTGATGTGTGGATGTGGATCAGGGTGTAGGGTTATTGTGGAAGTCAAGAACGCACCAGATAAAACCCTTGGTCCCTGACAAGGCTGTCAGAAAGTGGGGCCTTGTCCAGTCAGCAAGGGTGGTGGTTCGGGCATTGACTTCTTCTGCTTAAATTGTGTGTCAGTCAGACGTCGTCGTTTCAGTTGCTTTTCAATTCTTATTTTTATTTAGATGAATTATTATGACTGAATTTATTGCTTCCCATTAATTGCCTTTTCTCCTTTCTCTTTTCCAAAAAAAAAAAAAAAAAAAAAAATTATT

Coding sequence (CDS)

ATGACCACAAGATACACAGGTCGCCCAATTGAAGGGATGCCCATGCAACCATCTTTGGATTTGGACATGAATATATACTCGAGGCAATACACGGAGGCCATGGTTTCGTCCTCCGACATGATGTCGATGCCCTCGATGCTCCCGCCCGAGGCTGCCCACTTCTCGAAGGGCGACCTATTAATTGATGAGGAAAAGACACTGGCAATGGACCTGGCTGTGTCGTCCATAGCTGAACTTGTTAAGATGTGTCGCTCGACTGAGCCTCTTTGGGTTCGAGATAGCGAGAGCGGTAAGGACATTCTAAATGTCGAAGAGCATGCCAGGATGTTTCCATGGCCATTGAACCTCAAGCAACACTTGATCAATGAGTTTAGGACCGAAGCCACTCGTGACAGCGCCGTTGTTATAATGAATAGCATAACTCTTGTCGACGCCTTTCTTGATGCCAACAAATGGATGGAATTATTTCCTTCAATTGTGGCCAAAGCAAGAACTATTCAAGTCATTTCATCAAGTGTTTCAGGCCATGCTAGTGGTTCCCTTCAGCTGATGTATGCAGAACTTCAAGCCCTTTCTCCGCTCGTTCCGACGAGAGAAGCTCACTTTCTCCGGTGCTGCCAACAAAACGCCGACGAAGGAAGCTGGGCCATCGTCGATTTCCCGATCGACACCTTTCACGACAGCCTTCAGCACTCGTTTCCGAGATACAGAAGAAAGCCCTCTGGTTGCATTATTCAAGACATGCCCAATGGATACTCTAGGGTTACATGGGTGGAGCATGGAGAGATAGAAGAGAAGCCAATCCATCAAATATTCAATCACTTTGTGCATAGTGGAATGGCTTTTGGGGCACACCGTTGGTTGGCTATCTTACAAAGGCAATGCGAGAGAATTGCAAGCCTCATGGCTAGAAACATCTCTGATCTTGGAGTAATACCTTCACCGGAAGCTAGACAAAACCTAATGAAACTAGCACAAAGAATGATCAGAACTTTCTCCGTCAACATAAGCACCTCCGGTGGCCAGTCATGGACGGCGTTATCTGAATCTCCCGACGATACTGTTCGTATAACCACACGAAAAATCGTCGAGCCTGGCCAACCTAATGGGGTTATTCTCAGCGCCGTCTCGACCACTTGGCTTCCTTATCCTCATTATCGAGTCTTCGATCTTCTTCGAGACGAACGTCGACGGTCTCAGCTCGAGGTTCTTTCGAATGGGAATTCGCTGCACGAAGTTGCTCACATTGCCAATGGCTCTCACCCTGGAAATTGCATCTCTCTTCTTCGTATCAATGTGGCCAGCAACTCGTCCCAGCACGTCGAGCTAATGCTGCAAGAGAGCTGCACGGATCAGTCGGGCAGCCTCGTCGTCTACGCAACGATCGACGTCGATTCAATTCAATTAGCCATGAGTGGAGAAGACCCATCTTGCATTCCTCTCCTCCCCATAGGATTTTCCATTGTCCCCGTCGTCGGGTCTACCGTTGACGGACACCCATCACCACCACCCGAAGACGGTGTCGCTAACTCCGGCTGCCTCCTTACCGTCGGCTTGCAAGTTCTAGCCAGCACTATTCCATCAGCAAAGCTCAACCTATCAAGCGTTACTGCCATCAACAATCACCTATGTAACACGGTGCACCAAATCAACGCCGCTCTCGGCAGCCCGGCTAGTATCGAGAACGGTAATACCATTGCTGAGCCTAATAATGCACCAGCGCCGGCACCGGCACCGGCGCCAGCACCCAAGCAATAA

Protein sequence

MTTRYTGRPIEGMPMQPSLDLDMNIYSRQYTEAMVSSSDMMSMPSMLPPEAAHFSKGDLLIDEEKTLAMDLAVSSIAELVKMCRSTEPLWVRDSESGKDILNVEEHARMFPWPLNLKQHLINEFRTEATRDSAVVIMNSITLVDAFLDANKWMELFPSIVAKARTIQVISSSVSGHASGSLQLMYAELQALSPLVPTREAHFLRCCQQNADEGSWAIVDFPIDTFHDSLQHSFPRYRRKPSGCIIQDMPNGYSRVTWVEHGEIEEKPIHQIFNHFVHSGMAFGAHRWLAILQRQCERIASLMARNISDLGVIPSPEARQNLMKLAQRMIRTFSVNISTSGGQSWTALSESPDDTVRITTRKIVEPGQPNGVILSAVSTTWLPYPHYRVFDLLRDERRRSQLEVLSNGNSLHEVAHIANGSHPGNCISLLRINVASNSSQHVELMLQESCTDQSGSLVVYATIDVDSIQLAMSGEDPSCIPLLPIGFSIVPVVGSTVDGHPSPPPEDGVANSGCLLTVGLQVLASTIPSAKLNLSSVTAINNHLCNTVHQINAALGSPASIENGNTIAEPNNAPAPAPAPAPAPKQ
BLAST of Cp4.1LG15g01980 vs. Swiss-Prot
Match: HDG5_ARATH (Homeobox-leucine zipper protein HDG5 OS=Arabidopsis thaliana GN=HDG5 PE=2 SV=3)

HSP 1 Score: 679.9 bits (1753), Expect = 2.5e-194
Identity = 362/587 (61.67%), Postives = 449/587 (76.49%), Query Frame = 1

Query: 1   MTTRYTGRPIEGMPM--------------QPSLDLDMNIYSRQYTEAMVSSSDMMSMPSM 60
           + +RYTGRP++ MP               QPSL+LDM++Y+  + E   S +DMM +P  
Sbjct: 235 IASRYTGRPMQSMPPSQPLINPSPMLPHHQPSLELDMSVYAGNFPEQ--SCTDMMMLPPQ 294

Query: 61  -----LPPEAAHFSKGD--LLIDEEKTLAMDLAVSSIAELVKMCRSTEPLWVRDSES--G 120
                 P + A+ +  +  LL DEEK +AM+ AVS + EL KMC + EPLW++      G
Sbjct: 295 DTACFFPDQTANNNNNNNMLLADEEKVIAMEFAVSCVQELTKMCDTEEPLWIKKKSDKIG 354

Query: 121 KDIL--NVEEHARMFPWPLNLKQHLINEFRTEATRDSAVVIMNSITLVDAFLDANKWMEL 180
            +IL  N EE+ R+FPWP+   Q+   +F  EA++ +AVVIMNSITLVDAFL+A+KW E+
Sbjct: 355 GEILCLNEEEYMRLFPWPME-NQNNKGDFLREASKANAVVIMNSITLVDAFLNADKWSEM 414

Query: 181 FPSIVAKARTIQVISSSVSGHASGSLQLMYAELQALSPLVPTREAHFLRCCQQNADEGSW 240
           F SIVA+A+T+Q+ISS VSG ASGSL LM+AELQ LSPLVPTREA+FLR  +QNA+ G+W
Sbjct: 415 FCSIVARAKTVQIISSGVSG-ASGSLLLMFAELQVLSPLVPTREAYFLRYVEQNAETGNW 474

Query: 241 AIVDFPIDTFHDSLQHSFP---RYRRKPSGCIIQDMPNGYSRVTWVEHGEIEEKPIHQIF 300
           AIVDFPID+FHD +Q        Y+RKPSGCIIQDMPNGYS+V WVEH E++EK +H+ F
Sbjct: 475 AIVDFPIDSFHDQMQPMNTITHEYKRKPSGCIIQDMPNGYSQVKWVEHVEVDEKHVHETF 534

Query: 301 NHFVHSGMAFGAHRWLAILQRQCERIASLMARNISDLGVIPSPEARQNLMKLAQRMIRTF 360
             +V SGMAFGA+RWL +LQRQCERIASLMARNI+DLGVI S EAR+N+M+L+QR+++TF
Sbjct: 535 AEYVKSGMAFGANRWLDVLQRQCERIASLMARNITDLGVISSAEARRNIMRLSQRLVKTF 594

Query: 361 SVNISTSGGQSWTALSESPDDTVRITTRKIVEPGQPNGVILSAVSTTWLPYPHYRVFDLL 420
            VNIST+ GQSWTALSE+  DTVRITTRK+ EPGQP GV+L AVSTTWLP+ H++VFDL+
Sbjct: 595 CVNISTAYGQSWTALSETTKDTVRITTRKMCEPGQPTGVVLCAVSTTWLPFSHHQVFDLI 654

Query: 421 RDERRRSQLEVLSNGNSLHEVAHIANGSHPGNCISLLRINVASNSSQHVELMLQESCTDQ 480
           RD+  +S LEVL NGNS HEVAHIANGSHPGNCISLLRINVASNS  +VELMLQESC D 
Sbjct: 655 RDQHHQSLLEVLFNGNSPHEVAHIANGSHPGNCISLLRINVASNSWHNVELMLQESCIDN 714

Query: 481 SGSLVVYATIDVDSIQLAMSGEDPSCIPLLPIGFSIVPV---VGSTVDGHPSPPPEDGVA 540
           SGSL+VY+T+DVDSIQ AM+GED S IP+LP+GFSIVPV    G +V+ H SPP      
Sbjct: 715 SGSLIVYSTVDVDSIQQAMNGEDSSNIPILPLGFSIVPVNPPEGISVNSH-SPP------ 774

Query: 541 NSGCLLTVGLQVLASTIPSAKLNLSSVTAINNHLCNTVHQINAALGS 557
              CLLTVG+QVLAS +P+AK NLS+VT INNHLC TV+QI +AL +
Sbjct: 775 --SCLLTVGIQVLASNVPTAKPNLSTVTTINNHLCATVNQITSALSN 808

BLAST of Cp4.1LG15g01980 vs. Swiss-Prot
Match: ROC3_ORYSJ (Homeobox-leucine zipper protein ROC3 OS=Oryza sativa subsp. japonica GN=ROC3 PE=2 SV=1)

HSP 1 Score: 639.8 bits (1649), Expect = 2.9e-182
Identity = 351/595 (58.99%), Postives = 428/595 (71.93%), Query Frame = 1

Query: 15  MQPSLDLDMNIYSRQYTEA--MVSSSDMMSMPSMLPPEAAHFSKGDLLI---DEEKTLAM 74
           + P LDLDMN+YSR + E   ++   D++  P +   + A    G ++    +++K L +
Sbjct: 289 LMPPLDLDMNVYSRHFAEQAPVMGCGDLIPPPVVPQHDGAAAYMGAMMAPVQEQDKQLVV 348

Query: 75  DLAVSSIAELVKMCRSTEPLWVRDSESGKDILNVEEHARMFPWPLN-LKQHLINEF-RTE 134
           DLA ++  +L +MCR+ EPLWVR  + G +++ VEEHARMF WP++  KQ       R E
Sbjct: 349 DLAATAADQLARMCRAGEPLWVR--QRGAEVMAVEEHARMFSWPVDGAKQGDGGAVARAE 408

Query: 135 ATRDSAVVIMNSITLVDAFLDANKWMELFPSIVAKARTIQVIS-SSVSGH-ASGSLQLMY 194
            TRD+AVVIMNSI LVDAFLDANKWMELFPSIV KARTIQ+I+  + SGH  SG+L LM 
Sbjct: 409 GTRDNAVVIMNSINLVDAFLDANKWMELFPSIVCKARTIQIINHGAASGHLGSGTLLLMQ 468

Query: 195 AELQALSPLVPTREAHFLRCCQQNADEGSWAIVDFPIDTFHDSL-QHSFPRYRRKPSGCI 254
           AE+Q LSPLV  RE  F R C  NADEGSWAIVDFP + F + L Q S  R RR+PSGCI
Sbjct: 469 AEVQFLSPLVAAREVVFFRYCVHNADEGSWAIVDFPAEGFEEGLLQASVVRCRRRPSGCI 528

Query: 255 IQDMPNGYSRVTWVEHGEI--EEKPIHQIFNHFVHSGMAFGAHRWLAILQRQCERIASLM 314
           IQDMPNGYSRV WVEH E+  EEKP+  +F  +V SG AFGA RWL+ILQRQCER+AS +
Sbjct: 529 IQDMPNGYSRVVWVEHMEMVGEEKPLQPVFRDYVASGAAFGATRWLSILQRQCERLASEL 588

Query: 315 ARNISDLGVIPSPEARQNLMKLAQRMIRTFSVNISTSGGQSWTALSESPDDTVRITTRKI 374
           ARNI+DLGVI +PEAR N+MKL+QRMI TF  NIS SG QSWTALS+S  DT+R+TTRK 
Sbjct: 589 ARNIADLGVIRTPEARTNMMKLSQRMITTFCANISASGTQSWTALSDSTQDTIRVTTRKN 648

Query: 375 VEPGQPNGVILSAVSTTWLPYPHYRVFDLLRDERRRSQLEVLSNGNSLHEVAHIANGSHP 434
            EPGQP+GVIL+AVST+WLP+ H +VF+LL DE++R QLE+LSNG SLHEVAHIANGSHP
Sbjct: 649 TEPGQPSGVILTAVSTSWLPFTHQQVFELLADEQQRCQLEILSNGGSLHEVAHIANGSHP 708

Query: 435 GNCISLLRINVASNSSQHVELMLQESCTD-QSGSLVVYATIDVDSIQLAMSGEDPSCIPL 494
            NCISLLRIN ASNSSQ+VEL+LQES T    GSLVV+AT+DVD+IQ+ MSGEDPS IPL
Sbjct: 709 RNCISLLRINAASNSSQNVELLLQESSTHPDGGSLVVFATVDVDAIQVTMSGEDPSYIPL 768

Query: 495 LPIGFSIVPVVGSTVDGHP------------------SPPPEDGVANS----------GC 554
           LP+GF+I P    +    P                  S PP +  +N+          GC
Sbjct: 769 LPLGFAIFPATSPSPAAAPTISSSTTTTTGNGNGETSSTPPRNSSSNNNNADELLPPNGC 828

Query: 555 LLTVGLQVLASTIPSAKLNLSSVTAINNHLCNTVHQINAALGSPASIENGNTIAE 569
           LLTVG+QVLAS +PSAKLNLSSVTAIN+H+CN +HQI AAL S A    G   ++
Sbjct: 829 LLTVGMQVLASAVPSAKLNLSSVTAINSHVCNAIHQITAALKSSAGGAGGEPASD 881

BLAST of Cp4.1LG15g01980 vs. Swiss-Prot
Match: ROC3_ORYSI (Homeobox-leucine zipper protein ROC3 OS=Oryza sativa subsp. indica GN=ROC3 PE=3 SV=2)

HSP 1 Score: 638.3 bits (1645), Expect = 8.4e-182
Identity = 350/595 (58.82%), Postives = 427/595 (71.76%), Query Frame = 1

Query: 15  MQPSLDLDMNIYSRQYTEA--MVSSSDMMSMPSMLPPEAAHFSKGDLLI---DEEKTLAM 74
           + P LDLDMN+YSR + E   ++   D++  P +   + A    G ++    +++K L +
Sbjct: 289 LMPPLDLDMNVYSRHFAEQAPVMGCGDLIPPPVVPQHDGAAAYMGAMMAPVQEQDKQLVV 348

Query: 75  DLAVSSIAELVKMCRSTEPLWVRDSESGKDILNVEEHARMFPWPLN-LKQHLINEF-RTE 134
           DLA ++  +L +MCR+ EPLWVR  + G +++ VEEHARMF WP++  KQ       R E
Sbjct: 349 DLAATAADQLARMCRAGEPLWVR--QRGAEVMAVEEHARMFSWPVDGAKQGDGGAVARAE 408

Query: 135 ATRDSAVVIMNSITLVDAFLDANKWMELFPSIVAKARTIQVIS-SSVSGH-ASGSLQLMY 194
            TRD+AVVIMNSI LVDAFLDANKWMELFPSIV KARTIQ+I+  + SGH  SG+L LM 
Sbjct: 409 GTRDNAVVIMNSINLVDAFLDANKWMELFPSIVCKARTIQIINHGAASGHLGSGTLLLMQ 468

Query: 195 AELQALSPLVPTREAHFLRCCQQNADEGSWAIVDFPIDTFHDSL-QHSFPRYRRKPSGCI 254
           AE+Q LSPLV  RE  F R C  NADEGSWAIVDFP + F + L Q S  R RR+PSGCI
Sbjct: 469 AEVQFLSPLVAAREVVFFRYCVHNADEGSWAIVDFPAEGFEEGLLQASVVRCRRRPSGCI 528

Query: 255 IQDMPNGYSRVTWVEHGEI--EEKPIHQIFNHFVHSGMAFGAHRWLAILQRQCERIASLM 314
           IQDMPNGYSRV WVEH E+  EEKP+  +F  +V SG AFGA RWL+ILQRQCER+AS +
Sbjct: 529 IQDMPNGYSRVVWVEHMEMVGEEKPLQPVFRDYVASGAAFGATRWLSILQRQCERLASEL 588

Query: 315 ARNISDLGVIPSPEARQNLMKLAQRMIRTFSVNISTSGGQSWTALSESPDDTVRITTRKI 374
           ARNI+DLGVI +PEAR N+MKL+QRMI TF  NIS SG QSWTALS+S  DT+R+TTRK 
Sbjct: 589 ARNIADLGVIRTPEARTNMMKLSQRMITTFCANISASGTQSWTALSDSTQDTIRVTTRKN 648

Query: 375 VEPGQPNGVILSAVSTTWLPYPHYRVFDLLRDERRRSQLEVLSNGNSLHEVAHIANGSHP 434
            EPGQP+GVIL+AVST+WLP+ H +VF+LL DE++R QLE+LSNG SLHEVAHIANGSHP
Sbjct: 649 TEPGQPSGVILTAVSTSWLPFTHQQVFELLADEQQRCQLEILSNGGSLHEVAHIANGSHP 708

Query: 435 GNCISLLRINVASNSSQHVELMLQESCTD-QSGSLVVYATIDVDSIQLAMSGEDPSCIPL 494
            NCISLLRIN ASNSSQ+VEL+LQES T    GSLVV+AT+DVD+IQ+ MSGEDPS IPL
Sbjct: 709 RNCISLLRINAASNSSQNVELLLQESSTHPDGGSLVVFATVDVDAIQVTMSGEDPSYIPL 768

Query: 495 LPIGFSIVPVVGSTVDGHP------------------SPPPEDGVANS----------GC 554
           LP+GF+I P    +    P                  S PP +  +N+          GC
Sbjct: 769 LPLGFAIFPATSPSPAAAPTISSSTTTTTGNGNGETSSTPPRNSSSNNNNADELLPPNGC 828

Query: 555 LLTVGLQVLASTIPSAKLNLSSVTAINNHLCNTVHQINAALGSPASIENGNTIAE 569
           LLTVG+QVLAS +PSAKLNLSSVTAIN+H+CN +HQI AAL   A    G   ++
Sbjct: 829 LLTVGMQVLASAVPSAKLNLSSVTAINSHVCNAIHQITAALKGSAGGAGGEPASD 881

BLAST of Cp4.1LG15g01980 vs. Swiss-Prot
Match: HDG4_ARATH (Homeobox-leucine zipper protein HDG4 OS=Arabidopsis thaliana GN=HDG4 PE=1 SV=1)

HSP 1 Score: 525.4 bits (1352), Expect = 8.0e-148
Identity = 287/512 (56.05%), Postives = 371/512 (72.46%), Query Frame = 1

Query: 47  LPPEAAHFSKGDLLI-DEEKTLAMDLAVSSIAELVKMCRSTEPLWVRDSESGKDI-LNVE 106
           + PE    +  ++LI +EEK + M+LAVS   EL KMC   EPLW +     + + LN E
Sbjct: 214 ITPETNKNNNDNMLIAEEEKAIDMELAVSCARELAKMCDINEPLWNKKRLDNESVCLNEE 273

Query: 107 EHARMFPWPLNLKQHLINEFRTEATRDSAVVIMNSITLVDAFLDANKWMELFPSIVAKAR 166
           E+ +MF WPL       + FR EA+R +AV+++N ITLV AFLDA+KW E+F  IV+ A+
Sbjct: 274 EYKKMFLWPLMNDD---DRFRREASRANAVIMLNCITLVKAFLDADKWSEMFFPIVSSAK 333

Query: 167 TIQVISSSVSGHASGSLQLMYAELQALSPLVPTREAHFLRCCQQNADEGSWAIVDFPIDT 226
           T Q+ISS  SG  SG+L LM+AELQ +SPLVPTREA+FLR  +QNA+EG W +VDFPID 
Sbjct: 334 TAQIISSGASG-PSGTLLLMFAELQVVSPLVPTREAYFLRYVEQNAEEGKWMVVDFPIDR 393

Query: 227 FHDSLQHSFPRYRRKPSGCIIQDMPNGYSRVTWVEHGEIEEKPIH-QIFNHFVHSGMAFG 286
              +   +  +YRRKPSGCIIQ M NGYS+VTWVEH E+EEK +  ++   FV SG+AFG
Sbjct: 394 IKPASATTTDQYRRKPSGCIIQAMRNGYSQVTWVEHVEVEEKHVQDEVVREFVESGVAFG 453

Query: 287 AHRWLAILQRQCERIASLMARNISDLGVIPSPEARQNLMKLAQRMIRTFSVNISTSGGQS 346
           A RWL++L+RQCER+ASLMA NI+DLGVIPS EAR+NLMKL+QRM++TF +NI  S GQ+
Sbjct: 454 AERWLSVLKRQCERMASLMATNITDLGVIPSVEARKNLMKLSQRMVKTFCLNIINSHGQA 513

Query: 347 WTALSESPDDTVRITTRKIVEPGQPNGVILSAVSTTWLPYPHYRVFDLLRDERRRSQLEV 406
            T       DTV+I +RK+       G++  AVS T LPY H +VFDLLRD +R SQLE+
Sbjct: 514 PTK------DTVKIVSRKVC-----GGLVPCAVSVTLLPYSHQQVFDLLRDNQRLSQLEI 573

Query: 407 LSNGNSLHEVAHIANGSHPGNCISLLRINVASNSSQHVELMLQESCTDQSGSLVVYATID 466
           L  G+S  EVAHIANGSH GN ISLLRINV SNSS +VELMLQE+CTD SGSL+VY+T+D
Sbjct: 574 LFMGSSFQEVAHIANGSHLGNSISLLRINVESNSSHNVELMLQETCTDNSGSLLVYSTVD 633

Query: 467 VDSIQLAMSGEDPSCIPLLPIGFSIVPVVGSTVDGHPSPPPE-DGVANSGCLLTVGLQVL 526
             ++QLAM+GEDPS IPLLP+GFS+VPV       +PS   E   V++  CLLTV +QVL
Sbjct: 634 PVAVQLAMNGEDPSEIPLLPVGFSVVPV-------NPSDGVEGSSVSSPSCLLTVAIQVL 693

Query: 527 ASTIPSAKLNLSSVTAINNHLCNTVHQINAAL 555
            S + + +L+LS+V+ IN+ +C TV++I +AL
Sbjct: 694 GSNVTTERLDLSTVSVINHRICATVNRITSAL 703

BLAST of Cp4.1LG15g01980 vs. Swiss-Prot
Match: ROC2_ORYSJ (Homeobox-leucine zipper protein ROC2 OS=Oryza sativa subsp. japonica GN=ROC2 PE=2 SV=1)

HSP 1 Score: 452.6 bits (1163), Expect = 6.6e-126
Identity = 240/504 (47.62%), Postives = 352/504 (69.84%), Query Frame = 1

Query: 64  EKTLAMDLAVSSIAELVKMCRSTEPLW-----VRDSESGKDILNVEEHARMFPWPLNLKQ 123
           +K + ++LAV+++ ELV+M +  EPLW     +  + +  + L+ EE+ARMFP  L  KQ
Sbjct: 289 DKPMIVELAVAAMEELVRMAQLDEPLWSVAPPLDATAAAMETLSEEEYARMFPRGLGPKQ 348

Query: 124 HLINEFRTEATRDSAVVIMNSITLVDAFLDANKWMELFPSIVAKARTIQVISSSVSGHAS 183
           + +   R+EA+RDSAVVIM    LV+  +DAN++  +F +IV++A T++V+S+ V+G+ +
Sbjct: 349 YGL---RSEASRDSAVVIMTHANLVEILMDANQYAAVFSNIVSRAITLEVLSTGVAGNYN 408

Query: 184 GSLQLMYAELQALSPLVPTREAHFLRCCQQNADEGSWAIVDFPIDTFHDSLQHSFPRYRR 243
           G+LQ+M  E Q  SPLVPTRE++F+R C+QNAD G+WA+VD  +D+   S      + RR
Sbjct: 409 GALQVMSVEFQVPSPLVPTRESYFVRYCKQNAD-GTWAVVDVSLDSLRPS---PVLKCRR 468

Query: 244 KPSGCIIQDMPNGYSRVTWVEHGEIEEKPIHQIFNHFVHSGMAFGAHRWLAILQRQCERI 303
           +PSGC+IQ+MPNGYS+VTWVEH E++++ +H I+   V+SG+AFGA RW+  L RQCER+
Sbjct: 469 RPSGCLIQEMPNGYSKVTWVEHVEVDDRSVHNIYKLLVNSGLAFGARRWVGTLDRQCERL 528

Query: 304 ASLMARNI--SDLGVIPSPEARQNLMKLAQRMIRTFSVNISTSGGQSWTALSESPDDTVR 363
           AS+MA NI  SD+GVI S E R++++KLA+RM+ +F   ++ S    WT LS S  + VR
Sbjct: 529 ASVMASNIPTSDIGVITSSEGRKSMLKLAERMVVSFCGGVTASVAHQWTTLSGSGAEDVR 588

Query: 364 ITTRKIV-EPGQPNGVILSAVSTTWLPYPHYRVFDLLRDERRRSQLEVLSNGNSLHEVAH 423
           + TRK V +PG+P G++L+A ++ WLP P  RVFD LRDE  RS+ ++LSNG  + E+AH
Sbjct: 589 VMTRKSVDDPGRPPGIVLNAATSFWLPVPPKRVFDFLRDESSRSEWDILSNGGIVQEMAH 648

Query: 424 IANGSHPGNCISLLRINVASNSSQHVELMLQESCTDQSGSLVVYATIDVDSIQLAMSGED 483
           IANG   GNC+SLLR+N +SNS+Q   L+LQESCTD SGS V+YA +DV ++ + ++G D
Sbjct: 649 IANGRDQGNCVSLLRVN-SSNSNQSNMLILQESCTDASGSYVIYAPVDVVAMNVVLNGGD 708

Query: 484 PSCIPLLPIGFSIVPVVGSTVDGHPSPPPEDGVANSGCLLTVGLQVLASTIPSAKLNLSS 543
           P  + LLP GF+I+P  G   DG        GV + G LLTV  Q+L  ++P+AKL+L S
Sbjct: 709 PDYVALLPSGFAILP-DGPAHDGGDGDGGV-GVGSGGSLLTVAFQILVDSVPTAKLSLGS 768

Query: 544 VTAINNHLCNTVHQINAALGSPAS 560
           V  +N+ +  TV +I AA+   ++
Sbjct: 769 VATVNSLIACTVERIKAAVSGESN 782

BLAST of Cp4.1LG15g01980 vs. TrEMBL
Match: A0A0A0LEZ7_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G901030 PE=4 SV=1)

HSP 1 Score: 1073.9 bits (2776), Expect = 6.3e-311
Identity = 534/590 (90.51%), Postives = 558/590 (94.58%), Query Frame = 1

Query: 1   MTTRYTGRPIEGMP------MQPSLDLDMNIYSRQYTEAMVSSSDMMSMPSMLPPEAAHF 60
           MTTRYTGRPI+ M       MQPSLDLDMNIYSRQYTEAMV SSDMM++PSMLPPEAAHF
Sbjct: 221 MTTRYTGRPIQAMASAAPPLMQPSLDLDMNIYSRQYTEAMVPSSDMMALPSMLPPEAAHF 280

Query: 61  SKGDLLIDEEKTLAMDLAVSSIAELVKMCRSTEPLWVRDSESGKDILNVEEHARMFPWPL 120
            +G LLI+EEKTLAMDLAVSSIAELVKMCR TEPLWVRD+ESGK++LNVEEH RMFPWPL
Sbjct: 281 PEGGLLIEEEKTLAMDLAVSSIAELVKMCRLTEPLWVRDNESGKEVLNVEEHGRMFPWPL 340

Query: 121 NLKQHLINEFRTEATRDSAVVIMNSITLVDAFLDANKWMELFPSIVAKARTIQVISSSVS 180
           NLKQHLINEFRTEATRDSAVVIMNSITLVDAFLDANKWMELFPSIVAKA+T+QVISSSVS
Sbjct: 341 NLKQHLINEFRTEATRDSAVVIMNSITLVDAFLDANKWMELFPSIVAKAKTVQVISSSVS 400

Query: 181 GHASGSLQLMYAELQALSPLVPTREAHFLRCCQQNADEGSWAIVDFPIDTFHDSLQHSFP 240
           GHAS SLQ+MYAELQ LSPLVPTREAHFLRCCQQNADEGSW +VDFPID+FHDSLQHSFP
Sbjct: 401 GHASSSLQVMYAELQTLSPLVPTREAHFLRCCQQNADEGSWTVVDFPIDSFHDSLQHSFP 460

Query: 241 RYRRKPSGCIIQDMPNGYSRVTWVEHGEIEEKPIHQIFNHFVHSGMAFGAHRWLAILQRQ 300
           RYRRKPSGCIIQDMPNGYSRVTWVEH EIEEKPIHQIFNHFVHSGMAFGA+RWLAILQRQ
Sbjct: 461 RYRRKPSGCIIQDMPNGYSRVTWVEHAEIEEKPIHQIFNHFVHSGMAFGANRWLAILQRQ 520

Query: 301 CERIASLMARNISDLGVIPSPEARQNLMKLAQRMIRTFSVNISTSGGQSWTALSESPDDT 360
           CERIASLMARNISDLGVIPSPEARQNLMKLAQRMIRTFSVNISTSGGQSWTALS+SP+DT
Sbjct: 521 CERIASLMARNISDLGVIPSPEARQNLMKLAQRMIRTFSVNISTSGGQSWTALSDSPEDT 580

Query: 361 VRITTRKIVEPGQPNGVILSAVSTTWLPYPHYRVFDLLRDERRRSQLEVLSNGNSLHEVA 420
           VRITTRK+VEPGQPNGVILSAVSTTWLPYPHYRVFDLLRDERRRSQLEVLSNGNSLHEVA
Sbjct: 581 VRITTRKVVEPGQPNGVILSAVSTTWLPYPHYRVFDLLRDERRRSQLEVLSNGNSLHEVA 640

Query: 421 HIANGSHPGNCISLLRINVASNSSQHVELMLQESCTDQSGSLVVYATIDVDSIQLAMSGE 480
           HIANGSHPGNCISLLRINVASNSSQHVELMLQESCTDQSGSLVVYATIDVDSIQLAMSGE
Sbjct: 641 HIANGSHPGNCISLLRINVASNSSQHVELMLQESCTDQSGSLVVYATIDVDSIQLAMSGE 700

Query: 481 DPSCIPLLPIGFSIVPVVGSTVDGHPSPPPEDGV--ANSGCLLTVGLQVLASTIPSAKLN 540
           DPSCIPLLPIGFSIVP++GST+DGHP+PPPEDG    NSGCLLTVGLQVLASTIPSAKLN
Sbjct: 701 DPSCIPLLPIGFSIVPIIGSTIDGHPAPPPEDGTPNPNSGCLLTVGLQVLASTIPSAKLN 760

Query: 541 LSSVTAINNHLCNTVHQINAALGSPASIENGNTIAEPNNAPAPAPAPAPA 583
           LSSVTAINNHLCNTVHQIN ALG P  +EN N +AEPNN P P P P P+
Sbjct: 761 LSSVTAINNHLCNTVHQINIALGGPGRLENDNVVAEPNNPPTPPPPPPPS 810

BLAST of Cp4.1LG15g01980 vs. TrEMBL
Match: A5AJ70_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_02s0012g02030 PE=4 SV=1)

HSP 1 Score: 874.0 bits (2257), Expect = 1.0e-250
Identity = 444/590 (75.25%), Postives = 511/590 (86.61%), Query Frame = 1

Query: 1   MTTRYTGRPIE--GMP---MQPSLDLDMNIYSRQYTEAMVSSSDMMSMPSMLPPEAAHFS 60
           + +RY GR I+  G P   + PSLDLDM+IY+R + E M + +DM+ +P M  PE++HF 
Sbjct: 212 LASRYGGRAIQAIGPPPPLLAPSLDLDMSIYARNFPEPMANCTDMIPVPLM--PESSHFP 271

Query: 61  KGDLLIDEEKTLAMDLAVSSIAELVKMCRSTEPLWVRDSESGKDILNVEEHARMFPWPLN 120
           +G L+++EEK+LA++LA+SS+ ELVKMC+  EPLW+R +E+GK+++NVEE+ RMFPWP+N
Sbjct: 272 EGGLVLEEEKSLALELAISSVDELVKMCQLGEPLWIRSNENGKEVINVEEYGRMFPWPMN 331

Query: 121 LKQHLINEFRTEATRDSAVVIMNSITLVDAFLDANKWMELFPSIVAKARTIQVISSSVSG 180
           LKQH   EFRTEATRDSAVVIMNSI LVDAFLDA KWMELFPSI+++A+T+QV+S  VSG
Sbjct: 332 LKQHP-GEFRTEATRDSAVVIMNSINLVDAFLDAMKWMELFPSIISRAKTVQVLSG-VSG 391

Query: 181 HASGSLQLMYAELQALSPLVPTREAHFLRCCQQNADEGSWAIVDFPIDTFHDSLQHSFPR 240
           HA+GSL LMYAELQ LSPLVPTRE HFLR CQQN DEG+WAIVDFPID+F+D+LQ S PR
Sbjct: 392 HANGSLHLMYAELQVLSPLVPTRETHFLRYCQQNVDEGTWAIVDFPIDSFNDNLQPSVPR 451

Query: 241 YRRKPSGCIIQDMPNGYSRVTWVEHGEIEEKPIHQIFNHFVHSGMAFGAHRWLAILQRQC 300
           YRR+PSGCIIQDMPNGYSRVTWVEH ++EEKP+H IF+HFV+SGMAFGA RWLA+LQRQC
Sbjct: 452 YRRRPSGCIIQDMPNGYSRVTWVEHADVEEKPVHHIFHHFVNSGMAFGATRWLAVLQRQC 511

Query: 301 ERIASLMARNISDLGVIPSPEARQNLMKLAQRMIRTFSVNISTSGGQSWTALSESPDDTV 360
           ER+ASLMARNISDLGVIPSPEAR+NLM LAQRMIRTFSVNISTS GQSWTALS+S DDTV
Sbjct: 512 ERVASLMARNISDLGVIPSPEARKNLMNLAQRMIRTFSVNISTSSGQSWTALSDSSDDTV 571

Query: 361 RITTRKIVEPGQPNGVILSAVSTTWLPYPHYRVFDLLRDERRRSQLEVLSNGNSLHEVAH 420
           RITTRKI EPGQPNGVILSAVSTTWLP+PHY VFDLLRDERRR+QL+VLSNGNSLHEVAH
Sbjct: 572 RITTRKITEPGQPNGVILSAVSTTWLPHPHYHVFDLLRDERRRAQLDVLSNGNSLHEVAH 631

Query: 421 IANGSHPGNCISLLRINVASNSSQHVELMLQESCTDQSGSLVVYATIDVDSIQLAMSGED 480
           IANGSHPGNCISLLRINVASNSSQ+VELMLQESCTDQSGS VVY TIDVD+IQLAMSGED
Sbjct: 632 IANGSHPGNCISLLRINVASNSSQNVELMLQESCTDQSGSHVVYTTIDVDAIQLAMSGED 691

Query: 481 PSCIPLLPIGFSIVPVVG-------STVDGHPSPPPEDGVA-NSGCLLTVGLQVLASTIP 540
           PSCIPLLP+GF+IVPVV        +T D +P PP  DG   NSGCLLTVGLQVLASTIP
Sbjct: 692 PSCIPLLPMGFAIVPVVPNNDCNIMTTTDDNPMPPSGDGNGHNSGCLLTVGLQVLASTIP 751

Query: 541 SAKLNLSSVTAINNHLCNTVHQINAALGSPASIENGNTIAEPNNAPAPAP 578
           +AKLNLSSVTAINNHLCNTVHQINAAL S    +N +++   +  PA  P
Sbjct: 752 TAKLNLSSVTAINNHLCNTVHQINAALSSICP-DNSSSMVGSSTEPAAPP 796

BLAST of Cp4.1LG15g01980 vs. TrEMBL
Match: M5WW92_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa014792mg PE=4 SV=1)

HSP 1 Score: 845.5 bits (2183), Expect = 3.9e-242
Identity = 434/598 (72.58%), Postives = 502/598 (83.95%), Query Frame = 1

Query: 1   MTTRYT-GRPIEGM-PMQP-----SLDLDMNIYSRQYTEAMVSSSDMMSMPSMLPPEA-A 60
           +++RYT GR I+ M P  P     SLDLDMNIYSR + + M S  DM+ MP +LPPE  +
Sbjct: 218 LSSRYTTGRQIQTMAPGDPLMSASSLDLDMNIYSRHFQDPMTSCGDMIPMP-LLPPEVPS 277

Query: 61  HFSKGDLLIDEEKTLAMDLAVSSIAELVKMCRSTEPLWVRDSESGKDILNVEEHARMFP- 120
           H+++G +L+DEEK+LA++LA SS+ ELVKMC++ EPLW+R+SE GK++LNV+E+ RMFP 
Sbjct: 278 HYNEGGVLLDEEKSLAVELAASSVDELVKMCQAGEPLWIRNSEIGKEVLNVKEYTRMFPP 337

Query: 121 WPLNLKQHLINEFRTEATRDSAVVIMNSITLVDAFLDANKWMELFPSIVAKARTIQVISS 180
           WPLNLK H  ++FRTEATRDSAVVIMNSI LVD FLDANKWMELFPSIV++A+T+QVI +
Sbjct: 338 WPLNLKHHSSDQFRTEATRDSAVVIMNSINLVDCFLDANKWMELFPSIVSRAKTVQVIQA 397

Query: 181 SVSGHASGSLQLMYAELQALSPLVPTREAHFLRCCQQNADEGSWAIVDFPIDTFHDSLQH 240
             SG A+GSLQLMYAELQ LSPLVPTREAHFLR CQQNA+EG WAIVDFPID+FHD+LQ 
Sbjct: 398 DPSGQANGSLQLMYAELQILSPLVPTREAHFLRYCQQNAEEGCWAIVDFPIDSFHDNLQS 457

Query: 241 SFPRYRRKPSGCIIQDMPNGYSRVTWVEHGEIEEKPIHQIFNHFVHSGMAFGAHRWLAIL 300
           SFPRY+R PSGC+IQDMPNGYS++TWVEH EIEEKP+HQI +H+++SGMAFGA RWLAIL
Sbjct: 458 SFPRYKRLPSGCLIQDMPNGYSKITWVEHAEIEEKPVHQILSHYIYSGMAFGAQRWLAIL 517

Query: 301 QRQCERIASLMARNISDLGVIPSPEARQNLMKLAQRMIRTFSVNISTSGGQSWTALSESP 360
           QRQCER+ASLMARNISDLGVIPSPEAR+NLMKL+QRMIRTF VN+STS GQSWTALS+SP
Sbjct: 518 QRQCERVASLMARNISDLGVIPSPEARKNLMKLSQRMIRTFCVNMSTSNGQSWTALSDSP 577

Query: 361 DDTVRITTRKIVEPGQPNGVILSAVSTTWLPYPHYRVFDLLRDERRRSQLEVLSNGNSLH 420
           DDTVRITTRK+ +PGQP GVILSAVSTTWLPY HYRVF+LLRDE RR+QL+VLSNGNSLH
Sbjct: 578 DDTVRITTRKVTDPGQPIGVILSAVSTTWLPYSHYRVFELLRDEHRRAQLDVLSNGNSLH 637

Query: 421 EVAHIANGSHPGNCISLLRINVASNSSQHVELMLQESCTDQSGSLVVYATIDVDSIQLAM 480
           EVAHIANGSHPGNCISLLRINVASNS+Q+VELMLQESCTD+SGSLVVY T+DVD IQLAM
Sbjct: 638 EVAHIANGSHPGNCISLLRINVASNSAQNVELMLQESCTDESGSLVVYTTMDVDGIQLAM 697

Query: 481 SGEDPSCIPLLPIGFSIVPV-----VGSTVDGHPSPPPE-------------DGVANSGC 540
           SGEDPSCIPLLP+GF IVP+      G T +   S  P+               V NSGC
Sbjct: 698 SGEDPSCIPLLPLGFVIVPLHPMESTGPTPNHLTSSSPDHRQEDSTATTTTTSNVINSGC 757

Query: 541 LLTVGLQVLASTIPSAKLNLSSVTAINNHLCNTVHQINAALGS-------PASIENGN 565
           LLTVGLQVLAST PSAKLNLSSVTAINNHLCN+V QI +AL S        A+ ENG+
Sbjct: 758 LLTVGLQVLASTSPSAKLNLSSVTAINNHLCNSVQQIISALSSGSDTCIAAATTENGS 814

BLAST of Cp4.1LG15g01980 vs. TrEMBL
Match: A0A0B2S421_GLYSO (Homeobox-leucine zipper protein ROC3 OS=Glycine soja GN=glysoja_014944 PE=4 SV=1)

HSP 1 Score: 841.6 bits (2173), Expect = 5.6e-241
Identity = 418/581 (71.94%), Postives = 493/581 (84.85%), Query Frame = 1

Query: 1   MTTRYTGRPIEGMP-----MQPSLDLDMNIYSRQYTEAMVSSSDMMSMPSMLPPEAAHFS 60
           +TTRYTGRPI+ M      M PSLDLDM+IY R + + +   ++M+ +P MLPPEA+ FS
Sbjct: 212 LTTRYTGRPIQTMATGPTLMAPSLDLDMSIYPRHFADTIAPCTEMIPVP-MLPPEASPFS 271

Query: 61  KGDLLIDEEKTLAMDLAVSSIAELVKMCRSTEPLWVRDSESGKDILNVEEHARMFPWPLN 120
           +G +L++EEK+L ++LA SS+AELVKMC++ EPLW+R +ES +++LN EEHARMF WP N
Sbjct: 272 EGGILMEEEKSLTLELAASSMAELVKMCQTNEPLWIRSTESEREVLNFEEHARMFAWPQN 331

Query: 121 LKQHLINEFRTEATRDSAVVIMNSITLVDAFLDANKWMELFPSIVAKARTIQVISSSVSG 180
           LK    +E RTEA+RD++VVIMNS+TLVDAFLDA KWMELFP+IV++A+T+Q+ISS  SG
Sbjct: 332 LKHR--SELRTEASRDTSVVIMNSVTLVDAFLDAQKWMELFPTIVSRAKTVQIISSGASG 391

Query: 181 HASGSLQLMYAELQALSPLVPTREAHFLRCCQQNADEGSWAIVDFPIDTFHDSLQHSFPR 240
            ASG+LQLMYAE Q LSPLV TRE HFLR CQQNA+EG+WAIVDFP+D+FH +   S+PR
Sbjct: 392 LASGTLQLMYAEFQVLSPLVSTRETHFLRYCQQNAEEGTWAIVDFPVDSFHQNFHPSYPR 451

Query: 241 YRRKPSGCIIQDMPNGYSRVTWVEHGEIEEKPIHQIFNHFVHSGMAFGAHRWLAILQRQC 300
           Y R+ SGC+IQDMPNGYSRVTWVEH ++EEKP+HQIF ++V+SGMAFGA RWL +LQRQC
Sbjct: 452 YCRRSSGCVIQDMPNGYSRVTWVEHAKVEEKPVHQIFCNYVYSGMAFGAQRWLGVLQRQC 511

Query: 301 ERIASLMARNISDLGVIPSPEARQNLMKLAQRMIRTFSVNISTSGGQSWTALSESPDDTV 360
           ER+ASLMARNISDLG IPSPEAR+NLMKLAQRMI+TFS+N+STSGGQSWTA+S+SP+DTV
Sbjct: 512 ERVASLMARNISDLGAIPSPEARKNLMKLAQRMIKTFSLNMSTSGGQSWTAISDSPEDTV 571

Query: 361 RITTRKIVEPGQPNGVILSAVSTTWLPYPHYRVFDLLRDERRRSQLEVLSNGNSLHEVAH 420
           RITTRKI EPGQPNGVILSAVSTTWLPY H +VFDLLRDERRRSQ++ LSNGNSL+EVAH
Sbjct: 572 RITTRKITEPGQPNGVILSAVSTTWLPYSHTKVFDLLRDERRRSQMDALSNGNSLNEVAH 631

Query: 421 IANGSHPGNCISLLRINVASNSSQHVELMLQESCTDQSGSLVVYATIDVDSIQLAMSGED 480
           IANGSHPGNCISLLRINVASNSSQ+VELMLQE+CTDQSGS+VVY TIDVDSIQLAMSGED
Sbjct: 632 IANGSHPGNCISLLRINVASNSSQNVELMLQENCTDQSGSIVVYTTIDVDSIQLAMSGED 691

Query: 481 PSCIPLLPIGFSIV-----------PVVGSTVDGHPSPPPEDGVANS-GCLLTVGLQVLA 540
           PSCI LLP GF IV           P++ +  +    PPP     NS GCLLT+GLQVLA
Sbjct: 692 PSCIALLPQGFKIVPMSSPPNNVDTPIIDAATNSSSEPPPSLNNNNSGGCLLTMGLQVLA 751

Query: 541 STIPSAKLNLSSVTAINNHLCNTVHQINAALGSPASIENGN 565
           STIPSAKLNLSSVTAINNHLCNT+HQI AAL S +S ENGN
Sbjct: 752 STIPSAKLNLSSVTAINNHLCNTLHQIEAALSSSSSHENGN 789

BLAST of Cp4.1LG15g01980 vs. TrEMBL
Match: K7LF40_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_09G207500 PE=4 SV=1)

HSP 1 Score: 841.6 bits (2173), Expect = 5.6e-241
Identity = 418/581 (71.94%), Postives = 493/581 (84.85%), Query Frame = 1

Query: 1   MTTRYTGRPIEGMP-----MQPSLDLDMNIYSRQYTEAMVSSSDMMSMPSMLPPEAAHFS 60
           +TTRYTGRPI+ M      M PSLDLDM+IY R + + +   ++M+ +P MLPPEA+ FS
Sbjct: 212 LTTRYTGRPIQTMATGPTLMAPSLDLDMSIYPRHFADTIAPCTEMIPVP-MLPPEASPFS 271

Query: 61  KGDLLIDEEKTLAMDLAVSSIAELVKMCRSTEPLWVRDSESGKDILNVEEHARMFPWPLN 120
           +G +L++EEK+L ++LA SS+AELVKMC++ EPLW+R +ES +++LN EEHARMF WP N
Sbjct: 272 EGGILMEEEKSLTLELAASSMAELVKMCQTNEPLWIRSTESEREVLNFEEHARMFAWPQN 331

Query: 121 LKQHLINEFRTEATRDSAVVIMNSITLVDAFLDANKWMELFPSIVAKARTIQVISSSVSG 180
           LK    +E RTEA+RD++VVIMNS+TLVDAFLDA KWMELFP+IV++A+T+Q+ISS  SG
Sbjct: 332 LKHR--SELRTEASRDTSVVIMNSVTLVDAFLDAQKWMELFPTIVSRAKTVQIISSGASG 391

Query: 181 HASGSLQLMYAELQALSPLVPTREAHFLRCCQQNADEGSWAIVDFPIDTFHDSLQHSFPR 240
            ASG+LQLMYAE Q LSPLV TRE HFLR CQQNA+EG+WAIVDFP+D+FH +   S+PR
Sbjct: 392 LASGTLQLMYAEFQVLSPLVSTRETHFLRYCQQNAEEGTWAIVDFPVDSFHQNFHPSYPR 451

Query: 241 YRRKPSGCIIQDMPNGYSRVTWVEHGEIEEKPIHQIFNHFVHSGMAFGAHRWLAILQRQC 300
           Y R+ SGC+IQDMPNGYSRVTWVEH ++EEKP+HQIF ++V+SGMAFGA RWL +LQRQC
Sbjct: 452 YCRRSSGCVIQDMPNGYSRVTWVEHAKVEEKPVHQIFCNYVYSGMAFGAQRWLGVLQRQC 511

Query: 301 ERIASLMARNISDLGVIPSPEARQNLMKLAQRMIRTFSVNISTSGGQSWTALSESPDDTV 360
           ER+ASLMARNISDLG IPSPEAR+NLMKLAQRMI+TFS+N+STSGGQSWTA+S+SP+DTV
Sbjct: 512 ERVASLMARNISDLGAIPSPEARKNLMKLAQRMIKTFSLNMSTSGGQSWTAISDSPEDTV 571

Query: 361 RITTRKIVEPGQPNGVILSAVSTTWLPYPHYRVFDLLRDERRRSQLEVLSNGNSLHEVAH 420
           RITTRKI EPGQPNGVILSAVSTTWLPY H +VFDLLRDERRRSQ++ LSNGNSL+EVAH
Sbjct: 572 RITTRKITEPGQPNGVILSAVSTTWLPYSHTKVFDLLRDERRRSQMDALSNGNSLNEVAH 631

Query: 421 IANGSHPGNCISLLRINVASNSSQHVELMLQESCTDQSGSLVVYATIDVDSIQLAMSGED 480
           IANGSHPGNCISLLRINVASNSSQ+VELMLQE+CTDQSGS+VVY TIDVDSIQLAMSGED
Sbjct: 632 IANGSHPGNCISLLRINVASNSSQNVELMLQENCTDQSGSIVVYTTIDVDSIQLAMSGED 691

Query: 481 PSCIPLLPIGFSIV-----------PVVGSTVDGHPSPPPEDGVANS-GCLLTVGLQVLA 540
           PSCI LLP GF IV           P++ +  +    PPP     NS GCLLT+GLQVLA
Sbjct: 692 PSCIALLPQGFKIVPMSSPPNNVDTPIIDAATNSSSEPPPSLNNNNSGGCLLTMGLQVLA 751

Query: 541 STIPSAKLNLSSVTAINNHLCNTVHQINAALGSPASIENGN 565
           STIPSAKLNLSSVTAINNHLCNT+HQI AAL S +S ENGN
Sbjct: 752 STIPSAKLNLSSVTAINNHLCNTLHQIEAALSSSSSHENGN 789

BLAST of Cp4.1LG15g01980 vs. TAIR10
Match: AT5G46880.1 (AT5G46880.1 homeobox-7)

HSP 1 Score: 679.9 bits (1753), Expect = 1.4e-195
Identity = 362/587 (61.67%), Postives = 449/587 (76.49%), Query Frame = 1

Query: 1   MTTRYTGRPIEGMPM--------------QPSLDLDMNIYSRQYTEAMVSSSDMMSMPSM 60
           + +RYTGRP++ MP               QPSL+LDM++Y+  + E   S +DMM +P  
Sbjct: 235 IASRYTGRPMQSMPPSQPLINPSPMLPHHQPSLELDMSVYAGNFPEQ--SCTDMMMLPPQ 294

Query: 61  -----LPPEAAHFSKGD--LLIDEEKTLAMDLAVSSIAELVKMCRSTEPLWVRDSES--G 120
                 P + A+ +  +  LL DEEK +AM+ AVS + EL KMC + EPLW++      G
Sbjct: 295 DTACFFPDQTANNNNNNNMLLADEEKVIAMEFAVSCVQELTKMCDTEEPLWIKKKSDKIG 354

Query: 121 KDIL--NVEEHARMFPWPLNLKQHLINEFRTEATRDSAVVIMNSITLVDAFLDANKWMEL 180
            +IL  N EE+ R+FPWP+   Q+   +F  EA++ +AVVIMNSITLVDAFL+A+KW E+
Sbjct: 355 GEILCLNEEEYMRLFPWPME-NQNNKGDFLREASKANAVVIMNSITLVDAFLNADKWSEM 414

Query: 181 FPSIVAKARTIQVISSSVSGHASGSLQLMYAELQALSPLVPTREAHFLRCCQQNADEGSW 240
           F SIVA+A+T+Q+ISS VSG ASGSL LM+AELQ LSPLVPTREA+FLR  +QNA+ G+W
Sbjct: 415 FCSIVARAKTVQIISSGVSG-ASGSLLLMFAELQVLSPLVPTREAYFLRYVEQNAETGNW 474

Query: 241 AIVDFPIDTFHDSLQHSFP---RYRRKPSGCIIQDMPNGYSRVTWVEHGEIEEKPIHQIF 300
           AIVDFPID+FHD +Q        Y+RKPSGCIIQDMPNGYS+V WVEH E++EK +H+ F
Sbjct: 475 AIVDFPIDSFHDQMQPMNTITHEYKRKPSGCIIQDMPNGYSQVKWVEHVEVDEKHVHETF 534

Query: 301 NHFVHSGMAFGAHRWLAILQRQCERIASLMARNISDLGVIPSPEARQNLMKLAQRMIRTF 360
             +V SGMAFGA+RWL +LQRQCERIASLMARNI+DLGVI S EAR+N+M+L+QR+++TF
Sbjct: 535 AEYVKSGMAFGANRWLDVLQRQCERIASLMARNITDLGVISSAEARRNIMRLSQRLVKTF 594

Query: 361 SVNISTSGGQSWTALSESPDDTVRITTRKIVEPGQPNGVILSAVSTTWLPYPHYRVFDLL 420
            VNIST+ GQSWTALSE+  DTVRITTRK+ EPGQP GV+L AVSTTWLP+ H++VFDL+
Sbjct: 595 CVNISTAYGQSWTALSETTKDTVRITTRKMCEPGQPTGVVLCAVSTTWLPFSHHQVFDLI 654

Query: 421 RDERRRSQLEVLSNGNSLHEVAHIANGSHPGNCISLLRINVASNSSQHVELMLQESCTDQ 480
           RD+  +S LEVL NGNS HEVAHIANGSHPGNCISLLRINVASNS  +VELMLQESC D 
Sbjct: 655 RDQHHQSLLEVLFNGNSPHEVAHIANGSHPGNCISLLRINVASNSWHNVELMLQESCIDN 714

Query: 481 SGSLVVYATIDVDSIQLAMSGEDPSCIPLLPIGFSIVPV---VGSTVDGHPSPPPEDGVA 540
           SGSL+VY+T+DVDSIQ AM+GED S IP+LP+GFSIVPV    G +V+ H SPP      
Sbjct: 715 SGSLIVYSTVDVDSIQQAMNGEDSSNIPILPLGFSIVPVNPPEGISVNSH-SPP------ 774

Query: 541 NSGCLLTVGLQVLASTIPSAKLNLSSVTAINNHLCNTVHQINAALGS 557
              CLLTVG+QVLAS +P+AK NLS+VT INNHLC TV+QI +AL +
Sbjct: 775 --SCLLTVGIQVLASNVPTAKPNLSTVTTINNHLCATVNQITSALSN 808

BLAST of Cp4.1LG15g01980 vs. TAIR10
Match: AT4G17710.1 (AT4G17710.1 homeodomain GLABROUS 4)

HSP 1 Score: 525.4 bits (1352), Expect = 4.5e-149
Identity = 287/512 (56.05%), Postives = 371/512 (72.46%), Query Frame = 1

Query: 47  LPPEAAHFSKGDLLI-DEEKTLAMDLAVSSIAELVKMCRSTEPLWVRDSESGKDI-LNVE 106
           + PE    +  ++LI +EEK + M+LAVS   EL KMC   EPLW +     + + LN E
Sbjct: 214 ITPETNKNNNDNMLIAEEEKAIDMELAVSCARELAKMCDINEPLWNKKRLDNESVCLNEE 273

Query: 107 EHARMFPWPLNLKQHLINEFRTEATRDSAVVIMNSITLVDAFLDANKWMELFPSIVAKAR 166
           E+ +MF WPL       + FR EA+R +AV+++N ITLV AFLDA+KW E+F  IV+ A+
Sbjct: 274 EYKKMFLWPLMNDD---DRFRREASRANAVIMLNCITLVKAFLDADKWSEMFFPIVSSAK 333

Query: 167 TIQVISSSVSGHASGSLQLMYAELQALSPLVPTREAHFLRCCQQNADEGSWAIVDFPIDT 226
           T Q+ISS  SG  SG+L LM+AELQ +SPLVPTREA+FLR  +QNA+EG W +VDFPID 
Sbjct: 334 TAQIISSGASG-PSGTLLLMFAELQVVSPLVPTREAYFLRYVEQNAEEGKWMVVDFPIDR 393

Query: 227 FHDSLQHSFPRYRRKPSGCIIQDMPNGYSRVTWVEHGEIEEKPIH-QIFNHFVHSGMAFG 286
              +   +  +YRRKPSGCIIQ M NGYS+VTWVEH E+EEK +  ++   FV SG+AFG
Sbjct: 394 IKPASATTTDQYRRKPSGCIIQAMRNGYSQVTWVEHVEVEEKHVQDEVVREFVESGVAFG 453

Query: 287 AHRWLAILQRQCERIASLMARNISDLGVIPSPEARQNLMKLAQRMIRTFSVNISTSGGQS 346
           A RWL++L+RQCER+ASLMA NI+DLGVIPS EAR+NLMKL+QRM++TF +NI  S GQ+
Sbjct: 454 AERWLSVLKRQCERMASLMATNITDLGVIPSVEARKNLMKLSQRMVKTFCLNIINSHGQA 513

Query: 347 WTALSESPDDTVRITTRKIVEPGQPNGVILSAVSTTWLPYPHYRVFDLLRDERRRSQLEV 406
            T       DTV+I +RK+       G++  AVS T LPY H +VFDLLRD +R SQLE+
Sbjct: 514 PTK------DTVKIVSRKVC-----GGLVPCAVSVTLLPYSHQQVFDLLRDNQRLSQLEI 573

Query: 407 LSNGNSLHEVAHIANGSHPGNCISLLRINVASNSSQHVELMLQESCTDQSGSLVVYATID 466
           L  G+S  EVAHIANGSH GN ISLLRINV SNSS +VELMLQE+CTD SGSL+VY+T+D
Sbjct: 574 LFMGSSFQEVAHIANGSHLGNSISLLRINVESNSSHNVELMLQETCTDNSGSLLVYSTVD 633

Query: 467 VDSIQLAMSGEDPSCIPLLPIGFSIVPVVGSTVDGHPSPPPE-DGVANSGCLLTVGLQVL 526
             ++QLAM+GEDPS IPLLP+GFS+VPV       +PS   E   V++  CLLTV +QVL
Sbjct: 634 PVAVQLAMNGEDPSEIPLLPVGFSVVPV-------NPSDGVEGSSVSSPSCLLTVAIQVL 693

Query: 527 ASTIPSAKLNLSSVTAINNHLCNTVHQINAAL 555
            S + + +L+LS+V+ IN+ +C TV++I +AL
Sbjct: 694 GSNVTTERLDLSTVSVINHRICATVNRITSAL 703

BLAST of Cp4.1LG15g01980 vs. TAIR10
Match: AT4G04890.1 (AT4G04890.1 protodermal factor 2)

HSP 1 Score: 452.2 bits (1162), Expect = 4.8e-127
Identity = 250/574 (43.55%), Postives = 377/574 (65.68%), Query Frame = 1

Query: 1   MTTRYTGRPIEGMPMQP--------SLDLDMNIYSRQ--YTEAMVSSSDMMSMPSMLPPE 60
           +  +Y G+P+ G    P        SLDL++  +  Q  +   M  + D++   S +P E
Sbjct: 188 IAAKYVGKPL-GSSFAPLAIHAPSRSLDLEVGNFGNQTGFVGEMYGTGDILRSVS-IPSE 247

Query: 61  AAHFSKGDLLIDEEKTLAMDLAVSSIAELVKMCRSTEPLWVRDSESGKDILNVEEHARMF 120
                        +K + ++LAV+++ ELV+M ++ +PLW+  +++  +ILN EE+ R F
Sbjct: 248 T------------DKPIIVELAVAAMEELVRMAQTGDPLWL-STDNSVEILNEEEYFRTF 307

Query: 121 PWPLNLKQHLINEFRTEATRDSAVVIMNSITLVDAFLDANKWMELFPSIVAKARTIQVIS 180
           P  +  K   +   R+EA+R SAVVIMN I LV+  +D N+W  +F  IV++A T++V+S
Sbjct: 308 PRGIGPKPLGL---RSEASRQSAVVIMNHINLVEILMDVNQWSCVFSGIVSRALTLEVLS 367

Query: 181 SSVSGHASGSLQLMYAELQALSPLVPTREAHFLRCCQQNADEGSWAIVDFPIDTFHDSLQ 240
           + V+G+ +G+LQ+M AE Q  SPLVPTRE +F+R C+Q++D GSWA+VD  +D    SL+
Sbjct: 368 TGVAGNYNGALQVMTAEFQVPSPLVPTRENYFVRYCKQHSD-GSWAVVDVSLD----SLR 427

Query: 241 HSFP--RYRRKPSGCIIQDMPNGYSRVTWVEHGEIEEKPIHQIFNHFVHSGMAFGAHRWL 300
            S P  R RR+PSGC+IQ++PNGYS+VTW+EH E++++ +H ++   V SG+AFGA RW+
Sbjct: 428 PSTPILRTRRRPSGCLIQELPNGYSKVTWIEHMEVDDRSVHNMYKPLVQSGLAFGAKRWV 487

Query: 301 AILQRQCERIASLMARNI-SDLGVIPSPEARQNLMKLAQRMIRTFSVNISTSGGQSWTAL 360
           A L+RQCER+AS MA NI  DL VI SPE R++++KLA+RM+ +F   +  S   +WT +
Sbjct: 488 ATLERQCERLASSMASNIPGDLSVITSPEGRKSMLKLAERMVMSFCSGVGASTAHAWTTM 547

Query: 361 SESPDDTVRITTRKIV-EPGQPNGVILSAVSTTWLPYPHYRVFDLLRDERRRSQLEVLSN 420
           S +  D VR+ TRK + +PG+P G++LSA ++ W+P    RVFD LRDE  R + ++LSN
Sbjct: 548 STTGSDDVRVMTRKSMDDPGRPPGIVLSAATSFWIPVAPKRVFDFLRDENSRKEWDILSN 607

Query: 421 GNSLHEVAHIANGSHPGNCISLLRINVASNSSQHVELMLQESCTDQSGSLVVYATIDVDS 480
           G  + E+AHIANG  PGNC+SLLR+N + NSSQ   L+LQESCTD SGS V+YA +D+ +
Sbjct: 608 GGMVQEMAHIANGHEPGNCVSLLRVN-SGNSSQSNMLILQESCTDASGSYVIYAPVDIVA 667

Query: 481 IQLAMSGEDPSCIPLLPIGFSIVPVVGSTVDGHPSPPPEDGVANS------GCLLTVGLQ 540
           + + +SG DP  + LLP GF+I+P    +V G      ++ V+ +      G LLTV  Q
Sbjct: 668 MNVVLSGGDPDYVALLPSGFAILP--DGSVGGGDGNQHQEMVSTTSSGSCGGSLLTVAFQ 727

Query: 541 VLASTIPSAKLNLSSVTAINNHLCNTVHQINAAL 555
           +L  ++P+AKL+L SV  +N+ +  TV +I AA+
Sbjct: 728 ILVDSVPTAKLSLGSVATVNSLIKCTVERIKAAV 735

BLAST of Cp4.1LG15g01980 vs. TAIR10
Match: AT1G05230.1 (AT1G05230.1 homeodomain GLABROUS 2)

HSP 1 Score: 441.8 bits (1135), Expect = 6.5e-124
Identity = 242/548 (44.16%), Postives = 360/548 (65.69%), Query Frame = 1

Query: 29  QYTEAMVSSSDMMSMPSMLPPEAAHFSKG------------DLL------IDEEKTLAMD 88
           +Y    VS+  +MS P  LPP     + G            DLL       + +K + +D
Sbjct: 193 KYVGKPVSNYPLMSPPP-LPPRPLELAMGNIGGEAYGNNPNDLLKSITAPTESDKPVIID 252

Query: 89  LAVSSIAELVKMCRSTEPLWVRDSESGKDILNVEEHARMFPWPLNLKQHLINEFRTEATR 148
           L+V+++ EL++M +  EPLW         +L+ EE+AR FP  +  +      +R+EA+R
Sbjct: 253 LSVAAMEELMRMVQVDEPLWK------SLVLDEEEYARTFPRGIGPRPA---GYRSEASR 312

Query: 149 DSAVVIMNSITLVDAFLDANKWMELFPSIVAKARTIQVISSSVSGHASGSLQLMYAELQA 208
           +SAVVIMN + +V+  +D N+W  +F  +V++A T+ V+S+ V+G+ +G+LQ+M AE Q 
Sbjct: 313 ESAVVIMNHVNIVEILMDVNQWSTIFAGMVSRAMTLAVLSTGVAGNYNGALQVMSAEFQV 372

Query: 209 LSPLVPTREAHFLRCCQQNADEGSWAIVDFPIDTFHDSLQHSFP-RYRRKPSGCIIQDMP 268
            SPLVPTRE +F R C+Q  D GSWA+VD  +D    SLQ + P R RR+ SGC+IQ++P
Sbjct: 373 PSPLVPTRETYFARYCKQQGD-GSWAVVDISLD----SLQPNPPARCRRRASGCLIQELP 432

Query: 269 NGYSRVTWVEHGEIEEKPIHQIFNHFVHSGMAFGAHRWLAILQRQCERIASLMARNIS-- 328
           NGYS+VTWVEH E++++ +H ++ H V +G AFGA RW+AIL RQCER+AS+MA NIS  
Sbjct: 433 NGYSKVTWVEHVEVDDRGVHNLYKHMVSTGHAFGAKRWVAILDRQCERLASVMATNISSG 492

Query: 329 DLGVIPSPEARQNLMKLAQRMIRTFSVNISTSGGQSWTALSESPDDTVRITTRKIVE-PG 388
           ++GVI + E R++++KLA+RM+ +F   +S S   +WT LS +  + VR+ TRK V+ PG
Sbjct: 493 EVGVITNQEGRRSMLKLAERMVISFCAGVSASTAHTWTTLSGTGAEDVRVMTRKSVDDPG 552

Query: 389 QPNGVILSAVSTTWLPYPHYRVFDLLRDERRRSQLEVLSNGNSLHEVAHIANGSHPGNCI 448
           +P G++LSA ++ W+P P  RVFD LRDE  R++ ++LSNG  + E+AHIANG   GNC+
Sbjct: 553 RPPGIVLSAATSFWIPVPPKRVFDFLRDENSRNEWDILSNGGVVQEMAHIANGRDTGNCV 612

Query: 449 SLLRINVASNSSQHVELMLQESCTDQSGSLVVYATIDVDSIQLAMSGEDPSCIPLLPIGF 508
           SLLR+N A NSSQ   L+LQESCTD + S V+YA +D+ ++ + ++G DP  + LLP GF
Sbjct: 613 SLLRVNSA-NSSQSNMLILQESCTDPTASFVIYAPVDIVAMNIVLNGGDPDYVALLPSGF 672

Query: 509 SIVPVVGSTVDGHPSPPPEDGVANSGCLLTVGLQVLASTIPSAKLNLSSVTAINNHLCNT 555
           +I+P      DG+ +     G  + G LLTV  Q+L  ++P+AKL+L SV  +NN +  T
Sbjct: 673 AILP------DGNANSGAPGG--DGGSLLTVAFQILVDSVPTAKLSLGSVATVNNLIACT 716

BLAST of Cp4.1LG15g01980 vs. TAIR10
Match: AT4G21750.1 (AT4G21750.1 Homeobox-leucine zipper family protein / lipid-binding START domain-containing protein)

HSP 1 Score: 438.3 bits (1126), Expect = 7.2e-123
Identity = 244/563 (43.34%), Postives = 363/563 (64.48%), Query Frame = 1

Query: 18  SLDLDMNIYSRQ------YTEAMVSSSDMMSMPSMLPPEAAHFSKGDLLIDEEKTLAMDL 77
           SLDL++  +         +   M  SSD++   S +P EA            +K + ++L
Sbjct: 217 SLDLEVGNFGNNNNSHTGFVGEMFGSSDILRSVS-IPSEA------------DKPMIVEL 276

Query: 78  AVSSIAELVKMCRSTEPLWVRDSESGKDILNVEEHARMFPWPLNLKQHLINEFRTEATRD 137
           AV+++ ELV+M ++ +PLWV  S++  +ILN EE+ R FP  +  K   +   R+EA+R+
Sbjct: 277 AVAAMEELVRMAQTGDPLWV-SSDNSVEILNEEEYFRTFPRGIGPKPIGL---RSEASRE 336

Query: 138 SAVVIMNSITLVDAFLDANKWMELFPSIVAKARTIQVISSSVSGHASGSLQLMYAELQAL 197
           S VVIMN I L++  +D N+W  +F  IV++A T++V+S+ V+G+ +G+LQ+M AE Q  
Sbjct: 337 STVVIMNHINLIEILMDVNQWSSVFCGIVSRALTLEVLSTGVAGNYNGALQVMTAEFQVP 396

Query: 198 SPLVPTREAHFLRCCQQNADEGSWAIVDFPIDTFHDSLQHSFPRYRRKPSGCIIQDMPNG 257
           SPLVPTRE +F+R C+Q++D G WA+VD  +D+   S      R RR+PSGC+IQ++ NG
Sbjct: 397 SPLVPTRENYFVRYCKQHSD-GIWAVVDVSLDSLRPS---PITRSRRRPSGCLIQELQNG 456

Query: 258 YSRVTWVEHGEIEEKPIHQIFNHFVHSGMAFGAHRWLAILQRQCERIASLMARNI--SDL 317
           YS+VTWVEH E++++ +H ++   V++G+AFGA RW+A L RQCER+AS MA NI   DL
Sbjct: 457 YSKVTWVEHIEVDDRSVHNMYKPLVNTGLAFGAKRWVATLDRQCERLASSMASNIPACDL 516

Query: 318 GVIPSPEARQNLMKLAQRMIRTFSVNISTSGGQSWTALSESPDDTVRITTRKIV-EPGQP 377
            VI SPE R++++KLA+RM+ +F   +  S   +WT LS +  D VR+ TRK + +PG+P
Sbjct: 517 SVITSPEGRKSMLKLAERMVMSFCTGVGASTAHAWTTLSTTGSDDVRVMTRKSMDDPGRP 576

Query: 378 NGVILSAVSTTWLPYPHYRVFDLLRDERRRSQLEVLSNGNSLHEVAHIANGSHPGNCISL 437
            G++LSA ++ W+P    RVFD LRDE  RS+ ++LSNG  + E+AHIANG  PGN +SL
Sbjct: 577 PGIVLSAATSFWIPVAPKRVFDFLRDENSRSEWDILSNGGLVQEMAHIANGRDPGNSVSL 636

Query: 438 LRINVASNSSQHVELMLQESCTDQSGSLVVYATIDVDSIQLAMSGEDPSCIPLLPIGFSI 497
           LR+N + NS Q   L+LQESCTD SGS V+YA +D+ ++ + +SG DP  + LLP GF+I
Sbjct: 637 LRVN-SGNSGQSNMLILQESCTDASGSYVIYAPVDIIAMNVVLSGGDPDYVALLPSGFAI 696

Query: 498 VP-------------VVGSTVDGHPSPPPEDGVANS----GCLLTVGLQVLASTIPSAKL 555
           +P               G+ V+G       + V  +    G LLTV  Q+L  ++P+AKL
Sbjct: 697 LPDGSARGGGGSANASAGAGVEGGGEGNNLEVVTTTGSCGGSLLTVAFQILVDSVPTAKL 756

BLAST of Cp4.1LG15g01980 vs. NCBI nr
Match: gi|778687851|ref|XP_011652639.1| (PREDICTED: homeobox-leucine zipper protein HDG5 isoform X1 [Cucumis sativus])

HSP 1 Score: 1073.9 bits (2776), Expect = 9.1e-311
Identity = 534/590 (90.51%), Postives = 558/590 (94.58%), Query Frame = 1

Query: 1   MTTRYTGRPIEGMP------MQPSLDLDMNIYSRQYTEAMVSSSDMMSMPSMLPPEAAHF 60
           MTTRYTGRPI+ M       MQPSLDLDMNIYSRQYTEAMV SSDMM++PSMLPPEAAHF
Sbjct: 221 MTTRYTGRPIQAMASAAPPLMQPSLDLDMNIYSRQYTEAMVPSSDMMALPSMLPPEAAHF 280

Query: 61  SKGDLLIDEEKTLAMDLAVSSIAELVKMCRSTEPLWVRDSESGKDILNVEEHARMFPWPL 120
            +G LLI+EEKTLAMDLAVSSIAELVKMCR TEPLWVRD+ESGK++LNVEEH RMFPWPL
Sbjct: 281 PEGGLLIEEEKTLAMDLAVSSIAELVKMCRLTEPLWVRDNESGKEVLNVEEHGRMFPWPL 340

Query: 121 NLKQHLINEFRTEATRDSAVVIMNSITLVDAFLDANKWMELFPSIVAKARTIQVISSSVS 180
           NLKQHLINEFRTEATRDSAVVIMNSITLVDAFLDANKWMELFPSIVAKA+T+QVISSSVS
Sbjct: 341 NLKQHLINEFRTEATRDSAVVIMNSITLVDAFLDANKWMELFPSIVAKAKTVQVISSSVS 400

Query: 181 GHASGSLQLMYAELQALSPLVPTREAHFLRCCQQNADEGSWAIVDFPIDTFHDSLQHSFP 240
           GHAS SLQ+MYAELQ LSPLVPTREAHFLRCCQQNADEGSW +VDFPID+FHDSLQHSFP
Sbjct: 401 GHASSSLQVMYAELQTLSPLVPTREAHFLRCCQQNADEGSWTVVDFPIDSFHDSLQHSFP 460

Query: 241 RYRRKPSGCIIQDMPNGYSRVTWVEHGEIEEKPIHQIFNHFVHSGMAFGAHRWLAILQRQ 300
           RYRRKPSGCIIQDMPNGYSRVTWVEH EIEEKPIHQIFNHFVHSGMAFGA+RWLAILQRQ
Sbjct: 461 RYRRKPSGCIIQDMPNGYSRVTWVEHAEIEEKPIHQIFNHFVHSGMAFGANRWLAILQRQ 520

Query: 301 CERIASLMARNISDLGVIPSPEARQNLMKLAQRMIRTFSVNISTSGGQSWTALSESPDDT 360
           CERIASLMARNISDLGVIPSPEARQNLMKLAQRMIRTFSVNISTSGGQSWTALS+SP+DT
Sbjct: 521 CERIASLMARNISDLGVIPSPEARQNLMKLAQRMIRTFSVNISTSGGQSWTALSDSPEDT 580

Query: 361 VRITTRKIVEPGQPNGVILSAVSTTWLPYPHYRVFDLLRDERRRSQLEVLSNGNSLHEVA 420
           VRITTRK+VEPGQPNGVILSAVSTTWLPYPHYRVFDLLRDERRRSQLEVLSNGNSLHEVA
Sbjct: 581 VRITTRKVVEPGQPNGVILSAVSTTWLPYPHYRVFDLLRDERRRSQLEVLSNGNSLHEVA 640

Query: 421 HIANGSHPGNCISLLRINVASNSSQHVELMLQESCTDQSGSLVVYATIDVDSIQLAMSGE 480
           HIANGSHPGNCISLLRINVASNSSQHVELMLQESCTDQSGSLVVYATIDVDSIQLAMSGE
Sbjct: 641 HIANGSHPGNCISLLRINVASNSSQHVELMLQESCTDQSGSLVVYATIDVDSIQLAMSGE 700

Query: 481 DPSCIPLLPIGFSIVPVVGSTVDGHPSPPPEDGV--ANSGCLLTVGLQVLASTIPSAKLN 540
           DPSCIPLLPIGFSIVP++GST+DGHP+PPPEDG    NSGCLLTVGLQVLASTIPSAKLN
Sbjct: 701 DPSCIPLLPIGFSIVPIIGSTIDGHPAPPPEDGTPNPNSGCLLTVGLQVLASTIPSAKLN 760

Query: 541 LSSVTAINNHLCNTVHQINAALGSPASIENGNTIAEPNNAPAPAPAPAPA 583
           LSSVTAINNHLCNTVHQIN ALG P  +EN N +AEPNN P P P P P+
Sbjct: 761 LSSVTAINNHLCNTVHQINIALGGPGRLENDNVVAEPNNPPTPPPPPPPS 810

BLAST of Cp4.1LG15g01980 vs. NCBI nr
Match: gi|659132081|ref|XP_008466007.1| (PREDICTED: homeobox-leucine zipper protein HDG5 [Cucumis melo])

HSP 1 Score: 1069.7 bits (2765), Expect = 1.8e-309
Identity = 535/591 (90.52%), Postives = 559/591 (94.59%), Query Frame = 1

Query: 1   MTTRYTGRPIEGMP------MQPSLDLDMNIYSRQYTEAMVSSSDMMSMPSMLPPEAAHF 60
           MTTRYTGRPI+ M       MQPSLDLDMNIYSRQYTEAMV SS+MM++PSMLPPEAAHF
Sbjct: 221 MTTRYTGRPIQAMASTAPPLMQPSLDLDMNIYSRQYTEAMVPSSEMMALPSMLPPEAAHF 280

Query: 61  SKGDLLIDEEKTLAMDLAVSSIAELVKMCRSTEPLWVRDSESGKDILNVEEHARMFPWPL 120
            +G LLI+EEKTLAMDLAVSSIAELVKMCR TEPLWVRD+ESGK+ILNVEEH RMFPWPL
Sbjct: 281 PEGGLLIEEEKTLAMDLAVSSIAELVKMCRLTEPLWVRDNESGKEILNVEEHGRMFPWPL 340

Query: 121 NLKQHLINEFRTEATRDSAVVIMNSITLVDAFLDANKWMELFPSIVAKARTIQVISSSVS 180
           NLKQHLINEFRTEATRDSAVVIMNSITLVDAFLDANKWMELFPSIVAKA+T+QVISSSVS
Sbjct: 341 NLKQHLINEFRTEATRDSAVVIMNSITLVDAFLDANKWMELFPSIVAKAKTVQVISSSVS 400

Query: 181 GHASGSLQLMYAELQALSPLVPTREAHFLRCCQQNADEGSWAIVDFPIDTFHDSLQHSFP 240
           GHA+ SLQLMYAELQ LSPLVPTREAHFLRCCQQNADEGSW +VDFPID+FHDSLQHSFP
Sbjct: 401 GHATSSLQLMYAELQTLSPLVPTREAHFLRCCQQNADEGSWTVVDFPIDSFHDSLQHSFP 460

Query: 241 RYRRKPSGCIIQDMPNGYSRVTWVEHGEIEEKPIHQIFNHFVHSGMAFGAHRWLAILQRQ 300
           RYRRKPSGCIIQDMPNGYSRVTWVEH EIEEKPIHQIF+HFVHSGMAFGAHRWLAILQRQ
Sbjct: 461 RYRRKPSGCIIQDMPNGYSRVTWVEHAEIEEKPIHQIFDHFVHSGMAFGAHRWLAILQRQ 520

Query: 301 CERIASLMARNISDLGVIPSPEARQNLMKLAQRMIRTFSVNISTSGGQSWTALSESPDDT 360
           CERIASLMARNISDLGVIPSPEARQNLMKLAQRMIRTFSVNISTSGGQSWTALS+SP+DT
Sbjct: 521 CERIASLMARNISDLGVIPSPEARQNLMKLAQRMIRTFSVNISTSGGQSWTALSDSPEDT 580

Query: 361 VRITTRKIVEPGQPNGVILSAVSTTWLPYPHYRVFDLLRDERRRSQLEVLSNGNSLHEVA 420
           VRITTRK+VEPGQPNGVILSAVSTTWLPYPHYRVFDLLRDERRRSQLEVLSNGNSLHEVA
Sbjct: 581 VRITTRKVVEPGQPNGVILSAVSTTWLPYPHYRVFDLLRDERRRSQLEVLSNGNSLHEVA 640

Query: 421 HIANGSHPGNCISLLRINVASNSSQHVELMLQESCTDQSGSLVVYATIDVDSIQLAMSGE 480
           HIANGSHPGNCISLLRINVASNSSQHVELMLQESCTDQSGSLVVYATIDVDSIQLAMSGE
Sbjct: 641 HIANGSHPGNCISLLRINVASNSSQHVELMLQESCTDQSGSLVVYATIDVDSIQLAMSGE 700

Query: 481 DPSCIPLLPIGFSIVPVVGSTVDGHPSPPPEDGV--ANSGCLLTVGLQVLASTIPSAKLN 540
           DPSCIPLLPIGFSIVP++GSTVDGHP+PPP+DG   ANSGCLLTVGLQVLASTIPSAKLN
Sbjct: 701 DPSCIPLLPIGFSIVPILGSTVDGHPAPPPDDGTPNANSGCLLTVGLQVLASTIPSAKLN 760

Query: 541 LSSVTAINNHLCNTVHQINAALGSPASIEN-GNTIAEPNNAPAPAPAPAPA 583
           LSSVTAINNHLCNTVHQIN ALG P  +EN  N +AEPNN P P P P P+
Sbjct: 761 LSSVTAINNHLCNTVHQINIALGGPGRLENDNNVVAEPNNPPTPPPPPPPS 811

BLAST of Cp4.1LG15g01980 vs. NCBI nr
Match: gi|778687854|ref|XP_011652640.1| (PREDICTED: homeobox-leucine zipper protein HDG5 isoform X2 [Cucumis sativus])

HSP 1 Score: 1068.9 bits (2763), Expect = 3.1e-309
Identity = 534/590 (90.51%), Postives = 557/590 (94.41%), Query Frame = 1

Query: 1   MTTRYTGRPIEGMP------MQPSLDLDMNIYSRQYTEAMVSSSDMMSMPSMLPPEAAHF 60
           MTTRYTGRPI+ M       MQPSLDLDMNIYSRQYTEAMV SSDMM++PSMLPPEAAHF
Sbjct: 221 MTTRYTGRPIQAMASAAPPLMQPSLDLDMNIYSRQYTEAMVPSSDMMALPSMLPPEAAHF 280

Query: 61  SKGDLLIDEEKTLAMDLAVSSIAELVKMCRSTEPLWVRDSESGKDILNVEEHARMFPWPL 120
            +G LLI+EEKTLAMDLAVSSIAELVKMCR TEPLWVRD+ESGK++LNVEEH RMFPWPL
Sbjct: 281 PEGGLLIEEEKTLAMDLAVSSIAELVKMCRLTEPLWVRDNESGKEVLNVEEHGRMFPWPL 340

Query: 121 NLKQHLINEFRTEATRDSAVVIMNSITLVDAFLDANKWMELFPSIVAKARTIQVISSSVS 180
           NLKQHLINEFRTEATRDSAVVIMNSITLVDAFLDANKWMELFPSIVAKA+T+QVISSSVS
Sbjct: 341 NLKQHLINEFRTEATRDSAVVIMNSITLVDAFLDANKWMELFPSIVAKAKTVQVISSSVS 400

Query: 181 GHASGSLQLMYAELQALSPLVPTREAHFLRCCQQNADEGSWAIVDFPIDTFHDSLQHSFP 240
           GHAS SLQ MYAELQ LSPLVPTREAHFLRCCQQNADEGSW +VDFPID+FHDSLQHSFP
Sbjct: 401 GHASSSLQ-MYAELQTLSPLVPTREAHFLRCCQQNADEGSWTVVDFPIDSFHDSLQHSFP 460

Query: 241 RYRRKPSGCIIQDMPNGYSRVTWVEHGEIEEKPIHQIFNHFVHSGMAFGAHRWLAILQRQ 300
           RYRRKPSGCIIQDMPNGYSRVTWVEH EIEEKPIHQIFNHFVHSGMAFGA+RWLAILQRQ
Sbjct: 461 RYRRKPSGCIIQDMPNGYSRVTWVEHAEIEEKPIHQIFNHFVHSGMAFGANRWLAILQRQ 520

Query: 301 CERIASLMARNISDLGVIPSPEARQNLMKLAQRMIRTFSVNISTSGGQSWTALSESPDDT 360
           CERIASLMARNISDLGVIPSPEARQNLMKLAQRMIRTFSVNISTSGGQSWTALS+SP+DT
Sbjct: 521 CERIASLMARNISDLGVIPSPEARQNLMKLAQRMIRTFSVNISTSGGQSWTALSDSPEDT 580

Query: 361 VRITTRKIVEPGQPNGVILSAVSTTWLPYPHYRVFDLLRDERRRSQLEVLSNGNSLHEVA 420
           VRITTRK+VEPGQPNGVILSAVSTTWLPYPHYRVFDLLRDERRRSQLEVLSNGNSLHEVA
Sbjct: 581 VRITTRKVVEPGQPNGVILSAVSTTWLPYPHYRVFDLLRDERRRSQLEVLSNGNSLHEVA 640

Query: 421 HIANGSHPGNCISLLRINVASNSSQHVELMLQESCTDQSGSLVVYATIDVDSIQLAMSGE 480
           HIANGSHPGNCISLLRINVASNSSQHVELMLQESCTDQSGSLVVYATIDVDSIQLAMSGE
Sbjct: 641 HIANGSHPGNCISLLRINVASNSSQHVELMLQESCTDQSGSLVVYATIDVDSIQLAMSGE 700

Query: 481 DPSCIPLLPIGFSIVPVVGSTVDGHPSPPPEDGV--ANSGCLLTVGLQVLASTIPSAKLN 540
           DPSCIPLLPIGFSIVP++GST+DGHP+PPPEDG    NSGCLLTVGLQVLASTIPSAKLN
Sbjct: 701 DPSCIPLLPIGFSIVPIIGSTIDGHPAPPPEDGTPNPNSGCLLTVGLQVLASTIPSAKLN 760

Query: 541 LSSVTAINNHLCNTVHQINAALGSPASIENGNTIAEPNNAPAPAPAPAPA 583
           LSSVTAINNHLCNTVHQIN ALG P  +EN N +AEPNN P P P P P+
Sbjct: 761 LSSVTAINNHLCNTVHQINIALGGPGRLENDNVVAEPNNPPTPPPPPPPS 809

BLAST of Cp4.1LG15g01980 vs. NCBI nr
Match: gi|1009158962|ref|XP_015897558.1| (PREDICTED: LOW QUALITY PROTEIN: homeobox-leucine zipper protein ROC3-like [Ziziphus jujuba])

HSP 1 Score: 874.4 bits (2258), Expect = 1.1e-250
Identity = 453/608 (74.51%), Postives = 519/608 (85.36%), Query Frame = 1

Query: 1   MTTRYTGRPIEGMP-------MQPSLDLDMNIYSRQYTEAMVSSSDMMSMPSMLPPEA-A 60
           +T+RYTGR I+ M        M PSLDLDM+IYSR +++ M S SDM+ +P MLPPEA +
Sbjct: 212 LTSRYTGRAIQPMTPAAAPPLMPPSLDLDMDIYSRHFSDPMGSCSDMVPVP-MLPPEANS 271

Query: 61  HFSKGD-LLIDEEKTLAMDLAVSSIAELVKMCRSTEPLWVRDSESGKDILNVEEHARMFP 120
           HF +    L++EEK+LA+DLA+SS+ EL+KMC+S++PLW+R++E+G+++LN+EEHARMFP
Sbjct: 272 HFPETTGQLMEEEKSLALDLAMSSMDELIKMCQSSQPLWIRNNENGREVLNLEEHARMFP 331

Query: 121 WPLNLKQHLINEFRTEATRDSAVVIMNSITLVDAFLDANKWMELFPSIVAKARTIQVISS 180
           WPLNLKQH  +EFRTEA+RDSAVVIMNSITLVDAFLDANKWM+LFPSIV +A+T+QVISS
Sbjct: 332 WPLNLKQHS-SEFRTEASRDSAVVIMNSITLVDAFLDANKWMDLFPSIVCRAKTVQVISS 391

Query: 181 SVSGHASGSLQLMYAELQALSPLVPTREAHFLRCCQQNADEGSWAIVDFPIDTFHDS--L 240
             SG A+GSLQLM AELQ +SPLVPTREAHFLR CQQNA+EGSWAIVDFPID+FH+S   
Sbjct: 392 DASGQANGSLQLMCAELQVVSPLVPTREAHFLRYCQQNAEEGSWAIVDFPIDSFHESNIQ 451

Query: 241 QHSFPRYRRKPSGCIIQDMPNGYSRVTWVEHGEIEEKPIHQIFNHFVHSGMAFGAHRWLA 300
             SFPRYRR+PSGC+IQDMPNGYSRVTWVEHGEIEEKPIHQ F+H V+SGMAFGAHRWLA
Sbjct: 452 PSSFPRYRRRPSGCVIQDMPNGYSRVTWVEHGEIEEKPIHQTFSHLVYSGMAFGAHRWLA 511

Query: 301 ILQRQCERIASLMARNISDLGVIPSPEARQNLMKLAQRMIRTFSVNISTSGGQSWTALSE 360
           +LQRQCER+ASLMARNISDLGVIPSPEAR+NLMKLAQRMIRTF VN+STS  QSWTALS+
Sbjct: 512 VLQRQCERVASLMARNISDLGVIPSPEARRNLMKLAQRMIRTFCVNMSTSSNQSWTALSD 571

Query: 361 SPDDTVRITTRKIVEPGQPNGVILSAVSTTWLPYPHYRVFDLLRDERRRSQLEVLSNGNS 420
           SP+DTVRITTRK+ EPGQPNGVI SAVSTTWLPY HY+VFDLLRDERRRSQL+VLSNGNS
Sbjct: 572 SPEDTVRITTRKVTEPGQPNGVIPSAVSTTWLPYSHYQVFDLLRDERRRSQLDVLSNGNS 631

Query: 421 LHEVAHIANGSHPGNCISLLRINVASNSSQHVELMLQESCTDQSGSLVVYATIDVDSIQL 480
           LHEVAHIANGSHPGNCISLLRINVASNSSQHVELMLQESCTDQSGSLVVY TIDVD+IQL
Sbjct: 632 LHEVAHIANGSHPGNCISLLRINVASNSSQHVELMLQESCTDQSGSLVVYTTIDVDAIQL 691

Query: 481 AMSGEDPSCIPLLPIGFSIVPVVGSTV------------DGHPSPPPED-----GVANSG 540
           AMSGEDPSCIPLLP+GF IVPV  ++V            DG   P  ED        +S 
Sbjct: 692 AMSGEDPSCIPLLPLGFVIVPVESTSVTTTPTTTATMATDGSSVPSSEDATGSAATVSSS 751

Query: 541 CLLTVGLQVLASTIPSAKLNLSSVTAINNHLCNTVHQINAALGSPAS--IENGN---TIA 576
           CLLTVGLQVLASTIPSAKLNLSSV AINNHLCNTV QI +ALGS ++   +NG+   +  
Sbjct: 752 CLLTVGLQVLASTIPSAKLNLSSVNAINNHLCNTVQQIISALGSTSTTCTDNGSVGGSCT 811

BLAST of Cp4.1LG15g01980 vs. NCBI nr
Match: gi|225427116|ref|XP_002277673.1| (PREDICTED: homeobox-leucine zipper protein ROC3 [Vitis vinifera])

HSP 1 Score: 874.0 bits (2257), Expect = 1.5e-250
Identity = 444/590 (75.25%), Postives = 511/590 (86.61%), Query Frame = 1

Query: 1   MTTRYTGRPIE--GMP---MQPSLDLDMNIYSRQYTEAMVSSSDMMSMPSMLPPEAAHFS 60
           + +RY GR I+  G P   + PSLDLDM+IY+R + E M + +DM+ +P M  PE++HF 
Sbjct: 212 LASRYGGRAIQAIGPPPPLLAPSLDLDMSIYARNFPEPMANCTDMIPVPLM--PESSHFP 271

Query: 61  KGDLLIDEEKTLAMDLAVSSIAELVKMCRSTEPLWVRDSESGKDILNVEEHARMFPWPLN 120
           +G L+++EEK+LA++LA+SS+ ELVKMC+  EPLW+R +E+GK+++NVEE+ RMFPWP+N
Sbjct: 272 EGGLVLEEEKSLALELAISSVDELVKMCQLGEPLWIRSNENGKEVINVEEYGRMFPWPMN 331

Query: 121 LKQHLINEFRTEATRDSAVVIMNSITLVDAFLDANKWMELFPSIVAKARTIQVISSSVSG 180
           LKQH   EFRTEATRDSAVVIMNSI LVDAFLDA KWMELFPSI+++A+T+QV+S  VSG
Sbjct: 332 LKQHP-GEFRTEATRDSAVVIMNSINLVDAFLDAMKWMELFPSIISRAKTVQVLSG-VSG 391

Query: 181 HASGSLQLMYAELQALSPLVPTREAHFLRCCQQNADEGSWAIVDFPIDTFHDSLQHSFPR 240
           HA+GSL LMYAELQ LSPLVPTRE HFLR CQQN DEG+WAIVDFPID+F+D+LQ S PR
Sbjct: 392 HANGSLHLMYAELQVLSPLVPTRETHFLRYCQQNVDEGTWAIVDFPIDSFNDNLQPSVPR 451

Query: 241 YRRKPSGCIIQDMPNGYSRVTWVEHGEIEEKPIHQIFNHFVHSGMAFGAHRWLAILQRQC 300
           YRR+PSGCIIQDMPNGYSRVTWVEH ++EEKP+H IF+HFV+SGMAFGA RWLA+LQRQC
Sbjct: 452 YRRRPSGCIIQDMPNGYSRVTWVEHADVEEKPVHHIFHHFVNSGMAFGATRWLAVLQRQC 511

Query: 301 ERIASLMARNISDLGVIPSPEARQNLMKLAQRMIRTFSVNISTSGGQSWTALSESPDDTV 360
           ER+ASLMARNISDLGVIPSPEAR+NLM LAQRMIRTFSVNISTS GQSWTALS+S DDTV
Sbjct: 512 ERVASLMARNISDLGVIPSPEARKNLMNLAQRMIRTFSVNISTSSGQSWTALSDSSDDTV 571

Query: 361 RITTRKIVEPGQPNGVILSAVSTTWLPYPHYRVFDLLRDERRRSQLEVLSNGNSLHEVAH 420
           RITTRKI EPGQPNGVILSAVSTTWLP+PHY VFDLLRDERRR+QL+VLSNGNSLHEVAH
Sbjct: 572 RITTRKITEPGQPNGVILSAVSTTWLPHPHYHVFDLLRDERRRAQLDVLSNGNSLHEVAH 631

Query: 421 IANGSHPGNCISLLRINVASNSSQHVELMLQESCTDQSGSLVVYATIDVDSIQLAMSGED 480
           IANGSHPGNCISLLRINVASNSSQ+VELMLQESCTDQSGS VVY TIDVD+IQLAMSGED
Sbjct: 632 IANGSHPGNCISLLRINVASNSSQNVELMLQESCTDQSGSHVVYTTIDVDAIQLAMSGED 691

Query: 481 PSCIPLLPIGFSIVPVVG-------STVDGHPSPPPEDGVA-NSGCLLTVGLQVLASTIP 540
           PSCIPLLP+GF+IVPVV        +T D +P PP  DG   NSGCLLTVGLQVLASTIP
Sbjct: 692 PSCIPLLPMGFAIVPVVPNNDCNIMTTTDDNPMPPSGDGNGHNSGCLLTVGLQVLASTIP 751

Query: 541 SAKLNLSSVTAINNHLCNTVHQINAALGSPASIENGNTIAEPNNAPAPAP 578
           +AKLNLSSVTAINNHLCNTVHQINAAL S    +N +++   +  PA  P
Sbjct: 752 TAKLNLSSVTAINNHLCNTVHQINAALSSICP-DNSSSMVGSSTEPAAPP 796

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
HDG5_ARATH2.5e-19461.67Homeobox-leucine zipper protein HDG5 OS=Arabidopsis thaliana GN=HDG5 PE=2 SV=3[more]
ROC3_ORYSJ2.9e-18258.99Homeobox-leucine zipper protein ROC3 OS=Oryza sativa subsp. japonica GN=ROC3 PE=... [more]
ROC3_ORYSI8.4e-18258.82Homeobox-leucine zipper protein ROC3 OS=Oryza sativa subsp. indica GN=ROC3 PE=3 ... [more]
HDG4_ARATH8.0e-14856.05Homeobox-leucine zipper protein HDG4 OS=Arabidopsis thaliana GN=HDG4 PE=1 SV=1[more]
ROC2_ORYSJ6.6e-12647.62Homeobox-leucine zipper protein ROC2 OS=Oryza sativa subsp. japonica GN=ROC2 PE=... [more]
Match NameE-valueIdentityDescription
A0A0A0LEZ7_CUCSA6.3e-31190.51Uncharacterized protein OS=Cucumis sativus GN=Csa_3G901030 PE=4 SV=1[more]
A5AJ70_VITVI1.0e-25075.25Putative uncharacterized protein OS=Vitis vinifera GN=VIT_02s0012g02030 PE=4 SV=... [more]
M5WW92_PRUPE3.9e-24272.58Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa014792mg PE=4 SV=1[more]
A0A0B2S421_GLYSO5.6e-24171.94Homeobox-leucine zipper protein ROC3 OS=Glycine soja GN=glysoja_014944 PE=4 SV=1[more]
K7LF40_SOYBN5.6e-24171.94Uncharacterized protein OS=Glycine max GN=GLYMA_09G207500 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G46880.11.4e-19561.67 homeobox-7[more]
AT4G17710.14.5e-14956.05 homeodomain GLABROUS 4[more]
AT4G04890.14.8e-12743.55 protodermal factor 2[more]
AT1G05230.16.5e-12444.16 homeodomain GLABROUS 2[more]
AT4G21750.17.2e-12343.34 Homeobox-leucine zipper family protein / lipid-binding START domain-... [more]
Match NameE-valueIdentityDescription
gi|778687851|ref|XP_011652639.1|9.1e-31190.51PREDICTED: homeobox-leucine zipper protein HDG5 isoform X1 [Cucumis sativus][more]
gi|659132081|ref|XP_008466007.1|1.8e-30990.52PREDICTED: homeobox-leucine zipper protein HDG5 [Cucumis melo][more]
gi|778687854|ref|XP_011652640.1|3.1e-30990.51PREDICTED: homeobox-leucine zipper protein HDG5 isoform X2 [Cucumis sativus][more]
gi|1009158962|ref|XP_015897558.1|1.1e-25074.51PREDICTED: LOW QUALITY PROTEIN: homeobox-leucine zipper protein ROC3-like [Zizip... [more]
gi|225427116|ref|XP_002277673.1|1.5e-25075.25PREDICTED: homeobox-leucine zipper protein ROC3 [Vitis vinifera][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0008289lipid binding
Vocabulary: INTERPRO
TermDefinition
IPR023393START-like_dom_sf
IPR002913START_lipid-bd_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0005634 nucleus
cellular_component GO:0005575 cellular_component
molecular_function GO:0008289 lipid binding
molecular_function GO:0043565 sequence-specific DNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG15g01980.1Cp4.1LG15g01980.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002913START domainPFAMPF01852STARTcoord: 72..297
score: 1.8
IPR002913START domainSMARTSM00234START_1coord: 70..297
score: 1.6
IPR002913START domainPROFILEPS50848STARTcoord: 61..300
score: 44
IPR023393START-like domainGENE3DG3DSA:3.30.530.20coord: 151..280
score: 4.
NoneNo IPR availablePANTHERPTHR24326FAMILY NOT NAMEDcoord: 276..555
score: 9.3E
NoneNo IPR availablePANTHERPTHR24326:SF260SUBFAMILY NOT NAMEDcoord: 276..555
score: 9.3E
NoneNo IPR availableunknownSSF55961Bet v1-likecoord: 334..551
score: 1.04E-16coord: 62..299
score: 2.97

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cp4.1LG15g01980Cp4.1LG05g00220Cucurbita pepo (Zucchini)cpecpeB273