Cp4.1LG14g04420 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG14g04420
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionHomeobox protein
LocationCp4.1LG14 : 1612991 .. 1624485 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CAGACGGTGCGCAGAGAGTAACGGTGGCCCAGATTTGGAGGACTCTGAGACAAAGCCGCTTTATTCTCTCTTCGCGCTGAAAAGAGAAAGAACAAGAAGAACACCCACCACCATCGCCAAAATTCTTACACTTTTCTTCCAAAGCTCCATAAATAAACATCTCCACCACCAACTAGATCCGTAGCCGCCCTCTTCCATGGACAATACTATGCTCTTCGCCAGCACCAGGTACTTCTTTCCAATAGGTTTTTTTTTTTTTTTTTTTTTTTTTTTCTTTTACGTTCTTATGTTTCTGCTAGATTATGTTTGTTTCTTGAGAATTCGTGCGAAACGGAAAATGGCGTTATGGAAGCTAATGCATTGCATTGTTCTTCTGTTCAAATGCTTAGCTTGTTAATTGCATTTGTTTCTTCTCTTTCCTTTTTTCCTACTTTGCCGTTTTGTTTGCTTAAACTTCGGAGAAAGAAGCGTAATGATTCTGGAGTTATTTTTTTCAACATGTTGTCTCTCCTTTTTGAAGTATTAAACCAGGGACTGACCTCTTCTATCTCGAAGAATTTCATTGCTAAAATCTTGATTTTGATGAAGTCGAGCTTAATTTTTTCCCTTGGAAGCGTAGTTCCACGTAATTGAAGTTATTTGTGAGTTTGAACGTGAAAATTTCTGAGGTGAGGATTGAAGTACGTGCCAACCTCAGCTCTGCGTATGGTTGGGCATTTTGTAACTCCCGTAGAGCCTCGCCCTCATTTTCTTTGCAATTTAATATTGTCTCAAATTTTGAATTAATAATGGTCATTAGAATTTGTTTGTACTCTCTCTCACAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAANTAAATATCCGTTCAGTTCATAAAGTTTGGTTCTAGGTAAAAATTACTACATAGAAGCAATTTTGAGTAAATAAAACTTAATCTTAGAAATAATCGTTTAGTTAAGTTTTTTTTATTATTTATTTTTAGTTTATGTGCCTTTGTAGTCATATGCTTATAAAGGAAATATTTTGAGATTGTTTTCAAGTAAGATATGTATATATATATATATATATATTTCAGTAATGAATTTAGCTGATACTTTTTCTGATTCTGAAATGAAAGAGTGGATTGAAGAACCCAAAATATCTCTTGAATGTGATAAGATTCAATTTTATTATTTGTCACTACTAGCCTTACATCCCTTAAGATGGTTGATGGTAGTCATCTTGGTGTCTCTCCAGTGCCAGATGGTCAAGATTGATGATTGTCCGTGGAGAAGTTCTGTGAGGACAATGCTCCAGGAGGTCCATATACGTATGTAGAATATATATATATATATATATGTATTTAGCATTCAGGAACAGAAAATTGCCACTCCCTGGCAGTTGTCGAAGTTGAAAAAATGAACTCCGGATTGATTGAGTAGGAATGGGCGCTTTAATCTCCAGAAGTGATAAGCAAGAATCTACGTAGTCTTAGTGGTTGATTCTTTGGCCTTCAAGTCTGGCAACCTGTAGTTGATGTAATCAGGAATAGGACTAGTCAAACTAGAGATTAGTTGATAGTCTTGGGGGATCACATGGAAGAAAGAGATGAATATACAGAATCGAGAAGTAATAATAATGCTGAAGCCGTACAAGAAGCCAAGATCAGTGTTGAAGCTGAAATGCCAACTTGTCTTTCAAATGAGCAAAAGCATTCAGTTCCTGATTATCATGAATTGGAAGCAACTCCAGAATATACCAACAAAACTGGCGGTCCAGATGAAGAAAAGCCAGAGGTCCAGCAGAATATGGAGGAAGAGAATAAGGAACTTGGTTCAGGAGACGTGCTTAGTGAATTATCAGAAAAACACAATCGGACTTTCTCTAACCTTGCTGATAATGATCAAGTTGAAGCTGGTAATTTATTATGCTGTGATAAAGATACCGAAAATTTGATAGTACCTATTGAAGTTGAGACAACGACTCTTCTTGTTGACTGCTCTGAACTTCCACCTGAAGTTGTCAACAAAAACTATATTGAACAGATGAACCCTCCTATTGAAGAATTAACCCAAAATACTCCTTTTCAAAATTTAGAAACAGTCCCCAGTAATTCAGAACAATCGGATCACAAGGATAAGAGAATTTTGAAATCAATGAAGATAAAGTCTATTTTAAGGTCCCTTGTAAGTAGTGACAGAAATATGCGTTCAAAGACCCAAGAGAAAGATAAAGATCCTGAACCAAGTAATGACTTGAATAATTTTACTGCTGAAGAGGGAAAAGGGAAGAAGAAGGAGAGAAATATACAAGGAAAGGGAGCAAGAGTCGATGAATTCTCATCAATCAGGAATCATTTGAGATATTTACTGAACCGCATCAAATATGAACAGAACTTGATTGAAGCTTATTCTAGTGAAGGCTGGAAAGGGTTCAGGTATGTATTTTCACCTTTAATAGATCTTACATTGACTTTGTCCTGATACTTTCTTGTATTACTTTGTCTTTGGTGATTTTGATAAATGGTCATCGACTCATTGTGTAAACGCCTCCACTGATTTTGTAGTGTTTGTGTACTTTGAAATGCCCCCCACCCCACCGCCCCCCTCCCCCCTCTTAAGTAATTGTTCGATTTTCATACAGTGCGTAGTGAATACTAAGGAAAGGCAGGACGGTAAATCTCCTCCAAGCCATCCAAACACAAAATTTAAACTCTCCTTGCAGGTCTCTTCCCCATGGTTAAGAGTGGATCTAATTAAAAGATTGTTACAAGCTTACAGCCTTACGTAGTTCCTCAGTGTTGGGTTTGGTACCCTTGTTGATCGTTTCATATTTAGCAGCTATTACTTACAAGTGATCACTCTATCTTCTTAGTTGCTTTCAAGATTTATGACTTCGACCAAGTCAGACTTCTCATACTTCTAATGACAATAAACCTAGAAGTTAAATCTGTTGTTATTTCATTCTGAAATGAAGCGGATAGCAATTTCTATACACTTATGTTCGGATCAATATTGAAGTTCTTGTTATAATTTGATCTTTGTTTCAGTCACTAATGTTCTTTATGATTTTTTTAATCTATTCATTTCATCTATCATAATTGATGTAATGAAATGCATGGCATCTGCTGATCATTACTATAATAGTGCAGCTCAGATAAATTGAAGCCTGAAAAGGAACTTCAACGGGCATCAAATGAAATAATGCGACGCAAATTAAAAATAAGAGATGTATTTCAACGTATTGATGCACTTTGTGGCGAAGGAGGCCTTTCTAAATCTTTATTCGATTCTCAAGGACAGATAGACAGTGAAGATGTAGGGGGATCTTAAGTTTAATTTTTACTAGATGTTATCTTTTGATTCATTCTGCATAAATTTTGGCTCTGTTAAAGCATCTGCTTTTTGTGATAAGTGACTTCATTGTGAACTTCTGATCTTGCATCAGATATTCTGTGCAAAATGTGGATCCAAAGAATTGTCCCTTGAGAATGACATCATACTATGTGATGGTATTTGTGATCGTGGGTTCCATCAGTTCTGTTTAGAACCACCTTTGCTAAATACAGACAGTAATATTCTCATTGCCAACTAGATATTAAAATTAAAAAGTGGATTTGTGTTTCTCAAGTTACTTTTCATTCATGCTCTCTCCTTTCCTCTCACAGTTCCGCCGGATGATGAAGGATGGCTATGCCCCGGATGTGATTGCAAAGATGACTGCTTGAATCTGCTTAATGAATTTCAAGGATCAAGACTTTCCATCACTGATGGTTGGGAGGTAATTAAAGTTTTGCAACCTATATTAAAATAGGGTTGTTTTGGGTGTTTGTTATAGTGTCATTTTCCTCTAACTGTGTGCAGAAAGTCTATCCCGAGGCCGCAGCATCAGCTGCTGGACGAAATTTTGATCATGCCTTAGGTCTTCCTTCAGATGATTCTGAAGATGATGATTATGATCCTGATGTTCCAGATACTATTGTCCAGGACGATGAATCAAGTTCTGAAACATCTGGGTATGCTTCTGCTTCTGAGGAATTAGAGTCTCCATCCAATGTTGACCAGTACTTAGGTCTCCCTTCCGATGACTCGGACGATGATGACTATGATCCCAGTGCTCCAGAACGTGATGAAGATGTTAGACAGGAAAGTTCTAGTTCTGACTTTACATCTGATTCTGAGGATTTAGCTGCACTTGACAGTAGCCCCTCTTCCAAAGCTGATAACCTTGTGTCTTCATCACTGAATAATACTACGTCTACGAAAAACCCTGATGGGCGAAGTTCTGGAGGTGGTCCTAGAAAGAGTGCACTGTATAATGAGCTATCAAGTCTACTAGAGTCCGATCCTGAACCTGTTTTGGGAAGAAGACAGGTGGAACGGTTGGATTACAAGAAGCTCCATGATGTGAGTATTCTTTTATAATCATAATAATATTTATATTCTTATGAAAGAAAAAATTATATTGCTATTTTCCAATGTATTCTTTGAAGTATATGGGCGACTATCTCTCTACATTATGCAAGCATAGTTGATTGACATGTTTTTTTCCTTTTTTCTTTTGCTTTGTTCTAAAGAATACAACTTATAGATGTTAGGTGGGGAATTCAAATATTTGATAGGAAGGTTACTAGTGTCTTAATCAATTAAGTGATGTTTTGGTTAATAGTAGGCTCTTATCTTGAAAATTTTCCATCAATTGTAAATGATTTCTCATTCTTTTATCCTCTTGTAAATTATTTTTTAGCTACTTTTGTTTGTTCTTTTGTCAAAATAATGATAATAATAAATGAAAACAAGGCCTTGTTCCCATCAAGATGGCTTTAGTCCAACATGTTTATGTGTCTTGTGTGAATAAAATGAATTTACCGGGCATTAAATGTTTAATAGACGAAGTCTAGATATTTAACAGGGCCCATGCTTAGTTTTAGACTGAATGAGTTAAAGAATTCCCATACATTTAAATTATTTCTCTCTTTCCTTATTGAAGGAGACATACGGGAATGTTCCTACCGACTCAAGCGATGACACGTACGCGAGTATTTCTACGGATTCAAGTGATGACCAAGGCTGGGATAGTAATACAAGGAAGAGAAGTCCTAAAACCCTGGTTCTTGCACTGCCAAATTATCGAACTAATGATGATATGACTAACGTAAAAACTAAACACAGTTCTAAGAGGGGTACTCGTCAAAAGGCAGCTGCTGTAAATATGAATAAATCTGTGACTAAAACTCCTGAAGACACTGGAAAAGCTAGTTCTTCTGTTAGGAGAACCACACCATCATCGTATAGAAGACTCAGTCAACTTGCATTGGAGGTAATTTTTTTCCCTTTTCCTTTTACCTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTNTCTTTGCAGCTCTATTCTGATTCCAAAACATTGTCCTTAAAAAGATGTTATTTTTTTGTGCCGTTTTGTTCAAGCTGGGATTTCCTATTTCATTATTTTGTCTTACTTTTTGCTTTTTTTTTTTTTTTTTTTTTTTTNGGGGTTGGGAGACAAGGTTTTATTCAAAGATTTGAATCCCCTAGCAACTTTACTTAAGAATGCATGCTGGTGCTTGGATATATTGCCTCAAAGGAGATGTATATTGGAGTTACATAATTTCAACCATGCGATTTATTAGATTTGTTGTTCCAAAATCTCTTATCATTCTAAAATTTTGACTCTTAATCCAGTCTGAGTGGTCTGAATAATTGCTGAGACACTAAAAAACAGTTTTCTTTCTGTCTGATTCCTGTATTGTGAAGTCTTACTCTCAGTAAGGCAGTTATATATAGGAAATTGAACTGTGATGTGTAAATTTGAGCTGAAATGTCATTCCTTTTTGTGCTTGGAACAAGATTTTTTCTTGATTATAGCGAGTAATGAGCAGCTTAGCTCACTTTATATCCCTTGGATCTTCTCTTCTTTTTTCTTTGCAACTGAAGGTCACAATTTGGAGCTATTTATTTGAAGAAATACGTCTTGGTTTCAACAAAGATGCTTTACCACAAGTCTATAAGTTATCGCTTCCATGACATTGTCAGTTAGATTAGTTATCAACACCCTTCATCGTCTTTAGACTATTTATTGCGAAATATCAGTTTCTCTGTGCTTGTTTAGTTCACTTGGTTTTTAAATAAAACATGCCTTGGATGGAAAGGAGGCAATGAGATATTTGTTTCTCTGTTGTGGGAAATAAAATTTAGTATTAACTCCGAAATATCAAGTTTTTTTAATGCCACGTCTTGTTCCTAATATTGTTGCAAATGTTCCAGAGACTTTTAGCATCATTCCAAGAAAACCAGTATCCTGAACGAGCTACAAAGGAAAGTCTGGCACAAGAACTAGGGCTCAGTGTGAAGCAGGTAATGCATTGAGTTCCTTTTGAAGTTTTAGCGAACATTTTATTCTAGCGAAAGGCTCAACTATTTATTCCATTTTCAGGTTAGCAAATGGTTTACGAACACACGTTGGAGCACACGCCATCCCTCAAGCGTTGAGGGTAATAAAGCGAAGAGTTCCTCAAGAATGGGCATTCGTTCATCTCAGGCAAGTGGAGAGCTGCACCAGCCCGAGCAAGAATTTGGTGCCCAACATCAAGAATTACCAACGACAGATAGTGTTGTGGCCCCATGTCAGAGTGGGGATACAGGGGATGTCAAATTGGCAACTCAGGAAACTAAAAGATCAGAATTTTCTGCCGCAAAATCCAGAAAACGGAAGGGCAGGTCAGATCACGCTGCATCATGTTCAAAGGACAGTAAGGAATCACAAAGGCCTCCTGCCAAGTCTCCAAAAGTAAATGAAATCCAAACAGCACATAGCATTAAGACGAGGAGGAGAAATTCCTTATAGGTTCTGTAAGTTTCGATCATTCACGGGAAATTATTTGGCTATTTACTCTCTTCCAAGATACACATTAGAACAAATGCGGGCATCGTCTGAAGGGAGGTAATTCCAAGAACAATCACAATTGTTTGTAGTTCTTTGTAACCACATTAGAATTCCTCTTTCTAATTCCTCACACATAAATGCTTAGTCTTAGTTATGTAAATCTAGAGCTTGTTGTTAGATATTTATTTGTATTTCAAGAGACCCTTCTCCTGATGTGTTAAACAATTAATTAGTTGCAGATATTGCTATGTTAAAGAACAAAAGAGCTACTGCTACCCTCCAGCTTCGGAAGCTTTGCTGCGGATATATCACTTATGGATGCTACATTGCTATCAAGTCTAAATTGCTCTTGGCTTTTTTATCCAGATTTTAATGTCATCATCGTTAGTGAAATATTCTGCAATTAATCCTGTATGAGATGTGAACTTTAGGTTCTGCAGAGTCAACATAATGTAGTTCTCTTCACACATGAAAGCAATCGGATCTTCTGCGCGCATCCTGCGTTTATCCATTTGAATGCATCAAAATCCTGTCAACCACTGAAATTGGTATCAAATCCTCGTCCATGTAGATGTAACTTATTTCCAAGCTTTCTGGATGTTATCATCTTCTAATTCACAGGAAGCTGGAGCTCTGGGTGACCTGTGATTGAGCAGTGAAATGTGTATGGGCCATTTCTCCCAAATGGGAAACCACTGGAAGAGATGACAGAAGCAAGAATCTTGAAGTTTCAAATCATTATTGATCTTTTTTTGTTCCCCCTTACTGCCTACTTAGTTTGATGTAATTTACTCTACAACTTATAGATGAAACAAATCAGGCCTCTTGTGTTTCCATAAGCAGAAGAGATATAACATTAATGCTGCTTCTTCCTGGTTTTACATAGGATTCTAAGCACTTCCCCTAAGGTTAGTCATGTACTTGATATATGACGACACAATCAGAATGAAGAAGATGACTCCAGAGTTCTTGACAGACAGATACAAGAATTAAAGTCGGAGATTCAGGTCGAGCTTGAATGGTTCCATTTGCGGTTCTCGTAGTGGTGGTGACTCTGGGAGAACTTGATGTTGAGTGAAGCGGAAGCTTCCTGGCCACCTCAAGTCCATGGCATCCCCCAGCATGAAAGGTGCCCATGAACCACTTTTTACATGATTAAATCTTGCTACTATTGCAGTATCTTCCCTGCTTGTTTTGTGCACCAGTGAATGTGGTTGAACACCAAGTGATCTAACCATAGAATTATGTATTGGCAGACCAAGCAGAGTCATCATTCTCTGGGCTTGGTGCCTTCTGGCTGCACTCCTCTCCCTCTTGTGGGCATTTTGATGGCCTCCCAATGCCTGTGAGCTGTAGAATACTCTCTTACAGAAGTTGCATGAAAAGGTCTTGGCTGAACCTGCAGAGCTTTGTGACTCAGTTAAAAGCTGTTTGTTCCTACCCAGGCTCAAGTTCAGCCACTCCAAAGCTCCTACACCTTCAGGTTGGTCTGTTGCTTCTTCTTCTTCTTCTTCTTCTTCAACTGATTCATCTTCTCCTTTTTTCTCCTTCAATGATTCAATTTGTTCTCCTTGAAAGATCATTGCTAACCCAGATTCCTAAATTAGTAGCGATGATACTGAACTATAAAATGGACAGGATTGCTGATTCTTTATCTAGCCCTGAAGCTACTGAAGCTCTGGGAATGGTAACAACTCATTGATCCTTCTTAGATTCCATTTTCCAAGATGAACAAATGCCGAGGAAGTTGTGAGAGGATGAAAGTGTAAGTTTTCGTTTATATGTTTCTGCATTTCTTATCTTCATTCTCCTCCCGACCATTAGTGGTCCAGTGGACAATGCTTATGTCATCTGTACTGTTTCCCTTTCAAACTAAAGGATTCCCCACTTCCATCAATCTTTTCCAATCCCTAATTTCTTAATGCACTCAGATTATACCACTTATTTACGTTGGAGATGGCTTCAGGGAATCAGGTATTCAAGCCCCTTCATAAATCTAGATATATTTTAAGTTAGGTTCACATCATGGTAGCTACTTATTAAGGATATCGAGTATTCTATTAATTTTTTACTCAAAATATCTTACGTTAGATAGTTGCCTTGTGAGAAGTGTACAATCCTTTGGCGTTCACGATTATGAAAAAGCTCACGAGGACAGTCAAAGTTTGAATTCGAAATCACTTCAATTATTTGATTCAGCCCTACAAAGAAAATACTTGCGTAATGAAAGATTGGTCAATATTCAAACTCGAGATAATGTTGATTTTCTTTTGTCTTTGTATTGCTAGCTTCTTAGCCACATCTTTAGGAGCCCTATACCAACCTTACACAAAACTTGTTTGTGGGAAACTTTTGAGAAAATACATGATTGAAAGTTGTACAAAAACCGCAACATCAATATCAGTCACAATCTAACAAGCTGTTTGAATTGTCCCTCCAATGCATCGAAGGAGTACACTTCTTATCCAACACAAGTAAACCGAATCACTTTTGAAACTACTTTTGTGGGATTCTTTTCCAAGGCTGATGTTGGCAACACATATCTCTACAACCAAGCGATTTCTTTCGGGTCCTCATTCTGTTTGGCAAAAGAGAACAGGTAAAAAAGTTGTGAGAAGAAGCATCACACAAAGCAGATAATGGTGGTTTAAGCATCTTTATTAGATATAGATGGCTTTGCATGTTGACCTAATTGTACATGATTAGGATAATGATGACTTAGTTTTATTATACCCTTTTCATAAGCATTCATTCAGCCTAATCACATGCCTTCTACAGGGGAAATCCACTCAAGGGCCACCTTTGTATGCATTGCTAATAAGTTCTTTCTTTCCATTATACTCCCATGGAAAGGGATGGTACCCCCACAAAGAAAAGATGGGTCTTTTGGCTAGTGAAGGGTAATGTTTCTAAATAATATTTGATTGGTACTTCTGTATTTATTTCTTGAGGGGGCGGTGGAAGGATAAGAAAGGGAAGGTACTTATTTCTTCCACTTTTATTTATGCAATGTGGATAACCCTGAATTGGTTAGTTTGTCTTCTATGTTGAGGTTTGCTNGCTATTGAATTTGGACAGGAAATTGGAAGTACTTACTCATGTGATCTTGTCTGCAAAACTTTAGGCTAGGCTAAGACAGGATCTTTCTCTTCCTGTCTCCTGGTGAGAAGAGAAAGGGGAAGAGGACATTTGGATCTTTACCCTTCATGAACTTAACTAATGTTGTCTAATTCTGTTCGATTTGCTCGTTGCTTTATTTCGTGTCTTAAGAGCGAATTGACTAGAACTTGTGTGAGTGGTTTTCTCTGGATGACAGCATTGCTCATGACAAAAAAGTGAAATATAGTATAGATATTCTTAAGCAAATGTAAATAGTGGGAGAATAGATTATGAATACCTTTACATGTGAAAGTGAATGTGAAATAAAAGGCAATGTAGGGAGACTTGCCTCAGAAGACTGTCAAAAAGAAAAAACCTGTCAAGAAACGTTACAAGTAGAAGGTAAGAATCCATTCAACCAAAATTTAGTAAAGAATTTGTGGCAAAGAGTAATAATATTAATGGATAAACTTGACCTCAAAAGGTTGAATCATGAGGCATCAACTAAGGCATTATAAAGAAAGCCTGACAACCACTCTGATCAAAGAACACAAAATTGCATTATCTTGACAGTGCGATTAAATATGATGGAAAATGTGAGTTACTGTTTGATGACACTATAGTAGAAGCTAAGCTTGAAATAAAGGGTTCGTAAATCATGGGGATGGTGATCATGCAAGCAGTGAAGGAAAGTCAAGTGGATTCTCTCTACTTTTTAAGTGCACTTACTTTCTAACTGGGGTTCAGAACAGTCCTCTTCTGTATATTAGATATACAGTCGTGAGAGAGAAAGAGTTTTACAATTTGAGATTGAAAAATCTCTCCATTCTTCCTGCCACTGACTTAACCCTTCCACTATGAATATCAATTAGCCCAAAAAAAATCAGCTTTTTACGTTGTAGCTAGAAGCTCAGTTACTTCTAAGGTAAAGGAATTGTAGCTAAGAGGAAATAATCAAAAAATTGTTTGAAGAAATACCTGAATCCTTAAGGAGGAATCTTCAAACTCAGCCCGTTTTCCAAGACTTGTTGCTGCAAATGAACATTAAAACCTTAGATGGGGTCATAGATCGAGACTGGAGAGCCCATTTCTGTCCTTACTCTGTTACCATTTCCAATCAAGTTCCTTATGGAGGGTGGGGGGAAGAATAAGTGATGGGGTTTGAAGGTCCCCATTGTTGAATAGCAGGTAGTATTTTGGGTTTCCTGACAAGTTTTTATCAAAGGAAGCAGGAGGGGGCCAGTCCAGATGTAGGAAACAACACCACCAGCTAGTAGGCTGAGGTTATGGGCCGTAGAAATCAATAAAGATCTCAAGTGCACCTGCAAGCAGCCATACATGAATTACATTACATTGTGGACCAAATAAAACCATAAACCATAAACCTGGCTTTATAAACATGATATGTTAGTGTCAATCAAAATCGGCTTCGGCAATCCCATGTGCCTGTTGCTATTGCGCTCCCACCT

mRNA sequence

CAGACGGTGCGCAGAGAGTAACGGTGGCCCAGATTTGGAGGACTCTGAGACAAAGCCGCTTTATTCTCTCTTCGCGCTGAAAAGAGAAAGAACAAGAAGAACACCCACCACCATCGCCAAAATTCTTACACTTTTCTTCCAAAGCTCCATAAATAAACATCTCCACCACCAACTAGATCCGTAGCCGCCCTCTTCCATGGACAATACTATGCTCTTCGCCAGCACCAGATGGTCAAGATTGATGATTGTCCGTGGAGAAGTTCTGTGAGGACAATGCTCCAGGAGGTCCATATACGTATGTAGAATATATATATATATATATATGTATTTAGCATTCAGGAACAGAAAATTGCCACTCCCTGGCAGTTGTCGAAGTTGAAAAAATGAACTCCGGATTGATTGAGTAGGAATGGGCGCTTTAATCTCCAGAAGTGATAAGCAAGAATCTACGTAGTCTTAGTGGTTGATTCTTTGGCCTTCAAGTCTGGCAACCTGTAGTTGATGTAATCAGGAATAGGACTAGTCAAACTAGAGATTAGTTGATAGTCTTGGGGGATCACATGGAAGAAAGAGATGAATATACAGAATCGAGAAGTAATAATAATGCTGAAGCCGTACAAGAAGCCAAGATCAGTGTTGAAGCTGAAATGCCAACTTGTCTTTCAAATGAGCAAAAGCATTCAGTTCCTGATTATCATGAATTGGAAGCAACTCCAGAATATACCAACAAAACTGGCGGTCCAGATGAAGAAAAGCCAGAGGTCCAGCAGAATATGGAGGAAGAGAATAAGGAACTTGGTTCAGGAGACGTGCTTAGTGAATTATCAGAAAAACACAATCGGACTTTCTCTAACCTTGCTGATAATGATCAAGTTGAAGCTGGTAATTTATTATGCTGTGATAAAGATACCGAAAATTTGATAGTACCTATTGAAGTTGAGACAACGACTCTTCTTGTTGACTGCTCTGAACTTCCACCTGAAGTTGTCAACAAAAACTATATTGAACAGATGAACCCTCCTATTGAAGAATTAACCCAAAATACTCCTTTTCAAAATTTAGAAACAGTCCCCAGTAATTCAGAACAATCGGATCACAAGGATAAGAGAATTTTGAAATCAATGAAGATAAAGTCTATTTTAAGGTCCCTTGTAAGTAGTGACAGAAATATGCGTTCAAAGACCCAAGAGAAAGATAAAGATCCTGAACCAAGTAATGACTTGAATAATTTTACTGCTGAAGAGGGAAAAGGGAAGAAGAAGGAGAGAAATATACAAGGAAAGGGAGCAAGAGTCGATGAATTCTCATCAATCAGGAATCATTTGAGATATTTACTGAACCGCATCAAATATGAACAGAACTTGATTGAAGCTTATTCTAGTGAAGGCTGGAAAGGGTTCAGCTCAGATAAATTGAAGCCTGAAAAGGAACTTCAACGGGCATCAAATGAAATAATGCGACGCAAATTAAAAATAAGAGATGTATTTCAACGTATTGATGCACTTTGTGGCGAAGGAGGCCTTTCTAAATCTTTATTCGATTCTCAAGGACAGATAGACAGTGAAGATATATTCTGTGCAAAATGTGGATCCAAAGAATTGTCCCTTGAGAATGACATCATACTATGTGATGGTATTTGTGATCGTGGGTTCCATCAGTTCTGTTTAGAACCACCTTTGCTAAATACAGACATTCCGCCGGATGATGAAGGATGGCTATGCCCCGGATGTGATTGCAAAGATGACTGCTTGAATCTGCTTAATGAATTTCAAGGATCAAGACTTTCCATCACTGATGGTTGGGAGAAAGTCTATCCCGAGGCCGCAGCATCAGCTGCTGGACGAAATTTTGATCATGCCTTAGGTCTTCCTTCAGATGATTCTGAAGATGATGATTATGATCCTGATGTTCCAGATACTATTGTCCAGGACGATGAATCAAGTTCTGAAACATCTGGGTATGCTTCTGCTTCTGAGGAATTAGAGTCTCCATCCAATGTTGACCAGTACTTAGGTCTCCCTTCCGATGACTCGGACGATGATGACTATGATCCCAGTGCTCCAGAACGTGATGAAGATGTTAGACAGGAAAGTTCTAGTTCTGACTTTACATCTGATTCTGAGGATTTAGCTGCACTTGACAGTAGCCCCTCTTCCAAAGCTGATAACCTTGTGTCTTCATCACTGAATAATACTACGTCTACGAAAAACCCTGATGGGCGAAGTTCTGGAGGTGGTCCTAGAAAGAGTGCACTGTATAATGAGCTATCAAGTCTACTAGAGTCCGATCCTGAACCTGTTTTGGGAAGAAGACAGGTGGAACGGTTGGATTACAAGAAGCTCCATGATGAGACATACGGGAATGTTCCTACCGACTCAAGCGATGACACGTACGCGAGTATTTCTACGGATTCAAGTGATGACCAAGGCTGGGATAGTAATACAAGGAAGAGAAGTCCTAAAACCCTGGTTCTTGCACTGCCAAATTATCGAACTAATGATGATATGACTAACGTAAAAACTAAACACAGTTCTAAGAGGGGTACTCGTCAAAAGGCAGCTGCTGTAAATATGAATAAATCTGTGACTAAAACTCCTGAAGACACTGGAAAAGCTAGTTCTTCTGTTAGGAGAACCACACCATCATCGTATAGAAGACTCAGTCAACTTGCATTGGAGAGACTTTTAGCATCATTCCAAGAAAACCAGTATCCTGAACGAGCTACAAAGGAAAGTCTGGCACAAGAACTAGGGCTCAGTGTGAAGCAGGTTAGCAAATGGTTTACGAACACACGTTGGAGCACACGCCATCCCTCAAGCGTTGAGGGTAATAAAGCGAAGAGTTCCTCAAGAATGGGCATTCGTTCATCTCAGGCAAGTGGAGAGCTGCACCAGCCCGAGCAAGAATTTGGTGCCCAACATCAAGAATTACCAACGACAGATAGTGTTGTGGCCCCATGTCAGAGTGGGGATACAGGGGATGTCAAATTGGCAACTCAGGAAACTAAAAGATCAGAATTTTCTGCCGCAAAATCCAGAAAACGGAAGGGCAGGTCAGATCACGCTGCATCATGTTCAAAGGACAGTAAGGAATCACAAAGGCCTCCTGCCAAGTCTCCAAAAGTAAATGAAATCCAAACAGCACATAGCATTAAGACGAGGAGGAGAAATTCCTTATAGGTTCTGTAAGTTTCGATCATTCACGGGAAATTATTTGGCTATTTACTCTCTTCCAAGATACACATTAGAACAAATGCGGGCATCGTCTGAAGGGAGTTGCAGATATTGCTATGTTAAAGAACAAAAGAGCTACTGCTACCCTCCAGCTTCGGAAGCTTTGCTGCGGATATATCACTTATGGATGCTACATTGCTATCAAGTCTAAATTGCTCTTGGCTTTTTTATCCAGATTTTAATGTCATCATCGTTAGTGAAATATTCTGCAATTAATCCTGTATGAGATGTGAACTTTAGGTTCTGCAGAGTCAACATAATGTAGTTCTCTTCACACATGAAAGCAATCGGATCTTCTGCGCGCATCCTGCGTTTATCCATTTGAATGCATCAAAATCCTGTCAACCACTGAAATTGGAAGCTGGAGCTCTGGGTGACCTGTGATTGAGCAGTGAAATGTGTATGGGCCATTTCTCCCAAATGGGAAACCACTGGAAGAGATGACAGAAGCAAGAATCTTGAAGTTTCAAATCATTATTGATCTTTTTTTGTTCCCCCTTACTGCCTACTTAGTTTGATGATTCTAAGCACTTCCCCTAAGGTTAGTCATGTACTTGATATATGACGACACAATCAGAATGAAGAAGATGACTCCAGAGTTCTTGACAGACAGATACAAGAATTAAAGTCGGAGATTCAGGTCGAGCTTGAATGGTTCCATTTGCGGTTCTCGTAGTGGTGGTGACTCTGGGAGAACTTGATGTTGAGTGAAGCGGAAGCTTCCTGGCCACCTCAAGTCCATGGCATCCCCCAGCATGAAAGACCAAGCAGAGTCATCATTCTCTGGGCTTGGTGCCTTCTGGCTGCACTCCTCTCCCTCTTGTGGGCATTTTGATGGCCTCCCAATGCCTGTGAGCTGTAGAATACTCTCTTACAGAAGTTGCATGAAAAGGTCTTGGCTGAACCTGCAGAGCTTTGTGACTCAGTTAAAAGCTGTTTGTTCCTACCCAGGCTCAAGTTCAGCCACTCCAAAGCTCCTACACCTTCAGGATTGCTGATTCTTTATCTAGCCCTGAAGCTACTGAAGCTCTGGGAATGGTAACAACTCATTGATCCTTCTTAGATTCCATTTTCCAAGATGAACAAATGCCGAGGAAGTTGTGAGAGGATGAAAGTGTAAGTTTTCGTTTATATGTTTCTGCATTTCTTATCTTCATTCTCCTCCCGACCATTAGTGGTCCAGTGGACAATGCTTATGTCATCTGTACTGTTTCCCTTTCAAACTAAAGGATTCCCCACTTCCATCAATCTTTTCCAATCCCTAATTTCTTAATGCACTCAGATTATACCACTTATTTACGTTGGAGATGGCTTCAGGGAATCAGGTATTCAAGCCCCTTCATAAATCTAGATATATTTTAAGTTAGGTTCACATCATGGTAGCTACTTATTAAGGATATCGAGTATTCTATTAATTTTTTACTCAAAATATCTTACGTTAGATAGTTGCCTTGTGAGAAGTGTACAATCCTTTGGCGTTCACGATTATGAAAAAGCTCACGAGGACAGTCAAAGTTTGAATTCGAAATCACTTCAATTATTTGATTCAGCCCTACAAAGAAAATACTTGCGTAATGAAAGATTGGTCAATATTCAAACTCGAGATAATGTTGATTTTCTTTTGTCTTTGTATTGCTAGCTTCTTAGCCACATCTTTAGGAGCCCTATACCAACCTTACACAAAACTTGTTTGTGGGAAACTTTTGAGAAAATACATGATTGAAAGTTGTACAAAAACCGCAACATCAATATCAGTCACAATCTAACAAGCTGTTTGAATTGTCCCTCCAATGCATCGAAGGAGTACACTTCTTATCCAACACAAGTAAACCGAATCACTTTTGAAACTACTTTTGTGGGATTCTTTTCCAAGGCTGATGTTGGCAACACATATCTCTACAACCAAGCGATTTCTTTCGGGTCCTCATTCTGTTTGGCAAAAGAGAACAGGGGAAATCCACTCAAGGGCCACCTTTGTATGCATTGCTAATAAGTTCTTTCTTTCCATTATACTCCCATGGAAAGGGATGGTACCCCCACAAAGAAAAGATGGGTCTTTTGGCTAGTGAAGGGTAATGTTTCTAAATAATATTTGATTGGTACTTCTGTATTTATTTCTTGAGGGGGCGGTGGAAGGATAAGAAAGGGAAGGTACTTATTTCTTCCACTTTTATTTATGCAATGTGGATAACCCTGAATTGGTTAGTTTGTCTTCTATGTTGAGGTTTGCTNGCTATTGAATTTGGACAGGAAATTGGAAGTACTTACTCATGTGATCTTGTCTGCAAAACTTTAGGCTAGGCTAAGACAGGATCTTTCTCTTCCTGTCTCCTGGTGAGAAGAGAAAGGGGAAGAGGACATTTGGATCTTTACCCTTCATGAACTTAACTAATGTTGTCTAATTCTGTTCGATTTGCTCGTTGCTTTATTTCGTGTCTTAAGAGCGAATTGACTAGAACTTGTGTGAGTGGTTTTCTCTGGATGACAGCATTGCTCATGACAAAAAAGTGAAATATAGTATAGATATTCTTAAGCAAATGTAAATAGTGGGAGAATAGATTATGAATACCTTTACATGTGAAAGTGAATGTGAAATAAAAGGCAATGTAGGGAGACTTGCCTCAGAAGACTGTCAAAAAGAAAAAACCTGTCAAGAAACGTTACAAGTAGAAGGTAAGAATCCATTCAACCAAAATTTAGTAAAGAATTTGTGGCAAAGAGTAATAATATTAATGGATAAACTTGACCTCAAAAGGTTGAATCATGAGGCATCAACTAAGGCATTATAAAGAAAGCCTGACAACCACTCTGATCAAAGAACACAAAATTGCATTATCTTGACAGTGCGATTAAATATGATGGAAAATGTGAGTTACTGTTTGATGACACTATAGTAGAAGCTAAGCTTGAAATAAAGGGTTCGTAAATCATGGGGATGGTGATCATGCAAGCAGTGAAGGAAAGTCAAGTGGATTCTCTCTACTTTTTAAGTGCACTTACTTTCTAACTGGGGTTCAGAACAGTCCTCTTCTGTATATTAGATATACAGTCGTGAGAGAGAAAGAGTTTTACAATTTGAGATTGAAAAATCTCTCCATTCTTCCTGCCACTGACTTAACCCTTCCACTATGAATATCAATTAGCCCAAAAAAAATCAGCTTTTTACGTTGTAGCTAGAAGCTCAGTTACTTCTAAGGTAAAGGAATTGTAGCTAAGAGGAAATAATCAAAAAATTGTTTGAAGAAATACCTGAATCCTTAAGGAGGAATCTTCAAACTCAGCCCGTTTTCCAAGACTTGTTGCTGCAAATGAACATTAAAACCTTAGATGGGGTCATAGATCGAGACTGGAGAGCCCATTTCTGTCCTTACTCTGTTACCATTTCCAATCAAGTTCCTTATGGAGGGTGGGGGGAAGAATAAGTGATGGGGTTTGAAGGTCCCCATTGTTGAATAGCAGGTAGTATTTTGGGTTTCCTGACAAGTTTTTATCAAAGGAAGCAGGAGGGGGCCAGTCCAGATGTAGGAAACAACACCACCAGCTAGTAGGCTGAGGTTATGGGCCGTAGAAATCAATAAAGATCTCAAGTGCACCTGCAAGCAGCCATACATGAATTACATTACATTGTGGACCAAATAAAACCATAAACCATAAACCTGGCTTTATAAACATGATATGTTAGTGTCAATCAAAATCGGCTTCGGCAATCCCATGTGCCTGTTGCTATTGCGCTCCCACCT

Coding sequence (CDS)

ATGGAAGAAAGAGATGAATATACAGAATCGAGAAGTAATAATAATGCTGAAGCCGTACAAGAAGCCAAGATCAGTGTTGAAGCTGAAATGCCAACTTGTCTTTCAAATGAGCAAAAGCATTCAGTTCCTGATTATCATGAATTGGAAGCAACTCCAGAATATACCAACAAAACTGGCGGTCCAGATGAAGAAAAGCCAGAGGTCCAGCAGAATATGGAGGAAGAGAATAAGGAACTTGGTTCAGGAGACGTGCTTAGTGAATTATCAGAAAAACACAATCGGACTTTCTCTAACCTTGCTGATAATGATCAAGTTGAAGCTGGTAATTTATTATGCTGTGATAAAGATACCGAAAATTTGATAGTACCTATTGAAGTTGAGACAACGACTCTTCTTGTTGACTGCTCTGAACTTCCACCTGAAGTTGTCAACAAAAACTATATTGAACAGATGAACCCTCCTATTGAAGAATTAACCCAAAATACTCCTTTTCAAAATTTAGAAACAGTCCCCAGTAATTCAGAACAATCGGATCACAAGGATAAGAGAATTTTGAAATCAATGAAGATAAAGTCTATTTTAAGGTCCCTTGTAAGTAGTGACAGAAATATGCGTTCAAAGACCCAAGAGAAAGATAAAGATCCTGAACCAAGTAATGACTTGAATAATTTTACTGCTGAAGAGGGAAAAGGGAAGAAGAAGGAGAGAAATATACAAGGAAAGGGAGCAAGAGTCGATGAATTCTCATCAATCAGGAATCATTTGAGATATTTACTGAACCGCATCAAATATGAACAGAACTTGATTGAAGCTTATTCTAGTGAAGGCTGGAAAGGGTTCAGCTCAGATAAATTGAAGCCTGAAAAGGAACTTCAACGGGCATCAAATGAAATAATGCGACGCAAATTAAAAATAAGAGATGTATTTCAACGTATTGATGCACTTTGTGGCGAAGGAGGCCTTTCTAAATCTTTATTCGATTCTCAAGGACAGATAGACAGTGAAGATATATTCTGTGCAAAATGTGGATCCAAAGAATTGTCCCTTGAGAATGACATCATACTATGTGATGGTATTTGTGATCGTGGGTTCCATCAGTTCTGTTTAGAACCACCTTTGCTAAATACAGACATTCCGCCGGATGATGAAGGATGGCTATGCCCCGGATGTGATTGCAAAGATGACTGCTTGAATCTGCTTAATGAATTTCAAGGATCAAGACTTTCCATCACTGATGGTTGGGAGAAAGTCTATCCCGAGGCCGCAGCATCAGCTGCTGGACGAAATTTTGATCATGCCTTAGGTCTTCCTTCAGATGATTCTGAAGATGATGATTATGATCCTGATGTTCCAGATACTATTGTCCAGGACGATGAATCAAGTTCTGAAACATCTGGGTATGCTTCTGCTTCTGAGGAATTAGAGTCTCCATCCAATGTTGACCAGTACTTAGGTCTCCCTTCCGATGACTCGGACGATGATGACTATGATCCCAGTGCTCCAGAACGTGATGAAGATGTTAGACAGGAAAGTTCTAGTTCTGACTTTACATCTGATTCTGAGGATTTAGCTGCACTTGACAGTAGCCCCTCTTCCAAAGCTGATAACCTTGTGTCTTCATCACTGAATAATACTACGTCTACGAAAAACCCTGATGGGCGAAGTTCTGGAGGTGGTCCTAGAAAGAGTGCACTGTATAATGAGCTATCAAGTCTACTAGAGTCCGATCCTGAACCTGTTTTGGGAAGAAGACAGGTGGAACGGTTGGATTACAAGAAGCTCCATGATGAGACATACGGGAATGTTCCTACCGACTCAAGCGATGACACGTACGCGAGTATTTCTACGGATTCAAGTGATGACCAAGGCTGGGATAGTAATACAAGGAAGAGAAGTCCTAAAACCCTGGTTCTTGCACTGCCAAATTATCGAACTAATGATGATATGACTAACGTAAAAACTAAACACAGTTCTAAGAGGGGTACTCGTCAAAAGGCAGCTGCTGTAAATATGAATAAATCTGTGACTAAAACTCCTGAAGACACTGGAAAAGCTAGTTCTTCTGTTAGGAGAACCACACCATCATCGTATAGAAGACTCAGTCAACTTGCATTGGAGAGACTTTTAGCATCATTCCAAGAAAACCAGTATCCTGAACGAGCTACAAAGGAAAGTCTGGCACAAGAACTAGGGCTCAGTGTGAAGCAGGTTAGCAAATGGTTTACGAACACACGTTGGAGCACACGCCATCCCTCAAGCGTTGAGGGTAATAAAGCGAAGAGTTCCTCAAGAATGGGCATTCGTTCATCTCAGGCAAGTGGAGAGCTGCACCAGCCCGAGCAAGAATTTGGTGCCCAACATCAAGAATTACCAACGACAGATAGTGTTGTGGCCCCATGTCAGAGTGGGGATACAGGGGATGTCAAATTGGCAACTCAGGAAACTAAAAGATCAGAATTTTCTGCCGCAAAATCCAGAAAACGGAAGGGCAGGTCAGATCACGCTGCATCATGTTCAAAGGACAGTAAGGAATCACAAAGGCCTCCTGCCAAGTCTCCAAAAGTAAATGAAATCCAAACAGCACATAGCATTAAGACGAGGAGGAGAAATTCCTTATAG

Protein sequence

MEERDEYTESRSNNNAEAVQEAKISVEAEMPTCLSNEQKHSVPDYHELEATPEYTNKTGGPDEEKPEVQQNMEEENKELGSGDVLSELSEKHNRTFSNLADNDQVEAGNLLCCDKDTENLIVPIEVETTTLLVDCSELPPEVVNKNYIEQMNPPIEELTQNTPFQNLETVPSNSEQSDHKDKRILKSMKIKSILRSLVSSDRNMRSKTQEKDKDPEPSNDLNNFTAEEGKGKKKERNIQGKGARVDEFSSIRNHLRYLLNRIKYEQNLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDVFQRIDALCGEGGLSKSLFDSQGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLNLLNEFQGSRLSITDGWEKVYPEAAASAAGRNFDHALGLPSDDSEDDDYDPDVPDTIVQDDESSSETSGYASASEELESPSNVDQYLGLPSDDSDDDDYDPSAPERDEDVRQESSSSDFTSDSEDLAALDSSPSSKADNLVSSSLNNTTSTKNPDGRSSGGGPRKSALYNELSSLLESDPEPVLGRRQVERLDYKKLHDETYGNVPTDSSDDTYASISTDSSDDQGWDSNTRKRSPKTLVLALPNYRTNDDMTNVKTKHSSKRGTRQKAAAVNMNKSVTKTPEDTGKASSSVRRTTPSSYRRLSQLALERLLASFQENQYPERATKESLAQELGLSVKQVSKWFTNTRWSTRHPSSVEGNKAKSSSRMGIRSSQASGELHQPEQEFGAQHQELPTTDSVVAPCQSGDTGDVKLATQETKRSEFSAAKSRKRKGRSDHAASCSKDSKESQRPPAKSPKVNEIQTAHSIKTRRRNSL
BLAST of Cp4.1LG14g04420 vs. Swiss-Prot
Match: PRH_PETCR (Pathogenesis-related homeodomain protein OS=Petroselinum crispum GN=PRH PE=2 SV=1)

HSP 1 Score: 434.9 bits (1117), Expect = 2.1e-120
Identity = 277/600 (46.17%), Postives = 370/600 (61.67%), Query Frame = 1

Query: 198  VSSDRNMRSKTQEKDKDPEPSNDLNNFTAEEGKGKKKERNIQGKGA---RVDEFSSIRNH 257
            V+S R++RS++QEK  +P    D+NN  A+EG  ++K R  + K     RVDEF  IR H
Sbjct: 441  VNSSRSLRSRSQEKSIEP----DVNNIVADEGADREKPRKKRKKRMEENRVDEFCRIRTH 500

Query: 258  LRYLLNRIKYEQNLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDVFQRIDA 317
            LRYLL+RIKYE+N ++AYS EGWKG S DK+KPEKEL+RA  EI  RKLKIRD+FQR+D 
Sbjct: 501  LRYLLHRIKYEKNFLDAYSGEGWKGQSLDKIKPEKELKRAKAEIFGRKLKIRDLFQRLDL 560

Query: 318  LCGEGGLSKSLFDSQGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLL 377
               EG L + LFDS+G+IDSEDIFCAKCGSK+++L NDIILCDG CDRGFHQFCL+PPLL
Sbjct: 561  ARSEGRLPEILFDSRGEIDSEDIFCAKCGSKDVTLSNDIILCDGACDRGFHQFCLDPPLL 620

Query: 378  NTDIPPDDEGWLCPGCDCKDDCLNLLNEFQGSRLSITDGWEKVY-PEAAASAAGRNFDHA 437
               IPPDDEGWLCPGC+CK DC+ LLN+ Q + + + D WEKV+  EAAA+A+G+N D  
Sbjct: 621  KEYIPPDDEGWLCPGCECKIDCIKLLNDSQETNILLGDSWEKVFAEEAAAAASGKNLDDN 680

Query: 438  LGLPSDDSEDDDYDPDVP--DTIVQDDESSSETSGYASASEELESPSNVDQYLGLPSDDS 497
             GLPSDDSEDDDYDP  P  D  VQ D+SS++ S Y S S++++     +   GLPSDDS
Sbjct: 681  SGLPSDDSEDDDYDPGGPDLDEKVQGDDSSTDESDYQSESDDMQVIRQKNS-RGLPSDDS 740

Query: 498  DDDDYDPSAPERDEDVRQESSSSDFTSDSEDLAAL--DSSPSSKADNLVSSSLNNTTSTK 557
            +DD+YDPS    D+ + ++SS SDFTSDSED   +  D   + KA   ++S+ ++  + +
Sbjct: 741  EDDEYDPSGLVTDQ-MYKDSSCSDFTSDSEDFTGVFDDYKDTGKAQGPLASTPDHVRNNE 800

Query: 558  NPDGRSSGGGPRKSALYNELSSLLESDPEPVLGRRQVERLDYKKLHD------------- 617
               G    G                 D  P+  RRQVE LDYKKL+D             
Sbjct: 801  EGCGHPEQG-----------------DTAPLYPRRQVESLDYKKLNDIEFSKMCDILDIL 860

Query: 618  -------------ETYGNVPTDSSDDTYASISTDSSDDQGWDSNTRKRSPKTLVLALPNY 677
                         E YGN  +DSSD+ Y   S+   ++   ++   +R  ++  L L   
Sbjct: 861  SSQLDVIICTGNQEEYGNTSSDSSDEDYMVTSSPDKNNSDKEATAMERGRESGDLEL--- 920

Query: 678  RTNDDMTNVKTKHSSKRGTRQKAAAVNMNKSVTKTPEDTGKASSSVRRTTPSSYRRLSQL 737
                D    ++ H+  R   +K A    +  ++++ ED+    +  + T+ + +    + 
Sbjct: 921  ----DQKARESTHN--RRYIKKFAVEGTDSFLSRSCEDSAAPVAGSKSTSKTLH---GEH 980

Query: 738  ALERLLASFQENQYPERATKESLAQELGLSVKQVSKWFTNTRWSTRHPSSVEGNKAKSSS 764
            A +RLL SF+ENQYP+RA KESLA EL LSV+QVS WF N RWS RH S +  + AK  S
Sbjct: 981  ATQRLLQSFKENQYPQRAVKESLAAELALSVRQVSNWFNNRRWSFRHSSRIGSDVAKFDS 1005

BLAST of Cp4.1LG14g04420 vs. Swiss-Prot
Match: HAT31_ARATH (Homeobox protein HAT3.1 OS=Arabidopsis thaliana GN=HAT3.1 PE=2 SV=3)

HSP 1 Score: 413.3 bits (1061), Expect = 6.6e-114
Identity = 260/564 (46.10%), Postives = 354/564 (62.77%), Query Frame = 1

Query: 207 KTQEKDKDPEPSNDLNNFTAEEGKGKKKERNI-QGKGARVDEFSSIRNHLRYLLNRIKYE 266
           + Q   +D  PS+ + N T   G+ KKK + + +G+    DE++ I+  LRY LNRI YE
Sbjct: 136 RAQRSKEDAGPSSVVANSTPV-GRPKKKNKTMNKGQVREDDEYTRIKKKLRYFLNRINYE 195

Query: 267 QNLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDVFQRIDALCGEGGLSKSL 326
           Q+LI+AYS EGWKG S +K++PEKEL+RA+ EI+RRKLKIRD+FQ +D LC EG L +SL
Sbjct: 196 QSLIDAYSLEGWKGSSLEKIRPEKELERATKEILRRKLKIRDLFQHLDTLCAEGSLPESL 255

Query: 327 FDSQGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGW 386
           FD+ G+I SEDIFCAKCGSK+LS++NDIILCDG CDRGFHQ+CLEPPL   DIPPDDEGW
Sbjct: 256 FDTDGEISSEDIFCAKCGSKDLSVDNDIILCDGFCDRGFHQYCLEPPLRKEDIPPDDEGW 315

Query: 387 LCPGCDCKDDCLNLLNEFQGSRLSITDGWEKVYPEAAASAAGRNFDHALGLPSDDSEDDD 446
           LCPGCDCKDD L+LLN+  G++ S++D WEK++PEAAA+  G   +    LPSDDS+D++
Sbjct: 316 LCPGCDCKDDSLDLLNDSLGTKFSVSDSWEKIFPEAAAALVGGGQNLDCDLPSDDSDDEE 375

Query: 447 YDPDVPDTIVQDDESS------------SETSGYASASEEL-----ESPSNVDQYLGLPS 506
           YDPD  +    D++ S            S+ + + SAS+E+     E    +   + LPS
Sbjct: 376 YDPDCLNDNENDEDGSDDNEESENEDGSSDETEFTSASDEMIESFKEGKDIMKDVMALPS 435

Query: 507 DDSDDDDYDPSAPERDEDVRQESSSSDFTSDSEDLAALDSSPSSKADNLVSSSLNNTTST 566
           DDS+DDDYDP AP  D+D  +ESS+SD TSD+EDL       S K D   ++     T  
Sbjct: 436 DDSEDDDYDPDAPTCDDD--KESSNSDCTSDTEDLET-----SFKGDE--TNQQAEDTPL 495

Query: 567 KNPDGRSSGGGPRKSALYNELSSLLESDPEPVLGRRQVERLDYKKLHDETYGNVPTDSSD 626
           ++P GR +      + L +++   L+  P  V  RR VERLDYKKL+DE Y NVPT SSD
Sbjct: 496 EDP-GRQTSQLQGDAILESDVG--LDDGPAGVSRRRNVERLDYKKLYDEEYDNVPTSSSD 555

Query: 627 D-----TYASISTDSSDDQGWDSNTRKRSPKTLVLALPNYRTNDDMTNVKTKHSSKRGTR 686
           D     T      DS  +   D+   K+S              +D T+ K    SKR  +
Sbjct: 556 DDDWDKTARMGKEDSESEDEGDTVPLKQSSNA-----------EDHTSKKLIRKSKRADK 615

Query: 687 QKAAAVNMNKSVTKTPEDTGKASSSVRRTTPSSYRRLSQLALERLLASFQENQYPERATK 746
           +    +       + P + G  S  + +++ S+ ++ +    +RL  SFQENQYP++ATK
Sbjct: 616 KDTLEMPQ-----EGPGENG-GSGEIEKSSSSACKQ-TDPKTQRLYISFQENQYPDKATK 668

Query: 747 ESLAQELGLSVKQVSKWFTNTRWS 748
           ESLA+EL ++VKQV+ WF + RWS
Sbjct: 676 ESLAKELQMTVKQVNNWFKHRRWS 668

BLAST of Cp4.1LG14g04420 vs. Swiss-Prot
Match: HOX1A_MAIZE (Homeobox protein HOX1A OS=Zea mays GN=HOX1A PE=2 SV=1)

HSP 1 Score: 359.8 bits (922), Expect = 8.6e-98
Identity = 249/612 (40.69%), Postives = 351/612 (57.35%), Query Frame = 1

Query: 188 MKIKSILRSLVSSDRNMRSKTQEKDKDPEPSNDLNNFTAEEGKGKKKERNIQGKGARVDE 247
           M   S +R L S+  +  + T+      +P+             K+++ +     +  DE
Sbjct: 73  MSSNSDVRVLRSTSSSKTTSTEHVQAPVQPA------------AKRRKMSRASNKSSTDE 132

Query: 248 FSSIRNHLRYLLNRIKYEQNLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRD 307
           FS IR  +RY+LNR+ YEQ+LIEAY+SEGWK  S DK++PEKEL+RA +EI+R KL+IR+
Sbjct: 133 FSQIRKRVRYILNRMNYEQSLIEAYASEGWKNQSLDKIRPEKELERAKSEILRCKLRIRE 192

Query: 308 VFQRIDALCGEGGLSKSLFDSQGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQF 367
           VF+ ID+L  +G + ++LFDS+G+I  EDIFC+ CGS + +L NDIILCDG CDRGFHQ 
Sbjct: 193 VFRNIDSLLSKGKIDETLFDSEGEISCEDIFCSTCGSNDATLGNDIILCDGACDRGFHQN 252

Query: 368 CLEPPLLNTDIPPDDEGWLCPGCDCKDDCLNLLNEFQGSRLSITDGWEKVYPEAAASAAG 427
           CL PPL   DIP  DEGWLCP CDCK DC++L+NE  GS +SI D WEKV+P+AAA A  
Sbjct: 253 CLNPPLRTEDIPMGDEGWLCPACDCKIDCIDLINELHGSNISIEDSWEKVFPDAAAMAND 312

Query: 428 RNFDHALGLPSDDSEDDDYDPDVPD--TIVQDDESSSETSGYASASEEL-------ESPS 487
              D A  LPSDDS+D+D+DP++P+   + +D+ESS E     S S++        +S  
Sbjct: 313 SKQDDAFDLPSDDSDDNDFDPNMPEEHVVGKDEESSEEDEDGGSDSDDSDFLTCSDDSEP 372

Query: 488 NVDQY---LGLPSDDSDDDDYDPSAPERDEDVRQESSS--SDFTSDSEDLAALDSSPSSK 547
            +D+    L LPS+DS+DDDYDP+ P+ D+DV ++SSS  SDFTSDS+D        S  
Sbjct: 373 LIDKKVDDLRLPSEDSEDDDYDPAGPDSDKDVEKKSSSDESDFTSDSDDFC---KEISKS 432

Query: 548 ADNLVSSSLNNTTSTKNPDGRSSGGGPRKSALYNELSSLLESDPEPVLGRRQVERLDYKK 607
             + VSS L       + +  ++      SA     + + +    P   RRQ ERLDYKK
Sbjct: 433 GHDEVSSPLLPDAKVGDMEKITAQAKTTSSADDPMETEIDQGVVLPDSRRRQAERLDYKK 492

Query: 608 LHDETYGNVPTDSSDD-TYASISTD--SSDDQGWDSNTRKRSPKTLVLALPNYRTNDDMT 667
           L+DE YG   +DSSDD  ++  +T    S+++G  ++   +  + +         ND++T
Sbjct: 493 LYDEAYGEASSDSSDDEEWSGKNTPIIKSNEEGEANSPAGKGSRVV-------HHNDELT 552

Query: 668 NVKTKHSSKRGTRQKAAAVNMNKSVTKTPED-TGKASSSVRRTTPSSYRRLSQLALERLL 727
              TK S            +++ SV + P D T   S+S  R           +  ++L 
Sbjct: 553 TQSTKKSLH----------SIHGSVDEKPGDLTSNGSNSTARK-----GHFGPVINQKLH 612

Query: 728 ASFQENQYPERATKESLAQELGLSVKQVSKWFTNTRWSTRHPSSVEGNKAKSSSRMGIRS 781
             F+   YP R+ KESLA+ELGL+ +QV+KWF   R S R  SS +G      S     S
Sbjct: 613 EHFKTQPYPSRSVKESLAEELGLTFRQVNKWFETRRHSARVASSRKGISLDKHSPQNTNS 647

BLAST of Cp4.1LG14g04420 vs. Swiss-Prot
Match: PRH_ARATH (Pathogenesis-related homeodomain protein OS=Arabidopsis thaliana GN=PRH PE=2 SV=1)

HSP 1 Score: 194.9 bits (494), Expect = 3.7e-48
Identity = 105/287 (36.59%), Postives = 166/287 (57.84%), Query Frame = 1

Query: 205 RSKTQEKDKDPEPSNDLNNFTAEEGKGKKKERNIQGKGARVDEFSSIRNHLRYLLNRIKY 264
           +S+T++  +      ++     ++ + +K +R  +     VD+   ++   RYLL ++K 
Sbjct: 59  KSRTKKYSRGWVRCEEMEEEKVKKTRKRKSKRQQKDNKVEVDDSLRLQRRTRYLLIKMKM 118

Query: 265 EQNLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDVFQRIDALCGEGGLSKS 324
           +QNLI+AY++EGWKG S +K++P+KEL+RA  EI+  KL +RD  +++D L   G + + 
Sbjct: 119 QQNLIDAYATEGWKGQSREKIRPDKELERARKEILNCKLGLRDAIRQLDLLSSVGSMEEK 178

Query: 325 LFDSQGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEG 384
           +  S G I  + IFCA+C S+E   +NDIILCDG C+R FHQ CL+PPL    IPP D+G
Sbjct: 179 VIASDGSIHHDHIFCAECNSREAFPDNDIILCDGTCNRAFHQKCLDPPLETESIPPGDQG 238

Query: 385 WLCPGCDCKDDCLNLLNEFQGSRLSITDGWEKVYPEAAASAAG--RNFDHALGLPSDDSE 444
           W C  CDCK + ++ +N   G+   +   W+ ++ E A+   G     ++    PSDDS+
Sbjct: 239 WFCKFCDCKIEIIDTMNAQIGTHFPVDSNWQDIFNEEASLPIGSEATVNNEADWPSDDSK 298

Query: 445 DDDYDPDVPDTIVQDDESSSETSGYASASEELESPSNVDQYLGLPSD 490
           DDDYDP++ +       +SS  SG      + ES   +   L L SD
Sbjct: 299 DDDYDPEMRE---NGGGNSSNVSGDGGGDNDEES---ISTSLSLSSD 339

BLAST of Cp4.1LG14g04420 vs. TrEMBL
Match: A0A0A0LA53_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G198510 PE=4 SV=1)

HSP 1 Score: 828.9 bits (2140), Expect = 5.6e-237
Identity = 472/620 (76.13%), Postives = 512/620 (82.58%), Query Frame = 1

Query: 1   MEERDEYT--ESRSNNNAEAVQEAKISVEAEMPTCLSNEQKHSVPDYHELEATPEYTNKT 60
           MEERDE T  ESR N  AEAVQEAK SVE E+ TCLSNE K+S   Y EL  TPE+++K 
Sbjct: 146 MEERDENTDTESRPNKIAEAVQEAKASVEVEVLTCLSNEAKYS--GYQELGTTPEFSSKI 205

Query: 61  GGPDEEKPEVQQNMEEENKELGSGDVLSELSEKHNRTFSNLADNDQVEAGNLLCCDKDTE 120
            GPDEEK  VQQNME     LGSG +LSELSEK N+T SN ADND+VEAGNLL  DKDT+
Sbjct: 206 DGPDEEKAGVQQNME-----LGSGYLLSELSEKDNQTISNHADNDRVEAGNLLSNDKDTK 265

Query: 121 NLIVPIEVETTTLLVDCSELPPEVVNKNYIEQMNPPIEELTQNTPFQNLETVPSNSEQSD 180
           NL + IE E TTLL +CSELP E V KNYIE+MNPPI +LTQ T  Q+LET+PSNS+QS 
Sbjct: 266 NLKLSIEDEATTLLNECSELPLEDVTKNYIEKMNPPIGDLTQITSIQSLETIPSNSQQSA 325

Query: 181 HKDKRILKSMKIKSILRSLVSSDRNMRSKTQEKDKDPEPSNDLNNFTAEEG--KGKKKER 240
            KDK  LKS K    LRS VSSDR +RS+TQEK K PE SNDLNNFTAEE   + KKK+R
Sbjct: 326 RKDKIFLKSKKKNYKLRSHVSSDRVLRSRTQEKAKAPERSNDLNNFTAEEDGKRKKKKKR 385

Query: 241 NIQGKGARVDEFSSIRNHLRYLLNRIKYEQNLIEAYSSEGWKGFSSDKLKPEKELQRASN 300
           NIQGKGARVDE+SSIRNHLRYLLNRI+YEQ+LIEAYSSEGWKGFSSDKLKPEKELQRASN
Sbjct: 386 NIQGKGARVDEYSSIRNHLRYLLNRIRYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASN 445

Query: 301 EIMRRKLKIRDVFQRIDALCGEGGLSKSLFDSQGQIDSEDIFCAKCGSKELSLENDIILC 360
           EIMRRKLKIRD+FQRIDALC EG LS+SLFDS+GQIDSEDIFCAKCGSKELSLENDIILC
Sbjct: 446 EIMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILC 505

Query: 361 DGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLNLLNEFQGSRLSITDGWEK 420
           DGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCL+LLNEFQGS LSITDGWEK
Sbjct: 506 DGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEK 565

Query: 421 VYPEAAASAAGRNFDHALGLPSDDSEDDDYDPDVPDTIVQDDE---------------SS 480
           VYPEAAA+AAGRN DH LGLPSDDSED DYDPDVPDTI QD+E               S+
Sbjct: 566 VYPEAAAAAAGRNSDHTLGLPSDDSEDGDYDPDVPDTIDQDNELSSDESSSDQSNSDPSN 625

Query: 481 SETSGYASASEELESPSNVDQYLGLPSDDSDDDDYDPSAPERDEDVRQESSSSDFTSDSE 540
           S+TSGYASASE LE  SN DQYLGLPSDDS+D+DYDPS PE DE VRQESSSSDFTSDSE
Sbjct: 626 SDTSGYASASEGLEVSSNDDQYLGLPSDDSEDNDYDPSVPELDEGVRQESSSSDFTSDSE 685

Query: 541 DLAALDSSPSSKADNLVSSSLNNTTSTKNPDGRSSGGGPRKSALYNELSSLLESDP---- 597
           DLAALD++ SSK  +LV SSLNNT   KN +G+SS  GP KSAL+NELSSLL+S P    
Sbjct: 686 DLAALDNNCSSKDGDLV-SSLNNTLPVKNSNGQSS--GPNKSALHNELSSLLDSGPDKDG 745

BLAST of Cp4.1LG14g04420 vs. TrEMBL
Match: A0A067L7L0_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_16279 PE=4 SV=1)

HSP 1 Score: 583.2 bits (1502), Expect = 5.3e-163
Identity = 386/828 (46.62%), Postives = 508/828 (61.35%), Query Frame = 1

Query: 69  QQNMEEENKELGSGDVLSELSEKHNRTFSNLADNDQVEAGNLLCCDKDTENLIVPIEVET 128
           Q +  E+  E  S D   +  E+     S+L  ++ VE  N L C   T +L   +  ++
Sbjct: 183 QLSTSEQKVEFASDDATCDPLEESKVPASDLLRDELVEINNELSCCTATRHLGTQLTTKS 242

Query: 129 TTLLVDCSELPPEV-VNKNYIEQMNPPIEELTQNTPFQNLET----VPSNSEQSDHKDKR 188
           + L  +   +P +  +N    E++ PP + +  +   Q  +T    V  NS +   + KR
Sbjct: 243 SPL--EHLGMPSDSEINTCATEKLEPPHDNMDNHLNLQQSDTPSKDVSINSSRVGVRVKR 302

Query: 189 ILKSMKIKSILRSLVSSDRNMRSKTQEKDKDPEPSNDLNNFTAEEGKGKKKERNIQGKGA 248
             KS + K +LRSL  SDR  +S++QEK K P+P+ D+ N ++   K +KK +  Q K  
Sbjct: 303 TAKSTRKKYVLRSLRRSDRVRQSRSQEKPKGPDPNADMANASSNIEKTRKKRKKRQRKSV 362

Query: 249 RVDEFSSIRNHLRYLLNRIKYEQNLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKL 308
             DE+S IR HLRYLLNRI YEQ+LI AYS+EGWKG S +KLKPEKELQRA++EI+RRKL
Sbjct: 363 EGDEYSRIRKHLRYLLNRISYEQSLITAYSAEGWKGLSLEKLKPEKELQRATSEILRRKL 422

Query: 309 KIRDVFQRIDALCGEGGLSKSLFDSQGQIDSEDIFCAKCGSKELSLENDIILCDGICDRG 368
           KIRD+FQR+D+LC EG L +SLFDS GQI SEDIFCAKCGSK+++ +NDIILCDG CDRG
Sbjct: 423 KIRDLFQRVDSLCAEGRLPESLFDSDGQISSEDIFCAKCGSKDMTADNDIILCDGACDRG 482

Query: 369 FHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLNLLNEFQGSRLSITDGWEKVYPEAAA 428
           FHQFCL PPLL  DIPPDDEGWLCPGCDCK DC+ LLN+ QG+ +SI+D WEKV+PEAAA
Sbjct: 483 FHQFCLLPPLLKEDIPPDDEGWLCPGCDCKVDCIELLNDSQGTNISISDRWEKVFPEAAA 542

Query: 429 SAAGRNFDHALGLPSDDSEDDDYDPDVP--DTIVQDDESSSETSGYASASEELESPSNVD 488
             AG+N D   GLPSDDS+D+DYDPD P  D   Q DESS++ S Y SAS+ELE+    +
Sbjct: 543 --AGQNPDPNFGLPSDDSDDNDYDPDGPEIDEKSQGDESSNDESDYTSASDELEASPGDE 602

Query: 489 QYLGLPSDDSDDDDYDPSAPERDEDVRQESSSSDFTSDSEDLAAL--DSSPSSKADNLVS 548
           Q LGL SDDS+DDDYDP A +RDE+V +ESSSSDFTSDSEDL A   D+  S + +N +S
Sbjct: 603 QQLGLSSDDSEDDDYDPDALDRDENV-EESSSSDFTSDSEDLTATLDDNHLSGEDENHMS 662

Query: 549 SSLNNTTSTKNPDGRSSGGGPRKSALYNELSSL-LESDPE---PVLGRRQVERLDYKKLH 608
             L+        D +  G G  K + ++ELS L L S  +   P+ G+R VERLDYKKL+
Sbjct: 663 IGLHG-------DSKHRGNG--KQSTHSELSLLDLNSRKDGSGPISGKRDVERLDYKKLY 722

Query: 609 DETYGNVPTDSSDD---------------TYASISTDSSDDQGW--DSNTRKRSPKTLVL 668
           DETYGN  +DSSDD               TY S S+DSSDD+ +  D   RKR   T V 
Sbjct: 723 DETYGNASSDSSDDEDFTDDVEPRKRRKETYGSTSSDSSDDEDFIDDVEPRKRRRSTEV- 782

Query: 669 ALPNYRTNDDMTNVKTKHSSKRGTRQKAAAVNMNKSVTKTPEDTGKASSSVRRTTPSSYR 728
              +   N  ++    + ++ +  RQK+   N + S TK  E    +SSS +    S YR
Sbjct: 783 GQASVNANAFVSKTAKQDTTPKRHRQKSKFANTSTSSTKGHEGASPSSSSGKPVKSSGYR 842

Query: 729 RLSQLALERLLASFQENQYPERATKESLAQELGLSVKQVSKWFTNTRWSTRHPSSVEGNK 788
           RL +   + L  SF+ENQYP+RA KESLA+ELG++ +QVSKWF NTRWS  HP S + + 
Sbjct: 843 RLGETVTQGLYKSFKENQYPDRAKKESLAKELGITFQQVSKWFENTRWSFNHPPSTDAST 902

Query: 789 AKSSSRMGIRSSQASGELHQPEQE--------FGAQHQELPTTDSVVAPCQSGDTGDVKL 848
            + +++   +  + + EL  PE E         GAQ +E P  D        GDT D K+
Sbjct: 903 VRKTTKEDSQLPKTNTELCTPEPEKICRNTTSNGAQSEESPKVDDATGGSYIGDTRDTKM 962

Query: 849 ATQETKRSEFSAAKSRKRKGRSD-HAASCSKDSKESQRPPAKSPKVNE 858
            +QE+ + +     SRKRK  SD           E ++ P   PK  E
Sbjct: 963 GSQESCKQKSKTPDSRKRKHISDPRTLDPYSTIGEMEKIPVNLPKSQE 995

BLAST of Cp4.1LG14g04420 vs. TrEMBL
Match: M5VJJ0_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa023106mg PE=4 SV=1)

HSP 1 Score: 582.4 bits (1500), Expect = 9.1e-163
Identity = 417/923 (45.18%), Postives = 558/923 (60.46%), Query Frame = 1

Query: 6    EYTESRSNNNAEAVQEAKISVEAEMPTCLSNEQKHSVPDYHELEATPEYTNKTGGPDEEK 65
            E  E R  + +  VQ   +     +P C  +EQ   + +   + +     ++ G P E+ 
Sbjct: 151  EPAEERHPSGSFCVQNELLQTIMPLPICGGSEQVQPISENVNMASL---NDQAGLPPEDV 210

Query: 66   PEVQQNME---------EENKELGSGDVLSELSEKHNRTFSNLADNDQVEAGNLLCCDKD 125
             +  Q  +          +  E GSG V SE +++ ++  S  A ND+ +    +     
Sbjct: 211  SKTCQTQKISCPHQITSHQINEFGSGSVPSEPAKQKDQLDSVPAQNDEAKTSKAVSSSTV 270

Query: 126  TENLIVPIEVETTTLLVDCSELPPEVVNKNYIE-QMNPPIEELTQNTPFQNLETVPSNSE 185
             E     IE  T    +  SE P E ++K+  + +M P  E++TQN+  Q LET   N+ 
Sbjct: 271  FEQPGPSIEAMTEDSPIGHSEPPLEDLSKSLSDKEMEPLPEDVTQNSSLQQLETASKNAL 330

Query: 186  QSDH----KDKRILKSMKIKSILRSLVSSDRNMRSKTQEKDKDP-----------EPSND 245
            +       KDK+  KS K K + RS V SDR +RSKT EK+K             E SN 
Sbjct: 331  KISSCLGPKDKKNPKSRKRKYMSRSFVRSDRVLRSKTGEKEKPKDLKLSNNVATLESSNS 390

Query: 246  LNNFTAEEGKGKKKERNIQGKGARVDEFSSIRNHLRYLLNRIKYEQNLIEAYSSEGWKGF 305
            + N +  E K +KK +N +   A  DEFS IR HLRYLLNRI YE++LI+AYS EGWKG 
Sbjct: 391  IANVSNGEEKKRKKRKNRRDNRAIADEFSRIRTHLRYLLNRIGYEKSLIDAYSGEGWKGS 450

Query: 306  SSDKLKPEKELQRASNEIMRRKLKIRDVFQRIDALCGEGGLSKSLFDSQGQIDSEDIFCA 365
            S +KLKPEKELQRA++EI+RRKLKIRD+FQR+++LC EG   +SLFDS+GQIDSEDIFC 
Sbjct: 451  SLEKLKPEKELQRATSEILRRKLKIRDLFQRLESLCAEGMFPESLFDSEGQIDSEDIFCG 510

Query: 366  KCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLNLL 425
            KCGSK++SL+NDIILCDG CDRGFHQFCLEPPLL+ DIPPDDEGWLCPGCDCK DC++LL
Sbjct: 511  KCGSKDVSLDNDIILCDGACDRGFHQFCLEPPLLSEDIPPDDEGWLCPGCDCKVDCIDLL 570

Query: 426  NEFQGSRLSITDGWEKVYPEAAASA-AGRNFDHALGLPSDDSEDDDYDPDVPDT--IVQD 485
            N+ QG+ LS+TD WEKV+PEAAA+A AG N D+  GLPSDDS+D+DYDPD P+T   VQ 
Sbjct: 571  NDSQGTDLSVTDSWEKVFPEAAAAASAGENQDNH-GLPSDDSDDNDYDPDGPETDNKVQG 630

Query: 486  DESSSETSGYASASEELESP-SNVDQYLGLPSDDSDDDDYDPSAPERDEDVRQESSSSDF 545
            +ESSS+ S YASAS+ LE+P SN +QYLGLPS+DS+DDDY+P AP+ +EDV+QESSSSDF
Sbjct: 631  EESSSDESEYASASDGLETPKSNDEQYLGLPSEDSEDDDYNPYAPDVNEDVKQESSSSDF 690

Query: 546  TSDSEDL-AALDSSPSSKAD--NLVSSSLNNTTSTKNPDGRSSGGGPRKSALYNELSSLL 605
            TSDSEDL AALD +  S  D     S+SL+++   +    +SS  G +K +L +EL SLL
Sbjct: 691  TSDSEDLGAALDDNIMSSEDVEGPKSTSLDDSKPHRGSGEQSSISGQKKHSLKDELISLL 750

Query: 606  ESDP-----EPVLGRRQVERLDYKKLHDETYGNVPTDSSDD-TYASISTDSSDDQGWDSN 665
            ES P      P+ G+R +ERLDYK+LHDE YGNVPTDSSDD  +  I+T     +G    
Sbjct: 751  ESGPGQGESAPLSGKRHIERLDYKRLHDEAYGNVPTDSSDDEDWNDIATQRKRKKG-TGQ 810

Query: 666  TRKRSPKTLVLALPN-YRTNDDMTNV-KTKHSSKRGTRQKAAAVNMNKSVTKTPEDTGKA 725
               RSP      + N   T D   +V + +++ +R   +K+   + +    K+P+ + K+
Sbjct: 811  VANRSPNGKTSNIKNGVITKDIKPDVDENENTPRRMPHRKSNVEDTSNLSNKSPKGSTKS 870

Query: 726  SSSVRR--TTPSSYRRLSQLALERLLASFQENQYPERATKESLAQELGLSVKQ------- 785
             S+  R  ++ S+Y RL + A +RL  SF+EN YP+R+ KESLA+ELGL  KQ       
Sbjct: 871  GSTSGRAGSSRSTYSRLGEAATQRLCKSFKENHYPDRSMKESLARELGLMAKQVIPSFIL 930

Query: 786  --VSKWFTNTRWSTRHPSSVEGNKAKSSS-----RMGIRSSQASGELHQPEQEFGAQHQE 845
              VSKWF N     RH   V  +K+ S +     +   R  +    +       GAQ++E
Sbjct: 931  ASVSKWFEN----ARHCLKVGVDKSASENCAPPPQTNRRQLEQGDAIVGDSDHNGAQNKE 990

Query: 846  LPTTDSVVAPCQSGDTGDVKLATQETKRSEFSAAKSRKRKGRSDHAASCSKDSKESQRPP 873
            L  TD  +  C S D  D +LAT  + RS+ S   +RKRK RSD       D K     P
Sbjct: 991  LHGTDDPMIGCCSRDVMDSELATLGSSRSKLSTPNNRKRKRRSD-----DPDPKTETPTP 1050

BLAST of Cp4.1LG14g04420 vs. TrEMBL
Match: W9R947_9ROSA (Homeobox protein OS=Morus notabilis GN=L484_011492 PE=4 SV=1)

HSP 1 Score: 562.8 bits (1449), Expect = 7.5e-157
Identity = 361/740 (48.78%), Postives = 472/740 (63.78%), Query Frame = 1

Query: 118 ENLIVPIEVETTTLLVDCSELPPEVVN--KNYIE-QMNPPIEELTQNTPFQNLET----V 177
           +N++V   +  +  +V    L P V +   +YI+ Q+  P E++++++  + LET    +
Sbjct: 261 QNVLVETRIAASNGIVS-EHLEPPVGDGSDSYIDKQVEQPSEDVSKSSSLEQLETSSKSL 320

Query: 178 PSNSEQSDHKDKRILKSMKIKSILRSLVSSDRNMRSKTQEKDKDPEPSNDLNNFTAEEGK 237
            +   Q   KDK+  KS K + +LRSLV SDR +RS+TQEK K  E SN L+N      K
Sbjct: 321 VNKPSQLGRKDKQTSKSRKKQYMLRSLVHSDRVLRSRTQEKLKSHELSNTLSNIGNGVEK 380

Query: 238 GKKKERNIQGKGARVDEFSSIRNHLRYLLNRIKYEQNLIEAYSSEGWKGFSSDKLKPEKE 297
             K+ +  +G     DEFS IR  L+Y  NRI YEQNLI+AYSSEGWKG S +KLKPEKE
Sbjct: 381 RMKERKKRRGTRVIADEFSRIRKRLKYFFNRIHYEQNLIDAYSSEGWKGTSLEKLKPEKE 440

Query: 298 LQRASNEIMRRKLKIRDVFQRIDALCGEGGLSKSLFDSQGQIDSEDIFCAKCGSKELSLE 357
           LQRA +EI RRKLKIRD+FQ++D+LC EG   KSLFDS+GQIDSEDIFCAKCGSK++S  
Sbjct: 441 LQRAKSEIFRRKLKIRDLFQQLDSLCAEGRFPKSLFDSEGQIDSEDIFCAKCGSKDMSAN 500

Query: 358 NDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLNLLNEFQGSRLSI 417
           NDIILCDG CDRGFHQFCLEPPLL+ DIPPDDEGWLCPGCDCK DC +LLN+  G+ LS+
Sbjct: 501 NDIILCDGACDRGFHQFCLEPPLLSEDIPPDDEGWLCPGCDCKVDCFDLLNDSYGTNLSV 560

Query: 418 TDGWEKVYPEAAASA-AGRNFDHALGLPSDDSEDDDYDPDVPDTI--VQDDESSSETSGY 477
           TD WEKV+PEAAA+A  G++ DH L  PSDDSEDDDYDP  P+ +  V+ DESSS+ S Y
Sbjct: 561 TDSWEKVFPEAAAAAREGKDQDHNLEFPSDDSEDDDYDPYGPEIVEKVEGDESSSDESEY 620

Query: 478 ASASEEL--ESPSNVDQYLGLPSDDSDDDDYDPSAPERDEDVRQESSSSDFTSDSEDLA- 537
            SA +EL  E+P   +QY GL SDDS+D+D+DP   + DE+ +QESSSSDFTSDSEDLA 
Sbjct: 621 TSACDELEGEAPPKDEQYFGLSSDDSEDNDFDPDDQDVDENAKQESSSSDFTSDSEDLAF 680

Query: 538 ALDSSPSSKADNLVSSSLNNTTSTKNPDGRSSGGGPRKSALYNELSSLLES-----DPEP 597
            LD    ++ D +  SSL+ T S  N   +SS  G  KS++ +EL  +LES        P
Sbjct: 681 TLDEGQIAEKDEV--SSLDPTRSLGNAVMQSSKRGGNKSSIKDELLDILESGTGQDGSPP 740

Query: 598 VLGRRQVERLDYKKLHDETYGNVPTDSSDDTYASISTDSSDDQGWDSNTRKRSPKTLVLA 657
           + G+R VERLDYK+LHDETYG++P+DSSDD   +        +         SP      
Sbjct: 741 ISGKRHVERLDYKRLHDETYGHLPSDSSDDEDWTDYAAPRKRKRTTGQVSSVSPNENASI 800

Query: 658 LPNYRTNDDMTN--VKTKHSSKRGTRQKAAAVNMNKSVTKTPEDTGKASSSVRRTTPSSY 717
           + N  T D   N     ++  +R +RQ +   + N    K  + + K+ S+ RR   S+ 
Sbjct: 801 IKNQTTTDAANNDLEDNEYVPRRRSRQNSVVTDENNIPNKLLQGSPKSGSTGRRRELSTN 860

Query: 718 RRLSQLALERLLASFQENQYPERATKESLAQELGLSVKQVSKWFTNTRWSTRHPSSVEGN 777
           RRL +   +RL  SF+ENQY +RATKESLAQELGL+  QVSKWF N RWS RH SS +  
Sbjct: 861 RRLGEAVTQRLYQSFKENQYLDRATKESLAQELGLTSYQVSKWFENARWSYRHSSSKKPG 920

Query: 778 KAKSSSRMGIRSSQASGELHQPEQEF--------GAQHQELPTTDSVVAPCQSGDTGD-- 828
            ++ +S+    S Q + +L + E           GA + ELP T + +    SGD GD  
Sbjct: 921 ISEHASKESTLSPQTNKKLFETELNTSITNSTCNGALNNELPRTGNAMPESCSGDVGDGK 980

BLAST of Cp4.1LG14g04420 vs. TrEMBL
Match: A0A061E032_THECC (Homeodomain-like protein with RING/FYVE/PHD-type zinc finger domain, putative isoform 1 OS=Theobroma cacao GN=TCM_007171 PE=4 SV=1)

HSP 1 Score: 550.1 bits (1416), Expect = 5.0e-153
Identity = 347/741 (46.83%), Postives = 464/741 (62.62%), Query Frame = 1

Query: 64  EKPEVQQNMEEENKELGSGDVLSELSEKHNRTFSNLADNDQVEA---GNLLCCDKDTENL 123
           E PE +Q ++ E+   G  +    +S   +     L   D  ++   G+L    +   N+
Sbjct: 216 ESPEQRQQLDSESLPNGIEESTIAVSSNVSNQALQLKPEDMGKSHCGGHLHSPPEGVTNV 275

Query: 124 IVPIEVETTTLLVDCSELPPEVVNKN-YIEQMNPPIEELTQNTPFQNLETVPSNSEQSD- 183
           I      + + LV+   LP E    N   +Q   P E++ QN+  +  ET P N  ++  
Sbjct: 276 IQ----SSKSPLVEPLGLPQEFAQGNPSTQQSGLPCEDMAQNSGVEQHETKPKNLLENSG 335

Query: 184 -HKDKRILKSMKIKSILRSLVSSDRNMRSKTQEKDKDPEPSNDLNNFTAEEGKGKKKERN 243
             ++ +  K++K K +LRSL SSDR +RSK QEK K  E SN+L +  + E + ++K R 
Sbjct: 336 RRRNGKTSKTIKKKYMLRSLRSSDRVLRSKLQEKPKATESSNNLADVGSSEQQKRRKRRR 395

Query: 244 IQGKGARVDEFSSIRNHLRYLLNRIKYEQNLIEAYSSEGWKGFSSDKLKPEKELQRASNE 303
            +      DEFS IR HLRYLLNRI YE++LI AYS+EGWKG S +KLKPEKELQRA++E
Sbjct: 396 RKANREVADEFSRIRTHLRYLLNRINYERSLIAAYSTEGWKGLSLEKLKPEKELQRATSE 455

Query: 304 IMRRKLKIRDVFQRIDALCGEGGLSKSLFDSQGQIDSEDIFCAKCGSKELSLENDIILCD 363
           I+RRKLKIRD+FQ ID+LC EG L +SLFDS+GQIDSEDIFCAKCGSK+LS  NDIILCD
Sbjct: 456 ILRRKLKIRDLFQHIDSLCAEGKLPESLFDSEGQIDSEDIFCAKCGSKDLSANNDIILCD 515

Query: 364 GICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLNLLNEFQGSRLSITDGWEKV 423
           G CDRGFHQ+CL+PPLL  DIPPDDEGWLCPGCDCK DC+ L+NE QG+  SITD WEKV
Sbjct: 516 GACDRGFHQYCLQPPLLKEDIPPDDEGWLCPGCDCKVDCIELVNESQGTSFSITDSWEKV 575

Query: 424 YPEAAASAAGRNFDHALGLPSDDSEDDDYDPDVPDTIVQD--DESSSETSGYASASEELE 483
           +PEAA +AAG+N D   GLPSDDS+D+DY+PD  +T  +D  DESSSE S + S SEELE
Sbjct: 576 FPEAAVAAAGQNQDPNFGLPSDDSDDNDYNPDGSETDEKDHGDESSSEESEFTSTSEELE 635

Query: 484 SPSNVDQYLGLPSDDSDDDDYDPSAPERDEDVRQESSSSDFTSDSEDLAAL--DSSPSSK 543
            P+ VDQYLGLPSDDS+DDDYDP  P  DE V+ ESSSSDF+SDSEDL A+  +   S K
Sbjct: 636 VPAKVDQYLGLPSDDSEDDDYDPDGPNHDEVVKPESSSSDFSSDSEDLDAMLEEDITSQK 695

Query: 544 ADNLVSSSLNNTTSTKNPDGRSSGGGPRKSALYNELSSLL----ESDPEPVLGRRQVERL 603
            +  +++S    +  + P          K ++ +EL S++    E D   +  +R +ERL
Sbjct: 696 DEGPMANSAPRDSKRRKPK------LGEKESMNDELLSIMEPASEQDGSAISKKRSIERL 755

Query: 604 DYKKLHDETYGNVPTDSSDDTYASISTDSSDDQGWDSNT--RKRSPKTLVLALPNYRTND 663
           DYK+L+DETYGNVP            + SSDD+ W   T  RKR+  T  +A      N 
Sbjct: 756 DYKRLYDETYGNVP------------SSSSDDEDWSDITAPRKRNKCTAEVASAPENGNV 815

Query: 664 DMTNV------------KTKHSSKRGTRQKAAAVNMNKSVTKTPEDTGKASSSVRRTTPS 723
            ++              +T+H  +R TRQ +   + + S  +   +T  + SS ++   S
Sbjct: 816 SVSRTVSVSDGLKQNPEETEHKPRRKTRQMSRFKDTDSSPAEIQGNTSVSGSSGKKAGSS 875

Query: 724 SYRRLSQLALERLLASFQENQYPERATKESLAQELGLSVKQVSKWFTNTRWSTRHPSSVE 777
           +Y+RL +   +RL  SF+ENQYP+RATK+SLA+EL ++ +QVSKWF N RWS  +  S  
Sbjct: 876 TYKRLGEAVKQRLYKSFKENQYPDRATKQSLAKELDMTFQQVSKWFDNARWSFNNSPSSH 934

BLAST of Cp4.1LG14g04420 vs. TAIR10
Match: AT3G19510.1 (AT3G19510.1 Homeodomain-like protein with RING/FYVE/PHD-type zinc finger domain)

HSP 1 Score: 413.3 bits (1061), Expect = 3.7e-115
Identity = 260/564 (46.10%), Postives = 354/564 (62.77%), Query Frame = 1

Query: 207 KTQEKDKDPEPSNDLNNFTAEEGKGKKKERNI-QGKGARVDEFSSIRNHLRYLLNRIKYE 266
           + Q   +D  PS+ + N T   G+ KKK + + +G+    DE++ I+  LRY LNRI YE
Sbjct: 136 RAQRSKEDAGPSSVVANSTPV-GRPKKKNKTMNKGQVREDDEYTRIKKKLRYFLNRINYE 195

Query: 267 QNLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDVFQRIDALCGEGGLSKSL 326
           Q+LI+AYS EGWKG S +K++PEKEL+RA+ EI+RRKLKIRD+FQ +D LC EG L +SL
Sbjct: 196 QSLIDAYSLEGWKGSSLEKIRPEKELERATKEILRRKLKIRDLFQHLDTLCAEGSLPESL 255

Query: 327 FDSQGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGW 386
           FD+ G+I SEDIFCAKCGSK+LS++NDIILCDG CDRGFHQ+CLEPPL   DIPPDDEGW
Sbjct: 256 FDTDGEISSEDIFCAKCGSKDLSVDNDIILCDGFCDRGFHQYCLEPPLRKEDIPPDDEGW 315

Query: 387 LCPGCDCKDDCLNLLNEFQGSRLSITDGWEKVYPEAAASAAGRNFDHALGLPSDDSEDDD 446
           LCPGCDCKDD L+LLN+  G++ S++D WEK++PEAAA+  G   +    LPSDDS+D++
Sbjct: 316 LCPGCDCKDDSLDLLNDSLGTKFSVSDSWEKIFPEAAAALVGGGQNLDCDLPSDDSDDEE 375

Query: 447 YDPDVPDTIVQDDESS------------SETSGYASASEEL-----ESPSNVDQYLGLPS 506
           YDPD  +    D++ S            S+ + + SAS+E+     E    +   + LPS
Sbjct: 376 YDPDCLNDNENDEDGSDDNEESENEDGSSDETEFTSASDEMIESFKEGKDIMKDVMALPS 435

Query: 507 DDSDDDDYDPSAPERDEDVRQESSSSDFTSDSEDLAALDSSPSSKADNLVSSSLNNTTST 566
           DDS+DDDYDP AP  D+D  +ESS+SD TSD+EDL       S K D   ++     T  
Sbjct: 436 DDSEDDDYDPDAPTCDDD--KESSNSDCTSDTEDLET-----SFKGDE--TNQQAEDTPL 495

Query: 567 KNPDGRSSGGGPRKSALYNELSSLLESDPEPVLGRRQVERLDYKKLHDETYGNVPTDSSD 626
           ++P GR +      + L +++   L+  P  V  RR VERLDYKKL+DE Y NVPT SSD
Sbjct: 496 EDP-GRQTSQLQGDAILESDVG--LDDGPAGVSRRRNVERLDYKKLYDEEYDNVPTSSSD 555

Query: 627 D-----TYASISTDSSDDQGWDSNTRKRSPKTLVLALPNYRTNDDMTNVKTKHSSKRGTR 686
           D     T      DS  +   D+   K+S              +D T+ K    SKR  +
Sbjct: 556 DDDWDKTARMGKEDSESEDEGDTVPLKQSSNA-----------EDHTSKKLIRKSKRADK 615

Query: 687 QKAAAVNMNKSVTKTPEDTGKASSSVRRTTPSSYRRLSQLALERLLASFQENQYPERATK 746
           +    +       + P + G  S  + +++ S+ ++ +    +RL  SFQENQYP++ATK
Sbjct: 616 KDTLEMPQ-----EGPGENG-GSGEIEKSSSSACKQ-TDPKTQRLYISFQENQYPDKATK 668

Query: 747 ESLAQELGLSVKQVSKWFTNTRWS 748
           ESLA+EL ++VKQV+ WF + RWS
Sbjct: 676 ESLAKELQMTVKQVNNWFKHRRWS 668

BLAST of Cp4.1LG14g04420 vs. TAIR10
Match: AT4G29940.1 (AT4G29940.1 pathogenesis related homeodomain protein A)

HSP 1 Score: 194.9 bits (494), Expect = 2.1e-49
Identity = 105/287 (36.59%), Postives = 166/287 (57.84%), Query Frame = 1

Query: 205 RSKTQEKDKDPEPSNDLNNFTAEEGKGKKKERNIQGKGARVDEFSSIRNHLRYLLNRIKY 264
           +S+T++  +      ++     ++ + +K +R  +     VD+   ++   RYLL ++K 
Sbjct: 59  KSRTKKYSRGWVRCEEMEEEKVKKTRKRKSKRQQKDNKVEVDDSLRLQRRTRYLLIKMKM 118

Query: 265 EQNLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDVFQRIDALCGEGGLSKS 324
           +QNLI+AY++EGWKG S +K++P+KEL+RA  EI+  KL +RD  +++D L   G + + 
Sbjct: 119 QQNLIDAYATEGWKGQSREKIRPDKELERARKEILNCKLGLRDAIRQLDLLSSVGSMEEK 178

Query: 325 LFDSQGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEG 384
           +  S G I  + IFCA+C S+E   +NDIILCDG C+R FHQ CL+PPL    IPP D+G
Sbjct: 179 VIASDGSIHHDHIFCAECNSREAFPDNDIILCDGTCNRAFHQKCLDPPLETESIPPGDQG 238

Query: 385 WLCPGCDCKDDCLNLLNEFQGSRLSITDGWEKVYPEAAASAAG--RNFDHALGLPSDDSE 444
           W C  CDCK + ++ +N   G+   +   W+ ++ E A+   G     ++    PSDDS+
Sbjct: 239 WFCKFCDCKIEIIDTMNAQIGTHFPVDSNWQDIFNEEASLPIGSEATVNNEADWPSDDSK 298

Query: 445 DDDYDPDVPDTIVQDDESSSETSGYASASEELESPSNVDQYLGLPSD 490
           DDDYDP++ +       +SS  SG      + ES   +   L L SD
Sbjct: 299 DDDYDPEMRE---NGGGNSSNVSGDGGGDNDEES---ISTSLSLSSD 339

BLAST of Cp4.1LG14g04420 vs. NCBI nr
Match: gi|778679986|ref|XP_011651230.1| (PREDICTED: homeobox protein HOX1A [Cucumis sativus])

HSP 1 Score: 1169.8 bits (3025), Expect = 0.0e+00
Identity = 673/906 (74.28%), Postives = 742/906 (81.90%), Query Frame = 1

Query: 1    MEERDEYT--ESRSNNNAEAVQEAKISVEAEMPTCLSNEQKHSVPDYHELEATPEYTNKT 60
            MEERDE T  ESR N  AEAVQEAK SVE E+ TCLSNE K+S   Y EL  TPE+++K 
Sbjct: 146  MEERDENTDTESRPNKIAEAVQEAKASVEVEVLTCLSNEAKYS--GYQELGTTPEFSSKI 205

Query: 61   GGPDEEKPEVQQNMEEENKELGSGDVLSELSEKHNRTFSNLADNDQVEAGNLLCCDKDTE 120
             GPDEEK  VQQNME     LGSG +LSELSEK N+T SN ADND+VEAGNLL  DKDT+
Sbjct: 206  DGPDEEKAGVQQNME-----LGSGYLLSELSEKDNQTISNHADNDRVEAGNLLSNDKDTK 265

Query: 121  NLIVPIEVETTTLLVDCSELPPEVVNKNYIEQMNPPIEELTQNTPFQNLETVPSNSEQSD 180
            NL + IE E TTLL +CSELP E V KNYIE+MNPPI +LTQ T  Q+LET+PSNS+QS 
Sbjct: 266  NLKLSIEDEATTLLNECSELPLEDVTKNYIEKMNPPIGDLTQITSIQSLETIPSNSQQSA 325

Query: 181  HKDKRILKSMKIKSILRSLVSSDRNMRSKTQEKDKDPEPSNDLNNFTAEEG--KGKKKER 240
             KDK  LKS K    LRS VSSDR +RS+TQEK K PE SNDLNNFTAEE   + KKK+R
Sbjct: 326  RKDKIFLKSKKKNYKLRSHVSSDRVLRSRTQEKAKAPERSNDLNNFTAEEDGKRKKKKKR 385

Query: 241  NIQGKGARVDEFSSIRNHLRYLLNRIKYEQNLIEAYSSEGWKGFSSDKLKPEKELQRASN 300
            NIQGKGARVDE+SSIRNHLRYLLNRI+YEQ+LIEAYSSEGWKGFSSDKLKPEKELQRASN
Sbjct: 386  NIQGKGARVDEYSSIRNHLRYLLNRIRYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASN 445

Query: 301  EIMRRKLKIRDVFQRIDALCGEGGLSKSLFDSQGQIDSEDIFCAKCGSKELSLENDIILC 360
            EIMRRKLKIRD+FQRIDALC EG LS+SLFDS+GQIDSEDIFCAKCGSKELSLENDIILC
Sbjct: 446  EIMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILC 505

Query: 361  DGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLNLLNEFQGSRLSITDGWEK 420
            DGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCL+LLNEFQGS LSITDGWEK
Sbjct: 506  DGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEK 565

Query: 421  VYPEAAASAAGRNFDHALGLPSDDSEDDDYDPDVPDTIVQDDE---------------SS 480
            VYPEAAA+AAGRN DH LGLPSDDSED DYDPDVPDTI QD+E               S+
Sbjct: 566  VYPEAAAAAAGRNSDHTLGLPSDDSEDGDYDPDVPDTIDQDNELSSDESSSDQSNSDPSN 625

Query: 481  SETSGYASASEELESPSNVDQYLGLPSDDSDDDDYDPSAPERDEDVRQESSSSDFTSDSE 540
            S+TSGYASASE LE  SN DQYLGLPSDDS+D+DYDPS PE DE VRQESSSSDFTSDSE
Sbjct: 626  SDTSGYASASEGLEVSSNDDQYLGLPSDDSEDNDYDPSVPELDEGVRQESSSSDFTSDSE 685

Query: 541  DLAALDSSPSSKADNLVSSSLNNTTSTKNPDGRSSGGGPRKSALYNELSSLLESDP---- 600
            DLAALD++ SSK  +LV SSLNNT   KN +G+SS  GP KSAL+NELSSLL+S P    
Sbjct: 686  DLAALDNNCSSKDGDLV-SSLNNTLPVKNSNGQSS--GPNKSALHNELSSLLDSGPDKDG 745

Query: 601  -EPVLGRRQVERLDYKKLHDETYGNVPTDSSDDTYASISTDSSDDQGWDSNTRKRSPKTL 660
             EPV GRRQVERLDYKKLHDETYGNVPTDSSDDTY S + DSSDD+GWDS TRKR PKTL
Sbjct: 746  LEPVSGRRQVERLDYKKLHDETYGNVPTDSSDDTYGS-TLDSSDDRGWDSGTRKRGPKTL 805

Query: 661  VLALPNYRTNDDMTNVKTKHSSKRGTRQKAAAVNMNKSVTKTPEDTGKASSSVRRTTPSS 720
            VLAL N  +NDD+TNVKTK S KR TRQK  A+N+N SVT+TP DT K+SSSV+++T SS
Sbjct: 806  VLALSNNGSNDDLTNVKTKRSYKRRTRQKPGAINVNNSVTETPVDTAKSSSSVKKSTSSS 865

Query: 721  YRRLSQLALERLLASFQENQYPERATKESLAQELGLSVKQVSKWFTNTRWSTRHPSSVEG 780
             RRLSQ ALERLLASFQEN+YP+RATK+SLAQELGL +KQVSKWF NTRWSTRHPSS  G
Sbjct: 866  NRRLSQPALERLLASFQENEYPKRATKQSLAQELGLGLKQVSKWFENTRWSTRHPSS-SG 925

Query: 781  NKAKSSSRMGIRSSQASGELHQPEQEF----------GAQHQELPTTDSVVAPCQSGDTG 840
             KAKSSSRM I  SQASGEL + E E           GA+HQ+LP  +SVVA CQSGDTG
Sbjct: 926  KKAKSSSRMSIYLSQASGELSKNEPESATCFRDTDSNGARHQDLPMANSVVASCQSGDTG 985

Query: 841  DVKLATQETKRSEFSAAKSRKRKGRSDHAASCSKDSKESQRPPAKSPKVNEIQTAHSIKT 873
            D KL++++TKR++ SA KSRKRKGRSD+ AS SKD + S RPPAKSPKVNE+QTA   KT
Sbjct: 986  DKKLSSRKTKRADSSATKSRKRKGRSDNTASHSKDREGSPRPPAKSPKVNEMQTADRFKT 1039

BLAST of Cp4.1LG14g04420 vs. NCBI nr
Match: gi|659112348|ref|XP_008456177.1| (PREDICTED: pathogenesis-related homeodomain protein isoform X1 [Cucumis melo])

HSP 1 Score: 1168.7 bits (3022), Expect = 0.0e+00
Identity = 673/901 (74.69%), Postives = 736/901 (81.69%), Query Frame = 1

Query: 1    MEERDEYT--ESRSNNNAEAVQEAKISVEAEMPTCLSNEQKHSVPDYHELEATPEYTNKT 60
            MEERDE T  ESR N  AEAVQEAK SVE E+ TCLSNE  +S   Y EL  TPE++ KT
Sbjct: 175  MEERDENTDTESRPNKIAEAVQEAKASVEVEVRTCLSNEPMYS--GYQELGTTPEFSRKT 234

Query: 61   GGPDEEKPEVQQNMEEENKELGSGDVLSELSEKHNRTFSNLADNDQVEAGNLLCCDKDTE 120
             GPDEEK  VQQNME     LGSG +LSELSEK N+T SN ADNDQVEAGN L  DKDT+
Sbjct: 235  DGPDEEKAGVQQNME-----LGSGYLLSELSEKDNQTISNHADNDQVEAGNSLSIDKDTK 294

Query: 121  NLIVPIEVETTTLLVDCSELPPEVVNKNYIEQMNPPIEELTQNTPFQNLETVPSNSEQSD 180
            NL + IE ETTTLL +CSELP E V KNYIE+MNPPIE+LTQ T  Q+LET+PSNS+Q D
Sbjct: 295  NLKLSIEDETTTLLNECSELPLEDVTKNYIEKMNPPIEDLTQITSIQSLETIPSNSQQLD 354

Query: 181  HKDKRILKSMKIKSILRSLVSSDRNMRSKTQEKDKDPEPSNDLNNFTAEEG--KGKKKER 240
            HKD+R  KS K    LRSLVSSDR +RS+TQEK K PEPSNDLNNFTAEE   + KKK+R
Sbjct: 355  HKDERFFKSKKKNYKLRSLVSSDRVLRSRTQEKAKAPEPSNDLNNFTAEEEGKRKKKKKR 414

Query: 241  NIQGKGARVDEFSSIRNHLRYLLNRIKYEQNLIEAYSSEGWKGFSSDKLKPEKELQRASN 300
            NIQGKGARVDE+SSIRNHLRYLLNRI+YEQ+LIEAYSSEGWKGFSSDKLKPEKELQRASN
Sbjct: 415  NIQGKGARVDEYSSIRNHLRYLLNRIRYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASN 474

Query: 301  EIMRRKLKIRDVFQRIDALCGEGGLSKSLFDSQGQIDSEDIFCAKCGSKELSLENDIILC 360
            EIMRRKLKIRD+FQRID LC EG LS+SLFDS+GQIDSEDIFCAKCGSKELSLENDIILC
Sbjct: 475  EIMRRKLKIRDLFQRIDTLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILC 534

Query: 361  DGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLNLLNEFQGSRLSITDGWEK 420
            DGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCL+LLNEFQGS LSITDGWEK
Sbjct: 535  DGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEK 594

Query: 421  VYPEAAASAAGRNFDHALGLPSDDSEDDDYDPDVPDTIVQD----------DESSSETSG 480
            VYPEAAA AAGRN D  LGLPSDDSED DYDPD+PDTI QD          D+S+S+TSG
Sbjct: 595  VYPEAAA-AAGRNSDDTLGLPSDDSEDGDYDPDIPDTIDQDNELSSDESSSDQSNSDTSG 654

Query: 481  YASASEELESPSNVDQYLGLPSDDSDDDDYDPSAPERDEDVRQESSSSDFTSDSEDLAAL 540
            YASASE LE P N DQYLGLPSDDS+D+DYDPS PE DE  RQESSSSDFTSDSEDLAAL
Sbjct: 655  YASASEGLEVPPNDDQYLGLPSDDSEDNDYDPSVPELDEGDRQESSSSDFTSDSEDLAAL 714

Query: 541  DSSPSSKADNLVSSSLNNTTSTKNPDGRSSGGGPRKSALYNELSSLLES-----DPEPVL 600
            +++ SSK D+LV SSLNNT   KN +GRSS  GP KS L+NELSSLL+S       EP+ 
Sbjct: 715  ENNCSSKDDDLV-SSLNNTLPVKNTNGRSS--GPSKSTLHNELSSLLDSGLDKDGLEPIS 774

Query: 601  GRRQVERLDYKKLHDETYGNVPTDSSDDTYASISTDSSDDQGWDSNTRKRSPKTLVLALP 660
            GRRQVERLDYKKLHDETYGNVPT+SSDDTY S + DSSDD+G DS TRKR PKTLVLAL 
Sbjct: 775  GRRQVERLDYKKLHDETYGNVPTESSDDTYGS-TLDSSDDRGCDSGTRKRGPKTLVLALS 834

Query: 661  NYRTNDDMTNVKTKHSSKRGTRQKAAAVNMNKSVTKTPEDTGKASSSVRRTTPSSYRRLS 720
            N  +NDD+TNVKTK S KR TRQK  A+N+N SVT+TP DT K+SSSVR+ T SS RRLS
Sbjct: 835  NNGSNDDLTNVKTKRSYKRRTRQKPGAINVNNSVTETPVDTAKSSSSVRQCTSSSNRRLS 894

Query: 721  QLALERLLASFQENQYPERATKESLAQELGLSVKQVSKWFTNTRWSTRHPSSVEGNKAKS 780
            Q ALERL ASFQEN+YP+RATKESLAQELGL++KQVSKWF NTRWSTRHPSS  G KAKS
Sbjct: 895  QPALERLFASFQENEYPKRATKESLAQELGLNLKQVSKWFENTRWSTRHPSS-GGKKAKS 954

Query: 781  SSRMGIRSSQASGELHQPEQEF----------GAQHQELPTTDSVVAPCQSGDTGDVKLA 840
            SSRM I  SQASGEL + EQE           GA+HQ+LP  +SVVA CQSGDTGD KL 
Sbjct: 955  SSRMSIHLSQASGELSKNEQESATCFRDTDSNGARHQDLPMANSVVASCQSGDTGDKKLT 1014

Query: 841  TQETKRSEFSAAKSRKRKGRSDHAASCSKDSKESQRPPAKSPKVNEIQTAHSIKTRRRNS 873
            T++TKR E SA KSRKRKGRSD+ AS SKD + S RPPAKSPKVNE QTA   KTRRR S
Sbjct: 1015 TRKTKRGESSATKSRKRKGRSDNTASNSKDREGSPRPPAKSPKVNETQTADRFKTRRRRS 1062

BLAST of Cp4.1LG14g04420 vs. NCBI nr
Match: gi|659112354|ref|XP_008456180.1| (PREDICTED: homeobox protein HAT3.1 isoform X2 [Cucumis melo])

HSP 1 Score: 1168.7 bits (3022), Expect = 0.0e+00
Identity = 673/901 (74.69%), Postives = 736/901 (81.69%), Query Frame = 1

Query: 1    MEERDEYT--ESRSNNNAEAVQEAKISVEAEMPTCLSNEQKHSVPDYHELEATPEYTNKT 60
            MEERDE T  ESR N  AEAVQEAK SVE E+ TCLSNE  +S   Y EL  TPE++ KT
Sbjct: 117  MEERDENTDTESRPNKIAEAVQEAKASVEVEVRTCLSNEPMYS--GYQELGTTPEFSRKT 176

Query: 61   GGPDEEKPEVQQNMEEENKELGSGDVLSELSEKHNRTFSNLADNDQVEAGNLLCCDKDTE 120
             GPDEEK  VQQNME     LGSG +LSELSEK N+T SN ADNDQVEAGN L  DKDT+
Sbjct: 177  DGPDEEKAGVQQNME-----LGSGYLLSELSEKDNQTISNHADNDQVEAGNSLSIDKDTK 236

Query: 121  NLIVPIEVETTTLLVDCSELPPEVVNKNYIEQMNPPIEELTQNTPFQNLETVPSNSEQSD 180
            NL + IE ETTTLL +CSELP E V KNYIE+MNPPIE+LTQ T  Q+LET+PSNS+Q D
Sbjct: 237  NLKLSIEDETTTLLNECSELPLEDVTKNYIEKMNPPIEDLTQITSIQSLETIPSNSQQLD 296

Query: 181  HKDKRILKSMKIKSILRSLVSSDRNMRSKTQEKDKDPEPSNDLNNFTAEEG--KGKKKER 240
            HKD+R  KS K    LRSLVSSDR +RS+TQEK K PEPSNDLNNFTAEE   + KKK+R
Sbjct: 297  HKDERFFKSKKKNYKLRSLVSSDRVLRSRTQEKAKAPEPSNDLNNFTAEEEGKRKKKKKR 356

Query: 241  NIQGKGARVDEFSSIRNHLRYLLNRIKYEQNLIEAYSSEGWKGFSSDKLKPEKELQRASN 300
            NIQGKGARVDE+SSIRNHLRYLLNRI+YEQ+LIEAYSSEGWKGFSSDKLKPEKELQRASN
Sbjct: 357  NIQGKGARVDEYSSIRNHLRYLLNRIRYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASN 416

Query: 301  EIMRRKLKIRDVFQRIDALCGEGGLSKSLFDSQGQIDSEDIFCAKCGSKELSLENDIILC 360
            EIMRRKLKIRD+FQRID LC EG LS+SLFDS+GQIDSEDIFCAKCGSKELSLENDIILC
Sbjct: 417  EIMRRKLKIRDLFQRIDTLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILC 476

Query: 361  DGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLNLLNEFQGSRLSITDGWEK 420
            DGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCL+LLNEFQGS LSITDGWEK
Sbjct: 477  DGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEK 536

Query: 421  VYPEAAASAAGRNFDHALGLPSDDSEDDDYDPDVPDTIVQD----------DESSSETSG 480
            VYPEAAA AAGRN D  LGLPSDDSED DYDPD+PDTI QD          D+S+S+TSG
Sbjct: 537  VYPEAAA-AAGRNSDDTLGLPSDDSEDGDYDPDIPDTIDQDNELSSDESSSDQSNSDTSG 596

Query: 481  YASASEELESPSNVDQYLGLPSDDSDDDDYDPSAPERDEDVRQESSSSDFTSDSEDLAAL 540
            YASASE LE P N DQYLGLPSDDS+D+DYDPS PE DE  RQESSSSDFTSDSEDLAAL
Sbjct: 597  YASASEGLEVPPNDDQYLGLPSDDSEDNDYDPSVPELDEGDRQESSSSDFTSDSEDLAAL 656

Query: 541  DSSPSSKADNLVSSSLNNTTSTKNPDGRSSGGGPRKSALYNELSSLLES-----DPEPVL 600
            +++ SSK D+LV SSLNNT   KN +GRSS  GP KS L+NELSSLL+S       EP+ 
Sbjct: 657  ENNCSSKDDDLV-SSLNNTLPVKNTNGRSS--GPSKSTLHNELSSLLDSGLDKDGLEPIS 716

Query: 601  GRRQVERLDYKKLHDETYGNVPTDSSDDTYASISTDSSDDQGWDSNTRKRSPKTLVLALP 660
            GRRQVERLDYKKLHDETYGNVPT+SSDDTY S + DSSDD+G DS TRKR PKTLVLAL 
Sbjct: 717  GRRQVERLDYKKLHDETYGNVPTESSDDTYGS-TLDSSDDRGCDSGTRKRGPKTLVLALS 776

Query: 661  NYRTNDDMTNVKTKHSSKRGTRQKAAAVNMNKSVTKTPEDTGKASSSVRRTTPSSYRRLS 720
            N  +NDD+TNVKTK S KR TRQK  A+N+N SVT+TP DT K+SSSVR+ T SS RRLS
Sbjct: 777  NNGSNDDLTNVKTKRSYKRRTRQKPGAINVNNSVTETPVDTAKSSSSVRQCTSSSNRRLS 836

Query: 721  QLALERLLASFQENQYPERATKESLAQELGLSVKQVSKWFTNTRWSTRHPSSVEGNKAKS 780
            Q ALERL ASFQEN+YP+RATKESLAQELGL++KQVSKWF NTRWSTRHPSS  G KAKS
Sbjct: 837  QPALERLFASFQENEYPKRATKESLAQELGLNLKQVSKWFENTRWSTRHPSS-GGKKAKS 896

Query: 781  SSRMGIRSSQASGELHQPEQEF----------GAQHQELPTTDSVVAPCQSGDTGDVKLA 840
            SSRM I  SQASGEL + EQE           GA+HQ+LP  +SVVA CQSGDTGD KL 
Sbjct: 897  SSRMSIHLSQASGELSKNEQESATCFRDTDSNGARHQDLPMANSVVASCQSGDTGDKKLT 956

Query: 841  TQETKRSEFSAAKSRKRKGRSDHAASCSKDSKESQRPPAKSPKVNEIQTAHSIKTRRRNS 873
            T++TKR E SA KSRKRKGRSD+ AS SKD + S RPPAKSPKVNE QTA   KTRRR S
Sbjct: 957  TRKTKRGESSATKSRKRKGRSDNTASNSKDREGSPRPPAKSPKVNETQTADRFKTRRRRS 1004

BLAST of Cp4.1LG14g04420 vs. NCBI nr
Match: gi|700202354|gb|KGN57487.1| (hypothetical protein Csa_3G198510 [Cucumis sativus])

HSP 1 Score: 828.9 bits (2140), Expect = 8.0e-237
Identity = 472/620 (76.13%), Postives = 512/620 (82.58%), Query Frame = 1

Query: 1   MEERDEYT--ESRSNNNAEAVQEAKISVEAEMPTCLSNEQKHSVPDYHELEATPEYTNKT 60
           MEERDE T  ESR N  AEAVQEAK SVE E+ TCLSNE K+S   Y EL  TPE+++K 
Sbjct: 146 MEERDENTDTESRPNKIAEAVQEAKASVEVEVLTCLSNEAKYS--GYQELGTTPEFSSKI 205

Query: 61  GGPDEEKPEVQQNMEEENKELGSGDVLSELSEKHNRTFSNLADNDQVEAGNLLCCDKDTE 120
            GPDEEK  VQQNME     LGSG +LSELSEK N+T SN ADND+VEAGNLL  DKDT+
Sbjct: 206 DGPDEEKAGVQQNME-----LGSGYLLSELSEKDNQTISNHADNDRVEAGNLLSNDKDTK 265

Query: 121 NLIVPIEVETTTLLVDCSELPPEVVNKNYIEQMNPPIEELTQNTPFQNLETVPSNSEQSD 180
           NL + IE E TTLL +CSELP E V KNYIE+MNPPI +LTQ T  Q+LET+PSNS+QS 
Sbjct: 266 NLKLSIEDEATTLLNECSELPLEDVTKNYIEKMNPPIGDLTQITSIQSLETIPSNSQQSA 325

Query: 181 HKDKRILKSMKIKSILRSLVSSDRNMRSKTQEKDKDPEPSNDLNNFTAEEG--KGKKKER 240
            KDK  LKS K    LRS VSSDR +RS+TQEK K PE SNDLNNFTAEE   + KKK+R
Sbjct: 326 RKDKIFLKSKKKNYKLRSHVSSDRVLRSRTQEKAKAPERSNDLNNFTAEEDGKRKKKKKR 385

Query: 241 NIQGKGARVDEFSSIRNHLRYLLNRIKYEQNLIEAYSSEGWKGFSSDKLKPEKELQRASN 300
           NIQGKGARVDE+SSIRNHLRYLLNRI+YEQ+LIEAYSSEGWKGFSSDKLKPEKELQRASN
Sbjct: 386 NIQGKGARVDEYSSIRNHLRYLLNRIRYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASN 445

Query: 301 EIMRRKLKIRDVFQRIDALCGEGGLSKSLFDSQGQIDSEDIFCAKCGSKELSLENDIILC 360
           EIMRRKLKIRD+FQRIDALC EG LS+SLFDS+GQIDSEDIFCAKCGSKELSLENDIILC
Sbjct: 446 EIMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILC 505

Query: 361 DGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLNLLNEFQGSRLSITDGWEK 420
           DGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCL+LLNEFQGS LSITDGWEK
Sbjct: 506 DGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEK 565

Query: 421 VYPEAAASAAGRNFDHALGLPSDDSEDDDYDPDVPDTIVQDDE---------------SS 480
           VYPEAAA+AAGRN DH LGLPSDDSED DYDPDVPDTI QD+E               S+
Sbjct: 566 VYPEAAAAAAGRNSDHTLGLPSDDSEDGDYDPDVPDTIDQDNELSSDESSSDQSNSDPSN 625

Query: 481 SETSGYASASEELESPSNVDQYLGLPSDDSDDDDYDPSAPERDEDVRQESSSSDFTSDSE 540
           S+TSGYASASE LE  SN DQYLGLPSDDS+D+DYDPS PE DE VRQESSSSDFTSDSE
Sbjct: 626 SDTSGYASASEGLEVSSNDDQYLGLPSDDSEDNDYDPSVPELDEGVRQESSSSDFTSDSE 685

Query: 541 DLAALDSSPSSKADNLVSSSLNNTTSTKNPDGRSSGGGPRKSALYNELSSLLESDP---- 597
           DLAALD++ SSK  +LV SSLNNT   KN +G+SS  GP KSAL+NELSSLL+S P    
Sbjct: 686 DLAALDNNCSSKDGDLV-SSLNNTLPVKNSNGQSS--GPNKSALHNELSSLLDSGPDKDG 745

BLAST of Cp4.1LG14g04420 vs. NCBI nr
Match: gi|643738525|gb|KDP44446.1| (hypothetical protein JCGZ_16279 [Jatropha curcas])

HSP 1 Score: 583.2 bits (1502), Expect = 7.7e-163
Identity = 386/828 (46.62%), Postives = 508/828 (61.35%), Query Frame = 1

Query: 69  QQNMEEENKELGSGDVLSELSEKHNRTFSNLADNDQVEAGNLLCCDKDTENLIVPIEVET 128
           Q +  E+  E  S D   +  E+     S+L  ++ VE  N L C   T +L   +  ++
Sbjct: 183 QLSTSEQKVEFASDDATCDPLEESKVPASDLLRDELVEINNELSCCTATRHLGTQLTTKS 242

Query: 129 TTLLVDCSELPPEV-VNKNYIEQMNPPIEELTQNTPFQNLET----VPSNSEQSDHKDKR 188
           + L  +   +P +  +N    E++ PP + +  +   Q  +T    V  NS +   + KR
Sbjct: 243 SPL--EHLGMPSDSEINTCATEKLEPPHDNMDNHLNLQQSDTPSKDVSINSSRVGVRVKR 302

Query: 189 ILKSMKIKSILRSLVSSDRNMRSKTQEKDKDPEPSNDLNNFTAEEGKGKKKERNIQGKGA 248
             KS + K +LRSL  SDR  +S++QEK K P+P+ D+ N ++   K +KK +  Q K  
Sbjct: 303 TAKSTRKKYVLRSLRRSDRVRQSRSQEKPKGPDPNADMANASSNIEKTRKKRKKRQRKSV 362

Query: 249 RVDEFSSIRNHLRYLLNRIKYEQNLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKL 308
             DE+S IR HLRYLLNRI YEQ+LI AYS+EGWKG S +KLKPEKELQRA++EI+RRKL
Sbjct: 363 EGDEYSRIRKHLRYLLNRISYEQSLITAYSAEGWKGLSLEKLKPEKELQRATSEILRRKL 422

Query: 309 KIRDVFQRIDALCGEGGLSKSLFDSQGQIDSEDIFCAKCGSKELSLENDIILCDGICDRG 368
           KIRD+FQR+D+LC EG L +SLFDS GQI SEDIFCAKCGSK+++ +NDIILCDG CDRG
Sbjct: 423 KIRDLFQRVDSLCAEGRLPESLFDSDGQISSEDIFCAKCGSKDMTADNDIILCDGACDRG 482

Query: 369 FHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLNLLNEFQGSRLSITDGWEKVYPEAAA 428
           FHQFCL PPLL  DIPPDDEGWLCPGCDCK DC+ LLN+ QG+ +SI+D WEKV+PEAAA
Sbjct: 483 FHQFCLLPPLLKEDIPPDDEGWLCPGCDCKVDCIELLNDSQGTNISISDRWEKVFPEAAA 542

Query: 429 SAAGRNFDHALGLPSDDSEDDDYDPDVP--DTIVQDDESSSETSGYASASEELESPSNVD 488
             AG+N D   GLPSDDS+D+DYDPD P  D   Q DESS++ S Y SAS+ELE+    +
Sbjct: 543 --AGQNPDPNFGLPSDDSDDNDYDPDGPEIDEKSQGDESSNDESDYTSASDELEASPGDE 602

Query: 489 QYLGLPSDDSDDDDYDPSAPERDEDVRQESSSSDFTSDSEDLAAL--DSSPSSKADNLVS 548
           Q LGL SDDS+DDDYDP A +RDE+V +ESSSSDFTSDSEDL A   D+  S + +N +S
Sbjct: 603 QQLGLSSDDSEDDDYDPDALDRDENV-EESSSSDFTSDSEDLTATLDDNHLSGEDENHMS 662

Query: 549 SSLNNTTSTKNPDGRSSGGGPRKSALYNELSSL-LESDPE---PVLGRRQVERLDYKKLH 608
             L+        D +  G G  K + ++ELS L L S  +   P+ G+R VERLDYKKL+
Sbjct: 663 IGLHG-------DSKHRGNG--KQSTHSELSLLDLNSRKDGSGPISGKRDVERLDYKKLY 722

Query: 609 DETYGNVPTDSSDD---------------TYASISTDSSDDQGW--DSNTRKRSPKTLVL 668
           DETYGN  +DSSDD               TY S S+DSSDD+ +  D   RKR   T V 
Sbjct: 723 DETYGNASSDSSDDEDFTDDVEPRKRRKETYGSTSSDSSDDEDFIDDVEPRKRRRSTEV- 782

Query: 669 ALPNYRTNDDMTNVKTKHSSKRGTRQKAAAVNMNKSVTKTPEDTGKASSSVRRTTPSSYR 728
              +   N  ++    + ++ +  RQK+   N + S TK  E    +SSS +    S YR
Sbjct: 783 GQASVNANAFVSKTAKQDTTPKRHRQKSKFANTSTSSTKGHEGASPSSSSGKPVKSSGYR 842

Query: 729 RLSQLALERLLASFQENQYPERATKESLAQELGLSVKQVSKWFTNTRWSTRHPSSVEGNK 788
           RL +   + L  SF+ENQYP+RA KESLA+ELG++ +QVSKWF NTRWS  HP S + + 
Sbjct: 843 RLGETVTQGLYKSFKENQYPDRAKKESLAKELGITFQQVSKWFENTRWSFNHPPSTDAST 902

Query: 789 AKSSSRMGIRSSQASGELHQPEQE--------FGAQHQELPTTDSVVAPCQSGDTGDVKL 848
            + +++   +  + + EL  PE E         GAQ +E P  D        GDT D K+
Sbjct: 903 VRKTTKEDSQLPKTNTELCTPEPEKICRNTTSNGAQSEESPKVDDATGGSYIGDTRDTKM 962

Query: 849 ATQETKRSEFSAAKSRKRKGRSD-HAASCSKDSKESQRPPAKSPKVNE 858
            +QE+ + +     SRKRK  SD           E ++ P   PK  E
Sbjct: 963 GSQESCKQKSKTPDSRKRKHISDPRTLDPYSTIGEMEKIPVNLPKSQE 995

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PRH_PETCR2.1e-12046.17Pathogenesis-related homeodomain protein OS=Petroselinum crispum GN=PRH PE=2 SV=... [more]
HAT31_ARATH6.6e-11446.10Homeobox protein HAT3.1 OS=Arabidopsis thaliana GN=HAT3.1 PE=2 SV=3[more]
HOX1A_MAIZE8.6e-9840.69Homeobox protein HOX1A OS=Zea mays GN=HOX1A PE=2 SV=1[more]
PRH_ARATH3.7e-4836.59Pathogenesis-related homeodomain protein OS=Arabidopsis thaliana GN=PRH PE=2 SV=... [more]
Match NameE-valueIdentityDescription
A0A0A0LA53_CUCSA5.6e-23776.13Uncharacterized protein OS=Cucumis sativus GN=Csa_3G198510 PE=4 SV=1[more]
A0A067L7L0_JATCU5.3e-16346.62Uncharacterized protein OS=Jatropha curcas GN=JCGZ_16279 PE=4 SV=1[more]
M5VJJ0_PRUPE9.1e-16345.18Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa023106mg PE=4 SV=1[more]
W9R947_9ROSA7.5e-15748.78Homeobox protein OS=Morus notabilis GN=L484_011492 PE=4 SV=1[more]
A0A061E032_THECC5.0e-15346.83Homeodomain-like protein with RING/FYVE/PHD-type zinc finger domain, putative is... [more]
Match NameE-valueIdentityDescription
AT3G19510.13.7e-11546.10 Homeodomain-like protein with RING/FYVE/PHD-type zinc finger domain[more]
AT4G29940.12.1e-4936.59 pathogenesis related homeodomain protein A[more]
Match NameE-valueIdentityDescription
gi|778679986|ref|XP_011651230.1|0.0e+0074.28PREDICTED: homeobox protein HOX1A [Cucumis sativus][more]
gi|659112348|ref|XP_008456177.1|0.0e+0074.69PREDICTED: pathogenesis-related homeodomain protein isoform X1 [Cucumis melo][more]
gi|659112354|ref|XP_008456180.1|0.0e+0074.69PREDICTED: homeobox protein HAT3.1 isoform X2 [Cucumis melo][more]
gi|700202354|gb|KGN57487.1|8.0e-23776.13hypothetical protein Csa_3G198510 [Cucumis sativus][more]
gi|643738525|gb|KDP44446.1|7.7e-16346.62hypothetical protein JCGZ_16279 [Jatropha curcas][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0043565sequence-specific DNA binding
GO:0008270zinc ion binding
GO:0005515protein binding
GO:0003677DNA binding
Vocabulary: Biological Process
TermDefinition
GO:0006355regulation of transcription, DNA-templated
Vocabulary: INTERPRO
TermDefinition
IPR019787Znf_PHD-finger
IPR019786Zinc_finger_PHD-type_CS
IPR017970Homeobox_CS
IPR013083Znf_RING/FYVE/PHD
IPR011011Znf_FYVE_PHD
IPR009057Homeobox-like_sf
IPR001965Znf_PHD
IPR001356Homeobox_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006355 regulation of transcription, DNA-templated
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
cellular_component GO:0005634 nucleus
molecular_function GO:0005515 protein binding
molecular_function GO:0043565 sequence-specific DNA binding
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0003677 DNA binding
molecular_function GO:0046872 metal ion binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG14g04420.1Cp4.1LG14g04420.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001356Homeobox domainPFAMPF00046Homeoboxcoord: 698..749
score: 6.7
IPR001356Homeobox domainSMARTSM00389HOX_1coord: 693..755
score: 5.0
IPR001356Homeobox domainPROFILEPS50071HOMEOBOX_2coord: 691..751
score: 13
IPR001965Zinc finger, PHD-typeSMARTSM00249PHD_3coord: 338..391
score: 1.
IPR009057Homeodomain-likeGENE3DG3DSA:1.10.10.60coord: 683..745
score: 3.5
IPR009057Homeodomain-likeunknownSSF46689Homeodomain-likecoord: 687..751
score: 6.42
IPR011011Zinc finger, FYVE/PHD-typeunknownSSF57903FYVE/PHD zinc fingercoord: 329..396
score: 5.14
IPR013083Zinc finger, RING/FYVE/PHD-typeGENE3DG3DSA:3.30.40.10coord: 334..392
score: 1.1
IPR017970Homeobox, conserved sitePROSITEPS00027HOMEOBOX_1coord: 726..749
scor
IPR019786Zinc finger, PHD-type, conserved sitePROSITEPS01359ZF_PHD_1coord: 339..390
scor
IPR019787Zinc finger, PHD-fingerPFAMPF00628PHDcoord: 338..393
score: 1.4
IPR019787Zinc finger, PHD-fingerPROFILEPS50016ZF_PHD_2coord: 336..393
score: 1
NoneNo IPR availablePANTHERPTHR12628POLYCOMB-LIKE TRANSCRIPTION FACTORcoord: 116..796
score: 9.8E
NoneNo IPR availablePANTHERPTHR12628:SF13HOMEOBOX PROTEIN HAT3.1coord: 116..796
score: 9.8E

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cp4.1LG14g04420Cp4.1LG01g02480Cucurbita pepo (Zucchini)cpecpeB234