Cp4.1LG02g04400 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG02g04400
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionEnhancer of polycomb-like transcription factor protein
LocationCp4.1LG02 : 2379401 .. 2385829 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATTACCTTTATTTTCTATTTCATTTTTATTAATTTTCGATGTCTAACCCTTCTTCGGGAAATCAAAAAGAAACCCTAGGTCCGTCCGTTTTTCACGGTTAAAATTAAAAAAATTAGGGCAAAAATTCAAAAATTGCCAAAATTGATTGGGGAAATTGTGGAATCTATTAGGGTTCCGAATGAGATGGCCCGGATTCAAGCCCTAATTTACCTGTCGGGTTTATGATCGGCACGGTTCGTAGACCAAGGGATTTTTTTTTTTTGCGGTTCAATCAATCGAGGTAGGTATGCGATTTTTCATTGTCTTTTAAATTTGTTCATAAGTTTTGGTTTATCCGAATCCATCGGAGATTACTATGAATGCATGGATAATTGGATGTGATCGCTTTGCGAGTACGTTTGACGATGATTTTCGTTTTGTCAATTTTTTTGGCGTGGTTTCGGTGACACAGATCTGAGAAATGTTTATATGCATGTACAATGATGGAACATGAAAATTGGGGGTTTCTGGATTGGGAGTTTTCGTTTGGGGAAGTCGATGGAAAATAGTTTAGAATACTCACATGGTACTGATACACCAAAGAAATCGAGATCTTTGGATCTCAAGAGTTTATACGAATCAAAAGTGTCTAAAGAGGTTCAGAATGAGAGCTTAAAGAGAAAGGTCCGGGCGGAGAATGGTGATGAGCAGAGGAACGAGAGGAGAAATAGGAAAAAAGTGTCCCTCAGTAATTTCAGTAGTATTTATAGTAGGAGCAGGAAGAGTTTGCATGAAGTTTATGATGATGAACTAGGTTCAAGTGGGCATGATTCAAAAAAGGCCTTGAAGTCAGAGTCAAAGGAGAAGTTAAATAGCAGTAGTGAGTGTAACAAAGTTTCACTTATTTTGAATGATGATGTTATGCAAATTCCAAAGCGTAAGAGAGGTGGATTTGTTAGGAGGAAAAAAATTCTTGGTGGTCAGATTTTGAAACCATCTGGGCAATTGGATGGTAAAGCTGCTATTGTTGATCAGATTGCTAAATCTAGTGCCAAAGATCCAAGTGATCTGGTGGAGTGCTGTAAGACTAATAGAAAGCCAGGGTTTAAAAGTTTAAAGGAGAAAGAGCAAAGTGAGTTGAGCTCAACTCAGCATCCGAAGAGAGGAGATGGACATGCTGATCCATTAGTTAGAGAAAATCAATCGAGTTCCACTTTGCATTTGAAGGAGGAAGGTGAACATATTGATCATTCGGTTGTAAAACCTGTCAGTCTGTCATTCAAAAAATCACAAAAGAATTTTGGGAGAAGAAAGATTTCTGCATCTGGGAGAAAACGTAATTCAAAAGAGGGTGAGGCATCTATATCACATTCTACTAAGCGACGTGATGGCTGCCTGGAAGATGATGAAGAGAACCTTGAGGAGAATGCTGCTAGAATGTTATCATCAAGATTTGATCCAACTTGTACTGGGTTTTCTTCAAACGTGATGGGTTCTTTGCCACCAGCGAATGGGTTTGTTAGTCATGGTCTTAAGCCTCTAGCAGATTTGGAATCTGCATCAGTTGATTCGGCCGGAAGAGTACTGAGGCCTAGGAAGCAACGAAAAGAAAAGAAAAGCTCACGGAAAAGACGCCATTTTTATGAAATTTTATTAGCAGACTTGGATGCAGTTTGGATATTGAACAGGAGGATCAAAGTCTTTTGGCCTTTGGACCAAATCTGGTACTATGGGCTTGTGAATGACTATGATAAAGAAAGGAAGCTTCATCATGTCAAATATGATGATCGTGATGAAGAATGGATTGATCTTCAAAATGAAAGGTTCAAACTGTTGCTGCTTCCTAGTGAAGTTCCTGGTAGGGAAGACCGTAGGAAGTCGGTGATGGGAAATAATCCTGCTAATGTGAAAAGGGAATCAAGATCCAGGAAGGGGAAAGAAACGAATGCTCCAAAGGATGAATGCAATACTGGTAGCTTTATGGATTCTGAGCCAATCATCTCTTGGTTGGCTCGATCTACTCAACGTAATAAATCATGCCCCTCTCATAGTTCAAAGAGGCAGAAAAACTCCAGCCTATTTTTAAAATCGGGGTCCCAGGCAATTGAAAGGCCTCATTCAGGATTGCCAGAAAGATTAGGAGATATGGATGGGTTAGAGAAGTCTACTTCGGAAATCACCACTTGTTCAAAGACTTGTAAACTTCCTATTGTATATTTTAGGAAAAGGTTCCGCAACATAGGCACTGAAATGTCTCATAAGCATGAGACAAGTTATGCCTCTAGAAGAACACATTCTCTTGCTTCATTTTTTTCTAATGTTGGTGAAATCGACGATGTGGAGGAATCAGATATCTCTCCCAGAAGGTCGGAAGCACTCAGGTTGCTATGGTGTGTTGATGATGATGGGTTATTACAATTGGATATTCCAGCGATGGAAGTCGGGCAGTTGAGGTTTGAGTTAACCATCCCAGAATATTCATTCTTGAATATGACTTCAAGTGCAGAAACATTTTGGTTATTCCATTTGTCAATGCTCATCCAACATGGTGCATTGACGTTAACATGGCCAAAGGTTCAATTAGAGATGTTTTGTGTAGACAATGTGGTTGGGTTGAGATTTCTCCTGTTTGAAGGTTGCTTGATGCAGGCAGTGGCATTCATTTTCCTGGTCATGAAAATGTTTCGGCAACCCAGCAAACAGGTAAGGTATGCTGATTTTCAAGTTCCCATGACCTCTATCAGGTTCAAATTCTCGTGCCCTCCGGATATCGGAAAGCAGCTTGTGTTTGCTTTCTATAACTTTTCAGAAACAAAAAATTCCAAGTGGCTTCACTTAGACTGCAGGCTGAAGAAGTATTGCTTACTCACCAAGCAACTTCCATTGACTGAATGCACCTACGATAATATCAAAAGGTTTCAAAATTGCACAAGTCAGTTCCATACATCTCCATTTTGTGGCGAGTGTTCCTCTATCAAGGTACTTGTTTATTCTACATTGTTTTTTTTTTTTTAGGTTCTTTTATTTTTGGTTATGCTATCAACTATACTGTTGTCCAAGCATGTTACAAGGTTTCTAATATACCTTCACTTCCTCTGTATTTTCGACGCCAGATATTTCATCTCGTAGTAACTTCTTCAGTGTTTCTTCTTTAATCAGGGCACACAGAAGATTGGTAGTCTTGGTATCAACCACAAAGGAGATGCAGGCGAGAACAATGGTCATTCTAATTTGTGTTCTAATGAAACCAACAAAAAATTTCCTGCTTTTGCTCTTTCTTTTACTGCTGCACCTTCCTTTATGCTGAGTTTGCATCTCAAGCTGCTTATGGAACAGTGTGTGGCTCATTTAAGTTCGCTACATCAGGATTCAGGCAAGCGTGCAGAAAATTTTGGTAGGTTGACTGTGGACAACGTGTGTATGAATGACTGTGCCAATAATCTTAGTACCTCGTCAAAGGCATTAGGTAGATGGAACTTGTGTGCTCGGTCAGATTTAGGGACTGGTCTCTCTGACTGTGAGGAGGGAGGTTCCTCCCGGTATAAAAGAAGTCGTCTTGTTGCTGAAACGTGTGCAGGGTCTCACGATTCGGACAAGGCTAGAAATGATGTAAAGAAGCGGATGCGATCTTCGGGAAATGACAAATCAGAGAAAGCAATGGCTTTACCTAATGTGGCAAGATCTGATAATGGTTCTGATTCATTTTTGAATGACCTTAGTGTTGAGATTCCATCATTTCAGCCTGTGGATGGGGAGTTGCACAATGCTCAGCTATCTATGGATGTAGCATGGAATGTGAATTCTGGCATCATTCGCAGCCCTAACCCCACTGCACCACGAAGCACATGGCATCGAAATAAGAATAACTCATCATCATTTGGATTGGCCTCACATGGATGGTCGGACGGAAAGGATTTTCTGAACGGTTTGGGAAATAGAACTAAGAAACCACGAACTCAGGTTTCTTACATGTTACCTTTTGGAGGTTTAGATTATGGCTCAAAGAACAGAAACTCTCATCCTAAAGCAACTCCTTACAAGCGAATCAGAAGGGCTAGTGAGAAGCGGTCGGATGCAGCTAGAGGTTCCCAAAGAAACATAGAATTGCTATCATGTGATGCAAATGTGTTGATTACAACTGGTGATAGAGGATGGAGAGAATGTGGGGCAAGAGTGGTATTAGAGGTCTTTGACCATAATGAGTGGAAACTTGCTGTGAAACTTTCAGGAATTACCAAATATTCTTACAAGGCTCATCAATTTTTGCAACCCGGATCTACGAATCGGTATACACATGCTATGATGTGGAAAGGAGGAAAGGATTGGATCTTGGAGTTTCCAGATCGAAGCCAGTGGACGATATTCAAGGAGTTGCATGAGGAGTGTTACAATCGAAACATTAGATCAGCGTCCGTTAAGAATATTCCAATTCCTGGCGTTCGCTTGATAGAAGAAAATGATGAACATGTAGCAGAAACTGCATTTATGCGAAATCCTTCCAAGTACTTCCGGCAGGTAGAAACAGATGTTGAAATGGCTTTGAATCCAAATCGTGTCTTGTATGACATGGATAGTGATGACGAGCAGTGGATCAAGGATGTTCAAACTTCTTCAGAAGTTGGCTCTAGTAGCAGCTTGGGAGAGGCGTCGAGTGAGGTGTTCGAGAAAACGATGGATGCTTTTGAGAAGGCTGCATACTCTCAGCAACGTGATGAGTTCACAGATGACGAGATTGCAGAGGCAGTGAACGAAACTCTGGTGTCGGGTTTGACGAAAGGGATCTTTGAGTATTGGCAGCTGAAAAGGCGACAGAAGGGAATGCCTTTACTTCGACATCTTCAGGTAGCTTTTAGGTCTCTTCACATTCATGCATATTATGAGTTCTCTGCCTTTTCTGCACCTTCCTTGCTCGTGCTGCTGCGTCGATATAAATAAAAAAATACACGAAAACAACTATTTACAATAATCGTTTCTTTTTTCAGCCGCCTCTTTGGGAAACCTACCGACAGCAACTAAAAGATTGGGAGTCTACAGTCAACAAAAACAACACCAATTCCTGCAACGGATATCACGACTCGGCATCAATCGAGAAACCGCCCATGTTTGCTTTCTGTTTGAAACCCCGAGGCCTGGAAGTTTCCAATAAAGGCTCTAAGCAACGGTCGCATAGGAAGTTTTCAATGGCTGGTCATAGCAATTCCATCACCTATGATCAGGATGGATTGCATGGTTTTGGTAAATTCTACTGTCCACTTTTACCTGATTCATCTAGTTTTCGTTGTTTATCCCGAGCGATTTTACAACGAACTTCTAACGAACTCTCTTGACTTAGGCTTTACAATTTGAATTGTTTTGTTGTTTCAGTTAGAAGATTGAATGGGTCTGCCCTTGGGGATGATAGGATGGTCTATATAGGCCATAACTATGAGTTTTTGGAAGATTCTCCTTTGATTCACACATCATCGAGCTTATTTTCGCCACGACTCGAAGGTGGCATTTTGAGTAACGATGGGTTCGAGAGGAATGTTCTACCGAAACTTCACAAGACCAAGTCCAGAAAATATGGAGCTTCTCCATATGAACCTATGATGGCATCTTCTTTCAATCAGAGAATGGTTGGAAAGAGAGATGGGTTAAATAGGTGGAACAATGGTTATTCCGAGTGGTCAAGCCCGCTACGTTATCGGTTCGATGGATCACAAAGGCAGATTCTCGAACAGTTGGAAGGTTCCGACCTCGACGAGTACAGGTTCCGTGATGTATCTGGTGCTGCTCAGGAAGCACGCAACGTCGCCAAGTTTAAGAGAGAAAAGGCCAGACGATTACTAATCAGAGCAGACCTCGCAATTCATAAGGCAGTGGTTGCTATCATGACTGCTGAAGCAATGAAAGCTGCATCCGAGGACGACCCTAACGGCGATGGATAGAAAAGTGAAGAAAACCCGAATCCCGACTACCCCAACGTCTCCCATTGCTCAAGCTCACCGATATATCCACCCACATGGCAAGCAGATTAGGTTCGACGACTTCGTTAGCAATGGGTACTGCCCTTCCCCCTCGATATTGTGCAGATTCAATATCGAACGTCTGGCACGACAAACTCGAGCGATCAGTGACGAGAATTCGTTGCTACAGCCTCACCGTCCGACAAGGAAGGCTTCGTGTAAGCCAGCATGATCGTTAGTGATCAGAATTTGGGGGTATGACATTATTTGTACAGTTTTTAACCCTCAGCTTCTAAAGTGTTCCATCCATGAAATGAGAATCTTAGATTAAAGAATACATGATGTGTTCAATGCCATTGCATATTTCTGTGTATATAGAAACTCTACTGTTTATGCCACAATTTTTGAGGTCTACACAAACAATTGTTGATTGTAAATTATTGATCATATCTTGAATTGCAACTTGGTTTATAGTTATGTAATATATTGATTTTTTTTAAGTAAT

mRNA sequence

ATTACCTTTATTTTCTATTTCATTTTTATTAATTTTCGATGTCTAACCCTTCTTCGGGAAATCAAAAAGAAACCCTAGGTCCGTCCGTTTTTCACGGTTAAAATTAAAAAAATTAGGGCAAAAATTCAAAAATTGCCAAAATTGATTGGGGAAATTGTGGAATCTATTAGGGTTCCGAATGAGATGGCCCGGATTCAAGCCCTAATTTACCTGTCGGGTTTATGATCGGCACGGTTCGTAGACCAAGGGATTTTTTTTTTTTGCGGTTCAATCAATCGAGGTAGATCTGAGAAATGTTTATATGCATGTACAATGATGGAACATGAAAATTGGGGGTTTCTGGATTGGGAGTTTTCGTTTGGGGAAGTCGATGGAAAATAGTTTAGAATACTCACATGGTACTGATACACCAAAGAAATCGAGATCTTTGGATCTCAAGAGTTTATACGAATCAAAAGTGTCTAAAGAGGTTCAGAATGAGAGCTTAAAGAGAAAGGTCCGGGCGGAGAATGGTGATGAGCAGAGGAACGAGAGGAGAAATAGGAAAAAAGTGTCCCTCAGTAATTTCAGTAGTATTTATAGTAGGAGCAGGAAGAGTTTGCATGAAGTTTATGATGATGAACTAGGTTCAAGTGGGCATGATTCAAAAAAGGCCTTGAAGTCAGAGTCAAAGGAGAAGTTAAATAGCAGTAGTGAGTGTAACAAAGTTTCACTTATTTTGAATGATGATGTTATGCAAATTCCAAAGCGTAAGAGAGGTGGATTTGTTAGGAGGAAAAAAATTCTTGGTGGTCAGATTTTGAAACCATCTGGGCAATTGGATGGTAAAGCTGCTATTGTTGATCAGATTGCTAAATCTAGTGCCAAAGATCCAAGTGATCTGGTGGAGTGCTGTAAGACTAATAGAAAGCCAGGGTTTAAAAGTTTAAAGGAGAAAGAGCAAAGTGAGTTGAGCTCAACTCAGCATCCGAAGAGAGGAGATGGACATGCTGATCCATTAGTTAGAGAAAATCAATCGAGTTCCACTTTGCATTTGAAGGAGGAAGGTGAACATATTGATCATTCGGTTGTAAAACCTGTCAGTCTGTCATTCAAAAAATCACAAAAGAATTTTGGGAGAAGAAAGATTTCTGCATCTGGGAGAAAACGTAATTCAAAAGAGGGTGAGGCATCTATATCACATTCTACTAAGCGACGTGATGGCTGCCTGGAAGATGATGAAGAGAACCTTGAGGAGAATGCTGCTAGAATGTTATCATCAAGATTTGATCCAACTTGTACTGGGTTTTCTTCAAACGTGATGGGTTCTTTGCCACCAGCGAATGGGTTTGTTAGTCATGGTCTTAAGCCTCTAGCAGATTTGGAATCTGCATCAGTTGATTCGGCCGGAAGAGTACTGAGGCCTAGGAAGCAACGAAAAGAAAAGAAAAGCTCACGGAAAAGACGCCATTTTTATGAAATTTTATTAGCAGACTTGGATGCAGTTTGGATATTGAACAGGAGGATCAAAGTCTTTTGGCCTTTGGACCAAATCTGGTACTATGGGCTTGTGAATGACTATGATAAAGAAAGGAAGCTTCATCATGTCAAATATGATGATCGTGATGAAGAATGGATTGATCTTCAAAATGAAAGGTTCAAACTGTTGCTGCTTCCTAGTGAAGTTCCTGGTAGGGAAGACCGTAGGAAGTCGGTGATGGGAAATAATCCTGCTAATGTGAAAAGGGAATCAAGATCCAGGAAGGGGAAAGAAACGAATGCTCCAAAGGATGAATGCAATACTGGTAGCTTTATGGATTCTGAGCCAATCATCTCTTGGTTGGCTCGATCTACTCAACGTAATAAATCATGCCCCTCTCATAGTTCAAAGAGGCAGAAAAACTCCAGCCTATTTTTAAAATCGGGGTCCCAGGCAATTGAAAGGCCTCATTCAGGATTGCCAGAAAGATTAGGAGATATGGATGGGTTAGAGAAGTCTACTTCGGAAATCACCACTTGTTCAAAGACTTGTAAACTTCCTATTGTATATTTTAGGAAAAGGTTCCGCAACATAGGCACTGAAATGTCTCATAAGCATGAGACAAGTTATGCCTCTAGAAGAACACATTCTCTTGCTTCATTTTTTTCTAATGTTGGTGAAATCGACGATGTGGAGGAATCAGATATCTCTCCCAGAAGGTCGGAAGCACTCAGGTTGCTATGGTGTGTTGATGATGATGGGTTATTACAATTGGATATTCCAGCGATGGAAGTCGGGCAGTTGAGGTTTGAGTTAACCATCCCAGAATATTCATTCTTGAATATGACTTCAAGTGCAGAAACATTTTGGTTATTCCATTTGTCAATGCTCATCCAACATGGTGCATTGACGTTAACATGGCCAAAGGTTCAATTAGAGATGTTTTGTGTAGACAATGTGGTTGGGTTGAGATTTCTCCTGTTTGAAGGTTGCTTGATGCAGGCAGTGGCATTCATTTTCCTGGTCATGAAAATGTTTCGGCAACCCAGCAAACAGGTAAGGTATGCTGATTTTCAAGTTCCCATGACCTCTATCAGGTTCAAATTCTCGTGCCCTCCGGATATCGGAAAGCAGCTTGTGTTTGCTTTCTATAACTTTTCAGAAACAAAAAATTCCAAGTGGCTTCACTTAGACTGCAGGCTGAAGAAGTATTGCTTACTCACCAAGCAACTTCCATTGACTGAATGCACCTACGATAATATCAAAAGGTTTCAAAATTGCACAAGTCAGTTCCATACATCTCCATTTTGTGGCGAGTGTTCCTCTATCAAGGGCACACAGAAGATTGGTAGTCTTGGTATCAACCACAAAGGAGATGCAGGCGAGAACAATGGTCATTCTAATTTGTGTTCTAATGAAACCAACAAAAAATTTCCTGCTTTTGCTCTTTCTTTTACTGCTGCACCTTCCTTTATGCTGAGTTTGCATCTCAAGCTGCTTATGGAACAGTGTGTGGCTCATTTAAGTTCGCTACATCAGGATTCAGGCAAGCGTGCAGAAAATTTTGGTAGGTTGACTGTGGACAACGTGTGTATGAATGACTGTGCCAATAATCTTAGTACCTCGTCAAAGGCATTAGGTAGATGGAACTTGTGTGCTCGGTCAGATTTAGGGACTGGTCTCTCTGACTGTGAGGAGGGAGGTTCCTCCCGGTATAAAAGAAGTCGTCTTGTTGCTGAAACGTGTGCAGGGTCTCACGATTCGGACAAGGCTAGAAATGATGTAAAGAAGCGGATGCGATCTTCGGGAAATGACAAATCAGAGAAAGCAATGGCTTTACCTAATGTGGCAAGATCTGATAATGGTTCTGATTCATTTTTGAATGACCTTAGTGTTGAGATTCCATCATTTCAGCCTGTGGATGGGGAGTTGCACAATGCTCAGCTATCTATGGATGTAGCATGGAATGTGAATTCTGGCATCATTCGCAGCCCTAACCCCACTGCACCACGAAGCACATGGCATCGAAATAAGAATAACTCATCATCATTTGGATTGGCCTCACATGGATGGTCGGACGGAAAGGATTTTCTGAACGGTTTGGGAAATAGAACTAAGAAACCACGAACTCAGGTTTCTTACATGTTACCTTTTGGAGGTTTAGATTATGGCTCAAAGAACAGAAACTCTCATCCTAAAGCAACTCCTTACAAGCGAATCAGAAGGGCTAGTGAGAAGCGGTCGGATGCAGCTAGAGGTTCCCAAAGAAACATAGAATTGCTATCATGTGATGCAAATGTGTTGATTACAACTGGTGATAGAGGATGGAGAGAATGTGGGGCAAGAGTGGTATTAGAGGTCTTTGACCATAATGAGTGGAAACTTGCTGTGAAACTTTCAGGAATTACCAAATATTCTTACAAGGCTCATCAATTTTTGCAACCCGGATCTACGAATCGGTATACACATGCTATGATGTGGAAAGGAGGAAAGGATTGGATCTTGGAGTTTCCAGATCGAAGCCAGTGGACGATATTCAAGGAGTTGCATGAGGAGTGTTACAATCGAAACATTAGATCAGCGTCCGTTAAGAATATTCCAATTCCTGGCGTTCGCTTGATAGAAGAAAATGATGAACATGTAGCAGAAACTGCATTTATGCGAAATCCTTCCAAGTACTTCCGGCAGGTAGAAACAGATGTTGAAATGGCTTTGAATCCAAATCGTGTCTTGTATGACATGGATAGTGATGACGAGCAGTGGATCAAGGATGTTCAAACTTCTTCAGAAGTTGGCTCTAGTAGCAGCTTGGGAGAGGCGTCGAGTGAGGTGTTCGAGAAAACGATGGATGCTTTTGAGAAGGCTGCATACTCTCAGCAACGTGATGAGTTCACAGATGACGAGATTGCAGAGGCAGTGAACGAAACTCTGGTGTCGGGTTTGACGAAAGGGATCTTTGAGTATTGGCAGCTGAAAAGGCGACAGAAGGGAATGCCTTTACTTCGACATCTTCAGCCGCCTCTTTGGGAAACCTACCGACAGCAACTAAAAGATTGGGAGTCTACAGTCAACAAAAACAACACCAATTCCTGCAACGGATATCACGACTCGGCATCAATCGAGAAACCGCCCATGTTTGCTTTCTGTTTGAAACCCCGAGGCCTGGAAGTTTCCAATAAAGGCTCTAAGCAACGGTCGCATAGGAAGTTTTCAATGGCTGGTCATAGCAATTCCATCACCTATGATCAGGATGGATTGCATGGTTTTGTTAGAAGATTGAATGGGTCTGCCCTTGGGGATGATAGGATGGTCTATATAGGCCATAACTATGAGTTTTTGGAAGATTCTCCTTTGATTCACACATCATCGAGCTTATTTTCGCCACGACTCGAAGGTGGCATTTTGAGTAACGATGGGTTCGAGAGGAATGTTCTACCGAAACTTCACAAGACCAAGTCCAGAAAATATGGAGCTTCTCCATATGAACCTATGATGGCATCTTCTTTCAATCAGAGAATGGTTGGAAAGAGAGATGGGTTAAATAGGTGGAACAATGGTTATTCCGAGTGGTCAAGCCCGCTACGTTATCGGTTCGATGGATCACAAAGGCAGATTCTCGAACAGTTGGAAGGTTCCGACCTCGACGAGTACAGGTTCCGTGATGTATCTGGTGCTGCTCAGGAAGCACGCAACGTCGCCAAGTTTAAGAGAGAAAAGGCCAGACGATTACTAATCAGAGCAGACCTCGCAATTCATAAGGCAGTGGTTGCTATCATGACTGCTGAAGCAATGAAAGCTGCATCCGAGGACGACCCTAACGGCGATGGATAGAAAAGTGAAGAAAACCCGAATCCCGACTACCCCAACGTCTCCCATTGCTCAAGCTCACCGATATATCCACCCACATGGCAAGCAGATTAGGTTCGACGACTTCGTTAGCAATGGGTACTGCCCTTCCCCCTCGATATTGTGCAGATTCAATATCGAACGTCTGGCACGACAAACTCGAGCGATCAGTGACGAGAATTCGTTGCTACAGCCTCACCGTCCGACAAGGAAGGCTTCGTGTAAGCCAGCATGATCGTTAGTGATCAGAATTTGGGGGTATGACATTATTTGTACAGTTTTTAACCCTCAGCTTCTAAAGTGTTCCATCCATGAAATGAGAATCTTAGATTAAAGAATACATGATGTGTTCAATGCCATTGCATATTTCTGTGTATATAGAAACTCTACTGTTTATGCCACAATTTTTGAGGTCTACACAAACAATTGTTGATTGTAAATTATTGATCATATCTTGAATTGCAACTTGGTTTATAGTTATGTAATATATTGATTTTTTTTAAGTAAT

Coding sequence (CDS)

ATGAAAATTGGGGGTTTCTGGATTGGGAGTTTTCGTTTGGGGAAGTCGATGGAAAATAGTTTAGAATACTCACATGGTACTGATACACCAAAGAAATCGAGATCTTTGGATCTCAAGAGTTTATACGAATCAAAAGTGTCTAAAGAGGTTCAGAATGAGAGCTTAAAGAGAAAGGTCCGGGCGGAGAATGGTGATGAGCAGAGGAACGAGAGGAGAAATAGGAAAAAAGTGTCCCTCAGTAATTTCAGTAGTATTTATAGTAGGAGCAGGAAGAGTTTGCATGAAGTTTATGATGATGAACTAGGTTCAAGTGGGCATGATTCAAAAAAGGCCTTGAAGTCAGAGTCAAAGGAGAAGTTAAATAGCAGTAGTGAGTGTAACAAAGTTTCACTTATTTTGAATGATGATGTTATGCAAATTCCAAAGCGTAAGAGAGGTGGATTTGTTAGGAGGAAAAAAATTCTTGGTGGTCAGATTTTGAAACCATCTGGGCAATTGGATGGTAAAGCTGCTATTGTTGATCAGATTGCTAAATCTAGTGCCAAAGATCCAAGTGATCTGGTGGAGTGCTGTAAGACTAATAGAAAGCCAGGGTTTAAAAGTTTAAAGGAGAAAGAGCAAAGTGAGTTGAGCTCAACTCAGCATCCGAAGAGAGGAGATGGACATGCTGATCCATTAGTTAGAGAAAATCAATCGAGTTCCACTTTGCATTTGAAGGAGGAAGGTGAACATATTGATCATTCGGTTGTAAAACCTGTCAGTCTGTCATTCAAAAAATCACAAAAGAATTTTGGGAGAAGAAAGATTTCTGCATCTGGGAGAAAACGTAATTCAAAAGAGGGTGAGGCATCTATATCACATTCTACTAAGCGACGTGATGGCTGCCTGGAAGATGATGAAGAGAACCTTGAGGAGAATGCTGCTAGAATGTTATCATCAAGATTTGATCCAACTTGTACTGGGTTTTCTTCAAACGTGATGGGTTCTTTGCCACCAGCGAATGGGTTTGTTAGTCATGGTCTTAAGCCTCTAGCAGATTTGGAATCTGCATCAGTTGATTCGGCCGGAAGAGTACTGAGGCCTAGGAAGCAACGAAAAGAAAAGAAAAGCTCACGGAAAAGACGCCATTTTTATGAAATTTTATTAGCAGACTTGGATGCAGTTTGGATATTGAACAGGAGGATCAAAGTCTTTTGGCCTTTGGACCAAATCTGGTACTATGGGCTTGTGAATGACTATGATAAAGAAAGGAAGCTTCATCATGTCAAATATGATGATCGTGATGAAGAATGGATTGATCTTCAAAATGAAAGGTTCAAACTGTTGCTGCTTCCTAGTGAAGTTCCTGGTAGGGAAGACCGTAGGAAGTCGGTGATGGGAAATAATCCTGCTAATGTGAAAAGGGAATCAAGATCCAGGAAGGGGAAAGAAACGAATGCTCCAAAGGATGAATGCAATACTGGTAGCTTTATGGATTCTGAGCCAATCATCTCTTGGTTGGCTCGATCTACTCAACGTAATAAATCATGCCCCTCTCATAGTTCAAAGAGGCAGAAAAACTCCAGCCTATTTTTAAAATCGGGGTCCCAGGCAATTGAAAGGCCTCATTCAGGATTGCCAGAAAGATTAGGAGATATGGATGGGTTAGAGAAGTCTACTTCGGAAATCACCACTTGTTCAAAGACTTGTAAACTTCCTATTGTATATTTTAGGAAAAGGTTCCGCAACATAGGCACTGAAATGTCTCATAAGCATGAGACAAGTTATGCCTCTAGAAGAACACATTCTCTTGCTTCATTTTTTTCTAATGTTGGTGAAATCGACGATGTGGAGGAATCAGATATCTCTCCCAGAAGGTCGGAAGCACTCAGGTTGCTATGGTGTGTTGATGATGATGGGTTATTACAATTGGATATTCCAGCGATGGAAGTCGGGCAGTTGAGGTTTGAGTTAACCATCCCAGAATATTCATTCTTGAATATGACTTCAAGTGCAGAAACATTTTGGTTATTCCATTTGTCAATGCTCATCCAACATGGTGCATTGACGTTAACATGGCCAAAGGTTCAATTAGAGATGTTTTGTGTAGACAATGTGGTTGGGTTGAGATTTCTCCTGTTTGAAGGTTGCTTGATGCAGGCAGTGGCATTCATTTTCCTGGTCATGAAAATGTTTCGGCAACCCAGCAAACAGGTAAGGTATGCTGATTTTCAAGTTCCCATGACCTCTATCAGGTTCAAATTCTCGTGCCCTCCGGATATCGGAAAGCAGCTTGTGTTTGCTTTCTATAACTTTTCAGAAACAAAAAATTCCAAGTGGCTTCACTTAGACTGCAGGCTGAAGAAGTATTGCTTACTCACCAAGCAACTTCCATTGACTGAATGCACCTACGATAATATCAAAAGGTTTCAAAATTGCACAAGTCAGTTCCATACATCTCCATTTTGTGGCGAGTGTTCCTCTATCAAGGGCACACAGAAGATTGGTAGTCTTGGTATCAACCACAAAGGAGATGCAGGCGAGAACAATGGTCATTCTAATTTGTGTTCTAATGAAACCAACAAAAAATTTCCTGCTTTTGCTCTTTCTTTTACTGCTGCACCTTCCTTTATGCTGAGTTTGCATCTCAAGCTGCTTATGGAACAGTGTGTGGCTCATTTAAGTTCGCTACATCAGGATTCAGGCAAGCGTGCAGAAAATTTTGGTAGGTTGACTGTGGACAACGTGTGTATGAATGACTGTGCCAATAATCTTAGTACCTCGTCAAAGGCATTAGGTAGATGGAACTTGTGTGCTCGGTCAGATTTAGGGACTGGTCTCTCTGACTGTGAGGAGGGAGGTTCCTCCCGGTATAAAAGAAGTCGTCTTGTTGCTGAAACGTGTGCAGGGTCTCACGATTCGGACAAGGCTAGAAATGATGTAAAGAAGCGGATGCGATCTTCGGGAAATGACAAATCAGAGAAAGCAATGGCTTTACCTAATGTGGCAAGATCTGATAATGGTTCTGATTCATTTTTGAATGACCTTAGTGTTGAGATTCCATCATTTCAGCCTGTGGATGGGGAGTTGCACAATGCTCAGCTATCTATGGATGTAGCATGGAATGTGAATTCTGGCATCATTCGCAGCCCTAACCCCACTGCACCACGAAGCACATGGCATCGAAATAAGAATAACTCATCATCATTTGGATTGGCCTCACATGGATGGTCGGACGGAAAGGATTTTCTGAACGGTTTGGGAAATAGAACTAAGAAACCACGAACTCAGGTTTCTTACATGTTACCTTTTGGAGGTTTAGATTATGGCTCAAAGAACAGAAACTCTCATCCTAAAGCAACTCCTTACAAGCGAATCAGAAGGGCTAGTGAGAAGCGGTCGGATGCAGCTAGAGGTTCCCAAAGAAACATAGAATTGCTATCATGTGATGCAAATGTGTTGATTACAACTGGTGATAGAGGATGGAGAGAATGTGGGGCAAGAGTGGTATTAGAGGTCTTTGACCATAATGAGTGGAAACTTGCTGTGAAACTTTCAGGAATTACCAAATATTCTTACAAGGCTCATCAATTTTTGCAACCCGGATCTACGAATCGGTATACACATGCTATGATGTGGAAAGGAGGAAAGGATTGGATCTTGGAGTTTCCAGATCGAAGCCAGTGGACGATATTCAAGGAGTTGCATGAGGAGTGTTACAATCGAAACATTAGATCAGCGTCCGTTAAGAATATTCCAATTCCTGGCGTTCGCTTGATAGAAGAAAATGATGAACATGTAGCAGAAACTGCATTTATGCGAAATCCTTCCAAGTACTTCCGGCAGGTAGAAACAGATGTTGAAATGGCTTTGAATCCAAATCGTGTCTTGTATGACATGGATAGTGATGACGAGCAGTGGATCAAGGATGTTCAAACTTCTTCAGAAGTTGGCTCTAGTAGCAGCTTGGGAGAGGCGTCGAGTGAGGTGTTCGAGAAAACGATGGATGCTTTTGAGAAGGCTGCATACTCTCAGCAACGTGATGAGTTCACAGATGACGAGATTGCAGAGGCAGTGAACGAAACTCTGGTGTCGGGTTTGACGAAAGGGATCTTTGAGTATTGGCAGCTGAAAAGGCGACAGAAGGGAATGCCTTTACTTCGACATCTTCAGCCGCCTCTTTGGGAAACCTACCGACAGCAACTAAAAGATTGGGAGTCTACAGTCAACAAAAACAACACCAATTCCTGCAACGGATATCACGACTCGGCATCAATCGAGAAACCGCCCATGTTTGCTTTCTGTTTGAAACCCCGAGGCCTGGAAGTTTCCAATAAAGGCTCTAAGCAACGGTCGCATAGGAAGTTTTCAATGGCTGGTCATAGCAATTCCATCACCTATGATCAGGATGGATTGCATGGTTTTGTTAGAAGATTGAATGGGTCTGCCCTTGGGGATGATAGGATGGTCTATATAGGCCATAACTATGAGTTTTTGGAAGATTCTCCTTTGATTCACACATCATCGAGCTTATTTTCGCCACGACTCGAAGGTGGCATTTTGAGTAACGATGGGTTCGAGAGGAATGTTCTACCGAAACTTCACAAGACCAAGTCCAGAAAATATGGAGCTTCTCCATATGAACCTATGATGGCATCTTCTTTCAATCAGAGAATGGTTGGAAAGAGAGATGGGTTAAATAGGTGGAACAATGGTTATTCCGAGTGGTCAAGCCCGCTACGTTATCGGTTCGATGGATCACAAAGGCAGATTCTCGAACAGTTGGAAGGTTCCGACCTCGACGAGTACAGGTTCCGTGATGTATCTGGTGCTGCTCAGGAAGCACGCAACGTCGCCAAGTTTAAGAGAGAAAAGGCCAGACGATTACTAATCAGAGCAGACCTCGCAATTCATAAGGCAGTGGTTGCTATCATGACTGCTGAAGCAATGAAAGCTGCATCCGAGGACGACCCTAACGGCGATGGATAG

Protein sequence

MKIGGFWIGSFRLGKSMENSLEYSHGTDTPKKSRSLDLKSLYESKVSKEVQNESLKRKVRAENGDEQRNERRNRKKVSLSNFSSIYSRSRKSLHEVYDDELGSSGHDSKKALKSESKEKLNSSSECNKVSLILNDDVMQIPKRKRGGFVRRKKILGGQILKPSGQLDGKAAIVDQIAKSSAKDPSDLVECCKTNRKPGFKSLKEKEQSELSSTQHPKRGDGHADPLVRENQSSSTLHLKEEGEHIDHSVVKPVSLSFKKSQKNFGRRKISASGRKRNSKEGEASISHSTKRRDGCLEDDEENLEENAARMLSSRFDPTCTGFSSNVMGSLPPANGFVSHGLKPLADLESASVDSAGRVLRPRKQRKEKKSSRKRRHFYEILLADLDAVWILNRRIKVFWPLDQIWYYGLVNDYDKERKLHHVKYDDRDEEWIDLQNERFKLLLLPSEVPGREDRRKSVMGNNPANVKRESRSRKGKETNAPKDECNTGSFMDSEPIISWLARSTQRNKSCPSHSSKRQKNSSLFLKSGSQAIERPHSGLPERLGDMDGLEKSTSEITTCSKTCKLPIVYFRKRFRNIGTEMSHKHETSYASRRTHSLASFFSNVGEIDDVEESDISPRRSEALRLLWCVDDDGLLQLDIPAMEVGQLRFELTIPEYSFLNMTSSAETFWLFHLSMLIQHGALTLTWPKVQLEMFCVDNVVGLRFLLFEGCLMQAVAFIFLVMKMFRQPSKQVRYADFQVPMTSIRFKFSCPPDIGKQLVFAFYNFSETKNSKWLHLDCRLKKYCLLTKQLPLTECTYDNIKRFQNCTSQFHTSPFCGECSSIKGTQKIGSLGINHKGDAGENNGHSNLCSNETNKKFPAFALSFTAAPSFMLSLHLKLLMEQCVAHLSSLHQDSGKRAENFGRLTVDNVCMNDCANNLSTSSKALGRWNLCARSDLGTGLSDCEEGGSSRYKRSRLVAETCAGSHDSDKARNDVKKRMRSSGNDKSEKAMALPNVARSDNGSDSFLNDLSVEIPSFQPVDGELHNAQLSMDVAWNVNSGIIRSPNPTAPRSTWHRNKNNSSSFGLASHGWSDGKDFLNGLGNRTKKPRTQVSYMLPFGGLDYGSKNRNSHPKATPYKRIRRASEKRSDAARGSQRNIELLSCDANVLITTGDRGWRECGARVVLEVFDHNEWKLAVKLSGITKYSYKAHQFLQPGSTNRYTHAMMWKGGKDWILEFPDRSQWTIFKELHEECYNRNIRSASVKNIPIPGVRLIEENDEHVAETAFMRNPSKYFRQVETDVEMALNPNRVLYDMDSDDEQWIKDVQTSSEVGSSSSLGEASSEVFEKTMDAFEKAAYSQQRDEFTDDEIAEAVNETLVSGLTKGIFEYWQLKRRQKGMPLLRHLQPPLWETYRQQLKDWESTVNKNNTNSCNGYHDSASIEKPPMFAFCLKPRGLEVSNKGSKQRSHRKFSMAGHSNSITYDQDGLHGFVRRLNGSALGDDRMVYIGHNYEFLEDSPLIHTSSSLFSPRLEGGILSNDGFERNVLPKLHKTKSRKYGASPYEPMMASSFNQRMVGKRDGLNRWNNGYSEWSSPLRYRFDGSQRQILEQLEGSDLDEYRFRDVSGAAQEARNVAKFKREKARRLLIRADLAIHKAVVAIMTAEAMKAASEDDPNGDG
BLAST of Cp4.1LG02g04400 vs. TrEMBL
Match: A0A0A0LJD1_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G879490 PE=4 SV=1)

HSP 1 Score: 2591.6 bits (6716), Expect = 0.0e+00
Identity = 1355/1688 (80.27%), Postives = 1454/1688 (86.14%), Query Frame = 1

Query: 1    MKIGGFWIGSFRLGKSMENSLEYSHGTDTPKKSRSLDLKSLYESKVSKEVQNESLKRKVR 60
            MKIGGFWIGSFRLGKSMENSLE SHGTD PKKSRSLDLKSLYESKVSKEVQN+ LKRK R
Sbjct: 1    MKIGGFWIGSFRLGKSMENSLENSHGTDIPKKSRSLDLKSLYESKVSKEVQNKRLKRKGR 60

Query: 61   AENGDEQRNERRNRKKVSLSNFSSIYSRSRKSLHEVYDDELGSSGHDSKKALKSESKEKL 120
            AE+GD Q+NERRNRKKVSLSNFSSIYSRSRKSL EVYD  LGSSGHDSKKALKSESK+KL
Sbjct: 61   AEDGDVQKNERRNRKKVSLSNFSSIYSRSRKSLDEVYDAGLGSSGHDSKKALKSESKDKL 120

Query: 121  NSSSECNKVSLILNDDVMQIPKRKRGGFVRRKKILGGQILKPSGQLDGKAA-------IV 180
            NSSSE N+V LIL+++VM IPKRKRGGFVRRKK   GQILKPSGQLD KA         V
Sbjct: 121  NSSSEFNEVPLILDENVMHIPKRKRGGFVRRKKSHDGQILKPSGQLDAKAGSLDDKAGTV 180

Query: 181  DQIAKSSAKDPSDLVECCKTNRKPGFKSLKEKEQSELSSTQHPKRGDGHADPLVRENQSS 240
            DQIAKSS KD SD VECCKTNRK  FK LKEKE  EL    H K+ DG AD L REN+ +
Sbjct: 181  DQIAKSSVKDSSDQVECCKTNRKLAFKDLKEKEPKELRL--HLKKEDGQADQLTRENELN 240

Query: 241  STLHLKEEGEHIDHSVVKPVSLSFKKSQKNFGRRKISASGRKRNSKEGEASISHSTKRRD 300
                LKEEGEHIDHSVVKPVS S KKS+KN  +RKISASG K NSKEGEASIS STKRRD
Sbjct: 241  PASRLKEEGEHIDHSVVKPVSPSSKKSKKNVRKRKISASGSKSNSKEGEASISQSTKRRD 300

Query: 301  GCLEDDEENLEENAARMLSSRFDPTCTGFSSNVMGSLPPANGF----------VSHGLKP 360
            G  EDDEENLEENAARMLSSRFDP CTGFSSN  GSLPP NG           VS GLKP
Sbjct: 301  GFPEDDEENLEENAARMLSSRFDPNCTGFSSNTKGSLPPTNGLSFLLSSGHDNVSRGLKP 360

Query: 361  LADLESASVDSAGRVLRPRKQRKEKKSSRKRRHFYEILLADLDAVWILNRRIKVFWPLDQ 420
               LESASVD+AGRVLRPRKQRKEKKSSRKRRHFY+IL  D+DA W+LNRRIKVFWPLDQ
Sbjct: 361  --GLESASVDAAGRVLRPRKQRKEKKSSRKRRHFYDILFGDIDAAWVLNRRIKVFWPLDQ 420

Query: 421  IWYYGLVNDYDKERKLHHVKYDDRDEEWIDLQNERFKLLLLPSEVPGREDRRKSVMGNNP 480
            IWYYGLVNDYDKERKLHHVKYDDRDEEWIDLQNERFKLLLLPSEVPGRE+RRKS +GN+P
Sbjct: 421  IWYYGLVNDYDKERKLHHVKYDDRDEEWIDLQNERFKLLLLPSEVPGREERRKSAVGNDP 480

Query: 481  ANVKRESRSRKGKETNAP--KDECNTGSFMDSEPIISWLARSTQRNKSCPSHSSKRQKNS 540
            AN K  S SRKGKET+A   +D+CN GS+MDSEPIISWLARST RNKS PSH+SKRQK S
Sbjct: 481  ANEKGRSGSRKGKETDAVILEDDCNIGSYMDSEPIISWLARSTHRNKSSPSHNSKRQKTS 540

Query: 541  SLFLKSGSQAIERP------HSGLPERLGDMDGLEKSTSEITTCSKTCKLPIVYFRKRFR 600
            SL  KSGSQA E+P       SG+PERL D+DG EKS SE TTCS T KLPIVYFRKRFR
Sbjct: 541  SLSSKSGSQANEKPANLLVKSSGMPERLADVDGPEKSASETTTCSTTRKLPIVYFRKRFR 600

Query: 601  NIGTEMSHKHETSYASRRTHSLASF-FSNVGEIDDVEESDISPRRSEALRLLWCVDDDGL 660
            NIGTEM HK ET +ASRR+H+  SF FSN   IDDVEE DISPRRSEA RLLWCVDD GL
Sbjct: 601  NIGTEMPHKRETDFASRRSHASLSFSFSN---IDDVEEPDISPRRSEAHRLLWCVDDAGL 660

Query: 661  LQLDIPAMEVGQLRFELTIPEYSFLNMTSSAETFWLFHLSMLIQHGALTLTWPKVQLEMF 720
            LQL IP MEVGQ RFEL IP+YSFLN+TSSA+TFWLFHL+MLIQHG LTL WPKVQLEM 
Sbjct: 661  LQLAIPLMEVGQFRFELNIPQYSFLNVTSSADTFWLFHLAMLIQHGTLTLLWPKVQLEML 720

Query: 721  CVDNVVGLRFLLFEGCLMQAVAFIFLVMKMFRQPSKQVRYADFQVPMTSIRFKFSCPPDI 780
             VDNVVGLRFLLFEGCLMQAVAFIFLV+KMF+ P KQ RYADFQ P+TSIRFKFSC  DI
Sbjct: 721  FVDNVVGLRFLLFEGCLMQAVAFIFLVLKMFQSPGKQGRYADFQFPVTSIRFKFSCLQDI 780

Query: 781  GKQLVFAFYNFSETKNSKWLHLDCRLKKYCLLTKQLPLTECTYDNIKRFQNCTSQFHTSP 840
            GKQLVFAF+NFSE K SKW+HLD RLKKYCL++KQLPLTECTYDNIK+ QN  +QF  SP
Sbjct: 781  GKQLVFAFHNFSEIKYSKWVHLD-RLKKYCLISKQLPLTECTYDNIKKLQNSKTQFRASP 840

Query: 841  FCGECSSIKGTQKIGSLGINHKGDAGENNGHSNLCSNETNKKFPAFALSFTAAPSFMLSL 900
            FCG  SS+KGTQKI SLGIN KG A  N+GHSNLCSNET + FPAFALSFTAAP+F LSL
Sbjct: 841  FCGRSSSVKGTQKISSLGINLKGAACVNSGHSNLCSNETKRNFPAFALSFTAAPTFFLSL 900

Query: 901  HLKLLMEQCVAHLSSLHQDSGKRAENFGRLTVDNVCMNDCANNLSTSSKALGRWNLCARS 960
            HLKLLME+CVAHLS  H DS +  EN+GRLTVD+V  +DCAN+LSTSSKA  RWN C +S
Sbjct: 901  HLKLLMERCVAHLSLQHHDSIEHPENYGRLTVDDVLTDDCANSLSTSSKASDRWNSCPQS 960

Query: 961  DLGTGLSDCEEGG---SSRYKRSRLVAETCAGSHDSDKARNDVKKRMRSSGNDKSEKAMA 1020
            DLGTGLSDCE+G    SS+YK S  VA TCAGS D+DKARN +K+R+R  G +KS K  A
Sbjct: 961  DLGTGLSDCEDGDGVQSSQYK-STPVATTCAGSQDTDKARNGIKRRIRPLGKNKSGKTTA 1020

Query: 1021 LPNVARSDNGSDSFLNDLSVEIPSFQPVDGELHNAQLSMDVAWNVNSGIIRSPNPTAPRS 1080
            LPNVARSDN  +SFLNDLSVEIPSFQPVDGELH  Q SMDV WN ++ +I SPNPTAPRS
Sbjct: 1021 LPNVARSDN--NSFLNDLSVEIPSFQPVDGELHGPQQSMDVGWNASAVVIPSPNPTAPRS 1080

Query: 1081 TWHRNKNNSSSFGLASHGWSDGKDFL-NGLGNRTKKPRTQVSYMLPFGGLDYGSKNRNSH 1140
            TWHRNKNNS+S GLASHGWSDG   L NGLGNRTKKPRTQVSY LPFGG DY SK+RNSH
Sbjct: 1081 TWHRNKNNSTSLGLASHGWSDGNSLLINGLGNRTKKPRTQVSYSLPFGGFDYSSKSRNSH 1140

Query: 1141 PKATPYKRIRRASEKRSDAARGSQRNIELLSCDANVLITTGDRGWRECGARVVLEVFDHN 1200
            PKA+PYKRIRRASEKRSD ARGS+RN+ELLSCDANVLIT GDRGWRECGA+VVLEVFDHN
Sbjct: 1141 PKASPYKRIRRASEKRSDVARGSKRNLELLSCDANVLITLGDRGWRECGAKVVLEVFDHN 1200

Query: 1201 EWKLAVKLSGITKYSYKAHQFLQPGSTNRYTHAMMWKGGKDWILEFPDRSQWTIFKELHE 1260
            EWKLAVKLSGITKYSYKAHQFLQPGSTNRYTHAMMWKGGKDWILEFPDRSQW IFKELHE
Sbjct: 1201 EWKLAVKLSGITKYSYKAHQFLQPGSTNRYTHAMMWKGGKDWILEFPDRSQWAIFKELHE 1260

Query: 1261 ECYNRNIRSASVKNIPIPGVRLIEENDEHVAETAFMRNPSKYFRQVETDVEMALNPNRVL 1320
            ECYNRNIR+ASVKNIPIPGV L+EENDE+ AE+AFMRNPSKYFRQVETDVEMALNP R+L
Sbjct: 1261 ECYNRNIRAASVKNIPIPGVCLLEENDEYEAESAFMRNPSKYFRQVETDVEMALNPTRIL 1320

Query: 1321 YDMDSDDEQWIKDVQTSSEVGSSSSLGEASSEVFEKTMDAFEKAAYSQQRDEFTDDEIAE 1380
            YDMDSDDEQWIKD+  SSEVGSSS LGE SSEVFEKT+DAFEKAAYSQQRDEFTDDEIAE
Sbjct: 1321 YDMDSDDEQWIKDILPSSEVGSSSGLGEVSSEVFEKTVDAFEKAAYSQQRDEFTDDEIAE 1380

Query: 1381 AVNETLVSGLTKGIFEYWQLKRRQKGMPLLRHLQPPLWETYRQQLKDWESTVNKNNTNSC 1440
             +NETL S LTK IFEYWQ KRR+KGMPL+RHLQPPLWETY+QQLKDWE T+NK+NT+ C
Sbjct: 1381 VMNETLASDLTKAIFEYWQQKRRRKGMPLIRHLQPPLWETYQQQLKDWECTINKSNTSFC 1440

Query: 1441 NGYHD-SASIEKPPMFAFCLKPRGLEVSNKGSKQRSHRKFSMAGHSNSITYDQDGLHGFV 1500
            NGYH+ +AS+EKPPMFAFCLKPRGLEV NKGSKQRSHRKFS++GHSNSI YD DGLHGF 
Sbjct: 1441 NGYHEKAASVEKPPMFAFCLKPRGLEVFNKGSKQRSHRKFSVSGHSNSIAYDNDGLHGFG 1500

Query: 1501 RRLNGSALGDDRMVYIGHNYEFLEDSPLIHTSSSLFSPRLEGGILSNDGFERNVLPKLHK 1560
            RRLNG +LGDD+M YIGHNYEFLEDSPLIHTSSSLFSPRLEGGILSNDG ERN LPKLHK
Sbjct: 1501 RRLNGFSLGDDKMAYIGHNYEFLEDSPLIHTSSSLFSPRLEGGILSNDGLERNFLPKLHK 1560

Query: 1561 TKSRKYG--ASPYEPMMASSFNQRMVGKRDGLNRWNNGYSEWSSPLRYRFDGSQRQILEQ 1620
            +KSRKYG  AS Y+  MA SFNQRM+GKRDGLNRWNNGYSEWSSP RY FDGSQRQILEQ
Sbjct: 1561 SKSRKYGAWASTYDSGMA-SFNQRMIGKRDGLNRWNNGYSEWSSPRRYPFDGSQRQILEQ 1620

Query: 1621 LEGSDLDEYRFRDVSGAAQEARNVAKFKREKARRLLIRADLAIHKAVVAIMTAEAMKAAS 1656
            LEGSD+DE+R RD SGAAQ ARN+AK KREKARRLL RADLAIHKAVVAIMTAEAMKAAS
Sbjct: 1621 LEGSDVDEFRLRDASGAAQHARNMAKLKREKARRLLYRADLAIHKAVVAIMTAEAMKAAS 1676

BLAST of Cp4.1LG02g04400 vs. TrEMBL
Match: M5XKQ9_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa000151mg PE=4 SV=1)

HSP 1 Score: 1588.2 bits (4111), Expect = 0.0e+00
Identity = 921/1686 (54.63%), Postives = 1136/1686 (67.38%), Query Frame = 1

Query: 17   MENSLEYSHGTDTPKKSRSLDLKSLYESKVSKEVQNESLKRKVRAENGDEQRNERR-NRK 76
            MEN +E SHGT+ P+KSRSLDLKSLY+S+ +KEV  +SLKRK  AE+GDE R++++ +RK
Sbjct: 1    MENRIENSHGTEIPRKSRSLDLKSLYKSRTTKEVPTKSLKRKGSAEDGDENRDKKKKSRK 60

Query: 77   KVSLSNFSSIYSRSRKSLHEVYDDELGSSGHDSKKALKSESKEKLNSSSECNKVS-LILN 136
            +VSLS+  ++ + S+KSL EVY   L S  HD + A+K  S + L+S S  N VS L L 
Sbjct: 61   EVSLSSLKNVNTSSKKSLDEVYHSGLNSGSHDPE-AVKCGSSQILDSGSGFNGVSSLSLG 120

Query: 137  DDVMQIPKRKRGGFVRRKKILGGQILKPSGQLDGKAAIVDQ---IAKSSAKDPSDLVECC 196
            ++V+QIP+RKRG FV RKK  GGQ+LK   Q  GK  +VDQ   IAK +  D     E  
Sbjct: 121  NNVIQIPRRKRG-FVGRKKFEGGQVLKLPDQSAGKVGLVDQNHQIAKLNVDDLGTQDELL 180

Query: 197  KTNRKPGFKSLKEKEQSELSSTQHPKRGDGHADPLVRENQSSSTLHLKEEGEHIDHSVVK 256
               RK G    KE   SEL+S  H  +                      EG H  HSVV 
Sbjct: 181  NVKRKKGRDDFKENIDSELNSAPHADK----------------------EGVHTSHSVVS 240

Query: 257  PVSLSFKKSQKNFG---------RRKISASGRKRNSKEGEASISHSTKRRDGCLEDDEEN 316
                S KKS++N           +RK  A G K  +KE +  +  STK      EDDEEN
Sbjct: 241  NGDSSLKKSRRNQDNEENRRSRRKRKDLACGSKSAAKEADPLVDSSTKSCHDLQEDDEEN 300

Query: 317  LEENAARMLSSRFDPTCTGFSSNVMGS-LPPANG----------FVSHGLKPLADLESAS 376
            LEENAARMLSSRFDP+CTGFSSN   S L  ANG          F S   K ++  ES S
Sbjct: 301  LEENAARMLSSRFDPSCTGFSSNNKASALESANGLSFLLSSGQDFDSRRSKSISGSESPS 360

Query: 377  VDSAGRVLRPRKQRKEKKSSRKRRHFYEILLADLDAVWILNRRIKVFWPLDQIWYYGLVN 436
            VD++GRVLRPRKQ KEK  SRKRRHFYE+ L +LDA W+ NRRIKVFWPLDQ WYYGLVN
Sbjct: 361  VDNSGRVLRPRKQHKEKGHSRKRRHFYEVFLGNLDAYWVTNRRIKVFWPLDQTWYYGLVN 420

Query: 437  DYDKERKLHHVKYDDRDEEWIDLQNERFKLLLLPSEVPGREDRRKSVMGNNPANVKRES- 496
            DYDKE+KLHHVKYDDRDEEWIDLQNERFKLLLLPSEVPG+ +R+KS   N  ++V+R+  
Sbjct: 421  DYDKEKKLHHVKYDDRDEEWIDLQNERFKLLLLPSEVPGKIERKKSTQRNR-SSVERKGN 480

Query: 497  ----RSRKGKETNAPKDECNTGSFMDSEPIISWLARSTQRNKSCPSHSSKRQKNSSLFLK 556
                + +K +E  +  D C  GS+MD+EPIISWLARS +R KS PS + K+QK S L LK
Sbjct: 481  LKPRKEKKKRELTSEDDSC-MGSYMDTEPIISWLARSNRRVKS-PSCAVKKQKTSGLSLK 540

Query: 557  SG-------SQAIERPHSGLPERLGDMDGLEKSTSEITTCSKTCKLPIVYFRKRFRNIGT 616
                        I   H+    R  D+   EK TS+ +TC +  K+PIVYFR+R R  G+
Sbjct: 541  PPLSDEDVIRDKIRTSHNS--GRSSDVLRQEKPTSQGSTCPRDSKMPIVYFRRR-RKTGS 600

Query: 617  EMSHKHETSYASRRTHSLASFFSNVGEIDDVEESDISPRRSEALRLLWCVDDDGLLQLDI 676
             +SH  + ++A        + F  V EI D+EE     RR +A   LW +DD GLL+L +
Sbjct: 601  VLSHTSKGNHAYVSELGSITSFVPVKEIGDLEEPYDFVRRLDANGPLWYIDDAGLLKLTL 660

Query: 677  PAMEVGQLRFELTIPEYSFLNMTSSAETFWLFHLSMLIQHGALTLTWPKVQLEMFCVDNV 736
            P  E G++ FEL +P +S +N +   E F LFH +ML ++G + +TWPKV LEM  VDNV
Sbjct: 661  PRTEAGKVTFELGVPMHSTINDSFGVE-FSLFHAAMLHRYGTVVITWPKVYLEMLFVDNV 720

Query: 737  VGLRFLLFEGCLMQAVAFIFLVMKMFRQPSKQVRYADFQVPMTSIRFKFSCPPDIGKQLV 796
            VGLRFLLFEGCL QAVAF+FLV+ +F  P +Q ++ DFQ+P+TSIRFKFSC   + KQLV
Sbjct: 721  VGLRFLLFEGCLEQAVAFVFLVLALFHHPIEQGKFLDFQLPVTSIRFKFSCVQLLRKQLV 780

Query: 797  FAFYNFSETKNSKWLHLDCRLKKYCLLTKQLPLTECTYDNIKRFQNCTSQFHTSPFCGEC 856
            FA YNFS+ K SKW +LD +++ +CLLTK+LPL+ECTYD+I+  QN T+Q      CG  
Sbjct: 781  FAVYNFSQVKKSKWKYLDSKVRSHCLLTKKLPLSECTYDSIQALQNGTNQSPFMSLCGRP 840

Query: 857  SSIKGTQKIGSLGINHKGDAGE----NNGHSNLCSNETNKKFPAFALSFTAAPSFMLSLH 916
            SS+KGT++    GIN  G + E    N  HS   S+E  +K P  ALSFTAAP+F LSLH
Sbjct: 841  SSVKGTRRRSRQGINFMGGSRESTFVNISHSTSHSDELPRKLPPLALSFTAAPTFFLSLH 900

Query: 917  LKLLMEQCVAHLSSLHQDSGKRAENFG-RLTVDNVCMNDCANNLSTSSKALGRWNLCARS 976
            LKLLME CVA++     DS +   N G  L VD   + D  N     SK     NL  ++
Sbjct: 901  LKLLMEHCVANICFRDPDSVELLGNSGSMLAVDCSSVEDFFNR---GSKITHENNL--KA 960

Query: 977  DLGTGLSDCEEGGSSRYKRSRLVAETCAGSHDSDKARNDVKKRMRSSGNDKSEKAMALPN 1036
              G   SD                      H   K   +                +AL N
Sbjct: 961  SPGNATSD----------------------HSFSKPETETA--------------LALCN 1020

Query: 1037 VARSDNGSDSFLNDLSVEIPSF----QPVDGELHNAQLSMDVAWNVNSGIIRSPNPTAPR 1096
              +SD  S SFLN L+VEIPSF    +PVDGE+ +AQ   D +WN++  II SPNPTAPR
Sbjct: 1021 GEKSDTDSQSFLNGLTVEIPSFDRFEKPVDGEVQSAQQPTDCSWNMSGSIIPSPNPTAPR 1080

Query: 1097 STWHRNKNNSSSFGLASHGWSDGKD--FLNGLGNRTKKPRTQVSYMLPFGGLDYGSKNRN 1156
            STWHR++N+SSSFG  SHGWSDGK   F NG GN  KKPRTQVSY LP+GG D+ SK RN
Sbjct: 1081 STWHRSRNSSSSFGSLSHGWSDGKADLFHNGFGNGPKKPRTQVSYTLPYGGFDFSSKQRN 1140

Query: 1157 SHPKATPYKRIRRASEKR-SDAARGSQRNIELLSCDANVLITTGDRGWRECGARVVLEVF 1216
               K  P KRIRRA+EKR SD +RGSQRN+E LSC+ANVLI   DRGWRECGA +VLE+F
Sbjct: 1141 LQ-KGIPPKRIRRANEKRLSDVSRGSQRNLEQLSCEANVLINGSDRGWRECGAHIVLELF 1200

Query: 1217 DHNEWKLAVKLSGITKYSYKAHQFLQPGSTNRYTHAMMWKGGKDWILEFPDRSQWTIFKE 1276
            DHNEWKLAVK+SG TKYSYKAHQFLQPGSTNRYTHAMMWKGGKDWILEFPDRSQW +F+E
Sbjct: 1201 DHNEWKLAVKISGTTKYSYKAHQFLQPGSTNRYTHAMMWKGGKDWILEFPDRSQWALFRE 1260

Query: 1277 LHEECYNRNIRSASVKNIPIPGVRLIEENDEHVAETAFMRNPSKYFRQVETDVEMALNPN 1336
            +HEECYNRNIRSA VKNIPIPGVRLIEE+D++ AE +F+R+ +KYFRQ ETDVEMAL+P+
Sbjct: 1261 MHEECYNRNIRSALVKNIPIPGVRLIEESDDNGAEISFLRSSTKYFRQTETDVEMALDPS 1320

Query: 1337 RVLYDMDSDDEQWIKDVQTSSEVGSSSSLGEASSEVFEKTMDAFEKAAYSQQRDEFTDDE 1396
            RVLYDMDSDDEQWI   Q SSEV +SSS+ E   E+FEKTMD FEKAAY+QQ D+FT +E
Sbjct: 1321 RVLYDMDSDDEQWIMKFQNSSEVDNSSSI-EIDEEMFEKTMDMFEKAAYAQQCDQFTYEE 1380

Query: 1397 IAEAVNETLVSGLTKGIFEYWQLKRRQKGMPLLRHLQPPLWETYRQQLKDWESTVNKNNT 1456
            I E V       + K I+E+W+ KR +KGMPL+RHLQP  WE Y+QQ+++WE  + K NT
Sbjct: 1381 IEEFVAVVGPMDVIKTIYEHWRGKRLRKGMPLIRHLQPSAWERYQQQVREWEQAMIKTNT 1440

Query: 1457 NSCNGYHD-SASIEKPPMFAFCLKPRGLEVSNKGSKQRSHRKFSMAGHSNSITYDQDGLH 1516
               NG H+ +AS+EKPPMFAFCLKPRGLEV NKGSKQRS ++FS++GHS+ +  DQDG H
Sbjct: 1441 ILPNGCHEKAASVEKPPMFAFCLKPRGLEVPNKGSKQRSQKRFSVSGHSSGMLGDQDGFH 1500

Query: 1517 GFVRRLNGSALGDDRMVYIGHNYEFLEDSPLIHTSSSLFSPRLEGGIL-SNDGFERNVLP 1576
               RR NG A GD+++VY GHNY+ L+DSPL  TS  +FSPR    IL SNDGFERN L 
Sbjct: 1501 AIGRRSNGFAFGDEKVVYPGHNYDSLDDSPLSQTSPRVFSPRDATNILISNDGFERNHLH 1560

Query: 1577 KLHKTKSRKYG--ASPYEPMMASSFNQRMVGKRDGLNRWNNGYSEWSSPLRYRFDGSQRQ 1636
            ++H++KS+K+G   SP EP M S ++ R+VG R+G+ RWN G+ +WSS   Y+ DG QR 
Sbjct: 1561 RIHRSKSKKFGRTVSPVEPQMVSPYSHRVVGNRNGVQRWNTGFPDWSSQRYYQTDGPQRH 1612

Query: 1637 ILEQLEGSDLDEYRFRDVSGAAQEARNVAKFKREKARRLLIRADLAIHKAVVAIMTAEAM 1650
             +  L+G DLDE+R RD SGAAQ A NVA+ KREKA++L  RADLAIHKAVV++MTAEA+
Sbjct: 1621 DMGLLDGPDLDEFRLRDASGAAQHAHNVARLKREKAQKLFYRADLAIHKAVVSLMTAEAI 1612

BLAST of Cp4.1LG02g04400 vs. TrEMBL
Match: W9SBV9_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_007381 PE=4 SV=1)

HSP 1 Score: 1519.2 bits (3932), Expect = 0.0e+00
Identity = 890/1720 (51.74%), Postives = 1134/1720 (65.93%), Query Frame = 1

Query: 17   MENSLEYSHGTDTPKKSRSLDLKSLYESKVSKEVQNESLKRKVRAENGDE--QRNERRNR 76
            MEN +E S G + P+KSRSLDLKSLY+ +V+K+VQN+ LKRK  A++GDE  ++ ++++ 
Sbjct: 1    MENRIESSDGAEVPRKSRSLDLKSLYKHRVTKDVQNKKLKRKASADDGDENSEKKKKKSV 60

Query: 77   KKVSLSNFSSIYSRSRKSLHEVYDDELGSSGHDSKKALKSESKEKLNSSSECNKVS-LIL 136
            K+VSLS+  +  S S+K++ +     L S  HDSK  LK E+K+KLN S     +S L L
Sbjct: 61   KEVSLSSLKNTSSSSKKNVDKDCHKGLSSGLHDSKD-LKLEAKQKLNGSIGFKSISSLSL 120

Query: 137  NDDVMQIPKRKRGGFVRRKKILGGQILKPSGQLDGKAAIVDQIAKSSAKDPSDLVECCKT 196
            NDDV+QIP+RKRG FV RKK  GG + +  G   GK  +VDQI+K S  D    VE  K 
Sbjct: 121  NDDVIQIPRRKRG-FVGRKKGEGGHVPRRQGLSCGKLDLVDQISKLSGDDSGSQVESVKV 180

Query: 197  NRKPGFKSLKEKEQSELSSTQHPKRGDGHADPLVRENQSSSTLHLKEEGEHIDHSVVKPV 256
             R  GF   KE   SE                      S+S  H +EE E ++H VV   
Sbjct: 181  KRTKGFDDFKENRISE----------------------SNSARHAEEEHERVNHLVVSNG 240

Query: 257  SLSFKKSQKNFGRRKISASGRKRNSKEGEASISHSTKRRDGCLEDDEENLEENAARMLSS 316
               FKKS++   + K  +   K  +KE E    +ST   +   EDDEENLEENAA MLSS
Sbjct: 241  DSLFKKSRRKRSKTKNLSPDDKVGAKEAEPLADNSTMMCNDSQEDDEENLEENAAMMLSS 300

Query: 317  RFDPTCTGFSSNVMGSLPPANG----------FVSHGLKPLADLESASVDSAGRVLRPRK 376
            RFDP CTGFSSN   +    +G          FVS   + L+  ES SVD+AGRVLRPR 
Sbjct: 301  RFDPNCTGFSSNKASAFATVDGLSFLLSSGRDFVSRRSRSLSGSESPSVDAAGRVLRPRI 360

Query: 377  QRKEKKSSRKRRHFYEILLADLDAVWILNRRIKVFWPLDQIWYYGLVNDYDKERKLHHVK 436
            Q KEK  SRKRRHFYE+   DLDA W+LNRRIKVFWPLDQ WYYGLVNDYD+E+KLHHVK
Sbjct: 361  QHKEKGHSRKRRHFYEVFFGDLDADWVLNRRIKVFWPLDQSWYYGLVNDYDREKKLHHVK 420

Query: 437  YDDRDEEWIDLQNERFKLLLLPSEVPGREDRRKSVMGNNPANVKRESRSRKGKET----- 496
            YDDRDEEWIDLQNERFKLLLLPSEVPG+   R+S + +  ++V+R+S S+  KE      
Sbjct: 421  YDDRDEEWIDLQNERFKLLLLPSEVPGKAACRRSRIRDR-SSVQRKSSSKPKKEKKKGDI 480

Query: 497  NAPKDECNTGSFMDSEPIISWLARSTQRNKSCPSHSSKRQKNSSLFLK------------ 556
            +   D C   ++MDSEPIISWLARS +R KS P H+ K+QK S L +K            
Sbjct: 481  SMQDDSCIGSNYMDSEPIISWLARSRRRVKS-PFHALKKQKPSDLSVKPVLPPFSNNAVN 540

Query: 557  ------SGSQAIERP----HSGLPERLGDMDGLEKSTSEITTCSKTCKLPIVYFRKRFRN 616
                  SG+   ++     +S L  R  +    E+STSE  +C K  K+PIVYFR+RFR 
Sbjct: 541  SNRCFESGTVRRDKRKFSRNSNLSGRFANDAMKEESTSESISCPKDSKMPIVYFRRRFRK 600

Query: 617  IGTEMSHKHETSYASRRT-HSLASFFSNVGEIDDVEESDISPRRSEALRLLWCVDDDGLL 676
             G E+S   E ++A R T   + SF   V +  D  + D+   R +   LLW VDD GLL
Sbjct: 601  TGLELSRGCEDNHACRNTLDPVTSFAPAVDDTRDWVKWDVLLGRLDLGGLLWSVDDAGLL 660

Query: 677  QLDIPAMEVGQLRFELTIPEYSFLNMTSSAETFWLFHLSMLIQHGALTLTWPKVQLEMFC 736
            +L +P +E G+ +F++  P  S L      E  WL H ++L+ +G + + WP+V LEM  
Sbjct: 661  KLMLPGLESGKFKFDVDFPILSGLYDIFGVENLWLSHSAVLLHYGTVMIRWPQVHLEMLF 720

Query: 737  VDNVVGLRFLLFEGCLMQAVAFIFLVMKMFRQPSKQVRYADFQVPMTSIRFKFSCPPDIG 796
            VDNV GLRFLLFEGCL QA+A +FLV++ F QP+++V++ D  +P+TSIRFK +C     
Sbjct: 721  VDNVFGLRFLLFEGCLNQALALVFLVVRTFHQPTERVKFVD--MPVTSIRFKLTCFQHHK 780

Query: 797  KQLVFAFYNFSETKNSKWLHLDCRLKKYCLLTKQLPLTECTYDNIKRFQNCTSQFHTSPF 856
            K L FAF NFS  +NSKW++LD +L+++CL+TKQLPL ECTYDNIK  QN T        
Sbjct: 781  KHLEFAFCNFSTVENSKWIYLDRKLRRHCLVTKQLPLPECTYDNIKMLQNRTVHLPLRSV 840

Query: 857  CGECSSIKGTQKIGSLGINHKGDAGENN----GHSNLCSNETNKKFPAFALSFTAAPSFM 916
            CG+ S IKGT+K    GIN  G + E+     G S+   ++  KK P  ALSFTAAP+F 
Sbjct: 841  CGQPSFIKGTRKRLRQGINFMGISRESAFMDIGRSSHF-DKMYKKLPPLALSFTAAPTFF 900

Query: 917  LSLHLKLLMEQCVAHLSSLHQDSGKRAENFGRLTVDNVC-MNDCAN-----NLSTSSKAL 976
            LSLHLK+LME  +AH+S    DS +  EN   +T D+   M + +N     +L  ++KAL
Sbjct: 901  LSLHLKMLMEHSLAHISLREHDSEEHLENSCSMTADDSSSMEEYSNKGSEMSLEENTKAL 960

Query: 977  G---RWNLC---ARSDLGTGLSDCEEGGSSR-----YKRSRLVAETCAGSHDSDKARNDV 1036
                  + C    R +L  GLS C +    +     +    + A T A S    K R D 
Sbjct: 961  SGEVASDGCFSSGRPELSNGLSVCCDRDQIKASQPCHNGDAIAAGTSADSPVHKKIRTDA 1020

Query: 1037 KKRMRSSGNDKSEK------AMALPNVARSDNGSDSFLNDLSVEIPSF----QPVDGELH 1096
              ++++     SE       + +L +  +S+ GS SF+N LSVEIP F    + VDGELH
Sbjct: 1021 TVQLQAWKGHHSESDQSALLSRSLDDRDKSEKGSQSFVNGLSVEIPPFNQFEKSVDGELH 1080

Query: 1097 NAQLSMDVAWNVNSGIIRSPNPTAPRSTWHRNKNNSSSFGLASHGWSDGK--DFLNGLGN 1156
             AQ + D++WN N  I  SPNPTAPRSTWHRNK NSS FG  SHGWSDGK     NG GN
Sbjct: 1081 GAQQATDLSWNTNGAIFSSPNPTAPRSTWHRNKQNSS-FGHLSHGWSDGKADPVYNGFGN 1140

Query: 1157 RTKKPRTQVSYMLPFGGLDYGSKNRNSHPKATPYKRIRRASEKR-SDAARGSQRNIELLS 1216
              KKPRTQVSY+LPFGG D   K + S  K  P KR+R+ASEKR SD +RGSQRN+ELLS
Sbjct: 1141 GPKKPRTQVSYLLPFGGFDCSPKQK-SIQKGLPSKRLRKASEKRSSDVSRGSQRNLELLS 1200

Query: 1217 CDANVLITTGDRGWRECGARVVLEVFDHNEWKLAVKLSGITKYSYKAHQFLQPGSTNRYT 1276
            CD N+LIT  DRGWRECGA+VVLE+FD +EWKLAVKLSG+TKYSYKAHQFLQPGSTNR+T
Sbjct: 1201 CDVNILITATDRGWRECGAQVVLELFDDHEWKLAVKLSGVTKYSYKAHQFLQPGSTNRFT 1260

Query: 1277 HAMMWKGGKDWILEFPDRSQWTIFKELHEECYNRNIRSASVKNIPIPGVRLIEENDEHVA 1336
            HAMMWKGGKDW LEF DRSQW +FKE+HEECYNRNI++ASVK+IPIPGVRL+EE D++ A
Sbjct: 1261 HAMMWKGGKDWTLEFMDRSQWALFKEMHEECYNRNIQAASVKSIPIPGVRLVEEGDDNGA 1320

Query: 1337 ETAFMRNPSKYFRQVETDVEMALNPNRVLYDMDSDDEQWIKDVQTSSEVGSSSSLGEASS 1396
            E AF+R+ +KYFRQVETD+EMALNP+RVLYD+DSDDEQWI   ++SSE+  S SLG+ S 
Sbjct: 1321 ELAFVRSSAKYFRQVETDIEMALNPSRVLYDLDSDDEQWIMKARSSSEL-DSGSLGKISE 1380

Query: 1397 EVFEKTMDAFEKAAYSQQRDEFTDDEIAEAVNETLVSGLTKGIFEYWQLKRRQKGMPLLR 1456
            E+FEKTMD FEKAAY+ QRD+ T +EI E         + K I+E+W+LKR++ GMPL+R
Sbjct: 1381 EMFEKTMDMFEKAAYAHQRDQLTLEEIEELTVGVGPMDVIKVIYEHWRLKRQKNGMPLIR 1440

Query: 1457 HLQPPLWETYRQQLKDWESTVNKNNTNSCNGYHD-SASIEKPPMFAFCLKPRGLEVSNKG 1516
            HLQPPLWE Y+Q++++WE  + + N N  NG  + +A IEKPPMFAFC+KPRGLEV NKG
Sbjct: 1441 HLQPPLWERYQQEVREWELAMTRINANLPNGCQEKTAQIEKPPMFAFCMKPRGLEVPNKG 1500

Query: 1517 SKQRSHRKFSMAGHSNSITYDQDGLHGFVRRLNGSALGDDRMVYIGHNYEFLEDSPLIHT 1576
            SKQRSHRK S++G SN+   DQDGLH + RRLNG + GD++ VY G+NY+ LEDSPL  T
Sbjct: 1501 SKQRSHRKISVSGKSNTTFGDQDGLHAYGRRLNGFSFGDEKFVYPGYNYDSLEDSPLPQT 1560

Query: 1577 SSSLFSPRLEGGI-LSNDGFERNVLPKLHKTKSRKYG--ASPYEPMMASSFNQRMV--GK 1636
               +F PR  G + ++N G +RN   K  ++KS+KYG   SP  P     +  R+V  G 
Sbjct: 1561 PRRMFLPRDAGSMSMTNYGLDRNHSYKFQRSKSKKYGNTVSPNNPQTMGLYGHRVVGNGS 1620

Query: 1637 RDGLNRWNNGYSEWSSPLRYRFDGSQRQILEQLEGSDLDEYRFRDVSGAAQEARNVAKFK 1656
            R+GL+RWN G+SEWSS   ++ + SQR  +EQL+GSDLDEYR RD S AAQ A N+AK K
Sbjct: 1621 RNGLHRWNMGFSEWSSQQHFQPEPSQRHFIEQLDGSDLDEYRVRDASSAAQRALNIAKLK 1680

BLAST of Cp4.1LG02g04400 vs. TrEMBL
Match: A0A061GQB3_THECC (Enhancer of polycomb-like transcription factor protein, putative isoform 1 OS=Theobroma cacao GN=TCM_038296 PE=4 SV=1)

HSP 1 Score: 1470.3 bits (3805), Expect = 0.0e+00
Identity = 871/1718 (50.70%), Postives = 1105/1718 (64.32%), Query Frame = 1

Query: 17   MENSLEYSHGTDTPKKSRSLDLKSLYESKVSKEV-QNESLKRKVRAENGDEQRNERRN-- 76
            MEN +  SHG + P+KSRSLDLKSLY+S  SKE  +N+SLKRK  ++ GD+++    N  
Sbjct: 1    MENRIGNSHGAEIPRKSRSLDLKSLYKSGDSKESSKNKSLKRKDSSQEGDDEKRSSNNNK 60

Query: 77   ----RKKVSLSNFSSIY-SRSRKSLHEVYDDELGSSGHDSKKALKSESKEKLNSSSECNK 136
                RK + LS+F ++  S S KSL EVY+    S  HDS+        +KL +    N 
Sbjct: 61   RKKSRKALPLSSFRTVDGSNSSKSLTEVYNGGFSSGLHDSESLKNLGLSQKLKNGCGANG 120

Query: 137  VSLILNDDVMQIPKRKRGGFVRRKKILGGQILKPSGQLDGKAAIVDQIAKSSAKDPSDLV 196
            +SL L D   +IP+RKRG FV R K  GGQ LK +G+       V +  K +++D     
Sbjct: 121  ISLSLGDSETRIPRRKRG-FVGRNKFEGGQRLKLAGRSSSTVGDVKEEVKLTSEDSGTQN 180

Query: 197  ECCKTNRKPGFKSLKEKEQSELSSTQHPKRGDGHADPLVRENQSSSTLHLKEEGEHIDHS 256
            E  K  +K      KE   SE S  QH K  DG A  L   N   S L            
Sbjct: 181  ESSKVKQKKFIDDFKENRNSESSLVQHLKEEDGVAAYLA-VNDGDSLL------------ 240

Query: 257  VVKPVSLSFKKSQKNFGRRKISASGRKRNSKEGEASISHSTKRRDGCLEDDEENLEENAA 316
                     KKSQ+N  +RK S  G K  +K+ E  +  S K  D   EDDEENLEENAA
Sbjct: 241  ---------KKSQRNPRKRKDSVKGGKSVAKKAEILVGSSVKTCDDFKEDDEENLEENAA 300

Query: 317  RMLSSRFDPTCTGFSSNVMGSLPPA-NGF---------VSHGLKPLADLESASVDSAGRV 376
            RMLSSRFDP+CTGFSSN   S+ P+ NGF          S G K  +  ESASVD++GRV
Sbjct: 301  RMLSSRFDPSCTGFSSNSKVSVSPSENGFSFLLSSGQNASSGSKTFSGSESASVDASGRV 360

Query: 377  LRPRKQRKEKKSSRKRRHFYEILLADLDAVWILNRRIKVFWPLDQIWYYGLVNDYDKERK 436
            LRPRK  KEK +SRKRRHFYEI   DLDA W+LNRRIKVFWPLD+ WYYGLVN+YDKERK
Sbjct: 361  LRPRKSHKEKSNSRKRRHFYEIYSGDLDASWVLNRRIKVFWPLDKSWYYGLVNEYDKERK 420

Query: 437  LHHVKYDDRDEEWIDLQNERFKLLLLPSEVPGREDRRKSVMGN-NPANVKRESRSRKGKE 496
            LHHVKYDDRDEEWI+LQNERFKLLL PSEVP + +R++S     +   ++    +R+ K 
Sbjct: 421  LHHVKYDDRDEEWINLQNERFKLLLFPSEVPSKSERKRSRRKRCSDDRIRNLKPNREEKR 480

Query: 497  TNAPKDECNTGSFMDSEPIISWLARSTQRNKSCPSHSSKRQKNSSLFLKSGSQ------A 556
                +D+   GS+MDSEPIISWLARS+ R KSCP  + KRQK S+    S  Q      A
Sbjct: 481  NVVTEDDSGNGSYMDSEPIISWLARSSHRVKSCPLRAVKRQKTSASSHSSPGQPLLCDEA 540

Query: 557  IERPH-----------------SGLPERLGDMDGLEKSTSEITTCSKTCKLPIVYFRKRF 616
            ++                    S L +R  D   +E S+   T+C K  K PIVYFR+RF
Sbjct: 541  VDENSCLYRVSLRVDKIELSGASALSDRPVDGIRVEDSSLGSTSCLKDSKHPIVYFRRRF 600

Query: 617  RNIGTEMSHKHETSYASRRTHSLASFFSNVGEIDDVEESDISPRRSEALRLLWCVDDDGL 676
            R     +    E +  +       +  ++V E  D+ E D+   R +    L   D+ G 
Sbjct: 601  RRTEKALCQASEGNCVASSVSESITSLASVDEFQDLGELDVCLGRLDPEGDLLFSDNAGQ 660

Query: 677  LQLDIPAMEVGQLRFELTIPEYSFLNMTSSAETFWLFHLSMLIQHGALTLTWPKVQLEMF 736
            L+L+I  +   Q RF L+ P +S  N     ++F L H  +L+Q G +   WP V LE+ 
Sbjct: 661  LRLNISLLRTKQFRFGLSFPVFSVSNNLFGTKSFSLVHTLLLLQCGTVMTIWPMVHLEIL 720

Query: 737  CVDNVVGLRFLLFEGCLMQAVAFIFLVMKMFRQPSKQVRYADFQVPMTSIRFKFSCPPDI 796
             VDN VGLRFLLFEG L QAVAF+F V+ +F  P++Q ++AD Q+P+TSIRFKFSC  D 
Sbjct: 721  FVDNEVGLRFLLFEGSLKQAVAFVFRVLTVFYLPTEQGKFADLQLPVTSIRFKFSCSQDF 780

Query: 797  GKQLVFAFYNFSETKNSKWLHLDCRLKKYCLLTKQLPLTECTYDNIKRFQNCTSQFHTSP 856
             KQ+VFAFYNF E K+SKW+ LD +LK+ CL+T+QLPL+ECTYDNIK  QN T+Q  +SP
Sbjct: 781  RKQIVFAFYNFHEVKHSKWVFLDSKLKRQCLITRQLPLSECTYDNIKALQNGTNQLLSSP 840

Query: 857  FCGECSSIKGTQKIG-SLGINHKGDAGENN----GHSNLCSNETNKKFPAFALSFTAAPS 916
               + SS++G ++     GI+  G + E++    G     S + ++  P FALSF AAP+
Sbjct: 841  AYKDSSSLEGLRRRRYRQGISLMGVSRESSFLKVGQFTSSSEKKHRNLPLFALSFGAAPT 900

Query: 917  FMLSLHLKLLMEQCVAHLSSLHQDSGKRAENFGRLTVDNVC-MNDCANNLSTSS---KAL 976
            F LSLHLKLLME  VA +S    DS ++  + G L VD+     DC +    SS   K L
Sbjct: 901  FFLSLHLKLLMEHSVARISFQDHDSNEQLGSSGDLMVDDSSNREDCVDKRFDSSSVEKNL 960

Query: 977  GRWNLCARSDLGTGLSDCEEGGSSRYKRS--------RLVAETCAGSHDSDK--ARNDVK 1036
               +  A SD      D    G   +K+S        + +  T A SH+ ++  A   V 
Sbjct: 961  KASSKDAASDTELTTLDLSVCGDEHWKKSSQKYENGDQTIYGTFASSHEPEEVGATAIVP 1020

Query: 1037 KRMRSSGNDKSEK----AMALPNVARSDNGSDSFLNDLSVEIPSFQP----VDGELHNAQ 1096
             + +   + +SE+    + +L +  R++ GS+S LND+ VEIPSF      +DGEL   Q
Sbjct: 1021 LQKQQCAHSESEQLVSSSKSLVDGDRNNAGSNSVLNDIRVEIPSFDQYENHIDGELPGTQ 1080

Query: 1097 LSMDVAWNVNSGIIRSPNPTAPRSTWHRNKNNSSSFGLASHGWSDGKD--FLNGLGNRTK 1156
             S D+ WN+N GII SPNPTAPRSTWHRN+++SSS G  +HGWS+GK   F N  GN  K
Sbjct: 1081 QSSDLTWNMNGGIIPSPNPTAPRSTWHRNRSSSSSIGYNAHGWSEGKADFFHNNFGNGPK 1140

Query: 1157 KPRTQVSYMLPFGGLDYGSKNRNSHPKATPYKRIRRASEKR-SDAARGSQRNIELLSCDA 1216
            KPRTQVSY +PFGGLDY SKN+  H +  P+KRIRRA+EKR SD +RGSQ+N+ELLSCDA
Sbjct: 1141 KPRTQVSYSMPFGGLDYSSKNKGHHQRGPPHKRIRRANEKRSSDVSRGSQKNLELLSCDA 1200

Query: 1217 NVLITTGDRGWRECGARVVLEVFDHNEWKLAVKLSGITKYSYKAHQFLQPGSTNRYTHAM 1276
            N+LIT GDRGWRECGA+V LE+FDHNEWKLAVK+SG T+YS+KAHQFLQPGSTNRYTHAM
Sbjct: 1201 NLLITLGDRGWRECGAQVALELFDHNEWKLAVKVSGSTRYSHKAHQFLQPGSTNRYTHAM 1260

Query: 1277 MWKGGKDWILEFPDRSQWTIFKELHEECYNRNIRSASVKNIPIPGVRLIEENDEHVAETA 1336
            MWKGGKDWILEF DRSQW +FKE+HEECYNRNIR+ASVKNIPIPGVRLIEE DE+ AE  
Sbjct: 1261 MWKGGKDWILEFTDRSQWALFKEMHEECYNRNIRAASVKNIPIPGVRLIEEYDEN-AEVT 1320

Query: 1337 FMRNPSKYFRQVETDVEMALNPNRVLYDMDSDDEQWIKDVQTSSEVGSSSSLGEASSEVF 1396
            F R+ SKY RQVETDVEMAL+P+ VLYDMDSDDEQWI  ++ SSE   SS   E S E+F
Sbjct: 1321 FFRSSSKYLRQVETDVEMALDPSHVLYDMDSDDEQWISRIRRSSESDVSSCSLEFSDELF 1380

Query: 1397 EKTMDAFEKAAYSQQRDEFTDDEIAEAVNETLVSGLTKGIFEYWQLKRRQKGMPLLRHLQ 1456
            EKTMD FEKAAY+QQ D+F  DEI E +       + + I+E+W+ KR++ G+PL+RHLQ
Sbjct: 1381 EKTMDIFEKAAYTQQCDQFNSDEIQELMAGVGSMKVIRPIYEHWRQKRQRVGLPLIRHLQ 1440

Query: 1457 PPLWETYRQQLKDWESTVNKNNTNSCNGYHDSA-SIEKPPMFAFCLKPRGLEVSNKGSKQ 1516
            PPLWE Y++Q+++WE +++K N    NG  D   SIEKPPMFAFCLKPRGLEV NKGSK 
Sbjct: 1441 PPLWEMYQRQVREWELSMSKVNPILPNGCSDKVPSIEKPPMFAFCLKPRGLEVPNKGSKP 1500

Query: 1517 RSHRKFSMAGHSNSITYDQDGLHGFVRRLNGSALGDDRMVYIGHNYEFLEDSPLIHTSSS 1576
            RS RK S++G SN    D +G H F RR NG   GD++++Y  HNYE LEDSPL   S  
Sbjct: 1501 RSQRKISVSGQSNHALGDHEGCHSFGRRSNGFLFGDEKVLYPVHNYESLEDSPLSQASPR 1560

Query: 1577 LFSPRLEGGI----LSNDGFERNVLPKLHKTKSRKYG--ASPYEPMMASSFNQRMVGKRD 1636
            +FSPR  G +    + +DGF +    KL ++KS+K+G   S  +  M +S++QR++GKR+
Sbjct: 1561 VFSPRDVGSMGYFSMGSDGFNKKYHQKLQRSKSKKFGNFLSSNDAQMMASYSQRLMGKRN 1620

Query: 1637 GLNRWNNGYSEWSSPLRYRFDGSQRQILEQLEGSDLDEYRFRDVSGAAQEARNVAKFKRE 1656
            G+ +WN G+SEW S      DG QR   EQL+ SD+DE+R RD S AAQ+A N+AKFKRE
Sbjct: 1621 GIRQWNMGFSEWQSQRHSFSDGFQRHGPEQLDNSDIDEFRLRDASSAAQQALNMAKFKRE 1680

BLAST of Cp4.1LG02g04400 vs. TrEMBL
Match: A0A061GW48_THECC (Enhancer of polycomb-like transcription factor protein, putative isoform 4 OS=Theobroma cacao GN=TCM_038296 PE=4 SV=1)

HSP 1 Score: 1452.6 bits (3759), Expect = 0.0e+00
Identity = 870/1746 (49.83%), Postives = 1105/1746 (63.29%), Query Frame = 1

Query: 17   MENSLEYSHGTDTPKKSRSLDLKSLYESKVSKEV-QNESLKRKVRAENGDEQRNERRN-- 76
            MEN +  SHG + P+KSRSLDLKSLY+S  SKE  +N+SLKRK  ++ GD+++    N  
Sbjct: 1    MENRIGNSHGAEIPRKSRSLDLKSLYKSGDSKESSKNKSLKRKDSSQEGDDEKRSSNNNK 60

Query: 77   ----RKKVSLSNFSSIY-SRSRKSLHEVYDDELGSSGHDSKKALKSESKEKLNSSSECNK 136
                RK + LS+F ++  S S KSL EVY+    S  HDS+        +KL +    N 
Sbjct: 61   RKKSRKALPLSSFRTVDGSNSSKSLTEVYNGGFSSGLHDSESLKNLGLSQKLKNGCGANG 120

Query: 137  VSLILNDDVMQIPKRKRGGFVRRKKILGGQILKPSGQLDGKAAIVDQIAKSSAKDPSDLV 196
            +SL L D   +IP+RKRG FV R K  GGQ LK +G+       V +  K +++D     
Sbjct: 121  ISLSLGDSETRIPRRKRG-FVGRNKFEGGQRLKLAGRSSSTVGDVKEEVKLTSEDSGTQN 180

Query: 197  ECCKTNRKPGFKSLKEKEQSELSSTQHPKRGDGHADPLVRENQSSSTLHLKEEGEHIDHS 256
            E  K  +K      KE   SE S  QH K  DG A  L   N   S L            
Sbjct: 181  ESSKVKQKKFIDDFKENRNSESSLVQHLKEEDGVAAYLA-VNDGDSLL------------ 240

Query: 257  VVKPVSLSFKKSQKNFGRRKISASGRKRNSKEGEASISHSTKRRDGCLEDDEENLEENAA 316
                     KKSQ+N  +RK S  G K  +K+ E  +  S K  D   EDDEENLEENAA
Sbjct: 241  ---------KKSQRNPRKRKDSVKGGKSVAKKAEILVGSSVKTCDDFKEDDEENLEENAA 300

Query: 317  RMLSSRFDPTCTGFSSNVMGSLPPA-NGF---------VSHGLKPLADLESASVDSAGRV 376
            RMLSSRFDP+CTGFSSN   S+ P+ NGF          S G K  +  ESASVD++GRV
Sbjct: 301  RMLSSRFDPSCTGFSSNSKVSVSPSENGFSFLLSSGQNASSGSKTFSGSESASVDASGRV 360

Query: 377  LRPRKQRKEKKSSRKRRHFYEILLADLDAVWILNRRIKVFWPLDQIWYYGLVNDYDKERK 436
            LRPRK  KEK +SRKRRHFYEI   DLDA W+LNRRIKVFWPLD+ WYYGLVN+YDKERK
Sbjct: 361  LRPRKSHKEKSNSRKRRHFYEIYSGDLDASWVLNRRIKVFWPLDKSWYYGLVNEYDKERK 420

Query: 437  LHHVKYDDRDEEWIDLQNERFKLLLLPSEVPGREDRRKSVMGN-NPANVKRESRSRKGKE 496
            LHHVKYDDRDEEWI+LQNERFKLLL PSEVP + +R++S     +   ++    +R+ K 
Sbjct: 421  LHHVKYDDRDEEWINLQNERFKLLLFPSEVPSKSERKRSRRKRCSDDRIRNLKPNREEKR 480

Query: 497  TNAPKDECNTGSFMDSEPIISWLARSTQRNKSCPSHSSKRQKNSS--------------- 556
                +D+   GS+MDSEPIISWLARS+ R KSCP  + KRQK S+               
Sbjct: 481  NVVTEDDSGNGSYMDSEPIISWLARSSHRVKSCPLRAVKRQKTSASSHSSPGQPLLCDEA 540

Query: 557  ----LFLKSGSQAIERPH----SGLPERLGDMDGLEKSTSEITTCSKTCKLPIVYFRKRF 616
                  L   S  +++      S L +R  D   +E S+   T+C K  K PIVYFR+RF
Sbjct: 541  VDENSCLYRVSLRVDKIELSGASALSDRPVDGIRVEDSSLGSTSCLKDSKHPIVYFRRRF 600

Query: 617  RNIGTEMSHKHETSYASRRTHSLASFFSNVGEIDDVEESDISPRRSEALRLLWCVDDDGL 676
            R     +    E +  +       +  ++V E  D+ E D+   R +    L   D+ G 
Sbjct: 601  RRTEKALCQASEGNCVASSVSESITSLASVDEFQDLGELDVCLGRLDPEGDLLFSDNAGQ 660

Query: 677  LQLDIPAMEVGQLRFELTIPEYSFLNMTSSAETFWLFHLSMLIQHGALTLTWPKVQLEMF 736
            L+L+I  +   Q RF L+ P +S  N     ++F L H  +L+Q G +   WP V LE+ 
Sbjct: 661  LRLNISLLRTKQFRFGLSFPVFSVSNNLFGTKSFSLVHTLLLLQCGTVMTIWPMVHLEIL 720

Query: 737  CVDNVVGLRFLLFEGCLMQAVAFIFLVMKMFRQPSKQVRYADFQVPMTSIRFKFSCPPDI 796
             VDN VGLRFLLFEG L QAVAF+F V+ +F  P++Q ++AD Q+P+TSIRFKFSC  D 
Sbjct: 721  FVDNEVGLRFLLFEGSLKQAVAFVFRVLTVFYLPTEQGKFADLQLPVTSIRFKFSCSQDF 780

Query: 797  GKQLVFAFYNFSETKNSKWLHLDCRLKKYCLLTKQLPLTECTYDNIKRFQNCTSQFHTSP 856
             KQ+VFAFYNF E K+SKW+ LD +LK+ CL+T+QLPL+ECTYDNIK  QN T+Q  +SP
Sbjct: 781  RKQIVFAFYNFHEVKHSKWVFLDSKLKRQCLITRQLPLSECTYDNIKALQNGTNQLLSSP 840

Query: 857  FCGECSSIKGTQKIG-SLGINHKGDAGENN----GHSNLCSNETNKKFPAFALSFTAAPS 916
               + SS++G ++     GI+  G + E++    G     S + ++  P FALSF AAP+
Sbjct: 841  AYKDSSSLEGLRRRRYRQGISLMGVSRESSFLKVGQFTSSSEKKHRNLPLFALSFGAAPT 900

Query: 917  FMLSLHLKLLMEQCVAHLSSLHQDSGKRAENFGRLTVDNVC-MNDCANNLSTSS---KAL 976
            F LSLHLKLLME  VA +S    DS ++  + G L VD+     DC +    SS   K L
Sbjct: 901  FFLSLHLKLLMEHSVARISFQDHDSNEQLGSSGDLMVDDSSNREDCVDKRFDSSSVEKNL 960

Query: 977  GRWNLCARSDLGTGLSDCEEGGSSRYKRS--------RLVAETCAGSHDSDK--ARNDVK 1036
               +  A SD      D    G   +K+S        + +  T A SH+ ++  A   V 
Sbjct: 961  KASSKDAASDTELTTLDLSVCGDEHWKKSSQKYENGDQTIYGTFASSHEPEEVGATAIVP 1020

Query: 1037 KRMRSSGNDKSEKAMA----LPNVARSDNGSDSFLNDLSVEIPSFQP----VDGELHNAQ 1096
             + +   + +SE+ ++    L +  R++ GS+S LND+ VEIPSF      +DGEL   Q
Sbjct: 1021 LQKQQCAHSESEQLVSSSKSLVDGDRNNAGSNSVLNDIRVEIPSFDQYENHIDGELPGTQ 1080

Query: 1097 LSMDVAWNVNSGIIRSPNPTAPRSTWHRNKNNSSSFGLASHGWSDGKD--FLNGLGNRTK 1156
             S D+ WN+N GII SPNPTAPRSTWHRN+++SSS G  +HGWS+GK   F N  GN  K
Sbjct: 1081 QSSDLTWNMNGGIIPSPNPTAPRSTWHRNRSSSSSIGYNAHGWSEGKADFFHNNFGNGPK 1140

Query: 1157 KPRTQVSYMLPFGGLDYGSKNRNSHPKATPYKRIRRASEKRS-DAARGSQRNIELLSCDA 1216
            KPRTQVSY +PFGGLDY SKN+  H +  P+KRIRRA+EKRS D +RGSQ+N+ELLSCDA
Sbjct: 1141 KPRTQVSYSMPFGGLDYSSKNKGHHQRGPPHKRIRRANEKRSSDVSRGSQKNLELLSCDA 1200

Query: 1217 NVLITTGDRGWRECGARVVLEVFDHNEWKLAVKLSGITKYSYKAHQFLQPGSTNRYTHAM 1276
            N+LIT GDRGWRECGA+V LE+FDHNEWKLAVK+SG T+YS+KAHQFLQPGSTNRYTHAM
Sbjct: 1201 NLLITLGDRGWRECGAQVALELFDHNEWKLAVKVSGSTRYSHKAHQFLQPGSTNRYTHAM 1260

Query: 1277 MWKGGKDWILEFPDRSQWTIFKELHEECYNRNIRSASVKNIPIPGVRLIEENDEHVAETA 1336
            MWKGGKDWILEF DRSQW +FKE+HEECYNRNIR+ASVKNIPIPGVRLIEE DE+ AE  
Sbjct: 1261 MWKGGKDWILEFTDRSQWALFKEMHEECYNRNIRAASVKNIPIPGVRLIEEYDEN-AEVT 1320

Query: 1337 FMRNPSKYFRQVETDVEMALNPNRVLYDMDSDDEQWIKDVQTSSEVGSSSSLGEASSEVF 1396
            F R+ SKY RQVETDVEMAL+P+ VLYDMDSDDEQWI  ++ SSE   SS   E S E+F
Sbjct: 1321 FFRSSSKYLRQVETDVEMALDPSHVLYDMDSDDEQWISRIRRSSESDVSSCSLEFSDELF 1380

Query: 1397 EKTMDAFEKAAYSQQRDEFTDDEIAEAVNETLVSGLTKGIFEYWQLKRRQKGMPLLRHLQ 1456
            EKTMD FEKAAY+QQ D+F  DEI E +       + + I+E+W+ KR++ G+PL+RHLQ
Sbjct: 1381 EKTMDIFEKAAYTQQCDQFNSDEIQELMAGVGSMKVIRPIYEHWRQKRQRVGLPLIRHLQ 1440

Query: 1457 PPLWETYRQQLKDWESTVNKNNTNSCNGYHDSA-SIEKPPMFAFCLKPRGLEVSNKGSKQ 1516
            PPLWE Y++Q+++WE +++K N    NG  D   SIEKPPMFAFCLKPRGLEV NKGSK 
Sbjct: 1441 PPLWEMYQRQVREWELSMSKVNPILPNGCSDKVPSIEKPPMFAFCLKPRGLEVPNKGSKP 1500

Query: 1517 RSHRKFSMAGHSNSITYDQDGLHGFV----------------------------RRLNGS 1576
            RS RK S++G SN    D +G H F                             RR NG 
Sbjct: 1501 RSQRKISVSGQSNHALGDHEGCHSFGNVLCNFTFIWLFVMFSFASLTLYVVISGRRSNGF 1560

Query: 1577 ALGDDRMVYIGHNYEFLEDSPLIHTSSSLFSPRLEGGI----LSNDGFERNVLPKLHKTK 1636
              GD++++Y  HNYE LEDSPL   S  +FSPR  G +    + +DGF +    KL ++K
Sbjct: 1561 LFGDEKVLYPVHNYESLEDSPLSQASPRVFSPRDVGSMGYFSMGSDGFNKKYHQKLQRSK 1620

Query: 1637 SRKYG--ASPYEPMMASSFNQRMVGKRDGLNRWNNGYSEWSSPLRYRFDGSQRQILEQLE 1656
            S+K+G   S  +  M +S++QR++GKR+G+ +WN G+SEW S      DG QR   EQL+
Sbjct: 1621 SKKFGNFLSSNDAQMMASYSQRLMGKRNGIRQWNMGFSEWQSQRHSFSDGFQRHGPEQLD 1680

BLAST of Cp4.1LG02g04400 vs. TAIR10
Match: AT4G32620.2 (AT4G32620.2 Enhancer of polycomb-like transcription factor protein)

HSP 1 Score: 948.0 bits (2449), Expect = 7.9e-276
Identity = 647/1669 (38.77%), Postives = 938/1669 (56.20%), Query Frame = 1

Query: 17   MENSLEYSHGTDTPKKSRSLDLKSLYESKVSKEVQNESLKRKVRAE-NGDEQRNERRNRK 76
            MEN L  S+G    KKSRSLDLK+LY+S +SK+  N+S KRK R+  +GD+ + ++++RK
Sbjct: 1    MENRLGNSNGVGISKKSRSLDLKTLYKSSISKDSVNKSFKRKHRSGIDGDQLKQDKKSRK 60

Query: 77   KVSLSNFSSIYSRSRKSLHEVYDDELGSSGHDSKKALKSESKEKLNSSSECNKVSLILND 136
             VSLS+F  + S++   L +  +    +  H+ + + +    EKL  S+    +S+ L  
Sbjct: 61   VVSLSSFKKVGSQNESILDKACNGT--TILHNLEDSKEVGLDEKLCDSNGLQVISVGLAS 120

Query: 137  DVMQIPKRKRGGFVRRKKILGGQILKPSGQLDGKAAIVDQIAKSSAKDPSDLVECCKTNR 196
              + +P+R+R  FV R +   G   K +G+ D +  +V  I K +A++ S   +  K   
Sbjct: 121  STIYVPRRRRD-FVGRSRFENGLAQKSAGESDSQEELVVNIPKVTAEESSVQDQPSKVEE 180

Query: 197  KPGFKSLKEKEQSELSSTQHPKRGDGHADPLVRENQSSSTLHLKEEGEHIDHSVVKPVSL 256
            K   K +KE                           S+S   L+ E  H + S VK   L
Sbjct: 181  KDSDKDIKE---------------------------SNSAAPLQLENGHSNQSPVKDDQL 240

Query: 257  SFKKSQKNFGRRKISASGRKRNSKEGEASISHSTKRRDGCLEDDEENLEENAARMLSSRF 316
               K + +  R++ S++  +R  KE ++S   S +      EDDEENLE NAA MLSSRF
Sbjct: 241  VVVKQRNSNSRKRKSSASNRRVGKEAKSSGDASGRISKVSREDDEENLEANAAIMLSSRF 300

Query: 317  DPTCTGFSSNVMGSLPPANGFV------SHGLKPLADLESA---SVDSAGRVLRPRKQRK 376
            DP CT F SN +    P+   +       + + P ++L S+   S D+  R+LRPR+   
Sbjct: 301  DPNCTQFPSNSVTPGSPSASRLHPLPSGKNSVDPRSELLSSKCVSDDTDDRMLRPRRHND 360

Query: 377  EKKSS-RKRRHFYEILLADLDAVWILNRRIKVFWPLDQIWYYGLVNDYDKERKLHHVKYD 436
            + K   RKRRHFYEIL +D+D+ W+LN++IKVFWPLD+ WY+G V+ +D ++ LHHVKYD
Sbjct: 361  DGKGKVRKRRHFYEILFSDVDSHWLLNKKIKVFWPLDERWYHGFVDGFDGDKNLHHVKYD 420

Query: 437  DRDEEWIDLQNERFKLLLLPSEVPGREDRRKSVMGNNPANVKRESRSRKGKETNAPKDEC 496
            DRDEEWI+LQ ERFK+LL PSEVPG+  R++     + +  K +      K+    K++ 
Sbjct: 421  DRDEEWINLQGERFKILLFPSEVPGKNQRKRRC-SESKSTQKVKGNDTSSKDEEKQKEKL 480

Query: 497  NTGSFMDSEPIISWLARSTQRNKSCPSHSSKRQKNSSLF-----LKSGSQAIERPHSGLP 556
               S M+SEPII+WLARS  R+KS    + +++K + +      +K      +R  S L 
Sbjct: 481  EDDSCMESEPIITWLARSRHRDKSSTLKAVQKRKKTDVMTSNESVKMNGDVTDRSASSLA 540

Query: 557  ERLGDMDGLEKSTSEITTCSKTCKLPIVYFRKRFRNIGTEMSHKHETSYASRRTHSLASF 616
                 + G  K+  E +        PIVY R+R      ++  +                
Sbjct: 541  SC--GLPGPSKNELESSGFRNGSIFPIVYCRRRLHTAKKDIYKE---------------- 600

Query: 617  FSNVGEIDDVEESDISPRRSEALRLLWCVDDDGLLQLDIPAMEVGQLRFELTIPEYSFLN 676
             S    ++ +++  +S      +  L  ++D G L+L  P  E  Q    L++   S ++
Sbjct: 601  -SGYNSVEFLKQFLVSKSPDPGVEFL-PIEDSGDLELCCPWNESEQFELSLSLQGVSLMS 660

Query: 677  MTSSAETFWLFHLSMLIQHGALTLTWPKVQLEMFCVDNVVGLRFLLFEGCLMQAVAFIFL 736
                A+  WL   ++L++HG L   WP+V+LEM  ++N  GLR+L+FEGCLM+ V  IF 
Sbjct: 661  YFLMADVDWLSRAALLLRHGTLVTLWPRVRLEMIFLNNQDGLRYLIFEGCLMEVVQLIFR 720

Query: 737  VMKMFRQPSKQVRY---ADFQVPMTSIRFKFSCPPDIGKQLVFAFYNFSETKNSKWLHLD 796
            ++ +    +KQ      AD Q+P+ SI  + SC P   +QL F  Y+F E K+SKW +L+
Sbjct: 721  ILMVVDHSNKQGAQGADADLQLPVFSIGLQVSCIPGFQRQLGFQIYSFHEVKHSKWSYLE 780

Query: 797  CRLKKYCLLTKQLPLTECTYDNIKRFQNCTSQFHTSPFCGECSSIKGTQKIGSLGINHKG 856
              ++++ LL KQ+ + ECT++N+K  Q                  K ++   S G+  +G
Sbjct: 781  QNVRRHSLLVKQVSIAECTHNNMKVLQKVMQ--------------KRSRHGISSGLVSRG 840

Query: 857  DAGENNGHSNLCSNETNKKFPAFALSFTAAP-SFMLSLHLKLLMEQCVAHLSSLHQDSGK 916
             +      +++C  + N     FAL FTA P + +LSL           HL+ + ++ G 
Sbjct: 841  SSSAEAWPTSVCYKKQNTS--PFALLFTARPPTLLLSL-----------HLNMI-RELGH 900

Query: 917  RAENFGRLTVDNVCMNDCANNLSTSSKALGRWNLCARSDLGTGLSDCEEGGSSRYKRSRL 976
             + +F  +  D V    C  +++  +      +L ++S     +       SSR + S+ 
Sbjct: 901  DSADFLGIERDLVTHRGC--DMADFTNEHSELSLKSKSQTDEPIIT-----SSRAQESKD 960

Query: 977  VAETCAGSHDSDKARNDVKKRMRSSGNDKSEKAMALPNVARSDNGSDSFLNDLSVEIP-S 1036
            +         S +  +D +  M  S +    K     NV+         +N +S+++P S
Sbjct: 961  LHTPS----QSQQLGSDSENWMSYSSSVVRHKHETRSNVS---------VNGISIQVPIS 1020

Query: 1037 FQPVDGELHNAQLSMDVAWNVNSGIIRSPNPTAPRSTWHRNKNNSSSFGLASHGWSDGK- 1096
                DG   ++ L++++  + NS    SP  TAPRS W+R+K  SS  G  SHGWSD K 
Sbjct: 1021 DDCEDGTPQSSNLALNIQGSSNS----SPKATAPRSMWNRSK--SSLNGHLSHGWSDSKG 1080

Query: 1097 DFLN-GLGNRTKKPRTQVSYMLPFGGLDYGSKNRNSHPKATPYKRIRRASEKRSDAARGS 1156
            DFLN  L N  KK RTQVSY LP GG D  S+N+ S  K  P KRIRR++   +D  +G 
Sbjct: 1081 DFLNTNLANGPKKRRTQVSYSLPSGGSD--SRNKGSLLKGMPNKRIRRST---ADVTKGI 1140

Query: 1157 QRNIELLSCDANVLITTGDRGWRECGARVVLEVFDHNEWKLAVKLSGITKYSYKAHQFLQ 1216
            Q+++E   CDANVL+T GDRGWRE GA++ LE FD+NEW+LAVK+SG TKYS++AHQFLQ
Sbjct: 1141 QKDLESSLCDANVLVTLGDRGWREYGAQIFLEPFDNNEWRLAVKISGTTKYSHRAHQFLQ 1200

Query: 1217 PGSTNRYTHAMMWKGGKDWILEFPDRSQWTIFKELHEECYNRNIRSASVKNIPIPGVRLI 1276
            PGS NR+THAMMWKGGKDW LEFPDR QW +FKE+HEECYNRN R+A V+NIPIPG+R+I
Sbjct: 1201 PGSVNRFTHAMMWKGGKDWTLEFPDRGQWFLFKEMHEECYNRNTRAALVRNIPIPGIRMI 1260

Query: 1277 EENDEHVAETAFMRNPSKYFRQVETDVEMALNPNRVLYDMDSDDEQWIKDVQTSSEVGSS 1336
            E ++    ET F+R+ SKYFRQ ETDVEMAL+P+RV+YDMDSDDEQ +  ++  S   +S
Sbjct: 1261 ERDNFDGTETEFIRSSSKYFRQTETDVEMALDPSRVMYDMDSDDEQCLLRIRECSSAENS 1320

Query: 1337 SSLGEASSEVFEKTMDAFEKAAYSQQRDEFTDDEIAEAVNETLVSGLTKGIFEYWQLKRR 1396
             S  E + ++FEK MD FEKA++ +QRD FT  EI E           + I+E W+ KR+
Sbjct: 1321 GSC-EITEDMFEKAMDMFEKASFVKQRDNFTLIEIQELTAGVGSLEAMETIYELWRTKRQ 1380

Query: 1397 QKGMPLLRHLQPPLWETYRQQLKDWESTVNKNNT-NSCNGYHDSASIEKPPMFAFCLKPR 1456
            +KGMPL+RHLQPPLWE Y+++LKDWE  ++K NT NSC      +  EKP MFAFC KPR
Sbjct: 1381 RKGMPLIRHLQPPLWEKYQRELKDWELVMSKANTPNSCGSQKKQSPTEKPAMFAFCFKPR 1440

Query: 1457 GLEVSNKGSKQRSHRKFSMAGHSNSITYDQDGLHGFV-RRLNGSALGDDRMVYIGHNYEF 1516
            GLEV ++G+K RS +K S+    +S   D DG +    RR  G   GD+R +Y   +YE 
Sbjct: 1441 GLEVKHRGTKHRSQKKLSVYAQHSSALGDYDGCNSSAGRRPVGFVSGDERFLYSNQSYEH 1500

Query: 1517 LEDSPLIHTSSSLFSPR-LEGGILSN--DGFERNVLPKLHKTKSRKYGASPYEPMMASSF 1576
              +   +H  +  +SPR L  G  S+  +G+ RN     H+ KS                
Sbjct: 1501 SNEFS-VHPGT--YSPRDLGMGYFSSGGNGYHRN-----HQNKS---------------- 1533

Query: 1577 NQRMVGKRDGLNRWNNGYSEW-SSPLRYRFDGSQRQILEQLEGS-DLDEYRFRDVSGAAQ 1636
             QR+ GKR+   RW+ GYSE  SS L    +GSQR  +E +  S D+DEY+ RD +GAA+
Sbjct: 1561 -QRINGKRNTSERWDAGYSECPSSNLVCYSNGSQRPDVEGIRNSTDIDEYKLRDAAGAAR 1533

Query: 1637 EARNVAKFKREKARRLLIRADLAIHKAVVAIMTAEAMKAASEDDPNGDG 1656
             A  +AK KRE+A  L  +ADLAI KA  A+M AEA+KA+SED  N +G
Sbjct: 1621 RACALAKLKRERAESLRYKADLAIQKAAAALMCAEAVKASSEDLGNNNG 1533

BLAST of Cp4.1LG02g04400 vs. TAIR10
Match: AT5G04670.1 (AT5G04670.1 Enhancer of polycomb-like transcription factor protein)

HSP 1 Score: 169.1 bits (427), Expect = 2.3e-41
Identity = 91/255 (35.69%), Postives = 139/255 (54.51%), Query Frame = 1

Query: 1133 SQRNIELLSCDANVLITTGDRGWRECGARVVLEVFDHNEWKLAVKLSGITKYSYKAHQFL 1192
            ++  ++ + C AN+L+   DR  RE G  V+LE     EW L +K  G  +YS+ A + +
Sbjct: 398  TKEELDSICCSANILMIHSDRCTREEGFSVMLEASSSKEWFLVIKKDGAIRYSHMAQRTM 457

Query: 1193 QPGSTNRYTHAMMWKGGKDWILEFPDRSQWTIFKELHEECYNRNIRSASVKNIPIPGVRL 1252
            +P S+NR THA +W GG +W LEF DR  W  FK++++ECY RN+   SVK IPIPGVR 
Sbjct: 458  RPFSSNRITHATVWMGGDNWKLEFCDRQDWLGFKDIYKECYERNLLEQSVKVIPIPGVRE 517

Query: 1253 IEENDEHVAE-TAFMRNPSKYFRQVETDVEMALNPNRVLYDMDSDDEQWIKDVQTSSEVG 1312
            +    E++    +F R P  Y    E +V  A+  +  LYDMDS+DE+W++         
Sbjct: 518  VCGYAEYIDNFPSFSRPPVSYISVNEDEVSRAMARSIALYDMDSEDEEWLERQNQKMLNE 577

Query: 1313 SSSSLGEASSEVFEKTMDAFEKAAYSQQRDEFTDDEIAE--AVNETLVSGLTKGIFEYWQ 1372
                  +   E FE  +D FEK  +    D+  D++ A   +++      + + + +YW 
Sbjct: 578  EDDQYLQLQREAFELMIDGFEKYHFHSPADDLLDEKAATIGSISYLGRQEVVEAVHDYWL 637

Query: 1373 LKRRQKGMPLLRHLQ 1385
             KR+Q+  PLLR  Q
Sbjct: 638  KKRKQRKAPLLRIFQ 652

BLAST of Cp4.1LG02g04400 vs. TAIR10
Match: AT4G31880.1 (AT4G31880.1 LOCATED IN: cytosol, chloroplast)

HSP 1 Score: 57.8 bits (138), Expect = 7.5e-08
Identity = 64/292 (21.92%), Postives = 120/292 (41.10%), Query Frame = 1

Query: 346 DLESASVDSAGRVLRPRKQRKEK-----KSSRKRRHFYEILLADLDAVWILNRRIKVFWP 405
           D E  +V S     + +K+ K+       S+ KR+       A  ++  ++  RIKV+WP
Sbjct: 561 DNEKPAVSSGKLASKSKKEAKQTVEESPNSNTKRKRSLGQGKASGES--LVGSRIKVWWP 620

Query: 406 LDQIWYYGLVNDYDKERKLHHVKYDDRDEEWIDLQNERFKLLLLPSEVPGREDRRKSVMG 465
           +DQ +Y G+V  YD  +K H V YDD D+E + L+N+++  L         E   ++   
Sbjct: 621 MDQAYYKGVVESYDAAKKKHLVIYDDGDQEILYLKNQKWSPLDESELSQDEEAADQTGQE 680

Query: 466 NNPANVKRESRSRKGKETNAPKDECNTGSFMDSEPIISWLARSTQRNKSCPSHSSKRQKN 525
            + + V    +++ GK++         GS   S    +  A  + +       +SK + +
Sbjct: 681 EDASTVPLTKKAKTGKQSKMDNSSAKKGSGAGSSKAKATPASKSSKTSQDDKTASKSKDS 740

Query: 526 SSLFLKSGSQAIERPHSGLPERLGDMDGLEKSTSEITTCSKTCKLPIVYFRKRFRNIGTE 585
                +  + + E      P +     G  +S  +I++ SK+ K      +K   +  T 
Sbjct: 741 KEASREEEASSEEESEEEEPPKTVGKSGSSRSKKDISSVSKSGKSKASSKKKEEPSKATT 800

Query: 586 MSH-----------KHETSYASRRTHSLASFFSNVGEIDDVEESDISPRRSE 622
            S            K +T     ++ S ++  S   E     ES+ +P+  E
Sbjct: 801 SSKSKSGPVKSVPAKSKTGKGKAKSGSASTPASKAKESASESESEETPKEPE 850

BLAST of Cp4.1LG02g04400 vs. TAIR10
Match: AT2G31650.1 (AT2G31650.1 homologue of trithorax)

HSP 1 Score: 52.4 bits (124), Expect = 3.2e-06
Identity = 29/87 (33.33%), Postives = 43/87 (49.43%), Query Frame = 1

Query: 362 RKQRKEKKSSRKRRHFYEILLADLDAVWILNRRIKVFWPLDQIWYYGLVNDYDKERKLHH 421
           + Q K   +SR  + +  +    +D    +  + KVFWPLD +WY G +  Y  ERK + 
Sbjct: 178 KNQDKATLASRSAKKWVRLSYDGVDPTSFIGLQCKVFWPLDALWYEGSIVGYSAERKRYT 237

Query: 422 VKYDDRDEEWIDLQNERFKLLLLPSEV 449
           VKY D  +E I    E  K L+   E+
Sbjct: 238 VKYRDGCDEDIVFDREMIKFLVSREEM 264

BLAST of Cp4.1LG02g04400 vs. TAIR10
Match: AT1G15940.1 (AT1G15940.1 Tudor/PWWP/MBT superfamily protein)

HSP 1 Score: 52.0 bits (123), Expect = 4.1e-06
Identity = 40/146 (27.40%), Postives = 75/146 (51.37%), Query Frame = 1

Query: 390 ILNRRIKVFWPLDQIWYYGLVNDYDKERKLHHVKYDDRDEEWIDLQNERFKLLLLPSEVP 449
           ++ +R+ V+WPLD+ +Y G++  Y + +K+H V Y D D E ++L+ ERFK  ++  +  
Sbjct: 571 LVGKRVNVWWPLDKKFYEGVIKSYCRVKKMHQVTYSDGDVEELNLKKERFK--IIEDKSS 630

Query: 450 GREDRRKSVMGNNP--ANVKRESRSRKGKETNAPKDECNTGSFMDSEPIISWLARSTQRN 509
             ED+   ++ + P  A ++RE +S+K K  +   +  ++     S          T + 
Sbjct: 631 ASEDKEDDLLESTPLSAFIQRE-KSKKRKIVSKNVEPSSSPEVRSS--------MQTMKK 690

Query: 510 KSCPSHSSKRQKNSSLFLKSGSQAIE 534
           K   + S K+ K +   LK+ S   E
Sbjct: 691 KDSVTDSIKQTKRTKGALKAVSNEPE 705

BLAST of Cp4.1LG02g04400 vs. NCBI nr
Match: gi|659132747|ref|XP_008466363.1| (PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC103503793 [Cucumis melo])

HSP 1 Score: 2609.7 bits (6763), Expect = 0.0e+00
Identity = 1363/1688 (80.75%), Postives = 1460/1688 (86.49%), Query Frame = 1

Query: 1    MKIGGFWIGSFRLGKSMENSLEYSHGTDTPKKSRSLDLKSLYESKVSKEVQNESLKRKVR 60
            MKIGGFWIGSFRLGKSMENSLE SHGTD PKKSRSLDLKSLYESKVSKEVQN+ LKRK R
Sbjct: 1    MKIGGFWIGSFRLGKSMENSLENSHGTDIPKKSRSLDLKSLYESKVSKEVQNKRLKRKAR 60

Query: 61   AENGDEQRNERRNRKKVSLSNFSSIYSRSRKSLHEVYDDELGSSGHDSKKALKSESKEKL 120
            AE+GD Q+NERRNRKKVSLSNFSSIYSRSRKSL EVYD  LGSSGHDSKKALKSES++KL
Sbjct: 61   AEDGDGQKNERRNRKKVSLSNFSSIYSRSRKSLDEVYDAGLGSSGHDSKKALKSESRDKL 120

Query: 121  NSSSECNKVSLILNDDVMQIPKRKRGGFVRRKKILGGQILKPSGQLDGKAA-------IV 180
            NSSSE N+V LIL+++VM IPKRKRGGFVRRKK L GQILKPSGQLD KA        IV
Sbjct: 121  NSSSEFNEVPLILDENVMHIPKRKRGGFVRRKKSLDGQILKPSGQLDAKAGSLDDKAGIV 180

Query: 181  DQIAKSSAKDPSDLVECCKTNRKPGFKSLKEKEQSELSSTQHPKRGDGHADPLVRENQSS 240
            DQIAKSS KD SD VECCKTNRK  FK LKEKEQ ELSS QH K+ DG AD L REN+ +
Sbjct: 181  DQIAKSSVKDSSDQVECCKTNRKLAFKDLKEKEQKELSSAQHLKKEDGQADQLTRENELN 240

Query: 241  STLHLKEEGEHIDHSVVKPVSLSFKKSQKNFGRRKISASGRKRNSKEGEASISHSTKRRD 300
                LKEEGEHIDHSVVKPVS S KKSQKN  +RKIS S  K NSKEGEASIS STKRRD
Sbjct: 241  PASCLKEEGEHIDHSVVKPVSPSSKKSQKNVRKRKISGSRSKSNSKEGEASISPSTKRRD 300

Query: 301  GCLEDDEENLEENAARMLSSRFDPTCTGFSSNVMGSLPPANGF----------VSHGLKP 360
            G  EDDEENLEENAARMLSSRFDP CTGFSSN  GSLPP NG           VS   KP
Sbjct: 301  GFPEDDEENLEENAARMLSSRFDPNCTGFSSNTKGSLPPTNGLSFLLSSGHDNVSRIFKP 360

Query: 361  LADLESASVDSAGRVLRPRKQRKEKKSSRKRRHFYEILLADLDAVWILNRRIKVFWPLDQ 420
               LESASVD+AGRVLRPRKQRKEK SSRKRRHFYEIL  DLDA W+LNRRIKVFWPLDQ
Sbjct: 361  --GLESASVDAAGRVLRPRKQRKEKXSSRKRRHFYEILFGDLDAAWVLNRRIKVFWPLDQ 420

Query: 421  IWYYGLVNDYDKERKLHHVKYDDRDEEWIDLQNERFKLLLLPSEVPGREDRRKSVMGNNP 480
            IWYYGLVNDYDKERKLHHVKYDDRDEEWIDLQNERFKLLLLPSEVPGRE+RRKS +GN+ 
Sbjct: 421  IWYYGLVNDYDKERKLHHVKYDDRDEEWIDLQNERFKLLLLPSEVPGREERRKSAVGNDL 480

Query: 481  ANVKRESRSRKGKETNAP--KDECNTGSFMDSEPIISWLARSTQRNKSCPSHSSKRQKNS 540
            AN K  SRSRKGKET+A   +D+CNT S+MDSEPIISWLARST RNKS PSH+SKRQK S
Sbjct: 481  ANEKGRSRSRKGKETDAVILEDDCNTSSYMDSEPIISWLARSTNRNKSSPSHNSKRQKTS 540

Query: 541  SLFLKSGSQAIERP------HSGLPERLGDMDGLEKSTSEITTCSKTCKLPIVYFRKRFR 600
            SL  KSGSQA E P       SGL ERL D+DG EKS SE TTCS T KLPIVYFRKRFR
Sbjct: 541  SLSSKSGSQANENPANLLVKSSGLAERLADVDGQEKSASETTTCSTTRKLPIVYFRKRFR 600

Query: 601  NIGTEMSHKHETSYASRRTH-SLASFFSNVGEIDDVEESDISPRRSEALRLLWCVDDDGL 660
            NIGTE+ HK ET +ASRRTH SLA  FSNV EIDDVEE DISPRRSEA RLLWCVDD GL
Sbjct: 601  NIGTEIPHKRETDFASRRTHASLAFSFSNV-EIDDVEEPDISPRRSEAHRLLWCVDDAGL 660

Query: 661  LQLDIPAMEVGQLRFELTIPEYSFLNMTSSAETFWLFHLSMLIQHGALTLTWPKVQLEMF 720
            LQL IP MEVGQLRFEL+IPEYSF N+TSSAETFWLFHL+MLIQHG LTL WPKVQLEM 
Sbjct: 661  LQLAIPLMEVGQLRFELSIPEYSFWNVTSSAETFWLFHLAMLIQHGTLTLLWPKVQLEML 720

Query: 721  CVDNVVGLRFLLFEGCLMQAVAFIFLVMKMFRQPSKQVRYADFQVPMTSIRFKFSCPPDI 780
             VDNVVGLRFLLFEGCLMQAVAFIFLV+K+F+ P KQ RYADFQ P+TSIRFKFSC  DI
Sbjct: 721  FVDNVVGLRFLLFEGCLMQAVAFIFLVLKLFQSPGKQGRYADFQFPITSIRFKFSCLQDI 780

Query: 781  GKQLVFAFYNFSETKNSKWLHLDCRLKKYCLLTKQLPLTECTYDNIKRFQNCTSQFHTSP 840
            GKQLVFAFYNFSE KNSKW+HLD RLKKYCL++KQLPLTECTYDNIK+ QN  +QF  SP
Sbjct: 781  GKQLVFAFYNFSELKNSKWVHLD-RLKKYCLISKQLPLTECTYDNIKKLQNSKTQFRASP 840

Query: 841  FCGECSSIKGTQKIGSLGINHKGDAGENNGHSNLCSNETNKKFPAFALSFTAAPSFMLSL 900
            FCG  SS+KGTQKI SLGIN KG A  N+GHSNLCSNE  + FPAFA+SFTAAP+F LSL
Sbjct: 841  FCGRSSSVKGTQKISSLGINLKGAACVNSGHSNLCSNEXKRNFPAFAISFTAAPTFFLSL 900

Query: 901  HLKLLMEQCVAHLSSLHQDSGKRAENFGRLTVDNVCMNDCANNLSTSSKALGRWNLCARS 960
            HLKLLME+CVAHLS  H DS +  EN+GRLTVD++  +DCAN+LSTSSKA  RWN C +S
Sbjct: 901  HLKLLMERCVAHLSLQHHDSIEHQENYGRLTVDDMLTDDCANSLSTSSKASDRWNSCPQS 960

Query: 961  DLGTGLSDCEEGG---SSRYKRSRLVAETCAGSHDSDKARNDVKKRMRSSGNDKSEKAMA 1020
            DLGTG+SDCE+G    SS+YKRS  VA TCAGS D+DKA NDVK+R+R +G + S K M 
Sbjct: 961  DLGTGISDCEDGDGVQSSQYKRSTPVAPTCAGSQDTDKASNDVKRRIRPAGKNISGKTMP 1020

Query: 1021 LPNVARSDNGSDSFLNDLSVEIPSFQPVDGELHNAQLSMDVAWNVNSGIIRSPNPTAPRS 1080
            LP VARSD   DSFLNDLSVEIPSFQP+DGELH  Q SMDV WN N+G+I SPNPTAPRS
Sbjct: 1021 LPKVARSDK--DSFLNDLSVEIPSFQPLDGELHGPQQSMDVGWNGNAGVIPSPNPTAPRS 1080

Query: 1081 TWHRNKNNSSSFGLASHGWSDGKD-FLNGLGNRTKKPRTQVSYMLPFGGLDYGSKNRNSH 1140
            TWHRNKNNS+S GLASHGWSDGK  F+NGLGNRTKKPRTQVSY LPFGG DY SK+RNSH
Sbjct: 1081 TWHRNKNNSTSLGLASHGWSDGKSSFINGLGNRTKKPRTQVSYSLPFGGFDYSSKSRNSH 1140

Query: 1141 PKATPYKRIRRASEKRSDAARGSQRNIELLSCDANVLITTGDRGWRECGARVVLEVFDHN 1200
            PKA P KRIRRASEKRSD ARGS+RN+ELLSCDANVLIT GDRGWRECGARVVLEVFDHN
Sbjct: 1141 PKAIPSKRIRRASEKRSDVARGSKRNLELLSCDANVLITLGDRGWRECGARVVLEVFDHN 1200

Query: 1201 EWKLAVKLSGITKYSYKAHQFLQPGSTNRYTHAMMWKGGKDWILEFPDRSQWTIFKELHE 1260
            EWKLAVKLSGITKYSYKAHQFLQPGSTNRYTHAMMWKGGKDWILEFPDRSQW IFKELHE
Sbjct: 1201 EWKLAVKLSGITKYSYKAHQFLQPGSTNRYTHAMMWKGGKDWILEFPDRSQWAIFKELHE 1260

Query: 1261 ECYNRNIRSASVKNIPIPGVRLIEENDEHVAETAFMRNPSKYFRQVETDVEMALNPNRVL 1320
            ECYNRNIR+ASVKNIPIPGV L+EENDE+VAE A+MRNPSKYFRQVETDVEMALNP RVL
Sbjct: 1261 ECYNRNIRAASVKNIPIPGVCLLEENDEYVAEIAYMRNPSKYFRQVETDVEMALNPARVL 1320

Query: 1321 YDMDSDDEQWIKDVQTSSEVGSSSSLGEASSEVFEKTMDAFEKAAYSQQRDEFTDDEIAE 1380
            YDMDSDDEQWIKD++TSSEVGS+S LGE SSEVFEKT+DAFEKAAYSQQR EFTDDEIAE
Sbjct: 1321 YDMDSDDEQWIKDIRTSSEVGSNSGLGEVSSEVFEKTVDAFEKAAYSQQRVEFTDDEIAE 1380

Query: 1381 AVNETLVSGLTKGIFEYWQLKRRQKGMPLLRHLQPPLWETYRQQLKDWESTVNKNNTNSC 1440
             +NETL+SGLTK IFEYWQ KRR+KGMPL+RHLQPPLWETY+QQLKDWE T+NK+NT+ C
Sbjct: 1381 VMNETLLSGLTKAIFEYWQQKRRRKGMPLIRHLQPPLWETYQQQLKDWECTINKSNTSFC 1440

Query: 1441 NGYHD-SASIEKPPMFAFCLKPRGLEVSNKGSKQRSHRKFSMAGHSNSITYDQDGLHGFV 1500
            NGYH+ +AS+EKPPMFAFCLKPRGLEV NKGSKQRSHRKFS++GHSNSI YD +GLHGF 
Sbjct: 1441 NGYHEKAASVEKPPMFAFCLKPRGLEVFNKGSKQRSHRKFSVSGHSNSIAYDHEGLHGFG 1500

Query: 1501 RRLNGSALGDDRMVYIGHNYEFLEDSPLIHTSSSLFSPRLEGGILSNDGFERNVLPKLHK 1560
            RRLNG +LGDD+M YIGHNYEFLEDSPLIHTSSSLFSPRLEGGILSNDG ERN LPKLHK
Sbjct: 1501 RRLNGFSLGDDKMAYIGHNYEFLEDSPLIHTSSSLFSPRLEGGILSNDGLERNFLPKLHK 1560

Query: 1561 TKSRKYG--ASPYEPMMASSFNQRMVGKRDGLNRWNNGYSEWSSPLRYRFDGSQRQILEQ 1620
            +KSRKYG  ASPY+  MA SFNQRM+GKRDGLNRWNNGYSEWSSP RY FDGSQRQILEQ
Sbjct: 1561 SKSRKYGAWASPYDSGMA-SFNQRMIGKRDGLNRWNNGYSEWSSPRRYPFDGSQRQILEQ 1620

Query: 1621 LEGSDLDEYRFRDVSGAAQEARNVAKFKREKARRLLIRADLAIHKAVVAIMTAEAMKAAS 1656
            LEGSD+DE+R RD SGAAQ ARN+AK KREKARRLL RADLAIHKAVVAIMTAEAMKAAS
Sbjct: 1621 LEGSDVDEFRLRDASGAAQHARNMAKLKREKARRLLYRADLAIHKAVVAIMTAEAMKAAS 1680

BLAST of Cp4.1LG02g04400 vs. NCBI nr
Match: gi|778687072|ref|XP_011652501.1| (PREDICTED: uncharacterized protein LOC101216141 [Cucumis sativus])

HSP 1 Score: 2591.6 bits (6716), Expect = 0.0e+00
Identity = 1355/1688 (80.27%), Postives = 1454/1688 (86.14%), Query Frame = 1

Query: 1    MKIGGFWIGSFRLGKSMENSLEYSHGTDTPKKSRSLDLKSLYESKVSKEVQNESLKRKVR 60
            MKIGGFWIGSFRLGKSMENSLE SHGTD PKKSRSLDLKSLYESKVSKEVQN+ LKRK R
Sbjct: 1    MKIGGFWIGSFRLGKSMENSLENSHGTDIPKKSRSLDLKSLYESKVSKEVQNKRLKRKGR 60

Query: 61   AENGDEQRNERRNRKKVSLSNFSSIYSRSRKSLHEVYDDELGSSGHDSKKALKSESKEKL 120
            AE+GD Q+NERRNRKKVSLSNFSSIYSRSRKSL EVYD  LGSSGHDSKKALKSESK+KL
Sbjct: 61   AEDGDVQKNERRNRKKVSLSNFSSIYSRSRKSLDEVYDAGLGSSGHDSKKALKSESKDKL 120

Query: 121  NSSSECNKVSLILNDDVMQIPKRKRGGFVRRKKILGGQILKPSGQLDGKAA-------IV 180
            NSSSE N+V LIL+++VM IPKRKRGGFVRRKK   GQILKPSGQLD KA         V
Sbjct: 121  NSSSEFNEVPLILDENVMHIPKRKRGGFVRRKKSHDGQILKPSGQLDAKAGSLDDKAGTV 180

Query: 181  DQIAKSSAKDPSDLVECCKTNRKPGFKSLKEKEQSELSSTQHPKRGDGHADPLVRENQSS 240
            DQIAKSS KD SD VECCKTNRK  FK LKEKE  EL    H K+ DG AD L REN+ +
Sbjct: 181  DQIAKSSVKDSSDQVECCKTNRKLAFKDLKEKEPKELRL--HLKKEDGQADQLTRENELN 240

Query: 241  STLHLKEEGEHIDHSVVKPVSLSFKKSQKNFGRRKISASGRKRNSKEGEASISHSTKRRD 300
                LKEEGEHIDHSVVKPVS S KKS+KN  +RKISASG K NSKEGEASIS STKRRD
Sbjct: 241  PASRLKEEGEHIDHSVVKPVSPSSKKSKKNVRKRKISASGSKSNSKEGEASISQSTKRRD 300

Query: 301  GCLEDDEENLEENAARMLSSRFDPTCTGFSSNVMGSLPPANGF----------VSHGLKP 360
            G  EDDEENLEENAARMLSSRFDP CTGFSSN  GSLPP NG           VS GLKP
Sbjct: 301  GFPEDDEENLEENAARMLSSRFDPNCTGFSSNTKGSLPPTNGLSFLLSSGHDNVSRGLKP 360

Query: 361  LADLESASVDSAGRVLRPRKQRKEKKSSRKRRHFYEILLADLDAVWILNRRIKVFWPLDQ 420
               LESASVD+AGRVLRPRKQRKEKKSSRKRRHFY+IL  D+DA W+LNRRIKVFWPLDQ
Sbjct: 361  --GLESASVDAAGRVLRPRKQRKEKKSSRKRRHFYDILFGDIDAAWVLNRRIKVFWPLDQ 420

Query: 421  IWYYGLVNDYDKERKLHHVKYDDRDEEWIDLQNERFKLLLLPSEVPGREDRRKSVMGNNP 480
            IWYYGLVNDYDKERKLHHVKYDDRDEEWIDLQNERFKLLLLPSEVPGRE+RRKS +GN+P
Sbjct: 421  IWYYGLVNDYDKERKLHHVKYDDRDEEWIDLQNERFKLLLLPSEVPGREERRKSAVGNDP 480

Query: 481  ANVKRESRSRKGKETNAP--KDECNTGSFMDSEPIISWLARSTQRNKSCPSHSSKRQKNS 540
            AN K  S SRKGKET+A   +D+CN GS+MDSEPIISWLARST RNKS PSH+SKRQK S
Sbjct: 481  ANEKGRSGSRKGKETDAVILEDDCNIGSYMDSEPIISWLARSTHRNKSSPSHNSKRQKTS 540

Query: 541  SLFLKSGSQAIERP------HSGLPERLGDMDGLEKSTSEITTCSKTCKLPIVYFRKRFR 600
            SL  KSGSQA E+P       SG+PERL D+DG EKS SE TTCS T KLPIVYFRKRFR
Sbjct: 541  SLSSKSGSQANEKPANLLVKSSGMPERLADVDGPEKSASETTTCSTTRKLPIVYFRKRFR 600

Query: 601  NIGTEMSHKHETSYASRRTHSLASF-FSNVGEIDDVEESDISPRRSEALRLLWCVDDDGL 660
            NIGTEM HK ET +ASRR+H+  SF FSN   IDDVEE DISPRRSEA RLLWCVDD GL
Sbjct: 601  NIGTEMPHKRETDFASRRSHASLSFSFSN---IDDVEEPDISPRRSEAHRLLWCVDDAGL 660

Query: 661  LQLDIPAMEVGQLRFELTIPEYSFLNMTSSAETFWLFHLSMLIQHGALTLTWPKVQLEMF 720
            LQL IP MEVGQ RFEL IP+YSFLN+TSSA+TFWLFHL+MLIQHG LTL WPKVQLEM 
Sbjct: 661  LQLAIPLMEVGQFRFELNIPQYSFLNVTSSADTFWLFHLAMLIQHGTLTLLWPKVQLEML 720

Query: 721  CVDNVVGLRFLLFEGCLMQAVAFIFLVMKMFRQPSKQVRYADFQVPMTSIRFKFSCPPDI 780
             VDNVVGLRFLLFEGCLMQAVAFIFLV+KMF+ P KQ RYADFQ P+TSIRFKFSC  DI
Sbjct: 721  FVDNVVGLRFLLFEGCLMQAVAFIFLVLKMFQSPGKQGRYADFQFPVTSIRFKFSCLQDI 780

Query: 781  GKQLVFAFYNFSETKNSKWLHLDCRLKKYCLLTKQLPLTECTYDNIKRFQNCTSQFHTSP 840
            GKQLVFAF+NFSE K SKW+HLD RLKKYCL++KQLPLTECTYDNIK+ QN  +QF  SP
Sbjct: 781  GKQLVFAFHNFSEIKYSKWVHLD-RLKKYCLISKQLPLTECTYDNIKKLQNSKTQFRASP 840

Query: 841  FCGECSSIKGTQKIGSLGINHKGDAGENNGHSNLCSNETNKKFPAFALSFTAAPSFMLSL 900
            FCG  SS+KGTQKI SLGIN KG A  N+GHSNLCSNET + FPAFALSFTAAP+F LSL
Sbjct: 841  FCGRSSSVKGTQKISSLGINLKGAACVNSGHSNLCSNETKRNFPAFALSFTAAPTFFLSL 900

Query: 901  HLKLLMEQCVAHLSSLHQDSGKRAENFGRLTVDNVCMNDCANNLSTSSKALGRWNLCARS 960
            HLKLLME+CVAHLS  H DS +  EN+GRLTVD+V  +DCAN+LSTSSKA  RWN C +S
Sbjct: 901  HLKLLMERCVAHLSLQHHDSIEHPENYGRLTVDDVLTDDCANSLSTSSKASDRWNSCPQS 960

Query: 961  DLGTGLSDCEEGG---SSRYKRSRLVAETCAGSHDSDKARNDVKKRMRSSGNDKSEKAMA 1020
            DLGTGLSDCE+G    SS+YK S  VA TCAGS D+DKARN +K+R+R  G +KS K  A
Sbjct: 961  DLGTGLSDCEDGDGVQSSQYK-STPVATTCAGSQDTDKARNGIKRRIRPLGKNKSGKTTA 1020

Query: 1021 LPNVARSDNGSDSFLNDLSVEIPSFQPVDGELHNAQLSMDVAWNVNSGIIRSPNPTAPRS 1080
            LPNVARSDN  +SFLNDLSVEIPSFQPVDGELH  Q SMDV WN ++ +I SPNPTAPRS
Sbjct: 1021 LPNVARSDN--NSFLNDLSVEIPSFQPVDGELHGPQQSMDVGWNASAVVIPSPNPTAPRS 1080

Query: 1081 TWHRNKNNSSSFGLASHGWSDGKDFL-NGLGNRTKKPRTQVSYMLPFGGLDYGSKNRNSH 1140
            TWHRNKNNS+S GLASHGWSDG   L NGLGNRTKKPRTQVSY LPFGG DY SK+RNSH
Sbjct: 1081 TWHRNKNNSTSLGLASHGWSDGNSLLINGLGNRTKKPRTQVSYSLPFGGFDYSSKSRNSH 1140

Query: 1141 PKATPYKRIRRASEKRSDAARGSQRNIELLSCDANVLITTGDRGWRECGARVVLEVFDHN 1200
            PKA+PYKRIRRASEKRSD ARGS+RN+ELLSCDANVLIT GDRGWRECGA+VVLEVFDHN
Sbjct: 1141 PKASPYKRIRRASEKRSDVARGSKRNLELLSCDANVLITLGDRGWRECGAKVVLEVFDHN 1200

Query: 1201 EWKLAVKLSGITKYSYKAHQFLQPGSTNRYTHAMMWKGGKDWILEFPDRSQWTIFKELHE 1260
            EWKLAVKLSGITKYSYKAHQFLQPGSTNRYTHAMMWKGGKDWILEFPDRSQW IFKELHE
Sbjct: 1201 EWKLAVKLSGITKYSYKAHQFLQPGSTNRYTHAMMWKGGKDWILEFPDRSQWAIFKELHE 1260

Query: 1261 ECYNRNIRSASVKNIPIPGVRLIEENDEHVAETAFMRNPSKYFRQVETDVEMALNPNRVL 1320
            ECYNRNIR+ASVKNIPIPGV L+EENDE+ AE+AFMRNPSKYFRQVETDVEMALNP R+L
Sbjct: 1261 ECYNRNIRAASVKNIPIPGVCLLEENDEYEAESAFMRNPSKYFRQVETDVEMALNPTRIL 1320

Query: 1321 YDMDSDDEQWIKDVQTSSEVGSSSSLGEASSEVFEKTMDAFEKAAYSQQRDEFTDDEIAE 1380
            YDMDSDDEQWIKD+  SSEVGSSS LGE SSEVFEKT+DAFEKAAYSQQRDEFTDDEIAE
Sbjct: 1321 YDMDSDDEQWIKDILPSSEVGSSSGLGEVSSEVFEKTVDAFEKAAYSQQRDEFTDDEIAE 1380

Query: 1381 AVNETLVSGLTKGIFEYWQLKRRQKGMPLLRHLQPPLWETYRQQLKDWESTVNKNNTNSC 1440
             +NETL S LTK IFEYWQ KRR+KGMPL+RHLQPPLWETY+QQLKDWE T+NK+NT+ C
Sbjct: 1381 VMNETLASDLTKAIFEYWQQKRRRKGMPLIRHLQPPLWETYQQQLKDWECTINKSNTSFC 1440

Query: 1441 NGYHD-SASIEKPPMFAFCLKPRGLEVSNKGSKQRSHRKFSMAGHSNSITYDQDGLHGFV 1500
            NGYH+ +AS+EKPPMFAFCLKPRGLEV NKGSKQRSHRKFS++GHSNSI YD DGLHGF 
Sbjct: 1441 NGYHEKAASVEKPPMFAFCLKPRGLEVFNKGSKQRSHRKFSVSGHSNSIAYDNDGLHGFG 1500

Query: 1501 RRLNGSALGDDRMVYIGHNYEFLEDSPLIHTSSSLFSPRLEGGILSNDGFERNVLPKLHK 1560
            RRLNG +LGDD+M YIGHNYEFLEDSPLIHTSSSLFSPRLEGGILSNDG ERN LPKLHK
Sbjct: 1501 RRLNGFSLGDDKMAYIGHNYEFLEDSPLIHTSSSLFSPRLEGGILSNDGLERNFLPKLHK 1560

Query: 1561 TKSRKYG--ASPYEPMMASSFNQRMVGKRDGLNRWNNGYSEWSSPLRYRFDGSQRQILEQ 1620
            +KSRKYG  AS Y+  MA SFNQRM+GKRDGLNRWNNGYSEWSSP RY FDGSQRQILEQ
Sbjct: 1561 SKSRKYGAWASTYDSGMA-SFNQRMIGKRDGLNRWNNGYSEWSSPRRYPFDGSQRQILEQ 1620

Query: 1621 LEGSDLDEYRFRDVSGAAQEARNVAKFKREKARRLLIRADLAIHKAVVAIMTAEAMKAAS 1656
            LEGSD+DE+R RD SGAAQ ARN+AK KREKARRLL RADLAIHKAVVAIMTAEAMKAAS
Sbjct: 1621 LEGSDVDEFRLRDASGAAQHARNMAKLKREKARRLLYRADLAIHKAVVAIMTAEAMKAAS 1676

BLAST of Cp4.1LG02g04400 vs. NCBI nr
Match: gi|596285582|ref|XP_007225478.1| (hypothetical protein PRUPE_ppa000151mg [Prunus persica])

HSP 1 Score: 1588.2 bits (4111), Expect = 0.0e+00
Identity = 921/1686 (54.63%), Postives = 1136/1686 (67.38%), Query Frame = 1

Query: 17   MENSLEYSHGTDTPKKSRSLDLKSLYESKVSKEVQNESLKRKVRAENGDEQRNERR-NRK 76
            MEN +E SHGT+ P+KSRSLDLKSLY+S+ +KEV  +SLKRK  AE+GDE R++++ +RK
Sbjct: 1    MENRIENSHGTEIPRKSRSLDLKSLYKSRTTKEVPTKSLKRKGSAEDGDENRDKKKKSRK 60

Query: 77   KVSLSNFSSIYSRSRKSLHEVYDDELGSSGHDSKKALKSESKEKLNSSSECNKVS-LILN 136
            +VSLS+  ++ + S+KSL EVY   L S  HD + A+K  S + L+S S  N VS L L 
Sbjct: 61   EVSLSSLKNVNTSSKKSLDEVYHSGLNSGSHDPE-AVKCGSSQILDSGSGFNGVSSLSLG 120

Query: 137  DDVMQIPKRKRGGFVRRKKILGGQILKPSGQLDGKAAIVDQ---IAKSSAKDPSDLVECC 196
            ++V+QIP+RKRG FV RKK  GGQ+LK   Q  GK  +VDQ   IAK +  D     E  
Sbjct: 121  NNVIQIPRRKRG-FVGRKKFEGGQVLKLPDQSAGKVGLVDQNHQIAKLNVDDLGTQDELL 180

Query: 197  KTNRKPGFKSLKEKEQSELSSTQHPKRGDGHADPLVRENQSSSTLHLKEEGEHIDHSVVK 256
               RK G    KE   SEL+S  H  +                      EG H  HSVV 
Sbjct: 181  NVKRKKGRDDFKENIDSELNSAPHADK----------------------EGVHTSHSVVS 240

Query: 257  PVSLSFKKSQKNFG---------RRKISASGRKRNSKEGEASISHSTKRRDGCLEDDEEN 316
                S KKS++N           +RK  A G K  +KE +  +  STK      EDDEEN
Sbjct: 241  NGDSSLKKSRRNQDNEENRRSRRKRKDLACGSKSAAKEADPLVDSSTKSCHDLQEDDEEN 300

Query: 317  LEENAARMLSSRFDPTCTGFSSNVMGS-LPPANG----------FVSHGLKPLADLESAS 376
            LEENAARMLSSRFDP+CTGFSSN   S L  ANG          F S   K ++  ES S
Sbjct: 301  LEENAARMLSSRFDPSCTGFSSNNKASALESANGLSFLLSSGQDFDSRRSKSISGSESPS 360

Query: 377  VDSAGRVLRPRKQRKEKKSSRKRRHFYEILLADLDAVWILNRRIKVFWPLDQIWYYGLVN 436
            VD++GRVLRPRKQ KEK  SRKRRHFYE+ L +LDA W+ NRRIKVFWPLDQ WYYGLVN
Sbjct: 361  VDNSGRVLRPRKQHKEKGHSRKRRHFYEVFLGNLDAYWVTNRRIKVFWPLDQTWYYGLVN 420

Query: 437  DYDKERKLHHVKYDDRDEEWIDLQNERFKLLLLPSEVPGREDRRKSVMGNNPANVKRES- 496
            DYDKE+KLHHVKYDDRDEEWIDLQNERFKLLLLPSEVPG+ +R+KS   N  ++V+R+  
Sbjct: 421  DYDKEKKLHHVKYDDRDEEWIDLQNERFKLLLLPSEVPGKIERKKSTQRNR-SSVERKGN 480

Query: 497  ----RSRKGKETNAPKDECNTGSFMDSEPIISWLARSTQRNKSCPSHSSKRQKNSSLFLK 556
                + +K +E  +  D C  GS+MD+EPIISWLARS +R KS PS + K+QK S L LK
Sbjct: 481  LKPRKEKKKRELTSEDDSC-MGSYMDTEPIISWLARSNRRVKS-PSCAVKKQKTSGLSLK 540

Query: 557  SG-------SQAIERPHSGLPERLGDMDGLEKSTSEITTCSKTCKLPIVYFRKRFRNIGT 616
                        I   H+    R  D+   EK TS+ +TC +  K+PIVYFR+R R  G+
Sbjct: 541  PPLSDEDVIRDKIRTSHNS--GRSSDVLRQEKPTSQGSTCPRDSKMPIVYFRRR-RKTGS 600

Query: 617  EMSHKHETSYASRRTHSLASFFSNVGEIDDVEESDISPRRSEALRLLWCVDDDGLLQLDI 676
             +SH  + ++A        + F  V EI D+EE     RR +A   LW +DD GLL+L +
Sbjct: 601  VLSHTSKGNHAYVSELGSITSFVPVKEIGDLEEPYDFVRRLDANGPLWYIDDAGLLKLTL 660

Query: 677  PAMEVGQLRFELTIPEYSFLNMTSSAETFWLFHLSMLIQHGALTLTWPKVQLEMFCVDNV 736
            P  E G++ FEL +P +S +N +   E F LFH +ML ++G + +TWPKV LEM  VDNV
Sbjct: 661  PRTEAGKVTFELGVPMHSTINDSFGVE-FSLFHAAMLHRYGTVVITWPKVYLEMLFVDNV 720

Query: 737  VGLRFLLFEGCLMQAVAFIFLVMKMFRQPSKQVRYADFQVPMTSIRFKFSCPPDIGKQLV 796
            VGLRFLLFEGCL QAVAF+FLV+ +F  P +Q ++ DFQ+P+TSIRFKFSC   + KQLV
Sbjct: 721  VGLRFLLFEGCLEQAVAFVFLVLALFHHPIEQGKFLDFQLPVTSIRFKFSCVQLLRKQLV 780

Query: 797  FAFYNFSETKNSKWLHLDCRLKKYCLLTKQLPLTECTYDNIKRFQNCTSQFHTSPFCGEC 856
            FA YNFS+ K SKW +LD +++ +CLLTK+LPL+ECTYD+I+  QN T+Q      CG  
Sbjct: 781  FAVYNFSQVKKSKWKYLDSKVRSHCLLTKKLPLSECTYDSIQALQNGTNQSPFMSLCGRP 840

Query: 857  SSIKGTQKIGSLGINHKGDAGE----NNGHSNLCSNETNKKFPAFALSFTAAPSFMLSLH 916
            SS+KGT++    GIN  G + E    N  HS   S+E  +K P  ALSFTAAP+F LSLH
Sbjct: 841  SSVKGTRRRSRQGINFMGGSRESTFVNISHSTSHSDELPRKLPPLALSFTAAPTFFLSLH 900

Query: 917  LKLLMEQCVAHLSSLHQDSGKRAENFG-RLTVDNVCMNDCANNLSTSSKALGRWNLCARS 976
            LKLLME CVA++     DS +   N G  L VD   + D  N     SK     NL  ++
Sbjct: 901  LKLLMEHCVANICFRDPDSVELLGNSGSMLAVDCSSVEDFFNR---GSKITHENNL--KA 960

Query: 977  DLGTGLSDCEEGGSSRYKRSRLVAETCAGSHDSDKARNDVKKRMRSSGNDKSEKAMALPN 1036
              G   SD                      H   K   +                +AL N
Sbjct: 961  SPGNATSD----------------------HSFSKPETETA--------------LALCN 1020

Query: 1037 VARSDNGSDSFLNDLSVEIPSF----QPVDGELHNAQLSMDVAWNVNSGIIRSPNPTAPR 1096
              +SD  S SFLN L+VEIPSF    +PVDGE+ +AQ   D +WN++  II SPNPTAPR
Sbjct: 1021 GEKSDTDSQSFLNGLTVEIPSFDRFEKPVDGEVQSAQQPTDCSWNMSGSIIPSPNPTAPR 1080

Query: 1097 STWHRNKNNSSSFGLASHGWSDGKD--FLNGLGNRTKKPRTQVSYMLPFGGLDYGSKNRN 1156
            STWHR++N+SSSFG  SHGWSDGK   F NG GN  KKPRTQVSY LP+GG D+ SK RN
Sbjct: 1081 STWHRSRNSSSSFGSLSHGWSDGKADLFHNGFGNGPKKPRTQVSYTLPYGGFDFSSKQRN 1140

Query: 1157 SHPKATPYKRIRRASEKR-SDAARGSQRNIELLSCDANVLITTGDRGWRECGARVVLEVF 1216
               K  P KRIRRA+EKR SD +RGSQRN+E LSC+ANVLI   DRGWRECGA +VLE+F
Sbjct: 1141 LQ-KGIPPKRIRRANEKRLSDVSRGSQRNLEQLSCEANVLINGSDRGWRECGAHIVLELF 1200

Query: 1217 DHNEWKLAVKLSGITKYSYKAHQFLQPGSTNRYTHAMMWKGGKDWILEFPDRSQWTIFKE 1276
            DHNEWKLAVK+SG TKYSYKAHQFLQPGSTNRYTHAMMWKGGKDWILEFPDRSQW +F+E
Sbjct: 1201 DHNEWKLAVKISGTTKYSYKAHQFLQPGSTNRYTHAMMWKGGKDWILEFPDRSQWALFRE 1260

Query: 1277 LHEECYNRNIRSASVKNIPIPGVRLIEENDEHVAETAFMRNPSKYFRQVETDVEMALNPN 1336
            +HEECYNRNIRSA VKNIPIPGVRLIEE+D++ AE +F+R+ +KYFRQ ETDVEMAL+P+
Sbjct: 1261 MHEECYNRNIRSALVKNIPIPGVRLIEESDDNGAEISFLRSSTKYFRQTETDVEMALDPS 1320

Query: 1337 RVLYDMDSDDEQWIKDVQTSSEVGSSSSLGEASSEVFEKTMDAFEKAAYSQQRDEFTDDE 1396
            RVLYDMDSDDEQWI   Q SSEV +SSS+ E   E+FEKTMD FEKAAY+QQ D+FT +E
Sbjct: 1321 RVLYDMDSDDEQWIMKFQNSSEVDNSSSI-EIDEEMFEKTMDMFEKAAYAQQCDQFTYEE 1380

Query: 1397 IAEAVNETLVSGLTKGIFEYWQLKRRQKGMPLLRHLQPPLWETYRQQLKDWESTVNKNNT 1456
            I E V       + K I+E+W+ KR +KGMPL+RHLQP  WE Y+QQ+++WE  + K NT
Sbjct: 1381 IEEFVAVVGPMDVIKTIYEHWRGKRLRKGMPLIRHLQPSAWERYQQQVREWEQAMIKTNT 1440

Query: 1457 NSCNGYHD-SASIEKPPMFAFCLKPRGLEVSNKGSKQRSHRKFSMAGHSNSITYDQDGLH 1516
               NG H+ +AS+EKPPMFAFCLKPRGLEV NKGSKQRS ++FS++GHS+ +  DQDG H
Sbjct: 1441 ILPNGCHEKAASVEKPPMFAFCLKPRGLEVPNKGSKQRSQKRFSVSGHSSGMLGDQDGFH 1500

Query: 1517 GFVRRLNGSALGDDRMVYIGHNYEFLEDSPLIHTSSSLFSPRLEGGIL-SNDGFERNVLP 1576
               RR NG A GD+++VY GHNY+ L+DSPL  TS  +FSPR    IL SNDGFERN L 
Sbjct: 1501 AIGRRSNGFAFGDEKVVYPGHNYDSLDDSPLSQTSPRVFSPRDATNILISNDGFERNHLH 1560

Query: 1577 KLHKTKSRKYG--ASPYEPMMASSFNQRMVGKRDGLNRWNNGYSEWSSPLRYRFDGSQRQ 1636
            ++H++KS+K+G   SP EP M S ++ R+VG R+G+ RWN G+ +WSS   Y+ DG QR 
Sbjct: 1561 RIHRSKSKKFGRTVSPVEPQMVSPYSHRVVGNRNGVQRWNTGFPDWSSQRYYQTDGPQRH 1612

Query: 1637 ILEQLEGSDLDEYRFRDVSGAAQEARNVAKFKREKARRLLIRADLAIHKAVVAIMTAEAM 1650
             +  L+G DLDE+R RD SGAAQ A NVA+ KREKA++L  RADLAIHKAVV++MTAEA+
Sbjct: 1621 DMGLLDGPDLDEFRLRDASGAAQHAHNVARLKREKAQKLFYRADLAIHKAVVSLMTAEAI 1612

BLAST of Cp4.1LG02g04400 vs. NCBI nr
Match: gi|1009137360|ref|XP_015886011.1| (PREDICTED: uncharacterized protein LOC107421307 [Ziziphus jujuba])

HSP 1 Score: 1580.1 bits (4090), Expect = 0.0e+00
Identity = 919/1721 (53.40%), Postives = 1138/1721 (66.12%), Query Frame = 1

Query: 17   MENSLEYSHGTDTPKKSRSLDLKSLYESKVSKEV-QNESLKRKVRAENGDEQRNERRNR- 76
            MEN +E SHGT+ P+KSRSLDLKSLY+ KV KE  +N++LKRK  A++ ++ R+ ++ + 
Sbjct: 1    MENRIENSHGTEIPRKSRSLDLKSLYKPKVVKEATRNKNLKRKTSADDVNDSRDRKKKKS 60

Query: 77   -KKVSLSNFSSIYSRSRKSLHEVYDDELGSSGHDSKKALKSESKEKLNSSSECNKVS-LI 136
             K+VSLS+  ++ S S+KSL +V+D  L S+ +DSK  LK E  +K  SS   N VS L 
Sbjct: 61   IKEVSLSSLKNVNSNSKKSLDKVHDSTLSSTLYDSKD-LKLEGNQKSKSSIGFNSVSSLS 120

Query: 137  LNDDVMQIPKRKRGGFVRRKKILGGQILKPSGQLDGKAAIVDQIAKSSAKDPSDLVECCK 196
            L+++V+QIPKRKRG  V RKK  G  ++K  GQ D K  +VDQ +K S  +     E   
Sbjct: 121  LDNNVIQIPKRKRG-LVGRKKFKGEDVVKQQGQSDRKINLVDQTSKLSGDESESRTEPRN 180

Query: 197  TNRKPGFKSLKEKEQSELSSTQHPKRGDGHADPLVRENQSSSTLHLKEEGEHIDHSVVKP 256
               K  +   KE                       R N+S+S     EE     H  V  
Sbjct: 181  VECKKDYDDFKEN----------------------RNNESNSARQSAEEDSRTGHLAVSN 240

Query: 257  VSLSFKKSQKNFGRRKISASGRKRNSKEGEASISHSTKRRDGCLEDDEENLEENAARMLS 316
               S KKSQ+   +RK  A   K  +K  E  + +S+K  +   EDDEENLEENAARMLS
Sbjct: 241  CDSSKKKSQRKRSKRKDLAPNSKSAAKVAEPLVDNSSKIGNVSHEDDEENLEENAARMLS 300

Query: 317  SRFDPTCTGFSSNVMGS-----------LPPANGFVSHGLKPLADLESASVDSAGRVLRP 376
            SRFDP+CTGF SN  G+           L     +VSHG K  +  ES SVD+A R+LRP
Sbjct: 301  SRFDPSCTGFPSNAKGTALQSVTGLSFLLSTGEDYVSHGSKSFSGSESPSVDTAARILRP 360

Query: 377  RKQRKEKKSSRKRRHFYEILLADLDAVWILNRRIKVFWPLDQIWYYGLVNDYDKERKLHH 436
            RKQ K K  SRKRRHFYE+   DLDA W+LNRRIKVFWPLDQ WYYGLVNDYDKERKLHH
Sbjct: 361  RKQHKAKGHSRKRRHFYEVFFGDLDAHWVLNRRIKVFWPLDQSWYYGLVNDYDKERKLHH 420

Query: 437  VKYDDRDEEWIDLQNERFKLLLLPSEVPGREDRRK-SVMGNNPANVKRESRSRKGKETN- 496
            VKYDDRDEEWIDLQNERFKLLL PSEVPG+ +RRK  V  ++P  +K   + RK KE   
Sbjct: 421  VKYDDRDEEWIDLQNERFKLLLFPSEVPGKVERRKPKVQKSSPDEIKGSLKPRKEKEKRD 480

Query: 497  -APKDECNTGSFMDSEPIISWLARSTQRNKSCPSHSSKRQKNSSLFLKSGSQ-----AIE 556
               +D+   GS+MDSEPIISWLARST+R KS PS + K+QK SSL  KS  Q     A+ 
Sbjct: 481  LTTEDDGCMGSYMDSEPIISWLARSTRRVKS-PSRAVKKQKTSSLSFKSVPQILPDEAVN 540

Query: 557  RPHSGL-------------PERLGDMDGLEKSTSEITTCSKTCKLPIVYFRKRFRNIGTE 616
              H  L             P++L D+   +KS  E T+CSK  K PIVYFR+RFR    E
Sbjct: 541  VLHGSLRKGRSEVNRNCEFPDKLIDVTKQQKSALESTSCSKDSKSPIVYFRRRFRKTCLE 600

Query: 617  MSHKHETSYASRRTHSLASFFSNVGEID---DVEESDISPRRSEALRLLWCVDDDGLLQL 676
            +SH  E ++ SR  H L S  S   E D   ++E+ D+     E  R LW  DD GLL+L
Sbjct: 601  LSHTSEGNHGSR--HPLGSVISYAPEGDGSGELEKPDVLSEILEPSRTLWYTDDVGLLKL 660

Query: 677  DIPAMEVGQLRFELTIPEYSFLNMTSSAETFWLFHLSMLIQHGALTLTWPKVQLEMFCVD 736
              P +E G+ +  L  P  SFLN +  AE  WLF    L  HGAL +TWP+V LEM  VD
Sbjct: 661  TSPLIESGKFKVNLRYPVRSFLNDSFGAENLWLFRAFWLFHHGALMITWPRVHLEMLFVD 720

Query: 737  NVVGLRFLLFEGCLMQAVAFIFLVMKMFRQPSKQVRYADFQVPMTSIRFKFSCPPDIGKQ 796
            NVVGLRFLLFEGCL QA+AF+FLV+ +F+QP++Q RY + Q+P TSIRFK +C   + KQ
Sbjct: 721  NVVGLRFLLFEGCLEQALAFVFLVLSIFQQPNEQRRYVELQLPATSIRFKLTCSQHLRKQ 780

Query: 797  LVFAFYNFSETKNSKWLHLDCRLKKYCLLTKQLPLTECTYDNIKRFQNCTSQFHTSPFCG 856
            LVFAFYN+S  KNSKW++LD +LK++CLLTKQLPL+ECTYDNI+  QN +     + FCG
Sbjct: 781  LVFAFYNYSGIKNSKWIYLDYKLKRHCLLTKQLPLSECTYDNIQTLQNGSKHSSVTTFCG 840

Query: 857  ECSSIKGTQKIGSLGINHKGDAGE----NNGHSNLCSNETNKKFPAFALSFTAAPSFMLS 916
            + S IKG QK    G+   G + E    N  HS+   +E  +K P  ALSFTAAP+F LS
Sbjct: 841  QLSPIKGLQKRTRQGVKFMGSSKESAFVNISHSSSRFDEMYRKLPPLALSFTAAPTFFLS 900

Query: 917  LHLKLLMEQCVAHLSSLHQDSGKRAENFGRLTVDNV-CMNDCANNLS------------- 976
            LHLKLLME C+A +S    DS +  EN G L VD    +  C+NN S             
Sbjct: 901  LHLKLLMEHCLARISFQDHDSVENPENSGCLMVDGCSSVEKCSNNGSGIILEENLKGSVC 960

Query: 977  -TSSKALGRWNLCARSDLGTGLSDCEEGG----SSRYKRSRL-VAETCAGSHDSDKARND 1036
               + A   W  C++ DL    S     G    S  Y+   L V+E  AGS   +    D
Sbjct: 961  DADAAASDGWFSCSKPDLEADFSVSSNRGCIKSSEHYQNGNLHVSERSAGSIFPETTGTD 1020

Query: 1037 V--------KKRMRSSGNDKSEKAMALPNVARSDNGSDSFLNDLSVEIPSF----QPVDG 1096
                        + S   D S K   L +  +S+  S SFLN LS+EIP      + VDG
Sbjct: 1021 AIVQLQAWQSNHLESDQFDLSRK--PLIDGDKSNTASSSFLNGLSIEIPPCNQFEKSVDG 1080

Query: 1097 ELHNAQLSMDVAWNVNSGIIRSPNPTAPRSTWHRNKNNSSSFGLASHGWSDGK--DFLNG 1156
            ELH+ Q   D + N N GI+ SPNPTAPRSTWHRN++++SSFG  SHGWSDGK   F NG
Sbjct: 1081 ELHSIQQPTDFSCN-NGGIVPSPNPTAPRSTWHRNRSSTSSFGYLSHGWSDGKADTFQNG 1140

Query: 1157 LGNRTKKPRTQVSYMLPFGGLDYGSKNRNSHPKATPYKRIRRASEKR-SDAARGSQRNIE 1216
              N  KKPRTQVSY LPFGG D  SK + S  K    KRIRRA+EKR SD  RG  RN+E
Sbjct: 1141 FSNGPKKPRTQVSYTLPFGGYDLNSKQK-SIQKGIANKRIRRANEKRLSDVCRGPPRNLE 1200

Query: 1217 LLSCDANVLITTGDRGWRECGARVVLEVFDHNEWKLAVKLSGITKYSYKAHQFLQPGSTN 1276
            LLSCDANVLIT GDRGWRECGA+V+LE+F+ NEWKL+VK+SGITKYSYKAHQFLQPGSTN
Sbjct: 1201 LLSCDANVLITAGDRGWRECGAQVLLELFERNEWKLSVKISGITKYSYKAHQFLQPGSTN 1260

Query: 1277 RYTHAMMWKGGKDWILEFPDRSQWTIFKELHEECYNRNIRSASVKNIPIPGVRLIEENDE 1336
            R+THAMMWKGGKDWILEFPDRSQW +FKE+HEECYNRNIR+ASVKNIPIPGVRLIEE+D+
Sbjct: 1261 RFTHAMMWKGGKDWILEFPDRSQWALFKEMHEECYNRNIRAASVKNIPIPGVRLIEESDD 1320

Query: 1337 HVAETAFMRNPSKYFRQVETDVEMALNPNRVLYDMDSDDEQWIKDVQTSSEVGSSSSLGE 1396
            +  E AF+R+ SKYFR V TDVEMAL+P RVLYDMDSDDEQ+I +++ SSE   +  LG+
Sbjct: 1321 NGTEIAFIRSSSKYFRHVGTDVEMALDPCRVLYDMDSDDEQFILNIENSSEC-MNGCLGK 1380

Query: 1397 ASSEVFEKTMDAFEKAAYSQQRDEFTDDEIAEAVNETLVSGLTKGIFEYWQLKRRQKGMP 1456
             S E+FEKTMD FEKAA+++Q D+FT DEI E V       + K I+E+W  KR++ GMP
Sbjct: 1381 ISEEIFEKTMDMFEKAAFARQVDQFTSDEIEELVAGVGPMNVIKAIYEHWLQKRQKNGMP 1440

Query: 1457 LLRHLQPPLWETYRQQLKDWESTVNKNNTNSCNGYHD-SASIEKPPMFAFCLKPRGLEVS 1516
            L+RHLQPP WE Y+QQ+K+WE  ++K N    NG  + +A IEKPPMFAFCLKPRGLEV 
Sbjct: 1441 LIRHLQPPSWERYQQQVKEWELAMSKINATIPNGCQERAAPIEKPPMFAFCLKPRGLEVP 1500

Query: 1517 NKGSKQRSHRKFSMAGHSNSITYDQDGLHGFVRRLNGSALGDDRMVYIGHNYEFLEDSPL 1576
            NKGSKQRS RK S+ GH+N+   +QDG H F RRLNG A GD+++VY GH+YE+L+DSPL
Sbjct: 1501 NKGSKQRSQRKLSVTGHNNASFVEQDGYHSFGRRLNGFAFGDEKVVYPGHSYEYLDDSPL 1560

Query: 1577 IHTSSSLFSPRLEGGILS-NDGFERNVLPKLHKTKSRKYGA--SPYEPMMASSFNQRMVG 1636
              TS  +FSPR  GG+L  +DG +RN L KL ++KS+K GA  S  +  M + ++ RMVG
Sbjct: 1561 AQTSPRVFSPRDAGGMLMISDGLDRNPLHKLQRSKSKKQGAIISSNDSQMMAMYSHRMVG 1620

Query: 1637 KRDGLNRWNNGYSEWSSPLRYRFDGSQRQILEQLEGSDLDEYRFRDVSGAAQEARNVAKF 1656
             R+G++RWN G+SEW     Y+ +GSQR ++EQL+ SD+DE++ RD SGAAQ A N+AK 
Sbjct: 1621 HRNGVHRWNMGFSEWPGRRNYQLEGSQRLLVEQLDSSDIDEFKLRDASGAAQHALNMAKL 1680

BLAST of Cp4.1LG02g04400 vs. NCBI nr
Match: gi|703147422|ref|XP_010109047.1| (hypothetical protein L484_007381 [Morus notabilis])

HSP 1 Score: 1519.2 bits (3932), Expect = 0.0e+00
Identity = 890/1720 (51.74%), Postives = 1134/1720 (65.93%), Query Frame = 1

Query: 17   MENSLEYSHGTDTPKKSRSLDLKSLYESKVSKEVQNESLKRKVRAENGDE--QRNERRNR 76
            MEN +E S G + P+KSRSLDLKSLY+ +V+K+VQN+ LKRK  A++GDE  ++ ++++ 
Sbjct: 1    MENRIESSDGAEVPRKSRSLDLKSLYKHRVTKDVQNKKLKRKASADDGDENSEKKKKKSV 60

Query: 77   KKVSLSNFSSIYSRSRKSLHEVYDDELGSSGHDSKKALKSESKEKLNSSSECNKVS-LIL 136
            K+VSLS+  +  S S+K++ +     L S  HDSK  LK E+K+KLN S     +S L L
Sbjct: 61   KEVSLSSLKNTSSSSKKNVDKDCHKGLSSGLHDSKD-LKLEAKQKLNGSIGFKSISSLSL 120

Query: 137  NDDVMQIPKRKRGGFVRRKKILGGQILKPSGQLDGKAAIVDQIAKSSAKDPSDLVECCKT 196
            NDDV+QIP+RKRG FV RKK  GG + +  G   GK  +VDQI+K S  D    VE  K 
Sbjct: 121  NDDVIQIPRRKRG-FVGRKKGEGGHVPRRQGLSCGKLDLVDQISKLSGDDSGSQVESVKV 180

Query: 197  NRKPGFKSLKEKEQSELSSTQHPKRGDGHADPLVRENQSSSTLHLKEEGEHIDHSVVKPV 256
             R  GF   KE   SE                      S+S  H +EE E ++H VV   
Sbjct: 181  KRTKGFDDFKENRISE----------------------SNSARHAEEEHERVNHLVVSNG 240

Query: 257  SLSFKKSQKNFGRRKISASGRKRNSKEGEASISHSTKRRDGCLEDDEENLEENAARMLSS 316
               FKKS++   + K  +   K  +KE E    +ST   +   EDDEENLEENAA MLSS
Sbjct: 241  DSLFKKSRRKRSKTKNLSPDDKVGAKEAEPLADNSTMMCNDSQEDDEENLEENAAMMLSS 300

Query: 317  RFDPTCTGFSSNVMGSLPPANG----------FVSHGLKPLADLESASVDSAGRVLRPRK 376
            RFDP CTGFSSN   +    +G          FVS   + L+  ES SVD+AGRVLRPR 
Sbjct: 301  RFDPNCTGFSSNKASAFATVDGLSFLLSSGRDFVSRRSRSLSGSESPSVDAAGRVLRPRI 360

Query: 377  QRKEKKSSRKRRHFYEILLADLDAVWILNRRIKVFWPLDQIWYYGLVNDYDKERKLHHVK 436
            Q KEK  SRKRRHFYE+   DLDA W+LNRRIKVFWPLDQ WYYGLVNDYD+E+KLHHVK
Sbjct: 361  QHKEKGHSRKRRHFYEVFFGDLDADWVLNRRIKVFWPLDQSWYYGLVNDYDREKKLHHVK 420

Query: 437  YDDRDEEWIDLQNERFKLLLLPSEVPGREDRRKSVMGNNPANVKRESRSRKGKET----- 496
            YDDRDEEWIDLQNERFKLLLLPSEVPG+   R+S + +  ++V+R+S S+  KE      
Sbjct: 421  YDDRDEEWIDLQNERFKLLLLPSEVPGKAACRRSRIRDR-SSVQRKSSSKPKKEKKKGDI 480

Query: 497  NAPKDECNTGSFMDSEPIISWLARSTQRNKSCPSHSSKRQKNSSLFLK------------ 556
            +   D C   ++MDSEPIISWLARS +R KS P H+ K+QK S L +K            
Sbjct: 481  SMQDDSCIGSNYMDSEPIISWLARSRRRVKS-PFHALKKQKPSDLSVKPVLPPFSNNAVN 540

Query: 557  ------SGSQAIERP----HSGLPERLGDMDGLEKSTSEITTCSKTCKLPIVYFRKRFRN 616
                  SG+   ++     +S L  R  +    E+STSE  +C K  K+PIVYFR+RFR 
Sbjct: 541  SNRCFESGTVRRDKRKFSRNSNLSGRFANDAMKEESTSESISCPKDSKMPIVYFRRRFRK 600

Query: 617  IGTEMSHKHETSYASRRT-HSLASFFSNVGEIDDVEESDISPRRSEALRLLWCVDDDGLL 676
             G E+S   E ++A R T   + SF   V +  D  + D+   R +   LLW VDD GLL
Sbjct: 601  TGLELSRGCEDNHACRNTLDPVTSFAPAVDDTRDWVKWDVLLGRLDLGGLLWSVDDAGLL 660

Query: 677  QLDIPAMEVGQLRFELTIPEYSFLNMTSSAETFWLFHLSMLIQHGALTLTWPKVQLEMFC 736
            +L +P +E G+ +F++  P  S L      E  WL H ++L+ +G + + WP+V LEM  
Sbjct: 661  KLMLPGLESGKFKFDVDFPILSGLYDIFGVENLWLSHSAVLLHYGTVMIRWPQVHLEMLF 720

Query: 737  VDNVVGLRFLLFEGCLMQAVAFIFLVMKMFRQPSKQVRYADFQVPMTSIRFKFSCPPDIG 796
            VDNV GLRFLLFEGCL QA+A +FLV++ F QP+++V++ D  +P+TSIRFK +C     
Sbjct: 721  VDNVFGLRFLLFEGCLNQALALVFLVVRTFHQPTERVKFVD--MPVTSIRFKLTCFQHHK 780

Query: 797  KQLVFAFYNFSETKNSKWLHLDCRLKKYCLLTKQLPLTECTYDNIKRFQNCTSQFHTSPF 856
            K L FAF NFS  +NSKW++LD +L+++CL+TKQLPL ECTYDNIK  QN T        
Sbjct: 781  KHLEFAFCNFSTVENSKWIYLDRKLRRHCLVTKQLPLPECTYDNIKMLQNRTVHLPLRSV 840

Query: 857  CGECSSIKGTQKIGSLGINHKGDAGENN----GHSNLCSNETNKKFPAFALSFTAAPSFM 916
            CG+ S IKGT+K    GIN  G + E+     G S+   ++  KK P  ALSFTAAP+F 
Sbjct: 841  CGQPSFIKGTRKRLRQGINFMGISRESAFMDIGRSSHF-DKMYKKLPPLALSFTAAPTFF 900

Query: 917  LSLHLKLLMEQCVAHLSSLHQDSGKRAENFGRLTVDNVC-MNDCAN-----NLSTSSKAL 976
            LSLHLK+LME  +AH+S    DS +  EN   +T D+   M + +N     +L  ++KAL
Sbjct: 901  LSLHLKMLMEHSLAHISLREHDSEEHLENSCSMTADDSSSMEEYSNKGSEMSLEENTKAL 960

Query: 977  G---RWNLC---ARSDLGTGLSDCEEGGSSR-----YKRSRLVAETCAGSHDSDKARNDV 1036
                  + C    R +L  GLS C +    +     +    + A T A S    K R D 
Sbjct: 961  SGEVASDGCFSSGRPELSNGLSVCCDRDQIKASQPCHNGDAIAAGTSADSPVHKKIRTDA 1020

Query: 1037 KKRMRSSGNDKSEK------AMALPNVARSDNGSDSFLNDLSVEIPSF----QPVDGELH 1096
              ++++     SE       + +L +  +S+ GS SF+N LSVEIP F    + VDGELH
Sbjct: 1021 TVQLQAWKGHHSESDQSALLSRSLDDRDKSEKGSQSFVNGLSVEIPPFNQFEKSVDGELH 1080

Query: 1097 NAQLSMDVAWNVNSGIIRSPNPTAPRSTWHRNKNNSSSFGLASHGWSDGK--DFLNGLGN 1156
             AQ + D++WN N  I  SPNPTAPRSTWHRNK NSS FG  SHGWSDGK     NG GN
Sbjct: 1081 GAQQATDLSWNTNGAIFSSPNPTAPRSTWHRNKQNSS-FGHLSHGWSDGKADPVYNGFGN 1140

Query: 1157 RTKKPRTQVSYMLPFGGLDYGSKNRNSHPKATPYKRIRRASEKR-SDAARGSQRNIELLS 1216
              KKPRTQVSY+LPFGG D   K + S  K  P KR+R+ASEKR SD +RGSQRN+ELLS
Sbjct: 1141 GPKKPRTQVSYLLPFGGFDCSPKQK-SIQKGLPSKRLRKASEKRSSDVSRGSQRNLELLS 1200

Query: 1217 CDANVLITTGDRGWRECGARVVLEVFDHNEWKLAVKLSGITKYSYKAHQFLQPGSTNRYT 1276
            CD N+LIT  DRGWRECGA+VVLE+FD +EWKLAVKLSG+TKYSYKAHQFLQPGSTNR+T
Sbjct: 1201 CDVNILITATDRGWRECGAQVVLELFDDHEWKLAVKLSGVTKYSYKAHQFLQPGSTNRFT 1260

Query: 1277 HAMMWKGGKDWILEFPDRSQWTIFKELHEECYNRNIRSASVKNIPIPGVRLIEENDEHVA 1336
            HAMMWKGGKDW LEF DRSQW +FKE+HEECYNRNI++ASVK+IPIPGVRL+EE D++ A
Sbjct: 1261 HAMMWKGGKDWTLEFMDRSQWALFKEMHEECYNRNIQAASVKSIPIPGVRLVEEGDDNGA 1320

Query: 1337 ETAFMRNPSKYFRQVETDVEMALNPNRVLYDMDSDDEQWIKDVQTSSEVGSSSSLGEASS 1396
            E AF+R+ +KYFRQVETD+EMALNP+RVLYD+DSDDEQWI   ++SSE+  S SLG+ S 
Sbjct: 1321 ELAFVRSSAKYFRQVETDIEMALNPSRVLYDLDSDDEQWIMKARSSSEL-DSGSLGKISE 1380

Query: 1397 EVFEKTMDAFEKAAYSQQRDEFTDDEIAEAVNETLVSGLTKGIFEYWQLKRRQKGMPLLR 1456
            E+FEKTMD FEKAAY+ QRD+ T +EI E         + K I+E+W+LKR++ GMPL+R
Sbjct: 1381 EMFEKTMDMFEKAAYAHQRDQLTLEEIEELTVGVGPMDVIKVIYEHWRLKRQKNGMPLIR 1440

Query: 1457 HLQPPLWETYRQQLKDWESTVNKNNTNSCNGYHD-SASIEKPPMFAFCLKPRGLEVSNKG 1516
            HLQPPLWE Y+Q++++WE  + + N N  NG  + +A IEKPPMFAFC+KPRGLEV NKG
Sbjct: 1441 HLQPPLWERYQQEVREWELAMTRINANLPNGCQEKTAQIEKPPMFAFCMKPRGLEVPNKG 1500

Query: 1517 SKQRSHRKFSMAGHSNSITYDQDGLHGFVRRLNGSALGDDRMVYIGHNYEFLEDSPLIHT 1576
            SKQRSHRK S++G SN+   DQDGLH + RRLNG + GD++ VY G+NY+ LEDSPL  T
Sbjct: 1501 SKQRSHRKISVSGKSNTTFGDQDGLHAYGRRLNGFSFGDEKFVYPGYNYDSLEDSPLPQT 1560

Query: 1577 SSSLFSPRLEGGI-LSNDGFERNVLPKLHKTKSRKYG--ASPYEPMMASSFNQRMV--GK 1636
               +F PR  G + ++N G +RN   K  ++KS+KYG   SP  P     +  R+V  G 
Sbjct: 1561 PRRMFLPRDAGSMSMTNYGLDRNHSYKFQRSKSKKYGNTVSPNNPQTMGLYGHRVVGNGS 1620

Query: 1637 RDGLNRWNNGYSEWSSPLRYRFDGSQRQILEQLEGSDLDEYRFRDVSGAAQEARNVAKFK 1656
            R+GL+RWN G+SEWSS   ++ + SQR  +EQL+GSDLDEYR RD S AAQ A N+AK K
Sbjct: 1621 RNGLHRWNMGFSEWSSQQHFQPEPSQRHFIEQLDGSDLDEYRVRDASSAAQRALNIAKLK 1680

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0LJD1_CUCSA0.0e+0080.27Uncharacterized protein OS=Cucumis sativus GN=Csa_3G879490 PE=4 SV=1[more]
M5XKQ9_PRUPE0.0e+0054.63Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa000151mg PE=4 SV=1[more]
W9SBV9_9ROSA0.0e+0051.74Uncharacterized protein OS=Morus notabilis GN=L484_007381 PE=4 SV=1[more]
A0A061GQB3_THECC0.0e+0050.70Enhancer of polycomb-like transcription factor protein, putative isoform 1 OS=Th... [more]
A0A061GW48_THECC0.0e+0049.83Enhancer of polycomb-like transcription factor protein, putative isoform 4 OS=Th... [more]
Match NameE-valueIdentityDescription
AT4G32620.27.9e-27638.77 Enhancer of polycomb-like transcription factor protein[more]
AT5G04670.12.3e-4135.69 Enhancer of polycomb-like transcription factor protein[more]
AT4G31880.17.5e-0821.92 LOCATED IN: cytosol, chloroplast[more]
AT2G31650.13.2e-0633.33 homologue of trithorax[more]
AT1G15940.14.1e-0627.40 Tudor/PWWP/MBT superfamily protein[more]
Match NameE-valueIdentityDescription
gi|659132747|ref|XP_008466363.1|0.0e+0080.75PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC103503793 [Cucumis me... [more]
gi|778687072|ref|XP_011652501.1|0.0e+0080.27PREDICTED: uncharacterized protein LOC101216141 [Cucumis sativus][more]
gi|596285582|ref|XP_007225478.1|0.0e+0054.63hypothetical protein PRUPE_ppa000151mg [Prunus persica][more]
gi|1009137360|ref|XP_015886011.1|0.0e+0053.40PREDICTED: uncharacterized protein LOC107421307 [Ziziphus jujuba][more]
gi|703147422|ref|XP_010109047.1|0.0e+0051.74hypothetical protein L484_007381 [Morus notabilis][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR019542Enhancer_polycomb-like_N
IPR002999Tudor
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG02g04400.1Cp4.1LG02g04400.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002999Tudor domainSMARTSM00333TUDOR_7coord: 387..445
score: 8.
IPR019542Enhancer of polycomb-like, N-terminalPFAMPF10513EPL1coord: 1239..1333
score: 6.2
NoneNo IPR availablePANTHERPTHR13793PHD FINGER PROTEINScoord: 1033..1350
score: 9.3E
NoneNo IPR availablePANTHERPTHR13793:SF94SUBFAMILY NOT NAMEDcoord: 1033..1350
score: 9.3E

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cp4.1LG02g04400Cp4.1LG06g03560Cucurbita pepo (Zucchini)cpecpeB467