Cp4.1LG08g08310 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG08g08310
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionProtein DEK
LocationCp4.1LG08 : 6603183 .. 6613258 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TACTCGGAGAAGGACTTAGTTTATTTTATTACAAATTTAAATATTAAGAAAAAAGTGAAAGGAATACGAAACGAAGCCGAAAAAAAAAAAAAGGAGTTATCTACGGAGCTGAGAGGCGACAGTGAGACAGTAGAACTCCGACGACTTTCTGTTCGACGCTCGAATCGATTACCGAAGCTCTGAAATTCAGTGAGTTTTAGCTCCTTAAACTTGTACTCTTCTCTTTTAGGGCTTTTCCATTTCTTGCAATTCCGCAATTTTGGGGGTTTGTTGTTGTGGAATGCTCTGGATTTGTAGGATTAGGGTTTCGGATGGTCCTGTGCTGATTTTGTTCTTTCTTTTTGTTTTTTTTGGGTGAATTTAGTGTGTTTTGTTTCTTCGGAGCTGTTTTTGTTAATCTCTTTTAGCTCTCTCCGTGCGGGAGTGCTCGTTTTACTACTGATCTATTATGAAATTATGGCCGAATTCTTTGTGTTCATTTTAATTGGTTTTTCTCAGGTTCAAATGCAGAGTTTTTTTTGTGGGTTGAATGTCATGCATGATGCATCTTATATACTTCCACTGTTGATAATTCTTTGTTGAATTGAGGATTTTAAGTAGTGATGCTATTGCCGCTTATGGTGTTCTCTTTTCTAGCGATCCTTCTGAATGATATTGTTTTTATTTATTTGCTTCGAGTTTGTGTTACTGTCTTTTCTTAAATTTTGCTTTTCTGAAGGATTTCAAAGGATTCCGAATGGCAATTGATTTGAACTGATACCTTTAATTTTTATGTTCAGATTTGAAGTGAATTCTTTGTTGTGCTTAAGTTAAAGTTGTATGGAAGTAAAGATGAATCCAGATCTTGTTTTCCTCCACAAAAAAATTTAGGTTAAATTACCAATTTTATTCCTATGGTTCTAAAAGTTTAGGGTGGTGATCCAAGTAGATTTAGAATCTTGAAACATTGGAAGTATGTAATTTCTATATTCTAAATTATCACTTTGTCAGTCACAGAATTCAAATACAGTCTAATATTGTTTTTCATAAATAAACTCACTGCATAGAAATCCATTGAATTCATTTGATTTAGTGAATCTAAAATCAAGAAACTTGAATCCAGATAATATTTCAAACAAACCCTTTAAGGTTTGGTTCTACTACCTCACAAGTCATTCTTTAACCATATTTTATTCATCTATAAGCCCACCATTTTTCCTCTATTTTCTTACTCAGGTTCTAAATCCTAATACATCCTAATACAGTCTAGTAACTCCATTTGACACCAGCTCATTGTCATAAAATAGAAAATAGAGAATGTGTCATCATTTCCATTATGATTACGACTTACAAGGAGAATGATTTATGAGATTTGATCGAATTGTTGTTACCAAATTGAAACCTAAAAGCACTGAAAACTTTAAAACAGTAGGGACCAAATTTAGCAATTTAACCGTAATTTTATTCCAAAAAGGGTTGGTTAGATTCAACTTAAAAAATTCTATTGTTTTTTATAAGAGAAAAAGAAATCTATTGTCGCATCAGCCCAAAAAGAAATCTATTGTCGCATTAACCCTTAGGAAGTTATCATTTACTGTAAGAGTATTGTAGTAAAATGCTTTTATTTCTAGTTCACACCCAAACGAGAAGTTTTCATTTCATTCATAGCACAACTGTCTCATTCCAAAGGAAATTTGAATCTTAAATTATTCATGGTTTATAATTTGTGTACATCATAACTTAGCATCCTCATCTATCGTCAAGCATTCCCCACAGCAAATGTTCATTTTTTATCAGTATGTACAGTTGAGGCATTGAGTACCTTAAGGATTGTGATATTCTCCTGCAGAGCCAATAAATTTGAACACTTTCTTTTGAAGTAGTACCCCTACAAACAAAAATACTGATATGCTATTTATTGTTACTCCAAATCACTAGAGTAGTAGCATTAGTACAGATTTATAATTTAGTTTACTTTCCATATAATGTGTGTATTTAACATTCATATTGAATGATTGTGTACTTTTATATTTTGTGTAGTTTTATTCATGTGACTAGACAAAGATGTGCACGTGAGGAGATTGTGGTTTTCTGTAGATTAATTTGAGGTATATCAATCATGGGTCAAGAAGATGCAACAAATAATACCTTTGAGAATACAAAAAATGAAGCCAGTGGTGGAGATTTAAAAACAAATAGCATTGAGACCGTGGAAAATGGAAACAATAAAGAAGATAAAATGAAAAATAACGTTGAGACAGTGGACAATGAAACCACTGAAGAAGATAAAATGACAAATACCACTGAAACTGTGACAAATGGAATCAGTGAACTGGAAAAAATCAACGAGACTGTGCCTAATGGGTTGGAGAATGGAGTGAAAGAACCGGAAATTGAGCAGTGTACTGATGAGAAGGCAGAGGCCACTAAAATGGAAGAGAAACCTAAGATCAAGGAAGATGAGGAAAGCAATGAAGAAACTGTGAAGGAGGAAAAGGAGGAAGATGTGCTTCCAAATGACAAGAAGATTGAAGAAAATGTGGATATCAAAGATGAAGAAAATGTGGATGTCAAAGATGTAGAGGATGCCAAGGATAAAGAAATTGAGGCCAAGGACGAAGAAATTGAGGATGCCAAGGATGAAGAGGTCGAGGTCTTCGAGGATGCCAAGGATGAAGAAATTGAGGATGCCAAGGATGAAGAAGTTGAGGATGCCAAGGACGAAGAAGTTGAGGATGCTAAAGACGAAGAAATTGAGGATGCCAAGGATGAAGAAATTGAGGATGCCAAGGACGAAGAAATTGAGGATGCCAAGGACGAAGAAATTGAGGATGCCAAGGAAGAAGAAATTGAGGATGCCAAGGACAAAGAAATTGAGGATGCCAAAGATGAGGAAAATGAAGAAGAAAAAGAGGATGCTAAAGATAAGGTAGAGAAGGTTGACAGCCATATGGAAGAGGATGATAAGGAATTGAAGGATGAAGATCCCAAGGAAGGGAAAACAAAGAAAGCAAGAAAGAGAAGGGGTGCGGTTAAATCAAAAGGAAAGAATGAAGAGGATGAAAAGGATGAGGTAGGGATAAAGACTCCCATTATTGATCGCCCTGTTCGTGAGCGAAAATCAGTTGAAAGGTTGGTGGCATCTATAGAAAGATGTGTTGTGAAGGAATTCCATATCGAAAAGGTATTATATTTCATTTGGCCTTCCTGGAGAGTTTGGTATATTTTTAGAGCATTGATTCTATTTCTTTTTGTTGAAAATTTCAGGGCCGTGGTACTCCACTTAAAGATATACCCAATGGTATGATATGCATCCTATAATCATCTTACAGTTTGTTGTCGTGCATCCCATTTTTCTTTTATATTATTCCCTCCTACACGTTTATGGTGGATTATTGTATACTATGGTTTTGACATGTCCTTTATTTATTGCTCAGATTCCTTCGAAGAATAATTTGTTAATGTGTTGTCTTTAATATATTCATTATCCTTTAATCTACCATTTTAAAGTATGTGCATAATTTCATTTTTCATGCCTTGGATTGTTTTCTGACATAGCAGCTGAAGTAATGAAGTAGTATAAAATTTCCTGTTTCAAGCCATTTTTTATTTTGTTTCCCACATATACTTCTGAGTTATGCATAGCCTAATAATATATATATGTATATATGATAAATGAACTCTGAAAAAGGAGGGCTTTTGTTTTAGTGGCATTCAAACTCTCAAGGAAGAAGGCTGATGACATCTTCAGGCTGCTTCATTCAATTCTATTTGGAAGGAGAGGAAAGGTACTTATGACTAAGTGGTTTTCTCGTAGGTTAATCATCTTGTCCTTTGATGAATGATTCCTGAATTTACACATCCATATTGCGAAGTGATAACAAGCAGTACCGTTTGGTGTTTCTGATGTTCTGGTTTATGACTTGGAATGTAGTCATGGAGTATTTGCATCTAACATCTGTTTCTTTCTTTTGTCTACCCTTTCCTCCAGGCATTTCAGATTAAGAGCAACATATCCAGGTTTTCAGGTTTTGTGTGGCATGGAGATGAGGTAAAATATCGTATGTGGTGCCTACCAATATGAAATGTTTCTATCTGGAAGTAATTAGTAATTATCGGGAGGATTACAAATTTTTAGCAGAAGTTTCAATTTCTCAGACTTTGTTGTTTCACATGTAGCCTGTCTATCATATGGTAGTGAAGGCACTATCTTGCTGAGATTTTATCATGCAAGTTCCATTCTTTGCAGGAAAAACAAAAGAATAAAGTCAAAGAAAAATTTGACAAGTGTCATAAAGAGAAATTATTGGAATTGTGTGATGTGCTTGACATTCCTGATGTGAAGGCTACCACAAGAAAGGTCTGTTTGATACTCTATTTTGCTGTACTGGCATTCCTAGGTTTGCATTTGGGACAGATCATTCTTAAAAGTGCATTGATAATGAAAGTGAAAGGTTCACATCTAAATTATATTTTATTTCGCTGAAAAAGTTGATCTTATATTTCCTCTAATTGTAATAATCAATCATTGGGACTTTTATCAAATTTTGCATCCTTATAACGAGGACTGATTTCAAATTGGATCCATTGTTTGCCAATATCATATCCTTATTGTTCATGCACTGAACTTTCTTTCCATAAGTAGCCTTTTAATAATTTATCTGCTATGCTAGGATTTTTTTTCCTGTAAGATTCATGCTGTAAAAGACGATTGTCTTTAAGAAAAAGTGATATCCACCAACGTTGATATTGAATATAATGTTATAGGAGTAATTAGTTGTTTAAGATGGCATTCATAGTTCTTTTTCCCTTTTTGTGTGTGTGTGTGGTGGGGGATGTTAAAAATTAATCTTAAGAGACCTGTGCTTTGTGATCGCATGTCTATACTTAGGCATGGTGGCCTAGGACTTTTGTAATTATCCATTAGGTCTTATTGTCATGTAATGGAAGCCTGTTTTGTATTGTTAGATTATTCATGTTCTGCTGTTTTTTGTTTGCCCTTGTATTCTTTCACTATCAGTGGAAGCTCAATCTCTTATTAAAATAAATAATAGTAACTTGTCCGTAATTTATTTAATAAGAAAGAACAGTGTTGTTTTGAACATACGCTGTCCATGAAAATAAATAAAAGAAAAAGAAAGTGGATGAGGAAGAGGAATGACTTTTAGCTAACCCAACAATGCTTTCATGGATTCTTTTTCCAATGTGTATATTCTAGAATTAATAAAAGTAAGTATTTTCTTGTGTCGTTAGAAGGAAAGATTATGTTAGCATGGTTCTTAGAAAAGGTATTTTGATAAGAAGGGGATGTTGGGGAGATTTTAACATTGAAACTTGAAGGTTTGGTTTTTCTTTTTCCAAAAGAAAACATATATATTGTCTGGGTGTTTGGGGTGGATATGGTGCGTAACAAGGATTTCACCCTCTATGTTGGAGATTTTGATGAACTCCCTTTCGTGATAAAGGAAGAGTCTTGTGGCATGCTTGCTTTTTTTCTATTTTATGGGAAGTGCCTCTCTCGATGAATTGTAGAAAATTTAGGGGGTTGGAGAATCTTCGAGTGTGTGAGGTTGTGAGTTAATGTGTCCTTATGATAGACCATCTATGAATCGTTCTTTTTTGTAATCATCAGTTTAGGATTATTCTTTTGGATTGGAATCCATTCTTGTAATTTGTTGAAGACTCCTTTTGTGAGCCTGTTTTTCTTTTTGTATACCCTTATTTTTCCTTTTATTTTTTTCTCATTGAAAACTTGGGTTTTTATTAAGAACATATATATAGTTTTTCATTGCTCGAGTTTGATGTTATTTCTTTGTCTTGATTCAGGAAGATGTTATTGGCAAGCTCATAGAATTTTTAATGGCTCCTCATGCTACAAGTACTGTTCTCCTTGCAGAAATAGAAAAGGTTATTTTACCCCTCCTGATTGCAAACAAGCTCAATCTTCCCAATCAAATGCTTCCTTTATATTTACCCAAATTCTGGATCTAGTTTCTTAAATTTTGATGTTGTTAATTACAGTCAAGCAAGGGTAAAAAGCGTAAACGGACTGTAAAAGGAGGAATATCAACTCCTGGAGACGATAGTTCAAAACAGTCGGCAAAGGTGGCATTCTTTTCTCTCTGGGAGTATAATTGGTGAATTACTTCTAGGCCTTGTTTATACCCATCTTAATTCACACCTTGAATGTTTTTTTATCTTTGAATTCTCCTGTTATGTCTTTATGTTCCGTACTAGTTAGGTAGCTCAAGTATAAATGAGATAGTATCTTCTGTTTTCAGCCTTCAGATGCCTGAGGAAGTTGCTGAGTTATGTTGGTCTTCATGATAAGATATTTTCTTATTTCTTTTCTTTTGATTTTTGAAAGAAAATCGTTTGTTTTGAATACTTACCCTACTCGCCCTGTATCTCAAAATTTAGGAAAATTATTACCGTTAATGTAGTATCTGGTCACGAATGCTTGAAATTCTTCCCTTGGTTTGATTCTCTTGGATTTGGGGGAGCCCGTTGTTAATTCCCTTTTGTAATTTCAGTTTTTCATCAAAGAAAAAAAATGAATGTTTGAAATCCTGTATTTTAGGAAGTGTTTGATGCATTTATTGGAAAGCCTTTGAATTGAGCCCTCTTTATCACTTCCTGCCAGTGACGCCCTTGTCTTCATGATTCGGAGAAAAGGATTTTGTGTGTTTGACATGCCTTTTAAAGTGCTTAAGGAAGACAAAAGCGTATTTTTAATGCTTGGTAGCGTTCGACTACTTTTTTCTTTTTGGATAGGGAACAGTTTCGTTGGATAAAAATGGGTAGTGCTGGTAAAATTTTAAAGAATTCATTTAACCAATTAAAAAATTTCAATAATTAAAATAGTTTGAGAAATTCTAGTAACAGAAACAAAAACTTTATCTATAGGGCTTTTAGATGAAGTTTTGAATTAATTTTCTTCTCCCTTATTGGGCCACCTGTTCAAGGACAAAAGGATTCTTTGGACCAACACTGCTCGGGCTTTCTTGTGACCACTTTGAAGGATCAGAATGCTCTAATCTTCAAGAAATAAGATCCAAAATTTTGACATCTTTGAATCCACTACCCTTTAGCTCTTATGTGGAGTTGTTCTCCACCTTGTTGTAACTATAGTTATGTACTTTCATAGTATAAGTTCCAAAGTTAAAGATTTATAGATCATTCAAGCTTGTTTTAGATTCTGGTTCGTAGGTACGGAATTGAAGCAATAACAGGATAGTCAATCAAGTGGTATCAATTCTCTAGCCCGACCTTGTCTGTTGTCATGGTCCTTGTGGCTGGCCTTGATTTGAAGCTATATCCACATATTGGTTTCTCCAACTTTCTACAACTTTTCGAGAGATTCTCTCAACCGAGAAATCGAATCATCAATTTTTAAGATGGCAAGTNTTGGCATCTTTTTAACCTTGCCTATTTGATGTGCTCAAAGAATTTGTAAATAAAGGTATTTTTTTCTTTAGAATACTGTTTCTGTGGCTATACCTTGGGGAGAAACAACAGTTATTTACTTACTGTGGCATCCCTTTAACTTTACCTATCCACGTATTAGTTCCCTCATTTCTCTCTGTGGAGAATTCAATGTTATTACTTTTCTTTGGATGTCACTTCCTCTTGTTATTTATCCTTTTTAAGACTTAATGATAAGGATTCCTTGCCTCCAGTGCGTTGTACACGTGCTAGAAGGAAGTCTGGCAAGTGGATCAAGTCGAGCTAGTTTTATGGGGGTGTGGGTCTCACTTTGTATTAGCATAAAAAGTAGATGGGCATACAATGGGTCTCTCTGAATGAATAAAGAGTGCACCTCTGAATAAAACAGGGGAGAGTACTTGGTCTCTTGTAGTGCTTGGAGTGCTGTATTTTAATCTATTGTTTATATTAACTACTTATTTCATTTTATTTCTGGGTTGACTCTAGAGTCGTAGAAAAAGAGGAAATACTGCAAGATCTGAGATGACAAGAGATACTAGTGATGAAGATGGTGAGTCAGAAGAAGAGAAGGAAGCAGAAGAAGAAAATGACAAGGAAAATGAGAATGGAACCACAGAAAAATCTGATGATGAAATGTCCGAGCAGCCAGAAAGTGAAGATATCAATGACCCAACTGATGAGTCTGAGGAGGAAAAACCCAGAGCAAGTTCAAAACGTTCATCTAGAAAGAGGGGATCTGTAGGAAAAGCAAGAAGCAAGAAAGTTACAAGTTCCAATAAATCCGATTCAGCAAAATCAACATCGAAGAGGTCATCAGCAAGTCGTGCTAAGATTGATGACAGTGATAGTCCTAAGGTATTCTCTAGGAAGAAGAATAGCGAAAAAGTAAGCAAGGCCTCAACTCCACCAAAATCTGCTGCCAAGGAGAAGCCTGGTACCTGTCATATCTCTTTCAACTTGTTTGTTATGTTTTCTTCGCCTTCTTGTTCTATTCTTTTCTTGCTTGGGACATAAAAATAAAAATTTGTGTGCTTTTCATTATTAATCTTCTGAGCTCCAATTTTCTGGTATTACTTTAATCTGTTTGGGACCCTAAGGGATGACGGACACTGTGTTCTTTTTCTGCGTTATTTACGCCATTGGTGTATGTTTTAGGGAAGAAGATTACGAAAGGGAAGGACAAGACCAAGGAAGAGAAAACAAGGCCAAGCGATGATGTGCTTAGGGATGCAATATGTGAAATTCTTAAAGTAGTCGACTTCACTACGGTAAGTGTCACTGTATTCATACTGGCTACTCAAAGTTTAGTTGGCAAGCTTACTTCAATACTTGATATTTAGCTGTAATTGAGCATGATTTTCTGGCTGTTGGAGATTCATGCGTTTCTGGTGGTTTTTAGAACTGCTATCATGCGTTTACAAGCATTAACTGTCACTTGATCTTCCCTTCTCATGTGGACATTTGTGATTAAATAAATGCATATCTGAGCTATTAGATTCTTTCTTGTGTTAGTGCAAAATATTATGACTTTTAACAGCTGCCTGTTCACTGGTTCAAGTGTAAAGCCAATACAAAAGATGGTTGGTAAGGCAATGGTTGCGGTTCTAGAAATTGTTGGCTTTGAAGGATGAATGATTTAGAGTTACGGAGAAAGACAAATAAACTGAGGATGGATTAACATAAGAGTTTCTGTTTTCGCTAGCCTTAGTTTCTAAAAATCTCCGTGTCTAAGTAATATTATGTTTTGTGCTTGGTTCTGATAGTAAATGTTAGCCACATTTTGTAATCACAAAATAACGCGTAGCTCGTTTTTCGATTATGCAGGCCACCTTCACCGACATTCTAAAGCAACTTGGTACGTTTCTTATCTACTTGGATATTTTGTGGTTTGTTGAACTTATCCTTTATACAAGATGTCTTGAGTATTGTATTTCTTGATCACCACCATTTGAGATATCTTCAAATAAACAGTAGAAATTGTTGTATGTTCCATTGATTCATGAAAAATATAGAGATCATTGACTCAATTGATCTGATTTTATAACTTGTTTGAGATTTACCCTTTTCAACGTAGTACAATTTGGAAGAAAGAAAATAGAACACCTATCAAGTGTTCTTATAGAGTATTCCATATTGAGAACCATTTTCAAACATGTCCCTTCTGATCTGCTTGTGATTTCTCAGCTGGGCAATTCAAGATGGATCTCACCGCACAAAAGTCGTCGATAAAACTTATGATCCAAGAAGAGCTCACGAAACTGGCAGATGAAGCAGAAGACGAAGAGGACGGCGAAGGCGATGCCGATGCCGAGAAGGATGTAAAACAGGCCGCTCAAGAGGTGGAAACTTGAGTTGCCAAGAGTTGGCCCTGGGATTTGTCTAACAACATATATGAACGCCCTCAAAATCAGTGAGTCTGTTGTAAAGGTGGATGTAATTTATTTGTCGGTGTCAACCATGTTTACTTCTAATGAGAGATTAGTGTTTTAATTAGAAACGAGATTAGCTGTTGCTCATAGTTGCAAATATTGCAGTTTCTTTATACTGACTTATAAATACTGTTGGGAATAATTCCATGGATATATTCACTGACCATTAAGGAGTTTTTCGACTTCAGGTTTAGACCTTGAGACATTATATATATACTATAT

mRNA sequence

TACTCGGAGAAGGACTTAGTTTATTTTATTACAAATTTAAATATTAAGAAAAAAGTGAAAGGAATACGAAACGAAGCCGAAAAAAAAAAAAAGGAGTTATCTACGGAGCTGAGAGGCGACAGTGAGACAGTAGAACTCCGACGACTTTCTGTTCGACGCTCGAATCGATTACCGAAGCTCTGAAATTCATTTTATTCATGTGACTAGACAAAGATGTGCACGTGAGGAGATTGTGGTTTTCTGTAGATTAATTTGAGGTATATCAATCATGGGTCAAGAAGATGCAACAAATAATACCTTTGAGAATACAAAAAATGAAGCCAGTGGTGGAGATTTAAAAACAAATAGCATTGAGACCGTGGAAAATGGAAACAATAAAGAAGATAAAATGAAAAATAACGTTGAGACAGTGGACAATGAAACCACTGAAGAAGATAAAATGACAAATACCACTGAAACTGTGACAAATGGAATCAGTGAACTGGAAAAAATCAACGAGACTGTGCCTAATGGGTTGGAGAATGGAGTGAAAGAACCGGAAATTGAGCAGTGTACTGATGAGAAGGCAGAGGCCACTAAAATGGAAGAGAAACCTAAGATCAAGGAAGATGAGGAAAGCAATGAAGAAACTGTGAAGGAGGAAAAGGAGGAAGATGTGCTTCCAAATGACAAGAAGATTGAAGAAAATGTGGATATCAAAGATGAAGAAAATGTGGATGTCAAAGATGTAGAGGATGCCAAGGATAAAGAAATTGAGGCCAAGGACGAAGAAATTGAGGATGCCAAGGATGAAGAGGTCGAGGTCTTCGAGGATGCCAAGGATGAAGAAATTGAGGATGCCAAGGATGAAGAAGTTGAGGATGCCAAGGACGAAGAAGTTGAGGATGCTAAAGACGAAGAAATTGAGGATGCCAAGGATGAAGAAATTGAGGATGCCAAGGACGAAGAAATTGAGGATGCCAAGGACGAAGAAATTGAGGATGCCAAGGAAGAAGAAATTGAGGATGCCAAGGACAAAGAAATTGAGGATGCCAAAGATGAGGAAAATGAAGAAGAAAAAGAGGATGCTAAAGATAAGGTAGAGAAGGTTGACAGCCATATGGAAGAGGATGATAAGGAATTGAAGGATGAAGATCCCAAGGAAGGGAAAACAAAGAAAGCAAGAAAGAGAAGGGGTGCGGTTAAATCAAAAGGAAAGAATGAAGAGGATGAAAAGGATGAGGTAGGGATAAAGACTCCCATTATTGATCGCCCTGTTCGTGAGCGAAAATCAGTTGAAAGGTTGGTGGCATCTATAGAAAGATGTGTTGTGAAGGAATTCCATATCGAAAAGGGCCGTGGTACTCCACTTAAAGATATACCCAATGTGGCATTCAAACTCTCAAGGAAGAAGGCTGATGACATCTTCAGGCTGCTTCATTCAATTCTATTTGGAAGGAGAGGAAAGGCATTTCAGATTAAGAGCAACATATCCAGGTTTTCAGGTTTTGTGTGGCATGGAGATGAGGAAAAACAAAAGAATAAAGTCAAAGAAAAATTTGACAAGTGTCATAAAGAGAAATTATTGGAATTGTGTGATGTGCTTGACATTCCTGATGTGAAGGCTACCACAAGAAAGGAAGATGTTATTGGCAAGCTCATAGAATTTTTAATGGCTCCTCATGCTACAAGTACTGTTCTCCTTGCAGAAATAGAAAAGTCAAGCAAGGGTAAAAAGCGTAAACGGACTGTAAAAGGAGGAATATCAACTCCTGGAGACGATAGTTCAAAACAGTCGGCAAAGAGTCGTAGAAAAAGAGGAAATACTGCAAGATCTGAGATGACAAGAGATACTAGTGATGAAGATGGTGAGTCAGAAGAAGAGAAGGAAGCAGAAGAAGAAAATGACAAGGAAAATGAGAATGGAACCACAGAAAAATCTGATGATGAAATGTCCGAGCAGCCAGAAAGTGAAGATATCAATGACCCAACTGATGAGTCTGAGGAGGAAAAACCCAGAGCAAGTTCAAAACGTTCATCTAGAAAGAGGGGATCTGTAGGAAAAGCAAGAAGCAAGAAAGTTACAAGTTCCAATAAATCCGATTCAGCAAAATCAACATCGAAGAGGTCATCAGCAAGTCGTGCTAAGATTGATGACAGTGATAGTCCTAAGGTATTCTCTAGGAAGAAGAATAGCGAAAAAGTAAGCAAGGCCTCAACTCCACCAAAATCTGCTGCCAAGGAGAAGCCTGGGAAGAAGATTACGAAAGGGAAGGACAAGACCAAGGAAGAGAAAACAAGGCCAAGCGATGATGTGCTTAGGGATGCAATATGTGAAATTCTTAAAGTAGTCGACTTCACTACGGCCACCTTCACCGACATTCTAAAGCAACTTGCTGGGCAATTCAAGATGGATCTCACCGCACAAAAGTCGTCGATAAAACTTATGATCCAAGAAGAGCTCACGAAACTGGCAGATGAAGCAGAAGACGAAGAGGACGGCGAAGGCGATGCCGATGCCGAGAAGGATGTAAAACAGGCCGCTCAAGAGGTGGAAACTTGAGTTGCCAAGAGTTGGCCCTGGGATTTGTCTAACAACATATATGAACGCCCTCAAAATCAGTGAGTCTGTTGTAAAGGTGGATGTAATTTATTTGTCGGTGTCAACCATGTTTACTTCTAATGAGAGATTAGTGTTTTAATTAGAAACGAGATTAGCTGTTGCTCATAGTTGCAAATATTGCAGTTTCTTTATACTGACTTATAAATACTGTTGGGAATAATTCCATGGATATATTCACTGACCATTAAGGAGTTTTTCGACTTCAGGTTTAGACCTTGAGACATTATATATATACTATAT

Coding sequence (CDS)

ATGGGTCAAGAAGATGCAACAAATAATACCTTTGAGAATACAAAAAATGAAGCCAGTGGTGGAGATTTAAAAACAAATAGCATTGAGACCGTGGAAAATGGAAACAATAAAGAAGATAAAATGAAAAATAACGTTGAGACAGTGGACAATGAAACCACTGAAGAAGATAAAATGACAAATACCACTGAAACTGTGACAAATGGAATCAGTGAACTGGAAAAAATCAACGAGACTGTGCCTAATGGGTTGGAGAATGGAGTGAAAGAACCGGAAATTGAGCAGTGTACTGATGAGAAGGCAGAGGCCACTAAAATGGAAGAGAAACCTAAGATCAAGGAAGATGAGGAAAGCAATGAAGAAACTGTGAAGGAGGAAAAGGAGGAAGATGTGCTTCCAAATGACAAGAAGATTGAAGAAAATGTGGATATCAAAGATGAAGAAAATGTGGATGTCAAAGATGTAGAGGATGCCAAGGATAAAGAAATTGAGGCCAAGGACGAAGAAATTGAGGATGCCAAGGATGAAGAGGTCGAGGTCTTCGAGGATGCCAAGGATGAAGAAATTGAGGATGCCAAGGATGAAGAAGTTGAGGATGCCAAGGACGAAGAAGTTGAGGATGCTAAAGACGAAGAAATTGAGGATGCCAAGGATGAAGAAATTGAGGATGCCAAGGACGAAGAAATTGAGGATGCCAAGGACGAAGAAATTGAGGATGCCAAGGAAGAAGAAATTGAGGATGCCAAGGACAAAGAAATTGAGGATGCCAAAGATGAGGAAAATGAAGAAGAAAAAGAGGATGCTAAAGATAAGGTAGAGAAGGTTGACAGCCATATGGAAGAGGATGATAAGGAATTGAAGGATGAAGATCCCAAGGAAGGGAAAACAAAGAAAGCAAGAAAGAGAAGGGGTGCGGTTAAATCAAAAGGAAAGAATGAAGAGGATGAAAAGGATGAGGTAGGGATAAAGACTCCCATTATTGATCGCCCTGTTCGTGAGCGAAAATCAGTTGAAAGGTTGGTGGCATCTATAGAAAGATGTGTTGTGAAGGAATTCCATATCGAAAAGGGCCGTGGTACTCCACTTAAAGATATACCCAATGTGGCATTCAAACTCTCAAGGAAGAAGGCTGATGACATCTTCAGGCTGCTTCATTCAATTCTATTTGGAAGGAGAGGAAAGGCATTTCAGATTAAGAGCAACATATCCAGGTTTTCAGGTTTTGTGTGGCATGGAGATGAGGAAAAACAAAAGAATAAAGTCAAAGAAAAATTTGACAAGTGTCATAAAGAGAAATTATTGGAATTGTGTGATGTGCTTGACATTCCTGATGTGAAGGCTACCACAAGAAAGGAAGATGTTATTGGCAAGCTCATAGAATTTTTAATGGCTCCTCATGCTACAAGTACTGTTCTCCTTGCAGAAATAGAAAAGTCAAGCAAGGGTAAAAAGCGTAAACGGACTGTAAAAGGAGGAATATCAACTCCTGGAGACGATAGTTCAAAACAGTCGGCAAAGAGTCGTAGAAAAAGAGGAAATACTGCAAGATCTGAGATGACAAGAGATACTAGTGATGAAGATGGTGAGTCAGAAGAAGAGAAGGAAGCAGAAGAAGAAAATGACAAGGAAAATGAGAATGGAACCACAGAAAAATCTGATGATGAAATGTCCGAGCAGCCAGAAAGTGAAGATATCAATGACCCAACTGATGAGTCTGAGGAGGAAAAACCCAGAGCAAGTTCAAAACGTTCATCTAGAAAGAGGGGATCTGTAGGAAAAGCAAGAAGCAAGAAAGTTACAAGTTCCAATAAATCCGATTCAGCAAAATCAACATCGAAGAGGTCATCAGCAAGTCGTGCTAAGATTGATGACAGTGATAGTCCTAAGGTATTCTCTAGGAAGAAGAATAGCGAAAAAGTAAGCAAGGCCTCAACTCCACCAAAATCTGCTGCCAAGGAGAAGCCTGGGAAGAAGATTACGAAAGGGAAGGACAAGACCAAGGAAGAGAAAACAAGGCCAAGCGATGATGTGCTTAGGGATGCAATATGTGAAATTCTTAAAGTAGTCGACTTCACTACGGCCACCTTCACCGACATTCTAAAGCAACTTGCTGGGCAATTCAAGATGGATCTCACCGCACAAAAGTCGTCGATAAAACTTATGATCCAAGAAGAGCTCACGAAACTGGCAGATGAAGCAGAAGACGAAGAGGACGGCGAAGGCGATGCCGATGCCGAGAAGGATGTAAAACAGGCCGCTCAAGAGGTGGAAACTTGA

Protein sequence

MGQEDATNNTFENTKNEASGGDLKTNSIETVENGNNKEDKMKNNVETVDNETTEEDKMTNTTETVTNGISELEKINETVPNGLENGVKEPEIEQCTDEKAEATKMEEKPKIKEDEESNEETVKEEKEEDVLPNDKKIEENVDIKDEENVDVKDVEDAKDKEIEAKDEEIEDAKDEEVEVFEDAKDEEIEDAKDEEVEDAKDEEVEDAKDEEIEDAKDEEIEDAKDEEIEDAKDEEIEDAKEEEIEDAKDKEIEDAKDEENEEEKEDAKDKVEKVDSHMEEDDKELKDEDPKEGKTKKARKRRGAVKSKGKNEEDEKDEVGIKTPIIDRPVRERKSVERLVASIERCVVKEFHIEKGRGTPLKDIPNVAFKLSRKKADDIFRLLHSILFGRRGKAFQIKSNISRFSGFVWHGDEEKQKNKVKEKFDKCHKEKLLELCDVLDIPDVKATTRKEDVIGKLIEFLMAPHATSTVLLAEIEKSSKGKKRKRTVKGGISTPGDDSSKQSAKSRRKRGNTARSEMTRDTSDEDGESEEEKEAEEENDKENENGTTEKSDDEMSEQPESEDINDPTDESEEEKPRASSKRSSRKRGSVGKARSKKVTSSNKSDSAKSTSKRSSASRAKIDDSDSPKVFSRKKNSEKVSKASTPPKSAAKEKPGKKITKGKDKTKEEKTRPSDDVLRDAICEILKVVDFTTATFTDILKQLAGQFKMDLTAQKSSIKLMIQEELTKLADEAEDEEDGEGDADAEKDVKQAAQEVET
BLAST of Cp4.1LG08g08310 vs. TrEMBL
Match: W9R1E2_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_018225 PE=4 SV=1)

HSP 1 Score: 460.3 bits (1183), Expect = 4.5e-126
Identity = 355/610 (58.20%), Postives = 445/610 (72.95%), Query Frame = 1

Query: 170 EDAKDEEVEVFEDAKDEEIEDAKDEEVEDAKDEEVEDAKDEEIEDAKDEEIEDAKDEEIE 229
           ED   E  E   +  D + + A D  V   K  E +  K+ E ++  D++ E  K EE  
Sbjct: 4   EDTVVEATETAANGTDAKEKTAND--VTKKKAAESDGPKEMEEDNKDDDKAEFDKMEEDT 63

Query: 230 DAKDEEIEDAK-EEEIEDAKDKEIEDAKDEENEEEKE---DAKDKV-EKVDSHMEEDD-K 289
           +AK  EI+DAK +EEI + K + +E+      EEE E   + K++V EKVD   EE+  K
Sbjct: 64  EAK--EIKDAKGKEEIGEGKVEVMEEDNGPNKEEEVEGKVETKEEVGEKVDGFKEEEKVK 123

Query: 290 ELKDEDPKEGKTKKAR-KRRGAVKSKGKNEEDE-KDEVGIKTPIIDRPVRERKSVERLVA 349
           + K ED  E K  K R K +   K+K K +E E K E+  +TP IDRP RERKSVERLVA
Sbjct: 124 DEKAEDTGEEKESKKRGKGKSGEKTKEKRKESETKKELEPRTPAIDRPQRERKSVERLVA 183

Query: 350 SIERCVVKEFHIEKGRGTPLKDIPNVAFKLSRKKADDIFRLLHSILFGRRGKAFQIKSNI 409
           ++E+   KEFHIEKGRGTPLKDIPNVAFKLSR+K DD F+LLH+ILFGRRGKAFQIKSNI
Sbjct: 184 TVEKESHKEFHIEKGRGTPLKDIPNVAFKLSRRKTDDTFKLLHTILFGRRGKAFQIKSNI 243

Query: 410 SRFSGFVWHGDEEKQKNKVKEKFDKCHKEKLLELCDVLDIPDVKATTRKEDVIGKLIEFL 469
           SRFSGFVWH +EEKQK KVKEKFDKC+KEKLLE CDVLDIP  KATTRKED++ KLI+FL
Sbjct: 244 SRFSGFVWHENEEKQKIKVKEKFDKCNKEKLLEFCDVLDIPIAKATTRKEDIVSKLIDFL 303

Query: 470 MAPHATSTVLLAEIEKSSKGKKRKRTVKGGISTPGDDSSKQSAKSRRKRGNTARSEMTRD 529
           +AP+AT+ VLLAE EKS+KGKKRKR  KG  S  G  ++K+S K+R K  + +++E  ++
Sbjct: 304 VAPYATTAVLLAEKEKSNKGKKRKRAAKGSSSASG-GTTKRSVKNRIKNEDDSKAEEKKN 363

Query: 530 TSD-------EDGESEEEKEAEEENDKENENGTTEKSDDEMSEQPESEDINDPTDESEE- 589
           T+D       E+ E EE+KE +EEN++ENENG  +KS+DE+ E+ ESE+ +D  DESEE 
Sbjct: 364 TTDTEDESSEEEEEEEEDKEEDEENEEENENGVPDKSEDELPEKSESEEKSDSEDESEEE 423

Query: 590 -EKPRASSKRSSRKRGSVGKARSKKVTSSNKSD-SAKSTSKRSSASRAKIDDSD--SPKV 649
            EK R SSK++S+K+ S GKA++K+ T S KS    K T K+S++ R+K DD    SPKV
Sbjct: 424 VEKRRRSSKKASQKKESAGKAKTKRTTVSPKSSPPPKRTPKKSASKRSKGDDGSDTSPKV 483

Query: 650 FSRKKNSEKVSKASTPPKSAAKEKPGKKITKGKDKTKEEKTRPSDDVLRDAICEILKVVD 709
           FSRKK SEKV+KASTP K+A+KEK GK+  KGK+K+K+EK++P+DD LRDAICEILK VD
Sbjct: 484 FSRKKTSEKVAKASTPAKAASKEKTGKRTAKGKEKSKKEKSKPTDDELRDAICEILKEVD 543

Query: 710 FTTATFTDILKQLAGQFKMDLTAQKSSIKLMIQEELTKLADEAEDEEDGEGD--ADAEKD 757
           F TATFTDILKQLA QF  DLT +KSSIK+MIQEELTKLADEA+++EDG+GD   DAEKD
Sbjct: 544 FNTATFTDILKQLAKQFDTDLTPRKSSIKIMIQEELTKLADEADEDEDGDGDGEGDAEKD 603

BLAST of Cp4.1LG08g08310 vs. TrEMBL
Match: M5WUP2_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa003409mg PE=4 SV=1)

HSP 1 Score: 454.5 bits (1168), Expect = 2.5e-124
Identity = 334/588 (56.80%), Postives = 426/588 (72.45%), Query Frame = 1

Query: 184 KDEEIEDAKDEEVEDAKDEEVEDAKDEEIEDAKDEEIEDAKDEEIEDAKDEEIEDAKEEE 243
           +D E + ++DE+++    EE E+ K EE+E+      +   +E+ E+ ++ E+E+ K ++
Sbjct: 3   EDLEADGSRDEKLK----EEKEELKVEEMEEETGPNEKKETEEKTEEKREFEVEEDKVKK 62

Query: 244 IEDAKDKEIEDAKDEENEEEKEDAKDKVEKVDSHMEEDDKELKDEDPKEGKTKKAR---- 303
            E+ K +E+E+ K+EE EEEKE+ K++       MEE+ +E K+ED +E + K+A+    
Sbjct: 63  AEE-KAEELEEEKEEEKEEEKEEEKEE------EMEEEKEEEKEEDKEEARAKEAKVEKG 122

Query: 304 -KRRGAVKSKGKNEED-----EKDEVGIKTPIIDRPVRERKSVERLVASIERCVVKEFHI 363
            ++RG  KS  K +E      EK E   +TP  DRPVRERKSVERLVASIE+  V+EF I
Sbjct: 123 PRKRGKGKSVEKTKEKKKEVAEKKEPEQRTPTTDRPVRERKSVERLVASIEKDAVREFQI 182

Query: 364 EKGRGTPLKDIPNVAFKLSRKKADDIFRLLHSILFGRRGKAFQIKSNISRFSGFVWHGDE 423
           EKGRGTPLKDIPNVAFKLSR+K DD  +LLH+ILFGRRGKA ++KSNISRFSGFVW G+E
Sbjct: 183 EKGRGTPLKDIPNVAFKLSRRKLDDSLKLLHTILFGRRGKAVEVKSNISRFSGFVWRGNE 242

Query: 424 EKQKNKVKEKFDKCHKEKLLELCDVLDIPDVKATTRKEDVIGKLIEFLMAPHATSTVLLA 483
           +KQK KVKEKFDKC+KEKLLE CD+L++P  KATTRKED++ KLI+FL++PHAT+T LLA
Sbjct: 243 DKQKTKVKEKFDKCNKEKLLEFCDLLNLPISKATTRKEDIVAKLIDFLVSPHATTTSLLA 302

Query: 484 EIEKSSKGKKRKRTVKGGISTPGDDSSKQSAKSRRKRGNTARSEMTRDTSDEDGESEEEK 543
           E ++SSKGKKRKR  KG  ST G  +SK+SAK+RRK  + ++ +       ED   E+EK
Sbjct: 303 E-KESSKGKKRKRATKGSSSTSGGTNSKRSAKNRRKNDDDSKLDDKSAADTEDESEEDEK 362

Query: 544 EAEEENDKENENGTTEKSDDEMSEQPESEDINDPTDESEE--EKPRASSKRSSRKRGSVG 603
           E EE  ++ENENG  E S+DE  E  ESE+  D +D+SEE  EK +   K SSRK+GS  
Sbjct: 363 EDEENVEEENENGVRENSEDETPEHSESEEKLDSSDDSEEEVEKQKPRRKSSSRKKGSSA 422

Query: 604 KARSKKVTSSNKSDSAKSTS-KRSSASRAKIDDSD--SPKVFSRKKNSEKVSKASTPPKS 663
           KA++KK T S KS    + S K+SS+ R  +DD    SPK  SRKK +EKVSK  TP KS
Sbjct: 423 KAQTKKATGSAKSTPPPTKSRKKSSSKRTPVDDDSDTSPKASSRKKKNEKVSKVPTPTKS 482

Query: 664 AAKEKPGKKITKGKDKTKEEKTRPSDDVLRDAICEILKVVDFTTATFTDILKQLAGQFKM 723
           A+KEKPGKK+ KGKDKTKEEK RPSDD LRDAIC+ILK VDF TATFTDILKQLA QF  
Sbjct: 483 ASKEKPGKKVAKGKDKTKEEKLRPSDDKLRDAICQILKEVDFNTATFTDILKQLARQFDT 542

Query: 724 DLTAQKSSIKLMIQEELTKLADEAEDEEDGEGDADAEKDVKQAAQEVE 757
           DL+ +KSSIKLMIQEELTKLADEA DEED EG  + + + + A QEVE
Sbjct: 543 DLSPRKSSIKLMIQEELTKLADEA-DEEDEEGGPEKD-ETESAGQEVE 576

BLAST of Cp4.1LG08g08310 vs. TrEMBL
Match: A0A061FC21_THECC (DEK domain-containing chromatin associated protein isoform 2 OS=Theobroma cacao GN=TCM_030649 PE=4 SV=1)

HSP 1 Score: 451.8 bits (1161), Expect = 1.6e-123
Identity = 342/598 (57.19%), Postives = 432/598 (72.24%), Query Frame = 1

Query: 170 EDAKDEEVEVFEDAKD--EEIEDAKDEEVEDAKDEEVEDAKDEEIEDAKDEEIEDAKDEE 229
           E+ K E +E   +     E+  +A  E+ E+  +   E  +D+++E  K +E +  K++E
Sbjct: 4   EETKAEALEPVANGTSLPEKSGEAVAEKTEEENNGVKEMEEDKKVETEKMDEDQQVKEDE 63

Query: 230 IEDAKDEEIEDAKEEEIEDAKDKEIEDAKDEENEEEKEDAKDKVEKVDSHMEEDDKELKD 289
             ++K+E  ++ KEE   +A ++EI D K+ + ++EKE+ KD+VE+ D   EE+++E K 
Sbjct: 64  --ESKEELEKEEKEEPETEAMEEEI-DPKENDKKDEKEENKDEVEEKDGLKEEEEEEQKA 123

Query: 290 EDPKE--GKTKKARKRRGAVKSKGKNEEDEKDEVGIKTPIIDRPVRERKSVERLVASIER 349
           ++ KE  G  K+ + +    K KGK ++ EK E   +TP+ DRPVRERKSVERLVASI++
Sbjct: 124 KESKEKKGSKKRGKNQNAGEKVKGKTKKMEKKEPEQRTPLTDRPVRERKSVERLVASIDK 183

Query: 350 CVVKEFHIEKGRGTPLKDIPNVAFKLSRKKADDIFRLLHSILFGRRGKAFQIKSNISRFS 409
              KEF IEKGRGTPLKDIPNVAFKLSR+K DD FRLLH+ILFGRRGKA QIKSNISRFS
Sbjct: 184 DASKEFQIEKGRGTPLKDIPNVAFKLSRRKTDDTFRLLHTILFGRRGKAVQIKSNISRFS 243

Query: 410 GFVWHGDEEKQKNKVKEKFDKCHKEKLLELCDVLDIPDVKATTRKEDVIGKLIEFLMAPH 469
           GFVWH +EEKQK KVK+KFDKC+KEKLLE CDVLDIP +KATTRKED++ KLI+FL+AP 
Sbjct: 244 GFVWHENEEKQKTKVKDKFDKCNKEKLLEFCDVLDIPIMKATTRKEDIVAKLIDFLVAPQ 303

Query: 470 ATSTVLLAEIEKSSKGKKRKRTVKGGISTPGDDSSKQSAKSRRKRGNTARSEMTRDTSDE 529
           AT+TVLL+E EKSSK KKRKR +K G +      SK+S +SR+K  +T +S        E
Sbjct: 304 ATTTVLLSEKEKSSKSKKRKRVIKSGTT------SKRSTRSRKKSEDTPKSGKKSAPDSE 363

Query: 530 DGESEEEKEAEEENDKENENGTTEKSDDEMSEQPESEDINDPTDESEEE--KPRASSKRS 589
           D   EEEKE EE  ++ENENG TEKS+DEM E  ESE+ N+  DESEEE  K + S+K S
Sbjct: 364 DESEEEEKEEEENEEEENENGITEKSEDEMPEDSESEEKNETEDESEEEVGKKKKSTKVS 423

Query: 590 SRKRGSVGKARSKKVTSSNKSDSAKSTSKRSSASRAKIDDS--DSPKVFSRKKNSEKVSK 649
           S K+ S GKA  KKVT   +S + +  + ++S+  +K+DD    SPKV SRKK +EKV+K
Sbjct: 424 SSKKESAGKATPKKVTVPKRSSTPQKRTPKTSSKSSKVDDDIDKSPKVSSRKK-TEKVTK 483

Query: 650 --ASTPPKSAAKEKPGKKITKGKDKTKEEKTRPSDDVLRDAICEILKVVDFTTATFTDIL 709
             +STP KSA+KEK  KK+ KGKDK KEEK +PSD  LRDAICEILK VDF TATFTDIL
Sbjct: 484 EKSSTPTKSASKEKTSKKVAKGKDKAKEEKLKPSDHELRDAICEILKEVDFNTATFTDIL 543

Query: 710 KQLAGQFKMDLTAQKSSIKLMIQEELTKLADEAEDEEDGEGDADAEKDVKQAA-QEVE 757
           K LA QF  DLT +KSSIKLMIQEELTKLADEA D+EDGEG  DAEKD  Q+A QEVE
Sbjct: 544 KLLARQFDTDLTPRKSSIKLMIQEELTKLADEA-DDEDGEG--DAEKDENQSAGQEVE 588

BLAST of Cp4.1LG08g08310 vs. TrEMBL
Match: A0A067KSV3_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_01033 PE=4 SV=1)

HSP 1 Score: 444.1 bits (1141), Expect = 3.4e-121
Identity = 381/716 (53.21%), Postives = 483/716 (67.46%), Query Frame = 1

Query: 73  EKINETVPNGLENGVKEPEIE-QCTDEKAEATKMEEKPKIKEDEESNEETVKEEKEEDVL 132
           E+     P   ENG   PEI  +   EK E    + K +I ED+  NE++V E+ +ED  
Sbjct: 4   EETTTEAPKVAENGTSAPEISGKVVTEKTEEESNKVK-EIDEDKNDNEKSVTEKMDEDPK 63

Query: 133 PNDKKIEENVDIKDEENVDVKDVEDAKDKEIEAKDEEIEDAKDE-EVEVFEDAKDEEIED 192
            N +K E N D + EE  + K+V   KD   E K E +ED K E + E  E+ K E   +
Sbjct: 64  VNQEK-ESNGDTEREEVEEEKEVRKEKD---EHKTEAMEDEKAEHKAEAMEEEKTEHKAE 123

Query: 193 AKDEEVEDAKDEEVEDAKDEEIEDAKDEEIEDAKDEEIEDAKDEEIEDAKEEEIEDAKDK 252
           AK+EE  + K E +++ K E   +A +EE+E   DEE+E+   EE ++  E+++  +KDK
Sbjct: 124 AKEEEKAEHKAEAMKEEKAEHEGEAMEEELEPKGDEEVEEK--EETKEEVEDKVNGSKDK 183

Query: 253 EIEDAKDEENE------EEKEDAKDKVEKVDSHMEEDDKELKDEDPKEGKTKKAR-KRRG 312
             E+ K+E  +      +++E+A+ K E  D      DKE K E+ ++G  K+A+ K  G
Sbjct: 184 GEEETKEEVEDKVNGSKDKEEEAETKEEVEDKVKGSKDKEEKVEENEKGSKKRAKGKNAG 243

Query: 313 AVKSKGKNEEDEKDEVGIKTPIIDRPVRERKSVERLVASIERCVVKEFHIEKGRGTPLKD 372
               K K   +EK ++  +TP+ DRP RERKSVERLVASIE+  VKEFHIEKGRGTPLKD
Sbjct: 244 EKVQKKKKLGEEKKQLEPRTPVTDRPQRERKSVERLVASIEKDAVKEFHIEKGRGTPLKD 303

Query: 373 IPNVAFKLSRKKADDIFRLLHSILFGRRGKAFQIKSNISRFSGFVWHGDEEKQKNKVKEK 432
           IPNVAFKLSR+K DD F+LLH+ILFGRRGKA QIKSNISRFSGFVWH + EKQK KVKEK
Sbjct: 304 IPNVAFKLSRRKTDDTFKLLHTILFGRRGKAIQIKSNISRFSGFVWHENVEKQKIKVKEK 363

Query: 433 FDKCHKEKLLELCDVLDIPDVKATTRKEDVIGKLIEFLMAPHATSTVLLAEIEKSSKGKK 492
           FDKC+KEKLLE CDVLDI   KATT+KED++ KLIEFL+APHAT+TVLLAE EK+SK KK
Sbjct: 364 FDKCNKEKLLEFCDVLDISVAKATTKKEDIVAKLIEFLLAPHATTTVLLAEKEKASKSKK 423

Query: 493 RKRTVKGGISTPGDDSSKQSAKSRR------KRGNTARSEMTRDTSDEDGESEEEKE--- 552
           RKR  K   S  G  SS +SAKS++      K G     +   ++ +E GE EEE+E   
Sbjct: 424 RKRMTKS--SASGSSSSTRSAKSQKKSEYFSKSGKKGTPDSEEESDEEKGEEEEEEEEEE 483

Query: 553 --AEEENDKENENGTTEKSDDEMSEQPESEDINDPTDESEEE--KPRASSKRSSRKRGSV 612
              EEEN +ENENG  EKSD EM +  ++++ N+  +ESEE+  K + SSK SSRK+ S 
Sbjct: 484 DVEEEENVEENENGLPEKSDAEMPDHSDNDE-NESEEESEEDVGKRKRSSKTSSRKKESA 543

Query: 613 GKARSKKVTSSNKSDSAKSTSKRSSASRAKID-DSD-SPKVFSRKKNSEKVS--KASTPP 672
            KA+ +K+T S KS   K ++K+SS   ++ D DSD SPKVFSRKK SEK +  K+STP 
Sbjct: 544 EKAKPRKITISTKSGPPKGSTKKSSLKHSEADEDSDASPKVFSRKKKSEKAAKEKSSTPA 603

Query: 673 KSAAKEKPG-KKITKGKDK----TKEEKTRPSDDVLRDAICEILKVVDFTTATFTDILKQ 732
           KS +KEK G KK  KGK+K     KE+K +P+D+ LRDAICEILK VDF TATFTDILKQ
Sbjct: 604 KSPSKEKTGKKKAAKGKEKATGTAKEDKLKPTDNELRDAICEILKEVDFNTATFTDILKQ 663

Query: 733 LAGQFKMDLTAQKSSIKLMIQEELTKLADEAEDEEDGEGDADAEKDVKQAA-QEVE 757
           LA QF  DLT +KSSIK+MIQEELTKLADEA D+EDGEG  DAEKD  Q+A QEVE
Sbjct: 664 LARQFDTDLTERKSSIKIMIQEELTKLADEA-DDEDGEG--DAEKDENQSAGQEVE 706

BLAST of Cp4.1LG08g08310 vs. TrEMBL
Match: A0A061F550_THECC (DEK domain-containing chromatin associated protein isoform 1 OS=Theobroma cacao GN=TCM_030649 PE=4 SV=1)

HSP 1 Score: 437.6 bits (1124), Expect = 3.1e-119
Identity = 342/624 (54.81%), Postives = 432/624 (69.23%), Query Frame = 1

Query: 170 EDAKDEEVEVFEDAKD--EEIEDAKDEEVEDAKDEEVEDAKDEEIEDAKDEEIEDAKDEE 229
           E+ K E +E   +     E+  +A  E+ E+  +   E  +D+++E  K +E +  K++E
Sbjct: 4   EETKAEALEPVANGTSLPEKSGEAVAEKTEEENNGVKEMEEDKKVETEKMDEDQQVKEDE 63

Query: 230 IEDAKDEEIEDAKEEEIEDAKDKEIEDAKDEENEEEKEDAKDKVEKVDSHMEEDDKELKD 289
             ++K+E  ++ KEE   +A ++EI D K+ + ++EKE+ KD+VE+ D   EE+++E K 
Sbjct: 64  --ESKEELEKEEKEEPETEAMEEEI-DPKENDKKDEKEENKDEVEEKDGLKEEEEEEQKA 123

Query: 290 EDPKE--GKTKKARKRRGAVKSKGKNEEDEKDEVGIKTPIIDRPVRERKSVERLVASIER 349
           ++ KE  G  K+ + +    K KGK ++ EK E   +TP+ DRPVRERKSVERLVASI++
Sbjct: 124 KESKEKKGSKKRGKNQNAGEKVKGKTKKMEKKEPEQRTPLTDRPVRERKSVERLVASIDK 183

Query: 350 CVVKEFHIEKGRGTPLKDIPNV--------------------------AFKLSRKKADDI 409
              KEF IEKGRGTPLKDIPNV                          AFKLSR+K DD 
Sbjct: 184 DASKEFQIEKGRGTPLKDIPNVVNLNSSFLLSLFVTEFIVFALSSKHVAFKLSRRKTDDT 243

Query: 410 FRLLHSILFGRRGKAFQIKSNISRFSGFVWHGDEEKQKNKVKEKFDKCHKEKLLELCDVL 469
           FRLLH+ILFGRRGKA QIKSNISRFSGFVWH +EEKQK KVK+KFDKC+KEKLLE CDVL
Sbjct: 244 FRLLHTILFGRRGKAVQIKSNISRFSGFVWHENEEKQKTKVKDKFDKCNKEKLLEFCDVL 303

Query: 470 DIPDVKATTRKEDVIGKLIEFLMAPHATSTVLLAEIEKSSKGKKRKRTVKGGISTPGDDS 529
           DIP +KATTRKED++ KLI+FL+AP AT+TVLL+E EKSSK KKRKR +K G +      
Sbjct: 304 DIPIMKATTRKEDIVAKLIDFLVAPQATTTVLLSEKEKSSKSKKRKRVIKSGTT------ 363

Query: 530 SKQSAKSRRKRGNTARSEMTRDTSDEDGESEEEKEAEEENDKENENGTTEKSDDEMSEQP 589
           SK+S +SR+K  +T +S        ED   EEEKE EE  ++ENENG TEKS+DEM E  
Sbjct: 364 SKRSTRSRKKSEDTPKSGKKSAPDSEDESEEEEKEEEENEEEENENGITEKSEDEMPEDS 423

Query: 590 ESEDINDPTDESEEE--KPRASSKRSSRKRGSVGKARSKKVTSSNKSDSAKSTSKRSSAS 649
           ESE+ N+  DESEEE  K + S+K SS K+ S GKA  KKVT   +S + +  + ++S+ 
Sbjct: 424 ESEEKNETEDESEEEVGKKKKSTKVSSSKKESAGKATPKKVTVPKRSSTPQKRTPKTSSK 483

Query: 650 RAKIDDS--DSPKVFSRKKNSEKVSK--ASTPPKSAAKEKPGKKITKGKDKTKEEKTRPS 709
            +K+DD    SPKV SRKK +EKV+K  +STP KSA+KEK  KK+ KGKDK KEEK +PS
Sbjct: 484 SSKVDDDIDKSPKVSSRKK-TEKVTKEKSSTPTKSASKEKTSKKVAKGKDKAKEEKLKPS 543

Query: 710 DDVLRDAICEILKVVDFTTATFTDILKQLAGQFKMDLTAQKSSIKLMIQEELTKLADEAE 757
           D  LRDAICEILK VDF TATFTDILK LA QF  DLT +KSSIKLMIQEELTKLADEA 
Sbjct: 544 DHELRDAICEILKEVDFNTATFTDILKLLARQFDTDLTPRKSSIKLMIQEELTKLADEA- 603

BLAST of Cp4.1LG08g08310 vs. TAIR10
Match: AT4G26630.1 (AT4G26630.1 DEK domain-containing chromatin associated protein)

HSP 1 Score: 331.6 bits (849), Expect = 1.2e-90
Identity = 339/783 (43.30%), Postives = 466/783 (59.51%), Query Frame = 1

Query: 1   MGQEDATNNTFENTKNEASGGDLKTNSIETVENGNNKEDKMKNNVETVDNETTEEDKMTN 60
           MG++  T  T E T N+ +  +  + ++   EN   KE +        D +  E D M  
Sbjct: 1   MGED--TKATIEPTANKTTSLEKPSEAMAGKENAGGKETQELAK----DEDMAEPDNMEI 60

Query: 61  TTETVTNGISELEKINETVPNGLENGVKEPEIEQCTDEKAEATKMEEKPKIKEDEESNEE 120
                    ++++K +E      E   KE E+++  ++ AE  KMEEK ++ +DE   E 
Sbjct: 61  D--------AQIKKDDEKA----ETEDKESEVKK-NEDNAETQKMEEKVEVTKDEGQAEA 120

Query: 121 TVKEEKEEDVLPNDKKIEENVDIKD---EENVDVKDVEDAKDKEIEAKDEEIEDAK---- 180
           T     +ED     ++ ++ V ++D   +ENV+ KD   AKD E E K+ +I +A     
Sbjct: 121 T---NMDEDADGKKEQTDDGVSVEDTVMKENVESKDNNYAKDDEKETKETDITEADHKKA 180

Query: 181 -----DEEVEVFEDAKDEEIEDAKDEE--VEDAKDEEVEDAKDEEIEDAKDEEIEDAKDE 240
                  E +     KD    D K+E   V++ K  ++++  +   E+ + E +E  + E
Sbjct: 181 GKEDIQHEADKANGTKDGNTGDIKEEGTLVDEDKGTDMDEKVENGDENKQVENVEGKEKE 240

Query: 241 EIEDAKDEEIEDAKEEEIEDAKDKEIEDAKDEENEEEKEDAKDKVEKVDSHMEEDDKELK 300
           + E+ K +E+E AK  E++++K ++ ++  ++EN+ EK ++KD  E       +D ++ K
Sbjct: 241 DKEENKTKEVEAAK-AEVDESKVEDEKEGSEDENDNEKVESKDAKEDEKEETNDDKEDEK 300

Query: 301 DEDPKEGKTKKARKRRGAVKSKGKNEEDEKDEVGIKTPIIDRPVRERKSVERLVASIERC 360
           +E     K  K     G V+ K K EE +KD    +TP  DRPVRERKSVERLVA I++ 
Sbjct: 301 EESKGSKKRGKGTSSGGKVREKNKTEEVKKD-AEPRTPFSDRPVRERKSVERLVALIDKD 360

Query: 361 VVKEFHIEKGRGTPLKDIPNVAFKLSRKKADDIFRLLHSILF-GRRGKAFQIKSNISRFS 420
             KEF +EKGRG  LKDIPNVA K+ RK++D+  +LLH ILF GRRGKA QIK+NI  FS
Sbjct: 361 SSKEFRVEKGRGAYLKDIPNVANKVMRKRSDETLKLLHPILFGGRRGKAAQIKTNILGFS 420

Query: 421 GFVWHGDEEKQKNKVKEKFDKCHKEKLLELCDVLDIPDVKATTRKEDVIGKLIEFLMAPH 480
           GFVWHGDE+K K KVKEK +KC KEKL E CDVLDI   KATT+KED+I KL EFL  PH
Sbjct: 421 GFVWHGDEKKAKEKVKEKLEKCTKEKLWEFCDVLDIHITKATTKKEDIITKLFEFLEKPH 480

Query: 481 ATSTV----LLAEIEKSSKGKKRKRTVKGGISTPGDDSSKQSAKSRRKRGNTARSEMTRD 540
            T  V     ++E EKSSKG KRKRT K    T G  SSK+SAKS +K+   A   + + 
Sbjct: 481 VTGDVTGDTTVSEKEKSSKGAKRKRTPKKTSPTAGSSSSKRSAKS-QKKSEEATKVVKKS 540

Query: 541 TSDEDGESEEEK------------EAEEENDKENENGTTEKSDDEMSEQPESEDINDPTD 600
            +  D ESEEEK            E EE+ ++ENENG  +KS+DE  +  ESE+ ++  +
Sbjct: 541 LAHSDDESEEEKEEEEKQEEEKAEEKEEKKEEENENGIPDKSEDEAPQPSESEEKDESEE 600

Query: 601 ESEEE--KPRASSKRSSRKRGSVGKARSKK-VTSSNKSDSAKSTSKRSSASRAKIDDSD- 660
            SEEE  K +  S+ S+ K+ S G+AR+KK V ++  S   K T KRSSA R K DD   
Sbjct: 601 HSEEETTKKKRGSRLSAGKKESAGRARNKKAVVAAKSSPPEKITQKRSSAKRKKTDDDSD 660

Query: 661 -SPKVFSRKKNSEKVSKAS-TPPKSAAKEKPGKKITKGKDKTKEEKTRPSDDVLRDAICE 720
            SPK  S++K SE   KAS  P KSA+KEKP K+  KGKDK       PSD VL++AI E
Sbjct: 661 TSPKASSKRKKSENPIKASPAPSKSASKEKPVKRAGKGKDK-------PSDKVLKNAIVE 720

Query: 721 ILKVVDFTTATFTDILKQLAGQFKMDLTAQKSSIKLMIQEELTKLADEAEDEEDGEGDAD 747
           ILK VDF+TATFTDILK+LA +F  DLT +KSSIK++IQEELTKLADE E+EE  E D++
Sbjct: 721 ILKRVDFSTATFTDILKELAKEFTEDLTPRKSSIKMIIQEELTKLADEEEEEEKKEEDSE 751

BLAST of Cp4.1LG08g08310 vs. TAIR10
Match: AT5G55660.1 (AT5G55660.1 DEK domain-containing chromatin associated protein)

HSP 1 Score: 329.7 bits (844), Expect = 4.7e-90
Identity = 334/757 (44.12%), Postives = 461/757 (60.90%), Query Frame = 1

Query: 15  KNEASGGDLKTNSIETVENGNNKEDKMKNNVETVDNETTEEDKMTNTTETVTNGISELEK 74
           + E+  GD +    E  E  +  EDK +   + +D +T  +DK     + V+   +E + 
Sbjct: 65  EGESETGDKEVEVTEE-EKKDVGEDKEQPEADKMDEDT--DDKNLKADDGVSGVATEEDA 124

Query: 75  I-NETVPNGLENGVKEPEIEQCTDEKAEATKMEE-KPKIKEDEESNEETVKEEKEEDVLP 134
           +  E+V +      + PE EQ  + K E  K+E  K    E+ ++ E+ V  +K +DV  
Sbjct: 125 VMKESVESADNKDAENPEGEQEKESKEE--KLEGGKANGNEEGDTEEKLVGGDKGDDVDE 184

Query: 135 NDKKIEENVDIKD-EENVDVKDVEDAKDKEIEAKDEEIEDA-KDEEVEVFEDAKDEEIED 194
            +K   ENVD  D EE +  K+  +  ++E   K EE+++A K+++VE      + E+ED
Sbjct: 185 AEKV--ENVDEDDKEEALKEKNEAELAEEEETNKGEEVKEANKEDDVEADTKVAEPEVED 244

Query: 195 AKDEEVEDAKDEEVEDAKDEEIEDAKDEEIEDAKDEEIEDAKDEEIEDAKEEEIEDAKDK 254
            K E  ++ +D      K+EE ED K+E ++D K++E E++ D++ ED KEE  +D +DK
Sbjct: 245 KKTESKDENED------KEEEKEDEKEESMDD-KEDEKEESNDDDKEDEKEESNDDKEDK 304

Query: 255 EIEDAKDEENEEEKEDAKDKVEKVDSHMEEDDKELKDEDPKEGKTKKARKRRGAVKSKGK 314
                        KED K                 K     +GKT+K R        K K
Sbjct: 305 -------------KEDIK-----------------KSNKRGKGKTEKTR-------GKTK 364

Query: 315 NEEDEKDEVGIKTPII-DRPVRERKSVERLVASIERCVVKEFHIEKGRGTPLKDIPNVAF 374
           ++E++KD +  KTP   DRPVRERKSVERLVA +++   +EFH+EKG+GTPLKDIPNVA+
Sbjct: 365 SDEEKKD-IEPKTPFFSDRPVRERKSVERLVAVVDKDSSREFHVEKGKGTPLKDIPNVAY 424

Query: 375 KLSRKKADDIFRLLHSILF-GRRGKAFQIKSNISRFSGFVWHGDEEKQKNKVKEKFDKCH 434
           K+SRKK+D++F+ LH+ILF G+R KA Q+K++I RFSG+ W GDEEK K KVKEKF+K +
Sbjct: 425 KVSRKKSDEVFKQLHTILFGGKRVKATQLKAHILRFSGYKWQGDEEKAKLKVKEKFEKIN 484

Query: 435 KEKLLELCDVLDIPDVKATTRKEDVIGKLIEFLMAPHATSTVLLAEIEKSSKGKKRKRTV 494
           KEKLLE CD+ DI   KATT+KED++ KL+EFL  PHAT+ VL+ E E   KG KRKRT 
Sbjct: 485 KEKLLEFCDLFDISVAKATTKKEDIVTKLVEFLEKPHATTDVLVNEKE---KGVKRKRTP 544

Query: 495 KGGISTPGDDSSKQSAKSRRKRGNTARSEMTRDTSDEDGESEEEKEAEEENDK------- 554
           K      G  SSK+SAKS++K     R+   +  +  D ESEEEKE +EE +K       
Sbjct: 545 KKSSPAAGSSSSKRSAKSQKKTEEATRTN-KKSVAHSDDESEEEKEDDEEEEKEQEVEEE 604

Query: 555 --ENENGTTEKSDDEMSEQPESEDINDPTDESEEE--KPRASSKRSSRKRGSVGKARSKK 614
             ENENG  +KS+DE  +  ESE+  +  +ESEEE  K +  S+ SS K+ S GK+RSKK
Sbjct: 605 EEENENGIPDKSEDEAPQLSESEENVESEEESEEETKKKKRGSRTSSDKKESAGKSRSKK 664

Query: 615 VTSSNKSD-SAKSTSKRSSASRAKIDDSD--SPKVFSRKKNSEKVSK--ASTPPKSAAKE 674
                KS    K+T KRS+  R K DD    SPK  S++K +EK +K  A+ P KS +KE
Sbjct: 665 TAVPTKSSPPKKATQKRSAGKRKKSDDDSDTSPKASSKRKKTEKPAKEQAAAPLKSVSKE 724

Query: 675 KP--GKKITKGKDKTKEEKTRPSDDVLRDAICEILKVVDFTTATFTDILKQLAGQFKMDL 734
           KP  GK+  KGKDK KE    PSD+ L+ AI +ILK VDF TATFTDILK+L  +F + L
Sbjct: 725 KPVIGKRGGKGKDKNKE----PSDEELKTAIIDILKGVDFNTATFTDILKRLDAKFNISL 761

Query: 735 TAQKSSIKLMIQEELTKLADEAEDEEDGEGDADAEKD 748
            ++KSSIK MIQ+ELTKLADEAEDEE  E DA+ E++
Sbjct: 785 ASKKSSIKRMIQDELTKLADEAEDEEGEEEDAEHEEE 761

BLAST of Cp4.1LG08g08310 vs. TAIR10
Match: AT5G63550.2 (AT5G63550.2 DEK domain-containing chromatin associated protein)

HSP 1 Score: 198.0 bits (502), Expect = 2.1e-50
Identity = 208/547 (38.03%), Postives = 304/547 (55.58%), Query Frame = 1

Query: 235 EIEDAKEEEIEDAKDKEIEDAKDEENEEEKEDAKDKVEKVDSHMEEDDKELKDEDPKEGK 294
           E  D K  E+     +EI+    EE E EKE       KVDS    + +E K ED +EG+
Sbjct: 4   ETLDEKTPEVNSPAKEEIDVVPKEEKEVEKE-------KVDSPRIGEAEEEKKEDEEEGE 63

Query: 295 TKKARKRRGAVKSKGKNEEDEKDEVG----------IKTPIIDRPVRERKSVER--LVAS 354
            K+        +   ++EE+E++E G            TP  +RP RERK VER  L   
Sbjct: 64  AKEGELGEKDKEDDVESEEEEEEEEGSGSKKSSEKETVTPTSERPTRERKKVERFSLSTP 123

Query: 355 IERCVVKEFHIEKGRGTPLKDIPNVAFKLSRKKADDIFRLLHSILFGRRGKAFQIKSNIS 414
           +     K   IEKGRGTPL++IPNVA KLS++KADD   LLH+ILFG++ KA  +K NI 
Sbjct: 124 MRAPPSKSVSIEKGRGTPLREIPNVAHKLSKRKADDNLMLLHTILFGKKAKAQMVKRNIG 183

Query: 415 RFSGFVW-HGDEEKQKNKVKEKFDKCHKEKLLELCDVLDIPDVKATTRKEDVIGKLIEFL 474
           +FSGF W   +EEKQ+ ++KEK DKC KEKL+  CDVLDIP  ++  +KE++  K++EFL
Sbjct: 184 QFSGFAWSEKEEEKQRARIKEKIDKCVKEKLIVFCDVLDIPISRSNVKKEELAVKVLEFL 243

Query: 475 MAPHATSTVLLAEIEKSSKGKKRKRTVKGGISTPGDDSSKQSAKSRRKRGNTARSEMTRD 534
            +P  T  V++A+ EK  + KKRK T K G S    +SS   AK +R+   T + ++  D
Sbjct: 244 ESPKETRDVIIADQEK--QAKKRKSTPKRGKS---GESSDTPAKRKRQ---TKKRDLPSD 303

Query: 535 TSDEDGESEEEKEAEEENDKENENGTTEKSDDEMSEQPESEDINDPTDESEEEKPRASSK 594
           T  E+G+ E + ++E  ND   E+    + + +     E  D +D  DE E EKP  S K
Sbjct: 304 T--EEGKDEGDADSEGTNDPHEEDDAAPEEESD----HEKTDTDDEKDEVEVEKP--SKK 363

Query: 595 RSSRKR------GSVGKARSKKVTSSNKSDS------AKSTSKRSSASRAKIDDSDSPKV 654
           +SS K+      GS GK +      S +S        AKSTS  S A + K+D  +S K 
Sbjct: 364 KSSSKKTVEESSGSKGKDKQPSAKGSARSGEKSSKQIAKSTS--SPAKKQKVDHVESSKE 423

Query: 655 FSRKKNSEKVSKASTPPKSAAKEKPGKKITKGKDKTKEEKTRPSDDVLRDAICEILKVVD 714
            S+K+ S+  +K S       KEK GK   KGK      K  P+   + + + +ILK VD
Sbjct: 424 KSKKQPSKPQAKGS-------KEK-GKATKKGK-----AKAEPTRKEMLEVVSKILKEVD 483

Query: 715 FTTATFTDILKQLAGQFKMDLTAQKSSIKLMIQEELTKLADEAEDEEDGEGDADAEKDVK 757
           F TAT +DIL++L+  F ++L+ +K  +K +I E +  + D+ E++E+ E +A ++K+ +
Sbjct: 484 FNTATLSDILQKLSDHFGVELSHRKPEVKDVITEAINAMTDDEEEDEEEEAEAGSDKEKE 512

BLAST of Cp4.1LG08g08310 vs. TAIR10
Match: AT3G48710.1 (AT3G48710.1 DEK domain-containing chromatin associated protein)

HSP 1 Score: 182.2 bits (461), Expect = 1.2e-45
Identity = 168/468 (35.90%), Postives = 264/468 (56.41%), Query Frame = 1

Query: 288 EDPKEGKTKKARKRRGAVKSKGKNEEDEKDEVGIKTPIIDRPVRERKSVERLVASI--ER 347
           E   E K K   ++  A++ KG+  + EK +  + TP+ +RP+RERK   R V       
Sbjct: 21  EKDTETKKKDEVEKDEAMEEKGEEIDGEKVKSPV-TPVSERPIRERKRTGRYVIDTPPRS 80

Query: 348 CVVKEFHIEKGRGTPLKDIPNVAFKLSRKKADDIFRLLHSILFGRRGKAFQIKSNISRFS 407
              K   I +GRGT LK+IPNVA+KLS++K DD   LLH+IL+G++ KA  +K NI +FS
Sbjct: 81  SGNKPLSITQGRGTRLKEIPNVAYKLSKRKPDDNLFLLHTILYGKKAKAQMLKKNIGQFS 140

Query: 408 GFVW-HGDEEKQKNKVKEKFDKCHKEKLLELCDVLDIPDVKATTRKEDVIGKLIEFLMAP 467
           GFVW   +EEKQ+ K KEK DKC KEKL++ CDVLDIP  K+T +KE++  +++EFL+ P
Sbjct: 141 GFVWSEQEEEKQRAKAKEKLDKCIKEKLIDFCDVLDIPVNKSTVKKEELAVRVLEFLVCP 200

Query: 468 HATSTVLLAEIEKSSKGKKRKRTVKGGISTPGDDSSKQSAKSRRKRGNTARSEMTRDTSD 527
            AT  +LLA+ EK +K K++K T K   S    +SS   AK RR+      ++     ++
Sbjct: 201 KATRDILLADSEKETK-KRKKSTSKNVTS---GESSHVPAKRRRQ------AKKQEQPTE 260

Query: 528 EDGESEEEKEAEEENDKENENGTTEKSDDEMSEQPESEDINDPTDESEEEKPRAS-SKRS 587
            +G  E +  +E  ND   E+    + ++  SE  E+ED  D   E  +   +   SKR+
Sbjct: 261 TEGNGESDVGSEGTNDSNGEDDVAPEEENNKSEDTETEDEKDKAKEKTKSTDKKRLSKRT 320

Query: 588 SRKRGSVGKARSKK--VTSSNKSDSAKSTSKRSSASRAKIDDSDSPKVFSRKKNSEKVSK 647
            +++ +  + +S K    SS KS      S  SS+ + K+D  DS K        EK   
Sbjct: 321 KKEKPAAEEEKSIKGSAKSSRKSFRQVDKSTTSSSKKQKVDKDDSSK--------EKGKT 380

Query: 648 ASTPPKSAAKEKPGKKITKGKDKTKEEKTRPSDDVLRDAICEILKVVDFTTATFTDILKQ 707
            ++ P++   +  G+   KGK +       P+   L   + +ILK VDF TAT +DIL++
Sbjct: 381 QTSKPQAKGSKDQGQSRKKGKKE-------PTRKELHVVVTKILKEVDFNTATLSDILRK 440

Query: 708 LAGQFKMDLTAQKSSIKLMIQEELTKLADEAEDEEDGEGDADAEKDVK 750
           L   F +DL  +K+ +K +I + + +++D+ +DE++ + + + EK+ K
Sbjct: 441 LGSHFGIDLMHRKAEVKDIITDAINEMSDD-DDEKEEDTEDEGEKEGK 461

BLAST of Cp4.1LG08g08310 vs. NCBI nr
Match: gi|659096319|ref|XP_008449030.1| (PREDICTED: LOW QUALITY PROTEIN: glutamic acid-rich protein [Cucumis melo])

HSP 1 Score: 904.8 bits (2337), Expect = 1.0e-259
Identity = 581/768 (75.65%), Postives = 636/768 (82.81%), Query Frame = 1

Query: 4   EDATNNTFENTKNEASGGDLKTNSIETV--ENGNNKEDKMKNNVETVDNETTEEDKMTNT 63
           +D    T ENTKNEA G DLKTN++ETV  +NGNNKEDKMKN+VETV+N  TE+DKMTNT
Sbjct: 12  KDTAKITVENTKNEAKGEDLKTNTVETVTVQNGNNKEDKMKNSVETVENGKTEDDKMTNT 71

Query: 64  TETVTNGISELEKINETVPNGLENGVKEPEIEQCTDEKAEATKMEEKPKIKEDEESNEET 123
            ETVTNG  ELEK NETVP G ENGVKE EIE+    +AE TKMEE+ K+KED+E N E 
Sbjct: 72  VETVTNGTIELEKTNETVPKGDENGVKETEIEEGVVVEAEVTKMEEERKVKEDKEINAEN 131

Query: 124 VKEEKEE--------DVLPNDKKIEENVDIKDEENVDVKDVEDAKDKEIEAKDEEIEDAK 183
           VK+EKEE        DV+PN K  EEN+DIKD +N+DVKD     DK   AKD E E AK
Sbjct: 132 VKDEKEEAKIQAMEEDVIPNAKNDEENMDIKDADNIDVKD-----DKNESAKDGEFEGAK 191

Query: 184 DEEVEVFEDAKDEEIEDAKDEEVEDAKDEEVEDAKDEEIEDAKDEEIEDAKDEEIEDAKD 243
           DEE+E   DAKDE  EDAKDE  EDAKDE  EDAKDE  ED+KDE  ED+KDE  EDAKD
Sbjct: 192 DEEME---DAKDEGTEDAKDEGTEDAKDEGTEDAKDEGTEDSKDEGTEDSKDEGTEDAKD 251

Query: 244 EEIEDAKEEEIEDAKDKEIEDAKDEENEEEKEDAKDKVEKVDSHMEEDDKELKDEDPKEG 303
                                       E  +DAKD VEKVDSHMEEDDKE+KD+DP E 
Sbjct: 252 ----------------------------EGTKDAKDGVEKVDSHMEEDDKEMKDKDPNEE 311

Query: 304 KTKKARKRRGAVKSKGKNEEDEKDEVGIKTPIIDRPVRERKSVERLVASIERCVVKEFHI 363
           K KK R+R+GAVKSKG NEEDEK+E  I+TPI+DRPVRERKSVERLVASIER  VKEFHI
Sbjct: 312 KXKKGRRRKGAVKSKGNNEEDEKEEAEIRTPIVDRPVRERKSVERLVASIERYAVKEFHI 371

Query: 364 EKGRGTPLKDIPNVAFKLSRKKADDIFRLLHSILFGRRGKAFQIKSNISRFSGFVWHGDE 423
           EKGRGTPLKDIPNVAFKLSRKK DDIFRLLH+ILFGRRGKAFQIKSNISRFSGFVWHGDE
Sbjct: 372 EKGRGTPLKDIPNVAFKLSRKKTDDIFRLLHTILFGRRGKAFQIKSNISRFSGFVWHGDE 431

Query: 424 EKQKNKVKEKFDKCHKEKLLELCDVLDIPDVKATTRKEDVIGKLIEFLMAPHATSTVLLA 483
           EKQKNKVKEKFDKC+KEKLLELCDVLDIP  KATTRKED+IGKL+EFL+APHAT+TVLLA
Sbjct: 432 EKQKNKVKEKFDKCNKEKLLELCDVLDIPVAKATTRKEDIIGKLVEFLIAPHATTTVLLA 491

Query: 484 EIEKSSKGKKRKRTVKGGISTPGDDSSKQSAKSRRKRGNTARSEMTRDTSDEDGESEEEK 543
           E EKSSKGKKRKR VKGGISTPGD  SK SAKSRRKRGN+ARSEMT+D+SDED ESEEEK
Sbjct: 492 EKEKSSKGKKRKRAVKGGISTPGDSGSKSSAKSRRKRGNSARSEMTKDSSDEDDESEEEK 551

Query: 544 EAEEENDKENE-NGTTEKSDDEMSEQPESEDINDPTDESEEEKPRASSKRSSRKRGSVGK 603
           EAEE+NDKENE NGTTEKSD+E+SEQPESEDINDPTDESEEE+PRAS+K SS+K+GSVGK
Sbjct: 552 EAEEDNDKENEKNGTTEKSDEEVSEQPESEDINDPTDESEEERPRASTKTSSKKKGSVGK 611

Query: 604 ARSKKVTSSNKSDSAKSTSKRSSASRAKIDDSD-SPKVFSRKKNSEKVSKASTPPKSAAK 663
           ARSKKVT SNKSDSAKS++K+ +ASRAK+DD D SPKVFSRKKNSEK +KASTP KSA K
Sbjct: 612 ARSKKVTGSNKSDSAKSSAKKLAASRAKVDDIDASPKVFSRKKNSEKENKASTPSKSANK 671

Query: 664 EKPGKKITKGKDKTKEEKTRPSDDVLRDAICEILKVVDFTTATFTDILKQLAGQFKMDLT 723
           EKPGKK+ KGKDKTKEEK+RPSDD LR+AICEILKVVDFTTATFTDILKQLA QFKMDLT
Sbjct: 672 EKPGKKVVKGKDKTKEEKSRPSDDELREAICEILKVVDFTTATFTDILKQLARQFKMDLT 731

Query: 724 AQKSSIKLMIQEELTKLADEAEDEEDGEGDADAEKDVKQAA--QEVET 758
            QKSSIKLMIQEELTKLADEAEDEEDG G+ DAEKD KQAA  +EVET
Sbjct: 732 TQKSSIKLMIQEELTKLADEAEDEEDG-GEGDAEKDGKQAASGREVET 742

BLAST of Cp4.1LG08g08310 vs. NCBI nr
Match: gi|778675796|ref|XP_011650473.1| (PREDICTED: glutamic acid-rich protein isoform X3 [Cucumis sativus])

HSP 1 Score: 731.1 bits (1886), Expect = 2.0e-207
Identity = 509/774 (65.76%), Postives = 576/774 (74.42%), Query Frame = 1

Query: 1   MGQEDATNNTFENTKNEASGGDLKTNSIETV--ENGNNKEDKMKNNVETVDNETTEEDKM 60
           MG ED    T ENTK+EA G DLKTN++ETV  ENGN+KEDKMKN+V             
Sbjct: 1   MGAEDTAKITVENTKDEAKGEDLKTNTVETVTVENGNSKEDKMKNSV------------- 60

Query: 61  TNTTETVTNGISELEKINETVPNGL-ENGVKEPEIEQCTDEKAEATKMEE-KPKIKEDEE 120
                             ETV NG  E+   +  IE  T+   E  K+ E  PK +E+  
Sbjct: 61  ------------------ETVENGTNEDDKMKNTIETVTNGTNELEKINEIVPKGEENGV 120

Query: 121 SNEETVKEEKEEDVLPNDKKIEENVDIKDEENVDVKDVEDAKDK-EIEAKDEEIEDAKDE 180
              E  K   E +V     K+ E   IK+++  + ++V+D K++ +I+A DE+       
Sbjct: 121 KETEIEKGVVEAEVT----KMGEEPKIKEDKESNAENVKDEKEEAKIQAMDEDANP---- 180

Query: 181 EVEVFEDAKDEEIEDAKDEEVEDAKDEEVEDAKDEEIEDAKDEEIEDAKDEEIEDAKDEE 240
                 +AK+       DE+  D KD +  D KD++ E AKD EIE AKDEE+EDAKD  
Sbjct: 181 ------NAKN-------DEQNVDIKDADSVDVKDDKNEIAKDGEIEGAKDEEMEDAKD-- 240

Query: 241 IEDAKEEEIEDAKDKEIEDAKDEENEEEKEDAKDKVEKVDSHMEEDDKELKDEDPKEGKT 300
                                      E +DAKD VEKVDSHMEEDDKE+KD+DP E KT
Sbjct: 241 ---------------------------EVDDAKDGVEKVDSHMEEDDKEMKDKDPNEEKT 300

Query: 301 KKARKRRGAVKSKGKNEEDEKDEVGIKTPIIDRPVRERKSVERLVASIERCVVKEFHIEK 360
           KK R+R+GA+KSKG  EEDEK+E  I+TPI+DRPVRERKSVERLVASIER  VKEFHIEK
Sbjct: 301 KKGRRRKGAIKSKGNKEEDEKEEAEIRTPIVDRPVRERKSVERLVASIERYAVKEFHIEK 360

Query: 361 GRGTPLKDIPNVAFKLSRKKADDIFRLLHSILFGRRGKAFQIKSNISRFSGFVWHGDEEK 420
           GRGTPLKDIPNVAFKLSRKK DDIFRLLH+ILFGRRGKAFQIKSNISRFSGFVWHGDEEK
Sbjct: 361 GRGTPLKDIPNVAFKLSRKKTDDIFRLLHTILFGRRGKAFQIKSNISRFSGFVWHGDEEK 420

Query: 421 QKNKVKEKFDKCHKEKLLELCDVLDIPDVKATTRKEDVIGKLIEFLMAPHATSTVLLAEI 480
           QKNK+KEKFDKC+KEKLLE CDVLDIP VKATTRKED+IGKLIEFL+APH+T+TVLLAE 
Sbjct: 421 QKNKIKEKFDKCNKEKLLEFCDVLDIPVVKATTRKEDIIGKLIEFLIAPHSTTTVLLAEK 480

Query: 481 EKSSKGKKRKRTVKGGISTPGDDSSKQSAKSRRKRGNTARSEMTRDTSDEDGESEEEKEA 540
           EKSSKGKKRKR VKGGISTPGD  SK SAKS RKRGN+ARSEMT+D+SDED ESEEEKEA
Sbjct: 481 EKSSKGKKRKRAVKGGISTPGDSGSKSSAKSCRKRGNSARSEMTKDSSDEDDESEEEKEA 540

Query: 541 EEEND--------KENENGTTEKSDDEMSEQPESEDINDPTDESEEEKPRASSKRSSRKR 600
           EEE D         ENENGTTEKSDDE+SEQPESEDINDPTDESEEE+PR+S+K SS+++
Sbjct: 541 EEEKDAEEDNDKENENENGTTEKSDDEVSEQPESEDINDPTDESEEERPRSSTKSSSKRK 600

Query: 601 GSVGKARSKKVTSSNKSDSAKSTSKRSSASRAKIDDSD-SPKVFSRKKNSEKVSKASTPP 660
            SVGKARSKKV  SNKS+SAKS++K+SSASRAK+DD+D SPKVFSRKKNSEK SKASTP 
Sbjct: 601 RSVGKARSKKVAGSNKSESAKSSAKKSSASRAKVDDNDASPKVFSRKKNSEKESKASTPT 660

Query: 661 KSAAKEKPGKKITKGKD-KTKEEKTRPSDDVLRDAICEILKVVDFTTATFTDILKQLAGQ 720
           KSA KEKPGKK+ KGKD KTKEEKTRPSDD LR+AICEILKVVDFTTATFTDILKQLA Q
Sbjct: 661 KSANKEKPGKKVVKGKDNKTKEEKTRPSDDELREAICEILKVVDFTTATFTDILKQLARQ 690

Query: 721 FKMDLTAQKSSIKLMIQEELTKLADEAEDEEDGEGDADAEKDVKQAA--QEVET 758
           FKMDLT QKSSIKLMIQEELTKLADEAEDEEDG    DAEKD KQ A  +EVET
Sbjct: 721 FKMDLTTQKSSIKLMIQEELTKLADEAEDEEDG---GDAEKDGKQGASGKEVET 690

BLAST of Cp4.1LG08g08310 vs. NCBI nr
Match: gi|778675790|ref|XP_011650471.1| (PREDICTED: glutamic acid-rich protein isoform X1 [Cucumis sativus])

HSP 1 Score: 725.7 bits (1872), Expect = 8.3e-206
Identity = 506/771 (65.63%), Postives = 574/771 (74.45%), Query Frame = 1

Query: 4   EDATNNTFENTKNEASGGDLKTNSIETV--ENGNNKEDKMKNNVETVDNETTEEDKMTNT 63
           +D    T ENTK+EA G DLKTN++ETV  ENGN+KEDKMKN+V                
Sbjct: 12  KDTAKITVENTKDEAKGEDLKTNTVETVTVENGNSKEDKMKNSV---------------- 71

Query: 64  TETVTNGISELEKINETVPNGL-ENGVKEPEIEQCTDEKAEATKMEE-KPKIKEDEESNE 123
                          ETV NG  E+   +  IE  T+   E  K+ E  PK +E+     
Sbjct: 72  ---------------ETVENGTNEDDKMKNTIETVTNGTNELEKINEIVPKGEENGVKET 131

Query: 124 ETVKEEKEEDVLPNDKKIEENVDIKDEENVDVKDVEDAKDK-EIEAKDEEIEDAKDEEVE 183
           E  K   E +V     K+ E   IK+++  + ++V+D K++ +I+A DE+          
Sbjct: 132 EIEKGVVEAEVT----KMGEEPKIKEDKESNAENVKDEKEEAKIQAMDEDANP------- 191

Query: 184 VFEDAKDEEIEDAKDEEVEDAKDEEVEDAKDEEIEDAKDEEIEDAKDEEIEDAKDEEIED 243
              +AK+       DE+  D KD +  D KD++ E AKD EIE AKDEE+EDAKD     
Sbjct: 192 ---NAKN-------DEQNVDIKDADSVDVKDDKNEIAKDGEIEGAKDEEMEDAKD----- 251

Query: 244 AKEEEIEDAKDKEIEDAKDEENEEEKEDAKDKVEKVDSHMEEDDKELKDEDPKEGKTKKA 303
                                   E +DAKD VEKVDSHMEEDDKE+KD+DP E KTKK 
Sbjct: 252 ------------------------EVDDAKDGVEKVDSHMEEDDKEMKDKDPNEEKTKKG 311

Query: 304 RKRRGAVKSKGKNEEDEKDEVGIKTPIIDRPVRERKSVERLVASIERCVVKEFHIEKGRG 363
           R+R+GA+KSKG  EEDEK+E  I+TPI+DRPVRERKSVERLVASIER  VKEFHIEKGRG
Sbjct: 312 RRRKGAIKSKGNKEEDEKEEAEIRTPIVDRPVRERKSVERLVASIERYAVKEFHIEKGRG 371

Query: 364 TPLKDIPNVAFKLSRKKADDIFRLLHSILFGRRGKAFQIKSNISRFSGFVWHGDEEKQKN 423
           TPLKDIPNVAFKLSRKK DDIFRLLH+ILFGRRGKAFQIKSNISRFSGFVWHGDEEKQKN
Sbjct: 372 TPLKDIPNVAFKLSRKKTDDIFRLLHTILFGRRGKAFQIKSNISRFSGFVWHGDEEKQKN 431

Query: 424 KVKEKFDKCHKEKLLELCDVLDIPDVKATTRKEDVIGKLIEFLMAPHATSTVLLAEIEKS 483
           K+KEKFDKC+KEKLLE CDVLDIP VKATTRKED+IGKLIEFL+APH+T+TVLLAE EKS
Sbjct: 432 KIKEKFDKCNKEKLLEFCDVLDIPVVKATTRKEDIIGKLIEFLIAPHSTTTVLLAEKEKS 491

Query: 484 SKGKKRKRTVKGGISTPGDDSSKQSAKSRRKRGNTARSEMTRDTSDEDGESEEEKEAEEE 543
           SKGKKRKR VKGGISTPGD  SK SAKS RKRGN+ARSEMT+D+SDED ESEEEKEAEEE
Sbjct: 492 SKGKKRKRAVKGGISTPGDSGSKSSAKSCRKRGNSARSEMTKDSSDEDDESEEEKEAEEE 551

Query: 544 ND--------KENENGTTEKSDDEMSEQPESEDINDPTDESEEEKPRASSKRSSRKRGSV 603
            D         ENENGTTEKSDDE+SEQPESEDINDPTDESEEE+PR+S+K SS+++ SV
Sbjct: 552 KDAEEDNDKENENENGTTEKSDDEVSEQPESEDINDPTDESEEERPRSSTKSSSKRKRSV 611

Query: 604 GKARSKKVTSSNKSDSAKSTSKRSSASRAKIDDSD-SPKVFSRKKNSEKVSKASTPPKSA 663
           GKARSKKV  SNKS+SAKS++K+SSASRAK+DD+D SPKVFSRKKNSEK SKASTP KSA
Sbjct: 612 GKARSKKVAGSNKSESAKSSAKKSSASRAKVDDNDASPKVFSRKKNSEKESKASTPTKSA 671

Query: 664 AKEKPGKKITKGKD-KTKEEKTRPSDDVLRDAICEILKVVDFTTATFTDILKQLAGQFKM 723
            KEKPGKK+ KGKD KTKEEKTRPSDD LR+AICEILKVVDFTTATFTDILKQLA QFKM
Sbjct: 672 NKEKPGKKVVKGKDNKTKEEKTRPSDDELREAICEILKVVDFTTATFTDILKQLARQFKM 698

Query: 724 DLTAQKSSIKLMIQEELTKLADEAEDEEDGEGDADAEKDVKQAA--QEVET 758
           DLT QKSSIKLMIQEELTKLADEAEDEEDG    DAEKD KQ A  +EVET
Sbjct: 732 DLTTQKSSIKLMIQEELTKLADEAEDEEDG---GDAEKDGKQGASGKEVET 698

BLAST of Cp4.1LG08g08310 vs. NCBI nr
Match: gi|645241633|ref|XP_008227168.1| (PREDICTED: protein DEK [Prunus mume])

HSP 1 Score: 475.7 bits (1223), Expect = 1.5e-130
Identity = 380/717 (53.00%), Postives = 491/717 (68.48%), Query Frame = 1

Query: 54  EEDKMTNTTETVTNGISELEKINETVPN--GLEN-GVKEPEIEQCTDEKAEATKMEEKPK 113
           EED +TN TETV NG S+  K +E V +  G EN GVKE ++++  D+KAE  KM+E  +
Sbjct: 3   EEDTVTNGTETVANGTSQSGKTSEDVTDKKGKENVGVKEMDVDKKGDKKAEVEKMDEDLE 62

Query: 114 IK--EDEESNEETVK---EEKEEDVLPNDKKIEENVDIKDEENVDVKDVEDAKDKEIEAK 173
               +DE+  EET +   EEK+E+  PN+ KI        +EN +V+ +    D+++EA 
Sbjct: 63  ANGCKDEKLKEETKESKVEEKKEENGPNEVKI-------GDENAEVEKM----DEDLEAD 122

Query: 174 DEEIEDAKDEEVEVFEDAKDEEIEDAKDEEVEDAKDEEVEDAKDEEIEDAKDEEIEDAKD 233
               E  K+E+    E+ K EE+E+      +   + + E+ ++  +E+ K ++ E+ K 
Sbjct: 123 GSRDEKLKEEK----EELKVEEMEEETGPNEKKETEGKTEEKREFVVEEDKVKKAEE-KA 182

Query: 234 EEIEDAKDEEIEDAKEEEIEDAKDKEIEDAKDEENEEEKEDAKDKVEKVDSHMEEDDKEL 293
           EE+E+ K+EE E+ KEEE E+ K++E+E+ K+EE EEEKE+A+ K  KV+          
Sbjct: 183 EEMEEEKEEEKEEEKEEEKEEEKEEEMEEEKEEEKEEEKEEARAKEAKVE---------- 242

Query: 294 KDEDPKEGKTKKARKRRGAVKSKGKNEEDEKDEVGIKTPIIDRPVRERKSVERLVASIER 353
                 +G  K+ + +R     + K E  EK E   +TP  DRPVRERKSVERLVASIE+
Sbjct: 243 ------KGSRKRGKGKRVEKTKEKKKEVAEKKEPEHRTPATDRPVRERKSVERLVASIEK 302

Query: 354 CVVKEFHIEKGRGTPLKDIPNVAFKLSRKKADDIFRLLHSILFGRRGKAFQIKSNISRFS 413
             V+EF IEKGRGTPLKDIPNVAFKLSR+K DD  +LLH+ILFGRRGKA ++KSNISRFS
Sbjct: 303 DAVREFQIEKGRGTPLKDIPNVAFKLSRRKMDDSLKLLHTILFGRRGKALEVKSNISRFS 362

Query: 414 GFVWHGDEEKQKNKVKEKFDKCHKEKLLELCDVLDIPDVKATTRKEDVIGKLIEFLMAPH 473
           GFVW G+E+KQK KVKEKFDKC+KEKLLE CD+L++P  KATTRKED++ KLI+FL++PH
Sbjct: 363 GFVWRGNEDKQKTKVKEKFDKCNKEKLLEFCDLLNLPISKATTRKEDIVAKLIDFLVSPH 422

Query: 474 ATSTVLLA--EIEKSSKGKKRKRTVKGGISTPGDDSSKQSAKSRRKRGNTARSEMTRDTS 533
           AT+T LLA  E+    KGKKRKR  KG  ST G  +SK+SAK+RRK  + ++ +      
Sbjct: 423 ATTTSLLAEKEVHNFRKGKKRKRATKGSSSTSGGTNSKRSAKNRRKNDDDSKLDDKSAAD 482

Query: 534 DEDGESEEEKEAEEENDKENENGTTEKSDDEMSEQPESEDINDPTDESEE-EKPRASSKR 593
            ED   E+EKE EE  ++ENENG  E S+DE  E  ESE+  D +D+SEE EK +   K 
Sbjct: 483 TEDESEEDEKEDEENVEEENENGVHENSEDETPEHSESEEKLDSSDDSEEVEKQKPRRKS 542

Query: 594 SSRKRGSVGKARSKKVTSSNKSDSAKSTS-KRSSASRAKIDDSD--SPKVFSRKKNSEKV 653
           SSRK+GS  KA++KK T S KS    + S K+SS+ R  +DD    SPK  SRKK +EKV
Sbjct: 543 SSRKKGSSAKAQTKKATGSAKSTPPPTKSPKKSSSKRTPVDDDSDTSPKASSRKKKNEKV 602

Query: 654 SKASTPPKSAAKEKPGKKITKGKDKTKEEKTRPSDDVLRDAICEILKVVDFTTATFTDIL 713
           SK  TP KSA+KEKPGKK+ KGKDKTKEEK RPSDD LRDAIC+ILK VDF TATFTDIL
Sbjct: 603 SKVPTPTKSASKEKPGKKVAKGKDKTKEEKLRPSDDKLRDAICQILKEVDFNTATFTDIL 662

Query: 714 KQLAGQFKMDLTAQKSSIKLMIQEELTKLADEAEDEEDGEGDADAEKDVKQAAQEVE 757
           KQLA QF  DL+ +KSSIKLMIQEELTKLADEA DEED EG  + + + + A QEVE
Sbjct: 663 KQLARQFDTDLSPRKSSIKLMIQEELTKLADEA-DEEDEEGGPEKD-ETESAGQEVE 685

BLAST of Cp4.1LG08g08310 vs. NCBI nr
Match: gi|703069255|ref|XP_010088453.1| (hypothetical protein L484_018225 [Morus notabilis])

HSP 1 Score: 461.5 bits (1186), Expect = 2.9e-126
Identity = 349/596 (58.56%), Postives = 439/596 (73.66%), Query Frame = 1

Query: 182 DAKDEEIEDAKDEEV--EDAKDEEVEDAKDEEIEDAKDEEIEDAKDEEIEDAK-DEEIED 241
           DAK++   D   ++    D   E  ED KD++  +  D+  ED + +EI+DAK  EEI +
Sbjct: 19  DAKEKTANDVTKKKAAESDGPKEMEEDNKDDDKAEF-DKMEEDTEAKEIKDAKGKEEIGE 78

Query: 242 AKEEEIEDAKDKEIEDAKDEENEEEKEDAKDKVEKVDSHMEEDD-KELKDEDPKEGK-TK 301
            K E +E+         K+EE E + E  ++  EKVD   EE+  K+ K ED  E K +K
Sbjct: 79  GKVEVMEEDNGPN----KEEEVEGKVETKEEVGEKVDGFKEEEKVKDEKAEDTGEEKESK 138

Query: 302 KARKRRGAVKSKGKNEEDE-KDEVGIKTPIIDRPVRERKSVERLVASIERCVVKEFHIEK 361
           K  K +   K+K K +E E K E+  +TP IDRP RERKSVERLVA++E+   KEFHIEK
Sbjct: 139 KRGKGKSGEKTKEKRKESETKKELEPRTPAIDRPQRERKSVERLVATVEKESHKEFHIEK 198

Query: 362 GRGTPLKDIPNVAFKLSRKKADDIFRLLHSILFGRRGKAFQIKSNISRFSGFVWHGDEEK 421
           GRGTPLKDIPNVAFKLSR+K DD F+LLH+ILFGRRGKAFQIKSNISRFSGFVWH +EEK
Sbjct: 199 GRGTPLKDIPNVAFKLSRRKTDDTFKLLHTILFGRRGKAFQIKSNISRFSGFVWHENEEK 258

Query: 422 QKNKVKEKFDKCHKEKLLELCDVLDIPDVKATTRKEDVIGKLIEFLMAPHATSTVLLAEI 481
           QK KVKEKFDKC+KEKLLE CDVLDIP  KATTRKED++ KLI+FL+AP+AT+ VLLAE 
Sbjct: 259 QKIKVKEKFDKCNKEKLLEFCDVLDIPIAKATTRKEDIVSKLIDFLVAPYATTAVLLAEK 318

Query: 482 EKSSKGKKRKRTVKGGISTPGDDSSKQSAKSRRKRGNTARSEMTRDTSD-------EDGE 541
           EKS+KGKKRKR  KG  S  G  ++K+S K+R K  + +++E  ++T+D       E+ E
Sbjct: 319 EKSNKGKKRKRAAKGSSSASG-GTTKRSVKNRIKNEDDSKAEEKKNTTDTEDESSEEEEE 378

Query: 542 SEEEKEAEEENDKENENGTTEKSDDEMSEQPESEDINDPTDESEE--EKPRASSKRSSRK 601
            EE+KE +EEN++ENENG  +KS+DE+ E+ ESE+ +D  DESEE  EK R SSK++S+K
Sbjct: 379 EEEDKEEDEENEEENENGVPDKSEDELPEKSESEEKSDSEDESEEEVEKRRRSSKKASQK 438

Query: 602 RGSVGKARSKKVTSSNKSD-SAKSTSKRSSASRAKIDDSD--SPKVFSRKKNSEKVSKAS 661
           + S GKA++K+ T S KS    K T K+S++ R+K DD    SPKVFSRKK SEKV+KAS
Sbjct: 439 KESAGKAKTKRTTVSPKSSPPPKRTPKKSASKRSKGDDGSDTSPKVFSRKKTSEKVAKAS 498

Query: 662 TPPKSAAKEKPGKKITKGKDKTKEEKTRPSDDVLRDAICEILKVVDFTTATFTDILKQLA 721
           TP K+A+KEK GK+  KGK+K+K+EK++P+DD LRDAICEILK VDF TATFTDILKQLA
Sbjct: 499 TPAKAASKEKTGKRTAKGKEKSKKEKSKPTDDELRDAICEILKEVDFNTATFTDILKQLA 558

Query: 722 GQFKMDLTAQKSSIKLMIQEELTKLADEAEDEEDGEGD--ADAEKDVKQ-AAQEVE 757
            QF  DLT +KSSIK+MIQEELTKLADEA+++EDG+GD   DAEKD  Q A QEVE
Sbjct: 559 KQFDTDLTPRKSSIKIMIQEELTKLADEADEDEDGDGDGEGDAEKDETQPAEQEVE 608

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
W9R1E2_9ROSA4.5e-12658.20Uncharacterized protein OS=Morus notabilis GN=L484_018225 PE=4 SV=1[more]
M5WUP2_PRUPE2.5e-12456.80Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa003409mg PE=4 SV=1[more]
A0A061FC21_THECC1.6e-12357.19DEK domain-containing chromatin associated protein isoform 2 OS=Theobroma cacao ... [more]
A0A067KSV3_JATCU3.4e-12153.21Uncharacterized protein OS=Jatropha curcas GN=JCGZ_01033 PE=4 SV=1[more]
A0A061F550_THECC3.1e-11954.81DEK domain-containing chromatin associated protein isoform 1 OS=Theobroma cacao ... [more]
Match NameE-valueIdentityDescription
AT4G26630.11.2e-9043.30 DEK domain-containing chromatin associated protein[more]
AT5G55660.14.7e-9044.12 DEK domain-containing chromatin associated protein[more]
AT5G63550.22.1e-5038.03 DEK domain-containing chromatin associated protein[more]
AT3G48710.11.2e-4535.90 DEK domain-containing chromatin associated protein[more]
Match NameE-valueIdentityDescription
gi|659096319|ref|XP_008449030.1|1.0e-25975.65PREDICTED: LOW QUALITY PROTEIN: glutamic acid-rich protein [Cucumis melo][more]
gi|778675796|ref|XP_011650473.1|2.0e-20765.76PREDICTED: glutamic acid-rich protein isoform X3 [Cucumis sativus][more]
gi|778675790|ref|XP_011650471.1|8.3e-20665.63PREDICTED: glutamic acid-rich protein isoform X1 [Cucumis sativus][more]
gi|645241633|ref|XP_008227168.1|1.5e-13053.00PREDICTED: protein DEK [Prunus mume][more]
gi|703069255|ref|XP_010088453.1|2.9e-12658.56hypothetical protein L484_018225 [Morus notabilis][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0003677DNA binding
Vocabulary: INTERPRO
TermDefinition
IPR014876DEK_C
IPR009057Homeobox-like_sf
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003677 DNA binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG08g08310.1Cp4.1LG08g08310.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR009057Homeodomain-likeGENE3DG3DSA:1.10.10.60coord: 663..728
score: 2.1
IPR014876DEK, C-terminalPFAMPF08766DEK_Ccoord: 672..725
score: 3.1
NoneNo IPR availableunknownCoilCoilcoord: 525..545
score: -coord: 155..175
score: -coord: 241..284
score: -coord: 31..51
scor
NoneNo IPR availablePANTHERPTHR13468DEK PROTEINcoord: 117..756
score: 6.5E
NoneNo IPR availablePANTHERPTHR13468:SF1PROTEIN DEKcoord: 117..756
score: 6.5E
NoneNo IPR availableunknownSSF109715DEK C-terminal domaincoord: 665..725
score: 4.71

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cp4.1LG08g08310Cp4.1LG14g07910Cucurbita pepo (Zucchini)cpecpeB251
The following block(s) are covering this gene:
GeneOrganismBlock
Cp4.1LG08g08310Melon (DHL92) v3.6.1cpemedB954
Cp4.1LG08g08310Cucumber (Chinese Long) v3cpecucB1064
Cp4.1LG08g08310Wax gourdcpewgoB1094
Cp4.1LG08g08310Cucumber (Gy14) v1cgycpeB0029
Cp4.1LG08g08310Wild cucumber (PI 183967)cpecpiB870
Cp4.1LG08g08310Cucumber (Chinese Long) v2cpecuB868
Cp4.1LG08g08310Bottle gourd (USVL1VR-Ls)cpelsiB720
Cp4.1LG08g08310Watermelon (Charleston Gray)cpewcgB778
Cp4.1LG08g08310Watermelon (97103) v1cpewmB828
Cp4.1LG08g08310Melon (DHL92) v3.5.1cpemeB811
Cp4.1LG08g08310Cucumber (Gy14) v2cgybcpeB461