Cp4.1LG18g00040 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG18g00040
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionSerine/threonine-protein kinase Haspin, putative
LocationCp4.1LG18 : 42917 .. 54818 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATTGTTTGGCAGCTTTTTGCCCTTAAAAAACGTCCATTTCCGGCCTTAGTTGGCTCATTGGGCTCCCGCCATTCGGACATTCGAACGCTCCTTCTCTGCATTTCACCATCTGCTCTCCACTCCCTCTACTTTGTCCAATCTTGACAGCATGAATTCCAAACCAGGTACAGGAAACTACTCAACTTTCTAGCATTCATCTTCCTTCCAATCGTTATATACTTTGTAGCGTAGTTTCTTACTGTGTTGGACGGGTCAAATTCTAGGTGGGAAGGGCATTGATTTGTGGTCCGAGTTAATAGCTAGCGAAGGAAGCGAGGGCCAAGAAGAAGCGCGAGTAGAAGAGGTTTATAGACGAAGAAGGCCATCCGAGAAGACCGTTCATGAAGTTCCTTTGTAAACTTTCTTCACTCACTTGTTTAATTCCTCTTTTTGTTTTCTTTTGTGTGATGAAATCCATTGTTTCTTGTTTTCGTTTCGATTTTCTTGCAGGAAGCAAAATTTAGGTTCGAATCTTCGCGATGTGAACAGATTGAGTTTTGCTGCTGTTAACAGTAAGAGAATTAGTTGGAACCGTGCCCTTTCCATCAGGTTTACTTCGTTTTTCTCTTAATGTTCTTCTCAACTTCTTGCTGCCAATTCCAATAAGGGTGCTTATGTAGTTTGATCTGTGCTTTCTTAGTTCTTGTTAACACGGACGATTTCGTTGTAGCTTATCTCTTTAAAGTTGACTATGTAATGACTAAAATGAAACACGAGGAGGCAAATATCCAAACGACAACAAAGAAAGCCTACGTGATAGATTAATACGTTCTGCGGCAGTAGATTGTTAAGTTACCACGAGTTATCATTCCTTATCGGATGCCATGAAGTGATCTTAGAAAGATAGACGAGTTGTATGATCTTCGTTGCCTTACACCAAAAACAGTCGACATTAAATTTTTTTTTTCCTCCAGTCCTCTTAGATATTTCTAAGAAGAGGCTATAGAGTTTTGAGAGCGATATTGTATGACCTTGGCCCTTATAGTACATTCTTCTTTTCTTGTAAAAAAAGAAAACCCTTTCCAATTTGACATCTAGCACGTTGATTCACTTGCTGAGTAGCATCACCTCTATAGTATGTCTAATGAAACTCTTTAGCAAGGTCTGGGTTCTATTGGCTTGATTTTCTGATGCACCCAAGTTTAAGCTATTTTGTTGAAATTGAAAAATCAGCCTAAAAACATACAAAAAGGCTGCATCTATACAAGATGGAGGGCTTGCCTGATCGGTCAACAGCCTGCTTACCAACTAAACAGCCAAAAAATAGCTGTAACAAACATAAAACTATCAGCCTAAAGAATTACAAGAAAGAGAGTAAAATTACCACAACCATGACTGGTTAAGAAAACTAACATAATGAAAGGCCCTGCATCAAATCCTTCCCACCAAAAACAAAACTCAACCCACAAGTTTAAGGTGCTAAGTTTAATTCATAAGGATCAAAGTAGGGGAAGGCGATGGTCATATTGAAGGTGTTGTTGCAAGTCTAAATTATCTTGCTGTTTTTTATATTAAGTTTCAATGAGATACATGTGCCTTTTATAATATATAAAGGCTAACTAAAATCTCTAAATACTAGGTAACAATATAGATAAATTAGGGTCAACTACTTTTCTTATTTAATGAAATTACGTAGGTATTCCTTCAACACTCCCCCTTAAGCTGAGTAATAGATATAACGGTTACGTACTCTATCCTCATTAGTGGGGAAGGCCAAGGGGGAGGATCTAGAGCATCAAGAAGAGACCTTAGACAAGATGTTCTTAGTCTCCTTTCCTCTTCTTGCTTGTTGTGGATACCCTCAGTAAGATTGTTTTCAAATGTGTTGATGGTAACATTATCAACAGCATTTGAGGTCGGGAAAGATAAGTTACCCCATCTTATCTTCATACTATTTTCTTTTGCTCTAGGAAAGAGGAGTCGTTTCTAGACCAATCTTAGCGTTTTTTGAGGCCATGTTGGGACTTAAAATCAATAGAGGTAAATGTCAGATTCTAGGTTTGACTGTGACCCTAGGAAAGTGAGTAGTTGGTCTGCTCATTTAGGTTATGAGATTGGTTTGTTTCCCTCATCGTACTTAGGGCTCCCCTTTGCCATAATCCAAGGACTCTTCCTTTTTGAAACCTTGTGGTGGACAAGGTGAGAAAGAGAGTTGCCTTTTGGACGTGAATCCTTTCTCAAAAGGTGGTAGAATGACCTTCATTCGCTCAGTGTTGAGTGGCTTCCCTATCTATTTCTTCTTCCTTTTTAGAGCCAGTTATTCTGTGTGTAAAAATATTGAGAAGCTTATGAGGGACTTCTTGTGGTAAGGGATGATGAGGATAGAAGTGTGCATCGTGTGAGTTGGGAGATGGTGGGAAACTTAATAAACCTTTTGGGTTAGAACTAGGGAATTTAAGGACTCATAACAAAGCCTTGTTAGCCAAATGCTCTAGCGGTTTCTCCTTGTCCAAAACCCTTTGATATAGGATTTTTAACAATCAAGCATAAACCTCATCCTTATGAGTGGACGTTGAGTGGGGTCAAGGACACTTATAGGAGTCTATGGAAAGAAACATGAAATGGACTCCTCTCTTTTCCTTACCTTGTTCACTATTTTGTGAGCGATCAGAAGGAAACATATTTCTGGCAGGATAAGTGAGTGGGGGATAAACAAATGTGCTTTATGGGGGATAAACAAATGTGCTTTATGTTTCTCCGTTTATATCATTTGTCCTTCATGAAAATCCGTTTTGTGGCTGATGTTTTGGATCCTTCCAAGAGCTCTCCTTCTCTTTTACTTGGTTTTTGTCGTTCATTGTCTTATAGGGAAGTAGTGGATATCACAACTCTCTTATCATGATTAGAGAGTTTGAGTCTAGGTTGGGGAGAATGGATGTTCGTTTTTGGAGTCCCAATTCGACTAAGGGCCTCTCTTTTAGATCATTCTTCCATTGTTTGTTGGACCCTTCTCCCTCTAGTGATTTCATTTTTATTCTCTATGGAAGGAGAAAGTCCAAAGAATGTTGAATTAGTTATTCACGTAATAGTAAACACTTTTGATCGACTTACATTAAAGACGTCTTCTCTGGTTGGGTCGTTTTGTTGCATTATTTGTTAGAGGGCGAAGGAAGATTTGGATGATATCCTTTGGAGATATTCTTTTGCTTGTTCTGTTTGGAGTTATTTTTTTAATGTGTTTGAGTTCTAGTTATCAAGACACAAGATTACAGGGAGATGATCAAGGAGTTTCTTCCTCATACTTTTTCAAGAGGGGGGATTTCCGTAACAAGCCAAAGTTTGTGCTTTAATATAGAGACATTGGGGGGTGAGGAACAATAGAACCTTTAGAGGAATTGAAAGGGATACAAGTGACTTTTGATCTCTCATTAGATTCTATGTTTTTCTTTGGGTGCCGGTGTCAAATTCTTTTTGAAACTATTGATTAGGTTTTATTCTTCTTGATTGAAGTCCCTTTCTCTAATAGGGCTCCCTTTTGTAGGTTTGGTTCTTTTGGATGCTCTTGTAGTCTTTCGATTTTTTTTTTAATGAGAGCTAGATTTTCATCAAATATATAAATAGATTTTCATCAATGGAAGACCGAGAGGAAAAATTTATGCCACACGGGGCCTGAGACAAGGAGACCCACTATCCCCTTTTCTCTTCATCATGGTAATGGACTATCGCAGTCACCTACTGACAAAGGCAGAATACGAAGGGCAGATCAAAGGTCTTCAGATTGGTAATGAAGGCTTGAGCATAAACCACCTTCAGTTCGTAGATGACACAATCCTCTTTTCCGATTTGGTGAATGCCTCTTCCATAAAAAACATGATTGAGACGGTAAAAACCTTTGAAGGATTCTCTGGACAAAATATCAATCTCCAAAAAACAGAGATCATGGGCATCAATATTAGCACAGAGATCATTGAAGAATTTGCTTGCAGATATGGTTGTAAAAAGGGAGAATGGCCAAATATGTACCTAGGATTGCCTCTAAAAGGAAATCACAAATCCTTTTCCTTTTGGAAGATTATTATTGAGAAAATAGAAAGAAGGTTGTCAACATGGTCATCATGTTATACTTCAAAAGGTGGAAGACTTACCCTAATACAAGCCACATTATCCAACCTCCCTACCTACTATATGTCTCTATTTGAAATGCCACAAAAAGTGGCCGCGGATATAGAAAGGCTTTTCAGAAATTACTTGTGGAAAGATAGTGCACACCTTGTTCGTTGGAATATTATAAACCTCCCAACAGAAAAGGAGGTCTCGGTCTTCTCTCGATAAGGAAGAAGAACACAGCTCTCCTCGCCAAATGGATATGGAGATATCATCACGAAGAAAAGGCCTTATGGCGAAATCTTATAAAGGCTAAATATACTCCTACATCAAACAAAGATCACCTCCCTCCATCTTCTACAAAAGGGCCTTGGAAGTACATAAAGAATCATCAAAATCTCATCACCCACCGAACTCGCCATAAGGTGGGGAATGGGGGAAGCACATCATTTTGGACCAACCCATGGATTGAAAACACCACGCTAGCTTTGAAGTATCCACTTTTGTACAGGCTTTCTCACCACAAAAAAGCCACGATTAAAGAAATGTGGAATGTTGTCAACAAATTTTGGGACTTGAAGCTTGGTAGGAATCTAAAGGATAATGAAGCAACAGAGTGGGCCGAATTAAGCCTTGACCTTGCCCCTGTGGTATTGTCAAACAAAGAAGACTCACTAACTTGGCTCCCCAGCGCTGACGGGGTCTTTTCTACAAAATCCTTGATGATGGACATGGGGGAAAAAGTAGAAGCAATAAATCCCACGCTAGCAAAGACAATATGGAAAGGACATCAACCTAAAAAGGTGAAGTTCTTCCTTTGGGAAATAGCGCATAAAGCCATCAGCACAAGTGAAAATCTTCAAAAAAGAATGCCTTACATCACTCTCTCTCCAAATTGGTGTCCACTATGCAAGAAAGCAAACGAATCACAAAGTCACTTGTTTATGCAATGCACATACACTCAAAATTTCTGGACAACGATTCTTAATATATTCGGATGGCATCTCACATTTCCTAGGGAGGTAAAGGATTTTTTGGATATGGCCTTAACGTACCACCCTTTCAAGAACGCAAAAGCCCTATTATGGAAAAACCTCATCATGGCTTTCTTTTGAAATCTGTGGAAAGAAAGAAATCAGAGAATATTTGCAGAAGCGACACAGACCTACACNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAATATATATATATATATATATATATATAAATATTGAGAGTCGAGAGAAAAGAGACAACTTGATGACTTTTTGGACAATCATATGATAGTTTTTTTTATCGTTTGTTGGAAATGAAATTCCCCTATGGGTCTTGTCCTTGGAGGATTTAGCTCCAACCAAGGTGGGGAATTCCCATTTAGAATAGAATCGGGAAATAGTTTCCATCAATGTTTTTGGAAACAGGGATGGGAATTACATTTCCCGTGCCACCTTTCCTCTGTCCTTGATTCCAATTTTAAAAAATAACTAAAAACATTTACCATCCTTTTCTATTATAATGGTTGATATGAAATATTTAATATTTTTTTAAAATTAATTATTTAAATTAGTAAATAATGTAAATACAAAAAGTTTCATCTAAGATGCATGTAATTTTAATATATTTGTTTCGATTTTGATATTGATTACTTTTTTAATCTAAACAAATTAAATATTGAATGAGAAATTGTTTTTAGTGATCCCGACATGGGGATGGCGATTTCCCAACCTTGACCATCATTGGGTACAAAGATTAAGAAGGTTTCTATATAGCTAGTGACAAGGAGGGCGGTCTCCGACCCTGCTCCATTGCTTTATTGATTGATGTTACTCTCTGACTAAATATTCATGTATTGTCTTCATTTTCTTTCTCATTCTTTTGCCATTTGCTGAGTGTATTGTGTTCACTTTTATCAGAGGTAGGGCGAGTATTGCAGTTGAGGCTTGTTTAGATCATCAGCTTCGGCATAAGGAGGCAAAGAAAAAAGGAAAACCTGCTCTACCCAAAGTAGGCTTCTTTTTCTTTTGGTGACCTTTGTTTCCCCTTTACTTTATGCAATTTTGAAGTGTTCCGTACGTGTAATCTTATTTAAAAAAATTGGACCTTTTTGTATCACACACCAATACCCTTGCATAGAGTATTTGAAGTTTATATGAGCTCCTTCTTCTACAATCATCGCATTATTTTAATATTTGAAATTGACAGGGAAAGTTTGTACAACCTACCAACTTTGACAAGGAGCGTGCATACTTTCAAGAGGTAGATGCTTTTGAGTTGTTGGAGGAGAGCCCTTCACCCAAGAGATTCGGTACATGGGCAAGTAGTCAGTTTGATAGTGCTGCATTACCATCTCTATGCTCTAAAATAGAAAAATGGTTAATTTCCAAGAAATCAAATTACAGTCTTGCACCTTCAAGCACGTTATCAAAGATATTAGAAACTCCACTGGAGTCCAGAGAACCAATTGGTGGCAAACATTTGGACAAATTCAATTTGAGAACTCCTGATAAGTCTGCTAGAGAGATAGATGCGCGGTTGTGCTCTATTCAGAGAAGATTTATTTCCAGTCTAAATGATATCGATGCCCTTGAAATAGATAGCAATCGAGACAACAGAAGTACCGGAGCAGAAGATGTGAGCACAGAGGATCGTGCAGACCTTGAAGTTTCTGTGAAAAAACTTTCTTTAACATCAACATCTACTTCTTTCCATAACTATAATTTGGATCCATTTAATGCATTATTAGCAGTTTGTGGACAGTCAGCTCCTTCCACGCTTAAGGATGCATTCTCTAATTATTGGTTTGTTCCTCAGACTGCAAATACTTATTTTCATATATTTATTCCGCTTATTGGCTTAAAAGAACATAAAAACAATATATCTAGCCTTTTGTCTTCTCTAAGATTTAAAGGTTGGTTTCTGAATCAGTCATCTTCTTTGCAACTCGGACTTCGTTGCTTAAGAAGTGGCAGGAATAGGTTCCACCTCTAAACGGAGATTCCCAAGTAATGTGATTAATGCCTATGAAGTGTTCTTTTTGAGGATTAGCTTTTTAGGGATCCAGTTGCTGCCTTTTAGTTTATTTTCAAGCTTATAGCACAACTTCTACTTGGAACTATTAATTGGATGAGTTGATTCAGGATCAATCCCGACAAGTTACAGCTTTCTTATAACTGTTGAACTCAAGTAATTTATCTGAAATTACAGTCACCATTTGTTATAGAGTAAATAAATACATTCACATATATTTTGCCCTATGCGATTATGCATATGGTTTGTGGTCACTGCTCTTGCAATTTAAGTTGTTCTCCCGTTAATTTCTTGATTTACTTTAGTTCGTATGTTCCTCATTCACAATTTTTAAATTTCGTTGATTAGTGAATTTTTTTACTCAGTGACCCGGAAACTATTGTCAAGGTTGGTGAAGGTACATATGGAGAAGCTTTTAAAGCTGGTAACACCGTTTGCAAGATAGTTCCAATTGACGGAGATTTACAAGTTAATGGAGAAGTGCAAAAGGTACATTTTACCACCACTCATGGCTGTTCCAGCCTGGCAAGTGGCACTTCATGGAAAATTTCAGGGATGGAAGATCATGTTGAAGAGTTCTTTCGTTTTGGAAGCAGATTTTAATGCATTGATTTTTATTTTTTTTTTAAATTCGTTATTATATATCTTTTTTTGTTCGTTCATTAAAAATTATAATAATTGTTTTTTTCTTTTGTACCATCCTGATGATTCTGATGAATGTAGTTGAAATCAGAAATTTGTTATAATTCTAAGCCAGTCTTACTTACCTTGTATCAGAGATCAGAAGAACTGCTCGAGGAAGTCATACTTTCCAGAACTCTGAATTCTCTAAGAAATCAAGAGGGCGGTGCTGATAATTTTTGCACAACATTCATAAGAACCATAGAGTACGTTTACTGCCTAGTAAGATAAATGGTCATGGTTTTCTACAATGTCAATGCACATCCACTAGCTAGAATTTTCCACCACGCTTTTCCTTTATCCGCAGTAGTTTGTTTCTGCTTTATGTTGGGCTATCAAACATTCTGATGTGATGCAATTTTTAGGTCAATTAAAGAGTAGAAAACTTGTTTCCAGCTCATCTCATACGCTGGGTTTTGTGAGAATAATTGCCGTTGGAAATAATGGAGCAGTTTCTCATGGACTATTAAACTAATACTTTCGTGATCTTTTTATCCAATAGTTTAAGGGTGTGCCAAGGTTCTTATGATGCTCTACTAATCAAGGCTTGGGAAGATTGGGACGAAAAGCATGGCTCGGAAAATGATCACCCTAAGGAGTTTCCAGACAAACAGGTTTCTTTTTGTATAGATCATGAATTATTTCTTTTTCTGAAATATAAAAAGCTTCATGCATCTGAGTTTTTATCGTTTATATGTACCGATGCCATAAAGGGGAACAAAAGCACAGAAACGTATATTTGTTCGCATAATATTGCTTATCTTGTGAAAATAATTTATGTAGCTAGAAGATCACGTTCATTTTGATTTTGTGTTTCTTTGTATAGTTCCAACTTTTCACTTTTTTTTGTAAGACCTCACAATCTATTCCTTGAGTAATGGTTTGAAATTATTTTAAATAAAATGTCATATTATGATATTATGGGCTTGGTTATGCAATTTAGCCTGTGGTTGACTTTTGCTGATGTTCTGTAGTGTTATGTGGTGTTTGTTCTCCAACATGGGGGAAAAGATCTTGAAAGCTTTGTACTCCTGAACTTTGATGAAGCCCAGAGCTTACTAGTACAGGTTCGCAATGGATTTTACTAGCATTATTGACTTTTGAAGCTTTAACCAAATCCAATTTTGACGAACTGCAGGCTACTGCGGCCTTGGCTGTGGCTGAAGCTGCTTATGAATTTGAACACCGAGATCTGCATTGGTCGGGTTCTTCTTCCATCTACAGTTAATTTGGCAGTTTCTTTTCATTGACTACTGATTTGGACCGTGTATTTATTGCCAGGGGAAATGTACTTTTGAGTCGGAATGATTCTGAAGCGCTGCAGTTCACCCTTGAGGGTAAGAACATGACCGTACAAACATTTGGACTTCAGATTTCGATCATTGACTTCACCCTATCGCGAATCAATACTGGTAAGAGGTTTTTAATTTATCATCTATGGAGTTTGCACTGAGACAATTCCCTTAATTAAAATTTGTTTATGAGTTAGTTTTGTTGGTTGCCTGACCAAAAAGGGAGTTAGAAAACTGAATGAATGTTTAACATGCAGGTGAAGACATACTCTTTTTAGATCTCTCATCGGATCCTGATCTCTTTAAGGGTGCAAAAGGAGATAGACAAGTAAGCAAGCATTCAATTTGATTACTTAAGAATTAAGATAAATCTTATCAATCCACTTATTTGAAAATTTTCGTGTTACTTTTTTATTTTTATTTTTGTTCTCATATGCAGTCAGAAACATATAGGAAAATGAAGGAGGTGACTGGAGACTGCTGGGAGGGGAGGTACAAGTGTTTATGCGTTTATCTTTCTATCATATCTGAATCAATTATAGAGGTGATTTTGATCTTTTTCCTCTTTGGCATCCTCAGGATTTAATTGTTTATAAATTTTAGAAAGACTAATATATGCTTAGTGGCTAATATTTGATATTTTAAAATCTATTTCTGTTTTATAATTTCCAATTTAGCCTTAAAACAAGATGCCCCAACAGTCATGTTTCCTCTTTACTTCCAAGTATATTTGGTTCCTAATTAGTTTGATCTATCTATGTAAGAAATTTTAAATTTTAACGATACTTTTTATTATATAGACTAAATTATTCCAAATTTTAAAGTACACGGACTAAGTTTGTACACCATGTAGTTTAGAGACTAAATTCTTACAGAAATGAAAATACAAAAACTAAAACATTAAAAAAAAATTTAAGGACTACAATGTTACTTGGATGAAAGTTGAAGGACAAAAGGTAATTTTTGACCGAACATTTTTCAATTCAATCTCTTTTAATGTTGGCGAATATGAAAATTGGTTGATTATAAACTTACAAACTAAGAAAAATCTAGAGGAGAAAAAGTCATCTAGAAGGAAAACAATCGAAATTTCTTATAATTCCTTAGGAATTTAGTTTCTCCTCAAATTTTAACATTTATCAATTGTCATGGACTAAATTGAAAAATGATAATGCATAGAGGATTATGTTAAACTTTTTAAGCGATTGGGATTAGATTAAAAAAGAGTGACTAAAAGATATTTAGCCTTTTAGAAATGTCCAAAAAAGATGTGGACGGTACTGTTTGTGCTGGATTATCTGCTTGGTAAAAGTGTAAAGGGGAAGATCTTATTCTCGTTCTTTTTCCCTCCTTTCAGCAAGAGCGTCCATTGTTATTCTACTTTATATGAATCCACAAACTCCATGCACAAAGTTGATACGTCTAAAATTTTAGAATATTAGCGATGAAATTTAGCAAGATAATTTATGGCTTTCAAGTATAGTACTCAGTAGTAGACATTCTGCCATCCATCATACTAACCTGATACCTTTCACCATTGAAAATTGTCTGTGACTCATAACTGAAACCGTTGATTTTGTATTCAACAGCTTTCCTAGAACGAACGTCCTGTGGTTACTCTACCTTGTGCATATATTACTTCTGAAGAAATCATTCGTAAGTCTTCTATATCTAATTAGGAAACCAATCATAATTGTTCCTTAGAACCCAAAGAATTGCATTTAGCTTTTGTGAACAATCACTTTCATTTGTTGTAAGTTACTAAAAGAAAATGTATTTCTCATTTCTCAAGGACTGAGTAGATTCAAATGGTGCTCAGTACATTTTAGATGCAGAATATCAACAGAATTTGGTTAAGTTATAATTCTTATTCTGGAGTATGATTGTGAATCCCCAGATAAGTAGCTAGATTTTTTTGGTTGTATTATTTTCGAAGGTTCGTACATGTTCAGATAGTTTAGTTCTGAAGTGGATGCATATTCTGTGTCTATATCTAAACAAAACTTGTTAAAGTATGAAGTTGCTTATTCTGAAGATTGACATGAATCCTAAAGATAAATACCTTGGCTGCTCTCCTTTTTGTTTTTTCCTTCTGTGATGTTTACCTTCTAGGGGATTCACAAATCCAAAAGGTGTATATTTACAGTGGTTTTGGAAATGTTTTTGCTGTCTGAAATGTTAACTGTAAAAATTCGATTGAAAGGCTGTCGATATTTTTCCAGGAGCGTAGTTCAAAGCATGAAAGGGAACTGCGGGCATTCAAAAAACGTCTGGACAAATATAACTCAGCAAAAGAAGCAATTTTTGATCCATTTTTCAATGAGTTGATTGTTTGGTCAAGCAGTGTGGAGTAGAGCTGGATGATTTTAGACTCCAACACGAAGTAGAATGAAGTCCTGATATTGCTCCCTGCCGGTGGATGTAAATTAATCTTTGTATTTAATGTTTAAAAATTTATAGATTAAATTAACACAAATGTAAACTATAATGACAACAGGTATATCTTAAGAACCTTTCTTTTACTTTAAATTGTTAGGCTATCTTGTTC

mRNA sequence

ATTGTTTGGCAGCTTTTTGCCCTTAAAAAACGTCCATTTCCGGCCTTAGTTGGCTCATTGGGCTCCCGCCATTCGGACATTCGAACGCTCCTTCTCTGCATTTCACCATCTGCTCTCCACTCCCTCTACTTTGTCCAATCTTGACAGCATGAATTCCAAACCAGGTGGGAAGGGCATTGATTTGTGGTCCGAGTTAATAGCTAGCGAAGGAAGCGAGGGCCAAGAAGAAGCGCGAGTAGAAGAGGTTTATAGACGAAGAAGGCCATCCGAGAAGACCGTTCATGAAGTTCCTTTGAAGCAAAATTTAGGTTCGAATCTTCGCGATGTGAACAGATTGAGTTTTGCTGCTGTTAACAGTAAGAGAATTAGTTGGAACCGTGCCCTTTCCATCAGAGGTAGGGCGAGTATTGCAGTTGAGGCTTGTTTAGATCATCAGCTTCGGCATAAGGAGGCAAAGAAAAAAGGAAAACCTGCTCTACCCAAAGGAAAGTTTGTACAACCTACCAACTTTGACAAGGAGCGTGCATACTTTCAAGAGGTAGATGCTTTTGAGTTGTTGGAGGAGAGCCCTTCACCCAAGAGATTCGGTACATGGGCAAGTAGTCAGTTTGATAGTGCTGCATTACCATCTCTATGCTCTAAAATAGAAAAATGGTTAATTTCCAAGAAATCAAATTACAGTCTTGCACCTTCAAGCACGTTATCAAAGATATTAGAAACTCCACTGGAGTCCAGAGAACCAATTGGTGGCAAACATTTGGACAAATTCAATTTGAGAACTCCTGATAAGTCTGCTAGAGAGATAGATGCGCGGTTGTGCTCTATTCAGAGAAGATTTATTTCCAGTCTAAATGATATCGATGCCCTTGAAATAGATAGCAATCGAGACAACAGAAGTACCGGAGCAGAAGATGTGAGCACAGAGGATCGTGCAGACCTTGAAGTTTCTGTGAAAAAACTTTCTTTAACATCAACATCTACTTCTTTCCATAACTATAATTTGGATCCATTTAATGCATTATTAGCAGTTTGTGGACAGTCAGCTCCTTCCACGCTTAAGGATGCATTCTCTAATTATTGTGACCCGGAAACTATTGTCAAGGTTGGTGAAGGTACATATGGAGAAGCTTTTAAAGCTGGTAACACCGTTTGCAAGATAGTTCCAATTGACGGAGATTTACAAGTTAATGGAGAAGTGCAAAAGAGATCAGAAGAACTGCTCGAGGAAGTCATACTTTCCAGAACTCTGAATTCTCTAAGAAATCAAGAGGGCGGTGCTGATAATTTTTGCACAACATTCATAAGAACCATAGATTTAAGGGTGTGCCAAGGTTCTTATGATGCTCTACTAATCAAGGCTTGGGAAGATTGGGACGAAAAGCATGGCTCGGAAAATGATCACCCTAAGGAGTTTCCAGACAAACAGTGTTATGTGGTGTTTGTTCTCCAACATGGGGGAAAAGATCTTGAAAGCTTTGTACTCCTGAACTTTGATGAAGCCCAGAGCTTACTAGTACAGGCTACTGCGGCCTTGGCTGTGGCTGAAGCTGCTTATGAATTTGAACACCGAGATCTGCATTGGGGAAATGTACTTTTGAGTCGGAATGATTCTGAAGCGCTGCAGTTCACCCTTGAGGGTAAGAACATGACCGTACAAACATTTGGACTTCAGATTTCGATCATTGACTTCACCCTATCGCGAATCAATACTGGTGAAGACATACTCTTTTTAGATCTCTCATCGGATCCTGATCTCTTTAAGGGTGCAAAAGGAGATAGACAATCAGAAACATATAGGAAAATGAAGGAGGTGACTGGAGACTGCTGGGAGGGGAGCTTTCCTAGAACGAACGTCCTGTGGTTACTCTACCTTGTGCATATATTACTTCTGAAGAAATCATTCGAGCGTAGTTCAAAGCATGAAAGGGAACTGCGGGCATTCAAAAAACGTCTGGACAAATATAACTCAGCAAAAGAAGCAATTTTTGATCCATTTTTCAATGAGTTGATTGTTTGGTCAAGCAGTGTGGAGTAGAGCTGGATGATTTTAGACTCCAACACGAAGTAGAATGAAGTCCTGATATTGCTCCCTGCCGGTGGATGTAAATTAATCTTTGTATTTAATGTTTAAAAATTTATAGATTAAATTAACACAAATGTAAACTATAATGACAACAGGTATATCTTAAGAACCTTTCTTTTACTTTAAATTGTTAGGCTATCTTGTTC

Coding sequence (CDS)

ATGAATTCCAAACCAGGTGGGAAGGGCATTGATTTGTGGTCCGAGTTAATAGCTAGCGAAGGAAGCGAGGGCCAAGAAGAAGCGCGAGTAGAAGAGGTTTATAGACGAAGAAGGCCATCCGAGAAGACCGTTCATGAAGTTCCTTTGAAGCAAAATTTAGGTTCGAATCTTCGCGATGTGAACAGATTGAGTTTTGCTGCTGTTAACAGTAAGAGAATTAGTTGGAACCGTGCCCTTTCCATCAGAGGTAGGGCGAGTATTGCAGTTGAGGCTTGTTTAGATCATCAGCTTCGGCATAAGGAGGCAAAGAAAAAAGGAAAACCTGCTCTACCCAAAGGAAAGTTTGTACAACCTACCAACTTTGACAAGGAGCGTGCATACTTTCAAGAGGTAGATGCTTTTGAGTTGTTGGAGGAGAGCCCTTCACCCAAGAGATTCGGTACATGGGCAAGTAGTCAGTTTGATAGTGCTGCATTACCATCTCTATGCTCTAAAATAGAAAAATGGTTAATTTCCAAGAAATCAAATTACAGTCTTGCACCTTCAAGCACGTTATCAAAGATATTAGAAACTCCACTGGAGTCCAGAGAACCAATTGGTGGCAAACATTTGGACAAATTCAATTTGAGAACTCCTGATAAGTCTGCTAGAGAGATAGATGCGCGGTTGTGCTCTATTCAGAGAAGATTTATTTCCAGTCTAAATGATATCGATGCCCTTGAAATAGATAGCAATCGAGACAACAGAAGTACCGGAGCAGAAGATGTGAGCACAGAGGATCGTGCAGACCTTGAAGTTTCTGTGAAAAAACTTTCTTTAACATCAACATCTACTTCTTTCCATAACTATAATTTGGATCCATTTAATGCATTATTAGCAGTTTGTGGACAGTCAGCTCCTTCCACGCTTAAGGATGCATTCTCTAATTATTGTGACCCGGAAACTATTGTCAAGGTTGGTGAAGGTACATATGGAGAAGCTTTTAAAGCTGGTAACACCGTTTGCAAGATAGTTCCAATTGACGGAGATTTACAAGTTAATGGAGAAGTGCAAAAGAGATCAGAAGAACTGCTCGAGGAAGTCATACTTTCCAGAACTCTGAATTCTCTAAGAAATCAAGAGGGCGGTGCTGATAATTTTTGCACAACATTCATAAGAACCATAGATTTAAGGGTGTGCCAAGGTTCTTATGATGCTCTACTAATCAAGGCTTGGGAAGATTGGGACGAAAAGCATGGCTCGGAAAATGATCACCCTAAGGAGTTTCCAGACAAACAGTGTTATGTGGTGTTTGTTCTCCAACATGGGGGAAAAGATCTTGAAAGCTTTGTACTCCTGAACTTTGATGAAGCCCAGAGCTTACTAGTACAGGCTACTGCGGCCTTGGCTGTGGCTGAAGCTGCTTATGAATTTGAACACCGAGATCTGCATTGGGGAAATGTACTTTTGAGTCGGAATGATTCTGAAGCGCTGCAGTTCACCCTTGAGGGTAAGAACATGACCGTACAAACATTTGGACTTCAGATTTCGATCATTGACTTCACCCTATCGCGAATCAATACTGGTGAAGACATACTCTTTTTAGATCTCTCATCGGATCCTGATCTCTTTAAGGGTGCAAAAGGAGATAGACAATCAGAAACATATAGGAAAATGAAGGAGGTGACTGGAGACTGCTGGGAGGGGAGCTTTCCTAGAACGAACGTCCTGTGGTTACTCTACCTTGTGCATATATTACTTCTGAAGAAATCATTCGAGCGTAGTTCAAAGCATGAAAGGGAACTGCGGGCATTCAAAAAACGTCTGGACAAATATAACTCAGCAAAAGAAGCAATTTTTGATCCATTTTTCAATGAGTTGATTGTTTGGTCAAGCAGTGTGGAGTAG

Protein sequence

MNSKPGGKGIDLWSELIASEGSEGQEEARVEEVYRRRRPSEKTVHEVPLKQNLGSNLRDVNRLSFAAVNSKRISWNRALSIRGRASIAVEACLDHQLRHKEAKKKGKPALPKGKFVQPTNFDKERAYFQEVDAFELLEESPSPKRFGTWASSQFDSAALPSLCSKIEKWLISKKSNYSLAPSSTLSKILETPLESREPIGGKHLDKFNLRTPDKSAREIDARLCSIQRRFISSLNDIDALEIDSNRDNRSTGAEDVSTEDRADLEVSVKKLSLTSTSTSFHNYNLDPFNALLAVCGQSAPSTLKDAFSNYCDPETIVKVGEGTYGEAFKAGNTVCKIVPIDGDLQVNGEVQKRSEELLEEVILSRTLNSLRNQEGGADNFCTTFIRTIDLRVCQGSYDALLIKAWEDWDEKHGSENDHPKEFPDKQCYVVFVLQHGGKDLESFVLLNFDEAQSLLVQATAALAVAEAAYEFEHRDLHWGNVLLSRNDSEALQFTLEGKNMTVQTFGLQISIIDFTLSRINTGEDILFLDLSSDPDLFKGAKGDRQSETYRKMKEVTGDCWEGSFPRTNVLWLLYLVHILLLKKSFERSSKHERELRAFKKRLDKYNSAKEAIFDPFFNELIVWSSSVE
BLAST of Cp4.1LG18g00040 vs. Swiss-Prot
Match: HASP_BOVIN (Serine/threonine-protein kinase haspin OS=Bos taurus GN=GSG2 PE=2 SV=1)

HSP 1 Score: 218.4 bits (555), Expect = 2.2e-55
Identity = 131/322 (40.68%), Postives = 181/322 (56.21%), Query Frame = 1

Query: 291 LLAVCGQSAPSTLKDAFSNYCDPETIVKVGEGTYGEAFKAGNT----VCKIVPIDGDLQV 350
           L   C Q  P    D  S     E   K+GEG +GE F+          KI+ I+G   V
Sbjct: 446 LYGECNQVGPIPFSDYLSEE-KLECCEKIGEGVFGEVFQTVTNHTPVALKIIAIEGQNLV 505

Query: 351 NGEVQKRSEELLEEVILSRTLNSLRNQEGGADNFCTTFIRTIDLRVCQGSYDALLIKAWE 410
           NG  QK  EE+L E+I+S+ L+ L ++   A N    FI    +   QGSY  LL++AW+
Sbjct: 506 NGAHQKTFEEILPEIIISKELSLLSDE---ACNRTEGFIGLNSVHCVQGSYPPLLLQAWD 565

Query: 411 DWDEKHGSENDHPKEFPDKQCYVVFVLQHGGKDLESF--VLLNFDEAQSLLVQATAALAV 470
            +    GS ND P  F + Q ++V   + GG DLE     L +   A+S+L Q TA+LAV
Sbjct: 566 HYHSTKGSANDRPDFFREDQLFIVLEFEFGGIDLEQMRKKLSSIATAKSILHQITASLAV 625

Query: 471 AEAAYEFEHRDLHWGNVLLSRNDSEALQFTLEGKNMTVQTFGLQISIIDFTLSRINTGED 530
           AEA+  FEHRDLHWGNVLL +   + L +TL GK  ++ T GLQ++IID+TLSR+     
Sbjct: 626 AEASLHFEHRDLHWGNVLLKKTSLKELHYTLNGKKSSIPTRGLQVNIIDYTLSRLERDGI 685

Query: 531 ILFLDLSSDPDLFKGAKGDRQSETYRKMKEVTGDCWEGSFPRTNVLWLLYLVHILLLKKS 590
           ++F D+S D DLF G +GD Q E YR M++   +CW    P  NVLWL YL   +L + +
Sbjct: 686 VVFCDISRDEDLFMG-QGDYQFEIYRLMRKENNNCWGEYHPYNNVLWLHYLTDKILNQMT 745

Query: 591 FERSSKHER-ELRAFKKRLDKY 606
           F+  SKH    L+  KK++  +
Sbjct: 746 FK--SKHNTPALKRMKKQIQHF 760

BLAST of Cp4.1LG18g00040 vs. Swiss-Prot
Match: HASP_MOUSE (Serine/threonine-protein kinase haspin OS=Mus musculus GN=Gsg2 PE=1 SV=3)

HSP 1 Score: 213.8 bits (543), Expect = 5.5e-54
Identity = 142/402 (35.32%), Postives = 216/402 (53.73%), Query Frame = 1

Query: 227 QRRFISSLNDIDALEIDSNRDNRSTGAEDVSTEDRADLEVSVKKLS--LTSTSTSFHNYN 286
           +++ I+S+ ++ +    S+  +  +       ++RA L VS +  S  L S   + H  +
Sbjct: 351 KKKIITSVIEVCSSVASSSSRSLLSECSTPPIKNRAHLTVSSRCSSVYLLSPLKTLHVTD 410

Query: 287 LDPFNA--LLAVCGQSAPSTLKDAFSNYCDPETIVKVGEGTYGEAFKAGN----TVCKIV 346
             P  A  +   C Q  P    D  S     E   K+GEG +GE F+  N       KI+
Sbjct: 411 QRPSYAEKVYGECNQEGPIPFSDCLSTE-KLERCEKIGEGVFGEVFQIINDQAPVALKII 470

Query: 347 PIDGDLQVNGEVQKRSEELLEEVILSRTLNSLRNQEGGADNFCTTFIRTIDLRVCQGSYD 406
            I+G   VNG  QK  EE+L E+I+S+ L+ L ++   A N    FI    +   QG Y 
Sbjct: 471 AIEGLDLVNGSHQKTFEEILPEIIISKELSLLSSE---AYNRTEGFIGLNSVHCVQGLYP 530

Query: 407 ALLIKAWEDWDEKHGSENDHPKEFPDKQCYVVFVLQHGGKDLESFV--LLNFDEAQSLLV 466
            LL+KAW+ ++    S ND P  F + Q +++   + GG DLE     L +   A+S+L 
Sbjct: 531 PLLLKAWDHYNTTKRSANDRPDFFQEDQLFIILEFEFGGVDLERMKTKLSSVATAKSILH 590

Query: 467 QATAALAVAEAAYEFEHRDLHWGNVLLSRNDSEALQFTLEGKNMTVQTFGLQISIIDFTL 526
           Q TA+LAVAEA+  FEHRDLHWGNVLL + + + L++TL GK  T+ T GLQ++IID+TL
Sbjct: 591 QITASLAVAEASLHFEHRDLHWGNVLLKKTNLKELRYTLNGKTSTIPTHGLQVNIIDYTL 650

Query: 527 SRINTGEDILFLDLSSDPDLFKGAKGDRQSETYRKMKEVTGDCWEGSFPRTNVLWLLYLV 586
           SR+     ++F D+S++ DLF G +GD Q E YR M++   +CW    P  NVLWL YL 
Sbjct: 651 SRLERDGIVVFCDISAEEDLFTG-EGDYQFEIYRLMRKENKNCWGEYHPYNNVLWLHYLT 710

Query: 587 HILLLKKSFERSSKH------ERELRAFKKRLDKYNSAKEAI 613
             +L K  F+   +        + L+ F + +  ++SA + +
Sbjct: 711 DKILNKMKFKTKCQSAAMKQIRKNLQHFHRTVLSFSSATDLL 747

BLAST of Cp4.1LG18g00040 vs. Swiss-Prot
Match: HASP_HUMAN (Serine/threonine-protein kinase haspin OS=Homo sapiens GN=GSG2 PE=1 SV=3)

HSP 1 Score: 213.4 bits (542), Expect = 7.2e-54
Identity = 126/307 (41.04%), Postives = 181/307 (58.96%), Query Frame = 1

Query: 318 KVGEGTYGEAFK--AGNT--VCKIVPIDGDLQVNGEVQKRSEELLEEVILSRTLNSLRNQ 377
           K+GEG +GE F+  A +T    KI+ I+G   VNG  QK  EE+L E+I+S+ L+ L   
Sbjct: 489 KIGEGVFGEVFQTIADHTPVAIKIIAIEGPDLVNGSHQKTFEEILPEIIISKELSLL--- 548

Query: 378 EGGADNFCTTFIRTIDLRVCQGSYDALLIKAWEDWDEKHGSENDHPKEFPDKQCYVVFVL 437
            G   N    FI    +   QGSY  LL+KAW+ ++   GS ND P  F D Q ++V   
Sbjct: 549 SGEVCNRTEGFIGLNSVHCVQGSYPPLLLKAWDHYNSTKGSANDRPDFFKDDQLFIVLEF 608

Query: 438 QHGGKDLESF--VLLNFDEAQSLLVQATAALAVAEAAYEFEHRDLHWGNVLLSRNDSEAL 497
           + GG DLE     L +   A+S+L Q TA+LAVAEA+  FEHRDLHWGNVLL +   + L
Sbjct: 609 EFGGIDLEQMRTKLSSLATAKSILHQLTASLAVAEASLRFEHRDLHWGNVLLKKTSLKKL 668

Query: 498 QFTLEGKNMTVQTFGLQISIIDFTLSRINTGEDILFLDLSSDPDLFKGAKGDRQSETYRK 557
            +TL GK+ T+ + GLQ+SIID+TLSR+     ++F D+S D DLF G  GD Q + YR 
Sbjct: 669 HYTLNGKSSTIPSCGLQVSIIDYTLSRLERDGIVVFCDVSMDEDLFTG-DGDYQFDIYRL 728

Query: 558 MKEVTGDCWEGSFPRTNVLWLLYLVHILLLKKSFERS------SKHERELRAFKKRLDKY 613
           MK+   + W    P +NVLWL YL   +L + +F+         + +R+++ F + +  +
Sbjct: 729 MKKENNNRWGEYHPYSNVLWLHYLTDKMLKQMTFKTKCNTPAMKQIKRKIQEFHRTMLNF 788

BLAST of Cp4.1LG18g00040 vs. Swiss-Prot
Match: HASP_DROME (Putative serine/threonine-protein kinase haspin homolog OS=Drosophila melanogaster GN=Haspin PE=2 SV=1)

HSP 1 Score: 203.0 bits (515), Expect = 9.7e-51
Identity = 106/297 (35.69%), Postives = 164/297 (55.22%), Query Frame = 1

Query: 291 LLAVCGQSAPSTLKDAFSNYCDPETIVKVGEGTYGEAFKAGNT-----------VCKIVP 350
           +L  C Q  P     A+  +    T  K+GEG YGE F+               V KI+P
Sbjct: 227 VLKYCHQCTPLPFNTAYEQHKLLNT-KKIGEGAYGEVFRCSRNQEVLKDHISDIVLKIIP 286

Query: 351 IDGDLQVNGEVQKRSEELLEEVILSRTLNSLRNQEGGADNFCTTFIRTIDLRVCQGSYDA 410
           ++G   +NGE QK   ++L E+I+++ + SLR  +  + N    F+    + + +G Y  
Sbjct: 287 LEGSTVINGEKQKTFSQILPEIIITKKMCSLRTSKTNSTN---GFVSIQKVSLVKGRYPP 346

Query: 411 LLIKAWEDWDEKHGSENDHPKEFPDKQCYVVFVLQHGGKDLESFVLLNFDEAQSLLVQAT 470
             IK WE +D + GSENDHP+ F D Q + V  L+  G D+ +F  LN +++   L Q  
Sbjct: 347 HFIKLWEKYDNEKGSENDHPELFGDNQLFAVLELKFAGSDMANFKFLNSEQSYYALQQII 406

Query: 471 AALAVAEAAYEFEHRDLHWGNVLLSRNDSEALQFTLEGKNMTVQTFGLQISIIDFTLSRI 530
            ALAV E  Y+FEHRDLH GN+L+   + + +  T +  N+T+ + G+ ++IID+TLSR+
Sbjct: 407 LALAVGEEEYQFEHRDLHLGNILIEYTNKKHIVCTFKSSNLTLLSKGVNVTIIDYTLSRV 466

Query: 531 NTGEDILFLDLSSDPDLFKGAKGDRQSETYRKMKEVTGDCWEGSFPRTNVLWLLYLV 577
              +   F DLS D +LF+ A GD Q + YR M+    + W    P+TN++WL Y++
Sbjct: 467 TINDCCYFNDLSRDEELFQ-ATGDYQYDVYRMMRNELKNNWSSFSPKTNIIWLSYVI 518

BLAST of Cp4.1LG18g00040 vs. Swiss-Prot
Match: HASP_SCHPO (Serine/threonine-protein kinase haspin homolog hrk1 OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=hrk1 PE=1 SV=1)

HSP 1 Score: 171.0 bits (432), Expect = 4.1e-41
Identity = 125/367 (34.06%), Postives = 180/367 (49.05%), Query Frame = 1

Query: 244 SNRDNRSTGAEDVSTEDRADLEVSVKKLSLTSTSTSFHNYNLDPFNALLAVCGQSAPSTL 303
           S RDN S     VS ++  +L  SV       +    +N  LDP + LL +  Q      
Sbjct: 96  SPRDNASKSV--VSKKEVVNLSSSV-----ALSGKPANNSKLDPLHRLLQIVAQ------ 155

Query: 304 KDA--FSNYCDPET--IVKVGEGTYGEAFKAGNT-----VCKIVPIDGDLQVNGEVQKRS 363
           +DA  FS +   +T  I K+GE +Y E ++A N      V K++P   D Q       + 
Sbjct: 156 EDALPFSQFVKSQTFEIQKIGEASYSEVYQASNADDVPVVWKVIPFGEDGQA------QY 215

Query: 364 EELLEEVILSRTLNSLRNQEGGADNFCTTFIRTIDLRVCQGSYDALLIKAWEDWDEKHGS 423
            ++L EV +S+ +          D F         + V +G+Y +LL++ W+ +  ++GS
Sbjct: 216 ADVLNEVQISQWIK--------VDGFANLH----QVVVVKGTYPSLLLEEWDRYLMQNGS 275

Query: 424 ENDHPKEFPDKQCYVVFVLQHGGKDLESFVLLNFDEAQSLLVQATAALAVAEAAYEFEHR 483
           END P  +   Q Y V  L H G DLE F L ++ E  S+  +    L++ E  YEFEHR
Sbjct: 276 ENDRPDSYSSTQLYCVLCLDHSGTDLEHFELRSWRECWSVFYETLKILSLVETRYEFEHR 335

Query: 484 DLHWGNVLLSRND--SEALQFTLE-------------GKNMTVQTFG--LQISIIDFTLS 543
           DLHWGN+L+ + D   E + F L              G       F   LQ+++IDFTL+
Sbjct: 336 DLHWGNILIRKADRSEEEVSFLLNEISLDDIESVDFPGSQDKADDFDNILQVTLIDFTLA 395

Query: 544 RINTGEDILFLDLSSDPDLFKGAKGDRQSETYRKMKEVTGDCWEGSFPRTNVLWLLYLVH 585
           R +  + I+  +  +DPDLF G   D Q + YR M  VT   W   FP TNVLWL YL+H
Sbjct: 396 RASYSQGIISYNEFNDPDLFNGV-DDYQFDIYRLMSRVTKGRWAQFFPITNVLWLHYLIH 430

BLAST of Cp4.1LG18g00040 vs. TrEMBL
Match: A0A0A0LVR4_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G560690 PE=4 SV=1)

HSP 1 Score: 1050.4 bits (2715), Expect = 8.5e-304
Identity = 532/628 (84.71%), Postives = 572/628 (91.08%), Query Frame = 1

Query: 1   MNSKPGGKGIDLWSELIASEGSEGQEEARVEEVYRRRRPSEKTVHEVPLKQNLGSNLRDV 60
           M S  GGK IDLWSELIASEGS+ QEEA VEEVYRRR+P++KTVH    KQNLGSN  +V
Sbjct: 93  MTSNLGGKAIDLWSELIASEGSDLQEEASVEEVYRRRKPTQKTVHPPHPKQNLGSNGCNV 152

Query: 61  NRLSFAAVNSKRISWNRALSIRGRASIAVEACLDHQLRHKEAKKKGKPALPKGKFVQPTN 120
           NR+S AAV+SKRISWNRALSIRGR SIAVEAC+D Q + K+AK+KGKPALPKGK+VQPTN
Sbjct: 153 NRVSLAAVDSKRISWNRALSIRGRVSIAVEACIDRQRQCKQAKRKGKPALPKGKYVQPTN 212

Query: 121 FDKERAYFQEVDAFELLEESPSPKRFGTWASSQFDSAALPSLCSKIEKWLISKKSNYSLA 180
           FDKERAYFQEVDAFELLEESPSPK F TW SSQFDS+ +PSLCS+IEKWLISKKS YSLA
Sbjct: 213 FDKERAYFQEVDAFELLEESPSPKSFSTWTSSQFDSSTIPSLCSRIEKWLISKKSKYSLA 272

Query: 181 PSSTLSKILETPLESREPIGGKHLDKFNLRTPDKSAREIDARLCSIQRRFISSLNDIDAL 240
           PSSTLSKILETPL S EPIGG HLDKF L+TP+ SAR+ DA  CSIQRRFI S+NDIDAL
Sbjct: 273 PSSTLSKILETPLGSIEPIGGIHLDKFKLKTPENSARDRDAHWCSIQRRFIFSINDIDAL 332

Query: 241 EIDSNRDNRSTGAEDVSTEDRADLEVSVKKLSLTSTSTSFHNYNLDPFNALLAVCGQSAP 300
           +IDSN DNRS  AE++ TEDR D+EV+VKKLSLTSTSTSFH Y+LDP +ALLAVCGQS P
Sbjct: 333 KIDSN-DNRSNRAEEMRTEDREDIEVAVKKLSLTSTSTSFHKYDLDPLSALLAVCGQSTP 392

Query: 301 STLKDAFSNYCDPETIVKVGEGTYGEAFKAGNTVCKIVPIDGDLQVNGEVQKRSEELLEE 360
           STLKD FSNYC+ ETIVKVGEGTYGEAFK GNTVCK+VPIDGDL+VNGE+QKRS ELLEE
Sbjct: 393 STLKDVFSNYCELETIVKVGEGTYGEAFKVGNTVCKVVPIDGDLKVNGEIQKRSVELLEE 452

Query: 361 VILSRTLNSLRNQEGGADNFCTTFIRTIDLRVCQGSYDALLIKAWEDWDEKHGSENDHPK 420
           VILSRTLNSLR+ E  ADNFCTTFIRTIDLRVCQGSYDA+L+KAWEDWDEKHGSENDHPK
Sbjct: 453 VILSRTLNSLRSNERCADNFCTTFIRTIDLRVCQGSYDAVLVKAWEDWDEKHGSENDHPK 512

Query: 421 EFPDKQCYVVFVLQHGGKDLESFVLLNFDEAQSLLVQATAALAVAEAAYEFEHRDLHWGN 480
           EFP+KQ YVVFVLQHGGKDLESFVLLN+DEAQSLLVQ TAALAVAEAAY+FEHRDLHWGN
Sbjct: 513 EFPEKQLYVVFVLQHGGKDLESFVLLNYDEAQSLLVQVTAALAVAEAAYQFEHRDLHWGN 572

Query: 481 VLLSRNDSEALQFTLEGKNMTVQTFGLQISIIDFTLSRINTGEDILFLDLSSDPDLFKGA 540
           VLLSRND EALQFTLE KNMTV+TFGLQISIIDFTLSRINTGEDILFLDLSSDP LFKG 
Sbjct: 573 VLLSRNDYEALQFTLESKNMTVKTFGLQISIIDFTLSRINTGEDILFLDLSSDPYLFKGP 632

Query: 541 KGDRQSETYRKMKEVTGDCWEGSFPRTNVLWLLYLVHILLLKKSFERSSKHERELRAFKK 600
           +GDRQSETYRKMKEVTGDCWEGSFPRTNVLWLLYLV ILLLKKSFERSSKHERELRAFKK
Sbjct: 633 RGDRQSETYRKMKEVTGDCWEGSFPRTNVLWLLYLVDILLLKKSFERSSKHERELRAFKK 692

Query: 601 RLDKYNSAKEAIFDPFFNELIVWSSSVE 629
           RLDKY S KEAI+D FF+ELIVWSSSVE
Sbjct: 693 RLDKYTSTKEAIYDQFFSELIVWSSSVE 719

BLAST of Cp4.1LG18g00040 vs. TrEMBL
Match: A0A067JHK6_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_23175 PE=4 SV=1)

HSP 1 Score: 797.0 bits (2057), Expect = 1.7e-227
Identity = 413/625 (66.08%), Postives = 505/625 (80.80%), Query Frame = 1

Query: 1   MNSKPGGKGIDLWSELIASEG--SEGQEEARVEEVYRRRRPSEKTVHEVPLKQNLGSNLR 60
           M S+  G G+DLWS++IA E   ++ Q++ ++E +YRRR+P +KT  EV LKQ L S+  
Sbjct: 1   MGSREDGNGVDLWSQIIAEEPGYNDQQQQQKIEVIYRRRKP-QKTPAEVNLKQ-LESD-- 60

Query: 61  DVNRLSFAAVNSKRISWNRALSIRGRASIAVEACLDHQLRHKEAKKKGKPALPKGKFVQP 120
           + NR+S  AV +KR+SWNR+LSIRGR SIAV AC+D++ + K+AK+KGKP +PKGK V+P
Sbjct: 61  EENRVSLVAV-TKRVSWNRSLSIRGRVSIAVAACVDNRPQKKQAKRKGKPPVPKGKAVKP 120

Query: 121 TNFDKERAYFQEVDAFELLEESPSPKRFGTW-ASSQFDSAALPSLCSKIEKWLISKKSNY 180
            NF+KE+ YFQEVDAFELLEESPSPK  G W A +Q D+  +P L S++EKWLISKK NY
Sbjct: 121 PNFEKEKEYFQEVDAFELLEESPSPKNSGKWTADNQTDTIPIPHLSSRLEKWLISKKLNY 180

Query: 181 SLAPSSTLSKILETPLESREPIGGKHLDKFNLRTPDKSAREIDARLCSIQRRFISSLNDI 240
           S  PSSTLSK+LETP+ S EPI G      +L TP+KS+ ++ + L S+Q R  + L + 
Sbjct: 181 SCGPSSTLSKLLETPVMSLEPICGDDFGGIDLVTPEKSSSKVCSSLHSVQSRINAYLINK 240

Query: 241 DALEIDSNRDNRSTGAEDVSTEDRADLEVSVKKLSLTSTSTSFHNYNLDPFNALLAVCGQ 300
                +SN +  S  A  +  +D  D+E S+KKLSL STST+  +  ++PF++LLAVCGQ
Sbjct: 241 YVSGRNSNSEKSSMLAI-LRDDDCKDIEASIKKLSLASTSTTLDHDYVNPFSSLLAVCGQ 300

Query: 301 SAPSTLKDAFSNYCDPETIVKVGEGTYGEAFKAGNTVCKIVPIDGDLQVNGEVQKRSEEL 360
            AP TL D FS YCDP+++ KVGEGTYGEAFKAGNTVCKIVPIDG+L+VNGEVQKRSEEL
Sbjct: 301 MAPLTLLDVFSKYCDPKSVTKVGEGTYGEAFKAGNTVCKIVPIDGELKVNGEVQKRSEEL 360

Query: 361 LEEVILSRTLNSLRNQEGGADNFCTTFIRTIDLRVCQGSYDALLIKAWEDWDEKHGSEND 420
           LEEV+LSRTLN LR  +    N CTTFI T+DL+VCQG Y + LI+AWEDWD+KH SEND
Sbjct: 361 LEEVVLSRTLNHLRRNDVDVQNACTTFIETLDLKVCQGPYASALIRAWEDWDDKHSSEND 420

Query: 421 HPKEFPDKQCYVVFVLQHGGKDLESFVLLNFDEAQSLLVQATAALAVAEAAYEFEHRDLH 480
           HP+EFP+KQ YVVFVL HGGKDLESFVLLNFDEA+SLL+Q TAALAVAEAA+EFEHRDLH
Sbjct: 421 HPREFPEKQSYVVFVLAHGGKDLESFVLLNFDEARSLLIQVTAALAVAEAAFEFEHRDLH 480

Query: 481 WGNVLLSRNDSEALQFTLEGKNMTVQTFGLQISIIDFTLSRINTGEDILFLDLSSDPDLF 540
           WGN+LLSRN+S  +QFTLEGK M ++++GL ISIIDFTLSRINTGEDILFLDLSSDP LF
Sbjct: 481 WGNILLSRNESAMVQFTLEGKQMLLKSYGLLISIIDFTLSRINTGEDILFLDLSSDPYLF 540

Query: 541 KGAKGDRQSETYRKMKEVTGDCWEGSFPRTNVLWLLYLVHILLLKKSFERSSKHERELRA 600
           KG +GD+Q+ETYR+MKEVT D WEGSFPRTNVLWLLYLV ILLLKKSFER+SK+ERELR+
Sbjct: 541 KGPRGDKQAETYRRMKEVTEDFWEGSFPRTNVLWLLYLVDILLLKKSFERTSKNERELRS 600

Query: 601 FKKRLDKYNSAKEAIFDPFFNELIV 623
            KKRL+KYNSAKEAI DPFF++LI+
Sbjct: 601 LKKRLEKYNSAKEAILDPFFSDLII 619

BLAST of Cp4.1LG18g00040 vs. TrEMBL
Match: F6I4H2_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_14s0060g00450 PE=4 SV=1)

HSP 1 Score: 794.3 bits (2050), Expect = 1.1e-226
Identity = 407/618 (65.86%), Postives = 480/618 (77.67%), Query Frame = 1

Query: 6   GGKGIDLWSELIASEGSEGQEEARVEEVYRRRRPSEKTVHEVPLKQNLGSNLRDVNRLSF 65
           G   +DLWSE+ A+E  EG  + ++  +YRRRR +E T+ +V   Q   S+L + NRLS 
Sbjct: 18  GQDSVDLWSEIAATEEGEGNRKQQIAVIYRRRRRTENTLKDVDTNQ---SSLNNGNRLSL 77

Query: 66  AAVNSKRISWNRALSIRGRASIAVEACLDHQLRHKEAKKKGKPALPKGKFVQPTNFDKER 125
           AA   KR+SWNR+LSIRGR SIAV AC+++Q + ++ ++K KP LP+GKFVQP +F++ER
Sbjct: 78  AAP-VKRVSWNRSLSIRGRTSIAVFACVEYQPQQRKPRRKAKPPLPRGKFVQPQSFEQER 137

Query: 126 AYFQEVDAFELLEESPSPKRFGTWASS-QFDSAALPSLCSKIEKWLISKKSNYSLAPSST 185
           AYFQEVDA+ELLEESPSPKRFGTWA   Q D   LP L S + KWLI+KK NYS  PS +
Sbjct: 138 AYFQEVDAYELLEESPSPKRFGTWAMGVQSDDIVLPHLSSVLNKWLIAKKLNYSYGPSGS 197

Query: 186 LSKILETPLESREPIGGKHLDKFNLRTPDKSAREIDARLCSIQRRFISSLNDIDALEIDS 245
           LSKILETP    EPI G   D   ++TP+K++ ++   L S+Q RF S+  + D +   +
Sbjct: 198 LSKILETPAMPMEPICGDGFDALTVKTPEKASPQVCLGLHSVQDRFNSNFMNKDVVGRQT 257

Query: 246 NRDNRSTGAEDVSTEDRADLEVSVKKLSLTSTSTSFHNYNLDPFNALLAVCGQSAPSTLK 305
                +  +     E   D++V++KKLSLTS S S    + D F+ALL VC QSAPSTL 
Sbjct: 258 GSQKSNEVSLTTGDEGCEDIDVAIKKLSLTSRSASLGGDHWDSFSALLTVCEQSAPSTLL 317

Query: 306 DAFSNYCDPETIVKVGEGTYGEAFKAGNTVCKIVPIDGDLQVNGEVQKRSEELLEEVILS 365
           D FS YCDPE+IVK+GEGTYGEAF+AG TVCKIVPIDGDL VNGEVQKRS ELLEE ILS
Sbjct: 318 DVFSKYCDPESIVKIGEGTYGEAFRAGKTVCKIVPIDGDLLVNGEVQKRSGELLEEAILS 377

Query: 366 RTLNSLRNQEGGADNFCTTFIRTIDLRVCQGSYDALLIKAWEDWDEKHGSENDHPKEFPD 425
           RTLN LR   G  +N CT+FI T+DLRVCQG YDA LI+AWEDWDEKHGSENDHP+EFP+
Sbjct: 378 RTLNHLRGDGGRVNNSCTSFIETLDLRVCQGPYDAALIRAWEDWDEKHGSENDHPREFPE 437

Query: 426 KQCYVVFVLQHGGKDLESFVLLNFDEAQSLLVQATAALAVAEAAYEFEHRDLHWGNVLLS 485
           KQCYVVFVL+HGGKDLESFVLLNFDE +SLLVQ T ALAVAEAAYEFEHRDLHWGN+LLS
Sbjct: 438 KQCYVVFVLEHGGKDLESFVLLNFDEVRSLLVQVTVALAVAEAAYEFEHRDLHWGNILLS 497

Query: 486 RNDSEALQFTLEGKNMTVQTFGLQISIIDFTLSRINTGEDILFLDLSSDPDLFKGAKGDR 545
           R DSE LQFTLEGKNM V+TFGL ISIIDFTLSRINTGE ILFLDLSSDP+LFKG KGD+
Sbjct: 498 RKDSEMLQFTLEGKNMFVKTFGLSISIIDFTLSRINTGEAILFLDLSSDPELFKGPKGDK 557

Query: 546 QSETYRKMKEVTGDCWEGSFPRTNVLWLLYLVHILLLKKSFERSSKHERELRAFKKRLDK 605
           QS TYRKMKE+T D WEGSFP+TNVLWL YLV ILLLKKSF+R+SK ERELR+ KKR+D 
Sbjct: 558 QSNTYRKMKEITEDFWEGSFPKTNVLWLQYLVDILLLKKSFKRTSKDERELRSLKKRMDN 617

Query: 606 YNSAKEAIFDPFFNELIV 623
           Y SAKEA  DPFF ++ V
Sbjct: 618 YGSAKEATSDPFFTDMFV 631

BLAST of Cp4.1LG18g00040 vs. TrEMBL
Match: B9RFQ3_RICCO (Serine/threonine-protein kinase Haspin, putative OS=Ricinus communis GN=RCOM_1436590 PE=4 SV=1)

HSP 1 Score: 750.0 bits (1935), Expect = 2.4e-213
Identity = 400/639 (62.60%), Postives = 490/639 (76.68%), Query Frame = 1

Query: 10  IDLWSELIASEGSEGQEEARVEEVYRRRRPSEKTVHEVPLKQNLGSNLRDVNRLSF---A 69
           +DLW+E+IA E +  +++  ++  YRRRR S++T+ +  + QN   + ++ NR+S    A
Sbjct: 12  VDLWAEIIAEE-TNYKDQKNIKVFYRRRR-SQETLQD-SISQNQADSDKE-NRVSLGAAA 71

Query: 70  AVNSKRISWNRALSIRGRASIAVEACLDHQLRHKEAKKKGKPALPKGKFVQPTNFDKERA 129
           A   KR+SWNR+LSIRGR SIAV AC+D++ + K+ K+KGKP +PK K  QP NF+KERA
Sbjct: 72  AAAPKRVSWNRSLSIRGRVSIAVTACVDNRPQQKQPKRKGKPPVPKRKADQPPNFEKERA 131

Query: 130 YFQEVDAFELLEESPSPKRFGTWAS-SQFDSAALPSLCSKIEKWLISK--KSNYSLAPSS 189
           YFQEVDAFEL EESPSPK FGTW + +Q D+ A+P L S++E WLI+K  K N S APS+
Sbjct: 132 YFQEVDAFELPEESPSPKNFGTWITGNQSDNVAIPHLSSRLESWLITKQLKLNESSAPST 191

Query: 190 TLSKILETPLESREPIGGKHLDKFNLRTPDKSAREIDARLCSIQRRFISSLNDIDALEID 249
            LSK+LE+P    EPI     +  NL TP KS+ + ++ L S+Q        +  +L+ +
Sbjct: 192 ALSKLLESPRMPLEPICADDFNTINLITPVKSSLKTNSNLHSMQNIINLITPEESSLKTN 251

Query: 250 SN-------------RDNRSTGAEDVSTEDR----ADLEVSVKKLSLTSTSTSFHNYNLD 309
           SN             + + S  +  V +  R     D+E++VKKLSL STSTS  N N+D
Sbjct: 252 SNLHSMQKRINAYLLKKSNSRNSNSVLSRLRDEGCKDIELAVKKLSLASTSTSADNDNVD 311

Query: 310 PFNALLAVCGQSAPSTLKDAFSNYCDPETIVKVGEGTYGEAFKAGNTVCKIVPIDGDLQV 369
           PF++LLA CGQS PSTL D  S YC+P  I+KVGEGTYGEA++ G TVCKIVPIDG+L+V
Sbjct: 312 PFSSLLAYCGQSVPSTLLDVISKYCNPNDIIKVGEGTYGEAYRVGTTVCKIVPIDGELRV 371

Query: 370 NGEVQKRSEELLEEVILSRTLNSLRNQEGGADNFCTTFIRTIDLRVCQGSYDALLIKAWE 429
           NGEVQKRSEELLEEV+LSRTLN LR  +G A N CTTFI T+DLRVCQG YD  LI+AW+
Sbjct: 372 NGEVQKRSEELLEEVVLSRTLNHLRGNDGDACNACTTFIETLDLRVCQGPYDNALIRAWD 431

Query: 430 DWDEKHGSENDHPKEFPDKQCYVVFVLQHGGKDLESFVLLNFDEAQSLLVQATAALAVAE 489
            WD+ HGSENDHP+ FP+KQ YVVFVLQHGGKDLE+FVL NFDEA+SLLVQ T+ALAVAE
Sbjct: 432 RWDDAHGSENDHPRGFPEKQRYVVFVLQHGGKDLENFVLSNFDEARSLLVQVTSALAVAE 491

Query: 490 AAYEFEHRDLHWGNVLLSRNDSEALQFTLEGKNMTVQTFGLQISIIDFTLSRINTGEDIL 549
           AA+EFEHRDLHWGN+LLSRNDS  ++F LEGK M V+T+GL ISIIDFTLSRINTGE+IL
Sbjct: 492 AAFEFEHRDLHWGNILLSRNDSATVKFILEGKEMFVRTYGLAISIIDFTLSRINTGENIL 551

Query: 550 FLDLSSDPDLFKGAKGDRQSETYRKMKEVTGDCWEGSFPRTNVLWLLYLVHILLLKKSFE 609
           FLDLSSDP LFKG KGDRQ+ETYRKMKEVT DCWEGSFPRTNVLWLLYLV IL+ KKSFE
Sbjct: 552 FLDLSSDPYLFKGPKGDRQAETYRKMKEVTEDCWEGSFPRTNVLWLLYLVDILIQKKSFE 611

Query: 610 RSSKHERELRAFKKRLDKYNSAKEAIFDPFFNELIVWSS 626
           RSSK+ERELR+ KKRLDKYNSAKEAIFD FF++L+V +S
Sbjct: 612 RSSKNERELRSLKKRLDKYNSAKEAIFDTFFSDLLVDNS 646

BLAST of Cp4.1LG18g00040 vs. TrEMBL
Match: A0A061F3Z8_THECC (Serine/threonine-protein kinase Haspin, putative isoform 2 OS=Theobroma cacao GN=TCM_026831 PE=4 SV=1)

HSP 1 Score: 743.8 bits (1919), Expect = 1.7e-211
Identity = 397/626 (63.42%), Postives = 461/626 (73.64%), Query Frame = 1

Query: 1   MNSKPGGKGIDLWSELIASEGSEGQEEARVEEVYRRRRPS---EKTVHEVPLKQNLGSNL 60
           M+ K     + +WSE+IASE  E Q   RV+ +Y+RRRPS     T  E  L + +  N 
Sbjct: 1   MDRKAASGEVGIWSEIIASESEEHQSRKRVQVIYQRRRPSAHGHTTPQETLLVEEVAPNN 60

Query: 61  RDVNRLSFAAVNSKRISWNRALSIRGRASIAVEACLDHQLRHKEAKKKGKPALPKGKFVQ 120
           R  NRLS AA N KR+SWNR+LS RGR SIAV  C+ +Q + K+AK++GKP +PKGK  +
Sbjct: 61  R--NRLSLAAAN-KRVSWNRSLSTRGRTSIAVAPCVKNQPQQKQAKRRGKPPVPKGKLAE 120

Query: 121 PTNFDKERAYFQEVDAFELLEESPSPKRFGTWA-SSQFDSAALPSLCSKIEKWLISKKSN 180
           P +F+KER YFQEVDAFELLEESPSPK FGTWA  +Q  +  +P + S++EKWL SKK N
Sbjct: 121 PPSFEKEREYFQEVDAFELLEESPSPKNFGTWAMGNQSVTDLVPLVSSRLEKWLFSKKLN 180

Query: 181 YSLAPSSTLSKILETPLESREPIGGKHLDKFNLRTPDKSAREIDARLCSIQRRFISSLND 240
           +S  PSSTLSKILETP    + I    LD   LRTP+KS +                   
Sbjct: 181 FSCGPSSTLSKILETPAAPLDSIYSDDLDSSRLRTPEKSIQ------------------- 240

Query: 241 IDALEIDSNRDNRSTGAEDVSTEDRADLEVSVKKLSLTSTSTSFHNYNLDPFNALLAVCG 300
           I A  +D        G ED++         ++KKLSL ++S       +DPF+ALL +C 
Sbjct: 241 ISASSVDG-------GCEDINA--------AIKKLSLVTSSDLD---GVDPFSALLEICQ 300

Query: 301 QSAPSTLKDAFSNYCDPETIVKVGEGTYGEAFKAGNTVCKIVPIDGDLQVNGEVQKRSEE 360
           Q AP    + FS YCDPE+I KVGEGTYGEAF+AGNTVCKIVP DGD  VNGEVQK+SEE
Sbjct: 301 QLAPLRFFELFSKYCDPESITKVGEGTYGEAFRAGNTVCKIVPFDGDFPVNGEVQKKSEE 360

Query: 361 LLEEVILSRTLNSLRNQEGGADNFCTTFIRTIDLRVCQGSYDALLIKAWEDWDEKHGSEN 420
           LLEE +LS+TLNSLR  E G  N CTTFI TIDL+VCQGSYDA LI+AWE WDEK+ S+N
Sbjct: 361 LLEEAVLSQTLNSLREFENGVFNACTTFIETIDLKVCQGSYDAALIRAWEKWDEKNDSQN 420

Query: 421 DHPKEFPDKQCYVVFVLQHGGKDLESFVLLNFDEAQSLLVQATAALAVAEAAYEFEHRDL 480
           DHPKEFP+KQCYVVFVLQHGGKDLESFVL NFDEA+SLLVQ TAALAVAEAAYEFEHRDL
Sbjct: 421 DHPKEFPEKQCYVVFVLQHGGKDLESFVLKNFDEARSLLVQVTAALAVAEAAYEFEHRDL 480

Query: 481 HWGNVLLSRNDSEALQFTLEGKNMTVQTFGLQISIIDFTLSRINTGEDILFLDLSSDPDL 540
           HWGN+LLSRNDS   +F LEGK M ++TFGL ISIIDFTLSRINTGE ILFLDLS DP L
Sbjct: 481 HWGNILLSRNDSVTSKFILEGKQMFIRTFGLSISIIDFTLSRINTGESILFLDLSMDPYL 540

Query: 541 FKGAKGDRQSETYRKMKEVTGDCWEGSFPRTNVLWLLYLVHILLLKKSFERSSKHERELR 600
           FKG KGD+QSETYRKMKEVT D WEGSFPRTNVLWLLYLV ILLLKK+F RSS +ERELR
Sbjct: 541 FKGPKGDKQSETYRKMKEVTEDYWEGSFPRTNVLWLLYLVDILLLKKTFARSSTNERELR 586

Query: 601 AFKKRLDKYNSAKEAIFDPFFNELIV 623
           + KKRLDK NSA+EAIFDP F +L+V
Sbjct: 601 SLKKRLDKCNSAREAIFDPLFGDLLV 586

BLAST of Cp4.1LG18g00040 vs. TAIR10
Match: AT1G09450.1 (AT1G09450.1 Protein kinase superfamily protein)

HSP 1 Score: 670.6 bits (1729), Expect = 9.3e-193
Identity = 362/622 (58.20%), Postives = 452/622 (72.67%), Query Frame = 1

Query: 7   GKGIDLWSELIASEGSEGQEEARVEEVYRRRRPSEKTVHEVPLKQNLGSNLRDVNRLSFA 66
           G+ +DLWSE+I SE  +G +  ++E V++RR+  +K+   V    N G  ++     S  
Sbjct: 2   GQRVDLWSEVIKSEEEDG-DIPKIEAVFQRRKKPDKSSEAV----NFGWLVKGARTSS-- 61

Query: 67  AVNS-KRISWNRALSIRGRASIAVEACLDHQLRHKEAKKKGKPALPKGKFVQPTNFDKER 126
            VN  KR SW R+LS RGR SIAV A +++Q + K A +K KP +PKGK V+  +F KE+
Sbjct: 62  -VNGPKRDSWARSLSTRGRESIAVRAYVNNQPQKKAAGRK-KPPIPKGKVVKAPDFQKEK 121

Query: 127 AYFQEVDAFELLEESPSPKRFGTWASSQFDSAALPSLCSKIEKWLISKKSNYSLAPSSTL 186
            YF+++DAFELLEESPSP +  TW   +     +P L +++EKWLISKK N++  PSSTL
Sbjct: 122 EYFRDIDAFELLEESPSPNKSSTWTMGEQVVPEMPHLSTRLEKWLISKKLNHTCGPSSTL 181

Query: 187 SKILETPLESREPI-GGKHLDKFNLRTPDKSAREIDARLCSIQRRFISSLNDIDALEIDS 246
           SKILE     +E +      D  +L+TPDKS+    A   S+ R   S            
Sbjct: 182 SKILENSAIHQESVCDNDAFDSLSLKTPDKSS----AGNTSVFRLIPSC----------- 241

Query: 247 NRDNRSTGAEDVSTE----DRADLEVSVKKLSLTSTSTSFHNYNLDPFNALLAVCGQSAP 306
              + +  AEDV       +  DLE  +K+LSLTS     H     P   LL+ CGQ  P
Sbjct: 242 ---DENLAAEDVPVRKIKMESIDLEDELKRLSLTSDLIPTHQDFDQPILDLLSACGQMRP 301

Query: 307 STLKDAFSNYCDPETIVKVGEGTYGEAFKAGNTVCKIVPIDGDLQVNGEVQKRSEELLEE 366
           S   +AFS +C+PE+IVK+GEGTYGEAF+AG++VCKIVPIDGD +VNGEVQKR++ELLEE
Sbjct: 302 SNFIEAFSKFCEPESIVKIGEGTYGEAFRAGSSVCKIVPIDGDFRVNGEVQKRADELLEE 361

Query: 367 VILSRTLNSLRNQEGGADNFCTTFIRTIDLRVCQGSYDALLIKAWEDWDEKHGSENDHPK 426
           VILS TLN LR  E  A N C T+I+T D+++CQG YD +LIKAWE+WD KHGSENDHP 
Sbjct: 362 VILSWTLNQLRECETTAQNLCPTYIKTQDIKLCQGPYDPILIKAWEEWDAKHGSENDHP- 421

Query: 427 EFPDKQCYVVFVLQHGGKDLESFVLLNFDEAQSLLVQATAALAVAEAAYEFEHRDLHWGN 486
           +FP+KQCYV+FVL+HGGKDLESFVLLNFDEA+SLLVQATA LAVAEAA+EFEHRDLHWGN
Sbjct: 422 DFPEKQCYVMFVLEHGGKDLESFVLLNFDEARSLLVQATAGLAVAEAAFEFEHRDLHWGN 481

Query: 487 VLLSRNDSEALQFTLEGKNMTVQTFGLQISIIDFTLSRINTGEDILFLDLSSDPDLFKGA 546
           +LLSRN+S+ L F LEGK + ++TFG+QISIIDFTLSRINTGE ILFLDL+SDP LFKG 
Sbjct: 482 ILLSRNNSDTLPFILEGKQVCIKTFGVQISIIDFTLSRINTGEKILFLDLTSDPYLFKGP 541

Query: 547 KGDRQSETYRKMKEVTGDCWEGSFPRTNVLWLLYLVHILLLKKSFERSSKHERELRAFKK 606
           KGD+QSETYRKMK VT D WEGSF RTNVLWL+YLV ILL KKSFERSSKHERELR+ KK
Sbjct: 542 KGDKQSETYRKMKAVTEDYWEGSFARTNVLWLIYLVDILLTKKSFERSSKHERELRSLKK 595

Query: 607 RLDKYNSAKEAIFDPFFNELIV 623
           R++KY SAKEA+ DPFF+++++
Sbjct: 602 RMEKYESAKEAVSDPFFSDMLM 595

BLAST of Cp4.1LG18g00040 vs. NCBI nr
Match: gi|659099272|ref|XP_008450516.1| (PREDICTED: serine/threonine-protein kinase haspin [Cucumis melo])

HSP 1 Score: 1081.6 bits (2796), Expect = 0.0e+00
Identity = 544/628 (86.62%), Postives = 582/628 (92.68%), Query Frame = 1

Query: 1   MNSKPGGKGIDLWSELIASEGSEGQEEARVEEVYRRRRPSEKTVHEVPLKQNLGSNLRDV 60
           M+S  GGK IDLWSELIASEGS+ QEEA VEEVYRRR+P++KTVH +  KQNLGSN  +V
Sbjct: 1   MSSNLGGKAIDLWSELIASEGSDLQEEAPVEEVYRRRKPTQKTVHPLHPKQNLGSNGCNV 60

Query: 61  NRLSFAAVNSKRISWNRALSIRGRASIAVEACLDHQLRHKEAKKKGKPALPKGKFVQPTN 120
           NR+S AAV+SKRISWNRALSIRGR SIA+EAC+D Q +HK+AK+KGKPALPKGK+VQPTN
Sbjct: 61  NRVSLAAVDSKRISWNRALSIRGRVSIAIEACIDRQRQHKQAKRKGKPALPKGKYVQPTN 120

Query: 121 FDKERAYFQEVDAFELLEESPSPKRFGTWASSQFDSAALPSLCSKIEKWLISKKSNYSLA 180
           FDKERAYFQEVDAFELLEESPSPK F TW SSQFDS+ +PSLCS+IEKWLISKKSNYSLA
Sbjct: 121 FDKERAYFQEVDAFELLEESPSPKSFSTWTSSQFDSSTIPSLCSRIEKWLISKKSNYSLA 180

Query: 181 PSSTLSKILETPLESREPIGGKHLDKFNLRTPDKSAREIDARLCSIQRRFISSLNDIDAL 240
           PSSTLSKILETPL S EPIGG HLDK  L+TP++SAR+IDA  CSIQRRFI S+NDIDAL
Sbjct: 181 PSSTLSKILETPLGSIEPIGGLHLDKLKLKTPEESARDIDAHWCSIQRRFIFSINDIDAL 240

Query: 241 EIDSNRDNRSTGAEDVSTEDRADLEVSVKKLSLTSTSTSFHNYNLDPFNALLAVCGQSAP 300
            IDSN DNRS  AE++ TEDR D+EV+VKKLSLTSTSTSFH Y+LDP NALLAVCGQSAP
Sbjct: 241 GIDSN-DNRSNRAEEIRTEDREDIEVAVKKLSLTSTSTSFHKYDLDPLNALLAVCGQSAP 300

Query: 301 STLKDAFSNYCDPETIVKVGEGTYGEAFKAGNTVCKIVPIDGDLQVNGEVQKRSEELLEE 360
           STLKDAFSNYCD ETIVKVGEGTYGEAFKAGNTVCK+VPIDGDLQVNGE QKRSEELLEE
Sbjct: 301 STLKDAFSNYCDLETIVKVGEGTYGEAFKAGNTVCKVVPIDGDLQVNGETQKRSEELLEE 360

Query: 361 VILSRTLNSLRNQEGGADNFCTTFIRTIDLRVCQGSYDALLIKAWEDWDEKHGSENDHPK 420
           VILSRTLNSLR+ EG ADNFCTTFIRTIDLRVCQGSYDA+L+KAWEDWDEKHGSENDHPK
Sbjct: 361 VILSRTLNSLRSNEGSADNFCTTFIRTIDLRVCQGSYDAVLVKAWEDWDEKHGSENDHPK 420

Query: 421 EFPDKQCYVVFVLQHGGKDLESFVLLNFDEAQSLLVQATAALAVAEAAYEFEHRDLHWGN 480
           EFP+KQ YVVFVLQHGGKDLESFVLLN+DEAQSLLVQ TA LAVAEAAYEFEHRDLHWGN
Sbjct: 421 EFPEKQLYVVFVLQHGGKDLESFVLLNYDEAQSLLVQVTAGLAVAEAAYEFEHRDLHWGN 480

Query: 481 VLLSRNDSEALQFTLEGKNMTVQTFGLQISIIDFTLSRINTGEDILFLDLSSDPDLFKGA 540
           VLLSRND EALQFTLEGKNMTVQTFGLQISIIDFTLSRINTGED+LFLDLSSDP LFKG 
Sbjct: 481 VLLSRNDYEALQFTLEGKNMTVQTFGLQISIIDFTLSRINTGEDVLFLDLSSDPYLFKGP 540

Query: 541 KGDRQSETYRKMKEVTGDCWEGSFPRTNVLWLLYLVHILLLKKSFERSSKHERELRAFKK 600
           +GDRQSETYRKMKEVTGDCWEGSFPRTNVLWLLYLV ILLLKKSFERSSKHERELRAFKK
Sbjct: 541 RGDRQSETYRKMKEVTGDCWEGSFPRTNVLWLLYLVDILLLKKSFERSSKHERELRAFKK 600

Query: 601 RLDKYNSAKEAIFDPFFNELIVWSSSVE 629
           RLDKY SAKEAI+DPFF+ELIVWSSSVE
Sbjct: 601 RLDKYTSAKEAIYDPFFSELIVWSSSVE 627

BLAST of Cp4.1LG18g00040 vs. NCBI nr
Match: gi|449435954|ref|XP_004135759.1| (PREDICTED: serine/threonine-protein kinase haspin isoform X2 [Cucumis sativus])

HSP 1 Score: 1050.4 bits (2715), Expect = 1.2e-303
Identity = 532/628 (84.71%), Postives = 572/628 (91.08%), Query Frame = 1

Query: 1   MNSKPGGKGIDLWSELIASEGSEGQEEARVEEVYRRRRPSEKTVHEVPLKQNLGSNLRDV 60
           M S  GGK IDLWSELIASEGS+ QEEA VEEVYRRR+P++KTVH    KQNLGSN  +V
Sbjct: 1   MTSNLGGKAIDLWSELIASEGSDLQEEASVEEVYRRRKPTQKTVHPPHPKQNLGSNGCNV 60

Query: 61  NRLSFAAVNSKRISWNRALSIRGRASIAVEACLDHQLRHKEAKKKGKPALPKGKFVQPTN 120
           NR+S AAV+SKRISWNRALSIRGR SIAVEAC+D Q + K+AK+KGKPALPKGK+VQPTN
Sbjct: 61  NRVSLAAVDSKRISWNRALSIRGRVSIAVEACIDRQRQCKQAKRKGKPALPKGKYVQPTN 120

Query: 121 FDKERAYFQEVDAFELLEESPSPKRFGTWASSQFDSAALPSLCSKIEKWLISKKSNYSLA 180
           FDKERAYFQEVDAFELLEESPSPK F TW SSQFDS+ +PSLCS+IEKWLISKKS YSLA
Sbjct: 121 FDKERAYFQEVDAFELLEESPSPKSFSTWTSSQFDSSTIPSLCSRIEKWLISKKSKYSLA 180

Query: 181 PSSTLSKILETPLESREPIGGKHLDKFNLRTPDKSAREIDARLCSIQRRFISSLNDIDAL 240
           PSSTLSKILETPL S EPIGG HLDKF L+TP+ SAR+ DA  CSIQRRFI S+NDIDAL
Sbjct: 181 PSSTLSKILETPLGSIEPIGGIHLDKFKLKTPENSARDRDAHWCSIQRRFIFSINDIDAL 240

Query: 241 EIDSNRDNRSTGAEDVSTEDRADLEVSVKKLSLTSTSTSFHNYNLDPFNALLAVCGQSAP 300
           +IDSN DNRS  AE++ TEDR D+EV+VKKLSLTSTSTSFH Y+LDP +ALLAVCGQS P
Sbjct: 241 KIDSN-DNRSNRAEEMRTEDREDIEVAVKKLSLTSTSTSFHKYDLDPLSALLAVCGQSTP 300

Query: 301 STLKDAFSNYCDPETIVKVGEGTYGEAFKAGNTVCKIVPIDGDLQVNGEVQKRSEELLEE 360
           STLKD FSNYC+ ETIVKVGEGTYGEAFK GNTVCK+VPIDGDL+VNGE+QKRS ELLEE
Sbjct: 301 STLKDVFSNYCELETIVKVGEGTYGEAFKVGNTVCKVVPIDGDLKVNGEIQKRSVELLEE 360

Query: 361 VILSRTLNSLRNQEGGADNFCTTFIRTIDLRVCQGSYDALLIKAWEDWDEKHGSENDHPK 420
           VILSRTLNSLR+ E  ADNFCTTFIRTIDLRVCQGSYDA+L+KAWEDWDEKHGSENDHPK
Sbjct: 361 VILSRTLNSLRSNERCADNFCTTFIRTIDLRVCQGSYDAVLVKAWEDWDEKHGSENDHPK 420

Query: 421 EFPDKQCYVVFVLQHGGKDLESFVLLNFDEAQSLLVQATAALAVAEAAYEFEHRDLHWGN 480
           EFP+KQ YVVFVLQHGGKDLESFVLLN+DEAQSLLVQ TAALAVAEAAY+FEHRDLHWGN
Sbjct: 421 EFPEKQLYVVFVLQHGGKDLESFVLLNYDEAQSLLVQVTAALAVAEAAYQFEHRDLHWGN 480

Query: 481 VLLSRNDSEALQFTLEGKNMTVQTFGLQISIIDFTLSRINTGEDILFLDLSSDPDLFKGA 540
           VLLSRND EALQFTLE KNMTV+TFGLQISIIDFTLSRINTGEDILFLDLSSDP LFKG 
Sbjct: 481 VLLSRNDYEALQFTLESKNMTVKTFGLQISIIDFTLSRINTGEDILFLDLSSDPYLFKGP 540

Query: 541 KGDRQSETYRKMKEVTGDCWEGSFPRTNVLWLLYLVHILLLKKSFERSSKHERELRAFKK 600
           +GDRQSETYRKMKEVTGDCWEGSFPRTNVLWLLYLV ILLLKKSFERSSKHERELRAFKK
Sbjct: 541 RGDRQSETYRKMKEVTGDCWEGSFPRTNVLWLLYLVDILLLKKSFERSSKHERELRAFKK 600

Query: 601 RLDKYNSAKEAIFDPFFNELIVWSSSVE 629
           RLDKY S KEAI+D FF+ELIVWSSSVE
Sbjct: 601 RLDKYTSTKEAIYDQFFSELIVWSSSVE 627

BLAST of Cp4.1LG18g00040 vs. NCBI nr
Match: gi|700210889|gb|KGN65985.1| (hypothetical protein Csa_1G560690 [Cucumis sativus])

HSP 1 Score: 1050.4 bits (2715), Expect = 1.2e-303
Identity = 532/628 (84.71%), Postives = 572/628 (91.08%), Query Frame = 1

Query: 1   MNSKPGGKGIDLWSELIASEGSEGQEEARVEEVYRRRRPSEKTVHEVPLKQNLGSNLRDV 60
           M S  GGK IDLWSELIASEGS+ QEEA VEEVYRRR+P++KTVH    KQNLGSN  +V
Sbjct: 93  MTSNLGGKAIDLWSELIASEGSDLQEEASVEEVYRRRKPTQKTVHPPHPKQNLGSNGCNV 152

Query: 61  NRLSFAAVNSKRISWNRALSIRGRASIAVEACLDHQLRHKEAKKKGKPALPKGKFVQPTN 120
           NR+S AAV+SKRISWNRALSIRGR SIAVEAC+D Q + K+AK+KGKPALPKGK+VQPTN
Sbjct: 153 NRVSLAAVDSKRISWNRALSIRGRVSIAVEACIDRQRQCKQAKRKGKPALPKGKYVQPTN 212

Query: 121 FDKERAYFQEVDAFELLEESPSPKRFGTWASSQFDSAALPSLCSKIEKWLISKKSNYSLA 180
           FDKERAYFQEVDAFELLEESPSPK F TW SSQFDS+ +PSLCS+IEKWLISKKS YSLA
Sbjct: 213 FDKERAYFQEVDAFELLEESPSPKSFSTWTSSQFDSSTIPSLCSRIEKWLISKKSKYSLA 272

Query: 181 PSSTLSKILETPLESREPIGGKHLDKFNLRTPDKSAREIDARLCSIQRRFISSLNDIDAL 240
           PSSTLSKILETPL S EPIGG HLDKF L+TP+ SAR+ DA  CSIQRRFI S+NDIDAL
Sbjct: 273 PSSTLSKILETPLGSIEPIGGIHLDKFKLKTPENSARDRDAHWCSIQRRFIFSINDIDAL 332

Query: 241 EIDSNRDNRSTGAEDVSTEDRADLEVSVKKLSLTSTSTSFHNYNLDPFNALLAVCGQSAP 300
           +IDSN DNRS  AE++ TEDR D+EV+VKKLSLTSTSTSFH Y+LDP +ALLAVCGQS P
Sbjct: 333 KIDSN-DNRSNRAEEMRTEDREDIEVAVKKLSLTSTSTSFHKYDLDPLSALLAVCGQSTP 392

Query: 301 STLKDAFSNYCDPETIVKVGEGTYGEAFKAGNTVCKIVPIDGDLQVNGEVQKRSEELLEE 360
           STLKD FSNYC+ ETIVKVGEGTYGEAFK GNTVCK+VPIDGDL+VNGE+QKRS ELLEE
Sbjct: 393 STLKDVFSNYCELETIVKVGEGTYGEAFKVGNTVCKVVPIDGDLKVNGEIQKRSVELLEE 452

Query: 361 VILSRTLNSLRNQEGGADNFCTTFIRTIDLRVCQGSYDALLIKAWEDWDEKHGSENDHPK 420
           VILSRTLNSLR+ E  ADNFCTTFIRTIDLRVCQGSYDA+L+KAWEDWDEKHGSENDHPK
Sbjct: 453 VILSRTLNSLRSNERCADNFCTTFIRTIDLRVCQGSYDAVLVKAWEDWDEKHGSENDHPK 512

Query: 421 EFPDKQCYVVFVLQHGGKDLESFVLLNFDEAQSLLVQATAALAVAEAAYEFEHRDLHWGN 480
           EFP+KQ YVVFVLQHGGKDLESFVLLN+DEAQSLLVQ TAALAVAEAAY+FEHRDLHWGN
Sbjct: 513 EFPEKQLYVVFVLQHGGKDLESFVLLNYDEAQSLLVQVTAALAVAEAAYQFEHRDLHWGN 572

Query: 481 VLLSRNDSEALQFTLEGKNMTVQTFGLQISIIDFTLSRINTGEDILFLDLSSDPDLFKGA 540
           VLLSRND EALQFTLE KNMTV+TFGLQISIIDFTLSRINTGEDILFLDLSSDP LFKG 
Sbjct: 573 VLLSRNDYEALQFTLESKNMTVKTFGLQISIIDFTLSRINTGEDILFLDLSSDPYLFKGP 632

Query: 541 KGDRQSETYRKMKEVTGDCWEGSFPRTNVLWLLYLVHILLLKKSFERSSKHERELRAFKK 600
           +GDRQSETYRKMKEVTGDCWEGSFPRTNVLWLLYLV ILLLKKSFERSSKHERELRAFKK
Sbjct: 633 RGDRQSETYRKMKEVTGDCWEGSFPRTNVLWLLYLVDILLLKKSFERSSKHERELRAFKK 692

Query: 601 RLDKYNSAKEAIFDPFFNELIVWSSSVE 629
           RLDKY S KEAI+D FF+ELIVWSSSVE
Sbjct: 693 RLDKYTSTKEAIYDQFFSELIVWSSSVE 719

BLAST of Cp4.1LG18g00040 vs. NCBI nr
Match: gi|778662151|ref|XP_011659430.1| (PREDICTED: serine/threonine-protein kinase haspin isoform X1 [Cucumis sativus])

HSP 1 Score: 1035.8 bits (2677), Expect = 3.1e-299
Identity = 532/655 (81.22%), Postives = 572/655 (87.33%), Query Frame = 1

Query: 1   MNSKPGGKGIDLWSELIASEGSEGQEEARVEEVYRRRRPSEKTVHEVPLKQNLGSNLRDV 60
           M S  GGK IDLWSELIASEGS+ QEEA VEEVYRRR+P++KTVH    KQNLGSN  +V
Sbjct: 1   MTSNLGGKAIDLWSELIASEGSDLQEEASVEEVYRRRKPTQKTVHPPHPKQNLGSNGCNV 60

Query: 61  NRLSFAAVNSKRISWNRALSIRGRASIAVEACLDHQLRHKEAKKKGKPALPKGKFVQPTN 120
           NR+S AAV+SKRISWNRALSIRGR SIAVEAC+D Q + K+AK+KGKPALPKGK+VQPTN
Sbjct: 61  NRVSLAAVDSKRISWNRALSIRGRVSIAVEACIDRQRQCKQAKRKGKPALPKGKYVQPTN 120

Query: 121 FDKERAYFQEVDAFELLEESPSPKRFGTWASSQFDSAALPSLCSKIEKWLISKKSNYSLA 180
           FDKERAYFQEVDAFELLEESPSPK F TW SSQFDS+ +PSLCS+IEKWLISKKS YSLA
Sbjct: 121 FDKERAYFQEVDAFELLEESPSPKSFSTWTSSQFDSSTIPSLCSRIEKWLISKKSKYSLA 180

Query: 181 PSSTLSKILETPLESREPIGGKHLDKFNLRTPDKSAREIDARLCSIQRRFISSLNDIDAL 240
           PSSTLSKILETPL S EPIGG HLDKF L+TP+ SAR+ DA  CSIQRRFI S+NDIDAL
Sbjct: 181 PSSTLSKILETPLGSIEPIGGIHLDKFKLKTPENSARDRDAHWCSIQRRFIFSINDIDAL 240

Query: 241 EIDSNRDNRSTGAEDVSTEDRADLEVSVKKLSLTSTSTSFHNYNLDPFNALLAVCGQSAP 300
           +IDSN DNRS  AE++ TEDR D+EV+VKKLSLTSTSTSFH Y+LDP +ALLAVCGQS P
Sbjct: 241 KIDSN-DNRSNRAEEMRTEDREDIEVAVKKLSLTSTSTSFHKYDLDPLSALLAVCGQSTP 300

Query: 301 STLKDAFSNYCDPETIVKVGEGTYGEAFKAGNTVCKIVPIDGDLQVNGEVQKRSEELLEE 360
           STLKD FSNYC+ ETIVKVGEGTYGEAFK GNTVCK+VPIDGDL+VNGE+QKRS ELLEE
Sbjct: 301 STLKDVFSNYCELETIVKVGEGTYGEAFKVGNTVCKVVPIDGDLKVNGEIQKRSVELLEE 360

Query: 361 VILSRTLNSLRNQEGGADNFCTTFIRTIDLRVCQGSYDALLIKAWEDWDEKHGSENDHPK 420
           VILSRTLNSLR+ E  ADNFCTTFIRTIDLRVCQGSYDA+L+KAWEDWDEKHGSENDHPK
Sbjct: 361 VILSRTLNSLRSNERCADNFCTTFIRTIDLRVCQGSYDAVLVKAWEDWDEKHGSENDHPK 420

Query: 421 EFPDKQCYVVFVLQHGGKDLESFVLLNFDEAQSLLVQATAALAVAEAAYEFEHRDLHW-- 480
           EFP+KQ YVVFVLQHGGKDLESFVLLN+DEAQSLLVQ TAALAVAEAAY+FEHRDLHW  
Sbjct: 421 EFPEKQLYVVFVLQHGGKDLESFVLLNYDEAQSLLVQVTAALAVAEAAYQFEHRDLHWSD 480

Query: 481 -------------------------GNVLLSRNDSEALQFTLEGKNMTVQTFGLQISIID 540
                                    GNVLLSRND EALQFTLE KNMTV+TFGLQISIID
Sbjct: 481 YSSIYSYFSRFFSLATDLHHVFIARGNVLLSRNDYEALQFTLESKNMTVKTFGLQISIID 540

Query: 541 FTLSRINTGEDILFLDLSSDPDLFKGAKGDRQSETYRKMKEVTGDCWEGSFPRTNVLWLL 600
           FTLSRINTGEDILFLDLSSDP LFKG +GDRQSETYRKMKEVTGDCWEGSFPRTNVLWLL
Sbjct: 541 FTLSRINTGEDILFLDLSSDPYLFKGPRGDRQSETYRKMKEVTGDCWEGSFPRTNVLWLL 600

Query: 601 YLVHILLLKKSFERSSKHERELRAFKKRLDKYNSAKEAIFDPFFNELIVWSSSVE 629
           YLV ILLLKKSFERSSKHERELRAFKKRLDKY S KEAI+D FF+ELIVWSSSVE
Sbjct: 601 YLVDILLLKKSFERSSKHERELRAFKKRLDKYTSTKEAIYDQFFSELIVWSSSVE 654

BLAST of Cp4.1LG18g00040 vs. NCBI nr
Match: gi|1009116549|ref|XP_015874832.1| (PREDICTED: serine/threonine-protein kinase haspin [Ziziphus jujuba])

HSP 1 Score: 804.3 bits (2076), Expect = 1.5e-229
Identity = 420/623 (67.42%), Postives = 487/623 (78.17%), Query Frame = 1

Query: 1   MNSKPGGKGIDLWSELIASEGSEGQEEARVEEVYRRRRPSEKTVHEVPLKQNLGSNLRDV 60
           M SK GG  IDLWSE++ ++  + ++ +  E VY RR+P++     VP KQ       + 
Sbjct: 1   MGSKTGGNCIDLWSEIVGTDQQQ-EQPSHGEAVYGRRKPNKTPKDVVPPKQLGSYGATNH 60

Query: 61  NRLSFAAVN-SKRISWNRALSIRGRASIAVEACLDHQLRHKEAKKKGKPALPKGKFVQPT 120
           +R+SFAA   +KRISWNR+LS RGR SIAV AC+DHQ + K+AK+KGKPALPKGK VQP+
Sbjct: 61  HRVSFAAAAPNKRISWNRSLSTRGRTSIAVGACIDHQPQLKQAKRKGKPALPKGKIVQPS 120

Query: 121 NFDKERAYFQEVDAFELLEESPSPKRFGTWASSQFDSAALPSLCSKIEKWLISKKSNYSL 180
           NFD ERAYF EVDAFELLEESPSPK++GTWA       A+P L S++EKWLISKK N   
Sbjct: 121 NFDLERAYFDEVDAFELLEESPSPKKYGTWAIGNQTDVAIPHLSSRLEKWLISKKLN-PY 180

Query: 181 APSSTLSKILETPLESREPIGGKHLDKFNLRTPDKSAREIDARLCSIQRRFISSLNDIDA 240
            PSSTLSKIL TP +  E I  +  D  NL T ++S+ ++ ++L S + R  S+L D + 
Sbjct: 181 GPSSTLSKILGTPAKGPETIDEEDFDFLNLNTTERSSFKVSSQLHSFESRLKSNLTDRNV 240

Query: 241 LEIDSNRDNRSTGAEDVSTEDRADLEVSVKKLSLTSTSTSFHNYNLDPFNALLAVCGQSA 300
           LE D  + ++    + V  E   D+E +VKKLSL STSTS    +LDPF ALLAVCGQS 
Sbjct: 241 LERDIIKSHK-IDTKGVGNESCEDIEFAVKKLSLASTSTSSETDHLDPFAALLAVCGQST 300

Query: 301 PSTLKDAFSNYCDPETIVKVGEGTYGEAFKAGNTVCKIVPIDGDLQVNGEVQKRSEELLE 360
           PS  +D FS YCD + IVK+GEGT+GEAFKAG  VCKIVPIDGDL+VNGEVQK+S ELLE
Sbjct: 301 PSKFQDVFSKYCDLQEIVKIGEGTFGEAFKAGTYVCKIVPIDGDLRVNGEVQKKSAELLE 360

Query: 361 EVILSRTLNSLRNQEGGADNFCTTFIRTIDLRVCQGSYDALLIKAWEDWDEKHGSENDHP 420
           EV+LSRTLN LR QE    N CTTFI T+DLRVCQ SYD  LIKAWE+WDEKHGSENDHP
Sbjct: 361 EVVLSRTLNCLRRQEDDVRNACTTFIETVDLRVCQCSYDPSLIKAWENWDEKHGSENDHP 420

Query: 421 KEFPDKQCYVVFVLQHGGKDLESFVLLNFDEAQSLLVQATAALAVAEAAYEFEHRDLHWG 480
           KEFP+KQ YVVFVL+HGGKDLE FVLLNFDE ++LL Q TAALAVAEAAYEFEHRDLHWG
Sbjct: 421 KEFPEKQSYVVFVLEHGGKDLEGFVLLNFDEGRTLLAQVTAALAVAEAAYEFEHRDLHWG 480

Query: 481 NVLLSRNDSEALQFTLEGKNMTVQTFGLQISIIDFTLSRINTGEDILFLDLSSDPDLFKG 540
           N+LLSRNDS  LQFTLEGK   V+TFGLQISIIDFTLSRINTGEDILFLDLSSDP LFKG
Sbjct: 481 NILLSRNDSVTLQFTLEGKKFFVKTFGLQISIIDFTLSRINTGEDILFLDLSSDPYLFKG 540

Query: 541 AKGDRQSETYRKMKEVTGDCWEGSFPRTNVLWLLYLVHILLLKKSFERSSKHERELRAFK 600
            KGD+QSETYRKMKEVT DCWEGSFPRTNVLWLLYLV ILLLKKSFER+SK+ER++R+ K
Sbjct: 541 PKGDKQSETYRKMKEVTEDCWEGSFPRTNVLWLLYLVDILLLKKSFERTSKNERDMRSLK 600

Query: 601 KRLDKYNSAKEAIFDPFFNELIV 623
           KRLDKY SA+EAI DPFF EL V
Sbjct: 601 KRLDKYRSAREAILDPFFAELFV 620

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
HASP_BOVIN2.2e-5540.68Serine/threonine-protein kinase haspin OS=Bos taurus GN=GSG2 PE=2 SV=1[more]
HASP_MOUSE5.5e-5435.32Serine/threonine-protein kinase haspin OS=Mus musculus GN=Gsg2 PE=1 SV=3[more]
HASP_HUMAN7.2e-5441.04Serine/threonine-protein kinase haspin OS=Homo sapiens GN=GSG2 PE=1 SV=3[more]
HASP_DROME9.7e-5135.69Putative serine/threonine-protein kinase haspin homolog OS=Drosophila melanogast... [more]
HASP_SCHPO4.1e-4134.06Serine/threonine-protein kinase haspin homolog hrk1 OS=Schizosaccharomyces pombe... [more]
Match NameE-valueIdentityDescription
A0A0A0LVR4_CUCSA8.5e-30484.71Uncharacterized protein OS=Cucumis sativus GN=Csa_1G560690 PE=4 SV=1[more]
A0A067JHK6_JATCU1.7e-22766.08Uncharacterized protein OS=Jatropha curcas GN=JCGZ_23175 PE=4 SV=1[more]
F6I4H2_VITVI1.1e-22665.86Putative uncharacterized protein OS=Vitis vinifera GN=VIT_14s0060g00450 PE=4 SV=... [more]
B9RFQ3_RICCO2.4e-21362.60Serine/threonine-protein kinase Haspin, putative OS=Ricinus communis GN=RCOM_143... [more]
A0A061F3Z8_THECC1.7e-21163.42Serine/threonine-protein kinase Haspin, putative isoform 2 OS=Theobroma cacao GN... [more]
Match NameE-valueIdentityDescription
AT1G09450.19.3e-19358.20 Protein kinase superfamily protein[more]
Match NameE-valueIdentityDescription
gi|659099272|ref|XP_008450516.1|0.0e+0086.62PREDICTED: serine/threonine-protein kinase haspin [Cucumis melo][more]
gi|449435954|ref|XP_004135759.1|1.2e-30384.71PREDICTED: serine/threonine-protein kinase haspin isoform X2 [Cucumis sativus][more]
gi|700210889|gb|KGN65985.1|1.2e-30384.71hypothetical protein Csa_1G560690 [Cucumis sativus][more]
gi|778662151|ref|XP_011659430.1|3.1e-29981.22PREDICTED: serine/threonine-protein kinase haspin isoform X1 [Cucumis sativus][more]
gi|1009116549|ref|XP_015874832.1|1.5e-22967.42PREDICTED: serine/threonine-protein kinase haspin [Ziziphus jujuba][more]
The following terms have been associated with this gene:
Vocabulary: Biological Process
TermDefinition
GO:0006468protein phosphorylation
Vocabulary: Molecular Function
TermDefinition
GO:0005524ATP binding
GO:0004672protein kinase activity
Vocabulary: INTERPRO
TermDefinition
IPR024604Serine/threonine-protein kinase haspin, C-terminal
IPR011009Kinase-like_dom_sf
IPR000719Prot_kinase_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0035405 histone-threonine phosphorylation
biological_process GO:0009069 serine family amino acid metabolic process
biological_process GO:0006468 protein phosphorylation
cellular_component GO:0044424 intracellular part
molecular_function GO:0005524 ATP binding
molecular_function GO:0035184 histone threonine kinase activity
molecular_function GO:0004672 protein kinase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG18g00040.1Cp4.1LG18g00040.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000719Protein kinase domainPROFILEPS50011PROTEIN_KINASE_DOMcoord: 313..628
score: 9
IPR011009Protein kinase-like domainunknownSSF56112Protein kinase-like (PK-like)coord: 302..399
score: 1.27E-7coord: 426..618
score: 1.2
IPR024604Serine/threonine-protein kinase haspin, C-terminalPFAMPF12330DUF3635coord: 536..603
score: 9.9
IPR024604Serine/threonine-protein kinase haspin, C-terminalSMARTSM01331DUF3635_2coord: 534..617
score: 1.3
NoneNo IPR availableunknownCoilCoilcoord: 588..608
scor
NoneNo IPR availableGENE3DG3DSA:1.10.510.10coord: 444..522
score: 1.
NoneNo IPR availablePANTHERPTHR24419INTERLEUKIN-1 RECEPTOR-ASSOCIATED KINASEcoord: 10..607
score: 4.7E
NoneNo IPR availablePANTHERPTHR24419:SF18SERINE/THREONINE-PROTEIN KINASE HASPINcoord: 10..607
score: 4.7E

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Cp4.1LG18g00040Cucumber (Gy14) v1cgycpeB0050
Cp4.1LG18g00040Cucurbita moschata (Rifu)cmocpeB056
Cp4.1LG18g00040Wild cucumber (PI 183967)cpecpiB354
Cp4.1LG18g00040Cucumber (Chinese Long) v2cpecuB353
Cp4.1LG18g00040Cucumber (Gy14) v2cgybcpeB058
Cp4.1LG18g00040Cucumber (Chinese Long) v3cpecucB0432