CmaCh09G011940 (gene) Cucurbita maxima (Rimu)

NameCmaCh09G011940
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionCholine/ethanolamine kinase family protein
LocationCma_Chr09 : 7684006 .. 7696183 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAAAAAAAAAAAAAAAAAAAAAAAGATTGAGAAATCATGGGGGCTGAGAAGATCTACAATGGCTCCGTGGATGTGGAAGAATCTGTAGGAAATGGAGATGGAAATACCGAGTCGTATCAGTTATCGAATCTTTCCATCGACCATTGTCTTCCTCTTCCCGCCATGATCCCTTGCATAATGTTCGTATTGGTTCCTCTTTCATTTGCTTTCTTGTTTCCACATACGGATTGTTTATGGCGTCTATTTCTGTTGTGTTTTGGATGAACTGGAAGCAGGACGAACAGTGGTTCTTGTTTTTGTTTTTTTTTTTGAATTTGATGATGTCATCTCCCAGCTTCTCGGAGTTGATTCTGTAAGTTTGTTTGAATGTTTTTGTTCTGAATCGGTTTCTTAATTGCAGCGAGCTGTGTAAGGATCTGTTTAAGCAGTGGTCGGAGCTGGATGATTCTCGGTTCTCTGTTGAAACGGTCTCCGGTGGGATCACGAATCAGCGTGAGTGTTCGCTTACACAATGTATTTCTTCGTTCAATTAGATTACGGCAAGACTGTGTTCGAGGTTGTGGCTACTGTTTCTTCGTTGCTTGTGGATTTCTGTTATGTGAAAGGATTTTTTGTGGATTTCTGCAGTGCTTAAGGTTACAGTGGAGGAGGAAAATGGTAGTAACGTTTCCACCACCGTCAGACTATATGGACCTAACACGGATTATGTTATCAATCGTGATCGGGAACTTCAGGTAATTTGCGATGCTTCTTGATTGATTATTGCGTGTGAAAATATCCGTTTATTGTGTATGCTTCGTGTAGTGTTTGGTTTTAGTCGTTGTTCTTCACGTCGGCAACACCTTTTCCCTCTGTCTGTCATGATGTATCATCTTTTTGCATAGTCAACCCTAAAGCAATCTAGCCACGCACTCTGTGAATCGAAAGCTAACTGTTCTTAAGGTGATATATAGGTTTTGTTCTTAATGTGATATGGAAACTCTAGCTAGTTGAATCTAAGAACCCTCGTAAGCGCCTGAATTTGCAACTCCTAGTTCACAAATGCTAAAACTCCCCCGTTAATTTGATTTCACCCCAAGGAAAAAGTAAAACTCGAATTATCTTGTTGATAAAATATTTCAAGGCAAGAACACTTTTTGAGATTCAAATCACTCCACAAGCAAGATTGATCATATCAAGCTTGAATGATTCTAAACATGCAATCTAAACTACTTCTTTGAGATTCAAATCACTTCACAAGCAAGATTGATCATGTCAAGTTTGAATGATTCTAAACATGTAATCTAAACTACATAGAATTGCATAGAAAATTAGTCCCTGACTAAAGGAAAGTACAAATCCTCTTTTTACTGCATTTTCCAAGTCTCCTTTGGAAATATAACATACATGGTTTTATATAGCCTCAAAATGAAAACACTTGACCTTCCACGAGACATTTCAAGAGTTATAACCTTCATACTTTATGACCATAATTAGCCACTATTTAATCAACAGTAAACTAAAATTCAGCTATGTTTGAAAATATTCTCATGCTGTACATTTGGTGTTCGTTCATGTGTACATGACAATCAAAAACTGTACCTTTGACAGAAACCTATTGTGCTCAAACCTGATCTATATTGGTTCAGAATTCAAATATATATATCTAATTTTTTTTAACCATTAGCAATTGAGTACTGAATCAACATTTACGAAGTGGTTATTATTATTATTGTTTTGTGAATGATCTGATCGGCAAAATTGATTTTGGTGTTCTGGGTGGTCTATATACAACTCTTAAGGTAGGTAGTATTCTGTTTGTTTTTGTTTTATGCGTGCATACAAGTATTGAAGACTTCTTCTCTGTAATCAATTCATATTTGCTAAAGAGAGTGCCAGTTGTTTTTCTCTCTGTAATTGAGTAAGTTATCTGCTTTGTGTGGGAAATTCTGTTCCTGACAGAGAGGCGCGAATTGCCTGGGGTTGGATCTTGTGACTTTCAAGATTTTGACACTTGATTGGACTTGTTGCTCAGAGGGTGTAGTGTAGGCCTGAATCTCTTAAATCATGATTTTTTTTTCCTCATCCACAATTTCCTTGTCTTTTGTTCCATTTTACTTTCTTAATGATTTTCCTACGACTATTATGGTCTTACATCCCTTACTTGCATGGTCTTGAACCAAGAGGATTCAAGAAAGTTCGTGTGGATTTCATGAAATCTAGTGTGCAATCTATATAATACAACCATGTATTTTTTATTCTTATCTTGAATTTGATTGTCCAAATAATGCAACTATTTTTTTTATTTTTCACTTGGCTTCACTCTGTAATCTCATGAAATCTTGTATTAGGCAATCAAATATCTTTCAGCTGCAGGATTTGGTGCCAAGCTTCTTGGAGTTTTTAAAAATGGCATGGTGCAGTCATTTATTCATGCACGTACGCTAGAACCATCAGGTAAGAAGGCATTGAGAATCTGTCTGGTGTAGACCTTATTTACACACAATCCAGTTCAAACATTCAATATAGAAGATTGTTGATCATTATGTAGCATTTCATCTAAGAGTGGCCACGTTGAAATCTTTGTAGTTAAGATCTATTAGAGACAGGAAATGAAATATTAATTCAATACTAATATGTACATTTTAATAGTTAGATCTTGGTCTTAAATTTGTTTTTCACGTCCAGCAATATGTCGGCTGGGTTTTGAACATTTCACCTGAGAATCTTAATTTTTTCAGTAGTAGTAATTCAAATAGCCCTTGATTGGTTTAGAACCAGTTTATTCTTTAAGTACTTCATGCTCACAGCACCCTGCGCCTTCTCCCCCCCACCCCCACCTCTCCTTCTTCCCTTCAACCCCGGTTTTTCTTCATCAGTTTCTGCATTTAAGTTAGGTGTTCATCCTTTGATCTTGTATTTTCTTAGCTACTGGATCTTCAACCGATTAATCAGTTTATGCATCCAAGTTTAAATAGTTTTATCTGACTTCGGCTTCCAAGTTGAGCATCAGGTCTTACTTTTTTTTTTAAGTTCTGTTTCTGGTTTACCCTTGTTTTGAGTGAAAGATTCAAGGAGCTATTGTGTAAACAATAAGTATTTATGCTGGTTAGAGAATGACAGTTGAAGACATGGTCAACAAGCTGGTAATCCCCGGGGATGAATTTGGTACAAATCGCTTATGAATAAGAAGATCTTCTCTAGGGTGGAGCACTGAGTGTGCAGTACAGCGATCATCAGGAGCTTGGAAGAAAGCTTAGTCAAGTCTTTGTGGAAATTTCTAAAAAAGGTTGGCTCATTTTATGGGGTATGTTGAAAGACTTCCTAAAGAAGTATGAGAATAATTTCAGAAATCACAGAATCTGCCTAAGCTGCCCATGGTTAAGGAGTCTGTCAACTCCAGTCTCCAGAAGTTATGTGGAGATGGTGAACGGAAATCTGAGGTCCAATGTCCAATTGAAAAATCTTAATAGTTTTAAAATCTTCTCAACCAATTTGGCTAAGAAATGATCATGAAGTGGTGAACATGAGTTTTACTGAATTATTTGTTATTTCTTGAGGTCACTGTTTTGATCAAGCCTTTTATGATCGATAAAGCATTGCTCAACATGGATGAAGGAAGATTTCTAAAAGATTAAGGATCAAATGGAAAAAGAAACTCTTAGGAAATTTCATTTGAAGATTGAAAAGTGTTCAAGAGAAGAACAATTTCACTCAGATCTTGTTGAAGAGAATGGAGGTTGGAAATCAATTAAAATTTACCTTTATCAATTGGAAGAAAGGCACTTTTGAAGCAGTACGATAACACGTTGGAGGTCTTCTTATATCTCTGAAGCTTATATGAAAGTAGGAATTTTGATCCTTCCTTTGATCCTCTATTTTTTTATAGGAAGTTGACTGAAAGTGACTTTTCGAATTCCTTGGATTTTTTAAGGGTTAAGTCAGCAAGTAAAGGTGAAAAGGCCCTACATATGAAAGAATTGATGAGTTTCCGGCCCTATTAAATTCGGAAGGTTAGAGATTATGCATCAACGACTTATGATTGGTTGATGTGTTTAAATTACCAAGGGCTGCTTGATGACCAGTCATTACTTATAAGGAGATTCCTTCATCTACAGTACGTCTACGATTAATGATCTGCCAATTGAAGAATGTGGTATTAATGTAACGTCTGTTGATGTAAATGCAATAATTAAAGAAAAGTCTCTTCACGATCCTTCAGTGGCTTTCAACCCCTTATTCCTGATTCTTGAGGAAGTAAGAAACAGTTTTAGTGAGGCATATCTACTCTATTATACACGGTAAAAGGTAATTGACCCCTCAGTCCCTACTTCAATAATGAATGATTTTAATGCAGATTTTTACATTTTTACATTTGCCTTATACCAAAAAGGATGGATGCATGAGAAGTCAGGGATTATATACCTATTAGCCTAATCTCTTACCTTTACAAGATCATTGCTAGAGTCCTTTTGGAACAACTCAAAAAAGTCCTCTCCACCATTACTAATTACCGATCATCTTTTGTGGGGGGTAGACAAATTGTTGATGCCTGATTGATTGCCAATGAAGTGGCGGAGATATGTTTCAAGAAGAAGAATAAGGGCATTGTGGTTTAAATTGATATTGAAAATGCTTTTGACAAAGTGGATTGAAATCTTCTTGATTTAATCCTCAATACCAAGGGTTTTGGTACAACAGGGAGGAAGTGGATCAAAGGGTGTATATCATCTCCAAACTTCTCCATCATTATCAATGGTTACCCAAGAGGAAAGATATATGCCTCAAGAACCTTCAACATGGTGACCCTCTCTCACGCTTCCTCTTTATTATTGTCATGGATTGTTTTCGTAGAAGGTTGATCGAAACATGCTTTATTAGTGACATTAAATTGGTAAAGATTCTCTTCAGATGAATCACCTCCTATTTGCCAATGACACTATTATTTTTTCATCTAGAATATGGCAATTGACAACCTTCTTGGCCTTGTTGACTATTTTGAAAAGGCATCCAGACTGAATATAGATCATAAAAATCTGAGATTACTGGGGTAAATACAGAGCAAGATGAGGTTGAATTAATAGCCAACAAATTTGATTGTAGACAAGGACATTGGCTGAATTTATACCTAGGCCTCCCTTTATATAGAAAGCCAAGAACTAGGTCCTCTTGGACCCCAGTTGTTGAGAATATTGAAAGAAGGCTCTACTCATGAAATCACTCACACATATCCAAGGGAGTCATCTTACACTTTACAAGCCGCACTCTCGAATCTTCCACTTATTACTTGTCTCTCTCCGAGACCCCATTGAAAGTTTCTTCTATGGTTGAAAAACTCCATAGGTTCCTATGTAAAGGAAGCACATCCCATTCCGGAGTTCATCTTGTTAGATGAGATAAAGTCCCTAGTGGAGACCTCGAGCTTCAAAATTTGGCAGAAAAGAACAGACCTTTATTAGCCAAGTGGATATGAAGATACCTTTACGAAGAAGATGGTTTAGGGGACCATGGAGATCTATATAATTTCCATTAAACCTCGTCTGGGAAAGATTGAAACAAAAAGTGGGGAATGGAGATAAGCTAACACTTCATTTTGGCATGATCGTCAGTTCCCATGTCTTTTCTCACTGGCTTTTCTTAAAAAGATATATCCAGTCAAGACACATGGAATGAGAAACATGGTGGGTGGAATATGAATTTGAGGAGAAATCTCTCTGATGACGAAACTGAGGAATGGATTAATTTTATGGACCTATTGAATAGGGTTCATTTAACACACTACAGGGATTAATGCGCATGAATTTGGAATAACAAGGGGTTTTACTCCACTAAATCTCTACTGATCGATCTTGACACCTGTAGGACAGCACTACACCCTCTTCAAGAGGCGATATAGACAGATTTTTATCCAAAGAAGGTTAATTTTCCCTTTGGGAACTTGGCCACGATGGCATCAAGACATTTGAGAGAATGCAGCTCCATTTGCCACACATGGTTTTATCTCCACATTGCTGCACATCTGTAAAACTACTCATGAAACTCAGCCATTTATTCATTACTTGTCATTGCACTCCCAGATTTTGGTTGGAATTGATAATTACTCTCAGATAATATACAACCTCATACCATTGTGAAGATAGATGGAGGTCTTTTGTCTTTAATTGGAATCAGAGTCATGGTCTCAACCTAGCTGTGTTGGTGAGATCCTCAATGATTTTGAAGAGGAGATTAAGAGTGCGACAGTGATACACACCTAGTATTATCAAACAATGTGATCTGTAGTCATGCTCTGTCTAATTTTGCTCTGGACCTTGATAAGAGCAAACTCTTGTAGGCAATTGAGAGTGGATGGATGAAAGGAGGCACCTTGTTTGAAGGGAGGATTGTTAGGTGCAAACGAAGTCCCACATTGACTAGGAGGTATGAGGTCTTTCGGTGGTTTCAAAAACAAAACCATGAAGGTTTATGTCCAAAGTTGACAATATCATACGAAAACCATGAAGGTTTGTGGACAATATCATACCAGTGGAGGTCATTTTTTTTTTCTTTTATCACCTTCCATGCTGTCCTTCAAGCTGTTCCCTAATAAGAAGCATCCAAAAAACTCTTCCATCATAAAACGGGTGGTCATTTCCACTTTGGACATAAATCCTCATGACTTTACGTTTGGAACTACCCAAAAAGTTTCATACTAATAGAGATGATTGTGGTCAGTCACATCATAGATTTGCCCCTTCCCTAGCCAATGTTGGAGTTTGTTTGCACCTAACACAAGGAATGTAGCAACCATCTCTTTCAAGGTAAGAAAAACACTTTTTTTTTTCTTTTTTTTTTTTTTTTTTTTTTTTTTTAATGAGAGACTTGGAAAATACTCTTTGTTAAGTCACCGTTAAACTCACACAATTACAATCTTAAACTAATAGGATGCATTATATTTAATTATATCAATACTTCAACACTGCTCTTTACTTGTGGGCATGAAAATTTGTAAAAAGCCAAACAAATGGAATAAATATTAATTCGAGAGAAAATGATATTACAAGGGTTACTGGAGTTTAAACACAAGGCCTCTTACTTTTCTGCCAATTGATAAGGACAAAAGACCCCTCTCTACATATCTATACAATGGTATGATATTGTTCATTTTGAGTATAAACTCTAGTGACTTTGTTTTTAGAGCTATCCAAAAGACCTCATACCAATGGAGATAATAATCTTCTTTCGTGTTTCCTTCCTTCATCCAAAATTTTCTTTCCTTAAGCCTACTATTCTATCTAGTGTTCGATTAATCTAGTTCTTTTTAGATTTCGTTCGTGGTTATTTTTTGTTTTTACATTATTAAGTTGATCCCAAATTATGGAGGTTTGAAGCCGTTGTATATGTGATAATTGTTTTTGCATTTGGTATGAAATTGATATGTTCTATATATTGAAATGTGAAGTACAACTAGGTAATTCCCCTATCCCTCAGATTTGCGATGGGGTTGAAGACGCTTTAGTTGAATTATAGCAGCTTCCAAAGAAAGCTTATTTCTTTACACAAGATATGATGAGGTTGACACCTCCCGTCTCTCCAAATTCAACTCTGTCAATGGTTAGCAAGTTGTCTATTTTGTGTGGCATCATTTTGGAGGACGAAAGTTGATCAATGTGCCTGCCATTATTTGGACAATTTCCCTGGATGGAATTGTTTGCGGGAAATACTAAAGAATTTCTTTAGTCTTTTGAGGCTTGCCATTATTTGGACAATTTCCCTTTTAAGATGTCTCCTAGAATAGATCCAATTTCTCTTTATGAGATAAATGAACCGTCCAACAGGTGAAGTTATGTGGATGTTGCTACATCTACACAATAGTATTATACTGTCTGCTTTGGGCTTAAACCCTCGTGACGTTACTTTTGGAACCACCCAAAGTGCTTCATATTAATGGAGATAATTATCCTTACTTATGTACCATAGAGCTCTCCTTTCCTTGGTCAATATGAGAATTTGTTTGCTCTAAACACTCCTTCCCTCAAACAAAGTTTCACACTTGAGCATCCCTTCAATGTCTACTACTAACAACTAGAGCTTACTCTTCACCTCGTGGCAGTAACTAGATCATCCCAAGGCAACATCCAAATAGCACAAGCACCGAGTCAGTTTGTTTTGACGTCAATTTCTAGTCTCTATCTCATTCAAGGCTTCAAACCTTCTAAGCTGTGAGTTTGGCTCTCTATTGAAAGTCTTTTTGAATCCTCAATCAGCACTAAGTTTGTTTTAACGTTAGTTTCAAGTCTCCGCATTCAAGGCTCCAAAGCTTCTAAGCTGTAAGATTCTTCTTTTTCGGAAGGTTTAAGGAAGAAAAGAAGAAGAAACCGAATGCTCCTATTTATTCTTTCATTATTGATATGTTCTCTAAAAGCCCCCTTGAATCCATTTGATTTCTTTTTTACCGAGGCCCTTTGTTGGAAGTTTTCTCCTTCTCTTAGCAGTTCCTTTTGCATCATTTATGAAGTTTATCTGTTGGAGGGATCCTGTTGTCGGCTGCTTTGCATTGCCCCAGCTGCTCTCATAATTTCCTACGGTGTTCTTTATTTTAGCTTGTATTTGTTTTCTCTTTACCAGACATACTTCCTATTACCTTTTAGAGCTCCTATATTTGAGCATTAGTCTCTTTTCATGTGTTCAATGAAAAGTTTCGTTTCTTGTTTTTTAAAAAATAGCTCTTGATTGGGCTGAATTGGGCTGAAGGACATTGTTTGGAGAATGATAGCCTGTCTTAGGAGATGTGTGGTTTCATCAGACTTGAAGGTAATGTTGTGTAGGATACTTTAAGATGCATTTTTTTCGTTTACAATGAATATTGGTTTGTTTTGGAGGTTGATTAGAAAAGACCAATTGGCTTAGCCATTATTACATGGACTAAACCGAGCTATATCCTGATTCAGTGTGGACATCAATTTGGGATCTTTAACCCTTAATTTGTCATCCATCTTTTTTACTTTCAATAGAGTTGCTTGTGTGTTTATCTTGGAACTGCTTCCCCCTTCAGTCTCTTACGTTTTCATGTGTCATGTCCTATTGTTTTTTTTTTTTTCTCTATGGGAGAATCTTTTGAATCCTATGATTAATAACAAATATTTTGTTAATTTTGCTTGGTTTTGCAGATATAAGAAAGCCAGAGCTAGCTGCAGAAATTGCTAAACAGCTTAATAAATTCCACAAAGTGTATATTCCAGGTTCTACAGAACCTCAGTTATGGATCGAAATTTTTAAATGTTACGGGAAAGGTTATTTTTCTTCAACCAATAACAAGCTTCTTCTGGTCAGAGGACCCCTCATGCAGTTGAATATTCTATTTTCTTTTGAACTGGTTTTCACGTTTCTCATTGTTGACTTTCAGCCTCTGCACTGCAATTTGATGATACCGAGAAGCAATGTATATACGATTCAATTTCTTTTACGGAAATTCACAATGAAGTACTTGAGATTAAGGTAGTCTCTTGCATAGAGCCATTTTATTGATTCTTTGTAATAGCTTTTGACTATTTTTTGAGTGACCATCTGCTTTGGAAAAGAGTCTGTTTATTGATGCAACTTGATTGAACAGGAACTAACAAGCCTCCTTGATGCACCCGTTGTGTTTTCGCACAATGACCTGCTTTCTGGGAACCTAATGCTAAATGAGGAAGAAGGTAACCACCTATAATACTACATTCCACTTCTTCCCTAGATTCCTTCAAATTCTCCATTACCATATATTACTTTCTTCTCTCATATAACAATATATATGTACTCCCCCCGCTCTCTCATGTACTCTCGTCTATATATCCTTCTCCATTTTCCATATTATTATTATTCCCTTATGATATGAATTATGAAGATGTGCTTTCATATTGCAAATATTAAGGATATTAAATAAGATCTTTTATACTTTCCAAATAGTTGTGCAAGGGACATTAGCATGTCTTTGACTGTTGAGAATGAACAACTTATGAGATGCCGGAGTAAGATATAATGGTAGTAATGTGGACGTTAGGACTTCGAATGTGGATCTTCTTCTGGTTAGCAATTTGATTCCCTCACTAGGCTGAGAGGGAACTTGACGACCAAGGATTGATTGGCTGTGAACTTTAGTTGGACTTCTCTGCTTTCCCCCAATAAGATTCTCATCATATTGACCTCATATTGCTCATCTTACTAACATGCAGAACGACTCTACTTCATTGATTTCGAGTATGGATCATACAATTACAGAGGCTTCGACATTGGCAATCACTTCAATGAATATGCAGGCTACGACTGTGACTACAGCTGGTAGGTTCTTAATCTTCTTTTCATCTGTAAATTTTCCGTCTAAAATATAGGAAAATTGAAGTTTGACTATAGTTTCATCAATTGCTGAGCAGTTATCCGTCCAAGGAGGAACAGTATCATTTCTTCAGGCATTATTTACAACCTGAAGAACCAGACGAGGTGCGTTGACAGAAATACTCATTTCATTTTAGCTCAAAGATTCATGCATTCTGTTGATGTATGTTTAAAAAGTCTAGAATCTGGTTGACTTTTTCATCAACCCATGATGACCTCTGCCATGAACGCTTTCCTTATCTTACTTTGAGCCTCAACACCATACGCTGTAGGATTGAGGTTTTATGCTCATTCTCACGAATCTTCTTCTCTTAATTCTTTTTCATGCATTTTATGTTGAGGATGGTTGGGAGAGAAGTCCCACATTGACTAATTAAGAGTAAGGAATACATCTCCATTGGTATGAGGCCTTTTCGGGAAGCCAAAAGCAAAGCCATGAGAGCTTATGTTCAAAGTGGACAATACCATACTATTGTGGAGGTCCATGGTTTCTAACATGGTATTGGAGTCATGCTCTTAACTAACCATGCCAATAGAATCCTCAAATGCCGAACAAAGAAGTTGTAAGCCTCAAAGGTGTAGTCAAAAGTGACTTAAGTGTCGAACAAAGGGTGTATTTTGTTTGAGGGCTCCAGAGAAGGAGTCAAGGCCTCAGGGGAGACTCTATAACTGTACTTTGTATGAGGGGAGGATTGTTGAGGATTGTTGGGAGAAAAGTCCCACCTCGACTAATTAAGAGCAAGGAATACATCTCCATTGGTATGAGGCTTTTTGGGGAAACCAAAAGCGAACCCATGAGACCTTATACTCAAAGTAGACAATATCATACCATTGTGGAGAGTCGTGGTTTCTAATGTTATGGGCTTCCAATTCTTTGCTTAAACTGAATTGGTACTCTACCCTAGGCAGTAATCTTTTTTCCTTTTCGTTCTCTTTTTTTTTTTATCACTCCCTTTTTTGAGAACATATGTTATATAACTTATTAATCTCAAATTTATGATTACGATAATTCGAACATGGATGGAATGTAAATGTAGAGTACCAGAATATATTAGTTCTCTGACGCTTCGAGGATCTGCAGAGACATTGTACTTTTTAGCAATCATTTTCTATACTAATTTGAACTACTGTATTTTCGTGGAAGGTTTCCCAAGAAGATCTTGAAGCTCTCTACGTGGAGTCAAACATATTCATGCTGGCTTCACACTTGTATTGGGCTTTATGGGCGCTTATACAGGTATCTTACTTCATCCATTGACCTTCCATACTATTTGACCCAAATGTTTTATGTTCTATGTTCATTCACTTAATTATACTTTTTAGGCAAGGATGTCTCCAATTGATTTTGATTACCTTCGTTATTTCTTCCTGCGTTACAACGAGTATAAAAAGAATAAAGAAAAATATTGCTCTTTGGCGAGATCTTTCCTTGCTCGATGAGGATCGGGTCGTGGACCTGCATAGATAATACAATGGGGGGGGTGTGGTAGTATCAGCAATCAGTTTGTATCAACCCCATAAACATAGAGGCAGGTTTGATTTCTCATTCTTTACTTGATCTGTTGTAAAGTGGTATTTTTACTCTGATAATTCGTCATAAACTTTAAATAATCAATTACGTTTTCATGGCCAGTTCTTAGAAATTTTTAGAAGGATTTGTTAGGAATCACGGATCTCTACAATGGTATGATATTGTCCACTTTGAGCATAAGTTCTCGTGACTAGTTCTCATAGCTTTGTTTTGGGCTTCCCCAAAATACCTTATACCAATGGAGATGTATTCCTTACTTATAAACTCATGTCAAACAAAGTACATTATAGAGCCTCCCTTGAGGCCTATGGGGCC

mRNA sequence

AAAAAAAAAAAAAAAAAAAAAAAAGATTGAGAAATCATGGGGGCTGAGAAGATCTACAATGGCTCCGTGGATGTGGAAGAATCTGTAGGAAATGGAGATGGAAATACCGAGTCGTATCAGTTATCGAATCTTTCCATCGACCATTGTCTTCCTCTTCCCGCCATGATCCCTTGCATAATCGAGCTGTGTAAGGATCTGTTTAAGCAGTGGTCGGAGCTGGATGATTCTCGGTTCTCTGTTGAAACGGTCTCCGGTGGGATCACGAATCAGCTGCTTAAGGTTACAGTGGAGGAGGAAAATGGTAGTAACGTTTCCACCACCGTCAGACTATATGGACCTAACACGGATTATGTTATCAATCGTGATCGGGAACTTCAGGCAATCAAATATCTTTCAGCTGCAGGATTTGGTGCCAAGCTTCTTGGAGTTTTTAAAAATGGCATGGTGCAGTCATTTATTCATGCACGTACGCTAGAACCATCAGATATAAGAAAGCCAGAGCTAGCTGCAGAAATTGCTAAACAGCTTAATAAATTCCACAAAGTGTATATTCCAGGTTCTACAGAACCTCAGTTATGGATCGAAATTTTTAAATGTTACGGGAAAGCCTCTGCACTGCAATTTGATGATACCGAGAAGCAATGTATATACGATTCAATTTCTTTTACGGAAATTCACAATGAAGTACTTGAGATTAAGGAACTAACAAGCCTCCTTGATGCACCCGTTGTGTTTTCGCACAATGACCTGCTTTCTGGGAACCTAATGCTAAATGAGGAAGAAGAACGACTCTACTTCATTGATTTCGAGTATGGATCATACAATTACAGAGGCTTCGACATTGGCAATCACTTCAATGAATATGCAGGCTACGACTGTGACTACAGCTGTTATCCGTCCAAGGAGGAACAGTATCATTTCTTCAGGCATTATTTACAACCTGAAGAACCAGACGAGGTTTCCCAAGAAGATCTTGAAGCTCTCTACGTGGAGTCAAACATATTCATGCTGGCTTCACACTTGTATTGGGCTTTATGGGCGCTTATACAGGCAAGGATGTCTCCAATTGATTTTGATTACCTTCGTTATTTCTTCCTGCGTTACAACGAGTATAAAAAGAATAAAGAAAAATATTGCTCTTTGGCGAGATCTTTCCTTGCTCGATGAGGATCGGGTCGTGGACCTGCATAGATAATACAATGGGGGGGGTGTGGTAGTATCAGCAATCAGTTTGTATCAACCCCATAAACATAGAGGCAGGTTTGATTTCTCATTCTTTACTTGATCTGTTGTAAAGTGGTATTTTTACTCTGATAATTCGTCATAAACTTTAAATAATCAATTACGTTTTCATGGCCAGTTCTTAGAAATTTTTAGAAGGATTTGTTAGGAATCACGGATCTCTACAATGGTATGATATTGTCCACTTTGAGCATAAGTTCTCGTGACTAGTTCTCATAGCTTTGTTTTGGGCTTCCCCAAAATACCTTATACCAATGGAGATGTATTCCTTACTTATAAACTCATGTCAAACAAAGTACATTATAGAGCCTCCCTTGAGGCCTATGGGGCC

Coding sequence (CDS)

ATGGGGGCTGAGAAGATCTACAATGGCTCCGTGGATGTGGAAGAATCTGTAGGAAATGGAGATGGAAATACCGAGTCGTATCAGTTATCGAATCTTTCCATCGACCATTGTCTTCCTCTTCCCGCCATGATCCCTTGCATAATCGAGCTGTGTAAGGATCTGTTTAAGCAGTGGTCGGAGCTGGATGATTCTCGGTTCTCTGTTGAAACGGTCTCCGGTGGGATCACGAATCAGCTGCTTAAGGTTACAGTGGAGGAGGAAAATGGTAGTAACGTTTCCACCACCGTCAGACTATATGGACCTAACACGGATTATGTTATCAATCGTGATCGGGAACTTCAGGCAATCAAATATCTTTCAGCTGCAGGATTTGGTGCCAAGCTTCTTGGAGTTTTTAAAAATGGCATGGTGCAGTCATTTATTCATGCACGTACGCTAGAACCATCAGATATAAGAAAGCCAGAGCTAGCTGCAGAAATTGCTAAACAGCTTAATAAATTCCACAAAGTGTATATTCCAGGTTCTACAGAACCTCAGTTATGGATCGAAATTTTTAAATGTTACGGGAAAGCCTCTGCACTGCAATTTGATGATACCGAGAAGCAATGTATATACGATTCAATTTCTTTTACGGAAATTCACAATGAAGTACTTGAGATTAAGGAACTAACAAGCCTCCTTGATGCACCCGTTGTGTTTTCGCACAATGACCTGCTTTCTGGGAACCTAATGCTAAATGAGGAAGAAGAACGACTCTACTTCATTGATTTCGAGTATGGATCATACAATTACAGAGGCTTCGACATTGGCAATCACTTCAATGAATATGCAGGCTACGACTGTGACTACAGCTGTTATCCGTCCAAGGAGGAACAGTATCATTTCTTCAGGCATTATTTACAACCTGAAGAACCAGACGAGGTTTCCCAAGAAGATCTTGAAGCTCTCTACGTGGAGTCAAACATATTCATGCTGGCTTCACACTTGTATTGGGCTTTATGGGCGCTTATACAGGCAAGGATGTCTCCAATTGATTTTGATTACCTTCGTTATTTCTTCCTGCGTTACAACGAGTATAAAAAGAATAAAGAAAAATATTGCTCTTTGGCGAGATCTTTCCTTGCTCGATGA

Protein sequence

MGAEKIYNGSVDVEESVGNGDGNTESYQLSNLSIDHCLPLPAMIPCIIELCKDLFKQWSELDDSRFSVETVSGGITNQLLKVTVEEENGSNVSTTVRLYGPNTDYVINRDRELQAIKYLSAAGFGAKLLGVFKNGMVQSFIHARTLEPSDIRKPELAAEIAKQLNKFHKVYIPGSTEPQLWIEIFKCYGKASALQFDDTEKQCIYDSISFTEIHNEVLEIKELTSLLDAPVVFSHNDLLSGNLMLNEEEERLYFIDFEYGSYNYRGFDIGNHFNEYAGYDCDYSCYPSKEEQYHFFRHYLQPEEPDEVSQEDLEALYVESNIFMLASHLYWALWALIQARMSPIDFDYLRYFFLRYNEYKKNKEKYCSLARSFLAR
BLAST of CmaCh09G011940 vs. Swiss-Prot
Match: EKI_ARATH (Probable ethanolamine kinase OS=Arabidopsis thaliana GN=EMB1187 PE=2 SV=1)

HSP 1 Score: 518.5 bits (1334), Expect = 6.3e-146
Identity = 252/375 (67.20%), Postives = 306/375 (81.60%), Query Frame = 1

Query: 1   MGAEKIYNGSVDVEESVGNGDGNTESYQLSNLSIDHCLPLPAMIPCIIELCKDLFKQWSE 60
           MGA K      + E++  +     E    S+  +D  LPLP MIP IIELCKDLFK W E
Sbjct: 1   MGAAKNIWALANAEDAANDA----EQIPYSSFVVDTSLPLPLMIPRIIELCKDLFKNWGE 60

Query: 61  LDDSRFSVETVSGGITNQLLKVTVEEENGSNVSTTVRLYGPNTDYVINRDRELQAIKYLS 120
           LDDS FSVE VSGGITN LLKV+V+E+    VS TVRLYGPNT+YVINR+RE+ AIKYLS
Sbjct: 61  LDDSLFSVERVSGGITNLLLKVSVKEDTNKEVSVTVRLYGPNTEYVINREREILAIKYLS 120

Query: 121 AAGFGAKLLGVFKNGMVQSFIHARTLEPSDIRKPELAAEIAKQLNKFHKVYIPGSTEPQL 180
           AAGFGAKLLG F NGMVQSFI+ARTLEPSD+R+P++AA+IA++L KFHKV IPGS EPQL
Sbjct: 121 AAGFGAKLLGGFGNGMVQSFINARTLEPSDMREPKIAAQIARELGKFHKVDIPGSKEPQL 180

Query: 181 WIEIFKCYGKASALQFDDTEKQCIYDSISFTEIHNEVLEIKELTSLLDAPVVFSHNDLLS 240
           W++I K Y KAS L F++ +KQ ++++ISF E+H E++E++E T LL+APVVF+HNDLLS
Sbjct: 181 WVDILKFYEKASTLTFEEPDKQKLFETISFEELHKEIIELREFTGLLNAPVVFAHNDLLS 240

Query: 241 GNLMLNEEEERLYFIDFEYGSYNYRGFDIGNHFNEYAGYDCDYSCYPSKEEQYHFFRHYL 300
           GN MLN+EEE+LY IDFEYGSYNYRGFDIGNHFNEYAGYDCDYS YPSKEEQYHF +HYL
Sbjct: 241 GNFMLNDEEEKLYLIDFEYGSYNYRGFDIGNHFNEYAGYDCDYSLYPSKEEQYHFIKHYL 300

Query: 301 QPEEPDEVSQEDLEALYVESNIFMLASHLYWALWALIQARMSPIDFDYLRYFFLRYNEYK 360
           QP++PDEVS  ++E+++VE++ + LASHLYWA+WA+IQARMSPI+F+YL YFFLRYNEYK
Sbjct: 301 QPDKPDEVSIAEVESVFVETDAYKLASHLYWAIWAIIQARMSPIEFEYLGYFFLRYNEYK 360

Query: 361 KNKEKYCSLARSFLA 376
           K K    SL  S L+
Sbjct: 361 KQKPLTFSLVTSHLS 371

BLAST of CmaCh09G011940 vs. Swiss-Prot
Match: EKIA_DICDI (Probable ethanolamine kinase A OS=Dictyostelium discoideum GN=etnkA PE=3 SV=1)

HSP 1 Score: 231.1 bits (588), Expect = 2.0e-59
Identity = 123/325 (37.85%), Postives = 188/325 (57.85%), Query Frame = 1

Query: 47  IIELCKDLFKQWSELDDSRFSVETVSGGITNQLLKVTVE--EENGSNVSTTVRLYGPNTD 106
           + ++ +    ++    D   +++ ++GGITN L  V  +  E+    +   +RLYG  ++
Sbjct: 25  LCDIARYFVPEYRNSKDEDLTIQKLNGGITNVLYLVEDKNIEQKYRYLPVVIRLYGYKSE 84

Query: 107 YVINRDRELQAIKYLSAAGFGAKLLGVFKNGMVQSFIHARTLEPSDIRKPELAAEIAKQL 166
            +I+R  EL         G GAK  G+F NG +  FI    L   DI KP +   IAK++
Sbjct: 85  EIIDRKNELIIQTEADQNGLGAKFYGLFDNGCIYGFIKGEPLAYEDISKPTMQTCIAKEI 144

Query: 167 NKFHKVYIPGSTEPQLWIEIFKCYGKASALQFDDTEKQCIYDSISFTEIHNEVLEIKELT 226
            ++H + +P    P LW  I K    A  + +   EK   Y SI+  ++  E   +++  
Sbjct: 145 AQWHSIEMPTRKNPSLWPTIKKWAALAPDV-YPVPEKNEYYQSINVKKMIEEGKMLEQRL 204

Query: 227 SLLDAPVVFSHNDLLSGNLMLNEEEERLYFIDFEYGSYNYRGFDIGNHFNEYAGYDCDYS 286
           + L++P+VF HNDLLSGN++ +  +    FIDFEY +YN+RG ++GNHFNEYAG+  DYS
Sbjct: 205 AQLNSPIVFCHNDLLSGNIIYDPSQNCASFIDFEYANYNFRGLELGNHFNEYAGFGPDYS 264

Query: 287 CYPSKEEQYHFFRHYLQPEEPDEVSQEDLEALYVESNIFMLASHLYWALWALIQARMSPI 346
            YP+KE Q HF   Y +     E +Q++LE LY+ESN F LASHLYW  WA++QA  S I
Sbjct: 265 LYPNKESQIHFLTDYHRSLFKTEPTQDELEKLYIESNQFSLASHLYWGFWAIVQAMNSQI 324

Query: 347 DFDYLRYFFLRYNEYKKNKEKYCSL 370
           DFDYL Y   R++ Y + ++++ +L
Sbjct: 325 DFDYLEYGKARFDRYYETRDQFLNL 348

BLAST of CmaCh09G011940 vs. Swiss-Prot
Match: EKI2_MOUSE (Ethanolamine kinase 2 OS=Mus musculus GN=Etnk2 PE=1 SV=1)

HSP 1 Score: 218.8 bits (556), Expect = 1.0e-55
Identity = 124/329 (37.69%), Postives = 190/329 (57.75%), Query Frame = 1

Query: 43  MIPCIIELCKDLFKQWSELDDSRFSVETVSGGITNQLLKVTVEEENGSNVSTTVRLYGPN 102
           ++P  + L ++L   W      +   +    GITN+LL   VEE+    V   VR+YG  
Sbjct: 62  ILPGALRLIRELRPHWKP---EQVRTKRFKDGITNKLLACYVEEDMRDCV--LVRVYGER 121

Query: 103 TDYVINRDRELQAIKYLSAAGFGAKLLGVFKNGMVQSFIHARTLEPSDIRKPELAAEIAK 162
           T+ +++R+ E++  + L A G   KL   F+NG+   ++    L P  IR+P+L   IA 
Sbjct: 122 TELLVDRENEVRNFQLLRAHGCAPKLYCTFQNGLCYEYVQGVALGPEHIREPQLFRLIAL 181

Query: 163 QLNKFHKVYIPGST-EPQLWIEIFKCYGKASALQFDDTEKQCIYDSISFTEIHNEVLEIK 222
           ++ K H ++  GS  +P LW ++ + +     L  D+       D      +  E+  +K
Sbjct: 182 EMAKIHTIHANGSLPKPTLWHKMHRYF----TLVKDEISPSLSADVPKVEVLEQELAWLK 241

Query: 223 ELTSLLDAPVVFSHNDLLSGNLMLNEEEERLYFIDFEYGSYNYRGFDIGNHFNEYAGYDC 282
           E  S LD+PVVF HNDLL  N++ + ++ R+ FID+EY  YNY+ FDIGNHFNE+AG + 
Sbjct: 242 EHLSQLDSPVVFCHNDLLCKNIIYDSDKGRVCFIDYEYAGYNYQAFDIGNHFNEFAGVNV 301

Query: 283 -DYSCYPSKEEQYHFFRHYLQPEEPDEVSQEDLEALYVESNIFMLASHLYWALWALIQAR 342
            DYS YP++E Q  + R+YL+ ++    S  ++E LY + N F LASH +WALWALIQ +
Sbjct: 302 VDYSRYPARETQVQWLRYYLEAQKGTAASPREVERLYAQVNKFALASHFFWALWALIQNQ 361

Query: 343 MSPIDFDYLRYFFLRYNEYKKNKEKYCSL 370
            S I FD+LRY  +R+N+Y K K +  +L
Sbjct: 362 YSTISFDFLRYAVIRFNQYFKVKPQVSAL 381

BLAST of CmaCh09G011940 vs. Swiss-Prot
Match: EKI2_RAT (Ethanolamine kinase 2 OS=Rattus norvegicus GN=Etnk2 PE=3 SV=1)

HSP 1 Score: 213.0 bits (541), Expect = 5.6e-54
Identity = 122/329 (37.08%), Postives = 189/329 (57.45%), Query Frame = 1

Query: 43  MIPCIIELCKDLFKQWSELDDSRFSVETVSGGITNQLLKVTVEEENGSNVSTTVRLYGPN 102
           ++P  + L ++L   W      +   +    GITN+LL   VEE+    V   VR+YG  
Sbjct: 62  ILPGALRLIRELRPHWKP---EQVRTKRFKDGITNKLLACYVEEDMRDCV--LVRVYGEW 121

Query: 103 TDYVINRDRELQAIKYLSAAGFGAKLLGVFKNGMVQSFIHARTLEPSDIRKPELAAEIAK 162
           T+ +++R+ E++  + L A G   KL   F+NG+   ++    L P  IR+P+L   IA 
Sbjct: 122 TELLVDRENEIRNFQLLRAHGCAPKLYCTFQNGLCYEYMQGVALGPEHIREPQLFRLIAL 181

Query: 163 QLNKFHKVYIPGST-EPQLWIEIFKCYGKASALQFDDTEKQCIYDSISFTEIHNEVLEIK 222
           ++ K H ++  GS  +P LW ++ + +     L  D+       D      +  E+  +K
Sbjct: 182 EMAKIHTIHANGSLPKPTLWHKMHRYF----TLVKDEISPSLSADVPKVEVLEQELAWLK 241

Query: 223 ELTSLLDAPVVFSHNDLLSGNLMLNEEEERLYFIDFEYGSYNYRGFDIGNHFNEYAGY-D 282
           E  S LD+PVVF HNDLL  N++ + ++  + FID+EY  YNY+ FDIGNHFNE+AG  +
Sbjct: 242 EHLSQLDSPVVFCHNDLLCKNIIYDSDKGHVRFIDYEYAGYNYQAFDIGNHFNEFAGVNE 301

Query: 283 CDYSCYPSKEEQYHFFRHYLQPEEPDEVSQEDLEALYVESNIFMLASHLYWALWALIQAR 342
            DY  YP++E Q  + R+YL+ ++    S  ++E LY + N F LASH +WALWALIQ +
Sbjct: 302 VDYCRYPAREIQLQWLRYYLEAQKGTAASPREVERLYAQVNKFALASHFFWALWALIQNQ 361

Query: 343 MSPIDFDYLRYFFLRYNEYKKNKEKYCSL 370
            S I+FD+LRY  +R+N+Y K K +  +L
Sbjct: 362 YSTINFDFLRYAVIRFNQYFKVKPQVSAL 381

BLAST of CmaCh09G011940 vs. Swiss-Prot
Match: EKI1_HUMAN (Ethanolamine kinase 1 OS=Homo sapiens GN=ETNK1 PE=1 SV=1)

HSP 1 Score: 208.4 bits (529), Expect = 1.4e-52
Identity = 120/331 (36.25%), Postives = 188/331 (56.80%), Query Frame = 1

Query: 48  IELCKDLFKQWSELDDSRFSVETVSGGITNQLLKVTVEEENGSNVSTTVRLYGPNTDYVI 107
           + L + L   W   D    +++  + GITN+L+   V   N       VR+YG  T+ ++
Sbjct: 123 LSLLQHLRPHW---DPQEVTLQLFTDGITNKLIGCYVG--NTMEDVVLVRIYGNKTELLV 182

Query: 108 NRDRELQAIKYLSAAGFGAKLLGVFKNGMVQSFIHARTLEPSDIRKPELAAEIAKQLNKF 167
           +RD E+++ + L A G   +L   F NG+   FI    L+P  +  P +   IA+QL K 
Sbjct: 183 DRDEEVKSFRVLQAHGCAPQLYCTFNNGLCYEFIQGEALDPKHVCNPAIFRLIARQLAKI 242

Query: 168 HKVYIPGSTEPQ--LWIEIFKCYGKA-SALQFDDTEKQCIYDSISFTEIHNEVLEIKELT 227
           H ++      P+  LW+++ K +    +    +D  K+ + D  S   +  E+  +KE+ 
Sbjct: 243 HAIHAHNGWIPKSNLWLKMGKYFSLIPTGFADEDINKRFLSDIPSSQILQEEMTWMKEIL 302

Query: 228 SLLDAPVVFSHNDLLSGNLMLNEEEERLYFIDFEYGSYNYRGFDIGNHFNEYAGY-DCDY 287
           S L +PVV  HNDLL  N++ NE++  + FID+EY  YNY  +DIGNHFNE+AG  D DY
Sbjct: 303 SNLGSPVVLCHNDLLCKNIIYNEKQGDVQFIDYEYSGYNYLAYDIGNHFNEFAGVSDVDY 362

Query: 288 SCYPSKEEQYHFFRHYLQPEEP-----DEVSQEDLEALYVESNIFMLASHLYWALWALIQ 347
           S YP +E Q  + R YL+  +       EV+++++E L+++ N F LASH +W LWALIQ
Sbjct: 363 SLYPDRELQSQWLRAYLEAYKEFKGFGTEVTEKEVEILFIQVNQFALASHFFWGLWALIQ 422

Query: 348 ARMSPIDFDYLRYFFLRYNEYKKNKEKYCSL 370
           A+ S I+FD+L Y  +R+N+Y K K +  +L
Sbjct: 423 AKYSTIEFDFLGYAIVRFNQYFKMKPEVTAL 448

BLAST of CmaCh09G011940 vs. TrEMBL
Match: A0A0A0KDN9_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G018610 PE=4 SV=1)

HSP 1 Score: 672.5 bits (1734), Expect = 2.9e-190
Identity = 331/376 (88.03%), Postives = 350/376 (93.09%), Query Frame = 1

Query: 1   MGAEKIYNGSVDVEESVGNGDGNTESYQLSNLSIDHCLPLPAMIPCIIELCKDLFKQWSE 60
           MGA+KIYNG VDV E+V +GD N ESYQLS LS+DH LPLPA+ P IIELCKDLFK+WSE
Sbjct: 1   MGAKKIYNGDVDVVEAVEDGDSNAESYQLSYLSVDHSLPLPAITPRIIELCKDLFKEWSE 60

Query: 61  LDDSRFSVETVSGGITNQLLKVTVEEENGSNVSTTVRLYGPNTDYVINRDRELQAIKYLS 120
           LD SRFSVETVSGGITNQLLKVTV+EE+G++VS TVRLYGPNTDYVINRDRELQAIKYLS
Sbjct: 61  LDASRFSVETVSGGITNQLLKVTVKEESGTSVSVTVRLYGPNTDYVINRDRELQAIKYLS 120

Query: 121 AAGFGAKLLGVFKNGMVQSFIHARTLEPSDIRKPELAAEIAKQLNKFHKVYIPGSTEPQL 180
           AAGFGAKLLGVFKNGMVQSFIHARTLEPSD+RKPELAAEIAKQLNKFHKVYIPGS EPQL
Sbjct: 121 AAGFGAKLLGVFKNGMVQSFIHARTLEPSDLRKPELAAEIAKQLNKFHKVYIPGSNEPQL 180

Query: 181 WIEIFKCYGKASALQFDDTEKQCIYDSISFTEIHNEVLEIKELTSLLDAPVVFSHNDLLS 240
           W EI   Y KAS LQFDDT KQ IYD+ISF EIHNE+LEIKELTSLL+AP+VF+HNDLLS
Sbjct: 181 WNEILNFYDKASTLQFDDTGKQSIYDTISFQEIHNEILEIKELTSLLNAPIVFAHNDLLS 240

Query: 241 GNLMLNEEEERLYFIDFEYGSYNYRGFDIGNHFNEYAGYDCDYSCYPSKEEQYHFFRHYL 300
           GNLMLNEEE RLYFIDFEYGSY+YRGFDIGNHFNEYAGYDCDYSCYPSKEEQYHFFRHYL
Sbjct: 241 GNLMLNEEEGRLYFIDFEYGSYSYRGFDIGNHFNEYAGYDCDYSCYPSKEEQYHFFRHYL 300

Query: 301 QPEEPDEVSQEDLEALYVESNIFMLASHLYWALWALIQARMSPIDFDYLRYFFLRYNEYK 360
           QPE+PDEVSQ+DLEALYVESN FMLASHLYWALWALIQARMSPIDFDYL YFFLRY EYK
Sbjct: 301 QPEKPDEVSQKDLEALYVESNTFMLASHLYWALWALIQARMSPIDFDYLSYFFLRYGEYK 360

Query: 361 KNKEKYCSLARSFLAR 377
           K KEKYCSLARSFLAR
Sbjct: 361 KQKEKYCSLARSFLAR 376

BLAST of CmaCh09G011940 vs. TrEMBL
Match: A0A0B0NMB3_GOSAR (Putative ethanolamine kinase A OS=Gossypium arboreum GN=F383_18311 PE=4 SV=1)

HSP 1 Score: 552.4 bits (1422), Expect = 4.4e-154
Identity = 267/358 (74.58%), Postives = 313/358 (87.43%), Query Frame = 1

Query: 18  GNGDGNTESYQLSNLSIDHCLPLPAMIPCIIELCKDLFKQWSELDDSRFSVETVSGGITN 77
           GN   +  S   SNLS+D  LP P M+P +I LCKDLF++W++L+DS FSV+TVSGGITN
Sbjct: 22  GNSTFDDHSILHSNLSVDTALPFPLMVPRVIALCKDLFRKWAKLNDSCFSVDTVSGGITN 81

Query: 78  QLLKVTVEEENGSNVSTTVRLYGPNTDYVINRDRELQAIKYLSAAGFGAKLLGVFKNGMV 137
            LLKV+V+EENG  VS TVRLYGPNT+YVINR+RELQAIKYLSAAGFGAKLLGVF NGMV
Sbjct: 82  LLLKVSVKEENGEYVSVTVRLYGPNTEYVINRERELQAIKYLSAAGFGAKLLGVFGNGMV 141

Query: 138 QSFIHARTLEPSDIRKPELAAEIAKQLNKFHKVYIPGSTEPQLWIEIFKCYGKASALQFD 197
           QSFI+ARTL P+D+RKP+L +EIAKQL +FH+V IPGS EPQLW++IFK + KAS LQF+
Sbjct: 142 QSFINARTLTPADMRKPKLVSEIAKQLRRFHQVEIPGSKEPQLWVDIFKFFEKASTLQFE 201

Query: 198 DTEKQCIYDSISFTEIHNEVLEIKELTSLLDAPVVFSHNDLLSGNLMLNEEEERLYFIDF 257
           DT+KQ  Y++ISF E+H EV E+KELT+LL++PVVF+HNDLLSGNLM N+E+E+LY IDF
Sbjct: 202 DTDKQRTYETISFKEVHKEVKELKELTTLLNSPVVFAHNDLLSGNLMHNDEQEKLYLIDF 261

Query: 258 EYGSYNYRGFDIGNHFNEYAGYDCDYSCYPSKEEQYHFFRHYLQPEEPDEVSQEDLEALY 317
           EYGSYNYRGFDIGNHFNEYAGYDCDYS YPSK+EQYHFFRHYL+PE+P EVS++DLEALY
Sbjct: 262 EYGSYNYRGFDIGNHFNEYAGYDCDYSLYPSKDEQYHFFRHYLEPEKPCEVSEKDLEALY 321

Query: 318 VESNIFMLASHLYWALWALIQARMSPIDFDYLRYFFLRYNEYKKNKEKYCSLARSFLA 376
           VE+N FMLASHLYWALWALIQARMSPIDFDYL YFFLRYNEYKK K    SLA+S ++
Sbjct: 322 VETNTFMLASHLYWALWALIQARMSPIDFDYLGYFFLRYNEYKKQKRMCFSLAKSHIS 379

BLAST of CmaCh09G011940 vs. TrEMBL
Match: B9MTI9_POPTR (Choline/ethanolamine kinase family protein OS=Populus trichocarpa GN=POPTR_0006s12230g PE=4 SV=1)

HSP 1 Score: 550.4 bits (1417), Expect = 1.7e-153
Identity = 267/366 (72.95%), Postives = 318/366 (86.89%), Query Frame = 1

Query: 10  SVDVEESVGNGDGNTESYQLSNLSIDHCLPLPAMIPCIIELCKDLFKQWSELDDSRFSVE 69
           +++V E     + ++   Q ++L++D  L LP + P +IELCKDLFK+WS LDDS FSVE
Sbjct: 16  AMEVAEGARGDNSSSHVLQSASLTLDTSLSLPDLTPPLIELCKDLFKKWSRLDDSSFSVE 75

Query: 70  TVSGGITNQLLKVTVEEENGSNVSTTVRLYGPNTDYVINRDRELQAIKYLSAAGFGAKLL 129
           TVSGGITN LLKV+V+EE+G+ V  TVRLYGPNTDYVINR+RELQAIKYLSAAGFGAKLL
Sbjct: 76  TVSGGITNLLLKVSVKEEDGNEVPVTVRLYGPNTDYVINRERELQAIKYLSAAGFGAKLL 135

Query: 130 GVFKNGMVQSFIHARTLEPSDIRKPELAAEIAKQLNKFHKVYIPGSTEPQLWIEIFKCYG 189
           GVF+NGMVQSFI+ARTL P D+R+P+LAAEIAKQL+KFH+V IPGS EPQLW +IFK Y 
Sbjct: 136 GVFQNGMVQSFINARTLIPQDMREPKLAAEIAKQLHKFHRVDIPGSKEPQLWNDIFKFYE 195

Query: 190 KASALQFDDTEKQCIYDSISFTEIHNEVLEIKELTSLLDAPVVFSHNDLLSGNLMLNEEE 249
            AS L FDD EK+  Y++I F E++NEV+EIKELT LL+APVVF+HNDLLSGNLMLN++E
Sbjct: 196 NASTLHFDDIEKRKKYETILFKEVYNEVVEIKELTDLLNAPVVFAHNDLLSGNLMLNDDE 255

Query: 250 ERLYFIDFEYGSYNYRGFDIGNHFNEYAGYDCDYSCYPSKEEQYHFFRHYLQPEEPDEVS 309
           E+LY IDFEYGSY+YRG+DIGNHFNEYAGYDCDYS YPSK+EQYHFFRHYLQP++P EVS
Sbjct: 256 EKLYIIDFEYGSYSYRGYDIGNHFNEYAGYDCDYSLYPSKDEQYHFFRHYLQPDKPHEVS 315

Query: 310 QEDLEALYVESNIFMLASHLYWALWALIQARMSPIDFDYLRYFFLRYNEYKKNKEKYCSL 369
            +DLEALYVESN +ML SHL+WALWALIQA+MSPIDFDYL YFFLRY+E+K+ KEK CSL
Sbjct: 316 DKDLEALYVESNTYMLVSHLFWALWALIQAKMSPIDFDYLGYFFLRYDEFKRRKEKACSL 375

Query: 370 ARSFLA 376
           ARS+L+
Sbjct: 376 ARSYLS 381

BLAST of CmaCh09G011940 vs. TrEMBL
Match: A0A067KBK1_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_13308 PE=4 SV=1)

HSP 1 Score: 549.7 bits (1415), Expect = 2.8e-153
Identity = 274/378 (72.49%), Postives = 321/378 (84.92%), Query Frame = 1

Query: 1   MGA--EKIYNGSVDVEESVGNGDGNTESYQLSNLSIDHCLPLPAMIPCIIELCKDLFKQW 60
           MGA  +K ++G    EE+ GN   +  S   SNL++D  L LP M P ++ELCKDLFK+W
Sbjct: 1   MGAAEQKSWDGMEVAEEARGNCTSDILS---SNLTVDCSLSLPQMAPRVVELCKDLFKKW 60

Query: 61  SELDDSRFSVETVSGGITNQLLKVTVEEENGSNVSTTVRLYGPNTDYVINRDRELQAIKY 120
           S+LDDS FSVE VSGGITN LLKVTV+EE+G+ VS TVR+YGPNTDYVI+R+RELQAIKY
Sbjct: 61  SKLDDSLFSVERVSGGITNLLLKVTVKEEDGNEVSVTVRIYGPNTDYVIHRERELQAIKY 120

Query: 121 LSAAGFGAKLLGVFKNGMVQSFIHARTLEPSDIRKPELAAEIAKQLNKFHKVYIPGSTEP 180
           LSAAGFGAKLLG + NGMVQSFI+ARTL P+D+RKP+LAAEIA+QL++FHKV IPGS EP
Sbjct: 121 LSAAGFGAKLLGTYGNGMVQSFINARTLTPADMRKPKLAAEIARQLHEFHKVEIPGSKEP 180

Query: 181 QLWIEIFKCYGKASALQFDDTEKQCIYDSISFTEIHNEVLEIKELTSLLDAPVVFSHNDL 240
           QLW EIFK Y  AS LQFDD EKQ IY++ISF E++NE++EIK LT  L+APVVF+HNDL
Sbjct: 181 QLWNEIFKFYENASILQFDDIEKQRIYETISFKEVYNEIVEIKGLTDRLNAPVVFAHNDL 240

Query: 241 LSGNLMLNEEEERLYFIDFEYGSYNYRGFDIGNHFNEYAGYDCDYSCYPSKEEQYHFFRH 300
           LSGNLMLN++E +LYFIDFEYGSY+YRG+DI NHFNEYAGYDCDYS YPSK+EQYHFFRH
Sbjct: 241 LSGNLMLNDDENKLYFIDFEYGSYSYRGYDIANHFNEYAGYDCDYSLYPSKDEQYHFFRH 300

Query: 301 YLQPEEPDEVSQEDLEALYVESNIFMLASHLYWALWALIQARMSPIDFDYLRYFFLRYNE 360
           YLQP++P EVS  DLEALY+E+N FMLASH +WALWALIQA+MSPIDFDYL YFFLRYNE
Sbjct: 301 YLQPDKPHEVSDRDLEALYIEANTFMLASHFFWALWALIQAKMSPIDFDYLGYFFLRYNE 360

Query: 361 YKKNKEKYCSLARSFLAR 377
           YKK KE   SLA S+L+R
Sbjct: 361 YKKQKEMSSSLALSYLSR 375

BLAST of CmaCh09G011940 vs. TrEMBL
Match: A0A061GV59_THECC (Kinase superfamily protein isoform 1 OS=Theobroma cacao GN=TCM_038050 PE=4 SV=1)

HSP 1 Score: 549.3 bits (1414), Expect = 3.7e-153
Identity = 273/379 (72.03%), Postives = 322/379 (84.96%), Query Frame = 1

Query: 1   MGAEKIYNGSVDVEESVGN-GDGNT---ESYQLSNLSIDHCLPLPAMIPCIIELCKDLFK 60
           MGA +    ++D+E +     +GN+   +S   S LS+D  L  P M+ C+IELCKDLF 
Sbjct: 1   MGATRKIWTAMDIEANQAKQNNGNSTVDDSIPHSALSVDTALSFPLMVSCVIELCKDLFG 60

Query: 61  QWSELDDSRFSVETVSGGITNQLLKVTVEEENGSNVSTTVRLYGPNTDYVINRDRELQAI 120
           +W++LDDS FSVETVSGGITN LLKV+V+EENG +V  TVRLYGPNT+YVINR+RELQAI
Sbjct: 61  KWAKLDDSCFSVETVSGGITNLLLKVSVKEENGDDVYVTVRLYGPNTEYVINRERELQAI 120

Query: 121 KYLSAAGFGAKLLGVFKNGMVQSFIHARTLEPSDIRKPELAAEIAKQLNKFHKVYIPGST 180
           KYLSAAGFGAKLLGVF+NGMVQSFI+ARTL  SD+RKP+L AEIAKQL +FH+V IPGS 
Sbjct: 121 KYLSAAGFGAKLLGVFENGMVQSFINARTLTSSDMRKPKLVAEIAKQLRRFHQVEIPGSK 180

Query: 181 EPQLWIEIFKCYGKASALQFDDTEKQCIYDSISFTEIHNEVLEIKELTSLLDAPVVFSHN 240
           EPQLW++I K + KASALQF+D +KQ IY++I F E+H EV ++KELT LL+APVVF+HN
Sbjct: 181 EPQLWVDILKFFEKASALQFEDIDKQMIYETILFEEVHKEVTQLKELTGLLNAPVVFAHN 240

Query: 241 DLLSGNLMLNEEEERLYFIDFEYGSYNYRGFDIGNHFNEYAGYDCDYSCYPSKEEQYHFF 300
           DLLSGNLMLN+E ++LYFIDFEYGSYNYRGFDIGNHFNEYAGYDCDYS YPSK+EQY FF
Sbjct: 241 DLLSGNLMLNDEHDKLYFIDFEYGSYNYRGFDIGNHFNEYAGYDCDYSLYPSKDEQYLFF 300

Query: 301 RHYLQPEEPDEVSQEDLEALYVESNIFMLASHLYWALWALIQARMSPIDFDYLRYFFLRY 360
           RHYLQPE+P EVS++DLEALYVE+N FMLASHLYWALWA+IQARMSPIDFDYL YFFLRY
Sbjct: 301 RHYLQPEKPYEVSEKDLEALYVETNTFMLASHLYWALWAIIQARMSPIDFDYLGYFFLRY 360

Query: 361 NEYKKNKEKYCSLARSFLA 376
           NEYK+ KE   SLA+S L+
Sbjct: 361 NEYKRQKEMCFSLAQSHLS 379

BLAST of CmaCh09G011940 vs. TAIR10
Match: AT2G26830.1 (AT2G26830.1 Protein kinase superfamily protein)

HSP 1 Score: 518.5 bits (1334), Expect = 3.5e-147
Identity = 252/375 (67.20%), Postives = 306/375 (81.60%), Query Frame = 1

Query: 1   MGAEKIYNGSVDVEESVGNGDGNTESYQLSNLSIDHCLPLPAMIPCIIELCKDLFKQWSE 60
           MGA K      + E++  +     E    S+  +D  LPLP MIP IIELCKDLFK W E
Sbjct: 1   MGAAKNIWALANAEDAANDA----EQIPYSSFVVDTSLPLPLMIPRIIELCKDLFKNWGE 60

Query: 61  LDDSRFSVETVSGGITNQLLKVTVEEENGSNVSTTVRLYGPNTDYVINRDRELQAIKYLS 120
           LDDS FSVE VSGGITN LLKV+V+E+    VS TVRLYGPNT+YVINR+RE+ AIKYLS
Sbjct: 61  LDDSLFSVERVSGGITNLLLKVSVKEDTNKEVSVTVRLYGPNTEYVINREREILAIKYLS 120

Query: 121 AAGFGAKLLGVFKNGMVQSFIHARTLEPSDIRKPELAAEIAKQLNKFHKVYIPGSTEPQL 180
           AAGFGAKLLG F NGMVQSFI+ARTLEPSD+R+P++AA+IA++L KFHKV IPGS EPQL
Sbjct: 121 AAGFGAKLLGGFGNGMVQSFINARTLEPSDMREPKIAAQIARELGKFHKVDIPGSKEPQL 180

Query: 181 WIEIFKCYGKASALQFDDTEKQCIYDSISFTEIHNEVLEIKELTSLLDAPVVFSHNDLLS 240
           W++I K Y KAS L F++ +KQ ++++ISF E+H E++E++E T LL+APVVF+HNDLLS
Sbjct: 181 WVDILKFYEKASTLTFEEPDKQKLFETISFEELHKEIIELREFTGLLNAPVVFAHNDLLS 240

Query: 241 GNLMLNEEEERLYFIDFEYGSYNYRGFDIGNHFNEYAGYDCDYSCYPSKEEQYHFFRHYL 300
           GN MLN+EEE+LY IDFEYGSYNYRGFDIGNHFNEYAGYDCDYS YPSKEEQYHF +HYL
Sbjct: 241 GNFMLNDEEEKLYLIDFEYGSYNYRGFDIGNHFNEYAGYDCDYSLYPSKEEQYHFIKHYL 300

Query: 301 QPEEPDEVSQEDLEALYVESNIFMLASHLYWALWALIQARMSPIDFDYLRYFFLRYNEYK 360
           QP++PDEVS  ++E+++VE++ + LASHLYWA+WA+IQARMSPI+F+YL YFFLRYNEYK
Sbjct: 301 QPDKPDEVSIAEVESVFVETDAYKLASHLYWAIWAIIQARMSPIEFEYLGYFFLRYNEYK 360

Query: 361 KNKEKYCSLARSFLA 376
           K K    SL  S L+
Sbjct: 361 KQKPLTFSLVTSHLS 371

BLAST of CmaCh09G011940 vs. TAIR10
Match: AT4G09760.2 (AT4G09760.2 Protein kinase superfamily protein)

HSP 1 Score: 184.5 bits (467), Expect = 1.2e-46
Identity = 109/331 (32.93%), Postives = 174/331 (52.57%), Query Frame = 1

Query: 49  ELCKDLFKQWSEL--DDSRFSVETVSGGITNQLLKVTV--EEENGSNVSTTVRLYGPNTD 108
           ++ + L  +W ++  D     V+ + G +TN++  V+   +E N       VR+YG   +
Sbjct: 19  KILQALSTKWGDVVEDFESLEVKPMKGAMTNEVFMVSWPRKETNLRCRKLLVRVYGEGVE 78

Query: 109 YVINRDRELQAIKYLSAAGFGAKLLGVFKNGMVQSFIHARTLEPSDIRKPELAAEIAKQL 168
              NRD E++  +Y++  G G  LLG F  G V+ FIHARTL  +D+R P ++A +A +L
Sbjct: 79  LFFNRDDEIRTFEYVARHGHGPTLLGRFAGGRVEEFIHARTLSATDLRDPNISALVASKL 138

Query: 169 NKFHKVYIPGSTEPQLWIEIFKCYGKASALQFDDTEKQCIYDSISFTEIHNEVLEIKELT 228
            +FH ++IPG     +W  +    G+A  L  ++   +   D I       + + + E  
Sbjct: 139 RRFHSIHIPGDRIMLIWDRMRTWVGQAKNLCSNEHSTEFGLDDI------EDEINLLEQE 198

Query: 229 SLLDAPVVFSHNDLLSGNLMLNEEEERLYFIDFEYGSYNYRGFDIGNHFNEYAG------ 288
              +  + F HNDL  GN+M++EE   +  ID+EY SYN   +DI NHF E A       
Sbjct: 199 VNNEQEIGFCHNDLQYGNIMIDEETNAITIIDYEYASYNPIAYDIANHFCEMAADYHSNT 258

Query: 289 -YDCDYSCYPSKEEQYHFFRHYLQPEEPDEVSQEDLEALYVESNIFMLASHLYWALWALI 348
            +  DY+ YP +EE+  F  +YL     +E  +ED+E L  +   + LASHL+W LW +I
Sbjct: 259 PHILDYTLYPGEEERRRFICNYL-TSSGEEAREEDIEQLLDDIEKYTLASHLFWGLWGII 318

Query: 349 QARMSPIDFDYLRYFFLRYNEYKKNKEKYCS 369
              ++ I+FDY+ Y   R+ +Y   K K  S
Sbjct: 319 SGYVNKIEFDYIEYSRQRFKQYWLRKPKLLS 342

BLAST of CmaCh09G011940 vs. TAIR10
Match: AT1G74320.1 (AT1G74320.1 Protein kinase superfamily protein)

HSP 1 Score: 172.2 bits (435), Expect = 6.2e-43
Identity = 103/329 (31.31%), Postives = 168/329 (51.06%), Query Frame = 1

Query: 49  ELCKDLFKQWSELDDSR-FSVETVSGGITNQLLKVT-VEEENGSNVSTTVRLYGPNTDYV 108
           E  + +  +W ++ DS+   V  + G +TN++ ++     E G +    VR+YG   +  
Sbjct: 23  EALQAIASEWEDVIDSKALQVIPLKGAMTNEVFQIKWPTREKGPSRKVLVRIYGEGVEIF 82

Query: 109 INRDRELQAIKYLSAAGFGAKLLGVFKNGMVQSFIHARTLEPSDIRKPELAAEIAKQLNK 168
            +R+ E++  +++S  G G  LLG F NG ++ F+HARTL   D+R PE++  IA ++ +
Sbjct: 83  FDREDEIRTFEFMSKHGHGPLLLGRFGNGRIEEFLHARTLSACDLRDPEISGRIATRMKE 142

Query: 169 FHKVYIPGSTEPQLWIEIFKCYGKASALQFDDTEKQCIYDSISFTEIHNEVLEIKELTSL 228
           FH + +PG+ +  LW  +         L   +  K    D +         +EI  L   
Sbjct: 143 FHGLEMPGAKKALLWDRLRNWLTACKRLASPEEAKSFRLDVME--------MEINMLEKS 202

Query: 229 L---DAPVVFSHNDLLSGNLMLNEEEERLYFIDFEYGSYNYRGFDIGNHFNEYAG----- 288
           L   D  + F HNDL  GN+M++EE + +  ID+EY  YN   +DI NHF E A      
Sbjct: 203 LFDNDENIGFCHNDLQYGNIMMDEETKAITIIDYEYSCYNPVAYDIANHFCEMAADYHTE 262

Query: 289 --YDCDYSCYPSKEEQYHFFRHYLQPEEPDEVSQEDLEALYVESNIFMLASHLYWALWAL 348
             +  DYS YP  EE+  F + Y+   + ++ S   ++ L  +   + LASHL W LW +
Sbjct: 263 TPHIMDYSKYPGVEERQRFLKTYMSYSD-EKPSDTMVKKLLEDVEKYTLASHLIWGLWGI 322

Query: 349 IQARMSPIDFDYLRYFFLRYNEYKKNKEK 366
           I   ++ IDFDY+ Y   R+ +Y   K +
Sbjct: 323 ISEHVNEIDFDYMEYARQRFEQYWLTKPR 342

BLAST of CmaCh09G011940 vs. TAIR10
Match: AT1G71697.1 (AT1G71697.1 choline kinase 1)

HSP 1 Score: 170.6 bits (431), Expect = 1.8e-42
Identity = 102/317 (32.18%), Postives = 162/317 (51.10%), Query Frame = 1

Query: 54  LFKQWSEL--DDSRFSVETVSGGITNQLLKVTVEEENGSNV--STTVRLYGPNTDYVINR 113
           L   W ++  D  R  V  + G +TN++ ++     NG +V     VR+YG   D   NR
Sbjct: 26  LGSSWGDVVEDLERLEVVPLKGAMTNEVYQINWPTLNGEDVHRKVLVRIYGDGVDLFFNR 85

Query: 114 DRELQAIKYLSAAGFGAKLLGVFKNGMVQSFIHARTLEPSDIRKPELAAEIAKQLNKFHK 173
             E++  + +S  G+G KLLG F +G ++ FIHARTL   D+R  E +  IA +L +FHK
Sbjct: 86  GDEIKTFECMSHHGYGPKLLGRFSDGRLEEFIHARTLSADDLRVAETSDFIAAKLREFHK 145

Query: 174 VYIPGSTEPQLWIEIFKCYGKASALQFDDTEKQCIYDSISFTEIHNEVLEIKELTSLLDA 233
           + +PG     LW  +     +A  L           D      + NE+  ++E  +  D 
Sbjct: 146 LDMPGPKNVLLWERLRTWLKEAKNL-----ASPIEMDKYRLEGLENEINLLEERLTRDDQ 205

Query: 234 PVVFSHNDLLSGNLMLNEEEERLYFIDFEYGSYNYRGFDIGNHFNEYAG-------YDCD 293
            + F HNDL  GN+M++E    +  ID+EY S+N   +DI NHF E A        +  D
Sbjct: 206 EIGFCHNDLQYGNVMIDEVTNAITIIDYEYSSFNPIAYDIANHFCEMAANYHSDTPHVLD 265

Query: 294 YSCYPSKEEQYHFFRHYLQPEEPDEVSQEDLEALYVESNIFMLASHLYWALWALIQARMS 353
           Y+ YP + E+  F   YL     +  S +++E L  ++  + LA+H++W LW +I   ++
Sbjct: 266 YTLYPGEGERRRFISTYL-GSTGNATSDKEVERLLKDAESYTLANHIFWGLWGIISGHVN 325

Query: 354 PIDFDYLRYFFLRYNEY 360
            I+FDY+ Y   R+ +Y
Sbjct: 326 KIEFDYMEYARQRFEQY 336

BLAST of CmaCh09G011940 vs. NCBI nr
Match: gi|449441185|ref|XP_004138364.1| (PREDICTED: probable ethanolamine kinase [Cucumis sativus])

HSP 1 Score: 672.5 bits (1734), Expect = 4.1e-190
Identity = 331/376 (88.03%), Postives = 350/376 (93.09%), Query Frame = 1

Query: 1   MGAEKIYNGSVDVEESVGNGDGNTESYQLSNLSIDHCLPLPAMIPCIIELCKDLFKQWSE 60
           MGA+KIYNG VDV E+V +GD N ESYQLS LS+DH LPLPA+ P IIELCKDLFK+WSE
Sbjct: 1   MGAKKIYNGDVDVVEAVEDGDSNAESYQLSYLSVDHSLPLPAITPRIIELCKDLFKEWSE 60

Query: 61  LDDSRFSVETVSGGITNQLLKVTVEEENGSNVSTTVRLYGPNTDYVINRDRELQAIKYLS 120
           LD SRFSVETVSGGITNQLLKVTV+EE+G++VS TVRLYGPNTDYVINRDRELQAIKYLS
Sbjct: 61  LDASRFSVETVSGGITNQLLKVTVKEESGTSVSVTVRLYGPNTDYVINRDRELQAIKYLS 120

Query: 121 AAGFGAKLLGVFKNGMVQSFIHARTLEPSDIRKPELAAEIAKQLNKFHKVYIPGSTEPQL 180
           AAGFGAKLLGVFKNGMVQSFIHARTLEPSD+RKPELAAEIAKQLNKFHKVYIPGS EPQL
Sbjct: 121 AAGFGAKLLGVFKNGMVQSFIHARTLEPSDLRKPELAAEIAKQLNKFHKVYIPGSNEPQL 180

Query: 181 WIEIFKCYGKASALQFDDTEKQCIYDSISFTEIHNEVLEIKELTSLLDAPVVFSHNDLLS 240
           W EI   Y KAS LQFDDT KQ IYD+ISF EIHNE+LEIKELTSLL+AP+VF+HNDLLS
Sbjct: 181 WNEILNFYDKASTLQFDDTGKQSIYDTISFQEIHNEILEIKELTSLLNAPIVFAHNDLLS 240

Query: 241 GNLMLNEEEERLYFIDFEYGSYNYRGFDIGNHFNEYAGYDCDYSCYPSKEEQYHFFRHYL 300
           GNLMLNEEE RLYFIDFEYGSY+YRGFDIGNHFNEYAGYDCDYSCYPSKEEQYHFFRHYL
Sbjct: 241 GNLMLNEEEGRLYFIDFEYGSYSYRGFDIGNHFNEYAGYDCDYSCYPSKEEQYHFFRHYL 300

Query: 301 QPEEPDEVSQEDLEALYVESNIFMLASHLYWALWALIQARMSPIDFDYLRYFFLRYNEYK 360
           QPE+PDEVSQ+DLEALYVESN FMLASHLYWALWALIQARMSPIDFDYL YFFLRY EYK
Sbjct: 301 QPEKPDEVSQKDLEALYVESNTFMLASHLYWALWALIQARMSPIDFDYLSYFFLRYGEYK 360

Query: 361 KNKEKYCSLARSFLAR 377
           K KEKYCSLARSFLAR
Sbjct: 361 KQKEKYCSLARSFLAR 376

BLAST of CmaCh09G011940 vs. NCBI nr
Match: gi|659126174|ref|XP_008463050.1| (PREDICTED: probable ethanolamine kinase isoform X2 [Cucumis melo])

HSP 1 Score: 665.2 bits (1715), Expect = 6.6e-188
Identity = 329/375 (87.73%), Postives = 347/375 (92.53%), Query Frame = 1

Query: 1   MGAEKIYNGSVDVEESVGNGDGNTESYQLSNLSIDHCLPLPAMIPCIIELCKDLFKQWSE 60
           MGA+KIYNG VDV E+V +G  N E YQLS LS+D  LPLPAM P IIELCKDLFK+WSE
Sbjct: 1   MGAKKIYNGFVDVVEAVEDGHSNAEPYQLSTLSVDLSLPLPAMTPRIIELCKDLFKEWSE 60

Query: 61  LDDSRFSVETVSGGITNQLLKVTVEEENGSNVSTTVRLYGPNTDYVINRDRELQAIKYLS 120
           LD SRFSVETVSGGITNQLLKVTV+EE+G++VS TVRLYGPNTDYVINRDRELQAIKYLS
Sbjct: 61  LDASRFSVETVSGGITNQLLKVTVKEESGTSVSVTVRLYGPNTDYVINRDRELQAIKYLS 120

Query: 121 AAGFGAKLLGVFKNGMVQSFIHARTLEPSDIRKPELAAEIAKQLNKFHKVYIPGSTEPQL 180
           AAGFGAKLLGVFKNGMVQSFIHARTLEPSD+RKPELAAEIAKQLNKFHKVYIPGS EPQL
Sbjct: 121 AAGFGAKLLGVFKNGMVQSFIHARTLEPSDLRKPELAAEIAKQLNKFHKVYIPGSNEPQL 180

Query: 181 WIEIFKCYGKASALQFDDTEKQCIYDSISFTEIHNEVLEIKELTSLLDAPVVFSHNDLLS 240
           W E+ K Y KAS LQFDDT KQ IYD+ISF EIHNE+LEIKELTSLL+APVVF+HNDLLS
Sbjct: 181 WNEVLKFYEKASTLQFDDTGKQSIYDTISFQEIHNEILEIKELTSLLNAPVVFAHNDLLS 240

Query: 241 GNLMLNEEEERLYFIDFEYGSYNYRGFDIGNHFNEYAGYDCDYSCYPSKEEQYHFFRHYL 300
           GNLMLNEEE RLYFIDFEYGSY+YRGFDIGNHFNEYAGYDCDYSCYPSKEEQYHFFRHYL
Sbjct: 241 GNLMLNEEEGRLYFIDFEYGSYSYRGFDIGNHFNEYAGYDCDYSCYPSKEEQYHFFRHYL 300

Query: 301 QPEEPDEVSQEDLEALYVESNIFMLASHLYWALWALIQARMSPIDFDYLRYFFLRYNEYK 360
           QPE+PDEVSQ+DLEALYVESN FMLASHLYWALWALIQARMSPIDFDYL YFFLRY EYK
Sbjct: 301 QPEKPDEVSQKDLEALYVESNTFMLASHLYWALWALIQARMSPIDFDYLSYFFLRYGEYK 360

Query: 361 KNKEKYCSLARSFLA 376
           K KEKYCSLARSFLA
Sbjct: 361 KQKEKYCSLARSFLA 375

BLAST of CmaCh09G011940 vs. NCBI nr
Match: gi|659126172|ref|XP_008463048.1| (PREDICTED: probable ethanolamine kinase isoform X1 [Cucumis melo])

HSP 1 Score: 660.2 bits (1702), Expect = 2.1e-186
Identity = 329/377 (87.27%), Postives = 347/377 (92.04%), Query Frame = 1

Query: 1   MGAEKIYNGSVDVEESVGNGDGNTESYQLSNLSIDHCLPLPAMIPCII--ELCKDLFKQW 60
           MGA+KIYNG VDV E+V +G  N E YQLS LS+D  LPLPAM P II  ELCKDLFK+W
Sbjct: 1   MGAKKIYNGFVDVVEAVEDGHSNAEPYQLSTLSVDLSLPLPAMTPRIISRELCKDLFKEW 60

Query: 61  SELDDSRFSVETVSGGITNQLLKVTVEEENGSNVSTTVRLYGPNTDYVINRDRELQAIKY 120
           SELD SRFSVETVSGGITNQLLKVTV+EE+G++VS TVRLYGPNTDYVINRDRELQAIKY
Sbjct: 61  SELDASRFSVETVSGGITNQLLKVTVKEESGTSVSVTVRLYGPNTDYVINRDRELQAIKY 120

Query: 121 LSAAGFGAKLLGVFKNGMVQSFIHARTLEPSDIRKPELAAEIAKQLNKFHKVYIPGSTEP 180
           LSAAGFGAKLLGVFKNGMVQSFIHARTLEPSD+RKPELAAEIAKQLNKFHKVYIPGS EP
Sbjct: 121 LSAAGFGAKLLGVFKNGMVQSFIHARTLEPSDLRKPELAAEIAKQLNKFHKVYIPGSNEP 180

Query: 181 QLWIEIFKCYGKASALQFDDTEKQCIYDSISFTEIHNEVLEIKELTSLLDAPVVFSHNDL 240
           QLW E+ K Y KAS LQFDDT KQ IYD+ISF EIHNE+LEIKELTSLL+APVVF+HNDL
Sbjct: 181 QLWNEVLKFYEKASTLQFDDTGKQSIYDTISFQEIHNEILEIKELTSLLNAPVVFAHNDL 240

Query: 241 LSGNLMLNEEEERLYFIDFEYGSYNYRGFDIGNHFNEYAGYDCDYSCYPSKEEQYHFFRH 300
           LSGNLMLNEEE RLYFIDFEYGSY+YRGFDIGNHFNEYAGYDCDYSCYPSKEEQYHFFRH
Sbjct: 241 LSGNLMLNEEEGRLYFIDFEYGSYSYRGFDIGNHFNEYAGYDCDYSCYPSKEEQYHFFRH 300

Query: 301 YLQPEEPDEVSQEDLEALYVESNIFMLASHLYWALWALIQARMSPIDFDYLRYFFLRYNE 360
           YLQPE+PDEVSQ+DLEALYVESN FMLASHLYWALWALIQARMSPIDFDYL YFFLRY E
Sbjct: 301 YLQPEKPDEVSQKDLEALYVESNTFMLASHLYWALWALIQARMSPIDFDYLSYFFLRYGE 360

Query: 361 YKKNKEKYCSLARSFLA 376
           YKK KEKYCSLARSFLA
Sbjct: 361 YKKQKEKYCSLARSFLA 377

BLAST of CmaCh09G011940 vs. NCBI nr
Match: gi|728834334|gb|KHG13777.1| (putative ethanolamine kinase A [Gossypium arboreum])

HSP 1 Score: 552.4 bits (1422), Expect = 6.2e-154
Identity = 267/358 (74.58%), Postives = 313/358 (87.43%), Query Frame = 1

Query: 18  GNGDGNTESYQLSNLSIDHCLPLPAMIPCIIELCKDLFKQWSELDDSRFSVETVSGGITN 77
           GN   +  S   SNLS+D  LP P M+P +I LCKDLF++W++L+DS FSV+TVSGGITN
Sbjct: 22  GNSTFDDHSILHSNLSVDTALPFPLMVPRVIALCKDLFRKWAKLNDSCFSVDTVSGGITN 81

Query: 78  QLLKVTVEEENGSNVSTTVRLYGPNTDYVINRDRELQAIKYLSAAGFGAKLLGVFKNGMV 137
            LLKV+V+EENG  VS TVRLYGPNT+YVINR+RELQAIKYLSAAGFGAKLLGVF NGMV
Sbjct: 82  LLLKVSVKEENGEYVSVTVRLYGPNTEYVINRERELQAIKYLSAAGFGAKLLGVFGNGMV 141

Query: 138 QSFIHARTLEPSDIRKPELAAEIAKQLNKFHKVYIPGSTEPQLWIEIFKCYGKASALQFD 197
           QSFI+ARTL P+D+RKP+L +EIAKQL +FH+V IPGS EPQLW++IFK + KAS LQF+
Sbjct: 142 QSFINARTLTPADMRKPKLVSEIAKQLRRFHQVEIPGSKEPQLWVDIFKFFEKASTLQFE 201

Query: 198 DTEKQCIYDSISFTEIHNEVLEIKELTSLLDAPVVFSHNDLLSGNLMLNEEEERLYFIDF 257
           DT+KQ  Y++ISF E+H EV E+KELT+LL++PVVF+HNDLLSGNLM N+E+E+LY IDF
Sbjct: 202 DTDKQRTYETISFKEVHKEVKELKELTTLLNSPVVFAHNDLLSGNLMHNDEQEKLYLIDF 261

Query: 258 EYGSYNYRGFDIGNHFNEYAGYDCDYSCYPSKEEQYHFFRHYLQPEEPDEVSQEDLEALY 317
           EYGSYNYRGFDIGNHFNEYAGYDCDYS YPSK+EQYHFFRHYL+PE+P EVS++DLEALY
Sbjct: 262 EYGSYNYRGFDIGNHFNEYAGYDCDYSLYPSKDEQYHFFRHYLEPEKPCEVSEKDLEALY 321

Query: 318 VESNIFMLASHLYWALWALIQARMSPIDFDYLRYFFLRYNEYKKNKEKYCSLARSFLA 376
           VE+N FMLASHLYWALWALIQARMSPIDFDYL YFFLRYNEYKK K    SLA+S ++
Sbjct: 322 VETNTFMLASHLYWALWALIQARMSPIDFDYLGYFFLRYNEYKKQKRMCFSLAKSHIS 379

BLAST of CmaCh09G011940 vs. NCBI nr
Match: gi|470101773|ref|XP_004287343.1| (PREDICTED: probable ethanolamine kinase [Fragaria vesca subsp. vesca])

HSP 1 Score: 551.2 bits (1419), Expect = 1.4e-153
Identity = 264/356 (74.16%), Postives = 309/356 (86.80%), Query Frame = 1

Query: 21  DGNTESYQLSNLSIDHCLPLPAMIPCIIELCKDLFKQWSELDDSRFSVETVSGGITNQLL 80
           D +    + S+ S+D  LPLP + P + +LCKDLFK+WS+LDDSRFSVETVSGGITN LL
Sbjct: 18  DNSAIQIRFSSQSVDPSLPLPQITPLVTKLCKDLFKEWSDLDDSRFSVETVSGGITNLLL 77

Query: 81  KVTVEEENGSNVSTTVRLYGPNTDYVINRDRELQAIKYLSAAGFGAKLLGVFKNGMVQSF 140
           K TV+E++G+ VS TVRLYGPNTDYVINR+RELQAIKYLSAAGFGA LL VF NGMVQSF
Sbjct: 78  KATVKEDDGNEVSVTVRLYGPNTDYVINRERELQAIKYLSAAGFGASLLAVFGNGMVQSF 137

Query: 141 IHARTLEPSDIRKPELAAEIAKQLNKFHKVYIPGSTEPQLWIEIFKCYGKASALQFDDTE 200
           I+ARTL P D+R P+LAA+IAK+L +FH+V IPGS EPQLW +IFK Y KASAL+FDD E
Sbjct: 138 INARTLVPLDMRDPKLAADIAKELRRFHQVEIPGSKEPQLWTDIFKFYEKASALEFDDNE 197

Query: 201 KQCIYDSISFTEIHNEVLEIKELTSLLDAPVVFSHNDLLSGNLMLNEEEERLYFIDFEYG 260
           KQ IY++ISF+E+HNE++E+KELTS  +APVVF HNDLLSGN+M+N+EEE+LYFIDFEYG
Sbjct: 198 KQKIYETISFSEVHNELVEVKELTSHFNAPVVFCHNDLLSGNIMVNDEEEKLYFIDFEYG 257

Query: 261 SYNYRGFDIGNHFNEYAGYDCDYSCYPSKEEQYHFFRHYLQPEEPDEVSQEDLEALYVES 320
           SYNYRGFDIGNHFNEYAGY+CDYS YP+KEEQYHFFRHYL PE+P  VS +DLEALY+E+
Sbjct: 258 SYNYRGFDIGNHFNEYAGYECDYSLYPTKEEQYHFFRHYLGPEKPQAVSDKDLEALYIEA 317

Query: 321 NIFMLASHLYWALWALIQARMSPIDFDYLRYFFLRYNEYKKNKEKYCSLARSFLAR 377
           N +MLASHLYWALW LIQA+ SPI+FDYL YFFLRYNEYKK KEK   LARSFL+R
Sbjct: 318 NTYMLASHLYWALWGLIQAKFSPIEFDYLSYFFLRYNEYKKQKEKCLLLARSFLSR 373

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
EKI_ARATH6.3e-14667.20Probable ethanolamine kinase OS=Arabidopsis thaliana GN=EMB1187 PE=2 SV=1[more]
EKIA_DICDI2.0e-5937.85Probable ethanolamine kinase A OS=Dictyostelium discoideum GN=etnkA PE=3 SV=1[more]
EKI2_MOUSE1.0e-5537.69Ethanolamine kinase 2 OS=Mus musculus GN=Etnk2 PE=1 SV=1[more]
EKI2_RAT5.6e-5437.08Ethanolamine kinase 2 OS=Rattus norvegicus GN=Etnk2 PE=3 SV=1[more]
EKI1_HUMAN1.4e-5236.25Ethanolamine kinase 1 OS=Homo sapiens GN=ETNK1 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KDN9_CUCSA2.9e-19088.03Uncharacterized protein OS=Cucumis sativus GN=Csa_6G018610 PE=4 SV=1[more]
A0A0B0NMB3_GOSAR4.4e-15474.58Putative ethanolamine kinase A OS=Gossypium arboreum GN=F383_18311 PE=4 SV=1[more]
B9MTI9_POPTR1.7e-15372.95Choline/ethanolamine kinase family protein OS=Populus trichocarpa GN=POPTR_0006s... [more]
A0A067KBK1_JATCU2.8e-15372.49Uncharacterized protein OS=Jatropha curcas GN=JCGZ_13308 PE=4 SV=1[more]
A0A061GV59_THECC3.7e-15372.03Kinase superfamily protein isoform 1 OS=Theobroma cacao GN=TCM_038050 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G26830.13.5e-14767.20 Protein kinase superfamily protein[more]
AT4G09760.21.2e-4632.93 Protein kinase superfamily protein[more]
AT1G74320.16.2e-4331.31 Protein kinase superfamily protein[more]
AT1G71697.11.8e-4232.18 choline kinase 1[more]
Match NameE-valueIdentityDescription
gi|449441185|ref|XP_004138364.1|4.1e-19088.03PREDICTED: probable ethanolamine kinase [Cucumis sativus][more]
gi|659126174|ref|XP_008463050.1|6.6e-18887.73PREDICTED: probable ethanolamine kinase isoform X2 [Cucumis melo][more]
gi|659126172|ref|XP_008463048.1|2.1e-18687.27PREDICTED: probable ethanolamine kinase isoform X1 [Cucumis melo][more]
gi|728834334|gb|KHG13777.1|6.2e-15474.58putative ethanolamine kinase A [Gossypium arboreum][more]
gi|470101773|ref|XP_004287343.1|1.4e-15374.16PREDICTED: probable ethanolamine kinase [Fragaria vesca subsp. vesca][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR011009Kinase-like_dom_sf
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0016310 phosphorylation
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0016301 kinase activity
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh09G011940.1CmaCh09G011940.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011009Protein kinase-like domainunknownSSF56112Protein kinase-like (PK-like)coord: 46..365
score: 2.62
NoneNo IPR availableGENE3DG3DSA:3.10.450.110coord: 42..129
score: 4.6
NoneNo IPR availableGENE3DG3DSA:3.90.1200.10coord: 130..367
score: 3.1
NoneNo IPR availablePANTHERPTHR22603CHOLINE/ETHANOALAMINE KINASEcoord: 27..376
score: 8.7E
NoneNo IPR availablePANTHERPTHR22603:SF24PROTEIN CKA-1, ISOFORM Acoord: 27..376
score: 8.7E
NoneNo IPR availablePFAMPF01633Choline_kinasecoord: 93..291
score: 6.5

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
CmaCh09G011940CmaCh01G009570Cucurbita maxima (Rimu)cmacmaB050
The following block(s) are covering this gene:
GeneOrganismBlock
CmaCh09G011940Cucurbita pepo (Zucchini)cmacpeB046
CmaCh09G011940Watermelon (97103) v2cmawmbB046
CmaCh09G011940Cucurbita maxima (Rimu)cmacmaB056
CmaCh09G011940Cucurbita moschata (Rifu)cmacmoB027
CmaCh09G011940Cucurbita moschata (Rifu)cmacmoB056
CmaCh09G011940Watermelon (Charleston Gray)cmawcgB039
CmaCh09G011940Watermelon (97103) v1cmawmB016