CSPI01G05030 (gene) Wild cucumber (PI 183967)

NameCSPI01G05030
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionKinase, putative
LocationChr1 : 3251714 .. 3275665 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATCCAACATTATGACTAACATTACAGGATGCTATGGATCATCATGGGGATTTTCTGAAGGATTAAGAAATTTAGATTTCTGTACGCCAGAGGTAATGTAATGTTAATTTCATCTTTCCCTCTAATTGCAAGATTGAAATTCACATATTTTATTTGAGTTTAATATATCTGTTAGTGATATATAATCAAATTTGCCTTTACCCACAATTTTAAGCTTTTAGGTCAAGTGGTGATTCAAGGTGGTATTAGAGTTTTCATCTCTTCCATCACTGCCTAATTTCCAAATAGGCTATGGCCTTCTTCTTCTTCCCCCTCCTCTTCTCTCTTCTCTCTTTCCTCTTTTTTTTTTTTAAATTTAAAAAGAATGTTCCTTTTCTCACATATAAACGAACACACCTTTTTGCTTATCTTGTGAGGTACTTGAGATTGTTATTAATCCTCCTCTTTCCAGCCCTCTTATGATTGTGAAAATCCAAAGGAGTCTGAGTCTCCTCGCTTTCAAGCTATACTCCGAGTCACAAGTGCTCCTCGAAGAAAGTATCCTGCTGACATCAAAAGTTTTTCCCACGAACTAAATTCGAAAGGCGTACGACCTTTCCCACTTTGGAAGTCACGTCGTTTAAATAATTTGGAGGTACTGTCTTTTTTTTGAGTATTATATTTTTCTAGCTCCACAAAAAATTATGCCTCCTGAAGCTTATGCTGTTTGTAATTACATTTTTTACAGGAGATCTTAGTTATGATACGAGCAAAATTTGACAAGGCTAAGGAAGAAGTAAATTCTGACTTGGCTATTTTTGCTGCTGATCTGGTTGGAGTTCTTGAGAAAAATGTTGATACTCATCCAGAGTGGCAAGAGACAATCGAAGACTTGTTGGTTTTAGCTCGAAGCTGTGCAATGTCATCCCCAGGTGAATTTTGGCTTCAGTGTGAAAGCATTGTTCAGGAATTGGATGATAGACGCCAAGAACTTCCTCCTGGCATGCTGAAACAACTTCATACACGAATACTGTTCATTCTCACTAGATGTACCAGGTTGCTGCAGTTCCATAAAGAAAGTGGGTTAGCTGAAGATGAAAATGTTTTTCAACTTCGTCAGTCAAGAAATCTGCATTCTGCTGATAAACGCACGGCTCCGGCTATGGGAAGGGAAATGAAAAGTTCTAGCGCAGCCAAGGCTTCGAAGGGAACATCTTCCAGGAAATCTTACAGCCAGGAGCAGCGTGGTTTGGATTGGAGCAGAGAACATGATATACTGCCTGGGAATAATGTTTCAACATCTCCTGATGATACTTCAAAGAACTTGGTATCTCCTACTGGCAGAGATCGGATGGCATCTTGGAAAAGATTGCCTTCTCCAGCAAAAAGCAGCGTGAAAGAATCTACTTTGAAGGTTCAGGGTGATAATAAAAAATTTAAATCTCTGAACATGTCAAAAATCAGGATGAGTGTTTCTGAAACTGATTTTGCTGCTGCTAAGGTTTCTGAGCTTCCTCTGCCCAGAGATTCGCATGACCAACCTACAAAGCACCAACATAAACCTTCTTGGGGTTGGGGTGATCAACCAACTGTATCTGATGAGAGTTCAATAATATGTCGAATTTGTGAAGAAGAAATTCCTACTGCAAATGTGGAAGATCATTCAAGAATTTGTGCAGTTGCTGATAGATGTGATCAAAAGGGTATAAGTGTTAATGAACGTCTACTCAGAATTGCTGAAACTTTAGAGAAGATGATCGAGTCTTTTACGCAGAAGGATTCTCAGCATGTAGGAAGTCCGGATGTTGCAAAAGTTTCAAATTCTAGCATGACCGAAGAATCTGATATTTCCCCAAAACATAGTGATTGGTCCCGCAGGGGCTCAGAAGACATGCTAGATTGCTTCCATGAAACTGATAGTTCTGTATTCATGGATGACTTAAAAGGTTTACCTTCCATGTCTTGTAAAACTCGTTTTGGTCCAAAGTCTGATCAAGGTATGACAACATCATCGGCTGGTAGCATGACTCCTAGATCTCCTTTATTAACACCAAGAACCAGTCAGTTTGACTTGTTTTTGGCAGGAAAAGGTGCTTACTATGAACATGACGATCTTACGCAGGTATCCCTCATGTCAATGCTAATTTCTTTTGGACATTTTTTAATTGTAAGATATGTTGTGGCATGTATGACATCTTTGAAGAGCCAAAATATAGCAAAGGAACTGAATCGTAGTGTAGATATTTTCTCCAATAAGCTATAAGTTGTAAAAAGTTTACTGATTGATGTTAACATGATTTACTTCCACTAAGCGATTAGTTAATTAATGTATTTTTCTCAAATTATATTAATGATGAGAATCGAAACAACCATAGTGGAAAATACTATGTATCACAGCAGTAAAAAATTATCAATAAAATATTACAATATACAGAGTACAAACTCCAAAAAAATAGTGAAAAGAACTTTATGTAAAAATAAGAAGAAAACTTCCTAATTCTACTACTCAAGTTGATTATACTTGTTGAATAATCTCTAAACTTAAAACTGCTGCTCCATAGAAAGTGTTTACACTTGCTTTGGGCTAATGCTGTCAAGCCACTTCTTATAGGAAGTTGTTTTGAAAGAAAAGAAACCAACGGTTTTTTTAATGACAATCCTCGTCCGTGGGTGGATCGTATTGAAGCGGCATGTTTAAATGCATCTTCATGGCATTTTCCCTCTAACCTATTTGTTAATCTATTCAAGATATTTGTTTGAACTCGGGAGCATTTATTTTGTCTAAACAGTCCTAGTTGTAATGTTGGTTTAGTCCTGATTTCATATGTTTGAAGGGTGGTAAACTTTTAGTCCTTCATAAACCAAGGAGAATTTCATGTAGTTCTTTCACATTAAGATTTTTATGAGCAACAGAAGTTTCATGTAAGAATAAAAATACCAATTTTCTGATGATTAAAACATATTATCATTAAATCTTGGTTCGTGATTACTAGTTAATGTCCAATAATGAGCAAAAACTTCTTATCTTTGCAGATCAGTGATTTGGCAGATATTGCACGTTGTTCAGCTAATACACCTCTGGGGGATGATTGCTCGATGCAGTATCTGCTAACATGCCTTGAAGACTTGAGAGTTGTCATTAACCGCAGGAAGTTTAATGCACTGACAGTTGACACTTTTGGCACTCGCATTGAAAAGTTAATCCGGTTTGTAGTTTTTGCAGCATAAATGTGATCTTCAAAATATATGCACTTTACCACGTGTTGCATTCTATTGTCAACTTCTCTAATGACTCCCGTCATGACAAATTTCTTTATAATGTCTCTCAAATGTGTTGACTGCTATATTTCTTGACTTTTGATGGTTTGTGATTTCCTAGGGAGAAGTATTTGCAACTTTGTGAGTTGGTGGATGATGAAAAGATCGACATAGCAAGCACTGTCATTGATGAAGATACTCCTCTTGAAGATGATATTGTTCGTAGCTTAAGAACTAGTCCCATCCATTCTTCAAAGGATCGTACATCAATTGATGACTTTGAGATTATAAAACCAATCAGCCGTGGGGCATTTGGTCGGGTTTTCTTGGCTAAAAAGAGGACTACGGGGGATCTTTTTGCAATAAAGGTTAATTTTCACATTGATGCTTAAATAATGTTCCCAGCATGTTCTACCTGAGGTTTTGGCCTCTTCTTTTCTGTGCTTGTGATGAGTTCATCATTTAGTGGTATTTATGTTTCTTCTTCTTCTTTAGGAATTTCTATGACATCTATAATCTTTTTTTTGGAAACGGATACAAACCTCTTTATTCATGTATAAATAAAAGAGATTAATGTTCAAAGTACTAGAGAATTATACTAAGAGCGAAAAGAGCGATTGAAACAAAAGTACAAATGAACAATACAGACAACACAAACGAAACCCTGAGCAAAACAGTAGAAAAATATGGCTCTCTTTGTATAACTCTCCTGTACTTTGTGTTCTTATTAATAAAGAAGTTCTTGTCTCCATTTAAAAAAAACAGTAGAAAAATATGAACAAGGCAAAATTAAAATCTTTCAACAAAAAGCCCAAAAACAACTGATGATTGCAATCTAAAAACGTCAAAATGGATGGGAGGAAACACCAAGGGACAGCAAACATCCATGCAGCTTCAATTATGCCTTCTAAGTAGTCCTTAAATCTTGAGTGAGTGACTAGAACCAGAAAATTTCAGGAGAATTGATATAACTCAAGTGAGAATTTCTATTAATGTTTTTGAGTTTTGAATACAAGAAGACAACTACTCTATTTATAGCAAAAGCATAATCTAATCCTAATTAATAAAAGAAACTTAATCCTAATCCTAAATAATAAAAGAAACTATATATTTTTTTTTAAATGGAAACAAGCCTCTTTATTAATGATTATGAGACAAAGCTCAAAGTACAAGAGCATTATACTAAAAGCAAAAAGAATTAAGGGAAAATACAAGCTAAAACTTAAATAATCGGGTAACATAAATATGATGGAATTCTAAAAACGAACCCGAGCAGAAAATAACGAGCTAACCAAAGCCACGCAAAGACAATACATCAGTAGAAAGCTAAATACATGAGGGTGTCTAACTCAGAAGGGAAAAAAGAACGTAAACCTCTTCATCTTTCAAGGCGTCCCTAACTTTCAGTTGATCAATGGGGTTTTTAAAGTCTTTAGCAATAAGAGTTCCTCCGGTGAAGCTAGGGAGATCCAAAATCTTGATATCTCCAAAATTCAAGAAAATAATTACCTCTATTAAAATATGTAATTTCAATAGTTGAAGGCATAGAAGATATTACCTCTATTAAAATATGTAATTTATTAGAGGCTGGGGCTTTCAGTTGGGGGTTTTCCCCTTTTCATTTTCTTTAATAGTTGGTTATTTGACAAACGGTGTTGCAGACTCATTGAGAACTCCTTAGATTCTGAAAGTCACGAAGGATGGCCAGGCTTTGTTATATCATCCAAACTTCAATCCTTAAAAATTTAGCTGAAAAATTTGAACAGTGACAGTGGGAAAGCCAGAAAATATAATGAAGAAAGCTTGCTGAAAGCTTTAGCAAATGAAGAAGGTAAGGATGAGCAACTGGCTTTGGCATCAGAAAGTTGTAACCTAAAATGGATGCTGAAAGCTGACCTTCTGGAGCTATGTAGAAGTGAGGAAAGAGACCTTATTCAGAAAAGTAAATTAAATTGGTTGAAATTAGGGGATGAGAACACCAAGTTTTTCCATCGGTTTCTAGCTGCCAAAAAAGGAAGAAATCTGATTTCCAAATTAGTCAACGACCAAGAGGTTCCAACTTCCTCATATGTGGAGATTAAAGCTCCACAAGCTGGCCACTTTCCTTCACCCCTAAACTTTGCAGTAGTTACCAATGATCAGAATAAAAGCTTGACCTCAGAATTTTCAGTAGAAGCAATAAAATTAGCACTGAAATTGTTGGGAAGAAGTAAAGCGCTGGGGCCAGAGAATCTATCGCAGAATTCTTAATCAAATTTTGGAAAAAATTGAAGAGAGATTTCCATTCCCTTTTTAAAGAATTCTACGCTAATGGGAAGCTCAATCTTGTGTAAGGGAAAACTTCATCTGTTTAATCAAGAAAAAGAGGATGCTTCAAAGGTGAAAAATTTCAGGCCAATAAGTCTCACTACTTTGACTTACAAGCTGGTGGCAAAAGTATTGGCTGAAAAACTTGAGAAGATGATGCCAAGCATTATTGACACTCCTCAGAGTGCCTTTTTAGAAGGAAGGCAAATCATCGAATACCGAGCGAAAAAGAAGAAAAAGTAATGGTGGTTGTTGTTTAAGTTGGACCTTGAGAAAGTCTTTGATAGGGTTAATTGGGACTTTTTAGAAAAAGTAATGGTGCAGAAAAATTCTGAAAGGAAGAAAGAGATGAAAAAGGCCAAACATGAAATCTCTCACTTTACAAAAAAACCACTATCTTGCTTCAACAATCCTTCCCAAACACCTTGAGGAACACTGTATTCCCATTCAAATATTAAGATTAAATGTCTGAGGGACCAGAAAACATGTCTACTTTCAGTGCTCAAGCCAAAACTTACCACATTTCTCAACTAAGGAGTTCAAATGATCTGGAATACCACATTGATGAGCTCAAGCCAAGACATGCTGGAGTTAATTTCGAAAAAGGCAGTCTCGGATAATCTAGACAAAAGATTATGGCTGTTGGAACCTTCAGGCAAATTTTCAGTAAAGTCGTCAACCTCTCACCTTTTTCCTTCACCACTAAATGGCAGATTATACAATTACTCTGAAAATCACCCAGGGAGTTTAAATAGGGTAAGACAACAGATAAGACAGAAAGGCCTAAGAAAGCCTTAGTTCTAAATCTGATGGCTTTCCTTTCTCTAATAAGGCAATCTTTGTTTCAGAAAAGGCCAAGTAATTCATAAAGCTAACTTAAGGATCCTCTTGTATTACCTCCCACATATGACATGAATAGCGGCAAACAACTACAAAGAGAACTGCTGTACTCCTATAATTGCTGCTAACCTACCTAACTAAGAATATACGACAATCTCGAAAATGTGCCATTGAGATCATCAGATCCAGAGATCAAGTTTGCCAAGAACTACTCATTGTAGAATTTTCAAATATGTGGTGCAAACAGAGGCAAGAATAGACCACTCCAACCTTTTTGGAATAGACGTGGTGATCAACCTTTTGACTCTCTTCTTCTTTTCTTTCCTGACTTGGATTCACTGCCCTGATCATTGGATGAATTTAGTGACTTTCCCATTGAGTTCTCCATTGTTTTCCCAGAAGTAGATGAATTCTTCACATCTCTGCTTTCTTTTTCTTTTGCTTTTGAAGATGTGAACTTAACCTTGGTGGAGAGATTGGTATTATTAGACTTATCGCCTATTTTGGTAGACTTGCCTTTTGAAACAGGAGTGTTTGGGGTTTCTTGTTTGGACTTCCCCGTCTTCGATACATCTTGCTCGTTGGACTTTGCAACAACGGCAGGCGTGGATGTCTCATCATCCTAAATGGCAGATTATACAAAGCATTATGGAAGACTAAAAGCTCAAGTGTAGTCAACATACTGAGCTGGATAATGACTTTTGGGGCTTTAAACTGCTCAGTTATGCAGAGGAAGCTTATGGATATCTGCCTTTCTCCATCCGTGTGTCCACTATGTAAAAATAAAGGGGAGGAGTTACAACACTTCTCCTTTGATTGCTCTTATGCTGCAAACTGTTGGAGGAAATTCTTACTCGGGATATGATGCTTGGCGCTATGGGGCGTCTTCCTAGCTGAGATGTTCGAATGCGCCTCTTGATCCTTCTAGGACTCCCTTGATTATAACTTCTTTATTACTCTCTTTGTATAAATCTCTTGTACTTCGAGTTCACATTAATAAAGAGGTTTTGTCTCCGTTTCAAAAAAAACTGTTGGAGGAAATTCTTTTCAATTTTTCGAACTGCTTGGGTTTTTGGAAATAACTTTAGTCATAATGTACAACAGCTACTATAAGGGCATAGTTTGAAGAAGAAATGACGACTTTTATGGAAAAACATGGTTAAAGATTTGGTTTGCAGATTTTTGTTTTAAAAGAAATCAAAGGACATTTAATGACAAGGAGATGAGATGGTTAGACCGTTTTTGTTCGGCAGCTCATAACGCTTTAGCTTGGTGCTCCCCAAATAAAGAATTCAAAGCTTTTAGCATTCAGGAGATTAGCTTAGATTGGAGGGCTTTTATTTTTTCAGATTGATAATTCTTCTTGTAGTTGGATTTTTGTTTGTTTCTCTGCATTGTATGGGGTGGGGGCAGATTTGGCCTGTATATTGATCGTTTGTTTTGATGCTAGTTATTAGGAGGAGATGATGAGGGCACTAAGGGGGTGTCAACCTAGTTGAGATGTCTGGGTGTGTCTACTTATCTCCATGATCTCTTTGTGTGCTCTTTGTATAATTCTCTTGTACTTTGAGCATTTGTCTTGATTTTATTTTGTCTTGATTTTATTTAAAAAATTTGTGACATATCACCCCCCCCCTTTTTACAGTATCATAAAATTATTGAACTTGCATTTATTGAATATTTATGTGAGCAACACCAGGTTTTGAAGAAAGCGGATATGATTCGGAAGAACGCGGTCGAAAGTATTTTGGCAGAACGTAATATTTTAATTTCTGTGCGCAATCCTTTTGTGGTGAGCTTGTCTATTTGGTCATGTAAGTTATGCCTTATTTTCCTTTTGGTTTTGTTTTAACTGTAAAGTTAATTGTATCAGGTTCGCTTCTTCTACTCTTTTACTTGTCGTGACAATTTGTATCTTGTGATGGAGTATTTGAATGGAGGAGACCTTTACTCTTTGTTGAGAAATTTGGGCTGCTTAGACGAAGAGGTTGCTCGTGTATATATCGCAGAAGTTGTATGTATATGTTTTTCTTTTCATCTTTCATTAGTCGGCGAAGTTAAAACAGTTGTTAGTATTGCAAGCTCAATATTTGCTTGTTTCTTTAACTTTTCAGGTTCTTGCATTGGAGTATTTGCATTCACTTGGGGTTGTTCATCGGGACTTAAAGCCTGATAATTTATTAATAGCACATGATGGCCATATTAAGGTAAATTAAGACCGATGAATATGTTTGATCTAATAAGTTATTTAGATATTTATAATAGAGAATTTTTCACAGATAAAAATATCCCCAAAATTAAGAAAAACATTTCCAAAATTAGGAACTTCTAAAATTTCCAAAAATTTCAAAGATAGAGAAATCAATAGGATCGGAAAACATATTTTTAATTTTGAGCATCCCGTATTTTGATTAAAAATATGTTTTTTTTAAACGGAGACAAACTTCTTTATTAATAAAGAAAAATGAGACTAATGCTCAAAGTACAAGATAATTATACAAAGAGCAAAAAGAAAAGACAAAAACAACCAAACAAACTGCCCAAAGAGGATCTCGGCCAATACAAGGGAAAACAACCAAAACAAACGCAAAGAACAAGCTAAAAAGGCAAAAACCAACTGAAAGAATGAAGTTAAAACAAAACAAAAACGAAGAGATCTAAATACAAAGCATAACAAACTAATTTTCTCAAAAGGAAACCATTCGGATCTAGAGACTAGCAAAAAGATGCAACTGGTGAACGCAAAGCTATCTTGCCAAAGTAGTCCAAACCCAGAAGAGGAACAAAAAAATCTTTCAAAATATCGGCATCAGCAAAAGGTTGTACTTCCTATACAACACCTCAAGACACCGTGTTGAATCTTCAATTGATGTTCCAATCGAAAACTCATCCCCGATGCATGTAGAAGTACTAAAGATATACACCAGAATATTTCTCCCATAAGCCACTTTAATAGATTCCAGCACCTTTTTCAACCATAACTTGATTCACGATGAACAAGATAATCTCATAGAATAAGCTTGAGAAATATTCTGGCAATGAAGACTTCCAAAGATGAGTTGCAGGAAACATTAACAAATTCGCCTTATGAACTAAATACTTCTTGGCAGTTACCATGAAAAGCTAGCGATACTTTACCATAGGAACCACGAGCAAACCTTTTCTTTAGTGGCTGCACTTTTAAAATAGTCCAGAATAATTCCACAGATGATTGACATTAACTCTTTAGGCAAATCTGGTGGAAAATTATTTGGGAGAAAACCATCCTCATCAAACAAAACATTTCAGTCCCTCCTTTCAATTTCTGGATCCAATTTAGGGTCAACTTCACTAAGATTAACTGATTCTTCACTGCTAACACTGATAGGAGAGAAAATTTCGGGCTGGTTTTTAGGACTTAAGTCAGCTTTGCATGAGGAGCCTTGCTGATGAGATGTCTTTGCAATGGAGGTGGAGGATTTTGAAGTGTTGAGACGAAGGAGAAGTGCTATTTACTGACTTGGAAAAATGCTCAGGGGCTTGGATTTTAGAGCTTAAACTTCCAAGAGATCTGGGTTATTGACAGAAACTGAATGCTTGATGGTGTTATTCCTAGAAGAGTCCAAAGGGGGAATTTTAAGGTCACCAAACCCCATTTAATTCAGATTCTAGCAGGGTAGTCCACACGTTGTTGAAGGAGGTTTGCTTGCGGATATAGTGTTTAAGGATTTTTGGAATAAGAGAAGATTTACTACCTTTGGAATATTAAAAACAGGATCAATAGAAGTGGACCCCACTGCCTTTTTGCAACAGTTGTCTGACTTGGGTTTAGTAAAATTGAAAAAAGGGGCCTGTTCAGTTGCCCGGTCATTCAATGCTTCTTCAACTTGATTGCCGGCCACTGTTTTCATCTCTGACACCGGTGAAGTGGATGGAAAATTTTTCAGAATTTCAAATGGATTTCTGCTTCTAGGGAGAGATGCAAACACTGATCTCAGGATTTTCTGGACTAAAGGAAAGAAGGAGGGATCAAAATCTTCATCAATCATAACTTCTCTAATCCTGAAACAATCAAGAGAGTTATTAAAATCATCTCGTAAGTTAGGGGCTTTAAGGGAAATATGAGGATTTAAGAATTCAAAATCTCCAAAATGAAGATATATATTTCCCCTCTTTAGATCAGTGATTTCAATCGTGGAGGGCACAAAACCACATAGATTCTTCTTAACTTGAATTCGGGCTTCACTAACATTAATTAAATTCAAGAGTTTCGGAGGCTATATCTAAGAGACCTCCAAAATGATCCCCATTGACCTCAAAGGTACTTCTACACCAATAATCCAAGGGAAGGTTTTTAATTTTGAGCCAACCGCCATAGCCTTTTTGAACAAGTGGCCGACTGTGTTTAAAGTTATCCCATCTTTCGAATTTGAGATGAAAATTCCCCCAGGCTTGCCATTTTCCTTCCTCATGGATGAGATCTAAAATAATCCTTGGTCAATGCTAATGAGGGCATTCTCATCATATAATGGATTTATAACAATTTTTTATTGGAAAAGAGCTTCCAACCTTTTACGGATCAATTTCCAGTCATATGAGGCAAATAACTTTGTTTTTTTTTTTGAAAAGGAGACGAACTTCTTTATTATTAATAAACTCAATGTACAATAGAGCTTTACAAAGAGAATAATAGGGAAGCCAAGAAATGAGGGAGAGAGAGAGAAAGGGAGAGATGCAAACACCGATTTTGGGATTTTCTGGACTAAAGGAAAGAAGGAGGGATCAAGATCTTCATCAATCATAACTTCTCTAATCCTGAAGCAAATAACTTTGTTATAATCCAAAATTTTGAAATTGACTTTTCCAACTTCATGGTTCTCAATCAGCCAATGCTTTTGTTTTGAAGGGGGAAGATCGACTTATTTTTTAAACTTCTTGTTTGGACTAGGAGGCTGGTCAGTACCCTTAAAATTAGGTCTTAGAACAGGGGAATTGGAAAGAACTGATTTTGCCATTTCAAACATAACTTAGTTTTCTATTCAAAATCGGAGGAGGAACATTGAGGGAATTTTCAGCAAACCATCTAGCATAATCCAGTTTTTCGACAAACTTATCCAACATCGAGCAAAACAATTTCCAACCTTGCTTCTCCCTACGTAAGCACACTCTGATATTTAAGAGACCCCCTGAATATGGCCAGAGATCACAACTCAGTATCCAACCCCATGTTACTTGAAACTTTGAAATTCTCAATCTGCCTCTGTCAATATCTCCATGTTTCCGAAAATGGCTCTCTACCGGAAAACTGATAAGATCAGTTAACATTGACCTGATCCATCTAAGTTGGCGAACGGATAGATAATTCTGTCGGCTTCTACATCTTCAATAAAGCAACAATCTCCTTCTTGCCAAATGCAGTGGTGTGACTGGTCAATTTTACAACAAACCACCTCCATAGTGAACTTAATGGAGAGGGCTGCTGAAAAACATGGCTGCTGGAGAGGGGAGGAGAGGGGAGATCGAAGGGAGTGGGGTGGAGAGAGGAGAGAGGAGAGAGGAGAGAGGGGAGAGAGAAAACTTACCTTCTAATCTCTTGATTAAAAATATGTTTACATGCACAACGTATTTTTATTTTTATAAAAATAAAAAATAAAAATAACGTGTCCCTGCTGTGTCGTGTACTACTTTTTCAGAAATTGGCGTGTCGCCATGTCAGTGGTGTGTCATATTCGTATCACGTGTCAGTGTCAGTGCTTGCTAGCTAGTTTCTATTACTTATACACATTAATAGACTTCTATTAGCCTCTATTAACGATAGACCTGATAGACTTCTATCAGCATCTATTGATGGCAGACTTGTACTAGTTTCTATTTATGACTTTTATTAGTGTCTATCTGGTTTGTCAATAATGTAGAAATTTTTGTTATAATTTGTAAATAGTTTGGTCCATTTTTCTAGATTTAAAAGGTTTCCTTTATAATAATTGTGAGGCATTAATGTAATAAATAGCATAAAACCTAATAATATGTTTGCTCTTGTCAGCCTCAAAATTGTGTTCTTCATTCTGTATCTTTGTGATTATTCAAGTTATTTTTTTTTTTTGAAAGGGAAATACACTTTTCATTGATTATTGAAGTTATTGGTAATAATTGGAAGACCATTATGTGAGATTAAGGCTTTAGTTTTTCTTATTGCTTTATGTTTCTTTTTACTGTTCGGTGTTACTTTTGGTAGTATTGTTTTTTTTTTGTTGTTTTCTTTTTTGTATCTCGTACTTGGATTTTCTTTTGGTCTTTGGTTTTTCCTTGGTCCCCCTTCCCCTTTTTTTGGCTTCTTTTTTTTTCTCATTGTATAATTCTTTTGTACTTTGAGTTTTATATTAATAGAAGCTTGTCTCCGTTTCAAAAAAAAAAATCTTCTCAACTACTAGTAAAAGATTCCTTTATTCTTAAAAAAATGCATTTAATTACTTTCACTGAAATTTCAGCTTCTCATAATAATGGAAATAAATATAATTCTCTGTTACTTACAGAATTATTTTGCTTCAAGCAGCTGACAGATTTTGGGCTTTCCAAGGTTGGTCTAATCAACAGTACTGATGACTTGTCTGGACCAGCTGTTAGTGGGACCACTCTTCTTGGCTATGATGAGCCTACCATGTCTGCTTCCGAGCATCAACAAGAAAGACGGAAAAAACGTTCTGCTGTTGGCACACCAGACTATTTGGCACCAGAGATACTATTGGGAACTGGACACGGTTAGCATTTATCACTTCCGCCTCCCACCTCTCTATACATTTTCAAATTTCATCATAAGAGTAGCTTACGTAAATTTTTATTAAGGAAACATGCCGCTTCATTATGATAATGAAATGTTATGATGATGAAATGAGACAGCTCCATAATGATAAAATGAAGAGACAATAGCTAAAACCAATGTATCAGTTTGTGCATCCGGACATATCTGATAAAACACCAAGTAATGGTTCTATTTGTTCATTAAGCAACCAAACTATTACATAATACAATACTCAGAACAAACAAGTACTATTACAGACAATAACCAAGTTCAAAATAAAACATTAAAACATCAAAAGTAATTTACGATAATGGAAAACACTTTCTAAGATTAAAGTAACGGAACAACTTCTGTAATTTTGCTCTCCAGAAAATACATCCCCTCCACTTCCAGAAACTGATTAACCAACCCCGATCTTTCCTTCCATCTCTCCCTTTTAACCTCCCCCCCATGTAACTAACTGTGGTCCCCACTAAGGTGGTTTCTACTTCCTCCACTTCCATCACCTGTGCACGTGCAGTTCCACTTGTCTTTCTTCTCCTATATTGTATTACAATTGGTGGTCTATCAATATCGTCATGTTCAATAAATAAGACAAGCTATCCAAGATCAAGTACAAACATTAACAAAAGATAAAACATAAAACAACTAAGCATTACTAAAAGATCAAAATGAAGATAGTAATGAAAATCATCATTGGCCCCTCTTAAAATGCCTTCATAGAAGGAAGACGTATTCTGGATCCGATCTTGATTGTCAACGAAGTTGTAGAAGACTACTGTGCCAAAAAAAAGGAAAGGGTGGGTCTTAATGGTGGATTTAGAAAAGGCCATTGATCGTGTAGATTGGGAGTTCCTCAAAAAGGTGCTTCTTTGTAAAAACTTTGATCGACGATGGATTTCTTGGATCATGGGTTGTATAAAAGATCCAAGGTTTTCTGTGTTTATTAATGGTAGACCTTGCTGAAGAATTTTTGCTTCTAGAGGAATTAGGCAAGGGGGTCTCCTTTCTCTCTTCCTTCATGAGAATGGGTATTTGAAGGCTTTGTTGTGGGTAAGGAGAGGTTCATCTGCAATGCCTTCATTTTGCTGGACTAGATGGGAGAGAAAGCGAAGGCCTTCCAGTTTAGGTTGATATCTTGGATGGAGAAGTATTCAAAAGGCTTGGGAAGAGAACACCACGAGTCTTCATCATGGGACCTCCCAAGAGTTGAATGATATTCTCCTCGAGTGATCCTGAAGAGGCTGTAATTGTAAACCATTTTACTTATAATAACAATATCTAATCTGGATTTTATTCTTTGTTCTGTCTGTAGGCGCCACAGCTGACTGGTGGTCTGTTGGGATCATTTTATTTGAACTTATTGTTGGCATTCCACCCTTCAATGCAGAGCATCCCCAGGTTTACTTCTTGTTGTAAAATAGGTTTTCGTACTATCTTTTCGTTTTTCAGTTGTCAACAAGACGTGGGTGGGTTGTTCTATTGATTGCTTTTTTTTTTTGGGCGGGAACAAGTTCGTGTTATATATTTTGATTATATAAATCTCTTTATTATCTCTCTCATTTTTTCCTCTTTTCTTTATGTTCACTTCACTCCTCGGCTTTTTTCTTTATTATTTTGGAATGAATTATATATGTATATTTGAAATGAAAACAATACTTTTCATTGAAACAAGTTGAGTCTAATGCTCAAATTACAAAATATAAGATACTGTAAGGCCTTGCATTGTGCATACATATAGAAGAAGAGGGAAGAAAGGGCAGGAGGGCATTAAACGTCACTTTGGAAGCTCCTTAAAAAGGGGGGAATCACGAAGATTTTTGCTGGCTGGCACGGACCTTGAATGATGATGGGTTTAAGAGAGGCTTTGTGAGTTGTGGAAGGCAAGGATATTTTGGTAGTGAGCACAGAGGCTTTAAGGAGAAGATTTTCAACCCTCTAGAAAATGCTTGTTTGCTAGTCTTCTCTATCTTTTTGCTTGCTATTCATCTTGATATATGTTTTTTTTAAAGGAAACAAGTCTCTTTATTAATATAATGAGACAAATGCTCAAAGTACAAAAGAATTACACAAAAAGCAGAAAAAATTGAACCTAGAGATCAGGAGATGCACCCGAGCATCTTAACTAGGTTGATATCCCTTAGTGCCCTTAACATATCCAATAACAAACAAACAAAAATGATAAAAACATAATATATTATCTGTTCCTTTCAAAGATTTTTTTTTGTAAAGGAGACTAGTTTCTTTATTCATTAAAACTCAAAATACAAGAGAAGTATACAATGGAAAAAGGAAAAGGTAATGATGTAATTCAAAGCTGGGATTAATTTCTCAAAAGAATTCTCTGCCATTCTCAATAGGAGAATGGTTATTTATAGATGTGAATGATGGTTACAACTAATTGTTTGAGTGTAACTAATCAGTTATTGTAACTGATTAGTTTTGAAGAAAACAGAAAAAACAGCTAATAACAGAAAATAACAGAAAACAGCTAATAACAGGAAATAACAAAAAACAGCTAATAACAGGAAATAACAAAAAACAGCTAATAACAGGAAATAACAAAAAACAGCTAATAACAGGAAATAACAAAAAACAGCTAATAACAGGAAAATGGCTAATATCTATTTGTGCTGTTTATTATGACTGTGTGATTGCATCAGGTAAGTTCTCTCTCCCTTTCTCTACCCTTTCTCCTATTTTCTCTCTCCTCCTTCACCCTCCCTTCATTATCATTGTCAGCCTATTACCGTCTTTGGTAGACATTGCTATATATCGACTTGTACTTTGGCAATACCCTCACAACCATTCTTATTTTACAAAGATGGAAGTAGAAAGATGCAAAATCGACAACCTTTTTTTTTCTGCAACTGGCATGAAGATGAAGCTTTTCACATAGAAGATGTTGAAACTAATATTATTATGTCAATATCAATCCTTCAACTTAGGTGGTTCATTGAAATGGTGAAAGAATCGAATAAGAATTTTGAAGATTATTTCTATAAGAAAGAAAAAGTTGAGAGTGGAAGGGTCAAAACTTCCAAATTCAGAACCAAAAGTGGATGGGTTTTGAGATGGAATGCATGGCCTCGTACTGGAGGGGGATGATTTATTCATATCCCCATTGGTGAAGATAAAAGGGGTGGAAAAGTTTCAGAACAATGTTGGGTGACTTCAAGGACAACCATGAATACAGAATCTGGTTCTCTACCCAAACCATGTCCAACACCATCGTGAAGTAGCTGAACAAGAGTATGAAGGGAAGCTATGGGGAGATAGAAAAATCAAAAGAGACATCCAAGAACACTTTCTTCGATACCATAAAACAGAGTTAAAGCAAATATTGGGTGATTAAGAATCTGGTAGTTTTCAAAGCTGATTTTTATAATCTTTGGGTTGTTTCAAAACTTTTTGAATTTGATAGTTGGGTAAGAATTAATGAAACATTAGAACTCTTTTTCGACTCAAAAGTTATCATCAATCCATGTTTGCAGACAATGCTTTGATTGAATTGGAACGTGGTCCTTTGAAGGAGTTTATTGAAAATCCAGGAAAATGGCAAAACTTTGGTGCCTCTCATTTAAAGTTTGAAAAGTGGAACAATCATATCTATGGAAGACCATTATATTCGAAAGGTTATGGAGGGTGGATTACAATAAAATCTACCACTTGTTATTGCTGTAAGAAATTTTTTGAAGTGATTGGAGCTCATTTTGGAGGCTTAGAAAAGATTGCGGCTGAAACTATCAACTTCATTCAAGTGAAGGAGAATATCTGTGGTTTCATTCCAGCAACAATTGAAATATCTGATGAAAGAAGAGGAAACATTTTTTTTTTAAATTTTGGGAATATTTCTGTATTCGATACCCAGCAAAGTATAAGGTGTTCTATTTACTTACGTTTCTCTAATCCTTTAGATATTGTCCGAATAAACCAAGTCATGAAGCACAAAGGTATTGAGGAAGATTCATATAGCTCCAATTTTGACCTTTTCCTTCCTTCGAACTCCATTCCAAGCATGCAAAGGAATCCGTTTAAGTTGTTTGAAAGATTTTTTAAACATCATCATCAGAGGCCATCAACAACTCATCGGAATTGCAAAGCGACGTTGGATTTTTGGGAATCTTCTCTTCAACCTCTGACAAATGGCAGTTTCATTAGAAAACGGATGGCATAAGACAAACGGAAGTTAGAAATCCAAGAGCCATTCAAAATCCGAACACCAATCAGGAGAGGCTTAGTTTTCCTGCCAAAGGAGACTTCTTTTAATTTTATTCCATTTGAAAAGTCACTTGAGAAGTCCAGCAGTCACGTATTTGTAGCATTAAAGACAATAACAATCATCATAGGAAGGTTTGAGGTTTCCTCCAGTTATTTCATAGTTTCCTGTAGCAGGTAATAAAATATGAATTTTTGATCTGTAATTCCTACAATGCTCCACTTTAAGAGAAAAAGTAAAACTGTTTTTCTCCTCCATGATAATTTTCTACTGTTGTCCTTCCCTTTTTTCATGATGTCACTTGCTTGTGTTTTACAAGTATTTATGTGGTATGTTATCTGGAACTTGAGAATTTTTTTTAATACTTATATGGTCTAATTTTACTCATCTTTATTTCTTTTGTACAGACTATATTTGATAACATTCTCAATCGTAAGATCCCTTGGCCTCAAATACCTGAAGAGATGAGCCATGATGCTCAAGATCTAATTGACCGGTAAGAGAAACACACACAGTTGCGCTCGCACGCATACACATAATTGATATTTTTTGTAGACAATAAACAAAAATTTTCATTGATGCATTATTTTCAGTTATATTCTTTTGGTTTCACTTTTGATTTGATTACTTGGTTGTTGAAAATCTGTCACTACTGAAGCTTGCACATCATTCTTCGTACTTTTTTTAGTAGAGGATATATCTTAGGAAGGAAGGGGAATGAGATAAGAGAAAAGAAAGCAGGAGGGGGTGTACTTGGACCAATAAATATATATGTATTATGCACGCTCAGTCAACTCAAAAGCATTAGGCGGTGTACCAATGAAAGAATAAAAATTATCAAAGTCGTACGCAGAAGTAATCATAAAAATTCCTTTGTTCAGACTCCTCAATTAATATTCATTTCCACTTGTTGGGTCTTCTTCATATCGAGTTCCCAAGTAAAGGAGAGTGTTAGTTATTATATTAAATTTGTCTTCACCCATCAACTTAGTGAAAAGGTGATTTAGCATTTATAAACTTTAAGAAGAGGTTAGAAGGGTTAATATGGTTTTGAAATTATTAGGTTTAAGGGTATTTTTGGGTATTGTAAGAACTTTTGATTTTTGGACATTTAAGGGTATTTTTGACACAATGGACAATAATTAGGGGTATTTTGTATAATCTAGTCTTAACTTTACTATAAATTATTAGTTTAAGCATTTTAGTTGAATCTTGTTTAACGTTACCCCTTTTTCAATCTAATAAAGATCTCTTGGTGAAATGTTGATACCTCTATTTAAAGGATAGATAGAGGTATGAGCATTATTTCATGATATTGAGAATTTGCATGGGTTTGTGTAAAGATACTACAGTTGTATCTTTATATATATTTAATTTCCTCAACAGATTACAATGTAGAAATCAGTATTCTCACAGAAATTCGAGAATACAGATGATCAAACAACTTAAAACAACAGTAAATACAAACAAATCCACACATAAATGCAACAAAGAACAATTAACAAAAAACACTAAGATCGAACAATAGAACCTGGACCCTTCTGCCTTTCTTCCCGCGCAAGGAGGAATGCAGCCTCTGCTTCTCCAGAAAATATCTTCTATGCCCTCCCTCTCTCATTCTTCTTTTTATTCTAACTAACTACTAGTCCTCACTGAATGGTGGTCCCCCTCCTCCTTTCTCTCTCCCAAACTCCCACGTGTGATTCTATTTTTATCTTTATTCTTTCTGCTATATGTAAAGTGTATAGGAAGTCTAGCAGTTTCTACCAGGTTTCTGGATGTTTATTAATTGTATGTTGTTACTTTTTGTTGACGATTTTTCTCATCTAACTTGGTTATATTTAATGAAAAATCGTTCCGAGTTACTTTCTCACTTTTGTGCTTTTCATGCTGAATTTTTTTAGAAGATGAACCCCTCCATGGTTACCATAATAACAATGATCATGTTTTTATGTAATATACATAATTGCAAAGACAGCTCCAAGGACATCGAATATGTTTTACGCAAACACCTTACAAGTTAAAACTACTTTCCATTAGCGTCAAAGAGAAACAGAGGATGCCTCTATCAAAACTTACTGATAATGCTGGTGTATATTTACTTGTGCTTGACTCTAACTTGTGTGAACGTGGCATCGTTCATCAATCTTCTTATGCCAACACTCCATCCTAAAATGGAGTAACTGAACAAAAGAATAGACACTTACTTAAAACTCCATGCGCTTTCTTTCAAATATATGTTCCTAAGCATTTTTGGGCTGATCCTGTTTCCACTGCTTGCTTTTTGATTAATCGATTGCCTTTCTTTGTTCTAATGGTGGGATCCATATTGTGTTCTTTCCCTACCAAATTTTTGTTTCCTATTCCTTCGACAATCTTTGGTTGTGTTTGTTTTGCTCGGGATGTTCATCCTCATCATACTAAGTTACATCCAAAATCCTTGAAATGCATCTTTTAGGTTATTCACGAGTTCAAAAGGGGTTTCATTGTTACTGTCTTACTCATAAACAAGTATATTGTCTCTTCTGATGTTGATGTTACATTTTTCAAGGGTATACCCTTTAGTTTGTCACCATTGAGTGTGTGTTAGGGGGAGGATAACAATTTTTTTATCTATGAGATTATTTCTCCCACATCGTCATCTACTCAATCTTCCTAGTCTCTTCCTTCCCGTCCATTGATTACTCGAGTATACCCCTGGTGATTTCCACAACCTTTAAACACATGTCCTCCACGAATGCCTTTTTCAACATGTGATCTAAGACCAAGTGATGATCTTCTCATTGCCCTTCGGAAAGGTAAATGCCAGTGCACGCACCTTGTTTCATATTATTAGTTTTCCTCACCTATATATTCTTTTCTTATGCCCTCGATTCCACATCTATCCCTAACACTGTTCATGAAGCTTTATCTCATTTTTTCTAGTGTAACACAATGATTGAGGAGATGACTACTTTAGATGTAAATTGTACTTGGGATTTACTCTGCTCCTGCAGAAAAGAAGACCATACGGAGTAAATGGGTGTTTGCTATCAAGGTTGTCTTGTTGCCAAAAGTTATGCCCAAATTTATAGAATTGATTATTCAGATACATTTCCTTCACTTGCCAAATTAACTTCCATCAGACTTTTTCTTTCCATGGTTGCTACCCATAAATGGCTTTTGCATGAGCTTGACATTAAGAATGCTTTTCTGAGTAGTGATCTTCAGGAGGAAGTTTATGTGGGAGCGATCACTAAGATTAAGTATGTCACCTTCAAAAATCTTTATATGGTTTGAAACAAAGTCTATGTGCATGGTTGACAAGTTTAGTCAAGCTCTTCTACATTTTGGTATGAAGAAATATACGTCTGATCATTCTATTTTCTATTGGCAATCTATCTGATAAGGGTATTATCTTGCTTGTTGTGTATGTTGATGTTATTGTTATTATTGGAAATGATGCATTTGATATCTTGTCTCTCAAGACTTTCCTTTAGGTCAATTTCATAAGAAAGGTTTTGGACAATTGAAATTCTTTTTTGGGCATTGAAGTAATGAGATGCAAGAAAGGTATTTATTTGTCTCAATGAAAATATGTACTTGGTTTGTTGTTGAAGATAGAAAAACTAGGAGTCAGTCCAAGCAGTATTTCGATGATTCTTTCAACAACTTGGTATAGAAGGGGAGAATATTTAAAGATCCTGGAGATATAGCAGATTAGTTGGTAGGTTGAACTATTTAACAGAGGACGAGACCACACATTGTTGCTTCAAGTGTTATAAGTTTACGTCTTCTCATACAGTGAATCATTGGGCTGCATTAGAGCAAATTTATGTTATCTGAAAGGTGCATTTGAACGTGGGGTCTTGTATAAAGATCATTACCATTCAAGATTTGAATGTTTTTTCTATGCCAATTGTGTTGGATCTTGAGAAGATCGAAGATTGACTTCTGGATATTGTGTCTTTGTAGGAGGAAACTTAGTATCGTGGAAGATTAAGAAACAACACATAGTTTCTCGTTCAAGTGCGTAGTCAAAATGTAGGGCTATGACACAATCTGTTTGTGAAACAGTGTAGATAAATCTACTGTTATCTGAGATGGGCTTCATTTTTACTGTGCCAACTAAATTATGTCCTTTATTGTAATTGTACATACCTAGTCTAGGGTTTCTATGTGTCCTATATATATAACTCTACTCTGAATTATAATGAGATAATTTTCCTCCAAAAATATTGAGTACAGTATATAATATTTATTCGTGCAACTATCAGATTATTGACGGAAGATCCCCACCAGCGACTTGGAGCTATAGGTGCATCAGAGGTAATTAACTTTTTAATGTGGCTCATCTACTTGTCATGATACTGGTTTTCTTAGTGCCCTCAGATTTTTCATGGATGCTTACATCATAGAGCCAACTTATATCACTCAGGTGAAACAACATATGTTCTTCAAAGATATTAACTGGGACACGCTTGCTCGACAAAAGGTTAGTAAATCAGTTAAATTTATGTTGTAAGGGGTGCTTTCATTTTATTGATATGTACGATTTTTGAACTTTTATAGGCTGCATTTGTTCCAACATCTGAAAGTGCTCTTGATACTAGTTATTTTACAAGTCGCTATTCATGGAATCATTCAGATGATCATGTCTATCCACACAGTGAACTTGAGGATTCGAGTGATGCCGATAGCTTGAGTGGTGATAGTTGTTTGAGCAATCGTCAAGATGAAGTGGTAATTTGGTTGCATGATTTTTATAAGTATTGTTTATTTAAATCTCCTTGAACTAATGCTAATTGTATTTTTTTTATTCTGGTTCCTTAATCCTTGGCATCTCACGAATAGGGAGATGAATGTGGGGGTCTTACAGATTTTGAGCCCGGTGCTTCTGTCAATTACTCATTCAGTAATTTCTCTTTCAAGGTATCTACTAAACAATTTCTAGGCGTGCAGGTTATGATGAATTCCTAGCTTTTCATGCTTATTTGTTCAAATACCTTATAACATGTTTACCTGGGAGTAGTACATTCTTCTCGTGCATCCTCTTAATATCTTACTATTTGACGTTTCTATTTTAAACTATAATTGATTTAGCTTTCAAATTTTCTTTTATAAAAAAAAAGAAGAGGAATTTTGAACTTTTGTAAAGGATGGAAATGGGGGAAGTATCCCTCAACCCTCTCAGAAAGTTTGCCTCAAGGTTTTTATTTATTTTTTTGGTTATAGGAGGAAGTATGCTGTGCGGCAAGGCATGAGACACGTACACATCTAAATATATATATATTTAAACTCATATCTAAAATGGCCCTATTTTTTCTTTTTACTTTGTTATTTCTAATTTTTGTTCTCCTTTTTTCAAAACCAGAAAGGTAGTTTTCTTTATTAATCCTTTTACCATCATCCATGGCAACGAAGTTGGGGGAAGTCGTTGTGCAAGAGCTATTTAAATTGATCTTTACCAAATGCTAGATGCTTAATCCTTCTGATATGCACACTTTTGTTTAGGCAAGATGTTTATGACATACCTTTAATTACCACTTAAGCTAGTGGGTGAAGGAAAATTTAATATTATATTATTCAACAACATATATTAGACACTTGGGTGGTTAATTTTTTTTTATTTATCTGGAAATATGGGACCTAAAAAGTGATTAACTTTACAGTTTCGTGAAGTGTGCATATTTTTCACTTCAATCTGTCCTTCAAATCAATCTAAGCAATTAAAGATGGAGATGATTGCAGTGGCTGAATTAATTCAAAAGCAATTAAAAGTAGTGCATATTTCTAAATGTTCTCAAACTTCTCTTGTACTGTCGCGTTGCCATGCCCGTCTTCTTTGTTTAGAAAATTTGTGGCTTGCTGCTATTTATGTAGTTGGGAATGTCATTTTCCTCTTGATCACCCATGCTGGAGAAAATATGGAACAACTTGACCGGTCTGTTACGATTTATCTTTGTTTATGTTTTCTTGTAATCAGAATCTCTCCCAGCTCGCATCCATCAACTATGACCTGCTTTCCAAGGGTTTGAAGGATGATCCTCCCAATCATGATGCATAAATGTGTTAATCCTCTCATAATTGCTGAGTACTTTTAGGCGATTGGTTCGGTATTCGTCCTTTTATTTGTAATGTTCTGTACATGTTCGTGTAAGTCTTTCTTAGCTCCTCTACATGCCTTAACAACTTTATTTGTGATTGGTGTAGTGATTTGATCACTCTTGTAACTGCAGAGACAGACATATTTTAGAGGCTATATCTTGTTCGAGATGCTCTCTTCCAAATTATAGGCGCCATCTCCAGAACCATGTGGCCTTCAGTTTTCATTTTCCCTTTGTTTCACTTCTCTTTCTTCAAACTCACCTCACTGTGAGTTCAGAACTTCAATTTTCCATTGTTTAAATTGTCCAACTGTTTTACCTTTTTTCTGACAATTCGTATATTTATGTATTTGTAATATATTAATTATATAACAAAGAGAAGCACCATTGTGTACAGCATTTCCTTTGGCATACACCCTGACCATATTAGTATATGACGAAAGGAAGTTGAAAATAAATTCACAGGCGTTTGTTTTTGTTGTGACT

mRNA sequence

ATGACTAACATTACAGGATGCTATGGATCATCATGGGGATTTTCTGAAGGATTAAGAAATTTAGATTTCTGTACGCCAGAGCCCTCTTATGATTGTGAAAATCCAAAGGAGTCTGAGTCTCCTCGCTTTCAAGCTATACTCCGAGTCACAAGTGCTCCTCGAAGAAAGTATCCTGCTGACATCAAAAGTTTTTCCCACGAACTAAATTCGAAAGGCGTACGACCTTTCCCACTTTGGAAGTCACGTCGTTTAAATAATTTGGAGGAGATCTTAGTTATGATACGAGCAAAATTTGACAAGGCTAAGGAAGAAGTAAATTCTGACTTGGCTATTTTTGCTGCTGATCTGGTTGGAGTTCTTGAGAAAAATGTTGATACTCATCCAGAGTGGCAAGAGACAATCGAAGACTTGTTGGTTTTAGCTCGAAGCTGTGCAATGTCATCCCCAGGTGAATTTTGGCTTCAGTGTGAAAGCATTGTTCAGGAATTGGATGATAGACGCCAAGAACTTCCTCCTGGCATGCTGAAACAACTTCATACACGAATACTGTTCATTCTCACTAGATGTACCAGGTTGCTGCAGTTCCATAAAGAAAGTGGGTTAGCTGAAGATGAAAATGTTTTTCAACTTCGTCAGTCAAGAAATCTGCATTCTGCTGATAAACGCACGGCTCCGGCTATGGGAAGGGAAATGAAAAGTTCTAGCGCAGCCAAGGCTTCGAAGGGAACATCTTCCAGGAAATCTTACAGCCAGGAGCAGCGTGGTTTGGATTGGAGCAGAGAACATGATATACTGCCTGGGAATAATGTTTCAACATCTCCTGATGATACTTCAAAGAACTTGGTATCTCCTACTGGCAGAGATCGGATGGCATCTTGGAAAAGATTGCCTTCTCCAGCAAAAAGCAGCGTGAAAGAATCTACTTTGAAGGTTCAGGGTGATAATAAAAAATTTAAATCTCTGAACATGTCAAAAATCAGGATGAGTGTTTCTGAAACTGATTTTGCTGCTGCTAAGGTTTCTGAGCTTCCTCTGCCCAGAGATTCGCATGACCAACCTACAAAGCACCAACATAAACCTTCTTGGGGTTGGGGTGATCAACCAACTGTATCTGATGAGAGTTCAATAATATGTCGAATTTGTGAAGAAGAAATTCCTACTGCAAATGTGGAAGATCATTCAAGAATTTGTGCAGTTGCTGATAGATGTGATCAAAAGGGTATAAGTGTTAATGAACGTCTACTCAGAATTGCTGAAACTTTAGAGAAGATGATCGAGTCTTTTACGCAGAAGGATTCTCAGCATGTAGGAAGTCCGGATGTTGCAAAAGTTTCAAATTCTAGCATGACCGAAGAATCTGATATTTCCCCAAAACATAGTGATTGGTCCCGCAGGGGCTCAGAAGACATGCTAGATTGCTTCCATGAAACTGATAGTTCTGTATTCATGGATGACTTAAAAGGTTTACCTTCCATGTCTTGTAAAACTCGTTTTGGTCCAAAGTCTGATCAAGGTATGACAACATCATCGGCTGGTAGCATGACTCCTAGATCTCCTTTATTAACACCAAGAACCAGTCAGTTTGACTTGTTTTTGGCAGGAAAAGGTGCTTACTATGAACATGACGATCTTACGCAGATCAGTGATTTGGCAGATATTGCACGTTGTTCAGCTAATACACCTCTGGGGGATGATTGCTCGATGCAGTATCTGCTAACATGCCTTGAAGACTTGAGAGTTGTCATTAACCGCAGGAAGTTTAATGCACTGACAGTTGACACTTTTGGCACTCGCATTGAAAAGTTAATCCGGGAGAAGTATTTGCAACTTTGTGAGTTGGTGGATGATGAAAAGATCGACATAGCAAGCACTGTCATTGATGAAGATACTCCTCTTGAAGATGATATTGTTCGTAGCTTAAGAACTAGTCCCATCCATTCTTCAAAGGATCGTACATCAATTGATGACTTTGAGATTATAAAACCAATCAGCCGTGGGGCATTTGGTCGGGTTTTCTTGGCTAAAAAGAGGACTACGGGGGATCTTTTTGCAATAAAGGTTTTGAAGAAAGCGGATATGATTCGGAAGAACGCGGTCGAAAGTATTTTGGCAGAACGTAATATTTTAATTTCTGTGCGCAATCCTTTTGTGGTTCGCTTCTTCTACTCTTTTACTTGTCGTGACAATTTGTATCTTGTGATGGAGTATTTGAATGGAGGAGACCTTTACTCTTTGTTGAGAAATTTGGGCTGCTTAGACGAAGAGGTTGCTCGTGTATATATCGCAGAAGTTGTTCTTGCATTGGAGTATTTGCATTCACTTGGGGTTGTTCATCGGGACTTAAAGCCTGATAATTTATTAATAGCACATGATGGCCATATTAAGCTGACAGATTTTGGGCTTTCCAAGGTTGGTCTAATCAACAGTACTGATGACTTGTCTGGACCAGCTGTTAGTGGGACCACTCTTCTTGGCTATGATGAGCCTACCATGTCTGCTTCCGAGCATCAACAAGAAAGACGGAAAAAACGTTCTGCTGTTGGCACACCAGACTATTTGGCACCAGAGATACTATTGGGAACTGGACACGGCGCCACAGCTGACTGGTGGTCTGTTGGGATCATTTTATTTGAACTTATTGTTGGCATTCCACCCTTCAATGCAGAGCATCCCCAGACTATATTTGATAACATTCTCAATCGTAAGATCCCTTGGCCTCAAATACCTGAAGAGATGAGCCATGATGCTCAAGATCTAATTGACCGATTATTGACGGAAGATCCCCACCAGCGACTTGGAGCTATAGGTGCATCAGAGGTGAAACAACATATGTTCTTCAAAGATATTAACTGGGACACGCTTGCTCGACAAAAGGCTGCATTTGTTCCAACATCTGAAAGTGCTCTTGATACTAGTTATTTTACAAGTCGCTATTCATGGAATCATTCAGATGATCATGTCTATCCACACAGTGAACTTGAGGATTCGAGTGATGCCGATAGCTTGAGTGGTGATAGTTGTTTGAGCAATCGTCAAGATGAAGTGGGAGATGAATGTGGGGGTCTTACAGATTTTGAGCCCGGTGCTTCTGTCAATTACTCATTCAGTAATTTCTCTTTCAAGAATCTCTCCCAGCTCGCATCCATCAACTATGACCTGCTTTCCAAGGGTTTGAAGGATGATCCTCCCAATCATGATGCATAA

Coding sequence (CDS)

ATGACTAACATTACAGGATGCTATGGATCATCATGGGGATTTTCTGAAGGATTAAGAAATTTAGATTTCTGTACGCCAGAGCCCTCTTATGATTGTGAAAATCCAAAGGAGTCTGAGTCTCCTCGCTTTCAAGCTATACTCCGAGTCACAAGTGCTCCTCGAAGAAAGTATCCTGCTGACATCAAAAGTTTTTCCCACGAACTAAATTCGAAAGGCGTACGACCTTTCCCACTTTGGAAGTCACGTCGTTTAAATAATTTGGAGGAGATCTTAGTTATGATACGAGCAAAATTTGACAAGGCTAAGGAAGAAGTAAATTCTGACTTGGCTATTTTTGCTGCTGATCTGGTTGGAGTTCTTGAGAAAAATGTTGATACTCATCCAGAGTGGCAAGAGACAATCGAAGACTTGTTGGTTTTAGCTCGAAGCTGTGCAATGTCATCCCCAGGTGAATTTTGGCTTCAGTGTGAAAGCATTGTTCAGGAATTGGATGATAGACGCCAAGAACTTCCTCCTGGCATGCTGAAACAACTTCATACACGAATACTGTTCATTCTCACTAGATGTACCAGGTTGCTGCAGTTCCATAAAGAAAGTGGGTTAGCTGAAGATGAAAATGTTTTTCAACTTCGTCAGTCAAGAAATCTGCATTCTGCTGATAAACGCACGGCTCCGGCTATGGGAAGGGAAATGAAAAGTTCTAGCGCAGCCAAGGCTTCGAAGGGAACATCTTCCAGGAAATCTTACAGCCAGGAGCAGCGTGGTTTGGATTGGAGCAGAGAACATGATATACTGCCTGGGAATAATGTTTCAACATCTCCTGATGATACTTCAAAGAACTTGGTATCTCCTACTGGCAGAGATCGGATGGCATCTTGGAAAAGATTGCCTTCTCCAGCAAAAAGCAGCGTGAAAGAATCTACTTTGAAGGTTCAGGGTGATAATAAAAAATTTAAATCTCTGAACATGTCAAAAATCAGGATGAGTGTTTCTGAAACTGATTTTGCTGCTGCTAAGGTTTCTGAGCTTCCTCTGCCCAGAGATTCGCATGACCAACCTACAAAGCACCAACATAAACCTTCTTGGGGTTGGGGTGATCAACCAACTGTATCTGATGAGAGTTCAATAATATGTCGAATTTGTGAAGAAGAAATTCCTACTGCAAATGTGGAAGATCATTCAAGAATTTGTGCAGTTGCTGATAGATGTGATCAAAAGGGTATAAGTGTTAATGAACGTCTACTCAGAATTGCTGAAACTTTAGAGAAGATGATCGAGTCTTTTACGCAGAAGGATTCTCAGCATGTAGGAAGTCCGGATGTTGCAAAAGTTTCAAATTCTAGCATGACCGAAGAATCTGATATTTCCCCAAAACATAGTGATTGGTCCCGCAGGGGCTCAGAAGACATGCTAGATTGCTTCCATGAAACTGATAGTTCTGTATTCATGGATGACTTAAAAGGTTTACCTTCCATGTCTTGTAAAACTCGTTTTGGTCCAAAGTCTGATCAAGGTATGACAACATCATCGGCTGGTAGCATGACTCCTAGATCTCCTTTATTAACACCAAGAACCAGTCAGTTTGACTTGTTTTTGGCAGGAAAAGGTGCTTACTATGAACATGACGATCTTACGCAGATCAGTGATTTGGCAGATATTGCACGTTGTTCAGCTAATACACCTCTGGGGGATGATTGCTCGATGCAGTATCTGCTAACATGCCTTGAAGACTTGAGAGTTGTCATTAACCGCAGGAAGTTTAATGCACTGACAGTTGACACTTTTGGCACTCGCATTGAAAAGTTAATCCGGGAGAAGTATTTGCAACTTTGTGAGTTGGTGGATGATGAAAAGATCGACATAGCAAGCACTGTCATTGATGAAGATACTCCTCTTGAAGATGATATTGTTCGTAGCTTAAGAACTAGTCCCATCCATTCTTCAAAGGATCGTACATCAATTGATGACTTTGAGATTATAAAACCAATCAGCCGTGGGGCATTTGGTCGGGTTTTCTTGGCTAAAAAGAGGACTACGGGGGATCTTTTTGCAATAAAGGTTTTGAAGAAAGCGGATATGATTCGGAAGAACGCGGTCGAAAGTATTTTGGCAGAACGTAATATTTTAATTTCTGTGCGCAATCCTTTTGTGGTTCGCTTCTTCTACTCTTTTACTTGTCGTGACAATTTGTATCTTGTGATGGAGTATTTGAATGGAGGAGACCTTTACTCTTTGTTGAGAAATTTGGGCTGCTTAGACGAAGAGGTTGCTCGTGTATATATCGCAGAAGTTGTTCTTGCATTGGAGTATTTGCATTCACTTGGGGTTGTTCATCGGGACTTAAAGCCTGATAATTTATTAATAGCACATGATGGCCATATTAAGCTGACAGATTTTGGGCTTTCCAAGGTTGGTCTAATCAACAGTACTGATGACTTGTCTGGACCAGCTGTTAGTGGGACCACTCTTCTTGGCTATGATGAGCCTACCATGTCTGCTTCCGAGCATCAACAAGAAAGACGGAAAAAACGTTCTGCTGTTGGCACACCAGACTATTTGGCACCAGAGATACTATTGGGAACTGGACACGGCGCCACAGCTGACTGGTGGTCTGTTGGGATCATTTTATTTGAACTTATTGTTGGCATTCCACCCTTCAATGCAGAGCATCCCCAGACTATATTTGATAACATTCTCAATCGTAAGATCCCTTGGCCTCAAATACCTGAAGAGATGAGCCATGATGCTCAAGATCTAATTGACCGATTATTGACGGAAGATCCCCACCAGCGACTTGGAGCTATAGGTGCATCAGAGGTGAAACAACATATGTTCTTCAAAGATATTAACTGGGACACGCTTGCTCGACAAAAGGCTGCATTTGTTCCAACATCTGAAAGTGCTCTTGATACTAGTTATTTTACAAGTCGCTATTCATGGAATCATTCAGATGATCATGTCTATCCACACAGTGAACTTGAGGATTCGAGTGATGCCGATAGCTTGAGTGGTGATAGTTGTTTGAGCAATCGTCAAGATGAAGTGGGAGATGAATGTGGGGGTCTTACAGATTTTGAGCCCGGTGCTTCTGTCAATTACTCATTCAGTAATTTCTCTTTCAAGAATCTCTCCCAGCTCGCATCCATCAACTATGACCTGCTTTCCAAGGGTTTGAAGGATGATCCTCCCAATCATGATGCATAA
BLAST of CSPI01G05030 vs. Swiss-Prot
Match: IREH1_ARATH (Probable serine/threonine protein kinase IREH1 OS=Arabidopsis thaliana GN=IREH1 PE=1 SV=1)

HSP 1 Score: 1492.2 bits (3862), Expect = 0.0e+00
Identity = 766/1069 (71.66%), Postives = 888/1069 (83.07%), Query Frame = 1

Query: 5    TGCYGSSWGFSEGLRNLDFCTPEPSYDCENPKESESPRFQAILRVTSAPRRKYPADIKSF 64
            TG    S G S  LRN DFCTPE SY+ ENPKESESPR+QA+LR+TSAPR+++P DIKSF
Sbjct: 238  TGRSEMSSGRSGPLRNSDFCTPENSYEWENPKESESPRYQALLRMTSAPRKRFPGDIKSF 297

Query: 65   SHELNSKGVRPFPLWKSRRLNNLEEILVMIRAKFDKAKEEVNSDLAIFAADLVGVLEKNV 124
            SHELNSKGVRPFPLWK RR NN+EE+L +IRAKF+KAKEEVNSDLA+FAADLVGVLEKN 
Sbjct: 298  SHELNSKGVRPFPLWKPRRSNNVEEVLNLIRAKFEKAKEEVNSDLAVFAADLVGVLEKNA 357

Query: 125  DTHPEWQETIEDLLVLARSCAMSSPGEFWLQCESIVQELDDRRQELPPGMLKQLHTRILF 184
            ++HPEW+ET EDLL+LARSCAM++PG+FWLQCE IVQ+LDDRRQELPPG+LKQLHTR+LF
Sbjct: 358  ESHPEWEETFEDLLILARSCAMTTPGDFWLQCEGIVQDLDDRRQELPPGVLKQLHTRMLF 417

Query: 185  ILTRCTRLLQFHKESGLAEDENVFQLRQSRNLHSADKRTAPAMGREMKSSSAAKASKGTS 244
            ILTRCTRLLQFHKES   E+E V QLRQSR LHS +K      GR   S SAAK     S
Sbjct: 418  ILTRCTRLLQFHKESW-GEEEQVVQLRQSRVLHSIEKIPPSGAGR---SYSAAKVP---S 477

Query: 245  SRKSYSQEQRGLDWSREHDILPGNNVSTSPDDTSKNLVSPTGRDRMASWKRLPSPAKSSV 304
            ++K+YSQEQ GLDW  +  +     ++   +   K   SP   DRM+SWK+LPSPA  +V
Sbjct: 478  TKKAYSQEQHGLDWKEDAVVRSVPPLAPPENYAIKESESPANIDRMSSWKKLPSPALKTV 537

Query: 305  KESTLKVQGDNKKFKSLNMSKIRMSVSETDFAAAKVSELPLPRDSHDQPTKHQHKPSWG- 364
            KE+    + ++ K +  N+   R      D AA  +   P  +DSH+  +KH+H  SWG 
Sbjct: 538  KEAPASEEQNDSKVEPPNIVGSRQG---RDDAAVAILNFPPAKDSHEHSSKHRHNISWGY 597

Query: 365  WGDQPTVSDESSIICRICEEEIPTANVEDHSRICAVADRCDQKGISVNERLLRIAETLEK 424
            WG+QP +S+ESSI+CRICEEE+PT +VEDHSR+C +AD+ DQKG+SV+ERL+ +A TL+K
Sbjct: 598  WGEQPLISEESSIMCRICEEEVPTTHVEDHSRVCTLADKYDQKGLSVDERLMAVAGTLDK 657

Query: 425  MIESFTQKDSQHVG-SPDVAKVSNSSMTEESDI-SPKHSDWSRRGSEDMLDCFHETDSSV 484
            + E+F  KDS     SPD  KVSNS +TEESD+ SP+ SDWSR+GSEDMLDCF E D+S+
Sbjct: 658  IAETFRHKDSLAAAESPDGMKVSNSHLTEESDVLSPRLSDWSRKGSEDMLDCFPEADNSI 717

Query: 485  FMDDLKGLPSMSCKTRFGPKSDQGMTTSSAGSMTPRSPLLTPRTSQFDLFLAGKGAYYEH 544
            FMDDL+GLP MSC+TRFGPKSDQGMTTSSA SMTPRSP+ TPR    +  L GKG +++ 
Sbjct: 718  FMDDLRGLPLMSCRTRFGPKSDQGMTTSSASSMTPRSPIPTPRPDPIEQILGGKGTFHDQ 777

Query: 545  DDLTQISDLADIARCSANTPLGDDCSMQYLLTCLEDLRVVINRRKFNALTVDTFGTRIEK 604
            DD+ Q+S+LADIA+C+A+   GDD S+ +LL+CLEDLRVVI+RRKF+ALTV+TFGTRIEK
Sbjct: 778  DDIPQMSELADIAKCAADAIPGDDQSIPFLLSCLEDLRVVIDRRKFDALTVETFGTRIEK 837

Query: 605  LIREKYLQLCELVDDEKIDIASTVIDEDTPLEDDIVRSLRTSPIHSSKDRTSIDDFEIIK 664
            LIREKY+ +CEL+DDEK+D+ STVIDED PLEDD+VRSLRTSP+H  +DRTSIDDFEIIK
Sbjct: 838  LIREKYVHMCELMDDEKVDLLSTVIDEDAPLEDDVVRSLRTSPVHP-RDRTSIDDFEIIK 897

Query: 665  PISRGAFGRVFLAKKRTTGDLFAIKVLKKADMIRKNAVESILAERNILISVRNPFVVRFF 724
            PISRGAFGRVFLAKKRTTGDLFAIKVLKKADMIRKNAVESILAER+ILI+VRNPFVVRFF
Sbjct: 898  PISRGAFGRVFLAKKRTTGDLFAIKVLKKADMIRKNAVESILAERDILINVRNPFVVRFF 957

Query: 725  YSFTCRDNLYLVMEYLNGGDLYSLLRNLGCLDEEVARVYIAEVVLALEYLHSLGVVHRDL 784
            YSFTCRDNLYLVMEYLNGGDLYSLLRNLGCL+E++ RVYIAEVVLALEYLHS GVVHRDL
Sbjct: 958  YSFTCRDNLYLVMEYLNGGDLYSLLRNLGCLEEDIVRVYIAEVVLALEYLHSEGVVHRDL 1017

Query: 785  KPDNLLIAHDGHIKLTDFGLSKVGLINSTDDLSGPAVSGTTLLGYDEPTMSASEHQQERR 844
            KPDNLLIAHDGHIKLTDFGLSKVGLINSTDDL+GPAVSGT+LL  +E  ++ASE Q ERR
Sbjct: 1018 KPDNLLIAHDGHIKLTDFGLSKVGLINSTDDLAGPAVSGTSLLDEEESRLAASEEQLERR 1077

Query: 845  KKRSAVGTPDYLAPEILLGTGHGATADWWSVGIILFELIVGIPPFNAEHPQTIFDNILNR 904
            KKRSAVGTPDYLAPEILLGTGHGATADWWSVGIILFELIVGIPPFNAEHPQ IFDNILNR
Sbjct: 1078 KKRSAVGTPDYLAPEILLGTGHGATADWWSVGIILFELIVGIPPFNAEHPQQIFDNILNR 1137

Query: 905  KIPWPQIPEEMSHDAQDLIDRLLTEDPHQRLGAIGASEVKQHMFFKDINWDTLARQKAAF 964
            KIPWP +PEEMS +A D+IDR LTEDPHQRLGA GA+EVKQH+FFKDINWDTLARQKAAF
Sbjct: 1138 KIPWPHVPEEMSAEAHDIIDRFLTEDPHQRLGARGAAEVKQHIFFKDINWDTLARQKAAF 1197

Query: 965  VPTSESALDTSYFTSRYSWNHSDDHVYPHSELEDSSDADSLSGDS-CLSNRQDE-VGDEC 1024
            VP SESA+DTSYF SRYSWN SD+  +P  E+ D SDADS++  S C SN  +E   +EC
Sbjct: 1198 VPASESAIDTSYFRSRYSWNTSDEQFFPSGEVPDYSDADSMTNSSGCSSNHHEEGEAEEC 1257

Query: 1025 GGLTDFEPGASVNYSFSNFSFKNLSQLASINYDLLSKGLKDDP---PNH 1066
             G  +FE G  V+YSFSNFSFKNLSQLASINYDLLSKG KD+P   P+H
Sbjct: 1258 EGHAEFESGIPVDYSFSNFSFKNLSQLASINYDLLSKGWKDEPQQIPHH 1295

BLAST of CSPI01G05030 vs. Swiss-Prot
Match: IRE3_ARATH (Probable serine/threonine protein kinase IRE3 OS=Arabidopsis thaliana GN=IRE3 PE=2 SV=1)

HSP 1 Score: 1366.7 bits (3536), Expect = 0.0e+00
Identity = 709/1044 (67.91%), Postives = 845/1044 (80.94%), Query Frame = 1

Query: 22   DFCTPEPSYDCENPKESESPRFQAILRVTSAPRRKYPADIKSFSHELNSKGVRPFPLWKS 81
            + CTPE SYD ++PKES+SPR+QA+LR+TSAPR+++P DIKSFSHELNSKGVRPFPLWK 
Sbjct: 198  EVCTPENSYDLDDPKESDSPRYQALLRMTSAPRKRFPGDIKSFSHELNSKGVRPFPLWKP 257

Query: 82   RRLNNLEEILVMIRAKFDKAKEEVNSDLAIFAADLVGVLEKNVDTHPEWQETIEDLLVLA 141
            RRLNNLE+IL +IR KFDKAKEEVNSDL  F  DL+ + +KN ++HPE   TIEDLLVLA
Sbjct: 258  RRLNNLEDILNLIRTKFDKAKEEVNSDLFAFGGDLLDIYDKNKESHPELLVTIEDLLVLA 317

Query: 142  RSCAMSSPGEFWLQCESIVQELDDRRQELPPGMLKQLHTRILFILTRCTRLLQFHKESGL 201
            ++CA ++  EFWLQCE IVQ+LDDRRQELPPG+LKQLHTR+LFILTRCTRLLQFHKES  
Sbjct: 318  KTCAKTTSKEFWLQCEGIVQDLDDRRQELPPGVLKQLHTRMLFILTRCTRLLQFHKESW- 377

Query: 202  AEDENVFQLRQSRNLHSADKRTAPAMGREMKSSSAAKASKGTSSRKSYSQEQRGLDWSRE 261
             ++E+  QLRQS  LHSADKR      R+ K SS A A K  S++K+YSQEQRGL+W   
Sbjct: 378  GQEEDAVQLRQSGVLHSADKRDPTGEVRDGKGSSTANALKVPSTKKAYSQEQRGLNWIEG 437

Query: 262  HDILPGNNVSTSPDDTSKNLVSPTGRDRMASWKRLPSPAKSSVKESTLKVQGDNKKFKSL 321
              + P   +S+  ++TSK+  SP   D+M+SWKRLPSPA   V+E+ +  + +++K +  
Sbjct: 438  FFVRPAP-LSSPYNETSKDSESPANIDKMSSWKRLPSPASKGVQEAAVSKEQNDRKVEPP 497

Query: 322  NMSKIRMSVSETDFAAAKVSELPLPRDSHDQPTKHQHKPSWG-WGDQPTVSDESSIICRI 381
             + K  +++S+ D A AK+ E+   + S +  +K++H  SWG WG Q  +S+ESSIICRI
Sbjct: 498  QVVKKLVAISD-DMAVAKLPEVSSAKASQEHMSKNRHNISWGYWGHQSCISEESSIICRI 557

Query: 382  CEEEIPTANVEDHSRICAVADRCDQKGISVNERLLRIAETLEKMIESFTQKDS-QHVGSP 441
            CEEEIPT +VEDHSRICA+AD+ DQKG+ V+ERL+ +A TLEK+ ++  QKDS   V SP
Sbjct: 558  CEEEIPTTHVEDHSRICALADKYDQKGVGVDERLMAVAVTLEKITDNVIQKDSLAAVESP 617

Query: 442  DVAKVSNSSMTEESDI-SPKHSDWSRRGSEDMLDCFHETDSSVFMDDLKGLPSMSCKTRF 501
            +  K+SN+S+TEE D+ SPK SDWSRRGSEDMLDCF ETD+SVFMDD+  LPSMSC+TRF
Sbjct: 618  EGMKISNASLTEELDVLSPKLSDWSRRGSEDMLDCFPETDNSVFMDDMGCLPSMSCRTRF 677

Query: 502  GPKSDQGMTTSSAGSMTPRSPLLTPRTSQFDLFLAGKGAYYEHDDLTQISDLADIARCSA 561
            GPKSDQGM TSSAGSMTPRSP+ TPR    +L L GKG +++ DD  Q+S+LADIARC+A
Sbjct: 678  GPKSDQGMATSSAGSMTPRSPIPTPRPDPIELLLEGKGTFHDQDDFPQMSELADIARCAA 737

Query: 562  NTPLGDDCSMQYLLTCLEDLRVVINRRKFNALTVDTFGTRIEKLIREKYLQLCELVDDEK 621
            N    DD S+Q LL+CLEDLRVVI+RRKF+AL V+TFGTRIEKLI+EKYLQLCEL+DDEK
Sbjct: 738  NAIPVDDQSIQLLLSCLEDLRVVIDRRKFDALIVETFGTRIEKLIQEKYLQLCELMDDEK 797

Query: 622  IDIASTVIDEDTPLEDDIVRSLRTSPIHSSKDRTSIDDFEIIKPISRGAFGRVFLAKKRT 681
                 T+IDED PLEDD+VRSLRTSP+H  +DR SIDDFE++K ISRGAFG V LA+K T
Sbjct: 798  ----GTIIDEDAPLEDDVVRSLRTSPVHL-RDRISIDDFEVMKSISRGAFGHVILARKNT 857

Query: 682  TGDLFAIKVLKKADMIRKNAVESILAERNILISVRNPFVVRFFYSFTCRDNLYLVMEYLN 741
            TGDLFAIKVL+KADMIRKNAVESILAER+ILI+ RNPFVVRFFYSFTC +NLYLVMEYLN
Sbjct: 858  TGDLFAIKVLRKADMIRKNAVESILAERDILINARNPFVVRFFYSFTCSENLYLVMEYLN 917

Query: 742  GGDLYSLLRNLGCLDEEVARVYIAEVVLALEYLHSLGVVHRDLKPDNLLIAHDGHIKLTD 801
            GGD YS+LR +GCLDE  ARVYIAEVVLALEYLHS GVVHRDLKPDNLLIAHDGH+KLTD
Sbjct: 918  GGDFYSMLRKIGCLDEANARVYIAEVVLALEYLHSEGVVHRDLKPDNLLIAHDGHVKLTD 977

Query: 802  FGLSKVGLINSTDDLSGPAVSGTTLLGYDEPTMSASEHQQERRKKRSAVGTPDYLAPEIL 861
            FGLSKVGLIN+TDDLSGP  S T+LL  ++P +   +H      KRSAVGTPDYLAPEIL
Sbjct: 978  FGLSKVGLINNTDDLSGPVSSATSLLVEEKPKLPTLDH------KRSAVGTPDYLAPEIL 1037

Query: 862  LGTGHGATADWWSVGIILFELIVGIPPFNAEHPQTIFDNILNRKIPWPQIPEEMSHDAQD 921
            LGTGHGATADWWSVGIIL+E +VGIPPFNA+HPQ IFDNILNR I WP +PE+MSH+A+D
Sbjct: 1038 LGTGHGATADWWSVGIILYEFLVGIPPFNADHPQQIFDNILNRNIQWPPVPEDMSHEARD 1097

Query: 922  LIDRLLTEDPHQRLGAIGASEVKQHMFFKDINWDTLARQKAAFVPTSESALDTSYFTSRY 981
            LIDRLLTEDPHQRLGA GA+EVKQH FFKDI+W+TLA+QKAAFVP SE+A DTSYF SRY
Sbjct: 1098 LIDRLLTEDPHQRLGARGAAEVKQHSFFKDIDWNTLAQQKAAFVPDSENAFDTSYFQSRY 1157

Query: 982  SWNHSDDHVYPHSELEDSSDADSLSGDS-CLSNRQDEVGDECGGLTDFEPGASVNYSFSN 1041
            SWN+S +  +P +E EDSS+ DSL G S  LSN  DE  D   G  +FE   S NY F N
Sbjct: 1158 SWNYSGERCFPTNENEDSSEGDSLCGSSGRLSNHHDEGVDIPCGPAEFETSVSENYPFDN 1217

Query: 1042 FSFKNLSQLASINYDLLSKGLKDD 1062
            FSFKNLSQLA INY+L+SKG KD+
Sbjct: 1218 FSFKNLSQLAYINYNLMSKGHKDE 1227

BLAST of CSPI01G05030 vs. Swiss-Prot
Match: IRE_ARATH (Probable serine/threonine protein kinase IRE OS=Arabidopsis thaliana GN=IRE PE=2 SV=1)

HSP 1 Score: 1025.8 bits (2651), Expect = 3.4e-298
Identity = 581/1037 (56.03%), Postives = 723/1037 (69.72%), Query Frame = 1

Query: 36   KESESPRFQAILRVTSAPRRKYPADIKSFSHELNSKGVRPFPLWKSRRLNNLEEILVMIR 95
            KE++SPRFQAILRVTS  R+K   DIKSFSHELNSKGVRPFP+W+SR + ++EEI+  IR
Sbjct: 172  KETQSPRFQAILRVTSG-RKKKAHDIKSFSHELNSKGVRPFPVWRSRAVGHMEEIMAAIR 231

Query: 96   AKFDKAKEEVNSDLAIFAADLVGVLEKNVDTHPEWQETIEDLLVLARSCAMSSPGEFWLQ 155
             KFDK KE+V++DL +FA  LV  LE   +++ E +  +EDLLV AR CA     EFWL+
Sbjct: 232  TKFDKQKEDVDADLGVFAGYLVTTLESTPESNKELRVGLEDLLVEARQCATMPASEFWLK 291

Query: 156  CESIVQELDDRRQELPPGMLKQLHTRILFILTRCTRLLQFHKESGLAEDENVFQLRQSRN 215
            CE IVQ+LDD+RQELP G LKQ H R+LFILTRC RL+QF KESG  E E++  + Q  +
Sbjct: 292  CEGIVQKLDDKRQELPMGGLKQAHNRLLFILTRCNRLVQFRKESGYVE-EHILGMHQLSD 351

Query: 216  LHSADKRTAPAMGRE----MKSSSAAKASKGTSSRKSYSQEQRGLDWSREHDILPGNNVS 275
            L    ++      ++     K        +  + ++       G D           N +
Sbjct: 352  LGVYPEQMVEISRQQDLLREKEIQKINEKQNLAGKQDDQNSNSGADGVEV-------NTA 411

Query: 276  TSPDDTSKNLVSPTGRDRMASWKRLPSPA-KSSVKESTLKVQGDNKKFKSLNMSKIRMSV 335
             S D TS N        RM+SWK+LPS A K+    +T K +G+         SKI+  V
Sbjct: 412  RSTDSTSSNF-------RMSSWKKLPSAAEKNRSLNNTPKAKGE---------SKIQPKV 471

Query: 336  SETDFAAAKVSELPLPRDSHDQPTKHQHKPSWG-WGDQPTVSDESSIICRICEEEIPTAN 395
                +       L  P     QP        WG W D   V+ ++S+ICRICE EIP  +
Sbjct: 472  ----YGDENAENLHSPSG---QPASADRSALWGFWADHQCVTYDNSMICRICEVEIPVVH 531

Query: 396  VEDHSRICAVADRCDQKGISVNERLLRIAETLEKMIESFTQKDSQHVGS-PDVAKVSNSS 455
            VE+HSRIC +ADRCD KGI+VN RL R+AE+LEK++ES+T K S    +  D A++SNSS
Sbjct: 532  VEEHSRICTIADRCDLKGINVNLRLERVAESLEKILESWTPKSSVTPRAVADSARLSNSS 591

Query: 456  MTEESDISPKHSDWSRRGSEDMLDCFHETDSSVFMDDLKGLPSMSCKTRFGPKSDQGMTT 515
              E+ D      + S+R S+DMLDC   + ++  +D+L  L  MS           G   
Sbjct: 592  RQEDLD------EISQRCSDDMLDCVPRSQNTFSLDELNILNEMSMTN--------GTKD 651

Query: 516  SSAGSMTPRSPLLTPRTSQFDLFLAGKGAYYEHDDLTQISDLADIARCSANTPLGDDCSM 575
            SSAGS+TP SP  TPR SQ DL L+G+    E ++  QI+ L DIAR  AN  +    S+
Sbjct: 652  SSAGSLTPPSPA-TPRNSQVDLLLSGRKTISELENYQQINKLLDIARSVANVNVCGYSSL 711

Query: 576  QYLLTCLEDLRVVINRRKFNALTVDTFGTRIEKLIREKYLQLCELVDDEKIDIASTVIDE 635
             +++  L++L+ VI  RK +AL V+TFG RIEKL++EKY++LC L+DDEK+D ++ + DE
Sbjct: 712  DFMIEQLDELKYVIQDRKADALVVETFGRRIEKLLQEKYIELCGLIDDEKVDSSNAMPDE 771

Query: 636  DTPLEDDIVRSLRTSPIHS-SKDRTSIDDFEIIKPISRGAFGRVFLAKKRTTGDLFAIKV 695
            ++  ++D VRSLR SP++  +KDRTSI+DFEIIKPISRGAFGRVFLAKKR TGDLFAIKV
Sbjct: 772  ESSADEDTVRSLRASPLNPRAKDRTSIEDFEIIKPISRGAFGRVFLAKKRATGDLFAIKV 831

Query: 696  LKKADMIRKNAVESILAERNILISVRNPFVVRFFYSFTCRDNLYLVMEYLNGGDLYSLLR 755
            LKKADMIRKNAVESILAERNILISVRNPFVVRFFYSFTCR+NLYLVMEYLNGGDL+SLLR
Sbjct: 832  LKKADMIRKNAVESILAERNILISVRNPFVVRFFYSFTCRENLYLVMEYLNGGDLFSLLR 891

Query: 756  NLGCLDEEVARVYIAEVVLALEYLHSLGVVHRDLKPDNLLIAHDGHIKLTDFGLSKVGLI 815
            NLGCLDE++AR+YIAEVVLALEYLHS+ ++HRDLKPDNLLI  DGHIKLTDFGLSKVGLI
Sbjct: 892  NLGCLDEDMARIYIAEVVLALEYLHSVNIIHRDLKPDNLLINQDGHIKLTDFGLSKVGLI 951

Query: 816  NSTDDLSGPAVSGTTLLGYDEPTMSASEHQQ--ERRKKRSAVGTPDYLAPEILLGTGHGA 875
            NSTDDLSG +  G +  G+     S ++H Q  + RKK + VGTPDYLAPEILLG GHG 
Sbjct: 952  NSTDDLSGESSLGNS--GFFAEDGSKAQHSQGKDSRKKHAVVGTPDYLAPEILLGMGHGK 1011

Query: 876  TADWWSVGIILFELIVGIPPFNAEHPQTIFDNILNRKIPWPQIPEEMSHDAQDLIDRLLT 935
            TADWWSVG+ILFE++VGIPPFNAE PQ IF+NI+NR IPWP +PEE+S++A DLI++LLT
Sbjct: 1012 TADWWSVGVILFEVLVGIPPFNAETPQQIFENIINRDIPWPNVPEEISYEAHDLINKLLT 1071

Query: 936  EDPHQRLGAIGASEVKQHMFFKDINWDTLARQKAAFVPTSESALDTSYFTSRYSWNHSDD 995
            E+P QRLGA GA EVKQH FFKDINWDTLARQKA FVP++E   DTSYF SRY WN  D+
Sbjct: 1072 ENPVQRLGATGAGEVKQHHFFKDINWDTLARQKAMFVPSAEPQ-DTSYFMSRYIWNPEDE 1131

Query: 996  HVYPHSELEDSSDADSLSGDSCLSNRQDEVGDECGGLTDF--EPGASVNYSFSNFSFKNL 1055
            +V+  S+ +D +D  S S      N Q+E GDECG L +F   P  +V YSFSNFSFKNL
Sbjct: 1132 NVHGGSDFDDLTDTCSSSS----FNTQEEDGDECGSLAEFGNGPNLAVKYSFSNFSFKNL 1154

Query: 1056 SQLASINYDLLSKGLKD 1061
            SQLASINYDL+ K  K+
Sbjct: 1192 SQLASINYDLVLKNAKE 1154

BLAST of CSPI01G05030 vs. Swiss-Prot
Match: IRE4_ARATH (Probable serine/threonine protein kinase IRE4 OS=Arabidopsis thaliana GN=IRE4 PE=2 SV=1)

HSP 1 Score: 639.4 bits (1648), Expect = 6.9e-182
Identity = 352/647 (54.40%), Postives = 449/647 (69.40%), Query Frame = 1

Query: 376  IICRICEEEIPTANVEDHSRICAVADRCDQKGISVNERLLRIAETLEKMIESFTQKDSQH 435
            +ICRICEEE+P  ++E HS ICA AD+C+   + V+ERLL++ E LE++I+S +      
Sbjct: 400  VICRICEEEVPLFHLEPHSYICAYADKCEINCVDVDERLLKLEEILEQIIDSRSLNSFTQ 459

Query: 436  VGSPDVAKVSNSSMTEESDISPKHSDWSRRGSEDMLDCFHETDSSVFMDDLKGLPSMSCK 495
             G  + + +  S +  E   SPK ++W  +G E M +  HE D++ F+D+    P +  K
Sbjct: 460  AGGLENSVLRKSGVASEG-CSPKINEWRNKGLEGMFEDLHEMDTA-FIDESYTYP-IHLK 519

Query: 496  TRFGPKSDQGMTTSSAGSMTPRSPLLTPRTSQFDLFLAGKGAYYEHDDLTQISDLADIAR 555
            +  G K     T+SS GS+T  S   TPRTS FD +   +    E +DL  + DL+DIAR
Sbjct: 520  SHVGAKFCHHATSSSTGSITSVSSTNTPRTSHFDSYWLERHCP-EQEDLRLMMDLSDIAR 579

Query: 556  CSANTPLGDDCSMQYLLTCLEDLRVVINRRKFNALTVDTFGTRIEKLIREKYLQLCELVD 615
            C A+T    + S  Y++ C++D++ V+ + K  AL +DTFG RIEKL+ EKYL   EL  
Sbjct: 580  CGASTDFSKEGSCDYIMACMQDIQAVLKQGKLKALVIDTFGGRIEKLLCEKYLHARELTA 639

Query: 616  DEKIDIASTVIDEDTPLEDDIVRSLRTSPIHSSKDRTSIDDFEIIKPISRGAFGRVFLAK 675
            D+     S+V   +    +D++     +P    KDR SIDDFEIIKPISRGAFG+VFLA+
Sbjct: 640  DK-----SSV--GNIKESEDVLEHASATPQLLLKDRISIDDFEIIKPISRGAFGKVFLAR 699

Query: 676  KRTTGDLFAIKVLKKADMIRKNAVESILAERNILISVRNPFVVRFFYSFTCRDNLYLVME 735
            KRTTGD FAIKVLKK DMIRKN +E IL ERNILI+VR PF+VRFFYSFTCRDNLYLVME
Sbjct: 700  KRTTGDFFAIKVLKKLDMIRKNDIERILQERNILITVRYPFLVRFFYSFTCRDNLYLVME 759

Query: 736  YLNGGDLYSLLRNLGCLDEEVARVYIAEVVLALEYLHSLGVVHRDLKPDNLLIAHDGHIK 795
            YLNGGDLYSLL+ +GCLDEE+AR+YIAE+VLALEYLHSL +VHRDLKPDNLLIA++GHIK
Sbjct: 760  YLNGGDLYSLLQKVGCLDEEIARIYIAELVLALEYLHSLKIVHRDLKPDNLLIAYNGHIK 819

Query: 796  LTDFGLSKVGLINSTDDLSG--PAVSGTTLLGYDEPTMSASEHQQERRKKRSAVGTPDYL 855
            LTDFGLSK+GLIN+T DLSG    VS  T       +    ++Q+E R + SAVGTPDYL
Sbjct: 820  LTDFGLSKIGLINNTIDLSGHESDVSPRT------NSHHFQKNQEEERIRHSAVGTPDYL 879

Query: 856  APEILLGTGHGATADWWSVGIILFELIVGIPPFNAEHPQTIFDNILNRKIPWPQIPEEMS 915
            APEILLGT HG  ADWWS GI+LFEL+ GIPPF A  P+ IFDNILN K+PWP +P EMS
Sbjct: 880  APEILLGTEHGYAADWWSAGIVLFELLTGIPPFTASRPEKIFDNILNGKMPWPDVPGEMS 939

Query: 916  HDAQDLIDRLLTEDPHQRLGAIGASEVKQHMFFKDINWDTLARQKAAFVPTSESALDTSY 975
            ++AQDLI+RLL  +P +RLGA GA+EVK H FF+ ++W+ LA QKAAFVP  ES  DTSY
Sbjct: 940  YEAQDLINRLLVHEPEKRLGANGAAEVKSHPFFQGVDWENLALQKAAFVPQPESINDTSY 999

Query: 976  FTSRYSWNHSDDHVYPHSELEDSSDADSLSGDSCLSNRQDEVGDECG 1021
            F SR+S               +SS +D+ +G++  SN   + GDE G
Sbjct: 1000 FVSRFS---------------ESSCSDTETGNNSGSN--PDSGDEVG 1012

BLAST of CSPI01G05030 vs. Swiss-Prot
Match: Y0701_DICDI (Probable serine/threonine-protein kinase DDB_G0272282 OS=Dictyostelium discoideum GN=DDB_G0272282 PE=3 SV=1)

HSP 1 Score: 349.4 bits (895), Expect = 1.4e-94
Identity = 211/456 (46.27%), Postives = 266/456 (58.33%), Query Frame = 1

Query: 639  SLRTSPIHSSKDRTSIDDFEIIKPISRGAFGRVFLAKKRTTGDLFAIKVLKKADMIRKNA 698
            SL TS   S+    SI DFEIIKPISRGAFGRV+LA+K+ TGDL+AIKVLKK D IRKN 
Sbjct: 1512 SLTTS---SNTTSISIADFEIIKPISRGAFGRVYLAQKKKTGDLYAIKVLKKLDTIRKNM 1571

Query: 699  VESILAERNILISVRNPFVVRFFYSFTCRDNLYLVMEYLNGGDLYSLLRNLGCLDEEVAR 758
            V  ++ ERNIL  V+N FVV+ FY+F   D LYLVMEYL GGD  SLLR LGC +E +A+
Sbjct: 1572 VNHVIVERNILAMVQNEFVVKLFYAFQSTDKLYLVMEYLIGGDCASLLRALGCFEEHMAK 1631

Query: 759  VYIAEVVLALEYLHSLGVVHRDLKPDNLLIAHDGHIKLTDFGLSKVGLINST-------- 818
             YIAE VL LEYLH   +VHRDLKPDN+LI   GHIKLTDFGLSK+G+I+          
Sbjct: 1632 HYIAETVLCLEYLHKSAIVHRDLKPDNMLIDGLGHIKLTDFGLSKIGIIDDKKMEDSGNT 1691

Query: 819  ------------------DDLS---GPAVSGTTLLGYDEPTMSASEHQQERRKK------ 878
                              DD S    P  +G T L   +  + +   Q++   K      
Sbjct: 1692 NTNTHFNFSTSPTNTSMMDDSSTTGNPNGNGNTSLNSSQTNILSPYPQRKNTLKTPLKKP 1751

Query: 879  -RSAVGTPDYLAPEILLGTGHGATADWWSVGIILFELIVGIPPFNAEHPQTIFDNIL--N 938
             +  VGTPDYL+PEILLGTGHG T DWW++GIIL+E + G PPFN + P+ IF +IL  +
Sbjct: 1752 VKKVVGTPDYLSPEILLGTGHGQTVDWWALGIILYEFLTGSPPFNDDTPELIFQHILHRD 1811

Query: 939  RKIPWPQIPEEMSHDAQDLIDRLLTEDPHQRLGAIGASEVKQHMFFKDINWDTLARQKA- 998
            R++ W   PEE+S +A+DLI +LL  DP++RLGA GA EVK H FF ++NWDTL  Q+  
Sbjct: 1812 REMEW---PEEISSEAKDLILKLLNPDPYKRLGANGAYEVKTHPFFANVNWDTLIDQEMD 1871

Query: 999  -AFVPTSESALDTSYFTSRYSW------------NHSDDHVYPHSELEDSSDADSLSGDS 1043
              F+P  E+  DT YF  R S             N S        + +  S     S + 
Sbjct: 1872 NIFLPKPENNYDTDYFWDRQSMYDDEAEDDFLTINQSQPQHQSQHQSQPQSQPQPQSQNL 1931

BLAST of CSPI01G05030 vs. TrEMBL
Match: A0A061FX05_THECC (Kinase superfamily protein isoform 1 OS=Theobroma cacao GN=TCM_012919 PE=4 SV=1)

HSP 1 Score: 1710.3 bits (4428), Expect = 0.0e+00
Identity = 870/1060 (82.08%), Postives = 951/1060 (89.72%), Query Frame = 1

Query: 10   SSWGFSEGLRNLDFCTPEPSYDCENPKESESPRFQAILRVTSAPRRKYPADIKSFSHELN 69
            SSWG S GL++ DFCTPE SYDCENPKESESPRFQAILRVTS PR+++PADIKSFSHELN
Sbjct: 234  SSWGHSGGLKSSDFCTPETSYDCENPKESESPRFQAILRVTSGPRKRFPADIKSFSHELN 293

Query: 70   SKGVRPFPLWKSRRLNNLEEILVMIRAKFDKAKEEVNSDLAIFAADLVGVLEKNVDTHPE 129
            SKGVRPFPLWK RRLNNLEEIL+ IRAKFDKAKEEVN+DLAIFAADLVG+LEKN ++HPE
Sbjct: 294  SKGVRPFPLWKPRRLNNLEEILIAIRAKFDKAKEEVNADLAIFAADLVGILEKNAESHPE 353

Query: 130  WQETIEDLLVLARSCAMSSPGEFWLQCESIVQELDDRRQELPPGMLKQLHTRILFILTRC 189
            WQETIEDLLVLARSCAM+ PGEFWLQCE IVQELDD+RQELPPG LKQL+T++LFILTRC
Sbjct: 354  WQETIEDLLVLARSCAMTPPGEFWLQCEGIVQELDDKRQELPPGTLKQLYTKMLFILTRC 413

Query: 190  TRLLQFHKESGLAEDENVFQLRQSRNLHSADKRTAPAMGREMKSSSAAKASKGT---SSR 249
            TRLLQFHKESGLAEDE V QLRQSR LH  DKRT+  + RE KS SA+KASK +   SS+
Sbjct: 414  TRLLQFHKESGLAEDEPVIQLRQSRILHPVDKRTSSGVLREAKSLSASKASKSSKAASSK 473

Query: 250  KSYSQEQRGLDWSREHDILPGNNVSTSPDDTSKNLVSPTGRDRMASWKRLPSPAKSSVKE 309
            K+YSQEQ  LDW R+H +LPG  ++ + DDT KNL SP  RDR+ASWK+LPSPAK   KE
Sbjct: 474  KAYSQEQHALDWKRDHVVLPGGLIAPT-DDTPKNLESPASRDRIASWKKLPSPAKKGPKE 533

Query: 310  STLKVQGDNKKFKSLNMSKIRMSVSETDFAAAKVSELPLPRDSHDQPTKHQHKPSWG-WG 369
                 + ++ K ++L     R   S+ D AA K+ ELP  ++S +  +KHQHK SWG WG
Sbjct: 534  VIASKEQNDNKIETLK----RRGASDVDLAAMKLQELPPAKESQEHSSKHQHKVSWGYWG 593

Query: 370  DQPTVSDESSIICRICEEEIPTANVEDHSRICAVADRCDQKGISVNERLLRIAETLEKMI 429
            DQP VS+ESSIICRICEEE+ T+NVEDHSRICAVADRCDQKG+SV+ERL+RIAETLEKM 
Sbjct: 594  DQPNVSEESSIICRICEEEVATSNVEDHSRICAVADRCDQKGLSVDERLVRIAETLEKMT 653

Query: 430  ESFTQKDSQHVGSPDVAKVSNSSMTEESDI-SPKHSDWSRRGSEDMLDCFHETDSSVFMD 489
            +SF  KD QHVGSPD AKVSNSS+TEESD+ SPK SDWSRRGSEDMLDCF E D+SVFMD
Sbjct: 654  DSFANKDIQHVGSPDGAKVSNSSVTEESDVLSPKLSDWSRRGSEDMLDCFPEADNSVFMD 713

Query: 490  DLKGLPSMSCKTRFGPKSDQGMTTSSAGSMTPRSPLLTPRTSQFDLFLAGKGAYYEHDDL 549
            DLKGLPSMSCKTRFGPKSDQGMTTSSAGSMTPRSPLLTPRTSQ DL L+GKGA+ E +DL
Sbjct: 714  DLKGLPSMSCKTRFGPKSDQGMTTSSAGSMTPRSPLLTPRTSQIDLLLSGKGAFSEQEDL 773

Query: 550  TQISDLADIARCSANTPLGDDCSMQYLLTCLEDLRVVINRRKFNALTVDTFGTRIEKLIR 609
             Q+++LADIARC ANTPL DD SM +LL+ LE+LR+VI+RRKF+ALTV+TFG RIEKLIR
Sbjct: 774  PQMNELADIARCVANTPLVDDHSMPFLLSFLEELRLVIDRRKFDALTVETFGARIEKLIR 833

Query: 610  EKYLQLCELVDDEKIDIASTVIDEDTPLEDDIVRSLRTSPIHSSKDRTSIDDFEIIKPIS 669
            EKYLQLCELVDDEK+DI STVIDED PLEDD+VRSLRTSP HSS+DRT+IDDFEIIKPIS
Sbjct: 834  EKYLQLCELVDDEKVDITSTVIDEDAPLEDDVVRSLRTSPNHSSRDRTTIDDFEIIKPIS 893

Query: 670  RGAFGRVFLAKKRTTGDLFAIKVLKKADMIRKNAVESILAERNILISVRNPFVVRFFYSF 729
            RGAFGRVFLAKKRTTGDLFAIKVLKKADMIRKNAVESILAER+ILISVRNPFVVRFFYSF
Sbjct: 894  RGAFGRVFLAKKRTTGDLFAIKVLKKADMIRKNAVESILAERDILISVRNPFVVRFFYSF 953

Query: 730  TCRDNLYLVMEYLNGGDLYSLLRNLGCLDEEVARVYIAEVVLALEYLHSLGVVHRDLKPD 789
            TCR+NLYLVMEYLNGGDLYSLLRNLGCLDEEVARVYIAEVVLALEYLHSL VVHRDLKPD
Sbjct: 954  TCRENLYLVMEYLNGGDLYSLLRNLGCLDEEVARVYIAEVVLALEYLHSLHVVHRDLKPD 1013

Query: 790  NLLIAHDGHIKLTDFGLSKVGLINSTDDLSGPAVSGTTLLGYDEPTMSASEHQQERRKKR 849
            NLLIAHDGHIKLTDFGLSKVGLINSTDDLSGPAVSGT+LL  ++P +SASEHQQERRKKR
Sbjct: 1014 NLLIAHDGHIKLTDFGLSKVGLINSTDDLSGPAVSGTSLLDDEQPQLSASEHQQERRKKR 1073

Query: 850  SAVGTPDYLAPEILLGTGHGATADWWSVGIILFELIVGIPPFNAEHPQTIFDNILNRKIP 909
            SAVGTPDYLAPEILLGTGHGATADWWSVG+ILFELIVGIPPFNAEHPQTIFDNILNRKIP
Sbjct: 1074 SAVGTPDYLAPEILLGTGHGATADWWSVGVILFELIVGIPPFNAEHPQTIFDNILNRKIP 1133

Query: 910  WPQIPEEMSHDAQDLIDRLLTEDPHQRLGAIGASEVKQHMFFKDINWDTLARQKAAFVPT 969
            WP++ EEMS +A+DLIDRLLTEDPHQRLGA GASEVKQH+FFKDINWDTLARQKAAFVPT
Sbjct: 1134 WPRVSEEMSLEAKDLIDRLLTEDPHQRLGARGASEVKQHVFFKDINWDTLARQKAAFVPT 1193

Query: 970  SESALDTSYFTSRYSWNHSDDHVYPHSELEDSSDADSLSG-DSCLSNRQDEVGDECGGLT 1029
            SESALDTSYFTSRYSWN SDDH YP SE +DSSDADSLSG  SCLSNRQDEVGDECGGL 
Sbjct: 1194 SESALDTSYFTSRYSWNTSDDHAYPGSEFDDSSDADSLSGSSSCLSNRQDEVGDECGGLA 1253

Query: 1030 DFEPGASVNYSFSNFSFKNLSQLASINYDLLSKGLKDDPP 1064
            +FE G+SVNYSFSNFSFKNLSQLASINYDLLSKG KDD P
Sbjct: 1254 EFESGSSVNYSFSNFSFKNLSQLASINYDLLSKGWKDDHP 1288

BLAST of CSPI01G05030 vs. TrEMBL
Match: B9T5A7_RICCO (Kinase, putative OS=Ricinus communis GN=RCOM_0054220 PE=4 SV=1)

HSP 1 Score: 1704.5 bits (4413), Expect = 0.0e+00
Identity = 876/1056 (82.95%), Postives = 948/1056 (89.77%), Query Frame = 1

Query: 11   SWGFSEGLRNLDFCTPEPSYDCENPKESESPRFQAILRVTSAPRRKYPADIKSFSHELNS 70
            SWG S GLR+ D  TPE +YDCENPKESESPRFQAILRVTSAPR+++PADIKSFSHELNS
Sbjct: 231  SWGHSGGLRSSDVLTPE-TYDCENPKESESPRFQAILRVTSAPRKRFPADIKSFSHELNS 290

Query: 71   KGVRPFPLWKSRRLNNLEEILVMIRAKFDKAKEEVNSDLAIFAADLVGVLEKNVDTHPEW 130
            KGVRPFP WK R LNNLEEILV+IRAKFDKAKEEVNSDLAIFAADLVGVLEKN ++HPEW
Sbjct: 291  KGVRPFPFWKPRGLNNLEEILVVIRAKFDKAKEEVNSDLAIFAADLVGVLEKNAESHPEW 350

Query: 131  QETIEDLLVLARSCAMSSPGEFWLQCESIVQELDDRRQELPPGMLKQLHTRILFILTRCT 190
            QETIEDLLVLARSCAMSSP EFWLQCESIVQELDDRRQELPPGMLKQLHTR+LFILTRCT
Sbjct: 351  QETIEDLLVLARSCAMSSPSEFWLQCESIVQELDDRRQELPPGMLKQLHTRMLFILTRCT 410

Query: 191  RLLQFHKESGLAEDENVFQLRQSRNLHSADKRTAPAMGREMKSSSAAKASKGTSSRKSYS 250
            RLLQFHKESGLAEDENVFQLRQSR LHSA+KR  P++ R+ KSSSAAKASK  S++KSYS
Sbjct: 411  RLLQFHKESGLAEDENVFQLRQSRLLHSAEKRIPPSIVRDGKSSSAAKASKAASAKKSYS 470

Query: 251  QEQRGLDWSREHDILPGNNVSTSPDDTSKNLVSPTGRDRMASWKRLPSPAKSSVKESTLK 310
            QEQ GLDW R+     G+++ T+ DD SKN+ SP    RMASWKRLPSPA  SVKE    
Sbjct: 471  QEQHGLDWKRDQVAQLGSSLPTA-DDASKNMDSPGSGARMASWKRLPSPAGKSVKEVAPS 530

Query: 311  VQGDNKKFKSLNMSKIRMSVSETDFAAAKVSELPLPRDSHDQPTKHQHKPSWG-WGDQPT 370
             + ++ K + L +   R  VS+ D  A K+SELP+ +DSH+   KHQHK SWG WGDQ  
Sbjct: 531  KENNDCKIEPLKILNNRKGVSDADLTATKLSELPVAKDSHEHSMKHQHKISWGYWGDQQN 590

Query: 371  VSDESSIICRICEEEIPTANVEDHSRICAVADRCDQKGISVNERLLRIAETLEKMIESFT 430
            VSD++SIICRICEEE+PT +VEDHSRICA+ADR DQKG+SVNERL RI+ETL+KMIES  
Sbjct: 591  VSDDTSIICRICEEEVPTLHVEDHSRICAIADRSDQKGLSVNERLARISETLDKMIESIA 650

Query: 431  QKDSQH-VGSPDVAKVSNSSMTEESDI-SPKHSDWSRRGSEDMLDCFHETDSSVFMDDLK 490
            QKD+Q  VGSPDVAKVSNSS+TEESD+ SPK SDWSRRGSEDMLDCF E D+SVFMDDLK
Sbjct: 651  QKDTQPAVGSPDVAKVSNSSVTEESDVLSPKLSDWSRRGSEDMLDCFPEADNSVFMDDLK 710

Query: 491  GLPSMSCKTRFGPKSDQGMTTSSAGSMTPRSPLLTPRTSQFDLFLAGKGAYYEHDDLTQI 550
            GLPSMSCKTRFGPKSDQGM TSSAGSMTPRSPLLTPRTS  DL L GKGA+ EHDDL Q+
Sbjct: 711  GLPSMSCKTRFGPKSDQGMATSSAGSMTPRSPLLTPRTSPIDLLLTGKGAFSEHDDLPQM 770

Query: 551  SDLADIARCSANTPLGDDCSMQYLLTCLEDLRVVINRRKFNALTVDTFGTRIEKLIREKY 610
            ++LADIARC   TPL DD S+ YLL+CLEDLRVVI+RRKF+ALTV+TFGTRIEKLIREKY
Sbjct: 771  TELADIARCVVTTPLDDDRSIPYLLSCLEDLRVVIDRRKFDALTVETFGTRIEKLIREKY 830

Query: 611  LQLCELVDDEKIDIASTVIDEDTPLEDDIVRSLRTSPIHSSKDRTSIDDFEIIKPISRGA 670
            LQLCELV+DE++DI ST+IDED PLEDD+VRSLRTSPIHSSKDRTSIDDFEIIKPISRGA
Sbjct: 831  LQLCELVEDERVDITSTIIDEDAPLEDDVVRSLRTSPIHSSKDRTSIDDFEIIKPISRGA 890

Query: 671  FGRVFLAKKRTTGDLFAIKVLKKADMIRKNAVESILAERNILISVRNPFVVRFFYSFTCR 730
            FGRVFLAKKRTTGDLFAIKVLKKADMIRKNAVESILAER+ILISVRNPFVVRFFYSFTCR
Sbjct: 891  FGRVFLAKKRTTGDLFAIKVLKKADMIRKNAVESILAERDILISVRNPFVVRFFYSFTCR 950

Query: 731  DNLYLVMEYLNGGDLYSLLRNLGCLDEEVARVYIAEVVLALEYLHSLGVVHRDLKPDNLL 790
            +NLYLVMEYLNGGDLYSLLRNLGCLDEEVARVYIAEVVLALEYLHSL VVHRDLKPDNLL
Sbjct: 951  ENLYLVMEYLNGGDLYSLLRNLGCLDEEVARVYIAEVVLALEYLHSLRVVHRDLKPDNLL 1010

Query: 791  IAHDGHIKLTDFGLSKVGLINSTDDLSGPAVSGTTLLGYDEPTMSASEHQQERRKKRSAV 850
            IAHDGHIKLTDFGLSKVGLINSTDDLSGPAVSGT++L  DEP +SASEHQ+ERRKKRSAV
Sbjct: 1011 IAHDGHIKLTDFGLSKVGLINSTDDLSGPAVSGTSMLEDDEPQLSASEHQRERRKKRSAV 1070

Query: 851  GTPDYLAPEILLGTGHGATADWWSVGIILFELIVGIPPFNAEHPQTIFDNILNRKIPWPQ 910
            GTPDYLAPEILLGTGHG TADWWSVG+ILFELIVGIPPFNAEHPQ IFDNILNRKIPWP+
Sbjct: 1071 GTPDYLAPEILLGTGHGTTADWWSVGVILFELIVGIPPFNAEHPQIIFDNILNRKIPWPR 1130

Query: 911  IPEEMSHDAQDLIDRLLTEDPHQRLGAIGASEVKQHMFFKDINWDTLARQKAAFVPTSES 970
            +PEEMS +AQDLIDRLLTEDP  RLGA GASEVKQH+FFKDINWDTLARQKAAFVP+SES
Sbjct: 1131 VPEEMSPEAQDLIDRLLTEDPEVRLGAGGASEVKQHVFFKDINWDTLARQKAAFVPSSES 1190

Query: 971  ALDTSYFTSRYSWNHSDDHVYPHSELEDSSDADSLSG-DSCLSNRQDEVGDECGGLTDFE 1030
            ALDTSYFTSRYSWN S D VYP S+ EDSSDADSLSG  SCLSNRQDEVGDECGGL +FE
Sbjct: 1191 ALDTSYFTSRYSWNTS-DQVYPTSDFEDSSDADSLSGSSSCLSNRQDEVGDECGGLAEFE 1250

Query: 1031 PGASVNYSFSNFSFKNLSQLASINYDLLSKGLKDDP 1063
             G+SVNYSFSNFSFKNLSQLASINYDLLSKG KDDP
Sbjct: 1251 SGSSVNYSFSNFSFKNLSQLASINYDLLSKGWKDDP 1283

BLAST of CSPI01G05030 vs. TrEMBL
Match: A0A067L7C1_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_22916 PE=4 SV=1)

HSP 1 Score: 1703.7 bits (4411), Expect = 0.0e+00
Identity = 878/1066 (82.36%), Postives = 952/1066 (89.31%), Query Frame = 1

Query: 6    GCYGSSWGFSEGLRNLDFCTPEPS--YDCENPKESESPRFQAILRVTSAPRRKYPADIKS 65
            G + SSW  S  LR+ D  TPE S  YDCENPKESESPRFQAILRVTSAPR+++PADIKS
Sbjct: 236  GRHESSWSRSGVLRSSDVFTPEVSETYDCENPKESESPRFQAILRVTSAPRKRFPADIKS 295

Query: 66   FSHELNSKGVRPFPLWKSRRLNNLEEILVMIRAKFDKAKEEVNSDLAIFAADLVGVLEKN 125
            FSHELNSKGVRPFP WK R LNNLEEILV+IRAKFDKAKEEVNSDLAIFAADLVG+LEKN
Sbjct: 296  FSHELNSKGVRPFPFWKPRGLNNLEEILVVIRAKFDKAKEEVNSDLAIFAADLVGILEKN 355

Query: 126  VDTHPEWQETIEDLLVLARSCAMSSPGEFWLQCESIVQELDDRRQELPPGMLKQLHTRIL 185
             ++HPEWQETIEDLLVLARSCAM+SP EFWLQCE IVQELDDRRQELPPGMLKQLHTR+L
Sbjct: 356  AESHPEWQETIEDLLVLARSCAMTSPSEFWLQCEGIVQELDDRRQELPPGMLKQLHTRML 415

Query: 186  FILTRCTRLLQFHKESGLAEDENVFQLRQSRNLHSADKRTAPAMGREMKSSSAAKASKGT 245
            FILTRCTRLLQFHKESGLAEDENVF LRQSR LHS DKR     GRE KSSSAAKASK  
Sbjct: 416  FILTRCTRLLQFHKESGLAEDENVFHLRQSRLLHSDDKRIPLGPGREGKSSSAAKASKTA 475

Query: 246  SSRKSYSQEQR-GLDWSREHDILPGNNVSTSPDDTSKNLVSPTGRDRMASWKRLPSPAKS 305
            S+RKSYSQEQ  GLDW+R+    PGN++ T+ D TSK++ SP  RDRMASWK+LPSP   
Sbjct: 476  STRKSYSQEQHHGLDWNRDQIAQPGNSLPTT-DGTSKSMDSPGSRDRMASWKKLPSPVAK 535

Query: 306  SVKESTLKVQGDNKKFKSLNMSKIRMSVSETDFAAAKVSELPLPRDSHDQPTKHQHKPSW 365
            ++K++ LK  G   K + L     R+ +S+ D  A K+SE+P  +DSH+  TKHQHK SW
Sbjct: 536  NMKDAPLKELGS--KVEPLKTLNSRIGISDADLVATKLSEIPTAKDSHEHSTKHQHKVSW 595

Query: 366  G-WGDQPTVSDESSIICRICEEEIPTANVEDHSRICAVADRCDQKGISVNERLLRIAETL 425
            G WGDQ  + DESSIICRICEEE+PT++VEDHSRICA+ADRCDQKG+SVNERL RI+ETL
Sbjct: 596  GYWGDQQNIFDESSIICRICEEEVPTSHVEDHSRICAIADRCDQKGLSVNERLARISETL 655

Query: 426  EKMIESFTQKDSQHV-GSPDVAKVSNSSMTEESDI-SPKHSDWSRRGSEDMLDCFHETDS 485
            EKMIE+F QKD QH  GSPDVAKVSNSS+TEESD+ SPK SDWSRRGSEDMLDCF E D+
Sbjct: 656  EKMIETFAQKDIQHAAGSPDVAKVSNSSVTEESDVLSPKLSDWSRRGSEDMLDCFPEADN 715

Query: 486  SVFMDDLKGLPSMSCKTRFGPKSDQGMTTSSAGSMTPRSPL--LTPRTSQFDLFLAGKGA 545
             +FMDDLKGLPSMSCKTRFGPKSDQGM TSSAGSMTPRSP   LTPRTSQ DL LAGKGA
Sbjct: 716  YIFMDDLKGLPSMSCKTRFGPKSDQGMATSSAGSMTPRSPSSSLTPRTSQIDLLLAGKGA 775

Query: 546  YYEHDDLTQISDLADIARCSANTPLGDDCSMQYLLTCLEDLRVVINRRKFNALTVDTFGT 605
            + E+DD+ Q+++LADIARC ANTPL DD SM YLLTCLEDLRVVI+RRKF+A TV+TFGT
Sbjct: 776  FSENDDIPQMNELADIARCVANTPLDDDRSMPYLLTCLEDLRVVIDRRKFDAHTVETFGT 835

Query: 606  RIEKLIREKYLQLCELVDDEKIDIASTVIDEDTPLEDDIVRSLRTSPIHSSKDRTSIDDF 665
            RIEKLIREKYLQLCELV+D+K+DI STVIDEDTPLEDD+VRSLRTSPIHS KDRTSIDDF
Sbjct: 836  RIEKLIREKYLQLCELVEDDKVDITSTVIDEDTPLEDDVVRSLRTSPIHS-KDRTSIDDF 895

Query: 666  EIIKPISRGAFGRVFLAKKRTTGDLFAIKVLKKADMIRKNAVESILAERNILISVRNPFV 725
            EIIKPISRGAFGRVFLAKKRTTGDLFAIKVLKKADMIRKNAVESILAER+ILISVRNPFV
Sbjct: 896  EIIKPISRGAFGRVFLAKKRTTGDLFAIKVLKKADMIRKNAVESILAERDILISVRNPFV 955

Query: 726  VRFFYSFTCRDNLYLVMEYLNGGDLYSLLRNLGCLDEEVARVYIAEVVLALEYLHSLGVV 785
            VRFFYSFTCR+NLYLVMEYLNGGDLYSLLRNLGCLDE+VAR+YIAEVVLALEYLHSL VV
Sbjct: 956  VRFFYSFTCRENLYLVMEYLNGGDLYSLLRNLGCLDEDVARIYIAEVVLALEYLHSLRVV 1015

Query: 786  HRDLKPDNLLIAHDGHIKLTDFGLSKVGLINSTDDLSGPAVSGTTLLGYDEPTMSASEHQ 845
            HRDLKPDNLLIAHDGHIKLTDFGLSKVGLINSTDDLSGPAVSGT++L  DEP +S SE Q
Sbjct: 1016 HRDLKPDNLLIAHDGHIKLTDFGLSKVGLINSTDDLSGPAVSGTSMLVDDEPQVSTSEDQ 1075

Query: 846  QERRKKRSAVGTPDYLAPEILLGTGHGATADWWSVGIILFELIVGIPPFNAEHPQTIFDN 905
            Q+RRKKRSAVGTPDYLAPEILLGTGHG TADWWSVG+ILFELIVGIPPFNAEHPQ IFDN
Sbjct: 1076 QDRRKKRSAVGTPDYLAPEILLGTGHGTTADWWSVGVILFELIVGIPPFNAEHPQKIFDN 1135

Query: 906  ILNRKIPWPQIPEEMSHDAQDLIDRLLTEDPHQRLGAIGASEVKQHMFFKDINWDTLARQ 965
            ILNRKIPWP++PEEMS +A DLIDRLLTEDPHQRLGA GASEVKQH+FFKDINWDTLARQ
Sbjct: 1136 ILNRKIPWPRVPEEMSPEAWDLIDRLLTEDPHQRLGAGGASEVKQHVFFKDINWDTLARQ 1195

Query: 966  KAAFVPTSESALDTSYFTSRYSWNHSDDHVYPHSELEDSSDADSLSG-DSCLSNRQDEVG 1025
            KAAFVP+SESALDTSYFTSRYSWNHSDDHVYP S+ EDSSDADSLSG  SCLSNRQDEVG
Sbjct: 1196 KAAFVPSSESALDTSYFTSRYSWNHSDDHVYPASDFEDSSDADSLSGSSSCLSNRQDEVG 1255

Query: 1026 DECGGLTDFEPGASVNYSFSNFSFKNLSQLASINYDLLSKGLKDDP 1063
            DECGGL +FE G+SVNYSFSNFSFKNLSQLASINYDLLSKG KDDP
Sbjct: 1256 DECGGLAEFESGSSVNYSFSNFSFKNLSQLASINYDLLSKGWKDDP 1297

BLAST of CSPI01G05030 vs. TrEMBL
Match: A0A061FWF8_THECC (Kinase superfamily protein isoform 3 OS=Theobroma cacao GN=TCM_012919 PE=4 SV=1)

HSP 1 Score: 1703.7 bits (4411), Expect = 0.0e+00
Identity = 869/1060 (81.98%), Postives = 950/1060 (89.62%), Query Frame = 1

Query: 10   SSWGFSEGLRNLDFCTPEPSYDCENPKESESPRFQAILRVTSAPRRKYPADIKSFSHELN 69
            SSWG S GL++ DFCTPE SYDCENPKESESPRFQAILRVTS PR+++PADIKSFSHELN
Sbjct: 234  SSWGHSGGLKSSDFCTPETSYDCENPKESESPRFQAILRVTSGPRKRFPADIKSFSHELN 293

Query: 70   SKGVRPFPLWKSRRLNNLEEILVMIRAKFDKAKEEVNSDLAIFAADLVGVLEKNVDTHPE 129
            SKGVRPFPLWK RRLNNLEEIL+ IRAKFDKAKEEVN+DLAIFAADLVG+LEKN ++HPE
Sbjct: 294  SKGVRPFPLWKPRRLNNLEEILIAIRAKFDKAKEEVNADLAIFAADLVGILEKNAESHPE 353

Query: 130  WQETIEDLLVLARSCAMSSPGEFWLQCESIVQELDDRRQELPPGMLKQLHTRILFILTRC 189
            WQETIEDLLVLARSCAM+ PGEFWLQCE IVQELDD+RQELPPG LKQL+T++LFILTRC
Sbjct: 354  WQETIEDLLVLARSCAMTPPGEFWLQCEGIVQELDDKRQELPPGTLKQLYTKMLFILTRC 413

Query: 190  TRLLQFHKESGLAEDENVFQLRQSRNLHSADKRTAPAMGREMKSSSAAKASKGT---SSR 249
            TRLLQFHKESGLAEDE V QLRQSR LH  DKRT+  + RE KS SA+KASK +   SS+
Sbjct: 414  TRLLQFHKESGLAEDEPVIQLRQSRILHPVDKRTSSGVLREAKSLSASKASKSSKAASSK 473

Query: 250  KSYSQEQRGLDWSREHDILPGNNVSTSPDDTSKNLVSPTGRDRMASWKRLPSPAKSSVKE 309
            K+YSQEQ  LDW R+H +LPG  ++ + DDT KNL SP  RDR+ASWK+LPSPAK   KE
Sbjct: 474  KAYSQEQHALDWKRDHVVLPGGLIAPT-DDTPKNLESPASRDRIASWKKLPSPAKKGPKE 533

Query: 310  STLKVQGDNKKFKSLNMSKIRMSVSETDFAAAKVSELPLPRDSHDQPTKHQHKPSWG-WG 369
                 + ++ K ++L     R   S+ D AA K+ ELP  ++S +  +KHQHK SWG WG
Sbjct: 534  VIASKEQNDNKIETLK----RRGASDVDLAAMKLQELPPAKESQEHSSKHQHKVSWGYWG 593

Query: 370  DQPTVSDESSIICRICEEEIPTANVEDHSRICAVADRCDQKGISVNERLLRIAETLEKMI 429
            DQP VS+ESSIICRICEEE+ T+NVEDHSRICAVADRCDQKG+SV+ERL+RIAETLEKM 
Sbjct: 594  DQPNVSEESSIICRICEEEVATSNVEDHSRICAVADRCDQKGLSVDERLVRIAETLEKMT 653

Query: 430  ESFTQKDSQHVGSPDVAKVSNSSMTEESDI-SPKHSDWSRRGSEDMLDCFHETDSSVFMD 489
            +SF  KD QHVGSPD AKVSNSS+TEESD+ SPK SDWSRRGSEDMLDCF E D+SVFMD
Sbjct: 654  DSFANKDIQHVGSPDGAKVSNSSVTEESDVLSPKLSDWSRRGSEDMLDCFPEADNSVFMD 713

Query: 490  DLKGLPSMSCKTRFGPKSDQGMTTSSAGSMTPRSPLLTPRTSQFDLFLAGKGAYYEHDDL 549
            DLKGLPSMSCKTRFGPKSDQGMTTSSAGSMTPRSPLLTPRTSQ DL L+GKGA+ E +DL
Sbjct: 714  DLKGLPSMSCKTRFGPKSDQGMTTSSAGSMTPRSPLLTPRTSQIDLLLSGKGAFSEQEDL 773

Query: 550  TQISDLADIARCSANTPLGDDCSMQYLLTCLEDLRVVINRRKFNALTVDTFGTRIEKLIR 609
             Q+++LADIARC ANTPL DD SM +LL+ LE+LR+VI+RRKF+ALTV+TFG RIEKLIR
Sbjct: 774  PQMNELADIARCVANTPLVDDHSMPFLLSFLEELRLVIDRRKFDALTVETFGARIEKLIR 833

Query: 610  EKYLQLCELVDDEKIDIASTVIDEDTPLEDDIVRSLRTSPIHSSKDRTSIDDFEIIKPIS 669
            EKYLQLCELVDDEK+DI STVIDED PLEDD+VRSLRTSP HSS+DRT+IDDFEIIKPIS
Sbjct: 834  EKYLQLCELVDDEKVDITSTVIDEDAPLEDDVVRSLRTSPNHSSRDRTTIDDFEIIKPIS 893

Query: 670  RGAFGRVFLAKKRTTGDLFAIKVLKKADMIRKNAVESILAERNILISVRNPFVVRFFYSF 729
            RGAFGRVFLAKKRTTGDLFAIKVLKKADMIRKNAVESILAER+ILISVRNPFVVRFFYSF
Sbjct: 894  RGAFGRVFLAKKRTTGDLFAIKVLKKADMIRKNAVESILAERDILISVRNPFVVRFFYSF 953

Query: 730  TCRDNLYLVMEYLNGGDLYSLLRNLGCLDEEVARVYIAEVVLALEYLHSLGVVHRDLKPD 789
            TCR+NLYLVMEYLNGGDLYSLLRNLGCLDEEVARVYIAEVVLALEYLHSL VVHRDLKPD
Sbjct: 954  TCRENLYLVMEYLNGGDLYSLLRNLGCLDEEVARVYIAEVVLALEYLHSLHVVHRDLKPD 1013

Query: 790  NLLIAHDGHIKLTDFGLSKVGLINSTDDLSGPAVSGTTLLGYDEPTMSASEHQQERRKKR 849
            NLLIAHDGHIKLTDFGLSKVGLINSTDDLSGPAVSGT+LL  ++P +SASEHQQERRKKR
Sbjct: 1014 NLLIAHDGHIKLTDFGLSKVGLINSTDDLSGPAVSGTSLLDDEQPQLSASEHQQERRKKR 1073

Query: 850  SAVGTPDYLAPEILLGTGHGATADWWSVGIILFELIVGIPPFNAEHPQTIFDNILNRKIP 909
            SAVGTPDYLAPEILLGTGHGATADWWSVG+ILFELIVGIPPFNAEHPQTIFDNILNRKIP
Sbjct: 1074 SAVGTPDYLAPEILLGTGHGATADWWSVGVILFELIVGIPPFNAEHPQTIFDNILNRKIP 1133

Query: 910  WPQIPEEMSHDAQDLIDRLLTEDPHQRLGAIGASEVKQHMFFKDINWDTLARQKAAFVPT 969
            WP++ EEMS +A+DLIDRLLTEDPHQRLGA GASEVKQH+FFKDINWDTLARQKAAFVPT
Sbjct: 1134 WPRVSEEMSLEAKDLIDRLLTEDPHQRLGARGASEVKQHVFFKDINWDTLARQKAAFVPT 1193

Query: 970  SESALDTSYFTSRYSWNHSDDHVYPHSELEDSSDADSLSG-DSCLSNRQDEVGDECGGLT 1029
            SESALDTSYFTSRYSWN SDDH YP SE +DSSDADSLSG  SCLSNRQDE GDECGGL 
Sbjct: 1194 SESALDTSYFTSRYSWNTSDDHAYPGSEFDDSSDADSLSGSSSCLSNRQDE-GDECGGLA 1253

Query: 1030 DFEPGASVNYSFSNFSFKNLSQLASINYDLLSKGLKDDPP 1064
            +FE G+SVNYSFSNFSFKNLSQLASINYDLLSKG KDD P
Sbjct: 1254 EFESGSSVNYSFSNFSFKNLSQLASINYDLLSKGWKDDHP 1287

BLAST of CSPI01G05030 vs. TrEMBL
Match: A0A061FV35_THECC (Kinase superfamily protein isoform 5 OS=Theobroma cacao GN=TCM_012919 PE=4 SV=1)

HSP 1 Score: 1699.1 bits (4399), Expect = 0.0e+00
Identity = 869/1061 (81.90%), Postives = 950/1061 (89.54%), Query Frame = 1

Query: 10   SSWGFSEGLRNLDFCTPEPSYDCENPKESESPRFQAILRVTSAPRRKYPADIKSFSHELN 69
            SSWG S GL++ DFCTPE SYDCENPKESESPRFQAILRVTS PR+++PADIKSFSHELN
Sbjct: 234  SSWGHSGGLKSSDFCTPETSYDCENPKESESPRFQAILRVTSGPRKRFPADIKSFSHELN 293

Query: 70   SKGVRPFPLWKSRRLNNLEEILVMIRAKFDKAKEEVNSDLAIFAADLVGVLEKNVDTHPE 129
            SKGVRPFPLWK RRLNNLEEIL+ IRAKFDKAKEEVN+DLAIFAADLVG+LEKN ++HPE
Sbjct: 294  SKGVRPFPLWKPRRLNNLEEILIAIRAKFDKAKEEVNADLAIFAADLVGILEKNAESHPE 353

Query: 130  WQETIEDLLVLARSCAMSSPGEFWLQCESIVQELDDRRQELPPGMLKQLHTRILFILTRC 189
            WQETIEDLLVLARSCAM+ PGEFWLQCE IVQELDD+RQELPPG LKQL+T++LFILTRC
Sbjct: 354  WQETIEDLLVLARSCAMTPPGEFWLQCEGIVQELDDKRQELPPGTLKQLYTKMLFILTRC 413

Query: 190  TRLLQFHKESGLAEDENVFQLRQSRNLHSADKRTAPAMGREMKSSSAAKASKGT---SSR 249
            TRLLQFHKESGLAEDE V QLRQSR LH  DKRT+  + RE KS SA+KASK +   SS+
Sbjct: 414  TRLLQFHKESGLAEDEPVIQLRQSRILHPVDKRTSSGVLREAKSLSASKASKSSKAASSK 473

Query: 250  KSYSQEQRGLDWSREHDILPGNNVSTSPDDTSKNLVSPTGRDRMASWKRLPSPAKSSVKE 309
            K+YSQEQ  LDW R+H +LPG  ++ + DDT KNL SP  RDR+ASWK+LPSPAK   KE
Sbjct: 474  KAYSQEQHALDWKRDHVVLPGGLIAPT-DDTPKNLESPASRDRIASWKKLPSPAKKGPKE 533

Query: 310  STLKVQGDNKKFKSLNMSKIRMSVSETDFAAAKVSELPLPRDSHDQPTKHQHKPSWG-WG 369
                 + ++ K ++L     R   S+ D AA K+ ELP  ++S +  +KHQHK SWG WG
Sbjct: 534  VIASKEQNDNKIETLK----RRGASDVDLAAMKLQELPPAKESQEHSSKHQHKVSWGYWG 593

Query: 370  DQPTVSDESSIICRICEEEIPTANVEDHSRICAVADRCDQKGISVNERLLRIAETLEKMI 429
            DQP VS+ESSIICRICEEE+ T+NVEDHSRICAVADRCDQKG+SV+ERL+RIAETLEKM 
Sbjct: 594  DQPNVSEESSIICRICEEEVATSNVEDHSRICAVADRCDQKGLSVDERLVRIAETLEKMT 653

Query: 430  ESFTQKDSQHVGSPDVAKVSNSSMTEESDI-SPKHSDWSRRGSEDMLDCFHETDSSVFMD 489
            +SF  KD QHVGSPD AKVSNSS+TEESD+ SPK SDWSRRGSEDMLDCF E D+SVFMD
Sbjct: 654  DSFANKDIQHVGSPDGAKVSNSSVTEESDVLSPKLSDWSRRGSEDMLDCFPEADNSVFMD 713

Query: 490  DLKGLPSMSCKTRFGPKSDQGMTTSSAGSMTPRSPLLTPRTSQFDLFLAGKGAYYEHDDL 549
            DLKGLPSMSCKTRFGPKSDQGMTTSSAGSMTPRSPLLTPRTSQ DL L+GKGA+ E +DL
Sbjct: 714  DLKGLPSMSCKTRFGPKSDQGMTTSSAGSMTPRSPLLTPRTSQIDLLLSGKGAFSEQEDL 773

Query: 550  TQISDLADIARCSANTPLGDDCSMQYLLTCLEDLRVVINRRKFNALTVDTFGTRIEKLIR 609
             Q+++LADIARC ANTPL DD SM +LL+ LE+LR+VI+RRKF+ALTV+TFG RIEKLIR
Sbjct: 774  PQMNELADIARCVANTPLVDDHSMPFLLSFLEELRLVIDRRKFDALTVETFGARIEKLIR 833

Query: 610  EKYLQLCELVDDEKIDIASTVIDEDTPLEDDIVRSLRTSPIHSSKDRTSIDDFEIIKPIS 669
            EKYLQLCELVDDEK+DI STVIDED PLEDD+VRSLRTSP HSS+DRT+IDDFEIIKPIS
Sbjct: 834  EKYLQLCELVDDEKVDITSTVIDEDAPLEDDVVRSLRTSPNHSSRDRTTIDDFEIIKPIS 893

Query: 670  RGAFGRVFLAKKRTTGDLFAIKVLKKADMIRKNAVESILAERNILISVRNPFVVRFFYSF 729
            RGAFGRVFLAKKRTTGDLFAIKVLKKADMIRKNAVESILAER+ILISVRNPFVVRFFYSF
Sbjct: 894  RGAFGRVFLAKKRTTGDLFAIKVLKKADMIRKNAVESILAERDILISVRNPFVVRFFYSF 953

Query: 730  TCRDNLYLVMEYLNGGDLYSLLRNLGCLDEEVARVYIAEVVLALEYLHSLGVVHRDLKPD 789
            TCR+NLYLVMEYLNGGDLYSLLRNLGCLDEEVARVYIAEVVLALEYLHSL VVHRDLKPD
Sbjct: 954  TCRENLYLVMEYLNGGDLYSLLRNLGCLDEEVARVYIAEVVLALEYLHSLHVVHRDLKPD 1013

Query: 790  NLLIAHDGHIKLTDFGLSKVGLINSTDDLSGPAVSGTTLLGYDEPTMSASEHQQERRKKR 849
            NLLIAHDGHIKLTDFGLSKVGLINSTDDLSGPAVSGT+LL  ++P +SASEHQQERRKKR
Sbjct: 1014 NLLIAHDGHIKLTDFGLSKVGLINSTDDLSGPAVSGTSLLDDEQPQLSASEHQQERRKKR 1073

Query: 850  SAVGTPDYLAPEILLGTGHGATADWWSVGIILFELIVGIPPFNAEHPQTIFDNILNRKIP 909
            SAVGTPDYLAPEILLGTGHGATADWWSVG+ILFELIVGIPPFNAEHPQTIFDNILNRKIP
Sbjct: 1074 SAVGTPDYLAPEILLGTGHGATADWWSVGVILFELIVGIPPFNAEHPQTIFDNILNRKIP 1133

Query: 910  WPQIPEEMSHDAQDLIDRLLTEDPHQRLGAIGASE-VKQHMFFKDINWDTLARQKAAFVP 969
            WP++ EEMS +A+DLIDRLLTEDPHQRLGA GASE VKQH+FFKDINWDTLARQKAAFVP
Sbjct: 1134 WPRVSEEMSLEAKDLIDRLLTEDPHQRLGARGASEVVKQHVFFKDINWDTLARQKAAFVP 1193

Query: 970  TSESALDTSYFTSRYSWNHSDDHVYPHSELEDSSDADSLSG-DSCLSNRQDEVGDECGGL 1029
            TSESALDTSYFTSRYSWN SDDH YP SE +DSSDADSLSG  SCLSNRQDE GDECGGL
Sbjct: 1194 TSESALDTSYFTSRYSWNTSDDHAYPGSEFDDSSDADSLSGSSSCLSNRQDE-GDECGGL 1253

Query: 1030 TDFEPGASVNYSFSNFSFKNLSQLASINYDLLSKGLKDDPP 1064
             +FE G+SVNYSFSNFSFKNLSQLASINYDLLSKG KDD P
Sbjct: 1254 AEFESGSSVNYSFSNFSFKNLSQLASINYDLLSKGWKDDHP 1288

BLAST of CSPI01G05030 vs. TAIR10
Match: AT3G17850.1 (AT3G17850.1 Protein kinase superfamily protein)

HSP 1 Score: 1492.2 bits (3862), Expect = 0.0e+00
Identity = 766/1069 (71.66%), Postives = 888/1069 (83.07%), Query Frame = 1

Query: 5    TGCYGSSWGFSEGLRNLDFCTPEPSYDCENPKESESPRFQAILRVTSAPRRKYPADIKSF 64
            TG    S G S  LRN DFCTPE SY+ ENPKESESPR+QA+LR+TSAPR+++P DIKSF
Sbjct: 238  TGRSEMSSGRSGPLRNSDFCTPENSYEWENPKESESPRYQALLRMTSAPRKRFPGDIKSF 297

Query: 65   SHELNSKGVRPFPLWKSRRLNNLEEILVMIRAKFDKAKEEVNSDLAIFAADLVGVLEKNV 124
            SHELNSKGVRPFPLWK RR NN+EE+L +IRAKF+KAKEEVNSDLA+FAADLVGVLEKN 
Sbjct: 298  SHELNSKGVRPFPLWKPRRSNNVEEVLNLIRAKFEKAKEEVNSDLAVFAADLVGVLEKNA 357

Query: 125  DTHPEWQETIEDLLVLARSCAMSSPGEFWLQCESIVQELDDRRQELPPGMLKQLHTRILF 184
            ++HPEW+ET EDLL+LARSCAM++PG+FWLQCE IVQ+LDDRRQELPPG+LKQLHTR+LF
Sbjct: 358  ESHPEWEETFEDLLILARSCAMTTPGDFWLQCEGIVQDLDDRRQELPPGVLKQLHTRMLF 417

Query: 185  ILTRCTRLLQFHKESGLAEDENVFQLRQSRNLHSADKRTAPAMGREMKSSSAAKASKGTS 244
            ILTRCTRLLQFHKES   E+E V QLRQSR LHS +K      GR   S SAAK     S
Sbjct: 418  ILTRCTRLLQFHKESW-GEEEQVVQLRQSRVLHSIEKIPPSGAGR---SYSAAKVP---S 477

Query: 245  SRKSYSQEQRGLDWSREHDILPGNNVSTSPDDTSKNLVSPTGRDRMASWKRLPSPAKSSV 304
            ++K+YSQEQ GLDW  +  +     ++   +   K   SP   DRM+SWK+LPSPA  +V
Sbjct: 478  TKKAYSQEQHGLDWKEDAVVRSVPPLAPPENYAIKESESPANIDRMSSWKKLPSPALKTV 537

Query: 305  KESTLKVQGDNKKFKSLNMSKIRMSVSETDFAAAKVSELPLPRDSHDQPTKHQHKPSWG- 364
            KE+    + ++ K +  N+   R      D AA  +   P  +DSH+  +KH+H  SWG 
Sbjct: 538  KEAPASEEQNDSKVEPPNIVGSRQG---RDDAAVAILNFPPAKDSHEHSSKHRHNISWGY 597

Query: 365  WGDQPTVSDESSIICRICEEEIPTANVEDHSRICAVADRCDQKGISVNERLLRIAETLEK 424
            WG+QP +S+ESSI+CRICEEE+PT +VEDHSR+C +AD+ DQKG+SV+ERL+ +A TL+K
Sbjct: 598  WGEQPLISEESSIMCRICEEEVPTTHVEDHSRVCTLADKYDQKGLSVDERLMAVAGTLDK 657

Query: 425  MIESFTQKDSQHVG-SPDVAKVSNSSMTEESDI-SPKHSDWSRRGSEDMLDCFHETDSSV 484
            + E+F  KDS     SPD  KVSNS +TEESD+ SP+ SDWSR+GSEDMLDCF E D+S+
Sbjct: 658  IAETFRHKDSLAAAESPDGMKVSNSHLTEESDVLSPRLSDWSRKGSEDMLDCFPEADNSI 717

Query: 485  FMDDLKGLPSMSCKTRFGPKSDQGMTTSSAGSMTPRSPLLTPRTSQFDLFLAGKGAYYEH 544
            FMDDL+GLP MSC+TRFGPKSDQGMTTSSA SMTPRSP+ TPR    +  L GKG +++ 
Sbjct: 718  FMDDLRGLPLMSCRTRFGPKSDQGMTTSSASSMTPRSPIPTPRPDPIEQILGGKGTFHDQ 777

Query: 545  DDLTQISDLADIARCSANTPLGDDCSMQYLLTCLEDLRVVINRRKFNALTVDTFGTRIEK 604
            DD+ Q+S+LADIA+C+A+   GDD S+ +LL+CLEDLRVVI+RRKF+ALTV+TFGTRIEK
Sbjct: 778  DDIPQMSELADIAKCAADAIPGDDQSIPFLLSCLEDLRVVIDRRKFDALTVETFGTRIEK 837

Query: 605  LIREKYLQLCELVDDEKIDIASTVIDEDTPLEDDIVRSLRTSPIHSSKDRTSIDDFEIIK 664
            LIREKY+ +CEL+DDEK+D+ STVIDED PLEDD+VRSLRTSP+H  +DRTSIDDFEIIK
Sbjct: 838  LIREKYVHMCELMDDEKVDLLSTVIDEDAPLEDDVVRSLRTSPVHP-RDRTSIDDFEIIK 897

Query: 665  PISRGAFGRVFLAKKRTTGDLFAIKVLKKADMIRKNAVESILAERNILISVRNPFVVRFF 724
            PISRGAFGRVFLAKKRTTGDLFAIKVLKKADMIRKNAVESILAER+ILI+VRNPFVVRFF
Sbjct: 898  PISRGAFGRVFLAKKRTTGDLFAIKVLKKADMIRKNAVESILAERDILINVRNPFVVRFF 957

Query: 725  YSFTCRDNLYLVMEYLNGGDLYSLLRNLGCLDEEVARVYIAEVVLALEYLHSLGVVHRDL 784
            YSFTCRDNLYLVMEYLNGGDLYSLLRNLGCL+E++ RVYIAEVVLALEYLHS GVVHRDL
Sbjct: 958  YSFTCRDNLYLVMEYLNGGDLYSLLRNLGCLEEDIVRVYIAEVVLALEYLHSEGVVHRDL 1017

Query: 785  KPDNLLIAHDGHIKLTDFGLSKVGLINSTDDLSGPAVSGTTLLGYDEPTMSASEHQQERR 844
            KPDNLLIAHDGHIKLTDFGLSKVGLINSTDDL+GPAVSGT+LL  +E  ++ASE Q ERR
Sbjct: 1018 KPDNLLIAHDGHIKLTDFGLSKVGLINSTDDLAGPAVSGTSLLDEEESRLAASEEQLERR 1077

Query: 845  KKRSAVGTPDYLAPEILLGTGHGATADWWSVGIILFELIVGIPPFNAEHPQTIFDNILNR 904
            KKRSAVGTPDYLAPEILLGTGHGATADWWSVGIILFELIVGIPPFNAEHPQ IFDNILNR
Sbjct: 1078 KKRSAVGTPDYLAPEILLGTGHGATADWWSVGIILFELIVGIPPFNAEHPQQIFDNILNR 1137

Query: 905  KIPWPQIPEEMSHDAQDLIDRLLTEDPHQRLGAIGASEVKQHMFFKDINWDTLARQKAAF 964
            KIPWP +PEEMS +A D+IDR LTEDPHQRLGA GA+EVKQH+FFKDINWDTLARQKAAF
Sbjct: 1138 KIPWPHVPEEMSAEAHDIIDRFLTEDPHQRLGARGAAEVKQHIFFKDINWDTLARQKAAF 1197

Query: 965  VPTSESALDTSYFTSRYSWNHSDDHVYPHSELEDSSDADSLSGDS-CLSNRQDE-VGDEC 1024
            VP SESA+DTSYF SRYSWN SD+  +P  E+ D SDADS++  S C SN  +E   +EC
Sbjct: 1198 VPASESAIDTSYFRSRYSWNTSDEQFFPSGEVPDYSDADSMTNSSGCSSNHHEEGEAEEC 1257

Query: 1025 GGLTDFEPGASVNYSFSNFSFKNLSQLASINYDLLSKGLKDDP---PNH 1066
             G  +FE G  V+YSFSNFSFKNLSQLASINYDLLSKG KD+P   P+H
Sbjct: 1258 EGHAEFESGIPVDYSFSNFSFKNLSQLASINYDLLSKGWKDEPQQIPHH 1295

BLAST of CSPI01G05030 vs. TAIR10
Match: AT1G48490.1 (AT1G48490.1 Protein kinase superfamily protein)

HSP 1 Score: 1366.7 bits (3536), Expect = 0.0e+00
Identity = 709/1044 (67.91%), Postives = 845/1044 (80.94%), Query Frame = 1

Query: 22   DFCTPEPSYDCENPKESESPRFQAILRVTSAPRRKYPADIKSFSHELNSKGVRPFPLWKS 81
            + CTPE SYD ++PKES+SPR+QA+LR+TSAPR+++P DIKSFSHELNSKGVRPFPLWK 
Sbjct: 198  EVCTPENSYDLDDPKESDSPRYQALLRMTSAPRKRFPGDIKSFSHELNSKGVRPFPLWKP 257

Query: 82   RRLNNLEEILVMIRAKFDKAKEEVNSDLAIFAADLVGVLEKNVDTHPEWQETIEDLLVLA 141
            RRLNNLE+IL +IR KFDKAKEEVNSDL  F  DL+ + +KN ++HPE   TIEDLLVLA
Sbjct: 258  RRLNNLEDILNLIRTKFDKAKEEVNSDLFAFGGDLLDIYDKNKESHPELLVTIEDLLVLA 317

Query: 142  RSCAMSSPGEFWLQCESIVQELDDRRQELPPGMLKQLHTRILFILTRCTRLLQFHKESGL 201
            ++CA ++  EFWLQCE IVQ+LDDRRQELPPG+LKQLHTR+LFILTRCTRLLQFHKES  
Sbjct: 318  KTCAKTTSKEFWLQCEGIVQDLDDRRQELPPGVLKQLHTRMLFILTRCTRLLQFHKESW- 377

Query: 202  AEDENVFQLRQSRNLHSADKRTAPAMGREMKSSSAAKASKGTSSRKSYSQEQRGLDWSRE 261
             ++E+  QLRQS  LHSADKR      R+ K SS A A K  S++K+YSQEQRGL+W   
Sbjct: 378  GQEEDAVQLRQSGVLHSADKRDPTGEVRDGKGSSTANALKVPSTKKAYSQEQRGLNWIEG 437

Query: 262  HDILPGNNVSTSPDDTSKNLVSPTGRDRMASWKRLPSPAKSSVKESTLKVQGDNKKFKSL 321
              + P   +S+  ++TSK+  SP   D+M+SWKRLPSPA   V+E+ +  + +++K +  
Sbjct: 438  FFVRPAP-LSSPYNETSKDSESPANIDKMSSWKRLPSPASKGVQEAAVSKEQNDRKVEPP 497

Query: 322  NMSKIRMSVSETDFAAAKVSELPLPRDSHDQPTKHQHKPSWG-WGDQPTVSDESSIICRI 381
             + K  +++S+ D A AK+ E+   + S +  +K++H  SWG WG Q  +S+ESSIICRI
Sbjct: 498  QVVKKLVAISD-DMAVAKLPEVSSAKASQEHMSKNRHNISWGYWGHQSCISEESSIICRI 557

Query: 382  CEEEIPTANVEDHSRICAVADRCDQKGISVNERLLRIAETLEKMIESFTQKDS-QHVGSP 441
            CEEEIPT +VEDHSRICA+AD+ DQKG+ V+ERL+ +A TLEK+ ++  QKDS   V SP
Sbjct: 558  CEEEIPTTHVEDHSRICALADKYDQKGVGVDERLMAVAVTLEKITDNVIQKDSLAAVESP 617

Query: 442  DVAKVSNSSMTEESDI-SPKHSDWSRRGSEDMLDCFHETDSSVFMDDLKGLPSMSCKTRF 501
            +  K+SN+S+TEE D+ SPK SDWSRRGSEDMLDCF ETD+SVFMDD+  LPSMSC+TRF
Sbjct: 618  EGMKISNASLTEELDVLSPKLSDWSRRGSEDMLDCFPETDNSVFMDDMGCLPSMSCRTRF 677

Query: 502  GPKSDQGMTTSSAGSMTPRSPLLTPRTSQFDLFLAGKGAYYEHDDLTQISDLADIARCSA 561
            GPKSDQGM TSSAGSMTPRSP+ TPR    +L L GKG +++ DD  Q+S+LADIARC+A
Sbjct: 678  GPKSDQGMATSSAGSMTPRSPIPTPRPDPIELLLEGKGTFHDQDDFPQMSELADIARCAA 737

Query: 562  NTPLGDDCSMQYLLTCLEDLRVVINRRKFNALTVDTFGTRIEKLIREKYLQLCELVDDEK 621
            N    DD S+Q LL+CLEDLRVVI+RRKF+AL V+TFGTRIEKLI+EKYLQLCEL+DDEK
Sbjct: 738  NAIPVDDQSIQLLLSCLEDLRVVIDRRKFDALIVETFGTRIEKLIQEKYLQLCELMDDEK 797

Query: 622  IDIASTVIDEDTPLEDDIVRSLRTSPIHSSKDRTSIDDFEIIKPISRGAFGRVFLAKKRT 681
                 T+IDED PLEDD+VRSLRTSP+H  +DR SIDDFE++K ISRGAFG V LA+K T
Sbjct: 798  ----GTIIDEDAPLEDDVVRSLRTSPVHL-RDRISIDDFEVMKSISRGAFGHVILARKNT 857

Query: 682  TGDLFAIKVLKKADMIRKNAVESILAERNILISVRNPFVVRFFYSFTCRDNLYLVMEYLN 741
            TGDLFAIKVL+KADMIRKNAVESILAER+ILI+ RNPFVVRFFYSFTC +NLYLVMEYLN
Sbjct: 858  TGDLFAIKVLRKADMIRKNAVESILAERDILINARNPFVVRFFYSFTCSENLYLVMEYLN 917

Query: 742  GGDLYSLLRNLGCLDEEVARVYIAEVVLALEYLHSLGVVHRDLKPDNLLIAHDGHIKLTD 801
            GGD YS+LR +GCLDE  ARVYIAEVVLALEYLHS GVVHRDLKPDNLLIAHDGH+KLTD
Sbjct: 918  GGDFYSMLRKIGCLDEANARVYIAEVVLALEYLHSEGVVHRDLKPDNLLIAHDGHVKLTD 977

Query: 802  FGLSKVGLINSTDDLSGPAVSGTTLLGYDEPTMSASEHQQERRKKRSAVGTPDYLAPEIL 861
            FGLSKVGLIN+TDDLSGP  S T+LL  ++P +   +H      KRSAVGTPDYLAPEIL
Sbjct: 978  FGLSKVGLINNTDDLSGPVSSATSLLVEEKPKLPTLDH------KRSAVGTPDYLAPEIL 1037

Query: 862  LGTGHGATADWWSVGIILFELIVGIPPFNAEHPQTIFDNILNRKIPWPQIPEEMSHDAQD 921
            LGTGHGATADWWSVGIIL+E +VGIPPFNA+HPQ IFDNILNR I WP +PE+MSH+A+D
Sbjct: 1038 LGTGHGATADWWSVGIILYEFLVGIPPFNADHPQQIFDNILNRNIQWPPVPEDMSHEARD 1097

Query: 922  LIDRLLTEDPHQRLGAIGASEVKQHMFFKDINWDTLARQKAAFVPTSESALDTSYFTSRY 981
            LIDRLLTEDPHQRLGA GA+EVKQH FFKDI+W+TLA+QKAAFVP SE+A DTSYF SRY
Sbjct: 1098 LIDRLLTEDPHQRLGARGAAEVKQHSFFKDIDWNTLAQQKAAFVPDSENAFDTSYFQSRY 1157

Query: 982  SWNHSDDHVYPHSELEDSSDADSLSGDS-CLSNRQDEVGDECGGLTDFEPGASVNYSFSN 1041
            SWN+S +  +P +E EDSS+ DSL G S  LSN  DE  D   G  +FE   S NY F N
Sbjct: 1158 SWNYSGERCFPTNENEDSSEGDSLCGSSGRLSNHHDEGVDIPCGPAEFETSVSENYPFDN 1217

Query: 1042 FSFKNLSQLASINYDLLSKGLKDD 1062
            FSFKNLSQLA INY+L+SKG KD+
Sbjct: 1218 FSFKNLSQLAYINYNLMSKGHKDE 1227

BLAST of CSPI01G05030 vs. TAIR10
Match: AT5G62310.1 (AT5G62310.1 AGC (cAMP-dependent, cGMP-dependent and protein kinase C) kinase family protein)

HSP 1 Score: 1025.8 bits (2651), Expect = 1.9e-299
Identity = 581/1037 (56.03%), Postives = 723/1037 (69.72%), Query Frame = 1

Query: 36   KESESPRFQAILRVTSAPRRKYPADIKSFSHELNSKGVRPFPLWKSRRLNNLEEILVMIR 95
            KE++SPRFQAILRVTS  R+K   DIKSFSHELNSKGVRPFP+W+SR + ++EEI+  IR
Sbjct: 172  KETQSPRFQAILRVTSG-RKKKAHDIKSFSHELNSKGVRPFPVWRSRAVGHMEEIMAAIR 231

Query: 96   AKFDKAKEEVNSDLAIFAADLVGVLEKNVDTHPEWQETIEDLLVLARSCAMSSPGEFWLQ 155
             KFDK KE+V++DL +FA  LV  LE   +++ E +  +EDLLV AR CA     EFWL+
Sbjct: 232  TKFDKQKEDVDADLGVFAGYLVTTLESTPESNKELRVGLEDLLVEARQCATMPASEFWLK 291

Query: 156  CESIVQELDDRRQELPPGMLKQLHTRILFILTRCTRLLQFHKESGLAEDENVFQLRQSRN 215
            CE IVQ+LDD+RQELP G LKQ H R+LFILTRC RL+QF KESG  E E++  + Q  +
Sbjct: 292  CEGIVQKLDDKRQELPMGGLKQAHNRLLFILTRCNRLVQFRKESGYVE-EHILGMHQLSD 351

Query: 216  LHSADKRTAPAMGRE----MKSSSAAKASKGTSSRKSYSQEQRGLDWSREHDILPGNNVS 275
            L    ++      ++     K        +  + ++       G D           N +
Sbjct: 352  LGVYPEQMVEISRQQDLLREKEIQKINEKQNLAGKQDDQNSNSGADGVEV-------NTA 411

Query: 276  TSPDDTSKNLVSPTGRDRMASWKRLPSPA-KSSVKESTLKVQGDNKKFKSLNMSKIRMSV 335
             S D TS N        RM+SWK+LPS A K+    +T K +G+         SKI+  V
Sbjct: 412  RSTDSTSSNF-------RMSSWKKLPSAAEKNRSLNNTPKAKGE---------SKIQPKV 471

Query: 336  SETDFAAAKVSELPLPRDSHDQPTKHQHKPSWG-WGDQPTVSDESSIICRICEEEIPTAN 395
                +       L  P     QP        WG W D   V+ ++S+ICRICE EIP  +
Sbjct: 472  ----YGDENAENLHSPSG---QPASADRSALWGFWADHQCVTYDNSMICRICEVEIPVVH 531

Query: 396  VEDHSRICAVADRCDQKGISVNERLLRIAETLEKMIESFTQKDSQHVGS-PDVAKVSNSS 455
            VE+HSRIC +ADRCD KGI+VN RL R+AE+LEK++ES+T K S    +  D A++SNSS
Sbjct: 532  VEEHSRICTIADRCDLKGINVNLRLERVAESLEKILESWTPKSSVTPRAVADSARLSNSS 591

Query: 456  MTEESDISPKHSDWSRRGSEDMLDCFHETDSSVFMDDLKGLPSMSCKTRFGPKSDQGMTT 515
              E+ D      + S+R S+DMLDC   + ++  +D+L  L  MS           G   
Sbjct: 592  RQEDLD------EISQRCSDDMLDCVPRSQNTFSLDELNILNEMSMTN--------GTKD 651

Query: 516  SSAGSMTPRSPLLTPRTSQFDLFLAGKGAYYEHDDLTQISDLADIARCSANTPLGDDCSM 575
            SSAGS+TP SP  TPR SQ DL L+G+    E ++  QI+ L DIAR  AN  +    S+
Sbjct: 652  SSAGSLTPPSPA-TPRNSQVDLLLSGRKTISELENYQQINKLLDIARSVANVNVCGYSSL 711

Query: 576  QYLLTCLEDLRVVINRRKFNALTVDTFGTRIEKLIREKYLQLCELVDDEKIDIASTVIDE 635
             +++  L++L+ VI  RK +AL V+TFG RIEKL++EKY++LC L+DDEK+D ++ + DE
Sbjct: 712  DFMIEQLDELKYVIQDRKADALVVETFGRRIEKLLQEKYIELCGLIDDEKVDSSNAMPDE 771

Query: 636  DTPLEDDIVRSLRTSPIHS-SKDRTSIDDFEIIKPISRGAFGRVFLAKKRTTGDLFAIKV 695
            ++  ++D VRSLR SP++  +KDRTSI+DFEIIKPISRGAFGRVFLAKKR TGDLFAIKV
Sbjct: 772  ESSADEDTVRSLRASPLNPRAKDRTSIEDFEIIKPISRGAFGRVFLAKKRATGDLFAIKV 831

Query: 696  LKKADMIRKNAVESILAERNILISVRNPFVVRFFYSFTCRDNLYLVMEYLNGGDLYSLLR 755
            LKKADMIRKNAVESILAERNILISVRNPFVVRFFYSFTCR+NLYLVMEYLNGGDL+SLLR
Sbjct: 832  LKKADMIRKNAVESILAERNILISVRNPFVVRFFYSFTCRENLYLVMEYLNGGDLFSLLR 891

Query: 756  NLGCLDEEVARVYIAEVVLALEYLHSLGVVHRDLKPDNLLIAHDGHIKLTDFGLSKVGLI 815
            NLGCLDE++AR+YIAEVVLALEYLHS+ ++HRDLKPDNLLI  DGHIKLTDFGLSKVGLI
Sbjct: 892  NLGCLDEDMARIYIAEVVLALEYLHSVNIIHRDLKPDNLLINQDGHIKLTDFGLSKVGLI 951

Query: 816  NSTDDLSGPAVSGTTLLGYDEPTMSASEHQQ--ERRKKRSAVGTPDYLAPEILLGTGHGA 875
            NSTDDLSG +  G +  G+     S ++H Q  + RKK + VGTPDYLAPEILLG GHG 
Sbjct: 952  NSTDDLSGESSLGNS--GFFAEDGSKAQHSQGKDSRKKHAVVGTPDYLAPEILLGMGHGK 1011

Query: 876  TADWWSVGIILFELIVGIPPFNAEHPQTIFDNILNRKIPWPQIPEEMSHDAQDLIDRLLT 935
            TADWWSVG+ILFE++VGIPPFNAE PQ IF+NI+NR IPWP +PEE+S++A DLI++LLT
Sbjct: 1012 TADWWSVGVILFEVLVGIPPFNAETPQQIFENIINRDIPWPNVPEEISYEAHDLINKLLT 1071

Query: 936  EDPHQRLGAIGASEVKQHMFFKDINWDTLARQKAAFVPTSESALDTSYFTSRYSWNHSDD 995
            E+P QRLGA GA EVKQH FFKDINWDTLARQKA FVP++E   DTSYF SRY WN  D+
Sbjct: 1072 ENPVQRLGATGAGEVKQHHFFKDINWDTLARQKAMFVPSAEPQ-DTSYFMSRYIWNPEDE 1131

Query: 996  HVYPHSELEDSSDADSLSGDSCLSNRQDEVGDECGGLTDF--EPGASVNYSFSNFSFKNL 1055
            +V+  S+ +D +D  S S      N Q+E GDECG L +F   P  +V YSFSNFSFKNL
Sbjct: 1132 NVHGGSDFDDLTDTCSSSS----FNTQEEDGDECGSLAEFGNGPNLAVKYSFSNFSFKNL 1154

Query: 1056 SQLASINYDLLSKGLKD 1061
            SQLASINYDL+ K  K+
Sbjct: 1192 SQLASINYDLVLKNAKE 1154

BLAST of CSPI01G05030 vs. TAIR10
Match: AT1G45160.2 (AT1G45160.2 Protein kinase superfamily protein)

HSP 1 Score: 667.5 bits (1721), Expect = 1.3e-191
Identity = 375/691 (54.27%), Postives = 476/691 (68.89%), Query Frame = 1

Query: 376  IICRICEEEIPTANVEDHSRICAVADRCDQKGISVNERLLRIAETLEKMIESFTQKDSQH 435
            +ICRICEEE+P  ++E HS ICA AD+C+   + V+ERLL++ E LE++I+S +      
Sbjct: 400  VICRICEEEVPLFHLEPHSYICAYADKCEINCVDVDERLLKLEEILEQIIDSRSLNSFTQ 459

Query: 436  VGSPDVAKVSNSSMTEESDISPKHSDWSRRGSEDMLDCFHETDSSVFMDDLKGLPSMSCK 495
             G  + + +  S +  E   SPK ++W  +G E M +  HE D++ F+D+    P +  K
Sbjct: 460  AGGLENSVLRKSGVASEG-CSPKINEWRNKGLEGMFEDLHEMDTA-FIDESYTYP-IHLK 519

Query: 496  TRFGPKSDQGMTTSSAGSMTPRSPLLTPRTSQFDLFLAGKGAYYEHDDLTQISDLADIAR 555
            +  G K     T+SS GS+T  S   TPRTS FD +   +    E +DL  + DL+DIAR
Sbjct: 520  SHVGAKFCHHATSSSTGSITSVSSTNTPRTSHFDSYWLERHCP-EQEDLRLMMDLSDIAR 579

Query: 556  CSANTPLGDDCSMQYLLTCLEDLRVVINRRKFNALTVDTFGTRIEKLIREKYLQLCELVD 615
            C A+T    + S  Y++ C++D++ V+ + K  AL +DTFG RIEKL+ EKYL   EL  
Sbjct: 580  CGASTDFSKEGSCDYIMACMQDIQAVLKQGKLKALVIDTFGGRIEKLLCEKYLHARELTA 639

Query: 616  DEKIDIASTVIDEDTPLEDDIVRSLRTSPIHSSKDRTSIDDFEIIKPISRGAFGRVFLAK 675
            D+     S+V   +    +D++     +P    KDR SIDDFEIIKPISRGAFG+VFLA+
Sbjct: 640  DK-----SSV--GNIKESEDVLEHASATPQLLLKDRISIDDFEIIKPISRGAFGKVFLAR 699

Query: 676  KRTTGDLFAIKVLKKADMIRKNAVESILAERNILISVRNPFVVRFFYSFTCRDNLYLVME 735
            KRTTGD FAIKVLKK DMIRKN +E IL ERNILI+VR PF+VRFFYSFTCRDNLYLVME
Sbjct: 700  KRTTGDFFAIKVLKKLDMIRKNDIERILQERNILITVRYPFLVRFFYSFTCRDNLYLVME 759

Query: 736  YLNGGDLYSLLRNLGCLDEEVARVYIAEVVLALEYLHSLGVVHRDLKPDNLLIAHDGHIK 795
            YLNGGDLYSLL+ +GCLDEE+AR+YIAE+VLALEYLHSL +VHRDLKPDNLLIA++GHIK
Sbjct: 760  YLNGGDLYSLLQKVGCLDEEIARIYIAELVLALEYLHSLKIVHRDLKPDNLLIAYNGHIK 819

Query: 796  LTDFGLSKVGLINSTDDLSG--PAVSGTTLLGYDEPTMSASEHQQERRKKRSAVGTPDYL 855
            LTDFGLSK+GLIN+T DLSG    VS  T       +    ++Q+E R + SAVGTPDYL
Sbjct: 820  LTDFGLSKIGLINNTIDLSGHESDVSPRT------NSHHFQKNQEEERIRHSAVGTPDYL 879

Query: 856  APEILLGTGHGATADWWSVGIILFELIVGIPPFNAEHPQTIFDNILNRKIPWPQIPEEMS 915
            APEILLGT HG  ADWWS GI+LFEL+ GIPPF A  P+ IFDNILN K+PWP +P EMS
Sbjct: 880  APEILLGTEHGYAADWWSAGIVLFELLTGIPPFTASRPEKIFDNILNGKMPWPDVPGEMS 939

Query: 916  HDAQDLIDRLLTEDPHQRLGAIGASEVKQHMFFKDINWDTLARQKAAFVPTSESALDTSY 975
            ++AQDLI+RLL  +P +RLGA GA+EVK H FF+ ++W+ LA QKAAFVP  ES  DTSY
Sbjct: 940  YEAQDLINRLLVHEPEKRLGANGAAEVKSHPFFQGVDWENLALQKAAFVPQPESINDTSY 999

Query: 976  FTSRYSWNHSDDHVYPHSELEDSSDADSLSGDSCLSNRQDEVGDECGGLTDFEPGASVNY 1035
            F SR+S               +SS +D+ +G++  SN   + GDE    T+ E   S  Y
Sbjct: 1000 FVSRFS---------------ESSCSDTETGNNSGSN--PDSGDELDECTNLEKFDSPPY 1053

Query: 1036 --SFSNFSFKNLSQLASINYDLLSKGLKDDP 1063
              S  NFSFKNLSQLASIN+D+L   L+ DP
Sbjct: 1060 YLSLINFSFKNLSQLASINHDVL---LQKDP 1053

BLAST of CSPI01G05030 vs. TAIR10
Match: AT1G30640.1 (AT1G30640.1 Protein kinase family protein)

HSP 1 Score: 271.2 bits (692), Expect = 2.8e-72
Identity = 149/375 (39.73%), Postives = 220/375 (58.67%), Query Frame = 1

Query: 621 IASTVIDEDTPLED--DIVRSLRTSPIHS---SKDRTSIDDFEIIKPISRGAFGRVFLAK 680
           +   + D D  +ED  DI+++     +      + +  +DDFE++  I RGAFG V + K
Sbjct: 79  LEQNLADADVTVEDKMDILKNFEKKEMEYMRLQRQKMGVDDFELLSIIGRGAFGEVRICK 138

Query: 681 KRTTGDLFAIKVLKKADMIRKNAVESILAERNILISVRNPFVVRFFYSFTCRDNLYLVME 740
           +++TG ++A+K LKK++M+R+  VE + AERN+L  V +PF+V+  YSF   ++LYL+ME
Sbjct: 139 EKSTGSVYAMKKLKKSEMLRRGQVEHVKAERNVLAEVDSPFIVKLCYSFQDDEHLYLIME 198

Query: 741 YLNGGDLYSLLRNLGCLDEEVARVYIAEVVLALEYLHSLGVVHRDLKPDNLLIAHDGHIK 800
           YL GGD+ +LL     L E+  R Y+A+ +LA+E +H    VHRD+KPDNLLI  +GHIK
Sbjct: 199 YLPGGDMMTLLMRKDTLREDETRFYVAQTILAIESIHKHNYVHRDIKPDNLLITRNGHIK 258

Query: 801 LTDFGLSKVGLINSTDDLSGPAVSGTTLLGYDEPTMSASE------------HQQERRKK 860
           L+DFGLSK     +  D     V  +T    +   +S               H Q+ R+ 
Sbjct: 259 LSDFGLSKSLESKNFPDFKAELVDRSTKPAAEHDRLSKPPSAPRRTQQEQLLHWQQNRRT 318

Query: 861 R--SAVGTPDYLAPEILLGTGHGATADWWSVGIILFELIVGIPPFNAEHPQTIFDNILNR 920
              S VGTPDY+APE+LL  G+G   DWWS+G I+FE++VG PPF +E P      I+N 
Sbjct: 319 LAFSTVGTPDYIAPEVLLKKGYGMECDWWSLGAIMFEMLVGFPPFYSEEPLATCRKIVNW 378

Query: 921 KIPWPQIPEE--MSHDAQDLIDRLLTEDPHQRLGAIGASEVKQHMFFKDINWDTLARQKA 975
           K    + P+E  +S + +DLI RLL  +  QRLG  G  E+K H +F+ + W+ L    A
Sbjct: 379 K-TCLKFPDEAKLSIEVKDLIRRLLC-NVEQRLGTKGVHEIKAHPWFRGVEWERLYESNA 438

BLAST of CSPI01G05030 vs. NCBI nr
Match: gi|778656691|ref|XP_011649519.1| (PREDICTED: probable serine/threonine protein kinase IREH1 [Cucumis sativus])

HSP 1 Score: 2117.0 bits (5484), Expect = 0.0e+00
Identity = 1063/1066 (99.72%), Postives = 1064/1066 (99.81%), Query Frame = 1

Query: 2    TNITGCYGSSWGFSEGLRNLDFCTPEPSYDCENPKESESPRFQAILRVTSAPRRKYPADI 61
            +N  GCYGSSWGFSEGLRNLDFCTPEPSYDCENPKESESPRFQAILRVTSAPRRKYPADI
Sbjct: 233  SNEAGCYGSSWGFSEGLRNLDFCTPEPSYDCENPKESESPRFQAILRVTSAPRRKYPADI 292

Query: 62   KSFSHELNSKGVRPFPLWKSRRLNNLEEILVMIRAKFDKAKEEVNSDLAIFAADLVGVLE 121
            KSFSHELNSKGVRPFPLWKSRRLNNLEEILVMIRAKFDKAKEEVNSDLAIFAADLVGVLE
Sbjct: 293  KSFSHELNSKGVRPFPLWKSRRLNNLEEILVMIRAKFDKAKEEVNSDLAIFAADLVGVLE 352

Query: 122  KNVDTHPEWQETIEDLLVLARSCAMSSPGEFWLQCESIVQELDDRRQELPPGMLKQLHTR 181
            KNVDTHPEWQETIEDLLVLARSCAMSSPGEFWLQCESIVQELDDRRQELPPGMLKQLHTR
Sbjct: 353  KNVDTHPEWQETIEDLLVLARSCAMSSPGEFWLQCESIVQELDDRRQELPPGMLKQLHTR 412

Query: 182  ILFILTRCTRLLQFHKESGLAEDENVFQLRQSRNLHSADKRTAPAMGREMKSSSAAKASK 241
            ILFILTRCTRLLQFHKESGLAEDENVFQLRQSRNLHSADKRTAPAMGREMKSSSAAKASK
Sbjct: 413  ILFILTRCTRLLQFHKESGLAEDENVFQLRQSRNLHSADKRTAPAMGREMKSSSAAKASK 472

Query: 242  GTSSRKSYSQEQRGLDWSREHDILPGNNVSTSPDDTSKNLVSPTGRDRMASWKRLPSPAK 301
            GTSSRKSYSQEQRGLDWSREHDILPGNNVSTSPDDTSKNLVSPTGRDRMASWKRLPSPAK
Sbjct: 473  GTSSRKSYSQEQRGLDWSREHDILPGNNVSTSPDDTSKNLVSPTGRDRMASWKRLPSPAK 532

Query: 302  SSVKESTLKVQGDNKKFKSLNMSKIRMSVSETDFAAAKVSELPLPRDSHDQPTKHQHKPS 361
            SSVKESTLKVQGDNKKFKSLNMSKIRMSVSETDFAAAKVSELPLPRDSHDQPTKHQHKPS
Sbjct: 533  SSVKESTLKVQGDNKKFKSLNMSKIRMSVSETDFAAAKVSELPLPRDSHDQPTKHQHKPS 592

Query: 362  WGWGDQPTVSDESSIICRICEEEIPTANVEDHSRICAVADRCDQKGISVNERLLRIAETL 421
            WGWGDQPTVSDESSIICRICEEEIPTANVEDHSRICAVADRCDQKGISVNERLLRIAETL
Sbjct: 593  WGWGDQPTVSDESSIICRICEEEIPTANVEDHSRICAVADRCDQKGISVNERLLRIAETL 652

Query: 422  EKMIESFTQKDSQHVGSPDVAKVSNSSMTEESDISPKHSDWSRRGSEDMLDCFHETDSSV 481
            EKMIESFTQKDSQHVGSPDVAKVSNSSMTEESDISPKHSDWSRRGSEDMLDCFHETDSSV
Sbjct: 653  EKMIESFTQKDSQHVGSPDVAKVSNSSMTEESDISPKHSDWSRRGSEDMLDCFHETDSSV 712

Query: 482  FMDDLKGLPSMSCKTRFGPKSDQGMTTSSAGSMTPRSPLLTPRTSQFDLFLAGKGAYYEH 541
            FMDDLKGLPSMSCKTRFGPKSDQGMTTSSAGSMTPRSPLLTPRTSQFDLFLAGKGAYYEH
Sbjct: 713  FMDDLKGLPSMSCKTRFGPKSDQGMTTSSAGSMTPRSPLLTPRTSQFDLFLAGKGAYYEH 772

Query: 542  DDLTQISDLADIARCSANTPLGDDCSMQYLLTCLEDLRVVINRRKFNALTVDTFGTRIEK 601
            DDLTQISDLADIARCSANTPLGDDCSMQYLLTCLEDLRVVINRRKFNALTVDTFGTRIEK
Sbjct: 773  DDLTQISDLADIARCSANTPLGDDCSMQYLLTCLEDLRVVINRRKFNALTVDTFGTRIEK 832

Query: 602  LIREKYLQLCELVDDEKIDIASTVIDEDTPLEDDIVRSLRTSPIHSSKDRTSIDDFEIIK 661
            LIREKYLQLCELVDDEKIDIASTVIDEDTPLEDDIVRSLRTSPIHSSKDRTSIDDFEIIK
Sbjct: 833  LIREKYLQLCELVDDEKIDIASTVIDEDTPLEDDIVRSLRTSPIHSSKDRTSIDDFEIIK 892

Query: 662  PISRGAFGRVFLAKKRTTGDLFAIKVLKKADMIRKNAVESILAERNILISVRNPFVVRFF 721
            PISRGAFGRVFLAKKRTTGDLFAIKVLKKADMIRKNAVESILAERNILISVRNPFVVRFF
Sbjct: 893  PISRGAFGRVFLAKKRTTGDLFAIKVLKKADMIRKNAVESILAERNILISVRNPFVVRFF 952

Query: 722  YSFTCRDNLYLVMEYLNGGDLYSLLRNLGCLDEEVARVYIAEVVLALEYLHSLGVVHRDL 781
            YSFTCRDNLYLVMEYLNGGDLYSLLRNLGCLDEEVARVYIAEVVLALEYLHSLGVVHRDL
Sbjct: 953  YSFTCRDNLYLVMEYLNGGDLYSLLRNLGCLDEEVARVYIAEVVLALEYLHSLGVVHRDL 1012

Query: 782  KPDNLLIAHDGHIKLTDFGLSKVGLINSTDDLSGPAVSGTTLLGYDEPTMSASEHQQERR 841
            KPDNLLIAHDGHIKLTDFGLSKVGLINSTDDLSGPAVSGTTLLGYDEPTMSASEHQQERR
Sbjct: 1013 KPDNLLIAHDGHIKLTDFGLSKVGLINSTDDLSGPAVSGTTLLGYDEPTMSASEHQQERR 1072

Query: 842  KKRSAVGTPDYLAPEILLGTGHGATADWWSVGIILFELIVGIPPFNAEHPQTIFDNILNR 901
            KKRSAVGTPDYLAPEILLGTGHGATADWWSVGIILFELIVGIPPFNAEHPQTIFDNILNR
Sbjct: 1073 KKRSAVGTPDYLAPEILLGTGHGATADWWSVGIILFELIVGIPPFNAEHPQTIFDNILNR 1132

Query: 902  KIPWPQIPEEMSHDAQDLIDRLLTEDPHQRLGAIGASEVKQHMFFKDINWDTLARQKAAF 961
            KIPWPQIPEEMSHDAQDLIDRLLTEDPHQRLGAIGASEVKQHMFFKDINWDTLARQKAAF
Sbjct: 1133 KIPWPQIPEEMSHDAQDLIDRLLTEDPHQRLGAIGASEVKQHMFFKDINWDTLARQKAAF 1192

Query: 962  VPTSESALDTSYFTSRYSWNHSDDHVYPHSELEDSSDADSLSGDSCLSNRQDEVGDECGG 1021
            VPTSESALDTSYFTSRYSWNHSDDHVYPHSELEDSSDADSLSGDSCLSNRQDEVGDECGG
Sbjct: 1193 VPTSESALDTSYFTSRYSWNHSDDHVYPHSELEDSSDADSLSGDSCLSNRQDEVGDECGG 1252

Query: 1022 LTDFEPGASVNYSFSNFSFKNLSQLASINYDLLSKGLKDDPPNHDA 1068
            LTDFEPGASVNYSFSNFSFKNLSQLASINYDLLSKGLKDDPPNHDA
Sbjct: 1253 LTDFEPGASVNYSFSNFSFKNLSQLASINYDLLSKGLKDDPPNHDA 1298

BLAST of CSPI01G05030 vs. NCBI nr
Match: gi|659066229|ref|XP_008443041.1| (PREDICTED: uncharacterized protein LOC103486458 [Cucumis melo])

HSP 1 Score: 2087.0 bits (5406), Expect = 0.0e+00
Identity = 1049/1066 (98.41%), Postives = 1054/1066 (98.87%), Query Frame = 1

Query: 2    TNITGCYGSSWGFSEGLRNLDFCTPEPSYDCENPKESESPRFQAILRVTSAPRRKYPADI 61
            +N  GCYGSSWGFSEGLRNLDFCTPEPSYDCENPKESESPRFQAILRVTSAPRRKYPADI
Sbjct: 233  SNEAGCYGSSWGFSEGLRNLDFCTPEPSYDCENPKESESPRFQAILRVTSAPRRKYPADI 292

Query: 62   KSFSHELNSKGVRPFPLWKSRRLNNLEEILVMIRAKFDKAKEEVNSDLAIFAADLVGVLE 121
            KSFSHELNSKGVRPFPLWKSRRLNNLEEILVMIRAKFDKAKEEV+SDLAIFAADLVGVLE
Sbjct: 293  KSFSHELNSKGVRPFPLWKSRRLNNLEEILVMIRAKFDKAKEEVDSDLAIFAADLVGVLE 352

Query: 122  KNVDTHPEWQETIEDLLVLARSCAMSSPGEFWLQCESIVQELDDRRQELPPGMLKQLHTR 181
            KNVDTHPEWQETIEDLLVLARSCAMSSPGEFWLQCESIVQELDDRRQELPPGMLKQLHTR
Sbjct: 353  KNVDTHPEWQETIEDLLVLARSCAMSSPGEFWLQCESIVQELDDRRQELPPGMLKQLHTR 412

Query: 182  ILFILTRCTRLLQFHKESGLAEDENVFQLRQSRNLHSADKRTAPAMGREMKSSSAAKASK 241
            ILFILTRCTRLLQFHKESGLAEDENVFQLRQSRNLHSADKR APAMGRE KSSSAAKASK
Sbjct: 413  ILFILTRCTRLLQFHKESGLAEDENVFQLRQSRNLHSADKRMAPAMGRETKSSSAAKASK 472

Query: 242  GTSSRKSYSQEQRGLDWSREHDILPGNNVSTSPDDTSKNLVSPTGRDRMASWKRLPSPAK 301
              SSRKSYSQEQRGLDWSREHDILPGNNVSTSPDDTSKNLVSPTGRDR+ASWKRLPSPA 
Sbjct: 473  TPSSRKSYSQEQRGLDWSREHDILPGNNVSTSPDDTSKNLVSPTGRDRIASWKRLPSPAP 532

Query: 302  SSVKESTLKVQGDNKKFKSLNMSKIRMSVSETDFAAAKVSELPLPRDSHDQPTKHQHKPS 361
             +VKESTLK QGDNKKFKSLNMSKI MSVSETDF AAKVSELPLPRDSHDQPTKHQHKPS
Sbjct: 533  KTVKESTLKDQGDNKKFKSLNMSKIGMSVSETDFTAAKVSELPLPRDSHDQPTKHQHKPS 592

Query: 362  WGWGDQPTVSDESSIICRICEEEIPTANVEDHSRICAVADRCDQKGISVNERLLRIAETL 421
            WGWGDQPTVSDESSIICRICEEEIPTANVEDHSRICAVADRCDQKGISVNERLLRIAETL
Sbjct: 593  WGWGDQPTVSDESSIICRICEEEIPTANVEDHSRICAVADRCDQKGISVNERLLRIAETL 652

Query: 422  EKMIESFTQKDSQHVGSPDVAKVSNSSMTEESDISPKHSDWSRRGSEDMLDCFHETDSSV 481
            EKMIESFTQKDSQHVGSPDVAKVSNSSMTEESDISPKHSDWSRRGSEDMLDCFHETDSSV
Sbjct: 653  EKMIESFTQKDSQHVGSPDVAKVSNSSMTEESDISPKHSDWSRRGSEDMLDCFHETDSSV 712

Query: 482  FMDDLKGLPSMSCKTRFGPKSDQGMTTSSAGSMTPRSPLLTPRTSQFDLFLAGKGAYYEH 541
            FMDDLKGLPSMSCKTRFGPKSDQGMTTSSAGSMTPRSPLLTPRTSQFDLFLAGKGAYYEH
Sbjct: 713  FMDDLKGLPSMSCKTRFGPKSDQGMTTSSAGSMTPRSPLLTPRTSQFDLFLAGKGAYYEH 772

Query: 542  DDLTQISDLADIARCSANTPLGDDCSMQYLLTCLEDLRVVINRRKFNALTVDTFGTRIEK 601
            DDLTQISDLADIARCSANTPLGDDCSMQYLLTCLEDLRVVINRRKFNALTVDTFGTRIEK
Sbjct: 773  DDLTQISDLADIARCSANTPLGDDCSMQYLLTCLEDLRVVINRRKFNALTVDTFGTRIEK 832

Query: 602  LIREKYLQLCELVDDEKIDIASTVIDEDTPLEDDIVRSLRTSPIHSSKDRTSIDDFEIIK 661
            LIREKYLQLCELVDDEKID+ASTVIDEDTPLEDDIVRSLRTSPIHSSKDRTSIDDFEIIK
Sbjct: 833  LIREKYLQLCELVDDEKIDLASTVIDEDTPLEDDIVRSLRTSPIHSSKDRTSIDDFEIIK 892

Query: 662  PISRGAFGRVFLAKKRTTGDLFAIKVLKKADMIRKNAVESILAERNILISVRNPFVVRFF 721
            PISRGAFGRVFLAKKRTTGDLFAIKVLKKADMIRKNAVESILAERNILISVRNPFVVRFF
Sbjct: 893  PISRGAFGRVFLAKKRTTGDLFAIKVLKKADMIRKNAVESILAERNILISVRNPFVVRFF 952

Query: 722  YSFTCRDNLYLVMEYLNGGDLYSLLRNLGCLDEEVARVYIAEVVLALEYLHSLGVVHRDL 781
            YSFTCRDNLYLVMEYLNGGDLYSLLRNLGCLDEEVARVYIAEVVLALEYLHSLGVVHRDL
Sbjct: 953  YSFTCRDNLYLVMEYLNGGDLYSLLRNLGCLDEEVARVYIAEVVLALEYLHSLGVVHRDL 1012

Query: 782  KPDNLLIAHDGHIKLTDFGLSKVGLINSTDDLSGPAVSGTTLLGYDEPTMSASEHQQERR 841
            KPDNLLIAHDGHIKLTDFGLSKVGLINSTDDLSGPAVSGTTLLGYDEPTMSASEHQQERR
Sbjct: 1013 KPDNLLIAHDGHIKLTDFGLSKVGLINSTDDLSGPAVSGTTLLGYDEPTMSASEHQQERR 1072

Query: 842  KKRSAVGTPDYLAPEILLGTGHGATADWWSVGIILFELIVGIPPFNAEHPQTIFDNILNR 901
            KKRSAVGTPDYLAPEILLGTGHGATADWWSVGIILFELIVGIPPFNAEHPQTIFDNILNR
Sbjct: 1073 KKRSAVGTPDYLAPEILLGTGHGATADWWSVGIILFELIVGIPPFNAEHPQTIFDNILNR 1132

Query: 902  KIPWPQIPEEMSHDAQDLIDRLLTEDPHQRLGAIGASEVKQHMFFKDINWDTLARQKAAF 961
            KIPWPQIPEEMSHDAQDLIDRLLTEDPHQRLGAIGASEVKQHMFFKDINWDTLARQKAAF
Sbjct: 1133 KIPWPQIPEEMSHDAQDLIDRLLTEDPHQRLGAIGASEVKQHMFFKDINWDTLARQKAAF 1192

Query: 962  VPTSESALDTSYFTSRYSWNHSDDHVYPHSELEDSSDADSLSGDSCLSNRQDEVGDECGG 1021
            VPTSESALDTSYFTSRYSWNHSDDHVYPHSELEDSSDADSLSGDS LSNRQDEVGDECGG
Sbjct: 1193 VPTSESALDTSYFTSRYSWNHSDDHVYPHSELEDSSDADSLSGDSSLSNRQDEVGDECGG 1252

Query: 1022 LTDFEPGASVNYSFSNFSFKNLSQLASINYDLLSKGLKDDPPNHDA 1068
            LTDFEPGASVNYSFSNFSFKNLSQLASINYDLLSKGLKDDPPNHDA
Sbjct: 1253 LTDFEPGASVNYSFSNFSFKNLSQLASINYDLLSKGLKDDPPNHDA 1298

BLAST of CSPI01G05030 vs. NCBI nr
Match: gi|590666113|ref|XP_007036900.1| (Kinase superfamily protein isoform 1 [Theobroma cacao])

HSP 1 Score: 1710.3 bits (4428), Expect = 0.0e+00
Identity = 870/1060 (82.08%), Postives = 951/1060 (89.72%), Query Frame = 1

Query: 10   SSWGFSEGLRNLDFCTPEPSYDCENPKESESPRFQAILRVTSAPRRKYPADIKSFSHELN 69
            SSWG S GL++ DFCTPE SYDCENPKESESPRFQAILRVTS PR+++PADIKSFSHELN
Sbjct: 234  SSWGHSGGLKSSDFCTPETSYDCENPKESESPRFQAILRVTSGPRKRFPADIKSFSHELN 293

Query: 70   SKGVRPFPLWKSRRLNNLEEILVMIRAKFDKAKEEVNSDLAIFAADLVGVLEKNVDTHPE 129
            SKGVRPFPLWK RRLNNLEEIL+ IRAKFDKAKEEVN+DLAIFAADLVG+LEKN ++HPE
Sbjct: 294  SKGVRPFPLWKPRRLNNLEEILIAIRAKFDKAKEEVNADLAIFAADLVGILEKNAESHPE 353

Query: 130  WQETIEDLLVLARSCAMSSPGEFWLQCESIVQELDDRRQELPPGMLKQLHTRILFILTRC 189
            WQETIEDLLVLARSCAM+ PGEFWLQCE IVQELDD+RQELPPG LKQL+T++LFILTRC
Sbjct: 354  WQETIEDLLVLARSCAMTPPGEFWLQCEGIVQELDDKRQELPPGTLKQLYTKMLFILTRC 413

Query: 190  TRLLQFHKESGLAEDENVFQLRQSRNLHSADKRTAPAMGREMKSSSAAKASKGT---SSR 249
            TRLLQFHKESGLAEDE V QLRQSR LH  DKRT+  + RE KS SA+KASK +   SS+
Sbjct: 414  TRLLQFHKESGLAEDEPVIQLRQSRILHPVDKRTSSGVLREAKSLSASKASKSSKAASSK 473

Query: 250  KSYSQEQRGLDWSREHDILPGNNVSTSPDDTSKNLVSPTGRDRMASWKRLPSPAKSSVKE 309
            K+YSQEQ  LDW R+H +LPG  ++ + DDT KNL SP  RDR+ASWK+LPSPAK   KE
Sbjct: 474  KAYSQEQHALDWKRDHVVLPGGLIAPT-DDTPKNLESPASRDRIASWKKLPSPAKKGPKE 533

Query: 310  STLKVQGDNKKFKSLNMSKIRMSVSETDFAAAKVSELPLPRDSHDQPTKHQHKPSWG-WG 369
                 + ++ K ++L     R   S+ D AA K+ ELP  ++S +  +KHQHK SWG WG
Sbjct: 534  VIASKEQNDNKIETLK----RRGASDVDLAAMKLQELPPAKESQEHSSKHQHKVSWGYWG 593

Query: 370  DQPTVSDESSIICRICEEEIPTANVEDHSRICAVADRCDQKGISVNERLLRIAETLEKMI 429
            DQP VS+ESSIICRICEEE+ T+NVEDHSRICAVADRCDQKG+SV+ERL+RIAETLEKM 
Sbjct: 594  DQPNVSEESSIICRICEEEVATSNVEDHSRICAVADRCDQKGLSVDERLVRIAETLEKMT 653

Query: 430  ESFTQKDSQHVGSPDVAKVSNSSMTEESDI-SPKHSDWSRRGSEDMLDCFHETDSSVFMD 489
            +SF  KD QHVGSPD AKVSNSS+TEESD+ SPK SDWSRRGSEDMLDCF E D+SVFMD
Sbjct: 654  DSFANKDIQHVGSPDGAKVSNSSVTEESDVLSPKLSDWSRRGSEDMLDCFPEADNSVFMD 713

Query: 490  DLKGLPSMSCKTRFGPKSDQGMTTSSAGSMTPRSPLLTPRTSQFDLFLAGKGAYYEHDDL 549
            DLKGLPSMSCKTRFGPKSDQGMTTSSAGSMTPRSPLLTPRTSQ DL L+GKGA+ E +DL
Sbjct: 714  DLKGLPSMSCKTRFGPKSDQGMTTSSAGSMTPRSPLLTPRTSQIDLLLSGKGAFSEQEDL 773

Query: 550  TQISDLADIARCSANTPLGDDCSMQYLLTCLEDLRVVINRRKFNALTVDTFGTRIEKLIR 609
             Q+++LADIARC ANTPL DD SM +LL+ LE+LR+VI+RRKF+ALTV+TFG RIEKLIR
Sbjct: 774  PQMNELADIARCVANTPLVDDHSMPFLLSFLEELRLVIDRRKFDALTVETFGARIEKLIR 833

Query: 610  EKYLQLCELVDDEKIDIASTVIDEDTPLEDDIVRSLRTSPIHSSKDRTSIDDFEIIKPIS 669
            EKYLQLCELVDDEK+DI STVIDED PLEDD+VRSLRTSP HSS+DRT+IDDFEIIKPIS
Sbjct: 834  EKYLQLCELVDDEKVDITSTVIDEDAPLEDDVVRSLRTSPNHSSRDRTTIDDFEIIKPIS 893

Query: 670  RGAFGRVFLAKKRTTGDLFAIKVLKKADMIRKNAVESILAERNILISVRNPFVVRFFYSF 729
            RGAFGRVFLAKKRTTGDLFAIKVLKKADMIRKNAVESILAER+ILISVRNPFVVRFFYSF
Sbjct: 894  RGAFGRVFLAKKRTTGDLFAIKVLKKADMIRKNAVESILAERDILISVRNPFVVRFFYSF 953

Query: 730  TCRDNLYLVMEYLNGGDLYSLLRNLGCLDEEVARVYIAEVVLALEYLHSLGVVHRDLKPD 789
            TCR+NLYLVMEYLNGGDLYSLLRNLGCLDEEVARVYIAEVVLALEYLHSL VVHRDLKPD
Sbjct: 954  TCRENLYLVMEYLNGGDLYSLLRNLGCLDEEVARVYIAEVVLALEYLHSLHVVHRDLKPD 1013

Query: 790  NLLIAHDGHIKLTDFGLSKVGLINSTDDLSGPAVSGTTLLGYDEPTMSASEHQQERRKKR 849
            NLLIAHDGHIKLTDFGLSKVGLINSTDDLSGPAVSGT+LL  ++P +SASEHQQERRKKR
Sbjct: 1014 NLLIAHDGHIKLTDFGLSKVGLINSTDDLSGPAVSGTSLLDDEQPQLSASEHQQERRKKR 1073

Query: 850  SAVGTPDYLAPEILLGTGHGATADWWSVGIILFELIVGIPPFNAEHPQTIFDNILNRKIP 909
            SAVGTPDYLAPEILLGTGHGATADWWSVG+ILFELIVGIPPFNAEHPQTIFDNILNRKIP
Sbjct: 1074 SAVGTPDYLAPEILLGTGHGATADWWSVGVILFELIVGIPPFNAEHPQTIFDNILNRKIP 1133

Query: 910  WPQIPEEMSHDAQDLIDRLLTEDPHQRLGAIGASEVKQHMFFKDINWDTLARQKAAFVPT 969
            WP++ EEMS +A+DLIDRLLTEDPHQRLGA GASEVKQH+FFKDINWDTLARQKAAFVPT
Sbjct: 1134 WPRVSEEMSLEAKDLIDRLLTEDPHQRLGARGASEVKQHVFFKDINWDTLARQKAAFVPT 1193

Query: 970  SESALDTSYFTSRYSWNHSDDHVYPHSELEDSSDADSLSG-DSCLSNRQDEVGDECGGLT 1029
            SESALDTSYFTSRYSWN SDDH YP SE +DSSDADSLSG  SCLSNRQDEVGDECGGL 
Sbjct: 1194 SESALDTSYFTSRYSWNTSDDHAYPGSEFDDSSDADSLSGSSSCLSNRQDEVGDECGGLA 1253

Query: 1030 DFEPGASVNYSFSNFSFKNLSQLASINYDLLSKGLKDDPP 1064
            +FE G+SVNYSFSNFSFKNLSQLASINYDLLSKG KDD P
Sbjct: 1254 EFESGSSVNYSFSNFSFKNLSQLASINYDLLSKGWKDDHP 1288

BLAST of CSPI01G05030 vs. NCBI nr
Match: gi|255585466|ref|XP_002533426.1| (PREDICTED: probable serine/threonine protein kinase IREH1 isoform X1 [Ricinus communis])

HSP 1 Score: 1704.5 bits (4413), Expect = 0.0e+00
Identity = 876/1056 (82.95%), Postives = 948/1056 (89.77%), Query Frame = 1

Query: 11   SWGFSEGLRNLDFCTPEPSYDCENPKESESPRFQAILRVTSAPRRKYPADIKSFSHELNS 70
            SWG S GLR+ D  TPE +YDCENPKESESPRFQAILRVTSAPR+++PADIKSFSHELNS
Sbjct: 231  SWGHSGGLRSSDVLTPE-TYDCENPKESESPRFQAILRVTSAPRKRFPADIKSFSHELNS 290

Query: 71   KGVRPFPLWKSRRLNNLEEILVMIRAKFDKAKEEVNSDLAIFAADLVGVLEKNVDTHPEW 130
            KGVRPFP WK R LNNLEEILV+IRAKFDKAKEEVNSDLAIFAADLVGVLEKN ++HPEW
Sbjct: 291  KGVRPFPFWKPRGLNNLEEILVVIRAKFDKAKEEVNSDLAIFAADLVGVLEKNAESHPEW 350

Query: 131  QETIEDLLVLARSCAMSSPGEFWLQCESIVQELDDRRQELPPGMLKQLHTRILFILTRCT 190
            QETIEDLLVLARSCAMSSP EFWLQCESIVQELDDRRQELPPGMLKQLHTR+LFILTRCT
Sbjct: 351  QETIEDLLVLARSCAMSSPSEFWLQCESIVQELDDRRQELPPGMLKQLHTRMLFILTRCT 410

Query: 191  RLLQFHKESGLAEDENVFQLRQSRNLHSADKRTAPAMGREMKSSSAAKASKGTSSRKSYS 250
            RLLQFHKESGLAEDENVFQLRQSR LHSA+KR  P++ R+ KSSSAAKASK  S++KSYS
Sbjct: 411  RLLQFHKESGLAEDENVFQLRQSRLLHSAEKRIPPSIVRDGKSSSAAKASKAASAKKSYS 470

Query: 251  QEQRGLDWSREHDILPGNNVSTSPDDTSKNLVSPTGRDRMASWKRLPSPAKSSVKESTLK 310
            QEQ GLDW R+     G+++ T+ DD SKN+ SP    RMASWKRLPSPA  SVKE    
Sbjct: 471  QEQHGLDWKRDQVAQLGSSLPTA-DDASKNMDSPGSGARMASWKRLPSPAGKSVKEVAPS 530

Query: 311  VQGDNKKFKSLNMSKIRMSVSETDFAAAKVSELPLPRDSHDQPTKHQHKPSWG-WGDQPT 370
             + ++ K + L +   R  VS+ D  A K+SELP+ +DSH+   KHQHK SWG WGDQ  
Sbjct: 531  KENNDCKIEPLKILNNRKGVSDADLTATKLSELPVAKDSHEHSMKHQHKISWGYWGDQQN 590

Query: 371  VSDESSIICRICEEEIPTANVEDHSRICAVADRCDQKGISVNERLLRIAETLEKMIESFT 430
            VSD++SIICRICEEE+PT +VEDHSRICA+ADR DQKG+SVNERL RI+ETL+KMIES  
Sbjct: 591  VSDDTSIICRICEEEVPTLHVEDHSRICAIADRSDQKGLSVNERLARISETLDKMIESIA 650

Query: 431  QKDSQH-VGSPDVAKVSNSSMTEESDI-SPKHSDWSRRGSEDMLDCFHETDSSVFMDDLK 490
            QKD+Q  VGSPDVAKVSNSS+TEESD+ SPK SDWSRRGSEDMLDCF E D+SVFMDDLK
Sbjct: 651  QKDTQPAVGSPDVAKVSNSSVTEESDVLSPKLSDWSRRGSEDMLDCFPEADNSVFMDDLK 710

Query: 491  GLPSMSCKTRFGPKSDQGMTTSSAGSMTPRSPLLTPRTSQFDLFLAGKGAYYEHDDLTQI 550
            GLPSMSCKTRFGPKSDQGM TSSAGSMTPRSPLLTPRTS  DL L GKGA+ EHDDL Q+
Sbjct: 711  GLPSMSCKTRFGPKSDQGMATSSAGSMTPRSPLLTPRTSPIDLLLTGKGAFSEHDDLPQM 770

Query: 551  SDLADIARCSANTPLGDDCSMQYLLTCLEDLRVVINRRKFNALTVDTFGTRIEKLIREKY 610
            ++LADIARC   TPL DD S+ YLL+CLEDLRVVI+RRKF+ALTV+TFGTRIEKLIREKY
Sbjct: 771  TELADIARCVVTTPLDDDRSIPYLLSCLEDLRVVIDRRKFDALTVETFGTRIEKLIREKY 830

Query: 611  LQLCELVDDEKIDIASTVIDEDTPLEDDIVRSLRTSPIHSSKDRTSIDDFEIIKPISRGA 670
            LQLCELV+DE++DI ST+IDED PLEDD+VRSLRTSPIHSSKDRTSIDDFEIIKPISRGA
Sbjct: 831  LQLCELVEDERVDITSTIIDEDAPLEDDVVRSLRTSPIHSSKDRTSIDDFEIIKPISRGA 890

Query: 671  FGRVFLAKKRTTGDLFAIKVLKKADMIRKNAVESILAERNILISVRNPFVVRFFYSFTCR 730
            FGRVFLAKKRTTGDLFAIKVLKKADMIRKNAVESILAER+ILISVRNPFVVRFFYSFTCR
Sbjct: 891  FGRVFLAKKRTTGDLFAIKVLKKADMIRKNAVESILAERDILISVRNPFVVRFFYSFTCR 950

Query: 731  DNLYLVMEYLNGGDLYSLLRNLGCLDEEVARVYIAEVVLALEYLHSLGVVHRDLKPDNLL 790
            +NLYLVMEYLNGGDLYSLLRNLGCLDEEVARVYIAEVVLALEYLHSL VVHRDLKPDNLL
Sbjct: 951  ENLYLVMEYLNGGDLYSLLRNLGCLDEEVARVYIAEVVLALEYLHSLRVVHRDLKPDNLL 1010

Query: 791  IAHDGHIKLTDFGLSKVGLINSTDDLSGPAVSGTTLLGYDEPTMSASEHQQERRKKRSAV 850
            IAHDGHIKLTDFGLSKVGLINSTDDLSGPAVSGT++L  DEP +SASEHQ+ERRKKRSAV
Sbjct: 1011 IAHDGHIKLTDFGLSKVGLINSTDDLSGPAVSGTSMLEDDEPQLSASEHQRERRKKRSAV 1070

Query: 851  GTPDYLAPEILLGTGHGATADWWSVGIILFELIVGIPPFNAEHPQTIFDNILNRKIPWPQ 910
            GTPDYLAPEILLGTGHG TADWWSVG+ILFELIVGIPPFNAEHPQ IFDNILNRKIPWP+
Sbjct: 1071 GTPDYLAPEILLGTGHGTTADWWSVGVILFELIVGIPPFNAEHPQIIFDNILNRKIPWPR 1130

Query: 911  IPEEMSHDAQDLIDRLLTEDPHQRLGAIGASEVKQHMFFKDINWDTLARQKAAFVPTSES 970
            +PEEMS +AQDLIDRLLTEDP  RLGA GASEVKQH+FFKDINWDTLARQKAAFVP+SES
Sbjct: 1131 VPEEMSPEAQDLIDRLLTEDPEVRLGAGGASEVKQHVFFKDINWDTLARQKAAFVPSSES 1190

Query: 971  ALDTSYFTSRYSWNHSDDHVYPHSELEDSSDADSLSG-DSCLSNRQDEVGDECGGLTDFE 1030
            ALDTSYFTSRYSWN S D VYP S+ EDSSDADSLSG  SCLSNRQDEVGDECGGL +FE
Sbjct: 1191 ALDTSYFTSRYSWNTS-DQVYPTSDFEDSSDADSLSGSSSCLSNRQDEVGDECGGLAEFE 1250

Query: 1031 PGASVNYSFSNFSFKNLSQLASINYDLLSKGLKDDP 1063
             G+SVNYSFSNFSFKNLSQLASINYDLLSKG KDDP
Sbjct: 1251 SGSSVNYSFSNFSFKNLSQLASINYDLLSKGWKDDP 1283

BLAST of CSPI01G05030 vs. NCBI nr
Match: gi|802551124|ref|XP_012064629.1| (PREDICTED: probable serine/threonine protein kinase IREH1 isoform X2 [Jatropha curcas])

HSP 1 Score: 1703.7 bits (4411), Expect = 0.0e+00
Identity = 878/1066 (82.36%), Postives = 952/1066 (89.31%), Query Frame = 1

Query: 6    GCYGSSWGFSEGLRNLDFCTPEPS--YDCENPKESESPRFQAILRVTSAPRRKYPADIKS 65
            G + SSW  S  LR+ D  TPE S  YDCENPKESESPRFQAILRVTSAPR+++PADIKS
Sbjct: 236  GRHESSWSRSGVLRSSDVFTPEVSETYDCENPKESESPRFQAILRVTSAPRKRFPADIKS 295

Query: 66   FSHELNSKGVRPFPLWKSRRLNNLEEILVMIRAKFDKAKEEVNSDLAIFAADLVGVLEKN 125
            FSHELNSKGVRPFP WK R LNNLEEILV+IRAKFDKAKEEVNSDLAIFAADLVG+LEKN
Sbjct: 296  FSHELNSKGVRPFPFWKPRGLNNLEEILVVIRAKFDKAKEEVNSDLAIFAADLVGILEKN 355

Query: 126  VDTHPEWQETIEDLLVLARSCAMSSPGEFWLQCESIVQELDDRRQELPPGMLKQLHTRIL 185
             ++HPEWQETIEDLLVLARSCAM+SP EFWLQCE IVQELDDRRQELPPGMLKQLHTR+L
Sbjct: 356  AESHPEWQETIEDLLVLARSCAMTSPSEFWLQCEGIVQELDDRRQELPPGMLKQLHTRML 415

Query: 186  FILTRCTRLLQFHKESGLAEDENVFQLRQSRNLHSADKRTAPAMGREMKSSSAAKASKGT 245
            FILTRCTRLLQFHKESGLAEDENVF LRQSR LHS DKR     GRE KSSSAAKASK  
Sbjct: 416  FILTRCTRLLQFHKESGLAEDENVFHLRQSRLLHSDDKRIPLGPGREGKSSSAAKASKTA 475

Query: 246  SSRKSYSQEQR-GLDWSREHDILPGNNVSTSPDDTSKNLVSPTGRDRMASWKRLPSPAKS 305
            S+RKSYSQEQ  GLDW+R+    PGN++ T+ D TSK++ SP  RDRMASWK+LPSP   
Sbjct: 476  STRKSYSQEQHHGLDWNRDQIAQPGNSLPTT-DGTSKSMDSPGSRDRMASWKKLPSPVAK 535

Query: 306  SVKESTLKVQGDNKKFKSLNMSKIRMSVSETDFAAAKVSELPLPRDSHDQPTKHQHKPSW 365
            ++K++ LK  G   K + L     R+ +S+ D  A K+SE+P  +DSH+  TKHQHK SW
Sbjct: 536  NMKDAPLKELGS--KVEPLKTLNSRIGISDADLVATKLSEIPTAKDSHEHSTKHQHKVSW 595

Query: 366  G-WGDQPTVSDESSIICRICEEEIPTANVEDHSRICAVADRCDQKGISVNERLLRIAETL 425
            G WGDQ  + DESSIICRICEEE+PT++VEDHSRICA+ADRCDQKG+SVNERL RI+ETL
Sbjct: 596  GYWGDQQNIFDESSIICRICEEEVPTSHVEDHSRICAIADRCDQKGLSVNERLARISETL 655

Query: 426  EKMIESFTQKDSQHV-GSPDVAKVSNSSMTEESDI-SPKHSDWSRRGSEDMLDCFHETDS 485
            EKMIE+F QKD QH  GSPDVAKVSNSS+TEESD+ SPK SDWSRRGSEDMLDCF E D+
Sbjct: 656  EKMIETFAQKDIQHAAGSPDVAKVSNSSVTEESDVLSPKLSDWSRRGSEDMLDCFPEADN 715

Query: 486  SVFMDDLKGLPSMSCKTRFGPKSDQGMTTSSAGSMTPRSPL--LTPRTSQFDLFLAGKGA 545
             +FMDDLKGLPSMSCKTRFGPKSDQGM TSSAGSMTPRSP   LTPRTSQ DL LAGKGA
Sbjct: 716  YIFMDDLKGLPSMSCKTRFGPKSDQGMATSSAGSMTPRSPSSSLTPRTSQIDLLLAGKGA 775

Query: 546  YYEHDDLTQISDLADIARCSANTPLGDDCSMQYLLTCLEDLRVVINRRKFNALTVDTFGT 605
            + E+DD+ Q+++LADIARC ANTPL DD SM YLLTCLEDLRVVI+RRKF+A TV+TFGT
Sbjct: 776  FSENDDIPQMNELADIARCVANTPLDDDRSMPYLLTCLEDLRVVIDRRKFDAHTVETFGT 835

Query: 606  RIEKLIREKYLQLCELVDDEKIDIASTVIDEDTPLEDDIVRSLRTSPIHSSKDRTSIDDF 665
            RIEKLIREKYLQLCELV+D+K+DI STVIDEDTPLEDD+VRSLRTSPIHS KDRTSIDDF
Sbjct: 836  RIEKLIREKYLQLCELVEDDKVDITSTVIDEDTPLEDDVVRSLRTSPIHS-KDRTSIDDF 895

Query: 666  EIIKPISRGAFGRVFLAKKRTTGDLFAIKVLKKADMIRKNAVESILAERNILISVRNPFV 725
            EIIKPISRGAFGRVFLAKKRTTGDLFAIKVLKKADMIRKNAVESILAER+ILISVRNPFV
Sbjct: 896  EIIKPISRGAFGRVFLAKKRTTGDLFAIKVLKKADMIRKNAVESILAERDILISVRNPFV 955

Query: 726  VRFFYSFTCRDNLYLVMEYLNGGDLYSLLRNLGCLDEEVARVYIAEVVLALEYLHSLGVV 785
            VRFFYSFTCR+NLYLVMEYLNGGDLYSLLRNLGCLDE+VAR+YIAEVVLALEYLHSL VV
Sbjct: 956  VRFFYSFTCRENLYLVMEYLNGGDLYSLLRNLGCLDEDVARIYIAEVVLALEYLHSLRVV 1015

Query: 786  HRDLKPDNLLIAHDGHIKLTDFGLSKVGLINSTDDLSGPAVSGTTLLGYDEPTMSASEHQ 845
            HRDLKPDNLLIAHDGHIKLTDFGLSKVGLINSTDDLSGPAVSGT++L  DEP +S SE Q
Sbjct: 1016 HRDLKPDNLLIAHDGHIKLTDFGLSKVGLINSTDDLSGPAVSGTSMLVDDEPQVSTSEDQ 1075

Query: 846  QERRKKRSAVGTPDYLAPEILLGTGHGATADWWSVGIILFELIVGIPPFNAEHPQTIFDN 905
            Q+RRKKRSAVGTPDYLAPEILLGTGHG TADWWSVG+ILFELIVGIPPFNAEHPQ IFDN
Sbjct: 1076 QDRRKKRSAVGTPDYLAPEILLGTGHGTTADWWSVGVILFELIVGIPPFNAEHPQKIFDN 1135

Query: 906  ILNRKIPWPQIPEEMSHDAQDLIDRLLTEDPHQRLGAIGASEVKQHMFFKDINWDTLARQ 965
            ILNRKIPWP++PEEMS +A DLIDRLLTEDPHQRLGA GASEVKQH+FFKDINWDTLARQ
Sbjct: 1136 ILNRKIPWPRVPEEMSPEAWDLIDRLLTEDPHQRLGAGGASEVKQHVFFKDINWDTLARQ 1195

Query: 966  KAAFVPTSESALDTSYFTSRYSWNHSDDHVYPHSELEDSSDADSLSG-DSCLSNRQDEVG 1025
            KAAFVP+SESALDTSYFTSRYSWNHSDDHVYP S+ EDSSDADSLSG  SCLSNRQDEVG
Sbjct: 1196 KAAFVPSSESALDTSYFTSRYSWNHSDDHVYPASDFEDSSDADSLSGSSSCLSNRQDEVG 1255

Query: 1026 DECGGLTDFEPGASVNYSFSNFSFKNLSQLASINYDLLSKGLKDDP 1063
            DECGGL +FE G+SVNYSFSNFSFKNLSQLASINYDLLSKG KDDP
Sbjct: 1256 DECGGLAEFESGSSVNYSFSNFSFKNLSQLASINYDLLSKGWKDDP 1297

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
IREH1_ARATH0.0e+0071.66Probable serine/threonine protein kinase IREH1 OS=Arabidopsis thaliana GN=IREH1 ... [more]
IRE3_ARATH0.0e+0067.91Probable serine/threonine protein kinase IRE3 OS=Arabidopsis thaliana GN=IRE3 PE... [more]
IRE_ARATH3.4e-29856.03Probable serine/threonine protein kinase IRE OS=Arabidopsis thaliana GN=IRE PE=2... [more]
IRE4_ARATH6.9e-18254.40Probable serine/threonine protein kinase IRE4 OS=Arabidopsis thaliana GN=IRE4 PE... [more]
Y0701_DICDI1.4e-9446.27Probable serine/threonine-protein kinase DDB_G0272282 OS=Dictyostelium discoideu... [more]
Match NameE-valueIdentityDescription
A0A061FX05_THECC0.0e+0082.08Kinase superfamily protein isoform 1 OS=Theobroma cacao GN=TCM_012919 PE=4 SV=1[more]
B9T5A7_RICCO0.0e+0082.95Kinase, putative OS=Ricinus communis GN=RCOM_0054220 PE=4 SV=1[more]
A0A067L7C1_JATCU0.0e+0082.36Uncharacterized protein OS=Jatropha curcas GN=JCGZ_22916 PE=4 SV=1[more]
A0A061FWF8_THECC0.0e+0081.98Kinase superfamily protein isoform 3 OS=Theobroma cacao GN=TCM_012919 PE=4 SV=1[more]
A0A061FV35_THECC0.0e+0081.90Kinase superfamily protein isoform 5 OS=Theobroma cacao GN=TCM_012919 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G17850.10.0e+0071.66 Protein kinase superfamily protein[more]
AT1G48490.10.0e+0067.91 Protein kinase superfamily protein[more]
AT5G62310.11.9e-29956.03 AGC (cAMP-dependent, cGMP-dependent and protein kinase C) kinase fam... [more]
AT1G45160.21.3e-19154.27 Protein kinase superfamily protein[more]
AT1G30640.12.8e-7239.73 Protein kinase family protein[more]
Match NameE-valueIdentityDescription
gi|778656691|ref|XP_011649519.1|0.0e+0099.72PREDICTED: probable serine/threonine protein kinase IREH1 [Cucumis sativus][more]
gi|659066229|ref|XP_008443041.1|0.0e+0098.41PREDICTED: uncharacterized protein LOC103486458 [Cucumis melo][more]
gi|590666113|ref|XP_007036900.1|0.0e+0082.08Kinase superfamily protein isoform 1 [Theobroma cacao][more]
gi|255585466|ref|XP_002533426.1|0.0e+0082.95PREDICTED: probable serine/threonine protein kinase IREH1 isoform X1 [Ricinus co... [more]
gi|802551124|ref|XP_012064629.1|0.0e+0082.36PREDICTED: probable serine/threonine protein kinase IREH1 isoform X2 [Jatropha c... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR000719Prot_kinase_dom
IPR000961AGC-kinase_C
IPR008271Ser/Thr_kinase_AS
IPR011009Kinase-like_dom_sf
IPR000719Prot_kinase_dom
IPR000961AGC-kinase_C
IPR008271Ser/Thr_kinase_AS
IPR011009Kinase-like_dom_sf
IPR000719Prot_kinase_dom
IPR000961AGC-kinase_C
IPR008271Ser/Thr_kinase_AS
IPR011009Kinase-like_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0004672protein kinase activity
GO:0005524ATP binding
GO:0004674protein serine/threonine kinase activity
GO:0004672protein kinase activity
GO:0005524ATP binding
GO:0004674protein serine/threonine kinase activity
GO:0004672protein kinase activity
GO:0005524ATP binding
GO:0004674protein serine/threonine kinase activity
Vocabulary: Biological Process
TermDefinition
GO:0006468protein phosphorylation
GO:0006468protein phosphorylation
GO:0006468protein phosphorylation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006468 protein phosphorylation
biological_process GO:0009069 serine family amino acid metabolic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005524 ATP binding
molecular_function GO:0004674 protein serine/threonine kinase activity
molecular_function GO:0004672 protein kinase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI01G05030.1CSPI01G05030.1mRNA
CSPI01G05030.2CSPI01G05030.2mRNA
CSPI01G05030.3CSPI01G05030.3mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000719Protein kinase domainPFAMPF00069Pkinasecoord: 566..854
score: 5.5
IPR000719Protein kinase domainSMARTSM00220serkin_6coord: 565..854
score: 1.7E
IPR000719Protein kinase domainPROFILEPS50011PROTEIN_KINASE_DOMcoord: 565..854
score: 52
IPR000961AGC-kinase, C-terminalPROFILEPS51285AGC_KINASE_CTERcoord: 855..958
score: 1
IPR008271Serine/threonine-protein kinase, active sitePROSITEPS00108PROTEIN_KINASE_STcoord: 684..696
scor
IPR011009Protein kinase-like domainunknownSSF56112Protein kinase-like (PK-like)coord: 744..893
score: 2.83E-92coord: 562..712
score: 2.83
NoneNo IPR availableunknownCoilCoilcoord: 319..339
scor
NoneNo IPR availableGENE3DG3DSA:1.10.510.10coord: 674..861
score: 1.3
NoneNo IPR availableGENE3DG3DSA:3.30.200.20coord: 541..673
score: 3.8
NoneNo IPR availablePANTHERPTHR24356SERINE/THREONINE-PROTEIN KINASEcoord: 479..720
score: 0.0coord: 747..972
score: 0.0coord: 1..65
score:
NoneNo IPR availablePANTHERPTHR24356:SF144SUBFAMILY NOT NAMEDcoord: 1..65
score: 0.0coord: 479..720
score: 0.0coord: 747..972
score: