CmaCh13G003170 (gene) Cucurbita maxima (Rimu)

NameCmaCh13G003170
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
Description3-ketoacyl-CoA thiolase
LocationCma_Chr13 : 3661887 .. 3682674 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGATAACGACGCCTTCAACGTCTGTATATCTTTTCTACCTAATTCCTTGATCACTCTGAACATCATTTTTGGGGTGCCCATTTTCTTCTTCAATTCTGCACCTCTCTGCAAGCTACCTACTGCCGCTCTCTTCTGTTCATTTTTTCTTTTCTTTGTACAGCTTCCTTCTCTCCCATGGCTCCTCTTTCCTCTGATTCCATCAACCCCCGAGGTTCTTCAATTCTCCCCTATTTATTTAAACATATGTTTTGATTCAATTTTCGCTCCTTTTTCATGAATGAAATCATGGGCTGGGATCGCCTCTTTGTTATGTTGGCACAGATGTTTGTATTGTGGGTGTTGCTCGTACGCCAATTGGTGCCTTTCTTGGTTCACTTTCATCTTTCTCTGCTACCCAACTCGGTTCTATAGCGATTGAATGTAAGTTTTCTCCGGATTTGAGTACCCTGTTTTAAGATGATATGAATTGAAGAGCGAGGATGCTTACAGGTGCCCTTAAGAGGGCAAATGTTGATCCTTCTCTTGTGCAAGAGGTATTCTTCGGAAATGTTCTCAGTGCAAATTTAGGGCAAGCTCCTGCGAGGCAGGCTGCCTTAGGTGCTGGTATACCAAACTCTGTTATTTGCACCACTATTAATAAAGTCTGTGCATCCGGCATGAAAGGTGTTTCTTCATCTATCAAAATTTTTATTGTATTAGGATAACAGCATTGGTTCTGCAAAAGAACTAACACTTGTTTTTTCTTTCTTTTTTGTTGGCACAGCAACAATGATTGCAGCGCACACGATTCAGTTAGGTATAAATGATGTTGTTGTTTCTGGTGGTATGGAAAGTATGTCTAATGTGCCCAAATATCTCCAACTAAGGTTGGCCCTTTATCCTTCACAACCCCCCAATTATTTTATTTTATTTTAATCACTACTTCTTCTTCTTGTTGTCCTCATAACCTTTAATCCCTGATCATAAAGTATAGTTTAAGAAAACATTATACTTCAATGTAAGAATAAATTCATATATCTGATTGACTATATATTGTTATCAACTAATGCAAATTTCTCTTGCCATAGCTGGAACAGGGTGAGTTTTCCTTAAATGAGAATTATGTCTTTTGTTCATGGAGTATAAATTTTGATTATGTTATGTTTTGTTTATTACTCGCTACTTGTTCTTCTAGGTAATATGTTTATCGTTGTTGAAATATTTACTTTAATTTTCGTTCATCCTTAAAATATATGGTTAGATCACATGTTTAGTCCTTAAACTTTCAAGTTTGTGTCTATTTGGTCTCTAAACCTTTAAAAGTGTCTAATTGGTCCTTGAACTTCACGTTTTGTCTAATAGGTCTTTGAAATTTCAATTTTGTAAATAGGTTCCGGAATTTTTTTTTTATTTTTTTATTTAATTGATCTTATAGATATAAAACTGAATTTTATATCTAATGGGACTTTATACTTTCAATTTTGTACCTAGCACGTTCTTGGATTTTTAAAAAAATGTTGAATAGGTCATTGACCTATTAGACACAAAATTGAAAATTAAGAGGCCTATGGGACACAATACTGAAAGTTTAAGGATCTATTAAATGCTTTTTAAAGAAATTTCTGACTAAATCGACCCGAAGCTGAATTTGTAATTCAACCTAAAAATAATGACCTTATTTTCCACCTCCATTGGAGCCAAAAACCTTGTTTAGTGCATTGCATTGAAATGCACTTTCTCACTCACGGACACATTTATAGAAGAGAAACTAAGCTTTCATTTAGAAGAATGAAATACTTTCTCGCACTAATTTTGTTAGGGGAATAACTACTTCTCATCCTGACTCAAAGTTATGAACAGAAAAGGGTCTCGATTTGGAAATGATACTGTTGTAGATGGTATGCTTAAAGATGGTCTTTGGGATGTTTACAACGACTTTGCGATGGGTGTTTGTGGTGAGATGTGTGCCAGCCAGCACTCCATTACAAGAGAAGATCAGGTAAAGTTTACCAGATAGGTAGTTTCACTCAGTACTATTATTTTTGTTCAAAATCCAATTTGTGAAACCTCGAACTTTTTCAAGCCATCAAGAGCAACTCTTTTGGGATAGGACAAGCCAGCTAGAAACAGTTAGCAATTGAAGTACTCTGATTCTCGTTGAAATGTTTTTTCTTATCTGCTCCTTCAGAAATGATTTATCTATATAACATGTTCAAGTATAAATAAATGATCATATTTATCATACGCATAATCCATTGGGCTAAACTTTTTAGTTTACTGGCTGTTTCAAAGTCTTGTAAATGTCACTTCCTCCTCAATTATTAACATAATCACTTGTTGGGCCTTGTACTAATTTCCGACTGTGAAGGAAGCTAAAATATGAACACAAATATTCAAATTTTCATAACCTATAAGCTTTAAGCTTTTAGCTTCAGGGTGATTTAGCACAATTTAATGACCTTTTTTTAAATTACTCTCTCTTATGTTTATTAATCCCCCTCAATAGGATGCATATGCTATTAAGAGCTTTGAGCGAGGACTTGCAGCACAAAATAATGGTCAATTATCCTGGGAAATAGCTCCGGTAAGATTGCTATTCCGCTCTAAGAAAGTCTATTCTATTATAAATCAGACCTAACTAATTGTTTGAATTTTTTAGGTCAAGGTTCCTTCCGGAAGAGGGAAGCCATCTTCAATTTTTGACAAGGACGAAAGCTTGAGACAGGTAAATTGTCACTAAAAACAGTAAAAAGCACTGAATATTTGTTTTCTTAAAGTAATTAATAAACACCCAGTATTATATGCATACTTGATACTTAGGAAGATGAAGCAAGCTTATCATAGGGAACTAAGGGGATTGACATCAAAATTTTTGCTAGTTGATTGGGAGACATGTTTGAATTTAGAATGAAGCATTTATAAGCAAAAATTGGAGTTTCTTGTTTAGAAATAGGAACTGACTTCAGGTTCCTGCTAAGGTTAGTAAGTTCCGGAAAAAGTTCTTAATGATTTTCAAGGTAGGGGATAGTTTTGTCAGTGGAACTGCAAGTTGTATATGGGGCCATTCATTTTGTGTGAAGGGATTCCAAGAAAGTTGATTGGGTAAGTTGGGGGTCATCTCTCTACTTTTGTCTATATATATATATATATATATATTTTTTTTTTTTTTGAAATTCTGGAAAGGTTACTAGTTTTCTGAAATATTGAGAACTTCCATTTTTCTGAATATGGACCTTTTTGATGCTACATTCTACTTCATTGTCAATCAATTTTCTTGTTCTTGCTAGTTTGATGCTGCAAAACTAAGGAAGCTCAGGCCAAGTTTTAAGGAGGATGGCGGTACTGTTACTGCTGGCAATGCTTCTATCATAAGGTTCTCTTTCTTGTAATTGTATTTTCATATTTTCTGCCAATTAATTTTATATGCTGGTCTTGAAGAGATTTGATACAAAATAATGTGTCTTAATTATGCCACAAAAACTTCATGGTGGATGACAAGATTTTACCATACATGTTTGGATTTAGGGAGGTTTCATTGTATTTTTCAATTTTAGAGTATTATTTTTAAGGTTTTAAGTTTTATTTTCAAGGCTATAAATATCCGATCATGTTGTTGCTTTAGTATGAAGATTTTGGATATTAAAATGAAAGTTGTATATTATGCTTCTCAAGCCATGAGCTTGTTCCTTTACCTTTCTTGTGTTTAGATTTGCACCTAGAGTTATTCAAGTAGTCTTGATCAAGTTATCTTGTAAATTAATTCAAATCACGAGTTTAGAAACAAGTATTTTTTACTCTTGATCTTGATCATTAAGGTAATCCGTATCAAAACATTCTCTTAGGTTGATTCCTTATCATTTTTGAGGTTCTTGGATAATTTATTTCGAAAGATCTTGTTCTTCAACAAGGTTGTTAGATCAAATCCCAAGAGTTTTATAACAAGATCACTAATGTTTCTAAGGTGTTTCTTGAGCTGAACTGAATGCAGAAATGCATGACCATCAATGTTATTTGTTTATTTATTTGCTATTGAGTGAATTTTTTTAGAGCACATGATTTGGTAAGTTATGGAAAGACATGGTAGTATCAGTGGGTTTGTTTATTCCAGCGACTGATTTGTGAAGTACCTAGGCTAAGAAATTTACTTTCACTTTCCGTTGATGCCAAAGAGAGAACATTTTTAGGCAGCTGCATTATTTTTTATTATTTGATCTTGAACAGTGATGGTGCTGCTGCTTTAGTGCTGGTGAGTGGGAAGAAGGCGGTCGAACTCGGTTTAGAAATAATTGCCGTAATTAAAGGATACGCTGATGCAGCTCAGGTAATTAAGTTCTTGATGTTATTATCCTTCATACTGGAGAAAGATCATGTGGCTTACAGAAAAAATGACTACCAGGCGCCTGAATTATTTACAACGGCTCCAGCCCTTTCAATTCCAAAAGCTATCTCAAATGCTGGTTTACATGCTTCTCAAATTGATTATTATGAAATAAATGAAGCCTTTTCCGTAAGAACTTGTGCTTCATTTACTGCTATTGCATTTTCTTGTTCATCATGCATGATCTTGTTGTTAAGTTGCTCTAATTTTTCTCTTCAGGTTGTGGCTCTTGCTAATCAGAAGCTGCTTGGTCTTGATCCTGTGAGTCTTTTTTGTCATTTATTAAGTTCATCACCCATTTGGTATTCTTCAGAGTTCAACTAATTGTGTGCTGTTGTCCGGTAATGGTTAGTTTAGCAATCTGGTCGTTGATACATAAAGTAGGTTGTTGGACATGTCTTTTGTTACATGGTCGCCGAGCTCACTCCGCTGTCCGTTTTTGCTAGTGTTCGCTTGCATTTTTCTGTCCACTTCAGGAATTCTCATGAAAAAAGAAATTGAATTGGAACGTTTTCTTTCCGACATATGTCTTCTTTTTTTAATCTTTGGAGTTCATGGCAGTAGGGTATGTGAATTTTATTCAATTTTGGGTGGACTGTTGGCGCAAAAGGTCCTCTTCTTTCTTTTTTTCCCTTGTCCCTGTGGTTTTTCCTTCTTATGTCCATCATTTCAAATTTCCGCTCCTGTCTGGATTTCAAGCTTTCAAGTATCGTTAGAATCTTGAGTGTAGAGAGTTTTCTCTTGTAATTTTTTTCTGAAATGCTTTGTTTCCTCCCAGAAGAAATCAAAGCTCTTTGCTCAGCTTGTTGCTCATGGAGGATTGATATTATGAACGTTATTTAGGTTTCAAGTTATGAGCCGTTGTCTTGTTTTATACTGACACATCAATTTGTTTGTGATTTCTTATTTCGTGCGTGTGGTTGAATGCAGGATAGAGTTAATGTTCACGGGGGAGCTGTATCTTTGGGACACCCATTGGGATGTAGCGGAGCTCGGATTTTGGTCACATTGTTAGGGGTAATTTCTGTTACTCTCCGCAGACTGGATAGTAGTAGAGAGCTCAAATCTTGGTTACATTGTTAGAGATAAAATTCTTGATGTCTCCTTTTCTTTCTCTCTTCAATATTTCTCCTCATGTTTCCTTTAGTACCCATAATCTTGAGAGACTGAGACAGGGTGTGAGATCCCCACATCAGTTGGGGAGGAGAACGAAACACCCTTTATAAGGGTGTGGAAACCTCTCCCTAGCAAACGTGTTTTAAAAACCTTGAGGGGAAGCCCTAAAGGGAAAGCCCAAAGAGACAATATCGATATGCTAGCAATGGGCTTAGGCCGTTACAAACGGTATTAGAGCTAGACATTGAACGATGTGCCAGCGAAGAAGCTGAGCCCCGAAGGAGGCGGACACGAGACAATGTGCCAGGAAGGATGCTGGGCCCCAAAAGGGGTGGATTTGGTGGGGTCCCACATTGATTGGATAAAGGAACGAGTGTTAGCGAGAACGTTGGGCACCGAAGGGGGGTGGATTATGAGATCCCACATCAGTTGGAGAGGAGAACACCTTTATAAGGGTGTTGAAATCTCTCCCTAGCAGACATGTTTTAAAAACCTCGAGGGGAAGCCTGAAAGGGAAAGCCCAAAGAGAACGATATCTGCTAGCGGTTGGTTTGGATCGTTACACTCGGAGTACCATATCTTATAGGGGTTGCAATCTTATAGAATAAGTATGTTTGAAACAGGTACTAAGACAGAGAAATGGGAAGTATGGGGTTGGTGCTGTGTGCAATGGTGGTGGAGGAGCATCCGCTCTTGTGGTTGAACTCATGTAAGCTCTTTTAAAAAGCGGCTTACAACAATAGTGCCTTTGGTTCTGTATTACAATAGCCATATGTTTGAACATTCTGCATTTTGTTCCTTATGACTGTCCTGCAGGCCTGGCGCTAGAGTTCGACACTCCAAGCTGTGATCCAAATTGGCCCTCTCAAAAGACATATGGCAGTTCACTATTTCTCCCCTTCTACACTTGAAGAAAGAGGTCGAGTTATTCACTGTTATCATATTTAATACACCAAAATGCTCTCTTTTTATTACATTGCCTCTATTGAGTGATAGAAAGAATTTGGGTCTGTTATGTTTGTCAATTTCTCGGCCATCTCTACACAATAAAAAAGATGGGTCAGCCATAGATATGTAGGTTTAGTTCATGGGATAATGGACTTAGTATGTAGGTTTATTAGGGGTCAGACACAATAATCAAACCAAAGTATATCCATGCCCTCCTTCAAGCAATGTCGGGATACTTAGTATTAGAATTCTTAAGGTGTTCTCACTAAAAAAAATAATGGGTGTGACAGTGTACCGCCCCTAATATTCTTTAATCTAAGACTTCCATGGCCTTTGCTGCGTTCACTGTCAACCATGCATGGGAGCCTTGCCCTTACTTGAAAATATAGGTAGCACGTGGCTTGGGTATTAATACCCAGTAAGTTGCCCCACTGTTGAGGTTAAGTATACAATATACAAGCAGTTCACACACAGCACAATCACAAGTAGAGAAAAGCAAACATGTGAGGGGTTGTCTTCCATTCCCTTTTTGCATGGCCTTAGGCCCTGGTTTTATCCGGGTAGAGTGGATACGTGGTATTTCTTCACGCACGACCCACGTATGTCAACGTGGGCCCACCAGTCGGGTTCGTACACTAACCTCTTGGGCCTCGCTTGGTCAGGTACGCATGACATACGCTATCGGACGACATGAAAAAGGAGGAGGCCTGCATGATATCATAGCGAGCATAATATCGAGTGTACGCGTCCGTACCATCATTACATAACATGGCGGTAACCCTATGCATAAGACTCAACATGCATGGGGTCACTAACACCTACCCCCATGGCATTACACATACATTGTTTCATTTCCCTCGTTGCGTTACATAGCGTGACTTAATGGAACATTATATGAGTGTATTGTAGTACACCAGTCAACACATCCATGACACGTAAAATTCCCAACACATTCATGACATGCCAATCAATTTCATGCATACAGGGTAAAACAATTAAGGGCAACAACATCACATATATGAGCATGACATACATAACATTTCACCTAGGCGGTGCGTCCATTGCTTTGTCTTTCTTGCTCGAGAGTCTCGACGAACCCTAATCGCGGGACCTGACTACTGTCGCTGGTGAGATCGTCGTTGTTCGTTTGTCCCAGGCGGTGCGTCTGTCCTCCTTGTCTTGACGTACCTCGACGTTGGTTCACTCCCACCTAAGGCTCACAAACGTCCGAAGTCTTGCGGCGTTATCGATCCTCCACCTCAATCTTATTGCTTCTCTCTCACGGTACAACAATGACTCAACAGAAGGGCAGCTACTTAGATACCCTAGAGTTTAATGTGGGGGCTCAAGGAAATAGTACGTTCTCTCAGCAACCCTCACTAAAAATCGCGAGATTTTTGGCGTTTCATTTTTTTTTTTACTGGAACACGTATCATTACATCACCCCCTTGAGAAGGACTTTCGTCCTTGAAAGGTTTTCATCCAAGAACTCTGGGTACTTCTCCCTCATCTCATCCTCTCTCATCTCATCCTCTCGTTCCCATGTGGCCTCCTCGGCTGACTAGTTCCCCCACCACACCTTGACAAATGCAATTTCCCTAGTGCGCAAAGTCTTTATCTTGCGAGCCAGGATCCCCACCGATTTCTCCTCATAGATTAAGTCTTCGGTTATGTCTAGAGCCGAATAATCTATCACATGCGTGGGGTTCGCTAGGTACTTGCGAAGTGCCGATACATGGAACACGTGTATGGTTGAGAGGGACGGGGTAGGGGTAACCTATACGCAACGGGTCCAATTCGCTCCAAAATTTCAAAGGGTCTTATGAAGCGCAGACCTAGTTTGCCCTTGCGTCCGAACCTTAGAATGCCCTTCATGGGGGTCACCTTCAGGAATACCATGTCGCCTACTTCGAACTCCAAGTCTCTTCGTCTCAAGTCCGCGTAACTCTGAGCGGTGCGCATTCTCCCCCTGATCTGTTGCATAGCCTCATTCTTGACTTGGACCAGCTCCAGACCCACTAGTTCCCATTCTCCTACTTCGCCCCAACATAATGGTGTCCTACACCGCTTTCCATACAATGCCTCATACGATGCCATGTCAATTGTCACTTGATAGCTGTTACTATAAGTGAATTCCATAAGGTGCAACTTCGAGTCCCAACTCCCGGTAAAATAAAGTACACAGGCCCCTAGCATGTCCTTTAGAATTTGGTTCAAGAAGTCTGTTCATCGGTCTGCGGGTGAAATGTTGTGCTGAAGTCCAGTTGAGTGCCCAACGCTTTCTAGAGTCCACGCCAAAATGCTGAAGTAAAACATGGGTCTCGGTCTGTTACAATCGACACGGGAACCTCATGTAATCTGACAATTTCCTTCATGTACAGTTGTGCCCAATTGTCCACTTAGGTGGCCTTGCCGGGGAGGAAATGTGTTGACTTTGTCAATCTGTCCACGATCACCCATATGATCGTGTAACCCTTCAACGTCTTAGGTAGGCCCACTATGAAATCTAGGACAATGTTCTCCCACTTCCATTCTGGTATATTGAGAGGTTGTAGTAACCCCACTAGTCTCTGTCTAGGTGCCTTGACTTGTTGGCAAACCAGGCATCTGTTGACGAACTTTGCAATATCCTTTTTCATACCACATCACTAGAAGTGGGGTTTGATATCCTGATATATTTTTGTACCGCCAGGGTGCATGCCGAATGAGGAGTTATATGTCTCCTTCAAATATCTCACCTCTCAATCCCTCTATTGCCGGTACACACAAGCAACCTTGGTACAATAACCTACCATCTGCTGACTTGGAGTACCCACCAACTGGTTCTGTCAACATTTCATTTAGTACCCTCGCTAAACCCGAGTCTCTTGCCTGCGCCTCAACGATTCTCTGTCGAAGAGTGGGTTATACAGTTAGTCGAGCCAGTTGTGCCTTAATACCTTCCAGTGCTACTGCTATTTCAGCTCGCTCAAAATCTTCCTGCACCTTGGGCTCTTTCGTGATTAGTGCCGATGAATGCGTTGTCTTCCTATTTTGGGCGTTTGCAACCACATTGGCCTTCTGATCGCTTGTATCGTCTATGATAAAATGATGTAAAATTCACAATAATAAGTGCATGTATACACTTTCATAAGTAACAAGTAATAAGTAAATTATCGATTCCATAGGAAACTTATCTTTTTTGTTTAATTTTATAGCTGGAAATTAAGTGAATTTTTTGGTAACATTCAAATGGGTGTTGTAATTTTAACTACTTAAACAAAACCAAAATAGAGTGACATCGAAAAATAGAAAATGATGAGAAAATCAACTATAATGAAGAAGGTTCAGCTAAGAGAGATTAAATAAATAAGTTGTTGAATTTTATGGTAGATTTTGGTTTCTTTTAATAGATTCCTAAAGCCCACTTCGGAGGGTCATGAATGGTGATTATTTTCATAATGGTTGTAACTTCTATTAATAAGATTTCTAACCCTTCTATTAGCCTTCAAGTCTTCTTGAATAATTATTTCTAACCCTTCTATTAGCCTTCAAGTCTCCTTGCTGGAACTAGAGTATTAAAGAGGATTTCATTGCTTCCCCTCTTCTACCGAATAAGAGAACAACTATGTATGTAAAGTAACTAATTAAACACACGTAAATCATCAACAAAAACTCTTTAATAAACTAAAATCATACCGTCAACATACTTAAACTATATAAAACTCTCTCGCCTCCCCTATAAAGTTTAACTCTCCATACAAACTGAATTAAACATAGAAATTCACATCTCAATCACGTAGAATTGATGGAATAAGAAAAGAGAATAAGGAAATAGAGAGTTATCCCGGTTTGTTGCCATTCGTCAATGAAAGTTTCCTTCTCTTGAGCTCTCACGTCCACCTCTTTGCCTTAATCAATCCCTGTTGGTCTCCTTTTCCTTCTGCTGTCAACAGCTGTTGGATTAGGGTTTTTTTTTTCTTCTCTAGTTTCACAGTAGTTCATTTCAGCAGAGACTTCTACTATGTAATTCGATTTTTCTCCAAAGTGTGTCTCCCTCTCTTGGTCTTCTTTTTCCTTTTATATTGAGCATAGGTTCCTCTTTAGGGCTGAAAAGGAATAAGTTGTCCACATAGAATGTTCCTTAGTGGATAGTCAATGACCCACTTGGACGGCTGATGTGGATTTAAAATTTCTATGTCATCCTCCATTTATTTACAGATTTGCTACTCCCTCATAGTTTAGCACAAATGTTTGAAATTTTACCACATAACTCAATTAAATGTACAATTAAGATCAATTTATGTCAAATTATATTATTTTTCATGTTGGACCGCAATAAATGATTTCATGTTCACTAAGTAGAATAAATGAGGTAATAAATGAGAAAATGGTGAATAAGAGTGCAATATTAAATACAAAAAAACTTGATATGACTTTACTTTTAGAGAGTTATCAGGGTGGTACTGGATATCTATGTCATAATCCTTGACTAGTTCTAGCCATCGTCGTTGTCTCATATTCAACTCTTTTTGGGTGAAGAAATATTTCAGGCTCTTATGGTCGGTATAAATCTGAGTCCTCTCCCCATACAGATAATGTCGTCATATCTTTAATGCAAAGACTACTGCTGCTAGGTCCAAGTCATGCATGGGATAATTCTTTTCATAGTCCTTCAGTTGTCTGGATGCGTAGGAAATAACCTTTCCGCATTGCATTAACACGCATCCCAGGCCCTTCTTGGAAGCATCACTGAAAATGACGTAGTTCCCTGTACCATCAGGTATTGTGAGCACGAGTGCAGTCACCAGTCGATCCTTCAGTTCTCGAAAGCTTGTCTCACACTCCTCGGTCCAGATGAACGTCGTAGCCTTTCGAGTCAATTGCGAGAAGGGTGCCACTATTTTGGAAACTAGTTCGTGAGGAAAGAGGAATTTGAACTACAAAAGGTTAGTATTCAAACTCATCGAGTCTTAATGCTTTTGAACATGCTAGCTAATTTATTATAATTGATGTACTTAGAGTGCCTAAGATTCAAATTGCTTTCGCATGTGTTATATTAACACCATCACTTGTTTTTTATCGTGATCTGATGCTAACCCGACCGAAGGTCAATCTTTGAAAACACTGTGGCCTCTCTAAGCTGGTCAAACAAGTCCCGATGCGTGGGAGGGGATACCTATTCTTCATCGTCCTCTTATTCAGTTCCATGTAATCGATACACAAGCGTATGGAACCATCCTTTTTCTTGACGAATAACACCGGCGCACCCCAAGGCAACACACTAGGTCTAATAAACCCCTTGTTTAGCCACTCTTATAGTTGGATCTTGAGTTCTTTCAGCTCCGCTGGTGCCATGCAGTACGGGGCCATGGATATAGGGTCGGTTCCTAGTTCCAGTTCTATAGCATAATCGACATCTTAGGATGGAGGGATCCCTAGCAAGTTTGGGAACACATTTGGGAACTCATTCACTACAGGCAACGTGGTTAGGGAGACTCCTCTTTCTTTGATCTTTGCCACGCTAGCTAGTATCACCTAGGCTCCGTGCTGGATCAGCTTCTTCGCCTTCATCATTGATATCACCTTGGACATTGTGCCCAGACTTGTGCCCTTGAATTTGAATCTGGTCCTGAACAGTGGCATGAAAATAACCTCCTTCTTATGACAATCTATGCTAGTATGGTTTTCCGCTAACTAGTCCATGACCAAGATTACATCAAACACGGTCATATCGACCACCATTAAGTTTACGCCCAAGCTAATCCCTAATACTACTACGTGACCATTCCTTACCCTATAGGCGGCTACCATATCCACCCATGCTGGGGTGGCTACTAAAAAATCATGTAACAAGGGTTCTAATACGATTCTTGCTTAACTAACAAAATCAGTGGAAATAAAGGAGTGTATAGACCCCGAGTCAAACAACGTCAGAGCAAAGTGACCAAGGACGGATAGTGTACCTATCACCGCCATATCTGGGTTTTTAGCGTCCCTGTTGGTGGAGGCATAGGCTCTCACTGGAGGACAGTTCTGAGTGGGGGCATTCACTATGCGTAGTGGTCTCTACTGACCCCCGACGTTATTGGCGCAACCTCCTGGACAATCCCTGGCCAAGTGACCCTCCTTTCCGTTTCGGAAGCATGCTCCAGAGCAAGCCAAGCACTGACCCCAATGGTTCTTCCGACACTCGTTGCACTTGGGTCTATCGTCTACTTTGCTTTCTCTTCCCTGCTGCCCTCGCCTGGAATCGCAACTCCGTCTATTTGGTCATCTCGGTGGCGGCGATGAGGATGAATTACGACAGTCATCACAATCACGGCATCGTCTGTGTCTATCAGTCGATGAACAGGATTCGTGATGATCATCTCGGTCACTGCATCATTTTTGTCTTTGGGTCAATGAGCGGGATTCGCTCGGTCCATCGGTCCCCTCCATGGCCCGTGCTACTCTTAGAGAGTTAGCTTAAGTCGTAAGCGCTATCACCTCAACCACGTTACGAATCTTCCTGTCCAACCCCAGCACAAACCGGCAGGCTGTCATGTAGTCAGTGTCTACCAAATCCGGGGCAAAGCATTTCAACTTGGTGAACTCTTTGGCGTAGTCAGTAACCGAATGCTCTCTCTGCTTCAGGTGGGTGAATTCTTTATGTTTGCGGATCTGGACGTCCTTTAGGTAGTACGCCTCTACAAATGCCTCCTTAAATTCTGTCTATGAGATTTAACCCTCAGGTCTGATAAGCTCTTGGGTGGATCCCCACCATACTTGAGCATCGGCACGCAGCATGAATGTTGTGCATTCCACCTTTTGGTCTTCGGGATAGTTTGTCAGCCTGAATACGGTCCCTATGGAACTCAGCCACAACTGTGCCTTTGTCGGGTCATCGGAAGTTCCCTTGAAGGTTTGTGGGTCGCCACTCTTAAAATCTTGCAAGCACCTGGCCTCCCTGGTAGAGGCTGAGTCGTTAGTTGACTGATCACCGAACAGGTTTCGTACTATGATCTGCAATACGTCTACCAATTTCGCCGCCAAGTCTCAGGCGGTCGCTGGTGGCAGGAACGTCATCGCTTCGGGCTCTTCCACGGAGTTCTCACGGTCTGAACCTCGTCCTCTTCCACCTCTCCCTCTACCCTTAAGTGGGGGGTCATCCCCTAGTACTTCCTCCTCGCGTCCGGGGTGGCATATCTAAACATAGTAATAACTATGGGGTTACATACTAATCTATCAGCTTCTATAAACAATCACAATGACATAAGCCACGTACACATACACCCATATATTTGTCACATTATGGAACATACTAAAATCGAGACATGAAAGCGTACAAACCATAATTCTTCCTTATAATATACACATATATCTGACGGTGACGGTGTGGTTGCTAGAGGACCAGGGAAGTACCTTAGGCCTACCTCGTAGACCAGTCTACAAGATCTATAACCTAGTGCTCTGATACAGACTGTAACGTCCCTAATTTTCTTTAACCTAATTAGCGACGTTGCCTTGCATTCATATGACAAAAGTACGAAAATTCATATAAGACAACTTCAATAGACATGTTCATTATTATAAGTAGTTCGTAACTGGAAAATCAATAGCGACGACCTAACTGGGAATATCAATCAACAACTAACTAGGAGAAACAATTGCAAATAGCTATTATTTGAAAGAGTCTTAATAAGATGCCTACGGCACTACGAGCCCTCTCTAAGATTTCCATGGCCTTCACAGCGTTCACCGTCAGCCATACATGGGAGCCTTGTCCTTACCTGAAAATATAGGTAGCAAGTGACTTGAGTATTATATAAAATACATAGTAAGTTGTCCCACTATTGGGGTTAAGCATACAATACACATTCAGGACACAAGCAGTTCACACACAGCACGCATGCACAATCACAAGCAGAGAAAAGCAAACATGAGGGGCTAATGTTGTTACTATAACACGTGCATGAGCAATTTGGATCTTAGGTACTCTAAGTACATCAATTATAGTAAATTTGATGGTGTTAATATAACACATGCGGAAGCAATTTGGATCTTAGGCACTTTTAGAACATCAATTATAGTAAATAACTAGCATGTTCCTAAGTATTAAAACTTAATGAGTTTGAATACTAACCTTTTGTAGTTCAAATTTCTTTTTCCTCACGAACCGGTTTCGAACCACCACTAGTGTCTTCTCCACTATCCTCTAGCCTTAGAACGGGATTGTGGAATCCGGTGAGTGACTAAACTTGGGAGGGATTTTGTATATATGAAGAGAGTGTTTGTGAGAGGTTTTTCACCATAGAAGAAACCTTTCATGTTTTATGATCACTCTATTTATATAGTCATGTTTGCATGTTCATGCAAATTTGAGATCACCAAAATTTGACACTCAAAATTTCACTCAAATTTGTGGGGTTTATTTAGAAAGCATGCCAGTTTGGTTAAACCAAATTTTGACACCTCAAAATTTTGCTAATATAGATTTTTACATTAAATGAAGTCAACATTTTGACTTTCTTCATTTTGATTCAACAATCAATTTGAATTATTAATTTCAAATTAAAGTAATATTAAATGATTAATTAAACTATTTAATTAATATTTAATATTAATTCAAATGTCAATCTCAATTAATTTCGACACATATTTGATTAAGATATTTCAATCTTATTTAAATATTTCAGATTTTCTCTTTTTGTTTAATTCGTAAATAAAACAAAAATGCGGTTAAATATATCGTATATATGTAGCGCATTCTCCCTAATTTGAATTCGAACATTTCGAACTCACTCGTCACATTGTTCTATGGTTTAGTTCGATATGAGCTAGCAAAGGGACCTAATGGACCTATAAATCATGGGCTCCAACGATCCAAGATTAACTGGTTAAACTCATTAACCTTGTTAACCAACTTTCGTTAAGTACTGTCCACTATAGCCTAGTAGTTGCACTCCCCTCACTATAGATATATTTCTGTCCATTTGATATAACCATGATTAGTAAGTCGATCATTCACAGATTGTTCGTAACTAGAGCTGGGTCAATTTACTGTTTTACCCCTAAAGTTACTTCTTGTTTCTTATGTCTCACCGATCCTCTAATGAACAATTGGTTTGTGGTCCAACCAGTAAACCGAATCCCTCTTAGGCCAATGAGAGGGTGGGGCCCCTTGTTCAAGACTTGGAGTCAGTAATTAAGAAAACTACCTCTCTTCTATCCCTAAAAGTGGGTAGGAGTGAATTCCATTTTGCACCCCACGTCCCCAGCCATTTACCCAGTCTTACCCCTGAAATGGGAGGTCTATTGAATCAGCGAACTTGAACCACTTGGGTAGGCATTTGCACCCCACGTTCCCAGCCATTTACCTAGTCTTACCCCTGAAATGGGAGGCCTATTGAGTCGGCAAACTTGAGTCACTCTCACCCATGCTAATCTAAGGATAATTTCGAATAAACAGGAGTTCATNNNNNNNNNNCCAGTCTACAAGATCTATAACCTAGTGCTCTGATACAGACTGTAACGTCCCTAATTTTCTTTAACCTAATTAGCGACGTTGCCTTGCATTCATATGACAAAAGTACGAAAATTCATATAAGACAACTTCAATAGACATGTTCATTATTATAAGTAGTTCGTAACTGGAAAATCAATAGCGACGACCTAACTGGGAATATCAATCAACAACTAACTAGGAGAAACAATTGCAAATAGCTATTATTTGAAAGAGTCTTAATAAGATGCCTACGGCACTACGAGCCCTCTCTAAGATTTCCATGGCCTTCACAGCGTTCACCGTCAGCCATACATGGGAGCCTTGTCCTTACCTGAAAATATAGGTAGCAAGTGACTTGAGTATTATATAAAATACATAGTAAGTTGTCCCACTATTGGGGTTAAGCATACAATACACATTCAGGACACAAGCAGTTCACACACAGCACGCATGCACAATCACAAGCAGAGAAAAGCAAACATGAGGGGCTAATGTTGTTACTATAACACGTGCATGAGCAATTTGGATCTTAGGTACTCTAAGTACATCAATTATAGTAAATTTGATGGTGTTAATATAACACATGCGGAAGCAATTTGGATCTTAGGCACTTTTAGAACATCAATTATAGTAAATAACTAGCATGTTCCTAAGTATTAAAACTTAATGAGTTTGAATACTAACCTTTTGTAGTTCAAATTTCTTTTTCCTCACGAACCGGTTTCGAACCACCACTAGTGTCTTCTCCACTATCCTCTAGCCTTAGAACGGGATTGTGGAATCCGGTGAGTGACTAAACTTGGGAGGGATTTTGTATATATGAAGAGAGTGTTTGTGAGAGGTTTTTCACCATAGAAGAAACCTTTCATGTTTTATGATCACTCTATTTATATAGTCATGTTTGCATGTTCATGCAAATTTGAGATCACCAAAATTTGACACTCAAAATTTCACTCAAATTTGTGGGGTTTATTTAGAAAGCATGCCAGTTTGGTTAAACCAAATTTTGACACCTCAAAATTTTGCTAATATAGATTTTTACATTAAATGAAGTCAACATTTTGACTTTCTTCATTTTGATTCAACAATCAATTTGAATTATTAATTTCAAATTAAAGTAATATTAAATGATTAATTAAACTATTTAATTAATATTTAATATTAATTCAAATGTCAATCTCAATTAATTTCGACACATATTTGATTAAGATATTTCAATCTTATTTAAATATTTCAGATTTTCTCTTTTTGTTTAATTCGTAAATAAAACAAAAATGCGGTTAAATATATCGTATATATGTAGCGCATTCTCCCTAATTTGAATTCGAACATTTCGAACTCACTCGTCACATTGTTCTATGGTTTAGTAGCGCATTCTCCCTAATTTGAATTCGAACATTTCGAACTCACTCGTCACATTGTTCTATGGTTTAGTTCGATATGAGCTAGCAAAGGGACCTAATGGACCTATAAATCATGGGCTCCAACGATCCAAGATTAACTGGTTAAACTCATTAACCTTGTTAACCAACTTTCGTTAAGTACTGTCCACTATAGCCTAGTAGTTGCACTCCCCTCACTATAGATATATTTCTGTCCATTTGATATAACCATGATTAGTAAGTCGATCATTCACAGATTGTTCGTAACTAGAGCTGGGTCAATTTACTGTTTTACCCCTAAAGTTACTTCTTGTTTCTTATGTCTCACCGATCCTCTAATGAACAATTGGTTTGTGGTCCAACCAGTAAACCGAATCCCTCTTAGGCCAATGAGAGGGTGGGGCCCCTTGTTCAAGACTTGGAGTCAGTAATTAAGAAAACTACCTCTCTTCTATCCCTAAAAGTGGGTAGGAGTGAATTCCATTTTGCACCCCACGTCCCCAGCCATTTACCCAGTCTTACCCCTGAAATGGGAGGTCTATTGAATCAGCGAACTTGAACCACTTGGGTAGGCATTTGCACCCCACGTTCCCAGCCATTTACCTAGTCTTACCCCTGAAATGGGAGGCCTATTGAGTCGGCAAACTTGAGTCACTCTCACCCATGCTAATCTAAGGATAATTTCGAATAAACAGGAGTTCATGGTTAGCTCAGGATTAAGATCAAGTTACCTAGGTCATCGAATGAAAAAAAAATCAGTCTCAACAGTAAACGACATTATAAAGTGAAAATGATTTTCTTCATGGTCCGTTCTTATGCAATACTCATTGCATAGGACGCCCCCACTCACATGTTTCCACATGTACAATTTTAGTGATCACATTGTTCATATCATATACAAAAGTGGGCCGCATCCATAGTGTCCCCAGAATAAGGTACTCAGCCTTATTCTTATACTATAGATCATTTTGACTATATACTTGAACTTGATCAACTCTTATGTCTCTGCATATGGTTCAAGTAATCATATTATAGCCAGAGTGTTCTTAGTTTATTGGATTTAGATTAATGATCGTAAAGTTCACTTTATTCAATAACAATCTTTACTGAATAAACAACAATAATAACTTTATTGAAAAATAGAATATGTTTTTATTTACAAACTCTGAGTTTTAGGACATAAAACCCAACAAAATTAACTAGCATGTTCATAAGTATTAAAACTCAAGGAGTTTGAGCACTAACCTTTAGTAGTTTAAATTCTTTTTTTTCATGAATCGGTTTTGAACCACCACTAGTGTCTTCCCTACTATTCTTCGGCCTTAAAACGGGATTGTGGAATCTGACTAAACTTGGAAGGGATCTAGTATATGTGAAGAGAGAATTTGTGAGGGACTTTCTCAAGCTTTTTTCAGGAAAAATTTTCATGTACCTTGATCACTCTATTTATAGAGTCATATTTGCATGTTCATACAAAATTGAGATTACTATATTTTGACACTCAAAATTCCACTCAAATTTGTGGGGTTATGTGACAAACATGCCAGTTTGGTTAACCATATATTGACACTCAATATTCCACTAACTCAAACATGAATTTTTTCATTAGGTGAGGTCAACATTTTGACTTTCTTTATTTTAATTCAACCATTAATTTCAAATCAAAATAATATTAAAATGCCAATTTGAATGCACATATTGATTATGAATTCCTATTCATAATTAAAATGTTTAAATCTTATTTAAATATCTCAAATTCTCTCTGTATGTTTAATTCATAACTAAACAAAAATGCGGTTCGTATATATTTAGCGAATATTTCCTAATTTGAATTCGAACACTTCGAACTCACTTGTCACACTATTCTAAGGTTTAGTCCAATATGAGCTAGCAGGGGGACCTAATGGACCTATAGATCATGGTCTTCAACAATTCGAGATTAACCGGCTAAACTCATTAACCTTGTTAACCAACATTCGTTAACTACTAGGACACTCCACTATAGCTTAGTAGTTGCACTCCCCTCACTATTTTTGTCCATTTGATATAACCATGATTAGTAAATCGTTCCTTCACAGGTTGTTCGTAACTAGAGCTGGGTCAATTTACCATTTTACCACTAAAGTTACTTCTTGTTTCTTGAGTCCCACTGATCCTCTAATGAACAATTGGTTTGTGGTCTAACCACCAAACTTGGAGTCAGTACTTAAGGGAACAACCTCGCTACTATCCCTAAAAGCGGGTAGAAGTGAATTCCGTCATACACTCTATGTCCCCAGCTATTCACTCAGTCTTACCCCTGAAATGGGAGGCTTATTGAGTCGGCAAACTCGGGCCACTCTCACCCATGCAAATCTAAAGATAATTCCGAATAAACATGAGTTCATAGTTAGCTTAGGATTAAGATCGAGTTACCTAGGTCATCGAATGAAAAAATCAGTCTCAATAGTAAACGACATTATAAAGTGAGAGTGGCTTTCTTCATGGTCCGTTCTTATGCAATACTCATTGCATAGGACGCCCCCACTTACATGTCTCCACATGTACAATTTAGTGATCACATTGTTTATATCACATACAAAAGTGGGCCGCATCCATAGTGTCTCTAGAATAAGGTACTCAACCTTATCCCTATTCTATAGATCATTTTGACTATATACTTGAAATTGATCTACTCTTATGTCTCTACATATAGTTCAAGTAATCATACTATAGCTAGAGTGTTCTTAGTTTATTGGATTTAGATTAATGAACGTAACATTCACTTTATTCAATAACAATCTTTACTAAATAAACAACAATAATAACTTTATTGAAAACAGAATATGCTTTTGTTTACCAACTATGAGTTTTAGAACATAAAACTCAACAGGGCTGTCTTCCATTCCCTTTTTACATGGCCTTAAGCCATGGCTTTATCCGAGCAGAGTGGATGCGAGGTCCTTCTTCATGCACGACCCACGTACGTCAACAGGGCCCACCAGGTGCGTTCGTACACAAATCTCCTGAGCCTGGCTTGGTCGGGTACACATGACATACGTTACTGGACGTCAAGGAAAAGGAGGGGGGCTACATGATATCATAGCGAGCATAATATCGAGTGTACGCATTCGTACCATTATTACATAGCATGGCGGTAACCCTATGCATAAGAATCAACATGCATGGGGTCCCTAACGCCTACCCCCATGGCATTACACATACATCGTTTCATTTCCCTCGTCGCGTTACATAGCATGACTTAATGGAACATTATATGAGTGTATTGTAGTACACCAGTCAACACATCCATGGCACGTAAAACTCCCAACACATTCATGACGTGTCAATCAATTTCATGCATACAGGGTAAAACAATTAAGGGCAACAACATCACATATATGAGCATGACATACATAACATTTCACCTAACAACAACATCTCATACATGAGCATGACATACATAACATTTCACCACAAGAAGATACTTAGCATAACATACATGGTATACATTGACATTTCACACCAAGATAAACAC

mRNA sequence

ATGGATAACGACGCCTTCAACGTCTCTTCCTTCTCTCCCATGGCTCCTCTTTCCTCTGATTCCATCAACCCCCGAGATGTTTGTATTGTGGGTGTTGCTCGTACGCCAATTGGTGCCTTTCTTGGTTCACTTTCATCTTTCTCTGCTACCCAACTCGGTGCCCTTAAGAGGGCAAATGTTGATCCTTCTCTTGTGCAAGAGGTATTCTTCGGAAATGTTCTCAGTGCAAATTTAGGGCAAGCTCCTGCGAGGCAGGCTGCCTTAGGTGCTGGTATACCAAACTCTGTTATTTGCACCACTATTAATAAAGTCTGTGCATCCGGCATGAAAGCAACAATGATTGCAGCGCACACGATTCAGTTAGGTATAAATGATGTTGTTGTTTCTGGTGGTATGGAAAGTATGTCTAATGTGCCCAAATATCTCCAACTAAGAAAAGGGTCTCGATTTGGAAATGATACTGTTGTAGATGGTATGCTTAAAGATGGTCTTTGGGATGTTTACAACGACTTTGCGATGGGTGTTTGTGGTGAGATGTGTGCCAGCCAGCACTCCATTACAAGAGAAGATCAGAGCTTTGAGCGAGGACTTGCAGCACAAAATAATGGTCAATTATCCTGGGAAATAGCTCCGGTCAAGGTTCCTTCCGGAAGAGGGAAGCCATCTTCAATTTTTGACAAGGACGAAAGCTTGAGACAGTTTGATGCTGCAAAACTAAGGAAGCTCAGGCCAAGTTTTAAGGAGGATGGCGGTACTGTTACTGCTGGCAATGCTTCTATCATAAGTGATGGTGCTGCTGCTTTAGTGCTGGTGAGTGGGAAGAAGGCGGTCGAACTCGGTTTAGAAATAATTGCCGTAATTAAAGGATACGCTGATGCAGCTCAGGTAATTAAGTTCTTGATGTTATTATCCTTCATACTGGAGAAAGATCATGTGGCTTACAGAAAAAATGACTACCAGGCGCCTGAATTATTTACAACGGCTCCAGCCCTTTCAATTCCAAAAGCTATCTCAAATGCTGGTTTACATGCTTCTCAAATTGATTATTATGAAATAAATGAAGCCTTTTCCGTTGTGGCTCTTGCTAATCAGAAGCTGCTTGGTCTTGATCCTGATAGAGTTAATGTTCACGGGGGAGCTGTATCTTTGGGACACCCATTGGGATGTAGCGGAGCTCGGATTTTGGTCACATTGTTAGGGAGAAATGGGAAGTATGGGGTTGGTGCTGTGTGCAATGGTGGTGGAGGAGCATCCGCTCTTGTGGTTGAACTCATGCCTGGCGCTAGAGTTCGACACTCCAAGCTGTGATCCAAATTGGCCCTCTCAAAAGACATATGGCAGTTCACTATTTCTCCCCTTCTACACTTGAAGAAAGAGGTACGCATGACATACGCTATCGGACGACATGAAAAAGGAGGAGGCCTGCATGATATCATAGCGAGCATAATATCGAGTGTACGCGTCCGTACCATCATTACATAACATGGCGGTAACCCTATGCATAAGACTCAACATGCATGGGGTCACTAACACCTACCCCCATGGCATTACACATACATTGTTTCATTTCCCTCGTTGCGTTACATAGCGTGACTTAATGGAACATTATATGAGTGTATTGTAGTACACCAGTCAACACATCCATGACACGTAAAATTCCCAACACATTCATGACATGCCAATCAATTTCATGCATACAGGGTAAAACAATTAAGGGCAACAACATCACATATATGAGCATGACATACATAACATTTCACCACAAGAAGATACTTAGCATAACATACATGGTATACATTGACATTTCACACCAAGATAAACAC

Coding sequence (CDS)

ATGGATAACGACGCCTTCAACGTCTCTTCCTTCTCTCCCATGGCTCCTCTTTCCTCTGATTCCATCAACCCCCGAGATGTTTGTATTGTGGGTGTTGCTCGTACGCCAATTGGTGCCTTTCTTGGTTCACTTTCATCTTTCTCTGCTACCCAACTCGGTGCCCTTAAGAGGGCAAATGTTGATCCTTCTCTTGTGCAAGAGGTATTCTTCGGAAATGTTCTCAGTGCAAATTTAGGGCAAGCTCCTGCGAGGCAGGCTGCCTTAGGTGCTGGTATACCAAACTCTGTTATTTGCACCACTATTAATAAAGTCTGTGCATCCGGCATGAAAGCAACAATGATTGCAGCGCACACGATTCAGTTAGGTATAAATGATGTTGTTGTTTCTGGTGGTATGGAAAGTATGTCTAATGTGCCCAAATATCTCCAACTAAGAAAAGGGTCTCGATTTGGAAATGATACTGTTGTAGATGGTATGCTTAAAGATGGTCTTTGGGATGTTTACAACGACTTTGCGATGGGTGTTTGTGGTGAGATGTGTGCCAGCCAGCACTCCATTACAAGAGAAGATCAGAGCTTTGAGCGAGGACTTGCAGCACAAAATAATGGTCAATTATCCTGGGAAATAGCTCCGGTCAAGGTTCCTTCCGGAAGAGGGAAGCCATCTTCAATTTTTGACAAGGACGAAAGCTTGAGACAGTTTGATGCTGCAAAACTAAGGAAGCTCAGGCCAAGTTTTAAGGAGGATGGCGGTACTGTTACTGCTGGCAATGCTTCTATCATAAGTGATGGTGCTGCTGCTTTAGTGCTGGTGAGTGGGAAGAAGGCGGTCGAACTCGGTTTAGAAATAATTGCCGTAATTAAAGGATACGCTGATGCAGCTCAGGTAATTAAGTTCTTGATGTTATTATCCTTCATACTGGAGAAAGATCATGTGGCTTACAGAAAAAATGACTACCAGGCGCCTGAATTATTTACAACGGCTCCAGCCCTTTCAATTCCAAAAGCTATCTCAAATGCTGGTTTACATGCTTCTCAAATTGATTATTATGAAATAAATGAAGCCTTTTCCGTTGTGGCTCTTGCTAATCAGAAGCTGCTTGGTCTTGATCCTGATAGAGTTAATGTTCACGGGGGAGCTGTATCTTTGGGACACCCATTGGGATGTAGCGGAGCTCGGATTTTGGTCACATTGTTAGGGAGAAATGGGAAGTATGGGGTTGGTGCTGTGTGCAATGGTGGTGGAGGAGCATCCGCTCTTGTGGTTGAACTCATGCCTGGCGCTAGAGTTCGACACTCCAAGCTGTGA

Protein sequence

MDNDAFNVSSFSPMAPLSSDSINPRDVCIVGVARTPIGAFLGSLSSFSATQLGALKRANVDPSLVQEVFFGNVLSANLGQAPARQAALGAGIPNSVICTTINKVCASGMKATMIAAHTIQLGINDVVVSGGMESMSNVPKYLQLRKGSRFGNDTVVDGMLKDGLWDVYNDFAMGVCGEMCASQHSITREDQSFERGLAAQNNGQLSWEIAPVKVPSGRGKPSSIFDKDESLRQFDAAKLRKLRPSFKEDGGTVTAGNASIISDGAAALVLVSGKKAVELGLEIIAVIKGYADAAQVIKFLMLLSFILEKDHVAYRKNDYQAPELFTTAPALSIPKAISNAGLHASQIDYYEINEAFSVVALANQKLLGLDPDRVNVHGGAVSLGHPLGCSGARILVTLLGRNGKYGVGAVCNGGGGASALVVELMPGARVRHSKL
BLAST of CmaCh13G003170 vs. Swiss-Prot
Match: THIC2_ARATH (Probable acetyl-CoA acetyltransferase, cytosolic 2 OS=Arabidopsis thaliana GN=At5g47720 PE=2 SV=1)

HSP 1 Score: 590.5 bits (1521), Expect = 1.5e-167
Identity = 309/437 (70.71%), Postives = 355/437 (81.24%), Query Frame = 1

Query: 16  PLSSDSINPRDVCIVGVARTPIGAFLGSLSSFSATQLG------ALKRANVDPSLVQEVF 75
           P+S DS+ PRDVC+VGVARTPIG FLGSLSS +AT+LG      ALKRA+VDP+LV+EVF
Sbjct: 4   PVSDDSLQPRDVCVVGVARTPIGDFLGSLSSLTATRLGSIAIQAALKRAHVDPALVEEVF 63

Query: 76  FGNVLSANLGQAPARQAALGAGIPNSVICTTINKVCASGMKATMIAAHTIQLGINDVVVS 135
           FGNVL+ANLGQAPARQAALGAGIP SVICTTINKVCA+GMK+ M+A+ +IQLG+ND+VV+
Sbjct: 64  FGNVLTANLGQAPARQAALGAGIPYSVICTTINKVCAAGMKSVMLASQSIQLGLNDIVVA 123

Query: 136 GGMESMSNVPKYL-QLRKGSRFGNDTVVDGMLKDGLWDVYNDFAMGVCGEMCASQHSITR 195
           GGMESMSNVPKYL   R+GSR G+DTVVDGM+KDGLWDVYNDF MGVCGE+CA Q+ ITR
Sbjct: 124 GGMESMSNVPKYLPDARRGSRLGHDTVVDGMMKDGLWDVYNDFGMGVCGEICADQYRITR 183

Query: 196 EDQ------SFERGLAAQNNGQLSWEIAPVKVPSGRGKPSSIFDKDESLRQFDAAKLRKL 255
           E+Q      SFERG+AAQN    +WEI PV+V +GRG+PS + DKDE L +FDAAKL+KL
Sbjct: 184 EEQDAYAIQSFERGIAAQNTQLFAWEIVPVEVSTGRGRPSVVIDKDEGLGKFDAAKLKKL 243

Query: 256 RPSFKEDGGTVTAGNASIISDGAAALVLVSGKKAVELGLEIIAVIKGYADAAQVIKFLML 315
           RPSFKEDGG+VTAGNAS ISDGAAALVLVSG+KA+ELGL +IA I+GYADAA        
Sbjct: 244 RPSFKEDGGSVTAGNASSISDGAAALVLVSGEKALELGLHVIAKIRGYADAA-------- 303

Query: 316 LSFILEKDHVAYRKNDYQAPELFTTAPALSIPKAISNAGLHASQIDYYEINEAFSVVALA 375
                            QAPELFTT PAL+IPKAI  AGL ASQ+DYYEINEAFSVVALA
Sbjct: 304 -----------------QAPELFTTTPALAIPKAIKRAGLDASQVDYYEINEAFSVVALA 363

Query: 376 NQKLLGLDPDRVNVHGGAVSLGHPLGCSGARILVTLLG----RNGKYGVGAVCNGGGGAS 435
           NQKLLGLDP+R+N HGGAVSLGHPLGCSGARILVTLLG    + GKYGV ++CNGGGGAS
Sbjct: 364 NQKLLGLDPERLNAHGGAVSLGHPLGCSGARILVTLLGVLRAKKGKYGVASICNGGGGAS 415

BLAST of CmaCh13G003170 vs. Swiss-Prot
Match: THIC1_ARATH (Acetyl-CoA acetyltransferase, cytosolic 1 OS=Arabidopsis thaliana GN=AAT1 PE=2 SV=1)

HSP 1 Score: 582.4 bits (1500), Expect = 4.1e-165
Identity = 308/425 (72.47%), Postives = 348/425 (81.88%), Query Frame = 1

Query: 18  SSDSINPRDVCIVGVARTPIGAFLGSLSSFSATQLG------ALKRANVDPSLVQEVFFG 77
           +S+S+NPRDVCIVGVARTP+G FLGSLSS  AT+LG      ALKRANVDP+LVQEV FG
Sbjct: 4   TSESVNPRDVCIVGVARTPMGGFLGSLSSLPATKLGSLAIAAALKRANVDPALVQEVVFG 63

Query: 78  NVLSANLGQAPARQAALGAGIPNSVICTTINKVCASGMKATMIAAHTIQLGINDVVVSGG 137
           NVLSANLGQAPARQAALGAGIPNSVICTT+NKVCASGMKA MIAA +IQLGINDVVV+GG
Sbjct: 64  NVLSANLGQAPARQAALGAGIPNSVICTTVNKVCASGMKAVMIAAQSIQLGINDVVVAGG 123

Query: 138 MESMSNVPKYL-QLRKGSRFGNDTVVDGMLKDGLWDVYNDFAMGVCGEMCASQHSITRED 197
           MESMSN PKYL + RKGSRFG+D++VDGMLKDGLWDVYND  MG C E+CA +  ITRE 
Sbjct: 124 MESMSNTPKYLAEARKGSRFGHDSLVDGMLKDGLWDVYNDCGMGSCAELCAEKFQITREQ 183

Query: 198 ------QSFERGLAAQNNGQLSWEIAPVKVPSGRGKPSSIFDKDESLRQFDAAKLRKLRP 257
                 QSFERG+AAQ  G  +WEI PV+V  GRG+PS+I DKDE L +FDAAKLRKLRP
Sbjct: 184 QDDYAVQSFERGIAAQEAGAFTWEIVPVEVSGGRGRPSTIVDKDEGLGKFDAAKLRKLRP 243

Query: 258 SFKEDGGTVTAGNASIISDGAAALVLVSGKKAVELGLEIIAVIKGYADAAQVIKFLMLLS 317
           SFKE+GGTVTAGNAS ISDGAAALVLVSG+KA++LGL ++A IKGY DAA          
Sbjct: 244 SFKENGGTVTAGNASSISDGAAALVLVSGEKALQLGLLVLAKIKGYGDAA---------- 303

Query: 318 FILEKDHVAYRKNDYQAPELFTTAPALSIPKAISNAGLHASQIDYYEINEAFSVVALANQ 377
                          Q PE FTTAPAL+IPKAI++AGL +SQ+DYYEINEAF+VVALANQ
Sbjct: 304 ---------------QEPEFFTTAPALAIPKAIAHAGLESSQVDYYEINEAFAVVALANQ 363

Query: 378 KLLGLDPDRVNVHGGAVSLGHPLGCSGARILVTLLG----RNGKYGVGAVCNGGGGASAL 426
           KLLG+ P++VNV+GGAVSLGHPLGCSGARIL+TLLG    RNGKYGVG VCNGGGGASAL
Sbjct: 364 KLLGIAPEKVNVNGGAVSLGHPLGCSGARILITLLGILKKRNGKYGVGGVCNGGGGASAL 403

BLAST of CmaCh13G003170 vs. Swiss-Prot
Match: THIL_SCHPO (Acetyl-CoA acetyltransferase OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=erg10 PE=2 SV=1)

HSP 1 Score: 391.3 bits (1004), Expect = 1.3e-107
Identity = 215/415 (51.81%), Postives = 282/415 (67.95%), Query Frame = 1

Query: 26  DVCIVGVARTPIGAFLGSLSSFSATQLG------ALKRANVDPSLVQEVFFGNVLSANLG 85
           +V IV   RTP+G+F GS +S  AT+LG      AL+R N+ PS V EVF GNV+SANLG
Sbjct: 5   EVYIVSAVRTPMGSFGGSFASLPATKLGSIAIKGALERVNIKPSDVDEVFMGNVVSANLG 64

Query: 86  QAPARQAALGAGIPNSVICTTINKVCASGMKATMIAAHTIQLGINDVVVSGGMESMSNVP 145
           Q PARQ ALGAG+P S++CTT+NKVCASGMKAT++ A TI  G  ++VV+GG ESMSN P
Sbjct: 65  QNPARQCALGAGLPRSIVCTTVNKVCASGMKATILGAQTIMTGNAEIVVAGGTESMSNAP 124

Query: 146 KYL-QLRKGSRFGNDTVVDGMLKDGLWDVYNDFAMGVCGEMCASQHSITREDQ------S 205
            Y  + R G+++GN  +VDG+L+DGL D Y+   MG   E+CA +HSI R  Q      S
Sbjct: 125 YYAPKNRFGAKYGNVELVDGLLRDGLSDAYDGLPMGNAAELCAEEHSIDRASQDAFAISS 184

Query: 206 FERGLAAQNNGQLSWEIAPVKVPSGRGKPSSIFDKDESLRQFDAAKLRKLRPSFKEDGGT 265
           ++R   AQ       EI PV+VP GRGKP+ +  +DE  +  +  KL+ +R  FK + GT
Sbjct: 185 YKRAQNAQATKAFEQEIVPVEVPVGRGKPNKLVTEDEEPKNLNEDKLKSVRAVFKSN-GT 244

Query: 266 VTAGNASIISDGAAALVLVSGKKAVELGLEIIAVIKGYADAAQVIKFLMLLSFILEKDHV 325
           VTA NAS ++DGA+ALVL+S  K  ELGL+ +A I G+ +AA                  
Sbjct: 245 VTAANASTLNDGASALVLMSAAKVKELGLKPLAKIIGWGEAA------------------ 304

Query: 326 AYRKNDYQAPELFTTAPALSIPKAISNAGLHASQIDYYEINEAFSVVALANQKLLGLDPD 385
                  Q PE FTT+P+L+IPKA+ +AG+ ASQ+DYYEINEAFSVVA+AN K+LGLDP+
Sbjct: 305 -------QDPERFTTSPSLAIPKALKHAGIEASQVDYYEINEAFSVVAVANTKILGLDPE 364

Query: 386 RVNVHGGAVSLGHPLGCSGARILVT----LLGRNGKYGVGAVCNGGGGASALVVE 424
           RVN++GG V++GHPLG SG+RI+ T    L  ++ K GV AVCNGGGGAS++V+E
Sbjct: 365 RVNINGGGVAMGHPLGSSGSRIICTLAYILAQKDAKIGVAAVCNGGGGASSIVIE 393

BLAST of CmaCh13G003170 vs. Swiss-Prot
Match: THIA_CANTR (Acetyl-CoA acetyltransferase IA OS=Candida tropicalis GN=PACTA PE=1 SV=3)

HSP 1 Score: 364.8 bits (935), Expect = 1.3e-99
Identity = 201/415 (48.43%), Postives = 270/415 (65.06%), Query Frame = 1

Query: 27  VCIVGVARTPIGAFLGSLSSFSATQLGA-------LKRANVDPSLVQEVFFGNVLSANLG 86
           V IV  ARTPIG+F GSLSS + + LGA        K   + P  V E+ FG VL AN+G
Sbjct: 6   VYIVSTARTPIGSFQGSLSSLTYSDLGAHAVKAALAKVPQIKPQDVDEIVFGGVLQANVG 65

Query: 87  QAPARQAALGAGIPNSVICTTINKVCASGMKATMIAAHTIQLGINDVVVSGGMESMSNVP 146
           QAPARQ AL AG+P+S++ +TINKVCASGMKA +I A  I  G +D+VV GG ESMSN P
Sbjct: 66  QAPARQVALKAGLPDSIVASTINKVCASGMKAVIIGAQNIICGTSDIVVVGGAESMSNTP 125

Query: 147 KYL-QLRKGSRFGNDTVVDGMLKDGLWDVYNDFAMGVCGEMCASQHSITREDQ------S 206
            YL   R G+R+G+  +VDG+ KDGL DVY +  MGV  E CA  H  +REDQ      S
Sbjct: 126 YYLPSARSGARYGDAIMVDGVQKDGLLDVYEEKLMGVAAEKCAKDHGFSREDQDNFAINS 185

Query: 207 FERGLAAQNNGQLSWEIAPVKVPSGRGKPSSIFDKDESLRQFDAAKLRKLRPSFKEDGGT 266
           +++   A + G+   EIAPV +   RGKP ++ + DE + +F+  +L+  R  F+++ GT
Sbjct: 186 YKKAGKALSEGKFKSEIAPVTIKGFRGKPDTVIENDEEIGKFNEERLKSARTVFQKENGT 245

Query: 267 VTAGNASIISDGAAALVLVSGKKAVELGLEIIAVIKGYADAAQVIKFLMLLSFILEKDHV 326
           VTA NAS ++DG AALVLVS  K  +LGL+ +A I G+ +AA                  
Sbjct: 246 VTAPNASKLNDGGAALVLVSEAKLKQLGLKPLAKISGWGEAA------------------ 305

Query: 327 AYRKNDYQAPELFTTAPALSIPKAISNAGLHASQIDYYEINEAFSVVALANQKLLGLDPD 386
                  + P  FT APAL++PKA+ +AGL   ++D++E+NEAFSVV LAN +L+ +  +
Sbjct: 306 -------RTPFDFTIAPALAVPKAVKHAGLTVDRVDFFELNEAFSVVGLANAELVNIPLE 365

Query: 387 RVNVHGGAVSLGHPLGCSGARILVTLLG----RNGKYGVGAVCNGGGGASALVVE 424
           ++NV+GGAV++GHPLGCSGARI+VTLL       G++GV  VCNGGGGASA+V+E
Sbjct: 366 KLNVYGGAVAMGHPLGCSGARIIVTLLSVLTQEGGRFGVAGVCNGGGGASAVVIE 395

BLAST of CmaCh13G003170 vs. Swiss-Prot
Match: THIB_CANTR (Acetyl-CoA acetyltransferase IB OS=Candida tropicalis GN=PACTB PE=1 SV=3)

HSP 1 Score: 363.2 bits (931), Expect = 3.9e-99
Identity = 201/415 (48.43%), Postives = 269/415 (64.82%), Query Frame = 1

Query: 27  VCIVGVARTPIGAFLGSLSSFSATQLGA-------LKRANVDPSLVQEVFFGNVLSANLG 86
           V IV  ARTPIG+F GSLSS + + LGA        K   + P  V E+ FG VL AN+G
Sbjct: 6   VYIVSTARTPIGSFQGSLSSLTYSDLGAHAVKAALAKVPQIKPQDVDEIVFGGVLQANVG 65

Query: 87  QAPARQAALGAGIPNSVICTTINKVCASGMKATMIAAHTIQLGINDVVVSGGMESMSNVP 146
           QAPARQ AL AG+P+S+I +TINKVCASGMKA +I A  I  G +D+VV GG ESMSN P
Sbjct: 66  QAPARQVALKAGLPDSIIASTINKVCASGMKAVIIGAQNIICGTSDIVVVGGAESMSNTP 125

Query: 147 KYL-QLRKGSRFGNDTVVDGMLKDGLWDVYNDFAMGVCGEMCASQHSITREDQ------S 206
            YL   R G+R+G+  +VDG+ KDGL DVY +  MGV  E CA  H  +REDQ      S
Sbjct: 126 YYLPSARSGARYGDAVMVDGVQKDGLLDVYEEKLMGVAAEKCAKDHGFSREDQDNFAINS 185

Query: 207 FERGLAAQNNGQLSWEIAPVKVPSGRGKPSSIFDKDESLRQFDAAKLRKLRPSFKEDGGT 266
           +++   A + G+   EIAPV +   RGKP ++ + DE + +F+  +L+  R  F+++ GT
Sbjct: 186 YKKAGKALSEGKFKSEIAPVTIKGFRGKPDTVIENDEEIGKFNEDRLKSARTVFQKENGT 245

Query: 267 VTAGNASIISDGAAALVLVSGKKAVELGLEIIAVIKGYADAAQVIKFLMLLSFILEKDHV 326
           VTA NAS ++DG AALVLVS  K  +LGL+ +A I G+ +AA                  
Sbjct: 246 VTAPNASKLNDGGAALVLVSEAKLKQLGLKPLAKISGWGEAA------------------ 305

Query: 327 AYRKNDYQAPELFTTAPALSIPKAISNAGLHASQIDYYEINEAFSVVALANQKLLGLDPD 386
                  + P  FT APAL++PKA+ +AGL   ++D++E+NEAFSVV LAN +L+ +  +
Sbjct: 306 -------RTPFDFTIAPALAVPKAVKHAGLTVDRVDFFELNEAFSVVGLANAELVKIPLE 365

Query: 387 RVNVHGGAVSLGHPLGCSGARILVTLLG----RNGKYGVGAVCNGGGGASALVVE 424
           ++NV+GGAV++GHPLGCSGARI+VTLL       G++G   VCNGGGGASA+V+E
Sbjct: 366 KLNVYGGAVAMGHPLGCSGARIIVTLLSVLTQEGGRFGAAGVCNGGGGASAIVIE 395

BLAST of CmaCh13G003170 vs. TrEMBL
Match: A0A0A0LTB9_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G173130 PE=3 SV=1)

HSP 1 Score: 690.6 bits (1781), Expect = 1.2e-195
Identity = 370/439 (84.28%), Postives = 383/439 (87.24%), Query Frame = 1

Query: 14  MAPLSSDSINPRDVCIVGVARTPIGAFLGSLSSFSATQLG------ALKRANVDPSLVQE 73
           MAPLSSDSINPRDVCIVGVARTP+G FLGSLSSFSATQLG      ALKRANVDPSLVQE
Sbjct: 1   MAPLSSDSINPRDVCIVGVARTPMGGFLGSLSSFSATQLGSIAIECALKRANVDPSLVQE 60

Query: 74  VFFGNVLSANLGQAPARQAALGAGIPNSVICTTINKVCASGMKATMIAAHTIQLGINDVV 133
           VFFGNVLSANLGQAPARQAALGAGIPNSVICTTINKVCASGMKATMIAAHTIQLGINDVV
Sbjct: 61  VFFGNVLSANLGQAPARQAALGAGIPNSVICTTINKVCASGMKATMIAAHTIQLGINDVV 120

Query: 134 VSGGMESMSNVPKYLQ-LRKGSRFGNDTVVDGMLKDGLWDVYNDFAMGVCGEMCASQHSI 193
           VSGGMESMSN PKYLQ +RKGSRFGND VVDGMLKDGLWD YNDF MG C E+CASQ+SI
Sbjct: 121 VSGGMESMSNTPKYLQEVRKGSRFGNDAVVDGMLKDGLWDAYNDFPMGACAEICASQYSI 180

Query: 194 TREDQ------SFERGLAAQNNGQLSWEIAPVKVPSGRGKPSSIFDKDESLRQFDAAKLR 253
           TRE+Q      SFERGLAAQNNG LSWEIAPVKVPS RGKPSS FDKDESLRQFDAAKL+
Sbjct: 181 TREEQDAYAIKSFERGLAAQNNGSLSWEIAPVKVPSVRGKPSSTFDKDESLRQFDAAKLK 240

Query: 254 KLRPSFKEDGGTVTAGNASIISDGAAALVLVSGKKAVELGLEIIAVIKGYADAAQVIKFL 313
           KLRPSFK+DGGTVTAGNASIISDGAAALVLVSGKKA+ELGLE+IAVIKGYADAAQ     
Sbjct: 241 KLRPSFKKDGGTVTAGNASIISDGAAALVLVSGKKALELGLEVIAVIKGYADAAQ----- 300

Query: 314 MLLSFILEKDHVAYRKNDYQAPELFTTAPALSIPKAISNAGLHASQIDYYEINEAFSVVA 373
                               APELFTT PAL+IPKAISNA LH SQIDYYEINEAFSVVA
Sbjct: 301 --------------------APELFTTTPALAIPKAISNACLHHSQIDYYEINEAFSVVA 360

Query: 374 LANQKLLGLDPDRVNVHGGAVSLGHPLGCSGARILVTLLG----RNGKYGVGAVCNGGGG 433
           LANQK+LGLDPDRVN HGGAVSLGHPLGCSGARILVTLLG    +NGKYGVGAVCNGGGG
Sbjct: 361 LANQKILGLDPDRVNAHGGAVSLGHPLGCSGARILVTLLGVLRQKNGKYGVGAVCNGGGG 414

Query: 434 ASALVVELMPGARVRHSKL 436
           ASALVVELMPGARVR+SKL
Sbjct: 421 ASALVVELMPGARVRNSKL 414

BLAST of CmaCh13G003170 vs. TrEMBL
Match: B9HC23_POPTR (Truncated acetyl Co-A acetyltransferase-like family protein OS=Populus trichocarpa GN=POPTR_0006s00630g PE=3 SV=1)

HSP 1 Score: 622.5 bits (1604), Expect = 4.0e-175
Identity = 329/435 (75.63%), Postives = 364/435 (83.68%), Query Frame = 1

Query: 18  SSDSINPRDVCIVGVARTPIGAFLGSLSSFSATQLG------ALKRANVDPSLVQEVFFG 77
           SSDSI PRDVCIVGVARTP+G FLGSLSSFSAT+LG      AL+RAN+DPSLVQEVFFG
Sbjct: 3   SSDSIKPRDVCIVGVARTPMGGFLGSLSSFSATKLGSIAIQCALQRANIDPSLVQEVFFG 62

Query: 78  NVLSANLGQAPARQAALGAGIPNSVICTTINKVCASGMKATMIAAHTIQLGINDVVVSGG 137
           NVLSANLGQAPARQAALGAGIPNSVICTTINKVCASGMKATM+AA TIQLGINDVVV+GG
Sbjct: 63  NVLSANLGQAPARQAALGAGIPNSVICTTINKVCASGMKATMLAAQTIQLGINDVVVAGG 122

Query: 138 MESMSNVPKYL-QLRKGSRFGNDTVVDGMLKDGLWDVYNDFAMGVCGEMCASQHSITRED 197
           MESMSN PKYL   RKGSR G+DT+VDGM+KDGLWD+YNDF MGVC E+CA QHSITR+D
Sbjct: 123 MESMSNAPKYLADARKGSRLGHDTIVDGMMKDGLWDIYNDFGMGVCAEICADQHSITRDD 182

Query: 198 Q------SFERGLAAQNNGQLSWEIAPVKVPSGRGKPSSIFDKDESLRQFDAAKLRKLRP 257
           Q      SFERG+AAQN+G LSWE+ PV+V  GRGKP +I DKD+ L +FDAAKLRKLRP
Sbjct: 183 QDSYAIQSFERGIAAQNSGHLSWEVVPVEVSGGRGKPFTIVDKDDGLGKFDAAKLRKLRP 242

Query: 258 SFKEDGGTVTAGNASIISDGAAALVLVSGKKAVELGLEIIAVIKGYADAAQVIKFLMLLS 317
           SFKE+GG+VTAGNAS ISDGAAALVL+SG+KA++LGL++IA I+GYADAAQ         
Sbjct: 243 SFKENGGSVTAGNASSISDGAAALVLMSGEKALKLGLQVIAKIRGYADAAQ--------- 302

Query: 318 FILEKDHVAYRKNDYQAPELFTTAPALSIPKAISNAGLHASQIDYYEINEAFSVVALANQ 377
                           APELFTTAPAL+IPKAISNAGL ASQID+YEINEAFSVVALANQ
Sbjct: 303 ----------------APELFTTAPALAIPKAISNAGLEASQIDFYEINEAFSVVALANQ 362

Query: 378 KLLGLDPDRVNVHGGAVSLGHPLGCSGARILVTLLG----RNGKYGVGAVCNGGGGASAL 436
           KLLGL+P +VN HGGAVSLGHPLGCSGARILVTLLG    +NGKYGVG +CNGGGGASAL
Sbjct: 363 KLLGLNPQKVNAHGGAVSLGHPLGCSGARILVTLLGVLKHKNGKYGVGGICNGGGGASAL 412

BLAST of CmaCh13G003170 vs. TrEMBL
Match: A0A058ZX35_EUCGR (Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_L00148 PE=3 SV=1)

HSP 1 Score: 617.1 bits (1590), Expect = 1.7e-173
Identity = 324/435 (74.48%), Postives = 361/435 (82.99%), Query Frame = 1

Query: 18  SSDSINPRDVCIVGVARTPIGAFLGSLSSFSATQLG------ALKRANVDPSLVQEVFFG 77
           +S+SI P+DVCIVGVARTP+G  LGSLSSFSATQLG      ALKRANVDP+LVQEVFFG
Sbjct: 4   ASESIRPQDVCIVGVARTPMGGLLGSLSSFSATQLGSIAIQCALKRANVDPALVQEVFFG 63

Query: 78  NVLSANLGQAPARQAALGAGIPNSVICTTINKVCASGMKATMIAAHTIQLGINDVVVSGG 137
           NVLSANLGQAPARQAALGAGIPNSVICTT+NKVCA+GMKATM+AAHTIQLGIND VV+GG
Sbjct: 64  NVLSANLGQAPARQAALGAGIPNSVICTTVNKVCAAGMKATMLAAHTIQLGINDFVVAGG 123

Query: 138 MESMSNVPKYL-QLRKGSRFGNDTVVDGMLKDGLWDVYNDFAMGVCGEMCASQHSITRED 197
           MESMSN PKYL + RKGSR G+D +VDGMLKDGLWDVYNDF MGVC E+CA  HSIT+E+
Sbjct: 124 MESMSNAPKYLAEARKGSRLGHDIIVDGMLKDGLWDVYNDFGMGVCAEICAENHSITKEE 183

Query: 198 Q------SFERGLAAQNNGQLSWEIAPVKVPSGRGKPSSIFDKDESLRQFDAAKLRKLRP 257
           Q      S+ERG+AAQN+G  +WEI PV+V  GRGKPS+I DKDE L +FD+AKL+KLRP
Sbjct: 184 QDSYAIQSYERGIAAQNSGLFAWEIVPVEVSGGRGKPSTIVDKDEGLGKFDSAKLKKLRP 243

Query: 258 SFKEDGGTVTAGNASIISDGAAALVLVSGKKAVELGLEIIAVIKGYADAAQVIKFLMLLS 317
           SFKE+GG+VTAGNAS+ISDGAAALVLVSGKK +ELGL +IA IKGYADAAQ         
Sbjct: 244 SFKENGGSVTAGNASLISDGAAALVLVSGKKVLELGLRVIAKIKGYADAAQ--------- 303

Query: 318 FILEKDHVAYRKNDYQAPELFTTAPALSIPKAISNAGLHASQIDYYEINEAFSVVALANQ 377
                           APELFTTAPAL+IPKAISNAGL ASQIDYYEINEAFSVV LANQ
Sbjct: 304 ----------------APELFTTAPALAIPKAISNAGLDASQIDYYEINEAFSVVVLANQ 363

Query: 378 KLLGLDPDRVNVHGGAVSLGHPLGCSGARILVTLLG----RNGKYGVGAVCNGGGGASAL 436
           KLL L P+RVNVHGGAVSLGHPLGCSGARILVTLLG    +NG+YGVG +CNGGGGASAL
Sbjct: 364 KLLALKPERVNVHGGAVSLGHPLGCSGARILVTLLGVLRQKNGRYGVGGICNGGGGASAL 413

BLAST of CmaCh13G003170 vs. TrEMBL
Match: A0A061GPF0_THECC (Thiolase family protein isoform 1 OS=Theobroma cacao GN=TCM_030451 PE=3 SV=1)

HSP 1 Score: 613.2 bits (1580), Expect = 2.4e-172
Identity = 325/436 (74.54%), Postives = 363/436 (83.26%), Query Frame = 1

Query: 18  SSDS-INPRDVCIVGVARTPIGAFLGSLSSFSATQLG------ALKRANVDPSLVQEVFF 77
           SSDS I PRDVCIVGVARTP+GAFLGSLSSFSATQLG      ALKRAN+DPSLVQEVFF
Sbjct: 12  SSDSVIRPRDVCIVGVARTPMGAFLGSLSSFSATQLGSIAIHSALKRANLDPSLVQEVFF 71

Query: 78  GNVLSANLGQAPARQAALGAGIPNSVICTTINKVCASGMKATMIAAHTIQLGINDVVVSG 137
           GNVLSANLGQAPARQAALGAGIPNS+ICTT+NKVCASGMKA M+A+ TIQLGINDVV++G
Sbjct: 72  GNVLSANLGQAPARQAALGAGIPNSIICTTVNKVCASGMKAVMLASQTIQLGINDVVIAG 131

Query: 138 GMESMSNVPKYL-QLRKGSRFGNDTVVDGMLKDGLWDVYNDFAMGVCGEMCASQHSITRE 197
           GMESMSN PKYL + RKGSR G+DT++DGMLKDGLWDVYNDF MGVC E+CA QH+ITRE
Sbjct: 132 GMESMSNAPKYLAEARKGSRLGHDTIIDGMLKDGLWDVYNDFGMGVCAEICADQHNITRE 191

Query: 198 DQ------SFERGLAAQNNGQLSWEIAPVKVPSGRGKPSSIFDKDESLRQFDAAKLRKLR 257
           +Q      SFERG+AAQNNG L+WEI PV+V   RGKP +I D+DE L +FDAAKLRKLR
Sbjct: 192 EQDSYAIQSFERGIAAQNNGLLAWEIVPVEVSGRRGKPFTIIDRDEGLGKFDAAKLRKLR 251

Query: 258 PSFKEDGGTVTAGNASIISDGAAALVLVSGKKAVELGLEIIAVIKGYADAAQVIKFLMLL 317
           PSFKE+GG+VTAGNAS ISDGAAA+VLVSG+KA +LGL+++A I+GYADAAQ        
Sbjct: 252 PSFKEEGGSVTAGNASSISDGAAAIVLVSGEKATKLGLQVVAKIRGYADAAQ-------- 311

Query: 318 SFILEKDHVAYRKNDYQAPELFTTAPALSIPKAISNAGLHASQIDYYEINEAFSVVALAN 377
                            APELFTTAPAL+IPKAIS AGL ASQIDYYEINEAFSVVALAN
Sbjct: 312 -----------------APELFTTAPALAIPKAISAAGLEASQIDYYEINEAFSVVALAN 371

Query: 378 QKLLGLDPDRVNVHGGAVSLGHPLGCSGARILVTLLG----RNGKYGVGAVCNGGGGASA 436
           QKLLGL+P++VNVHGGAVSLGHPLGCSGARILVTLLG    +NGK+GVG +CNGGGGASA
Sbjct: 372 QKLLGLNPEKVNVHGGAVSLGHPLGCSGARILVTLLGVMRQKNGKFGVGGICNGGGGASA 422

BLAST of CmaCh13G003170 vs. TrEMBL
Match: A0A0B0PUC9_GOSAR (Acetyl-CoA acetyltransferase, cytosolic 1-like protein OS=Gossypium arboreum GN=F383_12929 PE=3 SV=1)

HSP 1 Score: 609.4 bits (1570), Expect = 3.5e-171
Identity = 323/431 (74.94%), Postives = 358/431 (83.06%), Query Frame = 1

Query: 12  SPMAPLSSDSINPRDVCIVGVARTPIGAFLGSLSSFSATQLG------ALKRANVDPSLV 71
           +P+A  SSDSI PRDVC+VGVARTP+G FLGSLSS SAT+LG      ALKRANVDPSLV
Sbjct: 2   APVAATSSDSIKPRDVCVVGVARTPMGGFLGSLSSLSATKLGSIAIEAALKRANVDPSLV 61

Query: 72  QEVFFGNVLSANLGQAPARQAALGAGIPNSVICTTINKVCASGMKATMIAAHTIQLGIND 131
           QEVFFGNVLSANLGQAPARQAALGAGIPNSVICTT+NKVCASGMKATM+AA +IQLGIND
Sbjct: 62  QEVFFGNVLSANLGQAPARQAALGAGIPNSVICTTVNKVCASGMKATMLAAQSIQLGIND 121

Query: 132 VVVSGGMESMSNVPKYL-QLRKGSRFGNDTVVDGMLKDGLWDVYNDFAMGVCGEMCASQH 191
           VVV+GGMESMSNVPKYL + RKGSR G+DT+VDGMLKDGLWDVY D  MG C E+CA +H
Sbjct: 122 VVVAGGMESMSNVPKYLGEARKGSRLGHDTLVDGMLKDGLWDVYGDCGMGSCAELCAEKH 181

Query: 192 SITREDQ------SFERGLAAQNNGQLSWEIAPVKVPSGRGKPSSIFDKDESLRQFDAAK 251
            ITRE+Q      SFERG+AAQ  G  +WEI PV+VP GRGKPS I DKDE L +FDAAK
Sbjct: 182 VITREEQDNFAVQSFERGIAAQQGGAFAWEIVPVEVPGGRGKPSIIVDKDEGLGKFDAAK 241

Query: 252 LRKLRPSFKEDGGTVTAGNASIISDGAAALVLVSGKKAVELGLEIIAVIKGYADAAQVIK 311
           LRKLRPSFK++GGTVTAGNAS ISDGAAAL+LVSG+KA+ELGL++IA I GYADAAQ   
Sbjct: 242 LRKLRPSFKDNGGTVTAGNASSISDGAAALILVSGEKALELGLQVIAKIAGYADAAQ--- 301

Query: 312 FLMLLSFILEKDHVAYRKNDYQAPELFTTAPALSIPKAISNAGLHASQIDYYEINEAFSV 371
                                 APE FTTAPAL+IPKAISNAGL ASQ+DYYEINEAF+V
Sbjct: 302 ----------------------APEFFTTAPALAIPKAISNAGLDASQVDYYEINEAFAV 361

Query: 372 VALANQKLLGLDPDRVNVHGGAVSLGHPLGCSGARILVTLLG----RNGKYGVGAVCNGG 426
           VALANQKLLGL+P++VNV+GGAVSLGHPLGCSGARILVTLLG    +NGKYGVG VCNGG
Sbjct: 362 VALANQKLLGLNPEKVNVNGGAVSLGHPLGCSGARILVTLLGVLKQKNGKYGVGGVCNGG 407

BLAST of CmaCh13G003170 vs. TAIR10
Match: AT5G47720.2 (AT5G47720.2 Thiolase family protein)

HSP 1 Score: 590.5 bits (1521), Expect = 8.4e-169
Identity = 309/437 (70.71%), Postives = 355/437 (81.24%), Query Frame = 1

Query: 16  PLSSDSINPRDVCIVGVARTPIGAFLGSLSSFSATQLG------ALKRANVDPSLVQEVF 75
           P+S DS+ PRDVC+VGVARTPIG FLGSLSS +AT+LG      ALKRA+VDP+LV+EVF
Sbjct: 4   PVSDDSLQPRDVCVVGVARTPIGDFLGSLSSLTATRLGSIAIQAALKRAHVDPALVEEVF 63

Query: 76  FGNVLSANLGQAPARQAALGAGIPNSVICTTINKVCASGMKATMIAAHTIQLGINDVVVS 135
           FGNVL+ANLGQAPARQAALGAGIP SVICTTINKVCA+GMK+ M+A+ +IQLG+ND+VV+
Sbjct: 64  FGNVLTANLGQAPARQAALGAGIPYSVICTTINKVCAAGMKSVMLASQSIQLGLNDIVVA 123

Query: 136 GGMESMSNVPKYL-QLRKGSRFGNDTVVDGMLKDGLWDVYNDFAMGVCGEMCASQHSITR 195
           GGMESMSNVPKYL   R+GSR G+DTVVDGM+KDGLWDVYNDF MGVCGE+CA Q+ ITR
Sbjct: 124 GGMESMSNVPKYLPDARRGSRLGHDTVVDGMMKDGLWDVYNDFGMGVCGEICADQYRITR 183

Query: 196 EDQ------SFERGLAAQNNGQLSWEIAPVKVPSGRGKPSSIFDKDESLRQFDAAKLRKL 255
           E+Q      SFERG+AAQN    +WEI PV+V +GRG+PS + DKDE L +FDAAKL+KL
Sbjct: 184 EEQDAYAIQSFERGIAAQNTQLFAWEIVPVEVSTGRGRPSVVIDKDEGLGKFDAAKLKKL 243

Query: 256 RPSFKEDGGTVTAGNASIISDGAAALVLVSGKKAVELGLEIIAVIKGYADAAQVIKFLML 315
           RPSFKEDGG+VTAGNAS ISDGAAALVLVSG+KA+ELGL +IA I+GYADAA        
Sbjct: 244 RPSFKEDGGSVTAGNASSISDGAAALVLVSGEKALELGLHVIAKIRGYADAA-------- 303

Query: 316 LSFILEKDHVAYRKNDYQAPELFTTAPALSIPKAISNAGLHASQIDYYEINEAFSVVALA 375
                            QAPELFTT PAL+IPKAI  AGL ASQ+DYYEINEAFSVVALA
Sbjct: 304 -----------------QAPELFTTTPALAIPKAIKRAGLDASQVDYYEINEAFSVVALA 363

Query: 376 NQKLLGLDPDRVNVHGGAVSLGHPLGCSGARILVTLLG----RNGKYGVGAVCNGGGGAS 435
           NQKLLGLDP+R+N HGGAVSLGHPLGCSGARILVTLLG    + GKYGV ++CNGGGGAS
Sbjct: 364 NQKLLGLDPERLNAHGGAVSLGHPLGCSGARILVTLLGVLRAKKGKYGVASICNGGGGAS 415

BLAST of CmaCh13G003170 vs. TAIR10
Match: AT5G48230.2 (AT5G48230.2 acetoacetyl-CoA thiolase 2)

HSP 1 Score: 582.4 bits (1500), Expect = 2.3e-166
Identity = 308/425 (72.47%), Postives = 348/425 (81.88%), Query Frame = 1

Query: 18  SSDSINPRDVCIVGVARTPIGAFLGSLSSFSATQLG------ALKRANVDPSLVQEVFFG 77
           +S+S+NPRDVCIVGVARTP+G FLGSLSS  AT+LG      ALKRANVDP+LVQEV FG
Sbjct: 4   TSESVNPRDVCIVGVARTPMGGFLGSLSSLPATKLGSLAIAAALKRANVDPALVQEVVFG 63

Query: 78  NVLSANLGQAPARQAALGAGIPNSVICTTINKVCASGMKATMIAAHTIQLGINDVVVSGG 137
           NVLSANLGQAPARQAALGAGIPNSVICTT+NKVCASGMKA MIAA +IQLGINDVVV+GG
Sbjct: 64  NVLSANLGQAPARQAALGAGIPNSVICTTVNKVCASGMKAVMIAAQSIQLGINDVVVAGG 123

Query: 138 MESMSNVPKYL-QLRKGSRFGNDTVVDGMLKDGLWDVYNDFAMGVCGEMCASQHSITRED 197
           MESMSN PKYL + RKGSRFG+D++VDGMLKDGLWDVYND  MG C E+CA +  ITRE 
Sbjct: 124 MESMSNTPKYLAEARKGSRFGHDSLVDGMLKDGLWDVYNDCGMGSCAELCAEKFQITREQ 183

Query: 198 ------QSFERGLAAQNNGQLSWEIAPVKVPSGRGKPSSIFDKDESLRQFDAAKLRKLRP 257
                 QSFERG+AAQ  G  +WEI PV+V  GRG+PS+I DKDE L +FDAAKLRKLRP
Sbjct: 184 QDDYAVQSFERGIAAQEAGAFTWEIVPVEVSGGRGRPSTIVDKDEGLGKFDAAKLRKLRP 243

Query: 258 SFKEDGGTVTAGNASIISDGAAALVLVSGKKAVELGLEIIAVIKGYADAAQVIKFLMLLS 317
           SFKE+GGTVTAGNAS ISDGAAALVLVSG+KA++LGL ++A IKGY DAA          
Sbjct: 244 SFKENGGTVTAGNASSISDGAAALVLVSGEKALQLGLLVLAKIKGYGDAA---------- 303

Query: 318 FILEKDHVAYRKNDYQAPELFTTAPALSIPKAISNAGLHASQIDYYEINEAFSVVALANQ 377
                          Q PE FTTAPAL+IPKAI++AGL +SQ+DYYEINEAF+VVALANQ
Sbjct: 304 ---------------QEPEFFTTAPALAIPKAIAHAGLESSQVDYYEINEAFAVVALANQ 363

Query: 378 KLLGLDPDRVNVHGGAVSLGHPLGCSGARILVTLLG----RNGKYGVGAVCNGGGGASAL 426
           KLLG+ P++VNV+GGAVSLGHPLGCSGARIL+TLLG    RNGKYGVG VCNGGGGASAL
Sbjct: 364 KLLGIAPEKVNVNGGAVSLGHPLGCSGARILITLLGILKKRNGKYGVGGVCNGGGGASAL 403

BLAST of CmaCh13G003170 vs. TAIR10
Match: AT5G48880.2 (AT5G48880.2 peroxisomal 3-keto-acyl-CoA thiolase 2)

HSP 1 Score: 180.6 bits (457), Expect = 2.0e-45
Identity = 147/443 (33.18%), Postives = 222/443 (50.11%), Query Frame = 1

Query: 8   VSSFSPMAPLSSDSINPRDVCIVGVARTPI-----GAFLGSLSS--FSATQLGALKRANV 67
           VS  SPMA    D      + IV   RT I     G F  +L     ++     ++R ++
Sbjct: 38  VSEVSPMAAFGDD------IVIVAAYRTAICKARRGGFKDTLPDDLLASVLKAVVERTSL 97

Query: 68  DPSLVQEVFFGNVLSANLGQA-PARQAALGAGIPNSVICTTINKVCASGMKATMIAAHTI 127
           DPS V ++  G V++    +A   R AA  AG P+SV   T+N+ C+SG++A    A +I
Sbjct: 98  DPSEVGDIVVGTVIAPGSQRAMECRVAAYFAGFPDSVPVRTVNRQCSSGLQAVADVAASI 157

Query: 128 QLGINDVVVSGGMESMSNVPKYLQLRKGSRFGNDTVVDGMLK--DGLWDVYNDFAMGVCG 187
           + G  D+ +  G+ESMS       +  G   G++       K  D L        MG+  
Sbjct: 158 RAGYYDIGIGAGVESMSTD----HIPGGGFHGSNPRAQDFPKARDCL------LPMGITS 217

Query: 188 EMCASQHSITREDQ------SFERGLAAQNNGQLSWEIAPVKV----PSGRGKPSSIFDK 247
           E  A +  +TRE+Q      S +R  AA  +G+L  EI PV      P  + + + +   
Sbjct: 218 ENVAERFGVTREEQDMAAVESHKRAAAAIASGKLKDEIIPVATKIVDPETKAEKAIVVSV 277

Query: 248 DESLR-QFDAAKLRKLRPSFKEDGGTVTAGNASIISDGAAALVLVSGKKAVELGLEIIAV 307
           D+ +R   + A L KL+  FK++G T TAGNAS ISDGA A++L+    A++ GL I+ V
Sbjct: 278 DDGVRPNSNMADLAKLKTVFKQNGST-TAGNASQISDGAGAVLLMKRSLAMKKGLPILGV 337

Query: 308 IKGYADAAQVIKFLMLLSFILEKDHVAYRKNDYQAPELFTTAPALSIPKAISNAGLHASQ 367
            + +A                              P +    PA++IP A   AGL+ S 
Sbjct: 338 FRSFAVTGV-------------------------EPSVMGIGPAVAIPAATKLAGLNVSD 397

Query: 368 IDYYEINEAFSVVALANQKLLGLDPDRVNVHGGAVSLGHPLGCSGARILVTLL------G 424
           ID +EINEAF+   + + K L LD ++VNV+GGA+++GHPLG +GAR + TLL      G
Sbjct: 398 IDLFEINEAFASQYVYSCKKLELDMEKVNVNGGAIAIGHPLGATGARCVATLLHEMKRRG 438

BLAST of CmaCh13G003170 vs. TAIR10
Match: AT1G04710.1 (AT1G04710.1 peroxisomal 3-ketoacyl-CoA thiolase 4)

HSP 1 Score: 180.3 bits (456), Expect = 2.6e-45
Identity = 145/444 (32.66%), Postives = 223/444 (50.23%), Query Frame = 1

Query: 9   SSFSPMAPLSSDSINPR---DVCIVGVARTPI-----GAFLGSL-SSFSATQLGAL-KRA 68
           +S S  A LS DS   +   DV IV   RT +     G+F  +      A+ L AL ++ 
Sbjct: 23  ASLSASACLSKDSAAYQYGDDVVIVAAQRTALCKAKRGSFKDTFPDELLASVLRALIEKT 82

Query: 69  NVDPSLVQEVFFGNVLSANLGQAP-ARQAALGAGIPNSVICTTINKVCASGMKATMIAAH 128
           NV+PS V ++  G VL     +A   R AA  AG P +V   T+N+ C+SG++A    A 
Sbjct: 83  NVNPSEVGDIVVGTVLGPGSQRASECRMAAFYAGFPETVPIRTVNRQCSSGLQAVADVAA 142

Query: 129 TIQLGINDVVVSGGMESMSNVPKYLQLRKGSRFGNDTVVDGMLKDGLWDVYNDFA-MGVC 188
            I+ G  D+ +  G+ESM+  P+     KGS   N    +          +N    MG+ 
Sbjct: 143 AIKAGFYDIGIGAGLESMTTNPRGW---KGSVNPNVKKFE--------QAHNCLLPMGIT 202

Query: 189 GEMCASQHSITREDQ------SFERGLAAQNNGQLSWEIAPVKVP-----SGRGKPSSIF 248
            E  A + +++RE+Q      S  +  +A  +G+   EI PVK       +G  KP ++ 
Sbjct: 203 SENVAHRFNVSREEQDQAAVDSHRKAASATASGKFKDEITPVKTKIVDPKTGDEKPITVS 262

Query: 249 DKDESLRQFDAAKLRKLRPSFKEDGGTVTAGNASIISDGAAALVLVSGKKAVELGLEIIA 308
             D        + L KL+P FKEDG T TAGN+S +SDGA A++L+    A++ GL I+ 
Sbjct: 263 VDDGIRPNTTLSGLAKLKPVFKEDG-TTTAGNSSQLSDGAGAVLLMRRNVAMQKGLPILG 322

Query: 309 VIKGYADAAQVIKFLMLLSFILEKDHVAYRKNDYQAPELFTTAPALSIPKAISNAGLHAS 368
           V + ++                              P +    PA++IP A+  AGL  +
Sbjct: 323 VFRTFSAVGV-------------------------DPAIMGVGPAVAIPAAVKAAGLELN 382

Query: 369 QIDYYEINEAFSVVALANQKLLGLDPDRVNVHGGAVSLGHPLGCSGARILVTLL------ 424
            +D +EINEAF+   +  +  LGLD +++NV+GGA+++GHPLG +GAR + TLL      
Sbjct: 383 DVDLFEINEAFASQFVYCRNKLGLDAEKINVNGGAIAIGHPLGATGARCVATLLHEMKRR 429

BLAST of CmaCh13G003170 vs. TAIR10
Match: AT2G33150.1 (AT2G33150.1 peroxisomal 3-ketoacyl-CoA thiolase 3)

HSP 1 Score: 178.7 bits (452), Expect = 7.7e-45
Identity = 146/447 (32.66%), Postives = 220/447 (49.22%), Query Frame = 1

Query: 9   SSFSPMAPLSSDS-------INPRDVCIVGVARTPI-----GAFLGSL-SSFSATQLGAL 68
           +S S  A L+ DS       +   DV IV   RTP+     G F  +      A  L AL
Sbjct: 27  ASLSASACLAGDSAAYQRTSLYGDDVVIVAAHRTPLCKSKRGNFKDTYPDDLLAPVLRAL 86

Query: 69  -KRANVDPSLVQEVFFGNVLSANLGQAP-ARQAALGAGIPNSVICTTINKVCASGMKATM 128
            ++ N++PS V ++  G VL+    +A   R AA  AG P +V   T+N+ C+SG++A  
Sbjct: 87  IEKTNLNPSEVGDIVVGTVLAPGSQRASECRMAAFYAGFPETVAVRTVNRQCSSGLQAVA 146

Query: 129 IAAHTIQLGINDVVVSGGMESMSNVPKYLQLRKGSRFGNDTVVDGMLKDGLWDVYNDFAM 188
             A  I+ G  D+ +  G+ESM+  P   +   GS       V+  +K           M
Sbjct: 147 DVAAAIKAGFYDIGIGAGLESMTTNPMAWE---GS-------VNPAVKKFAQAQNCLLPM 206

Query: 189 GVCGEMCASQHSITREDQ------SFERGLAAQNNGQLSWEIAPVKVP-----SGRGKPS 248
           GV  E  A +  ++R++Q      S  +  AA   G+   EI PVK       +G  KP 
Sbjct: 207 GVTSENVAQRFGVSRQEQDQAAVDSHRKAAAATAAGKFKDEIIPVKTKLVDPKTGDEKPI 266

Query: 249 SIFDKDESLRQFDAAKLRKLRPSFKEDGGTVTAGNASIISDGAAALVLVSGKKAVELGLE 308
           ++   D        A L KL+P FK+DG T TAGN+S +SDGA A++L+    A++ GL 
Sbjct: 267 TVSVDDGIRPTTTLASLGKLKPVFKKDG-TTTAGNSSQVSDGAGAVLLMKRSVAMQKGLP 326

Query: 309 IIAVIKGYADAAQVIKFLMLLSFILEKDHVAYRKNDYQAPELFTTAPALSIPKAISNAGL 368
           ++ V + +A                              P +    PA++IP A+  AGL
Sbjct: 327 VLGVFRTFAAVGV-------------------------DPAIMGIGPAVAIPAAVKAAGL 386

Query: 369 HASQIDYYEINEAFSVVALANQKLLGLDPDRVNVHGGAVSLGHPLGCSGARILVTLL--- 424
               ID +EINEAF+   +  +  LGLDP+++NV+GGA+++GHPLG +GAR + TLL   
Sbjct: 387 ELDDIDLFEINEAFASQFVYCRNKLGLDPEKINVNGGAMAIGHPLGATGARCVATLLHEM 437

BLAST of CmaCh13G003170 vs. NCBI nr
Match: gi|659128578|ref|XP_008464272.1| (PREDICTED: acetyl-CoA acetyltransferase, cytosolic 1-like isoform X1 [Cucumis melo])

HSP 1 Score: 691.0 bits (1782), Expect = 1.3e-195
Identity = 370/441 (83.90%), Postives = 383/441 (86.85%), Query Frame = 1

Query: 12  SPMAPLSSDSINPRDVCIVGVARTPIGAFLGSLSSFSATQLG------ALKRANVDPSLV 71
           SPMAP+SSDS+N RDVCIVGVARTP+G FLGSLSSFSATQLG      ALKRANVDPS+V
Sbjct: 46  SPMAPISSDSVNLRDVCIVGVARTPMGGFLGSLSSFSATQLGSIAIECALKRANVDPSIV 105

Query: 72  QEVFFGNVLSANLGQAPARQAALGAGIPNSVICTTINKVCASGMKATMIAAHTIQLGIND 131
           QEVFFGNVL ANLGQAPARQAALGAGIPNSVICTTINKVCASGMKATMIAAHTIQLGIND
Sbjct: 106 QEVFFGNVLGANLGQAPARQAALGAGIPNSVICTTINKVCASGMKATMIAAHTIQLGIND 165

Query: 132 VVVSGGMESMSNVPKYLQ-LRKGSRFGNDTVVDGMLKDGLWDVYNDFAMGVCGEMCASQH 191
           VVVSGGMESMSN PKYLQ +RKGSRFGNDTVVDGMLKDGLWD YNDF MG C E+CASQ+
Sbjct: 166 VVVSGGMESMSNAPKYLQEVRKGSRFGNDTVVDGMLKDGLWDAYNDFPMGACAEICASQY 225

Query: 192 SITREDQ------SFERGLAAQNNGQLSWEIAPVKVPSGRGKPSSIFDKDESLRQFDAAK 251
           SITRE+Q      SFERGLAAQNNG LSWEI PVKVPS RGKPSS FDKDESLR FDAAK
Sbjct: 226 SITREEQDAYAIKSFERGLAAQNNGLLSWEIVPVKVPSVRGKPSSTFDKDESLRHFDAAK 285

Query: 252 LRKLRPSFKEDGGTVTAGNASIISDGAAALVLVSGKKAVELGLEIIAVIKGYADAAQVIK 311
           L+KLRPSFK+DGGTVTAGNASIISDGAAALVLVSGKKA+ELGLE IAVIKGYADAA    
Sbjct: 286 LKKLRPSFKKDGGTVTAGNASIISDGAAALVLVSGKKALELGLEAIAVIKGYADAA---- 345

Query: 312 FLMLLSFILEKDHVAYRKNDYQAPELFTTAPALSIPKAISNAGLHASQIDYYEINEAFSV 371
                                QAPELFTTAPAL+IPKAISNA LHASQIDYYEINEAFSV
Sbjct: 346 ---------------------QAPELFTTAPALAIPKAISNACLHASQIDYYEINEAFSV 405

Query: 372 VALANQKLLGLDPDRVNVHGGAVSLGHPLGCSGARILVTLLG----RNGKYGVGAVCNGG 431
           VALANQK+LGLDPDRVN HGGAVSLGHPLGCSGARILVTLLG    RNGKYGVGAVCNGG
Sbjct: 406 VALANQKILGLDPDRVNAHGGAVSLGHPLGCSGARILVTLLGVLRQRNGKYGVGAVCNGG 461

Query: 432 GGASALVVELMPGARVRHSKL 436
           GGASALVVELMPGARVRHSKL
Sbjct: 466 GGASALVVELMPGARVRHSKL 461

BLAST of CmaCh13G003170 vs. NCBI nr
Match: gi|778659629|ref|XP_011654773.1| (PREDICTED: acetyl-CoA acetyltransferase, cytosolic 1 isoform X1 [Cucumis sativus])

HSP 1 Score: 690.6 bits (1781), Expect = 1.7e-195
Identity = 370/439 (84.28%), Postives = 383/439 (87.24%), Query Frame = 1

Query: 14  MAPLSSDSINPRDVCIVGVARTPIGAFLGSLSSFSATQLG------ALKRANVDPSLVQE 73
           MAPLSSDSINPRDVCIVGVARTP+G FLGSLSSFSATQLG      ALKRANVDPSLVQE
Sbjct: 1   MAPLSSDSINPRDVCIVGVARTPMGGFLGSLSSFSATQLGSIAIECALKRANVDPSLVQE 60

Query: 74  VFFGNVLSANLGQAPARQAALGAGIPNSVICTTINKVCASGMKATMIAAHTIQLGINDVV 133
           VFFGNVLSANLGQAPARQAALGAGIPNSVICTTINKVCASGMKATMIAAHTIQLGINDVV
Sbjct: 61  VFFGNVLSANLGQAPARQAALGAGIPNSVICTTINKVCASGMKATMIAAHTIQLGINDVV 120

Query: 134 VSGGMESMSNVPKYLQ-LRKGSRFGNDTVVDGMLKDGLWDVYNDFAMGVCGEMCASQHSI 193
           VSGGMESMSN PKYLQ +RKGSRFGND VVDGMLKDGLWD YNDF MG C E+CASQ+SI
Sbjct: 121 VSGGMESMSNTPKYLQEVRKGSRFGNDAVVDGMLKDGLWDAYNDFPMGACAEICASQYSI 180

Query: 194 TREDQ------SFERGLAAQNNGQLSWEIAPVKVPSGRGKPSSIFDKDESLRQFDAAKLR 253
           TRE+Q      SFERGLAAQNNG LSWEIAPVKVPS RGKPSS FDKDESLRQFDAAKL+
Sbjct: 181 TREEQDAYAIKSFERGLAAQNNGSLSWEIAPVKVPSVRGKPSSTFDKDESLRQFDAAKLK 240

Query: 254 KLRPSFKEDGGTVTAGNASIISDGAAALVLVSGKKAVELGLEIIAVIKGYADAAQVIKFL 313
           KLRPSFK+DGGTVTAGNASIISDGAAALVLVSGKKA+ELGLE+IAVIKGYADAAQ     
Sbjct: 241 KLRPSFKKDGGTVTAGNASIISDGAAALVLVSGKKALELGLEVIAVIKGYADAAQ----- 300

Query: 314 MLLSFILEKDHVAYRKNDYQAPELFTTAPALSIPKAISNAGLHASQIDYYEINEAFSVVA 373
                               APELFTT PAL+IPKAISNA LH SQIDYYEINEAFSVVA
Sbjct: 301 --------------------APELFTTTPALAIPKAISNACLHHSQIDYYEINEAFSVVA 360

Query: 374 LANQKLLGLDPDRVNVHGGAVSLGHPLGCSGARILVTLLG----RNGKYGVGAVCNGGGG 433
           LANQK+LGLDPDRVN HGGAVSLGHPLGCSGARILVTLLG    +NGKYGVGAVCNGGGG
Sbjct: 361 LANQKILGLDPDRVNAHGGAVSLGHPLGCSGARILVTLLGVLRQKNGKYGVGAVCNGGGG 414

Query: 434 ASALVVELMPGARVRHSKL 436
           ASALVVELMPGARVR+SKL
Sbjct: 421 ASALVVELMPGARVRNSKL 414

BLAST of CmaCh13G003170 vs. NCBI nr
Match: gi|449443498|ref|XP_004139514.1| (PREDICTED: probable acetyl-CoA acetyltransferase, cytosolic 2 isoform X2 [Cucumis sativus])

HSP 1 Score: 672.9 bits (1735), Expect = 3.7e-190
Identity = 361/429 (84.15%), Postives = 373/429 (86.95%), Query Frame = 1

Query: 14  MAPLSSDSINPRDVCIVGVARTPIGAFLGSLSSFSATQLG------ALKRANVDPSLVQE 73
           MAPLSSDSINPRDVCIVGVARTP+G FLGSLSSFSATQLG      ALKRANVDPSLVQE
Sbjct: 1   MAPLSSDSINPRDVCIVGVARTPMGGFLGSLSSFSATQLGSIAIECALKRANVDPSLVQE 60

Query: 74  VFFGNVLSANLGQAPARQAALGAGIPNSVICTTINKVCASGMKATMIAAHTIQLGINDVV 133
           VFFGNVLSANLGQAPARQAALGAGIPNSVICTTINKVCASGMKATMIAAHTIQLGINDVV
Sbjct: 61  VFFGNVLSANLGQAPARQAALGAGIPNSVICTTINKVCASGMKATMIAAHTIQLGINDVV 120

Query: 134 VSGGMESMSNVPKYLQ-LRKGSRFGNDTVVDGMLKDGLWDVYNDFAMGVCGEMCASQHSI 193
           VSGGMESMSN PKYLQ +RKGSRFGND VVDGMLKDGLWD YNDF MG C E+CASQ+SI
Sbjct: 121 VSGGMESMSNTPKYLQEVRKGSRFGNDAVVDGMLKDGLWDAYNDFPMGACAEICASQYSI 180

Query: 194 TREDQ------SFERGLAAQNNGQLSWEIAPVKVPSGRGKPSSIFDKDESLRQFDAAKLR 253
           TRE+Q      SFERGLAAQNNG LSWEIAPVKVPS RGKPSS FDKDESLRQFDAAKL+
Sbjct: 181 TREEQDAYAIKSFERGLAAQNNGSLSWEIAPVKVPSVRGKPSSTFDKDESLRQFDAAKLK 240

Query: 254 KLRPSFKEDGGTVTAGNASIISDGAAALVLVSGKKAVELGLEIIAVIKGYADAAQVIKFL 313
           KLRPSFK+DGGTVTAGNASIISDGAAALVLVSGKKA+ELGLE+IAVIKGYADAA      
Sbjct: 241 KLRPSFKKDGGTVTAGNASIISDGAAALVLVSGKKALELGLEVIAVIKGYADAA------ 300

Query: 314 MLLSFILEKDHVAYRKNDYQAPELFTTAPALSIPKAISNAGLHASQIDYYEINEAFSVVA 373
                              QAPELFTT PAL+IPKAISNA LH SQIDYYEINEAFSVVA
Sbjct: 301 -------------------QAPELFTTTPALAIPKAISNACLHHSQIDYYEINEAFSVVA 360

Query: 374 LANQKLLGLDPDRVNVHGGAVSLGHPLGCSGARILVTLLG----RNGKYGVGAVCNGGGG 426
           LANQK+LGLDPDRVN HGGAVSLGHPLGCSGARILVTLLG    +NGKYGVGAVCNGGGG
Sbjct: 361 LANQKILGLDPDRVNAHGGAVSLGHPLGCSGARILVTLLGVLRQKNGKYGVGAVCNGGGG 404

BLAST of CmaCh13G003170 vs. NCBI nr
Match: gi|659128580|ref|XP_008464273.1| (PREDICTED: acetyl-CoA acetyltransferase, cytosolic 1-like isoform X2 [Cucumis melo])

HSP 1 Score: 671.0 bits (1730), Expect = 1.4e-189
Identity = 360/431 (83.53%), Postives = 373/431 (86.54%), Query Frame = 1

Query: 12  SPMAPLSSDSINPRDVCIVGVARTPIGAFLGSLSSFSATQLG------ALKRANVDPSLV 71
           SPMAP+SSDS+N RDVCIVGVARTP+G FLGSLSSFSATQLG      ALKRANVDPS+V
Sbjct: 46  SPMAPISSDSVNLRDVCIVGVARTPMGGFLGSLSSFSATQLGSIAIECALKRANVDPSIV 105

Query: 72  QEVFFGNVLSANLGQAPARQAALGAGIPNSVICTTINKVCASGMKATMIAAHTIQLGIND 131
           QEVFFGNVL ANLGQAPARQAALGAGIPNSVICTTINKVCASGMKATMIAAHTIQLGIND
Sbjct: 106 QEVFFGNVLGANLGQAPARQAALGAGIPNSVICTTINKVCASGMKATMIAAHTIQLGIND 165

Query: 132 VVVSGGMESMSNVPKYLQ-LRKGSRFGNDTVVDGMLKDGLWDVYNDFAMGVCGEMCASQH 191
           VVVSGGMESMSN PKYLQ +RKGSRFGNDTVVDGMLKDGLWD YNDF MG C E+CASQ+
Sbjct: 166 VVVSGGMESMSNAPKYLQEVRKGSRFGNDTVVDGMLKDGLWDAYNDFPMGACAEICASQY 225

Query: 192 SITREDQ------SFERGLAAQNNGQLSWEIAPVKVPSGRGKPSSIFDKDESLRQFDAAK 251
           SITRE+Q      SFERGLAAQNNG LSWEI PVKVPS RGKPSS FDKDESLR FDAAK
Sbjct: 226 SITREEQDAYAIKSFERGLAAQNNGLLSWEIVPVKVPSVRGKPSSTFDKDESLRHFDAAK 285

Query: 252 LRKLRPSFKEDGGTVTAGNASIISDGAAALVLVSGKKAVELGLEIIAVIKGYADAAQVIK 311
           L+KLRPSFK+DGGTVTAGNASIISDGAAALVLVSGKKA+ELGLE IAVIKGYADAAQ   
Sbjct: 286 LKKLRPSFKKDGGTVTAGNASIISDGAAALVLVSGKKALELGLEAIAVIKGYADAAQ--- 345

Query: 312 FLMLLSFILEKDHVAYRKNDYQAPELFTTAPALSIPKAISNAGLHASQIDYYEINEAFSV 371
                                 APELFTTAPAL+IPKAISNA LHASQIDYYEINEAFSV
Sbjct: 346 ----------------------APELFTTAPALAIPKAISNACLHASQIDYYEINEAFSV 405

Query: 372 VALANQKLLGLDPDRVNVHGGAVSLGHPLGCSGARILVTLLG----RNGKYGVGAVCNGG 426
           VALANQK+LGLDPDRVN HGGAVSLGHPLGCSGARILVTLLG    RNGKYGVGAVCNGG
Sbjct: 406 VALANQKILGLDPDRVNAHGGAVSLGHPLGCSGARILVTLLGVLRQRNGKYGVGAVCNGG 451

BLAST of CmaCh13G003170 vs. NCBI nr
Match: gi|659128582|ref|XP_008464274.1| (PREDICTED: acetyl-CoA acetyltransferase, cytosolic 1-like isoform X3 [Cucumis melo])

HSP 1 Score: 642.9 bits (1657), Expect = 4.1e-181
Identity = 351/441 (79.59%), Postives = 364/441 (82.54%), Query Frame = 1

Query: 12  SPMAPLSSDSINPRDVCIVGVARTPIGAFLGSLSSFSATQLG------ALKRANVDPSLV 71
           SPMAP+SSDS+N RDVCIVGVARTP+G FLGSLSSFSATQLG      ALKRANVDPS+V
Sbjct: 46  SPMAPISSDSVNLRDVCIVGVARTPMGGFLGSLSSFSATQLGSIAIECALKRANVDPSIV 105

Query: 72  QEVFFGNVLSANLGQAPARQAALGAGIPNSVICTTINKVCASGMKATMIAAHTIQLGIND 131
           QEVFFGNVL ANLGQAPARQAALGAGIPNSVICTTINKVCASGMKATMIAAHTIQLGIND
Sbjct: 106 QEVFFGNVLGANLGQAPARQAALGAGIPNSVICTTINKVCASGMKATMIAAHTIQLGIND 165

Query: 132 VVVSGGMESMSNVPKYLQ-LRKGSRFGNDTVVDGMLKDGLWDVYNDFAMGVCGEMCASQH 191
           VVVSGGMESMSN PKYLQ +RKGSRFGNDTVVDGMLKDGLWD YNDF MG C E+CASQ+
Sbjct: 166 VVVSGGMESMSNAPKYLQEVRKGSRFGNDTVVDGMLKDGLWDAYNDFPMGACAEICASQY 225

Query: 192 SITREDQ------SFERGLAAQNNGQLSWEIAPVKVPSGRGKPSSIFDKDESLRQFDAAK 251
           SITRE+Q      SFERGLAAQNNG LSWEI P                      FDAAK
Sbjct: 226 SITREEQDAYAIKSFERGLAAQNNGLLSWEIVP----------------------FDAAK 285

Query: 252 LRKLRPSFKEDGGTVTAGNASIISDGAAALVLVSGKKAVELGLEIIAVIKGYADAAQVIK 311
           L+KLRPSFK+DGGTVTAGNASIISDGAAALVLVSGKKA+ELGLE IAVIKGYADAA    
Sbjct: 286 LKKLRPSFKKDGGTVTAGNASIISDGAAALVLVSGKKALELGLEAIAVIKGYADAA---- 345

Query: 312 FLMLLSFILEKDHVAYRKNDYQAPELFTTAPALSIPKAISNAGLHASQIDYYEINEAFSV 371
                                QAPELFTTAPAL+IPKAISNA LHASQIDYYEINEAFSV
Sbjct: 346 ---------------------QAPELFTTAPALAIPKAISNACLHASQIDYYEINEAFSV 405

Query: 372 VALANQKLLGLDPDRVNVHGGAVSLGHPLGCSGARILVTLLG----RNGKYGVGAVCNGG 431
           VALANQK+LGLDPDRVN HGGAVSLGHPLGCSGARILVTLLG    RNGKYGVGAVCNGG
Sbjct: 406 VALANQKILGLDPDRVNAHGGAVSLGHPLGCSGARILVTLLGVLRQRNGKYGVGAVCNGG 439

Query: 432 GGASALVVELMPGARVRHSKL 436
           GGASALVVELMPGARVRHSKL
Sbjct: 466 GGASALVVELMPGARVRHSKL 439

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
THIC2_ARATH1.5e-16770.71Probable acetyl-CoA acetyltransferase, cytosolic 2 OS=Arabidopsis thaliana GN=At... [more]
THIC1_ARATH4.1e-16572.47Acetyl-CoA acetyltransferase, cytosolic 1 OS=Arabidopsis thaliana GN=AAT1 PE=2 S... [more]
THIL_SCHPO1.3e-10751.81Acetyl-CoA acetyltransferase OS=Schizosaccharomyces pombe (strain 972 / ATCC 248... [more]
THIA_CANTR1.3e-9948.43Acetyl-CoA acetyltransferase IA OS=Candida tropicalis GN=PACTA PE=1 SV=3[more]
THIB_CANTR3.9e-9948.43Acetyl-CoA acetyltransferase IB OS=Candida tropicalis GN=PACTB PE=1 SV=3[more]
Match NameE-valueIdentityDescription
A0A0A0LTB9_CUCSA1.2e-19584.28Uncharacterized protein OS=Cucumis sativus GN=Csa_1G173130 PE=3 SV=1[more]
B9HC23_POPTR4.0e-17575.63Truncated acetyl Co-A acetyltransferase-like family protein OS=Populus trichocar... [more]
A0A058ZX35_EUCGR1.7e-17374.48Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_L00148 PE=3 SV=1[more]
A0A061GPF0_THECC2.4e-17274.54Thiolase family protein isoform 1 OS=Theobroma cacao GN=TCM_030451 PE=3 SV=1[more]
A0A0B0PUC9_GOSAR3.5e-17174.94Acetyl-CoA acetyltransferase, cytosolic 1-like protein OS=Gossypium arboreum GN=... [more]
Match NameE-valueIdentityDescription
AT5G47720.28.4e-16970.71 Thiolase family protein[more]
AT5G48230.22.3e-16672.47 acetoacetyl-CoA thiolase 2[more]
AT5G48880.22.0e-4533.18 peroxisomal 3-keto-acyl-CoA thiolase 2[more]
AT1G04710.12.6e-4532.66 peroxisomal 3-ketoacyl-CoA thiolase 4[more]
AT2G33150.17.7e-4532.66 peroxisomal 3-ketoacyl-CoA thiolase 3[more]
Match NameE-valueIdentityDescription
gi|659128578|ref|XP_008464272.1|1.3e-19583.90PREDICTED: acetyl-CoA acetyltransferase, cytosolic 1-like isoform X1 [Cucumis me... [more]
gi|778659629|ref|XP_011654773.1|1.7e-19584.28PREDICTED: acetyl-CoA acetyltransferase, cytosolic 1 isoform X1 [Cucumis sativus... [more]
gi|449443498|ref|XP_004139514.1|3.7e-19084.15PREDICTED: probable acetyl-CoA acetyltransferase, cytosolic 2 isoform X2 [Cucumi... [more]
gi|659128580|ref|XP_008464273.1|1.4e-18983.53PREDICTED: acetyl-CoA acetyltransferase, cytosolic 1-like isoform X2 [Cucumis me... [more]
gi|659128582|ref|XP_008464274.1|4.1e-18179.59PREDICTED: acetyl-CoA acetyltransferase, cytosolic 1-like isoform X3 [Cucumis me... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002155Thiolase
IPR016039Thiolase-like
IPR020610Thiolase_AS
IPR020613Thiolase_CS
IPR020616Thiolase_N
IPR020617Thiolase_C
Vocabulary: Molecular Function
TermDefinition
GO:0016747transferase activity, transferring acyl groups other than amino-acyl groups
GO:0003824catalytic activity
Vocabulary: Biological Process
TermDefinition
GO:0008152metabolic process
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0042967 acyl-carrier-protein biosynthetic process
biological_process GO:0006090 pyruvate metabolic process
biological_process GO:0006635 fatty acid beta-oxidation
biological_process GO:0008152 metabolic process
biological_process GO:0006574 valine catabolic process
biological_process GO:0006568 tryptophan metabolic process
biological_process GO:0018874 benzoate metabolic process
biological_process GO:0016126 sterol biosynthetic process
biological_process GO:0019745 pentacyclic triterpenoid biosynthetic process
biological_process GO:0006554 lysine catabolic process
biological_process GO:0006552 leucine catabolic process
biological_process GO:0006550 isoleucine catabolic process
biological_process GO:0006633 fatty acid biosynthetic process
biological_process GO:0046950 cellular ketone body metabolic process
cellular_component GO:0005575 cellular_component
cellular_component GO:0005829 cytosol
molecular_function GO:0003985 acetyl-CoA C-acetyltransferase activity
molecular_function GO:0016747 transferase activity, transferring acyl groups other than amino-acyl groups
molecular_function GO:0003824 catalytic activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh13G003170.1CmaCh13G003170.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002155ThiolasePIRPIRSF000429Ac-CoA_Ac_transfcoord: 21..426
score: 1.2
IPR016039Thiolase-likeGENE3DG3DSA:3.40.47.10coord: 312..424
score: 7.4E-46coord: 159..203
score: 7.4E-46coord: 204..287
score: 1.2E-58coord: 26..140
score: 1.2
IPR016039Thiolase-likeunknownSSF53901Thiolase-likecoord: 279..424
score: 1.1E-31coord: 26..281
score: 3.26
IPR020610Thiolase, active sitePROSITEPS00099THIOLASE_3coord: 406..419
scor
IPR020613Thiolase, conserved sitePROSITEPS00737THIOLASE_2coord: 375..391
scor
IPR020616Thiolase, N-terminalPFAMPF00108Thiolase_Ncoord: 27..272
score: 3.4
IPR020617Thiolase, C-terminalPFAMPF02803Thiolase_Ccoord: 320..423
score: 1.5
NoneNo IPR availablePANTHERPTHR18919ACETYL-COA C-ACYLTRANSFERASEcoord: 320..426
score: 1.8E-267coord: 22..294
score: 1.8E
NoneNo IPR availablePANTHERPTHR18919:SF105SUBFAMILY NOT NAMEDcoord: 320..426
score: 1.8E-267coord: 22..294
score: 1.8E