ClCG06G009580 (gene) Watermelon (Charleston Gray)

NameClCG06G009580
Typegene
OrganismCitrullus lanatus (Watermelon (Charleston Gray))
DescriptionL-allo-threonine aldolase
LocationCG_Chr06 : 16522440 .. 16544617 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTTTTTTTTTTTTTCCACATTGCTTTAATTTTCTGAGAAAATAAATGAATAAAAGCATATGATAATGACTCTTCTCTTGTCTTATCTTCCTATAAATTTCCTTTATGTTTATGCAACTCAATCCAATCCAATGGTGGGTTCATCGATATCTAAGGTGAGTGAAATTTAATTTCATGTTTTTGTTTTGAGGAATTTTTGTTGTATTTTTCTTTTCTATTGAAAGGAACGGAAAGTTTCATTTATTTATGTGTTTATGATCTCTACTTGTTTATATGTTTGCAGTGTGTTTGAAGTACTTATAGAAGCTATCAACTGATTTTGTTTTTTGGGGTTTATCCTCCATTTCAAGAGTAAGGAATCGCTATTTTTTCTTGTAATTTTCAATTTAGATATTGATAATTAATTCTTATTTCCATTTATGAAGAAATTGTTTTATTTTTTCATTTTTAGATTCAAAAGGCAAGAGATTTGAACGGACTCTTTAAAAAAAAAAATTAGTATGGACAATTAGTTACACTCTTTTTGAGGGGTTCAATATTGAGTAAAGATTCTTGTTTTTGTTTTTGTATAGGTAAGAAATCATAAACGGCGTGAATTATATTTTAGTTTATTCTTTATCATTTTCTATTTTTATTATATATTTTCAGAAAAAAAAAAAGAATATATTTGGTAATTATTTTCTTTCCATACACACCTTTATGAAGATCTTAACTAGCAAACAAAAATTATTGGTTTCTGTAATTATGTTCCTCTTAGATCCTTATTTTATACAATCATTTTGTGACCTCCGGTACAGTACAAGATCAGGTATACTCCTCATACAAAAAGGCAACATGTTTGATATATAAATGACCGATCTGTGTATAGGAATCAACAAAAATGGTGAGCAGAAAGGTGGATTTGCGGTCGGACACAGTGACGAAACCAACCGAATCGATGCGAGCTGCGATGGCGATTGCTGAGGTGGATGATGATGTGTTAGGGCATGACCCCATAGGGTTAGAGTTGGAAGAAGAGATGGCAAAGATAATGGGGAAAGAAGCAGGGTTATTTGTTCCCTCAGGCACAATGGGAAATCTCATAAGTGTTCTTGTACACTGTGACATTAGAGGGAGTGAAGTGATTGTTGGGGACAATTCCCATATTCACATTTTGGAAAATGGAGGCATTGCAACCATTGGAGGAGTTCATCCAAGGACAGTCAAAAACAATGCTGATGGAACAATGGATATTCATTTGATTCAAGCTGCCATTAGAAACCCAAAGGCCCAACTCTTCTTCCCAACAACAAGGCTCATTTGTTTAGAAAATACACATGCAAAGTAAGTATATTTATAATTAATTTCCCAATCATTTCCCTCTTTTGCTTCCTAATTAACCATTTATTTTATTTTATTTTATTTTATTTTATTTTTTTAAATTCTATCATGTCCTCTCAATTAAAAGAATATTCTTAACCAAATTTCAACTCCTAATTGATCTGCTTTCTTTTAATATTTTAAAACTATTTTTTTTTTTTAATTTTCAAAATTTAACTTTGTTTTTAGTTTCTAAAAAATCGATGTATTTGATATCTAATTGAATTTCCTATTTTTTCAATTTTTAAAACATTTTAAAACATGTTACTAGATTTCAAAAAATGAAAGAAACAAAAATTATTTTCAAAAACAGGTTTTCGTTTTCTAAAATGCGTGTGAATTCTATATATGATAAAACGATGACTAATGACGTATTTGAAACAAAAAATAAACACTTAAAAAAGTTAATCAAACAAGTTTAAATTCAATAAAAACTCCATATGTGGTTTTCCAGTTTGTAATTATTTTACAAATATTTCTTTTTTCTTTTTTTTTTTCTTAAAAGAGAATATTTGTCTTGGAATATTTCTTTTCCATTTTATAATCTCGACTTCCAACATGTTGAAAAAGTGTTCTTTTTTTATTTTCTTTTATATATATACACTAAAGCTCCAAATAAATTCTATTTTTTCTTTTATTTATTTTCCATTTTCTAATAACAATTTCAAATTTTTACATGAAATTTAATGGTGATAATATCGATTTTTTCCCACTTTAGTTCCACATCAATTATTGTCTTTTTTTTTTAACAAGAATATCTTTAAGTTTTTTTAAACAAAATTCAAAATGAGGGGAACAAGATAAATCGAATTAATTGTTTTGATTGTTTACTTATTTACTTATAACCTGCTCCTAAAAAATATCTGCTTTTATTTTATTTTGGAGATTTTTATATTCTTATCATTTTTCTTTATTTGATTCGTGATTAATCAAATACATTTAAAATATTTAGATTATTTTATTAAAAATTATTATAAATAGAAAAAAATATCAAACTATTAACAAATATATTAAAATTTTACTTTCTATCACTGATAGACCGCGATAGACCTATATGGTAAAGTGATGGCTGTCTATCTGGATCTATCACAATCTATCACATATATAAAGTAAAATTTTGCTATATTTGTAATTATTTTCAACAATTTTTCTTTTTTAAAAAATATCCCTTATTTTATTAACATGATATGTTTAAAAAAGTGAAAAATAAAGATATAATTTTACAATTATTATAGGACTCTTTTCAAATATAGCAAAAGGAACCAAAACATTTACAAATATAGCAAATTTCACTATTCATCTGTTATAGACCGCTATAGATCGCAAATATACTTTGTATTTTGCTATTAATTTGTAAATATTTTCAACAGTTTTGTCATTTAAGATAATTTTCTATTATTTATATTAGATTGAAAAACATTGTAAATATAGTTTGATTTTATTTTTGTATAAAAAATATAAAACTAAAAAAAAGTAGTCAAAACTTATTTTAGTTCCCTATAAACAAAAATGTGTTTTTAGTTTTTGTTCTTAAAAAAAAAACAAAAAATAATTATAAAAATAAGGTATTTGTTTTTTCAAAATTCAAAACCAAAAACAAAATTTAAAAAATTTAAAGGCATGAGTGTCTATAGACATTATCTATTTTCGAAAACTAAAAAAAAAAAAAAAATGTTACTAATTGAGTCCTAAGCTTTTTACTTTTGATTGCGTTTTATCTAATTTTGTGTCCGAATAATTCCTTGTCGTCTTTGTAGTTTAATGCGTAAGTAGAATATATGCTGATTGTTTATTTTAAGATAACATGACATGTTGACTTTAGACAAATAAAATATTAATTAAAGTTAACAATTTAGTTCTTAAACTCTAATTTATAACAAAGTTCGTTAACTTTCAATTTTGTAATAATCTAATATTTAAATTTTAGTATGTAACAATTTTGGTCCATGTATTTTAAAATTTGTAACATTTAGTTTCTATTATGAAAATTAGTGTTAAGATTTATTACATAAGTAGATTAATTAATTAATTAATTAGGGACCTAATACATTTATATGCTACAAAGTCTACCTGTAATGCTAAATTATATATATATATATATACATTGACATTTTCTGGTAAAAATTAAATTGTTATAAATTTGAAACTACAAGAACTAAATTTCTACATAATTGAAAGTAAAGTGACTAAATCATTCTAAATAAAAGTTAAAAGACTAAATTATTATAAATTTAACTTTAACGACTAAATTGTTTTTTAACTTACAATAATCTTGTAATGCAATCACTAAACTAATTACAACATTAACAATGAAAGATGTAGTGGATAGAAGAGACTAAATTTATAATTTAATTAAAACTATATTTTAGGCTTTTTAGTTACTGCTTCATGTTTAAAACTAGTTAACAAGACCTTGAAATCGACATGTTTTTTCCTTATCCTTTTTTTTTTTTATCTTTTTTTTTCTTTTTTTTTGAATTACTTAAAATTAAGTCAAATTGTTATATACACGAGTAAAAAATTATGAATGAAATGCAGCTCTGGTGGAAAATGTCTTTCAGTAGAATATACTGACGAAGTTGGAGAATTAGCTAAGAAGCATCACCTCAAACTTCACATTGACGGAGCTCGTATTTTCAATGCTTCAATTGTAAGTATAATCTATAAATAATATCTTTTCCTTGTTCAATCATATATATATATATATAGTTTTAGTTTTAGAAGTTGTTTGTTTCACAGGCATTTTAAATTTTCATGGAATTAACATTGCCTTTTGCTTAATTTTTCGTGGACTAGATTTGTTTTGTCGTATTGTAAAGATGTCATGATATCTTAAGATTCTGAACATTCTAGGATGCTTTATTGGGATTTGGGATCTAGAGAGTTTTGGATGCCTTCTTGAAACATTAATTTTGCAGATTCCCAGGTTGAGGGCATCCATATTTTATTGTTTTGCTCTTTCTTTTCTTTTTTCTTTTATTTATATATATAATTAAAATAATTAAAATAAATGTCACTCTTCTTTATTTGCATTATTATAATTAAAATAAATATTATCCGTCATATTTATCACTTTTCTCGGATATATAGTAGTGTTTTTTCTTTTAGCAATATATTCGCATTCAAATTTAAATTTATTTTATGAAAAAAAATCTTAAATTATTGTATAATAATTTTGTTAAAAAAATAGCATATAAAAATAATATAAATTGTATTGATGAAAAAAATTCAAGTTGATGAAAACAAATATGTTATATAAGTTTAAATCATATTAAATTAATTATAAATTAATAAATAATCGGAGCATCCTTGGAACATTAGGTAAATTAGAAATCCTAGTAAACAAACATAGTTATCAAAGCATTTTCCAGATATCTACAAAAATATTGATATGAAATAAATAAGGTTATCTAAAGATGTTACACCTTTGTAGTTCTAGGCATCTATAATTCTAAAGATCTAAAATTTCTGGTTATCTATATTTCCGCGGGTTAAGTAATATTTTACCAACAACATTTTCGTATACAATTAAAAAATATTATATTTTCAGGCACTTGGTGTTCCAGTGGATCGATTGGTACAAGCGGCTGACTCAGTATCTGTATGTCTTTATTTCTTTCTGCTAATAATTTGTTTTTATTAAATATGATTTGGAATTTATGGAGGAAATTCAACCTTTAACAAAATACAACATTATATCAAGGTTTGATATTTGGCTCATCAGGTATGTCTATCAAAAGGTTTGGGCGCACCTGTTGGATCAGTTATTGTGGGTTCCAAAGACTTTATTGCCAAGGTATAGTGGTTGGATGTTTTTTGGGACATATTAGTATTATGTTTGAGAATGTGTTTGAATTGGTTTCTTTAAATAAAAACTTCAATAACCCATTTTTGCAATTTCAAACTCTCTTGAGACATAGTATGGCAACCTAGGTTATTATATATTTGGAACTTCCTTCCATTGACAAATCTAATCCAATTTAAAAGGGGGTGAAATATGTTTAGAAATTAGTTAGAATGGGTTTGGATGAATTTTGAAAGCAATGCTTTTAAATGAAGAACTTTTTCCTATAAGTCTTTATGGAAAGGGACAAAAGAAGTTAGCACTTGCTATTAAGTGTTTTTTAAAAGCATTTCTTTTACCATGTTACCAAACATCATGCAAACAGAATATGTGATTTATCTTGCTAAACACATTTTGTTTTACATTTGTGTTAAAGTAGTCACCAAATTTCCAAATATTAGAATGATGGATCAAGTCCCAACAATTAGAAAATGCTCGATTTTGATTACTCAATTAACAGAAATTGTTAACGATTAACGAATGTATTGACATGGAATGAGAAAAAGAAAGAAAATCTCTTTGTCCCCTTGTTTCTCAAGACTTGTTCTTTTTCATCTAAAACTTGTATTGATTCTTCTTCATGAGACAAATCTTCCTTCAACTATATCGGTTGCACTTCAAGCGCATGAGTAGGATAGGGAATATACTTCGTTAACACCGATACATGGAAAACATTATGGATCTTGGACATCTCCATAGGTAAAGCTAACTTATAAGTTGCTGGTCCAACTCTTTCCAAGATCTCATAAGGTCCAATATATCTCGGACTTAGCTTCCCTTTTTTCCTGAACCATAACACTCCTTTCCCGGGAGACAACTTAAGGAATACTCAGTGTCCTACATCAAGCTCAAGATCTTTTCTTCTATTATCTACATAACCCTTCTATCTATCTCGAGCTATTTTCAACTTTTCCTTAATTATCTCAACTTTCTCTGTCATAGCTTGGACTATTTCTAGTCCGATTAACTTCCTTTGTCCAACTTCACCTCGACATACATGTATCCTGCACGGTCTACCATACAAAGCTTCATATGGGGTCATTCCAATACTCGCCTAATAACTGTTATTATACACAAACTTCATCAGCAACAGACGTGCATCCCAACTACCTTTAAACTGCAGTGCACACGCTCGAAGCATATCCTCGAAAGTCTGGATAGTCCTCTCTAATTGTCCATCAGTTTGAGGATGAAATGTTGTACTAAACTAAAACTTAGTATCCAGAGCTTGCTATAAACTAGTCTAGACCTTAGAAGTAAACCTCGGGTCACAATCGACATTGGTAACCCATATTGGTTCACTATTTTATCCATATACATCTGTGCCAAATTATCCAAGGTATAGGTTGCCTTTACCGGTAAGAAACTTGTCGTCTTGGTAAGTCTATCTACTATCACTCGTATACCATCAACACCACTCGAAGTCCTTGGCAACCCAAATAGAAAATCCATCATCACATGCTTTCAATTCCTCTCAAGAATAGGGAGTGGATTCAGTAATCCTGCTGGTCTTTGACGCTCGGGTTTAACTTGCGAGCACATCAAACATTTGACTACATATTCAGCTATTTCTCGTTCCATACCTAGCCACCAATAATAATTCCGTAAAGTATGATACATCTTGGTACTACCAAGATGCATCACATAAGCTGAATTGTGAGCTTCTTCTAAGATAGCTTGTTTAACTTCTAGATCCTTTGGCACACAAACTCTGTCGTGATTCAACAATGCTCTATTAGCTCTCAACTCATAATCTGGCCTCTTTTGAGCCTTTACCTCTTCTATTAATTTCCTGATTACAGGATTGTTTGATTACTTCTTAGTTACCCCATCTATTAGCTTAGGTCTTAAATGAAAATGCGCTAACAAACCTCCATTCTCATTTACTGATAAGACAGTAGCACCACTTCTGAACTCCTTAAGTAGAGTAGTCCTTATAGCATTAAGAGAACTCTAACTACTTTTTGACTTCCAACTCAAAGCATCAGCTACTAAATTCACTTTACCTAGGTAGTACTCAATCGTGCAATCATAATCTTTAATCAACCCCAACCACCCTCTTTGTCTCATGTTCATCTCCTTTTGATCAGAAATATATCTTAGACTCTTATGATTCTTGTAGATATGACATATCTTTCCAAATAAATAATGCCGTCATATCTTTAGAGCTAACATTAGTGCAACTAATTCTAAATCATGAGTAGGATAGTTAATCTTGTGAGGTCTCAACTGTCTAAAAGCATAGCCTATCACATTCCCACCTTGCATAAGAACACAACCTAATCCCTCACGAGAAGCATCACAATATACCTCAAACTCTTTTCCTGACTTTAGAAGTGACAACATAGGTGTCGTCACTAATCTTCTCTCTTTAGCTCCTGGAAACTTTGTGCGCACTTCTCATCCCACTCAAACTTAACACTTTTCTTCGTCAAACTCTTGAGTGACAAAGGTATTCTCAAGAACCCCTCAACAAACCATCTGTACTATCCAACTAACCCTAGGAAACTACAGGCTTCTATCACCATTGTGGGTCGTTCCCACTTCATTATAGCTTCAGTCTTCTGTGGATCAACATTTACTTGTACTACTGAAACCACATGTCCTAGGAACACCACCTATTCCAACTAGAACTCACACTTGTTGAACTTGGCATACAATTTCCGCTCTCTCAAAGTCTGCAACACCATCTTGAGATGTTCAACATGCTTTTCTTTATCACCAGAATACACCAGAATATCATCAATGAATACTATGACAAACTGATCAAGGTAAGGATGAAATATCCTATTCATAAGGTCCATAAACGCTGCCGGTTCGCTTGTCAGTTCGAACGACATCACTAAGAACTCGTAATGCCCACATCTAATCCTTTCTAAAAGCTGTCTTAGGAACATCTGACGCCCTAACCTTCAACGAGAACACTGGAGCTCCTCTAAGCTTAATAAACAAATCATATATCTGAGGCAACAAATACTTAGTACGTATTGTTACCTTATTCAATTGATAGTAATCAATACATAACCTTAGAGTATTATCTTTCTTCTTGACAAATAACACCGGTGCACCCCACAAGGAAACATTGGGTCTAATATAAACCTTGTCCACCAATTCCTGTAACTGTACCTTCAGTTCCTTTAACTCTGTTGGTGCCATTTGATAAGGTGCTTGAGAGATAGAAGTTGTACTTGGAAGCAGGTCTATAGTAAACTCTATCTCTTGATCAGGTGGTAATCCTGACAACTCTTCTGGAAATACATCTAAATAGTCACATATCACTGGCACGTCTTAAGGCTTCAACTTACCTGATTTAACTTTAGTCACATAGGTTAGGTACACTTCACGGCCTTTACTCATCATTTTCCTAGCCTTTACTGCAGACATCAAGCAAGTAGGAAGAATTTTCCTCATTCCTTTGAACACAACTACTACTTCTCCTAAATTTTCAAACTTCACTTCCTTTTTAAAACAATCAACATTAGCATGATACTTTGACAGAAAGTTCATGCCTAAAATAACATCAAACTCTGCTAACTCTAAAGGCAACAAATCTATCGACATAACTACACTATCAACCACTATCTCACAATTTCGATAAACATGCTCAATAACTATAGCATTAACTGCAGGTGTATGTACTAATAGTGTGTCAATCAATGGCTCTAGCCTCCTATTCATATGCATAACAGAGGTACTAGAAACAAATGAATGCGTAGTTCCTGTATCAATCAACACATAAGCATTCATATTACAAATAAATAATATACCTGTCATAATATCTGATGTCTCCTGAGCTTCTTATTGAGTCATAACATACACCTTTCCTTATTGCCGTGGTCTGCGCACCTATCCTTTCTGTCTTACACTTCCACCGCCTTCACCAACCACTATCTCGGGTCTTGGTTGATTAACAGTTTGGGAGATTGCACGTTGCTCAGCTTCATTTCCTGTCTTCAGCTTGGGGCAATCTCTCTTGTAATGCCCTGTCTGACCACAATTATAACAGACGTTGGTTCCACTCAAACACTAACCCTAGTGGTATCTTCCACAAGTACCACACATGGGCTTCCTAACTATACTAGCAATCGACTCACTGGGTTGCCTCGACTGGGAACCACTGGCTAATTCAGACATCTAAGTGAACCTCCTTTGGATAACGGTTTAAAGCTCCCCTTTCCAGACACACCAGGAACAAACTATCTACCTTCACCTTGACCAAATAGCTCTGGAAACATACCTGACGGTTGCCCTGATCTTCCTCCTCTATGAACCTCTTTCTCAGCTTTCTCTTTAGCTATACTTTTTTCAACTCTCATGGCAGCTTCTACCAACTTTGCAAAATCAGACCATTCTACACTCGCAATCACTGGAGTCCGAATCTCCTTCTGCAGGCCTTCTTCAAATCGTTTGCATCTATCGGTCTCATCAGCTATGACTACCATAGCATACTTTGCTAACTCTGTATACTTCTTCTCGTATTCAGCAACAGACATAGATCTTTAAACCAATCTCAGAAACTCATTTCTCTTAGCGTCACAGAAGGATCGAGGATAATACTTATCTTGGAAAACCTTCTTAAAATCACTCCTCGTCAATGCCTTAGCATCTCTTTTCCTGCTCTTTATCAATTTCCACCAATGCTTGACTCCCTTTTGGAGCAAGAACGTAGCTAACTTCAACTTTCTGTCTTCAGGACAACACATAACCCTAAAACACTTCTCTACCTGATTTAACCATACCTTTACATCAGCAGGATCTATGGTACCCTCGAACACTGCGACACCTAATGCTTTAAGTCTCTTGATCCCATACTTTTTCTCATGATCAGCCTGAACTGTCTCAACACTAGTAGCTAACCTCTAAGCTACCTTAGTGAATATCTATTCCTTCATGTTCACACATGCCTGAGGGTTACTAGAATCTCCCACAAAAGCCTCTCTAGAACCTCCAGTAGCTTACAAAGTTTCGGCTTCTGCTTGCCAACCTCGCCTGTTCCGTCGGGGCATGTTTTCTATAATACATCATAACATTAATATATTATAGACTCAACATGCTTCTAGCTACGATGCATGACCTAACTTCCCAAAATCTTATGCTCTAATACCAACTTGTCACACCCTCTCCCAAGTACTCTTTTAACCTAGCAAAGAATGTGAGGATAGCAAGTATCGACCCTTTTACGACACAAACTGTCAAATCTATACTCTTTTTTATACTCACAACATACAACACCTGATAAACAGATAACAACTGATAACATAAAGCATTCTCATAACCATAAGTAATGCACAAGACTAGTTTGATAACATTCTCCAACACATCCTTGTACAACACAGTTACATAACTTAAATCTCCAACCCTTATGTATAATGCATAATATGTCTTAAGTACAAGTACGTCCAAAGTAATCTAAAAACTAAAGTGATGTAGTAAAATCAAATGAGGGTGGGTAGGCTAAGCACCGCGAACTTTCTGCTACCTAGAAGAAAGAGATTTTAAAGAAAAACATGAGTTGGGTTGCCCAGTGAGTAACATAATACTAAAAATATAATCTTAACATAAAACTCATGCTCATGCTTTAAAGAAATCAAGCTCATAAATCACAGAAGAAAGTCTTTAACTAAAACTCATAAATCATGATCTTTAAATCAAATAACCTTAGTTGAGGTGGAGAAAGTCAAAACTCTCAATGTCGAAAACATCGTTAGCTGATGTGGAGTATTCTCAACACAACAACATCAACTCTCTCTATGTGCACATAGGAATAACTAAATCTTGAGCTGCCCAAGTATTTTCTTAATGTCCTTCAGTATTGGGTCCCAAGGATACTCAAATGTCTTCTCAATGCCCTTCAGCATTAAGTTCCTCTCATAATAATCTTAAGGTTGCCTATATGTCATCTCAATGTCCTTCAGCATTAAGTCCCTTGAAAACTTTTATGAAAACAAGCATGCTTTGGAAAGTAAGCATTTCAATATACAACTGTTCATGGAAAGCTTTAGAATTCTCAAAATTAGCAATAAGTAAACATGATTTTTCAACCTTTAAGTAAACAAGTAAAACATGCTTCTCAAACTCATCAAAATCAAGTACGAATCATACTTTCCAAACTCAAACTTCACAAGGAAAACATTTAGGAAACAATGTTAATTAATAAAAATTGTCACTCACTACTTTGCTCATGCTCTAAATGGATTATATATCTCCCACAAGTCCTCTTGGCTTGAAAATAATTTAACTTACTTAGTTAGGCTTCTTTATAACTTCCTTAAATCATAACTCAAGCCATTTAGGCTTTGAAGATCAAGTCTTAACTCAAATTAACTCACCAACGCCCAAATGATTAACTCACCAACGCCCAAATGATTGACTAATAGCTTAAGTGTTGTAGGATGGCCCTAGGCATGCACCTCAGCCACATACACACAGAGCATGCCCATTGCCCGCACGCATAAGCACAACACACTGTCACACACACACAAACACCATGTTGGGCAACAACTCTCGCACAACCTACTGCCCACAGGCGCGCACACACGGTTGGCCTATGCGTTGAAGTGGCTCATGTGTCTAACGCCTGCCCTTCCTTGGCCACAAGCGCACTCAGTCACACACTGCACACCTCCTCTACCTAGCACGTCTGCCCGCACACCCACTAAGCATCCAACTTGCCCAACTAACCTCTAAACAACTTTTTCAGCCCCTAGGCCACCGACAGTCCACTGCACGTCAACTTCCTTCAACCTTTGGCCAGCTCACCTGATAGCCTCAACTGCATAACCTACATTTTAACCCACCAAAATCAACATCTGTAGTCAATTTTAATTCCGATTTCTTTCACAAACCTCAAGATTAATCTCTTAACTTGCTTACCTTTGCGTTTGAGCTTCTAATTCTTTTTCTATAGTTTGAAATCCATCAATCTTCCAATGTTTCTCTAGATTGCCTTAGAGCATGAACAACCTTCCATCTTCCACAAATTTCAAGCCTTTGCTGTTGCTATCTTTCTCCGACAGTCTTCACACATCAATTAGGCTCAATTTGGAGGATTTTGGTAGAATAACTTCTTAATTGGCACGAGTAGATTTTGCAAACCTTTCTCCTTCGTCCAAGCTTCTATTTATAGCCCTTAAGTTTTCAGCATGCAACACATGAATTTGTCACCTCCTACTTAAATTTTGCCACACTAAATATAAAATTGCACCTAATTAATCTTATTTATCAAATAAATATCATAATCATCCAAAAGTTAAGAAGACTTGGACACCTACCTCAATTTCATCACCTAAACGCATGGCTCACCAAGTCTTGGAGGTTTCTGGCTTGTCCTCACTCAACACCCCTAGGCTCCCTTGCTTGTCAACAAACACTTGGCCCTTGCCTCTCCATGCCTACTTGGCTTAGGTCATTCTCCATGCATGCTTTTCCCCAACACCTACCAACATATGCCTTTTCCACGCATGGTTCTTTCGTTCAATGCCTAACTCCTAATGGTTTGAGCTTCACTTGCCGCCTACCCTTAAGTACTTTCTTTTACCATTCGGCTAACACAAATTACTCACAAACCCGAGACTGACTTGATCGTCAAGTGTGGTGCGCTTGACACCATATCGATCCCTAATTTCTATCGTCTAGTAGGAACTTGGATCATTTTCCCTTGTGATACAAATTTGCAGTTGTTCCATTTTTTTACATCAACAATTTCTATTAGTTAAAAGGCCAAAATCGAACATTTTCCAAATTTTGGAGGCTTGATTATTTGATTTTGAAATCTATAACATAAATATCAACCCTCACACCTCAAAATACAACCCACCCAACCATTTTTCACTACTTCAAAAGTGATATTGATTCAAGTCACCTTACATTATTAAGTCTTTCCCACTTTTTTAAATGCCTTTTTATGCAAATTTGTTCTTGAAATATTGTGCTTAATAATAGGCCACAAGGGTTAGAAAAGCATTGGGTGGTGGAATGAGGCAAATTGGCATCCTTTGTGCAGCTGGACTTATTGCAATAAAAGAGAATGTTCAAAAACTTCAAGCCGATCATGAGAAAGCCAAGCAACTAGCTAGTAATTAAGATCACTCTTTAATTCTTACCAACATTTTCAATTATTTTCACCTCTTTTTTCATATATATATATATATATTTTTTTTTTTTGGATTAATATAGGTGGGCTATACCAAATCAAAGGATTAAAGGTAGATCCAAAATCAGTTGAGACAAACATTGTGAGTATCTTTAAACATGATTAGAATTTTTAGTAATTTATTATGGTGTGTTTGAGTGTATATTAGTTTGTTTTTTTTCTTTAAACCATGCAAAAGTGTTTTAAAATAGTTTAAAAAGTTTGTTTTTATGCTTAGCATAAGTTTTGCTATCATAATTGTATTTGGTAAAAATTAATTTGAACATAATTTAGGTAGATAAAAATATTAATTATCATTTCAAGGTTGATAATGTTTTAATCTCCACTTGTGCAATCATTGAACTAATATATCTTCCTCTCTAATATAGATATTCACACACTTTCTTGTTTCTTTCTATACCTATAACTATCTTGATACTCTCTCTTTACACATCTTCTCTTAAAAATTCCATCATCTAGGATATATAACTACTGTTAAAAATTGGCAAAAAAAAAGATGAGATACAATCATTTAGAAGAAAAAAAACTTTTAGTAAGAAGTAAAAATATGTACACTTTTAAAAGAAGACTATTTCCACTCTTTCAATATTCACTCCTAAGTTTGTCTACCTTTTTCTCTGTTTGCTTTCTATCTTTCTTTGCCTTATTTACAAAGGGAGACATAAATCCTATTTATAGACGAGAGATATCCTTGGAGTTCAAGCAAAAGGATCTTTTTAGGTTCTTGGATTCTATCGTTGGAGCCTAAAGAAATGTTAGAATGTTTTTAAAGCATTTCTCTTGAAATACTTATCTACAATCTTCGAGATAAATCTAGATATATTTTATATCTACACTTACGTTATTCAAGTAGATACTTTAAGAAGCTTCTCCAGAATTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTATTNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTTTTTTTTTTTTTTTTTTTTGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTAATCATTCTCTAGAATAGTTAATGTTGCCCTCTACTGTAATACTCCCCCTAAATGATAACTATTCTAGCTCTTTCCATTTTTTAGAGTTGTTTCCAAAATTGTACATTAGGGGTGTACATGGTTTGGGTTGGCAACCCAAAAGACTGAAATAAAGTCTCCAACCCAACCCAACAAAATATGTAAATTTATCTACAACACATGACTTATAACTAAAATCTCATAAAGTTCAAATGTTATATAATAAATATCTATGATATTTATTACAAACTTTAAACAAAGACAAATATCATAACAATTTTAAAAGAAAAATATTAAAAATTTACAAATTAATCCATACAAAGTAATCATAAACTTTAAACGATATATATATATATATATAAAGAATTTGGGTTGGGTTTGGTCAACCCAATTTTTTTTAGCCAACCTGCGACCCAACCCAACCCAAGAACAAAGCTAACCCAACCCAACCTTTACATTTTGAGTTGGGTAGCCCGAGTTGTTTGGGTTGTCGAGTCTTTTGAACACCCCTACTATACATGCTTGGGTTCTTTTAGCCTTCTGGCATTAAGTCCCAACTTCGATTTCTTCTTAATGCATCAACTTCTGTGGCTATGATTATGTTCTTAGCCTCTGTATGTTCTTCTTCATCTTCGTCGTTCTCTTCTTCTCCTTCAATTGGAGTCTATTTCATTTGACTCATCTCTTCTTTTGATCCTTCGTTGAAAAAACCTCTCATCCTTAGGTGTCTGTTATCATTCGAGTGCCTGCCATGATGATACGTCATCAAATATAACATTTCTTAATGATAACACTGTTCACTGACATAATCAATATACTCTATCCTTAGGTGTCCTCACTAATATGACACATAGAGTCCATTCCTTGTCCTTCCATTAGCAATGTATCACTAATTTTGAGATCTTAGTACACCTTAACATTGTTAGGTTCAAACACAATGAAGTTGTCTTCTAATGTTAATTGAGATATTGATATCAAATTCTTCTTCATTCTAGGCATAGATAAAATTTGTTCTCTAACTCCACTTGAATGGAAATGGAATGAGGCATTATCATAGTTTTACTAACTAGAGCTATTTGAAACTTCTACTTATTTGCAACTTCGACAACTTGACCTTCCCTATGCTCTAATGTGGTTAGTAGTTTCCTTTTATCACCATGTCCTGTGATTAGAGCAACTTGAATAAACAATCCAATCATTTTCATAATTTACCTGATCTTGATGAGAGGTAACTCTTTCTTCTCCACCACATTGGATGTCGACTAGGAGTTGGGTTTTGCTATTGCAAAACATGTCACTGGATCCCATTATTCTTCACTAATGCTCTCAACCTAGGAGGTGACAACGTTGCTTTCTATTGATTTTTTTCTTTTGACAGCAATCTTTGGCATAATGGCCTACCTTTGCACAATTATAACATTCACCATATTTTTTCTATACTCACGTTTTCTATTATTTTTTATTAGTTTCCCTTACTTGAGAGCATTTTGGGGAATTTTTTTCATCTTCTTTTGATCCTACTTTCTTTTGATATTTTAGATTACTTTTCTTTGGCCACTAAAGAGGACTCCTTCATTATTACTCTGTATTGTGACATCCAATATTTGTTAGTCAATAATTCTTGACTAGTAAGCATATGTTCTAAGTCAATAAAAGAAGGTTGGACTGCCAAACCTTGAATAATAGCAATAAAGCTTCTATATTCAAGTTTAAGTTCATGAACACTAATTTTTCTCACTGTCGATTCTAAAATAGTAGTTGTAGGATCCAATTCAGAGATTTCATGACGTAGAGTTTTTACATTGGTGAAGTATTGATTGATAGTCAGATACCTTTGAGGAACTATTAGGAGCTTGTTCTCCGGAAACTAAAATCTCGCATCATTTTTCTTTGAGAAAAGTGAGGCAAACATGCCCATGCTATCTTTGGCATCTACACTATATTGATGTGCTCCAACATTTCTTCATCAATTATAATTCTAATTGCAAACATGACCTTACCTTCCTTGATATTCCATTTCTTAAAAGCAATAACATCTTCAGGTGGCTTGACCTTAATGCCTCCCGCAACACCCCACAAGTTTTGGCCTTGGAGATACGACTTCATGCATGTGAACCATGTTTTGTAGTTTTGAGTATTGAGCTTCTTGATTTCACCAACAATTTGAAGGTTTCCCATCATCTTGTAGTGATGTGAATGCACTCCACAAGTAGTTATGCCAAGATAACATGGTTCATAGTTCAGACTATAGACAATCTTCAGAAGCTCCACCACAGCAAGTTATGCTAGTTTGGCTATGATACCACTTGTTGAAGAAATTGGCACCAAATGGACAAGAGGCAGTCACTTGGAAAAAAGAAACTTTTATTAAGGAAGCACAAGTATGGACACTTTGCTTGGTTTACAGATGGAGACATAACTCCTATTTATAGGCATGAGATCATTGGAATACAAGCAAAATGATATTTCTAGGTTCTTAGATTCTAAAATTATTCTAGACATTTCTATGAATTATTCTTATCTAAAGAAATTCTCTAGAATTCATTGGAGCCTAAAGAGATGTTAGGATGTTTCTAAAACATTTTATTTTGATACTTATTTGCAATCTTCTAGATAAATCTAGAGATATTTAATCTATTTACATTTAAGTAATTTATGTTGATACTTAAGGAAACTTCTATAATAGATAGTGGTGTCTTCTACTTTAACATTTATTGAGTATCAAATTGAATGATGGTGAATGATTAAATTGAATAAAATGTTCTAATGTAAGAGATTTAAGAGAAACATTTTAGACCATGTAAACTAAAAAGGACGTCAACTCCGAACCTTAAGAAGTGTGACATACTTTTTTTAGACCATTTACAAAACATTAACTATTGTAACATATATTTGACCAATGTTAATAAAGAAGCAATTGGAAAACTTTTGTCTTAGAGTAGGGATAGAAATGATGTTTTAAACTCTAAGAGATTAGAAGATGAATTAACTTACCCATTTTGTCATAACTTTGAGTTTTTTATTGTGTATCTAATATGACATGAACAATTTGAAGTTTGTGTTTATTTTATTTCAAAATTTTAAAAAATGTTTAATAGATACTAGATTTTTTATCGTGTATCTAATATGACATGAACAATCAATTTTTTTTGTATCTATTAGCTAGCCAAAAAATAAAAAAAGTCTACAAATTTAATTGATTTATTAGATATAAATTTAAATTGTGTATAAACTTTCGTGTTGGTGTTGAACAAATATTTTGATTTAAAGAAAAAAATCTAAAAGACTAAATTTGTAATTTTTAGAAACAAAATAAAGCACCAAACTTCAAAGATCATTTTAATTGCAAACTAAACCAAAAAAAAAAAAAAAAATTACATGTTTAATTGCAATTTGGCTCGACATATCATATGGTGGGATCAATCTAGAACATCATTTTCATTGAGTATTTATATATTGAAAACTTTTCATCTTTTGTGGATTTTATTTTACTGTTTATTAAAAAATCTAAAACTTTTATTTATTTTATTTATTTATTATATGATCAGATATTCTTTGAAATAGAAGAGGATTACGGAATCTCACTGGAAACACTATGTAAAAGCTTGGAAGAACGTGGCATTTTTATGATGCTAGAAAGCCAAATAAGGTCTATCCTACAATTCCTTAAAAGAAAAAGAAAAAAAAAAAAATCTAGTTTCCATTATATTCGTAGTGAGTTTGTGATAACTTTGTTAGCTTTTGGGTAAAATATTTAAAATGTTTAACAAAAATATAGATTTTTGTAGAATGCTTTTTGGTCAAGTGTTTCATTCGGAAACATTTAAAATAATTCTTGAAAGCATAAAAGTTAGAACAAAATGTCAAAGTTATTTACTATGCACACTTTTCATTAAAGTGTATCTATTCAATATTTATATGCTAAAATGTCTGTTTTTCTAAAGATATTTTATGCTTTGAAATTTTGAAAGAAGATTTTGTTTTACTTTTCTTGTTTCCTAATTACAATTTTTATTCAATATATATATATATAGAGAGATTTTTTATTTTATTTTATATATTTGCTAAAACGATCAAGAAAATTTCTTTTTGTTCTTAATTTAAAACTTGAGAGAACAAACAAACAAAAATTACCAAAAAAACATATATTTTCCTTTTGTTCTTTTTGAAACTAAGTGGAAATAAAAAGTTAAAATTTTTAGCTAAGAAAAGTCAGATGTTTATCCAAAATAAAAAAATAAAATTATTGGTCATCAACTGCGAGTCAATTTGACATAATTTTTTAAATAGTGAAACATTTTAATTAATTACAAAATTAAATTAAATAAAAATGCTTTTAAAAGTGGGTCCTATTACATCACATCTGTTCTGAACAATAGTAACTTTCTACTCGATCAGAAAGGTGATAGATTGGAATCGCCAACAAAAATATTTTAAAATAAAATAGGTTAATTTGGTTATTGTATTTATTTATATATTTAGCAAACAAGTTTATTAATTGCAGGGCTAGAATTGTTCTTCATCATCAGATTTCAACAAGTGACGTGCACTACACTTTATCTTGCTTTCAGGTGATCTATTTCTAATGACTGATTCTTGGTTTTCTTTTCTTTTCTTTTTTTCTAAAATAAAAAATATTATATTTAAAAATATAATAAATCTATTAATATTTAAATAAATCACAATTTTTTCTTGAATTGATTAATTTAGTAAAATAGAAAAATAATGTAAAATCATATATCTATTTTGTATTTTCTTCAGAATCTGGGAAAAAAAAAGAGAGTCTAAAATAACTAGACAATTCAAAACAATAATTACATTTTCTTTGTTATCAAGAAATTAACATTTGTTGAAGAAGCAACGTCCCTGTTTTTTAAAATAATTTTTAAGTAACATATATTTATTTTAATAGAAATAATTAAATTTCAAGCTTTTTTCAGTTACTTTTACTAGGTTTTGTTTCAAAAATGTCAAAGACATTCATCACCTTTCAATGATGCATTTAATAAATCTATGACATATTACATACATTTTATAATTAAAAAATCTAATTGAATTATATGATTACATTTAAGATATGTCGATTAGATCAATAACTTTATTAAAATAAAATTAAAAGTTCAATAATTTATTAGACATATAGTCGAAATTCATATAAGCTATAGAATTGAAATGTCAAAAATTTATTAAATTTTTGAAGTATATATAAGAACTAACAATAAGTTTTCACATATCCTTCCCCAATTCTTCTTTAAACTCACAACTGATATAAACCTTTTCACTTTGGCTTCTGCAGCAAACTCTGAGTGGAATTCAAGTTGTAAATGGCAATTAATCACAACTAACTCTTTGTCTCTCTCTCTAATAAACAAGAGTGGAATTTGCCCACTTTGGCTTGTGCTTAATATGCCCTTAACTTGTTGCATAACACATTTCTAATCTCTATATTGGGATGCAATAATAATATGTGTTTTCTTTTTTCTTTTTTCTT

mRNA sequence

TTTTTTTTTTTTTTCCACATTGCTTTAATTTTCTGAGAAAATAAATGAATAAAAGCATATGATAATGACTCTTCTCTTGTCTTATCTTCCTATAAATTTCCTTTATGTTTATGCAACTCAATCCAATCCAATGGTGGGTTCATCGATATCTAAGTGTGTTTGAAGTACTTATAGAAGCTATCAACTGATTTTGTTTTTTGGGGTTTATCCTCCATTTCAAGAGAATCAACAAAAATGGTGAGCAGAAAGGTGGATTTGCGGTCGGACACAGTGACGAAACCAACCGAATCGATGCGAGCTGCGATGGCGATTGCTGAGGTGGATGATGATGTGTTAGGGCATGACCCCATAGGGTTAGAGTTGGAAGAAGAGATGGCAAAGATAATGGGGAAAGAAGCAGGGTTATTTGTTCCCTCAGGCACAATGGGAAATCTCATAAGTGTTCTTGTACACTGTGACATTAGAGGGAGTGAAGTGATTGTTGGGGACAATTCCCATATTCACATTTTGGAAAATGGAGGCATTGCAACCATTGGAGGAGTTCATCCAAGGACAGTCAAAAACAATGCTGATGGAACAATGGATATTCATTTGATTCAAGCTGCCATTAGAAACCCAAAGGCCCAACTCTTCTTCCCAACAACAAGGCTCATTTGTTTAGAAAATACACATGCAAACTCTGGTGGAAAATGTCTTTCAGTAGAATATACTGACGAAGTTGGAGAATTAGCTAAGAAGCATCACCTCAAACTTCACATTGACGGAGCTCGTATTTTCAATGCTTCAATTGCACTTGGTGTTCCAGTGGATCGATTGGTACAAGCGGCTGACTCAGTATCTGTATGTCTATCAAAAGGTTTGGGCGCACCTGTTGGATCAGTTATTGTGGGTTCCAAAGACTTTATTGCCAAGGCCACAAGGGTTAGAAAAGCATTGGGTGGTGGAATGAGGCAAATTGGCATCCTTTGTGCAGCTGGACTTATTGCAATAAAAGAGAATGTTCAAAAACTTCAAGCCGATCATGAGAAAGCCAAGCAACTAGCTAGTGGGCTATACCAAATCAAAGGATTAAAGGTAGATCCAAAATCAGTTGAGACAAACATTATATTCTTTGAAATAGAAGAGGATTACGGAATCTCACTGGAAACACTATGTAAAAGCTTGGAAGAACGTGGCATTTTTATGATGCTAGAAAGCCAAATAAGGGCTAGAATTGTTCTTCATCATCAGATTTCAACAAGTGACGTGCACTACACTTTATCTTGCTTTCAGCAAACTCTGAGTGGAATTCAAGTTGTAAATGGCAATTAATCACAACTAACTCTTTGTCTCTCTCTCTAATAAACAAGAGTGGAATTTGCCCACTTTGGCTTGTGCTTAATATGCCCTTAACTTGTTGCATAACACATTTCTAATCTCTATATTGGGATGCAATAATAATATGTGTTTTCTTTTTTCTTTTTTCTT

Coding sequence (CDS)

ATGGTGAGCAGAAAGGTGGATTTGCGGTCGGACACAGTGACGAAACCAACCGAATCGATGCGAGCTGCGATGGCGATTGCTGAGGTGGATGATGATGTGTTAGGGCATGACCCCATAGGGTTAGAGTTGGAAGAAGAGATGGCAAAGATAATGGGGAAAGAAGCAGGGTTATTTGTTCCCTCAGGCACAATGGGAAATCTCATAAGTGTTCTTGTACACTGTGACATTAGAGGGAGTGAAGTGATTGTTGGGGACAATTCCCATATTCACATTTTGGAAAATGGAGGCATTGCAACCATTGGAGGAGTTCATCCAAGGACAGTCAAAAACAATGCTGATGGAACAATGGATATTCATTTGATTCAAGCTGCCATTAGAAACCCAAAGGCCCAACTCTTCTTCCCAACAACAAGGCTCATTTGTTTAGAAAATACACATGCAAACTCTGGTGGAAAATGTCTTTCAGTAGAATATACTGACGAAGTTGGAGAATTAGCTAAGAAGCATCACCTCAAACTTCACATTGACGGAGCTCGTATTTTCAATGCTTCAATTGCACTTGGTGTTCCAGTGGATCGATTGGTACAAGCGGCTGACTCAGTATCTGTATGTCTATCAAAAGGTTTGGGCGCACCTGTTGGATCAGTTATTGTGGGTTCCAAAGACTTTATTGCCAAGGCCACAAGGGTTAGAAAAGCATTGGGTGGTGGAATGAGGCAAATTGGCATCCTTTGTGCAGCTGGACTTATTGCAATAAAAGAGAATGTTCAAAAACTTCAAGCCGATCATGAGAAAGCCAAGCAACTAGCTAGTGGGCTATACCAAATCAAAGGATTAAAGGTAGATCCAAAATCAGTTGAGACAAACATTATATTCTTTGAAATAGAAGAGGATTACGGAATCTCACTGGAAACACTATGTAAAAGCTTGGAAGAACGTGGCATTTTTATGATGCTAGAAAGCCAAATAAGGGCTAGAATTGTTCTTCATCATCAGATTTCAACAAGTGACGTGCACTACACTTTATCTTGCTTTCAGCAAACTCTGAGTGGAATTCAAGTTGTAAATGGCAATTAA

Protein sequence

MVSRKVDLRSDTVTKPTESMRAAMAIAEVDDDVLGHDPIGLELEEEMAKIMGKEAGLFVPSGTMGNLISVLVHCDIRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNNADGTMDIHLIQAAIRNPKAQLFFPTTRLICLENTHANSGGKCLSVEYTDEVGELAKKHHLKLHIDGARIFNASIALGVPVDRLVQAADSVSVCLSKGLGAPVGSVIVGSKDFIAKATRVRKALGGGMRQIGILCAAGLIAIKENVQKLQADHEKAKQLASGLYQIKGLKVDPKSVETNIIFFEIEEDYGISLETLCKSLEERGIFMMLESQIRARIVLHHQISTSDVHYTLSCFQQTLSGIQVVNGN
BLAST of ClCG06G009580 vs. Swiss-Prot
Match: THA1_ARATH (Probable low-specificity L-threonine aldolase 1 OS=Arabidopsis thaliana GN=THA1 PE=1 SV=1)

HSP 1 Score: 512.3 bits (1318), Expect = 4.3e-144
Identity = 250/353 (70.82%), Postives = 299/353 (84.70%), Query Frame = 1

Query: 1   MVSRKVDLRSDTVTKPTESMRAAMAIAEVDDDVLGHDPIGLELEEEMAKIMGKEAGLFVP 60
           MV R VDLRSDTVT+PT++MR AM  AEVDDDVLG+DP    LEEEMAK+MGKEA LFVP
Sbjct: 1   MVMRSVDLRSDTVTRPTDAMREAMCNAEVDDDVLGYDPTARRLEEEMAKMMGKEAALFVP 60

Query: 61  SGTMGNLISVLVHCDIRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNNADGTMDIHL 120
           SGTMGNLISV+VHCD+RGSEVI+GDN HIH+ ENGGI+TIGGVHP+TVKN  DGTMD+  
Sbjct: 61  SGTMGNLISVMVHCDVRGSEVILGDNCHIHVYENGGISTIGGVHPKTVKNEEDGTMDLEA 120

Query: 121 IQAAIRNPKAQLFFPTTRLICLENTHANSGGKCLSVEYTDEVGELAKKHHLKLHIDGARI 180
           I+AAIR+PK   F+P+TRLICLENTHANSGG+CLSVEYT++VGE+AK+H +KLHIDGAR+
Sbjct: 121 IEAAIRDPKGSTFYPSTRLICLENTHANSGGRCLSVEYTEKVGEIAKRHGVKLHIDGARL 180

Query: 181 FNASIALGVPVDRLVQAADSVSVCLSKGLGAPVGSVIVGSKDFIAKATRVRKALGGGMRQ 240
           FNASIALGVPV +LV+AADSV VCLSKGLGAPVGSVIVGS+ FI KA  VRK LGGGMRQ
Sbjct: 181 FNASIALGVPVHKLVKAADSVQVCLSKGLGAPVGSVIVGSQSFIEKAKTVRKTLGGGMRQ 240

Query: 241 IGILCAAGLIAIKENVQKLQADHEKAKQLASGLYQIKGLKVDPKSVETNIIFFEIEEDYG 300
           IG+LCAA L+A++EN+ KLQ DH+KAK LA GL Q+KG++V+  +VETN+IF ++E+   
Sbjct: 241 IGVLCAAALVALQENLPKLQHDHKKAKLLAEGLNQMKGIRVNVAAVETNMIFMDMEDGSR 300

Query: 301 ISLETLCKSLEERGIFMMLESQIRARIVLHHQISTSDVHYTLSCFQQTLSGIQ 354
           ++ E L K+LEE GI ++  +  R RIV+HHQI+TSDVHYTLSCFQQ +  +Q
Sbjct: 301 LTAEKLRKNLEENGILLIRGNSSRIRIVIHHQITTSDVHYTLSCFQQAMLTMQ 353

BLAST of ClCG06G009580 vs. Swiss-Prot
Match: THA2_ARATH (Probable low-specificity L-threonine aldolase 2 OS=Arabidopsis thaliana GN=THA2 PE=1 SV=1)

HSP 1 Score: 483.8 bits (1244), Expect = 1.6e-135
Identity = 240/344 (69.77%), Postives = 284/344 (82.56%), Query Frame = 1

Query: 4   RKVDLRSDTVTKPTESMRAAMAIAEVDDDVLGHDPIGLELEEEMAKIMGKEAGLFVPSGT 63
           R VDLRSDTVTKPTESMR+AMA AEVDDDVLG+DP  L LE+E+A+I GKEA +FVPSGT
Sbjct: 8   RTVDLRSDTVTKPTESMRSAMANAEVDDDVLGNDPTALRLEKEVAEIAGKEAAMFVPSGT 67

Query: 64  MGNLISVLVHCDIRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNNADGTMDIHLIQA 123
           MGNLISVLVHCD RGSEVI+GD+SHIHI ENGG++++GGVHPRTVKN  DGTM+I  I+A
Sbjct: 68  MGNLISVLVHCDERGSEVILGDDSHIHIYENGGVSSLGGVHPRTVKNEEDGTMEIGAIEA 127

Query: 124 AIRNPKAQLFFPTTRLICLENTHANSGGKCLSVEYTDEVGELAKKHHLKLHIDGARIFNA 183
           A+R+PK  L  P T+LICLENT AN GG+CL +EY D+VGELAKKH LKLHIDGARIFNA
Sbjct: 128 AVRSPKGDLHHPVTKLICLENTQANCGGRCLPIEYIDKVGELAKKHGLKLHIDGARIFNA 187

Query: 184 SIALGVPVDRLVQAADSVSVCLSKGLGAPVGSVIVGSKDFIAKATRVRKALGGGMRQIGI 243
           S+ALGVPV R+VQAADSVS+CLSKG+GAPVGSVIVGSK FI KA  +RK LGGGMRQIG+
Sbjct: 188 SVALGVPVKRIVQAADSVSICLSKGIGAPVGSVIVGSKKFITKARWLRKTLGGGMRQIGV 247

Query: 244 LCAAGLIAIKENVQKLQADHEKAKQLASGLYQIKGLKVDPKSVETNIIFFEIEEDYGISL 303
           LCAA L+A+ ENV KL+ DH+KA+ LA GL +I+ L+V+  +VETNII+ +I ED     
Sbjct: 248 LCAAALVALHENVAKLEDDHKKARVLAEGLNRIERLRVNVAAVETNIIYVDIPEDPKFGA 307

Query: 304 ETLCKSLEERGIFMMLESQIRARIVLHHQISTSDVHYTLSCFQQ 348
           E  CKSLE+ G+ ++ ++  R RIVLHHQIS  DV Y LSCF++
Sbjct: 308 EEACKSLEDVGVLVIPQATFRIRIVLHHQISDVDVEYVLSCFEK 351

BLAST of ClCG06G009580 vs. Swiss-Prot
Match: LTAA_AERJA (L-allo-threonine aldolase OS=Aeromonas jandaei GN=ltaA PE=1 SV=1)

HSP 1 Score: 283.5 bits (724), Expect = 3.2e-75
Identity = 161/346 (46.53%), Postives = 217/346 (62.72%), Query Frame = 1

Query: 4   RKVDLRSDTVTKPTESMRAAMAIAEVDDDVLGHDPIGLELEEEMAKIMGKEAGLFVPSGT 63
           R +DLRSDTVT+PT++MR  M  AEV DDV G DP    LE   A ++GKEA LFVPSGT
Sbjct: 2   RYIDLRSDTVTQPTDAMRQCMLHAEVGDDVYGEDPGVNALEAYGADLLGKEAALFVPSGT 61

Query: 64  MGNLISVLVHCDIRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNNADGTMDIHLIQA 123
           M NL++V+ HC  RG   ++G  +HI+  E  G A +G V  + V   ADG++ +  ++A
Sbjct: 62  MSNLLAVMSHCQ-RGEGAVLGSAAHIYRYEAQGSAVLGSVALQPVPMQADGSLALADVRA 121

Query: 124 AIRNPKAQLFFPTTRLICLENTHANSGGKCLSVEYTDEVGELAKKHHLKLHIDGARIFNA 183
           AI      + F  TRL+CLENTH    GK L + Y  E+ EL  +H L+LH+DGAR+FNA
Sbjct: 122 AIAPD--DVHFTPTRLVCLENTH---NGKVLPLPYLREMRELVDEHGLQLHLDGARLFNA 181

Query: 184 SIALGVPVDRLVQAADSVSVCLSKGLGAPVGSVIVGSKDFIAKATRVRKALGGGMRQIGI 243
            +A G  V  LV   DSVS+CLSKGLGAPVGS++VGS  FIA+A R+RK +GGGMRQ GI
Sbjct: 182 VVASGHTVRELVAPFDSVSICLSKGLGAPVGSLLVGSHAFIARARRLRKMVGGGMRQAGI 241

Query: 244 LCAAGLIAIKENVQKLQADHEKAKQLASGLYQIKGLKVDPKSVETNIIFFEIEEDYGISL 303
           L  AGL A++++V +L  DH +A+QLA GL  + G+++D   V+TN++F ++       L
Sbjct: 242 LAQAGLFALQQHVVRLADDHRRARQLAEGLAALPGIRLDLAQVQTNMVFLQLTSGESAPL 301

Query: 304 ETLCKSLEERGIFMMLESQIRARIVLHHQISTSDVHYTLSCFQQTL 350
               K+   RGI      ++  R+V H QI   D+   +  F + L
Sbjct: 302 LAFMKA---RGILFSGYGEL--RLVTHLQIHDDDIEEVIDAFTEYL 336

BLAST of ClCG06G009580 vs. Swiss-Prot
Match: YF64_CAEEL (Uncharacterized protein R102.4 OS=Caenorhabditis elegans GN=R102.4 PE=3 SV=3)

HSP 1 Score: 268.9 bits (686), Expect = 8.2e-71
Identity = 152/354 (42.94%), Postives = 213/354 (60.17%), Query Frame = 1

Query: 6   VDLRSDTVTKPTESMRAAMAIAEVDDDVLGHDPIGLELEEEMAKIMGKEAGLFVPSGTMG 65
           +DLRSDTVT P+  MR AMA A V DDV G D     LE+  A++ GKEAGLFV SGTMG
Sbjct: 67  IDLRSDTVTVPSVEMRRAMAEAIVGDDVYGEDTTTNRLEQRCAELFGKEAGLFVTSGTMG 126

Query: 66  NLISVLVHCDIRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNNADGTMDIHLIQAAI 125
           NL++++ HC  RG E+IVG  +HIH  E G  A   G+   T++   DGTMD++ I+ AI
Sbjct: 127 NLLAIMAHCQ-RGEEIIVGRYNHIHRWEQGNYAQFAGISATTLEVKPDGTMDLNDIEQAI 186

Query: 126 RNPKAQLFFPTTRLICLENTHANSGGKCLSVEYTDEVGELAKKHHLKLHIDGARIFNASI 185
           R     +  P ++LIC+ENTH  +GGK L +E+   V +LA++  LK+H+DGARI+NA++
Sbjct: 187 RVKDCHM--PASKLICIENTHNYTGGKALPIEWMRSVKQLAERRDLKVHMDGARIYNAAV 246

Query: 186 ALGVPVDRLVQAADSVSVCLSKGLGAPVGSVIVGSKDFIAKATRVRKALGGGMRQIGILC 245
           A    V ++   AD+V +C SKGLGAPVGS++VG KDFI +A   RKALGGG RQ GIL 
Sbjct: 247 ASNCSVSKIASFADTVQMCFSKGLGAPVGSIVVGPKDFIDRARHSRKALGGGWRQSGILA 306

Query: 246 AAGLIAIKENVQKLQADHEKAKQLASGLYQIKGLKVDPKSVETNIIFFEIEEDY------ 305
           AA  IA+      ++ADHE+AK LA  +         P+   T +  F  E+D       
Sbjct: 307 AAAHIALDHADATIRADHERAKTLARMIND-----ATPEEFRTKV--FAAEKDITNMVLV 366

Query: 306 ----GISLETLCKSLEERGIFMMLESQIRARIVLHHQISTSDVHYTLSCFQQTL 350
               G++++ L    ++  I  M     R R+VL+  +S  ++   +  +++ L
Sbjct: 367 HCQNGVTVQQLTDFFQKHDILAMTFDARRIRMVLNWNVSDENLETIVEVYKKFL 410

BLAST of ClCG06G009580 vs. Swiss-Prot
Match: LTAE_ECOLI (Low specificity L-threonine aldolase OS=Escherichia coli (strain K12) GN=ltaE PE=1 SV=1)

HSP 1 Score: 253.8 bits (647), Expect = 2.7e-66
Identity = 137/329 (41.64%), Postives = 199/329 (60.49%), Query Frame = 1

Query: 6   VDLRSDTVTKPTESMRAAMAIAEVDDDVLGHDPIGLELEEEMAKIMGKEAGLFVPSGTMG 65
           +DLRSDTVT+P+ +M  AM  A V DDV G DP    L++  A++ GKEA +F+P+GT  
Sbjct: 2   IDLRSDTVTRPSRAMLEAMMAAPVGDDVYGDDPTVNALQDYAAELSGKEAAIFLPTGTQA 61

Query: 66  NLISVLVHCDIRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNNADGTMDIHLIQAAI 125
           NL+++L HC+ RG E IVG  +H ++ E GG A +G + P+ +   ADGT+ +  +   I
Sbjct: 62  NLVALLSHCE-RGEEYIVGQAAHNYLFEAGGAAVLGSIQPQPIDAAADGTLPLDKVAMKI 121

Query: 126 RNPKAQLFFPTTRLICLENTHANSGGKCLSVEYTDEVGELAKKHHLKLHIDGARIFNASI 185
           +     + F  T+L+ LENTH    GK L  EY  E  E  ++ +L LH+DGARIFNA +
Sbjct: 122 KPD--DIHFARTKLLSLENTH---NGKVLPREYLKEAWEFTRERNLALHVDGARIFNAVV 181

Query: 186 ALGVPVDRLVQAADSVSVCLSKGLGAPVGSVIVGSKDFIAKATRVRKALGGGMRQIGILC 245
           A G  +  + Q  DS ++CLSKGLG PVGS++VG++D+I +A R RK  GGGMRQ GIL 
Sbjct: 182 AYGCELKEITQYCDSFTICLSKGLGTPVGSLLVGNRDYIKRAIRWRKMTGGGMRQSGILA 241

Query: 246 AAGLIAIKENVQKLQADHEKAKQLASGLYQIKGLKVDPKSVETNIIFFEIEEDYGISLET 305
           AAG+ A+K NV +LQ DH+ A  +A    Q++    D    +TN++F  + E+   +L  
Sbjct: 242 AAGIYALKNNVARLQEDHDNAAWMAE---QLREAGADVMRQDTNMLFVRVGEENAAALGE 301

Query: 306 LCKSLEERGIFMMLESQIRARIVLHHQIS 335
             K+       +++ +    R+V H  +S
Sbjct: 302 YMKARN-----VLINASPIVRLVTHLDVS 316

BLAST of ClCG06G009580 vs. TrEMBL
Match: A0A0D2U3I3_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_013G187300 PE=4 SV=1)

HSP 1 Score: 545.4 bits (1404), Expect = 5.1e-152
Identity = 274/358 (76.54%), Postives = 309/358 (86.31%), Query Frame = 1

Query: 1   MVSRKVDLRSDTVTKPTESMRAAMAIAEVDDDVLGHDPIGLELEEEMAKIMGKEAGLFVP 60
           MV++ VDLRSDTVTKPTE+MRAAMA AEVDDDVLG DP    LE E AKIMGKEAGLFV 
Sbjct: 1   MVTKSVDLRSDTVTKPTEAMRAAMATAEVDDDVLGADPTAARLESEAAKIMGKEAGLFVA 60

Query: 61  SGTMGNLISVLVHCDIRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNNADGTMDIHL 120
           SGTMGNLISVLVHCDIRGSEVI+GDN HIHI ENGGI+TIGGVHPR VKNN DGTMDI L
Sbjct: 61  SGTMGNLISVLVHCDIRGSEVILGDNCHIHIYENGGISTIGGVHPRPVKNNDDGTMDIGL 120

Query: 121 IQAAIRNPKAQLFFPTTRLICLENTHANSGGKCLSVEYTDEVGELAKKHHLKLHIDGARI 180
           I+AAIR+P+ ++ +PTTRLICLEN+HAN+GG+CLS EYTD+VGELAKKH LKLHIDGARI
Sbjct: 121 IEAAIRDPRGEIVYPTTRLICLENSHANTGGRCLSAEYTDKVGELAKKHGLKLHIDGARI 180

Query: 181 FNASIALGVPVDRLVQAADSVSVCLSKGLGAPVGSVIVGSKDFIAKATRVRKALGGGMRQ 240
           FNAS+ALGVPV+RLVQAADSVSVCLSKGLGAPVGSVIVGSK FI+KA R+RK LGGGMRQ
Sbjct: 181 FNASVALGVPVNRLVQAADSVSVCLSKGLGAPVGSVIVGSKSFISKARRLRKTLGGGMRQ 240

Query: 241 IGILCAAGLIAIKENVQKLQADHEKAKQLASGLYQIKGLKVDPKSVETNIIFFEIEEDYG 300
           +G +CAA  +A+KENV KL+ DH+KAK LA GL QIKGL+V+  +VETNIIFF+I E   
Sbjct: 241 LGFICAAAFVALKENVAKLEGDHKKAKVLAEGLNQIKGLRVNVAAVETNIIFFDIVEGSK 300

Query: 301 ISLETLCKSLEERGIFMMLESQIRARIVLHHQISTSDVHYTLSCFQQTLSGIQVVNGN 359
           I+ E L K LEE G+ +MLE  +R RIVLHHQIS+SDV YTLSCFQQ L G+QV NGN
Sbjct: 301 ITAEKLYKKLEEHGVLVMLEGPLRMRIVLHHQISSSDVLYTLSCFQQALIGVQVENGN 358

BLAST of ClCG06G009580 vs. TrEMBL
Match: A0A061F2K0_THECC (Threonine aldolase 1 isoform 1 OS=Theobroma cacao GN=TCM_026193 PE=4 SV=1)

HSP 1 Score: 543.9 bits (1400), Expect = 1.5e-151
Identity = 271/358 (75.70%), Postives = 308/358 (86.03%), Query Frame = 1

Query: 1   MVSRKVDLRSDTVTKPTESMRAAMAIAEVDDDVLGHDPIGLELEEEMAKIMGKEAGLFVP 60
           MV+R VDLRSDTVTKPTE+MRAAM  AEVDDDVLG DP   +LE E+AKIMGKEAGLFVP
Sbjct: 1   MVTRMVDLRSDTVTKPTEAMRAAMVTAEVDDDVLGADPTAFQLESEVAKIMGKEAGLFVP 60

Query: 61  SGTMGNLISVLVHCDIRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNNADGTMDIHL 120
           SGTMGNLISVLVHCDIRGSEVI+GDNSHIHI ENGGI+TIGGVHPR VKNN DGTMDI+L
Sbjct: 61  SGTMGNLISVLVHCDIRGSEVILGDNSHIHIYENGGISTIGGVHPRPVKNNEDGTMDINL 120

Query: 121 IQAAIRNPKAQLFFPTTRLICLENTHANSGGKCLSVEYTDEVGELAKKHHLKLHIDGARI 180
           I+AAIR+P+ +L +PTTRLICLEN+HANSGG+CLSV YTD VGELA KH LKLHIDGARI
Sbjct: 121 IEAAIRDPRGELVYPTTRLICLENSHANSGGRCLSVAYTDRVGELATKHGLKLHIDGARI 180

Query: 181 FNASIALGVPVDRLVQAADSVSVCLSKGLGAPVGSVIVGSKDFIAKATRVRKALGGGMRQ 240
           FNAS+ALGVPV RLVQAADS+SVCLSKGLGAPVGSVIVGSK FI KA R+RK LGGGMRQ
Sbjct: 181 FNASVALGVPVHRLVQAADSISVCLSKGLGAPVGSVIVGSKSFITKARRLRKTLGGGMRQ 240

Query: 241 IGILCAAGLIAIKENVQKLQADHEKAKQLASGLYQIKGLKVDPKSVETNIIFFEIEEDYG 300
           +G +CAA  +A++ENV KL+ DH+KAK LA GL QIKGL+VD  +V+TNII+F+I E   
Sbjct: 241 VGFICAAAFVALQENVGKLEGDHKKAKVLAEGLNQIKGLRVDVAAVQTNIIYFDIVEGSK 300

Query: 301 ISLETLCKSLEERGIFMMLESQIRARIVLHHQISTSDVHYTLSCFQQTLSGIQVVNGN 359
           ++ E L K+LEE G+ +M E   R RIVLHHQIS+SDV YTLSCFQQ L+G+Q  NGN
Sbjct: 301 LTAEKLYKNLEEHGVLVMPEGPARMRIVLHHQISSSDVQYTLSCFQQALTGVQEENGN 358

BLAST of ClCG06G009580 vs. TrEMBL
Match: A0A0R0HZC9_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_10G166000 PE=4 SV=1)

HSP 1 Score: 542.3 bits (1396), Expect = 4.3e-151
Identity = 272/360 (75.56%), Postives = 310/360 (86.11%), Query Frame = 1

Query: 1   MVSRKVDLRSDTVTKPTESMRAAMAIAEVDDDVLGHDPIGLELEEEMAKIMGKEAGLFVP 60
           MV+R VDLRSDTVTKPTE+MRAAMA AEVDDDVLG+DP    LE EMAK MGKEA LFVP
Sbjct: 1   MVTRIVDLRSDTVTKPTEAMRAAMASAEVDDDVLGYDPTAFRLETEMAKTMGKEAALFVP 60

Query: 61  SGTMGNLISVLVHCDIRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNNADGTMDIHL 120
           SGTMGNL+SVLVHCD+RGSEVI+GDN HI+I ENGGIATIGGVHPR VKNN DGT+DI L
Sbjct: 61  SGTMGNLVSVLVHCDVRGSEVILGDNCHINIFENGGIATIGGVHPRQVKNNDDGTIDIDL 120

Query: 121 IQAAIRNPKAQLFFPTTRLICLENTHANSGGKCLSVEYTDEVGELAKKHHLKLHIDGARI 180
           I+AAIR+P  +LF+PTT+LICLENTHANSGG+CLSVEYTD VGELAKKH LKLHIDGARI
Sbjct: 121 IEAAIRDPMGELFYPTTKLICLENTHANSGGRCLSVEYTDRVGELAKKHGLKLHIDGARI 180

Query: 181 FNASIALGVPVDRLVQAADSVSVCLSKGLGAPVGSVIVGSKDFIAKATRVRKALGGGMRQ 240
           FNAS+ALGVPVDRLVQAADSVSVCLSKG+GAPVGSVIVGSK+FIAKA R+RK LGGGMRQ
Sbjct: 181 FNASVALGVPVDRLVQAADSVSVCLSKGIGAPVGSVIVGSKNFIAKARRLRKTLGGGMRQ 240

Query: 241 IGILCAAGLIAIKENVQKLQADHEKAKQLASGLYQIKGLKVDPKSVETNIIFFEIEEDYG 300
           IG+LCAA L+A++ENV KL++DH+KA+ LA GL ++KGL+VD  SVETN++F +IEE   
Sbjct: 241 IGLLCAAALVALQENVGKLESDHKKARLLADGLKEVKGLRVDAGSVETNMVFIDIEEGTK 300

Query: 301 ISLETLCKSLEERGIFMMLESQIRARIVLHHQISTSDVHYTLSCFQQTLS--GIQVVNGN 359
              E +CK +EERGI +M ES  R R+VLHHQIS SDV Y LSCFQQ L+  G+Q   GN
Sbjct: 301 TRAEKICKYMEERGILVMQESSSRMRVVLHHQISASDVQYALSCFQQALAVKGVQNEMGN 360

BLAST of ClCG06G009580 vs. TrEMBL
Match: A0A0R0HUG2_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_10G166000 PE=4 SV=1)

HSP 1 Score: 542.3 bits (1396), Expect = 4.3e-151
Identity = 272/360 (75.56%), Postives = 310/360 (86.11%), Query Frame = 1

Query: 1   MVSRKVDLRSDTVTKPTESMRAAMAIAEVDDDVLGHDPIGLELEEEMAKIMGKEAGLFVP 60
           MV+R VDLRSDTVTKPTE+MRAAMA AEVDDDVLG+DP    LE EMAK MGKEA LFVP
Sbjct: 12  MVTRIVDLRSDTVTKPTEAMRAAMASAEVDDDVLGYDPTAFRLETEMAKTMGKEAALFVP 71

Query: 61  SGTMGNLISVLVHCDIRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNNADGTMDIHL 120
           SGTMGNL+SVLVHCD+RGSEVI+GDN HI+I ENGGIATIGGVHPR VKNN DGT+DI L
Sbjct: 72  SGTMGNLVSVLVHCDVRGSEVILGDNCHINIFENGGIATIGGVHPRQVKNNDDGTIDIDL 131

Query: 121 IQAAIRNPKAQLFFPTTRLICLENTHANSGGKCLSVEYTDEVGELAKKHHLKLHIDGARI 180
           I+AAIR+P  +LF+PTT+LICLENTHANSGG+CLSVEYTD VGELAKKH LKLHIDGARI
Sbjct: 132 IEAAIRDPMGELFYPTTKLICLENTHANSGGRCLSVEYTDRVGELAKKHGLKLHIDGARI 191

Query: 181 FNASIALGVPVDRLVQAADSVSVCLSKGLGAPVGSVIVGSKDFIAKATRVRKALGGGMRQ 240
           FNAS+ALGVPVDRLVQAADSVSVCLSKG+GAPVGSVIVGSK+FIAKA R+RK LGGGMRQ
Sbjct: 192 FNASVALGVPVDRLVQAADSVSVCLSKGIGAPVGSVIVGSKNFIAKARRLRKTLGGGMRQ 251

Query: 241 IGILCAAGLIAIKENVQKLQADHEKAKQLASGLYQIKGLKVDPKSVETNIIFFEIEEDYG 300
           IG+LCAA L+A++ENV KL++DH+KA+ LA GL ++KGL+VD  SVETN++F +IEE   
Sbjct: 252 IGLLCAAALVALQENVGKLESDHKKARLLADGLKEVKGLRVDAGSVETNMVFIDIEEGTK 311

Query: 301 ISLETLCKSLEERGIFMMLESQIRARIVLHHQISTSDVHYTLSCFQQTLS--GIQVVNGN 359
              E +CK +EERGI +M ES  R R+VLHHQIS SDV Y LSCFQQ L+  G+Q   GN
Sbjct: 312 TRAEKICKYMEERGILVMQESSSRMRVVLHHQISASDVQYALSCFQQALAVKGVQNEMGN 371

BLAST of ClCG06G009580 vs. TrEMBL
Match: A0A0B2Q4B1_GLYSO (L-allo-threonine aldolase OS=Glycine soja GN=glysoja_020991 PE=4 SV=1)

HSP 1 Score: 542.3 bits (1396), Expect = 4.3e-151
Identity = 272/360 (75.56%), Postives = 310/360 (86.11%), Query Frame = 1

Query: 1   MVSRKVDLRSDTVTKPTESMRAAMAIAEVDDDVLGHDPIGLELEEEMAKIMGKEAGLFVP 60
           MV+R VDLRSDTVTKPTE+MRAAMA AEVDDDVLG+DP    LE EMAK MGKEA LFVP
Sbjct: 1   MVTRIVDLRSDTVTKPTEAMRAAMASAEVDDDVLGYDPTAFRLETEMAKTMGKEAALFVP 60

Query: 61  SGTMGNLISVLVHCDIRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNNADGTMDIHL 120
           SGTMGNL+SVLVHCD+RGSEVI+GDN HI+I ENGGIATIGGVHPR VKNN DGT+DI L
Sbjct: 61  SGTMGNLVSVLVHCDVRGSEVILGDNCHINIFENGGIATIGGVHPRQVKNNDDGTIDIDL 120

Query: 121 IQAAIRNPKAQLFFPTTRLICLENTHANSGGKCLSVEYTDEVGELAKKHHLKLHIDGARI 180
           I+AAIR+P  +LF+PTT+LICLENTHANSGG+CLSVEYTD VGELAKKH LKLHIDGARI
Sbjct: 121 IEAAIRDPMGELFYPTTKLICLENTHANSGGRCLSVEYTDRVGELAKKHGLKLHIDGARI 180

Query: 181 FNASIALGVPVDRLVQAADSVSVCLSKGLGAPVGSVIVGSKDFIAKATRVRKALGGGMRQ 240
           FNAS+ALGVPVDRLVQAADSVSVCLSKG+GAPVGSVIVGSK+FIAKA R+RK LGGGMRQ
Sbjct: 181 FNASVALGVPVDRLVQAADSVSVCLSKGIGAPVGSVIVGSKNFIAKARRLRKTLGGGMRQ 240

Query: 241 IGILCAAGLIAIKENVQKLQADHEKAKQLASGLYQIKGLKVDPKSVETNIIFFEIEEDYG 300
           IG+LCAA L+A++ENV KL++DH+KA+ LA GL ++KGL+VD  SVETN++F +IEE   
Sbjct: 241 IGLLCAAALVALQENVGKLESDHKKARLLADGLKEVKGLRVDAGSVETNMVFIDIEEGTK 300

Query: 301 ISLETLCKSLEERGIFMMLESQIRARIVLHHQISTSDVHYTLSCFQQTLS--GIQVVNGN 359
              E +CK +EERGI +M ES  R R+VLHHQIS SDV Y LSCFQQ L+  G+Q   GN
Sbjct: 301 TRAEKICKYMEERGILVMQESSSRMRVVLHHQISASDVQYALSCFQQALAVKGVQNEMGN 360

BLAST of ClCG06G009580 vs. TAIR10
Match: AT1G08630.1 (AT1G08630.1 threonine aldolase 1)

HSP 1 Score: 512.3 bits (1318), Expect = 2.4e-145
Identity = 250/353 (70.82%), Postives = 299/353 (84.70%), Query Frame = 1

Query: 1   MVSRKVDLRSDTVTKPTESMRAAMAIAEVDDDVLGHDPIGLELEEEMAKIMGKEAGLFVP 60
           MV R VDLRSDTVT+PT++MR AM  AEVDDDVLG+DP    LEEEMAK+MGKEA LFVP
Sbjct: 1   MVMRSVDLRSDTVTRPTDAMREAMCNAEVDDDVLGYDPTARRLEEEMAKMMGKEAALFVP 60

Query: 61  SGTMGNLISVLVHCDIRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNNADGTMDIHL 120
           SGTMGNLISV+VHCD+RGSEVI+GDN HIH+ ENGGI+TIGGVHP+TVKN  DGTMD+  
Sbjct: 61  SGTMGNLISVMVHCDVRGSEVILGDNCHIHVYENGGISTIGGVHPKTVKNEEDGTMDLEA 120

Query: 121 IQAAIRNPKAQLFFPTTRLICLENTHANSGGKCLSVEYTDEVGELAKKHHLKLHIDGARI 180
           I+AAIR+PK   F+P+TRLICLENTHANSGG+CLSVEYT++VGE+AK+H +KLHIDGAR+
Sbjct: 121 IEAAIRDPKGSTFYPSTRLICLENTHANSGGRCLSVEYTEKVGEIAKRHGVKLHIDGARL 180

Query: 181 FNASIALGVPVDRLVQAADSVSVCLSKGLGAPVGSVIVGSKDFIAKATRVRKALGGGMRQ 240
           FNASIALGVPV +LV+AADSV VCLSKGLGAPVGSVIVGS+ FI KA  VRK LGGGMRQ
Sbjct: 181 FNASIALGVPVHKLVKAADSVQVCLSKGLGAPVGSVIVGSQSFIEKAKTVRKTLGGGMRQ 240

Query: 241 IGILCAAGLIAIKENVQKLQADHEKAKQLASGLYQIKGLKVDPKSVETNIIFFEIEEDYG 300
           IG+LCAA L+A++EN+ KLQ DH+KAK LA GL Q+KG++V+  +VETN+IF ++E+   
Sbjct: 241 IGVLCAAALVALQENLPKLQHDHKKAKLLAEGLNQMKGIRVNVAAVETNMIFMDMEDGSR 300

Query: 301 ISLETLCKSLEERGIFMMLESQIRARIVLHHQISTSDVHYTLSCFQQTLSGIQ 354
           ++ E L K+LEE GI ++  +  R RIV+HHQI+TSDVHYTLSCFQQ +  +Q
Sbjct: 301 LTAEKLRKNLEENGILLIRGNSSRIRIVIHHQITTSDVHYTLSCFQQAMLTMQ 353

BLAST of ClCG06G009580 vs. TAIR10
Match: AT3G04520.1 (AT3G04520.1 threonine aldolase 2)

HSP 1 Score: 483.8 bits (1244), Expect = 9.2e-137
Identity = 240/344 (69.77%), Postives = 284/344 (82.56%), Query Frame = 1

Query: 4   RKVDLRSDTVTKPTESMRAAMAIAEVDDDVLGHDPIGLELEEEMAKIMGKEAGLFVPSGT 63
           R VDLRSDTVTKPTESMR+AMA AEVDDDVLG+DP  L LE+E+A+I GKEA +FVPSGT
Sbjct: 8   RTVDLRSDTVTKPTESMRSAMANAEVDDDVLGNDPTALRLEKEVAEIAGKEAAMFVPSGT 67

Query: 64  MGNLISVLVHCDIRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNNADGTMDIHLIQA 123
           MGNLISVLVHCD RGSEVI+GD+SHIHI ENGG++++GGVHPRTVKN  DGTM+I  I+A
Sbjct: 68  MGNLISVLVHCDERGSEVILGDDSHIHIYENGGVSSLGGVHPRTVKNEEDGTMEIGAIEA 127

Query: 124 AIRNPKAQLFFPTTRLICLENTHANSGGKCLSVEYTDEVGELAKKHHLKLHIDGARIFNA 183
           A+R+PK  L  P T+LICLENT AN GG+CL +EY D+VGELAKKH LKLHIDGARIFNA
Sbjct: 128 AVRSPKGDLHHPVTKLICLENTQANCGGRCLPIEYIDKVGELAKKHGLKLHIDGARIFNA 187

Query: 184 SIALGVPVDRLVQAADSVSVCLSKGLGAPVGSVIVGSKDFIAKATRVRKALGGGMRQIGI 243
           S+ALGVPV R+VQAADSVS+CLSKG+GAPVGSVIVGSK FI KA  +RK LGGGMRQIG+
Sbjct: 188 SVALGVPVKRIVQAADSVSICLSKGIGAPVGSVIVGSKKFITKARWLRKTLGGGMRQIGV 247

Query: 244 LCAAGLIAIKENVQKLQADHEKAKQLASGLYQIKGLKVDPKSVETNIIFFEIEEDYGISL 303
           LCAA L+A+ ENV KL+ DH+KA+ LA GL +I+ L+V+  +VETNII+ +I ED     
Sbjct: 248 LCAAALVALHENVAKLEDDHKKARVLAEGLNRIERLRVNVAAVETNIIYVDIPEDPKFGA 307

Query: 304 ETLCKSLEERGIFMMLESQIRARIVLHHQISTSDVHYTLSCFQQ 348
           E  CKSLE+ G+ ++ ++  R RIVLHHQIS  DV Y LSCF++
Sbjct: 308 EEACKSLEDVGVLVIPQATFRIRIVLHHQISDVDVEYVLSCFEK 351

BLAST of ClCG06G009580 vs. NCBI nr
Match: gi|659107426|ref|XP_008453666.1| (PREDICTED: probable low-specificity L-threonine aldolase 1 isoform X1 [Cucumis melo])

HSP 1 Score: 646.4 bits (1666), Expect = 3.0e-182
Identity = 323/358 (90.22%), Postives = 343/358 (95.81%), Query Frame = 1

Query: 1   MVSRKVDLRSDTVTKPTESMRAAMAIAEVDDDVLGHDPIGLELEEEMAKIMGKEAGLFVP 60
           MVSRKVDLRSDTVTKPTESMRAAMA+AEVDDDVLG+DP  LELEE+MAKIMGKE GLFVP
Sbjct: 43  MVSRKVDLRSDTVTKPTESMRAAMAMAEVDDDVLGYDPTALELEEKMAKIMGKEGGLFVP 102

Query: 61  SGTMGNLISVLVHCDIRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNNADGTMDIHL 120
           SGTMGNLIS+LVHC+ RGSEVIVGDNSHIHILENGGIATIGGVHPRTVKN  DGTMDI L
Sbjct: 103 SGTMGNLISILVHCETRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNKDDGTMDIDL 162

Query: 121 IQAAIRNPKAQLFFPTTRLICLENTHANSGGKCLSVEYTDEVGELAKKHHLKLHIDGARI 180
           I+AAIRNPK QLFFPTTRLICLENTHANSGGKCLS+EYTDEVGELAKKH LKLHIDGARI
Sbjct: 163 IEAAIRNPKGQLFFPTTRLICLENTHANSGGKCLSMEYTDEVGELAKKHDLKLHIDGARI 222

Query: 181 FNASIALGVPVDRLVQAADSVSVCLSKGLGAPVGSVIVGSKDFIAKATRVRKALGGGMRQ 240
           FNASIAL +PVDRLV+AADSVSVCLSKGLGAPVGS+I+GSKDFI KA R+RK LGGGMRQ
Sbjct: 223 FNASIALAIPVDRLVRAADSVSVCLSKGLGAPVGSIILGSKDFITKAIRIRKTLGGGMRQ 282

Query: 241 IGILCAAGLIAIKENVQKLQADHEKAKQLASGLYQIKGLKVDPKSVETNIIFFEIEEDYG 300
           IGILCAAGL+AIKENVQKL+ADHEKAKQLASGLYQIKGLKVDPKSVETNIIFFEIE+DYG
Sbjct: 283 IGILCAAGLVAIKENVQKLEADHEKAKQLASGLYQIKGLKVDPKSVETNIIFFEIEDDYG 342

Query: 301 ISLETLCKSLEERGIFMMLESQIRARIVLHHQISTSDVHYTLSCFQQTLSGIQVVNGN 359
           IS+ETLCK+LEERGIFMMLESQ RARIVLHHQISTSDV YTLSCF+QTL+GI+VVNGN
Sbjct: 343 ISMETLCKTLEERGIFMMLESQTRARIVLHHQISTSDVQYTLSCFKQTLNGIKVVNGN 400

BLAST of ClCG06G009580 vs. NCBI nr
Match: gi|778664314|ref|XP_011660269.1| (PREDICTED: probable low-specificity L-threonine aldolase 1 [Cucumis sativus])

HSP 1 Score: 638.3 bits (1645), Expect = 8.2e-180
Identity = 321/358 (89.66%), Postives = 340/358 (94.97%), Query Frame = 1

Query: 1   MVSRKVDLRSDTVTKPTESMRAAMAIAEVDDDVLGHDPIGLELEEEMAKIMGKEAGLFVP 60
           MVSRKVDLRSDTVTKPTESMRAAMA+AEVDDDVLG+DP  LELEEEMAKIMGKE GLFVP
Sbjct: 1   MVSRKVDLRSDTVTKPTESMRAAMAMAEVDDDVLGYDPTALELEEEMAKIMGKEEGLFVP 60

Query: 61  SGTMGNLISVLVHCDIRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNNADGTMDIHL 120
           SGTMGNLISVLVHC+ RGSEVIVGDNSHIHILENGGIATIGGVHPRTVKN  DGTMDI L
Sbjct: 61  SGTMGNLISVLVHCETRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNKDDGTMDIDL 120

Query: 121 IQAAIRNPKAQLFFPTTRLICLENTHANSGGKCLSVEYTDEVGELAKKHHLKLHIDGARI 180
           I+AAIRNPK QLFFPTTRLICLENTHANSGGKCLSVEY DEVGELAKK+ LKLHIDGARI
Sbjct: 121 IEAAIRNPKGQLFFPTTRLICLENTHANSGGKCLSVEYIDEVGELAKKYDLKLHIDGARI 180

Query: 181 FNASIALGVPVDRLVQAADSVSVCLSKGLGAPVGSVIVGSKDFIAKATRVRKALGGGMRQ 240
           FNASIALGVPVDRLVQAADS+ VCLSKGLGAPVGS+IVGSKDFIAKA RVRK LGGGMRQ
Sbjct: 181 FNASIALGVPVDRLVQAADSILVCLSKGLGAPVGSIIVGSKDFIAKARRVRKTLGGGMRQ 240

Query: 241 IGILCAAGLIAIKENVQKLQADHEKAKQLASGLYQIKGLKVDPKSVETNIIFFEIEEDYG 300
           IGILCAAGL+AIKENVQKL+ADH+KAKQLASGL+QIKGLK+DPKSVETNII FEIE+DYG
Sbjct: 241 IGILCAAGLVAIKENVQKLEADHKKAKQLASGLFQIKGLKIDPKSVETNIILFEIEDDYG 300

Query: 301 ISLETLCKSLEERGIFMMLESQIRARIVLHHQISTSDVHYTLSCFQQTLSGIQVVNGN 359
           IS+ETLCKSLEERGIF+ML++Q RARIV HHQISTSDV Y LSCFQQTL+GI+VVNGN
Sbjct: 301 ISMETLCKSLEERGIFVMLQTQTRARIVFHHQISTSDVQYILSCFQQTLNGIKVVNGN 358

BLAST of ClCG06G009580 vs. NCBI nr
Match: gi|659107428|ref|XP_008453667.1| (PREDICTED: probable low-specificity L-threonine aldolase 1 isoform X2 [Cucumis melo])

HSP 1 Score: 627.1 bits (1616), Expect = 1.9e-176
Identity = 316/358 (88.27%), Postives = 335/358 (93.58%), Query Frame = 1

Query: 1   MVSRKVDLRSDTVTKPTESMRAAMAIAEVDDDVLGHDPIGLELEEEMAKIMGKEAGLFVP 60
           MVSRKVDLRSDTVTKPTESMRAAMA+AEVDDDVLG+DP  LELEE+MAKIMGKE GLFVP
Sbjct: 43  MVSRKVDLRSDTVTKPTESMRAAMAMAEVDDDVLGYDPTALELEEKMAKIMGKEGGLFVP 102

Query: 61  SGTMGNLISVLVHCDIRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNNADGTMDIHL 120
           SGTMGNLIS+LVHC+ RGSEVIVGDNSHIHILENGGIATIGGVHPRTVKN  DGTMDI L
Sbjct: 103 SGTMGNLISILVHCETRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNKDDGTMDIDL 162

Query: 121 IQAAIRNPKAQLFFPTTRLICLENTHANSGGKCLSVEYTDEVGELAKKHHLKLHIDGARI 180
           I+AAIRNPK QLFFPTTRLICLENTHANSGGKCLS+EYTDEVGELAKKH LKLHIDGARI
Sbjct: 163 IEAAIRNPKGQLFFPTTRLICLENTHANSGGKCLSMEYTDEVGELAKKHDLKLHIDGARI 222

Query: 181 FNASIALGVPVDRLVQAADSVSVCLSKGLGAPVGSVIVGSKDFIAKATRVRKALGGGMRQ 240
           FNASIAL +PVDRL        VCLSKGLGAPVGS+I+GSKDFI KA R+RK LGGGMRQ
Sbjct: 223 FNASIALAIPVDRL--------VCLSKGLGAPVGSIILGSKDFITKAIRIRKTLGGGMRQ 282

Query: 241 IGILCAAGLIAIKENVQKLQADHEKAKQLASGLYQIKGLKVDPKSVETNIIFFEIEEDYG 300
           IGILCAAGL+AIKENVQKL+ADHEKAKQLASGLYQIKGLKVDPKSVETNIIFFEIE+DYG
Sbjct: 283 IGILCAAGLVAIKENVQKLEADHEKAKQLASGLYQIKGLKVDPKSVETNIIFFEIEDDYG 342

Query: 301 ISLETLCKSLEERGIFMMLESQIRARIVLHHQISTSDVHYTLSCFQQTLSGIQVVNGN 359
           IS+ETLCK+LEERGIFMMLESQ RARIVLHHQISTSDV YTLSCF+QTL+GI+VVNGN
Sbjct: 343 ISMETLCKTLEERGIFMMLESQTRARIVLHHQISTSDVQYTLSCFKQTLNGIKVVNGN 392

BLAST of ClCG06G009580 vs. NCBI nr
Match: gi|659107430|ref|XP_008453668.1| (PREDICTED: probable low-specificity L-threonine aldolase 1 isoform X3 [Cucumis melo])

HSP 1 Score: 586.6 bits (1511), Expect = 2.8e-164
Identity = 294/329 (89.36%), Postives = 313/329 (95.14%), Query Frame = 1

Query: 1   MVSRKVDLRSDTVTKPTESMRAAMAIAEVDDDVLGHDPIGLELEEEMAKIMGKEAGLFVP 60
           MVSRKVDLRSDTVTKPTESMRAAMA+AEVDDDVLG+DP  LELEE+MAKIMGKE GLFVP
Sbjct: 43  MVSRKVDLRSDTVTKPTESMRAAMAMAEVDDDVLGYDPTALELEEKMAKIMGKEGGLFVP 102

Query: 61  SGTMGNLISVLVHCDIRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNNADGTMDIHL 120
           SGTMGNLIS+LVHC+ RGSEVIVGDNSHIHILENGGIATIGGVHPRTVKN  DGTMDI L
Sbjct: 103 SGTMGNLISILVHCETRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNKDDGTMDIDL 162

Query: 121 IQAAIRNPKAQLFFPTTRLICLENTHANSGGKCLSVEYTDEVGELAKKHHLKLHIDGARI 180
           I+AAIRNPK QLFFPTTRLICLENTHANSGGKCLS+EYTDEVGELAKKH LKLHIDGARI
Sbjct: 163 IEAAIRNPKGQLFFPTTRLICLENTHANSGGKCLSMEYTDEVGELAKKHDLKLHIDGARI 222

Query: 181 FNASIALGVPVDRLVQAADSVSVCLSKGLGAPVGSVIVGSKDFIAKATRVRKALGGGMRQ 240
           FNASIAL +PVDRLV+AADSVSVCLSKGLGAPVGS+I+GSKDFI KA R+RK LGGGMRQ
Sbjct: 223 FNASIALAIPVDRLVRAADSVSVCLSKGLGAPVGSIILGSKDFITKAIRIRKTLGGGMRQ 282

Query: 241 IGILCAAGLIAIKENVQKLQADHEKAKQLASGLYQIKGLKVDPKSVETNIIFFEIEEDYG 300
           IGILCAAGL+AIKENVQKL+ADHEKAKQLASGLYQIKGLKVDPKSVETNIIFFEIE+DYG
Sbjct: 283 IGILCAAGLVAIKENVQKLEADHEKAKQLASGLYQIKGLKVDPKSVETNIIFFEIEDDYG 342

Query: 301 ISLETLCKSLEERGIFMMLESQIRARIVL 330
           IS+ETLCK+LEERGIFMMLESQ R + V+
Sbjct: 343 ISMETLCKTLEERGIFMMLESQTRFQQVM 371

BLAST of ClCG06G009580 vs. NCBI nr
Match: gi|659086169|ref|XP_008443794.1| (PREDICTED: probable low-specificity L-threonine aldolase 1 [Cucumis melo])

HSP 1 Score: 579.7 bits (1493), Expect = 3.5e-162
Identity = 297/357 (83.19%), Postives = 318/357 (89.08%), Query Frame = 1

Query: 3   SRKVDLRSDTVTKPTESMRAAMAIAEVDDDVLGHDPIGLELEEEMAKIMGKEAGLFVPSG 62
           +RK+DLRSDTVTKPTE+MRAAMA+AEVDDDVLG+DPI LELEEEMAKIMGKE GLFVPSG
Sbjct: 4   NRKIDLRSDTVTKPTETMRAAMAMAEVDDDVLGNDPIALELEEEMAKIMGKEEGLFVPSG 63

Query: 63  TMGNLISVLVHCDIRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNNADGTMDIHLIQ 122
           TMGNLISVLVHC+IRGSEVIVGDNSHIHI ENGGIATIGGVH RTVKN  DGTMDI LI+
Sbjct: 64  TMGNLISVLVHCEIRGSEVIVGDNSHIHIYENGGIATIGGVHSRTVKNKDDGTMDIDLIE 123

Query: 123 AAIRNPKAQLFFPTTRLICLENTHANSGGKCLSVEYTDEVGELAKKHHLKLHIDGARIFN 182
           AAIRNPK QLFFPTTRLICLENTHANSGG+CL VEY D+VGELAKKH LKLHIDGARIFN
Sbjct: 124 AAIRNPKGQLFFPTTRLICLENTHANSGGRCLCVEYIDKVGELAKKHDLKLHIDGARIFN 183

Query: 183 ASIALGVPVDRLVQAADSVSVCLSKGLGAPVGSVIVGSKDFIAKATRVRKALGGGMRQIG 242
           AS+A  V VDRLVQ ADSVSVCLSKGLGAPVGSVIVGSK FI+KA RVRK LGGGMRQIG
Sbjct: 184 ASVATDVSVDRLVQVADSVSVCLSKGLGAPVGSVIVGSKSFISKARRVRKTLGGGMRQIG 243

Query: 243 ILCAAGLIAIKENVQKLQADHEKAKQLASGLYQIKGLKVDPKSVETNIIFFEIEEDYGIS 302
           ILCAAGL+A+KENVQKL+ADH KAKQLASGL QIKG+KVD KSVETNIIFFEIEED  IS
Sbjct: 244 ILCAAGLVALKENVQKLEADHHKAKQLASGLCQIKGIKVDLKSVETNIIFFEIEEDSQIS 303

Query: 303 LETLCKSLEERGIFMMLESQIRARIVLHHQISTSDVHYTLSCFQQTLSGIQVV-NGN 359
            + LCKS+EE GI +M ES  R RIVLHHQISTSDV YTL C +Q L G+  + NGN
Sbjct: 304 AKLLCKSMEEHGILLMQESLSRIRIVLHHQISTSDVEYTLKCMKQFLCGVPALQNGN 360

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
THA1_ARATH4.3e-14470.82Probable low-specificity L-threonine aldolase 1 OS=Arabidopsis thaliana GN=THA1 ... [more]
THA2_ARATH1.6e-13569.77Probable low-specificity L-threonine aldolase 2 OS=Arabidopsis thaliana GN=THA2 ... [more]
LTAA_AERJA3.2e-7546.53L-allo-threonine aldolase OS=Aeromonas jandaei GN=ltaA PE=1 SV=1[more]
YF64_CAEEL8.2e-7142.94Uncharacterized protein R102.4 OS=Caenorhabditis elegans GN=R102.4 PE=3 SV=3[more]
LTAE_ECOLI2.7e-6641.64Low specificity L-threonine aldolase OS=Escherichia coli (strain K12) GN=ltaE PE... [more]
Match NameE-valueIdentityDescription
A0A0D2U3I3_GOSRA5.1e-15276.54Uncharacterized protein OS=Gossypium raimondii GN=B456_013G187300 PE=4 SV=1[more]
A0A061F2K0_THECC1.5e-15175.70Threonine aldolase 1 isoform 1 OS=Theobroma cacao GN=TCM_026193 PE=4 SV=1[more]
A0A0R0HZC9_SOYBN4.3e-15175.56Uncharacterized protein OS=Glycine max GN=GLYMA_10G166000 PE=4 SV=1[more]
A0A0R0HUG2_SOYBN4.3e-15175.56Uncharacterized protein OS=Glycine max GN=GLYMA_10G166000 PE=4 SV=1[more]
A0A0B2Q4B1_GLYSO4.3e-15175.56L-allo-threonine aldolase OS=Glycine soja GN=glysoja_020991 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G08630.12.4e-14570.82 threonine aldolase 1[more]
AT3G04520.19.2e-13769.77 threonine aldolase 2[more]
Match NameE-valueIdentityDescription
gi|659107426|ref|XP_008453666.1|3.0e-18290.22PREDICTED: probable low-specificity L-threonine aldolase 1 isoform X1 [Cucumis m... [more]
gi|778664314|ref|XP_011660269.1|8.2e-18089.66PREDICTED: probable low-specificity L-threonine aldolase 1 [Cucumis sativus][more]
gi|659107428|ref|XP_008453667.1|1.9e-17688.27PREDICTED: probable low-specificity L-threonine aldolase 1 isoform X2 [Cucumis m... [more]
gi|659107430|ref|XP_008453668.1|2.8e-16489.36PREDICTED: probable low-specificity L-threonine aldolase 1 isoform X3 [Cucumis m... [more]
gi|659086169|ref|XP_008443794.1|3.5e-16283.19PREDICTED: probable low-specificity L-threonine aldolase 1 [Cucumis melo][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001597ArAA_b-elim_lyase/Thr_aldolase
IPR015421PyrdxlP-dep_Trfase_major
IPR015422PyrdxlP-dep_Trfase_dom1
IPR015424PyrdxlP-dep_Trfase
IPR023603Threonine_aldolase
Vocabulary: Biological Process
TermDefinition
GO:0006520cellular amino acid metabolic process
Vocabulary: Molecular Function
TermDefinition
GO:0016829lyase activity
GO:0003824catalytic activity
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006544 glycine metabolic process
biological_process GO:0006563 L-serine metabolic process
biological_process GO:0006566 threonine metabolic process
biological_process GO:0006520 cellular amino acid metabolic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0004793 threonine aldolase activity
molecular_function GO:0003824 catalytic activity
molecular_function GO:0016829 lyase activity
molecular_function GO:0030170 pyridoxal phosphate binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG06G009580.1ClCG06G009580.1mRNA


Analysis Name: InterPro Annotations of watermelon (Charleston Gray)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001597Aromatic amino acid beta-eliminating lyase/threonine aldolasePFAMPF01212Beta_elim_lyasecoord: 7..293
score: 1.3
IPR015421Pyridoxal phosphate-dependent transferase, major region, subdomain 1GENE3DG3DSA:3.40.640.10coord: 4..255
score: 9.4
IPR015422Pyridoxal phosphate-dependent transferase, major region, subdomain 2GENE3DG3DSA:3.90.1150.10coord: 256..349
score: 4.5
IPR015424Pyridoxal phosphate-dependent transferaseunknownSSF53383PLP-dependent transferasescoord: 6..351
score: 1.88E
IPR023603Threonine aldolasePIRPIRSF017617Thr_aldolasecoord: 1..356
score: 2.8E
NoneNo IPR availableunknownCoilCoilcoord: 249..269
scor
NoneNo IPR availablePANTHERPTHR10289THREONINE ALDOLASEcoord: 6..353
score: 4.4E
NoneNo IPR availablePANTHERPTHR10289:SF5LOW SPECIFICITY L-THREONINE ALDOLASEcoord: 6..353
score: 4.4E

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
ClCG06G009580Cucumber (Chinese Long) v3cucwcgB091
ClCG06G009580Wild cucumber (PI 183967)cpiwcgB081
ClCG06G009580Cucumber (Chinese Long) v2cuwcgB076
ClCG06G009580Melon (DHL92) v3.5.1mewcgB259
ClCG06G009580Watermelon (97103) v1wcgwmB344
ClCG06G009580Bottle gourd (USVL1VR-Ls)lsiwcgB030
ClCG06G009580Cucumber (Gy14) v2cgybwcgB081
ClCG06G009580Melon (DHL92) v3.6.1medwcgB250