ClCG06G009580 (gene) Watermelon (Charleston Gray) v2.5

Overview
NameClCG06G009580
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionBeta_elim_lyase domain-containing protein
LocationCG_Chr06: 16522596 .. 16544559 (-)
RNA-Seq ExpressionClCG06G009580
SyntenyClCG06G009580
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGATAATGACTCTTCTCTTGTCTTATCTTCCTATAAATTTCCTTTATGTTTATGCAACTCAATCCAATCCAATGGTGGGTTCATCGATATCTAAGGTGAGTGAAATTTAATTTCATGTTTTTGTTTTGAGGAATTTTTGTTGTATTTTTCTTTTCTATTGAAAGGAACGGAAAGTTTCATTTATTTATGTGTTTATGATCTCTACTTGTTTATATGTTTGCAGTGTGTTTGAAGTACTTATAGAAGCTATCAACTGATTTTGTTTTTTGGGGTTTATCCTCCATTTCAAGAGTAAGGAATCGCTATTTTTTCTTGTAATTTTCAATTTAGATATTGATAATTAATTCTTATTTCCATTTATGAAGAAATTGTTTTATTTTTTCATTTTTAGATTCAAAAGGCAAGAGATTTGAACGGACTCTTTAAAAAAAAAAATTAGTATGGACAATTAGTTACACTCTTTTTGAGGGGTTCAATATTGAGTAAAGATTCTTGTTTTTGTTTTTGTATAGGTAAGAAATCATAAACGGCGTGAATTATATTTTAGTTTATTCTTTATCATTTTCTATTTTTATTATATATTTTCAGAAAAAAAAAAAGAATATATTTGGTAATTATTTTCTTTCCATACACACCTTTATGAAGATCTTAACTAGCAAACAAAAATTATTGGTTTCTGTAATTATGTTCCTCTTAGATCCTTATTTTATACAATCATTTTGTGACCTCCGGTACAGTACAAGATCAGGTATACTCCTCATACAAAAAGGCAACATGTTTGATATATAAATGACCGATCTGTGTATAGGAATCAACAAAAATGGTGAGCAGAAAGGTGGATTTGCGGTCGGACACAGTGACGAAACCAACCGAATCGATGCGAGCTGCGATGGCGATTGCTGAGGTGGATGATGATGTGTTAGGGCATGACCCCATAGGGTTAGAGTTGGAAGAAGAGATGGCAAAGATAATGGGGAAAGAAGCAGGGTTATTTGTTCCCTCAGGCACAATGGGAAATCTCATAAGTGTTCTTGTACACTGTGACATTAGAGGGAGTGAAGTGATTGTTGGGGACAATTCCCATATTCACATTTTGGAAAATGGAGGCATTGCAACCATTGGAGGAGTTCATCCAAGGACAGTCAAAAACAATGCTGATGGAACAATGGATATTCATTTGATTCAAGCTGCCATTAGAAACCCAAAGGCCCAACTCTTCTTCCCAACAACAAGGCTCATTTGTTTAGAAAATACACATGCAAAGTAAGTATATTTATAATTAATTTCCCAATCATTTCCCTCTTTTGCTTCCTAATTAACCATTTATTTTATTTTATTTTATTTTATTTTATTTTTTTAAATTCTATCATGTCCTCTCAATTAAAAGAATATTCTTAACCAAATTTCAACTCCTAATTGATCTGCTTTCTTTTAATATTTTAAAACTATTTTTTTTTTTTAATTTTCAAAATTTAACTTTGTTTTTAGTTTCTAAAAAATCGATGTATTTGATATCTAATTGAATTTCCTATTTTTTCAATTTTTAAAACATTTTAAAACATGTTACTAGATTTCAAAAAATGAAAGAAACAAAAATTATTTTCAAAAACAGGTTTTCGTTTTCTAAAATGCGTGTGAATTCTATATATGATAAAACGATGACTAATGACGTATTTGAAACAAAAAATAAACACTTAAAAAAGTTAATCAAACAAGTTTAAATTCAATAAAAACTCCATATGTGGTTTTCCAGTTTGTAATTATTTTACAAATATTTCTTTTTTCTTTTTTTTTTTCTTAAAAGAGAATATTTGTCTTGGAATATTTCTTTTCCATTTTATAATCTCGACTTCCAACATGTTGAAAAAGTGTTCTTTTTTTATTTTCTTTTATATATATACACTAAAGCTCCAAATAAATTCTATTTTTTCTTTTATTTATTTTCCATTTTCTAATAACAATTTCAAATTTTTACATGAAATTTAATGGTGATAATATCGATTTTTTCCCACTTTAGTTCCACATCAATTATTGTCTTTTTTTTTTAACAAGAATATCTTTAAGTTTTTTTAAACAAAATTCAAAATGAGGGGAACAAGATAAATCGAATTAATTGTTTTGATTGTTTACTTATTTACTTATAACCTGCTCCTAAAAAATATCTGCTTTTATTTTATTTTGGAGATTTTTATATTCTTATCATTTTTCTTTATTTGATTCGTGATTAATCAAATACATTTAAAATATTTAGATTATTTTATTAAAAATTATTATAAATAGAAAAAAATATCAAACTATTAACAAATATATTAAAATTTTACTTTCTATCACTGATAGACCGCGATAGACCTATATGGTAAAGTGATGGCTGTCTATCTGGATCTATCACAATCTATCACATATATAAAGTAAAATTTTGCTATATTTGTAATTATTTTCAACAATTTTTCTTTTTTAAAAAATATCCCTTATTTTATTAACATGATATGTTTAAAAAAGTGAAAAATAAAGATATAATTTTACAATTATTATAGGACTCTTTTCAAATATAGCAAAAGGAACCAAAACATTTACAAATATAGCAAATTTCACTATTCATCTGTTATAGACCGCTATAGATCGCAAATATACTTTGTATTTTGCTATTAATTTGTAAATATTTTCAACAGTTTTGTCATTTAAGATAATTTTCTATTATTTATATTAGATTGAAAAACATTGTAAATATAGTTTGATTTTATTTTTGTATAAAAAATATAAAACTAAAAAAAAGTAGTCAAAACTTATTTTAGTTCCCTATAAACAAAAATGTGTTTTTAGTTTTTGTTCTTAAAAAAAAAACAAAAAATAATTATAAAAATAAGGTATTTGTTTTTTCAAAATTCAAAACCAAAAACAAAATTTAAAAAATTTAAAGGCATGAGTGTCTATAGACATTATCTATTTTCGAAAACTAAAAAAAAAAAAAAAATGTTACTAATTGAGTCCTAAGCTTTTTACTTTTGATTGCGTTTTATCTAATTTTGTGTCCGAATAATTCCTTGTCGTCTTTGTAGTTTAATGCGTAAGTAGAATATATGCTGATTGTTTATTTTAAGATAACATGACATGTTGACTTTAGACAAATAAAATATTAATTAAAGTTAACAATTTAGTTCTTAAACTCTAATTTATAACAAAGTTCGTTAACTTTCAATTTTGTAATAATCTAATATTTAAATTTTAGTATGTAACAATTTTGGTCCATGTATTTTAAAATTTGTAACATTTAGTTTCTATTATGAAAATTAGTGTTAAGATTTATTACATAAGTAGATTAATTAATTAATTAATTAGGGACCTAATACATTTATATGCTACAAAGTCTACCTGTAATGCTAAATTATATATATATATATATACATTGACATTTTCTGGTAAAAATTAAATTGTTATAAATTTGAAACTACAAGAACTAAATTTCTACATAATTGAAAGTAAAGTGACTAAATCATTCTAAATAAAAGTTAAAAGACTAAATTATTATAAATTTAACTTTAACGACTAAATTGTTTTTTAACTTACAATAATCTTGTAATGCAATCACTAAACTAATTACAACATTAACAATGAAAGATGTAGTGGATAGAAGAGACTAAATTTATAATTTAATTAAAACTATATTTTAGGCTTTTTAGTTACTGCTTCATGTTTAAAACTAGTTAACAAGACCTTGAAATCGACATGTTTTTTCCTTATCCTTTTTTTTTTTTATCTTTTTTTTTCTTTTTTTTTGAATTACTTAAAATTAAGTCAAATTGTTATATACACGAGTAAAAAATTATGAATGAAATGCAGCTCTGGTGGAAAATGTCTTTCAGTAGAATATACTGACGAAGTTGGAGAATTAGCTAAGAAGCATCACCTCAAACTTCACATTGACGGAGCTCGTATTTTCAATGCTTCAATTGTAAGTATAATCTATAAATAATATCTTTTCCTTGTTCAATCATATATATATATATATAGTTTTAGTTTTAGAAGTTGTTTGTTTCACAGGCATTTTAAATTTTCATGGAATTAACATTGCCTTTTGCTTAATTTTTCGTGGACTAGATTTGTTTTGTCGTATTGTAAAGATGTCATGATATCTTAAGATTCTGAACATTCTAGGATGCTTTATTGGGATTTGGGATCTAGAGAGTTTTGGATGCCTTCTTGAAACATTAATTTTGCAGATTCCCAGGTTGAGGGCATCCATATTTTATTGTTTTGCTCTTTCTTTTCTTTTTTCTTTTATTTATATATATAATTAAAATAATTAAAATAAATGTCACTCTTCTTTATTTGCATTATTATAATTAAAATAAATATTATCCGTCATATTTATCACTTTTCTCGGATATATAGTAGTGTTTTTTCTTTTAGCAATATATTCGCATTCAAATTTAAATTTATTTTATGAAAAAAAATCTTAAATTATTGTATAATAATTTTGTTAAAAAAATAGCATATAAAAATAATATAAATTGTATTGATGAAAAAAATTCAAGTTGATGAAAACAAATATGTTATATAAGTTTAAATCATATTAAATTAATTATAAATTAATAAATAATCGGAGCATCCTTGGAACATTAGGTAAATTAGAAATCCTAGTAAACAAACATAGTTATCAAAGCATTTTCCAGATATCTACAAAAATATTGATATGAAATAAATAAGGTTATCTAAAGATGTTACACCTTTGTAGTTCTAGGCATCTATAATTCTAAAGATCTAAAATTTCTGGTTATCTATATTTCCGCGGGTTAAGTAATATTTTACCAACAACATTTTCGTATACAATTAAAAAATATTATATTTTCAGGCACTTGGTGTTCCAGTGGATCGATTGGTACAAGCGGCTGACTCAGTATCTGTATGTCTTTATTTCTTTCTGCTAATAATTTGTTTTTATTAAATATGATTTGGAATTTATGGAGGAAATTCAACCTTTAACAAAATACAACATTATATCAAGGTTTGATATTTGGCTCATCAGGTATGTCTATCAAAAGGTTTGGGCGCACCTGTTGGATCAGTTATTGTGGGTTCCAAAGACTTTATTGCCAAGGTATAGTGGTTGGATGTTTTTTGGGACATATTAGTATTATGTTTGAGAATGTGTTTGAATTGGTTTCTTTAAATAAAAACTTCAATAACCCATTTTTGCAATTTCAAACTCTCTTGAGACATAGTATGGCAACCTAGGTTATTATATATTTGGAACTTCCTTCCATTGACAAATCTAATCCAATTTAAAAGGGGGTGAAATATGTTTAGAAATTAGTTAGAATGGGTTTGGATGAATTTTGAAAGCAATGCTTTTAAATGAAGAACTTTTTCCTATAAGTCTTTATGGAAAGGGACAAAAGAAGTTAGCACTTGCTATTAAGTGTTTTTTAAAAGCATTTCTTTTACCATGTTACCAAACATCATGCAAACAGAATATGTGATTTATCTTGCTAAACACATTTTGTTTTACATTTGTGTTAAAGTAGTCACCAAATTTCCAAATATTAGAATGATGGATCAAGTCCCAACAATTAGAAAATGCTCGATTTTGATTACTCAATTAACAGAAATTGTTAACGATTAACGAATGTATTGACATGGAATGAGAAAAAGAAAGAAAATCTCTTTGTCCCCTTGTTTCTCAAGACTTGTTCTTTTTCATCTAAAACTTGTATTGATTCTTCTTCATGAGACAAATCTTCCTTCAACTATATCGGTTGCACTTCAAGCGCATGAGTAGGATAGGGAATATACTTCGTTAACACCGATACATGGAAAACATTATGGATCTTGGACATCTCCATAGGTAAAGCTAACTTATAAGTTGCTGGTCCAACTCTTTCCAAGATCTCATAAGGTCCAATATATCTCGGACTTAGCTTCCCTTTTTTCCTGAACCATAACACTCCTTTCCCGGGAGACAACTTAAGGAATACTCAGTGTCCTACATCAAGCTCAAGATCTTTTCTTCTATTATCTACATAACCCTTCTATCTATCTCGAGCTATTTTCAACTTTTCCTTAATTATCTCAACTTTCTCTGTCATAGCTTGGACTATTTCTAGTCCGATTAACTTCCTTTGTCCAACTTCACCTCGACATACATGTATCCTGCACGGTCTACCATACAAAGCTTCATATGGGGTCATTCCAATACTCGCCTAATAACTGTTATTATACACAAACTTCATCAGCAACAGACGTGCATCCCAACTACCTTTAAACTGCAGTGCACACGCTCGAAGCATATCCTCGAAAGTCTGGATAGTCCTCTCTAATTGTCCATCAGTTTGAGGATGAAATGTTGTACTAAACTAAAACTTAGTATCCAGAGCTTGCTATAAACTAGTCTAGACCTTAGAAGTAAACCTCGGGTCACAATCGACATTGGTAACCCATATTGGTTCACTATTTTATCCATATACATCTGTGCCAAATTATCCAAGGTATAGGTTGCCTTTACCGGTAAGAAACTTGTCGTCTTGGTAAGTCTATCTACTATCACTCGTATACCATCAACACCACTCGAAGTCCTTGGCAACCCAAATAGAAAATCCATCATCACATGCTTTCAATTCCTCTCAAGAATAGGGAGTGGATTCAGTAATCCTGCTGGTCTTTGACGCTCGGGTTTAACTTGCGAGCACATCAAACATTTGACTACATATTCAGCTATTTCTCGTTCCATACCTAGCCACCAATAATAATTCCGTAAAGTATGATACATCTTGGTACTACCAAGATGCATCACATAAGCTGAATTGTGAGCTTCTTCTAAGATAGCTTGTTTAACTTCTAGATCCTTTGGCACACAAACTCTGTCGTGATTCAACAATGCTCTATTAGCTCTCAACTCATAATCTGGCCTCTTTTGAGCCTTTACCTCTTCTATTAATTTCCTGATTACAGGATTGTTTGATTACTTCTTAGTTACCCCATCTATTAGCTTAGGTCTTAAATGAAAATGCGCTAACAAACCTCCATTCTCATTTACTGATAAGACAGTAGCACCACTTCTGAACTCCTTAAGTAGAGTAGTCCTTATAGCATTAAGAGAACTCTAACTACTTTTTGACTTCCAACTCAAAGCATCAGCTACTAAATTCACTTTACCTAGGTAGTACTCAATCGTGCAATCATAATCTTTAATCAACCCCAACCACCCTCTTTGTCTCATGTTCATCTCCTTTTGATCAGAAATATATCTTAGACTCTTATGATTCTTGTAGATATGACATATCTTTCCAAATAAATAATGCCGTCATATCTTTAGAGCTAACATTAGTGCAACTAATTCTAAATCATGAGTAGGATAGTTAATCTTGTGAGGTCTCAACTGTCTAAAAGCATAGCCTATCACATTCCCACCTTGCATAAGAACACAACCTAATCCCTCACGAGAAGCATCACAATATACCTCAAACTCTTTTCCTGACTTTAGAAGTGACAACATAGGTGTCGTCACTAATCTTCTCTCTTTAGCTCCTGGAAACTTTGTGCGCACTTCTCATCCCACTCAAACTTAACACTTTTCTTCGTCAAACTCTTGAGTGACAAAGGTATTCTCAAGAACCCCTCAACAAACCATCTGTACTATCCAACTAACCCTAGGAAACTACAGGCTTCTATCACCATTGTGGGTCGTTCCCACTTCATTATAGCTTCAGTCTTCTGTGGATCAACATTTACTTGTACTACTGAAACCACATGTCCTAGGAACACCACCTATTCCAACTAGAACTCACACTTGTTGAACTTGGCATACAATTTCCGCTCTCTCAAAGTCTGCAACACCATCTTGAGATGTTCAACATGCTTTTCTTTATCACCAGAATACACCAGAATATCATCAATGAATACTATGACAAACTGATCAAGGTAAGGATGAAATATCCTATTCATAAGGTCCATAAACGCTGCCGGTTCGCTTGTCAGTTCGAACGACATCACTAAGAACTCGTAATGCCCACATCTAATCCTTTCTAAAAGCTGTCTTAGGAACATCTGACGCCCTAACCTTCAACGAGAACACTGGAGCTCCTCTAAGCTTAATAAACAAATCATATATCTGAGGCAACAAATACTTAGTACGTATTGTTACCTTATTCAATTGATAGTAATCAATACATAACCTTAGAGTATTATCTTTCTTCTTGACAAATAACACCGGTGCACCCCACAAGGAAACATTGGGTCTAATATAAACCTTGTCCACCAATTCCTGTAACTGTACCTTCAGTTCCTTTAACTCTGTTGGTGCCATTTGATAAGGTGCTTGAGAGATAGAAGTTGTACTTGGAAGCAGGTCTATAGTAAACTCTATCTCTTGATCAGGTGGTAATCCTGACAACTCTTCTGGAAATACATCTAAATAGTCACATATCACTGGCACGTCTTAAGGCTTCAACTTACCTGATTTAACTTTAGTCACATAGGTTAGGTACACTTCACGGCCTTTACTCATCATTTTCCTAGCCTTTACTGCAGACATCAAGCAAGTAGGAAGAATTTTCCTCATTCCTTTGAACACAACTACTACTTCTCCTAAATTTTCAAACTTCACTTCCTTTTTAAAACAATCAACATTAGCATGATACTTTGACAGAAAGTTCATGCCTAAAATAACATCAAACTCTGCTAACTCTAAAGGCAACAAATCTATCGACATAACTACACTATCAACCACTATCTCACAATTTCGATAAACATGCTCAATAACTATAGCATTAACTGCAGGTGTATGTACTAATAGTGTGTCAATCAATGGCTCTAGCCTCCTATTCATATGCATAACAGAGGTACTAGAAACAAATGAATGCGTAGTTCCTGTATCAATCAACACATAAGCATTCATATTACAAATAAATAATATACCTGTCATAATATCTGATGTCTCCTGAGCTTCTTATTGAGTCATAACATACACCTTTCCTTATTGCCGTGGTCTGCGCACCTATCCTTTCTGTCTTACACTTCCACCGCCTTCACCAACCACTATCTCGGGTCTTGGTTGATTAACAGTTTGGGAGATTGCACGTTGCTCAGCTTCATTTCCTGTCTTCAGCTTGGGGCAATCTCTCTTGTAATGCCCTGTCTGACCACAATTATAACAGACGTTGGTTCCACTCAAACACTAACCCTAGTGGTATCTTCCACAAGTACCACACATGGGCTTCCTAACTATACTAGCAATCGACTCACTGGGTTGCCTCGACTGGGAACCACTGGCTAATTCAGACATCTAAGTGAACCTCCTTTGGATAACGGTTTAAAGCTCCCCTTTCCAGACACACCAGGAACAAACTATCTACCTTCACCTTGACCAAATAGCTCTGGAAACATACCTGACGGTTGCCCTGATCTTCCTCCTCTATGAACCTCTTTCTCAGCTTTCTCTTTAGCTATACTTTTTTCAACTCTCATGGCAGCTTCTACCAACTTTGCAAAATCAGACCATTCTACACTCGCAATCACTGGAGTCCGAATCTCCTTCTGCAGGCCTTCTTCAAATCGTTTGCATCTATCGGTCTCATCAGCTATGACTACCATAGCATACTTTGCTAACTCTGTATACTTCTTCTCGTATTCAGCAACAGACATAGATCTTTAAACCAATCTCAGAAACTCATTTCTCTTAGCGTCACAGAAGGATCGAGGATAATACTTATCTTGGAAAACCTTCTTAAAATCACTCCTCGTCAATGCCTTAGCATCTCTTTTCCTGCTCTTTATCAATTTCCACCAATGCTTGACTCCCTTTTGGAGCAAGAACGTAGCTAACTTCAACTTTCTGTCTTCAGGACAACACATAACCCTAAAACACTTCTCTACCTGATTTAACCATACCTTTACATCAGCAGGATCTATGGTACCCTCGAACACTGCGACACCTAATGCTTTAAGTCTCTTGATCCCATACTTTTTCTCATGATCAGCCTGAACTGTCTCAACACTAGTAGCTAACCTCTAAGCTACCTTAGTGAATATCTATTCCTTCATGTTCACACATGCCTGAGGGTTACTAGAATCTCCCACAAAAGCCTCTCTAGAACCTCCAGTAGCTTACAAAGTTTCGGCTTCTGCTTGCCAACCTCGCCTGTTCCGTCGGGGCATGTTTTCTATAATACATCATAACATTAATATATTATAGACTCAACATGCTTCTAGCTACGATGCATGACCTAACTTCCCAAAATCTTATGCTCTAATACCAACTTGTCACACCCTCTCCCAAGTACTCTTTTAACCTAGCAAAGAATGTGAGGATAGCAAGTATCGACCCTTTTACGACACAAACTGTCAAATCTATACTCTTTTTTATACTCACAACATACAACACCTGATAAACAGATAACAACTGATAACATAAAGCATTCTCATAACCATAAGTAATGCACAAGACTAGTTTGATAACATTCTCCAACACATCCTTGTACAACACAGTTACATAACTTAAATCTCCAACCCTTATGTATAATGCATAATATGTCTTAAGTACAAGTACGTCCAAAGTAATCTAAAAACTAAAGTGATGTAGTAAAATCAAATGAGGGTGGGTAGGCTAAGCACCGCGAACTTTCTGCTACCTAGAAGAAAGAGATTTTAAAGAAAAACATGAGTTGGGTTGCCCAGTGAGTAACATAATACTAAAAATATAATCTTAACATAAAACTCATGCTCATGCTTTAAAGAAATCAAGCTCATAAATCACAGAAGAAAGTCTTTAACTAAAACTCATAAATCATGATCTTTAAATCAAATAACCTTAGTTGAGGTGGAGAAAGTCAAAACTCTCAATGTCGAAAACATCGTTAGCTGATGTGGAGTATTCTCAACACAACAACATCAACTCTCTCTATGTGCACATAGGAATAACTAAATCTTGAGCTGCCCAAGTATTTTCTTAATGTCCTTCAGTATTGGGTCCCAAGGATACTCAAATGTCTTCTCAATGCCCTTCAGCATTAAGTTCCTCTCATAATAATCTTAAGGTTGCCTATATGTCATCTCAATGTCCTTCAGCATTAAGTCCCTTGAAAACTTTTATGAAAACAAGCATGCTTTGGAAAGTAAGCATTTCAATATACAACTGTTCATGGAAAGCTTTAGAATTCTCAAAATTAGCAATAAGTAAACATGATTTTTCAACCTTTAAGTAAACAAGTAAAACATGCTTCTCAAACTCATCAAAATCAAGTACGAATCATACTTTCCAAACTCAAACTTCACAAGGAAAACATTTAGGAAACAATGTTAATTAATAAAAATTGTCACTCACTACTTTGCTCATGCTCTAAATGGATTATATATCTCCCACAAGTCCTCTTGGCTTGAAAATAATTTAACTTACTTAGTTAGGCTTCTTTATAACTTCCTTAAATCATAACTCAAGCCATTTAGGCTTTGAAGATCAAGTCTTAACTCAAATTAACTCACCAACGCCCAAATGATTAACTCACCAACGCCCAAATGATTGACTAATAGCTTAAGTGTTGTAGGATGGCCCTAGGCATGCACCTCAGCCACATACACACAGAGCATGCCCATTGCCCGCACGCATAAGCACAACACACTGTCACACACACACAAACACCATGTTGGGCAACAACTCTCGCACAACCTACTGCCCACAGGCGCGCACACACGGTTGGCCTATGCGTTGAAGTGGCTCATGTGTCTAACGCCTGCCCTTCCTTGGCCACAAGCGCACTCAGTCACACACTGCACACCTCCTCTACCTAGCACGTCTGCCCGCACACCCACTAAGCATCCAACTTGCCCAACTAACCTCTAAACAACTTTTTCAGCCCCTAGGCCACCGACAGTCCACTGCACGTCAACTTCCTTCAACCTTTGGCCAGCTCACCTGATAGCCTCAACTGCATAACCTACATTTTAACCCACCAAAATCAACATCTGTAGTCAATTTTAATTCCGATTTCTTTCACAAACCTCAAGATTAATCTCTTAACTTGCTTACCTTTGCGTTTGAGCTTCTAATTCTTTTTCTATAGTTTGAAATCCATCAATCTTCCAATGTTTCTCTAGATTGCCTTAGAGCATGAACAACCTTCCATCTTCCACAAATTTCAAGCCTTTGCTGTTGCTATCTTTCTCCGACAGTCTTCACACATCAATTAGGCTCAATTTGGAGGATTTTGGTAGAATAACTTCTTAATTGGCACGAGTAGATTTTGCAAACCTTTCTCCTTCGTCCAAGCTTCTATTTATAGCCCTTAAGTTTTCAGCATGCAACACATGAATTTGTCACCTCCTACTTAAATTTTGCCACACTAAATATAAAATTGCACCTAATTAATCTTATTTATCAAATAAATATCATAATCATCCAAAAGTTAAGAAGACTTGGACACCTACCTCAATTTCATCACCTAAACGCATGGCTCACCAAGTCTTGGAGGTTTCTGGCTTGTCCTCACTCAACACCCCTAGGCTCCCTTGCTTGTCAACAAACACTTGGCCCTTGCCTCTCCATGCCTACTTGGCTTAGGTCATTCTCCATGCATGCTTTTCCCCAACACCTACCAACATATGCCTTTTCCACGCATGGTTCTTTCGTTCAATGCCTAACTCCTAATGGTTTGAGCTTCACTTGCCGCCTACCCTTAAGTACTTTCTTTTACCATTCGGCTAACACAAATTACTCACAAACCCGAGACTGACTTGATCGTCAAGTGTGGTGCGCTTGACACCATATCGATCCCTAATTTCTATCGTCTAGTAGGAACTTGGATCATTTTCCCTTGTGATACAAATTTGCAGTTGTTCCATTTTTTTACATCAACAATTTCTATTAGTTAAAAGGCCAAAATCGAACATTTTCCAAATTTTGGAGGCTTGATTATTTGATTTTGAAATCTATAACATAAATATCAACCCTCACACCTCAAAATACAACCCACCCAACCATTTTTCACTACTTCAAAAGTGATATTGATTCAAGTCACCTTACATTATTAAGTCTTTCCCACTTTTTTAAATGCCTTTTTATGCAAATTTGTTCTTGAAATATTGTGCTTAATAATAGGCCACAAGGGTTAGAAAAGCATTGGGTGGTGGAATGAGGCAAATTGGCATCCTTTGTGCAGCTGGACTTATTGCAATAAAAGAGAATGTTCAAAAACTTCAAGCCGATCATGAGAAAGCCAAGCAACTAGCTAGTAATTAAGATCACTCTTTAATTCTTACCAACATTTTCAATTATTTTCACCTCTTTTTTCATATATATATATATATATTTTTTTTTTTTGGATTAATATAGGTGGGCTATACCAAATCAAAGGATTAAAGGTAGATCCAAAATCAGTTGAGACAAACATTGTGAGTATCTTTAAACATGATTAGAATTTTTAGTAATTTATTATGGTGTGTTTGAGTGTATATTAGTTTGTTTTTTTTCTTTAAACCATGCAAAAGTGTTTTAAAATAGTTTAAAAAGTTTGTTTTTATGCTTAGCATAAGTTTTGCTATCATAATTGTATTTGGTAAAAATTAATTTGAACATAATTTAGGTAGATAAAAATATTAATTATCATTTCAAGGTTGATAATGTTTTAATCTCCACTTGTGCAATCATTGAACTAATATATCTTCCTCTCTAATATAGATATTCACACACTTTCTTGTTTCTTTCTATACCTATAACTATCTTGATACTCTCTCTTTACACATCTTCTCTTAAAAATTCCATCATCTAGGATATATAACTACTGTTAAAAATTGGCAAAAAAAAAGATGAGATACAATCATTTAGAAGAAAAAAAACTTTTAGTAAGAAGTAAAAATATGTACACTTTTAAAAGAAGACTATTTCCACTCTTTCAATATTCACTCCTAAGTTTGTCTACCTTTTTCTCTGTTTGCTTTCTATCTTTCTTTGCCTTATTTACAAAGGGAGACATAAATCCTATTTATAGACGAGAGATATCCTTGGAGTTCAAGCAAAAGGATCTTTTTAGGTTCTTGGATTCTATCGTTGGAGCCTAAAGAAATGTTAGAATGTTTTTAAAGCATTTCTCTTGAAATACTTATCTACAATCTTCGAGATAAATCTAGATATATTTTATATCTACACTTACGTTATTCAAGTAGATACTTTAAGAAGCTTCTCCAGAATTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTATTNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTTTTTTTTTTTTTTTTTTTTGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTAATCATTCTCTAGAATAGTTAATGTTGCCCTCTACTGTAATACTCCCCCTAAATGATAACTATTCTAGCTCTTTCCATTTTTTAGAGTTGTTTCCAAAATTGTACATTAGGGGTGTACATGGTTTGGGTTGGCAACCCAAAAGACTGAAATAAAGTCTCCAACCCAACCCAACAAAATATGTAAATTTATCTACAACACATGACTTATAACTAAAATCTCATAAAGTTCAAATGTTATATAATAAATATCTATGATATTTATTACAAACTTTAAACAAAGACAAATATCATAACAATTTTAAAAGAAAAATATTAAAAATTTACAAATTAATCCATACAAAGTAATCATAAACTTTAAACGATATATATATATATATATAAAGAATTTGGGTTGGGTTTGGTCAACCCAATTTTTTTTAGCCAACCTGCGACCCAACCCAACCCAAGAACAAAGCTAACCCAACCCAACCTTTACATTTTGAGTTGGGTAGCCCGAGTTGTTTGGGTTGTCGAGTCTTTTGAACACCCCTACTATACATGCTTGGGTTCTTTTAGCCTTCTGGCATTAAGTCCCAACTTCGATTTCTTCTTAATGCATCAACTTCTGTGGCTATGATTATGTTCTTAGCCTCTGTATGTTCTTCTTCATCTTCGTCGTTCTCTTCTTCTCCTTCAATTGGAGTCTATTTCATTTGACTCATCTCTTCTTTTGATCCTTCGTTGAAAAAACCTCTCATCCTTAGGTGTCTGTTATCATTCGAGTGCCTGCCATGATGATACGTCATCAAATATAACATTTCTTAATGATAACACTGTTCACTGACATAATCAATATACTCTATCCTTAGGTGTCCTCACTAATATGACACATAGAGTCCATTCCTTGTCCTTCCATTAGCAATGTATCACTAATTTTGAGATCTTAGTACACCTTAACATTGTTAGGTTCAAACACAATGAAGTTGTCTTCTAATGTTAATTGAGATATTGATATCAAATTCTTCTTCATTCTAGGCATAGATAAAATTTGTTCTCTAACTCCACTTGAATGGAAATGGAATGAGGCATTATCATAGTTTTACTAACTAGAGCTATTTGAAACTTCTACTTATTTGCAACTTCGACAACTTGACCTTCCCTATGCTCTAATGTGGTTAGTAGTTTCCTTTTATCACCATGTCCTGTGATTAGAGCAACTTGAATAAACAATCCAATCATTTTCATAATTTACCTGATCTTGATGAGAGGTAACTCTTTCTTCTCCACCACATTGGATGTCGACTAGGAGTTGGGTTTTGCTATTGCAAAACATGTCACTGGATCCCATTATTCTTCACTAATGCTCTCAACCTAGGAGGTGACAACGTTGCTTTCTATTGATTTTTTTCTTTTGACAGCAATCTTTGGCATAATGGCCTACCTTTGCACAATTATAACATTCACCATATTTTTTCTATACTCACGTTTTCTATTATTTTTTATTAGTTTCCCTTACTTGAGAGCATTTTGGGGAATTTTTTTCATCTTCTTTTGATCCTACTTTCTTTTGATATTTTAGATTACTTTTCTTTGGCCACTAAAGAGGACTCCTTCATTATTACTCTGTATTGTGACATCCAATATTTGTTAGTCAATAATTCTTGACTAGTAAGCATATGTTCTAAGTCAATAAAAGAAGGTTGGACTGCCAAACCTTGAATAATAGCAATAAAGCTTCTATATTCAAGTTTAAGTTCATGAACACTAATTTTTCTCACTGTCGATTCTAAAATAGTAGTTGTAGGATCCAATTCAGAGATTTCATGACGTAGAGTTTTTACATTGGTGAAGTATTGATTGATAGTCAGATACCTTTGAGGAACTATTAGGAGCTTGTTCTCCGGAAACTAAAATCTCGCATCATTTTTCTTTGAGAAAAGTGAGGCAAACATGCCCATGCTATCTTTGGCATCTACACTATATTGATGTGCTCCAACATTTCTTCATCAATTATAATTCTAATTGCAAACATGACCTTACCTTCCTTGATATTCCATTTCTTAAAAGCAATAACATCTTCAGGTGGCTTGACCTTAATGCCTCCCGCAACACCCCACAAGTTTTGGCCTTGGAGATACGACTTCATGCATGTGAACCATGTTTTGTAGTTTTGAGTATTGAGCTTCTTGATTTCACCAACAATTTGAAGGTTTCCCATCATCTTGTAGTGATGTGAATGCACTCCACAAGTAGTTATGCCAAGATAACATGGTTCATAGTTCAGACTATAGACAATCTTCAGAAGCTCCACCACAGCAAGTTATGCTAGTTTGGCTATGATACCACTTGTTGAAGAAATTGGCACCAAATGGACAAGAGGCAGTCACTTGGAAAAAAGAAACTTTTATTAAGGAAGCACAAGTATGGACACTTTGCTTGGTTTACAGATGGAGACATAACTCCTATTTATAGGCATGAGATCATTGGAATACAAGCAAAATGATATTTCTAGGTTCTTAGATTCTAAAATTATTCTAGACATTTCTATGAATTATTCTTATCTAAAGAAATTCTCTAGAATTCATTGGAGCCTAAAGAGATGTTAGGATGTTTCTAAAACATTTTATTTTGATACTTATTTGCAATCTTCTAGATAAATCTAGAGATATTTAATCTATTTACATTTAAGTAATTTATGTTGATACTTAAGGAAACTTCTATAATAGATAGTGGTGTCTTCTACTTTAACATTTATTGAGTATCAAATTGAATGATGGTGAATGATTAAATTGAATAAAATGTTCTAATGTAAGAGATTTAAGAGAAACATTTTAGACCATGTAAACTAAAAAGGACGTCAACTCCGAACCTTAAGAAGTGTGACATACTTTTTTTAGACCATTTACAAAACATTAACTATTGTAACATATATTTGACCAATGTTAATAAAGAAGCAATTGGAAAACTTTTGTCTTAGAGTAGGGATAGAAATGATGTTTTAAACTCTAAGAGATTAGAAGATGAATTAACTTACCCATTTTGTCATAACTTTGAGTTTTTTATTGTGTATCTAATATGACATGAACAATTTGAAGTTTGTGTTTATTTTATTTCAAAATTTTAAAAAATGTTTAATAGATACTAGATTTTTTATCGTGTATCTAATATGACATGAACAATCAATTTTTTTTGTATCTATTAGCTAGCCAAAAAATAAAAAAAGTCTACAAATTTAATTGATTTATTAGATATAAATTTAAATTGTGTATAAACTTTCGTGTTGGTGTTGAACAAATATTTTGATTTAAAGAAAAAAATCTAAAAGACTAAATTTGTAATTTTTAGAAACAAAATAAAGCACCAAACTTCAAAGATCATTTTAATTGCAAACTAAACCAAAAAAAAAAAAAAAAATTACATGTTTAATTGCAATTTGGCTCGACATATCATATGGTGGGATCAATCTAGAACATCATTTTCATTGAGTATTTATATATTGAAAACTTTTCATCTTTTGTGGATTTTATTTTACTGTTTATTAAAAAATCTAAAACTTTTATTTATTTTATTTATTTATTATATGATCAGATATTCTTTGAAATAGAAGAGGATTACGGAATCTCACTGGAAACACTATGTAAAAGCTTGGAAGAACGTGGCATTTTTATGATGCTAGAAAGCCAAATAAGGTCTATCCTACAATTCCTTAAAAGAAAAAGAAAAAAAAAAAAATCTAGTTTCCATTATATTCGTAGTGAGTTTGTGATAACTTTGTTAGCTTTTGGGTAAAATATTTAAAATGTTTAACAAAAATATAGATTTTTGTAGAATGCTTTTTGGTCAAGTGTTTCATTCGGAAACATTTAAAATAATTCTTGAAAGCATAAAAGTTAGAACAAAATGTCAAAGTTATTTACTATGCACACTTTTCATTAAAGTGTATCTATTCAATATTTATATGCTAAAATGTCTGTTTTTCTAAAGATATTTTATGCTTTGAAATTTTGAAAGAAGATTTTGTTTTACTTTTCTTGTTTCCTAATTACAATTTTTATTCAATATATATATATATAGAGAGATTTTTTATTTTATTTTATATATTTGCTAAAACGATCAAGAAAATTTCTTTTTGTTCTTAATTTAAAACTTGAGAGAACAAACAAACAAAAATTACCAAAAAAACATATATTTTCCTTTTGTTCTTTTTGAAACTAAGTGGAAATAAAAAGTTAAAATTTTTAGCTAAGAAAAGTCAGATGTTTATCCAAAATAAAAAAATAAAATTATTGGTCATCAACTGCGAGTCAATTTGACATAATTTTTTAAATAGTGAAACATTTTAATTAATTACAAAATTAAATTAAATAAAAATGCTTTTAAAAGTGGGTCCTATTACATCACATCTGTTCTGAACAATAGTAACTTTCTACTCGATCAGAAAGGTGATAGATTGGAATCGCCAACAAAAATATTTTAAAATAAAATAGGTTAATTTGGTTATTGTATTTATTTATATATTTAGCAAACAAGTTTATTAATTGCAGGGCTAGAATTGTTCTTCATCATCAGATTTCAACAAGTGACGTGCACTACACTTTATCTTGCTTTCAGGTGATCTATTTCTAATGACTGATTCTTGGTTTTCTTTTCTTTTCTTTTTTTCTAAAATAAAAAATATTATATTTAAAAATATAATAAATCTATTAATATTTAAATAAATCACAATTTTTTCTTGAATTGATTAATTTAGTAAAATAGAAAAATAATGTAAAATCATATATCTATTTTGTATTTTCTTCAGAATCTGGGAAAAAAAAAGAGAGTCTAAAATAACTAGACAATTCAAAACAATAATTACATTTTCTTTGTTATCAAGAAATTAACATTTGTTGAAGAAGCAACGTCCCTGTTTTTTAAAATAATTTTTAAGTAACATATATTTATTTTAATAGAAATAATTAAATTTCAAGCTTTTTTCAGTTACTTTTACTAGGTTTTGTTTCAAAAATGTCAAAGACATTCATCACCTTTCAATGATGCATTTAATAAATCTATGACATATTACATACATTTTATAATTAAAAAATCTAATTGAATTATATGATTACATTTAAGATATGTCGATTAGATCAATAACTTTATTAAAATAAAATTAAAAGTTCAATAATTTATTAGACATATAGTCGAAATTCATATAAGCTATAGAATTGAAATGTCAAAAATTTATTAAATTTTTGAAGTATATATAAGAACTAACAATAAGTTTTCACATATCCTTCCCCAATTCTTCTTTAAACTCACAACTGATATAAACCTTTTCACTTTGGCTTCTGCAGCAAACTCTGAGTGGAATTCAAGTTGTAAATGGCAATTAA

mRNA sequence

ATGATAATGACTCTTCTCTTGTCTTATCTTCCTATAAATTTCCTTTATGTTTATGCAACTCAATCCAATCCAATGGTGGGTTCATCGATATCTAAGGAATCAACAAAAATGGTGAGCAGAAAGGTGGATTTGCGGTCGGACACAGTGACGAAACCAACCGAATCGATGCGAGCTGCGATGGCGATTGCTGAGGTGGATGATGATGTGTTAGGGCATGACCCCATAGGGTTAGAGTTGGAAGAAGAGATGGCAAAGATAATGGGGAAAGAAGCAGGGTTATTTGTTCCCTCAGGCACAATGGGAAATCTCATAAGTGTTCTTGTACACTGTGACATTAGAGGGAGTGAAGTGATTGTTGGGGACAATTCCCATATTCACATTTTGGAAAATGGAGGCATTGCAACCATTGGAGGAGTTCATCCAAGGACAGTCAAAAACAATGCTGATGGAACAATGGATATTCATTTGATTCAAGCTGCCATTAGAAACCCAAAGGCCCAACTCTTCTTCCCAACAACAAGGCTCATTTGTTTAGAAAATACACATGCAAACTCTGGTGGAAAATGTCTTTCAGTAGAATATACTGACGAAGTTGGAGAATTAGCTAAGAAGCATCACCTCAAACTTCACATTGACGGAGCTCGTATTTTCAATGCTTCAATTGCACTTGGTGTTCCAGTGGATCGATTGGTACAAGCGGCTGACTCAGTATCTGTATGTCTATCAAAAGGTTTGGGCGCACCTGTTGGATCAGTTATTGTGGGTTCCAAAGACTTTATTGCCAAGGCCACAAGGGTTAGAAAAGCATTGGGTGGTGGAATGAGGCAAATTGGCATCCTTTGTGCAGCTGGACTTATTGCAATAAAAGAGAATGTTCAAAAACTTCAAGCCGATCATGAGAAAGCCAAGCAACTAGCTAGTGGGCTATACCAAATCAAAGGATTAAAGGTAGATCCAAAATCAGTTGAGACAAACATTATATTCTTTGAAATAGAAGAGGATTACGGAATCTCACTGGAAACACTATGTAAAAGCTTGGAAGAACGTGGCATTTTTATGATGCTAGAAAGCCAAATAAGGGCTAGAATTGTTCTTCATCATCAGATTTCAACAAGTGACGTGCACTACACTTTATCTTGCTTTCAGCAAACTCTGAGTGGAATTCAAGTTGTAAATGGCAATTAA

Coding sequence (CDS)

ATGATAATGACTCTTCTCTTGTCTTATCTTCCTATAAATTTCCTTTATGTTTATGCAACTCAATCCAATCCAATGGTGGGTTCATCGATATCTAAGGAATCAACAAAAATGGTGAGCAGAAAGGTGGATTTGCGGTCGGACACAGTGACGAAACCAACCGAATCGATGCGAGCTGCGATGGCGATTGCTGAGGTGGATGATGATGTGTTAGGGCATGACCCCATAGGGTTAGAGTTGGAAGAAGAGATGGCAAAGATAATGGGGAAAGAAGCAGGGTTATTTGTTCCCTCAGGCACAATGGGAAATCTCATAAGTGTTCTTGTACACTGTGACATTAGAGGGAGTGAAGTGATTGTTGGGGACAATTCCCATATTCACATTTTGGAAAATGGAGGCATTGCAACCATTGGAGGAGTTCATCCAAGGACAGTCAAAAACAATGCTGATGGAACAATGGATATTCATTTGATTCAAGCTGCCATTAGAAACCCAAAGGCCCAACTCTTCTTCCCAACAACAAGGCTCATTTGTTTAGAAAATACACATGCAAACTCTGGTGGAAAATGTCTTTCAGTAGAATATACTGACGAAGTTGGAGAATTAGCTAAGAAGCATCACCTCAAACTTCACATTGACGGAGCTCGTATTTTCAATGCTTCAATTGCACTTGGTGTTCCAGTGGATCGATTGGTACAAGCGGCTGACTCAGTATCTGTATGTCTATCAAAAGGTTTGGGCGCACCTGTTGGATCAGTTATTGTGGGTTCCAAAGACTTTATTGCCAAGGCCACAAGGGTTAGAAAAGCATTGGGTGGTGGAATGAGGCAAATTGGCATCCTTTGTGCAGCTGGACTTATTGCAATAAAAGAGAATGTTCAAAAACTTCAAGCCGATCATGAGAAAGCCAAGCAACTAGCTAGTGGGCTATACCAAATCAAAGGATTAAAGGTAGATCCAAAATCAGTTGAGACAAACATTATATTCTTTGAAATAGAAGAGGATTACGGAATCTCACTGGAAACACTATGTAAAAGCTTGGAAGAACGTGGCATTTTTATGATGCTAGAAAGCCAAATAAGGGCTAGAATTGTTCTTCATCATCAGATTTCAACAAGTGACGTGCACTACACTTTATCTTGCTTTCAGCAAACTCTGAGTGGAATTCAAGTTGTAAATGGCAATTAA

Protein sequence

MIMTLLLSYLPINFLYVYATQSNPMVGSSISKESTKMVSRKVDLRSDTVTKPTESMRAAMAIAEVDDDVLGHDPIGLELEEEMAKIMGKEAGLFVPSGTMGNLISVLVHCDIRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNNADGTMDIHLIQAAIRNPKAQLFFPTTRLICLENTHANSGGKCLSVEYTDEVGELAKKHHLKLHIDGARIFNASIALGVPVDRLVQAADSVSVCLSKGLGAPVGSVIVGSKDFIAKATRVRKALGGGMRQIGILCAAGLIAIKENVQKLQADHEKAKQLASGLYQIKGLKVDPKSVETNIIFFEIEEDYGISLETLCKSLEERGIFMMLESQIRARIVLHHQISTSDVHYTLSCFQQTLSGIQVVNGN
Homology
BLAST of ClCG06G009580 vs. NCBI nr
Match: XP_008453666.1 (PREDICTED: probable low-specificity L-threonine aldolase 1 isoform X1 [Cucumis melo])

HSP 1 Score: 652.1 bits (1681), Expect = 3.0e-183
Identity = 328/370 (88.65%), Postives = 351/370 (94.86%), Query Frame = 0

Query: 25  MVGSSISKESTKMVSRKVDLRSDTVTKPTESMRAAMAIAEVDDDVLGHDPIGLELEEEMA 84
           ++ SSIS+   KMVSRKVDLRSDTVTKPTESMRAAMA+AEVDDDVLG+DP  LELEE+MA
Sbjct: 31  LITSSISRFQCKMVSRKVDLRSDTVTKPTESMRAAMAMAEVDDDVLGYDPTALELEEKMA 90

Query: 85  KIMGKEAGLFVPSGTMGNLISVLVHCDIRGSEVIVGDNSHIHILENGGIATIGGVHPRTV 144
           KIMGKE GLFVPSGTMGNLIS+LVHC+ RGSEVIVGDNSHIHILENGGIATIGGVHPRTV
Sbjct: 91  KIMGKEGGLFVPSGTMGNLISILVHCETRGSEVIVGDNSHIHILENGGIATIGGVHPRTV 150

Query: 145 KNNADGTMDIHLIQAAIRNPKAQLFFPTTRLICLENTHANSGGKCLSVEYTDEVGELAKK 204
           KN  DGTMDI LI+AAIRNPK QLFFPTTRLICLENTHANSGGKCLS+EYTDEVGELAKK
Sbjct: 151 KNKDDGTMDIDLIEAAIRNPKGQLFFPTTRLICLENTHANSGGKCLSMEYTDEVGELAKK 210

Query: 205 HHLKLHIDGARIFNASIALGVPVDRLVQAADSVSVCLSKGLGAPVGSVIVGSKDFIAKAT 264
           H LKLHIDGARIFNASIAL +PVDRLV+AADSVSVCLSKGLGAPVGS+I+GSKDFI KA 
Sbjct: 211 HDLKLHIDGARIFNASIALAIPVDRLVRAADSVSVCLSKGLGAPVGSIILGSKDFITKAI 270

Query: 265 RVRKALGGGMRQIGILCAAGLIAIKENVQKLQADHEKAKQLASGLYQIKGLKVDPKSVET 324
           R+RK LGGGMRQIGILCAAGL+AIKENVQKL+ADHEKAKQLASGLYQIKGLKVDPKSVET
Sbjct: 271 RIRKTLGGGMRQIGILCAAGLVAIKENVQKLEADHEKAKQLASGLYQIKGLKVDPKSVET 330

Query: 325 NIIFFEIEEDYGISLETLCKSLEERGIFMMLESQIRARIVLHHQISTSDVHYTLSCFQQT 384
           NIIFFEIE+DYGIS+ETLCK+LEERGIFMMLESQ RARIVLHHQISTSDV YTLSCF+QT
Sbjct: 331 NIIFFEIEDDYGISMETLCKTLEERGIFMMLESQTRARIVLHHQISTSDVQYTLSCFKQT 390

Query: 385 LSGIQVVNGN 395
           L+GI+VVNGN
Sbjct: 391 LNGIKVVNGN 400

BLAST of ClCG06G009580 vs. NCBI nr
Match: XP_038877077.1 (probable low-specificity L-threonine aldolase 1 [Benincasa hispida])

HSP 1 Score: 646.0 bits (1665), Expect = 2.1e-181
Identity = 325/358 (90.78%), Postives = 345/358 (96.37%), Query Frame = 0

Query: 37  MVSRKVDLRSDTVTKPTESMRAAMAIAEVDDDVLGHDPIGLELEEEMAKIMGKEAGLFVP 96
           MV RKVDLRSDTVTKPTE+M+AAMAIAEVDDDVLG+DPI L+LEEEMAKIMGKE GLFVP
Sbjct: 1   MVDRKVDLRSDTVTKPTETMQAAMAIAEVDDDVLGNDPIALKLEEEMAKIMGKEGGLFVP 60

Query: 97  SGTMGNLISVLVHCDIRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNNADGTMDIHL 156
           SGTMGNLISVLVHCDIRGSEVI+GDNSHIHILENGGIATIGGVH RTVKN  DGTMDI L
Sbjct: 61  SGTMGNLISVLVHCDIRGSEVILGDNSHIHILENGGIATIGGVHSRTVKNKDDGTMDIDL 120

Query: 157 IQAAIRNPKAQLFFPTTRLICLENTHANSGGKCLSVEYTDEVGELAKKHHLKLHIDGARI 216
           ++AAIRNPK +LFFPTTRLICLENTHAN+GGKCLSVEYTDEVGELAKKH+LKLHIDGARI
Sbjct: 121 VEAAIRNPKGELFFPTTRLICLENTHANTGGKCLSVEYTDEVGELAKKHNLKLHIDGARI 180

Query: 217 FNASIALGVPVDRLVQAADSVSVCLSKGLGAPVGSVIVGSKDFIAKATRVRKALGGGMRQ 276
           FNASIALGVPVDRLVQAADSVSVCLSKGLGAPVGSVIVGSKDFIAKA RVRK LGGGMRQ
Sbjct: 181 FNASIALGVPVDRLVQAADSVSVCLSKGLGAPVGSVIVGSKDFIAKAKRVRKTLGGGMRQ 240

Query: 277 IGILCAAGLIAIKENVQKLQADHEKAKQLASGLYQIKGLKVDPKSVETNIIFFEIEEDYG 336
           IGILCAAGL+A+KENV KL+ADH+KAKQLASGL+QIKGLK+DPKSVETNIIFFEIE+DYG
Sbjct: 241 IGILCAAGLVALKENVHKLEADHDKAKQLASGLHQIKGLKIDPKSVETNIIFFEIEDDYG 300

Query: 337 ISLETLCKSLEERGIFMMLESQIRARIVLHHQISTSDVHYTLSCFQQTLSGIQVVNGN 395
           IS+ETLCKSLEERGIFMMLESQIRARIVLHHQISTSDV YTLSCFQQTL+GI+V NGN
Sbjct: 301 ISMETLCKSLEERGIFMMLESQIRARIVLHHQISTSDVQYTLSCFQQTLNGIKVENGN 358

BLAST of ClCG06G009580 vs. NCBI nr
Match: XP_011660269.1 (probable low-specificity L-threonine aldolase 1 isoform X2 [Cucumis sativus] >XP_031745916.1 probable low-specificity L-threonine aldolase 1 isoform X2 [Cucumis sativus] >KAE8637487.1 hypothetical protein CSA_016969 [Cucumis sativus] >KAE8653610.1 hypothetical protein Csa_006916 [Cucumis sativus])

HSP 1 Score: 636.7 bits (1641), Expect = 1.3e-178
Identity = 321/358 (89.66%), Postives = 340/358 (94.97%), Query Frame = 0

Query: 37  MVSRKVDLRSDTVTKPTESMRAAMAIAEVDDDVLGHDPIGLELEEEMAKIMGKEAGLFVP 96
           MVSRKVDLRSDTVTKPTESMRAAMA+AEVDDDVLG+DP  LELEEEMAKIMGKE GLFVP
Sbjct: 1   MVSRKVDLRSDTVTKPTESMRAAMAMAEVDDDVLGYDPTALELEEEMAKIMGKEEGLFVP 60

Query: 97  SGTMGNLISVLVHCDIRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNNADGTMDIHL 156
           SGTMGNLISVLVHC+ RGSEVIVGDNSHIHILENGGIATIGGVHPRTVKN  DGTMDI L
Sbjct: 61  SGTMGNLISVLVHCETRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNKDDGTMDIDL 120

Query: 157 IQAAIRNPKAQLFFPTTRLICLENTHANSGGKCLSVEYTDEVGELAKKHHLKLHIDGARI 216
           I+AAIRNPK QLFFPTTRLICLENTHANSGGKCLSVEY DEVGELAKK+ LKLHIDGARI
Sbjct: 121 IEAAIRNPKGQLFFPTTRLICLENTHANSGGKCLSVEYIDEVGELAKKYDLKLHIDGARI 180

Query: 217 FNASIALGVPVDRLVQAADSVSVCLSKGLGAPVGSVIVGSKDFIAKATRVRKALGGGMRQ 276
           FNASIALGVPVDRLVQAADS+ VCLSKGLGAPVGS+IVGSKDFIAKA RVRK LGGGMRQ
Sbjct: 181 FNASIALGVPVDRLVQAADSILVCLSKGLGAPVGSIIVGSKDFIAKARRVRKTLGGGMRQ 240

Query: 277 IGILCAAGLIAIKENVQKLQADHEKAKQLASGLYQIKGLKVDPKSVETNIIFFEIEEDYG 336
           IGILCAAGL+AIKENVQKL+ADH+KAKQLASGL+QIKGLK+DPKSVETNII FEIE+DYG
Sbjct: 241 IGILCAAGLVAIKENVQKLEADHKKAKQLASGLFQIKGLKIDPKSVETNIILFEIEDDYG 300

Query: 337 ISLETLCKSLEERGIFMMLESQIRARIVLHHQISTSDVHYTLSCFQQTLSGIQVVNGN 395
           IS+ETLCKSLEERGIF+ML++Q RARIV HHQISTSDV Y LSCFQQTL+GI+VVNGN
Sbjct: 301 ISMETLCKSLEERGIFVMLQTQTRARIVFHHQISTSDVQYILSCFQQTLNGIKVVNGN 358

BLAST of ClCG06G009580 vs. NCBI nr
Match: XP_008453667.1 (PREDICTED: probable low-specificity L-threonine aldolase 1 isoform X2 [Cucumis melo])

HSP 1 Score: 633.3 bits (1632), Expect = 1.4e-177
Identity = 321/370 (86.76%), Postives = 343/370 (92.70%), Query Frame = 0

Query: 25  MVGSSISKESTKMVSRKVDLRSDTVTKPTESMRAAMAIAEVDDDVLGHDPIGLELEEEMA 84
           ++ SSIS+   KMVSRKVDLRSDTVTKPTESMRAAMA+AEVDDDVLG+DP  LELEE+MA
Sbjct: 31  LITSSISRFQCKMVSRKVDLRSDTVTKPTESMRAAMAMAEVDDDVLGYDPTALELEEKMA 90

Query: 85  KIMGKEAGLFVPSGTMGNLISVLVHCDIRGSEVIVGDNSHIHILENGGIATIGGVHPRTV 144
           KIMGKE GLFVPSGTMGNLIS+LVHC+ RGSEVIVGDNSHIHILENGGIATIGGVHPRTV
Sbjct: 91  KIMGKEGGLFVPSGTMGNLISILVHCETRGSEVIVGDNSHIHILENGGIATIGGVHPRTV 150

Query: 145 KNNADGTMDIHLIQAAIRNPKAQLFFPTTRLICLENTHANSGGKCLSVEYTDEVGELAKK 204
           KN  DGTMDI LI+AAIRNPK QLFFPTTRLICLENTHANSGGKCLS+EYTDEVGELAKK
Sbjct: 151 KNKDDGTMDIDLIEAAIRNPKGQLFFPTTRLICLENTHANSGGKCLSMEYTDEVGELAKK 210

Query: 205 HHLKLHIDGARIFNASIALGVPVDRLVQAADSVSVCLSKGLGAPVGSVIVGSKDFIAKAT 264
           H LKLHIDGARIFNASIAL +PVDRL        VCLSKGLGAPVGS+I+GSKDFI KA 
Sbjct: 211 HDLKLHIDGARIFNASIALAIPVDRL--------VCLSKGLGAPVGSIILGSKDFITKAI 270

Query: 265 RVRKALGGGMRQIGILCAAGLIAIKENVQKLQADHEKAKQLASGLYQIKGLKVDPKSVET 324
           R+RK LGGGMRQIGILCAAGL+AIKENVQKL+ADHEKAKQLASGLYQIKGLKVDPKSVET
Sbjct: 271 RIRKTLGGGMRQIGILCAAGLVAIKENVQKLEADHEKAKQLASGLYQIKGLKVDPKSVET 330

Query: 325 NIIFFEIEEDYGISLETLCKSLEERGIFMMLESQIRARIVLHHQISTSDVHYTLSCFQQT 384
           NIIFFEIE+DYGIS+ETLCK+LEERGIFMMLESQ RARIVLHHQISTSDV YTLSCF+QT
Sbjct: 331 NIIFFEIEDDYGISMETLCKTLEERGIFMMLESQTRARIVLHHQISTSDVQYTLSCFKQT 390

Query: 385 LSGIQVVNGN 395
           L+GI+VVNGN
Sbjct: 391 LNGIKVVNGN 392

BLAST of ClCG06G009580 vs. NCBI nr
Match: XP_031736719.1 (probable low-specificity L-threonine aldolase 1 isoform X1 [Cucumis sativus] >XP_031736720.1 probable low-specificity L-threonine aldolase 1 isoform X1 [Cucumis sativus] >XP_031736728.1 probable low-specificity L-threonine aldolase 1 isoform X1 [Cucumis sativus] >XP_031745913.1 probable low-specificity L-threonine aldolase 1 isoform X1 [Cucumis sativus] >XP_031745914.1 probable low-specificity L-threonine aldolase 1 isoform X1 [Cucumis sativus] >XP_031745915.1 probable low-specificity L-threonine aldolase 1 isoform X1 [Cucumis sativus])

HSP 1 Score: 630.2 bits (1624), Expect = 1.2e-176
Identity = 321/364 (88.19%), Postives = 340/364 (93.41%), Query Frame = 0

Query: 37  MVSRKVDLRSDTVTKPTESMRAAMAIAEVDDDVLGHDPIGLELEEEMAKIMGKEAGLFVP 96
           MVSRKVDLRSDTVTKPTESMRAAMA+AEVDDDVLG+DP  LELEEEMAKIMGKE GLFVP
Sbjct: 1   MVSRKVDLRSDTVTKPTESMRAAMAMAEVDDDVLGYDPTALELEEEMAKIMGKEEGLFVP 60

Query: 97  SGTMGNLISVLVHCDIRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNNADGTMDIHL 156
           SGTMGNLISVLVHC+ RGSEVIVGDNSHIHILENGGIATIGGVHPRTVKN  DGTMDI L
Sbjct: 61  SGTMGNLISVLVHCETRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNKDDGTMDIDL 120

Query: 157 IQAAIRNPKAQLFFPTTRLICLENTHANSGGKCLSVEYTDEVGELAKKHHLKLHIDGARI 216
           I+AAIRNPK QLFFPTTRLICLENTHANSGGKCLSVEY DEVGELAKK+ LKLHIDGARI
Sbjct: 121 IEAAIRNPKGQLFFPTTRLICLENTHANSGGKCLSVEYIDEVGELAKKYDLKLHIDGARI 180

Query: 217 FNASIALGVPVDRLVQAADSVSVCLSKGLGAPVGSVIVGSKDFIAKATRVRKALGGGMRQ 276
           FNASIALGVPVDRLVQAADS+ VCLSKGLGAPVGS+IVGSKDFIAKA RVRK LGGGMRQ
Sbjct: 181 FNASIALGVPVDRLVQAADSILVCLSKGLGAPVGSIIVGSKDFIAKARRVRKTLGGGMRQ 240

Query: 277 IGILCAAGLIAIKENVQKLQADHEKAKQLASGLYQIKGLKVDPKSVETNI------IFFE 336
           IGILCAAGL+AIKENVQKL+ADH+KAKQLASGL+QIKGLK+DPKSVETNI      I FE
Sbjct: 241 IGILCAAGLVAIKENVQKLEADHKKAKQLASGLFQIKGLKIDPKSVETNIFVIYYQILFE 300

Query: 337 IEEDYGISLETLCKSLEERGIFMMLESQIRARIVLHHQISTSDVHYTLSCFQQTLSGIQV 395
           IE+DYGIS+ETLCKSLEERGIF+ML++Q RARIV HHQISTSDV Y LSCFQQTL+GI+V
Sbjct: 301 IEDDYGISMETLCKSLEERGIFVMLQTQTRARIVFHHQISTSDVQYILSCFQQTLNGIKV 360

BLAST of ClCG06G009580 vs. ExPASy Swiss-Prot
Match: Q8RXU4 (Probable low-specificity L-threonine aldolase 1 OS=Arabidopsis thaliana OX=3702 GN=THA1 PE=1 SV=1)

HSP 1 Score: 510.4 bits (1313), Expect = 1.8e-143
Identity = 250/353 (70.82%), Postives = 301/353 (85.27%), Query Frame = 0

Query: 37  MVSRKVDLRSDTVTKPTESMRAAMAIAEVDDDVLGHDPIGLELEEEMAKIMGKEAGLFVP 96
           MV R VDLRSDTVT+PT++MR AM  AEVDDDVLG+DP    LEEEMAK+MGKEA LFVP
Sbjct: 1   MVMRSVDLRSDTVTRPTDAMREAMCNAEVDDDVLGYDPTARRLEEEMAKMMGKEAALFVP 60

Query: 97  SGTMGNLISVLVHCDIRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNNADGTMDIHL 156
           SGTMGNLISV+VHCD+RGSEVI+GDN HIH+ ENGGI+TIGGVHP+TVKN  DGTMD+  
Sbjct: 61  SGTMGNLISVMVHCDVRGSEVILGDNCHIHVYENGGISTIGGVHPKTVKNEEDGTMDLEA 120

Query: 157 IQAAIRNPKAQLFFPTTRLICLENTHANSGGKCLSVEYTDEVGELAKKHHLKLHIDGARI 216
           I+AAIR+PK   F+P+TRLICLENTHANSGG+CLSVEYT++VGE+AK+H +KLHIDGAR+
Sbjct: 121 IEAAIRDPKGSTFYPSTRLICLENTHANSGGRCLSVEYTEKVGEIAKRHGVKLHIDGARL 180

Query: 217 FNASIALGVPVDRLVQAADSVSVCLSKGLGAPVGSVIVGSKDFIAKATRVRKALGGGMRQ 276
           FNASIALGVPV +LV+AADSV VCLSKGLGAPVGSVIVGS+ FI KA  VRK LGGGMRQ
Sbjct: 181 FNASIALGVPVHKLVKAADSVQVCLSKGLGAPVGSVIVGSQSFIEKAKTVRKTLGGGMRQ 240

Query: 277 IGILCAAGLIAIKENVQKLQADHEKAKQLASGLYQIKGLKVDPKSVETNIIFFEIEEDYG 336
           IG+LCAA L+A++EN+ KLQ DH+KAK LA GL Q+KG++V+  +VETN+IF ++E+   
Sbjct: 241 IGVLCAAALVALQENLPKLQHDHKKAKLLAEGLNQMKGIRVNVAAVETNMIFMDMEDGSR 300

Query: 337 ISLETLCKSLEERGIFMMLESQIRARIVLHHQISTSDVHYTLSCFQQTLSGIQ 390
           ++ E L K+LEE GI ++  +  R RIV+HHQI+TSDVHYTLSCFQQ +  +Q
Sbjct: 301 LTAEKLRKNLEENGILLIRGNSSRIRIVIHHQITTSDVHYTLSCFQQAMLTMQ 353

BLAST of ClCG06G009580 vs. ExPASy Swiss-Prot
Match: Q9FPH3 (Probable low-specificity L-threonine aldolase 2 OS=Arabidopsis thaliana OX=3702 GN=THA2 PE=1 SV=1)

HSP 1 Score: 481.9 bits (1239), Expect = 7.0e-135
Identity = 241/349 (69.05%), Postives = 287/349 (82.23%), Query Frame = 0

Query: 35  TKMVSRKVDLRSDTVTKPTESMRAAMAIAEVDDDVLGHDPIGLELEEEMAKIMGKEAGLF 94
           T    R VDLRSDTVTKPTESMR+AMA AEVDDDVLG+DP  L LE+E+A+I GKEA +F
Sbjct: 3   TPTTIRTVDLRSDTVTKPTESMRSAMANAEVDDDVLGNDPTALRLEKEVAEIAGKEAAMF 62

Query: 95  VPSGTMGNLISVLVHCDIRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNNADGTMDI 154
           VPSGTMGNLISVLVHCD RGSEVI+GD+SHIHI ENGG++++GGVHPRTVKN  DGTM+I
Sbjct: 63  VPSGTMGNLISVLVHCDERGSEVILGDDSHIHIYENGGVSSLGGVHPRTVKNEEDGTMEI 122

Query: 155 HLIQAAIRNPKAQLFFPTTRLICLENTHANSGGKCLSVEYTDEVGELAKKHHLKLHIDGA 214
             I+AA+R+PK  L  P T+LICLENT AN GG+CL +EY D+VGELAKKH LKLHIDGA
Sbjct: 123 GAIEAAVRSPKGDLHHPVTKLICLENTQANCGGRCLPIEYIDKVGELAKKHGLKLHIDGA 182

Query: 215 RIFNASIALGVPVDRLVQAADSVSVCLSKGLGAPVGSVIVGSKDFIAKATRVRKALGGGM 274
           RIFNAS+ALGVPV R+VQAADSVS+CLSKG+GAPVGSVIVGSK FI KA  +RK LGGGM
Sbjct: 183 RIFNASVALGVPVKRIVQAADSVSICLSKGIGAPVGSVIVGSKKFITKARWLRKTLGGGM 242

Query: 275 RQIGILCAAGLIAIKENVQKLQADHEKAKQLASGLYQIKGLKVDPKSVETNIIFFEIEED 334
           RQIG+LCAA L+A+ ENV KL+ DH+KA+ LA GL +I+ L+V+  +VETNII+ +I ED
Sbjct: 243 RQIGVLCAAALVALHENVAKLEDDHKKARVLAEGLNRIERLRVNVAAVETNIIYVDIPED 302

Query: 335 YGISLETLCKSLEERGIFMMLESQIRARIVLHHQISTSDVHYTLSCFQQ 384
                E  CKSLE+ G+ ++ ++  R RIVLHHQIS  DV Y LSCF++
Sbjct: 303 PKFGAEEACKSLEDVGVLVIPQATFRIRIVLHHQISDVDVEYVLSCFEK 351

BLAST of ClCG06G009580 vs. ExPASy Swiss-Prot
Match: O07051 (L-allo-threonine aldolase OS=Aeromonas jandaei OX=650 GN=ltaA PE=1 SV=1)

HSP 1 Score: 282.0 bits (720), Expect = 1.1e-74
Identity = 161/346 (46.53%), Postives = 219/346 (63.29%), Query Frame = 0

Query: 40  RKVDLRSDTVTKPTESMRAAMAIAEVDDDVLGHDPIGLELEEEMAKIMGKEAGLFVPSGT 99
           R +DLRSDTVT+PT++MR  M  AEV DDV G DP    LE   A ++GKEA LFVPSGT
Sbjct: 2   RYIDLRSDTVTQPTDAMRQCMLHAEVGDDVYGEDPGVNALEAYGADLLGKEAALFVPSGT 61

Query: 100 MGNLISVLVHCDIRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNNADGTMDIHLIQA 159
           M NL++V+ HC  RG   ++G  +HI+  E  G A +G V  + V   ADG++ +  ++A
Sbjct: 62  MSNLLAVMSHCQ-RGEGAVLGSAAHIYRYEAQGSAVLGSVALQPVPMQADGSLALADVRA 121

Query: 160 AIRNPKAQLFFPTTRLICLENTHANSGGKCLSVEYTDEVGELAKKHHLKLHIDGARIFNA 219
           AI      + F  TRL+CLENTH    GK L + Y  E+ EL  +H L+LH+DGAR+FNA
Sbjct: 122 AI--APDDVHFTPTRLVCLENTH---NGKVLPLPYLREMRELVDEHGLQLHLDGARLFNA 181

Query: 220 SIALGVPVDRLVQAADSVSVCLSKGLGAPVGSVIVGSKDFIAKATRVRKALGGGMRQIGI 279
            +A G  V  LV   DSVS+CLSKGLGAPVGS++VGS  FIA+A R+RK +GGGMRQ GI
Sbjct: 182 VVASGHTVRELVAPFDSVSICLSKGLGAPVGSLLVGSHAFIARARRLRKMVGGGMRQAGI 241

Query: 280 LCAAGLIAIKENVQKLQADHEKAKQLASGLYQIKGLKVDPKSVETNIIFFEIEEDYGISL 339
           L  AGL A++++V +L  DH +A+QLA GL  + G+++D   V+TN++F ++       L
Sbjct: 242 LAQAGLFALQQHVVRLADDHRRARQLAEGLAALPGIRLDLAQVQTNMVFLQLTSGESAPL 301

Query: 340 ETLCKSLEERGIFMMLESQIRARIVLHHQISTSDVHYTLSCFQQTL 386
               K+   RGI      ++  R+V H QI   D+   +  F + L
Sbjct: 302 LAFMKA---RGILFSGYGEL--RLVTHLQIHDDDIEEVIDAFTEYL 336

BLAST of ClCG06G009580 vs. ExPASy Swiss-Prot
Match: Q21890 (Uncharacterized protein R102.4 OS=Caenorhabditis elegans OX=6239 GN=R102.4 PE=3 SV=3)

HSP 1 Score: 266.9 bits (681), Expect = 3.6e-70
Identity = 155/368 (42.12%), Postives = 221/368 (60.05%), Query Frame = 0

Query: 28  SSISKESTKMVSRKVDLRSDTVTKPTESMRAAMAIAEVDDDVLGHDPIGLELEEEMAKIM 87
           S+ +++S K     +DLRSDTVT P+  MR AMA A V DDV G D     LE+  A++ 
Sbjct: 54  SAYTQKSNK-THTSIDLRSDTVTVPSVEMRRAMAEAIVGDDVYGEDTTTNRLEQRCAELF 113

Query: 88  GKEAGLFVPSGTMGNLISVLVHCDIRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNN 147
           GKEAGLFV SGTMGNL++++ HC  RG E+IVG  +HIH  E G  A   G+   T++  
Sbjct: 114 GKEAGLFVTSGTMGNLLAIMAHCQ-RGEEIIVGRYNHIHRWEQGNYAQFAGISATTLEVK 173

Query: 148 ADGTMDIHLIQAAIRNPKAQLFFPTTRLICLENTHANSGGKCLSVEYTDEVGELAKKHHL 207
            DGTMD++ I+ AIR     +  P ++LIC+ENTH  +GGK L +E+   V +LA++  L
Sbjct: 174 PDGTMDLNDIEQAIRVKDCHM--PASKLICIENTHNYTGGKALPIEWMRSVKQLAERRDL 233

Query: 208 KLHIDGARIFNASIALGVPVDRLVQAADSVSVCLSKGLGAPVGSVIVGSKDFIAKATRVR 267
           K+H+DGARI+NA++A    V ++   AD+V +C SKGLGAPVGS++VG KDFI +A   R
Sbjct: 234 KVHMDGARIYNAAVASNCSVSKIASFADTVQMCFSKGLGAPVGSIVVGPKDFIDRARHSR 293

Query: 268 KALGGGMRQIGILCAAGLIAIKENVQKLQADHEKAKQLASGLYQIKGLKVDPKSVETNII 327
           KALGGG RQ GIL AA  IA+      ++ADHE+AK LA  +         P+   T + 
Sbjct: 294 KALGGGWRQSGILAAAAHIALDHADATIRADHERAKTLARMIND-----ATPEEFRTKV- 353

Query: 328 FFEIEEDY----------GISLETLCKSLEERGIFMMLESQIRARIVLHHQISTSDVHYT 386
            F  E+D           G++++ L    ++  I  M     R R+VL+  +S  ++   
Sbjct: 354 -FAAEKDITNMVLVHCQNGVTVQQLTDFFQKHDILAMTFDARRIRMVLNWNVSDENLETI 410

BLAST of ClCG06G009580 vs. ExPASy Swiss-Prot
Match: P75823 (Low specificity L-threonine aldolase OS=Escherichia coli (strain K12) OX=83333 GN=ltaE PE=1 SV=1)

HSP 1 Score: 251.9 bits (642), Expect = 1.2e-65
Identity = 138/329 (41.95%), Postives = 202/329 (61.40%), Query Frame = 0

Query: 42  VDLRSDTVTKPTESMRAAMAIAEVDDDVLGHDPIGLELEEEMAKIMGKEAGLFVPSGTMG 101
           +DLRSDTVT+P+ +M  AM  A V DDV G DP    L++  A++ GKEA +F+P+GT  
Sbjct: 2   IDLRSDTVTRPSRAMLEAMMAAPVGDDVYGDDPTVNALQDYAAELSGKEAAIFLPTGTQA 61

Query: 102 NLISVLVHCDIRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNNADGTMDIHLIQAAI 161
           NL+++L HC+ RG E IVG  +H ++ E GG A +G + P+ +   ADGT+   L + A+
Sbjct: 62  NLVALLSHCE-RGEEYIVGQAAHNYLFEAGGAAVLGSIQPQPIDAAADGTLP--LDKVAM 121

Query: 162 RNPKAQLFFPTTRLICLENTHANSGGKCLSVEYTDEVGELAKKHHLKLHIDGARIFNASI 221
           +     + F  T+L+ LENTH    GK L  EY  E  E  ++ +L LH+DGARIFNA +
Sbjct: 122 KIKPDDIHFARTKLLSLENTH---NGKVLPREYLKEAWEFTRERNLALHVDGARIFNAVV 181

Query: 222 ALGVPVDRLVQAADSVSVCLSKGLGAPVGSVIVGSKDFIAKATRVRKALGGGMRQIGILC 281
           A G  +  + Q  DS ++CLSKGLG PVGS++VG++D+I +A R RK  GGGMRQ GIL 
Sbjct: 182 AYGCELKEITQYCDSFTICLSKGLGTPVGSLLVGNRDYIKRAIRWRKMTGGGMRQSGILA 241

Query: 282 AAGLIAIKENVQKLQADHEKAKQLASGLYQIKGLKVDPKSVETNIIFFEIEEDYGISLET 341
           AAG+ A+K NV +LQ DH+ A  +A    Q++    D    +TN++F  + E+   +L  
Sbjct: 242 AAGIYALKNNVARLQEDHDNAAWMAE---QLREAGADVMRQDTNMLFVRVGEENAAALGE 301

Query: 342 LCKSLEERGIFMMLESQIRARIVLHHQIS 371
             K+       +++ +    R+V H  +S
Sbjct: 302 YMKARN-----VLINASPIVRLVTHLDVS 316

BLAST of ClCG06G009580 vs. ExPASy TrEMBL
Match: A0A1S3BXZ8 (probable low-specificity L-threonine aldolase 1 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103494316 PE=3 SV=1)

HSP 1 Score: 652.1 bits (1681), Expect = 1.5e-183
Identity = 328/370 (88.65%), Postives = 351/370 (94.86%), Query Frame = 0

Query: 25  MVGSSISKESTKMVSRKVDLRSDTVTKPTESMRAAMAIAEVDDDVLGHDPIGLELEEEMA 84
           ++ SSIS+   KMVSRKVDLRSDTVTKPTESMRAAMA+AEVDDDVLG+DP  LELEE+MA
Sbjct: 31  LITSSISRFQCKMVSRKVDLRSDTVTKPTESMRAAMAMAEVDDDVLGYDPTALELEEKMA 90

Query: 85  KIMGKEAGLFVPSGTMGNLISVLVHCDIRGSEVIVGDNSHIHILENGGIATIGGVHPRTV 144
           KIMGKE GLFVPSGTMGNLIS+LVHC+ RGSEVIVGDNSHIHILENGGIATIGGVHPRTV
Sbjct: 91  KIMGKEGGLFVPSGTMGNLISILVHCETRGSEVIVGDNSHIHILENGGIATIGGVHPRTV 150

Query: 145 KNNADGTMDIHLIQAAIRNPKAQLFFPTTRLICLENTHANSGGKCLSVEYTDEVGELAKK 204
           KN  DGTMDI LI+AAIRNPK QLFFPTTRLICLENTHANSGGKCLS+EYTDEVGELAKK
Sbjct: 151 KNKDDGTMDIDLIEAAIRNPKGQLFFPTTRLICLENTHANSGGKCLSMEYTDEVGELAKK 210

Query: 205 HHLKLHIDGARIFNASIALGVPVDRLVQAADSVSVCLSKGLGAPVGSVIVGSKDFIAKAT 264
           H LKLHIDGARIFNASIAL +PVDRLV+AADSVSVCLSKGLGAPVGS+I+GSKDFI KA 
Sbjct: 211 HDLKLHIDGARIFNASIALAIPVDRLVRAADSVSVCLSKGLGAPVGSIILGSKDFITKAI 270

Query: 265 RVRKALGGGMRQIGILCAAGLIAIKENVQKLQADHEKAKQLASGLYQIKGLKVDPKSVET 324
           R+RK LGGGMRQIGILCAAGL+AIKENVQKL+ADHEKAKQLASGLYQIKGLKVDPKSVET
Sbjct: 271 RIRKTLGGGMRQIGILCAAGLVAIKENVQKLEADHEKAKQLASGLYQIKGLKVDPKSVET 330

Query: 325 NIIFFEIEEDYGISLETLCKSLEERGIFMMLESQIRARIVLHHQISTSDVHYTLSCFQQT 384
           NIIFFEIE+DYGIS+ETLCK+LEERGIFMMLESQ RARIVLHHQISTSDV YTLSCF+QT
Sbjct: 331 NIIFFEIEDDYGISMETLCKTLEERGIFMMLESQTRARIVLHHQISTSDVQYTLSCFKQT 390

Query: 385 LSGIQVVNGN 395
           L+GI+VVNGN
Sbjct: 391 LNGIKVVNGN 400

BLAST of ClCG06G009580 vs. ExPASy TrEMBL
Match: A0A1S3BWA2 (probable low-specificity L-threonine aldolase 1 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103494316 PE=3 SV=1)

HSP 1 Score: 633.3 bits (1632), Expect = 7.0e-178
Identity = 321/370 (86.76%), Postives = 343/370 (92.70%), Query Frame = 0

Query: 25  MVGSSISKESTKMVSRKVDLRSDTVTKPTESMRAAMAIAEVDDDVLGHDPIGLELEEEMA 84
           ++ SSIS+   KMVSRKVDLRSDTVTKPTESMRAAMA+AEVDDDVLG+DP  LELEE+MA
Sbjct: 31  LITSSISRFQCKMVSRKVDLRSDTVTKPTESMRAAMAMAEVDDDVLGYDPTALELEEKMA 90

Query: 85  KIMGKEAGLFVPSGTMGNLISVLVHCDIRGSEVIVGDNSHIHILENGGIATIGGVHPRTV 144
           KIMGKE GLFVPSGTMGNLIS+LVHC+ RGSEVIVGDNSHIHILENGGIATIGGVHPRTV
Sbjct: 91  KIMGKEGGLFVPSGTMGNLISILVHCETRGSEVIVGDNSHIHILENGGIATIGGVHPRTV 150

Query: 145 KNNADGTMDIHLIQAAIRNPKAQLFFPTTRLICLENTHANSGGKCLSVEYTDEVGELAKK 204
           KN  DGTMDI LI+AAIRNPK QLFFPTTRLICLENTHANSGGKCLS+EYTDEVGELAKK
Sbjct: 151 KNKDDGTMDIDLIEAAIRNPKGQLFFPTTRLICLENTHANSGGKCLSMEYTDEVGELAKK 210

Query: 205 HHLKLHIDGARIFNASIALGVPVDRLVQAADSVSVCLSKGLGAPVGSVIVGSKDFIAKAT 264
           H LKLHIDGARIFNASIAL +PVDRL        VCLSKGLGAPVGS+I+GSKDFI KA 
Sbjct: 211 HDLKLHIDGARIFNASIALAIPVDRL--------VCLSKGLGAPVGSIILGSKDFITKAI 270

Query: 265 RVRKALGGGMRQIGILCAAGLIAIKENVQKLQADHEKAKQLASGLYQIKGLKVDPKSVET 324
           R+RK LGGGMRQIGILCAAGL+AIKENVQKL+ADHEKAKQLASGLYQIKGLKVDPKSVET
Sbjct: 271 RIRKTLGGGMRQIGILCAAGLVAIKENVQKLEADHEKAKQLASGLYQIKGLKVDPKSVET 330

Query: 325 NIIFFEIEEDYGISLETLCKSLEERGIFMMLESQIRARIVLHHQISTSDVHYTLSCFQQT 384
           NIIFFEIE+DYGIS+ETLCK+LEERGIFMMLESQ RARIVLHHQISTSDV YTLSCF+QT
Sbjct: 331 NIIFFEIEDDYGISMETLCKTLEERGIFMMLESQTRARIVLHHQISTSDVQYTLSCFKQT 390

Query: 385 LSGIQVVNGN 395
           L+GI+VVNGN
Sbjct: 391 LNGIKVVNGN 392

BLAST of ClCG06G009580 vs. ExPASy TrEMBL
Match: A0A6J1DWI0 (probable low-specificity L-threonine aldolase 1 OS=Momordica charantia OX=3673 GN=LOC111024142 PE=3 SV=1)

HSP 1 Score: 614.0 bits (1582), Expect = 4.4e-172
Identity = 314/359 (87.47%), Postives = 334/359 (93.04%), Query Frame = 0

Query: 37  MVSRKVDLRSDTVTKPTESMRAAMAIAEVDDDVLGHDPIGLELEEEMAKIMGKEAGLFVP 96
           MVSRKVDLRSDTVTKPTE+MRAAMA+AEVDDDVLG+DPI L+LEEEMAK+ GKEA LFVP
Sbjct: 1   MVSRKVDLRSDTVTKPTEAMRAAMAMAEVDDDVLGYDPIALQLEEEMAKMTGKEAALFVP 60

Query: 97  SGTMGNLISVLVHCDIRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNNADGTMDIHL 156
           SGTMGNLISVLVHC+IRGSEVI+G NSHIHILENGGIATIGGVHPRTVKNNADGTMDI L
Sbjct: 61  SGTMGNLISVLVHCEIRGSEVILGHNSHIHILENGGIATIGGVHPRTVKNNADGTMDIDL 120

Query: 157 IQAAIRNPKAQLFFPTTRLICLENTHANSGGKCLSVEYTDEVGELAKKHHLKLHIDGARI 216
           I+AAIRNPK +LFFPTTRL+CLEN+HANSGGKCLSVEYTDEVGELAKKH LKLHIDGARI
Sbjct: 121 IEAAIRNPKGELFFPTTRLVCLENSHANSGGKCLSVEYTDEVGELAKKHGLKLHIDGARI 180

Query: 217 FNASIALGVPVDRLVQAADSVSVCLSKGLGAPVGSVIVGSKDFIAKATRVRKALGGGMRQ 276
           FNASIALGV VDRLVQAADSVSVCLSKGLGAPVGSVIVGSK FIAKA RVRK LGGGMRQ
Sbjct: 181 FNASIALGVSVDRLVQAADSVSVCLSKGLGAPVGSVIVGSKSFIAKAKRVRKTLGGGMRQ 240

Query: 277 IGILCAAGLIAIKENVQKLQADHEKAKQLASGLYQIKGLKVDPKSVETNIIFFEIEE-DY 336
           IGILC+A L+AIKEN+ KL+ DH KAK LASGL +I GLKVDPKSVETNIIFFE+E+ DY
Sbjct: 241 IGILCSAALVAIKENLPKLEDDHHKAKLLASGLSEINGLKVDPKSVETNIIFFELEDVDY 300

Query: 337 GISLETLCKSLEERGIFMMLESQIRARIVLHHQISTSDVHYTLSCFQQTLSGIQVVNGN 395
            IS+ETLCKSLEERGIFMM ES  RARIV+HHQIS SDVHYTLSCFQQTLSGIQV NGN
Sbjct: 301 KISVETLCKSLEERGIFMMQESSTRARIVIHHQISISDVHYTLSCFQQTLSGIQVGNGN 359

BLAST of ClCG06G009580 vs. ExPASy TrEMBL
Match: A0A6J1EZ04 (probable low-specificity L-threonine aldolase 1 OS=Cucurbita moschata OX=3662 GN=LOC111440844 PE=3 SV=1)

HSP 1 Score: 608.6 bits (1568), Expect = 1.8e-170
Identity = 312/358 (87.15%), Postives = 327/358 (91.34%), Query Frame = 0

Query: 37  MVSRKVDLRSDTVTKPTESMRAAMAIAEVDDDVLGHDPIGLELEEEMAKIMGKEAGLFVP 96
           MVSRKVDLRSDTVTKPT+SMRAAMAIAEVDDDVLG+DPI L+LEEEMAK+ GKEA LFVP
Sbjct: 1   MVSRKVDLRSDTVTKPTDSMRAAMAIAEVDDDVLGYDPIALQLEEEMAKLTGKEAALFVP 60

Query: 97  SGTMGNLISVLVHCDIRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNNADGTMDIHL 156
           SGTMGNLISVLVHCDIRGSEVI+GDNSHIHILENGGIATIGGVHPRTVKNN DGT+DI L
Sbjct: 61  SGTMGNLISVLVHCDIRGSEVILGDNSHIHILENGGIATIGGVHPRTVKNNDDGTIDIDL 120

Query: 157 IQAAIRNPKAQLFFPTTRLICLENTHANSGGKCLSVEYTDEVGELAKKHHLKLHIDGARI 216
           I+AAIRNPK +LFFPTTRLICLENTHANSGGKCL VEY DEVGELAKKH LKLHIDGARI
Sbjct: 121 IEAAIRNPKGELFFPTTRLICLENTHANSGGKCLPVEYIDEVGELAKKHGLKLHIDGARI 180

Query: 217 FNASIALGVPVDRLVQAADSVSVCLSKGLGAPVGSVIVGSKDFIAKATRVRKALGGGMRQ 276
           FNASIALGV VDRLVQ ADSVSVCLSKGLGAPVGSVIVGSK FIAKA RVRK LGGGMRQ
Sbjct: 181 FNASIALGVSVDRLVQTADSVSVCLSKGLGAPVGSVIVGSKSFIAKAKRVRKTLGGGMRQ 240

Query: 277 IGILCAAGLIAIKENVQKLQADHEKAKQLASGLYQIKGLKVDPKSVETNIIFFEIEEDYG 336
           IGILCAA LIAIKENV KL  DH  AK LASGL QI G+KVDPKSVETNIIFFE+EED  
Sbjct: 241 IGILCAAALIAIKENVPKLATDHHNAKLLASGLNQINGVKVDPKSVETNIIFFEMEEDSK 300

Query: 337 ISLETLCKSLEERGIFMMLESQIRARIVLHHQISTSDVHYTLSCFQQTLSGIQVVNGN 395
           IS+ETLCKSLEERGIFMML+ + RAR+VLHHQISTSDV YTLSCFQQTLSGI   +GN
Sbjct: 301 ISVETLCKSLEERGIFMMLDGKTRARMVLHHQISTSDVEYTLSCFQQTLSGIAAADGN 358

BLAST of ClCG06G009580 vs. ExPASy TrEMBL
Match: A0A6J1HN30 (probable low-specificity L-threonine aldolase 1 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111466119 PE=3 SV=1)

HSP 1 Score: 607.1 bits (1564), Expect = 5.4e-170
Identity = 311/359 (86.63%), Postives = 326/359 (90.81%), Query Frame = 0

Query: 36  KMVSRKVDLRSDTVTKPTESMRAAMAIAEVDDDVLGHDPIGLELEEEMAKIMGKEAGLFV 95
           KMVSRKVDLRSDTVTKPT+SMRAAMAIAEVDDDVLG+DPI L+LEEEMAK+ GKEA LFV
Sbjct: 2   KMVSRKVDLRSDTVTKPTDSMRAAMAIAEVDDDVLGYDPIALQLEEEMAKLTGKEAALFV 61

Query: 96  PSGTMGNLISVLVHCDIRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNNADGTMDIH 155
           PSGTMGNLISVLVHCDIRGSEVI+GDNSHIHILENGGIATIGGVHPRTVKNN DGT+DI 
Sbjct: 62  PSGTMGNLISVLVHCDIRGSEVILGDNSHIHILENGGIATIGGVHPRTVKNNDDGTIDID 121

Query: 156 LIQAAIRNPKAQLFFPTTRLICLENTHANSGGKCLSVEYTDEVGELAKKHHLKLHIDGAR 215
           LI+AAIRNPK +LFFPTTRLICLENTHANSGGKCL VEY DEVGEL KKH LKLHIDGAR
Sbjct: 122 LIEAAIRNPKGELFFPTTRLICLENTHANSGGKCLPVEYIDEVGELTKKHGLKLHIDGAR 181

Query: 216 IFNASIALGVPVDRLVQAADSVSVCLSKGLGAPVGSVIVGSKDFIAKATRVRKALGGGMR 275
           IFNASIALGV VDRLVQ ADSVSVCLSKGLGAPVGSVIVGSK FIAKA RVRK LGGGMR
Sbjct: 182 IFNASIALGVSVDRLVQTADSVSVCLSKGLGAPVGSVIVGSKSFIAKAKRVRKTLGGGMR 241

Query: 276 QIGILCAAGLIAIKENVQKLQADHEKAKQLASGLYQIKGLKVDPKSVETNIIFFEIEEDY 335
           QIGILCAA LIAIKENV KL  DH  AK LASGL QI G+KVDPKSVETNIIFFE+EED 
Sbjct: 242 QIGILCAAALIAIKENVPKLATDHHNAKLLASGLNQINGVKVDPKSVETNIIFFEMEEDS 301

Query: 336 GISLETLCKSLEERGIFMMLESQIRARIVLHHQISTSDVHYTLSCFQQTLSGIQVVNGN 395
            IS+ET+CKSLEERGIFMML+ + RAR+VLHHQISTSDV YTLSCFQQTLSGI    GN
Sbjct: 302 KISVETVCKSLEERGIFMMLDGKTRARMVLHHQISTSDVEYTLSCFQQTLSGIAAAGGN 360

BLAST of ClCG06G009580 vs. TAIR 10
Match: AT1G08630.1 (threonine aldolase 1 )

HSP 1 Score: 510.4 bits (1313), Expect = 1.3e-144
Identity = 250/353 (70.82%), Postives = 301/353 (85.27%), Query Frame = 0

Query: 37  MVSRKVDLRSDTVTKPTESMRAAMAIAEVDDDVLGHDPIGLELEEEMAKIMGKEAGLFVP 96
           MV R VDLRSDTVT+PT++MR AM  AEVDDDVLG+DP    LEEEMAK+MGKEA LFVP
Sbjct: 1   MVMRSVDLRSDTVTRPTDAMREAMCNAEVDDDVLGYDPTARRLEEEMAKMMGKEAALFVP 60

Query: 97  SGTMGNLISVLVHCDIRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNNADGTMDIHL 156
           SGTMGNLISV+VHCD+RGSEVI+GDN HIH+ ENGGI+TIGGVHP+TVKN  DGTMD+  
Sbjct: 61  SGTMGNLISVMVHCDVRGSEVILGDNCHIHVYENGGISTIGGVHPKTVKNEEDGTMDLEA 120

Query: 157 IQAAIRNPKAQLFFPTTRLICLENTHANSGGKCLSVEYTDEVGELAKKHHLKLHIDGARI 216
           I+AAIR+PK   F+P+TRLICLENTHANSGG+CLSVEYT++VGE+AK+H +KLHIDGAR+
Sbjct: 121 IEAAIRDPKGSTFYPSTRLICLENTHANSGGRCLSVEYTEKVGEIAKRHGVKLHIDGARL 180

Query: 217 FNASIALGVPVDRLVQAADSVSVCLSKGLGAPVGSVIVGSKDFIAKATRVRKALGGGMRQ 276
           FNASIALGVPV +LV+AADSV VCLSKGLGAPVGSVIVGS+ FI KA  VRK LGGGMRQ
Sbjct: 181 FNASIALGVPVHKLVKAADSVQVCLSKGLGAPVGSVIVGSQSFIEKAKTVRKTLGGGMRQ 240

Query: 277 IGILCAAGLIAIKENVQKLQADHEKAKQLASGLYQIKGLKVDPKSVETNIIFFEIEEDYG 336
           IG+LCAA L+A++EN+ KLQ DH+KAK LA GL Q+KG++V+  +VETN+IF ++E+   
Sbjct: 241 IGVLCAAALVALQENLPKLQHDHKKAKLLAEGLNQMKGIRVNVAAVETNMIFMDMEDGSR 300

Query: 337 ISLETLCKSLEERGIFMMLESQIRARIVLHHQISTSDVHYTLSCFQQTLSGIQ 390
           ++ E L K+LEE GI ++  +  R RIV+HHQI+TSDVHYTLSCFQQ +  +Q
Sbjct: 301 LTAEKLRKNLEENGILLIRGNSSRIRIVIHHQITTSDVHYTLSCFQQAMLTMQ 353

BLAST of ClCG06G009580 vs. TAIR 10
Match: AT1G08630.3 (threonine aldolase 1 )

HSP 1 Score: 510.4 bits (1313), Expect = 1.3e-144
Identity = 250/353 (70.82%), Postives = 301/353 (85.27%), Query Frame = 0

Query: 37  MVSRKVDLRSDTVTKPTESMRAAMAIAEVDDDVLGHDPIGLELEEEMAKIMGKEAGLFVP 96
           MV R VDLRSDTVT+PT++MR AM  AEVDDDVLG+DP    LEEEMAK+MGKEA LFVP
Sbjct: 1   MVMRSVDLRSDTVTRPTDAMREAMCNAEVDDDVLGYDPTARRLEEEMAKMMGKEAALFVP 60

Query: 97  SGTMGNLISVLVHCDIRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNNADGTMDIHL 156
           SGTMGNLISV+VHCD+RGSEVI+GDN HIH+ ENGGI+TIGGVHP+TVKN  DGTMD+  
Sbjct: 61  SGTMGNLISVMVHCDVRGSEVILGDNCHIHVYENGGISTIGGVHPKTVKNEEDGTMDLEA 120

Query: 157 IQAAIRNPKAQLFFPTTRLICLENTHANSGGKCLSVEYTDEVGELAKKHHLKLHIDGARI 216
           I+AAIR+PK   F+P+TRLICLENTHANSGG+CLSVEYT++VGE+AK+H +KLHIDGAR+
Sbjct: 121 IEAAIRDPKGSTFYPSTRLICLENTHANSGGRCLSVEYTEKVGEIAKRHGVKLHIDGARL 180

Query: 217 FNASIALGVPVDRLVQAADSVSVCLSKGLGAPVGSVIVGSKDFIAKATRVRKALGGGMRQ 276
           FNASIALGVPV +LV+AADSV VCLSKGLGAPVGSVIVGS+ FI KA  VRK LGGGMRQ
Sbjct: 181 FNASIALGVPVHKLVKAADSVQVCLSKGLGAPVGSVIVGSQSFIEKAKTVRKTLGGGMRQ 240

Query: 277 IGILCAAGLIAIKENVQKLQADHEKAKQLASGLYQIKGLKVDPKSVETNIIFFEIEEDYG 336
           IG+LCAA L+A++EN+ KLQ DH+KAK LA GL Q+KG++V+  +VETN+IF ++E+   
Sbjct: 241 IGVLCAAALVALQENLPKLQHDHKKAKLLAEGLNQMKGIRVNVAAVETNMIFMDMEDGSR 300

Query: 337 ISLETLCKSLEERGIFMMLESQIRARIVLHHQISTSDVHYTLSCFQQTLSGIQ 390
           ++ E L K+LEE GI ++  +  R RIV+HHQI+TSDVHYTLSCFQQ +  +Q
Sbjct: 301 LTAEKLRKNLEENGILLIRGNSSRIRIVIHHQITTSDVHYTLSCFQQAMLTMQ 353

BLAST of ClCG06G009580 vs. TAIR 10
Match: AT1G08630.2 (threonine aldolase 1 )

HSP 1 Score: 510.4 bits (1313), Expect = 1.3e-144
Identity = 250/353 (70.82%), Postives = 301/353 (85.27%), Query Frame = 0

Query: 37  MVSRKVDLRSDTVTKPTESMRAAMAIAEVDDDVLGHDPIGLELEEEMAKIMGKEAGLFVP 96
           MV R VDLRSDTVT+PT++MR AM  AEVDDDVLG+DP    LEEEMAK+MGKEA LFVP
Sbjct: 1   MVMRSVDLRSDTVTRPTDAMREAMCNAEVDDDVLGYDPTARRLEEEMAKMMGKEAALFVP 60

Query: 97  SGTMGNLISVLVHCDIRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNNADGTMDIHL 156
           SGTMGNLISV+VHCD+RGSEVI+GDN HIH+ ENGGI+TIGGVHP+TVKN  DGTMD+  
Sbjct: 61  SGTMGNLISVMVHCDVRGSEVILGDNCHIHVYENGGISTIGGVHPKTVKNEEDGTMDLEA 120

Query: 157 IQAAIRNPKAQLFFPTTRLICLENTHANSGGKCLSVEYTDEVGELAKKHHLKLHIDGARI 216
           I+AAIR+PK   F+P+TRLICLENTHANSGG+CLSVEYT++VGE+AK+H +KLHIDGAR+
Sbjct: 121 IEAAIRDPKGSTFYPSTRLICLENTHANSGGRCLSVEYTEKVGEIAKRHGVKLHIDGARL 180

Query: 217 FNASIALGVPVDRLVQAADSVSVCLSKGLGAPVGSVIVGSKDFIAKATRVRKALGGGMRQ 276
           FNASIALGVPV +LV+AADSV VCLSKGLGAPVGSVIVGS+ FI KA  VRK LGGGMRQ
Sbjct: 181 FNASIALGVPVHKLVKAADSVQVCLSKGLGAPVGSVIVGSQSFIEKAKTVRKTLGGGMRQ 240

Query: 277 IGILCAAGLIAIKENVQKLQADHEKAKQLASGLYQIKGLKVDPKSVETNIIFFEIEEDYG 336
           IG+LCAA L+A++EN+ KLQ DH+KAK LA GL Q+KG++V+  +VETN+IF ++E+   
Sbjct: 241 IGVLCAAALVALQENLPKLQHDHKKAKLLAEGLNQMKGIRVNVAAVETNMIFMDMEDGSR 300

Query: 337 ISLETLCKSLEERGIFMMLESQIRARIVLHHQISTSDVHYTLSCFQQTLSGIQ 390
           ++ E L K+LEE GI ++  +  R RIV+HHQI+TSDVHYTLSCFQQ +  +Q
Sbjct: 301 LTAEKLRKNLEENGILLIRGNSSRIRIVIHHQITTSDVHYTLSCFQQAMLTMQ 353

BLAST of ClCG06G009580 vs. TAIR 10
Match: AT1G08630.4 (threonine aldolase 1 )

HSP 1 Score: 510.4 bits (1313), Expect = 1.3e-144
Identity = 250/353 (70.82%), Postives = 301/353 (85.27%), Query Frame = 0

Query: 37  MVSRKVDLRSDTVTKPTESMRAAMAIAEVDDDVLGHDPIGLELEEEMAKIMGKEAGLFVP 96
           MV R VDLRSDTVT+PT++MR AM  AEVDDDVLG+DP    LEEEMAK+MGKEA LFVP
Sbjct: 1   MVMRSVDLRSDTVTRPTDAMREAMCNAEVDDDVLGYDPTARRLEEEMAKMMGKEAALFVP 60

Query: 97  SGTMGNLISVLVHCDIRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNNADGTMDIHL 156
           SGTMGNLISV+VHCD+RGSEVI+GDN HIH+ ENGGI+TIGGVHP+TVKN  DGTMD+  
Sbjct: 61  SGTMGNLISVMVHCDVRGSEVILGDNCHIHVYENGGISTIGGVHPKTVKNEEDGTMDLEA 120

Query: 157 IQAAIRNPKAQLFFPTTRLICLENTHANSGGKCLSVEYTDEVGELAKKHHLKLHIDGARI 216
           I+AAIR+PK   F+P+TRLICLENTHANSGG+CLSVEYT++VGE+AK+H +KLHIDGAR+
Sbjct: 121 IEAAIRDPKGSTFYPSTRLICLENTHANSGGRCLSVEYTEKVGEIAKRHGVKLHIDGARL 180

Query: 217 FNASIALGVPVDRLVQAADSVSVCLSKGLGAPVGSVIVGSKDFIAKATRVRKALGGGMRQ 276
           FNASIALGVPV +LV+AADSV VCLSKGLGAPVGSVIVGS+ FI KA  VRK LGGGMRQ
Sbjct: 181 FNASIALGVPVHKLVKAADSVQVCLSKGLGAPVGSVIVGSQSFIEKAKTVRKTLGGGMRQ 240

Query: 277 IGILCAAGLIAIKENVQKLQADHEKAKQLASGLYQIKGLKVDPKSVETNIIFFEIEEDYG 336
           IG+LCAA L+A++EN+ KLQ DH+KAK LA GL Q+KG++V+  +VETN+IF ++E+   
Sbjct: 241 IGVLCAAALVALQENLPKLQHDHKKAKLLAEGLNQMKGIRVNVAAVETNMIFMDMEDGSR 300

Query: 337 ISLETLCKSLEERGIFMMLESQIRARIVLHHQISTSDVHYTLSCFQQTLSGIQ 390
           ++ E L K+LEE GI ++  +  R RIV+HHQI+TSDVHYTLSCFQQ +  +Q
Sbjct: 301 LTAEKLRKNLEENGILLIRGNSSRIRIVIHHQITTSDVHYTLSCFQQAMLTMQ 353

BLAST of ClCG06G009580 vs. TAIR 10
Match: AT1G08630.5 (threonine aldolase 1 )

HSP 1 Score: 488.8 bits (1257), Expect = 4.1e-138
Identity = 244/353 (69.12%), Postives = 291/353 (82.44%), Query Frame = 0

Query: 37  MVSRKVDLRSDTVTKPTESMRAAMAIAEVDDDVLGHDPIGLELEEEMAKIMGKEAGLFVP 96
           MV R VDLRSDTVT+PT++MR AM  AEVDDDVLG+DP    LEEEMAK+MGKEA LFVP
Sbjct: 1   MVMRSVDLRSDTVTRPTDAMREAMCNAEVDDDVLGYDPTARRLEEEMAKMMGKEAALFVP 60

Query: 97  SGTMGNLISVLVHCDIRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNNADGTMDIHL 156
           SGTMGNLISV+VHCD+RGSEVI+GDN HIH+ ENGGI+TIGGVHP+TVKN  DGTMD+  
Sbjct: 61  SGTMGNLISVMVHCDVRGSEVILGDNCHIHVYENGGISTIGGVHPKTVKNEEDGTMDLEA 120

Query: 157 IQAAIRNPKAQLFFPTTRLICLENTHANSGGKCLSVEYTDEVGELAKKHHLKLHIDGARI 216
           I+AAIR+PK   F+P+TRLICLENTHANSGG+CLSVEYT++VGE+AK+H +KLHIDGAR+
Sbjct: 121 IEAAIRDPKGSTFYPSTRLICLENTHANSGGRCLSVEYTEKVGEIAKRHGVKLHIDGARL 180

Query: 217 FNASIALGVPVDRLVQAADSVSVCLSKGLGAPVGSVIVGSKDFIAKATRVRKALGGGMRQ 276
           FNASIALGVPV +LV+AADSV VCLSKGLGAPVGSVIVGS+ FI KA  VRK LGGGMRQ
Sbjct: 181 FNASIALGVPVHKLVKAADSVQVCLSKGLGAPVGSVIVGSQSFIEKAKTVRKTLGGGMRQ 240

Query: 277 IGILCAAGLIAIKENVQKLQADHEKAKQLASGLYQIKGLKVDPKSVETNIIFFEIEEDYG 336
           IG+LCAA L+A++EN+ KLQ DH+KAK LA              +VETN+IF ++E+   
Sbjct: 241 IGVLCAAALVALQENLPKLQHDHKKAKLLA--------------AVETNMIFMDMEDGSR 300

Query: 337 ISLETLCKSLEERGIFMMLESQIRARIVLHHQISTSDVHYTLSCFQQTLSGIQ 390
           ++ E L K+LEE GI ++  +  R RIV+HHQI+TSDVHYTLSCFQQ +  +Q
Sbjct: 301 LTAEKLRKNLEENGILLIRGNSSRIRIVIHHQITTSDVHYTLSCFQQAMLTMQ 339

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_008453666.13.0e-18388.65PREDICTED: probable low-specificity L-threonine aldolase 1 isoform X1 [Cucumis m... [more]
XP_038877077.12.1e-18190.78probable low-specificity L-threonine aldolase 1 [Benincasa hispida][more]
XP_011660269.11.3e-17889.66probable low-specificity L-threonine aldolase 1 isoform X2 [Cucumis sativus] >XP... [more]
XP_008453667.11.4e-17786.76PREDICTED: probable low-specificity L-threonine aldolase 1 isoform X2 [Cucumis m... [more]
XP_031736719.11.2e-17688.19probable low-specificity L-threonine aldolase 1 isoform X1 [Cucumis sativus] >XP... [more]
Match NameE-valueIdentityDescription
Q8RXU41.8e-14370.82Probable low-specificity L-threonine aldolase 1 OS=Arabidopsis thaliana OX=3702 ... [more]
Q9FPH37.0e-13569.05Probable low-specificity L-threonine aldolase 2 OS=Arabidopsis thaliana OX=3702 ... [more]
O070511.1e-7446.53L-allo-threonine aldolase OS=Aeromonas jandaei OX=650 GN=ltaA PE=1 SV=1[more]
Q218903.6e-7042.12Uncharacterized protein R102.4 OS=Caenorhabditis elegans OX=6239 GN=R102.4 PE=3 ... [more]
P758231.2e-6541.95Low specificity L-threonine aldolase OS=Escherichia coli (strain K12) OX=83333 G... [more]
Match NameE-valueIdentityDescription
A0A1S3BXZ81.5e-18388.65probable low-specificity L-threonine aldolase 1 isoform X1 OS=Cucumis melo OX=36... [more]
A0A1S3BWA27.0e-17886.76probable low-specificity L-threonine aldolase 1 isoform X2 OS=Cucumis melo OX=36... [more]
A0A6J1DWI04.4e-17287.47probable low-specificity L-threonine aldolase 1 OS=Momordica charantia OX=3673 G... [more]
A0A6J1EZ041.8e-17087.15probable low-specificity L-threonine aldolase 1 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1HN305.4e-17086.63probable low-specificity L-threonine aldolase 1 isoform X1 OS=Cucurbita maxima O... [more]
Match NameE-valueIdentityDescription
AT1G08630.11.3e-14470.82threonine aldolase 1 [more]
AT1G08630.31.3e-14470.82threonine aldolase 1 [more]
AT1G08630.21.3e-14470.82threonine aldolase 1 [more]
AT1G08630.41.3e-14470.82threonine aldolase 1 [more]
AT1G08630.54.1e-13869.12threonine aldolase 1 [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (Charleston Gray) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 285..305
NoneNo IPR availablePIRSRPIRSR017617-1PIRSR017617-1coord: 67..335
e-value: 3.9E-108
score: 358.8
NoneNo IPR availablePANTHERPTHR48097L-THREONINE ALDOLASE-RELATEDcoord: 37..394
NoneNo IPR availablePANTHERPTHR48097:SF4L-ALLO-THREONINE ALDOLASE-LIKE PROTEINcoord: 37..394
IPR001597Aromatic amino acid beta-eliminating lyase/threonine aldolasePFAMPF01212Beta_elim_lyasecoord: 43..329
e-value: 1.1E-95
score: 320.3
IPR023603Threonine aldolasePIRSFPIRSF017617Thr_aldolasecoord: 33..392
e-value: 2.1E-135
score: 449.3
IPR015422Pyridoxal phosphate-dependent transferase, small domainGENE3D3.90.1150.10Aspartate Aminotransferase, domain 1coord: 291..386
e-value: 6.9E-24
score: 85.9
IPR015421Pyridoxal phosphate-dependent transferase, major domainGENE3D3.40.640.10coord: 41..290
e-value: 2.0E-88
score: 297.8
IPR015424Pyridoxal phosphate-dependent transferaseSUPERFAMILY53383PLP-dependent transferasescoord: 42..387

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG06G009580.2ClCG06G009580.2mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006520 cellular amino acid metabolic process
molecular_function GO:0016829 lyase activity
molecular_function GO:0003824 catalytic activity