CcUC06G119550 (gene) Watermelon (PI 537277) v1

Overview
NameCcUC06G119550
Typegene
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionBeta_elim_lyase domain-containing protein
LocationCicolChr06: 16371211 .. 16399319 (+)
RNA-Seq ExpressionCcUC06G119550
SyntenyCcUC06G119550
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTTCAAAAGTTTAAAAAATACTCCTAAATTAAAAAAAAAATATATATCATCGTGAGTTAACAGAAACTATTAGTATCTTGTTGAAAAAATAACTCAAATTTTAGAAAAAAAAAAAGTTAAAAAATGCACGTATCATTAGTATGGTGAGCAATTTGTTCGAAGTTTTCTTTAGTTTGAAAAGAATCAATTTTAAAACAAATTATATGTACATCTTTAATTTTTTTTTAAAAAAAATAGCTTATTTTTCTCCTATTTTTTCAATTCTTATACCAATTTTTTTTTCAAGAACCAAAATTTTAAGTTAAAACATAAAATCGCACAAAATTTCACAAAACTCCAAAATTAAAATCTTCCTTAAGAACATACCAAAATGATAAATGGTCAAATAAAAAATATATATTTAACTCTTAACACACACTTAACGCACAACTTTGTGAAAATATTTAAGTCCCAAAATCCCTTGGTGTTTCCATATTCTTTTTCTTCTTATTAGTGCATATACCTTAATTGAATTCTGCAATTTTATTTTTTTATTTATTATTTTTAGATAAAAAATCTAAAAAATTAACAAAAGGCGAGAGTATCAAGGTACTGAAGAAGAAAATGAGAAAAAAAGAGGAAAAAGAGGGAAAATAGAGAAGTAATATGACATTAATGGTCTCTATTCCTATAGAGAAGTAAGATGACATTAATGGTTGCTATTCATATAACTGTAAATGAGAAAAAAAGAGGGAAAATAGAGAAAAAAAGAGGGAAAATAGAGAAGTAAGATGACATTAATAGTTACTATTCATATAACTGTAAAGGTATTTAATTTTTTTAAAAAAAATTTAACGGTAGTTTTTTAAACTTTCAAAATTTATGAGTATTTTTGGCACAAAACACTAACGACTTCTCTCGAACCTAATTTTAAGGGTAGTTTTAAACTCATTTTGGAAGTTTAAAGATATTTTGAAATTTTTAAAATTTCAAAAGTATTTTTGAGACAAAGTACAAAGTTTATAGATATTTTTGTATAATTTAGCCTTAAAACTTTAAAAAGGAAGAATTTAATACATGAGGTTTAGAAAATATTTATAAATGATAAATGAGTGATATTGTCTAACAAAAAAAAATTTGACGTTGTATAATTGAAGTGTGTGAATTGTTGAAAAAATTCTAGTGTCTTTGAGATTGAGTGCCCATGTTCACACTTTCAAAAAGATTTTTTAACGGTGACGATCAATTAGTGCGAGCAGTAAAGTTTTTCCGCCCATTGTTAAGTGACATGGATATTTTATTAATTTTATGTTGATTTTACCTTACACGGGTTCTCTATTTTAAAATTTTAATTTTATTTTCTCTTCCTTTCTTTTCCACTCTTCTTTCCCCACACATTTTGAATCTGAATCTCGAAATGGAGAATTGATTCAAGAGAAAACAAAAAGAAATTGGAGTGGGTGAAAAGAGGGAGAGGAAGTGAGAAGAACAAGAACAAAAATAAATAAATAAATCGTCATTTTTTCGTTAGTCAACAATGATCAATACCACTCAAATGACTATTTTAAAATATTTTCTAAACTTTTGGGTAGAGTGGACTATTATAGTCCATGAAATAAACCACTCAATGTTTGTGACTCAAATTATAATAGTTTGTATTTCCTTTTTTATTATGGTAGATTATCTTAAATGACAAAACTACTTAAAATAATTACAACTAATAACAAAATATACAGTCTATCTGCGATAGACCGCGATAGATCACGATAGACTACTATCTATGTCTATTGTGACACAGACAATAGTCTATCACAATTTATCGCGGTCTATCACAGATAGATAGTTAAATTTTGCTATATTCGTAATTATTAGTTATTTTTGCTATATTTGAAAATAGCCCTTTTATTATAGTGTTTATTATTCCTTACATAGACTAAAATAATTTAACACCCCAAACATAAATTATTATAATATACGAACTATAATAACTATTGACTATAATATCTAATTGATTAGCATCCCCAACCACCCTTTCCATACTCAAGTTTTAGAGACAAAAAGTGTAATTTATTCTTATATTTGATAAAAAAGTCATTGTTCTATGTCTAATGACCAAAACTTATCAAGTGATTAATCAATTCTCTAAATATTTGATAATTCACATCATTTAATAGACTAAAATTCCATTTTAATATCAAACAAATTAAAGGATGGTGTGAATTTAAGTATGTGTATTAACGTGCTAACGTAAAAATACATTAAAGCCACCTAAGCTAGTTGAAGTGGACATTGAATGATAAGACAAAATGAAATTCATCACAAAATAAGGGAAAGACTTGTTATAAAATAGGTGAAAGTTAAAAAAAAATAAGAATATGAGAGAGTACAAAAAGAAGATGAAAGGAAGAGAAAATAGAGAGAATATTTAATTAACATTTAGCTTTTGGATGAGTGTGTTTTATATAAATGATCTCTACAATGGTAGAAGTGTAAACATAATGTATAAGATCTACCATAAATAAGACTAAAACTGACAAAGTGAGCTTTCTTTAAGGGTAATTGACGTGACATTCTATTTTAGAGGTGGGGTGATTTCATTTTTAGTTTTCTATCTCTTGAATTTTGGAAAATTATTTTAAATGACAAAATTGTTGAAAATATTTACAAATAATAACAAAATATCACCATGCATAACAGTGAAATTTTGCTATATTTGTAAATATTTTGGTTCATTTTCCTATATTTGAAAACAATGTTAAATTTATGCTTATGCCTTACAAATTTCACCATTATGTTTTGACTTATCTCACAAACGAAACTTGGATTTGTAGTCAAAATTTAAAACTAAAAAAAAGTTTTGAAAACTACATATTATTTTTAGTATTCAAAACTTATCTTAGTAAACATAAATAGAAAGTATATAACAAAACCAAGAACTCGTGGTAAAAGTAGTGTATAGTAGGGGTGGAAACGGTTCGATTTGGTGGTCAATCGAACCAAACCGAAAGTTTAAAAATTTTTATATGTGAATCGAACCGAACCAAACCAATCTTTCGTTTCAAACTAAATCAAACCAAATCGAATACATGCAATTCGGTTCGGTTTTAAATTCGATTATGTTTTTTGTTAAAAAAGAATATGTATATATACATATTTAATTTAAAATGGTACGGTTCGGTTTGGTTCGATTTTAAATTCAATTTTGCTCTTTAAATTAAAAATGGTTTAGTTTTGAATTCGATTTTGCTCTTTTTCTTAAAATATACAAATATATATATATATATATATAATATTAGTGGTTCAGTTTGGTTACAAATTCAATTTTTGAAATTGAACCCAAATCAAACCGAATTGAATAATTGGTTTGAAGAAATATATATATATATATATATATATATATATATATATATATATATTAGTGGTTCAGTTTGGTTACAAATTCAATTTTTGAAATTGAACCCAAATCAAACCGAATTGAATAATTGGTTTGAAGAAATATATATATATATATATATATATATATATATATATATATATATTAGTGGTTCAGTTTGGTTACAAATTCAATTTTTGAAATTGAACCCAAATCAAACCGAATTGAATAATTCGGTTTCAAGATTTCTTCAAACCAGACTAAACCATATTTTTTGGTTCATTAAGTTTTCGGTTCGGTTCGGTTCGATTTTGAAAAATGATTCGATTTTTGCAGTTTTAATGACCATCCCTAGTGTGTATTAGTTTCATTTTAAAAACAACAAATAATAGTTCATTTTTTTATTGATTAGGTAAAACTTTTTTTTATTAGTTTACAATAACATCTAACTGTGGAGAGATTCAAACTTCTAATTTATTGATCAGGGATATGTACTTTAAGCAGACGATGATGCTCATAATAAGAGCTACAACAAACTTTAAAACTTTAAAAATATGTTAGATTACCAATTGGGTCTCTATACTTTGAAAAAGTTAGACTTTAGTTCCTATAAGATTTTAAAAGTTAGAATTTAGTAATCTTTAGAGTGTGATAAAATTTCATAAATTGTTCATATCGTTTGAAGGGACTAAATTTTAATTATTATAAACTATAGGGACTAAATTCTAAATTTCTCACTACTATAGGACCAAACTACAATTAAGCTTTTAAAAATAGAATTAATTCACTAATTAATGCAATAAACTTGGAAGCTGGTTCTTTTGTGATCTTAATATATATAAGATGATTCGAAAGAATCAAATAATAATTTAAAAAGAAAGTTGAAACTACACCACCAAGGGTATATGAATTGAAACTTCAGATATATATTCTTTATTCTTTGATAGCATGCCCAAAACCACAATTATTAATTAATTAATTAATTTAAATATAGCAAAAAGCACTTTAAATGAAGGAATACCCAATATCCAATTTGTTGTGAGAAACTTTAGGCCTTGGATTAGGGGAACCTTCTGACCAAAGAAATTCATTTTCACCCTGCTTTTAAAAGTTTTTTTTTTTCTTTTTCCACATTGCTTTAATTTTCTGAGAAAATAAATAAATAAATAAGTAAAAGCATATGATAATGACTCTTGTCTTATCTTCCTATAAATTTCCTTTATGTTTATGCAACTCAATCCAATGGTGGGTTCATCGATATCTAAGGTGAGTGAAATTTAATTTCATGTTTTTGTTTTGAGGAGTTTTTGTTGTATTTTTCTCTTCTATTGAAAGGAATGGAAAGTTTCAATTATTTATGTGTTTATGATCTCTACTTGTTTATATGTTTGCAGTGTGTTTGAAGTACTTATAGAAGCTATCAACTGATTTTTTTTTTTTGGGGGGGGGGGGGGGGGGGGGTTATCCTCCATTTCAAGAGTAAGGAATTGCTATTTTTTCTTGTAATTTTCAATTTAGATATTGATAATTAATTCTTATTTCCATTTATGAAGAAATTGTTTTATTTTTTCATTTTTAGAAATTAAAAAAGTAAGAGATTTGAACCGACTCTTTTAAAAAAAATTAGTATGGATAATTAGATACACTCTTTTGAGAGGGTTCAATATTGAGTAAAGATTCTTGTTTTTGTTTTTGTATAGGTAAGAAATCATAAATGGCATGAATTATATTTTTGAAAACTAGAAACGTAAAATTAAAGTTGATTTGGTTGCATTCTTGTCTCACAATTTTAGTTTATTCTTTATCATTTTCTATTTTTATTATATATTTTCAGAAAAAAATAATAATATATTTGGTAATTATTTTGTTTTCATACGCACCTTTATGAAGATCTTAACTAGCAAACAAAAATTATTCGTTTCTGTAATTATCTTCCTCTTAGATCGTTATTTCATACAACCATTTTGTGACCTCCGCTACACTACAAGATCAGGTATACTCCTCATACAAAAAGCCAACATGTTTGATATATATATATATATATATATATATATATGGGCTGAAAATTAAAGTAATCAAATAATTATAGTGACCACTTAGGTTTCTTTTAAAAAACGATCTCTGTATAGGCATCAACAAAAAATGGTGAGCAGAAAGGTGGATTTGCGGTCGGACACAGTGACGAAACCGACCGAATCGATGCGAGCTGCGATGGCGATTGCTGAGGTGGACGATGATGTGTTGGGGCATGACCCCATAGGGTTAGAGTTGGAAGAAGAGATGGCGAAGATAATGGGGAAAGAAGCAGGGTTATTTGTTCCCTCAGGCACAATGGGAAATCTCATAAGTGTTCTTGTACACTGTGACATTAGAGGGAGTGAAGTGATTGTTGGGGACAATTCCCATATTCACATTTTGGAAAATGGAGGCATTGCAACCATTGGAGGAGTTCATCCAAGGACAGTCAAAAACAATGCTGATGGAACAATGGATATTCATTTGATTGAAGCTGCCATTAGAAACCCAAAGGCCCAACTCTTCTTCCCAACAACAAGGCTCATTTGTTTAGAAAATACACATGCAAAGTAAGTATAGTTATAATTAATTTCACAATCATTTCCTTCTTTTGCTTCCTAATTAACCATTTAATTTTTTTATTTTTTTATTATTTTTTATAAATTCTATCATGTCCTCTCAATTAAAAGAATATTCTTAACCAAATTTCAACTCCTAATTGATCTTGCTTTCTTTTAATATTTTAAAACTTTTTCTTTTTTTTTTTTTATTTTCAAAAGTTAACTTTGTTTTTAGTTTCTAAAAAATCGATATATTTGATATCTAATTGAATTTTCTATTTTTTCAATTTTTAAAACATTTTAAAAAATGTTACTAGATTTCAAAAAATTAAAAAAAAAATTATTTTCAAAAACAGATTTTTGTTTTCTAGAATGCATGTGAATTCTATATATGATAGAACGATGACTACTGACATATTTGAAAGAAAAAATAAACACTTAAAAAGTTAATCAAACCAATTTAAATTCAATAAAAACTCTCATATGTGATTTTCCGGTTTGTAATTATTTTACAAATATTAAAAGAGAATATTTGTCTTGGAATATTTCTTTTCCATTTTATAATATCGACTCTCAAATAAATTCTATTTTTTCTTTTATTTATTTTCCATTTTCTAATAACAATTTCAAATTTTTACATGAAATTTAATGGTGATAATATCGATTTTTTCCCACTTTAGTTCCACATCAATTATTGTTTTTTTTAACAAGAATATCGTAAAGTCTTTTTAAACAAAATTTAAATTCAAAATTAGGGAAACAAGATAAATCGAATTAATTGTTTTGATTGTTTACTTATTTACTTATAACCTGCTCCTAAAAAATATCTGCTTTTATTTTATTTTGGAGATTTTTATATTCTTATCATTTTTCTTTATTTGATTCGTGATTAATCAATACATTTAAAATATTTAGATTATTTTATTGAAAATTATTATAAATAGAAAAAAATATCAAATTATTTACAAATATACTAAAATTTTACTTTCTACTAGTGACATACCGCGATAGACCTATATGGTAAAGTGATAGCTGTCTATCTGGATCTATCGCGAACTATCACATATAGAATGTAAAATTTTGCTATACTTGTAATTATTTCAATAATTTTTCTATTTTAAAAAATATCCCTTATTTTATTAACATGATATTTTTAAAAAAGTGAAAAATAAAGATATAATTTTACAATTATTATAAGCCTCTTTTCAAATATAGCAAAAGGAACAAAAATATTTACAAATATAGCAAATTTCACTGTTCATCCCTTATAGACCGTTATAGGTTGCAAATATATGTGTATTTTGCTATTAGTTGTAAATATTTTCAGCAGTTTTGTCATTTAAGATAATTTTCTATTATTTATATTAGATTGAAAAATATTGTAAATATAGTTTGATTTTATTTTTGTATAAAAAATATAAAACTAAAAAAAGTAGTCAAAACTTATTTTAGTTTCCTATAAACAAAAATGTGTTTTTAGTTTTTGTTCTTAAAAAAAACAAAAAATAATTATAAACATAAGGTATCTGTTTTTTCAAAATTCAAAACCAAAAACAAAATTTTAAAAATTTAAAGGCATGAGTGTTTATAGACTATCTATTTTCGAAAACTAGAAAAAAAAAAAAAATGGTTACTAATTAAGTCCTAAACTCTTTACTTTTGATTGCGTTTTATCTAATTTTGTGTCCGAATAATTCCTTGTCGTCTTTGTAGCTTAATGCGTAAGTAGAATATATGCTGATTGTTTATTTTAAGATAACATGACATGTTGACTTTAGACAAATAAAATATTAATTAAAATTTATTTTCGTAGACTAACAATTTAGTTCCTAAACTTTAATTTATAATAAAGTTCGTCAATTTTCATTTTTGTAATAATTAATATTTAAATTTTAGTATGTAACAATTTTAGTCCATATATTTTAAAATTTGTAACATTTAATTCCTATTATGAAAATTAGTGTTAAGATTTATTGCATAAGTAGATTAATTAATTAATTAATTAATTAGGGACCTAATACGTTTATATGCTACAAAGTTTACCTATATTTATATATATAAATTGACATTTCCTGGTAAAGATTAAATTGTTATAAATTTGAAAATACAAAAACTAAATTTTTACATAATTGAAAGTAAAGTGACTAAATCATTCTAAATAAAAATTAAAGGACAAAATTATTATAAATTTAAGTTTAAAGACTAAATTGTCACATTGATAAATGTTTAAGGACCAAAAGTATTTTTTAACCTACAATAATTTTGTAATGCAATCACTAAACTAATTACAACATTAACAATCAAAGATGTAGTGGATAGAAGAGACTAAATTTATAATTTAATTAAAACTATATTTTAGGCTTTTAGTTACTACTTGATGTTTAAAACTACTTAACAAGATCTTGAAATCGACATGTTTTTCCCTTTTTTCTTTTTTTTTTCTTTTTTTTTTGAATTACTTAAAATTAAGTCAAATTGTTATATACATGAGTAAAAAATTATGAATGAAATGCAGCTCTGGTGGAAAATGTCTTTCAGTAGAATATACTGACGAAGTTGGAGAATTAGCTAAGAAGCATCACCTCAAACTTCACATTGATGGAGCTCGTATTTTCAATGCTTCAATTGTAAGTATAATCTATAAATATTATCTTTTCCTTATTCAATCATTTTCCTTTTTTTAGTTTTAGTTTCAGTTTTAGAAGTTGTTTGTTTCACGGGCATCTTAAATTTTCATGGAATGAACATTGCCTTTTGCTCAATTTCTATCTTAAGATTCTGAACATTCTAAGATGCTTTACTGGGATTTAGAGAGTTTTAGGTGTCCTCTTGAAACATGAATTTTGCAGATTCCCAGGTTGAGGGCATCCATATTTTAATGTTTTGCTCTTTTTTTTCTTTTTTCTTTTATTTATATATATAATTAAAATAATTAAAATAAATGTCACTCTTCTTTATTTGCATTATTATAATTAAAATAAATATTATCCGTCATATTTATCACTTTCCTTGGATATATAGTAGTGTTTTTTCTTTTAGCAAGATATTCTCATTCAAATTTAAATTTATTTTATGAAAAAAATCTTAAATTATTGTATATAATTTTGTTAAAAAAATAGCATATAAAAATAATATAAATTGTATTGATAAAAAAAATTCAAGTTGATGAAAACAAATATGTTATATAAGTTTAAATTATATTAAATTAATTATAAATTAATAAATAATCGGAGCATTCTTGAAATATTAGGTAAATTAGAAATCCTAGTAAACAAACATAGTTATCAAAGCATTTTTTAGATATCTGCAAAAATATTGATACAAACAAATAAGGTTATCTAAAGATGTTAACCGTTGTAGTTCTGGACATCTATAATTTTAAGCATCTAAAATTTCTGCTTATCTATATTTCTAGAAAACAAACAGCCGCTTAAGTAATATTTTACCAACAACATTTTCGTATACAATAAAAAATATATATTTTCAGGCACTTGGTGTTCCAGTGGATCGATTGGTACAAGCGGCTGACTCAGTATCTGTATGTTTCTGTATGCTCTGAATGTTTCTCTATCTTTATTTCTTTCTGCTAATAATTTGTTTTATTAAATATGATTTGGAATTTATGGAGGAAATTCAACCTTTAACAAAATACAACATTATATCAAGGTTTGATATTGGCTCATCAGGTATGTCTATCAAAAGGTTTGGGCGCACCTGTTGGATCAGTTATTGTGGGTTCCAAAGACTTTATTACCGAGGTATAGTGGTTGGATGTTTTTTAGACATATTAGTATTATGTTTGAGAATGTGTTTGAATTGATTTCTTTAAATAAAAACTTCAATAACCCATTTTTGCAATTTCAAACTCTCTTGAGACATAGTATGGCAATCTAGGTTATTATATATTTGGAACTTCCTTCCATTGACAAAATTAATCCAATTTAAAAGGGGGTGAAATATGTTTAGAAATTAGTTAGAATGGGTTTGGATGAATTTTGAAAGCAGTGCTTTTAAATGAAGAACTTTTTCCTATAAGTCTTTATGGAAAGGGACAAAAGAAGTTAGCACTTGCTATTAAGTGTTTTTTAAAAGCATTTCTTTTACTATGTTACCAAACATCATGCAAACAGAATATGTGATTTATCTTGCTAAACACATTTTGTTTTACATTTGTGTTAAAGTAGTCACCAAATTTCCAAATTTTCGAATGATCAATCAAGTCCCAACAATTAGAAAATGCTCGATTTTGATTAGTCAATTAACAGAAATTGTTAATGATTAACGAATGTATTGACATGGAATGAGAAAAAGAAAGAAAATCTCTTTGTCCCCTTGTTTCTCAAGACTTGTTCTTTTTCATCTAAAATTTGTATTGGTTCTTCTTCATGAGACAAATCTTCCTTCAACTATATCGGTTGCACTTCAAGCACATGAGTAGGATTGGGAATATACTTCCTTAACACCGATACATGGAAAGCATTATGGATCTTGGACATCTCCATAGGTAAAGCTAACCTATAAGTTGCTGGTCCAACTCTTTCCAAGATTTCATAAGGTCCAATATATCTCGGACTTAGTTTCCTTTTTCTCTTGAACCATAACACTCCTTTCCCGGGAGACAACTTAAGGAATACTCAGTCTCCTACATCAAACTCAAGATCTCTTATTCTGTTATCTACATAACCCTTCTATCTATCTCGAGCTATTTTCAACTTTTCCTTAATTATCTCAACTTTCTCTGTCGTAGCTTGGACTATTTCTGGTCCAATTAACTTCCTTTGTCCAACTTCACCCCAAGATACATGTATCCTGCACGATCTACCATACAAAGCTTCATATGGTCATTCCAATACTCGCCTAATAACTGTTATTATACGCAAACTTCATTAGCAACAGACGTGCATCCCAACTACCTTTAAACTGCAGTGCACACGCTCGAAGCATATCCTCGAAAGTCTGGATAGTCCTCTTTAATTGTCCATCAGTTTGAGAATGAAATGTTGTACTAAACTGAAACTTAGTATCCAAAGCTTGCTATAAACTAGTCTAGAACTTAGAAGTAAACCTCGGGTCTCTATACGATACAATTGACACTGGCAACCCATACTGGCTCACTATTTTATCCACATACATCTGTGCCAAATTATCCAAGGTATAGGTTGCCTTTACCGGTAAGAAACTTGTTGTCTTGGTAAGTCTATCTACTATCACTCATATACCATCAACACCACTCGAAGTCCTTGGCAACACAAATAGAAAATCCATTGTCACATGCTTTCAATTCCTCTCAGGAATACGGAGTGGATTCAGTAATCCTACTGGTCTCTAATGCTCGGGTTTAACTTGCAAGCACATCAAACATTTGACTACATATTCAACTATTTCTCATTTCATACCTAGCCACCAATAATAATCCCGTAAAGTATGATACATCTTGGTAGTACCAGGATGCATCACATAAGCTGAATTGTGAGCTTCTTCTAAGATAGCCTGTTTAACTTCTGGATCCTTTGGCACACAAATTCTGTCGTGCTTCAACAATGCTCTATCAGCTTTCAACTCATAATCTGGCCTCTGCTGAGCCTTTACCTCTTCTATTAACTTCCTGATTACAGGATTGTTTGACTGCTTCTTAGTTACCCCATCTATTAGCTTAGGTCTTAAATGAAAATGCTCTAACAAACCTCCATTCTCATTTACTGACAAGATAGTAGCACCACTTCTAAACTCCTTAAGTAGAGTAGTCCTTGTAGCATTAAGAGAATTCTAACTACTTTTTGACTTTCGACTCAAAGCATTAGCTACTACATTCACTTTACCTAGGTAGTACTCAATCGTGCAATCATAATCTTTAATCAACCCCAACCACCTTCTTTGTCTCATGTTCAGCTCCTTTTGATCAGAAATATATCTTAGACTCTTATGATTCTTGTAGATATGGCATATCTTTCCAAATAGATAATGCCATCATATCTTTAGAGCTAACACTAGTGTAACTAATTCTAAATCACGAGTAGGATAGTTAATCTTGTGAGGGCTTAACTGTCTAGAAGCATAGGCTATCACATTCCCACCTTGCATAAGAACACAACCTAATCCCTGGCGAAAAGCATCACAATATACCTCAAACTCTTTTCCTGATCTAGAAGTGACAACATAGGTGTCGTCACTAATCTTCTCTCTTTAGCTCCTGAAAATTTTGTGCGCACTTCTCATTCCACTCAAACTTAACACTTTTCTTCGTCAAACTCTTGAGTGACAAAGGTATTCTCAAGAACCCTTCAACAAACCATCTGTAGTATCCAACTAACCCTAGGAAACTACATGCTTCTATCACCGTTGTGGGTCATTCCCACTTCACTATAGCTTTAGTCTTCTGTGGATCAACATTTACTTGTACTACTGAAACCACATGTCCTAGGAACACTACCTATTCCAACTAGAACTCACACTTGTTGAACTTGGCATACAGTTTCCGTTGTCTCAGAGTTTGTAACACCATCTTGACATGTTTAACATACTTTTCTTTATCACCAAAATACACCAAAATATCATCAATGAATACTATGACAAACTGATCAAGGTAAGGATGAAATATCCTATTCATAAGGTCCATAAACGCTGCCGGTGCGCTTGTTAGTCCGAACGACATCACTAAGAACTCGTAATGCCCGCATCTAGTCCTTCCTGAAAGCTGTCTTAGGAACATCTGACGCCCTAACCTTCAACGAGAACATCGGAGCTCCTCTAAGCTGAATAAACAAATCATCTATTTGAGGCAACATATACTTATTACGTATTGTTACATTATTCAATTGATGGTAATCAATACATAACCTCAGAGTATTATCTTTCTTCTTGACAAATAACACCAGTGCACCCCATAAGGAAACATTGGGTCTAATATAAACCTTGTCCACCAATTCCTGTAACTGTACCTTCAGTTCCTTTAACTCCGTTGGTGCCATCTGATAAGGTGCTTGAGAGAAGGAAGTTGTACTTGGAAGCAGGTCTATAGTAAACTCTATCTCTTGATCAGGTGGTAATCCTGACAACTCTTCTGGAAATACATCTAAATACTCACGTATCACTAGCACGTCTTAAGGCTTCAACTTACCTGATTTAACTTCAGTCACATAGGCTAGGTACACTTCACGGCCTTTGCTCATCATTTTCCTAGCCTTTACTGCAGACGTCAAGCAAGTAGGAAGAATTTTCCTCATTCCTTTGAACATAACTACTACTTCTAACTTTTCAAACTTCACTTTCTTTTTAAAACAATCAACATTAGTATGATACTTTGACAGAAAGTACATGCCTAAAATAACATCAAACTTTGCTAACTCTAAAGGTAGCAAATCTATCGACATAACTACACTATCAACCACTATCTCATAATTTCGGTAAACATGCTCTATAACTATAGCATTAACAGCAGGTGTATGTACTAACAGTGTGTCAATCAGTGGCTCTAGCTTCCTATTCACATGCATAACAAAGGTACTAGAAACAAATGAATGCGTGGCTCCTGTATCAATCAACACATAAGTATTCATATTACAACTAAATAATATACCTGTTACAACATCTGGTGTCTCCTGAGCTTCTTGTTGAGTCATAACGTACACCTTTCCTTATTGTCGTGGTCTGCCCACCTATCCTTTTTGTCTCACACTTCCACTGCCTTCACCAACCACTATCTCGGGTCTTGGCTGATTAACAGTCTGGGAGATCGCACGTTTCTCAGCTTCATTTCCTGTCTTCAGCTTGGGGCAATCTCTCCTATAATGCCCTGTCTGACCACAATTATAACAGACGTTGGCTACACTCAAACACAAACCCTAGTGGCATCTTCCACAAGTACCACACATGAGCTTCCTAACTGTACTAGCAACCGACTCACTGGGTTGCCTCGACTGGGAACCACCGGCTAATTCAGACATCTAAGTGAACATCCTTTGGATAACGGTTTAAAGCTCCCCTTTCCAGACACGCCAGGAACAAACCATCTACCTTCACCTCAACCAAACAGCTCTGGAGACATACCTGACGGTTGCCCTGATCTTCCTCCTCTATGAACCTCTTCCTAAGCTTTCTCTTCAGCTAAACTTTTTTCAACTCTCACGGCAGCTTCTACCAACTTTGCAAAATCAGACCATTCGACACTCGCGATCACTGGAGTCTGAATCTCCTTTTGCAGGCCTTCTTCGAATCGTTTGCATCTATCGATCTCATTAGCTATGACTGCCATAACATACTTTGCTAACTCTATATACTTCTTCTCGTATTCAGCAATAGACATAGATCTTTAAACCAATATCAGAAACTCATTTCTCTTAGCGTCACAGAAGGATCGAGGATAATACTTATCCTGGAAAGCCTTCTTAAAATCACTCCCCGTCAATGCCTTAGCATCTCTTTTCCTGCTCTTTATCAATTTCCACCAATGCTTGACTCCCTTATGAAGCAAGAACGTAGCTAACTTCAACTTTCTGTCTTTAGGACAGCACATAACCCTGAAACACTTCTCTACCTGATTTAACCATACCTTTACATCAGCAGGATCTGTGGTACCCTCGAACCCTGCGACACCTAATGCTTTAAGTCTCTTGATCCCATACTTCTTCTCATGATCAGCTTGAACTATCTCAACACTAGTAGCTAACCTCTGAGCTATCTTAGTGAATATCTATTCCTCCATGTTTACACATGCCTGAGGGTTACTAGAATCTCCCACGAAAGCCTCTCTAGAACCTCTAGTAGCTTACAAAGTTTCGGCTTCTGCTTGCCAACCTCGCCTGTTCCGTCGGGGATGTTTCCTATAATACATCATAATATGAATATATTATAGACTAACGTGCTCTTAGCTACGATGCATGACCTAACTTCCCAAAATCTTATGCTCTAATACCAACTTGTCATACCCTCTCCCAAGTACCCTTTTAACCTAGCAAAGAATGTGAGGACAGCAAGTATCGATCCTTTTACAACACTTACTATCAAACCTGTATTCTTTCTTATACTCACAACATACAACACCTGATAAACACATAACAACTGATAACAGAAAGCATTCTCATAACGATAAGTAATTCACAAGACTAGTTTGATAACATTCTCCAACACATCCTTGTACAACACATTTACATAATTTAAATTCCAACCTTTATGTACAATGCATGGTATGTCTTAAGTACAAGTACGTCCAAAGTAATCTACAAACTAAAGTGATGTAGTAAAATCAAATGAGGGTGGGTAGGCTAAGCACCGCGAACTTTCTGCTACCTAGAAGAAAGAGATTTTAAAGAAAAATGTGAATTGGGTTGCCCAGTGAGTAACATAATACTAGAAATATAATCTTAACATAAAACTCATGCTCATGCTTTAAAGAAATCAAGCTCATAAATCACGGAAGAAAGTCTTTAACTAAAACTCATAAATCATGATCTTTAAATCAAATAACCTTAGTTGAGGTGGAGAAAGTCAACACTCTCAATGTCGAAAACATCCTTAGCTGATGTGGAGTATTCTCAACACAACAACATCAACTCTCTCTATGTGCACATAGGAATAACTAAATCTTGAGCTGTCCAAGTATCTTCTTAATGTCCTTTAGTATTGGGTCCCAAGGATACTCAAATGTCTTCTCAATGCCCTTCAGCATTAAGTTCCTTTCATAATAATCTCAAGGTCACCCATATGCCTTCTCAATGTCCTTCAACATTAGGTCCCTTGAAAACTTTTATGAAAACAAGCATGCTTTGGAAAGTAAATATTTCAATATACAACTGTTCATGGAAAGCTTTAGAATTCTCAAAACTAGCAATAAGTAAACATGATTTTTCAACCTTTAAGTAAACAAGTAAAACATGCTTCTCAAACTCATCAAAATCAAGTACGAATCATACTTTCCAAACTCAAACTTCACAAGGTAAACATTTAGGAAACAATGCTAATTAGTAAAAATTGCCACTCACTACTTTGCTCATGCTCTAAATGGATTATATATCTCCCACAACTCCTCTTGGCTTGAAAATAATTTAACTTACTTAGTTAGGCTTCTTTATAACATCGTTAAATCATAACTCAAGCCATTTAGGCTTTGAAGATCAAGTCTTAACTCAAATTAACTCACCAACACCCAAATGATTAACTCACCAACGCCCAAATGATTGACTAATAGCTTAAGTGTTGTAGGATGGCCCTAGGCATGCACCTCAGCCACATACACATAGAGCATGCCCATTGCCTGCACGCACAAGCACAACACACTGTCACACACATACGAGCACCATGTTGGGCAACAACTCTCGCACAACCTACTGCCCACAGGCGCGCACACAGTTGGCCTATGTGTTGAAATGGCCTATGTGTCTAACGCTTGCCCTTCCTTGGCCACAAGCACACTCAGTCACAAACTGCACACCTTCTCTACCTAGCATGTCTACCCGCACACCCACTACGCATCCAACTTGCCCAACTAACCTCTAAACAACTTTTTCAGCCCCTAAGCCATCGACAGTCCACTGCACGTCAACTTCCTTCAACCTTTGGCCAGCTCACCCGATAGCCTCAACTGCATAACCTACATTTTAACCCACCAAAATCAACATCTCTAGCCAATTTTAATTCCAATTTCTTTCACAAACCTCAAGATGAATCTCTTAACTTGCTTACCTTTGCGTTTGAGCTTCTAATTCCTTTTCTATAGTTTGAAATCCATCAATCTTCCAATGTTTCTCTAGATTACCTTAGAGCATGATCAACCTTCCATCTTCCACAAATTTCAAGCCTTTGTCGTTGCTATCTTTCTCCGATAGTCCTCACACATTAGGCTCAATTTGGAGGATTTTGGTAGAATAACTTCTTAATTGGCACGAGTAGATTTTGCAAACCTTTCTCCTTCATCCAAGCTTCTATTTATAGCCCTCAAGTTTTCAGCATGCAACACATGAGTTTGTCACCTCCTACTTTAATTTTGCCACACTAAATATAAAATTGCACCTAATTAATCTTATTTATCAAATAAATATCATAATCATCAAAAAGTTAAGAAGACTTGGACACCTACCTCAATTTCATCACCTAAACGCATGGCTCACCAAGTCTTGGAGGTTTCTGGCTTGTCCTCACTCAACACCCGTAGGCTTCCTTGCTTGTCAACAACACTTGGCCCTTGCCTCTCCATGCCTACTTGGCTTAGGTCATTCTCCATGCGTGCTTTTCCCCAACACCTACCAACATATGCCTTTTCCACGCACGGTTCTTTCTTTCAACGCCCAACTCCTAATGGTTTGAGCTTCACTTGCCGTCTACCCTTAAGTACTTCCTTTTACACTCGGCTAACACAAATTACTCACAAACCCGAGACTGACTTGATCGTCGAGTGTGTGTGCGCTTGACACCACATCGATGCCTAATTTCTATCGTCTAGTAGGAACTTGGATCATTTTCCCTGGAATACAAATTTGTGATTGTTCCATTTTTTACATCAATAATTTCTATTAGTTAAAAGGCCAAAATCGAACATTTTCCAAATTTTGGAGACTTGATTATTTGATTTTGAAATCTATAACATAAATATCAACCCTCAGACCTCAAAATACAACCCACCCAAACTTTTTTCACTACTTCAAAAGTGATATTGATTCAAGTCACCTTACAACATTATTAAGTCTTTCCCACTTTTTTAAACGCCTTTTTATGCAAATTTGTTCTTGAAATATTGTGCTTAATAATAGGCCACAAGGGTTAGAAAAGCATTGGGTGGTGGAATGAGGCAAATTGGCATCCTTTGTGCAGCTGGACTTATTGCAATAAAAGAGAATGTTCAAAAACTTCAAGCCGATCACGAGAAAGCCAAGCAACTAGCTAGTAATTAAGATCACTCTTTAATACTTACCAACATTTTCAATTATTTTCACCTCTTTTTTCATATATATATATATTTTTTCTTTTCTTTTTTTGATTAATATAGGTGGGTTATACCAAATCAAAGGATTAAAGGTAGATCCAAAATCAGTTGAGACAAACATTGTGAGTATCTTTAAACATGATTAGAATTTTTAGTAATTTATTATGGTGTGTTTGAGTGTATATTAGTTTTTTTTTTCTTTAAACCACGCAAAAGTGTTTTAAAATAGTTTTAAAAAGTTTGTTTTTAGGCTTAGCTTATGTTTTGCTATAATAATTGTATTTGGTAAAAATTAATCTGAACATAACTTAGGTAGATAAAAATATTAATTATCATTTCAAGGTTGATAATTTTTTAATCTCCACTTGTGCAATCATTGAACTAATATATCTTCCTCTCTAATATATCTATTTACACACTTTCTTGTTTCTTTCTATACCTATAACTATCTTGTTACTCTCTTTTTACACATCTTCTCTTAAAAATTCCATTATCTAGGATATATATCTACTGTTAAAAAATTGGCAAAAAAAAAAGATGAGAGACAATCATTTAGAAGAAAAAAAACTTTTAGTAAGAAGTAAAATTATGTACACTTTTAAAAGAATACTTTTTCCACTCTTTCAATACTCCTAAGTTTGTCTACCTTTCTCTCGTAAGTTTGTCTACCTTTCTCTGGGTTTGCTTTTTATCTTTCTTTGCCTTATTTACAAAGGGAGACATAAATCCTATTTATAGACGAGAGATGTCCTTGGAGTCCAAGCAAAATGATCTTTCTAGGTTCTTGGATTCTATGGTGGAGCCTAAAGAAATGTTAGAGTGTTTTTAAAGCATTTCTCTTGAAATACTTATCTACAGTCTTCGAGATAAATCTAGATATTTTTTATATCTACATTTAAGTTATTTAAGTAGATAATTTAAGAAGCTTCTCTAGAATTTATTTTTTTTTAATCATTCTCTAGAATAGTTAATGTTGTCCTTTACTGGAATACTCCCCTAAACGATAACTATTCTAGCTCTTTCCATTTTTTAGAATTGTTTCCAAAATTGTACATTAGGGGTATACATGGTTTGGGTTGGCAACCCGAAAGACTGAAATAAAGTCTCCAACCCAATACTACAAAATATGTAAATTTATCTACAACACATGACTAATAACTAAAATCTCATAAAGTTCAAATGTTATATAATAAATATCTATGATATTTATTACAAACTTTAAAAAAAGACAAATATCATAACAGTTTTAAAAGAAAAATATTAAAAATTCACAAATTAATCCATAGAAAGTAATCATAAACTTTAAACGATATATATAAAGAGAAATAGCTTCGTTCCTGGACTTCTTTGCTCAGTTAAGCTTCCTTGTTCCTTAGTGTAGTCTCGATCCGTAGTTTAAGACTCCGTTCCAACTAGCCTACCTAGTACATATCCACTGTTGACGTGACTCTGTATCGACGCCTTTCCCTATATATAAAGAATTTGGGTTGGGTTGGGTCAACCCAATTTTTTTTAGCCAACCAGCGACCCAACCCAATCCAAGAACAAAACTAACCCAACCCAACCTTTACATTTTGAGTTGGGTAGTCCAAGTTGTTTGGGTTGTCGAGTCATTTGAATACCCGTACTGTACATGCTTGGGTTCTTTTAGCCTTCTGGCATTAAGTCCCAACTTCGATTTCTTCTTAATGCATCAACTTCTGTGGCTATGATTATGTTCTTAGCCTCTGTATCCTTAATTGTTCTTCTTCGTCTTCGTCGTTCTCTTCTTCTCCTTCGATTGGAGTCTATTTCATTTGACTCATCTCTTCTTTTGATCCTTCGTTGAAAAAACCTCTCATCCTTAAGTGTCTGTTATCATTCGGGTGCTTGCCATGATGATACGTCATCAAATATAACATTTCTTAATGATAACATTGTTGATTAACATAATCAATATACTCTATCCTTAGGTGTCCTTGCTAACATGACACATAGAGTCCATTCCTTGTCCTTCCATTAACAATATATCACTAACTTTGAGATCTTAGTACACCTTAACATTGTTAGGTTCAAACACAAGGAAGTTGTCTTCTAATGTTAATTGAGATATTGATATCATATTCTTCTTCATTCTAGGCATAGATAAAATTTGTTCTCTAACTCCACTTGAATGGAAATGGAATGAGGCGTTATCATAGTTTTACTAACTAGAGCTATTTGAAACTTCTACTTATTTGCAACTTCGACAACTTGACCTTCCTTATGCTCTAATGTGTTTAGTAGTTTCCTTTTATCACCATGTCCTGTGATTCGAGCAACTTGAATCAACAATCCAATCATTTTCATAATTTACCTGATCTTGATGAGAGGTAACTCTTTCTTCTCCACCACATTGGATGTCGACTAGGGGTTGGGTTTTGCTATTGCAAAACATGTCACTGCATCACACTATTCTTCACTAATGCTGTCAACCTAGGAGGTAATGACGTTACTTTCTGTTGATTTTTTTCTTTTGACAACAATCTTTGGCATAATGGCCTACCTTTGCACAATTATAACATTCACCACATTTTTTCTATACTCACGTTTTCTATTATTTTTTATTAGTTTCCCTTACTTGAGAGCATTTTGAGGAATTTTTTTCATTTTCTTTTGATCTTACTTTCTTTTGATATTTTAGATTACTTTTCTTTGGCCACTAAAGAGGGCTCCTTCATTATTACTCTGTAATGTGACATCCAATATTTGCTTAGTCAATACTTCTTGACTAGTAAGCATATTTTTTAAGTCAATAAAAGAAGGTTGGACTGCCGAACCTTGAATGGTAGCAATAAAGCTTCTATATCAAGTTTAAGTTCATGAACAATAATTTTTCTCATTGTCAATTCTAAAATAGTAGTTGTAGGATCCAATTCAGAGATTTCATGACATAGAGTTTTTACATTGGTGAAGTATTGATTGATAGTCAGATACCTTTGAGAAACTATTAGGAGCTTGTTCTCCGGAAACTAAAATCTCGCATCATTTTTCTTTGAGAAAAGTGAGGCAAACATGTCCATACTATCTTTGGCATCTGCACTATATTGATGTGCTCCAACATTTCTTCATCAACTATAGTTCTAATTGCAAACATGACCTTATCTTCCTTGATATTCCATTTCTTAAAAGCAATAACATCATCAGGTGGCTTGACCTTAATGCCTCCCACAACACCCCACAAGTTTTGACCTTGGAGATACGACTTCATGCGTGAACCATGTTTTGTAGTTTTGAGTAATGAGCTTCTTGATTTCTCCAACAATTTGAAGGTTTCCCATCATCTTGTAGTGATGTGAATCCACTCCACAATTAGTTATGCCAAGATAACAAGGTTCATAGTTCCAGACTTTAGACAATCTTCAGAAGCTCCACCACAACAAGTTATGCTAGTTTGGCTATGATACCACTTGTTGAAGAAATTGGCACCAAAGGGACAAGAGACAGTCACTTGGAAAAAAGAAACTTTTATTAAGGAAGCACAAGTATGGACACTTTGCTTGGTTTAGAGATGGAGACGTAACTCCTATTTATAGGCTTGAGATCATTGGAATACAAGCAAAAGGATATTTCTAGGTTATTGGATTCTAAAATTATTCTAGACATTTCTATGAATTATTCTTATCTAAAGAAATTCCCTAGAATTCATTGAAGCCTAAAGAGATGTTAGGATGTTTCTAAAACATTTTCTTTTGATACTTATTTGCAATCTTCTAGATAAATCTAGAGATATTTAATCTATTTACATTGAAGTAATTTATGTTGATATTTAAGGAAACTTCTACAATAGATAGTGGTGTCTTCTACTTTAACATTTATTGAGTATCAAATTGAATGATGGTGAATGATTAAATTGAATAAAATGTTCTAATGTAAGAGATTTAAGAGAAACTTTTTAGACCATGTAGACTAAAATGGACGTCAACTCCGAACCTTAAGAAGTGTGACATACTTTTTTTAGACCATTTACAAAACATTAACCATTGTAACATATATTTGACCAATTAAAATAAAGAAGCAATTGGAAAACTTTTGTCTTAGAGAAGGGATAGAAATGATGTTTTAAACTCTAAGAGATTACAAGATGAATTAACTTACCCATTTAGTCATAAATTTTGAAGTTTGTGTTTATTTTATTTCAGAATTTTAAAAAATGTTTAATAGATGCTAGACTTTTTGTTGAGTGTCTAATATGACATGAACAATCAATTTTTTTTATCTATTAACTCGCCGAAGAATAAAAAAAGTCTACAAATTTAATTGATTTATTAGATATAAATTTAAATTGTGTGTTTAATGAACCATGAACTTTCTTGTTGGCGTTCAACAAATCTTTTGATTTAAAGAAAAAAATTTAAAAGACTAAATTTGTAATTTTTAGAAACAAAATAAAGCACCAAACTTCGAAGATCATTTTAATTGCAAACTAAACCAAAAAAATGATATGTTTAATTGCAATTTGGCTCGACATATCATATGGTGGGATCAATCTAGAACACCATTTTCATTGAGTATTTATATATTGAAAACTTTTCATCTTTTGTGGATTTTATTTTACTGTTTATTAAAAAATCTAAAACTTTTATTTATTTTATTTATTTATTATATGATCAGATATTCTTTGAAATAGAAGAGGATTACGGAATCTCACTGGAAACACTATGTAAAAGCTTGGAAGAACGTGGCGTTTTTATGATGCTAGAAAGCCAAATAAGGTCTATCCTACAATTCCTTAAAAGAAAAAGAAAAAAAATAAAATAAAATCTAGCTTCCATATTCGTAGTGAGTTTGTGATAACTTTGTTAGCTTTTGGGTAAAATGTTTAAAATGTTGAACAAAAATATATATTTTTGTAGAATGCTTTTTGGTCAAGTGTTTCATTCGGAAACATTTAAAATAATTCTTGAAAGCATAAAAGTTAGAACAAAATGTCAAAGTTATTTACTATGCACACTTTTCATTAAAGTGTATCTATTCAATATTTATATACTAAAATGTCTGTTTTTCCTAAAGATATTTTATGCTTTGAAATTTTGAAAGAAGATTTTGTTTTACTTTTCTTGTTTCCTAATTACAATTTTTATTATATATATATATATATATATATATATATATATTATTTTATATATTTGCTAAAAAGATCAAGAAAATTTCTTTTTGTTCTTAATTTAAAACTTGAGAGAACAAACAAACGAAAATTACCAAAAAAAAACATATATTTTCTTTTTGTTCTTTTTGAAACTAAGTGTAAATAAAAAGTTAAAATTTTTAGCTAAGAAAAGTCAGATGTTTATCCAAAATAAAAAAATAAAATTATTGGTCATCAACTACGAGTCAATTTGACATAATTTTTTAAATACTGAAACATTTTAATTAATTACAAAATAAAATTAAATAAAAATGCTTTTAAAAGTGGGTCCTATTACATCACATCTGTTCTGAACAATAGTAACTTTCTACTCGATCAGAAAGGTGATAGATTGGAATCGCCAACAAAAATATTTTTAAATAAAATAGGTTAATTTGGTTATTGTATTTATTTATATATTTAGCAAACAAGTTTATTAATTGCAGGGCTAGAATTGTTCTTCATCATCAGATTTCAACAAGTGACGTGCACTACACTTTATCTTGCTTTCAGGTGATCTATTTCTAATGACTGATTCTTGGTTTTCTTTTCTTTTCTTTTTTTCTAAAATAAAAAATATAATATTTAAAAATATAATAAATCTATTAATATTTAAATAAATCACAATTTTTTTCTCGAATTGGTTACTTTAGTAAAATAGAAAAATAATGTAAAATCATATATCTATTTTGTATTTTCTTCAGAATCTGGGAAAAAAAAAAGTCTAAAATAACTAGACAATTCAAAACAATAATTACATTTTCTTTGTTATCAAGAAATTAACATTTGTAGAAGAAGCAACGTCCCTGTTTTTTAAAATAACTTTTAAGTAACATATATTTATTTTAATAGAAATAATTAAATTACAAGCTTTTTTCAGTTACTTTTACTAGGTTTTGTTTCAAAAATGTCAAAGACATTCATCACCTTTCAATGATGCATTTAATAAATCTATGACATTTACATACATTTTATAATTAAAAAATCTAATTGAATTATATGATTGCATTTAAGATATGTCGATTAGATCAATAACTTTATTAGAAATAAAATTAAAAATTTAATAATTTTTTAGACATATAGTCGAAAGTCATATAATGTATAGAATTGGAATGTCAAAAGTTTATTAAATTTTTGAAGTATATATAAGAACTAACAATAAGTTTTCACATATCCTTCCCCAATTCTTCTTTAAACTCGCAAATGATATAAACCTTTTCACTTTGGCTTTTGCAGCAAACTCTGAGTGGAATTCAAGTTGTAAATGGCAATTAATCACAACTAACTCTTTGTCTCTCTCTCTAATAAACAAGAGTGGAATTTACCCACGTTGGCTTGTGCTTAATATGCCCTTAACTTGTTGCATAACACATTTCTAATCTCTATATTGGGTTGCAATAATATTCTGTGTTTTCTTTTTGCTTTTTTCTTCTTTTAAAAAATAATTATTTCGTCTCTAAAGACTAAGTAACACTCCTAAAAGCTTTTTGCTCTGTTCTGATGGCTTCAATTTTCTGTGGTGTTGGGAATTTCATAACTTGATAGTAAGTCGACGGAGCTACTCCTAATTCATGGAAAGTTGGTCTTCCTAATATCACATTATAAAATATGAAGAGCAGTTCACCGCAAGAAAGTTTACCATCTTTACAATTGATCTTGAATATTCTCCAAAAGTTACTAGTGTGCACTTCAATTTTTTCTTTGGTGTCCACTTTTTCTCCTCCAAAGCCCACATCGAGTTCATCAACTTTTCAATTGTTCTACCCTTAATTTCATCTTGCTAAAAGTGAAATTTTAAAGGGATCTAAAATGGAATACATATAAGTCAAACACGATGTTATACCAAGATAACTGACAAATCAAATAATTTAATAATAAACAATGAGAATAAAAATAGATAGACACAAAGATTGTTAATTTAGTTTGGTGCAACGTCGCTGATATTGCACAAGTGTGTGCGTTGAACTATAATTAAGTAAAACGGTTTGTAAGATCAAGTATCATATTCAAGGAATTAGTAATTCGGTCCTTTTAGCTATTAGAATGTGTCAATTTTATTTCAAGACTCACAACGTGAAGCTGATATTAACTTCAAACCATGAAAGCAAGAAAACAAATTGAATTGGTGTGGACAATCAATTAAGAACATCTCTAGGGTTTAGATTTCATTCGACAAGTATATGGAATCAAGTATTTTTCATTCAGTCAATTTAATCATTAATTTAGGTTTCGAAGCCTATTATCCAAGAACTTATCTCGAGCTCAATCCTTTCTAACACCTAAACATTGATCCTATTTTCATACAATCAATCAATTTAGATGCAACATTAAGACTCTTCATTAATGACCTCTCATGTTTTCCTCTCATTCTTATGCACAAATATCCAACACTAGTCTAACATGCTAGTTCTGGCCTAAAACTCTCTCTTTCGAGCAAGATGTAAGACCCCTATGCCAATTTGATAAGAAAATAAATGGCAATGATCGGATTTGAATCATTAAAAATGTTTGCTTCCAAGCTAAGGTTCTTAAACATGGTGAAGAGTCGTTGCAATTTTTTAAGCTCCAAAATACATTGGGCCATCCAAGAGATATATCTGAAAATCCAGATTCATCCACATTCTTGAATGCATTATGTGCTAATTTAATTTCATGTATTTACATTTATGAAAGAAGTTGAAGGGGTGTTAAATGAATCTAGAATATGTAGGTGAAAGGTTGTATCCTTTCATGGTGCCCTTTCATGTAGTTTAAATAGTGTTGTATTTTTCATATTTGAGTAAAAGTATAATTAAATTTGAATGAATTTTTGATAGAGTTGAATCACTCAAGCTTTTAATGCTTTCTTTGTATTCTTGTGTTTAGAATGCATGTTTTAAAACATTCAATTTATTATGATCGGTTAAATTGTGAAAAGTTAGAAATCTTGAACATAGGAGCAAGTCCTGAAATCTGAACATAGGAGCAAGTCCTCCTTTCTTCGTGATCTTTGATCACCTTACCAACCCAAATTTGTTTAACTTTCTTGGGGTTGTTTATCAATTAGTTATTAGGGGTTATTATTATTATTGTTGTGAGGGTTCACCTAGTATAAGGGGAGTTATGAACTATAATTGAGGCAGTTTCCTTTATCACAATCTAACGATGATCAAGTATCTAATTGAGTAAATGCAAGCATAATAGAAAGCCTACGAAATTAAATTAATCAAACTAGTTTACATCAATCATAAACTTGAAGAAGTTCATCCAATCCACAAAACTAGAAATAAAATCAGTCACTAATGTTCAAAATATCCATGAATAGCATGTTGTGCAAAATACTAAGGGAAAAACAAGATTTAGCCGTCAATCTGGATCCATCGCAACCCTCACATCAATGTCTTCCTCTTGATTTATCTCCAGTCAAATCCAATGTCAAAGCTCTAAGATCCCCTTTGAACGACAACCTATTAGCCTCAATGAAAAACCATAAGCATAACTCCAATTGAACTCCCTACCAAATGACGAAATGACAAATTAAACAAGTGAAAATTTTGTCCAAGCGCCTTGACACTGGTATGTTTTCCTCCCAGCGTTGTGAAGTGCGCCACGACGCTATGAGTGGCAATTCTCTTATAGTGCTGCTATGCTACAAGACAACACCATGAGCTACCCAATTTTAGGTTTCGATTTGGAGCGTTGAGCTAGCATCGTAGGAGCCTTTTTTGACAATTTTTCATCCCTCGTAAAGCCCTAGGTCAAACTTTGACCTTCCTTAGGATATTTTCGTCATTTTTTACTTCATTAGCCCATTTCAATCCAAAGTTTTGGTTCCTACAAATTCAACAATGAAAAGCATTAAAGAACATAAATTCACAACAAAACCAACAAGACTCTAAGCCCGAGAAAATACTTCTTGAGTGCTTTTCAGTCACCTACTTTAGGGGTTGCATGCGTGATGGACAAAAACTTCACTGGATAAAGAGTTAAGCGATGATAAACCCAAGATTTGTCAAAGGTCCTAGAATCAATCAAATAATACTTATATTACTGAATCTATACTATTCATACTAAAAGAGTAGTACAAGTGGCAAGGTTCGGTCGATCCGTAGGGAAGTGCGAATATTAGTTTCATAAACCCCATTTCTTAGTTAAAGCGTTAAAAGGAGGAGGGGGGAGGGGAGGGGTTGTGTGTGATTTTGAGAATCGAATGATAAACGAGCTAAGAAAAGTAGTTGTTCCCTTAAAGTGTAAAATGACAAGCTATGTTTAGTTGGCACATGACCTACTTAAGCGGAAAAGCCAACGTTAAGTGTCCTTAACTATTAATTAGGAATTTCTTCCTAGTTAAACTATGTGCCGATTAATAAATAATGAAAAGCATTACAAGACTAATCACACACATTGATAACTTGTAGAAATACAAGTTATTATAGTCACTATTTGAATTAATTAGAGTAACTTTAGAGCTAAACGCATGATTTTGGTAGTTGAATTATGTAAAATGTATCACCATGCGTTGGTACTATTTACAAAAGAAAATACTTGAAAAATTATGAGTTTTTTTGTTCTTAAAGCAAGATTAAGAAGAAAAGAAGCACAAATAACCCAAGTAATAGAGACGAAGTTGCTGCGAGAACAAATGGAAACCCATGCATCGAGACTATGCGTTGAGACCATGCGCTGAGAAAAAGTGGATACGGACGCCTGGGCAAAAGCTGTCATGTCATTATGCAAAAGAATAACACGCTATCAGATTATGGCAGAACCAAAGGGCCGACAATGTGAAAAGATTACGCAGCGGGGCAGGAAATTTAATGCAAAGGTGTCAGGTCATCATGAGTTTCAACTTTTTAACGTCTATAAATAGCCCAAAGCTTCTCCAGTCGGAAAAGGAGAGGAGAAAAGAGAGAGAGAGAAACGAGTGCAAGAGGCGAAAGCTCTGCACAAATCCACCCTCACCATAACCAGCCTCCATTTCGAGCTCTTCTGAGAGAAGGATTCTGAGAACTTTCAGAATCGTCTCCCAGTGAGTGAAAACCTTCACCCTATGAACTTTATGTATCAGATAGCAACAATCTCAAGCCTCCATGAAGCAAGACGTTGCTAGTGGCGGCTTGACGTCTTCCATTTCTTTTTTAT

mRNA sequence

TTTCAAAAGTTTAAAAAATACTCCTAAATTAAAAAAAAAATATATATCATCGTGAGTTAACAGAAACTATTAGTATCTTGTTGAAAAAATAACTCAAATTTTAGAAAAAAAAAAAGTTAAAAAATGCACGTATCATTAGTATGGTGAGCAATTTGTTCGAAGTTTTCTTTAGTTTGAAAAGAATCAATTTTAAAACAAATTATATGTACATCTTTAATTTTTTTTTAAAAAAAATAGCTTATTTTTCTCCTATTTTTTCAATTCTTATACCAATTTTTTTTTCAAGAACCAAAATTTTAAGTTAAAACATAAAATCGCACAAAATTTCACAAAACTCCAAAATTAAAATCTTCCTTAAGAACATACCAAAATGATAAATGGTCAAATAAAAAATATATATTTAACTCTTAACACACACTTAACGCACAACTTTGTGAAAATATTTAAGTCCCAAAATCCCTTGGTGTTTCCATATTCTTTTTCTTCTTATTAGTGCATATACCTTAATTGAATTCTGCAATTTTATTTTTTTATTTATTATTTTTAGATAAAAAATCTAAAAAATTAACAAAAGGCGAGAGTATCAAGGTACTGAAGAAGAAAATGAGAAAAAAAGAGGAAAAAGAGGGAAAATAGAGAAGTAATATGACATTAATGGTCTCTATTCCTATAGAGAAGTAAGATGACATTAATGGTTGCTATTCATATAACTGTAAATGAGAAAAAAAGAGGGAAAATAGAGAAAAAAAGAGGGAAAATAGAGAAGTAAGATGACATTAATAGTTACTATTCATATAACTGTAAAGGTATTTAATTTTTTTAAAAAAAATTTAACGGTAGTTTTTTAAACTTTCAAAATTTATGAGTATTTTTGGCACAAAACACTAACGACTTCTCTCGAACCTAATTTTAAGGGTAGTTTTAAACTCATTTTGGAAGTTTAAAGATATTTTGAAATTTTTAAAATTTCAAAAGTATTTTTGAGACAAAGTACAAAGTTTATAGATATTTTTGTATAATTTAGCCTTAAAACTTTAAAAAGGAAGAATTTAATACATGAGGTTTAGAAAATATTTATAAATGATAAATGAGTGATATTGTCTAACAAAAAAAAATTTGACGTTGTATAATTGAAGTGTGTGAATTGTTGAAAAAATTCTAGTGTCTTTGAGATTGAGTGCCCATGTTCACACTTTCAAAAAGATTTTTTAACGGTGACGATCAATTAGTGCGAGCAGTAAAGTTTTTCCGCCCATTGTTAAGTGACATGGATATTTTATTAATTTTATGTTGATTTTACCTTACACGGGTTCTCTATTTTAAAATTTTAATTTTATTTTCTCTTCCTTTCTTTTCCACTCTTCTTTCCCCACACATTTTGAATCTGAATCTCGAAATGGAGAATTGATTCAAGAGAAAACAAAAAGAAATTGGAGTGGGTGAAAAGAGGGAGAGGAAGTGAGAAGAACAAGAACAAAAATAAATAAATAAATCGTCATTTTTTCGTTAGTCAACAATGATCAATACCACTCAAATGACTATTTTAAAATATTTTCTAAACTTTTGGGTAGAGTGGACTATTATAGTCCATGAAATAAACCACTCAATGTTTGTGACTCAAATTATAATAGTTTGTATTTCCTTTTTTATTATGGTAGATTATCTTAAATGACAAAACTACTTAAAATAATTACAACTAATAACAAAATATACAGTCTATCTGCGATAGACCGCGATAGATCACGATAGACTACTATCTATGTCTATTGTGACACAGACAATAGTCTATCACAATTTATCGCGGTCTATCACAGATAGATAGTTAAATTTTGCTATATTCGTAATTATTAGTTATTTTTGCTATATTTGAAAATAGCCCTTTTATTATAGTGTTTATTATTCCTTACATAGACTAAAATAATTTAACACCCCAAACATAAATTATTATAATATACGAACTATAATAACTATTGACTATAATATCTAATTGATTAGCATCCCCAACCACCCTTTCCATACTCAAGTTTTAGAGACAAAAAGTGTAATTTATTCTTATATTTGATAAAAAAGTCATTGTTCTATGTCTAATGACCAAAACTTATCAAGTGATTAATCAATTCTCTAAATATTTGATAATTCACATCATTTAATAGACTAAAATTCCATTTTAATATCAAACAAATTAAAGGATGGTGTGAATTTAAGTATGTGTATTAACGTGCTAACGTAAAAATACATTAAAGCCACCTAAGCTAGTTGAAGTGGACATTGAATGATAAGACAAAATGAAATTCATCACAAAATAAGGGAAAGACTTGTTATAAAATAGGTGAAAGTTAAAAAAAAATAAGAATATGAGAGAGTACAAAAAGAAGATGAAAGGAAGAGAAAATAGAGAGAATATTTAATTAACATTTAGCTTTTGGATGAGTGTGTTTTATATAAATGATCTCTACAATGGTAGAAGTGTAAACATAATGTATAAGATCTACCATAAATAAGACTAAAACTGACAAAGTGAGCTTTCTTTAAGGGTAATTGACGTGACATTCTATTTTAGAGGTGGGGTGATTTCATTTTTAGTTTTCTATCTCTTGAATTTTGGAAAATTATTTTAAATGACAAAATTGTTGAAAATATTTACAAATAATAACAAAATATCACCATGCATAACAGTGAAATTTTGCTATATTTGTAAATATTTTGGTTCATTTTCCTATATTTGAAAACAATGTTAAATTTATGCTTATGCCTTACAAATTTCACCATTATGTTTTGACTTATCTCACAAACGAAACTTGGATTTGTAGTCAAAATTTAAAACTAAAAAAAAGTTTTGAAAACTACATATTATTTTTAGTATTCAAAACTTATCTTAGTAAACATAAATAGAAAGTATATAACAAAACCAAGAACTCGTGGTAAAAGTAGTGTATAGTAGGGGTGGAAACGGTTCGATTTGGTGGTCAATCGAACCAAACCGAAAGTTTAAAAATTTTTATATGTGAATCGAACCGAACCAAACCAATCTTTCGTTTCAAACTAAATCAAACCAAATCGAATACATGCAATTCGGTTCGGTTTTAAATTCGATTATGTTTTTTGTTAAAAAAGAATATGTATATATACATATTTAATTTAAAATGGTACGGTTCGGTTTGGTTCGATTTTAAATTCAATTTTGCTCTTTAAATTAAAAATGGTTTAGTTTTGAATTCGATTTTGCTCTTTTTCTTAAAATATACAAATATATATATATATATATATAATATTAGTGGTTCAGTTTGGTTACAAATTCAATTTTTGAAATTGAACCCAAATCAAACCGAATTGAATAATTGGTTTGAAGAAATATATATATATATATATATATATATATATATATATATATATATTAGTGGTTCAGTTTGGTTACAAATTCAATTTTTGAAATTGAACCCAAATCAAACCGAATTGAATAATTGGTTTGAAGAAATATATATATATATATATATATATATATATATATATATATATATTAGTGGTTCAGTTTGGTTACAAATTCAATTTTTGAAATTGAACCCAAATCAAACCGAATTGAATAATTCGGTTTCAAGATTTCTTCAAACCAGACTAAACCATATTTTTTGGTTCATTAAGTTTTCGGTTCGGTTCGGTTCGATTTTGAAAAATGATTCGATTTTTGCAGTTTTAATGACCATCCCTAGTGTGTATTAGTTTCATTTTAAAAACAACAAATAATAGTTCATTTTTTTATTGATTAGGTAAAACTTTTTTTTATTAGTTTACAATAACATCTAACTGTGGAGAGATTCAAACTTCTAATTTATTGATCAGGGATATGTACTTTAAGCAGACGATGATGCTCATAATAAGAGCTACAACAAACTTTAAAACTTTAAAAATATGTTAGATTACCAATTGGGTCTCTATACTTTGAAAAAGTTAGACTTTAGTTCCTATAAGATTTTAAAAGTTAGAATTTAGTAATCTTTAGAGTGTGATAAAATTTCATAAATTGTTCATATCGTTTGAAGGGACTAAATTTTAATTATTATAAACTATAGGGACTAAATTCTAAATTTCTCACTACTATAGGACCAAACTACAATTAAGCTTTTAAAAATAGAATTAATTCACTAATTAATGCAATAAACTTGGAAGCTGGTTCTTTTGTGATCTTAATATATATAAGATGATTCGAAAGAATCAAATAATAATTTAAAAAGAAAGTTGAAACTACACCACCAAGGGTATATGAATTGAAACTTCAGATATATATTCTTTATTCTTTGATAGCATGCCCAAAACCACAATTATTAATTAATTAATTAATTTAAATATAGCAAAAAGCACTTTAAATGAAGGAATACCCAATATCCAATTTGTTGTGAGAAACTTTAGGCCTTGGATTAGGGGAACCTTCTGACCAAAGAAATTCATTTTCACCCTGCTTTTAAAAGTTTTTTTTTTTCTTTTTCCACATTGCTTTAATTTTCTGAGAAAATAAATAAATAAATAAGTAAAAGCATATGATAATGACTCTTGTCTTATCTTCCTATAAATTTCCTTTATGTTTATGCAACTCAATCCAATGGTGGGTTCATCGATATCTAAGGCATCAACAAAAAATGGTGAGCAGAAAGGTGGATTTGCGGTCGGACACAGTGACGAAACCGACCGAATCGATGCGAGCTGCGATGGCGATTGCTGAGGTGGACGATGATGTGTTGGGGCATGACCCCATAGGGTTAGAGTTGGAAGAAGAGATGGCGAAGATAATGGGGAAAGAAGCAGGGTTATTTGTTCCCTCAGGCACAATGGGAAATCTCATAAGTGTTCTTGTACACTGTGACATTAGAGGGAGTGAAGTGATTGTTGGGGACAATTCCCATATTCACATTTTGGAAAATGGAGGCATTGCAACCATTGGAGGAGTTCATCCAAGGACAGTCAAAAACAATGCTGATGGAACAATGGATATTCATTTGATTGAAGCTGCCATTAGAAACCCAAAGGCCCAACTCTTCTTCCCAACAACAAGGCTCATTTGTTTAGAAAATACACATGCAAACTCTGGTGGAAAATGTCTTTCAGTAGAATATACTGACGAAGTTGGAGAATTAGCTAAGAAGCATCACCTCAAACTTCACATTGATGGAGCTCGTATTTTCAATGCTTCAATTGCACTTGGTGTTCCAGTGGATCGATTGGTACAAGCGGCTGACTCAGTATCTGTATGTCTATCAAAAGGTTTGGGCGCACCTGTTGGATCAGTTATTGTGGGTTCCAAAGACTTTATTACCGAGGCCACAAGGGTTAGAAAAGCATTGGGTGGTGGAATGAGGCAAATTGGCATCCTTTGTGCAGCTGGACTTATTGCAATAAAAGAGAATGTTCAAAAACTTCAAGCCGATCACGAGAAAGCCAAGCAACTAGCTAGTGGGTTATACCAAATCAAAGGATTAAAGGTAGATCCAAAATCAGTTGAGACAAACATTATATTCTTTGAAATAGAAGAGGATTACGGAATCTCACTGGAAACACTATGTAAAAGCTTGGAAGAACGTGGCGTTTTTATGATGCTAGAAAGCCAAATAAGGGCTAGAATTGTTCTTCATCATCAGATTTCAACAAGTGACGTGCACTACACTTTATCTTGCTTTCAGCAAACTCTGAGTGGAATTCAAGTTGTAAATGGCAATTAATCACAACTAACTCTTTGTCTCTCTCTCTAATAAACAAGAGTGGAATTTACCCACGTTGGCTTGTGCTTAATATGCCCTTAACTTGTTGCATAACACATTTCTAATCTCTATATTGGGTTGCAATAATATTCTGTGTTTTCTTTTTGCTTTTTTCTTCTTTTAAAAAATAATTATTTCGTCTCTAAAGACTAAGTAACACTCCTAAAAGCTTTTTGCTCTGTTCTGATGGCTTCAATTTTCTGTGGTGTTGGGAATTTCATAACTTGATAGTAAGTCGACGGAGCTACTCCTAATTCATGGAAAGTTGGTCTTCCTAATATCACATTATAAAATATGAAGAGCAGTTCACCGCAAGAAAGTTTACCATCTTTACAATTGATCTTGAATATTCTCCAAAAGTTACTAGTGTGCACTTCAATTTTTTCTTTGGTGTCCACTTTTTCTCCTCCAAAGCCCACATCGAGTTCATCAACTTTTCAATTGTTCTACCCTTAATTTCATCTTGCTAAAAGTGAAATTTTAAAGGGATCTAAAATGGAATACATATAAGTCAAACACGATGTTATACCAAGATAACTGACAAATCAAATAATTTAATAATAAACAATGAGAATAAAAATAGATAGACACAAAGATTGTTAATTTAGTTTGGTGCAACGTCGCTGATATTGCACAAGTGTGTGCGTTGAACTATAATTAAGTAAAACGGTTTGTAAGATCAAGTATCATATTCAAGGAATTAGTAATTCGGTCCTTTTAGCTATTAGAATGTGTCAATTTTATTTCAAGACTCACAACGTGAAGCTGATATTAACTTCAAACCATGAAAGCAAGAAAACAAATTGAATTGGTGTGGACAATCAATTAAGAACATCTCTAGGGTTTAGATTTCATTCGACAAGTATATGGAATCAAGTATTTTTCATTCAGTCAATTTAATCATTAATTTAGGTTTCGAAGCCTATTATCCAAGAACTTATCTCGAGCTCAATCCTTTCTAACACCTAAACATTGATCCTATTTTCATACAATCAATCAATTTAGATGCAACATTAAGACTCTTCATTAATGACCTCTCATGTTTTCCTCTCATTCTTATGCACAAATATCCAACACTAGTCTAACATGCTAGTTCTGGCCTAAAACTCTCTCTTTCGAGCAAGATGTAAGACCCCTATGCCAATTTGATAAGAAAATAAATGGCAATGATCGGATTTGAATCATTAAAAATGTTTGCTTCCAAGCTAAGGTTCTTAAACATGGTGAAGAGTCGTTGCAATTTTTTAAGCTCCAAAATACATTGGGCCATCCAAGAGATATATCTGAAAATCCAGATTCATCCACATTCTTGAATGCATTATGTGCTAATTTAATTTCATGTATTTACATTTATGAAAGAAGTTGAAGGGGTGTTAAATGAATCTAGAATATGTAGGTGAAAGGTTGTATCCTTTCATGGTGCCCTTTCATGTAGTTTAAATAGTGTTGTATTTTTCATATTTGAGTAAAAGTATAATTAAATTTGAATGAATTTTTGATAGAGTTGAATCACTCAAGCTTTTAATGCTTTCTTTGTATTCTTGTGTTTAGAATGCATGTTTTAAAACATTCAATTTATTATGATCGGTTAAATTGTGAAAAGTTAGAAATCTTGAACATAGGAGCAAGTCCTGAAATCTGAACATAGGAGCAAGTCCTCCTTTCTTCGTGATCTTTGATCACCTTACCAACCCAAATTTGTTTAACTTTCTTGGGGTTGTTTATCAATTAGTTATTAGGGGTTATTATTATTATTGTTGTGAGGGTTCACCTAGTATAAGGGGAGTTATGAACTATAATTGAGGCAGTTTCCTTTATCACAATCTAACGATGATCAAGTATCTAATTGAGTAAATGCAAGCATAATAGAAAGCCTACGAAATTAAATTAATCAAACTAGTTTACATCAATCATAAACTTGAAGAAGTTCATCCAATCCACAAAACTAGAAATAAAATCAGTCACTAATGTTCAAAATATCCATGAATAGCATGTTGTGCAAAATACTAAGGGAAAAACAAGATTTAGCCGTCAATCTGGATCCATCGCAACCCTCACATCAATGTCTTCCTCTTGATTTATCTCCAGTCAAATCCAATGTCAAAGCTCTAAGATCCCCTTTGAACGACAACCTATTAGCCTCAATGAAAAACCATAAGCATAACTCCAATTGAACTCCCTACCAAATGACGAAATGACAAATTAAACAAGTGAAAATTTTGTCCAAGCGCCTTGACACTGGTATGTTTTCCTCCCAGCGTTGTGAAGTGCGCCACGACGCTATGAGTGGCAATTCTCTTATAGTGCTGCTATGCTACAAGACAACACCATGAGCTACCCAATTTTAGGTTTCGATTTGGAGCGTTGAGCTAGCATCGTAGGAGCCTTTTTTGACAATTTTTCATCCCTCGTAAAGCCCTAGGTCAAACTTTGACCTTCCTTAGGATATTTTCGTCATTTTTTACTTCATTAGCCCATTTCAATCCAAAGTTTTGGTTCCTACAAATTCAACAATGAAAAGCATTAAAGAACATAAATTCACAACAAAACCAACAAGACTCTAAGCCCGAGAAAATACTTCTTGAGTGCTTTTCAGTCACCTACTTTAGGGGTTGCATGCGTGATGGACAAAAACTTCACTGGATAAAGAGTTAAGCGATGATAAACCCAAGATTTGTCAAAGGTCCTAGAATCAATCAAATAATACTTATATTACTGAATCTATACTATTCATACTAAAAGAGTAGTACAAGTGGCAAGGTTCGGTCGATCCGTAGGGAAGTGCGAATATTAGTTTCATAAACCCCATTTCTTAGTTAAAGCGTTAAAAGGAGGAGGGGGGAGGGGAGGGGTTGTGTGTGATTTTGAGAATCGAATGATAAACGAGCTAAGAAAAGTAGTTGTTCCCTTAAAGTGTAAAATGACAAGCTATGTTTAGTTGGCACATGACCTACTTAAGCGGAAAAGCCAACGTTAAGTGTCCTTAACTATTAATTAGGAATTTCTTCCTAGTTAAACTATGTGCCGATTAATAAATAATGAAAAGCATTACAAGACTAATCACACACATTGATAACTTGTAGAAATACAAGTTATTATAGTCACTATTTGAATTAATTAGAGTAACTTTAGAGCTAAACGCATGATTTTGGTAGTTGAATTATGTAAAATGTATCACCATGCGTTGGTACTATTTACAAAAGAAAATACTTGAAAAATTATGAGTTTTTTTGTTCTTAAAGCAAGATTAAGAAGAAAAGAAGCACAAATAACCCAAGTAATAGAGACGAAGTTGCTGCGAGAACAAATGGAAACCCATGCATCGAGACTATGCGTTGAGACCATGCGCTGAGAAAAAGTGGATACGGACGCCTGGGCAAAAGCTGTCATGTCATTATGCAAAAGAATAACACGCTATCAGATTATGGCAGAACCAAAGGGCCGACAATGTGAAAAGATTACGCAGCGGGGCAGGAAATTTAATGCAAAGGTGTCAGGTCATCATGAGTTTCAACTTTTTAACGTCTATAAATAGCCCAAAGCTTCTCCAGTCGGAAAAGGAGAGGAGAAAAGAGAGAGAGAGAAACGAGTGCAAGAGGCGAAAGCTCTGCACAAATCCACCCTCACCATAACCAGCCTCCATTTCGAGCTCTTCTGAGAGAAGGATTCTGAGAACTTTCAGAATCGTCTCCCAGTGAGTGAAAACCTTCACCCTATGAACTTTATGTATCAGATAGCAACAATCTCAAGCCTCCATGAAGCAAGACGTTGCTAGTGGCGGCTTGACGTCTTCCATTTCTTTTTTAT

Coding sequence (CDS)

ATGATAATGACTCTTGTCTTATCTTCCTATAAATTTCCTTTATGTTTATGCAACTCAATCCAATGGTGGGTTCATCGATATCTAAGGCATCAACAAAAAATGGTGAGCAGAAAGGTGGATTTGCGGTCGGACACAGTGACGAAACCGACCGAATCGATGCGAGCTGCGATGGCGATTGCTGAGGTGGACGATGATGTGTTGGGGCATGACCCCATAGGGTTAGAGTTGGAAGAAGAGATGGCGAAGATAATGGGGAAAGAAGCAGGGTTATTTGTTCCCTCAGGCACAATGGGAAATCTCATAAGTGTTCTTGTACACTGTGACATTAGAGGGAGTGAAGTGATTGTTGGGGACAATTCCCATATTCACATTTTGGAAAATGGAGGCATTGCAACCATTGGAGGAGTTCATCCAAGGACAGTCAAAAACAATGCTGATGGAACAATGGATATTCATTTGATTGAAGCTGCCATTAGAAACCCAAAGGCCCAACTCTTCTTCCCAACAACAAGGCTCATTTGTTTAGAAAATACACATGCAAACTCTGGTGGAAAATGTCTTTCAGTAGAATATACTGACGAAGTTGGAGAATTAGCTAAGAAGCATCACCTCAAACTTCACATTGATGGAGCTCGTATTTTCAATGCTTCAATTGCACTTGGTGTTCCAGTGGATCGATTGGTACAAGCGGCTGACTCAGTATCTGTATGTCTATCAAAAGGTTTGGGCGCACCTGTTGGATCAGTTATTGTGGGTTCCAAAGACTTTATTACCGAGGCCACAAGGGTTAGAAAAGCATTGGGTGGTGGAATGAGGCAAATTGGCATCCTTTGTGCAGCTGGACTTATTGCAATAAAAGAGAATGTTCAAAAACTTCAAGCCGATCACGAGAAAGCCAAGCAACTAGCTAGTGGGTTATACCAAATCAAAGGATTAAAGGTAGATCCAAAATCAGTTGAGACAAACATTATATTCTTTGAAATAGAAGAGGATTACGGAATCTCACTGGAAACACTATGTAAAAGCTTGGAAGAACGTGGCGTTTTTATGATGCTAGAAAGCCAAATAAGGGCTAGAATTGTTCTTCATCATCAGATTTCAACAAGTGACGTGCACTACACTTTATCTTGCTTTCAGCAAACTCTGAGTGGAATTCAAGTTGTAAATGGCAATTAA

Protein sequence

MIMTLVLSSYKFPLCLCNSIQWWVHRYLRHQQKMVSRKVDLRSDTVTKPTESMRAAMAIAEVDDDVLGHDPIGLELEEEMAKIMGKEAGLFVPSGTMGNLISVLVHCDIRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNNADGTMDIHLIEAAIRNPKAQLFFPTTRLICLENTHANSGGKCLSVEYTDEVGELAKKHHLKLHIDGARIFNASIALGVPVDRLVQAADSVSVCLSKGLGAPVGSVIVGSKDFITEATRVRKALGGGMRQIGILCAAGLIAIKENVQKLQADHEKAKQLASGLYQIKGLKVDPKSVETNIIFFEIEEDYGISLETLCKSLEERGVFMMLESQIRARIVLHHQISTSDVHYTLSCFQQTLSGIQVVNGN
Homology
BLAST of CcUC06G119550 vs. NCBI nr
Match: XP_008453666.1 (PREDICTED: probable low-specificity L-threonine aldolase 1 isoform X1 [Cucumis melo])

HSP 1 Score: 651.7 bits (1680), Expect = 3.9e-183
Identity = 326/363 (89.81%), Postives = 347/363 (95.59%), Query Frame = 0

Query: 29  RHQQKMVSRKVDLRSDTVTKPTESMRAAMAIAEVDDDVLGHDPIGLELEEEMAKIMGKEA 88
           R Q KMVSRKVDLRSDTVTKPTESMRAAMA+AEVDDDVLG+DP  LELEE+MAKIMGKE 
Sbjct: 38  RFQCKMVSRKVDLRSDTVTKPTESMRAAMAMAEVDDDVLGYDPTALELEEKMAKIMGKEG 97

Query: 89  GLFVPSGTMGNLISVLVHCDIRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNNADGT 148
           GLFVPSGTMGNLIS+LVHC+ RGSEVIVGDNSHIHILENGGIATIGGVHPRTVKN  DGT
Sbjct: 98  GLFVPSGTMGNLISILVHCETRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNKDDGT 157

Query: 149 MDIHLIEAAIRNPKAQLFFPTTRLICLENTHANSGGKCLSVEYTDEVGELAKKHHLKLHI 208
           MDI LIEAAIRNPK QLFFPTTRLICLENTHANSGGKCLS+EYTDEVGELAKKH LKLHI
Sbjct: 158 MDIDLIEAAIRNPKGQLFFPTTRLICLENTHANSGGKCLSMEYTDEVGELAKKHDLKLHI 217

Query: 209 DGARIFNASIALGVPVDRLVQAADSVSVCLSKGLGAPVGSVIVGSKDFITEATRVRKALG 268
           DGARIFNASIAL +PVDRLV+AADSVSVCLSKGLGAPVGS+I+GSKDFIT+A R+RK LG
Sbjct: 218 DGARIFNASIALAIPVDRLVRAADSVSVCLSKGLGAPVGSIILGSKDFITKAIRIRKTLG 277

Query: 269 GGMRQIGILCAAGLIAIKENVQKLQADHEKAKQLASGLYQIKGLKVDPKSVETNIIFFEI 328
           GGMRQIGILCAAGL+AIKENVQKL+ADHEKAKQLASGLYQIKGLKVDPKSVETNIIFFEI
Sbjct: 278 GGMRQIGILCAAGLVAIKENVQKLEADHEKAKQLASGLYQIKGLKVDPKSVETNIIFFEI 337

Query: 329 EEDYGISLETLCKSLEERGVFMMLESQIRARIVLHHQISTSDVHYTLSCFQQTLSGIQVV 388
           E+DYGIS+ETLCK+LEERG+FMMLESQ RARIVLHHQISTSDV YTLSCF+QTL+GI+VV
Sbjct: 338 EDDYGISMETLCKTLEERGIFMMLESQTRARIVLHHQISTSDVQYTLSCFKQTLNGIKVV 397

Query: 389 NGN 392
           NGN
Sbjct: 398 NGN 400

BLAST of CcUC06G119550 vs. NCBI nr
Match: XP_038877077.1 (probable low-specificity L-threonine aldolase 1 [Benincasa hispida])

HSP 1 Score: 645.2 bits (1663), Expect = 3.6e-181
Identity = 323/358 (90.22%), Postives = 344/358 (96.09%), Query Frame = 0

Query: 34  MVSRKVDLRSDTVTKPTESMRAAMAIAEVDDDVLGHDPIGLELEEEMAKIMGKEAGLFVP 93
           MV RKVDLRSDTVTKPTE+M+AAMAIAEVDDDVLG+DPI L+LEEEMAKIMGKE GLFVP
Sbjct: 1   MVDRKVDLRSDTVTKPTETMQAAMAIAEVDDDVLGNDPIALKLEEEMAKIMGKEGGLFVP 60

Query: 94  SGTMGNLISVLVHCDIRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNNADGTMDIHL 153
           SGTMGNLISVLVHCDIRGSEVI+GDNSHIHILENGGIATIGGVH RTVKN  DGTMDI L
Sbjct: 61  SGTMGNLISVLVHCDIRGSEVILGDNSHIHILENGGIATIGGVHSRTVKNKDDGTMDIDL 120

Query: 154 IEAAIRNPKAQLFFPTTRLICLENTHANSGGKCLSVEYTDEVGELAKKHHLKLHIDGARI 213
           +EAAIRNPK +LFFPTTRLICLENTHAN+GGKCLSVEYTDEVGELAKKH+LKLHIDGARI
Sbjct: 121 VEAAIRNPKGELFFPTTRLICLENTHANTGGKCLSVEYTDEVGELAKKHNLKLHIDGARI 180

Query: 214 FNASIALGVPVDRLVQAADSVSVCLSKGLGAPVGSVIVGSKDFITEATRVRKALGGGMRQ 273
           FNASIALGVPVDRLVQAADSVSVCLSKGLGAPVGSVIVGSKDFI +A RVRK LGGGMRQ
Sbjct: 181 FNASIALGVPVDRLVQAADSVSVCLSKGLGAPVGSVIVGSKDFIAKAKRVRKTLGGGMRQ 240

Query: 274 IGILCAAGLIAIKENVQKLQADHEKAKQLASGLYQIKGLKVDPKSVETNIIFFEIEEDYG 333
           IGILCAAGL+A+KENV KL+ADH+KAKQLASGL+QIKGLK+DPKSVETNIIFFEIE+DYG
Sbjct: 241 IGILCAAGLVALKENVHKLEADHDKAKQLASGLHQIKGLKIDPKSVETNIIFFEIEDDYG 300

Query: 334 ISLETLCKSLEERGVFMMLESQIRARIVLHHQISTSDVHYTLSCFQQTLSGIQVVNGN 392
           IS+ETLCKSLEERG+FMMLESQIRARIVLHHQISTSDV YTLSCFQQTL+GI+V NGN
Sbjct: 301 ISMETLCKSLEERGIFMMLESQIRARIVLHHQISTSDVQYTLSCFQQTLNGIKVENGN 358

BLAST of CcUC06G119550 vs. NCBI nr
Match: XP_011660269.1 (probable low-specificity L-threonine aldolase 1 isoform X2 [Cucumis sativus] >XP_031745916.1 probable low-specificity L-threonine aldolase 1 isoform X2 [Cucumis sativus] >KAE8637487.1 hypothetical protein CSA_016969 [Cucumis sativus] >KAE8653610.1 hypothetical protein Csa_006916 [Cucumis sativus])

HSP 1 Score: 636.0 bits (1639), Expect = 2.2e-178
Identity = 319/358 (89.11%), Postives = 339/358 (94.69%), Query Frame = 0

Query: 34  MVSRKVDLRSDTVTKPTESMRAAMAIAEVDDDVLGHDPIGLELEEEMAKIMGKEAGLFVP 93
           MVSRKVDLRSDTVTKPTESMRAAMA+AEVDDDVLG+DP  LELEEEMAKIMGKE GLFVP
Sbjct: 1   MVSRKVDLRSDTVTKPTESMRAAMAMAEVDDDVLGYDPTALELEEEMAKIMGKEEGLFVP 60

Query: 94  SGTMGNLISVLVHCDIRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNNADGTMDIHL 153
           SGTMGNLISVLVHC+ RGSEVIVGDNSHIHILENGGIATIGGVHPRTVKN  DGTMDI L
Sbjct: 61  SGTMGNLISVLVHCETRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNKDDGTMDIDL 120

Query: 154 IEAAIRNPKAQLFFPTTRLICLENTHANSGGKCLSVEYTDEVGELAKKHHLKLHIDGARI 213
           IEAAIRNPK QLFFPTTRLICLENTHANSGGKCLSVEY DEVGELAKK+ LKLHIDGARI
Sbjct: 121 IEAAIRNPKGQLFFPTTRLICLENTHANSGGKCLSVEYIDEVGELAKKYDLKLHIDGARI 180

Query: 214 FNASIALGVPVDRLVQAADSVSVCLSKGLGAPVGSVIVGSKDFITEATRVRKALGGGMRQ 273
           FNASIALGVPVDRLVQAADS+ VCLSKGLGAPVGS+IVGSKDFI +A RVRK LGGGMRQ
Sbjct: 181 FNASIALGVPVDRLVQAADSILVCLSKGLGAPVGSIIVGSKDFIAKARRVRKTLGGGMRQ 240

Query: 274 IGILCAAGLIAIKENVQKLQADHEKAKQLASGLYQIKGLKVDPKSVETNIIFFEIEEDYG 333
           IGILCAAGL+AIKENVQKL+ADH+KAKQLASGL+QIKGLK+DPKSVETNII FEIE+DYG
Sbjct: 241 IGILCAAGLVAIKENVQKLEADHKKAKQLASGLFQIKGLKIDPKSVETNIILFEIEDDYG 300

Query: 334 ISLETLCKSLEERGVFMMLESQIRARIVLHHQISTSDVHYTLSCFQQTLSGIQVVNGN 392
           IS+ETLCKSLEERG+F+ML++Q RARIV HHQISTSDV Y LSCFQQTL+GI+VVNGN
Sbjct: 301 ISMETLCKSLEERGIFVMLQTQTRARIVFHHQISTSDVQYILSCFQQTLNGIKVVNGN 358

BLAST of CcUC06G119550 vs. NCBI nr
Match: XP_008453667.1 (PREDICTED: probable low-specificity L-threonine aldolase 1 isoform X2 [Cucumis melo])

HSP 1 Score: 632.9 bits (1631), Expect = 1.9e-177
Identity = 319/363 (87.88%), Postives = 339/363 (93.39%), Query Frame = 0

Query: 29  RHQQKMVSRKVDLRSDTVTKPTESMRAAMAIAEVDDDVLGHDPIGLELEEEMAKIMGKEA 88
           R Q KMVSRKVDLRSDTVTKPTESMRAAMA+AEVDDDVLG+DP  LELEE+MAKIMGKE 
Sbjct: 38  RFQCKMVSRKVDLRSDTVTKPTESMRAAMAMAEVDDDVLGYDPTALELEEKMAKIMGKEG 97

Query: 89  GLFVPSGTMGNLISVLVHCDIRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNNADGT 148
           GLFVPSGTMGNLIS+LVHC+ RGSEVIVGDNSHIHILENGGIATIGGVHPRTVKN  DGT
Sbjct: 98  GLFVPSGTMGNLISILVHCETRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNKDDGT 157

Query: 149 MDIHLIEAAIRNPKAQLFFPTTRLICLENTHANSGGKCLSVEYTDEVGELAKKHHLKLHI 208
           MDI LIEAAIRNPK QLFFPTTRLICLENTHANSGGKCLS+EYTDEVGELAKKH LKLHI
Sbjct: 158 MDIDLIEAAIRNPKGQLFFPTTRLICLENTHANSGGKCLSMEYTDEVGELAKKHDLKLHI 217

Query: 209 DGARIFNASIALGVPVDRLVQAADSVSVCLSKGLGAPVGSVIVGSKDFITEATRVRKALG 268
           DGARIFNASIAL +PVDRL        VCLSKGLGAPVGS+I+GSKDFIT+A R+RK LG
Sbjct: 218 DGARIFNASIALAIPVDRL--------VCLSKGLGAPVGSIILGSKDFITKAIRIRKTLG 277

Query: 269 GGMRQIGILCAAGLIAIKENVQKLQADHEKAKQLASGLYQIKGLKVDPKSVETNIIFFEI 328
           GGMRQIGILCAAGL+AIKENVQKL+ADHEKAKQLASGLYQIKGLKVDPKSVETNIIFFEI
Sbjct: 278 GGMRQIGILCAAGLVAIKENVQKLEADHEKAKQLASGLYQIKGLKVDPKSVETNIIFFEI 337

Query: 329 EEDYGISLETLCKSLEERGVFMMLESQIRARIVLHHQISTSDVHYTLSCFQQTLSGIQVV 388
           E+DYGIS+ETLCK+LEERG+FMMLESQ RARIVLHHQISTSDV YTLSCF+QTL+GI+VV
Sbjct: 338 EDDYGISMETLCKTLEERGIFMMLESQTRARIVLHHQISTSDVQYTLSCFKQTLNGIKVV 392

Query: 389 NGN 392
           NGN
Sbjct: 398 NGN 392

BLAST of CcUC06G119550 vs. NCBI nr
Match: XP_031736719.1 (probable low-specificity L-threonine aldolase 1 isoform X1 [Cucumis sativus] >XP_031736720.1 probable low-specificity L-threonine aldolase 1 isoform X1 [Cucumis sativus] >XP_031736728.1 probable low-specificity L-threonine aldolase 1 isoform X1 [Cucumis sativus] >XP_031745913.1 probable low-specificity L-threonine aldolase 1 isoform X1 [Cucumis sativus] >XP_031745914.1 probable low-specificity L-threonine aldolase 1 isoform X1 [Cucumis sativus] >XP_031745915.1 probable low-specificity L-threonine aldolase 1 isoform X1 [Cucumis sativus])

HSP 1 Score: 629.4 bits (1622), Expect = 2.1e-176
Identity = 319/364 (87.64%), Postives = 339/364 (93.13%), Query Frame = 0

Query: 34  MVSRKVDLRSDTVTKPTESMRAAMAIAEVDDDVLGHDPIGLELEEEMAKIMGKEAGLFVP 93
           MVSRKVDLRSDTVTKPTESMRAAMA+AEVDDDVLG+DP  LELEEEMAKIMGKE GLFVP
Sbjct: 1   MVSRKVDLRSDTVTKPTESMRAAMAMAEVDDDVLGYDPTALELEEEMAKIMGKEEGLFVP 60

Query: 94  SGTMGNLISVLVHCDIRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNNADGTMDIHL 153
           SGTMGNLISVLVHC+ RGSEVIVGDNSHIHILENGGIATIGGVHPRTVKN  DGTMDI L
Sbjct: 61  SGTMGNLISVLVHCETRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNKDDGTMDIDL 120

Query: 154 IEAAIRNPKAQLFFPTTRLICLENTHANSGGKCLSVEYTDEVGELAKKHHLKLHIDGARI 213
           IEAAIRNPK QLFFPTTRLICLENTHANSGGKCLSVEY DEVGELAKK+ LKLHIDGARI
Sbjct: 121 IEAAIRNPKGQLFFPTTRLICLENTHANSGGKCLSVEYIDEVGELAKKYDLKLHIDGARI 180

Query: 214 FNASIALGVPVDRLVQAADSVSVCLSKGLGAPVGSVIVGSKDFITEATRVRKALGGGMRQ 273
           FNASIALGVPVDRLVQAADS+ VCLSKGLGAPVGS+IVGSKDFI +A RVRK LGGGMRQ
Sbjct: 181 FNASIALGVPVDRLVQAADSILVCLSKGLGAPVGSIIVGSKDFIAKARRVRKTLGGGMRQ 240

Query: 274 IGILCAAGLIAIKENVQKLQADHEKAKQLASGLYQIKGLKVDPKSVETNI------IFFE 333
           IGILCAAGL+AIKENVQKL+ADH+KAKQLASGL+QIKGLK+DPKSVETNI      I FE
Sbjct: 241 IGILCAAGLVAIKENVQKLEADHKKAKQLASGLFQIKGLKIDPKSVETNIFVIYYQILFE 300

Query: 334 IEEDYGISLETLCKSLEERGVFMMLESQIRARIVLHHQISTSDVHYTLSCFQQTLSGIQV 392
           IE+DYGIS+ETLCKSLEERG+F+ML++Q RARIV HHQISTSDV Y LSCFQQTL+GI+V
Sbjct: 301 IEDDYGISMETLCKSLEERGIFVMLQTQTRARIVFHHQISTSDVQYILSCFQQTLNGIKV 360

BLAST of CcUC06G119550 vs. ExPASy Swiss-Prot
Match: Q8RXU4 (Probable low-specificity L-threonine aldolase 1 OS=Arabidopsis thaliana OX=3702 GN=THA1 PE=1 SV=1)

HSP 1 Score: 511.5 bits (1316), Expect = 8.2e-144
Identity = 249/353 (70.54%), Postives = 301/353 (85.27%), Query Frame = 0

Query: 34  MVSRKVDLRSDTVTKPTESMRAAMAIAEVDDDVLGHDPIGLELEEEMAKIMGKEAGLFVP 93
           MV R VDLRSDTVT+PT++MR AM  AEVDDDVLG+DP    LEEEMAK+MGKEA LFVP
Sbjct: 1   MVMRSVDLRSDTVTRPTDAMREAMCNAEVDDDVLGYDPTARRLEEEMAKMMGKEAALFVP 60

Query: 94  SGTMGNLISVLVHCDIRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNNADGTMDIHL 153
           SGTMGNLISV+VHCD+RGSEVI+GDN HIH+ ENGGI+TIGGVHP+TVKN  DGTMD+  
Sbjct: 61  SGTMGNLISVMVHCDVRGSEVILGDNCHIHVYENGGISTIGGVHPKTVKNEEDGTMDLEA 120

Query: 154 IEAAIRNPKAQLFFPTTRLICLENTHANSGGKCLSVEYTDEVGELAKKHHLKLHIDGARI 213
           IEAAIR+PK   F+P+TRLICLENTHANSGG+CLSVEYT++VGE+AK+H +KLHIDGAR+
Sbjct: 121 IEAAIRDPKGSTFYPSTRLICLENTHANSGGRCLSVEYTEKVGEIAKRHGVKLHIDGARL 180

Query: 214 FNASIALGVPVDRLVQAADSVSVCLSKGLGAPVGSVIVGSKDFITEATRVRKALGGGMRQ 273
           FNASIALGVPV +LV+AADSV VCLSKGLGAPVGSVIVGS+ FI +A  VRK LGGGMRQ
Sbjct: 181 FNASIALGVPVHKLVKAADSVQVCLSKGLGAPVGSVIVGSQSFIEKAKTVRKTLGGGMRQ 240

Query: 274 IGILCAAGLIAIKENVQKLQADHEKAKQLASGLYQIKGLKVDPKSVETNIIFFEIEEDYG 333
           IG+LCAA L+A++EN+ KLQ DH+KAK LA GL Q+KG++V+  +VETN+IF ++E+   
Sbjct: 241 IGVLCAAALVALQENLPKLQHDHKKAKLLAEGLNQMKGIRVNVAAVETNMIFMDMEDGSR 300

Query: 334 ISLETLCKSLEERGVFMMLESQIRARIVLHHQISTSDVHYTLSCFQQTLSGIQ 387
           ++ E L K+LEE G+ ++  +  R RIV+HHQI+TSDVHYTLSCFQQ +  +Q
Sbjct: 301 LTAEKLRKNLEENGILLIRGNSSRIRIVIHHQITTSDVHYTLSCFQQAMLTMQ 353

BLAST of CcUC06G119550 vs. ExPASy Swiss-Prot
Match: Q9FPH3 (Probable low-specificity L-threonine aldolase 2 OS=Arabidopsis thaliana OX=3702 GN=THA2 PE=1 SV=1)

HSP 1 Score: 485.7 bits (1249), Expect = 4.8e-136
Identity = 242/344 (70.35%), Postives = 287/344 (83.43%), Query Frame = 0

Query: 37  RKVDLRSDTVTKPTESMRAAMAIAEVDDDVLGHDPIGLELEEEMAKIMGKEAGLFVPSGT 96
           R VDLRSDTVTKPTESMR+AMA AEVDDDVLG+DP  L LE+E+A+I GKEA +FVPSGT
Sbjct: 8   RTVDLRSDTVTKPTESMRSAMANAEVDDDVLGNDPTALRLEKEVAEIAGKEAAMFVPSGT 67

Query: 97  MGNLISVLVHCDIRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNNADGTMDIHLIEA 156
           MGNLISVLVHCD RGSEVI+GD+SHIHI ENGG++++GGVHPRTVKN  DGTM+I  IEA
Sbjct: 68  MGNLISVLVHCDERGSEVILGDDSHIHIYENGGVSSLGGVHPRTVKNEEDGTMEIGAIEA 127

Query: 157 AIRNPKAQLFFPTTRLICLENTHANSGGKCLSVEYTDEVGELAKKHHLKLHIDGARIFNA 216
           A+R+PK  L  P T+LICLENT AN GG+CL +EY D+VGELAKKH LKLHIDGARIFNA
Sbjct: 128 AVRSPKGDLHHPVTKLICLENTQANCGGRCLPIEYIDKVGELAKKHGLKLHIDGARIFNA 187

Query: 217 SIALGVPVDRLVQAADSVSVCLSKGLGAPVGSVIVGSKDFITEATRVRKALGGGMRQIGI 276
           S+ALGVPV R+VQAADSVS+CLSKG+GAPVGSVIVGSK FIT+A  +RK LGGGMRQIG+
Sbjct: 188 SVALGVPVKRIVQAADSVSICLSKGIGAPVGSVIVGSKKFITKARWLRKTLGGGMRQIGV 247

Query: 277 LCAAGLIAIKENVQKLQADHEKAKQLASGLYQIKGLKVDPKSVETNIIFFEIEEDYGISL 336
           LCAA L+A+ ENV KL+ DH+KA+ LA GL +I+ L+V+  +VETNII+ +I ED     
Sbjct: 248 LCAAALVALHENVAKLEDDHKKARVLAEGLNRIERLRVNVAAVETNIIYVDIPEDPKFGA 307

Query: 337 ETLCKSLEERGVFMMLESQIRARIVLHHQISTSDVHYTLSCFQQ 381
           E  CKSLE+ GV ++ ++  R RIVLHHQIS  DV Y LSCF++
Sbjct: 308 EEACKSLEDVGVLVIPQATFRIRIVLHHQISDVDVEYVLSCFEK 351

BLAST of CcUC06G119550 vs. ExPASy Swiss-Prot
Match: O07051 (L-allo-threonine aldolase OS=Aeromonas jandaei OX=650 GN=ltaA PE=1 SV=1)

HSP 1 Score: 280.4 bits (716), Expect = 3.1e-74
Identity = 159/346 (45.95%), Postives = 216/346 (62.43%), Query Frame = 0

Query: 37  RKVDLRSDTVTKPTESMRAAMAIAEVDDDVLGHDPIGLELEEEMAKIMGKEAGLFVPSGT 96
           R +DLRSDTVT+PT++MR  M  AEV DDV G DP    LE   A ++GKEA LFVPSGT
Sbjct: 2   RYIDLRSDTVTQPTDAMRQCMLHAEVGDDVYGEDPGVNALEAYGADLLGKEAALFVPSGT 61

Query: 97  MGNLISVLVHCDIRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNNADGTMDIHLIEA 156
           M NL++V+ HC  RG   ++G  +HI+  E  G A +G V  + V   ADG++ +  + A
Sbjct: 62  MSNLLAVMSHCQ-RGEGAVLGSAAHIYRYEAQGSAVLGSVALQPVPMQADGSLALADVRA 121

Query: 157 AIRNPKAQLFFPTTRLICLENTHANSGGKCLSVEYTDEVGELAKKHHLKLHIDGARIFNA 216
           AI      + F  TRL+CLENTH    GK L + Y  E+ EL  +H L+LH+DGAR+FNA
Sbjct: 122 AI--APDDVHFTPTRLVCLENTH---NGKVLPLPYLREMRELVDEHGLQLHLDGARLFNA 181

Query: 217 SIALGVPVDRLVQAADSVSVCLSKGLGAPVGSVIVGSKDFITEATRVRKALGGGMRQIGI 276
            +A G  V  LV   DSVS+CLSKGLGAPVGS++VGS  FI  A R+RK +GGGMRQ GI
Sbjct: 182 VVASGHTVRELVAPFDSVSICLSKGLGAPVGSLLVGSHAFIARARRLRKMVGGGMRQAGI 241

Query: 277 LCAAGLIAIKENVQKLQADHEKAKQLASGLYQIKGLKVDPKSVETNIIFFEIEEDYGISL 336
           L  AGL A++++V +L  DH +A+QLA GL  + G+++D   V+TN++F ++       L
Sbjct: 242 LAQAGLFALQQHVVRLADDHRRARQLAEGLAALPGIRLDLAQVQTNMVFLQLTSGESAPL 301

Query: 337 ETLCKSLEERGVFMMLESQIRARIVLHHQISTSDVHYTLSCFQQTL 383
               K+   RG+      ++  R+V H QI   D+   +  F + L
Sbjct: 302 LAFMKA---RGILFSGYGEL--RLVTHLQIHDDDIEEVIDAFTEYL 336

BLAST of CcUC06G119550 vs. ExPASy Swiss-Prot
Match: Q21890 (Uncharacterized protein R102.4 OS=Caenorhabditis elegans OX=6239 GN=R102.4 PE=3 SV=3)

HSP 1 Score: 269.6 bits (688), Expect = 5.4e-71
Identity = 152/354 (42.94%), Postives = 213/354 (60.17%), Query Frame = 0

Query: 39  VDLRSDTVTKPTESMRAAMAIAEVDDDVLGHDPIGLELEEEMAKIMGKEAGLFVPSGTMG 98
           +DLRSDTVT P+  MR AMA A V DDV G D     LE+  A++ GKEAGLFV SGTMG
Sbjct: 67  IDLRSDTVTVPSVEMRRAMAEAIVGDDVYGEDTTTNRLEQRCAELFGKEAGLFVTSGTMG 126

Query: 99  NLISVLVHCDIRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNNADGTMDIHLIEAAI 158
           NL++++ HC  RG E+IVG  +HIH  E G  A   G+   T++   DGTMD++ IE AI
Sbjct: 127 NLLAIMAHCQ-RGEEIIVGRYNHIHRWEQGNYAQFAGISATTLEVKPDGTMDLNDIEQAI 186

Query: 159 RNPKAQLFFPTTRLICLENTHANSGGKCLSVEYTDEVGELAKKHHLKLHIDGARIFNASI 218
           R     +  P ++LIC+ENTH  +GGK L +E+   V +LA++  LK+H+DGARI+NA++
Sbjct: 187 RVKDCHM--PASKLICIENTHNYTGGKALPIEWMRSVKQLAERRDLKVHMDGARIYNAAV 246

Query: 219 ALGVPVDRLVQAADSVSVCLSKGLGAPVGSVIVGSKDFITEATRVRKALGGGMRQIGILC 278
           A    V ++   AD+V +C SKGLGAPVGS++VG KDFI  A   RKALGGG RQ GIL 
Sbjct: 247 ASNCSVSKIASFADTVQMCFSKGLGAPVGSIVVGPKDFIDRARHSRKALGGGWRQSGILA 306

Query: 279 AAGLIAIKENVQKLQADHEKAKQLASGLYQIKGLKVDPKSVETNIIFFEIEEDY------ 338
           AA  IA+      ++ADHE+AK LA  +         P+   T +  F  E+D       
Sbjct: 307 AAAHIALDHADATIRADHERAKTLARMIND-----ATPEEFRTKV--FAAEKDITNMVLV 366

Query: 339 ----GISLETLCKSLEERGVFMMLESQIRARIVLHHQISTSDVHYTLSCFQQTL 383
               G++++ L    ++  +  M     R R+VL+  +S  ++   +  +++ L
Sbjct: 367 HCQNGVTVQQLTDFFQKHDILAMTFDARRIRMVLNWNVSDENLETIVEVYKKFL 410

BLAST of CcUC06G119550 vs. ExPASy Swiss-Prot
Match: P75823 (Low specificity L-threonine aldolase OS=Escherichia coli (strain K12) OX=83333 GN=ltaE PE=1 SV=1)

HSP 1 Score: 253.1 bits (645), Expect = 5.3e-66
Identity = 138/329 (41.95%), Postives = 201/329 (61.09%), Query Frame = 0

Query: 39  VDLRSDTVTKPTESMRAAMAIAEVDDDVLGHDPIGLELEEEMAKIMGKEAGLFVPSGTMG 98
           +DLRSDTVT+P+ +M  AM  A V DDV G DP    L++  A++ GKEA +F+P+GT  
Sbjct: 2   IDLRSDTVTRPSRAMLEAMMAAPVGDDVYGDDPTVNALQDYAAELSGKEAAIFLPTGTQA 61

Query: 99  NLISVLVHCDIRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNNADGTMDIHLIEAAI 158
           NL+++L HC+ RG E IVG  +H ++ E GG A +G + P+ +   ADGT+   L + A+
Sbjct: 62  NLVALLSHCE-RGEEYIVGQAAHNYLFEAGGAAVLGSIQPQPIDAAADGTLP--LDKVAM 121

Query: 159 RNPKAQLFFPTTRLICLENTHANSGGKCLSVEYTDEVGELAKKHHLKLHIDGARIFNASI 218
           +     + F  T+L+ LENTH    GK L  EY  E  E  ++ +L LH+DGARIFNA +
Sbjct: 122 KIKPDDIHFARTKLLSLENTH---NGKVLPREYLKEAWEFTRERNLALHVDGARIFNAVV 181

Query: 219 ALGVPVDRLVQAADSVSVCLSKGLGAPVGSVIVGSKDFITEATRVRKALGGGMRQIGILC 278
           A G  +  + Q  DS ++CLSKGLG PVGS++VG++D+I  A R RK  GGGMRQ GIL 
Sbjct: 182 AYGCELKEITQYCDSFTICLSKGLGTPVGSLLVGNRDYIKRAIRWRKMTGGGMRQSGILA 241

Query: 279 AAGLIAIKENVQKLQADHEKAKQLASGLYQIKGLKVDPKSVETNIIFFEIEEDYGISLET 338
           AAG+ A+K NV +LQ DH+ A  +A    Q++    D    +TN++F  + E+   +L  
Sbjct: 242 AAGIYALKNNVARLQEDHDNAAWMAE---QLREAGADVMRQDTNMLFVRVGEENAAALGE 301

Query: 339 LCKSLEERGVFMMLESQIRARIVLHHQIS 368
             K+       +++ +    R+V H  +S
Sbjct: 302 YMKARN-----VLINASPIVRLVTHLDVS 316

BLAST of CcUC06G119550 vs. ExPASy TrEMBL
Match: A0A1S3BXZ8 (probable low-specificity L-threonine aldolase 1 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103494316 PE=3 SV=1)

HSP 1 Score: 651.7 bits (1680), Expect = 1.9e-183
Identity = 326/363 (89.81%), Postives = 347/363 (95.59%), Query Frame = 0

Query: 29  RHQQKMVSRKVDLRSDTVTKPTESMRAAMAIAEVDDDVLGHDPIGLELEEEMAKIMGKEA 88
           R Q KMVSRKVDLRSDTVTKPTESMRAAMA+AEVDDDVLG+DP  LELEE+MAKIMGKE 
Sbjct: 38  RFQCKMVSRKVDLRSDTVTKPTESMRAAMAMAEVDDDVLGYDPTALELEEKMAKIMGKEG 97

Query: 89  GLFVPSGTMGNLISVLVHCDIRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNNADGT 148
           GLFVPSGTMGNLIS+LVHC+ RGSEVIVGDNSHIHILENGGIATIGGVHPRTVKN  DGT
Sbjct: 98  GLFVPSGTMGNLISILVHCETRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNKDDGT 157

Query: 149 MDIHLIEAAIRNPKAQLFFPTTRLICLENTHANSGGKCLSVEYTDEVGELAKKHHLKLHI 208
           MDI LIEAAIRNPK QLFFPTTRLICLENTHANSGGKCLS+EYTDEVGELAKKH LKLHI
Sbjct: 158 MDIDLIEAAIRNPKGQLFFPTTRLICLENTHANSGGKCLSMEYTDEVGELAKKHDLKLHI 217

Query: 209 DGARIFNASIALGVPVDRLVQAADSVSVCLSKGLGAPVGSVIVGSKDFITEATRVRKALG 268
           DGARIFNASIAL +PVDRLV+AADSVSVCLSKGLGAPVGS+I+GSKDFIT+A R+RK LG
Sbjct: 218 DGARIFNASIALAIPVDRLVRAADSVSVCLSKGLGAPVGSIILGSKDFITKAIRIRKTLG 277

Query: 269 GGMRQIGILCAAGLIAIKENVQKLQADHEKAKQLASGLYQIKGLKVDPKSVETNIIFFEI 328
           GGMRQIGILCAAGL+AIKENVQKL+ADHEKAKQLASGLYQIKGLKVDPKSVETNIIFFEI
Sbjct: 278 GGMRQIGILCAAGLVAIKENVQKLEADHEKAKQLASGLYQIKGLKVDPKSVETNIIFFEI 337

Query: 329 EEDYGISLETLCKSLEERGVFMMLESQIRARIVLHHQISTSDVHYTLSCFQQTLSGIQVV 388
           E+DYGIS+ETLCK+LEERG+FMMLESQ RARIVLHHQISTSDV YTLSCF+QTL+GI+VV
Sbjct: 338 EDDYGISMETLCKTLEERGIFMMLESQTRARIVLHHQISTSDVQYTLSCFKQTLNGIKVV 397

Query: 389 NGN 392
           NGN
Sbjct: 398 NGN 400

BLAST of CcUC06G119550 vs. ExPASy TrEMBL
Match: A0A1S3BWA2 (probable low-specificity L-threonine aldolase 1 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103494316 PE=3 SV=1)

HSP 1 Score: 632.9 bits (1631), Expect = 9.0e-178
Identity = 319/363 (87.88%), Postives = 339/363 (93.39%), Query Frame = 0

Query: 29  RHQQKMVSRKVDLRSDTVTKPTESMRAAMAIAEVDDDVLGHDPIGLELEEEMAKIMGKEA 88
           R Q KMVSRKVDLRSDTVTKPTESMRAAMA+AEVDDDVLG+DP  LELEE+MAKIMGKE 
Sbjct: 38  RFQCKMVSRKVDLRSDTVTKPTESMRAAMAMAEVDDDVLGYDPTALELEEKMAKIMGKEG 97

Query: 89  GLFVPSGTMGNLISVLVHCDIRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNNADGT 148
           GLFVPSGTMGNLIS+LVHC+ RGSEVIVGDNSHIHILENGGIATIGGVHPRTVKN  DGT
Sbjct: 98  GLFVPSGTMGNLISILVHCETRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNKDDGT 157

Query: 149 MDIHLIEAAIRNPKAQLFFPTTRLICLENTHANSGGKCLSVEYTDEVGELAKKHHLKLHI 208
           MDI LIEAAIRNPK QLFFPTTRLICLENTHANSGGKCLS+EYTDEVGELAKKH LKLHI
Sbjct: 158 MDIDLIEAAIRNPKGQLFFPTTRLICLENTHANSGGKCLSMEYTDEVGELAKKHDLKLHI 217

Query: 209 DGARIFNASIALGVPVDRLVQAADSVSVCLSKGLGAPVGSVIVGSKDFITEATRVRKALG 268
           DGARIFNASIAL +PVDRL        VCLSKGLGAPVGS+I+GSKDFIT+A R+RK LG
Sbjct: 218 DGARIFNASIALAIPVDRL--------VCLSKGLGAPVGSIILGSKDFITKAIRIRKTLG 277

Query: 269 GGMRQIGILCAAGLIAIKENVQKLQADHEKAKQLASGLYQIKGLKVDPKSVETNIIFFEI 328
           GGMRQIGILCAAGL+AIKENVQKL+ADHEKAKQLASGLYQIKGLKVDPKSVETNIIFFEI
Sbjct: 278 GGMRQIGILCAAGLVAIKENVQKLEADHEKAKQLASGLYQIKGLKVDPKSVETNIIFFEI 337

Query: 329 EEDYGISLETLCKSLEERGVFMMLESQIRARIVLHHQISTSDVHYTLSCFQQTLSGIQVV 388
           E+DYGIS+ETLCK+LEERG+FMMLESQ RARIVLHHQISTSDV YTLSCF+QTL+GI+VV
Sbjct: 338 EDDYGISMETLCKTLEERGIFMMLESQTRARIVLHHQISTSDVQYTLSCFKQTLNGIKVV 392

Query: 389 NGN 392
           NGN
Sbjct: 398 NGN 392

BLAST of CcUC06G119550 vs. ExPASy TrEMBL
Match: A0A6J1DWI0 (probable low-specificity L-threonine aldolase 1 OS=Momordica charantia OX=3673 GN=LOC111024142 PE=3 SV=1)

HSP 1 Score: 613.2 bits (1580), Expect = 7.4e-172
Identity = 312/359 (86.91%), Postives = 333/359 (92.76%), Query Frame = 0

Query: 34  MVSRKVDLRSDTVTKPTESMRAAMAIAEVDDDVLGHDPIGLELEEEMAKIMGKEAGLFVP 93
           MVSRKVDLRSDTVTKPTE+MRAAMA+AEVDDDVLG+DPI L+LEEEMAK+ GKEA LFVP
Sbjct: 1   MVSRKVDLRSDTVTKPTEAMRAAMAMAEVDDDVLGYDPIALQLEEEMAKMTGKEAALFVP 60

Query: 94  SGTMGNLISVLVHCDIRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNNADGTMDIHL 153
           SGTMGNLISVLVHC+IRGSEVI+G NSHIHILENGGIATIGGVHPRTVKNNADGTMDI L
Sbjct: 61  SGTMGNLISVLVHCEIRGSEVILGHNSHIHILENGGIATIGGVHPRTVKNNADGTMDIDL 120

Query: 154 IEAAIRNPKAQLFFPTTRLICLENTHANSGGKCLSVEYTDEVGELAKKHHLKLHIDGARI 213
           IEAAIRNPK +LFFPTTRL+CLEN+HANSGGKCLSVEYTDEVGELAKKH LKLHIDGARI
Sbjct: 121 IEAAIRNPKGELFFPTTRLVCLENSHANSGGKCLSVEYTDEVGELAKKHGLKLHIDGARI 180

Query: 214 FNASIALGVPVDRLVQAADSVSVCLSKGLGAPVGSVIVGSKDFITEATRVRKALGGGMRQ 273
           FNASIALGV VDRLVQAADSVSVCLSKGLGAPVGSVIVGSK FI +A RVRK LGGGMRQ
Sbjct: 181 FNASIALGVSVDRLVQAADSVSVCLSKGLGAPVGSVIVGSKSFIAKAKRVRKTLGGGMRQ 240

Query: 274 IGILCAAGLIAIKENVQKLQADHEKAKQLASGLYQIKGLKVDPKSVETNIIFFEIEE-DY 333
           IGILC+A L+AIKEN+ KL+ DH KAK LASGL +I GLKVDPKSVETNIIFFE+E+ DY
Sbjct: 241 IGILCSAALVAIKENLPKLEDDHHKAKLLASGLSEINGLKVDPKSVETNIIFFELEDVDY 300

Query: 334 GISLETLCKSLEERGVFMMLESQIRARIVLHHQISTSDVHYTLSCFQQTLSGIQVVNGN 392
            IS+ETLCKSLEERG+FMM ES  RARIV+HHQIS SDVHYTLSCFQQTLSGIQV NGN
Sbjct: 301 KISVETLCKSLEERGIFMMQESSTRARIVIHHQISISDVHYTLSCFQQTLSGIQVGNGN 359

BLAST of CcUC06G119550 vs. ExPASy TrEMBL
Match: A0A6J1EZ04 (probable low-specificity L-threonine aldolase 1 OS=Cucurbita moschata OX=3662 GN=LOC111440844 PE=3 SV=1)

HSP 1 Score: 607.8 bits (1566), Expect = 3.1e-170
Identity = 310/358 (86.59%), Postives = 326/358 (91.06%), Query Frame = 0

Query: 34  MVSRKVDLRSDTVTKPTESMRAAMAIAEVDDDVLGHDPIGLELEEEMAKIMGKEAGLFVP 93
           MVSRKVDLRSDTVTKPT+SMRAAMAIAEVDDDVLG+DPI L+LEEEMAK+ GKEA LFVP
Sbjct: 1   MVSRKVDLRSDTVTKPTDSMRAAMAIAEVDDDVLGYDPIALQLEEEMAKLTGKEAALFVP 60

Query: 94  SGTMGNLISVLVHCDIRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNNADGTMDIHL 153
           SGTMGNLISVLVHCDIRGSEVI+GDNSHIHILENGGIATIGGVHPRTVKNN DGT+DI L
Sbjct: 61  SGTMGNLISVLVHCDIRGSEVILGDNSHIHILENGGIATIGGVHPRTVKNNDDGTIDIDL 120

Query: 154 IEAAIRNPKAQLFFPTTRLICLENTHANSGGKCLSVEYTDEVGELAKKHHLKLHIDGARI 213
           IEAAIRNPK +LFFPTTRLICLENTHANSGGKCL VEY DEVGELAKKH LKLHIDGARI
Sbjct: 121 IEAAIRNPKGELFFPTTRLICLENTHANSGGKCLPVEYIDEVGELAKKHGLKLHIDGARI 180

Query: 214 FNASIALGVPVDRLVQAADSVSVCLSKGLGAPVGSVIVGSKDFITEATRVRKALGGGMRQ 273
           FNASIALGV VDRLVQ ADSVSVCLSKGLGAPVGSVIVGSK FI +A RVRK LGGGMRQ
Sbjct: 181 FNASIALGVSVDRLVQTADSVSVCLSKGLGAPVGSVIVGSKSFIAKAKRVRKTLGGGMRQ 240

Query: 274 IGILCAAGLIAIKENVQKLQADHEKAKQLASGLYQIKGLKVDPKSVETNIIFFEIEEDYG 333
           IGILCAA LIAIKENV KL  DH  AK LASGL QI G+KVDPKSVETNIIFFE+EED  
Sbjct: 241 IGILCAAALIAIKENVPKLATDHHNAKLLASGLNQINGVKVDPKSVETNIIFFEMEEDSK 300

Query: 334 ISLETLCKSLEERGVFMMLESQIRARIVLHHQISTSDVHYTLSCFQQTLSGIQVVNGN 392
           IS+ETLCKSLEERG+FMML+ + RAR+VLHHQISTSDV YTLSCFQQTLSGI   +GN
Sbjct: 301 ISVETLCKSLEERGIFMMLDGKTRARMVLHHQISTSDVEYTLSCFQQTLSGIAAADGN 358

BLAST of CcUC06G119550 vs. ExPASy TrEMBL
Match: A0A6J1HN30 (probable low-specificity L-threonine aldolase 1 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111466119 PE=3 SV=1)

HSP 1 Score: 606.7 bits (1563), Expect = 6.9e-170
Identity = 309/359 (86.07%), Postives = 325/359 (90.53%), Query Frame = 0

Query: 33  KMVSRKVDLRSDTVTKPTESMRAAMAIAEVDDDVLGHDPIGLELEEEMAKIMGKEAGLFV 92
           KMVSRKVDLRSDTVTKPT+SMRAAMAIAEVDDDVLG+DPI L+LEEEMAK+ GKEA LFV
Sbjct: 2   KMVSRKVDLRSDTVTKPTDSMRAAMAIAEVDDDVLGYDPIALQLEEEMAKLTGKEAALFV 61

Query: 93  PSGTMGNLISVLVHCDIRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNNADGTMDIH 152
           PSGTMGNLISVLVHCDIRGSEVI+GDNSHIHILENGGIATIGGVHPRTVKNN DGT+DI 
Sbjct: 62  PSGTMGNLISVLVHCDIRGSEVILGDNSHIHILENGGIATIGGVHPRTVKNNDDGTIDID 121

Query: 153 LIEAAIRNPKAQLFFPTTRLICLENTHANSGGKCLSVEYTDEVGELAKKHHLKLHIDGAR 212
           LIEAAIRNPK +LFFPTTRLICLENTHANSGGKCL VEY DEVGEL KKH LKLHIDGAR
Sbjct: 122 LIEAAIRNPKGELFFPTTRLICLENTHANSGGKCLPVEYIDEVGELTKKHGLKLHIDGAR 181

Query: 213 IFNASIALGVPVDRLVQAADSVSVCLSKGLGAPVGSVIVGSKDFITEATRVRKALGGGMR 272
           IFNASIALGV VDRLVQ ADSVSVCLSKGLGAPVGSVIVGSK FI +A RVRK LGGGMR
Sbjct: 182 IFNASIALGVSVDRLVQTADSVSVCLSKGLGAPVGSVIVGSKSFIAKAKRVRKTLGGGMR 241

Query: 273 QIGILCAAGLIAIKENVQKLQADHEKAKQLASGLYQIKGLKVDPKSVETNIIFFEIEEDY 332
           QIGILCAA LIAIKENV KL  DH  AK LASGL QI G+KVDPKSVETNIIFFE+EED 
Sbjct: 242 QIGILCAAALIAIKENVPKLATDHHNAKLLASGLNQINGVKVDPKSVETNIIFFEMEEDS 301

Query: 333 GISLETLCKSLEERGVFMMLESQIRARIVLHHQISTSDVHYTLSCFQQTLSGIQVVNGN 392
            IS+ET+CKSLEERG+FMML+ + RAR+VLHHQISTSDV YTLSCFQQTLSGI    GN
Sbjct: 302 KISVETVCKSLEERGIFMMLDGKTRARMVLHHQISTSDVEYTLSCFQQTLSGIAAAGGN 360

BLAST of CcUC06G119550 vs. TAIR 10
Match: AT1G08630.1 (threonine aldolase 1 )

HSP 1 Score: 511.5 bits (1316), Expect = 5.8e-145
Identity = 249/353 (70.54%), Postives = 301/353 (85.27%), Query Frame = 0

Query: 34  MVSRKVDLRSDTVTKPTESMRAAMAIAEVDDDVLGHDPIGLELEEEMAKIMGKEAGLFVP 93
           MV R VDLRSDTVT+PT++MR AM  AEVDDDVLG+DP    LEEEMAK+MGKEA LFVP
Sbjct: 1   MVMRSVDLRSDTVTRPTDAMREAMCNAEVDDDVLGYDPTARRLEEEMAKMMGKEAALFVP 60

Query: 94  SGTMGNLISVLVHCDIRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNNADGTMDIHL 153
           SGTMGNLISV+VHCD+RGSEVI+GDN HIH+ ENGGI+TIGGVHP+TVKN  DGTMD+  
Sbjct: 61  SGTMGNLISVMVHCDVRGSEVILGDNCHIHVYENGGISTIGGVHPKTVKNEEDGTMDLEA 120

Query: 154 IEAAIRNPKAQLFFPTTRLICLENTHANSGGKCLSVEYTDEVGELAKKHHLKLHIDGARI 213
           IEAAIR+PK   F+P+TRLICLENTHANSGG+CLSVEYT++VGE+AK+H +KLHIDGAR+
Sbjct: 121 IEAAIRDPKGSTFYPSTRLICLENTHANSGGRCLSVEYTEKVGEIAKRHGVKLHIDGARL 180

Query: 214 FNASIALGVPVDRLVQAADSVSVCLSKGLGAPVGSVIVGSKDFITEATRVRKALGGGMRQ 273
           FNASIALGVPV +LV+AADSV VCLSKGLGAPVGSVIVGS+ FI +A  VRK LGGGMRQ
Sbjct: 181 FNASIALGVPVHKLVKAADSVQVCLSKGLGAPVGSVIVGSQSFIEKAKTVRKTLGGGMRQ 240

Query: 274 IGILCAAGLIAIKENVQKLQADHEKAKQLASGLYQIKGLKVDPKSVETNIIFFEIEEDYG 333
           IG+LCAA L+A++EN+ KLQ DH+KAK LA GL Q+KG++V+  +VETN+IF ++E+   
Sbjct: 241 IGVLCAAALVALQENLPKLQHDHKKAKLLAEGLNQMKGIRVNVAAVETNMIFMDMEDGSR 300

Query: 334 ISLETLCKSLEERGVFMMLESQIRARIVLHHQISTSDVHYTLSCFQQTLSGIQ 387
           ++ E L K+LEE G+ ++  +  R RIV+HHQI+TSDVHYTLSCFQQ +  +Q
Sbjct: 301 LTAEKLRKNLEENGILLIRGNSSRIRIVIHHQITTSDVHYTLSCFQQAMLTMQ 353

BLAST of CcUC06G119550 vs. TAIR 10
Match: AT1G08630.3 (threonine aldolase 1 )

HSP 1 Score: 511.5 bits (1316), Expect = 5.8e-145
Identity = 249/353 (70.54%), Postives = 301/353 (85.27%), Query Frame = 0

Query: 34  MVSRKVDLRSDTVTKPTESMRAAMAIAEVDDDVLGHDPIGLELEEEMAKIMGKEAGLFVP 93
           MV R VDLRSDTVT+PT++MR AM  AEVDDDVLG+DP    LEEEMAK+MGKEA LFVP
Sbjct: 1   MVMRSVDLRSDTVTRPTDAMREAMCNAEVDDDVLGYDPTARRLEEEMAKMMGKEAALFVP 60

Query: 94  SGTMGNLISVLVHCDIRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNNADGTMDIHL 153
           SGTMGNLISV+VHCD+RGSEVI+GDN HIH+ ENGGI+TIGGVHP+TVKN  DGTMD+  
Sbjct: 61  SGTMGNLISVMVHCDVRGSEVILGDNCHIHVYENGGISTIGGVHPKTVKNEEDGTMDLEA 120

Query: 154 IEAAIRNPKAQLFFPTTRLICLENTHANSGGKCLSVEYTDEVGELAKKHHLKLHIDGARI 213
           IEAAIR+PK   F+P+TRLICLENTHANSGG+CLSVEYT++VGE+AK+H +KLHIDGAR+
Sbjct: 121 IEAAIRDPKGSTFYPSTRLICLENTHANSGGRCLSVEYTEKVGEIAKRHGVKLHIDGARL 180

Query: 214 FNASIALGVPVDRLVQAADSVSVCLSKGLGAPVGSVIVGSKDFITEATRVRKALGGGMRQ 273
           FNASIALGVPV +LV+AADSV VCLSKGLGAPVGSVIVGS+ FI +A  VRK LGGGMRQ
Sbjct: 181 FNASIALGVPVHKLVKAADSVQVCLSKGLGAPVGSVIVGSQSFIEKAKTVRKTLGGGMRQ 240

Query: 274 IGILCAAGLIAIKENVQKLQADHEKAKQLASGLYQIKGLKVDPKSVETNIIFFEIEEDYG 333
           IG+LCAA L+A++EN+ KLQ DH+KAK LA GL Q+KG++V+  +VETN+IF ++E+   
Sbjct: 241 IGVLCAAALVALQENLPKLQHDHKKAKLLAEGLNQMKGIRVNVAAVETNMIFMDMEDGSR 300

Query: 334 ISLETLCKSLEERGVFMMLESQIRARIVLHHQISTSDVHYTLSCFQQTLSGIQ 387
           ++ E L K+LEE G+ ++  +  R RIV+HHQI+TSDVHYTLSCFQQ +  +Q
Sbjct: 301 LTAEKLRKNLEENGILLIRGNSSRIRIVIHHQITTSDVHYTLSCFQQAMLTMQ 353

BLAST of CcUC06G119550 vs. TAIR 10
Match: AT1G08630.2 (threonine aldolase 1 )

HSP 1 Score: 511.5 bits (1316), Expect = 5.8e-145
Identity = 249/353 (70.54%), Postives = 301/353 (85.27%), Query Frame = 0

Query: 34  MVSRKVDLRSDTVTKPTESMRAAMAIAEVDDDVLGHDPIGLELEEEMAKIMGKEAGLFVP 93
           MV R VDLRSDTVT+PT++MR AM  AEVDDDVLG+DP    LEEEMAK+MGKEA LFVP
Sbjct: 1   MVMRSVDLRSDTVTRPTDAMREAMCNAEVDDDVLGYDPTARRLEEEMAKMMGKEAALFVP 60

Query: 94  SGTMGNLISVLVHCDIRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNNADGTMDIHL 153
           SGTMGNLISV+VHCD+RGSEVI+GDN HIH+ ENGGI+TIGGVHP+TVKN  DGTMD+  
Sbjct: 61  SGTMGNLISVMVHCDVRGSEVILGDNCHIHVYENGGISTIGGVHPKTVKNEEDGTMDLEA 120

Query: 154 IEAAIRNPKAQLFFPTTRLICLENTHANSGGKCLSVEYTDEVGELAKKHHLKLHIDGARI 213
           IEAAIR+PK   F+P+TRLICLENTHANSGG+CLSVEYT++VGE+AK+H +KLHIDGAR+
Sbjct: 121 IEAAIRDPKGSTFYPSTRLICLENTHANSGGRCLSVEYTEKVGEIAKRHGVKLHIDGARL 180

Query: 214 FNASIALGVPVDRLVQAADSVSVCLSKGLGAPVGSVIVGSKDFITEATRVRKALGGGMRQ 273
           FNASIALGVPV +LV+AADSV VCLSKGLGAPVGSVIVGS+ FI +A  VRK LGGGMRQ
Sbjct: 181 FNASIALGVPVHKLVKAADSVQVCLSKGLGAPVGSVIVGSQSFIEKAKTVRKTLGGGMRQ 240

Query: 274 IGILCAAGLIAIKENVQKLQADHEKAKQLASGLYQIKGLKVDPKSVETNIIFFEIEEDYG 333
           IG+LCAA L+A++EN+ KLQ DH+KAK LA GL Q+KG++V+  +VETN+IF ++E+   
Sbjct: 241 IGVLCAAALVALQENLPKLQHDHKKAKLLAEGLNQMKGIRVNVAAVETNMIFMDMEDGSR 300

Query: 334 ISLETLCKSLEERGVFMMLESQIRARIVLHHQISTSDVHYTLSCFQQTLSGIQ 387
           ++ E L K+LEE G+ ++  +  R RIV+HHQI+TSDVHYTLSCFQQ +  +Q
Sbjct: 301 LTAEKLRKNLEENGILLIRGNSSRIRIVIHHQITTSDVHYTLSCFQQAMLTMQ 353

BLAST of CcUC06G119550 vs. TAIR 10
Match: AT1G08630.4 (threonine aldolase 1 )

HSP 1 Score: 511.5 bits (1316), Expect = 5.8e-145
Identity = 249/353 (70.54%), Postives = 301/353 (85.27%), Query Frame = 0

Query: 34  MVSRKVDLRSDTVTKPTESMRAAMAIAEVDDDVLGHDPIGLELEEEMAKIMGKEAGLFVP 93
           MV R VDLRSDTVT+PT++MR AM  AEVDDDVLG+DP    LEEEMAK+MGKEA LFVP
Sbjct: 1   MVMRSVDLRSDTVTRPTDAMREAMCNAEVDDDVLGYDPTARRLEEEMAKMMGKEAALFVP 60

Query: 94  SGTMGNLISVLVHCDIRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNNADGTMDIHL 153
           SGTMGNLISV+VHCD+RGSEVI+GDN HIH+ ENGGI+TIGGVHP+TVKN  DGTMD+  
Sbjct: 61  SGTMGNLISVMVHCDVRGSEVILGDNCHIHVYENGGISTIGGVHPKTVKNEEDGTMDLEA 120

Query: 154 IEAAIRNPKAQLFFPTTRLICLENTHANSGGKCLSVEYTDEVGELAKKHHLKLHIDGARI 213
           IEAAIR+PK   F+P+TRLICLENTHANSGG+CLSVEYT++VGE+AK+H +KLHIDGAR+
Sbjct: 121 IEAAIRDPKGSTFYPSTRLICLENTHANSGGRCLSVEYTEKVGEIAKRHGVKLHIDGARL 180

Query: 214 FNASIALGVPVDRLVQAADSVSVCLSKGLGAPVGSVIVGSKDFITEATRVRKALGGGMRQ 273
           FNASIALGVPV +LV+AADSV VCLSKGLGAPVGSVIVGS+ FI +A  VRK LGGGMRQ
Sbjct: 181 FNASIALGVPVHKLVKAADSVQVCLSKGLGAPVGSVIVGSQSFIEKAKTVRKTLGGGMRQ 240

Query: 274 IGILCAAGLIAIKENVQKLQADHEKAKQLASGLYQIKGLKVDPKSVETNIIFFEIEEDYG 333
           IG+LCAA L+A++EN+ KLQ DH+KAK LA GL Q+KG++V+  +VETN+IF ++E+   
Sbjct: 241 IGVLCAAALVALQENLPKLQHDHKKAKLLAEGLNQMKGIRVNVAAVETNMIFMDMEDGSR 300

Query: 334 ISLETLCKSLEERGVFMMLESQIRARIVLHHQISTSDVHYTLSCFQQTLSGIQ 387
           ++ E L K+LEE G+ ++  +  R RIV+HHQI+TSDVHYTLSCFQQ +  +Q
Sbjct: 301 LTAEKLRKNLEENGILLIRGNSSRIRIVIHHQITTSDVHYTLSCFQQAMLTMQ 353

BLAST of CcUC06G119550 vs. TAIR 10
Match: AT1G08630.5 (threonine aldolase 1 )

HSP 1 Score: 490.0 bits (1260), Expect = 1.8e-138
Identity = 243/353 (68.84%), Postives = 291/353 (82.44%), Query Frame = 0

Query: 34  MVSRKVDLRSDTVTKPTESMRAAMAIAEVDDDVLGHDPIGLELEEEMAKIMGKEAGLFVP 93
           MV R VDLRSDTVT+PT++MR AM  AEVDDDVLG+DP    LEEEMAK+MGKEA LFVP
Sbjct: 1   MVMRSVDLRSDTVTRPTDAMREAMCNAEVDDDVLGYDPTARRLEEEMAKMMGKEAALFVP 60

Query: 94  SGTMGNLISVLVHCDIRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNNADGTMDIHL 153
           SGTMGNLISV+VHCD+RGSEVI+GDN HIH+ ENGGI+TIGGVHP+TVKN  DGTMD+  
Sbjct: 61  SGTMGNLISVMVHCDVRGSEVILGDNCHIHVYENGGISTIGGVHPKTVKNEEDGTMDLEA 120

Query: 154 IEAAIRNPKAQLFFPTTRLICLENTHANSGGKCLSVEYTDEVGELAKKHHLKLHIDGARI 213
           IEAAIR+PK   F+P+TRLICLENTHANSGG+CLSVEYT++VGE+AK+H +KLHIDGAR+
Sbjct: 121 IEAAIRDPKGSTFYPSTRLICLENTHANSGGRCLSVEYTEKVGEIAKRHGVKLHIDGARL 180

Query: 214 FNASIALGVPVDRLVQAADSVSVCLSKGLGAPVGSVIVGSKDFITEATRVRKALGGGMRQ 273
           FNASIALGVPV +LV+AADSV VCLSKGLGAPVGSVIVGS+ FI +A  VRK LGGGMRQ
Sbjct: 181 FNASIALGVPVHKLVKAADSVQVCLSKGLGAPVGSVIVGSQSFIEKAKTVRKTLGGGMRQ 240

Query: 274 IGILCAAGLIAIKENVQKLQADHEKAKQLASGLYQIKGLKVDPKSVETNIIFFEIEEDYG 333
           IG+LCAA L+A++EN+ KLQ DH+KAK LA              +VETN+IF ++E+   
Sbjct: 241 IGVLCAAALVALQENLPKLQHDHKKAKLLA--------------AVETNMIFMDMEDGSR 300

Query: 334 ISLETLCKSLEERGVFMMLESQIRARIVLHHQISTSDVHYTLSCFQQTLSGIQ 387
           ++ E L K+LEE G+ ++  +  R RIV+HHQI+TSDVHYTLSCFQQ +  +Q
Sbjct: 301 LTAEKLRKNLEENGILLIRGNSSRIRIVIHHQITTSDVHYTLSCFQQAMLTMQ 339

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_008453666.13.9e-18389.81PREDICTED: probable low-specificity L-threonine aldolase 1 isoform X1 [Cucumis m... [more]
XP_038877077.13.6e-18190.22probable low-specificity L-threonine aldolase 1 [Benincasa hispida][more]
XP_011660269.12.2e-17889.11probable low-specificity L-threonine aldolase 1 isoform X2 [Cucumis sativus] >XP... [more]
XP_008453667.11.9e-17787.88PREDICTED: probable low-specificity L-threonine aldolase 1 isoform X2 [Cucumis m... [more]
XP_031736719.12.1e-17687.64probable low-specificity L-threonine aldolase 1 isoform X1 [Cucumis sativus] >XP... [more]
Match NameE-valueIdentityDescription
Q8RXU48.2e-14470.54Probable low-specificity L-threonine aldolase 1 OS=Arabidopsis thaliana OX=3702 ... [more]
Q9FPH34.8e-13670.35Probable low-specificity L-threonine aldolase 2 OS=Arabidopsis thaliana OX=3702 ... [more]
O070513.1e-7445.95L-allo-threonine aldolase OS=Aeromonas jandaei OX=650 GN=ltaA PE=1 SV=1[more]
Q218905.4e-7142.94Uncharacterized protein R102.4 OS=Caenorhabditis elegans OX=6239 GN=R102.4 PE=3 ... [more]
P758235.3e-6641.95Low specificity L-threonine aldolase OS=Escherichia coli (strain K12) OX=83333 G... [more]
Match NameE-valueIdentityDescription
A0A1S3BXZ81.9e-18389.81probable low-specificity L-threonine aldolase 1 isoform X1 OS=Cucumis melo OX=36... [more]
A0A1S3BWA29.0e-17887.88probable low-specificity L-threonine aldolase 1 isoform X2 OS=Cucumis melo OX=36... [more]
A0A6J1DWI07.4e-17286.91probable low-specificity L-threonine aldolase 1 OS=Momordica charantia OX=3673 G... [more]
A0A6J1EZ043.1e-17086.59probable low-specificity L-threonine aldolase 1 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1HN306.9e-17086.07probable low-specificity L-threonine aldolase 1 isoform X1 OS=Cucurbita maxima O... [more]
Match NameE-valueIdentityDescription
AT1G08630.15.8e-14570.54threonine aldolase 1 [more]
AT1G08630.35.8e-14570.54threonine aldolase 1 [more]
AT1G08630.25.8e-14570.54threonine aldolase 1 [more]
AT1G08630.45.8e-14570.54threonine aldolase 1 [more]
AT1G08630.51.8e-13868.84threonine aldolase 1 [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (PI 537277) v1
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 282..302
NoneNo IPR availablePIRSRPIRSR017617-1PIRSR017617-1coord: 64..332
e-value: 4.2E-108
score: 358.1
NoneNo IPR availablePANTHERPTHR48097L-THREONINE ALDOLASE-RELATEDcoord: 34..391
NoneNo IPR availablePANTHERPTHR48097:SF4L-ALLO-THREONINE ALDOLASE-LIKE PROTEINcoord: 34..391
IPR001597Aromatic amino acid beta-eliminating lyase/threonine aldolasePFAMPF01212Beta_elim_lyasecoord: 40..326
e-value: 1.8E-95
score: 319.6
IPR015421Pyridoxal phosphate-dependent transferase, major domainGENE3D3.40.640.10coord: 38..287
e-value: 1.4E-88
score: 298.2
IPR015422Pyridoxal phosphate-dependent transferase, small domainGENE3D3.90.1150.10Aspartate Aminotransferase, domain 1coord: 288..383
e-value: 6.4E-24
score: 86.1
IPR023603Threonine aldolasePIRSFPIRSF017617Thr_aldolasecoord: 30..389
e-value: 2.8E-135
score: 448.3
IPR015424Pyridoxal phosphate-dependent transferaseSUPERFAMILY53383PLP-dependent transferasescoord: 39..384

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CcUC06G119550.1CcUC06G119550.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006520 cellular amino acid metabolic process
molecular_function GO:0016829 lyase activity
molecular_function GO:0003824 catalytic activity