IVF0022074 (gene) Melon (IVF77) v1

Overview
NameIVF0022074
Typegene
OrganismCucumis melo L. ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionBeta_elim_lyase domain-containing protein
Locationchr02: 18407117 .. 18428804 (+)
RNA-Seq ExpressionIVF0022074
SyntenyIVF0022074
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CTTATATTTTGCTATAAATTTCGTTTATGTTTATGCAAATTAACCCAATCGTGGGTTCATCCGTATCTAAGGTGAGTGAAATTAATTTCATGTTTTTGTTTCGAGTAATTTCAACCATTTATGTGTTCATGATCTATAATTATTTATATATTTATGCAGTGTTTGAAGTACTTAAAGAAGCTATATCAACTGCTTTTTCTTGGGCTTATAACATCTTCCATTTCAAGAGTAAGAAATCACTATTTTTTCTTATAATTTTAAAATTTGATGTTGATATTTAACTCCTAATTCCATTTAAATATAAATAAAAATGATTTTATTTTTTCATTTACTCGGAGTTCAAAAGGTTGAAGATTTGGACCCAATTTTTCTCGAAGAAAATTAGTCATGAACTTTCTTTCTTACACCCTTTTCAAGATCTTAACTACCAAACAAAATATTCTCCGTTTCTGGATTCATGATTAGCTCATAGATCAATATTTTACGTGTTCATTTTGCGATCAAAATCCTCGTACAAAAATCCAATAGGTTTGATACAATAAATATTTGGGCCGAAAATTAATGTTTGGGTTAAATAATGACTTTTTAAGTCACTAATTCATTTTCTTTATATATATATATATATAGTTTCAGTGCAAAATGGTGAGTAGAAAGGTGGATTTGCGGTCAGACACTGTGACGAAACCGACGGAATCAATGCGAGCTGCGATGGCTATGGCGGAGGTGGACGACGATGTGTTGGGGTACGACCCCACAGCCTTGGAGTTGGAAGAAAAGATGGCAAAGATAATGGGAAAAGAAGGAGGGTTATTCGTTCCATCAGGGACAATGGGAAATCTAATAAGTATTCTTGTGCATTGTGAAACTAGAGGGAGTGAAGTGATTGTTGGAGACAATTCTCATATTCATATATTGGAGAATGGAGGTATTGCAACCATTGGAGGAGTTCATCCAAGAACGGTGAAGAACAAAGATGATGGAACAATGGATATTGATTTGATTGAAGCTGCCATTAGAAATCCAAAGGGACAGCTCTTCTTCCCGACAACAAGGCTCATTTGTTTAGAAAATACACATGCAAAGTAAGTTTGTTCATTTACCCTATTGTTTTCTTACTTTGGTTTTGAAGGCTGTTTGGTAACAACCTTAGTAACGTTTTCTTTTCTTCTCTTTTTTTCTTTTCCTTTTAAAAAATCGACAAGATTTAGTAACATGTTTGATTCTTAACAAATTCCGAACTCCAAAAACTACATTTTTTTTTCTTAATTTCCAAATTTTGGCTTTGGTTTTCTAAAGCATCGGTAGAAAATAAGTAATTGAGAAAAAAAAAAAGGGCAAAATAGGTGTTCTTAGGTGTTAAATTTACTTTTTTTTTTTTATATATATATAAAAAAACTGTATGGTTACCCTAAATATTTTGTTTGTTTAGTTTCTTTTATCTTTTGTCTTGTTTTTTACTTTTAGTCTTTTTTTTTTTTTTTTTTTTTTTTTTAATTTTTTGTGTGTTTGCTCAAATTTGTTGTCCTAATAAATCCGTGTGTCTTTGTATTACGTTTAGTGCTTATAGTTGATTGTCTATTTCTTTTATTTCTTTAATTATTTTTTTATATTGAAAATGCCCCTTTTTTATATTCACGTTACTTTAATTTTAAATTTGAATTTCCAATATTTATATTTACATATTATTTGATTTCCAAATTTCAAATTTTTAATAAATTTTTTTAAATTTAAATTCTACAATATGCATATATATTATTTTATTTTTAAATTTTTAATATTTATTTTTATATATTTTTTCATATTTCAAATTTGTATATTGGAATATCTTTTAAATTTTAAATTTTATAACATGCAAATTTTTTTTTGTTTTATTTTACATTTTATTCCTTTCTTTTCTACTTCCTTTTTCTCTATATTTTCTTCGTATTTTTATAATTATTTATTTCTTTTATTTATATTTATTTTCTTTATTATTTTTATATTTATAAGTTAATCTAATTTTCAAATTTGAATTTTCAAATAATTGTATTTATAATTGATTTTGACAAATTTCAAAATTTTCATAGAATTGTTTTTAAATATAGAAAAATAAACCAAAGTATTTACAACTATATAAAAAATTAATTTTCATATATTTATATTTTGTATATTATTTTGATTTTCAAATTTTAATTTTTTAATAGGTTGTTTTCTAATATCGAAAAACGAACCAAAACATTTACAAATATAAAGTATTGTGGTATTTTTTTAAAACTAATATAAGGAACTACAAAACTATGAAATATAAGAAACTATAAAAAATGGTATAAAACTTTTTTTTTCTATGAAATAGGAATGACGAGTACTTATCAAACATACTATTGTTTCCTTGTTCTGAAAAAAAGAAGGAACAAAGAATAGAAAACGAGATGAGAATTTTGTACAACTAGATCTAAACTTAAAATTTAACAAACAATTATATTTTAGACCTTTCTTCATTTGTTCTTTTGAATTGGTTTACAAGATCTTGATATCGATATGGTTTTCGTTATTGCTGTTATAATTATTATAAATACTAAAATATTTATAAAATATAATAAAATTTCAGATTCCATTAGTTATAAACATAAATAGACGATGATAATCAATGATTATTAACACATAAACATTACTAGAAGCCTAACAATGTCTATTATTGATTTAGTTTGAAATTTTGTTATATTTTTGTAAATATAACAAAATTTACTGAAGTGTGAAAGAATATTACCTAATAACTTAAACACTACTGATATACAATTTAATTTACTTTGACATGAAACCATATCTTTTTTTTTTCTTAAGCACCGTGGAGTTCCATAGATTTGAGAACCATATATTTATTGATTTATTTATTTCATAGAATCTCAGGAAAATAAAAAAGTATAAGAAAAAAAAAGGCATTGTTATTGCACTATTGGGAAGGATTTGTATTTATTATATTTAGGCTTTTAAATTTCAAGTTGCTAACTTTTTATGAATGACAAAAAAAAAGAAAAAAAAAAAGAAAAAGAAACATAGAGGATCTATAAGAAAAAAGAAAGAAAGGTAGAAGAAGACCCAAAAAGTACAAAAGGATATGTGTTGACCAATAAGAATAATTGGAATGTCTAATCACTAATGGCAAAAAAGGGAGTGTTGTGAAATTAAATTTTGATTCAATTCTATCTTCTCAGATTCTTTAGTTGAACAAATATGTATTTAGCGGGTAATTTGAATTCTGTTTCCAATAATTATTTTTAAATTTGGTAACATAATCTATGTAAATTTTAAAAACAACTTTTTTTTTTTTTTTTTAGTTTTCAATCAATGAAGATTGTTTTTAAAGTCTCTTGAAGTACAACTTATGGTGGAAGATTTAAATCCGACACCTTTTGTTTTCATGAATATATAGATACAAGGTAAGCTAAGTTTGTGTTGCAATCAATCAAGATTTTGAACTTCAATTTTTAACTAAAAGTATTGTAGATAAAAAATATCAAATTATTTTTAGAAATAGAAAAAAACTGATACAACTAATAGATTTCTATCACTTTTTTTGAATGATAGATTTGTATCAATTTCTAGGGTTGATACATATTAGTGGAAGTCCATTATTGTCTATATTGACGGTAAGAAATGTAAAACTTTTTTCAATAGTTGATCCATATTTTTATATTAAAAAAGTTATTTTTAAAATGTAGAAGTTATATTAGGAAAGACAAAAACAAAAACACATGTATTAATTTATTTTATAAACGAGATCATTTTTGTTAAATTTTAAAAATATATATTATATGATAAGATAGACCAAGTTCGTTTGTTTGTATTAAAACTAATATAATTATATGGTATATAAATAGATTATGAAACAGATTTTGAAAAATTATATTTAAAAACATGAAATACAAGTTTAGTTTGATTAAATTTTGTTTTCAAATGGCTTAATAAAAAATTTCTTGGTATATATATAATAAGATATAGTTTATGAGTTGAGTAATTAGTGTACGGTGAAAAGCAGTTCGGGTGGAAAATGTCTTTCGATGGAATATACCGATGAAGTTGGAGAGTTAGCTAAAAAGCATGACCTCAAACTTCACATTGATGGAGCTCGAATTTTCAATGCTTCAATTGTGAGTACTATCTACTTGTATTTCTACTCAATAGTAAATTGAAATTTGGCTTTCGATTTAAAGAGGTATCTTCCTTAAAAAAAGAAAATATATTTTTTCAGGCACTAGCTATTCCGGTCGATCGATTGGTACGAGCCGCTGATTCAGTATCAGTATGTTCATACATTTTACTCTAAAATGTTTCTCGATCTTTATTTGTTTTATTATTGATTTGGAATTTAGGTGGGAAATATGTTTAAAGTTAAAAAGCGTTCGTAGTCACATATTCTTATCTGACCTTATCTAAAATTAATTCAATTGTTAGACTAGTTTTTACCTTAATTTGGCTTATTTTTATAAAAAAAGATATGTTATTTAACTACTTTTAAAGTTTCGTTGATAGTTTGGTTAAAGTAACTAAATCAATTATTTAGACCAACCAAGGTAGGGGGAGAAATATACTAAGAGTGGTAACTTTTTTAAATAGGTTTCAGTTGCCAAAAGGCTCCATGGACTTTTGAAGAGTAGACCAAAGTTAGTATTATTGCCAACATGGGCTAAGCTTATCTGATATATATATGTTCTCGAGGATAAGAGGTCCCGAGTTCAAATCTCCTTACTTCTAAGTTGTTCTTAATAAGACATTTTTAAAAAAGTAGGTATTATTTCAAATTCCAAAAAACTAGCAATTTGAATTTGTGGTGAGGAATTTTTTTTTTCCGAAAAGGTCTTTTTAAGTACAGCTTGGAGGTGAGGGGGTTTGAACCTGTAGCCTCTTGTCCTTGAAAACATAGAGATGTGCCGGTTAAGCTAAGTTCATATTGGCGGTGCAATTTGAATTTTTGGCCATTTGTGAAAGAAGCTTTTTTTTGGCTATAGATATTCATCTTCTTCTTAGGATAAGGGAACACAAGGAAAAAGCCATACAAAATAATAATCCTTTTGCACAACCTCCACATACCCTCCTGAGGTTGATTCCAAGAAAAAATTATTTTAGTTAATATTTTAAAATAAATCCTTTTATTAGAAGATTATTCCCATGACAGATTTTGAGAAACGGTACTAAGTTTTTACTTTATTGAGTTGATTTTTTTTTTCAATACACTTTAATTGGTGTAATACTCTTAACTTATTTTATGAAATTGTTGCTTCAAATATTAGATTGATAAACAATGAAATAAGACATTTTTTTTAAAAAAATATTATTTTCTAAATGAATTTCATTTGAATAATGACTTCAATTCAAGGTATAAAATTTGTCAACTATTTTCTAAATGAGTGTTGTAAGGTGATAACACCTTCCTCATACATAAATAACTCTTGAACTCAACTTTCACAAACCATTTTTTAAAAAAATTACTTAAACATTGTTTACTTTATTTTCCAATCACACCATAAAAAGGATTGGTGGTGACAACCTTTTTATTGTTCTTTAAAATTAACCCATTTTGAGAATGTTAGCCGCTCCACGTCGTCTTGAGCATATGACCACCACATGTTTTAATCCAAATAAGTTACACACAAACATTAAAACATTTGAAAGCTCGGAAGTTATATATTAATTGATGAATAGTTAACGAAGCTTCTTCAAAATTAGATACGTCGACGAAATTAAATAGAAATTTAATTATTACAAGTATTTATTTTTAGTGTGAAGTCTAGACAATTCTATAAGTAGGATTGATATTTTGCATTGAAAAGTAACCAAATGTGAAGAAGAATACACCCAAAGAAAATTATTTTGAAGAGTCTTCCAAATAAGTGTTTTGTTTTTTGTTTTTATAAATATTTTATTTGGTGATTATTTATTTAGGAGTGTCCATCACAAACTTCCAACCTTAATAAATTGCTAACATTATATAAAGCTTTAACTTTTTACTCATTAGGTATGCCTATCAAAAGGTTTGGGTGCACCTGTTGGATCAATCATTCTGGGTTCAAAAGACTTTATTACCAAGGTAGTAATTTGAGCTTTATTTATTTATTTTTGTGATATTGGAATTGATATTTCTAAAATAGAAATTTGAAAACACATCATAGGAATTTGAAGCTCTGTTGAGATAGTTTGGAAACCTAGGTTTATTATATTTGAAATTATTTCCAGCGAGAAATCTAGGACACCTTTATAAAAAAAATACAATGTTAATGTGGAGAGTTATGTCAGTACCAAAGCTATCGACAAAATAAGTAGCATAAGCAACAACATTCGGCGCAATTACACCTATATTTGGAGAATCTTAAGCCCGATTGAAATATCATTTATTGATATTCAAAAGGTTCACAAGTACATAATTTATCAAGAATTAACTATCTTGTATTATACACAACTTTTACAAGATAAATGTTTAGGCTCCCTTAAATTACTCTTGAAGTGTGAACAAGGGATTTGATCTAAGCTTCCCTTAGATCATATGCTTCTTTCTTAAAGCCCTTGTATCTCAACATCCAGGCTTCCTTTGAGAGAGAATCCTCAGAATTTTTTTTTAACATTAGGGGTGTATTCAAATCTTTTCAATTTATCGAATCATGAAGTCAATGATAAGGTGTCCAACACCACACACCACACAAAAAAGATATTTCTCACAAGATACTCACCGTACACACAAGTAATGCTTGAAGGAGAAATCTTCTCTAAAAAAATTGGATGATGCTGTATTCAAAGTCAAAGACATAAGGTAATAAAAATCTTCCAATCAACAGAATAGGAAAGAATTTTTACTTTATTTAAAAATAAGACCATAGTAAAACCACAATTCCCATTTTCTACCAACAACCAGAAAAGGAAAGATCTCCAAAAATATATAAGACAAAAATAAAAATAAATTTGCCAGCCAAAAAACTTAACAAACTCCCTCTTTTGGCAAACAACATATCTCTTTTTATTTTTTTTTTCTTAGCAAGCAAGCCAAACGATCAACAAACAATAGGGTAACAGACAACCACAACAACAACCATAAAACAAAGAATACATCAGCAAGCAACACTAACAACCAACAAAACAACCAATCATAGCCTACAACCTCAACAAAGCTTACAGTAACAAAGCGTGCAATCATCCCATAGTAGAGATCTTCCCATTTGCTTGCCATAGAAAATATAAGTCAATGGACGGAGGAGGATTTTGAGACGAGGAGGCAGCAAAACACTAAGAATGTGCATCAAGGCATCAAGAGTAGAATATCTATCCGAAAGACTGTGAAAACACCGCTCAAGGTTCGAGACTTTATAGATTAGATACGAAGGATATACGATTGGCCAAGGCAGGAGGCAACATTAACTCACCATTAGGAGTGCACAAGTCATTAGAAAAAAGGAAACTTTTTATCACGCGGAAGAAGAATGATGTGTGCAATATCCGAAACATGAGACCCCTGAAACAACCTATGGCTCAGATGAAAAGTCTTAGGGTAGAGTTAGGAGCATCCTGATCGGTCAAGATGGATGGGTGCTGAGAAAGCATAATAATGCAAAGTAAATGAGGAAAAGAAATAGAAATCTTTAAACTAAATGTCTCAATATGTAGAAGCAGCTGGTTAAACACAAACTGGGCAACATCAGCCACTGATCCAATCCCAATGAGGTAAATCAGTTTCGCAAAAGCCAAAGAGACACTTGAGGCATAGGTGGAGGGGTACCAATTCACAATGTCAATCTTGTCAAGAATGACATATTTCACACTAAGTGAGATAGTAGTAAATTGGTCATTAATGGGCCATTCAATAACGGTTCCTTTAGTGAATTCAGATGCTAAAAGCTCAGAGGATGGCTGCACAACAGAGAAACCAGTTGGAATCTCAACATCAAGAAATTAATTTATAATGGCAGATGCCATAATGAAGTATAAGCCACAAACATGAATCTTCCTATACTTAAGGCTACTCGAATCATCAAACTCTAGCAGATTGACAATTAATTCATGAATCAAGTGCATATAAAACTGGCCAACTTAAGACACAGTCTTACTCATACCAGCTCGGTTAATTAACTTCATAATAGCGAGACAAGAGTGATGCTTATAAGAAATGTTGACCTCATCAGCAATTCTACGCTGGACCGCATACTTCCATTTTCAAACACTTTCTTTACAATGAAATGGTACCCTATCAATGGGGACAGATGGGACACTTGAAAGAATTTTGCAACGACCAACTTTAGTGGTTATGATTTTGGGACCTTTGGAAGGACCCACAAGAGTATGTTTGCCTTTTGTAGTCTTCTCAATAGGACTAGGATTCGACCACCAAAAGAAATAGTAAGGTCCTCAGGAGCTCGCTTGGATCGACTAACCAGGTGAGCCAAAACGACATTATCTTCAGAGGAATAACCAAATGAAACATTATTCATAGACTTTAAAGGATCAGTAGACGTTGTAGTTATAAAGGCAGATGTACCCATATTAGCAGAAGGATTAATCTCACTTGTAAAATCAATAACAGTTTCAACAACGGGATCAGAAACATTCACAGACTTAATGCAATGAGCAATGTCAGCACCAATGTCATTAACATGAACATTATCCTCAGAAACAATATCTTCAATAATAGAATTGACTTTGTCTATAACATTGGAAAAAATAGGACAATTAGGAGGGATGATGGAGGTATTATTCACAATAGTCATAGAGGCGGCAAGAAGCATGATATTTTCATCATTCTCATTATCACTAGATACATGATTAGGGCTAGCAGAGGGTGCATGCAAGATATCTTCATCAACAGTAGGATTGGGGTGAGAAATACCTAAATCAGGAATAATAGGACTATGCAATCAAGAGGAACCCACTCTTCGCCACTTGGAGGAAGATTAAGAAGGCCCTCGAATTCAAACCCCACAAGAGTTAGTTCTTGCGATAGAGGCACAAGAACTTTGGATCCAAATTGATCCTTTAGTTGACGGGGCTGGTAAGTTCCTCTTTTGGTTGCTACCATGATGAATTTGGAAGGAACAGGAAGATAGAGATAATCATGCAACCATACCAATAACTTATAGTGAAGATGGCATTTGAGAAAATATAGGACAAAGTAAATTGAGTGAGAGTTGACTAGAAGAAAGAACATCTGCTTTATATTATAAATTGAAATATAGGAAAATATGGTAAATAAATGAGCTTTTCTCATGGACTTAGAATCACATGGCCCACAAGTAAACTTTGCCATAAAAGATCAAGTATCAATTGCAATAGATTTACCCTGCAAATTCCTAAATCTACCCAAGAAACTCGAAGGTAGTAGCATCCAAAGGTTTCGTTTAAATGTCTATGAGTTGAGTTGTAGTTATACATGTTCCAAAGAGATGATTTTATTCTTCAAAAGGTCTCGTATAAAATGATGTCGTATGTCAATGTGCTTTATTCTATTGTGTTGAATTAGATTCTTGGAGATGTTAATGATACTCGTGTTGTCACAATATAAGTTCACCATATCTTGGTTAATTCTATATTTTTTGAGCATTTGTTTCATCTAGAGCAACTGAGCACAACTACTTCTAGCTACTATGTACTCAACTTCAACAATGGACAAGGACACATTTTTCTGTTTCTTACTAAACCAAGACATTAGCTTATTCCCTAAGAAGAAGTATCCTTCTAAAGTGTTTTTCCTATCTTCCGAGCATCCAGCCCAGTTGGCATCCAATATCTCACTAAAGATGAATGAGTATCAAAGAAATAACATAGACCATACTCAGTAGTACCATGCACATATTTCAAAATTTGCTTGATTAAAGCAAGACATCACGCTCCCACCCATATCGCATCCTTCTCACGATCAGGCGTGGTCTCTACTTGGGAAGCTCCAACGCCTTTAAATAAAACATTAAAATGTCAAACTTACAACTTAAACTTTATTTATTAAGAACATGCTATGTGATAAGAACTCAACACTACAATGCCTTGCTTGCTTGGAAAGAAAAACATAAAATATAAACCTAATGCTTAGCGAGTGACATTTTAGGAAAACCTTTTGGGGGAATACAATAGCATATAATAAAACTTTCTTATTTTCAAAAACATCACAATGTAGCATTACACATTTAAATAATACAAATCACATAAGTCTCTACTTGTTCCTCTACTTATTCCTTTCTGATGGACATTGAACTTCCTCAACGCAAAGACTACTATGATGACTCATCTCACCATCAGTATCTCTGTGGACCCTCGTTCAATTCCAGTTGCTTTGGTGATAGATTAGAACATAAGCCACTCAATGCATTTAGCCACCCTTCTACCAAGCGGACCTCTCAATGCAAGGTAGTCTCTACTATTATGTGCACATAATAGATAGCTTCTTGATAGGAATAACGCTAATGATTTACTTGCATACAAGCATCATATAAGTTATATTTAAACTTATAAATATGCTTTCAAACCGTCGTGTTTTTCATAAAAATCATATAATAGATTACACATGTACTACACCACACAAGACTTTGGAAAAAAAAAATTAAGGAAACTCTGTTCTTTCTTCAGATAAATCACTCACAAAGCCTAATGTTATCTTTCTTTATAATCCTTTCTTATCGCCTAACTTCAAAGTCTTCTTTTCTTGCCTATTTTTTTTTTGACAAAGCTTAAGCTTAACGCTTCTTTCAAACGTTCCTTTGAGCTTACAAAACTTTTTATACTAATCATAAAATAGATTAGTAATCCTTAAACTCACCTTAGCTTGCTAGCTCCTTCTTTGTAACTTACATGTGTAGGTTTGATATATCCTTTAACCATGCTTAACTTTACCTCAACATGTAACTTGCTAACTAATTCAAATTAACGGGTCCTCTTAGCCTACGTTCCTTTCCTCGCCTAGCTAACAATTCATTTTTTTCCTCGCGTTACTTAGCCATAATGCATAACTACTCCATCCTTACGCCGCGTAACTAGCACACAACCTTCCTTCAAATGCCTAATAACTATACTTAGTACTTTCAACCTCCTTCACAACCTTTCTATTCTAGAAAGAGGTTTCACACAAGAGTGCATTTAGTAAACAAAGTTGTCAACCATGACTAAGGAAATCCTCTAAAAATGATATCATACACATCTATGCTACCATGAATTTTTTACCTGATTTCTTAATGAAAAGGGTTTTGTCAGCGTCACCACGAGATTGTTCTTTAAGAGCTAGAAACTCGAAAAGACGATCATACCAACCTTTTAGGAGTTTGCTTGATACCATAAAGAGCCTTCTAGAGTTTATACACATGTTAAGGATATGCTAGGTCAATGAAGCACTTGTGTTGAGCAACAAAGACTTCCTAATTCAAGTACCCATTAAAAAGGCACCTTTTAAATCCATTTGGAACAACTTGAACTTCAACAAACAAGAAAGGCTAAGAAGTAAATAAATGACTTCCAAGCAAGCTATAGGAGCAAAGGTCTCATGAAAGTTCATTCCTTCAATTTGTGCATACCATTGAGCAACAAATCTAGCTTTATTTTTTGTAATAGCACCACTTTCATCTGTTTTGTTTTTAAATTACCATTTAGTGCCAATAACATTAGCATTTTTCAGGACGAGGAACCAGCTTCCAAACCAGATTTCTTTCAAATTGAACCAACTTTTCTTGCATTGCATTCACCCAAAATTCATCTTTTAGGGCTTCAAAAATATTAGTGGACTCAAAAGAGGACGTGAAGCAGATGTTTCTATTCATTTTATTATAAATCTATCTGTTCTTCTTTCGAGTTGTAACCCTTGCATTTTGATTGCCAATGATACGACTAGAAGGGTGATTCTTTTGAATAGAGTGAAGGGAGATGGAGCACTGATATCAAACTCAATGATCACAACTTGATTGTAATTTGCATTTTCACAGTTCCAAGACTCATCACTAATGCTAGTGACAACAAAAGTAACAAGCCCCTTTATCTAACATTTTGAGCTTTAGGAATTTTGTTGTTAGGAAACATATCATTATCTATGATTAGTTGTGAAGTCGCTGAAGACTGATCATTAATAACAACATTAATAGACTCCATTATATATGAGTACATTTGTTAAAGATTTTAAAAGCTCTATTATTTGTAGAGTAGGCTAAGAAAATGCATTCATCTAATTTGGAGTCCCACTTTTTGTGAGGTTCATGATATGTAAAGGATATAACAAACACTTCTTAAGAAATGAAAGTATTTCACATAGGTTTTTTACCTTTCTAGAGCTGATAGTTGGTACTGGTTGTGCTAGGGTTCAAGACGACTCTATTATGGACTTGACAAGTTGTGTTTAAGGCTTCTGCCTAGAAGTAAATTGGAACCTTCTTAGCATGGAGCATGGCTCTAGCCATTTCTTATAACATTCTATTTTTTCGTTCAACAACCTCATTCTACTAAGGGGTAATAGGAGTAGAGAACTCATGAACAATCCCTTTTGAGGAACAAAACTAGTCAAAAGCTGCATTTTCAAATTTCTTTCCATAATCACTTTATATGTAGATATTGTTTTCTCCTTATTCACGTTGGAGCTGCAGGCATAAAGCTCGACAAACTTTGGAAGTATCTGCCTTTTCCCTAATGTGAACCATGAGTAGTCATCCACACACACAAAAACATAATTCTTTCCTCCTAAACTTTCAATCTGCATAGGACCATCAAGTTCATATGGAGTAGTTCATGAACTTTATCGGTATAACATTGACTGAATCTCTTGTGAGATGCCTTAATTTGTTTCTCTGCTTGACACTGTCCACATAAGGTATTTGACTCAACATGTAAGGTTGGAGTTCCAAGAATAGCTTCTTCATTTAAGGCTCTTTTTATAATTTTAAGATTTATGTGACCAAGCCATTTGTGTCAAATTTCAGCTTCCTCTCCTTTATATATATTACATACAAAATTTTAGCACTAGGATCTTAGTGATAGAACTTATCAAATGAGTAAAACTCATGATAGTCGTGTCATTTTTATCTTTCACTGTGCATTTATCATTTGAGAGAACTTACTTGATATCCCGGGTCACACAATTGACATATGCTGATCAAGTTGGCTCAGTCTGACATCTTGCAACATGGGTAGATTAGGCTTTGCAATGTCTCCTTTACCTATAACCTTTTCTTTAGCTCCATCACCAAAGGTTACATGGTTAGATGTACATTCCTTTAGTTTTGAAAGAAAACCTATTTTCCAATCATTTAGTGAGAACATCCGCTATCCAAGTACCAATATCTTCATTGGATGATAAGTGTATTGAGGTAAAGTTAACATTGCAGCAACCACTGTTTTGATGTAGACTATTCTTTACTTCCATTCCAGCTTTGGCAAACCTTGTTTCTTTCCCTACATTTTGGTTCATCCATTAACTATCCTCTTGAGAGGATTCGTAGGCACTTCTATCATGGAGATGGAAGCAAAATGGACGAATATGTCCAATTTCATCATAGTTGTGACTCACCCATCTTCTATTTGACTTATTTATAGCTAGAAACAAGAGAGGTCTTCAGGTTCCTAGACCTTTCAACCAAACTTCATCTAAGCCTTTATCTTAAGCAAGGACAGATTGTCTGAGGTATTGGAACAAGAGCATGTTTCCTTTTTCTTTGTCAAAGCCAACACCTTTTCTATTGGTACTGGATTTGCCAACTTTCAGAATTGAGTTTAAGTCATCTGTTTCAGAATTGAGCATTTGTACTGACTTGGATATAGTTTCAAATTCAATTCGAGTAGCCTTTAGATTCTCTTTTAACTAAAAAATTATGGACAACAAGCAAGGATTATTTTCTATTGAAAACTTCTGAATTATGTTCTTCTGTTGCTCAAGAACACGCAGGTCTTCTTCCCATTTTTCGTATATCTCATTAAAACTTAAGCCTTTAGCTTGATGATTTTAGCAGGAGGAGCATCGGTCTTTGATTCAATCACTTGCAAAACAAGATCAATATTTTTTTCATCTTCATCGCTTGTAAGGCATCCAAGCAAGGCATAAATATCATCTTTACGATCACTGCTTGTAGTTGATTCATCATCGAAAGAGTAACAACAAGACATTTGTTTTGTCTTTTCAGAAAATTTGGACATTCTGCTTCATAGTGACCAATCCCGCACATTCTCTGCATCTGAAACTTTTGTCACTCTTAGATTGATTATTGCTGATGCTTTTCTCATTTTCTCTTCTTTTGCCCGAAGGATTCATAGATAAATTTGCACTACTGTAGTTTCCATGATCTTTGTTATACTGACCTTCATAATTGTCAGGTTATTTGTTGAATTGCTTCATCACTTTGGAGAATTGTTTTTTCAGCAAGACAATTGAATGAGCTAAGTTTTCATCCACTACCTTGCTTTTACTATGATAAGACTCATCATCCCCACAGATTGTAAGGTCAACCCTTTTCCCTTCTTATCATTCTTTTCACCAAAGGTTAGTTCAAATGTGCGTAAAAACTGATCTATCTCCATGGTGACAATATCATTCGATTCTTCAATAGTTGTTATCTTCAACGTTAAAGCGTTTTGACAAAGAACGAAACACCTTTCTTACTAGTTTCTCTTCAGATAATTCTCATGAAGCGCGAACGACTCGTTTGCAATGTCCAGGACACAAACATTATAGTCAGCAATAGTTTCATCATCCATCATCTTAAAAGAAAGACATCAACATTTGAAGATGAGACATTTTACTTTGGACGTACCTTCATAAACAATCTCTAGAATTCCCCATGCTTCTTTAGCAAACGTACATGTATTTATTAGTTTGGACGAATTCCCAAAGATGCCTCATCTTTAGCTTCGAACCAGTCTTTCTCTGGTTTAGGGGACTCTTTTCCTTCAGCATTAGTGATAAAAGGATGAGTCCAACCTGAGATGACTACTTTCCAGGTTTTGTTGTCAATCGATTCGATTAAGGCAATCATTCTGACCTTCCAATATGCATAGCTTGAACCATCTAGCACACAAGAGGTAGTTGTTGATCCACCTTCTCTAGTCAAAGTCATAGCCAAACACTTAAACCTAGTTGTAATGCCAATTTGGAAAACACAAAGTTAACCTAGAGAATTATGTTAGCACTAAAGCTATTGACAAAGTAAATAGCATAAGCAATAGTAAGTAAAGATAGAACACATTCAAGTTGGTAATGCAGTTTGGTGTATTAATCACACCAACATCTGAGAGACCTTAAGCTCGACTAAAAAGATCATTTGCTAAGATTCAAAAGGTTCACAAGTACATAATTTATCAAGAATTAACTATCTCGTATTACACAACTTTCACAAGATAAATGTTTAGGCTCCGCCTTATAAACTACTTGAAATGTGAACAAGGGATTCGATCTAAGCTCCCTCTAGATCATACGTTTCATGCTTAACCCTTGCGTCTCCATATCTATGCTCCCTCTGAGAGAGACTCCTCATAATAACTTTTAATGTTAGGCTCTTCCTAACGTATTCAAATCTTTTCAATCTAGTGAATCATGAAATCAATGACAAGGTATTCAACATTACACACCACATAAAGGAATTTCTCGCAAGATAGTCACCATACACATGAATAATGCTTGAAGGAGAAATTTTCTATCAAAAACGGACATGGTTATATCCAAGGTCAAAGACAAAAGGTAATAAAAATATTTTCATCAAGAATAGGAAAGGATTTTTATTTTGTTTAAAACAAAGACCGATTTTTATTTTGTTTAAAACGAAGATCATAATAAAATCAAAATTCCCATAAATTACCAACAAAAAGAAAGTAAACATCTCCAATAATATAAGACAAAAATAAAATGAAATTTGCAAGCCAAAACACATAATTTACGTTGATGAAATATGCATTTCCGTTTTTCTTTCGAACGTGAGAATGAATGATATTACATTTTTTATTTTTTGTGCTTTGACATTTGTGTTAAAGTATAAAATTGATTATCTATCAAAGTATTCAACGTTGAAGAAATATTGCTCGATTTTGATCCCTCAATTGATAGAAACCATGAATGATTAACGAATGGGAGTAGGAGAAATGTATTGTCATGGACTTTTAAAATTGAAATAATAATTATTAAAATAACAAATAATTTAAGAATAAGAATAGTAGGTATTAAAAAATGTGTCCCTGCCCTTTTCCCTTCCACGTTCCAACCTGTGCCTCATCCCATCTTCTTTTCCTCAGCCTCAACGTTGGCGACAACACTCATGCCGAGCCCCACCTCCTATACCTCCACCTTTTCCCTGCATCCTTCTCCTTATCGTCCTTCTTTACCCACCAACACCTCAAGGATGTATTTGTCCTCCAACGTTTGATGTAATACCTAGATTCTAAATTTTGATTCAACACTTCGATCATTATTTTGATACTCCTATGCAACCTGATTTTATCATCCTTATTTGTTTCAAATGCTTCCTAAAGAATTTTGTCGTCCTAATTTATATGACGAACAACCGTTTCTCATTCCCTGATGTGCATTTCATTTTGGAAAAAGTTTTACATGTATACAAGGAGATAGAGAAGACAATGAAGAAGAGAACGACCTCTTGATATTAGAATGGGGACTGAGACTTTTTAATGTTGGGGATTACCTGCTTAAACACCCTCCATAAGTAAATTAATGATTTCTATTAATTAAGAGGTGAAAATTGAGTATTTTCCAAATGTTGCTCAGTTTTGAAAACTAAGTAGAAATTAATATGACACAAAAATCAAATCTTGATCATTTTCACTGCATCAAAATTGATGATGATTTAACTGAACTTGCATTGTGAGGTCTTTTCCACTTTTTAAAACAACCGTTTATTATGCAAATTTTCTGTTCAAATGGTGTGTTTAATATATAATAGGCAATAAGGATTAGAAAAACATTGGGTGGAGGAATGAGGCAAATTGGCATCCTTTGTGCAGCTGGACTTGTTGCTATCAAAGAGAATGTTCAAAAACTTGAAGCTGATCATGAGAAGGCTAAGCAACTAGCGAGTAATTAATATCACTCTTTCACATAATTTATCAACATTCTCGATTACTATTTTCCTCTTTCTTTTTTTAATATTTATTTTTGGATTATATAGGTGGGTTATACCAAATCAAAGGATTAAAGGTAGATACGAAATCAGTTGAGACAAACATTGTGAGTATCTCTAAACATAATTAAAATTTTAGTGATTTTTATCATGTTGTTGTGTTTGATCAATTGATTTTTTTTCCTTTTGTTTTTTTAAAACTAGAATAAAAAGGTTTAAAAGTAGTTTTTAAAACTTTATAAACTTCTGTGGGTAGCTTATGTTTGATTTTAATGACTATATTCTATACAGTCAATATGGACCAAACTCATATTAAATAAAAAATTATTTGCTCTCTCAATATTGAAGGAAGGCACCCCAAAACTTTTGAGATTGCATCCAATATGGAAATCATATAATGAGTGAGAATCTCTAGATCCTATATTATCTTACATTACCCTCAATACAAAAACGGTAATGATCCAGATACTTACTCAATAAGAATCTTGTGATGTTCATCAAACAATCCTTCCCCTAATATAGATTACAAAATGAATTATTAAACTTTTTTGTCTTGTCCCAAAACTTCAAAATTTATCGATGTTGAAATGAATGTCGAAATTTTTTATTTTGGGTTTTATAGAGCATGAATCATCAATATTGTATCTAATAACTTTTTTAAAAAATTTAAAATTCTTTTAAATGAATTTAATCTCTATTAGATAAAAAAATTTAAATCATGTGTTTAATGTTGGTGAACTTTTAATTTTGTGTTAAGAAATGTTTTGATTGAAAATTTCTAAGGAATACTCGCAAATATAGCGATTAAATTCAAAATATTAACATATACGATAATATTTAATAAAATTGCAAATATAGCAAAATTTGTCAAAGTTGATCGATGATTGAAATCTATCTTTGCTATATTTGTTGACAAACCGACTTAGACTCATATTCCTTGTTACGAAAGATTTACGCCTCTTTTATGTGTAGTACCAACAATTCTGAAACAGAAAAAACATTATAAATTTGAGGTATTTTAAAACCATATCTAATCCTTTCTTGTATCGTTAGTTTTAATCTTTTCTAAGTATTTTCTCAATTTTGTTTGTTTTGGGGTATTTCTATATGGTTTGTGTTGGTTGTTGAGTGGAGAATGAGGTTTTTGAGAAGTGGGTGTTGGGTGCATTAATTTCACTCCGGATCCTGTTTTTCTTGATTCTCCAATTTTTTTTCCTTTCACATTTCTTCACCTACTACTCTTGAATGGAAGAAAGACACGTGTGGACCGGTGGTAGTTTTCAGTATTCTTGAAAATACCAAACTATTAGGGTCTGTCTAACCAAGTTAATCACTTTTTAAATATTTCAAGCTTTCCATTTTATTTTTCTTCTCTTTCTCGCCTTGTGATGAAATCGTTATTTCTTTTATCTTGCTCCTACAATTTTTTTATCTTGTTCTTGCGATTTCTTTTTTCCTCTTGATTTTTTTTGTGATTTTTTTCCATTATCTTTTTATTTTTCAATTCTTTATTTACATTGTTTAATCTGTCTTCCTTATTTTTTTTTTTTTTTCGATTTATTTCCATCTTCTTTCTACTTTTCATTCTTTTTTTTCGCTACGATTTTTTTTCATCGTCTTTCATTTTTTCATTTTTTTTCACGTCATTTACATTTGATTGATTATGTACAAAAAAAATATATAGCAAAATCTAAAAGATTGTGTAGAAATAATTTGAAAAAAAATCATTTAGATTGTAGTAGCCAATGTCGTGTAAAAAATAAACAAAAAAATTTAAACGATCGTGTAAAGAAAATAAAAGAAAAAAAATCATTTGATTGAAATAGCCAAATGTAAACTATGTGTATGAAATTTTGGGGAAAAAAATTAGATTTGAGTCCCAAATCTAAACGTGTAACCAAATTGAACGAAACATGATTATAAAAAATCTTGAGAAAAAATAATATTGGACGAAAGATGGTTATAAAGAATATTGAAAAAAAATAGCCCAATATAAACGATAATGTAAAAAATGAATCCCAAAAGAATTAAAAAAAAAATAGTGAACCAAATTCTTTTTTTTAAAAAAATCATTTACGACCAAATCTATAACCAAATTAATAGATCGTGTAACAAAATTAAAAGATGGAATTGAAAATATGAATTGTAGTGATATCTAAACAATGGTGTACCAAACAATAACCTTTGACGATGTTGTTGAACGAATATTTTTGGTATTTTACACGGTGGACCTCTGTACATTTTTTGTTTTTGGAATTATTCTATACATTATAAATAAGGGATAGTTGCAAATGTAGCAATTATATTCGAAATAATTAAGTACATAGCAATATTTTAAAAAAATTGCAAATATAGCAAAATCTGTCAAAATCTATCAATGATAGGAGTCTATCACTGATAGACCATGTAGCAAATGTTGGTCTATCACCGATAAACCATAAGAGTCTATCAACGATAGAAGCTATCACTGATAGATTTTGCTATATTTACAATTTTTTTAAAATATTGCTACATACTTAATAATTATTCTAAAAATTGCTACCAATTATAATTACCCTATAAATATTGGTCGTTTTTTTATATTTTTGAAAAGGCCCGTTTTTCTAATAATTTTTTTTAATATTGTTATATTTAAAGTTGCTGCACCAATTAAAATTTCCATAAACAAATTTGCAATTAGTTTCAGTACCAAATCACAACGTCTAGAAAAAAATTGTAAACTACAGCGAATAAAATGAGATCAGTTAGATCAACGTAGGACATAATTTCCATTGACTAATTATCTGTTGAAAGCTTTTCATTTCTTCGTGGGTTTTATTTTAATACTTAAATTACAAAATCTAAAAATTTATTTCTCTATAATAATTATAATTATTTTGAAATTATTTTTACTTATTTACTTTGTTATATATTTATCAGATATTCTTTGAAATAGAAGACGATTACGGAATCTCGATGGAAACATTATGTAAAACCTTGGAAGAACGTGGCATTTTTATGATGCTAGAAAGCCAAACAAGGTCTATTCTAAAATTCTTTTCAAAACAGTATATATAAAAAAATTCTATTCTCAAATTAGAAAAGAAAAAAAATTCTACCTTAGTTTCCATTATATTCTAGTTAATTACTTTGTGATAACTTTATTTAGCTTTTAGGTAAATATTTTGACTTGAAATAGTTTAAATATAATATTTATAAAATATAATAAAATTTTAAATTTTAAACGTATTTACTATTAATAATTTATCATGGATAAGAACTAAATTTTATAAATATCTTAAAAAGTTTTATTATTTACGATAATTTTTCTTTATTTTTTTTACATATAGTTAATGTCTTTTCGGAAAAATATTATACTAGAAAACATTTGAAATAACTTTTGAAAGCATAAAGATAGTACAAAATGTCAAAGTTATTTGATATGACAAGTTTTTCACTATCTAATCGATGTTTATATGCTAAAATTTCATTGTTTTTAAATGTGTTATGCTTTGAAAAGTATGAAAGAATATTTTATTTTATTTGCATTTTTTCTAATTACAATTACAATTTATATTAAAAGATATGAACAAATGTATACAATATATATAAATTCTTTTAATATTTCATCAGAAAAAATCAACTAACTTCTTAAGTTGATTTTAAATATAATAAGATAAACTAAGATATCTATAAATATAGCAAAATACTATAGCTTTCTTGCAAATTGTGATATTTTTTATACTTATAATATTTTCTTCAAATATTTTTTTTGTTTTTTTGTCAATTTAAAATTTTAGAGAGAAAACACCAAACAAAATTGTCAAAATAACACATTTTATTTTTCTTCGTTTTGAAATGAAGTGTGCGTGAAAATAAAAATTTAGGATAAAAAAGTCCGATGTTTATCCAAAAAAGTTCAATTCGGAATCATGAGTTTTTTATAACTTCTGAAATAGTGAAATTAAATATTTTAATTAATTGGAAAAATATTTTTAAAAGTCTGTTAAAAATGGTTGTAACATCCTTAACAGAATAAAAAAAGAAAAAAAATAATTTTTTTATGATGAACAACATACATAAACTGCTATATATGTTTATTTGTATCACATCTTCTCATTTAGAATATCTTTCCAAAAAATGGTAATATATAACTCTTTTTTGGTCTAAGGTTGTAGATTTTGTTTTTAATCCTATATAAGTTATTTGAATCTTATTTAAGAAGGAATAATTTAAAATTAGTAAAGTTCAATTAATTGTAGTATTTATTTATTTATTAAACAAAATTCAATTACAGAGCTAGAATTGTTCTTCATCATCAGATTTCAACAAGTGATGTTCAGTACACTCTATCTTGCTTTCAGGTGATCTTTCGATTATTGAGTTCTTTATCATAAATGACAAAACTACTCAAATAAGTTAGAATTGAGAGTTTATTCATTATCTTTATAAGTTTAATTATATAAGAACTAACAATGAGTTTTGCATTCTCCCCCAAATTGTTGATTAAACTAACATAAACCTTTTTCACCTTGGCTTTTGCAGCAAACTCTAAAAGGAATTAAAGTTGTAAATGGCAACTAATTGATCTCATAAGTAACTCTTTCTCTCTCTCTCTCCCTAATAAATAAACTGGAAGTTACCCAATTTGGCTTATGCTTAATATGGCC

mRNA sequence

CTTATATTTTGCTATAAATTTCGTTTATGTTTATGCAAATTAACCCAATCGTGGGTTCATCCGTATCTAAGTGTTTGAAGTACTTAAAGAAGCTATATCAACTGCTTTTTCTTGGGCTTATAACATCTTCCATTTCAAGATTTCAGTGCAAAATGGTGAGTAGAAAGGTGGATTTGCGGTCAGACACTGTGACGAAACCGACGGAATCAATGCGAGCTGCGATGGCTATGGCGGAGGTGGACGACGATGTGTTGGGGTACGACCCCACAGCCTTGGAGTTGGAAGAAAAGATGGCAAAGATAATGGGAAAAGAAGGAGGGTTATTCGTTCCATCAGGGACAATGGGAAATCTAATAAGTATTCTTGTGCATTGTGAAACTAGAGGGAGTGAAGTGATTGTTGGAGACAATTCTCATATTCATATATTGGAGAATGGAGGTATTGCAACCATTGGAGGAGTTCATCCAAGAACGGTGAAGAACAAAGATGATGGAACAATGGATATTGATTTGATTGAAGCTGCCATTAGAAATCCAAAGGGACAGCTCTTCTTCCCGACAACAAGGCTCATTTGTTTAGAAAATACACATGCAAATTCGGGTGGAAAATGTCTTTCGATGGAATATACCGATGAAGTTGGAGAGTTAGCTAAAAAGCATGACCTCAAACTTCACATTGATGGAGCTCGAATTTTCAATGCTTCAATTGCACTAGCTATTCCGGTCGATCGATTGGTACGAGCCGCTGATTCAGTATCAGTATGCCTATCAAAAGGTTTGGGTGCACCTGTTGGATCAATCATTCTGGGTTCAAAAGACTTTATTACCAAGGCAATAAGGATTAGAAAAACATTGGGTGGAGGAATGAGGCAAATTGGCATCCTTTGTGCAGCTGGACTTGTTGCTATCAAAGAGAATGTTCAAAAACTTGAAGCTGATCATGAGAAGGCTAAGCAACTAGCGAGTGGGTTATACCAAATCAAAGGATTAAAGGTAGATACGAAATCAGTTGAGACAAACATTATATTCTTTGAAATAGAAGACGATTACGGAATCTCGATGGAAACATTATGTAAAACCTTGGAAGAACGTGGCATTTTTATGATGCTAGAAAGCCAAACAAGAGCTAGAATTGTTCTTCATCATCAGATTTCAACAAGTGATGTTCAGTACACTCTATCTTGCTTTCAGCAAACTCTAAAAGGAATTAAAGTTGTAAATGGCAACTAATTGATCTCATAAGTAACTCTTTCTCTCTCTCTCTCCCTAATAAATAAACTGGAAGTTACCCAATTTGGCTTATGCTTAATATGGCC

Coding sequence (CDS)

ATGTTTATGCAAATTAACCCAATCGTGGGTTCATCCGTATCTAAGTGTTTGAAGTACTTAAAGAAGCTATATCAACTGCTTTTTCTTGGGCTTATAACATCTTCCATTTCAAGATTTCAGTGCAAAATGGTGAGTAGAAAGGTGGATTTGCGGTCAGACACTGTGACGAAACCGACGGAATCAATGCGAGCTGCGATGGCTATGGCGGAGGTGGACGACGATGTGTTGGGGTACGACCCCACAGCCTTGGAGTTGGAAGAAAAGATGGCAAAGATAATGGGAAAAGAAGGAGGGTTATTCGTTCCATCAGGGACAATGGGAAATCTAATAAGTATTCTTGTGCATTGTGAAACTAGAGGGAGTGAAGTGATTGTTGGAGACAATTCTCATATTCATATATTGGAGAATGGAGGTATTGCAACCATTGGAGGAGTTCATCCAAGAACGGTGAAGAACAAAGATGATGGAACAATGGATATTGATTTGATTGAAGCTGCCATTAGAAATCCAAAGGGACAGCTCTTCTTCCCGACAACAAGGCTCATTTGTTTAGAAAATACACATGCAAATTCGGGTGGAAAATGTCTTTCGATGGAATATACCGATGAAGTTGGAGAGTTAGCTAAAAAGCATGACCTCAAACTTCACATTGATGGAGCTCGAATTTTCAATGCTTCAATTGCACTAGCTATTCCGGTCGATCGATTGGTACGAGCCGCTGATTCAGTATCAGTATGCCTATCAAAAGGTTTGGGTGCACCTGTTGGATCAATCATTCTGGGTTCAAAAGACTTTATTACCAAGGCAATAAGGATTAGAAAAACATTGGGTGGAGGAATGAGGCAAATTGGCATCCTTTGTGCAGCTGGACTTGTTGCTATCAAAGAGAATGTTCAAAAACTTGAAGCTGATCATGAGAAGGCTAAGCAACTAGCGAGTGGGTTATACCAAATCAAAGGATTAAAGGTAGATACGAAATCAGTTGAGACAAACATTATATTCTTTGAAATAGAAGACGATTACGGAATCTCGATGGAAACATTATGTAAAACCTTGGAAGAACGTGGCATTTTTATGATGCTAGAAAGCCAAACAAGAGCTAGAATTGTTCTTCATCATCAGATTTCAACAAGTGATGTTCAGTACACTCTATCTTGCTTTCAGCAAACTCTAAAAGGAATTAAAGTTGTAAATGGCAACTAA

Protein sequence

MFMQINPIVGSSVSKCLKYLKKLYQLLFLGLITSSISRFQCKMVSRKVDLRSDTVTKPTESMRAAMAMAEVDDDVLGYDPTALELEEKMAKIMGKEGGLFVPSGTMGNLISILVHCETRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNKDDGTMDIDLIEAAIRNPKGQLFFPTTRLICLENTHANSGGKCLSMEYTDEVGELAKKHDLKLHIDGARIFNASIALAIPVDRLVRAADSVSVCLSKGLGAPVGSIILGSKDFITKAIRIRKTLGGGMRQIGILCAAGLVAIKENVQKLEADHEKAKQLASGLYQIKGLKVDTKSVETNIIFFEIEDDYGISMETLCKTLEERGIFMMLESQTRARIVLHHQISTSDVQYTLSCFQQTLKGIKVVNGN
Homology
BLAST of IVF0022074 vs. ExPASy Swiss-Prot
Match: Q8RXU4 (Probable low-specificity L-threonine aldolase 1 OS=Arabidopsis thaliana OX=3702 GN=THA1 PE=1 SV=1)

HSP 1 Score: 513.1 bits (1320), Expect = 2.9e-144
Identity = 245/349 (70.20%), Postives = 302/349 (86.53%), Query Frame = 0

Query: 43  MVSRKVDLRSDTVTKPTESMRAAMAMAEVDDDVLGYDPTALELEEKMAKIMGKEGGLFVP 102
           MV R VDLRSDTVT+PT++MR AM  AEVDDDVLGYDPTA  LEE+MAK+MGKE  LFVP
Sbjct: 1   MVMRSVDLRSDTVTRPTDAMREAMCNAEVDDDVLGYDPTARRLEEEMAKMMGKEAALFVP 60

Query: 103 SGTMGNLISILVHCETRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNKDDGTMDIDL 162
           SGTMGNLIS++VHC+ RGSEVI+GDN HIH+ ENGGI+TIGGVHP+TVKN++DGTMD++ 
Sbjct: 61  SGTMGNLISVMVHCDVRGSEVILGDNCHIHVYENGGISTIGGVHPKTVKNEEDGTMDLEA 120

Query: 163 IEAAIRNPKGQLFFPTTRLICLENTHANSGGKCLSMEYTDEVGELAKKHDLKLHIDGARI 222
           IEAAIR+PKG  F+P+TRLICLENTHANSGG+CLS+EYT++VGE+AK+H +KLHIDGAR+
Sbjct: 121 IEAAIRDPKGSTFYPSTRLICLENTHANSGGRCLSVEYTEKVGEIAKRHGVKLHIDGARL 180

Query: 223 FNASIALAIPVDRLVRAADSVSVCLSKGLGAPVGSIILGSKDFITKAIRIRKTLGGGMRQ 282
           FNASIAL +PV +LV+AADSV VCLSKGLGAPVGS+I+GS+ FI KA  +RKTLGGGMRQ
Sbjct: 181 FNASIALGVPVHKLVKAADSVQVCLSKGLGAPVGSVIVGSQSFIEKAKTVRKTLGGGMRQ 240

Query: 283 IGILCAAGLVAIKENVQKLEADHEKAKQLASGLYQIKGLKVDTKSVETNIIFFEIEDDYG 342
           IG+LCAA LVA++EN+ KL+ DH+KAK LA GL Q+KG++V+  +VETN+IF ++ED   
Sbjct: 241 IGVLCAAALVALQENLPKLQHDHKKAKLLAEGLNQMKGIRVNVAAVETNMIFMDMEDGSR 300

Query: 343 ISMETLCKTLEERGIFMMLESQTRARIVLHHQISTSDVQYTLSCFQQTL 392
           ++ E L K LEE GI ++  + +R RIV+HHQI+TSDV YTLSCFQQ +
Sbjct: 301 LTAEKLRKNLEENGILLIRGNSSRIRIVIHHQITTSDVHYTLSCFQQAM 349

BLAST of IVF0022074 vs. ExPASy Swiss-Prot
Match: Q9FPH3 (Probable low-specificity L-threonine aldolase 2 OS=Arabidopsis thaliana OX=3702 GN=THA2 PE=1 SV=1)

HSP 1 Score: 485.7 bits (1249), Expect = 4.9e-136
Identity = 237/344 (68.90%), Postives = 291/344 (84.59%), Query Frame = 0

Query: 46  RKVDLRSDTVTKPTESMRAAMAMAEVDDDVLGYDPTALELEEKMAKIMGKEGGLFVPSGT 105
           R VDLRSDTVTKPTESMR+AMA AEVDDDVLG DPTAL LE+++A+I GKE  +FVPSGT
Sbjct: 8   RTVDLRSDTVTKPTESMRSAMANAEVDDDVLGNDPTALRLEKEVAEIAGKEAAMFVPSGT 67

Query: 106 MGNLISILVHCETRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNKDDGTMDIDLIEA 165
           MGNLIS+LVHC+ RGSEVI+GD+SHIHI ENGG++++GGVHPRTVKN++DGTM+I  IEA
Sbjct: 68  MGNLISVLVHCDERGSEVILGDDSHIHIYENGGVSSLGGVHPRTVKNEEDGTMEIGAIEA 127

Query: 166 AIRNPKGQLFFPTTRLICLENTHANSGGKCLSMEYTDEVGELAKKHDLKLHIDGARIFNA 225
           A+R+PKG L  P T+LICLENT AN GG+CL +EY D+VGELAKKH LKLHIDGARIFNA
Sbjct: 128 AVRSPKGDLHHPVTKLICLENTQANCGGRCLPIEYIDKVGELAKKHGLKLHIDGARIFNA 187

Query: 226 SIALAIPVDRLVRAADSVSVCLSKGLGAPVGSIILGSKDFITKAIRIRKTLGGGMRQIGI 285
           S+AL +PV R+V+AADSVS+CLSKG+GAPVGS+I+GSK FITKA  +RKTLGGGMRQIG+
Sbjct: 188 SVALGVPVKRIVQAADSVSICLSKGIGAPVGSVIVGSKKFITKARWLRKTLGGGMRQIGV 247

Query: 286 LCAAGLVAIKENVQKLEADHEKAKQLASGLYQIKGLKVDTKSVETNIIFFEIEDDYGISM 345
           LCAA LVA+ ENV KLE DH+KA+ LA GL +I+ L+V+  +VETNII+ +I +D     
Sbjct: 248 LCAAALVALHENVAKLEDDHKKARVLAEGLNRIERLRVNVAAVETNIIYVDIPEDPKFGA 307

Query: 346 ETLCKTLEERGIFMMLESQTRARIVLHHQISTSDVQYTLSCFQQ 390
           E  CK+LE+ G+ ++ ++  R RIVLHHQIS  DV+Y LSCF++
Sbjct: 308 EEACKSLEDVGVLVIPQATFRIRIVLHHQISDVDVEYVLSCFEK 351

BLAST of IVF0022074 vs. ExPASy Swiss-Prot
Match: O07051 (L-allo-threonine aldolase OS=Aeromonas jandaei OX=650 GN=ltaA PE=1 SV=1)

HSP 1 Score: 278.1 bits (710), Expect = 1.6e-73
Identity = 154/346 (44.51%), Postives = 216/346 (62.43%), Query Frame = 0

Query: 46  RKVDLRSDTVTKPTESMRAAMAMAEVDDDVLGYDPTALELEEKMAKIMGKEGGLFVPSGT 105
           R +DLRSDTVT+PT++MR  M  AEV DDV G DP    LE   A ++GKE  LFVPSGT
Sbjct: 2   RYIDLRSDTVTQPTDAMRQCMLHAEVGDDVYGEDPGVNALEAYGADLLGKEAALFVPSGT 61

Query: 106 MGNLISILVHCETRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNKDDGTMDIDLIEA 165
           M NL++++ HC+ RG   ++G  +HI+  E  G A +G V  + V  + DG++ +  + A
Sbjct: 62  MSNLLAVMSHCQ-RGEGAVLGSAAHIYRYEAQGSAVLGSVALQPVPMQADGSLALADVRA 121

Query: 166 AIRNPKGQLFFPTTRLICLENTHANSGGKCLSMEYTDEVGELAKKHDLKLHIDGARIFNA 225
           AI      + F  TRL+CLENTH    GK L + Y  E+ EL  +H L+LH+DGAR+FNA
Sbjct: 122 AI--APDDVHFTPTRLVCLENTH---NGKVLPLPYLREMRELVDEHGLQLHLDGARLFNA 181

Query: 226 SIALAIPVDRLVRAADSVSVCLSKGLGAPVGSIILGSKDFITKAIRIRKTLGGGMRQIGI 285
            +A    V  LV   DSVS+CLSKGLGAPVGS+++GS  FI +A R+RK +GGGMRQ GI
Sbjct: 182 VVASGHTVRELVAPFDSVSICLSKGLGAPVGSLLVGSHAFIARARRLRKMVGGGMRQAGI 241

Query: 286 LCAAGLVAIKENVQKLEADHEKAKQLASGLYQIKGLKVDTKSVETNIIFFEIEDDYGISM 345
           L  AGL A++++V +L  DH +A+QLA GL  + G+++D   V+TN++F ++        
Sbjct: 242 LAQAGLFALQQHVVRLADDHRRARQLAEGLAALPGIRLDLAQVQTNMVFLQLTSG---ES 301

Query: 346 ETLCKTLEERGIFMMLESQTRARIVLHHQISTSDVQYTLSCFQQTL 392
             L   ++ RGI  +       R+V H QI   D++  +  F + L
Sbjct: 302 APLLAFMKARGI--LFSGYGELRLVTHLQIHDDDIEEVIDAFTEYL 336

BLAST of IVF0022074 vs. ExPASy Swiss-Prot
Match: Q21890 (Uncharacterized protein R102.4 OS=Caenorhabditis elegans OX=6239 GN=R102.4 PE=3 SV=3)

HSP 1 Score: 276.9 bits (707), Expect = 3.5e-73
Identity = 154/350 (44.00%), Postives = 218/350 (62.29%), Query Frame = 0

Query: 48  VDLRSDTVTKPTESMRAAMAMAEVDDDVLGYDPTALELEEKMAKIMGKEGGLFVPSGTMG 107
           +DLRSDTVT P+  MR AMA A V DDV G D T   LE++ A++ GKE GLFV SGTMG
Sbjct: 67  IDLRSDTVTVPSVEMRRAMAEAIVGDDVYGEDTTTNRLEQRCAELFGKEAGLFVTSGTMG 126

Query: 108 NLISILVHCETRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNKDDGTMDIDLIEAAI 167
           NL++I+ HC+ RG E+IVG  +HIH  E G  A   G+   T++ K DGTMD++ IE AI
Sbjct: 127 NLLAIMAHCQ-RGEEIIVGRYNHIHRWEQGNYAQFAGISATTLEVKPDGTMDLNDIEQAI 186

Query: 168 RNPKGQLFFPTTRLICLENTHANSGGKCLSMEYTDEVGELAKKHDLKLHIDGARIFNASI 227
           R        P ++LIC+ENTH  +GGK L +E+   V +LA++ DLK+H+DGARI+NA++
Sbjct: 187 R--VKDCHMPASKLICIENTHNYTGGKALPIEWMRSVKQLAERRDLKVHMDGARIYNAAV 246

Query: 228 ALAIPVDRLVRAADSVSVCLSKGLGAPVGSIILGSKDFITKAIRIRKTLGGGMRQIGILC 287
           A    V ++   AD+V +C SKGLGAPVGSI++G KDFI +A   RK LGGG RQ GIL 
Sbjct: 247 ASNCSVSKIASFADTVQMCFSKGLGAPVGSIVVGPKDFIDRARHSRKALGGGWRQSGILA 306

Query: 288 AAGLVAIKENVQKLEADHEKAKQLASGLYQIKGLKVDTKSVE-----TNIIFFEIEDDYG 347
           AA  +A+      + ADHE+AK LA  +      +  TK        TN++    ++  G
Sbjct: 307 AAAHIALDHADATIRADHERAKTLARMINDATPEEFRTKVFAAEKDITNMVLVHCQN--G 366

Query: 348 ISMETLCKTLEERGIFMMLESQTRARIVLHHQISTSDVQYTLSCFQQTLK 393
           ++++ L    ++  I  M     R R+VL+  +S  +++  +  +++ LK
Sbjct: 367 VTVQQLTDFFQKHDILAMTFDARRIRMVLNWNVSDENLETIVEVYKKFLK 411

BLAST of IVF0022074 vs. ExPASy Swiss-Prot
Match: P75823 (Low specificity L-threonine aldolase OS=Escherichia coli (strain K12) OX=83333 GN=ltaE PE=1 SV=1)

HSP 1 Score: 253.8 bits (647), Expect = 3.2e-66
Identity = 133/329 (40.43%), Postives = 201/329 (61.09%), Query Frame = 0

Query: 48  VDLRSDTVTKPTESMRAAMAMAEVDDDVLGYDPTALELEEKMAKIMGKEGGLFVPSGTMG 107
           +DLRSDTVT+P+ +M  AM  A V DDV G DPT   L++  A++ GKE  +F+P+GT  
Sbjct: 2   IDLRSDTVTRPSRAMLEAMMAAPVGDDVYGDDPTVNALQDYAAELSGKEAAIFLPTGTQA 61

Query: 108 NLISILVHCETRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNKDDGTMDIDLIEAAI 167
           NL+++L HCE RG E IVG  +H ++ E GG A +G + P+ +    DGT+ +D  + A+
Sbjct: 62  NLVALLSHCE-RGEEYIVGQAAHNYLFEAGGAAVLGSIQPQPIDAAADGTLPLD--KVAM 121

Query: 168 RNPKGQLFFPTTRLICLENTHANSGGKCLSMEYTDEVGELAKKHDLKLHIDGARIFNASI 227
           +     + F  T+L+ LENTH    GK L  EY  E  E  ++ +L LH+DGARIFNA +
Sbjct: 122 KIKPDDIHFARTKLLSLENTH---NGKVLPREYLKEAWEFTRERNLALHVDGARIFNAVV 181

Query: 228 ALAIPVDRLVRAADSVSVCLSKGLGAPVGSIILGSKDFITKAIRIRKTLGGGMRQIGILC 287
           A    +  + +  DS ++CLSKGLG PVGS+++G++D+I +AIR RK  GGGMRQ GIL 
Sbjct: 182 AYGCELKEITQYCDSFTICLSKGLGTPVGSLLVGNRDYIKRAIRWRKMTGGGMRQSGILA 241

Query: 288 AAGLVAIKENVQKLEADHEKAKQLASGLYQIKGLKVDTKSVETNIIFFEIEDDYGISMET 347
           AAG+ A+K NV +L+ DH+ A  +A    Q++    D    +TN++F  + ++   ++  
Sbjct: 242 AAGIYALKNNVARLQEDHDNAAWMAE---QLREAGADVMRQDTNMLFVRVGEENAAALGE 301

Query: 348 LCKTLEERGIFMMLESQTRARIVLHHQIS 377
             K        +++ +    R+V H  +S
Sbjct: 302 YMKARN-----VLINASPIVRLVTHLDVS 316

BLAST of IVF0022074 vs. ExPASy TrEMBL
Match: A0A1S3BXZ8 (probable low-specificity L-threonine aldolase 1 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103494316 PE=3 SV=1)

HSP 1 Score: 776.9 bits (2005), Expect = 4.0e-221
Identity = 397/400 (99.25%), Postives = 398/400 (99.50%), Query Frame = 0

Query: 1   MFMQINPIVGSSVSKCLKYLKKLYQLLFLGLITSSISRFQCKMVSRKVDLRSDTVTKPTE 60
           MFMQINPIVGSSVSKCLKYLKKLYQLLFLGLITSSISRFQCKMVSRKVDLRSDTVTKPTE
Sbjct: 1   MFMQINPIVGSSVSKCLKYLKKLYQLLFLGLITSSISRFQCKMVSRKVDLRSDTVTKPTE 60

Query: 61  SMRAAMAMAEVDDDVLGYDPTALELEEKMAKIMGKEGGLFVPSGTMGNLISILVHCETRG 120
           SMRAAMAMAEVDDDVLGYDPTALELEEKMAKIMGKEGGLFVPSGTMGNLISILVHCETRG
Sbjct: 61  SMRAAMAMAEVDDDVLGYDPTALELEEKMAKIMGKEGGLFVPSGTMGNLISILVHCETRG 120

Query: 121 SEVIVGDNSHIHILENGGIATIGGVHPRTVKNKDDGTMDIDLIEAAIRNPKGQLFFPTTR 180
           SEVIVGDNSHIHILENGGIATIGGVHPRTVKNKDDGTMDIDLIEAAIRNPKGQLFFPTTR
Sbjct: 121 SEVIVGDNSHIHILENGGIATIGGVHPRTVKNKDDGTMDIDLIEAAIRNPKGQLFFPTTR 180

Query: 181 LICLENTHANSGGKCLSMEYTDEVGELAKKHDLKLHIDGARIFNASIALAIPVDRLVRAA 240
           LICLENTHANSGGKCLSMEYTDEVGELAKKHDLKLHIDGARIFNASIALAIPVDRLVRAA
Sbjct: 181 LICLENTHANSGGKCLSMEYTDEVGELAKKHDLKLHIDGARIFNASIALAIPVDRLVRAA 240

Query: 241 DSVSVCLSKGLGAPVGSIILGSKDFITKAIRIRKTLGGGMRQIGILCAAGLVAIKENVQK 300
           DSVSVCLSKGLGAPVGSIILGSKDFITKAIRIRKTLGGGMRQIGILCAAGLVAIKENVQK
Sbjct: 241 DSVSVCLSKGLGAPVGSIILGSKDFITKAIRIRKTLGGGMRQIGILCAAGLVAIKENVQK 300

Query: 301 LEADHEKAKQLASGLYQIKGLKVDTKSVETNIIFFEIEDDYGISMETLCKTLEERGIFMM 360
           LEADHEKAKQLASGLYQIKGLKVD KSVETNIIFFEIEDDYGISMETLCKTLEERGIFMM
Sbjct: 301 LEADHEKAKQLASGLYQIKGLKVDPKSVETNIIFFEIEDDYGISMETLCKTLEERGIFMM 360

Query: 361 LESQTRARIVLHHQISTSDVQYTLSCFQQTLKGIKVVNGN 401
           LESQTRARIVLHHQISTSDVQYTLSCF+QTL GIKVVNGN
Sbjct: 361 LESQTRARIVLHHQISTSDVQYTLSCFKQTLNGIKVVNGN 400

BLAST of IVF0022074 vs. ExPASy TrEMBL
Match: A0A1S3BWA2 (probable low-specificity L-threonine aldolase 1 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103494316 PE=3 SV=1)

HSP 1 Score: 756.5 bits (1952), Expect = 5.5e-215
Identity = 389/400 (97.25%), Postives = 390/400 (97.50%), Query Frame = 0

Query: 1   MFMQINPIVGSSVSKCLKYLKKLYQLLFLGLITSSISRFQCKMVSRKVDLRSDTVTKPTE 60
           MFMQINPIVGSSVSKCLKYLKKLYQLLFLGLITSSISRFQCKMVSRKVDLRSDTVTKPTE
Sbjct: 1   MFMQINPIVGSSVSKCLKYLKKLYQLLFLGLITSSISRFQCKMVSRKVDLRSDTVTKPTE 60

Query: 61  SMRAAMAMAEVDDDVLGYDPTALELEEKMAKIMGKEGGLFVPSGTMGNLISILVHCETRG 120
           SMRAAMAMAEVDDDVLGYDPTALELEEKMAKIMGKEGGLFVPSGTMGNLISILVHCETRG
Sbjct: 61  SMRAAMAMAEVDDDVLGYDPTALELEEKMAKIMGKEGGLFVPSGTMGNLISILVHCETRG 120

Query: 121 SEVIVGDNSHIHILENGGIATIGGVHPRTVKNKDDGTMDIDLIEAAIRNPKGQLFFPTTR 180
           SEVIVGDNSHIHILENGGIATIGGVHPRTVKNKDDGTMDIDLIEAAIRNPKGQLFFPTTR
Sbjct: 121 SEVIVGDNSHIHILENGGIATIGGVHPRTVKNKDDGTMDIDLIEAAIRNPKGQLFFPTTR 180

Query: 181 LICLENTHANSGGKCLSMEYTDEVGELAKKHDLKLHIDGARIFNASIALAIPVDRLVRAA 240
           LICLENTHANSGGKCLSMEYTDEVGELAKKHDLKLHIDGARIFNASIALAIPVDRL    
Sbjct: 181 LICLENTHANSGGKCLSMEYTDEVGELAKKHDLKLHIDGARIFNASIALAIPVDRL---- 240

Query: 241 DSVSVCLSKGLGAPVGSIILGSKDFITKAIRIRKTLGGGMRQIGILCAAGLVAIKENVQK 300
               VCLSKGLGAPVGSIILGSKDFITKAIRIRKTLGGGMRQIGILCAAGLVAIKENVQK
Sbjct: 241 ----VCLSKGLGAPVGSIILGSKDFITKAIRIRKTLGGGMRQIGILCAAGLVAIKENVQK 300

Query: 301 LEADHEKAKQLASGLYQIKGLKVDTKSVETNIIFFEIEDDYGISMETLCKTLEERGIFMM 360
           LEADHEKAKQLASGLYQIKGLKVD KSVETNIIFFEIEDDYGISMETLCKTLEERGIFMM
Sbjct: 301 LEADHEKAKQLASGLYQIKGLKVDPKSVETNIIFFEIEDDYGISMETLCKTLEERGIFMM 360

Query: 361 LESQTRARIVLHHQISTSDVQYTLSCFQQTLKGIKVVNGN 401
           LESQTRARIVLHHQISTSDVQYTLSCF+QTL GIKVVNGN
Sbjct: 361 LESQTRARIVLHHQISTSDVQYTLSCFKQTLNGIKVVNGN 392

BLAST of IVF0022074 vs. ExPASy TrEMBL
Match: A0A1S3BWX2 (probable low-specificity L-threonine aldolase 1 isoform X3 OS=Cucumis melo OX=3656 GN=LOC103494316 PE=3 SV=1)

HSP 1 Score: 713.8 bits (1841), Expect = 4.1e-202
Identity = 366/371 (98.65%), Postives = 368/371 (99.19%), Query Frame = 0

Query: 1   MFMQINPIVGSSVSKCLKYLKKLYQLLFLGLITSSISRFQCKMVSRKVDLRSDTVTKPTE 60
           MFMQINPIVGSSVSKCLKYLKKLYQLLFLGLITSSISRFQCKMVSRKVDLRSDTVTKPTE
Sbjct: 1   MFMQINPIVGSSVSKCLKYLKKLYQLLFLGLITSSISRFQCKMVSRKVDLRSDTVTKPTE 60

Query: 61  SMRAAMAMAEVDDDVLGYDPTALELEEKMAKIMGKEGGLFVPSGTMGNLISILVHCETRG 120
           SMRAAMAMAEVDDDVLGYDPTALELEEKMAKIMGKEGGLFVPSGTMGNLISILVHCETRG
Sbjct: 61  SMRAAMAMAEVDDDVLGYDPTALELEEKMAKIMGKEGGLFVPSGTMGNLISILVHCETRG 120

Query: 121 SEVIVGDNSHIHILENGGIATIGGVHPRTVKNKDDGTMDIDLIEAAIRNPKGQLFFPTTR 180
           SEVIVGDNSHIHILENGGIATIGGVHPRTVKNKDDGTMDIDLIEAAIRNPKGQLFFPTTR
Sbjct: 121 SEVIVGDNSHIHILENGGIATIGGVHPRTVKNKDDGTMDIDLIEAAIRNPKGQLFFPTTR 180

Query: 181 LICLENTHANSGGKCLSMEYTDEVGELAKKHDLKLHIDGARIFNASIALAIPVDRLVRAA 240
           LICLENTHANSGGKCLSMEYTDEVGELAKKHDLKLHIDGARIFNASIALAIPVDRLVRAA
Sbjct: 181 LICLENTHANSGGKCLSMEYTDEVGELAKKHDLKLHIDGARIFNASIALAIPVDRLVRAA 240

Query: 241 DSVSVCLSKGLGAPVGSIILGSKDFITKAIRIRKTLGGGMRQIGILCAAGLVAIKENVQK 300
           DSVSVCLSKGLGAPVGSIILGSKDFITKAIRIRKTLGGGMRQIGILCAAGLVAIKENVQK
Sbjct: 241 DSVSVCLSKGLGAPVGSIILGSKDFITKAIRIRKTLGGGMRQIGILCAAGLVAIKENVQK 300

Query: 301 LEADHEKAKQLASGLYQIKGLKVDTKSVETNIIFFEIEDDYGISMETLCKTLEERGIFMM 360
           LEADHEKAKQLASGLYQIKGLKVD KSVETNIIFFEIEDDYGISMETLCKTLEERGIFMM
Sbjct: 301 LEADHEKAKQLASGLYQIKGLKVDPKSVETNIIFFEIEDDYGISMETLCKTLEERGIFMM 360

Query: 361 LESQTRARIVL 372
           LESQTR + V+
Sbjct: 361 LESQTRFQQVM 371

BLAST of IVF0022074 vs. ExPASy TrEMBL
Match: A0A6J1DWI0 (probable low-specificity L-threonine aldolase 1 OS=Momordica charantia OX=3673 GN=LOC111024142 PE=3 SV=1)

HSP 1 Score: 604.4 bits (1557), Expect = 3.5e-169
Identity = 306/359 (85.24%), Postives = 329/359 (91.64%), Query Frame = 0

Query: 43  MVSRKVDLRSDTVTKPTESMRAAMAMAEVDDDVLGYDPTALELEEKMAKIMGKEGGLFVP 102
           MVSRKVDLRSDTVTKPTE+MRAAMAMAEVDDDVLGYDP AL+LEE+MAK+ GKE  LFVP
Sbjct: 1   MVSRKVDLRSDTVTKPTEAMRAAMAMAEVDDDVLGYDPIALQLEEEMAKMTGKEAALFVP 60

Query: 103 SGTMGNLISILVHCETRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNKDDGTMDIDL 162
           SGTMGNLIS+LVHCE RGSEVI+G NSHIHILENGGIATIGGVHPRTVKN  DGTMDIDL
Sbjct: 61  SGTMGNLISVLVHCEIRGSEVILGHNSHIHILENGGIATIGGVHPRTVKNNADGTMDIDL 120

Query: 163 IEAAIRNPKGQLFFPTTRLICLENTHANSGGKCLSMEYTDEVGELAKKHDLKLHIDGARI 222
           IEAAIRNPKG+LFFPTTRL+CLEN+HANSGGKCLS+EYTDEVGELAKKH LKLHIDGARI
Sbjct: 121 IEAAIRNPKGELFFPTTRLVCLENSHANSGGKCLSVEYTDEVGELAKKHGLKLHIDGARI 180

Query: 223 FNASIALAIPVDRLVRAADSVSVCLSKGLGAPVGSIILGSKDFITKAIRIRKTLGGGMRQ 282
           FNASIAL + VDRLV+AADSVSVCLSKGLGAPVGS+I+GSK FI KA R+RKTLGGGMRQ
Sbjct: 181 FNASIALGVSVDRLVQAADSVSVCLSKGLGAPVGSVIVGSKSFIAKAKRVRKTLGGGMRQ 240

Query: 283 IGILCAAGLVAIKENVQKLEADHEKAKQLASGLYQIKGLKVDTKSVETNIIFFEIED-DY 342
           IGILC+A LVAIKEN+ KLE DH KAK LASGL +I GLKVD KSVETNIIFFE+ED DY
Sbjct: 241 IGILCSAALVAIKENLPKLEDDHHKAKLLASGLSEINGLKVDPKSVETNIIFFELEDVDY 300

Query: 343 GISMETLCKTLEERGIFMMLESQTRARIVLHHQISTSDVQYTLSCFQQTLKGIKVVNGN 401
            IS+ETLCK+LEERGIFMM ES TRARIV+HHQIS SDV YTLSCFQQTL GI+V NGN
Sbjct: 301 KISVETLCKSLEERGIFMMQESSTRARIVIHHQISISDVHYTLSCFQQTLSGIQVGNGN 359

BLAST of IVF0022074 vs. ExPASy TrEMBL
Match: A0A6J1EZ04 (probable low-specificity L-threonine aldolase 1 OS=Cucurbita moschata OX=3662 GN=LOC111440844 PE=3 SV=1)

HSP 1 Score: 599.7 bits (1545), Expect = 8.7e-168
Identity = 299/358 (83.52%), Postives = 326/358 (91.06%), Query Frame = 0

Query: 43  MVSRKVDLRSDTVTKPTESMRAAMAMAEVDDDVLGYDPTALELEEKMAKIMGKEGGLFVP 102
           MVSRKVDLRSDTVTKPT+SMRAAMA+AEVDDDVLGYDP AL+LEE+MAK+ GKE  LFVP
Sbjct: 1   MVSRKVDLRSDTVTKPTDSMRAAMAIAEVDDDVLGYDPIALQLEEEMAKLTGKEAALFVP 60

Query: 103 SGTMGNLISILVHCETRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNKDDGTMDIDL 162
           SGTMGNLIS+LVHC+ RGSEVI+GDNSHIHILENGGIATIGGVHPRTVKN DDGT+DIDL
Sbjct: 61  SGTMGNLISVLVHCDIRGSEVILGDNSHIHILENGGIATIGGVHPRTVKNNDDGTIDIDL 120

Query: 163 IEAAIRNPKGQLFFPTTRLICLENTHANSGGKCLSMEYTDEVGELAKKHDLKLHIDGARI 222
           IEAAIRNPKG+LFFPTTRLICLENTHANSGGKCL +EY DEVGELAKKH LKLHIDGARI
Sbjct: 121 IEAAIRNPKGELFFPTTRLICLENTHANSGGKCLPVEYIDEVGELAKKHGLKLHIDGARI 180

Query: 223 FNASIALAIPVDRLVRAADSVSVCLSKGLGAPVGSIILGSKDFITKAIRIRKTLGGGMRQ 282
           FNASIAL + VDRLV+ ADSVSVCLSKGLGAPVGS+I+GSK FI KA R+RKTLGGGMRQ
Sbjct: 181 FNASIALGVSVDRLVQTADSVSVCLSKGLGAPVGSVIVGSKSFIAKAKRVRKTLGGGMRQ 240

Query: 283 IGILCAAGLVAIKENVQKLEADHEKAKQLASGLYQIKGLKVDTKSVETNIIFFEIEDDYG 342
           IGILCAA L+AIKENV KL  DH  AK LASGL QI G+KVD KSVETNIIFFE+E+D  
Sbjct: 241 IGILCAAALIAIKENVPKLATDHHNAKLLASGLNQINGVKVDPKSVETNIIFFEMEEDSK 300

Query: 343 ISMETLCKTLEERGIFMMLESQTRARIVLHHQISTSDVQYTLSCFQQTLKGIKVVNGN 401
           IS+ETLCK+LEERGIFMML+ +TRAR+VLHHQISTSDV+YTLSCFQQTL GI   +GN
Sbjct: 301 ISVETLCKSLEERGIFMMLDGKTRARMVLHHQISTSDVEYTLSCFQQTLSGIAAADGN 358

BLAST of IVF0022074 vs. NCBI nr
Match: XP_008453666.1 (PREDICTED: probable low-specificity L-threonine aldolase 1 isoform X1 [Cucumis melo])

HSP 1 Score: 776 bits (2005), Expect = 8.07e-283
Identity = 397/400 (99.25%), Postives = 398/400 (99.50%), Query Frame = 0

Query: 1   MFMQINPIVGSSVSKCLKYLKKLYQLLFLGLITSSISRFQCKMVSRKVDLRSDTVTKPTE 60
           MFMQINPIVGSSVSKCLKYLKKLYQLLFLGLITSSISRFQCKMVSRKVDLRSDTVTKPTE
Sbjct: 1   MFMQINPIVGSSVSKCLKYLKKLYQLLFLGLITSSISRFQCKMVSRKVDLRSDTVTKPTE 60

Query: 61  SMRAAMAMAEVDDDVLGYDPTALELEEKMAKIMGKEGGLFVPSGTMGNLISILVHCETRG 120
           SMRAAMAMAEVDDDVLGYDPTALELEEKMAKIMGKEGGLFVPSGTMGNLISILVHCETRG
Sbjct: 61  SMRAAMAMAEVDDDVLGYDPTALELEEKMAKIMGKEGGLFVPSGTMGNLISILVHCETRG 120

Query: 121 SEVIVGDNSHIHILENGGIATIGGVHPRTVKNKDDGTMDIDLIEAAIRNPKGQLFFPTTR 180
           SEVIVGDNSHIHILENGGIATIGGVHPRTVKNKDDGTMDIDLIEAAIRNPKGQLFFPTTR
Sbjct: 121 SEVIVGDNSHIHILENGGIATIGGVHPRTVKNKDDGTMDIDLIEAAIRNPKGQLFFPTTR 180

Query: 181 LICLENTHANSGGKCLSMEYTDEVGELAKKHDLKLHIDGARIFNASIALAIPVDRLVRAA 240
           LICLENTHANSGGKCLSMEYTDEVGELAKKHDLKLHIDGARIFNASIALAIPVDRLVRAA
Sbjct: 181 LICLENTHANSGGKCLSMEYTDEVGELAKKHDLKLHIDGARIFNASIALAIPVDRLVRAA 240

Query: 241 DSVSVCLSKGLGAPVGSIILGSKDFITKAIRIRKTLGGGMRQIGILCAAGLVAIKENVQK 300
           DSVSVCLSKGLGAPVGSIILGSKDFITKAIRIRKTLGGGMRQIGILCAAGLVAIKENVQK
Sbjct: 241 DSVSVCLSKGLGAPVGSIILGSKDFITKAIRIRKTLGGGMRQIGILCAAGLVAIKENVQK 300

Query: 301 LEADHEKAKQLASGLYQIKGLKVDTKSVETNIIFFEIEDDYGISMETLCKTLEERGIFMM 360
           LEADHEKAKQLASGLYQIKGLKVD KSVETNIIFFEIEDDYGISMETLCKTLEERGIFMM
Sbjct: 301 LEADHEKAKQLASGLYQIKGLKVDPKSVETNIIFFEIEDDYGISMETLCKTLEERGIFMM 360

Query: 361 LESQTRARIVLHHQISTSDVQYTLSCFQQTLKGIKVVNGN 400
           LESQTRARIVLHHQISTSDVQYTLSCF+QTL GIKVVNGN
Sbjct: 361 LESQTRARIVLHHQISTSDVQYTLSCFKQTLNGIKVVNGN 400

BLAST of IVF0022074 vs. NCBI nr
Match: XP_008453667.1 (PREDICTED: probable low-specificity L-threonine aldolase 1 isoform X2 [Cucumis melo])

HSP 1 Score: 756 bits (1951), Expect = 1.01e-274
Identity = 389/400 (97.25%), Postives = 390/400 (97.50%), Query Frame = 0

Query: 1   MFMQINPIVGSSVSKCLKYLKKLYQLLFLGLITSSISRFQCKMVSRKVDLRSDTVTKPTE 60
           MFMQINPIVGSSVSKCLKYLKKLYQLLFLGLITSSISRFQCKMVSRKVDLRSDTVTKPTE
Sbjct: 1   MFMQINPIVGSSVSKCLKYLKKLYQLLFLGLITSSISRFQCKMVSRKVDLRSDTVTKPTE 60

Query: 61  SMRAAMAMAEVDDDVLGYDPTALELEEKMAKIMGKEGGLFVPSGTMGNLISILVHCETRG 120
           SMRAAMAMAEVDDDVLGYDPTALELEEKMAKIMGKEGGLFVPSGTMGNLISILVHCETRG
Sbjct: 61  SMRAAMAMAEVDDDVLGYDPTALELEEKMAKIMGKEGGLFVPSGTMGNLISILVHCETRG 120

Query: 121 SEVIVGDNSHIHILENGGIATIGGVHPRTVKNKDDGTMDIDLIEAAIRNPKGQLFFPTTR 180
           SEVIVGDNSHIHILENGGIATIGGVHPRTVKNKDDGTMDIDLIEAAIRNPKGQLFFPTTR
Sbjct: 121 SEVIVGDNSHIHILENGGIATIGGVHPRTVKNKDDGTMDIDLIEAAIRNPKGQLFFPTTR 180

Query: 181 LICLENTHANSGGKCLSMEYTDEVGELAKKHDLKLHIDGARIFNASIALAIPVDRLVRAA 240
           LICLENTHANSGGKCLSMEYTDEVGELAKKHDLKLHIDGARIFNASIALAIPVDRLV   
Sbjct: 181 LICLENTHANSGGKCLSMEYTDEVGELAKKHDLKLHIDGARIFNASIALAIPVDRLV--- 240

Query: 241 DSVSVCLSKGLGAPVGSIILGSKDFITKAIRIRKTLGGGMRQIGILCAAGLVAIKENVQK 300
                CLSKGLGAPVGSIILGSKDFITKAIRIRKTLGGGMRQIGILCAAGLVAIKENVQK
Sbjct: 241 -----CLSKGLGAPVGSIILGSKDFITKAIRIRKTLGGGMRQIGILCAAGLVAIKENVQK 300

Query: 301 LEADHEKAKQLASGLYQIKGLKVDTKSVETNIIFFEIEDDYGISMETLCKTLEERGIFMM 360
           LEADHEKAKQLASGLYQIKGLKVD KSVETNIIFFEIEDDYGISMETLCKTLEERGIFMM
Sbjct: 301 LEADHEKAKQLASGLYQIKGLKVDPKSVETNIIFFEIEDDYGISMETLCKTLEERGIFMM 360

Query: 361 LESQTRARIVLHHQISTSDVQYTLSCFQQTLKGIKVVNGN 400
           LESQTRARIVLHHQISTSDVQYTLSCF+QTL GIKVVNGN
Sbjct: 361 LESQTRARIVLHHQISTSDVQYTLSCFKQTLNGIKVVNGN 392

BLAST of IVF0022074 vs. NCBI nr
Match: XP_008453668.1 (PREDICTED: probable low-specificity L-threonine aldolase 1 isoform X3 [Cucumis melo])

HSP 1 Score: 714 bits (1842), Expect = 2.84e-258
Identity = 366/371 (98.65%), Postives = 368/371 (99.19%), Query Frame = 0

Query: 1   MFMQINPIVGSSVSKCLKYLKKLYQLLFLGLITSSISRFQCKMVSRKVDLRSDTVTKPTE 60
           MFMQINPIVGSSVSKCLKYLKKLYQLLFLGLITSSISRFQCKMVSRKVDLRSDTVTKPTE
Sbjct: 1   MFMQINPIVGSSVSKCLKYLKKLYQLLFLGLITSSISRFQCKMVSRKVDLRSDTVTKPTE 60

Query: 61  SMRAAMAMAEVDDDVLGYDPTALELEEKMAKIMGKEGGLFVPSGTMGNLISILVHCETRG 120
           SMRAAMAMAEVDDDVLGYDPTALELEEKMAKIMGKEGGLFVPSGTMGNLISILVHCETRG
Sbjct: 61  SMRAAMAMAEVDDDVLGYDPTALELEEKMAKIMGKEGGLFVPSGTMGNLISILVHCETRG 120

Query: 121 SEVIVGDNSHIHILENGGIATIGGVHPRTVKNKDDGTMDIDLIEAAIRNPKGQLFFPTTR 180
           SEVIVGDNSHIHILENGGIATIGGVHPRTVKNKDDGTMDIDLIEAAIRNPKGQLFFPTTR
Sbjct: 121 SEVIVGDNSHIHILENGGIATIGGVHPRTVKNKDDGTMDIDLIEAAIRNPKGQLFFPTTR 180

Query: 181 LICLENTHANSGGKCLSMEYTDEVGELAKKHDLKLHIDGARIFNASIALAIPVDRLVRAA 240
           LICLENTHANSGGKCLSMEYTDEVGELAKKHDLKLHIDGARIFNASIALAIPVDRLVRAA
Sbjct: 181 LICLENTHANSGGKCLSMEYTDEVGELAKKHDLKLHIDGARIFNASIALAIPVDRLVRAA 240

Query: 241 DSVSVCLSKGLGAPVGSIILGSKDFITKAIRIRKTLGGGMRQIGILCAAGLVAIKENVQK 300
           DSVSVCLSKGLGAPVGSIILGSKDFITKAIRIRKTLGGGMRQIGILCAAGLVAIKENVQK
Sbjct: 241 DSVSVCLSKGLGAPVGSIILGSKDFITKAIRIRKTLGGGMRQIGILCAAGLVAIKENVQK 300

Query: 301 LEADHEKAKQLASGLYQIKGLKVDTKSVETNIIFFEIEDDYGISMETLCKTLEERGIFMM 360
           LEADHEKAKQLASGLYQIKGLKVD KSVETNIIFFEIEDDYGISMETLCKTLEERGIFMM
Sbjct: 301 LEADHEKAKQLASGLYQIKGLKVDPKSVETNIIFFEIEDDYGISMETLCKTLEERGIFMM 360

Query: 361 LESQTRARIVL 371
           LESQTR + V+
Sbjct: 361 LESQTRFQQVM 371

BLAST of IVF0022074 vs. NCBI nr
Match: XP_011660269.1 (probable low-specificity L-threonine aldolase 1 isoform X2 [Cucumis sativus] >XP_031745916.1 probable low-specificity L-threonine aldolase 1 isoform X2 [Cucumis sativus] >KAE8637487.1 hypothetical protein CSA_016969 [Cucumis sativus] >KAE8653610.1 hypothetical protein Csa_006916 [Cucumis sativus])

HSP 1 Score: 660 bits (1703), Expect = 1.74e-237
Identity = 331/358 (92.46%), Postives = 347/358 (96.93%), Query Frame = 0

Query: 43  MVSRKVDLRSDTVTKPTESMRAAMAMAEVDDDVLGYDPTALELEEKMAKIMGKEGGLFVP 102
           MVSRKVDLRSDTVTKPTESMRAAMAMAEVDDDVLGYDPTALELEE+MAKIMGKE GLFVP
Sbjct: 1   MVSRKVDLRSDTVTKPTESMRAAMAMAEVDDDVLGYDPTALELEEEMAKIMGKEEGLFVP 60

Query: 103 SGTMGNLISILVHCETRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNKDDGTMDIDL 162
           SGTMGNLIS+LVHCETRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNKDDGTMDIDL
Sbjct: 61  SGTMGNLISVLVHCETRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNKDDGTMDIDL 120

Query: 163 IEAAIRNPKGQLFFPTTRLICLENTHANSGGKCLSMEYTDEVGELAKKHDLKLHIDGARI 222
           IEAAIRNPKGQLFFPTTRLICLENTHANSGGKCLS+EY DEVGELAKK+DLKLHIDGARI
Sbjct: 121 IEAAIRNPKGQLFFPTTRLICLENTHANSGGKCLSVEYIDEVGELAKKYDLKLHIDGARI 180

Query: 223 FNASIALAIPVDRLVRAADSVSVCLSKGLGAPVGSIILGSKDFITKAIRIRKTLGGGMRQ 282
           FNASIAL +PVDRLV+AADS+ VCLSKGLGAPVGSII+GSKDFI KA R+RKTLGGGMRQ
Sbjct: 181 FNASIALGVPVDRLVQAADSILVCLSKGLGAPVGSIIVGSKDFIAKARRVRKTLGGGMRQ 240

Query: 283 IGILCAAGLVAIKENVQKLEADHEKAKQLASGLYQIKGLKVDTKSVETNIIFFEIEDDYG 342
           IGILCAAGLVAIKENVQKLEADH+KAKQLASGL+QIKGLK+D KSVETNII FEIEDDYG
Sbjct: 241 IGILCAAGLVAIKENVQKLEADHKKAKQLASGLFQIKGLKIDPKSVETNIILFEIEDDYG 300

Query: 343 ISMETLCKTLEERGIFMMLESQTRARIVLHHQISTSDVQYTLSCFQQTLKGIKVVNGN 400
           ISMETLCK+LEERGIF+ML++QTRARIV HHQISTSDVQY LSCFQQTL GIKVVNGN
Sbjct: 301 ISMETLCKSLEERGIFVMLQTQTRARIVFHHQISTSDVQYILSCFQQTLNGIKVVNGN 358

BLAST of IVF0022074 vs. NCBI nr
Match: XP_031736719.1 (probable low-specificity L-threonine aldolase 1 isoform X1 [Cucumis sativus] >XP_031736720.1 probable low-specificity L-threonine aldolase 1 isoform X1 [Cucumis sativus] >XP_031736728.1 probable low-specificity L-threonine aldolase 1 isoform X1 [Cucumis sativus] >XP_031745913.1 probable low-specificity L-threonine aldolase 1 isoform X1 [Cucumis sativus] >XP_031745914.1 probable low-specificity L-threonine aldolase 1 isoform X1 [Cucumis sativus] >XP_031745915.1 probable low-specificity L-threonine aldolase 1 isoform X1 [Cucumis sativus])

HSP 1 Score: 654 bits (1686), Expect = 8.46e-235
Identity = 331/364 (90.93%), Postives = 347/364 (95.33%), Query Frame = 0

Query: 43  MVSRKVDLRSDTVTKPTESMRAAMAMAEVDDDVLGYDPTALELEEKMAKIMGKEGGLFVP 102
           MVSRKVDLRSDTVTKPTESMRAAMAMAEVDDDVLGYDPTALELEE+MAKIMGKE GLFVP
Sbjct: 1   MVSRKVDLRSDTVTKPTESMRAAMAMAEVDDDVLGYDPTALELEEEMAKIMGKEEGLFVP 60

Query: 103 SGTMGNLISILVHCETRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNKDDGTMDIDL 162
           SGTMGNLIS+LVHCETRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNKDDGTMDIDL
Sbjct: 61  SGTMGNLISVLVHCETRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNKDDGTMDIDL 120

Query: 163 IEAAIRNPKGQLFFPTTRLICLENTHANSGGKCLSMEYTDEVGELAKKHDLKLHIDGARI 222
           IEAAIRNPKGQLFFPTTRLICLENTHANSGGKCLS+EY DEVGELAKK+DLKLHIDGARI
Sbjct: 121 IEAAIRNPKGQLFFPTTRLICLENTHANSGGKCLSVEYIDEVGELAKKYDLKLHIDGARI 180

Query: 223 FNASIALAIPVDRLVRAADSVSVCLSKGLGAPVGSIILGSKDFITKAIRIRKTLGGGMRQ 282
           FNASIAL +PVDRLV+AADS+ VCLSKGLGAPVGSII+GSKDFI KA R+RKTLGGGMRQ
Sbjct: 181 FNASIALGVPVDRLVQAADSILVCLSKGLGAPVGSIIVGSKDFIAKARRVRKTLGGGMRQ 240

Query: 283 IGILCAAGLVAIKENVQKLEADHEKAKQLASGLYQIKGLKVDTKSVETNI------IFFE 342
           IGILCAAGLVAIKENVQKLEADH+KAKQLASGL+QIKGLK+D KSVETNI      I FE
Sbjct: 241 IGILCAAGLVAIKENVQKLEADHKKAKQLASGLFQIKGLKIDPKSVETNIFVIYYQILFE 300

Query: 343 IEDDYGISMETLCKTLEERGIFMMLESQTRARIVLHHQISTSDVQYTLSCFQQTLKGIKV 400
           IEDDYGISMETLCK+LEERGIF+ML++QTRARIV HHQISTSDVQY LSCFQQTL GIKV
Sbjct: 301 IEDDYGISMETLCKSLEERGIFVMLQTQTRARIVFHHQISTSDVQYILSCFQQTLNGIKV 360

BLAST of IVF0022074 vs. TAIR 10
Match: AT1G08630.1 (threonine aldolase 1 )

HSP 1 Score: 513.1 bits (1320), Expect = 2.1e-145
Identity = 245/349 (70.20%), Postives = 302/349 (86.53%), Query Frame = 0

Query: 43  MVSRKVDLRSDTVTKPTESMRAAMAMAEVDDDVLGYDPTALELEEKMAKIMGKEGGLFVP 102
           MV R VDLRSDTVT+PT++MR AM  AEVDDDVLGYDPTA  LEE+MAK+MGKE  LFVP
Sbjct: 1   MVMRSVDLRSDTVTRPTDAMREAMCNAEVDDDVLGYDPTARRLEEEMAKMMGKEAALFVP 60

Query: 103 SGTMGNLISILVHCETRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNKDDGTMDIDL 162
           SGTMGNLIS++VHC+ RGSEVI+GDN HIH+ ENGGI+TIGGVHP+TVKN++DGTMD++ 
Sbjct: 61  SGTMGNLISVMVHCDVRGSEVILGDNCHIHVYENGGISTIGGVHPKTVKNEEDGTMDLEA 120

Query: 163 IEAAIRNPKGQLFFPTTRLICLENTHANSGGKCLSMEYTDEVGELAKKHDLKLHIDGARI 222
           IEAAIR+PKG  F+P+TRLICLENTHANSGG+CLS+EYT++VGE+AK+H +KLHIDGAR+
Sbjct: 121 IEAAIRDPKGSTFYPSTRLICLENTHANSGGRCLSVEYTEKVGEIAKRHGVKLHIDGARL 180

Query: 223 FNASIALAIPVDRLVRAADSVSVCLSKGLGAPVGSIILGSKDFITKAIRIRKTLGGGMRQ 282
           FNASIAL +PV +LV+AADSV VCLSKGLGAPVGS+I+GS+ FI KA  +RKTLGGGMRQ
Sbjct: 181 FNASIALGVPVHKLVKAADSVQVCLSKGLGAPVGSVIVGSQSFIEKAKTVRKTLGGGMRQ 240

Query: 283 IGILCAAGLVAIKENVQKLEADHEKAKQLASGLYQIKGLKVDTKSVETNIIFFEIEDDYG 342
           IG+LCAA LVA++EN+ KL+ DH+KAK LA GL Q+KG++V+  +VETN+IF ++ED   
Sbjct: 241 IGVLCAAALVALQENLPKLQHDHKKAKLLAEGLNQMKGIRVNVAAVETNMIFMDMEDGSR 300

Query: 343 ISMETLCKTLEERGIFMMLESQTRARIVLHHQISTSDVQYTLSCFQQTL 392
           ++ E L K LEE GI ++  + +R RIV+HHQI+TSDV YTLSCFQQ +
Sbjct: 301 LTAEKLRKNLEENGILLIRGNSSRIRIVIHHQITTSDVHYTLSCFQQAM 349

BLAST of IVF0022074 vs. TAIR 10
Match: AT1G08630.3 (threonine aldolase 1 )

HSP 1 Score: 513.1 bits (1320), Expect = 2.1e-145
Identity = 245/349 (70.20%), Postives = 302/349 (86.53%), Query Frame = 0

Query: 43  MVSRKVDLRSDTVTKPTESMRAAMAMAEVDDDVLGYDPTALELEEKMAKIMGKEGGLFVP 102
           MV R VDLRSDTVT+PT++MR AM  AEVDDDVLGYDPTA  LEE+MAK+MGKE  LFVP
Sbjct: 1   MVMRSVDLRSDTVTRPTDAMREAMCNAEVDDDVLGYDPTARRLEEEMAKMMGKEAALFVP 60

Query: 103 SGTMGNLISILVHCETRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNKDDGTMDIDL 162
           SGTMGNLIS++VHC+ RGSEVI+GDN HIH+ ENGGI+TIGGVHP+TVKN++DGTMD++ 
Sbjct: 61  SGTMGNLISVMVHCDVRGSEVILGDNCHIHVYENGGISTIGGVHPKTVKNEEDGTMDLEA 120

Query: 163 IEAAIRNPKGQLFFPTTRLICLENTHANSGGKCLSMEYTDEVGELAKKHDLKLHIDGARI 222
           IEAAIR+PKG  F+P+TRLICLENTHANSGG+CLS+EYT++VGE+AK+H +KLHIDGAR+
Sbjct: 121 IEAAIRDPKGSTFYPSTRLICLENTHANSGGRCLSVEYTEKVGEIAKRHGVKLHIDGARL 180

Query: 223 FNASIALAIPVDRLVRAADSVSVCLSKGLGAPVGSIILGSKDFITKAIRIRKTLGGGMRQ 282
           FNASIAL +PV +LV+AADSV VCLSKGLGAPVGS+I+GS+ FI KA  +RKTLGGGMRQ
Sbjct: 181 FNASIALGVPVHKLVKAADSVQVCLSKGLGAPVGSVIVGSQSFIEKAKTVRKTLGGGMRQ 240

Query: 283 IGILCAAGLVAIKENVQKLEADHEKAKQLASGLYQIKGLKVDTKSVETNIIFFEIEDDYG 342
           IG+LCAA LVA++EN+ KL+ DH+KAK LA GL Q+KG++V+  +VETN+IF ++ED   
Sbjct: 241 IGVLCAAALVALQENLPKLQHDHKKAKLLAEGLNQMKGIRVNVAAVETNMIFMDMEDGSR 300

Query: 343 ISMETLCKTLEERGIFMMLESQTRARIVLHHQISTSDVQYTLSCFQQTL 392
           ++ E L K LEE GI ++  + +R RIV+HHQI+TSDV YTLSCFQQ +
Sbjct: 301 LTAEKLRKNLEENGILLIRGNSSRIRIVIHHQITTSDVHYTLSCFQQAM 349

BLAST of IVF0022074 vs. TAIR 10
Match: AT1G08630.2 (threonine aldolase 1 )

HSP 1 Score: 513.1 bits (1320), Expect = 2.1e-145
Identity = 245/349 (70.20%), Postives = 302/349 (86.53%), Query Frame = 0

Query: 43  MVSRKVDLRSDTVTKPTESMRAAMAMAEVDDDVLGYDPTALELEEKMAKIMGKEGGLFVP 102
           MV R VDLRSDTVT+PT++MR AM  AEVDDDVLGYDPTA  LEE+MAK+MGKE  LFVP
Sbjct: 1   MVMRSVDLRSDTVTRPTDAMREAMCNAEVDDDVLGYDPTARRLEEEMAKMMGKEAALFVP 60

Query: 103 SGTMGNLISILVHCETRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNKDDGTMDIDL 162
           SGTMGNLIS++VHC+ RGSEVI+GDN HIH+ ENGGI+TIGGVHP+TVKN++DGTMD++ 
Sbjct: 61  SGTMGNLISVMVHCDVRGSEVILGDNCHIHVYENGGISTIGGVHPKTVKNEEDGTMDLEA 120

Query: 163 IEAAIRNPKGQLFFPTTRLICLENTHANSGGKCLSMEYTDEVGELAKKHDLKLHIDGARI 222
           IEAAIR+PKG  F+P+TRLICLENTHANSGG+CLS+EYT++VGE+AK+H +KLHIDGAR+
Sbjct: 121 IEAAIRDPKGSTFYPSTRLICLENTHANSGGRCLSVEYTEKVGEIAKRHGVKLHIDGARL 180

Query: 223 FNASIALAIPVDRLVRAADSVSVCLSKGLGAPVGSIILGSKDFITKAIRIRKTLGGGMRQ 282
           FNASIAL +PV +LV+AADSV VCLSKGLGAPVGS+I+GS+ FI KA  +RKTLGGGMRQ
Sbjct: 181 FNASIALGVPVHKLVKAADSVQVCLSKGLGAPVGSVIVGSQSFIEKAKTVRKTLGGGMRQ 240

Query: 283 IGILCAAGLVAIKENVQKLEADHEKAKQLASGLYQIKGLKVDTKSVETNIIFFEIEDDYG 342
           IG+LCAA LVA++EN+ KL+ DH+KAK LA GL Q+KG++V+  +VETN+IF ++ED   
Sbjct: 241 IGVLCAAALVALQENLPKLQHDHKKAKLLAEGLNQMKGIRVNVAAVETNMIFMDMEDGSR 300

Query: 343 ISMETLCKTLEERGIFMMLESQTRARIVLHHQISTSDVQYTLSCFQQTL 392
           ++ E L K LEE GI ++  + +R RIV+HHQI+TSDV YTLSCFQQ +
Sbjct: 301 LTAEKLRKNLEENGILLIRGNSSRIRIVIHHQITTSDVHYTLSCFQQAM 349

BLAST of IVF0022074 vs. TAIR 10
Match: AT1G08630.4 (threonine aldolase 1 )

HSP 1 Score: 513.1 bits (1320), Expect = 2.1e-145
Identity = 245/349 (70.20%), Postives = 302/349 (86.53%), Query Frame = 0

Query: 43  MVSRKVDLRSDTVTKPTESMRAAMAMAEVDDDVLGYDPTALELEEKMAKIMGKEGGLFVP 102
           MV R VDLRSDTVT+PT++MR AM  AEVDDDVLGYDPTA  LEE+MAK+MGKE  LFVP
Sbjct: 1   MVMRSVDLRSDTVTRPTDAMREAMCNAEVDDDVLGYDPTARRLEEEMAKMMGKEAALFVP 60

Query: 103 SGTMGNLISILVHCETRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNKDDGTMDIDL 162
           SGTMGNLIS++VHC+ RGSEVI+GDN HIH+ ENGGI+TIGGVHP+TVKN++DGTMD++ 
Sbjct: 61  SGTMGNLISVMVHCDVRGSEVILGDNCHIHVYENGGISTIGGVHPKTVKNEEDGTMDLEA 120

Query: 163 IEAAIRNPKGQLFFPTTRLICLENTHANSGGKCLSMEYTDEVGELAKKHDLKLHIDGARI 222
           IEAAIR+PKG  F+P+TRLICLENTHANSGG+CLS+EYT++VGE+AK+H +KLHIDGAR+
Sbjct: 121 IEAAIRDPKGSTFYPSTRLICLENTHANSGGRCLSVEYTEKVGEIAKRHGVKLHIDGARL 180

Query: 223 FNASIALAIPVDRLVRAADSVSVCLSKGLGAPVGSIILGSKDFITKAIRIRKTLGGGMRQ 282
           FNASIAL +PV +LV+AADSV VCLSKGLGAPVGS+I+GS+ FI KA  +RKTLGGGMRQ
Sbjct: 181 FNASIALGVPVHKLVKAADSVQVCLSKGLGAPVGSVIVGSQSFIEKAKTVRKTLGGGMRQ 240

Query: 283 IGILCAAGLVAIKENVQKLEADHEKAKQLASGLYQIKGLKVDTKSVETNIIFFEIEDDYG 342
           IG+LCAA LVA++EN+ KL+ DH+KAK LA GL Q+KG++V+  +VETN+IF ++ED   
Sbjct: 241 IGVLCAAALVALQENLPKLQHDHKKAKLLAEGLNQMKGIRVNVAAVETNMIFMDMEDGSR 300

Query: 343 ISMETLCKTLEERGIFMMLESQTRARIVLHHQISTSDVQYTLSCFQQTL 392
           ++ E L K LEE GI ++  + +R RIV+HHQI+TSDV YTLSCFQQ +
Sbjct: 301 LTAEKLRKNLEENGILLIRGNSSRIRIVIHHQITTSDVHYTLSCFQQAM 349

BLAST of IVF0022074 vs. TAIR 10
Match: AT1G08630.5 (threonine aldolase 1 )

HSP 1 Score: 490.3 bits (1261), Expect = 1.4e-138
Identity = 239/349 (68.48%), Postives = 292/349 (83.67%), Query Frame = 0

Query: 43  MVSRKVDLRSDTVTKPTESMRAAMAMAEVDDDVLGYDPTALELEEKMAKIMGKEGGLFVP 102
           MV R VDLRSDTVT+PT++MR AM  AEVDDDVLGYDPTA  LEE+MAK+MGKE  LFVP
Sbjct: 1   MVMRSVDLRSDTVTRPTDAMREAMCNAEVDDDVLGYDPTARRLEEEMAKMMGKEAALFVP 60

Query: 103 SGTMGNLISILVHCETRGSEVIVGDNSHIHILENGGIATIGGVHPRTVKNKDDGTMDIDL 162
           SGTMGNLIS++VHC+ RGSEVI+GDN HIH+ ENGGI+TIGGVHP+TVKN++DGTMD++ 
Sbjct: 61  SGTMGNLISVMVHCDVRGSEVILGDNCHIHVYENGGISTIGGVHPKTVKNEEDGTMDLEA 120

Query: 163 IEAAIRNPKGQLFFPTTRLICLENTHANSGGKCLSMEYTDEVGELAKKHDLKLHIDGARI 222
           IEAAIR+PKG  F+P+TRLICLENTHANSGG+CLS+EYT++VGE+AK+H +KLHIDGAR+
Sbjct: 121 IEAAIRDPKGSTFYPSTRLICLENTHANSGGRCLSVEYTEKVGEIAKRHGVKLHIDGARL 180

Query: 223 FNASIALAIPVDRLVRAADSVSVCLSKGLGAPVGSIILGSKDFITKAIRIRKTLGGGMRQ 282
           FNASIAL +PV +LV+AADSV VCLSKGLGAPVGS+I+GS+ FI KA  +RKTLGGGMRQ
Sbjct: 181 FNASIALGVPVHKLVKAADSVQVCLSKGLGAPVGSVIVGSQSFIEKAKTVRKTLGGGMRQ 240

Query: 283 IGILCAAGLVAIKENVQKLEADHEKAKQLASGLYQIKGLKVDTKSVETNIIFFEIEDDYG 342
           IG+LCAA LVA++EN+ KL+ DH+KAK LA              +VETN+IF ++ED   
Sbjct: 241 IGVLCAAALVALQENLPKLQHDHKKAKLLA--------------AVETNMIFMDMEDGSR 300

Query: 343 ISMETLCKTLEERGIFMMLESQTRARIVLHHQISTSDVQYTLSCFQQTL 392
           ++ E L K LEE GI ++  + +R RIV+HHQI+TSDV YTLSCFQQ +
Sbjct: 301 LTAEKLRKNLEENGILLIRGNSSRIRIVIHHQITTSDVHYTLSCFQQAM 335

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q8RXU42.9e-14470.20Probable low-specificity L-threonine aldolase 1 OS=Arabidopsis thaliana OX=3702 ... [more]
Q9FPH34.9e-13668.90Probable low-specificity L-threonine aldolase 2 OS=Arabidopsis thaliana OX=3702 ... [more]
O070511.6e-7344.51L-allo-threonine aldolase OS=Aeromonas jandaei OX=650 GN=ltaA PE=1 SV=1[more]
Q218903.5e-7344.00Uncharacterized protein R102.4 OS=Caenorhabditis elegans OX=6239 GN=R102.4 PE=3 ... [more]
P758233.2e-6640.43Low specificity L-threonine aldolase OS=Escherichia coli (strain K12) OX=83333 G... [more]
Match NameE-valueIdentityDescription
A0A1S3BXZ84.0e-22199.25probable low-specificity L-threonine aldolase 1 isoform X1 OS=Cucumis melo OX=36... [more]
A0A1S3BWA25.5e-21597.25probable low-specificity L-threonine aldolase 1 isoform X2 OS=Cucumis melo OX=36... [more]
A0A1S3BWX24.1e-20298.65probable low-specificity L-threonine aldolase 1 isoform X3 OS=Cucumis melo OX=36... [more]
A0A6J1DWI03.5e-16985.24probable low-specificity L-threonine aldolase 1 OS=Momordica charantia OX=3673 G... [more]
A0A6J1EZ048.7e-16883.52probable low-specificity L-threonine aldolase 1 OS=Cucurbita moschata OX=3662 GN... [more]
Match NameE-valueIdentityDescription
XP_008453666.18.07e-28399.25PREDICTED: probable low-specificity L-threonine aldolase 1 isoform X1 [Cucumis m... [more]
XP_008453667.11.01e-27497.25PREDICTED: probable low-specificity L-threonine aldolase 1 isoform X2 [Cucumis m... [more]
XP_008453668.12.84e-25898.65PREDICTED: probable low-specificity L-threonine aldolase 1 isoform X3 [Cucumis m... [more]
XP_011660269.11.74e-23792.46probable low-specificity L-threonine aldolase 1 isoform X2 [Cucumis sativus] >XP... [more]
XP_031736719.18.46e-23590.93probable low-specificity L-threonine aldolase 1 isoform X1 [Cucumis sativus] >XP... [more]
Match NameE-valueIdentityDescription
AT1G08630.12.1e-14570.20threonine aldolase 1 [more]
AT1G08630.32.1e-14570.20threonine aldolase 1 [more]
AT1G08630.22.1e-14570.20threonine aldolase 1 [more]
AT1G08630.42.1e-14570.20threonine aldolase 1 [more]
AT1G08630.51.4e-13868.48threonine aldolase 1 [more]
InterPro
Analysis Name: InterPro Annotations of Melon (IVF77) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 291..311
NoneNo IPR availablePIRSRPIRSR017617-1PIRSR017617-1coord: 73..349
e-value: 7.6E-109
score: 361.1
NoneNo IPR availablePANTHERPTHR48097:SF4L-ALLO-THREONINE ALDOLASE-LIKE PROTEINcoord: 43..400
NoneNo IPR availablePANTHERPTHR48097L-THREONINE ALDOLASE-RELATEDcoord: 43..400
IPR001597Aromatic amino acid beta-eliminating lyase/threonine aldolasePFAMPF01212Beta_elim_lyasecoord: 49..335
e-value: 5.5E-98
score: 327.8
IPR023603Threonine aldolasePIRSFPIRSF017617Thr_aldolasecoord: 39..398
e-value: 8.0E-137
score: 453.9
IPR015421Pyridoxal phosphate-dependent transferase, major domainGENE3D3.40.640.10coord: 39..296
e-value: 1.7E-89
score: 301.2
IPR015422Pyridoxal phosphate-dependent transferase, small domainGENE3D3.90.1150.10Aspartate Aminotransferase, domain 1coord: 297..392
e-value: 1.7E-24
score: 87.9
IPR015424Pyridoxal phosphate-dependent transferaseSUPERFAMILY53383PLP-dependent transferasescoord: 48..393

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
IVF0022074.3IVF0022074.3mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006520 cellular amino acid metabolic process
molecular_function GO:0016829 lyase activity
molecular_function GO:0003824 catalytic activity