Tan0006277 (gene) Snake gourd v1

Overview
NameTan0006277
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionDiphthamide synthase
LocationLG08: 16485856 .. 16520862 (+)
RNA-Seq ExpressionTan0006277
SyntenyTan0006277
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GTTATTGTTAATCTCTAGCAGAGCCAGAACCTCCCCAAAACCCCAACGGACAAGCAGAGCAATTTCTTCTAGTTGGCGATCGGAGATGAAGGTGGTGGCTTTGGTAAGCGGCGGCAAAGACAGTTGCTTCGCCATGATGAAGTCTATTCAATACGGCCACGAGGTTTCGTTTTCCTTTTCGATTTTCCCTTTTAATTCTTCTTCGTCTTCTTCTTTTGTCTGGCTTAGGCGAACAATGTTTACTGGCTCTTACATTTTGCTTCTTCTCTCAATTTTCGTTTGTTTGATTTTCTCTTTAATGTAAACTTATGCCCTGCTTGGTTGTGTTGTTCTATGTTTGGTCTAGATTGTAGCGTTAGCAAATTTGTTGCCGGCCGATGATTCCGTGGATGAACTTGATAGTTACATGTATCAAACTGTAAGTCTTCCGGCCCTCCATATGCAAAATATGCATGCGTTCACTTATAACTTTGTAGTAAATGGTTGATTTGGTTTTTGACAGTCAACTATCGGTCTCCTTAATCATGGAAGTTTTGTTTTTGTTTTAGGTTTTCAACTCTTCAACTACAGCCCTGTCTTACATGTTGAAAAACCCCTTGTGAAAAAACAGGCACAGTTTTTTCGTTGAAATTGGGAATAAATAAATTATGTTTCCTTCTCCATTTTCAAAACTGCAATTAAAATTAAATACCTGCATTGGTTGATGGGGTTGGCTTGGTTGATCTTCCTTTCTTTTCCCCTTCTTCTCCACCAAGAATAGTTAGTATGACTCCGAGAGTTAAGTTTCTTATAAAACCCTTGACTTGACCCTTGAGCTACCTCACTCCTTTGGTTCGCTGTTGATATTTTTGATCAACCCAACGTGTTGGGTTATAAGTTTATAACCTTCTTTGCTTTGAGATGGAATCGACTCAAATTATTATTAATTAAGAAGACATCTTTTTGTTATATTAATTTTTTCATATTTATTTTTATTTAACCCCTTATAGGAGTTTGTATCTTTCGAATGTACTTGTACGTTGTAGCACTTTACTCATCAAATGAAAAGTTTGTTTCTTATTAAAAAAATTGCTTATATATACACACACATTAATTTCATTTCATACATTTTTTTTTCTATAAATTTAGATAGGAAACAGAGAAAACCATTTCATTGATGAGTGAAATAGATACAAAAGAGGTCCTATATGTGATAGTTACAAAAAATAGATAAAAGGGTTGTTAAGCCATAATTACAAAAAAGAGGAGATAAAATACTCCACGATAAAGCATGATATGAAGTTGAGTCTATGAAGGTTTGAATGTTTATGTTGCTTATCTGTGAAAACCCATGTGAAAACCCAGACATTTCTCTCAAAGCATAAACTCCATAGAAAAGCATAAATGAAATTGATCCATAGTGTCTTATTTTCTTTTTTGAACGGGTGACCATTGATTAACACATCTAGCAATGTATAGATATCCCTAGGCCACGCTGTGGACCACCCAAACTCTTGTTGAAGGAAGGACCAAAAAGCATTGGCAAGGTTTTTTTTTTTTTTAGTATATCAACAAATTGGGGGTGGGGGGATTCGAACCCATGACCTCTTGTTCATGAGTCATACTTGATGCCAGTTGAGCTACGCTCTTGTTGGCAAAGCATTGGCAAAGTTACAGTTACTGAAAAGGTGGATTTGAGACTCATTGTTTATGTAACAAAGATGACCAGGAGATTTTTTTTTTTTTTTAATATGTCAAAAGTGTTGATACATGCATGACTAAGCTCCCAAATAAAGAACGTAACTTTTTTTGGATGGTGACCCTTCCACAATTGCTTATAAAATGATGTTTTATCGAAGTTTCCAGTAGATGCAAGCCTTTTAGTTTGTCAACTCGTGGAGAAATGCCCATCCTTTTCCAATGCCCAAACCCATCTATCCTCTTGGTCTTTGAAAATAAACGAGGACAATAACTGAGATAAGGAACCCCATTCTTCAATTTCTATTTCGTTGAGGTTTCTTCTTGAAAAGAGATTTCATGATCCAGCCAGTTGATTCCAATGATTAGCAACATGCAAGTCTTTCTTTAGAGATAAAAGGTAAAGAGATGGGAATTTTGACAGGAAAGATTGAGTTCCCAACCAAACGTCTTTCCAGAAATGGGTTTTGTCTCCTTTCCCAACAATGAATTCAATGTTGCTGGAAACCAAATTTTGCAAGTCCGGAATGGATTTCCAAGGTCCCTTAGCAATTTTCAGAGACATACTGCTGGCTTGCAATTGTAGTGGCTAGTACCATATTTTGCTGCAATCACCCTTCTCCAAAGAGCTTCTTTTTCGATGTAAAACCTCCAATTCCATTTCGCCAAAAGGGAGATGTTATGTATGTGTATATCACCTTCCTCAATAGGTCCTTGGAATGTGGACCACTTTAGGAGATGCCTTCCCTTTTTATCGATACCACCCCTCTATAGAAAATCCCTATACAACTTCTCAATAGATTTTGTGACTTTGGATGGAGCAAGAAAAATGGAGAAATAACATGTGGGTAAATTAGAGAGTGTAGCTTGGATGAGTGTGTGTCTTCCTCCTTTTGAAATATGAGAATTTCCCCAATTATGCAACCGATTCTCAATTTTCTCAATGACTGGACTCCAATAACTAGTCGTACGTGGGTTGCCTTTCAACGGTAGACATAAGTATGTAGAGGGCAGTTGCCCACCCTACATACAAAGATAGAGGCACAACCAACTTTCTGGGCAGCGATATCGATCCCAAAAATTATGATTTCTGACCATTTATGTTTAGCCCAGATGCTTCCTCAAAATCATGAATGACTTGAAAAAGATTCTGAAGCATTGCTTCATTGCGTGATGAGAAAAGGATGCCCTGCAAAATCATTAACGACCTGCAGATTGGAGGTGAGTGATTGAAACTGTGTATGGAGCTTTACCAACATTAAACCCCTCAATCAATCCTTTTTGTTCTGCCAGTAATAGCATCCTACTAAGTTAGTTGACCACAAGAATGAATACGAAAGGAGAAAGGGGATCTCCTTGACGAAGGTCTCTAGTTGCTTTAAATTTACCCCTACACATTAATAATAATGGAGAAGTTTGTTAAAGAGATGCATCCTCTAATCCATATGCGCCATGTTCGTCCGAAGCCTTTGGCTCGAAGAATATTCTTGAGAAAATCCCAATCCACCATATTTTTTAGACAAAGTGACAATGGAATTTCATTGAGATAATGAAAGAGACTAATGCTCCAGTTACAACTTCAACCATTAATCAAAAGAACCTATAAAAGACATTTGCGCCAAAAGCAAAAACTGAAAAATAAAGGTCCCAACCCAACATATTCGGTCCCATAGAATGCCCAAGAGAACCACCCTCAAAGCACCAAACTAAGGCGTAAAATAGCTATCCGTGGCCAAGGAACTGTGGCAAAACAGAAAACAACAACCCTATTCAAACCTCTGCCCTAGTCCAAACAAACGAAACTACATATTAAGTAAGAAGGGCTGCCAAGAAAAAGTCAAGTCTGAAGAAAAAGTCAAGAAAAAGTCAAGAAGGGCTCTCTAAATCCCAATCCACCATATCAAACGCCTTTTCAATGTCAAGTTTAATGACCACTCCACTATTCTTCTTTCGGTTCCATTCATCGATGATCTCATTGGCTTTAAGGGAGGCATCAAGGATTTGGCGATTGGCTATGAAAGCAAATTGATGGTCTGTAATTGAGTATGGGAGGACACATTTCAATCTTTCTGAGAGCACTCTAGCAATAATCTAGTGGAGACAAGTTGTGAGACTAATTGGCCTGTAATCACTCACTTTTTTGGCACCAATCTTTTTTGGGATGAGGCAAACGTATGTTTCATTTATGTTTGCATTGATGATCCCATTCTTAAAAAAATCATGGAACACCCCAATAATATCTTCCTTAAGGATGTTCCATGATTTTTTAAAGAATTCTGCCGTGAATCCATCTGATCCCTAGAGTCTTGTTACTGCCCATATCAGTGATTGCTTTGTAAATCTCAGCTTTTGTAAAAGGTATTTCCCGTGGGTCCTTTTGCTCAGAAGAAATGGGACTCCAATCATCAATCTCAAGTAGTGTTTTAATATACCTCTTCTTTTAGTAAACGGATTAGGGTAGAAATTCAAGAATTCCCCCTCAATATCTTCATCATTCACCAAACGGTCCCCATCTGCAGGAAGGACTTCCATAGTTGTACTTTTCCTTCTGTTTGCTGCCACAATCCTATGGAAGAAAGCAGTATTTCCTCGACAAGCCATTTAGTTTTGCAGCTTTGTCCCCACATGGCTTCTTCATTAGCTGAGATAGTAAGCAATTCCCATTTTATGGTGAGTCTTCGCTCTAAGTCATCATTAGATAAAAGGGTGGATTCCTCTTTTTCATCAATGACCATAAGCCCATGGTGGAGTTTGGCTTTAAGCTCCTTCTGATGGCCAAAAATATTATGATTCCACAACCTCACCTCCTTCTTTAAGCCTTTGAGTTTTTGGATGAAATCATGTCCTAGCCAGCCTTGTATAGGATTTTTCAACCACCAAGAATCAATCATTCTATCAAAGGTGTGATGGTTTAGCCAAGCATTAAAAAACCTAAAGGGAGTAGGACCCCAGGATTCCTTTCCTAGAGTGAGACAAATGGGGTAATGGTCAGAAGTAATTCTCTCAAGTCTACGAGCAGAGACACTAACAAATTTTTTAATGATCCCATCCCGTAGCAAAAATCTGTCTATAGGACTCATCGTAGCTTCAATGCGCATATTGGACCTAGTGAATTTTCCATTTGTGAGGGGGATGTCTGTAAGTTCAAAAGAGCTAACGAAATGGTTGAATTTCTTCATTCCTCGTGTAGTGACAACACTAGAAGATTTCTCCCAAGACCACCTAGTAATATTAAATTCTCCACCCAAAATCCAATTAGGTTGACAAAGGCTCGATAAATCTTTTAATTCAGTCCATAAGAATTTCCGCTCAGTAGATGAATTTGGCCCATATACCCCTGTAATCCAAAAAAAGTAATTGTCAGCAAGAGAGAAATGGATAGATTAGGAATAGATACCTTCCACCACTTCCAAGATATTGAAGGACTCCTTCCATATAATAACAATTCCTCTCGAGGTCCCCACTACATCAATCGATGCCCAAGCAATGTTGCGTGAGCTCTAAAGTGATTTGATAAATTTTTGGTCCATGGCTTGAGATTTGGTTTCTTGTAACATGATGAGGGTTGAATTATGAAGTCGAATGAAGTCCTTGATAAAAATTCTCTTTTTCCAAGAGCCAAAACCCCTAACATTCCAAGAGATAAAAATCATCTAAGGGAAATCGTCCCCTCTGCCTTGTTTTCAATTCTATTTTTTATCTCTCTGGCCCATGTTTTTCCTCTTTGCAACTGTGGAATTCATTTTCTAGGCTGGTAAAGGCATATTACATAAACCATTTTTAATCAACCAAGGAAAAATAATAGATAGATATATGGTTGGGTCTTCATTTTGTGATTCCTCTTCCTTGTCCATGTCACTGACATCGATGATAGTGATGTCCCTTGGGCAATCAATAATGCCAAGATGAGGGTTCATCTCACTGGGCTCAAGAGTGGAATCATCAATTAATAAAGGGGAAGGATTAAGTATGGGGTTTAGTTGGTCCGGGGGAGGTTGAAGAGGTTGGATAAAAACATTAGTGAGTTTGTGGGAGATTTTGGCAGAGGTCAGGTCAATGGAGGAGAGTCTTCAATAGATTGGGAGCATGGTCCCAGTGATGAGCATAGTTTGCTTGTTATTATTGAGGATGGGTTTTTTGGATGATGAAGGTAGGGGATAAGTAAGATCTGATGGGGGGGGGGGTGTTGAATCAGGGTTAGAAAAAATAATTTTCTTCTGTTTTGGACAGTGGTCAACGGTTGATAATGGTTGAGAGAGGGGAATTTCGGGATTGGTGTGAGATAAATGATGAGAGTGGGTCCCATTAAGGTTGGCATTTTGTATGTATTTAGGGGAAGTCATAAACTTTTTGGATTGAGATAAAGGATATTGGTGGATCCCACTATGTTTGGTCTTTTGAGGATTAAGAGAAACAATGGGTTGCGTCTGGGCTTGGGTAAGGTGTGGGCCCTGTTCTGATAAAGACAAAGAAGGAGGCTTATTGATTGCAACTGACTTTTGTACCTGATAAGCTGAAAAGGGGTTGCTCGATGGTACAGTGCATTGAAACCTGCCAAACTCTCATTGTTCTCATCAATTTGTTCAAGGGAGGCGCTGTTTTCACCTTCTTCACTATGCGGGGTGGGTAGCTTCCCATGGATTCTGGCCACAAAACCAATGTAATTTTCTTCCATAAAAAAGGGGTCGATTTTAACCGTCAAGGGCAAGGAAGAGGATGATGGCAAGTGTACCTCACCAGGGGTGAAACAATTGTGATTTTCTTCACTTTAATACTAACCTCCATCATATCCATTCTAGACAAAGTCTTATTTGCTGTCTCCCGATAACTCCCACATTTGTCCCCAATTTTCTTGAAAGTCTCTATGGACCCTTTGTCCAATGGAAGGTTCCTAAGCTTTATCTATCCCCCATAAGGCACTTGGTTCACCAAGCATATTTTCTGCACTCCAAGGCCGAAAATGAACATTGAACTTCCCTACCTTGTACCACCCTGATATATTAGCCGTAATTCGAGCTTGTTCTGCATCTACACATAAGAGAAGGGCCCTATCCGACTGTAATGGACTAACCGAGGAAAAGTCAGAAAAGTTTTGCTGAAGAGCTCGCATTATATCATGCCAGTTGTCGTGGAAATGCTCTCGCTGGAGGATGACCATAGAAGAAAGCTTTAAGGTTGTGTAAGGTGAGGGAGCGGGATCTCTGATCGGCTTGTTCTATGGGACCTGTTCTCTGTTGTTGTTATCACCTGTTATCTTCAGCTTTAAAGCTTTCTTGTATGCTAAAACTTCCTTTCTATTTCCACCCCCGGGAGGATAAGTTTATTTAAACCCCCATGTGCTCCTAGCTTAGCTACTTCAATGTAGTGACCTTTTTTGTTTGATATCTTCTCAACCAATAGACAATCTCTCCATCTCGTGTTTCATTGAAGAATTTTTGTGTCAATGGGAGCTCACAAAGTCGGGTGAAACACTCATGTATCCCGCTCAAAATCTCTCCGCTCAAGGTGATTGAAAAACTTCTCTCATTTGTGGCTTCCGTGATGCAAAAAAGGTTTGGGTTTCTGGGGTCGTTTGCAATGGAGAAGGTTTTCCTCTCTATCTTGAAGGAGGAGGGCGAAGCCATGGTGGTAGCCGATCAGTTGGCCGGAGAGTCATAACTTAAGGAAAGGGTGGAGAAGAGACATGGTTCTGATACTTTACTGTTTTCTGTTCCAAATATGATCCTTGAGCTTATACATACACAAGTCAGACATTTAAGAAACATAAAATTGTTGATACGATCCCATGTCTCAATCACGAACTTGCTCAACAGTAAAATTGGAGAAATTCTTAGGCCGACGACGAATATTTGAGATGCCGGCTGTAACAGACAGAGAGAGAGAGTGAGGTACTGGCACATTTCTCTTTCTAATAGCTTCTGATTCCTTCTCTTTTACATCTCTCTCTCAATTTTCTCTCTTTCCTTTCCTTCTCTCTAAAATTTCTCCCATTTCATACATTCTCTTTTAGATATTTTACTCTGCAATTTCAAAAAATATAAATTCCTAAATCTTTGAAATTTTGTTTTTCATACATGACTTTGTAACTACCAATGTCTGAAAATTAGAAGATAAATGCTGTTAAAAGAAAAAAAAGAAAAGAAAATTAGAAGATAAATATTGCAGTATTAATACTTTTTGTATTTTTTACATAATCGTGAATAACTAGTTTGGGGACCATCCTGACCAACCTAACCAAAATATTTTGTTGTTGCATTGAGTATATCATTTAATAAGAAGTCTTTAAGTTAGAAAAAGTTGCAACTTGAATAATGGGTTGGGCTCAGAGTTTATCTTCTGGTTGTTGGAGGTTTGGATGGTGCGGCTTGAGTCTGCTCCTTTGAGGGGCTTCTCTTGGAGTTTAAGTGTTATTGGTTTCTCTTTCTCAGTTATTTGTTGATTTGGGCCATTGTTCTTTCTTCGGTTCTTGAAAGTCTGTATGGAAAGTCTTCTTCTGTTATTGAGTTTTTCTTGTTTGTTTTTGGAGAGATGTTGTTTTTCTTTGGTTCTACTTCTACCTTTAAGTTTGGTCTCATGTTGTTTTGGATATTGTTTGTTCTTTTCTTTTTGTTGTATTTTGAGCATCAGTCTCATTTTATTTTATCAATGAAATTTCGTTTGTTTCCTTGTCAAAAAATATATATATTATCATTTTAAAAGTAATAAAATATTATCATTTGAAAAGTAATCGAGGACTTGTAATCTATGATTGCTAATTTGAGTTATTGGCTATAAATCTTCAGGTTGGACATCAAATTATTGTTAGCTACGCAGAATGCATGGGGATTCCATTGTTTAGAAGGCGAATTCAAGGATCCACAAGGCAAGCCTATTAAAAAAAGAAATTCTTTGGTAAAAGACATCAGCTTGGTGTGTGTGTGTTCATTTCATTAAAAAGTCAGTCACAAGCATTGACAATTATTTTTTGTGCAAATGGCTAGCAGGAGGATCTTGTCGGTGTGTAGTGGTCACCCTCTTTTTCTGATTAAGGGAATCTATTTAGCAAGTTTTTTTTTTCTTTTTTTCTATCTTCTCTTTTCTGTTTTTTTTTTGGCAATTTTATGGACCATTTGGTCAAGAGAATTAGTAGTGTTCTTAGAGGTGTGGAGAACTTTTGGGGGTGGTTTGGGCCTTTTAATGCCTTCATTTGGGTTTTGGCAACTTATGAGTTAGGAGTTTAGTTCTTTTCCTTTGTCTGCAGCTCTTCTTGTAATAGCTATTTGGTTGTTGTTTTTTTGCCTTTGTATGTCATCATGTCCTGGACGTCCCTTTTCTGTACTTCAATTCCTTCCTTCTATTTTTATTGTATGAAACCTGTTTTTTCATTGAAAAAAGTTGTTTGTGTGAAAAGATCACATCATCCATTCATGTAGTTGTGTGATTCACTGCATCAGGTTATTTGCGCAAATTGGGAGGCTTTTATTCCTCCCTCATAAGAAGTTCTGCATCCCTAGTTTTTTGTTTTCATTTTTTCACTCTTTTGGGAGTTTGTATCATTCATTGAACATAATTGTTCCTTTTCATTAATCGAAAAGTTTGTATCTTGTTAAAAATAGATACAATGCATTAGGTCCCCCCCTCTTTCTAACTGTTATTATTTTCTTGAATTATTTTAGGCATCAGAAGCTTAACTATAGAATAACTCCAGGTGATGAAGTGGAAGATATGTATATTTTGTTAAAGGAAGTGAAGAGGAAATTACCTAGTGTCACCGCAGTATGCTCTGGTGCCATTGCATCTGATTATCAAAGATTGCGGGTAGAGAGTGTTTGTTCAAGGCTAGGACTTGTTTCCCTAGCATATTTGTGGAAACAAGATCAATCATTGCTTCTACATGAAATGGTAAATAACATTTATATAATTTATAAAATGGACATGCAACATTTGAATTCTACTCGTTGAAGTCTTTCCCCTCTTCTTTTGAACTAGATAAACAATGGAATTTTGGCCATCACGGTGAAGGTATAACTCTCTCTCTCTCTCTGTTGTGTGTGTGTGTGGGGGGTGGAGAGGGTGATGTTTTGACAAGACGAAACTCTTGCATTAAATTATACATGCATGCTCTGTCTAATCATTCGTAAAGATATTTTTTATCATTAACCTATTTTTGTGCTATGACCAGATATGATTGAATAATTAAATTCATTTTTATAATGATAATGCATGCATTAGTATGTTTATCTTCTTTTAACATGGACGGCGTTCGACCAAAATAAATGTTTGGAGCATTATTTTATTCTGGAAGTTTGAATTATTCTTTTTTGTCTCTAAAAAATCTAAAGTGAGTAGGGATATGCATCCATTTGCAAATGCACAAAATTGAACTTTTGAACCCGTAGTTAAACGTATTTTCTCCATTGCCTACCTGACTATCTAAGTTGTAATTACGGGCAATTAATCTACTTTGTAATTCTTTTTTGAGGTGGTGTGGGAACGGTCGATATTTGTATTGAGGCACTGAAAAGATTACAAGATGAAGGAGTGGGAGCTCTCCTATTGATGTACTAGGGTTAGTGATTAATTAGATGTCTTGGTCAATTCCTTAGTTAATTATGATTAAAAGTTTGTTTAATTCCTTGATTAATTAGGATTAGGATTAGTTAGTTTAATTCCTTATTTAATTAGGAATAGGATTAGTTTGTTTTCAGTTCTCTATAAATAGAGAGTTATTCTCTTGTATTCAATTACTTTTTAATTCATAATCAACATTTGATTGATTTTGGGAGAGAATTCTCCTTTTATCCTCTTAGGCTACACCGCCTACCTTCCCAAGATCAAACTATTAGAACATTAATCCAATGTTTGCCAACCACCCGGCACCTCCTAACTAGCGCTATTTAAACCTAGAGCCATCAGTACTAAAGAAATTCTGCAGAGGCGTATACCTTTTTTGGCCGTGTCCCCAAGCTGGTGTTCTTTATGCAAGCAAAATGAGGAAACATTGGACCATTTATTCATACACTGCCTTGGAGCACAATCGTTACATCTTTTGGTTGGCATCTGGCCATTCCTTGGGATGTTAAGGATCTTCTTGCCACTGTCTTGTTGGGTCATTTTTTTAAGGATGCTAAAGCTAGGTCGTGGAAACACATCATCAGGGCCTTTCTTTGGGTTTTTTGGAGGGAATGTAACGATAGAGGTTTTTTTGGACATTAGTGAATTCTGTCTCAGTGGCATTCAAGATAGAGGTTTTTTTGGACAAGGAGCAACCCTTTAACGGGCTCTTTGATCTTGATACTTATTATGCTATTTCTTGGTTTAAATTATCTAATCACTTTGCTTCCTATGGTTATGCTTCCCTCTAACCCATTGGGAGAGGCGGAGAGCCTTTTGTCACCTAGGGTTCTCTATCCCTTTTGTCAATTTCATTACATCAATGAAATTGTTTTTTTTAATAAAAAACAAACTATTTAAACCTCTTTCCGTACCGTTACAAAGTTTACTTCTGCCCTACTCACTGTTTCGAACTCTTAACTATCCCGTATTATTATACACAATAAAATGAAATGTATAAAGGGGGAGAACCTACCATAAGGAAGTTACTAAACAGTCTTTGATAGGATTTCGATCGTGAGAATGGAAGTAAGGAGATTGCATTGCAATATAAAAGAGAATATATTGGGATGGTCTTTTGAGAGGGCTCTTTCCCTCTCATAATTAACCAATTCCCAAAATTATACTTTTGCCCCCAACTCCCTGATGGTTTTACGCTTTTCAAAATGCTTGAGCTGTGATGAAAACGCAGTGCATGATTATCTGAGGGTGTCCTTAGGCGTGCGCCTGTCTAATGCTGTGCACAACAGAGATGCTATTTGAGCAAAAATAAGAGAGATAATCTTTTCTCAATATTTCATTAAGTGACATTAGGGTTTCTTAAATAGAAGAAATACCCAATGTACACATAAGGAAATAAATTTTCAGCAATATATTACATTAAGTACAAATATACTATTGAGCTAAAGAAGGAAAAAAATCCAAAATAACCAACACTTCCCCTCAAACTGGCTTGAAGATATCTTCCATGGCTACCTTGTTGATCATCTTGTAAAACTGCAACTTAGGAAGCCCTTTTGTCAATACATTAGCAATTTGCTCTGTAGTAGGAAGGTAGAGAATGCAGATTACTCCTGAATCGATTTTCTCCTTTATAAAATGTTTATCAACCTCAATATGTTTTGTCCTATCGTGAAGTACTGGATTGTGAGTTATGGAAATTGCTGCCTTGTTGTCGCAATAGATACATATAGGTGTACTCTGAGAAGACTTCAATTCTTCCAACAATCTCTTTATCCATATGCCTTCACGAATGCTATGGGCTAAGGCCCTAAATTCAGCTTCAACACTACTTCTAGCTACTATACTCTGTTTCTTCTTCGTCAGGTAACAAGATTTTTCCTTACAAAGGAACAGTAACTAGAAGTAGATCTTATATCTGTTGTACTGCCTGCCCAGTCAACATCTATATATACTTCAACATGCAGATGACCATGTTTCTTAAACAATATTCGTTTCCTAGGAGTACCTTTTAGGTATCTCAAAATTCTATAGATAGTTTCAAAATGAGTAGGTCCTGGGACATGCACAAACTGACTTACCATACTAACTGCAAATGCAATGTCAGGACGTGTGTGGACAGGTAAATAAGTCTTCCCACAAGTCCCTGATATTTTTCATTATCTTTTACTTCTTTTTCTATTGCAACTTGCAATTTTAAATTAGGCTCAATTGGAGTTTCTGATGTTCTGCAACCCAACAATCCTGTCTCTTCAAGTAAGTCAATAATATACTTAATTTGATTGATCAGAATACCATCCTTCGATCTAACAAATTCCATTCCTAGGAAATATTTTAGGGCTCCTAGATCTTTGATTTGAAATTCATTGGCAAGTTTTTCTTTCAAGGTAGTTAGGCTTGCCTCGTCATCTCCTGTAAGAATAATGCCATCAACATAAACTATTAGAATGACAATCTTGTCATTTTCAGTATGTTTATAGAAGATAGTTCGGTCTGCTTGACTTTGGAGGAAACCATAGCTCGTGACTGCTTTGGCAAACCATTCAAACCAAGCTCTAGGAGATTGTTTTAGGCCATATAAGGATTTCTTCAATCTGTATATTTTGTTAATCCCAAAATCTTCCTCAAAACCAGGGGGCAAATTCATAAATACCGCTTCTTCAAGTTCACCATTGAGAAAAGCATTCTTGACATTAAGTTGATAAAGAGGTCAATCTGCATTAACAACAATAGATAACAGAACTCTAATGGAGTTAATTTTAGCCACAGGAGCAAATGTTTCTTGATAATCAACAATAGATAACAGAACTCTAATGGAGTTAATTTTAGCCACAGGAGCAAATGTTTCTTGATAATCAATTCCATAAGTCTGAGTGAAGTCTTTAGCAACACGCCTGACTTTATATCTTTCAACCCTCCCGTCAGCTTTGCACTTTATATTGAACACCCATTTGCACCCGACTGTTTTCTTATCCTTTGGTAATTCTACTATGTCCCATGTCTGATTTTGTTCAAGAGTATTCATCTCTTCCATCACTGCTAATTTCCAATTCAAGTCATTTTGTGCTTCCTGTATATTCCTGGGAACAACCAAATTTGTCATCCTAGAGACAAAAGTTCTATGACTATGGTAAGACAGGTAATTGACAATAGGGTATTTGGTGCAGTTTCGAGTACCTTTTCTATGGGCTATTGGAATGTCAAGATCAGATGTATTAGATAAACAGTTATGGATAGATGAAGAAATAGGAATCGAAGGTATGGCACCTGGATTTTCAAAATCATTCATCGGAGCATTGGATTGGTCTCGTGCTAAGTCAACTATCTGGTCTTGGTTCCTTTGATGTATTGCCCTTCTAGTATAAACCTGAAGTTCAGGATTTCGATCACTTTGTAGTGTTTCTCCCCCTGAAGGAGTATTTTCTGTGCTTGGCATGGAACTAGAAGTAGTAACCTCAGGGTAAATGATATTAGGAAGAAGAGAAGTATCCCAAAAATTATCTTCTAAGTTAGAGATCTCCCCCAGTAGAGAAGTTTTGGTGAAAAAAGATTGATTTTCCAAAAAGGACACATCCATACTTTCAAAATATTTTTTAGTCACTGGATCAAAACATTTATAGGCCTTCTTTTGTGATACATAACCTAAGAAGATGCATTTGACAACCCTAGGATCAAGTTTAGACCTAGAAAGATTAGAAGTGTTAACATATGCAGTACAACCAAACACATTACGACAATTAGCATCTTTAGAAATTTTGCTAACTGATAATAAGTTACAAGTAAGTTTTGGAACGTGAAGTACAAAATGTAAAGTAATGTGTGAAGTTAAGGTAATAGTTCCTTTTCTAGCAATAGAAACTACCCTCGACAATTCTAATTTTTCATTACAATACATAGGAGAGTATGATTAAAAAAGGTTGGAGAAACTAGTCATATGATTGGAATCTCCAGAATTTATAATCCACGGAGATGAATTGAGAGAAGAAAGAGCTTGAGGATAGTTGCCTGTTTGTGCTAAGAAAACACTAGGATTATCGGATGATGAATTGTTTTTTAGCAGTTTCAGGAGTTGATCGACTTGCTCCTTACTGAATGGACTGGATTCAACAACATTAGCACTCGAGGAATGTTGATTTATGGCCTTCTCACCTTGCCTAGTACTTTTTCAATTTGCAGGTTTTCCATGTAGCTTCCAACACGTCTCACGAGTATGTCGGGGTTTGTTGCAGTATTCACACCATACATGCTAGGGGTGTGCACGGTTCGGTTCGACCCGATTTTGGATGAAACCAATGACCGAACCGAACATCTCGGTTCTAAAAGACCTCAAACCGAAAAAAACCGAACCGGTCCATTTCGGTTTCACCGGTTCGGTTTGGTTTTTAACTGACTTTTATCATATCAGCAAACTAACCACACAACCTGCGAAAAACAAAGAAAAAAATCAAGAAAATACACACCAAAACAACCTTGAATGAAAAACCAAAACAATCCTTCACCACTACTTTGCAAAACCAAAACAATCCTTAAAAAAAGAATTTCAAAAGAAATCAAACCAAACATTTATTTTTAACTTTCATCTTGGAGATGATGGAGATGAAGATGGAGATGAAGAATGGTAGAGGGCCGACCATGAAGAAACGAGAAACGGTGCAGTGAGAGAGAAAGAGAGAGGAGAGGGAAGGAAGAGAAACGGGGAGAGAGAAGAGGGAAGGAAGGAGAGGGAAGGAAGAGAAACGGGAAGGAGAAAAAAGGTGTAGTCTGACTCTGTGAGAGAGAGAATGAGAGAGGAGAGGGAAAGTGGTGATAAAATTAGGTTAGGATGCTTAAATACCTAGGGTTAGAGATAGGTTTGGGCTTTTGTGTATTGGGCCTTCATTAAGAAAAGGGTTAAAATGGATTGAGTTTTTTTTTAAATATATATTTTTTATTTTATTTAAAATGGTCGGTTCGGTTCGGTTTTTTCGGTTTTCAAAAGAGTGAAACCGTAAACCGAACCAAAAATATCGGTTTTTTAGAAGAATAAACCGAAGGACTCAAAAAAAAATATAAAACCGGACCGAACCGGTCGGTTCGGTTTGCCGGTTCGACTCGGTTTTTGCACACCCCTAATACATGCGGCCTTTCATGGATCTTTTTTGAATGATCAGAGGCCTTTTGAGCAGTAGCCTCAACCAACAAACTAGAATTTTCAACTAAATCAATTGGTTCTTACCAATCATAACATTCCGACGACTCTCTTCCCTGCGTACTTAAAAAAAACATCATTAATAGTCGGGAGGTTAGTTTACCAAGTATTCTTCCTCTAACCTCATCAAATTCAATATATAGGCCAACTAGGAATTTATAAATACGACCATCTTCAACTATTTTTCGGTAGTGCTTTTGATCCTCGATAGACTTCCATTCATATGTATCAAATAAGTCCAGATCTTGCCAAATCCTTTTGAGAGAGTGAAAGTACTGAGTAACAAAATTACCTCCTTGTCGTATGTCACTCAATTTCAGATTTAGCTCAAACACCTGTGATTTGTTGCCCAAATCTGAATACATTTCAATCACGCTATCCTAAAGCTCCTTTGCCGTAGTATAACACATATAGTTACAGCTTATGTCTTCGACCATAGAATTGACTAACCAAGTCATCACCATGGAATTTTCAGCATTCCATCCGGCAAATGAAGGGTCGTCTTTGGTTGGTGCCCTTTTATCTCCAGTAAGGTGCCGATCTTTCCTTGTCCTCGAATGTACATTCGTACACTTTGGGACCAACAAAGAAAATTTTCGCCATTCAGTCGAATGGTGGTGATTTGGACAATATTGGAAAGAACTCGCTTATCTCGATACTCTCCCAATCGAAATTTTAAAACTTGATGCTTCCTTAAAAAAGGTTAGTACCTTTCAAAAGACCCCCAAAAAAAAAAAAAAATAGGACACACACAAGGACTCGAGGGCTTCGACTAAAACAACGACGCTTCTCTCTGGCGATGCAGACAACGACGACGGTCAGCGGTGTAGACAACGGCGACGTTCGGTGCTGCTAAACGGCGGCAACGGGTTCAAAAGCAGTTTAATCGGCGGCGGAATACACGCGAAAGAAGGGAGCCCAGTGGTCTTCGATCCTTTATGCAGTCGATAAGGTCCCCCTCGAGTTCACAGAAACTAGGAGAAAATGCACTGGGAAGTCGATAAGGTTAAAGGCCTGCAACTAAGGGTAAGACAGCTATGGTTTTTTTTTTTTAAATTTTTTTTTGAAAGAATTAGGGTTTATTGCTCTGATACGATGAGCAAAAATAAGAGAGAAATAATCTTTTCTCAATATTTCATTAAGTGAGATTAGGGTTTCTTAAATAGAAGAAATACCCATGTACACATAAGGAAATAAATTTCCAACAATATATTATATTAAGTACAAATATACTATTGATCTAAAGAAGGAAAAAATCCAAAATAACCAACACTATTGTATCAAGATATTGGAGTTTATTCTCCATTTTGATGATAAAATTGATGGCCTTTCTGATGGTTATTTTTATATATATCATCAAGTAAAATAAATCCCTTTTATTTATTTTGATGGCTTGAAATTACTCTGGTCTACGTAGGTTGCAGCAATGGGATTGGATCCTGTGAAGCACTTGGGCAAAGAATTAACATTTTTGGATTCAGATCTACATAAGTTGAACAAGTAATTAATATGTTATGTTAGCTTTTGAGTGGTAGTTTGGCAGGCACCATTAGCACAACTGTTATCTTTTTCGTTCGTATCGGCAGATTATATGGGATCAATGTATGTGGTGAAGGTGGGGAATATGAAACGTTGACCCTTGATTGCCCTCTTTTTAAGGTATGTTCTGAGTTTTTATCCTATGCAATGTTGGTTAACAGGTGGTTGTATGTTTCCATTTTTTCTTAAGCTATTTTAATTTTTTCTTTATTAGGAAATTGCTGTAATACCAAGGAAAGACAATTTTGTTGATAATAATGCATCATATAGCCTAATCGCAAGTCCTTATAGGGGATTTGAAGTCCAAAACAGCCTTCTAACCAAGGTTGTTAACTCGTGAAACGGAACGAGTATCGAAATTCAATTTTAATGTATCGTGTATCGTAACGTATCGTGTATCGTAAGTTTCAGTTTAGTAAAGAATATTGAAATATAAATCTAAATAACAAAGAATGAAATAAAATTCCAAATATTAAATAATTCCAAAACAAGATCAAAACATAAACTAACTTTAAAGTTCAAAATACTCAAGAAAATTAAACAACAATAGTGGTTGTTATTCTTTCTTTCTTTTTTAATGGAAAGTTGGTTGTGCTTTATTTATAATAAAAGAAAAAACTTTTCATATTCTTTATAATCTATAATGAAGACAAAAAATAAAAATAAACAAATAATAATTTAAAATAAAAAATAACCTATTTAGAATATTTTTCTTTTAATATAAACGTGTAGAGGAAAAAATATGTTTAATAATATATTTTAAAATAATATTTATTTATTAATGGTCATCATATCACTTCCTTTTTTCAATGTGAAATGACTTGCCATCATTTCATTTATATTTTCAATGCCATTATTTCATCTCTCTCTCTCTCTTCACATTTAGAACCTTTAGGAATTCCCAACAACACAACATTACTTTCTCATTCTTTTTCTCTTTCTTCTTTAACACTTTTTTCGAAAACTTCTTCTCTAACCTTCAAACTTCCCTCTTTCTAAAACTTCTTCTTCAACCTTCAAACTTCCTTCTTCAACCTTAGATTTCCAGAATTCCAGTCTTCTTCTTCTCATGCGGCTGCTACAAGACTACAAGAGCACGTCTTCTTCTCCTACTATCGCGAATCACGGTATTTTTTTTTTTGAATCGGTTTGTTACGATTTTAAGCGATACATACGATTTTATTTTCCGAATCGTACGATACGTCCCATTTACGATTCAAAATCACGAGTTGAATCGATTTTGGTCCGTTTCGAGTTGAATCGCATTTATTCAACTCGTGAAACGCATTTATTTTAACAACATTGCTTCTAACCACTTTCTAATGGCTGGAAAAGTTAACAGAAAGTTAAATAGCCTTCTCACTGTGATCAATATTGAGAAAAATACAGCAAGAGAGTAACTTCGATTCAAGGATTGTTGATTTAATCCTCATACGAGGGGGAATTTTCCATTAGATAATGTGTTGTGTATCAAACAATAAGCCACTACCTATCAAAAAAAATGTATAAAAAAAGAGCAGCGAATGAGTGATTTTTGGTGTACTTTCTATCAACATTGTTGTGATTCGGTCAATAGATTTTATACACTAAAGAGTGTACTGACTTGGTGAGTACTTTTTTCTATCAACATAGTTATGAATCTGTCAATAGATTTGATACACTTCCCAATTTTATACACTAAAGAGTGTACTGACTTGGTGAGTACTTTTTTTAAATTCTGCGAACTGATAATGTTGGTGAATATTTTTCTTTGGGATGGCAAAAATACCCACGGGGTTGGGTCCCCGCGGGTCCCCACCCCAAATGGGGTGGAGAATCCCCAGCTTGACCGGGTATGGGGTCAAACTGGGGATTTTTGTCCGGTCTGCGATAGGAACGGTATCCTTGCCCCGTCCCGCATAATTTTCTTTCAGTTGCTCGTTTGAAAGCCTGCCTTGTTGCTAAAGTATGCTCAGACTTATGGGATAGATTATTCTAATACATTTTCTCCAGTTGCTAAATTAACTTCTATTCGATTGTTTATTTTCATGGCTATTTACTCACTGTTGGCCTTTGCATCAACTTGATGTTAAGAATTATTTTTTCCATGATGTTCAAGAAAAAGTATATATGGAGCAACCACCTAGATTTGTTGTTTTTAGGGGAAGTCGTCTTAGAAAATCTTTGTATGGATTGAAACAGAGCTCACAGGCATGGTTTGGTAAATTTATTCAAGCACTTAGTAGTTTGGAATGAAAAAAAGTATGTCTGATCATTTTATCTTCTATTGACGATCTGATCGTGTACGTCGATAATATAATTATCACTGGAAATGATACATTAGGTATATCTTTTTTGAAAACTTTCCTTCAAGATTAGATTCATACTTGGGAACACTAAAGTATTTTTTGGACATTGAAGTGATGAGAAACAAGAAAGATATCTGTTTGTCGCAACGGAAATATGTACTTGATTTGTTATCTGAGATAGGAAAACTAGGATTAAACCATGTAGCACTCCGATGATACCTAATTTGCAACTCACAAAAGAAGGAGAATCATTTAAAGATCTTGAGAGATATAGAAGATTGGTTGGGAAGTTGAATTATCTGACAATGACACGACCAGACGTCACTTATTCAGAAAGTGTTGTGAGCTAGTATATATCTTCACCTATTGTACATCATTAGGTTGCAACAGAACAAATTCTATGTTACTTAAAAGCTGCACCTGGGCGTGGGATCTTATATATAAATCACAGTCATACATGAGTTGAATGTTTCTCAGATGCTAATTGGGCAGGATCTAAGAAGATAGAAGATCAACATTTGAATATTGTGTTTTTGTAGGAGACGATTTGGTGTTGTGGAAGAGTAAGAAGCAGAATGTGGTGTCACGTTCAAGTGTTGAGTCACAACCTATGTGAAATGTTAGGATGCCAGTCTCTTCTGATGTAGATTGGACAGGTGATGTTGAAAAACAATCCTTTTTCCAATATCGAAGGAAGTATGTTTTAGACATTCTTAAAGGAATTGACATGTTAGGATGTAAACAAACAGACTCTTTTATGGATCCTCAAAAGAAACTTGGACTCAACCATGAGAGCACACCTGTGGAAATCTATTTCTGGATCTTATAGGAACAAGACAAAGACCCAAACATCAGAATGAACTAATTCAAACGGAACAATAACTCGTTTATGGACTCTAGGACTCGAACTAAGACGATGAAATTTAGCAAACTGATAAGATTCACAATTCAACGAAGACAAAGAATGAAATTCTAGATGAATTTTCTTTAACACGGACAAAGATGGATGACCTAAATGACAATGGACTTCAAACAAAGATGCATTACCAAAGCAGTCCACAACTTTTAGTATCTGTTGATAAAAAATGTAAATGCCTCCAGATTCATATCCTTTACTAATAATCTCCTTCGTCATACGATCCTGAAACAAACAATAGCCAGAAAAAAGAAATAAAATAACTAAGGTCACGAGTGAGCTTATTAACCAAAATCAAATTAAATGCGAGTCGAGACAAATGCAAAATAGAAGACAAAGGAAGCGAGGCTGTGAGATTAATGGTGCCAGAACGGTGCCAGAACCAAGAACAGAGGATGTTGATCCATTTACCAAAGTGGCAATTGGAAAAGGTGCAAGTGACAAAAAGGTAGAAAATAAGCTAGAATTACTTATCATATGAGCTGTGGCACCAGAGTTGATGACAATGGAAGATGTAAGAAGACAATGATTCTTATTACCTTATTCAACAAGAGTTGTAATAAGATTCGATGAAGAAGGTGCTTGCAATGAGTCTTGGTACAACCTGGAATTTAGAAAACTCATATGGAGAAATAATAACCGATCCCTCAGCTGTATCACTGGAAGTAACTTGAGGTTTGTGACCCTTGTTCAATTCAACAATTTTCGACAGACACATTTCATGTGATTAGGCTTACGACAATAGTTACATACAATCTCCTTCAAATCTTCTCGACGACAAAACTAAGTCTCCGAATGTTGATGCTCATCCCTTTTGTAACCTTAGGATCATGTCGACTAATGAGAGTATTGTTGGATTGAGGAGCATACAAACTAGATGAAAGGTTTTCTGTACGGAGGACTCTACTGAAGGCTTCGTTTAAAGAAGGGGTCTCAGAGTCAGATAGAATTTGTGGTTTAGCTATCCCGAATTCTGGCGAAAGTCCATTTAGAAAGCTCATTACACCCATCTTCTCTCACTGAGCTTGTTGAACCTTTATATCTGTACTAAATGGGAGTAATGTATTATGTTCACACGTCTTCTTAACTTGCATAAAATAATTCGTAACGGATTGTTTTTTCTGTTTCGCATGATAAAGGGTTGTGCAAGCGGCGAGGCGTGTGTGGCTCACGTGCGACTGCTTCTGGCGAATGGTGACGACGGTCTTGCTCTACTGACGGTGGTGCTCCCCCTGAGGTAGGTGGTTTTGACAAACGATCACTCTAACGTGATGACCTAAACTTCAAATCTTAACCTCACTTGGAGGAGAAACTCCTAATTGCCAAATAAAATTTTAAAACCCTAGGGGTTGTCTCTGATTCCATATAAACAATATTGGGAAAAATACTAATTGTATTATACATTAGAGAGTCACTAACATATATATAGGTCCACAAGGAAACCCTAAACTAAGACATGTACAATTATTATAAAGGACAAATGTACATAATAATATATATTATAGTGGATGGTGAGAGGACGGTTGTTTGCTCCGCCTCTTGGCCGGTGGAAGAAGCTTCCGCCATGACTGCAAGTCTGCAACTTTCTTGCCTGCCATTAGTGCCTGAATCTTGGAGCTCTATACCAATTTAGTGTATTCCAAATGAAGGAATTTCTAATTCAATATGAATTATGAACTCTTTGATACAATGGAGGGGGTATTTATACAAAGTAACCAATAACCAACTAATTACCTAAAGGACCGGTTAAAAGAGTAAAGTAACCATAACAACCGGTAAACATAAAAATGGGAAATTAAAGTAAAAGCCCCCTGTTGCTAATACATGATGTGTAGTGAGATTAGTCGAGGTTTGCACAAGTTGGCCCAGACACTCATTGATATAAAAAAGAAAAAAAAATCTAATTTTCCTTATTTTCTATTCACAAGAATGTCCTAACTTTGACCCTCAGTTTGTAATAACTTGGAAATAAATTCTCGGAACAAAAAATGTTTAGGTGAATTGAGTTTGCTATCTACATCATGGCAAATCATTCTCGAACTTTGGTCAAGTTAATGTGCATATAACGTTTCTGGGCTGCAAAATAACATATCAATTTTTCAGAGTGCTAGGATTGTGCTCGACGAATTCAAAACTGTGATGCACTCATCGGATTCCATAGCACCAGTGGGGATCCTCCATCCTGTTTCCTTCCATTTGGAGTATAAGACCTCTTCTCTTGGTATCTGTGACAACAACAATTCAGTTGACCATGAGAAAGTGGGTCTTCTTTTCGAAATTCAAGGAGATTGCTTTCACAGCAGCGACACATTACAGTCCATAGCTGACGCCTCTAGTGCCAGTCATATACTTGATGATGCTCCAGTTGATAGAATTCATATTTCATGTTCAAGGATGCGGAATACATTTGCAATTTGTGGCTGGTTGCAAGATTCATGCTATACATCTCCAGGTTGATTAAAGGAGAGTTGCCCTATTTACATTTTCAACTTTCAGCCTATATTTTTTGTGAGCACACTCAAGTCTTTGATTGGCAGGTCTGCAGGATGATCTAAAGACTATTCTCAGGAAAATAGAATCAGAACTTCTAGGTCATGGTTGTGGATGGAAAAATGTGCTCTATATTCATCTTTACCTTGCTGATATGGATGAGTTTGCTTTGGCTAATGAAACATATGTCAGTTTTATAACTCAGGAGAAGTGCCCTTTTGGTGTCCCATCACGCAGTACAATTGAATTGCCTCTACTCCATGTGAGATTAGGAAATGCATATATTGAAGTTCTGGTGGCAAATGACCAAACGAAAAGAGTTCTTCATGTTCAGAGTATATCGTCTTGGGCGCCTAGTTGCATTGGACCATATAGCCAGGTAAAAACACCTTTTTTGCATATCTTAATGATCTTAACTTGGCAAAAAAGGATTTCACTCTTTAAGCTATCTATTCAGACCCTGTCAAATTACGTTTTTAGGGAAACACATTTGTTATTGATTAAATTCTGGTGTCATGAAGTTGAATTACCATGTACATTTCATCATTTTCGTCCAAAATGACTGGAATCTTGAATTGGAATGCTTTTATTTGTTCTCATTAATGTTTGATGGTAGACGGTTTGGTTTTGTGTTTTCTGTTTCTCTCCATGTGGAGTTGTATTTTGGAGCATTGGTTTCTCTTTTCATTTTTTCAATGAAAGGTGCAATTTGTTACTTGTCAAAATAAAAAAAAAGTAATTTCTCATCTGTTGACTAAACCTAAACCTTTTATAGAGGTCCAAGGAAATTACTTGATGTATTAAAAGATATCAAAGTACATATGTAACTACCTTTGTGCATTCTTATTACATGGATTAGAAATCTTGATGGTGGACAAGATCTCATAAATAGTGGTCAAATTTGTTCTATAGCTTCCGTGCAAAAATATTAACATTTTGAAATGCATTTTCGGTAGGGTTTTGGTATTTTGTTATATTTGTTGATGATTTTTCTCGTCTATCATGGTTATATCTAATGAAAAATTGTTCTGAGTTGCTTTCCCATTTTTGTAACTTTCATGGTACTATTCGAACTCAATTTAACGGTTCTCTTGAAGTTTTATGGAGTGGTAATGCTAAAGAATACTTCTCACTTGATCTTAATTCTTATTTGAGTGAACATGTAATTCTCCATCAATCCTAGTGTGTTGATGCTCCATCTTAAAATGGGATTGCAGAACGAAAGAATCACCATCTCCTTGAAACAACCAGAGCCTTAATCTTTCAGATGAATGTCACAAAATAATTTTGGGTTGATGTTGTATCGATGACTTGCTTCTTAATAAATCACATGCCTTCTTCTATTCTTAAGGGTGAGATTCCTTATCATATTTTGCGCCCCAAACTCCATTTCCTCTAACACCCAAAATCTTTGGTTGTACTTGCTTTTGATGTAAAAACAATCTTGGAGGTTTCTTGAATTAAAACTTTCTTGAATAATACCTCAATTACAACCTAGAGATCTCTATTTAAAGGCTACAAACCAACCTTTGTTACAACTTAAGTCATTGGCAGTAGAAACTAAACCAAAGACTGCAAAGACTGCAGTAAAAACTAAACTTAAAGTCATTGGCAGCCGTAACTAAACCAAAGATGGCAATAAAAACTAACTAAAGCCACGATAACAACTTAACAAAAGTAAAAACAAAAAATAATTACACTTCACAACAGTAAATAGAAAATAAAGTCATGATAAATGAAGATATGCTGCATCAATTCCCTCCCCCTAAAGATAACTCGTCCTCGAGTTTATGGAGCCAACCGAAAATCATCTGGAGCATGGTAGGCAGTAAGATCCGAGACATTGAAGGTTGGAGTGATCTTGAATTGAGGAGGGAGATCGACCTTGTAGGCATTAGGTCCATAACGTTCAGGAATAGGAAAAGGACCAATTTTCTTAGGATGTAGTTTTCCATGTGTACCCGATGGCAACCTAGATTTCCTTAAATGAACCATCACAAGGTCCCCAACCTCAAATGTTTGTGAACGGGGGTGCTTATCAACCTGTAGTTTATATGTTGCATTTGCCGCCTCCAAGTGATCATGTACTTCTTTGTGTAACTTGGAAATCATGTACTTCTTTGTGTAACTTGGAAATCATGTACTTCTTCGTGTAACTTGGAAATATGCTCTGCCATTTCCTCTGCTTCCTGATGTACATCCAAAGAAGATGGTAATGTAGTAAGGTCCATAGTTAGACATGGAATTTTAGTATAAACAATTTCAAATGGTGACTTCCCTGTAGACCTATTTCTCATGTGATTGTAGGCAGATTCTGCTTGTGGTAAAGCTAAGTCTAAGTCCCATTGTCGTGTTTGTCTCCACAAAGGCGACGAATCAGATTTCCAAGGGTACGATTGGTCACTTTGGTTTGTCCATCAGTTTGTGGATGACTAGTTGTACTAAACTTTAATCCTGTATCAAACTTCTTCCAAAGAGTTCTCCAAAAATGGCTTAAAAATTTAGCATCACGGTTAGAAAAGATCGACTTAGGAATACCATGTAAACGAACCATCTCATTAAAAAAAAAATTAGCAATAGCAATAGCATCATTAGTTTTCTTACAAGTCGAAAGTGAGTCATTTTGCTATATTGATCCACTACTACCAAAACAGAATCAAACCCCCTTTGTGTTGGAGGCAATCCTAAGACAAAATCTATTGAAAGATCTTCCCATATAGAATGTGGAATAGGTAAAAGAGAATAGAGACGTGTTTTGTGTTTGTCGTTTAGAGGTTTGACAAACAAAACATTTTTGAACAAAGTTGGTAACATCTTTTCGAAGTTGTGGCCAATAATACCTGCATGACACTAACTCAAAATTTTTATCTCTTCCTAAATGTCCAGCTAAACCCCTAGAATGCATTTCTTTCAAGAGTTGTTCACGCAAAGAGGTGTGAGTTGCATAATACATTACCTTTAAATAAATATCCATCAACAATGTGAAAATCATTAGGACTTACATTTTGACAACATTTTCTCCACATATCACAAAAATCAACATCAAACTCGTATAAGGGTGGTAATTCCTCAAAAGCAATGATTCCCCCTTTTAATAAAGCTAGAAGAGAATGTTTTCTACTAAGGGCATCAACTACTTTATTAGTTACTCTAGCTTTGTGTTTAATAAGAAAAGCAAATTTTTGTATAAAAGTTATCCACCTTGCATGCATTCCGTTTATATGTTTTTGAGTCTGTAAAATTTTCAAAGAGTAATGATCCATGAATAAAACAAATTCTTTACCTAGAAGGTATTGCTCCCATTGTTTAAGTGCCCTAACAAAAGAATACAACTCTGGTTCATAAGTACTTCATTTTTTTCTAGCTTCACTGCGTTTTTCACTAAAATACTCAATTGGGTGATTTTCTTGAGACAAAACAGCACCTACTCCTATTCCCGAGGCATCCACAACCACTTCAAAAACCTTACTAAAATCAGGTAAAGCCAAATAGGGGCTGCACATAGCTTGTTTTTTAAAGTGGCAAAACTATTGTCTTGTTTATTAGTCCAAGAAAACTTTCTTTTTTTTCTTAAACATTCAGTCAAAGGTGCAGCAATAAGGCTGAAATCTTTTATAAATTTACGGTAAAAAGAGGCTAATCCCAAAAAACATTGGATGTCTTTCTCATTTTTTAGTTGTGGCCTTTGTTTAATAGCTTCTATCTTTTTTGGATCTACAGAAACTGTGTTACTCCCTATAATAAAGCCTAGAAAATGTATCTTAGATTCTAAGAAGTAACATTTCTTTAGATTGATGTATAGTTGGTTTCTTTCCAAAGTTTCAAAAAGTGCTTTTAAGGTATTCTAGATGCTCTTCTTTGGATCGACGATAAATCAAGATATCATCAAAATACACCATAACAAATTTATTAAGGAAAGGGAGTAGAACCTGATTCATTAACCTCATGAACGTGCTAGGGCGTTAGATAATCCGAATGGCATGACCAACCATTCAAAGAGTCCAACATTAGTTTTGAAAGTAGTCTTCCATTCATCTCCCTTCCTTATTCGAATTTGATGGTATCCACTACGGAGGTCCACTTTAGAAAATATTCTCGCACCACTCAATTGATCAACTAAATCCGATATTCTTGGGATAGGAAATCTATACTTTACCATGATTCTATTGATGGCTCTGCTGTCCACACACATTCTCCATGTTCCATCTTTTTTTTGGAGTCAGTCATACGGGGATAGCGCATGGACTAACACTTGGTTTGTATGACCCTTGTCTAGAAGTTCTTGAACTTGTTCTTGTAAGACCTCATATTCCTTAGGGCTCATCCTGTAATTAGGCAAATTGGCAAGGTAGAACTTGGTATAAAATCAATTTGGTGTTGAATATCTCTCAGTGATGGTAAAGTAGTTGGATTTTTCACCAAATCTGGAAAGGTTTGCAAAATTTCAATAACAGAAGGGTCAATTTGTCCCTCAGCCTCAAGTTGTTCTTCTCTTGTTTGTACAACAACCCATATTTCCTCGGCAATTAGTTGGCCAAATTCTTTAGCCTCAAAACAGTGAACAAATTTGATTGATTCTCAAACTTTGTTTTAGTCACCTTCTTTGTATTGGCGAGTGGCAATAGGACTATCTTTTTGCCCATCCAAGAAAACTCATAAGTATTTTCTCGCCCCTTGTGGAGGGTTGCACATCATATTGCCACGACCGCCCCAAAAGAATATGATATGCATCCATGTCGAGTATGTCACAAATAATTTGATCTTTGTAATTATTGCCTATAGACAATGAAATAGTAGAAATCTCTTTTACTTGTCCTCCCCACCTTTTCGGATCCAACTCACTTTATTGGGATTTGGGTGAGGATCGATTTTTAAGTGGAGAGCTTGAACGACCTTGTGTGAAATAATGTTTTCACTACTTCCACTATCAATGATCACATTACAAATCTTACTGTTAATAGTACATCGTGTTCTGAATAGAGAATGTTGTTGTGGATGGGCATCGGTCGTTGGTGTAAGGAGTATTCGTTGAACAACACAATTTAGGTGTTCACCATCATCAACCCCCACATAGGATACTTCATCTTCAAGTTCATCTGCAAATTCTTGTTCTTCTTCATGTGGACCATCCATAATAGTTAAAGTTTTCCATTGAGGGCACTCATTTGATAAATGCCCTTGTTGCATCAATCCTAGGGTTGGCCGGTTGTAGATGTTGGATTGGTTACCTTTTTTAGCATTCAACCCATCATTGGCCTTGCCTCCATTGGTTCGCGGTGCCATTACTTCTTTACCTTTGTCAGCATTGGATGTAGGGATGTCCATCTTCCCCGCCCCTAACGGGGCGGAGAATCACCGTTTAGGCGGGGGATGGGAGAGGAGTGGGGAAGAATTTTCGTTGGCCAAAGGTCCCCACCTCGATTAATTTATTTATTTTTTAAATATATATTTGTATATATATAATAAACCATACTTCTAATTTTTTATTAAATGTAATTTTATTTTTCTAAGAAAATTGAAAAAAATGTCTCTTAATAGAAATCTCTAATTAAGAATTAACCATTTAAATGAAAATTAAAAAAAAAATATTTAATAAAATTTAGAAAAATAGAAATGATAAACGGGGAAGTATTTCTCGGGGGAACCTGATCCCTGCGAATTCCCCGCAGGGAATCCCTACCCTGTGAAATTAAATGGGGTATTTTGCGGGTATGGGGAATGAAATAGGGGGGCGGGGAAGCCATTCCTCGGCCTCGCCCCTCCCCGCCCCGTGGGCATCTCTAATTGGATGGGGATGGACCACCAACTTGCCAATTCTTTTGCACCTCATTGTTGTTTCTTTTAAAAGTTTGGCTGCCTTTGTCCTATTGTGATCTTTTTGTGTCCACACCCTTTTGATTCGAGTTTCTATTGTCTTCAACCTTCTTTGCCAACGCCACAGTAAGATAATATAAAGGCTGTAAATTCACCTTTTTCTTGATATCCTCTCTTAATCCGTCAACGAAGTGAGCAATTAGGTGTTGTTTAGTTTCAGCCAAGTTGTTGCGTGCACACAATCTATGGAATTCCTCTGAGTAATCGGCCACCGAGTGCACTCCTTGAACACAATATTGATATTTCTTATAAATTAGTTGCTCGTAATTAATAGGGAGGAATTGCCATTTCATTAATTTCATCATCTTTGGCCAATATCTAATAGACCTTTTTCCATATCTCAGGATTTGTAGTTGATCCCACCATGCGGAGGCTCCACTTTTCAACTTGTAGGTAACCAATCTCACTTTCTTATCTTCGGGAGTGTTGGTGTAGTCAAAGAAAGATTCTACTTCTTTGACCCAATCCAAAAAGGTTTCAACATCAAACTTACTACTAAATGTAGGTAGATCCACTTTCATCTTATAATCATAACCTTGTTGAGGAAAATTTGGTGCTTCTCTTCTCCGTTCTTGACAATTCAAGTTTCCTCTATAGATTGGATCTTCATCATCGCTTGAAGTATCAAGTCCTAAGAATCTTGGATCATCTAACCTTGGATCTTGAATTTGAGGTGGTGGAAATCTTGTCTAATGTAACCTTTCTGGTTGAAAAAATTGTGATTCTTCTTCTCTAAAAAGTTATGTGGTTGATTCTGATTAGTAGGTGGTAAATTCTTGGGGAATTCGTCTTTTTCTTGCTTCATAGGGCTGATGTCTCTCTGGTTGGGAGAAAAATTCAGGTTCTGTCTTTTGACAGCCTTGTTGGTGGGAGTTATTTGGCTAGTTGACATAGAATCCATTCTTGTGGTCATGGCCTTCATCATTTGGTGTAATTCTTGCATGTTGTCTCTTATTTCATCCATGGTTTCTTCCATCTTGAGGATGTGTTGAGACATGATTCTTGGAGAGAGGATTTCTTCAGCCTCTTGTTCGGATCGAACTGCGTCGCTCTCGGAGGTGGTGGGAACTTTTCTCCCAACCATTTAGATTCCTTTGTTGAGGTGTTATGATTCCAAACTGATGTAAAACCAATCTTGGAGGTTTCTTGAATTAAAGCTTTATTGAATAATAATACCTCAATTACAACCTAGGACCCCTATTTAAAGGCTACAAACCAGCCTTTGTTACAACTTAAGTCATTGGCAGCAGTAACTAAACCAAAGACGACAGTAAAAACTAAACTTAAGTCATTGGCAGTAGTAACTAAACCAAAGACGACAGTAAAAACTAATTACAACCACGATAACAACTTAATAATAGTAAAAACCAAAAATAATTACACTTAACAGCAGTAAATATAAAATAAAGCCATGGTAAATGAAGTCTGAAGATATGCTGCATCAGCTTTGTTCGGGGTGTTCGCCCTCAACTCACCAAACTAGATCCCAAGTCCTTGGTTATTCTCGTGTCCATAAGGGGTATTGTTTTTATTGTCCTAGTCTCGATTGTTACTTTGCCTCTCTTGATGTCATATTCTTCGAAGATTCTTCTTTCTTTCATCTCCATAGGTTCCTATTTCTACTCACCTACCTACCACTCAGGTTTATTCTTGACGACCACCCTCTTTAGTTCCATGCCCTATAATAGAGGCTTCTTTGTCATTGAATTCAGGAACAAAGGATGATCTTCCTATTACTCTCCGTAAAGGTAAGCACTAGTGCACATATCCTATTTTCTCTTTTGTTTCATATAACCATTTGTCATCTTCTACCTGTTCTTTCATTGCATCCCTACAATCTATATCTTTTTTCTAAGACCGTTCATGAAACTTTATCTCATCTTGGTTGGTGTGCTGCAATGGTAGAAGAGTGACTGCCTTAGATGACAATTGTACTTGGGATTTAGTTTCTCTTCATATAGGAAAGAAGCCTATTGGTCGTAAAAGGGTCTTTATAGTTAAAGTCAATCCAAATGAGTCTGTACCTCGGTTGAAACCCACCTTGTTGATAAAGGCTACGTACAAACTTATGGGGTTGACTATTCTAATACTTTTTCTCTTGTTGCTAATATGACTTATGTCGGGTTATTCATTGTTGGTTGTTGCATCAATTTGATATTAAAAACACCTTTCTTCATGATGATCTTGGGGATGTGTTGTATATGGAGCAACCTCTCGAGTTTGTTGCTTGGGGGAGAATGGAAAGGTATATCGTCTTCGTAAATCCTTGTATGGGTTAAAGCAGAGTCCACGAGCATGGTTCAAAAAATTCACTCAGGTAATTGAAAGCTTTGGAATGAGAAAGAGCAAGTCAGATCATTCTGTCTTTTATTAGCAGTATGAGAATGGTGTCATTATGTTGGTTTTGTATGTTGTTGATATTGTTATTACATGCGATGACAGATTGGGTATCCAATCACTAAGACCTTTTTCCATAGTCAATTCCACACAAAAGATTTGAGAATATTGAAATATTTTCTAGGAATTAAGGTAATGAGAAGCAAGGGAATCTTATCACAAAGAAAATATCTAATTGACTTGTTGACTGAAACAAGAACATTAGAGGCTAAGCCATTTAGTATCCCGATGACGCCTAATATACAACTCACAAAAGAGGGAGAATTACTTAAAGATCCTGAACGATATAGGAGGTTAGTTGGAAAAGTCAATTATCTTATGGTAACTCGACCAAACATAGCTTATTCAGTGAGCATTGTGAGCCAATATATGTCTTCTCCGACCACATTGTTTTGATTGTGTGGGCCGACAATATTATCCTCATTAGTAATGTTATAATCATGTTCATTCTTAGTGAGGAAATAATAAATATGTCTAGGTTTGGGGATTACCAGGAGTCTAATGCATATATCAGTTGCAATTACATTTTTTATTTTGTCGTGGCTATTGTAAATAGCAGAATCAAATTGATTTCAAACATCCTAACTAGTTCTTCAAGTTTATTTCCAGGCAACTCTGCACAACGAAATCCTTTACATGGCCGGACAATTGGGGCTTAACCCTCCCACTATGACACTTTGTAGTGGAGATGCTACAGATGAGCTGGAACAAGCATTAAAAAATTGTGAAGCAGTGTCAGAATGTTTTAACTCTTCAATATCTACTACTTCTGTTATATTTGTCACGTACTGTTCTACTCGTATTCAACCAGAGGAGAGAAGGAGAATAGAGGACAGGTTGCATGGTGTACTTGAGGAAATGAGGCATTCCGATAAAGATAGTTTGTCAAAAGTTCTTGATACCATTTTTCTGTATATTCATGTACCCAATCTTCCTAAAGGGTACGCTTTTATGTATCAATGTAATGAGATCTTTTATCATCACACTTGTGCTTTTGTTGGAACTTTTTATTTTTACAATCAATGCTCATTTTAAAATAACAAATTAGGAAAGATGGTTTCTTCTAAATAATTATTTTCGGTTCTTGTAGAAGTTGAAAATCTTGATGATGCACCATAGGTTTGTGACTATTTTATTGGTTTTCAAAATTGAGTCAAATGTGGCTTATAATATTTTTAAGTGTTTCTTTCTTTCTTTTTCGTTTTTGATTTCCAGTATTTTCTTTGCAGAGCACTTGTTGAAGTAAAGCCTATTCTTTATGTTCAAGAGAATTTAGATACAGTGGTAGAAAGTGTTCAAGACTCACCGAAGTTACATACCCCTACATATTGGGGTTTCCATCATGAACACTGGCACAAGTCTTGCATTCAGAAATGCGTTGTTAATGGAAAGATATGTGCGGTAGTGTTGTCTGTGACAAATGAACTTGCTCGGAATATTTGTTCTTGCTTACTTGGAAATCAGATTACCGAGGAGCACTTCGGATTGGTTTCTAAATTTTGCATTTATGTTCTCAATGAAGTTCTCTTGGATAGCGCTTTCTGTTGGGAAGATATAAAGGTTAGTTAATCTGGAAAATATTATTGCAAAAAGATTACTGTACAAACTTTTGTTCCTCGTTTAGTGTTTTTCAGTTCTTTGTTTGTTTAGGATTTTCTTTGATTCAGTGGTTGACCTAAATGATTTTAATTTTTATGCTTTTCCTTATTCTTTCACACCCCCAAGTCGACTTGCTTTGCCCCTGTTCTAGTCCTTGGGGTGAGGGGAAAATTGAGATGGAGAAAAATGTATCAGGAAAGTGATACAGGAGAGTTTTGGTCTTGGCTTGTGGATTTACATTATTGGTTTGTTTGGCAGTCAATTCTGTAAGTTGGGTTATCTGAATTTAGGCTTGGACAGAAAAATGTTACATTTTTTGTTGAGATCAACGGTTAAAGTTAGGGTATTCTTGTAAATGAAAAATGAGGAAATTATTACTGTCTTTTTAACTTGTATGATTTGATTAGACTATATTATTCTTACCTTTCTTGTAGGCTTTTTTTTCTCATGTTTTTAACTTAGTGTGTTATTAGTTCTAATTTCTAGATCAGAAGAAATTCCAGAAATCTTCATGGTATCTTCTTTGTTGGACAAAGTCAAATCTGAATTCAGTAATTGTGGAATTACTATCTTTTCAGAGAGGCTTACACTTATTCAAGCCATTTTATCAAGTATGCCTCCGTACTTCTCTCTTTTAAAGTCCTCAAGAATATAGATTTGGATTTCGAGAAAATTTCATATATTTTCTTGTGGGTGGCTCAAAGTCCCAAAGGTCTTGGAGGATTAGGCATAGAGGCATTGTTAAATTGGCAGGTAAACATGCTAAGTAGGTTGAGAAATTTCCCAACAGAAGTGGGTAGCACAATGCTATTAGAATAAATAAGTTGAAGATTCCTATGTATCGTTTATGAAGGGACCCAAAAGATGTATAGAAGCTCTTGGAAATCGATGTCATTTTACACTGGAAGCATTCGGATTTTTCATCCTTGGCATTAAGAGCAGAATCAAATTTATAGAGGATGTCTTGATTGGAGACAGAACTCCCTCAGTGATTCCAGGTACTTGGACCTCCCAAGTTTGAGTGTTCTTACTCAAGCCTAAGCTCACAAACCAATTGCTGCACCTTCATCCTTTCTTACTCACTCTATTTATATTTTCTTTTGAGTTCAACAATATGTGGAGGATTTGAACCTCTAGCCTCTCGATCGAGCTTTATGCCAGTTGAGCTATGTTCATGTTGACTCACTCACTCTATTTATAACAAATCTCTCCCAACAAGCTCTATGACTACTAATACACCCTTCATACCCTAATAATCTTTTTAATTCTAACCCAACAACATCATAAGTCTATACCACTTCACAAGAACAGATATTTTTTCCTAAGAAATTTTCAGTCATTGTTTTGCTAATTTGTTTTCCAGGCGTTGTCAGGCAGTGTTGCATGACGCTCTGTGTTTACATGTTTGGGTTGGTAACCATTGCAAAAAAATTAGGAAAGAGAAAGTCTCAAGAGCTCCTTTTCTGCTTTAAAATTGATTGAGATGTTAAAACTAAACTCTTCTATAAAATGGAAAGGTAGGAAAGGAAACAAATTTCTTACTGCTAAGCCACACTTGTTTTTGAATGGCGATAACACAAAATAAAGTAGTTGTCAAATAGACCATTTGTTATGAATGTTATATTCTACATAGTTGTTCGGTTGCTCTATTTTTGCTTTTCACAACAGATATCCCTCTTAGGAATATTATTTAAATTTCTTACAGTTTCTAAACATATAGTATCGTTTTTGTAGAATTTAAGGTTCTACTTTCCAACTAGCCTTAATATTACTCTGGAGGCTGCGTCAATCGTATTCTCTCGTGCTTTCGATGAACTTGCGGAGTCGAATCCAACAGCTCATGTTGATCGGTTCTTCAACCTCATCCCAGTCTTAGGTGCTGGACGGACACCAACATCCATGGATGATATATTGACTTGCGAATTATTTGCTCAAAAATCCTGAGACCAAGTTGCTCAGCTGCTGCTATTCTCTGGCTTGCGATTTGTATATCTCCTTACCATGTTCACACGATTGAAGTGAAGAAAGAAATAGGCAGATTCTTGGTTATGGAGGTACGCCCACTCTCTTGAAGGGAAAACTCTTATCTATCTAGTTATCGAGAAAATTTTTAGTACTACGGTAGTTCCTCTCATTTGACACATGATTGGATAATATATTAATTTATTCTATGGGCCCATCCTATTCTTAAATGCTATTGTTATGGCAAACAATGGATAGAAGTAGTATAAAATATTGGTGGCAAGGAGTGAAGGTAGTATGAAAATTTTTTCATCCTATTGTAGTACCAGAATTTTCCTCGCAGGTTTATTTCCCTAAAAGATATAGATATATATAACTTAGATTGATTTGTATCATTTTTTTTTAATGTTATTGTAATTTGTATCAATTCTCTTCAGAGGAAAAAGCAAAACAATGCAAGGAAAATATATTGGGATTGATTTTGTACCATCAAACCTTTTTATTAAGAAGTACATATTTCAATT

mRNA sequence

GTTATTGTTAATCTCTAGCAGAGCCAGAACCTCCCCAAAACCCCAACGGACAAGCAGAGCAATTTCTTCTAGTTGGCGATCGGAGATGAAGGTGGTGGCTTTGGTAAGCGGCGGCAAAGACAGTTGCTTCGCCATGATGAAGTCTATTCAATACGGCCACGAGATTGTAGCGTTAGCAAATTTGTTGCCGGCCGATGATTCCGTGGATGAACTTGATAGTTACATGTATCAAACTGTTGGACATCAAATTATTGTTAGCTACGCAGAATGCATGGGGATTCCATTGTTTAGAAGGCGAATTCAAGGATCCACAAGGCATCAGAAGCTTAACTATAGAATAACTCCAGGTGATGAAGTGGAAGATATGTATATTTTGTTAAAGGAAGTGAAGAGGAAATTACCTAGTGTCACCGCAGTATGCTCTGGTGCCATTGCATCTGATTATCAAAGATTGCGGGTAGAGAGTGTTTGTTCAAGGCTAGGACTTGTTTCCCTAGCATATTTGTGGAAACAAGATCAATCATTGCTTCTACATGAAATGATAAACAATGGAATTTTGGCCATCACGGTGAAGGTTGCAGCAATGGGATTGGATCCTGTGAAGCACTTGGGCAAAGAATTAACATTTTTGGATTCAGATCTACATAAGTTGAACAAATTATATGGGATCAATGTATGTGGTGAAGGTGGGGAATATGAAACGTTGACCCTTGATTGCCCTCTTTTTAAGAGTGCTAGGATTGTGCTCGACGAATTCAAAACTGTGATGCACTCATCGGATTCCATAGCACCAGTGGGGATCCTCCATCCTGTTTCCTTCCATTTGGAGTATAAGACCTCTTCTCTTGGTATCTGTGACAACAACAATTCAGTTGACCATGAGAAAGTGGGTCTTCTTTTCGAAATTCAAGGAGATTGCTTTCACAGCAGCGACACATTACAGTCCATAGCTGACGCCTCTAGTGCCAGTCATATACTTGATGATGCTCCAGTTGATAGAATTCATATTTCATGTTCAAGGATGCGGAATACATTTGCAATTTGTGGCTGGTTGCAAGATTCATGCTATACATCTCCAGGTCTGCAGGATGATCTAAAGACTATTCTCAGGAAAATAGAATCAGAACTTCTAGGTCATGGTTGTGGATGGAAAAATGTGCTCTATATTCATCTTTACCTTGCTGATATGGATGAGTTTGCTTTGGCTAATGAAACATATGTCAGTTTTATAACTCAGGAGAAGTGCCCTTTTGGTGTCCCATCACGCAGTACAATTGAATTGCCTCTACTCCATGTGAGATTAGGAAATGCATATATTGAAGTTCTGGTGGCAAATGACCAAACGAAAAGAGTTCTTCATGTTCAGAGTATATCGTCTTGGGCGCCTAGTTGCATTGGACCATATAGCCAGGCAACTCTGCACAACGAAATCCTTTACATGGCCGGACAATTGGGGCTTAACCCTCCCACTATGACACTTTGTAGTGGAGATGCTACAGATGAGCTGGAACAAGCATTAAAAAATTGTGAAGCAGTGTCAGAATGTTTTAACTCTTCAATATCTACTACTTCTGTTATATTTGTCACGTACTGTTCTACTCGTATTCAACCAGAGGAGAGAAGGAGAATAGAGGACAGGTTGCATGGTGTACTTGAGGAAATGAGGCATTCCGATAAAGATAGTTTGTCAAAAGTTCTTGATACCATTTTTCTGTATATTCATGTACCCAATCTTCCTAAAGGAGCACTTGTTGAAGTAAAGCCTATTCTTTATGTTCAAGAGAATTTAGATACAGTGGTAGAAAGTGTTCAAGACTCACCGAAGTTACATACCCCTACATATTGGGGTTTCCATCATGAACACTGGCACAAGTCTTGCATTCAGAAATGCGTTGTTAATGGAAAGATATGTGCGGTAGTGTTGTCTGTGACAAATGAACTTGCTCGGAATATTTGTTCTTGCTTACTTGGAAATCAGATTACCGAGGAGCACTTCGGATTGGTTTCTAAATTTTGCATTTATGTTCTCAATGAAGTTCTCTTGGATAGCGCTTTCTGTTGGGAAGATATAAAGAATTTAAGGTTCTACTTTCCAACTAGCCTTAATATTACTCTGGAGGCTGCGTCAATCGTATTCTCTCGTGCTTTCGATGAACTTGCGGAGTCGAATCCAACAGCTCATGTTGATCGGTTCTTCAACCTCATCCCAGTCTTAGGTGCTGGACGGACACCAACATCCATGGATGATATATTGACTTGCGAATTATTTGCTCAAAAATCCTGAGACCAAGTTGCTCAGCTGCTGCTATTCTCTGGCTTGCGATTTGTATATCTCCTTACCATGTTCACACGATTGAAGTGAAGAAAGAAATAGGCAGATTCTTGGTTATGGAGGTACGCCCACTCTCTTGAAGGGAAAACTCTTATCTATCTAGTTATCGAGAAAATTTTTAGTACTACGGTAGTTCCTCTCATTTGACACATGATTGGATAATATATTAATTTATTCTATGGGCCCATCCTATTCTTAAATGCTATTGTTATGGCAAACAATGGATAGAAGTAGTATAAAATATTGGTGGCAAGGAGTGAAGGTAGTATGAAAATTTTTTCATCCTATTGTAGTACCAGAATTTTCCTCGCAGGTTTATTTCCCTAAAAGATATAGATATATATAACTTAGATTGATTTGTATCATTTTTTTTTAATGTTATTGTAATTTGTATCAATTCTCTTCAGAGGAAAAAGCAAAACAATGCAAGGAAAATATATTGGGATTGATTTTGTACCATCAAACCTTTTTATTAAGAAGTACATATTTCAATT

Coding sequence (CDS)

ATGAAGGTGGTGGCTTTGGTAAGCGGCGGCAAAGACAGTTGCTTCGCCATGATGAAGTCTATTCAATACGGCCACGAGATTGTAGCGTTAGCAAATTTGTTGCCGGCCGATGATTCCGTGGATGAACTTGATAGTTACATGTATCAAACTGTTGGACATCAAATTATTGTTAGCTACGCAGAATGCATGGGGATTCCATTGTTTAGAAGGCGAATTCAAGGATCCACAAGGCATCAGAAGCTTAACTATAGAATAACTCCAGGTGATGAAGTGGAAGATATGTATATTTTGTTAAAGGAAGTGAAGAGGAAATTACCTAGTGTCACCGCAGTATGCTCTGGTGCCATTGCATCTGATTATCAAAGATTGCGGGTAGAGAGTGTTTGTTCAAGGCTAGGACTTGTTTCCCTAGCATATTTGTGGAAACAAGATCAATCATTGCTTCTACATGAAATGATAAACAATGGAATTTTGGCCATCACGGTGAAGGTTGCAGCAATGGGATTGGATCCTGTGAAGCACTTGGGCAAAGAATTAACATTTTTGGATTCAGATCTACATAAGTTGAACAAATTATATGGGATCAATGTATGTGGTGAAGGTGGGGAATATGAAACGTTGACCCTTGATTGCCCTCTTTTTAAGAGTGCTAGGATTGTGCTCGACGAATTCAAAACTGTGATGCACTCATCGGATTCCATAGCACCAGTGGGGATCCTCCATCCTGTTTCCTTCCATTTGGAGTATAAGACCTCTTCTCTTGGTATCTGTGACAACAACAATTCAGTTGACCATGAGAAAGTGGGTCTTCTTTTCGAAATTCAAGGAGATTGCTTTCACAGCAGCGACACATTACAGTCCATAGCTGACGCCTCTAGTGCCAGTCATATACTTGATGATGCTCCAGTTGATAGAATTCATATTTCATGTTCAAGGATGCGGAATACATTTGCAATTTGTGGCTGGTTGCAAGATTCATGCTATACATCTCCAGGTCTGCAGGATGATCTAAAGACTATTCTCAGGAAAATAGAATCAGAACTTCTAGGTCATGGTTGTGGATGGAAAAATGTGCTCTATATTCATCTTTACCTTGCTGATATGGATGAGTTTGCTTTGGCTAATGAAACATATGTCAGTTTTATAACTCAGGAGAAGTGCCCTTTTGGTGTCCCATCACGCAGTACAATTGAATTGCCTCTACTCCATGTGAGATTAGGAAATGCATATATTGAAGTTCTGGTGGCAAATGACCAAACGAAAAGAGTTCTTCATGTTCAGAGTATATCGTCTTGGGCGCCTAGTTGCATTGGACCATATAGCCAGGCAACTCTGCACAACGAAATCCTTTACATGGCCGGACAATTGGGGCTTAACCCTCCCACTATGACACTTTGTAGTGGAGATGCTACAGATGAGCTGGAACAAGCATTAAAAAATTGTGAAGCAGTGTCAGAATGTTTTAACTCTTCAATATCTACTACTTCTGTTATATTTGTCACGTACTGTTCTACTCGTATTCAACCAGAGGAGAGAAGGAGAATAGAGGACAGGTTGCATGGTGTACTTGAGGAAATGAGGCATTCCGATAAAGATAGTTTGTCAAAAGTTCTTGATACCATTTTTCTGTATATTCATGTACCCAATCTTCCTAAAGGAGCACTTGTTGAAGTAAAGCCTATTCTTTATGTTCAAGAGAATTTAGATACAGTGGTAGAAAGTGTTCAAGACTCACCGAAGTTACATACCCCTACATATTGGGGTTTCCATCATGAACACTGGCACAAGTCTTGCATTCAGAAATGCGTTGTTAATGGAAAGATATGTGCGGTAGTGTTGTCTGTGACAAATGAACTTGCTCGGAATATTTGTTCTTGCTTACTTGGAAATCAGATTACCGAGGAGCACTTCGGATTGGTTTCTAAATTTTGCATTTATGTTCTCAATGAAGTTCTCTTGGATAGCGCTTTCTGTTGGGAAGATATAAAGAATTTAAGGTTCTACTTTCCAACTAGCCTTAATATTACTCTGGAGGCTGCGTCAATCGTATTCTCTCGTGCTTTCGATGAACTTGCGGAGTCGAATCCAACAGCTCATGTTGATCGGTTCTTCAACCTCATCCCAGTCTTAGGTGCTGGACGGACACCAACATCCATGGATGATATATTGACTTGCGAATTATTTGCTCAAAAATCCTGA

Protein sequence

MKVVALVSGGKDSCFAMMKSIQYGHEIVALANLLPADDSVDELDSYMYQTVGHQIIVSYAECMGIPLFRRRIQGSTRHQKLNYRITPGDEVEDMYILLKEVKRKLPSVTAVCSGAIASDYQRLRVESVCSRLGLVSLAYLWKQDQSLLLHEMINNGILAITVKVAAMGLDPVKHLGKELTFLDSDLHKLNKLYGINVCGEGGEYETLTLDCPLFKSARIVLDEFKTVMHSSDSIAPVGILHPVSFHLEYKTSSLGICDNNNSVDHEKVGLLFEIQGDCFHSSDTLQSIADASSASHILDDAPVDRIHISCSRMRNTFAICGWLQDSCYTSPGLQDDLKTILRKIESELLGHGCGWKNVLYIHLYLADMDEFALANETYVSFITQEKCPFGVPSRSTIELPLLHVRLGNAYIEVLVANDQTKRVLHVQSISSWAPSCIGPYSQATLHNEILYMAGQLGLNPPTMTLCSGDATDELEQALKNCEAVSECFNSSISTTSVIFVTYCSTRIQPEERRRIEDRLHGVLEEMRHSDKDSLSKVLDTIFLYIHVPNLPKGALVEVKPILYVQENLDTVVESVQDSPKLHTPTYWGFHHEHWHKSCIQKCVVNGKICAVVLSVTNELARNICSCLLGNQITEEHFGLVSKFCIYVLNEVLLDSAFCWEDIKNLRFYFPTSLNITLEAASIVFSRAFDELAESNPTAHVDRFFNLIPVLGAGRTPTSMDDILTCELFAQKS
Homology
BLAST of Tan0006277 vs. ExPASy Swiss-Prot
Match: Q9USQ7 (Diphthine--ammonia ligase OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=mug71 PE=1 SV=1)

HSP 1 Score: 319.7 bits (818), Expect = 8.6e-86
Identity = 198/508 (38.98%), Postives = 290/508 (57.09%), Query Frame = 0

Query: 1   MKVVALVSGGKDSCFAMMKSIQYGHEIVALANLLPADDSVDELDSYMYQTVGHQIIVSYA 60
           MKV+ L+SGGKDSCF +M  +  GHE+VALANL P +D  DE+DS+MYQ+VGH +I  YA
Sbjct: 1   MKVLGLISGGKDSCFNLMHCVSLGHEVVALANLHP-EDGKDEIDSFMYQSVGHDVIPLYA 60

Query: 61  ECMGIPLFRRRIQGSTRHQKLNYRITPGDEVEDMYILLKEVKRKLPSVTAVCSGAIASDY 120
           EC  +PL+R +I G + +Q L+Y+ T  DE ED+Y L+K V    P + AV +GAI S Y
Sbjct: 61  ECFDLPLYREKIGGQSINQNLDYQFTEKDETEDLYRLIKRVLTNHPDLEAVSTGAILSTY 120

Query: 121 QRLRVESVCSRLGLVSLAYLWKQDQSLLLHEMINNGILAITVKVAAMGLDPVKHLGKELT 180
           QR RVE+VC RLGL SL++LW++DQ  LL++M+ +G+ AI +KVAA+GL   K LGK L 
Sbjct: 121 QRTRVENVCKRLGLKSLSFLWQKDQEKLLNDMVVSGLNAILIKVAAIGLTR-KDLGKSLA 180

Query: 181 FLDSDLHKLNKLYGINVCGEGGEYETLTLDCPLFKSARIVLDEFKTVMHSSDSIAPVGIL 240
            +   L  LNK + ++ CGEGGEYETL LDCPLFK  RIVL + + V HSS  +  + + 
Sbjct: 181 EMQDKLLTLNKKFELHPCGEGGEYETLVLDCPLFKK-RIVLTDKEVVEHSSGEVCYLKVK 240

Query: 241 HPVSFHLEYKTSSLGICDNNNSVDHEKVGLLFEIQGDCFHS-SDTLQSIADASSASHILD 300
             V    E++  SL     +  V +E+  LL E     +H+ S   + I D       L 
Sbjct: 241 ACVKDKPEWQPISL----KSELVPNEE--LLGEEYSHIYHTISKKYELIDDQEETPTSL- 300

Query: 301 DAPVDRIHISCSRMRNTFAICGWLQDSCYTSPGLQDDLKTILRKIESELLG-HGCGWKNV 360
             P+     +  + + +F + G +  +  +    Q + ++ +  + +ELLG +G   KNV
Sbjct: 301 -IPIPLRESAFQQKKGSFLVLGNVVATKGSYNTFQGEAESAINNL-NELLGTYGYSNKNV 360

Query: 361 LYIHLYLADMDEFALANETYVSFITQEKCPFGVPSRSTIELPLLHVRLGNAYIEVLVAND 420
            ++ + L+ M +FA  N  Y  +          PSRS +  PL         +  +V + 
Sbjct: 361 YFVTVILSSMSKFAEFNSVYNKYFDFT----NPPSRSCVAAPLASEY--RIVMSCIVGDV 420

Query: 421 QTKRVLHVQSISSWAPSCIGPYSQATLHNEILYMAGQLGLNPPTMTLCSGDATDELEQAL 480
             KR LHVQ  S WAP+ IGPYSQ+   N +++++GQ+GL P  M L   D   E+  AL
Sbjct: 421 TEKRALHVQGQSYWAPANIGPYSQSICANGVVFISGQIGLIPSVMELKLHDKIFEMVLAL 480

Query: 481 KNCEAVSECFNSSISTTSVIFVTYCSTR 507
           ++   V++          + +V  C +R
Sbjct: 481 QHANRVAKAMRVGSLIACLAYV--CDSR 488

BLAST of Tan0006277 vs. ExPASy Swiss-Prot
Match: A2RV01 (Diphthine--ammonia ligase OS=Danio rerio OX=7955 GN=dph6 PE=2 SV=1)

HSP 1 Score: 295.8 bits (756), Expect = 1.3e-78
Identity = 149/252 (59.13%), Postives = 187/252 (74.21%), Query Frame = 0

Query: 1   MKVVALVSGGKDSCFAMMKSIQYGHEIVALANLLPADDSV-DELDSYMYQTVGHQIIVSY 60
           M+VV L+SGGKDSCF M++ +  GH IVALANL PAD +  DELDSYMYQTVGHQ +   
Sbjct: 1   MRVVGLISGGKDSCFNMLQCVSAGHSIVALANLRPADHAASDELDSYMYQTVGHQAVDLI 60

Query: 61  AECMGIPLFRRRIQGSTRHQKLNYRITPGDEVEDMYILLKEVKRKLPSVTAVCSGAIASD 120
           AE MG+PL+RR I+GS+ H    Y  T GDEVED+Y LLK VK ++  V  V  GAI SD
Sbjct: 61  AEAMGLPLYRRTIEGSSVHIDREYSPTDGDEVEDLYQLLKHVKEEM-HVDGVSVGAILSD 120

Query: 121 YQRLRVESVCSRLGLVSLAYLWKQDQSLLLHEMINNGILAITVKVAAMGLDPVKHLGKEL 180
           YQR+RVE+VC+RL L  LAYLW++DQ+ LL EMI++G+ AI +KVAA GL P KHLGK L
Sbjct: 121 YQRVRVENVCARLQLQPLAYLWRRDQAALLSEMISSGLHAILIKVAAFGLHPDKHLGKSL 180

Query: 181 TFLDSDLHKLNKLYGINVCGEGGEYETLTLDCPLFKSARIVLDEFKTVMHSSDSIAPVGI 240
             ++  LH+L++ YG+++CGEGGEYET TLDCPLFK  +I++D  +TV+HS D+ APVG 
Sbjct: 181 AEMELYLHELSEKYGVHICGEGGEYETFTLDCPLFKK-KIIIDATETVIHSDDAFAPVGF 240

Query: 241 LHPVSFHLEYKT 252
           L     H E KT
Sbjct: 241 LRFTKMHTEDKT 250

BLAST of Tan0006277 vs. ExPASy Swiss-Prot
Match: Q7L8W6 (Diphthine--ammonia ligase OS=Homo sapiens OX=9606 GN=DPH6 PE=1 SV=3)

HSP 1 Score: 283.5 bits (724), Expect = 6.8e-75
Identity = 147/256 (57.42%), Postives = 186/256 (72.66%), Query Frame = 0

Query: 1   MKVVALVSGGKDSCFAMMKSIQYGHEIVALANLLPADDSV--DELDSYMYQTVGHQIIVS 60
           M+V AL+SGGKDSC+ MM+ I  GH+IVALANL PA++ V  DELDSYMYQTVGH  I  
Sbjct: 1   MRVAALISGGKDSCYNMMQCIAAGHQIVALANLRPAENQVGSDELDSYMYQTVGHHAIDL 60

Query: 61  YAECMGIPLFRRRIQGSTRHQKLNYRITPGDEVEDMYILLKEVKRKLPSVTAVCSGAIAS 120
           YAE M +PL+RR I+G +   +  Y    GDEVED+Y LLK VK K   V  +  GAI S
Sbjct: 61  YAEAMALPLYRRTIRGRSLDTRQVYTKCEGDEVEDLYELLKLVKEK-EEVEGISVGAILS 120

Query: 121 DYQRLRVESVCSRLGLVSLAYLWKQDQSLLLHEMINNGILAITVKVAAMGLDPVKHLGKE 180
           DYQR+RVE+VC RL L  LAYLW+++Q  LL EMI++ I A+ +KVAA+GLDP KHLGK 
Sbjct: 121 DYQRIRVENVCKRLNLQPLAYLWQRNQEDLLREMISSNIQAMIIKVAALGLDPDKHLGKT 180

Query: 181 LTFLDSDLHKLNKLYGINVCGEGGEYETLTLDCPLFKSARIVLDEFKTVMHSSDSIAPVG 240
           L  ++  L +L+K YG++VCGEGGEYET TLDCPLFK  +I++D  + V+HS+D+ APV 
Sbjct: 181 LDQMEPYLIELSKKYGVHVCGEGGEYETFTLDCPLFKK-KIIVDSSEVVIHSADAFAPVA 240

Query: 241 ILHPVSFHLEYKTSSL 255
            L  +  HLE K SS+
Sbjct: 241 YLRFLELHLEDKVSSV 254

BLAST of Tan0006277 vs. ExPASy Swiss-Prot
Match: Q5M9F5 (Diphthine--ammonia ligase OS=Rattus norvegicus OX=10116 GN=Dph6 PE=2 SV=1)

HSP 1 Score: 282.3 bits (721), Expect = 1.5e-74
Identity = 152/267 (56.93%), Postives = 185/267 (69.29%), Query Frame = 0

Query: 1   MKVVALVSGGKDSCFAMMKSIQYGHEIVALANLLPADDSV--DELDSYMYQTVGHQIIVS 60
           M+V AL+SGGKDSC+ MM+ I  GH+IVALANL P D+ V  DELDSYMYQTVGH  I  
Sbjct: 1   MRVAALISGGKDSCYNMMRCIAEGHQIVALANLRPDDNQVESDELDSYMYQTVGHHAIDL 60

Query: 61  YAECMGIPLFRRRIQGSTRHQKLNYRITPGDEVEDMYILLKEVKRKLPSVTAVCSGAIAS 120
           YAE M +PL+RR I+G +      Y    GDEVED+Y LLK VK K   +  V  GAI S
Sbjct: 61  YAEAMALPLYRRTIRGRSLETGRVYTRCEGDEVEDLYELLKLVKEK-EEIEGVSVGAILS 120

Query: 121 DYQRLRVESVCSRLGLVSLAYLWKQDQSLLLHEMINNGILAITVKVAAMGLDPVKHLGKE 180
           DYQR+RVE+VC RL L  LAYLW+++Q  LL EMI + I AI +KVAA+GLDP KHLGK 
Sbjct: 121 DYQRVRVENVCKRLNLQPLAYLWQRNQEDLLREMIASNIEAIIIKVAALGLDPDKHLGKT 180

Query: 181 LTFLDSDLHKLNKLYGINVCGEGGEYETLTLDCPLFKSARIVLDEFKTVMHSSDSIAPVG 240
           L  ++  L +L+K YG++VCGEGGEYET TLDCPLFK  +IV+D  + V+HS+D+ APV 
Sbjct: 181 LGEMEPYLLELSKKYGVHVCGEGGEYETFTLDCPLFKK-KIVVDTSEAVIHSADAFAPVA 240

Query: 241 ILHPVSFHLEYKTSSLGICDNNNSVDH 266
            L     HLE K SS+   D   S  H
Sbjct: 241 YLRLSGLHLEEKVSSVPGDDETTSYIH 265

BLAST of Tan0006277 vs. ExPASy Swiss-Prot
Match: Q9CQ28 (Diphthine--ammonia ligase OS=Mus musculus OX=10090 GN=Dph6 PE=1 SV=1)

HSP 1 Score: 280.0 bits (715), Expect = 7.5e-74
Identity = 151/267 (56.55%), Postives = 184/267 (68.91%), Query Frame = 0

Query: 1   MKVVALVSGGKDSCFAMMKSIQYGHEIVALANLLPADDSV--DELDSYMYQTVGHQIIVS 60
           M+V AL+SGGKDSC+ MM+ I  GH+IVALANL P ++ V  DELDSYMYQTVGH  I  
Sbjct: 1   MRVAALISGGKDSCYNMMQCIAEGHQIVALANLRPDENQVESDELDSYMYQTVGHHAIDL 60

Query: 61  YAECMGIPLFRRRIQGSTRHQKLNYRITPGDEVEDMYILLKEVKRKLPSVTAVCSGAIAS 120
           YAE M +PL+RR I+G +      Y    GDEVED+Y LLK VK K   +  V  GAI S
Sbjct: 61  YAEAMALPLYRRAIRGRSLETGRVYTQCEGDEVEDLYELLKLVKEK-EEIEGVSVGAILS 120

Query: 121 DYQRLRVESVCSRLGLVSLAYLWKQDQSLLLHEMINNGILAITVKVAAMGLDPVKHLGKE 180
           DYQR RVE+VC RL L  LAYLW+++Q  LL EMI + I AI +KVAA+GLDP KHLGK 
Sbjct: 121 DYQRGRVENVCKRLNLQPLAYLWQRNQEDLLREMIASNIKAIIIKVAALGLDPDKHLGKT 180

Query: 181 LTFLDSDLHKLNKLYGINVCGEGGEYETLTLDCPLFKSARIVLDEFKTVMHSSDSIAPVG 240
           L  ++  L +L+K YG++VCGEGGEYET TLDCPLFK  +IV+D  + VMHS+D+ APV 
Sbjct: 181 LVEMEPYLLELSKKYGVHVCGEGGEYETFTLDCPLFKK-KIVVDSSEAVMHSADAFAPVA 240

Query: 241 ILHPVSFHLEYKTSSLGICDNNNSVDH 266
            L     HLE K SS+   D   +  H
Sbjct: 241 YLRLSRLHLEEKVSSVPADDETANSIH 265

BLAST of Tan0006277 vs. NCBI nr
Match: XP_038889528.1 (diphthine--ammonia ligase isoform X1 [Benincasa hispida])

HSP 1 Score: 1345.5 bits (3481), Expect = 0.0e+00
Identity = 657/734 (89.51%), Postives = 694/734 (94.55%), Query Frame = 0

Query: 1   MKVVALVSGGKDSCFAMMKSIQYGHEIVALANLLPADDSVDELDSYMYQTVGHQIIVSYA 60
           MKVVALVSGGKDSCFAMMKSIQYGHEIVALANL+PADDSVDELDSYMYQTVGHQIIVSYA
Sbjct: 1   MKVVALVSGGKDSCFAMMKSIQYGHEIVALANLMPADDSVDELDSYMYQTVGHQIIVSYA 60

Query: 61  ECMGIPLFRRRIQGSTRHQKLNYRITPGDEVEDMYILLKEVKRKLPSVTAVCSGAIASDY 120
           ECMGIPLFRRRIQGSTRH+KLNYRITPGDEVEDMYILL EVKR+LPSVTAVCSGAIASDY
Sbjct: 61  ECMGIPLFRRRIQGSTRHEKLNYRITPGDEVEDMYILLNEVKRQLPSVTAVCSGAIASDY 120

Query: 121 QRLRVESVCSRLGLVSLAYLWKQDQSLLLHEMINNGILAITVKVAAMGLDPVKHLGKELT 180
           QRLRVESVCSRLGLVSLAYLWKQDQSLLLHEMINNGILAITVKVAAMGLDPVKHLGKEL+
Sbjct: 121 QRLRVESVCSRLGLVSLAYLWKQDQSLLLHEMINNGILAITVKVAAMGLDPVKHLGKELS 180

Query: 181 FLDSDLHKLNKLYGINVCGEGGEYETLTLDCPLFKSARIVLDEFKTVMHSSDSIAPVGIL 240
            L SDLHKLN+LYGINVCGEGGEYETLTLDCPLFK+ARIVL+EFK VMHSSDSIAPVGIL
Sbjct: 181 SLGSDLHKLNRLYGINVCGEGGEYETLTLDCPLFKTARIVLEEFKVVMHSSDSIAPVGIL 240

Query: 241 HPVSFHLEYK--TSSLGICDNNNSVDHEKVGLLFEIQGDCFHSSDTLQSIADASSASHIL 300
           HPVSFHL+YK  TSSLG+CDN N VDHEKVGLLFEIQGDCF +  TLQS+ADASS +HIL
Sbjct: 241 HPVSFHLKYKEETSSLGVCDNTNLVDHEKVGLLFEIQGDCFQNFSTLQSVADASSVNHIL 300

Query: 301 DDAPVDRIHISCSRMRNTFAICGWLQDSCYTSPGLQDDLKTILRKIESELLGHGCGWKNV 360
           +D P DR+ I CSRM+NTF+IC WLQDSC TSPGLQD+LKT+LRK+ESELL  GCGWKNV
Sbjct: 301 NDVPDDRLQILCSRMQNTFSICCWLQDSCDTSPGLQDNLKTVLRKVESELLACGCGWKNV 360

Query: 361 LYIHLYLADMDEFALANETYVSFITQEKCPFGVPSRSTIELPLLHVRLGNAYIEVLVAND 420
           LYIHLYLADM+EFALANETYVSFITQEKCPFGVPSRSTIELPL  VRLGNAYIEVLVAND
Sbjct: 361 LYIHLYLADMNEFALANETYVSFITQEKCPFGVPSRSTIELPLQQVRLGNAYIEVLVAND 420

Query: 421 QTKRVLHVQSISSWAPSCIGPYSQATLHNEILYMAGQLGLNPPTMTLCSGDATDELEQAL 480
           QTKRVLHVQSISSWAPSCIGPYSQATLH EILYMAGQLGL+PPTMTLCSG AT+ELEQAL
Sbjct: 421 QTKRVLHVQSISSWAPSCIGPYSQATLHKEILYMAGQLGLHPPTMTLCSGGATNELEQAL 480

Query: 481 KNCEAVSECFNSSISTTSVIFVTYCSTRIQPEERRRIEDRLHGVLEEMRHSDKDSLSKVL 540
           +NCEAVSECF +S+ST+SVIFVTYCSTRIQPEER+RIED+LHGVLEEMRHSDKDSLSK+L
Sbjct: 481 RNCEAVSECFRASVSTSSVIFVTYCSTRIQPEERKRIEDKLHGVLEEMRHSDKDSLSKLL 540

Query: 541 DTIFLYIHVPNLPKGALVEVKPILYVQENLDTVVESVQDSPKLHTPTYWGFHHEHWHKSC 600
           DTIFLYIHVPNLPKGALVEVKPILYVQENLDT   ++ DSPKL  PT WGF +EHWHKSC
Sbjct: 541 DTIFLYIHVPNLPKGALVEVKPILYVQENLDTAEGTLHDSPKLRIPTSWGFQYEHWHKSC 600

Query: 601 IQKCVVNGKICAVVLSVTNELARNICSCLLGNQITEEHFGLVSKFCIYVLNEVLLDSAFC 660
           IQKC+VNGKICA VL +TNELARNICSCLLGNQI EEH  LVSKFCIY+LNEVLLDSAFC
Sbjct: 601 IQKCIVNGKICAAVLYMTNELARNICSCLLGNQIMEEHLELVSKFCIYLLNEVLLDSAFC 660

Query: 661 WEDIKNLRFYFPTSLNITLEAASIVFSRAFDELAESNPTAHVDRFFNLIPVLGAGRTPTS 720
           WEDIKNLRFYFPTSLNITLEAASI+ SRAFDELAESNPT HVDRFFNLIPVLGAGRTPTS
Sbjct: 661 WEDIKNLRFYFPTSLNITLEAASIILSRAFDELAESNPTIHVDRFFNLIPVLGAGRTPTS 720

Query: 721 MDDILTCELFAQKS 733
           M+DILTCELFAQKS
Sbjct: 721 MNDILTCELFAQKS 734

BLAST of Tan0006277 vs. NCBI nr
Match: XP_022149926.1 (diphthine--ammonia ligase isoform X1 [Momordica charantia])

HSP 1 Score: 1328.9 bits (3438), Expect = 0.0e+00
Identity = 652/734 (88.83%), Postives = 687/734 (93.60%), Query Frame = 0

Query: 1   MKVVALVSGGKDSCFAMMKSIQYGHEIVALANLLPADDSVDELDSYMYQTVGHQIIVSYA 60
           MKVVALVSGGKDSCFAMMKSIQYGHEIVALANL+PADDSVDELDSYMYQTVGHQIIVSYA
Sbjct: 1   MKVVALVSGGKDSCFAMMKSIQYGHEIVALANLMPADDSVDELDSYMYQTVGHQIIVSYA 60

Query: 61  ECMGIPLFRRRIQGSTRHQKLNYRITPGDEVEDMYILLKEVKRKLPSVTAVCSGAIASDY 120
           ECMG+PLFRRRIQGSTRHQKLNYRITPGDEVEDMYILL EVKR+LP VTA+CSGAIASDY
Sbjct: 61  ECMGVPLFRRRIQGSTRHQKLNYRITPGDEVEDMYILLNEVKRQLPCVTAICSGAIASDY 120

Query: 121 QRLRVESVCSRLGLVSLAYLWKQDQSLLLHEMINNGILAITVKVAAMGLDPVKHLGKELT 180
           QRLRVESVC+RLGLVSLAYLWK+DQSLLLHEMINNGILAITVKVAAMGLDPVKHLGKEL 
Sbjct: 121 QRLRVESVCARLGLVSLAYLWKRDQSLLLHEMINNGILAITVKVAAMGLDPVKHLGKELA 180

Query: 181 FLDSDLHKLNKLYGINVCGEGGEYETLTLDCPLFKSARIVLDEFKTVMHSSDSIAPVGIL 240
            LDSDLHKLNKLYGINVCGEGGEYETLTLDCPLFK+ARIVLDE+K +MHSSDSIAPVGIL
Sbjct: 181 SLDSDLHKLNKLYGINVCGEGGEYETLTLDCPLFKNARIVLDEYKVMMHSSDSIAPVGIL 240

Query: 241 HPVSFHLEY--KTSSLGICDNNNSVDHEKVGLLFEIQGDCFHSSDTLQSIADASSASHIL 300
           HPVSFHLEY  KTSS+GICDN+ SVD EK+ LLFEIQGDCF+S DTLQSIADA+ AS+IL
Sbjct: 241 HPVSFHLEYKAKTSSVGICDNSKSVDDEKMDLLFEIQGDCFNSCDTLQSIADATGASNIL 300

Query: 301 DDAPVDRIHISCSRMRNTFAICGWLQDSCYTSPGLQDDLKTILRKIESELLGHGCGWKNV 360
           DD P DR+ ISCSRM NTF+IC WLQDSC TS GLQDDLKT+LRKIESELLG G GWKNV
Sbjct: 301 DDVPDDRLQISCSRMHNTFSICCWLQDSCGTSQGLQDDLKTVLRKIESELLGRGFGWKNV 360

Query: 361 LYIHLYLADMDEFALANETYVSFITQEKCPFGVPSRSTIELPLLHVRLGNAYIEVLVAND 420
           LYIHLYLADMD FALANE YVSFIT EKCPFGVPSRSTIELPLL V+LG+AYIEVLVAND
Sbjct: 361 LYIHLYLADMDAFALANEAYVSFITLEKCPFGVPSRSTIELPLLQVKLGHAYIEVLVAND 420

Query: 421 QTKRVLHVQSISSWAPSCIGPYSQATLHNEILYMAGQLGLNPPTMTLCSGDATDELEQAL 480
           QTKRVLHVQSISSWAPSCIGPYSQATLH EILYMAGQLGLNPPTMTLCSG AT+ELEQAL
Sbjct: 421 QTKRVLHVQSISSWAPSCIGPYSQATLHKEILYMAGQLGLNPPTMTLCSGGATNELEQAL 480

Query: 481 KNCEAVSECFNSSISTTSVIFVTYCSTRIQPEERRRIEDRLHGVLEEMRHSDKDSLSKVL 540
           +NCEAV+ECFNSSI T+SVIFVTYCST IQPEERR+I+D+LHG LEEMRHSDKDSLSK L
Sbjct: 481 ENCEAVAECFNSSICTSSVIFVTYCSTHIQPEERRKIQDKLHGALEEMRHSDKDSLSKAL 540

Query: 541 DTIFLYIHVPNLPKGALVEVKPILYVQENLDTVVESVQDSPKLHTPTYWGFHHEHWHKSC 600
           DTIFLYI+VPNLPKGALVEVKPILYVQEN+DTV E V D PKLHTP YWGF HEHWH SC
Sbjct: 541 DTIFLYINVPNLPKGALVEVKPILYVQENVDTVTEIVHDLPKLHTPRYWGFQHEHWHNSC 600

Query: 601 IQKCVVNGKICAVVLSVTNELARNICSCLLGNQITEEHFGLVSKFCIYVLNEVLLDSAFC 660
           IQKCVVNGKICAVVLSVTNELARNICSC LGN ITEEH  LVSKFCIY+LNEVLLDSAF 
Sbjct: 601 IQKCVVNGKICAVVLSVTNELARNICSCSLGNLITEEHLELVSKFCIYLLNEVLLDSAFF 660

Query: 661 WEDIKNLRFYFPTSLNITLEAASIVFSRAFDELAESNPTAHVDRFFNLIPVLGAGRTPTS 720
           WEDIKNLRFYFPT+LNITLE AS++FSRAF+ELAESNPT  V RFFNLIPVLGAGRTPTS
Sbjct: 661 WEDIKNLRFYFPTNLNITLEVASLIFSRAFNELAESNPTVDVGRFFNLIPVLGAGRTPTS 720

Query: 721 MDDILTCELFAQKS 733
           MD+ILTCELFAQKS
Sbjct: 721 MDNILTCELFAQKS 734

BLAST of Tan0006277 vs. NCBI nr
Match: XP_004152819.2 (diphthine--ammonia ligase isoform X1 [Cucumis sativus])

HSP 1 Score: 1320.8 bits (3417), Expect = 0.0e+00
Identity = 645/735 (87.76%), Postives = 688/735 (93.61%), Query Frame = 0

Query: 1   MKVVALVSGGKDSCFAMMKSIQYGHEIVALANLLPADDSVDELDSYMYQTVGHQIIVSYA 60
           MKVVALVSGGKDSCFAMMKSIQYGHEIVALANL+PADDSVDELDSYMYQTVGHQIIVSYA
Sbjct: 1   MKVVALVSGGKDSCFAMMKSIQYGHEIVALANLMPADDSVDELDSYMYQTVGHQIIVSYA 60

Query: 61  ECMGIPLFRRRIQGSTRHQKLNYRITPGDEVEDMYILLKEVKRKLPSVTAVCSGAIASDY 120
           ECMGIPLFRRRIQGSTRHQKLNYRITPGDEVEDMYILL EVK++LPSV AV SGAIASDY
Sbjct: 61  ECMGIPLFRRRIQGSTRHQKLNYRITPGDEVEDMYILLNEVKKQLPSVMAVSSGAIASDY 120

Query: 121 QRLRVESVCSRLGLVSLAYLWKQDQSLLLHEMINNGILAITVKVAAMGLDPVKHLGKELT 180
           QRLRVESVCSRLGLVSLAYLWKQDQSLLLHEMINNGILAITVKVAAMGLDPVKHLGKELT
Sbjct: 121 QRLRVESVCSRLGLVSLAYLWKQDQSLLLHEMINNGILAITVKVAAMGLDPVKHLGKELT 180

Query: 181 FLDSDLHKLNKLYGINVCGEGGEYETLTLDCPLFKSARIVLDEFKTVMHSSDSIAPVGIL 240
            LDSDLHKLN+LYGINVCGEGGEYETLTLDCPLFK+ARIVLD+F+ VMHSSDSIAPVGIL
Sbjct: 181 SLDSDLHKLNRLYGINVCGEGGEYETLTLDCPLFKNARIVLDKFEVVMHSSDSIAPVGIL 240

Query: 241 HPVSFHLEY--KTSSLG-ICDNNNSVDHEKVGLLFEIQGDCFHSSDTLQSIADASSASHI 300
           HPVSFHL+Y  KTSSLG ICDN N VDHEK GLLFEIQGDCF + D LQS+AD SS +HI
Sbjct: 241 HPVSFHLKYKAKTSSLGSICDNTNLVDHEKGGLLFEIQGDCFQNCDILQSVADVSSDNHI 300

Query: 301 LDDAPVDRIHISCSRMRNTFAICGWLQDSCYTSPGLQDDLKTILRKIESELLGHGCGWKN 360
           LD+ P DR+ ISCSRM+NTF IC WLQ+SC TSPGLQDDLKT+LRKIESELL  GCGWKN
Sbjct: 301 LDEVPDDRLQISCSRMQNTFLICCWLQNSCGTSPGLQDDLKTVLRKIESELLARGCGWKN 360

Query: 361 VLYIHLYLADMDEFALANETYVSFITQEKCPFGVPSRSTIELPLLHVRLGNAYIEVLVAN 420
           VLYIHLYLADM+ F LANETYVSFITQEKCPFGVPSRST+ELPL  V+LGNAYIEVLVAN
Sbjct: 361 VLYIHLYLADMNGFGLANETYVSFITQEKCPFGVPSRSTVELPLQQVQLGNAYIEVLVAN 420

Query: 421 DQTKRVLHVQSISSWAPSCIGPYSQATLHNEILYMAGQLGLNPPTMTLCSGDATDELEQA 480
           DQTKRVLHVQSISSWAPSCIGPYSQATLH EILYMAGQLGL+PPTMTLCSG AT ELEQA
Sbjct: 421 DQTKRVLHVQSISSWAPSCIGPYSQATLHKEILYMAGQLGLDPPTMTLCSGGATQELEQA 480

Query: 481 LKNCEAVSECFNSSISTTSVIFVTYCSTRIQPEERRRIEDRLHGVLEEMRHSDKDSLSKV 540
           LKNCEAV+ECF SS+ST+SVIFVTYCSTRIQPEERRRIE++ HGVLEEMRHSDK SLSK+
Sbjct: 481 LKNCEAVAECFRSSVSTSSVIFVTYCSTRIQPEERRRIEEKFHGVLEEMRHSDKASLSKL 540

Query: 541 LDTIFLYIHVPNLPKGALVEVKPILYVQENLDTVVESVQDSPKLHTPTYWGFHHEHWHKS 600
           LD+IFLY++VPNLPKGALVEVKPILYVQE LDTV ++  DSP+L+ PT WGF HEHWHKS
Sbjct: 541 LDSIFLYVNVPNLPKGALVEVKPILYVQETLDTVEQTPHDSPRLYIPTDWGFQHEHWHKS 600

Query: 601 CIQKCVVNGKICAVVLSVTNELARNICSCLLGNQITEEHFGLVSKFCIYVLNEVLLDSAF 660
           CIQKC+VNGK+C  VLS+TNELARNI SCLLGNQITEE+  LVSKFCIY+LNE+LLDSAF
Sbjct: 601 CIQKCIVNGKVCVTVLSITNELARNISSCLLGNQITEENLELVSKFCIYLLNEILLDSAF 660

Query: 661 CWEDIKNLRFYFPTSLNITLEAASIVFSRAFDELAESNPTAHVDRFFNLIPVLGAGRTPT 720
           CWEDIKNLRFYFPTSLNITLEAASI+FSRAF+ELAESNPT HVDRFFNLIPV+GAGRTPT
Sbjct: 661 CWEDIKNLRFYFPTSLNITLEAASIIFSRAFNELAESNPTVHVDRFFNLIPVIGAGRTPT 720

Query: 721 SMDDILTCELFAQKS 733
           SMDD+LTCELFAQKS
Sbjct: 721 SMDDVLTCELFAQKS 735

BLAST of Tan0006277 vs. NCBI nr
Match: XP_022965506.1 (diphthine--ammonia ligase isoform X1 [Cucurbita maxima])

HSP 1 Score: 1318.5 bits (3411), Expect = 0.0e+00
Identity = 651/732 (88.93%), Postives = 683/732 (93.31%), Query Frame = 0

Query: 1   MKVVALVSGGKDSCFAMMKSIQYGHEIVALANLLPADDSVDELDSYMYQTVGHQIIVSYA 60
           MKVVALVSGGKDSCFAMMKSIQYGHEIVALANL+PADDSVDELDSYMYQTVGHQIIVSYA
Sbjct: 1   MKVVALVSGGKDSCFAMMKSIQYGHEIVALANLMPADDSVDELDSYMYQTVGHQIIVSYA 60

Query: 61  ECMGIPLFRRRIQGSTRHQKLNYRITPGDEVEDMYILLKEVKRKLPSVTAVCSGAIASDY 120
           ECMGIPLFRRRIQGSTRHQKLNYRITPGDEVEDMYILLKEVKR+LPSV+AVCSGAIASDY
Sbjct: 61  ECMGIPLFRRRIQGSTRHQKLNYRITPGDEVEDMYILLKEVKRQLPSVSAVCSGAIASDY 120

Query: 121 QRLRVESVCSRLGLVSLAYLWKQDQSLLLHEMINNGILAITVKVAAMGLDPVKHLGKELT 180
           QRLRVESVCSRLGLVSLAYLWKQDQSLLLHEMINNGILAITVKVAAMGLDP+KHLGKEL+
Sbjct: 121 QRLRVESVCSRLGLVSLAYLWKQDQSLLLHEMINNGILAITVKVAAMGLDPMKHLGKELS 180

Query: 181 FLDSDLHKLNKLYGINVCGEGGEYETLTLDCPLFKSARIVLDEFKTVMHSSDSIAPVGIL 240
            LDS LHKLNKLYGINVCGEGGEYETLTLDCPLFK+ARIVLDE K VMHSSDSIAPVG L
Sbjct: 181 SLDSVLHKLNKLYGINVCGEGGEYETLTLDCPLFKNARIVLDESKVVMHSSDSIAPVGFL 240

Query: 241 HPVSFHLEYKTSSLGICDNNNSVDHEKVGLLFEIQGDCFHSSDTLQSIADASSASHILDD 300
           HP+SFHLEYK  +  ICD NNSVDHEKVGLLFEI+GDCFHSSDTLQS+ADASSA+H+LDD
Sbjct: 241 HPISFHLEYKAKNSSICD-NNSVDHEKVGLLFEIEGDCFHSSDTLQSVADASSANHLLDD 300

Query: 301 APVDRIHISCSRMRNTFAICGWLQDSCYTSPGLQDDLKTILRKIESELLGHGCGWKNVLY 360
            P DR+ ISCSRM+ TFAI  WLQDS  TS GLQDDLKT+LRKIESELLG G GWKNVLY
Sbjct: 301 VPDDRLQISCSRMQYTFAISCWLQDSRDTSSGLQDDLKTVLRKIESELLGRGYGWKNVLY 360

Query: 361 IHLYLADMDEFALANETYVSFITQEKCPFGVPSRSTIELPLLHVRLGNAYIEVLVANDQT 420
           IHLYLADMD+FALANETYVSFITQEKCPFGVPSRSTIELPLL VR GNAYIEVLVANDQ+
Sbjct: 361 IHLYLADMDDFALANETYVSFITQEKCPFGVPSRSTIELPLLQVRSGNAYIEVLVANDQS 420

Query: 421 KRVLHVQSISSWAPSCIGPYSQATLHNEILYMAGQLGLNPPTMTLCSGDATDELEQALKN 480
           KRVLHVQSISSWAPSCIGPYSQATLH EILYMAGQLGLNPPTMTLCSG AT ELEQALKN
Sbjct: 421 KRVLHVQSISSWAPSCIGPYSQATLHKEILYMAGQLGLNPPTMTLCSGGATHELEQALKN 480

Query: 481 CEAVSECFNSSISTTSVIFVTYCSTRIQPEERRRIEDRLHGVLEEMRHSDKDSLSKVLDT 540
           CEAVSE FNSSIST+SV+F+TYCSTRIQ EER++IED+LHGVLEEMRHS KDS SKVLDT
Sbjct: 481 CEAVSESFNSSISTSSVLFITYCSTRIQLEERKKIEDKLHGVLEEMRHSKKDSSSKVLDT 540

Query: 541 IFLYIHVPNLPKGALVEVKPILYVQENLDTVVESVQDSPKLHTPTYWGFHHEHWHKSCIQ 600
           I+LYIHVPNLPKGALVEVKP+LYVQEN DT  E + DS KLHTPTYWGF HE+WHKSCIQ
Sbjct: 541 IYLYIHVPNLPKGALVEVKPVLYVQENFDTEAEHLHDSSKLHTPTYWGFQHENWHKSCIQ 600

Query: 601 KCVVNGKICAVVLSVTNELARNICSCLLGNQITEEHFGLVSKFCIYVLNEVLLDSAFCWE 660
           KCVVNG ICAVVLSVTNE ARNICS L GNQI+EEH  LVSKFCI +LNE LLDSA CWE
Sbjct: 601 KCVVNGNICAVVLSVTNEPARNICSSLRGNQISEEHLELVSKFCICLLNEALLDSACCWE 660

Query: 661 DIKNLRFYFPTSLNITLEAASIVFSRAFDELAESNPTAHVDRFFNLIPVLGAGRTPTSMD 720
           DIK+LRFY PTSLNITLEAASI+FSRAF+ELAESNPT HVDRFFNLIPVLGAGR PTSMD
Sbjct: 661 DIKSLRFYLPTSLNITLEAASIIFSRAFNELAESNPTVHVDRFFNLIPVLGAGRAPTSMD 720

Query: 721 DILTCELFAQKS 733
           DILTCELFAQKS
Sbjct: 721 DILTCELFAQKS 731

BLAST of Tan0006277 vs. NCBI nr
Match: XP_023538215.1 (diphthine--ammonia ligase isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1317.4 bits (3408), Expect = 0.0e+00
Identity = 650/732 (88.80%), Postives = 683/732 (93.31%), Query Frame = 0

Query: 1   MKVVALVSGGKDSCFAMMKSIQYGHEIVALANLLPADDSVDELDSYMYQTVGHQIIVSYA 60
           MKVVALVSGGKDSCFAMMKSIQ+GHEIVALANL+PADDSVDELDSYMYQTVGHQIIVSYA
Sbjct: 1   MKVVALVSGGKDSCFAMMKSIQFGHEIVALANLMPADDSVDELDSYMYQTVGHQIIVSYA 60

Query: 61  ECMGIPLFRRRIQGSTRHQKLNYRITPGDEVEDMYILLKEVKRKLPSVTAVCSGAIASDY 120
           ECMGIPLFRRRIQGSTRHQKLNYRITPGDEVEDMYILLKEVKR+LPSV+AVCSGAIASDY
Sbjct: 61  ECMGIPLFRRRIQGSTRHQKLNYRITPGDEVEDMYILLKEVKRQLPSVSAVCSGAIASDY 120

Query: 121 QRLRVESVCSRLGLVSLAYLWKQDQSLLLHEMINNGILAITVKVAAMGLDPVKHLGKELT 180
           QRLRVESVCSRLGLVSLAYLWKQDQSLLLHEMINNGILAITVKVAAMGLDP+ HLGKEL+
Sbjct: 121 QRLRVESVCSRLGLVSLAYLWKQDQSLLLHEMINNGILAITVKVAAMGLDPMTHLGKELS 180

Query: 181 FLDSDLHKLNKLYGINVCGEGGEYETLTLDCPLFKSARIVLDEFKTVMHSSDSIAPVGIL 240
            LDS LHKLNKLYGINVCGEGGEYETLTLDCPLFK+ARIVLDE K VMHSSDSIAPVG L
Sbjct: 181 SLDSVLHKLNKLYGINVCGEGGEYETLTLDCPLFKNARIVLDESKVVMHSSDSIAPVGFL 240

Query: 241 HPVSFHLEYKTSSLGICDNNNSVDHEKVGLLFEIQGDCFHSSDTLQSIADASSASHILDD 300
           HP+SFHLEYK  +  ICD NNSVDHE+VGLLFEI+GDCFHSSDTLQS+ADASSASH+LDD
Sbjct: 241 HPISFHLEYKAKTSSICD-NNSVDHERVGLLFEIEGDCFHSSDTLQSVADASSASHLLDD 300

Query: 301 APVDRIHISCSRMRNTFAICGWLQDSCYTSPGLQDDLKTILRKIESELLGHGCGWKNVLY 360
            P DR+ ISCSRM+NTFAI  WLQDS  TS GLQDDLKT+LRKIESELLG G GWKNVLY
Sbjct: 301 VPDDRLQISCSRMQNTFAISCWLQDSRDTSSGLQDDLKTVLRKIESELLGRGYGWKNVLY 360

Query: 361 IHLYLADMDEFALANETYVSFITQEKCPFGVPSRSTIELPLLHVRLGNAYIEVLVANDQT 420
           IHLYLADMD+FALANETYVSFITQEKCPFGVPSRSTIELPLL VR GNAYIEVLVANDQ+
Sbjct: 361 IHLYLADMDDFALANETYVSFITQEKCPFGVPSRSTIELPLLQVRSGNAYIEVLVANDQS 420

Query: 421 KRVLHVQSISSWAPSCIGPYSQATLHNEILYMAGQLGLNPPTMTLCSGDATDELEQALKN 480
           KRVLHVQSISSWAPSCIGPYSQATLH EILYMAGQLGLNPPTMTLCSG AT ELEQALKN
Sbjct: 421 KRVLHVQSISSWAPSCIGPYSQATLHKEILYMAGQLGLNPPTMTLCSGGATHELEQALKN 480

Query: 481 CEAVSECFNSSISTTSVIFVTYCSTRIQPEERRRIEDRLHGVLEEMRHSDKDSLSKVLDT 540
           CEAVSE FNSSIST+SV+ VTYCSTRIQ EER++IED+LHG+LEEMRHS KDS SKVLDT
Sbjct: 481 CEAVSESFNSSISTSSVLLVTYCSTRIQLEERKKIEDKLHGMLEEMRHSKKDSSSKVLDT 540

Query: 541 IFLYIHVPNLPKGALVEVKPILYVQENLDTVVESVQDSPKLHTPTYWGFHHEHWHKSCIQ 600
           I+LYIHVPNLPKGALVEVKP+LYVQEN DT  E++ DS KLHTPTYWGF HE WHKSCIQ
Sbjct: 541 IYLYIHVPNLPKGALVEVKPVLYVQENFDTEAENLHDSSKLHTPTYWGFQHEDWHKSCIQ 600

Query: 601 KCVVNGKICAVVLSVTNELARNICSCLLGNQITEEHFGLVSKFCIYVLNEVLLDSAFCWE 660
           KCVVNG ICAVVLSVTNELARNICS L GNQI+EEH  LVSKFCI +LNE LLDSA CWE
Sbjct: 601 KCVVNGNICAVVLSVTNELARNICSSLRGNQISEEHLELVSKFCICLLNEALLDSACCWE 660

Query: 661 DIKNLRFYFPTSLNITLEAASIVFSRAFDELAESNPTAHVDRFFNLIPVLGAGRTPTSMD 720
           DIK+LRFY PTSLNITLEAASI+FSRAF+ELAESNPT HVDRFFNLIPVLGAGR PTSMD
Sbjct: 661 DIKSLRFYLPTSLNITLEAASIIFSRAFNELAESNPTVHVDRFFNLIPVLGAGRAPTSMD 720

Query: 721 DILTCELFAQKS 733
           DILTCELFAQKS
Sbjct: 721 DILTCELFAQKS 731

BLAST of Tan0006277 vs. ExPASy TrEMBL
Match: A0A6J1D9W6 (Diphthamide synthase OS=Momordica charantia OX=3673 GN=LOC111018223 PE=4 SV=1)

HSP 1 Score: 1328.9 bits (3438), Expect = 0.0e+00
Identity = 652/734 (88.83%), Postives = 687/734 (93.60%), Query Frame = 0

Query: 1   MKVVALVSGGKDSCFAMMKSIQYGHEIVALANLLPADDSVDELDSYMYQTVGHQIIVSYA 60
           MKVVALVSGGKDSCFAMMKSIQYGHEIVALANL+PADDSVDELDSYMYQTVGHQIIVSYA
Sbjct: 1   MKVVALVSGGKDSCFAMMKSIQYGHEIVALANLMPADDSVDELDSYMYQTVGHQIIVSYA 60

Query: 61  ECMGIPLFRRRIQGSTRHQKLNYRITPGDEVEDMYILLKEVKRKLPSVTAVCSGAIASDY 120
           ECMG+PLFRRRIQGSTRHQKLNYRITPGDEVEDMYILL EVKR+LP VTA+CSGAIASDY
Sbjct: 61  ECMGVPLFRRRIQGSTRHQKLNYRITPGDEVEDMYILLNEVKRQLPCVTAICSGAIASDY 120

Query: 121 QRLRVESVCSRLGLVSLAYLWKQDQSLLLHEMINNGILAITVKVAAMGLDPVKHLGKELT 180
           QRLRVESVC+RLGLVSLAYLWK+DQSLLLHEMINNGILAITVKVAAMGLDPVKHLGKEL 
Sbjct: 121 QRLRVESVCARLGLVSLAYLWKRDQSLLLHEMINNGILAITVKVAAMGLDPVKHLGKELA 180

Query: 181 FLDSDLHKLNKLYGINVCGEGGEYETLTLDCPLFKSARIVLDEFKTVMHSSDSIAPVGIL 240
            LDSDLHKLNKLYGINVCGEGGEYETLTLDCPLFK+ARIVLDE+K +MHSSDSIAPVGIL
Sbjct: 181 SLDSDLHKLNKLYGINVCGEGGEYETLTLDCPLFKNARIVLDEYKVMMHSSDSIAPVGIL 240

Query: 241 HPVSFHLEY--KTSSLGICDNNNSVDHEKVGLLFEIQGDCFHSSDTLQSIADASSASHIL 300
           HPVSFHLEY  KTSS+GICDN+ SVD EK+ LLFEIQGDCF+S DTLQSIADA+ AS+IL
Sbjct: 241 HPVSFHLEYKAKTSSVGICDNSKSVDDEKMDLLFEIQGDCFNSCDTLQSIADATGASNIL 300

Query: 301 DDAPVDRIHISCSRMRNTFAICGWLQDSCYTSPGLQDDLKTILRKIESELLGHGCGWKNV 360
           DD P DR+ ISCSRM NTF+IC WLQDSC TS GLQDDLKT+LRKIESELLG G GWKNV
Sbjct: 301 DDVPDDRLQISCSRMHNTFSICCWLQDSCGTSQGLQDDLKTVLRKIESELLGRGFGWKNV 360

Query: 361 LYIHLYLADMDEFALANETYVSFITQEKCPFGVPSRSTIELPLLHVRLGNAYIEVLVAND 420
           LYIHLYLADMD FALANE YVSFIT EKCPFGVPSRSTIELPLL V+LG+AYIEVLVAND
Sbjct: 361 LYIHLYLADMDAFALANEAYVSFITLEKCPFGVPSRSTIELPLLQVKLGHAYIEVLVAND 420

Query: 421 QTKRVLHVQSISSWAPSCIGPYSQATLHNEILYMAGQLGLNPPTMTLCSGDATDELEQAL 480
           QTKRVLHVQSISSWAPSCIGPYSQATLH EILYMAGQLGLNPPTMTLCSG AT+ELEQAL
Sbjct: 421 QTKRVLHVQSISSWAPSCIGPYSQATLHKEILYMAGQLGLNPPTMTLCSGGATNELEQAL 480

Query: 481 KNCEAVSECFNSSISTTSVIFVTYCSTRIQPEERRRIEDRLHGVLEEMRHSDKDSLSKVL 540
           +NCEAV+ECFNSSI T+SVIFVTYCST IQPEERR+I+D+LHG LEEMRHSDKDSLSK L
Sbjct: 481 ENCEAVAECFNSSICTSSVIFVTYCSTHIQPEERRKIQDKLHGALEEMRHSDKDSLSKAL 540

Query: 541 DTIFLYIHVPNLPKGALVEVKPILYVQENLDTVVESVQDSPKLHTPTYWGFHHEHWHKSC 600
           DTIFLYI+VPNLPKGALVEVKPILYVQEN+DTV E V D PKLHTP YWGF HEHWH SC
Sbjct: 541 DTIFLYINVPNLPKGALVEVKPILYVQENVDTVTEIVHDLPKLHTPRYWGFQHEHWHNSC 600

Query: 601 IQKCVVNGKICAVVLSVTNELARNICSCLLGNQITEEHFGLVSKFCIYVLNEVLLDSAFC 660
           IQKCVVNGKICAVVLSVTNELARNICSC LGN ITEEH  LVSKFCIY+LNEVLLDSAF 
Sbjct: 601 IQKCVVNGKICAVVLSVTNELARNICSCSLGNLITEEHLELVSKFCIYLLNEVLLDSAFF 660

Query: 661 WEDIKNLRFYFPTSLNITLEAASIVFSRAFDELAESNPTAHVDRFFNLIPVLGAGRTPTS 720
           WEDIKNLRFYFPT+LNITLE AS++FSRAF+ELAESNPT  V RFFNLIPVLGAGRTPTS
Sbjct: 661 WEDIKNLRFYFPTNLNITLEVASLIFSRAFNELAESNPTVDVGRFFNLIPVLGAGRTPTS 720

Query: 721 MDDILTCELFAQKS 733
           MD+ILTCELFAQKS
Sbjct: 721 MDNILTCELFAQKS 734

BLAST of Tan0006277 vs. ExPASy TrEMBL
Match: A0A0A0LKJ7 (Diphthamide synthase OS=Cucumis sativus OX=3659 GN=Csa_2G075360 PE=4 SV=1)

HSP 1 Score: 1320.8 bits (3417), Expect = 0.0e+00
Identity = 645/735 (87.76%), Postives = 688/735 (93.61%), Query Frame = 0

Query: 1   MKVVALVSGGKDSCFAMMKSIQYGHEIVALANLLPADDSVDELDSYMYQTVGHQIIVSYA 60
           MKVVALVSGGKDSCFAMMKSIQYGHEIVALANL+PADDSVDELDSYMYQTVGHQIIVSYA
Sbjct: 1   MKVVALVSGGKDSCFAMMKSIQYGHEIVALANLMPADDSVDELDSYMYQTVGHQIIVSYA 60

Query: 61  ECMGIPLFRRRIQGSTRHQKLNYRITPGDEVEDMYILLKEVKRKLPSVTAVCSGAIASDY 120
           ECMGIPLFRRRIQGSTRHQKLNYRITPGDEVEDMYILL EVK++LPSV AV SGAIASDY
Sbjct: 61  ECMGIPLFRRRIQGSTRHQKLNYRITPGDEVEDMYILLNEVKKQLPSVMAVSSGAIASDY 120

Query: 121 QRLRVESVCSRLGLVSLAYLWKQDQSLLLHEMINNGILAITVKVAAMGLDPVKHLGKELT 180
           QRLRVESVCSRLGLVSLAYLWKQDQSLLLHEMINNGILAITVKVAAMGLDPVKHLGKELT
Sbjct: 121 QRLRVESVCSRLGLVSLAYLWKQDQSLLLHEMINNGILAITVKVAAMGLDPVKHLGKELT 180

Query: 181 FLDSDLHKLNKLYGINVCGEGGEYETLTLDCPLFKSARIVLDEFKTVMHSSDSIAPVGIL 240
            LDSDLHKLN+LYGINVCGEGGEYETLTLDCPLFK+ARIVLD+F+ VMHSSDSIAPVGIL
Sbjct: 181 SLDSDLHKLNRLYGINVCGEGGEYETLTLDCPLFKNARIVLDKFEVVMHSSDSIAPVGIL 240

Query: 241 HPVSFHLEY--KTSSLG-ICDNNNSVDHEKVGLLFEIQGDCFHSSDTLQSIADASSASHI 300
           HPVSFHL+Y  KTSSLG ICDN N VDHEK GLLFEIQGDCF + D LQS+AD SS +HI
Sbjct: 241 HPVSFHLKYKAKTSSLGSICDNTNLVDHEKGGLLFEIQGDCFQNCDILQSVADVSSDNHI 300

Query: 301 LDDAPVDRIHISCSRMRNTFAICGWLQDSCYTSPGLQDDLKTILRKIESELLGHGCGWKN 360
           LD+ P DR+ ISCSRM+NTF IC WLQ+SC TSPGLQDDLKT+LRKIESELL  GCGWKN
Sbjct: 301 LDEVPDDRLQISCSRMQNTFLICCWLQNSCGTSPGLQDDLKTVLRKIESELLARGCGWKN 360

Query: 361 VLYIHLYLADMDEFALANETYVSFITQEKCPFGVPSRSTIELPLLHVRLGNAYIEVLVAN 420
           VLYIHLYLADM+ F LANETYVSFITQEKCPFGVPSRST+ELPL  V+LGNAYIEVLVAN
Sbjct: 361 VLYIHLYLADMNGFGLANETYVSFITQEKCPFGVPSRSTVELPLQQVQLGNAYIEVLVAN 420

Query: 421 DQTKRVLHVQSISSWAPSCIGPYSQATLHNEILYMAGQLGLNPPTMTLCSGDATDELEQA 480
           DQTKRVLHVQSISSWAPSCIGPYSQATLH EILYMAGQLGL+PPTMTLCSG AT ELEQA
Sbjct: 421 DQTKRVLHVQSISSWAPSCIGPYSQATLHKEILYMAGQLGLDPPTMTLCSGGATQELEQA 480

Query: 481 LKNCEAVSECFNSSISTTSVIFVTYCSTRIQPEERRRIEDRLHGVLEEMRHSDKDSLSKV 540
           LKNCEAV+ECF SS+ST+SVIFVTYCSTRIQPEERRRIE++ HGVLEEMRHSDK SLSK+
Sbjct: 481 LKNCEAVAECFRSSVSTSSVIFVTYCSTRIQPEERRRIEEKFHGVLEEMRHSDKASLSKL 540

Query: 541 LDTIFLYIHVPNLPKGALVEVKPILYVQENLDTVVESVQDSPKLHTPTYWGFHHEHWHKS 600
           LD+IFLY++VPNLPKGALVEVKPILYVQE LDTV ++  DSP+L+ PT WGF HEHWHKS
Sbjct: 541 LDSIFLYVNVPNLPKGALVEVKPILYVQETLDTVEQTPHDSPRLYIPTDWGFQHEHWHKS 600

Query: 601 CIQKCVVNGKICAVVLSVTNELARNICSCLLGNQITEEHFGLVSKFCIYVLNEVLLDSAF 660
           CIQKC+VNGK+C  VLS+TNELARNI SCLLGNQITEE+  LVSKFCIY+LNE+LLDSAF
Sbjct: 601 CIQKCIVNGKVCVTVLSITNELARNISSCLLGNQITEENLELVSKFCIYLLNEILLDSAF 660

Query: 661 CWEDIKNLRFYFPTSLNITLEAASIVFSRAFDELAESNPTAHVDRFFNLIPVLGAGRTPT 720
           CWEDIKNLRFYFPTSLNITLEAASI+FSRAF+ELAESNPT HVDRFFNLIPV+GAGRTPT
Sbjct: 661 CWEDIKNLRFYFPTSLNITLEAASIIFSRAFNELAESNPTVHVDRFFNLIPVIGAGRTPT 720

Query: 721 SMDDILTCELFAQKS 733
           SMDD+LTCELFAQKS
Sbjct: 721 SMDDVLTCELFAQKS 735

BLAST of Tan0006277 vs. ExPASy TrEMBL
Match: A0A6J1HLV6 (Diphthamide synthase OS=Cucurbita maxima OX=3661 GN=LOC111465391 PE=4 SV=1)

HSP 1 Score: 1318.5 bits (3411), Expect = 0.0e+00
Identity = 651/732 (88.93%), Postives = 683/732 (93.31%), Query Frame = 0

Query: 1   MKVVALVSGGKDSCFAMMKSIQYGHEIVALANLLPADDSVDELDSYMYQTVGHQIIVSYA 60
           MKVVALVSGGKDSCFAMMKSIQYGHEIVALANL+PADDSVDELDSYMYQTVGHQIIVSYA
Sbjct: 1   MKVVALVSGGKDSCFAMMKSIQYGHEIVALANLMPADDSVDELDSYMYQTVGHQIIVSYA 60

Query: 61  ECMGIPLFRRRIQGSTRHQKLNYRITPGDEVEDMYILLKEVKRKLPSVTAVCSGAIASDY 120
           ECMGIPLFRRRIQGSTRHQKLNYRITPGDEVEDMYILLKEVKR+LPSV+AVCSGAIASDY
Sbjct: 61  ECMGIPLFRRRIQGSTRHQKLNYRITPGDEVEDMYILLKEVKRQLPSVSAVCSGAIASDY 120

Query: 121 QRLRVESVCSRLGLVSLAYLWKQDQSLLLHEMINNGILAITVKVAAMGLDPVKHLGKELT 180
           QRLRVESVCSRLGLVSLAYLWKQDQSLLLHEMINNGILAITVKVAAMGLDP+KHLGKEL+
Sbjct: 121 QRLRVESVCSRLGLVSLAYLWKQDQSLLLHEMINNGILAITVKVAAMGLDPMKHLGKELS 180

Query: 181 FLDSDLHKLNKLYGINVCGEGGEYETLTLDCPLFKSARIVLDEFKTVMHSSDSIAPVGIL 240
            LDS LHKLNKLYGINVCGEGGEYETLTLDCPLFK+ARIVLDE K VMHSSDSIAPVG L
Sbjct: 181 SLDSVLHKLNKLYGINVCGEGGEYETLTLDCPLFKNARIVLDESKVVMHSSDSIAPVGFL 240

Query: 241 HPVSFHLEYKTSSLGICDNNNSVDHEKVGLLFEIQGDCFHSSDTLQSIADASSASHILDD 300
           HP+SFHLEYK  +  ICD NNSVDHEKVGLLFEI+GDCFHSSDTLQS+ADASSA+H+LDD
Sbjct: 241 HPISFHLEYKAKNSSICD-NNSVDHEKVGLLFEIEGDCFHSSDTLQSVADASSANHLLDD 300

Query: 301 APVDRIHISCSRMRNTFAICGWLQDSCYTSPGLQDDLKTILRKIESELLGHGCGWKNVLY 360
            P DR+ ISCSRM+ TFAI  WLQDS  TS GLQDDLKT+LRKIESELLG G GWKNVLY
Sbjct: 301 VPDDRLQISCSRMQYTFAISCWLQDSRDTSSGLQDDLKTVLRKIESELLGRGYGWKNVLY 360

Query: 361 IHLYLADMDEFALANETYVSFITQEKCPFGVPSRSTIELPLLHVRLGNAYIEVLVANDQT 420
           IHLYLADMD+FALANETYVSFITQEKCPFGVPSRSTIELPLL VR GNAYIEVLVANDQ+
Sbjct: 361 IHLYLADMDDFALANETYVSFITQEKCPFGVPSRSTIELPLLQVRSGNAYIEVLVANDQS 420

Query: 421 KRVLHVQSISSWAPSCIGPYSQATLHNEILYMAGQLGLNPPTMTLCSGDATDELEQALKN 480
           KRVLHVQSISSWAPSCIGPYSQATLH EILYMAGQLGLNPPTMTLCSG AT ELEQALKN
Sbjct: 421 KRVLHVQSISSWAPSCIGPYSQATLHKEILYMAGQLGLNPPTMTLCSGGATHELEQALKN 480

Query: 481 CEAVSECFNSSISTTSVIFVTYCSTRIQPEERRRIEDRLHGVLEEMRHSDKDSLSKVLDT 540
           CEAVSE FNSSIST+SV+F+TYCSTRIQ EER++IED+LHGVLEEMRHS KDS SKVLDT
Sbjct: 481 CEAVSESFNSSISTSSVLFITYCSTRIQLEERKKIEDKLHGVLEEMRHSKKDSSSKVLDT 540

Query: 541 IFLYIHVPNLPKGALVEVKPILYVQENLDTVVESVQDSPKLHTPTYWGFHHEHWHKSCIQ 600
           I+LYIHVPNLPKGALVEVKP+LYVQEN DT  E + DS KLHTPTYWGF HE+WHKSCIQ
Sbjct: 541 IYLYIHVPNLPKGALVEVKPVLYVQENFDTEAEHLHDSSKLHTPTYWGFQHENWHKSCIQ 600

Query: 601 KCVVNGKICAVVLSVTNELARNICSCLLGNQITEEHFGLVSKFCIYVLNEVLLDSAFCWE 660
           KCVVNG ICAVVLSVTNE ARNICS L GNQI+EEH  LVSKFCI +LNE LLDSA CWE
Sbjct: 601 KCVVNGNICAVVLSVTNEPARNICSSLRGNQISEEHLELVSKFCICLLNEALLDSACCWE 660

Query: 661 DIKNLRFYFPTSLNITLEAASIVFSRAFDELAESNPTAHVDRFFNLIPVLGAGRTPTSMD 720
           DIK+LRFY PTSLNITLEAASI+FSRAF+ELAESNPT HVDRFFNLIPVLGAGR PTSMD
Sbjct: 661 DIKSLRFYLPTSLNITLEAASIIFSRAFNELAESNPTVHVDRFFNLIPVLGAGRAPTSMD 720

Query: 721 DILTCELFAQKS 733
           DILTCELFAQKS
Sbjct: 721 DILTCELFAQKS 731

BLAST of Tan0006277 vs. ExPASy TrEMBL
Match: A0A6J1FIH5 (Diphthamide synthase OS=Cucurbita moschata OX=3662 GN=LOC111444237 PE=4 SV=1)

HSP 1 Score: 1317.4 bits (3408), Expect = 0.0e+00
Identity = 649/732 (88.66%), Postives = 681/732 (93.03%), Query Frame = 0

Query: 1   MKVVALVSGGKDSCFAMMKSIQYGHEIVALANLLPADDSVDELDSYMYQTVGHQIIVSYA 60
           MKVVALVSGGKDSCFAMMKSIQ+GHEIVALANL+PADDSVDELDSYMYQTVGHQIIVSYA
Sbjct: 1   MKVVALVSGGKDSCFAMMKSIQFGHEIVALANLMPADDSVDELDSYMYQTVGHQIIVSYA 60

Query: 61  ECMGIPLFRRRIQGSTRHQKLNYRITPGDEVEDMYILLKEVKRKLPSVTAVCSGAIASDY 120
           ECMGIPLFRRRIQGSTRHQKLNYRITPGDEVEDMYILLKEVKR+LPSV+AVCSGAIASDY
Sbjct: 61  ECMGIPLFRRRIQGSTRHQKLNYRITPGDEVEDMYILLKEVKRQLPSVSAVCSGAIASDY 120

Query: 121 QRLRVESVCSRLGLVSLAYLWKQDQSLLLHEMINNGILAITVKVAAMGLDPVKHLGKELT 180
           QRLRVESVCSRLGLVSLAYLWKQDQSLLLHEMINNGILAITVKVAAMGLDP+KHLGKEL+
Sbjct: 121 QRLRVESVCSRLGLVSLAYLWKQDQSLLLHEMINNGILAITVKVAAMGLDPMKHLGKELS 180

Query: 181 FLDSDLHKLNKLYGINVCGEGGEYETLTLDCPLFKSARIVLDEFKTVMHSSDSIAPVGIL 240
            LDS LHKLNKLYGINVCGEGGEYETLTLDCPLFK+ARIVLDE K VMHSSDSIAPVG L
Sbjct: 181 SLDSVLHKLNKLYGINVCGEGGEYETLTLDCPLFKNARIVLDESKVVMHSSDSIAPVGFL 240

Query: 241 HPVSFHLEYKTSSLGICDNNNSVDHEKVGLLFEIQGDCFHSSDTLQSIADASSASHILDD 300
           HP+SFHLEYK  +  ICD NNSVDHEKVGLLFEI+GDCFH SDTLQS+ADASS SH+LDD
Sbjct: 241 HPISFHLEYKAKNSSICD-NNSVDHEKVGLLFEIEGDCFHGSDTLQSVADASSVSHLLDD 300

Query: 301 APVDRIHISCSRMRNTFAICGWLQDSCYTSPGLQDDLKTILRKIESELLGHGCGWKNVLY 360
            P DR+ ISCSRM+NTFAI  WLQDS  TS GLQDDLKT+LRKIESELLG G GWKNVLY
Sbjct: 301 VPDDRLQISCSRMQNTFAISCWLQDSRDTSSGLQDDLKTVLRKIESELLGRGYGWKNVLY 360

Query: 361 IHLYLADMDEFALANETYVSFITQEKCPFGVPSRSTIELPLLHVRLGNAYIEVLVANDQT 420
           IHLYLADMD+FALANETYVSFITQEKCPFGVPSRSTIELPLL VR GN YIEVLVANDQ+
Sbjct: 361 IHLYLADMDDFALANETYVSFITQEKCPFGVPSRSTIELPLLQVRSGNLYIEVLVANDQS 420

Query: 421 KRVLHVQSISSWAPSCIGPYSQATLHNEILYMAGQLGLNPPTMTLCSGDATDELEQALKN 480
           KRVLHVQSISSWAPSCIGPYSQATLH EILYMAGQLGLNPPTMTLCSG AT ELEQALKN
Sbjct: 421 KRVLHVQSISSWAPSCIGPYSQATLHKEILYMAGQLGLNPPTMTLCSGGATHELEQALKN 480

Query: 481 CEAVSECFNSSISTTSVIFVTYCSTRIQPEERRRIEDRLHGVLEEMRHSDKDSLSKVLDT 540
           CEAVSE FNSSIST+SV+F+TYCSTRIQ EER++IED+LHGVLEEMRHS KDS SKVLDT
Sbjct: 481 CEAVSESFNSSISTSSVLFITYCSTRIQLEERKKIEDKLHGVLEEMRHSKKDSSSKVLDT 540

Query: 541 IFLYIHVPNLPKGALVEVKPILYVQENLDTVVESVQDSPKLHTPTYWGFHHEHWHKSCIQ 600
           I+LYIHVPNLPKGALVEVKP+LYVQEN DT  E++ DS K HTPTYWGF HE WHKSCIQ
Sbjct: 541 IYLYIHVPNLPKGALVEVKPVLYVQENFDTEAENLHDSSKFHTPTYWGFQHEDWHKSCIQ 600

Query: 601 KCVVNGKICAVVLSVTNELARNICSCLLGNQITEEHFGLVSKFCIYVLNEVLLDSAFCWE 660
           KCVVNG ICAVVLSVTNELARNICS L GNQI+EEH  LVSKFCI +LNE LLDSA CWE
Sbjct: 601 KCVVNGNICAVVLSVTNELARNICSSLRGNQISEEHLELVSKFCICLLNEALLDSACCWE 660

Query: 661 DIKNLRFYFPTSLNITLEAASIVFSRAFDELAESNPTAHVDRFFNLIPVLGAGRTPTSMD 720
           DIK+LRFY PTSLNITLEAASI+FSRAF+ELAESNPT HVDRFFNLIPVLGAGR PTSMD
Sbjct: 661 DIKSLRFYLPTSLNITLEAASIIFSRAFNELAESNPTVHVDRFFNLIPVLGAGRAPTSMD 720

Query: 721 DILTCELFAQKS 733
           DILTCELFAQKS
Sbjct: 721 DILTCELFAQKS 731

BLAST of Tan0006277 vs. ExPASy TrEMBL
Match: A0A1S3B4Z6 (Diphthamide synthase OS=Cucumis melo OX=3656 GN=LOC103485878 PE=4 SV=1)

HSP 1 Score: 1316.2 bits (3405), Expect = 0.0e+00
Identity = 645/734 (87.87%), Postives = 683/734 (93.05%), Query Frame = 0

Query: 1   MKVVALVSGGKDSCFAMMKSIQYGHEIVALANLLPADDSVDELDSYMYQTVGHQIIVSYA 60
           MKVVALVSGGKDSCFAMMKSIQYGHEIVALANL+PADDSVDELDSYMYQTVGHQIIVSYA
Sbjct: 1   MKVVALVSGGKDSCFAMMKSIQYGHEIVALANLMPADDSVDELDSYMYQTVGHQIIVSYA 60

Query: 61  ECMGIPLFRRRIQGSTRHQKLNYRITPGDEVEDMYILLKEVKRKLPSVTAVCSGAIASDY 120
           ECMGIPLFRRRIQGSTRHQKLNYRITPGDEVEDMYILL EVKR+LPSV+AV SGAIASDY
Sbjct: 61  ECMGIPLFRRRIQGSTRHQKLNYRITPGDEVEDMYILLNEVKRQLPSVSAVSSGAIASDY 120

Query: 121 QRLRVESVCSRLGLVSLAYLWKQDQSLLLHEMINNGILAITVKVAAMGLDPVKHLGKELT 180
           QRLRVESVCSRLGLVSLAYLWKQDQSLLLHEMINNGILAITVKVAAMGLDPVKHLGKELT
Sbjct: 121 QRLRVESVCSRLGLVSLAYLWKQDQSLLLHEMINNGILAITVKVAAMGLDPVKHLGKELT 180

Query: 181 FLDSDLHKLNKLYGINVCGEGGEYETLTLDCPLFKSARIVLDEFKTVMHSSDSIAPVGIL 240
            LDSDLHKLN+LYGINVCGEGGEYETLTLDCPLFK+ARIVLDEFK VMHSSDSIAPVGIL
Sbjct: 181 SLDSDLHKLNRLYGINVCGEGGEYETLTLDCPLFKNARIVLDEFKVVMHSSDSIAPVGIL 240

Query: 241 HPVSFHLEY--KTSSLG-ICDNNNSVDHEKVGLLFEIQGDCFHSSDTLQSIADASSASHI 300
           HPVSFHL+Y  KTSSLG ICDN N VD EK GLLFEIQGDCF + D LQS+AD SS  HI
Sbjct: 241 HPVSFHLQYKAKTSSLGSICDNKNLVDQEKGGLLFEIQGDCFQNCDILQSVADVSSDDHI 300

Query: 301 LDDAPVDRIHISCSRMRNTFAICGWLQDSCYTSPGLQDDLKTILRKIESELLGHGCGWKN 360
           LDD P DR+ ISCSRM+ TF IC WLQ+SC TSPGLQDDLK++LRKIESELL  GCGWKN
Sbjct: 301 LDDVPDDRLQISCSRMQTTFLICCWLQNSCGTSPGLQDDLKSVLRKIESELLARGCGWKN 360

Query: 361 VLYIHLYLADMDEFALANETYVSFITQEKCPFGVPSRSTIELPLLHVRLGNAYIEVLVAN 420
           VLYIHLYLADM  F LANETYVSFITQEKCPFGVPSRST+ELPL  V+LGNAYIEVLVAN
Sbjct: 361 VLYIHLYLADMKGFGLANETYVSFITQEKCPFGVPSRSTVELPLQQVQLGNAYIEVLVAN 420

Query: 421 DQTKRVLHVQSISSWAPSCIGPYSQATLHNEILYMAGQLGLNPPTMTLCSGDATDELEQA 480
           DQTKRVLHVQSISSWAPSCIGPYSQATLH EILYMAGQLGL+PPTMTLCSG A  ELEQA
Sbjct: 421 DQTKRVLHVQSISSWAPSCIGPYSQATLHKEILYMAGQLGLDPPTMTLCSGGAAHELEQA 480

Query: 481 LKNCEAVSECFNSSISTTSVIFVTYCSTRIQPEERRRIEDRLHGVLEEMRHSDKDSLSKV 540
           LKNCEAV+ECF SS+ST+SVIFVTYCSTR+QPEERRRIED+ HGVLEEMRHSDK SLSK+
Sbjct: 481 LKNCEAVAECFRSSVSTSSVIFVTYCSTRMQPEERRRIEDKFHGVLEEMRHSDKASLSKL 540

Query: 541 LDTIFLYIHVPNLPKGALVEVKPILYVQENLDTVVESVQDSPKLHTPTYWGFHHEHWHKS 600
           LD+IFLY++VPNLPKGALVEVKPILYVQENLDTV ++  DSP+L+ PT WGF HEHWH S
Sbjct: 541 LDSIFLYVNVPNLPKGALVEVKPILYVQENLDTVEQTPHDSPRLYIPTDWGFQHEHWHNS 600

Query: 601 CIQKCVVNGKICAVVLSVTNELARNICSCLLGNQITEEHFGLVSKFCIYVLNEVLLDSAF 660
           CIQKC+VNGK+C  VLS+TNELARNI SCLLGN+ITEEH  LVSKFCIY+LNEVLLDSAF
Sbjct: 601 CIQKCIVNGKVCVTVLSMTNELARNISSCLLGNEITEEHLELVSKFCIYLLNEVLLDSAF 660

Query: 661 CWEDIKNLRFYFPTSLNITLEAASIVFSRAFDELAESNPTAHVDRFFNLIPVLGAGRTPT 720
           CWEDIKNLRFYFPTSLNITLEAASI+FSRAF+ELAESNPT HVDRFFNLIPV+GAGRTPT
Sbjct: 661 CWEDIKNLRFYFPTSLNITLEAASIIFSRAFNELAESNPTIHVDRFFNLIPVIGAGRTPT 720

Query: 721 SMDDILTCELFAQK 732
           SMDDILTCELFA+K
Sbjct: 721 SMDDILTCELFAKK 734

BLAST of Tan0006277 vs. TAIR 10
Match: AT3G04480.1 (endoribonucleases )

HSP 1 Score: 875.2 bits (2260), Expect = 3.8e-254
Identity = 442/736 (60.05%), Postives = 560/736 (76.09%), Query Frame = 0

Query: 1   MKVVALVSGGKDSCFAMMKSIQYGHEIVALANLLPADDSVDELDSYMYQTVGHQIIVSYA 60
           MKVVALVSGGKDSC+AMMK IQYGHEIVALANLLP DDSVDELDSYMYQTVGHQI+V YA
Sbjct: 1   MKVVALVSGGKDSCYAMMKCIQYGHEIVALANLLPVDDSVDELDSYMYQTVGHQILVGYA 60

Query: 61  ECMGIPLFRRRIQGSTRHQKLNYRITPGDEVEDMYILLKEVKRKLPSVTAVCSGAIASDY 120
           ECM +PLFRRRI+GS+RHQKL+Y++TP DEVEDM++LL EVKR++PS+TAV SGAIASDY
Sbjct: 61  ECMNVPLFRRRIRGSSRHQKLSYQMTPDDEVEDMFVLLSEVKRQIPSITAVSSGAIASDY 120

Query: 121 QRLRVESVCSRLGLVSLAYLWKQDQSLLLHEMINNGILAITVKVAAMGLDPVKHLGKELT 180
           QRLRVES+CSRLGLVSLA+LWKQDQ+LLL +MI NGI AI VKVAA+GLDP KHLGK+L 
Sbjct: 121 QRLRVESICSRLGLVSLAFLWKQDQTLLLQDMIANGIKAILVKVAAIGLDPSKHLGKDLA 180

Query: 181 FLDSDLHKLNKLYGINVCGEGGEYETLTLDCPLFKSARIVLDEFKTVMHSSDSIAPVGIL 240
           F++  L KL + YG NVCGEGGEYETLTLDCPLF +A IVLDE++ V+HS DSIAPVG+L
Sbjct: 181 FMEPYLLKLKEKYGSNVCGEGGEYETLTLDCPLFTNASIVLDEYQVVLHSPDSIAPVGVL 240

Query: 241 HPVSFHLEYKTSSLGICDNNNSVDHEKVGLLFEIQGDCFHSSDTLQSIADASSASHILDD 300
           HP +FHLE K    G  D+++    E+  L+ E+ GD  ++SD   S     +    L +
Sbjct: 241 HPSTFHLEKK----GNPDSHS--PEEESSLVSEVLGDGPNTSD---STRQRDNGIVDLVE 300

Query: 301 APVDRIHISCSRMRNTFAICGWLQDSCYTSPGLQDDLKTILRKIESELLGHGCGWKNVLY 360
              +R+HIS +   NTF+IC WL+DS  +S GL++DL+T+L ++ES+LL HG  W++VLY
Sbjct: 301 HTSNRLHISRAEKHNTFSICCWLEDSSESSKGLKEDLETVLTELESQLLKHGYNWQHVLY 360

Query: 361 IHLYLADMDEFALANETYVSFITQEKCPFGVPSRSTIELPLLHVRLGNAYIEVLVANDQT 420
           IHLY++DM EFA+ANETYV FITQEKCPFGVPSRSTIELPL+   LG AYIEVLVAND++
Sbjct: 361 IHLYISDMSEFAVANETYVKFITQEKCPFGVPSRSTIELPLVQAGLGKAYIEVLVANDES 420

Query: 421 KRVLHVQSISSWAPSCIGPYSQATLHNEILYMAGQLGLNPPTMTLCSGDATDELEQALKN 480
           KRVLHVQSIS WAPSCIGPYSQATLH  +L+MAGQLGL+PPTM L +  A  EL QAL N
Sbjct: 421 KRVLHVQSISCWAPSCIGPYSQATLHQSVLHMAGQLGLDPPTMNLQTEGAIAELNQALTN 480

Query: 481 CEAVSECFNSSISTTSVIFVTYCSTRIQPEERRRIEDRLHGVLEEMRHSDKDSLSKVLDT 540
            EA++E FN SIS+++++FV +CS R +  ER ++ ++    L   + S +  +  VLD 
Sbjct: 481 SEAIAESFNCSISSSAILFVVFCSARTKQSERNQLHEKFVTFLGLAKSSRR--VQNVLDP 540

Query: 541 IFLYIHVPNLPKGALVEVKPILYVQENLDTVVESVQDSPKLHTPTYWGFHHEHWHKSCIQ 600
           +FLYI VP+LPK ALVEVKPILYV+E+ DT  E+ +D       + WG+  E WH+ C+Q
Sbjct: 541 MFLYILVPDLPKRALVEVKPILYVEEDTDTEDETSRDQSGEGHYSIWGYKPEKWHQDCVQ 600

Query: 601 KCVVNGKICAVVLSVTNELARNICSCLLGNQITEEHFGLVSKFCIYVLNEVLLDSAFCWE 660
           K VV+GK+C  VLS++ EL R +       Q  EE   +VS+FC+Y+LN+ L +++F W+
Sbjct: 601 KRVVDGKVCVAVLSISAELMRKL-------QGEEEELEIVSRFCVYLLNKTLSENSFSWQ 660

Query: 661 DIKNLRFYFPTSLNITLEAASIVFSRAFDELAESNPTAHVDR----FFNLIPVLGAGRTP 720
           D  +LR +F TS+ +++E  S +F  AF EL E +    +D      FNL+PVLGAG + 
Sbjct: 661 DTTSLRIHFSTSIGVSVERLSAIFVSAFRELNEMSDGVKMDSLKEPIFNLVPVLGAGNSS 718

Query: 721 TSMDDILTCELFAQKS 733
            S+D+I+TCELFA +S
Sbjct: 721 ASLDNIITCELFALRS 718

BLAST of Tan0006277 vs. TAIR 10
Match: AT3G20390.1 (endoribonuclease L-PSP family protein )

HSP 1 Score: 48.5 bits (114), Expect = 2.6e-05
Identity = 26/70 (37.14%), Postives = 40/70 (57.14%), Query Frame = 0

Query: 433 APSCIGPYSQATLHNEILYMAGQLGLNPPTMTLCSGDATDELEQALKN----CEAVSECF 492
           AP+ +GPYSQA   N +++++G LGL P T    S    D+ EQ LKN     +A    +
Sbjct: 72  APAALGPYSQAIKANNLVFLSGVLGLIPETGKFVSESVEDQTEQVLKNMGEILKASGADY 131

Query: 493 NSSISTTSVI 499
           +S + TT ++
Sbjct: 132 SSVVKTTIML 141

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9USQ78.6e-8638.98Diphthine--ammonia ligase OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843)... [more]
A2RV011.3e-7859.13Diphthine--ammonia ligase OS=Danio rerio OX=7955 GN=dph6 PE=2 SV=1[more]
Q7L8W66.8e-7557.42Diphthine--ammonia ligase OS=Homo sapiens OX=9606 GN=DPH6 PE=1 SV=3[more]
Q5M9F51.5e-7456.93Diphthine--ammonia ligase OS=Rattus norvegicus OX=10116 GN=Dph6 PE=2 SV=1[more]
Q9CQ287.5e-7456.55Diphthine--ammonia ligase OS=Mus musculus OX=10090 GN=Dph6 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
XP_038889528.10.0e+0089.51diphthine--ammonia ligase isoform X1 [Benincasa hispida][more]
XP_022149926.10.0e+0088.83diphthine--ammonia ligase isoform X1 [Momordica charantia][more]
XP_004152819.20.0e+0087.76diphthine--ammonia ligase isoform X1 [Cucumis sativus][more]
XP_022965506.10.0e+0088.93diphthine--ammonia ligase isoform X1 [Cucurbita maxima][more]
XP_023538215.10.0e+0088.80diphthine--ammonia ligase isoform X1 [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
A0A6J1D9W60.0e+0088.83Diphthamide synthase OS=Momordica charantia OX=3673 GN=LOC111018223 PE=4 SV=1[more]
A0A0A0LKJ70.0e+0087.76Diphthamide synthase OS=Cucumis sativus OX=3659 GN=Csa_2G075360 PE=4 SV=1[more]
A0A6J1HLV60.0e+0088.93Diphthamide synthase OS=Cucurbita maxima OX=3661 GN=LOC111465391 PE=4 SV=1[more]
A0A6J1FIH50.0e+0088.66Diphthamide synthase OS=Cucurbita moschata OX=3662 GN=LOC111444237 PE=4 SV=1[more]
A0A1S3B4Z60.0e+0087.87Diphthamide synthase OS=Cucumis melo OX=3656 GN=LOC103485878 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G04480.13.8e-25460.05endoribonucleases [more]
AT3G20390.12.6e-0537.14endoribonuclease L-PSP family protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR035959RutC-like superfamilyGENE3D3.30.1330.40coord: 370..516
e-value: 7.4E-23
score: 82.8
IPR035959RutC-like superfamilyGENE3D3.30.1330.40coord: 262..369
e-value: 5.2E-8
score: 34.8
IPR035959RutC-like superfamilySUPERFAMILY55298YjgF-likecoord: 375..519
IPR035959RutC-like superfamilySUPERFAMILY55298YjgF-likecoord: 272..366
IPR002761Diphthamide synthase domainTIGRFAMTIGR00290TIGR00290coord: 17..185
e-value: 1.3E-40
score: 137.4
IPR002761Diphthamide synthase domainPFAMPF01902Diphthami_syn_2coord: 16..183
e-value: 9.5E-29
score: 100.5
IPR002761Diphthamide synthase domainCDDcd01994Alpha_ANH_like_IVcoord: 28..167
e-value: 1.41531E-58
score: 193.977
NoneNo IPR availableGENE3D3.90.1490.10coord: 113..207
e-value: 1.1E-28
score: 101.0
NoneNo IPR availableCDDcd06156eu_AANH_C_2coord: 393..515
e-value: 1.37408E-27
score: 105.874
NoneNo IPR availableSUPERFAMILY52402Adenine nucleotide alpha hydrolases-likecoord: 13..196
IPR006175YjgF/YER057c/UK114 familyPFAMPF01042Ribonuc_L-PSPcoord: 386..513
e-value: 9.2E-11
score: 41.8
IPR014729Rossmann-like alpha/beta/alpha sandwich foldGENE3D3.40.50.620HUPscoord: 9..110
e-value: 1.9E-24
score: 88.1
IPR030662Diphthine--ammonia ligase/Uncharacterised protein MJ0570PANTHERPTHR12196DOMAIN OF UNKNOWN FUNCTION 71 DUF71 -CONTAINING PROTEINcoord: 39..684

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0006277.1Tan0006277.1mRNA
Tan0006277.3Tan0006277.3mRNA
Tan0006277.2Tan0006277.2mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0017183 peptidyl-diphthamide biosynthetic process from peptidyl-histidine
molecular_function GO:0017178 diphthine-ammonia ligase activity