Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GGTTCACTGTTATATTTGATAATTTAGGCTTAAAAAAAAGTATTGAGTACAGTGTATGATATCGCAGCCCGGAATCGGAATTACAATCGGCGGCGATAGAGGGGATTGGTTCCGGTTAGGTTTCCGGCCAAAAAGAAACAGTAGCGCTATTCCATTTTGCATATATCCAATTTTCCTGCTGGTAAATTTGCAATCTCTCTTTTACGCTGTTTCTTCAATTGGGTTTCCTGTTTTTTTTTTTTTTTGGATCAATTGTGCTTGATTGTTTTGATGAGATTAATCGCAGTTCTCAGGATCATGCGTCGGTGGTGTGTGTAAGCTTAATTGTTTTGGCACACCATCGTTACGGATTTCGTTGCCTGTTTAAATCCCCTTGAATTCAATCAAGTGGCCAGCCGTTTGAGTTATTGTGTTTCAAGCTTCCATCTTGACTAGTTTTTTACAATTAAATGTTTAGGTATTTGAAACAAAACTTTAACGATTCAGGGTGTTTTTGAAGTTTTTCAAAATTTAGGGATATGTTTGATGGAAGAGTATTTTTTATAGACCAAATCAACCGATCGAGAGGAGAACCTTTGTCATGTTTTCCTTGCAAAGAACTCCTGCAGAAGTTATTTGGTTGAAGTTTTTTGCTCGGATATTAACCTAAGTTGTATGGTACAATGGGAGAATACAGATTTGACTTAGACTGTTCAATTCCTTTGCAGCGTAGCTCGAGTTGCTCTCCATTTTGTGTGAATGTCGGTATCGCACAAGAGGGTTCTCTGCGCTATTACTCTTCCTTTGTCTTCCTCTTTGACTTTCAGTTCCAGGACCCGTTTCTACATTCCCCACTCCTTATTACCTTTCAAAGCGTCTTCCCCATTTCCCCTTTCCTGCCATTCTCCATTCATCATGCCTTACAATCAGGTACCCATGTGTTGGTATTTTCAAATTGTCGTTTGATGATGATTTACTTTGACAATTCATGTTATTAAGATGCTGCCATCCAAAAGGATTCAGCTTTTTTTTTTTTATATACGCTTCACGTATGTGAATTACTTCAATTATATTACTTTTGTCTCAACACATTCACAACCCATGATAATAATAATAATTTCTTTAATTGTAAGATGGTTTGAATGCTCAGCTTGTTTAGTTTGATCTTCATTTGTAGCCAAGGGGTGGTTGTAAAGACCAGAAGTGGAAAGCGAAGACGAAGGCCGACAATACTTCGATGGATGCTGCTGAAGTTGTTACACATGCACTCAATAAATTGAGTGTCACTGAAAGTGGTCAACCTCATGTTCCTATTTCAAGTACGCAGTTTGGAAATGTGCAACTCACAAACCAGGTCCCTCCTGGGGATGGTCATAGAACAATTTGGAAACCAAAAGCATACGGAACAACCAGTGGGGCTGCAGTTGTCGAAGCTGAAAATGCATCCGCCGGTCGAACATCTATTCAAAACAAGGAGAACTCTGCTGGACTGGCTGCACAGAATGACATATTTAAGGGAAATCAAATAGAGAAGTTTACTGTGGATAACTACACTTACACACATGCCCAAATTAGAGCTACCTTCTACCCAAAATTTGAGAATGAGAAGTCGGATCAGGAGGTATTTCTATTTTCAATGTGTTTTCTTAAAGTTAAACTTACACCCTTTTTGAATAGTTAGCTGTGTGGCTCTTCTTTGGGACGACTTGCTGATCCTTGCATAGTTCCCTGTTTGCCCTTCTGTGTTATTTCATTTCTTTTATCAATAAAAGCATGGTTTCTCATTATTTATAGAAATGATCATGGGTATATGGACGGGGAAGTTCTCTTGTGAACCTTCTACTTTTCTTTTGGCTTCTATTGGACATTTGTCTTTGTGAAATGTGGATCATGTCATCATATTTGATATATATATATATATATTGAAAAGGAAACATGATATACAGTATTCGAAATGATTGGGGTTAACTCCAGCCTCCAGGCACCAAAAGGTGAATTACACGAATAGATTCCAATTGGAAACCCTTAGACCCCTAAGGGATTTAGGATTTTAATATTACATTAGTAAGGGTATAGTAGTAATTAGTTATTAGTGTGCTTAAATAGGGGAGTGGAGTTTTTAGAAAGGTATGTAATTTTGGTATTTTTGTCTTGTTCAACAAGTGGGGAGAGTCTAGCCCTCTCGAATGGTTATGGTATTGTAACCCTATCTATTATCATCTCAATATAGTGTTCTTGGCTATTTTTTTATTGTCGGTGTTAGTTTTCTTGTCATTGTTAAGATTAGTGGTATTTTAACAGAAACAAAATAGGGATACTATATGAGGAAGGAAATACTTATCTTTACACCAAAAGTATACCGTTGTGACAATGGCCTCAAGAAGTTACCAAAGTATACAATTGTGACAATGGCCTCAATGTTGATGAACTTTGAACGTCCTTGGAATGTTTTGCAATCATTGACTTAAGCACTTTGTTTCCTATAAAAAAACCATCGGCTTAAGCACATTTCAGAAGGAGGATCGCTGTTTACTTATTTCAACCTATCTTTTTCTTCATAAACTCGAGAACAAGTTTTTGTAGGGATAAAATGGTGTAAAGAAGTTAAGGGCATGCTGGATAGTTCTTGATTAGTTAGTTACTTCTCTTAAATCCTTATAAATAGTAAGACTCACTCGTGTATTCTTAAACATTTTATTCATTATAAGTAGTAAGCCTCACTCATGTATTCTTTAACATTTGATTCATTAATAAAGATGATAAGGTTGGGCCTAGGGCTCTATTTAACCATTACTTGGAGAATAAGTTAGTATCTTGTTATAAATAGAGTGAGTTAGGTTAGAGTTTAGGCATCCAATTATTAGTGATTTATTGTAGGGCTCAAGAGATTCTCTTGAGAGGGGAGGTTACAAGACCTATCAAAACTTGGCTTATCTTTATTCTATTTTTTTTTGGTATAATCTAGATCTTGGATCATATCAATTGGTATCAGAGCCGTTTCGATTCTTGGTTGTAGAGATCGATCTAGAACATGTTCGCTTATTTTAGAGAGATAAGAGAGGAAAGTGCAAAAGAGCAGGAACGAAGAAGATTAGAAAGGGAGGAGCAGTCCAAGGCAAGGGAACAATGGTACTTAGAGATGGAAGCTAGGATTCTAGGAAGAAGGGAGGTTGTGGAAGAAACAGTGACGGTTAGGGTTTTGACACTGCACCTCCGCCACCACCATCGCTGCCACCACCTCCGACATCGCTGCCGCCACCTCCGCCGCCACCGCTGCCGCCGCCGTCACCACCTCCACCACCACCATCGCCGCCGCAACCTCCGACATCGCCGCTGCCACCTCTGCCGCCACCAACTCTGACATCGCCGCTGCCACCTCTGCCGCCACCACCTCTAACATCGCCGCCGCCACCTCCGCTGCCACCACTTCCACCACCGCCGCCACCTTCGCCACCACCTCCGCCACCAACGCCGCCACCAACGCTGCCATTTCCGCCACCACAGAAGCACCGCTGCCGCAATCGCCGCTGCGGCCGCGGAAATTTTTTTTTTTTTTTCATTTTTTTTTCTTTCTATTTTTGTGTATGTCCACCTTGAGGACAATGTGCTTTAAAGGGGCGGGTATTGATAAGGTTGGGCCTAGGGCTCAGTTCGGCCTAGGGCTCTATTTAGCCATTACTTGGAGAATAAGTTAGTATCTTGTTATAAATAGAGTGGGTTAGGTTAGGGTTTAGGCATCCAATTATTGAATGAGTTATTGAGTGATTTATTGTAGGGCTCAAGAGATTTTCTTGAGAGGGGAGGTTCCAAGACCTATCAAAACTTGGCTTATCTTTATTCTTTTATTTTTTGATATAATCTAGATCCTGGATCATATCAAAAGATTGGTGTTTTATACTTCTTGGAGAGTTATTTTCTATTGGTTATTATAGGCTACATCAAAATGGAAACTATCAATTTGATAAGGCGATTATGTTTAATTATTTGTTTTTATAAGATAGATTTGTATTTCGTTGGCAATGATGGTATGTTTAATGTGCGATGTGTTTTTGTCAGATTCGATCAAGGATGATAGAGATGGTATCGAAAGGCTTAGCAACATTGGAGGTATAAAAATTTTCCCATGATATTTTGTGAAGATTGTCATAATGTTTATGGTGATATTTAAGTTATTAATAATAAAATGCCATCCAGAGTCTCAATTCCATTTATGTTTAATATTGTTTGCTATTTATGTTTAATATTGTTTGCTTTAGCTTTGGAAGGATCAAAATCTCCATGATATATTGTTGGTTTATGTTAATTTTTTGTGAAATAGTATTTTTATCTTTTTAATTATGTTTTCTTTTTTAGGATTAGATTAAATATTATTTTATATCTTTACCTTTTATGATTACGTCATTTTTCCTTTTATGATTAGATATTATATTTTCCTTTATAATTAGGTCATATCTTGTACTCTATTTAAACTCATTGTTGGGTTCATTATTAAATGAACAATTTGAATAATTAGAATTCCATCTTCAACATATATTTATATCTTTCTGTGAAAGTTTTCATTGTTTGGAACAAAACAGAAGGAAAAACAGTTTCTATAGCATAGCATTTACTGTTTATTCTTCTTCTTTGACTTGCTTCTTATTTGCCGATAGTGGCAGGTTAATGTCCCAAAGGTTATAGTATATTGATTGTGCTTTGAATTTTTCAAGCTGTCAGTAGGTTTATTAGGTTATTCAACTTTAAAATTTCTGCAGGTTTCACTAAAACACTCAGGGTCTTTGTTTATGTATGCTGGCCATGAAGGTGGGGCATATGCCAAAAACAGCTTTGGAAATATGTAAGTCGATAATGTCTTGGCCCTTTTCTTTTTATTTTTTTTTGGATGGTCTGTTCTACGCACATTTCCTGCAATATCATTTTTCACGATTGATTAGTATTTTTATTACTGTAGATTGTTATTATTTTATGGAGCTTTAGTATTAGGTGAGGAGAGTACTATATATTCAAACTTTATAGTTTCGGAACTTGTCACTGTTTCTATTCTATTCCACCACGTTTTCAAAGCTTTTCTCAATAAATTTTTTCCATTTTTAGCTACACTGCTGTTGGTGTGTTTGTTCTCGGAAGGATGTTTCGAGAGGCTTGGGGATCCGGAGCGGCAAAAAAGCAAGTAGAATTCAATGATTTCCTTGAGGTATTGTAACATGGGAAAAACATTGCTTTTGCTATCCAAAGTTTTTAGCCTTGATCCTCACATCTTCATGGGGACCAGGGCATTAGTTTACTGTGTTGAGTCTTCATTCAAAACTTTTCCATGGGTGAATTTTTTATAATTAACTTGTAGGCCTTTTTCTTGTTGAATGGTGGGTTTTTTGTATATATAGTTTAATGGTCGAATGGTCCTTTTCTTTTATTGAAAATAGAAGCATCGCTTTCCATAAAAAATAAAAAGCTATGAGAAGCTTGAACGTGTTTATTTTCAGTTAGACATCTCACATATGCGTGTTGCACTCAAACCATCATCTTTCTTCTTTTAGAGTAACCGCATGTGCATATCAATGGAGCTAGTAACTGCTGTCTTGGGTGATCATGGCCAACGACCACGAGAGGATTACGGTAATATAATATCCGTTTAAAACTTTTTATTTGTTGGTCTTTAGTCTTAGAGACCCAAAATCTTACTATATTCAACTTGTATTTTTTTGTTTGGTAAATAAAATATTTTTATTGATTAAGTGAAAGTGTCTCAAAGAAACCTATTGAAACAATGATAGGGGTCAATCAAAGAGCAAAATACATGAGATATATCGATAAATACAATCTTGACTGTGTGAGATACAATGAATGGGTGTTGGATGGGCTTGTAGGTTGATGTGGGCGTATCCAAGAGATACCTTAATATTTTGCTAACAAATCAGAAAAATGATGTTTGTGTGTTGAGAAGCGAAAAGCTGTTCAGTTAATAAGAATTTCCTAAATATGGTAGCCTTCAAAAAGCTTGGACAATGAGCACCTAAGATGTAACTTTTATTATTTTTATTTATTTATTTTTTATTGCTCTTCTTTCTGTCTGGGTATTGCATTAGGTTTTGAGTAACATGCTCTTTCATTTTGTGCAGTGGTAGTTACAGCAGTTACTGAACTAGGCAATGGAAAACCGAAGTTCTACTCTACTGCAGAAATAATAGCTTTTTGTAGAAAATGGCGCTTACCAACTAATCACGTCTGGTTATTCTCAAGCAGGTGATCCATAATTTCATTTAATTTACTGGTAGGAACTACCACAATATTATATTAAACTACGATAAATCTCTCGAGAGAATCTCTCAAAGAGCCAAGGTACATAGACGACAACTGAATGAATCCCTCATAGTGGCCTATCATATCCTATTTATAGTAAACCTAGGATTTAACCTAATTTAGTAAATAGCTCTATTAAGGCCCAAAAGCCCAATACAACTTAAATAACTAAAACAGAATTATATTTAAGAATAATAATAAAATAGATCCGTCACTTCTTGGTCCGCCTTGAGTAAACTGATCTCGGGATCGTATCAATACTCCCCCCCCAAAGAACCACCTTGTCCTCAAGGTGAAAAGTCCGGGAATTATAGCTCTAGATCGGCAGCGGACTCCCAGGTAGCATCGTCTGGAGTGGACCCCTCCCACTGTACCAAAATCTGGCGTGACCCTCCATCCTGTGGAGATTCCCATATGCCCAAAACAGCCTTGGGACAAACCAACACGCCTAAATCATCTCCCACCAAAGCCGGTGTAGGGAAGACTAGACCTGGGGAACCAACCGCCTTGCGCAACACGGACACATGAAAAACCGGGTGAATCCTCACAGTCTGTGGTAACTCCAATCGGTACGCAACTGGCCCCACCCGAGCTAACACTCGGTACGGCCCAATAAATCGTGGTGCTAACTTGGGGTGTTTAAATTTCGCCAAGGAGGACTGGCGGTAGGGTCGGAGCTTAACATACACTAAATCGTCCACATCAAACTGAATTACTGAACATCCCGACGTTTTGCATTAGCACGATCCGTCATCAACTGCTGAGATCGCAACAAATTTGTCTTAAGCTTTTCCAACATTCTATCGCGTTCCATCATCAATGAATCCACGGTCGCAACTGGGCTAGCCCCACAATCGTAACCCAAAATTGTCGGCGGGGAACACCCATAAACAATCTCAAAAGGTGTCATACCCGTGGACGAATGATACGACGTATTAAAACTGAACTCAGCCCATGCAAGCCACCGGTACCATGCCTTCGGTTGTGTCAACACAAAGCATCTTAAGTACGATTCCAAACAGCGGTTCACGACCTCGGTTTGACCATCAGTCTGTGGATGATAAGTGGTACTGCGACGTAGCTGCGTCCTAGAAGTCTTGAAGATTTTCTCCCAGAGCAGGCTAGTGAATATCTTGTCGCGGTCCGACACGATACTCTTAGGTACCCCATGTAAACGGACCACCTCTCTGATAAAGACCCAAGACACTGTGAGAGAAGTCAATAGGTGTCGAAGGGGTATAAAATGAGCATACTTAGAAAGCCGATCGACCACCACAAAAATAGTGTCATAACCCTCGGAATGGGGAAGTCCTTCCACGAAATCCATCGAGATGTCTTCTCAAATTCTTTCAGGAATCGGTAACGGTTGTAATAGGCCAGCTGGAGACAACGATAAATGTTTTGCTTGCACACAAATCAAACACTCGGCCACAAATGCCTGTACGCGGGCCTTCATTCCTTGCCAATACACTTCTTTAGCCAGACGTTGATACGTCTTGAGGACCCCAAAATGCCCCCCGATGGCGCCCCCGCGGAACTCAACCAATAACAATGGAATCATAGGGGATGTCGGAGGTAGCACCAGTCGGCCCTGGTATAATAGCACATCACCCATGACGGAATAACCTGACGGAATAACCTGACGGGCTCACCTCACCTGCAACCAACGCGGTATAAATAGCAAACAACTTTTCGTCTTCCCGCACTTGCTGAGTAAACACTGATGTGTTAACTCCAGCCACACAACTCAACATGCTCAATTCACCTGTCGACGGCATTCGAGAAAGAGCATCTGACGCTCTGTTTTCTAGGTCTTTCTCATATTCAATATGGAAATCATAACTCATAAGCTTCGCAATCCACCGCTGATACTTTCCATCAACTACACGTTGTTCTAGCAGAAACTTAAGGCTCTTCTGGTCTATACGGACGAAAAAATGGTGTCCCAGTAAATACGCTCGCCAACGTTGTACTGCAAACACGATGGCCATCAGCTCACGCTCATAAACAAACTTCACGCGGTGGGTTATAGGTAAGGCCTTACTAAAATAAGCCAATGGTTGTCCCTGTTGCATCATGACTTCGCCCACACCAATACCTGAAGCATCAGTTTCCACAACAACAACTGATCAAAATCTGGCAATCGTAATACGGGCACACTACTCATCGCATGCTTAAGTCTCTGAAAGCAATCCTCCACTGTCGAGCCCCATTCAAACTTTCCTTTTTTTAGCAGCTGAGTCAGTGGAAAAGCCATCGACCCATAATTTGACACAAAACGATGGTAGTAACCTGGCAACCCAAGAAAACCCCCGTAGATCTTTAATATTCTTAGGGCTTGGCCAACGTACCATGGCCGCAATTTTCTCTGGATCGGTAGCCACTCCTTCAGCAGATATAAAGTGCCCCAAATGCTCAATCCGGCGCAGCTCAAACTGACATTTCTTAGCATTGGCTACAAGGGTATGGGTCTGTAAGGCTTCGAAAACTCGAACTAGGTGCTCCTTATGATCTTAGATGGACAGACTGTAGATCAAGATATCATAAAAAAATACTAGCACAAACTGGCGTGGAAACGGTCATAAAATCTCATTCATAACCGACTAAAACGTTGCAGGAGCATTGCGTAATCTGAAAGGCATGACTACAAATTCATAGTGGCCCTCATGAGTTCTAAAAGCCGTTTTATGTACATCAGCTGGCTTAACACGGATCTGATGGTAACCCGCCTTAAGATCAATTTTCGAAAACACTGTCGCCCCATACAGTTCATCCAACAACTCGTCAACCAGTGGTATCAGATATTTATCGGGCACTGTTGCTTGATTTAAAGTGCAGTAGTCGACGCAAAAATGCCAACTCCCATCTTTCTTTTTAACCAACAAAACTGGGCTTGAAAACGGGCTCGTGCTAGGACGAATAATTCCTGCCAACAACATTTCTCCCACTAATTTTTCAATCTCATTCTTTTGATACTGTGGGTACCGGTACAGACGTACATTTACTGATTCCTTGCTGACCAATAGTTCAATAGCATGGTCACAATTACGTTGCGGGGGTAGACCTGTCAATGATTAAAAAACCGGTGAGAAAAAATGAAGTACAGAGTGCAGTTCTGTGGGTACCTGAGACAGATCTGGTAGAGCTCTGGTCTGTTCTCAAGTCGGTATCGACGTCTCAAGCATGTTTAGCTCGACCAGGAGCCCCTGATTCTCTGGTCGTAAGGACTTCATCATGGATTTTAATGAGACCTGGGTCTTGACCAGCCGTGGCTCGCCTTGCAACTCAACCTTTCCTGACCCCAGTTTGAATTGCATTTTTAGTGCGTGGTAATCAAATTCAATTTTTCCCAACGTCGCTAGCCACACGACGCCCAAAATCACATCTACGCTTCCCAGTGGTAAGGGCAAGAAATCGTTCACTATTCTCAATTTTGCCAACGGGAGTTCCACATTCTTACAGATGCCAGCAGTTCTCATAGAATCCCCAGTTCCCAACATAATGCCATAATCATCGGACGGCTCCATCGGTAATTTCAATTTAGCAACAATGTCATCGGATATGAAGTTATGCGTAGCCCCACTATCGATCAATACTACTACTTCTTGTCCTTGTATCGAGCCTGTGACCTTTAGTGTTTTTGGTGAACTCAACCCTACCATTGAGTGCAAGGATAGTGCTGCCATGTTCGGTGTATCCACAGTATCATCAATAAGCTCATCTTCGAACGCATAATCTTGGGGATCATCGGTTTCTTCGGTTTCTAGTTTTTCTATTGCATCATGTACTACAAGTATCTCCAGAGCCTGCAGCTCCTTCTTTTTACACCGGTGACCCGGAACCAACTTCTCGTCACACCTAAAACACAGCCCCTTGTCTTTACGTATGCAGATCTCACTATCTGTTAAACGCTTATAAGGGATGGTGGGGGTGAGGGTGTTCGATTGTGTTGATCGGTTGGAATAGTTGGAGGTGGGGGTGAGAAGTTTTAGGTTTGTACTTCCAGCCACTCTTGCATTTACAGCCCCTGATCCCTGACTCGATTTAACTGTTGACGACGTGCTCGTTCCTCTTGCTTGATTTCGAAAAGCCAGGTCATTTTCAATTACCTGTGACATAAATTTCTTGTCCTGGATTCCCACTGGCCGTAGTTTTCTCATCTCACTCCGTATTTCTGCCTTTAGTCCACTCTCCCATTTTCCTTCCAAAGCACTCGCACTGATGTCGCGCATACCCTTTGCATATTTTTCAAACAACCGGCGATACTCTTTGACTATTTCCACCTGTTGCAGACTCATCAAATTGGCATATTTATTGTCATTGATCGTTGGTTGGAAACGATGCAACAACAATTCTCGAAATTCCTCCCACGAAGCAATCGGAGCTCGGTCCTCTTCATACTGCAACCACACCAATGCCTCCCCTTCCATACACAGGGCTGCGGCATCGACCCTTTCATGTCCCGTCAACCGATTAACCCAGAAGTACCTCTCAACTTGGCATAGCCACCCGTCTGGGTCCTCATCTGCAAGTCCTTTAAACACTGGCATTTCCAACTTCTGTAGCCTTCTATCAATCATTGGTCCATCCCTTCCACCTGCTCTCGGCCCCCCTCGGTCACAACGATCCTGCCTATCTCTTCAATCCTCCCGCTCGTTCCAACCGTCATGCGCTAACGACCTTTCATAGCCGGTGTGCACCCTCCTCTCCTCCGAAAACCCATTCCCCTTTCCATCGAACCAGACATCTCCTCGTCGATCGAAATCCGAACCCATATAATTTTCCCCATCGGGCAAGCCATGAAAATTTTCCCTCCGGTCCAACCCAAACCGACTCTGTCCGATCGGCCCACCCCGACCCACGCCATGGCCCGCGCCCCACTCTCTCGCGCCCCTGCCTAGGCCCACGCCCCACTCTCTCGCGCCCATGCCCAGGCCCGCACCCCACTCGCCTGCGCCCAGGCCCAGGCCCCCCGACCAGCCCCGCTCACCTGCGTTGCCCCACTCCCGCTCGCCCACGCCCACGCCTAGACCACCCCAGCCCCCTCGGCCGTCATCGCACGTCCGCTCGCCGCCCACGCTCGGACCGCCCCAGCCCCCTTGGCCGTTCCCGCACGCCCGTTCACCCTCGCACTGGACCACTCCACCCAGGCCTGCACCCCCCGAGCCGCAACCCCAGCCCGCGCGCCCCCTTGTGCCGCTCGCGCTCCCCGTGCGACCGCCAGTGCTCCCCGCATTGACCCCTAGCCCCGCCGGCCCGACCACCACGCCCGAGTCCTTCGCCGACGTTGCCTTAGCCCCACTCTCCTCTTCTGATTCCTCCGCTACCCCCTTCCCCTTGTCCACGACAATTAATTCGGGTTTCGACGGTTCCGGTTTGCGCTCTAATAGCTTATGTATGTTGAGATTCATTTGTCCCAACTCTTGGTCCAATCGACTCCCTATAGCCGCCAACTGAGCCTCAAACTCTTTCTGTATCTTCAACATCTCTGCCCCGTTTTCCCTGCAATCCTGCATCCTGGTCTCCATCTTAGTACTCACCATTTCCGGATCGACAAATGACTCTGATACCAAAATGATAGGAACTACCACAATATTATATTAAACTACGATAAATCTCCCGAGAGAATCTCTCAAAGAGCCAAGGTACATAGACGACAACTGAATGAATCCCTCATAGTGACCTATCATATCCTATTTATGGTAAACCTAGGATTTAACCTAATTTAGTAAATAGCTCTATTAAGGTCCAAAAGCCCAATACAACTTAAATAACTAAAACAGAATTATATTTAAGAATAATAATAAAATAGATCCGTCACTTCTTGGTCCGCTTTGAGTAAACTTTCAATGATCTCGGGACGTATCATTTACATGGTATAAACAATTTTGTTTGAGAGTCAGTATCGACATCTCAAACAACTTTTTTAGAATTGAGGGATGGTCTATTTTTTATTAGATCTAATTTCTAATGATTGCATGATATGGAAAATCAGATAAGGTATTTTGTCCCAGACTCCCAAAAATTAATGATATCTTCATAGTTTCTGAACTTGTCAATGTTTCTCCGTCTCAGAGTTTAACATTGAAATGCTACATCAACTTGTGGTTTATCATTTATCCTTTCGTTTGTTTTATCACAACTAAATGGTTTATTATTGAAGATTTTTTAAAGATTGTTAGTCTTATTTGGCAGCTACAAAGTTTCTAGATGATGAATTATCTGAGACTGAGTTATTCATCATTAACATTTAGTGCTTCATTTCTAAGACCGGAGATTGCTTATTTGATCTCTAGTAATATATATCTCTATATTGTATTTCATGAATAAATGAAATGATTTCTTATCCAAAAAAACATTTAGTGCTTTATTAGATGCGTAAGAGTTCCAAGTGTTTCTTGCATGCAGCTACTTTCTATTTACTTTCATGTAGCTATGATTTTGATTATATTTCTTGAGGGCTCAAAAGTGATTTTTCTTTTAAACTCGAACACTCTACTTTGAAACTAATGTAATTTACTTTTTAATTTTTTTTTGTAGTGTCTACTCAAACGCTCTACTCTTAAGTTTTTGTAATCATGGATAGGATTCCTTTTGTAACTTTTCATTCATTTGTTTCCTAAAAAAATGCAGTGTGTACTTGTTACCTAATAACTATCTTCTGTCTCCTTATCCTTTTCTTTTCCGATTTCTGAAGGAAATCAGTCACATCTTTTTTTGCTTCCTTTGATGCCCTTTGTGAAGAAGGAACTGCAACAACAGTATGCAAAGCTCTTGATGAAGTTGCAGAAATATCTGTACCAGGTACAATTTTGTCACTTTAATCTGGTGTACTGTTTACTGTACAGTTGTCAGCTGCAATATACCCCTTCTGTGTTGTTTGCATTTGAGACATTATTTCAAAGATAATATGCTATCCATTACCACCTTACTTGAACAGGATCCATTTACAGGCTGTACCATTGTGTCCTTGAGTTCTATAAATATGCTATCCATTGAAATCAAGCTTTTCTAGTAATTATTTCACTACTTGAAGATAGGTTTCTTTGCTTTGCATCTAGTAATGACTTCACTACTGGAAGATAGGTTTATTTGCTTTACATCTGATCTATTCTTCCATAACTTGTAGCCATGAACATTAATAAGTTTTTCCCACTTTCTTGTATGTTGTTTATAATCTTCTCACTTTGATAGACCTTGCATCATTTACCCACTTTGTTTTATTAGTGGAGAAGTTGATAGTGGTTGTCTACTAAAACTTGGCAGGCTCAAAAGATCACATAAAAGTGCAGGGTGAAATTCTTGAGGGTCTAGTGGCCCGTATGGTGAGCCACGAGAGTTCAAAACACATGGAGAAAGTATTGGAAGAATTTCCTGCTCTGCCCGACAATGAAGGAGGTCTGTTGTTGCTGCCATTATTTTCTCCTTGTAATCCTTCTCGATTGGGACCCTTCATTTTAGTTTTATACCCTGTTTGCCTTAGGTGAATTTGATTTGGGACCAAGCCTGAGGGAAATTTGTGCTGCCAATAGATCGGATGAAAAACAGGTTACAAAAAAATTCCTTGGCATGAAATAGGATGAAATTTTCACTTCCCGTTTTATTAGATCAAAAGGGAATGATTTTCCCGTCAAGTGTAGCCTTTCTTTCAATCTCTGCTTCTGTAAAATATAACTGCAGCAAATAAAAGCACTTCTTCAAAACGTTGGTAGTGCCTTTTGCCCCGTCCATTCAGACTGGTATGGCGATTCTCACTCAAGAAATGCAGACAGATCTGTTGTATCAAAATTCTTACAAGCCAACCCAGCTGATTTTTCAACTTCCAAATTACAGGTTAGTTAATTGTTTGGTCATGATAGGACAATGAATTTATTTTGTCAGGTATTTTTGTTTTCTTTCAATTTAAATGTTGCATTATCTGCTCAAGATTTGAGAAGTATTTTCCCTATGGCAGCTTGAAATGAAATTTCATTTCCTCTCCCTCTTCTTTTATAAATTATTGATTTTTTCAGACATACTAGTTACCTTCTGTATTAGCATCTTGAAGCTTAATGTTCACTCTTCTCGATAAGCTCATGAAGTACTTCAAATTCCTCCAGGAAATGGTTCGACTAATGAGAGAAAATCGTTTTCCAGCTGCCTTCAAATGCTATTATAATTTCCACAAAGTTGGTTCCATATCAAACGACAACCTTTTCTATAAAATGGTCGTTCATGTTCAAAGTGACTCTGCTTTTCGGAGATATCAAAAGGAAATGAGGTCATGCTTGGGAATTGCACTCTATTTATTTGCAATATTCAAATAATTATTTTTGCTGGTATTCTTGAGCGATTGTTATGTTTATATTAATAGTGCCAAACGTGTTGAACCTGTTTCAGGAACAAGCCGGGTTTGTGGCCATTATATCGAGGTAATAATAGCCAACCTTAAATAATGTGAATTCTTTTTTAGATTTCAACTACTTCAATTGGAAGTATTGTAGGCAGCCTCTTCACAGTTTGACCATGATGATCTTGATTCATAAATTTGATGTTCAACTTTGGTTTGGTTTTAGTATTATTACTAGCAGTCGATATTGTGTTTTAGCAGAAAAAAATGAACCCAAACAAACTTTTCATGGTTGAATTTTTTTTAGAAAGATCTCAGCCCTTGGATCCCATCAACAATAGAACTTTGAAGATTGTTAGAGGCCCTTGGAAAGCTATTGCTCAACACAGTCACTTGATTTATAACAATATTTTCTACAGAATTGGGAACGGGATGCACATGAATTTTTGGAAGGATCATTGGATTTTAGATGCCCCCCTCATTGTTCTTTTCCGAAGACTTTTTGAATTGTCTAACAAGAAAGGGGCCTCGGTGAATGAGTTGTGGGATGAATCCAAAACTTTCTGGAACTTTGGTTTTGGGAGGAATCTAAAGGACGAAGAAGTGGCTGATTTATGCAATCTATTAGATGCTCTTGATCATGTTCATCTGTCCCAAACGAATAACAAGGTTATTTGGAAGCTTGAATCATCAGGATTCTCTTGTGAGCAATATGGAATCCAGGCCTTTGGAAAACATTCCAGCCATATTTAACTACATATGGAAAGGCCCTTCACCTAGAAAAATCAAATTTCTGCTTCAGGGCTATTAACACAAACGAGAAACTTCAAAGAAGGCTTCCTCATTGGAATTTTTCACCAAACTGGTGCACGCTTTGTCGTCTGGACACAAAAACTCAAGGCCATTTATTCTCAACATGTTCTTTCTCCAGGCGTTTTTTGGACCGGATGCTATTCTCCTTCAATTGGTCTTGTCCTCTACCGATGGACCTCTATTAGACACTTGTTTACTGAGTCATCCATTCAAGAATAGGGCCAAAATTTAATGGGAGTCTTTCACTAGAGCCTTTCTTTGGGGAATTTGGTGTGAAAGAAACAGGAGGATCTTCCAAGATGTTGATGCCCCTTTTGATCGCTTTTTTGACAATATTGTTCTTACGGTTGTATCTTGGATCAAACGATATCATTGTTTTAGAAATTACTCTTTCTCATCCCTCCTCTCTAATTGGAAAAACTTTATGTTTTTTTGTATGGTTCATACGGGACCTCATATTTCATTCTTAATGAAATCGTTTCTTTATCCAAAAATAGAAAGATCTAGAAGAAAAAGACTAACTATGATAGACAAACGAGAAGTGCAGAAAAAACTGAGTAGTATAAAAAAACAAACTTTTAGGATTGTGCACAATAGGTCCTTAAACTTTAAAATATGTTTAATATCTTCTTGAACTTTCATTTTCATCTAAAAGATGAAACACGAAACAAACTTTTTATTAAGAAAATCCATAGAAGCAAAGCTTGAGATTCCAAAGGAGGGAAAAACAAAAGGTAAGCCTCAAACAAGAGGGATAGCAACAAACCGAGATAAATACTACAAGTTAAGATATCTTAAAAAGCAAGAATGAATGTTATAGAGAAAGATAGATGATCAAAGTGCTAGTAGTCTTGACTCTTCCTACAATCATCTAGAGGAAGATAGATGCAATCTGTGTGATTGATGAAACCATTTTCCCATCAATTTTGTTCGAATTACAAAATGAGGACAAACAACATGCTTCACCACTTAAAGTACTAGATAAATTTGCTAATATTGTAAATATCTGTGGTAATCAATTGAAGAATGTTGAGGTAGTACTTGCTAAGTTTGGTTTTGAGGAGTGATTTTGCACCAGATGGGGTTTTAAGGTTTTTTCGCAAAGCTTTGGCGCACCACAAAATTATAGAAGCTCGTTGTTTTGTCTTGGAAACTGGCTTTGGCTTGGGCTCATTATTTTTCTGATTGAAGTTTCAAGCCTCAAGCCTCTGGCTATGGCTCGAGTCCCATTGTTTGGTCTCTTAGTTGTAGGAGCTTTCCAACTTTGTTTTAGGCCTGTCGATGGTCTTTAGCTCCGTGTCTCTCTCTCTCTTGTCTTTTCTTTGTATTTTGTTATTTCATTTGTAGTCTCCTCTCTATTTTCACAGGTTTTTTTATGTATTTTTTCCCTTGGTAGTTGTATCTGAAACTTTCGTCTCTTTTCATCATCTTATTCGATAGTTTGTTTCTTGTTTAAACAAAGTTGTTGTATTCCCAGGCTTTTTTATGGACATTAATTTATTCAAAGCAAACAAAGACAAGGCAGCTGAAATTGTGAAAGGTAAAAACAATTTGATGGAGATTGAAGGAAATGGCACCTTGGGAAGAGATGGATTTGCTGATGAAGATGCAAATCTGATGATCAAACTGAAATTCCTTACATACAAGGTATTTTCTTTTGTTAATTGTTCCTTGCAGGTCCATTTCATATTGGTGGGCTAGTTTTCTTGTTCACAATATGGTTCAACTGATAGTTGCCAATGGTGTGCCGTCTCTCTCTCTTTTGGTCATGGTTTGAGATGCATTTCTTTTGTTAAACATCACAGACCTTTCAGTTTTTATTCTTAATCCCTATTTGGTTAGTGATTGAGATGCATTTCTTTTCTTTGTAGTAATGCTGATTTTAGTTAGAGTATAGCGGTCACAAGGCTTTTGATTCAATCAAGTACCTTGAGGTCTGAAATATGAGATATTAAAATAACAAGTTTCAGAGTATCACAATTATTAGCTTAGTTTGGAATTAGGTGAATGTGGAGAGGTCCTAGATTAAAAAAGATTTATTTACGAGTATTTTTGTTGTGTTTGATACTACCAAATTGGGCTTAACCAATTAGTCTGAACTTTTCTTATATCTCCTAGTTACGGACTTTTTTGATTCGTAATGGCTTGTCAATTGTCTTCAAAGAAGGTCCAGCTGCATACAAGGCTTATTATTTGAGGTATGATTAATTGATTTGTTGCCTTAAAAACAAGAATGACATCTCATACTTTTTTTAATCAAGGCATATCATACTTTTTTCGCATAAATTTAGGAATGAATCAATAGTTGAGAGGGAAATTGTAGACTAGTTATGTTATGCATGATGGTATAGTTTTCAATATTGGTAGTCCGAACGAATGATTGAACTAAGTTCTTATAACTTAGGATGGTTGGAATTCTCTCTAACCTTAGCCCTCGCCATTGGGTTGTTCTTTGGTGGACTTGAAGAAATCAGTATGGATACTCTTGATCTAACTAATGTTTCAAAGGCTCATATTAAGGTTAGACACAATATTTGTGGGTTTTTACCCGCTACCATCTTAGTTTCCCACTCTATTAAGGGAAGCATCTTCATAAATTTTGGTGACATTGTATCAATGGTCCCCCCAATTTGATAAAGGAAATTTCATCAATGAATTTAAAAATCCTCTGGATTTCGCTAGAACGAGGGATGTAACAGTTGATTGGGGCTTAGACCTTGATAGTCTACTTAGAACCGTTGTTTCTCTTGGCGTTCCTGTTAACAAGATGAACTTTAATCAAAATCCATGGCCTGTGAGTGCTACAGACCATCCTCCTGATCATTCTGTTCAGACAAAAAGAATCCACGTTGAGTCGCATGCTAATTACTCATCTGATCCGTTGATCCCCAAGCCCACTTCCTTAAATCCTTTCTTAAACCCAACAAACCACAGCTTAAACCCTTCGTTGGTTGCATTAAATTTATCCTTGTCGTTTGAGCCACTTTTATTTCCAGGCCCTTTGCATGAAATTCCCCATTAATCCATTTCTAAAATTTATTGCTACCGATTTAACCGCAGACGGTTGAACTTCAACCCCCTTCTCCTTCTATTTCATCCAACCAAAATGTTGATTCAAACTCCCCTTCATTAATTCAGCCCATACCCGACCAACCTTGCTCTGCATTAAATGCTCCCAAGAAAAAAACATTTTCTCATTGAGCCACTTCCTAGGCATTATTCACATGAACTCAAACAAACCAATAACCTCAACCTCTCCATAGCCAATTACCAATCCTTGAATCGTGACTCCTTAGACTCTCTTTGTTCCAGTACCAATCCTTGAATCGTGACTCTTTAGACTCTCATTTTTCCAGTCTATTGATATCTCTACCTTATTAAGGTGAGCAAGAACCTAAGCATGTGCCACAATCTTTCTCCTCTGAATGCAACAAAACAAAACTTCAATCTTTGAGCTCTGGAATTTCTATGGTTACAGGGTTTGTCATTCAACTAAATAAATTAGACGCTTCCCCAATTTTGAAACAGTTTGTCATAAGAGAACTTGAATTTTGAGTCAGACCCAAGCATTAGTAGTGTTGAATCATTGCAGGAAGATCTTGATATTTGCAATGTGTACCAGAAGGACAATTCAACATGTCTTTCAGATCTCTTTAAATTATCCACTCTTCAATTGGAAACTAAGTATTAAATAATATGAGGTTCAACCTCAAAATCAATTGGAAGTAAAAGGGAATAGCTCATATCTCTTATAAAGATGTTGAAGACTGCACACACTTCCAATGTGGGAAAAAGGCCACAAGGTGAGTATCTCCAACACGCCCCTTAAGATGGTGCCATATAGGGAAGGCCATCTTGGATTAAATCCAAATTTGTGGACCGAAATACCCGCTTTGGCTTACTCGTTTAGGTTGGATCCAAATCTTTGGGATCAATACCCACCAAGCTTATGTTCGCTTGGGTGTGGTCCAAACCACTCGCTTTGGTATCGAAATTTTGGGACCATTACCCACGTGGGCTCGGATACCAACTAGATTTTGGGAATAAAAAGTCCATTCATTATATTAGTATTGTACAATTGAGGGTTGAGTTCTTTTATAGTGAAAACTCCAAGAGCATAATTGAAAAATAACATGAGAATCCCTAAAATTAAGGAATATAAAATAAAATATGAGATACTATGAGATGTTAAAAAGAAATAACCAACAGTTAGGCTGTCACTCTACCATCTGACATTATCATACTAGACAAATTCTCCTCAATTATTGAGGAGTGTGGATTGCAATTGAAGGAGGTTTTAGTCAAAGTAAGTTTTCATCCCTATTCTCAATCCATTTTGTGTAACTAATGGAGATTGTGTCTTGACTCTTGGAACACCAGGTGTTTGGGGGATGCATTTAAAAGAGTAGCTATTAAAAACTATATTCTGCAATACCATCCAAACGTTGTACTTATTCAAGAGACCAAATTACCTTCACTTGATGGAATGATGATCAAATCCATTTGAAAGAGTTATTGATAGGGTGTCAGTCCAATCGTGGGGTAGATCTGGAGGCTTGTTGTCCACGTGGGGTCAAAATCGCATTAACATCATTGATTATATTCAAGGAGGCTATACCTTATCCATCAAGATCTCTTTTGCATCAAACAAAGTATGTTGGATCACTAATGTTTATGGGCCTATAGATTAGGTATTTATGGACTTGCCACTGGGTTTTGAAAGAAATGATGGTATGGCCTCAAGAAATCATTGTATGGCCTCAAGCAGTCACCTAGAGCATGGTTTGAGCGATTTGGTAAAGCATTAACGAGCTATGGATTTCTTCAAAGTTAAGCAGATCACACTATATTTTACCCACACTAAAAATGAAAAGATGGTTATTCTGAAAGTATATGTTGATGATATAATAGTGACTGACAATGATGATGTGAGAATAACTAGTTTGAAGAAGAAGGTAGCAAGTGAGTTTCAATTCAAGGACTTGGGACTGGGAGCCCTGAAATATTTTATGGGGATCAAATATGCCAGATCAAAAAATGGGATTCTAGTAAATCAAAGGAAATACGTCATTGATTTGCTGGAAGAAACTGGATTACTTGATTGCAGACCAACAAAAACCCCTATTGAACAAAATGTGAAATTACAAATTGCAATTGAAGAAGAAATAAAGGATAAGGAAAAAATATCATGGGCTTATGGGAAAACTTATCTATCTATCTATCTCATACATATCCTGGTATAACTTTTGCTATGAACACTGTGTATGGTTAGTCAATTTATGCATGCCCCCTGGTCTTGCTCACTTTGAAGCAGTGTATAGAATCCTAAGATACTTGGAAGGTACTTCTAGAAAACGTATGCTATTTAAAAAGCAAGGTCATCATCATGTTGAAATTTACACAGATGCTGACTGAGCAGGTAGAACTACAAATAGGAGATCAACCTCGGGTTATTGTTCCTTTGCTGGAGGAAACTTAGTCAAATGGCAAAGTAAAAAGCTAATTGTTGTGGTGAGAAGTCGGGCTGAAGCCAAATTCAGGGCATTAGTTCAAGGAATTTGCGAAGGTATATGGATAAAGAAATTGTTAGAAGAACTTTAATTTGTCCAAACAACACCTATGCATGTGTAGTGTGATAACAAAACAACTATTTCCATTGCCCACAACCTAGTCCTTCATGATAGAACCAAGTATATGGAAGTAGACAAACACTTCATTAAATAGAAGATTGAAGTTGGAACCATATGTATCCCTTATATGCCTATTTCAAAATAAGTTGTTGATGTCTTTACAAAAGGTCTTTTGAAGCGACAGTTTGATAAGATGATAGATAAGCCAACAATGGAAGACATCCTCAGTCTGGCTTGATGGGGGAGTGTTGGATATTGCCATTATTCTCGTATCATTTGTTTCCATATTTACTTGATTAGTATCTTGTAATTAATAATATTGTTATGTAGAGAACAATATTATATTGTATGATTATGTTTTCCTTATTTGTATAGTGAGGGTTTCTCTATTTAAGAAATCCCCAATCATTAATGAAGAATATAGTAATTATTCATCTCTTTGTATTTTGCACATTATTCGTAGTTAATTTGCTTCTTATGTTCATATCTTTATTTTGAAAATGTAGGCAAATGAAATTGTGGGGTACTTCAGATGGAAAACAAAGAGAGCTCAGCAAGATGCTCGACGAATGGTAAGCATACTATTGTTGATTCTTGGGCTACTTTCTTGTTCCACTAGAATTATGGGTACTTCACGTGAGAAAATAAATAAGTTGATGATATACTCTTAACATGCAGGGCTGTATTCTTGAGGAGGAAGTATGGAAATAAACAACTGTCATCAACTACTTATCTTAGTGAAGCTGAACCTTTTCTTGAGCAGTATGCCAAACGCAGTCCTCAGAATCAGGCTCTTATTGGATCTGCTGGCAATTTAGTTAGAGCAGAAGATTTCTTGGCCATTGTGGAGGAAGGAATGGATGAAGAGGGTGATCTACAGAAGGAGCAGGAGGCAACACCGTCAAGTTCACTGCACTCTGGGAAGGATGTTGTGCCTAAAGCAGATGGTTTAATTGTGTTTTTTCCAGGTTATTATATTGACCATATGATTTGCCTTTGTTCATTTCCAATGAAGTAAAGCCTTTTGTCTTGAGTTTTTCCATGTCCAAATTTTATTAGCTAGATTAGGATCATGGATTTCTTGGAGGAATTTCATCTTTTGTTGTATGTGCGTGTTTGTCTATATTTATTTATTTAATTATTTCGACATATTCTAAAGTTCTTTAATTGCTCTTTTGTTAAGTGGACACAAACATAGTTCTTGTAGTTGTAGGCATGCCCATTTGCATCTGAATTTGTGTTGGCCTTTTCTGAAGCCTATATGAAATGTGCAAGTATGCAGTTTCGTACTTTTTTTTTTTTTTTTTTGACAAGACATTTTTCTTAATTATTTTTTTTGACCCGCCGCCATGCCCCCGCTCAAAATACAAAACATTAAGTAGGAGAAAAGAAAAAATAATCAGAAAATATGGTACTTTTAATATGCGATAAAAGAATTAAGAAAAATTTTCACATTTCACATTTCAACGATATTTAATCTCTTGAAGTCAATTCATCAAAGTATAGGTTGAATAAAGTTTGAGCTAAACTAGTACGTTTGTTACACCTTATTACATTAGGTTTACAAATATATTTATTGTCTTAATGTTTAAATTCTTTGGCTGCCATTGTTCTTGCATTTTGACTTGCTAGATGTTGTTATTTTAAATCTTGAGGTGGTCCTAAGATGCATTACATTTTTTATGCGTTTTGATAATTTTAGTTTGGTACAGTTTATTAAATACATAGTTCTCTAATGTATTATGTAACATTTTTATGTACACTAACCATTTTCTCTCGTTTGTAGACTTTTTATAGCTCTTATATATTGCTTTGAACTTCCATTTGCAGGAATCCCTGGCTGTGCGAAGTCTGCTATTTGTAGAGAAATACTTAATGCCCCAGGAGGACTTGGAGATGATCGACCACTTAATAGTCTAATGGGTGACTTGATAAAAGGTAATAATGCACGTATTCATTTGACTTTACAGATATTGAGCTAGTGGTTTTTGATGTTGTTGTTGCCTCTAAAAAAAGAGTTTACATTCAACTCAGCCTATGATCGTCAAAATTCATTTGGCATGCATCCCTCGATATATGATTGAACCTCCGACCCATAGGAAAAAGAGAAATTTCAGCCTTCGGCACACCCTTGCCACCCGACACTATTCTCATATCTCCTCCAAAAAAAGTTGCCTGAGACATTATGTTGCAATGCTTGCCCACACCACGACACATTCGTGTGCCATGGGGTAGCTTATTTAGGCCACATGGTCAAAGTGTGGTGGAGAGCTGGCATTCTGTTAAAGTACATTTTGGAGTTTTTTCAATATGACATCTTTACATGTCATTTTGGCAAGGGATCATATGCCAACATGGAGAGTTCATCAGGTGTTAGATTGACACCAAACTTGGCGAGCTTACATCTTAGACTTGTACGAACTTAACTTTAGAGTTGCTCGTCAAAAATCCATCTTGAGCTTTGACATCATTGGTATGGTTCAGGACCCATCGTGCCCACTGAGCCTTAACACTGACTTTGTTGACAACGTAGATGCTTGGTTCTTCATGCCTCCTCTCGACACTCTTAAACGCTCATCTTGGTGTGTTGGATGATACATCGTCCTCCTTGGTCGCATCATCGCGCTTCATGAATGTGGGTCGTTACAACATACTCTATGATTGTGACACATTGCTCTCGACCACTTCAATTCCATCTTTGTTAGTGCATTTTTGGCATCCCTCTTATGCCATCTTCCATTTTAGACGTGCGTATACTGGTTATGATGGACCTACTAGCCATCGTCGACACTCACCTTTGTGTTTCTTCCTTCTTTTCTGACAATTTTCTAGCATATTTGACCTTGTAGCTCTCCATCGTGCACTTACATGTGCTAGAGACTCTCTTTTATATTTGGCAGGTGTGACATCTCCTCGAACGATTGTCGGCTAGCAACAACCATTTTCGGGGCATCTTCCTTCTTCATCGGCCTTCTCCAACAAAATTGAATACACAACAACTATTCTTCTTCTTCGACCATCTTTTGACGATCATGACAAATAAGCCTCATGTTCGATATTTAAGGCACTAATTCAAGAAATATTCCGACGACATTTTTTCCTTCTTCGGCAGCATTGACATGCAAGCATTTGCTGACACCCACAATTGTCTCTCTGTTTTCTTTGGTCTAGGGGCTTGTATAAATTGGTTTATTCTCTTGTTTTTGCCGTTTATTTTGGCCTATGTAATCAGCCTCTTCTCACCCCTATCTCTTTTTCTTCATTTTTATTGCTTGTTTCAATAGGTTTCCTTGTAACCTTTTCACTTATCAATGAAATTATTTTGTTACCTATCCACCCCCTCTCCCGTAAGGCCTAACTTAGTCAATCTCATGTTCAATAGGCTTATTCTTCGAATTGAAGATAATCCAAAAGCTAGATGAAGAGTTCGTACAGTGTTCGATTTTTTATATTTTGATTTTTGGGTATTTGGGAAGAAATTCAAATGAAATTTCGGTAAGAGAACGTATTAGCCTTTGGAAGTTTGAGGCATTTTTTTGTCAAATGAAGAATGCATAGCCCATAAGTTCTTGGATTGGGTCTTTTGGCCCCCTTAATGTGTTTATTTCATAGTTCACTTCATTCTAGCTTACTAATTGCTTTGACCATTCTAAGCATGCTTTTCCACTCAACCACTTCCTTCGAAGGATAGGATTTTTTTTTTTTTTTTAATTTTTTAATTTTTTAATTTTTTAATTTTTTTTTTTGACTTTTTGATTTTTTTTTTCTTTCTTTCTTTCTCAAGAAAGTGGATGTATTGCATTGAAATGATGGATTTGCACTTTTGGCTTTGGAAGAATGCAATTGTCGCTGGAGATTTGCATCTTTGCAGATGAAAATGCTTGATTTAGAGTGTTATACTTGTAGTTTTAAATGATCAACTTGTTTCTTATAGAGTGTTATGCATGTAGTTTTAAACAATCTACTTAATACTTGATGTATACAGGAAGGTATTGGCAGAAGGTTGCTGATGAACTTAGGAGAAAACCGTACTCCATAATGCTTGCAGACAAAAATGCACCGAATGAAGAAGTGTGGAGACAAGTAAACCTCTCGTTCTTTACTTTGAATAGTACACAGTGGAATCGAAAAAAATGACTCTCTCTACCCCCATCCTCTTCCCTCTCTGTATCCCACTTGTCATGGACAAGTTTCTTATTTTCTATTATATTTTTGTCTCTTTTAAAATAAATGACTATTTTTTAGAAGTACGCTAGTACGTTACTAACTTACTATTGCACCTGGTTATGAAAAAGCCCAATCTATTTTGTGATGGGAGAAGCCAGAACGTTGTCTTTAACTTCTGAGGATGTTCTGCTTTTCAATGTGTGTTCTTATCAAACTTGTTGTGAGGAATTGCTAAATTTGTGCAGATTGAGGATATGTGCCATAGCACTAGAGCATCTGCTGTTCCAGTTGTACCTGATTCTGAAGGTTATTTTCTGATAGAATCTACCACTCTATTTTATTAATCTATGATAAAAACCTCTCGAGAGAATCTCTCAAAGAGCCACAATACAATGATGATAATTCAATGAATGAAAAGAGTAACCTATCCCTTCCTATTTATAGTAAACCTAGTATTTAACCTAATTTGGTAATAAACTCTATTAAGGCCCATAAGCCCAATACAACCTAAATTCGCATAATAAAAATATATTAAAATAGTAATAACTTCCTTCTTCACTTCTTGCTCCGCCTTGAATAGACTTTTAATGATTTCAGGATCGTATCAATACTTCCCCCCCACAGAGCCACCTTGTCCTCAAGGTGAAAGTCCGGAAATTGCAATTCTAAATCAGCCGTGGATTCCCAAGTAGCATCATCCGGCGAGGATCCTTCCCACTGAACCAAAACCTGGCGTGAGCCGGCTGCCTGCAAATCCTCACGCACGCCCAAAATAGCCTTAGGGCTAATCACAATGCACAAATCATCTCCCACCAAGGATGGTGTAGACATGACGAGTACCGAGGAATCTACCGTTTTACGCAGAACCGACACGTGAAAAATCGGGTGAATCTTAACTGTCGGCGGCAATTCCAGACGGTATGCAACAGGTCCCACTCGTGCCAGCACACGGTAGGGACCGATAAACCTCGGCGCCAACTTAGGGTGTTTAAATTTAGCCAAAGAAGACTGGCGGTATGGTCTAAGCTTAATGTACACCAGGTCATCAACACTGAACTGTATATCACGGCGTTTGGCGTTTGCGCGATCCGTCATAGACTGATGCGCACGCGACAAACTAGCCTTTAAGGTTTCTAAGACCCAGTCGCGCTCCATCATCATCGAGTCCACAGTCGCCACCGGGCTAGCTCCATGATCATATCCCAAAATGGAAGGAGGGGGTCTGCCATAAACAATCTCGAAAGGTGTCATATTCATGGATGAATGAAACGAAGTGTTAAAACTGAACTCGGCCCAGGACAACCATTGGTACCACGCCTTAGGTTGATGCTTCACAAAACAGCGCAAATACGATTCTAAGCAGCGGTTCACAACCTCGGTTTTCCCATCAGTCTGGGGATGGTAAGTAGTGCTACGAGAAAGTTTAGTTCCCGTCGCTTTAAACATCCCAAAGTAAACTCGTAAAAACCTTATCACGATCGGACACAATACTTTTTGGAATTCCATGCAGACGGACGACCTCCTTTATAAACACCTTTGACACTGATAATGAAGAAAATGGGTGCCGAAGAGGGATAAAGTGAACATATTTTGAGAGACGATCAACCACCACCAGTATAGTATCATAACCCTCTGATCGCGGTAACCCTTCCACAAAGTCCATCGAAATGTCTTCCCAAACCCGTGCAGGGATCGGTAAGGGTTGTAGCAAACCAGCAGGAGATAACGACAGGTGTTTAGCTTGAACACAGACAAAACATTCGGCTACAAAAGAACGCACACGGGCCTTCATCCCTTGCCAATACACTTCCTTAGCCATACGCTGATAAGTTTTTAGGACTCCAAAATGTCCCCCAATAGCTCCACTATGGAATTCAAGCAATAATAAAGGAATCGTTGGAGATGTCGGGGGTAACACCAGTTTGCCCTGATATAGTAGTACAACACCAACTACTGAGTATCCAGGAGGGCCCATATCACCAGCTGTCAAGGTTTTATAAATCGCCTTAACTTTTCATCTTCCTTAACCTGCTGTGTAAACACTGCTGTGTTAATTCCAGCCACCACACTCAACATGCCCAACTCGCATGTCAAAGGCATTCGAGAGAGAGCATCCGCGGCTCGATTTTCAAGCCTCTTCTTGTATTCGATACTGAAGTCATATCCCATCAGTTTGGCAATCCAACGCTGGTAGTCGCCATCCACTACATGCTGTTCGAGTAAAAACTTAAGACTCTTCTGATCAGTGCGTACAACAAAATGGTGCCCCAAAAGATAGGCTCACCAGCGCTGAACAACGAACACAATTGCCATCAATTCGCGCTCATACACAGGTTTCACGCGATGAGTAATTGGCAAAGCCTTACTAAAGTATGCTATCGGTTGGCTCTGTTGCATCAGAACTGCTCCCACTCCTATTCCCGAAGCATCGGTCTCTACCACAAATATCTGGTCAAAATTCGGCAATCGCAGAACTGGAACACTACTCATGGCATGTTTCATTCTCTGAAAACTGTCTTCTGCAGCCGGTCCCCATTCAAACTTCCCTTTCTTCAAGGTGAAAGCCATAGAGCCATAGTTCGCCACAAACCAACGATAGTATCCCGTCAACCCAAGGAATCCCCGTAGGTCCCTAATGTTTCGAGGACTTGGCCAGTCCATCATTGCTTCAATTTTTGCTGGGTCGGCTAAAACGCCATCCGCGGATATGAAGTGCCTTAAATACTCGATGCGTCGCAACCCAAATTGACATTTCTTGGCATTAGCAACAAAAGCATGTGTCTGTAGCACCTCAAAGACTCGAGCTAGGTGCTCTCTGTGTTCCTGAATAGTCATGCTGTAGATAAAAATATCATAAAAAAATACCAGCACAAACTTACGCAGATAAGATCGCAAAATCTCATTCATTACGGACTGGAAAGTTGCTGGAGCATTTCGTAGCCCAAAAGGCATGACTACAAACTCATAATGTCCCTCATGGTCTGGAAAGCTGTCTTGTGTATGTCGGTCAGTTTGACACGGATCTGATGATAACCAGCCTTCAAATCAATTTTCGAGAATATGGATGCGCCATGAAGTTCATCTAACAGTTCATCCGCTAACGGTATCAGGTATTTATCTGTCACTGTAGCTTGGTTTAAAGCCCGGTAGTCGACACAAAAACGCCAACTTCCATCCTTCTTTTTAACTAACAACACAGGACTTGAAAACGAGCTCGTGCTAGGGCGAATTACCCCTGCTAACAACATTTCCCTCACCAATTTTTCTTTCTCATTTTTTTGGTTCTGTGGGTACCGGTACAATCGAACGTTCACTGAGCTAGTTCCCGTAATCAATTCTATCGCATGATCACGATTTCTTTCTGGAGCTAAATCTGCCAAGGATTGAAAAACAGGTGAAAAAGATTCAATTAAAGAGTGTAGTTCTCGTGGTACCTGGGATAAATCGGGAAGCATTTTAGTCGGTCCACATTCAGGTGATGTCGCCTCAATCATGTTTAACTCGACCAGTAAGCCCTGGTCCTCGGGTCGTAAAGATTTCATCATTGATTTCAGTGAGACTTGTGCGTTGACCAAACGGGGGTCCCCTTGCAACTCTGTCTGCCATGATCCCAACACAAATTGCATTTTCAGGGAGCTGAAATTAAATTCAATCTTTCCCAAAGTCTCTAGCCACGCCACCCCCAAAATCACATCTGCGCTACCTCGAGGTAAGGGCAAGAAATCATTGACCACTTTCAGTTCAGCCAGATGCAGTACGACATTCTTACAGATCCCCGCTGTTCTCACAGACTCGCCGGTACCCAACATGATGCCATAATCATTTGAAGGCTCTACTGGTAAATTTAGCTTTACTTCTATTACGTCGGAAATAAAGTTATGTGTGGCTCCACAATCTATAAGAACTACTACCTCCAAACCCTGGATCAATCCGGTGACTTTCAAAGTCTTTGGTGAACTCAATCCCGCCAACGAATTTAAGGACAATGCCGCTAAATCCCTTGCATCTCCCGTATCATCAATAGTTTCATCCTGGAGTGCCTCGTCATTTTTGTATGATTCCTCGTGGTTAATTGTATATCGTACTACTAATATCTCCAATGACTGAAGTTCTTTCTTTTTGCACCGATGCCCCGAAACAAACTTCTCATCGCAACGAAAACACAATCCTTTATCTTTTCGGACACGGATTTCACTATCCGTCAAACGCTTATAAGGCATAGTGGATGCTATACTATTTGTAGCTGGTCTAGATGTAGAGAAAGTTGTAGTTCTCAAAGTTGAACCACCTGCCCCCTTTGCGCTAACTACCCCAGATCCCGTACTTGATTTACTTACTGGTACCGTGCTAACTCCCCTTCCTTGGGCTCGGAAAGCCAGATTATCCTCAATTACCTGTGCCATAAATTTTTTATCTTGAATTCCAACAGGTCGCAATTTCCTCATTTCACTTCGGATTTCTTCTTTTAACCCGCTCTCCCATTTTCCCTCTAGCGCACTGGCACTAATATCCCGCATTCCCTTGGCGTATTTCTCAAACAATCGCCGATACTCTTTAACAGTGCCTACCTGTTGCAAGCTCATCAGCTTAGCGTACCTGTTGTCGTTCATGCTCGGTTGGAAACGGTGTAACAAAAGTTCTCGAAATTCTTCCCAAGATCCGATCGGAGGTTGATCTTCTTCATACTGCAGCCATTCCAGGGCCTCACCCTCCATACACAAAGCTGCGGTGTCGACTCGTTCCTGTCCATTTAGCCGGTTGACCCAAAAATACCGTTCCACTCGACACAACCACCCTTCCGAATCCTCATCGGTAAGCCCCTTGAAAACATGCATCTCGAGCTTTCGTAACCTCCTGTCAAACACTGTACCCTCTTTGTCTCCTGATCTCGGTCCTCCACGTTCATAACGATCCTCCCACCCTCTCCAGTCCTCTCGGTTCCCCTCTCGATTGTAGCCATCCCCCCCTCGGTCGTTCCAACCGTCGTGCGCTATCGACCTCTCATAATCGATGTGCACCTTCCTATCTTCTGGTGACTCGTGGACTCGGCCGCCGACCTTTCGCTCGCATCCCGGTCCCCCACTGTTACCCTCATCGCCAAACCCCGGATACCTCTCTCTTCGGTCCAACCCGACTGGCCCAGACCGAACCTATCCCGACCCGGTCCCCCCTAGCCGCCGCCGTGCCCCGCACGCCCCCGCATTCGAGCCCCAGCCCACGCCCTCTGCGCCGCCAACGCCCGACCCACAGCCCCCACGCCCCCGCTCGCCCCCGACTCCGCACGCCCACGCCCTCCGTTCTGCCAGCGCCCGATCCCCAGCCCGGCCACCAGCCCGCGCGTCCCCCCGCGCCGCCCGCGCTCGCACCCCAACCCGAGCCCCATACCGCGCGCCCGCCCACGCCCGTACCCAGCCCGTCTCCCTCTGTTCGCACCGAGCCATCGTCGCCCTTCGCCACGCCCGACTCCGCTGCCGGCCCAACCCCGACAGCCCCCTGCGTCTCCCCCCGCACCAGCCCTTGTTGCCGAGCCCACCCACAACCTTCCCGATTCGGTTGACACTCCCTTACCCTTGTCCAAACCGCTTGTCTCGGGATTTGGTTGTTCCGGTGTTCGCTCTAGCAACTTTTGTATATTTAAGTTAATCTGACCTATCTCTTGGTCCAGTCGACTATTCATCGCTGCCAGTTGAGCCTCGAATTCCCGTTGCTTCTTCAACATCTCGGCTCCACTCTCTTCGCACTCTTGAACCCTCGTTTCCATCTTGGTAGTCACCATTCCCGGATCGGTAAATGGCTCTGATACCAAAATGATAGAATCTACCACTCTATTTTATTAATCTATGATAAAAACCTCTCGAGAGAATCTCTCAAAGAGCCACAATACAATGATGATAATTCAATGAATGAAAAGAGTAACCTATCCCTTCCTATTTATAGTAAACCTAGTATTTAACCTAATTTGGTAATAAACTCTATTAAGGCCCATAAGCCCAATACAACCTAAATTCGCATAATAAAAATATATTAAAATAGTAATAACTTCCTTCTTCACTTCTTGCTCCACCTTGAATAGACTTTTAATGATTTCGGGATCGTATCATTTTCATTATGACGCTCAAAGTTTGGCACTCAGTTCCTTGTCTCATGATATGTGGGGAGAAGTTTTTCTGTCAATTGCTTGATACTTGCTTGTATCAACAGCACATCGACTAAACCTAATATTGGGATGGCTTTTGAGAAAAAGTTGCTTTTAAACCTGGATTCAATTCTGGCCATTGAACTTGAACTAGTGATAACTGAGAAATGATTTGGATGAATCATAAATAATTTGGTGGTTCTAGCTAGATAAGTTTCCAGCATCTTAGGCAGATCTACATACTTCAGGCATCTAGAACTGTAAACTATATAAAAAAGATGGCGTGGTATTATGATGAATATGAACTGAGAAGCAGTTTGGAAAGGGTTCGTTGTGTAATACACAGCAGAAAGAAGTAAATAAAACAAGGTTGGAGAATTGAAACCAAGGAGCCAACAAGAGCATAGCTCAATTGGTATGAAATAATGCTTATGACCAAGAGGTCATGGGTTCGAATTTCCCACTCCCTAATGTTATGTACTAAAAAAAAGAATTGAAACGAATATAAAGTAAGATAAATGAAATTATCTCTCTCTTACATAATTGGTCATGGCTACTATATTAATTATGCAAAAGTAAGAAAAATGAAATTATGGAGAGCATTACTGATGTGAATATAAAGAAAATCAGGATTAAATCTGGAAGAAAGCACAAAAACTGCCTCTAGCAAAGACTCAGCTTCCAGCTCATTCTATCATGGAGTTCTGTTGTTTCATGGCATTTGGGAGGTTATATACAAAATTTTGTTGTATGTAGGTTCTGTTAAATGGGAATTGTGTCTATTGATATTGTATATGGTTTTGCTGGAAACAGTTATTTTGGTTACTTGCCATCCACATTCCACAGAATAATTCTTATTTATGTCCTGCTGAATGTGCAGGAACTGATTCTAACCCCTTCTCTTTTGATGCTCTGGCCGTCTTCATGTTTCGTGTGCTGCAAAGAGTTAATCATCCGGTATGTGCTCTATGATGATACTTGAAGTGTTGAAGTACCTCGTTTATCATCAACAATTAGGTTTCTCATTTTCTAATTGTTCTACTAATGCTCTGTAATTTTGAAGGGAAATCTCGACAAGGCATCTCCCAACGCAGGCTATGTTCTTCTAATGTTTTATCACCTTTATGAGGGCAAGGTGTGATGTTAACACTCCTCATTGCCTCATTTTATACTGAGTTTCACTTTTAGTAACTTGATTATTTTTCTTATTGCTTTCAGAGTCGCAGAGAGTTTGAGGGTGAGCTTATTGATCGTTTTGGATTTCTGATTAAGATACCATTGCTGAAATCTGATAGGTATAAAACTCATCTTCCGCATTGGTCTTTTTTTCATTATTTCTCCATCCTAGAATCTCTTTGCAGTTTGGTCCTTAGTTGTCAAGATTTTTTTATAGGATAGAAAAACTTCCATTGAAATATTAAATGTGTACAAAAAGTCCTATCATTAGTTACAATAAAACTTCTCCAATGAAAAAGAAGAGTAGAAAGACTACAATCTTTAAAAGGATGAACAGTCTAAGTTCATGAGAGAGTTAGAAGCTAATTGATTCAAGGAGACTTCTTAATGGTTGTTAATCTGAGAATATGCTATCATTACAAGCCTTCCACAATTTCTACCAACAAGCATACGCGAACATACACCAAAGAACTTTCTTTTAAACGGGTGCTCTTTAATGGAAATGTTCAGAAGAGATTGTATGTTTCCTGGCCTTGTGAAATGGCAGCAACTATATAGGTGGATTTGGGTCTCCATATTGTTGAAACACAAAGTACAACATTTTGGAGAAAGTGCTCTACACTCCAAGGGGACTTTATTTAAAGGATATCATAAGTGTTTATCACCCCATGGCTGATCTCTCATTTGAAGAATTTCACTTTCGTAGGTTGTCCTTCCCAAAGTTTGGTATATTATATACCATATATAATACTTATTATGATTATTTCTCTTTTCCAATTTTTTAAAATCCTTTTGCATTTTTCTTTCTTTGTTTTTATCTTCTCGATTTCAAGTATTCATTTTTAAATAAATTAGAAAGAGGTACCATAAAAAATCCTTCCATTTTCTAAAAAGGTCATAAGAGAGAAAATTTACTCATCTTTTTGACATGTGGTTTTTGTGTGACAAGGTTTCTACCTTAGCCATTTTACTTTAGTGATCTCTCTTAAATCATGTAGCTTCTTATTTTAATTTGAAAATTGACGGTTATAATTACAAATTGAACTGTTTGTTGTTTTTTTCTTCTTTTGCTCCTCTTCAAAACATTCTGATTGATTTTGCAGTTGTACAATGTAAATATGTAATATTTTTCTGCTCTTGAACTACAGGAGTCCCTTACCTGATAATTTGAAGACTGTCTTGGAGGAAGGATTAAGTCTGTATAAGCTCCATACTAGGAGACATGGAAGGTTGGTGGCTTGTTCGAGTATTTATAGATGTTTTTAATGTTAAACTTTGCTGTCCCCATTTCCCAAGGGATGACGTGTTGATGACTTGAGCTTTGAGGGTATGCTCCACTCAAGGTTCCAGGTTTGAAACTCACTTGTCACATTACTCTTTCGATGTCTGTGGTGCCTGGCCTAGGGACGGGCGTGGTTACCTTATTTCATAAAAAAAAAAGGTCCAACTTTGCTGTCTGAAAGTTTCTGTCATAGAATATGAATAAATAATCTACAATGTGCTTTCTTGTCAGCTACCTTACGCATATTTCTTGTACTTTTGCAGGGCGGACTCTACCAAAGGTTCCTTTGCAAAAGAATGGGCCAAATGGGAGAAGCAATTGCGAGAAACTTTGTTTGGTAACACCGAGTATCTCAATTCTATTCAGGTGCATTTGTCTTCCCTTTAAAAGATCCAAATTTTTGTCCATGAATCAAACTCAACTTACTTTGTATCAAACTTCTGGGGCTTTAATAAACTTTTTGGTTTATTTTATGTTTCTGTATTAAAACATTGTTTGGTAGCCTTATTCACGAATACAAGTGTTTTGGAGATTTCAGCTTAATTTCTAACACCACAAAATGTAACTTCGGAAGTTTTTTGTTTTACCTTTGGCCTATAGGTTGTTATTTCCTGTGTTTTTTAGGATTGTTTGGATTGAAAGCTTCTTGTTTCCTAACCCAAAAAAAGATACTTATGGTTAAAATTGGCTAAGTTTGCTTCAACAATAGAAAAAATAATAACAATGCTGATCTTTTAGAAGACAGAAAAACTGAAGAAGAAAGGGTATGCTGGATCCTAAAGTGCAATTCTTGGCGCTAGTTCATGTATTAAATTTTTTCTTTTTTTTAACAAGAACAAAAATTTTTGTTGGAAAAACCTTAACACCTATACAACCTTTTTTCAATTAAAATTTTAATAAATAATCCATTTAATTATTCAGGTTACGTTTGAGTTTGCTGTTCAAGATGTGTTGGAACAACTGAAGAAGATATTAAAGGGTGACTATAAAAACCTTACTACAGATAGGAGGAAGTCAGCGACCATAGTGTTTGCTGCTGTCAGTCTTCCCATTCTGGAGATTCAAAATCTTCTTGACACTGTAAGTGAATATATTTCGGAACTATAACTCCCATACCAAGCCTCCTTTTCAACGTGAAAACTAACCATTTAATTTTTTTTAACTCTCCGAGTCCAAAAGCCTAAACTAGCTAGTATGTTTTTGTATTTTTAATATTTTTAATATTTTCTTATACATTCTTATCACTCCTCTCTCTTTGGATTTTGAAATTATTTCAACAGCCAACAAAGAAAACATTAAATTTTTCTTACCTAGTTTGGTAATAGCTTCCAAAGCTTCCTTTCGTTTTCTAACAGACTAGCATGTTATTAAAGTATCTTGTATAGTTGCTTAATCTATTACTGAATCGAAGTTGCCAAGATAATGCGGGAGATAGGGAAGCCCTCATAGAAAAGCCCACAGTTATTTGCTTTACAAATTTGTTGGTTAAGCCTGTCATATATTATGACTTTTCTACAAAAACAATTGTTTTCTTGCCCACGGGTCATCTCATGGATTATGGTGCAAGTTGCAACTAGCCAATATCTAGATCATAATCACAACTTCCCTTCAGTTTTATTTAGTTATGTGCAACAATCTTTCTACTTGTGGAGCAAAGCTTCTTCATCGACGTGGGAGTGTATCACACCATTACAGTGTGGAGGTCTCTTGATTGTGGGGATTCCCTTCACTTCACCTCTCTCAACAACAAGCTTTGTGTCATCTATCCTCTAAAAGTAACAGAAAAACGCTGATGAGTGACAAGATTCTCTAGATAAGTGATTAGGCAGTGTTTGGCTTTTCAAATCCTGTCATTCAAATCCGTTGTCTTTTAAAATTTCTACTTGCATGAATAGGGTGTATAGTTTTGGAGCTTGATAAGGAGATGACATTTGATATGGTTGATTGGAACTTTCTCGATGTCATTCTTGAAGCAAAGGGCCTGAGGAATGGATTACAACTTCTCTGTTATTATCAATAGAAAGCCAAGAGGGAAGATTGTTGCATTAAGAAGATTAAGACCGGTAGACCTTCTATCTCTCTTTCTTTTCATCCTTATTGTGGATTGCCTTAGTAGACAGCTGATTTGTAGCCGAGAATGTAGGTCTCATCGAAGATTTTAGAGTGAAATCCTCCACTGGCTCCATGTCTATTACACACTTGTAATTTGCAGACGATATTATTGTTTTCTTCAGCCTTTAAAAGAATTCTTCGGCTAATTTGATCAACATTGTACAAAGTTTTGAAGCATCATCCGGGCTAAACATAAATGGTCAAAGACTGCTTTTTGTACCCTTGATATGGACCAAATCAAAGCTACTGGACTTGCTGCACATATGTGTTCAAACAAGAATCTTGGCAATCAACTTACCAGGGACTTTCCCTTTATGGGAATCCTCGATCTATAGCATTTTTCTATCTGATGATAGAAAAGATTGATAGGAGATTTAAGATTACATAATTGGCGAACCACTCATATTTTCAAAGGAGGTAGATTAATGCTTATCAATGCTACCATCACTAAACTACCCATTTACTATCTCGCCCTTTACCTTGCTCCTAAAAAGGTCACAAATGTGATTGAAAGGATGTATCGAAACTTTGTATGGAGAGGATCCAGTGGCTGCAAAGCTAGTCACTTGTAGAATAGGGAGGTAGTCGAAATTCCCATAGAAGATGGGGGACATGGGATGTGAAGAATATCTCCTTGCTTGCAAAATAGCATTGAAGATTTTATTTAGAGCCACTAAATATGGATCCAATTATCTAGATTTCAAATCCGGTGTCTGTAGAGAGACAGATAAATGAAAGCACCTCATAATATACAATTCTATCACCCACAAGGTGGCAGCGGGCAAGGCAACTAGTTTTTTGGAAGAAATTTGGATGGGTACTGTACTGCTTCCCTCAAAGCAACCTTTTCCTTGCTTTATAATTTGTCTTCATGCAAATTGGCTAAGGTTGTAGACCTTTGGAATGCTGAAAATGGTATATGGAATCTCAATCCTAGAAGAAATTTAAAACAGTGAGATTGAGGAGTGAGCAGAACTTTCTCACTTGCTTGGTCCTCCTGTATTGATTGGACATTAGAGAAAAATGGAAACCTTACTGCTAGATCCCTGTTTCACAGATTGTCATCAACACAGAACCCTCCCCATTGTGATTATTACAATGAAATATGGAGGGGCCCTCAACCAAAAAACATGAAGTTTTTCAATTGGGAGTTAAGTTCTAAATACCTTCGACAAGTTGCAGCATAGAACACCGGGTATGGCTCTCTCCCCCAACTGTTGTCATATGTGCTATAAAGGGATGGAGACACATATCCATCTTTCTAGTTCTTGCTCTATTGCAAACAACATTTGGAATTATCTCTATCAAGTCTTCGATTGGTCGACTTCGAGACCAGATAAACATTCAGAACCTGCTTTTATACTCCCTACACGCACACATTAAAAAATGTGAAGAAAGTTTTATGGTGCTTATTGATTATTGCTTTTTGGTTGAAGATTTGGAAGACTCGCAAGGACGTTTTTTTTACAGACGATCCACAACGAAGGGAAAATTTGATGGATTCTGTTACTTTCCTTTCTCTATTTAAAGAAAAAAAAAATGTGCTGCAATCTTATATATAGCAATAAAGAATTGGGTTATCTCCCACCCTCATTTGTTGAACTAAAAATAGAGCCATGTGGTTAAAGGTTTTCGATGAAAGTTTGATATTTTCATTTAAAATAATAAATAAAAATATAGCAATAAAGCATACATGCGATTAGGAATAATCTATTTGTTAGGCTTCATAGTTATAAATAGAGAGAATGGAAAATGAAGAAGATAGATAATATTTTGGTAACTTCCGTCCTAGGATTGAGAAAATTCTCAAGAGGGGACTGTGACAGTCCAAATACCTCAAACACTTGTTTAAATGTAATTCTATTATCTTTTATCTTTCAATATATTTGGATTATATCCCATATGTTAAATTGTCTTGTAGTTTCATCATCTGTATAATTACCACTATGGAATTCTTACGTTTTCCACCACAATTGTACAGCTAGGCAAGATAAATCCCCATGTTGGAGGCTTCTTTAAAAAAAACTTGAAGGACTTCACGCTTAGAGAGGTCCATGTGACACTCGCACACAAGAGAGGCCATGGCGTGAAAGCCGTAGCTGACTACGGGATCTTCGAAAACAAAGAAGTTCCAGTGGAGTTGACGGGCCTACTTTTCTCAAACAAAATGGCTGCCTTTGAAGCCCGCCTCGGCAGCATTGAGGATGAAAGAGTGATCTCCAAAAATGAGTGGCCTCATGCGACAATATGGACAACAGAAGGGGTTGCAGCAAAAGAGGCTAACACATTACCGCTGTTGGTTCCAGAGGGAAAAGCAACTCTCGTTGAAATCAACCCTCCCATAGTCATATCAGGAAAGGTGCAATTCTTTTAGCCTTTTTCCTCCCTCCATTTCTCTAGATGGATAGGAAATGTCCAATGCATACAAATTTTCTCGCTGAGGTTTAGGTTTAGATGCAGTTCATAAGTTTTGTAGATTGTATGACAACATTTTTGTATGTATAGACCAACTTCATTTACAGGAAAATAAGTGGAATTTTGTTTCAGGTACCCCTATACCATTGAGCTATTTCATTTACAGGAAAA
mRNA sequence
GGTTCACTGTTATATTTGATAATTTAGGCTTAAAAAAAAGTATTGAGTACAGTGTATGATATCGCAGCCCGGAATCGGAATTACAATCGGCGGCGATAGAGGGGATTGGTTCCGGTTAGGTTTCCGGCCAAAAAGAAACAGTAGCGCTATTCCATTTTGCATATATCCAATTTTCCTGCTGCTCGAGTTGCTCTCCATTTTGTGTGAATGTCGGTATCGCACAAGAGGGTTCTCTGCGCTATTACTCTTCCTTTGTCTTCCTCTTTGACTTTCAGTTCCAGGACCCGTTTCTACATTCCCCACTCCTTATTACCTTTCAAAGCGTCTTCCCCATTTCCCCTTTCCTGCCATTCTCCATTCATCATGCCTTACAATCAGCCAAGGGGTGGTTGTAAAGACCAGAAGTGGAAAGCGAAGACGAAGGCCGACAATACTTCGATGGATGCTGCTGAAGTTGTTACACATGCACTCAATAAATTGAGTGTCACTGAAAGTGGTCAACCTCATGTTCCTATTTCAAGTACGCAGTTTGGAAATGTGCAACTCACAAACCAGGTCCCTCCTGGGGATGGTCATAGAACAATTTGGAAACCAAAAGCATACGGAACAACCAGTGGGGCTGCAGTTGTCGAAGCTGAAAATGCATCCGCCGGTCGAACATCTATTCAAAACAAGGAGAACTCTGCTGGACTGGCTGCACAGAATGACATATTTAAGGGAAATCAAATAGAGAAGTTTACTGTGGATAACTACACTTACACACATGCCCAAATTAGAGCTACCTTCTACCCAAAATTTGAGAATGAGAAGTCGGATCAGGAGATTCGATCAAGGATGATAGAGATGGTATCGAAAGGCTTAGCAACATTGGAGGTTTCACTAAAACACTCAGGGTCTTTGTTTATGTATGCTGGCCATGAAGGTGGGGCATATGCCAAAAACAGCTTTGGAAATATCTACACTGCTGTTGGTGTGTTTGTTCTCGGAAGGATGTTTCGAGAGGCTTGGGGATCCGGAGCGGCAAAAAAGCAAGTAGAATTCAATGATTTCCTTGAGAGTAACCGCATGTGCATATCAATGGAGCTAGTAACTGCTGTCTTGGGTGATCATGGCCAACGACCACGAGAGGATTACGTGGTAGTTACAGCAGTTACTGAACTAGGCAATGGAAAACCGAAGTTCTACTCTACTGCAGAAATAATAGCTTTTTGTAGAAAATGGCGCTTACCAACTAATCACGTCTGGTTATTCTCAAGCAGGAAATCAGTCACATCTTTTTTTGCTTCCTTTGATGCCCTTTGTGAAGAAGGAACTGCAACAACAGTATGCAAAGCTCTTGATGAAGTTGCAGAAATATCTGTACCAGGCTCAAAAGATCACATAAAAGTGCAGGGTGAAATTCTTGAGGGTCTAGTGGCCCGTATGGTGAGCCACGAGAGTTCAAAACACATGGAGAAAGTATTGGAAGAATTTCCTGCTCTGCCCGACAATGAAGGAGGTGAATTTGATTTGGGACCAAGCCTGAGGGAAATTTGTGCTGCCAATAGATCGGATGAAAAACAGCAAATAAAAGCACTTCTTCAAAACGTTGGTAGTGCCTTTTGCCCCGTCCATTCAGACTGGTATGGCGATTCTCACTCAAGAAATGCAGACAGATCTGTTGTATCAAAATTCTTACAAGCCAACCCAGCTGATTTTTCAACTTCCAAATTACAGGAAATGGTTCGACTAATGAGAGAAAATCGTTTTCCAGCTGCCTTCAAATGCTATTATAATTTCCACAAAGTTGGTTCCATATCAAACGACAACCTTTTCTATAAAATGGTCGTTCATGTTCAAAGTGACTCTGCTTTTCGGAGATATCAAAAGGAAATGAGGAACAAGCCGGGTTTGTGGCCATTATATCGAGGCTTTTTTATGGACATTAATTTATTCAAAGCAAACAAAGACAAGGCAGCTGAAATTGTGAAAGGTAAAAACAATTTGATGGAGATTGAAGGAAATGGCACCTTGGGAAGAGATGGATTTGCTGATGAAGATGCAAATCTGATGATCAAACTGAAATTCCTTACATACAAGTTACGGACTTTTTTGATTCGTAATGGCTTGTCAATTGTCTTCAAAGAAGGTCCAGCTGCATACAAGGCTTATTATTTGAGGCAAATGAAATTGTGGGGTACTTCAGATGGAAAACAAAGAGAGCTCAGCAAGATGCTCGACGAATGGGCTGTATTCTTGAGGAGGAAGTATGGAAATAAACAACTGTCATCAACTACTTATCTTAGTGAAGCTGAACCTTTTCTTGAGCAGTATGCCAAACGCAGTCCTCAGAATCAGGCTCTTATTGGATCTGCTGGCAATTTAGTTAGAGCAGAAGATTTCTTGGCCATTGTGGAGGAAGGAATGGATGAAGAGGGTGATCTACAGAAGGAGCAGGAGGCAACACCGTCAAGTTCACTGCACTCTGGGAAGGATGTTGTGCCTAAAGCAGATGGTTTAATTGTGTTTTTTCCAGGAATCCCTGGCTGTGCGAAGTCTGCTATTTGTAGAGAAATACTTAATGCCCCAGGAGGACTTGGAGATGATCGACCACTTAATAGTCTAATGGGTGACTTGATAAAAGGAAGGTATTGGCAGAAGGTTGCTGATGAACTTAGGAGAAAACCGTACTCCATAATGCTTGCAGACAAAAATGCACCGAATGAAGAAGTGTGGAGACAAATTGAGGATATGTGCCATAGCACTAGAGCATCTGCTGTTCCAGTTGTACCTGATTCTGAAGGAACTGATTCTAACCCCTTCTCTTTTGATGCTCTGGCCGTCTTCATGTTTCGTGTGCTGCAAAGAGTTAATCATCCGGGAAATCTCGACAAGGCATCTCCCAACGCAGGCTATGTTCTTCTAATGTTTTATCACCTTTATGAGGGCAAGAGTCGCAGAGAGTTTGAGGGTGAGCTTATTGATCGTTTTGGATTTCTGATTAAGATACCATTGCTGAAATCTGATAGGAGTCCCTTACCTGATAATTTGAAGACTGTCTTGGAGGAAGGATTAAGTCTGTATAAGCTCCATACTAGGAGACATGGAAGGGCGGACTCTACCAAAGGTTCCTTTGCAAAAGAATGGGCCAAATGGGAGAAGCAATTGCGAGAAACTTTGTTTGGTAACACCGAGTATCTCAATTCTATTCAGGTTACGTTTGAGTTTGCTGTTCAAGATGTGTTGGAACAACTGAAGAAGATATTAAAGGGTGACTATAAAAACCTTACTACAGATAGGAGGAAGTCAGCGACCATAGTGTTTGCTGCTGTCAGTCTTCCCATTCTGGAGATTCAAAATCTTCTTGACACTCTAGGCAAGATAAATCCCCATGTTGGAGGCTTCTTTAAAAAAAACTTGAAGGACTTCACGCTTAGAGAGGTCCATGTGACACTCGCACACAAGAGAGGCCATGGCGTGAAAGCCGTAGCTGACTACGGGATCTTCGAAAACAAAGAAGTTCCAGTGGAGTTGACGGGCCTACTTTTCTCAAACAAAATGGCTGCCTTTGAAGCCCGCCTCGGCAGCATTGAGGATGAAAGAGTGATCTCCAAAAATGAGTGGCCTCATGCGACAATATGGACAACAGAAGGGGTTGCAGCAAAAGAGGCTAACACATTACCGCTGTTGGTTCCAGAGGGAAAAGCAACTCTCGTTGAAATCAACCCTCCCATAGTCATATCAGGAAAGGTGCAATTCTTTTAGCCTTTTTCCTCCCTCCATTTCTCTAGATGGATAGGAAATGTCCAATGCATACAAATTTTCTCGCTGAGGTTTAGGTTTAGATGCAGTTCATAAGTTTTGTAGATTGTATGACAACATTTTTGTATGTATAGACCAACTTCATTTACAGGAAAATAAGTGGAATTTTGTTTCAGGTACCCCTATACCATTGAGCTATTTCATTTACAGGAAAA
Coding sequence (CDS)
ATGTCGGTATCGCACAAGAGGGTTCTCTGCGCTATTACTCTTCCTTTGTCTTCCTCTTTGACTTTCAGTTCCAGGACCCGTTTCTACATTCCCCACTCCTTATTACCTTTCAAAGCGTCTTCCCCATTTCCCCTTTCCTGCCATTCTCCATTCATCATGCCTTACAATCAGCCAAGGGGTGGTTGTAAAGACCAGAAGTGGAAAGCGAAGACGAAGGCCGACAATACTTCGATGGATGCTGCTGAAGTTGTTACACATGCACTCAATAAATTGAGTGTCACTGAAAGTGGTCAACCTCATGTTCCTATTTCAAGTACGCAGTTTGGAAATGTGCAACTCACAAACCAGGTCCCTCCTGGGGATGGTCATAGAACAATTTGGAAACCAAAAGCATACGGAACAACCAGTGGGGCTGCAGTTGTCGAAGCTGAAAATGCATCCGCCGGTCGAACATCTATTCAAAACAAGGAGAACTCTGCTGGACTGGCTGCACAGAATGACATATTTAAGGGAAATCAAATAGAGAAGTTTACTGTGGATAACTACACTTACACACATGCCCAAATTAGAGCTACCTTCTACCCAAAATTTGAGAATGAGAAGTCGGATCAGGAGATTCGATCAAGGATGATAGAGATGGTATCGAAAGGCTTAGCAACATTGGAGGTTTCACTAAAACACTCAGGGTCTTTGTTTATGTATGCTGGCCATGAAGGTGGGGCATATGCCAAAAACAGCTTTGGAAATATCTACACTGCTGTTGGTGTGTTTGTTCTCGGAAGGATGTTTCGAGAGGCTTGGGGATCCGGAGCGGCAAAAAAGCAAGTAGAATTCAATGATTTCCTTGAGAGTAACCGCATGTGCATATCAATGGAGCTAGTAACTGCTGTCTTGGGTGATCATGGCCAACGACCACGAGAGGATTACGTGGTAGTTACAGCAGTTACTGAACTAGGCAATGGAAAACCGAAGTTCTACTCTACTGCAGAAATAATAGCTTTTTGTAGAAAATGGCGCTTACCAACTAATCACGTCTGGTTATTCTCAAGCAGGAAATCAGTCACATCTTTTTTTGCTTCCTTTGATGCCCTTTGTGAAGAAGGAACTGCAACAACAGTATGCAAAGCTCTTGATGAAGTTGCAGAAATATCTGTACCAGGCTCAAAAGATCACATAAAAGTGCAGGGTGAAATTCTTGAGGGTCTAGTGGCCCGTATGGTGAGCCACGAGAGTTCAAAACACATGGAGAAAGTATTGGAAGAATTTCCTGCTCTGCCCGACAATGAAGGAGGTGAATTTGATTTGGGACCAAGCCTGAGGGAAATTTGTGCTGCCAATAGATCGGATGAAAAACAGCAAATAAAAGCACTTCTTCAAAACGTTGGTAGTGCCTTTTGCCCCGTCCATTCAGACTGGTATGGCGATTCTCACTCAAGAAATGCAGACAGATCTGTTGTATCAAAATTCTTACAAGCCAACCCAGCTGATTTTTCAACTTCCAAATTACAGGAAATGGTTCGACTAATGAGAGAAAATCGTTTTCCAGCTGCCTTCAAATGCTATTATAATTTCCACAAAGTTGGTTCCATATCAAACGACAACCTTTTCTATAAAATGGTCGTTCATGTTCAAAGTGACTCTGCTTTTCGGAGATATCAAAAGGAAATGAGGAACAAGCCGGGTTTGTGGCCATTATATCGAGGCTTTTTTATGGACATTAATTTATTCAAAGCAAACAAAGACAAGGCAGCTGAAATTGTGAAAGGTAAAAACAATTTGATGGAGATTGAAGGAAATGGCACCTTGGGAAGAGATGGATTTGCTGATGAAGATGCAAATCTGATGATCAAACTGAAATTCCTTACATACAAGTTACGGACTTTTTTGATTCGTAATGGCTTGTCAATTGTCTTCAAAGAAGGTCCAGCTGCATACAAGGCTTATTATTTGAGGCAAATGAAATTGTGGGGTACTTCAGATGGAAAACAAAGAGAGCTCAGCAAGATGCTCGACGAATGGGCTGTATTCTTGAGGAGGAAGTATGGAAATAAACAACTGTCATCAACTACTTATCTTAGTGAAGCTGAACCTTTTCTTGAGCAGTATGCCAAACGCAGTCCTCAGAATCAGGCTCTTATTGGATCTGCTGGCAATTTAGTTAGAGCAGAAGATTTCTTGGCCATTGTGGAGGAAGGAATGGATGAAGAGGGTGATCTACAGAAGGAGCAGGAGGCAACACCGTCAAGTTCACTGCACTCTGGGAAGGATGTTGTGCCTAAAGCAGATGGTTTAATTGTGTTTTTTCCAGGAATCCCTGGCTGTGCGAAGTCTGCTATTTGTAGAGAAATACTTAATGCCCCAGGAGGACTTGGAGATGATCGACCACTTAATAGTCTAATGGGTGACTTGATAAAAGGAAGGTATTGGCAGAAGGTTGCTGATGAACTTAGGAGAAAACCGTACTCCATAATGCTTGCAGACAAAAATGCACCGAATGAAGAAGTGTGGAGACAAATTGAGGATATGTGCCATAGCACTAGAGCATCTGCTGTTCCAGTTGTACCTGATTCTGAAGGAACTGATTCTAACCCCTTCTCTTTTGATGCTCTGGCCGTCTTCATGTTTCGTGTGCTGCAAAGAGTTAATCATCCGGGAAATCTCGACAAGGCATCTCCCAACGCAGGCTATGTTCTTCTAATGTTTTATCACCTTTATGAGGGCAAGAGTCGCAGAGAGTTTGAGGGTGAGCTTATTGATCGTTTTGGATTTCTGATTAAGATACCATTGCTGAAATCTGATAGGAGTCCCTTACCTGATAATTTGAAGACTGTCTTGGAGGAAGGATTAAGTCTGTATAAGCTCCATACTAGGAGACATGGAAGGGCGGACTCTACCAAAGGTTCCTTTGCAAAAGAATGGGCCAAATGGGAGAAGCAATTGCGAGAAACTTTGTTTGGTAACACCGAGTATCTCAATTCTATTCAGGTTACGTTTGAGTTTGCTGTTCAAGATGTGTTGGAACAACTGAAGAAGATATTAAAGGGTGACTATAAAAACCTTACTACAGATAGGAGGAAGTCAGCGACCATAGTGTTTGCTGCTGTCAGTCTTCCCATTCTGGAGATTCAAAATCTTCTTGACACTCTAGGCAAGATAAATCCCCATGTTGGAGGCTTCTTTAAAAAAAACTTGAAGGACTTCACGCTTAGAGAGGTCCATGTGACACTCGCACACAAGAGAGGCCATGGCGTGAAAGCCGTAGCTGACTACGGGATCTTCGAAAACAAAGAAGTTCCAGTGGAGTTGACGGGCCTACTTTTCTCAAACAAAATGGCTGCCTTTGAAGCCCGCCTCGGCAGCATTGAGGATGAAAGAGTGATCTCCAAAAATGAGTGGCCTCATGCGACAATATGGACAACAGAAGGGGTTGCAGCAAAAGAGGCTAACACATTACCGCTGTTGGTTCCAGAGGGAAAAGCAACTCTCGTTGAAATCAACCCTCCCATAGTCATATCAGGAAAGGTGCAATTCTTTTAG
Protein sequence
MSVSHKRVLCAITLPLSSSLTFSSRTRFYIPHSLLPFKASSPFPLSCHSPFIMPYNQPRGGCKDQKWKAKTKADNTSMDAAEVVTHALNKLSVTESGQPHVPISSTQFGNVQLTNQVPPGDGHRTIWKPKAYGTTSGAAVVEAENASAGRTSIQNKENSAGLAAQNDIFKGNQIEKFTVDNYTYTHAQIRATFYPKFENEKSDQEIRSRMIEMVSKGLATLEVSLKHSGSLFMYAGHEGGAYAKNSFGNIYTAVGVFVLGRMFREAWGSGAAKKQVEFNDFLESNRMCISMELVTAVLGDHGQRPREDYVVVTAVTELGNGKPKFYSTAEIIAFCRKWRLPTNHVWLFSSRKSVTSFFASFDALCEEGTATTVCKALDEVAEISVPGSKDHIKVQGEILEGLVARMVSHESSKHMEKVLEEFPALPDNEGGEFDLGPSLREICAANRSDEKQQIKALLQNVGSAFCPVHSDWYGDSHSRNADRSVVSKFLQANPADFSTSKLQEMVRLMRENRFPAAFKCYYNFHKVGSISNDNLFYKMVVHVQSDSAFRRYQKEMRNKPGLWPLYRGFFMDINLFKANKDKAAEIVKGKNNLMEIEGNGTLGRDGFADEDANLMIKLKFLTYKLRTFLIRNGLSIVFKEGPAAYKAYYLRQMKLWGTSDGKQRELSKMLDEWAVFLRRKYGNKQLSSTTYLSEAEPFLEQYAKRSPQNQALIGSAGNLVRAEDFLAIVEEGMDEEGDLQKEQEATPSSSLHSGKDVVPKADGLIVFFPGIPGCAKSAICREILNAPGGLGDDRPLNSLMGDLIKGRYWQKVADELRRKPYSIMLADKNAPNEEVWRQIEDMCHSTRASAVPVVPDSEGTDSNPFSFDALAVFMFRVLQRVNHPGNLDKASPNAGYVLLMFYHLYEGKSRREFEGELIDRFGFLIKIPLLKSDRSPLPDNLKTVLEEGLSLYKLHTRRHGRADSTKGSFAKEWAKWEKQLRETLFGNTEYLNSIQVTFEFAVQDVLEQLKKILKGDYKNLTTDRRKSATIVFAAVSLPILEIQNLLDTLGKINPHVGGFFKKNLKDFTLREVHVTLAHKRGHGVKAVADYGIFENKEVPVELTGLLFSNKMAAFEARLGSIEDERVISKNEWPHATIWTTEGVAAKEANTLPLLVPEGKATLVEINPPIVISGKVQFF
Homology
BLAST of Sed0015984 vs. NCBI nr
Match:
XP_022157928.1 (tRNA ligase 1 [Momordica charantia])
HSP 1 Score: 2011.5 bits (5210), Expect = 0.0e+00
Identity = 1016/1198 (84.81%), Postives = 1075/1198 (89.73%), Query Frame = 0
Query: 1 MSVSHKRVLCAITLP---------LSSSLTFSSRTRFYIPHSL-LPFKASSPFPLSCHSP 60
MS SH R+ CAITLP L +S F S + F P SL LP SSPF LS HS
Sbjct: 1 MSASH-RIFCAITLPHPPRFSPSSLFNSRAFLSTSHFIFPRSLALPPLISSPFHLSPHSR 60
Query: 61 FIMPYNQPRGGCKDQKWKAKTKADNTSMD---AAEVVTHALNKLSVTESGQPHVPISSTQ 120
IMPYNQ G ++QKWK K K D TS + AAEVVT+AL KL V+ESGQPHVPISS +
Sbjct: 61 SIMPYNQRSDGRREQKWKEKAKLDRTSTESEAAAEVVTNALGKLRVSESGQPHVPISSRE 120
Query: 121 FGNVQLTNQVPPGDGHRTIWKPKAYGTTS-GAAVVEAENASAGRTSIQNKENSAGLAAQN 180
FGN QLTNQVP G G+R IWKPKAYGTTS GAAVVEAE A A TSI+NK N+AGLAAQN
Sbjct: 121 FGNAQLTNQVPSGLGNRGIWKPKAYGTTSGGAAVVEAEKAPAVGTSIENKGNTAGLAAQN 180
Query: 181 ------DIFKGNQIEKFTVDNYTYTHAQIRATFYPKFENEKSDQEIRSRMIEMVSKGLAT 240
+FKGNQIE FTVDN TYT AQIRATFYPKFENEKSDQEIR+RMIEMVSKGLAT
Sbjct: 181 GTVGLSQLFKGNQIENFTVDNSTYTQAQIRATFYPKFENEKSDQEIRTRMIEMVSKGLAT 240
Query: 241 LEVSLKHSGSLFMYAGHEGGAYAKNSFGNIYTAVGVFVLGRMFREAWGSGAAKKQVEFND 300
LEVSLKHSGSLFMYAGHEGGAYAKNSFGNIYTAVGVFVLGRMFREAWGS AAKKQ EFN+
Sbjct: 241 LEVSLKHSGSLFMYAGHEGGAYAKNSFGNIYTAVGVFVLGRMFREAWGSKAAKKQAEFNN 300
Query: 301 FLESNRMCISMELVTAVLGDHGQRPREDYVVVTAVTELGNGKPKFYSTAEIIAFCRKWRL 360
FLESNRMCISMELVTAVLGDHGQRPREDYVVVTAVT+LGNGKPKFYSTAEII FCR+WRL
Sbjct: 301 FLESNRMCISMELVTAVLGDHGQRPREDYVVVTAVTDLGNGKPKFYSTAEIIVFCREWRL 360
Query: 361 PTNHVWLFSSRKSVTSFFASFDALCEEGTATTVCKALDEVAEISVPGSKDHIKVQGEILE 420
PTNHVWLFSSRKSVTSFFA+FDALCEEGTAT+VCKALDEVAEISVPGSKDHIKVQGEILE
Sbjct: 361 PTNHVWLFSSRKSVTSFFAAFDALCEEGTATSVCKALDEVAEISVPGSKDHIKVQGEILE 420
Query: 421 GLVARMVSHESSKHMEKVLEEFPALPDNEGGEFDLGPSLREICAANRSDEKQQIKALLQN 480
GLVAR+VSHESSKHMEKVLEEFP+LPD EGG DLG SLREICAANRSDEKQQIKALLQN
Sbjct: 421 GLVARIVSHESSKHMEKVLEEFPSLPDEEGGGLDLGRSLREICAANRSDEKQQIKALLQN 480
Query: 481 VGSAFCPVHSDWYGDSHSRNADRSVVSKFLQANPADFSTSKLQEMVRLMRENRFPAAFKC 540
VGS+FCP HSDW GDSHSR ADRSV+SKFLQ +P DFSTSKLQEM+RLMRE R PAAFKC
Sbjct: 481 VGSSFCPDHSDWSGDSHSRTADRSVLSKFLQTSPTDFSTSKLQEMIRLMREKRLPAAFKC 540
Query: 541 YYNFHKVGSISNDNLFYKMVVHVQSDSAFRRYQKEMRNKPGLWPLYRGFFMDINLFKANK 600
Y+NFHKVGSISND+LFYKMV+HV SDSAFRRYQKEMR+KPGLWPLYRGFF+DINLFKANK
Sbjct: 541 YHNFHKVGSISNDDLFYKMVIHVHSDSAFRRYQKEMRHKPGLWPLYRGFFVDINLFKANK 600
Query: 601 DKAAEIVKGKNNLMEIEGNGTLGRDGFADEDANLMIKLKFLTYKLRTFLIRNGLSIVFKE 660
DKAAEI+K K+NLME+EGNG LGRDG ADEDANLMIKLKFLTYKLRTFLIRNGLSI+FKE
Sbjct: 601 DKAAEIMKSKSNLMEVEGNGILGRDGLADEDANLMIKLKFLTYKLRTFLIRNGLSILFKE 660
Query: 661 GPAAYKAYYLRQMKLWGTSDGKQRELSKMLDEWAVFLRRKYGNKQLSSTTYLSEAEPFLE 720
GPAAYKAYYLRQMKLWGTS GKQRELSKMLDEWAV+LRRKYGN+QLSS TYLSEAEPFLE
Sbjct: 661 GPAAYKAYYLRQMKLWGTSVGKQRELSKMLDEWAVYLRRKYGNRQLSSATYLSEAEPFLE 720
Query: 721 QYAKRSPQNQALIGSAGNLVRAEDFLAIVEEGMDEEGDLQKEQEATPSSSLHSGKDVVPK 780
QYAKRSPQNQALIGSAGNLVRAEDFLAIVEEGMDEEGDLQKEQE PSS + GKD V K
Sbjct: 721 QYAKRSPQNQALIGSAGNLVRAEDFLAIVEEGMDEEGDLQKEQEVAPSSPMLPGKDTVSK 780
Query: 781 ADGLIVFFPGIPGCAKSAICREILNAPGGLGDDRPLNSLMGDLIKGRYWQKVADELRRKP 840
A+GLIVFFPGIPGCAKSA+CREILNAPGGLGDDRP+ SLMGDLIKGRYWQKV DE RRKP
Sbjct: 781 AEGLIVFFPGIPGCAKSALCREILNAPGGLGDDRPVKSLMGDLIKGRYWQKVVDERRRKP 840
Query: 841 YSIMLADKNAPNEEVWRQIEDMCHSTRASAVPVVPDSEGTDSNPFSFDALAVFMFRVLQR 900
YSIMLADKNAPNEEVWRQIEDMCHSTRASAVPVVPDSEGTD NPFS DALAVFMFRVLQR
Sbjct: 841 YSIMLADKNAPNEEVWRQIEDMCHSTRASAVPVVPDSEGTDGNPFSLDALAVFMFRVLQR 900
Query: 901 VNHPGNLDKASPNAGYVLLMFYHLYEGKSRREFEGELIDRFGFLIKIPLLKSDRSPLPDN 960
VNHPGNLDKASPNAGYVLLMFYHLYEGKSRREFE ELIDRFG L+K+PLLK DRSPLPDN
Sbjct: 901 VNHPGNLDKASPNAGYVLLMFYHLYEGKSRREFEDELIDRFGSLVKMPLLKCDRSPLPDN 960
Query: 961 LKTVLEEGLSLYKLHTRRHGRADSTKGSFAKEWAKWEKQLRETLFGNTEYLNSIQVTFEF 1020
LKT+LEEGLSLYKLHT RHGRADSTKGS+AKEWAKWEKQLRETLFGNTEYLNSIQV FE
Sbjct: 961 LKTILEEGLSLYKLHTSRHGRADSTKGSYAKEWAKWEKQLRETLFGNTEYLNSIQVPFEV 1020
Query: 1021 AVQDVLEQLKKILKGDYKNLTTDRRKSATIVFAAVSLPILEIQNLLDTLGKINPHVGGFF 1080
AVQDVLEQLKKI KGDYK ++RRKSATIVFAAVSLP+ EIQNLLDTLGK NPHV F
Sbjct: 1021 AVQDVLEQLKKIAKGDYKTPISERRKSATIVFAAVSLPVQEIQNLLDTLGKKNPHVESFL 1080
Query: 1081 KKNLKDFTLREVHVTLAHKRGHGVKAVADYGIFENKEVPVELTGLLFSNKMAAFEARLGS 1140
K++ KD+TL+ HVTLAHKR HGVKAVADYGIF+NKEVPVELT LLFS+KMAAFEA LGS
Sbjct: 1081 KQDYKDYTLKAAHVTLAHKRSHGVKAVADYGIFQNKEVPVELTALLFSDKMAAFEAHLGS 1140
Query: 1141 IEDERVISKNEWPHATIWTTEGVAAKEANTLPLLVPEGKATLVEINPPIVISGKVQFF 1179
+EDERV+SKNEWPH T+WT EGVAAKEANTLP LV EGKATLVE+NPP +ISG V+FF
Sbjct: 1141 VEDERVVSKNEWPHVTLWTREGVAAKEANTLPQLVSEGKATLVELNPPTIISGTVKFF 1197
BLAST of Sed0015984 vs. NCBI nr
Match:
XP_022964893.1 (tRNA ligase 1 isoform X1 [Cucurbita moschata])
HSP 1 Score: 2006.9 bits (5198), Expect = 0.0e+00
Identity = 1009/1190 (84.79%), Postives = 1070/1190 (89.92%), Query Frame = 0
Query: 7 RVLCAITLPLSSSLTFSSRT--------RFYIPHSLLPFKAS-SPFPLSCHSPFIMPYNQ 66
R+ CAITLPLSSS SR +I H L AS PF + S F MPYNQ
Sbjct: 6 RIFCAITLPLSSSPALHSRAFPFVSCSLSHFILHPSLTLPASVFPFTVCRDSRFTMPYNQ 65
Query: 67 PRGGCKDQKWKAKTKADNTSMD---AAEVVTHALNKLSVTESGQPHVPISSTQFGNVQLT 126
RGG ++QKWK K K + S + A+EVVT+AL+ L VTES QPH+PI+S QFGN Q T
Sbjct: 66 RRGGRREQKWKEKAKVEGISTESETASEVVTNALSNLRVTESNQPHIPITSVQFGNAQPT 125
Query: 127 NQVPPGDGHRTIWKPKAYGTTSGAAVVEAENASAGRTSIQNKENSAGLAAQ------NDI 186
N PG GHR IWKPKAYGTTSGAAVVE E A A TSI+NK ++A +AA + +
Sbjct: 126 NLATPGLGHRAIWKPKAYGTTSGAAVVEGEKAPAVGTSIENKGSNAEIAANSSAIALSQL 185
Query: 187 FKGNQIEKFTVDNYTYTHAQIRATFYPKFENEKSDQEIRSRMIEMVSKGLATLEVSLKHS 246
KGNQIE+FTVDN YT AQIRATFYPKFENEKSDQEIR+RMIEMVSKGLATLEVSLKHS
Sbjct: 186 LKGNQIEQFTVDNSAYTQAQIRATFYPKFENEKSDQEIRTRMIEMVSKGLATLEVSLKHS 245
Query: 247 GSLFMYAGHEGGAYAKNSFGNIYTAVGVFVLGRMFREAWGSGAAKKQVEFNDFLESNRMC 306
GSLFMYAGH+GGAYAKNSFGNIYTAVGVFVLGRMFREAWGS A KKQ EFNDFLESNRMC
Sbjct: 246 GSLFMYAGHQGGAYAKNSFGNIYTAVGVFVLGRMFREAWGSVAPKKQAEFNDFLESNRMC 305
Query: 307 ISMELVTAVLGDHGQRPREDYVVVTAVTELGNGKPKFYSTAEIIAFCRKWRLPTNHVWLF 366
ISMELVTAVLGDHGQRPREDYVVVTAVTELGNGKPKFYST+EIIAFCRKWRLPTNHVWLF
Sbjct: 306 ISMELVTAVLGDHGQRPREDYVVVTAVTELGNGKPKFYSTSEIIAFCRKWRLPTNHVWLF 365
Query: 367 SSRKSVTSFFASFDALCEEGTATTVCKALDEVAEISVPGSKDHIKVQGEILEGLVARMVS 426
SSRKSVTSFFASFDALCEEGTATTVCKALDEVAEISVPGSKDHIKVQGEILEGLVARMVS
Sbjct: 366 SSRKSVTSFFASFDALCEEGTATTVCKALDEVAEISVPGSKDHIKVQGEILEGLVARMVS 425
Query: 427 HESSKHMEKVLEEFPALPDNEGGEFDLGPSLREICAANRSDEKQQIKALLQNVGSAFCPV 486
HESSKHMEKVLEEFPALP NEGG DLGPSLREICAANRSDEKQQIKALLQNVGSAFCP
Sbjct: 426 HESSKHMEKVLEEFPALPYNEGGGLDLGPSLREICAANRSDEKQQIKALLQNVGSAFCPD 485
Query: 487 HSDWYGDSHSRNADRSVVSKFLQANPADFSTSKLQEMVRLMRENRFPAAFKCYYNFHKVG 546
HSDWYGDSHSRNADRSVVSKFLQA PADFSTSKLQEMVRLMRE R PAAFKCY+NFHK+G
Sbjct: 486 HSDWYGDSHSRNADRSVVSKFLQAKPADFSTSKLQEMVRLMRERRLPAAFKCYHNFHKIG 545
Query: 547 SISNDNLFYKMVVHVQSDSAFRRYQKEMRNKPGLWPLYRGFFMDINLFKANKDKAAEIVK 606
SISNDNLFYKMV+HVQSDSAFRRYQKEMR+KPGLWPLYRGFF+DINLFK NK+K AEIVK
Sbjct: 546 SISNDNLFYKMVIHVQSDSAFRRYQKEMRHKPGLWPLYRGFFVDINLFKENKEKTAEIVK 605
Query: 607 GKNNLMEIEGNGTLGRDGFADEDANLMIKLKFLTYKLRTFLIRNGLSIVFKEGPAAYKAY 666
KNNLME EGNGT+GRDGFADEDANLMIKLKFLTYKLRTFLIRNGLSI+FKEG AAYKAY
Sbjct: 606 SKNNLMETEGNGTVGRDGFADEDANLMIKLKFLTYKLRTFLIRNGLSILFKEGSAAYKAY 665
Query: 667 YLRQMKLWGTSDGKQRELSKMLDEWAVFLRRKYGNKQLSSTTYLSEAEPFLEQYAKRSPQ 726
YLRQMKLWGTS GKQRELSKMLDEWAV+LRRKYGNKQLSS+ YLSEAEPFLEQYAKRSPQ
Sbjct: 666 YLRQMKLWGTSFGKQRELSKMLDEWAVYLRRKYGNKQLSSSIYLSEAEPFLEQYAKRSPQ 725
Query: 727 NQALIGSAGNLVRAEDFLAIVEEGMDEEGDLQKEQEATPSSSLHSGKDVVPKADGLIVFF 786
NQALIGSAGNLVRAEDFLA+VEEGMDEEGDLQKE +A PSS + S KDVVPKA+GLIVFF
Sbjct: 726 NQALIGSAGNLVRAEDFLAVVEEGMDEEGDLQKE-DAAPSSPMLSRKDVVPKAEGLIVFF 785
Query: 787 PGIPGCAKSAICREILNAPGGLGDDRPLNSLMGDLIKGRYWQKVADELRRKPYSIMLADK 846
PGIPGCAKSA+CREILNAPGGLGDDRP+N+LMGDLIKGRYWQKVADE RRKPYSIMLADK
Sbjct: 786 PGIPGCAKSALCREILNAPGGLGDDRPVNTLMGDLIKGRYWQKVADERRRKPYSIMLADK 845
Query: 847 NAPNEEVWRQIEDMCHSTRASAVPVVPDSEGTDSNPFSFDALAVFMFRVLQRVNHPGNLD 906
NAPNEEVWRQIEDMCHSTRASAVPV+PDSEGTDSNPFS DALAVFMFRVLQRVNHPGNLD
Sbjct: 846 NAPNEEVWRQIEDMCHSTRASAVPVIPDSEGTDSNPFSLDALAVFMFRVLQRVNHPGNLD 905
Query: 907 KASPNAGYVLLMFYHLYEGKSRREFEGELIDRFGFLIKIPLLKSDRSPLPDNLKTVLEEG 966
KASPNAGYVLLMFYHLYEGKSRREFEGELIDRFG L+KIPLLKSDRSPLPDNLKT+LEEG
Sbjct: 906 KASPNAGYVLLMFYHLYEGKSRREFEGELIDRFGSLVKIPLLKSDRSPLPDNLKTILEEG 965
Query: 967 LSLYKLHTRRHGRADSTKGSFAKEWAKWEKQLRETLFGNTEYLNSIQVTFEFAVQDVLEQ 1026
LSLYKLHT RHGRADSTKGS+AKEWAKWEKQLRETLFGN EYLN+IQV FEFAVQ+VLEQ
Sbjct: 966 LSLYKLHTSRHGRADSTKGSYAKEWAKWEKQLRETLFGNAEYLNAIQVPFEFAVQNVLEQ 1025
Query: 1027 LKKILKGDYKNLTTDRRKSATIVFAAVSLPILEIQNLLDTLGKINPHVGGFFKKNLKDFT 1086
LKKI KGDYK+ T+RRKSATIV+AAVSLP+ +IQ+ LDTLG NP V F K+ KD+T
Sbjct: 1026 LKKISKGDYKSPITERRKSATIVYAAVSLPVQDIQDALDTLGNKNPQVEAFIKEGYKDYT 1085
Query: 1087 LREVHVTLAHKRGHGVKAVADYGIFENKEVPVELTGLLFSNKMAAFEARLGSIEDERVIS 1146
L+ HVTLAHKR HG+KAVADYGIFENKEVPVELT LLFS+KMAAFEAR+GSIEDERVIS
Sbjct: 1086 LKSAHVTLAHKRSHGIKAVADYGIFENKEVPVELTALLFSDKMAAFEARVGSIEDERVIS 1145
Query: 1147 KNEWPHATIWTTEGVAAKEANTLPLLVPEGKATLVEINPPIVISGKVQFF 1179
KNEWPH T+WT EG+AAKEANTLP LV EGKATLVE+NPPI+ISGKVQFF
Sbjct: 1146 KNEWPHVTLWTREGIAAKEANTLPQLVSEGKATLVELNPPIIISGKVQFF 1194
BLAST of Sed0015984 vs. NCBI nr
Match:
KAG6583500.1 (tRNA ligase 1, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 2004.2 bits (5191), Expect = 0.0e+00
Identity = 1008/1190 (84.71%), Postives = 1069/1190 (89.83%), Query Frame = 0
Query: 7 RVLCAITLPLSSSLTFSSRT--------RFYIPHSLLPFKAS-SPFPLSCHSPFIMPYNQ 66
R+ CAITLPLSSS R +I H L AS PF +S S F MPYNQ
Sbjct: 6 RIFCAITLPLSSSPALHYRAFPFVSCSLSHFILHPSLTLPASVFPFTVSRDSRFTMPYNQ 65
Query: 67 PRGGCKDQKWKAKTKADNTSMD---AAEVVTHALNKLSVTESGQPHVPISSTQFGNVQLT 126
RGG ++QKWK K K + S + A+EVVT+AL+ L VTES QPH+PI+S QFGN Q T
Sbjct: 66 RRGGRREQKWKEKAKVEGISTESETASEVVTNALSNLRVTESNQPHIPITSVQFGNAQPT 125
Query: 127 NQVPPGDGHRTIWKPKAYGTTSGAAVVEAENASAGRTSIQNKENSAGLAAQ------NDI 186
N PG GHR IWKPKAYGTTSGAAVVE E A A TSI+NK ++A +AA + +
Sbjct: 126 NLATPGLGHRAIWKPKAYGTTSGAAVVEGEKAPAVGTSIENKGSNAEIAANSSAIALSQL 185
Query: 187 FKGNQIEKFTVDNYTYTHAQIRATFYPKFENEKSDQEIRSRMIEMVSKGLATLEVSLKHS 246
KGNQIE+FTVDN YT AQIRATFYPKFENEKSDQEIR+RMIEMVSKGLATLEVSLKHS
Sbjct: 186 LKGNQIEQFTVDNSAYTQAQIRATFYPKFENEKSDQEIRTRMIEMVSKGLATLEVSLKHS 245
Query: 247 GSLFMYAGHEGGAYAKNSFGNIYTAVGVFVLGRMFREAWGSGAAKKQVEFNDFLESNRMC 306
GSLFMYAGH+GGAYAKNSFGNIYTAVGVFVLGRMFREAWGS A KKQ EFNDFLESNRMC
Sbjct: 246 GSLFMYAGHQGGAYAKNSFGNIYTAVGVFVLGRMFREAWGSVAPKKQAEFNDFLESNRMC 305
Query: 307 ISMELVTAVLGDHGQRPREDYVVVTAVTELGNGKPKFYSTAEIIAFCRKWRLPTNHVWLF 366
ISMELVTAVLGDHGQRPREDYVVVTAVTELGNGKPKFYST+EIIAFCRKWRLPTNHVWLF
Sbjct: 306 ISMELVTAVLGDHGQRPREDYVVVTAVTELGNGKPKFYSTSEIIAFCRKWRLPTNHVWLF 365
Query: 367 SSRKSVTSFFASFDALCEEGTATTVCKALDEVAEISVPGSKDHIKVQGEILEGLVARMVS 426
SSRKSVTSFFASFDALCEEGTATTVCKALDEVAEISVPGSKDHIKVQGEILEGLVARMVS
Sbjct: 366 SSRKSVTSFFASFDALCEEGTATTVCKALDEVAEISVPGSKDHIKVQGEILEGLVARMVS 425
Query: 427 HESSKHMEKVLEEFPALPDNEGGEFDLGPSLREICAANRSDEKQQIKALLQNVGSAFCPV 486
HESSKHMEKVLEEFPALP NEGG DLGPSLREICAANRSDEKQQIKALLQNVGSAFCP
Sbjct: 426 HESSKHMEKVLEEFPALPYNEGGGLDLGPSLREICAANRSDEKQQIKALLQNVGSAFCPD 485
Query: 487 HSDWYGDSHSRNADRSVVSKFLQANPADFSTSKLQEMVRLMRENRFPAAFKCYYNFHKVG 546
HSDWYGDSHSRNADRSVVSKFLQA PADFSTSKLQEMVRLMRE R PAAFKCY+NFHK+G
Sbjct: 486 HSDWYGDSHSRNADRSVVSKFLQAKPADFSTSKLQEMVRLMRERRLPAAFKCYHNFHKIG 545
Query: 547 SISNDNLFYKMVVHVQSDSAFRRYQKEMRNKPGLWPLYRGFFMDINLFKANKDKAAEIVK 606
SISNDNLFYKMV+HVQSDSAFRRYQKEMR+KPGLWPLYRGFF+DINLFK NK+K AEIVK
Sbjct: 546 SISNDNLFYKMVIHVQSDSAFRRYQKEMRHKPGLWPLYRGFFVDINLFKENKEKTAEIVK 605
Query: 607 GKNNLMEIEGNGTLGRDGFADEDANLMIKLKFLTYKLRTFLIRNGLSIVFKEGPAAYKAY 666
KNNLME EGNGT+GRDGFADEDANLMIKLKFLTYKLRTFLIRNGLSI+FKEG AAYKAY
Sbjct: 606 SKNNLMETEGNGTVGRDGFADEDANLMIKLKFLTYKLRTFLIRNGLSILFKEGSAAYKAY 665
Query: 667 YLRQMKLWGTSDGKQRELSKMLDEWAVFLRRKYGNKQLSSTTYLSEAEPFLEQYAKRSPQ 726
YLRQMKLWGTS GKQRELSKMLDEWAV+LRRKYGNKQLSS+ YLSEAEPFLEQYAKRSPQ
Sbjct: 666 YLRQMKLWGTSFGKQRELSKMLDEWAVYLRRKYGNKQLSSSIYLSEAEPFLEQYAKRSPQ 725
Query: 727 NQALIGSAGNLVRAEDFLAIVEEGMDEEGDLQKEQEATPSSSLHSGKDVVPKADGLIVFF 786
NQALIGSAGNLVRAEDFLA+VEEGMDEEGDLQKE +A PSS + S KDVVPKA+GLIVFF
Sbjct: 726 NQALIGSAGNLVRAEDFLAVVEEGMDEEGDLQKE-DAAPSSPMLSRKDVVPKAEGLIVFF 785
Query: 787 PGIPGCAKSAICREILNAPGGLGDDRPLNSLMGDLIKGRYWQKVADELRRKPYSIMLADK 846
PGIPGCAKSA+CREILNAPGGLGDDRP+N+LMGDLIKGRYWQKVADE RRKPYSIMLADK
Sbjct: 786 PGIPGCAKSALCREILNAPGGLGDDRPVNTLMGDLIKGRYWQKVADERRRKPYSIMLADK 845
Query: 847 NAPNEEVWRQIEDMCHSTRASAVPVVPDSEGTDSNPFSFDALAVFMFRVLQRVNHPGNLD 906
NAPNEEVWRQIEDMCHSTRASAVPV+PDSEGTDSNPFS DALAVFMFRVL RVNHPGNLD
Sbjct: 846 NAPNEEVWRQIEDMCHSTRASAVPVIPDSEGTDSNPFSLDALAVFMFRVLHRVNHPGNLD 905
Query: 907 KASPNAGYVLLMFYHLYEGKSRREFEGELIDRFGFLIKIPLLKSDRSPLPDNLKTVLEEG 966
KASPNAGYVLLMFYHLYEGKSRREFEGELIDRFG L+KIPLLKSDRSPLPDNLKT+LEEG
Sbjct: 906 KASPNAGYVLLMFYHLYEGKSRREFEGELIDRFGSLVKIPLLKSDRSPLPDNLKTILEEG 965
Query: 967 LSLYKLHTRRHGRADSTKGSFAKEWAKWEKQLRETLFGNTEYLNSIQVTFEFAVQDVLEQ 1026
LSLYKLHT RHGRADSTKGS+AKEWAKWEKQLRETLFGN EYLN+IQV FEFAVQ+VLEQ
Sbjct: 966 LSLYKLHTGRHGRADSTKGSYAKEWAKWEKQLRETLFGNAEYLNAIQVPFEFAVQNVLEQ 1025
Query: 1027 LKKILKGDYKNLTTDRRKSATIVFAAVSLPILEIQNLLDTLGKINPHVGGFFKKNLKDFT 1086
LKKI KGDYK+ T+RRKSATIV+AAVSLP+ +IQ+ LDTLG NP V F K+ KD+T
Sbjct: 1026 LKKISKGDYKSPITERRKSATIVYAAVSLPVQDIQDALDTLGNKNPQVEAFIKEGYKDYT 1085
Query: 1087 LREVHVTLAHKRGHGVKAVADYGIFENKEVPVELTGLLFSNKMAAFEARLGSIEDERVIS 1146
L+ HVTLAHKR HG+KAVADYGIFENKEVPVELT LLFS+KMAAFEAR+GSIEDERVIS
Sbjct: 1086 LKSAHVTLAHKRSHGIKAVADYGIFENKEVPVELTALLFSDKMAAFEARVGSIEDERVIS 1145
Query: 1147 KNEWPHATIWTTEGVAAKEANTLPLLVPEGKATLVEINPPIVISGKVQFF 1179
KNEWPH T+WT EG+AAKEANTLP LV EGKATLVE+NPPI+ISGKVQFF
Sbjct: 1146 KNEWPHVTLWTREGIAAKEANTLPQLVSEGKATLVELNPPIIISGKVQFF 1194
BLAST of Sed0015984 vs. NCBI nr
Match:
KAG7019255.1 (tRNA ligase 1 [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 2003.0 bits (5188), Expect = 0.0e+00
Identity = 1007/1190 (84.62%), Postives = 1068/1190 (89.75%), Query Frame = 0
Query: 7 RVLCAITLPLSSSLTFSSRT--------RFYIPHSLLPFKAS-SPFPLSCHSPFIMPYNQ 66
R+ CAITLPLSSS SR +I H L AS PF + S F MPYNQ
Sbjct: 6 RIFCAITLPLSSSPALHSRAFPFVSCSLSHFILHPSLTLPASVFPFTVCRDSRFTMPYNQ 65
Query: 67 PRGGCKDQKWKAKTKADNTSMD---AAEVVTHALNKLSVTESGQPHVPISSTQFGNVQLT 126
RGG ++QKWK K K + S + A+EVVT+AL+ L VTES QPH+PI+S QFGN Q T
Sbjct: 66 RRGGRREQKWKEKAKVEGISTESETASEVVTNALSNLRVTESNQPHIPITSVQFGNAQPT 125
Query: 127 NQVPPGDGHRTIWKPKAYGTTSGAAVVEAENASAGRTSIQNKENSAGLAAQ------NDI 186
N PG GHR IWKPKAYGTTSGAAVVE E A A TS +NK ++A +AA + +
Sbjct: 126 NLATPGLGHRAIWKPKAYGTTSGAAVVEGEKAPAVGTSTENKGSNAEIAANSSAIALSQL 185
Query: 187 FKGNQIEKFTVDNYTYTHAQIRATFYPKFENEKSDQEIRSRMIEMVSKGLATLEVSLKHS 246
KGNQIE+FTVDN YT AQIRATFYPKFENEKSDQEIR+RMIEMVSKGLATLEVSLKHS
Sbjct: 186 LKGNQIEQFTVDNSAYTQAQIRATFYPKFENEKSDQEIRTRMIEMVSKGLATLEVSLKHS 245
Query: 247 GSLFMYAGHEGGAYAKNSFGNIYTAVGVFVLGRMFREAWGSGAAKKQVEFNDFLESNRMC 306
GSLFMYAGH+GGAYAKNSFGNIYTAVGVFVLGRMFREAWGS A KKQ EFNDFLESNRMC
Sbjct: 246 GSLFMYAGHQGGAYAKNSFGNIYTAVGVFVLGRMFREAWGSVAPKKQAEFNDFLESNRMC 305
Query: 307 ISMELVTAVLGDHGQRPREDYVVVTAVTELGNGKPKFYSTAEIIAFCRKWRLPTNHVWLF 366
ISMELVTAVLGDHGQRPREDYVVVTAVTELGNGKPKFYST+EIIAFCRKWRLPTNHVWLF
Sbjct: 306 ISMELVTAVLGDHGQRPREDYVVVTAVTELGNGKPKFYSTSEIIAFCRKWRLPTNHVWLF 365
Query: 367 SSRKSVTSFFASFDALCEEGTATTVCKALDEVAEISVPGSKDHIKVQGEILEGLVARMVS 426
SSRKSVTSFFASFDALCEEGTATTVCKALDEVAEISVPGSKDHIKVQGEILEGLVARMVS
Sbjct: 366 SSRKSVTSFFASFDALCEEGTATTVCKALDEVAEISVPGSKDHIKVQGEILEGLVARMVS 425
Query: 427 HESSKHMEKVLEEFPALPDNEGGEFDLGPSLREICAANRSDEKQQIKALLQNVGSAFCPV 486
HESSKHMEKVLEEFPALP NEGG DLGPSLREICAANRSDEKQQIKALLQNVGSAFCP
Sbjct: 426 HESSKHMEKVLEEFPALPYNEGGGLDLGPSLREICAANRSDEKQQIKALLQNVGSAFCPD 485
Query: 487 HSDWYGDSHSRNADRSVVSKFLQANPADFSTSKLQEMVRLMRENRFPAAFKCYYNFHKVG 546
HSDWYGDSHSRNADRSVVSKFLQA PADFSTSKLQEMVRLMRE R PAAFKCY+NFHK+G
Sbjct: 486 HSDWYGDSHSRNADRSVVSKFLQAKPADFSTSKLQEMVRLMRERRLPAAFKCYHNFHKIG 545
Query: 547 SISNDNLFYKMVVHVQSDSAFRRYQKEMRNKPGLWPLYRGFFMDINLFKANKDKAAEIVK 606
SISNDNLFYKMV+HVQSDSAFRRYQKEMR+KPGLWPLYRGFF+DINLFK NK+K AEIVK
Sbjct: 546 SISNDNLFYKMVIHVQSDSAFRRYQKEMRHKPGLWPLYRGFFVDINLFKENKEKTAEIVK 605
Query: 607 GKNNLMEIEGNGTLGRDGFADEDANLMIKLKFLTYKLRTFLIRNGLSIVFKEGPAAYKAY 666
KNNLME EGNGT+GRDGFADEDANLMIKLKFLTYKLRTFLIRNGLSI+FKEG AAYKAY
Sbjct: 606 SKNNLMETEGNGTVGRDGFADEDANLMIKLKFLTYKLRTFLIRNGLSILFKEGSAAYKAY 665
Query: 667 YLRQMKLWGTSDGKQRELSKMLDEWAVFLRRKYGNKQLSSTTYLSEAEPFLEQYAKRSPQ 726
YLRQMKLWGTS GKQRELSKMLDEWAV+LRRKYGNKQLSS+ YLSEAEPFLEQYAKRSPQ
Sbjct: 666 YLRQMKLWGTSFGKQRELSKMLDEWAVYLRRKYGNKQLSSSIYLSEAEPFLEQYAKRSPQ 725
Query: 727 NQALIGSAGNLVRAEDFLAIVEEGMDEEGDLQKEQEATPSSSLHSGKDVVPKADGLIVFF 786
NQALIGSAGNLVRAEDFLA+VEEGMDEEGDLQKE +A PSS + S KDVVPKA+GLIVFF
Sbjct: 726 NQALIGSAGNLVRAEDFLAVVEEGMDEEGDLQKE-DAAPSSPMLSRKDVVPKAEGLIVFF 785
Query: 787 PGIPGCAKSAICREILNAPGGLGDDRPLNSLMGDLIKGRYWQKVADELRRKPYSIMLADK 846
PGIPGCAKSA+CREILNAPGGLGDDRP+N+LMGDLIKGRYWQKVADE RRKPYSIMLADK
Sbjct: 786 PGIPGCAKSALCREILNAPGGLGDDRPVNTLMGDLIKGRYWQKVADERRRKPYSIMLADK 845
Query: 847 NAPNEEVWRQIEDMCHSTRASAVPVVPDSEGTDSNPFSFDALAVFMFRVLQRVNHPGNLD 906
NAPNEEVWRQIEDMCHSTRASAVPV+PDSEGTDSNPFS DALAVFMFRVL RVNHPGNLD
Sbjct: 846 NAPNEEVWRQIEDMCHSTRASAVPVIPDSEGTDSNPFSLDALAVFMFRVLHRVNHPGNLD 905
Query: 907 KASPNAGYVLLMFYHLYEGKSRREFEGELIDRFGFLIKIPLLKSDRSPLPDNLKTVLEEG 966
KASPNAGYVLLMFYHLYEGKSRREFEGELIDRFG L+KIPLLKSDRSPLPDNLKT+LEEG
Sbjct: 906 KASPNAGYVLLMFYHLYEGKSRREFEGELIDRFGSLVKIPLLKSDRSPLPDNLKTILEEG 965
Query: 967 LSLYKLHTRRHGRADSTKGSFAKEWAKWEKQLRETLFGNTEYLNSIQVTFEFAVQDVLEQ 1026
LSLYKLHT RHGRADSTKGS+AKEWAKWEKQLRETLFGN EYLN+IQV FEFAVQ+VLEQ
Sbjct: 966 LSLYKLHTSRHGRADSTKGSYAKEWAKWEKQLRETLFGNAEYLNAIQVPFEFAVQNVLEQ 1025
Query: 1027 LKKILKGDYKNLTTDRRKSATIVFAAVSLPILEIQNLLDTLGKINPHVGGFFKKNLKDFT 1086
LKKI KGDYK+ T+RRKSATIV+AAVSLP+ +IQ+ LDTLG NP V F K+ KD+T
Sbjct: 1026 LKKISKGDYKSPITERRKSATIVYAAVSLPVQDIQDALDTLGNKNPQVEAFIKEGYKDYT 1085
Query: 1087 LREVHVTLAHKRGHGVKAVADYGIFENKEVPVELTGLLFSNKMAAFEARLGSIEDERVIS 1146
L+ HVTLAHKR HG+KAVADYGIFENKEVPVELT LLFS+KMAAFEAR+GSIEDERVIS
Sbjct: 1086 LKSAHVTLAHKRSHGIKAVADYGIFENKEVPVELTALLFSDKMAAFEARVGSIEDERVIS 1145
Query: 1147 KNEWPHATIWTTEGVAAKEANTLPLLVPEGKATLVEINPPIVISGKVQFF 1179
KNEWPH T+WT EG+AAKEANTLP LV EGKATLVE+NPPI+ISGKVQFF
Sbjct: 1146 KNEWPHVTLWTREGIAAKEANTLPQLVSEGKATLVELNPPIIISGKVQFF 1194
BLAST of Sed0015984 vs. NCBI nr
Match:
XP_038894223.1 (tRNA ligase 1 isoform X1 [Benincasa hispida])
HSP 1 Score: 2001.1 bits (5183), Expect = 0.0e+00
Identity = 1014/1200 (84.50%), Postives = 1073/1200 (89.42%), Query Frame = 0
Query: 1 MSVSHKRVLCAITLP---LSSSLTFSSR---------TRFYIPHSL-LPFKASSPFPLSC 60
MS S +R+ CAITLP L + F+ R + F +P SL L SSPFPLS
Sbjct: 1 MSAS-QRIFCAITLPHPRLYAPSAFNYRAFPFICHPLSHFILPRSLTLAPLTSSPFPLSR 60
Query: 61 HSPFIMPYNQPRGGCKDQKWKAKTKADNTSMD---AAEVVTHALNKLSVTESGQPHVPIS 120
S FIMPYNQ +GG ++QKWK K K D S + AAEVVT+AL KL VTE+ QPHV S
Sbjct: 61 DSRFIMPYNQRKGGRREQKWKEKAKVDRNSTESEAAAEVVTNALGKLRVTENDQPHVLTS 120
Query: 121 STQFGNVQLTNQVPPGDGHRTIWKPKAYGTTSGAAVVEAENASAGRTSIQNKENSAGLAA 180
S QFGN QLTNQV PG HR +WKPKAYGTTSGAA VE E A TS +NK ++A LAA
Sbjct: 121 SAQFGNAQLTNQVTPGLAHRAVWKPKAYGTTSGAAEVEGEKAPTNGTSTENKGSNAELAA 180
Query: 181 QN------DIFKGNQIEKFTVDNYTYTHAQIRATFYPKFENEKSDQEIRSRMIEMVSKGL 240
QN +FKGNQIEKFTVDN TYT AQIRATFYPKFENEKSDQEIR+RMIEMVSKGL
Sbjct: 181 QNGAVGLSQLFKGNQIEKFTVDNSTYTRAQIRATFYPKFENEKSDQEIRTRMIEMVSKGL 240
Query: 241 ATLEVSLKHSGSLFMYAGHEGGAYAKNSFGNIYTAVGVFVLGRMFREAWGSGAAKKQVEF 300
ATLEVSLKHSGSLFMYAGHEGGAYAKNSFGNIYTAVGVFVLGRMF+EAWG+ AAKKQ EF
Sbjct: 241 ATLEVSLKHSGSLFMYAGHEGGAYAKNSFGNIYTAVGVFVLGRMFQEAWGAAAAKKQAEF 300
Query: 301 NDFLESNRMCISMELVTAVLGDHGQRPREDYVVVTAVTELGNGKPKFYSTAEIIAFCRKW 360
NDFLESNRM ISMELVTAVLGDHGQRPREDYVVVTAVTELG GKPKFYSTAEIIAFCRKW
Sbjct: 301 NDFLESNRMSISMELVTAVLGDHGQRPREDYVVVTAVTELGKGKPKFYSTAEIIAFCRKW 360
Query: 361 RLPTNHVWLFSSRKSVTSFFASFDALCEEGTATTVCKALDEVAEISVPGSKDHIKVQGEI 420
RLPTNHVWLFSSRKS TSFFA+FDALCEEGTAT+VCKALDEVAEISVPG+KDHIKVQGEI
Sbjct: 361 RLPTNHVWLFSSRKSATSFFAAFDALCEEGTATSVCKALDEVAEISVPGTKDHIKVQGEI 420
Query: 421 LEGLVARMVSHESSKHMEKVLEEFPALPDNEGGEFDLGPSLREICAANRSDEKQQIKALL 480
LEGLVAR+VSHESSKHMEKVLE+FPALPDNE G DLGPSLREICAANRSDEKQQIKALL
Sbjct: 421 LEGLVARIVSHESSKHMEKVLEDFPALPDNEVGGLDLGPSLREICAANRSDEKQQIKALL 480
Query: 481 QNVGSAFCPVHSDWYGDSHSRNADRSVVSKFLQANPADFSTSKLQEMVRLMRENRFPAAF 540
QNVGSAFCP HSDWYGDSHSRNADRSV+SKFLQANPADFSTSKLQEM+RLMRE R PAAF
Sbjct: 481 QNVGSAFCPDHSDWYGDSHSRNADRSVLSKFLQANPADFSTSKLQEMIRLMREKRLPAAF 540
Query: 541 KCYYNFHKVGSISNDNLFYKMVVHVQSDSAFRRYQKEMRNKPGLWPLYRGFFMDINLFKA 600
KCY+NFHKVGSISNDNLFYKMV+HV SDSAFRRYQKEMR+KPGLWPLYRGFF+DINLFK
Sbjct: 541 KCYHNFHKVGSISNDNLFYKMVIHVHSDSAFRRYQKEMRHKPGLWPLYRGFFVDINLFKE 600
Query: 601 NKDKAAEIVKGKNNLMEIEGNGTLGRDGFADEDANLMIKLKFLTYKLRTFLIRNGLSIVF 660
NKDK AE+VK KNNLMEIEGNGTLGRDGFADEDANLMIKLKFLTYKLRTFLIRNGLSI+F
Sbjct: 601 NKDK-AELVKSKNNLMEIEGNGTLGRDGFADEDANLMIKLKFLTYKLRTFLIRNGLSILF 660
Query: 661 KEGPAAYKAYYLRQMKLWGTSDGKQRELSKMLDEWAVFLRRKYGNKQLSSTTYLSEAEPF 720
KEGPAAYKAYYLRQMKLWGTS GKQRELSKMLDEWAV+LRRKYGNKQLSS TYLSEAEPF
Sbjct: 661 KEGPAAYKAYYLRQMKLWGTSAGKQRELSKMLDEWAVYLRRKYGNKQLSSATYLSEAEPF 720
Query: 721 LEQYAKRSPQNQALIGSAGNLVRAEDFLAIVEEGMDEEGDLQKEQEATPSSSLHSGKDVV 780
LEQYAKRSPQNQALIGSAGNLV+AEDFLAIVEEGMDEEGDLQKEQEA PSS + SGKD V
Sbjct: 721 LEQYAKRSPQNQALIGSAGNLVKAEDFLAIVEEGMDEEGDLQKEQEAAPSSPMLSGKDAV 780
Query: 781 PKADGLIVFFPGIPGCAKSAICREILNAPGGLGDDRPLNSLMGDLIKGRYWQKVADELRR 840
PKA+GLIVFFPGIPGCAKSA+CREILNAPG LGDDRP+N+LMGDLIKGRYWQKVADE RR
Sbjct: 781 PKAEGLIVFFPGIPGCAKSALCREILNAPGALGDDRPVNTLMGDLIKGRYWQKVADERRR 840
Query: 841 KPYSIMLADKNAPNEEVWRQIEDMCHSTRASAVPVVPDSEGTDSNPFSFDALAVFMFRVL 900
KPYSIMLADKNAPNEEVWRQIEDMC STRASAVPVVPDSEGTDSNPFS DALAVFMFRVL
Sbjct: 841 KPYSIMLADKNAPNEEVWRQIEDMCRSTRASAVPVVPDSEGTDSNPFSLDALAVFMFRVL 900
Query: 901 QRVNHPGNLDKASPNAGYVLLMFYHLYEGKSRREFEGELIDRFGFLIKIPLLKSDRSPLP 960
QRVNHPGNLDKASPNAGYVLLMFYHLYEGKSRREFEGELIDRFG L+KIPLLKSDR+PLP
Sbjct: 901 QRVNHPGNLDKASPNAGYVLLMFYHLYEGKSRREFEGELIDRFGSLVKIPLLKSDRNPLP 960
Query: 961 DNLKTVLEEGLSLYKLHTRRHGRADSTKGSFAKEWAKWEKQLRETLFGNTEYLNSIQVTF 1020
+NLKT+LEEGLSLYKLHT RHGR DSTKGS+AKEW KWEKQLRETLFGNTEYLN+IQV F
Sbjct: 961 NNLKTILEEGLSLYKLHTSRHGRVDSTKGSYAKEWTKWEKQLRETLFGNTEYLNAIQVPF 1020
Query: 1021 EFAVQDVLEQLKKILKGDYKNLTTDRRKSATIVFAAVSLPILEIQNLLDTLGKINPHVGG 1080
EFAVQDVLEQLKKI KGD+K+ T+RRKS IVFAAV+LP+ EIQNLL TLGK NP V
Sbjct: 1021 EFAVQDVLEQLKKISKGDFKSPITERRKSGAIVFAAVNLPVQEIQNLLGTLGKKNPRVEA 1080
Query: 1081 FFKKNLKDFTLREVHVTLAHKRGHGVKAVADYGIFENKEVPVELTGLLFSNKMAAFEARL 1140
F K++ KD+TL+ HVTLAHKR HGVK VADYGIFENKEVPVELT LLFS+KMAAFEARL
Sbjct: 1081 FLKEHYKDYTLKGAHVTLAHKRSHGVKGVADYGIFENKEVPVELTALLFSDKMAAFEARL 1140
Query: 1141 GSIEDERVISKNEWPHATIWTTEGVAAKEANTLPLLVPEGKATLVEINPPIVISGKVQFF 1179
GSIE+ERVISKNEWPH T+WT EGVAAKEAN LP LV EGKATLVEINPPI ISG V+FF
Sbjct: 1141 GSIENERVISKNEWPHVTLWTREGVAAKEANALPQLVSEGKATLVEINPPINISGTVKFF 1198
BLAST of Sed0015984 vs. ExPASy Swiss-Prot
Match:
Q0WL81 (tRNA ligase 1 OS=Arabidopsis thaliana OX=3702 GN=RNL PE=1 SV=1)
HSP 1 Score: 1510.7 bits (3910), Expect = 0.0e+00
Identity = 755/1115 (67.71%), Postives = 900/1115 (80.72%), Query Frame = 0
Query: 74 DNTSMDAAEVVTHALNKLSVTES--GQPHVPISSTQFGNVQLTNQVPPGDGHRTIWKPKA 133
D+++ AE V + LS+ ES P +P +T VQ +WKPK+
Sbjct: 9 DSSATVVAEAVNNQFGGLSLKESNTNAPVLPSQTTSNHRVQ-----------NLVWKPKS 68
Query: 134 YGTTSGAAVVEAENASAGRTSIQNKENSAGLAA----QNDIFKGNQIEKFTVDNYTYTHA 193
YGT SG+ + G+TS ++ S+G + IF GN +EKF+VD TY HA
Sbjct: 69 YGTVSGS----SSATEVGKTSAVSQIGSSGDTKVGLNLSKIFGGNLLEKFSVDKSTYCHA 128
Query: 194 QIRATFYPKFENEKSDQEIRSRMIEMVSKGLATLEVSLKHSGSLFMYAGHEGGAYAKNSF 253
QIRATFYPKFENEK+DQEIR+RMIEMVSKGLATLEVSLKHSGSLFMYAGH+GGAYAKNSF
Sbjct: 129 QIRATFYPKFENEKTDQEIRTRMIEMVSKGLATLEVSLKHSGSLFMYAGHKGGAYAKNSF 188
Query: 254 GNIYTAVGVFVLGRMFREAWGSGAAKKQVEFNDFLESNRMCISMELVTAVLGDHGQRPRE 313
GNIYTAVGVFVL RMFREAWG+ A KK+ EFNDFLE NRMCISMELVTAVLGDHGQRP +
Sbjct: 189 GNIYTAVGVFVLSRMFREAWGTKAPKKEAEFNDFLEKNRMCISMELVTAVLGDHGQRPLD 248
Query: 314 DYVVVTAVTELGNGKPKFYSTAEIIAFCRKWRLPTNHVWLFSSRKSVTSFFASFDALCEE 373
DYVVVTAVTELGNGKP+FYST+EII+FCRKWRLPTNHVWLFS+RKSVTSFFA+FDALCEE
Sbjct: 249 DYVVVTAVTELGNGKPQFYSTSEIISFCRKWRLPTNHVWLFSTRKSVTSFFAAFDALCEE 308
Query: 374 GTATTVCKALDEVAEISVPGSKDHIKVQGEILEGLVARMVSHESSKHMEKVLEEFPALPD 433
G AT+VC+ALDEVA+ISVP SKDH+KVQGEILEGLVAR+VS +SS+ ME VL + P P
Sbjct: 309 GIATSVCRALDEVADISVPASKDHVKVQGEILEGLVARIVSSQSSRDMENVLRDHPP-PP 368
Query: 434 NEGGEFDLGPSLREICAANRSDEKQQIKALLQNVGSAFCPVHSDWYGD-SHSRNADRSVV 493
+G DLG SLREICAA+RS+EKQQ++ALL++VG +FCP +W+GD SH ++AD+SV+
Sbjct: 369 CDGANLDLGLSLREICAAHRSNEKQQMRALLRSVGPSFCPSDVEWFGDESHPKSADKSVI 428
Query: 494 SKFLQANPADFSTSKLQEMVRLMRENRFPAAFKCYYNFHKVGSISNDNLFYKMVVHVQSD 553
+KFLQ+ PAD+STSKLQEMVRLM+E R PAAFKCY+NFH+ IS DNLFYK+VVHV SD
Sbjct: 429 TKFLQSQPADYSTSKLQEMVRLMKEKRLPAAFKCYHNFHRAEDISPDNLFYKLVVHVHSD 488
Query: 554 SAFRRYQKEMRNKPGLWPLYRGFFMDINLFKANKDKAAEIVKGKNNLMEIEGNGTLGRDG 613
S FRRY KEMR+ P LWPLYRGFF+DINLFK+NK + +K +N E +G G +DG
Sbjct: 489 SGFRRYHKEMRHMPSLWPLYRGFFVDINLFKSNKGRDLMALKSIDNASENDGRGE--KDG 548
Query: 614 FADEDANLMIKLKFLTYKLRTFLIRNGLSIVFKEGPAAYKAYYLRQMKLWGTSDGKQREL 673
AD+DANLMIK+KFLTYKLRTFLIRNGLSI+FK+G AAYK YYLRQMK+WGTSDGKQ+EL
Sbjct: 549 LADDDANLMIKMKFLTYKLRTFLIRNGLSILFKDGAAAYKTYYLRQMKIWGTSDGKQKEL 608
Query: 674 SKMLDEWAVFLRRKYGNKQLSSTTYLSEAEPFLEQYAKRSPQNQALIGSAGNLVRAEDFL 733
KMLDEWA ++RRK GN QLSS+TYLSEAEPFLEQYAKRSP+N LIGSAGNLVR EDFL
Sbjct: 609 CKMLDEWAAYIRRKCGNDQLSSSTYLSEAEPFLEQYAKRSPKNHILIGSAGNLVRTEDFL 668
Query: 734 AIVEEGMDEEGDLQKEQEATPSSSLHSGKDVVPKADGLIVFFPGIPGCAKSAICREILNA 793
AIV+ +DEEGDL K+Q TP++ + K+ V K +GLIVFFPGIPG AKSA+C+E+LNA
Sbjct: 669 AIVDGDLDEEGDLVKKQGVTPATPEPAVKEAVQKDEGLIVFFPGIPGSAKSALCKELLNA 728
Query: 794 PGGLGDDRPLNSLMGDLIKGRYWQKVADELRRKPYSIMLADKNAPNEEVWRQIEDMCHST 853
PGG GDDRP+++LMGDL+KG+YW KVADE R+KP SIMLADKNAPNE+VWRQIEDMC T
Sbjct: 729 PGGFGDDRPVHTLMGDLVKGKYWPKVADERRKKPQSIMLADKNAPNEDVWRQIEDMCRRT 788
Query: 854 RASAVPVVPDSEGTDSNPFSFDALAVFMFRVLQRVNHPGNLDKASPNAGYVLLMFYHLYE 913
RASAVP+V DSEGTD+NP+S DALAVFMFRVLQRVNHPG LDK S NAGYVLLMFYHLYE
Sbjct: 789 RASAVPIVADSEGTDTNPYSLDALAVFMFRVLQRVNHPGKLDKESSNAGYVLLMFYHLYE 848
Query: 914 GKSRREFEGELIDRFGFLIKIPLLKSDRSPLPDNLKTVLEEGLSLYKLHTRRHGRADSTK 973
GK+R EFE ELI+RFG LIK+PLLKSDR+PLPD +K+VLEEG+ L+ LH+RRHGR +STK
Sbjct: 849 GKNRNEFESELIERFGSLIKMPLLKSDRTPLPDPVKSVLEEGIDLFNLHSRRHGRLESTK 908
Query: 974 GSFAKEWAKWEKQLRETLFGNTEYLNSIQVTFEFAVQDVLEQLKKILKGDYKNLTTDRRK 1033
G++A EW KWEKQLR+TL N+EYL+SIQV FE V V E+LK I KGDYK ++++RK
Sbjct: 909 GTYAAEWTKWEKQLRDTLVANSEYLSSIQVPFESMVHQVREELKTIAKGDYKPPSSEKRK 968
Query: 1034 SATIVFAAVSLPILEIQNLLDTLGKINPHVGGFF---KKNLKDFTLREVHVTLAHKRGHG 1093
+IVFAA++LP ++ +LL+ L NP + F KK++++ L HVTLAHKR HG
Sbjct: 969 HGSIVFAAINLPATQVHSLLEKLAAANPTMRSFLEGKKKSIQE-KLERSHVTLAHKRSHG 1028
Query: 1094 VKAVADYGIFENKEVPVELTGLLFSNKMAAFEARLGSIEDERVISKNEWPHATIWTTEGV 1153
V VA Y N+EVPVELT L++++KMAA A +GS++ E V+SKNEWPH T+WT EGV
Sbjct: 1029 VATVASYSQHLNREVPVELTELIYNDKMAALTAHVGSVDGETVVSKNEWPHVTLWTAEGV 1088
Query: 1154 AAKEANTLPLLVPEGKATLVEINPPIVISGKVQFF 1179
AKEANTLP L EGKA+ + I+PP+ ISG ++FF
Sbjct: 1089 TAKEANTLPQLYLEGKASRLVIDPPVSISGPLEFF 1104
BLAST of Sed0015984 vs. ExPASy TrEMBL
Match:
A0A6J1DUP6 (tRNA ligase 1 OS=Momordica charantia OX=3673 GN=LOC111024537 PE=4 SV=1)
HSP 1 Score: 2011.5 bits (5210), Expect = 0.0e+00
Identity = 1016/1198 (84.81%), Postives = 1075/1198 (89.73%), Query Frame = 0
Query: 1 MSVSHKRVLCAITLP---------LSSSLTFSSRTRFYIPHSL-LPFKASSPFPLSCHSP 60
MS SH R+ CAITLP L +S F S + F P SL LP SSPF LS HS
Sbjct: 1 MSASH-RIFCAITLPHPPRFSPSSLFNSRAFLSTSHFIFPRSLALPPLISSPFHLSPHSR 60
Query: 61 FIMPYNQPRGGCKDQKWKAKTKADNTSMD---AAEVVTHALNKLSVTESGQPHVPISSTQ 120
IMPYNQ G ++QKWK K K D TS + AAEVVT+AL KL V+ESGQPHVPISS +
Sbjct: 61 SIMPYNQRSDGRREQKWKEKAKLDRTSTESEAAAEVVTNALGKLRVSESGQPHVPISSRE 120
Query: 121 FGNVQLTNQVPPGDGHRTIWKPKAYGTTS-GAAVVEAENASAGRTSIQNKENSAGLAAQN 180
FGN QLTNQVP G G+R IWKPKAYGTTS GAAVVEAE A A TSI+NK N+AGLAAQN
Sbjct: 121 FGNAQLTNQVPSGLGNRGIWKPKAYGTTSGGAAVVEAEKAPAVGTSIENKGNTAGLAAQN 180
Query: 181 ------DIFKGNQIEKFTVDNYTYTHAQIRATFYPKFENEKSDQEIRSRMIEMVSKGLAT 240
+FKGNQIE FTVDN TYT AQIRATFYPKFENEKSDQEIR+RMIEMVSKGLAT
Sbjct: 181 GTVGLSQLFKGNQIENFTVDNSTYTQAQIRATFYPKFENEKSDQEIRTRMIEMVSKGLAT 240
Query: 241 LEVSLKHSGSLFMYAGHEGGAYAKNSFGNIYTAVGVFVLGRMFREAWGSGAAKKQVEFND 300
LEVSLKHSGSLFMYAGHEGGAYAKNSFGNIYTAVGVFVLGRMFREAWGS AAKKQ EFN+
Sbjct: 241 LEVSLKHSGSLFMYAGHEGGAYAKNSFGNIYTAVGVFVLGRMFREAWGSKAAKKQAEFNN 300
Query: 301 FLESNRMCISMELVTAVLGDHGQRPREDYVVVTAVTELGNGKPKFYSTAEIIAFCRKWRL 360
FLESNRMCISMELVTAVLGDHGQRPREDYVVVTAVT+LGNGKPKFYSTAEII FCR+WRL
Sbjct: 301 FLESNRMCISMELVTAVLGDHGQRPREDYVVVTAVTDLGNGKPKFYSTAEIIVFCREWRL 360
Query: 361 PTNHVWLFSSRKSVTSFFASFDALCEEGTATTVCKALDEVAEISVPGSKDHIKVQGEILE 420
PTNHVWLFSSRKSVTSFFA+FDALCEEGTAT+VCKALDEVAEISVPGSKDHIKVQGEILE
Sbjct: 361 PTNHVWLFSSRKSVTSFFAAFDALCEEGTATSVCKALDEVAEISVPGSKDHIKVQGEILE 420
Query: 421 GLVARMVSHESSKHMEKVLEEFPALPDNEGGEFDLGPSLREICAANRSDEKQQIKALLQN 480
GLVAR+VSHESSKHMEKVLEEFP+LPD EGG DLG SLREICAANRSDEKQQIKALLQN
Sbjct: 421 GLVARIVSHESSKHMEKVLEEFPSLPDEEGGGLDLGRSLREICAANRSDEKQQIKALLQN 480
Query: 481 VGSAFCPVHSDWYGDSHSRNADRSVVSKFLQANPADFSTSKLQEMVRLMRENRFPAAFKC 540
VGS+FCP HSDW GDSHSR ADRSV+SKFLQ +P DFSTSKLQEM+RLMRE R PAAFKC
Sbjct: 481 VGSSFCPDHSDWSGDSHSRTADRSVLSKFLQTSPTDFSTSKLQEMIRLMREKRLPAAFKC 540
Query: 541 YYNFHKVGSISNDNLFYKMVVHVQSDSAFRRYQKEMRNKPGLWPLYRGFFMDINLFKANK 600
Y+NFHKVGSISND+LFYKMV+HV SDSAFRRYQKEMR+KPGLWPLYRGFF+DINLFKANK
Sbjct: 541 YHNFHKVGSISNDDLFYKMVIHVHSDSAFRRYQKEMRHKPGLWPLYRGFFVDINLFKANK 600
Query: 601 DKAAEIVKGKNNLMEIEGNGTLGRDGFADEDANLMIKLKFLTYKLRTFLIRNGLSIVFKE 660
DKAAEI+K K+NLME+EGNG LGRDG ADEDANLMIKLKFLTYKLRTFLIRNGLSI+FKE
Sbjct: 601 DKAAEIMKSKSNLMEVEGNGILGRDGLADEDANLMIKLKFLTYKLRTFLIRNGLSILFKE 660
Query: 661 GPAAYKAYYLRQMKLWGTSDGKQRELSKMLDEWAVFLRRKYGNKQLSSTTYLSEAEPFLE 720
GPAAYKAYYLRQMKLWGTS GKQRELSKMLDEWAV+LRRKYGN+QLSS TYLSEAEPFLE
Sbjct: 661 GPAAYKAYYLRQMKLWGTSVGKQRELSKMLDEWAVYLRRKYGNRQLSSATYLSEAEPFLE 720
Query: 721 QYAKRSPQNQALIGSAGNLVRAEDFLAIVEEGMDEEGDLQKEQEATPSSSLHSGKDVVPK 780
QYAKRSPQNQALIGSAGNLVRAEDFLAIVEEGMDEEGDLQKEQE PSS + GKD V K
Sbjct: 721 QYAKRSPQNQALIGSAGNLVRAEDFLAIVEEGMDEEGDLQKEQEVAPSSPMLPGKDTVSK 780
Query: 781 ADGLIVFFPGIPGCAKSAICREILNAPGGLGDDRPLNSLMGDLIKGRYWQKVADELRRKP 840
A+GLIVFFPGIPGCAKSA+CREILNAPGGLGDDRP+ SLMGDLIKGRYWQKV DE RRKP
Sbjct: 781 AEGLIVFFPGIPGCAKSALCREILNAPGGLGDDRPVKSLMGDLIKGRYWQKVVDERRRKP 840
Query: 841 YSIMLADKNAPNEEVWRQIEDMCHSTRASAVPVVPDSEGTDSNPFSFDALAVFMFRVLQR 900
YSIMLADKNAPNEEVWRQIEDMCHSTRASAVPVVPDSEGTD NPFS DALAVFMFRVLQR
Sbjct: 841 YSIMLADKNAPNEEVWRQIEDMCHSTRASAVPVVPDSEGTDGNPFSLDALAVFMFRVLQR 900
Query: 901 VNHPGNLDKASPNAGYVLLMFYHLYEGKSRREFEGELIDRFGFLIKIPLLKSDRSPLPDN 960
VNHPGNLDKASPNAGYVLLMFYHLYEGKSRREFE ELIDRFG L+K+PLLK DRSPLPDN
Sbjct: 901 VNHPGNLDKASPNAGYVLLMFYHLYEGKSRREFEDELIDRFGSLVKMPLLKCDRSPLPDN 960
Query: 961 LKTVLEEGLSLYKLHTRRHGRADSTKGSFAKEWAKWEKQLRETLFGNTEYLNSIQVTFEF 1020
LKT+LEEGLSLYKLHT RHGRADSTKGS+AKEWAKWEKQLRETLFGNTEYLNSIQV FE
Sbjct: 961 LKTILEEGLSLYKLHTSRHGRADSTKGSYAKEWAKWEKQLRETLFGNTEYLNSIQVPFEV 1020
Query: 1021 AVQDVLEQLKKILKGDYKNLTTDRRKSATIVFAAVSLPILEIQNLLDTLGKINPHVGGFF 1080
AVQDVLEQLKKI KGDYK ++RRKSATIVFAAVSLP+ EIQNLLDTLGK NPHV F
Sbjct: 1021 AVQDVLEQLKKIAKGDYKTPISERRKSATIVFAAVSLPVQEIQNLLDTLGKKNPHVESFL 1080
Query: 1081 KKNLKDFTLREVHVTLAHKRGHGVKAVADYGIFENKEVPVELTGLLFSNKMAAFEARLGS 1140
K++ KD+TL+ HVTLAHKR HGVKAVADYGIF+NKEVPVELT LLFS+KMAAFEA LGS
Sbjct: 1081 KQDYKDYTLKAAHVTLAHKRSHGVKAVADYGIFQNKEVPVELTALLFSDKMAAFEAHLGS 1140
Query: 1141 IEDERVISKNEWPHATIWTTEGVAAKEANTLPLLVPEGKATLVEINPPIVISGKVQFF 1179
+EDERV+SKNEWPH T+WT EGVAAKEANTLP LV EGKATLVE+NPP +ISG V+FF
Sbjct: 1141 VEDERVVSKNEWPHVTLWTREGVAAKEANTLPQLVSEGKATLVELNPPTIISGTVKFF 1197
BLAST of Sed0015984 vs. ExPASy TrEMBL
Match:
A0A6J1HM92 (tRNA ligase 1 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111464863 PE=4 SV=1)
HSP 1 Score: 2006.9 bits (5198), Expect = 0.0e+00
Identity = 1009/1190 (84.79%), Postives = 1070/1190 (89.92%), Query Frame = 0
Query: 7 RVLCAITLPLSSSLTFSSRT--------RFYIPHSLLPFKAS-SPFPLSCHSPFIMPYNQ 66
R+ CAITLPLSSS SR +I H L AS PF + S F MPYNQ
Sbjct: 6 RIFCAITLPLSSSPALHSRAFPFVSCSLSHFILHPSLTLPASVFPFTVCRDSRFTMPYNQ 65
Query: 67 PRGGCKDQKWKAKTKADNTSMD---AAEVVTHALNKLSVTESGQPHVPISSTQFGNVQLT 126
RGG ++QKWK K K + S + A+EVVT+AL+ L VTES QPH+PI+S QFGN Q T
Sbjct: 66 RRGGRREQKWKEKAKVEGISTESETASEVVTNALSNLRVTESNQPHIPITSVQFGNAQPT 125
Query: 127 NQVPPGDGHRTIWKPKAYGTTSGAAVVEAENASAGRTSIQNKENSAGLAAQ------NDI 186
N PG GHR IWKPKAYGTTSGAAVVE E A A TSI+NK ++A +AA + +
Sbjct: 126 NLATPGLGHRAIWKPKAYGTTSGAAVVEGEKAPAVGTSIENKGSNAEIAANSSAIALSQL 185
Query: 187 FKGNQIEKFTVDNYTYTHAQIRATFYPKFENEKSDQEIRSRMIEMVSKGLATLEVSLKHS 246
KGNQIE+FTVDN YT AQIRATFYPKFENEKSDQEIR+RMIEMVSKGLATLEVSLKHS
Sbjct: 186 LKGNQIEQFTVDNSAYTQAQIRATFYPKFENEKSDQEIRTRMIEMVSKGLATLEVSLKHS 245
Query: 247 GSLFMYAGHEGGAYAKNSFGNIYTAVGVFVLGRMFREAWGSGAAKKQVEFNDFLESNRMC 306
GSLFMYAGH+GGAYAKNSFGNIYTAVGVFVLGRMFREAWGS A KKQ EFNDFLESNRMC
Sbjct: 246 GSLFMYAGHQGGAYAKNSFGNIYTAVGVFVLGRMFREAWGSVAPKKQAEFNDFLESNRMC 305
Query: 307 ISMELVTAVLGDHGQRPREDYVVVTAVTELGNGKPKFYSTAEIIAFCRKWRLPTNHVWLF 366
ISMELVTAVLGDHGQRPREDYVVVTAVTELGNGKPKFYST+EIIAFCRKWRLPTNHVWLF
Sbjct: 306 ISMELVTAVLGDHGQRPREDYVVVTAVTELGNGKPKFYSTSEIIAFCRKWRLPTNHVWLF 365
Query: 367 SSRKSVTSFFASFDALCEEGTATTVCKALDEVAEISVPGSKDHIKVQGEILEGLVARMVS 426
SSRKSVTSFFASFDALCEEGTATTVCKALDEVAEISVPGSKDHIKVQGEILEGLVARMVS
Sbjct: 366 SSRKSVTSFFASFDALCEEGTATTVCKALDEVAEISVPGSKDHIKVQGEILEGLVARMVS 425
Query: 427 HESSKHMEKVLEEFPALPDNEGGEFDLGPSLREICAANRSDEKQQIKALLQNVGSAFCPV 486
HESSKHMEKVLEEFPALP NEGG DLGPSLREICAANRSDEKQQIKALLQNVGSAFCP
Sbjct: 426 HESSKHMEKVLEEFPALPYNEGGGLDLGPSLREICAANRSDEKQQIKALLQNVGSAFCPD 485
Query: 487 HSDWYGDSHSRNADRSVVSKFLQANPADFSTSKLQEMVRLMRENRFPAAFKCYYNFHKVG 546
HSDWYGDSHSRNADRSVVSKFLQA PADFSTSKLQEMVRLMRE R PAAFKCY+NFHK+G
Sbjct: 486 HSDWYGDSHSRNADRSVVSKFLQAKPADFSTSKLQEMVRLMRERRLPAAFKCYHNFHKIG 545
Query: 547 SISNDNLFYKMVVHVQSDSAFRRYQKEMRNKPGLWPLYRGFFMDINLFKANKDKAAEIVK 606
SISNDNLFYKMV+HVQSDSAFRRYQKEMR+KPGLWPLYRGFF+DINLFK NK+K AEIVK
Sbjct: 546 SISNDNLFYKMVIHVQSDSAFRRYQKEMRHKPGLWPLYRGFFVDINLFKENKEKTAEIVK 605
Query: 607 GKNNLMEIEGNGTLGRDGFADEDANLMIKLKFLTYKLRTFLIRNGLSIVFKEGPAAYKAY 666
KNNLME EGNGT+GRDGFADEDANLMIKLKFLTYKLRTFLIRNGLSI+FKEG AAYKAY
Sbjct: 606 SKNNLMETEGNGTVGRDGFADEDANLMIKLKFLTYKLRTFLIRNGLSILFKEGSAAYKAY 665
Query: 667 YLRQMKLWGTSDGKQRELSKMLDEWAVFLRRKYGNKQLSSTTYLSEAEPFLEQYAKRSPQ 726
YLRQMKLWGTS GKQRELSKMLDEWAV+LRRKYGNKQLSS+ YLSEAEPFLEQYAKRSPQ
Sbjct: 666 YLRQMKLWGTSFGKQRELSKMLDEWAVYLRRKYGNKQLSSSIYLSEAEPFLEQYAKRSPQ 725
Query: 727 NQALIGSAGNLVRAEDFLAIVEEGMDEEGDLQKEQEATPSSSLHSGKDVVPKADGLIVFF 786
NQALIGSAGNLVRAEDFLA+VEEGMDEEGDLQKE +A PSS + S KDVVPKA+GLIVFF
Sbjct: 726 NQALIGSAGNLVRAEDFLAVVEEGMDEEGDLQKE-DAAPSSPMLSRKDVVPKAEGLIVFF 785
Query: 787 PGIPGCAKSAICREILNAPGGLGDDRPLNSLMGDLIKGRYWQKVADELRRKPYSIMLADK 846
PGIPGCAKSA+CREILNAPGGLGDDRP+N+LMGDLIKGRYWQKVADE RRKPYSIMLADK
Sbjct: 786 PGIPGCAKSALCREILNAPGGLGDDRPVNTLMGDLIKGRYWQKVADERRRKPYSIMLADK 845
Query: 847 NAPNEEVWRQIEDMCHSTRASAVPVVPDSEGTDSNPFSFDALAVFMFRVLQRVNHPGNLD 906
NAPNEEVWRQIEDMCHSTRASAVPV+PDSEGTDSNPFS DALAVFMFRVLQRVNHPGNLD
Sbjct: 846 NAPNEEVWRQIEDMCHSTRASAVPVIPDSEGTDSNPFSLDALAVFMFRVLQRVNHPGNLD 905
Query: 907 KASPNAGYVLLMFYHLYEGKSRREFEGELIDRFGFLIKIPLLKSDRSPLPDNLKTVLEEG 966
KASPNAGYVLLMFYHLYEGKSRREFEGELIDRFG L+KIPLLKSDRSPLPDNLKT+LEEG
Sbjct: 906 KASPNAGYVLLMFYHLYEGKSRREFEGELIDRFGSLVKIPLLKSDRSPLPDNLKTILEEG 965
Query: 967 LSLYKLHTRRHGRADSTKGSFAKEWAKWEKQLRETLFGNTEYLNSIQVTFEFAVQDVLEQ 1026
LSLYKLHT RHGRADSTKGS+AKEWAKWEKQLRETLFGN EYLN+IQV FEFAVQ+VLEQ
Sbjct: 966 LSLYKLHTSRHGRADSTKGSYAKEWAKWEKQLRETLFGNAEYLNAIQVPFEFAVQNVLEQ 1025
Query: 1027 LKKILKGDYKNLTTDRRKSATIVFAAVSLPILEIQNLLDTLGKINPHVGGFFKKNLKDFT 1086
LKKI KGDYK+ T+RRKSATIV+AAVSLP+ +IQ+ LDTLG NP V F K+ KD+T
Sbjct: 1026 LKKISKGDYKSPITERRKSATIVYAAVSLPVQDIQDALDTLGNKNPQVEAFIKEGYKDYT 1085
Query: 1087 LREVHVTLAHKRGHGVKAVADYGIFENKEVPVELTGLLFSNKMAAFEARLGSIEDERVIS 1146
L+ HVTLAHKR HG+KAVADYGIFENKEVPVELT LLFS+KMAAFEAR+GSIEDERVIS
Sbjct: 1086 LKSAHVTLAHKRSHGIKAVADYGIFENKEVPVELTALLFSDKMAAFEARVGSIEDERVIS 1145
Query: 1147 KNEWPHATIWTTEGVAAKEANTLPLLVPEGKATLVEINPPIVISGKVQFF 1179
KNEWPH T+WT EG+AAKEANTLP LV EGKATLVE+NPPI+ISGKVQFF
Sbjct: 1146 KNEWPHVTLWTREGIAAKEANTLPQLVSEGKATLVELNPPIIISGKVQFF 1194
BLAST of Sed0015984 vs. ExPASy TrEMBL
Match:
A0A6J1HM43 (tRNA ligase 1 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111464863 PE=4 SV=1)
HSP 1 Score: 1985.7 bits (5143), Expect = 0.0e+00
Identity = 987/1135 (86.96%), Postives = 1045/1135 (92.07%), Query Frame = 0
Query: 53 MPYNQPRGGCKDQKWKAKTKADNTSMD---AAEVVTHALNKLSVTESGQPHVPISSTQFG 112
MPYNQ RGG ++QKWK K K + S + A+EVVT+AL+ L VTES QPH+PI+S QFG
Sbjct: 1 MPYNQRRGGRREQKWKEKAKVEGISTESETASEVVTNALSNLRVTESNQPHIPITSVQFG 60
Query: 113 NVQLTNQVPPGDGHRTIWKPKAYGTTSGAAVVEAENASAGRTSIQNKENSAGLAAQ---- 172
N Q TN PG GHR IWKPKAYGTTSGAAVVE E A A TSI+NK ++A +AA
Sbjct: 61 NAQPTNLATPGLGHRAIWKPKAYGTTSGAAVVEGEKAPAVGTSIENKGSNAEIAANSSAI 120
Query: 173 --NDIFKGNQIEKFTVDNYTYTHAQIRATFYPKFENEKSDQEIRSRMIEMVSKGLATLEV 232
+ + KGNQIE+FTVDN YT AQIRATFYPKFENEKSDQEIR+RMIEMVSKGLATLEV
Sbjct: 121 ALSQLLKGNQIEQFTVDNSAYTQAQIRATFYPKFENEKSDQEIRTRMIEMVSKGLATLEV 180
Query: 233 SLKHSGSLFMYAGHEGGAYAKNSFGNIYTAVGVFVLGRMFREAWGSGAAKKQVEFNDFLE 292
SLKHSGSLFMYAGH+GGAYAKNSFGNIYTAVGVFVLGRMFREAWGS A KKQ EFNDFLE
Sbjct: 181 SLKHSGSLFMYAGHQGGAYAKNSFGNIYTAVGVFVLGRMFREAWGSVAPKKQAEFNDFLE 240
Query: 293 SNRMCISMELVTAVLGDHGQRPREDYVVVTAVTELGNGKPKFYSTAEIIAFCRKWRLPTN 352
SNRMCISMELVTAVLGDHGQRPREDYVVVTAVTELGNGKPKFYST+EIIAFCRKWRLPTN
Sbjct: 241 SNRMCISMELVTAVLGDHGQRPREDYVVVTAVTELGNGKPKFYSTSEIIAFCRKWRLPTN 300
Query: 353 HVWLFSSRKSVTSFFASFDALCEEGTATTVCKALDEVAEISVPGSKDHIKVQGEILEGLV 412
HVWLFSSRKSVTSFFASFDALCEEGTATTVCKALDEVAEISVPGSKDHIKVQGEILEGLV
Sbjct: 301 HVWLFSSRKSVTSFFASFDALCEEGTATTVCKALDEVAEISVPGSKDHIKVQGEILEGLV 360
Query: 413 ARMVSHESSKHMEKVLEEFPALPDNEGGEFDLGPSLREICAANRSDEKQQIKALLQNVGS 472
ARMVSHESSKHMEKVLEEFPALP NEGG DLGPSLREICAANRSDEKQQIKALLQNVGS
Sbjct: 361 ARMVSHESSKHMEKVLEEFPALPYNEGGGLDLGPSLREICAANRSDEKQQIKALLQNVGS 420
Query: 473 AFCPVHSDWYGDSHSRNADRSVVSKFLQANPADFSTSKLQEMVRLMRENRFPAAFKCYYN 532
AFCP HSDWYGDSHSRNADRSVVSKFLQA PADFSTSKLQEMVRLMRE R PAAFKCY+N
Sbjct: 421 AFCPDHSDWYGDSHSRNADRSVVSKFLQAKPADFSTSKLQEMVRLMRERRLPAAFKCYHN 480
Query: 533 FHKVGSISNDNLFYKMVVHVQSDSAFRRYQKEMRNKPGLWPLYRGFFMDINLFKANKDKA 592
FHK+GSISNDNLFYKMV+HVQSDSAFRRYQKEMR+KPGLWPLYRGFF+DINLFK NK+K
Sbjct: 481 FHKIGSISNDNLFYKMVIHVQSDSAFRRYQKEMRHKPGLWPLYRGFFVDINLFKENKEKT 540
Query: 593 AEIVKGKNNLMEIEGNGTLGRDGFADEDANLMIKLKFLTYKLRTFLIRNGLSIVFKEGPA 652
AEIVK KNNLME EGNGT+GRDGFADEDANLMIKLKFLTYKLRTFLIRNGLSI+FKEG A
Sbjct: 541 AEIVKSKNNLMETEGNGTVGRDGFADEDANLMIKLKFLTYKLRTFLIRNGLSILFKEGSA 600
Query: 653 AYKAYYLRQMKLWGTSDGKQRELSKMLDEWAVFLRRKYGNKQLSSTTYLSEAEPFLEQYA 712
AYKAYYLRQMKLWGTS GKQRELSKMLDEWAV+LRRKYGNKQLSS+ YLSEAEPFLEQYA
Sbjct: 601 AYKAYYLRQMKLWGTSFGKQRELSKMLDEWAVYLRRKYGNKQLSSSIYLSEAEPFLEQYA 660
Query: 713 KRSPQNQALIGSAGNLVRAEDFLAIVEEGMDEEGDLQKEQEATPSSSLHSGKDVVPKADG 772
KRSPQNQALIGSAGNLVRAEDFLA+VEEGMDEEGDLQKE +A PSS + S KDVVPKA+G
Sbjct: 661 KRSPQNQALIGSAGNLVRAEDFLAVVEEGMDEEGDLQKE-DAAPSSPMLSRKDVVPKAEG 720
Query: 773 LIVFFPGIPGCAKSAICREILNAPGGLGDDRPLNSLMGDLIKGRYWQKVADELRRKPYSI 832
LIVFFPGIPGCAKSA+CREILNAPGGLGDDRP+N+LMGDLIKGRYWQKVADE RRKPYSI
Sbjct: 721 LIVFFPGIPGCAKSALCREILNAPGGLGDDRPVNTLMGDLIKGRYWQKVADERRRKPYSI 780
Query: 833 MLADKNAPNEEVWRQIEDMCHSTRASAVPVVPDSEGTDSNPFSFDALAVFMFRVLQRVNH 892
MLADKNAPNEEVWRQIEDMCHSTRASAVPV+PDSEGTDSNPFS DALAVFMFRVLQRVNH
Sbjct: 781 MLADKNAPNEEVWRQIEDMCHSTRASAVPVIPDSEGTDSNPFSLDALAVFMFRVLQRVNH 840
Query: 893 PGNLDKASPNAGYVLLMFYHLYEGKSRREFEGELIDRFGFLIKIPLLKSDRSPLPDNLKT 952
PGNLDKASPNAGYVLLMFYHLYEGKSRREFEGELIDRFG L+KIPLLKSDRSPLPDNLKT
Sbjct: 841 PGNLDKASPNAGYVLLMFYHLYEGKSRREFEGELIDRFGSLVKIPLLKSDRSPLPDNLKT 900
Query: 953 VLEEGLSLYKLHTRRHGRADSTKGSFAKEWAKWEKQLRETLFGNTEYLNSIQVTFEFAVQ 1012
+LEEGLSLYKLHT RHGRADSTKGS+AKEWAKWEKQLRETLFGN EYLN+IQV FEFAVQ
Sbjct: 901 ILEEGLSLYKLHTSRHGRADSTKGSYAKEWAKWEKQLRETLFGNAEYLNAIQVPFEFAVQ 960
Query: 1013 DVLEQLKKILKGDYKNLTTDRRKSATIVFAAVSLPILEIQNLLDTLGKINPHVGGFFKKN 1072
+VLEQLKKI KGDYK+ T+RRKSATIV+AAVSLP+ +IQ+ LDTLG NP V F K+
Sbjct: 961 NVLEQLKKISKGDYKSPITERRKSATIVYAAVSLPVQDIQDALDTLGNKNPQVEAFIKEG 1020
Query: 1073 LKDFTLREVHVTLAHKRGHGVKAVADYGIFENKEVPVELTGLLFSNKMAAFEARLGSIED 1132
KD+TL+ HVTLAHKR HG+KAVADYGIFENKEVPVELT LLFS+KMAAFEAR+GSIED
Sbjct: 1021 YKDYTLKSAHVTLAHKRSHGIKAVADYGIFENKEVPVELTALLFSDKMAAFEARVGSIED 1080
Query: 1133 ERVISKNEWPHATIWTTEGVAAKEANTLPLLVPEGKATLVEINPPIVISGKVQFF 1179
ERVISKNEWPH T+WT EG+AAKEANTLP LV EGKATLVE+NPPI+ISGKVQFF
Sbjct: 1081 ERVISKNEWPHVTLWTREGIAAKEANTLPQLVSEGKATLVELNPPIIISGKVQFF 1134
BLAST of Sed0015984 vs. ExPASy TrEMBL
Match:
A0A1S3CK49 (uncharacterized protein LOC103501711 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103501711 PE=4 SV=1)
HSP 1 Score: 1974.9 bits (5115), Expect = 0.0e+00
Identity = 983/1163 (84.52%), Postives = 1048/1163 (90.11%), Query Frame = 0
Query: 26 TRFYIPHSL-LPFKASSPFPLSCHSPFIMPYNQPRGGCKDQKWKAKTKADNT---SMDAA 85
+ F +P SL L SSPFPLSC S F+MPYNQ RGG +QKWK K K D + S A
Sbjct: 33 SHFILPRSLTLAPLTSSPFPLSCDSRFVMPYNQRRGGRGEQKWKEKAKVDKSPTESEAAV 92
Query: 86 EVVTHALNKLSVTESGQPHVPISSTQFGNVQLTNQVPPGDGHRTIWKPKAYGTTSGAAVV 145
EVVT+AL KL VTES Q HV SS QFGN QLTNQ PG HR IWKPKAYGTTSGAAV+
Sbjct: 93 EVVTNALGKLRVTESDQSHVLTSSAQFGNAQLTNQAIPGLAHRAIWKPKAYGTTSGAAVI 152
Query: 146 EAENASAGRTSIQNKENSAGLAAQ------NDIFKGNQIEKFTVDNYTYTHAQIRATFYP 205
E E AS TS +NK ++AGLA Q + +FK NQIEKF VDN TYT AQIRATFYP
Sbjct: 153 EGEKASTNGTSTENKGSNAGLAVQGGAVGLSQLFKSNQIEKFIVDNSTYTQAQIRATFYP 212
Query: 206 KFENEKSDQEIRSRMIEMVSKGLATLEVSLKHSGSLFMYAGHEGGAYAKNSFGNIYTAVG 265
KFENEKSDQEIR+RMIEMVSKGLATLEVSLKHSGSLFMYAGHEGGAYAKNSFGNIYTAVG
Sbjct: 213 KFENEKSDQEIRTRMIEMVSKGLATLEVSLKHSGSLFMYAGHEGGAYAKNSFGNIYTAVG 272
Query: 266 VFVLGRMFREAWGSGAAKKQVEFNDFLESNRMCISMELVTAVLGDHGQRPREDYVVVTAV 325
VFVLGRMFREAWG+ AAKKQ EFNDFL+SNRMCISMELVTAVLGDHGQRPREDYVVVTAV
Sbjct: 273 VFVLGRMFREAWGAEAAKKQAEFNDFLQSNRMCISMELVTAVLGDHGQRPREDYVVVTAV 332
Query: 326 TELGNGKPKFYSTAEIIAFCRKWRLPTNHVWLFSSRKSVTSFFASFDALCEEGTATTVCK 385
TELG GKPKFYSTAEIIAFCR WRLPTNHVWLFSSRKSVTSFFA+FDALCEEGTAT+VCK
Sbjct: 333 TELGKGKPKFYSTAEIIAFCRNWRLPTNHVWLFSSRKSVTSFFAAFDALCEEGTATSVCK 392
Query: 386 ALDEVAEISVPGSKDHIKVQGEILEGLVARMVSHESSKHMEKVLEEFPALPDNEGGEFDL 445
ALDEVAEISVPGSKDHIKVQGEILEGLVARMVSHESSKHM+KVLEEFPA+PDNEGG DL
Sbjct: 393 ALDEVAEISVPGSKDHIKVQGEILEGLVARMVSHESSKHMQKVLEEFPAVPDNEGGGLDL 452
Query: 446 GPSLREICAANRSDEKQQIKALLQNVGSAFCPVHSDWYGDSHSRNADRSVVSKFLQANPA 505
GPSLREICAANRSDEKQQIKALLQNVG+AFCP HSDWYGDSHSRNADRSV+SKFLQANPA
Sbjct: 453 GPSLREICAANRSDEKQQIKALLQNVGTAFCPDHSDWYGDSHSRNADRSVLSKFLQANPA 512
Query: 506 DFSTSKLQEMVRLMRENRFPAAFKCYYNFHKVGSISNDNLFYKMVVHVQSDSAFRRYQKE 565
DFSTSKLQEM+RLMRE R PAAFKCY+NFHKV SISNDNLFYKMV+HV SDSAFRRYQKE
Sbjct: 513 DFSTSKLQEMIRLMRERRLPAAFKCYHNFHKVASISNDNLFYKMVIHVHSDSAFRRYQKE 572
Query: 566 MRNKPGLWPLYRGFFMDINLFKANKDKAAEIVKGKNNLMEIEGNGTLGRDGFADEDANLM 625
MR+KPGLWPLYRGFF+DINLFK NKDKAA +VK K+NLM+ EGNGTLGRDGFADED+NLM
Sbjct: 573 MRHKPGLWPLYRGFFVDINLFKENKDKAAGLVKSKSNLMDTEGNGTLGRDGFADEDSNLM 632
Query: 626 IKLKFLTYKLRTFLIRNGLSIVFKEGPAAYKAYYLRQMKLWGTSDGKQRELSKMLDEWAV 685
IKLKFLTYKLRTFLIRNGLSI+FKEGP AYKAYYLRQMKLWGTS GKQRELSKMLDEWAV
Sbjct: 633 IKLKFLTYKLRTFLIRNGLSILFKEGPVAYKAYYLRQMKLWGTSAGKQRELSKMLDEWAV 692
Query: 686 FLRRKYGNKQLSSTTYLSEAEPFLEQYAKRSPQNQALIGSAGNLVRAEDFLAIVEEGMDE 745
++RRKYGNKQLSS TYLSEAEPFLEQYAKRSPQNQALIGSAGNLVRAEDFLAIVEEGMDE
Sbjct: 693 YIRRKYGNKQLSSATYLSEAEPFLEQYAKRSPQNQALIGSAGNLVRAEDFLAIVEEGMDE 752
Query: 746 EGDLQKEQEATPSSSLHSGKDVVPKADGLIVFFPGIPGCAKSAICREILNAPGGLGDDRP 805
EGDLQKEQEA PSS + SGKD VPKA+GLIVFFPGIPGCAKSA+CREILNAPG LGDDRP
Sbjct: 753 EGDLQKEQEAAPSSPMLSGKDAVPKAEGLIVFFPGIPGCAKSALCREILNAPGALGDDRP 812
Query: 806 LNSLMGDLIKGRYWQKVADELRRKPYSIMLADKNAPNEEVWRQIEDMCHSTRASAVPVVP 865
+N+LMGDLIKGRYWQKVADE R+KPYSIMLADKNAPNEEVWRQIEDMC STRASAVPV+P
Sbjct: 813 VNTLMGDLIKGRYWQKVADERRKKPYSIMLADKNAPNEEVWRQIEDMCRSTRASAVPVIP 872
Query: 866 DSEGTDSNPFSFDALAVFMFRVLQRVNHPGNLDKASPNAGYVLLMFYHLYEGKSRREFEG 925
DSEGTDSNPFS DALAVFMFRVLQRVNHPGNLDKASPNAGYVLLMFYHLY+GKSRREFEG
Sbjct: 873 DSEGTDSNPFSLDALAVFMFRVLQRVNHPGNLDKASPNAGYVLLMFYHLYDGKSRREFEG 932
Query: 926 ELIDRFGFLIKIPLLKSDRSPLPDNLKTVLEEGLSLYKLHTRRHGRADSTKGSFAKEWAK 985
ELIDRFG L+K+PLLK DR+PLPD+LK++LEEG+SLYKLHT RHGR DSTKGS+AKEWAK
Sbjct: 933 ELIDRFGSLVKMPLLKPDRNPLPDDLKSILEEGISLYKLHTSRHGRVDSTKGSYAKEWAK 992
Query: 986 WEKQLRETLFGNTEYLNSIQVTFEFAVQDVLEQLKKILKGDYKNLTTDRRKSATIVFAAV 1045
WEKQLRETLF NTEYLN+IQV FE AVQDVLEQLKKI +GDYK+ T+RRKS IVFAAV
Sbjct: 993 WEKQLRETLFSNTEYLNAIQVPFESAVQDVLEQLKKISEGDYKSPITERRKSGAIVFAAV 1052
Query: 1046 SLPILEIQNLLDTLGKINPHVGGFFKKNLKDFTLREVHVTLAHKRGHGVKAVADYGIFEN 1105
SLP+ EIQN+L TLGK N + F K++ KD+ L+ HVTLAHKR HGVK VADYGIFEN
Sbjct: 1053 SLPVQEIQNVLGTLGKKNSRIEAFLKEHYKDYKLKGAHVTLAHKRSHGVKGVADYGIFEN 1112
Query: 1106 KEVPVELTGLLFSNKMAAFEARLGSIEDERVISKNEWPHATIWTTEGVAAKEANTLPLLV 1165
KEVPVELT LLFS+KMA FEARLGSIE+ERVISKNEWPH T+WT EGVAAKEAN LP LV
Sbjct: 1113 KEVPVELTALLFSDKMAGFEARLGSIENERVISKNEWPHVTLWTREGVAAKEANALPQLV 1172
Query: 1166 PEGKATLVEINPPIVISGKVQFF 1179
EGKATLVEINPPI+ISG V+FF
Sbjct: 1173 SEGKATLVEINPPIIISGIVKFF 1195
BLAST of Sed0015984 vs. ExPASy TrEMBL
Match:
A0A6J1I3R5 (tRNA ligase 1 OS=Cucurbita maxima OX=3661 GN=LOC111469400 PE=4 SV=1)
HSP 1 Score: 1974.1 bits (5113), Expect = 0.0e+00
Identity = 1001/1201 (83.35%), Postives = 1067/1201 (88.84%), Query Frame = 0
Query: 1 MSVSHKRVLCAITLP---LSSSLTFSSRTRF-YIPHSL--------LPFKASSPFPLSCH 60
MS H R+ CAITLP LS S F+ R F +IP+S L S FP +
Sbjct: 1 MSAPH-RIFCAITLPRHRLSYSSAFNYRVFFPFIPYSFSHRILSPSLTITDSISFPSTVS 60
Query: 61 SP--FIMPYNQPRGGCKDQKWKAKTKADNTSMD---AAEVVTHALNKLSVTESGQPHVPI 120
S F+MPYNQ RGG ++QKWK K K + S + A++VVT+AL+ L VTES QPH+PI
Sbjct: 61 SDFRFMMPYNQRRGGRREQKWKEKAKVEGISTESEAASQVVTNALSNLRVTESNQPHIPI 120
Query: 121 SSTQFGNVQLTNQVPPGDGHRTIWKPKAYGTTSGAAVVEAENASAGRTSIQNKENSAGLA 180
+S QFGN Q TN PG GHR IWKPKAYGTT GAAVVE E ASA TSI+NK ++A +A
Sbjct: 121 TSVQFGNAQPTNLATPGLGHRAIWKPKAYGTTIGAAVVEGEKASAVGTSIENKGSNAEIA 180
Query: 181 AQ------NDIFKGNQIEKFTVDNYTYTHAQIRATFYPKFENEKSDQEIRSRMIEMVSKG 240
A N + KGNQIEKFTVDN YT AQIRATFYPKFENEKSDQEIR+RMIEMVSKG
Sbjct: 181 ANSSAIALNQLLKGNQIEKFTVDNSAYTQAQIRATFYPKFENEKSDQEIRTRMIEMVSKG 240
Query: 241 LATLEVSLKHSGSLFMYAGHEGGAYAKNSFGNIYTAVGVFVLGRMFREAWGSGAAKKQVE 300
LATLEVSLKHSGSLFMYAGH+GGAYAKNSFGNIYTAVGVFVLGRMF+EAWGS A KKQ E
Sbjct: 241 LATLEVSLKHSGSLFMYAGHQGGAYAKNSFGNIYTAVGVFVLGRMFQEAWGSVAPKKQAE 300
Query: 301 FNDFLESNRMCISMELVTAVLGDHGQRPREDYVVVTAVTELGNGKPKFYSTAEIIAFCRK 360
FNDFLESNRMCISMELVTAVLGDHGQRP+EDYVVVTAVTELGNGKPKFYST+EIIAFCRK
Sbjct: 301 FNDFLESNRMCISMELVTAVLGDHGQRPQEDYVVVTAVTELGNGKPKFYSTSEIIAFCRK 360
Query: 361 WRLPTNHVWLFSSRKSVTSFFASFDALCEEGTATTVCKALDEVAEISVPGSKDHIKVQGE 420
WRLPTNHVWLFSSRKSVTSFFASFDALCEEGTATTVCKALDEVAEISVPGSKDHIKVQGE
Sbjct: 361 WRLPTNHVWLFSSRKSVTSFFASFDALCEEGTATTVCKALDEVAEISVPGSKDHIKVQGE 420
Query: 421 ILEGLVARMVSHESSKHMEKVLEEFPALPDNEGGEFDLGPSLREICAANRSDEKQQIKAL 480
ILEGLVARMVSHESSKHMEKVLEEFPALP NEGG DL PSLREICAANRSDEKQQIKAL
Sbjct: 421 ILEGLVARMVSHESSKHMEKVLEEFPALPYNEGGGLDLEPSLREICAANRSDEKQQIKAL 480
Query: 481 LQNVGSAFCPVHSDWYGDSHSRNADRSVVSKFLQANPADFSTSKLQEMVRLMRENRFPAA 540
LQNVGSAFCP HSDWYGDSHSRNADRSVVSKFLQA PADFST KLQEMVRLMRE R PAA
Sbjct: 481 LQNVGSAFCPDHSDWYGDSHSRNADRSVVSKFLQAKPADFSTFKLQEMVRLMRERRLPAA 540
Query: 541 FKCYYNFHKVGSISNDNLFYKMVVHVQSDSAFRRYQKEMRNKPGLWPLYRGFFMDINLFK 600
FKCY+NFHKVGSISNDNLFYKMV+HVQSDSAFRRYQKEMR+KPGLWPLYRGFF+DINLFK
Sbjct: 541 FKCYHNFHKVGSISNDNLFYKMVIHVQSDSAFRRYQKEMRHKPGLWPLYRGFFVDINLFK 600
Query: 601 ANKDKAAEIVKGKNNLMEIEGNGTLGRDGFADEDANLMIKLKFLTYKLRTFLIRNGLSIV 660
NK+KAAEIVK KNNLME EGNGTLGRDGFADEDANLMIKLKFLTYKLRTFLIRNGLSI+
Sbjct: 601 ENKEKAAEIVKSKNNLMETEGNGTLGRDGFADEDANLMIKLKFLTYKLRTFLIRNGLSIL 660
Query: 661 FKEGPAAYKAYYLRQMKLWGTSDGKQRELSKMLDEWAVFLRRKYGNKQLSSTTYLSEAEP 720
FKEGPAAYKAYYLRQMKLWGTS GKQRELSKMLDEWAV+LRRKYGNKQLSS+ YLSEAEP
Sbjct: 661 FKEGPAAYKAYYLRQMKLWGTSFGKQRELSKMLDEWAVYLRRKYGNKQLSSSIYLSEAEP 720
Query: 721 FLEQYAKRSPQNQALIGSAGNLVRAEDFLAIVEEGMDEEGDLQKEQEATPSSSLHSGKDV 780
FLEQYAKRSPQNQ LIGSAGNLVRAEDFLA+V+EGMDEEGDLQKE A PSS + S KDV
Sbjct: 721 FLEQYAKRSPQNQTLIGSAGNLVRAEDFLAVVDEGMDEEGDLQKEDTA-PSSPMLSRKDV 780
Query: 781 VPKADGLIVFFPGIPGCAKSAICREILNAPGGLGDDRPLNSLMGDLIKGRYWQKVADELR 840
VPKA+GLIVFFPGIPGCAKS++CREILNAPG LGDDRP+N+L GDLIKGRYWQKVADE R
Sbjct: 781 VPKAEGLIVFFPGIPGCAKSSLCREILNAPGALGDDRPVNTLTGDLIKGRYWQKVADERR 840
Query: 841 RKPYSIMLADKNAPNEEVWRQIEDMCHSTRASAVPVVPDSEGTDSNPFSFDALAVFMFRV 900
RKPYSIMLADKNAPNEEVWRQIEDMCHST ASAVPV+PDSEGTDSNPFS DALAVFMFRV
Sbjct: 841 RKPYSIMLADKNAPNEEVWRQIEDMCHSTGASAVPVIPDSEGTDSNPFSLDALAVFMFRV 900
Query: 901 LQRVNHPGNLDKASPNAGYVLLMFYHLYEGKSRREFEGELIDRFGFLIKIPLLKSDRSPL 960
LQRVNHPGNLDKASPNAGYVLLMFYH YEGKSRREFEGELIDRFG L+KIPLLKSDRSPL
Sbjct: 901 LQRVNHPGNLDKASPNAGYVLLMFYHFYEGKSRREFEGELIDRFGSLVKIPLLKSDRSPL 960
Query: 961 PDNLKTVLEEGLSLYKLHTRRHGRADSTKGSFAKEWAKWEKQLRETLFGNTEYLNSIQVT 1020
PDNLKT+LEEGLSLYKLHT RHG DSTKGS+AKEWA+WEKQLRETLFGN EYLN+IQV
Sbjct: 961 PDNLKTILEEGLSLYKLHTSRHGWTDSTKGSYAKEWAEWEKQLRETLFGNAEYLNAIQVP 1020
Query: 1021 FEFAVQDVLEQLKKILKGDYKNLTTDRRKSATIVFAAVSLPILEIQNLLDTLGKINPHVG 1080
FEF+VQ+VLEQLKKI KGDYK+ T+ RKSATIV+AAVSLP+ EIQN LDTLG NP V
Sbjct: 1021 FEFSVQNVLEQLKKISKGDYKSPITE-RKSATIVYAAVSLPVQEIQNALDTLGNKNPQVE 1080
Query: 1081 GFFKKNLKDFTLREVHVTLAHKRGHGVKAVADYGIFENKEVPVELTGLLFSNKMAAFEAR 1140
F K+ KD+TL+ HVTLAHKR HG+KAVADYGIFENKEVPVELT LLFS+KMAAFEAR
Sbjct: 1081 AFIKEGYKDYTLKSAHVTLAHKRSHGIKAVADYGIFENKEVPVELTALLFSDKMAAFEAR 1140
Query: 1141 LGSIEDERVISKNEWPHATIWTTEGVAAKEANTLPLLVPEGKATLVEINPPIVISGKVQF 1179
+GSIEDERVISKNEWPH T+WT EG+AAKEAN+LP LV EGKATL+E+NPPI+ISGKVQF
Sbjct: 1141 VGSIEDERVISKNEWPHVTLWTREGIAAKEANSLPQLVSEGKATLLELNPPIIISGKVQF 1198
BLAST of Sed0015984 vs. TAIR 10
Match:
AT1G07910.1 (RNAligase )
HSP 1 Score: 1510.7 bits (3910), Expect = 0.0e+00
Identity = 755/1115 (67.71%), Postives = 900/1115 (80.72%), Query Frame = 0
Query: 74 DNTSMDAAEVVTHALNKLSVTES--GQPHVPISSTQFGNVQLTNQVPPGDGHRTIWKPKA 133
D+++ AE V + LS+ ES P +P +T VQ +WKPK+
Sbjct: 9 DSSATVVAEAVNNQFGGLSLKESNTNAPVLPSQTTSNHRVQ-----------NLVWKPKS 68
Query: 134 YGTTSGAAVVEAENASAGRTSIQNKENSAGLAA----QNDIFKGNQIEKFTVDNYTYTHA 193
YGT SG+ + G+TS ++ S+G + IF GN +EKF+VD TY HA
Sbjct: 69 YGTVSGS----SSATEVGKTSAVSQIGSSGDTKVGLNLSKIFGGNLLEKFSVDKSTYCHA 128
Query: 194 QIRATFYPKFENEKSDQEIRSRMIEMVSKGLATLEVSLKHSGSLFMYAGHEGGAYAKNSF 253
QIRATFYPKFENEK+DQEIR+RMIEMVSKGLATLEVSLKHSGSLFMYAGH+GGAYAKNSF
Sbjct: 129 QIRATFYPKFENEKTDQEIRTRMIEMVSKGLATLEVSLKHSGSLFMYAGHKGGAYAKNSF 188
Query: 254 GNIYTAVGVFVLGRMFREAWGSGAAKKQVEFNDFLESNRMCISMELVTAVLGDHGQRPRE 313
GNIYTAVGVFVL RMFREAWG+ A KK+ EFNDFLE NRMCISMELVTAVLGDHGQRP +
Sbjct: 189 GNIYTAVGVFVLSRMFREAWGTKAPKKEAEFNDFLEKNRMCISMELVTAVLGDHGQRPLD 248
Query: 314 DYVVVTAVTELGNGKPKFYSTAEIIAFCRKWRLPTNHVWLFSSRKSVTSFFASFDALCEE 373
DYVVVTAVTELGNGKP+FYST+EII+FCRKWRLPTNHVWLFS+RKSVTSFFA+FDALCEE
Sbjct: 249 DYVVVTAVTELGNGKPQFYSTSEIISFCRKWRLPTNHVWLFSTRKSVTSFFAAFDALCEE 308
Query: 374 GTATTVCKALDEVAEISVPGSKDHIKVQGEILEGLVARMVSHESSKHMEKVLEEFPALPD 433
G AT+VC+ALDEVA+ISVP SKDH+KVQGEILEGLVAR+VS +SS+ ME VL + P P
Sbjct: 309 GIATSVCRALDEVADISVPASKDHVKVQGEILEGLVARIVSSQSSRDMENVLRDHPP-PP 368
Query: 434 NEGGEFDLGPSLREICAANRSDEKQQIKALLQNVGSAFCPVHSDWYGD-SHSRNADRSVV 493
+G DLG SLREICAA+RS+EKQQ++ALL++VG +FCP +W+GD SH ++AD+SV+
Sbjct: 369 CDGANLDLGLSLREICAAHRSNEKQQMRALLRSVGPSFCPSDVEWFGDESHPKSADKSVI 428
Query: 494 SKFLQANPADFSTSKLQEMVRLMRENRFPAAFKCYYNFHKVGSISNDNLFYKMVVHVQSD 553
+KFLQ+ PAD+STSKLQEMVRLM+E R PAAFKCY+NFH+ IS DNLFYK+VVHV SD
Sbjct: 429 TKFLQSQPADYSTSKLQEMVRLMKEKRLPAAFKCYHNFHRAEDISPDNLFYKLVVHVHSD 488
Query: 554 SAFRRYQKEMRNKPGLWPLYRGFFMDINLFKANKDKAAEIVKGKNNLMEIEGNGTLGRDG 613
S FRRY KEMR+ P LWPLYRGFF+DINLFK+NK + +K +N E +G G +DG
Sbjct: 489 SGFRRYHKEMRHMPSLWPLYRGFFVDINLFKSNKGRDLMALKSIDNASENDGRGE--KDG 548
Query: 614 FADEDANLMIKLKFLTYKLRTFLIRNGLSIVFKEGPAAYKAYYLRQMKLWGTSDGKQREL 673
AD+DANLMIK+KFLTYKLRTFLIRNGLSI+FK+G AAYK YYLRQMK+WGTSDGKQ+EL
Sbjct: 549 LADDDANLMIKMKFLTYKLRTFLIRNGLSILFKDGAAAYKTYYLRQMKIWGTSDGKQKEL 608
Query: 674 SKMLDEWAVFLRRKYGNKQLSSTTYLSEAEPFLEQYAKRSPQNQALIGSAGNLVRAEDFL 733
KMLDEWA ++RRK GN QLSS+TYLSEAEPFLEQYAKRSP+N LIGSAGNLVR EDFL
Sbjct: 609 CKMLDEWAAYIRRKCGNDQLSSSTYLSEAEPFLEQYAKRSPKNHILIGSAGNLVRTEDFL 668
Query: 734 AIVEEGMDEEGDLQKEQEATPSSSLHSGKDVVPKADGLIVFFPGIPGCAKSAICREILNA 793
AIV+ +DEEGDL K+Q TP++ + K+ V K +GLIVFFPGIPG AKSA+C+E+LNA
Sbjct: 669 AIVDGDLDEEGDLVKKQGVTPATPEPAVKEAVQKDEGLIVFFPGIPGSAKSALCKELLNA 728
Query: 794 PGGLGDDRPLNSLMGDLIKGRYWQKVADELRRKPYSIMLADKNAPNEEVWRQIEDMCHST 853
PGG GDDRP+++LMGDL+KG+YW KVADE R+KP SIMLADKNAPNE+VWRQIEDMC T
Sbjct: 729 PGGFGDDRPVHTLMGDLVKGKYWPKVADERRKKPQSIMLADKNAPNEDVWRQIEDMCRRT 788
Query: 854 RASAVPVVPDSEGTDSNPFSFDALAVFMFRVLQRVNHPGNLDKASPNAGYVLLMFYHLYE 913
RASAVP+V DSEGTD+NP+S DALAVFMFRVLQRVNHPG LDK S NAGYVLLMFYHLYE
Sbjct: 789 RASAVPIVADSEGTDTNPYSLDALAVFMFRVLQRVNHPGKLDKESSNAGYVLLMFYHLYE 848
Query: 914 GKSRREFEGELIDRFGFLIKIPLLKSDRSPLPDNLKTVLEEGLSLYKLHTRRHGRADSTK 973
GK+R EFE ELI+RFG LIK+PLLKSDR+PLPD +K+VLEEG+ L+ LH+RRHGR +STK
Sbjct: 849 GKNRNEFESELIERFGSLIKMPLLKSDRTPLPDPVKSVLEEGIDLFNLHSRRHGRLESTK 908
Query: 974 GSFAKEWAKWEKQLRETLFGNTEYLNSIQVTFEFAVQDVLEQLKKILKGDYKNLTTDRRK 1033
G++A EW KWEKQLR+TL N+EYL+SIQV FE V V E+LK I KGDYK ++++RK
Sbjct: 909 GTYAAEWTKWEKQLRDTLVANSEYLSSIQVPFESMVHQVREELKTIAKGDYKPPSSEKRK 968
Query: 1034 SATIVFAAVSLPILEIQNLLDTLGKINPHVGGFF---KKNLKDFTLREVHVTLAHKRGHG 1093
+IVFAA++LP ++ +LL+ L NP + F KK++++ L HVTLAHKR HG
Sbjct: 969 HGSIVFAAINLPATQVHSLLEKLAAANPTMRSFLEGKKKSIQE-KLERSHVTLAHKRSHG 1028
Query: 1094 VKAVADYGIFENKEVPVELTGLLFSNKMAAFEARLGSIEDERVISKNEWPHATIWTTEGV 1153
V VA Y N+EVPVELT L++++KMAA A +GS++ E V+SKNEWPH T+WT EGV
Sbjct: 1029 VATVASYSQHLNREVPVELTELIYNDKMAALTAHVGSVDGETVVSKNEWPHVTLWTAEGV 1088
Query: 1154 AAKEANTLPLLVPEGKATLVEINPPIVISGKVQFF 1179
AKEANTLP L EGKA+ + I+PP+ ISG ++FF
Sbjct: 1089 TAKEANTLPQLYLEGKASRLVIDPPVSISGPLEFF 1104
BLAST of Sed0015984 vs. TAIR 10
Match:
AT1G07910.2 (RNAligase )
HSP 1 Score: 1510.7 bits (3910), Expect = 0.0e+00
Identity = 755/1115 (67.71%), Postives = 900/1115 (80.72%), Query Frame = 0
Query: 74 DNTSMDAAEVVTHALNKLSVTES--GQPHVPISSTQFGNVQLTNQVPPGDGHRTIWKPKA 133
D+++ AE V + LS+ ES P +P +T VQ +WKPK+
Sbjct: 9 DSSATVVAEAVNNQFGGLSLKESNTNAPVLPSQTTSNHRVQ-----------NLVWKPKS 68
Query: 134 YGTTSGAAVVEAENASAGRTSIQNKENSAGLAA----QNDIFKGNQIEKFTVDNYTYTHA 193
YGT SG+ + G+TS ++ S+G + IF GN +EKF+VD TY HA
Sbjct: 69 YGTVSGS----SSATEVGKTSAVSQIGSSGDTKVGLNLSKIFGGNLLEKFSVDKSTYCHA 128
Query: 194 QIRATFYPKFENEKSDQEIRSRMIEMVSKGLATLEVSLKHSGSLFMYAGHEGGAYAKNSF 253
QIRATFYPKFENEK+DQEIR+RMIEMVSKGLATLEVSLKHSGSLFMYAGH+GGAYAKNSF
Sbjct: 129 QIRATFYPKFENEKTDQEIRTRMIEMVSKGLATLEVSLKHSGSLFMYAGHKGGAYAKNSF 188
Query: 254 GNIYTAVGVFVLGRMFREAWGSGAAKKQVEFNDFLESNRMCISMELVTAVLGDHGQRPRE 313
GNIYTAVGVFVL RMFREAWG+ A KK+ EFNDFLE NRMCISMELVTAVLGDHGQRP +
Sbjct: 189 GNIYTAVGVFVLSRMFREAWGTKAPKKEAEFNDFLEKNRMCISMELVTAVLGDHGQRPLD 248
Query: 314 DYVVVTAVTELGNGKPKFYSTAEIIAFCRKWRLPTNHVWLFSSRKSVTSFFASFDALCEE 373
DYVVVTAVTELGNGKP+FYST+EII+FCRKWRLPTNHVWLFS+RKSVTSFFA+FDALCEE
Sbjct: 249 DYVVVTAVTELGNGKPQFYSTSEIISFCRKWRLPTNHVWLFSTRKSVTSFFAAFDALCEE 308
Query: 374 GTATTVCKALDEVAEISVPGSKDHIKVQGEILEGLVARMVSHESSKHMEKVLEEFPALPD 433
G AT+VC+ALDEVA+ISVP SKDH+KVQGEILEGLVAR+VS +SS+ ME VL + P P
Sbjct: 309 GIATSVCRALDEVADISVPASKDHVKVQGEILEGLVARIVSSQSSRDMENVLRDHPP-PP 368
Query: 434 NEGGEFDLGPSLREICAANRSDEKQQIKALLQNVGSAFCPVHSDWYGD-SHSRNADRSVV 493
+G DLG SLREICAA+RS+EKQQ++ALL++VG +FCP +W+GD SH ++AD+SV+
Sbjct: 369 CDGANLDLGLSLREICAAHRSNEKQQMRALLRSVGPSFCPSDVEWFGDESHPKSADKSVI 428
Query: 494 SKFLQANPADFSTSKLQEMVRLMRENRFPAAFKCYYNFHKVGSISNDNLFYKMVVHVQSD 553
+KFLQ+ PAD+STSKLQEMVRLM+E R PAAFKCY+NFH+ IS DNLFYK+VVHV SD
Sbjct: 429 TKFLQSQPADYSTSKLQEMVRLMKEKRLPAAFKCYHNFHRAEDISPDNLFYKLVVHVHSD 488
Query: 554 SAFRRYQKEMRNKPGLWPLYRGFFMDINLFKANKDKAAEIVKGKNNLMEIEGNGTLGRDG 613
S FRRY KEMR+ P LWPLYRGFF+DINLFK+NK + +K +N E +G G +DG
Sbjct: 489 SGFRRYHKEMRHMPSLWPLYRGFFVDINLFKSNKGRDLMALKSIDNASENDGRGE--KDG 548
Query: 614 FADEDANLMIKLKFLTYKLRTFLIRNGLSIVFKEGPAAYKAYYLRQMKLWGTSDGKQREL 673
AD+DANLMIK+KFLTYKLRTFLIRNGLSI+FK+G AAYK YYLRQMK+WGTSDGKQ+EL
Sbjct: 549 LADDDANLMIKMKFLTYKLRTFLIRNGLSILFKDGAAAYKTYYLRQMKIWGTSDGKQKEL 608
Query: 674 SKMLDEWAVFLRRKYGNKQLSSTTYLSEAEPFLEQYAKRSPQNQALIGSAGNLVRAEDFL 733
KMLDEWA ++RRK GN QLSS+TYLSEAEPFLEQYAKRSP+N LIGSAGNLVR EDFL
Sbjct: 609 CKMLDEWAAYIRRKCGNDQLSSSTYLSEAEPFLEQYAKRSPKNHILIGSAGNLVRTEDFL 668
Query: 734 AIVEEGMDEEGDLQKEQEATPSSSLHSGKDVVPKADGLIVFFPGIPGCAKSAICREILNA 793
AIV+ +DEEGDL K+Q TP++ + K+ V K +GLIVFFPGIPG AKSA+C+E+LNA
Sbjct: 669 AIVDGDLDEEGDLVKKQGVTPATPEPAVKEAVQKDEGLIVFFPGIPGSAKSALCKELLNA 728
Query: 794 PGGLGDDRPLNSLMGDLIKGRYWQKVADELRRKPYSIMLADKNAPNEEVWRQIEDMCHST 853
PGG GDDRP+++LMGDL+KG+YW KVADE R+KP SIMLADKNAPNE+VWRQIEDMC T
Sbjct: 729 PGGFGDDRPVHTLMGDLVKGKYWPKVADERRKKPQSIMLADKNAPNEDVWRQIEDMCRRT 788
Query: 854 RASAVPVVPDSEGTDSNPFSFDALAVFMFRVLQRVNHPGNLDKASPNAGYVLLMFYHLYE 913
RASAVP+V DSEGTD+NP+S DALAVFMFRVLQRVNHPG LDK S NAGYVLLMFYHLYE
Sbjct: 789 RASAVPIVADSEGTDTNPYSLDALAVFMFRVLQRVNHPGKLDKESSNAGYVLLMFYHLYE 848
Query: 914 GKSRREFEGELIDRFGFLIKIPLLKSDRSPLPDNLKTVLEEGLSLYKLHTRRHGRADSTK 973
GK+R EFE ELI+RFG LIK+PLLKSDR+PLPD +K+VLEEG+ L+ LH+RRHGR +STK
Sbjct: 849 GKNRNEFESELIERFGSLIKMPLLKSDRTPLPDPVKSVLEEGIDLFNLHSRRHGRLESTK 908
Query: 974 GSFAKEWAKWEKQLRETLFGNTEYLNSIQVTFEFAVQDVLEQLKKILKGDYKNLTTDRRK 1033
G++A EW KWEKQLR+TL N+EYL+SIQV FE V V E+LK I KGDYK ++++RK
Sbjct: 909 GTYAAEWTKWEKQLRDTLVANSEYLSSIQVPFESMVHQVREELKTIAKGDYKPPSSEKRK 968
Query: 1034 SATIVFAAVSLPILEIQNLLDTLGKINPHVGGFF---KKNLKDFTLREVHVTLAHKRGHG 1093
+IVFAA++LP ++ +LL+ L NP + F KK++++ L HVTLAHKR HG
Sbjct: 969 HGSIVFAAINLPATQVHSLLEKLAAANPTMRSFLEGKKKSIQE-KLERSHVTLAHKRSHG 1028
Query: 1094 VKAVADYGIFENKEVPVELTGLLFSNKMAAFEARLGSIEDERVISKNEWPHATIWTTEGV 1153
V VA Y N+EVPVELT L++++KMAA A +GS++ E V+SKNEWPH T+WT EGV
Sbjct: 1029 VATVASYSQHLNREVPVELTELIYNDKMAALTAHVGSVDGETVVSKNEWPHVTLWTAEGV 1088
Query: 1154 AAKEANTLPLLVPEGKATLVEINPPIVISGKVQFF 1179
AKEANTLP L EGKA+ + I+PP+ ISG ++FF
Sbjct: 1089 TAKEANTLPQLYLEGKASRLVIDPPVSISGPLEFF 1104
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Q0WL81 | 0.0e+00 | 67.71 | tRNA ligase 1 OS=Arabidopsis thaliana OX=3702 GN=RNL PE=1 SV=1 | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1DUP6 | 0.0e+00 | 84.81 | tRNA ligase 1 OS=Momordica charantia OX=3673 GN=LOC111024537 PE=4 SV=1 | [more] |
A0A6J1HM92 | 0.0e+00 | 84.79 | tRNA ligase 1 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111464863 PE=4 SV=1 | [more] |
A0A6J1HM43 | 0.0e+00 | 86.96 | tRNA ligase 1 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111464863 PE=4 SV=1 | [more] |
A0A1S3CK49 | 0.0e+00 | 84.52 | uncharacterized protein LOC103501711 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... | [more] |
A0A6J1I3R5 | 0.0e+00 | 83.35 | tRNA ligase 1 OS=Cucurbita maxima OX=3661 GN=LOC111469400 PE=4 SV=1 | [more] |