Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTCCTCTTCCTGTTCCATTTTTGTTTCTCTCTTGTAATGCCAGGCCAAACCCTAACCCTAATTTTATTTCCATTTCTTCCAAACCTTGAACTCCAACCCCCTAAATCCTGATTTTTCATCTTCTATTTCTAATTGGGATTGATATCATTGTTATTGATCTTCGATTGCCATGGATTCGCAGCAGAACAATTTGTTTGAAACGGCATCTCAACCTGATACGGGGAACGACGCCTACACATTTCTTGAGTTCAACACACAGGGAGAAGACTTCGATTACCCTGAATTTCGCGACCCTATTAGGTCCCCTGTAGCGTGGCCGACTCCTTCCGATTCGTTAGCGGATCACACGGACCGTGGTGGTGGGTCGGATCATCAGTCTGATGCATCTCCGGTTTCGGCTGCACCGGGAAGTGCCACGAAGGGCCGGACTGGAGGAGGTTCAGGGAATAGCGGCGGTAATAATCAAATGGTTGACGTATTGGCTGCCGGGATGAGTGGGTTGACGTTTGAGGATACGGGGGATGATGATAATTATGAGTTTGGGAAGGGAAATTTCACGGAGCATGCTTGTAGGTATTGTGGGATTTCGAATCCGGCTTGCGTTGTGAGGTGTAACGTGCCGTCATGCCGTAAGTGGTTCTGCAATTCGCGAGGGAACACGTCTGGGTCCCATATTGTGAATCACCTGGTACGCTACTCTGTCTTTGTTATTTGTTTCCGCGTGTTCTCCCCGAGATTTTTCAGTTTGTGGAAATTTTAAAATTTTATTTTTCCTTCGTATGGCGGAATTCGATGATGATCAATTGCCGGTGAATGTTGAATAACATGTGACAGTGTTGCGCAATAGGCGTTTATTATGATACTCTCTTATCAGGATTTGTTTTGCCTTATATGGAAGTTGTGGACGTACTATCCCGTTTAGCCTAGGTGAAATTGACATGAGAAAGGGCTGTTATATTTCTTTGTTTGTCGGTTGATTTTTCAGTTGCACGAGTGGGGAATGCTGTTTGTACCTCGTTGTATGTTTACGTTATTATGCTGTTCTGTATTTTTCTCATCCAAAATTCTTTGACGTGTAATGTCTTTTTGGATAGAATCACAGTTTTATTGTGCAATGTCCATATTAGTTTTCTTGCTGGTCAGGTTTTAATGGGACTTTAAAAGTTGATTAAATGTAGCCTTATCCTTATACAACTCTAGTAGTTAAATATTGGAAGGAACATGTAACGACCCGATTTTTGATACCTCGAATTGCAGGTCGTCACGTACACTGAAGGGTGAAAAGGGTGGTAAACTTTTCTTTTGTAAAACAGGAGACTCAAGAAATTTAAAACTTAAATATGCATTGAAAATCGACATAAAAGTAGTTTAAATGTAATATCTGTAGAGTCAAATCAAAACGGTTTACAAAAATACACAAGTTTCCAGAATGACTTTAAAATAATAGTAACATGACAAGAAGGACGAAGACTCGATCTAAAAGGCACCCCTGTAGCTGCATGTCTCCGCATGCTCCTTTAAACAGAAACTTAGGCTATGTACTCATAACTGTTTTTAACGATCGCCCAAACTATGGCTAAATGGCTGTACAAAACTTGGTAGGTCACTAAACAACTACCTTCAACATACCTCCTTTACGGAGTACTCAATCCTTCTCAGGTTCACAACCTAGGGGTTTTCTGTTATCTTGGCTCTAGGTTTCACGTATGTCTAGGTCCTTCATCAGTCTTGCGTGAGGAGCGCCTCAAGTCACTATGACCCTTGAAAAAAGGCCTCAAGGTAATACTCGTCGACATGGAGGCCTTAAGGTAATCATATCTACTTACTGAGCTCTAATTAATGGATTCTCCTAATGGTGTCGAGCAACATCTACCTGTTCCTAAGATCAGTAAACTATTTGTCACTACGCCCATTCCAGGCTAGAATGGTAACTACCTGCTCCTGTACTCTCAGCGGGTTAGAATAGTAAAACTACTTGTCACTATGCCCTTTTCAGACTGGGAATGTAAACTACTTGCTCATGTACCCTCATTGGGTTAGAATAGTAAAACTACCTTTTTAGGAATAGGAACTACATGTTACTTTGCCTTTTTCAGGCTAGAACAGTAAAACTACCTGTCACTACACCACTTGGGAAAAGGTCATGGCTCTCCCCATTAGGCACCACCAACATAAACGGTAGTGGTATACTCTCTATGGCTACTAATATCTTCTATGGGTGACAACTAAGTCATGTGTAACAAGTAGACCCTAATCTCTGTTGGTTACTTCACGAATAGGGGTTGTGCCCTAGCTGTCCCTACAAGGTATCTGGTCAACCCAAGGAGCCTATGGAACTCATGACTTGGTGCTACGACCAAATTACGTGAAAGAGTACCTTATAGTCTATGGAAACCATAATTAACACGTTATGACACATAATATTTAGTCCAACTCATAACTAGACAATAACGGGCTATCAAAACCCTAAATGAATGCATAACAATGACTAAAGACATACAACATGCTTATCATGCTTATTAAGTGATAATAATCCAACCTCAGCACACTAAGAAGCATACAAGTCCAATGTAAATTTTTCACATCATCTAAGCGCAATTCTCATGAGACTGAGTTACCTACTTGATTATAGACTTCTCAAGCTTGTTTTGGGTCTTTGAGGATATTCCAAATTTCTCCGAAATTCCTTAATATGCCCTACATTTAGTCAAAAGAATAAAATAGGTGAAAATGGCTAGCTTCAAATTTGTTAAAGAAAAAGTTGAGAAACTAACCATGGACTATACGGTCATCTTCTCCAACTTGCGTGCTTGCCATTCTTCTTTCTTTTCCTTTTTTTTTTTGAAAAAATATCTTCTATTTATTTTCTTAAAATTCTAGACAGACTAGAATTGGTTTTTTATGCATGTTGAAATAAAGCATATTTACTTTTGATGGGCTATTACCATATTAATTTTCTTACTAGAGAACTGTGATTTGTACATAATGAGATCAAATTGTTTCTTATATATATATATAAAAATATCTTATTGGAGAACTTACATTAGTTATTTTATTGAATGATATGACGATTTTAGGTCCGGGCAAAACATAAAGAAGTTTGTCTTCACAAAGACAGTCCTCTGGGAGAAACAATTCTCGAGTGTTACAATTGTGGGTGCCGAAATGTATTCCTCCTTGGATTTATTTCTGCAAAGACAGAAAGTGTGGTTGTTCTACTCTGTAGGGAACCTTGCTTGAGTGTTAATGCTCTTAAGGATATGAATTGGGACTTGAGTCAGTGGTGCCCTCTAATTGATGATAGGTGCTTCTTGCAGTGGCTGGTTAAGGTTTGTGTATCTTTCTGTAGGATCAACATCTCTGATATGTGGTGATATGTAATGCATTGTCACTGTCCTATGAACATTTTGCTTTGTAGATTCCTTCTGAGCAAGAGCAGTTGAGAGCACGGCAAATTAGTGCCCAACAAATAAATAAAATTGAGGAACTTTGGAAGACAAACCCAGATGCATCTCTTGAAGATCTTGAAAAACCTGGTGTGGATGATGAACCACAGCCCGTAGCATTAAAATATGAAGATGCTTATCAGGTGTGCTTGTTGTATCTAACCTTTATCTAGTAGCAGTCCTAGTGAAGTTGCATTTTGCTTTGATGCATGTATATTGACAAGGTGGCATTTTGTCTTTCCTATTACAGTATCAAAATGTATTTGCACCTCTTATCAAGCTTGAAGCCGACTACGATAAAGTATTGTACCAAGACTCCTTATCTCATTTACCTTTCTCTTTTTCTTCTTTTTTAAAGGAAAATACTTCAAGTAGGTCTGGCAATATCTAACTTCATGACCCCATAATGGTAGCTTTTCCTGTTTTTTCCCCATAAATGAATGATGTAATATTAATTTTTTTTAAAGACCTTTATAATACGATTCTCCATTTTTGTATTAGATGATGAAAGAATCTCAAAGCAAGGATAATGTGACTGTACGTTGGGACATTGGTCTTAACAAGAAGAGAATAGCTTATTTTGTCTTTCCAAAGGTACTGGTGGTATTTGTTATTTGTTAAATGGATGCTTTATTTTGCTTTTTTTTTTCCCCTTGATTTTTAAGCTGCTTTATTTGTTGGTATAGGAGGACAATGAGTTGCGCCTTGTACCTGGAGATGAGTTGAGGTTACGTTATTCTGGTGATGCAGCTCATCCAGCTTGGCATTCTGTTGGACATGTGGTATGTCTCTTTATTTATTTTTAATTTTTATGAGAAAATGAAAGAATTAATAATGGTGTACAAAAATTCAGCCCAACGAAAAGATTCCCTCAAGTAGGTGTGGGTTTGAAATCTTTGATTTGGGGGGGGGGGGGGGGGGGGGTNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNGGGGGGGGGGGGGGGGGGGGTGATTGTTTTTTTTAATGCAAAATAGTGAACAGTGAGGCGCTGATTGTTTTTTTTAATGCAATTTAGTCAGGGGTAGAATATACTGTATATGGTTAATTTGCCGATAAATTCACCGCCAGTGTGCTTGATGTTTTTGCAACTGAAGTTATTTCGGTTCTTTCTATCGGACTTTCTTTTTTCTTCTTTTTTTACATGACATGTCTGACATTTTAGTGATTATTGTACTTTAAAAACTGAATTGTCTCAATTGAGTTGGTGATTAATTGTATGACATAATCTTGTTCTGCTGTGTTAGATCAAGCTAACTGCACAGGAAGAGGTTGCACTCGAGCTACGTGCTAGCCAGGTAGCACTCAGTTTTGATGGTTATATTTATTTACTGGGTTAAGTCGTTTGTTAGTGACTATTTTATATTATGCTACATAGGGGGTTCCAGTTGATGTGGTCCATGGTTTTAGTGTTGATTTTGTGTGGAAGAGTACTAGTTTCGATCGAATGCAGGGAGCTATGAAGACATTTGCTGTTGATGAGACCAGTGTTAGTGGGTAAGAACTGTCTGCACTGTATGGATCAAGAAGTTAATTTCTATCAGGATCAATGAGCCTTCTTTATGATCCAACATTCTTGAATAACCCGTCCCCACTTTAATAAATGTGGTACAGAGATTGTATTTATAATAAGAAATGCTGCTTGATATATGTATTTCAACTAGCTATGACAATGCATGTTCTTATGTGCAGTTATATCTACCATCACTTATTAGGACACGAAGTGGAGGTTCAGATGGTTCGCAATACACTTCCACGTCGTTTTGGTGCGCCTGGTCTCCCAGAACTCAATGCATCTCAAGTATGCAACTAGTCTTAGAAATAAAATGCATCATGTTTATTTGTTTGGATTTGAAATTTCAATGTCATATTGAGTTGTTGAAGTATACGTGTATTGATTATCAAGGAATAATATAATTTGGTTGTTGTTTTAATGCTGTTGGAGTTTACCTTGGGGGTGGTCTTGTTGTCGTATGGAATGATTTGTAATGAGTTCTGGTTTTCTCTTCTTCAATATGTTAATATCCTATTGATGATGGAATGTAATCTATTTTTTGTTTATAGAAAAAAGCGATTCCTTTTGAGATCTGCACACTTCTTAAGAGCAGCGAATGAGCTTGTCAATTTGATTAGTTTTATGACTATTTTTAAATTTTGTGCATTCTGATTTTTAACTTCAAGCCTTATGCTCTTTTGTCGAACTAAGATTTTATTAAAGAGAACAATGTCTTAAGATCTCGTCCTTTTACAACATTTCATAGTTATAGTGACTATTATTGTTTCCGCTAATGAAAGTTGGATCTATTCCAGGTTTCGAGTCATGTTAATGATATAGCGGGAACTCTTACAGCCATAGTGTTTGCATGTGTTTTTATGATTATATTTGGTATTTGTTTTTATCCTTTAGGCAAGCTTAGAACAATGTCTCAAGTTCACAATTTTGGCCACATTTTTTTGATTGTAAGTTTTCTCTTTATTGGTAGGTTTTTGCTGTAAAAAGTGTTCTTCAGAAGCCAATAAGCTTGATTCAGGGCCCTCCGGGTACTGGGAAAACTGTAACTTCTGCTGCCATAGTGTATCATATGGCCAAACAAGGCCAAGGGCAGGTCTACATTTTGAATACTCTTCTAGTCGCCTTGTTTATTTTAATATTCAGTTATCAGATTATGAATGTAAGGTGTTCTCGCTCGGCAGGTACTGGTCTGTGCCCCTAGTAATGTGGCTGTGGACCAACTGGCAGAGAAGATAAGTGCAACTGGGTTAAAGGTAAAATTAGCAGTTGCTCCTCCTTGGAAACCACTTTTCCTATTATGTTTGTCAGATGTAATAGTTTATCGTCTGATATGTCTTGTGTGACCTTGTGTTGTGACTGCTGGGCTTATTTGAGGCTATTGCAATAAAAATTGAATACCACTCACAATTTCTCACCTGCTTTTTGAATCAAGGTTGTTAGGCTATGTGCAAAATCAAGAGAAGCTGTAAGTTCTCCTGTGGAACACTTAACCCTTCACTATCAGGTATTTGATCATTGTGGATTCGAGCCTCAGCTGCAATGCGACTAAGTTCTCATTTTCATGTTTTACCACAAAGTTTGATAGATTAAAACTCTTAATAGGTTCGACATCTTGACACATCGGAAAGAAGTGAACTTCATAAGTTGCAACAATTGAAAGATGAACAAGGTATGGGTTTCTTTTTATTTTAAATGAGCAATTTTTTTTGTAACTAATATTCTTATTAGGAGATTTTGCATCCTCCGATTGCTCTGAAATGCTTTTGTCCGTGAAATATCAAATAGTTGCATCTCTTTTTAAGCTTCAGTTCCTTTGCTGCTTTTCCATCTCATGAAATAGAGTGCGTAGTTTTGAATCCTTTTGTGATAAGTTGATATTCTTGGGACTGGGCTTGTAAGCGCGCAGTTTGATGTATACAACACCATGTCCTTTTGGAAAGTGCTGCTTTTCCCTCTTTTTATTCTCTTTTTAACTAATGGTCCAAACAGTGTGAAGGCTACTAAAAGTGTGCTCTTTAGTCCAATTACATTTCATTTGAGGCGGGTGATTATTATTTTTGTTGGAGTACATATCTACCAAAATTATCTTTCAGCGAGTTGTTAAAAAATACTTGGTTTTTTAATTGTTTATTCCCAAATACCCAGACCTACGAACTGGATATATAAAATAGTGACAGTTAAACATCCTAATTGGTGTCTGATGGACCCAACTGTTATTTCATAACATCGATGAAAAATCTGTTTCTTATGGAAAAAATGATGGGCCAAAATGTTTGCACTGTTCTCTATCTAACTTGTCAGTTAACGAAGGGAGTCTGAAAATTGCATGTCGTTGAAAGCCTGAAAGAGGCTTCAAAAGGAATTGTACATTAAATACCATCGTTGCCTGTCTAGTTATTGCAATGGCTATATTTTATTCCTTAGCGTATTTCTTGGTGAGTAGTTGTATTAGCTAGTGTTAGCACTATTTCTCTTGCTCCATTTAACATTTATGTGCTAGTTTGTTGTTTAATAGCAATCTCATTGTTTAATCATCACAGGGGAGCTGTCTAGCAGTGATGAGAAAAAATATAAAGCCCTTAAGAGAGCAACAGAGAGGGAAATTTCCCAGAGTGCAGATGTTATTTGTTGCACATGTGTCGGTGCTGGAGATCCTCGCCTGTCAAATTTTAGATTTCGCCAGGTTAGAGCAGTCGTCTCTGTTGTTATACTATGTGAAATTGAAAATAACTTAAAAACTTAAAAGATTTTACCTTGGTTTAGGTTCTTATCGATGAGTCTACTCAGGCAACCGAACCTGAATGTCTTATTCCTCTTGTTCTTGGAGCAAAGCAGGCATGTCGAGTTTTCAATTCACGTATGTCATCTATTTAAATTGTGCTTCCTTTCTCCTACTGACTTTTTCTTGTTTTAGGCTGTTCTTGTTGGTGACCATTGCCAACTGGGACCTGTCATTATGTGCAAAAAAGCAGCACGTGCAGGGCTGGCTCAATCCCTTTTTGAGCGTCTTGTTCTTCTTGGTGTGAAGCCAATTAGATTGCAGGTATTTGTTTTTTTTTTACTTGCATATGATTCAAGGGACGTATGAAATAATACCACTTAAATTTTATTTCTATTTTGACATCAATCGAGAAAAGTTTCCAATCTTAGAATGTTACTACTCACTCTTCTGTTTCAGTGCTGGATAATTAGAATAGTCTTTTCTTATATTATATATAGCATATTTCCCCCCTGTTGGAAACAATTTTTGTTGGTTGAATAACACTTCAGCGCTTGCTAAATTTAGAAAAAGTGACTACTGTGCAGGCTCCAATTGGAACTTCCTTTTAGCTAAATTTATTTTCCAGATAGATGTGTATTCTGAGGTGATTGCCTTTTTGCGATTCTTTGGAATATATAAGTACCGAGAGTCCTTATGATAAAGTTTGGGATGTGGATCACTTTAATCTTCATTATGGACCATGGTCCGCACGACAATATTTCTGTAATCATTTTTTTTTAATTCTAGTTCTTGGAGCCTTTTGTTCTTAGTGCCCCTAGGTAGTGGTTTTGGTTTGCATTTTTGTAGGACCTCGTTTGGTTGCTCTTGCATTGTTCTTAATGAAAGCCGTTTACTAGGCTAAAAGAACGGATTCCACGATTCAGGCCTTGTGGATGATGGAGTTGGTTTAGTAAAAGTAAAAAAGAAATAATTTGCGGTGGAAGGAACTTTGTCTTTCAATTTTCTAGATTTGTTTTTTTTTTTTTTGGGTTAATGACCACCCTCCAAAATGTTCAGCAAACGTGCTGATCCTTCCCTCCAAAACCCCCCTTCAAAGAAAAGTTCCAAGAATATTACTTCTCAATCTTGCTATCTTATGCACTTTGAATCTTACTAACCATTGTTTACAACCAAACACCATGCGTTGGCCTGAAGACACAAAAAGGACCACTTTAAGAAGGGGATTCCTATGGCTACTATTATTGTGCACCATGATATCCGTCTGACAGGTTCTAAGCTAGAATATGAAGAAAGAGAAGAAGTACGTGGTGGTTTTAGAAGGAATTAAATGAAGCCAGATCATTCCTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTGATGAGTTTGTTTTGATCCATTTTGATAACTTCTTAAAAATTCTCTCATTTTGGGGTTCCATAATTTGAAGGAATAGGGAGTCTCCTGATGAATTCTTTGAGTTGCTTTATTTTATTTCTAGGCCTGTAATTTACGTACGCACTAGAATCCAAGAAGTTTTCATTTTTTAACTCTGCTTCTGGTATTTTACCTTTATTTTATATCTAGTCTTAATTTTACTATTAAAATAGTAAAGCACATCTAATTGTATCAGAATGTTGATGATATGTTCTCCTTTCTCTTTTATTGTAGGTCCAATATCGTATGCACCCATCCCTTTCTGAATTTCCTTCCAACAGCTTCTACGAGGGCACACTACAAAATGGAGTTACAATCAATGAAAGGCAATCGACAGGCATCGACTTTCCATGGCCAGTTCCCAACCGTCCAATGTTCTTCTATGTTCAGGTATAATGGATTCAGGTATCTTTTAGAATTTGAATAGCAGAATAAAGAAAAAGGATCCTCCAGGGGTGGTGGTGTTGTCATAGTTTTGAAGCTACAAACCAATCTCCAATTATGAAGAACATAAAAAAGGAGTAATTATTAAGGAGATTAGTCTTTAGCACTCTTGAAAGGAGGAGCCCATGCTCTGAGGTTTGAGATGTACGCGTTCAAAGAAAACTTGAATTGTATGCAACGTATATTCAGTTTCAAAGCTTTTAAATGTATCTTCTTCGGAATAATCATCAATTGGACATTTTGAATGTATATTCTGTCGTGAAAAAAAAGAAAGAATGAATTTCAGTTTTGACATAGCTTATGTTTTGCTAACTTGCCCCATGTGTATGGTGTTTCTAAGTAGTCTTCAGTTTTGCTTGCATTCATGTTCTCAATTTTAACATCGAACATTTTTTAGATGGGACAAGAGGAGATAAGTGCCAGTGGAACTTCCTACCTAAATAGGACCGAGGCAGCAAATGTAGAAAAAATTGTGACCACTTTCTTAAGAAGTGGTGTAGTCCCTAGTCAGGTACAATATAGTTGACATCTTCGGTTGTCAGGATGATATTTTCCTTCTTTTTTTTTTAATATATGTTATTATCCTGTTGCTACAGATTGGAGTTATTACACCATATGAGGGCCAACGAGCTTACATTGTGAACTATATGTCAAGAAATGGTGCTCTCAGACAGCAACTTTACAAGGAAATTGAGGTAATGCTTATAAAGAGTTTGTGGATGCCCTCGTTACTAGCTGCTGCAATTATGCACTCTTTCTGTTGACAGGTTGCAAGTGTGGATTCATTTCAAGGAAGGGAAAAAGATTACATCATATTGTCATGCGTAAGGAGTAACGAACATCAGGTGGGTTACATTTTCAGGTGTAGTTACTGTCTTTTTCAAGCTTTTTTTAACTATCTATCACCGATTCACTTGAAAATTTAAAAGTGTACCCTTTCTGATGCTGTACATATCTCCTAGAAAGCAAAGGCGTTGGACATTTTAATTGCCTTGTGCATTAACTATTTTGTTTGTTAACCTTTTCATTTCCTTTTCAAATAAGGTTATCATTTTGGAGTAACTGATAACACCTGTTCCTTTTAGGGCATTGGATTCCTGAACGATCCTCGCAGGCTCAATGTGGCTTTAACACGAGCTCGATATGGTATTGTCATTCTTGGAAACCCAAAGGTTTTGAGTAAACAGCCTCTGTGGAATAGCTTGTTGACACATTACAAGGTATGTTGTGCTCAAGTAGAATATTTATATGGATTTTCAGTGAGTTGTGACCTTCTGATACAATATATTTGAAGCTGAGATCCCTGATTTATTATGATATATTTTTATTTCCCGATGAAAGTTCATTCTTATAAAATAAAATTGTGATATTTGTTCATCTTTCATCTAAGCTAAGTATTTGCTGTATTTCTGGCTTACTTCACTATCAATGTCCGGCAGGAACATGAATGCTTGGTTGAGGGTCCATTGAATAACTTGAAACAAAGCATGATTCAGTTTCAGAAGCCTAAAAAGGTTGACCACTGGATGAACAGGGGGGTGGGGGGACTTTTAAATTGATTGTTCTTCGCTGATTATTTTTATGCTTCAGATATACAATGATCGGCGTCTTTTCTTTGCTGGTGGTCCTGGAGTTGTCCCCAATGATAATTTAGGGCCTGTTGCCCCATCTGGTCCTAATGCTGATAGAAGAAGTAGTCGTGGTAGAGGTATGTTTTTTCCCCGGATATTCTACAATTTTCATAAGTTCCCAACTTCGTTTTGCGGAAGTTGTTCTTAAGGATTCATATGCTGCTTATGGCTCATTTCATGACCACGTAGATTGATCATGCCTTTCCTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTNNNNNNNNNNTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTGACAACTCAAATCTCCTAATTCAGGTTCGTACTTTCCTCCTCACCTGCCGAATGGAGCGCAAAAGCCTGGAGTGCATGCATCTGGTTATCCCATGCCACGAGTTCCTCTTCCCTCGTTTCATGGTGGTCCCCCGCATCCATATGGAATTCCAACTCGTGGAGCTGTACATGGACCAGTTGGTGCTGTTCCTCATGTTCCTCAACCAGGAAGTAGAGGCTTTGGGGCAGGGCGGGGAAATGTTGGTGCTCCTATTGGTAGTCAGCTTCCAAATCAGCAAGGCTCTCAACAAAATATTGGAAATCTTGGCTCCACTTTTAACTTTCCTGGTTTGGAGAGTCCCAATAGCCAACCATCAGTGGGTGGTCCGTTGTCTCAACTTGGGTTTGTTAACAATGTGTGTATTCTGTCTCCCTTCTCAGTATCCGTCTTGAGGGTTACCTTATGTACTGAAAGTATTTTTGTTTGTTCAACCAATAGATGCCTGTTCAGCCGCCTACTCAAACATTTCGTGATGGATACTCCATGGGAGGAATTTCGCAGGTACTTAATTGGTGTATAATATGCTTAGTTTTGCTATATTTGTCCTTAATCCACTTTTCTAAAGTAATTTTGATGATTTATCAGGACTTCTTGGGCGATGACTTTAAAAGCCAGGGATCACATGTTCCATACAATGTTACGGATTTCTCCACTCAGGTACTTTTCTGCAAGAATGTAATTCGTGATTGCTAGCACTAGTGCCCAAGTTTAATTTTAAGGATATAAGTCAGATGCTTAAGTAGAAGATATAGAGCAATCGATACTAGAAATTATTTGCACTATTTTTCTGAACAAGTTTGAACATTCTTATGTTTCAAGGCACGTTTAGGCATATCTTATTTTTTTGTGTCGTACTCTAGTACATTGGAACCTTTTGTTTCTGTTTTCACCAGATGGTGGACGCATCCAATTAACTTTTAATTCTCTCTTTTGGTTTATTTGTAGGCCTCTCAAACTGGATATCCCATTGATTATGTTAGTCAAGGAGCACAGGGTGGCTTTCCAGGGAGCTTCCTGAACCAGAATTCTCAATCTGGATATAACCGTTTCGGAACAGGAAATGATTTCATGTCGCAGGTTTTCTACTATTTTGCTTTCCTTGACTTGATTTCTGAAGTATCCATTGGCATTTAACCTCCATCTGTCTCACAGGACTACATGAACCATGGTTCACAAGGTTTATTTACTCAAGTTGGTTTCAGCGATCCCTCACTAGATGAAGCCTTCCAAAGTCACTATAACGTGACAAATACAAACTCACTACAGTCTCAGGTTTCTCTTAATACTTTAGTAGATTTGATAATTTATTGTCAAGTTTTACTGAATGATAGGGTGTGGGAACAATTGGCTTGTTATATCTTCTCACATATATTACCATCAAGTGGCTTCTCACATATATTACCATCAAGTGGGGGGTGGGGGGTGGGGGGTCAATTCGATACCGAATAACGCCAACTGTTGCATTTTAATCCTTCGGTTATATTTTCTCTGTTTTTTCTTTGTTGCTGAATAAAGAATAATAATTTTACATCAGGGCATGATGAATTCTCTCTACTCCCAGCCCTTTGCACATTACAATTCACAGCCTTCGACCCAGCAGGCCCCACCACAGCAGCAGCCTCAGCAGGGCCAGAGCTCTCAAAATCAGAAAATTCACTTTAGTGGTTGAAAGAGTGTAGGATTCGCGGCGGTGATGGGTATGCAACTGTCTGCCTCTTGATTGCAAGCTTGCCATTTTTTGGTCGATATGATTGTTCTATCCTCAACCATTGTTGGTAACATGGGATTTGGCAATCTACTAATACTCAAGAGTGGACAGAAGACGGAAGTGCTGTCTTTCAGCAAAACAAGGGCGGCACATCCAGGGCTGAGAAAGAAAAGATAACCCATATTCCTGCTTGAGGTTTTGTGTTAATGAATGAAACTACAAATATTATTTTGTGCAATGGAGCACTGCATGGCACGGTTTCGGCAAAAGGGTTCCATAATACGTAGGATCGGCTACATTTTGTGTGATATAACCCCTGGCAACTGATTCATTTGAAAAAAAAAAAAAAAAAAATACATAAGCAGCAGTCAAATGGACATTTTTGTACAGCAGCAGCTGGTGAAAGTTGCCTTATCTCATCCCTTCACCTCCCCGTGACATGTACCTGTACTCTGCTTTATCATTTCGTCAAGTAGAGGATGGCTGTAGACTATAGGAAGTATTATATAATTTTCTTTTGCGTAGTGAGTGCTGCTACTACTCCATACAGGAATGAATGTATTTTGATATGGTATATATCTTTTGGGCTTGTTAGGATATATTATTTCAGACTCAGCTCTATTGTGAAGTTGCCATATTTAATGCCATTTTGTTCTTCATTGAGGGATGGTAATATGGAAAGAATTATTAAAGGCAATTGAACAGTTCATCTCATATCAAGCATGCCAACTAACATTCTTTTTTGATTAGGAGTTCCTTTCTTCATCAGGTGGTTGGAGATGGAAAATAGAAAACAAGTCTAAAATGGCGTGGTCGTCGTCTTCCT
mRNA sequence
TTCCTCTTCCTGTTCCATTTTTGTTTCTCTCTTGTAATGCCAGGCCAAACCCTAACCCTAATTTTATTTCCATTTCTTCCAAACCTTGAACTCCAACCCCCTAAATCCTGATTTTTCATCTTCTATTTCTAATTGGGATTGATATCATTGTTATTGATCTTCGATTGCCATGGATTCGCAGCAGAACAATTTGTTTGAAACGGCATCTCAACCTGATACGGGGAACGACGCCTACACATTTCTTGAGTTCAACACACAGGGAGAAGACTTCGATTACCCTGAATTTCGCGACCCTATTAGGTCCCCTGTAGCGTGGCCGACTCCTTCCGATTCGTTAGCGGATCACACGGACCGTGGTGGTGGGTCGGATCATCAGTCTGATGCATCTCCGGTTTCGGCTGCACCGGGAAGTGCCACGAAGGGCCGGACTGGAGGAGGTTCAGGGAATAGCGGCGGTAATAATCAAATGGTTGACGTATTGGCTGCCGGGATGAGTGGGTTGACGTTTGAGGATACGGGGGATGATGATAATTATGAGTTTGGGAAGGGAAATTTCACGGAGCATGCTTGTAGGTATTGTGGGATTTCGAATCCGGCTTGCGTTGTGAGGTGTAACGTGCCGTCATGCCGTAAGTGGTTCTGCAATTCGCGAGGGAACACGTCTGGGTCCCATATTGTGAATCACCTGGTCCGGGCAAAACATAAAGAAGTTTGTCTTCACAAAGACAGTCCTCTGGGAGAAACAATTCTCGAGTGTTACAATTGTGGGTGCCGAAATGTATTCCTCCTTGGATTTATTTCTGCAAAGACAGAAAGTGTGGTTGTTCTACTCTGTAGGGAACCTTGCTTGAGTGTTAATGCTCTTAAGGATATGAATTGGGACTTGAGTCAGTGGTGCCCTCTAATTGATGATAGGTGCTTCTTGCAGTGGCTGGTTAAGATTCCTTCTGAGCAAGAGCAGTTGAGAGCACGGCAAATTAGTGCCCAACAAATAAATAAAATTGAGGAACTTTGGAAGACAAACCCAGATGCATCTCTTGAAGATCTTGAAAAACCTGGTGTGGATGATGAACCACAGCCCGTAGCATTAAAATATGAAGATGCTTATCAGTATCAAAATGTATTTGCACCTCTTATCAAGCTTGAAGCCGACTACGATAAAATGATGAAAGAATCTCAAAGCAAGGATAATGTGACTGTACGTTGGGACATTGGTCTTAACAAGAAGAGAATAGCTTATTTTGTCTTTCCAAAGGAGGACAATGAGTTGCGCCTTGTACCTGGAGATGAGTTGAGGTTACGTTATTCTGGTGATGCAGCTCATCCAGCTTGGCATTCTGTTGGACATGTGATCAAGCTAACTGCACAGGAAGAGGTTGCACTCGAGCTACGTGCTAGCCAGGGGGTTCCAGTTGATGTGGTCCATGGTTTTAGTGTTGATTTTGTGTGGAAGAGTACTAGTTTCGATCGAATGCAGGGAGCTATGAAGACATTTGCTGTTGATGAGACCAGTGTTAGTGGTTATATCTACCATCACTTATTAGGACACGAAGTGGAGGTTCAGATGGTTCGCAATACACTTCCACGTCGTTTTGGTGCGCCTGGTCTCCCAGAACTCAATGCATCTCAAGTTTTTGCTGTAAAAAGTGTTCTTCAGAAGCCAATAAGCTTGATTCAGGGCCCTCCGGGTACTGGGAAAACTGTAACTTCTGCTGCCATAGTGTATCATATGGCCAAACAAGGCCAAGGGCAGGTACTGGTCTGTGCCCCTAGTAATGTGGCTGTGGACCAACTGGCAGAGAAGATAAGTGCAACTGGGTTAAAGGTTGTTAGGCTATGTGCAAAATCAAGAGAAGCTGTAAGTTCTCCTGTGGAACACTTAACCCTTCACTATCAGGTTCGACATCTTGACACATCGGAAAGAAGTGAACTTCATAAGTTGCAACAATTGAAAGATGAACAAGGGGAGCTGTCTAGCAGTGATGAGAAAAAATATAAAGCCCTTAAGAGAGCAACAGAGAGGGAAATTTCCCAGAGTGCAGATGTTATTTGTTGCACATGTGTCGGTGCTGGAGATCCTCGCCTGTCAAATTTTAGATTTCGCCAGGTTCTTATCGATGAGTCTACTCAGGCAACCGAACCTGAATGTCTTATTCCTCTTGTTCTTGGAGCAAAGCAGGCATGTCGAGCTGTTCTTGTTGGTGACCATTGCCAACTGGGACCTGTCATTATGTGCAAAAAAGCAGCACGTGCAGGGCTGGCTCAATCCCTTTTTGAGCGTCTTGTTCTTCTTGGTGTGAAGCCAATTAGATTGCAGGTCCAATATCGTATGCACCCATCCCTTTCTGAATTTCCTTCCAACAGCTTCTACGAGGGCACACTACAAAATGGAGTTACAATCAATGAAAGGCAATCGACAGGCATCGACTTTCCATGGCCAGTTCCCAACCGTCCAATGTTCTTCTATGTTCAGATGGGACAAGAGGAGATAAGTGCCAGTGGAACTTCCTACCTAAATAGGACCGAGGCAGCAAATGTAGAAAAAATTGTGACCACTTTCTTAAGAAGTGGTGTAGTCCCTAGTCAGATTGGAGTTATTACACCATATGAGGGCCAACGAGCTTACATTGTGAACTATATGTCAAGAAATGGTGCTCTCAGACAGCAACTTTACAAGGAAATTGAGGTTGCAAGTGTGGATTCATTTCAAGGAAGGGAAAAAGATTACATCATATTGTCATGCGTAAGGAGTAACGAACATCAGGGCATTGGATTCCTGAACGATCCTCGCAGGCTCAATGTGGCTTTAACACGAGCTCGATATGGTATTGTCATTCTTGGAAACCCAAAGGTTTTGAGTAAACAGCCTCTGTGGAATAGCTTGTTGACACATTACAAGGAACATGAATGCTTGGTTGAGGGTCCATTGAATAACTTGAAACAAAGCATGATTCAGTTTCAGAAGCCTAAAAAGATATACAATGATCGGCGTCTTTTCTTTGCTGGTGGTCCTGGAGTTGTCCCCAATGATAATTTAGGGCCTGTTGCCCCATCTGGTCCTAATGCTGATAGAAGAAGTAGTCGTGGTAGAGGTTCGTACTTTCCTCCTCACCTGCCGAATGGAGCGCAAAAGCCTGGAGTGCATGCATCTGGTTATCCCATGCCACGAGTTCCTCTTCCCTCGTTTCATGGTGGTCCCCCGCATCCATATGGAATTCCAACTCGTGGAGCTGTACATGGACCAGTTGGTGCTGTTCCTCATGTTCCTCAACCAGGAAGTAGAGGCTTTGGGGCAGGGCGGGGAAATGTTGGTGCTCCTATTGGTAGTCAGCTTCCAAATCAGCAAGGCTCTCAACAAAATATTGGAAATCTTGGCTCCACTTTTAACTTTCCTGGTTTGGAGAGTCCCAATAGCCAACCATCAGTGGGTGGTCCGTTGTCTCAACTTGGGTTTGTTAACAATATGCCTGTTCAGCCGCCTACTCAAACATTTCGTGATGGATACTCCATGGGAGGAATTTCGCAGGACTTCTTGGGCGATGACTTTAAAAGCCAGGGATCACATGTTCCATACAATGTTACGGATTTCTCCACTCAGGCCTCTCAAACTGGATATCCCATTGATTATGTTAGTCAAGGAGCACAGGGTGGCTTTCCAGGGAGCTTCCTGAACCAGAATTCTCAATCTGGATATAACCGTTTCGGAACAGGAAATGATTTCATGTCGCAGGACTACATGAACCATGGTTCACAAGGTTTATTTACTCAAGTTGGTTTCAGCGATCCCTCACTAGATGAAGCCTTCCAAAGTCACTATAACGTGACAAATACAAACTCACTACAGTCTCAGGGCATGATGAATTCTCTCTACTCCCAGCCCTTTGCACATTACAATTCACAGCCTTCGACCCAGCAGGCCCCACCACAGCAGCAGCCTCAGCAGGGCCAGAGCTCTCAAAATCAGAAAATTCACTTTAGTGGTTGAAAGAGTGTAGGATTCGCGGCGGTGATGGGTATGCAACTGTCTGCCTCTTGATTGCAAGCTTGCCATTTTTTGGTCGATATGATTGTTCTATCCTCAACCATTGTTGGTAACATGGGATTTGGCAATCTACTAATACTCAAGAGTGGACAGAAGACGGAAGTGCTGTCTTTCAGCAAAACAAGGGCGGCACATCCAGGGCTGAGAAAGAAAAGATAACCCATATTCCTGCTTGAGGTTTTGTGTTAATGAATGAAACTACAAATATTATTTTGTGCAATGGAGCACTGCATGGCACGGTTTCGGCAAAAGGGTTCCATAATACGTAGGATCGGCTACATTTTGTGTGATATAACCCCTGGCAACTGATTCATTTGAAAAAAAAAAAAAAAAAAATACATAAGCAGCAGTCAAATGGACATTTTTGTACAGCAGCAGCTGGTGAAAGTTGCCTTATCTCATCCCTTCACCTCCCCGTGACATGTACCTGTACTCTGCTTTATCATTTCGTCAAGTAGAGGATGGCTGTAGACTATAGGAAGTATTATATAATTTTCTTTTGCGTAGTGAGTGCTGCTACTACTCCATACAGGAATGAATGTATTTTGATATGGTATATATCTTTTGGGCTTGTTAGGATATATTATTTCAGACTCAGCTCTATTGTGAAGTTGCCATATTTAATGCCATTTTGTTCTTCATTGAGGGATGGTAATATGGAAAGAATTATTAAAGGCAATTGAACAGTTCATCTCATATCAAGCATGCCAACTAACATTCTTTTTTGATTAGGAGTTCCTTTCTTCATCAGGTGGTTGGAGATGGAAAATAGAAAACAAGTCTAAAATGGCGTGGTCGTCGTCTTCCT
Coding sequence (CDS)
ATGGATTCGCAGCAGAACAATTTGTTTGAAACGGCATCTCAACCTGATACGGGGAACGACGCCTACACATTTCTTGAGTTCAACACACAGGGAGAAGACTTCGATTACCCTGAATTTCGCGACCCTATTAGGTCCCCTGTAGCGTGGCCGACTCCTTCCGATTCGTTAGCGGATCACACGGACCGTGGTGGTGGGTCGGATCATCAGTCTGATGCATCTCCGGTTTCGGCTGCACCGGGAAGTGCCACGAAGGGCCGGACTGGAGGAGGTTCAGGGAATAGCGGCGGTAATAATCAAATGGTTGACGTATTGGCTGCCGGGATGAGTGGGTTGACGTTTGAGGATACGGGGGATGATGATAATTATGAGTTTGGGAAGGGAAATTTCACGGAGCATGCTTGTAGGTATTGTGGGATTTCGAATCCGGCTTGCGTTGTGAGGTGTAACGTGCCGTCATGCCGTAAGTGGTTCTGCAATTCGCGAGGGAACACGTCTGGGTCCCATATTGTGAATCACCTGGTCCGGGCAAAACATAAAGAAGTTTGTCTTCACAAAGACAGTCCTCTGGGAGAAACAATTCTCGAGTGTTACAATTGTGGGTGCCGAAATGTATTCCTCCTTGGATTTATTTCTGCAAAGACAGAAAGTGTGGTTGTTCTACTCTGTAGGGAACCTTGCTTGAGTGTTAATGCTCTTAAGGATATGAATTGGGACTTGAGTCAGTGGTGCCCTCTAATTGATGATAGGTGCTTCTTGCAGTGGCTGGTTAAGATTCCTTCTGAGCAAGAGCAGTTGAGAGCACGGCAAATTAGTGCCCAACAAATAAATAAAATTGAGGAACTTTGGAAGACAAACCCAGATGCATCTCTTGAAGATCTTGAAAAACCTGGTGTGGATGATGAACCACAGCCCGTAGCATTAAAATATGAAGATGCTTATCAGTATCAAAATGTATTTGCACCTCTTATCAAGCTTGAAGCCGACTACGATAAAATGATGAAAGAATCTCAAAGCAAGGATAATGTGACTGTACGTTGGGACATTGGTCTTAACAAGAAGAGAATAGCTTATTTTGTCTTTCCAAAGGAGGACAATGAGTTGCGCCTTGTACCTGGAGATGAGTTGAGGTTACGTTATTCTGGTGATGCAGCTCATCCAGCTTGGCATTCTGTTGGACATGTGATCAAGCTAACTGCACAGGAAGAGGTTGCACTCGAGCTACGTGCTAGCCAGGGGGTTCCAGTTGATGTGGTCCATGGTTTTAGTGTTGATTTTGTGTGGAAGAGTACTAGTTTCGATCGAATGCAGGGAGCTATGAAGACATTTGCTGTTGATGAGACCAGTGTTAGTGGTTATATCTACCATCACTTATTAGGACACGAAGTGGAGGTTCAGATGGTTCGCAATACACTTCCACGTCGTTTTGGTGCGCCTGGTCTCCCAGAACTCAATGCATCTCAAGTTTTTGCTGTAAAAAGTGTTCTTCAGAAGCCAATAAGCTTGATTCAGGGCCCTCCGGGTACTGGGAAAACTGTAACTTCTGCTGCCATAGTGTATCATATGGCCAAACAAGGCCAAGGGCAGGTACTGGTCTGTGCCCCTAGTAATGTGGCTGTGGACCAACTGGCAGAGAAGATAAGTGCAACTGGGTTAAAGGTTGTTAGGCTATGTGCAAAATCAAGAGAAGCTGTAAGTTCTCCTGTGGAACACTTAACCCTTCACTATCAGGTTCGACATCTTGACACATCGGAAAGAAGTGAACTTCATAAGTTGCAACAATTGAAAGATGAACAAGGGGAGCTGTCTAGCAGTGATGAGAAAAAATATAAAGCCCTTAAGAGAGCAACAGAGAGGGAAATTTCCCAGAGTGCAGATGTTATTTGTTGCACATGTGTCGGTGCTGGAGATCCTCGCCTGTCAAATTTTAGATTTCGCCAGGTTCTTATCGATGAGTCTACTCAGGCAACCGAACCTGAATGTCTTATTCCTCTTGTTCTTGGAGCAAAGCAGGCATGTCGAGCTGTTCTTGTTGGTGACCATTGCCAACTGGGACCTGTCATTATGTGCAAAAAAGCAGCACGTGCAGGGCTGGCTCAATCCCTTTTTGAGCGTCTTGTTCTTCTTGGTGTGAAGCCAATTAGATTGCAGGTCCAATATCGTATGCACCCATCCCTTTCTGAATTTCCTTCCAACAGCTTCTACGAGGGCACACTACAAAATGGAGTTACAATCAATGAAAGGCAATCGACAGGCATCGACTTTCCATGGCCAGTTCCCAACCGTCCAATGTTCTTCTATGTTCAGATGGGACAAGAGGAGATAAGTGCCAGTGGAACTTCCTACCTAAATAGGACCGAGGCAGCAAATGTAGAAAAAATTGTGACCACTTTCTTAAGAAGTGGTGTAGTCCCTAGTCAGATTGGAGTTATTACACCATATGAGGGCCAACGAGCTTACATTGTGAACTATATGTCAAGAAATGGTGCTCTCAGACAGCAACTTTACAAGGAAATTGAGGTTGCAAGTGTGGATTCATTTCAAGGAAGGGAAAAAGATTACATCATATTGTCATGCGTAAGGAGTAACGAACATCAGGGCATTGGATTCCTGAACGATCCTCGCAGGCTCAATGTGGCTTTAACACGAGCTCGATATGGTATTGTCATTCTTGGAAACCCAAAGGTTTTGAGTAAACAGCCTCTGTGGAATAGCTTGTTGACACATTACAAGGAACATGAATGCTTGGTTGAGGGTCCATTGAATAACTTGAAACAAAGCATGATTCAGTTTCAGAAGCCTAAAAAGATATACAATGATCGGCGTCTTTTCTTTGCTGGTGGTCCTGGAGTTGTCCCCAATGATAATTTAGGGCCTGTTGCCCCATCTGGTCCTAATGCTGATAGAAGAAGTAGTCGTGGTAGAGGTTCGTACTTTCCTCCTCACCTGCCGAATGGAGCGCAAAAGCCTGGAGTGCATGCATCTGGTTATCCCATGCCACGAGTTCCTCTTCCCTCGTTTCATGGTGGTCCCCCGCATCCATATGGAATTCCAACTCGTGGAGCTGTACATGGACCAGTTGGTGCTGTTCCTCATGTTCCTCAACCAGGAAGTAGAGGCTTTGGGGCAGGGCGGGGAAATGTTGGTGCTCCTATTGGTAGTCAGCTTCCAAATCAGCAAGGCTCTCAACAAAATATTGGAAATCTTGGCTCCACTTTTAACTTTCCTGGTTTGGAGAGTCCCAATAGCCAACCATCAGTGGGTGGTCCGTTGTCTCAACTTGGGTTTGTTAACAATATGCCTGTTCAGCCGCCTACTCAAACATTTCGTGATGGATACTCCATGGGAGGAATTTCGCAGGACTTCTTGGGCGATGACTTTAAAAGCCAGGGATCACATGTTCCATACAATGTTACGGATTTCTCCACTCAGGCCTCTCAAACTGGATATCCCATTGATTATGTTAGTCAAGGAGCACAGGGTGGCTTTCCAGGGAGCTTCCTGAACCAGAATTCTCAATCTGGATATAACCGTTTCGGAACAGGAAATGATTTCATGTCGCAGGACTACATGAACCATGGTTCACAAGGTTTATTTACTCAAGTTGGTTTCAGCGATCCCTCACTAGATGAAGCCTTCCAAAGTCACTATAACGTGACAAATACAAACTCACTACAGTCTCAGGGCATGATGAATTCTCTCTACTCCCAGCCCTTTGCACATTACAATTCACAGCCTTCGACCCAGCAGGCCCCACCACAGCAGCAGCCTCAGCAGGGCCAGAGCTCTCAAAATCAGAAAATTCACTTTAGTGGTTGA
Protein sequence
MDSQQNNLFETASQPDTGNDAYTFLEFNTQGEDFDYPEFRDPIRSPVAWPTPSDSLADHTDRGGGSDHQSDASPVSAAPGSATKGRTGGGSGNSGGNNQMVDVLAAGMSGLTFEDTGDDDNYEFGKGNFTEHACRYCGISNPACVVRCNVPSCRKWFCNSRGNTSGSHIVNHLVRAKHKEVCLHKDSPLGETILECYNCGCRNVFLLGFISAKTESVVVLLCREPCLSVNALKDMNWDLSQWCPLIDDRCFLQWLVKIPSEQEQLRARQISAQQINKIEELWKTNPDASLEDLEKPGVDDEPQPVALKYEDAYQYQNVFAPLIKLEADYDKMMKESQSKDNVTVRWDIGLNKKRIAYFVFPKEDNELRLVPGDELRLRYSGDAAHPAWHSVGHVIKLTAQEEVALELRASQGVPVDVVHGFSVDFVWKSTSFDRMQGAMKTFAVDETSVSGYIYHHLLGHEVEVQMVRNTLPRRFGAPGLPELNASQVFAVKSVLQKPISLIQGPPGTGKTVTSAAIVYHMAKQGQGQVLVCAPSNVAVDQLAEKISATGLKVVRLCAKSREAVSSPVEHLTLHYQVRHLDTSERSELHKLQQLKDEQGELSSSDEKKYKALKRATEREISQSADVICCTCVGAGDPRLSNFRFRQVLIDESTQATEPECLIPLVLGAKQACRAVLVGDHCQLGPVIMCKKAARAGLAQSLFERLVLLGVKPIRLQVQYRMHPSLSEFPSNSFYEGTLQNGVTINERQSTGIDFPWPVPNRPMFFYVQMGQEEISASGTSYLNRTEAANVEKIVTTFLRSGVVPSQIGVITPYEGQRAYIVNYMSRNGALRQQLYKEIEVASVDSFQGREKDYIILSCVRSNEHQGIGFLNDPRRLNVALTRARYGIVILGNPKVLSKQPLWNSLLTHYKEHECLVEGPLNNLKQSMIQFQKPKKIYNDRRLFFAGGPGVVPNDNLGPVAPSGPNADRRSSRGRGSYFPPHLPNGAQKPGVHASGYPMPRVPLPSFHGGPPHPYGIPTRGAVHGPVGAVPHVPQPGSRGFGAGRGNVGAPIGSQLPNQQGSQQNIGNLGSTFNFPGLESPNSQPSVGGPLSQLGFVNNMPVQPPTQTFRDGYSMGGISQDFLGDDFKSQGSHVPYNVTDFSTQASQTGYPIDYVSQGAQGGFPGSFLNQNSQSGYNRFGTGNDFMSQDYMNHGSQGLFTQVGFSDPSLDEAFQSHYNVTNTNSLQSQGMMNSLYSQPFAHYNSQPSTQQAPPQQQPQQGQSSQNQKIHFSG
Homology
BLAST of CmaCh16G010870 vs. ExPASy Swiss-Prot
Match:
Q9FJR0 (Regulator of nonsense transcripts 1 homolog OS=Arabidopsis thaliana OX=3702 GN=UPF1 PE=1 SV=2)
HSP 1 Score: 2031.1 bits (5261), Expect = 0.0e+00
Identity = 1041/1284 (81.07%), Postives = 1121/1284 (87.31%), Query Frame = 0
Query: 1 MDSQQNNLFETASQPDTGNDAYTFLEFNTQGE-DFDYPEFRDPIRSPVAWPTPSD--SLA 60
MDSQQ++LF+TASQPDT D YTFLEFNTQG+ +FDY +F SP AWPTPSD S+A
Sbjct: 1 MDSQQSDLFDTASQPDTVADEYTFLEFNTQGDSEFDYQDF----GSPTAWPTPSDSISIA 60
Query: 61 DHTDRGGG---SDHQSDA-SPVSAAPGSATKGRTG-GGSGNSGG--NNQMVDVLAAGMSG 120
D DRG G +DH S+A SP S + G+ + G GG G SGG ++ VD LAAG+
Sbjct: 61 DVADRGEGGAAADHHSEASSPSSLSAGAGNGAKVGRGGVGGSGGVSSSSQVDALAAGVGN 120
Query: 121 LTFEDTGDDDNYEFGKGNFTEHACRYCGISNPACVVRCNVPSCRKWFCNSRGNTSGSHIV 180
L FE+TGDDD +++GK +FTEHAC+YCGISNPACVVRCNV SCRKWFCNSRGNTSGSHIV
Sbjct: 121 LNFEETGDDDGFDYGKNDFTEHACKYCGISNPACVVRCNVASCRKWFCNSRGNTSGSHIV 180
Query: 181 NHLVRAKHKEVCLHKDSPLGETILECYNCGCRNVFLLGFISAKTESVVVLLCREPCLSVN 240
NHLVRAKHKEVCLH+DSPLGETILECYNCGCRNVFLLGFISAKT+SVVVLLCR+PCL+VN
Sbjct: 181 NHLVRAKHKEVCLHRDSPLGETILECYNCGCRNVFLLGFISAKTDSVVVLLCRDPCLNVN 240
Query: 241 ALKDMNWDLSQWCPLIDDRCFLQWLVKIPSEQEQLRARQISAQQINKIEELWKTNPDASL 300
ALKDMNWDLSQWCPLIDDRCFL WLVK+PSEQEQLRARQISAQQINKIEELWKTNPDA+L
Sbjct: 241 ALKDMNWDLSQWCPLIDDRCFLPWLVKVPSEQEQLRARQISAQQINKIEELWKTNPDATL 300
Query: 301 EDLEKPGVDDEPQPVALKYEDAYQYQNVFAPLIKLEADYDKMMKESQSKDNVTVRWDIGL 360
EDLEKPGVDDEPQPV KYEDAYQYQNVFAPLIKLEADYDKMMKESQSK+N+TVRWDIGL
Sbjct: 301 EDLEKPGVDDEPQPVQPKYEDAYQYQNVFAPLIKLEADYDKMMKESQSKENLTVRWDIGL 360
Query: 361 NKKRIAYFVFPKEDNELRLVPGDELRLRYSGDAAHPAWHSVGHVIKLTAQEEVALELRAS 420
NKKR+AYFVFPKE+NELRLVPGDELRLRYSGDA HP+W SVGHVIKLTAQEEVALELRA+
Sbjct: 361 NKKRVAYFVFPKEENELRLVPGDELRLRYSGDAVHPSWQSVGHVIKLTAQEEVALELRAN 420
Query: 421 QGVPVDVVHGFSVDFVWKSTSFDRMQGAMKTFAVDETSVSGYIYHHLLGHEVEVQMVRNT 480
QGVP+DV HGFSVDFVWKSTSFDRMQGAMK FAVDETSVSGYIYH LLGHEVE QMVRNT
Sbjct: 421 QGVPIDVNHGFSVDFVWKSTSFDRMQGAMKNFAVDETSVSGYIYHQLLGHEVEAQMVRNT 480
Query: 481 LPRRFGAPGLPELNASQVFAVKSVLQKPISLIQGPPGTGKTVTSAAIVYHMAKQGQGQVL 540
LPRRFG PGLPELNASQV AVKSVLQKPISLIQGPPGTGKTVTSAAIVYHMAKQGQGQVL
Sbjct: 481 LPRRFGVPGLPELNASQVNAVKSVLQKPISLIQGPPGTGKTVTSAAIVYHMAKQGQGQVL 540
Query: 541 VCAPSNVAVDQLAEKISATGLKVVRLCAKSREAVSSPVEHLTLHYQVRHLDTSERSELHK 600
VCAPSNVAVDQLAEKISATGLKVVRLCAKSREAVSSPVE+LTLHYQVRHLDTSE+SELHK
Sbjct: 541 VCAPSNVAVDQLAEKISATGLKVVRLCAKSREAVSSPVEYLTLHYQVRHLDTSEKSELHK 600
Query: 601 LQQLKDEQGELSSSDEKKYKALKRATEREISQSADVICCTCVGAGDPRLSNFRFRQVLID 660
LQQLKDEQGELSSSDEKKYK LKRATEREI+QSADVICCTCVGA D RLSNFRFRQVLID
Sbjct: 601 LQQLKDEQGELSSSDEKKYKNLKRATEREITQSADVICCTCVGAADLRLSNFRFRQVLID 660
Query: 661 ESTQATEPECLIPLVLGAKQACRAVLVGDHCQLGPVIMCKKAARAGLAQSLFERLVLLGV 720
ESTQATEPECLIPLVLG KQ VLVGDHCQLGPVIMCKKAARAGLAQSLFERLV LG+
Sbjct: 661 ESTQATEPECLIPLVLGVKQ---VVLVGDHCQLGPVIMCKKAARAGLAQSLFERLVTLGI 720
Query: 721 KPIRLQVQYRMHPSLSEFPSNSFYEGTLQNGVTINERQSTGIDFPWPVPNRPMFFYVQMG 780
KPIRLQVQYRMHP+LSEFPSNSFYEGTLQNGVTI ERQ+TGIDFPWPVPNRPMFFYVQ+G
Sbjct: 721 KPIRLQVQYRMHPALSEFPSNSFYEGTLQNGVTIIERQTTGIDFPWPVPNRPMFFYVQLG 780
Query: 781 QEEISASGTSYLNRTEAANVEKIVTTFLRSGVVPSQIGVITPYEGQRAYIVNYMSRNGAL 840
QEEISASGTSYLNRTEAANVEK+VT FL+SGVVPSQIGVITPYEGQRAYIVNYM+RNG+L
Sbjct: 781 QEEISASGTSYLNRTEAANVEKLVTAFLKSGVVPSQIGVITPYEGQRAYIVNYMARNGSL 840
Query: 841 RQQLYKEIEVASVDSFQGREKDYIILSCVRSNEHQGIGFLNDPRRLNVALTRARYGIVIL 900
RQQLYKEIEVASVDSFQGREKDYIILSCVRSNEHQGIGFLNDPRRLNVALTRARYGIVIL
Sbjct: 841 RQQLYKEIEVASVDSFQGREKDYIILSCVRSNEHQGIGFLNDPRRLNVALTRARYGIVIL 900
Query: 901 GNPKVLSKQPLWNSLLTHYKEHECLVEGPLNNLKQSMIQFQKPKKIYNDRRLFFAGGPGV 960
GNPKVLSKQPLWN LLTHYKEHECLVEGPLNNLKQSM+QFQKP+KIYNDRRLF+ GG G+
Sbjct: 901 GNPKVLSKQPLWNGLLTHYKEHECLVEGPLNNLKQSMVQFQKPRKIYNDRRLFYGGGAGM 960
Query: 961 VPNDNLGPVAPSGPNADRRSSRGR--GSYFPPHLPNGAQKPGVHASGYPMPRVPLPSFHG 1020
+ NDN G PNADRR SRGR GSY P PNGA +PG+H +GYP+PRVPL F G
Sbjct: 961 IGNDNFG---SGNPNADRRGSRGRAGGSYLPSGPPNGA-RPGLHPAGYPIPRVPLSPFPG 1020
Query: 1021 GPP-HPYGIPTRGAVHGPVGAVPHVPQPGSRGFGAGRGNVGAPIGSQLPNQQGSQQNIGN 1080
GPP PY IPTR GPVGAVPH PQPG+ GFGAGR G +G LP+QQ +Q N+G
Sbjct: 1021 GPPSQPYAIPTR----GPVGAVPHAPQPGNHGFGAGR---GTSVGGHLPHQQATQHNVGT 1080
Query: 1081 LGSTFNFPGLESPNSQPSVGGPLSQLGFVNNMPVQPPTQTFRDGYSMGGISQDFLGDDFK 1140
+G + NFP L+SPNSQPS GGPLSQ G+ +Q FRDG+SMGGISQDFL DD K
Sbjct: 1081 IGPSLNFP-LDSPNSQPSPGGPLSQPGY--------GSQAFRDGFSMGGISQDFLADDIK 1140
Query: 1141 SQGSHVPYNVTDFSTQASQTGYPIDYVSQGAQGGFPGSFLNQNSQSGYNRFGTGNDFMSQ 1200
SQGSH PYN+ DF+TQAS G+ +DY +QGA G FPG+F+NQNSQ GY+RF NDFMSQ
Sbjct: 1141 SQGSHDPYNMADFATQASPGGFAVDYATQGAHGAFPGNFMNQNSQGGYSRFSGINDFMSQ 1200
Query: 1201 DYMNHGSQGLFTQVGFSDPSLDEAFQSHYNVTNTNSLQSQGMMNSLYSQPFAHYNSQPST 1260
+YM HG QGLFTQ GF D S D+ Q+ Y V N N LQSQG+ NSLYSQPFAHYN+QP
Sbjct: 1201 EYMAHGGQGLFTQAGFIDSSQDDGQQNPYGVNNPN-LQSQGLPNSLYSQPFAHYNTQPLN 1254
Query: 1261 QQAPPQQQPQQGQSSQNQKIHFSG 1272
P Q QP QSSQN K ++G
Sbjct: 1261 LSGPQQSQP--NQSSQNPKHPYNG 1254
BLAST of CmaCh16G010870 vs. ExPASy Swiss-Prot
Match:
Q92900 (Regulator of nonsense transcripts 1 OS=Homo sapiens OX=9606 GN=UPF1 PE=1 SV=2)
HSP 1 Score: 1253.8 bits (3243), Expect = 0.0e+00
Identity = 679/1134 (59.88%), Postives = 812/1134 (71.60%), Query Frame = 0
Query: 28 NTQGEDFDYPEFRDPIRSPVAWPTPSDSLADHTDRGGGSDHQSDASPVSAAPGSATKGRT 87
+TQG +F++ +F P ++ P GGG + AA G
Sbjct: 27 DTQGSEFEFTDFTLPSQTQTPPGGPGGP-------GGGGAGGPGGAGAGAAAGQLDAQVG 86
Query: 88 GGGSGNSGGNNQMVDVLAAGMSGLTFEDTGDDDNYEFGKGNFTEHACRYCGISNPACVVR 147
G +G + V + ++ L FE+ +D Y + HAC YCGI +PACVV
Sbjct: 87 PEGILQNGAVDDSVAKTSQLLAELNFEEDEEDTYY---TKDLPIHACSYCGIHDPACVVY 146
Query: 148 CNVPSCRKWFCNSRGNTSGSHIVNHLVRAKHKEVCLHKDSPLGETILECYNCGCRNVFLL 207
CN + +KWFCN RGNTSGSHIVNHLVRAK KEV LHKD PLGET+LECYNCGCRNVFLL
Sbjct: 147 CN--TSKKWFCNGRGNTSGSHIVNHLVRAKCKEVTLHKDGPLGETVLECYNCGCRNVFLL 206
Query: 208 GFISAKTESVVVLLCREPCLSVNALKDMNWDLSQWCPLIDDRCFLQWLVKIPSEQEQLRA 267
GFI AK +SVVVLLCR+PC S ++LKD+NWD SQW PLI DRCFL WLVKIPSEQEQLRA
Sbjct: 207 GFIPAKADSVVVLLCRQPCASQSSLKDINWDSSQWQPLIQDRCFLSWLVKIPSEQEQLRA 266
Query: 268 RQISAQQINKIEELWKTNPDASLEDLEKPGVDDEPQPVALKYEDAYQYQNVFAPLIKLEA 327
RQI+AQQINK+EELWK NP A+LEDLEKPGVD+EPQ V L+YEDAYQYQN+F PL+KLEA
Sbjct: 267 RQITAQQINKLEELWKENPSATLEDLEKPGVDEEPQHVLLRYEDAYQYQNIFGPLVKLEA 326
Query: 328 DYDKMMKESQSKDNVTVRWDIGLNKKRIAYFVFPKEDN-----------ELRLVPGDELR 387
DYDK +KESQ++DN+TVRWD+GLNKKRIAYF PK D+ ++RL+ GDE+
Sbjct: 327 DYDKKLKESQTQDNITVRWDLGLNKKRIAYFTLPKTDSGNEDLVIIWLRDMRLMQGDEIC 386
Query: 388 LRYSGDAAHPAWHSVGHVIKLTAQ--EEVALELRASQGVPVDVVHGFSVDFVWKSTSFDR 447
LRY GD A P W +GHVIK+ +E+A+ELR+S G PV+V H F VDFVWKSTSFDR
Sbjct: 387 LRYKGDLA-PLWKGIGHVIKVPDNYGDEIAIELRSSVGAPVEVTHNFQVDFVWKSTSFDR 446
Query: 448 MQGAMKTFAVDETSVSGYIYHHLLGHEVEVQMVRNTLPRRFGAPGLPELNASQVFAVKSV 507
MQ A+KTFAVDETSVSGYIYH LLGHEVE +++ LP+RF A GLP+LN SQV+AVK+V
Sbjct: 447 MQSALKTFAVDETSVSGYIYHKLLGHEVEDVIIKCQLPKRFTAQGLPDLNHSQVYAVKTV 506
Query: 508 LQKPISLIQGPPGTGKTVTSAAIVYHMAKQGQGQVLVCAPSNVAVDQLAEKISATGLKVV 567
LQ+P+SLIQGPPGTGKTVTSA IVYH+A+QG G VLVCAPSN+AVDQL EKI TGLKVV
Sbjct: 507 LQRPLSLIQGPPGTGKTVTSATIVYHLARQGNGPVLVCAPSNIAVDQLTEKIHQTGLKVV 566
Query: 568 RLCAKSREAVSSPVEHLTLHYQVRHLDTSERSELHKLQQLKDEQGELSSSDEKKYKALKR 627
RLCAKSREA+ SPV L LH Q+R++D+ EL KLQQLKDE GELSS+DEK+Y+ALKR
Sbjct: 567 RLCAKSREAIDSPVSFLALHNQIRNMDS--MPELQKLQQLKDETGELSSADEKRYRALKR 626
Query: 628 ATEREISQSADVICCTCVGAGDPRLSNFRFRQVLIDESTQATEPECLIPLVLGAKQACRA 687
ERE+ +ADVICCTCVGAGDPRL+ +FR +LIDESTQATEPEC++P+VLGAKQ
Sbjct: 627 TAERELLMNADVICCTCVGAGDPRLAKMQFRSILIDESTQATEPECMVPVVLGAKQ---L 686
Query: 688 VLVGDHCQLGPVIMCKKAARAGLAQSLFERLVLLGVKPIRLQVQYRMHPSLSEFPSNSFY 747
+LVGDHCQLGPV+MCKKAA+AGL+QSLFERLV+LG++PIRLQVQYRMHP+LS FPSN FY
Sbjct: 687 ILVGDHCQLGPVVMCKKAAKAGLSQSLFERLVVLGIRPIRLQVQYRMHPALSAFPSNIFY 746
Query: 748 EGTLQNGVTINERQSTGIDFPWPVPNRPMFFYVQMGQEEISASGTSYLNRTEAANVEKIV 807
EG+LQNGVT +R G DF WP P++PMFFYV GQEEI++SGTSYLNRTEAANVEKI
Sbjct: 747 EGSLQNGVTAADRVKKGFDFQWPQPDKPMFFYVTQGQEEIASSGTSYLNRTEAANVEKIT 806
Query: 808 TTFLRSGVVPSQIGVITPYEGQRAYIVNYMSRNGALRQQLYKEIEVASVDSFQGREKDYI 867
T L++G P QIG+ITPYEGQR+Y+V YM +G+L +LY+E+E+ASVD+FQGREKD+I
Sbjct: 807 TKLLKAGAKPDQIGIITPYEGQRSYLVQYMQFSGSLHTKLYQEVEIASVDAFQGREKDFI 866
Query: 868 ILSCVRSNEHQGIGFLNDPRRLNVALTRARYGIVILGNPKVLSKQPLWNSLLTHYKEHEC 927
ILSCVR+NEHQGIGFLNDPRRLNVALTRARYG++I+GNPK LSKQPLWN LL +YKE +
Sbjct: 867 ILSCVRANEHQGIGFLNDPRRLNVALTRARYGVIIVGNPKALSKQPLWNHLLNYYKEQKV 926
Query: 928 LVEGPLNNLKQSMIQFQKPKKIYNDRRLFFAGGPG-----VVPNDNLGPVAPSGPNADRR 987
LVEGPLNNL++S++QF KP+K+ N PG D + P + R
Sbjct: 927 LVEGPLNNLRESLMQFSKPRKLVN------TINPGARFMTTAMYDAREAIIPG--SVYDR 986
Query: 988 SSRGRGS--YFPPHLPNGAQKPG---VHASGYP------MPRVPLPSFHGGPPHPYGIPT 1047
SS+GR S YF H G G V A P MP +P P + G P
Sbjct: 987 SSQGRPSSMYFQTHDQIGMISAGPSHVAAMNIPIPFNLVMPPMPPPGYFGQANGP--AAG 1046
Query: 1048 RGAVHGPVGAVPHVPQPGSRGFGAGRGNVGAPIGSQ--LPNQQGSQQNIGNLGSTFNFPG 1107
RG G G RG G + G P SQ LPN Q SQ
Sbjct: 1047 RGTPKGKTG----------RG-GRQKNRFGLPGPSQTNLPNSQASQ-------------- 1101
Query: 1108 LESPNSQPSVGGPLSQLGFVNNMPVQPPTQTFRDGYSMGGISQD-FLGDDFKSQ 1130
SQP G L+Q G+++ + P+Q + G S +SQD +LGD+FKSQ
Sbjct: 1107 --DVASQPFSQGALTQ-GYIS---MSQPSQMSQPGLSQPELSQDSYLGDEFKSQ 1101
BLAST of CmaCh16G010870 vs. ExPASy Swiss-Prot
Match:
Q9EPU0 (Regulator of nonsense transcripts 1 OS=Mus musculus OX=10090 GN=Upf1 PE=1 SV=2)
HSP 1 Score: 1251.1 bits (3236), Expect = 0.0e+00
Identity = 674/1128 (59.75%), Postives = 813/1128 (72.07%), Query Frame = 0
Query: 28 NTQGEDFDYPEFRDPIRSPVAWPTPSDSLADHTDRGGGSDHQSDASPVSAAPGSATKGRT 87
+TQG +F++ +F P ++ P + GG+ Q DA P
Sbjct: 27 DTQGSEFEFTDFTLPSQTQTPPGGPGGAGGPGGAGAGGAAGQLDA---QVGP-------- 86
Query: 88 GGGSGNSGGNNQMVDVLAAGMSGLTFEDTGDDDNYEFGKGNFTEHACRYCGISNPACVVR 147
G +G + V + ++ L FE+ +D Y + HAC YCGI +PACVV
Sbjct: 87 -EGILQNGAVDDSVAKTSQLLAELNFEEDEEDTYY---TKDLPVHACSYCGIHDPACVVY 146
Query: 148 CNVPSCRKWFCNSRGNTSGSHIVNHLVRAKHKEVCLHKDSPLGETILECYNCGCRNVFLL 207
CN + +KWFCN RGNTSGSHIVNHLVRAK KEV LHKD PLGET+LECYNCGCRNVFLL
Sbjct: 147 CN--TSKKWFCNGRGNTSGSHIVNHLVRAKCKEVTLHKDGPLGETVLECYNCGCRNVFLL 206
Query: 208 GFISAKTESVVVLLCREPCLSVNALKDMNWDLSQWCPLIDDRCFLQWLVKIPSEQEQLRA 267
GFI AK +SVVVLLCR+PC S ++LKD+NWD SQW PLI DRCFL WLVKIPSEQEQLRA
Sbjct: 207 GFIPAKADSVVVLLCRQPCASQSSLKDINWDSSQWQPLIQDRCFLSWLVKIPSEQEQLRA 266
Query: 268 RQISAQQINKIEELWKTNPDASLEDLEKPGVDDEPQPVALKYEDAYQYQNVFAPLIKLEA 327
RQI+AQQINK+EELWK NP A+LEDLEKPGVD+EPQ V L+YEDAYQYQN+F PL+KLEA
Sbjct: 267 RQITAQQINKLEELWKENPSATLEDLEKPGVDEEPQHVLLRYEDAYQYQNIFGPLVKLEA 326
Query: 328 DYDKMMKESQSKDNVTVRWDIGLNKKRIAYFVFPKEDN-----------ELRLVPGDELR 387
DYDK +KESQ++DN+TVRWD+GLNKKRIA+F PK D+ ++RL+ GDE+
Sbjct: 327 DYDKKLKESQTQDNITVRWDLGLNKKRIAFFTLPKTDSGNEDLVIIWLRDMRLMQGDEIC 386
Query: 388 LRYSGDAAHPAWHSVGHVIKLTAQ--EEVALELRASQGVPVDVVHGFSVDFVWKSTSFDR 447
LRY GD A P W +GHVIK+ +E+A+ELR+S G PV+V H F VDFVWKSTSFDR
Sbjct: 387 LRYKGDLA-PLWKGIGHVIKVPDNYGDEIAIELRSSVGAPVEVTHNFQVDFVWKSTSFDR 446
Query: 448 MQGAMKTFAVDETSVSGYIYHHLLGHEVEVQMVRNTLPRRFGAPGLPELNASQVFAVKSV 507
MQ A+KTFAVDETSVSGYIYH LLGHEVE +++ LP+RF A GLP+LN SQV+AVK+V
Sbjct: 447 MQSALKTFAVDETSVSGYIYHKLLGHEVEDVVIKCQLPKRFTAQGLPDLNHSQVYAVKTV 506
Query: 508 LQKPISLIQGPPGTGKTVTSAAIVYHMAKQGQGQVLVCAPSNVAVDQLAEKISATGLKVV 567
LQ+P+SLIQGPPGTGKTVTSA IVYH+A+QG G VLVCAPSN+AVDQL EKI TGLKVV
Sbjct: 507 LQRPLSLIQGPPGTGKTVTSATIVYHLARQGNGPVLVCAPSNIAVDQLTEKIHQTGLKVV 566
Query: 568 RLCAKSREAVSSPVEHLTLHYQVRHLDTSERSELHKLQQLKDEQGELSSSDEKKYKALKR 627
RLCAKSREA+ SPV L LH Q+R++D+ EL KLQQLKDE GELSS+DEK+Y+ALKR
Sbjct: 567 RLCAKSREAIDSPVSFLALHNQIRNMDS--MPELQKLQQLKDETGELSSADEKRYRALKR 626
Query: 628 ATEREISQSADVICCTCVGAGDPRLSNFRFRQVLIDESTQATEPECLIPLVLGAKQACRA 687
ERE+ +ADVICCTCVGAGDPRL+ +FR +LIDESTQATEPEC++P+VLGAKQ
Sbjct: 627 TAERELLMNADVICCTCVGAGDPRLAKMQFRSILIDESTQATEPECMVPVVLGAKQ---L 686
Query: 688 VLVGDHCQLGPVIMCKKAARAGLAQSLFERLVLLGVKPIRLQVQYRMHPSLSEFPSNSFY 747
+LVGDHCQLGPV+MCKKAA+AGL+QSLFERLV+LG++PIRLQVQYRMHP+LS FPSN FY
Sbjct: 687 ILVGDHCQLGPVVMCKKAAKAGLSQSLFERLVVLGIRPIRLQVQYRMHPALSAFPSNIFY 746
Query: 748 EGTLQNGVTINERQSTGIDFPWPVPNRPMFFYVQMGQEEISASGTSYLNRTEAANVEKIV 807
EG+LQNGVT +R G DF WP P++PMFFYV GQEEI++SGTSYLNRTEAANVEKI
Sbjct: 747 EGSLQNGVTAADRVKKGFDFQWPQPDKPMFFYVTQGQEEIASSGTSYLNRTEAANVEKIT 806
Query: 808 TTFLRSGVVPSQIGVITPYEGQRAYIVNYMSRNGALRQQLYKEIEVASVDSFQGREKDYI 867
T L++G P QIG+ITPYEGQR+Y+V YM +G+L +LY+E+E+ASVD+FQGREKD+I
Sbjct: 807 TKLLKAGAKPDQIGIITPYEGQRSYLVQYMQFSGSLHTKLYQEVEIASVDAFQGREKDFI 866
Query: 868 ILSCVRSNEHQGIGFLNDPRRLNVALTRARYGIVILGNPKVLSKQPLWNSLLTHYKEHEC 927
ILSCVR+NEHQGIGFLNDPRRLNVALTRARYG++I+GNPK LSKQPLWN LL++YKE +
Sbjct: 867 ILSCVRANEHQGIGFLNDPRRLNVALTRARYGVIIVGNPKALSKQPLWNHLLSYYKEQKA 926
Query: 928 LVEGPLNNLKQSMIQFQKPKKIYNDRRLFFAGGPG-----VVPNDNLGPVAPSGPNADRR 987
LVEGPLNNL++S++QF KP+K+ N PG D + P + R
Sbjct: 927 LVEGPLNNLRESLMQFSKPRKLVN------TVNPGARFMTTAMYDAREAIIPG--SVYDR 986
Query: 988 SSRGRGS--YFPPHLPNGAQKPG---VHASGYPMP-RVPLPSFHGGPPHPYGIPTRGAVH 1047
SS+GR S YF H G V A P+P + +P PP Y G +
Sbjct: 987 SSQGRPSNMYFQTHDQISMISAGPSHVAAMNIPIPFNLVMPPM---PPPGY----FGQAN 1046
Query: 1048 GP-VGAVPHVPQPGSRGFGAGRGNVGAPIGSQLPNQQGSQQNIGNLGSTFNFPGLESPNS 1107
GP G + G G R + P + LPN Q SQ S
Sbjct: 1047 GPAAGRGTPKTKTGRGGRQKNRFGLPGPSQTTLPNSQASQ----------------DVAS 1096
Query: 1108 QPSVGGPLSQLGFVNNMPVQPPTQTFRDGYSMGGISQD-FLGDDFKSQ 1130
QP G L+Q G+V+ + P+Q + G S +SQD +LGD+FKSQ
Sbjct: 1107 QPFSQGALTQ-GYVS---MSQPSQMSQPGLSQPELSQDSYLGDEFKSQ 1096
BLAST of CmaCh16G010870 vs. ExPASy Swiss-Prot
Match:
Q98TR3 (Putative regulator of nonsense transcripts 1 OS=Takifugu rubripes OX=31033 GN=rent1 PE=3 SV=1)
HSP 1 Score: 1190.6 bits (3079), Expect = 0.0e+00
Identity = 625/1014 (61.64%), Postives = 756/1014 (74.56%), Query Frame = 0
Query: 28 NTQGEDFDYPEFRDPIRSPVAWPTPSDSLADHTDRGGGSDHQSDASPVSAAPGSATKGRT 87
+TQG ++D+ +F +L T G + Q D + G
Sbjct: 27 DTQGSEYDFTDF---------------TLPSQTQTQGHTQSQLD---------NQLNGPD 86
Query: 88 GGGSGNSGGNNQMVDVLAAGMSGLTFEDTGDDDNYEFGKGNFTEHACR-YCGISNPACVV 147
G ++GG + V + ++ L FE+ +D Y + HACR YCGI +PACVV
Sbjct: 87 DG--LHNGGMDDSVAKASQLLAELNFEEDEEDTYY---TKDLPVHACRSYCGIHDPACVV 146
Query: 148 RCNVPSCRKWFCNSRGNTSGSHIVNHLVRAKHKEVCLHKDSPLGETILECYNCGCRNVFL 207
CN + +KWFCN RGNTSGSHIVNHLVRAK KEV LHKD PLGET+LECYNCGCRNVFL
Sbjct: 147 YCN--TSKKWFCNGRGNTSGSHIVNHLVRAKCKEVTLHKDGPLGETVLECYNCGCRNVFL 206
Query: 208 LGFISAKTESVVVLLCREPCLSVNALKDMNWDLSQWCPLIDDRCFLQWLVKIPSEQEQLR 267
LGFI AK +SVVVLLCR+PC S ++LKD+NWD SQW PLI DRCFL WLVKIPSEQEQLR
Sbjct: 207 LGFIPAKADSVVVLLCRQPCASQSSLKDINWDSSQWQPLIQDRCFLSWLVKIPSEQEQLR 266
Query: 268 ARQISAQQINKIEELWKTNPDASLEDLEKPGVDDEPQPVALKYEDAYQYQNVFAPLIKLE 327
ARQI+AQQINK+EELWK NP A+LEDLEKPGVD+EPQ V L+YEDAYQYQN+F PL+KLE
Sbjct: 267 ARQITAQQINKLEELWKDNPCATLEDLEKPGVDEEPQHVLLRYEDAYQYQNIFGPLVKLE 326
Query: 328 ADYDKMMKESQSKDNVTVRWDIGLNKKRIAYFVFPKEDNELRLVPGDELRLRYSGDAAHP 387
ADYDK +KESQ++DN+TVRWD+GLNKKRIAYF PK D+++RL+ GDE+ LRY GD A P
Sbjct: 327 ADYDKKLKESQTQDNITVRWDLGLNKKRIAYFTLPKTDSDMRLMQGDEICLRYKGDLA-P 386
Query: 388 AWHSVGHVIKL--TAQEEVALELRASQGVPVDVVHGFSVDFVWKSTSFDRMQGAMKTFAV 447
W +GHVIK+ + +E+A+ELR S G PV++ H + VDFVWKSTSFDRMQ A+KTFAV
Sbjct: 387 LWKGIGHVIKVPDSYGDEIAIELRTSVGAPVEIPHNYQVDFVWKSTSFDRMQSALKTFAV 446
Query: 448 DETSVSGYIYHHLLGHEVEVQMVRNTLPRRFGAPGLPELNASQVFAVKSVLQKPISLIQG 507
DETSVSGYIYH LLGHEVE ++ LP+RF A GLP+LN SQV+AVK+VLQ+P+SLIQG
Sbjct: 447 DETSVSGYIYHKLLGHEVEDVTIKCQLPKRFTANGLPDLNHSQVYAVKTVLQRPLSLIQG 506
Query: 508 PPGTGKTVTSAAIVYHMAKQGQGQVLVCAPSNVAVDQLAEKISATGLKVVRLCAKSREAV 567
PPGTGKTVTSA IVYH+++QG G VLVCAPSN+AVDQL EKI TGLKVVRLCAKSREA+
Sbjct: 507 PPGTGKTVTSATIVYHLSRQGNGPVLVCAPSNIAVDQLTEKIDKTGLKVVRLCAKSREAI 566
Query: 568 SSPVEHLTLHYQVRHLDTSERSELHKLQQLKDEQGELSSSDEKKYKALKRATEREISQSA 627
SPV L LH Q+ ++D+ EL KLQQLKDE GELSS+DEK+Y+ALKR ERE+ +A
Sbjct: 567 ESPVSFLALHNQISNMDS--MPELQKLQQLKDETGELSSADEKRYRALKRTAERELLMNA 626
Query: 628 DVICCTCVGAGDPRLSNFRFRQVLIDESTQATEPECLIPLVLGAKQACRAVLVGDHCQLG 687
DVI CTCV AGDPRL+ +FR +LIDESTQATEP+C+ P+ LGAKQ +++G+
Sbjct: 627 DVIWCTCVRAGDPRLAKMQFRSILIDESTQATEPKCIGPVELGAKQ----LILGEITASW 686
Query: 688 PVIMCKKAARAGLAQSLFERLVLLGVKPIRLQVQYRMHPSLSEFPSNSFYEGTLQNGVTI 747
+MCKKAA+AGL+QSLFERLV+LG++PIRLQVQYRMHP+LS FPSN FYEG+LQNGVT
Sbjct: 687 SCVMCKKAAKAGLSQSLFERLVVLGIRPIRLQVQYRMHPALSAFPSNIFYEGSLQNGVTA 746
Query: 748 NERQSTGIDFPWPVPNRPMFFYVQMGQEEISASGTSYLNRTEAANVEKIVTTFLRSGVVP 807
+R G DF WP P +PMFFYV GQEEI++SGTSYLNRTEAANVEKI T L++G P
Sbjct: 747 GDRIKKGFDFQWPQPEKPMFFYVTQGQEEIASSGTSYLNRTEAANVEKITTRLLKAGAKP 806
Query: 808 SQIGVITPYEGQRAYIVNYMSRNGALRQQLYKEIEVASVDSFQGREKDYIILSCVRSNEH 867
QIG+ITPYEGQR+Y+V YM +G+L +LY ++E+ASVD+FQGREKD+IILSCVR+NEH
Sbjct: 807 DQIGIITPYEGQRSYLVQYMQFSGSLHTKLY-QVEIASVDAFQGREKDFIILSCVRANEH 866
Query: 868 QGIGFLNDPRRLNVALTRARYGIVILGNPKVLSKQPLWNSLLTHYKEHECLVEGPLNNLK 927
QGIGFLNDPRRLNVALTRA+YG++I+GNPK LSKQPLWN+LL +YKE + LVEGPLNNL+
Sbjct: 867 QGIGFLNDPRRLNVALTRAKYGVIIVGNPKALSKQPLWNNLLNNYKEQKVLVEGPLNNLR 926
Query: 928 QSMIQFQKPKKIYNDRRLFFAGGPGVVPNDNLGPVAPSGPNADRRSSRGRGS--YFPPHL 987
+S++QF KP+K+ N F + L P G DR ++ GR S YF H
Sbjct: 927 ESLMQFSKPRKLVNTINPRFMSTAMYDAREALIP----GSAYDRSNTGGRPSNMYFQTHD 986
Query: 988 P---NGAQKPGVHASGYP------MPRVPLPSFHGGPPHPYGIPTRGAVHGPVG 1028
GA + A P MP +P PS+ G P RGA+ G G
Sbjct: 987 QIGMIGAAASHLAALNIPIPFNLVMPPMPPPSYQGQTNGP--AAGRGAMKGKSG 995
BLAST of CmaCh16G010870 vs. ExPASy Swiss-Prot
Match:
Q9VYS3 (Regulator of nonsense transcripts 1 homolog OS=Drosophila melanogaster OX=7227 GN=Upf1 PE=1 SV=2)
HSP 1 Score: 1115.5 bits (2884), Expect = 0.0e+00
Identity = 574/986 (58.22%), Postives = 708/986 (71.81%), Query Frame = 0
Query: 93 NSGGNNQMVDVLAAGMSGLTFEDTGDDDNYEFGKGNFTEHACRYCGISNPACVVRCNVPS 152
++G ++ + + ++ L FE+ D+ + K HAC+YCGI +PA VV CN +
Sbjct: 60 SAGDSHPRLASITNDLADLQFEEEDDEPGSSYVK-ELPPHACKYCGIHDPATVVMCN--N 119
Query: 153 CRKWFCNSRGNTSGSHIVNHLVRAKHKEVCLHKDSPLGETILECYNCGCRNVFLLGFISA 212
CRKWFCN RG+TSGSHI+NHLVRAKH+EV LH + PLGETILECY+CG RNVF+LGFI A
Sbjct: 120 CRKWFCNGRGSTSGSHIINHLVRAKHREVTLHGEGPLGETILECYSCGVRNVFVLGFIPA 179
Query: 213 KTESVVVLLCREPCLSVNALKDMNWDLSQWCPLIDDRCFLQWLVKIPSEQEQLRARQISA 272
K +SVVVLLCR+PC + N+LKDMNWD QW PLI DRCFL WLVK PSEQ QLRARQISA
Sbjct: 180 KADSVVVLLCRQPCAAQNSLKDMNWDQEQWKPLIADRCFLAWLVKQPSEQGQLRARQISA 239
Query: 273 QQINKIEELWKTNPDASLEDLEKPGVDDEPQPVALKYEDAYQYQNVFAPLIKLEADYDKM 332
QINK+EELWK N +A+ +DLEKPG+D EP V L+YED YQY+ F PL++LEA+YD+
Sbjct: 240 AQINKLEELWKENIEATFQDLEKPGIDSEPAHVLLRYEDGYQYEKTFGPLVRLEAEYDQK 299
Query: 333 MKESQSKDNVTVRWDIGLNKKRIAYFVFPKEDNELRLVPGDELRLRYSGDAAHPAWHSVG 392
+KES +++N+ VRWD+GLNKK IAYF K D++++L+ GDELRL Y G+ +P W +G
Sbjct: 300 LKESATQENIEVRWDVGLNKKTIAYFTLAKTDSDMKLMHGDELRLHYVGELYNP-WSEIG 359
Query: 393 HVIKLTAQ--EEVALELRASQGVPVDVVHGFSVDFVWKSTSFDRMQGAMKTFAVDETSVS 452
HVIK+ ++V LEL++S PV F+VDF+WK TSFDRM A+ FA+D SVS
Sbjct: 360 HVIKVPDNFGDDVGLELKSSTNAPVKCTSNFTVDFIWKCTSFDRMTRALCKFAIDRNSVS 419
Query: 453 GYIYHHLLGH----EVEVQMVRNTLPRRFGAPGLPELNASQVFAVKSVLQKPISLIQGPP 512
+IY LLGH + + R P+ F AP LP+LN SQV+AVK LQ+P+SLIQGPP
Sbjct: 420 NFIYSRLLGHGRADSNDEVLFRGPQPKLFSAPHLPDLNRSQVYAVKHALQRPLSLIQGPP 479
Query: 513 GTGKTVTSAAIVYHMAKQGQGQVLVCAPSNVAVDQLAEKISATGLKVVRLCAKSREAVSS 572
GTGKTVTSA IVY + K G VLVCAPSN AVDQL EKI T LKVVR+CAKSREA+ S
Sbjct: 480 GTGKTVTSATIVYQLVKLHGGTVLVCAPSNTAVDQLTEKIHRTNLKVVRVCAKSREAIDS 539
Query: 573 PVEHLTLHYQVRHLDTSERSELHKLQQLKDEQGELSSSDEKKYKALKRATEREISQSADV 632
PV L LH Q+R+++T+ SEL KLQQLKDE GELSS+DEK+Y+ LKRA E ++ ++ADV
Sbjct: 540 PVSFLALHNQIRNMETN--SELKKLQQLKDETGELSSADEKRYRNLKRAAENQLLEAADV 599
Query: 633 ICCTCVGAGDPRLSNFRFRQVLIDESTQATEPECLIPLVLGAKQACRAVLVGDHCQLGPV 692
ICCTCVGAGD RLS +F +LIDES Q+TEPEC++P+VLGAKQ +LVGDHCQLGPV
Sbjct: 600 ICCTCVGAGDGRLSRVKFTSILIDESMQSTEPECMVPVVLGAKQ---LILVGDHCQLGPV 659
Query: 693 IMCKKAARAGLAQSLFERLVLLGVKPIRLQVQYRMHPSLSEFPSNSFYEGTLQNGVTINE 752
+MCKKAARAGL+QSLFERLV+LG++P RL+VQYRMHP LS+FPSN FYEG+LQNGV +
Sbjct: 660 VMCKKAARAGLSQSLFERLVVLGIRPFRLEVQYRMHPELSQFPSNFFYEGSLQNGVCAED 719
Query: 753 RQSTGIDFPWPVPNRPMFFYVQMGQEEISASGTSYLNRTEAANVEKIVTTFLRSGVVPSQ 812
R+ +DFPWP P RPMFF V GQEEI+ SGTS+LNRTEAANVEKI T FL++G+ P Q
Sbjct: 720 RR-LKLDFPWPQPERPMFFLVTQGQEEIAGSGTSFLNRTEAANVEKITTRFLKAGIKPEQ 779
Query: 813 IGVITPYEGQRAYIVNYMSRNGALRQQLYKEIEVASVDSFQGREKDYIILSCVRSNEHQG 872
IG+ITPYEGQRAY+V YM G+L +LY+EIE+ASVD+FQGREKD II+SCVRSNE QG
Sbjct: 780 IGIITPYEGQRAYLVQYMQYQGSLHSRLYQEIEIASVDAFQGREKDIIIMSCVRSNERQG 839
Query: 873 IGFLNDPRRLNVALTRARYGIVILGNPKVLSKQPLWNSLLTHYKEHECLVEGPLNNLKQS 932
IGFLNDPRRLNVALTRA++GI+I+GNPKVL+KQ LWN LL YK+ + LVEG LNNLK+S
Sbjct: 840 IGFLNDPRRLNVALTRAKFGIIIVGNPKVLAKQQLWNHLLNFYKDRKVLVEGSLNNLKES 899
Query: 933 MIQFQKPKKIYNDRRLFFAGGPGVVPNDNLGPVAPSGPNADRRSSRGRGSYFPPHLPNGA 992
+I FQKPKK+ N + ++ + V G DR G+G NG
Sbjct: 900 LIHFQKPKKLVNSMNIGAHFMSTIIADAK--EVMVPGSIYDRSGGYGQGRQMVGQSMNGG 959
Query: 993 QKPGVHASGYPMPRVPLPSFHGGPPHPYGIPTRGAVHGPVGAVPHVPQPGSRGFGAGRGN 1052
Q G + G P +G P YG P+ ++ GFG G G
Sbjct: 960 QYGG--SGGGP---------YGNSPLGYGTPSSNSM---------------VGFGLGNGG 998
Query: 1053 VGAPIGSQLPNQQGSQQNIGNLGSTF 1073
GA G N G G ++
Sbjct: 1020 NGA---------AGGNNNFGGAGPSW 998
BLAST of CmaCh16G010870 vs. TAIR 10
Match:
AT5G47010.1 (RNA helicase, putative )
HSP 1 Score: 2031.1 bits (5261), Expect = 0.0e+00
Identity = 1041/1284 (81.07%), Postives = 1121/1284 (87.31%), Query Frame = 0
Query: 1 MDSQQNNLFETASQPDTGNDAYTFLEFNTQGE-DFDYPEFRDPIRSPVAWPTPSD--SLA 60
MDSQQ++LF+TASQPDT D YTFLEFNTQG+ +FDY +F SP AWPTPSD S+A
Sbjct: 1 MDSQQSDLFDTASQPDTVADEYTFLEFNTQGDSEFDYQDF----GSPTAWPTPSDSISIA 60
Query: 61 DHTDRGGG---SDHQSDA-SPVSAAPGSATKGRTG-GGSGNSGG--NNQMVDVLAAGMSG 120
D DRG G +DH S+A SP S + G+ + G GG G SGG ++ VD LAAG+
Sbjct: 61 DVADRGEGGAAADHHSEASSPSSLSAGAGNGAKVGRGGVGGSGGVSSSSQVDALAAGVGN 120
Query: 121 LTFEDTGDDDNYEFGKGNFTEHACRYCGISNPACVVRCNVPSCRKWFCNSRGNTSGSHIV 180
L FE+TGDDD +++GK +FTEHAC+YCGISNPACVVRCNV SCRKWFCNSRGNTSGSHIV
Sbjct: 121 LNFEETGDDDGFDYGKNDFTEHACKYCGISNPACVVRCNVASCRKWFCNSRGNTSGSHIV 180
Query: 181 NHLVRAKHKEVCLHKDSPLGETILECYNCGCRNVFLLGFISAKTESVVVLLCREPCLSVN 240
NHLVRAKHKEVCLH+DSPLGETILECYNCGCRNVFLLGFISAKT+SVVVLLCR+PCL+VN
Sbjct: 181 NHLVRAKHKEVCLHRDSPLGETILECYNCGCRNVFLLGFISAKTDSVVVLLCRDPCLNVN 240
Query: 241 ALKDMNWDLSQWCPLIDDRCFLQWLVKIPSEQEQLRARQISAQQINKIEELWKTNPDASL 300
ALKDMNWDLSQWCPLIDDRCFL WLVK+PSEQEQLRARQISAQQINKIEELWKTNPDA+L
Sbjct: 241 ALKDMNWDLSQWCPLIDDRCFLPWLVKVPSEQEQLRARQISAQQINKIEELWKTNPDATL 300
Query: 301 EDLEKPGVDDEPQPVALKYEDAYQYQNVFAPLIKLEADYDKMMKESQSKDNVTVRWDIGL 360
EDLEKPGVDDEPQPV KYEDAYQYQNVFAPLIKLEADYDKMMKESQSK+N+TVRWDIGL
Sbjct: 301 EDLEKPGVDDEPQPVQPKYEDAYQYQNVFAPLIKLEADYDKMMKESQSKENLTVRWDIGL 360
Query: 361 NKKRIAYFVFPKEDNELRLVPGDELRLRYSGDAAHPAWHSVGHVIKLTAQEEVALELRAS 420
NKKR+AYFVFPKE+NELRLVPGDELRLRYSGDA HP+W SVGHVIKLTAQEEVALELRA+
Sbjct: 361 NKKRVAYFVFPKEENELRLVPGDELRLRYSGDAVHPSWQSVGHVIKLTAQEEVALELRAN 420
Query: 421 QGVPVDVVHGFSVDFVWKSTSFDRMQGAMKTFAVDETSVSGYIYHHLLGHEVEVQMVRNT 480
QGVP+DV HGFSVDFVWKSTSFDRMQGAMK FAVDETSVSGYIYH LLGHEVE QMVRNT
Sbjct: 421 QGVPIDVNHGFSVDFVWKSTSFDRMQGAMKNFAVDETSVSGYIYHQLLGHEVEAQMVRNT 480
Query: 481 LPRRFGAPGLPELNASQVFAVKSVLQKPISLIQGPPGTGKTVTSAAIVYHMAKQGQGQVL 540
LPRRFG PGLPELNASQV AVKSVLQKPISLIQGPPGTGKTVTSAAIVYHMAKQGQGQVL
Sbjct: 481 LPRRFGVPGLPELNASQVNAVKSVLQKPISLIQGPPGTGKTVTSAAIVYHMAKQGQGQVL 540
Query: 541 VCAPSNVAVDQLAEKISATGLKVVRLCAKSREAVSSPVEHLTLHYQVRHLDTSERSELHK 600
VCAPSNVAVDQLAEKISATGLKVVRLCAKSREAVSSPVE+LTLHYQVRHLDTSE+SELHK
Sbjct: 541 VCAPSNVAVDQLAEKISATGLKVVRLCAKSREAVSSPVEYLTLHYQVRHLDTSEKSELHK 600
Query: 601 LQQLKDEQGELSSSDEKKYKALKRATEREISQSADVICCTCVGAGDPRLSNFRFRQVLID 660
LQQLKDEQGELSSSDEKKYK LKRATEREI+QSADVICCTCVGA D RLSNFRFRQVLID
Sbjct: 601 LQQLKDEQGELSSSDEKKYKNLKRATEREITQSADVICCTCVGAADLRLSNFRFRQVLID 660
Query: 661 ESTQATEPECLIPLVLGAKQACRAVLVGDHCQLGPVIMCKKAARAGLAQSLFERLVLLGV 720
ESTQATEPECLIPLVLG KQ VLVGDHCQLGPVIMCKKAARAGLAQSLFERLV LG+
Sbjct: 661 ESTQATEPECLIPLVLGVKQ---VVLVGDHCQLGPVIMCKKAARAGLAQSLFERLVTLGI 720
Query: 721 KPIRLQVQYRMHPSLSEFPSNSFYEGTLQNGVTINERQSTGIDFPWPVPNRPMFFYVQMG 780
KPIRLQVQYRMHP+LSEFPSNSFYEGTLQNGVTI ERQ+TGIDFPWPVPNRPMFFYVQ+G
Sbjct: 721 KPIRLQVQYRMHPALSEFPSNSFYEGTLQNGVTIIERQTTGIDFPWPVPNRPMFFYVQLG 780
Query: 781 QEEISASGTSYLNRTEAANVEKIVTTFLRSGVVPSQIGVITPYEGQRAYIVNYMSRNGAL 840
QEEISASGTSYLNRTEAANVEK+VT FL+SGVVPSQIGVITPYEGQRAYIVNYM+RNG+L
Sbjct: 781 QEEISASGTSYLNRTEAANVEKLVTAFLKSGVVPSQIGVITPYEGQRAYIVNYMARNGSL 840
Query: 841 RQQLYKEIEVASVDSFQGREKDYIILSCVRSNEHQGIGFLNDPRRLNVALTRARYGIVIL 900
RQQLYKEIEVASVDSFQGREKDYIILSCVRSNEHQGIGFLNDPRRLNVALTRARYGIVIL
Sbjct: 841 RQQLYKEIEVASVDSFQGREKDYIILSCVRSNEHQGIGFLNDPRRLNVALTRARYGIVIL 900
Query: 901 GNPKVLSKQPLWNSLLTHYKEHECLVEGPLNNLKQSMIQFQKPKKIYNDRRLFFAGGPGV 960
GNPKVLSKQPLWN LLTHYKEHECLVEGPLNNLKQSM+QFQKP+KIYNDRRLF+ GG G+
Sbjct: 901 GNPKVLSKQPLWNGLLTHYKEHECLVEGPLNNLKQSMVQFQKPRKIYNDRRLFYGGGAGM 960
Query: 961 VPNDNLGPVAPSGPNADRRSSRGR--GSYFPPHLPNGAQKPGVHASGYPMPRVPLPSFHG 1020
+ NDN G PNADRR SRGR GSY P PNGA +PG+H +GYP+PRVPL F G
Sbjct: 961 IGNDNFG---SGNPNADRRGSRGRAGGSYLPSGPPNGA-RPGLHPAGYPIPRVPLSPFPG 1020
Query: 1021 GPP-HPYGIPTRGAVHGPVGAVPHVPQPGSRGFGAGRGNVGAPIGSQLPNQQGSQQNIGN 1080
GPP PY IPTR GPVGAVPH PQPG+ GFGAGR G +G LP+QQ +Q N+G
Sbjct: 1021 GPPSQPYAIPTR----GPVGAVPHAPQPGNHGFGAGR---GTSVGGHLPHQQATQHNVGT 1080
Query: 1081 LGSTFNFPGLESPNSQPSVGGPLSQLGFVNNMPVQPPTQTFRDGYSMGGISQDFLGDDFK 1140
+G + NFP L+SPNSQPS GGPLSQ G+ +Q FRDG+SMGGISQDFL DD K
Sbjct: 1081 IGPSLNFP-LDSPNSQPSPGGPLSQPGY--------GSQAFRDGFSMGGISQDFLADDIK 1140
Query: 1141 SQGSHVPYNVTDFSTQASQTGYPIDYVSQGAQGGFPGSFLNQNSQSGYNRFGTGNDFMSQ 1200
SQGSH PYN+ DF+TQAS G+ +DY +QGA G FPG+F+NQNSQ GY+RF NDFMSQ
Sbjct: 1141 SQGSHDPYNMADFATQASPGGFAVDYATQGAHGAFPGNFMNQNSQGGYSRFSGINDFMSQ 1200
Query: 1201 DYMNHGSQGLFTQVGFSDPSLDEAFQSHYNVTNTNSLQSQGMMNSLYSQPFAHYNSQPST 1260
+YM HG QGLFTQ GF D S D+ Q+ Y V N N LQSQG+ NSLYSQPFAHYN+QP
Sbjct: 1201 EYMAHGGQGLFTQAGFIDSSQDDGQQNPYGVNNPN-LQSQGLPNSLYSQPFAHYNTQPLN 1254
Query: 1261 QQAPPQQQPQQGQSSQNQKIHFSG 1272
P Q QP QSSQN K ++G
Sbjct: 1261 LSGPQQSQP--NQSSQNPKHPYNG 1254
BLAST of CmaCh16G010870 vs. TAIR 10
Match:
AT2G03270.1 (DNA-binding protein, putative )
HSP 1 Score: 197.6 bits (501), Expect = 6.1e-50
Identity = 152/454 (33.48%), Postives = 231/454 (50.88%), Query Frame = 0
Query: 483 LNASQVFAV-KSVLQKPISLIQGPPGTGKTVTSAAIVYHMAKQGQGQVLVCAPSNVAVDQ 542
L+ SQ A+ K++ K + L+ GPPGTGKT T IV K+G ++L CA SN+AVD
Sbjct: 190 LDQSQKDAITKALSSKDVFLLHGPPGTGKTTTVVEIVLQEVKRG-SKILACAASNIAVDN 249
Query: 543 LAEKISATGLKVVRLCAKSR---EAVSSPVEHLTLHYQVRHLDTSERSELH----KLQQL 602
+ E++ +K+VR+ +R + + S ++ L L R E+ KL +
Sbjct: 250 IVERLVPHKVKLVRVGHPARLLPQVLDSALDAQVLKGDNSGLANDIRKEMKALNGKLLKA 309
Query: 603 KDE------QGELSSSDEKKYKALKRATEREISQSADVICCTCVGAGDPRLSNFRFRQVL 662
KD+ Q EL + +++ K + A ++ ++ADVI T GA +L N F V+
Sbjct: 310 KDKNTRRLIQKELRTLGKEERKRQQLAVS-DVIKNADVILTTLTGALTRKLDNRTFDLVI 369
Query: 663 IDESTQATEPECLIPLVLGAKQACRAVLVGDHCQLGPVIMCKKAARAGLAQSLFERLVLL 722
IDE QA E C I L+ G+ R +L GDH QL P I +A R GL ++LFERL L
Sbjct: 370 IDEGAQALEVACWIALLKGS----RCILAGDHLQLPPTIQSAEAERKGLGRTLFERLADL 429
Query: 723 GVKPIR--LQVQYRMHPSLSEFPSNSFYEGTLQNGVTINERQSTGIDFPWPVPNRPMFFY 782
I+ L VQYRMH + + S Y+ N +T + ++ + F +
Sbjct: 430 YGDEIKSMLTVQYRMHELIMNWSSKELYD----NKITAHSSVASHMLFDLENVTKSSSTE 489
Query: 783 VQM--------GQEEISASGTSYLNRTEAANVEKIVTTFLRSGVVPSQIGVITPYEGQRA 842
+ EE S N EA + SGV PS IG+ITPY Q
Sbjct: 490 ATLLLVDTAGCDMEEKKDEEESTYNEGEAEVAMAHAKRLMESGVQPSDIGIITPYAAQVM 549
Query: 843 YIVNYMSRNGALRQQLYKEIEVASVDSFQGREKDYIILSCVRSNEHQGIGFLNDPRRLNV 902
+ R +++ K++E+++VD FQGREK+ II+S VRSN + +GFL D RR+NV
Sbjct: 550 LL-----RILRGKEEKLKDMEISTVDGFQGREKEAIIISMVRSNSKKEVGFLKDQRRMNV 609
Query: 903 ALTRARYGIVILGNPKVLSKQPLWNSLLTHYKEH 913
A+TR+R I+ + + +S ++ +++EH
Sbjct: 610 AVTRSRRQCCIVCDTETVSSDAFLKRMIEYFEEH 628
BLAST of CmaCh16G010870 vs. TAIR 10
Match:
AT5G35970.1 (P-loop containing nucleoside triphosphate hydrolases superfamily protein )
HSP 1 Score: 196.4 bits (498), Expect = 1.4e-49
Identity = 150/448 (33.48%), Postives = 228/448 (50.89%), Query Frame = 0
Query: 496 QKPISLIQGPPGTGKTVTSAAIVYHMAKQGQGQVLVCAPSNVAVDQLAEKISATGLKVVR 555
++P+ ++QGPPGTGKT ++ +QG+ +VLV AP+N AVD + EK+ GL +VR
Sbjct: 502 KRPVMIVQGPPGTGKTGMLKEVITLAVQQGE-RVLVTAPTNAAVDNMVEKLLHLGLNIVR 561
Query: 556 LCAKSREAVSSPVEHLTLHYQVRHLDTSERSELHK---------LQQLKDEQ-----GEL 615
+ +R +SS V +L V S R+EL + Q L+D+ +L
Sbjct: 562 VGNPAR--ISSAVASKSLGEIVNSKLASFRAELERKKSDLRKDLRQCLRDDVLAAGIRQL 621
Query: 616 SSSDEKKYKALKRATEREISQSADVICCTCVGAGDPRLSNFR-FRQVLIDESTQATEPEC 675
K K ++ T +EI +A V+ T +GA DP + F V+IDE+ Q+ EP C
Sbjct: 622 LKQLGKTLKKKEKETVKEILSNAQVVFATNIGAADPLIRRLETFDLVVIDEAGQSIEPSC 681
Query: 676 LIPLVLGAKQACRAVLVGDHCQLGPVIMCKKAARAGLAQSLFERLVLL--GVKPIRLQVQ 735
IP++ G R +L GD CQL PV++ +KA GL SL ER L GV +L Q
Sbjct: 682 WIPILQGK----RCILSGDPCQLAPVVLSRKALEGGLGVSLLERAASLHDGVLATKLTTQ 741
Query: 736 YRMHPSLSEFPSNSFYEGTLQNGVTINER---QSTGIDFPW-------PVPNRPMFFYVQ 795
YRM+ ++ + S Y G L++ ++ S + W + R + +
Sbjct: 742 YRMNDVIAGWASKEMYGGWLKSAPSVASHLLIDSPFVKATWITQCPLVLLDTRMPYGSLS 801
Query: 796 MG-QEEISASGT-SYLNRTEAANVEKIVTTFLRSGVVPSQIGVITPYEGQ----RAYIVN 855
+G +E + +GT S N EA V V + + +GV P I V +PY Q R + +
Sbjct: 802 VGCEERLDPAGTGSLYNEGEADIVVNHVISLIYAGVSPMAIAVQSPYVAQVQLLRERLDD 861
Query: 856 YMSRNGALRQQLYKEIEVASVDSFQGREKDYIILSCVRSNEHQGIGFLNDPRRLNVALTR 911
+ +G +EVA++DSFQGRE D +I+S VRSN +GFL D RR+NVA+TR
Sbjct: 862 FPVADG---------VEVATIDSFQGREADAVIISMVRSNNLGAVGFLGDSRRMNVAITR 921
BLAST of CmaCh16G010870 vs. TAIR 10
Match:
AT1G05460.1 (P-loop containing nucleoside triphosphate hydrolases superfamily protein )
HSP 1 Score: 181.4 bits (459), Expect = 4.5e-45
Identity = 152/452 (33.63%), Postives = 215/452 (47.57%), Query Frame = 0
Query: 478 PGLPELNASQVFAVKSVL---QKPISLIQGPPGTGKTVTSA-AIVYHMAKQGQGQVLVCA 537
P P LNA Q+ +++ VL P +I GPPGTGKT+T AIV Q +VLVCA
Sbjct: 392 PISPALNAEQICSIEMVLGCKGAPPYVIHGPPGTGKTMTLVEAIVQLYTTQRNARVLVCA 451
Query: 538 PSNVAVDQLAEKISATGLKVVRLCAKSREAVSSPVEHLTLHYQVRHLDTSERSELHKLQQ 597
PSN A D + EK+ LC + + ++ L+ + RS ++
Sbjct: 452 PSNSAADHILEKL---------LCLEGVRIKDN---------EIFRLNAATRS----YEE 511
Query: 598 LKDEQGELSSSDEKKYKA--LKRATEREISQSADVICCTCVGAGDPRLSNFRFRQVLIDE 657
+K E DE +K LK T ++ S + G ++ F +L+DE
Sbjct: 512 IKPEIIRFCFFDELIFKCPPLKALTRYKLVVSTYMSASLLNAEG---VNRGHFTHILLDE 571
Query: 658 STQATEPECLIPLVLGAKQACRAVLVGDHCQLGPVIMCKKAARAGLAQSLFERLVLLGV- 717
+ QA+EPE +I + VL GD QLGPVI + A GL +S ERL
Sbjct: 572 AGQASEPENMIAVSNLCLTETVVVLAGDPRQLGPVIYSRDAESLGLGKSYLERLFECDYY 631
Query: 718 ------KPIRLQVQYRMHPSLSEFPSNSFYEGTLQNGVTINERQSTGIDFPWPVPNR--P 777
+L YR HP + + PS FY+G L + ++F +PN+ P
Sbjct: 632 CEGDENYVTKLVKNYRCHPEILDLPSKLFYDGELVASKEDTDSVLASLNF---LPNKEFP 691
Query: 778 MFFYVQMGQEEISASGTSYLNRTEAANVEKIVTTFLRSGVVPSQ-IGVITPYEGQRAYIV 837
M FY G +E + S+ NR E + V + + + V + IGVITPY Q I
Sbjct: 692 MVFYGIQGCDEREGNNPSWFNRIEISKVIETIKRLTANDCVQEEDIGVITPYRQQVMKIK 751
Query: 838 NYMSRNGALRQQLYKEIEVASVDSFQGREKDYIILSCVRS----NEHQG---IGFLNDPR 897
+ R E++V SV+ FQG+EK II+S VRS NE +GFL++PR
Sbjct: 752 EVLDRLD------MTEVKVGSVEQFQGQEKQVIIISTVRSTIKHNEFDRAYCLGFLSNPR 809
Query: 898 RLNVALTRARYGIVILGNPKVLSKQPLWNSLL 907
R NVA+TRA +VI+GNP ++ K WN LL
Sbjct: 812 RFNVAITRAISLLVIIGNPHIICKDMNWNKLL 809
BLAST of CmaCh16G010870 vs. TAIR 10
Match:
AT2G19120.1 (P-loop containing nucleoside triphosphate hydrolases superfamily protein )
HSP 1 Score: 179.1 bits (453), Expect = 2.2e-44
Identity = 120/336 (35.71%), Postives = 180/336 (53.57%), Query Frame = 0
Query: 588 LHKLQQLKDEQGELSSSDEKKYKALKRATEREISQSADVICCTCVGAGDPRLSNFR--FR 647
L ++ +L +G+ + + + + + E + A+++ T +G S F
Sbjct: 717 LVEISRLLIVEGKFRAGNNFNLEEARASLEASFANEAEIVFTTVSSSGRKLFSRLTHGFD 776
Query: 648 QVLIDESTQATEPECLIPLVLGAKQACRAVLVGDHCQLGPVIMCKKAARAGLAQSLFERL 707
V+IDE+ QA+E L PL LG A R VLVGD QL ++ K A ++SLFER
Sbjct: 777 MVVIDEAAQASEVGVLPPLALG---AARCVLVGDPQQLPATVISKAAGTLLYSRSLFERF 836
Query: 708 VLLGVKPIRLQVQYRMHPSLSEFPSNSFYEGTLQNGVTINERQSTGIDFPWPVPNRPMFF 767
L G + L VQYRMHP + +FPS FY+G L++ +I+ I + PV +FF
Sbjct: 837 QLAGCPTLLLTVQYRMHPQIRDFPSRYFYQGRLKDSESISSAPDE-IYYKDPVLRPYLFF 896
Query: 768 YVQMGQEEISASGTSYLNRTEA---ANVEKIVTTFLRS-GVVPSQIGVITPYEGQRAYIV 827
+ G+E SY N EA V + L+S G +GVITPY+ Q +
Sbjct: 897 NISHGRESHRGGSVSYENVDEARFCVGVYMHLQKTLKSLGAGKVSVGVITPYKLQLKCLK 956
Query: 828 NYMSRNGALRQQLYKEIEVASVDSFQGREKDYIILSCVRSNEHQGIGFLNDPRRLNVALT 887
+ AL Q KEI + +VD+FQG+E+D II+SCVR++ H G+GF++D RR+NVALT
Sbjct: 957 HEF--GNALGQDELKEIYINTVDAFQGQERDVIIMSCVRASGH-GVGFVSDIRRMNVALT 1016
Query: 888 RARYGIVILGNPKVLSKQPLWNSLLTHYKEHECLVE 918
RAR + ++GN L K W +L++ + C +E
Sbjct: 1017 RARRALWVMGNASALMKSEDWAALISDARGRNCFME 1045
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Q9FJR0 | 0.0e+00 | 81.07 | Regulator of nonsense transcripts 1 homolog OS=Arabidopsis thaliana OX=3702 GN=U... | [more] |
Q92900 | 0.0e+00 | 59.88 | Regulator of nonsense transcripts 1 OS=Homo sapiens OX=9606 GN=UPF1 PE=1 SV=2 | [more] |
Q9EPU0 | 0.0e+00 | 59.75 | Regulator of nonsense transcripts 1 OS=Mus musculus OX=10090 GN=Upf1 PE=1 SV=2 | [more] |
Q98TR3 | 0.0e+00 | 61.64 | Putative regulator of nonsense transcripts 1 OS=Takifugu rubripes OX=31033 GN=re... | [more] |
Q9VYS3 | 0.0e+00 | 58.22 | Regulator of nonsense transcripts 1 homolog OS=Drosophila melanogaster OX=7227 G... | [more] |
Match Name | E-value | Identity | Description | |
AT5G47010.1 | 0.0e+00 | 81.07 | RNA helicase, putative | [more] |
AT2G03270.1 | 6.1e-50 | 33.48 | DNA-binding protein, putative | [more] |
AT5G35970.1 | 1.4e-49 | 33.48 | P-loop containing nucleoside triphosphate hydrolases superfamily protein | [more] |
AT1G05460.1 | 4.5e-45 | 33.63 | P-loop containing nucleoside triphosphate hydrolases superfamily protein | [more] |
AT2G19120.1 | 2.2e-44 | 35.71 | P-loop containing nucleoside triphosphate hydrolases superfamily protein | [more] |