Tan0011780 (gene) Snake gourd v1

Overview
NameTan0011780
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionZf-CCHC domain-containing protein/UBN2 domain-containing protein
LocationLG11: 44172064 .. 44205527 (+)
RNA-Seq ExpressionTan0011780
SyntenyTan0011780
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GTAACTTTCTCTTCTCCCTTCCAGACTCCCTCTTTTATTACGCAGATTCCCTCGTTTTTCGACTCTCTCTCTCATTCTTCTCCTTTGTTTCTCTGATTTTCTTCCTCAAATTCCATCGTTCCTCTAATTTCTTCCTCAAACTCCCTCGTTTCTCTATTTTCTTCGATTTGGTTCCTTAGAGACCCTCGTTTTCGATGGAGATCGAGTTTGATCTAGGGTCAGAAGAGCGATTTTCGGGTTCGATTTTGGGGTCGGAGGAGTGATTGGTTCAATTTCTCGTTTTCTACCCAGATTTGTAAGTCTGCCTCTCCATCCTTCATGAAATCATTCGATTCATTATTTTTTTACATACCACAATCATTGTTCTTGCAAGATCCATAACCTCTTCTCCTTCTCTGTTGAATCGACCATTTTTTTCAATTTCTTCCCACAGTCGTAAATCCTAACAATGGATGTCGCATGTCGATTTCCTTGCTCTCTCCATCTCACTCCCCCTGCCGTCTATTCCCTAAGACTCGTCCCTTGTCAAAGTCCTTGCTTTTCCTCTATATGACAATTCCCTCATCCACTACTAACCTATCAATAGATGGGTTAGTTCGCCTGAACTCATGTTCTATCTGAGGGATGAGTGAGGAATTTCATGGTAGGTGAGTTGATTTGGTATGTAGAGCTTGATGGAGGGTTGAACGACAGGTAAGTTTTCTACCTTTCCAGCCGATGATGAGAATTTATTTTTTTACATAACATTTTTCTTCTATGATTATGCATACTGATTTGTGGTTAAAATTTTTTCCTTCTATGATTATGCATATTGATATGTTGTTAAGATTTTTGTTCTTCTATGAGTATGCATACTGATTTGCAGTTGATTTTTTTTTTAGGTTTAGCAAGTTGTCCATGATTTTTGCGTGCTAATTTGTTATTGGACTTTTAGCATTTTCCTTCATCATATTGTTTTTATTTATCATACTGATTTGTTCAATTTTAGTGCATACTGATATGTTCTAGTTAATATTTGTCCATGTCTATGTGTTGTATTACATTGATGAAATTTATTTTACTATGAGTCAAAGTTTTCCTGAGGCGGATTTTTTAAGTTTTACATGCTTTTTTCTACAATCTTCTCAGACATTTATATTTTATATTGAATGAATTTCATCTACCTCATTTATTATAGATTTTTGGATTTTTAAATACCACCTACCTTCTCTTGGATACTTGCAATAATTTATTACCATGAATGAACTTCATTATTTCATTAGAAGTAGGTGGGAATACCCCACTTCCTCCTTCTATATATTGTACATGACCTACATGTTTTCCATTTGAATATTGTACATGACCTACATGTTTTCCATTTGAGAATATTCCACTGTCTTCCATATTTTTCTAATCCATTAAGTACCTCCTAAACGTGATTTCCTTTCAAAGTCATTTTATCCGCTTCAAGTACAATGTTGGACGATAACAACATGTTGCCTTATTCACGACCACATCACTCGAGAGATAGGGTCGACTACAATGTTGGATGAGGTCGAGGAAGCTGACTTTGCATCTAGTGTTGTTCGTGGGAACAACATACAATTTATTGAAACCTCGAACGAATGGATCGTATTTAGAGATAGCTTGGCTAAGCGAATGTTCACGCATGGGAATAGTCGTGATGAATTTTCTACTTCTGCATGTATTTTTTGTTTTCAATCGAACACCAACTCGGGTATAATTTATGTATGTGTGCTTGTGATGTAGCGGATGAAAGGAATCGAATCCCAAAGCGGAAGCGATTGTGAATCGCGCGTTCCTAATTTCGTTACATCGATTACATTCAAGAACTAAAACAACTAGATTAAGAATAAAAACAGAATATGGACAGCAAGAAATACACCTTTGAAGAACTTTTCTTCAAGTGCTCCTTTCTCCCACGATCACGATCTCAGCAACTGGAACACGAACAGATGAAGACACGACCACTGGATCTTCTCCGTTGTTCTCTAGATGAGGGAAGAATGTGGGTTCCCTTTAGGTGAGATGAGATGTTGTGAGAGAGAGGTTTGTGTTTTGGAACTTTTCCAACAACATAACAAATGCCTCAAATTACTTGACCTAGTTGCATCTCATGCAACCATATATATAGGGAAATAGGATGGAAATAACCATCCTACCCTTCCTTTATTCTCTCATTAGTATATGGTTTATATGTGAATCTAATTCACATTAAAACTATATATTATTATCAATTATATCATATATAATTGATATAACACCTCAGGTATGAAATTGAACACTTCAATTTCATTCCTAACTTTATACCCTCAATTTTACCATGTTGAGCCAACGGAGGGACCTAATGGACCTACAGTTCATGAGCTCCAATGATGTGAAATTAACTGACCAAACTCTTTGATCCAGTTAATCAATATTCATTAACTACAAGTCACTACACTATAAACTTGTAGCTGCACTCTCCTCACTGTAGATTATATTGTGTCCACTGATATAGTCATTACCCATGAGTCGACCCTTCACAGGTTGTTCGTGATCACAGCTGGGTCAAATTACCATTTTCTCCCTGTGATTACCTCTTTGTTCCTTAAGCTCCACTGATCCTCTAATGGACAACAAGTCTAAGGTCCTACCATAGACTCATACCCTCATAGAGGACGAGGCCCGTTGTTCAAGACCTGGAGTCAGTTCTTAAGGGAACAACCTATCTAATTCCTTAGTGTAGGAATGAGTGAATTCCATCTTGTACAGTTAAGTTCCCAGCTACCTACGAGGTTTTGTCCCTGAGAAGGTAGGCTTGTTGAGTCCATGCACCACTCTCACCCATACTAGTCTAAGGACAAACCCCTATAAGCAGGAGTTCTTAACTCGCTCGAGATTCAGTCGAGTCACTAATGACCATCTATAAATTACGAGTCTTATCTCATTAGCGATTGTTACTGCTGGTTAGACTAATAATTTGTGGTCAGGTCTTGTATGAACTCTTCATGTGATGCCCCCACTCGCATGTCAACTACATGAATGAGTTGGATTACCTCATTTTGTACTAATTACAAAGTGGGTCGCATCCATGGTGTTGTTATCGGAATGAGGTTTCCAACCTTATTCTTATACCATAGACCATTTAGGTTATATTCTTAAACACAATCCAAATAGTGTCCACTACACTGATGTTCAAGATATATATATTATCAAGATATATATTATAACCTTGAAGCTTAGTTTATTGGATTTATAAAGCAAGACTGTAATTTTTTCAAAACAAATAACTTTTTATTTCTTTGATCAAAATTACACAAGTTGTAGGGCATACATCCCAACAATCTCCCACATGACCCAAAGCTTGTGGAGTACTATTACAGTGATGTAAATGATGGAGATCGGATCTCCTTGAGCAGCGGAAACGAATTTGGAATCGTAATATGCAAGATTGTTTTGTTTAAAAAACAAAAATCCATACATTGACAAAAACATACAGAACAAGCATGCTTTTAATCTAATTGAAACATGGAGATATATATGCAAACGTGTTTAGTTTAACAATTAAACTAAAAAGAATAAAAATGATATTATTACCTTTGTAGACTTTCCTCAGCAACTTGTTGCTCTAAAGGTTGTACAACAAATCAACTTCTCTGTTTTCCAGCAGGACACCACCAACAGAGTATACCTTCTATATCTCTGAGTGCTTAGATCTTCCTTGGTGGGAAGGATCCAACAATTGGAAGAATTGGGAATTAGAGAGTGAGAGAGAGAGTTTTTTCTTCACAGGAATTTTCCAATTGCAAAAACAGTAAGCACTCTTCAACATCACCCCTCTTCTCCCTGTTAAACTGTCGATAGGGGGAAGAAGGAGGGTAGTGGGAACGTGTCTGGATGACACGTTTCCTTATTTTATTATTAAAATAATAATTATTAAATCAAATTTAATTAATAATATAAATCATATTTAAATAATACAAATTAAATATATGATATTTAATTTGTATTAAATCTCATTTAATCATATTACCCTTTTAACCTATAGTTTTAATGTACATCCAGTGCACATTAAATTTTAATCAATAGTTTTCAAATGCGAATCACATTCACATTTAATATAATATTTGAACTCTTTCAAATATTCAATTCTCTCCTATAATTTAATATGAATCATATTCACATTAAATTTATAATATATAGTTCCAAAACTATATATTATATCGTATCCATATACATTAAATATATTCCCTAATGAATTTGAACATTTCAAATTCAAATGATCTAAGAACCCTTTACGAGCTAAAAGGTGGACCTAATGGACCTCTGGATCGTAAGCTCCAACGATACGAGATTGCTTTTGTTAATCTCATTAACCTCCCAATCAACATTCGTTAAGCTGCGGAACACTCCACTAAAGTCCCACAGCTGCACTCTTCTCACTGCAGATATATTTCTGTGTCCATGGATATTGACCAATAACAGCAAGTCAATCCTTCACGAATGTTCGTAACACCTGCTGGGTCAAATTACTTTTTTACCCCTGGGTTACATCTTGTACCTTAAGTACTAGTGCTCCTCTAATGAACAATTTGTTTGTGGTCCTACCAACAAACAGAGTCCCTCTCGGGCCAATGAGAGGGTCCGACCCTTTGTTCAAGTCCTGGAGACACCACTTGAGGGAACATTCCTCTACATACCCTAGTAGGTGGGAAGGAGTGAATTCCGTCTTGCTAAGTTAAGTTCCAGCCGCTCACTCGGTCTTGTCCCCAAGAAGGTAGGCATATTGAGTCGGTGAATCTGGCCACTCTCACCCATACTAGTCAAAGGACAATTCCTCTAATGAACAATTTGTTTGTGGTCCTACCAACAAACAGAGTCCCTCTCGGGCCAATGAGAGGATTGGGCCTTTTGTTCAAGTCCCGGAGACACCACTTAAGGGAACATTCCTCTACTTACCCTAGTAGGCGGGAAGGAGTGAATTCCATCTTGCTAAGTTAAGTTCCCAGCCGCTCACTCGGTCTTGTCCCCAAGAAAGTAGGCATATTGAGTCGATTTAATCAACCGCCACTCTCACCCATACTAGTCAAAGGACAATCCTCGCAATGACAGGAGTTAAGTAACTTCTTTCGAGATTGAGATCAAGTTGCCTAGGTCATCCTAGTGAAATAGAAACTTAACTAGTCAACGGAGTTACAACTAGAGATTACTATTTCATGGTCCGGTCTTATGTAATCTCATTACATAGGATATCCCCACTCACATGTCATCTACATGAACACGTTAGGATCACAGTGTTTGTATCAAATACAAAGTGGGCCACATCCATAGTTTTACCAGGGTAAGGTACCCAAACCTTATCCCCTAACTATAGACCCTTTAGGTTGTATCTCAAACTGAGATCCTTTATATGTACACTACATTCAGTTAAAGATTCATTTAACAACCTTGGATGTTTAGTTTATTGGATTTAGGGTTTGATTGAAGGCAAACTCGTAGATAACTAACAATAACACTTTATTGAAATAATAATACTTTATTACATTAATTAGAACATAATTACAAACTACGAGTTTTAGGGCACAAATCCCAACAGTAAATACAATAAACTAGGGCATTTAAACCCTTTGTACAAAATACTCCCACTTGCCCTAGTTGGGGAAGACATGTAAACATAGACCTTCTAGGTGACTCTCAAATACTTTAGCCGTGAGAGGCTTTGTAAAAGGATCAGCAACGTTGTGCTCTGAGGCTATCTTCGTGACAATAACGTCTCCTCTATGCACGATCTCCCTAATGAGATGATACTTCCGCTCAATGTGCTTTCCCCTCTTGTAACTTCGGGATTCCCTCGAATTTGCCACTGCACCACTGTTATCGCAATACAGTGTGATGGGCAAAGTCATATTTGGAACAACTTCCAAATTTAGCATGAATTTCTTTAATCAAACGACCTCTTTAGTTGTTTCACAAGCTGCTACATATTCGACCTCCATGGTGGAATCTGCGATGCATCCTTGCTTGATGTTTCGCCATACTACAACTCCTCCGTTAAGAGTACATACTGACCCTGATGTAGATTTTCGAGAATCTTTATTAGTCTGAAAGTCAGAATTCGTGTATCCTGTAAGGATCAAATGCTTAGCCCCATACACAAGCATGTAGTTCCTTGTTCTCCGAAAATACTTAAGGATTGTTTTAACCGTTGTCCAGTGTTCAAGTCCTGGATTGGATTGATACCTACTGACCATTCCAACTGCAAAATAGATGTCAAGCCTAGTACACAACATGGCGTACATCAGACTCCCAACAGCTGATGCATAAGGAATCCGTCTCATATCCTCAACTCCTAAAGGTGTCTTAGGACACTGTTCCTTAGACAAATGGACTCTATGTCTAAAAGTCAACAAGCCCTTTTTGGAGTCTTGCATCTTAAACCTTGAAAACACTTTGTCGTTGTAAGACGTCTGAGACAGGGCTAGTGTTTTATCCCAAGAACAAACTGTGGATCACCCAAATCTTTCATTTGGAATTGCGTAGCTAGCCAATCCTTAATGTCAGTAAGATAACCTACATCATTCCCAATGAGTAGGATATCATCCACATAGAGAATTAGGAATGCGACAGAACTATTGATGATTTTCTTGTAGACACAAGGTTCATCATCATTCTGGTCAAAACCATAAGATTTGATTGCCTCATTAAATCTTATATTCCAGGACCTAGAGGCTTGTTTCAGTCCATAAATTGACCTTTGAAGCCTGCAAACCTTTTGCTCTTGTCCTGATTCAATGAACCCTTTGGGTTGGTTTATGTAGATGTTTTCCTCAAGATTGCCATTCAAAAAGGTTGTCTTGACACCCATTTGCCATACCTCATAGTCATAATATGCGGCAATGGCAAGTAGGATACGGATAGACTTTACCATAGCAACAGGTGAAAAGGTTTTCTCATAGTCAACCCATTCAACCTGGGTATAACCCTTTGCTACTAGTTTCGCTTTGAAAGTTTACACCTTCCCATCTACACCACGTTTTCTCTTGTAGATCCACTTACAACCTATAGGTTTTACCCCATCAGGCTTATCTACAAGATCTCAAACAGAATTGAAGTACATAGACTCCATTTCTTGGTCCATGGCTATGATCCATTTGTCTTTGTCTTTGTCAACCATTGCTTGATTATATAAGTCAATGGATCCTCGCAGTTGTCATCTGGTATAACAACTTGGGTTTCAGATAAACCCATGTAACGGTCAGGTTGTGTTATAACTCTCCCATTACGTCGAGGCATACTAGACTCTTGAGATAGACTCTCTTGACTAGACAAGCTAGTATCAACAACTCTTGTTGACGTACTAGTGTTATTAGCAACTTTTGCTATTGTTCCGTCTAACTCACTCAACACAATTTTACTTCTTGGTTTGTGGTCCTTTATGTGGTCCTCCTCTAAGAAAGTGACATTTGTCGACACAAAAACCTTATCTTCCTTAGGATCATAGAATAAACCACCTCTTGTTTCTTTTGGGTAGCCTACGAATAGGCACAATTTTGAACGTGAATCAAACTTTTTGGATTTGTCACAAGCACATGTCTTTGGACAACCTAAGTCTCAAATGACATAAACTACCTTTACGTCATTTCCAGAGTTCAAAATTTGTTTCAGAAACACTTTTTGAGGGCACATTGTTTAAAATGTATACTGCAGTCTCCACTGCGTAACCCCAAAAGGAACTAAGGAGATGAGCATAACTCATCATTGATCGAACCATGTCCAACAAGGTTCTATTTTTCCCTTCCGATACAACCGCTTTCTTTGTGGGTATGCAGGGAGCCGAGAGTTGGGATGTAATTCCATGTTCTATTATATAGTTCTGGAATTTTAAATCCATGTACTCTCGACCTCGATTCGATCGAAGTGTTTTAATAGTTTTACCTAATTGATTTTCAACCTCAGTCATGAATTCCTTGAACTTTTCACGTGCTTAAGACTTGTGGTGCATTAGGTAAATATGCCCAAATCTAGAATAATCATCTATAAAAGAGATGAAATATTCATATCCTCCACTGGCTTTAACATTCATAGGACCACAAAGGTCTGAATGTATAAGTTCAAGAGGGTCTTTAGTTTTATAACCTTTTCCACTAAAAGGTCTTTTAGTCATTTTGCCTTCTAGGCAGGATTCACATACAGGTAAAGTATTTTCTTCTAACTCATTTAGAAGTCCATTCTTTACCAATCTCTCAATCCTATTGAGATTAATGTGACCCAATCTTAGATGCCAAAGATGAGCATTTTCTTTAGGAGAATTTTTACGTTTCTTAGTTTGGGTGTTTGCAGTTTTAAATAATTCAGTGTTCAAAACCATTTTGACTTAATTAGGCCTTAAAACATACAATCCATTTTCCAATGAAGCTGAAGGTTTAGAATGTATGACACCGAAGGGTAATAAAAAACACTCCCCCTCTAGATCAAGACACTCTTGACCAAAACCTAATGCAAGAATAACTCTTATTCCTATAGGCTTTAGTTACCATTTAGTAACCCTCTGTCTTTAGATCCCAGAGTCTCCATCCAAAAAAAATCTGTCGAAGGGAGGACTAGATTGAAAAACAGACTAAGGAAATCTTTATCCGTTTTGGGTGTCTACTGTGTTCGATCCCTTATGATCAAACCCTCCAAAGGGATTGTCTGGTCCAGGCCAAACTAGCAGGCTGACCATAGTGATCTTACGGTGTGAACCGACGGAAGAGACCATAGGTTTTGTTTTTACTAAACCTACTCCCACTCACTATAAAGTACCCTCTCCATTCACCTTGATATTGACTCATACAAACACCATTCGAAGGGGGACGCTCACGGTGTAAATTTCCCGGAGATAACGTAGGTGGAAAAATATGTATCATATACACTTTTCCTCCCACTGGATACTTTAAAGTTATAGGAATCTTACTTAGGCTAAAACTCGGTTAATGGTTTTACCTAACTCTATCAATATAACTCTTATATCGATAATCAATATAACTCTTATATTGATTTTATCCTAAGTTTGTTGATATAACTCTTTATATTAAACCTAGGATTTCTAGTTTATTAACCCCTTCATAAACCCGCTGACGCTATTGTAATTTATAAACTCTCTTATTAAATTTCACTCCCAAGTCGAGTTCAGGTGTAGGGTGTTCATTTCCGTCACTTAAATACCCACACTTTAGATGTAACCTTCCTTAGACAGAGTTCCTCTTTTAATCATAGCTTATAAATCACTTGGTTCCTAATTCTAATCTAATTAGAACTAGATAACCTAAGGTCTATTATTTCCAAGTTTTAAACATTTTAAAACTAATAATTAACCTAAGGCGACATGGTGCTTGATCATTCATCACATGAATAATAAGACATTCATAGCATAATCATCTCATGATATATAATGCATGTCAATGTTGTCAATAAATTAATATAACAACTTTATATTAATTAATCATGCAAACTATATAATATAACTCTTTATATTATAAATGATGCATGAAAATGCTTAACCTAAGGTGAGATTTCTAAAATAAAAGCATACATTATGCATGACAATAATTACACAACAATTGCATAATTTAAATAGCAAAATTAGACCAATTTTGGCACCCTAAATTCAATTTTAATTAATATAATCTAATTATATTAATTAAAATTAAATAAATTAAATATAAATAAATATAAAACGTTTTTATATTTATTATATATAATAAAAGCGAAATCGCGCTCGCTGTTCTTCGCGTGAAATTGTGGCGGCGCTGGTAACGACTGATACCAGCTTTGCAACGCTGTAGTTTGTCCTTTTCTGTCGGTTGAAACGCGCCTCAGTAGCAGCATTTGCAACGCTGTCGTCTCCTGCGCGCTCCTTCTGCAAACGTAGCGTTGCAACCCCAATGAACAGCGTTGCAACGCTGCACACAGACAACAACATGAATTCTCCAATTCTTGCTCGTTTTCGCTCCGATTTTCTCAGCAACCACTTTGCCTCGATCGACACGGACGTACAGACTCGATTTACAATTAAAATTGCCCAAACCACCGGGCCCTTACATTGAACGATTCAATGGAAAAATTGTAAATATAAGTGCCAAAATCGAAAACATCTAATATGCAAACCGTTCATACATCTCATGCAAAATAGTAAATCCCACCTCGTTCTTGAAAAATCCAGAACGTAAATCAAGAACGCAATTGGTCACTGATACCAATTTAAGGGAATCGAATCCCGCAACAGAAGCGATTGTGAATCGCGCGTTCTTAATTTTGTTACATCGATTACATTCAAGAACTAAAACAACTAGATTAAGCATAAAAATAGAATATGGACAACAAGAAATATACCTTTGAAGAACTTTTCTTCAAGTGCTCCTTTCTCCCACGATCACGATCTCAGCAATTGGAACACGCCACTGGATCTTCTTCGTTGTTCTCTAGGTGAGGGAGGAATGTGGGTTCCCTTTAGGTGAGATGAGATGTTGTGTGAGAGAGGTTTGTGTTTTGGAACTTTTCCAACAACACAACAAAGGCCTCAAATTACTTGACCTAGTTGCATCTCATGCAATGGAAATGACCATCCTACCCTTCCTTTATTCTCTCATTAGTATATGGCTTATATGTGAATCTAATTCACATTAAAACTATATATTATATCAATTATATCATATATAATTGATATAACACCTCAGGTATGAAATTGAACACTTCAATTTCATTCCTAACTTTATACCCTCAATTTTACAATGTTGAGCCAACAGAGGGACCTAATAGAACTACAGTTCATGAGCTCCAATGATGTGAAATTAACTGATCAAACTCTTTGATCCAGTTAATCAATATTCAATAACTACAAGTCACTCCACTATAGACTTTTAGTTGCACTCTCCTCACTGTAGATTATATTGTGTCCACTGATATCGTCATTACCCATGAGTCGATAATTCACAGGTTGTTCGTGATCACAGCTGGGTCAAATTACCATTTTACCCCTGTGATTACCTCTTTGTTCTTTAAGCACCACTGATCCTCTAATGAATAATAAGTCTAAAAGGTCCTACCATAGACTCATACCATCATAGAGGGCGGGGCCCGTTGTTCAAGACCTGGAGTCAGTTCTTAAGGAAACAACCTATCTAATTCCTTAGTGTAGGAATGAGTGAATTCCATCTTGCATGCTTAAGTTCCAACTACTCACGAGGTTCCGTCCCCGAGAAGGTAGACTTTTTGAGTCGGCTCACAGTCACCCTCACCCATACTAGTCCAAGGACAAACCCCTATGTTATGAGTTCTTAACTGCTTGCTCAGATTCAGTCGAGTCACGAATGACCATCTATAAATTACTAGTCTTATCCATTAATCTGTTACACCTGGTTAGACTAATAATTTATGATCCGATCTTGTATGAACTCTTCATACAGGATGCCCCCACTCGCATGTCAACTACATGAATGAGTTTGGATTACCTCATTTGTACTACATAGTGGGTCGCATCCATGGTGTTACCAGAATGAGGTTTCCAACCTTATTCTTATACCATTGACCATTTAGAGTATATTCTTGAATACAATCCAAATAGTGTCCACTACGCTGATGTTCAAGATATATATATTATCAAGATATATATTATAACCTTGAAGCTTAGTTTATTGGATTTATAAAGCAAGACTGTAATTTTATCAAAACAAATAACTCTTTATTTCTTTGATCAAAATTACACAAGTTTTAGGGCATACATCCCAACAGCGGATAACTTTCTCATTTCTTCCTATTTTTGTATCTTACTTATGTGTATAATTATTTTGTTTACTACAATATGTATTCCCCAAACATAAAATTCTTGAATGTAAGCATTGTAACATTACATGGGTATACACATGCATTCAACAACATGATAGGTACATTAAAGGCTCCAAATCACAGTTGGGCTAAGGCAGATGATTCAAAGTTGGCGGAGTCACTAGCTGAATTAGTTGGGTGCTTGGTGGTATGATAGCAAGACCTTTAGACTTGGCTATCTACAACATCTACATCGCATGCTAGCAGAGAAATTGCCAAATTCTTCTCTTAGCAAAATACAATTAAGTGCAAGATTAGGTCTCTGAAAAAACGGTACAGGGTGATTGTAGAGATGAAGAGTTGAAGTGTGTTGATGTGGAGAAAGATATTTTTTATACTTTTGTCTGTTAGCTTTAAGGTTTCAATCAAACTAAATATTTTGATGATAACAAACTCAAATTTTTGTACTAATTTTTTTAGTATTTCAAAGCGTCTATCTTGGAGAGTTCGAAGAGACGAGTAGAAGACAATTTGAATCGCTCAAATTGGAGTTCAAATGAAGGAGATATGAGCGAAAGAAGTTTGAAAAAGGGAAACTAATAAGGGAGTGACACGTGGCACTGTGCTGACGTGGTGATGACTGATGTGGCGATGATGTGGCAGATGATGTGGTACTCTAACGGTCGAATAACATGGCATGGCATTGGCGTGGCACTCAAACCGTCAAATTTGCTGACATGGCAGATGGCGTGTCAGTTGATGTGACACCTCATAATTATATTTTGATGACATGGCACATAACGACTGAATTCTGATGACGTGGCATACGTGGCCAAATAAAAAGCGTACATGTAGTGAGAAGCTGTCAAAAATTCAAAATTTTGAAATTCAAATGAATTAAACAAAAGAAAAAAAGGAAAGAATGTGGGATATTTGTACGGAAGACCCCTTTTTATTATCATTAATGTTGAATGACCTGGCGCTCGAAAATAATGAAAACTGACATATTCTGACGTGGCAACACGCGCAAAATTACGCCGAAGGGTAGCTTCTCGCCTCGCAGGAACGCGCACGCCTTCTCGAAAGGAACGCGCACGCCTTACTCGTAGGAACGCTCACGGTTACTCGTGAACGCGCTCAGAACGCTCCCGCTTACTCGTAGGAGCGATCATCCTTACTCAAACGGTGACTCGCCATGCATACATACTGAGAACTCATGCATGCTTACTCGTAGGAACCCTCACGCTTAAACTCGCTCTCGCTTTGGAAATAATGAAAACAGAAGATGCAAATATCTAAACAACCATAATATATTCATTCATTCACATGTCACAAAAAAAGTATATTAAGAAAAATAAAAAACACATTATAAGGAAGGGAAAGGAAGCTCGCATCTCCTAATCGACTCACTGGAGGAGTCGACACACCGAGGGGCCGACCATCCGCCCTCTTCTTGGGGCGCTCTTCAGAGGAATGTTAACGGAACATTCAGAAGGCGCGTCGGTTGCGCGGGGAGGGATGTCAGAATCAGCAACCGTGGCGTCGGTTGCCTCAATGGGGTCCTCGGATGACTCCTCAGATGACTCCTCTGAGGACTCATCGGATTCCTCCCCAGAGGAGTCATCCGAGGACTTGCTCGGTGGCTGTCCAGGATAGTTGAAAAAACTATCCTCTGATGATTGTGGTGTGGAGGCAGAAAATGACTTTTCTACCTCTTCATTGTCAGAAATCGTTTTGTCTGAACTGTACTTAATTGTGTTTGAATTCATTTTGAAGGGAAGAATATGAAAGAAAGGGTGAAGAAGATGAATAGAGGTTAATTGTCGTTTAGGATGAAATTTGTATAAGCCTCACCCTATATTAATACTAAAAAACTCTACGGTCGAACTTAAAGACGCCTGATCACTAATTAATGCGCCTATTTGTTTCTCCAATTAGAAATTACTTTACTGTTTTTGTAATCATGAAAATACTCGATCGGCGTGAGTGTATCCACGCCAGCGAGTAACCGAGTACGAGAGCGAGTTATCGAGAGCGAGAGCGAGCAATCAAGAGCAATTCAACGGGAGCAAGAACGCGGGAGAATATAACGGTAGCAAGAAAGCGAGAGCCTATCTACGAGAGCGATCCGACGAGAGCGAGTCTGCGAGAGTGAGCAATCAAGAGCACTTCGGGAGCAATAACGCGAGAGCATAAAAACGATTGCAAGAAAGCGAGAGCCTATCTACGAGAGCGATCCAACGAGAGAGATCCTGCGTGAGCGAGCCAACGTTTTTTTTTCATTTCATATAAAAAGGGTTTTTCATTTCTACATCTCACTAACAAGTTCTTACTCTCTCTCTAAGTTCTCGACACTATCACTCTCTCTAGTCTCTTGGCCTCAAGCGTTCGGCCGCACTCCGCTCCATCGCCGCTTGTCGTCGATACGATCCACAGGGGTTTTACGTCGACCTTCTACCGTTGTTAGCCGTCAATATTCCACCGTCAGCGATGTGTTTCATTGTATTGTTTTTCTTTTTATATTCAGAATCGCGATATTGATTGTTCTGATCGAGCGGTCGAAAAAGGGTGCATGTACTATATCCGCTCTAGTTTTATTTGGAAATCGACGAAAGGTTGCTAGGGTTTAAGTTATATCCGTCTAGTTTGGGGTTGTTACGTTCATCGAACGCAAGGACGGAGATGACGAGAGTACGTGAGCGATTGAGCGTGTGCGAGTGAACGGAGCGAGTCGGAGGGAGCGAGTTGATGCGCGCGCGATTGATCGATTTCATTTTGTATATATCGATCAGATATACACTAATCGCTTTTAATATATCAGTGTTCGTGCATTTTGTTAAAATGATGTTCGACTATTATGCACAGGTTTAGAATGACAACTGTTCTGAAGATAGCACAGAAAGACTGATTTCCAGGTCAAGCGTGGGTGTGTGGGTGGTGGATGTGGTGGACGTCGTGGGTGGAATGAGAGTATGTGATGGCTCGGGTGGAACGGAAGGGTCTGACGACCAGAGAGAGTGGTAGATGACTCGGGTGGGTCATTAGGAATGTGAGTGGGAGTGGGGGTGTCGGTGGTGGATGGATCGGGAATGAGAGTGGGGTGGGAGATGACACATGTGAAGTGGGAGATGGCTCGGGTGCATGATGGCTTGTTGTGCAATCATCGTCCTGCAATGTATGGAATCCATGTAAGACATACTATTTCCAAATAAGGATGACAGAAAGCAAGATACTACAGTATGTAATGTACCTTCATCATCCTAGACATGAACTCGCGTAGCCATAGCAGATGGGTTTTAACCTCCCCCAGTTTCGTTCTCATTTCTGTCCTCATGTCCAGTATGTCTGTTTTCAGACTGTTAACAGACATCTCGACGCCTTGAACGCGCATTTCTATACCCCGAATGTAGTCAACACTGGTTGATGCGGTGGGCATATCAATGGGTTGATCGTGTGAAGGTTGGACAATAGGGGTCGTGGATCGGATCGGTGATCGAATGGGGATCGGTCAGAGACACGATATGCGAACGAATCGGGAGCGGCCAGTGTTCCGAGTCTCTCACTAATGTTCTCCCGATCCTCGACCGCGGCATCCTCAACATCGACATCCATATGCTGGACCTCCTCAAACAAATCGTCATCTTCCACAGATCCCAACTCATCTTCTTCATCATCGGACGCCGTTAAGAGTCTATCGTGTACAGGTCTGTCACGATACTGCATTTTCGATTCTGATGGGATAAGGTTCGCGACGACGACCAACTAATGACAAAGAATGCACAAACACATAGAGTGGGTAAGTAAGATGAGTCATGCATACACATTAACGTGAATCATATCATATTAGAGGTTATAAACCTTTTTTGACGCGAGGACATCTCGTTCCAAGGCCTTACGCGTAGCAGTGTGTGTGCACGACCATCTACGAATCACGTGGCACCGCTGTCGTCGCCATCGACGCGAGTAGCTATCCGATCCGCCATCGGTAGTAGGGTCTCGTATGCCCACACCTATTAGATATTCATTAGCTTCGTAGCAATACTAGAATATGATAATAAGTACATTTTACCACATGTATTACATGAAAAGCAGGAGAGAATCCAGGAAGGTTATACTTAAAAGTATAACGTTTGTTAGTCTTCGACTTCATTTTATACGTTTCGAGCTTCCCCTGCAAGGCGGCCTTCAACCCACGTATGGTCCTTCCCCATATTCTAGAACCCCAATCAATACTGTTGAAGTATTCGAGGTCCTCTACGTCTCCAAACAAAGAACTGTCGACGACGGTTTTTGACTTGTTCTTCCCCATCATTACTACCTCACAGTAATATATAAGCGACACCTTCACTGCATCTTCGTCGGTGTCAAATTTTATCGTCTTGTAGGCAGACTCAAAAGCATCTACATGGATGCCCGTCCCCATGCCATCGAAATACTTCTCCCGAAGAGCAATCGGGCCGGTTCGGGCCGTTCACGTGGATCGTGGGGATTGCCACAATCTGTCATTAGAATGAACTCATCCTTACCAAACTTGGCTACTGTTCCATTGATAGAGAACAACATCGATTCATTGCCTTCGCCTCCATCCCCTCCACTCACCTCCTTCAGAAGAATGTGATGGACTAGTGGACTATTGAATACGAGGTCCACATCAACGAACGAACCGAATCTTTGTCGCTGAATAGTCCAAGTGTGTAGATGTCACCTGTTTCAATACTTTGTTAGCACTGGTGATGTGTGCTAAACTAGATGCTTGACCTGGAAACCACTTTTTCTGTGCTATCTTTGAAGCATTGCCATTCTAAACTCGTGCATAATAGTTGAACATCATTTTAACAAAATGCACGAACAACGATATATTAAAAATAAAGTGTGTAATCTGATCGATATATACAAAATGAAATCGATCAATCGCGCCCGTCAACTCTCTCCCGCTGACTCGCTCCCGTTCACTCGCACACGCCCAATCGCTCACGTACTCTCGTCATCTCCGTCCTTGCGTTCAGTAACGTAACAACCCCAAACTAGACAGATATAACTTAAACCCTAGCAACCTATCGTCGATTTCCAAATAAAACTAGACAGATATAGTACATGCACCCTTTCGACCGCTCGACTGTAACAATCAATATCGCAGTTACGAATATATAAAAAAAAAACAATACAGTGAAACCTGTCGACGGTGGAATATCGACGGCTAACAGCGGTAGAAGGTCGACGTAAAACCCCGGTGGACTGTCGACGACAAGCGGCGGTGGAGCGGAGCGTGACCAAACGCTGAGGCTGAAGAGACTAGAGAGAGTGATAGTGTCGAGAACTTAGAGAGAGAGTAAGAATTTGTTAGTGAGATGTAGAAATGAAAAACCCTTTTTATATGAAATGAAAAAAAAACGATGGCTCGCTCACGCAGGATCGCTCTCGTAGATAGGATCTCGCTTTCTTGCAATCGTTTATATGCTCTCGCGTTATTGCTCCCGAAGTGCTCTTGATTGCTCGCTCTCGCAGACTCGCTCTCGCCGGATCGCTCTCGTAGATAGGCTCTCACTTTCTTGCTACCGTTATATTCTCCCGCGTTCTTGCTCCCGTTGAATTGCTCTTGATTGCTCGCTCTCGCTTATTAGCTCTCGCTCTCGATAACTCGCTCTCGTACTCGGTTACTTGCTGGCGTGGATACACTCACGCCGATCGAGTATTTTCATGATTACAAAAACAGTGAAGTAATTTCTAATTGGAGAAACAAATAGGCGCATTAATTAGTGATCAGGCGTCTTTAAGTTCGACCGTTGAGTTTTTTAGTATTAATATAGGGTGTGGCTTATACAAATTTCATCCTAAACGACAATTAACTTCTATTCATCTTCTTCACCCTTTCTTTCATATTCTTCCCTTCAAAATGAATTCAAACACAACTGAGTACAGTTCAGACAAAACGATTTCTGACAATGAAGAGGTAGAAAAGCCGTTTTCTGCCTCCACACCACAATCATCAGAGGATAGTTTTTTCAACTATCCTGGACAGCCACCGAGCAAGTCCTCGGATGACTCCTCTGGGGACGAATCCGATGAGTCCTTAGAGGAGTCATCTGAGGAGTCCTCCGAGGACCCCATTGAGGCAACCGACGCCATGGTTGCTGATTCTGACATCCCTCCCCGCGCAACCGACGCGCCTTCTGAATGTTCCGTTAACATTCCTACAGAAGAGCGCCCCAAGAAGAGGGCCAGACGGTCGGCCCCCTCGGTCGTAATCAACTCATCCAGTGCGTCAGATTAGGAGATGCAGGCTTCCTTTCCCCTTCCTTATAATGTGTTTTTTATTTTTCTTAATGTACTTTTTTTGTGACATGTGAATGAATGAATATATTATGGTTGTTTAGATATTTGCATCTTCTGTTTTCATTATTTCCAAAGCGAGAGCGAGTTTAAGCGTGAGGGTTCCTACGAGTAAGCATGCATGAGCGTTACTGTATGTATGCATGAGCGAGTCACCGTTTCGAGTAAGGAAGAGCGTTCCTACGAGTAAGCGGGAGCGTTCCTACGAGTAACCGTGAGCGTTCCTACGAGTAAGGCGTGCGCGTTCCTTTCGAGAAGGCGTGCGCGTTCCTGCGAGGCGAGAAGCTATCCTTCGGCGTAATTTTGCGCGTGTTGCCACGTCAGAATAAGTCAGTTTTCATTATTTTCGAGCGTCAGGTCATTCAACATTAATGATAATAAAAAGGGGTCTTCCGTACAAATATCCCAAAGAATGTCACACATAGATATTTTGTAGAAGAATCTTTTCTCCTCTCTCCTCTCCAATTATGGACTTTAACCTTTCCAAAATTTTCAACCATTCAATCTCAACCATCCATCCTTTCTTACCTAAATCTAGTGGTCTATCTTCAATTTGTTCAAAATCCTTTTTAACCCTAAAAACATAATCTTTCTCATACTTTTTGAAAATCTTCCAGCAACCAAAAAGCCATCAAAAAGTCTCCAAAAAGGCTATAATTTTGAAGAATTCTCACCATCCTCTAAACTAGCAAATCCTGCAATTATTGTGAGAATTTTGGAATTGAAGCGTGATCCGAAACTTGTGTAATTTTTTTTTTTTGAAGCAATCTTTTTTTTGTTCAATTTTCTTTTTTTTTTTTGGAATATTGTATTAAGGTGTGGCACCTTGGGTTGTATGAAAATTCAATTGGTTTAATCCTGACCTAGAAAAAAGATTTGGGTTTCTCTTTAGCCTAGGAAAAAGTGGGGTGTGACGGTTTGGTTCTCACCCGCAAGAGGATCGAATTTAGTGAAATACCCAAGAGGAGCTTGGGGGGTGGGATAGGCAATTTGCCGAACCACCATAAAATTTTCGGTGTCCTATTTTCCCTTAACTTTTTAATTTTTGCACTTTACTTTATGCTCTAGCATTTTTTTTTAGATTGCATAATTTAAGATTTGCTTCTTTGAATTGTTTTGTCTACAATCTTTCTATCATTGAATTTTTTTTTTTTTTTAAGTTTGTTGTATTAAAATTGATTGAGTGCAAATTTCTTAAATTTCTCATTGTGTGAAAGAGTGCCTAAAAAGTGTTAAAATTAATGTGTTAATAAATAAAAAAAGTTGGTACACAACCCTATTCAACCCCTCCCCCCCTCCCCCTCTAGTGTTGCCATACATACGATCCTACAATTGGTATCAGAGCGGGTAGCTCGTTTTTTTTATCTTGAAGGGTATTTTAATTTTTTTTGTTTTTAAAAAAAACTATTTTTGAATTTTTTTAAATCTGATTTTTTTTTAAATTAAAGAAAAAGTTGAATTTTGAGTTTTTAAAAGGTTCTTTCACTAGAAAATTGGCTCTTAAAATTCAATTTTTTTATTGTTTTAAAACTCTAAATAAGTTTTGAAATTTTATTGAAAAATTGTCTTCACAAAAATTCATCTTAATTTAAAAAAATAAATATTTTTTTAAATTTGAATAATTTTTTTTGTTTTAATTTTTTTTAAAAAAACACTCTTAATTGTTCGAGCTTATGGCAAATACTGGAATAAATGGGGTTGTAGAGGGTCAATCTACCACTAGGTCACCTATCTTTGATGGAAATAATTATGCATAGTGGAAGACTAGGATGAGAATTTGCCTACTATCTATTGATTATAATTTGTGGTAAATTGTTTCCAAGGGTCCTTTTGTCCCAACTAAATCTATTGCAAATCAGGAAATGCCCAAAAATGAAAATGTGTTAAATGAGTATGATAAAAAGAAAGTGTATATTAATTTTAGAGCTATGAACTGTTTGTTTTGTGCTTTATGTCCTGATGAGTTCAATCGAATCTCTGCATGTAACTCTGCTAAAGAAATTTGGGATATGCTCGAAGTAACTTATGAAGGAACGAATCAAGTAAAAGAATAAAAAATTAGCATATTAGTCCATAATTACGAATTATTTAGAATGGATGTAAATGAGTCTATTTCTGAAATGTTTACTCGATTTACTAATATTGTAAATGCCTTGAAAAGATTAGGAAAAAACTTACTCTACCTCTAAGAATGTCAGGAAGATCTTCAGATCATTGCCCAAAAGTTGAGAAGCTAAAGTCACGACGATTCAAGAAGCCAAAGATCTCTCAACACTTCCTTCAGAGGAGCTCATTGGGTCTCTCATGAAACATGAGATAATCATGAAGGGCAACATGGAGAATGACGTAAAGAAGAAAAAGAACCTAGCATTAAAGTCTACTCAAGCTCAAGACGGTTCTGAAAGTGAGAATGAATTAAATGATGATGAATATTCGTATCTGGCAAAAAAGTTCAAGAGGTTTCTTAAAAAGAAAATTTTCTCCAAGAAAAGTAACAGTCAAGAGGGAAAGGGAGAAAAGAGCAATAGGGACGCAATTATATGTTTTGAATGTAAGAAGCCTGGACACGTGAAGTGTGATTGTCCACCAAGGAAGTCATCTTTGAAGAAGAGCAAGAAGGCCATGAAAGCTACTTGGGATGAAAGCGACGAGAGTGAATCTGAAAGTGAAGGAGAAGAAGCCAACCTCTGTTTTATGGCATTCGGAGACGATGACGATGATGATGAGGTAATTCCTAAAAATCTCTCATTTGGTGAACTTGTAGAAGCCTTTAATGAAATTAAAATGAATTTGAAAAATTAGGTTGCAAATACATGACTCTTAGAAAGAAAAATACTAAAATAGCTCACGAAAATAATAACCTATCAATTGAGCTAGAAAATATAAAAGAAGAAAATGAGATTTTAAGAAAAAACCTTGATATGTTTAAATCAAATTTGCATGTGTTAAAAGATGAATATAAAATACTAGAAAATATAAAAGAAGAAAATGAGAAATTAAGAAAAGATCTTGATTTTCCTGAATCAAAATTGCATACTTTAAAAGAAGAATATGAAATATTAGAAATTAAAAATGAAGATTTATCATCTACCTTAGTAAAATTTACTAATGGGAAGAAAAATCTAGACATTATGATGGGTAAACAATGGATTTTTTTATAAAAGCGGTATAGGTTACAAACCTCATCAAAAGCAAAAGAAATTTACAAATTATTTTGTGAAAGAAAGTATCATGTCATCTATTGTTTGTCATGCATGTGGAAAACATGGTTACAAAACCTATGCATGTAATTTAAGGAAATCCAACGTTATAAATGAAAATAAAAAGAGCTAGATTCCTAAGTTTTTATTGACTAACCCCCAAGGACACAAACAAATTTGGGTACCTAAGATTACATCTTGAGAATAAATCTCTTTCAGGTATGTTTGAGATCAATAAAAAACAAGAGGTACTTGGACATGTTGGAGTTTTTGCCCTAAAACTCGTAGTTGTATTTTTTGTTATAATTATTTATAAAGTGTTATTATTTCAATAAAGAGTTATTGTTCGTTATTTACGATATTGTCCCTGTTTTAACCCTTGGGAACGGACTTGACTAGACGAGCTAGGATCAACAACACTTGTTGACGTACTAGCACCATCAACAACTCTTGCTGATGAACTGTCCATTTCATTCAACACAATCTTACTCCTTGGTAAGTGATCCCTGATGTGGTCTTCCTCAAGGAAAGTGACGTTTGTCGACACAAGCACCCTATTTTCCTTAGGATCGTAAAAGAAACCACCTCTAATCTCTTTTGGGTAACCTACAAAGAGACACAACTTTGAACGGGGCTCCAACTTCTTCGGATTTGACACCAACACATGGGTTGGACATCCCCAAATTCTGAAATGACGTAAACTAGTTTTACGACCATGCCAGAGTTCGAAAGGTGTTTCACAAACACTTTTCGACAGAATTGTGTTCAAAATGTAAATCGCAGTTTCCACTGCGTAACCCCAAAAAGAATCAAGGAGACGAGCATAGCTCATCATCGATCGAACCATGTCCAATAGGGTTTTGTTTCTCCTCTCCGATACACCATTTTATTGTGGCATACCAGGTGCTGAGAGTTGGGACGTAATTCCATGTTCTATCATATAGTCCTGGAATTCAGTGTCCATATACTCTCCACCTCGATCCGATCAAAGTGTCTTTATAGTTTTACCTAACAGGTTCTCAACCTCACTCTTGAACTCCTTGAACTTTTCAAGTGTTTCAGACTTTCTATGCATTAGGTAAATGTTCCCATACCTCGAATAGTCATCTATGAAAGATACGAAGTATTCGTAACCACCTCGTCCTTTAACATTCATCGGACCACAGAGGTAAGAATGTATAAGCTCAAGGGGCTCCTTGGCTCTATATCCTTTTCCACTAAAAGAACGATTGGTCATTTTGCTTTCGAAGCATGACTCACAAATCGGTAAAGAGTTTTCTTCCAACTCGCTGAAGTCCACTCTTCACTCGTCTCTCAATCCTATTGAGATTTATGTGGCCTAACCTTAGATGCTAAAGATGGACGCTTTCTTTTGGAGAAACTTTTACTCTCTTAGTTCGTGTTTCAGCTGTTTTGAACAATTCAGTGTTCAAAACACTTTTGACCGAATTAGGGTTCAAAACATATAAATTGCTCTCAAGTGAAGCAGAACAAATCAAATTTCCATTTCGAGAAATAAACACTTTATTACTCTTGAATTAAAATGAAATTTTGTTCTAATAGGCAAGATATAGAAACTAGGTTTCTAGTAAACCCAGGAACTATGTACATATTATCCAATAATAGGTACTTTTCGCCAAATAACAGCTTCACCGTGCCAATTGCTGCAGCTTAGAAAACTTCTCCGGATCCAACCCTAAGTGTTATCTCACCCTCTCTCAGCCGCTGCCAAGAAATTAATCCTTGAAAAGAAGAGTAAACGTGGTTAGTGGCGCCTGAATCCAATATTCAGACTGAGTCACTACTCTCCACTAAACAAGTTTCTGTGACAAGTAAATCACATTTATCCGTCATCTTCTTTTCATCAAGGAACTTGGGACAACTCATTTTCAGGTGTTTGTACTCGTTGCAGTAGAAACACTTTTTAGAAACTTCCTTGATCTCTTTGCCCTTTTGGGCGACAGTGCGGTCAACCCTATCCTCTTTCAACTTCTCCTTCTTCCCTTTAGGGCCTGAAGAGTCTACAAACTTTGTCCCAGAGGTCATACCCCTGTGAAAAGGTCTTAAGGCAACACTTGCCTCATCTTCCTGTCTCCTGGTTTGCACCAAGGATTGGAAGGTTTGTAGATCGTTGAATAGGACTATTGGACCGAGCACCTGAGGACCCTCCTCAGTGAGCGCGAACTCTTGGTCGTCATCTACCAAAATCGTATCAATATTATTTTTCCACGAAACATTATCGCCGACTAACTTAACGGAAGCTAAAGGAGCGTTAAAAGTGTTTCATTTTGTTGCTGAAAAATGAAGAACTTTGTATTAGTCTCGCAAAAAGCTTTTGAAAAATCCAAACAAGTTTTAGTAATTTTTTTATCCTTTGATCCATAGGTTATTAAAAATGCAATGATACCCAAAAGGTTTAGAATGACCGCCACCGTAGGGTAGACAAACCCACTCCTCCTTTGGGACGAGACTTTCTCGACCAGACCCTAATACCAGAATAACTCTTATTCCATTAGGCATCCAGCCATACAATTTTTGATGCAATTCATTAACTCTTAACGAATCCTGTGAGTGTCACCCTCAATCTTTAAATTTCCAAGTATCCGTCCAAACAATAGCCGCCGTAGGGAAGACTAAGAAGGAAAACCAACAAGGAAATCGTATCCCTTACGAGAATTCACGGTGTTTCGACTCCCATGATCAAGGCAATTGTCTGGTGTAGGCCAAACTAGCAGGCGGATCATAGGCGTCTCACGGTGTGATTCGAAAGAAGAGACCGTGGGTACCGTTTTCACAGGTCCCGCTCCCACTCACTATAAGTATTCTCTCCATTAACGTTGAATATTGACCCATACAAATGCCACTCGTAGGGGGACGCTCCCATTTGCGCCACGAGGCCGAGTATGGATCTCACGGTGCGAACTTAATGGAGATTTGTGGATGGAAAATTAAATGTATCTCAAACACTTTGTTTCCTCCCACTCAGCACTTTAGATTAAAATGAGATTGACCTAGATAAACCGACTAAAATTTACCTAAGTCTACCAATATAACTCTTATATTGGATTAACCTAGGTCTATAATCACATTATTAACTTAGGTCTATGTGAGTAAATTAATTAACTCTTAATTAATGGCACTCACGAGAATATAACCTAAGAATCTCCGGACCGAATTCACTCCCAGGTCGGTTCCCAGGTAGGGGTGTTCCGCAAACCGTCAGCTTAAATACCTCAGCCTTAGACAGAACCTTCCTTAGACAGAGTTACTAATCAGTCTTCATTCAAAGGTCAAATTTGATTTCCTAATTCTAATCTTATTAGAACTAGAAAACCTAAGTCTAATTATTAAAGTTCTAAACAATTTAAAACTAATAATTAACTTAGGTTACATGCATTTATATGTTATAGTGTTTGATCATTCATCACATGAATAATACTATACTATAACATACATTCATCACATGAATTAAATGCATACTATATATAACACTTATATTAAATAAATTAAAACATGCAAACCTTATAATATAATCCTATCATGCATACTATATATTATAACTCTTATAATATAAATATGATGCATGAAATGTTTAACCTAAGGTGAGATTTTAAATCTATATGACATACATTATGCACATAGAATAATTATTTCATATATCTCATGCATAAAATAAATAAATAGCATTATTGGACCGATTTTTGGCACTCTAGGGACATTAATTTAATTAATTAATATTAATATAAAAAATATATTAATATTAACTAAAATTAAATTAATTAAAATTCATTTTACAAAAGAAATAAAATAAAAATAAAAAACGGTTGTCAGCAGGCTTCGAACCGCCGACCTTGAAGTCCCCACGAAAACCACCTTGCCACTACGCTGAAAATTCTCGCGAACTGCTGCTCAGTATGCGATGGCGTTGCAACGCTGTAACCTCCAGCGTTAAGCGTAGCGTTCTCAATTTTGCAGACCGCAGCAGACCTCTCCAAGAACATGGATGGCGCCGCAACGCTAAAAAGCGTTGCATGACGACGATGAATTTCAAAGAAACAACTTTTTTGTTTTTCACCAAAATCATCCGTTTTCGCTTCGTTTTCTTCACAGCAACTCTTGCCACGAACGACACTGACTTACAGACTCGATTTACATTAAAAAATACCCAAAACACTAGGCTTTACATTGATCTCATAAACTTTAATGCAAATCTAAACTAAGTGCCAAAAACCGAAAACATCCAAAATTGCAAACCATTAACATTCATCTCATAAAAATTAATGATACCCACCTAATTGTAAGAATTCACAAATACACAACGATTAGCTCTGATACCAATTGAAGGGATTTGGAATTCCCGCAGTAGAAGACGTTGTGAAACGTTGTTGATCCGTTGTGTAATTCTATCTTAAAATTGAATTAAACAACTTTAATTCTAGGTCAAACATGCAAAAGAAGAGAAAAAACAGTAAGGTAATCGTTACTTACTTAATGAAGAACACTTCTTAAATGCTTTCCTCCTCCGGTCACGATCTCCCTCGAACGGACGAACTCCTCAATCTCGTTCCTCAAACCAAACAGTGAGACACCACCACAGGGTCTTCTCGATTGTTCTCGGGAGGTTGAGGGAGAGTATGTGAGACTCCCTTTGGATTTTGGGTGAGGAGATTTTTAGAGAGAATTGAGAGAGAATGAAAGAAAGGTGAATGCTCTCTTTGGATGAAAATCCAAAAAATGGCAATTGCTTTTTTTAAAAGCTTCTTGCAAAGTCTTTATATACTATTGTGTGATGGAGTTAATATCTCCACCAAATCACAATATTACCACAACTATATGATATACAAATCATATTTGTATATCACATAATATAAGGTATCTCATACCTTATTATACAAATCTCATTTGTATAAATGTGAAATTTTTAAATTCAATCACATTCAATTTAATTTTTCTTAATTCAATTTTTCCAATTAGCACTAATTAATCAATTAGGTTATACAATAGTTTATTATGAATCTCATTCACATTAAACTATATATTATGTCATCTATATCCCATATAAATGACATAAATACTCAAATAATGATTTTGAACACTTCAAACTCACACCAAACTGTAAACCCTCAATTTGTCAAATTTGAGCCAACCGAGGGACCTAATGGACCTACACGTGATGAGCTCGAATGATCTGAGACTAACCTGTCAAACTTTTTGACCCGGTTATTTAACATTCATTAGCTACGGTAACACTCCACTAAAACCCGTAGCTGCACTCTCATCACTGTAGGACAAGTTGTGTCCATCGATATAACCAATACCCATGAGTCGACCATTCACAGGTTGTTCGTAGACTCTGCTGGGTCAAATTATTATTTTACCCCTGTGTCTACCTCTTGCTCCTTAAGTCTCACTGCTCCTTTAATGAACAACTACGTTGTATGGTCCAACCATAAACAACAATCCTCTTAGGCCAGTGAAAGAGTGGGGCCCCGTTGTTCAAGCCCTGGAGACAACACTTAAGAGAACGACCCCTCTACTTTCCCTAAGTCGGGAACGAGTGAATTTCGTTTCGCATTGTTAAGTTCCCAGCTCTCCACGTGGTACTGTCCCTGAGAAGATAGGCATATTGGGCGAGCAACAGTGGCAACCCTCACCCGTACTAGTCTAAGGACAGACTTTCGCAGACAGGAGTTCGTAACACGCTCAGGATTAAGGTCGAGTCACTAACGGTCATCTACGAAATTATTAGCCCTATGCTGTTAACGGTGTTACATCAATAAGTCTAATAATTCATGGTCCGGTCTTGTACAATCTCATTGCACAGGATGCCTCCACTCGCATGTCAACAACATGAACGAGTCAGATCACCTCATTTGTATCTTAATACAAAGCGGGTCGCATCTATAGCGTATCCAGGATTAGGTCTCCAACCCTATCCATATACTGTAGACCGTTCGGGTCATTAACTCGAACATGATCCTCCCTGTGTGTCCACTACACACTGTTCAAGTTCTAGTTCTCTCATTAATACAATGACCCTAGAGCTTAGTTAATTGGATTAAGTTTATGAAATATGCGAGACACAAAGTTCAAATAAATAACTCTTATCTATATCACAATGCTCAAATAAATAACTCTTATTTATTTACAACTTGGAATATTACAAACTACACGAGATGAGGACATACATCCCAACAATACCATGGGCTGAAGCCCTAAATTCTGCTTCAGCACTGCTTCTGGCTACAACATTCTGTTTTTTGCTTCTCCAAGTGACCAAATTACCTCCAACAAACGAGTAATAACCAGATGTAGACCTTCTATCCATTGCACTTCCTGCCCTATCTGCATCAGTATAGACTTCAACCTGTAGATGGTTGTGTTTTTTTAAACAATATACCTTTTCCTGGAGTTCCCTTTAAGTATCTCAAGATTCTATAGGCAGCTTCAAAGTGAACTGAACATGGTGAATGTATAAATTAGCTTACCATGCTTACTACAAAAGCAATATCTGGACGTGTATTGGACAAATAAATGAGTTTTCCCACGAGGCTTTGGTATCGTTCCTTATCCTTTACTTCATCATCTTTTGCAACTTCCAATTTCAAATTCGGTTCAATGGGAGTTTCAGCTACTTTGCATCCAAGTAAACCTGTTTCTCCAAGTAAATCAATGATATAATTTCTCTGATTCACAAAAATACCTTTTTTGGATCTCGCAAACTCCATTCCTAGGAAGTACTTGAGCATTCCTAAGTCCTTGATTTGAAATGCACTTGCAAGCTTTTTCTTTAGAGTGGTCAATTTTATTTCGTCATTACCTGTAAGAATGATTTCATCTGCAAAAACAATCAAAATCGCAATTTTGTTCTTCGTAGAATGCTTATAAAAAATGGTGTGATCAGCTTGACTTTGAAAAAATCCAAAGCTCATAACTACTTTCCCAAATCGTTCAAACCATGCTCTTGGAGATTATTTTAGCCCATATAGAGATTTCTTTAGTAGACATACTTTGTCCTTTCCGAGATCAATTTCAAAACCTGGTGGTAAGCTCATAAACACTTCTTCTTCAAATCACCGTTGAGAAAAGCATTTTTACATCAAGCTGGTGAAAAGGCCAATCTAAGTTTGCTGCAAGAGATAAAAGTACTCTGATGGAGTTAATTTTAGCAACAGGAGCAAAAGTTTCTTGATAATCAATGCCATAGGTTTGAGTGAATCCTTTTGCTACTAGTCTAGCTTTATATCTTTCAACACTACCATCAACATTACATTTTATTGTGAAAACCCATTTACACGCCACTGTTTTCTTATCTCTTGGCAAGTCTACTATTTCTCATGTGCAATTTGTCCTTAGAGCATTCATCTCCTCCATAACTGCTAGCTTCCAATTTGGATCATTTAAAGCTTCGGTTCCTTGGAATAAACAAGTTGGTTATCCTAGATGTAAACGCTCTATGACCATTTGACAATTTTTTATATGAGATATAATTTGAAATGGGGTATTTGGTGCAAGCTCGAGTACCTTTTTGCAAAGCTATAGGGACATCAAGATCAGAAACAACAGGTAGAACATGTTCAGTAGTTAAAGAGACGAGAATGGAAGGTTGGTTACCTGATGCTTCAGAGTCATTCCTCAGAGTGTCAGATTGGATCCATGACGAGTCAGCTGTCTAGTCCTTCTTTATTTGATTGAGTTTTCTTCTAGTATAAAATTTAAACTCAGGATTAAGATCAGTTGAACCTTTATGTAGTATTTCTCCCTCTGAAGTAGATTTTTCTATACTTGGTATTGAAGTACTAGTAGAATCAGGACAAGTAGAAAAAGGACCAACAACATTTAGAATAGACATAGATCCATTCCAAAAATGATCTTCTTCAAACTTTGCTTTCTCCCCCTAAAGATACTCTTGGGTAAAAAAAAGGTTGATTTCCCAAAAACGATACATCTATTCTCACAGAGTCTGAGGATCAAAACATTTATATCCCTTTGTGTTAGGAGCATAGCCCAGAAATATGCATTTAATAGCTCTATGATCTAATTTTGACCGAAAAGGATTAGGTATATGCACATGATTAGTACATCTAAAGACTTTTATGGGTAAATCATAGTGCAAATGAGTTGTTGGAAAGAATTCTTTCAAAATATCAAGCGGTGTTTTAAACTTTCAGGCTTTACTAGGCATTCGATTAATTAAGTAGATGGCAGTTAAGACTGCTTCCCCCCATAAATACTTCGGAACATGCATAGAAAACATTATGGCTCGAGCTACATCCAATAAATGTCTATTTTTTTGTTCAGCAATACCATTTTGTTGAGGGGTGTCTCAACAAGTAGATTGATGAACAATACCTTTATCTTTAAAAAATACACTAAAATGTTCATTGAAATATTCAGTTTCATTATCAGAGTGCAGAATGCTGATTTTGGTTTGAAATTGAGTTTCAATCATATTGTAAAAACCTTTAAAAATATCATGCACTTCAGATTTTTTTTTTTCATTAAATAGACCCAAGACAAATGTGTGTGATCATCAATAAAGGAAACAAACTATCATTTTCCACTATGAGTTATAACCTTAAAAGGCCCCCAAACATCACGATGAATCAAGTAGAATGGTTTGAAGGACTTGTAAGGTTTTGGTAAGAAAGTGGTTCGATGATTTTTAGCAAAAGTGCAACTTTCACAGTGAAAATGAGAACACTCAATTCCTTTAAATAATTTTGGGAATAATTTTTAAGATAGAAAAAATTAGGATGACCTAGCCTACGATGCCACAGTAGAAGTTCTTTAATTAACAGAGAGAGAATTGATACTACTAAAGCTCTGAGTTATTTTACTACCATAGTACTCATCAAAGTAGTAGAGGCCATCAAGCATTCTTGCATGTCCAAACATCTTCCTCGAGGTTTGATCCTGAAAAATACAATGAGATTCAAAGAAAACTACACGACAATTAGAATCTTTAGAAAGTTTACTAACAGATAAGAGATTGCAAGCCAATGTGGGAACATGAAGGACAAACTTTAAAGTAATATCTTTACCTAGTTTAATAGTTCCTTTTCCTACAACAGATGAAAAACTGTCGTCTGCAATTCTGATTTTTTTTTTTATTGCAATATAAAGGAGAATATGAGTCAAAGAAAGAAGAGCAACTAGTCATATGATCAGAGGCTCCGGAATCAATAATCCACGGAGAGAGGTTACTACAAGAGAACGCCTGAGGATAATTACCTGGTTGTTCCAAGGAAACACAAGAATTATTAGATGATACATTGATCGGTAGCAAGTTCAAGATTGAATGAATTTGCTCCATAATAAAAAAAAATTAACCAGACCAGTAATTGAACCAAACCAAACTGGATTTTGGAATCAAACCAAACCTGAGCTGGAATCGACCCTGAACCAAATCGGATCTTGTTTATAAACTTAATAGAGGTCTGAAATCAACGAGATTAAATAAACAATTGATGGAACTATCGATCACCAGATGAAGGCCAAACATGAGACGACAATTGCAACCATAAGGTGAATATCCTCTGGCGAATCAGAAGCCTTGTAAATCCGAGATCTCTCCTTGAACTTTTCTTAAAAAAAAAAAAGAAAAAAAAGGAAAGGTATGTTTGATATGTATGGTTTTGTTTTAAAAAAAAAAGAAAAGAAAAGAAAAGAAAAGGAAGGAAAAGGAAAAGAAACAAAGAAAGGATTATAAAAAAAATGTAAATGGTAAAAGACAAAGAACCCACCGCCAAAGCTTTTCTTAGTTTATTTAATCTACCTTCTTTCCATTTCCTTCTTATTTTATGAAAGAAAATCAGAGAGCCTCGTGAGATGAAGTTGAAGGAAGAAGAAGAAAGGGGTTTTTGGTTTAATGCTAAAGAAGAAAGGGAGAAAGTTCAATCCAACCTCCCATGGCAGCAAGGTGAACATTTCTGCCCAGTTCTGATATTTTGATCATTTCGAGCTCCACAAGAAGAGGTAAGAGTTGAGGTTTGATTTGCTTGAAAGTTAATTCATATTTGAACAAAATTCGTGAAGGAAGCTCGAGAAAATTCTTAAACTAAGTTACTGATTTGTTAAGAGGAAGGCAGAGGAAAGATTTTGCTGTCAAATCTGAGAACCCTCTGGTTTTTGGGTCGTGCCGATCCATAAAATTCGAATTAGAAGTTGTAAGAAGTATGGTTGGAAAGCTTATGAAATGTCCTATAACTTTCATGAAGAACTCGTGAGTATAAAACGACTAGAAGATGTTTTAAATTTCAGGTCACCTTTTGTGTGCAGCAGGGGAACGAATCTGGAAATAGCAGAAGTTTGTTTTTAGGGAAGAGACTTTCGAGGTACTTTCGAACAAATACGTAAAGAATTTCAATTTTATTCCTTCATATTAGTTTGAGGGTTCAAAAAAGAAGAGTTTGGTATAAAGTTCATCCTTTTTGGGTAAGTAATGAAGAAGTTATGAGAGTATGAAGTCAAGGCATGAAATGTGTAAAATGCTGGAAAAAAAAGATGAAGTGTTTTTTTTGTAATTTTAAGGTTAAAATTCATGAGCTCGAGGTTCTAATACTTGTTTTTTGTTTCTTTGACAGACCAAGAGGAAATAGGTCGAGTGTACAGACCAAAGTTCAAGTTCTAACGTTGTGAGTGACAAAACGAACCTCCAAAACGATCTTTATACTTGAATTATGCTTAAGGAAGTACAAAGTTTTCCTAGAATGAAATTACTTAGATGCTTTATGTATTTTTTTTTAACACATACTTATGAGCATACATGTTTTCTTAAAAATGATTTCAAGTGCTCGTTGTGAGCCTAGATTTCCCGATGAAATTTATATGTGAGCATATGTTGTTGGTTACCGAAATTTTGTGAAAATGTTATGATTTCAAGCAAGCATGAAGCAAGTATTATTTAAAGATTTCCAGTTTTTTATGAACAAGCTGAGTAAGCTTGAGTTTTACGAAAGAAAGATAAGCAGGCATGTTCAGGTATTTATATGTGTTTTTCTGAGACTCGAATGAGTAAATGACTGAGATATTGAGCCCGAGGTTGTATAGTATCGTGTGCACACAGGTCATTTTCTGTTGTCGACGTTGAGTGTACTCCGTGACAACAATGCTGTCGTGAGTGTTGGGTGGGCCCCACTACGACAAAGACGATGGGAGTGTTGGGCAGACCCCACTACATTGTAGAGTAAACGTTGGTTGTACTGGGTGTGTCCTACACAACGTAGATTGTCATACGTTTTAAAATTGTTAGTAGTATCGATATGCCTTGCGAGATTTTAATGACGTACTTGCTGGATTTCTGAAAAGCTATTATGATTACACGTGTATTTTTAACGCTTGATTACAGATGCATATGCTCATAGGTTTTCAGGTGATATAAATTATAAGTTATTAGATCTTAAACTCAATCACTCACTTAGCTTCATAGCTCATTCTTTCAGTGTTTTCAAACTTTTCAGGTAGAGATCGAGCTCTCGGTGCCTGATAACCTGCCATAGTCTACTGGAAGCTCCACGAGTTTGATGTTTTGTACGTGAATGGAGTTGTATAGCAAGTCTATATGTATTGTAAGAGGACCATGAAGTTATGTTTGTGTGTACATTCTAGGTTGTGGTTGTGTAAACCATGCTTGGGTATGTTTTTGGTTGTGATGCTCATACCTGTTCCAGTTTTCAAGAGAGTCTTAAGTAGGGTTACTCCTTACGTAGGTTTCGTAATTTTAATATTCCGTTGTGTTATGTTTAAAGTGTTTCTTATACAAGTATAGTATGCTTATCAGGTAAAATAGGGTCAACAGGTAACGTTAGAGATGTGAACGATGTCTGTTGGCTTCACGCCATCTGCCGGGCTAAGTTAACAGGTAGTTCGGGAAGGGGTGTGACAACCTGAAGTTGGTGATCGCGAAACTGCAACTCTTGTGACAACAGAGTTTTCAACGATACTCAAGAATAACTACTGCACAAAAGTAAATACCACAAATAAATGTTGCTCACATGCAAAAATATACCCATGTAAATATACCCACGCACAAGCACGAGCATGGAATTTTCTTCCACGAATAAGAAAGCTCACGAGCAAATATACCCACTACAAACGGCAGTGATCAAACAAGCTATAGGCAGAAATAAAAAACCAGAGGTTGCGCAC

mRNA sequence

GTAACTTTCTCTTCTCCCTTCCAGACTCCCTCTTTTATTACGCAGATTCCCTCGTTTTTCGACTCTCTCTCTCATTCTTCTCCTTTGTTTCTCTGATTTTCTTCCTCAAATTCCATCGTTCCTCTAATTTCTTCCTCAAACTCCCTCGTTTCTCTATTTTCTTCGATTTGGTTCCTTAGAGACCCTCGTTTTCGATGGAGATCGAGTTTGATCTAGGGTCAGAAGAGCGATTTTCGGGTTCGATTTTGGGGTCGGAGGAGTGATTGGTTCAATTTCTCGTTTTCTACCCAGATTTGTAAGTCTGCCTCTCCATCCTTCATGAAATCATTCGATTCATTATTTTTTTACATACCACAATCATTGTTCTTGCAAGATCCATAACCTCTTCTCCTTCTCTGTTGAATCGACCATTTTTTTCAATTTCTTCCCACAGTCGTAAATCCTAACAATGGATGTCGCATGTCGATTTCCTTGCTCTCTCCATCTCACTCCCCCTGCCGTCTATTCCCTAAGACTCGTCCCTTGTCAAAGTCCTTGCTTTTCCTCTATATGACAATTCCCTCATCCACTACTAACCTATCAATAGATGGGTTAGTTCGCCTGAACTCATGTTCTATCTGAGGGATGAGTGAGGAATTTCATGGTAGGTGAGTTGATTTGGTATGTAGAGCTTGATGGAGGGTTGAACGACAGGAAGATCTTCAGATCATTGCCCAAAAGTTGAGAAGCTAAAGTCACGACGATTCAAGAAGCCAAAGATCTCTCAACACTTCCTTCAGAGGAGCTCATTGGGTCTCTCATGAAACATGAGATAATCATGAAGGGCAACATGGAGAATGACGTAAAGAAGAAAAAGAACCTAGCATTAAAGTCTACTCAAGCTCAAGACGGTTCTGAAAGTGAGAATGAATTAAATGATGATGAATATTCGTATCTGGCAAAAAAGTTCAAGAGGTTTCTTAAAAAGAAAATTTTCTCCAAGAAAAGTAACAGTCAAGAGGGAAAGGGAGAAAAGAGCAATAGGGACGCAATTATATGTTTTGAATGTAAGAAGCCTGGACACGTGAAGTGTGATTGTCCACCAAGGAAGTCATCTTTGAAGAAGAGCAAGAAGGCCATGAAAGCTACTTGGGATGAAAGCGACGAGAGTGAATCTGAAAGTGAAGGAGAAGAAGCCAACCTCTGTTTTATGGCATTCGGAGACGATGACGATGATGATGAGAAAATCAGAGAGCCTCGTGAGATGAAGTTGAAGGAAGAAGAAGAAAGGGGTTTTTGGTTTAATGCTAAAGAAGAAAGGGAGAAAGTTCAATCCAACCTCCCATGGCAGCAAGGTGAACATTTCTGCCCAGTTCTGATATTTTGATCATTTCGAGCTCCACAAGAAGAGGTCACCTTTTGTGTGCAGCAGGGGAACGAATCTGGAAATAGCAGAAGTTTGTTTTTAGGGAAGAGACTTTCGAGACCAAGAGGAAATAGGTCGAGTGTACAGACCAAAGTTCAAGTTCTAACGTTGTAGAGATCGAGCTCTCGGTGCCTGATAACCTGCCATAGTCTACTGGAAGCTCCACGAGTTTGATGTTTTGTACGTGAATGGAGTTGTATAGCAAGTCTATATGTATTGTAAGAGGACCATGAAGTTATGTTTGTGTGTACATTCTAGGTTGTGGTTGTGTAAACCATGCTTGGGTATGTTTTTGGTTGTGATGCTCATACCTGTTCCAGTTTTCAAGAGAGTCTTAAGTAGGGTTACTCCTTACGTAGGTTTCGTAATTTTAATATTCCGTTGTGTTATGTTTAAAGTGTTTCTTATACAAGTATAGTATGCTTATCAGGTAAAATAGGGTCAACAGGTAACGTTAGAGATGTGAACGATGTCTGTTGGCTTCACGCCATCTGCCGGGCTAAGTTAACAGGTAGTTCGGGAAGGGGTGTGACAACCTGAAGTTGGTGATCGCGAAACTGCAACTCTTGTGACAACAGAGTTTTCAACGATACTCAAGAATAACTACTGCACAAAAGTAAATACCACAAATAAATGTTGCTCACATGCAAAAATATACCCATGTAAATATACCCACGCACAAGCACGAGCATGGAATTTTCTTCCACGAATAAGAAAGCTCACGAGCAAATATACCCACTACAAACGGCAGTGATCAAACAAGCTATAGGCAGAAATAAAAAACCAGAGGTTGCGCAC

Coding sequence (CDS)

ATGAAACATGAGATAATCATGAAGGGCAACATGGAGAATGACGTAAAGAAGAAAAAGAACCTAGCATTAAAGTCTACTCAAGCTCAAGACGGTTCTGAAAGTGAGAATGAATTAAATGATGATGAATATTCGTATCTGGCAAAAAAGTTCAAGAGGTTTCTTAAAAAGAAAATTTTCTCCAAGAAAAGTAACAGTCAAGAGGGAAAGGGAGAAAAGAGCAATAGGGACGCAATTATATGTTTTGAATGTAAGAAGCCTGGACACGTGAAGTGTGATTGTCCACCAAGGAAGTCATCTTTGAAGAAGAGCAAGAAGGCCATGAAAGCTACTTGGGATGAAAGCGACGAGAGTGAATCTGAAAGTGAAGGAGAAGAAGCCAACCTCTGTTTTATGGCATTCGGAGACGATGACGATGATGATGAGAAAATCAGAGAGCCTCGTGAGATGAAGTTGAAGGAAGAAGAAGAAAGGGGTTTTTGGTTTAATGCTAAAGAAGAAAGGGAGAAAGTTCAATCCAACCTCCCATGGCAGCAAGGTGAACATTTCTGCCCAGTTCTGATATTTTGA

Protein sequence

MKHEIIMKGNMENDVKKKKNLALKSTQAQDGSESENELNDDEYSYLAKKFKRFLKKKIFSKKSNSQEGKGEKSNRDAIICFECKKPGHVKCDCPPRKSSLKKSKKAMKATWDESDESESESEGEEANLCFMAFGDDDDDDEKIREPREMKLKEEEERGFWFNAKEEREKVQSNLPWQQGEHFCPVLIF
Homology
BLAST of Tan0011780 vs. NCBI nr
Match: XP_038895919.1 (uncharacterized protein LOC120084093 [Benincasa hispida])

HSP 1 Score: 189.5 bits (480), Expect = 2.6e-44
Identity = 101/143 (70.63%), Postives = 118/143 (82.52%), Query Frame = 0

Query: 1   MKHEIIMKGNMENDVKKKKNLALKSTQAQDGSESENELNDDEYSYLAKKFKRFLKKKIFS 60
           M+HEIIMK N+E DVKKKKNL LKSTQ Q+ SE+E ELND+E++YL KKFK+  +K+ FS
Sbjct: 211 MRHEIIMKANVEEDVKKKKNLELKSTQVQEDSETEAELNDEEFAYLNKKFKKHFRKRNFS 270

Query: 61  KKSNSQEGKGEKSNRDAIICFECKKPGHVKCDCPPRKSSLKKSKKAMKATWDESDESESE 120
           KK N+QEGKGEKSNRD IIC+ECKKPGHV  D P RK+  KKS+KAMKAT DESDESE E
Sbjct: 271 KKVNNQEGKGEKSNRDTIICYECKKPGHVNWDYPQRKAISKKSRKAMKATLDESDESEFE 330

Query: 121 SEGEEANLCFMAFGDDDDDDEKI 144
           SE   ANLC MAF DDDDDD+++
Sbjct: 331 SEEGVANLCVMAFRDDDDDDDEV 353

BLAST of Tan0011780 vs. NCBI nr
Match: XP_038885875.1 (uncharacterized protein LOC120076181 [Benincasa hispida])

HSP 1 Score: 182.6 bits (462), Expect = 3.2e-42
Identity = 102/149 (68.46%), Postives = 118/149 (79.19%), Query Frame = 0

Query: 7   MKGNMENDVKKKKNLALKSTQAQDGSESENELNDDEYSYLAKKFKRFLKKKIFSKKSNSQ 66
           MK N+E DVKKKK+LALKSTQ Q+ SE+E ELND+E++YL KKFK   +K+ FSKK N+Q
Sbjct: 1   MKANVEEDVKKKKSLALKSTQVQEDSETEAELNDEEFAYLTKKFKMHFRKRNFSKKVNNQ 60

Query: 67  EGKGEKSNRDAIICFECKKPGHVKCDCP--------PRKSSLKKSKKAMKATWDESDESE 126
           EGKGEKSNRD II +ECKKPGHVK DCP         RK+  KK+KKAMKATWDESDESE
Sbjct: 61  EGKGEKSNRDTIIYYECKKPGHVKWDCPQRKAISKKKRKAISKKNKKAMKATWDESDESE 120

Query: 127 SESEGEEANLCFMAFGDDDDD-DEKIREP 147
           S+SE E ANLC MAFGDDDDD DE  ++P
Sbjct: 121 SKSEEEVANLCVMAFGDDDDDNDEPCQDP 149

BLAST of Tan0011780 vs. NCBI nr
Match: XP_022143648.1 (uncharacterized protein LOC111013509 [Momordica charantia])

HSP 1 Score: 179.9 bits (455), Expect = 2.1e-41
Identity = 98/143 (68.53%), Postives = 118/143 (82.52%), Query Frame = 0

Query: 1   MKHEIIMKGNMENDVKKKKNLALKSTQAQDGSESENELNDDEYSYLAKKFKRFLKKKIFS 60
           M HEI+MKGNME DVKKKK+LALKST  Q  SESE ELN++E +YL+KKFK+  KK+ F 
Sbjct: 57  MTHEIVMKGNMEEDVKKKKSLALKSTSFQRASESEEELNEEELAYLSKKFKKHFKKRHFP 116

Query: 61  KKSNSQEGKGEKSNRDAIICFECKKPGHVKCDCP-PRKSSLKKSKKAMKATWDESD-ESE 120
           KK+NSQ+ KGEK+ RD IIC+ECKK GHV+ +CP  RKSS +++KKAMKATWDESD  S+
Sbjct: 117 KKTNSQDAKGEKNTRDIIICYECKKAGHVRSECPLLRKSSSRRNKKAMKATWDESDGGSD 176

Query: 121 SESEGEEANLCFMAFGDDDDDDE 142
           SES  E ANLCFMAFGD+DDD+E
Sbjct: 177 SESGEEVANLCFMAFGDEDDDEE 199

BLAST of Tan0011780 vs. NCBI nr
Match: XP_038882256.1 (uncharacterized protein LOC120073483 [Benincasa hispida])

HSP 1 Score: 177.9 bits (450), Expect = 7.9e-41
Identity = 94/127 (74.02%), Postives = 105/127 (82.68%), Query Frame = 0

Query: 7   MKGNMENDVKKKKNLALKSTQAQDGSESENELNDDEYSYLAKKFKRFLKKKIFSKKSNSQ 66
           MK N+E DVKKKKNLALKSTQ Q+ SE+E ELND+E++YL KKFK   +K+ FSKK N+Q
Sbjct: 1   MKANVEEDVKKKKNLALKSTQVQEDSETEAELNDEEFAYLTKKFKNHFRKRNFSKKVNNQ 60

Query: 67  EGKGEKSNRDAIICFECKKPGHVKCDCPPRKSSLKKSKKAMKATWDESDESESESEGEEA 126
           EGKGEKSN+D IIC+E KKPGHVK DCP RKS  KKSKKAMKATWDESDESESESE E A
Sbjct: 61  EGKGEKSNKDTIICYEYKKPGHVKWDCPQRKSISKKSKKAMKATWDESDESESESEEEVA 120

Query: 127 NLCFMAF 134
           NLC   F
Sbjct: 121 NLCVWLF 127

BLAST of Tan0011780 vs. NCBI nr
Match: XP_031741720.1 (uncharacterized protein LOC116403915 [Cucumis sativus])

HSP 1 Score: 134.4 bits (337), Expect = 1.0e-27
Identity = 78/150 (52.00%), Postives = 106/150 (70.67%), Query Frame = 0

Query: 1   MKHEIIMKGNMENDVKKKKNLALKSTQAQDGSESENELNDDEYSYLAKKFKRFLKKKIFS 60
           M HEIIMK ++E++ KKKK++ALK+   +   E E+ L++D+ +Y ++K+K F+K+K + 
Sbjct: 211 MTHEIIMKEHLEDESKKKKSIALKTISLEVDPEDEDGLDEDDIAYFSRKYKNFIKRKKYF 270

Query: 61  KK--SNSQEGKGEKSNRDAIICFECKKPGHVKCDCPPRKSSLKKSKKAMKATWDESDESE 120
           KK  S  +E KGEKS +D +IC+ECK+ GH++ DCP  KSS K  KKAMKATWD+S ESE
Sbjct: 271 KKHLSTQKESKGEKSKKDEVICYECKRSGHIRTDCPLLKSSKKSKKKAMKATWDDSSESE 330

Query: 121 SESEGEEANLCFMAFGDDDD--DDEKIREP 147
           SE E E ANL  MA  D DD  DD+   EP
Sbjct: 331 SEVE-EMANLGLMAHSDKDDEHDDKVTLEP 359

BLAST of Tan0011780 vs. ExPASy TrEMBL
Match: A0A6J1CR79 (uncharacterized protein LOC111013509 OS=Momordica charantia OX=3673 GN=LOC111013509 PE=4 SV=1)

HSP 1 Score: 179.9 bits (455), Expect = 1.0e-41
Identity = 98/143 (68.53%), Postives = 118/143 (82.52%), Query Frame = 0

Query: 1   MKHEIIMKGNMENDVKKKKNLALKSTQAQDGSESENELNDDEYSYLAKKFKRFLKKKIFS 60
           M HEI+MKGNME DVKKKK+LALKST  Q  SESE ELN++E +YL+KKFK+  KK+ F 
Sbjct: 57  MTHEIVMKGNMEEDVKKKKSLALKSTSFQRASESEEELNEEELAYLSKKFKKHFKKRHFP 116

Query: 61  KKSNSQEGKGEKSNRDAIICFECKKPGHVKCDCP-PRKSSLKKSKKAMKATWDESD-ESE 120
           KK+NSQ+ KGEK+ RD IIC+ECKK GHV+ +CP  RKSS +++KKAMKATWDESD  S+
Sbjct: 117 KKTNSQDAKGEKNTRDIIICYECKKAGHVRSECPLLRKSSSRRNKKAMKATWDESDGGSD 176

Query: 121 SESEGEEANLCFMAFGDDDDDDE 142
           SES  E ANLCFMAFGD+DDD+E
Sbjct: 177 SESGEEVANLCFMAFGDEDDDEE 199

BLAST of Tan0011780 vs. ExPASy TrEMBL
Match: A0A6J1DUJ5 (uncharacterized protein LOC111024159 OS=Momordica charantia OX=3673 GN=LOC111024159 PE=4 SV=1)

HSP 1 Score: 131.3 bits (329), Expect = 4.1e-27
Identity = 76/137 (55.47%), Postives = 101/137 (73.72%), Query Frame = 0

Query: 11  MENDVKKK-KNLALKSTQAQDGSESENELNDDEYSYLAKKFKRFLKKKIFSKK--SNSQE 70
           ME++ KKK K++ALK+   +  S+ EN L++D+ +YL++K+K F+K+K   KK  SN +E
Sbjct: 1   MEDEKKKKEKSIALKAITLEVDSKGENALDEDDVAYLSRKYKNFIKRKKQFKKNFSNQKE 60

Query: 71  GKGEKSNRDAIICFECKKPGHVKCDCPPRKSSLKKSKKAMKATWDESDESESESEGEE-A 130
            KGEKS +D +IC++CKKPGH++ DCP  KSS K  KKAMKATWD+SDES SESE EE  
Sbjct: 61  SKGEKSKKDEVICYKCKKPGHIRTDCPLLKSSKKSKKKAMKATWDDSDESGSESENEEVV 120

Query: 131 NLCFMAFGD--DDDDDE 142
           N CFMA  D  D+ DDE
Sbjct: 121 NFCFMAHSDKEDEQDDE 137

BLAST of Tan0011780 vs. ExPASy TrEMBL
Match: A0A5A7TZF4 (Zf-CCHC domain-containing protein/UBN2 domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold216G00480 PE=4 SV=1)

HSP 1 Score: 125.6 bits (314), Expect = 2.3e-25
Identity = 73/145 (50.34%), Postives = 101/145 (69.66%), Query Frame = 0

Query: 1   MKHEIIMKGNMENDVKKKKNLALKSTQAQDGSESENELNDDEYSYLAKKFKRFLKKKIFS 60
           M HEIIM  ++E++ KKKK++ALK+   +   E E++L+ D+ +YL++K+K F+K+K + 
Sbjct: 22  MTHEIIMTEHLEDESKKKKSIALKTISLEVDPEDEDDLDQDDITYLSRKYKNFIKRKKYF 81

Query: 61  KK--SNSQEGKGEKSNRDAIICFECKKPGHVKCDCPPRKSSLKKSKKAMKATWDESDESE 120
           KK  S  +E KGEKS +D +IC+ECKK  H++ DCP  KSS K  KKAMKATWD+S ESE
Sbjct: 82  KKHLSTQKESKGEKSKKDEVICYECKKSNHIRTDCPLLKSSKKSKKKAMKATWDDSSESE 141

Query: 121 SESEGEEANLCFMAFGD--DDDDDE 142
           SE E +   L  MA  D  D+ DDE
Sbjct: 142 SEVE-KMTRLGLMAHSDKEDEQDDE 165

BLAST of Tan0011780 vs. ExPASy TrEMBL
Match: A0A5D3E1G2 (Zf-CCHC domain-containing protein/UBN2 domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold64G00350 PE=4 SV=1)

HSP 1 Score: 125.2 bits (313), Expect = 2.9e-25
Identity = 73/145 (50.34%), Postives = 100/145 (68.97%), Query Frame = 0

Query: 1   MKHEIIMKGNMENDVKKKKNLALKSTQAQDGSESENELNDDEYSYLAKKFKRFLKKKIFS 60
           M HEIIM  ++E++ KKKK++ALK+   +   E E++L+ D+ +YL++K+K F+KKK + 
Sbjct: 22  MTHEIIMTEHLEDESKKKKSIALKTISLEVDPEDEDDLDQDDITYLSRKYKNFIKKKKYF 81

Query: 61  KK--SNSQEGKGEKSNRDAIICFECKKPGHVKCDCPPRKSSLKKSKKAMKATWDESDESE 120
           KK  S  +E KGEKS +D +IC+ECKK  H++ DCP  KSS K  KK MKATWD+S ESE
Sbjct: 82  KKHLSTQKESKGEKSKKDEVICYECKKSNHIRTDCPLLKSSKKSKKKTMKATWDDSSESE 141

Query: 121 SESEGEEANLCFMAFGD--DDDDDE 142
           SE E +   L  MA  D  D+ DDE
Sbjct: 142 SEVE-KMTRLGLMAHSDKEDEQDDE 165

BLAST of Tan0011780 vs. ExPASy TrEMBL
Match: A0A6J1DY46 (uncharacterized protein LOC111025259 OS=Momordica charantia OX=3673 GN=LOC111025259 PE=4 SV=1)

HSP 1 Score: 117.5 bits (293), Expect = 6.1e-23
Identity = 68/126 (53.97%), Postives = 88/126 (69.84%), Query Frame = 0

Query: 26  TQAQDGSESENELNDDEYSYLAKKFKRFLKKKIFSKK--SNSQEGKGEKSNRDAIICFEC 85
           T + D    EN L++D+ +YL++K+K F+K+K   KK  SN +E K E S +D +IC+EC
Sbjct: 63  TLSMDELIGENALDEDDVAYLSRKYKNFIKRKKQFKKNFSNXKEXKSEXSKKDEVICYEC 122

Query: 86  KKPGHVKCDCPPRKSSLKKSKKAMKATWDESDESESESEGEE-ANLCFMAFGD--DDDDD 145
           KKPGH++ DCP  KSS K  KKAMKATWD+SDES +ESE EE AN CFMA  D  D+ DD
Sbjct: 123 KKPGHIRTDCPFLKSSKKSKKKAMKATWDDSDESGNESENEEVANFCFMAHSDKEDEKDD 182

Query: 146 EKIREP 147
           E   +P
Sbjct: 183 EITLDP 188

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_038895919.12.6e-4470.63uncharacterized protein LOC120084093 [Benincasa hispida][more]
XP_038885875.13.2e-4268.46uncharacterized protein LOC120076181 [Benincasa hispida][more]
XP_022143648.12.1e-4168.53uncharacterized protein LOC111013509 [Momordica charantia][more]
XP_038882256.17.9e-4174.02uncharacterized protein LOC120073483 [Benincasa hispida][more]
XP_031741720.11.0e-2752.00uncharacterized protein LOC116403915 [Cucumis sativus][more]
Match NameE-valueIdentityDescription
A0A6J1CR791.0e-4168.53uncharacterized protein LOC111013509 OS=Momordica charantia OX=3673 GN=LOC111013... [more]
A0A6J1DUJ54.1e-2755.47uncharacterized protein LOC111024159 OS=Momordica charantia OX=3673 GN=LOC111024... [more]
A0A5A7TZF42.3e-2550.34Zf-CCHC domain-containing protein/UBN2 domain-containing protein OS=Cucumis melo... [more]
A0A5D3E1G22.9e-2550.34Zf-CCHC domain-containing protein/UBN2 domain-containing protein OS=Cucumis melo... [more]
A0A6J1DY466.1e-2353.97uncharacterized protein LOC111025259 OS=Momordica charantia OX=3673 GN=LOC111025... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableGENE3D4.10.60.10coord: 59..108
e-value: 7.8E-6
score: 27.7
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 95..114
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 95..126
IPR001878Zinc finger, CCHC-typePROSITEPS50158ZF_CCHCcoord: 80..94
score: 9.42196
IPR036875Zinc finger, CCHC-type superfamilySUPERFAMILY57756Retrovirus zinc finger-like domainscoord: 60..101

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0011780.1Tan0011780.1mRNA
Tan0011780.2Tan0011780.2mRNA
Tan0011780.4Tan0011780.4mRNA
Tan0011780.5Tan0011780.5mRNA
Tan0011780.6Tan0011780.6mRNA
Tan0011780.8Tan0011780.8mRNA
Tan0011780.9Tan0011780.9mRNA
Tan0011780.11Tan0011780.11mRNA
Tan0011780.3Tan0011780.3mRNA
Tan0011780.7Tan0011780.7mRNA
Tan0011780.10Tan0011780.10mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0008270 zinc ion binding