Tan0020838 (gene) Snake gourd v1

Overview
NameTan0020838
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionendonuclease MutS2 isoform X1
LocationLG01: 360238 .. 413172 (-)
RNA-Seq ExpressionTan0020838
SyntenyTan0020838
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAAGGAGGTAAGCTCCTAAATGCTCCACACCTGAGAAGAATGTACTGTTGTAGTGCTGCAGGGGTTGGTACCATCCTAGAGCCACTCTCTGCCGTTCCTTTAAACGATGAGTTGCAACAAGCAAAGGCATCAGTGGCAAAAGCTGAGAAAGATGTTCTCTTTATGCTAACTGAAAAAGTATGGGTGAGATGGTTCAAAAATAATAGGTGATCATAAATAATGTCAAAAGTTCATCAGAGTTAATTTTGTAAATGACGTGGTTTTTTATACACCATTATACTTCTGGTGATTTTAAAGTTTAAACTTTTCATTTTGAAGTCAAGATAAATTATAATATTTATATTTATGTTAAATCACACATGAACCCAAAGGCTTAAGCTAATGGGTTACGGTAAAATTTAATTATATCTAAACTTTAACAATCCCTCTCACTTGTAGGCTTGAAAATTTGTAGAAGGTCCAACAAGTAGAAATTAATATTAATGCAGAGGAAATGATATTATTAGGAGTTCAAACATAGGATTTCCTACACTAGTACCATGTTAAATCACTATTCAACTCAAAAACTTAAACTAATAGGTTATGGTAAATCAAATTATATCAAACTTCAATAATTTATATTTAAAGTATTTTGGTTGATGATATGATATCGTATATAATCCTAAAACTTTAAGCTTTTAGGTAAGAATTAATATAACGACCTTACTTGCCCAAGATCTATTCTATAGGTTGTGCATTTGTGTCCTTGAGATTTATTTAATATCAACTAATACTATCTTATTCATGTTTTGTGGCAGGTGAAAATGGATCTTGAAGACGTTAAAACGCTCATCGACTGTATAATTGAATTAGATGTGGTGAGTAATTATTGCTACACCCTCAGTCCCCCTCCTTCTCTCCCTAACTTAACTATTCTGGTGAATTATGTTTAATCCGCTTCCTTCCCCATCCGTTCTTTGGCCAATCTAGGATGTTCCCAATTTTATGTACTCTCCTATTTGTGGAAGACTATTATCAAAAACAAATATGGCCAGAAAGTTTTTGGATCCCCAATTCGCAGAAAAAGAAAAGAACTCAATGTGTTTGCTTTGTTTTCAGAATGAGGAGTCCTTGAAGAAAGATGAACGGGAAGAAACCCTCCCCCGTCTCTCCTCCTCCCCTTGCCTTGCCCCTAAGTGACGCCTCCGATACCCCTTCTCTTCCTCCGATTTTCTCCGGTGACCGGAATTCGTGGCGTCTCCGATACCTTTTCCCTTCCTTTGATCTTCTCCGATGATTGGATTATCGATCCTTCGTCGGTGACTGGTACTTTCGTCCTAAACCCCCGAATCGAAGCGTCTTCTCGATCTGATCCTAGACTCCTGAGTCGAATCGTCTCCTTGATTCGAAGAGGTCCCCTCTCAATTTAGATGGGTAACAACAGTCGATCATGCTTTTTTCAGTCTCGTTTCAGCCTCTAGTCTTTTCGAACCATCATTGGCGAACGGATTTGCCGCTTCCGAATCGTCACTCCTAGACGTCGTCCGTCTCTTCCGTAGCCGTCTGTATCAACTGTCGTCTGCTATAGTATCCCTTCGTTGTAGTCTGCCTCTGTGAGAAGTCCTCTTGACTTGACGATCGAACTTCCCCCTTGTTGGTTTATCTATTTCCTTATCTAGATAATTACCATTTATTGTATTATATTATTTTATATTGTATTATATTATTTTATTTGATTTGTTCCCTTATTTGTATATTCCATATATGTAAATGGGTATCTCTTTATAAGAACCCTAATTCTGATCCAATCAATAAGAAATATTAGATACACTATTTCTTCATGTATTGTAGAGTGCAACAGAAAATGATCTTGAAACTCAATTTCAGACTAAAATTAGCATTTCTGTTGCACTATGCTAATTTTAGTCTGCAATACCTTTCCTATGTCAAGAATGCTTTTCTCAATGGGGAACTTGAAGAAGAAGTATTTATGGATTTACCACCAGGTTTTGAGAAAAATCTTGGGGTTGACAAAGTATGCAAGCTAAAGAAAAAGCTTGCATAAGAACCCTAATTTCTATACGGCCTTAAAGTATGCAAGCTAAAGAAATCTCTATACGACCTTAAACAATATACTTCCTTTGATTAACAAGGATACCATCTTTTGACAAAGTATGCAAGCTTAAGGATCTCGGAGTCTTGAAATATTTCCTGGGAATGGAATTTGCTAGATCAAAAGATGGTATCCTTGTTAATCAAAGGAAGTATATTCTTGACTTACTTAAAGAGACAGGATTACTTGGTTGTAGAGCAACGCAGAAACTCCCATTGAACCAAATTTAAAATTGCAGTGCAAACAAAAAATGAAGTAAAAGACAAAGCAAGATATCAAAGACTTGTGGGAGATTGATCTACTTGTCTCACACACGCCCGGACATCTCCTTTGCGATCGGTATGGTAAGCCAATTTATGCACGCTCCAGACTAACTCATTTTGAAGCGATCTATAGAATTCTAAGGTACTTGAAAGGTACTCCGAGCAAAGGTGTATTGTTTAAAAGGTATGGTAACCTACAAAGTTGAAGTTTACACCGATGCCATCATTGGGCGAAGACGATAGATAGATAGACGATCCACCTCTGGTTCTTTGTTCCTTTGTTGGAGGAAATTTAGTTACATGGCGGAGTAAAAACAAAGTGTGGTGGCAAGAAGCGATCTGAGGAGTGAGTATAGGGCCCTAGCTCGGGGTATTTGTGAGGGTATATGGATAAAGAGACTATTGGAAGAGTTGAAATTTTGTCCGAACAACCCATACATATTTTTTGTGATAACAAGGCGCGATCTCAATTGCTCACAATCCGATTCTCCATGACAGACAAAGCATATTGAAATTGATAAACATTTTATAAAAGAAAAGATTGATGCAGTGTTATATGCATTCCATATCCACCAACAGACAGAGCAAATTGCGGATGTATTAACGAAAGGGCTTCCAAAGACACGGTTGACAAGATGATCGAAGTGGCAATAGAAGACATCTTCACACCAAATTGAGGGGAGTGTTGGTTTATCTATTTCCTTATCTAGATAATTACCATTTATTGTATTATATTATTTTATATTGTATTATATTATTTTATTTGATTTGTTCCCTTATTTGTATATTCCATATATGTAAATGGGTATCTCTTTATAAGAACCCTAATTCTGATCCAATCAATAAGAAATATTAGATACATTATTTCTTCACCCCTGAATTGAGTTGTCTGTCTATTTCGGGACACTGAGTTATGCGAATGGAGAAGTTTAGTTGCAGTGTCGACTCTCGGAATTATCAGATTTGGCAAGATGAGGCAAAAATTCTCCTATGGGATGGATCAGTACATGGCTCGATTCGACTTTCCCTTGGTCAAGTGGTATGGTTTCGTGATTCTTTTGCTGATATGACGATGAAGGTAACTTCCTATTTCAGGCATAGGAAATTTAGAGATTTGGAATCTACATTAGGGATCTTTAAGACTAAAAAGGAAGAAGGTTGGTTTGTGGAAGGAATGATATGGCCTTCTTCGGGAGGTAGAAAACTGTTTACAGTTCCCTTGGGGAGATATGAGATTGGATGAGCTGTGTTTAGAGATCTCTTAAGTGATTTCATCAATTCTGTGAACCCTCCTCGAAGAATTTTGGAATCGGAGGTGTCTTCAGATGTTCTGGTTCCATTTGAAGGTAATAATGAGCGACCTTTGGAAACACCAAAGACAGTGGCAGTGGACTTCTCAAAGGTTTGGGTGGTCACTCGGTTATTTGCACATGTTCAATGGTTTTCCATTTTGAGCGTATTGGAATCCTATTTTGAATGCAAGGTGCAGATCAATCCGTTTATGAGTGATAAAGCCTTGTTTCTTTTTGACAATGAATTGAATCAGTGCCCTAAGGAAGATGTAGATTTCAGTAGCCGGAAATGGAAAGTCATTGGAAATCTTCATCTTAAAGTCGAAAGATGGGAGGATAAACTACATGGACGACCGACGGTAATGGAAGGATATGGGGGATGGATAGCGGTTAAAAATCTACCTCTTAGGTTATGGAAGAAAGAGGTATTTGAGACCATTGGATTGTATTTTGGGGGGGCTGTTGGAGATTGCTTCAGATACACTAAACCTCATTGACTTATCAATGGCCAAAATCAAAGTTGCACCTAATCTTTGCGGATTCTGCCTAGCTTTTATACCAGTTTAAGATCCAATATCAGGGAGTTTTTGGATCAATTTTGAGAATATGGAGGAGATTAAGGTGCCTACCGAACTCAATGATCCTTTGACTTTGAAAGACGTCAAGAACTTGGTTGACCTGGAACGAATTAAACAAGTGTTGATGGATGAAATTGGGGGTGAAGAGCAACAAGGATCACTGATTGAAGCAGATGAGGTCGATGTGCAAAATAGAAGACCAGTGACAGAGAATTTTGCCCTTAGTAGCAATGATTTAGAAGGATTGGTGGTTGAGGTGCAACAAGGAAGACCCGTGACTGTGAATATTGCTCCTAGTAATGGTTATTTACAAGGAATGACACCTGTAACGATTGATAATCAGCAAGTAATGGATGATCCTAACATTTTTGCATTAATGAAGCAGAGAGAAGATTTAGTTGAAATAACTGCTGTCCCTCTGAATTCCAAAATAGATTTAATTGATTCTTTGATTGCAGATGAATTAAACACTCTGAACGGTCATGTGAAGAAGAAATCGAAAAGGCAAGGTAATCAAGAGGCATCATTTGCTGTGGGAACTCTTAATGAGCAACCGATTGAAGGGGAGCAATTAATAGTGAAGCCCATTTAGCCAAGGAAGCCGAGACTGACTTCGACATTTTTTCAATTTTAGAGCCTGAAAGTGTAAAGGTTGTGAAAGACACTTTTTGTGAGAAGTATTATGTCAGAAGAGGTGGAAAGATGCAAGTATAGAGGCTAGAAACCAAGAGTCAAACGCTTTCCCTGATTTCTTGGAGCGATCCGAAAGGTTAGTACCATTTTCTACCATTCCCTTGATTGATGTGCAGATTAGTCCTGGTACCTCCTAAAAAATACCGAAGTTTGTGGTTGAGCAATGCCATTTGGGGCCTGTTAGAGTCCTTATATTAGTATAATTGTATTTTGTTAATATAACTCTAAGGGTGTCCTTTATTGTCTTTTTACGCTCTTCTATTTAGGTTAACTCTGTATACTTGTATATATATATATCTTGAGTGAATGGAATAGAACATACTGATTCTCTCATACTTTAGCATTAACATGGTATCGTGAGCCTGGGTTTTCTAGGGCAATTAGGGTTCTTTGTTTTTGTTGTCCTTAGGATTTGGAACTAATTAGGGTTTGAGATTTGAAGGTTTAAGTCCTTTGTGTGTGAGGTTTGCTTGCTGCCACTACTATCGTGACTCTCTTCACCCTAGAAATCCATTCTTTGTTGTCATTTGAAAAACATGCCGCCACCGCACCGGAACTAGCACTGCCGCTGGAAAAACCGCCGCTGAAATCTTTTTTTTTTTTTTTTTTTTTTTTTGGGCTTCGCTCGGTTTTGCATACTCTTTCTAGTGGTGTGCTTGTTGGGGTCATATCCTCTCGGTTTAGGGTCGGTTTGAGACACCCTCTTTGGCAATCTAGGATTTTTGGTACAATTCTGCTTTTGGGCCTTTCAATCTGTTTTTTGTGGTCCTTTTTTCTTTTCTTTAGTTATGGTGACAAGAAAGGTATGGTGATGACGGAGATGATCCTATGGTGTCAAAGATTACGAACATAAGCTTAATGGGTCTAATTTTTATGCATGGCGGACAAACATTCGTCACTACATTCGAGTCTCGATATGGATGATCATGTCACCGAAATTCACCACCGAATCGATGATACTAGAAAAAATTGGTTACGCTGATGATTCTAGGCTTATTCTACAAATAAAGAACTCTATTGAGAGCAACATCGAGGGGTTGGTTAATCATTGTGAATCTGTTCACGAGCTCTTGGAATATTTGGAATTCCTTTACTCGAAAAGGGAATATTAGTAGAGTGTTTGATGTATGCAAAACTCTCTTTCAACTCGATCAAGGTGAGAAATCTCGACTAGTTACTTTATGGAGTGCAAAACATCATATGCACGAGTTTAATGCCTTGTTGCCTATTAGTAATGATGTGAAAGTTCAAATCGCACAACGTGAACAACTGCTTCATCATGAGTTTTTAGTTGGTCTCGCTTCTAAATATGATATGGCAAAAGATCAAGTTCTATCTAGCTCGACATCTCCTCATTGGAAGATGCTTATACTCGCATACTTCATCGAGAAGTCGTGGTAACCCTTGCCACTGAGTCAAACAAAGGCTCTTGCCTTGGTTGGGCGGACATGTGTACAAAGGAAACAGAGGGGTTGGACCCAACGCATATAAAGGTAATTCAACACTCGAGAGAGCAAATTCGGGTGATATTGTTTGTCACTATTGCCATAAGCTTTGTCATATGAAGCGTGATTGCGCACGATTGTTGAATAGAGGCCGAGGGTTTCCATCTCGCGCATGTTGCCTCTACACTGATAATCCCGATAAATCTATCTCGATTTCTGCAGAGGAGTTTGCTCAGTTCCGCCAGTATCAAGAGTCGCTAAAGGCGTCATCCTCCACTCCTATTACTGTCATTGCTGACACAGGTAATACATCTACGTGTCTTCTTTCCTCCTCTTCCAAATGGGTCATAGACTCGCGCGCTCGATCATATGATAGGTAATCCAAATTTATTCTCTCATATCTGTCCGTATAAATCTTCACCTAATGTTACTATAGCTGATGGGACCACCTCTCATGTTTTAGGGTCGGGCACTGTCAATCTTACCAAGTCAATTTCCTTGTCTTCTGTTTTAAGTTTACCGCAGTTCTCCTTTAACTTGATTTCAGTTAGTAAGCTCACTCGAGATCTAAATTGTTGTGTCTTATTCTTTCCCAGTTTATTGTTTATTTCGTGATCTTTTGTGAAGAAGACTATTGGTAAAGGGCACGAATCGGGAGGTCTCTATATTTTTGAACCACACATATCCACAGTAGTTCCATGCTCTCTGTTGACCTCTCCTTTTGAAGAACATTGTCGTTTGGGTCATCCATCTTTTCTCGTCTTGAAGAGTGTTCGTCCTCAATTTAATCACTCGCCTTCGTTAGATTGCGAGTCATGTCGGTTTGCTAAATTTCATCGTCTAAGTATGTATCCTAGAGTCAATAAACGAGCTAATGCTCCATTTGAGTTAGTTCATTCGATGTTTGGGGTTCTTGTCCTGTTGAGTCTAAGCGAGGGTTTCGGTACTTTGTTACATTTGTCGATGATTTTTCTCGTGTCACGTGGTTATATTTAATGAAGAATCGCTCGAGTTACTTTCTCATTTTCGCAATTTTCATGCTGAAATTGCATACTCAATATCATGGTTCTCTTAAAGTTCTGCAAAGTGATAATGCTAAGGAATACTTTTCTCATGCTCTTAGTTCTTATCTAAGTGAACATGACATTCTACATCAATCCTCATGTGTTGATACTCCGTCTCAAAATGGAGTTGCGAACGAAAGAATCGCCATCTCCTTGAAACAACAAGAGCCTTAATGTTTCGTATGAATGTCCCCAAACAATTTTGGGGTGATGCTATATTGACGGCTTGTTTCTTAATCAATTGTATGCCCTCGCTTCGGTTCTTAAGGTGAGATACCTTATCGTATTTTGTGCCCCACCCTCCTCCGTTCCCTCTCCCAATCGAAAATCTTTAGTTGTACTTGCTTTGTTCGAGATGTTCGCCTCTCTCTCACCAAACTCGATCCAAAATCTTTAAAATGTGTCTTCCTTGGTTATTCTCGTGTCCAAAAAGGGTATCGGTGCTATTCTCCTAGTCTTGATCGTTACTTTGTCTCTCCCGATGTCACGTTCTTTGAAGATTCTTCTTTCTTTACTTCATCTTCGAATACGTGTCAGGGGGAGCATTGAGAGGACGAAGATGATTTTCTTGTTTATACAATTTTCTCCTCTTGTGAGTCTTCGGGTGCTTCTTCTCCATCTCCATCCGTTTGTATTAACCCACCTATCACTCAGTTTATTCTCGACGACGACCTCCTTCGGATGCATGCCTGCACTGCAGGCTTCTTCATTGGATTCGAGGAACAAACGATGATCTTCCTATTGCACTTCGTAAAGGTAAACGTCAATGCACATATCCTATTTCCTTTTTTGTTTCATATAACCATTTGTCATCTTCTACTTGTTCATTCATTGCATCCCTCCAATCTATATCTGTTCCTAAGACTGTTCCTGAAGCTTTGTCTCATCCTCGTTTGGCGTGTTGCAATGGTGGAGGAGATGATCGCCTTAGATGACAATTGTACTTGGGATTTAGTGTCTCTTCCTGCGCAGAAAGAAATCTATTGGTTGTAAGTGGGTGTTTGCGGTTAAAGTCAATCCGGATGGATCGGTTGCTCGATTGAAAGCTCGCCTTGTTGCTAAAGGCTACGCACAAACTTATGGGATTGACTATTCGATACATTTTCTCCTCGTTGCTAAAATGACTTCACAAAGTTATTCATTTCATTAGCATCGATTCATCACCGCCATTGTATCAACTTGATATTAAAAATGCCTTTCTTCATGGTGATCTTGGGGAAGAGGTGTATATGGAGCAACCTCCGTGGTTTGTTGCTCGGGGAGAAGGGAAAGGTATGTCGCCTTCGTAAATCCTTGTATGGATTAAAGCGAGTCCACGAGCATGGTTTGGGAAATTCACTCGGTGATTGAAAGCTTTGGAATGAGAAAGAGCAAATCGATCATTCATGTTTTACAAGCGGTCCGAGAGTGGTGTCATCCTGTTGGTTGTGTATGTTGACGATATTGTGATTACAGGAGATGATACATTGGGCATCCAATCTCTCAAGACCTTTCTCCATAGTCAATTCCACACGAAAGATTTGGGAATGTTGAAATACTTTCTAGGAATTGAGGTAATGAGAAGCAAGAAGGGAATCATGCTATCACAGAGAAAGTATATTATTGACTTGTTGATCAAACAGGAAGTTAGGGGCTAAGCCATGTAGTACCCCGATGATGCCTAATGTACAACTCACAAAAGAGGAGAATTATTGAAAGATCCCGAACGATATAGGAGGTTGGTGGGAAAACTCAATTATCTTACAGTAACTCGACCAGACAAAGCTTATTCAGTGAGTATTGTGAGCCAGTACATGTCTTCTCCTACAATTGATCATTGGGCTGCAGTGGAACAGATTTTGTGTTATTTGAAAGCAGCTCACGGCGTGGTTTATTATATAAGAGCTATGAGCATATGAACATTGAATGTTTCTCGATCTTTGATTGGGCGGGATCTAAGGAAGATAGAAGGTCAACGTCGAGGATATTGTGTATTCGTTGGTGGTAATTTGGTTTCTTGGAAGAGTAAGAAACAAAATGTGGTTTCACGATCAAGTGCTGAGTCAAATATAGAGCAATGGCACAATCTGTCTGTGAATTAGTTTGGATACGTCAACTTCTTATTGAATTGGGATTTGATATCACAACACCGACCAAATTGTGGTGTGATAATCAAGCAGCTCTACATATAGCATCTAATCCAGTATTTCATGAGAGAACTAAACACATTGAAGTTGATTGTCATTTTGTACGTGAGAAGATACAACAAGGTTTGATATCGACGGGATATGTGAAGACCGAGAGCAATTAGGAGATATCTTCACCAAAGCATTGAACGGAGCACGAATTGATTACCTCTCTAACAAGTTGTGCATGATGGATATATATGCTCCAACTTGAGGGGGAGTGTTAGAGTCCTTATATTAGTATAATTGTATTTTGTTAATATAACTCTAAGGATGTCCTTTATTGTCTTTTTACGCTCTTCTATTTAGGTTAACTCTGTATACTTGTATATATATATCTTGAGTGAATGGAATAGAACATACTGATTCTCTCATACTTTAGCATTAACAGGGCCGAAGGAACTCTCTTTTATTAAAGCGTCTTTCTTAATTGACGAAAACCAGAATAAGGAGCTCTCTCATTTTCCCTTGTCTTTCGGTCAAAATTTAATTGTTGAATATGATATTTGTCTTAGCAGCCTGGATCTGAGTTCGATCAAGAATGTAAGTAGAAAGTCATTAACAAAATTTTCTCCAATTATTAATCTGTCAAGTTTGTTTGATTCTCTTGGATCAGTGTCTTTTTCTAGAAGCAAAGTGTCCACGGAAGCTTTGACTTGCCCAACCGGTTTGATACCTTTTGAAGCATTAATAAAGGCAAGTGGGTTGCAATTTAAAGAGATCTCAACAAAGGGATTTCCAGGAGTGTCCAAATGAAAATCTTGTCGTGGAACACGTGCGGATTGGGGGATGGGTCGAAGAGGATAAGTTTGAAAAAGGTACTGCAAAAATTAAATCCTGATATTGTTCTTATTCAAGAGACTAAGTTACTGTGTGTGGATGAGAAAATAATCAAATCCATGTGGAGTTCAAGGGATATTGGATGGGCCCACGTTGATGTCTTTGGTAAATCAGGGGTATTGCTAATCATGTGGGATGAAACAAAAGTTCAGGTAAAGGAAATTTTGAAAGGAGGGTACTCCATCTCCATAAATGCACACTCTATAATTGGGAATGCTTTTTGGGTATCCAACTTTTATGGACCAAACCATTACCGTGAGAGAAGATTTCTATGGGATGAATGACGTTCGCTTGCTGGATATTGTGAGGGGCCTTGGTGCTTGGGAGGAGATTTTAATATCACAAGATGGATTCATGAACGTTGGCCTCAAAGAAGGGTGACTAAAGGAATGAAGAATTTTAATAAAATCATAACAGAAATAGGGTTGCTGGAGATACCTCTTTCTAATGGGGTGTTTACTTGGTCAAAACCAAGGAATGAGAATAAGCGATCTCTTATTGACAGATTTTTTTGTTAATGTTGAGTGGGATGTTGTTTTCGAAAATACTAGGGTTGGAAGACAAGCCAAAACTTTTTCAGACCATTTTCCTTTGTTGTTGGAAGCTGGCTCATTCCTTTGGGGACCCTCCCCTTTTCGGTTTTACAATTCGTGGTTGGATGCGGGAGACTATGTTAGACTAGTGGAAGACTCCCTTAGAAAAGACTTCACCTTCACCTTCGGCTGGGAATGTTAAAAGCATTGTCAAAAAATGGTCTGGGGATTTTGAAAAACATAAAAGAGAAAAGGAGACGGACTTGTTGGATGAGATTAACTGGTTGGACAACAAAGCAGATTCTTCTGGCCTTTCCCCTCTAGAGAGTTCAGCTAGGATAGCGGCTTTGGGGAACCTAGCTGATTTGCACTTTAAGGAGGAGAGGGATTTGATACAGAAATGCAAGCTTAATTGGTTAATGGAAGAGGATGAGAATACTTCTTTCTTTCACAGATTCCTGGCAGCAAAGAAGAGGAAAAGATTTATTGGGGAAATTTTGAGTTTGAATGGGGATGCCCTTGTTTTGGTAAGAGATATTGAGAAAGAGATTGTCAGTTTCTTCAGCTCATTGTACTCGCGTATTGATGGAAGTCGTTTTATGCCAAATTCCTTGATATGGGAAATGGTCACTTTCGATCAGAATGCTTCTTTAATTAAAGCCTTTTCACAAGAAGAAATTTTGGGAGCTATTAAAAGTTTGGGAAGAAACAAGGCGCCGGGTCCAGATAGATTCACTACGGAGTTCTATCTTAAGTTTTAGGAGTTTCTTAAAAATGATTTTGATGAGATTCTTTGAAGAGTTTGCAGTCAATGAGCATTTAAATGCAACACAAAAGGAGACCTTTGTTTGTTTAATTCAGAGGAAAGAATTTTCCAGATCTGTCAAAGATTCCCGACCTATTAGTCTGGTTACCTCCTCTTATAAAATTTTAGCAAAGGTGTTGTCGGAAAAGTTGAAGTTGATGATGCCCTATATCATATCGGACACTCAAAGTGCCTTCATTCATGGAAGGCAGATTGTGGATCCCATTTTGATTGCTAATGAAGTAGTAGAAGAGTATCGGGCTAAGAAGTTAAAAGGGTGGGTGTTAAAGATAGATCTTGAGAAGGCCTTCGATTGTGTGGATTGGGAATTCCTTGAGGAAGTAATGACCCGTAAGGGATTTTGCAACAAGTGGATTCAATAGATTTTGGGTTGTATTAAAAACCCTATGTTTTCAGTGTTTATTAGTGGGAGTCCGAAAGGCAGAATAAGGGTTTCTAGAGGTTTAAGGTAAGGGGACCCTCTTTCCCCTTTTCTCTTTTTGCTGATTTCGGAGGTATTATCCTTCCTCATTAAGAGAATGGTTGAGAAAGGGTTATTAGAAGGTTTCGTGGTTGGGAGGGATCGGAATCAGATCTCTATATTATAATTTGCTGATGATACTTTGTTGTTTTGCAAGTTTGATAATGGCATGATGGAGGTTTTAGTTAAAACCATTGCGCTTTTTGAATCCTGCTCGGGGCTCCGAACCAATTGGTCTGAATCAGCTTTATCAGGTGTTAATATTGAAGATTCAGTAATTGATGAGACTGAGAAACTTGGTTGTTATGTGGGTTGTTTTCCGATTCTTTACTTGGGGTTGCCGCTTGGAGGGCACCCAAAAAATTGTGGGTTCTGGCCATTAGTGGGTAATTTTAAGAAGAAAGTGGATAGGCGACGTAAGTTTAATCTTTCTAGGGGTGGTTGTATGACTTTGTGCATGGCGATTCTGTCAAGCTTGTCGTTATATTACTTTTCCTTGTTTGAATGTCCAACAACTGTGAACTTTGAGTTGGAGAAGATATCAGAGATTTCTTCTGGGAAGGTAGGAGTGGCTCAAAGATAAATCATTTAGTTTGTTGGGATCTTGTGTTGCAAGATCGATCGGGGTTTGGCTTTGGTGATTTACGCTGAAGGAATAAGGTTTTTCTAGCCAAATGGGGGTGGAGGTTCGGGATGGAGACCTCGACTTTATGGAGGAGAGTTGTGGCAAGTATTCATGATGTTTCACAGAATGGTTGGAATTCCTTGGTGGCTAGAGGAAGGAGTTTGAGAAGTCCTTGGCTGAACATTGCAAAACAGTGGGGCGAGGTGAATTCTTTGACTCGGCTTCAATTTGGGAATGGTCTTTGCATTCTTTTCTGGGAAGATGTTTGGGTTGGTGATGCTCCTCTTAAAGTCCAATTCCCTTCGTTGTGATAATGCCAAGGGACGGGTGGTGGATTTTTGGGACGAGATGACATTGTCTTGGAGTCCCTCTTTTCGCAGAAATTTGAAAGATGAGGAGGTTGTGGAGTTGGTAATCTTGTTGGGAAGGTTAGAGGGGTTGCGACCATCCCAAAAAGAAGATTGGAGAAAATGGGTTATAGAGCCTTCGGGGCAATTTTTTATGAACTCATTAGTTGCATATTCTTGTAAGGGACAAACTATGGACAATGGGCTATATCAGGCCATATGGTGCTCATCGTGTCCAAGGAGGATCAACATATTAATGTGGACTTTGATTGTACGAAAGGTGAACACTTCGTAAGTTCTTCAGAGGTAACAACCCTCCTTCATGCTTCAATCTTCGATTTGTGGCCTATGTTTGAAGTCTGAGGAAACATCCATTCATTTATTTTTCCAGTGTGATTACGCAATGAATTGTTGGGCAAAGTTGTTTGACATCTTCAACATCAGCTGGGTCTTTTCTAATTCTGTGTCGGGAAATATTTTTCAACTCCTTGTGGGTCTTGTTGGGAGCCCAAAGGCAAAGTCTCTGTGGTTGAATGGAGTTAAAGCTCTTATTTCGGAATTATGGCTGGAAAGAAATCAGAGGTTGTTTGAAGATAAAGAGATTGAATGGTCGAGAAGATTTGACTCGATCACTCTAAAGGCTTCTACTTGGTGCACCCTTTCAAGGGAGTTTTTTGGCTACTTGATCTCCGATTTGTGCTTTTCCTGGCTTCCCTTTTTGAATAGTGTTTAATGTGGCTTCATTTATTTGTTTTATTTGTTTTGTACTTTCTGTTTGCTGGTGGGGGCTCTTGTAGCCTGCCTAGCCTTTGATTTTTACTTTGTTTTGTTTTATCTTGTACTGTTTTCCTTTACATTTTCATTACATCAATGAAATCATTTTGTTGCCTTCTCAAAAAAAAGAATGAGAAGTCCTTGAATCAAATTTTCTTTCATTGTAGTTTTGCAAAGGAATTTTGGGTTAAGTTGTTCACTCTGTTCTCCGTCAGTTGGGCATTCTCTAGTAGTTATAGAGATAATGTAGTGCAACTTTTGATCGGGCCCTCCTTAAACAACCACGTTTCTCTCCTTTGGCGCAATGCGGTTAAGGCGATATTTTATGAGATTTGGTTTGAAAGGAATCAGAGAATTTTTGAAGACAAATATTCAAGCCCCAACGTTAGATTCGACTCTATTCAGCTTAAAGTGTCTTCTTGGTGCCCTCTTTCCAAGTCTTTCTTTGGTTATTCTTGCTCTGATATTAATCTTAATTGGTAAAATTTCATTTATCGTGTGTAACTGCTTTTGTGCTTTGTAATACCCTTAGTGCTTTGTTTTTAGTTTGGTTGATATTTTGTTGCGTGTTCTCCTTTGTGGAGATGTATTTTGAGCATTAATCTCTTTTCATTATTTTTTTTTGAACAAGAGAGAAATGTTTCATTAAGATATATGAAAAGAGACTACTGCTCAAACGATACTGAAGACAGAATGAAATCTCTTTTCTGTATTGATCTCATTAGAATTAGGGTTCTTTAAAGAGACACCCATTTACACATAAGGAATTTACAAGTAAGGAAATGAATCCAGAAATATAATGGAATGCAATAAATGATAATTCTTTACATAAATAAATAAATAAGCCAACACTCCCCCTCAAGCTGGTGTGAAGATGTCTTTCATTTCCAGCTTGCTGACCATCTTGTCAAACTGTGTTTTCGGGAGCCCTTTTGTTAACACATCTGCAATTTGCTCAAATCGTGGCAAATATGGAATGCATATAACACCGCATCAATCTTTTCTTTTATAAAATGTTTATCAACCTCAATGTGTTTTGTCCTCATGAAGGATCGGATTATGAGCAATCGAGATCATTTGCCTTATTATCACAATAAATACGTATGGGTTGTTCTCACATGCATTTCAATTCTTCCAATAGTCTCTTTATCCATATGCCCTCACAAATACCATGAGCTAGGGCCCTATACTCGCCTTTCAAAGATCGCTTCAGCCACTACACTTTGTTTTTACTCTGCCATGTAAATTTCCTCCAACAAAGGAATGTATCCAGAGGTAGATCGCCTATCTGTCATCGTACTACCTCCCAATCTGCATCGATGTAAACTTCAACCTCAGTAGGTTCCCATGCTTCTTAAACAATACACCTTTGCAGAGTGCCTTTCAAGTACACCGAGGATTCTATAGACAGCCCTCAAAATGAGTTGATCTTGGAGCATGCATAAATTGACTTACCATATCGATCAGCAAAGGAGATGTCCTGGACGTGTGTGAGACAGTAGATCAATCTTCCCACAAGTCTTTGATATCTTTCTTTGTCTTTTACTTCCTTTTCTCGCTACACTCTGCAATTTCAAATTTGGTTCAATGGGAGTTTCACGCTTTCTACAACCAAGTAATCCTGTCTCTTCAAGTAAGTCAATAATATACTTAGTTGATTAACAAGAATACCGTCTCTTGATCTAGCAAATTCCATTCCCGGAAATACTTCAAGACTCAGAGATCTTTAATTTGGAATTCCTTAGCTAGGTTTTTCTTAAGGTCAGTTAACTCTGACTCATCATTACCTGTGAGAATAATGTCATCAACATATACTATTAAAACAACAATTTTGTTATTTGCAGTATGTTTATAGAAGATGGTATGATCTGCTTGGCTCTGAAGAAATCCATGACTAGTGACTGCTTTTCCAAACCTTTCAAACCACGCTCTCGGCGATTGTTTAAGACCGTATAGAGATTTCTTTAGCCTGCATACTTTGTCAATCCCAAGATTTTTTTCAAAGCCAGGTGGTAAATCCATAAATACTTCTTCTTCAAGTTCCCCATTAAGAAAAGCATTCTTGACATCAAGCTGATAAAGTGGCCAATCTGCATTCACAGCAATGGAAAGTAGGACTCTAATGGAATTAATTTTAGCAACTGGGGCAAAAGTTTCTTGATAATCAATCCCATAGGTTTGAGTGAAGCCTTTCGCAACAAGTCTGGCTTTATATCTTTCAACGCTCCCATCAGATTTACACTTTATAGTAAATACCCATTTGCACCCTACTGACTTCTTATTTTTTGGTAATTTTACAATGTCCCATGTGTGATTTTGTTTAAGGGCATTCATTTCCTCCATAACTGCTAATTTCCAGTTCGAGTCATCTAGGGCCTCCTGTATATTCCTTGGGACAAACAGATTGGTTATCCTAGAGGTAAAAACTCTATGGCTATCAGACAATCTATGATAAGAAAGGTAGTTTGCAATAGGATATTTAGGATTAGGGCAGTTTCGAGTACCTTTCCTATGAGCTATTGGAATGTCAAGATCAGATATTACAGGTAAGTGATTGTAGGTTGATGAAGAAATGGGATTAGAGGGTATGTTACCTGAACCTTCAGAATCATTCAACGGAGTATCAGATTGGTTTCGTGATAAACCCACTGTCTGGTCTCTATTCTTTTGATTCATCATCCTTCTAGTATAAACCTGAAGTTCAGTATTTCGATCAATTTGATTATTTTGTAGTGTTTCTCCCCCTGAACAAGAATTTTCCATAATTGGCAGCGAAGAACTAGAGCTGGTAATCTCTGGACTAATGATGTTAGGGAGAGGAGAGGTATCCCAAAAATTATCTTCTAAATTAGGTATTATCTCCCCCTGAAGAGAATTTTTAAAAAATGATTGATTTTCCCAAAAAGAAACATTCAGGGTTTCAAAATATCTTTTGGTCACAGGATCAAAACATTTATAGGCCTTTTTGTGGGAGACATACCCTAAAAAGACGCATTTAACAGCTCGAGGATCTAATTTAGATCTAGAGAAGCTAGGTATGTGAACATAAGCAGTACACCCAAACACCTTAATCGGTAAATCAGAAAACAAGCGAACATTAGGAAAGAAGGTTTTTAAGTGATCTAGAGGGGTTTTAAAATCTAAAACTTTTGTAGGCATTCGATTTATAAGGTATGCAGACGTTAGAATAGATTCACCCCACAAATAAGAAGGTACAGTCATAGAGAACATAAGTGCACGAGCAACATCAAGTAAATGATTTACTTTTGCGCTCTGCTATACCGCTTTTGTTGGGGAGTATCTCGACACGTGGATTGATGAAAGATTCCTTTAGCTTGCAGAAAATCAGTTAAGTGTTCATTAAAATACTCAGTACCATTATCAGAGTGGAGAATGCTAATTTTAGTCTGAAATTGGGTTTCAATCATGGTGTAAAATCGAATAAAAATCTCCTTCACTTCTGATTTTTTTGTTAACAAATAAATCCAAGTTAAACGAGTGTGATCATCAATAAATGTAACAAACCATCGCTTTCCACTATGAGTTAAAACTTTAGAGGGACCCCATACATCACTATGAATTAAGTAAAAAGGTGTAGAAGCCTTGTAAGGTTTTGGCAAATAAGTTGATCTATGATTTTTTGCAAAAATACAACTTTCGCAGTGAAAAACAGAGCAATCAATTCCCTTAAATAAATTTGGAAACAAATACTTTAAGTAGAAAAAACTAGGATGTCCTAATCGATGATGCCAAAGCATGATAGTTTCTTGAGCAGAAGGAGAACTAACGCTACTCAAGCCCTAAACTTTTTTATGGCTAGTTGGAGATTCATCGAAATAGTAGAGATCATCGAGCATCCTAGCACGCCCAATCGTCTCCCCCGAATCCTGATCCTGAAAGATGCAGTGAGAATCAAAAAAGGTAACACGACAATTAGCATCCTTAGAGATTTTACTAACAGATAACAAGTTACATGCTAATTTAGGGACATGAAGAACAGAGTGCAAAGTAATCTGTGGGGTTAATTTAATAGTTCCTTTTCCTGCAATAGAATTGAAACTACCATCTGCAATTCGAATTTTTTCATTGCAATACATGGGAGAATATGATTCAAATAATTTGGAAGAACTAGTCATATGGTCGAGGCTCAGGATCTATAATCCATGGAGAGGAGTGGAGACAAGATAGGGCTTTAGAATAATTACTGCTTGTGCCAAGGAAACACTAGGATTACTGGGATGATGAAGTGACCTTTAGCATCTTCGTGGAGTTGATCAACTTGCTCTTTGTTAAATGGGTTGGACTCTACAACATTAGCACTTGAAGGTCCCATATTAGAAGATTGTTGATTGTTGCTCTTTTCACCTTGCCTTGTACTCTTCCAATTTGCAGGTTTTCCGTGAAGTTTCCAACATGTCTCACGAGTATGTCGGGGTTTGTTACAATAGTCACACCAGACACGAGGTTTTTCAGGATTCTTTGATTTATCAGAAACCTTGTGTGCAGAAGCTTCAACCACCAAGGCTGAACTCTCAACTGGCTCAACCGATTTCTTTCCAATCATAACATTCCGACGACTCTCTTCCCTGCGCACTTCAGAAAAAACTTCACTAATAGTTGGGAGAATAGTCTTACCAAGTATTCTACCTCTGACCTCATCGAATTCAACATTGAGGCCTGCCAAGAATTTGTAAATCCGACCATCTTCCACTAGTTTCCTGTAATGTTTTTGGTCATCTGTAGACTTCCATTCATATGAATCAAAGAGATCGAGATCTTGCCAAATCCGTTTTAAGGAGTGAAAATATTGTGTAATTGAGTTACCTTCCTGTCGTATATCACCTAATTTGAGATTCAACTCAAATACTTGTGATTGATTGCCCAGATCAGAGTACATCTCAGTCACACTGTCCCATAGCTCCTTTGCAGTAGAGTAACACATGTAGTTACAACTGATGTCTTCCACCATGGAATTGACTAGCCAAGTCATAACCATGGAGTTTTCAGCATCCCATACAACAAACAACGGATCATCCTTGACAGGTGCAATTTTATCACCAGTGAGATATCCAATCTTCCCTTGTCCGCGAATATACATTCTGACACTTTGGGACCAACGTAGAAAGTTGTCTCCATTAAGCCGGATTGTAGTAATTTGGACAATAGAGCTATTGGAATGACCACGATTGTCTGAGGTTTTAACCTTATTTTCCGACATTGTTTGATCAACAAAAACAACCCAAAAACTCAACACTCAGTAGTAATCCCAAAGAAAACAAAGGAGCCCGATAAAAAAATAACACGAAATCGGAAAAAAATACGATGAGGTCACGGTACACACCACATCGTGTGTCACAATCGAATTCTTTGGGGCGGGACAACGATTGTGCGGCACTTGTTCTAACCCAAATGAACAAGTCAGCCGAACGAAAACTAAGTCGCGGACAACTTCGATCTGGTGGTCTAGCCTCCGCTCACGTTTTGAGAGAAAATGGTTTTAGAAAACGTGAGAGAGAAATCATTTGAAAACATCTTTTTTGAAAAGTTTGTTATACACCAGCAAGCAATATAAAGGGACAACAACAGCTTGAAAGCATATAACCCGGGTAGGAAGAATTGACGACTAGTCGCATCCCCCTATAGTATATTATATGAAATTATAGCTTTGGCAGGCATGTACATGAAAATGGCGAGGAGTCATACGCCAACAAAAATACAAGGAAAGTAAGCAACAAGCAACAACAAAGTAAATATATGAAAGCATAGTAAAAAGGGGGTGGAATGGGCATGCGGCCATGGGCAACACGCCCTGGACAAGCATGCCCGTGACATTCTCCCCCACTTAAACGGTTGACGTCCCTGTCAACTGACGAAGCTTGAATTCTTCGATCTTCTGGGCCCGGTGGCGCGTCTTCACATCGTTCCCCGCTTGTTTTTCCTCCGTAGGGAGGTCCTTCCACTTGACAAGGAATTCTTGGACCACCCGTACGGGTCTTCCAGTTTTTCTGTTTCTTGCGGCCAAGATTTCTTCCACGACTTTGACATCTTTTCGCTTCAAATCGATGTTTGGCCGATTCGAGATATTGCGTCGATCATCATCTGGATCGGGGTGATAGGGTTTCAAATTACTCACATGAATTACGGGATGAATTTTCATCCATGCGGGCAACGCCACCCTATATGAGGTCACCCCGATTTTCTTCAAGACTTCCATGGGTCCTCGTGACTTTCTTACAAGTCACGATCTTTGCGGCTTCGGAAACGGATCTGTTCAGGTCTCAGTTTGACAAGGACTTGGTCTCCTGCTCGAAACTCGAGAGGACGACGTTTCTTGTCGGCCCATTTCTTCATGTGTTTAGAGGCTTTCTCAAGATATGCTCTGCTATTTCTCATTGTCTGCTTCCACTCTTTCGTGAAGTGATGGGCTTGAGGATTCTTCCCCGCATATGGGTGATCGACAATATGGGCGAGGATGGTTGTCTTCCGCATACGATCTCGAAGGACTTTTCCCAGTTGAAGAACTTGTTTGGCGGTTGAAGCAAAATTGAGCCACATCCAAAGAATCGGGCCCAATTTTTCTCATCGGCGTCAACGGTGGCGTAAGTATTCCTCCAATAGACAATTAAATCTTTCGGTCCGCCCATCGGTTTGTGGGTGGTAACTTGAGGATATATTCGGGCTTGTCCCCAAGAATGCGAATAATTTACATCCGTAATGTACCGATGAATCGACCGCCCCTCTCATCGACAATACTCGATGGGATACCCCACGCGGCTTCACAACATGCTTAAAGAACAACCGAGCCACAACTCAAATTGAACATTGTTTCTGCAAGGCTATGAAGGTGGCATATTTTGAGAATCGGTCGATGATCACCAAGATTGCTTGAGTTCTCCTACCTTGGGAAGGTGCGTGATGAAATCCATGGACACACTCTCCCAAGGTCCTGCTTTGGTATAGGTAAAGGCTCGAGTAAACCGACACCTTCGCCTTTTCTATTTTGTCTTGTTGGCAAATAAGGCAAGTCTTGGTGTATTGCATTACATCGTCCCGCATCTTTGGCCGTAAGTATCCCTTCTTTAAGAGCGCGTAAGTGCGCCGCCATCCGTGATGTCCAAATTTATGAGAGTGTCGTGGCATTCGTGCGATAATTTCTTCCTCAAGTCTCCCGCTCTTGGTACGTCTGAGCGGTTACCTCTTTGTTAACAACGGATCATCCTCGACCTGCACTGTCTTGTCTTCCTTCTTTGGCCAAATCGATTACAGCTTTGGCCGAGGAGTCTTTCTGTAGGTGTTCCTTGATGATGTCGCGTATTGAGCCATCAATCTTACTTGCGTGGATATGGGCTAACATGCACAGGGCCGCATGTTCGCCCTTTCGACTCAAGGCATCTGCCGCTTGGTTACTCTTTCTGTTTTATGTTCAAACTTGAAATCAAACTCGGCTAAGGATTCCTGCCATCGAGCTTGTTTCGAAGTTAATTTAGGCTGGCTGAAGAAGTGGCAGATGGCGCTGTTATCTGTCTTTACCACAAATCGAGATCCCAGTAAATATTGTCTCCATACTCTGAGACACAATGAACAACGGCTAACATTTCTTTTTGGAGACAAGATACCTCCGTTCAAAGATCATTTAGTTTTCGGCTTTCGTAGGCGATCGGGTGGCCCTCCTGTATGAGGACACCCCCAAGGGCAAAGTCTGAAGCATCTGTCTCAACTTCAAATGGCTTTGTCACGTCGACCAATCCGAGGACAGGACCCCTCATCATGGTTTCCTTCAAGTCTTCAAAAGCCCCTTGGCATTCGGCGATCCATCGCTGTGGCTTGTCTTTCTTTAGTAACTCTAGTAAGTGGGGCGCTCGTTTGAGAATCCTCGACGAACCGTCGATAGTAATTTGCCAATCCGAGAAAGGAACGTAGTTGGGTCACCGAGGAAGGAACTTTCCATTCTCGAATTGACTCTACCTTGTCACCGCCCATACCGATTAAATCACACTTTTCGATCACATGTCCGAGAAAGTTGATGCGCTGTTGGGCAAAAGCACACTTCTCTTTCTTTACATATAGTTGGTTCTGTTTCAACTTTTCGAAGACTAGTCTCAGGTGCACCTTGTGTTCTTCTAGGGTTGCACTATAGACCACGATGTCGTCGAGGTAAACTACGACAAACTGGTCGAGACACTCATGGAACACTTGATTCATCAAGGTGCAAAATGTGGCAGGAGCATTTGTCAGGCCAAAAGGCATGACTAAGAATTCGAAGGCTCCGTATCTTGTCACACAGGTGGTCTTTTGCTCGTCTCCTCAAAGAATATGAACTTGATAATATCCCGATCGTAGGTCCGCAAACAAAGTATTTAGCCCCGTGTAATTGGGTCGAATAGATCGGATATTATTGGTAGTGGATATCGGTTGCGCATCGTTACCTTGTTCAATGCCCTATAGTCGATGCACAACCGTAGAGACCCATCCTTCTTTTTCCGAAATAGCACGGGGCCCCATAAGGTGCTTTTATGGGCGGATGAACCTCGCGCTCAATAGCTCGTCTAATTGCTTTCGAGTTCGCGAGTTCGTGGGTGCCATCCGTATGCATTTTTCATTGGAGGTTTAACTCCCGGTAGGAGTTCAATTTCGTGATCAATCCCTCGACGAGGGGTAGAGTCTTCGGCAAGTTGTCGGCATTATCTCGATATATTCTTGCGTAACTTCTTGGATTTCTCGCCGGGGCGGGGTCGGTGGTGGTTGTATCCACGACCATAGGAATGGCCATGAATGTTGGTTCTTCCCGACCCAATCCTTTCTTCACCGCATGGTCAAATCATTCTCATGCCACCGGCCGTTTAATTAATGCTTGCAGAGATGACTCGTGGGATTACTACTGTGACGACGATGCACACTTTGCTAGGGGCATTGGGATGACTTTGTGTTCTAGAAGGAAGTCCATCCCAATTACCACGTCAAAGTCATCCATGCGGACCACGACGAGGTCTATTTCTCCCACCAAGACCCCAACTTCAGAGAGACTCTTTTGGAGACTCCCACAATAGGCGGGCCTCTGAATGATGCTTTCATCCTTCCTGTCTCTCTCTTTTTCAACTCTAGTCTCTTGGCTTCTTGATCGGATATGAAGTTGTGAGTCGCGCCAGTCGACTAAAGTGCTCTTAGCTCGGTGGGAGTTGATTATGGCATCAACGAACATCAACCCCTTCTCGCACGTCTCTCTCTGCCTCGACTTTCTTTTGTAGTGCGAGATAGGAATTTTAGCGCACCCATTCTAGGTGTTTCGTTGCCCTCTTCCTTTTCAATTTCAGTTTCGGTCTCCCCCTCATTTTTACTTTGAACTGAGGCTTGGAGAGCCGTGAGGCGCAACTCTGTGTGGGCGATCGTCTGATGCAGTGGCGCCTTTGCATAGGAGGCATTGTAGGGCCCCGGTGATACCTCGATTGATAAGGTCCTCGCGGAGGACCTGTGTTTGTGTTTTGAGGCCGCCTCTCTTCTCCCCCATTCGGTTTATCTGCTCCCCCACTCGGTTTATGTGGCCCATCATATCGCTGATTCGTCGGTCGGGAGGGGTTGTTTCCACCTTGGTGGGGTGGAGTGTATTTCCTCTGTGAACTTGATTCGCCACCATAGTCTAAGAGTCTTTCGGCTGCCGCGTAGGCTGAGGTAAGGTCTTGGACTCTCTGTTCATACAATTTTGTTCTTGCCCACGGTTGTAATCCCTCGATAAAGCAAAACACTTTATCCTTTTCAGACATATCTCGAATGTCGAGCATCACTGCCGCGAATCGCTTGACGTAATCTCTCTTATGTTTCGGCTTGCCTCAATTCGCGAAGTTTTCGCCTTGCAATGAACTCTACATTCTCTGGGAAAAATTGAGTTTTCAGCTCTTTCTTCAGGTCTTCCCAGGTTGTAATGGTGCATAGGCCATTTTGCATGTCATTGACCTTTGTGCGCCACCACAACTTAGCATCGTCTATGAGATGCATCGAGGCCACGGTGACTTTCAGTTCTTCAGATGTAGTACCTGTAGCCTTGAAGTACTGCTCAACGTCGAAGATGAAATTCTCAAGATCTTTTGCATCCCGATTCCCTTTGAATGCTTTGGGATCCGGGATCTTTAGTTTGTTTGGCCCAACACTCGCTTGGTTCGGGGCCTGATTCTCCACAGCTCGCATTGTGCAGCTTATGCGAGTGCTCAATTCAGCCATCTCAGTCCTCATTGCCTCTATAGCCATTCGAACGTCCTCAGTCAGATCGTTGAACAGCTTCATCATAGTCTTTTGGGTATTGTCGAGCTCTTCCACACGCTCCTCTATGTGTGCGGCAGAGCCAGTGGAGCTGTCTCCACGTTCGAAGCTACTTGGACGGGTAGCTTTGTCTTCAAGGGAGTCGACTCTCAGCGTCAGCTCTCTTATCGGTAACCCATCAAATCGGGCATTCAGGGCCTTTATTTCACCCACCGCCTCTTGGACTTCCTTTATTTGATCTTCGAGGAATCGCACCGTGTCAGGTGTTTCCCTCAAGTACAGCAACTGTTCTTCGATTTCCACTAGTCGGTCGACGTGCGACTTGCTCAATTGTTTTGTCGCCGACATGATTGCAATCTTCTTTTACCGAGGAGCCAACTAGGCTCTGATACCACTTGTCACAATCGAATTCTTTGGGACGGGACAACGATTGTGCGGCACTTGTTCTAACCCAGCGAACAAGTCAGCCGAACGAAAACGGAAGTCGCGGACAACTTCGATACAGGGTCTAGCCTCCGCTCACGTTTTGAGAGAAAATGGTTTTAGAAAACGTGAGAGAGAAATCATTTGAAAACATCTTTTTGAAAAGTTTGTTATACACCAGCAAGCAATATAAAGGGACAACAACAGCTTGAAAGCATATAACCCGGGTAGGAAGAATTGACGACTAGTCGCATCCCCCTATAGTATATTATATGAAATTATAGCTTTGGCAGGCATGTACATGAAAATGGCGAGGAGTCATACGCCAACAAAAATACAAGGAAAGTAAGCAACAAGCAACAACAAAGTAAATATATGAAAGCATAGTAAAAAGGGGGTGGAATGGGCATGCGGCCATGGGCAACACGCCCTGGACAAGCATGCCCGTGACACTGTGCACTTAGACACGGGTCTCGGGTATCGGGTCGCGGATGTTCAGGTGTACGCGGGTCTTCAGCCGTTCGCGGGTCGCGCAGAAACCGGGTCAGTAGTTCGCGGGTCAGCTGTTCGTGGGTCGCGCGGGTCGTCTGCAGGTCGCGGGTGCAGCAGCCGGGTTGTTCGCGGGTCGCGCAGGTCTCTCGTTCGAGGGGGTTGTTCGCGGGTCGCGCAACCGCCAGTTCAATCTCTCGCGGGTCGTCAGCAGGAAGGGGTCGCGCGGATCTCGCGTTCGAGGGTGCCGTTCGTGGGTCGCGCGTTCGTGGGTCGCCGGTTAATGCAACGGACGACGAGTTCTTCAGTTGCACGACGAGTTCTGGCGAGTTGGCGGCTGTTCTTCAGTCGGTTAGGTTTTTTTTTTTTTTTTTTTTTTCTTTTCTGCTCTGATACCATGAAGACAGAATGAAATCTCTTTTCTGTATTGATCTCATTAGAATTAGGGTTCTTTAAAGAGACACCCATTTACACATAAGGAATTTACAAGTAAGGAAATGAATCCAGAAATATAATGGAATGCAATAAATGATAATTCTTTACATAAATAAATAAATAAGCCAACAGATACAAACTCCACAAGGAGTGAAAGTAAGCCATAAAAACAAAAACCAAACCCGGTCCACACCCAACATACCACCAAGTTATAAAGACAACTAACAACTAACATTATGATTGAAAATAAGAGCATCCCAATTTAAAAAAATATCCTGAGACGAATAACCAGCAAAACAATTCAGCAATGAACACCATGAAGAGGCATGCAAACGTGTTGAATCCAACTGAGCCACCCAATACACTTTCCTATTTTGAAAAATTCTTTGATTCCTTTCGAACCATATATCAGCAAGAATAGCTTTTGAGCCCATTAATCCAAAGGAATTTTGCCTTTGAATGCAACCTAGGCCCAAACAACAAAGCCCTCACATTGTCTTTAAAGTTATTAGAAGACACTCATGCGAAACCAAAGAAGGAGAATGAAAATTTTAGTATTTGTCATTGATTTTCAACTATGCAAGATCCAATATAAATACAAGGTATAAAATCCTTTTTCAATAAGGAAAAGAAAAAACTTAAAATGTCTAAAAAAAAAAGATAAATCATAAAACTAAATTACAAGAATGCCCTAACTTTAATCATAGTTTTCAACACTCCCCCACAAGGTGAGTTGTACATATTATACAATCCTAACTTGTTATTCAAATTGTTAGTGTTGATATAATTAAATTTGCCATAACCTATCAACTTAAGATTTTGGGTTAATTGGCGATTTACATGGTATCAGAACAGGAGGTCTTGTGTTCGAACCCTGAGAGTGTGGTGTTGGCAAAAGTTTGTTAAGGATATGTGCTAACTGTTGTTTACTAGGCAAATAATCGAGTCAAACCACTTCATTATTGACTTTCTTTGGGATGAAGTGAGCAAGTGCTCTATATTCAGCTTCAACACTACTCCTTGCAACTATGGAGTGCTTCTTGCTTCTCCATGTAACAAGATTACCCCATACAAAGGAACAATAACTTGAGGTAGATCTTCTATCTTTAATATCACTTGGCTAATCTGCATCAGAGTATACCTCAATGTTTCTCTTGTCAGTTTTTTTCAAAAAACAGTCATTTTTAGATATGTAATGATTCGATATACTGCGTCATATGTTCTTCATTGGGACTATGCATAAATTGGCTAACCACGCTAACAACAAATCCAATATCTGGATGTGTATGGGAAAGATATATAAGGCGTCCCACAAGTCGTTGATATCTCCCCCTATCCACAGGTGTGCTCTCATGGTTGAGTCCAAGCTTCTTTTGAGGATCCATAGGAGAATATGTTGGTTTACATCCTAACATCTCGGTCTCTTTGAGAATGTCTAAAACATACTTCCTTTGGGATACAACTATACCTTTTTTAGACCGAGCGATCTCCATGCCTAGAAAATACTTTAGATTTCCTAGATCTTTGACTTCAAATTCTTGAGCCAAGTGATTTTTCAACATTTGCAGTTTCAATTCATCATTTGCAGAATGAATAATATCATCAACATGCACTATTAAAAGAGCAAGTAAATTTTCTGACATCTTTTTAACAAATAAAGTATGGTCTGATTGGCATTGTGAATATCCAAGACTAAAAACAAAATTTGCAAGTTTACCAAACCATGCATGTGGAGATTGCTTAAGTCCATAGAGAGATTTTTTTAAGTTTGCAAACTTTTGGTAACTGAATTTCTCTTCACATCCAGGAGGAATATTCATATATACTTCTTCTTCTAGATCTCCATTCAAAAAAGTACTTTTAATATCCAACTGTTCAAGACTCCAATCAAGATTAACTGCTAGAGATAAGATAATTCTTATGGTGGTAAGTTTAGCTACAGGTGCAAAGGTTTCCTGGTAGTCAATTCCATAAGATTGAGTGAATCCTCTAGCAACAAGTCTGGTCTTCAATCTCTCAATGCTTCCATCTGCCTTATGTTTTACTGTAAAGATCCACTTACATCCAACCGTTCTTTTTCCATTAGGTAAATCGGTCACTGTCCACGTGTGATTTTTTCTCAAGAACTGCAATCTCATCTTAGTACAACTTTTCACCAACCAGGATGTTTTAATGCTTCATGAATATTGGTTGGTATTTGAACATTATCAATAATTGAAAGAAATGCTCGATATGATTCAGATAATCCATCATATGACGCATAGTTTCCTATAGGATGAATGATGCAACTCCTGACTCCATTTCTACTGCAATTGGTAAGTCTAAATCATCAACTTTTGGAATTACATCCTCTCCCGTCTCACCTTTATCCTCACTTATGGATTGTTCATTGGATGTTAAAACCAGTTCAGACATTGGAGAGTGTTTATTTGTTCTAGAATCTAATTTCTCAACTTCTTTGTTTCTCCTTTGGTAGGTGAGGAGATTTTCACTAGCATTTGGAGGAACAACATTTGGTGGACTTGTTTCAGGAGAAATAGGAAGAGTCACAGGATCTTGAGGAGGAGAATGAAGTAGGGTAATAGATTCGGGTTCAGTCTCAAGTAACGAGATCCATAACTGATTGTCACTCATCTTTTCCCCCCTGACGGTCATTTTTGGTAGATGCTCGAAGAACGTAACATCCATAGTGACATATACCCGTTTGGTGATAGGATAGTAACATTTGTATCCTCGTTGATTTGATGAGTATCCTAAGAAAATGCATTTGTGGGCTCGAGGATCCAACTTACTACGTTGTTGGGAATGAATTGGAACAAAGGCACTACACCAAAAAACTTTAGGGACCAAATCAAGAGATTGAGCACGAAGATGGGAAAAAGATTGAAGAAAAACTTGACGAGGAGTAGAGAATTTAAGAACACGAGAGGGCATACGATTAATCAAGTATGTGGTAGTGAGAATGGCTTCTCCCCAATAGAATTTAGGAACATGAGTAGAGAACATCAAAGATCGAGTAACTTCAAGAAGATGACGATTTTTGCGTTCAGCTACCCCATTTTGTTGAGGAGTGTCAACACAAGTGCTAAGATGGAGAATCTCTTGAGTTGATAGATAATCTCCCAAACATGAATTGAAGAATTCTTTTTCATTATCTGTTTTTAAAACACGAATTTTGGTTTGATACTGAGTTTGAATCATATTATTGAAATGCATGAAATGTTGATAAACTTCAGATTTTTCTTTCATAAGAAAGATCCATGTAATCTTTGTATGATCATCAACAAAGAAAACAAACCAACGAGCTCCAGAGATATTTTTAATTTTTGAAGGACCCTAGACATCGCTATGAATCAATGAGAAAGGACTTGATGGTTTATAATCAATGCGAGGATAAGGGTTACAAGTATGTTTGAAAATCTGACAAATCTCACAATGGAACAATGCAGGATTTTTGTTGCAAAAAAAAATTCGAAAATAATTTGGAAAGATACACAAAATTTGGATGGCCTAATCGATAGTGCAATAACATAATTTCACTATCTTGGTTGACAAAATTAGACAACTTACTATTACTCGAAAACTGACTATTATTACTAGGAACAAAGGAAGATAAAAAACATGACTTATCTCCAAATGCTAGTGAGCCTTGATCACAATTGAGAAGATAGAGCCCAGAACAAGGTTAGCACTCCCAATTGTTTTCCCCCATTTCAAATCCTGAAAAACACATAAGTTTGGATAAAATTTAGTCACACATTTAAAATCACGTGTCAATTTACTTCTTGAAAGTAAATTACAATCCAATCCTGACACATATAATACAAGGTTAAGATATAAATTTTGAGTAACTTGTATAAATCACCAATTAACCCAAAAGCTTAAGCTGATAGGTTATGGTAAATTTAATTATATCAATACTAACACTCCTCTGCACTTGTGGGTTTGGAAATTTGGACAAAGGCCCAACAAGTAGAAATCAACACACTCCCACACACTTGTGTAGGCTTAGAAATTTGAACAAATGCCCAAAAAGTGCAAATCAATAATTGGGAGAATACGACTTCACCAGGGTTCGAACACATGATTTCCTGCTCTGATACCATGTAAATCACCAATTAACCCAAAAACTTAAGCTGATAGTGAAGGGAATGGAAATCCCGCAGCGGAAACGACTTGAAACGTCGTGGTGTTCGTTGTGAAAAACGTTTATTAAAATCAAACCTATCTCATAGATTCTAGGCTAAACATGCATTTCTAAGAAGGAAAAACAGTAAGGAAAGAGGTACTTGTTGAAGAACGCTTCTTCAAACTTTCCTTCTCCGGTCACGAACCTCTTCGCAACTACGATCTCTACAACACGATCTCTCGAACCAAGAACTTGGACACCTCCAAATTGATCTTCTTAGTTGTTCTTGATCGGTTGAGGGAGAGTGGTGGAACTCCCTTTGATTTAGGGTAGAGAGAATTTGAGAGAGATTGAGAGAATGAGAGAGAGCTTTTGGATGAAATTCCAAAAAAATGAGAGATCACCTACCAAATCCTTCTTGCAAAGGTTCTTTTATAATGGTGAGAGGTGGAGTTGAAATCTCCACCTATTTACCATTTTACCACAACCTTATGTTATGCAAATCATATTTACATAACACATATAATAAGATATCTCATACCTTATTATGCAAATCACATTTGTATATTGTGGATTTTAAATTGAATCACATTCAATTTAATTTCTCTCAATCCAATTTCTCTAAATTAGCACTAATTAATCAATTAGGCTAACCTATAGTTTAATATGAATCTCATTCACATTAAAACTATATATTATGTCTATCATATCTCATATAAATGACATAAATACCCTTTTAATGAATTTGAACACTTCAAATCCACACCAAACTGTAAACCCTCAGTTTATCCAGTTTGAGCCAACCGAAGGACCTAATGAACCTACAGATGACGAGCTCCAATGATCCAAGACTAACCTGTCAAACTCTTTGACCCGGTTATTCAACATTCATTAGCTACGGTAACACTCCACTAAAGCCCGTAGCTGCATTCCTATCACTGTATGACAAGTTGTGTCCATTGATATAACCAATGCCCGTGAGTCGACCCTTCACAGGTCGTTCGTAGACACTGCTCGGTCAAATTACCGTTTTACCCCTGTGTCTACCTCTTGCTCCTTAAGTCCCACTACTCCTCTAATGAACAACACGTTGCATGGTCCAACCATAAACAACATCCCTCTCGGGCCAGTGAGAGGGTGGGCGCCCGTTGTCCAAGCCCTGGAGACAACACTTAAGGGAACAACCCCTCTAGTTTTCCTGAGTCGGGAACAAGTGAATTCCATCTTGCGTAGTCAAGTTCCCAACTTCTCACATGGTCCTGTCCCTGAGAAGATAGACATATTGGGTGGGTAACTGTGACCACCCTCACCCGTACTAGTCAAAGCGGATGGACCCCGCGCATGCGAGTCCAGTAACACGTTCAGGATTAAGGTCGAGTCACTATTGGTCATCTACGAAGTTATTAGCCCTATACTGTTAACGGTGTTACATCAGTAAGTCTAATAATTCACGGTCCGGTCTTGTATAATCTCATTGCACATGATGCCCCCACTCGCATGTCAACCACATGAACGAGTTGGATCACCTCGTTTGTATCTAATACAAAGCGGGCCGCATCCACAACGTATCTAGGATTAGGTCTCCAACCCTATCCGTATACTGTATACCGTTCGGGTCATTTACTCAAACGTGATCCTCCCTGTGTGTCCACTACACACCATTCAAGTTCTAGTTATCTCATAATTCAATGACCCTGGAGCTTAGTTTATTGGATAAAGTTTGTACTTATGCGAGACACAAAGTGATGAAAAATAACTATTATTTATTTCAATAAGGAATATTACAAACATATACGAGATTAGGACATACATCCCAACAGATAGGTTATGGTAAATTTAATTATATCAACACTAACAACTTGAATGGAATCTGTCCCTGTCACTTTGGAGTATGACCTTGTTAGTCCTAATGTTACTAGTTCTTTGATGCCAAGTGTAGAAACTTCTCAATCAGAGGGAGAAATAATACAGAATGATCCTACTGATCAAGTTCCCGAGTTTAAGTTCTATACTAGAAGAAAATTCAATCAAAGGAACGATGACCAGAGAGTTAATCCTTCACAAAGCCAATATGAGACTCCGACGAATGAAACTGAAAATCTTGGTAATCCTTCTTCTATTCCTACTCCTTCAACTACTCAAAATACTTTATCAATTGTGTCTGATCATGATGTTCCTATAGCCATTAGAAAAGGTACTTGAAGTTGTACCAAATATCCTATTGCAAACTACATGTCATATTATAATTTGTCAAATAGTCATAAAGCTTTCACATCTAGGATAAATGACTTGTTTGTTCCGAGGAATATACATGAGGCCTTAGAGGATCCAAATTGGAAAGTAGCAGTTATGGAGGAGATGAATGCTCTAAACCAAAATGGGACTTGGGAAATAGTACAGTTTCCAACAGATAAGAAGCCAGTAGGTTGTAAATGGGTATTTACCATAAAATGAAGGTAGATGGTAGTATTGAGAGATATAAGACCAGATTGGTTGCTAAAGGTTTCACTCAAACATATGAAGTTGATTATCAAGAAATTTGCTCTTGTTGCTAAAATTAATTCCATTAGAGTTCTTTTGTCACTTGTAGTCAATTCAGAATGACCACTTTACCAACTAGACGTGGAAAATGCATTTTTGAATGGCGACCTTGAGGAAGTGGTTTTTATGAGCTTACCACCAGGTTTTGGAAAAGAACTCGGATATAGTAAAATTTGCAAATTAAAGAAATCCCTCTATGGTCTTAAACAGTCTCCAAGAGCTTGGTTTGAGTGTTTTGGAAAAGTTGTGTCTAACTATGGATTTCTTCAGAGTCAAGTAGATCATACTATATTTTATAGACACTCTAAGAATAATAAAATCTCAATTTTGATTGTCTATGTAGATGATATTATTATCATAGGTGATGATGAAGTTGGTCTAGCTGATCTGAAGAAAAATCTTGCATGTGAATTTCAAATCAAGGACTTGAGATCGTTGAAATATTTCTTAGGGATGGAGTTTGCAAGATCAAAGAAGGGTATCTTTGCTAATCAAAGAAAGTACGCCCTTGACTTACTTGAAGAAACATGATTACTTGGTTGCAAGGTCGCAGAAACTCCCATTGAACCAAATCTAAAGTTACAAGTAGCAAAAGCAGAAGAAATAAAGGATAGAGAACGATACCAAAGACTAGTGGGAAGATTAATTTATCTATCTCATATGCGTCCTGATATTGCATATGCCGTAAGCATGGTAAGTTCTTTATGCATGCACCTAAAGCGACTCATTTTGAAGCTGTTTATAGAATTTTGAGGTATCTAAAGGAACTCCTGGAAAAGGAATTTTATTCAAAAGGCATAATTACTCCCAAATAGAAGTTTATACTGATGCAGATTGGGCAGGAAGTACTACAGATAGAAGATCCACTTCTGGTTATTGTTCCTTTGTTGGAGGGAACTTGGTTACTTGGAGGAGTAGAAAACAAAATGTAGTGGCAAGGAGTAGTGTTGAAGCAGAGTTTAGAGCTCTAGCTCATGGAATTTGTGAGGGGATATGGATCAAAAGGATGCTTGAAGAATTGAAGTTCCTTTGAAGACACCTATACGAGTTTATTGTGACAATAAAGCTGCCATCTCCATAGCTCATAATCCAGTTCTACATGATAAAACAAAACATATCGAGGTTGATAAGCATTTTATTAAGGAGAAGATTGATGCAGGTACAACATGCATTCCTTATCTTCCTACCTCAAAGCAAATTTCTGATTTTCTAACTAAAGTACTTCCAAAGAAGCAATTCGATAGATTGATTGGCAAGCTGACTATGGAAGATATATATAAACCAGCTTGAGGGGGAGTGTTGGAATTTCTATTATTTAGGAATGTTTATGTCTAAATATTAATTGTAAAATATTTTGTCTTTATTGTATTCATTTATTTCCATATTTGTAATAGGTCTTTCTTATATATAAGAGGACCTAAATTGCACATATTTGATATGAAATACAGATTTATATCCTCAAAATTGACATTTAGCTCATATTAAATATACAAAATTCAAAACATTGCCTCAAAGGAACATACATGTTGAAATTAGAATTGCAAGGCAAGGTTAGTGCTAAAGAGAAAGCAAGGAAAATTGCAAATGGCGGCTAAGTACAGTTGAAATTAAGGCAAAGTAAAAATTAAAAAAAAAAAAAAAACGATTGTTAAAATAGGATCTTAAATAATAAAAACACCACTGTAGTAAAGCAACAAATTTTTTTTTAATTCATTTTCTTCTTCTTCTTTCCAGGAACAGAGGGCGTGGGAATAAAAACCTCACTTTCAACATAATACCCATAACTCCTGCCTTTTGCTTTTCTACGTTTTTTTCCTTTTTCTACTGCATCAGAATTGCGAGTTCGTCTTGCAATGTGGTTTTGCAATTTTTGAAATTCCTGCACAGAAGAAACTCTTTTTTAGTTTGTCTTTCTCTTTTCCCCCATTCTTTCTTTTACATGTACTTCTCACAGGTTATTGGTGTTTTTTTGAGCATGTTATTGTTACCAATTTTCTCCTATGATTTCTAAAGAAAAGCAAGTAATTTTTTAGTTTATGAGCATTAAATAAGCCTTAATTCTAACGCTTATCAATTCCTATATCACGGGTCACAAACTGAATCCAGACTCATCTGACAAGTTCAAGCTTTGCCACTAAAAGGTTGTATGAAAATACCACCCACCTAGATCTCTCTCTCTCTGCAAGTTAATTTGAGAGAGAAACAACCCGAAGAAGGTAAAATCTTTTATATGTTCACTAGCACGTGGCAGCTTTTACCCACGAACATGTGCCTTACGCGTAAAAAAAATTAACATACCTCCAGTCATCTCTTTCATCATGCCTGATTGCAAGAAAGGGTTGGAATCTCATCTTTAATTATTTTTATTTTTTCTGGTGTCCACCTAAAGAAGTTGGGAGGGTTTGTCAGAATTGCTCTCTGGTGGGTGGCATAAAGAGAAGGCAAGGATTATTTGGTCAAAAAATATGTATTCTCCTATTTGACCTTGTGCAAATTGCAAGAAATGAAGGCACTCCTTTAGTAGATTTATTATTTGACTTGCAATGTCATCAGATGCCTTATCTCAGAAAATATTGTTTACTTAGCCGCCTTTCCCCTTTCCTTCTTGAATTGATTACAATACTCATTAACCTTGGCAATTAGCATTGATATATCTATTTCTGAATTTCTCTGGAAACATCACAATATCCCAGTGCTCTACTCTTGTTTCTTAGTTGGCAAATTCATGTTTTTCATTAATGGACAGGTCAATGCTCGAGCATCTTATAGTCTTTCATTTGGAGGGACATGTCCCAATTTAGTTCTACCAGAAGGGCGCAACTCTTCTATTGCTAATGTCTGCTCATCAGGAGACCAAACATCTGAGGCATCGCACCCAAATAAGAATGAACGGGTTCTCTATTTACCAAATGCCCATCACCCTTTACTACTTCAGCAATACAGAGAAAATTTGGAGAATGCCAAGCGAGATGTCAGAAATGCTTTTACTGTAAGCTATACCGAGTCAAATATTGTGGACAGCAGTTATTTAGTAAATTGAGGCTATTCTGAGTCATCATTTTTCTCCTAGGAGATAGGGAGAAAACTTCCTGGGGGGACTATGCCATGGAAAGAAAAAAATGTTGTTGATATTTCATTCTTAAGAATGAAGGTAATTGTCAATGTAATTTAGTGTCAGATTCTAAAATCTACTTGCATTATTATTTAGTGTATTCAATTTTTTTTTAAAAAAGAAGTGCTTATTAGGTTGAAGAATTGGAGAAAGCTTGTCCGATTTCGGTTGATTTTTCAATATCTCAAAGAGTTCGAGTTTTAGTTATAACTGGCCCTAATACTGGGGGTAAGACAGTTTGTTTGAAGACCATTGGATTGGCAGCCATGATGGCGAAATCAGGTATGTTTTAAGAGTGGTCTGCTTATTATTACTACGGCAAATTGCAAAAAACCACCCTTGAAGTATGGTGATTGCTATAACTACACTCCGAATTCCTAAACTTTCAATCATAAAAATTTAACTCCCAAACTTTCATAGGCGCAAACTTTTTTTTGTAAACTTTTATAAGTGAAAAACTTGCACTCTTAAAATTTTAATGGTAAAAGTTTAAAGAATTTGATCTTTCACATTAATTTTCAACTATGCAAGAACCAATATATATACAAGGTACAAAATCCTTTTTCAATAAGGAGGAGGATATAAAATCCTTTTTCAATAAGGAAAGGGATATTCTATCCTATTTAAAACTTAAAATATCTAAAAAGATAAACCATAAAACTAAATTACAAGAACGCCCTAACATTAACGTTAATTTTTAACACTCCCCTCAAGATGGGTTGTATATATTATACAATCCTAACTTGTTATTCGAATCTTCAAAGTTAGGTCTTGGTAAAGGTTTGGTAAGGATATCTGCTAACTGTTGTTTAGTAGGCACATAATTGAGTTGAACCACTTCATTATTGACTTTCTCTGAGATGAAGTGTCAGTCAATTTCAATATGTTTTGCCCTATCATGATGAATAGGATTCTTGGCAATACTTATTGCAGCTTGATTATCACACATCATCTTTATCGGTAACAAACAGTCTTTTCCTAATTCTCATAGAACTCTTTTGATCTATATGCCTTCACACATACCATGAGCAAGTGCTCCGTATTCAGCTTCAACACTGCTTCTTGCAACTACAGATTGCTTCTTGCTTCTCCATGTCACAAGATTACCCCATACAAAGGAACAATAACTTGAAGTAGATCTTCTATCTGTAATATCACCTGCCCTGATTTGGAGTTTGCAAGGAGCTTAAAGCACAAAGAATTGGAAGATTTTTCCCCAACAATTAACCTTCAAGAATTTTTTGATTTAGAAGCTTCTCAGCCTTATTTGGGAAAGGAAAATAATAAGGAGGCCCCTTATTGTCCAATTGAATTAGTTCGTTTTAACGATTTAATAAAAGAAAGCGGTCTACAATTCAAAGAAATTGTACCTAGACAAGCAGCCTTGACCTTGCAATGAAAATTCTATCATGGAATACAAGAGGGCTGGGAGATAGTTCAAAGAGAATCAACCTTAAAAAATTTCTCCAAAGAATATGTCCTGATTTGGTTTTCATCCAAGAAACTAAGCTTGTGATAGTAGATGACAATGTGATTAAATCAATTTGGAGTTCGAGAGACATTGGGTGGGTTCATGTTGAGGCTTATGGTAGGTCTGGAGGTATGCTGGTGATGTGGGATGAACTTAAAGTATCGGTTAAAGAAGTGTTGAAAGGAGGCTACTCTCTCTCTGTTAATGCAAGCTCTCTTCGAGGGAAGTCCTTTTGGGTCTCCAACTTTTATGGGCCAGATCATTACAAGAGAGAAAATTCTTGTGGGAGTCTGTCGCTCTCTCTCTTGTTATTGTGAAGGACTGTGGTGTATGGGAGGTGACTTTAATATTATTAGATGGGCTCATGAAAGGTTGCCTTTGAGAAGGGTAACCAAGGGTATGAAGAAATTTAACAAGCTCATTGCTGAACTTGGATTGCTGGAAATTCCCCTCTCGATATGGGACGTTCACTTGGTCCAAACCAAGAGATGAGAGTACGAGGTCTCTTATTGACAGATTCTTCATCAATCTTGAGTGGGATGCCATTTTTGAGAACACAAGGACTTCTAGGCAAGCTCGAGTTTTCTCAGATCATTTTCCGTTGATGCTAGAGGCCGACTCATTTCAATGGGGGCCAACTCCTTTTCGATTCTATAATTCATGGTTAGACAATGTTGGTTGTGTTAAAATTATTGAAAACTCTTTGAAGAAGGATAAGGCTTATGGGTGGGCGGGTTTTGTAATTGCATTAAAACTAAGTAATGTGAAGGGTGTTGTCAAGAAATGGTATGGGGAGTTCGTAAAACTGAGAGAGCAGAAGGAGAATGATTTGTTAGAGGAAATTAAATGGTTAGACTCAAAAGTTGGTCGTTCGACTCTTTCCTCTTTGGAGTCCTCGGCTAAAGCAGCAACAATGGGGGAATTGACCAATTTATTTATGCGGGAGGAAAGAAACTTGATTCAAAAATGTAAATTGACTTGGTTAAGGGAGGGAGATGAAAATACAGGGTTCTTCCACAGATTTCTTGCGACAAAGAAGAGGAAAAGATTGATTGTAGAGCTTCTGAATCCCGAGGGGGATGTTCTTATGTTGGCCAATGATATTGAGAATGAAATAGTTAACTTTTTCAGAAAGTTATTGGTAGGAGATTTGTTCCAATTTTAGTTGAATGGGAGGTGGCCACTTTAGGGCAGAATGGCAAGTTAGTTGCCCCTTTTTCGGTTGATGAAGTCTTGGAAGCTATAAAAGGCCTAGGTAGTAACAAAGCCCTGGGGCCTGATGAATTCACTACTGAGTTCTATCTCAGATTTGGGAAGCTACTAAAAGCAAATTTGGTTAGATTTTACGATGAGTTTTTCCTCAACAGTCAAATTAATGTAGCTCAGAAAGAGAATTTTATATGTTTAATTCAGAAAAAAGATTTGGCAAATACAATTAAGGACTTTCGACCTATTAGTTTGGTTTCTTCATCCTATAAGATCTTGGCCAAAGTACTTGCGAAAAGGTTGAGATTAATCATGCCTGTTATTATTTTGGTGTTCGGTAGGGGTGTTATTTTGGTGTTTGCCCTCAGTAGTTCGTTTGTTTTGCAGGTGGCCAGTGCTTGGTTGTTTTGTGTTATGCACTTGTTGGCGGGTTGTTGAGATAGTTCTTTGGTCGGTTTCTTTGAGCGGGTTTGTCTTGGTGGGTTTCGGCTGGTTTTTGAGTTAGTTCTTTTGTTGTTTGGGTTTGTCATGGTTAGGTGTGGATTTTTTTGGTTCCTTTTTGTTTCTTTCGCTTCGGCTGCCATTTGTTGATGTTGTTTTGTTGGTATTTTTGAGGTTTCCCTTTGTTTTCTTGGGTCTTTGCTCTGTTTTGGTCTTTTTGGAAGTTATATTGTATTCTCTTTGTTACTTTTGTTTCCTCTGGTTGTTCTTTTTTCGTTAAAAGTTTTGTACTTGGAGCGTTAGTCTCCTTTCATTATATCAATGAAATCTTTGTTACCTTATCAAAAAAAAAATCATGCCTGCTATTATTTCAGATACTCAAAGTGCCTTCATCATGGGAAGACAAATTTTAGATCCAATTTTGATTGCAAATGAAGTAATAGAAGAATACAGAGCAAAGAAGAAGGGATGGGTGTTAAAGATTGATATTGAAAAAGCCTTTGACGGTGTTGACTGGGAATTTCTTGAGGAAGTAATGAAATTGAAAGGTTTTGCACCACTTGGATCCAATGGATCATGGGCTGTATCAAGGACCCGGTGTTTTCAATTTTTATCAATGGAAGACCACGGGACAGAATTAAGGCTTCAAGGGGCTCCAGGCAAGGAGATCCTCTACCTCCTTTTCTTTATCTATTGGTATCAAAGGTCTTATCCTCCTTGGTGAATAAAAATGCATGATCAAGGGCGCTATTTGAAGGTTTTGTTGTGGGGAAAGATCAAATTCATATCTCAGTTTTACAATTTGTCGATGATACTTTATTATTTTGTAAACATGATGATGAAATGATGGGAATTTTTGTTAGAACCATTGCTTTATTTGAAGCTTGCTCGAGGTTGCGTATTAATTGGCTTAAGTCGGCATTGGCTGGAGCGAACATTCAGGCCTCGGAAGTCTTGAAAATGGCTAATAAATTGGGATGCAGGGTTGAATCTTTTCCAATCCAGTATTTGGGGTTACCTCTTGGAGGGCGTCCTAAGAACGTTGGATTCTGGCAATAGGTGGTTGACTCCATTGGGGAGAAGATAGACAGATGGAAGAAATTTAACCTCTCAAGGGGAGGGCGTATGACCCTTTGTTCCTCGATTCTTGCCAATTTGCCATTATACTATTTCTCAGTCTTTGAAATGCCAGCTGGAGTGGCTCATCAAGTGGATGGTGCTATAAGGAATTTTTTCTGGGAAGGGAGAAATGGTTATAAACTTAGTCACCTGGTTCGATTGGGTTTAGCCTCGCAAGACCGATCTTTGGGGGGGCTTGGCCTTGGGAATCTCAAAAGGAAGAATAAGGCCCTCTTGGCTAAGTGGGGATGGAGATTTGAAGTTGAAAAATCAGCCTTGTGGAGAAAAGTGATAGCAAGCATACATGGGGTATCAAAAGATGGTTGAAACTCTCTTGTGGCTAGGGGTCGAAGCTTGAGAAGTCCTTGGCTTAATATTGCTAAGCATTGCGAAGAGGTGTATGGTTTGGTGCGTATGCAGCTCGGAAATGGTCTTAGAATACAGTTCTGGGATGATATATATGGATAGGAGAGACGACACTGAAATCCTCTTTCCCCTTACTCTACTCAATAGTCAGTAATAAAGAGAGCAATGTTGCTGAATGTTGGGATCCTAAAATCTTGGCATGAGACTTATCATTCCGAAGAAACCTGAAGGAGGAGGAGGAGGAACTGATGTCTTTATTGGGAACTCTCAAAGATCAGAAACCGTCTATGAAGGAAGATTGGAGGAGATGGTCCATTGATCCTTCTGGAAATTTCTCAGTTCGTTCGTTAGTCTCTCACTATGGGCTCAGTTTGGCTATGGCTAAGCCATTATTTCAAGCTCTTTGGTGCCCTTATTGCCCTAGAAGAGTTAATGTGTTGATTTGGATTTTGATCGCTGGAAAGCTTAATACATCGAAAGTTCGGCAGAAAAAACAAGTTTCAATGATGTTGCATCCCTCGATAATGTTTGTGGGCTCCTATCGAGAGTCTCTTTACCTTCTAAAGCAAGGATTTTATGGAGAATCGGAGTTAAAGCCTTGCTATCAGATATCTGGTATGAAAGAAATCAAAGAATATTTGAAGATAAAGCATTGGAATGGAATTGAAGATTCGACTCAATCCATTTGAAGGCTTCTTCCTAGTGTGTTCTTTCCAAGAACTTTGCTGGATATTCAGTTTCAGATTTATGTTTTTCATGGTATCCTTTCCTTGCTAATATTTAGTTTCCTGACTTTATTGTTATCTTTATTATGTTTATGTTGGGACATCTTTGTTATCTTCCTGTTGTCTATTTTTCTCTTAGTTTTTATCTCCTGTACTTTTGTATCATTTCATTATATCAATGAAATTACTTTGTTTCCTTGTCAAAAAAAAAATATATATATCACCTGTCTAGTTTGCATCAGAATATACTTCAATATTTCTCGTCAGTTTTTCCAAAAAACAGTCCTTTTCCAAGAGTCATTTTTAGATATTTGAGGATTTCATATACTGCTTCATATATTCTTCATTAGGATTATGCATAAACTGGCTAACCACGCTAACAGCAAATCCAATATCTGGACGTGTATGGAAAAGATATATAAGGCGTCCTACAAGTCGTTGATATCTCCCCCTATCCACAGGTGTGCTCTCATGGTTGAGTCCAAGTTTCTTTGGAGGATCCATAGGTGAATTTGTTGGTTTACATCCTAACATGTCGGTCTCTTTGAGAAGGTCTAAAACATACTTCCTTTGGGATACAACTATACCTTTTTTAGACCAAGTAATCTCCATGCCTAGAAAATACTTTAGATTTCCTAGATCTTTGACTTCAAATTCTTGAGCCAAGTGCTTTTTCAACATTTGCAGTTCCAATTCATCTTTCCAGAATGAATAATATCATCTTCATGCACTATTAAAAGAGCAAGTAAATTTTCAGACCTCTTCTTAACAAATAAAGTATGGTCTGACTGGCATTGGGAATATCCAAGACTGGCATTGGGAATATCCAAGACTGAGAACAAAATTTGCAAATTTACTAAACCATGCACATGGAGATTGCTTAAGTCCATAGAGAGACTTTTTAAGTTTGCAAACCTTTGGTGAACTAAATTTCTCTTCATATCCTGGAGGAATATTCATATATACTTCTTCTTCTAGATCTCCATTAAAAAAAAACATTTTTAATATCCAACTGTTGAAGACCCCAATCAAGATTAACTGCTTGAGAAAAAATAATTCTTATAGTGTTGAGTTTAGCTACAGGTGCAAAGGTTTCCTGGTAGTTAATTCCATAAGACTGAGTGAATCCTCTAGCAACAAGTCTGACCTTCAATCTCTCAATGCTTCCATCCACCTTGTATTTTACTGCAAAGATCCACTTACATCCAATAGTTCTTCTTCCACTAGGTAAATCGGACATTGTCCATGTGCCATTTTTCTCAAGAGCTTCACTTTCATCTTGTATAAATAGAATGCAAAGGAAAGGATAGAATAATCAGAGTAGGGTTTTTCTCTCTTCTCTCTCCTCTGACCGCCTCCTGCTCTCTTCGTCTTCTGCCGTTTTCGACCTGATTTTCCGACCATCGTCTTCGTCTGTTGTCTTCGCCTCCGTGAGCCTTTCTCCTCTCTCCGATATAGCCGCTCGTCCCTCTCTGTTGTCTTCGTTTCCCTCTCCGACGGTTCACCGACTTACCCGTAGGTCAACTTTTCAGATTCCGCCCGTCCTCCCTTCTGGTTTTTCCGTCTGTCTTCTTTGGGGTTGGCTGATGGTGTTAGATAGACCACGATGTATTTTTCCCTTGCCGACGTGTGGGTTTGTTTTGTTTGTTGGTTTCGGCTTCGTCTGGGGTGTTTGGCTGCTCTTTTTTCATCGACGTAGTGTGTTCCCTTTTAGAGGTCTTTTGTTCTTCGCTCATCGATGGTCTTTGCCCTTTGTAGCAAAGATTGGGTGGTTTTTGTGGTTTTTTTCTTTTTGTTGGGTTTGACATGGACTCTCTCAGTTTTAGTATCAGATTGAAGTGTTTCAGTTGTTGCATGGGGGAAAGTTTTTTCTGGATTTGGTGTGAAGGTTTGAATGTCGTTATAAAGGATTGTGCAAGAGATGCTTCGGTTTCCCTATCGGTGGAACATGCCAGATGATTTCTACATTCTTTTGGTGAGTTGTGTTTAGAGACTAGCTCTTCTTTTGTAAGGAAAAATTTTAAGGATGTGGAGGCTACATTGGGACTTTTCAAGATTAAAAAAGCAGATGGTTGGGTGGTCAAAGGAATGGTTTGGCCCCCTTCTGCAGGGAGACAATTCTTCAGAGTCCCGCTAGGGTCTTATTCGATTGGATGGATGGTTTTTTAGGGTTTTGTTGGCTGATTTTATGGACTTTGTTGAACGTTCAGTCCCTGTGGATTCTTTGGTTGATTATTTGGAAGGCAGAAAAGAAGGTGTTGGCCTTCTTGACCACGCTAAAGATGATTACTCTTGCCATTTTGGGATGTCTGTTTCTGCAAGACATTGGGTTCGAAAGGCTGATGAGGTGGTGGATGTGAACTTTTCACATCTTTGGGTGGTGACTCGCCTTTTTGTGCATATTCATTGGAGTCTTATTAAGAAGGAAGTAGAAGCTCATTTTAAGATAAGAGTGCAGATTAACCCTTTTCATATTGATAAAGCCATTCTATTAATCGATGATGGCTCTATAGATGAATTGGGGTTGAATCATGGAAAATGGCAAGTTGTTGGTGGCCTTCATTTGAAATTGGGGAAATGGAATGATAGTTTGCATAGTCGGCCTGAGCATATTGGTGGTTACGGAGGTTGGATTGCTATAAAAAATTTGCCATTAAGGTTATGGAGAAGAAGTGTTTTTGAAGCCATTGGAAGACGTTTGGGGGGTTTGGAGGAAATCGCGACAGATACTTTGAATTTAATTGATATTTTGGAGGCTAAGATTAGAGTGTCTCCAAATCTATGTGGTTTTCTCCCGGCGCATATAGTGATTGATGACCCGATTGGAGGGGTCTTTTCTTTAAATTTTGGGGATGTGGCAACCTTGGATCCTCCCCCTAGGCTGTTGGATTCTTTAGTATTGAAAGACCTTTCAAATTCGATTGATTTGGTAAGAGTAAGTCAAGTATTGATGGATGAGCTTGTTTCGGAGTCTAATCAATCCTTGGGTGATTCTTTAGGTGATTAGTTCGATCTTGGAAATTCGGTAGACCCATTGCTGGTGATTGGGGGTGGAATTGTTGAAACAGGGGATGACCTCTTGGTGGAGCGTCTCAATAAGGAGAATTGTAGTTCTGCGAATTTGAATGGGTGTTTGGAGTTGTTCTCGAGCAGTTGCCTTAATATGGAGCAAGATATCAGTTCGGAAAGACAATTGGAAGCTTTAAACTGTTTGATATGCCCTCCAGGTCGTGATATTGAGTTAGAGGAAGTTAAGGAAGACGAATTCCTTGCTACAGAGTTGATTACCGAGGATCTGCTATTCTCCTCCCCTGCCCCATTGAGTGTAAAGGAGATTAAAAATGCCTTTTCTGGAAAATACTACTCTAGAAGGAAGGAAAAGGAATTGATTGAGGGGTCGGTGGTTCAGGAGGAAGGAGTTTTCTAGGATTTCTTGGAACAGACGGAAGGTTTGCCTGCAAAAAGGCTTTCCTGGGTGAATTGCTCTAAGGTTAACTCTCCTTCCTTCATTTTTGAGAAATGCTAGGTTGGGCCGAAAGAGATCCCCTTTTTTAAGGCGGTCCTCCATCCGAAAGACTACAAAGCGAGTGGGTCTGTATTGCCTTCCCCTGTTTTTGATCAAAGCCCGGTAGTGGGCTCTGAGATTAGTTTAAGTAGCCCTGACTCGTTTTCAGATAGAAGGGTGAGGTCGGTCTTGGTGAAGGAAGTCTCTCCCGTGGTTAATTTGGAAGCTTTGTTTGCTTCTCCTGTTGGTAAATCCCCTTTGGTAGGTGGGCAGGATGAGGTTGGGTTAGATTGTCCCCGTGAGCTCTCGTTTTGATTCCCTAATAAAGGTGAGTGGGCTGCAGTTTAAAGAAGTTGTTGTTGGTGGTAGGGTGTCTTCTATATGTTGAAGATTTTTCTTGAATTTTGAGAGGATACCAAATTGACGGCTTGGTGTTGATGAGGTATTCTTATTCTTATGGAGTTTGAGTTTAGTTGGGTGGGGGCTTGTGTTAGCCCTTGGCAGAGCAGTTGGCATGTTAGTTTTGTGAGTTGAATCACAGTTTTCAGCCTTTGATGGATTCGTTGTTGTCTCCTTAAGCTTGCTTGCTGTGAAGATTTGCTTTGGCAAAGCCTTGTTTTCAAATCTCTGGGTTATGTGATAACAAAGTCTGTTAGAGAAAAGGCATGGTATTGGGTTTGGTATCTTTGCCCAGTTCGCTTGAAAGCTTTTTCCTGGTGTGGTTTTTCCAAGGATTGGCTGTTTGTTCTGTTTCTGATCTATGTCTTTCTTGTTGCAAGTTCTTTTCTTATGATTGTTGGGTTGTAGTTGTTTTGGTCTGAAGGGATGGGTGTTTGGTTTTGATTCTCTTTTGGCCTGGGGCTTCTTTCAGCTTTTTCTCTGGTTATATTGACTTTTTTGTTTCTCTCTTTCTCTTTGTATTTTGAGCGTTAGTCTCATTTCATTTTTTCAATGAAATCTTTTTTGTTGCCTTGTCAAAAAAAAAAAAACCTTAGATCAACCTTTTAGAAAATTCATCTCTATATCCCCACCTAGTTTAGTTATTTTTAAAAAAATATTAAATGGTTTTTGTTGGTTGTTTCCTTATTTGTTAATAAGAATATTCTGTATTTACTATATGTAGAAAATAATAAGAGTAGAAATATTTTCTATATCTTACAATGAAGGTACAAGGTTTCTTATATAGGAGAAACCTATATTTACATACAAGGAAATAAAATCACAGCATCTATATTATGGTAAATACAAGGAAATATTTATTGACAAATAAGGGAATAGCTAATATTTCCTTATTAATAAATAAAGAGAATAACCAACACTCCCCTCAAGTCGTTTGAATATATCTTCCATCGCCACTTGGAGATCATCTTGTCAAACTCAGAATTTGGGAAGTCCTTTGTTAACACATCAACTACTGTGCTGTTGTAGGAGATAAGGAGTACATATTACACCAAAGATCAATCTTCTCTTTTTATGAAGTGTTTATCTACTTCAATATGCTTTGTTCTATCATGAAGAACCGGATTGTGGGCAATGGAAATAGCTGCCTTGTTGTCACAGTAGATATGCATTGGTCTATCCTGACTAAATTTTAATTCATCCAATAATCGTTTTATCCATATGCCCTCGCAAATTCCATGAGCCAAGGCTCTAAATTCAGCTTCAACACTACTTCTAGCCACCACACTCTGTTTTTTACTTCTCCATGTGACTAGGTTGCCTCCAACAAACGAGCAATACCCAGAAGTAGATCTTCTATCTGTTATACTCCCTGCCCAATCAGCATTAGTGTATACTTCGACATGTAAATTATCATGTTTCTTGAATAGTATACCTTTTCCAGGAGTGCCTTTTAAATACCTTAGGATTCTGCAAACTGCTTCAAAATGAGATTGTCCTGGGGCATGCATAAATTGACTCACCACGCTTACAAAGAAAGGCAATGTCGGGCGTGTATGAGACAAATAAATGAGTCTCCCTACCACTCGATATTTTTCTTTATCTTTTACTTCTCCTCCTGTTGCAACTTGCAATTTTAAATTTGGCTCAATAGGAGTTTCTGATATCTTGCACCCAAGTAAGCATGTCTCTTCGAGTAGATCAACGATATACTTCCGCTGATTTACAAGAATCCCTTCTTTTGATTTGGCAAATTCCATTCCTAGAAAGTACTTCAAACTTCCCAAGTCTTTAATTTGGAATTCATTTGCAAGTTGTTTCTTCAAGGTCGCTACTCCTCGCTTGTCATTTCTGTGATAATAATATCATCAACATACACTATTAGAACAACAACTTTGTTATTTTCAGTATGTCTATAGAAAATAGTATGATCTGCTTGGCTTTGATAAAATCCATAGCTCATAACTACTTTCCCGAATCGTTCAAACCATGCCCTTGGAGATTGTTTAAGTCCATATAGATCTTTCTTTAATCTGCACACTTTGTTAATCCCAACTTCTTTCTCAAAACCAGGTGGAGGTTCCATAAATACCTCTTCCTCAAGATCCCCATTGAGGAATGCATTCTTGACATCAAGTTGATAAAGGGCCAATCTGCATTGACAAAGAATGTATAATAATACTCTTATAGAGTTAATTTTAGCAACAGGGCAAATGTCTCACGATAGTCAATTCCATAGGTTTGAGTGAAACCTTTAGCAACGAGTCCGGCTTTGTACCTCTCAATACCCCATCCGCTTTACACTTTATAGTAAACACCCATTTACATCCTACATCTTTCTTACCTTTTGCGGTTCAACTATGTTCCAGTGCCATTTTGTTTAAGAGCATTCATTTCTTCCATAACCGCTTTCCTCCAATTCGTATCATTTTAGGGCATCTTGTATATTCCTTGGCACAAATAAGTCTCATTATTCTAGAGGTAAAGGCCCTATGATTATCGGATAGTCTATGGTAGGAAAGGTAATTTGATATAGGATATTTGGTGCAATTACGAGTGCCTTTCCTATGGGCTATTAGGATATCAAGATCGGATATGTCGGATAAGTGATTTGAAGCACAATGGGAAATAGAAGTACCTGGATCTTCAAGATCAGTTGTCGGAGTTTTAGATTGGTTCTGTGACAAATCTGCTCTCTGGTCTTGGTTCTTTTGATTTACACCCTTCTAGTATAAACACAGTAGCTCAAGTTTTCGATCAATAAGACCACTTTGTAGTGTTTCTCCCCCTGAAGAAGAGTTTTCCACACTTGGCATTGGAGAATTAGAGTTTGTAACCTCAGGGCTAATGAGTGTGGGTAGGGGAGACGTATCCCAGAAATTATCTTCTAAGTTAGATACCATCTCCCCCTGAAGATAATTTGGCTGAAAAAAGGATTGATCTTCCAAGAATGAAACATCCATGCTTTCAACATATTTTTTCTGTATGGGGTCAAAACATTTGTATGCTTTCTTGTTTGAAACATATCCAACGAAAATACATTTAGAAGCCCTAGGCTCAAGTTTAGACCGACACATGTTAGGAAGGTGAACATATGTTGTACACCCAAAAACCTTGATAGGCAAATCAGAAAATAAGCGAACGGAGGGAAAAAAAGGTTTTAAAATGATTCAGGAGTGTCTTAAAATCCAAAGTTTTTGAAGGCATTCGATTAATAAGATACGCTGCCGTGAGAATAGCTTCTCCCCACAAATAATTAGGAACATGCATAGAGAACATAAGTGCACGAGCAACATCAAGTAAGTGTCTATTTTTTCTTTCAGCAATGCCATTTTGTTGGGGAGTGTCACGACACGTCGATTGGTGAAAAATACCTTTGATTTTAAAGAATTCATTTAAATGTTCATTGAAATACTCTTTTCCATTATCAGAATGCAAAATGCTAATTTTAGTTTGGAATTGAGTTTCAACCATGTTATAAAACTGGATAAAAATATCTTTCACATCTGACTTGTTTTTTAGCAAATAAATCCAGGTTAACCTGGTATGGTCATCAATAAAGGTAACAAACCACCGTTTTCCACTATGAGTTAAAATTTTAGAAGGTCCCCAAACATCATTATGAATTAGGTAAAAAGGAGTTATGCTTTGTAAGGGTTGGGTGAATAAGTTGATCTGTGATTTTTTGCAAAAATAAAGCTTTCACACTGAAAAATGGAGCAATCAAGACCTTTAAATAAATTTGGAAACAGATACTTTAAATAGAAAAAACTGGGATGCCCTAATCTACGATGCCAAAGCAGTATAGTTTCTTTAACAGGGGAATAACTAACACTACTCAAGCCTTGAACTGTTTTATGACTAGTCGAGGGTTCCTCAAAATAGTAGAGACCTTCAAGCATTCTAGCACGCCCAATCATCTCCCCCGACCCCTGATCCTGAAAAACACAATGGGAATCACAGAAGATGACACGACAATTAGCATCCTTAGATATTTTACTTACAGATAACAAATTGCATTCTAGTTTTGGAACATGCAATACAAAATATAGATTAATTTGTTAGGATAGTCTAATAGTTCCTTTCCCTGCAATGGAATTAAAACTACCGTCTGCAATTCGAATTTTTTCATTACAGTACATCGGAGAGTATGATTCAAATAATTTTGAGGAACTAGTCATGTGATCAGAAGTCCTTAATCTATGATCCATGGGGAGGAAGTAAGACAAGAAAGGGCTTGAGGACGAAAACTGCTTGTGCCAAAGAAACACTGGGATTACCCGATGATGAATTGACCTTTAGCGACTTCAGAGTTGATCGACTTGTTCCTTGCTAAATGGATTAGAATCAAAGAATATTAGCGAGAGAATGATGATTGATTACGGATGGGTGTTGATTTACACTCGTCATCGATGAAATGGGGCCATGCTTATTACTCTTCCAGTTAGCAGGTTTCCCGTGAAGTTTCCAACATGTTTCACGAGTATGTCGAGGCTTGTTGCAGTAATCACACCAGACTCGAGGCTTTTCATTATTTCTGGGTACCTGATTGGATGCCTTATGTGTTGTAGCTTCAACTGCTAGAGCCGAACATTCAACCAATCAATGACTTTCTTGCCGATCATAACACTTCTACGACTTTCTTCCCCGCGAACTTGAAAAACACATCATTAATATTTGGGAGTACAGTTTTTCCCAATATTCGTCCTCTAACCTCATCGAATTCAACGTTGAGCCCTGCGAGGAATTTATAAATCCGCTCGTCTTCCACAACTTTCCTGTAGTGCCTTTGATCATCAGTTGACTTCCACTCATACGTATCAAATAAATCCATGTCCTGCCAAATTCGTTTTAACAAATGAAAGTATTGTGTAATGCAATTATTTCCACATCGTATATCACCCAGTTTAAGATTGAGTTCAAACACTTGTGATTGATTTCCCAGGTAAGAATACATCTGAGTTACATTTTCCCACAATTCCTTAGCAGTAGAGTAGCACATATAGTTAGAACTTATTTCTTCTACCATGGAATTTACTAACCAGGTCATCACCATGGAGTTTTCAAAGATCCCATACAAATGACGAGGGGATCGTCTTCTGCAGCAGTATGGATTTGTCTCCGATGAAGGTATCCAATCTTTCCTTGTCCCCTGATATACATTCAACACTTTGCGACCAACGAAGGAAGTTATCGCCATTAAGTCGAATGGTTGTAATTTGGACAACAGGATCATTGGAGTGGATCCGACTATCTGAGGTTTTTACAATGGGCAACAACTTATTGTCTGACATTCTGTTGGAGAGCACCACACAGTAACAACAAGCAAAGCAAAAAATAGAAACTTCCTTTGATCCAAAAGAAAAACTATACGTAAAGGTGTAGAGAAACAACAAACGTATGAGTCCAACACGGAGGTCACAAATTCCTTAGCACTCGCGCGGCGAGACCAAACGGCGAGAAAAATCCGACCAACGGCGGTGGGTTCGACGGCAGTGTGTCCAACGGCAGTGGGAAGTCCAGCGGCGACCTTTGAAGGAAGAATCCGACCAACGGCAGTGAGGTCTTCGTCCACCAACGGCGGTGGGTCTTCGCGCTTTCGTGGAGTGCTTTACAGGGGGTTTTTTCCAAAACCGAAGGAAGGGGGTCCGACGGCCGACTTTAGGGTTGCAACCGATTTAGGGTTGCAAAAGAGAAAGGAGGGGGTCCGACGCCCGATTAGGGTTTGCTTTGGTCGTGAGGGATCGTTTGCGACCAATTTAGGGTTGCAACCGATTTAAGGTTTTGAAAAAAGGAATTAGGGTTTTTTCTTTGAACAGATTTAGGGTTTCTGCTCTGATACCATGTAGAAAATAATAAGAGTAGAAATATTTTTTATATCTTACAATGAAGGTACAAGGTTTCTTATATAGGAGAAACCTATATTTACATACAAGGAAATAAAATCACAGCATCTATATTATGGTAAATACAAGGAAATATTTATTAACAAATAAGGGAATAGCTAATATTTCCTTATTAATAAATAAAGAGAATAACCAACACTATATTATCTTGTATTATATGTGCTGGAATATTTTTTCCTTTTTTGTAAATGAGGGTTTCTTAAATCACCAATCATACCAAAAGGTTAAGCTGATAGTTTGGGATAAATTTAATTATATCAATATAGTCTAACACTCCCCCTCACCTGTGGGCTTGGAAATTTGATGAAAAGCCAAACAAGTGGAATCAATATTAAATGGAGAGGAAACTGACTTTGCAGGGGTTTGAATACAGGATCTCCTAGACCACCTGCACTGATATCATCTTAAATCACCAATCAAACTAAAAGCTTAAGCTGATAGTTTGAGATAAATTTAATTATATCAACATGGTCTAACAAGGTTTCTCTATATATGAAATCATTTATTCTCATAATGAATACACGGAAAATATATTCTACTCTTGATATTTTGTACAGTTTTGTAATATTTTTATGCTTTTAGAATAGAGTATATATTATTATTATTTTTTATGATGTGTCCCCTACGTGCCATGTCCTTTTTTTTTTGTAATTGACATGTCGTTGTGTCCTCATCGTGCCGTGTCCATGTTTGTGCTTCTTAGCTTGTGACCCAAGTAGGGTGTTAATGTGATGGAACGAGTTCTAACTTATTTAACCAATTTAAATCTAGTTTGTAATGAGCTATATTAATTATTGTTATCCTAAAAGATGAAGCGCAATATGACCTACATAGTGAATGGAAAATTTGTGATATTTCTCATCCCCTTTTTGTATAATCTCCTTCATATCAATACATATTCTTTGTTTCTTATCAAAAAAAAGAGTGTGATATTGAAATACCTAATCCATTATGCATATCTCTATATTTACATCACTTTTGTCGGTTGTAATATTGGTTTTTTTACAATTGAATATTTAATTGACCTTGTTGTCCTTTTTGTTCACTTCATAGGTCTTCATGTTTTAGCTTCAGAGTCTGTACAAATCCCTTGGTTTGATTCTGTTTTTGCTGATATCGGTGATGAACAGTCCCTAACCCAATCTTTGTCTACTTTTTCTGGACATTTGAGAAAAATAAGCGTAAGTAAATTTGGATCCTTCATCTCGTTACAATTTGGGACAGTCTTCAGTTTTCTGTGATTCCCCACTATGCATGCAGTCTCTGTACGTGGCATAATCATCAGAGACATTCATATTTCTTAATTACGATGAACTTTTTGTGAGTTGATTAAGTTGTGAAAGTTCTTAGCTACTCATGTCATGTGGGGGTTAGTACCTTTTTGTGAGTTGATAAGGGGCTTTGGAAGTAATTCTATTTTTGTAATTATCTATTTTACTTGGAGCCTCTTTGTGTTTTAGTTTGGCTCCTTTTATTTTGGAGCTTTATTTTTTTGTTTATCCCTTTTATTCTTTCATTTTTTTCTCAATAAAAGGCCAGGTTTTCATAAATAAATAAATAAATAATTCTGTTCTTTTTTACTTTAAGGCCTGGGATAAGAGAAAAGAAGTTGTATGGATTTTTTTAATATATATAAAATATTTCTTGGGATATTGGTGAGAGAAGCTTCTTCATATTTCCTATGGAAAACCAATTTTTTTGGAAGAAAACTTCCCGAAGGGCATGTTTCTTATGGCAAGAAATGAAAATTTGCATTTGGATGCTTTCTATTTCCCTCATTAGCGTTTGGCATTTTCTTACTCTGCTTTTTATTAGAACTCAAGGACACAGTTGTACAACCTATGGAACAAATCATGTTCAAGCAAGGTTGTTATCTTCTTTACTTTACTAGGAGCTTGTTTCTTTTGAACATTTTTGTATCTTTTCATTATCTCAATGAGAAGTTTACATCTAGTTCAAACAAAAAGAAACAAAAATTGCCTAAAGAAATAAATGTCAGAGACAAACTCCCGATGGGGGTGAAAAGAAATTAAAAAAAAAAAACACAACTAGGAGGGAGTTATGAAGTCCTCCCAATTCATACTTATCTTGCAAAGAAGAATGTGGAAAAGGTTTAGAAAGTGAATACCATAAAAAAGCTTTCCATCTAGCTGCCTCGATCTGATTACTGATTTTCCTTGTTCTCTAAAACTCTTTGATTGCAGTATAGTAACATTAAAAAAAAAAACCTCTTTGATTGCTATACAACCAGATCTCTGAAACAATTGCTTTAACTGCATTTACCCACAACAAATGTGGTCTTGGAGGTAGAGAAGGCCCAAAAAGTAGATGATGTTACGGTGAAACAAGATATGAGAATTCTTATTCCATTCTATACAATAGAGTCGAATATATATAGGCATACATGGAAGCTCTAA

mRNA sequence

ATGAAAGGAGGGGTTGGTACCATCCTAGAGCCACTCTCTGCCGTTCCTTTAAACGATGAGTTGCAACAAGCAAAGGCATCAGTGGCAAAAGCTGAGAAAGATGTTCTCTTTATGCTAACTGAAAAAGTCAATGCTCGAGCATCTTATAGTCTTTCATTTGGAGGGACATGTCCCAATTTAGTTCTACCAGAAGGGCGCAACTCTTCTATTGCTAATGTCTGCTCATCAGGAGACCAAACATCTGAGGCATCGCACCCAAATAAGAATGAACGGGTTCTCTATTTACCAAATGCCCATCACCCTTTACTACTTCAGCAATACAGAGAAAATTTGGAGAATGCCAAGCGAGATGTCAGAAATGCTTTTACTGAGATAGGGAGAAAACTTCCTGGGGGGACTATGCCATGGAAAGAAAAAAATGTTGTTGATATTTCATTCTTAAGAATGAAGGTTGAAGAATTGGAGAAAGCTTGTCCGATTTCGGTTGATTTTTCAATATCTCAAAGAGTTCGAGTTTTAGTTATAACTGGCCCTAATACTGGGGGTAAGACAGTTTGTTTGAAGACCATTGGATTGGCAGCCATGATGGCGAAATCAGGTCTTCATGTTTTAGCTTCAGAGTCTGTACAAATCCCTTGGTTTGATTCTGTTTTTGCTGATATCGGTGATGAACAGTCCCTAACCCAATCTTTGTCTACTTTTTCTGGACATTTGAGAAAAATAAGCTTCTTAGCTACTCATGTCATGTGGGGGTTAGTACCTTTTTGTGAGTTGATAAGGGGCTTTGGAAGTAATTCTATTTTTGTAATTATCTATTTTACTTGGAGCCTCTTTGTCTTTCCATCTAGCTGCCTCGATCTGATTACTGATTTTCCTTGTTCTCTAAAACTCTTTGATTGCAGTATAGTAGAGAAGGCCCAAAAAGTAGATGATGTTACGGTGAAACAAGATATGAGAATTCTTATTCCATTCTATACAATAGAGTCGAATATATATAGGCATACATGGAAGCTCTAA

Coding sequence (CDS)

ATGAAAGGAGGGGTTGGTACCATCCTAGAGCCACTCTCTGCCGTTCCTTTAAACGATGAGTTGCAACAAGCAAAGGCATCAGTGGCAAAAGCTGAGAAAGATGTTCTCTTTATGCTAACTGAAAAAGTCAATGCTCGAGCATCTTATAGTCTTTCATTTGGAGGGACATGTCCCAATTTAGTTCTACCAGAAGGGCGCAACTCTTCTATTGCTAATGTCTGCTCATCAGGAGACCAAACATCTGAGGCATCGCACCCAAATAAGAATGAACGGGTTCTCTATTTACCAAATGCCCATCACCCTTTACTACTTCAGCAATACAGAGAAAATTTGGAGAATGCCAAGCGAGATGTCAGAAATGCTTTTACTGAGATAGGGAGAAAACTTCCTGGGGGGACTATGCCATGGAAAGAAAAAAATGTTGTTGATATTTCATTCTTAAGAATGAAGGTTGAAGAATTGGAGAAAGCTTGTCCGATTTCGGTTGATTTTTCAATATCTCAAAGAGTTCGAGTTTTAGTTATAACTGGCCCTAATACTGGGGGTAAGACAGTTTGTTTGAAGACCATTGGATTGGCAGCCATGATGGCGAAATCAGGTCTTCATGTTTTAGCTTCAGAGTCTGTACAAATCCCTTGGTTTGATTCTGTTTTTGCTGATATCGGTGATGAACAGTCCCTAACCCAATCTTTGTCTACTTTTTCTGGACATTTGAGAAAAATAAGCTTCTTAGCTACTCATGTCATGTGGGGGTTAGTACCTTTTTGTGAGTTGATAAGGGGCTTTGGAAGTAATTCTATTTTTGTAATTATCTATTTTACTTGGAGCCTCTTTGTCTTTCCATCTAGCTGCCTCGATCTGATTACTGATTTTCCTTGTTCTCTAAAACTCTTTGATTGCAGTATAGTAGAGAAGGCCCAAAAAGTAGATGATGTTACGGTGAAACAAGATATGAGAATTCTTATTCCATTCTATACAATAGAGTCGAATATATATAGGCATACATGGAAGCTCTAA

Protein sequence

MKGGVGTILEPLSAVPLNDELQQAKASVAKAEKDVLFMLTEKVNARASYSLSFGGTCPNLVLPEGRNSSIANVCSSGDQTSEASHPNKNERVLYLPNAHHPLLLQQYRENLENAKRDVRNAFTEIGRKLPGGTMPWKEKNVVDISFLRMKVEELEKACPISVDFSISQRVRVLVITGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLRKISFLATHVMWGLVPFCELIRGFGSNSIFVIIYFTWSLFVFPSSCLDLITDFPCSLKLFDCSIVEKAQKVDDVTVKQDMRILIPFYTIESNIYRHTWKL
Homology
BLAST of Tan0020838 vs. ExPASy Swiss-Prot
Match: P73625 (Endonuclease MutS2 OS=Synechocystis sp. (strain PCC 6803 / Kazusa) OX=1111708 GN=mutS2 PE=3 SV=1)

HSP 1 Score: 110.2 bits (274), Expect = 4.8e-23
Identity = 61/135 (45.19%), Postives = 85/135 (62.96%), Query Frame = 0

Query: 111 LENAKRDVRNAFTEIGRKLPGGTMPWKEKNV----VDISFLRMKVEELEKACPISVDFSI 170
           L+ A   VR +F  +G   P    P  EK +    +    L  + E+      + +  +I
Sbjct: 279 LDLATARVRYSFW-LGAHPPQWLTPGDEKPITLRQLRHPLLHWQAEKEGGPAVVPITLTI 338

Query: 171 SQRVRVLVITGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQS 230
             ++RV+ ITGPNTGGKTV LKT+GL A+MAK GL++ A E+V++PWF  + ADIGDEQS
Sbjct: 339 DSQIRVIAITGPNTGGKTVTLKTLGLVALMAKVGLYIPAKETVEMPWFAQILADIGDEQS 398

Query: 231 LTQSLSTFSGHLRKI 242
           L Q+LSTFSGH+ +I
Sbjct: 399 LQQNLSTFSGHICRI 412

BLAST of Tan0020838 vs. ExPASy Swiss-Prot
Match: Q5WEK0 (Endonuclease MutS2 OS=Bacillus clausii (strain KSM-K16) OX=66692 GN=mutS2 PE=3 SV=1)

HSP 1 Score: 102.4 bits (254), Expect = 1.0e-20
Identity = 54/104 (51.92%), Postives = 72/104 (69.23%), Query Frame = 0

Query: 138 EKNVVDISFLRMKVEELEKACPISVDFSISQRVRVLVITGPNTGGKTVCLKTIGLAAMMA 197
           ++  +D+   R  +   +K  P   D +I  +VR LVITGPNTGGKTV LKTIGL  +MA
Sbjct: 298 DRGYLDLRQARHPLLPPDKVVP--SDMAIGDQVRSLVITGPNTGGKTVTLKTIGLLTLMA 357

Query: 198 KSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLRKI 242
           +SGL V A+E  ++  F+ +FADIGDEQS+ QSLSTFS H++ I
Sbjct: 358 QSGLFVPAAEETELAVFEHIFADIGDEQSIEQSLSTFSSHMKNI 399

BLAST of Tan0020838 vs. ExPASy Swiss-Prot
Match: C0Z9F1 (Endonuclease MutS2 OS=Brevibacillus brevis (strain 47 / JCM 6285 / NBRC 100599) OX=358681 GN=mutS2 PE=3 SV=1)

HSP 1 Score: 102.1 bits (253), Expect = 1.3e-20
Identity = 67/180 (37.22%), Postives = 93/180 (51.67%), Query Frame = 0

Query: 90  ERVLYLPNAHHPLLLQQYRENLENAKRDVRNAFTEIGRKLPGGTMPWKEK----NVVDIS 149
           ER+LY+        ++   EN E        A TE+        + W  K     + D  
Sbjct: 248 ERILYVLTEQVSFAVEALVENTE--------ALTELDFMFAKAQLAWSMKAICPRINDRG 307

Query: 150 FLRMKVEE---LEKACPISVDFSISQRVRVLVITGPNTGGKTVCLKTIGLAAMMAKSGLH 209
           ++ M+      + +   + VD  +    + +V+TGPNTGGKTV LKTIGL ++M  +GLH
Sbjct: 308 YVNMRKARHPLIPREVVVPVDVELGGEYQAIVVTGPNTGGKTVSLKTIGLLSLMTMAGLH 367

Query: 210 VLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLRK-ISFLATHVMWGLVPFCELIRG 262
           + A E  ++  F S+FADIGDEQS+ QSLSTFS H+   I  LA      LV F EL  G
Sbjct: 368 IPAEEESEMTVFSSIFADIGDEQSIEQSLSTFSSHMTNIIQILAKMDDKSLVLFDELGAG 419

BLAST of Tan0020838 vs. ExPASy Swiss-Prot
Match: B8D298 (Endonuclease MutS2 OS=Halothermothrix orenii (strain H 168 / OCM 544 / DSM 9562) OX=373903 GN=mutS2 PE=3 SV=1)

HSP 1 Score: 101.7 bits (252), Expect = 1.7e-20
Identity = 50/106 (47.17%), Postives = 70/106 (66.04%), Query Frame = 0

Query: 138 EKNVVDISFLRMK--VEELEKACPISVDFSISQRVRVLVITGPNTGGKTVCLKTIGLAAM 197
           E  + D  F+ ++     L K  P+ +D ++    + LVITGPNTGGKTV LKT+GL  +
Sbjct: 298 EPGINDKGFINIRGGRHPLLKVKPVPIDITVGNEFKTLVITGPNTGGKTVALKTVGLFVL 357

Query: 198 MAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLRKI 242
           M ++GLH+ A E   I  F+ V+ADIGDEQS+ Q+LSTFS H+ +I
Sbjct: 358 MVQAGLHIPAEEETVISIFNGVYADIGDEQSIEQNLSTFSSHINRI 403

BLAST of Tan0020838 vs. ExPASy Swiss-Prot
Match: B9KYW4 (Endonuclease MutS2 OS=Thermomicrobium roseum (strain ATCC 27502 / DSM 5159 / P-2) OX=309801 GN=mutS2 PE=3 SV=1)

HSP 1 Score: 101.3 bits (251), Expect = 2.2e-20
Identity = 49/101 (48.51%), Postives = 72/101 (71.29%), Query Frame = 0

Query: 146 FLRMKVEE-----LEKACPISVDFSISQRVRVLVITGPNTGGKTVCLKTIGLAAMMAKSG 205
           FLR+++       L++   + +D  + +R R+LVITGPNTGGKTV LKT+GL A+MA++G
Sbjct: 308 FLRVRLRAARHPLLDRRTAVPIDVELGERFRILVITGPNTGGKTVALKTVGLLALMAQAG 367

Query: 206 LHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLRKI 242
           L + A+    +  F ++F DIGDEQS+ Q+LSTFS H+R+I
Sbjct: 368 LFIPAAPGSGLSVFPAIFVDIGDEQSIEQNLSTFSSHMRRI 408

BLAST of Tan0020838 vs. NCBI nr
Match: XP_038894074.1 (endonuclease MutS2 isoform X11 [Benincasa hispida])

HSP 1 Score: 420.2 bits (1079), Expect = 1.6e-113
Identity = 212/239 (88.70%), Postives = 224/239 (93.72%), Query Frame = 0

Query: 4   GVGTILEPLSAVPLNDELQQAKASVAKAEKDVLFMLTEKVNARASYSLSFGGTCPNLVLP 63
           G G ILEPLSAVPLNDELQQA+ASVAKAE+DVLFMLTEKVNARASY LSFGGTCPNL+LP
Sbjct: 284 GTGFILEPLSAVPLNDELQQARASVAKAEEDVLFMLTEKVNARASYGLSFGGTCPNLILP 343

Query: 64  EGRNSSIANVCSSGDQTSEASHPNKNERVLYLPNAHHPLLLQQYRENLENAKRDVRNAFT 123
           EG NSSIANVC SGDQTSEASH  KNE VLYL NAHHPLLLQQYRENLENAKRDV+NAFT
Sbjct: 344 EGCNSSIANVCLSGDQTSEASHSKKNEWVLYLQNAHHPLLLQQYRENLENAKRDVQNAFT 403

Query: 124 EIGRKLPGGTMPWKEKNVVDISFLRMKVEELEKACPISVDFSISQRVRVLVITGPNTGGK 183
           E+GRKLPGG M WKEK VVDIS L+MKVE+LEKA P+SVDFSIS+R++VLVITGPNTGGK
Sbjct: 404 EMGRKLPGGNMSWKEKEVVDISLLKMKVEQLEKARPVSVDFSISRRIQVLVITGPNTGGK 463

Query: 184 TVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLRKIS 243
           TVCLKTIGLAAMMAKSGLHVLASES QIPWFDS+FADIGDEQSLTQSLSTFSGHLRKIS
Sbjct: 464 TVCLKTIGLAAMMAKSGLHVLASESAQIPWFDSIFADIGDEQSLTQSLSTFSGHLRKIS 522

BLAST of Tan0020838 vs. NCBI nr
Match: XP_038894019.1 (endonuclease MutS2 isoform X8 [Benincasa hispida])

HSP 1 Score: 408.3 bits (1048), Expect = 6.5e-110
Identity = 212/259 (81.85%), Postives = 224/259 (86.49%), Query Frame = 0

Query: 4   GVGTILEPLSAVPLNDELQQAKASVAKAEKDVLFMLTEK--------------------V 63
           G G ILEPLSAVPLNDELQQA+ASVAKAE+DVLFMLTEK                    V
Sbjct: 284 GTGFILEPLSAVPLNDELQQARASVAKAEEDVLFMLTEKVKMDFEDINKLIGCIIELDVV 343

Query: 64  NARASYSLSFGGTCPNLVLPEGRNSSIANVCSSGDQTSEASHPNKNERVLYLPNAHHPLL 123
           NARASY LSFGGTCPNL+LPEG NSSIANVC SGDQTSEASH  KNE VLYL NAHHPLL
Sbjct: 344 NARASYGLSFGGTCPNLILPEGCNSSIANVCLSGDQTSEASHSKKNEWVLYLQNAHHPLL 403

Query: 124 LQQYRENLENAKRDVRNAFTEIGRKLPGGTMPWKEKNVVDISFLRMKVEELEKACPISVD 183
           LQQYRENLENAKRDV+NAFTE+GRKLPGG M WKEK VVDIS L+MKVE+LEKA P+SVD
Sbjct: 404 LQQYRENLENAKRDVQNAFTEMGRKLPGGNMSWKEKEVVDISLLKMKVEQLEKARPVSVD 463

Query: 184 FSISQRVRVLVITGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGD 243
           FSIS+R++VLVITGPNTGGKTVCLKTIGLAAMMAKSGLHVLASES QIPWFDS+FADIGD
Sbjct: 464 FSISRRIQVLVITGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESAQIPWFDSIFADIGD 523

BLAST of Tan0020838 vs. NCBI nr
Match: XP_022985054.1 (uncharacterized protein LOC111483140 isoform X5 [Cucurbita maxima])

HSP 1 Score: 407.5 bits (1046), Expect = 1.1e-109
Identity = 206/239 (86.19%), Postives = 220/239 (92.05%), Query Frame = 0

Query: 4   GVGTILEPLSAVPLNDELQQAKASVAKAEKDVLFMLTEKVNARASYSLSFGGTCPNLVLP 63
           G+GTILEPLSAVPLNDELQQA+A+VAKAE+DVLFMLTEKVNARASY LSFGG CPNL+LP
Sbjct: 284 GIGTILEPLSAVPLNDELQQARAAVAKAEEDVLFMLTEKVNARASYGLSFGGACPNLILP 343

Query: 64  EGRNSSIANVCSSGDQTSEASHPNKNERVLYLPNAHHPLLLQQYRENLENAKRDVRNAFT 123
            G NSSIANV  SGDQ SEASHP +N+ VLYLPNAHHPLL QQYRE+LENAKRDVRNA T
Sbjct: 344 GGCNSSIANVYLSGDQISEASHPKENKWVLYLPNAHHPLLTQQYRESLENAKRDVRNAVT 403

Query: 124 EIGRKLPGGTMPWKEKNVVDISFLRMKVEELEKACPISVDFSISQRVRVLVITGPNTGGK 183
           EIGRKLPGG M WKEK V DIS L+MKVE+LE+A P+SVDF+IS R+RVLVITGPNTGGK
Sbjct: 404 EIGRKLPGGNMSWKEKEVADISLLKMKVEQLEQARPVSVDFAISHRIRVLVITGPNTGGK 463

Query: 184 TVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLRKIS 243
           TVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSV ADIGDEQSLTQSLSTFSGHLRKIS
Sbjct: 464 TVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVLADIGDEQSLTQSLSTFSGHLRKIS 522

BLAST of Tan0020838 vs. NCBI nr
Match: XP_038894013.1 (endonuclease MutS2 isoform X7 [Benincasa hispida])

HSP 1 Score: 407.5 bits (1046), Expect = 1.1e-109
Identity = 212/261 (81.23%), Postives = 224/261 (85.82%), Query Frame = 0

Query: 4   GVGTILEPLSAVPLNDELQQAKASVAKAEKDVLFMLTEK--------------------- 63
           G G ILEPLSAVPLNDELQQA+ASVAKAE+DVLFMLTEK                     
Sbjct: 284 GTGFILEPLSAVPLNDELQQARASVAKAEEDVLFMLTEKVWVKMDFEDINKLIGCIIELD 343

Query: 64  -VNARASYSLSFGGTCPNLVLPEGRNSSIANVCSSGDQTSEASHPNKNERVLYLPNAHHP 123
            VNARASY LSFGGTCPNL+LPEG NSSIANVC SGDQTSEASH  KNE VLYL NAHHP
Sbjct: 344 VVNARASYGLSFGGTCPNLILPEGCNSSIANVCLSGDQTSEASHSKKNEWVLYLQNAHHP 403

Query: 124 LLLQQYRENLENAKRDVRNAFTEIGRKLPGGTMPWKEKNVVDISFLRMKVEELEKACPIS 183
           LLLQQYRENLENAKRDV+NAFTE+GRKLPGG M WKEK VVDIS L+MKVE+LEKA P+S
Sbjct: 404 LLLQQYRENLENAKRDVQNAFTEMGRKLPGGNMSWKEKEVVDISLLKMKVEQLEKARPVS 463

Query: 184 VDFSISQRVRVLVITGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADI 243
           VDFSIS+R++VLVITGPNTGGKTVCLKTIGLAAMMAKSGLHVLASES QIPWFDS+FADI
Sbjct: 464 VDFSISRRIQVLVITGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESAQIPWFDSIFADI 523

BLAST of Tan0020838 vs. NCBI nr
Match: XP_022922845.1 (uncharacterized protein LOC111430703 isoform X5 [Cucurbita moschata])

HSP 1 Score: 406.0 bits (1042), Expect = 3.2e-109
Identity = 204/239 (85.36%), Postives = 220/239 (92.05%), Query Frame = 0

Query: 4   GVGTILEPLSAVPLNDELQQAKASVAKAEKDVLFMLTEKVNARASYSLSFGGTCPNLVLP 63
           G+GT+LEPLSAVPLNDELQQA+A+VAKAE+DVLFMLTEKVNARASY LSFGG CPNL+LP
Sbjct: 284 GIGTVLEPLSAVPLNDELQQARAAVAKAEEDVLFMLTEKVNARASYGLSFGGACPNLILP 343

Query: 64  EGRNSSIANVCSSGDQTSEASHPNKNERVLYLPNAHHPLLLQQYRENLENAKRDVRNAFT 123
            G NSSIANV  SGDQ S+ASHP +N+ VLYLPNAHHPLL QQYRE+LENAKRDVRNA T
Sbjct: 344 GGCNSSIANVYLSGDQISQASHPKENKWVLYLPNAHHPLLTQQYRESLENAKRDVRNAVT 403

Query: 124 EIGRKLPGGTMPWKEKNVVDISFLRMKVEELEKACPISVDFSISQRVRVLVITGPNTGGK 183
           EIGRKLPGG M WKEK V DIS L+MKVE+LE+A P+SVDF+IS R+RVLVITGPNTGGK
Sbjct: 404 EIGRKLPGGNMSWKEKGVADISLLKMKVEQLEQARPVSVDFAISHRIRVLVITGPNTGGK 463

Query: 184 TVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLRKIS 243
           TVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSV ADIGDEQSLTQSLSTFSGHLRKIS
Sbjct: 464 TVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVLADIGDEQSLTQSLSTFSGHLRKIS 522

BLAST of Tan0020838 vs. ExPASy TrEMBL
Match: A0A6J1JC76 (uncharacterized protein LOC111483140 isoform X5 OS=Cucurbita maxima OX=3661 GN=LOC111483140 PE=4 SV=1)

HSP 1 Score: 407.5 bits (1046), Expect = 5.3e-110
Identity = 206/239 (86.19%), Postives = 220/239 (92.05%), Query Frame = 0

Query: 4   GVGTILEPLSAVPLNDELQQAKASVAKAEKDVLFMLTEKVNARASYSLSFGGTCPNLVLP 63
           G+GTILEPLSAVPLNDELQQA+A+VAKAE+DVLFMLTEKVNARASY LSFGG CPNL+LP
Sbjct: 284 GIGTILEPLSAVPLNDELQQARAAVAKAEEDVLFMLTEKVNARASYGLSFGGACPNLILP 343

Query: 64  EGRNSSIANVCSSGDQTSEASHPNKNERVLYLPNAHHPLLLQQYRENLENAKRDVRNAFT 123
            G NSSIANV  SGDQ SEASHP +N+ VLYLPNAHHPLL QQYRE+LENAKRDVRNA T
Sbjct: 344 GGCNSSIANVYLSGDQISEASHPKENKWVLYLPNAHHPLLTQQYRESLENAKRDVRNAVT 403

Query: 124 EIGRKLPGGTMPWKEKNVVDISFLRMKVEELEKACPISVDFSISQRVRVLVITGPNTGGK 183
           EIGRKLPGG M WKEK V DIS L+MKVE+LE+A P+SVDF+IS R+RVLVITGPNTGGK
Sbjct: 404 EIGRKLPGGNMSWKEKEVADISLLKMKVEQLEQARPVSVDFAISHRIRVLVITGPNTGGK 463

Query: 184 TVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLRKIS 243
           TVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSV ADIGDEQSLTQSLSTFSGHLRKIS
Sbjct: 464 TVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVLADIGDEQSLTQSLSTFSGHLRKIS 522

BLAST of Tan0020838 vs. ExPASy TrEMBL
Match: A0A6J1E9Z0 (uncharacterized protein LOC111430703 isoform X5 OS=Cucurbita moschata OX=3662 GN=LOC111430703 PE=4 SV=1)

HSP 1 Score: 406.0 bits (1042), Expect = 1.6e-109
Identity = 204/239 (85.36%), Postives = 220/239 (92.05%), Query Frame = 0

Query: 4   GVGTILEPLSAVPLNDELQQAKASVAKAEKDVLFMLTEKVNARASYSLSFGGTCPNLVLP 63
           G+GT+LEPLSAVPLNDELQQA+A+VAKAE+DVLFMLTEKVNARASY LSFGG CPNL+LP
Sbjct: 284 GIGTVLEPLSAVPLNDELQQARAAVAKAEEDVLFMLTEKVNARASYGLSFGGACPNLILP 343

Query: 64  EGRNSSIANVCSSGDQTSEASHPNKNERVLYLPNAHHPLLLQQYRENLENAKRDVRNAFT 123
            G NSSIANV  SGDQ S+ASHP +N+ VLYLPNAHHPLL QQYRE+LENAKRDVRNA T
Sbjct: 344 GGCNSSIANVYLSGDQISQASHPKENKWVLYLPNAHHPLLTQQYRESLENAKRDVRNAVT 403

Query: 124 EIGRKLPGGTMPWKEKNVVDISFLRMKVEELEKACPISVDFSISQRVRVLVITGPNTGGK 183
           EIGRKLPGG M WKEK V DIS L+MKVE+LE+A P+SVDF+IS R+RVLVITGPNTGGK
Sbjct: 404 EIGRKLPGGNMSWKEKGVADISLLKMKVEQLEQARPVSVDFAISHRIRVLVITGPNTGGK 463

Query: 184 TVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLRKIS 243
           TVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSV ADIGDEQSLTQSLSTFSGHLRKIS
Sbjct: 464 TVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVLADIGDEQSLTQSLSTFSGHLRKIS 522

BLAST of Tan0020838 vs. ExPASy TrEMBL
Match: A0A6J1JCF6 (uncharacterized protein LOC111483140 isoform X4 OS=Cucurbita maxima OX=3661 GN=LOC111483140 PE=4 SV=1)

HSP 1 Score: 395.6 bits (1015), Expect = 2.1e-106
Identity = 206/259 (79.54%), Postives = 220/259 (84.94%), Query Frame = 0

Query: 4   GVGTILEPLSAVPLNDELQQAKASVAKAEKDVLFMLTEK--------------------V 63
           G+GTILEPLSAVPLNDELQQA+A+VAKAE+DVLFMLTEK                    V
Sbjct: 284 GIGTILEPLSAVPLNDELQQARAAVAKAEEDVLFMLTEKVKMDFEDINKLIGCIIELDVV 343

Query: 64  NARASYSLSFGGTCPNLVLPEGRNSSIANVCSSGDQTSEASHPNKNERVLYLPNAHHPLL 123
           NARASY LSFGG CPNL+LP G NSSIANV  SGDQ SEASHP +N+ VLYLPNAHHPLL
Sbjct: 344 NARASYGLSFGGACPNLILPGGCNSSIANVYLSGDQISEASHPKENKWVLYLPNAHHPLL 403

Query: 124 LQQYRENLENAKRDVRNAFTEIGRKLPGGTMPWKEKNVVDISFLRMKVEELEKACPISVD 183
            QQYRE+LENAKRDVRNA TEIGRKLPGG M WKEK V DIS L+MKVE+LE+A P+SVD
Sbjct: 404 TQQYRESLENAKRDVRNAVTEIGRKLPGGNMSWKEKEVADISLLKMKVEQLEQARPVSVD 463

Query: 184 FSISQRVRVLVITGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGD 243
           F+IS R+RVLVITGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSV ADIGD
Sbjct: 464 FAISHRIRVLVITGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVLADIGD 523

BLAST of Tan0020838 vs. ExPASy TrEMBL
Match: A0A6J1J709 (uncharacterized protein LOC111483140 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111483140 PE=4 SV=1)

HSP 1 Score: 395.6 bits (1015), Expect = 2.1e-106
Identity = 206/259 (79.54%), Postives = 220/259 (84.94%), Query Frame = 0

Query: 4   GVGTILEPLSAVPLNDELQQAKASVAKAEKDVLFMLTEK--------------------V 63
           G+GTILEPLSAVPLNDELQQA+A+VAKAE+DVLFMLTEK                    V
Sbjct: 284 GIGTILEPLSAVPLNDELQQARAAVAKAEEDVLFMLTEKVKMDFEDINKLIGCIIELDVV 343

Query: 64  NARASYSLSFGGTCPNLVLPEGRNSSIANVCSSGDQTSEASHPNKNERVLYLPNAHHPLL 123
           NARASY LSFGG CPNL+LP G NSSIANV  SGDQ SEASHP +N+ VLYLPNAHHPLL
Sbjct: 344 NARASYGLSFGGACPNLILPGGCNSSIANVYLSGDQISEASHPKENKWVLYLPNAHHPLL 403

Query: 124 LQQYRENLENAKRDVRNAFTEIGRKLPGGTMPWKEKNVVDISFLRMKVEELEKACPISVD 183
            QQYRE+LENAKRDVRNA TEIGRKLPGG M WKEK V DIS L+MKVE+LE+A P+SVD
Sbjct: 404 TQQYRESLENAKRDVRNAVTEIGRKLPGGNMSWKEKEVADISLLKMKVEQLEQARPVSVD 463

Query: 184 FSISQRVRVLVITGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGD 243
           F+IS R+RVLVITGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSV ADIGD
Sbjct: 464 FAISHRIRVLVITGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVLADIGD 523

BLAST of Tan0020838 vs. ExPASy TrEMBL
Match: A0A6J1J3U0 (uncharacterized protein LOC111483140 isoform X3 OS=Cucurbita maxima OX=3661 GN=LOC111483140 PE=4 SV=1)

HSP 1 Score: 394.8 bits (1013), Expect = 3.6e-106
Identity = 206/261 (78.93%), Postives = 220/261 (84.29%), Query Frame = 0

Query: 4   GVGTILEPLSAVPLNDELQQAKASVAKAEKDVLFMLTEK--------------------- 63
           G+GTILEPLSAVPLNDELQQA+A+VAKAE+DVLFMLTEK                     
Sbjct: 284 GIGTILEPLSAVPLNDELQQARAAVAKAEEDVLFMLTEKVWVKMDFEDINKLIGCIIELD 343

Query: 64  -VNARASYSLSFGGTCPNLVLPEGRNSSIANVCSSGDQTSEASHPNKNERVLYLPNAHHP 123
            VNARASY LSFGG CPNL+LP G NSSIANV  SGDQ SEASHP +N+ VLYLPNAHHP
Sbjct: 344 VVNARASYGLSFGGACPNLILPGGCNSSIANVYLSGDQISEASHPKENKWVLYLPNAHHP 403

Query: 124 LLLQQYRENLENAKRDVRNAFTEIGRKLPGGTMPWKEKNVVDISFLRMKVEELEKACPIS 183
           LL QQYRE+LENAKRDVRNA TEIGRKLPGG M WKEK V DIS L+MKVE+LE+A P+S
Sbjct: 404 LLTQQYRESLENAKRDVRNAVTEIGRKLPGGNMSWKEKEVADISLLKMKVEQLEQARPVS 463

Query: 184 VDFSISQRVRVLVITGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADI 243
           VDF+IS R+RVLVITGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSV ADI
Sbjct: 464 VDFAISHRIRVLVITGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVLADI 523

BLAST of Tan0020838 vs. TAIR 10
Match: AT5G54090.1 (DNA mismatch repair protein MutS, type 2 )

HSP 1 Score: 209.5 bits (532), Expect = 4.1e-54
Identity = 123/266 (46.24%), Postives = 164/266 (61.65%), Query Frame = 0

Query: 4   GVGTILEPLSAVPLNDELQQAKASVAKAEKDVLFMLTEK--------------------V 63
           G GT  EP++AV +ND+LQ A+ASVAKAE ++L MLTEK                    +
Sbjct: 272 GGGTAAEPIAAVSMNDDLQSARASVAKAEAEILSMLTEKMQDGLCQIEVVLSYSIQLDVI 331

Query: 64  NARASYSLSFGGTCPNLVL-PEGRNSSIANVCSSGDQTSEASHP-NKNERVLYLPNAHHP 123
           NARA+YS ++GG  P++ L PE    S++   +S D    +  P +K E +LYLP  +HP
Sbjct: 332 NARATYSRAYGGAHPDIYLPPEDEVESLSAGENSPDINLPSEKPLSKKEWLLYLPRCYHP 391

Query: 124 LLLQQYRENLENAKRDVRNAFTEIGRKLPGGTMPWKEKNVVDISFLRMKVEELEKACPIS 183
           LLL Q+++ +   +  V+                          F +     L  A PI 
Sbjct: 392 LLLYQHKKGIRKTRETVK--------------------------FHKTADTVLSGAPPIP 451

Query: 184 VDFSISQRVRVLVITGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADI 243
            DF IS+  RVLVITGPNTGGKT+CLK++GLAAMMAKSGL+VLA+ES +IPWFD+++ADI
Sbjct: 452 ADFQISKGTRVLVITGPNTGGKTICLKSVGLAAMMAKSGLYVLATESARIPWFDNIYADI 511

Query: 244 GDEQSLTQSLSTFSGHLRKISFLATH 248
           GDEQSL QSLSTFSGHL++IS + +H
Sbjct: 512 GDEQSLLQSLSTFSGHLKQISEILSH 511

BLAST of Tan0020838 vs. TAIR 10
Match: AT1G65070.1 (DNA mismatch repair protein MutS, type 2 )

HSP 1 Score: 103.6 bits (257), Expect = 3.2e-22
Identity = 46/83 (55.42%), Postives = 63/83 (75.90%), Query Frame = 0

Query: 159 PISVDFSISQRVRVLVITGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVF 218
           P+ VD  +    +V+VI+GPNTGGKT  LKT+GL ++M+KSG+++ A    ++PWFD + 
Sbjct: 384 PVPVDIKVESSAKVVVISGPNTGGKTALLKTLGLLSLMSKSGMYLPAKNCPRLPWFDLIL 443

Query: 219 ADIGDEQSLTQSLSTFSGHLRKI 242
           ADIGD QSL QSLSTFSGH+ +I
Sbjct: 444 ADIGDPQSLEQSLSTFSGHISRI 466

BLAST of Tan0020838 vs. TAIR 10
Match: AT1G65070.2 (DNA mismatch repair protein MutS, type 2 )

HSP 1 Score: 103.6 bits (257), Expect = 3.2e-22
Identity = 46/83 (55.42%), Postives = 63/83 (75.90%), Query Frame = 0

Query: 159 PISVDFSISQRVRVLVITGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVF 218
           P+ VD  +    +V+VI+GPNTGGKT  LKT+GL ++M+KSG+++ A    ++PWFD + 
Sbjct: 384 PVPVDIKVESSAKVVVISGPNTGGKTALLKTLGLLSLMSKSGMYLPAKNCPRLPWFDLIL 443

Query: 219 ADIGDEQSLTQSLSTFSGHLRKI 242
           ADIGD QSL QSLSTFSGH+ +I
Sbjct: 444 ADIGDPQSLEQSLSTFSGHISRI 466

BLAST of Tan0020838 vs. TAIR 10
Match: AT4G25540.1 (homolog of DNA mismatch repair protein MSH3 )

HSP 1 Score: 48.5 bits (114), Expect = 1.2e-05
Identity = 40/121 (33.06%), Postives = 59/121 (48.76%), Query Frame = 0

Query: 174 VITGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLST 233
           +ITGPN GGK+  ++ + L ++MA+ G  V AS   ++   D VF  +G   S+    ST
Sbjct: 814 IITGPNMGGKSCYIRQVALISIMAQVGSFVPAS-FAKLHVLDGVFTRMGASDSIQHGRST 873

Query: 234 FSGHLRKIS-FLATHVMWGLVPFCELIRGFGSNSIFVIIYFTWSLFVFPSSCLDL-ITDF 293
           F   L + S  + T     LV   EL RG  ++    I Y T    +    CL L +T +
Sbjct: 874 FLEELSEASHIIRTCSSRSLVILDELGRGTSTHDGVAIAYATLQHLLAEKRCLVLFVTHY 933

BLAST of Tan0020838 vs. TAIR 10
Match: AT3G24320.1 (MUTL protein homolog 1 )

HSP 1 Score: 46.6 bits (109), Expect = 4.6e-05
Identity = 33/91 (36.26%), Postives = 49/91 (53.85%), Query Frame = 0

Query: 172 VLVITGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSL 231
           + ++TGPN GGK+  L++I  AA++  SGL V A ES  IP FDS+   +    S     
Sbjct: 763 LFLLTGPNGGGKSSLLRSICAAALLGISGLMVPA-ESACIPHFDSIMLHMKSYDSPVDGK 822

Query: 232 STFSGHLRKI-SFLATHVMWGLVPFCELIRG 262
           S+F   + +I S ++      LV   E+ RG
Sbjct: 823 SSFQVEMSEIRSIVSQATSRSLVLIDEICRG 852

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
P736254.8e-2345.19Endonuclease MutS2 OS=Synechocystis sp. (strain PCC 6803 / Kazusa) OX=1111708 GN... [more]
Q5WEK01.0e-2051.92Endonuclease MutS2 OS=Bacillus clausii (strain KSM-K16) OX=66692 GN=mutS2 PE=3 S... [more]
C0Z9F11.3e-2037.22Endonuclease MutS2 OS=Brevibacillus brevis (strain 47 / JCM 6285 / NBRC 100599) ... [more]
B8D2981.7e-2047.17Endonuclease MutS2 OS=Halothermothrix orenii (strain H 168 / OCM 544 / DSM 9562)... [more]
B9KYW42.2e-2048.51Endonuclease MutS2 OS=Thermomicrobium roseum (strain ATCC 27502 / DSM 5159 / P-2... [more]
Match NameE-valueIdentityDescription
XP_038894074.11.6e-11388.70endonuclease MutS2 isoform X11 [Benincasa hispida][more]
XP_038894019.16.5e-11081.85endonuclease MutS2 isoform X8 [Benincasa hispida][more]
XP_022985054.11.1e-10986.19uncharacterized protein LOC111483140 isoform X5 [Cucurbita maxima][more]
XP_038894013.11.1e-10981.23endonuclease MutS2 isoform X7 [Benincasa hispida][more]
XP_022922845.13.2e-10985.36uncharacterized protein LOC111430703 isoform X5 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
A0A6J1JC765.3e-11086.19uncharacterized protein LOC111483140 isoform X5 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1E9Z01.6e-10985.36uncharacterized protein LOC111430703 isoform X5 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1JCF62.1e-10679.54uncharacterized protein LOC111483140 isoform X4 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1J7092.1e-10679.54uncharacterized protein LOC111483140 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1J3U03.6e-10678.93uncharacterized protein LOC111483140 isoform X3 OS=Cucurbita maxima OX=3661 GN=L... [more]
Match NameE-valueIdentityDescription
AT5G54090.14.1e-5446.24DNA mismatch repair protein MutS, type 2 [more]
AT1G65070.13.2e-2255.42DNA mismatch repair protein MutS, type 2 [more]
AT1G65070.23.2e-2255.42DNA mismatch repair protein MutS, type 2 [more]
AT4G25540.11.2e-0533.06homolog of DNA mismatch repair protein MSH3 [more]
AT3G24320.14.6e-0536.26MUTL protein homolog 1 [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 104..124
NoneNo IPR availablePANTHERPTHR11361:SF132DNA MISMATCH REPAIR PROTEIN MUTS, TYPE 2coord: 4..243
IPR000432DNA mismatch repair protein MutS, C-terminalSMARTSM00534mutATP5coord: 170..311
e-value: 5.5E-8
score: -10.6
IPR000432DNA mismatch repair protein MutS, C-terminalPFAMPF00488MutS_Vcoord: 173..261
e-value: 5.9E-12
score: 46.0
IPR027417P-loop containing nucleoside triphosphate hydrolaseGENE3D3.40.50.300coord: 133..275
e-value: 5.0E-31
score: 109.8
IPR027417P-loop containing nucleoside triphosphate hydrolaseSUPERFAMILY52540P-loop containing nucleoside triphosphate hydrolasescoord: 161..247
IPR045076DNA mismatch repair MutS familyPANTHERPTHR11361DNA MISMATCH REPAIR PROTEIN MUTS FAMILY MEMBERcoord: 4..243

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0020838.1Tan0020838.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006298 mismatch repair
biological_process GO:0045910 negative regulation of DNA recombination
biological_process GO:0090305 nucleic acid phosphodiester bond hydrolysis
molecular_function GO:0005524 ATP binding
molecular_function GO:0016887 ATP hydrolysis activity
molecular_function GO:0004519 endonuclease activity
molecular_function GO:0030983 mismatched DNA binding