Tan0001692 (gene) Snake gourd v1

Overview
NameTan0001692
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionendonuclease MutS2 isoform X1
LocationLG01: 308927 .. 344180 (-)
RNA-Seq ExpressionTan0001692
SyntenyTan0001692
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACAGGAGATACAGTCAGTTTCAACTAGTCAGTCGTTAGTGCTACTGGATGAAGTGAGTATTTTGCTCGGTTCAAGAACAGAGTCAAAACAATCATCAATTTAAAGGAAAAGAAAATAAAAGAACAAATAATAATGATTGATTCTAGTCTAGTTGTTTTGCGTGGTAGGTTGGTGCAGGAACCAATCCTCTGGAGGGAGCCGCACTTGGGATGTCACTCCTGGAATCTTTGGCTAAATCTGGTGCTTCATTGACAATCGCGACTACACATCATGGAGAACTTAAAACCCTAAAATATAGGTACAGTAAGCAAATTAAAGATAAAATTTGAATGCTAATGATGACAGGTCTTTTCTTTTCTTTTCTTGAGGTGGATAATGCACGTTTTAGCTGCTCGACTTTTTGAACTAATATAAACCTTGTTGTTTAAAAAATAATCAAAGCATTCTTTTGAATATATATACTTTTTAGTCTTTAATTTTATTTAAATGTTAATCATCGAACAAAATAAAGAGCAAGAAAGTGTATTCTTTTTCCTGCTCAAAGATTTTGTTTTTTAAACGGTTGAGGTTTTTCTTTACAGCAAGGAGGTCTTTGAAAATGCGTGTATGGAATTTGATGAGGTGAACTTAAAGCCAACTTACAAGCTTCTCTGGGGAGTACCAGGTTTTCATTATCTTTTTCTATTTTTTTTTTTTTTTTGTTAACGAGAAACGAAAATTTTCCATTCAATAAATGAAAAGAGACTAATGCTCAAACGATACAAACTCCATAAAAGAGTGAAACAAGAAAAAAAAAGACAAAGGAAAAATAAAAGCATCCCAAGGCAAACATATATTCTCCTTTTGCAATGGGGAAGAGGCTGTCAAATGCCAAGAGAAAGATCTCCCCGAGAAAACACGACTGAAGAGCACCAAAGCTCATCCAAGACCATGAAAAACTTCCAACAAGCCAAAACAACTACACCCATCAAGCTGCACCAAACTTCGAACCAGAAGAAAGTACATCTAAAAATGAGGAAGTGCACTAATTCCTTCTGCACAAGACTCCTCGAATAGCCCTTCAGCCCCATTAAAGGAAGCCTCGGAAGAAACACCGCCACCTAACGAGTATCATTTTCTCGCAACTTCGTATACCTTGCAAATCCAGACACTGGTTGCAAAATGTGGCCATGTTTTTTAGCAAGACCTTCAGCTGGTTTCCCATTCTTAGAACGAAAATAAAATACTTTGTCCCACTCCGATGGCCTCTCACGGTTTATCCACATACCAAACGAAAATATCACAATCAAGATAAATTGCAAAACTGATTGAGTTCAACAAACTTCCTCGAGGCTCTTGAATTATCAAAAAGAACATCCCATTCCTTAGTAATGAAGAAGTGATTTATTAAGGATCGAGAGCGATTATCACCATTTCTTGACCACACAATTCTGCCATGTGATAGAAGAATTTCCATTAAACCGAGCTCTTCAATAACCTTGATGAGTCAAAAACTCCTATCTAAAGAAGACAATTTGGATTCTTGGATCAGTACTAGAGAATTTTTTAAAAAAAAAAATTAACTCTTTGGAGCAATCTCCCAATCCTCTTGTATTTCAGAGATGATCTTCATTGAAATACAAGTAATATCAACCAATGCTCAAGCTTTCTGACTAGCCCCTTCTGAAACTAAAGGGATAATCTCTCTGACCTGCAAACTGCAAACTACTAGCCTTGATTTAAGAGGCAAACTTGTCTAGATTAGGAATCGGCAAAGCTTCAAAATTTGAAAGCTGATTTTATGTCTTAAGTTTTGAGAAATAGAGCAATCTGACTTGTTAGGTGTCTGAAAAAGAGCATTTAAATCAGCAACATTTGTATCCTCGATCATTTCATCTTCTAATTTTTTTTTTTCTTTTTTTGATAAGGAAACAACAAAATTTCATTTTTTTTTGACAAGGTAACAATAAATTTCATTGATGTTATGAAAAGAGCAGAAGCTCGAAGTACAAATCAAAATGAGTAACAGCTCAAAGAATAGGACAAAATAAAGCACCCAAACAGAAACCCCTTTGAAGAAATTAAAAACGAAGGTACAGTAAACCCCTCCAAATAACCAACCAAAACAAATCCCATAACAACAAAAGTCAAATGAAAGCCATGAAACAACTCGAAGCCTCAACATCCAAACCCTCTGAAACGAACAGCGAAACAAGAAACAACCTTCAGAACTTCTTCCATCACAAAACTAGCTCCAAACCGCCAGCTTAACCACCCTGAAAACCGAAGACTGTTATCCCCTCCGAACAACCAAAGACCAAATAAAAATAAAACCACCCCTCAAGCACCAAAGTCCTTCACAGAAACCCGCAAGCGAAAACCGCCCAAACCACAACTACTTGCATACTTCTAGCAACCCTTCCCTGATGACAATCCCCCCCCCCCACCCCCAAAGGCCTCGGCCCCTCACAAGCAAACCATCACCAAGCCACCCCCAAAACCCAACACACAAAACACAACTGCCAACTTAGATTCGAAATTCGCTTAAGGCTCACCCTTATAGCTCCTTCACCCAAACCTCTCGCAAACCAGTAAACAACTTTCAAAATGCGATGAAAATCCCCAACTTTCACCCCTTGAATTGCAACCCACTTGCCTTAATGAGGGAGGCAAACCGAGACAATTCTGTAGGAATAATATAGTCACCCTTTCTACCTCTGCCTTTCCTTGTTGCCCTTAGGGTGACTCAAATAAACTTGCTAGATTGACAACCGGGGAATCTGCCCTCCACACCTCCTCCACTCTCCTTCTCCCCGACTCACAATTAACACTGCTCAGACTCTGAATTGACCACCCATCTGACTCATGGCACCCATTGACCGGGCTAGAGAACCCCTTGGCACACCCCTGAACCGTTTGAAAAGAGGCTCTAACAAAAGATACCTCCCTTTTACCTAAGATGCACTCCTCCACTTCAAAACTAACCGAATGTTCGGCTTCCCTTGCGAACTTCCTTGATCTTTGCATCTCAGGGGTAAGGCCAGCCGGCTGCTCAAGAAAATCCGGCCAAACATCTTCAACCAGAGAAATTTGATCTACAACCGCCCCCTTCTCATGCCTCCTTACATAAACCTTCCCCCATAATTTATCGGTCTCCTCCCTTTCCTTCCCCAAAGCAGTTGAGAGAATCAAGCAATCGACAACCACCACTTCTTCTCCTTGAACAACATCACCCCCTATACCTCTATCCTCTTCATTTCCCACCCGCAAGCTAATCTTGCCCTCTTCTTCCACCATTGCCTCCTCAAAATTGGAAATCAAGCAGCAAGAATTACAAAAGACCCATTCCTCGAGAAGGCGCCTCTTCCTCACAACCCCTGCCACTTCCCCCGAACAACCCCCTTCAAATGCACCCCCTCCAAGGCAACTTCATTAATCATCCCAAAAGGTGGCTTCAACTCCCTTGAACCACCGCAACCCTTCTCATAGCTCGCCACTGGCGACATTTCACTGCCCATAATCACACCCCCCACACTCCTTACTTCCGTTAAGAAGCAAGAAGATCCCCTAACATCCTCCTGAGGCTGAGACAAACCAGCCCCCCTCTCATCCTCAAAGACTTGAAGAAATCGATCCAAATCTAAGGAATTGGAAGCGTCCTTCATCACCCATGGCATCTCTAACCCCACCAGAGGCACCACCCTATCAAGAAGTTCAAAACTCAACGAAAATACTTTTAAATCCAACCCTGAAACCGGAATAGATAAAGGCATAAATCCACAAATATTAGGAGCCACCTTGATCCTTGCCTTTGAAACGTCCAACATATTCAACATTTCTTGAGCAATTTCCTCCAATCCTCCAAAGTGATCTCCAATTGCCTCGAAAACGCTCATTTTCCACAGTTTAAGAGGCAAATTCGAAATGGCCAACCACCCCCCCCATAGCCCCCCATGAACTCCGGCAAACTATGAGAATTTTCATCCCATTTTTTGATCTTCAAATGTAACCCTCCAACCACCTGTCATTTCACCTCGATCGGCCTCCAGTTATCTTGTTCACTATCAATAATGCAAATGGCCTTATCAACAAACAGAGGATTAATTTGGATGATCACTCCAAAAAAGTCCTCCAACTCCTTCTTAATCAACCCCCACCGTACATGGGCAAACAACCGGTTCACCACCCTAACCTTGGAGAAGTCCATCCCCACCACATCAAAATCTTTTTTAACCCAATGGTACTCAGAATTAGGATGAAGCAGGGGCTCAGTAGGACCAAACACTGGCAAACCCAACTCTCCAACAACTTCGAAGTCACCACCATTAGAAACAATGCCAACCGACTCCTCTACCACCCTGTTACTCAACCCAAAACCATCGAAGAAGACTAGAAAATCCTCCAACATTTCCTTGAAAACCAACCATCCTTTCAAGAAACTACCATTCGGAACCCTGAAAATCGAACGTCCCCCATGAGAAGGCCAAATCGCTCCTTCGCCACCCAGCCTTCCCTCCTTCAGAATTTGAACAACCCCAACCTAGCATCGCCCACCTTACCCTTCAGCCTACTGAACGAGGATACTCCTCGTGCACAGACCCATTCGAGTGCCTCCCAGAACCAACGGACTTGTTCCACAGAGAGCTGAAAGTGTTTTAAAAAGCCCTCCGAGGCGCAGGGCGCAACTTCAGCGCCTCGCCTCGAAGAGGCGAGGCGCATCTAAGAAGGCGCGCGCGAGGTGTCACGGTCATGCTTGTCCAAGGCGTGTTGCCCATGGCCGCATGGCCGTCCCCATCCTTTTTCATTATGTTTTCATGTATGCTTTTGCTTTAGCATTGTTATTTCATTTCCATGTATTTTCGTGGGCGTATGACTCCTCGCCAAAATGACGACGTGCAAGGTGTCATAGCCAAAATTCCTAGAATGTACTATAGGGGGATGCGAACATTCATCAATTCTCCCTACCAAGAGTTATATGCTAGATAAGTGTGGCTTTCTTATTATTGCTTGTCAGCTGTATAACTAACTTTCTAAACTCTCACAAACGTTTTCCAAATTGTTTTCACAAATTCTCTCTGACCCCGTTTTCTAAAACCTTTTCTCAAGAACGGGGGTGGAGGTTGCGACATAGGGCCGTTGTGCCGCCGGCAATCCGAATTTGAGTTGGCTTACTTGCTCGTGAGCGAGAGCAAGTGCCGCACGGTCGTGTCGAGACCCTTTCAAAGGTGCGACCGTGACAGTTGGTATCAGAGCCTAGTTGGCTTCGGGAAGGCAAGACACAGTAATCATGCCTGCAAACAAATTAAACAAGTCGGACAAAGAGCGGTTGGCCGAGATCGAGGAGCAATTGCTTCTCTTGACGGAAGTCCCTGATAATATTCGGTCCTTGGGATCTCGACTGGAAGAGATTGCGGCCAAGGCCGACGCGATTGATGCGATTGCTGGACGCATCGAAGGAATGCCCGTCTATGAATTGATGGAGAGAGTTGAGACCCTCGAGGACAGGGTAAATAGATGTGAAAGCTTCGGGCGTGGTGCTAGCTCGTCGGGCGGCGTCGCCCAAATTGAGGAACGTGTCGATGGTCTGGACAGTTCCCACAAAGAGTTGGGTCGATGTTTTCTCGACTTTTACGTTGATGTGAGAGATGCCCTTAAGATCTGTAGGGGCGAGATAGGTGAGATGGCCGCAAAGCTAAATCTCACCATGATTGCGATGGGAAGTCAAGCCTCACCTGGGGGCGCGACCAAGTCGGGGAAGAAGAAGATCCCCGAGCCGAAACCCTTCTGTGGGGTTCGAGATGCCAAAGTGTTGGAAAACTTCATCTTCGATATGGAGCAATACTTTGAGCAAATGAACACGGTGGTCGATGACGAAGAAAGAGTCACCTTAGCAACAATGCATCCGGAAGACGATGCCAAACTATGGTGGAGGACCCGTTATATGGACATCCAAGAAGGCAGATGTACCATTAACACATGGGAGACGCTAAAACAAGAATTGCGCTCGCAATTCTTCCCTGAGAATGTCGAGATAATTGCAAGGCGCAAGCTTCGAGAGTTGAGACATAAAGGTGCCATCCGCGAATATGTCAAACAGTTTGTCGGGTTGATGTTGGACATCCGAGATATGAGCGAGAAGGACAAGATCTTCTCCTTTGTCGGCGGGCTACAACCATGGGCGAGGGCTAAGTTGTGCGAAAGCGAATGAGTACAACACCGGCCAAGCATATGCAAAGGCAATGAACGATCGCTTTGACTCCGAGTGGTGATGTATTAATGCACGCTTCCAAGAAGCGGGGACTGAGCAAAACCTCTCCCACGGGGAGAAGATCGATCTCATCACCCTAACTCTCCCAAGAGTGGGGAGAGATAGGCCTAGAAGTTCGGGAAGGGACAAAGAGGAAATCCCCACCAAGGAATGGAAATTGCGTGGCGAGACCACATAACAAGAACAACAACTACCGAAGTGCGTTCTTTGTGACGGCCTCACCGAGTGAGTGATCGCCCAGAGTAAAATAGCCCTCAAGGCCCTTCAAGCGTCCTATGAGGCAAAGAAAGGAAAGGAAGTCGACGATACGGAAATTGAGTCCGTACAAGACGAAGACCTAGAGAATCCGCGCATTGGCGCCATAAGATTCCCAGCCCTTCAGAAGAGTGGGGGAGAGACAAAGGGCCCATCCAGTAAGGGCCTCATGTACGTTGACGCACATGTCAACAACAAAACAACCAAGAGTACTATGGTTGATTCGGGAGCCACCCATAACCGTCTCCTCGCGAGGCGAGGCGATTGAACTTGTCCCGGAGCCGAGATCCGGAAGGATGAAAGCCAGAATTCAAGACCACCCGCCTATTTCGGGAATTGCGGGCGTCGCTCGGTCAAGATGGGGGCCGGGGTGGTCGCGTAGACCTTGTCGTCGTCAGGATGGACGACTTCGATGTTGTGTTGGGAATGGAGTTCCTTCTTCAACACAAAGTCATACCTGTGCCCCTAGCGAGTTGTCTAGTGATCACAGGAGGAAATCCCGCCGTAGTGCACGCACGCGGGTTGAAGCGACCCTCGGGCGCGCGAGTGTTATCGGCCTTGCGGTGAGCGGGCCCAAGTGAATGCCGTGGCGAGATGCCACGAGTAATGCACCATCGCGACGAGAGAAGCGGTCGGCTACACCTCCCCGAAAGAGGGGATGAAAGAATGCTAAAGTGCAAGAGTGCACACAAAAGAGCGCCGAGGCGCGGTCGAGGGACACACGCCGAGATGCCCTTCCAAGAATTGAAGCCAAAGCGGCGAAGCCCGTGAGTGCGAAGTCGGACTACCACCGGGCTTGGAAGACGCAATCACAAAGCGTGTGTGCGAAGGCGCGGCGACGTGACAAGCCCGCCGATCAAATCAAGAACTCCTTTGAGGTCCTAGGCGGCGGACGGAGTCGAATCTTCGCCCCATATGAGAAGGTGCGGCAGCTTCTTGCGAGATGTGGAAGGAAGCCGTGGGAGCGTAAGTCCGCCGGGACCGAGTTGAAGAATTGAAGACCGCCACCCGTGACCATCCTGCTTCGAAGAGTGTCGGTGACGAGGCGTCAACCAATTAAGTGGGGAGAATGTCACGGTCATGCTTGTCCAAGGCGTGTTGCCCATGGCGCATGGCCGTCCCCATCCTTTTTCATTATGTTTTCATGTATGCTTTTGCTTTAGCATTGTTATTTCATTTCCATGTATTTTTTGTGGGCGTATGACTCCTCGCCAAAATGACGTGCAAGGTGTCATAGCCAAAATTCCTAGAATGTACTATAGGGGATGCGAACATTCATCAATTCTCCTACCAAGAGTTATATGCTAGATAAGTGTGGCTTTCTTATTATTGCTTGTCACCGTGTATAACTAACTTTCTAAACTCTCACAAACGTTTTCCAAATTGTTTTCACAAATTCTCTCCGACCGCTTTTCTAAAACCTTTTCTCAAGAACGGGGTGGAGGTTGCGACATAGGGCCGTTGTGCCGCCGGCGATCCGAATTTGAGTTGGCTTACTTGCTCGTGAGCGAGAGCAAGTGCCGCACGGTCGTGTCGAGACCCTTTCAAAGGTGCGACCGTGACACGAGGCGCGCGCCTTCCATGAAGCCCTCGAGGCGCGCAAGGCGCGCGCCTCGGGGGCTTCTTCAACTTTTTTTTTTTTTTTTCCGTTTTAGAGCCTTGAGACGTTTAAAACATAATAGAGCCTGAATTATTTGGTGTTCATAAAATATAATAGAGCCTTGAGACGTCGATATAGCATTCGAGATGTCATTGACCCCATTTCTTTAAAAGATATTGATGATAGTAATGAGTGGTTGATTGGAAGAATGGATGATGATTCTGAAGAGGATAATGATCTTGTCTTCGATGATGATTCTTTAACATGGGGTGATGTTTCAAGAGTTGCCGGAGCAAAAGAATCTACACACCCTACTATTCTAGAGCTAAATCAAGAACCAAGAATAATGGTTCATGTTCATCCTCCACGCCCACACAACCATTGCCACGAGATCCACAACCCATGCCCCCACCTCATTAACTTGGATGACTATGAGAATGAGACTGATGAAGAAGATCTGAGATGGCTATATGTCCAATGATGAAGAGAATGCGGTTAAAGAAGAAGATCTATTTAGTGAAGAAGATATTGATATTTAAGGCTTTTGATCTTGTAGTGTTTATAATTTATTAGGTTGTGTATTGTAATTGTTTCATGTTTTTAATAGAGTATCTATTGTCTCTATGTTCAAACATTAATACTTGAAAGTTGAACCTTATTTATTTTACTTATTTTCCTTGGTTAAAACTTAATTTCATCCCATGAAGAGTTTTTTCTTATTGTGTATGTATACATATATATTTTATATTTTTTTTTATATATAGTGTGCCTTAGATAAAAAAGCCCGCGCCTTTTGGTGCGCCTTGCGCCTAGGCTCCAGAGGGCTATTGCGCCTTAGTGCGCCTTGAGCCTTTAAAAACACTGGGCTGAATCACCTTCTCGTCCAACTGCCCTGAATAACGACCGTCTCACCTAACATTTCCAACCGGAAATCACACCCTCCTGCACTGCAGCTGAAAATTCTCATCCTCAGAGCCAGCACAAGCACCAACACCCAACTAAACGTGCAAAGATTCGGACAAAACCCGTTGCCTTACCACTTGACCCAGCCAAAGAGAATGAAAGAAAAAGAAGTGAGAGGCCGAAGAAGAAAGAACCAGATGACCGGAGAAGCAAAAGGATGGTCGAATCAAACGAAAAAAAACAATAAGACAGCCGATGGAAACAAGAGAAGAACCCACCAGCGACACCAGGGAGGAGCGACTCCAGCAGCAATGAACGACTGAGAGACTGAGAAAGGATAGGAGACGACCGGCCAACCGGAAGGGAAAACAACCGATAGAAACCTAAAGAAGACGAAAGGTTAAAAAAACCACTGCGGCCGGAAGATTAATCGCCGGACACGAACTAGCGTAAAGTAACCCAGGACACCGAAGCCGAGAAAAGCAAAAGGAATCGGCCGGAAACTTAAAGGACAGAAGCGGGGGGACTAGTTTTTAAACAACAATATTTCATTGAAATAAAGAAATGAGACTAATGCTCAAATCATTTCATCTTCTAATAAATCATCTTCTAATTGATTTCAATGGAGATTATGATCAATTTCAACACTGCTTACGTTGACCTCTTATCCATTATCTGAATCATCGAGAATGGGGAAGAATGATTAGCATTTGGTGAGGTTGCTTTAGAAGAGAATGGAGAAACAGTGACAAACCTAATACCTTTAGGCCCAATATTTCAATGCTCCGCCAACAAATTTTGTGAAGATGTCTTTAAACTCGATCCACTCATTAACTTCTCTGTCAAGATGTCTACTAGGGAAAAAGAATACAAAGCGTTCATTTCACGATCAAAGTGTCTGTTCATGCTCGTGAAAAATGATTTGCTAGTTTCAGTTGAGGTGTTTGCTTTTTAAGTATACCAATTGCATGTCGTGTCGTAGATTCTATAATGTTTTTTCTCTCAACAAACCTAACACCATAAACGGGTGAAATTATGTTTATAGGGCGTTCAAATGCTATTAATATAGCTGAAAGGTTGGGGTTGCCTAGTTCTATTGTAGATGATGCTCGTGAACATTATGGTGCGGCAAGTGCACAGATAGATGAGGTAGTATTCACTGTTATAATATATATTGTTGTCCTTTATCGTAATTATGCAATCTTTAGTATCAGGATTTCTTTGTATACCTTTATATATGTCATAGAACTCTAATGTAAATAGAACATATATTTTTCTCAATTTTGTTTTATATGGTATCAGAGCCAGCTCCTAGGGTTTAGGGTTTTCATTTGGCAATTAGTGTTTTTTCCTCCTAAGTAAGGTTAGGGTTTGAAGTGTCGGTCATCATCTTAGGGTGGTCATTTGTCGACACCACCTACCTTAGGAGGAGCGTCGCCGTCAGAAAAGCAAGTTCGTCGGAAGCCGCTGCGCGTGAGGCTGACGTGCCGCCCGAAGTTACTGCCTTCACCCCCACGCGCCGGTGCGTGGGATCGCGTGACCGTGTCGATCAGTTCGCCGCCGCTTGGGTTTGTCTTCCTTTGGTATTTCTAGCCTGTTTGTCACCTTGGATTCGGCATTAGCCCCTTGGTTTGTCTCGGTTTTTTTGCTTTTAGATGAGATTAAGCCACTTTTTGGTACTGTTCAGATTTGTTTTTTGTTTTACATGACGTTCACCTACTGTTTCTTTAATGATGGTTGAGCTGAAACCTATGGTTGTACCCAATTTGGTTCCCCTCTCGTCAAAGATCACGGAGCATAAGTTGAATGGTTAAAACTATTATGATTGGAGTCGAACAATTAGGTTCTATTTACGAAGCACTGAGATGGATGATCATATCGATCAGGATCAACCAGAGGATGACACAAGGAGGACTTGGTTGCGCGATGACACTCGATTGTATATTCAAATCAAGAATTCAATTGAGAGTGAAGTGGTTGGATTGGTAAATCACTGTGATTCTGTTAAAGAACTACTTGGATTTTTAGAATTCTTATATTCAGGAAAAGAGCAAGTTAACCGAATATATGAAGTCTGCACCGCTCTTTTTCATGCGAAACAGGGAGACCAATCTGTCACGAATTATTTTCTGCAAGTTAAGAAGACATGTGCAGAGTTTAATACTTTACTCCCATTTAGTGCAGACATAAAGGTTCAACAAGCTCAAAGAGAAAAGATGGGTGTCATGAGCTTTCTTAATGGCCTCTCCTCTGAATTTGATATGGCCAAATCACAAATTCTCTTAGGTTTTGAGATCCCCTCTTTGGACGAAGCTTTTAGTAGAGTTCTCCGTACATAAAAGTTTTCGTCTGGTTCATCTGCTTCTCAGTCCAACAGTGCTTTAGTTAGTCGCAATGATTCTAAGGTCACTAAAGGAATGAATGGTAAGTCGAGTCATTAGAGACTTGGTTCTATTCATCAGAAACTTGGTTTCGATCATCGACGACAAGAGTCACAAGAGATTATATGTAACTATTGTCGTAAGCTGGGTCACATGAAACGTGACTGTTGGAAGTTGTTGAATAGAGGGCAAAGGCCTAAAGTAGCTCAGCTTGCATCTACAAATGATATACCTGAGAGGACGGTAATTATTTGTGTAGACGATTTTGCTAAATTCAAGTTGCAGGACTCATTACAAGCATCTTCTTCATCGAATCTTATTGCAACTATTGTTGAATCAGGTAAGAATCATTGTCTCCTTATATCTTCCACCAAATGGGTCATCGACTCTGGTGTCACGGCTCACATGACAGGTAATTCTAGCCTATTTTGTACCACTTTGTCCTCTGCACCATTTCTAAATGTCACTTTGGCAGATGGTTCGACATCTTCTGTTCTTGGATCCGGCACCATTAACCTCACACCCTCGCTTTCTTTATCTTCTGTCTTACATTTGTCTCGACTTTCATTTAATTTGCTTTCTGTTAGTAAACTCACTCATGACCTTAATTGTTTTATCTCATTTTTTCCTGGTTATTGCTTGTTTGAGGATCGTATGACGAAGAAGATTATTGGTAAAGGATATGAGTCTAAAGGTCTTTACATTTTTTATCAGCAGATACCGAAAGTTGTGGCATGCTCTGGTGTTGCTTCTCCGTTTGAGGTCCATTGTCGTTTAGGTCATCCATCCTTGCTCGTATTAAAGAAAATGCATCCCGAGTTTCATTCTTTATCTTCATTGAATTGTGAGTCATGTCAGTTTGCAAAGTTCCATCGTCTTAGTTCAAGTCCTAGGGTCCATAAACGAGCTAGTGCTTCGTTTGAATTAGTTCATTCTGATGTATGGGTCCGTGTCCTGTAATGTCCAAAACTGGATTTCGATATTTTGTTACTTTTTTCGACGATTATTCTCGTCTGACTTGGTTATATTTAATGAAAAATCATACTGAGTTACTCATTTTTGCAATTTCCATGCTGAGATTTAGAATCAGTTTAATGCCTTTGTTAAAATTCTGCGAACTGATAATGTTGGTGAATATTTTTCTAACATGTTGGAATCATACTTGTGTACAAATGGCATTCTTCATCAATCTTCTTGCGTTGATACTCCATCCCAAAATAGGGTCGTTGAATGGAAAAATCGCCACTTACTTGAAACAGCTAGAGCTTTATACTTTCAAATGAATGTTCCAAAGCATTTTTGGGCCGAGGCAGTTTCCACAGCTTGTTTTCTAATAAATAGAATGTCTTCCTCAGTTCTTCATGGTGAGATCCCTTATCGTGTTATCTATCCAACTAAATCTTTGTTTCCTATTTCACCCAAAATATTTGGTTTCACTTGTTTTACTCGTCTGAAAGCACGCCTTGTTGCTAAAGGCTATGCTCAGACTAACGGGATAGATTATTTTGACACTTTTTTTCCGATTTCTAAATTAACTTCTATCTGATTGTTTATTTCCATGATTGCTACTCAATGTTGGCTTTTACTTCAACTTGACATCAAGAATGTTTTTCTCCTTGGTGATCTTCAAGAAGAAGTATATATGGAGCAACCACCTGGATTTGTTGCTCAAGGGGAGAGCGATGAAGTTTGTCATCTAAGAAAATCATTGTCAGAGTCAACGAGCATGGTTTGGTAAATTTAGTCAAGCACATGAGCAGTTTAGAATGAAGAAAAGTACGTCTGATCATTCTGTTTTTTATCGACGATCTTAGAATGGTATTACTTTGCTTTTCGTGTACGTTGATGATATCTTTTTTTTTTTTGACAAGGTAACAAAACGATTTCATTGATATAATGAAAAAACCCGAAGGAAAATAATACAAGGTTAAAAAAGAAAAGTATAAGAGTAAAGCAGACAACCTACAAATCATTATCACTTTGCTTTTCGTGTACGTTGATGGTATCATTATCACTGGAATTGATACATCAGGTATATTAGAGCAGGAGGTCCTGTGTTCGAACCCTGGTGAAGTCATTTTCTCCCAATTAATATTGATTTATACTTGTTGGGCTTTTGTTCAAATTTCTAAGCCCACAAGTTAGGGGGAATGTTTGATTTCCACTTTTGGGTCTTTGTTTAAATTTGTAAGCCCACAAGTGAGGGGAGTGTTGGTATTGATATATTTAAATTTGTCATAACCTATTAGCTTAAGCTTTTGGGTTAATTGGCGATTTACAATATCTTCTTTGAAAACTTTCCTTCATGGGTCAGTTTCATACCAAGGATCTAGGAACACTAAAGTATTTTTTGGGTATTGAAATGATACAAAGCAAGAAAGGTATCTTTTTGTCGCAACGAAAACATGTACTTGATTTGTTATCTGAGACAGGAAAACTAGGAGTTAAACCATGTAGTACTCCAATGATACCTAATTTGCAACTCACCAAAGGAAAATCATTTAAAGATTTTGAGAGAGATTGGTTGGGAAGTTGAATGATCTGATAGTGACACGATCGTACATCGCTCATTCAATAAGTGTTGTGAGTCAGTTTGTGTTATCACCTACCATGGATCATTGGGCTGCAATAGAACAAATGTTTTATTACCTAAAAGCTACATCCAGGCATGGAATCTTGTCTAAAGATCATGGTCATACAAAAGTTGAATGTTTTTCAGATGCTGATTGGGCAGGATCTAAAGAAGATACAAGATTAACGTCTGATATTGTGTATTTGTAGGAGGCAATTTGATATCGTGGAAGAGTAAGAAATAGAACGTGGTTTCAAATTTGAGTGTCGAGTCGGAATATAGAGCTATGGCACAGTCTGTGTATGAAATTGTGTGGATACACCAATTTTGTACTGAGATAGGCTTCAATATTACTTTGCTATCTAAATTGTAGTGTGATAATCAAGTTGCACTTCACATTCCGTCTAATTCAGTGTTTCATGAAAGGACTAAACATATAGAATTGAATTATCATTTCATTCGTGAGAAAATACAAGGGTTTGTGTCTAGAGGATATGTAAAGACTAGAGAACAATTGGGAGATATCCTTACCAAAGCCTTTTTTCCTTTTAAATGCTTTAGTTTAGGGTTTCCTTGTGTACCTATATCTATGTTAATGACTCTCTAATGTAAATTACATTGAGTATCTTCCCAATACTAAGTACACTATCATTGTTAATTATTTTGTTATAATTCAATATATTCTTACGTATAGATTAAATTTTTATTCTTTTTTTAATTTTCAAAAAACGGAAAGATAGTGATCGCTTCTCCAGTTAGCATAGGCAGATATTGAACATTAAATACTTATGCACTTGGTATTGCTTCTCAAGTGTATATTAAGCTACATTCTTTATAATTTTATGTTACATTTTAATGCAGGTCATAATCGATATGGAATGTATAAAGAAAAAGTATGGGGAGCTCTTACAAGAAGCTCAAAATCACTTGATGTGAGTCTCTTAGTAAAAGTAATTTTATGGAAATATTGTCTTTTAACCTAGTTATGTCGTAATAAATATTATATGCCTGATTCGCATCATTTTTATTGATTAAGTTGTAATAACGTTTAATTTGACTTGAATTTAGAAGAGAAATAGATATATTTGTATTTCTTCTTACTTAATCAAGATACAAGGTCTTCTTATATAGGAGAGAGACCATCTACTAAAATCTACTAATAAGGAAAATATCTATCAATAAGGAAAAAGAATATTACAGGTATAAATAATAAAGATACAAATAATACACATTATTATCAAAGATACACAGAAAATCCAACACTCCCCCTCAAGCTGGTTTGTAGATATCTTCCATAGCCAGCTTGTTGATAAGTTTATTGAATTGTCCTTTTGAAAGTCCTTTGGTTAGCATATCAGCCGTTTGTTCTGTCGTTGGAAGGTACGATATGCATATCACACCTACATCAATCTTCTCGTTTATAAAATGCTTGTCAACTTTTATATGCTTTGTTTTGTCATGTAGAACTGGATTACGAGATATGGCAATGGCAGCTTTGTTATCACAATATACTCTTATAGGTGTCTTTTGAGACAATTTCAGCTCGTCGAGTATCCTGTCAATCCATATACCTTCACAAATTCCATGTGCTAGGGCTCTAAATTCCGCTTCAGCACTACTTCTGGCTACTACATTTTGTTTTTTACTACGCCATGTAACTAAATTTCCTCCAACAAATGAACAGTAACCTGATGTAGATCTTCTATCGATAGGACTTCCAGCCCAATCGGCATCAGTATACACTTCAACTTGAAGATGGTCATGTTTTTGAAACAATATACCTTTACCTGGAGTTCCTTTCAAATATCTCAAGATTCTATACACAACTTCTGAAAATGAGTAGGTCCAAAGAGTGCATAAATTGACTTACCATACTTACTGCAAATGCTATATCCGGACGTGTGTGAGATAAATAGATCAGTCTACCCACAAGTCTTTGATATCTGTCTTTGTCCTTTACCTCTTCACTTTTTTGCGACTTGTAACTTTACATTTGGCTCGATAGGAGTTTCTCGCTGTTCGACAACCAAGTAAACTCGTCTCTTGAAGTAAGTCAATAACATATTTCCTTTGATTAACAAAGATACCTTTCTTGGATCTTGCGAATTCCATCCCCAAAAAAACTTTTTAGTGGCTCCCAAATCATTGATCTAAAACTCCCTTGCAAGTTTTTCCTTAAGATTAGTCAAGCCTTCTTCATCATTTCCTGTGAGAATAATATCATCAACATAAACAATTAAAATAGCAATTTTACTATTTTCAGCGTGTTTGTAGAAAATGGTATGGTCTGCTTGACTTTGAAGGTATCCATAACTAGTGATTGCCTTTCCAAAACGTTCAAACCAAGCTCTTGGAGACTGTTTGAGACCATATAGAGATTTTTTCACTCACAGACTTTATTATTTCCAAATTCTATTTCAAAACCTGGTGATAGACTCATGAATACCTCCTCCTCTAAATCACCATTGAGAAAAGCATTCTTAACATCTAGTTGATGAAGTGACCAATCTAAATTAGCTGCAAGAGACAAAAGAATTCTGATAGAGTTGATTTTTGCAACTGGAGCAAAAGTCTCCTGATAATCAATTCCATAGGTTTGAGTGAAACCTTTTGCTACTAATCTAGCCTTGTATCTCTCAACAGAACCATCAGTATTACACTTTACAGTAAACACCCATTTGCATCCCACTGTCTTCTTATCTTTTGGCAAATCTGTTATTTCCCAAGTACCACTTTGTTTGAGCATTCATCTCTTCCATGATCGCTAATTTCCAATTTGGATCATCTAGGGCTTCTTGTATATTCTTTGGAACAAATAAGTTGTCTATCCTAGAGGTAAAGACTTTATGACTATTTGACAATTTCTCATAAGAGATATAGTTTGCAATGGGATATTTAGTACAAGTTCGAGTACCTTTCCTAATAGCAATAGGAACATCAAGATCAGAAATGATAGGTAAGGTATTTTGAACATTTGACAAATTAGGGATAGGAGCGGGATTACTAGACCTTTCATCATTGACAATAGGAGATGAATTTTCAGCATCATTTTCCAGAACTTCAAATTGGATTGGTGTCGGGTCAACTCTTTGGTCTTGGTTCTTTTGATTGATCGCCCTTCTAGTATAAAACTTTAACTCAGGAATTTGACGTGTAGTCATTTTGTGGTATTTCTCCCCATGAATGAGAAGTTTCTATGTTTGACATCGAAGGGCTAGTGGCACACAAGTCACGATGGATGACACAAGGAAGGGATTCAGATGTTTCCCAAAAATTATCTTCTAACTCTGATTTCTCCCCCTGAAGAGAATTTTGGATAAAATATGGTTGTTTTTCTAAAAAAGAAACATCCATACTCACATAAAACTTTTTTTTTTGAGGATCAAAACATTTATACCCCTTTGGTTTGAAGCATAGCCCAAAAAATGCATTTAATAGCCCTATGATCAAGTTTTGATCGAAAAACATTGGGTATGTGAACGTATGCAATGCAACCAAATATTTTTATGGGTAAATCAAAATGTATACGATTAGTTGGAAAAAAAGTTTTAAGGCAATCAAGGGGGGTTCTAAAAGTCAAGACTTTACTAGGCATCTTATTTATCAAATAGGTAGCCGTAAGTACAGCTTCCCCCCATAAATACTTAGGGACATGCATAGAAAACATTATGGCACGTGCAACTTCAAGTAAATGTCTATTTTTCCGTTCAAAGCAATTCCATTTTGTTGAGGAGTATCTCGACACGTAGATTGGTGTAAAATGCCTTTATCCTTTAAAAATCATTCGTATTCATTGAAATATTCATCTCATTGTCGAGAGTGTAAAATATCGATTTTAGTTTGAAATTGAGTTTCAATCATATTATAAAAGTGTATGAAAAGATCTTTCACCTCAGATTTTTTTGCATTAAATAGACCCAAGATAGACGTGTGTGTGTCATCAATGAAAGTGACAAACCATTTCTTACCACTATGAGTCATAATCTTAGAAGGACCCCAAACATCGCTATGTATTACGTAAAACGGTTTAGAGGCTTTGTAAGATTTTGGTAAATATGTGGATTGATGGTTTTTTGCGAAGATACAACTTTCACAATGAAAAGAAGAGCAATCAACATTTTTGAATAAATTTGGAAATAAATGTTTCAGATAGAAGAAATTGGGGTGTCCTAACCTACGATGCCAAAGTAAGACTTGCTCATGAATAGAGAGAGAACTAATACTACTAAGACCACGAAGTTACTTTATTACTAAAAGGACCTTCATCAAAGTAGTAGAGACCATCAAGCATCCTTGCACACCCAATTTTCTTTCCTGAGTCTTGATGCTGAAAAATACAATGAGATTCAAAGAAAGTGACATGACAGTTAGAATCTCTACAAAGTTTACTAACAGATAAGAGGTTACATGCTAGTTTAGGAACATGGAGGACAGACTGTAGGGACAGGTTATCATTTAATTTAATGGTTCCTTTTCCTGCAATAGAAGAGTAACTATCATCTGCAATTCTAATTTTTTCATTTTTATAGGAAAGAGAATAAGAATCAAACAATTGAGAAGAACTAGTCATGTGATCAGATGCTCCATAATCAATAATCCATGGAGATGAATTACTACAAGAAAAGGCTTGAGGACAGTTACCTATTTGTGCCAAAGAAACACAAAGATTACCGGATGATAAATTGGTGTGCAGCAGTTTCAAAGATTGATCAACTTGCTCTTTTCCAAATGGCCTTGAGGTAATGGTTGTATTATCGAACAACCCATACTCACTTGTGCTATATTTCGGAGTCATTTCAACGATCAATTTAGACATTAGTAAAAAACAACTCAATTAAGAAGCTGAAAATAAACCAATACCAAACCAATTTTCACTGTTGTACTGCCCAAATAAATGCCAAAATAAACCGGACCAAACTGAGGAAGACGGCGGAGTCGTGAACTTTAGCGTGAACGGTAGAAGACCCACGCTGTAGAACCCACGAATCGGCAGATCCTACCAGCAACCACGAATCGGCAGAATCTGAGGAGTGTGGAGGCTTCTTGTAGCTTCTTGCGTTTTTTTCGTCTTAGGGTTTTAGTGAGAACGGAGGCGCCAAGTCCGAGCAAAGCAACAGCACCAAGAGGCAATGGCTGAGTAATAATTCCAACAATCGTAGCAAGGAAAATCGCGAGCAATTGCCAAAGCGTTTGAGAAACACCGCACGGCACCGGAACAAAGCACGAATGATACCAGTAGCAATGGAAGCAACAAGTGGCTTAATCGCCGCACCTTGCCACGGCTGAGAAGCCGGATCAACTCAGACGCATTTAAGACGAACAGAGAGAGACGAAGACGCGCGCGCGATCGGTTGATTTAAGACGAAGACCCACGGCCTGCTGCGCAAAAACCCACGGCCTACTTCTTTTACAGCTGACCGTAACCTTGATACTCCGACCGGAAGTAAGGTGCCTCGTTTGAAGATGGTGGCTAAGGTTCCGACGGTTTAGCGGCTAGGGTTACTATAGCGGCGGCGGCTAGGGTTTTTTTTTTTCTTTTTTTCTTTTTTTTTTCTGCTCTGATACCAACTTGAATTTAGAAGAGAAATAGATATATTTGTATTTCTTCTTACTTAATCAAGATACAAGGTCTTCTTATATAGGAGAGAGACCATCTACTAATATCTACTAATAAGGAAAATATCTATCAATAAGGAAAAAGAATATTACAGGTATAAATAATAAAGATACAAATAATACACATTATTATCAAAGATACACAGAAAATCCAACAATTTTTGTAGTTTCAGAGATTTTGATCTTAACTTCCATTAGCTTTTTCTTTTCTTTTCTTTTATTTTATTTTTAATAACCGGGTGTCTCCCCTATACTACCCGCATCCATCCCAAAGCCTGGACTGGGAGACATCAAGGGTTTTTATCTATTAAGCTCATCCAATGGCTCAAACTTGAGACCTCTAAGCCAATATGACTGAGAGACCCCAAGCTCTTACCAATGGGGTTGCCCTTAGGGCCTAAACTTTCATTAGCTATCATTAACCGTATAAAGCAATCTCAAAAAAAAAAAGTGTAATACTGACCCTTTTTAATGTGAAGGAGAAGATCTTTGGAGGGATTGAATTTTTTTTTAATGTGATTTCCGTATCCAGATAGATCAAAGCTCTCTCTAAAAAACTAGTTTAGATTAGATACAAGTTTGTGTCTCGCCTATATAGACAACAGTTCTGTTTGTAATAAATATTGATCTTCCCTCAAAAGACTAGGTTTGTTTTATCTTTTCCTCATTTTCAATAGCCCTTCTCGAACTCTTGCATTCGTGATCTTTTAATCCATCCACTTTTGTTATCATAGGGACTCAAAGAATCTTTATGAAAAGATGTTGCTTGTTAGGAGAAACATTATTGAACATGGCAGAGAACAGACGCTTAGAAAAGCGCAAGAGGTATCCAAGGCTGCAGCAACAGCTCGTTCTAACATTCACAAAAGAGTACGAGAACTGCATGTGTCTGCTATTGAATCCTCCTCACCCTCTGCCATTGGTAGTAGTCAGCGTGCAGGAGTGAACTCTAATAAGCTAGCTAGAGCAGGCGAAAAGAATCCGATGGCATTGGATACTCGCATCTCTTCAACCGGTAACATCAGTCAACCACGATCAGGTTTGCCAATGAGAAAATCTCCGGTTAATTTTATTTGTCTGGAAAACCATTGAGATTGGAGGTTTTTCTTATTTAAATTTAAATTTTATGTTTTTTAAGTTTCTTACTTTATTTTAGATTTAATTGTTCAGTTTTATGTCTTAAGTCTCCACTTTAAACATTTACCATTAATGTTGTCTTTAAGTTTTAAGTTGTTAGATCTCAAATAGGTTTTGTTGAGGAAAAAATGAAGGATGAAAAAATATTGCAGAGGCAATGATCAAATCAAAGTCACAATATTTGTTGAACAAAAAGAAAAAATTGAAAATGAGAGAATGGAACTAATGTGGGCTTAGATTTAGTTTACAACCCAGTTCTTTAATATCTTATGGTTGTTTAGGTCACTTTTGTTTTACGTTAGTCTTTTTACCTTACCAACAGATTAGTTGGTTTTGAGTTAGTTAAGGGTTAGCTTATATAAATAGGATTGGCTCATTCTTTCCTTTGATCATTAATAAAATCATTTGAGGTTTCTTGGAAGTTGATACATCAAATTGGTATCAGAGCATTCGGTCCAAGGCTCGTTAAATTTCTTTTGGACCTATTGAATATCTTCATCATGGCAGCGGAAGCTTCCAGTTCTTTACAGAAAGACACTGAGAGGAGAATCTATGTAGTTGAGACAGCAGTTGGAGAAATCTAGAAAGAAGTCATTGAATTAAGACAAATGATGGGACAATTCTTACATCAATTCAACCTTTTGTCTATAAACAATCAGACACAACAAATCATTGGAAATCTTGGAACTAACCAGAATGTTCAAGAAAATCAAGAAAAACCTCAAATTCCAGCAAGAAGACCTGAAAATCAGATGAATTGAAGGATTCCAGAAATTCAAATGTCTGGTTTTATGACCCAAGATTCAGAATCATCAGGAGAAGATGAAGTTTATAACCAAGGGAATCAAAAACGAGAGATAAGGAATGCAAGAAGACAGCAATCGGATCAATATGAAGCTCAAGATTTCATAATGAAAATGGATCTACCTAGTTTCAACGGTAGAATGGATGTTGAAACCTTTTTGGATTGGGTTAAAAAGGTGGAAAGCTTCTTTGAATACACAAATACCCCTTTGGACAAGAAAGTCAAATTAGTGGCTTTTAAACTTCAAAGTGGAGCTTCGGCTTAGTGGGATCAACTTCAAAATAACCAAAGATGCTATGGCAAACAAGGCATCAAAAAATGGCCCAAGATGCTTCGTTTGATGAAGAAACGTTTCCTACCTTTAAATTACCAACAAATTCTTTATAACCAATACCAACATTGTAAACAAGGAGGGCGTTCGATACTTGATTACACCGAAGAATTTCATCGTTTAGGAGCAAGAAATAACCTCCCAGAGACTGAACAACAACAATTCTGAAGATTTGTAAATGGCCTTCGAGATGACATTAAAAGGTGGTAAATTTACATCCTCTCGCCTACCTTGCAGATGCAATTACTTTGGCCACTAAAATTGAGGAAACTATGAAGTTAAACCAATCAGAACATACCAAAGGAGGAATACATGGGATAAACAACAACAAATTCATAGAAAATCACTTCCTGATAGTTCAAAAGGTTTTCTTCTAGGGAACTCTTCATCCTCCAATCAGATGGTCACAAAAGATGACTCTCAGACAAGAAACACGCTGGTCAATAAAGGTTTGAATAACTCTATTTCTAAAAGAAATACTAATTCATATAACCATCCTTCCTTGGGAAAATGTTTTAGGATGTAGCAGCCTTTTCCAAGAAAACAATTTACTGTTATTTTTATTATATCCAAAACATAGGATGTAGCAGCCTTTTATACAAGTTATGAAATAATAAAGGTAAAACTAACTAAAACAGTAACTCTAACAGTAACTAGTTACTGCATAACAGAAATAAGGTTCCAGTACAACAACCCCCACTCTAATAACAGTATGCCTACATCAAATCCACCTCCCCAGAATTAAACTCGTCCTTGAGGTTCAGTTGAGAGTGAAAATTCATCTGGGGCAAAATACTTGGAGATATCGGCTACGTTAAAGGTGTTACTGATCCGCAAACTAGGTGGCAATTCCACAGTGTAGGCATTATCACCAATTTGGGACAGAATAGGAAATGGTCCCATTTTTTTATTAGTCAGCTTCGAATGACACCCAGAAGGAGGGCGTGATTTCCTTAAATGGACCATAACAAGGTCCCCAACCTTAAAAGTGGTCGTCCGACGGTGGATATCTGCTTTGGTTTTATAAGATTCATTTGCTCGTTGAAGGTGCTGGCGCACTTCTTCATGTAGCTCCTTAATCCGGTCAGCCATTACTTCTGCTTCGCTACTAAGATCAACAAAAGTGGGTAGATTAGCAAGGTCTAAAGTTAGACGGGGAAGGCGAGTATATACTACTTCAAATGGTGACTTCCCTGTAGAACGATTGACCATATGATTGAAAGCAAATTCTACTTGAGCAAGTGCAAGGTCCCACTGTTTAGGTTTGTCCCCACTTAAACAACGTAAAAGGTTTCCAAGAGTGCGATTTGTGACTTTGGTTTGGCCATCCGTTTGCGGATGACTAGTTGTGCTAAAAAGAAGAGTTGTTCCAAATTTTTTCCAAAGAGATCGCCAAAAATGACTCATGAACTTTACATCTCTATCAGAAACAATAGTCTTGGGAATACCATGTAACCTAACAACTTCTCGAAAAAACAAGTTAGCAACATGTAAAGCATCATTAGTTTTCTTACAAGCAATAAAGTGGGCCATCTTGCTAAACCGATCAACAACTACTAGGACCGAATCAAAACCTCTTTGTGTTCTAGGTAATCCGAGTACAAAATCCATGGAAAGGTCCTCTCATATAGAATGTGGAATGGGTAAGGGTGTGTACAAACCTTGATTAGTAGTTCCCCTTTAGATGTTTGGCATATGAAGCATCGTTTTACAAAATTGGAAACATCCTTTTTTAAGTTGTGGCCAAAAAAATTGAGAACAAACTGTGATTTGAGTTTTATCCCTACCAAAGTGTCCTGCAAGTCCCCCAGCATGTGCTTCTTTTATAACTGATTCCCTAAGAGAAGTCTGTGGGAGGCATAAGACGTCATTTTTAAAAAGAATCCATTAACAATATGAAAATCTTTACACGGTAAATGATTGGTGCATTTAGACCAAATTTCTTTAAAATCTGGATCATCCTCATATAAATCAGATATATGTTCAAATGCAATAATTTCTCCATGCAGTAAGGTTAATAGAGAATTTTTTCTGCTTAGTGCATCAGCCACTTTATTTTTATTTCCTGCCGTATGTTTAATGACAAAATCGAATCGTTGAAGAAATTGAATCCATCTTGCATGCATGCGACTTATTGTTTTTTGAGAATGAAGGAATTTCAAGGAAAAGTGGTCGGTAAGTAACACAAATTCTCTACCCAATAGATAATGTTCCCATTGTTTCAAGGCTCTCACAAGGGAATAAAATTCTTGTTCATAAGTGCTCCATTTTTGGCGGGCTTCACTCAATTTTTCACTAAAGTATTCAATGGGATGGTTCTCTTGGCTTAGCACACTGCCAATCCCTACTCCTGAGGCGTCTACAGAGACTTCAAACGGTTTAGAAAAATTAGGAAAGCTAAGGATAGGAGTAGAACTAAGCATTGATTGAAGGGTAGAAAAGCTTTCTTGTTGACTAGATCCCCAATTGTACTGACCTCTTTTTAAGCATTGAGTTAATGGTGCAGCAATTGTACTGAAATTTTTAATGAACTTCCGGTAAAATGATGCTAGGCCTAAAAAACTCTGAATTTCCCAAACTGTGGTAGGGGTTGGCCAATTAGATATAGCTTCAACTTTTTTAGGGTCAACTTCTATTCCTTGGTTGCTGATTATGAAACCTAAGAAATGAATTTTAGAGGAGAAAAAACTACATTTCTTTAAATTTATGTGAAGATTGTTATCCTTTAATGAGGTGAAAACAGTAGTTAAGTGGGATAAATGTTCTTCATGAGATTTACTATAAACTAATATGTCATCAAAGTATACAACCACAAATTTATTAAGGAATGGTAGAAGTACTTGATTCATGAGTCTCATAAAGGTGCTAGGCGCATTTGATAGTCCAAAAGGCATTACTAACCATTCAAAAAGTCCTTCATTGGTCTTAAAGGCAGTTTTCCATTCATCTCCCGGACGAATTCTAATTTGGTGGTAACCGCTTCGAGGATCTATTTTAGAAAATATCATAGCACCTCCTAATTGATCTAGGATGTCATTTAAACGAGGGATAGGGAATCTATATTTTATTGTGATCTTATTTATGGCTCTACTGTCAATGCAAAGTCGCCAAGATCCATCTTTTTTAGGAGCTAAGAGGGCTGGAACAGCACATGGACTTAAACTAGGTTGGATGTGTCCTTTAGTTAGTAATTCTTGTATTTGATCATGGAGTAATTGGTATTCTTTGGGGCTCATTCGATAATGCGGAAAATTTGGTAAGGAGCTTCCTGGAATTAGGTCTATATAGTGTTGAATATTTCTAAGAGGTGGCAAAGAATTTGGGATTTCAGTCAAGGTGTGGAAATTCTTAAGTAATTGGTTTATTTGATCATCTAAAGGCATTCTTAATATCAAGCTGATGCAAGGGCCAACCATGAGTAGCAGCCATGGAAATAAATAATCGAATAGAAGTCAATTTAGCAACTGGAGAGAATGTGTCAGAATAATCTATCCCATAAGTCTGAGCATAACCTTTAGCAACTAAGCGGGCTTTCAAGCGAGCAACAGATCCGTCGGGATTGACTTTAATGGCGAAAACCCATTTACATCCAATAGCCTTCTTTCCTGCAGGACGAGATACCAAATCCCAAGTACCATTATCATCTAAAGCATGCATCTCTTCAATCATTGCATTACGCCAGCCAGGATGAGATAAAGCTTCATGAACAGAGTTAGGAAGACAGGTAGAATCAAGGGATGTAACAAAAGAATATGTAGGCTGTGACAATTGGTGATAAGAGACACAAGAAGCAATAGGGTAAGTGCAAGTGCGTTTACCTTTACGAAGGGCAATGGGAAGCTCATCACAAGTTCCTGGATCAGATGTCGAAGAATCTTCTGGCATAGGTGTAAACTTGGGTAATAGGTGGACGCAGTGGGATAGGTGCTGAAGGAGGCGAAGTAGAGAGAGATGAAGGAGAGACGATCTCATAGATAAGTGTGTCGTCCTCCTCCTGACATGAACTCGACGATGATGGGGTAAAGGGTAAATCTTCAAAGAACGTAACATCAGGAGAGACAAGATACTTGTTAAGACTAGGACAATAACAACGATACCCTTTTTGAACCCAAGAATAACCCAAAAATATGCATTTTAAGGACTTAGGATCTAACTTCGTACGATGAGGACTGATATCTCGAACAAAACAAGTGCAACCAAATATTTTGGGTGCAATTGGGAACAAAGATTTTGTTGGACAAAGAACACGATAAGGGATCTCACCATGAAGAACTGAAGAAGGCATTCTATTTATCAAAAAGCAAGTCAGGAAATCGCATCTGCCCAAAAATGTTTTGGTACATTCATTTGAAAGGACAAAGCTCTAGTTGTTTCAAGTAAGTGTCGATTTTTTTTCGTTCAAAGAACCCCATTTTGGGATGGAGTATCAGCGCAAGAGGATTGATGAAGAATGCCATGTGTACATAAGTAGGAACCCAACATGTTCGAAAAATATTCACCCGCATTATCGGTTCGCAAAATTTTAATAGAGGCATTAAACTGATTCTGAATCTCAGCATGAAAATTGCAAAAATGAGTAAGTAACTCAGATCGACTTTTCATTAAATATAACCAAGTCGACGTGAAAAATCGTCGACAAAAGTAACAAAATATCGAAATCCAGTTTTGGACATTACAGGGCACGGACCCCAGACATCAGAATGGACTAATTCAAATGGAGCACTAGCTCGTTTATGGATCCTAGGACTCGAACTAAGACGATGAAATTTTGCAAACTGACACGACTCACAATCCAATGAAAACAAAGAAAGAAACTCTGGATACATTTTCTTCAACACGGGTAAGGATGGATGACCTAAACGACAATGGACCTCAAATGGAGATGCAACACCAGAGCATGCCACAACTTTTGGGATTTGTTGATCAAAAATGTAAAGGCCTTCAGACTCATATCCTTTACCAATAATCTTCTTCGTCATACGATCCTGAAACAAGCAATAACCAGGAAAGAATGAGATAAGGCAATTAAGGTCATGAGTGAGTTTACTAACAGAAATCAAATTAAATGAAAGTCGAGGCAAGTGTAAGACAGTAGATAAAGAGAGTGAAGGCGTGAGGTTAATGGTTCCGCATCCAAGAACAGAGGGCGTCGAACCATCTGCTAAAGTAATGTGATGGAAAAGGTGCAGAGGACAGAGAGGTACAAAATAGGCTAGAATTACCTGATCATATGAGCTGTGGCACCAGAGTCGATGACCCATTTGGTGGAAGATGTAAGGAGACAATGATTTTTACCTGAGTCAGCAATAGTTGCAATAGGATTCGATGAAGAAGATGCTTGTAGTGAGTCCTGGTACAACTGGAATCTAGCAAAGTCATCTGCAGAAATAGTTACCGTTCTCTCAGGTGTATCATTTGTAGACGCAACTGTCCCAGCCACCAAAGTCTCTTTGCCCCTATTCAACAACTTCCAACATCCACGCTTTCAAGTGACCCGCTTACGACAATAGTTACATATAATCTCTTGTGGCTCTTGTCGCCGATCTTTGATCAAAACCAGTCTCTGATGGTTCCAGCAAGAATGACCCATGTCTCTCGATAATCGTAACTGTGTCTTTGATGACCTGGCTTACCATTCATTCCTTTAGTGACCTTAGGATCATTGCGACTAACTAAAGCACTGTTGGACTGAGAAACAGAAGAGCCAGACGAAGAATTTTCTGTACGGAGAATTCTACTAAAAACTTCATCCAAAGAGGGATCTCGAACTTGAAAGAATTTGTGATTTGGCCATATCAAATTCGGAAGAAAGCCCATTAAGAAAGCTCATGACACCCATCTTTTCTCTTTGAGCTTGTTGAACCTTTATGTCCTGCATACTAAATGGGATTTAAGTGTTCAATTCAAAGACATGTCTTCTTAACTCGCATAAAATAATTCGTGATGAGATCGGTCTCCCTGTTTTCACTTTGATAAAGAGAGTCAGTGCGCGACTTCATACATTCGATTAACTTGCTCTTTCCTTGAGTATAAGAATTCTAAAAATTCGAGCAGTTCTTTTCAGAGTCACAGTGATTTACCAATCCAACTACCTCACCCTCAATTGAATTCTTGATTTGAATATACAAACGAGCGTCATCACGCAACCAAGTCCTCCTTGTATCATCCTCTGGTGGATCCTGATCGATGTGATCATCCATCTCAGTGCTTCGTAAATAGAGCCTAATTGTTCGACTCCAATCATAATAGTTTGAACCATTCAACTTATGCTCCGTGATCTTTGACGAGAGGGAAACCAAATTGGGTACAACTATAGGCTTCATCTCAAATGCATTGTTAAGACAAAACAAAGAGGTTAATCTTCAAGGAAACAAAAATCAAATCAATAGCGATGCAAAAAAATGCCCTAATCTCACCCAAATGAAGAAACCGAGACAAACCCGAGGTTTGGTGTCAAATCCAGAGGTGACAGACAGTAGGAAATCATGAATCGAGACAAACCCGGTGGCGGCTCGGCCGATCGGAGCTCCCACGCGCTCGCGCGCTCCCACGCGCCGGCGGCGTGGCGAGAGCGAGGCAGAATGGTTGGGCGGCGCGTGAGCCTCACTCGCGGCTGCTTCGGCGAACGGCGACGGCGGTTCTCCTTTCTCAGCGGCGCGCTCTCCTCCTCAAGTAGGTGGTGTCGACAAACGGCCACTCTAAGATGATGACCCAAACTTCAAACCCTAACCTTACTTGGGAGGAAGAAATCCTAATTTTCAAATGCAAACCCTAAACCCTAGGGTGCTCTGGTACCATGTAAAACACAGAATTGAGAAAGAATAATATTCGTTCTATTCACATTTGAGCTCTATGACAAATATATATAGGTGTACAAAGAAACCCTAATACTAAAGGATGTACAATTACGATAAAGGACAACAATATATATATTATAACACTCATCATACCAAGTTAGATCCTAAGTCCTTACAATGCATCTTTCTGGGTTATTCTTGTGTTCAAAAGGGGTATCGTCGTTATTGTCCTAATCTTAACAAGTATCGTGTTTCTCCTGATGTTACGTTTTTTTAGGATTTACCCTTTAGCCCATCATTGTCGAGTTCATGTCAGGGAGAGGATGACACACTTCTCTATGAGATTGTCTCTCCTTCATCTCTCTCTACTCCATCTCCTTCAGCATCTATCCCTCAACGTCCGCCTATTACTCATGTTTACACTAGACGGCTACCACAGCAACCTTCAGACACATGTCCTCTACCAGAAGATTCTTCGACTTCTGATCTAGGAACATGTGATGAGCTCCTCATTGCCTTCGTAAAGGTAAACGCACCTGCACTTACCCTATTGCATCTTTTGTTTCTTATCAGCAGTTTCACAACCTACATATTCTTTCATTACATCCCTTGATTCTACCTCTATTTCTAACTCTGTTCATGAAGTTTTATCTCATTTTGGCTGGCGGAATGCAATGATCGAAGAGACGAATGCTCTAGATGATAATTGTACTTGGGATTTGGTATCTCGTCCTGCAGGAAAGAAGGCTATTGGATGTAAATGGGTATTTTCCATTGAAGTTAATCCCGACTAATCTGTTGCTCGCTTGAAAGCTATGGCTTAGACTTATGGGATAGATTATTCTGACACATTTTCTCCAGTTGCTAAATTAACTTCTATTCGATTGTTTATTTTCGTGGTTGCTACTCACCATTGGCCCTTGCATCAACTTGATATTAAGAATGTCTTTCTCCATGGTGATCTTTAGGAAGAAGTATATATGGAGCAACCACCTGGTTTTGTTGCTCAGGGGGAGAGTGATAAAGTTTGTCGTCTTAGAAAATCATTGCTTCTAACCCAGTGTTTCACAAGAGAACAAAACATATTGAAATGGATTGTCATTTTATTCGCGAGAAAATACGAGGATTGGTGTCCACAGGATATGTGAAGACTGGAGAGCAATTGGGAGATATTCTTACTAAAGCCTTAAATGGAGCAAGAATAAATTATCTATGTAACAAGTTAGGCATGTTTGACATATTTGCTCAAGCTTGAGGGGGAGTGTTATAATATATATTATTATGTATAGTTGTTCTTTATCGTAATTGTATATTCTTTAGTATGAGGATTTCTTTGTATACCTATATATATGTCATAGCGCTCTAATGTGAATAGAATGAATAATTCTTTCTCAATTCTGTGTTACAAATAGTAGGGTCATAAATAAGATCACAATTAAATATAGGTTCCCAATACCACGTTTGAATGATCTTTTAGACAAATTAGGGGGTGCCTTAATTTTCTCAAAGATAGACCTTAGGAGTGGATATCATGAAATAAGAACTAATCCGGGAGATGAGTGGAAAACACTTTTAAAACTAATGAAGGGCTTTTTGAATGGTTAGTAATGCCTTTTGGCCTATCTAATGCTCCTAGCACGTTTATGAGGCTAATGAATGACGTCTTCCTCCCTTATCTAAATAAATTTGTAGTCGTTTACTTTGATAGCAAATCTCATGTGGAACACCTAAATCACTTGACCACTGTTTTTACTACACTTAGAAATAATGAACTTTATATAAACTTGAAAAAATGCAGCTTTTTCTCTACTGAAATTTATTTTCTAGGGTTTTTAATTAGTAGCAAAGGAATAGTTGTTGATCCTAGAAAAGTTCAAGCCATAAAAAAATTGCCTACACCTAAGACAGTACGTGAAATTCAAAGCTTTCTAGGACTTGCTTCTTTTTATTTTTATAGAAAGTTTATCAAACATTTCAACACAATTGCAGCCCCACTAACACAATTGCAGCCCCACTTACACTGCTTAAAAAAGGGACAGTTTGATTGGGGAAATGACCAAAACTTAAGTTTTTCTACACTTAAACTTTAATCACTTCCACACCTATTTTAGGGTTTCCTGATTTTTCAAAACCTTTTCAGGTATCTGTGGATGCTTCTGGAGATGGAATTGGTGCTGTTTTAAGCCAAGTAAACCACCCAATTGAATTTTTTAATGAAAAGTTAAGTGAAGCTCGAAAAAAATGGAGTACATATAAGCAAGAATTTTACTTTTTAGTGTGTGCGTTAAAACAGTGGGAACATTATTTACTTGGTAAAGAATTCATTTTACTAACTGACCACTTTTCTTTAAAATTTCTACAATCACAAAAGACAATTAGTCGTATGCATGCGCAATGGATTCAATTTCTCCAATGGTTTGACTTTGTTATAAAACACACATCAAGTAAGACTAATATGGTGGTTGATGCCTTAAGCAGAAAAGTTACTCTTCTAACCTTATTACATGGAGAAATTATTGTTTTTGCACATTTACCTGAACTTTATAGCACTAATCCTGACTTTAAGGATATTGGGTATAAATGGCACTGATCATTTACCTTGTAAAGACTTCCATATTGTGGATAATTTCCTATTTAAAAAGGATGCAGTATGTCTTCCTCAAACCTCTTTGCAAGAATCCCTTTTAAAAGAAGTCCATTCGGGTGGTTTGGTTGGACACTTTGGCCGTGAGAAAACTCTTGTTGCACTTTCCTCTAGGTTTTATTGGCCTCAAATAAAAAGAAATGTTACCAATTTTGTTAAGCATTGCTTTATTTGTCAAACTTCCAAAGGTTCAAGCCAAAATTTAGGACTTTATACTCTTTTACCAATTCCTATTCTATTTGGGAGGACCTTTCCATGGACTTTGTCCTTGGTTTGCATAGGACTCAACGCGGCTTTGATTCAGTTATAGTGGTTGTTGATCGTTTTAGTAAGATGGCTCATTCGATTGCTTGTAAAAAATCTAATGATACATTGAATATAGCTAACTTGTTTGTTTCGTGAGATTGTTAGGCTACATGGTATTCCTAAAAGCATTGTTTCGGATAGATTTTTTTTTTTTTTTTGACAAAGTGACAAAGAATTTCATTAAGGAAATGAAAGAGACTAATGCTCCAGTTACAGTTTCAAACGAGGTACAAAAAACCTCATAAATGACAGAAAAGCCAAAGCACCAACTGAAAAGTAACACTCCAGCGTATTTTCCTACTGAATTTCCAAAAGAACCACCCTCTACCCCCCCGAACTAAGGCTACAGATAGCCATCCAGGTCATGGAACAGCAACCAACAGCAAAACAAAACGCCACACTATTCAAAACTAGGCCCTAGTTCTAGCAAACAAAACTACATAACAAACAAAAAAGGCTGCCAAGAGAAAGTCAAGACCGCCGTAGAATAACCCTCGAACTCCTTGCTCTGAGAGCTCCACAGTGAAGCTTTGACAACAGCGGCAACAAACCTTCGAGCCCATTCCCAGCCTCGCCCCTCGAACACCCTTTGATTCCTTTCAAACCAAAGTTCTGAAAGGAGAGCTTTGGTCCCCGCCAACCACAACACCTCAGCACTGGAACGAAGAAAAGACCCTGCCAACAAGTGTCGTACGTTTGCTACCACCGAATTGCCAAACACCCAGCTAACATTGAAAAGCCCCAAGAATTTCTCCCAACATCTTCTTGCATATGGGCACTGAAAAAAGATGTGCAAAGAGGTCTCCTCAAACAGATGGACAAACAGAAAAAAGAATTTCTCCCAACATCTTCTTGTTTCGGATAGAGATGTTAAGTTTATGAGTTATTTTAGGCATTCCCTTTGGAAAAAATTTGGCACAGATCTTTTCAGTACTACTAGTCATCCACAAACAGATGGACAAACCGAAGTCACTAATCGTACCCTTAGTAATCTTTTACGGTTTCTAAGTGGAGATAAACCTAAGCAATGGGACCTTTCTCTTGCTCAAGTTGAATTTTACAGTCAACCATATGGTCAATCGGTCAATAGGGAAGTCTCCATTTGAGATAGTATATACTCGTCTCCCTCGTTTAATATTTGACCTTGCTAATTTACCTACTTCTGTTGATCTTAGTATCGAAGTTGGGTTGATGACTGATCAGACTAAGGAATTGCACCAAGAAGTTCGCAGTCATTTTTAGAAAGCAAATGATTCCTATAAAGCTTAGGCTGATGTTCATCGTAGAAATGAAAATTTTCAAGTTGGCGACCTTGTCATGGTGCACTTAAAAAAATCCCACCTTCCTTCTGGCCTTCATTCCCAGCTAACCAACAAGAAGATTGGACTTTTTCCTATTCTGTCCAAGATTGGATCCAATGCCTATGCTATTGATCTCCCACTCACAATGCAAATTAGTAACACCTTCAATGTATCAAATATATATCCTTATTTTGCGCCTGATGAGTTTCTTGTATCTACTTGAACTCGGGGACAAGTTTTGTTTTTGGTGGGGTGGATTTGATGTAGGCTTAGTTTTTGTTTACAACCCAGTTTTTTTTTGACAAAGCAACAAATGAAATTTCATTGATAAAATAAAATAAGACTAATGCTCAAAATACAACAAAAAGAAAAGAACATACAATGTCCAAAACAACCTGAGACAAAACTTAAACAAGAGAGAAAACATCACCTCTTCCAAGAACAAACCAAAAGAAAGAGAAGAATAACCATAAAGATATGAGAACCAACTAAAAAACAAAGACCAAAAGACTAAAGAGGTTGAAGAAAAGAAGCTAATAACGCCAGCAAGGGAGAACCCTCCAGATCGAAGCCCTCAAAACGACCGCCCCTAAACGCACCACCCTAACCTCCAACAACCATTAGCTAATCTTGGAACCGATCATAAACCCGAACAATGCTACCAACACCCAAAATTTCCATAACTAAGACAAACAGTAGGGGGAAACCTTCAAAAAGCCCCCAACCCTGCAACTACTTCAAATACCAACACCACAAAATAGGCAACAAAAATTCGTAGAGATCTCACCCAACTGAGTTAAACTCTCAGCTGAATTCTTTAATATCTTATGGTTGTTTAGGTCACTTTTGTTATTTTGGTTGTTTACTGTAATTTTTATACGTTAGTCTTTTTACCTTACCAACAGATTAGTTGGTTTTGAGTTAGTTAACGGTTAGCTTGTATAAATAGACTTGGCTCATTCTTTCCTTTTGATCATTAATAAAATCATTTGAGGTTTCTTGGAGAGTTGATACATCAGGAACTCAATAGATGAATAGACTTCTATTGAAGTAGAAAATATTGGTGATACAATCTTCATATAGAAGTTCATCCTTAAAACTAAACCTAATTAATATCACTAATTAACTTACAATATTAAAAAATATAAACTCGCAAAACTTAAAAACTTAACAGACAAAAAAATTGAAATATGTTAGATCTAACTTTAAAAAAAATTAAAACATTAAATTGAAAAACGTGATGTGATTAAAAAGGAAAACTTCATCACTCAACACTCTTTGAGGTTTTTCTTTGTGATACCCAGCTTTTGGCGTAGAAACGCAACTTTCCTATGTAGTAGAGCATTTGTGAGAATGTCGGCAAGTTGTTCTTCAGAACTACAAAAAATGAGTTTGATTTCACCTTCTTTCTATGCTTCTCTCAATAAATGAAACTTAACTTTGACGTGTTTCATTCTACCATGTTGTTCTGGATTGGAGGCAATAGCAATAACGGACTTGTTATCAGAAAACAGTTTTGTAGGTTCTTCAAGTTCAAATCCCAAATCACTTAGCAATTTATAAAGCAATATTGCTTGATTTACAGCTGATGCAGCTGAAATATATTCAACTTCAGCCGTAGATTGGGCAACCACTTCATGTTTTCTTGAATTCCAAGAGAAAATCCTAGTGTCGAGAGAAAAAACATAACTAGAAGTGCTTTTAGAATTATCGACACAACCGACCCAATGACTAATTGAGCAAAATAAGAGGAATAACTCTCTGTATATTATTTGATTTGATTATTGCTTCTGTAATATTTTTTATTTTTCATTTCATTTTTTCTAACAATGTTATCATAACCTTAGATTTAAAGGCCTATGATTTTATTTGGAGTGAGAAATATTTAACTTTATATTTGAAAAAATGTTTTCTAATTTTTTATGTAACCTCTCCTCTACGTTTTAATGGTGAAAACTATCAATTGTGGGTTGTTAAGATGCAGTCCTATTTCAAGACACTTGTCAAGAATGTTTCCACGGATGCTGGTCCTAAACCATTAGCAGAAAATCTCAACAAAGACTCAAAACTTTCTTCCTAATAAACGCTCCTTTTTTATAATTATGTTGTCAATTTTTATACGATGAAAGTAATCATTGATCTTCCATTACGTGAGGCTTTTTGCTACCTTATATTTGTGAGTTTTGTCCTTAACACTACAGAGAAGCCTGAGTTTCCCACTGTTGGCGATACTGTGTACATTTCTTCGTTTGGAAAAGAAGCGACTGTTTTAAGTATAGAGCCATTAAAAGGCGAAATAGTTGTTCAAGTTGGTAGCATAAAGTTGAAGCTGAAGTTCACCGACATAATGAGATAA

mRNA sequence

ATGACAGGAGATACAGTTGGTGCAGGAACCAATCCTCTGGAGGGAGCCGCACTTGGGATGTCACTCCTGGAATCTTTGGCTAAATCTGGTGCTTCATTGACAATCGCGACTACACATCATGGAGAACTTAAAACCCTAAAATATAGCAAGGAGGTCTTTGAAAATGCGTGTATGGAATTTGATGAGGTGAACTTAAAGCCAACTTACAAGCTTCTCTGGGGAGTACCAGGGCGTTCAAATGCTATTAATATAGCTGAAAGGTTGGGGTTGCCTAGTTCTATTGTAGATGATGCTCGTGAACATTATGGTGCGGCAAGTGCACAGATAGATGAGGTCATAATCGATATGGAATGTATAAAGAAAAAGTATGGGGAGCTCTTACAAGAAGCTCAAAATCACTTGATGGACTCAAAGAATCTTTATGAAAAGATGTTGCTTGTTAGGAGAAACATTATTGAACATGGCAGAGAACAGACGCTTAGAAAAGCGCAAGAGGTATCCAAGGCTGCAGCAACAGCTCGTTCTAACATTCACAAAAGAGTACGAGAACTGCATGTGTCTGCTATTGAATCCTCCTCACCCTCTGCCATTGGTAGTAGTCAGCGTGCAGGAGTGAACTCTAATAAGCTAGCTAGAGCAGGCGAAAAGAATCCGATGGCATTGGATACTCGCATCTCTTCAACCGGTAACATCAGTCAACCACGATCAGAGAAGCCTGAGTTTCCCACTGTTGGCGATACTGTGTACATTTCTTCGTTTGGAAAAGAAGCGACTGTTTTAAGTATAGAGCCATTAAAAGGCGAAATAGTTGTTCAAGTTGGTAGCATAAAGTTGAAGCTGAAGTTCACCGACATAATGAGATAA

Coding sequence (CDS)

ATGACAGGAGATACAGTTGGTGCAGGAACCAATCCTCTGGAGGGAGCCGCACTTGGGATGTCACTCCTGGAATCTTTGGCTAAATCTGGTGCTTCATTGACAATCGCGACTACACATCATGGAGAACTTAAAACCCTAAAATATAGCAAGGAGGTCTTTGAAAATGCGTGTATGGAATTTGATGAGGTGAACTTAAAGCCAACTTACAAGCTTCTCTGGGGAGTACCAGGGCGTTCAAATGCTATTAATATAGCTGAAAGGTTGGGGTTGCCTAGTTCTATTGTAGATGATGCTCGTGAACATTATGGTGCGGCAAGTGCACAGATAGATGAGGTCATAATCGATATGGAATGTATAAAGAAAAAGTATGGGGAGCTCTTACAAGAAGCTCAAAATCACTTGATGGACTCAAAGAATCTTTATGAAAAGATGTTGCTTGTTAGGAGAAACATTATTGAACATGGCAGAGAACAGACGCTTAGAAAAGCGCAAGAGGTATCCAAGGCTGCAGCAACAGCTCGTTCTAACATTCACAAAAGAGTACGAGAACTGCATGTGTCTGCTATTGAATCCTCCTCACCCTCTGCCATTGGTAGTAGTCAGCGTGCAGGAGTGAACTCTAATAAGCTAGCTAGAGCAGGCGAAAAGAATCCGATGGCATTGGATACTCGCATCTCTTCAACCGGTAACATCAGTCAACCACGATCAGAGAAGCCTGAGTTTCCCACTGTTGGCGATACTGTGTACATTTCTTCGTTTGGAAAAGAAGCGACTGTTTTAAGTATAGAGCCATTAAAAGGCGAAATAGTTGTTCAAGTTGGTAGCATAAAGTTGAAGCTGAAGTTCACCGACATAATGAGATAA

Protein sequence

MTGDTVGAGTNPLEGAALGMSLLESLAKSGASLTIATTHHGELKTLKYSKEVFENACMEFDEVNLKPTYKLLWGVPGRSNAINIAERLGLPSSIVDDAREHYGAASAQIDEVIIDMECIKKKYGELLQEAQNHLMDSKNLYEKMLLVRRNIIEHGREQTLRKAQEVSKAAATARSNIHKRVRELHVSAIESSSPSAIGSSQRAGVNSNKLARAGEKNPMALDTRISSTGNISQPRSEKPEFPTVGDTVYISSFGKEATVLSIEPLKGEIVVQVGSIKLKLKFTDIMR
Homology
BLAST of Tan0001692 vs. ExPASy Swiss-Prot
Match: P73625 (Endonuclease MutS2 OS=Synechocystis sp. (strain PCC 6803 / Kazusa) OX=1111708 GN=mutS2 PE=3 SV=1)

HSP 1 Score: 156.0 bits (393), Expect = 6.4e-37
Identity = 104/282 (36.88%), Postives = 158/282 (56.03%), Query Frame = 0

Query: 4   DTVGAGTNPLEGAALGMSLLESLAKSGASLTIATTHHGELKTLKYSKEVFENACMEFDEV 63
           D VGAGT+P EG+AL ++LL  LA     LT+ATTH+GELK LKY    FENA +EFD+ 
Sbjct: 446 DEVGAGTDPTEGSALAIALLRHLADQ-PCLTVATTHYGELKALKYQDARFENASVEFDDQ 505

Query: 64  NLKPTYKLLWGVPGRSNAINIAERLGLPSSIVDDAREHYGAASAQIDEVIIDMECIKKKY 123
           +L PTY+LLWG+PGRSNA+ IA+RLGLP +IV+ A++  G  S  I++VI  +E  +++ 
Sbjct: 506 SLSPTYRLLWGIPGRSNALAIAQRLGLPLAIVEQAKDKLGGFSEDINQVIAGLESQRREQ 565

Query: 124 GELLQEAQNHLMDSKNLYEKMLLVRRNIIEHGREQTLRKAQEVSKAAATARSNIHKRVRE 183
            +    AQ  L +++  Y+++     ++    RE    + QEV +A A A+  I K +R+
Sbjct: 566 EQKAANAQKLLQETEIFYQQVSQKAASLQARERELKSYQDQEVQQAIAAAKEEIAKVIRQ 625

Query: 184 LHVSAIESSSPSAIGSSQRAGVNSNKLARAGEKNPMALDTRISSTGNISQPRSEKPEFPT 243
           L     +   PSA  + Q   +      +A +K  +A            +P   +   PT
Sbjct: 626 L-----QRGKPSAQKAQQATEILGQ--IQAEQKAKVA-----------PKPIGYQ---PT 685

Query: 244 VGDTVYISSFGKEATVLSIEPLKGEIVVQVGSIKLKLKFTDI 286
           VG+ + I SFG+ A V  +      + V +G +K+ +   DI
Sbjct: 686 VGERIRIPSFGQTAEVTQVNATAQTVNVTLGLMKMTVPMADI 705

BLAST of Tan0001692 vs. ExPASy Swiss-Prot
Match: A5D0W6 (Endonuclease MutS2 OS=Pelotomaculum thermopropionicum (strain DSM 13744 / JCM 10971 / SI) OX=370438 GN=mutS2 PE=3 SV=1)

HSP 1 Score: 135.2 bits (339), Expect = 1.2e-30
Identity = 99/285 (34.74%), Postives = 145/285 (50.88%), Query Frame = 0

Query: 4   DTVGAGTNPLEGAALGMSLLESLAKSGASLTIATTHHGELKTLKYSKEVFENACMEFDEV 63
           D +GAGT+P EGAAL  S+LE L  +GA  T+ATTH+GELK    ++E  ENA +EFD +
Sbjct: 411 DELGAGTDPAEGAALAQSILEKLHSAGAK-TVATTHYGELKDFALTRERVENASVEFDAI 470

Query: 64  NLKPTYKLLWGVPGRSNAINIAERLGLPSSIVDDAREHYGAASAQIDEVIIDMECIKKKY 123
            L+PTY+LL G PGRSNA  IA RLGLP  +V  AR    A   Q +E++  +E  +++ 
Sbjct: 471 TLRPTYRLLIGKPGRSNAFEIAARLGLPEEVVKRARSFLTAEHIQAEELMRSLEKTQQEA 530

Query: 124 GELLQEAQNHLMDSKNLYEKMLLVRRNIIEHGREQTLRKAQEVSKA-AATARSNIHKRVR 183
               + A     +++ L E+   +  ++    RE  L KA E ++A    AR      VR
Sbjct: 531 EAERRRAAELASEARALKERYEKIEADLASK-RESILSKAAEEAQALVRAARLEAEAAVR 590

Query: 184 ELHVSAIESSSPSAIGSSQRAGVNSNKLARAGEKNPMALDTRISSTGNISQPRSEKPEFP 243
           EL     E          + A    N +  A EK    L  R+           E P   
Sbjct: 591 ELREKMAE----------EAARERENAIREAREK-LRKLQQRVGRAVPEKTVPGEAPAGL 650

Query: 244 TVGDTVYISSFGKEATVLSIEPLKGEIVVQVGSIKLKLKFTDIMR 288
             G+ V+++ + ++  VL      GE++VQVG IK+ +   ++ R
Sbjct: 651 RPGEEVFLTRYNQKGYVLEPPGAGGEVLVQVGVIKMNVPLRELRR 682

BLAST of Tan0001692 vs. ExPASy Swiss-Prot
Match: A6LS00 (Endonuclease MutS2 OS=Clostridium beijerinckii (strain ATCC 51743 / NCIMB 8052) OX=290402 GN=mutS2 PE=3 SV=1)

HSP 1 Score: 134.8 bits (338), Expect = 1.5e-30
Identity = 93/287 (32.40%), Postives = 146/287 (50.87%), Query Frame = 0

Query: 4   DTVGAGTNPLEGAALGMSLLESLAKSGASLTIATTHHGELKTLKYSKEVFENACMEFDEV 63
           D +GAGT+P EGAAL +S+LE+L   GA L IATTH+ ELK      +  ENA +EFD  
Sbjct: 412 DELGAGTDPAEGAALAVSILETLRNRGAKL-IATTHYSELKAYALKTDGVENASVEFDIE 471

Query: 64  NLKPTYKLLWGVPGRSNAINIAERLGLPSSIVDDAREHYGAASAQIDEVIIDME---CIK 123
            L+PTY+LL GVPG+SNA  I++RLGL   ++  A+E+    + Q + +I +++    I 
Sbjct: 472 TLRPTYRLLIGVPGKSNAFEISKRLGLVEGVIKRAKEYMSEENLQFENLIRELQEKSIIA 531

Query: 124 KKYGELLQEAQNHLMDSKNLYEKMLLVRRNIIEHGREQTLRKAQEVSKAAATARSNIHKR 183
           KK     +  ++   D K  YE+ L    N  E       R+A+E+   A     +I K 
Sbjct: 532 KKEAREAKMLRDQAEDLKKKYEEKLEKLENTREKAYMDARREAKEIIANAKDEADDILKA 591

Query: 184 VRELHVSAIESSSPSAIGSSQRAGVNSNKLARAGEKNPMALDTRISSTGNISQPRSEKPE 243
           +REL    I             AG    +L    +K   +L+ R      + +   E   
Sbjct: 592 MRELEKLGI-------------AGGGRQRLEEERKKLKDSLEEREKGIHKMKENEGESIT 651

Query: 244 FPTVGDTVYISSFGKEATVLSIEPLKGEIVVQVGSIKLKLKFTDIMR 288
             T+G   Y+ S  ++  ++S+   +GE+ V+ G +K+ +K  D+ +
Sbjct: 652 NVTLGMEAYLPSLNQKVIIVSMPDNRGEVQVEAGIMKVNVKLKDLRK 684

BLAST of Tan0001692 vs. ExPASy Swiss-Prot
Match: A0PZP4 (Endonuclease MutS2 OS=Clostridium novyi (strain NT) OX=386415 GN=mutS2 PE=3 SV=1)

HSP 1 Score: 134.0 bits (336), Expect = 2.6e-30
Identity = 95/290 (32.76%), Postives = 153/290 (52.76%), Query Frame = 0

Query: 4   DTVGAGTNPLEGAALGMSLLESLAKSGASLTIATTHHGELKTLKYSKEVFENACMEFDEV 63
           D +GAGT+P EGAAL +S+LE+L K    + IATTH+ ELK      +  ENA +EFD  
Sbjct: 412 DELGAGTDPTEGAALAVSILENLRKRKTKV-IATTHYSELKAYALKVDNVENASVEFDVE 471

Query: 64  NLKPTYKLLWGVPGRSNAINIAERLGLPSSIVDDAREHYGAASAQIDEVIIDME---CIK 123
            L+PTY+LL GVPG+SNA  I++RLGLP  I++DARE     + + +++I  ++      
Sbjct: 472 TLRPTYRLLIGVPGKSNAFEISKRLGLPDYIIEDAREGISEETLKFEDLIQSLQHKNIKA 531

Query: 124 KKYGELLQEAQNHLMDSKNLYEKMLLVRRNIIEHGREQTLRKAQEVSKAAATARSNIHKR 183
           +++    + A+   +  K  YE  L   ++I E       ++A+E+ K A      I K 
Sbjct: 532 QEHARKAESAKEEAVKLKEKYESKLDKFQDIREKAILNAQKEAKEIIKEAKEEADKILKD 591

Query: 184 VRELHVSAIESSSPSAIGSSQRAGVNSNKLARAGEKNPMALDTRISST-GNISQPRSEKP 243
           +REL                +R G +S+ + +  E+N   L  ++  T   ++QP+    
Sbjct: 592 IREL----------------ERMGYSSD-VRKLLEENRKKLKDKLEKTESKLNQPKEVGE 651

Query: 244 EFPTV--GDTVYISSFGKEATVLSIEPLKGEIVVQVGSIKLKLKFTDIMR 288
               V  GD +Y+  F  +  VL+    KG++ VQ G +K+K+   D+ +
Sbjct: 652 AVTNVSEGDELYLPKFETKVMVLTNPDNKGDVQVQAGIMKIKVNIKDLRK 683

BLAST of Tan0001692 vs. ExPASy Swiss-Prot
Match: C3KTI4 (Endonuclease MutS2 OS=Clostridium botulinum (strain 657 / Type Ba4) OX=515621 GN=mutS2 PE=3 SV=1)

HSP 1 Score: 132.9 bits (333), Expect = 5.8e-30
Identity = 99/288 (34.38%), Postives = 154/288 (53.47%), Query Frame = 0

Query: 4   DTVGAGTNPLEGAALGMSLLESLAKSGASLTIATTHHGELKTLKYSKEVFENACMEFDEV 63
           D +GAGT+P EGAAL +S+LE+L K GA + IATTH+ ELK     KE  ENA +EFD  
Sbjct: 412 DELGAGTDPTEGAALAISILENLRKRGAKI-IATTHYSELKAYALRKEGVENASVEFDVE 471

Query: 64  NLKPTYKLLWGVPGRSNAINIAERLGLPSSIVDDAREHYGAASAQIDEVIIDMECIKKKY 123
            L+PTY+LL G+PG+SNA  I++RLGLP  I+D ARE+    + + +E+I +++    K 
Sbjct: 472 TLRPTYRLLIGIPGKSNAFEISKRLGLPDYIIDFARENISNENIRFEELIQNLQEKSIKA 531

Query: 124 GELLQEAQNHLMD---SKNLYEKMLLVRRNIIEHGREQTLRKAQEVSKAAATARSNIHKR 183
            E  + A+N  ++    K  YE+ L   + + ++      R+A+ + K A      I K 
Sbjct: 532 QEDARLAENLKLERDKEKKKYEEKLEGLQKVRDNALIDARREAKNIIKEAKEEADKILKD 591

Query: 184 VRELHVSAIESSSPSAIGSSQRAGVNSN---KLARAGEKNPMALDTRISSTGNISQPRSE 243
           +R+L                +R G +S+   KL    +K    LD+ I      +  + E
Sbjct: 592 IRQL----------------ERMGYSSDARRKLEEERKKLKDKLDS-IEEKEIKTVHKGE 651

Query: 244 KPEFPTVGDTVYISSFGKEATVLSIEPLKGEIVVQVGSIKLKLKFTDI 286
             +    GD V ++S  ++  VLS    KG+++VQ G +K+     D+
Sbjct: 652 ALKNVKEGDEVLLASINQKVIVLSKPDNKGDVLVQAGIMKITANIKDL 681

BLAST of Tan0001692 vs. NCBI nr
Match: XP_023552491.1 (uncharacterized protein LOC111810138 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 455.3 bits (1170), Expect = 3.9e-124
Identity = 233/284 (82.04%), Postives = 256/284 (90.14%), Query Frame = 0

Query: 4   DTVGAGTNPLEGAALGMSLLESLAKSGASLTIATTHHGELKTLKYSKEVFENACMEFDEV 63
           D VGAGTNPLEGAALGMSLLES AK GA+LTIATTHHGELKTLKYS EVFENACMEFDEV
Sbjct: 557 DEVGAGTNPLEGAALGMSLLESFAKCGAALTIATTHHGELKTLKYSNEVFENACMEFDEV 616

Query: 64  NLKPTYKLLWGVPGRSNAINIAERLGLPSSIVDDAREHYGAASAQIDEVIIDMECIKKKY 123
           NLKPTYK+LWGVPGRSNAINIAERLG+PSS+VDDAREHYGAASAQIDEVI+DMEC KKKY
Sbjct: 617 NLKPTYKILWGVPGRSNAINIAERLGVPSSVVDDAREHYGAASAQIDEVILDMECTKKKY 676

Query: 124 GELLQEAQNHLMDSKNLYEKMLLVRRNIIEHGREQTLRKAQEVSKAAATARSNIHKRVRE 183
           G+LLQEAQN+L DSKNLYEK+LL RRNIIEHGR+Q LRK QEVS+AAATARSN+H++VRE
Sbjct: 677 GDLLQEAQNNLTDSKNLYEKLLLARRNIIEHGRQQRLRKVQEVSRAAATARSNLHRKVRE 736

Query: 184 LHVSAIESSSPSAIGSSQRAGVNSNKLARAGEKNPMALDTRISSTGNISQPRSEKPEFPT 243
           L  SAIE S PSAI S QRA  N N L   G+KN MALDT +SSTG+I+QPRSE+PEFPT
Sbjct: 737 LRASAIEFSPPSAIDSRQRAVKNPNTLDTTGQKNSMALDTHLSSTGDINQPRSEEPEFPT 796

Query: 244 VGDTVYISSFGKEATVLSIEPLKGEIVVQVGSIKLKLKFTDIMR 288
           VGDTVY+SSFGK+ATVL +EP K E+ V+VGSIKLKLKFTDIMR
Sbjct: 797 VGDTVYVSSFGKKATVLGVEPSKDEVTVRVGSIKLKLKFTDIMR 840

BLAST of Tan0001692 vs. NCBI nr
Match: XP_022922843.1 (uncharacterized protein LOC111430703 isoform X3 [Cucurbita moschata])

HSP 1 Score: 454.1 bits (1167), Expect = 8.7e-124
Identity = 233/284 (82.04%), Postives = 255/284 (89.79%), Query Frame = 0

Query: 4   DTVGAGTNPLEGAALGMSLLESLAKSGASLTIATTHHGELKTLKYSKEVFENACMEFDEV 63
           D VGAGTNPLEGAALGMSLLES AK GA+LTIATTHHGELKTLKYS EVFENACMEFDEV
Sbjct: 559 DEVGAGTNPLEGAALGMSLLESFAKCGAALTIATTHHGELKTLKYSNEVFENACMEFDEV 618

Query: 64  NLKPTYKLLWGVPGRSNAINIAERLGLPSSIVDDAREHYGAASAQIDEVIIDMECIKKKY 123
           NLKPTYK+LWGVPGRSNAINIAERLG+PSS+VDDAREHYGAASAQIDEVI+DMEC KKKY
Sbjct: 619 NLKPTYKILWGVPGRSNAINIAERLGVPSSVVDDAREHYGAASAQIDEVILDMECTKKKY 678

Query: 124 GELLQEAQNHLMDSKNLYEKMLLVRRNIIEHGREQTLRKAQEVSKAAATARSNIHKRVRE 183
           G+LLQEAQN+L DSKNLYEK+LL RRNIIEHGR+Q LRK QEVS+AAATARSN+H++VRE
Sbjct: 679 GDLLQEAQNNLTDSKNLYEKLLLARRNIIEHGRQQRLRKVQEVSRAAATARSNLHRKVRE 738

Query: 184 LHVSAIESSSPSAIGSSQRAGVNSNKLARAGEKNPMALDTRISSTGNISQPRSEKPEFPT 243
           L  SAIE S PSA  S QRA  N N L   G+KN MALDT ISSTG+I+QPRSE+PEFPT
Sbjct: 739 LRASAIEFSPPSATDSRQRAVKNPNTLDTTGQKNSMALDTHISSTGDINQPRSEEPEFPT 798

Query: 244 VGDTVYISSFGKEATVLSIEPLKGEIVVQVGSIKLKLKFTDIMR 288
           VGDTVY+SSFGK+ATVL +EP K E+ V+VGSIKLKLKFTDIMR
Sbjct: 799 VGDTVYVSSFGKKATVLGVEPSKDEVTVRVGSIKLKLKFTDIMR 842

BLAST of Tan0001692 vs. NCBI nr
Match: XP_022922844.1 (uncharacterized protein LOC111430703 isoform X4 [Cucurbita moschata])

HSP 1 Score: 454.1 bits (1167), Expect = 8.7e-124
Identity = 233/284 (82.04%), Postives = 255/284 (89.79%), Query Frame = 0

Query: 4   DTVGAGTNPLEGAALGMSLLESLAKSGASLTIATTHHGELKTLKYSKEVFENACMEFDEV 63
           D VGAGTNPLEGAALGMSLLES AK GA+LTIATTHHGELKTLKYS EVFENACMEFDEV
Sbjct: 557 DEVGAGTNPLEGAALGMSLLESFAKCGAALTIATTHHGELKTLKYSNEVFENACMEFDEV 616

Query: 64  NLKPTYKLLWGVPGRSNAINIAERLGLPSSIVDDAREHYGAASAQIDEVIIDMECIKKKY 123
           NLKPTYK+LWGVPGRSNAINIAERLG+PSS+VDDAREHYGAASAQIDEVI+DMEC KKKY
Sbjct: 617 NLKPTYKILWGVPGRSNAINIAERLGVPSSVVDDAREHYGAASAQIDEVILDMECTKKKY 676

Query: 124 GELLQEAQNHLMDSKNLYEKMLLVRRNIIEHGREQTLRKAQEVSKAAATARSNIHKRVRE 183
           G+LLQEAQN+L DSKNLYEK+LL RRNIIEHGR+Q LRK QEVS+AAATARSN+H++VRE
Sbjct: 677 GDLLQEAQNNLTDSKNLYEKLLLARRNIIEHGRQQRLRKVQEVSRAAATARSNLHRKVRE 736

Query: 184 LHVSAIESSSPSAIGSSQRAGVNSNKLARAGEKNPMALDTRISSTGNISQPRSEKPEFPT 243
           L  SAIE S PSA  S QRA  N N L   G+KN MALDT ISSTG+I+QPRSE+PEFPT
Sbjct: 737 LRASAIEFSPPSATDSRQRAVKNPNTLDTTGQKNSMALDTHISSTGDINQPRSEEPEFPT 796

Query: 244 VGDTVYISSFGKEATVLSIEPLKGEIVVQVGSIKLKLKFTDIMR 288
           VGDTVY+SSFGK+ATVL +EP K E+ V+VGSIKLKLKFTDIMR
Sbjct: 797 VGDTVYVSSFGKKATVLGVEPSKDEVTVRVGSIKLKLKFTDIMR 840

BLAST of Tan0001692 vs. NCBI nr
Match: XP_022985052.1 (uncharacterized protein LOC111483140 isoform X3 [Cucurbita maxima])

HSP 1 Score: 448.0 bits (1151), Expect = 6.3e-122
Identity = 230/284 (80.99%), Postives = 253/284 (89.08%), Query Frame = 0

Query: 4   DTVGAGTNPLEGAALGMSLLESLAKSGASLTIATTHHGELKTLKYSKEVFENACMEFDEV 63
           D VGAGTNPLEGAALGMSLLES AK GA+LTIATTHHGELKTLKYS EVFENACMEFDEV
Sbjct: 559 DEVGAGTNPLEGAALGMSLLESFAKCGAALTIATTHHGELKTLKYSNEVFENACMEFDEV 618

Query: 64  NLKPTYKLLWGVPGRSNAINIAERLGLPSSIVDDAREHYGAASAQIDEVIIDMECIKKKY 123
           NLKPTYK+LWGVPGRSNAINIAERLG+PSS+VDDAREHYGAASAQIDEVI+DMEC KKK+
Sbjct: 619 NLKPTYKILWGVPGRSNAINIAERLGVPSSVVDDAREHYGAASAQIDEVILDMECTKKKF 678

Query: 124 GELLQEAQNHLMDSKNLYEKMLLVRRNIIEHGREQTLRKAQEVSKAAATARSNIHKRVRE 183
           G+LLQEAQN+L DSKNLYEK+LL RRNIIEHGR Q LRK QEVS+AAATARSN+H++VRE
Sbjct: 679 GDLLQEAQNNLTDSKNLYEKLLLARRNIIEHGRHQRLRKVQEVSRAAATARSNLHRKVRE 738

Query: 184 LHVSAIESSSPSAIGSSQRAGVNSNKLARAGEKNPMALDTRISSTGNISQPRSEKPEFPT 243
           L  SAIE S PSAI S QRA  N N L   G+KN MALDT +SSTG+I+QPRSE+P FPT
Sbjct: 739 LRASAIEFSPPSAIDSRQRAVKNPNTLDTTGQKNSMALDTHLSSTGDINQPRSEEPGFPT 798

Query: 244 VGDTVYISSFGKEATVLSIEPLKGEIVVQVGSIKLKLKFTDIMR 288
           VGD VY+SSFGK+ATVL +EP K E+ V+VGSIKLKLKFTDIMR
Sbjct: 799 VGDIVYVSSFGKKATVLGVEPSKDEVTVRVGSIKLKLKFTDIMR 842

BLAST of Tan0001692 vs. NCBI nr
Match: XP_022985053.1 (uncharacterized protein LOC111483140 isoform X4 [Cucurbita maxima])

HSP 1 Score: 448.0 bits (1151), Expect = 6.3e-122
Identity = 230/284 (80.99%), Postives = 253/284 (89.08%), Query Frame = 0

Query: 4   DTVGAGTNPLEGAALGMSLLESLAKSGASLTIATTHHGELKTLKYSKEVFENACMEFDEV 63
           D VGAGTNPLEGAALGMSLLES AK GA+LTIATTHHGELKTLKYS EVFENACMEFDEV
Sbjct: 557 DEVGAGTNPLEGAALGMSLLESFAKCGAALTIATTHHGELKTLKYSNEVFENACMEFDEV 616

Query: 64  NLKPTYKLLWGVPGRSNAINIAERLGLPSSIVDDAREHYGAASAQIDEVIIDMECIKKKY 123
           NLKPTYK+LWGVPGRSNAINIAERLG+PSS+VDDAREHYGAASAQIDEVI+DMEC KKK+
Sbjct: 617 NLKPTYKILWGVPGRSNAINIAERLGVPSSVVDDAREHYGAASAQIDEVILDMECTKKKF 676

Query: 124 GELLQEAQNHLMDSKNLYEKMLLVRRNIIEHGREQTLRKAQEVSKAAATARSNIHKRVRE 183
           G+LLQEAQN+L DSKNLYEK+LL RRNIIEHGR Q LRK QEVS+AAATARSN+H++VRE
Sbjct: 677 GDLLQEAQNNLTDSKNLYEKLLLARRNIIEHGRHQRLRKVQEVSRAAATARSNLHRKVRE 736

Query: 184 LHVSAIESSSPSAIGSSQRAGVNSNKLARAGEKNPMALDTRISSTGNISQPRSEKPEFPT 243
           L  SAIE S PSAI S QRA  N N L   G+KN MALDT +SSTG+I+QPRSE+P FPT
Sbjct: 737 LRASAIEFSPPSAIDSRQRAVKNPNTLDTTGQKNSMALDTHLSSTGDINQPRSEEPGFPT 796

Query: 244 VGDTVYISSFGKEATVLSIEPLKGEIVVQVGSIKLKLKFTDIMR 288
           VGD VY+SSFGK+ATVL +EP K E+ V+VGSIKLKLKFTDIMR
Sbjct: 797 VGDIVYVSSFGKKATVLGVEPSKDEVTVRVGSIKLKLKFTDIMR 840

BLAST of Tan0001692 vs. ExPASy TrEMBL
Match: A0A6J1E4M8 (uncharacterized protein LOC111430703 isoform X4 OS=Cucurbita moschata OX=3662 GN=LOC111430703 PE=4 SV=1)

HSP 1 Score: 454.1 bits (1167), Expect = 4.2e-124
Identity = 233/284 (82.04%), Postives = 255/284 (89.79%), Query Frame = 0

Query: 4   DTVGAGTNPLEGAALGMSLLESLAKSGASLTIATTHHGELKTLKYSKEVFENACMEFDEV 63
           D VGAGTNPLEGAALGMSLLES AK GA+LTIATTHHGELKTLKYS EVFENACMEFDEV
Sbjct: 557 DEVGAGTNPLEGAALGMSLLESFAKCGAALTIATTHHGELKTLKYSNEVFENACMEFDEV 616

Query: 64  NLKPTYKLLWGVPGRSNAINIAERLGLPSSIVDDAREHYGAASAQIDEVIIDMECIKKKY 123
           NLKPTYK+LWGVPGRSNAINIAERLG+PSS+VDDAREHYGAASAQIDEVI+DMEC KKKY
Sbjct: 617 NLKPTYKILWGVPGRSNAINIAERLGVPSSVVDDAREHYGAASAQIDEVILDMECTKKKY 676

Query: 124 GELLQEAQNHLMDSKNLYEKMLLVRRNIIEHGREQTLRKAQEVSKAAATARSNIHKRVRE 183
           G+LLQEAQN+L DSKNLYEK+LL RRNIIEHGR+Q LRK QEVS+AAATARSN+H++VRE
Sbjct: 677 GDLLQEAQNNLTDSKNLYEKLLLARRNIIEHGRQQRLRKVQEVSRAAATARSNLHRKVRE 736

Query: 184 LHVSAIESSSPSAIGSSQRAGVNSNKLARAGEKNPMALDTRISSTGNISQPRSEKPEFPT 243
           L  SAIE S PSA  S QRA  N N L   G+KN MALDT ISSTG+I+QPRSE+PEFPT
Sbjct: 737 LRASAIEFSPPSATDSRQRAVKNPNTLDTTGQKNSMALDTHISSTGDINQPRSEEPEFPT 796

Query: 244 VGDTVYISSFGKEATVLSIEPLKGEIVVQVGSIKLKLKFTDIMR 288
           VGDTVY+SSFGK+ATVL +EP K E+ V+VGSIKLKLKFTDIMR
Sbjct: 797 VGDTVYVSSFGKKATVLGVEPSKDEVTVRVGSIKLKLKFTDIMR 840

BLAST of Tan0001692 vs. ExPASy TrEMBL
Match: A0A6J1E7X9 (uncharacterized protein LOC111430703 isoform X3 OS=Cucurbita moschata OX=3662 GN=LOC111430703 PE=4 SV=1)

HSP 1 Score: 454.1 bits (1167), Expect = 4.2e-124
Identity = 233/284 (82.04%), Postives = 255/284 (89.79%), Query Frame = 0

Query: 4   DTVGAGTNPLEGAALGMSLLESLAKSGASLTIATTHHGELKTLKYSKEVFENACMEFDEV 63
           D VGAGTNPLEGAALGMSLLES AK GA+LTIATTHHGELKTLKYS EVFENACMEFDEV
Sbjct: 559 DEVGAGTNPLEGAALGMSLLESFAKCGAALTIATTHHGELKTLKYSNEVFENACMEFDEV 618

Query: 64  NLKPTYKLLWGVPGRSNAINIAERLGLPSSIVDDAREHYGAASAQIDEVIIDMECIKKKY 123
           NLKPTYK+LWGVPGRSNAINIAERLG+PSS+VDDAREHYGAASAQIDEVI+DMEC KKKY
Sbjct: 619 NLKPTYKILWGVPGRSNAINIAERLGVPSSVVDDAREHYGAASAQIDEVILDMECTKKKY 678

Query: 124 GELLQEAQNHLMDSKNLYEKMLLVRRNIIEHGREQTLRKAQEVSKAAATARSNIHKRVRE 183
           G+LLQEAQN+L DSKNLYEK+LL RRNIIEHGR+Q LRK QEVS+AAATARSN+H++VRE
Sbjct: 679 GDLLQEAQNNLTDSKNLYEKLLLARRNIIEHGRQQRLRKVQEVSRAAATARSNLHRKVRE 738

Query: 184 LHVSAIESSSPSAIGSSQRAGVNSNKLARAGEKNPMALDTRISSTGNISQPRSEKPEFPT 243
           L  SAIE S PSA  S QRA  N N L   G+KN MALDT ISSTG+I+QPRSE+PEFPT
Sbjct: 739 LRASAIEFSPPSATDSRQRAVKNPNTLDTTGQKNSMALDTHISSTGDINQPRSEEPEFPT 798

Query: 244 VGDTVYISSFGKEATVLSIEPLKGEIVVQVGSIKLKLKFTDIMR 288
           VGDTVY+SSFGK+ATVL +EP K E+ V+VGSIKLKLKFTDIMR
Sbjct: 799 VGDTVYVSSFGKKATVLGVEPSKDEVTVRVGSIKLKLKFTDIMR 842

BLAST of Tan0001692 vs. ExPASy TrEMBL
Match: A0A6J1JCF6 (uncharacterized protein LOC111483140 isoform X4 OS=Cucurbita maxima OX=3661 GN=LOC111483140 PE=4 SV=1)

HSP 1 Score: 448.0 bits (1151), Expect = 3.0e-122
Identity = 230/284 (80.99%), Postives = 253/284 (89.08%), Query Frame = 0

Query: 4   DTVGAGTNPLEGAALGMSLLESLAKSGASLTIATTHHGELKTLKYSKEVFENACMEFDEV 63
           D VGAGTNPLEGAALGMSLLES AK GA+LTIATTHHGELKTLKYS EVFENACMEFDEV
Sbjct: 557 DEVGAGTNPLEGAALGMSLLESFAKCGAALTIATTHHGELKTLKYSNEVFENACMEFDEV 616

Query: 64  NLKPTYKLLWGVPGRSNAINIAERLGLPSSIVDDAREHYGAASAQIDEVIIDMECIKKKY 123
           NLKPTYK+LWGVPGRSNAINIAERLG+PSS+VDDAREHYGAASAQIDEVI+DMEC KKK+
Sbjct: 617 NLKPTYKILWGVPGRSNAINIAERLGVPSSVVDDAREHYGAASAQIDEVILDMECTKKKF 676

Query: 124 GELLQEAQNHLMDSKNLYEKMLLVRRNIIEHGREQTLRKAQEVSKAAATARSNIHKRVRE 183
           G+LLQEAQN+L DSKNLYEK+LL RRNIIEHGR Q LRK QEVS+AAATARSN+H++VRE
Sbjct: 677 GDLLQEAQNNLTDSKNLYEKLLLARRNIIEHGRHQRLRKVQEVSRAAATARSNLHRKVRE 736

Query: 184 LHVSAIESSSPSAIGSSQRAGVNSNKLARAGEKNPMALDTRISSTGNISQPRSEKPEFPT 243
           L  SAIE S PSAI S QRA  N N L   G+KN MALDT +SSTG+I+QPRSE+P FPT
Sbjct: 737 LRASAIEFSPPSAIDSRQRAVKNPNTLDTTGQKNSMALDTHLSSTGDINQPRSEEPGFPT 796

Query: 244 VGDTVYISSFGKEATVLSIEPLKGEIVVQVGSIKLKLKFTDIMR 288
           VGD VY+SSFGK+ATVL +EP K E+ V+VGSIKLKLKFTDIMR
Sbjct: 797 VGDIVYVSSFGKKATVLGVEPSKDEVTVRVGSIKLKLKFTDIMR 840

BLAST of Tan0001692 vs. ExPASy TrEMBL
Match: A0A6J1J3U0 (uncharacterized protein LOC111483140 isoform X3 OS=Cucurbita maxima OX=3661 GN=LOC111483140 PE=4 SV=1)

HSP 1 Score: 448.0 bits (1151), Expect = 3.0e-122
Identity = 230/284 (80.99%), Postives = 253/284 (89.08%), Query Frame = 0

Query: 4   DTVGAGTNPLEGAALGMSLLESLAKSGASLTIATTHHGELKTLKYSKEVFENACMEFDEV 63
           D VGAGTNPLEGAALGMSLLES AK GA+LTIATTHHGELKTLKYS EVFENACMEFDEV
Sbjct: 559 DEVGAGTNPLEGAALGMSLLESFAKCGAALTIATTHHGELKTLKYSNEVFENACMEFDEV 618

Query: 64  NLKPTYKLLWGVPGRSNAINIAERLGLPSSIVDDAREHYGAASAQIDEVIIDMECIKKKY 123
           NLKPTYK+LWGVPGRSNAINIAERLG+PSS+VDDAREHYGAASAQIDEVI+DMEC KKK+
Sbjct: 619 NLKPTYKILWGVPGRSNAINIAERLGVPSSVVDDAREHYGAASAQIDEVILDMECTKKKF 678

Query: 124 GELLQEAQNHLMDSKNLYEKMLLVRRNIIEHGREQTLRKAQEVSKAAATARSNIHKRVRE 183
           G+LLQEAQN+L DSKNLYEK+LL RRNIIEHGR Q LRK QEVS+AAATARSN+H++VRE
Sbjct: 679 GDLLQEAQNNLTDSKNLYEKLLLARRNIIEHGRHQRLRKVQEVSRAAATARSNLHRKVRE 738

Query: 184 LHVSAIESSSPSAIGSSQRAGVNSNKLARAGEKNPMALDTRISSTGNISQPRSEKPEFPT 243
           L  SAIE S PSAI S QRA  N N L   G+KN MALDT +SSTG+I+QPRSE+P FPT
Sbjct: 739 LRASAIEFSPPSAIDSRQRAVKNPNTLDTTGQKNSMALDTHLSSTGDINQPRSEEPGFPT 798

Query: 244 VGDTVYISSFGKEATVLSIEPLKGEIVVQVGSIKLKLKFTDIMR 288
           VGD VY+SSFGK+ATVL +EP K E+ V+VGSIKLKLKFTDIMR
Sbjct: 799 VGDIVYVSSFGKKATVLGVEPSKDEVTVRVGSIKLKLKFTDIMR 842

BLAST of Tan0001692 vs. ExPASy TrEMBL
Match: A0A6J1E9Y4 (uncharacterized protein LOC111430703 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111430703 PE=4 SV=1)

HSP 1 Score: 445.3 bits (1144), Expect = 2.0e-121
Identity = 233/296 (78.72%), Postives = 255/296 (86.15%), Query Frame = 0

Query: 4   DTVGAGTNPLEGAALGMSLLESLAKSGASLTIATTHHGELKTLK------------YSKE 63
           D VGAGTNPLEGAALGMSLLES AK GA+LTIATTHHGELKTLK            YS E
Sbjct: 559 DEVGAGTNPLEGAALGMSLLESFAKCGAALTIATTHHGELKTLKYRSLIHSYEAFLYSNE 618

Query: 64  VFENACMEFDEVNLKPTYKLLWGVPGRSNAINIAERLGLPSSIVDDAREHYGAASAQIDE 123
           VFENACMEFDEVNLKPTYK+LWGVPGRSNAINIAERLG+PSS+VDDAREHYGAASAQIDE
Sbjct: 619 VFENACMEFDEVNLKPTYKILWGVPGRSNAINIAERLGVPSSVVDDAREHYGAASAQIDE 678

Query: 124 VIIDMECIKKKYGELLQEAQNHLMDSKNLYEKMLLVRRNIIEHGREQTLRKAQEVSKAAA 183
           VI+DMEC KKKYG+LLQEAQN+L DSKNLYEK+LL RRNIIEHGR+Q LRK QEVS+AAA
Sbjct: 679 VILDMECTKKKYGDLLQEAQNNLTDSKNLYEKLLLARRNIIEHGRQQRLRKVQEVSRAAA 738

Query: 184 TARSNIHKRVRELHVSAIESSSPSAIGSSQRAGVNSNKLARAGEKNPMALDTRISSTGNI 243
           TARSN+H++VREL  SAIE S PSA  S QRA  N N L   G+KN MALDT ISSTG+I
Sbjct: 739 TARSNLHRKVRELRASAIEFSPPSATDSRQRAVKNPNTLDTTGQKNSMALDTHISSTGDI 798

Query: 244 SQPRSEKPEFPTVGDTVYISSFGKEATVLSIEPLKGEIVVQVGSIKLKLKFTDIMR 288
           +QPRSE+PEFPTVGDTVY+SSFGK+ATVL +EP K E+ V+VGSIKLKLKFTDIMR
Sbjct: 799 NQPRSEEPEFPTVGDTVYVSSFGKKATVLGVEPSKDEVTVRVGSIKLKLKFTDIMR 854

BLAST of Tan0001692 vs. TAIR 10
Match: AT5G54090.1 (DNA mismatch repair protein MutS, type 2 )

HSP 1 Score: 265.4 bits (677), Expect = 5.4e-71
Identity = 141/283 (49.82%), Postives = 201/283 (71.02%), Query Frame = 0

Query: 4   DTVGAGTNPLEGAALGMSLLESLAKSGASLTIATTHHGELKTLKYSKEVFENACMEFDEV 63
           D VGAGTNPLEGAALGM++LES A+SG+ LT+ATTHHGELKTLKYS   FENACMEFD++
Sbjct: 521 DEVGAGTNPLEGAALGMAILESFAESGSLLTMATTHHGELKTLKYSNSAFENACMEFDDL 580

Query: 64  NLKPTYKLLWGVPGRSNAINIAERLGLPSSIVDDAREHYGAASAQIDEVIIDMECIKKKY 123
           NLKPTYK+LWGVPGRSNAINIA+RLGLP  I++ ARE YG+ASA+I+EVI+DME  K++Y
Sbjct: 581 NLKPTYKILWGVPGRSNAINIADRLGLPCDIIESARELYGSASAEINEVILDMERYKQEY 640

Query: 124 GELLQEAQNHLMDSKNLYEKMLLVRRNIIEHGREQTLRKAQEVSKAAATARSNIHKRVRE 183
             LL E++ ++  S+ L+E +L  ++NI +H  ++  +  QE+++A +  RS + + +++
Sbjct: 641 QRLLNESRVYIRLSRELHENLLTAQKNINDHSTKERRKMRQELTQAGSMTRSTLRRTLQQ 700

Query: 184 LHVSAIESSSPSAIGSSQRAGVNSNKLARAGEKNPMALDTRISSTGNISQPRSEKPEFPT 243
              SA   SS S + +  +  V + K    G ++   ++ R         P +   + P 
Sbjct: 701 FRSSA-GKSSQSKVATQLQTKVKTTKDEDNGIRSSSVVERR-------PLPEAAAQKVPE 760

Query: 244 VGDTVYISSFGKEATVLSIEPLKGEIVVQVGSIKLKLKFTDIM 287
           VG +V++SS GK+ATVL +E  K EI+VQVG +K+K+K TD++
Sbjct: 761 VGSSVFVSSLGKKATVLKVEHSKKEILVQVGIMKMKVKLTDVV 795

BLAST of Tan0001692 vs. TAIR 10
Match: AT1G65070.1 (DNA mismatch repair protein MutS, type 2 )

HSP 1 Score: 84.7 bits (208), Expect = 1.3e-16
Identity = 77/307 (25.08%), Postives = 145/307 (47.23%), Query Frame = 0

Query: 4   DTVGAGTNPLEGAALGMSLLESLAKSGASLTIATTHHGELKTLKYSKEVFENACMEFDEV 63
           D + +GT+P EG AL  S+L+ + K+  ++ + +TH+G+L  LK ++  F+NA MEF   
Sbjct: 482 DEICSGTDPSEGVALATSILQYI-KNRVNVAVVSTHYGDLSRLKDNEPRFQNAAMEFSME 541

Query: 64  NLKPTYKLLWGVPGRSNAINIAERLGLPSSIVDDAREHYGAASAQIDEVIIDMECIKKKY 123
            L+PT+++LWG  G SNA+ +A+ +G    I+++A +     + + D        +++K 
Sbjct: 542 TLQPTFRVLWGSTGLSNALRVAKSIGFNKRILENAHKWTEKLNPEQD--------VERK- 601

Query: 124 GELLQ---EAQNHL-----------MDSKNLYEKMLLVRRNIIEHGREQTLRKAQEVSKA 183
           G L Q   E +N L            D  NLY ++     ++ +  R    ++ Q+V + 
Sbjct: 602 GSLFQSLMEERNKLKLQATKTAAFHRDLMNLYHELEHESHDLDKRERALLKKETQKVQED 661

Query: 184 AATARSNIHKRVRELHVSAIESSSPSAIGSSQRAGVNSNKLARAGEKNPMALDTRISSTG 243
             +A+S + + V E                      +  ++ +A + N + L T  +   
Sbjct: 662 LNSAKSKMERLVAEFE--------------------SQLEITQADQYNSLILKTEEAVAE 721

Query: 244 NIS-----QPRSEKPEF----PTVGDTVYISSFGKEATVLSIEPLKGE--IVVQVGSIKL 286
            I       P S + E+    P  G+ V ++  G +   +  EP   +  ++VQ G I++
Sbjct: 722 IIEACCPMDPDSLEEEYSDYSPQAGEKVLVTGLGDKLGTVVEEPGDDDDTVLVQHGKIRV 758

BLAST of Tan0001692 vs. TAIR 10
Match: AT1G65070.2 (DNA mismatch repair protein MutS, type 2 )

HSP 1 Score: 84.7 bits (208), Expect = 1.3e-16
Identity = 77/307 (25.08%), Postives = 145/307 (47.23%), Query Frame = 0

Query: 4   DTVGAGTNPLEGAALGMSLLESLAKSGASLTIATTHHGELKTLKYSKEVFENACMEFDEV 63
           D + +GT+P EG AL  S+L+ + K+  ++ + +TH+G+L  LK ++  F+NA MEF   
Sbjct: 482 DEICSGTDPSEGVALATSILQYI-KNRVNVAVVSTHYGDLSRLKDNEPRFQNAAMEFSME 541

Query: 64  NLKPTYKLLWGVPGRSNAINIAERLGLPSSIVDDAREHYGAASAQIDEVIIDMECIKKKY 123
            L+PT+++LWG  G SNA+ +A+ +G    I+++A +     + + D        +++K 
Sbjct: 542 TLQPTFRVLWGSTGLSNALRVAKSIGFNKRILENAHKWTEKLNPEQD--------VERK- 601

Query: 124 GELLQ---EAQNHL-----------MDSKNLYEKMLLVRRNIIEHGREQTLRKAQEVSKA 183
           G L Q   E +N L            D  NLY ++     ++ +  R    ++ Q+V + 
Sbjct: 602 GSLFQSLMEERNKLKLQATKTAAFHRDLMNLYHELEHESHDLDKRERALLKKETQKVQED 661

Query: 184 AATARSNIHKRVRELHVSAIESSSPSAIGSSQRAGVNSNKLARAGEKNPMALDTRISSTG 243
             +A+S + + V E                      +  ++ +A + N + L T  +   
Sbjct: 662 LNSAKSKMERLVAEFE--------------------SQLEITQADQYNSLILKTEEAVAE 721

Query: 244 NIS-----QPRSEKPEF----PTVGDTVYISSFGKEATVLSIEPLKGE--IVVQVGSIKL 286
            I       P S + E+    P  G+ V ++  G +   +  EP   +  ++VQ G I++
Sbjct: 722 IIEACCPMDPDSLEEEYSDYSPQAGEKVLVTGLGDKLGTVVEEPGDDDDTVLVQHGKIRV 758

BLAST of Tan0001692 vs. TAIR 10
Match: AT3G24320.1 (MUTL protein homolog 1 )

HSP 1 Score: 45.8 bits (107), Expect = 6.7e-05
Identity = 33/100 (33.00%), Postives = 47/100 (47.00%), Query Frame = 0

Query: 4   DTVGAGTNPLEGAALGMSLLESLAKSGASLTIATTHHGELKTLKYSKEVFENAC-MEFDE 63
           D +  GT   +G  +  S++ESL  SG    ++T  HG       +K +   A   E  E
Sbjct: 847 DEICRGTETAKGTCIAGSVVESLDTSGCLGIVSTHLHGIFSLPLTAKNITYKAMGAENVE 906

Query: 64  VNLKPTYKLLWGVPGRSNAINIAERLGLPSSIVDDAREHY 103
              KPT+KL  GV   S A   A+R G+P S++  A   Y
Sbjct: 907 GQTKPTWKLTDGVCRESLAFETAKREGVPESVIQRAEALY 946

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
P736256.4e-3736.88Endonuclease MutS2 OS=Synechocystis sp. (strain PCC 6803 / Kazusa) OX=1111708 GN... [more]
A5D0W61.2e-3034.74Endonuclease MutS2 OS=Pelotomaculum thermopropionicum (strain DSM 13744 / JCM 10... [more]
A6LS001.5e-3032.40Endonuclease MutS2 OS=Clostridium beijerinckii (strain ATCC 51743 / NCIMB 8052) ... [more]
A0PZP42.6e-3032.76Endonuclease MutS2 OS=Clostridium novyi (strain NT) OX=386415 GN=mutS2 PE=3 SV=1[more]
C3KTI45.8e-3034.38Endonuclease MutS2 OS=Clostridium botulinum (strain 657 / Type Ba4) OX=515621 GN... [more]
Match NameE-valueIdentityDescription
XP_023552491.13.9e-12482.04uncharacterized protein LOC111810138 [Cucurbita pepo subsp. pepo][more]
XP_022922843.18.7e-12482.04uncharacterized protein LOC111430703 isoform X3 [Cucurbita moschata][more]
XP_022922844.18.7e-12482.04uncharacterized protein LOC111430703 isoform X4 [Cucurbita moschata][more]
XP_022985052.16.3e-12280.99uncharacterized protein LOC111483140 isoform X3 [Cucurbita maxima][more]
XP_022985053.16.3e-12280.99uncharacterized protein LOC111483140 isoform X4 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
A0A6J1E4M84.2e-12482.04uncharacterized protein LOC111430703 isoform X4 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1E7X94.2e-12482.04uncharacterized protein LOC111430703 isoform X3 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1JCF63.0e-12280.99uncharacterized protein LOC111483140 isoform X4 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1J3U03.0e-12280.99uncharacterized protein LOC111483140 isoform X3 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1E9Y42.0e-12178.72uncharacterized protein LOC111430703 isoform X1 OS=Cucurbita moschata OX=3662 GN... [more]
Match NameE-valueIdentityDescription
AT5G54090.15.4e-7149.82DNA mismatch repair protein MutS, type 2 [more]
AT1G65070.11.3e-1625.08DNA mismatch repair protein MutS, type 2 [more]
AT1G65070.21.3e-1625.08DNA mismatch repair protein MutS, type 2 [more]
AT3G24320.16.7e-0533.00MUTL protein homolog 1 [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000432DNA mismatch repair protein MutS, C-terminalSMARTSM00534mutATP5coord: 1..103
e-value: 1.1E-7
score: -14.5
IPR000432DNA mismatch repair protein MutS, C-terminalPFAMPF00488MutS_Vcoord: 4..101
e-value: 2.0E-10
score: 41.0
IPR027417P-loop containing nucleoside triphosphate hydrolaseGENE3D3.40.50.300coord: 2..188
e-value: 6.0E-32
score: 112.8
IPR027417P-loop containing nucleoside triphosphate hydrolaseSUPERFAMILY52540P-loop containing nucleoside triphosphate hydrolasescoord: 6..103
IPR045076DNA mismatch repair MutS familyPANTHERPTHR11361DNA MISMATCH REPAIR PROTEIN MUTS FAMILY MEMBERcoord: 4..284
NoneNo IPR availablePANTHERPTHR11361:SF132DNA MISMATCH REPAIR PROTEIN MUTS, TYPE 2coord: 4..284

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0001692.1Tan0001692.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006298 mismatch repair
biological_process GO:0045910 negative regulation of DNA recombination
biological_process GO:0090305 nucleic acid phosphodiester bond hydrolysis
molecular_function GO:0005524 ATP binding
molecular_function GO:0016887 ATP hydrolysis activity
molecular_function GO:0004519 endonuclease activity
molecular_function GO:0030983 mismatched DNA binding