Sed0021872 (gene) Chayote v1

Overview
NameSed0021872
Typegene
OrganismSechium edule (Chayote v1)
DescriptionMMS19 nucleotide excision repair protein
LocationLG07: 10887015 .. 10943787 (+)
RNA-Seq ExpressionSed0021872
SyntenySed0021872
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CTTCATCTCTTCCAAACAACCCTTTAAGAGTTCTTAAAGAAAAAAAGAAAAAGAAAACCCAAAAGGGTCGCGTGTATATAAATTGATTCTACAAGAGGAATCTATTCTCAGTTCTTCGAAGGCAGAAAAACCTTCTTCCGCTAAGTTCTTCACGCGGCTAAAACCCATTCCATAAACCCCAAAATGGGAGAGCTCAGTTCGCTTACACAGTACATCGAATCGTTCGTCGACGCATCTCGTACTGCATCTCAACAGGTTCGTAATCCATTACCGTTTTGTTCTTGAAAACTACGCAATCATCAACGCAACCACTGTAAATTAATCTAGTTTGCAATTCTGATCATCACTTAGTTTGCTGCATATAATTGGTTGGAAACGCAGTAGAAAGTTTTGAAATACAGGAGCTGAACAGTTTTTTTTGAAACAAGGGTAATCACTCCCGTTCTTAGTCCAGCACCGAAGACATACCCGTTTTTGAAACAAGAGTCTCGAACTCAGGACTTTGAGGGGAGCATACCCTCAAAGCCCAAGTTTTCAACCACTGCGCCACCCCTTGGGACATAGCCAGCTGAACAATCGTGTGTGTGTGTGTGCGTGTTTGAGAAAACTGTGCAGCAGGGTGGGGTTTGGGGAGTTACTTTTTGTCAAAGATGAAAAATGCAGTGTACAGAGTGAGACTAAGAAGTGTTGGCTGAATAGTTAGTTTCAACTATGATATTTTACTGTAAAAATTAGGTTTTAGGTGATGATGATCATTGATCTAGGTTCATTTTAAGGACTACCTAGAAAAGAGGAAAGTACAATGTTCAAGTGCTGGAACAAAAAGAGAAGACTTCAAGAAATATGGACCTATAGAAATCTAAAAAGAACAAGCGCTAATTCAAGTCAATCCAACCATCATGGAGTGATCTAGTGGTCAAGAGGGTCTTGGTGAATTAAGAGGTGAAGGGTTGCGTCCATGGTGACCACCTACCTAGGAATTAATTTTCTACGGGTTTCCTTGATATCCAAATTGTGTTAGGCAATATGTCTCCCGACAATAGTCGTGGTGTGCACAAGCTGGCTTGGACAATCACGATATAAAAAAGAACAATTCAAGCCAATCCAAAACTGCAACCTTCGGCAAAATACAAGCCAAAACAACTTGAAAAAACTTTAGAAGCAACAACTAAGAGACCAATAACAACAGCAAGATATAGCAGGAGGATTTTTGACAGCAACTTGAAGACGTATATTTAGGGAAAAATTGTCTACCAAACCAATGTGATCTCCTACAAATATGAGCTTTGTCCAGAACAGTTACCCAATGAAGAACCACCAAAAGTAAGGCCAAAACTGATGGGACTATTACTCCTCTGAGATTTTAGCATTCCTGCTTGGGGGGATAGCTTTAAATTGAATACCACAAATGTGAGAAAATTTGTTTGGAGCTTTAACTGGTGGATCAAACTGCTGATCAGGAGGTTGAGGCACAAAAGAGAAATTGGACGATTCAAATAAGTTTTATCGTGATATGATTTCATCACAAGGATGTGAACTGCCTTCCTCAAGATGATTGGATGATGAATCAAACCTTCTAACACTTGCGTCATCTTCCAAATCAGAATCTTCAATTAGGGGGCGATCTAAATTATTCGGGGGATTTGAAGATTGTTGAAGAACAAGCTTTTTGTTCTTTTTACTATGTTTAATTTTAAGCTTGTTCATTAACTATTACAATCTATACAACAACATAGTTTGACGATGTTTACATATTGATATGAAGGTTGAGAAACTAATGATGACGATGAATAATTTTCAACGCTTTTTAAGACTTAAGTTGACACTGGTTCGAATCAAAGGATTTTTTGACCAACTCTATTCTCCTCCCAACCTCTTTCCAGGACATACATAATTTCTTACCAAGAGGAACTGGTAATTAGTGATTGCTGTGTTGGAGTGAGGAAAATTGAAATTGGGATTTGGGCATTTTGAGAAACCTTTCTTCCAATATGTGCCTTGGAATTTGGGGGTAGTTTAAGTTTCCGTTGATTTTATGGCTTCTTTTTGGTGTTCTTTATGTGGATTCTTTTGTAACTATTCCCCCACCTGAATTTATTGATAGTGATGTTTCATCTGTGTAACAATGTACTGATAATTGGAAAACCATGATGAACAATGGACTAATGAGAATATATGTGACACTACAGGCCACAAGCTTGGAAGCAATCATCTCCCTTACGAAGAACAATGCACTAACAATAGAAACATTGGTATTTTCTGCATTGCTTGTTCTTCCCTTCCTCTTTGTTCATGTTTCCCTTCTAATCTTGGCAGTTTTAATGCATGCCTTAATTCTTATTTACATGATTTAGGTTAGAGAGATGGGAATGTATTTGACAATTACTGATAACATTATTCGAGGCAGAGGTACTGGCTACACAAACTTCTAGTTTTGTTTTATTATTCTTGCCATACAAATTGATCTTATCTGTACTGTCTTCACAGTTTTCTTGCTTTAACTTTTGAACAAGTATCTAAGAAATGTGCTCATTCAAAACTTATTTCACCTTGTTAACTGAAATAGATAAGTTCAATATCTAGAATTTCATACAAATAATACAATTGTTTTTTTTTGGAGATGATCAGGATTTGATGTTCTTCCTAATAGAATTCTGCAAGATAACAGAAATTCTAGGTTTAGGTATTATTATTATTATTTAATGGCAGTAAGATATTTCATTGTTAAGATAAAGTATACAAAAGAATGGGAAATGCCTGGAAACTAAGTTACATAAAAATTTCCTAATTAGCCAAATAGAAGCAGCCATAAACGGGAGAATCTTGGCCAATTTGAAAGATCTTTTGATGTAGTTAAAATAAGGGAAGTTTAGGTTGTTCGATCAAGTAAGGGAAACTCGATGGAAAAACTAAGACCATTCTAAACAATGAGGAGGGGAGAAAAAGGCAAACAAAAGCCTCAGCATAAGAAAGAGGGATAAAAAAGAACATAAACTAAGATAAAAAAAAACAAATATGTCCTTTATATCTTTGAAGTGGTAGGAGATAGTGCCCAATAAACACAACAACTATCTAAAATAGAGATCCATAGGAATCAAATCCAAAGGGATTTGTATGGGAACATGATACGATCTATAACTATATAAGTATATTGCTACCCATTATGAATAGAACTTACAAATTGATATTGAATTAACTCTAGTGATTGAGGGGGGGGGGGGGGGCTTATGATAGAGCAAGACTTTATCGTTTTGTATTAGAGCTGTTCGATCCATAGAGATTGATGTAGGTATTTTTTTTTTGAAACAAGGGTGTCCACGCCCGTCCTTAGGCCAGGCACCGGAGACATCGAAGGAGTAATGTCACAAATGAGTCTCGAACCTAGGACCTTGAGGGGAGCATACCCTCAAAGTCCAAGTCTTCAACTACTGCGCCACCCTTTGAGGACTATTGATGCAGGTATTATTGAAGCTAGTGCTTATATAGGCGTTTAGCCCAACTAACTCTACTAACCACTAAGTAACTATGATTAACTAACTCACTAAAGGGGTAAAGGGTCTTAAAGCATGTTTTGGGATGCACCTTTTCAAATCCCATAGAAATATGATAAGTAGCTTGAAAAAATCCCCTTTGAAAGAAAAAGAGATCTCACTTTGTTGATCATAATCTTCTTGAAGAGTTTGATTTTGGTTTTGTATTGGTCAAATGTCTTGGAGGTTTAGGAGCTCACTAGTCTGTTAGTATTTTTCCTAGATGCGCTCGTAGACTTCTTGCTCGTAAGCACTGGTATGCAATGATACATAAGCCATTTTTTAGTTATCCAAGGAAAAATAAGAGGTAAAAGTAAAGCATGGTCAATATCCACTTTTTCCACTTCTATATCCACCAAAAATTAAATTCCATTGATTTCTTTGTATTCAAGAAACCGAGGCTCAACCAATCTAGTATTTGGTTCTTAAATTCTCAAAAGAATGAGATATTGATTTAGCGGTAAGCTTTTAGATGCTAGTTCAAAAAATCCCCAAGATTAGGTATATAATCTTATGATAGAGCAAGACTTTATCGTTTTGTATTAGAGCTGTTCGATCCATAGAGATTGATGTAGGTATTTTTTTTTTGAAACAAGGGTGTCCACGCCCGTCCTTAGGCCAGGCACCGGAGACATCGAAGGAGTAATGTCACAAATGAGTCTCGAACCTAGGACCTTGAGGGGAGCATACCCTCAAAGTCCAAGTCTTCAACTACTGCGCCACCCTTTGAGGACTATTGATGCAGGTATTATTGAAGCTAGTGCTTATATAGGCGTTTAGCCCAACTAACTCTACTAACCACTAAGTAACTATGATTAACTAACTCACTAAAGGGGTAAAGGGTCTTAAAGCATGTTTTGGGATGCACCTTTTCAAATCCCATAGAAATATGATAAGTAGCTTGAAAAAATCCCCTTTGAAAGAAAAAGAGATCTCACTTTGTTGATCATAATCTTCTTGAAGAGTTTGATTTTGGTTTTGTATTGGTCAAATGTCTTGGAGGTTTAGGAGCTCACTAGTCTGTTAGTATTTTTCCTAGATGCGCTCGTAGACTTCTTGCTCGTAAGCACTGGTATGCAATGATACATAAGCCATTTTTTAGTTATCCAAGGAAAAATAAGAGGTAAAAGTAAAGCATGGTCAATATCCACTTTTTCCACTTCTATATCCACCAAAAATTAAATTCCATTGATTTCTTTGTATTCAAGAAACCGAGGCTCAACCAATCTAGTATTTGGTTCTTAAATTCTCAAAAGAATGAGATATTGATTTAGCGGTAAGCTTTTAGATGCTAGTTCAAAAAATCCCCAAGATTAGGTATATAATCTCATGGATGAAGAGTTAATGCCCTGTTTGACCTTGATGAATGAAAAATGGAGCTATTGCAAAAAGAGGAGCAATCACCTACTACAGAGTTTGTGCTGCCTGTGATAATAGGCTCTCAGTGTCCAAAAATAGATGGAGAGGGAAGTATCTTCGGGTTAGGAAGAAAAATATGTTGAAACAAAATATGAATTTCCCTATCTTATATGAGTGTGTGATATTTTACATTGCGCAAGTTTGTTCAGTGTCTCTACAAACAAGAAAAGCCTTGTTTGGTTGGAAAGGTTTCACCAAAGAAGAAGAGAATACTCTTTCTTGAGGGCTAGTTTAATGTCAAACTATGAATCATGGAGATGTTCCTTGTGGATGATGACGACAAAAGAAAGATTGGCAAAATCAATCAAGGAGGGGAGAGGGAGGGTTCTTTTAATACCATTCTAGACTAAACTCAAGCTTGGTGGATTTGCAACCTTTTTCCAGGCCTCTCTGTATGATGCAGTGGCTTAATTATAAGAAAGAACTCTAGTATTGGTGGATAAACCTTGTTTTTGCTTTTTATGGGAGAAGACTTTGTGATGAGTCCTATTGTACAATGTATTAGCCCATTCTGCCCTTGTTTTCCACACCAATGGTAAAATAATCTTATTTAATCCATCGTTGGGATCCAATTTTGCAATCTTTGCAAAATGCCTTTTTTATTGATTAACTTTTCCATTAATACAATCGTCTCTTCTAGCTTTTAGTTGAAAGGCAGATCACAGTGTCAAGCGAAGACAAAAAATGTTTGTCTAAGCTTCTCTTCGACAAGGATTTAGTGAGAGGAAAGATGTTCGGGAGTTAGTAAATATTTTATTCTCCTGTCAACCTTAAAAGAGACATGAGAGAGGACAATCTTGTTGTATGTCTAAAAGAAAAAGAGATGTCAAAGCAGTAATCTAGGGGGATGGGGTGAGGTGAGATCTAATGTTGATGGGATGTAAAGGGAAGAGGAGAGTGGAGAGTGGATAGCAGGAGGGAGGAGAGAAGAGATAGGAGATAGGAGATAGGAGATAGGAGAGAGGAGAGAGAGTGGAGTTGGTGGGAGTCCTTCATGCACATAACACAGCTTGGCGAGAGCAAGTGGGGGAATTCATTTCACTATGTGATCATTAGTGTTGCTTCTATCATGGTTAAGTCCCAATGTAATAAAATAATCATCTTAGGACTTTTATCCATCCAAATTTTGTTCTACAACATGTGTTTTTTGAGATTATGTGATATGAGAGTTCCTTGAACATGAACCTTATTGAATGTTGGTGGTTGGTGGGATCTAAAGGCCAAATCAAATCATTATCCAAATTTGATGAAACATGGTTGGTTGCTTCTGTTATCAATATGATTCAAAACAGTGTATTCTATTATTGAGAGGCAACCTCTTTATATAAGAGTTACAATGGGCCTAGCGGGCCTAACGGGCCTAGATCTATTACATAGTATTACACAGTACATTATACACAATCACACATTTAGACTCTAACACTCTCCCTCAAGCTGGAGCAAATATATCTATCATGCCCAGCTTGTTACAAAGATAGTCTATCCGAGCTCCATTCAATGCTTTGGTGAAGATATCACCCAACTGTTCTCCAGTCTTCACATAACTCGTCGAAACCAACCTTTGTTGTATCTTCTCTCGGATAAAGTGGCAGTCTATTTCAATATGTTTAGTTCTCTCGTGGAATACCGGATTAGAGGCAATATGAATTGCTGCTTGATTGTCACACCATAGTTTAGCGGTGTGTTGCGGTTTTGATTCCAACTTCATTCAAATAGATTGAACCGGCCATACTATCTCACACGTTCCTTGGGACATAGCTCGTATTCGACTCAAAGACTAGAACGCGACACCACATTTTGCTTCTTACTCTTTCATGAAACCAAATTACCTCCCATAAACACATAGTATCCCGAGGTTGATCTCCTGTCTTCTTTAGAACCTGCCCAGTCAGCATCTGAAAAACATTCAATATCGGAATGTCCATGGTTTCGGTACAAGACGCCCCTCTCGAGAGCAAACTTAAGATAACTCGAGGATCTGTTCCACACTCTCCAATGTTCAACTGTAGGAGATGACATAAATCGACTCACGATACTCACGAGAATACAAAATATCCGCCTCGCGTGATATCAAGTAGTTGACTTCCCAACTAACTTCCTGTACCTTTCTGGATCAGAAAAAGCTTCACCGTCCTTGACAAACTGTTGGTTTGGCACCATCGGGGTACTACACGGTTTCACACCTAACTTTCCTACTTAAGTCAGTAGGTCGATAACATACTTTTTTGGGACAAAAGATCCCTTTCTTACTTCTCATGAACTCAATACCCAAGAAATATTTCAAATTTCCAAGATCTTTAGTCTTTGAAATGGCTATGAAGGAAAACCTTTAGTGAGGCCATCTCCGAACTATCACTCCTCCATGTAATCACAATATCGCCGACATATACCACAAGTAACACAATCCCATGGTCGACCGGCGGAAGAAGAGAATGATCTCGAAGAGCATTTTCGCATTCCAAATCAAGCAAGAACTTGGTGAACCTACCAAACCACGCTCGCGGGCTTTGTTTCAAGCCATACGTAGATTTCTGAAGACGACACACTTTGGCATTCTCCCGAGCAACAAACCGGTGGTTGCTCCATATAGACCTCCTCAAGAAGATCACCATGCAAAAATGCATTCTTAATATCCAACTGAAATAATGGCCAATCATTCATTGCTGCGAGAGAAATAAACAGGCGAACGGACGTCATCTTGGCAACAGGAGAGAAGGTCTCACTGTAATCAATCCCATATGTCTGAGCATACCCCTTAGCAACCAAGCGTGCTTTGAGATGAGCCACGGACCCGTCGTAATTCATTTTGACAAAGAAACACCCACTTACAACCAATCGTCTGCTTTCCTTACGGACGAGCTACCGGTTCCCAAGTACCATTCGCATCCAGCGACCACCCTCTCTACCATCGCTGGTGCCAACTCGAATGGGACAAGGCTTCACGAGTACTTTTAGGGACAGAGACAGAATCAAGGGAAGTCAGAAAGGAAAACGAAGTAGGGGATAATTGATGGTATGACACATATGTGGGGATAGGATACGTGCAAGACCGTTTACCTTTGCGAAGAGCAATGGGCAAGTCTAGATCATTACTAGCTTGCAAGTCAGATGACGAAGAATCTGCCAGCATGGGACAGGCTGCCGAGGGTCGCGGTCGAGGACATGTACTCGGGCAAAAACAACGAGGACACGACCTCATAAACCATCGGACGAGGACACGACCTCATAAACCAACAAATCATCCTCCACAGCTGGGGGACTAGACACAGGATCAAACGGAGTATCCTAAAAAAAGGTAACATCAGAAGAGACGAAATAACGACCCGTACTCGGGCAAAAACAACGATAACCCTTCTGGACGCGAGAATACCCGAGGAAGATACATTTCAAAGACTTAGGATCCAACTTGGTCCTTTGAGGACGAACATCGCGAACAAAGGCCGTACACTCGAAAACCTTCGGGGGAAGGTGGAACATCTTCGTGGAGGGATACAAAACCTGGAATGGAATTTGATTACCCAAAATCGACGACGACATTCGATTAATGAGAAAACAGGCTGTAGACACAGCGTCAACCCAAAAATGTTTTGGAACATTCATCGAAATGACGTGGCACGAAAATGTTTTCCAAAGAAATGCTCGTTTTCTACGCTCGCCACACCGCTCGGGATGGTGTGTCCGCACACGACGATTGATGAATAATACCCTTCTCACTAAGATAGGAACGAAGGGTGCCCGAAAAATATTTGCCCCCATTATCAGTGCGTAAAATTTGGATGGAAGCTTTAAATTGATTACGAATCTCGGCAAGATTACAGTACAAAAGTGAGAAAACAATTCAAAGACGATTTTTCATTAAATAGACCCAAGTAAATCGGGAAAAGTCATCAACAAAGCGTGACAAAATAGCGAAATCCCGTTTTAGACATAATCTGGACTCGGACCCCAAACATCGAGTGGACCAATTCAAAAGGAGCAATTTGCTCGTTTATTAACCCTAGGACTAGAACTCAATCGGTGAAACTTTGCAAACCGACATGACTCGCAATCGAGGAAGACATCAGATCGAATTCGGGAAATAACTTCTTCAAAATAGACAAACAGGATGACCCATCCGACAATGGACTTCAAAAAGAGACGCAGAACCCGCGACATGCCGGCTGGGTGGTCTGCTGATCCAGATATAGAGACCCCATTTGCATGCCCCCTACCAATAATCTTCTTCGTCAAATGATCCTGAAACAAGCAATAGTCAGGAAAGAAGGAAACAAAGCAATGTAAGTCACGAGCCAATTGACTAACCGAAATCAAAGTTATATGAAAACGAGGCGGACACAACACGGAAGATAGCAACAAGGTAGGGTAAGAGTAATAGACCCAGCTCCCTTTACGAGAGACAAGGATCCATCATTTCTAAGGTAACCAGTAGGAAAAGGAGCAGATGACAATGATGTAGAGAACAACTGTTCGTTACATCATATGAAAGCGATCGCCGAGTCAATGACCCACTTTGTGGACAACGAAAGAAGACAATGGTTGTTATTTGTTTCACCGGGGTTGCAAAGAGCGGAAGTGCCGAGGATGCTTTGGCCGTGATGGAGCTGTGGGAACTCTGGAGCGCAAACACACTCGTCAACGTGTACTTTCTCACTTTGGACACAAGGTGGGCCTTCAAGCCTCCGATTGTCATATTGCAACTTTCGCAATCTCTCTTTAAATGACCGGGTTTACGGCAAGAATGGCGAGTCATTCGAGTATCCGGACGTCGGTTGTCACTAGAATATATCCGACGTCGGTCATTCCCGCACTTCGATGCGAGAAAAACTAACGTTCCTCCCACCACGAACCACACTCCTCCCAAGCAAGGCGCTATTCGATTGTTCAATTAAATGAAACATTGCGTTGCAAAGCGGTGAAACACATCATCCAAAGATGGTAGTTCGATTCTGACGAGGATTTGAGACCTCACATTCTCATACTCCGAACCAAGCCCACGCAAGAAAGCGATAACCCTCATCTTTTCCCCTCGGGTCTGGCTGAATCGCGATATACGCCTTTGACCGAACGGCATGAGAGAATTTAACTCGGCACTATTACGCTTGTGCCTCGTGTAATAGGTCAACAATGGTTCACCATTCGCCGGTGGCGTGGAAATAGGACGAGCAAACATCATACGTCCTTGTTACACCGCTCCTTCTCCGAGTACAGAATTCCACATACGTCATGAGTTCTTTCACCGTAGTGCAATGACTAATCGGATCGACAATCTCATCATTCATTGAGTTAGTAATCGATTATACAAGCGAGCATCAACGCGATCCCACTTCCTTTTTCAACTTCGTCTTTAGGCGGATCACTATCCATGTGTTCTTCTAGCTCATTACTCGCAGGTGGTTTTGAATCGTTCGATGCCGATCGAGATAATTTGTCCCATTGGGTTTGCGTTTTGTGATCTCCATCATATGAGAAATCACACTTGTATGCTTGGTATCACTCATATTTCCCGATTAGTGTCAGCGGACTGATGAGCAGATAACTCAATTATTTCACAAATCAAAGCGAAATAGAATCTGGAAATTGGAAAAACAACACTGTGAGAACGGAAAATGGCACACCCGAGCACCCCCGGTGCGTAAAATGCGAAACGAACCTCACACATGTGTAGAACACGCCGCGCGCACAAAACCCGAGAAGACACGACTTGGTGGAGCCGCCACGCGCGCCCGCGCGCGACGGAAGCCGGGCGGTTGGCGGAAGCCGCTGCGCGGGGTGCGGCTCGTCGTGCTTTCGGACGGAGTCGCGACTTTAACGATCGGATAGATCGGCGACGGGCTTTCGGATGGTACCGGCGGTGACTCGCGGCAGCGGGCGACGCGCGGAGCTGTGAGGGCGGGCGGGAATTTTCTCCTCCTTTTTTCCCAAACCCTATTTCTGTCCTGATGGCTCTGATACCATGTTATCAATATGATTCAAAACAGTGTATTCTATTATTGAGAGACAACCTCTTTATATAAGAGTTACAATGGGCCTAGTGGGCCTAGATCTATTACACAGTATTACACAGTACATTATACACAATCACACATTTAGACTCTAACAGCTTCCATTAGAGAGAGCACATTTAGTTATTTATTTCCTCATCCTTAAGAAGTTTACAAAATCTATAGTCATTGACCAATTTATGTAGCATTGTGTACTCGAGGTTGTATTGGAGGACGAAAGAAGTGCAAAGTCTAAGCTCTATGATAGTTGAGAGTTTGCCACCCATTTTCATGCCAAAGCTGGTGCTGGCTTTACCTTTTTGATAGTAAACAAAAAAAAATTCATTGATGAGACTGTATTAAAAAAGGGGAGCTAATCTGATTCTAATATACCGATGTACATACGATAGAATCAAGAAATGAGTTGATGGTTTGTTGTTTGTCCAAGTAAATTCCATTGTTGCGTTCCAACCATAGTTTCTAAAAAACGTGTATGTGAAGTTTCGCCCCAAGATCTCATTCTCTCATTTAAAAGGGTGACTGGTGTGTGTAGTTGAGGAGAGTGTGTATTTCTTTCGGTCTATCCATGCTCCAGTCGAAATCTTCATATATAAAATTTTAAAAAATACTAGGAAATGAGCAAATGTTGAAAAGTTTAGAACGTGTTTCTTGAGCTTGATAGCTTAAGTGACAACAATTTGGAGATAGAGAAATCCATGGTGAATTCCTTTTGTAAATGTTGTTAGAATTATGGATATATGTATTTGTATTTATGCTTCCATAAGTTTAGACCCAGCAGGCCCGGTAGGCCCATTAGGGTTACCTTGTACTTTATAAATAGGCATCGCCTCATCTAATGATATGTACAACTATTTTTCGGTCCAAACTCTCGAATAACATATGTCATGTTGATCTCATAATTTAACTTTTCTTGGTTGTGTACCTTTCCATAAATTTGTGTAGAACTCACCATCATTTTGGATAGAAGTGGAGGTCAATTTTCTGGTCAATGAACCTATTGCCAAATATCCTCACAATCGGCTCCAATTAATCGTGGATAATTGATTAGAGAGAGAAGCCCATTTTTCAATTTCCACTTCTTTTAGGTTTGGTCTCAAGAAGAGGTTTCATGCTTGAGTGTGTACATGCCAATAATGAACAATGAGTACTTTCTCCTTCAATATGAGAAGGAATAGTCAACAATGAGACCCGTCTTGTTCAATGTGAGATAGTATAATCAAGAAAGGGTCCTATCTAGGTATTCGTTATTCTATATTTTATTGCCTTGACTCTCCTCTAGGCCGTAAAACATTTTAGTTAGCTAAACCTAAATGCAAAAGAGCAATTACAAAATTTATACTTTGACACATAACCTATTGGCCTTACCCACCCTCAATTTTTTGGTCCATTAGAAATCCTCTTATTTCTTTCTTTCCCCAACTTTGATAAATGTACTTAGTAACAACAAATCAAATGCATTTTTTCCACAACTTCGGTAAAATAACCAGTATAATTATTATCTTGATAGTCCTAGCTTTATTCCTAAGATTTATATCAAAGACTGGTTGTGGTAAATTTAATTATATCAATCAACATTCTCCATTGAACAACTTGACAAGAGTTTGAACGTATGACCTCTTGGTTTGATACCATTAAGCAGTTCTTACTTGCCTTGTTGATCTATTCTTCCACAACTTACAACCATATATACAATCTAATTCTAGGATCAACAACTTACGGGAATTTCTAGCTAATATTTTTCAACTAATTATGCTAAAAATAAACCTAAACTAAATTGGGTAAAATCAGACTAATTACACATTGACTCTTGACAATTTTTATTTTGAAAACTGGAAAATTGGAGCTTTGCTCCACTATTCTAGAGTAACCACGCCCGTCCCTAGGCCATGCATTGACTCTAACAATTACAAAGTGACTTTCAACATCTGGCTTAAAACTTAAGTTATTTTCAGAGGTATTGTTTTCTTGATGTTGATGGCTTAAAAAGTCATCTTCAAATTTCCCTTGACAGCAATGAGCGAAGAAGTTCTGGATATTGTCATTGGTGCTTGACACTAGTTGTTGATTGATAGGAATGTTTAAGGGGTATATTAGTAATTAGTATAATAAACCAATTTTATTAAGAGTAGGATAATGATATCCGATTATAAATAAGAGATGGTGGGGAAAGCAGAAAGCACATCGAAGACATAGATTGAGTGATTGACTGGACTTGAGTGATCTCAAGATAGAAAGGGTCCAAGAATCTTTAACACTTGAGAGTTACTTTGTTATTTTTTTTTTCATATCAACACATATTTTGGTTTCTATCAATTTGGCATAGAGTGGTTCCTATCATTGATTAAGTAACATGTCCCTACCTTTGGTTTGTTCGAAGGATGTTTTTTGACCATGTCACTATGCATTGCTGTTCGCCTTAGCCCTTCAAGATATGTGTTGTGTACATGCCATATTTTGTGGTTCATATGCACGGTAGATAGTTGGAAATTTGACTTGTTGCTTTGCATTACAAATTTATTCTCTGTATTGTTGAAAAGAAACAAAAAGCTTTCTATTCTTTTCAGGTATACTTCTTCTTGGGGAACTACTTGCATGTCTTGCATCGAAGCCTCTAGATGATGCAACAATACACAGTCTAATGACATTCTTCACTGAGAGACTGGTGAGTGGGCAGTTTTATTTTCTTTCTCTTGATAACAATCGCCTATGCAAAGAGTAATTAAATTGTTTGATTGTTTGATTTAGTACTTCCTTATGAGCGATGGTTACTAGGCTTCTTTGTAATTATAACCTTGGTCTTGTTCTTTTGCGTTGGAGTCTATTTTTGTTAGTTTCAGGACTCTCACTTCTTTCATTTTTTTTGTATGCCATTGTTTTATTTCATTTGTTCAATAAATGTTTGATCTTACCAAAGAAACCAAAGATGTATAGATAGAATTAGTGAGAAAACGTGTAGAGAGAGAATTTAATGAGGCGAAGTGTGTGCAAAGGGTAATTAAAAAAGAAGGAAAAAGGTGTGTGCCATGAAAGGTTACAATTGAGGTTGTTAAGGCGAACAAAGATTTCTTTACAAGCATTTATATGTTGTTTACTTGTTAGATAAGTGATTCTTTTTTAGGGCAATTGGTGCGGGGTTGGTAAGAAAAATGCTTCTAAATCGTACCTAAACACATGCCTGAGGGAGGAAGTAATTTTAATCATTTGAAAACACTTCTGCTATGACTAAAAACATTCCAAAATAGTGGTATCGTAGATTCTTATTTACATACCCGCCATCATGAAAGTTTCTCTATTCAATGTGATAGATTACTTGATACTTATTATCAGGCAGATTGGAAAGCTTTACGAGGCGCCCTTGTTGGCTGCTTGGCACTGATGAGGAGGAAAACAAACGTTGGTACAGTTTCTCAGAATGATGCAAAGTCTGTTGCCCAGTCATATTTTCAAAATCTTCAAGTTCAGTCTTTGGGACAACATGATCGGAAGGTGAGTTAACAACTATAGGAAAGTTATGCAGATACCCAATGGCAAATGCTTTCAAATTTTAGTAACATTTTTGTTTGACTTTGATTAATTTATTATGACAGCTCAGTTTTGAACTTTTGGCGTGTTTGTTGGAACATTACCCTAATGCAGTTGTTTCACTGGTACTATCTTTTGCTTGTTTTCTTCCAGCCTTTTGATCATTGTCTTTTTTCTGTTAGGACTCCTGAAAACCACAAGTGATTGACCCATTTGGTCACCCACTTCTTTTCTTTTGGGCTACAACACATCATAACGTCTAAATGACATTTCTCATGCTACACTGCTATGCATAGATTTTGTGCGTAACTTGACCTTCCGGTTGGAGTTAGTATCAATGAATTTCTCACTATTATCCAGGGTGATGATCTTGTATATGGAATCTGTGAAGCCATTGATGGTGAAAAAGATCCACACTGCTTAATGCTTACTTTTCACATCGTTGAGCTTGTGGCAAAGCTATTCCCAGATCCAACTGGAACACTTGCAAATAGTTCTAGCGATCTTTTTGAATTCCTGGGTTGCTATTTTCCTATCCACTTCACACATGTAAGCTTAATGCATGCCTCTCTTATTTTTATTTTTTCATTTTCTTTCTTGGTTGCTTGACTTTTCTCCACTTGCATATGATTAGTTATTAGTAATAATTGTAGACTTCTTTCTCATTTTGGGAAGGGAAAATGATGAACCATTGTATAATACATAAGAAATTGTTTGTTTTCGATGTGGAATACTATAGAATACATAAGAAATTGAGGTGGGTAGAAGGGAGCCATTGGTTTATGAACCTTATTGAAACCAAAATAGCTAAATTGGTTGAGGATGGAATAAGTGATTTGAACATATTGCATATTTTTGTATTTTTCTTTCCTTTTTTCCTCCCTCGGGAGTTGTATTTGAAACTTCTAAGTTTTGAAAGAGTTTTTTTTAACCTGCTACAAAATTTTTCATTAAAGAAATGAAAAGAGCCTAATGCTCTAAAGATACAAACTCCATGGGTGAAACAATAAACAACAAAATTTAAACAGGAAAAATAAAAGTATCCTAATTAAAATATATCACATATAAAAGATCAGTGAAAGGCTTTCAAAGAAAATGCCTAGTGGGACCTTTAGTTAGTAGACTCAAAGTGATCAATAATTTCACCCTCAAATACTTTATGGAAGATTTCCTGAAACAAGAAACATACCTTTCATTAAGAAGATGAAAAAGTCTAATGTTCAATGTCAATGATACTAACTCCCATAAGAGTGGGAGAGAAAACAAAAGAATTCATAAAGGAAATAAAAACGTAGAACTAGAAAATGGAATTGAGATTGAGATTGAATAGCGTCGTAATTTTATTCATTATGACTATGTCTATAATTATATGTATACAAATGTAAAATTGTAAAACCTAATCTCTAAAGTGGGCCTAAAAGTCCCATTATAATAACTATATGAATTAACATAAATACATAAATATAAATAACTCTAACACTCTCCCTCAAGTTGGAGCGAATATGTCTATCATGCCCAACTTGTTGAAGAGATCGTTTATCCTTGCTCTATTTAGTGCCTTGGTAACAATATCTCCCAATTATTCTTCAATGTTTACGTAGCTCATAGAAATCAATATTTATTGTGTATCCTCTCACGAATAAAGTGACAGTCACTTTTATATGCTTGGTACTCTCATGAAACAATGCTATGTAAATAACTGCTTTATTATCACATCGTAGTGTTGCCGGTACAGTAGTTTCCACTCCAATTTCATTCAGCAACTGAAGTAACCATACTATCTCACATGTTGATTAAGCCATTCTGACTTTGCACTCGAATGGAAAACTTTGTTATGTTTCTTGCTCTTCCACGACACTAGGTTGCCTCCTATGAAGATACAATATCTAGAAGTAGACCTTCTGTCTTCCTTAGGCCCTGCCCAGTCAATATCTGAGTAACAATCATTATTCATGTGATTGTGACTCTCATACAATATTCCTTGACCAGGTGTAGATTTCAAATAACTCAAGATCTGTTCCACGACCACCCAATGATCCGCTGTAGGAGATAACATAGATTGACTCACCACACTTGGTGGGTATCTATATCAGTTCTCGTAACTTTCAAGTAATTTAATTTCCAAACCATCCTTCTAAATCTCTTAGGATTTTAAAATGATTCTGCATCTTTGACAAGATGTAGATGGGTACCATCGGGGTACCACATGGTTTCACTCCTAGCTTCCCTGCTTCGATAGCAAGTTTAGAACATACTTGCGTTGGGACATGAAGATGCCTTTCTTGCTTTTCATCAGTTACCTCAATGCCTAAGAAATATTTGAGAATTTCCAGATCTTTAGTATGAAACTAGTTGTGGAGAAATTCTTTCAACGATGTCATGCATGTTGTGTCATCTCTAGTTATTATGATATCGTCAACATACACTACCAACAAAGTAATGCCATTCTTTGACCATTTATTAAACACAGAATGGTCGGACGAGCTCTTCATTCCAAACTGCTCAAGGGCTTGATTGAATTTTCCAAACTATGCTGATGGACTCTATTTCAAACCATACAATGATTTGCAAAACTTACATAATATGCAGTTATTGACTGCCCCTAGAGCTATGAAAATGAAAAAAAAAAAAAATTTCCAAAAAAATGAAAAATGTTTTTGTTTCTTATAAATAACAAAGAAAAGAAATGAGTTTGGTAGCGTATCTTATGTGTGTGTATATATATTTATGTCCATAGTTTGAAAAATGGGCTGTTTTCTCTCTTTCTTTTTGTACTCAACAACAATTGGGGATGAGAGATTTGAACCTGTAACCATTGGTTACTAGGTCATTATGCCACTAGAGCTATGATTTTGTTTGTGTTTTCTCTCTTTCTTGCTCATAAGTTAATAATGTTTGGCCTCGTGATATTTTCCAGGGAAAAGAGGAGGATGTAGATGTAAGAAGGAATGATCTTTCGCAGGCACTTATGGTATGGTAATGTCTATTGTGCAAGGACACCATTTTACTACCTCACAATGCAATTTGAGTTCATTGTTTCCAAAGATTTTGTACAGGAACAATTGAATTATAGTAATACTAATAGCATTCTAGAGGTTTGGGAATGCTGTTTTAGTGAATAAATGGGTGTTGGTAGATTTTAGTGACGCCAACATAAAACTTCAGAAAATTCCTTACATTTATAGTTATGTGGAATTAATTAGCTGCATATACCTGTTAATGTTTTGAATAAAATCTGAAGCAGCATTTAACATAGTCTGGGTAATGTGAAAAGCCTGGTTGATATAAAAAAAAAGATATCAGAAAGTCAGTCCTGATATCTTTGTTGTTAGTAACCATGGGTATTACTATTACTACCATTATTTTGAATTAAACATAGGCCTTGAAGTTTGTTCTTATACGTTAAGTCAATTGCATGAAACATATCCACAAAAAATACATGACAAATTTGTTATATGTTAAAAGTCTGTTGGAAGATCATGAATCTCGAGATTTTTATTTTTTATCATCCATCTCGACTATTCTCATAGAAGATGATGAATCTCTCTAGTTTTTTGATATTTATGAATGAAACTCTAATTTTGAAACGTATTATGAACGAAAAGTAGTTTAAAAGCCACAGTAAAGCCTCGTGGGCTCTGTGTGGGTTGGGTAGATTACATCTTCATAGACATGCCTATTTAACCTGACTTGTTGAGATAATGCCCACTATCGAGATTGTTATGCCTTCGTGCAGTTAGAACTTACCAACAAAAAGTTTTTCACAGTAAAGCATTGTAGGTTTTACTGTATGGGTTGGGTAAATCAAATCTTGATAGACATGTCTATTTAACCAAATTTCTCTTTTTGAGAAGTGTTCACCATCCAGATCATTATGTCTTTCATTTTGAATTCAACAAAACTTGTGGGTGGAGTTTCAAACCTGTAACCTCTTGGCTACTTGGTCGTACATGATACCATTGAAGCTCTGCTCCTTTTGGCCAGATCATTATGACTTTCATTTGGTTAGAAGTTAGAACTTACAGACAAAAAGTAAAGCTAAGTAAGTTGCAAAATGGAGGGAAATGATGAGTCCAAAATAAAGCTAAGTTGCAGAATCTATATACACCTCAATAGATTGAGTCTTGTGTCATGTCCTTGAAGATGTGGTTGTTCCTTTCCATCCATATTTTTCAAACAAAGCCATGTAGAAACTTTTCCCAAAAACCTTCTTCTCTTTTTGTAAGGGATGGCCGGCAAGTTGCTTTCTGGATAATTTTGGAAATTCAAATTTTTTCCTGCATCTCCCGAACAAAAAAGAAGAGTATTGTTCCATTTTTCTCTAGTAGTATACCAATATATTGATATTTGTACTAACAGTTTTGACCTCCTAATTCTTGGTTTTGTTGCTTCCTGCTTAAGAATTATGATGTTCATCTATTGGCTATTGCGTTGCGATCATAACTCTCCTTACAGCCATGATGATGCTGAATGTGTTTATGTTTTTCTCATTTTGCACAAGGCTTCAGATTTAAAAAAAAAAGATTTTAGGTCATTTATTGGTGCTGTGGACTGGTATTAATTACATTAGCCTCATCTATAATCCATCATTAATTATTTTCTACATTGCATGGTGAATTTTACACCACATTAATAATCTTTAAGTGCACATTATTTGGCTTAATTGCACTTTTTTTTAACAAGAAACAAATTGATTGCACTTGTAATATTACTGTGCAGATTGCCTTCTCTTCTACCCCCCTCTTTGAGCCATTTGCAGTTCCTTTGCTTCTCGAGAAACTTTCTTCTTCATTGCCACTAGCAAAGGTCTGTTTAATGTTCTCTATTTACTTGATTTAGTAATTACCTATTTTGCAGGACACTTCTTTTGATTGCCCATAGCTGTTGAAAAATAACTTAAATGATTGCAGATTGATTCTTTGAAGTACCTAAGTGATTGCACCGTAAAATATGGGGCAGATAGAATGGAAAAGCATAGTGCATCTATCTGGTCTTCAGTAAAGGAGATCTTATTTACATCAATAGGACAGCCTTCTTTGTCCATTAACTTAGAATCATTAAGTAGTCCTAGCTTTGAAGGGAATGAAATTACAACTGAAGCTCTACGACTCCTGCAGAAGATGGTCGTGGAGAGTAATGGATTATTTTTAAGATTAATTATCAACGATGAAGATATAAAGGATATTTTCAGCATCCTAAATATTTATACATGTTACAACGACTTCCCTTTGCATAGCAGGCAGAGACTAAATGCAGTTGGCCATATCCTTTACAAGTCAGCGAATGCATCTCTTGCTTCCTGTGGTCACGTGTTTGAAAGTTTCTTCCCTCGTTTGCTGGATATTGTCGGGATTTCTGCGGATCAGCCTCATAATAACAAAATTTCTCCAAAGAATTTTAATTTTGGGGCCCTCTATCTCTGTATTGAACTTCTTGTAGCTTGCAGAGATCTGATTGCAAGCTCTGATGAACACATATGCTTTGTTAAAGAAAAATTATACGGCATGCTTCAAACCTTTTCATGTTCAATGGTTCATCTCCTCAATTCTATCTTTCCAGTAATTGTTAAGAAGGATCTGCATGATGCTGAGTTCTACTGTGCAGGTATCTTCTTTTGAGGTGTTCTTCTTGAATGATGACGTCTGCCTGAAAATTTGCTATTCTAAGCTTAGCTAATATGAAAGTCATTGTCTTTTCAATTTTCCTTCAATTTCCATTTATATCTCTCTCAAACTTTTTTTTTATTGATTTTTGTTTTTTGTCTCACTCAATATTGCTCATTGTCTGAGTGGTTTAATTTAAGGTTGATTAATTATCTTTCCAAATCTAAGATTAATTAATTAGAATGCTGTGACTGAAGTTTGACATCATGTGCAGTAAAGGGCTTGCAGAATCTGGCCACATTCCCTGTAGGCTCTTCACCAGTATCAAGAGTCATATTTGAGGATATTTTGCTGGGACTCATGTCATTTATAACAGCGGACTTCAAATTTGCATCGTTGTGGAATCATGCCTTGAATTCATTACAGCATATTGGTTCATTTGTTGACAAATATCCAGAGTCTCTGGAATTGCAAAGTTTCATGCATGTTGTTGTTGAAAAGATTGCATCAATGTTCTCTCTTCATGAAGAGACCTTGCCATTGTCGATTAAACTGGAACTGGCATTGAACATTGGTAGAACTGGACGGAGTTATATGCTGAAAATTGTTCAGGGGATTGAAGAGGCAACATTCTTCCATTTATCTGAGGTTTATGTACGTGTTAAAAGATCAATAATCTTTATTTTACGCTAGACGTTCCTCATTTGCATCATTCCATGTTCCCAGTTGAACAGATGTTTTTCTTACATTTATAGTTTTCCTTTAATACATGACCGATCCATCTTTTCTTGCCTTATTAATTTTTTATAGCTTTTGAGCCATTTCTTCCTAATAGGATATGTTTCACTTACAGAAAAAACACCTGACAGATGATAATTTCTTAATACTTTTTTGGCTGGAACACAATGCGGTCATACTCCTTTTCGGTTCCATAATATATGGCTGGAATACAAGGATTTCTTTCCAATGATGGAATGTTGGTGGCTCAAGACTCCCATGAGAGGATGGTTGGGTCATGGCATCATGCAGAAACTTTAAATTGATCGTTAAATCATGGAACAAGAGCACTTTTGGAGAATCATTTGACTTTGGAATTAGCCAGCACTTCGCAATCTGTACAGTCAGTACAGATTGCACAGAGAGCCTGTATTAAGGCCAAATGACTTGCTATAGCAGTTAAAGAAGAGTCCCTTTAGAGGCAAAGTTGTAAAAAAAGTGGTTGGAGTTTGGTGATGTAAACTCTAGTATCTTCCATTCCGTGGTATTATTGCTGCTAGAAGAAGAAATTTGTCCTCATTAATGACAAAGACATAGAAGTCGAGTTTATCCATTATTATAGCAAGATCTTTGCCAAACAAGGAAGCTTCCTGGACTGATCCTCACCTTCATGATTGGGATCCTAATTCCTATCAACAATGTGATAGAATCTACCAACTTTTATTGAAATAAATCTCTAGAGAGAATCTCCTTTAGAGCCAAAGAGTTCAAATGACTACAAGATAATTGAATACCAATAAATGGCCTAACTTATCCTATATATAGACTTATCCTATTTCCTAATCCTTAAAGGAACCCTTAATTAATAATAACTTATTTCCTAATCCTAAGAGAAACCCTAATTGATAATAAAGCCTAATAATAATAAATAGCCAACTATAATTAAAATAACAACTAATTAAATACTAATTATAATAAACTAAATCCGTCACTTCTTGCTCCGCCTCGAGTAGACCTTCAGCGGGCCCGGGATAGTATCAATACTTCCCCCCCAAAGAGCCACCTTGTCCTCAAGGTGGAAATTCGAAAACTGCAACTCCAAGTCCGTAGCAGATTCCCACGTAGCATCGTCCGGAGAAGAGCCCTCCCACTGGACCAAAACCTGTCGTGTGCCTCCATCATGCAATTTCTCACGTACTCCCAAAACCGCATGAGGTCGAACCACTACGCACAAATCGGTCCCCACCATGAACAGTGAGGACATGACGAGCACCGAAGATCCCACTGCTTTTCTCAGAACAGAAACATGGAACACCGGGTGAATCTTCACAGACGGAAGAAGCTCTAATCGATATGCCACTGGCCCTACCCGTGCTAGGCCACAATACGGCCCAATAAATCTAGGTGCTAGCTTTGGGTGTTTAAATTTTGCCAATGACAACTGGCAGTATGGTCTGAGCTTAATATACACCAAGTCATCCACAGAAAACTGAACATCTCGGCGTTTTGCGTTCGCTCGATCGGACATCGATTGTTGCGCAGGTGACAAATTAGCCTTTAGGGTCTCCAAAATTCGATCCCGTTCTAACATCAAGGAATCTACAGCAGCCACAGGGCTAGCCCCATAATCATATCCCAGAATCGATGGTGGAGCTCGTTCGTACACAATCTCGAAGGGTGTCATGCCCGTGGAGGAGTGAAATGACGTGTTAAAATTGAATTCGGCCCATGGCAACCACTGATACCACACCTTGGGTTGATGCATCACAAAACATCAGAGATACGATTCCAAACAGCGGTTCACCACCTCGGTTTGGCCATCCGTTTGGGGGTGATAAGTAGTGCTACGACAGAGCTTAGTCTCAGACGCCTTAAATATCTCTTCCCACAGCAAGCTAGTAAACACTGTCTAATGCTCTTTGGGATCCCATGCAAGCGAACCACCTCTTTTATAAAGACCTTTGACACCGACAAGGACGAGAAAGGGTGTCGAAGCGGAATGAAATTAGCATATTTCGAGAGCCGATCAACCACCACTAAGATGGTATCATAACCTTCCGAACTTGGTAGTCCCTCCACAAAATCCATCGAGATGTCTTCCCAAACTCGCCTCGGGATTGGTAACGTCTGTAATAGACCTGCTAGGGATAATGATAGATGCTTAGCCTGAACATAAACCGAACACTCGGCCACAAATGCTCGTACTCGAGCCTTCATTCCTGTCCAGTACACTTCCTTGGCAAGTCGTTGGTAAGTCTTGAGGACTCCAAAATGCCCCCCAATAGCTCCCCCATGGAACTCAAGTAGCAACATTGGGATCGTTGGGGATGTCGGGGTAGCACAAGTCTGTCCCGGTAGAGTAACACATCACCCACTATTGAATAACCAGGAGGACCCACATTTTCAGCTGTCAGTGTCGCATATACAGCCGATAATTTCTCATCTTTCTTAACCTGTTGAGTGAAAACTATTGTGTTTATCCCGACCACGAAGCTTAATATGCTGCACTCACATTACTGAGGCATTCGGGACAAAGCATCAACCGCTCGATTCTCGAGTCCTTTTTGTATTCAATACTAAAATCATAGCCCATCAGTTTAGCGATCCAACGCTGATACTTGCCGTCTACTACACGCTGTTCAAGTAAAAACTTGAGACTTTTCTGGTGTGTCCGCACAATAAAATGATGACCCAATAAGTAAGCCCGCCAACACTGGACAACGAACACAATTGCCATCAATTCTCGTTCATAGACAGGTTTCACACAGTGAGTTACCGGCAATGCCTTACTAAAATATGCCAAAGGTTGCCCATGTTGCATGAGCACAGTGCCCACACCGATTTTTGAGGCGTCAATTTCTACCACAAATGACTGGTCGAAATCTGGCAAGCGCAGAACTGGGACAGTGCTCATAGCATGTTTCATTCTCTGGAAGCAGTCTTCCGCATTCGGTCCCCAATCAAACTTGCCTTTCTTCAACAGTTGAGTTAGTGGGAACGCCATAGATCCATAGTTAGCTACAAAGCGCCGATAGCAACCTGTCAGGCCAAGGAAACCCGCAAGTCTTTAATAGTTCTAGGGCTGGGCCAACCAATCATTGCCTCTATCTTTGCTGGCTCAGCTGACACTCCCTCGCAGATATGAAATGGCCCAAGTATTCAATGCGGTGCAACCCAAACTGACATTTCTTAGCATTGGCCACAAAAACATGAGTTTGTAATATCTCGAAGACCCGAGCCAGATGCTCTTTATATTCTTGAATAGTCAGACTATAGATGAGAATATCATAAAAAAATACGAGTACAAACTTACGGAGGAACGGTCGCAGAATCTCATTCATCACAGATTGGATAGTGGCAGGGGCATTCCGCAGCCCGAACGGCATGACCACAAATTCATAGTGACCCTCGTGGGTCCTGAAAGCTGTTTTGTGCACGTCGGTCGGCTTCACACGGATCTGGTGGTAACCTGCCTTCAAATCAATTTTAGAGAACACCGTCGCTCTGTGCAATTCATCTAGGAGTTCATCCACAAGCGGTATGGGGAACTTGTCCGGGATTGTGGCTTGATTCAAAGCCCGATAGTCAACACAAAAGCGCTAGCTTCCATCCTTTTTCTTTACTAACAACACAAGGCTTGAAAAAAAACTCGTGCTAGGGCGAATAACCCCGGCCAACAACATTTCCTATACCAATCTCTCAATCTCGTTCTTTTGGTACTGAGGGTACTGATACGACCATACATTTATTGAATCGTTACCTGTCTTCAACTCAATTGAGTGGTCACAATTCCTTTGAGGAGGCAATTCTGTCAAGGGTTCAACTTAAAAAACGGTTGAATTAGATTCAATTAAAAAATGTAATTCACGAGGTACCTGAGTTAGATCCGGAAGAACATCCGTCTGGACCCTGTTGGTCACTGTGGCGTCGATCATGTTAAATTCTACTAAAAGCCCTTGGTCCTCTGGTCGTAGCCACTTCATCGTGGATTTCAGGGAAACCTACGCCTTAACCAGACTTGGATCTCCTTGCAGCTCAGCTTGCCAAGAACCCAACAAAAATTTCATCTATAACGACCGAAAATTCAACTCAATTTTCCCTAAAGTCTCTAGCCAAGCCACCCTCAAAATCACATATGCACTCCCTAACGGTAAAGGCAGGAAATCGTTAACAATTTTCAGGTTAGCTAAATGCAGTTCCACATTCTTACAAATCCCAGCTGTTCTTACTGAATCGCCAGTCCCCAACATAATGCCATAATCATTGGAAGGATGCACTGGTAGCTTCAACTTAGACACAATCACATCCGATATGAAATTATGTGTGGCGCCGTAGTCAATTAGTATGACTACCCCCAATCCTTGAATCTGTCCTGTGACTTTTAATGTCTTTGGTGAGCTCAACCCGACCATCGAATTTAATGACAGTGCTGCCAAATTCCGTGTACTGTCCGCGTCATCAATATTCTCGTTTGTGTGTACTGTCGTTTTCATAGGTGTCCCCAAGATCATTCACATCTCGTACTACTAGTATCTCTAGAGCCTGTAGCTCTTTCCTTTTGCACCTATGACCCAGAACAAATTTTTCATCACAACGAAAACACAATCCCTTATCTTTTCGGACATGGATTTCACTATCCGTCAGACGTTTGTACGGAGTTGTGGTAGCTGTACTTGTAGTAGTCGACCTATTTGGAGTGAAGACTATCGTTCTCAGATTTGAACTCCCTGTTCCCCTCGTACTTGTGAAACTGGTTCCAGTACTTGTTTTTACCCCTGATGATGACTACCTCCTTTCGCTTGACCTCGAACAGCAAGGTCATCTTCAATTACCTTGTGGCATCCATTTCTTTTCCTGAATCTCGACTGGTCGTAGTTTTCTCATCTCGCTTCAGATTTCTTCCTTCAAACCACTCTCCCATTTCCCTTCTAAAGCGCTTTCACTGATGTCGTGCATACCTTTTGCATATTTCTCGAACTGTTGTCTATACTCTTTAATAGTCCCGACCTGTTGCAAACTCTTAAGATTGGCATATTTGTTGTCATGGTTAGTCGGTTGGAAACGGTGCAGTAATAATTCTCGGAAATCGTCCCAAGACGCGATCGGTGCTTGATCCTCTTCGTACTGTAGCCATTCTAAAGCCTCCCTTCCATACACAATGTCGCGGCCTCAACCCGTTCTTGACCTACTAGTCGGTTTACCCAAAAATAGTGTTCTACTCGGCACAACCAACCATCCGGATCCTCATCTAAAAGCCCTCTGAAAACCCGTATCTCTAGTTTTCGGAGTCTTCTATTGAACCCCGACCCTTCTCTATTTCCTGTTTTCGGTCCCCATTGCACATAACGATCCTCTCCATCTCCCAATTCTTCATAATCCCTGTCTTGGTTGTTGTCGCCCGTTCCCCAACCTCGATCGTTCCAACCGTCGCGCGTTGTCGACCTGCCTTTGTCATTGCGCATCCCCTTTCGTTATCCCATTCACGACCCCGATTGTTCCAAACGGTCGTTGGTACACGTCGCGCACCCTTCTTTCGTCCGGTAAATTCTCACTCCTCTGGTCCATTCTCCGATTATCCTCGAAACCACCTCACTGCGACTAAAAATCCCTACTTGGTCCTTGATCATGCGGTCGGTTCACCCTCTCTGGCCGATTTGACCCAGCCTCCGTGAATCGGGTTGGACCGTAACCCTCGGGCCCTGCCGACCACACCCGTCCCCGCCGCAGTCCCTCGCACTCGAGCGCAGCCCGCCCGTGCCCCACGACCGCTCGTCCCGATCGCTCTGCCTGCCTTCGACCTCGGTCGGCCTAGTCCGCTCCTGCCCGAGCTCGCCCTTGCCTTTATCGCCGTCGCTCTCCCCGTGCGCGCGCGCCAGGCCTCGAGATCGGTTTGCGCCGATGTCGCTCGGTTGTAGCCCCTCTAGTTCGTGGGTTTCGTCGATCTCACCTAGTCCCCCCTTCTCCTGTCCCTTGCGTGCGATTCCTTTGTTGCCGTGCTTGTTCCTTGCGACAACCAAGATCGAATTTCATCTTTTAGCAAATCAAACTTCGAATCGAAGTCTGAGCGAATTGTTTTAATAATTCTCTCCTCTGCAGCCCCCAATTTAACTTCGAGCTCTTGTTGCTTCTTTTGAATTCGACATTGACGCCCTCCAAATCATCCATTCTTGATTCCGTCCTAGTAGCAACCAGTTCCGGATCGGTGAAGTCTTTGATTCCAAAATTGATAGAATCTACCAACTTTTATTGAAATAAATCTCTAGAGAGAATCTCCCTTAGAGCCAAAGAGTTCAAATGACTACAAAATAATTGAATACCAATAAATGATCTAACTTATCCTATTTCCTAATCCTTAAAGGAACCCTTAATTAATAATAACCTATTTCCTAATCCTAAGAGAAACCCTAATTGATAATAAGGCCTAATAATAATAAATAGCCAACTATAATTAAAATAACAACTAATTAAATACTAATTATAATAAACTAAATCTGTCACTTCTTGCTCCGCCTCGAGTAGACCTTCAGTGGGCCCGGGATTGTATCACAATGATCAAGCCCTTTCTTTAGAACTGCCCTTCATTGAGAATGAACTTTGTGAAGCAGTGAAAGACCTCAGATCTAACAAAATGCCTGGACTAGATGGAATCTCTGATGAATTTACAAAAAGTCCTAGAACATTCTTAAACCTAATATTTTGAAAGTGTTCCACAATTTTTTCAAGAGTGGTATTTTGAGCGCGAGCATGAAAGAAACTTACATATGCTTTATCCCTATGAAGGTCAAGGCCAAGAGTGTCAATGATTACAGACCGATAATTTGATCTCTTGTATGTACAAGATTCTTGAGAAGGTACGGTCTCAAAGGCTCGAAAAGGTGCTCCCACACACTATAGAGAGCTTCAATAAGCATTTGTGGTGAACAGACAAATTATCGATGCCTTCCTTATAGCAAATGAAGTCATCGAGAAATGGAATAGAAAGAAGAGGAAGGGAGATGTGATTAAGTTAGACATCGAGAAAACATTTTATACTATTGGTTGAGAATTCTGAGAAGATGTATTACGTGTAAAAGGTTTTGGCCATAAATGGAGAAGATGGATTTGCAGCTGCATTTCTTCCCCAAACTTCTCTATGATCATCAACAGGAAGCCTAGAGACAAGATACAAGCCACTAGGGGTCTAAGGCAAGAAGACCCTCTTTCCCTTTCCTCTTTATCTTAATTGATGATGTATTGAGCGGAACATTAAATGAACTGGCTAAAGAAGAAAGGATTCAAGGCTTCCTCATTGGAAAAGACCAAGTACAAATCAACCTCTTTCAATTTGCGGATGACACTTTACTTTTCTCCAATCATGATCCATTGACTATTGATTAGCTCTTCAAGATCATTAGAGAATTTGAGCATTCCTTGGGCCTCAGTATCAATTTGCAAAAATTTGAGATTCTGGGCATAAATCTCGATGAACAATTGTTATCGTCTATAGTTAGCAAGTATGGGAGCAAACTGTGGACTTGGCCCACACCCTATCTTGGCTTCCCTTTCAATGGAAGTTTAAAGTCCTTATCTTTCTGGGAGTCAATGATTGAGAATGTTGAAGAGAAAGGAAATCCTTTCTCTGAAGAGTTCTTATATCTCTAAGGGTGGATGCCTTCCTAACTCTTATTCAAGCTACCTTAACTAACCATCCTACCTGGGTAGCAAAATGGCTCTTTCTCCTCTTCTCTTCTCTCCCGCCTCACACCGACCAAACCACACCCTCCCCACCCCTGCCCCCTCCGATGAAGCTCTTTAACTTCTCCCATGGCGTCACAAGCACGTATCTTTGGATAGACAAAAAGCTTTTCTCTATCACTAAAAATCTCCCAGGATCATTCACCATCACTGAAGTCAATCGTAACAACAGCCTTGTCGTTTCCTTTGATTGGGTTTTCCTCCCATGGCTCATAGACAACCTTGAAAGGTTGTGCAATATCCAGTGTTTTCAAAGCGCAAGGCGCAACACAAGGCAACACTCTCTTTGATCGCCTCAAGGCGAGAGGCGACAAAAAGGCGAGGCCCGAGCGAAGCAAGGCACAACTTATAATTATATATCAAAATAATTCATCCGTAGTGAAACTATATACCAAAATAATATAATAAAGATGAATTGATATAAAAAAAATAAATTCTATTTACATAAAATATAAAATCCTTCCTATTAATATTAGATGATAAAAACATTCAAATTATAAATCTAAATACCAAATGTAAAAAATATCAAAGTTATGTACTTTATAGATTATCAAAAAGTTAATAAATAGTAAACAACAATTAAAATTGTAAAAATTTAAGGTATTATAAATTGCTTAATTTAAACAACTCATGCATATTATTGATAATTAAGTAACAGGCTTAGAAATAAAAGAAGAATTTTTTTTTGATAAGCCGTTCTTAAAAAAATTGCCAAGATATGATCTTTTTATAATAAAGAAGCTAGAAGCTATAATGATGACTTGTATTTTCAAAATAAGAAAAAAGGAAAAGGATTAAGGATGACTTTTATTTTCAAACTTCAAAAAAAAAGAAAAAGAAAAAAGAAGAAGAAAGATGTGTAGCTTAATATTATACATATTAAGTTAAATATAATTTGACAAGACAAAAGAAGAAAGATGTGTGGCTTAATAGTAAATTAAATATAATTTTACTCTTAGAAGTTTTAAATTTGACAAGACAAGTTGATAAATTGAAAAGACTAGTCAGTATTTTAAAGAGAAGTAAAAAAATCAAAAAGTAAAAATGAAACAATATTTTCAACTTAAACTTCGCCCCAAATCATCTTTTCCTCTCATCTTCTTCCCATCTCCTCTTCTTTTTGTATTTTTTTCGTTTCTTCACCGTAACTTCAATGGCTTCTTTAAGAAAAAAAAAACTTCAATAACATCAGCGACGCCTTCATGAAAGACGAAGGTGCCCAAGGCGGTTAGAGAGCACGGCAAGGCGAGCGCTTCTTTGAAGTCGCCGCTGAGTACTGCTGCAAGGCGAAGGTACCCTGTGTCGCCTCGCCCGCGCCATAGGCGGCGCACCGGGTCGCGCTTTGAAAACACTGGCAATATCTCCTTAAATCATAGATTCTTCAAGGAACACAGATTGGGTGAATCGGTTGTTTGGCTGGAAAAGATGTCAAACAAACGTGGGCATTGTGCAGATATCACCAAGCTAAGCCCAAATGGAAGGATACAGAGGATAATGATCCCTATTGGCATTGAAAGAAAAGGATGGAAGGATTTGCTATCTTGCTTAAATAGCTTACCCTCGCCGAGTGCAAACAAAGTTGATTTTGGTGACCACACCATTAATAACCCACCAACCTCTTCAGCCCTCATTAAGAAGAAGATTGAAGCTAGTGTGATTCTGCACAACAAGCCTTATGTGGAATTTGTGTCGACCAAAGAAGATGAAAAAGGAAAAAAATAGATTGGCTTAATTAACTCTGAGGGGGACATTTTCTCTGCACATTTTGCAACATTATTCTTATCCAAAGCAGTCATTGTACAGAGAGAGAACTTCCACGACAAATGGACTGATCTGGGCAAAGTTATAAGAGAAGAACTTTTAGGCGGTGGTATCAGTCCCCTAACTCCCGACAAGGCTTTATTGTCCTGCTTAGATGCTGAACATGCTCGCTCCCTTTGTGAAAACAGAGCCTGGACCAAAGTGGGATCTTTTAATATCAAGTTCCATCCATGGAGTGCCGAAACCATTCTCAAGGAGCCCAAAATCCCCAGCTACGGCAGATGGATACGAATAAGAAATTTCCCTATTGATCAATGGAACATCAGTGCACTTAGATCGATTGGTGACACCTGTGGTGGTTATGTGGAAACGACCAAACAAACCCTCTCTCGTTTAAACATGATGGAAGCTTGTGTGAAAGTTGTTCAAAACTCACATGGTTTCATCCTAAGAGAAATACACGTTAAATCCCCAGCATCCTCTGTTTTTCGAGTTGAGATCAACACTTTCTCCATGGATGATTTCGCCATCGGTTATGCTCCTGAGATTCACGGAAATATTCATAAGTCTTCATCGACTGATAAAACAGTTGGGTTCGTATCGCCGATCTCTCACCCGCTGGAAAAGCTTAATCATCAGGCCATGAAAGGTCAGAAGGAAAAAAAGAGCACTATTCATCCGTTAACATCAGCTGCACTGCCTCGATCCTATGCCTCATTGGTCGGCGCAAAAGGTTGGAAGCCAATCACAAAAGCAGTACAAAGCCATCCAATCACTGCTTAAAACTCTCTTAAGAGCGGGACCTAATCCTGTACCCCATCCAATCCCAACCAATCTCAACCATTAACTGATGGACCCAACCTCGATCGGACCAGTACAAACCTTGACAACCAACAGACCTTTCACCCATCCTCTCGAGCCTCACAAACCTCTCGAGCCTCGAAAACCACCGACCCTCCATTAGGATCCCCTACCTCTAGGGCTATTGTTTCGGTTCAAACTAAACAAGTGAGTTTAACCAGATCATTATCCCTCCATTCACCTCTGACTATCCAACAAACCAAAAAAATAACCATCAACAAAACTCCCTGTCTTTAGCCATAGGAACCAAGCAATCTACCAACCATAGCTACTTCTTCTCTGAAACAGAAGGTGATTGTTCTTCTCCTGGCAACACCTCTCTTTCAGAATCCAACCTTGAGGAAAACCCCTTGGCTATCCAATCTTTCCCTTTGGACCTTGACCTTCCCCTGCTTTACCACCTTCTCGAGGATGGGGGCCCATCATCATCCTTTCCAGTTCCTCTTCGCATTGAAGCTCCATGTAAGAATAGCTCAAAGGATTTTCTCTTCGAAACAAAGCATGGAGCTGGTGATACTCTCAATGTCGATGAGCTTTCCCCTGAAGATGAACAAGAGCTGTTGAATAAATTGATTAAAAGGAAGGCACTAAAAGATCCATCAGCTATTCTCCCTCTCTTATTCCCATGGATGGCAGAAGAAGGAATGTGTATCATGCCCATGCCTTCCTTTAAAAATACATCGGCTGCTTTAAAAAAGAAGTCGAAGAGCATGGGGGAATTACAATGCTTAACTACTGATCTCAAATTCTCACCAAAGACTGCTCCGAGGCAGTATATCGGCTTGTCGAACCTTAAATGATTATTCTATCTTGGAATACCAGGGGCTTGGGGTGCTGGAAGAAGGGAGCATTAATTAAAGATTTCATTTGTCAGCAAAACCCCGTTATTCTTATATTGCAAGAAACAAAATAGAGTTTCATTGACTCTGCATTCATCAAGAGCTTATGGGGATCTAAAGATATTGCATGGACATGTCTAGAATCTACTGGCACCAGGGGTGGAATAGCAATCCTATGGAAGGACTCTCTGTTTACCACTACAAACATCATTAGAGGTACTTACACCCTCTCTATTCATTTCTCTTTACATGATGATTTCTCATTTTGGGTGACATGGGTATATGGGCCTAACAATCCAAGAGAAAGAACATTACTATGGGCATAATTCAACAACCTTAGTAATAGTTGCAATCACAGCTGGATTATGGGAGGGGACTTTAACACCATCCGATGGACCAGTGAAAAAATTTCTCCACACAGAGTTTCCAGGACCACGCGATCCATGAGATTGTTTAACAACTTAATTCAGAGATCGAACCTTAGTGACATCCCATTGTCAAACGGTGGTTATACTTGGTCCAACCTTAGGAGTGAACCAACTCTCTCCCTTTTGGATCGGTTCCTCATTTCTGCCTCCATTCAAGTCAAATTCAACATGGTCAATGCTAGAAGACTCGAAAGGGTGTGCTATGATCATTTTCCAGTTAGTCTTTCTCTTGGAAAAGCTTCTTGGGGACCTGCTCCTTTTAGATTCTTAAACGCTTGGCTGAATCACAAATCTTTGATTCCTTTGATTGATCAATGGTGGGTGAATAATCAATTCCCAGGATGGCCTGGTCATAGCTTCATTAAAAAATAAAGGGGCTATTCTCAAAGAATGGAACATTCACACATTTGGCCATCACAAAGAAAAATCCCTTCAATTACAGAAGGATATTGACAAGCTTTGACTGTAAGGAAATTAATGGCTCTATTGAGGAAGCTGTCAGATTGAATAGAATTTGCTTAAGAACAGAGTTATTGATGAACTCTGCTAGAGAAGAGATTCATTGGAGACAGATGTGTAAAGCACTTTGGCTTAAAGAAGGGGACGCAAATACATATTTTTTCCATAAGTATGCTGGTGCTAGCAAGAAAAGAAACTTCATACATGAAATCATGTCCTCTTCTCAACTTTGCCTAGTTAATGACATTGACATTGAGACTGAATTTGTTGGATTTTTCAAGAAGCTTTACTGTTAGAAGGACTCTTGTGGATCATTACCGAATATTGAAAATTGGAGCCCCATCAATGAATCTCAGAGAATTGCTATGGATGGTCCTTTCACTGAGAGGAAATTTTGTTTGCAGTGAACTCTCTTGGCACTAATAAATCTCCAGGGCCAGATGGATATACCATAAACTTCTTTAAAAAATCTTGGAACATTCTTAAAGGAGACATCATGGGAGTGTTCAAGGATTGTTTTGAGAATGGTATCATTAATCTTAGTGTCAATGAAACTTACATTTGCTTGATTCCTAAGAAAGTTGATGCGCGCACTATGGGAGACTATCGACCTATTGGCCTCACTACATGCCTTTACAAGATTAACGCTAGGGTGCTTTCTGAGAGGTTGAAGAAAGTATTGCCTTTTACCATCACTAAGTATCAATCTGCCTTTGTGGCCAACAGACAGATTTTTGATGCATCTCTAATTGCTAATGAGCTTATTGATGAGTGGCATAGAAAGAAGAAAAGAGGTATTATCCTGAAACTGGATATTGAAAAAGCTTTCGATATGGTTGACTGGGACTTTTTGATTGATATTTTGGAGAAAAAAGGTTTTAGCTCGACATGGAGAAGGTGGATTTACGGTTGTATATCATCAACAAACTTTTCCATTATTATTAATGGTAAGCCCAGGGGGAAAATCAAAGCATCTAGGGGTCTTCGGCAAGGTGATCCGCTATCTCCCTTTCTTTTCACTCTTGTTGTTGACTGTTTAAGTAGGATTCTTATCAAAGAGGAGGATGCCGGAAATATCGATGGTTTCTTTGTGGGTGACCAACAGAATAAGCTTTCGATGACACATCTTCAATTTGCAGATGACACCATTTTGTTTTCCTCAGCTGACAATACAAAGTTCGCAAATCTGATGAATACAGTAAAAAATTTTGAGTATATTTCGGGGCTAAATATCAACATCAACAAGACCGAATTTCTATCAATTGGTTTGAACGAAATTGAAGTTAACCTTTTTGCTCAGCCATACGACTTTTCTATTAAGCATTGGCCAATGTCATACCTTGGCCTCCCCTTATTTGGCAAGCCAAAATCTATGGAATTTTGGAATCCGGTTATTGTAAAAATTGGAAAGCGGCTCCATTCTTGGAGTAGCAACAACCTCTCTAAAGGAGGAGGGCTCACTTTAATTAATGCCTCCTTATCCAACATCCCTGTCTATTATGTCTCTCTGTATGAAGCTCCATCTAAGGTCACAATTGCTATTGGAAAGCTCTTTCGGGATTACCTCTGGAGATGCAATAGTGGTAATTCTATTAGTCATTTGGTTCGCTGGAATGAGGTCAAAAAACTGATTGCTGATGGTGGTTTAGGCATTCTCGACATTAAACTTAAAAATCGAGCATTGCTTGCAAAGTTGCATTGGCGCTATGGTGTAGAATCTGAAGCTTTATGGAGGTTGGTCTTAATTTACAAATATAGGGGTAAGGTGGAATAAAAAAGGCCTGAAGGTTTCTCTCTCAATAGGGCTAATGGTCCTTAGAAAATCATATCCAAACATGGCAGCCTTATCTATAACAATGTTGCATTCAAACTTGGTAAAGGTGATCAGATTTTGTTTTGGGAGGACACTTGGAAAGATGGAAGTGCTTTACGGGACCAATACCCTCTCATCTATTGATTGGCTGTTGACAAAAATATTGAAATCTCCAAGATCTTCATCAACAATTTATCGCACTGGAATCTGCCCCTCAGAAGAAGCTTGAATGATGCTGAGACAAAGGAATGGGCCCCTCTTTGCTTATTGATGGCCAAATTGAATTGAATGATGATGAGGACCCTTGGAGATGGCCTTTGGAAGAATCGGGGAACTTCACCACAAGCTCTCTTTTTAACAAGATCTCCTCTAATGGGCTAAATGGGCCTAATCAGGATTTTTACACTAGATTATGGAAAGGTCCTCAGCCTAAAAAGGTGAAATTCTTTATATGGGAGCTTAGCAAAGAGGCCATCAATACTAATGATAAGATCCAGAGGAAGCATGCTAATACGACTCTCCAACCATCTGCTTGTATCTTATGCCTTTTTGAATTCGAAACTCAACACCATCTGTTCATCAGTTGCGATGTTTCTGCCCAAATATGGGCCCATATTCTCACGGCTTTTAATTGGTCCACCCCCTTCCCCAGTCGAGCATCCGATCTTATTGAATTTCTAATTATGGGACATCCATTCAACGGGGAAGTTGAAGTTCTATGGCTTTGCATTATTTATGCATTTTTGTGGGCCACTTGGAGGGAACGAAACAATAGACTCTTCAATGATGAATACACACATAAGGAATCGAGTCTTGAAACTATTATATACATGGCGATGTACTGGTGTCATTCTACCCCTCCCTTCTGTAATTATGACTTATCCACATTAATCTCTCAGTGGTATAGTTTTTTGTAATGTATCGATAGGATGTATCTTTTCAACCATCAATGAAAGTTTTGTTTCCTATAAAAAAAAACTAACCATCCTACCTACTTCATGTCAATTTTCAAGATGCCAGCTAAGATCACAAACATCCTGGAGAGCCTCATTAGAAATTTTCTATGGAAGGGTGTTGAATTATGTTAAACAACCAATCAACTCAAAAGCTTAAGCTAATGGATTGGGGTAAATTTAATTATATCAACCAACACTCCCCCCTCACTTGTGGGCTTTGATATTTGAGAAATGCCCTACAAGTAGAATTTAATTTTAATTGGGAAGGAAACGACTTGGCAGGGATTCGAACTCATATCCTCCTGCTCTGATACCATATGTTGAATTATGTTAAACAACCAATCTACCCAAAAGCTTAAGTTAATGGATTGAGGTAAATTTAATTATATCAACCATCAAAGGGTAATAAGGAAGAAAATGGATTTCACCTTGTCAAATGGAAGGAAACTTCTCTTCCCATTGATCAAGAGGGACTAGGCATCATTGATCTTAAAATGATGAATATGGCCCTTCATGGGACTTTGCTCTGTGGAGGAAGACTAGCGATGCTAAATATGGGTACAAACATCATAATAATAAATCAGGTAAAGTGAAATTTGGAATTACTAAAACTCCATGGAAAGATATTTCTACATTGGCTCCCTTAATAAACGTTAAAATGTGTTTTAAAGTTAAGGATGGCTATGCTAATAGCCTATGATTAGATAATTGGCTGAATGATGCCCTTTTATGCAACCAAATTCTAAACTTATACGACTTATCTACCAAAAAAGAGCAAAAAAGAGATGGTTGCCAAGGAAGCTGGGGAAAATGATCTTAATTGATGGAACTTATACTTGAGAAGAAATTTAAGGGAGAAAGAAATCTCTAAATTTTGTGAGCTATCTAGCATGTTGATTGGTGTCCATTTGAGGAACGGTAGGGACTTATGCGAGAGCTATCAGGTACAATTGAGTCATCTCTAATAATATCTATACTGTGATTTGGAAAGATAAATTCTCCAAAAAAATAAAATTCTTCCGGTAGGAACGACTAAGAAAGGCTATCAACACTAAGACATGATGCAATGAAGGCTACCAGTCTACCATAGATGCTCTTTTGTCCAAACTCGTGTGTTATGTGCAAAGTTTTGGCTGAGACACAACACCACCTTCTAATAGCCTGCTCCTTCGCTACATTTTTAGGGAAGCATTTTTGTGGAGCTTTACTCTTATAGAAGACAAGTACTCTTTTCCAAATCACTCTTGTGGGTCACCAATTCAAAAAGAAGAAAAAGATTATTTGGGAAAACGTTTTTAAAGCATTTTGTTGGAGCACATGGTTGGAGATCCCCTATTTCATGATTATATTTATGTAAACCCTTATAAATAGCTGGAGATCGCTTTTGTAATCTGGACAAAGCGTTCTCCTTTTGTAATTTCATACTTATAATGAAATTGTTTCTTATTCAAAATTTTATTTATAAATACTCTTTAGTATTGTACTACACTTCATATTTTCATTTCTCTCGAGGAAACAATAGTGTTGGTTGATATAATTAAGTTTAACCCAAACTATTAGCTTAAGTTTCTTGGGTTTGATTGATGGTTTAACATGATTCAACATGGTATTAGGGCAGAAGGTTTTGAGTTCGAATCTTTGCCAAGTTGTTTCCTCCCCAATTAAATTCTACTTGTAGGGAATTTTCAAATTTTTAACCCCACAAGTGGTTGATTGATACAATCAAATTTACCTGAACCCATTAACTTAAACTTTTGGGTTAATGGGTGGTTTAATATAATTCAACAAAAAGCAACATATCAAATAAAAAAGCTCCAACATAGATAATTTCTGACTATTCTTTAATAGTATTACTTATTTTGAGGTTATTCTTCTCTGTTTTTCTGTATCTATTTGAATGAAAAATGATGATAATGCACCTGAGGAATAACTTCATTGGGTTTTCAAGGAATGTCAGAGCTCTTTTTTGTTGTTTTTCATTACATAGTTTTCCCATTGCAACCTCAATTATGACTATGAGTTAATTACCCCAATTTTCTTTTATCAAATGAACTTTTGCACATTCTCTTTATTTAGGTCAATGGCAACTCAAAGTCGGTGGAGATTCTATTGTCCCTGTTGGATTGTTACTCGACCAAAATTCTTCCATGGTATATTGTTCTATCATATTCTTGTACATCTTTTGTGTGGAAACATTACGATTATCTTAATTCTCGTATTTTCTTTCTTCTGTTTCTCTTCTATCAAGGCACTTAACAAATTTTTCTTTATGCTGAAAAAGTATATTAAAATTTTAGGAACGGCAATTTGACTGCAGCCTTGATATATCAATTTGAATGGTTTTCATTTTTCCACTCCCGTAAGTTCATTATAAGTCTAGGAATTTTGATTTGACACATGCTTTATTATGTTATGAACTCGTGAGTATTGAATATTCTTCGAGTGAAAAGTTTGAAGATCAGGTTTCTAAAATGCTACCTTGTTCATTTGAGTCTGAGCATGGCTATGGTTGGGTGTTGCAAGGTGTGTTCTATAATCTAGTTCTCTCAAGTTTTGTTGCTTCACCCACTTAGGTGGGTTACTTGCAATTTTTCTTGATGTAAATTGTTCGGTTTCTTCTTTTGAGTTTCTTTTTTTGAGTTTGGTTTGTGGTTTTTTTTTTTTTTTTTTGTTTTAGTTGCTTTCTTTTGGTCCCTTGACAATATCTTCTTGAAAAGTGTTGTTTCTTGTTTAAACCAAATAAAACGCTACCAGATACTTCTTTGGAGAATTAATCTTTAGATATTGAAGCAATTGACATTCAGATATTTCCAAGAGATCAAGATTATTGTGATGATTCTGACTTGTTTGAGCAGCCCTGAATGCACTACATTCGTTGAAAAGGCCTTAACAGATATCTCACAAGCTCTCCTCCAGTTATTAACCTTAATTCGATATTCAACTCACCTGAATGCACTACATTCTTTGAAAAGGCCTTAACTGATAAATCTTTTGTTTTCCGTTTAATCCTTCTTTTTTTCTTTTTCTATATTTTGTTATCAATGTGATGCTTACACTTTTTTTTATGATGGTGTAGGCTTGATGAAGTTGGTGGTTTTGAGGAAGTCATATTGCGAATTGCATTAAACATTTGGGATCAGATTGAAAAATGTTCAGTTTTTAGCGCTTTGCTGGATAAAGTGAGTAAAACTTCTATTCTTTTTTTTAGGAAATATGTCTTTTCATTGATGAATGAAGTAGTTACAAAGTGATCCTAGCAAAACTAATTACAAAAAACATCGAGATTGTAGTCAGACAACTTAATTTCATTCGTTAGCTTTTTACTGAAGTAGTAGCCTTTTTTCTTGTTTTTAATGAATTCAGGTGCTTCTAGATGCTACCATGTTGGCTATGAAGCTCTCTGTTCGAAGTTGCTCAAAGGAAAGCCAGAATGTTGTAATCCAAAAGGCATTTGATGTATTATTAACCAGCAATTTTACTCCTTTGAAATTACCATCATCTACTACAGTACCACTTCAGATGGAGGGCTTACAACTTCTGAAGCAGAAAGATAGTCCACTTTGTAGAGATGAATGGATTCTTTTATTATTTGCATCAGTCACTATAGGACTTCGTCCACAAACACAGATTCCAGATGTGAGATCAGTAATACATTTGCTTATGTTATCCATCACCAGGGGCTGCATACCAGCTGCACAAGCACTAGGTTCTATAATCAATAAATTGAGTCTGAAATCAGATAAAGTAGAAGTTTCAAGTTACGTTTCATTGGAAGAAGCAATTGATATTATTTTCAAAACCAAATTTAGGTGCTTCCATAACGGAAGTACTCTTGCAGGCAGTGAGATGTTTCTCACTGATTTATGCTCTAGCATTGAAAAAAGTTCTTTACTTCAAGTTCATGTTGTGGTTGGATTATCATGGATTGGAAAAGGTCTGCTTCTTTGTGGTCATGAAAAGGTCCGTGATATAACTATGGTTTTATTGGAGTGCTTACTATCAAAAAGCAGAACAGATGCCTCATCCTTGCAGCAGGTTATACTGGAAAAAGATTATGAGCCGAACTTCGACTTTGCAATAGTGAAGGGTGCAGCAGATGCATTTCACATTCTCATGAGTGATTCTGAAGCTTGTTTGAACCGTAAATTTCATGCAATAGTACGGCCACTTTATAAGCAGCGTTTTTACTCTACCATGATGCCTATTTTCCAGTCTCTAGTAAGCAAATCAGATGCATCACTTTCTCGGTAGGCATTCATGTTTTATCCTAGTGGAAAGTTTGTTTTTTTTTTTTCTAAGGACAATATTGCATGAGTTTTTGATAGCCATGTTCTGATATGCCGCTTTATATTTCAGATATATGTTGTACAAGGCATTTGCACATGTTATAACCGATACTCCACTCACTGCCATATTGAGTGATGCGAAGAAGGTTAGCTTTAATGCTAACAAAAAACAGATTCTGGGTTTATTTCTGCTATTGTTACAACCCAGCTTTCTTTCTCACAATTAACTTATAAATCTAACTCCAGTATCATGTCAATTTTTGCAGCTTATACCTATGCTTCTGGATGGCTTGCTAACATTAAGTGTGAACGTCATCGACAAGGATGTGGTTTATGGCCTTCTTCTTGTTTTATCAGGGATCTTAACTGATAAAAATGGTAAGGTTCTTATTTATTTCCATGCTTAAATCTTTTTTTCTCTCTTTTTTGATAGGAAACAAAACATTTTCTTTGATAGTTCATTATACCATCGAAAAAGAGTAACAATGACCAAATTAATAGAGGGTATTGAAAAGTTGGGGTTAGTTTAGTAGTTTGTTAATTCTTTATTTATATCTTTCATTTGATTCTTTCAAAAGATTGTTAGGTAGGTGAGATTTTGTAGGAAAGATCTTGTAGTTTGTTGGTGTTAGGCTTGTATTTATACATTTGATCAAAATTGAATAAAAAGAAGAAAACAAGCGTTTCTTCATGGTATCAAAGCCAAAAAGGGAAAAAAAACACAAACTCTTCTCTCCTCGTGAAACTCAGTCACCATCTCCGGTGATCTCTCTTCGAGGAATTTTTCGCCGATCACCTTATCGACCGTCATCTCTCCAGTCACCATCTCCGGTGATCTCTCTTCGAGATTCTTCGCCAATCACCTTGGTCGACACCTCCTATTCAATTTCAGGAATAGTCGATCAACCTCAGAAATTGCCGATCAATCTCCAGCAATTGCCGATCAACCACAAGAAATCTCCAGCTTCGTGACCCGTCCAATTGTTGTCGGCGAACAATCTCCAGCTTTGTGACCCATCCAATTGATGTCGCTCTTGTCCTTCATGGAAATTTTCGATTCCAATTTCTCACGGCAATTTCCGATTCCAATTTTCTCACGGAATTTCATCCTGATCGGCCTCTTCCTGATTTCAATTGACAATTGCCGATCTCAATTGCTTCCTCCGATTACACAATCACCGACATCTTCCAAGTGTAATTGATCCCATATGGATTCCATTACAATTATCAATCGATTGTAATAGAAATTATTGGGAATCAATCGATAATCAGCTATCGGCAGCTGGAGTTGAAGTCAACCCTTCAATCAACTTATATGTTCATCATCAATCAGCAAGTTTGAGGTACTGCTACTCATCCCTGTTCTATTTTGAAAGCAACTGCAAAGTACAAAAATATTAACATCCAAGAGGTGCTGTGATTTTGTTTATCTCAAATTATTTCACATATTTAGTCTTGGCTCATATTAGACTATTACTGTGTTATGGCTTCCTCGAAATCTGTTGATCACACATTCGAGTTATCATCATCGCCCTATATCACGGGTCATAAATTAAATGGTCAGAATTATGTCCAATGGTCTCAAGCAGTAAACATGTTCATATGTGGTAAGGGCAAAGATGATTACCTGTCAAAAGATAAGACACGACCTAAGACAGATGATCCCGAGTTCAAACAATGGAAAATAGAGAATAACATGGTAATGCCATGGTTGATTAACTCTATGACTACAGAAATAAGAGAAAACTTCTTACTTTACAGCACAACTTATCAGATTTGGGAAGCTGCAAGAGACACATACTCCAATAAAGAAAATACTTCAGAACTGTTTAGAATTGAGTCTATTTTGCATGACCTTCTTCAAGGAGAACAGTCAGTTACCGAGTACTTCAACAATCTCTCTCATAATTGGCAACAGTTGGACCTGTTTGAACTTCATACATGGAAGTGTGCTGACGATGCTACACAATTTACGAAACTTGTAGAGCAAAAGAGGACATTTAAGTTCCTCTTTGGCTTGAACAACAATCTTGATGAAGTTCGGGAGAGAATTATGGGAGTCAAGCCGATGCCATCCCTCCGTGAGCCTTTTCGGAAGTACGGCAAGAAGAAAGCAGAAAGAAGATTATGATGGGTAAGCTAGAGTCTGGTCCAACGGCATCTGCACTTGTCGCAAGAAACTATAATCCAACAGGAGAACATAAGCTGAAACGAGATAGATCCTGGTGTAACCACTGCAAAAAGCCAGGCCATCTAAAGGACACATGCTGGAAGCCCACTGATTGGAAACCAAGAAATCGTAATGATCGTGATAGTAGGGCTTGTGTAACTACAGCTACCTCCGAGAGTAATATTGGAATCGAATCCTCTCCATTTAGCAAAGAACAGATCGAGACATTACAAAAGATCATCAGTAATACAATTCAAGGAACAGGTCTAGTTGCTCAAAAAGTTGCATCGACTGCTCTCTTCGCTAGTTAGAAGAGGAACCCAGCCCTGGATTATTGACACTGGAGCATCGGACCATATGTCAGGGGATAGGTCACTATTCCATCAATATCAGACAGACACATCCAGCTCTTTAGTAAAAATCGCAGATGGGTCACATTCGAAAGTAGCAGGGACAGGTTCCATTCGTATCACTCAAAACTTAACTCTCAATTTTGTTCTTCATGTACCAGACTTGGATTATAATCTACTTTCGATTAGTAAATTGACTCAAGACTTGAAGTGTGTAAGTAAAGTCTATCCAAACACTTGTGTTTTTCAGGATTTGGGATCGGAGAGGACGATTGGCAGTGCTGAAGTTTGTTCGGGGCTCTACCTCCTCAACTTTGACCAAAACTCGCCGACAATCGGAGACAAGTCGTCATCTTTATCTTTCATTTCCAGTAGTCTAGTTTCTCGTAGTGTCAATAACGATAATGAAGTCATGTTACATTATCGCCTGGGTCATCCCAATTTTTCATATTTATCCAAAGTGTTTCCTGAGTTGTTTTATAATAAAAATCCTACTTCATTCTCGTGAAATCTGTTAGATTGCAAAGCATACACGATCTTCCTATCCTGCCCTTGATTATAAATCTTCTAGCCCATTTTCTTTAATTCATAGTGACGTATGTGGCCTGTCAAAAGTAAAAACCCTGTCTGGGGCTCGTTGGTTTGTTTCATTTATCGATGATCACACGCGTAGTACGTGGGTCTTTCTGATGAAAGAAAAATCCGAAGTATACTCCTGCTTCAAACACTTCAATACTATGATTCAAACTCAGTACCAGACCAAAATTCGTGTTGTTAAAACAGACAATGCGAAAGAATTCTTTAGTTCCTCTTTGGGGGTACTTATCAACAGAAGGGATCATTCATATCAGCTCTTGTGTTGACACCCCTCAACAAAATGGAGTGGTCGAACGGAAGAATCGTCATCTTCTGGAAGTCATTCGCTGCCTCATGTTCTCTGCTCATGTTCCTAAATCCTATTAGGGGGAAGCCTTTCTCATTGCTACCTATCTTATCAACCGTATGCCCTCCCGTGTCCTCAAGTTGTCTAGTCCTCGGGACACCTTTCTTTAGGCTTTCCCTCACCTTCGGATACCATCTCTCGACTTAGCTCCTAAAATATTTGGATGCAGTGCCTTTGTCCATATCCATTTCCAACATCGTAGTAAATTGGAACCTTGAGCCCATAAATGCATCTTCTTGGGATACTCTCCAACTCAAAAAGGCTACTAGTGCTACTCACTTGAGACAAAGCGAGTATATGTCACAATGGATGTAACGTTCTTCGAGCAACTTCCCTTCTACTCCAAGACTGCTTGTTTGGGGGATAATTTAAGTGACAATCAGTTGTGGTCATCCATCCTTGAAACACTCTAGAGATGGCTCCCGATCCTCCTACCTCCATACATGCCCCCTCCTCGATCCTCCATCCTCCATACCTGTCTCTACTCTTCTCGATCCTCCATCATCTCCGATTGTCAGCAATCAACTCCTCACCTATACGAGGGGAAATAACCGAGTTGAGCAGGCAGACACCTGAACAGACCAAGTACTTCAAGAGTCGGAACCAACTCCAAACTCTGAACAACCTGCCAATGAGATGGATGTTGTTAATGATCTGGACCTGCCGATTGCAATTAGGAAAGGTGTTAGAAGTTGCACATTGCATCCTATAGGGAATTATGTTTTGTATGATAATTTTTCTGAACCCTATCGTGCCTTTGTGTCCACTCTTGATAATGCATAGGTTCCAGCTAGTATTGCCGAAGCATTGAAACATCCCGGTTGGAGGAAAGTGGTACATGATGAAATTGAGGCTTTTGAGAAAAATGATACATGGACCACGAACAAACTACCAAGTGGGAAAAAACCTGTTGGGTGCAAGTGGATATTTACAGTAAAACATAATGCATATGGAAGCATTGAGAGGTTGAAGGCGAGGCTAGTAGCCAAAGGGTTCACCCAATCATATGAGATTGATTATCAAGAGACCTTTGCCCCTGTTGCGAAACTCAATACCATAAGAATTATCTTGTCCCTTGCAGTTAACTTTGAGTGTGGTCTACAACAGCTGGATATAAAAACTGCTTTTTTGAACGGGGATCTAGAGGAAGAAGTTTACATGAAAATTCCTCCCGGATACAAAGAAGGTCTCAATCCAACCCACGTATGCCGACTCAAAAAGTCTCTCTATGGGCTTAATAAATCTCACCTCGGGCTTGGTTCGGCAGATTTGCAAACGTTGTTCTTAGTTTGGGATATTCACAATGTCAATCTGATCATACATTATTTGTCAAACGGAGGTCTGAAACATCTCTTGCTCTGGTAATAGTGTATGTGGATGATATCATCCTTTCTTGGAATGATAAGTCTGAGTTACAGGCCTGAAAAAACACCTAGCTCGAGAATTTGAAGTTAAAGATCTCGGGAAACTCAGGTATTTCCTGGGTATGGAGGTTGCACGATCCAAGACAGGTATTGTTATGTCTCAAAGGAAGTATGTGTTAGATCTTCTTACAGAGACAGGCATGCTAGGATGTAAACCAGCAGCCTCTCCCATGGACCCTTAGTCAAAATTTAGGCATGACTCGGAAAGCAACCCGGTGGATAGGGGGAGATACCAAAGGTTGGTGGGACGCCTTATATATCTCTCTCATACTCGATCAGACATTGGGTTTGCAGTTAGTGTTGTCAGCCAGTTTATGCATAGTCCCAATAACAAACATATGGAAGCAGTATACATAATCCTCATATATCTAAAGATGACTCCCGGGAAAGGATTACTGTTCAAACGAACTGATAAGAGAGATGTTGAAATCTACTCAGATGCGGATTGGGCAGGTGATACTACAAACAGAACATCGACCTCTGGGTATTGCTCATTTGTGTGGGGCAATCTTGTTACCTGGAGAAGTAAAAGACAACCCGTTGTTGCTAGTAGTGCTGAAGCTGAATATCGAGCTCTGTCTAAAGGCATATGTGAAGGCATCTAGGTAAAGAAAGTCCTTTGTGAGTTAGGACAAGAGTGTCCAACATCGATTCAAGTATTGTGCGATAATCAAGCTGCGATAAGTATTGCCAAGAATTCTGTTCATCATGATAGGAAGAAACATGTTGAGATTGATCGACACTTCATCTCTGAAAAGGTTAACAATAAATTGATTCAACTAGTCTACATTCCAACCAAGCAACAGGTGGTAGACATCCTCACTAAACCATTACCGAGGCCTAACTTTGAAGACCTAAATAGCAAGCTAGGTTTGTATAATATATACAATCCAGCTTGAGGGGGAGTATTGAAAAGTTAGGGTTAGTTTAGTAGTTTGTTAATTTTTTATTTATATCTTTCATTTGATTCTTTCAGAAGATTGTTAGGTAGGTGAGATTTTGTAGGAAAGATCTTGTAGTTTGTTGGTGTTAGGCTTGTATTTATACATTTGATCAGAATTGAATAAAAATAAGAAAATAAGCGTTTCTTCAGAGGGTAAGCACAAAATATTACTAGCCAATACAACTAATTGTAAACATGGAAAACTCTCGAAAGAGGAGAAGAAAACGGTAACATTGGTGAAGCTTGCATGCTTGGATGAGAAAGAACCCTCTAATTTCAGGAGATGGCCGATGAAAAGGTAGAAGCCAAAGCTGAACATGATAGGATTATTCAAGTCTGTTGAATTATGTTAAACTACCAATCAATCCAAAAGTCTAAGCTGATGGATTGGGGTAAATTTAATTATATCAACCAACACTCCCCTCAAGAAATACCCTACAAGTGGAATTTAATTTTAATTGGGGAGGAAACGACTTGGCAGGGATTCGAACTCATGACTTCCTTCTCTGATACCATATTGAATTATGTTAAATCACCAATCAATCCAAAGGCCTAAGCTGATGGATTGGGATAAATTTAATTATATCAACCAACATCTACTATCTCGCACGGATCTCCCACACGCAATTCATCACATTCAAACTTGTTGGCCATTATCGATGTTGAGGAAAACTTCAAACATTGAATCTTTCTATTAGTAGAAGTTGCTGGTGTCACGTGCCACATGGTGAAGAAGACCAAAGAGATGCCGATTCTAATGACACACTTCCTTTGATTCAGGCATAAAGGTGGGTCAGGAAAATCGAACTGGTCAGTTACTTGAATTGGTCAAAATGGACAATATGGCTCTATTTATAGGAGAGGATCGATTGGTCCAAATCGATTAAAATTGGTCACAAATGAACCCTAGTGTACTGAACTAATTTATGTAAAACAATATGATCCAAAACAGTATCTGTTATTATTGAGAGGCAACTTCTTTATATAAGAGTTACAATGGGCCTAGCGGGCCTAACGGGCCTAGATCTATTACACAGTACATTATATACAATCACACATTTAGACTCTAACACTCCCCTCAAGCGGAGCAAATATATCTATCATGCCCGGCTTGTTACAAAGATAGTCTATCCGAGCTCCATTCAATGCTTTGGTGAAGATATCACCCAACTGTTCTCCAGTCTTCACATAACTCGTCGAAACCAACCCTTGTTGTATCTTCTCTCGGATAAAGTGGCAATCTATTTCAATATGTTTAGTTCTCTCGTGGAATACCGGATTAGAGGCAATATGAATCGCTGCTTGATTGTCACACCATAGTTTAGCAGGTGTTGCAGTTTTGATTCCAACTTCATTCAGCAGTTGAACCAGCCATACTATCTCACACGTTCCTTGGGACATAGCTCAGTATTTCGACTCAAAGACTAGAACGCGACACCACATTTTGCTTCTTACTCTTCCATGAAACCAAATTACCTCCCATAAACACACAAGATATCCGAGGTTGATCTCTGTCTTCTTTAGAACCCGCCCGATCAAGATCGAAAAACATTCAATATCGGAATGTCCGTGGTTTTTGGTACAAGACGCCCCTCCCGAGGAGCAAACTTAAGATAACTCGGGATCTCATTCCACGGCTCTCCAATGTTCAATCTGTAGGAGATGACATAAAGCGACTCACGATACTCACGAGAATCACGAAATATCGGCCTCGTGGATGTCAAGTAGTTCAACTTCCCAACTAGCTTCCTGTACCTTTCGGATCGGAAAAGCTTCACCGTCCTTGATTGAATCGTTGGTTTGGCACCATCGGGGCCCTACACGGTTTACACCTAACTTTCCTACTTGATCCAAGAGGTCGAGAACATCTTGATAGAATCTACCACTATTTTATTAAGAACTAAGATAATCTCTCGAGAGGATCTCTCAAAGAGCCCAAATACAAAGACAACAAACTGTTTGAATGAATTGCTAGAGGCCTAACATATCCTATATATAGTAAACCTAGAAGATGACCTAATTTAGTAATAACTCTAATAAGGCCCATCGCCCAATACAATGAATTACAATTAAAGACTATTAAATATAATTAATAAATAACTAATTTCATAATGTCTTGTTCCGCTCGAGTAAACCTTCACGGGCCGGGTTCAGTATCAATACTTCCCCCCCAAAGAGCCACCTTGTCCTCAAGGTGAAAGTCAGGAAACTGTAGCTCTAGATCCCTGGCGGCCTCCCACGTAGCCTCGTCTGGAGTGGACCCCTCCCACTGGATCAAAACTTGGCGCGAACCCCCTTCGAGCGGGTCCTCCCGTATGCCCAACACAGCCTTAGGGCAAACCACGACACACAAATCTTCCGTCACCATGGACGGCGTAGCCAGAACTAACACTGAAGAACCCACCGCCTTCCGCAGGACGGACACATGAAAAACCGGATGAATCTTCACTGATGGAGGCAACTCGAGCCGATATGCCACTGGCCCAACCCGTGCCAAAACACGATACGGACCAATAAATCGGGGTGCTAACTTGGGATGTTTAAACCGTGCCAGAGACGATTGTCGGTAAGGTCGGAGTTTAATATAAACCAAGTCATCAACAGCAAACTGAACATCGCGGCGATTGACATTCGCGCGATCCGACATGGACTGCTGTGCACGCGATAAGTTAGCCTTCAAAGAATCCAACATGCGATCTCTCTCTAACATCAACGAGTCAATCGCCGCGCACGGGACTAGCTCGAAATCATATCCCGTAATCGAGGGATGAGGCGCACCGTAAACGATCGCAAAGGCGTCATGCACGTGGGACGAATGAAACGATGTGTTAAAACCGAATTCACCCAAGACAACCATTTATACCATGCCTTCGGTTGAGTCATCACAAAACATCGTAGGTAGGACTCCAAACAGCGGTTTACAACCTCGGTTTGGCCATCCGTTTGAGGATGGTATGTAGTACTGCGGCGAAGTTTCGTTCCCGAAGCCTTAAACATTTCCTCCCACAGTAAGCTAGTGAAAATCTTATCCCGATCAGACACGATGCTTTTCGGTATCCCATGTAAGCGCACCACCTCCTTTATAAAAACCTGAGACACGGATAAGGAAGTGAAGGGGTGGCGTAGGGGGATGAAATGCGCATATTTAGACAATCTGTCAACCACGACTAAAATAGTGTCAAACCCCTCCGAACGCGGCAAACCCTCAACAAAGTCCATGGAGATGTCCTCCCAAATTCGTTCGGGAATCGGTAATGGCTGTAACAATCCTGCTGGAGATAATGACAGGTGTTTGGCCTGCACACAGATCGAACATTCCGCCACAAAGGCGCGGACACGAGCCTTCATCCCTTGCCAATAAACTTCCTTAGCAAGACGCTGATATGTTTTTAAGACTCCGAAATGTCCACCGACAGCCCCCCCATGGAATTCTAGCAATAATAGAGGGATCGTTGGTGACGTCGGCGGTAACACTAACCTGCCCTGGTAAAGCAAAACATCGCCCACAACTGAATACCCTGGTGGACCCTCCTCACCTGCTTTCAGAGCCGTAAAGATAGGCAGTAATTTTGCATCCTCTTTAATCTGCTGCGTAAACACAGCTGTGTTAACCCCGGCCACGACACTGAGCATTCCCAACTCACCAGATAGAGGCATCCGGGAAAGAGCATCAGCTGCTCGATTCTCAAGACCCTTCTTGTACTCAATGCTGAAATCATACCCCATAAGCTTAGCGATCCATCGATAATCACCATCCACCACCCACGCTCTAGTAAAAATTTAAGACTTTTTCTCGTCCGTGCGTACAATAAAATGGTGTCCCAATAAAGTACGCCCTCCATCATTTGAGCGGCGAACACAATAGCCATTAACTCCCGCTCATAGACAGGCTTAACGCGATGAGTGATCGGCAAAGCCCTACTAAAATATGCCACTGGTTGGCCTTGCTGCATTAACACTGCACCCACTCCAATTCCGGAAGCGTCGGTTTCGACCACAAATCCCTGGCTGAAATCCGGTAATCGGAGGACTGGAACACTGCTCATGGCATGTTTCATTCTCTGGAAGCAGTCCTCAGCAGCTAGCCCCCACTCAAATTTCCCCTTCTTGAATGAATGCGTCAAGGAAAAGCCATCGATCCATAGTTAGCTACGAATCGCCGGTAGTATCCCGTGAGACCTAGGAATCCCCTTAATTCTTTAATATTCCTGGGGCTCGGCCATCCCACCATTGCTTCGATCTTTGCCGGGTCCGCAGACACGCCTTCAGCAGATATGAAATGCCCCAAGTACTCGATACGGCGCAACCCAAACTGGCATTTCTTGGCGTTGGCCACAAACGCATGTTCAATCAAAACCTCCAAAACCTGAGCCAGGTGCTCCCTATGCTCCCCAATGGTCATACTGTAAATGAGAATGTCATCAAAGAAAACCAACACACACTCTCACGCAGATACGGGCGCAAAATGTCGTTCATAATAGACTGGAAGGTGGTGGAGCATTTCTCAATCCGAAAGGCATGACAACGAACTCATAATGGCCTCATGGGTTTCGAACATTTGTTTTTTTGTGCACATCCGTAGGTTTAACGCGAATCACGGTGGTAACCAGGCCTTCAAATCAATTTTGGAAAAAATCGTCGCTCCATGAAGTTCATCCAGAAGCTCATCCACCAAGGGAATGGGATACTTATCTGGTACGGTGACATGATTCAGAGCCCTGTAATCAACACAAAATCGCCAACTACCATCCTTCTTCCTCACTAGCAGCACTGGACTCGAAAATGCGCTCGTGCTAGGGCGGATTATCCCTGCCAGTAACATTTCTCGCACCAATTTTTCAATCTCATTTTTCTGGTATTGCGGATATCGATACGGACGCACATTCACGGAATCCGTTCCCGCCATCAATTCGATGGAATGATCCCTGTTCCTCGGTGGTGGTAATCCTGTCAATGACTCAAACACGGGTGAGCACGAATTAATTAAGGAGTGCAGTTCACTGGGTACCTGAGTTAAGTCGGGAAGATTTTTCGTCTGAACAGCCTCAGCACCTGTGGTCTCAACCATATTCAACTCCACTAACAGCCCCTGATCCTCGGGTCGTAAGGATTTCATCATAGATTTCAACGACACCTGCGCCTTAACCAGGCGTGGATCCCCCTGTAATTCGGCTTCCCATGAGCCCAACACAAATCGTGATCTCGCAAGGAACGGAAATTGAATTCAATCTTCCCCAAGGTCTCCAACCAAGCTACCCCTAGGATCACATCAAGACTTCCGAGGGGTAAAGGAAGGAAATCGTTAACTACTTTAAGCTCGCTAAGTGTAGTTCCACATTTTGCAAATTCCTGCTTTGCCCTTTATCGACTTGTTCTGTTCTCAAGCATAATACCATAATCATGCGATGGTTCCACCGGGAGTTTCAGCTTAGCCACGATCACATCAGATATAAAGTTATGGGTGGCTCCACTGTCGATAAGAACCACTACGAAGACCTTGAATAGATCTCGTGACTTTCAACGTTTTTGGTGAACTCAACCCCGCCATCGAATTTAATGCACAGTATTTGCTAAGTCCCTCACATCCTCTGTGTCACCGATATCCCATCATTGTCATTTGCGTAAAACCGTCCCGGTAATTCCATCCTCAACAACACGTATCTCTAGAGCACAAGCTCCTTCTTCTTACAACGGTGCCCTGGCTCAAATTTCTCGTCACAACGAAAACACAATCCTTTATCTTTACGGATCCGAATCTCACTATCTGACAAACGTTTATAGGGTAATGTGGGACGGTGTACTTGTGGTCATCGGCCTGCAGGAAGTTAGGGCTATAGTTCTCGATTGGTATTTCTGTCGTCCCTCGTGCTCGTAGCCCCGCTTCCTATCTTTGCTAGTAGATGGCTTAAGCTGGCTTTGAAGAAGATCCTACCCTTTTACTTGAGCTCGAAAAACAAGGTCATCCTCAATCACCGAGCCATGAACTTCTTATCCACGAATACCCACGGTCGCAGACTTCCTCATCTCACTTCGATTTCTTCCTTCAATCCACTTTCCCATTTTCCCTCCAACGCACTTTCGCACTAATATCGCGTATGCCCTTAGCATACTTCTCAAACAGGCGCCTATACTCTTTCACAGTTCCCACCTGTTGTAAACTCATAAGATTTGCATATTTGTTGTCATTGATCGTAGTTTGGAAGCGGTGCAACAACAATTCTCGAAACTCCTCCCACGAAGCAATCGGAGCACGGTCCTCCTCATCATCAGTAACCATTCTAGTGCCTCTCCTTCCATACACAAAGCTGCGGCATCCACTCTTTCGTGCCCCGTCACCGATTCACCAGAAATACCTTTCGACGCGCAAGCATAACCATCCGTCCGGATCCTCATCCGTTAGACCCTTGAACACCGGCATTTCCAACTTTTCGTAACCTCCGATCAAACATTGGCCCATCCCTCACCGCCTATCCCCGGCCCGCCCCTTTCAAATTGATCCCGCCTATCTCCCAAATCCGCGCCCTCACGGCTCTCGGGCCGCCATGGATTTCGCCCCGATTGTTCCAACCGTCATGCGCATTCGGCCTTCAGAGCCGTTGCGCACCCTCCGTGCCTCGGAACCCCGTTTCTCGGTCAAAACCGTCGATCCGGCTGACCCGTTCCTGCCCGGCTCGTTCGGCCAGCCCGACCTCCCGACCAGACTCGGTCCCCCCACGTCCGGACCGGCCCGGCCTTCGGCCGACCTGCGACCTCAGCCCGCGCGCGGCCGCGCCCGCGCGCCGCTTCGCCCGACTCCACCCACCTCGATATCGCCCGCTGCTGCGCGCGCGCGCCCTCGCGCCCGCTCGTGCGGCCGACACATCGCCCGGCCAGCGCTGCGCGCGCGCGCGCCGCCTCGCCGGTCCGACTCCACCCGGCCGATCCGCATCGCCCGCACCGCGCACCGCGCGCGCGCGCGCGCCGGCCCGCTCATCGCCTGCTCGCTGCGCGTACCACGCGCGCCGCCCTCGTCCGGCTCGGTCTCGCCCAGCCCGACACCGCCCCACGCGTCCGCATGCCCTACCGCTCCGCCGCGCGCCCGGCCTGTTCCCTGCCCGACCGCGCCGCTCCTGCCCGATGTCGCCCCTGTTTCCTCCAGATTCTCCTCCGCCGTAGCAGAAGCCGCAGCTGCCCAGACCTCCCTCCGTCCTCTCGGTCATTAATCCCTTCCCTTTGTCAGCCCGAATCGATTCCATTATTAGTTCTAAATTCCGGGTTATCGACTCAAACTTTAGATCCTGGGAATCTATCTTTTTTCCCAACCGCTCCTCCATCGCTTCCATCTTCCCATTAGTTTCTTGCAACTTCCTCGCCAAGTCGGCCACTCCTTCTTCACATTCTTGTATCCTGGATTCCATCTTCGTAGTAACCATTCCCGGATCGGTAAATGGCTCTGATACCAAAATGATAGAATCTACCACTATTTTATTAAGAACTAAGATAATCTCTCGAGAGGATCTCTCAAAGAGCCCAAATACAAAGACAACAACTGTTTGAATGAATTGCTAGAGGCCTAACATATCCTATATATAGTAAACCTAGAAGATGACCTAATTTAGTAATAACTCTAATAAGGCCCATCGGCCCAATACAACTGAATTACAATTAAAGACTATTAAATATAATTAATAAATAACTAATTTCATAATGTCTTGTTCCGCCTCGAGTAAACCTTCAGCGGGCCCGGGTTCGTATCACATACTTTCGTTGAGACAAAAAGATCCCTTTCTTACTTCTCATGACCTCAATACCCAAGAAATATTTCAAATTTCCAAGATCTTTAGTGCTGAAATGGCTATGAAGGAAAACCTTTAGTGAGGCCATCCCTGAACTATCACTTCCAGTAATCACAATATCGCCGACATATACCACAAGTAACACAATCCCATGGTCGGACCGGCGGAAGAAGACAGTAATGATCTCAAGAGCATTTCGCATTCCAAAGCTAGCATAAGAACTTTTCTTGAACCTACCAAACCACGCTCGCGGGCTTTGTTTCAAGCCATACAGAGATTTACTGAAGACGGCACACTTTGGCATTCTCCCCGAGCAACAAACCCGGTGGTTGCTCCATATAGACCTCCTCAAGAAGATCACCATGCAAAAAACGCATTCTTAATATCCAAATGAAATAAAGGCCAATCATTCATTCTCGCAAGAGAAATAAGCAGATTTAACGGACGACATCTTGGCAGCAGGAGAGAAGGTCTCACCGTAATCAACCCCATATGTCTGAGCATACCCCTTAGCAACCAAGCGTGCTTTGAGGCGAGCAACGGACCCGTCAGAATTCACTTTGACAGCAAACACCCACTTACAACCAATCGTCTGTTTTCCTTCTGGACGAGCTACCAGTTCCCAAGTACCATTCGCATCCAACGCAGCCACCTCCTCTACCATCGCCTGGTGCCAACTGAATGGGACAAGGCTTCACGAGTACTTTTAGGGACAGAGACAGAATCAAGGGAAGTCAGAAAGGAAAACGAAGTAGGGGATAATTGATGGTATGACACATATGTGGAGATAGGATACGTGCAAGAACGTTTACCTTTGCGAAGAGCAATGGGGTTGATTACAGTGAGACCTTCTCTCCTGTTGCCAAGATGTCGTCCGTTCGTCTGTTTATTTCTCTTGCAGCAATGAATGATTGGCCTTTATTTCAGTTGGATATTAAGAATGCGTTTTTGCATGGTGATCTTCTTGAGGAGGTCTATATGGAGCAACCACCTGGGTTTGTTGCTCAGGGGGAGAATGCCAAAGTGTGCCGTCTTCATTTTGGACCGAACGAAGTCGCGACTTCCACGACTGTGTAGATCGGTGGCTGGGCTGCGATTTCCCGGCGGCGGCGGCGGTCAACGGCGGTCAACGGCGGCAGCGGCGGCGGTCAACCCGAAAAAAAACCTAAAAAATTCTCAACCTAACTGTGTGGCTCTGATACCATGTAAAACAATATGATCCAAAACAGTATCTGTTATTATTGAGAGGCAACTTCTTTATATAAGAGTTACAATGGGCCTAGCGGGCCTAACGGGCCTAGATCTATTACACAGTACATTATATACAATCACACATTTAGACTCTAACAATTTAAACCAGGTTGGACCGAACCCATCCTCCTATTAGGGATTTTATTAGATAATATGAGATTCTACCTCAAAACCAATTGATAATGAGAGGAGTTATCTATATCTCTTATAAAGATGTCGAAGAATTCTCACATTTCCAATGTGGGACAATTGCCACAAAATGAGTGTCCCCAATATGTCTCCTCAAGATGGGTGTCATATAGGGAAGACCATTTTAGATATGATCCAAGTTTGTAATGAGAGAACTTGGATATGTACTTTATGTTCAATAAATATCCGCAAGTTTCATTAGTTGCTTGCCTTTTTTTCTTCCCTAATAATGCCAAATAATGCCAGTGGCAATGCAAGAGGAAAGTTTGTTAAAAGAAAAGGGGAGATGAGAAGTTAGGGAAAAAAATGGTGACATACCCGTGTCAATCTGTTCCGTTCTCATGAGTGGGTATCCCCAAGGCCTTTGAGGGTATGTTTCCCTCAAGGTCTTGGGTTCGAGACTCACATGTGATGTTACTTCTTCGATGTCTCCGGTGCCTGGCCTAGGGATGGACGTGGTTACCCTTGTTTCAAACAAAAAAAAGTTTTCATGAGTGGGTCTTACATTTTTGGGAACACATGGACCTAACCTAAATGAACAATTTCTATTCTGTCAATGCTATGGTGTCAAACTCAACAGATTCCTTGATCCAGTCTAATGGCATCTGCTAATCTGCTGCAGGACAAGAAGCTGTCACAGAGAATGCGCATAAAATCGTTGATTGCCTTGCCAGACTTACTTCTTTTTCTCACATGATGGTATGAAGTCATTTTCTTGATCGTCTTTTATTGTGTGTATTGCAAATTGCAATGCAATCGTTATGACTGGACTGCTGATATTCTCCAGATATATAAATGCATTTTGTTTCCTCTATCTGCTCTTTGCTTTCACATTTACCGTCCTCTTCCCAGTTGTATGATATCTCATACAATTCATACTCTTCCGCGCGGCTACTTAACTTAGATATGGTTTTAGTATTTTGTATCTTCTAGATTTCCTTGCATGTTGTGTTACCATTTGATAAACAACAATTATGCTACTATTATACTAAAGCCCGAAGGATATGCATTATCCAGCTTGTACGAGAAACTGCAATTCAGTGCCTTGTTGCTGTGTCCGAGCTGCCCCATGCAAGGATATACCCCATGAGAAAACAGGTAGCCCAAGGATTATTCCTTGCAATCAGTACTTCATATTCTTCTTATAATGCTGAATAATATTCTGCCGTTTTCAGGTACTACATGCAATATCAAAAGCTCTTGATGATCCAAAGAGAGCTGTTCGACTGGAAGCTGTCCGAGCTCGACAAGCATGGTAAGTGCTGCGTGATGCGTATGTGATTTCATGAACATACTACTGCTACTACTGACCGAAAATATAATAAATTGTGCTTAATTCCTTTTATAATTTCACCACCAGGGCATCAATTGCATCAAGAAGTCTTCATTTCTGATGGTCCATTGCAAGCAATTTTCCTAGTTTGGCAGCTGATCACATTTGAAGTTGTCGATCTAGAGTGTATTTTGCATTTATCATATAATCAATGTACATATACCCGTTGGGGCAAGCTTTCTTTTTTAGATAAGAGGTAGTTGATTCATTTTATTGTGACGAAGAGAGAGCTGTATGAACATGATTTTAGAAAAAAAAAATGTCAGTATAGATTGCATATTGACTTTGGACATGAAGGCAAGTACAATAGATGTCTTGAATTTAAAAAAATGAGTGTTTTTGTATGGACTTTTTATTATTATTAGCATTTGTGTTGGACTTGGACTGTATAAATTCTTAAAACCTACATCCTTTATTCCAACTTTGTAATCTTATCTTCCAATAAAATGGTATCTATTTAGGATT

mRNA sequence

CTTCATCTCTTCCAAACAACCCTTTAAGAGTTCTTAAAGAAAAAAAGAAAAAGAAAACCCAAAAGGGTCGCGTGTATATAAATTGATTCTACAAGAGGAATCTATTCTCAGTTCTTCGAAGGCAGAAAAACCTTCTTCCGCTAAGTTCTTCACGCGGCTAAAACCCATTCCATAAACCCCAAAATGGGAGAGCTCAGTTCGCTTACACAGTACATCGAATCGTTCGTCGACGCATCTCGTACTGCATCTCAACAGGCCACAAGCTTGGAAGCAATCATCTCCCTTACGAAGAACAATGCACTAACAATAGAAACATTGGTTAGAGAGATGGGAATGTATTTGACAATTACTGATAACATTATTCGAGGCAGAGGTATACTTCTTCTTGGGGAACTACTTGCATGTCTTGCATCGAAGCCTCTAGATGATGCAACAATACACAGTCTAATGACATTCTTCACTGAGAGACTGGCAGATTGGAAAGCTTTACGAGGCGCCCTTGTTGGCTGCTTGGCACTGATGAGGAGGAAAACAAACGTTGGTACAGTTTCTCAGAATGATGCAAAGTCTGTTGCCCAGTCATATTTTCAAAATCTTCAAGTTCAGTCTTTGGGACAACATGATCGGAAGCTCAGTTTTGAACTTTTGGCGTGTTTGTTGGAACATTACCCTAATGCAGTTGTTTCACTGGGTGATGATCTTGTATATGGAATCTGTGAAGCCATTGATGGTGAAAAAGATCCACACTGCTTAATGCTTACTTTTCACATCGTTGAGCTTGTGGCAAAGCTATTCCCAGATCCAACTGGAACACTTGCAAATAGTTCTAGCGATCTTTTTGAATTCCTGGGTTGCTATTTTCCTATCCACTTCACACATGGAAAAGAGGAGGATGTAGATGTAAGAAGGAATGATCTTTCGCAGGCACTTATGATTGCCTTCTCTTCTACCCCCCTCTTTGAGCCATTTGCAGTTCCTTTGCTTCTCGAGAAACTTTCTTCTTCATTGCCACTAGCAAAGATTGATTCTTTGAAGTACCTAAGTGATTGCACCGTAAAATATGGGGCAGATAGAATGGAAAAGCATAGTGCATCTATCTGGTCTTCAGTAAAGGAGATCTTATTTACATCAATAGGACAGCCTTCTTTGTCCATTAACTTAGAATCATTAAGTAGTCCTAGCTTTGAAGGGAATGAAATTACAACTGAAGCTCTACGACTCCTGCAGAAGATGGTCGTGGAGAGTAATGGATTATTTTTAAGATTAATTATCAACGATGAAGATATAAAGGATATTTTCAGCATCCTAAATATTTATACATGTTACAACGACTTCCCTTTGCATAGCAGGCAGAGACTAAATGCAGTTGGCCATATCCTTTACAAGTCAGCGAATGCATCTCTTGCTTCCTGTGGTCACGTGTTTGAAAGTTTCTTCCCTCGTTTGCTGGATATTGTCGGGATTTCTGCGGATCAGCCTCATAATAACAAAATTTCTCCAAAGAATTTTAATTTTGGGGCCCTCTATCTCTGTATTGAACTTCTTGTAGCTTGCAGAGATCTGATTGCAAGCTCTGATGAACACATATGCTTTGTTAAAGAAAAATTATACGGCATGCTTCAAACCTTTTCATGTTCAATGGTTCATCTCCTCAATTCTATCTTTCCAGTAATTGTTAAGAAGGATCTGCATGATGCTGAGTTCTACTGTGCAGTAAAGGGCTTGCAGAATCTGGCCACATTCCCTGTAGGCTCTTCACCAGTATCAAGAGTCATATTTGAGGATATTTTGCTGGGACTCATGTCATTTATAACAGCGGACTTCAAATTTGCATCGTTGTGGAATCATGCCTTGAATTCATTACAGCATATTGGTTCATTTGTTGACAAATATCCAGAGTCTCTGGAATTGCAAAGTTTCATGCATGTTGTTGTTGAAAAGATTGCATCAATGTTCTCTCTTCATGAAGAGACCTTGCCATTGTCGATTAAACTGGAACTGGCATTGAACATTGGTAGAACTGGACGGAGTTATATGCTGAAAATTGTTCAGGGGATTGAAGAGGCAACATTCTTCCATTTATCTGAGGTTTATGTCAATGGCAACTCAAAGTCGGTGGAGATTCTATTGTCCCTGTTGGATTGTTACTCGACCAAAATTCTTCCATGGCTTGATGAAGTTGGTGGTTTTGAGGAAGTCATATTGCGAATTGCATTAAACATTTGGGATCAGATTGAAAAATGTTCAGTTTTTAGCGCTTTGCTGGATAAAGTGCTTCTAGATGCTACCATGTTGGCTATGAAGCTCTCTGTTCGAAGTTGCTCAAAGGAAAGCCAGAATGTTGTAATCCAAAAGGCATTTGATGTATTATTAACCAGCAATTTTACTCCTTTGAAATTACCATCATCTACTACAGTACCACTTCAGATGGAGGGCTTACAACTTCTGAAGCAGAAAGATAGTCCACTTTGTAGAGATGAATGGATTCTTTTATTATTTGCATCAGTCACTATAGGACTTCGTCCACAAACACAGATTCCAGATGTGAGATCAGTAATACATTTGCTTATGTTATCCATCACCAGGGGCTGCATACCAGCTGCACAAGCACTAGGTTCTATAATCAATAAATTGAGTCTGAAATCAGATAAAGTAGAAGTTTCAAGTTACGTTTCATTGGAAGAAGCAATTGATATTATTTTCAAAACCAAATTTAGGTGCTTCCATAACGGAAGTACTCTTGCAGGCAGTGAGATGTTTCTCACTGATTTATGCTCTAGCATTGAAAAAAGTTCTTTACTTCAAGTTCATGTTGTGGTTGGATTATCATGGATTGGAAAAGGTCTGCTTCTTTGTGGTCATGAAAAGGTCCGTGATATAACTATGGTTTTATTGGAGTGCTTACTATCAAAAAGCAGAACAGATGCCTCATCCTTGCAGCAGGTTATACTGGAAAAAGATTATGAGCCGAACTTCGACTTTGCAATAGTGAAGGGTGCAGCAGATGCATTTCACATTCTCATGAGTGATTCTGAAGCTTGTTTGAACCGTAAATTTCATGCAATAGTACGGCCACTTTATAAGCAGCGTTTTTACTCTACCATGATGCCTATTTTCCAGTCTCTAGTAAGCAAATCAGATGCATCACTTTCTCGATATATGTTGTACAAGGCATTTGCACATGTTATAACCGATACTCCACTCACTGCCATATTGAGTGATGCGAAGAAGCTTATACCTATGCTTCTGGATGGCTTGCTAACATTAAGTGTGAACGTCATCGACAAGGATGTGGTTTATGGCCTTCTTCTTGTTTTATCAGGGATCTTAACTGATAAAAATGGACAAGAAGCTGTCACAGAGAATGCGCATAAAATCGTTGATTGCCTTGCCAGACTTACTTCTTTTTCTCACATGATGCTTGTACGAGAAACTGCAATTCAGTGCCTTGTTGCTGTGTCCGAGCTGCCCCATGCAAGGATATACCCCATGAGAAAACAGGTACTACATGCAATATCAAAAGCTCTTGATGATCCAAAGAGAGCTGTTCGACTGGAAGCTGTCCGAGCTCGACAAGCATGGGCATCAATTGCATCAAGAAGTCTTCATTTCTGATGGTCCATTGCAAGCAATTTTCCTAGTTTGGCAGCTGATCACATTTGAAGTTGTCGATCTAGAGTGTATTTTGCATTTATCATATAATCAATGTACATATACCCGTTGGGGCAAGCTTTCTTTTTTAGATAAGAGGTAGTTGATTCATTTTATTGTGACGAAGAGAGAGCTGTATGAACATGATTTTAGAAAAAAAAAATGTCAGTATAGATTGCATATTGACTTTGGACATGAAGGCAAGTACAATAGATGTCTTGAATTTAAAAAAATGAGTGTTTTTGTATGGACTTTTTATTATTATTAGCATTTGTGTTGGACTTGGACTGTATAAATTCTTAAAACCTACATCCTTTATTCCAACTTTGTAATCTTATCTTCCAATAAAATGGTATCTATTTAGGATT

Coding sequence (CDS)

ATGGGAGAGCTCAGTTCGCTTACACAGTACATCGAATCGTTCGTCGACGCATCTCGTACTGCATCTCAACAGGCCACAAGCTTGGAAGCAATCATCTCCCTTACGAAGAACAATGCACTAACAATAGAAACATTGGTTAGAGAGATGGGAATGTATTTGACAATTACTGATAACATTATTCGAGGCAGAGGTATACTTCTTCTTGGGGAACTACTTGCATGTCTTGCATCGAAGCCTCTAGATGATGCAACAATACACAGTCTAATGACATTCTTCACTGAGAGACTGGCAGATTGGAAAGCTTTACGAGGCGCCCTTGTTGGCTGCTTGGCACTGATGAGGAGGAAAACAAACGTTGGTACAGTTTCTCAGAATGATGCAAAGTCTGTTGCCCAGTCATATTTTCAAAATCTTCAAGTTCAGTCTTTGGGACAACATGATCGGAAGCTCAGTTTTGAACTTTTGGCGTGTTTGTTGGAACATTACCCTAATGCAGTTGTTTCACTGGGTGATGATCTTGTATATGGAATCTGTGAAGCCATTGATGGTGAAAAAGATCCACACTGCTTAATGCTTACTTTTCACATCGTTGAGCTTGTGGCAAAGCTATTCCCAGATCCAACTGGAACACTTGCAAATAGTTCTAGCGATCTTTTTGAATTCCTGGGTTGCTATTTTCCTATCCACTTCACACATGGAAAAGAGGAGGATGTAGATGTAAGAAGGAATGATCTTTCGCAGGCACTTATGATTGCCTTCTCTTCTACCCCCCTCTTTGAGCCATTTGCAGTTCCTTTGCTTCTCGAGAAACTTTCTTCTTCATTGCCACTAGCAAAGATTGATTCTTTGAAGTACCTAAGTGATTGCACCGTAAAATATGGGGCAGATAGAATGGAAAAGCATAGTGCATCTATCTGGTCTTCAGTAAAGGAGATCTTATTTACATCAATAGGACAGCCTTCTTTGTCCATTAACTTAGAATCATTAAGTAGTCCTAGCTTTGAAGGGAATGAAATTACAACTGAAGCTCTACGACTCCTGCAGAAGATGGTCGTGGAGAGTAATGGATTATTTTTAAGATTAATTATCAACGATGAAGATATAAAGGATATTTTCAGCATCCTAAATATTTATACATGTTACAACGACTTCCCTTTGCATAGCAGGCAGAGACTAAATGCAGTTGGCCATATCCTTTACAAGTCAGCGAATGCATCTCTTGCTTCCTGTGGTCACGTGTTTGAAAGTTTCTTCCCTCGTTTGCTGGATATTGTCGGGATTTCTGCGGATCAGCCTCATAATAACAAAATTTCTCCAAAGAATTTTAATTTTGGGGCCCTCTATCTCTGTATTGAACTTCTTGTAGCTTGCAGAGATCTGATTGCAAGCTCTGATGAACACATATGCTTTGTTAAAGAAAAATTATACGGCATGCTTCAAACCTTTTCATGTTCAATGGTTCATCTCCTCAATTCTATCTTTCCAGTAATTGTTAAGAAGGATCTGCATGATGCTGAGTTCTACTGTGCAGTAAAGGGCTTGCAGAATCTGGCCACATTCCCTGTAGGCTCTTCACCAGTATCAAGAGTCATATTTGAGGATATTTTGCTGGGACTCATGTCATTTATAACAGCGGACTTCAAATTTGCATCGTTGTGGAATCATGCCTTGAATTCATTACAGCATATTGGTTCATTTGTTGACAAATATCCAGAGTCTCTGGAATTGCAAAGTTTCATGCATGTTGTTGTTGAAAAGATTGCATCAATGTTCTCTCTTCATGAAGAGACCTTGCCATTGTCGATTAAACTGGAACTGGCATTGAACATTGGTAGAACTGGACGGAGTTATATGCTGAAAATTGTTCAGGGGATTGAAGAGGCAACATTCTTCCATTTATCTGAGGTTTATGTCAATGGCAACTCAAAGTCGGTGGAGATTCTATTGTCCCTGTTGGATTGTTACTCGACCAAAATTCTTCCATGGCTTGATGAAGTTGGTGGTTTTGAGGAAGTCATATTGCGAATTGCATTAAACATTTGGGATCAGATTGAAAAATGTTCAGTTTTTAGCGCTTTGCTGGATAAAGTGCTTCTAGATGCTACCATGTTGGCTATGAAGCTCTCTGTTCGAAGTTGCTCAAAGGAAAGCCAGAATGTTGTAATCCAAAAGGCATTTGATGTATTATTAACCAGCAATTTTACTCCTTTGAAATTACCATCATCTACTACAGTACCACTTCAGATGGAGGGCTTACAACTTCTGAAGCAGAAAGATAGTCCACTTTGTAGAGATGAATGGATTCTTTTATTATTTGCATCAGTCACTATAGGACTTCGTCCACAAACACAGATTCCAGATGTGAGATCAGTAATACATTTGCTTATGTTATCCATCACCAGGGGCTGCATACCAGCTGCACAAGCACTAGGTTCTATAATCAATAAATTGAGTCTGAAATCAGATAAAGTAGAAGTTTCAAGTTACGTTTCATTGGAAGAAGCAATTGATATTATTTTCAAAACCAAATTTAGGTGCTTCCATAACGGAAGTACTCTTGCAGGCAGTGAGATGTTTCTCACTGATTTATGCTCTAGCATTGAAAAAAGTTCTTTACTTCAAGTTCATGTTGTGGTTGGATTATCATGGATTGGAAAAGGTCTGCTTCTTTGTGGTCATGAAAAGGTCCGTGATATAACTATGGTTTTATTGGAGTGCTTACTATCAAAAAGCAGAACAGATGCCTCATCCTTGCAGCAGGTTATACTGGAAAAAGATTATGAGCCGAACTTCGACTTTGCAATAGTGAAGGGTGCAGCAGATGCATTTCACATTCTCATGAGTGATTCTGAAGCTTGTTTGAACCGTAAATTTCATGCAATAGTACGGCCACTTTATAAGCAGCGTTTTTACTCTACCATGATGCCTATTTTCCAGTCTCTAGTAAGCAAATCAGATGCATCACTTTCTCGATATATGTTGTACAAGGCATTTGCACATGTTATAACCGATACTCCACTCACTGCCATATTGAGTGATGCGAAGAAGCTTATACCTATGCTTCTGGATGGCTTGCTAACATTAAGTGTGAACGTCATCGACAAGGATGTGGTTTATGGCCTTCTTCTTGTTTTATCAGGGATCTTAACTGATAAAAATGGACAAGAAGCTGTCACAGAGAATGCGCATAAAATCGTTGATTGCCTTGCCAGACTTACTTCTTTTTCTCACATGATGCTTGTACGAGAAACTGCAATTCAGTGCCTTGTTGCTGTGTCCGAGCTGCCCCATGCAAGGATATACCCCATGAGAAAACAGGTACTACATGCAATATCAAAAGCTCTTGATGATCCAAAGAGAGCTGTTCGACTGGAAGCTGTCCGAGCTCGACAAGCATGGGCATCAATTGCATCAAGAAGTCTTCATTTCTGA

Protein sequence

MGELSSLTQYIESFVDASRTASQQATSLEAIISLTKNNALTIETLVREMGMYLTITDNIIRGRGILLLGELLACLASKPLDDATIHSLMTFFTERLADWKALRGALVGCLALMRRKTNVGTVSQNDAKSVAQSYFQNLQVQSLGQHDRKLSFELLACLLEHYPNAVVSLGDDLVYGICEAIDGEKDPHCLMLTFHIVELVAKLFPDPTGTLANSSSDLFEFLGCYFPIHFTHGKEEDVDVRRNDLSQALMIAFSSTPLFEPFAVPLLLEKLSSSLPLAKIDSLKYLSDCTVKYGADRMEKHSASIWSSVKEILFTSIGQPSLSINLESLSSPSFEGNEITTEALRLLQKMVVESNGLFLRLIINDEDIKDIFSILNIYTCYNDFPLHSRQRLNAVGHILYKSANASLASCGHVFESFFPRLLDIVGISADQPHNNKISPKNFNFGALYLCIELLVACRDLIASSDEHICFVKEKLYGMLQTFSCSMVHLLNSIFPVIVKKDLHDAEFYCAVKGLQNLATFPVGSSPVSRVIFEDILLGLMSFITADFKFASLWNHALNSLQHIGSFVDKYPESLELQSFMHVVVEKIASMFSLHEETLPLSIKLELALNIGRTGRSYMLKIVQGIEEATFFHLSEVYVNGNSKSVEILLSLLDCYSTKILPWLDEVGGFEEVILRIALNIWDQIEKCSVFSALLDKVLLDATMLAMKLSVRSCSKESQNVVIQKAFDVLLTSNFTPLKLPSSTTVPLQMEGLQLLKQKDSPLCRDEWILLLFASVTIGLRPQTQIPDVRSVIHLLMLSITRGCIPAAQALGSIINKLSLKSDKVEVSSYVSLEEAIDIIFKTKFRCFHNGSTLAGSEMFLTDLCSSIEKSSLLQVHVVVGLSWIGKGLLLCGHEKVRDITMVLLECLLSKSRTDASSLQQVILEKDYEPNFDFAIVKGAADAFHILMSDSEACLNRKFHAIVRPLYKQRFYSTMMPIFQSLVSKSDASLSRYMLYKAFAHVITDTPLTAILSDAKKLIPMLLDGLLTLSVNVIDKDVVYGLLLVLSGILTDKNGQEAVTENAHKIVDCLARLTSFSHMMLVRETAIQCLVAVSELPHARIYPMRKQVLHAISKALDDPKRAVRLEAVRARQAWASIASRSLHF
Homology
BLAST of Sed0021872 vs. NCBI nr
Match: XP_038898520.1 (MMS19 nucleotide excision repair protein homolog isoform X1 [Benincasa hispida])

HSP 1 Score: 1899.0 bits (4918), Expect = 0.0e+00
Identity = 978/1147 (85.27%), Postives = 1050/1147 (91.54%), Query Frame = 0

Query: 1    MGELSSLTQYIESFVDASRTASQQATSLEAIISLTKNNALTIETLVREMGMYLTITDNII 60
            M ELS LTQYIESFVD SRT SQQATSLEAI SL KNN LTIETLVREMGMYLTITDNII
Sbjct: 1    MAELSKLTQYIESFVDVSRTPSQQATSLEAITSLAKNNVLTIETLVREMGMYLTITDNII 60

Query: 61   RGRGILLLGELLACLASKPLDDATIHSLMTFFTERLADWKALRGALVGCLALMRRKTNVG 120
            RGRGILLLGELLACLASKPLD ATIHSL+ FFTERLADWKALRGALVGCLALMRRK+NVG
Sbjct: 61   RGRGILLLGELLACLASKPLDGATIHSLIAFFTERLADWKALRGALVGCLALMRRKSNVG 120

Query: 121  TVSQNDAKSVAQSYFQNLQVQSLGQHDRKLSFELLACLLEHYPNAVVSLGDDLVYGICEA 180
            T+SQNDAKSVAQSYFQNLQVQSLGQHDRKLSFELLACLLEHYP+AVVSLGDDLVYGICEA
Sbjct: 121  TISQNDAKSVAQSYFQNLQVQSLGQHDRKLSFELLACLLEHYPDAVVSLGDDLVYGICEA 180

Query: 181  IDGEKDPHCLMLTFHIVELVAKLFPDPTGTLANSSSDLFEFLGCYFPIHFTHGKEEDVDV 240
            IDGEKDPHCLMLTFHIVELVAKLFPDP+GTLA+SSSDLFEFLGCYFPIHFTHGKEEDVDV
Sbjct: 181  IDGEKDPHCLMLTFHIVELVAKLFPDPSGTLASSSSDLFEFLGCYFPIHFTHGKEEDVDV 240

Query: 241  RRNDLSQALMIAFSSTPLFEPFAVPLLLEKLSSSLPLAKIDSLKYLSDCTVKYGADRMEK 300
            RRNDLSQALM AFSSTPLFEPFA+PLLLEKLSSSLPLAKIDSLKYLSDC++ YGADRM+K
Sbjct: 241  RRNDLSQALMRAFSSTPLFEPFAIPLLLEKLSSSLPLAKIDSLKYLSDCSLNYGADRMKK 300

Query: 301  HSASIWSSVKEILFTSIGQPSLSINLESLSSPSFEGNEITTEALRLLQKMVVESNGLFLR 360
            HS +IWSSVKEI+FTSIGQPSLSIN+ESL+SPSF+ NEITTEAL LLQKMVVESN  FLR
Sbjct: 301  HSEAIWSSVKEIIFTSIGQPSLSINIESLNSPSFQENEITTEALILLQKMVVESNEFFLR 360

Query: 361  LIINDEDIKDIFSILNIYTCYNDFPLHSRQRLNAVGHILYKSANASLASCGHVFESFFPR 420
            LII+DEDIKDIF+ILNIYTCYNDFPL SRQRLNAVGHILYKSANAS+ASC HVFESFFPR
Sbjct: 361  LIIDDEDIKDIFNILNIYTCYNDFPLQSRQRLNAVGHILYKSANASVASCDHVFESFFPR 420

Query: 421  LLDIVGISADQPHNNKISP-KNFNFGALYLCIELLVACRDLIASSDEHICFVKEKLYGML 480
            LLD VGIS  + HNNK+SP +NFNFGALYLCIELL ACRDLIASSDE  C VKEK Y ML
Sbjct: 421  LLDFVGISVGRSHNNKVSPSRNFNFGALYLCIELLAACRDLIASSDEPTCSVKEKSYCML 480

Query: 481  QTFSCSMVHLLNSIFPVIVKKDLHDA-EFYCAVKGLQNLATFPVGSSPVSRVIFEDILLG 540
            QT SCS+V LL+S F  IVKKDLHD  EFYCAVKGL+NL+TFPVGSSPVSRVIFEDILL 
Sbjct: 481  QTSSCSLVQLLSSTFSGIVKKDLHDTEEFYCAVKGLRNLSTFPVGSSPVSRVIFEDILLE 540

Query: 541  LMSFITADFKFASLWNHALNSLQHIGSFVDKYPESLELQSFMHVVVEKIASMFSLHEETL 600
             MSFIT +FKF SLWNHAL +LQHIGSFVDKY  S+E QS+MH+VVEKIASMF  H+E L
Sbjct: 541  FMSFITVNFKFGSLWNHALKALQHIGSFVDKYHGSVESQSYMHIVVEKIASMFCPHDEAL 600

Query: 601  PLSIKLELALNIGRTGRSYMLKIVQGIEEATFFHLSEVYVNGNSKSVEILLSLLDCYSTK 660
            PL +KLE+A +IGRTG SYMLKIV GIE+A FFHLSEVYV GN+KSVEILLSLL CYS K
Sbjct: 601  PLMLKLEMAFDIGRTGGSYMLKIVLGIEDAIFFHLSEVYVYGNAKSVEILLSLLACYSNK 660

Query: 661  ILPWLDEVGGFEEVILRIALNIWDQIEKCSVFSALLDKVLLDATMLAMKLSVRSCSKESQ 720
            +LPW DE G FEEVIL+ ALNIWDQIEKCS  S L+DKVLLDATMLA+KLSVRSCSKESQ
Sbjct: 661  VLPWFDEAGDFEEVILQFALNIWDQIEKCSTLSTLMDKVLLDATMLALKLSVRSCSKESQ 720

Query: 721  NVVIQKAFDVLLTSNFTPLKLPSSTTVPLQMEGLQLLKQKDSPLCRDEWILLLFASVTIG 780
            NV+IQKAF+VLLTS+F+PLKL  STT+P+QME LQLL+QKD+PL RDEWI  LFASV I 
Sbjct: 721  NVIIQKAFNVLLTSSFSPLKLALSTTIPVQMEDLQLLQQKDNPLSRDEWIFSLFASVIIA 780

Query: 781  LRPQTQIPDVRSVIHLLMLSITRGCIPAAQALGSIINKLSLKSDKVEVSSYVSLEEAIDI 840
            LRPQ  +PDVR V+HLLMLSITRGC+ AAQALGS+INKLS+KSDKVE SSYVSLEEA+DI
Sbjct: 781  LRPQIHVPDVRLVMHLLMLSITRGCVLAAQALGSMINKLSMKSDKVEDSSYVSLEEAMDI 840

Query: 841  IFKTKFRCFHNGSTLAGSE--MFLTDLCSSIEKSSLLQVHVVVGLSWIGKGLLLCGHEKV 900
            IFKT+FRCFHN S   GSE  MFLTDLCSSIEKSS LQVH VVGLSWIGKGLLLCGHEKV
Sbjct: 841  IFKTEFRCFHNESAGDGSEMRMFLTDLCSSIEKSSSLQVHAVVGLSWIGKGLLLCGHEKV 900

Query: 901  RDITMVLLECLLSKSRTDASSLQQVILEKDYEPNFDFAIVKGAADAFHILMSDSEACLNR 960
            RDITMV L+CL+SKSRTDAS LQQ ILEKD + N DFA+++ AADAFHILMSDSEACLNR
Sbjct: 901  RDITMVFLQCLVSKSRTDASPLQQFILEKDNQTNLDFAVMEVAADAFHILMSDSEACLNR 960

Query: 961  KFHAIVRPLYKQRFYSTMMPIFQSLVSKSDASLSRYMLYKAFAHVITDTPLTAILSDAKK 1020
            KFHAIVRPLYKQRF+STMMPIFQ+LVSKSD SLSRYMLY+AFAHVI+DTPL+A+LSDAKK
Sbjct: 961  KFHAIVRPLYKQRFFSTMMPIFQTLVSKSDTSLSRYMLYQAFAHVISDTPLSAVLSDAKK 1020

Query: 1021 LIPMLLDGLLTLSVNVIDKDVVYGLLLVLSGILTDKNGQEAVTENAHKIVDCLARLTSFS 1080
            LIPMLLDGLLTLSVN+I+KDVVY LLLVLSGIL  +NGQEAVTENAHKIVDCLA LT+FS
Sbjct: 1021 LIPMLLDGLLTLSVNIINKDVVYSLLLVLSGILMGRNGQEAVTENAHKIVDCLAGLTAFS 1080

Query: 1081 HMMLVRETAIQCLVAVSELPHARIYPMRKQVLHAISKALDDPKRAVRLEAVRARQAWASI 1140
            HMMLVRETAIQCLVAVSELPHARIYPMR+QVLHAISKALDDPKRAVR EAVR RQAWASI
Sbjct: 1081 HMMLVRETAIQCLVAVSELPHARIYPMRRQVLHAISKALDDPKRAVRQEAVRCRQAWASI 1140

Query: 1141 ASRSLHF 1144
            ASRSLHF
Sbjct: 1141 ASRSLHF 1147

BLAST of Sed0021872 vs. NCBI nr
Match: XP_022953370.1 (MMS19 nucleotide excision repair protein homolog isoform X2 [Cucurbita moschata])

HSP 1 Score: 1892.5 bits (4901), Expect = 0.0e+00
Identity = 968/1144 (84.62%), Postives = 1039/1144 (90.82%), Query Frame = 0

Query: 1    MGELSSLTQYIESFVDASRTASQQATSLEAIISLTKNNALTIETLVREMGMYLTITDNII 60
            M ELS LTQYIESFVD S T SQQATSLEAIISL KNN +TI+TLV EMGMYLTITD+II
Sbjct: 1    MAELSKLTQYIESFVDVSHTPSQQATSLEAIISLVKNNVVTIKTLVTEMGMYLTITDHII 60

Query: 61   RGRGILLLGELLACLASKPLDDATIHSLMTFFTERLADWKALRGALVGCLALMRRKTNVG 120
            RGRGILLLGE+L CLASKPLDDATIHSLMTFFTERLADWKALRGAL+GCLALMRRKT VG
Sbjct: 61   RGRGILLLGEVLTCLASKPLDDATIHSLMTFFTERLADWKALRGALIGCLALMRRKTEVG 120

Query: 121  TVSQNDAKSVAQSYFQNLQVQSLGQHDRKLSFELLACLLEHYPNAVVSLGDDLVYGICEA 180
             VSQNDAKS AQSYFQNLQVQSLGQHDRKLSFELL CLLEHYP+AVVSLGDDLVYGICEA
Sbjct: 121  AVSQNDAKSFAQSYFQNLQVQSLGQHDRKLSFELLVCLLEHYPDAVVSLGDDLVYGICEA 180

Query: 181  IDGEKDPHCLMLTFHIVELVAKLFPDPTGTLANSSSDLFEFLGCYFPIHFTHGKEEDVDV 240
            IDGEKDPHCLMLTFHIVELVAKLFPDPTGTLA+SSSDLFEFLGCYFPIHFTHGKEEDVDV
Sbjct: 181  IDGEKDPHCLMLTFHIVELVAKLFPDPTGTLASSSSDLFEFLGCYFPIHFTHGKEEDVDV 240

Query: 241  RRNDLSQALMIAFSSTPLFEPFAVPLLLEKLSSSLPLAKIDSLKYLSDCTVKYGADRMEK 300
             RNDLS+ALM+AFSS PLFEPFA+PLLLEKLSSSLPLAKIDSLKYLSDCTVKYGADRMEK
Sbjct: 241  TRNDLSRALMMAFSSNPLFEPFAIPLLLEKLSSSLPLAKIDSLKYLSDCTVKYGADRMEK 300

Query: 301  HSASIWSSVKEILFTSIGQPSLSINLESLSSPSFEGNEITTEALRLLQKMVVESNGLFLR 360
            HS +IWSSVKEI+FTSI QPSLS NLESL SPSF+GNE+  EALRLLQKMVVESNG FLR
Sbjct: 301  HSEAIWSSVKEIIFTSIEQPSLSFNLESLDSPSFQGNEMIIEALRLLQKMVVESNGSFLR 360

Query: 361  LIINDEDIKDIFSILNIYTCYNDFPLHSRQRLNAVGHILYKSANASLASCGHVFESFFPR 420
            LIINDEDIK+I + LNIYTCYND PL SRQRLNAVGHILYKSANAS+ASC HVFESFFP 
Sbjct: 361  LIINDEDIKEILNSLNIYTCYNDLPLQSRQRLNAVGHILYKSANASVASCNHVFESFFPC 420

Query: 421  LLDIVGISADQPHNNKISP-KNFNFGALYLCIELLVACRDLIASSDEHICFVKEKLYGML 480
            LLD VGIS DQ  N KISP +NFNFGALYLCIELL ACRDL AS DE  C VKEK Y ML
Sbjct: 421  LLDFVGISVDQSDNYKISPSRNFNFGALYLCIELLAACRDLYASCDEQTCSVKEKSYNML 480

Query: 481  QTFSCSMVHLLNSIFPVIVKKDLHDAEFYCAVKGLQNLATFPVGSSPVSRVIFEDILLGL 540
            QTFSC++V LLNS FP I KKDLHDAEFYCAVKGL+NLA FPVGSSP+S V+FEDILLGL
Sbjct: 481  QTFSCALVQLLNSTFPGIAKKDLHDAEFYCAVKGLRNLAIFPVGSSPISSVVFEDILLGL 540

Query: 541  MSFITADFKFASLWNHALNSLQHIGSFVDKYPESLELQSFMHVVVEKIASMFSLHEETLP 600
            MSFIT + +  SLWNHAL +LQHIGSFVD+Y  S+E QS+MHVVVEKIA MFSLH+E LP
Sbjct: 541  MSFITMNLECGSLWNHALKALQHIGSFVDRYHGSVEWQSYMHVVVEKIAPMFSLHDEALP 600

Query: 601  LSIKLELALNIGRTGRSYMLKIVQGIEEATFFHLSEVYVNGNSKSVEILLSLLDCYSTKI 660
            L++KL++A +IGR+GRSYMLKIVQGIEEAT FHLSEVY NGNSKSVEILLSLLDCYSTKI
Sbjct: 601  LTLKLKMASDIGRSGRSYMLKIVQGIEEATSFHLSEVYSNGNSKSVEILLSLLDCYSTKI 660

Query: 661  LPWLDEVGGFEEVILRIALNIWDQIEKCSVFSALLDKVLLDATMLAMKLSVRSCSKESQN 720
            LPW DE G FEEVILRI +NIWDQIEKC VFS  +DK LLD+TM+A+KLSVRSCSKESQN
Sbjct: 661  LPWFDEAGDFEEVILRITINIWDQIEKCLVFSTSMDKALLDSTMMALKLSVRSCSKESQN 720

Query: 721  VVIQKAFDVLLTSNFTPLKLPSSTTVPLQMEGLQLLKQKDSPLCRDEWILLLFASVTIGL 780
            ++IQKAF+VLLTS+F+P K+  STT+P++MEGLQLL+QKDSPL RDEWIL LFASV I L
Sbjct: 721  IIIQKAFNVLLTSSFSPSKVALSTTIPVKMEGLQLLQQKDSPLSRDEWILSLFASVIIAL 780

Query: 781  RPQTQIPDVRSVIHLLMLSITRGCIPAAQALGSIINKLSLKSDKVEVSSYVSLEEAIDII 840
            RPQ  +PDVRSV+ LLMLSITRGCIPAAQALGS+INKLSLKSDKVEVS+YVSLEEAIDII
Sbjct: 781  RPQIHVPDVRSVMRLLMLSITRGCIPAAQALGSMINKLSLKSDKVEVSNYVSLEEAIDII 840

Query: 841  FKTKFRCFHNGSTLAGSEMFLTDLCSSIEKSSLLQVHVVVGLSWIGKGLLLCGHEKVRDI 900
            F TKFRCFHN ST  GSEM LTDLCSSIEK SLL VH VVGLSWIGKGLLL GHEKVRD+
Sbjct: 841  FNTKFRCFHNESTRDGSEMLLTDLCSSIEKGSLLPVHAVVGLSWIGKGLLLFGHEKVRDV 900

Query: 901  TMVLLECLLSKSRTDASSLQQVILEKDYEPNFDFAIVKGAADAFHILMSDSEACLNRKFH 960
            TMV L+CL+SKSRTDAS LQ+VILEKD E N DF ++ GAADAFHILMSDSEACLNRKFH
Sbjct: 901  TMVFLQCLVSKSRTDASPLQKVILEKDCETNLDFGVMNGAADAFHILMSDSEACLNRKFH 960

Query: 961  AIVRPLYKQRFYSTMMPIFQSLVSKSDASLSRYMLYKAFAHVITDTPLTAILSDAKKLIP 1020
            AI+RPLYKQRF+STMMPIFQSLVSKSD SLSRYMLY+AFAHVI+DTPLTAI+SDAKKLIP
Sbjct: 961  AILRPLYKQRFFSTMMPIFQSLVSKSDESLSRYMLYEAFAHVISDTPLTAIMSDAKKLIP 1020

Query: 1021 MLLDGLLTLSVNVIDKDVVYGLLLVLSGILTDKNGQEAVTENAHKIVDCLARLTSFSHMM 1080
            MLLDGLL LSVN+I+KDVVY LLLVLSGIL DKN QEAVTENAHKIVDCLA LT+F HMM
Sbjct: 1021 MLLDGLLALSVNIINKDVVYSLLLVLSGILMDKNVQEAVTENAHKIVDCLAGLTAFPHMM 1080

Query: 1081 LVRETAIQCLVAVSELPHARIYPMRKQVLHAISKALDDPKRAVRLEAVRARQAWASIASR 1140
            LVRET+IQCLVAVSELPHARIYPMR QVLHAISKALDDPKRAVR EAVR RQAWASIASR
Sbjct: 1081 LVRETSIQCLVAVSELPHARIYPMRMQVLHAISKALDDPKRAVRQEAVRCRQAWASIASR 1140

Query: 1141 SLHF 1144
            SL+F
Sbjct: 1141 SLNF 1144

BLAST of Sed0021872 vs. NCBI nr
Match: XP_008462417.1 (PREDICTED: MMS19 nucleotide excision repair protein homolog isoform X2 [Cucumis melo])

HSP 1 Score: 1892.1 bits (4900), Expect = 0.0e+00
Identity = 959/1144 (83.83%), Postives = 1050/1144 (91.78%), Query Frame = 0

Query: 1    MGELSSLTQYIESFVDASRTASQQATSLEAIISLTKNNALTIETLVREMGMYLTITDNII 60
            M +L  LTQY+ESFVD SRT SQQATSLE I SL KNN LTIETLVREMGMYLTITDNII
Sbjct: 1    MADLCKLTQYVESFVDVSRTPSQQATSLETITSLVKNNVLTIETLVREMGMYLTITDNII 60

Query: 61   RGRGILLLGELLACLASKPLDDATIHSLMTFFTERLADWKALRGALVGCLALMRRKTNVG 120
            RGRGILLLGELLACL SKPLD ATIHSL+ FFTERLADWKALRGALVGCLALMRRKTNVG
Sbjct: 61   RGRGILLLGELLACLTSKPLDSATIHSLIAFFTERLADWKALRGALVGCLALMRRKTNVG 120

Query: 121  TVSQNDAKSVAQSYFQNLQVQSLGQHDRKLSFELLACLLEHYPNAVVSLGDDLVYGICEA 180
            T+SQNDAKSVAQSYFQNLQVQSLGQHDRKLSFELLACLLEHYP+AVVSLGDDLVYGICEA
Sbjct: 121  TISQNDAKSVAQSYFQNLQVQSLGQHDRKLSFELLACLLEHYPDAVVSLGDDLVYGICEA 180

Query: 181  IDGEKDPHCLMLTFHIVELVAKLFPDPTGTLANSSSDLFEFLGCYFPIHFTHGKEEDVDV 240
            IDGEKDPHCL+LTF IVELVAKLFPDP+GTLA+SSSDLFEFLGCYFPIHFTHGKEED+DV
Sbjct: 181  IDGEKDPHCLLLTFRIVELVAKLFPDPSGTLASSSSDLFEFLGCYFPIHFTHGKEEDIDV 240

Query: 241  RRNDLSQALMIAFSSTPLFEPFAVPLLLEKLSSSLPLAKIDSLKYLSDCTVKYGADRMEK 300
            RRNDLSQALM AFSSTPLFEPFA+PLLLEKLSSSLPLAKIDSLKYLSDCTVKYGADRM+K
Sbjct: 241  RRNDLSQALMRAFSSTPLFEPFAIPLLLEKLSSSLPLAKIDSLKYLSDCTVKYGADRMKK 300

Query: 301  HSASIWSSVKEILFTSIGQPSLSINLESLSSPSFEGNEITTEALRLLQKMVVESNGLFLR 360
            HS +IWSSVKEI+FTSIGQP+LSIN ESL+SPSF+ NE+TTEALRLLQKMVVESNGLFL 
Sbjct: 301  HSEAIWSSVKEIIFTSIGQPNLSINTESLNSPSFQENEMTTEALRLLQKMVVESNGLFLT 360

Query: 361  LIINDEDIKDIFSILNIYTCYNDFPLHSRQRLNAVGHILYKSANASLASCGHVFESFFPR 420
            LIINDEDIKDIF+ILNIYTCYND+PL SRQRLNAVGHILY SA+AS+ASC HVFES+F R
Sbjct: 361  LIINDEDIKDIFNILNIYTCYNDYPLQSRQRLNAVGHILYTSASASVASCDHVFESYFHR 420

Query: 421  LLDIVGISADQPHNNKISPK-NFNFGALYLCIELLVACRDLIASSDEHICFVKEKLYGML 480
            LL+ +GIS DQ HN+KISP  + NFGALYLCIE++ ACRDLIAS+DE+ C VKEK Y ML
Sbjct: 421  LLEFLGISVDQYHNDKISPVISLNFGALYLCIEVIAACRDLIASTDENTCSVKEKSYSML 480

Query: 481  QTFSCSMVHLLNSIFPVIVKKDLHDAEFYCAVKGLQNLATFPVGSSPVSRVIFEDILLGL 540
            QTFS SMV LL+S FP IVK+DLHDAEF+CAVKGL NL+TFPVGSSPVSRVIFEDILL  
Sbjct: 481  QTFSRSMVQLLSSTFPGIVKQDLHDAEFHCAVKGLLNLSTFPVGSSPVSRVIFEDILLEF 540

Query: 541  MSFITADFKFASLWNHALNSLQHIGSFVDKYPESLELQSFMHVVVEKIASMFSLHEETLP 600
            MSF+T +FKF SLWNHAL +LQHIGSFVDKYP S++ QS+MH+VVEKIASMFS H+E LP
Sbjct: 541  MSFVTVNFKFGSLWNHALKALQHIGSFVDKYPGSVDSQSYMHIVVEKIASMFSPHDEVLP 600

Query: 601  LSIKLELALNIGRTGRSYMLKIVQGIEEATFFHLSEVYVNGNSKSVEILLSLLDCYSTKI 660
            L +KLE+A++IGRTGRSYMLKIV GIEE  F++LSEVY  GNSKSVEILL+LLDCYSTKI
Sbjct: 601  LILKLEMAVDIGRTGRSYMLKIVGGIEEPIFYNLSEVYAYGNSKSVEILLTLLDCYSTKI 660

Query: 661  LPWLDEVGGFEEVILRIALNIWDQIEKCSVFSALLDKVLLDATMLAMKLSVRSCSKESQN 720
            LPW DE G FEEVILR ALNIWDQIEKCS F+ L+DKVLLDATM+A+KLSVRSCSKESQN
Sbjct: 661  LPWFDEAGDFEEVILRFALNIWDQIEKCSTFNTLMDKVLLDATMMALKLSVRSCSKESQN 720

Query: 721  VVIQKAFDVLLTSNFTPLKLPSSTTVPLQMEGLQLLKQKDSPLCRDEWILLLFASVTIGL 780
            +++QKAF+VLLTS+F+P K+  STT+P+QMEGLQ+L+QKD+P  RDEWIL LFASV I L
Sbjct: 721  IIVQKAFNVLLTSSFSPSKVALSTTIPVQMEGLQILQQKDNPTSRDEWILSLFASVIIAL 780

Query: 781  RPQTQIPDVRSVIHLLMLSITRGCIPAAQALGSIINKLSLKSDKVEVSSYVSLEEAIDII 840
            RPQ  +PDVR +IHLLMLSITRGC+PAAQALGS+INKLS+KSDKVEVSSYVSLEEAIDII
Sbjct: 781  RPQVHVPDVRLIIHLLMLSITRGCVPAAQALGSMINKLSVKSDKVEVSSYVSLEEAIDII 840

Query: 841  FKTKFRCFHNGSTLAGSEMFLTDLCSSIEKSSLLQVHVVVGLSWIGKGLLLCGHEKVRDI 900
            FKT+FRCFHN +T  GS MFLT+LCSSIEK+SLLQVH VVGLSWIGKGLLLCGH+KVRD+
Sbjct: 841  FKTEFRCFHNENTGNGSVMFLTELCSSIEKTSLLQVHAVVGLSWIGKGLLLCGHDKVRDV 900

Query: 901  TMVLLECLLSKSRTDASSLQQVILEKDYEPNFDFAIVKGAADAFHILMSDSEACLNRKFH 960
            TMV L+ L+SKSRTD   LQQ ILEKD E + DFA++KGAA+AFHILMSDSEACLNRKFH
Sbjct: 901  TMVFLQLLVSKSRTDGPPLQQFILEKDNETSLDFAVMKGAAEAFHILMSDSEACLNRKFH 960

Query: 961  AIVRPLYKQRFYSTMMPIFQSLVSKSDASLSRYMLYKAFAHVITDTPLTAILSDAKKLIP 1020
            AIVRPLYKQRF+STMMPIFQ+LVSKSD SLSRYMLY+A+AHVI+DTPLTA+L+DAKK IP
Sbjct: 961  AIVRPLYKQRFFSTMMPIFQTLVSKSDTSLSRYMLYQAYAHVISDTPLTALLTDAKKFIP 1020

Query: 1021 MLLDGLLTLSVNVIDKDVVYGLLLVLSGILTDKNGQEAVTENAHKIVDCLARLTSFSHMM 1080
            MLLDGLLTLSVN I+KDVVY LLLVLSGIL DKNGQEAVTENAHKIVDCLA LT FSHMM
Sbjct: 1021 MLLDGLLTLSVNGINKDVVYSLLLVLSGILMDKNGQEAVTENAHKIVDCLAGLTDFSHMM 1080

Query: 1081 LVRETAIQCLVAVSELPHARIYPMRKQVLHAISKALDDPKRAVRLEAVRARQAWASIASR 1140
            LVRETAIQCLVAVSELPHARIYPMR+QVLH ISKALDDPKRAVR EAVR RQAWASIASR
Sbjct: 1081 LVRETAIQCLVAVSELPHARIYPMRRQVLHTISKALDDPKRAVRQEAVRCRQAWASIASR 1140

Query: 1141 SLHF 1144
            SLHF
Sbjct: 1141 SLHF 1144

BLAST of Sed0021872 vs. NCBI nr
Match: XP_022992063.1 (MMS19 nucleotide excision repair protein homolog isoform X2 [Cucurbita maxima])

HSP 1 Score: 1891.7 bits (4899), Expect = 0.0e+00
Identity = 972/1144 (84.97%), Postives = 1039/1144 (90.82%), Query Frame = 0

Query: 1    MGELSSLTQYIESFVDASRTASQQATSLEAIISLTKNNALTIETLVREMGMYLTITDNII 60
            M ELS L QYIESFVD S T SQQATSLEAIISL KNN +TI+TLV EMGMYLTITD+II
Sbjct: 1    MAELSKLAQYIESFVDVSHTPSQQATSLEAIISLVKNNVVTIKTLVTEMGMYLTITDHII 60

Query: 61   RGRGILLLGELLACLASKPLDDATIHSLMTFFTERLADWKALRGALVGCLALMRRKTNVG 120
            RGRGILLLGE+LACLASKPLDDATIHSLMTFF ERLADWKALRGAL+GCLALMRRK  VG
Sbjct: 61   RGRGILLLGEVLACLASKPLDDATIHSLMTFFIERLADWKALRGALIGCLALMRRKMEVG 120

Query: 121  TVSQNDAKSVAQSYFQNLQVQSLGQHDRKLSFELLACLLEHYPNAVVSLGDDLVYGICEA 180
            TVSQ DAKS AQSYFQNLQVQSLGQHDRKLSFELL CLLEHYP+AVVSLGDDLVYGICEA
Sbjct: 121  TVSQTDAKSFAQSYFQNLQVQSLGQHDRKLSFELLVCLLEHYPDAVVSLGDDLVYGICEA 180

Query: 181  IDGEKDPHCLMLTFHIVELVAKLFPDPTGTLANSSSDLFEFLGCYFPIHFTHGKEEDVDV 240
            IDGEKDPHCLMLTFHIVELVAKLFPDPTGTLA+SSSDLFEFLGCYFPIHFTHGKEEDVDV
Sbjct: 181  IDGEKDPHCLMLTFHIVELVAKLFPDPTGTLASSSSDLFEFLGCYFPIHFTHGKEEDVDV 240

Query: 241  RRNDLSQALMIAFSSTPLFEPFAVPLLLEKLSSSLPLAKIDSLKYLSDCTVKYGADRMEK 300
             RNDLSQALM+AFSS PLFEPFA+PLLLEKLSSSLPLAKIDSLKYLSDCTVKYGADRMEK
Sbjct: 241  TRNDLSQALMMAFSSNPLFEPFAIPLLLEKLSSSLPLAKIDSLKYLSDCTVKYGADRMEK 300

Query: 301  HSASIWSSVKEILFTSIGQPSLSINLESLSSPSFEGNEITTEALRLLQKMVVESNGLFLR 360
            HS ++WSSVKEI+FTSI QPSLS NLESL SPSF+GNE+  EALRLLQKMVV+SNG FLR
Sbjct: 301  HSEAVWSSVKEIIFTSIEQPSLSFNLESLDSPSFQGNEMIIEALRLLQKMVVKSNGSFLR 360

Query: 361  LIINDEDIKDIFSILNIYTCYNDFPLHSRQRLNAVGHILYKSANASLASCGHVFESFFPR 420
            LIINDEDIK+IF+ILNIYTCYND PL SRQRLNAVGHILYKSANAS+ASC HVFESFF R
Sbjct: 361  LIINDEDIKEIFNILNIYTCYNDLPLQSRQRLNAVGHILYKSANASVASCNHVFESFFLR 420

Query: 421  LLDIVGISADQPHNNKISP-KNFNFGALYLCIELLVACRDLIASSDEHICFVKEKLYGML 480
            LLD VGIS DQ  N KISP +NFNFGALYLCIELL ACRDL AS DE  C VKEK Y ML
Sbjct: 421  LLDFVGISVDQSDNCKISPSRNFNFGALYLCIELLAACRDLYASCDEQTCSVKEKSYNML 480

Query: 481  QTFSCSMVHLLNSIFPVIVKKDLHDAEFYCAVKGLQNLATFPVGSSPVSRVIFEDILLGL 540
            QTFS S+V LLNS FP I KK+LHDAEFYCAVKGL+NLATFPVGSSPVS V+FE+ILLGL
Sbjct: 481  QTFSRSLVQLLNSTFPGIPKKNLHDAEFYCAVKGLRNLATFPVGSSPVSSVVFEEILLGL 540

Query: 541  MSFITADFKFASLWNHALNSLQHIGSFVDKYPESLELQSFMHVVVEKIASMFSLHEETLP 600
            MSFIT + K  SLWNHAL +LQHIGSFVD+Y  S+E QS+MHVVVEKIA MFSLH+E LP
Sbjct: 541  MSFITMNLKCGSLWNHALKALQHIGSFVDRYHGSVEWQSYMHVVVEKIAPMFSLHDEALP 600

Query: 601  LSIKLELALNIGRTGRSYMLKIVQGIEEATFFHLSEVYVNGNSKSVEILLSLLDCYSTKI 660
            L++KL++A +IGR+GRSYMLKIVQGIEEAT FHL+EVY NGNSKSVEILLSLLDCYSTKI
Sbjct: 601  LTLKLKMASDIGRSGRSYMLKIVQGIEEATSFHLTEVYSNGNSKSVEILLSLLDCYSTKI 660

Query: 661  LPWLDEVGGFEEVILRIALNIWDQIEKCSVFSALLDKVLLDATMLAMKLSVRSCSKESQN 720
            LPW DE G FEEVILRI  NIWDQIEKC VFS  +DK LLDATM+A+KLSVRSCSKESQN
Sbjct: 661  LPWFDEAGDFEEVILRITFNIWDQIEKCLVFSTSMDKALLDATMMALKLSVRSCSKESQN 720

Query: 721  VVIQKAFDVLLTSNFTPLKLPSSTTVPLQMEGLQLLKQKDSPLCRDEWILLLFASVTIGL 780
            ++IQKAF+VLLTS+F+P K+  STT+P+QMEGLQLL+QKDSPL RDEWIL LFASV I L
Sbjct: 721  IIIQKAFNVLLTSSFSPSKVALSTTIPVQMEGLQLLQQKDSPLSRDEWILSLFASVIIAL 780

Query: 781  RPQTQIPDVRSVIHLLMLSITRGCIPAAQALGSIINKLSLKSDKVEVSSYVSLEEAIDII 840
            RPQ  +PDVRSV+ LLMLSITRGCIPAAQALGS+INKLSLK DKVEVS+YVSLEEAIDII
Sbjct: 781  RPQIHVPDVRSVMRLLMLSITRGCIPAAQALGSMINKLSLKFDKVEVSNYVSLEEAIDII 840

Query: 841  FKTKFRCFHNGSTLAGSEMFLTDLCSSIEKSSLLQVHVVVGLSWIGKGLLLCGHEKVRDI 900
            F TKFRCFHNGST  GSEM LTDLCSSIEK SLL VHVVVGLSWIGKGLLL GHEKVRD+
Sbjct: 841  FNTKFRCFHNGSTRDGSEMLLTDLCSSIEKGSLLPVHVVVGLSWIGKGLLLFGHEKVRDV 900

Query: 901  TMVLLECLLSKSRTDASSLQQVILEKDYEPNFDFAIVKGAADAFHILMSDSEACLNRKFH 960
            TMV L+CL+SKSRTDAS LQ+VILEKD E N DFA++  AADAFHILMSDSEACLNRKFH
Sbjct: 901  TMVFLQCLVSKSRTDASPLQKVILEKDCETNLDFAVMNCAADAFHILMSDSEACLNRKFH 960

Query: 961  AIVRPLYKQRFYSTMMPIFQSLVSKSDASLSRYMLYKAFAHVITDTPLTAILSDAKKLIP 1020
            AI+RPLYKQRF+STMMPIFQSLVSKSD SLSRYMLY+AFAHVI+DTPLTAILSDAKKLIP
Sbjct: 961  AILRPLYKQRFFSTMMPIFQSLVSKSDESLSRYMLYEAFAHVISDTPLTAILSDAKKLIP 1020

Query: 1021 MLLDGLLTLSVNVIDKDVVYGLLLVLSGILTDKNGQEAVTENAHKIVDCLARLTSFSHMM 1080
            MLLDGLL LSVN+I+KDVVY LLLVLSGIL DKNGQE VTENAHKIVDCLA LT+F HMM
Sbjct: 1021 MLLDGLLALSVNIINKDVVYSLLLVLSGILMDKNGQEVVTENAHKIVDCLAGLTAFPHMM 1080

Query: 1081 LVRETAIQCLVAVSELPHARIYPMRKQVLHAISKALDDPKRAVRLEAVRARQAWASIASR 1140
            LVRETAIQCLVAVSELPHARIYPMR QVLHAISKALDDPKRAVR EAVR RQAWASIASR
Sbjct: 1081 LVRETAIQCLVAVSELPHARIYPMRMQVLHAISKALDDPKRAVRQEAVRCRQAWASIASR 1140

Query: 1141 SLHF 1144
            SL+F
Sbjct: 1141 SLNF 1144

BLAST of Sed0021872 vs. NCBI nr
Match: KAG6575503.1 (MMS19 nucleotide excision repair protein-like protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1889.0 bits (4892), Expect = 0.0e+00
Identity = 970/1144 (84.79%), Postives = 1035/1144 (90.47%), Query Frame = 0

Query: 1    MGELSSLTQYIESFVDASRTASQQATSLEAIISLTKNNALTIETLVREMGMYLTITDNII 60
            M ELS LTQYIESFVD S T SQQATSLEAIISL KNN +TI+TLV EMGMYLTITD+II
Sbjct: 1    MAELSKLTQYIESFVDVSHTPSQQATSLEAIISLVKNNVVTIKTLVTEMGMYLTITDHII 60

Query: 61   RGRGILLLGELLACLASKPLDDATIHSLMTFFTERLADWKALRGALVGCLALMRRKTNVG 120
            RGRGILLLGE+L CLASKPLDDATIHSLMTFFTERLADWKALRGAL+GCLALMRRKT VG
Sbjct: 61   RGRGILLLGEVLTCLASKPLDDATIHSLMTFFTERLADWKALRGALIGCLALMRRKTEVG 120

Query: 121  TVSQNDAKSVAQSYFQNLQVQSLGQHDRKLSFELLACLLEHYPNAVVSLGDDLVYGICEA 180
             VSQNDAKS AQSYFQNLQVQSLGQHDRKLSFE L CLLEHYP+AVVSLGDDLVYGICEA
Sbjct: 121  AVSQNDAKSFAQSYFQNLQVQSLGQHDRKLSFEHLVCLLEHYPDAVVSLGDDLVYGICEA 180

Query: 181  IDGEKDPHCLMLTFHIVELVAKLFPDPTGTLANSSSDLFEFLGCYFPIHFTHGKEEDVDV 240
            IDGEKDPHCLMLTFHIVELVAKLFPDPTGTLA+SSSDLFEFLGCYFPIHFTHGKEEDVDV
Sbjct: 181  IDGEKDPHCLMLTFHIVELVAKLFPDPTGTLASSSSDLFEFLGCYFPIHFTHGKEEDVDV 240

Query: 241  RRNDLSQALMIAFSSTPLFEPFAVPLLLEKLSSSLPLAKIDSLKYLSDCTVKYGADRMEK 300
             RNDLS+ALM+AFSS PLFEPFA+PLLLEKLSSSLPLAKIDSLKYLSDCTVKY ADRMEK
Sbjct: 241  TRNDLSRALMMAFSSNPLFEPFAIPLLLEKLSSSLPLAKIDSLKYLSDCTVKYRADRMEK 300

Query: 301  HSASIWSSVKEILFTSIGQPSLSINLESLSSPSFEGNEITTEALRLLQKMVVESNGLFLR 360
            HS +IWSSVKEI+FTSI QPSLS NLESL SPSF+GNE+  EALRLLQKMVVESNG FLR
Sbjct: 301  HSEAIWSSVKEIIFTSIEQPSLSFNLESLDSPSFQGNEMIIEALRLLQKMVVESNGSFLR 360

Query: 361  LIINDEDIKDIFSILNIYTCYNDFPLHSRQRLNAVGHILYKSANASLASCGHVFESFFPR 420
            LIINDEDIK+I + LNIYTCYND PL SRQRLNAVGHILYKSANAS+ASC HVFESFFPR
Sbjct: 361  LIINDEDIKEILNSLNIYTCYNDLPLQSRQRLNAVGHILYKSANASVASCNHVFESFFPR 420

Query: 421  LLDIVGISADQPHNNKISP-KNFNFGALYLCIELLVACRDLIASSDEHICFVKEKLYGML 480
            LLD VGIS DQ  N KISP +NFNFGALYLCIELL ACRDL AS D   C VKEK Y ML
Sbjct: 421  LLDFVGISVDQSDNYKISPSRNFNFGALYLCIELLAACRDLYASCDVQTCSVKEKSYNML 480

Query: 481  QTFSCSMVHLLNSIFPVIVKKDLHDAEFYCAVKGLQNLATFPVGSSPVSRVIFEDILLGL 540
            QTFSC++V LLNS FP I KKDLHDAEFYCAVKGL+NLATFPVGSSPVS V+FEDILLGL
Sbjct: 481  QTFSCALVQLLNSTFPGIAKKDLHDAEFYCAVKGLRNLATFPVGSSPVSSVVFEDILLGL 540

Query: 541  MSFITADFKFASLWNHALNSLQHIGSFVDKYPESLELQSFMHVVVEKIASMFSLHEETLP 600
            MSFIT + +  SLWNHAL +LQHIGSFVD+Y  S+E QS+MHVVVEKIA MFSLH+E LP
Sbjct: 541  MSFITMNLECGSLWNHALKALQHIGSFVDRYHGSVEWQSYMHVVVEKIAPMFSLHDEALP 600

Query: 601  LSIKLELALNIGRTGRSYMLKIVQGIEEATFFHLSEVYVNGNSKSVEILLSLLDCYSTKI 660
            L++KL++A +IGR+GRSYMLKIVQGIEEAT FHL EVY NGNSKSVEILLSLLDCYSTKI
Sbjct: 601  LTLKLKMASDIGRSGRSYMLKIVQGIEEATSFHLYEVYSNGNSKSVEILLSLLDCYSTKI 660

Query: 661  LPWLDEVGGFEEVILRIALNIWDQIEKCSVFSALLDKVLLDATMLAMKLSVRSCSKESQN 720
            LPW DE G FEEVILRI  NIWDQIEKC VFS  +DK LLDATM+A+KLSVRSCSKESQN
Sbjct: 661  LPWFDEAGDFEEVILRITFNIWDQIEKCLVFSTSMDKALLDATMMALKLSVRSCSKESQN 720

Query: 721  VVIQKAFDVLLTSNFTPLKLPSSTTVPLQMEGLQLLKQKDSPLCRDEWILLLFASVTIGL 780
            ++IQKAF+VLLTS+F+P K+  STT+P+QMEGLQLL+QKDSPL RDEWIL LFASV I L
Sbjct: 721  IIIQKAFNVLLTSSFSPSKVALSTTIPVQMEGLQLLQQKDSPLSRDEWILSLFASVIIAL 780

Query: 781  RPQTQIPDVRSVIHLLMLSITRGCIPAAQALGSIINKLSLKSDKVEVSSYVSLEEAIDII 840
            RPQ  +PDVRSV+ LLMLSITRGCIPAAQALGS+INKLSLKSDKVEVS+YVSLEEAIDII
Sbjct: 781  RPQIHVPDVRSVMRLLMLSITRGCIPAAQALGSMINKLSLKSDKVEVSNYVSLEEAIDII 840

Query: 841  FKTKFRCFHNGSTLAGSEMFLTDLCSSIEKSSLLQVHVVVGLSWIGKGLLLCGHEKVRDI 900
            F TKFRCFHN ST  GSEM LTDLCSSIEK SLL VH VVGLSWIGKGLLL GHEKVRD+
Sbjct: 841  FNTKFRCFHNESTRDGSEMLLTDLCSSIEKGSLLPVHAVVGLSWIGKGLLLFGHEKVRDV 900

Query: 901  TMVLLECLLSKSRTDASSLQQVILEKDYEPNFDFAIVKGAADAFHILMSDSEACLNRKFH 960
            TMV L+CL+SKSRTDA  LQ+VILEKD E N DFA++ GAADAFHILMSDSEACLNRKFH
Sbjct: 901  TMVFLQCLVSKSRTDAPPLQKVILEKDCETNLDFAVMNGAADAFHILMSDSEACLNRKFH 960

Query: 961  AIVRPLYKQRFYSTMMPIFQSLVSKSDASLSRYMLYKAFAHVITDTPLTAILSDAKKLIP 1020
            AI+RPLYKQRF+STMMPIFQSLVSKSD SLSRYMLY+AFAHVI+DTPLTAILSDAKKLIP
Sbjct: 961  AILRPLYKQRFFSTMMPIFQSLVSKSDESLSRYMLYEAFAHVISDTPLTAILSDAKKLIP 1020

Query: 1021 MLLDGLLTLSVNVIDKDVVYGLLLVLSGILTDKNGQEAVTENAHKIVDCLARLTSFSHMM 1080
            MLLDGLL LSVN+I+KDVVY LLLVLSGIL DKN QEAVTENAHKIVDCLA L +F HMM
Sbjct: 1021 MLLDGLLALSVNIINKDVVYSLLLVLSGILMDKNVQEAVTENAHKIVDCLAGLAAFPHMM 1080

Query: 1081 LVRETAIQCLVAVSELPHARIYPMRKQVLHAISKALDDPKRAVRLEAVRARQAWASIASR 1140
            LVRETAIQCLVAVSELPHARIYPMR QVLHAISKALDDPKRAVR EAVR RQAWASIASR
Sbjct: 1081 LVRETAIQCLVAVSELPHARIYPMRMQVLHAISKALDDPKRAVRQEAVRCRQAWASIASR 1140

Query: 1141 SLHF 1144
            SL+F
Sbjct: 1141 SLNF 1144

BLAST of Sed0021872 vs. ExPASy Swiss-Prot
Match: Q0WVF8 (MMS19 nucleotide excision repair protein homolog OS=Arabidopsis thaliana OX=3702 GN=MMS19 PE=1 SV=1)

HSP 1 Score: 991.9 bits (2563), Expect = 6.1e-288
Identity = 546/1156 (47.23%), Postives = 769/1156 (66.52%), Query Frame = 0

Query: 1    MGELSSLTQYIESFVDASRTASQQATSLEAIISLTKNNALTIETLVREMGMYLTITDNII 60
            M E + L Q++E+FVD +R++SQQ  SL+AI S  +N++L+I  LVREM MYLT TDN++
Sbjct: 2    MVEPNQLVQHLETFVDTNRSSSQQDDSLKAIASSLENDSLSITQLVREMEMYLTTTDNLV 61

Query: 61   RGRGILLLGELLACLASKPLDDATIHSLMTFFTERLADWKALRGALVGCLALMRRKTNVG 120
            R RGILLL E+L CL +KPL+D  +H+L+ FF+E+LADW+A+ GALVGCLAL++RK   G
Sbjct: 62   RARGILLLAEILDCLKAKPLNDTIVHTLVGFFSEKLADWRAMCGALVGCLALLKRKDVAG 121

Query: 121  TVSQNDAKSVAQSYFQNLQVQSLGQHDRKLSFELLACLLEHYPNAVVSLGDDLVYGICEA 180
             V+  D +++A+S  QN+QVQ+L  H+RKL+FELL CLL+ +  A++++GD LVY +CEA
Sbjct: 122  VVTDIDVQAMAKSMIQNVQVQALALHERKLAFELLECLLQQHSEAILTMGDLLVYAMCEA 181

Query: 181  IDGEKDPHCLMLTFHIVELVAKLFPDPTGTLANSSSDLFEFLGCYFPIHFTHGKEEDVDV 240
            IDGEKDP CLM+ FH+VEL+A LFP P+G LA+ +SDLFE +GCYFP+HFTH K+++ ++
Sbjct: 182  IDGEKDPQCLMIVFHLVELLAPLFPSPSGPLASDASDLFEVIGCYFPLHFTHTKDDEANI 241

Query: 241  RRNDLSQALMIAFSSTPLFEPFAVPLLLEKLSSSLPLAKIDSLKYLSDCTVKYGADRMEK 300
            RR DLS+ L++A SSTP FEP+A+PLLLEKLSSSLP+AK+DSLK L DC +KYG DRM+K
Sbjct: 242  RREDLSRGLLLAISSTPFFEPYAIPLLLEKLSSSLPVAKVDSLKCLKDCALKYGVDRMKK 301

Query: 301  HSASIWSSVKEILFTSIGQPSLSINLESLSSPSFEGNEITTEALRLLQKMVVESNGLFLR 360
            H  ++WS++K+  ++S G   LS  +ESL+SP FE NEI  +A+ LLQ++V +    FL 
Sbjct: 302  HYGALWSALKDTFYSSTG-THLSFAIESLTSPGFEMNEIHRDAVSLLQRLVKQDIS-FLG 361

Query: 361  LIINDEDIKDIFSILNIYTCYNDFPLHSRQRLNAVGHILYKSANASLASCGHVFESFFPR 420
             +++D  I  +F  +  Y  Y + P  S+  +  +  IL  SA AS+ SC  +FE+ F R
Sbjct: 362  FVVDDTRINTVFDTIYRYPQYKEMPDPSKLEVLVISQILSVSAKASVQSCNIIFEAIFFR 421

Query: 421  LLDIVGI-------SADQPHNNKISPKNFNFGALYLCIELLVACRDLIASSDE--HICFV 480
            L++ +GI          Q  N+ +S + ++ G L+LCIELL A +DLI   +E       
Sbjct: 422  LMNTLGIVEKTSTGDVVQNGNSTVSTRLYH-GGLHLCIELLAASKDLILGFEECSPTSGC 481

Query: 481  KEKLYGMLQTFSCSMVHLLNSIFPVIVKKDLHDAEFYCAVKGLQNLATFPVGSSPVSRVI 540
                  M+++FS  ++ +  S   V    D    + Y  VKGL  +  F  GSSPVSR  
Sbjct: 482  ANSGCSMVKSFSVPLIQVFTS--AVCRSNDDSVVDVYLGVKGLLTMGMFRGGSSPVSRTE 541

Query: 541  FEDILLGLMSFITADFKFASLWNHALNSLQHIGSFVDKYPESLELQSFMHVVVEKIASMF 600
            FE+IL+ L S ITA      +W  AL +L  IGSF+D+Y ES +  S+M +VV+ + S+ 
Sbjct: 542  FENILVTLTSIITAKSGKTVVWELALKALVCIGSFIDRYHESDKAMSYMSIVVDNLVSLA 601

Query: 601  SLHEETLPLSIKLELALNIGRTGRSYMLKIVQGIEEATFFHLSEVYVNGNSKSVEILLSL 660
                  LP  + LE    +  TG  Y+ K+VQG+EEA    LS+ YVNGN +S++    L
Sbjct: 602  CSSHCGLPYQMILEATSEVCSTGPKYVEKMVQGLEEAFCSSLSDFYVNGNFESIDNCSQL 661

Query: 661  LDCYSTKILPWLDEVGGFEEVILRIALNIWDQIEKCSVFSALLD-KVLLDATMLAMKLSV 720
            L C + K+LP + E+ G E++++  A+++W QIE C VFS   + +  ++A M  M+  V
Sbjct: 662  LKCLTNKLLPRVAEIDGLEQLLVHFAISMWKQIEFCGVFSCDFNGREFVEAAMTTMRQVV 721

Query: 721  RSCSKESQNVVIQKAFDVLLTSNFTPLKLPSSTTVPLQMEGLQLLKQKDSPLCRDEWILL 780
                 +SQN +IQKA+ V+     +   LP+  ++PL    L+ L++  S   RDE IL 
Sbjct: 722  GIALVDSQNSIIQKAYSVV-----SSCTLPAMESIPLTFVALEGLQRDLS--SRDELILS 781

Query: 781  LFASVTIGLRPQTQIPDVRSVIHLLMLSITRGCIPAAQALGSIINKLSLKSDKVEVSSYV 840
            LFASV I   P   IPD +S+IHLL++++ +G IPAAQALGS++NKL   S     S   
Sbjct: 782  LFASVIIAASPSASIPDAKSLIHLLLVTLLKGYIPAAQALGSMVNKLGSGSGGTNTSRDC 841

Query: 841  SLEEAIDIIFKTKF----RCFHNGST--LAGSEMFLTDLCSSIEKSSLLQVHVVVGLSWI 900
            SLEEA  IIF   F    +   NGS   + GSE  ++ +C     S  LQ   + GL+WI
Sbjct: 842  SLEEACAIIFHADFASGKKISSNGSAKIIVGSETTMSKICLGYCGSLDLQTRAITGLAWI 901

Query: 901  GKGLLLCGHEKVRDITMVLLECLLSKSRTDASSLQQVILEKDYEPNFDFAIVKGAADAFH 960
            GKGLL+ G+E+V +I +VL+ECL S + +  +                 + +K AADAF 
Sbjct: 902  GKGLLMRGNERVNEIALVLVECLKSNNCSGHA--------------LHPSAMKHAADAFS 961

Query: 961  ILMSDSEACLNRKFHAIVRPLYKQRFYSTMMPIFQSLVSKSDASLSRYMLYKAFAHVITD 1020
            I+MSDSE CLNRKFHA++RPLYKQR +ST++PI +SL+  S  SLSR ML+ A AHVI++
Sbjct: 962  IIMSDSEVCLNRKFHAVIRPLYKQRCFSTIVPILESLIMNSQTSLSRTMLHVALAHVISN 1021

Query: 1021 TPLTAILSDAKKLIPMLLDGLLTLSVNVIDKDVVYGLLLVLSGILTDKNGQEAVTENAHK 1080
             P+T IL + KKL P++L+GL  LS++ ++K+ ++ LLLVLSG LTD  GQ++ ++NAH 
Sbjct: 1022 VPVTVILDNTKKLQPLILEGLSVLSLDSVEKETLFSLLLVLSGTLTDTKGQQSASDNAHI 1081

Query: 1081 IVDCLARLTSFSHMMLVRETAIQCLVAVSELPHARIYPMRKQVLHAISKALDDPKRAVRL 1140
            I++CL +LTS+ H+M+VRET+IQCLVA+ ELPH RIYP R++VL AI K+LDDPKR VR 
Sbjct: 1082 IIECLIKLTSYPHLMVVRETSIQCLVALLELPHRRIYPFRREVLQAIEKSLDDPKRKVRE 1131

BLAST of Sed0021872 vs. ExPASy Swiss-Prot
Match: Q6DCF2 (MMS19 nucleotide excision repair protein homolog OS=Xenopus laevis OX=8355 GN=mms19 PE=2 SV=1)

HSP 1 Score: 208.0 bits (528), Expect = 5.7e-52
Identity = 278/1174 (23.68%), Postives = 490/1174 (41.74%), Query Frame = 0

Query: 5    SSLTQYIESFVDASRTASQQATSLEAIISLTKNNALTIETLVREMGMYLTITDNIIRGRG 64
            ++L   +E FV       +Q +    + +  K+   T+  +V  +G  L   +  +R +G
Sbjct: 6    TALWGLVEEFV-----VGEQDSKSAEVAAGVKDGVFTVLQVVESLGSCLANPEPRMRSKG 65

Query: 65   ILLLGELLACLASKPLDDATIHSLMTFFTERLADWKALR-GALVGCLALMRRKTNVGTVS 124
            + LL  +L    S+ L +  +  L+ F+  RL D   +    L G +AL    +    + 
Sbjct: 66   VQLLSRVLLECYSR-LTEKEVEVLVVFYENRLKDHHLITPHVLQGLMAL----SMCDVLP 125

Query: 125  QNDAKSVAQSYFQNLQVQSLGQHDRKLSFELLACLLEHYPNAVVSLGDDLVYGICEAIDG 184
            Q  A SV +S FQ + VQSL Q DR   + ++   ++     + +LG D  YG  + +DG
Sbjct: 126  QGVAVSVLKSVFQEVHVQSLMQIDRHTVYMIITNFMKTREEELKNLGADFTYGFIQVMDG 185

Query: 185  EKDPHCLMLTFHIV-ELVAKLFPDPTGTLANSSSDLFEFLGCYFPIHFTHGKEEDVDVRR 244
            EKDP  L++ F+IV ++V K +      L     +LFE   CYFPI FT    +   + R
Sbjct: 186  EKDPRNLLVAFYIVQDIVTKNY-----ALGPFVEELFEVTSCYFPIDFTPPPSDPHGITR 245

Query: 245  NDLSQALMIAFSSTPLFEPFAVPLLLEKLSSSLPLAKIDSLKYLSDCTVKYGADRMEKHS 304
              L   L    +ST  F  F +PLL+EK+ S +  AK+DSL+ LS C   YG   +++  
Sbjct: 246  EHLIMGLRAVLASTSRFAEFLLPLLIEKMDSEMQSAKLDSLQTLSACCTVYGQKELKEFL 305

Query: 305  ASIWSSVKEILFTSIGQPSLSINLESLSSPSFEGNEITTEALRLLQKMVVESNGLFLRLI 364
            + +WSS++  +F +  +                  +I  E L  LQ +    +    R +
Sbjct: 306  SGLWSSIRREVFQTASE------------------KIEAEGLAALQAL----SACLSRSV 365

Query: 365  IND--EDIKDIFSILNIYTCYNDFPLHSRQRLNAVGHILYKSANASLASCGHVFESFFPR 424
            ++   ED+ D F    +  C +       + +     +L  +A  S  +C  V  +  P 
Sbjct: 366  LSPDAEDLLDSFLNNILQDCKHHLCEPDMKLVWPSAKLLQAAAGGSSRACWKVTANVLPL 425

Query: 425  LLDIVGISADQPHNNKISPKNFNFGALYLCIELLVACRDLIASSDEHICFVKEKLYGML- 484
            LL+     A   H   I      F        L +  R L    +  +  +KE L  M+ 
Sbjct: 426  LLEQYNQHAQSSHRRTILEMTLGF--------LKLQSRWLDEEDENGLGNLKESLCSMVF 485

Query: 485  --QTFSCSMVHLLN-SIFPVIVKKDLHDAEFYCAVKGLQNLATFPVGSSPVSRVIFEDI- 544
               T S + +H++   I  V+  +           +G    +   +    ++R+I +   
Sbjct: 486  SAVTDSSTQLHVVALKILTVLAMQ-----------QGFMASSDIDLVVDHLTRLILQKTD 545

Query: 545  LLGLMSFITADFKFASLWNHALNS--LQHIGSFVDKYPESLEL--------QSFMHVVVE 604
                M+ + A    A +   A  S  L  + + +   P  + L         S   + +E
Sbjct: 546  SESCMAAVEASGTLAKVHPSAFISRMLPQLCANLQTEPMDINLNESGVVPEHSIRQLCLE 605

Query: 605  KIASMFSLHEETLPLSIKLELALNIGRTGRSYMLKIVQGIEEATFFHLSEVYVNGNSKSV 664
             +A++ S H+  L  ++ + L          Y+ +   G EE    ++  V  + +  +V
Sbjct: 606  ALAAV-STHQSILKETVPILL---------DYIRRAHNGEEETNVENVVSVCKSLHRVAV 665

Query: 665  EILL--SLLDCYSTKILPWL---------DEVGGFEEVILRIALNIWDQIEKCSVFSALL 724
            +  L    L  Y   +LP L          + G    ++LR             + +A++
Sbjct: 666  QCQLDSESLQFYHQTVLPCLLSLTVQAATQDSGTSSHILLR-----------DDILTAMV 725

Query: 725  DKVLLDATMLAMKLSVRSCSKESQNVVIQKAFDVLLTSNFTPLKLPSSTTVPLQMEGLQL 784
              +    T L  +L+ +S S +  ++ +     +L  +NF      SS   P Q+ G   
Sbjct: 726  PVISAACTHLKPELASKSTS-QIVSLFLDGDISLLSENNF------SSKFQPFQVNGPTE 785

Query: 785  LKQKDSPLCRDEWILLLFASVTIGLRPQTQIPDVRSVI-HLLMLSITRGC-----IPAAQ 844
            L+ +         ++ L  +    L    +IP +R ++ HLL LS++ GC       A++
Sbjct: 786  LQNR---------LVSLLMAFICSLPRNVEIPHLRRLLQHLLSLSLS-GCSLFAYSSASK 845

Query: 845  ALGSIINKL---SLKSDKVEVSSYVSLEEAIDIIFKTKFRCFHNGSTLAGSEMFLTDLCS 904
                +INK     L  D ++V++     + ID+                           
Sbjct: 846  CFAGLINKCPQGDLLDDILKVTA-----QRIDV--------------------------G 905

Query: 905  SIEKSSLLQVHVVVGLSWIGKGLLLCGHEKVRDITMVLLECLLSKSRTDASSLQQVILEK 964
             +E+ S  Q   +  L W+ K L+L  H     +T  ++  LLS  +   S         
Sbjct: 906  LVEEPSRTQ--AITLLVWVTKALVLRYHPLSGQLTDKMIG-LLSDQQLGPS--------- 965

Query: 965  DYEPNFDFAIVKGAADAFHILMSDSEACLNRKFHAIVRPLYKQRFYSTMMP-IFQSLVSK 1024
                          A+ F +L+SDS   LN+  HA +R +++QRF++  +P + Q   S 
Sbjct: 966  -------------VANMFSLLVSDSPDILNKACHADIRIMFRQRFFTENVPKLVQGFNSA 1019

Query: 1025 SDASLSRYMLYKAFAHVITDTPLTAILSDAKKLIPMLLDGLLTLSVNVIDKDVVYGLLLV 1084
            +      Y+  KA +HV+   P   ++ +   L+ +LL+ L     +  DK V    L+ 
Sbjct: 1026 NGDDKPNYL--KALSHVLNTLPKQVLMPELPSLLSLLLEAL-----SCPDKVVQLSTLIC 1019

Query: 1085 LSGILTDKNGQEAVTENAHKIVDCLARLTSFSHMMLVRETAIQCLVAVSELPHARIYPMR 1139
            L  +L  +   E +  +   ++  L  L S S  M VR TA++C++A+++LP   + P +
Sbjct: 1086 LEPLL--QEAPETLKVHIDGLISKLLGL-SCSPAMAVRITALKCILALTKLPLHMLLPYK 1019

BLAST of Sed0021872 vs. ExPASy Swiss-Prot
Match: Q0V9L1 (MMS19 nucleotide excision repair protein homolog OS=Xenopus tropicalis OX=8364 GN=mms19 PE=2 SV=2)

HSP 1 Score: 202.2 bits (513), Expect = 3.1e-50
Identity = 276/1162 (23.75%), Postives = 478/1162 (41.14%), Query Frame = 0

Query: 5    SSLTQYIESFVDASRTASQQATSLEAIISLTKNNALTIETLVREMGMYLTITDNIIRGRG 64
            S+L   +E FV       Q + S E + +  K+   T+  +V  +G  L   +   R +G
Sbjct: 6    SALWGLVEEFV----VGEQDSRSAE-VAAGVKDGIFTVLQVVEALGSCLANPEPRSRAKG 65

Query: 65   ILLLGELLACLASKPLDDATIHSLMTFFTERLADWKALR-GALVGCLALMRRKTNVGTVS 124
            + LL  +L    S+ L +  +  L+ F+  RL D   +    L G +AL    +    + 
Sbjct: 66   MQLLSRVLLECYSR-LTEKEVEVLVLFYENRLKDHHLITPHVLKGLMAL----SMCDVLP 125

Query: 125  QNDAKSVAQSYFQNLQVQSLGQHDRKLSFELLACLLEHYPNAVVSLGDDLVYGICEAIDG 184
            Q  A SV +S FQ + VQSL Q DR   + ++   ++     + SLG D  YG  + +DG
Sbjct: 126  QGLAVSVLKSVFQEVHVQSLMQIDRHTVYMIITNFMKTREEELKSLGADFTYGFIQVMDG 185

Query: 185  EKDPHCLMLTFHIV-ELVAKLFPDPTGTLANSSSDLFEFLGCYFPIHFTHGKEEDVDVRR 244
            EKDP  L++ FHIV +++ K +      L     +LFE   CYFPI FT    +   + R
Sbjct: 186  EKDPRNLLVAFHIVQDIITKNY-----ALGPFVEELFEVTSCYFPIDFTPPPNDPHGITR 245

Query: 245  NDLSQALMIAFSSTPLFEPFAVPLLLEKLSSSLPLAKIDSLKYLSDCTVKYGADRMEKHS 304
              L   L     ST  F  F +PLL+EK+ S +  AK+DSL+ L  C   YG   +++  
Sbjct: 246  EHLIVGLRAVLVSTSRFAEFFLPLLIEKMDSDVQSAKLDSLQTLIACCTVYGQKDLKEFL 305

Query: 305  ASIWSSVKEILFTSIGQPSLSINLESLSSPSFEGNEITTEALRLLQKMVVESNGLFLRLI 364
            + +WSS++  +F +  +                  +I  E L  LQ +    +    R I
Sbjct: 306  SGLWSSIRREVFQTASE------------------KIEAEGLAALQAL----SACLSRSI 365

Query: 365  IND--EDIKDIFSILNIYTCYNDFPLHSRQRLNAVGHILYKSANASLASCGHVFESFFPR 424
            ++   ED+ D F    +  C +       + +     +L  +A  S  +C  V  +  P 
Sbjct: 366  LSPDAEDLLDSFLNSILQDCKHHLCEPDMKLVWPSAKLLQAAAGGSSRACWKVTANVLPL 425

Query: 425  LLDIVGISADQPHNNKISPKNFNFGALYLCIELLVACRDLIASSDEHICFVKEKLYGML- 484
            LL+     A   H   I      F        L +  R L    D  +  +KE L  M+ 
Sbjct: 426  LLEQYNQHAQSSHRRTILEMTLGF--------LKLQSRWLDEEDDNGLGNLKEALCTMVF 485

Query: 485  --QTFSCSMVH--LLNSIFPVIVKKD-LHDAEFYCAVKGLQNLATFPVGSSPVSRVIFED 544
               T S + +H   + ++  + +++  L  ++    V  L  L      S      I   
Sbjct: 486  SAVTDSSTQLHQVAVRTLTVLAMQQGFLSSSDIDLVVDHLTRLILQETDSESCMAAIEAS 545

Query: 545  ILLGLMSFITADFKFASLWNHALNSLQHIGSFVDKYPESLELQSFMHVVVEKIASMFSLH 604
               G ++ +      + +      +LQ     ++     +  + F+     +  +  S H
Sbjct: 546  ---GTLAKVHPSVFISRMLPQLCANLQTEPMDINLNESRVVPERFIRQRCLEALAAVSTH 605

Query: 605  EETLPLSIKLELALNIGRTGRSYMLKIVQGIEEATFFHLSEVYVNGNSKSVEILL--SLL 664
            +  L  ++ + L          Y+ ++  G  E    ++  +  + +  +V+  L    L
Sbjct: 606  QSILKETVPILL---------DYIGRVHNGEGETNAENVVSICRSLHRVAVQCQLDSEAL 665

Query: 665  DCYSTKILPWLDEVGGFEEVILRIALNIWDQIEKCS------VFSALLDKVLLDATMLAM 724
              Y   +LP L        + L +     D    C+      V +A++  +    T L  
Sbjct: 666  QFYHEIVLPSL--------LSLTVQAATQDSGTSCNVLLRDDVLTAMVPVITAACTHLTP 725

Query: 725  KLSVRSCSKESQNVVIQKAFDVLLTSNFTPLKLPSSTTVPLQMEGLQLLKQKDSPLCRDE 784
            +L+ +S    SQ V +    DV L   F+   LPS+   P Q+ G         P     
Sbjct: 726  ELASKSV---SQVVALFLDGDVSL---FSENNLPSNFQ-PFQVNG---------PTELQN 785

Query: 785  WILLLFASVTIGLRPQTQIPDVRSVI-HLLMLSITRGCIP-----AAQALGSIINKL--- 844
             ++ L  +    L    +IP +R ++ HLL LS++ GC P     A++    +INK    
Sbjct: 786  HLVSLLMAFVCSLPRNVEIPHLRRLLQHLLSLSLS-GCSPFAYSSASKCFAGLINKCPQG 845

Query: 845  SLKSDKVEVSSYVSLEEAIDIIFKTKFRCFHNGSTLAGSEMFLTDLCSSIEKSSLLQVHV 904
             L  D ++V++     + ID+                            +++ S  +   
Sbjct: 846  DLLDDILQVTA-----QRIDV--------------------------GLVDEPS--RTRA 905

Query: 905  VVGLSWIGKGLLLCGHEKVRDITMVLLECLLSKSRTDASSLQQVILEKDYEPNFDFAIVK 964
            +  L W+ K L+L  H     +T  ++  L  K                        +  
Sbjct: 906  ITLLVWVTKALVLRYHPLSGQLTNKMIGLLSDKQ-----------------------LGP 965

Query: 965  GAADAFHILMSDSEACLNRKFHAIVRPLYKQRFYSTMMP-IFQSLVSKSDASLSRYMLYK 1024
              A+ F +L+SDS   +N+  HA +R +++QRF++  +P + Q   S +      Y+  K
Sbjct: 966  SVANMFSLLVSDSPDIINKACHADIRIMFRQRFFTENVPKLVQGFNSANRDDKPNYL--K 1019

Query: 1025 AFAHVITDTPLTAILSDAKKLIPMLLDGLLTLSVNVIDKDVVYGLLLVLSGILTDKNGQE 1084
            A +HV+   P   ++ +   L+ +LL+ L     +  DK V    L  L  +L  +   E
Sbjct: 1026 ALSHVLNALPKQVLMPELPSLLSLLLEAL-----SCPDKVVQLSTLTCLEPLL--QEAPE 1019

Query: 1085 AVTENAHKIVDCLARLTSFSHMMLVRETAIQCLVAVSELPHARIYPMRKQVLHAISKALD 1139
             +  +   ++  L  LT  S  M VR TA++C++A+++LP   + P ++QV+ A++K LD
Sbjct: 1086 TLKVHIDGLISKLVSLT-LSPAMAVRITALKCILALTKLPLHMLLPYKQQVIRALAKPLD 1019

BLAST of Sed0021872 vs. ExPASy Swiss-Prot
Match: E7FBU4 (MMS19 nucleotide excision repair protein homolog OS=Danio rerio OX=7955 GN=mms19 PE=3 SV=1)

HSP 1 Score: 198.0 bits (502), Expect = 5.9e-49
Identity = 274/1155 (23.72%), Postives = 474/1155 (41.04%), Query Frame = 0

Query: 11   IESFVDASRTASQQATSLEAIISLTKNNALTIETLVREMGMYLTITDNIIRGRGILLLGE 70
            +E FV     +    TS     +  KN   T+  LV  +G+ LT +    RGRG+ LL +
Sbjct: 12   VEEFVSGQVDSKAADTS-----TGVKNGQFTVLQLVEALGVSLTSSQPQTRGRGVQLLSQ 71

Query: 71   LL-ACLASKPLDDATIHSLMTFFTERLADWKALRGALVGCLALMRRKTNVGTVSQNDAKS 130
            +L  C +   L +  +  L+ F+  RL D   +   ++  L  + +      +    A S
Sbjct: 72   VLQECYSG--LSEREVEVLIAFYENRLKDHYVITPHVLRGLKALAK---CSVLPPGSAVS 131

Query: 131  VAQSYFQNLQV--QSLGQHDRKLSFELLACLLEHYPNAVVSLGDDLVYGICEAIDGEKDP 190
            + +S FQ++ V  QSL   +R   + +L  L+E     +  LG D ++G  +++DGE+DP
Sbjct: 132  ILKSIFQDVHVQQQSLMVTERSCVYNILISLMESREEELKGLGADFIFGFVQSVDGERDP 191

Query: 191  HCLMLTFHIVELVAKLFPDPTGTLANSSSDLFEFLGCYFPIHFTHGKEEDVDVRRNDLSQ 250
              L+L F + + +     D    L     +LFE   CYFPI F+    +   + + +L  
Sbjct: 192  RNLLLAFQVAKNIIYRGYD----LGKFVEELFEVTSCYFPIDFSPPPNDPHGITQEELIL 251

Query: 251  ALMIAFSSTPLFEPFAVPLLLEKLSSSLPLAKIDSLKYLSDCTVKYGADRMEKHSASIWS 310
            +L    + TP F  F +PL++EK+ S +  AK+DS+  L+ C   Y    + +    +WS
Sbjct: 252  SLRAVLTGTPRFAEFLLPLIIEKMDSDVQSAKVDSMHTLAACGQTYSHKELAEFLPGLWS 311

Query: 311  SVKEILFTSIGQPSLSINLESLSSPSFEGNEITTEALRLLQKMVVESNGLFLRLIINDED 370
            S++  +F +  +   S  L +LSS       ++  +  +L     +S  +FL L++  + 
Sbjct: 312  SIRREVFQTASERVESAGLSALSS------LVSCLSRSVLNSDSEDSLQVFLNLVLKSD- 371

Query: 371  IKDIFSILNIYTCYNDFPLHSRQRLNAVGHILYKSANASLASCGHVFESFFPRLLDIVGI 430
                      + C  D  L     +     +L  +A AS  +     ++  P LLD    
Sbjct: 372  -------CQHHLCEPDLKL-----VWPSAKLLQAAAGASYRASLIGTQAVIPALLD---- 431

Query: 431  SADQPHNNKISPKNFNFGALYLCIELLVACRDLIASSDEHICF-----VKEKLYGMLQTF 490
                 +NN+          L   ++  V    L   +D   C       +E +    Q  
Sbjct: 432  ----QYNNRTQCAQRR--TLLEVLQGFVQPTPLSRPADGVSCISTHTHEEESVLVAFQQS 491

Query: 491  SCSMVHLLNSIFPVIVKKDLHDAEFYCAVK---GLQNLATFPVGSSPVSRVIFEDILLGL 550
             C++V   +++        +       A+    GL +          ++R+I E+    +
Sbjct: 492  LCTVV--FSALSETSAGLQVTATRVLTALSQQPGLLSQTDVENAVDHLTRLILEEEEAQV 551

Query: 551  -MSFITADFKFASLWNHALNSLQHIGSFVDKYPESLELQSFMHVVVEKIASMFSLHEETL 610
             ++ +      A L  HA      +   + +  E + L   +H V+EK  S  SL+E + 
Sbjct: 552  SLAVVECSGSLAHLHPHAF-----VSRMIPQLKEKI-LSGRVHTVMEKTLS-GSLYECSG 611

Query: 611  PLSIKLELALNIGRTGRSYMLKIVQGIEEATFFHLSEVYVNGNSKSVEILLSLLDCYSTK 670
             +  +   AL       S    +VQ         L+  +      SVE ++++  C S +
Sbjct: 612  AVRRRCVAAL----ASVSSQPSVVQESSPVLLQVLTSAHTGCCGFSVEEVIAV--CISLQ 671

Query: 671  ILPW----LDEVGG-FEEVI----LRIALNIWDQIEKCSVFSALLDKVLLDATMLAMKLS 730
             +       + +G  F ++I    L + L    Q +     S L D+ +L A +  +  +
Sbjct: 672  RIAVHARDNEAIGQFFHDIIIPRLLGLTLQAALQSKDSGHISPLTDEAVLSAIVPVISTA 731

Query: 731  VRSCSKESQNVVIQKAFDVLLTSNFTPLKLPSSTTVPLQMEGLQLLKQKDSPLCRDEWIL 790
              +   ES + +  +A  + L  + + L        P Q++ LQ       P    + + 
Sbjct: 732  CAALKPESASRMAAQAVSLFLDGDTSFL---PENAFPSQIQPLQSQADSRGP---SQLVC 791

Query: 791  LLFASVTIGLRPQTQIPDV-RSVIHLLMLSITR----GCIPAAQALGSIINKLSLKSDKV 850
            LL A V   L    +IPD+ R ++ L  LS T         A++ +  ++NK        
Sbjct: 792  LLMACV-CSLPRSVEIPDMDRLLVQLEDLSCTSPHLFSYTFASKCIAGLVNKR------- 851

Query: 851  EVSSYVSLEEAIDIIFKTKFRCFHNGSTLAGSEMFLTDLCSSIEKSSLLQVHVVVGLSWI 910
               +  +L   +D + K               E+         E SS  +      L W+
Sbjct: 852  --PAGAALNAVLDRVLKR-----------VSLEL--------EETSSTHRTQAFTLLIWV 911

Query: 911  GKGLLLCGHEKVRDITMVLLECLLSKSRTDASSLQQVILEKDYEPNFDFAIVKGAADAFH 970
             K LLL  H                        L   + +K +    D A+    AD F 
Sbjct: 912  AKALLLRYH-----------------------PLSTALTDKLFSLLSDSALGSLVADGFC 971

Query: 971  ILMSDSEACLNRKFHAIVRPLYKQRFYS-TMMPIFQSLVSKSDASLSRYMLYKAFAHVIT 1030
            +LM+DS   LNR  HA VR +Y+QRF++     + Q   S   A  S Y+  KA +H++ 
Sbjct: 972  VLMNDSPDVLNRDCHADVRIMYRQRFFTENSSKLVQGFNSAEQAKKSCYL--KALSHIVN 1031

Query: 1031 DTPLTAILSDAKKLIPMLLDGLLTLSVNVIDKDVVYGLLLVLSGILTDKNGQEAVTENAH 1090
            + P    L++   L+P+LL+ L     + +D+ V    L  L  +L +     A+     
Sbjct: 1032 NLPREVQLTELPALLPLLLEAL-----SCVDQGVQLSTLSCLQPVLLE--SPAALNTQLE 1040

Query: 1091 KIVDCLARLTSFSHMMLVRETAIQCLVAVSELPHARIYPMRKQVLHAISKALDDPKRAVR 1139
             +   L  LT+ S  M VR  +++C+ A+S LP   + P R +VL A++  LDD KR VR
Sbjct: 1092 ALFTRLLALTT-SPAMKVRMASLRCVHALSRLPEHMVMPFRARVLKALAAPLDDKKRLVR 1040

BLAST of Sed0021872 vs. ExPASy Swiss-Prot
Match: E1BP36 (MMS19 nucleotide excision repair protein homolog OS=Bos taurus OX=9913 GN=MMS19 PE=2 SV=3)

HSP 1 Score: 195.3 bits (495), Expect = 3.8e-48
Identity = 280/1177 (23.79%), Postives = 468/1177 (39.76%), Query Frame = 0

Query: 4    LSSLTQYIESFVDASRTASQQATSLEAIISLTKNNALTIETLVREMGMYLTITDNIIRGR 63
            L +L   ++ FV       QQ    + + +  K+ + T+  +V  +G  L   +   R R
Sbjct: 13   LGTLWGLVQDFV-----MGQQEGPADQVAADVKSGSYTVLQVVEALGSSLENPEPRTRAR 72

Query: 64   GILLLGELLACLASKPLDDATIHSLMTFFTERLADWKALRGALVGCLALMRRKTNVGTVS 123
            GI LL ++L    S  L+   +H L+ F+  RL D   +   +   L  +R  +   T+ 
Sbjct: 73   GIQLLSQVLLQCHSLLLEKEVVH-LILFYENRLKDHHLV---IPSVLQGLRALSLCVTLP 132

Query: 124  QNDAKSVAQSYFQNLQVQSLGQHDRKLSFELLACLLEHYPNAVVSLGDDLVYGICEAIDG 183
               A SV ++ FQ + VQSL Q DR   + ++   +      +  LG D  +G  + +DG
Sbjct: 133  PGLAVSVLKAIFQEVHVQSLPQVDRHTVYSIITNFMRAREEELKGLGADFTFGFIQVMDG 192

Query: 184  EKDPHCLMLTFHIV-ELVAKLFPDPTGTLANSSSDLFEFLGCYFPIHFTHGKEEDVDVRR 243
            EKDP  L++ FHIV +L+++ +     +L     +LFE   CYFPI FT    +   ++R
Sbjct: 193  EKDPRNLLVAFHIVYDLISRDY-----SLGPFVEELFEVTSCYFPIDFTPPPNDPHGIQR 252

Query: 244  NDLSQALMIAFSSTPLFEPFAVPLLLEKLSSSLPLAKIDSLKYLSDCTVKYGADRMEKHS 303
             DL  +L    +STP F  F +PLL+EK+ S +  AK+DSL+ L+ C   YG   ++   
Sbjct: 253  EDLILSLRAVLASTPRFAEFLLPLLIEKVDSEILSAKLDSLQTLNACCAVYGQKELKDFL 312

Query: 304  ASIWSSVKEILFTSIGQPSLSINLESLSSPSFEGNEITTEALRLLQKMVVESNGLFLRLI 363
             S+W+S++  +F +  +                   +  E L  L  +    +   LR  
Sbjct: 313  PSLWASIRREVFQTASE------------------RVEAEGLAALNSLTACLSRSVLR-- 372

Query: 364  INDEDIKDIFSILNIYTCYNDFPLHSRQRLNAVGHILYKSANASLASCGHVFESFFPRLL 423
             + ED+ D F    +  C +       + +     +L  +A AS  +C HV  +  P LL
Sbjct: 373  ADAEDLLDSFLSNILQDCRHHLCEPDMKLVWPSAKLLQAAAGASARACDHVTSNVLPLLL 432

Query: 424  DIVGISADQPHNNKISPKNFNFGALYLCIELLVACRDLIASSDEHICFVKEKLYGMLQTF 483
                   +Q H +  S +      + L    L          +  +   K++L       
Sbjct: 433  -------EQFHKHSQSNQRRTILEMILGFLKLQQKWSYEDKDERPLSGFKDQL------- 492

Query: 484  SCSMVHLLNSIFPVIVKKDLHDAEFYCAVKGLQNLATFPVGSSPVSRVIFEDILLGLMSF 543
             CS++ +            L D      + G++ L    +G+ P   +   D+ L +   
Sbjct: 493  -CSLMFMA-----------LTDPNTQLQLVGIRTLTV--LGAQP-DLLSSGDLELAVGHL 552

Query: 544  ITADFKFASLWNHALNSLQHIGSFVDKYPESLELQSFMHVVVEKIASMFSLHEETLPLSI 603
                F      +  + +L+  G+    YP      +F   +V ++A      E       
Sbjct: 553  YRLSFLEEDSQSCRVAALEASGTLATLYP-----MAFSSHLVPRLAEDLCTEES------ 612

Query: 604  KLELALNIGRTGRSYMLKIVQGIEEATFFHLSEVYVNGNSKSVEILLS-LLDCYSTKILP 663
              +LA   G T  S   + +Q +  A   H S V      +++ +LL  L       + P
Sbjct: 613  --DLARADGPTRCSRHPRCLQAL-SAISTHPSIV-----KETLPLLLQHLCQMNRGSVSP 672

Query: 664  WLDEVGGFEEVILRIALNIWDQIEKCSVFSALLDKVLLDATMLAMKLSVRSCSKESQNVV 723
               EV    + + ++A N     E C  F          A    + L V++ + E ++ V
Sbjct: 673  GTSEVIAVCQSLQQVAENCQRDPESCWYFHQ-------TAVPCLLALVVQASAPEKEHSV 732

Query: 724  IQKAF---DVL---------LTSNFTPLKLPSSTT--VPLQMEG-LQLLKQ--------- 783
            ++K     +VL          T++ +P     S    VPL ++G +  L +         
Sbjct: 733  LKKVLLEDEVLATMASVIATATTHLSPDLASQSVAHIVPLFLDGNISFLPENSFSGRFQP 792

Query: 784  -KDSPLCRDEWILLLFASVTIGLRPQTQIPDVRSVI-HLLMLSITRGC----IPAAQALG 843
             +D    +   + LL A V   L    +IP +  ++  LL LS  + C      AA+   
Sbjct: 793  FQDGSSGQRRLVALLMAFV-CSLPRNVEIPQLNRLMGELLELSCCQSCPFSSTAAAKCFA 852

Query: 844  SIINK----------LSLKSDKVEVSSYVSLEEAIDIIFKTKFRCFHNGSTLAGSEMFLT 903
             ++NK          L L  DKVE                +   C     TL        
Sbjct: 853  GLLNKHPAGQQLDEFLQLAVDKVEAG-------------LSSGPCRSQAFTL-------- 912

Query: 904  DLCSSIEKSSLLQVHVVVGLSWIGKGLLLCGHEKVRDITMVLLECLLSKSRTDASSLQQV 963
                               L W+ K L+L  H         L  CL  +       L  +
Sbjct: 913  -------------------LLWVTKALVLRYHP--------LSSCLTER-------LMGL 972

Query: 964  ILEKDYEPNFDFAIVKGAADAFHILMSDSEACLNRKFHAIVRPLYKQRFYSTMMPIFQSL 1023
            + + +  P         AAD F +LMSD    L R  HA VR +++QRF++  +P     
Sbjct: 973  LSDPELGP--------AAADGFSLLMSDCTDVLTRAGHAEVRIMFRQRFFTDNVPALVRG 1027

Query: 1024 VSKSDASLSRYMLYKAFAHVITDTPLTAILSDAKKLIPMLLDGLLTLSVNVIDKDVVYGL 1083
               +   +    L K  +HV+   P   +L +   L+ +LL+ L     +  D  V    
Sbjct: 1033 FHAAPQDVKPNYL-KGLSHVLNRLPKPVLLPELPTLLSLLLEAL-----SCPDSVVQLST 1027

Query: 1084 LLVLSGILTDKNGQEAVTENAHKIVDCLARLTSFSHMMLVRETAIQCLVAVSELPHARIY 1139
            L  L  +L +    + ++ +   ++     L S S  M VR  A+QC+ A++ LP   + 
Sbjct: 1093 LSCLQPLLLE--APQVMSLHVDTLITKFLNL-SASPSMAVRIAALQCMHALTRLPTPVLL 1027

BLAST of Sed0021872 vs. ExPASy TrEMBL
Match: A0A6J1GPG0 (MMS19 nucleotide excision repair protein OS=Cucurbita moschata OX=3662 GN=LOC111455939 PE=3 SV=1)

HSP 1 Score: 1892.5 bits (4901), Expect = 0.0e+00
Identity = 968/1144 (84.62%), Postives = 1039/1144 (90.82%), Query Frame = 0

Query: 1    MGELSSLTQYIESFVDASRTASQQATSLEAIISLTKNNALTIETLVREMGMYLTITDNII 60
            M ELS LTQYIESFVD S T SQQATSLEAIISL KNN +TI+TLV EMGMYLTITD+II
Sbjct: 1    MAELSKLTQYIESFVDVSHTPSQQATSLEAIISLVKNNVVTIKTLVTEMGMYLTITDHII 60

Query: 61   RGRGILLLGELLACLASKPLDDATIHSLMTFFTERLADWKALRGALVGCLALMRRKTNVG 120
            RGRGILLLGE+L CLASKPLDDATIHSLMTFFTERLADWKALRGAL+GCLALMRRKT VG
Sbjct: 61   RGRGILLLGEVLTCLASKPLDDATIHSLMTFFTERLADWKALRGALIGCLALMRRKTEVG 120

Query: 121  TVSQNDAKSVAQSYFQNLQVQSLGQHDRKLSFELLACLLEHYPNAVVSLGDDLVYGICEA 180
             VSQNDAKS AQSYFQNLQVQSLGQHDRKLSFELL CLLEHYP+AVVSLGDDLVYGICEA
Sbjct: 121  AVSQNDAKSFAQSYFQNLQVQSLGQHDRKLSFELLVCLLEHYPDAVVSLGDDLVYGICEA 180

Query: 181  IDGEKDPHCLMLTFHIVELVAKLFPDPTGTLANSSSDLFEFLGCYFPIHFTHGKEEDVDV 240
            IDGEKDPHCLMLTFHIVELVAKLFPDPTGTLA+SSSDLFEFLGCYFPIHFTHGKEEDVDV
Sbjct: 181  IDGEKDPHCLMLTFHIVELVAKLFPDPTGTLASSSSDLFEFLGCYFPIHFTHGKEEDVDV 240

Query: 241  RRNDLSQALMIAFSSTPLFEPFAVPLLLEKLSSSLPLAKIDSLKYLSDCTVKYGADRMEK 300
             RNDLS+ALM+AFSS PLFEPFA+PLLLEKLSSSLPLAKIDSLKYLSDCTVKYGADRMEK
Sbjct: 241  TRNDLSRALMMAFSSNPLFEPFAIPLLLEKLSSSLPLAKIDSLKYLSDCTVKYGADRMEK 300

Query: 301  HSASIWSSVKEILFTSIGQPSLSINLESLSSPSFEGNEITTEALRLLQKMVVESNGLFLR 360
            HS +IWSSVKEI+FTSI QPSLS NLESL SPSF+GNE+  EALRLLQKMVVESNG FLR
Sbjct: 301  HSEAIWSSVKEIIFTSIEQPSLSFNLESLDSPSFQGNEMIIEALRLLQKMVVESNGSFLR 360

Query: 361  LIINDEDIKDIFSILNIYTCYNDFPLHSRQRLNAVGHILYKSANASLASCGHVFESFFPR 420
            LIINDEDIK+I + LNIYTCYND PL SRQRLNAVGHILYKSANAS+ASC HVFESFFP 
Sbjct: 361  LIINDEDIKEILNSLNIYTCYNDLPLQSRQRLNAVGHILYKSANASVASCNHVFESFFPC 420

Query: 421  LLDIVGISADQPHNNKISP-KNFNFGALYLCIELLVACRDLIASSDEHICFVKEKLYGML 480
            LLD VGIS DQ  N KISP +NFNFGALYLCIELL ACRDL AS DE  C VKEK Y ML
Sbjct: 421  LLDFVGISVDQSDNYKISPSRNFNFGALYLCIELLAACRDLYASCDEQTCSVKEKSYNML 480

Query: 481  QTFSCSMVHLLNSIFPVIVKKDLHDAEFYCAVKGLQNLATFPVGSSPVSRVIFEDILLGL 540
            QTFSC++V LLNS FP I KKDLHDAEFYCAVKGL+NLA FPVGSSP+S V+FEDILLGL
Sbjct: 481  QTFSCALVQLLNSTFPGIAKKDLHDAEFYCAVKGLRNLAIFPVGSSPISSVVFEDILLGL 540

Query: 541  MSFITADFKFASLWNHALNSLQHIGSFVDKYPESLELQSFMHVVVEKIASMFSLHEETLP 600
            MSFIT + +  SLWNHAL +LQHIGSFVD+Y  S+E QS+MHVVVEKIA MFSLH+E LP
Sbjct: 541  MSFITMNLECGSLWNHALKALQHIGSFVDRYHGSVEWQSYMHVVVEKIAPMFSLHDEALP 600

Query: 601  LSIKLELALNIGRTGRSYMLKIVQGIEEATFFHLSEVYVNGNSKSVEILLSLLDCYSTKI 660
            L++KL++A +IGR+GRSYMLKIVQGIEEAT FHLSEVY NGNSKSVEILLSLLDCYSTKI
Sbjct: 601  LTLKLKMASDIGRSGRSYMLKIVQGIEEATSFHLSEVYSNGNSKSVEILLSLLDCYSTKI 660

Query: 661  LPWLDEVGGFEEVILRIALNIWDQIEKCSVFSALLDKVLLDATMLAMKLSVRSCSKESQN 720
            LPW DE G FEEVILRI +NIWDQIEKC VFS  +DK LLD+TM+A+KLSVRSCSKESQN
Sbjct: 661  LPWFDEAGDFEEVILRITINIWDQIEKCLVFSTSMDKALLDSTMMALKLSVRSCSKESQN 720

Query: 721  VVIQKAFDVLLTSNFTPLKLPSSTTVPLQMEGLQLLKQKDSPLCRDEWILLLFASVTIGL 780
            ++IQKAF+VLLTS+F+P K+  STT+P++MEGLQLL+QKDSPL RDEWIL LFASV I L
Sbjct: 721  IIIQKAFNVLLTSSFSPSKVALSTTIPVKMEGLQLLQQKDSPLSRDEWILSLFASVIIAL 780

Query: 781  RPQTQIPDVRSVIHLLMLSITRGCIPAAQALGSIINKLSLKSDKVEVSSYVSLEEAIDII 840
            RPQ  +PDVRSV+ LLMLSITRGCIPAAQALGS+INKLSLKSDKVEVS+YVSLEEAIDII
Sbjct: 781  RPQIHVPDVRSVMRLLMLSITRGCIPAAQALGSMINKLSLKSDKVEVSNYVSLEEAIDII 840

Query: 841  FKTKFRCFHNGSTLAGSEMFLTDLCSSIEKSSLLQVHVVVGLSWIGKGLLLCGHEKVRDI 900
            F TKFRCFHN ST  GSEM LTDLCSSIEK SLL VH VVGLSWIGKGLLL GHEKVRD+
Sbjct: 841  FNTKFRCFHNESTRDGSEMLLTDLCSSIEKGSLLPVHAVVGLSWIGKGLLLFGHEKVRDV 900

Query: 901  TMVLLECLLSKSRTDASSLQQVILEKDYEPNFDFAIVKGAADAFHILMSDSEACLNRKFH 960
            TMV L+CL+SKSRTDAS LQ+VILEKD E N DF ++ GAADAFHILMSDSEACLNRKFH
Sbjct: 901  TMVFLQCLVSKSRTDASPLQKVILEKDCETNLDFGVMNGAADAFHILMSDSEACLNRKFH 960

Query: 961  AIVRPLYKQRFYSTMMPIFQSLVSKSDASLSRYMLYKAFAHVITDTPLTAILSDAKKLIP 1020
            AI+RPLYKQRF+STMMPIFQSLVSKSD SLSRYMLY+AFAHVI+DTPLTAI+SDAKKLIP
Sbjct: 961  AILRPLYKQRFFSTMMPIFQSLVSKSDESLSRYMLYEAFAHVISDTPLTAIMSDAKKLIP 1020

Query: 1021 MLLDGLLTLSVNVIDKDVVYGLLLVLSGILTDKNGQEAVTENAHKIVDCLARLTSFSHMM 1080
            MLLDGLL LSVN+I+KDVVY LLLVLSGIL DKN QEAVTENAHKIVDCLA LT+F HMM
Sbjct: 1021 MLLDGLLALSVNIINKDVVYSLLLVLSGILMDKNVQEAVTENAHKIVDCLAGLTAFPHMM 1080

Query: 1081 LVRETAIQCLVAVSELPHARIYPMRKQVLHAISKALDDPKRAVRLEAVRARQAWASIASR 1140
            LVRET+IQCLVAVSELPHARIYPMR QVLHAISKALDDPKRAVR EAVR RQAWASIASR
Sbjct: 1081 LVRETSIQCLVAVSELPHARIYPMRMQVLHAISKALDDPKRAVRQEAVRCRQAWASIASR 1140

Query: 1141 SLHF 1144
            SL+F
Sbjct: 1141 SLNF 1144

BLAST of Sed0021872 vs. ExPASy TrEMBL
Match: A0A1S3CGY0 (MMS19 nucleotide excision repair protein OS=Cucumis melo OX=3656 GN=LOC103500778 PE=3 SV=1)

HSP 1 Score: 1892.1 bits (4900), Expect = 0.0e+00
Identity = 959/1144 (83.83%), Postives = 1050/1144 (91.78%), Query Frame = 0

Query: 1    MGELSSLTQYIESFVDASRTASQQATSLEAIISLTKNNALTIETLVREMGMYLTITDNII 60
            M +L  LTQY+ESFVD SRT SQQATSLE I SL KNN LTIETLVREMGMYLTITDNII
Sbjct: 1    MADLCKLTQYVESFVDVSRTPSQQATSLETITSLVKNNVLTIETLVREMGMYLTITDNII 60

Query: 61   RGRGILLLGELLACLASKPLDDATIHSLMTFFTERLADWKALRGALVGCLALMRRKTNVG 120
            RGRGILLLGELLACL SKPLD ATIHSL+ FFTERLADWKALRGALVGCLALMRRKTNVG
Sbjct: 61   RGRGILLLGELLACLTSKPLDSATIHSLIAFFTERLADWKALRGALVGCLALMRRKTNVG 120

Query: 121  TVSQNDAKSVAQSYFQNLQVQSLGQHDRKLSFELLACLLEHYPNAVVSLGDDLVYGICEA 180
            T+SQNDAKSVAQSYFQNLQVQSLGQHDRKLSFELLACLLEHYP+AVVSLGDDLVYGICEA
Sbjct: 121  TISQNDAKSVAQSYFQNLQVQSLGQHDRKLSFELLACLLEHYPDAVVSLGDDLVYGICEA 180

Query: 181  IDGEKDPHCLMLTFHIVELVAKLFPDPTGTLANSSSDLFEFLGCYFPIHFTHGKEEDVDV 240
            IDGEKDPHCL+LTF IVELVAKLFPDP+GTLA+SSSDLFEFLGCYFPIHFTHGKEED+DV
Sbjct: 181  IDGEKDPHCLLLTFRIVELVAKLFPDPSGTLASSSSDLFEFLGCYFPIHFTHGKEEDIDV 240

Query: 241  RRNDLSQALMIAFSSTPLFEPFAVPLLLEKLSSSLPLAKIDSLKYLSDCTVKYGADRMEK 300
            RRNDLSQALM AFSSTPLFEPFA+PLLLEKLSSSLPLAKIDSLKYLSDCTVKYGADRM+K
Sbjct: 241  RRNDLSQALMRAFSSTPLFEPFAIPLLLEKLSSSLPLAKIDSLKYLSDCTVKYGADRMKK 300

Query: 301  HSASIWSSVKEILFTSIGQPSLSINLESLSSPSFEGNEITTEALRLLQKMVVESNGLFLR 360
            HS +IWSSVKEI+FTSIGQP+LSIN ESL+SPSF+ NE+TTEALRLLQKMVVESNGLFL 
Sbjct: 301  HSEAIWSSVKEIIFTSIGQPNLSINTESLNSPSFQENEMTTEALRLLQKMVVESNGLFLT 360

Query: 361  LIINDEDIKDIFSILNIYTCYNDFPLHSRQRLNAVGHILYKSANASLASCGHVFESFFPR 420
            LIINDEDIKDIF+ILNIYTCYND+PL SRQRLNAVGHILY SA+AS+ASC HVFES+F R
Sbjct: 361  LIINDEDIKDIFNILNIYTCYNDYPLQSRQRLNAVGHILYTSASASVASCDHVFESYFHR 420

Query: 421  LLDIVGISADQPHNNKISPK-NFNFGALYLCIELLVACRDLIASSDEHICFVKEKLYGML 480
            LL+ +GIS DQ HN+KISP  + NFGALYLCIE++ ACRDLIAS+DE+ C VKEK Y ML
Sbjct: 421  LLEFLGISVDQYHNDKISPVISLNFGALYLCIEVIAACRDLIASTDENTCSVKEKSYSML 480

Query: 481  QTFSCSMVHLLNSIFPVIVKKDLHDAEFYCAVKGLQNLATFPVGSSPVSRVIFEDILLGL 540
            QTFS SMV LL+S FP IVK+DLHDAEF+CAVKGL NL+TFPVGSSPVSRVIFEDILL  
Sbjct: 481  QTFSRSMVQLLSSTFPGIVKQDLHDAEFHCAVKGLLNLSTFPVGSSPVSRVIFEDILLEF 540

Query: 541  MSFITADFKFASLWNHALNSLQHIGSFVDKYPESLELQSFMHVVVEKIASMFSLHEETLP 600
            MSF+T +FKF SLWNHAL +LQHIGSFVDKYP S++ QS+MH+VVEKIASMFS H+E LP
Sbjct: 541  MSFVTVNFKFGSLWNHALKALQHIGSFVDKYPGSVDSQSYMHIVVEKIASMFSPHDEVLP 600

Query: 601  LSIKLELALNIGRTGRSYMLKIVQGIEEATFFHLSEVYVNGNSKSVEILLSLLDCYSTKI 660
            L +KLE+A++IGRTGRSYMLKIV GIEE  F++LSEVY  GNSKSVEILL+LLDCYSTKI
Sbjct: 601  LILKLEMAVDIGRTGRSYMLKIVGGIEEPIFYNLSEVYAYGNSKSVEILLTLLDCYSTKI 660

Query: 661  LPWLDEVGGFEEVILRIALNIWDQIEKCSVFSALLDKVLLDATMLAMKLSVRSCSKESQN 720
            LPW DE G FEEVILR ALNIWDQIEKCS F+ L+DKVLLDATM+A+KLSVRSCSKESQN
Sbjct: 661  LPWFDEAGDFEEVILRFALNIWDQIEKCSTFNTLMDKVLLDATMMALKLSVRSCSKESQN 720

Query: 721  VVIQKAFDVLLTSNFTPLKLPSSTTVPLQMEGLQLLKQKDSPLCRDEWILLLFASVTIGL 780
            +++QKAF+VLLTS+F+P K+  STT+P+QMEGLQ+L+QKD+P  RDEWIL LFASV I L
Sbjct: 721  IIVQKAFNVLLTSSFSPSKVALSTTIPVQMEGLQILQQKDNPTSRDEWILSLFASVIIAL 780

Query: 781  RPQTQIPDVRSVIHLLMLSITRGCIPAAQALGSIINKLSLKSDKVEVSSYVSLEEAIDII 840
            RPQ  +PDVR +IHLLMLSITRGC+PAAQALGS+INKLS+KSDKVEVSSYVSLEEAIDII
Sbjct: 781  RPQVHVPDVRLIIHLLMLSITRGCVPAAQALGSMINKLSVKSDKVEVSSYVSLEEAIDII 840

Query: 841  FKTKFRCFHNGSTLAGSEMFLTDLCSSIEKSSLLQVHVVVGLSWIGKGLLLCGHEKVRDI 900
            FKT+FRCFHN +T  GS MFLT+LCSSIEK+SLLQVH VVGLSWIGKGLLLCGH+KVRD+
Sbjct: 841  FKTEFRCFHNENTGNGSVMFLTELCSSIEKTSLLQVHAVVGLSWIGKGLLLCGHDKVRDV 900

Query: 901  TMVLLECLLSKSRTDASSLQQVILEKDYEPNFDFAIVKGAADAFHILMSDSEACLNRKFH 960
            TMV L+ L+SKSRTD   LQQ ILEKD E + DFA++KGAA+AFHILMSDSEACLNRKFH
Sbjct: 901  TMVFLQLLVSKSRTDGPPLQQFILEKDNETSLDFAVMKGAAEAFHILMSDSEACLNRKFH 960

Query: 961  AIVRPLYKQRFYSTMMPIFQSLVSKSDASLSRYMLYKAFAHVITDTPLTAILSDAKKLIP 1020
            AIVRPLYKQRF+STMMPIFQ+LVSKSD SLSRYMLY+A+AHVI+DTPLTA+L+DAKK IP
Sbjct: 961  AIVRPLYKQRFFSTMMPIFQTLVSKSDTSLSRYMLYQAYAHVISDTPLTALLTDAKKFIP 1020

Query: 1021 MLLDGLLTLSVNVIDKDVVYGLLLVLSGILTDKNGQEAVTENAHKIVDCLARLTSFSHMM 1080
            MLLDGLLTLSVN I+KDVVY LLLVLSGIL DKNGQEAVTENAHKIVDCLA LT FSHMM
Sbjct: 1021 MLLDGLLTLSVNGINKDVVYSLLLVLSGILMDKNGQEAVTENAHKIVDCLAGLTDFSHMM 1080

Query: 1081 LVRETAIQCLVAVSELPHARIYPMRKQVLHAISKALDDPKRAVRLEAVRARQAWASIASR 1140
            LVRETAIQCLVAVSELPHARIYPMR+QVLH ISKALDDPKRAVR EAVR RQAWASIASR
Sbjct: 1081 LVRETAIQCLVAVSELPHARIYPMRRQVLHTISKALDDPKRAVRQEAVRCRQAWASIASR 1140

Query: 1141 SLHF 1144
            SLHF
Sbjct: 1141 SLHF 1144

BLAST of Sed0021872 vs. ExPASy TrEMBL
Match: A0A6J1JY03 (MMS19 nucleotide excision repair protein OS=Cucurbita maxima OX=3661 GN=LOC111488523 PE=3 SV=1)

HSP 1 Score: 1891.7 bits (4899), Expect = 0.0e+00
Identity = 972/1144 (84.97%), Postives = 1039/1144 (90.82%), Query Frame = 0

Query: 1    MGELSSLTQYIESFVDASRTASQQATSLEAIISLTKNNALTIETLVREMGMYLTITDNII 60
            M ELS L QYIESFVD S T SQQATSLEAIISL KNN +TI+TLV EMGMYLTITD+II
Sbjct: 1    MAELSKLAQYIESFVDVSHTPSQQATSLEAIISLVKNNVVTIKTLVTEMGMYLTITDHII 60

Query: 61   RGRGILLLGELLACLASKPLDDATIHSLMTFFTERLADWKALRGALVGCLALMRRKTNVG 120
            RGRGILLLGE+LACLASKPLDDATIHSLMTFF ERLADWKALRGAL+GCLALMRRK  VG
Sbjct: 61   RGRGILLLGEVLACLASKPLDDATIHSLMTFFIERLADWKALRGALIGCLALMRRKMEVG 120

Query: 121  TVSQNDAKSVAQSYFQNLQVQSLGQHDRKLSFELLACLLEHYPNAVVSLGDDLVYGICEA 180
            TVSQ DAKS AQSYFQNLQVQSLGQHDRKLSFELL CLLEHYP+AVVSLGDDLVYGICEA
Sbjct: 121  TVSQTDAKSFAQSYFQNLQVQSLGQHDRKLSFELLVCLLEHYPDAVVSLGDDLVYGICEA 180

Query: 181  IDGEKDPHCLMLTFHIVELVAKLFPDPTGTLANSSSDLFEFLGCYFPIHFTHGKEEDVDV 240
            IDGEKDPHCLMLTFHIVELVAKLFPDPTGTLA+SSSDLFEFLGCYFPIHFTHGKEEDVDV
Sbjct: 181  IDGEKDPHCLMLTFHIVELVAKLFPDPTGTLASSSSDLFEFLGCYFPIHFTHGKEEDVDV 240

Query: 241  RRNDLSQALMIAFSSTPLFEPFAVPLLLEKLSSSLPLAKIDSLKYLSDCTVKYGADRMEK 300
             RNDLSQALM+AFSS PLFEPFA+PLLLEKLSSSLPLAKIDSLKYLSDCTVKYGADRMEK
Sbjct: 241  TRNDLSQALMMAFSSNPLFEPFAIPLLLEKLSSSLPLAKIDSLKYLSDCTVKYGADRMEK 300

Query: 301  HSASIWSSVKEILFTSIGQPSLSINLESLSSPSFEGNEITTEALRLLQKMVVESNGLFLR 360
            HS ++WSSVKEI+FTSI QPSLS NLESL SPSF+GNE+  EALRLLQKMVV+SNG FLR
Sbjct: 301  HSEAVWSSVKEIIFTSIEQPSLSFNLESLDSPSFQGNEMIIEALRLLQKMVVKSNGSFLR 360

Query: 361  LIINDEDIKDIFSILNIYTCYNDFPLHSRQRLNAVGHILYKSANASLASCGHVFESFFPR 420
            LIINDEDIK+IF+ILNIYTCYND PL SRQRLNAVGHILYKSANAS+ASC HVFESFF R
Sbjct: 361  LIINDEDIKEIFNILNIYTCYNDLPLQSRQRLNAVGHILYKSANASVASCNHVFESFFLR 420

Query: 421  LLDIVGISADQPHNNKISP-KNFNFGALYLCIELLVACRDLIASSDEHICFVKEKLYGML 480
            LLD VGIS DQ  N KISP +NFNFGALYLCIELL ACRDL AS DE  C VKEK Y ML
Sbjct: 421  LLDFVGISVDQSDNCKISPSRNFNFGALYLCIELLAACRDLYASCDEQTCSVKEKSYNML 480

Query: 481  QTFSCSMVHLLNSIFPVIVKKDLHDAEFYCAVKGLQNLATFPVGSSPVSRVIFEDILLGL 540
            QTFS S+V LLNS FP I KK+LHDAEFYCAVKGL+NLATFPVGSSPVS V+FE+ILLGL
Sbjct: 481  QTFSRSLVQLLNSTFPGIPKKNLHDAEFYCAVKGLRNLATFPVGSSPVSSVVFEEILLGL 540

Query: 541  MSFITADFKFASLWNHALNSLQHIGSFVDKYPESLELQSFMHVVVEKIASMFSLHEETLP 600
            MSFIT + K  SLWNHAL +LQHIGSFVD+Y  S+E QS+MHVVVEKIA MFSLH+E LP
Sbjct: 541  MSFITMNLKCGSLWNHALKALQHIGSFVDRYHGSVEWQSYMHVVVEKIAPMFSLHDEALP 600

Query: 601  LSIKLELALNIGRTGRSYMLKIVQGIEEATFFHLSEVYVNGNSKSVEILLSLLDCYSTKI 660
            L++KL++A +IGR+GRSYMLKIVQGIEEAT FHL+EVY NGNSKSVEILLSLLDCYSTKI
Sbjct: 601  LTLKLKMASDIGRSGRSYMLKIVQGIEEATSFHLTEVYSNGNSKSVEILLSLLDCYSTKI 660

Query: 661  LPWLDEVGGFEEVILRIALNIWDQIEKCSVFSALLDKVLLDATMLAMKLSVRSCSKESQN 720
            LPW DE G FEEVILRI  NIWDQIEKC VFS  +DK LLDATM+A+KLSVRSCSKESQN
Sbjct: 661  LPWFDEAGDFEEVILRITFNIWDQIEKCLVFSTSMDKALLDATMMALKLSVRSCSKESQN 720

Query: 721  VVIQKAFDVLLTSNFTPLKLPSSTTVPLQMEGLQLLKQKDSPLCRDEWILLLFASVTIGL 780
            ++IQKAF+VLLTS+F+P K+  STT+P+QMEGLQLL+QKDSPL RDEWIL LFASV I L
Sbjct: 721  IIIQKAFNVLLTSSFSPSKVALSTTIPVQMEGLQLLQQKDSPLSRDEWILSLFASVIIAL 780

Query: 781  RPQTQIPDVRSVIHLLMLSITRGCIPAAQALGSIINKLSLKSDKVEVSSYVSLEEAIDII 840
            RPQ  +PDVRSV+ LLMLSITRGCIPAAQALGS+INKLSLK DKVEVS+YVSLEEAIDII
Sbjct: 781  RPQIHVPDVRSVMRLLMLSITRGCIPAAQALGSMINKLSLKFDKVEVSNYVSLEEAIDII 840

Query: 841  FKTKFRCFHNGSTLAGSEMFLTDLCSSIEKSSLLQVHVVVGLSWIGKGLLLCGHEKVRDI 900
            F TKFRCFHNGST  GSEM LTDLCSSIEK SLL VHVVVGLSWIGKGLLL GHEKVRD+
Sbjct: 841  FNTKFRCFHNGSTRDGSEMLLTDLCSSIEKGSLLPVHVVVGLSWIGKGLLLFGHEKVRDV 900

Query: 901  TMVLLECLLSKSRTDASSLQQVILEKDYEPNFDFAIVKGAADAFHILMSDSEACLNRKFH 960
            TMV L+CL+SKSRTDAS LQ+VILEKD E N DFA++  AADAFHILMSDSEACLNRKFH
Sbjct: 901  TMVFLQCLVSKSRTDASPLQKVILEKDCETNLDFAVMNCAADAFHILMSDSEACLNRKFH 960

Query: 961  AIVRPLYKQRFYSTMMPIFQSLVSKSDASLSRYMLYKAFAHVITDTPLTAILSDAKKLIP 1020
            AI+RPLYKQRF+STMMPIFQSLVSKSD SLSRYMLY+AFAHVI+DTPLTAILSDAKKLIP
Sbjct: 961  AILRPLYKQRFFSTMMPIFQSLVSKSDESLSRYMLYEAFAHVISDTPLTAILSDAKKLIP 1020

Query: 1021 MLLDGLLTLSVNVIDKDVVYGLLLVLSGILTDKNGQEAVTENAHKIVDCLARLTSFSHMM 1080
            MLLDGLL LSVN+I+KDVVY LLLVLSGIL DKNGQE VTENAHKIVDCLA LT+F HMM
Sbjct: 1021 MLLDGLLALSVNIINKDVVYSLLLVLSGILMDKNGQEVVTENAHKIVDCLAGLTAFPHMM 1080

Query: 1081 LVRETAIQCLVAVSELPHARIYPMRKQVLHAISKALDDPKRAVRLEAVRARQAWASIASR 1140
            LVRETAIQCLVAVSELPHARIYPMR QVLHAISKALDDPKRAVR EAVR RQAWASIASR
Sbjct: 1081 LVRETAIQCLVAVSELPHARIYPMRMQVLHAISKALDDPKRAVRQEAVRCRQAWASIASR 1140

Query: 1141 SLHF 1144
            SL+F
Sbjct: 1141 SLNF 1144

BLAST of Sed0021872 vs. ExPASy TrEMBL
Match: A0A6J1GN60 (MMS19 nucleotide excision repair protein OS=Cucurbita moschata OX=3662 GN=LOC111455939 PE=3 SV=1)

HSP 1 Score: 1886.7 bits (4886), Expect = 0.0e+00
Identity = 968/1148 (84.32%), Postives = 1039/1148 (90.51%), Query Frame = 0

Query: 1    MGELSSLTQYIESFVDASRTASQQATSLEAIISLTKNNALTIETLVREMGMYLTITDNII 60
            M ELS LTQYIESFVD S T SQQATSLEAIISL KNN +TI+TLV EMGMYLTITD+II
Sbjct: 1    MAELSKLTQYIESFVDVSHTPSQQATSLEAIISLVKNNVVTIKTLVTEMGMYLTITDHII 60

Query: 61   RGRGILLLGELLACLASKPLDDATIHSLMTFFTERLADWKALRGALVGCLALMRRKTNVG 120
            RGRGILLLGE+L CLASKPLDDATIHSLMTFFTERLADWKALRGAL+GCLALMRRKT VG
Sbjct: 61   RGRGILLLGEVLTCLASKPLDDATIHSLMTFFTERLADWKALRGALIGCLALMRRKTEVG 120

Query: 121  TVSQNDAKSVAQSYFQNLQVQSLGQHDRKLSFELLACLLEHYPNAVVSLGDDLVYGICEA 180
             VSQNDAKS AQSYFQNLQVQSLGQHDRKLSFELL CLLEHYP+AVVSLGDDLVYGICEA
Sbjct: 121  AVSQNDAKSFAQSYFQNLQVQSLGQHDRKLSFELLVCLLEHYPDAVVSLGDDLVYGICEA 180

Query: 181  IDGEKDPHCLMLTFHIVELVAKLFPDPTGTLANSSSDLFEFLGCYFPIHFTHGKEEDVDV 240
            IDGEKDPHCLMLTFHIVELVAKLFPDPTGTLA+SSSDLFEFLGCYFPIHFTHGKEEDVDV
Sbjct: 181  IDGEKDPHCLMLTFHIVELVAKLFPDPTGTLASSSSDLFEFLGCYFPIHFTHGKEEDVDV 240

Query: 241  RRNDLSQALMIAFSSTPLFEPFAVPLLLEKLSSSLPLAKIDSLKYLSDCTVKYGADRMEK 300
             RNDLS+ALM+AFSS PLFEPFA+PLLLEKLSSSLPLAKIDSLKYLSDCTVKYGADRMEK
Sbjct: 241  TRNDLSRALMMAFSSNPLFEPFAIPLLLEKLSSSLPLAKIDSLKYLSDCTVKYGADRMEK 300

Query: 301  HSASIWSSVKEILFTSIGQPSLSINLESLSSPSFEGNEITTEALRLLQKMVVESNGLFLR 360
            HS +IWSSVKEI+FTSI QPSLS NLESL SPSF+GNE+  EALRLLQKMVVESNG FLR
Sbjct: 301  HSEAIWSSVKEIIFTSIEQPSLSFNLESLDSPSFQGNEMIIEALRLLQKMVVESNGSFLR 360

Query: 361  LIINDEDIKDIFSILNIYTCYNDFPLHSRQRLNAVGHILYKSANASLASCGHVFESFFPR 420
            LIINDEDIK+I + LNIYTCYND PL SRQRLNAVGHILYKSANAS+ASC HVFESFFP 
Sbjct: 361  LIINDEDIKEILNSLNIYTCYNDLPLQSRQRLNAVGHILYKSANASVASCNHVFESFFPC 420

Query: 421  LLDIVGISADQPHNNKISP-KNFNFGALYLCIELLVACRDLIASSDEHICFVKEKLYGML 480
            LLD VGIS DQ  N KISP +NFNFGALYLCIELL ACRDL AS DE  C VKEK Y ML
Sbjct: 421  LLDFVGISVDQSDNYKISPSRNFNFGALYLCIELLAACRDLYASCDEQTCSVKEKSYNML 480

Query: 481  QTFSCSMVHLLNSIFPVIVKKDLHDAEFYCAVKGLQNLATFPVGSSPVSRVIFEDILLGL 540
            QTFSC++V LLNS FP I KKDLHDAEFYCAVKGL+NLA FPVGSSP+S V+FEDILLGL
Sbjct: 481  QTFSCALVQLLNSTFPGIAKKDLHDAEFYCAVKGLRNLAIFPVGSSPISSVVFEDILLGL 540

Query: 541  MSFITADFKFASLWNHALNSLQHIGSFVDKYPESLELQSFMHVVVEKIASMFSLHEETLP 600
            MSFIT + +  SLWNHAL +LQHIGSFVD+Y  S+E QS+MHVVVEKIA MFSLH+E LP
Sbjct: 541  MSFITMNLECGSLWNHALKALQHIGSFVDRYHGSVEWQSYMHVVVEKIAPMFSLHDEALP 600

Query: 601  LSIKLELALNIGRTGRSYMLKIVQGIEEATFFHLSEVYVNGNSKSVEILLSLLDCYSTKI 660
            L++KL++A +IGR+GRSYMLKIVQGIEEAT FHLSEVY NGNSKSVEILLSLLDCYSTKI
Sbjct: 601  LTLKLKMASDIGRSGRSYMLKIVQGIEEATSFHLSEVYSNGNSKSVEILLSLLDCYSTKI 660

Query: 661  LPWLDEVGGFEEVILRIALNIWDQIEKCSVFSALLDKVLLDATMLAMKLSVRSCSKESQN 720
            LPW DE G FEEVILRI +NIWDQIEKC VFS  +DK LLD+TM+A+KLSVRSCSKESQN
Sbjct: 661  LPWFDEAGDFEEVILRITINIWDQIEKCLVFSTSMDKALLDSTMMALKLSVRSCSKESQN 720

Query: 721  VVIQKAFDVLLTSNFTPLKLPSSTTVPLQMEGLQLLKQKDSPLCRDEWILLLFASVTIGL 780
            ++IQKAF+VLLTS+F+P K+  STT+P++MEGLQLL+QKDSPL RDEWIL LFASV I L
Sbjct: 721  IIIQKAFNVLLTSSFSPSKVALSTTIPVKMEGLQLLQQKDSPLSRDEWILSLFASVIIAL 780

Query: 781  RPQTQIPDVRSVIHLLMLSITRGCIPAAQALGSIINKLSLKSDKVEVSSYVSLEEAIDII 840
            RPQ  +PDVRSV+ LLMLSITRGCIPAAQALGS+INKLSLKSDKVEVS+YVSLEEAIDII
Sbjct: 781  RPQIHVPDVRSVMRLLMLSITRGCIPAAQALGSMINKLSLKSDKVEVSNYVSLEEAIDII 840

Query: 841  FKTKFRCFHNGSTLAGSEMFLTDLCSSIEKSSLLQVHVVVGLSWIGKGLLLCGHEKVRDI 900
            F TKFRCFHN ST  GSEM LTDLCSSIEK SLL VH VVGLSWIGKGLLL GHEKVRD+
Sbjct: 841  FNTKFRCFHNESTRDGSEMLLTDLCSSIEKGSLLPVHAVVGLSWIGKGLLLFGHEKVRDV 900

Query: 901  TMVLLECLLSKSRTDASSLQQVILEKDYEPNFDFAIVKGAADAFHILMSDSEACLNRKFH 960
            TMV L+CL+SKSRTDAS LQ+VILEKD E N DF ++ GAADAFHILMSDSEACLNRKFH
Sbjct: 901  TMVFLQCLVSKSRTDASPLQKVILEKDCETNLDFGVMNGAADAFHILMSDSEACLNRKFH 960

Query: 961  AIVRPLYKQRFYSTMMPIFQSLVSKSDASLSRYMLYKAFAHVITDTPLTAILSDAKK--- 1020
            AI+RPLYKQRF+STMMPIFQSLVSKSD SLSRYMLY+AFAHVI+DTPLTAI+SDAKK   
Sbjct: 961  AILRPLYKQRFFSTMMPIFQSLVSKSDESLSRYMLYEAFAHVISDTPLTAIMSDAKKVSL 1020

Query: 1021 -LIPMLLDGLLTLSVNVIDKDVVYGLLLVLSGILTDKNGQEAVTENAHKIVDCLARLTSF 1080
             LIPMLLDGLL LSVN+I+KDVVY LLLVLSGIL DKN QEAVTENAHKIVDCLA LT+F
Sbjct: 1021 NLIPMLLDGLLALSVNIINKDVVYSLLLVLSGILMDKNVQEAVTENAHKIVDCLAGLTAF 1080

Query: 1081 SHMMLVRETAIQCLVAVSELPHARIYPMRKQVLHAISKALDDPKRAVRLEAVRARQAWAS 1140
             HMMLVRET+IQCLVAVSELPHARIYPMR QVLHAISKALDDPKRAVR EAVR RQAWAS
Sbjct: 1081 PHMMLVRETSIQCLVAVSELPHARIYPMRMQVLHAISKALDDPKRAVRQEAVRCRQAWAS 1140

Query: 1141 IASRSLHF 1144
            IASRSL+F
Sbjct: 1141 IASRSLNF 1148

BLAST of Sed0021872 vs. ExPASy TrEMBL
Match: A0A6J1JNP3 (MMS19 nucleotide excision repair protein OS=Cucurbita maxima OX=3661 GN=LOC111488523 PE=3 SV=1)

HSP 1 Score: 1885.9 bits (4884), Expect = 0.0e+00
Identity = 972/1148 (84.67%), Postives = 1039/1148 (90.51%), Query Frame = 0

Query: 1    MGELSSLTQYIESFVDASRTASQQATSLEAIISLTKNNALTIETLVREMGMYLTITDNII 60
            M ELS L QYIESFVD S T SQQATSLEAIISL KNN +TI+TLV EMGMYLTITD+II
Sbjct: 1    MAELSKLAQYIESFVDVSHTPSQQATSLEAIISLVKNNVVTIKTLVTEMGMYLTITDHII 60

Query: 61   RGRGILLLGELLACLASKPLDDATIHSLMTFFTERLADWKALRGALVGCLALMRRKTNVG 120
            RGRGILLLGE+LACLASKPLDDATIHSLMTFF ERLADWKALRGAL+GCLALMRRK  VG
Sbjct: 61   RGRGILLLGEVLACLASKPLDDATIHSLMTFFIERLADWKALRGALIGCLALMRRKMEVG 120

Query: 121  TVSQNDAKSVAQSYFQNLQVQSLGQHDRKLSFELLACLLEHYPNAVVSLGDDLVYGICEA 180
            TVSQ DAKS AQSYFQNLQVQSLGQHDRKLSFELL CLLEHYP+AVVSLGDDLVYGICEA
Sbjct: 121  TVSQTDAKSFAQSYFQNLQVQSLGQHDRKLSFELLVCLLEHYPDAVVSLGDDLVYGICEA 180

Query: 181  IDGEKDPHCLMLTFHIVELVAKLFPDPTGTLANSSSDLFEFLGCYFPIHFTHGKEEDVDV 240
            IDGEKDPHCLMLTFHIVELVAKLFPDPTGTLA+SSSDLFEFLGCYFPIHFTHGKEEDVDV
Sbjct: 181  IDGEKDPHCLMLTFHIVELVAKLFPDPTGTLASSSSDLFEFLGCYFPIHFTHGKEEDVDV 240

Query: 241  RRNDLSQALMIAFSSTPLFEPFAVPLLLEKLSSSLPLAKIDSLKYLSDCTVKYGADRMEK 300
             RNDLSQALM+AFSS PLFEPFA+PLLLEKLSSSLPLAKIDSLKYLSDCTVKYGADRMEK
Sbjct: 241  TRNDLSQALMMAFSSNPLFEPFAIPLLLEKLSSSLPLAKIDSLKYLSDCTVKYGADRMEK 300

Query: 301  HSASIWSSVKEILFTSIGQPSLSINLESLSSPSFEGNEITTEALRLLQKMVVESNGLFLR 360
            HS ++WSSVKEI+FTSI QPSLS NLESL SPSF+GNE+  EALRLLQKMVV+SNG FLR
Sbjct: 301  HSEAVWSSVKEIIFTSIEQPSLSFNLESLDSPSFQGNEMIIEALRLLQKMVVKSNGSFLR 360

Query: 361  LIINDEDIKDIFSILNIYTCYNDFPLHSRQRLNAVGHILYKSANASLASCGHVFESFFPR 420
            LIINDEDIK+IF+ILNIYTCYND PL SRQRLNAVGHILYKSANAS+ASC HVFESFF R
Sbjct: 361  LIINDEDIKEIFNILNIYTCYNDLPLQSRQRLNAVGHILYKSANASVASCNHVFESFFLR 420

Query: 421  LLDIVGISADQPHNNKISP-KNFNFGALYLCIELLVACRDLIASSDEHICFVKEKLYGML 480
            LLD VGIS DQ  N KISP +NFNFGALYLCIELL ACRDL AS DE  C VKEK Y ML
Sbjct: 421  LLDFVGISVDQSDNCKISPSRNFNFGALYLCIELLAACRDLYASCDEQTCSVKEKSYNML 480

Query: 481  QTFSCSMVHLLNSIFPVIVKKDLHDAEFYCAVKGLQNLATFPVGSSPVSRVIFEDILLGL 540
            QTFS S+V LLNS FP I KK+LHDAEFYCAVKGL+NLATFPVGSSPVS V+FE+ILLGL
Sbjct: 481  QTFSRSLVQLLNSTFPGIPKKNLHDAEFYCAVKGLRNLATFPVGSSPVSSVVFEEILLGL 540

Query: 541  MSFITADFKFASLWNHALNSLQHIGSFVDKYPESLELQSFMHVVVEKIASMFSLHEETLP 600
            MSFIT + K  SLWNHAL +LQHIGSFVD+Y  S+E QS+MHVVVEKIA MFSLH+E LP
Sbjct: 541  MSFITMNLKCGSLWNHALKALQHIGSFVDRYHGSVEWQSYMHVVVEKIAPMFSLHDEALP 600

Query: 601  LSIKLELALNIGRTGRSYMLKIVQGIEEATFFHLSEVYVNGNSKSVEILLSLLDCYSTKI 660
            L++KL++A +IGR+GRSYMLKIVQGIEEAT FHL+EVY NGNSKSVEILLSLLDCYSTKI
Sbjct: 601  LTLKLKMASDIGRSGRSYMLKIVQGIEEATSFHLTEVYSNGNSKSVEILLSLLDCYSTKI 660

Query: 661  LPWLDEVGGFEEVILRIALNIWDQIEKCSVFSALLDKVLLDATMLAMKLSVRSCSKESQN 720
            LPW DE G FEEVILRI  NIWDQIEKC VFS  +DK LLDATM+A+KLSVRSCSKESQN
Sbjct: 661  LPWFDEAGDFEEVILRITFNIWDQIEKCLVFSTSMDKALLDATMMALKLSVRSCSKESQN 720

Query: 721  VVIQKAFDVLLTSNFTPLKLPSSTTVPLQMEGLQLLKQKDSPLCRDEWILLLFASVTIGL 780
            ++IQKAF+VLLTS+F+P K+  STT+P+QMEGLQLL+QKDSPL RDEWIL LFASV I L
Sbjct: 721  IIIQKAFNVLLTSSFSPSKVALSTTIPVQMEGLQLLQQKDSPLSRDEWILSLFASVIIAL 780

Query: 781  RPQTQIPDVRSVIHLLMLSITRGCIPAAQALGSIINKLSLKSDKVEVSSYVSLEEAIDII 840
            RPQ  +PDVRSV+ LLMLSITRGCIPAAQALGS+INKLSLK DKVEVS+YVSLEEAIDII
Sbjct: 781  RPQIHVPDVRSVMRLLMLSITRGCIPAAQALGSMINKLSLKFDKVEVSNYVSLEEAIDII 840

Query: 841  FKTKFRCFHNGSTLAGSEMFLTDLCSSIEKSSLLQVHVVVGLSWIGKGLLLCGHEKVRDI 900
            F TKFRCFHNGST  GSEM LTDLCSSIEK SLL VHVVVGLSWIGKGLLL GHEKVRD+
Sbjct: 841  FNTKFRCFHNGSTRDGSEMLLTDLCSSIEKGSLLPVHVVVGLSWIGKGLLLFGHEKVRDV 900

Query: 901  TMVLLECLLSKSRTDASSLQQVILEKDYEPNFDFAIVKGAADAFHILMSDSEACLNRKFH 960
            TMV L+CL+SKSRTDAS LQ+VILEKD E N DFA++  AADAFHILMSDSEACLNRKFH
Sbjct: 901  TMVFLQCLVSKSRTDASPLQKVILEKDCETNLDFAVMNCAADAFHILMSDSEACLNRKFH 960

Query: 961  AIVRPLYKQRFYSTMMPIFQSLVSKSDASLSRYMLYKAFAHVITDTPLTAILSDAKK--- 1020
            AI+RPLYKQRF+STMMPIFQSLVSKSD SLSRYMLY+AFAHVI+DTPLTAILSDAKK   
Sbjct: 961  AILRPLYKQRFFSTMMPIFQSLVSKSDESLSRYMLYEAFAHVISDTPLTAILSDAKKVSL 1020

Query: 1021 -LIPMLLDGLLTLSVNVIDKDVVYGLLLVLSGILTDKNGQEAVTENAHKIVDCLARLTSF 1080
             LIPMLLDGLL LSVN+I+KDVVY LLLVLSGIL DKNGQE VTENAHKIVDCLA LT+F
Sbjct: 1021 NLIPMLLDGLLALSVNIINKDVVYSLLLVLSGILMDKNGQEVVTENAHKIVDCLAGLTAF 1080

Query: 1081 SHMMLVRETAIQCLVAVSELPHARIYPMRKQVLHAISKALDDPKRAVRLEAVRARQAWAS 1140
             HMMLVRETAIQCLVAVSELPHARIYPMR QVLHAISKALDDPKRAVR EAVR RQAWAS
Sbjct: 1081 PHMMLVRETAIQCLVAVSELPHARIYPMRMQVLHAISKALDDPKRAVRQEAVRCRQAWAS 1140

Query: 1141 IASRSLHF 1144
            IASRSL+F
Sbjct: 1141 IASRSLNF 1148

BLAST of Sed0021872 vs. TAIR 10
Match: AT5G48120.1 (ARM repeat superfamily protein )

HSP 1 Score: 991.9 bits (2563), Expect = 4.3e-289
Identity = 546/1156 (47.23%), Postives = 769/1156 (66.52%), Query Frame = 0

Query: 1    MGELSSLTQYIESFVDASRTASQQATSLEAIISLTKNNALTIETLVREMGMYLTITDNII 60
            M E + L Q++E+FVD +R++SQQ  SL+AI S  +N++L+I  LVREM MYLT TDN++
Sbjct: 2    MVEPNQLVQHLETFVDTNRSSSQQDDSLKAIASSLENDSLSITQLVREMEMYLTTTDNLV 61

Query: 61   RGRGILLLGELLACLASKPLDDATIHSLMTFFTERLADWKALRGALVGCLALMRRKTNVG 120
            R RGILLL E+L CL +KPL+D  +H+L+ FF+E+LADW+A+ GALVGCLAL++RK   G
Sbjct: 62   RARGILLLAEILDCLKAKPLNDTIVHTLVGFFSEKLADWRAMCGALVGCLALLKRKDVAG 121

Query: 121  TVSQNDAKSVAQSYFQNLQVQSLGQHDRKLSFELLACLLEHYPNAVVSLGDDLVYGICEA 180
             V+  D +++A+S  QN+QVQ+L  H+RKL+FELL CLL+ +  A++++GD LVY +CEA
Sbjct: 122  VVTDIDVQAMAKSMIQNVQVQALALHERKLAFELLECLLQQHSEAILTMGDLLVYAMCEA 181

Query: 181  IDGEKDPHCLMLTFHIVELVAKLFPDPTGTLANSSSDLFEFLGCYFPIHFTHGKEEDVDV 240
            IDGEKDP CLM+ FH+VEL+A LFP P+G LA+ +SDLFE +GCYFP+HFTH K+++ ++
Sbjct: 182  IDGEKDPQCLMIVFHLVELLAPLFPSPSGPLASDASDLFEVIGCYFPLHFTHTKDDEANI 241

Query: 241  RRNDLSQALMIAFSSTPLFEPFAVPLLLEKLSSSLPLAKIDSLKYLSDCTVKYGADRMEK 300
            RR DLS+ L++A SSTP FEP+A+PLLLEKLSSSLP+AK+DSLK L DC +KYG DRM+K
Sbjct: 242  RREDLSRGLLLAISSTPFFEPYAIPLLLEKLSSSLPVAKVDSLKCLKDCALKYGVDRMKK 301

Query: 301  HSASIWSSVKEILFTSIGQPSLSINLESLSSPSFEGNEITTEALRLLQKMVVESNGLFLR 360
            H  ++WS++K+  ++S G   LS  +ESL+SP FE NEI  +A+ LLQ++V +    FL 
Sbjct: 302  HYGALWSALKDTFYSSTG-THLSFAIESLTSPGFEMNEIHRDAVSLLQRLVKQDIS-FLG 361

Query: 361  LIINDEDIKDIFSILNIYTCYNDFPLHSRQRLNAVGHILYKSANASLASCGHVFESFFPR 420
             +++D  I  +F  +  Y  Y + P  S+  +  +  IL  SA AS+ SC  +FE+ F R
Sbjct: 362  FVVDDTRINTVFDTIYRYPQYKEMPDPSKLEVLVISQILSVSAKASVQSCNIIFEAIFFR 421

Query: 421  LLDIVGI-------SADQPHNNKISPKNFNFGALYLCIELLVACRDLIASSDE--HICFV 480
            L++ +GI          Q  N+ +S + ++ G L+LCIELL A +DLI   +E       
Sbjct: 422  LMNTLGIVEKTSTGDVVQNGNSTVSTRLYH-GGLHLCIELLAASKDLILGFEECSPTSGC 481

Query: 481  KEKLYGMLQTFSCSMVHLLNSIFPVIVKKDLHDAEFYCAVKGLQNLATFPVGSSPVSRVI 540
                  M+++FS  ++ +  S   V    D    + Y  VKGL  +  F  GSSPVSR  
Sbjct: 482  ANSGCSMVKSFSVPLIQVFTS--AVCRSNDDSVVDVYLGVKGLLTMGMFRGGSSPVSRTE 541

Query: 541  FEDILLGLMSFITADFKFASLWNHALNSLQHIGSFVDKYPESLELQSFMHVVVEKIASMF 600
            FE+IL+ L S ITA      +W  AL +L  IGSF+D+Y ES +  S+M +VV+ + S+ 
Sbjct: 542  FENILVTLTSIITAKSGKTVVWELALKALVCIGSFIDRYHESDKAMSYMSIVVDNLVSLA 601

Query: 601  SLHEETLPLSIKLELALNIGRTGRSYMLKIVQGIEEATFFHLSEVYVNGNSKSVEILLSL 660
                  LP  + LE    +  TG  Y+ K+VQG+EEA    LS+ YVNGN +S++    L
Sbjct: 602  CSSHCGLPYQMILEATSEVCSTGPKYVEKMVQGLEEAFCSSLSDFYVNGNFESIDNCSQL 661

Query: 661  LDCYSTKILPWLDEVGGFEEVILRIALNIWDQIEKCSVFSALLD-KVLLDATMLAMKLSV 720
            L C + K+LP + E+ G E++++  A+++W QIE C VFS   + +  ++A M  M+  V
Sbjct: 662  LKCLTNKLLPRVAEIDGLEQLLVHFAISMWKQIEFCGVFSCDFNGREFVEAAMTTMRQVV 721

Query: 721  RSCSKESQNVVIQKAFDVLLTSNFTPLKLPSSTTVPLQMEGLQLLKQKDSPLCRDEWILL 780
                 +SQN +IQKA+ V+     +   LP+  ++PL    L+ L++  S   RDE IL 
Sbjct: 722  GIALVDSQNSIIQKAYSVV-----SSCTLPAMESIPLTFVALEGLQRDLS--SRDELILS 781

Query: 781  LFASVTIGLRPQTQIPDVRSVIHLLMLSITRGCIPAAQALGSIINKLSLKSDKVEVSSYV 840
            LFASV I   P   IPD +S+IHLL++++ +G IPAAQALGS++NKL   S     S   
Sbjct: 782  LFASVIIAASPSASIPDAKSLIHLLLVTLLKGYIPAAQALGSMVNKLGSGSGGTNTSRDC 841

Query: 841  SLEEAIDIIFKTKF----RCFHNGST--LAGSEMFLTDLCSSIEKSSLLQVHVVVGLSWI 900
            SLEEA  IIF   F    +   NGS   + GSE  ++ +C     S  LQ   + GL+WI
Sbjct: 842  SLEEACAIIFHADFASGKKISSNGSAKIIVGSETTMSKICLGYCGSLDLQTRAITGLAWI 901

Query: 901  GKGLLLCGHEKVRDITMVLLECLLSKSRTDASSLQQVILEKDYEPNFDFAIVKGAADAFH 960
            GKGLL+ G+E+V +I +VL+ECL S + +  +                 + +K AADAF 
Sbjct: 902  GKGLLMRGNERVNEIALVLVECLKSNNCSGHA--------------LHPSAMKHAADAFS 961

Query: 961  ILMSDSEACLNRKFHAIVRPLYKQRFYSTMMPIFQSLVSKSDASLSRYMLYKAFAHVITD 1020
            I+MSDSE CLNRKFHA++RPLYKQR +ST++PI +SL+  S  SLSR ML+ A AHVI++
Sbjct: 962  IIMSDSEVCLNRKFHAVIRPLYKQRCFSTIVPILESLIMNSQTSLSRTMLHVALAHVISN 1021

Query: 1021 TPLTAILSDAKKLIPMLLDGLLTLSVNVIDKDVVYGLLLVLSGILTDKNGQEAVTENAHK 1080
             P+T IL + KKL P++L+GL  LS++ ++K+ ++ LLLVLSG LTD  GQ++ ++NAH 
Sbjct: 1022 VPVTVILDNTKKLQPLILEGLSVLSLDSVEKETLFSLLLVLSGTLTDTKGQQSASDNAHI 1081

Query: 1081 IVDCLARLTSFSHMMLVRETAIQCLVAVSELPHARIYPMRKQVLHAISKALDDPKRAVRL 1140
            I++CL +LTS+ H+M+VRET+IQCLVA+ ELPH RIYP R++VL AI K+LDDPKR VR 
Sbjct: 1082 IIECLIKLTSYPHLMVVRETSIQCLVALLELPHRRIYPFRREVLQAIEKSLDDPKRKVRE 1131

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038898520.10.0e+0085.27MMS19 nucleotide excision repair protein homolog isoform X1 [Benincasa hispida][more]
XP_022953370.10.0e+0084.62MMS19 nucleotide excision repair protein homolog isoform X2 [Cucurbita moschata][more]
XP_008462417.10.0e+0083.83PREDICTED: MMS19 nucleotide excision repair protein homolog isoform X2 [Cucumis ... [more]
XP_022992063.10.0e+0084.97MMS19 nucleotide excision repair protein homolog isoform X2 [Cucurbita maxima][more]
KAG6575503.10.0e+0084.79MMS19 nucleotide excision repair protein-like protein, partial [Cucurbita argyro... [more]
Match NameE-valueIdentityDescription
Q0WVF86.1e-28847.23MMS19 nucleotide excision repair protein homolog OS=Arabidopsis thaliana OX=3702... [more]
Q6DCF25.7e-5223.68MMS19 nucleotide excision repair protein homolog OS=Xenopus laevis OX=8355 GN=mm... [more]
Q0V9L13.1e-5023.75MMS19 nucleotide excision repair protein homolog OS=Xenopus tropicalis OX=8364 G... [more]
E7FBU45.9e-4923.72MMS19 nucleotide excision repair protein homolog OS=Danio rerio OX=7955 GN=mms19... [more]
E1BP363.8e-4823.79MMS19 nucleotide excision repair protein homolog OS=Bos taurus OX=9913 GN=MMS19 ... [more]
Match NameE-valueIdentityDescription
A0A6J1GPG00.0e+0084.62MMS19 nucleotide excision repair protein OS=Cucurbita moschata OX=3662 GN=LOC111... [more]
A0A1S3CGY00.0e+0083.83MMS19 nucleotide excision repair protein OS=Cucumis melo OX=3656 GN=LOC103500778... [more]
A0A6J1JY030.0e+0084.97MMS19 nucleotide excision repair protein OS=Cucurbita maxima OX=3661 GN=LOC11148... [more]
A0A6J1GN600.0e+0084.32MMS19 nucleotide excision repair protein OS=Cucurbita moschata OX=3662 GN=LOC111... [more]
A0A6J1JNP30.0e+0084.67MMS19 nucleotide excision repair protein OS=Cucurbita maxima OX=3661 GN=LOC11148... [more]
Match NameE-valueIdentityDescription
AT5G48120.14.3e-28947.23ARM repeat superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Chayote (edule) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011989Armadillo-like helicalGENE3D1.25.10.10coord: 859..1139
e-value: 1.1E-7
score: 32.5
IPR024687MMS19, C-terminalPFAMPF12460MMS19_Ccoord: 646..1069
e-value: 1.2E-38
score: 133.4
IPR029240MMS19, N-terminalPFAMPF14500MMS19_Ncoord: 46..313
e-value: 9.5E-79
score: 264.6
IPR039920DNA repair/transcription protein MET18/MMS19PANTHERPTHR12891DNA REPAIR/TRANSCRIPTION PROTEIN MET18/MMS19coord: 7..1133
IPR016024Armadillo-type foldSUPERFAMILY48371ARM repeatcoord: 890..1131
IPR016024Armadillo-type foldSUPERFAMILY48371ARM repeatcoord: 8..425

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sed0021872.1Sed0021872.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006281 DNA repair
biological_process GO:0016226 iron-sulfur cluster assembly
biological_process GO:0097428 protein maturation by iron-sulfur cluster transfer
cellular_component GO:0097361 CIA complex
cellular_component GO:0005634 nucleus