Tan0007314 (gene) Snake gourd v1

Overview
NameTan0007314
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionEndonuclease III homolog
LocationLG10: 14855792 .. 14891244 (-)
RNA-Seq ExpressionTan0007314
SyntenyTan0007314
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCTTTTTACTTGTCCTATTAGAACTCCGGCGCTTTCGATTGCATTTGCGCGAAGAATTACATGCAACGGCATGTCGAAAGGATGTTTATCTTCCTTCTCAACTAGCTCAAACAAAGTCCCTCCAGAATCAGGTCCCTAGAAACTGCTACTTGATCTTGTTTCAGTCTCCGTTAAGTTCTTTCTTCATCGTTAGTCTTTGGGGCATTTCTTAACTGGGTCTTCACTGTTTTACGAATTTCAGGTGTCAAGTCTTCCAATGGCGTTTCTGAGTCTGAAACTCGTGTATTCGTGAGGAGAAGAGTGAAAAAGAAGGCAGAAGTTCAAGATAGCGGGCTTGAAGTTGACCCTAATGTCGACGCTAAAGTAAGAGCTTCACCATCTTCTTTTCCGTCTTCCAAACATATCTTTCATTCTGGAAAATGGGTTATTTTAGCTTCAGAAAGTCGTTTTTCTGGTGATTAGCTTGATGAAGTCTATCATCTATGCAGTCATTTTACTTCATTTTATTATTTTTTTTTTCTGTCCATATACCTGATGTTTGGGTTTTGGGTTTTTAACTCCCTGTGCACTTTCTGTTTCACTCACAATTCATTGATGCATTCAGTTAATGCTAAATTAGTTCATAAACGTTCAGAGTAAATTCATACATTAGCATGATATTCTAGCAACTTATTATTTGATAAGATTATGAACCCTCGTAGATAAAGGTCTTACATGGTGGATTCTGGATTCTGTAGCGGAGAGAAACAATAGAATCTTTAGGGGAAAAGAGAGTTATCCTTTGGAAGTGTGGTCCCTCGTTAGATTCTATGTTCCTCTTTGGGTCTCGGTGACGGGGTTCTTTTGTAATTATTCTTAAATATTATTTTGCTTGATTGGAGCCCCTTTTTATAGTTGACTTTTTTTTTGTGGGCTTGTTTTTTTGTACGCCCGTGTATTCTTTCATTTTTATCTCAATGAAAGTTCGATTTTTCATAAAAAAAAAATAAAAATAGAATAAAGGTCTTACATGGTGAAGGTTAATAATTTAAACTTCTAATCTTTTTCCTGTACAGCGCTGCTGTCCTCCTAATATTGAAGATTTTGCATTCAAAAGAACAAAGGATTCCCCTGGATCAAGTAATTTCTTACCTTAATGGCACTTTAGCTTGTCTGAATTTAGCCATTGCTACTGGATGATCTATATACCCCATTTATGTATGAGAACACTCCCATGTAAATTTTTAATCTTCTTGTAAGAAAAATGAAATGAGAAAAATTTATACAAAATAATTATTTCTTAAAATTTGTTTTGAAGGACTATGTAATGTGGATATTGCACAAACTTCTCCTTGATTTTCATTTTAGTTTCTCAAGTGCAATCAATGGTGATTAACTTTTGGTTCAGTTTTACCTAGGTCTCCCTCTTGGGCACAATCCTAAATCTATTTCGTTTTAGGTTTCTGTTGTGGAGAAGGTTTGTAAGAGGCTTTCATCTTGGAAAATGACTTACTCTTATTCAATCAGTTTAGAATGGGATCCCCACTTATTTCCTCTCTCTCGTCATAGTCCCTGCGTCAGTGAGTAAGGATCTCGAAGAAGCTTACGAGAGATTTCCTTTGGAAGGGTGTTGAGGAGAGGAAAGGGGCTCATCTTGTTAATTGGGAGGAGGTGACTAAGCTCGTTGATTTTGGGGGGTTAGGTATAGGTAACTTGAGATCAAGTAAAAAGGCCCTCTTGGCAAGGTGATTGAGGCATTTTCACCTTGATTCCTTTTCTCATTGGCATAAGCTTATTGTGAGCAAGTACGGGCTGCCCCCTTTTGAGTGGACTTCGTATGGAGGTCGAGGCACGTCTAGGAACCCTTGGAAAGGCATCTCCGAGGAATATCCCTTCTTTTCTACCTTTGCCCATTGTTTCGTGGGGGATGGGAGAGATACTTTTTTATAGGAATATAAGTGGTTGGGGGATAAACCCATTTGCTCCATGTTTCCTCGCCTTTATGATTTATCTTCTATGAAAAGTCATTTGGTGGCTGATGTTCTGTCTCATCCTAGGCCTTGTCCCTCGCCTTCGTTGGGTTTCCACCGCCTTTTGTCTGGGTAGATGTCATTGACCTCCTTTCTTTGTTGGGAGAGTTTCATTGTGTTTCGAGCAGGAGGGTTTTTTGCTATTGGAGCCCTAGTTTGTCTACTGGTTTTTCGTGTAAATTCTTTTTTCTGTGCCTTGTTCATCCCTCTTCTCCCTCTTCCCTAGGAGATTCTATCTTTGCCTCTTTATGGAAGGTCAAAATCCCAAAGAAGATTAAGGTTTTTGTTTGGCAGGTTTTACCCGGAAGAATTAATACCTTAGATAGGGTGGTGAGAAAGGTGCCTTCCTTAGTGGGGTCGTCGTCTTGCATTCTTTGTAGGAGAGCTATCGAGGATATTGATCATGTGATTTGGAGCTATGAGTTGGCTTATGATGCGTGGAGTCGTTTCTTGGCAGAGTTTGGCTTTGTTACTGCTAGACAGAAGATTGTCGTGAGATGATAGAGGAGTTCATCCATTTTTCTCCCTTTTGGGAGAGAGGAAAGTTCTTGGGGCAGGCTGGGGTGTGTGCTATTTTGTGGAGGTTTTGGGGTGGGAGAAACAATAGAATCTTTCGTGGGCTTGAGAGATCTCCTAGTGATGTGTGATCCCTTATTTGGTTCTATGTTTCTCTTTGGGTTTCGGTGACTAAGCCTTACTGTAATTATGCAGTAGGTCTTATTTTGCTTGACTGTAGGCCTTTCATGTAGGGTGCCTCCCCTTTTTTTGTGTGCCATCTTTTTGTATGCCCCTGTATTCTTTCATTTTTTCTCAATGAAAGCTCAGTTAATTCTCAAAAAAAAAAAAAAAAACTTTTGGTTCAGTTTGATGTGAGAAGCACACAAAAGATGTTGCCTGGATGTGCATTTTCAAAAAAATAGGAAAATATAATTAGACATCTCAAATATAAGATTAAATGTATCATAACATAACAAGATAACAAAGAATGCATGATTGGATTTAAATATATCTTTTTCTTTTATGCTAATATAGCAGTAGAGGTGGAAATGTTTTAATGTTCACTAATAAAATATATGACCGGCATTAATGTACTTTTAGTTTACACATCGAAGTTTAATGCCCTTATTTACAAGAGATCATAATTGAAAGAAGTAAATCTAGTAAAATGTTCAAAAATCAACCATAAGGTTGCTTTGGTTCACCAAGACCAGAAAGTGGCAAAAAAGTAATCAAAAGGTAACCTAGGCTAGTGCGTGTTTTAAGGTGTGCTATGAGCCTGCGTCTGCTTGGCAGGTGGGGGTTGCTCAACTCACATCGCCTATTGCCATTGGCAGACTCAAGGTGTCACCTCAAAAACACCGATCCATCTTGAATTTGGATAATTTTTTTCATCAAAATGGCTTAGGCCTACTTGCGAAGGGCTGGCCTGACCTGTAAAGAAAAGGCCTAGATGAAGCCAATCTTGAAGGTCTTCTACCGCTGGCAAGGCACAGATTGGCAGCCAACCGGTGGAGAGGTTTGCAGGTCCTTTTGGTCTGCCGACTGCCTGGGGAAGCAATTTTATATTCATGTATGGAAGTTTGCAACAATTGCTTCCCCAGGTTTCGTATACATACTAGATGAGTGTATTGAACCGGAGGATGTACAATGAAGAACAAGTGTGTCAAGCCAACCTAGGAATTAGGGAGGTTTTAAAGCTTTCCAATTAGTCAATATCACAACCTTGGGTGATTATAATAAGCACTTTTCATTAATATTTGATGATTTTCTGAGAGGACAAGATATCTTGTTTGAAAGACGCTTCGCTCACAAAACTAATAATAGTTTGTTTTACCGAATTACTTTTGAGTATGTTTGGAAATGTTAATCGAGTTTCACATTTAAGTTTTCAACTTGTTCTCACACATGCAAGAATGTGTGCATGCTGCCTACACTCGGACGTGCGAAGATACAAAGAGTGGACATGCTTTTACATATATAATATGAAATTATTGACTCCCCGACTCCCTACCTTCTCATCCATCATTGTTTTTTATAAATAGGTACTACAGGGAAGTCAAAGCCTCCACTAGATCCTCTTCTCACAGGAATTGAAGATTCCTATCCAACTAGACAAAAAGGCATTGCAGAAGGAGGTGAGTGATACTTTTTTATTATTTACTTTTTCCTCGGTGGGATTTATGCTGCGTATTTAGCAGAGAGTTGATGTGGTAAGTCAGTGAATAAGAAACAGACTGAATAAGAAAAAGTACATTTCGTTTAATACTTTTTTTAGGAGTTTTCCCCCCTTTATGGATTGGATTATATTATTGGGAGAATTGTGTTAGTGTTTTTACTAGGTTGTGGTAGATACTTTTTTAAAGTAATTATTTAAATTATATTCCTCTCATATCTAAGTACTGCTCTTGTAAATACACATATTCTAGATATAAGAAGGTCAACAATTCACATTATTATTAGTTTTAATAAGCAATGATTTCGTTGAAGTAAATGAAACTCAATAAGAGAGAGACATGTAGGAGTTATAGAAAATATCTCCAATTGATCAATAAAGAAGATAAAGGGTAATTACTGATGAGCCCAGACGTAAGATACCGAGAAGAGTGAAATAAAATAATATTCCAAAACTGGTCCAGACTTTTCGTCATTGATTTTTTTTTTCCATTTCATCCAAGTAAAACATCTAATGCTGCTATTTGATTTCCTTAGATCCTTGGCCATACAAGCCTTGAACCGCAAACTCCTTGGCCATACAAGCCTTGAACCGCAAACTCCTTGGCCAGTAGTATGGGGAGAGGCAAGGCCAAACCAAAATTTCCGGGCAGCAAGAAGAGTTTCTTCTCTTAATTTTTTCATTTGTATTGAGGTCATCTAACAATGAAGACCACATAATAAACTTGCATTTGGTAGGAAAGTTTGGACTTCAAGAACAAATGAGATAAACAAAGCTAGTCACGAATCTTTGACCATTTAGGGGGTTTCTTTGTATATTTATTTTTCATCAATTGGATGTTTTTGTTTCCTAAGGAAAAGCTAAATTATGATATAGGTAAACCACCCATGAATTGGGAAAAAGTCCTTGAAGGAATTCGTGAAATGAGATCCTCTGAAGAAGCTCCAGTAGATACCATGGGATGTGGGCAAGCTGGGAGTACTCTTCCTCCCAAGGTAAAATCCCGATTTTTACTGGACACCCCAGTTTTAGTTTGCATGGTTGCATACTTCTTTTTAAGATCTATTATTATCTCAAGTTTGTACTTAAAATTTTTAGAGCACGGTGGATGTTATCTTGTTAGCCTTTTGTCTTCTTTCATCCCTTGGTACAAAATTAAATCTTTCTTTCATATTCCTAAATGCTATCTCTGGCCCAAACATTTATATCTTTGATGCACCGCGAAATCTATAAATTAACCAGATGTAAAGATGACGTGGTAAGCTTCTTAATCCCAATCAGTTGCCTGTTGTTCTTTAGAAGTAACAACTTGTGAATGATAGAGATGACGGTGGATCCGGCAACCAAATAGTTTTTTCATCCCAGATGTATCAGGGGCACCAAGGAGCCAGCTAAGGTGTTTTTGTAACACACTACACGTTCTATTGTCCTTTCCTCAATGGCCTTGCAGGCACCATCTACTGATGAAGGAGACCCATTTTTCTTTATTAAATTGTATATGATATATGCAGCAATATATAAAATAATAACTACTGCTTTTACTTTATTTTTATTAGGAAAGAAGATTTGCTGTCTTGGCATCTTCTCTTCTCTCAAGCCAAACCAAAGACCACGTGACTCATGGCAAGTTCTTGTACTCTGAAATTATTGCTTGATAGAGAATTTTAATTCAACACATTTTCTTGTAGTCTATCTGAAAATTCTTATGATTACTTGGTCCTCCAAGAACCGAGTAAACAAAAATTGTAAGAGGAGGAATTTGCCTTCTTCTCTGAATCTCTGATGGTTTTGACCATTGCACCATTATGAGAAATTTGGTCCTTGCCACCACCACCATCACTCCTGTAACTTCAACCATGAGTTTATCCTCCATTTCCGGGCATTTCCCCTTTTGTCAGTTATCAAATGCTCCTCATTTTCCAACTTTCCTTCACATGATTTCATGATTCTTGCTGGAACTTTCTCCCTTTGTGCCCAGCTGTTTGTGTGCTTCATTCTCCATTGATCATCTACCCTTCTAGCATATTCAACCTTTCTTTGGATGTTGTTCATGACATCTTGGCTCATTGCCTGTAAACCTACCAACTTGAAGTTATCTTTTACTTGATCCATTTTGATTTGCTAACCTTCAATATCTCATTTAGTTCTCCTTAACCTTTGTGCTCCTCTGGCCATTTTATTCTGGGTTTTCCTCTTTTGTGCAACTCCTTTTCAATGATATGCGAACCTTGGTTGAACCTGTACAGGTGCAACTTTCTTTGACCTGATCTTTTCCTTTTTACCCTTTTCTTCTTCACTCTCGCATATAATCTTCCAGTGTGTTGCTTGCTTTTTCCAGGCTTTGGTCTGGTTAGGGTACTTGTCGTACCCTTAATCTTTTTGTAAATTGCAACCCTTTACTAAATTTAGAATTGATTTTCCCCGAACAGATGTGACAGATATTAGTCTTGAGATAGAAAAGATTGCATAGCTGAGGGGGAAAAGAAAATTGTGAGGCATCTCCTACTTGATAATTGCCTCACTCTCACCCCTCCAACTCACTCTATTTATGATACATCTTTGGCGAACTCCCTAGTTAATTCCTAGTGGGTACCCTAATGACAACCTTATCACAAACACACCCACAGATGTGCAAGAAAAGGCGTGTATTATTTGGTGGAGCCAAAAAGCCTTAGGCAGTATTTTATTAATCTCCAAAAAAGGGGTACAAAAGGGGAAGCTAAGCCTTAGGCAGTATTTTATTAATCTCCAAAAAAGGGGTACAAAAGGGGAAGCCTTAGCCTTTATATATATAGGCTTAGCTATTGGGCTAATGGGCTTTAATTACAAAGGTGGTTAATTATATTAACCACAGTAACTAACTAACAACCTAACTAAGAGAACAAATAAAAATACCTAAGCTAGATTACAAAGAGAACCCTAGATATTCTAACTAACTACATCAATTCCCCCCTTCATGAAAAAGAACTCGTCCTCGAGTTAAATATGAAGTTCATCGGGAGGAAAATAGGTAGATAAATCTGAAATATTGAAGGTGTTAGATATGTTGTATCCTTCAGGGAGTGCCACCACATAGGCATTGCTGCCAATCCTTTTAAGGATCCGACATGGTCCAATTTTCTTCTTTTGCAGTTTGTTATAAGTCCCCCTGGGAAATCTTTCTTTTTTAAGATGTACAAGGACTAAATCACCTTCAGCAAACTCCTTATGCCTCCTTTTAATGTTAGCCAATTGGGCATATTTGGTATTAGCTTCTTGAAGATGGTCTGTGACTTCTTTGTGTATTCTTTGAACCCGATCAGCCATCAATTCAGCTTCTGAACTCAAATCTGCAGCACTAGGCACACTAATAAGATCAAGAGTTAAGCGAGGAGGGTTAGTATATACTACCTCAAAGGGTGTTTTGCCGGTGGATCGATTTTGCATGTTGTTGAAGGCAAATTCAGCTTGTGGTAAACAAATGTCCCATTGTTTTGGCTTATCACCGCTTAAACACCGAATTAAATTTCCCAAAGTTCTATTTATGACTTCGGTTTGACCATCAGTTTGTGGGTGACAAGTGGTGCTATACTTCAAGTGGGTATCAAACTTTCGCCAAAGACACTTCCAAAAATGGCTTAAAAATTTTATATCCCTATCACTAGTAATGGTTTTAGGAATTCCATGTAGCCGAACTACTTCTTTAAAGAAAAGGTTAGCAATATGAATAGCATCATTAGTTTTTCTACATGGTATAAAATGAGCCATTTTGCTAAAACGGTCAACCACCACAAAGGTAGAATCATTACCTCATTGGGTTCGCGGTAAACCCAACACAAAATCCATAGTGAGATCTTCCCAAATTGTTTTTGGAATTGGTAGGGGAGTGTACAAACCTGTATTTTGTCCTTGGCCTTTATTGACTTGGCAAGTTGGACATCTTTGAACAAAATTCTCAACATCCTTTCGAATGTGTGGCCAAAAGAATCGTTTGGAGATTTGTTCATATGTTTTGTCTCGGCCAAAATGTCCTCCTAAGCCACCCGAATGAATATCCCTAAGGATTTGTATCCTCAATGATGTGCGTGGAACACAAAGTACATTATCTTTAAAGAGAAATCCATTAGTCATACTAAAATCTTTATAAGCATGTCCATCCTTGCATTTAAGCCAAATGTCAGAGAAATCTTCATCAAGCTCATATAAATCAACAAGGCCATCAAATGAAATTATGTCACTCTTCAATATATTTAATAGGTGAGATCTTCTACTTAATGCATCAGCTACTTTATTAGTTTGTCCAGATTTGTGCTTGATGGAAAAATCAAAACGTTGGCATGCATCCTAGACAAGTGTTTTTGGGAATTGATATGACGAAGGGATTGATGGTCTGTATATAAAATAAACTCCTTTCCTATCAAGTAATGCTCCCACTGTTTTAAAGCTCTAACCAATGCATAAAGTTCTTGTTCATACGTAGACCATTTTTGTCTTGCTTCACTAAGTTTCTCACTAAAATATTCAATTGGTTTATTTTCTTGACTTAATACTCCACCTATTCTCGAACCCGAAGCATCAACTTCTACAACAAAAAGTTTATCAAAGCTAGGTAATGCTAAAATAGGTGCTGAAGTTAACCTTTGTTTTAGGGTTTCAAAACTGCTTTGTTGATCTTTGTCCCAATGGAACCCTTTCTTCTTAAGACATTCGGTTAAAGGTGATGCTATAGTGCTGAAATTTCGAATAAACCTCCTGTAGAAAGATGCAAGACCATGGAAGCTTTGAACTTCTTTGACATTTGAAGGTGTTGGCCATTCACTTATGGCTTTAGTCTTGAGAGGGTCGACTTCTATGCCATGCTCACCTATTATAAAGCCTAAAAAGACAAGTTTGGGTACTCCAAAAATACATTTCTTCAAGTTAATATACAAGGAATTAGCTTTAAGAGTTGAAAAAACATGCTTTAAATGAATAACATGTTCATCTAGATTTGTGCTATATACCAAGATATCATCAAAATAAACTACCACAAATTTGCCAAAAAATGGTTGTAAAACTTTGTTCATAAGACGCATGAATGTACTTGGTGCATTAGTGAGTCCAAATGGCATTACAAGCCACTCAAACAATCCCTCATTAGTTTTAAAGGCGGTTTTCCACTCATCCCCGGCCTTATACGAATTTGGTGATATCCACTTCGAAGATCAATTTTAGAAAAGATTTTTGAACCATATAATTGATCTAGAAGGTCGGGTAATCTAGGAATTGGGAATCTATATCTAATTGTAATTTTATTTATCGCCCGACTATCAATACACATCCTCCAAGTTCCATCTTTTTTTGGTGTAAGTAGGGCAGGAATCGCACAAGGACTCATACTATGACGTATTAATCCTTTTTTTAAGAGGTCATCAACTTGATCATGCAGAATTTTATATTCAGTTGGACTCATTCTATAATGTGGTAAGTTTGGTAAAGATGCACTAGGAACTAAGTCTATATGATGTTGTATATCTCTCAAAGGTGGAAGTTCACTAGGCAATTCTTGAGGATATAAGTTTTTGAAATCATGCAGTAAATTAGAAATGACTTGTGGAGTTTTAGTGTCTAGAACATCCTCGTTATGACCCTTAATAACAAGTGCAACTAAGTTTTTGTTATGTTCACCTAAAAACTCATTTGTAGGAATAATAGAAAAGGGTTTAGAAGATGTACCCGTTGAATTTACCTCCTTTGGTTTGCTAGGTAGTAAAACTATTTTCTTTCCAAGCCAAGTAAATTCATATGTGTTTTCTCTCCCTTTGTGGGTGGCTGAAAGATCAAATTGCCAAGGTCTTCCAAGCAAGATGTGGCAAGCATCCATGTCGATAACTTCACAAGTGATTTGGTCTTTGTAATGCTTTCCAATGGATAATGGAATAGTGCAAATGTCGGTAATGGCTACCTCATTTCCTTTCTTGATCCATCCTATTTTATAAGGGTTAGGATGAGGTTCTATGTTTAATCCAAGAGCCTTTACTAATGCTTTGGACACAACATTTTCACAACTCCCACTATCTATTATTACATCACACACCTTGTTATTCACGGTGCATCTTGTCTTGAATATTGAGTGCCTTTGAGTTGTATTCACTTCCATTCTTGGATTTAGTAAAAGCCTTTGAAGTACGCAAATTACTCTTTCTCCTTCATCACCTTCTTCAAGAATTTCTGGTTCATCTTCTGAGTTTCTTTCTTCATGTTCGCTGCTGTAATTTTCTTGTTCGGAGCCTTCTTCTACCAAAGCAATCTTTTTTCTTTCCGGACATTCATTAGAAAAATGACCGGTTTGTCCACACCTATAACATTTCTTTGAACTTGATGTTCTTGCATATGGATTTTCTGGTTTGGTGGCTGTTTTCTTGATATTTGTAGGCTACACACTATACCTTTCACTCCCATTCTTTGTGTGGTTTGATCTAGATGCTGTCTTTGATTCTACTCCTTCATAGGCTTCGTTAGTTTGTTCCCAATTTCTGTTAGCTTGACTAGGATCTCTTCTTGAATTAGATATCGTGCGACCAAATGGTGTCCTTCCTCCTCTATTCCAATTTGAATTTCTTGGATTATCATTCCATCTTGGTTGCCCATAGCCTCCACTTCTTTCTTTCTTGATTTTCTTCTCAACCTTCAAGGCCATTGCTACTGCTTCCCGTATTCCTCCTAAAGGAAATAAGTCAATTCGATCTTGAATTTCTTCTCGAAGGCCATGAATGTATCGTGCTACTTGTTGATAGTCGGACTCTGGCAAATCATTTCTTGCATTCAATCGGTACCACTCACCTGTAGACCCATCTACTGACCGGTTATGTTGCTTGCATTCTTGATATGATTGATACAAGATTTGCTCATAATTGTCGGGTAAGAATCTTGCCTTTAACATCTTTCTCATTTTTTCCCAATCTTTCACTGGTTTCTTGCCCATTTTTCTTCTATTTGAGGTTAGTTGGTCCCACCATGCTGCTGCCCCTCCTTTCAATTTGTATTTTACCAATTTAACTTGTCTTTTTTCGGAGATATCCATGCATTCGAAGAAGTCTTCTACTTGTTTGATCCAATCTAAGAAATTTTCCATGTCTAATCGCCCATCAAAGTACGGCAAGTCGATTTTCAGCTTGTATCCTTCGTGCTTCTCTTTATATTTTTGCCTTCTCTTTGTTTTTGAACTGCAATCGCTGCTGTCTTCATCACTTGAAAGGTAAGAACTATCACTGCTTGTTGATGAATCTGATTCTTGAGAATTTTTCTTCTTTTTCTTGGATTTTTTTGTGATTCTTGGAGCTTTTCTTGGATTTTTCTCTGCTGGAATTTTCTGCTTTTTGCCCAGAATTTTGTCCACTTTTTTCTTCTTGATGTTCTGATCCTTTCTTTTTCTTCCTTTCATACTTCTTTGCAAATAGATCAACCGATTCTTCCTCTTTGTTTTCTCCTGTTTGGCCCTCTGATTTTTCTGCAACTTTCTCTTTTCCTTTGTCAATTTGTGCACTGATTTTTGTATGTTCCCTCATTAATCCTCCAAGACTTCTTTGTAGAACTTCAATATTTTCTTCTACACTTGTCATCCGTCCGATTAAACCCTCCAAAGGAAGGATGCGATGTTGGATATCGAGCACAATCGCATAAAGCTCGTTGGAATAGGCCTCTTCCTTACCCGATTCCACATTCTTGGTGTTGGTCATCTTTCTTTGTCGGATCTTGGTCTCTGATACCAATCTTGGTGGAGCCAAAAAGCCTTAGGTAGTATTTTATTAATCTCCAAAAAGGGGTACAAAAGGGGAAGCCTTAGCCTTTATATATATAGGCTTAGCTATTGGGCTAATGGGCTTTAATTACAAAGGTGGTTAATTATATTAACCACAGTAACTAACTAACAACCTAACTAAGAGAACAAATAAAAATACCTAAGCTAGATTACAAAGAGAACCCTAGATAGTCTAACTAACTACATCATTATTCCTTTAGAAAAAGGAAAAACAAAGGTCGTGTGAGCCTTAGAAGCAACAAATTAGAATAGGCTTTTAAACAATAGAACCCACCCGTGTAACACCTTCCTAAACTAAATGGGGCACTTGTGAGTATTCCTGGATAACCTACACAGAGTTCTACGGAACAGGTTGGCTTACTTTTAGTAACTAGAGATGAGCATCTAGTGTGTTCGTATCGATAGCCCTTATTGAGTAAAACTTAATGAAGCCTGCAAGACTATGAACTGTGTTAAAATCTTTATAGGCCAAATGCATGATGTATAGACTATATTTAATATAATACTAAAATGAGTAACGTGAATTGTTAGAAGCATTTGAATATGAATGATTTATGAGCAAAATTGTGAACGTAACAATAAGCTGATAGATACTGCTTTTACACTAAGTTAAAAAAGGATCAGTTTTAAATTGAATTGGGAAGTTTTGCACAATTTATATCACGACCTTGTAGTTTCTAGGTTTGATTCACATAATTATATTGCTTTATTTGTGTGAGTTAATATATGCAGTGCAACAAAAATTTATCTGTTTGTTGTAATATAAAAAATATCAGTTTGTTGTAATATAAAAAAAATCATCAGTATGTTGTACTATGGACCTCCATTCTAAATAAAGCAAGCAGTATATAGTAAAATGCACTAGAAAAAGATTAAAGTACTTTACAGTTGGTAACTACAGAACATAGGTCAGAAATTACTAAACACTGCTTTTCCTGTTTTCACAGTAACTGTTACTGTCATTGGTAGTAAGAGGGTTGGATGTTTTTCTAGAAATCATTTTACCGCTTCAGCCTTCTGTTAAGGAGTAAATTTCTTGCCATATGATGGTGTTTTCTCAGTACTTAAGAATGTTACCCCTGCCATCAAATCCTGTGTCATAAGATACTGAGCAGAAAGAAAGTCTGCTACAAACTTGAGTTGAAGAAATTCTCGTCTCAGCTGCCATTCTGGAACTACAATTTTGAATTCCAGTTTCCTTGGCTTGCTTATTTTATATCGGGGAAGTGTAACAGTAGAATCTTGGCTTCAAATTGTTATGCCACTCTGTAAAATAAATGCGTTTTGAGAAGTGTTGGGATCAGAAAGAACTAGTATTTCATGAAACTGCTGCTGTAAAAGATGGGCCTCTTCTCCTGTACAATTCAGATTACTCACGTACAATTCAGACTACTTCTCCCCTCTCTAATTTAGTTCCTACCCGCCCTGTGTAACTGATCTCCCAACTAACTACGAGCAACCAATAGCCAACTGCTACAGCCTGGTAATTGACAAACTACCTTTGCACTTCCATCACCGACAGTACAAATCGACCCCTCAAGCTGTCTACTCCTGGCATGAGGAAGTCTGTTTGGGTGCTGGAAAGTCTATGAAGAAGCAACAACAGGCTCAGATATATAAGTTTTATCTTCATGATAATAAAACTAGAATGTTGAAAATTAATAGTTTGAAAGTGAACTTTTAGGTCCCAGGGCCATATGTTATTCGTTGTTTCCCAGAGATATTAAGCTCTCTTTTCTAATATTTTCTTTAATTGACAAAAGATCATTCAGGAGTGCAGGAGCGGTATTGCGTCTCCAGGAAAGTGGTCTTCTTACTGCTGATGCCATGGACAAAGCTGATGAAGCAACCATTAAAAGCTTGATTTACCCGGTATATTTTCTTGGAAATTATTGAGCAATTGCACAAATTTTCCCACTGCATTAGAATCACACGAACAGAGAAATCTGAAAAGTAGTAGTTATATTCATATGCCTGTTCCGTTCTTGTTGGGGCACTCCTTGAATTTCTTCTTTTCTCAACGGAAGTTCCATTCTTCCATCAATAAATTTAGGGTTTGCTTGAATCCATCTTGCAGGTTGGATTTTATTCTACAAAGGCTAGGAATTTGAAGAAGATTGCAAAAATATGTCTTATGAAGTATGGTGGGGACATACCTAGATCATTGGAAGAGCTACTTCTACTACCTGGGATAGGTCCTAAGATTGCACATTTGGTATGACAGATTGTGTATTGTTAGAAAAGAAGGAACTTATCTGGGTATGCTTTGCATGACTCCTCATCATGTTTCATATGTGGCATTTAGATCATGATTATGGCATGGAACGATGTTCAGGGGATATGTGTAGATACTCACGTGCATCGCATTTGCAATCGGATTGGATGGGTGTCTGGAAAAGGCTCAAAACAGGTAACTGTGGAATGTGTACTTGACTCCACTTTTAATAACATGGATGGATTCTGACTTCAGATTTTGAATTTCGTTCTTTTCCAAACTACCAAATTTTGCTAATGTTGCTAAACTAAAACTGTTACTGGTAGCAAGAATAAGCCACTATTGAGTATGCGTAAAACATTTTGAGGCGGTACATGTAAACTAGATGTTCTTATTGCTTCTCTTGCCAAGTCGAATGGATTTCCCTTCATCAAATGATTCAAAGTATTCCTATTAATTTTTTCATTTCAATAGATTTAATGCTTTGTGGAAGTTGTCCTCGTCATATTGTACTTGAATTGTCTGAAGCATGTTTTCATGTCCTTTTAGTGAAGATAATAGTGTCATAAAATAAATTTTTCCTTTAGCTGTGTTATGGAGGTTCTTGAAAAAGTGTAGGTAATGAACTAATTGGATTATCATGTTAGCAAGTATTATGCTCATATTGCTAATGTATTACTCTGATTTAGTTTTGACCACGTTGAAGCAAGTAAGGATGTTGGAATTTCTACCATTTAGAAATGTTTTTGTGTAAATATTAATTGATATTTGACCTTTATTTTCTTAATTCTTTTCCATATTTGTAATAAGTCTTTCTCCTATAGAAGAAGACCTAAATTGTACATATTAAGATATGATTGAGTTTATTCTCTTGAAAATTAAGATGGTATCATAGATGGGAAAATAAAAACTCTAGTTTTTTTAGTCGTGATCAAACCCTAGCAGACCTAGCCACTGCTACGGTTGGAACCCTAGCCACTGTTTGAAACGACGCACTCTACTTCTGGCTATATATCTTTGGTAGTAAGTTAGCAAGGTAGGTTGTTGGTATTCTTGTTGTGGTCGTTCGTTGTTGTCACTCGTTGGTATTGTAGTGTTTGCTGGTTTTGTCTTCTTCCACCGCAGCCTCTCAAGGAGGTGTTACAGTGTCGTTGTTGGTGTTGTAGCGGCATTGCAAATATTTGAGCGCACGTCCTTGAGCCCCACCAGTGCATTCGTCAAACTATTTCAGCCTTCTCGTTCACCAGAGCATTTCTCTTTGCGATTGCATTTTCATCTTTGTTTTGGTTTTAGCCTACATTGGGTACTTTGGTGGTCTTTGATTTCATCTGTTTCAGACTCTTTTAGGGTTACAATCAGGTTTGGTTCAAGGTTCATTCGGTTCAGTTTAGATTTCGAGCCATCTCACTTGGTTTGGTTTGGTTCAGAACCCTGTTCGAGTAAACAGATTGGTCTGATTTGGTTCAAAATCCAGTTCGAGCCATTAGATTGGTTCAACTCATAATTTGCTTTGGTTCAGTTTTAGCGTAGGTTCGATTCGATTCAGGTGCATTTTTCAATACTGTTTACTCCCTCTTTATAATGACAAGGTTGAAAATGATAGTGACTCATAGTTGATTGTGGAGAAAATTCATCTACTCTTGAAATTGCTACCACCCCATTGCCCTTCTAATCATTCATGTGTATTTTGGGAACAAACAAGTATTTATCCTAAGGCTTTCTCCTAGGCTAACCTGCCTTCGTAGATTATAGATTATGGAGCCTCTGATCATATGACAAGTTCTTCTCATCTTCTTAACTCACTCTCCTTTGTATTCTAATGAAAAAAATTAGAAATTGTCGATGATAGTTTTTCTTTTATTGCAGAAAAAAGAACCATTAAGTTGACTGAAAATCTCTCTACAATATGTTCTCCAGGTTCCAAAATTAGTATGCAATCTTTTATCTGTCAACAAACTTTCTAGAGATTCTACTTGTCGTGTTATATTCTTTGAATCTCATTGTATTTTTCAGCATCATGACTTAAAAATATGATTGAAAGTTCTAGGATGCTTGATGACCTCTATTACTTTTGTTGAAATTCTTTTAATAATAAATAACTTAGGGTCTCAGTAGTATTAGTTCTTTTTCTGTTAAAGAACAAATTATGCTTTGGTACCTTAGGCTAGGACATCCCAAATTTTCTTATCTCAAGTATTTATTTTCAAATTTATTTAAAGGAATTGATTGTTCGTCTTTTGATCGTGAAAGTTGCATTTTAACAAAAAATCATCATTCTGCATATTTACCCAAACTTTATAAAGCTACCAAACTAAAATTTGGTTTGTCATTTATGATCATACTAGTCTATCTTGGGTTTATTTAATGGAAAAAAACATCAGGTGAGAGACATTTTAAAACATTTTTACAATATAATTGAAGCTCAAGTTCAAACTAAAATTTGCATTTTACATTATGATAATGGAATTGAGTATTTTAATAAATTTTGATGATTTTTTAAAAGACAAAGGTATTGTTCATCAGTCTACTTCTTTACTTGTCGGGACCTCAACAAAATGAGATTGTTGAAAGAAAAAATAGGCATTTACTTGAAGTTGAGCGTGCTATAATGTTTTCCATGCATGTTCCATTCCAAAATATTCATGGGGGAAGCAATTGTCACAGCTGCTACTTAATAAATAGGATGCCTAGTAAAATCTTGAATTTTAAAACTCTCATTGATTGCTTTTGAGCAAAATAATGATTCAATAATCTATATTATTTTGAGAATAGAATTGGGTTCTTAAGAGACCACCCATGTACAAATATGGAATTTACAACTAAGGAAAATAATAATCTTGCATATCTACAATAAAGACATAAGATATTATTTAGTTAAATAAGGAAATAGTTAAACCAACACTCCCCCTCAAGCTGGTTTGAAGATATCTTCCATTGCCAGCTTGCTTGTCATCTTGTCAAACTGTGATTTTGGGAGTCCTTTTGTGAATACATCAAGTGATTTGTTGGTTGTAGGTAAGTAAGAAATGCATATTACACTGTATCAATTTTCTCCTTTATAAAGTGTTTATCAACCTCAATGTGTTTTGTCCTATCATGAAGAACTGGATTGTGGGCGATTGAAATTGCAGCTTTATTGTCACAATAAATCCGTATGGGTGCCTTTGTGAAGAATTTCAATTCTTCCAAGAGTCTCCTTATCCACATGCCCTCACAAATACCATGTGCTAAGGCTCTAAATTCTGCTTCAGCACTGCTTCTGGCCACTACATTTTGTTTTTTACTTCTCCAGGTGACTAGATTCCCTCCAACAAACGTACAGTACCCAGAAGTAGATCTTCTATCTATGGTACTACCAGCCCAGTCTGCATCAGTGTAAACTTCAACATGTAGATGGTCATGCTTCTTAAATAAAGATACCTTTTTCCTGGTGTACCTTTCAAGTACCTTAGGATTCTATAGACAACCTCAAAATGAGTAGATCCGGGAGCATGCATAAACTGACTTACCATACTAACAGCGAAAGAGATGTCAGGGCGTGTGTGAGACAAGTAAATTAGTCTTCCCACAAGTCTCTGATATTTTTCTCTGTCTTTTACCTCTTTCTCTGTTGCAGTTTGTAACTTCAAGTTTGGTTCAATAGGTGTCTCAGCAGTTCTACAACCAAGTAATCCAGTTTCTTCAAGCAAGTCAATGATGTACTTCCTCTGATTAACAAGGATGCCTTGTTCTGACCTAGCAAATTCCATTCCTAGGAAGTATTTTAGTGTTCCCAAGTCTTTGATTTGGAATTCATTAGCTAGTCTTTTCTTCAGATCAATCAATTCAGTCTCATCATTTCTCTGTAAGGATAATGTCATCAACATATACTATTAGTACAACAACTTTGTCTTTCCCAGTATGTTTGAAAAAAATAGTATGATCAGCTTGGCTTTGAAGAAATCCATAGCTTGTGACAGTTTTTCCAAATCGTTCAAACCACGCTCTAGGAGATTGTTTGAGGTCGTATAAAGCCTTTTTAACTTGCATATTTTGTTACGACCTGATTTTCCTCAAAACCGGTGGTAAACTCATGAATACCTCTTCTTCAAGTTCTCCATTGAGAAAAGCATTCTTGACATCGAGTTGATAAAGTGGCCAATCTGCATTTACTGAACGGATAGCGAGAACTCTAATGGAGTTAATTTTTGCAAGCCCGAGCGAATGTTTCTTGATAATCAATTCCATAGAGTCTGTGTGTGAAACCTTTAGCAACAAGTCCGGCTTTGTACCTTTCAACACTTCCATCGGCCTTGCACTTTATAGTAAACACCCATTTGCACCCTCTTTGTTTTCTTATTCTTTGGTAATTCTACAACATCCCATGTGCAATTCTCGCTTAAGAGCATTCATCTCTCCATAATCGCTAATTTCCACTTCGAGTCATCTAGGGCCTCAGATATATTTCTTGGAACAAACAGATTTGTTATCCTAGAGGTGAAGGCTCTATGGCTATCGACGACCTATGGTAGGAGAGATAGTTTGCAATAGGATATTTAGGGCAATTTCGAGTACCTTTCCTATGGGCTATTGGAATGTCAAGATCAGATATTTCAGGTGAACGATTCGATAAAGGAATGGGATTCGGAGATATGTTACCTGAATCTTCAGGATCATTCATCGGAGTATTAGATTGGTCTTGTGATAAATTCCTTTGTTTGGTCCCGATTCTTTTGACTTGTTGACCTTCTAGTATAAACCTGAAGTTCAGTATTTTTATCAATTGGTAGTGTTTCTCCCCCTGAACACAAAGTTTCCATACTTGGCATTGGAGAACTGGAACTGGAATTCTCAAAGCTAGTAATGCTAGGAAGAGGGGAAGTCTCCCAAAAATTATCTTCTAAGGTAGTTGTCTCCCCCTGAAGAGAAGTTTTGGTAAAGAAAGGTTGATTTTCCCAAAAAGACACATCCAGACATTCGAAGTACTTTTGGTTCAAGGGATCAAAACATTTGTAGGCCTTTTTGTGGGAGACATACCCTAAAAAAATGCATTTCACCGCTCGAGGATCTAATTTAGACCGTGAGAAACTAGGTACATGAACATAGGCAGTACACCCAAATACCTTAATCTGTAGATCAGAAAACAAACGAACATTGGGAAAAAAGGTTTTGAAGTGGTCCAGAGGTGTTTTAAAATCCAAAATTTTTGTAGGCATTCGGTTTATAAGATATGCAGCAGTTAGAATAGATTCTCCCCATAAATAAGCAGGGACATTCATAGAAAACATAAGGGCACGAGCGACATCAAGTAAATGACGATTTTTTCGCTCAAAGAATGCCATTTTGTTTGGGTGTATCTCTACACGTAGATTGATGAAAATTCCTTTAGCTAGCAAAAATCATTTAACTGTTCATTAAAATACTCGATTGCCATTATCGTGAGTGCAAAATACTAATTTTAGTATGAAATTGAGTTTCAATCATGTTATAAAACCGAATAAAAGTATCTTTCACCTCTGATTTTTTTGTTAACAAATAAATCCAAGTTAAACGAGTGTGATCATCAATAAAAGTAACAAACCATCGTTTTTCCACTATGTGTTAAAACTTTAGAGGACCCCACACATCACTATGAATTAAGTAAAAGGGTGCGAGAAGTCTTGTAAGGTTTGGGCAAGTAAGAAGATCTATGATGCTTTGCAAAGATGCAACTTTCACTTTGAAAAACAGAGCAATCAATGCCTTTAAATAATTTTGGGAACAAATATTTTAAGTAGAAAAAGCGGGATGTCCTAGTCTACGATGCCAAAGCATGATAGTTTCTTGAATGAAGGAGAACTAACACTACATAAGCCCCGAACTTTTTATGACTAGTTGGAGACTCATCAAAATAGTAGAGGCCATCGAGCATTTTAGCACGCCCAATCATCTCCCGAATCTCGATCCCGAAATGTGCAATAAGAGTCAAAAAGGTAACACGACAATTAGCATCCTTTGAGATTTTACTAATGCAGATGACAAATTACAAGTTAATTTTGGGACATGAAGGACGAGAATGTAGGGTAATATGTGGAGTTAAACGAATCATTCCTTTACCGACAATAGAATTAAAACTACCATATGCAATTCGGATTTTTTCATTGCAGACATATAGAGAGTATGAGTTAAATAGTTTTGAGGAACCGGTCATGTGATCGAGGTCTCGGAATCTATAATCCACGGAGAGGAGTTAAAACAAGAGAGGGCTTTAGGAGAATTACTGTTTGTGCCAAGGAAACACTAGGATTACGAAGAAGATGAAGTAGTCTTTAGCGAATTTGAAGTTGATCAACTTGCTCTTTGCTAAATGGGTTGGAGTCAACAACATTAGCTTTGGAGGATTGTTGATTGCTACTTTTCTCACTTTGCTTAGAACTCTTCCAATTAGCGGGCTTTAAAGTGCGGTTTCCAACAAGTATCCCGAGTATGTCGGGGTTTATTCTGATGGTCACACCGGACACGAGGTTTTTCATGGTTCTTATTTGATTGATCGGAGGCTTTGTATGTCGTAGCTTCAACCACTAACGCGAACTTTCAATGTAATCGATAGGCTTCTTTCCAATCATCACATTTCGACGACTTTCTTCCTACGCACTTCGGAAAAAACTTCATTAATAGTGGGCAAGGTGGTCTTACGAGTATTCGCCTCGACCTCATCAAATTCAACATTAAGGCTCGCGAGAAATTTATAAATACGACCATCTTCTACGAGTTTCTGTAGTGTTTACGATCATCTGTAGACTTCCATTCATATGAATCAAAGCAGATCGAGGTCTTGCGGATCCTCTTTAGGGAATGAAAATATTGCGTAGTTGAGTTATCTCCTTGTCGTATGTCATCTAACTTGAGATTCACTCAAATACTTGTGGTGATTACCCATATCCGAATACATCTCCGTCACACTATCCCATAGTTCCTGTGCAGTAGCATAACACATATAGTTAGAACTAATGTCTTCTACCATTGAATTAACTAGCCACGTCATCACCATGGAGTTTTCAGCATCCCACACAGCAAACAAAGGATCATCTTTGGCGGGTGCCGTTTTGTCTCCAGTAAGATATCCAATCTTCCCTTGTCCACGAATATACATTCGAACACTTTGAGACCAACGAAGAAAGTTGTCTCCATTAAGCCGAATGGTTGTAATTTGGACAGTAGGACTGTTGGAATGGGTGCGATTATCTGTAGCTTTAACAATGGAAGACTTATTTTCTGACATTTTCAAAAAAAAAGTAGCGACCAGCAATCCAAACAACAAAAACAAGACAGAGCAAGAAAGATTCAAGTGACCCAATTAGGGATTCTTGAATAGTTGACCGAGGATGGTGGTGCCCGGCGGTCAACGATGGAGCGCATGCAGGCGTCTACTTGAATCAGAGGGGTTGCAGGACTGGCCGCGCGCGCGCTGAAGCGCCGACGTGGAAACGGCGCCGAGACGTGGCGCGCGCGCGATGGAGCGGTTCGCTCGAGGAAGGTGTCGACGTGAAACGGCGTCAACTGGTGGTTCTGTCCGATGGAAGGCGTCGACCTGAAATGGCGAGGACTGGCGGATCACGGCTGCGTCAACTGGCGATTCACGGCTGCGACTCGGCGGCGGCGGAAGACTTGCGGTTCTGTCTGCACGTGAATCGCGGTAGTGGATCTGCACGACGTGATCGGCGTCGACTCGCGAACGGAGGTTGACCGGCGGCTCTGTCTGCGTTGATTTAGAGTGAGTGTGGCGCGCGGCGGCGGTAGGCAAATGGCGGCGCGGCGGCGATCAATTAGGGTTAGGGTTTTTTCTTTTTGTTTTTGAAAAAACTAGGGTTGCTCTGATACCATGAGCAAAATAATGATTCAATAATCTATATTATTTTGAGAATAGAATTGGGTTCTTAAGAGACCACCCATGTACAAATATGGAATTTACAACTAAGGAAAATAATAATCTTGCATATCTACAATAAAGACATAAGATATTATTTAGTTAAATAAGGAAATAGTTAAACCAACAGCTTTAAATATATTTTTCCAACATCTATCATATATTTGCACATACCTATCAAAGTTTTTTGGGCGTACTACTTGTGTTCACATACCTAGTTACTTTCGATCAGAACTTGATCATAAGCGTTGTTAAATACATTTTCTTGATGTATCTTCAAGTCAAAAGGGTTATAAATGTTTTGATCTTTAGATTAATTTGGGTTGTTTCTATTCCTCTTCCTAGTTTTATTTATCCCGAAGTTACTAGTCCTTCGATGCCAAGTGTAAGAACTTCCAAATCAAGTGGAGATATACTAGAATCATCCCATTGATCAAGTTCCCTAGTTTAAGTTTTATATGAGAAGAAAATTCAATCAAAGGAACCAAGACCAAAAAGTTGATCCTCTACAGAACCAATCGAAGACTTACGATGAATGTAACTGAAAAATCTGGTAATCCTGCTTCCATTCTTCTTGTTTCAACAATTCAAAATACTTTGTCAGTTGTGTCTGATTCTGATGTTCCCATAGTCATTAGAATAGGTATTCGAAGTTGTACCAAATATCCTATTGCAAAACTACATGTCATATAAAAAACTGTCAAATAGTCATAAAGTTTTCACATCTAGGATAAATGACTTGTTTGTGCCAAGGATTATCCACGAGGCGACGTTAAAGATCTAAATTGGAAACTAGCAGTTATAGAGGAGATGAATGCTCTAAATAAAATGAAACTTGGGAAATAGTAGAGTTGCCAAAAAAATAAGAAATCAGTGGTTTTGAGAGATACAGGGTCAGACTAGTAGTTAAAGGTTTCACTTAGACATATAAGATTGACCAAAAGTGAGACATGAACCTAACAACCTCCCATAACTCCCCTCAGATCTTCGATGCCTCTAAAAAGCCTCTCATTCCTCTCAAGCAGCCCCAAAGACTGGCAAAGAAGCACACCTGTGAAAGAAAAGGGGACCTTTCTCCCTGAAAGGGGGAGACAACAAAACCTCCTCCATCCAAGTCCTACAATCCCTGCTCCAAGCCAACAATACACCAATACAAAAATCTAGACCATATAAACTAAGAATAACGACAATCCCAAAGGATATGGTCAAGATTCTCTAACACACCTCTACAAAGGGAGAATAGAGGATGCTTTGTTCTCTCTCCTCTTTCCTTTCCCCTAACAACCCCCTCACCTAGGCCCCTACCGATCTCCTCCGGCCACCATGATTTTTGCCTACCCAAAGCCCCCCTCCCTTATGGAATTATTCTACCCGCTCTGTTCTTATTGGTCAGAAGACCTTCACAATTGAGTTTGATGATTCATCGAGGGGGAGCAAAGCTAAAATCATCGAACTTGGGAGATCTTCATCTCATGGTTTATCCATCTCCGGGAACTCTCTCATGTGGTTAGCCAATTCCTTTGAACTTCTACACAAAGAACCATGCTCTTACAAATTTTTTAAGAAGTATCAAATCGTTACAGATGCTGGAAAAATTAAACAACAAGTATGGTTTTTATGTGGAACTTACACAACTTAGCCACTTTGGGGTAAGGAAGAAGCTTCTCATCCCTTCATAAGAAAACAAATAAGGGTGTGTTAGGAACCCAAATACTAGGAACACGATAATGGAAATATATTGAAATATGAATGATAAAACATAACAAGATAAACCAAGTATAAGACACTTGTAACCTCCCACTTGAGTATGCTAACCCAAGCCCTAAATCATTCAACAATTGCCTACCTTCCCCTCAAACCCACTTCCTCTATTTATAACCAAAAACACTAACTAACTCCTTAACTAATTACTCTTATGCCCCTACTAATACTGTTAGTTTATCTATTTCCTTATGTAGAGAATCACCATTTATTGTATTATTATTCTGTATTTATTTGTTTCCATATTTATTTGGTATATTCCATACATGTACATGGGTATCTCTTTATAAGAACCCTAATTCTTATCAAATCAATAAGAAATACTAGATACACTATTTCTTCATGGTATCAGAGCAGCAGAAACCCTAGCCGTCACTGTCTTTTTGCTAGCTATCCGGTCAACTTCTTTGGGTGATCTAAATCACAAATTGATCTGCACCCAAACCTTACCGAAACCGTATTCCAGTGAAAAATGAGGAATAAATTTTCATCTTCTTCATGGTGAATAAACCTTACCGATCGTGAAGATCGCGACCGTGAAGATCTTCATCTTCTGACTTGCGACAGTGAAGATCGGCGACCGTGAAGATCGTGAAGACCGGCGACCTGCATCTGAAGATCCGGCGAGCCGCGTATCCTGGTGCATCTGAAGAAATCCGCGAACTCGTGAGACGACCCGGTCCTGCGTTGTAAAGATTCACCGACCCGCCGACCCGCGCCTAAGGACCCGTGACCCGCGCCTAAGGACTCGTGACCCGCGCCTAAGGATTCGCCGACCCGCGCCTAAGGACCCGTGACCCGTGACCCGAGCATTCTGTGAGTTACCACTTCGTTTCTGAATACGTGGTAACCTACTGGTATTTTTTTTCTTCCGCTTTTTATCGGGCTCACTAATCTTTTCGGGACTACTACGAAGTGTTCAGTTTTTTTGTTATTTGGTTGATTGTTATCCCAATATCATTACTGTGTGTCAAACAATGTCAGAAAATAAGGATTTGTCCACCACTAAAACTCCTGACAGTCGTGGTCACAGCCCTATTGTCCAAATTACCACGATTCGACTTAATGGAGATAATTTTTTACGCTGGTCCCAAAGTGTCAGAATGTATATTCGCGGACAAGGGAAGATTGGATATCTCACCGGCGATAAAATCACACCTGTCAAGGATGATCCACTGTTTCCTGTGTGGGATGCTGAAAACTCCATGGTTATGACATGGCTAGTCAACTCTATGGTGGAAGACATCAGTTGTAACTACATGTGTTACTCCACTGCGAAGGAGCTCGGGGTAGTGTGACCGAGATGTACTCGGATCTGGGTAATCAATCACAAGTATTTGAGTTGAATCTCAAGTTATGTGATATACGACAAGGGAGGTAACTCAATTACACAATATTTTCACTCCTTAAAAAGGATTTGGCAGGATCTTGATCTTTTTGATTCATATGAGTGGAAGTCTCTGGATGATCGAAGCATTACAGAGAAACCGGTGGAAGATGGTCGGATTTACAAATTCCTGGCAGGCCTTAATGTGGAATTCGATGAGGTTAGAGGTAGAATACTTGGTAAGACTATCCTTCCAACTATTAGTGAAGTTTTTTCTGAAGTACGCAGGGAAGAAAGTCGTCGGAATGTTATGATTGGAAAGAAATCAACTGAGCCAATTGAGAGTTCAGCTTTGGTAGTTGAAGCTACTGCACACAAGGTTTCTGATAAATCAAAGAATCATGAAAACCTCGTGTCCGTGTGACTATTGTAACAAACCCCGACATACTCGTGAGACGTGTTGGAAAATTCATGGAAAACCTGCAAATTGGAAGAGTTCGAGGCAAGGTGAAAAGAGCTGCAATCAACAGTCTTCCAACATTGGCTCTTCAAGTGCTAATGTTGTTGAGTCCAACCCATTTAACCAAGAGCAAGTGGATCAACTCCTGAAGCTACTAAAGGCCACTTCATCATCCGGTAATCCTAGTGTTTCCTTGGCACAAACAGGTAATTGTCCTCAAGCCCTAACTTGTCTCCATTCTTCTCCGTGGATTATCGATTAGGAGCTTCTGACCATATGACTAGTTCTTCAAAATTATTCGAATCATATTTTCCAATGTATTGCAATGCAAAAATTCGAATTGCCGATGGTAGTTTCAATTCTATTGCAGGAAAAGGAAATGTTAAATTAACCTCACAGATTACTTTACATTCTGTCCTTCATGTCCCTAAATTAGCCTGTAACTTGTTATCTGTTAGTAAAATCTCTAAGGATGCTAACTGTCGGGTTACCTTTTTTGACTCTCACTTTGCGCATTTCGGGATCGGGATTCGGGGAGATGATTGGGCGTGCTAGGATGCTCGATGGCCTCTACTATTTTGATGAATCTCCAACTAGCAATAAAAAAGTTTAGGGCTTGAGTAATGTTAGTTCTCCTCCTGCTCAAGAAACTATCATGCTTTGGCATCGTAGATTAGGGCATCCCAGTTTTTTTTATTTAAAGTATTTGTTTCCAAATTTATTTAAAGGAATTGATTGTTCTATTTTTCATTGCGAAAGTTGTATTTTTGCAAAAAATCATAGATCAACTTATTTGCCAAAACCTTACAAGGCGTCTGCACCTTTTTACTTAATCCATAGTGATGTCTGGGGTCCCTCTAAAGTTTTAACTCATAGTGGAAAGCGATGGTTTGTTACCTTTATTGATGATCACACCCGTTTAACTTGGATTTATTTGTTAACGAAAAAGTCAGAAGTGAAGGAGACCTTTGTTCGTTTTTACAACATGGTTGAAACTCAATTTCAGACTAAAATTAGCATCCTTCACTCTGATAATGGTGCGAGTACTTCAATGAACACCTAACTGATTTTTTACAAGTTAAGGGAATTTTTCATCAATCCACGTGTCGTGATACCCCCCAACAAAATGATATTGCAGAGCGCAAAAATCGTCACTTACTTGATGTTGCTCGCGCCTAATGTTCTCTGATGAGTTCCTTCTTATTTGTGGGGTGAAGTCGTTTCTAACACCGCATATCTTATAAATCGAATGCCTACGAAAGTCTTGGATTTTAAAACCCCTCTAGATCATTTCAAAACCTTCTTTCCCAATGTTCGCTTGTTTTCTGATTTACCTATCAAAGTGTTTGGGTGTGTTCGCTTATGTTCACATACCTAGCTTCTCTCGGTCTAAATTAGATCCTCGAGCTGTTAAATGCATTTTTTAGGGTATGTCTCTCAAAAAAAGGCTTATAAATGTTTTGATCCTGTGACCAAAAAATACTTTGAAACTCTTGATGTCTCTTTTTGGGAAAATCAATCTTTTTTTAAAAATTCTCTTCAGGGGGAGACACTACCTAATTTAAAAGATAATTTTTGGGATACCTCTCCCCTTCCTAACATCATTAGCCCGTAGAATTACCAATTCTAGTGCTTCACGCCAAATATGGAAAATTCTTGTCAGGGGAGAAACACTACAAAATAATCAAATTGATCAAAATCATGAACTTGTGTTTATACTAGAAGGATGGTGAATCGGAAGAATAGGGACGGACAATGGACTCATCACGTAACTTGATCGACACTCCGGTGAATGATCCTCGAGTTCTGTAATACACCCTCTAATCCCATTCCTTCATTAACTCACAATCTCTTACCTGAAATATCTGATCTTGATATTCCAATAGCTCATAGGAAAGGTACGCGAAACTGCCCTAACCCAAAATACTAGGAACACGATAATGGAAATATATTGAAATATGAATGATAAAACGTAACAAGATAAACCAAGTATAAGGCACTTGTAACCTCCCACTTGAGTATGCTAACCCAAGCCCTAAATCACTCAACAATTGCCTACCTTCCCCTCAAACCCACTTCCTCTATTTATAACCAAAAGCACTAACTAACTCCTTGACTAATTACTCTTATGCCCCTACTAATACCATACTACTACTATCCCTAAATCCCTAATAGTACCCTAACAGAGTGGTTCTCCTTTTATTCTCTTATCTTGGATTACCTTGGAGAAGCACAAAAGAAGTTTAATTCCCAAACTTACTGCGATGTTGTCAAAAACCCACAACAACTGGAGAGTCCCAAGCATGTTAGTGTTTCCCTCCTGACCAAAGTTCCTCTCCGTTTATTGCCTCTTCCGCTATTTCCTATGGGATCCTCATATGCCAACGTTTTTCCCAGACGGATTCTTGGCCAATTATTAGGTCAACACTTCAAGAAAGGTTCTCTCCAAGATGCACCATTAATCCTTTCAAGGACAATAAAGTTGTGATTAATACCTATGATCATCAACTCCAGAAGTTTAGTGATGCCTTTGTTTGGCATTTGGTTGGAGGAATCAAATTGAAATTCCATGAGTTCTCCAATTCTTCTTTCTTGAAGGATAGGATGATTGCCTCATATGGTGGTTGGATAGAAATTATGGATCATCCCTTATGTCTTTGGACTATCGAAATTTTCAAGTACATAGGAGATCAATGTGGAGGCTTTCTTCAAACCTCAAATTACACGGATCGGTGGTTGTTTCTTTTTGTTGCCCGTGTGAAAATCAAATCTAACACCATTGGCTTCATACCAGCGAGGATAGAATTGCTGGAATCCATCGTCGAAGTTCCCATCTCTGTTCAAATTCGATCGGTAACGGTAAGGAATCAATCACACCCATTTAATAAGGCAGTTTCGGAAGCACTAATTATGATGGATCCAAAACATAAAGTCTCGAGTGATTCAAGGAATTTGGAAGGAATGGAATCCCAAAATTCAGGAGATTTTATTCAATCTTCACAACCAAGTAAGGAGGGGAATATTGAAGCTTTAAATGCGGCAGATTTTATTCAATCTTCACAACCAAGGAGGGGAATAATGAAGCTTTAAATGCAGCAGATTTTATTCAATCTTCACAACCAAGTAAGGAGGGGAATAATGAAGCTTTAAATGCGGCAAATTTTATTCAATCTTCACAACCAAGTAAGGAGGGGGAATAATGAAGCTTTTAAATGCGACATATTTAAACCTAAGGGAGTTGGGTATTTTGGTGATTAACAAAGGTAAAGGGATCTTTAATTCGAGTGAATTAATTCTTGATTTTCCTCCTAATCCCACCACTTTAGCGATTTCCACATTAAACCTAGGGTTACCTAAGCCGCCAATCCAACCCGTATCTGATCCTCCTAAATTTTTTCGATTTTGGTGGGCCATCAAATCCTCTCTCTCCAACACTTATTCAACCAACCCTAACTTCCATATCTTTTACTGAGGGCCCCACAACACATCAACCGCCCATTCTCCAAATCGGGAAAATTCACTCTACATCAGCCGCAAATATGAATTCTTCTGTGCCCGTTAATGCATCAGATACAAAAGCTTATTTGTCTAGCCCAAACTAATACAAGTCGAGCCATTTATCTGAATTTTAGGCGGCACCTTTTGATTTGGCAATTTTTGGCGATAACCCCTCCCCTGAAACTACCCTCCCCTTAGATCTTTTACCGCTTGTTCCAGAAGATATTAATCCTCAACAGATTGACATTAACCCACAGACTATACGGACGAAACCGACTCAATTTTCATCTCCCATTAAACCGATTTCCCTAGGTGCACACCAACCTTCGGGCTCTTATCCTTTTGAATTAAGGGGCATTGCACAATTCCTTACCCAACATGGTTTATGTATTATGCCAATCCCGTCGCTACCCCCTTCACTTGTGAAGCCTAAAAAGGTATATACAACAAATGGAAAAAGGAATAAATTACAAAGGGAGTTGCAAAACTTAAAATCCACCGTCCATTATGACAAATCTGCCACTTAGGCATTATTGGAGGGACCTTCTGTTGTTAGATGAAGTTCTTGTCTTGGAATGTTCGGGGTCTTGGCTCTTGGAAAAAACAATCACTGATCAAGCAGTCTATCTTAAAGCATAGCCCAGGGATTGTTCTTTTGCAAGAGACCAAGTTAGCTTCTGTTAGTCACCATCTGATAAAGAATATTTGAAGTTCATCACATATTGGTTGGACTTCTCTCGATGCTGTTAATTCTGCGGGAGAAATTTTAATTCTTTGGAGTGAGCCAGATTTCTCGATTAAGGAAGTGATGCAGGGTTCATACACTCTTTCTATTAATGTTTTTTTTACTGATGGGTTTTCTTTTTGGCTTACATCAGTGTATGGTCCTTTTGGAAACGGGGTTTATGATGACTACTGGAGAGAATTAGATGATTTGGCTGGCTTGGGTGGCAATTGTTGGATTATTGGAGGGGACTTCAATGTCACTCGTCGATCCTGGGAGAAGTCGCACGATCCCTCTATTAGCGGAGTATGAACATGTTCAATCAATGGATTTCCAACTATAATTTGCTTGATGTTCCTATGCAAAATGGTAATTTTACTTGGTCGAGTTTTGGTCCTACACAATACTTATCTCTCTTGGATAGGTTTCTTATTACTGATGGTTGCACAGATAAATTTGGCTCTGCTTCTTTTAGAAGATTGGATAGGGTTACTTCAGATCATTATCCACTTGCCCTTTCTTTTGGTAATATTAATTGGGGTCCTTTCTCCGTTTTGTTTTGAGAATTCGTGGTTGCAAATTAAATCGATTCACGATGATGTATCTAATTGGTGGAACCAGAACCCTTTGGTTGGTTGGCCAGGACACGGTTTAATGATGAAATTAAAAGGTCTTAAAGTCTTCTTGAAACAATGGAGTAAGTCTCATCGTTTTGAGTTGACACATTTACAATCTCTTGCTTCTCAACTGAAGGTTTTGGATAATCTGGAGGATAGTACACCTTTAACAACCGAACAAGTTGAGACTCGCCGCCTTCTTCGTGAACAGATTGAGATTTTGACAGCTCAAGATCATTTATATTGGAGGCAAAGGTGCAAATTGAAATGGCTTAAGGAGGGAGATGAGAACACAAAATTCTTTCATAGAATTTTGGTTGCAAGGAAACGAAAGAATTCGATTACTGAAGTTTTTTTTCAGGAATGGGGTCAATCTTGTAACAACAAATGATATTGAGCACGAGTTTATTGATTTTTATTCGAATTTGTTTCGGAAAGATGTCAACCTTCGTTTTCTTCCAACAAATATTGATTGGAGTCCTATATCTACAGATCAAGCCATTCGTTTAGAACGTGTGTTCACAAAGGAGGAAGTTCGTCAAGCGGTTTTTTCCTTGGGCTCTTGTAAATCACTAGGTCCCGATGGTTTTACATCTGAATTTTTCAAGGTTTTTTGGGATGTCATCAAAACCGATTTACTCATCATGATTCATCACTTTTACTCTTCTGGGATTATCAACGCATCTATGAATGAGACATATATCTGCTTAATCCCCCAAAAGCTTGCATCGAAGTCGGTAAATGATTTTCTACCGATTAACCTTATCTCTTGTCTTTATAAAATTGTTGCACGAGTTCTATCTAATCGGTTGAAGTTGGTTCTTGCTTCCACTTTTGCTGAAAATCAGCTTGCTTTTGTAGCACACAGACATATCATCGATGCCTCATTAATGGCCAATGAATTGATTGGAGATTGGAGCCTTCGTAGCCTTTAGGGTGTGGTTGTGAAACTGGATTTAGAAAAAGCTTTTGATACAGTGGATTGACAATTTTTAGATGTGGTTTTATGTGCAAAGGGCTTTGGGGTTTTATGGAGGAAATGGATTTGAGGTTGTCTTTCTAGTGCTAATTACTCTACTATTATTAATGGGTGACCACGTGGGAAAATCTTTCCTACTCATGGAATTCGACAAGGTGATCCTTTATCCCCTTTCCTTTTTATATCAGTGATTGCTTAAGTCGACTAATGGCTCATAGCGCTAGTTTGGGTCATATATCAACTCATCCTTTGGGTACATCATCGTTTTATTTAAACCATTTACAATTCGCAGACGACACCTTGCTCTTTTCTACTTCCAATAGAATTGCATTGAACAATCTTTTTCAAATGGTTAAGATCTTTGTGCAAGCTTCTGGGTTGAAAGTTAATTTGGTAAAGAGTGAGATTTTGGGAATTAACGTTGATGATTCTGAGATGGAGTGGATCTTGTCGACTTTTGGTTGCAAGCAGAGATTTTGGCCAAGCACTTACCTTGAATTGCCTTTGGAAGGGAATTCAAAAACTCTTTCCTTTTGGCAACCAGTAATTGAACGTTTGCAGCAGAAGCTTCATAATTGGAAATATGCTCTCATCTCAAAAGGGGGGAGGCATACTCTAATTCAAGCCACTTTGTCCAGTATGCCTACCTACTATATGTCCTTATTTAAATTGCCTTCAAAGATTATTAAGGTCTTGAATAAGATTGTTCGTATTTTTTTTGGGGAAGGTTCAAGGTTTGATGGCGGAGTTCATAATGTTAATTGGGCAACAACACAACTTTCTAATCTAATGGGCGGTCTTGGCATTGGCAATTTTGGTCATCAGAATTTAGCCCTATTGGCTAAATGGATTTGGCGGTTTACACAAGAACCAAAATCTCTCTGGCGAAAGCTTGTTGTTGCTAAATACTATGGGGGGATGCAAACCGATGGTTAGCCTTTACCAATTAGGCATACCTCTCATAAATCGCCTTGGAAACATATTTGCACTGTTCGAGATCTGATTAAATCGCGATCCATTAGGCGTATTGGAAATGGATTTGCTATTGCTTTTTGGACTGACTCTTGGCTCAGTTGTGGCTCTCTTGATGTTGCTTTCCCACACCTCTCTCGGCTTACTATGCACCTTGATTCCTCGGTGGCTGGTTTGTGGAATTTCACAGATTCTGCTTGGGATCTAAAGCTTCGCAGGAACTCAACAGATTTGGAAACCAGTGATTGGGCATGCCTTTCTCAATTGCTTTCTCATATTAGGCTATCAGAGGGTGTTGATTCCTGGACTTGGCCTTTGGAGCCTTCTGGATGCTTTTCTATCAAGTCTCTTACTACCTGTTTGTTGAGCTCTGGAAATTTGATATTAAGTGATCTTTACTCTATTATTTGGAAGAATCAGTATCCCAAGAAGATAAAGATTTTCCTTTGGGAGCTTAGCTTGGAAGCTATTAATACTGCTGATAGGTTGCAAAGATGAATGCCTTATATGAGTCTTTCTCCTTCTTGGTGTATCATGTGTGGATTAGATTCACAGACAACTTGTCATCTGTTTGTGCATTGTTCTTTTGCTCAGAGGTTTTGGTGGTGTCTTTTGGATGCTTTTGGTTGGTCTACGACTCTACCAAAAAATATCTTCGATTTCTTTTCCTCCGTCTTGGTGGGCCATCCTTTCAAAGGAACAAAGAAAACTCTATGGTTGGCCTTTGTTCGTGTTTTTCTATGGAATCTTTGGATGAAGAGAAATGATCGCCTTTTCAGGGATGCTTCTTATACTTTTGAACGTTTGATGGACCATGTTCTTTCTATTGTTTATTTGTGGTGCAAATCTTTACCATCTTGTGGCCTTTATAGTTTATCGTTTTTTATTTCGAATTGGTGTTCTTTCTTGTAATTCACCTATGGGTGCTCGAAGTTCACTCCACTTCATTTATCAATGAAATTGTTTCTTATACAAAAAAAAAAAAAAAAAAAACTTAGACATACAAGATTAATTATCAGGAAACATTTGCTCTTGCTGCTAAAATTAACTCTATTAGAGTTCTTTTGTGTCTTGTCGCTAAATTAAGATTGACCGCTTCACCAACTTGATGGGAAAAATGTCTTCCTCAATGGCAATCTTGAGGAAGAGGTTTTTATGAGTTTACTGTCAGGTTTTAGAAATAATCTCTCAACTCTAAATGTCTCTCGTGCTAGTGGAAGACTCGTCAACCATACAAGGAAGACAAACATGGTGAGAAAATAGATCCTAAAACAAGTCACCCCTCCCAAGGAGGACAAACTAGTTCTAGTTTCTATTGAACAATTCACTAGGATAGGTAAAAGTGAGTGAGATGATATTATGGTTTCTCAGGTTGATGATTTGAACTTACCAATAGCAGCAAGGAAAGGAGTTCCTATTGGGATCTTTGTGCCATATGATGGCATGAGTGGAGACATATGGATCAAGAAATAGTGTCAAGACAGAATTTAGAGCTTCTGCTCATGGCATGGGTGAAGGATTGTTCGTCACCAATACAGATGATACGTGACAATCAAGTTATAATATCACTAAAAAATCTTGTTCATTATGATAGGACAAAACATGTTGAAGTTGATCGGCACTTCATTTTAAAAAAGATTGATAATAAAGTGGTTCGACTCAACTATGTCCCTACCAAGCACAGATAGCATATAGTCTTGCCAAGCCAAAACCTAACTTGATTTGAATAAAAAATTGGGTTTATATGACATATACCACCTATCTTAAGGAAGAGTGTTAAAAATGAATGTCAATGTTAGGGCATTCTTGTAAATAGTTCTTTAGAGAAGATTTATTATATTCTTTTATTATTATTACATATCTTTGTTCCTAAAAAGTTTAGGGGTTGATGAGAGTAATTTACTTTGTTTAAACTTCTTTTGTACTTAAGTTGTTGTAATCCTTATTTTGAATGGATGAATTTGAAATATCAGAAAATTTCATGATATGGACATCAACATTGCTATCTTGATCCTTACCTGCACTACACTCTTTCTTTCTAGATTGTTTCCAAGACATGGCATTGCTATAATAATTGCTTGCTTTTATCCTTTGTTAGAAAACATCCACTCCAGAAGAAACTCGAGTGGCATTAGAACTGTGGCTGCCAAAGGAAGAATGGGTTCCAATTAATCCTCTTCTGGTATGTAATGTTTACAAATAGTTGCAGCCTTAGGTACTATATACAATGAATTTGATAGCATTATGTCCTTATAAACGTACTAGGTAAAACACGGTACCATTTATTTTCCTCAAACATTTTGATATTCTTAGACTGTTAAACGTGTCAAAGTTGTGCATTTGCATATCCCTATCTGTATTCACATCTTGTACGTGTAGAAAACACTCTGTACTGCATCTACAATTTGGGCATCTAGGTAACTTCTAATTGCGATGTGTAACAATTTATGCTGCTTTTAACCCCAAAGTACCCTCTCTTAAGAAAATGATATCCAAGTGCAAACAAAATTGCAGATATATTGTTAATTTTATGCATTACTAACATGGGTAAAATAAGGGTCTATACTTGCTCTGACCATCCAGCATATAAACTATTGACGTAGGAGTGTAGGCGCGCAGGTCAAGTTCAGATTCATTTTTGACAATTGAATCAACAAGAACTGAAAAAAATCTATTTCAATCAAACTGTTCGATTATTTCTTGATTTTATACTATAAAGGACTGAATTGACTGAACCGGTTCGATTGGTTTTTGCTAGTCTTTTCATTTCTGGCTTGCACCCCTACATTTTTCAAGCAATTAGTGAGTTGGTGACTTAAGCTCTTTTGCTTCCTGTGATCTGAAGTTTGTTGGGTTCTTGTTGCATCTCCATGTATTCCTTTTTATCATTGTTGCCATGTGAAATAGGTCGGATTTGGACAGACTATTTGCACTCCTCTTAGACCCAAATGTGGAAATTGCAGTGTTAGTGACTTGTGCCCATCTGCATTCAAGGAATCATCAAGCCCATCTCCCAAATTAAAGCGTTCAAGTTCCACCAAAAACCTATGA

mRNA sequence

ATGCTTTTTACTTGTCCTATTAGAACTCCGGCGCTTTCGATTGCATTTGCGCGAAGAATTACATGCAACGGCATGTCGAAAGGATGTTTATCTTCCTTCTCAACTAGCTCAAACAAAGTCCCTCCAGAATCAGGTGTCAAGTCTTCCAATGGCGTTTCTGAGTCTGAAACTCGTGTATTCGTGAGGAGAAGAGTGAAAAAGAAGGCAGAAGTTCAAGATAGCGGGCTTGAAGTTGACCCTAATGTCGACGCTAAACGCTGCTGTCCTCCTAATATTGAAGATTTTGCATTCAAAAGAACAAAGGATTCCCCTGGATCAAGGAAGTCAAAGCCTCCACTAGATCCTCTTCTCACAGGAATTGAAGATTCCTATCCAACTAGACAAAAAGGCATTGCAGAAGGAGGTAAACCACCCATGAATTGGGAAAAAGTCCTTGAAGGAATTCGTGAAATGAGATCCTCTGAAGAAGCTCCAGTAGATACCATGGGATGTGGGCAAGCTGGGAGTACTCTTCCTCCCAAGGAAAGAAGATTTGCTGTCTTGGCATCTTCTCTTCTCTCAAGCCAAACCAAAGACCACGTGACTCATGGAGCGGTATTGCGTCTCCAGGAAAGTGGTCTTCTTACTGCTGATGCCATGGACAAAGCTGATGAAGCAACCATTAAAAGCTTGATTTACCCGGTTGGATTTTATTCTACAAAGGCTAGGAATTTGAAGAAGATTGCAAAAATATGTCTTATGAAGTATGGTGGGGACATACCTAGATCATTGGAAGAGCTACTTCTACTACCTGGGATAGGTCCTAAGATTGCACATTTGGTCGGATTTGGACAGACTATTTGCACTCCTCTTAGACCCAAATGTGGAAATTGCAGTGTTAGTGACTTGTGCCCATCTGCATTCAAGGAATCATCAAGCCCATCTCCCAAATTAAAGCGTTCAAGTTCCACCAAAAACCTATGA

Coding sequence (CDS)

ATGCTTTTTACTTGTCCTATTAGAACTCCGGCGCTTTCGATTGCATTTGCGCGAAGAATTACATGCAACGGCATGTCGAAAGGATGTTTATCTTCCTTCTCAACTAGCTCAAACAAAGTCCCTCCAGAATCAGGTGTCAAGTCTTCCAATGGCGTTTCTGAGTCTGAAACTCGTGTATTCGTGAGGAGAAGAGTGAAAAAGAAGGCAGAAGTTCAAGATAGCGGGCTTGAAGTTGACCCTAATGTCGACGCTAAACGCTGCTGTCCTCCTAATATTGAAGATTTTGCATTCAAAAGAACAAAGGATTCCCCTGGATCAAGGAAGTCAAAGCCTCCACTAGATCCTCTTCTCACAGGAATTGAAGATTCCTATCCAACTAGACAAAAAGGCATTGCAGAAGGAGGTAAACCACCCATGAATTGGGAAAAAGTCCTTGAAGGAATTCGTGAAATGAGATCCTCTGAAGAAGCTCCAGTAGATACCATGGGATGTGGGCAAGCTGGGAGTACTCTTCCTCCCAAGGAAAGAAGATTTGCTGTCTTGGCATCTTCTCTTCTCTCAAGCCAAACCAAAGACCACGTGACTCATGGAGCGGTATTGCGTCTCCAGGAAAGTGGTCTTCTTACTGCTGATGCCATGGACAAAGCTGATGAAGCAACCATTAAAAGCTTGATTTACCCGGTTGGATTTTATTCTACAAAGGCTAGGAATTTGAAGAAGATTGCAAAAATATGTCTTATGAAGTATGGTGGGGACATACCTAGATCATTGGAAGAGCTACTTCTACTACCTGGGATAGGTCCTAAGATTGCACATTTGGTCGGATTTGGACAGACTATTTGCACTCCTCTTAGACCCAAATGTGGAAATTGCAGTGTTAGTGACTTGTGCCCATCTGCATTCAAGGAATCATCAAGCCCATCTCCCAAATTAAAGCGTTCAAGTTCCACCAAAAACCTATGA

Protein sequence

MLFTCPIRTPALSIAFARRITCNGMSKGCLSSFSTSSNKVPPESGVKSSNGVSESETRVFVRRRVKKKAEVQDSGLEVDPNVDAKRCCPPNIEDFAFKRTKDSPGSRKSKPPLDPLLTGIEDSYPTRQKGIAEGGKPPMNWEKVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFAVLASSLLSSQTKDHVTHGAVLRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKRSSSTKNL
Homology
BLAST of Tan0007314 vs. ExPASy Swiss-Prot
Match: B9DFZ0 (Endonuclease III homolog 2, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=NTH2 PE=2 SV=1)

HSP 1 Score: 270.4 bits (690), Expect = 2.6e-71
Identity = 168/384 (43.75%), Postives = 215/384 (55.99%), Query Frame = 0

Query: 4   TCPIRTPALSIAFARRITCNGMSKGCLSSFSTSSNKVPPESGVKSSNGVSESETRVFVRR 63
           T PI    L+    R      +S     S S  S  +  +S  ++++G SESETRV +R+
Sbjct: 9   TFPIVARVLNAMNRRMYAATTLSSA--KSISAESLNLRSDSNSEAAHGASESETRVSLRK 68

Query: 64  RVKKKAEVQD----SGLEVDPNVDAKRCCPPNIEDFAFKRTKDSPGSRKSKPPLDPLLTG 123
           +  K+ +++     S  E     D   C  P+IED  +K+T  +  SR  K  L+  +  
Sbjct: 69  KRIKQDDLEPVKKCSARETKARKD--MCGLPDIEDSPYKKTNGTASSRTRK--LNSYIKS 128

Query: 124 IEDSYPTRQKGIAEGGKPPMNWEKVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFA 183
            E S        A  G PP NWEKVLEGIR+M+ SEEAPV+ + C + GS LPPKERRF 
Sbjct: 129 TEASPSASSIKTAGLGIPPENWEKVLEGIRKMKPSEEAPVNAVECDRTGSFLPPKERRFY 188

Query: 184 VLASSLLSSQTKDHVTHGAVLRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKARNLK 243
           VL  +LLSSQTK+H+T  AV RL ++GLLT +A+DKADE+TIK LIYPVGFY+ KA N+K
Sbjct: 189 VLIGTLLSSQTKEHITGAAVERLHQNGLLTPEAIDKADESTIKELIYPVGFYTRKATNVK 248

Query: 244 KIAKICLMKYGGDIPRSLEELLLLPGIGPKIAH--------------------------- 303
           K+AKICLM+Y GDIPR+LEELL LPG+GPKIAH                           
Sbjct: 249 KVAKICLMEYDGDIPRTLEELLSLPGVGPKIAHLVLHVAWNDVQGICVDTHVHRICNRLG 308

Query: 304 ------------------------------------LVGFGQTICTPLRPKCGNCSVSDL 321
                                               LVGFGQTICTPLRP CG CS++++
Sbjct: 309 WVSKPGTKQKTSSPEETRVALQQWLPKGEWVAINFLLVGFGQTICTPLRPHCGTCSITEI 368

BLAST of Tan0007314 vs. ExPASy Swiss-Prot
Match: Q9SIC4 (Endonuclease III homolog 1, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=NTH1 PE=2 SV=2)

HSP 1 Score: 261.9 bits (668), Expect = 9.3e-69
Identity = 167/373 (44.77%), Postives = 211/373 (56.57%), Query Frame = 0

Query: 16  FARRITCNGMSKGCLSSFSTSSNKVP---PESGVKSSNGVSESETRVFVRRRVKKKAEVQ 75
           F R  T +    G +SS    S K      +S  + + G S SETRV+ R++  K+   +
Sbjct: 20  FYRIGTMSRQIHGAVSSSKHISLKTQHPLSDSNSELAYGASGSETRVYTRKKRLKQEPFE 79

Query: 76  D----SGLEVDPNVDAKRCCPPNIEDFAFKRTKDSPGSRKSKPPLDPLLTGIEDSYPTRQ 135
                SG  V+ +   K C  P+IEDFA+K+T  SP S +S           E S     
Sbjct: 80  PLEKYSGKGVNTH---KLCGLPDIEDFAYKKTIGSPSSSRS----------TETSITV-- 139

Query: 136 KGIAEGGKPPMNWEKVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFAVLASSLLSS 195
             +   G PP NW +VLEGIR+MRSSE+APVD+MGC +AGS LPP ERRFAVL  +LLSS
Sbjct: 140 TSVKTAGYPPENWVEVLEGIRQMRSSEDAPVDSMGCDKAGSFLPPTERRFAVLLGALLSS 199

Query: 196 QTKDHVTHGAVLRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICLMK 255
           QTKD V + A+ RL ++GLLT +A+DKADE+TIK LIYPVGFY+ KA  +KKIA+ICL+K
Sbjct: 200 QTKDQVNNAAIHRLHQNGLLTPEAVDKADESTIKELIYPVGFYTRKATYMKKIARICLVK 259

Query: 256 YGGDIPRSLEELLLLPGIGPKIAH------------------------------------ 315
           Y GDIP SL++LL LPGIGPK+AH                                    
Sbjct: 260 YDGDIPSSLDDLLSLPGIGPKMAHLILHIAWNDVQGICVDTHVHRICNRLGWVSRPGTKQ 319

Query: 316 ---------------------------LVGFGQTICTPLRPKCGNCSVSDLCPSAFKESS 319
                                      LVGFGQ ICTP+RP+C  CSVS LCP+AFKE+S
Sbjct: 320 KTTSPEETRVALQQWLPKEEWVAINPLLVGFGQMICTPIRPRCEACSVSKLCPAAFKETS 377

BLAST of Tan0007314 vs. ExPASy Swiss-Prot
Match: A7M7B9 (Endonuclease III-like protein 1 OS=Gallus gallus OX=9031 GN=NTHL1 PE=2 SV=1)

HSP 1 Score: 147.9 bits (372), Expect = 2.0e-34
Identity = 89/238 (37.39%), Postives = 120/238 (50.42%), Query Frame = 0

Query: 123 SYPTRQKGIAEGGK-PPMNWEKVLEGIREMRSSEEAPVDTMGCGQAGST-LPPKERRFAV 182
           +Y    K  + G K  P NW++ LE IREMR   +APVD MG  +   T  PP+  R+ V
Sbjct: 42  AYEAEPKPESPGPKWEPENWQQQLERIREMRRHRDAPVDEMGVDKCYDTSAPPQVMRYQV 101

Query: 183 LASSLLSSQTKDHVTHGAVLRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKK 242
           L S +LSSQTKD VT  A+LRL++ G LT D++ + D+AT+  +IYPVGF+  K + +K+
Sbjct: 102 LLSLMLSSQTKDQVTSAAMLRLRQRG-LTVDSILQMDDATLGQIIYPVGFWRNKVKYIKQ 161

Query: 243 IAKICLMKYGGDIPRSLEELLLLPGIGPKIAH---------------------------- 301
              I   KYGGDIP ++EEL+ LPG+GPK+AH                            
Sbjct: 162 TTAILKQKYGGDIPGTVEELVKLPGVGPKMAHLAMNIAWNSVSGIAVDTHVHRITNRLKW 221

BLAST of Tan0007314 vs. ExPASy Swiss-Prot
Match: P78549 (Endonuclease III-like protein 1 OS=Homo sapiens OX=9606 GN=NTHL1 PE=1 SV=2)

HSP 1 Score: 139.8 bits (351), Expect = 5.3e-32
Identity = 86/234 (36.75%), Postives = 117/234 (50.00%), Query Frame = 0

Query: 133 EGGKP-------PMNWEKVLEGIREMRSSEEAPVDTMGCGQA-GSTLPPKERRFAVLASS 192
           EG +P       P +W++ L  IR MR+ ++APVD +G      S+ PPK RR+ VL S 
Sbjct: 77  EGAEPLKVPVWEPQDWQQQLVNIRAMRNKKDAPVDHLGTEHCYDSSAPPKVRRYQVLLSL 136

Query: 193 LLSSQTKDHVTHGAVLRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKI 252
           +LSSQTKD VT GA+ RL+  G LT D++ + D+AT+  LIYPVGF+ +K + +K+ + I
Sbjct: 137 MLSSQTKDQVTAGAMQRLRARG-LTVDSILQTDDATLGKLIYPVGFWRSKVKYIKQTSAI 196

Query: 253 CLMKYGGDIPRSLEELLLLPGIGPKIAH-------------------------------- 301
               YGGDIP S+ EL+ LPG+GPK+AH                                
Sbjct: 197 LQQHYGGDIPASVAELVALPGVGPKMAHLAMAVAWGTVSGIAVDTHVHRIANRLRWTKKA 256

BLAST of Tan0007314 vs. ExPASy Swiss-Prot
Match: O35980 (Endonuclease III-like protein 1 OS=Mus musculus OX=10090 GN=Nthl1 PE=2 SV=2)

HSP 1 Score: 134.0 bits (336), Expect = 2.9e-30
Identity = 81/222 (36.49%), Postives = 110/222 (49.55%), Query Frame = 0

Query: 138 PMNWEKVLEGIREMRSSEEAPVDTMGCGQA-GSTLPPKERRFAVLASSLLSSQTKDHVTH 197
           P NW++ L  IR MRS ++APVD +G      ++  PK RR+ VL S +LSSQTKD VT 
Sbjct: 77  PQNWQQQLANIRIMRSKKDAPVDQLGAEHCYDASASPKVRRYQVLLSLMLSSQTKDQVTA 136

Query: 198 GAVLRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICLMKYGGDIPRS 257
           GA+ RL+  G LT +++ + D+ T+  LIYPVGF+  K + +K+   I   +Y GDIP S
Sbjct: 137 GAMQRLRARG-LTVESILQTDDDTLGRLIYPVGFWRNKVKYIKQTTAILQQRYEGDIPAS 196

Query: 258 LEELLLLPGIGPKIAH-------------------------------------------- 301
           + EL+ LPG+GPK+AH                                            
Sbjct: 197 VAELVALPGVGPKMAHLAMAVAWGTISGIAVDTHVHRIANRLRWTKKMTKTPEETRKNLE 256

BLAST of Tan0007314 vs. NCBI nr
Match: XP_038878189.1 (endonuclease III homolog 2, chloroplastic-like isoform X4 [Benincasa hispida])

HSP 1 Score: 549.7 bits (1415), Expect = 1.7e-152
Identity = 293/383 (76.50%), Postives = 301/383 (78.59%), Query Frame = 0

Query: 1   MLFTCPIRTPALSIAFARRITCNGMSKGCLSSFSTSSNKVPPESGVKSSNGVSESETRVF 60
           M+F CPIRTPA SI FARRITC+GMSKG LSS  TSSN+VPP  GVKSSNGVSESETRVF
Sbjct: 1   MIFACPIRTPAFSITFARRITCSGMSKGSLSSLPTSSNEVPPNPGVKSSNGVSESETRVF 60

Query: 61  VRRRVKKKAEVQDSGLEVDPNVDAKRCCPPNIEDFAFKRTKDSPGSRKSKPPLDPLLTGI 120
           VRRRVKK AE Q SGLEV+P VDAKRCCPPNIEDFAFKRTKDSPGSRKSK PLDPLL GI
Sbjct: 61  VRRRVKKNAEGQVSGLEVEPKVDAKRCCPPNIEDFAFKRTKDSPGSRKSKLPLDPLLNGI 120

Query: 121 EDSYPTRQKGIAEGGKPPMNWEKVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFAV 180
           E S PTRQKGIAEGGKPP+NWEKVLEGIR+MRSSEEAPVDTMGCGQAGSTLPPKERRFAV
Sbjct: 121 EVSNPTRQKGIAEGGKPPVNWEKVLEGIRKMRSSEEAPVDTMGCGQAGSTLPPKERRFAV 180

Query: 181 LASSLLSSQTKDHVTHGAVLRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKK 240
           LASSLLSSQTKDHVTHGA LRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKA+NLKK
Sbjct: 181 LASSLLSSQTKDHVTHGAALRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKAKNLKK 240

Query: 241 IAKICLMKYGGDIPRSLEELLLLPGIGPKIAH---------------------------- 300
           IAKICLMKYGGDIPRSLEELL+LPGIGPKIAH                            
Sbjct: 241 IAKICLMKYGGDIPRSLEELLILPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGW 300

Query: 301 -----------------------------------LVGFGQTICTPLRPKCGNCSVSDLC 321
                                              LVGFGQTICTPLRPKCGNCSVSDLC
Sbjct: 301 VSGKGSKQKTSTPEETRVALELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLC 360

BLAST of Tan0007314 vs. NCBI nr
Match: XP_038878188.1 (endonuclease III homolog 2, chloroplastic-like isoform X3 [Benincasa hispida])

HSP 1 Score: 546.2 bits (1406), Expect = 1.9e-151
Identity = 294/386 (76.17%), Postives = 302/386 (78.24%), Query Frame = 0

Query: 1   MLFTCPIRTPALSIAFARRITCNGMSKGCLSSFSTSSNKVPPE---SGVKSSNGVSESET 60
           M+F CPIRTPA SI FARRITC+GMSKG LSS  TSSN+VPP    SGVKSSNGVSESET
Sbjct: 1   MIFACPIRTPAFSITFARRITCSGMSKGSLSSLPTSSNEVPPNPGISGVKSSNGVSESET 60

Query: 61  RVFVRRRVKKKAEVQDSGLEVDPNVDAKRCCPPNIEDFAFKRTKDSPGSRKSKPPLDPLL 120
           RVFVRRRVKK AE Q SGLEV+P VDAKRCCPPNIEDFAFKRTKDSPGSRKSK PLDPLL
Sbjct: 61  RVFVRRRVKKNAEGQVSGLEVEPKVDAKRCCPPNIEDFAFKRTKDSPGSRKSKLPLDPLL 120

Query: 121 TGIEDSYPTRQKGIAEGGKPPMNWEKVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERR 180
            GIE S PTRQKGIAEGGKPP+NWEKVLEGIR+MRSSEEAPVDTMGCGQAGSTLPPKERR
Sbjct: 121 NGIEVSNPTRQKGIAEGGKPPVNWEKVLEGIRKMRSSEEAPVDTMGCGQAGSTLPPKERR 180

Query: 181 FAVLASSLLSSQTKDHVTHGAVLRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKARN 240
           FAVLASSLLSSQTKDHVTHGA LRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKA+N
Sbjct: 181 FAVLASSLLSSQTKDHVTHGAALRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKAKN 240

Query: 241 LKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAH------------------------- 300
           LKKIAKICLMKYGGDIPRSLEELL+LPGIGPKIAH                         
Sbjct: 241 LKKIAKICLMKYGGDIPRSLEELLILPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNR 300

Query: 301 --------------------------------------LVGFGQTICTPLRPKCGNCSVS 321
                                                 LVGFGQTICTPLRPKCGNCSVS
Sbjct: 301 LGWVSGKGSKQKTSTPEETRVALELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVS 360

BLAST of Tan0007314 vs. NCBI nr
Match: XP_038878187.1 (endonuclease III homolog 2, chloroplastic-like isoform X2 [Benincasa hispida])

HSP 1 Score: 533.9 bits (1374), Expect = 9.7e-148
Identity = 293/413 (70.94%), Postives = 301/413 (72.88%), Query Frame = 0

Query: 1   MLFTCPIRTPALSIAFARRITCNGMSKGCLSSFSTSSNKVPPESGVKSSNGVSESETRVF 60
           M+F CPIRTPA SI FARRITC+GMSKG LSS  TSSN+VPP  GVKSSNGVSESETRVF
Sbjct: 1   MIFACPIRTPAFSITFARRITCSGMSKGSLSSLPTSSNEVPPNPGVKSSNGVSESETRVF 60

Query: 61  VRRRVKKKAEVQDSGLEVDPNVDAK------------------------------RCCPP 120
           VRRRVKK AE Q SGLEV+P VDAK                              RCCPP
Sbjct: 61  VRRRVKKNAEGQVSGLEVEPKVDAKVRASSFSFSSSKHIFHPGKCVILALDGRFLRCCPP 120

Query: 121 NIEDFAFKRTKDSPGSRKSKPPLDPLLTGIEDSYPTRQKGIAEGGKPPMNWEKVLEGIRE 180
           NIEDFAFKRTKDSPGSRKSK PLDPLL GIE S PTRQKGIAEGGKPP+NWEKVLEGIR+
Sbjct: 121 NIEDFAFKRTKDSPGSRKSKLPLDPLLNGIEVSNPTRQKGIAEGGKPPVNWEKVLEGIRK 180

Query: 181 MRSSEEAPVDTMGCGQAGSTLPPKERRFAVLASSLLSSQTKDHVTHGAVLRLQESGLLTA 240
           MRSSEEAPVDTMGCGQAGSTLPPKERRFAVLASSLLSSQTKDHVTHGA LRLQESGLLTA
Sbjct: 181 MRSSEEAPVDTMGCGQAGSTLPPKERRFAVLASSLLSSQTKDHVTHGAALRLQESGLLTA 240

Query: 241 DAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKI 300
           DAMDKADEATIKSLIYPVGFYSTKA+NLKKIAKICLMKYGGDIPRSLEELL+LPGIGPKI
Sbjct: 241 DAMDKADEATIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSLEELLILPGIGPKI 300

Query: 301 AH---------------------------------------------------------- 321
           AH                                                          
Sbjct: 301 AHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQKTSTPEETRVALELWLPKEEWV 360

BLAST of Tan0007314 vs. NCBI nr
Match: XP_038878185.1 (endonuclease III homolog 2, chloroplastic-like isoform X1 [Benincasa hispida])

HSP 1 Score: 530.4 bits (1365), Expect = 1.1e-146
Identity = 294/416 (70.67%), Postives = 302/416 (72.60%), Query Frame = 0

Query: 1   MLFTCPIRTPALSIAFARRITCNGMSKGCLSSFSTSSNKVPPE---SGVKSSNGVSESET 60
           M+F CPIRTPA SI FARRITC+GMSKG LSS  TSSN+VPP    SGVKSSNGVSESET
Sbjct: 1   MIFACPIRTPAFSITFARRITCSGMSKGSLSSLPTSSNEVPPNPGISGVKSSNGVSESET 60

Query: 61  RVFVRRRVKKKAEVQDSGLEVDPNVDAK------------------------------RC 120
           RVFVRRRVKK AE Q SGLEV+P VDAK                              RC
Sbjct: 61  RVFVRRRVKKNAEGQVSGLEVEPKVDAKVRASSFSFSSSKHIFHPGKCVILALDGRFLRC 120

Query: 121 CPPNIEDFAFKRTKDSPGSRKSKPPLDPLLTGIEDSYPTRQKGIAEGGKPPMNWEKVLEG 180
           CPPNIEDFAFKRTKDSPGSRKSK PLDPLL GIE S PTRQKGIAEGGKPP+NWEKVLEG
Sbjct: 121 CPPNIEDFAFKRTKDSPGSRKSKLPLDPLLNGIEVSNPTRQKGIAEGGKPPVNWEKVLEG 180

Query: 181 IREMRSSEEAPVDTMGCGQAGSTLPPKERRFAVLASSLLSSQTKDHVTHGAVLRLQESGL 240
           IR+MRSSEEAPVDTMGCGQAGSTLPPKERRFAVLASSLLSSQTKDHVTHGA LRLQESGL
Sbjct: 181 IRKMRSSEEAPVDTMGCGQAGSTLPPKERRFAVLASSLLSSQTKDHVTHGAALRLQESGL 240

Query: 241 LTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICLMKYGGDIPRSLEELLLLPGIG 300
           LTADAMDKADEATIKSLIYPVGFYSTKA+NLKKIAKICLMKYGGDIPRSLEELL+LPGIG
Sbjct: 241 LTADAMDKADEATIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSLEELLILPGIG 300

Query: 301 PKIAH------------------------------------------------------- 321
           PKIAH                                                       
Sbjct: 301 PKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQKTSTPEETRVALELWLPKE 360

BLAST of Tan0007314 vs. NCBI nr
Match: XP_022145216.1 (endonuclease III homolog 1, chloroplastic-like isoform X2 [Momordica charantia])

HSP 1 Score: 522.7 bits (1345), Expect = 2.2e-144
Identity = 283/385 (73.51%), Postives = 296/385 (76.88%), Query Frame = 0

Query: 1   MLFTCPIRTPALSIAFARRITCNGMSKGCLSSFSTSSNKVPPE-SGVKSSNGVSESETRV 60
           M  TC I TP L IAFARRITC  MSKG LSS  TSSNK P + SGV+SSNGVSESETRV
Sbjct: 1   MFLTCAITTPTLPIAFARRITCKSMSKGSLSSLPTSSNKFPQDHSGVRSSNGVSESETRV 60

Query: 61  FVRRRVKKKAEVQDSGLEVDPNVDAKRCCPPNIEDFAFKRTKDSPGSRKSK-PPLDPLLT 120
           FVRRRVKK AEVQDSG +V+PNVDAKRCCPP+IEDFAFKRTK+SPGS KSK PPLDPL+T
Sbjct: 61  FVRRRVKKNAEVQDSGPQVEPNVDAKRCCPPDIEDFAFKRTKESPGSWKSKPPPLDPLVT 120

Query: 121 GIEDSYPTRQKGIAEGGKPPMNWEKVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRF 180
           GIE S P RQKGIAE GKPP+NWEK+LEGIREMRSSE+APVDTMGCGQA STLPPKERRF
Sbjct: 121 GIEVSNPIRQKGIAERGKPPVNWEKILEGIREMRSSEDAPVDTMGCGQAASTLPPKERRF 180

Query: 181 AVLASSLLSSQTKDHVTHGAVLRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKARNL 240
           +VLASSLLSSQTKDHVTHGA  RLQE+GLLTADAMDKADEATIKSLIYPVGFYSTKARNL
Sbjct: 181 SVLASSLLSSQTKDHVTHGAATRLQENGLLTADAMDKADEATIKSLIYPVGFYSTKARNL 240

Query: 241 KKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAH-------------------------- 300
           KKIAKICL+KYGGDIPRSLEELLLLPGIGPKIAH                          
Sbjct: 241 KKIAKICLIKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRISNRL 300

Query: 301 -------------------------------------LVGFGQTICTPLRPKCGNCSVSD 321
                                                LVGFGQTICTPLRPKCGNCSVSD
Sbjct: 301 GWVSGKGSKQKTSTPEETRVALELWLPKEEWVPINTLLVGFGQTICTPLRPKCGNCSVSD 360

BLAST of Tan0007314 vs. ExPASy TrEMBL
Match: A0A0A0KU72 (ENDO3c domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_4G015140 PE=3 SV=1)

HSP 1 Score: 527.3 bits (1357), Expect = 4.4e-146
Identity = 280/358 (78.21%), Postives = 289/358 (80.73%), Query Frame = 0

Query: 1   MLFTCPIRTPALSIAFARRITCNGMSKGCLSSFSTSSNKVPPESG---VKSSNGVSESET 60
           M F CPIR PALSI FARRITC+ MSKG  SS  TSSN+VPP  G   VKSSNGVSE ET
Sbjct: 1   MFFACPIRIPALSITFARRITCSAMSKGSSSSLPTSSNEVPPNPGISSVKSSNGVSEPET 60

Query: 61  RVFVRRRVKKKAEVQDSGLEVDPNVDAKRCCPPNIEDFAFKRTKDSPGSRKSKPPLDPLL 120
           RVFVRRRVKK AE QDSG EV+P +D KR CPPNIEDFAFKRTKDSPGSRK KPPLD LL
Sbjct: 61  RVFVRRRVKKIAESQDSGFEVEPKIDTKRSCPPNIEDFAFKRTKDSPGSRKLKPPLDLLL 120

Query: 121 TGIEDSYPTRQKGIAEGGKPPMNWEKVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERR 180
            GIEDS PT  KG AE GKPP+NWEKVL+GIREMRSSEEAPVDTMGCG+AGSTLPPKERR
Sbjct: 121 NGIEDSNPTTHKGKAERGKPPVNWEKVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERR 180

Query: 181 FAVLASSLLSSQTKDHVTHGAVLRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKARN 240
           FAVLASSLLSSQTKDHVTHGA LRLQESGLLTADAMDKADE TIKSLIYPVGFYSTKA+N
Sbjct: 181 FAVLASSLLSSQTKDHVTHGAALRLQESGLLTADAMDKADEETIKSLIYPVGFYSTKAKN 240

Query: 241 LKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHL------------------------ 300
           LKKIA+ICLMKYGGDIPRSL ELLLLPGIGPKIAHL                        
Sbjct: 241 LKKIARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNR 300

Query: 301 -----------VGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKRSSSTKNL 321
                      VGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLK SSSTK L
Sbjct: 301 LGWVSGKGSKQVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKGSSSTKKL 358

BLAST of Tan0007314 vs. ExPASy TrEMBL
Match: A0A6J1CVP5 (Endonuclease III homolog OS=Momordica charantia OX=3673 GN=LOC111014717 PE=3 SV=1)

HSP 1 Score: 522.7 bits (1345), Expect = 1.1e-144
Identity = 283/385 (73.51%), Postives = 296/385 (76.88%), Query Frame = 0

Query: 1   MLFTCPIRTPALSIAFARRITCNGMSKGCLSSFSTSSNKVPPE-SGVKSSNGVSESETRV 60
           M  TC I TP L IAFARRITC  MSKG LSS  TSSNK P + SGV+SSNGVSESETRV
Sbjct: 1   MFLTCAITTPTLPIAFARRITCKSMSKGSLSSLPTSSNKFPQDHSGVRSSNGVSESETRV 60

Query: 61  FVRRRVKKKAEVQDSGLEVDPNVDAKRCCPPNIEDFAFKRTKDSPGSRKSK-PPLDPLLT 120
           FVRRRVKK AEVQDSG +V+PNVDAKRCCPP+IEDFAFKRTK+SPGS KSK PPLDPL+T
Sbjct: 61  FVRRRVKKNAEVQDSGPQVEPNVDAKRCCPPDIEDFAFKRTKESPGSWKSKPPPLDPLVT 120

Query: 121 GIEDSYPTRQKGIAEGGKPPMNWEKVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRF 180
           GIE S P RQKGIAE GKPP+NWEK+LEGIREMRSSE+APVDTMGCGQA STLPPKERRF
Sbjct: 121 GIEVSNPIRQKGIAERGKPPVNWEKILEGIREMRSSEDAPVDTMGCGQAASTLPPKERRF 180

Query: 181 AVLASSLLSSQTKDHVTHGAVLRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKARNL 240
           +VLASSLLSSQTKDHVTHGA  RLQE+GLLTADAMDKADEATIKSLIYPVGFYSTKARNL
Sbjct: 181 SVLASSLLSSQTKDHVTHGAATRLQENGLLTADAMDKADEATIKSLIYPVGFYSTKARNL 240

Query: 241 KKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAH-------------------------- 300
           KKIAKICL+KYGGDIPRSLEELLLLPGIGPKIAH                          
Sbjct: 241 KKIAKICLIKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRISNRL 300

Query: 301 -------------------------------------LVGFGQTICTPLRPKCGNCSVSD 321
                                                LVGFGQTICTPLRPKCGNCSVSD
Sbjct: 301 GWVSGKGSKQKTSTPEETRVALELWLPKEEWVPINTLLVGFGQTICTPLRPKCGNCSVSD 360

BLAST of Tan0007314 vs. ExPASy TrEMBL
Match: A0A6J1CTU5 (Endonuclease III homolog OS=Momordica charantia OX=3673 GN=LOC111014717 PE=3 SV=1)

HSP 1 Score: 521.5 bits (1342), Expect = 2.4e-144
Identity = 283/388 (72.94%), Postives = 296/388 (76.29%), Query Frame = 0

Query: 1   MLFTCPIRTPALSIAFARRITCNGMSKGCLSSFSTSSNKVPPE----SGVKSSNGVSESE 60
           M  TC I TP L IAFARRITC  MSKG LSS  TSSNK P +    SGV+SSNGVSESE
Sbjct: 1   MFLTCAITTPTLPIAFARRITCKSMSKGSLSSLPTSSNKFPQDHSGISGVRSSNGVSESE 60

Query: 61  TRVFVRRRVKKKAEVQDSGLEVDPNVDAKRCCPPNIEDFAFKRTKDSPGSRKSK-PPLDP 120
           TRVFVRRRVKK AEVQDSG +V+PNVDAKRCCPP+IEDFAFKRTK+SPGS KSK PPLDP
Sbjct: 61  TRVFVRRRVKKNAEVQDSGPQVEPNVDAKRCCPPDIEDFAFKRTKESPGSWKSKPPPLDP 120

Query: 121 LLTGIEDSYPTRQKGIAEGGKPPMNWEKVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKE 180
           L+TGIE S P RQKGIAE GKPP+NWEK+LEGIREMRSSE+APVDTMGCGQA STLPPKE
Sbjct: 121 LVTGIEVSNPIRQKGIAERGKPPVNWEKILEGIREMRSSEDAPVDTMGCGQAASTLPPKE 180

Query: 181 RRFAVLASSLLSSQTKDHVTHGAVLRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKA 240
           RRF+VLASSLLSSQTKDHVTHGA  RLQE+GLLTADAMDKADEATIKSLIYPVGFYSTKA
Sbjct: 181 RRFSVLASSLLSSQTKDHVTHGAATRLQENGLLTADAMDKADEATIKSLIYPVGFYSTKA 240

Query: 241 RNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAH----------------------- 300
           RNLKKIAKICL+KYGGDIPRSLEELLLLPGIGPKIAH                       
Sbjct: 241 RNLKKIAKICLIKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRIS 300

Query: 301 ----------------------------------------LVGFGQTICTPLRPKCGNCS 321
                                                   LVGFGQTICTPLRPKCGNCS
Sbjct: 301 NRLGWVSGKGSKQKTSTPEETRVALELWLPKEEWVPINTLLVGFGQTICTPLRPKCGNCS 360

BLAST of Tan0007314 vs. ExPASy TrEMBL
Match: A0A1S3BX24 (Endonuclease III homolog OS=Cucumis melo OX=3656 GN=LOC103494544 PE=3 SV=1)

HSP 1 Score: 510.0 bits (1312), Expect = 7.2e-141
Identity = 278/386 (72.02%), Postives = 287/386 (74.35%), Query Frame = 0

Query: 1   MLFTCPIRTPALSIAFARRITCNGMSKGCLSSFSTSSNKVPPESG---VKSSNGVSESET 60
           M F CPIR PALSI FARRITC+ MSKG  SS  TSSN+VPP  G   VKSSNGVSE ET
Sbjct: 1   MFFACPIRIPALSITFARRITCSAMSKGSSSSLPTSSNEVPPNPGISSVKSSNGVSEPET 60

Query: 61  RVFVRRRVKKKAEVQDSGLEVDPNVDAKRCCPPNIEDFAFKRTKDSPGSRKSKPPLDPLL 120
           RVFVRRRVKK AE QDSG EV+P +D KR CPPNIEDFAFKRTKDSPGSR+SKPP D LL
Sbjct: 61  RVFVRRRVKKIAESQDSGFEVEPKIDTKRSCPPNIEDFAFKRTKDSPGSRESKPPPDLLL 120

Query: 121 TGIEDSYPTRQKGIAEGGKPPMNWEKVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERR 180
            GIE S PT  KG A  GKPP NWEKVL+GIREMRSSEEAPVDTMGCG+AGSTLPPKERR
Sbjct: 121 NGIEGSTPTTHKGNAGRGKPPENWEKVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERR 180

Query: 181 FAVLASSLLSSQTKDHVTHGAVLRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKARN 240
           FAVLASSLLSSQTKDHVTHGA LRLQESGLLTA+AMDKADE TIKSLIYPVGFYSTKA+N
Sbjct: 181 FAVLASSLLSSQTKDHVTHGAALRLQESGLLTAEAMDKADEETIKSLIYPVGFYSTKAKN 240

Query: 241 LKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAH------------------------- 300
           LKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAH                         
Sbjct: 241 LKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNR 300

Query: 301 --------------------------------------LVGFGQTICTPLRPKCGNCSVS 321
                                                 LVGFGQTICTPLRPKCGNCSVS
Sbjct: 301 LGWVSGKGSKQKTSTPEETRVGLELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVS 360

BLAST of Tan0007314 vs. ExPASy TrEMBL
Match: A0A6J1GW15 (Endonuclease III homolog OS=Cucurbita moschata OX=3662 GN=LOC111457673 PE=3 SV=1)

HSP 1 Score: 501.9 bits (1291), Expect = 2.0e-138
Identity = 273/382 (71.47%), Postives = 288/382 (75.39%), Query Frame = 0

Query: 1   MLFTCPIRTPALSIAFARRITCNGMSKGCLSSFSTSSNKVPPE-SGVKSSNGVSESETRV 60
           M   CPIR  ALSI FARRITC GMSK   SS   +SN+ PP  SGV+SSNGVS+SETRV
Sbjct: 1   MFLACPIRNSALSITFARRITCTGMSKRSSSSMPITSNESPPGISGVQSSNGVSKSETRV 60

Query: 61  FVRRRVKKKAEVQDSGLEVDPNVDAKRCCPPNIEDFAFKRTKDSPGSRKSKPPLDPLLTG 120
           FVRR VKKKAE Q SGL+++   D KRCCPP+IEDFAFKRTKDSPGSRKSKPPLD LL  
Sbjct: 61  FVRRTVKKKAEGQYSGLDLELKDDVKRCCPPDIEDFAFKRTKDSPGSRKSKPPLDTLLPR 120

Query: 121 IEDSYPTRQKGIAEGGKPPMNWEKVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFA 180
           IEDS PTRQKG+AEGGK P++WEKVLEGIREMRSSE APVDTMGCG+AGSTLPPKERRFA
Sbjct: 121 IEDSNPTRQKGVAEGGKTPVDWEKVLEGIREMRSSEVAPVDTMGCGRAGSTLPPKERRFA 180

Query: 181 VLASSLLSSQTKDHVTHGAVLRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKARNLK 240
           VLASSLLSSQTKDHVTHGA LRLQESGLLTADAMDKADE+TIKSLIYPVGFYSTKA+NLK
Sbjct: 181 VLASSLLSSQTKDHVTHGAALRLQESGLLTADAMDKADESTIKSLIYPVGFYSTKAKNLK 240

Query: 241 KIAKICLMKYGGDIPRSLEELLLLPGIGPKIAH--------------------------- 300
           KIAKICLMKYGGDIPRSLEELLLLPGIGPKIAH                           
Sbjct: 241 KIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVRGICVDTHVHRICNRLG 300

Query: 301 ------------------------------------LVGFGQTICTPLRPKCGNCSVSDL 319
                                               LVGFGQTICTPLRPKCGNCSVS L
Sbjct: 301 WVSGKGSKQKTSSPEETRVALELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSHL 360

BLAST of Tan0007314 vs. TAIR 10
Match: AT1G05900.2 (endonuclease III 2 )

HSP 1 Score: 270.4 bits (690), Expect = 1.9e-72
Identity = 168/384 (43.75%), Postives = 215/384 (55.99%), Query Frame = 0

Query: 4   TCPIRTPALSIAFARRITCNGMSKGCLSSFSTSSNKVPPESGVKSSNGVSESETRVFVRR 63
           T PI    L+    R      +S     S S  S  +  +S  ++++G SESETRV +R+
Sbjct: 9   TFPIVARVLNAMNRRMYAATTLSSA--KSISAESLNLRSDSNSEAAHGASESETRVSLRK 68

Query: 64  RVKKKAEVQD----SGLEVDPNVDAKRCCPPNIEDFAFKRTKDSPGSRKSKPPLDPLLTG 123
           +  K+ +++     S  E     D   C  P+IED  +K+T  +  SR  K  L+  +  
Sbjct: 69  KRIKQDDLEPVKKCSARETKARKD--MCGLPDIEDSPYKKTNGTASSRTRK--LNSYIKS 128

Query: 124 IEDSYPTRQKGIAEGGKPPMNWEKVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFA 183
            E S        A  G PP NWEKVLEGIR+M+ SEEAPV+ + C + GS LPPKERRF 
Sbjct: 129 TEASPSASSIKTAGLGIPPENWEKVLEGIRKMKPSEEAPVNAVECDRTGSFLPPKERRFY 188

Query: 184 VLASSLLSSQTKDHVTHGAVLRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKARNLK 243
           VL  +LLSSQTK+H+T  AV RL ++GLLT +A+DKADE+TIK LIYPVGFY+ KA N+K
Sbjct: 189 VLIGTLLSSQTKEHITGAAVERLHQNGLLTPEAIDKADESTIKELIYPVGFYTRKATNVK 248

Query: 244 KIAKICLMKYGGDIPRSLEELLLLPGIGPKIAH--------------------------- 303
           K+AKICLM+Y GDIPR+LEELL LPG+GPKIAH                           
Sbjct: 249 KVAKICLMEYDGDIPRTLEELLSLPGVGPKIAHLVLHVAWNDVQGICVDTHVHRICNRLG 308

Query: 304 ------------------------------------LVGFGQTICTPLRPKCGNCSVSDL 321
                                               LVGFGQTICTPLRP CG CS++++
Sbjct: 309 WVSKPGTKQKTSSPEETRVALQQWLPKGEWVAINFLLVGFGQTICTPLRPHCGTCSITEI 368

BLAST of Tan0007314 vs. TAIR 10
Match: AT2G31450.2 (DNA glycosylase superfamily protein )

HSP 1 Score: 265.4 bits (677), Expect = 6.0e-71
Identity = 169/371 (45.55%), Postives = 211/371 (56.87%), Query Frame = 0

Query: 16  FARRITCNGMSKGCLSSFSTSSNKVP-PESGVKSSNGVSESETRVFVRRRVKKKAEVQD- 75
           F R  T +    G +SS    S K   P S   S+ G S SETRV+ R++  K+   +  
Sbjct: 20  FYRIGTMSRQIHGAVSSSKHISLKTQHPLSDSNSAYGASGSETRVYTRKKRLKQEPFEPL 79

Query: 76  ---SGLEVDPNVDAKRCCPPNIEDFAFKRTKDSPGSRKSKPPLDPLLTGIEDSYPTRQKG 135
              SG  V+ +   K C  P+IEDFA+K+T  SP S +S           E S       
Sbjct: 80  EKYSGKGVNTH---KLCGLPDIEDFAYKKTIGSPSSSRS----------TETSITV--TS 139

Query: 136 IAEGGKPPMNWEKVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFAVLASSLLSSQT 195
           +   G PP NW +VLEGIR+MRSSE+APVD+MGC +AGS LPP ERRFAVL  +LLSSQT
Sbjct: 140 VKTAGYPPENWVEVLEGIRQMRSSEDAPVDSMGCDKAGSFLPPTERRFAVLLGALLSSQT 199

Query: 196 KDHVTHGAVLRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICLMKYG 255
           KD V + A+ RL ++GLLT +A+DKADE+TIK LIYPVGFY+ KA  +KKIA+ICL+KY 
Sbjct: 200 KDQVNNAAIHRLHQNGLLTPEAVDKADESTIKELIYPVGFYTRKATYMKKIARICLVKYD 259

Query: 256 GDIPRSLEELLLLPGIGPKIAH-------------------------------------- 315
           GDIP SL++LL LPGIGPK+AH                                      
Sbjct: 260 GDIPSSLDDLLSLPGIGPKMAHLILHIAWNDVQGICVDTHVHRICNRLGWVSRPGTKQKT 319

Query: 316 -------------------------LVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSP 319
                                    LVGFGQ ICTP+RP+C  CSVS LCP+AFKE+SSP
Sbjct: 320 TSPEETRVALQQWLPKEEWVAINPLLVGFGQMICTPIRPRCEACSVSKLCPAAFKETSSP 375

BLAST of Tan0007314 vs. TAIR 10
Match: AT2G31450.1 (DNA glycosylase superfamily protein )

HSP 1 Score: 261.9 bits (668), Expect = 6.6e-70
Identity = 167/373 (44.77%), Postives = 211/373 (56.57%), Query Frame = 0

Query: 16  FARRITCNGMSKGCLSSFSTSSNKVP---PESGVKSSNGVSESETRVFVRRRVKKKAEVQ 75
           F R  T +    G +SS    S K      +S  + + G S SETRV+ R++  K+   +
Sbjct: 20  FYRIGTMSRQIHGAVSSSKHISLKTQHPLSDSNSELAYGASGSETRVYTRKKRLKQEPFE 79

Query: 76  D----SGLEVDPNVDAKRCCPPNIEDFAFKRTKDSPGSRKSKPPLDPLLTGIEDSYPTRQ 135
                SG  V+ +   K C  P+IEDFA+K+T  SP S +S           E S     
Sbjct: 80  PLEKYSGKGVNTH---KLCGLPDIEDFAYKKTIGSPSSSRS----------TETSITV-- 139

Query: 136 KGIAEGGKPPMNWEKVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFAVLASSLLSS 195
             +   G PP NW +VLEGIR+MRSSE+APVD+MGC +AGS LPP ERRFAVL  +LLSS
Sbjct: 140 TSVKTAGYPPENWVEVLEGIRQMRSSEDAPVDSMGCDKAGSFLPPTERRFAVLLGALLSS 199

Query: 196 QTKDHVTHGAVLRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICLMK 255
           QTKD V + A+ RL ++GLLT +A+DKADE+TIK LIYPVGFY+ KA  +KKIA+ICL+K
Sbjct: 200 QTKDQVNNAAIHRLHQNGLLTPEAVDKADESTIKELIYPVGFYTRKATYMKKIARICLVK 259

Query: 256 YGGDIPRSLEELLLLPGIGPKIAH------------------------------------ 315
           Y GDIP SL++LL LPGIGPK+AH                                    
Sbjct: 260 YDGDIPSSLDDLLSLPGIGPKMAHLILHIAWNDVQGICVDTHVHRICNRLGWVSRPGTKQ 319

Query: 316 ---------------------------LVGFGQTICTPLRPKCGNCSVSDLCPSAFKESS 319
                                      LVGFGQ ICTP+RP+C  CSVS LCP+AFKE+S
Sbjct: 320 KTTSPEETRVALQQWLPKEEWVAINPLLVGFGQMICTPIRPRCEACSVSKLCPAAFKETS 377

BLAST of Tan0007314 vs. TAIR 10
Match: AT1G05900.1 (endonuclease III 2 )

HSP 1 Score: 234.2 bits (596), Expect = 1.5e-61
Identity = 137/275 (49.82%), Postives = 177/275 (64.36%), Query Frame = 0

Query: 4   TCPIRTPALSIAFARRITCNGMSKGCLSSFSTSSNKVPPESGVKSSNGVSESETRVFVRR 63
           T PI    L+    R      +S     S S  S  +  +S  ++++G SESETRV +R+
Sbjct: 9   TFPIVARVLNAMNRRMYAATTLSSA--KSISAESLNLRSDSNSEAAHGASESETRVSLRK 68

Query: 64  RVKKKAEVQD----SGLEVDPNVDAKRCCPPNIEDFAFKRTKDSPGSRKSKPPLDPLLTG 123
           +  K+ +++     S  E     D   C  P+IED  +K+T  +  SR  K  L+  +  
Sbjct: 69  KRIKQDDLEPVKKCSARETKARKD--MCGLPDIEDSPYKKTNGTASSRTRK--LNSYIKS 128

Query: 124 IEDSYPTRQKGIAEGGKPPMNWEKVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFA 183
            E S        A  G PP NWEKVLEGIR+M+ SEEAPV+ + C + GS LPPKERRF 
Sbjct: 129 TEASPSASSIKTAGLGIPPENWEKVLEGIRKMKPSEEAPVNAVECDRTGSFLPPKERRFY 188

Query: 184 VLASSLLSSQTKDHVTHGAVLRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKARNLK 243
           VL  +LLSSQTK+H+T  AV RL ++GLLT +A+DKADE+TIK LIYPVGFY+ KA N+K
Sbjct: 189 VLIGTLLSSQTKEHITGAAVERLHQNGLLTPEAIDKADESTIKELIYPVGFYTRKATNVK 248

Query: 244 KIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLV 275
           K+AKICLM+Y GDIPR+LEELL LPG+GPKIAHLV
Sbjct: 249 KVAKICLMEYDGDIPRTLEELLSLPGVGPKIAHLV 277

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
B9DFZ02.6e-7143.75Endonuclease III homolog 2, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=NTH... [more]
Q9SIC49.3e-6944.77Endonuclease III homolog 1, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=NTH... [more]
A7M7B92.0e-3437.39Endonuclease III-like protein 1 OS=Gallus gallus OX=9031 GN=NTHL1 PE=2 SV=1[more]
P785495.3e-3236.75Endonuclease III-like protein 1 OS=Homo sapiens OX=9606 GN=NTHL1 PE=1 SV=2[more]
O359802.9e-3036.49Endonuclease III-like protein 1 OS=Mus musculus OX=10090 GN=Nthl1 PE=2 SV=2[more]
Match NameE-valueIdentityDescription
XP_038878189.11.7e-15276.50endonuclease III homolog 2, chloroplastic-like isoform X4 [Benincasa hispida][more]
XP_038878188.11.9e-15176.17endonuclease III homolog 2, chloroplastic-like isoform X3 [Benincasa hispida][more]
XP_038878187.19.7e-14870.94endonuclease III homolog 2, chloroplastic-like isoform X2 [Benincasa hispida][more]
XP_038878185.11.1e-14670.67endonuclease III homolog 2, chloroplastic-like isoform X1 [Benincasa hispida][more]
XP_022145216.12.2e-14473.51endonuclease III homolog 1, chloroplastic-like isoform X2 [Momordica charantia][more]
Match NameE-valueIdentityDescription
A0A0A0KU724.4e-14678.21ENDO3c domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_4G015140 PE=3... [more]
A0A6J1CVP51.1e-14473.51Endonuclease III homolog OS=Momordica charantia OX=3673 GN=LOC111014717 PE=3 SV=... [more]
A0A6J1CTU52.4e-14472.94Endonuclease III homolog OS=Momordica charantia OX=3673 GN=LOC111014717 PE=3 SV=... [more]
A0A1S3BX247.2e-14172.02Endonuclease III homolog OS=Cucumis melo OX=3656 GN=LOC103494544 PE=3 SV=1[more]
A0A6J1GW152.0e-13871.47Endonuclease III homolog OS=Cucurbita moschata OX=3662 GN=LOC111457673 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT1G05900.21.9e-7243.75endonuclease III 2 [more]
AT2G31450.26.0e-7145.55DNA glycosylase superfamily protein [more]
AT2G31450.16.6e-7044.77DNA glycosylase superfamily protein [more]
AT1G05900.11.5e-6149.82endonuclease III 2 [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR003651Endonuclease III-like, iron-sulphur cluster loop motifSMARTSM00525ccc3coord: 280..300
e-value: 8.8E-6
score: 35.2
IPR003265HhH-GPD domainSMARTSM00478endo3endcoord: 186..317
e-value: 1.2E-8
score: 44.8
IPR003265HhH-GPD domainPFAMPF00730HhH-GPDcoord: 183..247
e-value: 3.6E-11
score: 44.0
IPR003265HhH-GPD domainCDDcd00056ENDO3ccoord: 178..278
e-value: 2.70783E-26
score: 100.009
IPR000445Helix-hairpin-helix motifPFAMPF00633HHHcoord: 248..273
e-value: 1.7E-7
score: 30.8
NoneNo IPR availableGENE3D1.10.340.30Hypothetical protein; domain 2coord: 169..276
e-value: 3.7E-27
score: 97.0
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 300..320
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 98..122
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 33..54
NoneNo IPR availablePANTHERPTHR43286:SF3ENDONUCLEASE III HOMOLOG 2, CHLOROPLASTICcoord: 273..318
NoneNo IPR availablePANTHERPTHR43286:SF3ENDONUCLEASE III HOMOLOG 2, CHLOROPLASTICcoord: 66..274
NoneNo IPR availablePANTHERPTHR43286ENDONUCLEASE III-LIKE PROTEIN 1coord: 273..318
NoneNo IPR availablePANTHERPTHR43286ENDONUCLEASE III-LIKE PROTEIN 1coord: 66..274
IPR004035Endonuclease III, iron-sulphur binding sitePROSITEPS00764ENDONUCLEASE_III_1coord: 281..297
IPR011257DNA glycosylaseSUPERFAMILY48150DNA-glycosylasecoord: 149..309

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0007314.1Tan0007314.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006284 base-excision repair
biological_process GO:0090305 nucleic acid phosphodiester bond hydrolysis
biological_process GO:0006281 DNA repair
cellular_component GO:0043231 intracellular membrane-bounded organelle
molecular_function GO:0051539 4 iron, 4 sulfur cluster binding
molecular_function GO:0003906 DNA-(apurinic or apyrimidinic site) endonuclease activity
molecular_function GO:0003677 DNA binding
molecular_function GO:0019104 DNA N-glycosylase activity
molecular_function GO:0004519 endonuclease activity
molecular_function GO:0016829 lyase activity
molecular_function GO:0003824 catalytic activity