Tan0008131 (gene) Snake gourd v1

Overview
NameTan0008131
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptioncleavage and polyadenylation specificity factor subunit 1
LocationLG07: 6373113 .. 6422015 (+)
RNA-Seq ExpressionTan0008131
SyntenyTan0008131
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAGCCCTCCCGCCAAACCTCCCCAAAGCCCTAGTCTGCTGGTCCGTTTTCCCTCCACTCATTCTCATTGTCTGCAACTCCATGTTCCGTTGCCACCTCCAACTCCGTCTCTAAACCCTCAAATTCTCGAAGCCATAACTTCCTCTCAAGCTCCATAGTCTTACTCATAATATATCCAATTTCGTTGTGAACATTTTACTTTCCACTACCGGAGGATGAGTTTTGCCGCCTATAGAATGATGCACTGGCCTACGGGCATCGAGAACTGTGATTCAGGCTTCATCACCCATTCTCGCGCCGACTTCGTACCCGGTGTTACATCTCACACCGACGAACTCGAGTCCGACTGGCCGGCCCGCCGAGAAATGGGTCCAGTTCCTAATCTCGTTGTCACCGCCGGTAATGTCCTGGAGGTATATGTTGTTAGGGTTCAAGAAGAGGGTGGCAGTGAATCAAGAAGCTCAGGAGAAGTCAGGCGCGGTGGTATTATGGACGGAGTCTCTGGGGCCTCGCTCGAGCTTGTTTGCCACTATAGGTATGCTTCGAGTTTTGTCAATGTACACAAGGATTGTCATGCTAGTATGTTCAATCTTCAATGTTCTAACTAGCTTGTCAGAACATACACCGTGGAGCAGCTTTTTGTGAGCTCCTCAATTTTGGGTCTTTTTGCGTCTCATTTTGTACTTCTCATCAAATGATTGATAATATTATTTCTCGTAGGAAAGAAAAAAAACATGCGATTAGCTTTCCCGAGCATTTGCAAAGGAGGCCCCTTTATATGATCCTTAATTATCTGCAAAAAGCATATGCACCAACATCCTTCTACCTTGCTTGTTGACGCTGGATAATTTGGAAGAACATAAAAATTCCATCTACTTATTCTTATTCCTTCCATCTCTGTTGATTAAGATAAGTTCTGGAAATAGTCTGTGATTAATTTTTCTGTTTCAAAAGGGAGGGATAACCAGGAGTCCCTCATCGAAAAGAAACAAGAAGCTTAACATCTTTATAGAAAATAGAATTCTAGAAATAATTGGGTGTTCTTTTTTTAATACTCTAGCCTTCACAACTTTTTTCTTTGGGGGGGGGGGGGATTTTTATTCCAAGGAGTCGGCAGATCAATTACCATTGGCTGTTTAGTTTGTCCATCCATGGTGGATGGTGTAAGGTGTCATCAAGACAATTTAAACGATATGGTTGGAAGTCTATCTGAATCAAAGTAATGAATGCATGCCCAGGTCTTATGAAATATGAACTAATTTTTAAGTTTATTTTTTAAAATATAGAACCTAACTTTAACTAAGAGGAATGGAGTACAAAAAAAGAAAGTCATACAAAAAGATAGGCCGAAACCAAACACAAAGAACTACATAGGAAAGAAAAAGCTACAAGAAGGAACTCCAATTGTTGGTAATAGAGAACAAAGGAGTATTAGAATAAAACTTAGTCGAATAGAAACAAAGGAATCATTTAAGTGTACAAACTCCAAACCTCTCCTCAAGATCTCTCTACTCCCTTAAACACCCTTGTTTTCGTTTCTAGCAAGACGTTCCATAAAGCAGCAAAGAAGCTAATTTTCCAAAGAAATCTTCCTTTGTGTGGAAAAGAACGGTGTAGTAGGGCTTCCTATGCATATCACGTCGCTCTTGAGTTTACCACCTCCCCAAAAGATAAGAACTCGCTCATCTTCATTGTGAATGGGCAACTCTACAGAATGTGATTTAAATCCACTAATGCTTGTTTTCAAGTAATGAAGAGCATTAGTGAAGAACTAGAGTGGAAGATGTCAGATTTATTTTTGGTAAGCACTTGCTATTTTTTTTCAGTGATAATTTATTTTTCTCGCATCTTGCACAGGTTGCACGGTAATGTTGAGTCCATGGCAATTTTGTCTAGTAGAGGAGGAGATGGTTCCAAGAAGAGAGATTCAATTATATTAGTCTTCCAAGAAGCAAAAATTTCAGTGCTAGAGTTTGATGATTCTATCCATAGTCTCCGTACAAGGTGGGTGGAGTTGTCAAAGGAACAAGATCCTTGTTTGCCTTTTTGATTGTTCTCTACAAACATTTAGTCCTTCACATATGCAGGCTTATTTACTTTTAGACAAAGATAACGTACATAATCACAATCAGAAATAATTACATGGGATGTGTAAATTCTAAGCTTTTGGTATTTCATTTTGGCTATATGTGTTTCTACTAGAACTCTTTCTCATTGTCTGATCCATTAACACAATTGGGGACTGGGTAAGAAAAATTTCTAATTGTTGGATTCATATTTATCACAATTCTTTTGATTTTTCTATCTTTATTTTTTTGAAAGAAACATAATTGGTGCAAGCAGATAATTCTTCATTCTAGATTTGATATTTATGATGTTTTTTTTTGAAAGATGATATTCGAAGGCAATACAACTATATATTGGCGATAAAATGTATATTGGTACAGCCTTTTGGTTTTTATCAAGGTTTCTATAGAGTCCATGACAACTTGCATTTGGGTCAGATATATATATATATTTTTTGGAGAAGAAGCAAGAATTTCAATGAATGAAATTTACAAGAAGGAAGAATAAACAACCAAACATCTAAAGTGGATTACAAAAATTCTTCCACATATCTAATCAGATAATTTATGCAATATAGATGTTACTTTCTTATCAGGCTTCTGGTTTCAAACTACTTAATTGTCTTGTTTAGTTTCCCAAAATGGTTCCACATTATTAGTGAACTTCATCATCTTCTATTTAAGATTTTTGCATTTGTATTTGTTTTTTACTTTATAAAAACTTATAGAACCTAAACAATGCTGCATAACTAGAATCTTATGTTATATGTGGGAAAAGGGTTATTTGACGAGGTATTTAGTTAAGCTATCGACAACATAAAATAATAAAAAGGTTACTGTATTTGTAATTGTTAGATCAATTCATCTTTAGTAGCCTGACCAAACATTTTCTTTGACTTTAATTTTATTGTGACTTATATAATAAGACAACATACTTCTTTGTTAATGCACGACCATTGTGTAATTATTCAATCCTTCCAAATTCTTTCCTAACTGTTTCTTTCCTCCACCAGATAGATGTACAAATGTACTATGCTTGGAAAGAACTTGTGCAAGATGATACTCTGGTTATTTTATTATCTTAGTTGTATAATTTTCTTTTGCATCTGACAATTACCTTTAAGTGAATTTTCATTTGTAGCTCAATGCATTGCTTTGAGGGTCCACAATGGCTTCATTTGAAAAGAGGTCGAGAATCATTTGCAAGAGGTCCAGTGGTAAAGGTTGATCCTCAAGGCAGGTGTGGAGGAGTTCTTGTTTATGGTTTGCAAATGATAATACTTAAGGCTTCTCAGGTGTCTTTTGCCTTTCAAACTTTTTATTTGGATGTGCATGACAACTCAATGCCATGCTTTGTTGACATATAGTTCCTAATATCTTGTATAATATATTGGATAGAAAGAGAAACTATGTTGGATTAAGAAGTAGTTCTATCCACGGGCTTGTGTACTGTACTTTTATCCTGCTAAGTTTCACCATTTACAATGTTGCACACTGAACTTTGTGGAGGGGTTATCTGTACCCTTGCCCTTGGGTTGTTCTTTTGGCTTGTGTACTATATACCTTTATCCTTCAATTAAGTTTCACCATTCACAATGTTTGCTCATTGAGCTTTGGAGAGGGATTATCTCTGCCCCTTGCCCTTAGGTTGTTCTTTTGGCTTTATGTGAATATATTTCTCTATGTTTCTTATTAATAAATTGTTTGCACATTTTTTTAACAAGAAATAACTCTCCATTAAGATTATGAAAAGTGACTAATGCTCAAAAGATACAAACTTCACTCGGGAGTGAGAGGAAGCAAATAGGAACATCACACAAATCAAAAGAAACAAGAAAACAAACTACACAAAAGCCTCCCAACTAAGATAAATATCATTCAAAGAATAAGCCTCAAAAGGTTTGGAGAGTGTGCACCGAGATGGTGCCTTCAAGCATGTTGAATCAAATCTTTCCACTCACGAAAGACTTTTGCTTTCAAATATTCTTGAATTTCTTTCCCACCATAGTTCTGACAACAAAACCTTGATACCATTAACCCATGAACGTGTTGACTTTGTGGAGTGCTTTTATTGTTTCTTTCTAAAGCTTTAAAGTTATGTACTCTTATGTTTTATTGTCAGCTTGTTTCTTTTTTTTTTTTGGGTTTTTTTTCCTCACCTCTAAGCTTAGAGAGCACACCTTGGTGGGAGGAAACTTCAAACTCCAATCCTGAATGATCAAGAACCTATCAATTAGAGAGCACAATAATCGGTCTTGAGCAGTCTTCCGATCTTCTACTTTTTGCTTTTAGAAGCCCCTGTGAGAGCTATAAAGAAGATGTAAAATCTGGTGAGGATTTTTTTTTTTTTTTGGGGGGGGGGGGGGGATTCTGGATGCAGTCATCTTGTAAACTGGGAAAAGATCTCTCTGCCTTCGAAATATGGTGGGCCCGACCTAGATGCTTTTCAACAAAAGAATAGAGCGTTACTCTTGAAATGGCTTTGGAGATTCATTAGAGAAGAAGAAGCTCTTTGGAGGGAAGTTATTGTGAGCTACGGTCTCGATCAACCAGGTTGGTGCACCTTCCCTCCAAAAGGCAACTCAAAAGGTAGGACCTGGTTTGAGATTGCCAAAAATGGTGCTGTTATTTAATTATTTATTTTAAATAGTTTCCTTTCTTTTAAGGTGAACAAGGGCAGTCAGATTCGCTTTTGGAAAGATGTTGGAGATGCTACTCTAGCTTCTTCTTTTTCAATTATCTTTGGTATTTCGATGAAGCAAGAAGCTATTGTGGAAGAATGCTAGAGGAATGATTGGGATTTGGGCCTTAGAGGAGGCTTGTTTGATTGGGAGATTTCTTGTTGGATTGGACTGATTGAACTGATTGATGTCATGGCTAGGAGTAGGGGTTGTTGAAGCACTTTGGTCTTTAGAAAAGTCAAGGAGTTTTTCTGCAAAATCAACCCTTTTGGAGCTTACAGATTCCCCTTCCTCTTTGAGAGTTCCCTTGATTGATGCCACATAGAAGAATGGTTCTCCCAAGAAGGTGAGGGTGTTCCTTCGGTCTCTGGCCCATAGAAGAATCAATACTCATGACCTTCTCCCAAAGAAATGTAGGCATTGGGCTTTGTCGCCAAATGTTTGTTTGATGTGCTTGGGAAATGAATAATCTCTAGACCATTTGTTCCCTCATTGCTCTTTTGCCTCTCAGGCGTGGCAGTGGTGGTTCAAGTAATCAAATTTCTCCTTTTGTTTGCCTAGATGATTGGCTCGTTGAGGTTTTGGTTGGGTGGAATATGAGGAAAAGAGGTGAAGTTCTTTGGGCTATGACTGTTAGGGCTTTGTTGTGGAGTAGTTTGTGATTGGAAAGGAACCATCGCCTCTTTGAAGATAAAGCTTTCCCTTTTGCTCCTTTTTGTTTTAATGTGCAGTCTTTAGCTTCTTGATTGTGCACAAAGAACATAAAATTCTTTTGTAATTACAACTTATCCACGATCCAACTAAATTAAAAAGCATTCTTATAGTTTTTGTTTTAGGGAGGGGTTTTCGTTCTACCCTTTGCCTTTAGGCTGTCTTTTTTGTGAGCCTTGTGCATAATATCCCTCTTTGTTTCTTATCCAAAAAAAAAGAGAGAGCACACTATTCTTTCTTCCCCTAATTAATGGAAGTTATACTTGGATGAATGGGCAAGAAAGATTGGCGTGCTCTCTAATTGATAGGCTCTTAATTAATCAAGATTGGAGTCTAAAGTTTCCTCCCACCAAGGTAACAAGAATTCAAAAAATCACTTGTGACCACTCCCCTTTATTATTGCAAACAAGGACTTTCAAGTTCAAGAATATTGGGCTTAAGCATCCATCCTTGTTCAAGACCTTTGAGGAATGGTGAAGCAACACTACCTTTGAGGGATGGCCCGTTTGGGAAGGGGATGAAAATATAAGTTGTTTTCGCAAATATACTAATGCTAAAAGGAAAAAAAGTTTGGTTACAGAGGTCCAAAGTAGTGAGGGGAGAAGCTTGGACAAAGAGGAGGAGATAGTTGAAGAATTCATTTCTTTCTTCCAAAAGCTCTATAAGAAAAAAAAGGTCAGAATTTTAGCATTGAAGGCCAATGCAAGCAGAAGAAAGTGCTGCCTTAGAGGACCCGTTCACAGAGTTTGAGATCAAGCAATAGTGGGATGTTTTTTATTTAAAAAATTGAAGAGGAGGATTCCATAATTTGGTTCCTTAAAAAAATGAGAATCTATTTGGTTAAGTTGGCAAGTGGCAACTTCAATTCTCTTGGACAAACACAGTTTTTTTATCCAGCTGTCATTGACTCTATTTGGAAATCGAAGACTCTTAAGAAGGTTAAAGTTCTTCTCTGGATTATGTTGCAGGGGAAGCTAAATACCTTAGATACTCTCCAAAGAAAAATGTCGAATTTTCTTTTCACCCTTAGTTGATGTGTTCTACTAAAAAAAAAGCCACGAATGTCCCATTGCAATATGGCAAGGCAAAGTTGTTCAACCTTTGTGGAACAACGTAATGTGCACAACCCTTTGTGGCTCAAAAGAAACCAAAGAATTTTCAATGGGCTGGAGTTGAGCTTGGACAACACATGGGACAACATTAAGTTTTATTCTTCCTTTTGTTGCATTCGTTAAGGAATTTTGTAATAATATTTTTTGTAATTGGTCAAATAGCTTCTCTTAAAGGAGTGGCAACTAGATACGACTTTTATGAGGTCCCATGGGGAGAACCAATAGTCATCACAAGACAAAACTTCCATCATGATTGGTACACAATCATGTTAAAAAAATCTAACTTTGCAAGAGCGGGAAAGAGCATGCTAAATACATTCCATCTCAATAAAGGCTTATTGACATGGCAAGTCAATGCTCCTATGTTCTAATAAAGTATGGTGTACCATCGATAGCTTTACTGATAAGCTTGCTAGGTGAGGACTAAAGAATTTTAATCATTCCTTCCTATGGCAGGTGGATTAAATTTCTTTAGGCTCTTTTTAGTTTTTACAATGGTCTTTTCTTTTTTGCCTCGCATGCCAAGTATTTTTCATTTATCTTGATGAAAGTTTGGTTTCTCACTAAATTTTTTTGGTTGTTAAAGAATCATCGCACGACTTATAATTTATGCATGTTTTTATTGCTTGATGCATTTATTTGTACTTCTATGTTAAGTTCTCCTTATGTTCAACTTGCATTAAAATATTTGTTTGTCAGGCTGGATCTGGTTTGGTTGTGGACGATGAAGCTTTCGGTAACATAGGTGCAATTTCTGCTCGAGTTGAATCGTCATACCTAATCAACCTAAGGGATTTGGATGTCAAGCATGTAAAGGATTTTGTTTTTGTACATGGTGAGTGTACATGGGAAATTCCTTGATTTTCTATTATACTTATCTAGCTTTACTTTTTCCTCTCATGTTTTGTATTTCGTTTTATCTATTTTTATCTTGTCTCGATTCTTTGTATCTATAACATTTAAATTAGGGGGACATTTAGGAAAATTTTAAGAATTTTGACTACGTTGCCTTGACTGCCCCTTTTGTTTTACCCGATGCTCGCTTAATTGCGCCTTTTGTTGCCTTGATGCGCATTTGCCTCATGCGAAGGCAAGTCCATTTTTTTGTCCTTTTAATAAGCTATCCACCCCTACTCTTATATTAAAAGGCAAGTTGCCCTGACGATGAGCTCTCTACCTTGGGCTTTTGTTTTTGGGCGGAATCTCATTTCCTGGGGTGCTAGAAAGCAACCCCTTACTCACCTCTTCAACATCAATAAAAAAGCCTTTCCCCTTTGAATAATAATAAGGTTTGGGAGGATTATTGGCGTGAATTAGATGATGTTGTAGGTTTGGGAGGAGATCGTTGGGTTATAGGTGGTGACTTCAATGTAACCCATTGGACTTGAGAAAAATCGCATAATCATTCCATCACTAAAAGTATGCGTATCTTTAATTAGTGGATCTCGAATTATGAGCTGTTAGATCTTCTTCTTCAAAATGACAAGTTTACTTGGTCCAGTTTTGGTGAGTCGCAATACAAATCAGTCTTGGATCGATTTCTAACCACTGATGCTTGCCTTAGTAAATTTGGTAATGCATTTCTCAAGCGTTTAGACCGTATAACCTCTGACCATTATTCGTTGGCCGTATCGTTTGTTGATATAGTTTGGGGGCCCTCTCCCTTCCATTTTGAGAATGCGTGGCTGCGACACACTTCTATTCGACAGGTTGTTGACGGTTGGTGGCATCAAAATAGTTTGGAAGGTTGGCCTGGACATGGCTTTATGATGAAATTAAAAGGCCTCAAATTTGTTCTTAAACAATGAAGTAAATCCAATCGCTCACATGCTTCTCAAATTTCCTCCTTTTTGGCTCGGTTACAGTTGTTAAATAACTTAGCAGACAATGGCCCTCTTATTAACTCTCAGATGGATTCCCGTCGTTTGCTCTGTGAACAGATTGAGGAATTGACTGTCCAAGAGCATATTTCCTGGAGTCAAAGGTGCAAGTTGAAGTGGATCAGTGAGGGGGATGAAAATACAAAATTTTTTCATTGAATTCTGGCTGCTCGTAGGCGCAAAAATTTGATCACATAGATTCTTTCTAGAGATGGCCTAAGCTTGCTGACAGCTTCTGATATTGAATCATAATTTATTGGCTTTTACACATCTTTATTTACGAAGGATATGGAAATCAAAGATTCATTCCCACCAATGTTAATTGGAGCCCTATTAGTATTGACCAAGCTTCCAGTCGGGAAGGAATCTTTACTGAGGATGAAGTATTCAAAGCTGTTTCTTCTTTGCAAAATTTTTCAAATTTTTTTGGGATACTATCAAATCTGATGCTATGGTTGTGTTACATGATTTTTTCTCCTCTGGGATTATTAATGTTTATGTGAATGAAACTTACTTATGCTTAATCCCGAAGAAGCTCACCCCCAAATCAGTATCGTCCTATTAGTCTTATTTCTTGTGCTTACAAGATTATTGCTCGAGTCTTATCTAATAGGTTGAAGGCTGTTTTGCCTTTTACTATCTTTGAGAATCAACTTGCTTATGTGACTGACCGATAAATTATTGATGCCTCTCTAATGGCTAACGAACTGATAGGTGATTGGACTGTTTCTCAACGAAGTGGTGTGGTTTTAAAGCTGGATTTAGAGAAAGCTTTTGACACAGTTGGCTGAGATTTTCTGGATGCCATTTTGCATGCCAAAAGCTTTGGTGTGAAGTGGAGATCTTGGATTAGGGGCTGTATCTCTAGTGCGAACTATTCAATCTTCATTAATGGTAGGCCTCATGGTAAGATCATTCCTTCTCGTGGTATTCGACAGGGCGATCCATTATCACCTTTTCTCTTCATTCTGGTTTCTGATTGTCTGAGTCGTCTAATGGTCCATAGTGCTTTGATTGGGGTCATCTCTACTCATCCGATTAGGAACTCTTCATTCTTTTTGAATCACTTGCAATTTGCTGATGATACTCTTTTATTCTCTACTGCTGATAGACAAGCATTGACTCGACCTTTTGAAGTGGTTAAGATCTTTGAATCAACATCTGGTTTTAAGATCAACTTGTCTAAGAGTGAGTTGCTAGGGATACATGTTGAGGAAGCGGAAATGGAGTGGATTTTGTCACATTTTGGCTGTAAAAAGGGATGTTGGCCTTCTACTTACATGGGACTTCCTCTAATGGGGAATTCCAAGTTGAGCTCGTTCTGGAATCTTGTAGTTGAGAAGATGAAACAGAAACTACATTACTGGAAGTATGCCTTTATTTCAAAAGGAGGTAGACATACCCTTATTCAGGCTACTATTGCTAGTATGCCAACCTATTATTTATCTCTTTTTAAACTTCCCTCTTCAGTTACAAAAATCATGGATAAAATTATTTGTGATTTTTTTGGGAAGGTTCTCGAGCAGATGGTGGTAGTCACAATGTGAATTGGAAGTCGGTACAACTTCCAAAGCTGTTGGGGGGCCTTGGTATTGGTAACTTTAAGCATCGAAATTCTGCTCTTCTAGCCAAATGGGCATGGCGATTTGTTCATGAACAAGATTCTCTATGGCGGAAACTTATCGTTGCTAAATACTATGAGGCTTTACATGGGGATGGATGGCCACAAGATATTAGCTTGCGTGCTCATAAATCACCATGGCGATATATTTCACACTTGTCGAATTTGGCCTCAAGCAGGTCAACTCGTATTCTGGGCAATGGTTGTCATACTTTATTTTGGAAAGATTCTTGGCTCAGTTGTGGACCTTGTGCATTGGCTTTCCCTCGTTTATTTCGACTCACAGTAAATCCAGATAGTTTGGTGGTTAACCTTTGGAATGTCGCTAATGATGCTTGGGATTTGCAACTTCGACGTGGTTTAAATGATTTGGAAATAGAGGAATGGGCTACTCTATCCCAACTCTTGTCTCCAATTCGACTTCGAAATATCCCTGATACTTGGTCTTGGCCTCTTGATCCTTCTGGTACCTTTATGGTCAATTCTCTTATGGTATTTTTGGTAGGTGGATCTGATGAATTACGTAAGGATCTGATCTGCATCTGGAAGGATCGTTATCCAAAGAAGGTTAAAATCTTCTTGTGGGAGCTGAGCTTGCGTACAATCAATACCTTAGATCGTCTTCAGCGAAGAATGCCTCATATGTCTCTATCACCATCATGGTGTGTTATGTGTTGCTCTAATCTGGAAAGTGCTGGTCATCTGTTAATGCATTGCCCCTTTGCATCGCGATTTTGGTCTTTTATGCTCGAAGCTTTTGGTTGGTGCCAACCTTTTCTAACCACAGGTTTTGATTTTTTATCTTCAGTCTTTGTGGGACATCCTTTTCATGGCCTTAACTCTTAAGAAAACTCTGTGGCTTGCCTTCACTCGTGCTTTCTTTTGGACTTTATGGGTTGAACGAAATGGTCGAGTTTTCAGGGGAGTTTCTTCTACCTTTGATCGTTTTTTGGATTTGGTTCTTCTTAATGCGTTCCTATGGTGTAAATCTTAATATCCTTTCTCGCTCTATAGTTTATCCTTTTTATTTTCGAATTGGAAAGCCTCCTTGTAACTCACCTTTGATGTTTGAAGTTTTTCTCCTCTATTTCATATATCAATGAAATGGTTTCTTGAGAAAAATAAGTCAAGCTTTTCATGGGAGAAACTAAAGAGTTGAATGAAGGACAAGAAAAAATGGAAGAGAGAAGTATAGAAAATAGAGAAGAATAAAAAGAAAGAAGAAGGGAAGAGTTGGAAGAAGGAGAAGGAAAGACGTAATAGAAGTAAAAAAGAAAAGAATAAAGAGAGAGAAGAAGAAAAAACGTAGTAGAAGAAGAGTCTCATCCAAGACATATGTGCTAGTTGCTTGAACTGTCAAATGGCGTGTTCTTACTTAGAAGACAGTTTCTAGTAGGGGTGATAGTCGGTTCAAGTCCTTCGGTTTTAGGCCCAACCGAGAACCGAATCGAACCCCTCACTTTTTACGAAAAGAGATCTGAACCAATGTCCAATTAGGTGTAGAATAAATTCTTTCTAGAAGAATTAAATAGGAGTAGTTGCATGAAACTACCATCCGGCAATTTCCTTCCATGAATACAAGGGTTTATATTTAATACCGGAAGAGAGAAAATGATTCACCTTTAATAGCATAGATTCCCTAGGGATAGAGTCTTCATAAAGAAGTCTTTTTTCGGTGGGACTTCGAGTGACAGAGGCTTCGTTAAATACGTCATACAAATCTATTTTTTATTAATATTGTCTATTTTCTGGTTGAATGAGCTGCTCTTCTTCTTCAAGTATAATTGAAAGGATATGGCTTTTTTAATCCTCTTATTGTGGTTGAACCTTTCTCGCCTAAGGAAATACTATGAGGATTTTTGGACTTCCGACCAATGAATCCATGTGGTTTTTAAATAATTGCTCAATTGAATTTCCCTTATATTTTCTTCCACTTCATTATTCTTATTTAGTAATAGAAATCTGATGGATGAGTCTGCTGGCAAATTAAAATTCAATATCTCTTCATCTTCCCCTTGCTTCTTTAGTAAACAAGAGCTTAAAAACGAGGACAAACTTGATGTAAGAACTTTTTGACCAGAAGTCTCTTTCTAGAATCCCTTAAACCTCTTGTATTCCATGAGATTTTGGTGTAGATTTTGCATGTACATTTTCAGGTGTGGGGTTCTGTTTTTTGATTAACATTGAGGCGGGTGGTTTTCATGCCTTTTCACGTTGTAACTCGTCAGTTGTTCAATGAAAAGTTTGTTTCATTCATTTTTTTAAAAGATAAAGTTTAGGTTTTCATGAAACGTTTAACCTTTGAATCAACTTTCGTGTATGATATTTCTGACAGAAGATTCCTCCTCGTTTAGGTTACATTGAACCTGTGATGGTGATTCTTCATGAGCAGGAGCTTACTTGGGCCGGCCGTGTTTCATGGAAGCATCATACGTGTATGATTTCTGCACTAAGTATTAGCACAACCTTGAAGCAGCATCCTCTAATATGGTCTGCCAACGTAAGTATAAAGCTGGTTCAGTCCTACCAGAAGCTGCAAAATTTCACCAACTTAAATTGTTCAGATATGGATTTTTTTTATCAAAATAATCTATTGCATTTTAACCTTTTGATTAACCCGCATGTGGAATGATTTCAAAATGATAGTTTGAACCACAGAAGAAATAATGTGATGTAGCCCAGGCGACTAAGGAGAGTTCTCTCCAAGAAGTCTATCAAGAATCTTTAAATCCATAGAGAGATTCTCTATTCTCTATTTATAGAGCATTGAGTACACTAAAATCCTAAAAAGAAACTAAAAAGATTAATCGTGGAATTATCCAAGGTTAACCACCAATCCTATCAATTCTATCCCTCCCCTCAAAAAAACTCTCCCTCGACTTTTTTAAGCTAAAGAAAGTTGGAAATTTTCCTTAAAGAAACTTCTTCCGATGATTAGAGGTTGTCTCATAACATTCTTCAAAAACTTGTTCTAGTGGTGCTTCTTTCAATGATTGGAAAAAACTCCCACCGAAAATGTTGTCTTAAGCTCATCAAAACACCATGAAAACCTCAAAGCAGCATAAAGAACCTGGAAATAGTTATATTTCGAAGTCTTCTCTGCCTTCGTTCCATCTAGCTCAATGTCGTCATCTGAAGAAGAATATAGGGATTGGTTTGGTTGTCCAACCGAGTATGAATCTTGAAATATAACGTTGGCAGATTTAAATGAGTAAATTTCATATGAATGGTGAGAAATCGGTTTGGCTTCCTTCTCATTCTCCTCCCCAATTTCTTGCTCTGTTCTAATATATTCTTGAAAGATTAAAGACTTATGTCCACTGAAATCTTGCTATTCAACTTCATAGAACAATTGAGCCAAATAAAAAGGAGAATCATCAACCATTTTGATAAATTCTTTCTCAACTTCTCTTATCAAAAAATTTCTCTCAAATTCATCTTCCTCATCAAAATAATTTTCAAATTTTAGAAAATATTCTTCTTCTTTGGTGAAAAAACCAGAATGGTAAACATCAAATCTGTGAAAAAATTTAGTTACTGATCCGAAGTGTAGGCATGGCAGAAATTTAGTTACCAATCCGATGGTCAATATCACATTGTATCAATCATTATCCATGCCTGTGGTAATATTCCCAATCCACATGCAGGTATATGGTTTCTGTATGATTATTATTTTGCCCAGAAAACGGGCATCCGTATGGCTTTTATTTCTTTTTGAATGCCATGTAAATACTAATAACTGAATAGTTTATAAATTTTTTTCCATATTGCCACGGTGTTTTTTTCACAAACCATCAGAAATTGGCCTAATGCTTGTAACCTAGACAGAGAACTTCTTTAAGATAAGGAAGCCATGTTAGGCATGACAGAAAGCTTCTTGGGTTCAACTAGGTGGGTATTATAAGTTCAGAATTCCTTGGGATTTCCCTAGTTGTGATATCGTGGGTTAGATAGTAATCCACAATACATGGAGGAACGTGTTTGGAGTCTTTATACCTTATTGAAATGCTAGAAGAATAATTTCAATTTGTTGTCTACTTCTAGAAAATATAAACATTCACGTCTTTTCTTTTCTTTTCTTCTTCCTTTTTTTTTATATAAATATCCCTAAACATCCACTTCTGGAATTAATGTTGATATACCACGAGTTATGGGGATTCAGGCTATTTCTATTTGTTCCCCAAATTTTTTTAAGAGTAAGTGAATTACATTATTGATAATCATGATGACATGGTTTTGCCAATAAATATTTAGACAAGCAACATGTGCGGATTCTGCATGGCCTTCTGTTTTGGGCTTTGTGTGTTTTTGTTTTTTCGTTTTTTGGGCTATTTATGTTCTTCTTTTGGAAGTTCTTGTAACTTTTATTGTAATTTTTTATTTTATCAATAATATTTTTTTTCCGTTGTGAAAGGACGTCATAGTTTGATTTTGTATTTTATAATGATGTTATTTTTATTTAAATTGAGACTTTCATGTCACACTGCAGTGAGACGTGTGCCTCATCACATCTGGAGGAATAGCCTCTGGCCCACCTTTGAACTCTTAAAACATTGCTAGGGACTAACTCTTCCCGACAATATTATCTATTTGTTCCCTGCACCATTTTTCGGGTTACCAACAAAATATAAATTTTTTGTCTAATCGAAGGTATCTAAAGCATGGTTAATCTTGTGGCCTGTAATCATTTCAAATGAGTGGAAATGGTGGTAGTTGCCCAAAAGTTCTGTAAATTGATTAATGTTTGATCTAGATGGTCATTGTTCAATTGCAGCTTCCATGGTATTGGTGGGGTGAGGGGGTCTAATCAGTTGCAAGTTACTAATCTTTCTTTTCCTAGAACTCTCTCAATTGACAATAATTTTCTTTCTTTTGTTTCTTCCTTTTTCCTAACTTTGTATGTTGTAGAACCTCCCTCATGATGCTTACAAGCTTCTTGCAGTGCCATCACCAATTGGTGGTGTACTTGTTGTCAGTGCAAATAGTATTCATTATCATAGTCAGGTAACATTCCCTTTTCTATTCTTGTACTTTTAATTAACAAATTATTTAATATTTGTCTTCCCCAAAAAAGAGCTGAAGTAATTTTTTGTACGAGGGTCTCACCTATCTGGCTTTCTCCACTTTATATTTTTATATGAACTTTGGACTTGGAGTTGATCTTTTCATATTTGTGTTGTAATCTAATGAGCTGTTTATCTATTTCAGTCAGCTTCATGCATGTTGGCTTTGAATAATTATGCTGTTTCTGCTGATAGCAGGTCACACTTCTCTCTCTCTCTCTCTCTGTTTTTTTTTTTGATAAGAAACCTGTATATATTAACAAAAAAAGAGAGAAACAACCTAAGGGTTAGGGGAAGGACCCCACCCACAAAAGAAATTTAAGAAAAGCTTTTCGATTGTTTATGATCATGGAGAGGCCTAAGCTCATTTTCATTTTTTTTGTTTATTTATTTACTTTTTCCCTTTATATTTACTTGACCTTTTCCTACCTCTTATATTTACTTGACCTTTTACTACCTCTTCTCCTCCTTGAAATGGGAAAGAAACATGTTAGCTGTCAGAGCTTATATCATTTATCAATTTTTAATTTGGCATTCATGCTGGGACATTAGGTAAATCAAATTTGACAGTCAGTTCAGAAATGCTGGATAATTTATCTGTTGAACATTGTTTTTCTTTTTCCTCGAGCGATTGAGATGTTTAAGGTTGATGAGTTTTTTCCGTACTCTTTTGGCTGCGGTCCTTGGAAATTCATATTCAACGTCTAACTCATTCCCTCTGTTTTGTTAATTTCTATGTTAGGAATGGCGTTCAATTTACTGAGAACTGTGGTTGTGCAGTGCTCTTCGTTTGGTTTCTTTCACCTGGCTGTTTAATGTGTCTTTGTCTTCTGGTTCTTCCATTGCTTCTCTTGATTCCTTTCAATGTAGGCCTCCTCGAGTCTTTATTTTTGCTCAAAATATAATGACATGGAGACTATTAATTTGGCTTCCCACCAGCATCTGCAAAGTGTCAAGCTGCCCATCTTAGTCTTGCTATTCAGGATAAAATATTGTGATTCTTGCATCCTTGGGGTTTCTTCTCTTGTGATTTCTTTCTATTTTCTTTTCTTTCTCAAAGAATCTTTCCTTCACCTCTTCTTTTACCCAGTTTATCCTCTTCCCTTTGTCGAGAAATTTTACCTTTTTTAGAGAAGCTTGCAAGAGCGAAGGTGCCTAAGAAGGTGGGGTATAATGCCTCATTACAGGGTTGAAGCAGAGTTCTTTTGTTGGTTGAAGCAGGGTTCTTTTGTTCTCCAATCTTACCTCTATTGGTTGAAGCAGGGTTCTTTTGTTCTTGGAGCTTCAGGCTTTTTTCTGACGTCTTTGGTTAGAGAGAAATAGTTTTCATTTATTGGAAGGAGGTTTGCAGTTGTGTGCAATATAATGCCTCATTACAGGCTACTTCATCTTTTTTTGTATAACATTCTCATCTCTTGATTCCCCCCCCCCCCCCCCCCATTATCTCAGTTCTTAAACAAACCACCTTTACCTTTTTCTCCCTCCTTGGAACCTTCCATTGAACCGTCTTCTTCAAGCTTCTCCACGAAACATCATACCAACACTTTTCCATGTGAGTTGCGTGACATTGCTCACCTTCTCACAGAGCATGAGTTATGTATTATGCCCATTCCAACCCTACCCTCACCCACCAAGCCTAAAAAGACTTATAATAGAAATAGGAAGAAGGACAAATTAACGAGGGAGTTACAGAATTTACAAACCACTGTTCACTATGACCAATCTGCCGCATTGGCTCTAATGGAGGGATCTTCGGTTGTAAAATGAAATTTATCACTTGGAATGTTTGGGGTTTAGGTTCTTGGAAGAAACAATCCTTGTTTAAGGATTCCATTCTCAAACAAAATTCGGGAATTGTTCAGGAAACTAAGTTATCTTCTGTCAGTCACCGGTTGATAAAGTCTATTTGGAGTTTGGTCGGGCTTTCCTTGATGCACTTATTTCTGCTGGTGGATCTTAATTTTTGGAGTGAACCGAGCTTTACTGTTAAAGAAATCTCTCAGGATTGGTAGCTTTACTTTGGTTTGAGAGAAATAGTCGTTTTTTCAGGGTTGTTTCTGCTCCTTTTGAACGTTTGATGGACCGAGTTTCTTTTAATTGTTTATTCGTGGTGCAAGATTGTTCATCCATTTAGCCTTTTTGGCTGTTCTTTTTTTATTTCCTTTTGGAATTTTTTCTTGTAATACTCCTTTTGGAGTTTTTCTCCATTTCATTCATCGATGAAATTGTTTCTTATACCAAAAAAAAAAAAAAACTTATGCTTTCATCTCTGTATTTCAGTCAAGATATGCCTAGATCAAATTTCAACGTGGAATTGGATGCTGCCAATGCTACGTGGTTGCTAAATGATGTGGCCTTGCTGTCAACCAAGACTGGGGAGCTATTACTGCTGGCACTTGTCTATGATGGACGGTGAGCACTTTAATTTTTCTCAGTGTCCATACATTCCAATTTCACAGCATTATCATCGTAATTGTCAGCTTCTATTGCTTCAGCTTTTAGGAATATTCCTTATTTTTTTAGGGGTATTTGACTGGCGAGGAACCATAGGATTTTTTGAGAGGTTGAGAGATCTTGGAGAGGAACAATAGGATTTTTAGAGGGATTGACAGATATTGGGGGGAAGTTTGGGAGCTTCCTAGGCTTAATGCATCCTTATGGGTGATGTGTTTGACATCGTGCTTTTTGTAATTACCATCTTGGTCTTATCCTTTCAATTTGATGTTATTCTTCTTGATAAGTTTTAGTCTAAACTCATGATTTTCCAACATCATTAGTTTAAATGTTTATAGTTCAGTTTGCGACACAAAAAATTAGATTATTAAACATAGCTGGTCATGTTTAAAATCATTAGAATAATAGATGAGCGTGGAAATATTTTCTTGAAGTAGACACTTTCTCTCATTTGAACATTTTTAGCATTGAATAATATACAATGTTGATTTGCAGGGTTGTGCAGAGACTTGATCTTTCAAAGTCTAAAGCTTCAGTACTTACATCGGTCGGTTAGATATGTTGTTTTCATGCCAGGATACTTTGATACTACTTTGTCCTTCCCTTTTAGTTGTTAACTTGATTCAGAGACGATGTTTAAATATATATATCCTTTTGTTTTATTATTAAATAGGAAACATGAAATTGGTGTTTATATGTATATATTGTTGGAATGATATGTAGGGGATTGCATCAATTGGAAATTCATTATTTTTTCTGGGCAGTCGATTGGGAGATAGTTTGCTTGTGCAGTTTAGTAGTGGAGTGGGATCCTCAGGATTGGCATCTAGTCTAAAAGATGAGGTTGGTTGGGCTCCCTCAACTTACTTTTGCTTGTTACTTCAAACTTGTTCTTGTTGGTTACTTTGGACATTTCAAGTTAATGTTGTTTCCTCGTATTTTGGTGATTCTTACCTTCCCTGTTGTTCCAACTCAATTACTTGTATGCCATTTGACTTTATGCATATTGGAGAAGATTCCTTCTATTATTAATAAATATGCACATGACCTTTCATTGAAATTGAGAAGAAGGAAGTTCATTGCTTCCTTGGGTCTTTGCTTTGTTTTGGTCTTTTTGGAGTTCTTTTGTATTCTCTTTGTTTATTTGTGTTTCACCTGACTGGTTCTTTTTTTGTTGTAAGGTTTGTACCTGGAGCGTTAGTCTCCTTTCATTATATCAATGAAATCTTTGTTGCCTTGTCCAAAAAAAAAAAGAGAAGAAGGAAGTTCATTTATTGATAAAGCATGGAGTTTAATAAGTTCATTTATTATTCATGAACTCATGTCTTAAGTATTGATTAATTAAGTTTACCATAACCCATCAACTTAAACTTTTGAGTTGATTGGTTATTTAACATGGTATCAAAGTAGGTTGTTCGTAGTTCAAACGCTTCCTCTCCAATTAATATTGATTTCCACTTATTAGGCTTTTGACAAATTTTCAAGCCCAGAAGTGGGAGAATATTAAGTATTGATATAATTAAATTTACTATAGCCTATAACTCATCAACTTAAGCTTTTTGGTTGATTGATGATTTAACATCATATTTAGATTCTTTTGCCATTTTGTGGTTTAGCTTATTAAGGAGATTGTGTCATTTGGATGGTATTTGTTAGACCCCCAATTATCAAGATGTTAGGAGGGGTAAGAGGGTAATTGTAAATAGGGAATCAGGGACCACTGAGGAATGAAAGCTGTCCGTACGGTTAGGAGCATGGGAGTGACGGTTTTTTATAGTGGTTTTGTCATCCAAGTGAGGTAGGTTTTATTTTGTCAGACTTTTGCTTGTAGGAGAGAATCAGCCTCTCGAATTGGTGGGTATCTTTTGTAGTTGTTCGACAACTTTCCTCATATTATCAGCTAATGAAATAACCCCCTTCGATATTGAAGAACTATCATTATCGGAGGGTTGGTTCTTGTTATCCATTTCATTGTTGATTAGAGTGAGGTTGAGTGAGTTTGCAGGTCTTAGTTTGGGATACCTAACAGTATTCTTATTCCCTCTGGTAACCTAAATTCTTACCTGCATCCGATCGGAGATAAAGCAGTTTTATTAGATTTTCCTTTCAGGGAGAACAAGGAAGTTCATTTATTTTAAATTATGAAGTTGAATAAATTCATTTACTATTCATTTGAAATGTTCTTTTCCATGAACTTCACATGTTGAGATTAGCTTATCGTTACAAATGCCAAGTTGTCTTTCAGTGCCATGCATATTAATATTTACTACATTTTATAGGTTGGAGATATTGAAGTTGATGCTCCTACAGCAAAGCGAATGCGTAGATCATCTTCTGATGCTCTACAAGATATGGTTGGAGGAGATGAGCTGTCATTGTATGGTTCAGCTCCAAATAATGCGGAATCTGCTCAGGTTTCTTATAATGGACTATGTGAACAAACTTTCAATGTCTCTTAACTTTTGTCTTTCCTTTCATTTTTTTTTCCTGGATAAAGTGAATAGAAACTAAGAGGTCCGATAGAAGCCAAAAAAAAAAAAAGAAAGAGAGAGAGGGATAGAGATAGAGAGAGAAGCAATCAAGGACTACCACTTGGTATCGGTTGGATTTGTAAAATACATCTAGTCTTGTTAGAACTTTAAAAGTTCCAACTTTCTACGCGTATAGGCATTTTTCCCAGATAAATGCACTAATGATTATTCATGATGTTTGCTTGTAAATATGTTTTTTTGGTATATTTTGTATGTTCACTTAGTGTTAGGATTAGTTAAGCCAGTTAATAGGTTCTCATTGCACAATTACTGTGTGTGTATGGTCACTTAGTGCTAGAATTAGTTTGTTAGTTGATTAGTTAAGCCAGTTAATAGGTTCTCATTGCACAATTACCATGTGTTAATTATTTTAAGTGTTATTGGTTTGCTGTTTTTTGACTTGTTCAAATAATGTAAAATTGACTTCTCTTTTCATTTTATCAACTTCGTTTCTTGTTTAAAATAATGTAAAATTGGCTGGTTAATATCAACAAATATAAACCCTTTTGTACACCTTCGCCAAGATTGGCATTAGAGTTTGAGAAATAAGACATATCCAACCACCCTTTTGGCAAGCTATTTTTCTTTTTGCAATGATGTGGTAAAAACTCTTCTTCTAATGAACTTCTAAGTAGTTGCATTTAAGCTTCAACCATCCTCCTCCTATTAGAGGAACTACTCGGATGAATGACAACCCTAGAGGAAAATTATGAGGGAATGTCGTTGCTATTAAATATCTTTCTTAATGATTGGTGAGCCTTAATGGCATTAAAAATTGGAGGATGGAGGATTCAAAACGAGCAACTGCATATTTGATTAGAAGAACCATTTTAGGTATAGATAGCAAAACGGGAGGAGGTTTGGGGTCTAGCATGTTTTGACATCCCTTTTTGGGTCCATTCATCAATCATTTTTTATAACTATTCTTGTGTGCTATTGATCTCAAGGCCTCTTCCTGATGTGTTCTTCCCAATCCCTTTGAGGGGTTTCTTTAGTTGATATTAACTTGAACTGGAATGGTTTTATTTGTTCTCATTATTGTTTGATTGTAGACTGTTTGGTTTTGTATTTTCTGTTTCTCTCCTTCGTGGCGTTGATTTTGGAGCATTAGTCTCTCTTTTCATTTTTTCTATGAAAGGTGCGATTTATTTCTGGTTAAAATAAAATTATTCTTGTGTGATTACCAACAGCTGGTGCTTTTTTCTACAATTTTGTTGTTACTGTGTTGGTTATGTCAAGTTTGTGTGTGTGTATGTGTTTTTAGAATAAAATAACCTCCGGTAGTTCCTTCCTTAAATAAGCTACTCAAAACATCTTAATCAATGAAACAGTAAAGGCAAGGAGGGGATCACTTTTATCTCTTCCCCCGGCGCTTTGGTTGTCCTTTTGTAATTATTACAAGTTTCTTGTTTCTTATTAAAAAAAGAGGGGATTACGATTAAGACTTTACCTTAACTTGTGGATGCTTTTATTCTATATCTTGGATTACCATCAAGAAAATTTGAGAACGTAGTACCATGAACGCATTCCTTTGTCTGATTAATCAATCTGTGCATAAACCAATTATCTGGTTAACTTGTTTTGCTAGTCTAAAATTATTTGGCTGGCTTCTTTTTTTCAAGAGAGACAAAACACGTATAGTACATTTTAGAGGGTGTGACAGAAACATACAAGATAACAGTTAACACCATGTTTGAGTTTGTTCCTATAAATGCAGATTCTCTTTTCAATGCAATCACTAGTGTTGGCTTCTCTGGTGAATCATTGCCAAGGGAATAAACGGACTTTATTTTGATTTTTGCTCAGAATTTGAGTCTTTTTTTAATTCTTGCTGCTTGGCTTCCTGACATCAGACCCTTTTCATGCATGCAGAAAAATTTTTCTTTTGCTGTTAGAGATTCATTGATCAATATTGGCCCTCTGAAGGATTTCTCCTACGGTTTAAGAATTAATGCAGATGCTAATGCGACTGGAATTGCCAAACAAAGCAATTATGAACTTGTTAGTAACCTTATAACGATGTGAATTTTGGTATTCAGTTAAAGATTATGTAATTTCTCTTCTTTGATAATTTGTAGGTTTGTTGTTCGGGTAATGGAAAAAATGGTGCATTATGCATTCTTCGGCAGTCAATTCGCCCTGAAATGATTACAGAGGTATATTATTGTGATTTTTTTTTTTGACAATAAACAAATCATACCTTTCATTGAAAAAATGAAAAGAGAGACTAATGCTCTAAAATACAGCTCCACAAAGGAGAGAAAAAGAACACAAAAACAACCAGAAATTAATACAACCAGAAATTAATGAGAACAAATAAAAGCATTCCAATTCAAGTTAGTATCAAATAAAGAAAAAACCCTCAACGAGATTGGATAGAGCACTCCAGGAAGATTCCTTGAGATGAATAGCATCGAAATGAGCTTCCCAAGTTAAAAATTTATCTTCAAAGTTCCGTTGATTCCTTTCAAATCAAATATCTGAGAAAATAGCCTTGACAACATTTCCCTATATTATGGAAGCTTGTACAGCTAACAGAGGTCCGACAAAGGACATTGTCTTTAATACAATCAGAAAACACCCAACTGATGTTAGCTATTTGAAATAGCTTATACCAACACTTCAAAGCAAATTCACAGTGAAAGAAAATATAGAAACCGGATTCCGAGTCATTATGACAAAATAAACACATCGAGGGCTGCAACTTCATTGAAGTCAATTTCCTTTGCAACACATCTGCCGTATTTAAGTGACCATTCAGGAGAATCCATAAGAAGACTCTGACCCTCTTAGGGCCTTTAGATTTCCAAATAGCCCCATGAGCTAGCTTATCCAAAGGCAAATGATTAGCTAAGTGCTTAGACAACGAGTTGACTGAGAACAAACTAGAGGAGTCTTCCAACCATCTTCTTGAATCCTCCAACAAAGTAGGTTTGAACTTATTCAACAAAGCCAAAAATTCTGAAAAATCAGCAATTTCGTCCTCCTTTAACATCCTTCTAAGGATGTATTCCAAAAACCAGTATTAAATTCCCTAAAATCAGTCACACAACCTGATTGGGAGGTGGATATCAAAAATAAAGAACGGAACCTGTCCATGAGAGGAGCCACTCGTACCCAAGAATCTTGCCAAAACCGAATTCCCTGCCCATTGGGGAATGGTTACTAACTAATAATAAATTTATTCTTATAGAGAAAACTAATAATTAACTTATGGATGGTTTAATGATGTTTTCTTTCAAAACTCTTGTCTAATCATTCTTTGAAGGATGTATATCTTAATTGGAATGCTCTCAATTTTCTGTTGTAATTTTCAGTTTTCCATTTTATTGCTTTCTTATTTTTCTCCCTTATGGTGTTTGTACTTTTGAACACTATTCTCTTTTCATTAACCAATAAAAAGTTTTGTTTCTTGTTAAAAAAAAATTATGCATCGATTAAATCTTGATTTTTGAACCAAATAAAACTTGTGTAGTTTAGGAGAATGTTATAATTAGAGGAATCCATATGCGTTCCATTTAGGTATGAACGTGAATAAAACAATATGGTAAAATATTCTTATTTGGATTATTTTAAACAGAGCCTTAATACGTCAGAAGTTGCTACAATAAAAGCTTATGGCATCAGATTGTTTGCTATGTTGTGTATATTTGGAAGATCAAAATCGTGCATCTTGTAAATTGGGAGGTGGTCTCTAAGCCTCTTAATCAAGGGGTTTAGGGATTGGTAAGGTAAGGACAAGAAACAAGGCCCTTTTAGCTAAATGGTTATGGCGATTCTATTATGAACCCAATACCCTGTGGCACAAAATAATTGTAAGTAAGTATGGCCCTCATCCCTTCGAGTGGACTTTTGGTGGAGTTCTTGGCACTTCTAGGAATCCGTGGAAAGTGATTTCGTCTCAGCTCCCTGTTTTTTCGCGTTTTGTTCGCTGTGTGGTGGGAAATGGGAAGGACACTTATTTTTGGGAAGATCATTGGGTGGGGGATAGACCTCTCCGTTCCTTGTATCCTCATTTATATCATCTGCCCACTTTAAAAAACCATTCAGTAGATTCGATTTTTTCTAGCCATTCTTCTGCGCATGTTATCACTCTTCTCCTTCCCTTGGATTCAGTTGTGCTTTGACCAATTGGGAAACGACCGAAGTTATTTCTTTATTGACTTTGCTTAATCAGTGTCGGGTCTACCCTCATCGGAGGGATGCCTGCCTCTCGACCCTCTCTCCTTCTAGTGGCTTTTCGTGTAGCTCTTTCTTTCATTCGTGCTTCTGTTTTCTCCTCTCTTTGGAAGGTGAAAGTTTCGAAGAAGGTTAAGTTTTTCGTCTGGCAGGTTTGGCAGGAAGGGTGAATACTTTGGATTGTTTGTTGGCTAGGGGGTCTCCTTTGGTTGGGCCATTCTGTTGTATTCTGTGCAGGAGGGTAGATGAGACCCTGAATCACATTCTCTGGAGCTGTGACTTTGCTCGAGTTATTTGAAGTTCTTTCTTCCAACAATTCAACTTTTGTTATGTTGGTTACTTGGATAGTAGAAAGATATTCATGGAGCTTCTTCTCAATTTGCCTTTCCGGAAGAAAGGCTTATTTTTGTGGCAGGCTGGGGTTTGTGCTATTTTGTGGATTCTGTGGCGGGAGAGAAACGATAGAATCTTTAGGGGAAAAGAGAGTCATCCTTTGGAAGTGTGGTCTCTTATTAGATTCTATGTTTCTCTTTGGGCTTCGGTGTCGGGGTTCTTCTGTAATTATTCTCTAAATATTATTTTGCTTGATTGGAGCCCCTTTTTATTGTTGGCTCCTTTTTTTTTGTGAGCTTCTTTTTTGTATGCCCGTGTATTCTTTCATTTTTTATCTTAATGACGGTTTTTCATCAAAATTTGGAAGATCAAAATCATTTGTTTTTTTTGGATGTTCATATGCTTTTGCTTGTTGGCACACATTGTTTCAGTTTTTTAGATTATATTGTGTATTTCTAAATGACTCAAGTAGTAATCTCCATCTTTTGGACGGTTGGGGCCTTTAAATCTCATTTAAAGCTATCATTTCGGAGATCTGGTTAGAATGCAATCATCATATATGCATATATATTGATAGGGAACAAAAACCATTTATGGTGAAATGTAAACACAATATCCTATCAGTAAATACATGGGACATTTTGAATGAGCGATGAGAGTAGACAAACTGTAATTGTAAAAAGGGGGAGTTAATTTGCTCCAACTTCCAAGATAAAGCTAAAAAAGATGTAGAGTCAATAAACTTAGATATATCTTGTTTCTTGTCTGAAAAGATGCGCTCATTCTGCTCCAACCACAAGTTCCAAAAGAAAGCCCAAAGAAAATTCAACCAAAAAGTATTTTTCTCCTCTTTAAAGGGGTGGGCCCTAAGAATGAGGGAAATAAGAGAGTAAATGTCACCGTGCCGGGTCATGTTTCACCCAAAAGATGATTGGATGAAATTCTAGAACTCTCAAAGGAACAACTACTGAAAATGTTTATCTGTGTTTCAAGTCCCTTCTGCATAAGAGGCAGCAGTTTGGAGAGAAACAACCAAGGGGATCTCTTTTGTAAATTGTCTTGCGTATTGATGCTTTTCTGGCTAACCTCCCACATAATATTTTTCCTAATCTTTTTTGGATAAGAACCACTCCAGATGTGATGATAGGGATCATCAATATTGGGTTGATCGATTTGAAAACAAGTAACCTTGCTGGGGTTTCCATCACGGGGAATCCCTGTCTCATCTATTTTCTAATGTGGATTTGCATTGGATTGTTGGTCCAAACTCTTCCAACATTTTTCTGTCAGCTGGGTATTCTTCAAGAATTTTTGGGATAATGTGTTGTAGTTACTTATTGGGCTCTCATTGTGTGAGAGTACTAGCATGAGATAGTAGAGAGTCTTTGATTCTCCATTTCTGCCGCCAATTAATCTATTCTCTTGTCTATCCTCTTGGCTATTCTCTGTAAAATATCTTTCATTTCAAACACATCCTTCTCTATTCTCTCCATTCTTTCTTCCAGTTGTTTTTGCACCATTTTCCAGGATGAACATGCTCTGATACCAATTTATTAGGTATCCAAACCAGAAAATGCCAAGAACACAAAGGAACTAGTATATTGCAATATCAAGAAAGAAAATTACAATATCCAATAGGTTTTCGAGAGGGCTATCTCTCTCCTAAGTCCCACAATGGACATGACCAAAATGATGCACACCAAAATAAGGACCCTAACACTACTATTTATAACCAAGACACCCTAGTTAAAAACTCCAAAATACCCCTTACTAATATACTACTAATCTCACATTGTGGTTAAAAACTCCAAAATACCCCTTACTAATATACTACTAATCTCACATTGTGAGAGTACTAGTATGGGATAGTAGAGTGTATTTGTGGGAGATTAGTAGTATATTAGTAAGGGGTGTTTTGGATTTTTTAACTAGGGTGTCCCACACCTTGAGGACAAGGTGTGTTTTGAAGGGCCCAGTATTGTGAGAGTACTAGTATGGGATAGTAGAGAGTCTTTGTGGGAGATTAGTAGTATATTAGTAAGGGGTATTTTGGAGTTTTTTAACTAGGGTGTCTTGGTTATAAATAGTAGTGTTAGGGTCCTTATTTTTTTTTATTTTCTTTATTTATTTTGGATTCTTGTTTGTCGTTTGTATTTAGGTTGACTCACACCTTGAGGACAAGGTGTGTTTTGAAGGGCCCGATATTGTGAGAGTACTAGTATGAGATAGTAGAGAGTCTTTGTGGGAGATTAGTAGTATATTAGTAGGGGGTATTTTGGAGTTTTTAACTAGGGTGTCTTGGTTATAAATAGTTGTGTTAGGGTCCTTATTTTGGTATGCATCATTTTGGTAGTGTCCATTGTGGGACTTAGGAAGAGAGATAGCCCTCTCGAAAGGCTATTGGATATTGTAATTTTCTTTATTGATATTGCAATATATTAGTTCCTTTGTGTTTTTGGTATTTTCTGGTTTGGATACCTAACACATTGAATGTCAAAACCTTTCTTCTTTGGCTCAATTCGATTAAAGCCATTTTATATGAGATTTGGGTCGAAAGAAATCACAAAACTTTAGAAGACAAATTTACAAGTTCGAGTGTTCGATTTGATTCTATTCAACTCAAAGTTTCTTCCAGGTGCGCTCTTTCCATATTTTTCGAAGGCTTTTCTTGTTATGATATTAGTTTGAATTGACATGCTTTTTTTAATCCTTTGTAATTGTTTTACTTTTATTTTGGTTTTCTTCTTTGTGGAGTTTGTACTTTGAGCCTTAGTCTCTTTCCATCAATTCAACGAAAAGGGTTCGTTTCCTCCTAAAAAGACCTTGCTGGGTTGACCGATTTGAATTTGCTATATGAAATGCTCCACAGTGGCGTTCTTCATTTAAGTTATTTGTTGATTTCTCCATTTGGGATATTTGTTTAAATTGGGATGCATTCATATTCTCTACTTAGTTAGGTTATTATCTTCTTTCTTTTCTTTTTTTTTTTGTTTTTTTTACATTACTTCATTTAAAGGCTTCGGTTCCTTGTTAAAAAAATGTTATGTTAAAGTAACCTTATGGAGGTTTTGTTTCTCTTTTCTCTTTAAAGAGTAAGCGAAATTATTATGATTAATGACATTTTCATGGTATAACTTGTCCATTCCTTTGGTCTTCAATTTTATTCCAATTTGTATTTTAAAGTGAAATTCATGTCTTGGGAAGATGATGAAAACTTTGATTCTGTTTCAACAGTCTGTAAGTGATAGTGGTTCAAAAGGTGCACCTAGGTGCCTCGCCTCGAAGGAACGAGGCACTGGTAAGAAGGTGCGTCCTAGGTGCGTGTCTTCCATGAAGCCCTGAGGCACGTGCTCCACGCACCTCTAGGGCTTATTGAATTTTATTTATTGTTTTTAATTTTTAAATCCAAATGGTGCATTTAGGTTAGTTATTATTTTTAAAAAAATTTCTAAAAACTCTAAATGCAAGGGCTTGCTACATGTACAAACACATATATATTTATGTTTTTTTTTAATATGTAGTACGCCTTACAAAAAAAAAAGTCGTGCCCTTTTTTGTGCCTAAGCCCCAAAGGGCTATTGCACTTTAGCGCGCCTTGAGCCTTTAAAAACACGGAGTGATACTAGTCTTATAACTGAACCATTGTTTTTGTGCCTAAGCCCCAAAGGGCCATTGCGCCTTGAGCCTACGGAGTGATACTAGTCGTATGACTGAACCATTGTTATTGGTTGTAGAACAATAAATATATAAGTGAAATCAATCAAATAAGTCCTACTTTATTTCAGCAAATATTTTATCGGTTTTAGTGACTAGCTGACTTGACAATATTTTGAATGTTGTTGCCGTATATGACATTTTTAGATGTGCATTTATGGGATTTTCAGGTTGAACTCCCAGGTTGTAAAGGCATTTGGACTGTTTACCACAAAAATACTCGTGGTAGTAGTGCTGATTCTTCGAGAATGGTTCCAGATGATGATGAATATCATGCATATTTGATTATAAGCCTTGAGGCTCGCACAATGGTAATTTATCTTTTCTCTCTCTTGATTTAAGTTTAGGTTTAGGGTGTCTATGGGATTTGGTGATGAATTTATTTTCTCATGATTCTAGAAATAATAGGACTCCAGTTGCACTTCTTTCTCTAAACAACGATACACAATGAATAGGACAAGAAATCTATAGAGAATAACCTTTTAGCTAATAGTTTCCGTTATGTTTCTTTTGCACTCTTTTGATTTGCTTTCCTACAATAGACATTCTTTAAGGATGCTAAGTAGAACCCTCATGAAGCCTATGACTTATGACCATAAGTTTTTAATATATGGGGGCAGAGGGGAGGGAGAGAGGGGGGACAGATTGTTTAGTTTAGAGTACAAGTGGGTACATTTCTGCTGTCGAAATTTGATCCCATGCCCTTTTTTGTTTTTGTTGGGCTTCTTTTGGGTAGGGCTTTTTTTATTTTAATTTTTATGTCCTTTGTATTCTTTCATTGTCTTGAATGAAAGTTTGGTTTCTTGAAATCTTTCCATTTATCTCAAAGCTAAAAAATCTTCAAATATTGATAGGATTATCATATGGTCCAACCGGAAAATTCTTCTGCTTGGAACAAAAGAATCATACATTATCTTTGAGCTAGATATCAGGCACCTTCTCCCTCATTCTAGAACATATAAATAGATTTGTAGGCTCCCCTTTCTGTAAGACAACTCAAGACTATTTGTTTTTTAGAAATAGAAACCATTTCATTGAAATGATTAAATTAATACAAAAGAAGGTATATTTGTAAATAATTACAAAAAAAGCACCTCGAATGAGAGCTAGACTCGGGACACAATTTTTTAGACCACATAAAATGACTAATTTTGCGCTAAAAAAGTTCTCTATATTTATGAACTTTTCTTTTGTTTTTTTCCTCCCTTGAATGTGGAATCGAAGTGCTTAATCATTATTGGGTTTTGTCTTAAACTATCCATATTGAAAGCAACTGAATAAGGGATTGTCTTTTATATGCATGTATATGTCCTCTATGACGATAATAGTTTAATAGAAAGTGAGAGAAGTTTGAGAAGCACCACTAAAATCTAATTAGCAATGCTTTATTTAAACTGAGAGAAGTTTGAGAAGCACCACTAAAATCTAATTAGCAATGCTTTATTTAAACTGAGAGAAGTTTGAGAAGCACCACTAAAATCTAATTAGCAATGCTTTATTCAGTAGAATTATTGATGTATATATGTGTGTGTCTGTATGACCTGTAAACTAAGGATTGTTGAATAACAAGACATAATTGACTATCTACATCAATACACAATAATGTACATGTTGTAACCATTGTGTTGGCAGGTACTTGAAACTGGAGATCTCCTAACAGAAGTCACTGAGAGTGTTGACTACTTTGTGCAAGGAAGAACAATTGCGGCAGGCAACTTGTTTGGAAGGTGAGTGCTCTCTTATTTTGTTAATGTAAAATGGTTAGTATTTGCTACATGAGAATCTAGAATGTCATCTTTTATGATGTTTGAATTTCCATGAGGGAAAACGTGAATAGATGGACGTCAAAAAAATTGCGAAGCAGCCTTATGGATGTTCAATTTATCAGTAATTTTATTGGCACGAATAAAAGGTTAATTTTTGTTAGGAATGTTAGTAGAAAAATATTAGGGACGTTAGGATAGTATATTTGTCATTAGTACGGAGTTGGTTACTTATGGGTGATAAAGGTCTTTGCAAATGAGGGAAGACGAGACAGGGGTGGAGGACAAAAAGGGGATATAAGGTTGAAGGGGAGGAAGGATAGATGGAAGGATTAAACAAGATATTTGTTCATGTTACCTAACTCAACAATCTGGTGATAAGTGTACAATGTACATGATAGGAACGCCTTCAGTGATTTGTGATATGGTGGGGATGGACTCTTTAGATGCTAAGTTCTCTGACCTGTTTGATACAAATTTCCATTCTCTCCTACAATTACATTTTATACAACCCTCCTTTTGGCATACAATTCCAGTAAGTCTAGCCTTTAATTATCAAAAACAGTTCTCAACGTATTTCTCTCTCCGACCTTTGTATTCATTCCTATATGCCTTAACTAGTTTAGCTATGCTCAAGTTGACATCTATTAAATCATACTAAACCTAAAAGCTTAATTAATGATTTATGGTAAATTTATTCTTATATCAATACTTTAACCTTCCATTCACTCGTGAGCTTAGAAATTTACCTACAACCCAACAAGTAGTTATCAACATTAACAAGGAGGAAATGACATTATAGTTGTTTTGAATATAAGACCTCTTGTTAGCTTAGAAATTTACCTACAACCCAACAAGTAGTTATCAACATTAACAAGGAGGAAATGACATTATAGTTGTTTTGAATATAAGACCTCTTGTTACGGAACCAAATAGCTTAAACTAATGGATTTTGGTAAATTTAATCTTATATCAATACTTTAACATTTTTTTAGGTAAAACCAATCGCAACAAGTCTCCCTCGTCGTGTTCAAGTCATTCGCCACACTTTTTGCACATTATTATCGAAACTTAAAGTGCTTGAGAATTTCAAGTTCTTTATTAATTTAATACCAGACTTTCATTGAGAATAAATAAAAGAATATAAAGGGATACAAAAAAGTGAGCCTAAAACAAAAAGGAGCCAAACTAAGACAAAAATAGGAGCAATCAAGCAAAATGAGACCTAAAGGGTAATTATAGAAAGATTTAATGAGTGAGGCCCAAAGAGAGACACAAAACCTAATAGAGGCTCACACATCCATAGAAGATGGCATGTCCTTTGTGGGAGAATGCAGGATTATGTTGTGGACCTTGGCTATGGAGGGAGGGGCTAAACATGACCAAGAAGCTTGAAAAGAGATGCCCCAACTTTAATCTTAGTTCCTTCTAGGGTATGATGTGTTAATGTGAAAATGAAGATGTCGACCATTCGTAATTGTTTATGTCGTGTCCTGTGGTTGGACTAGTTCTTTAGTGCCTTTAAATTGATTGGTCCCAACACCATTCAACCATCAACATTTGGTTCAAATTTGTGGCATCTGCAATTTCTTCTATTTGAAACTCATGCTTTGGTAGAATAGGGGGTTAGCCATTTCTGGGTTACGTGACTTGAGAGCACTAGAACAATAATCACAAGGAGAAAGGCCTAAGAAACTTCTAGAGAGGATATTACCTATTTGATGTTTCGTTTTGCTTTAAAGTAGAGGGGTTTATTTTGCTCGTAGGGACAAACCTCATTTAGGCGGGGAATAGGGAGAGAGTGTGGAGAAAATTTTCCCTGTTAGCTATTCCTGTCCCCGCCCTTGCTTCGTTCCTTGTCCCCACCCCAATTACTTTAATTTTTAATATATATATTTAATAAATCCTAAATTTGAACTTGGCTTTCCTAAAGTTTCTTTTTTTTTTTTTTAAAATCTCTTTAAATAGAAAATTACCACTAGGAATTAATCATTTAAATAATTTTTTTTTTAAAAATTAGTTAATCCAAATAAAAAAATTAAAAATCATAAATTGGGAATTATTTCCCTCGGGGAACCTGATCCATGCGAATTCCCCACGGAGAATCCTGGTTTCGATCCCCATAAAATTAAATGGAGAATTTTGCAAGGACGAGAAATGAAATAGGGGGCGAGGATGGGAAAGCCATTCTCAGCCCCGCCCAACCCCGTGGACATTTTGCTTTTAAGTCTAACCACTTATGTAATTATCCTTTTTTCTATTCTATCGACACATGAGTCTGTTTTGTAATCACTATATTCCTTGTATAATCTCTTTTACGCTTTTCATGTCCATAATCAAACGTCAATATTTACGTCTGTTGTGAGTTTTGAAGTTGCCTTGTGACACCCAAACTAGGAGGGGTACATGAATGAACTGACACCACATCAGAATGAGAGTGATCTTGAGGATATGCAAGACATGGTTAAGAAGGGATTGAAAGAATTGATAGTTACTACTTATATCAACAAGTTGTAACTTCCTTTTCGGTGACTCAATCATAGGAACTCTAAAATTAAGCGTGTTTGGCTTGGAGCAATCTTATGTTAGGTGACCTCTTGAGAATTTTCCTAGAAAACACGTGAATGAGGACAAAACATGCGGAAAGGACCTGTATTGATATGTGGGAACAATTGCAGTTCATGACAAGTAGTGGCATCGCCAGGTTGTAAGGAACGTCGGGGCCGTCAGGCGTCGATTCGGATTCTGAATCCTGAGGATTAGGCATTACAAGATGGTACTAAAGCGGAACCTTTCTCAGTCGGATGTGGTTCAAGATGAACCAAGGCAAAAGCTGGTGGATGTGACACCTCAGTTTGGAGGGATGCATGAATGGATCATCACCACATTGGAATGGGAGAGATCTTGTAGTTATGCAAGACATGGTTAAGAAAAACTTAAAAAAATCGATAGTTACTACATATACCAACAAGGTGCACCTTTCTTTTTGGTGGTTCAATCATAGGAACTTCAAAGTTAAGCATGCTTAGCTAAAACAAACCTATGTTGGGTGAACTTCTGAGAATTTTTTTAGGAAGCATGTGAGCGAGGACGAAGCATGTTGAAAAGACATGTGTTGGTCTGTGGCGGTATTCTTCGTTCTCAGAAGTAATTTCAAGAGAAGCACTTAGCGTGACTAGGATGCAAGGAAATTGTGGGATTGTCTGGTGTCAAATTTAGATTATGAATCTTGGTCCAAATTATCTCATGCTTTTTATTGACTTTGTCTCGATTCTTTCTCGTTTGTTCCATTCCATTATGTGTTATGTCTAAATGCAGTATTTGCTAGCGAGATAGTTGTAGACTCTGGCCTGTTTTTTGTATGCCCTTGTATATTCTTTCATTTCTCTCAATAAAATCTTAGTTTCTTACCAAAAAAAATTATTTGCTAGTGCGGTTACATTTTTATAAAGAATTGGTTTTATTTGTTTTTTTAGTTTTTCTTTTTTGTCTATACTTTGTTTCTCTGTGTTTCATTATTGTTGTTGGGTGAAGTTTCTATCGAAGAATAGTAAGGTTCTTTTTTCAATGTTTTGACCTCTTGCTTGCTTCTTCAGGCGTCGAGTTATCCAGGTCTATGAAAGTGGTGCACGAATTTTGGATGGATCTTTTATGACTCAAGATTTGAACATGGTAGTCACATGCAATGAATCTGGTAATGGTTCTGAAGGTTGTACTGTGTTGTCTGCATCTATTAGTGATCCATATGTCTTGCTGACTATGACGGATGGGAGTATTCGATTACTTGTTGGAGGTATGGTGAAGGATTTTCAAATTATGAATAGAGATAACACGTTATCATATTACGTTGTAGATTTTATTCATTTAATGTATTGTCATGTTAGTTGCCACTTGCCAGAAAACTTCTTTGCGCTACTTAATTTTTTTCTTTGTTAAAATTTGATGCTTCATACTTGATAATTGGTGGAAGAACAACCCCATGTTTGGATGGCCTGATCATGGCTTTATGATGGAGCACCAAGGACTAAAATCGATGCTGGAAGCTTGGGATCTTGAAACTTTTGGAAACCTAACCAACGAGAAGACATCTCTCATTAAGGATCCCCATTAATTGATTCAATGGAAGAACAAGGACCCCTTTGTCAAGCTTGATTAGATCATCATTTGGCCATCAACGTTCAACCTCTCTCCATTCCAACAAAGGGAGAAACGATGTGGAGACAAAGATGCAAGGCTGAATGGTTGGTTGAGAGGGACAAAAACACCAAATGTTTCCACAGAACCGTGGCGACAAAGTGAGGATGAGCATGATAGTTGAGATTCTCTCTAGGGTAATAGCCTCTTGAATGACAGTGAGATTGAAAGAAACCTTTTAGATTGTTACTAGAATTTATTTGAAAAAGGAGGTACTCAATTTCTCCCCCATAATCTCCATTGGGACCTTATTTCTCAATCTTAAAGTGCGGCCTTAGAAACCTCATTCACAAAGGAGGAAGCTTTCAGAGCTGTTAGTTACTTGGGTACCAACAATAATGAATTCATTATTGTTAAGCTAATTCATTATTGCTAAGCTTCCTTAGCAATAATGAATTAACTCTTTTTGTATTCTTTGTAAAATATATTTATTTTTATGCTCAAATTATATGTTTAAAGTATCTAATATATCTTCTAAAAAGAAAAACGTGTCCACAACGTGGTCGTGTCCTACTTTTTTTTTTTAAATTTGGCGTATCGCCGTGTCCGTATCATTTCGTATCCATGTCCCGTGTATGTATCCGTGCTTCTTAGGTTGTCATCAGTGGTTTTGGGTGGTTTGGTTGTCAGGTTGTCGGGGTGATCGCTGAAGTAGTCATCGGGGGTGATTGGGTTGCTGGGGTGGTCGGGTCATTGGAGTAGTCACTAGGTGGTTGTTGGCAATGGTTGCTAGGTGGTTGGATTTTTGGGGTGATTGTGAATTATGGTTGTCGGGTCGTCGGAGCGGTCGCCAACTATGGTTGCCGGGTGGTTGGGTCGTCGAAGTGGTTGTCAGATGGTTGTTTGCAGTTGTTTGGTCAAGTTGGGTTGCAATGGTTGGGTCATTGAAGTATCGAGTTGTTGGGGTGGTTGAGTTGCAGGGGTGGTCGCCAAACTAATCTATAGAGTTCTCATAACAGTTGTCGCCAACGGTAATCGTCACCAGTGGTTGGGTTGTCGGTGTGGTTGTCAGAGTTGTCGTTGTCGGTAGTCGTCAGGTGTTCGTTGGAGTTGTCATTAGAGTTTTTGTCGTAGTAGTCATCGAAAATAGAAAACGATGAATGGAGGAGGAAAACATGGGACAAACTCCATGAGGTTGCAGTGTTGTAACGTATGAGGTTGTGTTTCAAAACTAAGGATTTTGAACACCTTGTACCAAACATGGTTTTGGGTTAGGAACCCAAAACCATGGGCCAAACCCCTTTAAATGGGTTATGCACAAGACTGAACTCTAATAGTTAATAGTTGGTACGTAGTCAGTGGTGCTGGCCTCACTTGAAAAGTTTCTCTTAATAAAATGTTAATGATAATGAACATTATTGCCACCCCATGGTAGTGCTAGGTGTCACAATCCATTGTGTGATGTAAGCGGATTGTGTCCTTGAGTGAAGTCATCGAGGACGATGACAAGTTTAAGTGGGGGAGGATGTCACAATCATACGAGAAGAGTGTATGTTGTGACCCCCACCGTGAAGACAAGTGCCATGATGCGACCGAGGAGGACGGTGCATCATCCAACATGTTAAGATAGGTGAGCATGAATGCTAAGTTGAGGGCAAGCAACATTGGGGACATCAGTGTCGTCGGAGGTGTCGGTGTGGGGGCGTATGATCTCTCGCCATATTGACGTGCAAGGGTGCCAGACCTAATAATCCTTATAATATAGTATAGGAGGTAACATGACGTCAACTCTCCACCACCCCTGAGCCTCGTGGCCTAAGTGAGCCACCCAATGGCGCACAAGTGTGTTATGGCGTCGGCTAGCGTTGCAAGACGAGTGTCTTGGGCGTCTTTCTTTTTGTAAATCACCAATTAACCCAAAAGCTTAAGTTGATAGGTTATGACAAATTTACTTATATCAACACCAACACTTGTGAATGAGTTTTCCAAGGACAATGTCGCTGTGGTGACATTGCACCCTTTCCCATGGGTTGGAGGTTTCAATTTAGTGCCCGCTAGTGGCCAACAATTTGAAAATGTCCTGATAAAGCCGACTCATGGGGAGGACCGAGTGAATATACACAGATGGATGACGTGTACAACCAGGTCAAGTTCTGCTGCACATGTAGTGATGCTTGGTTGAGGATGAATAACCTGGCCTAAGGTCTGACGAAGCCACGAAAGGTTGCGTGACGGAGGATGGGGCTTGAGTTCCCCTTAAGGCTGACCATGATTGGGGGAAGGATCATGTTGGCGCACGGTTGGAGAGGTGCGACTGTGATACTAGGCAAGTCGTGGGCAAAATTCAACCTCAATAGATGATAATTGGCACCGAGTCTGTAGAGTGAGCCCCACTTGCAAAGTTCTTCATGAATTCATGGTATATCACCATGATGGAGGTTAATTGCCATTTTTTTTAATGAAAATATATTTGTGACCGTACACAGTATGGGGACTTGTCAATTAAACATGATTTTTAACCTACTATGCAATGTGCAAAGCATTCGAAAGATTATATATAATTAATACAATATATTACATAATACAATCGAAATTAAACCAAACACCATATCAATGTATCCAGCCAAGGCCAAAGGCATTCTGAGAGTCATGTCACTTGCTTAAATAACGAACCTAAATATTTATCTTGGATCAATCGAGGACTTGCCAATTAAACATGATTTTTAACCTGCTATGCAATGTGCAAAGCATTCGAAAGATTATATATAATTAATACAATATATTACATAATGCAATCGAAATTAAACCAAACACCATATCAATGTATCCAACCAAGGCCAAAGACATTCTGAGAGTCGTGCCACTTGCTTAAATAGCGAACCTAAATATTTATCTTGGATCAATTGAAAATAATTTTAAAAAATATTGGGAGTTGATATTGATGACTTATTTAATTAAGAGAGACAAGCGTTGTTGTCAAAAGGGAAGGGAGAATTCAGAGAATTTTTAAAAAAATGTCTATATGTCCACGTCCTACTTTTTCTAAAATTGAGGTGTGTTTTGTCTATGTTGTGTTGTATCTATGTCACATCTCCTTGTCCATGCTGCTTGGGTTACAACTTTTGTTGTGGATAAGATCTTTGATGTAGTTTATCAATTGTATAAGGACTAAGGGAACATTATTGTTGTATTCACCACCTACATTTCTGTTTAAAATGTTTTCTTGGGTGTCAATGTTTCACATGCTTCGACTTTGTCACAGATCCTTCTTCTTGCTCTGTTTCTGTATCTACACCAGCCGCCTTTGGGAGTTCAAAAAAATGCGTATCTTGTTGTACTCTTTATCATGACAAGGGCATTGAGCCTTGGCTTCGGATGACAAGTACAGATGCATGGCTTTCTACAGGAGTTGGTGAAACAATTGATGGTACTGATGGCTCACTCCAAGATCAGGGTGACATATATTGTGTTGCTTGTTACGATAGTGGGGACCTTGAAATATTTGACGTGCCGAATTTTGTTAGCGTTTTCTATGTGGACAAATTTGTTTCTGGAAAATCACATTTAGTTGATTTTCAAATGTCGGACTTGCAGAAAAATTCTGAGAAGTTGGATCGAAATTCTCAGGAATTGAATAGCCATGGTAGGAATGAGAGTTCCCAAAATATGAAGGTAATTGAGGTAGCCATGCAGAGGTGGTCAGGGAAGCATAGTCGCCCATTTCTTTTTGGAATATTGACCGACGGGACAATTCTTTGTTACCATGCTTATTTATTTGAAAGTACAGACAGTGCCTCTAAAATTGATGATTCGGTTTCCATGGAAAATTCTGTTAGCTCAAGCAATATGAGTTCTTCTAGATTAAGAAATTTGAGATTTCTTCGTGTCCCCTTGGACATACAAGGAAGGGATGATATGCCAAATGGAACCTTGTCTCGTAGATTATCTATTTTCAAGAATATTTCCGGTTATCAGGGGCTATTTCTCTGCGGGTCAAGACCTGCTTGGTTTATGGTGTTTAGAGAACGGCTTCGAGTTCACCCTCAGGTACTCGTCTAATCTATTTCTAATAGTGGATGAGGTTTAGACAACATGAAGTATTGGTATTCCTTTTGTTATGATGTTGTCCTGAATTCCAGTTAATTAGTTGTGCTTCAATTTTTCATGTTCATGTGAAGAAAAACGGCATCAATCATTTTTGTTTGATGTCGACAAGTTACTATGCTACCTACCTTAGTTTCCATGGAAGATCTATCATCTAGCTTTAGTCCCATTAAAGTCTGCAGGTGCACTTGAAATTATTTACTTTGTAATAGAATTCATTTATCTTTGCAAGTTATTCTGCTTCAAAAAAAAAAAAAAAAGAAAAGAAAAGAAAAGAAGATGCATTGCAAGTGTTAGTTGTCACTGCCAGACAAGCTTTCACGGCTTTTTTTGTTTTGTTTTCTGTTTTTTTTTTGAGAAAAACATGTCTTTTTTGTTTGATTTGTATATATTTTCTTTTTTATTTGTCGGGCAATGAAACGTCAGAAGATTCATCTTGGACTCTGATAATTACTCTTCTCATTCAGGCTCTTATTAACTGAGTAAGGATTATCATAATTTTATTGCTTTTCAAGCCATCTATGGAAACCTGAAAAATTGAAACACCTGTATCTTAAGATTAAGATATGGAATTTATCAGTTATTTAAACTTAGTGTAAATAACTTTAACTATAAATGATATTGGAAAATATATCCTTATTATAAACTGGATCACTTTTGTATTCTTGCTCTATTGTAATTCTGTAGTTTAAAAGCTAGTCAATGAATTGCTAAATGATAAAAATACATTTTAGGAATGGAGGTGGCATATTCTTTAATTCTTAGTCTTGTAATTTTCATTAACATGTTTTCTGGAGTGAAATAAGGATTTACTTTATGGCAATTAAAAAAGCATGTGTAACTTCCTTACTGAGAACCGCCTTGCCTTTGGCAGCTATGTGATGGACCCATCGTTGCCTTTACAGTGCTACATAATGTAAACTGTAACCATGGACTTATATATGTCACGTCACAGGTTAGAGATTCTTAGTGCCTTTTGCTGTGACGGAGTGTATGATTGTTAGCATTGATTGGTGTTTTCATTTTCTAGGGCGTTTTAAAGATTTGCCAACTTCCATCTACATCAAGCTATGATAATTATTGGCCGGTACAAAAGGTATGTTTTCCCTTCCCTCTTCTTATTTTATTTTTATTTTCGTTAACTTTAGCTTGTGCACAATGTTTAACTTCTGCTCAGGTTCCATTGAAAGGAACTCCACACCAGGTCACCTACTTCCACGAGAAGAATCTGTACCCCGTCATAATTTCAGCACCTGTGAGTAGTTCTATATTTTCTTTCTTTCAACTTTTTTTTTCTTGCATATGCTCACACTTCAACTCATGACTTGCTGTGTTCTGTGGATGACCTATCGTGTACAGAGTTGGAGCTGAGAGTACAATTTTCTTTCATTATCTTTATTTTTGTTTGCATATGAACTAATGACCTACTGCATAATAAAGCCTTTAATATTTGATAAGTTTGTACGTCATTTTCTCCCGATCTTCCATTATTGAATGTTTGTCACAAAGTTTGCACATTATGATATAACCTCTCGCTCATCATATATATTTCATATCTAGAATTTTACCTTTGTGGTAGGAACAAGGGCTTCATTTACTAGAAATCCATGGGAATAGTAATATTATTCTTTGAAGTTGTTTACTAGAAATTGTCAAGAGTTGGAAATTGGAATTAGAAACCTATTGATATTGTTTATTATAAATTGTACGATTAATAGTCTTTTATCAGAACAATTATTAATATAAAATATGTTAAATCACCACTCAACCGGAAAGCTTAAGCAAATGAGTTATGGTAAACTTAATTATATCAACACTCTCCCTTACTCATGGACTTGGACATTTGTAAAAGGTCTAATAATTGGAAATCAATATTAATTGGGAGTAAATGAGTTTACAGGGGTTCAAATACAAGTCCTTTTATTCTAATACCACTCAATCCAAAAGTTTAAGCTAATGAGTTTTAGTAAATTTTATTATATCAACTAAATATTGTTAAATTTCAGTTTTATAATATTTAATAAGAATGAATTGTCTTTTAACAATTAAATTGATATGAATTAAGTAAAATAGATTGTCTTGGGGGACATCTACCAGTAATAATTAAGTAAAATAGATTGTCTTGGGCGTTTTGGATGAGATTCTCTTGGCTTTCCTTTATGCATTTTGCTTTTTGTTGTTTCTTTTTGGATGTTCTTTTTATATTCTCATTCTGGTCTTTCGGATTCTTGTAGGCTGTTTGCTTTGTTTTTCTGCTTCTCTTTCTTAGTCTTGTTTTGTTTTCTTTCGAATTTGTCTGTTTCGTGAGTATTGGTGGGTTTTTCTTGTTCTCCCTTGTGGAGTTGTATTTTGAGCATTAGTCTCGTTTCATTTTTTTAATGAAATCTTTTGTTTCCTTGTAAAAAAAATGATATAATGGATAAGCTAAATAGATTGTAATATTTTGTACTAAAATCACATGAACTAAACAAAACTTAGTATATTCTTTTATTTGAAGAAAAAATTAGTATAAATTTTATTAGAAGAAAAAAAATATTAATTGAAAAGTTAATCCCTATTTTTACCTTGTAACAATATTCTATTTTTAAAATTTAAATGATAAACAATATTAAATGATATTAAATAGTTAGCCGATATTCTAAAGAAAAGAGTAGTACAATTTTTTTTTTTTATTAGAAGAAAAATTATTGGTAGTGCAAACAAGATAAGAATAAAAGTTGCAAAAAAATTAATAAAAATGTCAGGAGTGTTAACAAAGTCCCACATTGTCTAGAGAAGATCTATGATATATAAGTGAAGACAATTATCTCCATTGGACAATATCATACCATTGTGGAGATAAGTAGAGGTATGTTGTCCTTATCAATATTGTTAGAAGTGCTAAAAAAAAAGACCATCGGTCTTACTAATAGATGGGAAAGAAAAAGTAAAAATAAGTTGTAAAGTAATTAAAGAAAAAGTTGTGGAGCTTATGGAGTAAGGAATGAGAATAACAACATTAACGAGGTTGTTGGAGAGGAATGAGATTAATCTTTTTGAGGAGAGTGATTCCCAAATTTGCACATTCTTTCTTGGAGCTTATGGAGTAAGGAATGAGAATAACAACATTAACGAGGTTGTTGGAGAGGAATGAGATTAATCTTTTTGAGGAGAGTGATTCCCAAATTTGCACATTCTTTCTTGTTCTTATGTAATAAGTATTCATTTTTCTATCCCTGGTGGTACTTTCCAACTAGTGCCCATTAAAGTTTGAAGGAACATGTCGTAGAGAATAAGAGGTTTACGTAGATATACCAGCAGCAAGTCATAGGAAGGGTATTGTTACCACTAGCCATGATCAAGAGAAGTTTGAAGCACAAGAAAATATCTAGCTCTTGAGTTTTAGTTAAAGTCTTGAGGAAGTATACATGGGATTGATCTGTCCATTCCCATAATTTATCTTCTCCATCACAAAAAAAAAAGAAGAGGAATACATATTCAACCAGTTGTAGGTTAAAGCCTTGAGAAAGTTGACATGGTTGGGAAATTTTCTTTGAGGAAGATATGGCTGTAAGGGCCTTCGCAAACATATTTAACCAGTTCAATTGTTGACAGGAGAGAAAAAAAAAGACGTATTGCTACATCGAGTGCTGAGGCAGAGTTTTAGAGTAAAGAGATGGAATCTGTGAATAATTGGGTGATTCTTAGAAGCTCTGGGGATTTGAAGGTTCAAAATAATAAGCTGATGAGTCTAGATTGTGACGAAACAATGATCAATCAAACCTTGTAGATATAGGTTAATTTTTTATTTATAAAAACAATGGAGAAAGTGTAGGTGAAGGTACTTCGTTTCCCCACTATCTGCCGCATTCAAACTTCTATAAGGCTACAAGAATGGTAAGGGTGTGTTAAATCACCAATCAAAACTGATTTTGATTGGTTATAGTACATTTTATTATATCAATACTCTAACACTCGCTTTTACTTGTGGGCTTGAAATTTGTAGAAGGCTCAAGAAGTGGAAATCAATATTAATTGGGGAGGAAATGACATTTCTAGGGTTTGAACTCAAAACTTCCTGTTTTAATATCATGTTAAAATTATCAATCAACCGAAATACTTAAACTGATAGGTTACTGCAAATTTAATTATATCAATACTCTAACAAGATGTCTAATATTACAAATGCTCGAGGGGAGTAATGGAGAAGCAAACAAATTTAATGCCATTGTATTCAAATATTAACTCTTGCATGCTGACATCCTTGATTATTTTCTTTGTGCTTCTTCATATCTCTTATTCCTGTTTGTATTAGTCGTAAATTGCAAATAATGGTAGTTGTAATGTTGTTTTTCATTTTCTGTTTAAGGTGGAATTCTTCCTATTAGTGAATAAGATAATCGTCATTTGGGAAATAGAGAATTTTCCTTTCCAGTACTATTAATGTTGACTCTTACCCCAACAATTACCTTTCTCAACTAATGACTTGCAAGCTATGGAGGTTAATTTAATTATTATATATTTTTCTTTCCTATCGGAGTGCAAACTTATGTTGTAACAGTGATGCTGCATATCTTCTGCAGGTTCATAAGCCGTTGAATCAAGTGCTTTCATCAATGGTTGATCAAGATGTTGGTCAAGTTGAGAATCATAACTTGAGTGCCGATGAGCTGCAGCAAACTTACTCTGTGGAAGAGTTTGAGATTCGGATTTTGGAACCAGAAAAATCTGGTGGCCCTTGGCAAACTAGGGCTACAATCGCTATGCACAGTTCCGAAAATGCTCTTACCATTCGCGTGGTTACACTGTTGGTCAGTTCATTTTCCCTCCCCCCCCCCTTCCCTTTTTAGTTATTCCCGATGATCCTTTATACCACAAAATTCTAGCTCTTTCAGTTGGGGTTCTCTTTGACAAGGATAGGGTGTGATGAATTGTTAATAGTTGAAGAAAAATGAAAAAGAGTATTATACTTTTATATATATATTGAAGGTTATTCAAGGCTCTTTGATATAGGAGAAAAATCATTTATAAATAAGAAAAAGCCAAAGAAAGGGAGTATTATACTTTTATATATATATTGAAGGTTATTCAAGGCTCTTTGATATAGGAGAAAAATCATTTATAAATAAGGAAAAGACAAAAGAAAAGGTAGCATAATTATTTTCATAAATATGGACCGCTACTGTTTATTAACTATTGTTATAAAAAGTTAACCCACAGACAATAAATCAATTGGGAGAGATAAGGAAAAATGCTTTCTCCTCTCTCCTCTCTCCTACTCCTAACCTTCATCCCTCAGACTTCAACCAGTTCGTCTTCTCCAGAATCTGCCTCAAGCTAAGTATGTTCATCTGATTTACTCCTTAAGGTTAGTTTTCCGATTGCTTCTTTTTAGAAATATGGAATTAACCAGTTTTTGTATCAACCATGAGTATTTTTACATATGGTTTGAGAATGTTAGTTTCCTCATAGAAGATATTGAAAGTAAGTTGGTAATCCCTCTTTCACTCTCTATTTGTGCTGGTTTGAAAGCTTTTTGGTGGAACTGTTACATCATCCGCTGCACTCGCTTTAGCTTTCAAACCAGCACTAATGAGACAGTGAAAGAGGGATTGCCAACTTACTTTTGATATCTTCTATGAGGCCCGGTGCATTCACATTTCTTTGAGCAAGATAGAGATGAAAACGGTACTATTCGTTTATTAAAAGAGGTTGACTCATATTTTGGGGTATGATTAAAGACTTCCTCAGGAGGCACGAGGAAAACACTCAATGTTCTAAGTGCATGTCCAAGTCTCCTAGATGGGACTGATTTTGGTAAGAGAAGTTTTGTGGGGTAGTAAAAGGAAATCCAAGAACAAAAAATTTGACCTCTGACTCTGCTCTGAGGTAACAAAATCTACTCATTCATATTGGGTAAGAAAAGATCAAGACATGGTGGACATGAACTTCACTGAAGTCTTGGTTTTATCCAGGCTGTTTGTTCATGACAATTGGAAGGAAATTAAGAAGGTATTAGAGAATCATTTTAAGACCACTTTTCTAATTAATCCTTTTACGGCAGACAGAGCGTTGATTAGAATGGAGGATGGTACAAATCTTCACAATGTAGTTTCATTTGAAAAATGGAAGCTCATAGGTTATTATCATTTGAAAAATAGTAGCTGCAAATTGCGTAATTAAATGGTGGTGGCTAGGGTTTTGGTGGCTGCTAGGGTTCTAATGGTAGCTAGGGGTTTTGTGTTCAGTTAAAGAAAAACAAAAGGAAATGATATTTCCTGTATGTTCAAAGGTAAAATCCTCTTATTTAAGAGGATAACCATTGCGGATGTGAAAAGGACTAAGAATAAGAATAACATAACTATTATCATAAATATTGGCAGTTTAAAAAAGAAAAATTAAACGTTAGTGGAGACTAGTATTTCTCTAGTAGACTTGTTTCTTTATCGTTTCATGGAGTTTCTGACTTGAAAGAAACAATGGAATTTTTAGATGGGTGGAGAGGTTTTGGGAAGTGACCTAGGCTATCATTAGATTTAACACCATTTTATGGGTGTTTGTTTCAAAAGCTTTTTGTAGTTAACAGCTAGGTGTTATTCCTTCGGATTGGAGTCATTTTTTTTTTGGTAGATTAAGACTCCTTTTGTCGGGCTGTTATTTTCTATACTCTGGTATATTCTTTCATTTTTCTCACTGAAATGTTGCTTCCTTATAAAAGGAAAAAAAACATTAATGAAGACTAGTTCTTTGCTTCTTGATGTAGCTTTTTTAGACCTCCTTTTTATCAACATTAGAGAGTGTCGGGAGGTCTACTTAACGTGTAACTTTGTGGGGCCTAAGAATCTTAGTTTGTACAGAGGTTGACATGTGAATTATTTTTAGATATTTAGGGAAGATAGACTTGGATGAGAGATATGTTTTGTTTAGCTTGTATTAGGTCAGTTAGGGTAGGGAGCTTTGGAATTTCCCTTAGTAAGTACCTTCACTATAAATCAATGAATTGGTTTCTATAACTTTCCATATCTGTCTTTTAGTTTTATAAAAAAAGGTTGACATTGTAGCATTTCTTTTGTAGAACACAACCACAAAGGAGAATGAAACACTTTTAGCAGTTGGAACTGCATATGTGCAAGGGGAGGATGTTGCTGCAAGAGGAAGAGTGCTTTTATTTTCTGTTGGAAAAGATGTTGATAATTCACAGACCTTGGTATATTTATGAATTTTGTTTCTGAACTTTTTTTAATCTTATTATTATCTATGTTTTATTACATGAATTATTAATCTTTGTTCTCCTTTCCTGTTGGTTAATGTAGATAAAGAAATTTCGAAGGTTGCTAATCTAAAAATTTGCAAGTTCCAACTTCAAAAAATTCTTTCAGTGTTTGCTCATTCATATTTCAACCTTTGGTTCTCATCTTCTTTCAGGTTTCAGAGGTTTATTCGAAAGAATTGAAGGGTGCTATTTCTGCTTTAGCCTCTCTGCAAGGTCATCTATTGATAGCTTCTGGTCCTAAAATAATATTACACAAATGGACTGGTGCGGAGTTAAATGGTATTGCGTTCTATGACGTTCCACCCTTATATGTTGTGAGCTTGAACATTGTACGTTCTACCTCCTCACCACAATTTTTTTTCTTGGATATGATATTTACATGTTCAACCCACAATTTTTTTTCTTGGATATGATATTTACATGTTCAACCTAGGCAGACGATCTTGAGTGTTGCTATTAATGTTTAAATTAGGGGTGATCATTAGTCTGCCGGCGTTAGTTTTGGGCTCATGTCGACACTGGCCACGGATATGTTGCTTTTGATTGGTCGGTGGTTGTTAGTTTTGGGAGTTATTACGGAATTGATTGACTGGCTACATGCAAAAATTAGTCGGGTTCAAGAATTCGCCAGAGCTAGGGTGTGTTTGTTTGTGGCGATATAGGCTCCAGCGAAGAGGGTTAATAAAAAAAGATTAGAGAGAGAAAATGAGACAGAGAGAAGAAGAGAGAGAGAAGGAAAAAGAAGGGGAGGAAAAAACAAAAAAAAAAGTGGCCCGCAGTGCTACAATGGCAATGGCAGCCCTAGACTCCCTTTGATGAGAGGCGGTTCTGATAAAAAGAAAAAAGAAGAGAGAGAAGATAGGATGAGCTAGAGAAAGGAGAAGAGGAGAAAGGTAAAAGAGAGAAGGAAGATGGAGGGGGAGAAGACGAAGTGAGAAAATAGGTAATCATTACCCAAAATTCAGCCCTGGTTTTGAAAACCAGCTGCAGTTTCTTCTCAAAACTGCACGGAGGACAGGGGCATGCTATTTATTTATTTTTTTAAGAGATAGATGTCGATTAGTTTGAGCCTTCTGAGGCTCACCAACCGACCAACTGGTTGGTTTTGCTATGTGCAAAACTGTCTCCGACCAACCCATATCCTTAAACAAACCGACTGAAGTTGGTTCGGTCGGTTTCGGTTGATCGGCTCTGTTTTTCCGTCTATGTTACTCACCCCTAGTTCAAATTTATTTCTAATGCTAATGCGTTTTTGGTCCATGATTTTTGAATGGGGGAAAAATCTATGACTCACAGACTTGAGCTTTGGATCTTGTCTTTATGGTTATTAACACTCAGAAAAATTCAATCTATCATGCAAGAATATATTCATTGGAACTATTTCTCAGTAGTCTCTTAAATAATGGATGTAGATGCAGAAAAAGGGAAGAAATAAATCTAGAAAATATGAAAATTTTTACTACCCACGTGGATGAAACTCCAGTCTTACATATGAACGGGAACTGCTACAGGTCAAGAATTTCATACTTCTTGGTGATATACACAAGAGCATTTACTTTCTGAGTTGGAAAGAACAGGGAGCTCAACTTAGCTTGTTGGCGAAGGATTTTGGTTCTCTAGATTGCTATGCAACAGAATTTCTGATTGATGGAAGTACTCTTAGTCTTACTGTTTCTGATGATCAAAAGAATATTCAGGTAAGCACCTTATTTTAATCTTGCACTTCCTTTAGTTAGTTCCTAGGTGTAAACTAACCTTCAATAAAGTGACATTCTTTTTAAATATTTGTGAGTAAATATTATCATGGTGGTATCCATGACCATGAGATATATATGTCAAATTGTTTGTTTACTGGAATATTTTATATATGCCACAACAGATATTTTATTATGCACCAAAGTCGACGGAGAGTTGGAAAGGGCAGAAGCTTCTATCAAGAGCTGAATTTCATGTGGGTGCTCATGTGACGAAGTTTCTACGGCTACAGATGTTGTCTACCACCTCAGACAGAGCAAGTACTACAGTTTCTGATAAGACCAATCGCTTCGCTTTGTTATTTGGCACCCTTGATGGAAGTATTGGTTGTATTGCACCTCTTGATGAACTCACATTTCGTAGACTACAGTCATTACAAAAGAAGCTTGTTGATGCCGTTCCGCATGTCGGTGGTTTAAACCCAAGATCTTTTCGCCAGTTTCTTTCAAATGGAAAGGTTCATCGACGTGGCCCAGACAGCATAGTTGATGGTGAATTACTATGCCAGTAAGTTTTCTTCCAAACTTTTTTAATTGACTCAATATGTTAGTTGATTTGGACGGTAGAATAACAGGACTAAGTTTAGATTGGAATTATATGTCAGTATTAATTATCTATCTTTAGGTTGATAACTGTGTTTCCTTTTTTTTAAAAAAATTATTTCTTATTTATAGATAGCCTTCTTGTTCTTGATATAAACAAGACCCTAGCGAATTGACTTGAGCAATCGTTTGGTTATATGGACCTCTCAATCACAAACACTAACACCTTTCATAGACACACAAGGAACTATAGAGAATTGGCTTGAGAAGCTTCACATTTGGTGTGTCGATGGAATAAACATAATCACACCCATGAAAGTAGTTGAAAAATATATGGAAAGCCTGAAAATGGAGGAGAGTAATAAGAAAGGAAATAAATTGACCAATAGTCATGTAATGCCAATGTAGTTAAGTCAAGCTTATTCAATGAGAAGCAAATTGAAACTCAAAAGACCTTGGAATTAGAAGAGTAAAATATAGTCTCAACTTTTGTTGCCGCTTGCTTGTATTCTGTGATCAATAACTCATAATTATGTTTCCATTTGTGGCAGCTATGAGATGCTACCGTTGGAAGAGCAGCTTGATATTGCTCACCAAATTGGGACAACTCGTTCGCAAATTCTCTCAAACTTAAATGACCTCTCTCTAGGAACGAGTTTCTTATAATTGTACGAGACTTATCGATTGTATAGCCGTGTATTACAATCAAGTTGCATATTTATTGAAGAATATGAACCAAATCATGTACTGCCCTGTCTAATTTATCAACGATTGTGATGGTTAAAGCCTTAGCCTTTTTAGACTTTTTGTTCATTACTAAGTTACAATGAGTGAGACTGAAAAGGGATAGAGAGAGAGAGAGAGAGCTAAGTAGAGCACACCTTTCAAGTCATATGTGTAGCAATAGCTTGTGACAAGAATGTTAGGTTTTGATACTTCGATGCCATGGCATCTTCTTTGAATTTTTGCTGATCATTATAATATTTAATATTTTATATGTACTAGAGGC

mRNA sequence

AAGCCCTCCCGCCAAACCTCCCCAAAGCCCTAGTCTGCTGGTCCGTTTTCCCTCCACTCATTCTCATTGTCTGCAACTCCATGTTCCGTTGCCACCTCCAACTCCGTCTCTAAACCCTCAAATTCTCGAAGCCATAACTTCCTCTCAAGCTCCATAGTCTTACTCATAATATATCCAATTTCGTTGTGAACATTTTACTTTCCACTACCGGAGGATGAGTTTTGCCGCCTATAGAATGATGCACTGGCCTACGGGCATCGAGAACTGTGATTCAGGCTTCATCACCCATTCTCGCGCCGACTTCGTACCCGGTGTTACATCTCACACCGACGAACTCGAGTCCGACTGGCCGGCCCGCCGAGAAATGGGTCCAGTTCCTAATCTCGTTGTCACCGCCGGTAATGTCCTGGAGGTATATGTTGTTAGGGTTCAAGAAGAGGGTGGCAGTGAATCAAGAAGCTCAGGAGAAGTCAGGCGCGGTGGTATTATGGACGGAGTCTCTGGGGCCTCGCTCGAGCTTGTTTGCCACTATAGGTTGCACGGTAATGTTGAGTCCATGGCAATTTTGTCTAGTAGAGGAGGAGATGGTTCCAAGAAGAGAGATTCAATTATATTAGTCTTCCAAGAAGCAAAAATTTCAGTGCTAGAGTTTGATGATTCTATCCATAGTCTCCGTACAAGCTCAATGCATTGCTTTGAGGGTCCACAATGGCTTCATTTGAAAAGAGGTCGAGAATCATTTGCAAGAGGTCCAGTGGTAAAGGTTGATCCTCAAGGCAGGTGTGGAGGAGTTCTTGTTTATGGTTTGCAAATGATAATACTTAAGGCTTCTCAGGCTGGATCTGGTTTGGTTGTGGACGATGAAGCTTTCGGTAACATAGGTGCAATTTCTGCTCGAGTTGAATCGTCATACCTAATCAACCTAAGGGATTTGGATGTCAAGCATGTAAAGGATTTTGTTTTTGTACATGGTTACATTGAACCTGTGATGGTGATTCTTCATGAGCAGGAGCTTACTTGGGCCGGCCGTGTTTCATGGAAGCATCATACGTGTATGATTTCTGCACTAAGTATTAGCACAACCTTGAAGCAGCATCCTCTAATATGGTCTGCCAACAACCTCCCTCATGATGCTTACAAGCTTCTTGCAGTGCCATCACCAATTGGTGGTGTACTTGTTGTCAGTGCAAATAGTATTCATTATCATAGTCAGTCAGCTTCATGCATGTTGGCTTTGAATAATTATGCTGTTTCTGCTGATAGCAGTCAAGATATGCCTAGATCAAATTTCAACGTGGAATTGGATGCTGCCAATGCTACGTGGTTGCTAAATGATGTGGCCTTGCTGTCAACCAAGACTGGGGAGCTATTACTGCTGGCACTTGTCTATGATGGACGGGTTGTGCAGAGACTTGATCTTTCAAAGTCTAAAGCTTCAGTACTTACATCGGGGATTGCATCAATTGGAAATTCATTATTTTTTCTGGGCAGTCGATTGGGAGATAGTTTGCTTGTGCAGTTTAGTAGTGGAGTGGGATCCTCAGGATTGGCATCTAGTCTAAAAGATGAGGTTGGAGATATTGAAGTTGATGCTCCTACAGCAAAGCGAATGCGTAGATCATCTTCTGATGCTCTACAAGATATGGTTGGAGGAGATGAGCTGTCATTGTATGGTTCAGCTCCAAATAATGCGGAATCTGCTCAGAAAAATTTTTCTTTTGCTGTTAGAGATTCATTGATCAATATTGGCCCTCTGAAGGATTTCTCCTACGGTTTAAGAATTAATGCAGATGCTAATGCGACTGGAATTGCCAAACAAAGCAATTATGAACTTGTTTGTTGTTCGGGTAATGGAAAAAATGGTGCATTATGCATTCTTCGGCAGTCAATTCGCCCTGAAATGATTACAGAGGTTGAACTCCCAGGTTGTAAAGGCATTTGGACTGTTTACCACAAAAATACTCGTGGTAGTAGTGCTGATTCTTCGAGAATGGTTCCAGATGATGATGAATATCATGCATATTTGATTATAAGCCTTGAGGCTCGCACAATGGTACTTGAAACTGGAGATCTCCTAACAGAAGTCACTGAGAGTGTTGACTACTTTGTGCAAGGAAGAACAATTGCGGCAGGCAACTTGTTTGGAAGGCGTCGAGTTATCCAGGTCTATGAAAGTGGTGCACGAATTTTGGATGGATCTTTTATGACTCAAGATTTGAACATGGTAGTCACATGCAATGAATCTGGTAATGGTTCTGAAGGTTGTACTGTGTTGTCTGCATCTATTAGTGATCCATATGTCTTGCTGACTATGACGGATGGGAGTATTCGATTACTTGTTGGAGATCCTTCTTCTTGCTCTGTTTCTGTATCTACACCAGCCGCCTTTGGGAGTTCAAAAAAATGCGTATCTTGTTGTACTCTTTATCATGACAAGGGCATTGAGCCTTGGCTTCGGATGACAAGTACAGATGCATGGCTTTCTACAGGAGTTGGTGAAACAATTGATGGTACTGATGGCTCACTCCAAGATCAGGGTGACATATATTGTGTTGCTTGTTACGATAGTGGGGACCTTGAAATATTTGACGTGCCGAATTTTGTTAGCGTTTTCTATGTGGACAAATTTGTTTCTGGAAAATCACATTTAGTTGATTTTCAAATGTCGGACTTGCAGAAAAATTCTGAGAAGTTGGATCGAAATTCTCAGGAATTGAATAGCCATGGTAGGAATGAGAGTTCCCAAAATATGAAGGTAATTGAGGTAGCCATGCAGAGGTGGTCAGGGAAGCATAGTCGCCCATTTCTTTTTGGAATATTGACCGACGGGACAATTCTTTGTTACCATGCTTATTTATTTGAAAGTACAGACAGTGCCTCTAAAATTGATGATTCGGTTTCCATGGAAAATTCTGTTAGCTCAAGCAATATGAGTTCTTCTAGATTAAGAAATTTGAGATTTCTTCGTGTCCCCTTGGACATACAAGGAAGGGATGATATGCCAAATGGAACCTTGTCTCGTAGATTATCTATTTTCAAGAATATTTCCGGTTATCAGGGGCTATTTCTCTGCGGGTCAAGACCTGCTTGGTTTATGGTGTTTAGAGAACGGCTTCGAGTTCACCCTCAGCTATGTGATGGACCCATCGTTGCCTTTACAGTGCTACATAATGTAAACTGTAACCATGGACTTATATATGTCACGTCACAGGGCGTTTTAAAGATTTGCCAACTTCCATCTACATCAAGCTATGATAATTATTGGCCGGTACAAAAGGTTCCATTGAAAGGAACTCCACACCAGGTCACCTACTTCCACGAGAAGAATCTGTACCCCGTCATAATTTCAGCACCTGTTCATAAGCCGTTGAATCAAGTGCTTTCATCAATGGTTGATCAAGATGTTGGTCAAGTTGAGAATCATAACTTGAGTGCCGATGAGCTGCAGCAAACTTACTCTGTGGAAGAGTTTGAGATTCGGATTTTGGAACCAGAAAAATCTGGTGGCCCTTGGCAAACTAGGGCTACAATCGCTATGCACAGTTCCGAAAATGCTCTTACCATTCGCGTGGTTACACTGTTGAACACAACCACAAAGGAGAATGAAACACTTTTAGCAGTTGGAACTGCATATGTGCAAGGGGAGGATGTTGCTGCAAGAGGAAGAGTGCTTTTATTTTCTGTTGGAAAAGATGTTGATAATTCACAGACCTTGGTTTCAGAGGTTTATTCGAAAGAATTGAAGGGTGCTATTTCTGCTTTAGCCTCTCTGCAAGGTCATCTATTGATAGCTTCTGGTCCTAAAATAATATTACACAAATGGACTGGTGCGGAGTTAAATGGTATTGCGTTCTATGACGTTCCACCCTTATATGTTGTGAGCTTGAACATTGTCAAGAATTTCATACTTCTTGGTGATATACACAAGAGCATTTACTTTCTGAGTTGGAAAGAACAGGGAGCTCAACTTAGCTTGTTGGCGAAGGATTTTGGTTCTCTAGATTGCTATGCAACAGAATTTCTGATTGATGGAAGTACTCTTAGTCTTACTGTTTCTGATGATCAAAAGAATATTCAGATATTTTATTATGCACCAAAGTCGACGGAGAGTTGGAAAGGGCAGAAGCTTCTATCAAGAGCTGAATTTCATGTGGGTGCTCATGTGACGAAGTTTCTACGGCTACAGATGTTGTCTACCACCTCAGACAGAGCAAGTACTACAGTTTCTGATAAGACCAATCGCTTCGCTTTGTTATTTGGCACCCTTGATGGAAGTATTGGTTGTATTGCACCTCTTGATGAACTCACATTTCGTAGACTACAGTCATTACAAAAGAAGCTTGTTGATGCCGTTCCGCATGTCGGTGGTTTAAACCCAAGATCTTTTCGCCAGTTTCTTTCAAATGGAAAGGTTCATCGACGTGGCCCAGACAGCATAGTTGATGGTGAATTACTATGCCACTATGAGATGCTACCGTTGGAAGAGCAGCTTGATATTGCTCACCAAATTGGGACAACTCGTTCGCAAATTCTCTCAAACTTAAATGACCTCTCTCTAGGAACGAGTTTCTTATAATTGTACGAGACTTATCGATTGTATAGCCGTGTATTACAATCAAGTTGCATATTTATTGAAGAATATGAACCAAATCATGTACTGCCCTGTCTAATTTATCAACGATTGTGATGGTTAAAGCCTTAGCCTTTTTAGACTTTTTGTTCATTACTAAGTTACAATGAGTGAGACTGAAAAGGGATAGAGAGAGAGAGAGAGAGCTAAGTAGAGCACACCTTTCAAGTCATATGTGTAGCAATAGCTTGTGACAAGAATGTTAGGTTTTGATACTTCGATGCCATGGCATCTTCTTTGAATTTTTGCTGATCATTATAATATTTAATATTTTATATGTACTAGAGGC

Coding sequence (CDS)

ATGAGTTTTGCCGCCTATAGAATGATGCACTGGCCTACGGGCATCGAGAACTGTGATTCAGGCTTCATCACCCATTCTCGCGCCGACTTCGTACCCGGTGTTACATCTCACACCGACGAACTCGAGTCCGACTGGCCGGCCCGCCGAGAAATGGGTCCAGTTCCTAATCTCGTTGTCACCGCCGGTAATGTCCTGGAGGTATATGTTGTTAGGGTTCAAGAAGAGGGTGGCAGTGAATCAAGAAGCTCAGGAGAAGTCAGGCGCGGTGGTATTATGGACGGAGTCTCTGGGGCCTCGCTCGAGCTTGTTTGCCACTATAGGTTGCACGGTAATGTTGAGTCCATGGCAATTTTGTCTAGTAGAGGAGGAGATGGTTCCAAGAAGAGAGATTCAATTATATTAGTCTTCCAAGAAGCAAAAATTTCAGTGCTAGAGTTTGATGATTCTATCCATAGTCTCCGTACAAGCTCAATGCATTGCTTTGAGGGTCCACAATGGCTTCATTTGAAAAGAGGTCGAGAATCATTTGCAAGAGGTCCAGTGGTAAAGGTTGATCCTCAAGGCAGGTGTGGAGGAGTTCTTGTTTATGGTTTGCAAATGATAATACTTAAGGCTTCTCAGGCTGGATCTGGTTTGGTTGTGGACGATGAAGCTTTCGGTAACATAGGTGCAATTTCTGCTCGAGTTGAATCGTCATACCTAATCAACCTAAGGGATTTGGATGTCAAGCATGTAAAGGATTTTGTTTTTGTACATGGTTACATTGAACCTGTGATGGTGATTCTTCATGAGCAGGAGCTTACTTGGGCCGGCCGTGTTTCATGGAAGCATCATACGTGTATGATTTCTGCACTAAGTATTAGCACAACCTTGAAGCAGCATCCTCTAATATGGTCTGCCAACAACCTCCCTCATGATGCTTACAAGCTTCTTGCAGTGCCATCACCAATTGGTGGTGTACTTGTTGTCAGTGCAAATAGTATTCATTATCATAGTCAGTCAGCTTCATGCATGTTGGCTTTGAATAATTATGCTGTTTCTGCTGATAGCAGTCAAGATATGCCTAGATCAAATTTCAACGTGGAATTGGATGCTGCCAATGCTACGTGGTTGCTAAATGATGTGGCCTTGCTGTCAACCAAGACTGGGGAGCTATTACTGCTGGCACTTGTCTATGATGGACGGGTTGTGCAGAGACTTGATCTTTCAAAGTCTAAAGCTTCAGTACTTACATCGGGGATTGCATCAATTGGAAATTCATTATTTTTTCTGGGCAGTCGATTGGGAGATAGTTTGCTTGTGCAGTTTAGTAGTGGAGTGGGATCCTCAGGATTGGCATCTAGTCTAAAAGATGAGGTTGGAGATATTGAAGTTGATGCTCCTACAGCAAAGCGAATGCGTAGATCATCTTCTGATGCTCTACAAGATATGGTTGGAGGAGATGAGCTGTCATTGTATGGTTCAGCTCCAAATAATGCGGAATCTGCTCAGAAAAATTTTTCTTTTGCTGTTAGAGATTCATTGATCAATATTGGCCCTCTGAAGGATTTCTCCTACGGTTTAAGAATTAATGCAGATGCTAATGCGACTGGAATTGCCAAACAAAGCAATTATGAACTTGTTTGTTGTTCGGGTAATGGAAAAAATGGTGCATTATGCATTCTTCGGCAGTCAATTCGCCCTGAAATGATTACAGAGGTTGAACTCCCAGGTTGTAAAGGCATTTGGACTGTTTACCACAAAAATACTCGTGGTAGTAGTGCTGATTCTTCGAGAATGGTTCCAGATGATGATGAATATCATGCATATTTGATTATAAGCCTTGAGGCTCGCACAATGGTACTTGAAACTGGAGATCTCCTAACAGAAGTCACTGAGAGTGTTGACTACTTTGTGCAAGGAAGAACAATTGCGGCAGGCAACTTGTTTGGAAGGCGTCGAGTTATCCAGGTCTATGAAAGTGGTGCACGAATTTTGGATGGATCTTTTATGACTCAAGATTTGAACATGGTAGTCACATGCAATGAATCTGGTAATGGTTCTGAAGGTTGTACTGTGTTGTCTGCATCTATTAGTGATCCATATGTCTTGCTGACTATGACGGATGGGAGTATTCGATTACTTGTTGGAGATCCTTCTTCTTGCTCTGTTTCTGTATCTACACCAGCCGCCTTTGGGAGTTCAAAAAAATGCGTATCTTGTTGTACTCTTTATCATGACAAGGGCATTGAGCCTTGGCTTCGGATGACAAGTACAGATGCATGGCTTTCTACAGGAGTTGGTGAAACAATTGATGGTACTGATGGCTCACTCCAAGATCAGGGTGACATATATTGTGTTGCTTGTTACGATAGTGGGGACCTTGAAATATTTGACGTGCCGAATTTTGTTAGCGTTTTCTATGTGGACAAATTTGTTTCTGGAAAATCACATTTAGTTGATTTTCAAATGTCGGACTTGCAGAAAAATTCTGAGAAGTTGGATCGAAATTCTCAGGAATTGAATAGCCATGGTAGGAATGAGAGTTCCCAAAATATGAAGGTAATTGAGGTAGCCATGCAGAGGTGGTCAGGGAAGCATAGTCGCCCATTTCTTTTTGGAATATTGACCGACGGGACAATTCTTTGTTACCATGCTTATTTATTTGAAAGTACAGACAGTGCCTCTAAAATTGATGATTCGGTTTCCATGGAAAATTCTGTTAGCTCAAGCAATATGAGTTCTTCTAGATTAAGAAATTTGAGATTTCTTCGTGTCCCCTTGGACATACAAGGAAGGGATGATATGCCAAATGGAACCTTGTCTCGTAGATTATCTATTTTCAAGAATATTTCCGGTTATCAGGGGCTATTTCTCTGCGGGTCAAGACCTGCTTGGTTTATGGTGTTTAGAGAACGGCTTCGAGTTCACCCTCAGCTATGTGATGGACCCATCGTTGCCTTTACAGTGCTACATAATGTAAACTGTAACCATGGACTTATATATGTCACGTCACAGGGCGTTTTAAAGATTTGCCAACTTCCATCTACATCAAGCTATGATAATTATTGGCCGGTACAAAAGGTTCCATTGAAAGGAACTCCACACCAGGTCACCTACTTCCACGAGAAGAATCTGTACCCCGTCATAATTTCAGCACCTGTTCATAAGCCGTTGAATCAAGTGCTTTCATCAATGGTTGATCAAGATGTTGGTCAAGTTGAGAATCATAACTTGAGTGCCGATGAGCTGCAGCAAACTTACTCTGTGGAAGAGTTTGAGATTCGGATTTTGGAACCAGAAAAATCTGGTGGCCCTTGGCAAACTAGGGCTACAATCGCTATGCACAGTTCCGAAAATGCTCTTACCATTCGCGTGGTTACACTGTTGAACACAACCACAAAGGAGAATGAAACACTTTTAGCAGTTGGAACTGCATATGTGCAAGGGGAGGATGTTGCTGCAAGAGGAAGAGTGCTTTTATTTTCTGTTGGAAAAGATGTTGATAATTCACAGACCTTGGTTTCAGAGGTTTATTCGAAAGAATTGAAGGGTGCTATTTCTGCTTTAGCCTCTCTGCAAGGTCATCTATTGATAGCTTCTGGTCCTAAAATAATATTACACAAATGGACTGGTGCGGAGTTAAATGGTATTGCGTTCTATGACGTTCCACCCTTATATGTTGTGAGCTTGAACATTGTCAAGAATTTCATACTTCTTGGTGATATACACAAGAGCATTTACTTTCTGAGTTGGAAAGAACAGGGAGCTCAACTTAGCTTGTTGGCGAAGGATTTTGGTTCTCTAGATTGCTATGCAACAGAATTTCTGATTGATGGAAGTACTCTTAGTCTTACTGTTTCTGATGATCAAAAGAATATTCAGATATTTTATTATGCACCAAAGTCGACGGAGAGTTGGAAAGGGCAGAAGCTTCTATCAAGAGCTGAATTTCATGTGGGTGCTCATGTGACGAAGTTTCTACGGCTACAGATGTTGTCTACCACCTCAGACAGAGCAAGTACTACAGTTTCTGATAAGACCAATCGCTTCGCTTTGTTATTTGGCACCCTTGATGGAAGTATTGGTTGTATTGCACCTCTTGATGAACTCACATTTCGTAGACTACAGTCATTACAAAAGAAGCTTGTTGATGCCGTTCCGCATGTCGGTGGTTTAAACCCAAGATCTTTTCGCCAGTTTCTTTCAAATGGAAAGGTTCATCGACGTGGCCCAGACAGCATAGTTGATGGTGAATTACTATGCCACTATGAGATGCTACCGTTGGAAGAGCAGCTTGATATTGCTCACCAAATTGGGACAACTCGTTCGCAAATTCTCTCAAACTTAAATGACCTCTCTCTAGGAACGAGTTTCTTATAA

Protein sequence

MSFAAYRMMHWPTGIENCDSGFITHSRADFVPGVTSHTDELESDWPARREMGPVPNLVVTAGNVLEVYVVRVQEEGGSESRSSGEVRRGGIMDGVSGASLELVCHYRLHGNVESMAILSSRGGDGSKKRDSIILVFQEAKISVLEFDDSIHSLRTSSMHCFEGPQWLHLKRGRESFARGPVVKVDPQGRCGGVLVYGLQMIILKASQAGSGLVVDDEAFGNIGAISARVESSYLINLRDLDVKHVKDFVFVHGYIEPVMVILHEQELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSANNLPHDAYKLLAVPSPIGGVLVVSANSIHYHSQSASCMLALNNYAVSADSSQDMPRSNFNVELDAANATWLLNDVALLSTKTGELLLLALVYDGRVVQRLDLSKSKASVLTSGIASIGNSLFFLGSRLGDSLLVQFSSGVGSSGLASSLKDEVGDIEVDAPTAKRMRRSSSDALQDMVGGDELSLYGSAPNNAESAQKNFSFAVRDSLINIGPLKDFSYGLRINADANATGIAKQSNYELVCCSGNGKNGALCILRQSIRPEMITEVELPGCKGIWTVYHKNTRGSSADSSRMVPDDDEYHAYLIISLEARTMVLETGDLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVYESGARILDGSFMTQDLNMVVTCNESGNGSEGCTVLSASISDPYVLLTMTDGSIRLLVGDPSSCSVSVSTPAAFGSSKKCVSCCTLYHDKGIEPWLRMTSTDAWLSTGVGETIDGTDGSLQDQGDIYCVACYDSGDLEIFDVPNFVSVFYVDKFVSGKSHLVDFQMSDLQKNSEKLDRNSQELNSHGRNESSQNMKVIEVAMQRWSGKHSRPFLFGILTDGTILCYHAYLFESTDSASKIDDSVSMENSVSSSNMSSSRLRNLRFLRVPLDIQGRDDMPNGTLSRRLSIFKNISGYQGLFLCGSRPAWFMVFRERLRVHPQLCDGPIVAFTVLHNVNCNHGLIYVTSQGVLKICQLPSTSSYDNYWPVQKVPLKGTPHQVTYFHEKNLYPVIISAPVHKPLNQVLSSMVDQDVGQVENHNLSADELQQTYSVEEFEIRILEPEKSGGPWQTRATIAMHSSENALTIRVVTLLNTTTKENETLLAVGTAYVQGEDVAARGRVLLFSVGKDVDNSQTLVSEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGAELNGIAFYDVPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLSLLAKDFGSLDCYATEFLIDGSTLSLTVSDDQKNIQIFYYAPKSTESWKGQKLLSRAEFHVGAHVTKFLRLQMLSTTSDRASTTVSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDAVPHVGGLNPRSFRQFLSNGKVHRRGPDSIVDGELLCHYEMLPLEEQLDIAHQIGTTRSQILSNLNDLSLGTSFL
Homology
BLAST of Tan0008131 vs. ExPASy Swiss-Prot
Match: Q9FGR0 (Cleavage and polyadenylation specificity factor subunit 1 OS=Arabidopsis thaliana OX=3702 GN=CPSF160 PE=1 SV=2)

HSP 1 Score: 2166.0 bits (5611), Expect = 0.0e+00
Identity = 1074/1459 (73.61%), Postives = 1255/1459 (86.02%), Query Frame = 0

Query: 1    MSFAAYRMMHWPTGIENCDSGFITHSRADF---VPGVTSHTDELESDWP-ARREMGPVPN 60
            MSFAAY+MMHWPTG+ENC SG+ITHS +D    +P V+ H D++E++WP  +R +GP+PN
Sbjct: 1    MSFAAYKMMHWPTGVENCASGYITHSLSDSTLQIPIVSVH-DDIEAEWPNPKRGIGPLPN 60

Query: 61   LVVTAGNVLEVYVVRVQEEGGS-ESRSSGEVRRGGIMDGVSGASLELVCHYRLHGNVESM 120
            +V+TA N+LEVY+VR QEEG + E R+    +RGG+MDGV G SLELVCHYRLHGNVES+
Sbjct: 61   VVITAANILEVYIVRAQEEGNTQELRNPKLAKRGGVMDGVYGVSLELVCHYRLHGNVESI 120

Query: 121  AILSSRGGDGSKKRDSIILVFQEAKISVLEFDDSIHSLRTSSMHCFEGPQWLHLKRGRES 180
            A+L   GG+ SK RDSIIL F++AKISVLEFDDSIHSLR +SMHCFEGP WLHLKRGRES
Sbjct: 121  AVLPMGGGNSSKGRDSIILTFRDAKISVLEFDDSIHSLRMTSMHCFEGPDWLHLKRGRES 180

Query: 181  FARGPVVKVDPQGRCGGVLVYGLQMIILKASQAGSGLVVDDEAFGNIGAISARVESSYLI 240
            F RGP+VKVDPQGRCGGVLVYGLQMIILK SQ GSGLV DD+AF + G +SARVESSY+I
Sbjct: 181  FPRGPLVKVDPQGRCGGVLVYGLQMIILKTSQVGSGLVGDDDAFSSGGTVSARVESSYII 240

Query: 241  NLRDLDVKHVKDFVFVHGYIEPVMVILHEQELTWAGRVSWKHHTCMISALSISTTLKQHP 300
            NLRDL++KHVKDFVF+HGYIEPV+VIL E+E TWAGRVSWKHHTC++SALSI++TLKQHP
Sbjct: 241  NLRDLEMKHVKDFVFLHGYIEPVIVILQEEEHTWAGRVSWKHHTCVLSALSINSTLKQHP 300

Query: 301  LIWSANNLPHDAYKLLAVPSPIGGVLVVSANSIHYHSQSASCMLALNNYAVSADSSQDMP 360
            +IWSA NLPHDAYKLLAVPSPIGGVLV+ AN+IHYHSQSASC LALNNYA SADSSQ++P
Sbjct: 301  VIWSAINLPHDAYKLLAVPSPIGGVLVLCANTIHYHSQSASCALALNNYASSADSSQELP 360

Query: 361  RSNFNVELDAANATWLLNDVALLSTKTGELLLLALVYDGRVVQRLDLSKSKASVLTSGIA 420
             SNF+VELDAA+ TW+ NDVALLSTK+GELLLL L+YDGR VQRLDLSKSKASVL S I 
Sbjct: 361  ASNFSVELDAAHGTWISNDVALLSTKSGELLLLTLIYDGRAVQRLDLSKSKASVLASDIT 420

Query: 421  SIGNSLFFLGSRLGDSLLVQFSSGVGSSGLASSLKDEVGDIEVDAPTAKRMRRSSSDALQ 480
            S+GNSLFFLGSRLGDSLLVQFS   G +     L+DE  DIE +   AKR+ R +SD  Q
Sbjct: 421  SVGNSLFFLGSRLGDSLLVQFSCRSGPAASLPGLRDEDEDIEGEGHQAKRL-RMTSDTFQ 480

Query: 481  DMVGGDELSLYGSAPNNAESAQKNFSFAVRDSLINIGPLKDFSYGLRINADANATGIAKQ 540
            D +G +ELSL+GS PNN++SAQK+FSFAVRDSL+N+GP+KDF+YGLRINADANATG++KQ
Sbjct: 481  DTIGNEELSLFGSTPNNSDSAQKSFSFAVRDSLVNVGPVKDFAYGLRINADANATGVSKQ 540

Query: 541  SNYELVCCSGNGKNGALCILRQSIRPEMITEVELPGCKGIWTVYHKNTRGSSADSSRMVP 600
            SNYELVCCSG+GKNGALC+LRQSIRPEMITEVELPGCKGIWTVYHK++RG +ADSS+M  
Sbjct: 541  SNYELVCCSGHGKNGALCVLRQSIRPEMITEVELPGCKGIWTVYHKSSRGHNADSSKMAA 600

Query: 601  DDDEYHAYLIISLEARTMVLETGDLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVYESG 660
            D+DEYHAYLIISLEARTMVLET DLLTEVTESVDY+VQGRTIAAGNLFGRRRVIQV+E G
Sbjct: 601  DEDEYHAYLIISLEARTMVLETADLLTEVTESVDYYVQGRTIAAGNLFGRRRVIQVFEHG 660

Query: 661  ARILDGSFMTQDLNMVVTCNESGNGSEGCTVLSASISDPYVLLTMTDGSIRLLVGDPSSC 720
            ARILDGSFM Q+L+   + +ES +GSE  TV S SI+DPYVLL MTD SIRLLVGDPS+C
Sbjct: 661  ARILDGSFMNQELSFGASNSESNSGSESSTVSSVSIADPYVLLRMTDDSIRLLVGDPSTC 720

Query: 721  SVSVSTPAAFGSSKKCVSCCTLYHDKGIEPWLRMTSTDAWLSTGVGETIDGTDGSLQDQG 780
            +VS+S+P+    SK+ +S CTLYHDKG EPWLR  STDAWLS+GVGE +D  DG  QDQG
Sbjct: 721  TVSISSPSVLEGSKRKISACTLYHDKGPEPWLRKASTDAWLSSGVGEAVDSVDGGPQDQG 780

Query: 781  DIYCVACYDSGDLEIFDVPNFVSVFYVDKFVSGKSHLVDFQMSDLQKNSEKLDRNSQELN 840
            DIYCV CY+SG LEIFDVP+F  VF VDKF SG+ HL D  + +L+    +L++NS++  
Sbjct: 781  DIYCVVCYESGALEIFDVPSFNCVFSVDKFASGRRHLSDMPIHELE---YELNKNSEDNT 840

Query: 841  SHGRNESSQNMKVIEVAMQRWSGKHSRPFLFGILTDGTILCYHAYLFESTDSASKIDDSV 900
            S   ++  +N +V+E+AMQRWSG H+RPFLF +L DGTILCYHAYLF+  DS +K ++S+
Sbjct: 841  S---SKEIKNTRVVELAMQRWSGHHTRPFLFAVLADGTILCYHAYLFDGVDS-TKAENSL 900

Query: 901  SMENSVSSSNMSSSRLRNLRFLRVPLDIQGRDDMPNGTLSRRLSIFKNISGYQGLFLCGS 960
            S EN  + ++  SS+LRNL+FLR+PLD   R+   +G  S+R+++FKNISG+QG FL GS
Sbjct: 901  SSENPAALNSSGSSKLRNLKFLRIPLDTSTREGTSDGVASQRITMFKNISGHQGFFLSGS 960

Query: 961  RPAWFMVFRERLRVHPQLCDGPIVAFTVLHNVNCNHGLIYVTSQGVLKICQLPSTSSYDN 1020
            RP W M+FRERLR H QLCDG I AFTVLHNVNCNHG IYVT+QGVLKICQLPS S YDN
Sbjct: 961  RPGWCMLFRERLRFHSQLCDGSIAAFTVLHNVNCNHGFIYVTAQGVLKICQLPSASIYDN 1020

Query: 1021 YWPVQKVPLKGTPHQVTYFHEKNLYPVIISAPVHKPLNQVLSSMVDQDVG-QVENHNLSA 1080
            YWPVQK+PLK TPHQVTY+ EKNLYP+I+S PV KPLNQVLSS+VDQ+ G Q++NHN+S+
Sbjct: 1021 YWPVQKIPLKATPHQVTYYAEKNLYPLIVSYPVSKPLNQVLSSLVDQEAGQQLDNHNMSS 1080

Query: 1081 DELQQTYSVEEFEIRILEPEKSGGPWQTRATIAMHSSENALTIRVVTLLNTTTKENETLL 1140
            D+LQ+TY+VEEFEI+ILEPE+SGGPW+T+A I M +SE+ALT+RVVTLLN +T ENETLL
Sbjct: 1081 DDLQRTYTVEEFEIQILEPERSGGPWETKAKIPMQTSEHALTVRVVTLLNASTGENETLL 1140

Query: 1141 AVGTAYVQGEDVAARGRVLLFSVGKDVDNSQTLVSEVYSKELKGAISALASLQGHLLIAS 1200
            AVGTAYVQGEDVAARGRVLLFS GK+ DNSQ +V+EVYS+ELKGAISA+AS+QGHLLI+S
Sbjct: 1141 AVGTAYVQGEDVAARGRVLLFSFGKNGDNSQNVVTEVYSRELKGAISAVASIQGHLLISS 1200

Query: 1201 GPKIILHKWTGAELNGIAFYDVPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLSL 1260
            GPKIILHKW G ELNG+AF+D PPLYVVS+N+VK+FILLGD+HKSIYFLSWKEQG+QLSL
Sbjct: 1201 GPKIILHKWNGTELNGVAFFDAPPLYVVSMNVVKSFILLGDVHKSIYFLSWKEQGSQLSL 1260

Query: 1261 LAKDFGSLDCYATEFLIDGSTLSLTVSDDQKNIQIFYYAPKSTESWKGQKLLSRAEFHVG 1320
            LAKDF SLDC+ATEFLIDGSTLSL VSD+QKNIQ+FYYAPK  ESWKG KLLSRAEFHVG
Sbjct: 1261 LAKDFESLDCFATEFLIDGSTLSLAVSDEQKNIQVFYYAPKMIESWKGLKLLSRAEFHVG 1320

Query: 1321 AHVTKFLRLQMLSTTSDRASTTVSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQK 1380
            AHV+KFLRLQM+S+         +DK NRFALLFGTLDGS GCIAPLDE+TFRRLQSLQK
Sbjct: 1321 AHVSKFLRLQMVSSG--------ADKINRFALLFGTLDGSFGCIAPLDEVTFRRLQSLQK 1380

Query: 1381 KLVDAVPHVGGLNPRSFRQFLSNGKVHRRGPDSIVDGELLCHYEMLPLEEQLDIAHQIGT 1440
            KLVDAVPHV GLNP +FRQF S+GK  R GPDSIVD ELLCHYEMLPLEEQL++AHQIGT
Sbjct: 1381 KLVDAVPHVAGLNPLAFRQFRSSGKARRSGPDSIVDCELLCHYEMLPLEEQLELAHQIGT 1440

Query: 1441 TRSQILSNLNDLSLGTSFL 1454
            TR  IL +L DLS+GTSFL
Sbjct: 1441 TRYSILKDLVDLSVGTSFL 1442

BLAST of Tan0008131 vs. ExPASy Swiss-Prot
Match: Q7XWP1 (Probable cleavage and polyadenylation specificity factor subunit 1 OS=Oryza sativa subsp. japonica OX=39947 GN=Os04g0252200 PE=3 SV=2)

HSP 1 Score: 1875.1 bits (4856), Expect = 0.0e+00
Identity = 971/1470 (66.05%), Postives = 1149/1470 (78.16%), Query Frame = 0

Query: 1    MSFAAYRMMHWPTGIENCDSGFITHSRADFVPGVTSHT------DELESDWPAR--REMG 60
            MS+AAY+MMHWPTG+++C +GF+THS +D     T+ T       +++S   A   R +G
Sbjct: 1    MSYAAYKMMHWPTGVDHCAAGFVTHSPSDAAAFFTAATVGPGPEGDIDSAAAASRPRRLG 60

Query: 61   PVPNLVVTAGNVLEVYVVRVQ---EEGGSESRSSGEVRRGGIMDGVSGASLELVCHYRLH 120
            P PNLVV A NVLEVY VR +   E+GG  ++ S     G ++DG+SGA LELVC+YRLH
Sbjct: 61   PSPNLVVAAANVLEVYAVRAETAAEDGGGGTQPSSS--SGAVLDGISGARLELVCYYRLH 120

Query: 121  GNVESMAILSSRGGDGSK-KRDSIILVFQEAKISVLEFDDSIHSLRTSSMHCFEGPQWLH 180
            GN+ESM +LS    DG++ +R +I L F++AKI+ LEFDD+IH LRTSSMHCFEGP+W H
Sbjct: 121  GNIESMTVLS----DGAENRRATIALAFKDAKITCLEFDDAIHGLRTSSMHCFEGPEWQH 180

Query: 181  LKRGRESFARGPVVKVDPQGRCGGVLVYGLQMIILKASQAGSGLVVDDEAFGNIGAISAR 240
            LKRGRESFA GPV+K DP GRCG  L YGLQMIILKA+Q G  LV +DE    + + +  
Sbjct: 181  LKRGRESFAWGPVIKADPLGRCGAALAYGLQMIILKAAQVGHSLVGEDEPTCALSSTAVC 240

Query: 241  VESSYLINLRDLDVKHVKDFVFVHGYIEPVMVILHEQELTWAGRVSWKHHTCMISALSIS 300
            +ESSYLI+LR LD+ HVKDF FVHGYIEPV+VILHEQE TWAGR+  KHHTCMISA SIS
Sbjct: 241  IESSYLIDLRALDMNHVKDFAFVHGYIEPVLVILHEQEPTWAGRILSKHHTCMISAFSIS 300

Query: 301  TTLKQHPLIWSANNLPHDAYKLLAVPSPIGGVLVVSANSIHYHSQSASCMLALNNYAVSA 360
             TLKQHP+IWSA NLPHDAY+LLAVP PI GVLV+ ANSIHYHSQS SC L LNN++   
Sbjct: 301  MTLKQHPVIWSAANLPHDAYQLLAVPPPISGVLVICANSIHYHSQSTSCSLDLNNFSSHP 360

Query: 361  DSSQDMPRSNFNVELDAANATWLLNDVALLSTKTGELLLLALVYDGRVVQRLDLSKSKAS 420
            D S ++ +SNF VELDAA ATWL ND+ + STK GE+LLL +VYDGRVVQRLDL KSKAS
Sbjct: 361  DGSPEISKSNFQVELDAAKATWLSNDIVMFSTKAGEMLLLTVVYDGRVVQRLDLMKSKAS 420

Query: 421  VLTSGIASIGNSLFFLGSRLGDSLLVQFSSGVGSSGLASSLKDEVGDIEVDAPTAKRMRR 480
            VL+S + SIGNS FFLGSRLGDSLLVQFS     S L     +   DIE D P +KR++R
Sbjct: 421  VLSSAVTSIGNSFFFLGSRLGDSLLVQFSYCASKSVLQDLTNERSADIEGDLPFSKRLKR 480

Query: 481  SSSDALQDMVGGDELSLYG-SAPNNAESAQKNFSFAVRDSLINIGPLKDFSYGLRINADA 540
              SD LQD+   +ELS     APN+ ESAQK  S+ VRD+LIN+GPLKDFSYGLR NAD 
Sbjct: 481  IPSDVLQDVTSVEELSFQNIIAPNSLESAQK-ISYIVRDALINVGPLKDFSYGLRANADP 540

Query: 541  NATGIAKQSNYELVCCSGNGKNGALCILRQSIRPEMITEVELPGCKGIWTVYHKNTRGSS 600
            NA G AKQSNYELVCCSG+GKNG+L +L+QSIRP++ITEVELP C+GIWTVY+K+ RG  
Sbjct: 541  NAMGNAKQSNYELVCCSGHGKNGSLSVLQQSIRPDLITEVELPSCRGIWTVYYKSYRGQM 600

Query: 601  ADSSRMVPDDDEYHAYLIISLEARTMVLETGDLLTEVTESVDYFVQGRTIAAGNLFGRRR 660
            A       +D+EYHAYLIISLE RTMVLETGD L EVTE+VDYFVQ  TIAAGNLFGRRR
Sbjct: 601  A-------EDNEYHAYLIISLENRTMVLETGDDLGEVTETVDYFVQASTIAAGNLFGRRR 660

Query: 661  VIQVYESGARILDGSFMTQDLNMVVTCNESGNGSEGCTVLSASISDPYVLLTMTDGSIRL 720
            VIQVY  GAR+LDGSFMTQ+LN     +ES + SE   V  ASI+DPYVLL M DGS++L
Sbjct: 661  VIQVYGKGARVLDGSFMTQELNFTTHASES-SSSEALGVACASIADPYVLLKMVDGSVQL 720

Query: 721  LVGDPSSCSVSVSTPAAFGSSKKCVSCCTLYHDKGIEPWLRMTSTDAWLSTGVGETIDGT 780
            L+GD  +C++SV+ P+ F SS + ++ CTLY D+G EPWL  T +DAWLSTG+ E IDG 
Sbjct: 721  LIGDYCTCTLSVNAPSIFISSSERIAACTLYRDRGPEPWLTKTRSDAWLSTGIAEAIDGN 780

Query: 781  DGSLQDQGDIYCVACYDSGDLEIFDVPNFVSVFYVDKFVSGKSHLVD--FQMSDLQKNSE 840
              S  DQ DIYC+ CY+SG LEIF+VP+F  VF V+ F+SG++ LVD   Q+       E
Sbjct: 781  GTSSHDQSDIYCIICYESGKLEIFEVPSFRCVFSVENFISGEALLVDKFSQLIYEDSTKE 840

Query: 841  KLDRNSQELNSHGRNESSQNMKVIEVAMQRWSGKHSRPFLFGILTDGTILCYHAYLFEST 900
            + D     L    + E+  +++++E+AM RWSG+ SRPFLFG+L DGT+LCYHA+ +E++
Sbjct: 841  RYDCTKASL----KKEAGDSIRIVELAMHRWSGQFSRPFLFGLLNDGTLLCYHAFSYEAS 900

Query: 901  DSASKIDDSVSMENSVSSSNMSSSRLRNLRFLRVPLDIQGRDDMPNGTLSR-RLSIFKNI 960
            +S  K    +S + S    N S SRLRNLRF RV +DI  R+D+P  TL R R++ F N+
Sbjct: 901  ESNVK-RVPLSPQGSADHHNASDSRLRNLRFHRVSIDITSREDIP--TLGRPRITTFNNV 960

Query: 961  SGYQGLFLCGSRPAWFMVFRERLRVHPQLCDGPIVAFTVLHNVNCNHGLIYVTSQGVLKI 1020
             GY+GLFL G+RPAW MV R+RLRVHPQLCDGPI AFTVLHNVNC+HG IYVTSQG LKI
Sbjct: 961  GGYEGLFLSGTRPAWVMVCRQRLRVHPQLCDGPIEAFTVLHNVNCSHGFIYVTSQGFLKI 1020

Query: 1021 CQLPSTSSYDNYWPVQKVPLKGTPHQVTYFHEKNLYPVIISAPVHKPLNQVLSSMVDQD- 1080
            CQLPS  +YD+YWPVQKVPL GTPHQVTY+ E++LYP+I+S PV +PLNQVLSSM DQ+ 
Sbjct: 1021 CQLPSAYNYDSYWPVQKVPLHGTPHQVTYYAEQSLYPLIVSVPVVRPLNQVLSSMADQES 1080

Query: 1081 VGQVENHNLSADELQQTYSVEEFEIRILEPEKSGGPWQTRATIAMHSSENALTIRVVTLL 1140
            V  ++N   S D L +TY+V+EFE+RILE EK GG W+T++TI M   ENALT+R+VTL 
Sbjct: 1081 VHHMDNDVTSTDALHKTYTVDEFEVRILELEKPGGHWETKSTIPMQLFENALTVRIVTLH 1140

Query: 1141 NTTTKENETLLAVGTAYVQGEDVAARGRVLLFSVGKDVDNSQTLVSEVYSKELKGAISAL 1200
            NTTTKENETLLA+GTAYV GEDVAARGRVLLFS  K  +NSQ LV+EVYSKE KGA+SA+
Sbjct: 1141 NTTTKENETLLAIGTAYVLGEDVAARGRVLLFSFTKS-ENSQNLVTEVYSKESKGAVSAV 1200

Query: 1201 ASLQGHLLIASGPKIILHKWTGAELNGIAFYDVPPLYVVSLNIVKNFILLGDIHKSIYFL 1260
            ASLQGHLLIASGPKI L+KWTGAEL  +AFYD  PL+VVSLNIVKNF+L GDIHKSIYFL
Sbjct: 1201 ASLQGHLLIASGPKITLNKWTGAELTAVAFYDA-PLHVVSLNIVKNFVLFGDIHKSIYFL 1260

Query: 1261 SWKEQGAQLSLLAKDFGSLDCYATEFLIDGSTLSLTVSDDQKNIQIFYYAPKSTESWKGQ 1320
            SWKEQG+QLSLLAKDFGSLDC+ATEFLIDGSTLSL  SD  KN+QIFYYAPK  ESWKGQ
Sbjct: 1261 SWKEQGSQLSLLAKDFGSLDCFATEFLIDGSTLSLVASDSDKNVQIFYYAPKMVESWKGQ 1320

Query: 1321 KLLSRAEFHVGAHVTKFLRLQMLSTTSDRASTTVSDKTNRFALLFGTLDGSIGCIAPLDE 1380
            KLLSRAEFHVGAH+TKFLRLQML T         S+KTNRFALLFG LDG IGCIAP+DE
Sbjct: 1321 KLLSRAEFHVGAHITKFLRLQMLPTQG-----LSSEKTNRFALLFGNLDGGIGCIAPIDE 1380

Query: 1381 LTFRRLQSLQKKLVDAVPHVGGLNPRSFRQFLSNGKVHRRGPDSIVDGELLCHYEMLPLE 1440
            LTFRRLQSLQ+KLVDAVPHV GLNPRSFRQF SNGK HR GPD+I+D ELLC YEML L+
Sbjct: 1381 LTFRRLQSLQRKLVDAVPHVCGLNPRSFRQFHSNGKGHRPGPDNIIDFELLCSYEMLSLD 1440

Query: 1441 EQLDIAHQIGTTRSQILSNLNDLSLGTSFL 1454
            EQLD+A QIGTTRSQILSN +D+SLGTSFL
Sbjct: 1441 EQLDVAQQIGTTRSQILSNFSDISLGTSFL 1441

BLAST of Tan0008131 vs. ExPASy Swiss-Prot
Match: Q10570 (Cleavage and polyadenylation specificity factor subunit 1 OS=Homo sapiens OX=9606 GN=CPSF1 PE=1 SV=2)

HSP 1 Score: 650.6 bits (1677), Expect = 4.2e-185
Identity = 477/1492 (31.97%), Postives = 733/1492 (49.13%), Query Frame = 0

Query: 56   NLVVTAGNVLEVYVVRVQEEGGSESRSSGEVRRGGIMDGVSGASLELVCHYRLHGNVESM 115
            NLVV   + L VY +    E  +++  S E +            LEL   +   GNV SM
Sbjct: 29   NLVVAGTSQLYVYRLNRDAEALTKNDRSTEGK-------AHREKLELAASFSFFGNVMSM 88

Query: 116  AILSSRGGDGSKKRDSIILVFQEAKISVLEFDDSIHSLRTSSMHCFEGPQWLHLKRGRES 175
            A +   G     KRD+++L F++AK+SV+E+D   H L+T S+H FE P+   L+ G   
Sbjct: 89   ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 148

Query: 176  FARGPVVKVDPQGRCGGVLVYGLQMIILKASQAGSGLVVDDEAFGNIGAISARVESSYLI 235
                P V+VDP GRC  +LVYG ++++L   +    L  + E     G  S+ +  SY+I
Sbjct: 149  NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRR--ESLAEEHEGLVGEGQRSSFL-PSYII 208

Query: 236  NLRDLDVK--HVKDFVFVHGYIEPVMVILHEQELTWAGRVSWKHHTCMISALSISTTLKQ 295
            ++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++ T K 
Sbjct: 209  DVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 268

Query: 296  HPLIWSANNLPHDAYKLLAVPSPIGGVLVVSANSIHYHSQSASCM-LALNNYAVSADSSQ 355
            HP+IWS  +LP D  + LAVP PIGGV+V + NS+ Y +QS     +ALN+      +  
Sbjct: 269  HPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 328

Query: 356  DMPRSNFNVELDAANATWLLNDVALLSTKTGELLLLALVYDG-RVVQRLDLSKSKASVLT 415
               +    + LD A AT++  D  ++S K GE+ +L L+ DG R V+     K+ ASVLT
Sbjct: 329  LRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 388

Query: 416  SGIASIGNSLFFLGSRLGDSLLVQFSSGVGSSGLASSLKDEVGDIEVDAPTAKRM----- 475
            + + ++     FLGSRLG+SLL++++  +      +S   E  D E      KR+     
Sbjct: 389  TSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEP--PASAVREAADKEEPPSKKKRVDATAG 448

Query: 476  -RRSSSDALQDMVGGDELSLYGSAPNNAESAQKNFSFAVRDSLINIGPLKDFSYGLRINA 535
               +     QD V  DE+ +YGS   +  +    +SF V DS++NIGP  + + G     
Sbjct: 449  WSAAGKSVPQDEV--DEIEVYGSEAQSG-TQLATYSFEVCDSILNIGPCANAAVGEPAFL 508

Query: 536  DANATGIAKQSNYELVCCSGNGKNGALCILRQSIRPEMITEVELPGCKGIWTVY------ 595
                   + + + E+V CSG+GKNGAL +L++SIRP+++T  ELPGC  +WTV       
Sbjct: 509  SEEFQN-SPEPDLEIVVCSGHGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKE 568

Query: 596  -HKNTRGSSAD----SSRMVPDDDEYHAYLIISLEARTMVLETGDLLTEVTESVDYFVQG 655
               N +G   +    ++    DD   H +LI+S E  TM+L+TG  + E+  S  +  QG
Sbjct: 569  EEDNPKGEGTEQEPSTTPEADDDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQG 628

Query: 656  RTIAAGNLFGRRRVIQVYESGARILDG----SFMTQDLNMVVTCNESGNGSEGCTVLSAS 715
             T+ AGN+   R ++QV   G R+L+G     F+  DL              G  ++  +
Sbjct: 629  PTVFAGNIGDNRYIVQVSPLGIRLLEGVNQLHFIPVDL--------------GAPIVQCA 688

Query: 716  ISDPYVLLTMTDGSIRLLVGDPSSCS-----VSVSTPAAFGSSKKCVSCCTLYHDKGIEP 775
            ++DPYV++   +G + + +    S       +++  P     SK    C  LY D     
Sbjct: 689  VADPYVVIMSAEGHVTMFLLKSDSYGGRHHRLALHKPPLHHQSKVITLC--LYRDLS--- 748

Query: 776  WLRMTSTDAWL--------------STGVGE----TIDGTDGSLQ-DQGDIY-------- 835
               M +T++ L              + G+G     T+D  +  L  D G ++        
Sbjct: 749  --GMFTTESRLGGARDELGGRSGPEAEGLGSETSPTVDDEEEMLYGDSGSLFSPSKEEAR 808

Query: 836  ---------------------CVACYDSGDLEIFDVPNFVSVFYVDKFVSGKSHLVDFQM 895
                                 C+   ++G +EI+ +P++  VF V  F  G+  LVD   
Sbjct: 809  RSSQPPADRDPAPFRAEPTHWCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVD--- 868

Query: 896  SDLQKNSEKLDRNSQELNSHGRNESSQNMKVIEVAMQRWSGKHSRPFLFGILTDGTILCY 955
            S   + + + +   +E    G     +   V EV +     + SRP+L  +  D  +L Y
Sbjct: 869  SSFGQPTTQGEARREEATRQG-----ELPLVKEVLLVALGSRQSRPYLL-VHVDQELLIY 928

Query: 956  HAYLFESTDSASKIDDSVSMENSVSSSNMSSSRLRNLRFLRVPLDIQGRDDMPNGTLSR- 1015
             A+  +S                +   N+       +RF +VP +I  R+  P  +  + 
Sbjct: 929  EAFPHDS---------------QLGQGNL------KVRFKKVPHNINFREKKPKPSKKKA 988

Query: 1016 ----------------RLSIFKNISGYQGLFLCGSRPAWFMVF-RERLRVHPQLCDGPIV 1075
                            R   F++I GY G+F+CG  P W +V  R  LR+HP   DGP+ 
Sbjct: 989  EGGGAEEGAGARGRVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRLHPMAIDGPVD 1048

Query: 1076 AFTVLHNVNCNHGLIYVTSQGVLKICQLPSTSSYDNYWPVQKVPLKGTPHQVTYFHEKNL 1135
            +F   HNVNC  G +Y   QG L+I  LP+  SYD  WPV+K+PL+ T H V Y  E  +
Sbjct: 1049 SFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVESKV 1108

Query: 1136 YPVIISAPVHKPLNQVLSSMVDQDVGQVENHNLSADELQQTYSVEEFEIRILEPEKSGGP 1195
            Y V  S       N   + +      + E   +  DE       E F I+++ P      
Sbjct: 1109 YAVATST------NTPCARIPRMTGEEKEFETIERDERYIHPQQEAFSIQLISPVS---- 1168

Query: 1196 WQT--RATIAMHSSENALTIRVVTLLNTTTKEN-ETLLAVGTAYVQGEDVAARGRVLLFS 1255
            W+    A I +   E+   ++ V+L +  T    +  +A GT  +QGE+V  RGR+L+  
Sbjct: 1169 WEAIPNARIELQEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMD 1228

Query: 1256 VGKDV-DNSQTLVSE----VYSKELKGAISALASLQGHLLIASGPKIILHKWTGAELNGI 1315
            V + V +  Q L       +Y KE KG ++AL    GHL+ A G KI L     +EL G+
Sbjct: 1229 VIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGM 1288

Query: 1316 AFYDVPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLSLLAKDFGSLDCYATEFLI 1375
            AF D   LY+  +  VKNFIL  D+ KSI  L ++E+   LSL+++D   L+ Y+ +F++
Sbjct: 1289 AFIDT-QLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMV 1348

Query: 1376 DGSTLSLTVSDDQKNIQIFYYAPKSTESWKGQKLLSRAEFHVGAHVTKFLRLQMLSTTSD 1435
            D + L   VSD  +N+ ++ Y P++ ES+ G +LL RA+FHVGAHV  F R      T  
Sbjct: 1349 DNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGATEG 1408

Query: 1436 RASTTVSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDAVPHVGGLNPRSF 1444
             +  +V  + N+    F TLDG IG + P+ E T+RRL  LQ  L   +PH  GLNPR+F
Sbjct: 1409 LSKKSVVWE-NKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAF 1433

BLAST of Tan0008131 vs. ExPASy Swiss-Prot
Match: A0A0R4IC37 (Cleavage and polyadenylation specificity factor subunit 1 OS=Danio rerio OX=7955 GN=cpsf1 PE=3 SV=2)

HSP 1 Score: 649.0 bits (1673), Expect = 1.2e-184
Identity = 469/1488 (31.52%), Postives = 753/1488 (50.60%), Query Frame = 0

Query: 56   NLVVTAGNVLEVY--VVRVQEEGGSESRSSGEVRRGGIMDGVSGASLELVCHYRLHGNVE 115
            NLVV   + L VY  +  V+    SE  S G+ R+           LE V  + L GNV 
Sbjct: 29   NLVVAGTSQLYVYRIIYDVESTSKSEKSSDGKSRK---------EKLEQVASFSLFGNVM 88

Query: 116  SMAILSSRGGDGSKKRDSIILVFQEAKISVLEFDDSIHSLRTSSMHCFEGPQWLHLKRGR 175
            SMA +   G      RD+++L F++AK+SV+E+D   H L+T S+H FE P+   L+ G 
Sbjct: 89   SMASVQLVG----TNRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGF 148

Query: 176  ESFARGPVVKVDPQGRCGGVLVYGLQMIILKASQAGSGLVVDDEAFGNIG-AISARVESS 235
                  P+V+VDP+ RC  +LVYG  +++L   +      + DE  G +G    +    S
Sbjct: 149  VQNVHIPMVRVDPENRCAVMLVYGTCLVVLPFRKD----TLADEQEGIVGEGQKSSFLPS 208

Query: 236  YLINLRDLDVK--HVKDFVFVHGYIEPVMVILHEQELTWAGRVSWKHHTCMISALSISTT 295
            Y+I++R+LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++  
Sbjct: 209  YIIDVRELDEKLLNIIDMKFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNIM 268

Query: 296  LKQHPLIWSANNLPHDAYKLLAVPSPIGGVLVVSANSIHYHSQSASCM-LALNNYAVSAD 355
             K HP+IWS +NLP D  +++AVP PIGGV+V + NS+ Y +QS     ++LN+      
Sbjct: 269  QKVHPVIWSLSNLPFDCNQVMAVPKPIGGVVVFAVNSLLYLNQSVPPFGVSLNSLTNGTT 328

Query: 356  SSQDMPRSNFNVELDAANATWLLNDVALLSTKTGELLLLALVYDG-RVVQRLDLSKSKAS 415
            +    P+    + LD + A+++ +D  ++S K GE+ +L L+ DG R V+     K+ AS
Sbjct: 329  AFPLRPQEEVKITLDCSQASFITSDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAAS 388

Query: 416  VLTSGIASIGNSLFFLGSRLGDSLLVQFSSGVGSSGLASSLKDEVGDIEVDAPTAKRMRR 475
            VLT+ + ++     FLGSRLG+SLL++++  +  + +    ++E  + + + P  K+   
Sbjct: 389  VLTTCMMTMEPGYLFLGSRLGNSLLLRYTEKLQETPMEEGKENEEKEKQEEPPNKKKRVD 448

Query: 476  SS------SDALQDMVGGDELSLYGSAPNNAESAQKNFSFAVRDSLINIGPLKDFSYG-- 535
            S+         L D +  DE+ +YGS   +  +    +SF V DS++NIGP    S G  
Sbjct: 449  SNWAGCPGKGNLPDEL--DEIEVYGSEAQSG-TQLATYSFEVCDSILNIGPCASASMGEP 508

Query: 536  --LRINADANATGIAKQSNYELVCCSGNGKNGALCILRQSIRPEMITEVELPGCKGIWTV 595
              L      N      + + E+V CSG GKNGAL +L++SIRP+++T  ELPGC  +WTV
Sbjct: 509  AFLSEEFQTN-----PEPDLEVVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCHDMWTV 568

Query: 596  YHKNTR---------GSSADSSRMVP---DDDEYHAYLIISLEARTMVLETGDLLTEVTE 655
             +   +         G S +  +  P   DD + H +LI+S E  TM+L+TG  + E+  
Sbjct: 569  IYCEEKPEKPSAEGDGESPEEEKREPTIEDDKKKHGFLILSREDSTMILQTGQEIMELDT 628

Query: 656  SVDYFVQGRTIAAGNLFGRRRVIQVYESGARILDG----SFMTQDLNMVVTCNESGNGSE 715
            S  +  QG T+ AGN+   + +IQV   G R+L+G     F+  DL              
Sbjct: 629  S-GFATQGPTVYAGNIGDNKYIIQVSPMGIRLLEGVNQLHFIPVDL-------------- 688

Query: 716  GCTVLSASISDPYVLLTMTDGSIRLLV--GDP---SSCSVSVSTPAAFGSSKKCVSCCTL 775
            G  ++  S++DPYV++   +G + + V   D     S  +++  P     S + ++ C  
Sbjct: 689  GSPIVHCSVADPYVVIMTAEGVVTMFVLKNDSYMGKSHRLALQKPQIHTQS-RVITLCAY 748

Query: 776  YHDKGI-------------EPWLRMTSTDAWLSTGVGETIDGTD---------------- 835
                G+             E  +R  S    +   +  T+D  +                
Sbjct: 749  RDVSGMFTTENKVSFLAKEEIAIRTNSETETIIQDISNTVDDEEEMLYGESNPLTSPNKE 808

Query: 836  ------------------GSLQDQGDIYCVACYDSGDLEIFDVPNFVSVFYVDKFVSGKS 895
                              GS + +   +C+   ++G +EI+ +P++  VF V  F  G+ 
Sbjct: 809  ESSRGSAAASSAHTGKESGSGRQEPSHWCLLVRENGVMEIYQLPDWRLVFLVKNFPVGQR 868

Query: 896  HLVDFQMSDLQKNSEKLDRNSQELNSHGRNESSQNMKVIEVAMQRWSGKHSRPFLFGILT 955
             LVD   S   +++ + +   +E+   G         V EVA+      HSRP+L   + 
Sbjct: 869  VLVD---SSASQSATQGELKKEEVTRQG-----DIPLVKEVALVSLGYNHSRPYLLAHV- 928

Query: 956  DGTILCYHAYLFESTDSASKIDDSVSMENSVSSSNMSSSRLRNLRFLRVPLDIQGRDDMP 1015
            +  +L Y A+ ++   + S +   V  +    + N    +++ +R  + P + QG D + 
Sbjct: 929  EQELLIYEAFPYDQQQAQSNL--KVRFKKMPHNINYREKKVK-VRKDKKP-EGQGEDTLG 988

Query: 1016 NGTLSRRLSIFKNISGYQGLFLCGSRPAWFMV-FRERLRVHPQLCDGPIVAFTVLHNVNC 1075
                  R   F++ISGY G+F+CG  P W +V  R  +R+HP   DG I +F+  HN+NC
Sbjct: 989  VKGRVARFRYFQDISGYSGVFICGPSPHWMLVTSRGAMRLHPMTIDGAIESFSPFHNINC 1048

Query: 1076 NHGLIYVTSQGVLKICQLPSTSSYDNYWPVQKVPLKGTPHQVTYFHEKNLYPVIISAPVH 1135
              G +Y   QG L+I  LP+  SYD  WPV+K+PL+ T H V+Y  E  +Y V  S  V 
Sbjct: 1049 PKGFLYFNKQGELRISVLPTYLSYDAPWPVRKIPLRCTVHYVSYHVESKVYAVCTS--VK 1108

Query: 1136 KPLNQVLSSMVDQDVGQVENHNLSADELQQTYSVEEFEIRILEPEKSGGPWQTRATIAMH 1195
            +P  ++     ++     E   +  DE       ++F I+++ P        TR  + + 
Sbjct: 1109 EPCTRIPRMTGEEK----EFETIERDERYIHPQQDKFSIQLISPVSWEAIPNTR--VDLE 1168

Query: 1196 SSENALTIRVVTLLNTTTKEN-ETLLAVGTAYVQGEDVAARGRVLLFSVGKDV-DNSQTL 1255
              E+   ++ V L +  T    +  +A+GT  +QGE+V  RGR+L+  V + V +  Q L
Sbjct: 1169 EWEHVTCMKTVALKSQETVSGLKGYVALGTCLMQGEEVTCRGRILILDVIEVVPEPGQPL 1228

Query: 1256 VSE----VYSKELKGAISALASLQGHLLIASGPKIILHKWTGAELNGIAFYDVPPLYVVS 1315
                   +Y KE KG ++AL    G L+ A G KI L      +L G+AF D   LY+  
Sbjct: 1229 TKNKFKVLYEKEQKGPVTALCHCSGFLVSAIGQKIFLWSLKDNDLTGMAFIDT-QLYIHQ 1288

Query: 1316 LNIVKNFILLGDIHKSIYFLSWKEQGAQLSLLAKDFGSLDCYATEFLIDGSTLSLTVSDD 1375
            +  +KNFIL  D+ KSI  L ++ +   LSL+++D   L+ Y+ EF++D + L   VSD 
Sbjct: 1289 MYSIKNFILAADVMKSISLLRYQPESKTLSLVSRDAKPLEVYSIEFMVDNNQLGFLVSDR 1348

Query: 1376 QKNIQIFYYAPKSTESWKGQKLLSRAEFHVGAHVTKFLRLQMLSTTSDRASTTVSDKTNR 1435
             KN+ ++ Y P++ ES+ G +LL RA+F+VG+HV  F R+    T  D A+       N+
Sbjct: 1349 DKNLMVYMYLPEAKESFGGMRLLRRADFNVGSHVNAFWRMPCRGTL-DTANKKALTWDNK 1408

Query: 1436 FALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDAVPHVGGLNPRSFRQFLSNGKVHRR 1452
                F TLDG +G + P+ E T+RRL  LQ  L   +PH  GLNP++FR    + +  + 
Sbjct: 1409 HITWFATLDGGVGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPKAFRMLHCDRRTLQN 1449

BLAST of Tan0008131 vs. ExPASy Swiss-Prot
Match: Q10569 (Cleavage and polyadenylation specificity factor subunit 1 OS=Bos taurus OX=9913 GN=CPSF1 PE=1 SV=1)

HSP 1 Score: 646.7 bits (1667), Expect = 6.1e-184
Identity = 484/1545 (31.33%), Postives = 748/1545 (48.41%), Query Frame = 0

Query: 3    FAAYRMMHWPTGIE-NCDSGFITHSRADFVPGVTSHTDELESDWPARREMGPVPNLVVTA 62
            +A Y+  H PTG+E +    F  +S  + V   TS                         
Sbjct: 2    YAVYKQAHPPTGLEFSMYCNFFNNSERNLVVAGTS------------------------- 61

Query: 63   GNVLEVYVVRVQEEGGSESRSSGEVRRGGIMDGVSGASLELVCHYRLHGNVESMAILSSR 122
                ++YV R+  +  SE+ +  +    G         LELV  +   GNV SMA +   
Sbjct: 62   ----QLYVYRLNRD--SEAPTKNDRSTDGKAHREHREKLELVASFSFFGNVMSMASVQLA 121

Query: 123  GGDGSKKRDSIILVFQEAKISVLEFDDSIHSLRTSSMHCFEGPQWLHLKRGRESFARGPV 182
            G     KRD+++L F++AK+SV+E+D   H L+T S+H FE P+   L+ G       P 
Sbjct: 122  GA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQNVHTPR 181

Query: 183  VKVDPQGRCGGVLVYGLQMIILKASQAGSGLVVDDEAFGNIGAISARVESSYLINLRDLD 242
            V+VDP GRC  +L+YG ++++L   +    L  + E     G  S+ +  SY+I++R LD
Sbjct: 182  VRVDPDGRCAAMLIYGTRLVVLPFRR--ESLAEEHEGLVGEGQRSSFL-PSYIIDVRALD 241

Query: 243  VK--HVKDFVFVHGYIEPVMVILHEQELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 302
             K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++ T K HP+IWS
Sbjct: 242  EKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVHPVIWS 301

Query: 303  ANNLPHDAYKLLAVPSPIGGVLVVSANSIHYHSQSASCM-LALNNYAVSADSSQDMPRSN 362
              +LP D  + LAVP PIGGV++ + NS+ Y +QS     +ALN+      +     +  
Sbjct: 302  LTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPLRTQEG 361

Query: 363  FNVELDAANATWLLNDVALLSTKTGELLLLALVYDG-RVVQRLDLSKSKASVLTSGIASI 422
              + LD A A ++  D  ++S K GE+ +L L+ DG R V+     K+ ASVLT+ + ++
Sbjct: 362  VRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLTTSMVTM 421

Query: 423  GNSLFFLGSRLGDSLLVQFSSGVGSSGLASSLKDEVGDIEVDAPTAKRMRRS-----SSD 482
                 FLGSRLG+SLL++++  +      +S   E  D E      KR+  +     S  
Sbjct: 422  EPGYLFLGSRLGNSLLLKYTEKLQEP--PASTAREAADKEEPPSKKKRVDATTGWSGSKS 481

Query: 483  ALQDMVGGDELSLYGSAPNNAESAQKNFSFAVRDSLINIGPLKDFSYGLRINADANATGI 542
              QD V  DE+ +YGS   +  +    +SF V DS++NIGP  + + G            
Sbjct: 482  VPQDEV--DEIEVYGSEAQSG-TQLATYSFEVCDSILNIGPCANAAMGEPAFLSEEFQN- 541

Query: 543  AKQSNYELVCCSGNGKNGALCILRQSIRPEMITEVELPGCKGIWTV-------YHKNTRG 602
            + + + E+V CSG GKNGAL +L++SIRP+++T  ELPGC  +WTV         +  +G
Sbjct: 542  SPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEQEETLKG 601

Query: 603  SSADSSRMVP---DDDEYHAYLIISLEARTMVLETGDLLTEVTESVDYFVQGRTIAAGNL 662
               +     P   DD   H +LI+S E  TM+L+TG  + E+  S  +  QG T+ AGN+
Sbjct: 602  EGTEPEPGAPEAEDDGRRHGFLILSREDSTMILQTGQEIMELDAS-GFATQGPTVFAGNI 661

Query: 663  FGRRRVIQVYESGARILDG----SFMTQDLNMVVTCNESGNGSEGCTVLSASISDPYVLL 722
               R ++QV   G R+L+G     F+  DL              G  ++  +++DPYV++
Sbjct: 662  GDNRYIVQVSPLGIRLLEGVNQLHFIPVDL--------------GSPIVQCAVADPYVVI 721

Query: 723  TMTDGSIRLLVGDPSSCS-----VSVSTPAAFGSSKKCVSCCTLYHDKGIEPWLRMTSTD 782
               +G + + +    S       +++  P     SK    C  +Y D        M +T+
Sbjct: 722  MSAEGHVTMFLLKNDSYGGRHHRLALHKPPLHHQSKVITLC--VYRDVS-----GMFTTE 781

Query: 783  AWLSTGVGETIDGTDG------------SLQDQGDI------------------------ 842
            + L  GV + + G  G            ++ D+ ++                        
Sbjct: 782  SRLG-GVRDELGGRGGPEAEGQGAETSPTVDDEEEMLYGDSGSLFSPSKEEARRSSQPPA 841

Query: 843  -------------YCVACYDSGDLEIFDVPNFVSVFYVDKFVSGKSHLVDFQMSDLQKNS 902
                         +C+   ++G +EI+ +P++  VF V  F  G+  LVD   S   + +
Sbjct: 842  DRDPAPFRAEPTHWCLLVRENGAMEIYQLPDWRLVFLVKNFPVGQRVLVD---SSFGQPT 901

Query: 903  EKLDRNSQELNSHGRNESSQNMKVIEVAMQRWSGKHSRPFLFGILTDGTILCYHAYLFES 962
             + +   +E    G     +   V EV +     +  RP+L  +  D  +L Y A+  +S
Sbjct: 902  TQGEARKEEATRQG-----ELPLVKEVLLVALGSRQRRPYLL-VHVDQELLIYEAFPHDS 961

Query: 963  TDSASKIDDSVSMENSVSSSNMSSSRLRNLRFLRVPLDIQGRDDMP-------------N 1022
                            +   N+       +RF +VP +I  R+  P              
Sbjct: 962  ---------------QLGQGNL------KVRFKKVPHNINFREKKPKPSKKKAEGGSTEE 1021

Query: 1023 GTLSR----RLSIFKNISGYQGLFLCGSRPAWFMVF-RERLRVHPQLCDGPIVAFTVLHN 1082
            GT  R    R   F++I GY G+F+CG  P W +V  R  LR+HP   DGPI +F   HN
Sbjct: 1022 GTGPRGRVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRLHPMGIDGPIDSFAPFHN 1081

Query: 1083 VNCNHGLIYVTSQGVLKICQLPSTSSYDNYWPVQKVPLKGTPHQVTYFHEKNLYPVIISA 1142
            +NC  G +Y   QG L+I  LP+  SYD  WPV+K+PL+ T H V Y  E  +Y V  S 
Sbjct: 1082 INCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVESKVYAVATST 1141

Query: 1143 PVHKPLNQVLSSMVDQDVGQVENHNLSADELQQTYSVEEFEIRILEPEKSGGPWQT--RA 1202
                P  +V     ++     E   +  DE       E F I+++ P      W+    A
Sbjct: 1142 ST--PCTRVPRMTGEEK----EFETIERDERYVHPQQEAFCIQLISPVS----WEAIPNA 1201

Query: 1203 TIAMHSSENALTIRVVTLLNTTTKEN-ETLLAVGTAYVQGEDVAARGRVLLFSVGKDV-D 1262
             I +   E+   ++ V+L +  T    +  +A GT  +QGE+V  RGR+L+  V + V +
Sbjct: 1202 RIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPE 1261

Query: 1263 NSQTLVSE----VYSKELKGAISALASLQGHLLIASGPKIILHKWTGAELNGIAFYDVPP 1322
              Q L       +Y KE KG ++AL    GHL+ A G KI L     +EL G+AF D   
Sbjct: 1262 PGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMAFIDT-Q 1321

Query: 1323 LYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLSLLAKDFGSLDCYATEFLIDGSTLSL 1382
            LY+  +  VKNFIL  D+ KSI  L ++E+   LSL+++D   L+ Y+ +F++D + L  
Sbjct: 1322 LYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGF 1381

Query: 1383 TVSDDQKNIQIFYYAPKSTESWKGQKLLSRAEFHVGAHVTKFLRLQMLSTTSDRASTTVS 1442
             VSD  +N+ ++ Y P++ ES+ G +LL RA+FHVGAHV  F R          +  +V 
Sbjct: 1382 LVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGAAEGPSKKSVV 1434

Query: 1443 DKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDAVPHVGGLNPRSFRQFLSNG 1444
             + N+    F TLDG IG + P+ E T+RRL  LQ  L   +PH  GLNPR+FR    + 
Sbjct: 1442 WE-NKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDR 1434

BLAST of Tan0008131 vs. NCBI nr
Match: XP_038887722.1 (cleavage and polyadenylation specificity factor subunit 1 isoform X1 [Benincasa hispida])

HSP 1 Score: 2792.3 bits (7237), Expect = 0.0e+00
Identity = 1406/1454 (96.70%), Postives = 1429/1454 (98.28%), Query Frame = 0

Query: 1    MSFAAYRMMHWPTGIENCDSGFITHSRADFVPGVTSHTDELESDWPARREMGPVPNLVVT 60
            MSFAAYRMMHWPTGIENCDSGFITHS ADFVPGV SH D+L+SDW  RRE+GPVPNLVVT
Sbjct: 1    MSFAAYRMMHWPTGIENCDSGFITHSPADFVPGVASHADDLDSDWQPRREIGPVPNLVVT 60

Query: 61   AGNVLEVYVVRVQEEGGSESRSSGEVRRGGIMDGVSGASLELVCHYRLHGNVESMAILSS 120
            AGNVLEVYVVRVQEEGG ES+SSGEVRRGGIMDGVSGASLELVCHYRLHGNVESMAILSS
Sbjct: 61   AGNVLEVYVVRVQEEGGRESKSSGEVRRGGIMDGVSGASLELVCHYRLHGNVESMAILSS 120

Query: 121  RGGDGSKKRDSIILVFQEAKISVLEFDDSIHSLRTSSMHCFEGPQWLHLKRGRESFARGP 180
            RGGDGSKKRDSIILVFQEAKISVLEFDDS HSLRTSSMHCFEGPQWLHLKRGRESFARGP
Sbjct: 121  RGGDGSKKRDSIILVFQEAKISVLEFDDSTHSLRTSSMHCFEGPQWLHLKRGRESFARGP 180

Query: 181  VVKVDPQGRCGGVLVYGLQMIILKASQAGSGLVVDDEAFGNIGAISARVESSYLINLRDL 240
            VVKVDPQGRCGGVLVYGLQMIILKASQAGSGLVVDDEAFGN GAISARVESSYLINLRDL
Sbjct: 181  VVKVDPQGRCGGVLVYGLQMIILKASQAGSGLVVDDEAFGNTGAISARVESSYLINLRDL 240

Query: 241  DVKHVKDFVFVHGYIEPVMVILHEQELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSA 300
            DVKHVKDFVFVHGYIEPVMVILHEQELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSA
Sbjct: 241  DVKHVKDFVFVHGYIEPVMVILHEQELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSA 300

Query: 301  NNLPHDAYKLLAVPSPIGGVLVVSANSIHYHSQSASCMLALNNYAVSADSSQDMPRSNFN 360
            NNLPHDAYKLLAVPSPIGGVLVVSANSIHYHSQSASCMLALNNYAVSADSSQDMPRSNFN
Sbjct: 301  NNLPHDAYKLLAVPSPIGGVLVVSANSIHYHSQSASCMLALNNYAVSADSSQDMPRSNFN 360

Query: 361  VELDAANATWLLNDVALLSTKTGELLLLALVYDGRVVQRLDLSKSKASVLTSGIASIGNS 420
            VELDAANATWL+NDVALLSTKTGELLLLALVYDGRVVQRLDLSKSKASVLTSGIASIGNS
Sbjct: 361  VELDAANATWLVNDVALLSTKTGELLLLALVYDGRVVQRLDLSKSKASVLTSGIASIGNS 420

Query: 421  LFFLGSRLGDSLLVQFSSGVGSSGLASSLKDEVGDIEVDAPTAKRMRRSSSDALQDMVGG 480
            LFFLGSRLGDSLLVQFS GVGSSGLASSLKDEVGDIEVDA  AKRMRRSSSDALQDMVGG
Sbjct: 421  LFFLGSRLGDSLLVQFSCGVGSSGLASSLKDEVGDIEVDAHIAKRMRRSSSDALQDMVGG 480

Query: 481  DELSLYGSAPNNAESAQKNFSFAVRDSLINIGPLKDFSYGLRINADANATGIAKQSNYEL 540
            DELSLYGSAPNN ESAQK+FSFAVRDSLINIGPLKDFSYGLRINAD NATGIAKQSNYEL
Sbjct: 481  DELSLYGSAPNNTESAQKSFSFAVRDSLINIGPLKDFSYGLRINADPNATGIAKQSNYEL 540

Query: 541  VCCSGNGKNGALCILRQSIRPEMITEVELPGCKGIWTVYHKNTRGSSADSSRMVPDDDEY 600
            VCCSG+GKNGALCILRQSIRPEMITEVELPGCKGIWTVYHKNTRGS ADSSRMVPDDDEY
Sbjct: 541  VCCSGHGKNGALCILRQSIRPEMITEVELPGCKGIWTVYHKNTRGSGADSSRMVPDDDEY 600

Query: 601  HAYLIISLEARTMVLETGDLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVYESGARILD 660
            HAYLIISLEARTMVLETGDLLTEVTESVDYFVQGRT+AAGNLFGRRRVIQVYESGARILD
Sbjct: 601  HAYLIISLEARTMVLETGDLLTEVTESVDYFVQGRTVAAGNLFGRRRVIQVYESGARILD 660

Query: 661  GSFMTQDLNMVVTCNESGNGSEG-CTVLSASISDPYVLLTMTDGSIRLLVGDPSSCSVSV 720
            GSFMTQDLN+VVT NESGNGSEG CTVLSASISDPYVLLTMTDGSIRLLVGDPSSCSVSV
Sbjct: 661  GSFMTQDLNLVVTSNESGNGSEGCCTVLSASISDPYVLLTMTDGSIRLLVGDPSSCSVSV 720

Query: 721  STPAAFGSSKKCVSCCTLYHDKGIEPWLRMTSTDAWLSTGVGETIDGTDGSLQDQGDIYC 780
            S PAAFGSS+KCVSCCTLYHDKGIEPWLR TSTDAWLSTG+GETIDGTDGS+QDQGDIYC
Sbjct: 721  SAPAAFGSSRKCVSCCTLYHDKGIEPWLRKTSTDAWLSTGIGETIDGTDGSVQDQGDIYC 780

Query: 781  VACYDSGDLEIFDVPNFVSVFYVDKFVSGKSHLVDFQMSDLQKNSEKLDRNSQELNSHGR 840
            VACYDSGDLEIFDVPNF+SVFYVDKFVSGKSHLVDFQ+SD+QKNSEK+D+NSQEL SHGR
Sbjct: 781  VACYDSGDLEIFDVPNFISVFYVDKFVSGKSHLVDFQVSDMQKNSEKVDQNSQELISHGR 840

Query: 841  NESSQNMKVIEVAMQRWSGKHSRPFLFGILTDGTILCYHAYLFESTDSASKIDDSVSMEN 900
            NESSQNMKV+EVAMQRWSG+HSRPFLFGILTDGTILCYHAYLFESTDSASKIDDSVSMEN
Sbjct: 841  NESSQNMKVVEVAMQRWSGQHSRPFLFGILTDGTILCYHAYLFESTDSASKIDDSVSMEN 900

Query: 901  SVSSSNMSSSRLRNLRFLRVPLDIQGRDDMPNGTLSRRLSIFKNISGYQGLFLCGSRPAW 960
            SVSSSNMSSSRLRNLRF+RVPLDIQGRDDMPNGTLSRRLSIFKNISGYQGLFLCGSRPAW
Sbjct: 901  SVSSSNMSSSRLRNLRFVRVPLDIQGRDDMPNGTLSRRLSIFKNISGYQGLFLCGSRPAW 960

Query: 961  FMVFRERLRVHPQLCDGPIVAFTVLHNVNCNHGLIYVTSQGVLKICQLPSTSSYDNYWPV 1020
            FMVFRERLRVHPQLCDGPIVAFTVLHNVNCNHGLIYVTSQGVLKICQLPSTS+YDNYWPV
Sbjct: 961  FMVFRERLRVHPQLCDGPIVAFTVLHNVNCNHGLIYVTSQGVLKICQLPSTSNYDNYWPV 1020

Query: 1021 QKVPLKGTPHQVTYFHEKNLYPVIISAPVHKPLNQVLSSMVDQDVGQVENHNLSADELQQ 1080
            QKVPLKGTPHQVTYFHEKNLYPVIISAPV KPLNQVLSSMVDQDVG VENHNLSADELQQ
Sbjct: 1021 QKVPLKGTPHQVTYFHEKNLYPVIISAPVQKPLNQVLSSMVDQDVGHVENHNLSADELQQ 1080

Query: 1081 TYSVEEFEIRILEPEKSGGPWQTRATIAMHSSENALTIRVVTLLNTTTKENETLLAVGTA 1140
            TYSVEEFEIRILEPEKSGGPWQTRATIAMHSSENALTIRVVTLLNTTTKENETLLAVGTA
Sbjct: 1081 TYSVEEFEIRILEPEKSGGPWQTRATIAMHSSENALTIRVVTLLNTTTKENETLLAVGTA 1140

Query: 1141 YVQGEDVAARGRVLLFSVGKDVDNSQTLVSEVYSKELKGAISALASLQGHLLIASGPKII 1200
            YVQGEDVAARGRVLLFSVGKD DNSQTLVSEVYSKELKGAISALASLQGHLLIASGPKII
Sbjct: 1141 YVQGEDVAARGRVLLFSVGKDADNSQTLVSEVYSKELKGAISALASLQGHLLIASGPKII 1200

Query: 1201 LHKWTGAELNGIAFYDVPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLSLLAKDF 1260
            LHKWTGAELNGIAFYDVPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLSLLAKDF
Sbjct: 1201 LHKWTGAELNGIAFYDVPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLSLLAKDF 1260

Query: 1261 GSLDCYATEFLIDGSTLSLTVSDDQKNIQIFYYAPKSTESWKGQKLLSRAEFHVGAHVTK 1320
            GSLDCYATEFLIDGSTLSLTVSDDQKNIQIFYYAPKSTESWKGQKLLSRAEFHVGAHVTK
Sbjct: 1261 GSLDCYATEFLIDGSTLSLTVSDDQKNIQIFYYAPKSTESWKGQKLLSRAEFHVGAHVTK 1320

Query: 1321 FLRLQMLSTTSDRASTTVSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDA 1380
            FLRLQMLSTTSD+AS+TVSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKL DA
Sbjct: 1321 FLRLQMLSTTSDKASSTVSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLGDA 1380

Query: 1381 VPHVGGLNPRSFRQFLSNGKVHRRGPDSIVDGELLCHYEMLPLEEQLDIAHQIGTTRSQI 1440
            VPHVGGLNPRSFRQF SNGKVHRRGPDSIVD ELLCHYEMLPLEEQLDIAHQIGTTRSQI
Sbjct: 1381 VPHVGGLNPRSFRQFQSNGKVHRRGPDSIVDCELLCHYEMLPLEEQLDIAHQIGTTRSQI 1440

Query: 1441 LSNLNDLSLGTSFL 1454
            LSNLNDLSLGTSFL
Sbjct: 1441 LSNLNDLSLGTSFL 1454

BLAST of Tan0008131 vs. NCBI nr
Match: XP_022973592.1 (cleavage and polyadenylation specificity factor subunit 1 isoform X1 [Cucurbita maxima] >XP_022973593.1 cleavage and polyadenylation specificity factor subunit 1 isoform X1 [Cucurbita maxima])

HSP 1 Score: 2767.3 bits (7172), Expect = 0.0e+00
Identity = 1396/1453 (96.08%), Postives = 1421/1453 (97.80%), Query Frame = 0

Query: 1    MSFAAYRMMHWPTGIENCDSGFITHSRADFVPGVTSHTDELESDWPARREMGPVPNLVVT 60
            MSFAAYRMMH PTGIENCDSGFITHSRADFVP VTSH D+LESDWP RRE+GPVPNLVVT
Sbjct: 1    MSFAAYRMMHCPTGIENCDSGFITHSRADFVPRVTSHADDLESDWPPRREIGPVPNLVVT 60

Query: 61   AGNVLEVYVVRVQEEGGSESRSSGEVRRGGIMDGVSGASLELVCHYRLHGNVESMAILSS 120
            AGNVLEVYVVRVQEEGG ESRSSGEVRRGGIMDG+SGASLELVCHYRLHGNVESM ILSS
Sbjct: 61   AGNVLEVYVVRVQEEGGKESRSSGEVRRGGIMDGLSGASLELVCHYRLHGNVESMVILSS 120

Query: 121  RGGDGSKKRDSIILVFQEAKISVLEFDDSIHSLRTSSMHCFEGPQWLHLKRGRESFARGP 180
            RGGDGSKKRDSIILVF+EAKISVLEFDDSIHSLRTSSMHCFEGPQWLHLKRGRESFARGP
Sbjct: 121  RGGDGSKKRDSIILVFKEAKISVLEFDDSIHSLRTSSMHCFEGPQWLHLKRGRESFARGP 180

Query: 181  VVKVDPQGRCGGVLVYGLQMIILKASQAGSGLVVDDEAFGNIGAISARVESSYLINLRDL 240
            VVKVDPQGRCGGVLVYGLQMIILKASQAGSGLVVDDEA G IGA SARVESSYLINLRDL
Sbjct: 181  VVKVDPQGRCGGVLVYGLQMIILKASQAGSGLVVDDEACGKIGASSARVESSYLINLRDL 240

Query: 241  DVKHVKDFVFVHGYIEPVMVILHEQELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSA 300
            DVKHVKDFVFVHGYIEPVMVILHEQELTWAGRV+WKHHTCMISALSISTTLKQHPLIWSA
Sbjct: 241  DVKHVKDFVFVHGYIEPVMVILHEQELTWAGRVAWKHHTCMISALSISTTLKQHPLIWSA 300

Query: 301  NNLPHDAYKLLAVPSPIGGVLVVSANSIHYHSQSASCMLALNNYAVSADSSQDMPRSNFN 360
            NNLPHDAYKLLAVPSPIGGVLVVSANSIHYHSQSASCMLALNNYAVS DSSQDMPRSNFN
Sbjct: 301  NNLPHDAYKLLAVPSPIGGVLVVSANSIHYHSQSASCMLALNNYAVSPDSSQDMPRSNFN 360

Query: 361  VELDAANATWLLNDVALLSTKTGELLLLALVYDGRVVQRLDLSKSKASVLTSGIASIGNS 420
            VELDAA+ATWLLNDVALLSTKTGELLLLALVYDGRVVQRLDLSKSKASVLTSGIASIG+S
Sbjct: 361  VELDAAHATWLLNDVALLSTKTGELLLLALVYDGRVVQRLDLSKSKASVLTSGIASIGSS 420

Query: 421  LFFLGSRLGDSLLVQFSSGVGSSGLASSLKDEVGDIEVDAPTAKRMRRSSSDALQDMVGG 480
            LFFLGSRLGDSLLVQFS GVGSSGLASSLKDEVGDIEVDAPTAKRMRRSSSDALQDMVGG
Sbjct: 421  LFFLGSRLGDSLLVQFSCGVGSSGLASSLKDEVGDIEVDAPTAKRMRRSSSDALQDMVGG 480

Query: 481  DELSLYGSAPNNAESAQKNFSFAVRDSLINIGPLKDFSYGLRINADANATGIAKQSNYEL 540
            DELSLYG   NN ESAQKNFSFAVRDSLINIGPLKDFSYGLRINADANATGIAKQSNYEL
Sbjct: 481  DELSLYG-VSNNTESAQKNFSFAVRDSLINIGPLKDFSYGLRINADANATGIAKQSNYEL 540

Query: 541  VCCSGNGKNGALCILRQSIRPEMITEVELPGCKGIWTVYHKNTRGSSADSSRMVPDDDEY 600
            VCCSGNGKNGALCILRQSIRPEMITEVELPGCKGIWTVYHKNTRGSSADSSRM+ DDDEY
Sbjct: 541  VCCSGNGKNGALCILRQSIRPEMITEVELPGCKGIWTVYHKNTRGSSADSSRMLSDDDEY 600

Query: 601  HAYLIISLEARTMVLETGDLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVYESGARILD 660
            HAYLIISLE+RTMVLETGDLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVYE+GARILD
Sbjct: 601  HAYLIISLESRTMVLETGDLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVYETGARILD 660

Query: 661  GSFMTQDLNMVVTCNESGNGSEGCTVLSASISDPYVLLTMTDGSIRLLVGDPSSCSVSVS 720
            GSFMTQDLN+VV  NESGNGSEGCTVLSASISDPYVLLTMTDGSIRLLVGDPSSCSVSVS
Sbjct: 661  GSFMTQDLNLVVPGNESGNGSEGCTVLSASISDPYVLLTMTDGSIRLLVGDPSSCSVSVS 720

Query: 721  TPAAFGSSKKCVSCCTLYHDKGIEPWLRMTSTDAWLSTGVGETIDGTDGSLQDQGDIYCV 780
             PAAFG SKKCVSCCTLYHDKGIEPWLRMTSTDAWLSTGVGETIDGTDGSLQDQGDIYCV
Sbjct: 721  APAAFGGSKKCVSCCTLYHDKGIEPWLRMTSTDAWLSTGVGETIDGTDGSLQDQGDIYCV 780

Query: 781  ACYDSGDLEIFDVPNFVSVFYVDKFVSGKSHLVDFQMSDLQKNSEKLDRNSQELNSHGRN 840
            ACYDSGDLEIFDVPNF SVFYVDKFVSGKSHLVDFQ+SDLQK+SE+LD NSQELN++GRN
Sbjct: 781  ACYDSGDLEIFDVPNFTSVFYVDKFVSGKSHLVDFQISDLQKSSERLDGNSQELNNNGRN 840

Query: 841  ESSQNMKVIEVAMQRWSGKHSRPFLFGILTDGTILCYHAYLFESTDSASKIDDSVSMENS 900
            ESSQNMKV EVAMQRWSG+HSRPFLFGILTDGTILCYHAYLFES+D+ASKIDDSVSMENS
Sbjct: 841  ESSQNMKVTEVAMQRWSGQHSRPFLFGILTDGTILCYHAYLFESSDTASKIDDSVSMENS 900

Query: 901  VSSSNMSSSRLRNLRFLRVPLDIQGRDDMPNGTLSRRLSIFKNISGYQGLFLCGSRPAWF 960
            VSSSNMSSSRLRNLRFLRVPLDIQGRDDMPNG LSRRLSIFKNISGYQGLFLCGSRPAWF
Sbjct: 901  VSSSNMSSSRLRNLRFLRVPLDIQGRDDMPNGALSRRLSIFKNISGYQGLFLCGSRPAWF 960

Query: 961  MVFRERLRVHPQLCDGPIVAFTVLHNVNCNHGLIYVTSQGVLKICQLPSTSSYDNYWPVQ 1020
            MVFRERLR+HPQLCDGPIVAFTVLHNVNCNHGLIYVTSQGVLKICQLPSTS+YDNYWPVQ
Sbjct: 961  MVFRERLRIHPQLCDGPIVAFTVLHNVNCNHGLIYVTSQGVLKICQLPSTSNYDNYWPVQ 1020

Query: 1021 KVPLKGTPHQVTYFHEKNLYPVIISAPVHKPLNQVLSSMVDQDVGQVENHNLSADELQQT 1080
            KVPLKGTPHQVTYFHEKNLYPVIISAPVHKPLNQVLSSMVDQD G VENHNLSADELQQT
Sbjct: 1021 KVPLKGTPHQVTYFHEKNLYPVIISAPVHKPLNQVLSSMVDQDAGHVENHNLSADELQQT 1080

Query: 1081 YSVEEFEIRILEPEKSGGPWQTRATIAMHSSENALTIRVVTLLNTTTKENETLLAVGTAY 1140
            YSVEEFEIRILEPEKSGGPWQTRATIAMHSSENALTIRVVTLLNTTTKENETLLAVGTAY
Sbjct: 1081 YSVEEFEIRILEPEKSGGPWQTRATIAMHSSENALTIRVVTLLNTTTKENETLLAVGTAY 1140

Query: 1141 VQGEDVAARGRVLLFSVGKDVDNSQTLVSEVYSKELKGAISALASLQGHLLIASGPKIIL 1200
            VQGEDVAARGRVLLFSVGKD DNSQTLVSEVYSKELKGAISALASLQGHLLIASGPKIIL
Sbjct: 1141 VQGEDVAARGRVLLFSVGKDADNSQTLVSEVYSKELKGAISALASLQGHLLIASGPKIIL 1200

Query: 1201 HKWTGAELNGIAFYDVPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLSLLAKDFG 1260
            HKWTGAELNGIAFYDVPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLSLLAKDFG
Sbjct: 1201 HKWTGAELNGIAFYDVPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLSLLAKDFG 1260

Query: 1261 SLDCYATEFLIDGSTLSLTVSDDQKNIQIFYYAPKSTESWKGQKLLSRAEFHVGAHVTKF 1320
            SLDC+ATEFLIDGSTLSLTVSDDQKNIQIFYYAPKSTESWKGQKLLSRAEFHVGAHVTKF
Sbjct: 1261 SLDCFATEFLIDGSTLSLTVSDDQKNIQIFYYAPKSTESWKGQKLLSRAEFHVGAHVTKF 1320

Query: 1321 LRLQMLSTTSDRASTTVSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDAV 1380
            LRLQMLSTTSDR S+TVSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDAV
Sbjct: 1321 LRLQMLSTTSDRGSSTVSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDAV 1380

Query: 1381 PHVGGLNPRSFRQFLSNGKVHRRGPDSIVDGELLCHYEMLPLEEQLDIAHQIGTTRSQIL 1440
            PHVGGLNPRSFRQF SNGKVHR GPDSIVD ELLCHYEM+PLEEQL+IA QIGTTRSQIL
Sbjct: 1381 PHVGGLNPRSFRQFHSNGKVHRSGPDSIVDCELLCHYEMIPLEEQLEIAQQIGTTRSQIL 1440

Query: 1441 SNLNDLSLGTSFL 1454
            SNLNDLSLGTSFL
Sbjct: 1441 SNLNDLSLGTSFL 1452

BLAST of Tan0008131 vs. NCBI nr
Match: XP_023520837.1 (cleavage and polyadenylation specificity factor subunit 1 isoform X1 [Cucurbita pepo subsp. pepo] >XP_023520838.1 cleavage and polyadenylation specificity factor subunit 1 isoform X1 [Cucurbita pepo subsp. pepo] >XP_023520839.1 cleavage and polyadenylation specificity factor subunit 1 isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 2759.9 bits (7153), Expect = 0.0e+00
Identity = 1393/1453 (95.87%), Postives = 1420/1453 (97.73%), Query Frame = 0

Query: 1    MSFAAYRMMHWPTGIENCDSGFITHSRADFVPGVTSHTDELESDWPARREMGPVPNLVVT 60
            MSFAAYRMMH PTGIENCDSGFITHSRADFVP VTSH D+LESDWP RRE+GPVPNLVVT
Sbjct: 1    MSFAAYRMMHSPTGIENCDSGFITHSRADFVPRVTSHADDLESDWPPRREIGPVPNLVVT 60

Query: 61   AGNVLEVYVVRVQEEGGSESRSSGEVRRGGIMDGVSGASLELVCHYRLHGNVESMAILSS 120
            AGNVLEVYVVRVQE+GG ESRSSGEVRRGGIMDGVSGASLELVCHYRLHGNVESM ILSS
Sbjct: 61   AGNVLEVYVVRVQEDGGKESRSSGEVRRGGIMDGVSGASLELVCHYRLHGNVESMVILSS 120

Query: 121  RGGDGSKKRDSIILVFQEAKISVLEFDDSIHSLRTSSMHCFEGPQWLHLKRGRESFARGP 180
            RGGDGSKKRDSIILVF+EAKISVLEFDDSIHSLRTSSMHCFEGPQWLHLKRGRESFARGP
Sbjct: 121  RGGDGSKKRDSIILVFKEAKISVLEFDDSIHSLRTSSMHCFEGPQWLHLKRGRESFARGP 180

Query: 181  VVKVDPQGRCGGVLVYGLQMIILKASQAGSGLVVDDEAFGNIGAISARVESSYLINLRDL 240
            VVKVDPQGRCGGVLVYGLQMIILKASQAGSGLVVDDEA GN+GA SARVESSYLINLRDL
Sbjct: 181  VVKVDPQGRCGGVLVYGLQMIILKASQAGSGLVVDDEACGNVGASSARVESSYLINLRDL 240

Query: 241  DVKHVKDFVFVHGYIEPVMVILHEQELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSA 300
            DVKHVKDFVFVHGYIEPVMVILHEQELTWAGRV+WKHHTCMISALSISTTLKQHPLIWSA
Sbjct: 241  DVKHVKDFVFVHGYIEPVMVILHEQELTWAGRVAWKHHTCMISALSISTTLKQHPLIWSA 300

Query: 301  NNLPHDAYKLLAVPSPIGGVLVVSANSIHYHSQSASCMLALNNYAVSADSSQDMPRSNFN 360
            NNLPHDAYKLLAVPSPIGGVLVVSANSIHYHSQSASCMLALNNYAVS DSSQDMPRSNFN
Sbjct: 301  NNLPHDAYKLLAVPSPIGGVLVVSANSIHYHSQSASCMLALNNYAVSPDSSQDMPRSNFN 360

Query: 361  VELDAANATWLLNDVALLSTKTGELLLLALVYDGRVVQRLDLSKSKASVLTSGIASIGNS 420
            VELDAA+ATWLLNDVALLSTKTGELLLLALVYDGRVVQRLDLSKSKASVLTSGIASIG+S
Sbjct: 361  VELDAAHATWLLNDVALLSTKTGELLLLALVYDGRVVQRLDLSKSKASVLTSGIASIGSS 420

Query: 421  LFFLGSRLGDSLLVQFSSGVGSSGLASSLKDEVGDIEVDAPTAKRMRRSSSDALQDMVGG 480
            LFFLGSRLGDSLLVQFS GVGSSGLASSLKDEVGDIEVDAPT+KRMRRSSSDALQDMVGG
Sbjct: 421  LFFLGSRLGDSLLVQFSCGVGSSGLASSLKDEVGDIEVDAPTSKRMRRSSSDALQDMVGG 480

Query: 481  DELSLYGSAPNNAESAQKNFSFAVRDSLINIGPLKDFSYGLRINADANATGIAKQSNYEL 540
            DELSLYG   NN ESAQKNFSFAVRDSLINIGPLKDFSYGLRINADANATGIAKQSNYEL
Sbjct: 481  DELSLYG-VSNNTESAQKNFSFAVRDSLINIGPLKDFSYGLRINADANATGIAKQSNYEL 540

Query: 541  VCCSGNGKNGALCILRQSIRPEMITEVELPGCKGIWTVYHKNTRGSSADSSRMVPDDDEY 600
            VCCSGNGKNGALCILRQSIRPEMITEVELPGCKGIWTVYHKNTRGSSADSSRM+ DDDEY
Sbjct: 541  VCCSGNGKNGALCILRQSIRPEMITEVELPGCKGIWTVYHKNTRGSSADSSRMLSDDDEY 600

Query: 601  HAYLIISLEARTMVLETGDLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVYESGARILD 660
            HAYLIISLEARTMVLETGDLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVYE+GAR+LD
Sbjct: 601  HAYLIISLEARTMVLETGDLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVYETGARVLD 660

Query: 661  GSFMTQDLNMVVTCNESGNGSEGCTVLSASISDPYVLLTMTDGSIRLLVGDPSSCSVSVS 720
            GSFMTQDLN+VV  NESGNGSEGCTVLSASISDPYVLLTMTDGSIRLLVGDPSSCSVSVS
Sbjct: 661  GSFMTQDLNLVVPGNESGNGSEGCTVLSASISDPYVLLTMTDGSIRLLVGDPSSCSVSVS 720

Query: 721  TPAAFGSSKKCVSCCTLYHDKGIEPWLRMTSTDAWLSTGVGETIDGTDGSLQDQGDIYCV 780
             PAAFGSSKKCVSCCTLYHDKGIEPWLRMTSTDAWLSTGVGETIDGTDGSLQDQGDIYCV
Sbjct: 721  APAAFGSSKKCVSCCTLYHDKGIEPWLRMTSTDAWLSTGVGETIDGTDGSLQDQGDIYCV 780

Query: 781  ACYDSGDLEIFDVPNFVSVFYVDKFVSGKSHLVDFQMSDLQKNSEKLDRNSQELNSHGRN 840
            ACYDSGDLEIFDVPNF SVFYVDKFVSGKSHLVDFQ+SD QK+SE+LD NSQELN++GRN
Sbjct: 781  ACYDSGDLEIFDVPNFTSVFYVDKFVSGKSHLVDFQISDSQKSSERLDGNSQELNNNGRN 840

Query: 841  ESSQNMKVIEVAMQRWSGKHSRPFLFGILTDGTILCYHAYLFESTDSASKIDDSVSMENS 900
            ESSQNMKV EVAMQRWSG+HSRPFLFGILTDGTILCYHAYLFES+D+ASKIDDSVSMEN 
Sbjct: 841  ESSQNMKVTEVAMQRWSGQHSRPFLFGILTDGTILCYHAYLFESSDTASKIDDSVSMEN- 900

Query: 901  VSSSNMSSSRLRNLRFLRVPLDIQGRDDMPNGTLSRRLSIFKNISGYQGLFLCGSRPAWF 960
             SSSNMSSSRLRNLRFLRVPLDIQGRDDMPNG LSRRLSIFKNISGYQGLFLCGSRPAWF
Sbjct: 901  -SSSNMSSSRLRNLRFLRVPLDIQGRDDMPNGALSRRLSIFKNISGYQGLFLCGSRPAWF 960

Query: 961  MVFRERLRVHPQLCDGPIVAFTVLHNVNCNHGLIYVTSQGVLKICQLPSTSSYDNYWPVQ 1020
            MVFRERLR+HPQLCDGPIVAFTVLHNVNCNHGLIYVTSQGVLKICQLPSTS+YDNYWPVQ
Sbjct: 961  MVFRERLRIHPQLCDGPIVAFTVLHNVNCNHGLIYVTSQGVLKICQLPSTSNYDNYWPVQ 1020

Query: 1021 KVPLKGTPHQVTYFHEKNLYPVIISAPVHKPLNQVLSSMVDQDVGQVENHNLSADELQQT 1080
            KVPLKGTPHQVTYFHEKNLYPVIISAPVHKPLNQVLSSMVDQD G VENHNLSADELQQT
Sbjct: 1021 KVPLKGTPHQVTYFHEKNLYPVIISAPVHKPLNQVLSSMVDQDAGHVENHNLSADELQQT 1080

Query: 1081 YSVEEFEIRILEPEKSGGPWQTRATIAMHSSENALTIRVVTLLNTTTKENETLLAVGTAY 1140
            YSVEEFEIRILEPEKSGGPWQTRATIAMHSSENALTIRVVTLLNTTTKENETLLAVGTAY
Sbjct: 1081 YSVEEFEIRILEPEKSGGPWQTRATIAMHSSENALTIRVVTLLNTTTKENETLLAVGTAY 1140

Query: 1141 VQGEDVAARGRVLLFSVGKDVDNSQTLVSEVYSKELKGAISALASLQGHLLIASGPKIIL 1200
            VQGEDVAARGRVLLFSVGKD DNSQTLVSEVYSKELKGAISALASLQGHLLIASGPKIIL
Sbjct: 1141 VQGEDVAARGRVLLFSVGKDADNSQTLVSEVYSKELKGAISALASLQGHLLIASGPKIIL 1200

Query: 1201 HKWTGAELNGIAFYDVPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLSLLAKDFG 1260
            HKWTGAELNGIAFYDVPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLSLLAKDFG
Sbjct: 1201 HKWTGAELNGIAFYDVPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLSLLAKDFG 1260

Query: 1261 SLDCYATEFLIDGSTLSLTVSDDQKNIQIFYYAPKSTESWKGQKLLSRAEFHVGAHVTKF 1320
            SLDC+ATEFLIDGSTLSLTVSDDQKNIQIFYYAPKSTESWKGQKLLSRAEFHVGAHVTKF
Sbjct: 1261 SLDCFATEFLIDGSTLSLTVSDDQKNIQIFYYAPKSTESWKGQKLLSRAEFHVGAHVTKF 1320

Query: 1321 LRLQMLSTTSDRASTTVSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDAV 1380
            LRLQMLSTTSDR S+TVSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDAV
Sbjct: 1321 LRLQMLSTTSDRGSSTVSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDAV 1380

Query: 1381 PHVGGLNPRSFRQFLSNGKVHRRGPDSIVDGELLCHYEMLPLEEQLDIAHQIGTTRSQIL 1440
            PHVGGLNPRSFRQF SNGKVHR GPDSIVD ELLCHYEM+PLEEQL+IA QIGTTRSQIL
Sbjct: 1381 PHVGGLNPRSFRQFHSNGKVHRSGPDSIVDCELLCHYEMIPLEEQLEIAQQIGTTRSQIL 1440

Query: 1441 SNLNDLSLGTSFL 1454
            SNLNDLSLGTSFL
Sbjct: 1441 SNLNDLSLGTSFL 1450

BLAST of Tan0008131 vs. NCBI nr
Match: XP_008441850.1 (PREDICTED: cleavage and polyadenylation specificity factor subunit 1 isoform X1 [Cucumis melo])

HSP 1 Score: 2755.7 bits (7142), Expect = 0.0e+00
Identity = 1392/1453 (95.80%), Postives = 1419/1453 (97.66%), Query Frame = 0

Query: 1    MSFAAYRMMHWPTGIENCDSGFITHSRADFVPGVTSHTDELESDWPARREMGPVPNLVVT 60
            MSFAAYRMMHWPTGIENCDS FITHSRADFVP VTSH+D+L+SDW  RR++GPVPNLVVT
Sbjct: 1    MSFAAYRMMHWPTGIENCDSAFITHSRADFVPAVTSHSDDLDSDWHPRRDIGPVPNLVVT 60

Query: 61   AGNVLEVYVVRVQEEGGSESRSSGEVRRGGIMDGVSGASLELVCHYRLHGNVESMAILSS 120
            AGNVLEVYVVRV EEGG ESRSSGEVRRGGIMDGVSGASLELVCHYRLHGNVESMAILSS
Sbjct: 61   AGNVLEVYVVRVLEEGGRESRSSGEVRRGGIMDGVSGASLELVCHYRLHGNVESMAILSS 120

Query: 121  RGGDGSKKRDSIILVFQEAKISVLEFDDSIHSLRTSSMHCFEGPQWLHLKRGRESFARGP 180
            RGGDGSKKRDSIILVFQEAKISVLEFDDSIHSLRTSSMHCFEGPQWLHLKRGRESFARGP
Sbjct: 121  RGGDGSKKRDSIILVFQEAKISVLEFDDSIHSLRTSSMHCFEGPQWLHLKRGRESFARGP 180

Query: 181  VVKVDPQGRCGGVLVYGLQMIILKASQAGSGLVVDDEAFGNIGAISARVESSYLINLRDL 240
            VVKVDPQGRCGGVLVYGLQMIILKASQAGSGLVVDDEAFGN GAISARVESSYLINLRDL
Sbjct: 181  VVKVDPQGRCGGVLVYGLQMIILKASQAGSGLVVDDEAFGNTGAISARVESSYLINLRDL 240

Query: 241  DVKHVKDFVFVHGYIEPVMVILHEQELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSA 300
            DVKHVKDFVFVHGYIEPVMVILHEQELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSA
Sbjct: 241  DVKHVKDFVFVHGYIEPVMVILHEQELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSA 300

Query: 301  NNLPHDAYKLLAVPSPIGGVLVVSANSIHYHSQSASCMLALNNYAVSADSSQDMPRSNFN 360
            NNLPHDAYKLLAVPSPIGGVLV+SANSIHY+SQSASCMLALNNYAVSADSSQDMPRSNFN
Sbjct: 301  NNLPHDAYKLLAVPSPIGGVLVISANSIHYNSQSASCMLALNNYAVSADSSQDMPRSNFN 360

Query: 361  VELDAANATWLLNDVALLSTKTGELLLLALVYDGRVVQRLDLSKSKASVLTSGIASIGNS 420
            VELDAANATWL+NDVALLSTKTGELLLLALVYDGRVVQRLDLSKSKASVL SGIASIGNS
Sbjct: 361  VELDAANATWLVNDVALLSTKTGELLLLALVYDGRVVQRLDLSKSKASVLASGIASIGNS 420

Query: 421  LFFLGSRLGDSLLVQFSSGVGSSGLASSLKDEVGDIEVDAPTAKRMRRSSSDALQDMVGG 480
            LFFLGSRLGDSLLVQFS GVGSSGLAS+LKDE GDIEVDA TAKRMRRSSSDALQDMVGG
Sbjct: 421  LFFLGSRLGDSLLVQFSCGVGSSGLASNLKDEGGDIEVDAHTAKRMRRSSSDALQDMVGG 480

Query: 481  DELSLYGSAPNNAESAQKNFSFAVRDSLINIGPLKDFSYGLRINADANATGIAKQSNYEL 540
            DELSLYGSA NN ESAQKNFSFAVRDSLINIGPLKDFSYGLRINAD NATGIAKQSNYEL
Sbjct: 481  DELSLYGSAANNTESAQKNFSFAVRDSLINIGPLKDFSYGLRINADPNATGIAKQSNYEL 540

Query: 541  VCCSGNGKNGALCILRQSIRPEMITEVELPGCKGIWTVYHKNTRGSSADSSRMVPDDDEY 600
            VCCSG+GKNGALCILRQSIRPEMITEVELPGCKGIWTVYHKNTRGS ADSSRMVPDDDEY
Sbjct: 541  VCCSGHGKNGALCILRQSIRPEMITEVELPGCKGIWTVYHKNTRGSIADSSRMVPDDDEY 600

Query: 601  HAYLIISLEARTMVLETGDLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVYESGARILD 660
            HAYLIISLEARTMVLETGDLLTEVTE+VDYFV GRTIAAGNLFGRRRVIQVYESGARILD
Sbjct: 601  HAYLIISLEARTMVLETGDLLTEVTETVDYFVHGRTIAAGNLFGRRRVIQVYESGARILD 660

Query: 661  GSFMTQDLNMVVTCNESGNGSEGCTVLSASISDPYVLLTMTDGSIRLLVGDPSSCSVSVS 720
            GSFMTQDLN+VV  NESGN SEGCTVLSASISDPYVLLTMTDGSIRLLVGD SSCSVSVS
Sbjct: 661  GSFMTQDLNLVVNGNESGNASEGCTVLSASISDPYVLLTMTDGSIRLLVGDSSSCSVSVS 720

Query: 721  TPAAFGSSKKCVSCCTLYHDKGIEPWLRMTSTDAWLSTGVGETIDGTDGSLQDQGDIYCV 780
             PAAFGSSKKCVS CTLYHDKG+EPWLRMTSTDAWLSTGVGETIDGTDGSLQDQGDIYCV
Sbjct: 721  APAAFGSSKKCVSSCTLYHDKGVEPWLRMTSTDAWLSTGVGETIDGTDGSLQDQGDIYCV 780

Query: 781  ACYDSGDLEIFDVPNFVSVFYVDKFVSGKSHLVDFQMSDLQKNSEKLDRNSQELNSHGRN 840
            ACYD+GDLEIFDVPNF+SVFYVDKFVSGKSHLVD Q+SDLQK SE +D+NSQEL SHGRN
Sbjct: 781  ACYDNGDLEIFDVPNFISVFYVDKFVSGKSHLVDHQISDLQKPSE-VDQNSQELISHGRN 840

Query: 841  ESSQNMKVIEVAMQRWSGKHSRPFLFGILTDGTILCYHAYLFESTDSASKIDDSVSMENS 900
            ESSQNMKVIEVAMQRWSG+HSRPFLFGILTDGTILCYHAYLFE+TDSASKIDDSVS++NS
Sbjct: 841  ESSQNMKVIEVAMQRWSGQHSRPFLFGILTDGTILCYHAYLFENTDSASKIDDSVSIDNS 900

Query: 901  VSSSNMSSSRLRNLRFLRVPLDIQGRDDMPNGTLSRRLSIFKNISGYQGLFLCGSRPAWF 960
            VSSSNMSSSRLRNLRFLRVPLDIQGRDDMPNGTLSRRLSIFKNISGYQGLFLCGSRPAWF
Sbjct: 901  VSSSNMSSSRLRNLRFLRVPLDIQGRDDMPNGTLSRRLSIFKNISGYQGLFLCGSRPAWF 960

Query: 961  MVFRERLRVHPQLCDGPIVAFTVLHNVNCNHGLIYVTSQGVLKICQLPSTSSYDNYWPVQ 1020
            MVFRERLRVHPQLCDGPIVAFTVLHNVNCNHGLIYVTSQGVLKICQLPSTS+YDNYWPVQ
Sbjct: 961  MVFRERLRVHPQLCDGPIVAFTVLHNVNCNHGLIYVTSQGVLKICQLPSTSNYDNYWPVQ 1020

Query: 1021 KVPLKGTPHQVTYFHEKNLYPVIISAPVHKPLNQVLSSMVDQDVGQVENHNLSADELQQT 1080
            KVPLKGTPHQVTYFHEKNLYPVIISAPV KPLNQVLSSMVDQDVG VENHNLSADELQQT
Sbjct: 1021 KVPLKGTPHQVTYFHEKNLYPVIISAPVQKPLNQVLSSMVDQDVGHVENHNLSADELQQT 1080

Query: 1081 YSVEEFEIRILEPEKSGGPWQTRATIAMHSSENALTIRVVTLLNTTTKENETLLAVGTAY 1140
            YSVEEFEIRILEPEKSGGPWQTRATIAMHSSENALTIRVVTLLN TTKENETLLAVGTAY
Sbjct: 1081 YSVEEFEIRILEPEKSGGPWQTRATIAMHSSENALTIRVVTLLNATTKENETLLAVGTAY 1140

Query: 1141 VQGEDVAARGRVLLFSVGKDVDNSQTLVSEVYSKELKGAISALASLQGHLLIASGPKIIL 1200
            VQGEDVAARGRVLLFSVGKD DNSQTLVSEVYSKELKGAISALASLQGHLLIASGPKIIL
Sbjct: 1141 VQGEDVAARGRVLLFSVGKDADNSQTLVSEVYSKELKGAISALASLQGHLLIASGPKIIL 1200

Query: 1201 HKWTGAELNGIAFYDVPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLSLLAKDFG 1260
            HKWTGAELNGIAFYDVPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLSLLAKDFG
Sbjct: 1201 HKWTGAELNGIAFYDVPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLSLLAKDFG 1260

Query: 1261 SLDCYATEFLIDGSTLSLTVSDDQKNIQIFYYAPKSTESWKGQKLLSRAEFHVGAHVTKF 1320
            SLDCYATEFLIDGSTLSLTVSDDQKNIQIFYYAPKSTESWKGQKLLSRAEFHVGAHVTKF
Sbjct: 1261 SLDCYATEFLIDGSTLSLTVSDDQKNIQIFYYAPKSTESWKGQKLLSRAEFHVGAHVTKF 1320

Query: 1321 LRLQMLSTTSDRASTTVSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDAV 1380
            LRLQMLST+SD+A +TVSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKL DAV
Sbjct: 1321 LRLQMLSTSSDKACSTVSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLGDAV 1380

Query: 1381 PHVGGLNPRSFRQFLSNGKVHRRGPDSIVDGELLCHYEMLPLEEQLDIAHQIGTTRSQIL 1440
            PHVGGLNPRSFRQF SNGKVHRRGPDSIVD ELLCHYEMLPLEEQL+IAHQIGTTRSQIL
Sbjct: 1381 PHVGGLNPRSFRQFHSNGKVHRRGPDSIVDCELLCHYEMLPLEEQLEIAHQIGTTRSQIL 1440

Query: 1441 SNLNDLSLGTSFL 1454
            SNLNDLSLGTSFL
Sbjct: 1441 SNLNDLSLGTSFL 1452

BLAST of Tan0008131 vs. NCBI nr
Match: KAG7011474.1 (Cleavage and polyadenylation specificity factor subunit 1, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 2752.6 bits (7134), Expect = 0.0e+00
Identity = 1389/1453 (95.60%), Postives = 1418/1453 (97.59%), Query Frame = 0

Query: 1    MSFAAYRMMHWPTGIENCDSGFITHSRADFVPGVTSHTDELESDWPARREMGPVPNLVVT 60
            MSFAAYRMMH PTGIENCDSGFITHSR+DFVP VTSH D+LESDWP RRE+GPVPNLVVT
Sbjct: 1    MSFAAYRMMHCPTGIENCDSGFITHSRSDFVPRVTSHADDLESDWPPRREIGPVPNLVVT 60

Query: 61   AGNVLEVYVVRVQEEGGSESRSSGEVRRGGIMDGVSGASLELVCHYRLHGNVESMAILSS 120
            AGNVLEVYVVRVQE+GG ESRSSGEVRRGGIMDGVSGASLELVCHYRLHGNVESM ILSS
Sbjct: 61   AGNVLEVYVVRVQEDGGKESRSSGEVRRGGIMDGVSGASLELVCHYRLHGNVESMVILSS 120

Query: 121  RGGDGSKKRDSIILVFQEAKISVLEFDDSIHSLRTSSMHCFEGPQWLHLKRGRESFARGP 180
            RGGDGSKKRDSIILVF+EAKISVLEFDDSIHSLRTSSMHCFEGPQWLHLKRGRESFARGP
Sbjct: 121  RGGDGSKKRDSIILVFKEAKISVLEFDDSIHSLRTSSMHCFEGPQWLHLKRGRESFARGP 180

Query: 181  VVKVDPQGRCGGVLVYGLQMIILKASQAGSGLVVDDEAFGNIGAISARVESSYLINLRDL 240
            VVKVDPQGRCGGVLVYGLQMIILKASQAGSGLVVDDEA GN+GA SARVESSYLINLRDL
Sbjct: 181  VVKVDPQGRCGGVLVYGLQMIILKASQAGSGLVVDDEACGNVGASSARVESSYLINLRDL 240

Query: 241  DVKHVKDFVFVHGYIEPVMVILHEQELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSA 300
            DVKHVKDFVFVHGYIEPVMVILHEQE TWAGRV+WKHHTCMISALSISTTLKQHPLIWSA
Sbjct: 241  DVKHVKDFVFVHGYIEPVMVILHEQEPTWAGRVAWKHHTCMISALSISTTLKQHPLIWSA 300

Query: 301  NNLPHDAYKLLAVPSPIGGVLVVSANSIHYHSQSASCMLALNNYAVSADSSQDMPRSNFN 360
            NNLPHDAYKLLAVPSPIGGVLVVSANSIHYHSQSASCMLALNNYAVS DSSQDMPRSNFN
Sbjct: 301  NNLPHDAYKLLAVPSPIGGVLVVSANSIHYHSQSASCMLALNNYAVSPDSSQDMPRSNFN 360

Query: 361  VELDAANATWLLNDVALLSTKTGELLLLALVYDGRVVQRLDLSKSKASVLTSGIASIGNS 420
            VELDAA+ATWLLNDVALLSTKTGELLLLALVYDGRVVQRLDLSKSKASVLTSGIASIG+S
Sbjct: 361  VELDAAHATWLLNDVALLSTKTGELLLLALVYDGRVVQRLDLSKSKASVLTSGIASIGSS 420

Query: 421  LFFLGSRLGDSLLVQFSSGVGSSGLASSLKDEVGDIEVDAPTAKRMRRSSSDALQDMVGG 480
            LFFLGSRLGDSLLVQFS GVGSSGLASSLKDEVGDIEVDAPT+KRMRRSSSDALQDMVGG
Sbjct: 421  LFFLGSRLGDSLLVQFSCGVGSSGLASSLKDEVGDIEVDAPTSKRMRRSSSDALQDMVGG 480

Query: 481  DELSLYGSAPNNAESAQKNFSFAVRDSLINIGPLKDFSYGLRINADANATGIAKQSNYEL 540
            DELSLYG   NN ESAQK+FSFAVRDSLINIGPLKDFSYGLRINADANATGIAKQSNYEL
Sbjct: 481  DELSLYG-VSNNTESAQKSFSFAVRDSLINIGPLKDFSYGLRINADANATGIAKQSNYEL 540

Query: 541  VCCSGNGKNGALCILRQSIRPEMITEVELPGCKGIWTVYHKNTRGSSADSSRMVPDDDEY 600
            VCCSGNGKNGALCILRQSIRPEMITEVELPGCKGIWTVYHKNTRGSSADSSRM+ DDDEY
Sbjct: 541  VCCSGNGKNGALCILRQSIRPEMITEVELPGCKGIWTVYHKNTRGSSADSSRMLSDDDEY 600

Query: 601  HAYLIISLEARTMVLETGDLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVYESGARILD 660
            HAYLIISLEARTMVLETGDLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVYE+GAR+LD
Sbjct: 601  HAYLIISLEARTMVLETGDLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVYETGARVLD 660

Query: 661  GSFMTQDLNMVVTCNESGNGSEGCTVLSASISDPYVLLTMTDGSIRLLVGDPSSCSVSVS 720
            GSFMTQDLN+VV  NESGNG EGCTVLSASISDPYVLLTMTDGSIRLLVGDPSSCSVSVS
Sbjct: 661  GSFMTQDLNLVVPGNESGNGPEGCTVLSASISDPYVLLTMTDGSIRLLVGDPSSCSVSVS 720

Query: 721  TPAAFGSSKKCVSCCTLYHDKGIEPWLRMTSTDAWLSTGVGETIDGTDGSLQDQGDIYCV 780
             PAAFGSSKKCVSCCTLYHDKGIEPWLRMTSTDAWLSTGVGETIDGTDGSLQDQGDIYCV
Sbjct: 721  APAAFGSSKKCVSCCTLYHDKGIEPWLRMTSTDAWLSTGVGETIDGTDGSLQDQGDIYCV 780

Query: 781  ACYDSGDLEIFDVPNFVSVFYVDKFVSGKSHLVDFQMSDLQKNSEKLDRNSQELNSHGRN 840
            ACYDSGDLEIFDVPNF SVFYVDKFVSGKSHLVDFQ+SD QK+SE+LD NSQELN++GRN
Sbjct: 781  ACYDSGDLEIFDVPNFTSVFYVDKFVSGKSHLVDFQISDSQKSSERLDGNSQELNNNGRN 840

Query: 841  ESSQNMKVIEVAMQRWSGKHSRPFLFGILTDGTILCYHAYLFESTDSASKIDDSVSMENS 900
            ESSQNMKV EVAMQRWSG+HSRPFLFGILTDGTILCYHAYLFES+D+ASKIDDSVSMEN 
Sbjct: 841  ESSQNMKVTEVAMQRWSGQHSRPFLFGILTDGTILCYHAYLFESSDTASKIDDSVSMEN- 900

Query: 901  VSSSNMSSSRLRNLRFLRVPLDIQGRDDMPNGTLSRRLSIFKNISGYQGLFLCGSRPAWF 960
             SSSNMSSSRLRNLRFLRVPLDIQGRDDMPNG LSRRLSIFKNISGYQGLFLCGSRPAWF
Sbjct: 901  -SSSNMSSSRLRNLRFLRVPLDIQGRDDMPNGALSRRLSIFKNISGYQGLFLCGSRPAWF 960

Query: 961  MVFRERLRVHPQLCDGPIVAFTVLHNVNCNHGLIYVTSQGVLKICQLPSTSSYDNYWPVQ 1020
            MVFRERLR+HPQLCDGPIVAFTVLHNVNCNHGLIYVTSQGVLKICQLPSTS+YDNYWPVQ
Sbjct: 961  MVFRERLRIHPQLCDGPIVAFTVLHNVNCNHGLIYVTSQGVLKICQLPSTSNYDNYWPVQ 1020

Query: 1021 KVPLKGTPHQVTYFHEKNLYPVIISAPVHKPLNQVLSSMVDQDVGQVENHNLSADELQQT 1080
            KVPLKGTPHQVTYFHEKNLYPVIISAPVHKPLNQVLSSMVDQD G VENHNLSADELQQT
Sbjct: 1021 KVPLKGTPHQVTYFHEKNLYPVIISAPVHKPLNQVLSSMVDQDAGHVENHNLSADELQQT 1080

Query: 1081 YSVEEFEIRILEPEKSGGPWQTRATIAMHSSENALTIRVVTLLNTTTKENETLLAVGTAY 1140
            YSVEEFEIRILEPEKSGGPWQTRATIAMHSSENALTIRVVTLLNTTTKENETLLAVGTAY
Sbjct: 1081 YSVEEFEIRILEPEKSGGPWQTRATIAMHSSENALTIRVVTLLNTTTKENETLLAVGTAY 1140

Query: 1141 VQGEDVAARGRVLLFSVGKDVDNSQTLVSEVYSKELKGAISALASLQGHLLIASGPKIIL 1200
            VQGEDVAARGRVLLFSVGKD DNSQTLVSEVYSKELKGAISALASLQGHLLIASGPKIIL
Sbjct: 1141 VQGEDVAARGRVLLFSVGKDADNSQTLVSEVYSKELKGAISALASLQGHLLIASGPKIIL 1200

Query: 1201 HKWTGAELNGIAFYDVPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLSLLAKDFG 1260
            HKWTGAELNGIAFYDVPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLSLLAKDFG
Sbjct: 1201 HKWTGAELNGIAFYDVPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLSLLAKDFG 1260

Query: 1261 SLDCYATEFLIDGSTLSLTVSDDQKNIQIFYYAPKSTESWKGQKLLSRAEFHVGAHVTKF 1320
            SLDC+ATEFLIDGSTLSLTVSDDQKNIQIFYYAPKSTESWKGQKLLSRAEFHVGAHVTKF
Sbjct: 1261 SLDCFATEFLIDGSTLSLTVSDDQKNIQIFYYAPKSTESWKGQKLLSRAEFHVGAHVTKF 1320

Query: 1321 LRLQMLSTTSDRASTTVSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDAV 1380
            LRLQMLSTTSDR S+TVSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDAV
Sbjct: 1321 LRLQMLSTTSDRGSSTVSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDAV 1380

Query: 1381 PHVGGLNPRSFRQFLSNGKVHRRGPDSIVDGELLCHYEMLPLEEQLDIAHQIGTTRSQIL 1440
            PHVGGLNPRSFRQF SNGKVHR GPDSIVD ELLCHYEM+PLEEQL+IA QIGTTRSQIL
Sbjct: 1381 PHVGGLNPRSFRQFHSNGKVHRSGPDSIVDCELLCHYEMIPLEEQLEIAQQIGTTRSQIL 1440

Query: 1441 SNLNDLSLGTSFL 1454
            SNLNDLSLGTSFL
Sbjct: 1441 SNLNDLSLGTSFL 1450

BLAST of Tan0008131 vs. ExPASy TrEMBL
Match: A0A6J1I7X9 (cleavage and polyadenylation specificity factor subunit 1 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111472174 PE=4 SV=1)

HSP 1 Score: 2767.3 bits (7172), Expect = 0.0e+00
Identity = 1396/1453 (96.08%), Postives = 1421/1453 (97.80%), Query Frame = 0

Query: 1    MSFAAYRMMHWPTGIENCDSGFITHSRADFVPGVTSHTDELESDWPARREMGPVPNLVVT 60
            MSFAAYRMMH PTGIENCDSGFITHSRADFVP VTSH D+LESDWP RRE+GPVPNLVVT
Sbjct: 1    MSFAAYRMMHCPTGIENCDSGFITHSRADFVPRVTSHADDLESDWPPRREIGPVPNLVVT 60

Query: 61   AGNVLEVYVVRVQEEGGSESRSSGEVRRGGIMDGVSGASLELVCHYRLHGNVESMAILSS 120
            AGNVLEVYVVRVQEEGG ESRSSGEVRRGGIMDG+SGASLELVCHYRLHGNVESM ILSS
Sbjct: 61   AGNVLEVYVVRVQEEGGKESRSSGEVRRGGIMDGLSGASLELVCHYRLHGNVESMVILSS 120

Query: 121  RGGDGSKKRDSIILVFQEAKISVLEFDDSIHSLRTSSMHCFEGPQWLHLKRGRESFARGP 180
            RGGDGSKKRDSIILVF+EAKISVLEFDDSIHSLRTSSMHCFEGPQWLHLKRGRESFARGP
Sbjct: 121  RGGDGSKKRDSIILVFKEAKISVLEFDDSIHSLRTSSMHCFEGPQWLHLKRGRESFARGP 180

Query: 181  VVKVDPQGRCGGVLVYGLQMIILKASQAGSGLVVDDEAFGNIGAISARVESSYLINLRDL 240
            VVKVDPQGRCGGVLVYGLQMIILKASQAGSGLVVDDEA G IGA SARVESSYLINLRDL
Sbjct: 181  VVKVDPQGRCGGVLVYGLQMIILKASQAGSGLVVDDEACGKIGASSARVESSYLINLRDL 240

Query: 241  DVKHVKDFVFVHGYIEPVMVILHEQELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSA 300
            DVKHVKDFVFVHGYIEPVMVILHEQELTWAGRV+WKHHTCMISALSISTTLKQHPLIWSA
Sbjct: 241  DVKHVKDFVFVHGYIEPVMVILHEQELTWAGRVAWKHHTCMISALSISTTLKQHPLIWSA 300

Query: 301  NNLPHDAYKLLAVPSPIGGVLVVSANSIHYHSQSASCMLALNNYAVSADSSQDMPRSNFN 360
            NNLPHDAYKLLAVPSPIGGVLVVSANSIHYHSQSASCMLALNNYAVS DSSQDMPRSNFN
Sbjct: 301  NNLPHDAYKLLAVPSPIGGVLVVSANSIHYHSQSASCMLALNNYAVSPDSSQDMPRSNFN 360

Query: 361  VELDAANATWLLNDVALLSTKTGELLLLALVYDGRVVQRLDLSKSKASVLTSGIASIGNS 420
            VELDAA+ATWLLNDVALLSTKTGELLLLALVYDGRVVQRLDLSKSKASVLTSGIASIG+S
Sbjct: 361  VELDAAHATWLLNDVALLSTKTGELLLLALVYDGRVVQRLDLSKSKASVLTSGIASIGSS 420

Query: 421  LFFLGSRLGDSLLVQFSSGVGSSGLASSLKDEVGDIEVDAPTAKRMRRSSSDALQDMVGG 480
            LFFLGSRLGDSLLVQFS GVGSSGLASSLKDEVGDIEVDAPTAKRMRRSSSDALQDMVGG
Sbjct: 421  LFFLGSRLGDSLLVQFSCGVGSSGLASSLKDEVGDIEVDAPTAKRMRRSSSDALQDMVGG 480

Query: 481  DELSLYGSAPNNAESAQKNFSFAVRDSLINIGPLKDFSYGLRINADANATGIAKQSNYEL 540
            DELSLYG   NN ESAQKNFSFAVRDSLINIGPLKDFSYGLRINADANATGIAKQSNYEL
Sbjct: 481  DELSLYG-VSNNTESAQKNFSFAVRDSLINIGPLKDFSYGLRINADANATGIAKQSNYEL 540

Query: 541  VCCSGNGKNGALCILRQSIRPEMITEVELPGCKGIWTVYHKNTRGSSADSSRMVPDDDEY 600
            VCCSGNGKNGALCILRQSIRPEMITEVELPGCKGIWTVYHKNTRGSSADSSRM+ DDDEY
Sbjct: 541  VCCSGNGKNGALCILRQSIRPEMITEVELPGCKGIWTVYHKNTRGSSADSSRMLSDDDEY 600

Query: 601  HAYLIISLEARTMVLETGDLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVYESGARILD 660
            HAYLIISLE+RTMVLETGDLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVYE+GARILD
Sbjct: 601  HAYLIISLESRTMVLETGDLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVYETGARILD 660

Query: 661  GSFMTQDLNMVVTCNESGNGSEGCTVLSASISDPYVLLTMTDGSIRLLVGDPSSCSVSVS 720
            GSFMTQDLN+VV  NESGNGSEGCTVLSASISDPYVLLTMTDGSIRLLVGDPSSCSVSVS
Sbjct: 661  GSFMTQDLNLVVPGNESGNGSEGCTVLSASISDPYVLLTMTDGSIRLLVGDPSSCSVSVS 720

Query: 721  TPAAFGSSKKCVSCCTLYHDKGIEPWLRMTSTDAWLSTGVGETIDGTDGSLQDQGDIYCV 780
             PAAFG SKKCVSCCTLYHDKGIEPWLRMTSTDAWLSTGVGETIDGTDGSLQDQGDIYCV
Sbjct: 721  APAAFGGSKKCVSCCTLYHDKGIEPWLRMTSTDAWLSTGVGETIDGTDGSLQDQGDIYCV 780

Query: 781  ACYDSGDLEIFDVPNFVSVFYVDKFVSGKSHLVDFQMSDLQKNSEKLDRNSQELNSHGRN 840
            ACYDSGDLEIFDVPNF SVFYVDKFVSGKSHLVDFQ+SDLQK+SE+LD NSQELN++GRN
Sbjct: 781  ACYDSGDLEIFDVPNFTSVFYVDKFVSGKSHLVDFQISDLQKSSERLDGNSQELNNNGRN 840

Query: 841  ESSQNMKVIEVAMQRWSGKHSRPFLFGILTDGTILCYHAYLFESTDSASKIDDSVSMENS 900
            ESSQNMKV EVAMQRWSG+HSRPFLFGILTDGTILCYHAYLFES+D+ASKIDDSVSMENS
Sbjct: 841  ESSQNMKVTEVAMQRWSGQHSRPFLFGILTDGTILCYHAYLFESSDTASKIDDSVSMENS 900

Query: 901  VSSSNMSSSRLRNLRFLRVPLDIQGRDDMPNGTLSRRLSIFKNISGYQGLFLCGSRPAWF 960
            VSSSNMSSSRLRNLRFLRVPLDIQGRDDMPNG LSRRLSIFKNISGYQGLFLCGSRPAWF
Sbjct: 901  VSSSNMSSSRLRNLRFLRVPLDIQGRDDMPNGALSRRLSIFKNISGYQGLFLCGSRPAWF 960

Query: 961  MVFRERLRVHPQLCDGPIVAFTVLHNVNCNHGLIYVTSQGVLKICQLPSTSSYDNYWPVQ 1020
            MVFRERLR+HPQLCDGPIVAFTVLHNVNCNHGLIYVTSQGVLKICQLPSTS+YDNYWPVQ
Sbjct: 961  MVFRERLRIHPQLCDGPIVAFTVLHNVNCNHGLIYVTSQGVLKICQLPSTSNYDNYWPVQ 1020

Query: 1021 KVPLKGTPHQVTYFHEKNLYPVIISAPVHKPLNQVLSSMVDQDVGQVENHNLSADELQQT 1080
            KVPLKGTPHQVTYFHEKNLYPVIISAPVHKPLNQVLSSMVDQD G VENHNLSADELQQT
Sbjct: 1021 KVPLKGTPHQVTYFHEKNLYPVIISAPVHKPLNQVLSSMVDQDAGHVENHNLSADELQQT 1080

Query: 1081 YSVEEFEIRILEPEKSGGPWQTRATIAMHSSENALTIRVVTLLNTTTKENETLLAVGTAY 1140
            YSVEEFEIRILEPEKSGGPWQTRATIAMHSSENALTIRVVTLLNTTTKENETLLAVGTAY
Sbjct: 1081 YSVEEFEIRILEPEKSGGPWQTRATIAMHSSENALTIRVVTLLNTTTKENETLLAVGTAY 1140

Query: 1141 VQGEDVAARGRVLLFSVGKDVDNSQTLVSEVYSKELKGAISALASLQGHLLIASGPKIIL 1200
            VQGEDVAARGRVLLFSVGKD DNSQTLVSEVYSKELKGAISALASLQGHLLIASGPKIIL
Sbjct: 1141 VQGEDVAARGRVLLFSVGKDADNSQTLVSEVYSKELKGAISALASLQGHLLIASGPKIIL 1200

Query: 1201 HKWTGAELNGIAFYDVPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLSLLAKDFG 1260
            HKWTGAELNGIAFYDVPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLSLLAKDFG
Sbjct: 1201 HKWTGAELNGIAFYDVPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLSLLAKDFG 1260

Query: 1261 SLDCYATEFLIDGSTLSLTVSDDQKNIQIFYYAPKSTESWKGQKLLSRAEFHVGAHVTKF 1320
            SLDC+ATEFLIDGSTLSLTVSDDQKNIQIFYYAPKSTESWKGQKLLSRAEFHVGAHVTKF
Sbjct: 1261 SLDCFATEFLIDGSTLSLTVSDDQKNIQIFYYAPKSTESWKGQKLLSRAEFHVGAHVTKF 1320

Query: 1321 LRLQMLSTTSDRASTTVSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDAV 1380
            LRLQMLSTTSDR S+TVSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDAV
Sbjct: 1321 LRLQMLSTTSDRGSSTVSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDAV 1380

Query: 1381 PHVGGLNPRSFRQFLSNGKVHRRGPDSIVDGELLCHYEMLPLEEQLDIAHQIGTTRSQIL 1440
            PHVGGLNPRSFRQF SNGKVHR GPDSIVD ELLCHYEM+PLEEQL+IA QIGTTRSQIL
Sbjct: 1381 PHVGGLNPRSFRQFHSNGKVHRSGPDSIVDCELLCHYEMIPLEEQLEIAQQIGTTRSQIL 1440

Query: 1441 SNLNDLSLGTSFL 1454
            SNLNDLSLGTSFL
Sbjct: 1441 SNLNDLSLGTSFL 1452

BLAST of Tan0008131 vs. ExPASy TrEMBL
Match: A0A1S3B4E8 (cleavage and polyadenylation specificity factor subunit 1 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103485885 PE=4 SV=1)

HSP 1 Score: 2755.7 bits (7142), Expect = 0.0e+00
Identity = 1392/1453 (95.80%), Postives = 1419/1453 (97.66%), Query Frame = 0

Query: 1    MSFAAYRMMHWPTGIENCDSGFITHSRADFVPGVTSHTDELESDWPARREMGPVPNLVVT 60
            MSFAAYRMMHWPTGIENCDS FITHSRADFVP VTSH+D+L+SDW  RR++GPVPNLVVT
Sbjct: 1    MSFAAYRMMHWPTGIENCDSAFITHSRADFVPAVTSHSDDLDSDWHPRRDIGPVPNLVVT 60

Query: 61   AGNVLEVYVVRVQEEGGSESRSSGEVRRGGIMDGVSGASLELVCHYRLHGNVESMAILSS 120
            AGNVLEVYVVRV EEGG ESRSSGEVRRGGIMDGVSGASLELVCHYRLHGNVESMAILSS
Sbjct: 61   AGNVLEVYVVRVLEEGGRESRSSGEVRRGGIMDGVSGASLELVCHYRLHGNVESMAILSS 120

Query: 121  RGGDGSKKRDSIILVFQEAKISVLEFDDSIHSLRTSSMHCFEGPQWLHLKRGRESFARGP 180
            RGGDGSKKRDSIILVFQEAKISVLEFDDSIHSLRTSSMHCFEGPQWLHLKRGRESFARGP
Sbjct: 121  RGGDGSKKRDSIILVFQEAKISVLEFDDSIHSLRTSSMHCFEGPQWLHLKRGRESFARGP 180

Query: 181  VVKVDPQGRCGGVLVYGLQMIILKASQAGSGLVVDDEAFGNIGAISARVESSYLINLRDL 240
            VVKVDPQGRCGGVLVYGLQMIILKASQAGSGLVVDDEAFGN GAISARVESSYLINLRDL
Sbjct: 181  VVKVDPQGRCGGVLVYGLQMIILKASQAGSGLVVDDEAFGNTGAISARVESSYLINLRDL 240

Query: 241  DVKHVKDFVFVHGYIEPVMVILHEQELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSA 300
            DVKHVKDFVFVHGYIEPVMVILHEQELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSA
Sbjct: 241  DVKHVKDFVFVHGYIEPVMVILHEQELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSA 300

Query: 301  NNLPHDAYKLLAVPSPIGGVLVVSANSIHYHSQSASCMLALNNYAVSADSSQDMPRSNFN 360
            NNLPHDAYKLLAVPSPIGGVLV+SANSIHY+SQSASCMLALNNYAVSADSSQDMPRSNFN
Sbjct: 301  NNLPHDAYKLLAVPSPIGGVLVISANSIHYNSQSASCMLALNNYAVSADSSQDMPRSNFN 360

Query: 361  VELDAANATWLLNDVALLSTKTGELLLLALVYDGRVVQRLDLSKSKASVLTSGIASIGNS 420
            VELDAANATWL+NDVALLSTKTGELLLLALVYDGRVVQRLDLSKSKASVL SGIASIGNS
Sbjct: 361  VELDAANATWLVNDVALLSTKTGELLLLALVYDGRVVQRLDLSKSKASVLASGIASIGNS 420

Query: 421  LFFLGSRLGDSLLVQFSSGVGSSGLASSLKDEVGDIEVDAPTAKRMRRSSSDALQDMVGG 480
            LFFLGSRLGDSLLVQFS GVGSSGLAS+LKDE GDIEVDA TAKRMRRSSSDALQDMVGG
Sbjct: 421  LFFLGSRLGDSLLVQFSCGVGSSGLASNLKDEGGDIEVDAHTAKRMRRSSSDALQDMVGG 480

Query: 481  DELSLYGSAPNNAESAQKNFSFAVRDSLINIGPLKDFSYGLRINADANATGIAKQSNYEL 540
            DELSLYGSA NN ESAQKNFSFAVRDSLINIGPLKDFSYGLRINAD NATGIAKQSNYEL
Sbjct: 481  DELSLYGSAANNTESAQKNFSFAVRDSLINIGPLKDFSYGLRINADPNATGIAKQSNYEL 540

Query: 541  VCCSGNGKNGALCILRQSIRPEMITEVELPGCKGIWTVYHKNTRGSSADSSRMVPDDDEY 600
            VCCSG+GKNGALCILRQSIRPEMITEVELPGCKGIWTVYHKNTRGS ADSSRMVPDDDEY
Sbjct: 541  VCCSGHGKNGALCILRQSIRPEMITEVELPGCKGIWTVYHKNTRGSIADSSRMVPDDDEY 600

Query: 601  HAYLIISLEARTMVLETGDLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVYESGARILD 660
            HAYLIISLEARTMVLETGDLLTEVTE+VDYFV GRTIAAGNLFGRRRVIQVYESGARILD
Sbjct: 601  HAYLIISLEARTMVLETGDLLTEVTETVDYFVHGRTIAAGNLFGRRRVIQVYESGARILD 660

Query: 661  GSFMTQDLNMVVTCNESGNGSEGCTVLSASISDPYVLLTMTDGSIRLLVGDPSSCSVSVS 720
            GSFMTQDLN+VV  NESGN SEGCTVLSASISDPYVLLTMTDGSIRLLVGD SSCSVSVS
Sbjct: 661  GSFMTQDLNLVVNGNESGNASEGCTVLSASISDPYVLLTMTDGSIRLLVGDSSSCSVSVS 720

Query: 721  TPAAFGSSKKCVSCCTLYHDKGIEPWLRMTSTDAWLSTGVGETIDGTDGSLQDQGDIYCV 780
             PAAFGSSKKCVS CTLYHDKG+EPWLRMTSTDAWLSTGVGETIDGTDGSLQDQGDIYCV
Sbjct: 721  APAAFGSSKKCVSSCTLYHDKGVEPWLRMTSTDAWLSTGVGETIDGTDGSLQDQGDIYCV 780

Query: 781  ACYDSGDLEIFDVPNFVSVFYVDKFVSGKSHLVDFQMSDLQKNSEKLDRNSQELNSHGRN 840
            ACYD+GDLEIFDVPNF+SVFYVDKFVSGKSHLVD Q+SDLQK SE +D+NSQEL SHGRN
Sbjct: 781  ACYDNGDLEIFDVPNFISVFYVDKFVSGKSHLVDHQISDLQKPSE-VDQNSQELISHGRN 840

Query: 841  ESSQNMKVIEVAMQRWSGKHSRPFLFGILTDGTILCYHAYLFESTDSASKIDDSVSMENS 900
            ESSQNMKVIEVAMQRWSG+HSRPFLFGILTDGTILCYHAYLFE+TDSASKIDDSVS++NS
Sbjct: 841  ESSQNMKVIEVAMQRWSGQHSRPFLFGILTDGTILCYHAYLFENTDSASKIDDSVSIDNS 900

Query: 901  VSSSNMSSSRLRNLRFLRVPLDIQGRDDMPNGTLSRRLSIFKNISGYQGLFLCGSRPAWF 960
            VSSSNMSSSRLRNLRFLRVPLDIQGRDDMPNGTLSRRLSIFKNISGYQGLFLCGSRPAWF
Sbjct: 901  VSSSNMSSSRLRNLRFLRVPLDIQGRDDMPNGTLSRRLSIFKNISGYQGLFLCGSRPAWF 960

Query: 961  MVFRERLRVHPQLCDGPIVAFTVLHNVNCNHGLIYVTSQGVLKICQLPSTSSYDNYWPVQ 1020
            MVFRERLRVHPQLCDGPIVAFTVLHNVNCNHGLIYVTSQGVLKICQLPSTS+YDNYWPVQ
Sbjct: 961  MVFRERLRVHPQLCDGPIVAFTVLHNVNCNHGLIYVTSQGVLKICQLPSTSNYDNYWPVQ 1020

Query: 1021 KVPLKGTPHQVTYFHEKNLYPVIISAPVHKPLNQVLSSMVDQDVGQVENHNLSADELQQT 1080
            KVPLKGTPHQVTYFHEKNLYPVIISAPV KPLNQVLSSMVDQDVG VENHNLSADELQQT
Sbjct: 1021 KVPLKGTPHQVTYFHEKNLYPVIISAPVQKPLNQVLSSMVDQDVGHVENHNLSADELQQT 1080

Query: 1081 YSVEEFEIRILEPEKSGGPWQTRATIAMHSSENALTIRVVTLLNTTTKENETLLAVGTAY 1140
            YSVEEFEIRILEPEKSGGPWQTRATIAMHSSENALTIRVVTLLN TTKENETLLAVGTAY
Sbjct: 1081 YSVEEFEIRILEPEKSGGPWQTRATIAMHSSENALTIRVVTLLNATTKENETLLAVGTAY 1140

Query: 1141 VQGEDVAARGRVLLFSVGKDVDNSQTLVSEVYSKELKGAISALASLQGHLLIASGPKIIL 1200
            VQGEDVAARGRVLLFSVGKD DNSQTLVSEVYSKELKGAISALASLQGHLLIASGPKIIL
Sbjct: 1141 VQGEDVAARGRVLLFSVGKDADNSQTLVSEVYSKELKGAISALASLQGHLLIASGPKIIL 1200

Query: 1201 HKWTGAELNGIAFYDVPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLSLLAKDFG 1260
            HKWTGAELNGIAFYDVPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLSLLAKDFG
Sbjct: 1201 HKWTGAELNGIAFYDVPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLSLLAKDFG 1260

Query: 1261 SLDCYATEFLIDGSTLSLTVSDDQKNIQIFYYAPKSTESWKGQKLLSRAEFHVGAHVTKF 1320
            SLDCYATEFLIDGSTLSLTVSDDQKNIQIFYYAPKSTESWKGQKLLSRAEFHVGAHVTKF
Sbjct: 1261 SLDCYATEFLIDGSTLSLTVSDDQKNIQIFYYAPKSTESWKGQKLLSRAEFHVGAHVTKF 1320

Query: 1321 LRLQMLSTTSDRASTTVSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDAV 1380
            LRLQMLST+SD+A +TVSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKL DAV
Sbjct: 1321 LRLQMLSTSSDKACSTVSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLGDAV 1380

Query: 1381 PHVGGLNPRSFRQFLSNGKVHRRGPDSIVDGELLCHYEMLPLEEQLDIAHQIGTTRSQIL 1440
            PHVGGLNPRSFRQF SNGKVHRRGPDSIVD ELLCHYEMLPLEEQL+IAHQIGTTRSQIL
Sbjct: 1381 PHVGGLNPRSFRQFHSNGKVHRRGPDSIVDCELLCHYEMLPLEEQLEIAHQIGTTRSQIL 1440

Query: 1441 SNLNDLSLGTSFL 1454
            SNLNDLSLGTSFL
Sbjct: 1441 SNLNDLSLGTSFL 1452

BLAST of Tan0008131 vs. ExPASy TrEMBL
Match: A0A6J1EY93 (cleavage and polyadenylation specificity factor subunit 1 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111439584 PE=4 SV=1)

HSP 1 Score: 2748.8 bits (7124), Expect = 0.0e+00
Identity = 1388/1453 (95.53%), Postives = 1417/1453 (97.52%), Query Frame = 0

Query: 1    MSFAAYRMMHWPTGIENCDSGFITHSRADFVPGVTSHTDELESDWPARREMGPVPNLVVT 60
            MSFAAYRMMH PTGIENCDSGFITHSR+DFVP VTSH D+LESDWP RRE+GPVPNLVVT
Sbjct: 1    MSFAAYRMMHCPTGIENCDSGFITHSRSDFVPRVTSHADDLESDWPPRREIGPVPNLVVT 60

Query: 61   AGNVLEVYVVRVQEEGGSESRSSGEVRRGGIMDGVSGASLELVCHYRLHGNVESMAILSS 120
            AGNVLEVYVVRVQE+GG ESRSSGEVRRGGIMDGVSGASLELVCHYRLHGNVESM ILSS
Sbjct: 61   AGNVLEVYVVRVQEDGGKESRSSGEVRRGGIMDGVSGASLELVCHYRLHGNVESMVILSS 120

Query: 121  RGGDGSKKRDSIILVFQEAKISVLEFDDSIHSLRTSSMHCFEGPQWLHLKRGRESFARGP 180
            RGGDGSKKRDSIILVF+EAKISVLEFDDSIHSLRTSSMHCFEGPQWLHLKRGRESFARGP
Sbjct: 121  RGGDGSKKRDSIILVFKEAKISVLEFDDSIHSLRTSSMHCFEGPQWLHLKRGRESFARGP 180

Query: 181  VVKVDPQGRCGGVLVYGLQMIILKASQAGSGLVVDDEAFGNIGAISARVESSYLINLRDL 240
            VVKVDPQGRCGGVLVYGLQMIILKASQAGSGLVVDDEA GN+GA SARVESSYLINLRDL
Sbjct: 181  VVKVDPQGRCGGVLVYGLQMIILKASQAGSGLVVDDEACGNVGASSARVESSYLINLRDL 240

Query: 241  DVKHVKDFVFVHGYIEPVMVILHEQELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSA 300
            DVKHVKDFVFVHGYIEPVMVILHEQELTWAGRV+WKHHTCMISALSISTTLKQHPLIWSA
Sbjct: 241  DVKHVKDFVFVHGYIEPVMVILHEQELTWAGRVAWKHHTCMISALSISTTLKQHPLIWSA 300

Query: 301  NNLPHDAYKLLAVPSPIGGVLVVSANSIHYHSQSASCMLALNNYAVSADSSQDMPRSNFN 360
            NNLPHDAYKLLAVPSPIGGVLVVSANSIHY SQSASCMLALNNYAVS DSSQDMPRSNFN
Sbjct: 301  NNLPHDAYKLLAVPSPIGGVLVVSANSIHYLSQSASCMLALNNYAVSPDSSQDMPRSNFN 360

Query: 361  VELDAANATWLLNDVALLSTKTGELLLLALVYDGRVVQRLDLSKSKASVLTSGIASIGNS 420
            VELDAA+ATWLLNDVALLSTKTGELLLLALVYDGRVVQRLDLSKSKASVLTSGIASIG+S
Sbjct: 361  VELDAAHATWLLNDVALLSTKTGELLLLALVYDGRVVQRLDLSKSKASVLTSGIASIGSS 420

Query: 421  LFFLGSRLGDSLLVQFSSGVGSSGLASSLKDEVGDIEVDAPTAKRMRRSSSDALQDMVGG 480
            LFFLGSRLGDSLLVQFS GVGSSGLASSLKDEVGDIEVDAPT+KRMRRSSSDALQDMVGG
Sbjct: 421  LFFLGSRLGDSLLVQFSCGVGSSGLASSLKDEVGDIEVDAPTSKRMRRSSSDALQDMVGG 480

Query: 481  DELSLYGSAPNNAESAQKNFSFAVRDSLINIGPLKDFSYGLRINADANATGIAKQSNYEL 540
            DELSLYG   NN ESAQK+FSFAVRDSLINIGPLKDFSYGLRINADANATGIAKQSNYEL
Sbjct: 481  DELSLYG-VSNNTESAQKSFSFAVRDSLINIGPLKDFSYGLRINADANATGIAKQSNYEL 540

Query: 541  VCCSGNGKNGALCILRQSIRPEMITEVELPGCKGIWTVYHKNTRGSSADSSRMVPDDDEY 600
            VCCSGNGKNGALCILRQSIRPEMITEVELPGCKGIWTVYHKNTRGSSADSSRM+ DDDEY
Sbjct: 541  VCCSGNGKNGALCILRQSIRPEMITEVELPGCKGIWTVYHKNTRGSSADSSRMLSDDDEY 600

Query: 601  HAYLIISLEARTMVLETGDLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVYESGARILD 660
            HAYLIISLEARTMVLETGDLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVYE+GAR+LD
Sbjct: 601  HAYLIISLEARTMVLETGDLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVYETGARVLD 660

Query: 661  GSFMTQDLNMVVTCNESGNGSEGCTVLSASISDPYVLLTMTDGSIRLLVGDPSSCSVSVS 720
            GSFMTQDLN+VV  NESGNG EGCTVLSASISDPYVLLTMTDGSIRLLVGDPSSCSVSVS
Sbjct: 661  GSFMTQDLNLVVPGNESGNGPEGCTVLSASISDPYVLLTMTDGSIRLLVGDPSSCSVSVS 720

Query: 721  TPAAFGSSKKCVSCCTLYHDKGIEPWLRMTSTDAWLSTGVGETIDGTDGSLQDQGDIYCV 780
             PAAFGSSKKCVSCCTLYHDKGIEPWLRMTSTDAWLSTGVGETIDGTDGSLQDQGDIYCV
Sbjct: 721  APAAFGSSKKCVSCCTLYHDKGIEPWLRMTSTDAWLSTGVGETIDGTDGSLQDQGDIYCV 780

Query: 781  ACYDSGDLEIFDVPNFVSVFYVDKFVSGKSHLVDFQMSDLQKNSEKLDRNSQELNSHGRN 840
            ACYDSGDLEIFDVPNF SVFYVDKFVSGKSHLVDFQ+SD QK+SE+LD NSQELN++GRN
Sbjct: 781  ACYDSGDLEIFDVPNFTSVFYVDKFVSGKSHLVDFQISDSQKSSERLDGNSQELNNNGRN 840

Query: 841  ESSQNMKVIEVAMQRWSGKHSRPFLFGILTDGTILCYHAYLFESTDSASKIDDSVSMENS 900
            ESSQNMKV EVAMQRWSG+HSRPFLFGILTDGTILCYHAYLFES+D+ASKIDDSVSMEN 
Sbjct: 841  ESSQNMKVTEVAMQRWSGQHSRPFLFGILTDGTILCYHAYLFESSDTASKIDDSVSMEN- 900

Query: 901  VSSSNMSSSRLRNLRFLRVPLDIQGRDDMPNGTLSRRLSIFKNISGYQGLFLCGSRPAWF 960
             SSSNMSSSRLRNLRFLRVPLDIQGRDDMPNG LSRRLSIFKNISGYQGLFLCGSRPAWF
Sbjct: 901  -SSSNMSSSRLRNLRFLRVPLDIQGRDDMPNGALSRRLSIFKNISGYQGLFLCGSRPAWF 960

Query: 961  MVFRERLRVHPQLCDGPIVAFTVLHNVNCNHGLIYVTSQGVLKICQLPSTSSYDNYWPVQ 1020
            MVFRERLR+HPQLCDGPIVAFTVLHNVNCNHGLIYVTSQGVLKICQLPSTS+YDNYWPVQ
Sbjct: 961  MVFRERLRIHPQLCDGPIVAFTVLHNVNCNHGLIYVTSQGVLKICQLPSTSNYDNYWPVQ 1020

Query: 1021 KVPLKGTPHQVTYFHEKNLYPVIISAPVHKPLNQVLSSMVDQDVGQVENHNLSADELQQT 1080
            KVPLKGTPHQVTYFHEKNLYPVIISAPVHKPLNQVLSSMVDQD G VENHNLSADELQQT
Sbjct: 1021 KVPLKGTPHQVTYFHEKNLYPVIISAPVHKPLNQVLSSMVDQDAGHVENHNLSADELQQT 1080

Query: 1081 YSVEEFEIRILEPEKSGGPWQTRATIAMHSSENALTIRVVTLLNTTTKENETLLAVGTAY 1140
            YSVEEFEIRILEPEKSGGPWQTRATIAMHSSENALTIRVVTLLNTTTKENE LLAVGTAY
Sbjct: 1081 YSVEEFEIRILEPEKSGGPWQTRATIAMHSSENALTIRVVTLLNTTTKENEILLAVGTAY 1140

Query: 1141 VQGEDVAARGRVLLFSVGKDVDNSQTLVSEVYSKELKGAISALASLQGHLLIASGPKIIL 1200
            VQGEDVAARGRVLLFSVGKD DNSQTLVSEVYSKELKGAISALASLQGHLLIASGPKIIL
Sbjct: 1141 VQGEDVAARGRVLLFSVGKDADNSQTLVSEVYSKELKGAISALASLQGHLLIASGPKIIL 1200

Query: 1201 HKWTGAELNGIAFYDVPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLSLLAKDFG 1260
            HKWTGAELNGIAFYDVPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLSLLAKDFG
Sbjct: 1201 HKWTGAELNGIAFYDVPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLSLLAKDFG 1260

Query: 1261 SLDCYATEFLIDGSTLSLTVSDDQKNIQIFYYAPKSTESWKGQKLLSRAEFHVGAHVTKF 1320
            SLDC+ATEFLIDGSTLSLTVSDDQKNIQIFYYAPKSTESWKGQKLLSRAEFHVGAHVTKF
Sbjct: 1261 SLDCFATEFLIDGSTLSLTVSDDQKNIQIFYYAPKSTESWKGQKLLSRAEFHVGAHVTKF 1320

Query: 1321 LRLQMLSTTSDRASTTVSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDAV 1380
            LRLQMLSTTSDR S+TVSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDAV
Sbjct: 1321 LRLQMLSTTSDRGSSTVSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDAV 1380

Query: 1381 PHVGGLNPRSFRQFLSNGKVHRRGPDSIVDGELLCHYEMLPLEEQLDIAHQIGTTRSQIL 1440
            PHVGGLNPRSFRQF SNGKVHR GPDSIVD ELLCHYEM+PLEEQL+IA QIGTTRSQIL
Sbjct: 1381 PHVGGLNPRSFRQFHSNGKVHRSGPDSIVDCELLCHYEMIPLEEQLEIAQQIGTTRSQIL 1440

Query: 1441 SNLNDLSLGTSFL 1454
            SNLNDLSLGTSFL
Sbjct: 1441 SNLNDLSLGTSFL 1450

BLAST of Tan0008131 vs. ExPASy TrEMBL
Match: A0A0A0LKI9 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G074280 PE=4 SV=1)

HSP 1 Score: 2738.8 bits (7098), Expect = 0.0e+00
Identity = 1385/1453 (95.32%), Postives = 1415/1453 (97.38%), Query Frame = 0

Query: 1    MSFAAYRMMHWPTGIENCDSGFITHSRADFVPGVTSHTDELESDWPARREMGPVPNLVVT 60
            MSFAAYRMMHWPTGIENCDS +ITHSRADFVP VTSH+D+L+SDW  RR++GPVPNLVVT
Sbjct: 1    MSFAAYRMMHWPTGIENCDSAYITHSRADFVPAVTSHSDDLDSDWHPRRDIGPVPNLVVT 60

Query: 61   AGNVLEVYVVRVQEEGGSESRSSGEVRRGGIMDGVSGASLELVCHYRLHGNVESMAILSS 120
            AGNVLEVYVVRV EEGG ES+SSGEVRRGGIMDGVSGASLELVCHYRLHGNVESMAILSS
Sbjct: 61   AGNVLEVYVVRVLEEGGRESKSSGEVRRGGIMDGVSGASLELVCHYRLHGNVESMAILSS 120

Query: 121  RGGDGSKKRDSIILVFQEAKISVLEFDDSIHSLRTSSMHCFEGPQWLHLKRGRESFARGP 180
            RGGDGSKKRDSIILVFQEAKISVLEFDDS HSLRTSSMHCF+GPQWLHLKRGRESFARGP
Sbjct: 121  RGGDGSKKRDSIILVFQEAKISVLEFDDSTHSLRTSSMHCFDGPQWLHLKRGRESFARGP 180

Query: 181  VVKVDPQGRCGGVLVYGLQMIILKASQAGSGLVVDDEAFGNIGAISARVESSYLINLRDL 240
            VVKVDPQGRCGGVLVYGLQMIILKASQAGSGLVVDDEAFGN GAISARVESSYLINLRDL
Sbjct: 181  VVKVDPQGRCGGVLVYGLQMIILKASQAGSGLVVDDEAFGNTGAISARVESSYLINLRDL 240

Query: 241  DVKHVKDFVFVHGYIEPVMVILHEQELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSA 300
            DVKHVKDFVFVHGYIEPVMVILHEQELTWAGRVSWKHHTCM+SALSISTTLKQHPLIWSA
Sbjct: 241  DVKHVKDFVFVHGYIEPVMVILHEQELTWAGRVSWKHHTCMVSALSISTTLKQHPLIWSA 300

Query: 301  NNLPHDAYKLLAVPSPIGGVLVVSANSIHYHSQSASCMLALNNYAVSADSSQDMPRSNFN 360
            +NLPHDAYKLLAVPSPIGGVLV+SANSIHY+SQSASCMLALNNYAVSADSSQDMPRSNFN
Sbjct: 301  SNLPHDAYKLLAVPSPIGGVLVISANSIHYNSQSASCMLALNNYAVSADSSQDMPRSNFN 360

Query: 361  VELDAANATWLLNDVALLSTKTGELLLLALVYDGRVVQRLDLSKSKASVLTSGIASIGNS 420
            VELDAANATWL+NDVALLSTKTGELLLLALVYDGRVVQRLDLSKSKASVLTSGIASIGNS
Sbjct: 361  VELDAANATWLVNDVALLSTKTGELLLLALVYDGRVVQRLDLSKSKASVLTSGIASIGNS 420

Query: 421  LFFLGSRLGDSLLVQFSSGVGSSGLASSLKDEVGDIEVDAPTAKRMRRSSSDALQDMVGG 480
            LFFLGSRLGDSLLVQFS GVGSSGLAS+LKDE GDIEVDA TAKRMRRSSSDALQDMVGG
Sbjct: 421  LFFLGSRLGDSLLVQFSCGVGSSGLASNLKDEGGDIEVDAHTAKRMRRSSSDALQDMVGG 480

Query: 481  DELSLYGSAPNNAESAQKNFSFAVRDSLINIGPLKDFSYGLRINADANATGIAKQSNYEL 540
            DELSLYGSA NN ESAQK FSFAVRDSLINIGPLKDFSYGLRINAD NATGIAKQSNYEL
Sbjct: 481  DELSLYGSAANNTESAQKIFSFAVRDSLINIGPLKDFSYGLRINADPNATGIAKQSNYEL 540

Query: 541  VCCSGNGKNGALCILRQSIRPEMITEVELPGCKGIWTVYHKNTRGSSADSSRMVPDDDEY 600
            VCCSG+GKNGALCILRQSIRPEMITEVELPGCKGIWTVYHKNTRGS ADSSRMVPDDDEY
Sbjct: 541  VCCSGHGKNGALCILRQSIRPEMITEVELPGCKGIWTVYHKNTRGSIADSSRMVPDDDEY 600

Query: 601  HAYLIISLEARTMVLETGDLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVYESGARILD 660
            HAYLIISLEARTMVL TG+LLTEVTESVDYFV GRTIAAGNLFGRRRVIQVYESGARILD
Sbjct: 601  HAYLIISLEARTMVLVTGELLTEVTESVDYFVHGRTIAAGNLFGRRRVIQVYESGARILD 660

Query: 661  GSFMTQDLNMVVTCNESGNGSEGCTVLSASISDPYVLLTMTDGSIRLLVGDPSSCSVSVS 720
            GSFMTQDLN+VV  NESGN SEGCTVLSASISDPYVLLTMTDGSIRLLVGD SSCSVSVS
Sbjct: 661  GSFMTQDLNLVVNGNESGNASEGCTVLSASISDPYVLLTMTDGSIRLLVGDSSSCSVSVS 720

Query: 721  TPAAFGSSKKCVSCCTLYHDKGIEPWLRMTSTDAWLSTGVGETIDGTDGSLQDQGDIYCV 780
             PAAFGSSKKCVS CTLY DKGIEPWLRMTSTDAWLSTGVGETIDGTDGSLQDQGDIYCV
Sbjct: 721  APAAFGSSKKCVSSCTLYQDKGIEPWLRMTSTDAWLSTGVGETIDGTDGSLQDQGDIYCV 780

Query: 781  ACYDSGDLEIFDVPNFVSVFYVDKFVSGKSHLVDFQMSDLQKNSEKLDRNSQELNSHGRN 840
            ACYD+GDLEIFDVPNF SVFYVDKFVSGKSHLVD Q+SDLQK+SE +D+NSQEL SHGRN
Sbjct: 781  ACYDNGDLEIFDVPNFTSVFYVDKFVSGKSHLVDHQISDLQKSSE-VDQNSQELISHGRN 840

Query: 841  ESSQNMKVIEVAMQRWSGKHSRPFLFGILTDGTILCYHAYLFESTDSASKIDDSVSMENS 900
            ESSQNMKVIEVAMQRWSG+HSRPFLFGILTDGTILCYHAYLFESTDSASKIDDSVS++NS
Sbjct: 841  ESSQNMKVIEVAMQRWSGQHSRPFLFGILTDGTILCYHAYLFESTDSASKIDDSVSIDNS 900

Query: 901  VSSSNMSSSRLRNLRFLRVPLDIQGRDDMPNGTLSRRLSIFKNISGYQGLFLCGSRPAWF 960
            VSSSNMSSSRLRNLRFLRVPLDIQGR+DMPNGTLS RLSIFKNISGYQGLFLCGSRPAWF
Sbjct: 901  VSSSNMSSSRLRNLRFLRVPLDIQGREDMPNGTLSCRLSIFKNISGYQGLFLCGSRPAWF 960

Query: 961  MVFRERLRVHPQLCDGPIVAFTVLHNVNCNHGLIYVTSQGVLKICQLPSTSSYDNYWPVQ 1020
            MVFRERLRVHPQLCDGPIVAF VLHNVNCNHGLIYVTSQGVLKICQLPSTS+YDNYWPVQ
Sbjct: 961  MVFRERLRVHPQLCDGPIVAFAVLHNVNCNHGLIYVTSQGVLKICQLPSTSNYDNYWPVQ 1020

Query: 1021 KVPLKGTPHQVTYFHEKNLYPVIISAPVHKPLNQVLSSMVDQDVGQVENHNLSADELQQT 1080
            KVPLKGTPHQVTYFHEKNLYPVIISAPV KPLNQVLSSMVDQDVG VENHNLSADELQQT
Sbjct: 1021 KVPLKGTPHQVTYFHEKNLYPVIISAPVQKPLNQVLSSMVDQDVGHVENHNLSADELQQT 1080

Query: 1081 YSVEEFEIRILEPEKSGGPWQTRATIAMHSSENALTIRVVTLLNTTTKENETLLAVGTAY 1140
            YSVEEFEIRILEPEKSGGPWQTRATIAMHSSENALTIRVVTLLNTTTKENETLLAVGTAY
Sbjct: 1081 YSVEEFEIRILEPEKSGGPWQTRATIAMHSSENALTIRVVTLLNTTTKENETLLAVGTAY 1140

Query: 1141 VQGEDVAARGRVLLFSVGKDVDNSQTLVSEVYSKELKGAISALASLQGHLLIASGPKIIL 1200
            VQGEDVAARGRVLLFSVGKD DNSQTLVSEVYSKELKGAISALASLQGHLLIASGPKIIL
Sbjct: 1141 VQGEDVAARGRVLLFSVGKDADNSQTLVSEVYSKELKGAISALASLQGHLLIASGPKIIL 1200

Query: 1201 HKWTGAELNGIAFYDVPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLSLLAKDFG 1260
            HKWTGAELNGIAFYDVPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLSLLAKDFG
Sbjct: 1201 HKWTGAELNGIAFYDVPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLSLLAKDFG 1260

Query: 1261 SLDCYATEFLIDGSTLSLTVSDDQKNIQIFYYAPKSTESWKGQKLLSRAEFHVGAHVTKF 1320
            SLDCYATEFLIDGSTLSLTVSDDQKNIQIFYYAPKSTESWKGQKLLSRAEFHVGAHVTKF
Sbjct: 1261 SLDCYATEFLIDGSTLSLTVSDDQKNIQIFYYAPKSTESWKGQKLLSRAEFHVGAHVTKF 1320

Query: 1321 LRLQMLSTTSDRASTTVSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDAV 1380
            LRLQMLST+SD+A +TVSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKL DAV
Sbjct: 1321 LRLQMLSTSSDKACSTVSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLGDAV 1380

Query: 1381 PHVGGLNPRSFRQFLSNGKVHRRGPDSIVDGELLCHYEMLPLEEQLDIAHQIGTTRSQIL 1440
            PHVGGLNPRSFRQF SNGKVHRRGPDSIVD ELLCHYEMLPLEEQLDIAHQIGTTRSQIL
Sbjct: 1381 PHVGGLNPRSFRQFHSNGKVHRRGPDSIVDCELLCHYEMLPLEEQLDIAHQIGTTRSQIL 1440

Query: 1441 SNLNDLSLGTSFL 1454
            SNLNDLSLGTSFL
Sbjct: 1441 SNLNDLSLGTSFL 1452

BLAST of Tan0008131 vs. ExPASy TrEMBL
Match: A0A6J1DTM4 (cleavage and polyadenylation specificity factor subunit 1 OS=Momordica charantia OX=3673 GN=LOC111023895 PE=4 SV=1)

HSP 1 Score: 2604.3 bits (6749), Expect = 0.0e+00
Identity = 1313/1362 (96.40%), Postives = 1336/1362 (98.09%), Query Frame = 0

Query: 92   MDGVSGASLELVCHYRLHGNVESMAILSSRGGDGSKKRDSIILVFQEAKISVLEFDDSIH 151
            MDGVSGASLELVCHYRLHGNVESMAILSSRGGDGSKKRDSIILVFQEAKISVLEFDDSIH
Sbjct: 1    MDGVSGASLELVCHYRLHGNVESMAILSSRGGDGSKKRDSIILVFQEAKISVLEFDDSIH 60

Query: 152  SLRTSSMHCFEGPQWLHLKRGRESFARGPVVKVDPQGRCGGVLVYGLQMIILKASQAGSG 211
             LRTSSMHCFEGPQWLHLKRGRESFARGPVVKVDPQGRCGGVLVYGLQMIILKASQAGSG
Sbjct: 61   CLRTSSMHCFEGPQWLHLKRGRESFARGPVVKVDPQGRCGGVLVYGLQMIILKASQAGSG 120

Query: 212  LVVDDEAFGNIGAISARVESSYLINLRDLDVKHVKDFVFVHGYIEPVMVILHEQELTWAG 271
            LV  DEAFG+ GA SARVESSYL NLRDLDVKHVKDFVFVHGYIEPVMVILHEQELTWAG
Sbjct: 121  LVAVDEAFGHGGASSARVESSYLSNLRDLDVKHVKDFVFVHGYIEPVMVILHEQELTWAG 180

Query: 272  RVSWKHHTCMISALSISTTLKQHPLIWSANNLPHDAYKLLAVPSPIGGVLVVSANSIHYH 331
            RVSWKHHTCMISALSISTTLKQHPLIWSA NLPHDAYKLLAVPSPIGGVLVVSANSIHYH
Sbjct: 181  RVSWKHHTCMISALSISTTLKQHPLIWSARNLPHDAYKLLAVPSPIGGVLVVSANSIHYH 240

Query: 332  SQSASCMLALNNYAVSADSSQDMPRSNFNVELDAANATWLLNDVALLSTKTGELLLLALV 391
            SQSASCMLALNNYAVSADSSQDMPRSNFNVELDAANA+WLLNDVALLSTKTGELLLLALV
Sbjct: 241  SQSASCMLALNNYAVSADSSQDMPRSNFNVELDAANASWLLNDVALLSTKTGELLLLALV 300

Query: 392  YDGRVVQRLDLSKSKASVLTSGIASIGNSLFFLGSRLGDSLLVQFSSGVGSSGLASSLKD 451
            YDGRVVQRLDLSKSKASVLTSGIASIGNSLFFLGSRLGDSLLVQFS GVGSSGL SSLKD
Sbjct: 301  YDGRVVQRLDLSKSKASVLTSGIASIGNSLFFLGSRLGDSLLVQFSCGVGSSGLTSSLKD 360

Query: 452  EVGDIEVDAPTAKRMRRSSSDALQDMVGGDELSLYGSAPNNAESAQKNFSFAVRDSLINI 511
            EVGDIEVDAPTAKR+RRSSSDALQDMVGGDELSLYGSAPNN ESAQKNFSFAVRDSLINI
Sbjct: 361  EVGDIEVDAPTAKRIRRSSSDALQDMVGGDELSLYGSAPNNTESAQKNFSFAVRDSLINI 420

Query: 512  GPLKDFSYGLRINADANATGIAKQSNYELVCCSGNGKNGALCILRQSIRPEMITEVELPG 571
            GPLKDFSYGLRINADANATGIAKQSNYELVCCSGNGKNGALCILRQSIRPEMITEVELPG
Sbjct: 421  GPLKDFSYGLRINADANATGIAKQSNYELVCCSGNGKNGALCILRQSIRPEMITEVELPG 480

Query: 572  CKGIWTVYHKNTRGSSADSSRMVPDDDEYHAYLIISLEARTMVLETGDLLTEVTESVDYF 631
            CKGIWTVYHKNTRGSS DSSRMVPDDDEYHAYLIISLEARTMVLETGDLLTEVTESVDYF
Sbjct: 481  CKGIWTVYHKNTRGSSVDSSRMVPDDDEYHAYLIISLEARTMVLETGDLLTEVTESVDYF 540

Query: 632  VQGRTIAAGNLFGRRRVIQVYESGARILDGSFMTQDLNMVVTCNESGNGSEGCTVLSASI 691
            VQGRTIAAGNLFGRRRVIQVYESGARILDGSFMTQDLN+VVT NESG GSE CTVLSASI
Sbjct: 541  VQGRTIAAGNLFGRRRVIQVYESGARILDGSFMTQDLNLVVTSNESGTGSESCTVLSASI 600

Query: 692  SDPYVLLTMTDGSIRLLVGDPSSCSVSVSTPAAFGSSKKCVSCCTLYHDKGIEPWLRMTS 751
            +DPYVLLTMTDGSIRLLVGDPSSCSVSVSTPAAFGSSKKCVSCCTLYHDKG+EPWLRMTS
Sbjct: 601  TDPYVLLTMTDGSIRLLVGDPSSCSVSVSTPAAFGSSKKCVSCCTLYHDKGVEPWLRMTS 660

Query: 752  TDAWLSTGVGETIDGTDGSLQDQGDIYCVACYDSGDLEIFDVPNFVSVFYVDKFVSGKSH 811
            TDAWLSTGVGETI+GTDGSLQDQGDIYCVACY+SGDLEIFDVPNF+SVFYVDKFVSGKSH
Sbjct: 661  TDAWLSTGVGETIEGTDGSLQDQGDIYCVACYESGDLEIFDVPNFISVFYVDKFVSGKSH 720

Query: 812  LVDFQMSDLQKNSEKLDRNSQELNSHGRNESSQNMKVIEVAMQRWSGKHSRPFLFGILTD 871
            LVDFQ+ D QKNSEK+D N QELN+HGRNE++QNMKVIEVAMQRWSG+HSRP+LFGILTD
Sbjct: 721  LVDFQIPDSQKNSEKMDGNCQELNNHGRNETTQNMKVIEVAMQRWSGQHSRPYLFGILTD 780

Query: 872  GTILCYHAYLFESTDSASKIDDSVSMENSVSSSNMSSSRLRNLRFLRVPLDIQGRDDMPN 931
            GTILCYHAYLFESTD+ASK D SVSMENSVSSSN+S+SRLRNLRFLRVPLDIQGRDDMPN
Sbjct: 781  GTILCYHAYLFESTDNASKNDASVSMENSVSSSNISASRLRNLRFLRVPLDIQGRDDMPN 840

Query: 932  GTLSRRLSIFKNISGYQGLFLCGSRPAWFMVFRERLRVHPQLCDGPIVAFTVLHNVNCNH 991
            G LSRRLSIFKNISGYQGLFLCGSRP+WFMVFRERLRVHPQLCDGPIVAFTVLHNVNCNH
Sbjct: 841  GILSRRLSIFKNISGYQGLFLCGSRPSWFMVFRERLRVHPQLCDGPIVAFTVLHNVNCNH 900

Query: 992  GLIYVTSQGVLKICQLPSTSSYDNYWPVQKVPLKGTPHQVTYFHEKNLYPVIISAPVHKP 1051
            GLIYVTSQGVLKICQLPSTSSYDNYWPVQKVPLKGTPHQVTYFHEKNLYPVIIS PVHKP
Sbjct: 901  GLIYVTSQGVLKICQLPSTSSYDNYWPVQKVPLKGTPHQVTYFHEKNLYPVIISVPVHKP 960

Query: 1052 LNQVLSSMVDQDVGQVENHNLSADELQQTYSVEEFEIRILEPEKSGGPWQTRATIAMHSS 1111
            LNQVLSSMVDQDVGQVENHNLSADELQQTYSVEEFEIRILEPEKSGGPWQTRATIAM SS
Sbjct: 961  LNQVLSSMVDQDVGQVENHNLSADELQQTYSVEEFEIRILEPEKSGGPWQTRATIAMQSS 1020

Query: 1112 ENALTIRVVTLLNTTTKENETLLAVGTAYVQGEDVAARGRVLLFSVGKDVDNSQTLVSEV 1171
            ENALTIRVVTLLNTTTKENETLLAVGTAYVQGEDVAARGRVLLFSVGKD+DNSQTLVSEV
Sbjct: 1021 ENALTIRVVTLLNTTTKENETLLAVGTAYVQGEDVAARGRVLLFSVGKDIDNSQTLVSEV 1080

Query: 1172 YSKELKGAISALASLQGHLLIASGPKIILHKWTGAELNGIAFYDVPPLYVVSLNIVKNFI 1231
            YSKELKGAISALASLQGHLLIASGPKIILHKWTGAELNGIAFYDVPPLYVVSLNIVKNFI
Sbjct: 1081 YSKELKGAISALASLQGHLLIASGPKIILHKWTGAELNGIAFYDVPPLYVVSLNIVKNFI 1140

Query: 1232 LLGDIHKSIYFLSWKEQGAQLSLLAKDFGSLDCYATEFLIDGSTLSLTVSDDQKNIQIFY 1291
            LLGDIHKSIYFLSWKEQGAQLSLLAKDFGSLDCYATEFLIDGSTLSLTVSDDQKNIQIFY
Sbjct: 1141 LLGDIHKSIYFLSWKEQGAQLSLLAKDFGSLDCYATEFLIDGSTLSLTVSDDQKNIQIFY 1200

Query: 1292 YAPKSTESWKGQKLLSRAEFHVGAHVTKFLRLQMLSTTSDRASTTVSDKTNRFALLFGTL 1351
            YAPKSTESWKGQKLLSRAEFHVGAHVTKFLRLQMLSTTSDR STTVSDKTNRFALLFGTL
Sbjct: 1201 YAPKSTESWKGQKLLSRAEFHVGAHVTKFLRLQMLSTTSDRTSTTVSDKTNRFALLFGTL 1260

Query: 1352 DGSIGCIAPLDELTFRRLQSLQKKLVDAVPHVGGLNPRSFRQFLSNGKVHRRGPDSIVDG 1411
            DGSIGCIAP+DELTFRRLQSLQKKLVDAVPHVGGLNPRSFRQF SNGKVHRRGPDSIVD 
Sbjct: 1261 DGSIGCIAPVDELTFRRLQSLQKKLVDAVPHVGGLNPRSFRQFHSNGKVHRRGPDSIVDC 1320

Query: 1412 ELLCHYEMLPLEEQLDIAHQIGTTRSQILSNLNDLSLGTSFL 1454
            ELLCHYEMLPLEEQL+IAHQIGTTRSQILSNLNDLSLGTSFL
Sbjct: 1321 ELLCHYEMLPLEEQLEIAHQIGTTRSQILSNLNDLSLGTSFL 1362

BLAST of Tan0008131 vs. TAIR 10
Match: AT5G51660.1 (cleavage and polyadenylation specificity factor 160 )

HSP 1 Score: 2166.0 bits (5611), Expect = 0.0e+00
Identity = 1074/1459 (73.61%), Postives = 1255/1459 (86.02%), Query Frame = 0

Query: 1    MSFAAYRMMHWPTGIENCDSGFITHSRADF---VPGVTSHTDELESDWP-ARREMGPVPN 60
            MSFAAY+MMHWPTG+ENC SG+ITHS +D    +P V+ H D++E++WP  +R +GP+PN
Sbjct: 1    MSFAAYKMMHWPTGVENCASGYITHSLSDSTLQIPIVSVH-DDIEAEWPNPKRGIGPLPN 60

Query: 61   LVVTAGNVLEVYVVRVQEEGGS-ESRSSGEVRRGGIMDGVSGASLELVCHYRLHGNVESM 120
            +V+TA N+LEVY+VR QEEG + E R+    +RGG+MDGV G SLELVCHYRLHGNVES+
Sbjct: 61   VVITAANILEVYIVRAQEEGNTQELRNPKLAKRGGVMDGVYGVSLELVCHYRLHGNVESI 120

Query: 121  AILSSRGGDGSKKRDSIILVFQEAKISVLEFDDSIHSLRTSSMHCFEGPQWLHLKRGRES 180
            A+L   GG+ SK RDSIIL F++AKISVLEFDDSIHSLR +SMHCFEGP WLHLKRGRES
Sbjct: 121  AVLPMGGGNSSKGRDSIILTFRDAKISVLEFDDSIHSLRMTSMHCFEGPDWLHLKRGRES 180

Query: 181  FARGPVVKVDPQGRCGGVLVYGLQMIILKASQAGSGLVVDDEAFGNIGAISARVESSYLI 240
            F RGP+VKVDPQGRCGGVLVYGLQMIILK SQ GSGLV DD+AF + G +SARVESSY+I
Sbjct: 181  FPRGPLVKVDPQGRCGGVLVYGLQMIILKTSQVGSGLVGDDDAFSSGGTVSARVESSYII 240

Query: 241  NLRDLDVKHVKDFVFVHGYIEPVMVILHEQELTWAGRVSWKHHTCMISALSISTTLKQHP 300
            NLRDL++KHVKDFVF+HGYIEPV+VIL E+E TWAGRVSWKHHTC++SALSI++TLKQHP
Sbjct: 241  NLRDLEMKHVKDFVFLHGYIEPVIVILQEEEHTWAGRVSWKHHTCVLSALSINSTLKQHP 300

Query: 301  LIWSANNLPHDAYKLLAVPSPIGGVLVVSANSIHYHSQSASCMLALNNYAVSADSSQDMP 360
            +IWSA NLPHDAYKLLAVPSPIGGVLV+ AN+IHYHSQSASC LALNNYA SADSSQ++P
Sbjct: 301  VIWSAINLPHDAYKLLAVPSPIGGVLVLCANTIHYHSQSASCALALNNYASSADSSQELP 360

Query: 361  RSNFNVELDAANATWLLNDVALLSTKTGELLLLALVYDGRVVQRLDLSKSKASVLTSGIA 420
             SNF+VELDAA+ TW+ NDVALLSTK+GELLLL L+YDGR VQRLDLSKSKASVL S I 
Sbjct: 361  ASNFSVELDAAHGTWISNDVALLSTKSGELLLLTLIYDGRAVQRLDLSKSKASVLASDIT 420

Query: 421  SIGNSLFFLGSRLGDSLLVQFSSGVGSSGLASSLKDEVGDIEVDAPTAKRMRRSSSDALQ 480
            S+GNSLFFLGSRLGDSLLVQFS   G +     L+DE  DIE +   AKR+ R +SD  Q
Sbjct: 421  SVGNSLFFLGSRLGDSLLVQFSCRSGPAASLPGLRDEDEDIEGEGHQAKRL-RMTSDTFQ 480

Query: 481  DMVGGDELSLYGSAPNNAESAQKNFSFAVRDSLINIGPLKDFSYGLRINADANATGIAKQ 540
            D +G +ELSL+GS PNN++SAQK+FSFAVRDSL+N+GP+KDF+YGLRINADANATG++KQ
Sbjct: 481  DTIGNEELSLFGSTPNNSDSAQKSFSFAVRDSLVNVGPVKDFAYGLRINADANATGVSKQ 540

Query: 541  SNYELVCCSGNGKNGALCILRQSIRPEMITEVELPGCKGIWTVYHKNTRGSSADSSRMVP 600
            SNYELVCCSG+GKNGALC+LRQSIRPEMITEVELPGCKGIWTVYHK++RG +ADSS+M  
Sbjct: 541  SNYELVCCSGHGKNGALCVLRQSIRPEMITEVELPGCKGIWTVYHKSSRGHNADSSKMAA 600

Query: 601  DDDEYHAYLIISLEARTMVLETGDLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVYESG 660
            D+DEYHAYLIISLEARTMVLET DLLTEVTESVDY+VQGRTIAAGNLFGRRRVIQV+E G
Sbjct: 601  DEDEYHAYLIISLEARTMVLETADLLTEVTESVDYYVQGRTIAAGNLFGRRRVIQVFEHG 660

Query: 661  ARILDGSFMTQDLNMVVTCNESGNGSEGCTVLSASISDPYVLLTMTDGSIRLLVGDPSSC 720
            ARILDGSFM Q+L+   + +ES +GSE  TV S SI+DPYVLL MTD SIRLLVGDPS+C
Sbjct: 661  ARILDGSFMNQELSFGASNSESNSGSESSTVSSVSIADPYVLLRMTDDSIRLLVGDPSTC 720

Query: 721  SVSVSTPAAFGSSKKCVSCCTLYHDKGIEPWLRMTSTDAWLSTGVGETIDGTDGSLQDQG 780
            +VS+S+P+    SK+ +S CTLYHDKG EPWLR  STDAWLS+GVGE +D  DG  QDQG
Sbjct: 721  TVSISSPSVLEGSKRKISACTLYHDKGPEPWLRKASTDAWLSSGVGEAVDSVDGGPQDQG 780

Query: 781  DIYCVACYDSGDLEIFDVPNFVSVFYVDKFVSGKSHLVDFQMSDLQKNSEKLDRNSQELN 840
            DIYCV CY+SG LEIFDVP+F  VF VDKF SG+ HL D  + +L+    +L++NS++  
Sbjct: 781  DIYCVVCYESGALEIFDVPSFNCVFSVDKFASGRRHLSDMPIHELE---YELNKNSEDNT 840

Query: 841  SHGRNESSQNMKVIEVAMQRWSGKHSRPFLFGILTDGTILCYHAYLFESTDSASKIDDSV 900
            S   ++  +N +V+E+AMQRWSG H+RPFLF +L DGTILCYHAYLF+  DS +K ++S+
Sbjct: 841  S---SKEIKNTRVVELAMQRWSGHHTRPFLFAVLADGTILCYHAYLFDGVDS-TKAENSL 900

Query: 901  SMENSVSSSNMSSSRLRNLRFLRVPLDIQGRDDMPNGTLSRRLSIFKNISGYQGLFLCGS 960
            S EN  + ++  SS+LRNL+FLR+PLD   R+   +G  S+R+++FKNISG+QG FL GS
Sbjct: 901  SSENPAALNSSGSSKLRNLKFLRIPLDTSTREGTSDGVASQRITMFKNISGHQGFFLSGS 960

Query: 961  RPAWFMVFRERLRVHPQLCDGPIVAFTVLHNVNCNHGLIYVTSQGVLKICQLPSTSSYDN 1020
            RP W M+FRERLR H QLCDG I AFTVLHNVNCNHG IYVT+QGVLKICQLPS S YDN
Sbjct: 961  RPGWCMLFRERLRFHSQLCDGSIAAFTVLHNVNCNHGFIYVTAQGVLKICQLPSASIYDN 1020

Query: 1021 YWPVQKVPLKGTPHQVTYFHEKNLYPVIISAPVHKPLNQVLSSMVDQDVG-QVENHNLSA 1080
            YWPVQK+PLK TPHQVTY+ EKNLYP+I+S PV KPLNQVLSS+VDQ+ G Q++NHN+S+
Sbjct: 1021 YWPVQKIPLKATPHQVTYYAEKNLYPLIVSYPVSKPLNQVLSSLVDQEAGQQLDNHNMSS 1080

Query: 1081 DELQQTYSVEEFEIRILEPEKSGGPWQTRATIAMHSSENALTIRVVTLLNTTTKENETLL 1140
            D+LQ+TY+VEEFEI+ILEPE+SGGPW+T+A I M +SE+ALT+RVVTLLN +T ENETLL
Sbjct: 1081 DDLQRTYTVEEFEIQILEPERSGGPWETKAKIPMQTSEHALTVRVVTLLNASTGENETLL 1140

Query: 1141 AVGTAYVQGEDVAARGRVLLFSVGKDVDNSQTLVSEVYSKELKGAISALASLQGHLLIAS 1200
            AVGTAYVQGEDVAARGRVLLFS GK+ DNSQ +V+EVYS+ELKGAISA+AS+QGHLLI+S
Sbjct: 1141 AVGTAYVQGEDVAARGRVLLFSFGKNGDNSQNVVTEVYSRELKGAISAVASIQGHLLISS 1200

Query: 1201 GPKIILHKWTGAELNGIAFYDVPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLSL 1260
            GPKIILHKW G ELNG+AF+D PPLYVVS+N+VK+FILLGD+HKSIYFLSWKEQG+QLSL
Sbjct: 1201 GPKIILHKWNGTELNGVAFFDAPPLYVVSMNVVKSFILLGDVHKSIYFLSWKEQGSQLSL 1260

Query: 1261 LAKDFGSLDCYATEFLIDGSTLSLTVSDDQKNIQIFYYAPKSTESWKGQKLLSRAEFHVG 1320
            LAKDF SLDC+ATEFLIDGSTLSL VSD+QKNIQ+FYYAPK  ESWKG KLLSRAEFHVG
Sbjct: 1261 LAKDFESLDCFATEFLIDGSTLSLAVSDEQKNIQVFYYAPKMIESWKGLKLLSRAEFHVG 1320

Query: 1321 AHVTKFLRLQMLSTTSDRASTTVSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQK 1380
            AHV+KFLRLQM+S+         +DK NRFALLFGTLDGS GCIAPLDE+TFRRLQSLQK
Sbjct: 1321 AHVSKFLRLQMVSSG--------ADKINRFALLFGTLDGSFGCIAPLDEVTFRRLQSLQK 1380

Query: 1381 KLVDAVPHVGGLNPRSFRQFLSNGKVHRRGPDSIVDGELLCHYEMLPLEEQLDIAHQIGT 1440
            KLVDAVPHV GLNP +FRQF S+GK  R GPDSIVD ELLCHYEMLPLEEQL++AHQIGT
Sbjct: 1381 KLVDAVPHVAGLNPLAFRQFRSSGKARRSGPDSIVDCELLCHYEMLPLEEQLELAHQIGT 1440

Query: 1441 TRSQILSNLNDLSLGTSFL 1454
            TR  IL +L DLS+GTSFL
Sbjct: 1441 TRYSILKDLVDLSVGTSFL 1442

BLAST of Tan0008131 vs. TAIR 10
Match: AT4G21100.1 (damaged DNA binding protein 1B )

HSP 1 Score: 112.1 bits (279), Expect = 3.8e-24
Identity = 281/1343 (20.92%), Postives = 482/1343 (35.89%), Query Frame = 0

Query: 95   VSGASLELVCHYRLHGNVESMAILSSRGGDGSKKRDSIILVFQEAKISVLEFDDSIHSLR 154
            +S   L+ +    L+G + +M +    G    + +D + +  +  K  VL++D     L 
Sbjct: 45   LSPQGLQTILDVPLYGRIATMELFRPHG----EAQDFLFVATERYKFCVLQWDYESSELI 104

Query: 155  TSSMHCFEGPQWLHLKRGRESFARGPVVKVDPQGRCGGVLVY-GLQMIILKASQAGSGLV 214
            T +M           + GR +   G +  +DP  R  G+ +Y GL  +I           
Sbjct: 105  TRAMGDVSD------RIGRPT-DNGQIGIIDPDCRVIGLHLYDGLFKVI----------- 164

Query: 215  VDDEAFGNIGAISARVESSYLINLRDLDVKHVKDFVFVHGYIEPVMVILHEQELTWAGRV 274
                 F N G    +++ ++ I L +L V  +K   F++G  +P + +L++         
Sbjct: 165  ----PFDNKG----QLKEAFNIRLEELQVLDIK---FLYGCTKPTIAVLYQDNKD----- 224

Query: 275  SWKHHTCMISALSISTTLKQHPLIWSANNLPHDAYKLLAVPSPIGGVLVVSANSIHYHSQ 334
                H              + P  WS NNL + A  L+ VPSP+ GVL++   +I Y S 
Sbjct: 225  --ARHVKTYEVSLKDKNFVEGP--WSQNNLDNGADLLIPVPSPLCGVLIIGEETIVYCSA 284

Query: 335  SASCMLALNNYAVSADSSQDMPRSNFNVELDAANATWLLNDVALLSTKTGELLLLALVYD 394
            +A   + +      A    D+  S +                 LL    G + LL + ++
Sbjct: 285  NAFKAIPIRPSITKAYGRVDLDGSRY-----------------LLGDHAGLIHLLVITHE 344

Query: 395  GRVVQRLDLSKSKASVLTSGIASIGNSLFFLGSRLGDSLLVQFSSGVGSSGLASSLKDEV 454
               V  L +     + + S I+ + N++ F+GS  GDS L++                  
Sbjct: 345  KEKVTGLKIELLGETSIASSISYLDNAVVFVGSSYGDSQLIKL----------------- 404

Query: 455  GDIEVDAPTAKRMRRSSSDALQDMVGGDELSLYGSAPNNAESAQKNFSFAVRDSLINIGP 514
                                                  N +   K     + +  +N+GP
Sbjct: 405  --------------------------------------NLQPDAKGSYVEILEKYVNLGP 464

Query: 515  LKDFSYGLRINADANATGIAKQSNYELVCCSGNGKNGALCILRQSIRPEMITEVELPGCK 574
            + DF              + +Q   ++V CSG  K+G+L I+R  I       VEL G K
Sbjct: 465  IVDFC----------VVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIK 524

Query: 575  GIWTVYHKNTRGSSADSSRMVPDDDEYHAYLIISL--EARTMVLETGDLLTEVTESVDYF 634
            G+W          S  SS     D+ +  +L++S   E R + +   D L E TE   + 
Sbjct: 525  GMW----------SLKSS----IDEAFDTFLVVSFISETRILAMNIEDELEE-TEIEGFL 584

Query: 635  VQGRTIAAGNLFGRRRVIQVYESGARILDGSFMTQDLNMVVTCNESGNGSEGCTVLSASI 694
             + +T+   +     +++QV  +  R                                  
Sbjct: 585  SEVQTLFCHDAV-YNQLVQVTSNSVR---------------------------------- 644

Query: 695  SDPYVLLTMTDGSIRLLVGDPSSCSVSVSTPAAFGSSKKCVSCCTLYHDKGIEPWLRMTS 754
                 L++ T   +R     P+  SV+V+T  A                           
Sbjct: 645  -----LVSSTTRELRNKWDAPAGFSVNVATANA--------------------------- 704

Query: 755  TDAWLSTGVGETI--DGTDGSLQDQGDI---YCVACYDSGDLEIFDVPNFVSVFYVDKFV 814
            +   L+TG G  +  +  DG+L +   +   Y V+C D     I D PN+  +  V  + 
Sbjct: 705  SQVLLATGGGHLVYLEIGDGTLTEVKHVLLEYEVSCLDIN--PIGDNPNYSQLAAVGMWT 764

Query: 815  SGKSHLVDFQMSDLQKNSEKLDRNSQELNSHGRNESSQNMKVIEVAMQRWSGKHSRPFLF 874
                 +  F + DL   ++      +EL       S     V+  A +  S      +L 
Sbjct: 765  DISVRI--FVLPDLTLITK------EELGGEIIPRS-----VLLCAFEGIS------YLL 824

Query: 875  GILTDGTILCYHAYLFESTDSASKIDDSVSMENSVSSSNMSSSRLRNLRFLRVPLDIQGR 934
              L DG     H   F+   S  K+ D                                 
Sbjct: 825  CALGDG-----HLLNFQLDTSCGKLRDR-------------------------------- 884

Query: 935  DDMPNGTLSRRLSIFKNISGYQGLFLCGSRPAWFMVFRERLRVHP-QLCDGPIVAFTVLH 994
                           K +S        G+RP     F  +   H     D P V +    
Sbjct: 885  ---------------KKVS-------LGTRPITLRTFSSKSATHVFAASDRPAVIY---- 944

Query: 995  NVNCNHGLIY--VTSQGVLKICQLPSTSSYDNYWPVQKVPLK-GTPHQVTYFHEKNLYPV 1054
              + N  L+Y  V  + V  +C   S +  D+    ++  L  GT   +   H       
Sbjct: 945  --SNNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAREGELTIGTIDDIQKLH------- 1004

Query: 1055 IISAPVHKPLNQVLSSMVDQDVGQVENHNLSADELQQTYSVEEFE---IRILEPEKSGGP 1114
            I + P+ +   ++          Q +    +   L+   S EE E   +R+L+ +     
Sbjct: 1005 IRTIPIGEHARRICH--------QEQTRTFAISCLRNEPSAEESESHFVRLLDAQS---- 1052

Query: 1115 WQTRATIAMHSSENALTIRVVTLLNTTTKENETLLAVGTAYV-QGEDVAARGRVLLFSVG 1174
            ++  ++  + + E   +I    L  + T +      VGTAYV   E+   +GR+L+F V 
Sbjct: 1065 FEFLSSYPLDAFECGCSI----LSCSFTDDKNVYYCVGTAYVLPEENEPTKGRILVFIV- 1052

Query: 1175 KDVDNSQTLVSEVYSKELKGAISALASLQGHLLIASGPKIILHKWT-----GAELNGIAF 1234
               +    L++E   KE KGA+ +L +  G LL +   KI L+KW        EL     
Sbjct: 1125 --EEGRLQLITE---KETKGAVYSLNAFNGKLLASINQKIQLYKWMLRDDGTRELQSECG 1052

Query: 1235 Y--DVPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLSLLAKDFGSLDCYATEFLI 1294
            +   +  LYV +     +FI +GD+ KSI  L +K +   +   A+D+ +    A E L 
Sbjct: 1185 HHGHILALYVQTRG---DFIAVGDLMKSISLLIYKHEEGAIEERARDYNANWMTAVEILN 1052

Query: 1295 DGSTLSLTVSDDQKNIQIFYYAPKSTESWKGQKLLSRAEFHVGAHVTKFLRLQMLSTTSD 1354
            D   L    +D+  NI       +     +  ++    E+H+G  V +F    ++    D
Sbjct: 1245 DDIYLG---TDNCFNIFTVKKNNEGATDEERARMEVVGEYHIGEFVNRFRHGSLVMKLPD 1052

Query: 1355 RASTTVSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDAVPHVGGLNPRSF 1414
                  SD      ++FGT+ G IG IA L +  +  L+ LQ  L   +  VGGL+   +
Sbjct: 1305 ------SDIGQIPTVIFGTVSGMIGVIASLPQEQYAFLEKLQTSLRKVIKGVGGLSHEQW 1052

BLAST of Tan0008131 vs. TAIR 10
Match: AT4G05420.1 (damaged DNA binding protein 1A )

HSP 1 Score: 95.5 bits (236), Expect = 3.7e-19
Identity = 88/329 (26.75%), Postives = 153/329 (46.50%), Query Frame = 0

Query: 1127 TKENETLLAVGTAYV-QGEDVAARGRVLLFSVGKDVDNSQTLVSEVYSKELKGAISALAS 1186
            T++      VGTAYV   E+   +GR+L+F V    D    L++E   KE KGA+ +L +
Sbjct: 777  TEDKNVYYCVGTAYVLPEENEPTKGRILVFIV---EDGRLQLIAE---KETKGAVYSLNA 836

Query: 1187 LQGHLLIASGPKIILHKWT-----GAELNGIAFY--DVPPLYVVSLNIVKNFILLGDIHK 1246
              G LL A   KI L+KW        EL     +   +  LYV +     +FI++GD+ K
Sbjct: 837  FNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRG---DFIVVGDLMK 896

Query: 1247 SIYFLSWKEQGAQLSLLAKDFGSLDCYATEFLIDGSTLSLTVSDDQKNIQIFYYAPKSTE 1306
            SI  L +K +   +   A+D+ +    A E L D   + L   ++   + +   +  +T+
Sbjct: 897  SISLLLYKHEEGAIEERARDYNANWMSAVEILDD--DIYLGAENNFNLLTVKKNSEGATD 956

Query: 1307 SWKGQKLLSRAEFHVGAHVTKFLRLQMLSTTSDRASTTVSDKTNRFALLFGTLDGSIGCI 1366
              +G +L    E+H+G  V +F    ++    D     +        ++FGT++G IG I
Sbjct: 957  EERG-RLEVVGEYHLGEFVNRFRHGSLVMRLPDSEIGQIP------TVIFGTVNGVIGVI 1016

Query: 1367 APLDELTFRRLQSLQKKLVDAVPHVGGLNPRSFRQFLSNGKVHRRGPDSIVDGELLCHYE 1426
            A L +  +  L+ LQ  L   +  VGGL+   +R F  N +       + +DG+L+  + 
Sbjct: 1017 ASLPQEQYTFLEKLQSSLRKVIKGVGGLSHEQWRSF--NNEKRTAEARNFLDGDLIESFL 1076

Query: 1427 MLPLEEQLDIAHQIGTTRSQILSNLNDLS 1448
             L   +  DI+  +     ++   + +L+
Sbjct: 1077 DLSRNKMEDISKSMNVQVEELCKRVEELT 1085

BLAST of Tan0008131 vs. TAIR 10
Match: AT4G05420.2 (damaged DNA binding protein 1A )

HSP 1 Score: 95.5 bits (236), Expect = 3.7e-19
Identity = 88/329 (26.75%), Postives = 153/329 (46.50%), Query Frame = 0

Query: 1127 TKENETLLAVGTAYV-QGEDVAARGRVLLFSVGKDVDNSQTLVSEVYSKELKGAISALAS 1186
            T++      VGTAYV   E+   +GR+L+F V    D    L++E   KE KGA+ +L +
Sbjct: 756  TEDKNVYYCVGTAYVLPEENEPTKGRILVFIV---EDGRLQLIAE---KETKGAVYSLNA 815

Query: 1187 LQGHLLIASGPKIILHKWT-----GAELNGIAFY--DVPPLYVVSLNIVKNFILLGDIHK 1246
              G LL A   KI L+KW        EL     +   +  LYV +     +FI++GD+ K
Sbjct: 816  FNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRG---DFIVVGDLMK 875

Query: 1247 SIYFLSWKEQGAQLSLLAKDFGSLDCYATEFLIDGSTLSLTVSDDQKNIQIFYYAPKSTE 1306
            SI  L +K +   +   A+D+ +    A E L D   + L   ++   + +   +  +T+
Sbjct: 876  SISLLLYKHEEGAIEERARDYNANWMSAVEILDD--DIYLGAENNFNLLTVKKNSEGATD 935

Query: 1307 SWKGQKLLSRAEFHVGAHVTKFLRLQMLSTTSDRASTTVSDKTNRFALLFGTLDGSIGCI 1366
              +G +L    E+H+G  V +F    ++    D     +        ++FGT++G IG I
Sbjct: 936  EERG-RLEVVGEYHLGEFVNRFRHGSLVMRLPDSEIGQIP------TVIFGTVNGVIGVI 995

Query: 1367 APLDELTFRRLQSLQKKLVDAVPHVGGLNPRSFRQFLSNGKVHRRGPDSIVDGELLCHYE 1426
            A L +  +  L+ LQ  L   +  VGGL+   +R F  N +       + +DG+L+  + 
Sbjct: 996  ASLPQEQYTFLEKLQSSLRKVIKGVGGLSHEQWRSF--NNEKRTAEARNFLDGDLIESFL 1055

Query: 1427 MLPLEEQLDIAHQIGTTRSQILSNLNDLS 1448
             L   +  DI+  +     ++   + +L+
Sbjct: 1056 DLSRNKMEDISKSMNVQVEELCKRVEELT 1064

BLAST of Tan0008131 vs. TAIR 10
Match: AT3G55220.1 (Cleavage and polyadenylation specificity factor (CPSF) A subunit protein )

HSP 1 Score: 48.5 bits (114), Expect = 5.2e-05
Identity = 83/364 (22.80%), Postives = 140/364 (38.46%), Query Frame = 0

Query: 298 WSANNLPHDAYKLLAVPSPI---GGVLVVSANSIHYHSQSASCMLALNNYAVSADSSQDM 357
           WS N + + A  L+ VP       GVLV + N + Y +Q    + A+           D+
Sbjct: 223 WS-NPVDNGANMLVTVPGGADGPSGVLVCAENFVIYMNQGHPDVRAV------IPRRTDL 282

Query: 358 PRSNFNVELDAANATWLLNDVALLSTKTGELLLLALVYDGRVVQRLDLSKSKASVLTSGI 417
           P     + + AA          L+ T+ G++  + L ++G  V  L +       + S I
Sbjct: 283 PAERGVLVVSAAVHKQKTMFFFLIQTEYGDVFKVTLDHNGDHVSELKVKYFDTIPVASSI 342

Query: 418 ASIGNSLFFLGSRLGDSLLVQFSSGVGSSGLASSLKDEVGDIEVDAPTAKRMRRSSSDAL 477
             +     F  S  G+  L QF + +G          E  D+E           SSS  L
Sbjct: 343 CVLKLGFLFSASEFGNHGLYQFQA-IG----------EEPDVE-----------SSSSNL 402

Query: 478 QDMVGGDELSLYGSAPNNAESAQKNFSFAVRDSLINIGPLKDFSYGLRINADANATGIAK 537
            +   G +   +   P   ++  +       D + ++ PL D               I +
Sbjct: 403 METEEGFQPVFF--QPRRLKNLVR------IDQVESLMPLMDM----------KVLNIFE 462

Query: 538 QSNYELVCCSGNGKNGALCILRQSIRPEMITEVELPG-CKGIWTVYHKNTRGSSADSSRM 597
           +   ++    G G   +L ILR  +    +   +LPG    +WTV  KN           
Sbjct: 463 EETPQIFSLCGRGPRSSLRILRPGLAITEMAVSQLPGQPSAVWTV-KKNV---------- 522

Query: 598 VPDDDEYHAYLIISLEARTMVLETGDLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVYE 657
               DE+ AY+++S    T+VL  G+ + EV +S   F+      A +L G   ++QV+ 
Sbjct: 523 ---SDEFDAYIVVSFTNATLVLSIGEQVEEVNDS--GFLDTTPSLAVSLIGDDSLMQVHP 523

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9FGR00.0e+0073.61Cleavage and polyadenylation specificity factor subunit 1 OS=Arabidopsis thalian... [more]
Q7XWP10.0e+0066.05Probable cleavage and polyadenylation specificity factor subunit 1 OS=Oryza sati... [more]
Q105704.2e-18531.97Cleavage and polyadenylation specificity factor subunit 1 OS=Homo sapiens OX=960... [more]
A0A0R4IC371.2e-18431.52Cleavage and polyadenylation specificity factor subunit 1 OS=Danio rerio OX=7955... [more]
Q105696.1e-18431.33Cleavage and polyadenylation specificity factor subunit 1 OS=Bos taurus OX=9913 ... [more]
Match NameE-valueIdentityDescription
XP_038887722.10.0e+0096.70cleavage and polyadenylation specificity factor subunit 1 isoform X1 [Benincasa ... [more]
XP_022973592.10.0e+0096.08cleavage and polyadenylation specificity factor subunit 1 isoform X1 [Cucurbita ... [more]
XP_023520837.10.0e+0095.87cleavage and polyadenylation specificity factor subunit 1 isoform X1 [Cucurbita ... [more]
XP_008441850.10.0e+0095.80PREDICTED: cleavage and polyadenylation specificity factor subunit 1 isoform X1 ... [more]
KAG7011474.10.0e+0095.60Cleavage and polyadenylation specificity factor subunit 1, partial [Cucurbita ar... [more]
Match NameE-valueIdentityDescription
A0A6J1I7X90.0e+0096.08cleavage and polyadenylation specificity factor subunit 1 isoform X1 OS=Cucurbit... [more]
A0A1S3B4E80.0e+0095.80cleavage and polyadenylation specificity factor subunit 1 isoform X1 OS=Cucumis ... [more]
A0A6J1EY930.0e+0095.53cleavage and polyadenylation specificity factor subunit 1 isoform X1 OS=Cucurbit... [more]
A0A0A0LKI90.0e+0095.32Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G074280 PE=4 SV=1[more]
A0A6J1DTM40.0e+0096.40cleavage and polyadenylation specificity factor subunit 1 OS=Momordica charantia... [more]
Match NameE-valueIdentityDescription
AT5G51660.10.0e+0073.61cleavage and polyadenylation specificity factor 160 [more]
AT4G21100.13.8e-2420.92damaged DNA binding protein 1B [more]
AT4G05420.13.7e-1926.75damaged DNA binding protein 1A [more]
AT4G05420.23.7e-1926.75damaged DNA binding protein 1A [more]
AT3G55220.15.2e-0522.80Cleavage and polyadenylation specificity factor (CPSF) A subunit protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 817..837
NoneNo IPR availablePANTHERPTHR10644DNA REPAIR/RNA PROCESSING CPSF FAMILYcoord: 2..1211
NoneNo IPR availablePANTHERPTHR10644:SF2CLEAVAGE AND POLYADENYLATION SPECIFICITY FACTOR SUBUNIT 1coord: 2..1211
IPR018846Cleavage/polyadenylation specificity factor, A subunit, N-terminalPFAMPF10433MMS1_Ncoord: 130..707
e-value: 2.0E-35
score: 122.4
IPR015943WD40/YVTN repeat-like-containing domain superfamilyGENE3D2.130.10.10coord: 53..445
e-value: 9.5E-71
score: 240.4
IPR015943WD40/YVTN repeat-like-containing domain superfamilyGENE3D2.130.10.10coord: 974..1216
e-value: 1.3E-34
score: 122.0
IPR004871Cleavage/polyadenylation specificity factor, A subunit, C-terminalPFAMPF03178CPSF_Acoord: 1086..1214
e-value: 2.5E-23
score: 82.9

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0008131.1Tan0008131.1mRNA
Tan0008131.2Tan0008131.2mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006378 mRNA polyadenylation
cellular_component GO:0005634 nucleus
molecular_function GO:0003684 damaged DNA binding
molecular_function GO:0005515 protein binding
molecular_function GO:0003676 nucleic acid binding