MC09g0858 (gene) Bitter gourd (Dali-11) v1

Overview
NameMC09g0858
Typegene
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptiontranscription initiation factor TFIID subunit 1
LocationMC09: 9689868 .. 9738252 (-)
RNA-Seq ExpressionMC09g0858
SyntenyMC09g0858
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSutr3
Hold the cursor over a type above to highlight its positions in the sequence below.
TACATACTTTATATTTTGGTTGCAGATGACGATGACTATGAGGATGCTGGTGGTGGCAATCGATTCTTGGGGTTTATGTTTGGAAATGTGGATAATTCTGGTGATCTTGATGCTGATTATCTCGACGAGGTAATAATTTTGCTTCTCGGTGGCATTTTTGTAGAATTAGATTTGTTACAGGACAAACGTATGTAAGCTACGAACTGATTCTAGTAGAAGATTGCTGCACCATAGATTTGGACAAATATAGTTGATATCTTCCACATATGATTTTTGGCAGTCCGAAAATTCTGGATAAAGACTCAACTTGTTACACTTGAAATGATATATTTTTTTTTCAATAATTTGAACTTTCATCTTATTTTCACTTCCTTCTGATCAATATCATATTAATAATGCTGAGGGTATAATGTTGTGTTTTAGGATGCTAAGGAACATCTTGATGCATTGGCTGATAAGCTGGGTTCAACTTTGACAGATATCGATGTAAGAAAACTTTTATGATCCTGAATATTATATTGTTTTCATTTGGTTTTGGGGTCTCCTTATGTAGCTTTTTAGTTTGTATATAATCTCTCATTTCTCCTCGATAACATACATGATACAAATACTATATCTATTTGCGTTTGATAAGTTCTTACTCCCTTCCTACCTTTTTCTTTGATACCTGAAAATTTTCAAATTTTTCAGTTGTCAACGAAGTCAGCAAAAACACCATCGGACGCTGTTGAACCAGGTGAAGTTGTATTCATGTGAAAATTTACCTAGTAGTCTTCCCGAGTTTTGTTTTTGTGTTTATGCCTTTATTTCAAATACATTTTTTTGACACTATTACATGCACAATTTTATGATCATTAGAATGAAAGTTATTTATTTATTTATTTTTAAAACAAGAAAGAACTTTTCATTGATATAGATATATGAAAAGGAGAAAAAGTTCAAGGATACAAACTCCCGAAAGAGTGAGAGAAAGAAAAAATGTGAAAGAGAACAGAAAAGAGTCTCAACAAGGAGCAAGCATTACAGAGAAAATAAGAACAATAAAAATCAAAGACCATGACATAAGAGACCAAAAAGCTTCATAGATGAATCTTGAATCATAAGAGAAACCAAAGCCCTTCAATCAACAAATCTTGACCGCCAAAGGAATTGAAAATCTACTTGGACGAAGTGGGAATTGAAGGTGGAGAATCTTCTAAATCAATCTTCTATTCCTTCAAATAATATTTAGTAATCTGAAACATCTTTTCGAGATAAGAGACATAAAGGCGAGCTTTTTCTTCACGAATTCTGGCTAACCCTTCTTTCTGAGAAGTGATATAATCATCCAATTGAGAAGGAGGCACACTAAAAAGTAACTTGAGAGTTAAGAGAGAAGAAGGCCTACTTCCAAAGTAACAAAGATTGTGGATCGAACTAGAAGATTTGGAAGGGAGTAGAAGAGAACCCAAAGGAATTGCCGTGAAAGAAGGCTGTGAATTCAAAAAGATGAAAGTCTCAACACCAAATAGAAGCCATTAAATAACTGAACTGAGAAGAACAGCAGTCAACCATCTACTTTTTCTAAAAGGGTCTTCAATTTCAAAGGCCCAAAACTTTGTGGTGACCCTTCGAATACTGAGAAAAAACCAGATATTCTAAAATCAATATTCCACCTATCAGGAATATTGATGTTCCCACCTATCAGGAACATCAATATTCCACCAAAATCGAAGGGAAGATTTGAAAGACGGGTGAGAGAGCACATATTCTAAAACCTAGGGGCAAGGACTCTAGTTAGGAGAACCAGAATCCAGACAGATTGGCCATGGTTGAAAGTGACCCTAGGACCTAGGACATGATTAACAGTCAAGAAAGAGTCCATCCTCGACTTGTTTGACCTGCAAATAATAGAACAGTTTGGGATACTATAAGTTTTGGGGTTATGTTTTGGTCCTTGAGAATGTAATAATTACCTAATTGCTACTAGAGGTCCTAGGATAATATTCCTGTGTGTGTTTTTACTCCTGGTTTAGTCTTATATGAGGATTTAAAAGACAGAAGTATAGAAGCTTCACCTTGAGTACTTCATCTTGATGATGTATATTTTATTATTTAGTTCATTCTGGTTTCTTATGCAAGAATTAAGTAATAATTTTAAACTTCATGCTAGATAGTTACAATCCACTAAGACAAATGATTTTTCTTGGCCAACCTGATGTAACCTAGGGTCTATTTAAATATTTGAGTCTTTATTCAACACCGAACTTAAAATAATCTTTGTAAAGCATGACACCCCATCTTATAGGCTATCTGCTGACTAACTGAACCAATTAGTAACCATACTAATTAAGACACTAACGGTAAGCTACTTACCTAATCAAATAAGTGGCATACTAGGGCAACATGGACTCTTAAAACCCCTAATCAGTAACAGGTGAACAGCTCACGCTATATCATATCCATGGGGCAGCATTCAAGCATATAGGATTATATAGTCTTGGTGTATGTTTTAAGTATGCTGTTCACCTGTACACTGCCATTGTTATCTGACTTTCTTTGATTGGGCTGGTGGGTTTTGTTTGGGTATTTGTGTTCTCAGACTATGATGCAAAGGCTGAAGACGCAGTTGATTATGAAGATATTGATGAAGAGTACGATGGTCCAGAGATTGAAGCTGCTGGTGAGGAAGATCATTTATTGCCAAAAAAGGAATATTTTTCTACTGAAGTTTCTTTGGCTACGCTGGAGCCCACAGTTTCTGTATTTGATGATGAAGACTATGATGAGGACTTTGAAAAGGTGCATGATGTTATAAATAGCAGTGTCGAAGCTCGAACTACCCATGCGTCAGGTATAGTATCTCTCATCAGTCTCTACATTATTGCTAACATACTTTAACCTGCTACTGAAGAAGCATCCCATGCAGCTAGAAGTTTTCTAATTGTGTTCTTTTTGGTTTTCACGATGGTTTTCTCTTTCCTTTTTTTTTTCTCTTTCAATTTTCTTTATTACAGAATGGATTTTGAGTTGAGTTAACTAGTTGTACATTTGCCGTGCCTGGTACTTTGTCTTTCCATTGGTGGGAGTTTTTTTAGTACAATTATCATTTTCTTATTTTACAAGATTGGATTATAACTTACAAAAGCTAAATCTTTGTTTACTCTTTAAGATTGTCCTATTTCTTATGCTATTTTCTCCCAAAATTTTTTTTCTGTGATGAAAGGGTATTTTATGGTATTCATATAAATAGAAAGGCTTTTTTGGTGAGCTTTTCTGGAATATATAGTCAATTTTGAATTATGAAGTTGGGTGTGTGTTTCTTCTTTGGAATGGTAAGTAGTAATGGTGTTTATAGCATTTGCATTAACAGTTATATGGTTGAATGAATTTGGTGAAAGTGCCATAAACTTCTTAGTGGTTCTAGTTTCTTTCTTAAGGGTTTAATAGATCAACTTTTAGTAGATTACAGTCATGAAAATTTGACAATCTCTCACCAATAAGCCTTTCAAAATGTTGTGTATACTGTATAGGCATTCTTCATAGTCTCAATGAAAGTAGGTCTTTCATCCAATAAAAAATGTTCCGTAGTTTGCGATGAGCTCAGTGCCATATTAACCATGTTAGAAATATGAGTTCAGTGCATATTATGGCAGATAAGGAAGGATGACGATCCTTGATTGGTTAATAAGGCTTTCCTCTACAAAGGCCATCGGGTAGCCATTTTAAAACAATTGGATGCTATTAAGGCTAGATCCTTATATCAATTCGGGAATGGTAATATTGCTAGCTTCTGGCATAATTTTGTTTGGCGGGAAAGCTGTCTCCCATATGCTGTCCAATATTGCCCTCTCCACCAGAGAGGACAAATGGATTTAGGAATGCAATCCATCAAAGAGCTTCTCGGTTAAATCTATGGTCAAAGAGCTGATTAACAAGTTGACTCCCATTCAGTGTTATTGAGGCGCACTCGGGTGCTCGCCTCAGGCAAGAGGCGAGGTGATTTTGCCTAGGTGCGCCTTGCAAGGTGCCCAAGGCGAGCGCCAATCGGGCGCTCACCTTAGTGCGCCTCGATGTGCCTCTCGCCTGAGGCTAGGCGATTGCCTTATTGAAGCGAGGCGATCTGCTTTATTTTATTTATTTTTAATGTGTTTTTTTTTTTAAAGAACCTTTCTCCCTTGCGAAATACCTCAGTCCACAAACCCTAATCATTCTCTCTGCCTCTCGACATTCACCTTCACTACCCTCAATCTCCGGCCATCGCTCGCTCCCGTCTCCAACTAAACTTATTAAATTTAGTTGTTTTATGCATCTGGTTGACATTTATTATGACACTATTTTGGTATTAATTTTTTAAACTATGTTTACTATATAAATTTATGCATTATATATAAATTTTATTTATTTATTAAGGTGTGCCTCGCTTCACTCGGGCGTCGCCTTTGTATCGCCTCTCGCCTCAAGATGATCAAAGGGCTTGTCGCCTTGAGGTGTGCCTTACGCCTCGAAAAACCACACTTTTATTTTTTGTTGTTGGGAACTTAGCCATGCATCCATCATCACACTAGACGTTGTGCAAAGGAGGCTCCCAAACATGGCCATCTCTCATATGGCTGTATCGTGTGTAGTAAAAGTGCAGAGCCATTCATGAATCACGATAAACTCAACATCCTCCTTAATGCTTTTTTTGGTTGGTCATGGGTTTAACTTATCAAAGCTTTCTTGAAATCTTTTTTTGTTTGTTTGTTGATATCTGTGAGTGTCCGGGCCAGCTTTATTACGCGCACCTGGACTAATCTCACGGGGCAACTGCCTGATCGTACAACATTTTGTGCCGGGAAACGCATAGGAATTACTAATTTCTAAGATAGGTGTTCTTGAAATCTGTTTTTGCAAAGGCTCTTAGTTATTGTAAATCCTTTTAATATCCTTCTAAAAATTTTGATGTAGTTAAGTAGCTGCTTCATAGCTTTTCTTGTTTATTAGCTACTGTTATAGCTTTTATTTTAGTAGCTTTACTTTTACTGCACAGTAACAATAGTTACTCATGTAATTCTTATGCTTGAAACCTCTATAAATAGGTTTCGGCGTTTTAATGAAGAGGCATTGAATCATTTCTTAAAAAACCTTGTGAGGTACTGCATCAAATTGGTATTAGAGCACCAAGATTCTTGGGCTCTTTTTTGACATGGCAAGCAAGAAAGCTCCCACCAATGTCGTCGTAGGAGACTCTTCTCTTGTCAAGAAAGAGGCGGATCAAACCACTGCCCTCTCACCACGAGCCACAACCAAACGTTTGCCTTCGATTGAAGGAGCTATGGAAGATATCAAAAGGAATGTAGGTGAGATATGACAACTTTTGAATAAGATAGCTTGGCGACTCGAAGATTCAAACTTGCAACAAAATGAATCAAGAATGGATCAAGAAGAACAACAAAGAGGACAGGGTATTAAAATTCTTCAACACCAACGGAGAAGATTTCAAGAACCTGTTTTTGCACCAAGAATTCTACAAGAACTGCAAGCTAATCGGCAAGAATCCAATCGAAATCCTTTGTTAAGGAGAGATGAAATACATCTAGATTCTTCAATTAACCATCATGGGATGACTTAAGTGGTCAATGAGGGCCAATGTTAATAGCAAAGGACTTAAAGGGAATGAGTTCAAATCATGGTGGCTACCTATCTAGGATTTAATATCTTACAAGTTATCTTGTCAGCCAAATGTAGTAGGGTCAGACGGTTGGCCCGTGATTAGTGGTGGTGTGCGCAAGCTGGTCCGGACACACGGATTTCATAAAAAAAAAATAATCTAGATTCTTCAAGTGAGGAAGATGAGATCCTTAACTTGGGAGAAGATAGACAATTGATTAGATCCAACCAGCAGCAATTTTATGATACAAAAGACATTAATCTTCATCAAGAACAACAAAAGCAACCTATATTAACAACAAGACAACACCAGGAATAGGGGAACCAGTCCACAGAAAATATTATGAATTTTCTCCACCACGATAGATCTTGCATTCAAGAATCCAACTTTTCTAGCAAAGAAGAAGAGTTTTTCTATGGGTTTGTTGTAGATTATTTTGATGACCCTAAAGAAGATTCTCATAGGAGTGAAAAGATTGAAGGAGATGAAGGTTCTCCTATTGAACTTCAAGAACATGGCAAGTAGACTTCAAGAATGTAGATCCAAGAAGGAGAAAGATTCAATGTGTGGGAAAGTTGAGAACCAAACTCATTCGGCCAAGACAATTTCTAGTTATTCCAAGAAAATTTTGAGAAGTATTGGGATGAAGAATAAAGTGGGAAGTTTTGAGAAGGATCCCAAGGAAAAGAATGTTGATGATTTGGAACACACTTGTATCAAAAAACTATTGCATCTGGGATTTAATTCAGTTTGGTCACTTTATCGTTGTTTTGATGAGTTAAAAAAGACATTTTCGGTTGGAGTATTTTTTTAACCACAAGAAGAAGCAAGTTTCTAGTTTGAGGATTGTTCACTGCCTTCTAACTTCACTGCTTTTAAACTCAAGGACGAGTTTCATTTTAGGGTCGGGGAAATTGATGTAGTTAAGTAGCTGCTTCATAGCTTTTCTCGTTTATTAGCTACTGTTATAGCTTTTATTTTAGTAGCTTTACTTGTACTGCACAGTAACAATAGTTACTCATGTGAGGTGCTGAATCATTTCTATAAATAGGTTTCAGCCTTTTAATGAAGAGGCATTGAATCATTTCTTAAAAAACCTTGTGAGGTGCTGCATCAAGTTTCCTCATCTCCGAGTCCAGTCCCTTGATTTGGTCCCTGTGGTTTTTGGGTGCACTGCTTTTTTCCATATCCTTTTTTATTTTTTTTTATTTTAAGAAACAAACTTTTCATTATAGATGATTAGGAAGAAAAATGTTCAAGGATACAAACTCCCAAAGGAAGTGAAAAAGGAAAATAGAGCTAAATAAAAAACAAATACCAAACGAGCAAAAGGAAAAGGGTCTAGAATGAGCAAAAGGATATAATAAATCTATCTTTCCTGAAAGGTTATTAACAATAATGCCCTATTGTGGTACCTCATGTTGATGATTCAGACTTGGCAATCGCGGTAAGGAAAAGGGTCTAAAGGTGTACCACACACCCTATTGGGAAATATGTGTCATATGATGGTTTATCGAAGTATTATCGAGCTTATTGTTGATTGTGTTCAAGTATCAGCCAACATCCATGAAACTCTTAAGCATCCTAGTAAGAGAAAAGCAGTCCAACATGAGATCGACACACTTGAAAAAATGGCAGATGGGTAGTGACAAATTTGCCCAGTGGGATGCAAGTGGATTTTAACTCTAAAGTATAAGAGAAATGGGAGCATTGAAAGATTGAAGGCTAGACTTGTTTCTCGAGGATTCACTCAATCCTATGGGATTACTACTAGGAAACATTTACTTTGGTAACTAAACTCAACACTATAAGGATAATCTTATCTCTAGCTGCCAATCTTGATTGGGGTCTTCAATAGTTAGACAGTAAAAATGCCTTCTTGAATGGCAATTTCAATTTGGAAGAAGAGGATTACATGGATATTCCTCCTTGTTGTACGGGGAACTTTAATCCCACATAGGTATGCAAGCTTAAGAAATTGCTCTACAGATTGAAGTAGTCTCCTTGTGTATGGTTCAGTAGTTTTGCAACTTTTATTAGAGCCTTAGATATTCTCGGTGCCAATTAGATCATACTTTGTTCGTCGAAAGGAGATTAGAAACTCAAATTGCTCTCTTAATAGTATATGTTGATGATATTATTCTTTTAGGAGATGATGAAACAAAATTGCAGATGTTAAAGAATCACTTTTCCCAATAATTTAAGGTCAAGGATGTAGGGGTTCTCAAATATTTCTAGGTACGGAGGTTGCTCAGTCTCATAAAGGTTCTTTTTGTCTCGCAAAGAAAATTTGTGCTATATCTTCTTGAAGAAACTGACATGCTTGGGTGCAACCAACTGATTCTCCTGTGGTCTCATGAGCAAAACTTGGCAACTTTAGTATAAGTAAAGTAGCTGGAGTAGTTATAAGTTAGGTTGTTAGCTCACAACTTAGCTTTAAATGATCATTGTAAAAAAGCAAGGGTCCTTTAGGGTTGAGCTGGTAGAGCTTGCTTTTGGAGTATGTGGAAGGAAAGAAATCAAAGAGTGTTCAATGACACTTCTCACCTTTTTGAAAATTTTTGTGAAAATTTACAAATTAATACGGCCTCTTGGTGCCTTACTTACAGGAGATTTTTTTTTGTAATTACAGTCAAGCTTCTCTTTCACGAGATCGGCAAGCTGTAATGTGGTAGCTTTTTTGGTTAGGGCCTCCTCGGCCCTCGCCCGTAGGTTGTTCTCCCCTTTTTGGTTATTATTTATATTATATTTTGTCTCTTATCAAAAAAATAAAATAAAAGACCGTTCGTGAAAACAACCAGCAGTGGCAGTTCTCCCTTAACCCAATAAAATAGGGGATTGTATGATATTTTCAGATGATGAATAATAATTGAACTTTTCTCTATTTGAGTCTTCACCATGGACTCAATCACGAGCGCATGTCAATTCACAGGAATAAATACCAATGCCTTTTCGGGCGGCTCAACTATCTTTCTCATACATTTCCAGTCTACATAAAAGCTGTCCAGGTTACTGTTTGTTTGTATAAGGTAATCTTGTAAAATGGAGAAGTCAAAGGCATTTGTGGTTGCAAGAAGTAGTGCTGAGGCAAAATTTACAACTTTGGCTCATGGCATGTGTGAGGGCATATGGTTCAAAAGAGTCTTAAAGGGAGTTGGAAAAAGGATATATATATATATATATATATATATATTTTTATGAGAAACAATGCTTTCATTGATAAAAATGAAAATAGTTACAAAAGAACAACATAGGCATACAAAAAGACAGCCCAAAAAGAGCTCGCCCCCTACAGAAAAGGACTCCAATCCAAAAGAATGAGGCCTAACTGATGACCTTAGTGATCGAACCCCATAAGGAAGTATTATATCTAATAAGTTCCCAAACATAATCCTCCGATCTCTCCACCCCCTAAACAGTCTATTGTTCCTCTCTAACCAAATACCCCATAAAATAGTCAGAAAACAAGCCTGCCACACAAATGGAAAAGGGATATTTATCACCAATACAAATGATGTATGATTGTTTCAATCAAGTTGCAATAAGGATTGCAAAGAATCCAATTCATCATGATAGGATAAAATATGTTGAAATTGATTGACACTTTATCTTAGAAAATGTCCCTAATGAAGTGGTTCAACTCAATTATGTAGCTACAAAGCAACAGATGACAGATATTCTCTCTAAGCGATTACCAAGACCTAACTTTGAGGATCTAAATAACAAGTCTGGTTTATGATATATACAAACAACCCATCTTGAGGAGTGTTTAAAATGTGTGTTAAATTAGGGCATTCTTGTCAATAACCTTTTGGAAAGTTAGATTTATTATATCTTTTTAATTAGATTATTTTGCTTGCCTTTCCCTTTTTTATTTATCAAGGTGCAATCTGTAGTAGTAAATAATAACGTGAAAATAAATTTTCTCCCGAAAGCAACACTTATGTAGTTAATTATCACATTCTTGTAGCAAACCCTTTACTCATGTTTAGGGAAAAAAATGTTTGTGGTTTTTTCAGGTTATAAGTTGTAAGAGAAACAGAGTAATCAAGTTTAAATTACTTATTTAGTTTTCTTATGAATGTTATAGTTTCTTTCTTTTATTCATTTTCTCTGCCAACATTAAGTATTGATATTTGACTTATTTACCATGACTCTTCCAGATGAGAAAGGTGAGTGCCTTGAGGTGGCTTATGAAGGAGAAAAATCTGTTGCAGATGATGATATACAATCTGCTTCTCTCAATAATGAAGTTATAACCAGTAGTGCAGAAGAATTGCTCGAGGTCTGTTTATGTTTATCTTATATTTGATTTTTGCATTTTATATATGTTTGCTAAACGTGTATTAATTTCTTGACGTTTCTAATGTTACTGGATTTTGTCGTGTTATCTTTTTTCTTTTATCGTTCTCTGGATGAATTTTGTGAAAGCTTTCTTTTGGAACACTTGACAAGAAAGGAATATCGAAACTTTCAGAGACAAGGAGCGATCTTTTGATATATATATATATATATATATATATATTTTGAAGCATACTCTTACTAGTTGTATCTTGGTGTAAATGCTCCCCTTTTCATCATTATTGCTTTTCTAATCTTTTATCCAATTGGATACTTATTTTATAACTCCTTTTTCATGGGAATCTCCCCTTTTGTAATTTAATCTCATCGATAAATTGTTTCTCATAAAAAAAAAAAAGAAACTTTTTGTGACTAGAAACTTAGTTTATTGATTTTTTGTGTCACCTTGAACTATCATATCTTAACCTGATGTACATTGTGAAAGAGAAGTGTGGTAAGTGTACCTAAGAAATCTAAATTGTTCAACTAGGTCACTTTATTTAAAGGTCAAAGTTCTTCGTGGGTTAGTTCTTCATGCAACATAATTTATAAAGCCATTGTGGTTCCTATGGAAAATTACTCGTAGAGTTACTGGCTTGTTTCAGACTTTGCATTGATACTGTCATGTTTAAAGCTACTGTTACCATGTTATGGTTCTCTTGTGTGGTTGTTAACAGGAGACACCTGAAGTACAGAAAAAACTACTGGACGAGAAAGCTCATACTCCCTTACCTGTATTGTGCATGGAAAATGGGATGGCGATCTTACAGTTTTCTGAAATTTTTGGTGTTCACGACAGTTTGAAGAAAAAGGAGAAGAGAGAATCTAGATATTGTACTCGTAGAGGTACTTGTCCTGAAATTATTATATGCGGCTTCCTATTTCAAGTCCTTTTGATCTCATTCCAAATTACAAATAGCATGGTTCTTTTACGTTAGCTGTTATAAATCTGTAATTGTTTTGTACAAAATCTGGAGCTACATCATGTGAAACAACAGGGTACTATTAGCAATCTGGCCATCAACTGTTTTTCTGGATATCTATTATTTGCATTTTGCTTCAGCTTATGAATTTTTATTTCTTGTTATTTCTTCCCCATCTTTTCAGATAAATATAGGTCTGTGGATGTATCTGATATTGTTGAAGAGGATGAAGAGGCATTCTTACATGGCTTCAGTCAAGGTGTATCGTGTGTGAAACCAGCATCTGTCGTAAAAGATGATACTACTATGTTTAATCTGGATGACCCAGAATTTACTAAATTTGGTGTAGTGCAAGGGGTTGATGTAATGGCTGCAAGAGTGGACTGGCGTCAAAAAGACAATTGTTGTGGTGCAGAACCCATGAAACAAGTCTTTGCAGAAAATATTTCCATTGGATCAAATTCATTATTGTTTAAAAAATTTTACCCCCTTGATCAGCAAAATTGGGAAGAGGGGATTTTGTGGGATAATTCTCCGGTCTTAAGTAAGAACTCTGCTGGTAGTTGTGAGGTATCTGGATCTGATCTGGAAGATTCAGTTAGTAGTGATGTGGAACAACAAGTTTCTATTCAGATAGTTCGATCAGAGCACCATATAGATCCAAATGACAGGAGACAGAGGCTTTCCCAGCACGACCTTCCATTACTGGAGCCTTTTGGCTCAAGGAAATTTTCAGGGCCAGAAGAACCTTTCTCACCAGAAATGATTTATCATCCTCAAATGTTGAGACTAGAATCCTGGAAGGATGTGGACGGATCCTGCCAATCAGAGGGTACGAGGGAAAATTTTTCAGAGGAGCATCAAAGCAATGCTATAAGATGCTTTAGTAAATTTTCCCCAAAGAACAGAAGAATGTTGGAAGGTTCTTGGTTAGACAAAGTATTATGGGAATCAGATGAACCCATTGAAAAACAAAAATTTATCTTTGATCTTGAAGATGAACATATGCTTTTTGAAATCTCAGATGAAAAGGAATCTAAATACATTCAGTTTCATGCTGGAGCTATGATTTTAACACGGTCATCAATGTCAGTCAATGGAAATTCTTTTGAGATATCTGGCAGTGGAGGTCAAGGTGGCTGGAGATTTGTTTCTAATGACAAACACTATTCCAATAGAAAAGCATCTCAGCAACTCAAATCAAATTCTAAAAAACGTTCTGTTCATGGTGTCAAAGTTTTCCATTCGAAACCTGCGATGATGTTGCAGACAATGAAGCTAAAGTTGAGCAAGTAAGACTAGTGAAATAGTATTTCTTCTAGAGATACCATGAATGTCTGCGCCTACTATTATAAACTAATCTCAGCTTGCTAACATTTGGTATTAGAATATGTAGGATATTTAATATCTTAAAGCAGGTAGTCACCATATTTTGAATTCATGGCCTTGAAAGACAGACGCTATTACAACTTGTCTGATCAATGGGGTTCACTCTACTGTAGCTTTTTTTTTTTTTATTATAACTTTTTTTAATAACAAGAAACAACTAGAAAAACGTTCAATAAATAAAAGCTCCCAAAGGGATTGAAAATAAAAACTAACAACTAAAATTTAAAAGTAAAGAGCTCCAAGTATTTTTTTTTATGAGAAACCAAATTTTCATTGTGAAAAGAAGGATAGAGAACAAAAGAACAAACCAAGGGACTGCAGAAAAAATAGCCCAAAATAGCCTACGAGTAGTGGAGTCCCATAGCTACCATACAAAAAAAACCCAGCCTAAAATAATGATACGTAATGGATAATTACATAAGTCCTTAGAAGAGCCAAAGTACTAAAAATCTCCAAATATTACAAGAGCCGAAGTATTAAAAAAAACTTTGAAGAATGCCATGAAATCTAAATGTTGAATTGTTGAAATCTTTATGTTGTTAATTTGTTACTCTTTGGTTCTTTTTAGTTGGATTCTTATGAGAGCTCCTCGGCTTCCAACTATGTGATTGAATAAATCCACCTCAAGATGTTGCGGGTTGAGGGCTCCTTCTCCCTCCCTTTCTTCATATATATACGAAGAGGTGTTTTTTGTGTATGTATGTTTGTATATGTGTATGCATTGATATGCACATATGCGTCCATAGGTATTAACATGTTAGTCTGCTTGTGGTTTTAGTTGATCTGAATCATCTGATTAGTGTTTCTGAGAGGGGTTGAGGTGTCATTCTGCAAGGTTTCTAGGTGGAGAGAGATAGGCTGACTTAGTTTCATCTTTAGTTTTTTTATGACATGAGTGGTTTCTCCTTCGGCAAGGAAGAGTCCTTTGTTAATCTTAATTGCCTTTGTCTTCATTTGAGTTGATTTCTGGTTTGAAGATTAATAGAAGCAAATGTTTTGGCTTAGCAATTAACTCTGATCCCTCTAAGTTTGTCAGGTCGGCAACTTTGGTGGGATGTGAGGTAGGTTCCTTCCCTTTGTATCTGAGTCTTCTGGGTCTTCCCATAGGGCACTATCATAGATCTTTGTCGTTCTGGGCTCTTATTATGGAAAAGGTGCAAAAACGTCTTTCTTCGTGGAAAAAATATTTCTTATCCAAAGGGGAAGATTTTTCTTTGAACAAGAAATGATTTTTTTCATTGAAGAGATGGAAAGAAACAAGAGTTCAAATATACAAGCCCCAAAAAGGGGAAAACAAAAAGACCGAAAGCAAAATAAAACAATGATAAAGTTATCATGAGGAGGGGATAAATAAAGAACTCCCTATTCAAACACAACCTTTTGTAGAGAATAATCCGAAACACACTTGGAAAGAGCACACCATTACGACGCTTTAAGCCTAGCTGCTTCAAAACGATCCAACTAATGCCTCTGAGTACTATAAGGAACTGATTATGTTCCAACCATAGTTATGAGGTAATAGCCTTTATGATTTGGTCCATAATAAAGTTGGTCTTGGAGCCAAGGGAAGACCGATAATAAGCTGAAGAATATTATTCTGAGAATATTAGACCCATTGCAACTTGAAAATCCAGAAAAGATAAAACCAACAATATATAGCATATGAGTAACCAATGAAAACATGGTTATGTTGTTCTGGAGCTGTGGCCAAAAGAATCAGGTAGTGATTGTGTCAAGACTCAAGAGTCCTTACCTAGGTGTCTTGCCAATCCCCCCGCATGGACATCTTGTATGATAGCCTCCCAAAGAGAGGTATGAGGGATGCATAAGGCATTTCCCTTGAATAAGTATCCGTTTACCACATGGAAATCATTGTAAGCCGTATGTGTACTACATTGGTGCCATATGTCATGAAAGTCTGCCACCCTTGTAAATGTTAGGGAGATGGTTAAATGCAATGATTTTACCTTTTAGAAGTGTTGAGAGACTTTCTTTTCTACTCAGTGCAACAGCAGCCTTGTTAGTAATTCCAGATTTATGTTTAATAACAAAATCGAACTGTTGGAGAAAAGATAACCAACGAGCATGCACACGACTTGTGGTTTTTTGAGTGTGCAGAAACTTGAGTGGCAAATGGTCAGTGAATAAAACAAACTCCTTGCTCAAAGGATAGAGTTCCCATTGTTTGTTTCAAATCTGTAACCAAGGAATATAGCTCTGGCTCATATGTACTCCATTTTTGTCTAGAAGGGTTGAGTTTTTAACTAGAGTACTCTATAGGGTGATTGTTTTGGGACAGCACTACACCTATCCCTTTACCGGAAGCATCAACTGCTACCTCAAAAGGTTGGTTAAAGTCAGGTAATGCAAGGACAGGGGTATGGCTTAATTTAGTTTTTATAAGATGAAAGCTCTCGGCCTGGCTTTGGTCCCATGAAAAGTGGCCCTTATTTAAGCAATTTGTCAAAGGAGCAACAAGAGAATTGCAGTCCTTGATGAACTTTCGACTGAAGGAAGCAAGACCAAGGAAACATTGAACATCCTTTACTGTTTTTGGTTCTGTCCAATCGGAAATGGCATTAATCTTTATAGGATCCATTTTAACCTCGTTATGATCAATAATAAACCCTAAAAAGGAAGTTTTGGCTGCTACAAATAGGTGCTCTGATACCAATTGATGCAGCCTTGCACAATTGAATTTTGATAATAACTCAATAGCCCTTTATTAACGAGGCTGAAACCTATTTAGAGAGGTTTCAACTTTAATTGTAGAAGAGTGACTAGAGTTACAGTGCAGTAAATGCAAATCCAACTTAAACAAATCTTACAGGTAAGCTATTTAAATGATAAAGGCTATTTAACGGCATCTAACTACATCAATTTTTGTCAATGGAGGAAAGGAAACAAGTTGTCTTGAGAGGGATTTTACCGAAAAGACACCTGAAAGCTCAAGACTCCATCTTCTCTGGTAATCGTTAGTATCAATTTTCACAGCTTCTAACTCAAATAAAAGAGAGCTAACTAGTTGATTTCATCTTCCTTCAAAAATCTTCTAAAATGATCTTCCATGAGAGACAAGATAAGTCCCACAAATCTGAAAACCGGGCCATCGAAATTCAGTGACACTTTAAAAAGAAGTCGCCAAAAGTATATTCTCTTAAGAAGATCTACACAGACATTAAGAAGATAGATACACGCTAAAAGTATATTCTCTTACCATTCCAAATTTTGAAGGAAGCATAATTCTCCACACTCTTCCATGATTTGGAAATGTTGTGCCAAGGGCTGTGGGGGCTAGAATTTCTGTTTCCAGAAGAATGCCATTCAAAAGAATCAATACCATGGATGCTGACAATTACTTACCGGAGTTGGATTCAAAGATAAACCTCTAGCTCCAGTTGGCCAATAATGCTATATTCCTGGATTTTAAACCATCAATTCCTAAACCACCATTGGATCTCGTCTTTTGCTGATAGGGAGATGTTGGATCTCGTGGCCTTATTGTCCTTGGTGGGAGACTTTGTTGTTTGTCCGGGTAGCAGGGTTTCTTGTCTTTGGTCTCCTAATTCTTCTGAATGGTTTTCTTGTAAATCCTTCTTCATGGCAGATCCTTTCCCCTCTTATTGCTCCATTTTTTCTTCACTGTAGAAGGTTAAGATACCAAAGAATATCAGATTTTTTGTGTGGGATGCAGTGTTGAATTTTTTCTTTCTTTGTAGGGGATCAGTAGAGAATCTCGATCATCTCCTTTTGTCTTGCCACGGTGCTTTCTTGGTTTGGGATTGTTTGTTTCAGGCTTTTGGGGTTTGCTTTGTTTTCTCTAGAGAGTGTAGGTTGGAGGAGGTTATCCTTCACTCTCTCTTTTGGGTTTAGGGGCGATTTTTGTGGTGCATCAGTTTTTTTGGCTGTTCTTCGGGGTATGTGAAGTAGAAACAACCAATTTTTAAAGGATCGAGAGAACTTTTGAGTAGGTTTGGACCCTTGCTCATTCATTCACTTTTCTCTTTTTGGTTCAGTTCAGTGTCTATAAATTTTTGTAATTACGTCTTAAAATTGATCTTGTTAGACTGAAGTCCTTTATTTTAGTTTGACCCATTTTGTTGTCTTTTAGTATACATTCTTCTCAGTGAAAGCATGATTTTTCATATATATATATATAATATGTAAAATATAGAATGTGGAAAAAGGGTGCTTTTGAGGAAAAGTACTGTCTTAGGGCATCCCGTTGAGACATCAGTTCTAATAAGGCAACTCTTGTTCGATTATATTATTACTTTTTTGAAAGACGTTATATGGAGCTTGATTGAGTTATAGCTTGCAATACATTATTTTTCCTGCAAAGATTGGTGCAATATGATATATTTTGTTATTTATGTGCTTAGTATTGCTTATGAGTAACAGATATTTAGTTATTCTTTTTAGATAAAAAGAAATCATATGATAAAAAGGAAGAATGTTCAGAATGGAAGGACCAGAAAGTCTCACAAAACCACAATAAAATTTGATGAATATCTTTCAATATACATGTGAAATTTGTTTTATCTTTTTGATCATGGCTATATTTCATCAAAAGCAATCTCTACTCCACCGATGTGTTCTTGTCTACCTTTTTGCAGCAAAGAGCTAGCTAATTTTCATCGGCCTAAAGCTTCATGGTATCCCCATGATAACGAGATGGCTGTCAGAGAGCTACAGAAGTTGCCTACTCAAGGACCAATGAAGATTATTCTGAAAAGTTTAGGAGGCAAAGGCAGCAAACTTTTTGTGGACTCTGAGGAGACTGTTTCGTCTCTCATGGCCAAAGCTTCAAAGAAGCTAGGTTATCAAATTTTCTTTTTTGGGTATTTTTCACAAGTTTATCCTTTCTATTTGTGTAATGTACATGCTATGCTGTATGATGACTTGTTTTCTCTCTTCGTTACTACTAAATTATGTCAAGGGGTTAAATTTTTTTATTGTTTTTTTTGATCTCCTGGTTAGATATGAAGTCATCTGAAATAGTGAAAGTATTTTATTCTGGAAAAGAGCTTGAACGAGAAAAATCTTTAGCTGCCCAAAATGTGCAACCAAATTCCTTACTTCATCTTGTTCGTTCGAAAATATTTGTAATGCCATGGACACAACATTTACGTGGTGATAGTAAATCCGTGAGATCTCCTGGGGCATTCAAAAAGAAATCTGATCTTTCTGTGAAAGACGGACATGTGTTTTTGATGGAGTAAGTATAGTGTTCTCCTCTTGTATAATATTGATGCTTCACAACATGTTGATTCTTGGACTTTCTGTATTCATTGTTGTTTCATTTATCTAGGTATTGTTCTGCTTAGTTCTTGCTCAGAGTTGAACTTTGTGGTGCATTGGCAATAGTCATCGGATTTATTGTTTGTTTATTGGCATGGTGTTAAATGAATAAGTATATCCTTTGCAAGCTAAAGGCCAGTTGCCAATCCTGTAAATATTATACTAGTTTATCCCCTCAATTAAGAGAGAGAAGGAAATGTCCCCTACAAATTAAAAATCCCATAGCAATCGATTGAAATGAATTGAAAATACTTCTTTTTTATTTTTAATTTTATTTGGTAAATGGGGGTTTTTATTTTTGGCATTTTAATGTAATTTTGAGGATTCTGTTCAGAATCCTTAATTGATGTCTGATTCTTTTACTATGGCAACTTTTCAATCATCCATCAAAAGTTCTAGTTCCTTTTATGAAAACAAAATTGAATTGAACCTTAAATGTCATTGATGTTTCTGATGGCCATCTATGCTTCCATGGATAATGCTTATACTGTCTGCTTTAGATAGGCAACAAATACTTTTCGGCCTGATTCATAAAATACTTCAATTAGAATGATTGTCTGAACTTCTAACGACCTACAGAATAATGTGAATCGGAATGTTTTGGTAGTTCATGTTTCTGGATGGAGTATGATTTAGCTTTTTTTTTGGTTTAAGAAACAATTTCACTGATGTGTGAAATTACAAAAGGAGTGTGACCCCAAAAACCAAAGGAGTTACATGAAACTTCTCCAATTGGTCAAAAGAGAAACCATACTATAAGAGTAAAAAAGTGGAGATGCTTTGCACCAAGACATAGCAAGAAAAATGATGATGTCAAAGAGTTGAGTGAAAGGGAGCTCTTTTTCTATGAAGCGTCTATGGTTTTCTTCATTCTAAAATGTCCAGGAAAAAAGCTCAAAGGATGTGTTTTTAGATGGTCGCCTTCTCTTTCTTGAAAGGATGGCCCACAAGAACATATAAGAGTAAATTTGTGATATCAATAGGTAAGCCACTTTCCAATCGGATGCTAGTAAAATCTCATTCCAGAACAATGTTGCAAGGGAGCACGTGAGGAAGAGGTGAGGCTGTGATTCTCCATGTCTTTTACAAAGGACGAACCATTGGGGATAAATGGCAAAACACGGAAGCTTCCCTGTAATCTATCATGGGAGTTGATGGCCTTGTGTGAAAGCTCCCATATGAAGAATTTAATCTTTTTTTGGGGTAGATTTCACCCCATATAGCTTTGGCAAGGGGGGAAGGGATGAGGTATTCCTTTGTGACCAAGTCAGTTAGGAGGGATTTGATAGAGAATAATCCATTGCTGTCTAGGGGTCATAACCAAGAGTCTTCTCCCAAAGTGAGTCGGACAGGTGAAAGCTGGTTGGAGGGTATAACCCATTCATCAATTTCACCATCCTTGAGATGTCTCTTAAAGCCCAAGTCCCAAAAGCCAAGTTCTTTGTTCCATATGTCCTTCACAGATTCTGTTTTTCTAAGAGAAACAGAGTGGAGTTGAGGGAAGATTGTCGTCAAGGGGGAACCGAGCAGCCAAGCATCAGACCAAAAAGATGTAGAAGCCCCATTTCTAACTTGTGAAGTTGTACGATCATGTATGATATGCCAATGCTTATAAATATTCTTCCATGGGCCACGTGCATGTTTGATGGATTTGGATCTGGGTTTGGTATTAAAGGATTTGGAAGAATATTTGGCCTCAATAACTCGTCTCTACAGAGCTGTTTTTCCCCTTGATATCGCCAAATCGATTTTGCAAGTAAAGGTCTATTCTTGTCTCGCATCCTATGTAGACCGAGAGCACCTTGCTCCATGGGCAGGAGAAGTTATCCCATTTGACAAGGTGTGATGATGGAAGACTCAGAGCTGCCCCTCCAAAAGTAATTCTTGAAAAGTTTCTCGATTGCTAAATCAACCTTGTTTGGCGTGGCAAATAAAGAAAGGTAGTAAATGGGGAGGTTTGATAAAGTGGCTGGGATGAGAGCGTGTCTACCACCTTTAGAAATATAGGAGTGTCATGATTGTAGTCGTCTCTCAATTTTCTCAAGAACATGGGCCCAAAAAGCTGCTGCATTTGGATTAGCATTTAATGGAAGACCAAGGTATGTGTTCAGGTTAATGTTAAGTCTCGATGCCTCTTCGAACTCCTTTATAATGGTGAAAGGATTCCGAATAGATTCATCGTCGTCAATGGAGAAGAGTATGGCAATGAATCTTGTCCCAACTCAACCCCTTTGATAAAATTTGTGGTTGCTGCTTGTGTCAAAAGGCGGCTAAGGCAATCAACTACCATGATGAATAGGAAGGGTGAGAGTGGGTCCCTTGTCTAAGCCCTCGAGAGGCTATAATTTTATCTCTAGGCCTCCCGTTGATGATGATAGAATAGTATGATTTATTTATAAAACTTGAAAATGAACATAATGGGTAAGACTCAATGAGACCTACTGGAAATTTTATAAATAAAATCTCAAGCAAGGTAGGAACTCTTTATTTTAAATATTATTAAATTAAAATAATATTGATGCATTAAAGTATTCTTCATTTCTTTACCTCAAAAAAAAAAAAAAAGAATCTTCATTTATCATAAAAAAATTGATGCCATCAGAGACTTTTTTTTTTAAAATCTTTTTTACAAGAAATGAACTTTTGTATCGATATATGAAAAGGAACAGTAATTGTTTAAAGATACAAACCACAAAGGGATGAAATGCACAAATGAAATAAAAGATAAGTTCAAATACAAGATATTAGATAAATAAGTACAAATAAACCAGCCAACGGACAATTAAGGTAAACGGTAGACAAGAAGTTAATTGTCTGTTTGATAACCATTTAAAATTTTAGTTTTCCATGTTAAGTTTTGTTTGGACTTTTTGTTTTTTGCCAATTTTCCACNCAAAGTAATTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTCTTTATCACTTATCTGATCATTTTTAAATGGGTATATGACTATTTTCTTATTTTTCTTATGAGTGTTATCCATTGGGTATCCCTGAGAGTCAAAAAGGGAAAATATTGCTGCTTTTTTAACATACAAGTTGGTAATGTAATGGCCAGATGTTACTTTTGATACCAAAAATCTATCAAACAACTAGTGAGTCGGCTTCTCGCCCAAAAGGAAATTTCTTCAAGCTTCAATGATTCGAATATCTCATTATACTTTCTGGAATTGAACAAGGACCAGGGAGATTTTTGTAAAGAGAAGAGAAGAATCCCACAATTTCTTCAAGCTTTCTGTACTTTCTTATGATAAAAGAAAAAAGTGAAATTACTACTTATATGTTTACCATTCACAGTTTACCTATTATCTCATCTTTGTAGGTATTGTGAAGAAAGGCCTTTACTTTTGGGCAACATTGGAATGGGGGCAAGATTGTGCACTTATTACCAAAAATCATCCCCTGATGATCAAACTGGTGCATTATTGCGAAATGGGGGTGATAGCTTAGGGCATGTCATTGTCCTTGAACCTTCTGATAAATCTCCTTACATTGGAGATATAAAAGGTGGCTCAGTACAGGCATCTCTTGAAACAAACATGTACAGGTCACCTATATTCCCCCATAAGGTGCCAATGACTGACTATATATTGGTTCGTTCTGCAAAGGGGAAGCTTTCCCTCAGACGTATTTACAGAAATTTTGCAGTTGGCCAGCAGGTTGGTTGTTTTAAATTTGTCTATCTGGATGTGCTTTATCTTATTAACTTAATTTTTTTCTTATTTTAAACTTAATTTTTTTCTTATTTTAATTTTATCAGACTTTTTCTCTAATATTTGCTTGATTTCATACTTTAATGGCTGGATTTGATGATTTGATTGTTGGCTCTATTTACATAGATATTTACTCTTAAATGTTTTTGGATATTTATATGCCTTTTCTTAAGTACGGAAAGGATGAACTTTCTTGGAACTTTTTGGTTTAAGTACAATCTTATATATTTTCAATCTGATATTGTACATGGACAGAATATACAGTCGAGTGAAAGGTTCTTATCCGCTTGCAATTAAGTTGTTCCTTTCAATCAGGAATTGCATTTTAGTCAGTGTTATTGAGGCGCACTCGGGCGCTCGCCTCAAGCGAGAGGCGAGGCGATTTCACCTAGGTGCGCCTTGCAAGGTGCTCAAGTGAGTGCCTCGGCGAGCCTTGGTGCGCTTTTGCGCGCCTCTCTCCTGAGGCCGGGCGAGACGATCTGCCTTATTATTATTGTTTTTTTAATTTTTAATGTGTTTTTTATTAACAGAATCCTTCTCCCTCGTGAAATACCTCAGTCTACAAACCCTAATCATTCTCTCTGCCTCTCGATATTCACCTTCACCACCCTCAATCTCCGGCCATCGCCCGCTTCCGTCTCTGACCATCGCTGTTGCCGCTCCCGTCTCCAACTAAACTTATTGAATTTGGTTGTTTTATGTATCTTGTTGACATTTACTATGACACTATTTTGGTATTTATTTTTTAAACTATGTTTACTATATAAATTTATGCACTATATATATAATTTTTATTTATTTATTAAGGTGTGCCTCGCTTCACTCGGACATCGCCTTTGTATCGCCTCTCGCCTCAAGGCGGTCAAAGGGCTTGTTGCCTTGCTCCTCGAAAACATTGATTTTAGTTTCTGTCGGGGCTCAAAAAAGAGGCGGGTTGGAATTTTTGGAAATGATGAATGAGTTTCAGTCTATTTATAAGGATATAGTTGCTTCTTCTAAGGAGATTCAAAATGATAATAAAGTGCCCAACTCTTCGGTTTTGGACACGGGGAGGAATTTTTCGATTAATACAGTTGTGGCTCCTTAAAAGGATTTTTGGATTAAAAAAGATAAGGAAGTAGTGGTATTGGATTTTCAATTAATTTTTGTAATTACAAGGCTTCATTCTTTTGAAGATTGGAAGTTAATTTCTGAAATTTGAAAAAAAGAAGTTTCAGGGAGCTGTTTGATTGAATTTATTTTCAGCTGATAAAGCTCAACTAAAAATGAGTGATTGCTAGGATATTGAGGTTAAGAGATAATTAATTATTGGCAAAAGTTTGGTGGTATTTACATCAATATCAAAAAATGGTGTGCTAAAATCATGGAAAGGTTGAGTTGGTTGGGAGCAGCTCACCTTCTCTCGGTGGTGCTACAAGGAAAGTTCTTTCGTAATTGTAGCCTTTCTATGATTATTCAAGATTGGAGAACCATTCTTATTTACTTTTGGGAGGGAGTCTTCTCATCCCCTGCCTTAGGTTGTTTTGTTTTCTTTATAAAAGAGTTGGCTGGGAGCTATGGAGCTTACCTTTCTTTAGGCATGCGGCTATAGGCAAACACTCCTTGGACGAGCATGCCCAAGACATTTTGTCCCACTTAATCGGTTGACGTTTCTATCAATTGAAGCAACTTGAAGTTGTCGATCTTCATCTTCTAAGCCTTGAGGTCTTCAACTCGTTCCCAACTGATCTCTCGATCGGGAAGGTTTTCCCACTTGACAAGAAAGTCTGAAGTCTACAAGTCGCGTTTCCCACCTTTCTAACTCTATTTGCTAGTATTTCTTTTGCTTCCTTGTCTTCTTTCGGCTTCAGGTCGATGACAGCACATAACAACATTCCGCCCCTCACTTATTGATTAGGATACTAGGGTTTCAAGTTATTCACATGGGTTACGGGGTGAATCTTCATCCATGAGGGTAACACCGCCCTATAAGTGATGTTCCCAACTTTCTTGAGAACTTTGAGATCCTTTGTACTTTTTGACTAGTCTCTGATTTATGCGGCCTTGAAAAGGATCTCTTCTGGTCTCAACTTGATAAGGATTTAGTCACCTATCCAGAACTCGAGAAGGTGATGCTTGCAATCTGCCCACTCCTTCATATGTTTGGAGGTTTTTTCTTAGTAGGCTTGGGCAATCTGATGTAATCAGTTAGTTTATATTCTGTTTTAGTTGGTTAGTTAGGTATAAACAACCTTCTGTTAGTTATCTTTTAGTTAATTCCCTGTTAGTTATGGCTAGGTGCTTATAAAAGCCAGCCATCTCTGTAATCAGGCTGATCTTGATTAGAATTTCAGTTTTCTCTAATTGGTTTACACCAAATTGGTTTCAGAGCCCTAAACAATTCTTAGGATCCAGTGTTTTCAAGGCGCAAGGCGCACCTCAAGGCGACAAGCCCTTCCATCGCCTCAAGGCGAGAGGTGATAAAAAGGCGACACCTGAGTGAAGCGAGGCGCACTACATAAAATGTTAAATAAATAATATTTATTAATAATTTATATGTAAAATAAATAAATATTCAATTTTAAAATGTGAAAATTAAAATAATAATAATATTTTAAATAAAAAATTAACTTAAAATAATTGATAAAAATAGAAATATTCGGAATAAAGAAAAAATAAGTAGAGAGAAAATTGTATTTGAAATAAGCTGCAAAAGAATTAAATAGAAATAGTTGTATTTATTGCAAACAGCTGTATTTATAGTAAACAGCTGTGTTCATGACCATGGGAAATAGGTGACAGCTAACTTATCATGCAAGCACATAGAAAAGTGAGCTGCCTTCCACCTTTCCGTTTTTTGAAAGGACTCTTCTCCTCAGACTTTTCTTCTTCAACTTCTTTCTTCAACATGTTCTGCAACTTCTTTCGTCAACATGTTCTGCTTCAGTTTTTCTTCTTCCTCCAAGTATACCATCTCTCCCTTATCATCTCCTATGGTTCTATTTTTGTCCTCTCTTCATCTTTTGTTTATTTTCATTATGAGGTGGGTTTTTATCAAGTTAAAATGATTTAAAATTAAAAAAAAAAGATATATGCGTCCGCCTTCAACCAGGCGCTCGCCTCGCCATAAGGCAAGGGCGCGCCAAGGCACAGTGAGGGACGCCAAGGCGAGCGCCCGGTTGAAGGTGGTCGCCTTGGCACCACTCAAGGCACACCCAGGCGAAATCGCCTCGCCCCTCACCTTAGGCGCTCGCCTGGGTGCGCCTCGAAAACACTGTTAGGACCTGATAGCGGGAAAGAACATAGCAGCTTCCGGCAAAGGGTCTGCGGTGGACCCTTCTATAGCGCAAACTCTATCCCCTAGGACAACCACTAATCGCTTGCTGTTAGTTCGAGAGAATGTGCAGAATATCCACACGTTACTAAAGAGCGTGTGTAGGCAACTGAAACGAAATTCGGTTGAACAAGAGCGACTTCGACTTGATAGGGGTAATTGAGGTAACACAGGCTGTGGGGAACAACATCAAGAACGTCCAAAAATAGATCCCGTACTCCAGTTGTTGCGATGTATGTGGAAGATCGCAGACGGAGGAATCAAGACTTGCAGATAAGAGTCCAATATTCAGATTCTTCCGGAGAGGAAGATATCTTTTTTCGGGAAAACCGTCGGTTTGATAGAGATGACCACCAAAACAAGGGTGGTGTTCGAATGTATTACCGAATGAAGATCGATTTACCTACTTTTAATGGGAAGATGGATGTGGAAGCATTCTTGGATTGGATAAAGAATGTCGGAAATTTTTTCGACTATACCAACACTCTAGATGATAAAAAGGTTGCTTGGTCGCCTTCAAATGCAATCTAGCGATTCCGCATGGTTGGGCCAGCTGGGAATCAATCGGCGACGTCTGGGAAAGCGACCTATCCGCAGCTGGCCGAGGATATTACGCCTCAGTTAGTTTATATTCTGTTTTCGTTGGTTGGTTAGGTATAAACAACCTTCTGTTAGTTATCTTTTAGTTAATTCCCTGTTAGTTATGGCTAGGTGCTTATAAAAGCCAGCCATCTCTATAATCAGAGGCAGATCTTGAATTAAAATTTCAGTTTTCTCTAACTGGTTTACACCACAATCTCTGTCGTTTGCCTCCGCTATTTCGTGAAATTATCGGTAAGGAAGTGATTAGTTTCTTTACGAATCTATGCTCTTCATACGTTGGTCTCAGGTTTGCCCTAGAAGGTGTTGGGTGGAAGCCTCTTTCTGGTGGGATGAAGGATTATGAAGAGGAAATTCATAGTGATCACGAGTCTTGGTAATTTGAAGTCGCAAGTCCCGATGAAATGACAATGGAATTTTTTTTAAAATCTTGGAACATGTTGGAAAAAGATTTAGTAGCGCCAGGTGTTCCAAGATATATATATATATATATATTTTGATTTTTTTAAAAAGGGGATCATAAATAAATGTACAAACAAGACCTACATTTGCCTTATTCCAAAAAAGATGAAGGCCAATACAGTGAGCGAGTTTCAAACTATTAGCCTAACTGCGTCTCTATACAAAACCATGGCAGATTACTTTGTAAAGGGAATCGGCTTCCCCATGTTCACTGTTGAAAGAGATGCAAGGAGGGACTTATAGGAAAAGTCTTTGAAGCATCTGGAGGCCATTCCCGTCCATCTCTGCCTCCACTGATCCGAAGTATTGAGATAGAATTCGAAAGAATTGCCCAGTCTAAATCTCATCATCTCTCAAATTCCTAGTAGGTTTAGGTCCGTGAGCTAGTCGCTCAAATCCAAGCCTCAACAATGGTGATATTCAGATTAGTAGTCAGCTTAAAGAGCTTCAGAAAGTTTGTGTTGAGAGGGCCTTGGTCCATCCAAAATCTTTCAAAAATTTGAGAGAATTACCATCACCAATTTTGCATGTAGCTTTGCTTTGGATAGTTGTGATTTCTTTTGATATGTGAAACCAAGGCACTTTGGAGGATATATGGCAGGAACTAATAGGCGAAGACATATTTATATGATCCTTGTACTTCTCTTGGATAGCCTTGCACCAGAGAGCATTTTTTTCAGCGGTGAATTTCCACAGGAAGAGCTTTCAATGTTCTTGATCCAAGGCAGGAATCCAAAATAGAGACACAAAATAGGTTGGAAGATTTGAAAGGGTTTATTGGATTAGAGTTAAACGACCCCTTTAGAGATGAAGGTGGATTTATTTTTTTATTTTTTTGGTAAGATTTAAGAAACAAATATTTCATCGAGGGTATGAAATATCCAAAAGCATGATGGCTAATTAAAAAAGCCTTCCAATTAGGGGCAAGGGCATTTAGAGAGTAGCTATTGAAAGGTGAATAAAAATATTTACACCATCCAAGAGCTAAAACAATAACCCTATAAAAAAACCTCAAAAGGGAGAGTTCGAGAGATGTAGCGTTGAAGATTCTTTTGTTTCTTTCTAACCAAAGGCTCCAAAAAATTGCCCTGATGAAATGTTCCCATAGAATTCCTCCTTCCTTATTAAAAGGATGAGCAGTCAACAAAGAGCTAAGAGGGATGGAGGGACTATTGTGTAATGCCGTATGCCATCCGAAAGTCTCGAGCATTCTGTTCCTAAACAAAGAGCTAAGACCGTCTAAAGGCGAATTTCCTATTGCAAAATAGAAAAAGGATGTGGAACTTCTATTGGGAAGGTAACTCGAAGGCTCCCTTGAAACATTTAGTCCAATGGAAAATTTCTTCCTCTCCATTGGAGGCAGGGGGCTTGGGGATTGGTGGTATACAACAGAAAAATAGGGCTTTGTTAGCCAAGTGGGAATGGCGTCTTTGTAATGAACCTTTCTCTCTCTGAAGAAAGGTGATTGTAAGATTTATGGATCGGACTCCTTTGGTTGGAAGACTTTGGAGAAGCGGTCAAGTTGTTCTCGTAGTCTTTGGATCAATATTGCTAAGCAATGGGACAAAGTGGAACATTTTTGTTCCTTTACAGTGGGTAACGACAAGCATATCGTTTTTTGGTCTCATGTTTGGCTTGGAGATAGAGCTTTGAAGGACAAATTTCCATCTCTTTACCGAATCTCTTCTAATCCAACAGGTTCGGTCTTTCATCATTGGGATTCAGCCACTCAATCTTGGAAAATTGAAACTAGACCTTTTCTAAAGGAAGTGGAAGTAGGAGCATTACGGAGTTGCTGTCTTCCCTCAGCCACATTTCATTGATTTCTAAAGATGATCAACAAAGATGGTCGGCCGGCCTTTTCACGGTTAGTTCCTTGTGCACAACCCGTCGTTCCTCCTCAATCTTTTCACAAGATTTGTTCAAAGCCATTTGGAAGTCTAAAAGCCCATAGAAGGTCAACATTCTCACTTGGATTTTGTTCCAAGGAAAGGTAAAGACAGCCAACGTCCTCCAAAAGAAGATCCATAGTATGGCCTTGTCACCTTCCATGTGCATATTGTGCAAGAATTGTGATAAACAGCAGTCTCACGTCTTCTTCCATTGCCCTTTCTACAAGAATTGTTGGGAGCTTCTATTCAATTCTCTCTCTTTACAGTTGGTGTTTGATCATGCTGTTCCAAACAACATTGCCCAACTACTCAGTATATCAGGTTTAAATTATAAAGCTTGGTTATTATGGGCAAACGGTGTCAAAGCCTTAGTTTCTGAACTATGGTTTCAACAAAATCAGTGTCTCTTTGAAAACAAAAACAAGTCTCCGAAAGAATGCTACAGTATTGCAAAGTTGAAAGCTTCCCATTGGTGTTCTCTTGATGCTTCTTTTTCTAGTTATTCCCCTAATATGATTTGCTGTAATTGAGAGACCTTTTTAAATCCCTTTTAAGTTTTTTCTCCCTTTTGTTTCTTGTTTTCTTGCTGTTTTTTTTAAAGTTTTTCTTTTTCACTCCCTTGGGAGTTTGTATCTTTGAACAATATTGCAAAATTTTCTCTCAACTCTTTCCAGAATTGGCTACGAGAAGTGAAGGGATTTGTGGAATAACAAGATAAGTGGCTGGACATGAGTCCATTTTGCAGCATGCTGACTAGAAGAGTTTCTAGGTCAGGAACAGAAAAATTTATACCCAGCCGCTCACTTTTACCATAGTTCATATTTAGTCAAAGGCCTCTTCAAAATTTTTGATGACGTTGAACAGATTGGTGATGGCCTTTCAAATGAGGAGAAGAGAAGGTTGTCGTCAGTGAATTGCTGGTGGTTGAGGCACAAGGAAGAATTGTTCAAAGAGAAACCTTTAATATTGACCTGATTTGTGCTGTGAGAAAGTAACCGGCTGAGGCTATCTCTATCTACTACCAAAATGAAGAGAAAAAGATGGGTCTCCTTGTCCAAGGCCTCTCGACGCGATGACTTTACCTCAAGGTTTACCATTGATTATCATGGAATAGTTTGCGCTAGATACACGTCCAGAGATCCAATTCCTCCAACGATCTCCAAAGCCCTTAGCTTTTAGAACACAATCTAAATAGTTGCAATCTACCTCAAATACCTTTTCTAAATCAAGCTTTGCAACCACCCCTGCCTTTTTCTTGAAAGACCATTCTTCAATTAGCTCATTTGCAATGAGGGAGGTTTCTGTAATTTGTCTTTCACTGACAAAGCCGAAATAGTGAATGGGAGCACTCCTTTGGGATGAGACAAATGTAAGTCTCATGAAGCTTTGCATTGGTAAAACCATTCTCAAAAAGATCATGGATCAAAGTCAAAACGTCTTGCTTGATTGTGTTCCATGATTTTTTCAAATATTAAAAGGTAAAACTGTCTGGTATTGGTTTTGTTGTTGTCCAGAGAAGAGGCAGCTTGGTATACTTGGTTGGTATCCGGCTTCACCCGGTGGATCTCTGCTTCAGTTAATTATTCTTCTACTTGCTTAGCTTGCTCTTGGTGGATGGCTGCCAATCCAAATTAGAGGGAATGAACCTCTTGTTCTTATCTTTAGTGTAGAGCTGATCATTGAAATCCAGGAATTCTTTTTCCATGTCTTTCTGTTAACAAACTATTGTCATCTCTAGATAGGATTTGGTAAATATAAGTGAGTGTTTTGTCTCATTCCTTAAACCATCTCAGCTTGCATCTTTGTCTCCAAAATACTCTCATTGGCAGCAAGGTCCATTAACCGAGGCTTGACATTGTCTGACTTTCCTCTGATCTAGGGTGAAGCCCATTCCTTTCTTGTCTATCAAGATCTACCAGCTGAGTTTTCGATTTATTTTGCTAAGTAGCTAAGGCTTTAACATTCAGCTTGCTTCAGTTCTTCAAAATAAACTTTAGGCCTTTGAGTTTCTCCCTAAAACCGAAAATCGTGCCCTGGCCAACCTTTCATTGGATTTTGCTTCCACCAGGAGTCAACTATGGGGGAAAGGACTATGGCTTAGACACATTTTCTCCAAGGGGAAGGTGGTGGTCCCCATTTGATGTTGCCAAAAATGAGAGATAAGGGGTAGTGGACAGAGATAATTCTCTCCGGCCTCTATAGGTGGCCCATCCCAAGTTTTGGAGCAATTCTATTGGTGATCAGAAATCTCTCCAAAAATGAAAGAGTGGGCTGGTATTTGAAACTTTACCAAGTGAAGCAACAATTTATAAGAGGGATATCTAACGAACTGTGAGAGGAGATGAATTTATTGAATTTTCTCATGCTTGTTGAGTCTGCCTGTGATTTTTGCCATTTCCATCCGGTAACATTGAAATTTCCACCCAACTAAGTATTGAAGTTCAGCTAGTCGTTTCTAGCGATGCACCAACCTTGGAGGGTAATAAATACCAGAAAACCAAAAAGAAAACCCATTGGCATTAAGATATTGAGGGAAAGAGAGAGGGCACCTTTTATCACATTGATCTGAAAGAGATGGTCATTCCCCTCTTCTGTGACTTTGGAAGCTTTTCAAGTGATTACTGTTTGAAGCTTTAAGTGTTTATTCAAATCCTTTGTTTTTATTGAAAGGAAATGAAGAGTTAGAGGCTTTTTCTACGTCAAAGAAAAGCATTGGAGAAGACTTTCCTTTGGTTTACAAGAGTAGAAGACTTTCCTTCGACAATTAACAAGGATATTTTGCTAATGCATTGGGTTCTTTCTCAAAGAGTGAAAGTCTATTCCTTTTTGATGATTAACGGTCTTTTTCTATTGATGTCATAAGCTCTAACGTGCTTTTTCTTTGACAGCTTCTTCTCCTTCGTTTGGAGCCATGGGTGGCTCCAGGGGTTTTAGTTTTCTTCTAGATTATTTCATTGTGGAGTTTGCTTTTGGTTTGTTTCAGTGGAGCCTTGTGGTTTCTTGGTACTGCTGTCTCTAGATCCTTCAGTTTGATGGCTCGTTATTTTTGAATTTCTTCAGTTGATCATGTTCTTATTTTGTTTTTTGTTTTTTTATCTCTTTTACTTTTCCATCTCACCCCTCGAGTTTGTTTTCTAAACATTTCCTTTTCATATTATCAATGAAAAACTGTTTCTAGTTAAAGAAAAATGGTATTCTATGCAGGAGCCGTTGATGGAGGTGTTTTCTCCTGGAACAAAGTCTCTTCAAATGTTTATGATGAATAGATTAACTTTATATATATTTCGTGAATTCCTTGCTGCTGAAAAGCGTAGAAGACTTCCTTACATTCGTGTTGATGAACTACCTTCTCAGTTTCCGTATCTGTCTGAGACTGTGATTAGGAAGAAATTGAAGGAATATGCTCTTCAACAGGTTAGTTAATAATTTTTTTACAAGTATAATTGTAAGCTCTATTTGAACAAACGGACTACCCTGTTTTGTTCTTTAAGATGTTTTAGTTTTACTTCAAGATTATCTTTCATTATCGACTTCATTACATTTGCACTGACCTCATATACAAATCTACAGAAAAGTTCTAATGGACAAACCATTTTGATTAAGAAGCGTAATGCAAGTATTTCTTTGAAGAAAGATGCTGTGACACCCGAGGATGTAAGTATCATAAGTCCCTTATCATTAGTTCTCTTTAAATAAACTAGACATTTTAATTCTCCTGCATTGTTCGTGTTGTTAAGGAAAATGCGTACCTATCTCTTTCATTGCTGTATTTATTTACTTCTGCTCTGCATATGAAAGAAAAATTTTATATTTTGACTGTGTGTCTTTGAACAACTGGGATCTTCCCCGAAGTTGCAAGGTGTTGAAAACGGTATTCCAATTAGCTATAAAGGAAGATAAATTGTGATTAAAGCTTACTAAAGATCCATCTCATTAGTAAGGAGTAACAGATATCACTTGACAAAGTATTGCACTCCAAAACAACATTAGTCTCAGCCCTTCTTTTACTTTTCAGCCAAAGTAATTTCTTAAAGCCCAAATCAAATTCATGTTTTGATTTGCCACTTCCGAAGTTAGTAAAACAAAATATCATATTATAGAGCTCATTTTAAATATCCATAGATCACTTGCCACATCAGATTTTTGTTACTCTCGCCTAATGACAGGGACATGGGATAAACTTTAATAAAATGATAGGGACAAGATGAAATATTTAAAAGGATAGAGACAATTGGCCAAGAATTTAAATTTAATCTTTTATTTATTACTTGGTCTGTTGGTCATTGTTTGAAGTGATATTTGTAATTGGGTGGTGGGTTAGGTGCTGGCTTTGTTTTTCGTTGTCTTGGGCTTCAGTGGTTCTTTGGTCTTTGCGGTTACACTTCTGGGATAATTTTGTTTATTGTTTTTCACTTGGGGATTCTTATTTTATTGTTTCATAATATCAATGAAAATTTTTGTTTCTTGTTACAAAAAGACAACTTTTCTCATATTATGATACAAATTTTTCATTCATCTTTCAAAAGAAATCTCTCTTTCCCTTGTTTCTTTTACTACATGATTTAGAGTTATGTAGTCGAGCAGAGGGTGCAAAGCAAACTGTTCCTTTGCCTTGTACTATATGATTTAGGATTCTTGAGGAGCTTGAAAGAATAGGAGAATAATGAAATAATTGTCACAGCCAGTCTGCAAGACTTGAGTTGACAACGACAAGGACAGGCATAGACTAATTCTTGTCCAGATTTGAGATACCTCTCACTACGTACCGTTTCTTTTTTTTTTTTTTAACATGATACGAACTTTTCATTGATTAATGAAAAGAAACAAAATTGTTCAAAGAGAATCACAATAGAATCACAATAGAATCAACAAAAGTTAAAACACTGACTAGGGGGAGTTATAAATGCCTCCCAATTCATACATAGCATCCCAGGAGAGTGTTAGGCCCCCAATTACAAAGACATATTCTAGGAGGGGTAAGTTTGGAAATAACCTAAGGGGTATTATGGGTAATGCCTAGGGAGAGTTCTAACGGGCAGGAGAGAGAACGTATGTACCAATTTTGTGGGGAGGGAGGTAGGTATAAGGGGAGTGCGCTCATTTGAGTAGGGGGACTGTTTTTGGCTTGTACCTTATGTAGTAGGAAGGGAGGAGAGATACCCACCCTCTCGAATCGGTGGGGTAGCTTGTAATCGCCTTCGGGCTATTTTCTATATTCGAATAGTAATCGGAGAAGGAGGTCGATACCTACAGAGAGTAAGAGGAAAAGGTATCGGAAAGAACACAACATTGTTTTGCAAAGACGATGATAATATGTTGGCTGTATTGTTTGACACAATTAAAGCCTTTGAATGGCTCTCGGGATTGAAAGTAAATTGGGAGAAATCTTCTTTAAGTGGAGTCAACATGAATACTCAGAAAATTCAGAGGATGGCTTCTAAATTTTGTTGCAAAGCTGAAAAACTTCCGTTGATTTATTTGGGAATGCCATTGGGAGGAAATCCAAAGCATGACAATTTTTGGAGTCCCATTCTTGACAAGACAACAAAGAAGCTAGATCGATGGAAGAGATATCAACTTTCGAGAGGTGGAAGGTTGACCCTTTGTCAATCGGTTCTTGAACATCTTCCATTATATTACTTTTCTCTATTCAAGGTTCCTATGTGAATTTGCAAAATAATGGAGAAAAGTGTTCGATCCTTTCTTTGGGGAGATTCTTTGAAAGGTTCAGTAAATCACTTAGTGAGGTGGGACATATCTTCTTTAGATCGAGAGGAAGGTTGCCTTGGGATAGGGAATTTTACTAAAAGAAACCGAGCTTTGCTTGCCAAGTGGGGGTGGCGGTTCAGTGTTGAATCTTCAAGTTTGTGGAGGCTGGTGGTGGCTAGTATTCATGATTCTGATTTTGGTGATTGGTGCACTTTGACTAAGTTGACAGGTTCAGGTAGAAGCCCATGGAGGAATATATCCAAGGTATGGAAGCAAGTGGAATCATTTTCCCATTTAAAAATCGGGAATGGCAAAAGGTGTCTGTTTTGGCATCATCGATGGATGGGAGATTTTTCGCTTAAAGATAGATTTCCAGCTATCTTTAGGATCTCTTCTAATCCGTCCATTGTAGTTGCGGATGCTTGGGATAACACTGGTTATTCTTGGAGGCTGTCAACTAGGCGATCATTGAAGGATAATGAAGTTATTGAATTGGCAAATCTTTTATCCTTGCTTTCTTCAACCTCCATCACTGATTCAGAAGATCATCGAGTATGAAGTTTAGAAGCTCCGGGTTTACTTTCAGTTCGATCGTTGAGCAAATTCCGGCATTCGGTTTCTAATTTTCCTAAAGAAGTACTACCTATTCTATGGAACTCTTTTTGACCAAAGAGAGTTAATTGCATGGTTTGGATTCTTCTAATGGGCAAAGTTAACACTTCAGAGACACTACAAAAGAAAATGTCCAATTCTTTGATGCAGCCATCATGTTGTTTACTGTGCAAAGCAAGTGGGGAAACCCAAAGGCATGTCTTCTTCTCATGTAGCTATGTAGTGGTCTGTTGGAGAAAATTCCTATTAAGTTTCATGGGTGTGGGATATGGAGGTGCCTCAAATCATTTTTCAGCTTTTCAATGGGCGCAAGCAAGCTCTAAAACTAATCTGCTTTGGGCTTTTGGAATGAAAGCTCTTCTTTACGAATTGTGGTTTGAGAGAAATCAACGAATATTTGAAGGCAAGAATCGATCTCCTATGAAGTGTTTCTCAATCGCCAAATTCAAAGCCTCCCAATGGTGTGCTCGTTCCGATATCTTCTCTTCTTATTCTCCTAGTATGCTTTGTATGAATTGGGAGGCTCTTATAACTCCCCTTTAAGGAGGTTTCTTTAAGTTTATGGTTTTTATTTCTTCTAATGCTTACTTTTATTTATTTCCTGTATCTTCACTCCTATGGTAGTTTGTATCTTTGAACAATTTTGTTCATTTTCATTTATCAATGAAAAGTTCGTATCGTGTTAAAAAAAAAGTGATGCCGAAATAAGCTCCGATTGCCATTCCCTACTTTGACATGGGAAAACGACTCCACCTCTTGCCAAGCTTTCAAAATATTATGCCAAGGACTTCGACCCGATCCGACTAACTTAGGCAATGTGCACCAACCTCCATAATCCAATCCATAAATGCTAGCCACCACTGATCGCCAAAGAGATGAAGTTTCCACACCAAATCTCCAATCCCATTTTGCTAACAAGGCTTTATTCCTTTTGGAGAGATTTCCAATACCCAGACCTTCGACTTCTCTAGCATGAGAAGTTATATCCCACCTTACTAAATGATTCATCGAGCCTTTTGATGAGTCTCCCGAAAGGAAAAAGTGTATACTTTTCTCCCAGTGCTTAAAAATTCTCACCGGAGCTTGAAAAATAGAGAAATAGTATAAGGGAAGATGTTCTAGAACCGATTGACATAAAGTCAACCTACCTCCCTTCGAAAGTTGATACCATTGCATCTATCTAATTTCTTGAAAATCTTGTCTTGAATGGGGCTCCAAAAAGCTTCATGCTTAGGATTGCCCCCAGTGGAGAAGGGCTTTAAGCCTGTAGCAGAAACCACAGTTCGGAGAAAACGATTCTTATCCTCAGATTCCTGATAGGTTCATTTCGAATATGACCTTGTTTAAGTGAAGATCGAAGAGATCTTAATTTGGAGGAAATCAAATAACCGGCGAATGATCATTTGAAAATTTGGTTTCAATTAGACTTGCACAATCTTTCGAAGATAACCAAGAGTTGAAGAACCGAAAAGGACACGGTCCCCAACACGAATCCCCTGCATCCAGAAAAATGGGAAAGTGATCGGAAGTGATCCGGTTTGCTCTAAAGACCCTGGACTTGTTAAACAGAGAAATCCAATCCTGAGAGAGAGAATCTGTCAGTCAACGAATGAACGCTGTCATTCCCCCCATCTTCGACCAAGTGAAGCCTCCATTTGAGAGAGGTATTTCAACCATCTCTACATTCTCAATCATAGTACTGAAGGTCTTCATAGCGTTTGAAATTCTTCCACCAGAGGATCTATCACCAATCCATCGAGTAACATTAAAATACCCTCCATTACACAAGAATTTTTCGACGAGCCCCTTAATATCCCTGAGCGCTTGCCAGAGGAACTCTTTCTTTATAATTCATCGGCCCATAGACATTTGTCGCACACAAGTTCTGCAAGTTAGAAAAAGAACACAGAATTCTTAACGAAAAAACTCCATGATGAACCTCCTGCAATCGAATTTTACTCTCATCCCATATTGTGATCACCCCACCAGATTTGCCAACAGAGTCTATGTGAGACCAACTGGCATCCATTGAATTCCATAGAGATTTTACAATAATATCATCTATATTTTGTAATTTCGTTTCTTGCAGGAGAACAATATCTGGACCATGCTTCTGAATAAACCTTTTAATCTTCAACTGTTTATTATAATCACCAAGACCACAGACGTTCCAAGACAGGATTTTTATTGGGATTTAATAGATGGAGCAGGTGTGAGACCAACTGTTTTGGAGAAGGTAGGAATAGAAGAATCCGTGGGATCTTTGAGGTGTGCAAGCCGAGAATCATTAAAAGTATCTTCTTTCGATTGAGATGATTGATCATGCGAAGATCCTGGTAAAGTGAACGAAGGGCCACCTCCAAACCAAAAAGATTAATAAAATCTTCACCCTAGGAAATTTCCTCTGAATTACATCTCTGATTAGCGGGAGAAGTGGAGAAGGGGTGACATCAGAACTACTTATGCTGACATTGAATTCCTTTTCATTTTCTGAAAAAATATCATTTGGAATTGGATTCATCAAAGCATTAAATGAATAAATCACACCTGACTTTAATGAATCAGTGTTGGGCCCATCATTCGGAATTGGCCCATCAACAATATGTACCGTTTCTAAGCACCACAAGAATTAACCAAAAGAGGAGATATTTGTTGAGAAGGTCCTTCAAGAATTCTGAGGAAATTCCATGTCTTGAATTAAAAATAAGCTAAAGAACCCTTCATCCCTTCGGAGGGGAATATTATTCTTCATAAAAGTCCTTAGTCAAAGGGCAAGAAAGGCACCCTTTGAAGGAGAAGAAAAGTAGTTCATTTCTATTCCCACCCTTGTAGCTGACCAGGGAGGTTTGAAATTATGAGTCTCATCGCTTCTGAAGACGGGTCTGCTTGGATCCTCCTCACCAAAGAAACTGGATTGATTTTTGTTTGCTTAGCTTTCTCTCAAGGGTTTGGTAAAAAATGTTGCAGGCTATAATTGGAATTTGGTCTTTAGAACCTTCAGGCTGTTGTAGAAAATATTGCACTGATTGTTAGTTTCCGAACCTTATAGAATTTGAATTAGTAGTCTTCAATTTACGGTAGATACCTCAAAAAGATGGACTCTTACTGATACAGGAGAATTTTGACAAGTAACTTTGAGGGGAGAGAGGGAAAATAGATTGCCTTCATGCTCTTCAATTGTTGTGGTGAACAGTTGGGAAGATTGCTCATATGATGTTAGTTGAAGCGTGCGTAGTTGTGTTGCACATCACTTTATTTTTTCCCTGCTTTTCAATCTGTGATTTATTCTATTTAGTTTTCTTAAAGCATTTCTGAATCTAGTGTTATGATCATAGCATAGGCTTACAAGATCAAAATGTTAGATGTCATTCTAGTGCATATGGTACATATTGTTGTATCATTCTATTGTTCATTGTTATATTGCCGTCATATTATATGTTAGGTATGCAAGTATGAAAGCATGCAAGCTGGTCTGTACCGCCTTAAACATTTGGGAATTACTGAGGTCCATCCCTCCGCAATTTCATCTGCAATGAGTAGACTTCCTGATGAAGCTATTACTCTGGCTGCTGCATCTCATATTGAAAGAGAGTTGCAGATAACTCCCTGGAACTTGAGTAGCAATTTTGTTGCTTGTACGACCCAGGTAAAAAATCTTTACTTATCTCCATTATAAGAAGAGTCTCAATGGTTCGTATGTGTAATCCTCCTACTGTTTTTTTTATTTTATATTTTGTTTGTGAAAGAAAAGCGTATGATAAACTGGCATAGAAAAGCAACAGCTCAAACGACTATTTTCCAAGGGGTAAAAAATAAGGGTTGACATCACTTGATACTTTATAGGTTGATAATGCATGTTGATTTGTTGATGTCTTCTATAAATGAACTGATCTTAAGTTGTCTTTGATAAGTTTGTTTCAATAAGTATTTCTCATCTTCCCTAAGAAAAGAAAAATTTTAATGGATTCTTATCACTTAAGCTACATTTTGAATTCTAAGATGGATAAATAATGGGCTTCATACGGTAGAAGCAAGTGAATAAGTTCTCAAAAAAAAAAAAAAACAAAAGAAGCAAATGGATCAAGGAAGATGGATAAACTGGCATAGAAAAGCAACAGCTCAAACGACTATTTTCCAAGGGGTAAAAAATAAGGGTTGACATCACTTGATACTTTATAGGTTGATAATACATGTTGATTTGTTGATGTCTTCTATAAATGAACTGATCTTAAGTTGTCTTTGATAAGTTTGTTTCAATAAGTATTTCTCATCTTCCCTAAGAAAAGAAAAATTTTAATGGATTCTTATCACTTAAGCTACATTTTGAATTCTAAGATGGATAAATAATGGGCTTCATACGGTAGAAGCAAGTGAATAAGTTCTCAAAAAAAAAAAAAAACAAAAGAAGCAAATGGATCAAGGAAGATGGATAAACTGGCATAGAAAAGCAACAGCTCAAACGACTATTTTCCAAGGGGTAAAAAATAAGGGTTGACATCACTTGATACTTTATAGGTTGATAANTTTTAAAAAAAAAAAAAAAAAAAAAACAAAGAAGCAAATGGATCAAGGAAGATGATATGATGAATTTCTGCTAATTTACAACCTTATTCTCCAACTCCCCAACCGAAACAACAAAAAAAGGGAAAAGAAAATATAGAGCGGTTGGAAATTACAGGTGTTGGTGATCCATCTGGCCGTGGACTTGGTTTCAGTTATGTTCGTTCAGTTCCAAAAGCACCTATTTCTAATGCATCTTTGAAGAAAAAAGCAGCTTCTAGTCGAGGAAGCTCTGCAGTTACAGGAACAGATGCTGACCTACGTAGGTTGAGCATGGATGCCGCAAAAGAGGTACCTTCTCATCTATTGAAAGATCGTGTTCAGCTTTATTCTGTCTTGTGAGAGTACTTTCATGTTCCACCTGAGAATTCTATGGTAGATAAAGAACTCTTTAAGAATGGAGGTGTACCAAAAAATCATCTCTTTAGCCTTTTGTATTTTATGTATGCACACTCTTAATTACATACATTATTTGTGATTAAATGATTACGTCAATTTCTTTATACTCATTCTCTGCTTTATCCTGTTATTTTCACCTTTGGTTATGGAGTCGGAAGTTTCTCTTTTAGCTATATGTTCAGAAGTAGCTTTGGCATTGTTTCCTTGTTCAAATTAAAATTACGAAGAGAGAAAGAGTGAAATTGTGGAAGTTGGAAAATATATAAGAAACAGAAGGAAAGGGGGCGGAAAAATAAAATTGGGGGGAGAGAATAGAAGAAGCAGAAGAAAATGGGAGGGGATAGAGAATAGAAGAAGCGGAGGAAAATGGAAGGGGGAAAATACAAGAAATAAAGAATAATGTTTTCCAGGTAAGCAATAAAAATAGAAAATTAGTTTTTCAATGTCACCCAACATAAGTAATGAAATAGTTTCAGTCAGCTTTTCACTACGATAGCTAACAAAAACCTAATTACCTTAAATCTTCGCCTTGTTTAATATTTCAGGTTCTTCTCAAGTTTGATGTATCTGAGGAACAGATCGCAAAACTGACCAGATGGCACCGAATTGCTATGATACGCAGGCTTTCAAGTGAGCAAGCGGCTTCTGGGGTGCAAGTTGATCCAACAACCATCAGCAAGTATGCACGTGGCCAGCGAATGTCCTTTCTGCAACTGCAGCGTCAGACGAGGGAAAAGTGTCAAGAAATTTGGGAGCGGCAAATTCAGAGTCTTTCAACCTCAGATGGTGCTGAGAATGAGAGTGACTCTGAAGGAAATAGTGACCTGGATTCCTTTGCTGGAGATTTAGAAAATTTGCTGGATGCTGAGGAATTTGAAGATGAAGTTGATACATTTGAGATCCGACATGAAAAGACTGATGGGGTCAAGGGTCTTAAAATGAGGAGACGTCCATCCATTGCTCAAACAGAAGAGGAAATTGAAGATGAAGTGGCTGAAGCTACAGAGTTGTGCAGGTTGCTGATGGACGGTATGGATAACTCTTGTATTGTGATACTTGCTTATCTTGGCTGTGGTTTCTCTGTATCATTCTTGCTTATCATTGATGGAATGGTTATTTTTTTGGTGATAAATGAGTGTTTTATCAGCATTGTTGGCAATAATATCATTTAAATATTGACATCTTGTTCCTTTGAATTTTATAAACTTTTCCAAAGTTTTCGGTGATATTAACATTTTATGTGTATGATTGACATTTCCGTAGAATCTTATTAAAAACCTCTTCACTTAAGTCAATATTCTCATCCTCTATTGATCCACTTTATGAAAAGGAGGGAAAAGTGTAGATGTAGCTTTTATAAAGTTCAACTTCATTTTAAAAAAGACGTGGAGTTCCTTCCCATTGTTCTAGTAAGAAGTTTTACTTCCAAACTAAATTAGTAAGTTTTGTATCTACTTTTGCCTGGTTAGTCATATATTTTCTCTGGGCATGGCATACATCAGTGGTGAAAACTTGGGAGATTCCAAGCGCTCATAAGTTGAATGCATCCGATCCCACCTATCATCGAAACTTTATTACTCCTAAAATAGCATACTTGTTTTGGAGTTAAAGATGGGAGGATTCATACCCTTATGTAGCATTAACGTGGAATATTTTTGGCAGATGAGGCTGAGAGGAGGAGGAAGAAGAAGAAGAATAAAGTCATGGGAGAAGCAATATTGGTGCCAGGCCTACAAGCTAGTTTCGGTCATGAGATTCCTCAGCAGACCAGGCATCTTGTTAGTATTGCCCAACCTGATATAGCCTATACTTCTAAAGAAAATATAAGAGATCAAAAGGAGGTATTAATATATCATCATGTGCATTTTATTTTATTTCCTTGCCTTCTTAAACATGGAAGATTTTATTTCCTCACATCTAAAGTAGATTTATAGATAATTAGTGCCATGTGACGTGATGGCAGGTGGAAAGTATTATTAATAGAAAAGACAAGTCTGGAAAGTTCAAACCCATGAAGAAAAATTATAGTTCAGAGATGAGCCTACTCAATAAGAAACTGAAGATATCAGGAGACAAAGTCAAGGTGAGTTAGTTGATTCTTCTTTATATTTTTATCTGTAGGTGGGACAATGGGATTTACTTGTTTCTCTTACTGTTTTTATATTCTGCAGATATTCTAGTTTTCAAACTAATAAAAATAGGCCTAACCTAGCCTAAGTTATAGGGTTTATCCTTAATTATGGGGATAACTTTTAATTCACATCCTTGGTGGTTTTCCCATTGCGGGATACAATTTATGGTAGAATCAGTTCTTTTCTTGGTTCATCGTGGATAGGTTTTGAGCAACTTTAGAAACCTGTAGCCACACTTTTCCCAACCTTCATCAGTAGAATTGATCTAGTTTTCATTGCTGAATACTGAGGCTCTCATTTCGTTCACTAGCATGATTTACATAACTTTTTCTATCTACCAAATACTAATAGTACTGATGTAGCATATTATCAATGTTTTAAAAATCGTAAGGCGGGTTTGAGGCTTTTTCTCGTGAAGAGGTGAGGCGTAAGTAAAAAAATGATAGGAATGTAAAATTTTGTATTATTAATATGATAGTGTTCATATAGAAGTATGAAATACATCATTTAGAACTAAAAAAGTAATTCTAAAAAATTTGTAAAATAAAAAGGAATATCGAAAATATTAAAAAACAAAAAGGAAAAACATGGAGAAAAGAAAAATTAGAAAAAACACATGAAAAAACACATCAATTACTTTTATTAGAAGTTAGAACGGAAAACTTTACTTAACCAAAGTTAAAAAACTTGTAAAATATTTCGATGTAATAATTGTTGAAAATTTTGTAATCCATATAACAGTAGAAATAAGGAAGGCTGGCAGTTGGGCTGGCAGTTGTATCACATAAAAGAAAGAGAAAACAAAACAGACAAAAAAAAATTGGTTGCTTTCCACCACCGTAAAACAAGCCAGTAGCACAAAGGAAAAGAAAGTTGAGCAATAAATGAGTCAAATACTGTCAAAAGCCACCGGATGAAGTCTTTTTGTATAATTCCACCATTGGAGCTTAAGGCTTACGCCTTTCAAGCCTCAGAGGCATAAGCACGCCTCCATCTTTAGAGGCGGAAGCCTCATTTTATGTTGCACCGGCGTAAGCCTTGAGGCACCTCAAGTCCGCCTCGCCTCAAGGCGCGCCTCAAACGATTTTTAAAACATTGCATATTATGTGTTTAACTGGGGTTGTATTGTAATTTTGTTTAAGTTGGTTGATTCTGTTAACGAGTTTAGTTCGATTAGTGGGATTAACTTCCTTTAGGTGGAAATGGATTAACACTCTATTTATAGAGAAGTTAATCCCACTACTGGACGGTTAATGAATACATATAAGACAAATAATGAATGTGTGACATTTTGATGCTTGGCTCTTCAAACAATTTTTAAACAATTAAACTCGAGTGATAAATATTTTAATTTTTTATTTATTTATTGGACATACTGAAATTTCACAATCAAACCCTGGTTAGAACCTTGAGGAAAAACCATTAACATTCCAAATACGAAGTTCGCACAGTAGTTACCCAAGTAGATAGTTACCTAAATGGTTCGTGCTAGGGCTCAGGATGGTACATAATTTTTTTAAAAGAAGGATCATATTCTTTACGAGAGGTATCAAGTTTTTTGTCAAGTAGAAGTTTTAATAAATTTTTACAAATTTAGTATCTACAATAATCTCAGTCCAAGGTTTAGACTATAGTCCGCTGATCCAATACTGGAGAACTACCTGATAAAGGTCGTAGCTAGAGCCAAGAAAATGGAAAAGGGTGCTTAACCTTACCATAGAAGTGTTTAACCTGAAAAAATTATCCTCGGATAATTCATATCAATTTTGGAATAAGGATTGGACAATGAAGAAGCGTTTGAAGGGGTGACTTGACAATTTATCGGCAAGGGCGTAGAGGAAATAGTTCCTACCAAGCAAAGCCTTTAATGAAACCAGAAAGGGTAACTTTCTCCAAGTGGCGACTCAGCCAATCCCTGGCTTGAATCCAAGTCCTTTACCAATGGAGCCGCCCTCTTGGGGTCAATGATCATTCCACTATTTGTTTTAAAGGCTCTAGAATTTAGGATTAGAATTTAGGATTAGACAGCCATATTTTCTAGAACAGAAAGGGGCAGGCCGCCCAGATGAGATAGCTAATTTCAAGAAGAGGACGAGAAAGTGGTCTACTGGTCTGAGGTTAGGCCTTGAAAGCTCGCAAACTTTGGATTGCTGGAGCTTGTCCTCCCAATCAAGGGTGATCGTAAATCTATCTGTGGGTGGGAGAGGATCACATTATTTCTAAGGTTTGACTTAGTGAAATGGTCATTTTGAATAGGGATGTCACTAAACTTTTCATCATTTGGTTTCGCAACATTAATACCACCTAGCAGCCAAAAATGACAGCAAAACCTTTCCAATTCATTCACCTCTTGCTAGAGTACTTTCGGCCATGACTTGAGGGAGGACATAGATATCAGAGAAAGCCACTAATAGAAGCCCATCTTTCAACTCCAGATTAATAGAGGGAACTACCTTCCACCACCTCTAAGGCTTTCGTAGCTAGATCATCCCAAAGGAGAAGAATGACCCTAAAGGAGATCTAGGGCATCCTGGGAGGCCAATTCAATTATTTTAAAACTTCAGATTGACTTGGTAAGCTTGTTTTCTGTACACTTGAGTTTGGGCTTAAAGAGAACGACAACATCAGGACTAATAAGGAGGATAAGATTTTTAATTATGGCTCTTTTGTCGGGGGAACCTATCCGGAACATATCTTCCTATAGAATTAAAGACTGGAATGTTGCTCAGAACAACCTTTCTCGTTAATAGAAGAGGTTATATTTCTGAGCTCTCTATTCCCTTTGGATATTTTGTCTTAACTAGGATTTCCTAGTAATTTGCATTAAGGAAAAGTCATCGTCCTTAAGGCCATCTTGCTACGCCTGTTTGGGGATGTAGCCTCTCCTAGTGTTGAAGATCTGTGTCTTTCTTTGACTTGGGCAACCCTCATTCGGCTCTGGGGTTAGACTGACCCTCTTTTTTGGAATCATCTTAAATTGTAGGATAAAGAGGATGAAAAGCTTCTTGAGACTTGACTAATACTGGTGGAGATTCCCTTGCTGTTTCAATGTTAGAAGTTTGATCGGCCCACTTTTTCTTTGATAACAAAGGCCTGAGGGTAGAAGCTTGAACATTTAGCCTCAAAGAGAGGGCAATAGTAAGATATACTTTTGCCTGACAGTGAACCAACTGTTGGAGTGGATTAGGAATTTCCTTTTTCTTTAGATTCATGCTTTTAGGCTGGAGATGTGCTTTCTAAGTGTGATGTAGTCTTTGGAGGTAAAGATGGATTGTCTCTGCTTTTTAATTGATCTTTATTACTTTGGCTTTTAATAATAGGCTTTGAAATAGGGTTGAGGAACGGAACTGATTACAGCTCAACGTTGCAGCCTGTCAAAAAGAAACAATTTGAGTATTGAGTAGGTTGAGCATTCAAGTATTTCAGAGCCCAAGAGGGAGGATGATCGGTCCACTGGCATTGGCTTTTCCTTTTCTTTTTCTATCATAGACTTCTGAAATAAATCTCGTGAGTCCAAGTATCAGTTACGATTCCGTGCATCCCATATCTTTGTCAGAGAGGCTGTAAAAGGCTGAGTTTTACATGGGTTTGGGCTGGCCAGACCATCCTTCCCCCCTAAAGCTATTTTAGGCATGACGCCATGTTGAATTCTTTTATAACGAGCCATTTGCTGTGGGAGTTTACATGATGATTCAATTAGCAATTTACTATAAGCACAAACGAAAATAGATCAATTCTGATGAGTAACTTATTGATCTTCCAGCCGATTTCAATATTAGCCCACCTTTAATGTATCTGATTTTTATATTTATCATCCACCAGGCATTTTTTCTTTAGGTTATCAACATGGGATCAGTTCCTATATTTTGAGTAGGGGTGTGATGAGAGCGGAGGAAAATGAGAAAGCAAACCAAACGAAAAGGAGCTGAACCAGTCAGTTTAGTCCCTTTTAAAAAATTTACTTTCCTCGGTTTGTCTTTTTAAAAGGCCACTATTTTCGGTTTTGGTTTCGTTTTTTTTTTTCCTTAAATAGGAAACAAAAAGTGAATAAAACTGAAATAATTATAAAAATTTTAAAATGAAAAATACCTCCAACCCAAGCAACCCTCCTTTGTCCCCATATCCCAAGCTCGGCCCACCCAATTTTTAAGTCGTGACTTGTGAAATCCTCTCTTTACTGCTCTCTCCCTTCAGCACTCCCTCTGGCTGACCCTCTCTTTTCCCGTCTATGCTGAAAACTCCTCTCCAAACCTATGGTGGCTGCTCAAAATGCTTCATTCCTTCCTTCTCTCGACCACCTACCGAACTCACTATAACTAGTTCAAACTGAACTGGTTTGAACCAAGCAAAGCCAGTTTGAACTGAACCAGTTTTGGGGAAAAAATGGTTTTCTCGATTTAGCTTTACCAAGGTACTATAGTTTTAATTTTGGCTAGAACCGTGCACAACCCCGGTTTTTGGGGTCTGTAAAAATGAACCAAAGATGTCTTTTTGTTAGTCAGCTTGAGTTGTAGTAGTTGGACTAGGGAGTTGGTTTTAACTGTCAGATTTTAAAAGGATAGTTTGCTACTGTATTATTCTGCGAAGTTTCATTATAATACCAATTGTTGAATTTTTCTGAACCACATGTGCGTCAGTCTTGGATCATTTCTAATGTAAAATACATATTTCATGCCAATATGATACATGCTTATCTAATGTGCCAAATTAGTTGTAGTTTAATAAGGAAAAAGCAATCGCTCTCATCATCATTTATATTATGTAATTTTTTGGTAAAAAGTAGTAGCTTAAATTCTTGATCTTCCTGTTTAATTTGTTTCTACTCTATCTCTCTTTTTTTGTTTTTGTTTTGTTTTTACATTAATCGTTAGTTTTGGTGATGGGAGTTTTGGTGACGGGAGGGATTAGGCTGGGAATAAGGTTGATACTGGTGGTTTTTTCATCATTACTTTGATATGCTAGTCGAACCACATTAATTTAACCTCTTCTTGATCAGAATTTCAAGGAGAAGAAATCAGCTCGAGAAAGCTTCGTGTGTGGAAAGTGTGGTCAGGTACGTGAAGTATTTAGTTTAAAATTTCCATCCTTAACTGAAGTCATTATTATCGTTAAATGTTTGGCAACTAAAGTAAAATCCAAAAATAAATATTTTTTTAAATAATGAGCTGACAATTTTTGTCGGCATAGAAATAGGTCATCACATGTAAGTATTTTTCTTCTGAACCCTCAATATGATAACCTTTTTGGACTTCTTTTACCCTCAGTTTGGACACATGCGGACAAACAAGAACTGCCCCAAGTATGGAGAAGATATGGAAACACCGGAGACAACAGATCAAGAAAAAGTATCTATAAAGTTGAATACCGTGGACCCATCAAGCCAATCCCATCAGAAAGCTCACACTAAGAAGGTTACACCTAAAACCATCACAAAAATTTCTACAACTGAGACATTTGAAGGTGAAAAATCTACATTGATGGCAAAAATGCTTCCAGTCAAATTCAAATGCAGCTCGAGTGAGAAGCTTTCTGACAATCTTAGTCCTGGAGTGCCGCAGACTTCCGACCTGCCATTCAATTCTGACAATGAAACTGGAAAATCTGTTGTCAAGGTTAACAAGATCACGTTTTCTAAGAAGAGAAATGAAGACATTCAGTTTGAATCTCATAAACCCTCAATTGTGATACGACCTCCTGATGCTAAAAAAGTTTATGCTGAAGCTCATAAGCCCTCCATTGTAATAAGACCACCAACAAGTATTGATAGAGACAGAATGGAATTTCCCAGACGCTCGGCCACCGTAGTAAGGTCACCAGCTGAAACAAAAAGAGAGAAGCTTAACAAAAAACTCATAATCAAAAGGCCAAAAGAGGTTATTGATTTGGATCAGATGGGTTTTGATGCGAGTGCTGGTATGCAATACAGGAAGACTAAAAGAATAGTTGAGCTGTCGAGTTTCGAAAATCATACAAGGCCTGGAAGCATGAGTTCAGCTGAGTCTGGAAAAAAGAAAGTTAGAGAAGATCAAATATGGTGGGAAAAGCAAGAGAGGCAGAAAAATGAAGAGAGACTAAGAGAAGAGAAGGCGAGGAGGGTTTACAAGGAAGAAATGAGGATGCGGGATGAACAAGAGAAGTTAGCAGAGATTAGAAGATTTGAAGCCAGCATTAGAAGTGATAAGGAGGAAGAAGAACGTATAAAAGCAAAGAAGAAGAAGAAGAAACGAATCCCGGAAATAATGGATGACTATATGGAGGACCCCAGATCAAGACAAAGGATTCCAGAACGAGATCGCGTTGTGAAAAGGAAGCCTATTGAGTTGGGAAGACATGGTGCAGAGCACGCATCATCGACAAAGCGCCGTAGAGGGGGAGAGGTATTTTTGTGCTTTGTATGATTTTCCTCTACCCTCGTGCGATTCACATAACTACATTCATCTACAACTCTGGTAATTTTTTATTGCAGGTCGGTTTGTCGAATATTTTGGAGCGGATTGTGGAGGCCCTCAAAGACAACTTCGAAATTTCTTATCTCTTTTTAAAACCAGTGTCCAAGAAGGAGGCTCCTGACTACCTCGATATCATAGAGCGTCCAATGGATCTTTCTACCATCAGAGAGAAGGTTAGAAAGTTGGAATACAAGACCAGGGACGAGTTCAGAACTGACGTATGGCAGATAATGTATAATGCTCACATGTATAATGATGGGCGCAACCCAGGTATTCCTCCCCTAGCAGACAAGCTTTTGATGCATTGTGACAATCTATTAAACGAAAATGACGATGAATTGACTGAAGCCGAAATCGGAATCGAGTATAGGGATTCATAGGGGTGGAGTACTTTTCGGATTGAGCCTTTTGTGTAGTATATACATGTAGTCTAGTGTAGTGGGGCAGACCCCTCATCTCCATGGCAAGTAACTTAACATATAGCGTTCAGTTTAGGAAAGGTGGTAGCAATTCGTCGAAGAGAGTTAAGGGTAAGAAGGCAGGTGTAGAACACGACCTTGCCCAATTGAATGTTTTGCCAAAAAGATTAAAGAGAGATTTGAGGGTTAGAACTTGTTGATTAACCAGCGTTTCATAGCGTATAGCCATGTATATAAAACCCGAGTTGAAAATTTTTTGGGGTTCAATAGAAATGTATTTTATTGTCACCATAGTGATGTCCATACGAATGTATGTAAAACCTGAGTTGAAACTTCTGTTTGGGGATTAATATTAATGTATTTTGTTTCCCCCAAAGTTGCTATTGACACATTTTATACCTAAGGTCAAGTGGTTTGTGTTTTCAGTAATTGACTGTAGAGGCAAGAAAAGAAAATAGGAAATATATTTTTAGTTGGCTGAACTTTGAATTAATTCTTGAAGAAAATAGATGGAGC

mRNA sequence

TACATACTTTATATTTTGGTTGCAGATGACGATGACTATGAGGATGCTGGTGGTGGCAATCGATTCTTGGGGTTTATGTTTGGAAATGTGGATAATTCTGGTGATCTTGATGCTGATTATCTCGACGAGGATGCTAAGGAACATCTTGATGCATTGGCTGATAAGCTGGGTTCAACTTTGACAGATATCGATTTGTCAACGAAGTCAGCAAAAACACCATCGGACGCTGTTGAACCAGACTATGATGCAAAGGCTGAAGACGCAGTTGATTATGAAGATATTGATGAAGAGTACGATGGTCCAGAGATTGAAGCTGCTGGTGAGGAAGATCATTTATTGCCAAAAAAGGAATATTTTTCTACTGAAGTTTCTTTGGCTACGCTGGAGCCCACAGTTTCTGTATTTGATGATGAAGACTATGATGAGGACTTTGAAAAGGTGCATGATGTTATAAATAGCAGTGTCGAAGCTCGAACTACCCATGCGTCAGATGAGAAAGGTGAGTGCCTTGAGGTGGCTTATGAAGGAGAAAAATCTGTTGCAGATGATGATATACAATCTGCTTCTCTCAATAATGAAGTTATAACCAGTAGTGCAGAAGAATTGCTCGAGGAGACACCTGAAGTACAGAAAAAACTACTGGACGAGAAAGCTCATACTCCCTTACCTGTATTGTGCATGGAAAATGGGATGGCGATCTTACAGTTTTCTGAAATTTTTGGTGTTCACGACAGTTTGAAGAAAAAGGAGAAGAGAGAATCTAGATATTGTACTCGTAGAGATAAATATAGGTCTGTGGATGTATCTGATATTGTTGAAGAGGATGAAGAGGCATTCTTACATGGCTTCAGTCAAGGTGTATCGTGTGTGAAACCAGCATCTGTCGTAAAAGATGATACTACTATGTTTAATCTGGATGACCCAGAATTTACTAAATTTGGTGTAGTGCAAGGGGTTGATGTAATGGCTGCAAGAGTGGACTGGCGTCAAAAAGACAATTGTTGTGGTGCAGAACCCATGAAACAAGTCTTTGCAGAAAATATTTCCATTGGATCAAATTCATTATTGTTTAAAAAATTTTACCCCCTTGATCAGCAAAATTGGGAAGAGGGGATTTTGTGGGATAATTCTCCGGTCTTAAGTAAGAACTCTGCTGGTAGTTGTGAGGTATCTGGATCTGATCTGGAAGATTCAGTTAGTAGTGATGTGGAACAACAAGTTTCTATTCAGATAGTTCGATCAGAGCACCATATAGATCCAAATGACAGGAGACAGAGGCTTTCCCAGCACGACCTTCCATTACTGGAGCCTTTTGGCTCAAGGAAATTTTCAGGGCCAGAAGAACCTTTCTCACCAGAAATGATTTATCATCCTCAAATGTTGAGACTAGAATCCTGGAAGGATGTGGACGGATCCTGCCAATCAGAGGGTACGAGGGAAAATTTTTCAGAGGAGCATCAAAGCAATGCTATAAGATGCTTTAGTAAATTTTCCCCAAAGAACAGAAGAATGTTGGAAGGTTCTTGGTTAGACAAAGTATTATGGGAATCAGATGAACCCATTGAAAAACAAAAATTTATCTTTGATCTTGAAGATGAACATATGCTTTTTGAAATCTCAGATGAAAAGGAATCTAAATACATTCAGTTTCATGCTGGAGCTATGATTTTAACACGGTCATCAATGTCAGTCAATGGAAATTCTTTTGAGATATCTGGCAGTGGAGGTCAAGGTGGCTGGAGATTTGTTTCTAATGACAAACACTATTCCAATAGAAAAGCATCTCAGCAACTCAAATCAAATTCTAAAAAACGTTCTGTTCATGGTGTCAAAGTTTTCCATTCGAAACCTGCGATGATGTTGCAGACAATGAAGCTAAAGTTGAGCAACAAAGAGCTAGCTAATTTTCATCGGCCTAAAGCTTCATGGTATCCCCATGATAACGAGATGGCTGTCAGAGAGCTACAGAAGTTGCCTACTCAAGGACCAATGAAGATTATTCTGAAAAGTTTAGGAGGCAAAGGCAGCAAACTTTTTGTGGACTCTGAGGAGACTGTTTCGTCTCTCATGGCCAAAGCTTCAAAGAAGCTAGATATGAAGTCATCTGAAATAGTGAAAGTATTTTATTCTGGAAAAGAGCTTGAACGAGAAAAATCTTTAGCTGCCCAAAATGTGCAACCAAATTCCTTACTTCATCTTGTTCGTTCGAAAATATTTGTAATGCCATGGACACAACATTTACGTGGTGATAGTAAATCCGTGAGATCTCCTGGGGCATTCAAAAAGAAATCTGATCTTTCTGTGAAAGACGGACATGTGTTTTTGATGGAGTATTGTGAAGAAAGGCCTTTACTTTTGGGCAACATTGGAATGGGGGCAAGATTGTGCACTTATTACCAAAAATCATCCCCTGATGATCAAACTGGTGCATTATTGCGAAATGGGGGTGATAGCTTAGGGCATGTCATTGTCCTTGAACCTTCTGATAAATCTCCTTACATTGGAGATATAAAAGGTGGCTCAGTACAGGCATCTCTTGAAACAAACATGTACAGGTCACCTATATTCCCCCATAAGGTGCCAATGACTGACTATATATTGGTTCGTTCTGCAAAGGGGAAGCTTTCCCTCAGACGTATTTACAGAAATTTTGCAGTTGGCCAGCAGGAGCCGTTGATGGAGGTGTTTTCTCCTGGAACAAAGTCTCTTCAAATGTTTATGATGAATAGATTAACTTTATATATATTTCGTGAATTCCTTGCTGCTGAAAAGCGTAGAAGACTTCCTTACATTCGTGTTGATGAACTACCTTCTCAGTTTCCGTATCTGTCTGAGACTGTGATTAGGAAGAAATTGAAGGAATATGCTCTTCAACAGAAAAGTTCTAATGGACAAACCATTTTGATTAAGAAGCGTAATGCAAGTATTTCTTTGAAGAAAGATGCTGTGACACCCGAGGATGTATGCAAGTATGAAAGCATGCAAGCTGGTCTGTACCGCCTTAAACATTTGGGAATTACTGAGGTCCATCCCTCCGCAATTTCATCTGCAATGAGTAGACTTCCTGATGAAGCTATTACTCTGGCTGCTGCATCTCATATTGAAAGAGAGTTGCAGATAACTCCCTGGAACTTGAGTAGCAATTTTGTTGCTTGTACGACCCAGGGAAAAGAAAATATAGAGCGGTTGGAAATTACAGGTGTTGGTGATCCATCTGGCCGTGGACTTGGTTTCAGTTATGTTCGTTCAGTTCCAAAAGCACCTATTTCTAATGCATCTTTGAAGAAAAAAGCAGCTTCTAGTCGAGGAAGCTCTGCAGTTACAGGAACAGATGCTGACCTACGTAGGTTGAGCATGGATGCCGCAAAAGAGGTTCTTCTCAAGTTTGATGTATCTGAGGAACAGATCGCAAAACTGACCAGATGGCACCGAATTGCTATGATACGCAGGCTTTCAAGTGAGCAAGCGGCTTCTGGGGTGCAAGTTGATCCAACAACCATCAGCAAGTATGCACGTGGCCAGCGAATGTCCTTTCTGCAACTGCAGCGTCAGACGAGGGAAAAGTGTCAAGAAATTTGGGAGCGGCAAATTCAGAGTCTTTCAACCTCAGATGGTGCTGAGAATGAGAGTGACTCTGAAGGAAATAGTGACCTGGATTCCTTTGCTGGAGATTTAGAAAATTTGCTGGATGCTGAGGAATTTGAAGATGAAGTTGATACATTTGAGATCCGACATGAAAAGACTGATGGGGTCAAGGGTCTTAAAATGAGGAGACGTCCATCCATTGCTCAAACAGAAGAGGAAATTGAAGATGAAGTGGCTGAAGCTACAGAGTTGTGCAGGTTGCTGATGGACGATGAGGCTGAGAGGAGGAGGAAGAAGAAGAAGAATAAAGTCATGGGAGAAGCAATATTGGTGCCAGGCCTACAAGCTAGTTTCGGTCATGAGATTCCTCAGCAGACCAGGCATCTTGTTAGTATTGCCCAACCTGATATAGCCTATACTTCTAAAGAAAATATAAGAGATCAAAAGGAGGTGGAAAGTATTATTAATAGAAAAGACAAGTCTGGAAAGTTCAAACCCATGAAGAAAAATTATAGTTCAGAGATGAGCCTACTCAATAAGAAACTGAAGATATCAGGAGACAAAGTCAAGAATTTCAAGGAGAAGAAATCAGCTCGAGAAAGCTTCGTGTGTGGAAAGTGTGGTCAGTTTGGACACATGCGGACAAACAAGAACTGCCCCAAGTATGGAGAAGATATGGAAACACCGGAGACAACAGATCAAGAAAAAGTATCTATAAAGTTGAATACCGTGGACCCATCAAGCCAATCCCATCAGAAAGCTCACACTAAGAAGGTTACACCTAAAACCATCACAAAAATTTCTACAACTGAGACATTTGAAGGTGAAAAATCTACATTGATGGCAAAAATGCTTCCAGTCAAATTCAAATGCAGCTCGAGTGAGAAGCTTTCTGACAATCTTAGTCCTGGAGTGCCGCAGACTTCCGACCTGCCATTCAATTCTGACAATGAAACTGGAAAATCTGTTGTCAAGGTTAACAAGATCACGTTTTCTAAGAAGAGAAATGAAGACATTCAGTTTGAATCTCATAAACCCTCAATTGTGATACGACCTCCTGATGCTAAAAAAGTTTATGCTGAAGCTCATAAGCCCTCCATTGTAATAAGACCACCAACAAGTATTGATAGAGACAGAATGGAATTTCCCAGACGCTCGGCCACCGTAGTAAGGTCACCAGCTGAAACAAAAAGAGAGAAGCTTAACAAAAAACTCATAATCAAAAGGCCAAAAGAGGTTATTGATTTGGATCAGATGGGTTTTGATGCGAGTGCTGGTATGCAATACAGGAAGACTAAAAGAATAGTTGAGCTGTCGAGTTTCGAAAATCATACAAGGCCTGGAAGCATGAGTTCAGCTGAGTCTGGAAAAAAGAAAGTTAGAGAAGATCAAATATGGTGGGAAAAGCAAGAGAGGCAGAAAAATGAAGAGAGACTAAGAGAAGAGAAGGCGAGGAGGGTTTACAAGGAAGAAATGAGGATGCGGGATGAACAAGAGAAGTTAGCAGAGATTAGAAGATTTGAAGCCAGCATTAGAAGTGATAAGGAGGAAGAAGAACGTATAAAAGCAAAGAAGAAGAAGAAGAAACGAATCCCGGAAATAATGGATGACTATATGGAGGACCCCAGATCAAGACAAAGGATTCCAGAACGAGATCGCGTTGTGAAAAGGAAGCCTATTGAGTTGGGAAGACATGGTGCAGAGCACGCATCATCGACAAAGCGCCGTAGAGGGGGAGAGGTCGGTTTGTCGAATATTTTGGAGCGGATTGTGGAGGCCCTCAAAGACAACTTCGAAATTTCTTATCTCTTTTTAAAACCAGTGTCCAAGAAGGAGGCTCCTGACTACCTCGATATCATAGAGCGTCCAATGGATCTTTCTACCATCAGAGAGAAGGTTAGAAAGTTGGAATACAAGACCAGGGACGAGTTCAGAACTGACGTATGGCAGATAATGTATAATGCTCACATGTATAATGATGGGCGCAACCCAGGTATTCCTCCCCTAGCAGACAAGCTTTTGATGCATTGTGACAATCTATTAAACGAAAATGACGATGAATTGACTGAAGCCGAAATCGGAATCGAGTATAGGGATTCATAGGGGTGGAGTACTTTTCGGATTGAGCCTTTTGTGTAGTATATACATGTAGTCTAGTGTAGTGGGGCAGACCCCTCATCTCCATGGCAAGTAACTTAACATATAGCGTTCAGTTTAGGAAAGGTGGTAGCAATTCGTCGAAGAGAGTTAAGGGTAAGAAGGCAGGTGTAGAACACGACCTTGCCCAATTGAATGTTTTGCCAAAAAGATTAAAGAGAGATTTGAGGGTTAGAACTTGTTGATTAACCAGCGTTTCATAGCGTATAGCCATGTATATAAAACCCGAGTTGAAAATTTTTTGGGGTTCAATAGAAATGTATTTTATTGTCACCATAGTGATGTCCATACGAATGTATGTAAAACCTGAGTTGAAACTTCTGTTTGGGGATTAATATTAATGTATTTTGTTTCCCCCAAAGTTGCTATTGACACATTTTATACCTAAGGTCAAGTGGTTTGTGTTTTCAGTAATTGACTGTAGAGGCAAGAAAAGAAAATAGGAAATATATTTTTAGTTGGCTGAACTTTGAATTAATTCTTGAAGAAAATAGATGGAGC

Coding sequence (CDS)

TACATACTTTATATTTTGGTTGCAGATGACGATGACTATGAGGATGCTGGTGGTGGCAATCGATTCTTGGGGTTTATGTTTGGAAATGTGGATAATTCTGGTGATCTTGATGCTGATTATCTCGACGAGGATGCTAAGGAACATCTTGATGCATTGGCTGATAAGCTGGGTTCAACTTTGACAGATATCGATTTGTCAACGAAGTCAGCAAAAACACCATCGGACGCTGTTGAACCAGACTATGATGCAAAGGCTGAAGACGCAGTTGATTATGAAGATATTGATGAAGAGTACGATGGTCCAGAGATTGAAGCTGCTGGTGAGGAAGATCATTTATTGCCAAAAAAGGAATATTTTTCTACTGAAGTTTCTTTGGCTACGCTGGAGCCCACAGTTTCTGTATTTGATGATGAAGACTATGATGAGGACTTTGAAAAGGTGCATGATGTTATAAATAGCAGTGTCGAAGCTCGAACTACCCATGCGTCAGATGAGAAAGGTGAGTGCCTTGAGGTGGCTTATGAAGGAGAAAAATCTGTTGCAGATGATGATATACAATCTGCTTCTCTCAATAATGAAGTTATAACCAGTAGTGCAGAAGAATTGCTCGAGGAGACACCTGAAGTACAGAAAAAACTACTGGACGAGAAAGCTCATACTCCCTTACCTGTATTGTGCATGGAAAATGGGATGGCGATCTTACAGTTTTCTGAAATTTTTGGTGTTCACGACAGTTTGAAGAAAAAGGAGAAGAGAGAATCTAGATATTGTACTCGTAGAGATAAATATAGGTCTGTGGATGTATCTGATATTGTTGAAGAGGATGAAGAGGCATTCTTACATGGCTTCAGTCAAGGTGTATCGTGTGTGAAACCAGCATCTGTCGTAAAAGATGATACTACTATGTTTAATCTGGATGACCCAGAATTTACTAAATTTGGTGTAGTGCAAGGGGTTGATGTAATGGCTGCAAGAGTGGACTGGCGTCAAAAAGACAATTGTTGTGGTGCAGAACCCATGAAACAAGTCTTTGCAGAAAATATTTCCATTGGATCAAATTCATTATTGTTTAAAAAATTTTACCCCCTTGATCAGCAAAATTGGGAAGAGGGGATTTTGTGGGATAATTCTCCGGTCTTAAGTAAGAACTCTGCTGGTAGTTGTGAGGTATCTGGATCTGATCTGGAAGATTCAGTTAGTAGTGATGTGGAACAACAAGTTTCTATTCAGATAGTTCGATCAGAGCACCATATAGATCCAAATGACAGGAGACAGAGGCTTTCCCAGCACGACCTTCCATTACTGGAGCCTTTTGGCTCAAGGAAATTTTCAGGGCCAGAAGAACCTTTCTCACCAGAAATGATTTATCATCCTCAAATGTTGAGACTAGAATCCTGGAAGGATGTGGACGGATCCTGCCAATCAGAGGGTACGAGGGAAAATTTTTCAGAGGAGCATCAAAGCAATGCTATAAGATGCTTTAGTAAATTTTCCCCAAAGAACAGAAGAATGTTGGAAGGTTCTTGGTTAGACAAAGTATTATGGGAATCAGATGAACCCATTGAAAAACAAAAATTTATCTTTGATCTTGAAGATGAACATATGCTTTTTGAAATCTCAGATGAAAAGGAATCTAAATACATTCAGTTTCATGCTGGAGCTATGATTTTAACACGGTCATCAATGTCAGTCAATGGAAATTCTTTTGAGATATCTGGCAGTGGAGGTCAAGGTGGCTGGAGATTTGTTTCTAATGACAAACACTATTCCAATAGAAAAGCATCTCAGCAACTCAAATCAAATTCTAAAAAACGTTCTGTTCATGGTGTCAAAGTTTTCCATTCGAAACCTGCGATGATGTTGCAGACAATGAAGCTAAAGTTGAGCAACAAAGAGCTAGCTAATTTTCATCGGCCTAAAGCTTCATGGTATCCCCATGATAACGAGATGGCTGTCAGAGAGCTACAGAAGTTGCCTACTCAAGGACCAATGAAGATTATTCTGAAAAGTTTAGGAGGCAAAGGCAGCAAACTTTTTGTGGACTCTGAGGAGACTGTTTCGTCTCTCATGGCCAAAGCTTCAAAGAAGCTAGATATGAAGTCATCTGAAATAGTGAAAGTATTTTATTCTGGAAAAGAGCTTGAACGAGAAAAATCTTTAGCTGCCCAAAATGTGCAACCAAATTCCTTACTTCATCTTGTTCGTTCGAAAATATTTGTAATGCCATGGACACAACATTTACGTGGTGATAGTAAATCCGTGAGATCTCCTGGGGCATTCAAAAAGAAATCTGATCTTTCTGTGAAAGACGGACATGTGTTTTTGATGGAGTATTGTGAAGAAAGGCCTTTACTTTTGGGCAACATTGGAATGGGGGCAAGATTGTGCACTTATTACCAAAAATCATCCCCTGATGATCAAACTGGTGCATTATTGCGAAATGGGGGTGATAGCTTAGGGCATGTCATTGTCCTTGAACCTTCTGATAAATCTCCTTACATTGGAGATATAAAAGGTGGCTCAGTACAGGCATCTCTTGAAACAAACATGTACAGGTCACCTATATTCCCCCATAAGGTGCCAATGACTGACTATATATTGGTTCGTTCTGCAAAGGGGAAGCTTTCCCTCAGACGTATTTACAGAAATTTTGCAGTTGGCCAGCAGGAGCCGTTGATGGAGGTGTTTTCTCCTGGAACAAAGTCTCTTCAAATGTTTATGATGAATAGATTAACTTTATATATATTTCGTGAATTCCTTGCTGCTGAAAAGCGTAGAAGACTTCCTTACATTCGTGTTGATGAACTACCTTCTCAGTTTCCGTATCTGTCTGAGACTGTGATTAGGAAGAAATTGAAGGAATATGCTCTTCAACAGAAAAGTTCTAATGGACAAACCATTTTGATTAAGAAGCGTAATGCAAGTATTTCTTTGAAGAAAGATGCTGTGACACCCGAGGATGTATGCAAGTATGAAAGCATGCAAGCTGGTCTGTACCGCCTTAAACATTTGGGAATTACTGAGGTCCATCCCTCCGCAATTTCATCTGCAATGAGTAGACTTCCTGATGAAGCTATTACTCTGGCTGCTGCATCTCATATTGAAAGAGAGTTGCAGATAACTCCCTGGAACTTGAGTAGCAATTTTGTTGCTTGTACGACCCAGGGAAAAGAAAATATAGAGCGGTTGGAAATTACAGGTGTTGGTGATCCATCTGGCCGTGGACTTGGTTTCAGTTATGTTCGTTCAGTTCCAAAAGCACCTATTTCTAATGCATCTTTGAAGAAAAAAGCAGCTTCTAGTCGAGGAAGCTCTGCAGTTACAGGAACAGATGCTGACCTACGTAGGTTGAGCATGGATGCCGCAAAAGAGGTTCTTCTCAAGTTTGATGTATCTGAGGAACAGATCGCAAAACTGACCAGATGGCACCGAATTGCTATGATACGCAGGCTTTCAAGTGAGCAAGCGGCTTCTGGGGTGCAAGTTGATCCAACAACCATCAGCAAGTATGCACGTGGCCAGCGAATGTCCTTTCTGCAACTGCAGCGTCAGACGAGGGAAAAGTGTCAAGAAATTTGGGAGCGGCAAATTCAGAGTCTTTCAACCTCAGATGGTGCTGAGAATGAGAGTGACTCTGAAGGAAATAGTGACCTGGATTCCTTTGCTGGAGATTTAGAAAATTTGCTGGATGCTGAGGAATTTGAAGATGAAGTTGATACATTTGAGATCCGACATGAAAAGACTGATGGGGTCAAGGGTCTTAAAATGAGGAGACGTCCATCCATTGCTCAAACAGAAGAGGAAATTGAAGATGAAGTGGCTGAAGCTACAGAGTTGTGCAGGTTGCTGATGGACGATGAGGCTGAGAGGAGGAGGAAGAAGAAGAAGAATAAAGTCATGGGAGAAGCAATATTGGTGCCAGGCCTACAAGCTAGTTTCGGTCATGAGATTCCTCAGCAGACCAGGCATCTTGTTAGTATTGCCCAACCTGATATAGCCTATACTTCTAAAGAAAATATAAGAGATCAAAAGGAGGTGGAAAGTATTATTAATAGAAAAGACAAGTCTGGAAAGTTCAAACCCATGAAGAAAAATTATAGTTCAGAGATGAGCCTACTCAATAAGAAACTGAAGATATCAGGAGACAAAGTCAAGAATTTCAAGGAGAAGAAATCAGCTCGAGAAAGCTTCGTGTGTGGAAAGTGTGGTCAGTTTGGACACATGCGGACAAACAAGAACTGCCCCAAGTATGGAGAAGATATGGAAACACCGGAGACAACAGATCAAGAAAAAGTATCTATAAAGTTGAATACCGTGGACCCATCAAGCCAATCCCATCAGAAAGCTCACACTAAGAAGGTTACACCTAAAACCATCACAAAAATTTCTACAACTGAGACATTTGAAGGTGAAAAATCTACATTGATGGCAAAAATGCTTCCAGTCAAATTCAAATGCAGCTCGAGTGAGAAGCTTTCTGACAATCTTAGTCCTGGAGTGCCGCAGACTTCCGACCTGCCATTCAATTCTGACAATGAAACTGGAAAATCTGTTGTCAAGGTTAACAAGATCACGTTTTCTAAGAAGAGAAATGAAGACATTCAGTTTGAATCTCATAAACCCTCAATTGTGATACGACCTCCTGATGCTAAAAAAGTTTATGCTGAAGCTCATAAGCCCTCCATTGTAATAAGACCACCAACAAGTATTGATAGAGACAGAATGGAATTTCCCAGACGCTCGGCCACCGTAGTAAGGTCACCAGCTGAAACAAAAAGAGAGAAGCTTAACAAAAAACTCATAATCAAAAGGCCAAAAGAGGTTATTGATTTGGATCAGATGGGTTTTGATGCGAGTGCTGGTATGCAATACAGGAAGACTAAAAGAATAGTTGAGCTGTCGAGTTTCGAAAATCATACAAGGCCTGGAAGCATGAGTTCAGCTGAGTCTGGAAAAAAGAAAGTTAGAGAAGATCAAATATGGTGGGAAAAGCAAGAGAGGCAGAAAAATGAAGAGAGACTAAGAGAAGAGAAGGCGAGGAGGGTTTACAAGGAAGAAATGAGGATGCGGGATGAACAAGAGAAGTTAGCAGAGATTAGAAGATTTGAAGCCAGCATTAGAAGTGATAAGGAGGAAGAAGAACGTATAAAAGCAAAGAAGAAGAAGAAGAAACGAATCCCGGAAATAATGGATGACTATATGGAGGACCCCAGATCAAGACAAAGGATTCCAGAACGAGATCGCGTTGTGAAAAGGAAGCCTATTGAGTTGGGAAGACATGGTGCAGAGCACGCATCATCGACAAAGCGCCGTAGAGGGGGAGAGGTCGGTTTGTCGAATATTTTGGAGCGGATTGTGGAGGCCCTCAAAGACAACTTCGAAATTTCTTATCTCTTTTTAAAACCAGTGTCCAAGAAGGAGGCTCCTGACTACCTCGATATCATAGAGCGTCCAATGGATCTTTCTACCATCAGAGAGAAGGTTAGAAAGTTGGAATACAAGACCAGGGACGAGTTCAGAACTGACGTATGGCAGATAATGTATAATGCTCACATGTATAATGATGGGCGCAACCCAGGTATTCCTCCCCTAGCAGACAAGCTTTTGATGCATTGTGACAATCTATTAAACGAAAATGACGATGAATTGACTGAAGCCGAAATCGGAATCGAGTATAGGGATTCATAG

Protein sequence

YILYILVADDDDYEDAGGGNRFLGFMFGNVDNSGDLDADYLDEDAKEHLDALADKLGSTLTDIDLSTKSAKTPSDAVEPDYDAKAEDAVDYEDIDEEYDGPEIEAAGEEDHLLPKKEYFSTEVSLATLEPTVSVFDDEDYDEDFEKVHDVINSSVEARTTHASDEKGECLEVAYEGEKSVADDDIQSASLNNEVITSSAEELLEETPEVQKKLLDEKAHTPLPVLCMENGMAILQFSEIFGVHDSLKKKEKRESRYCTRRDKYRSVDVSDIVEEDEEAFLHGFSQGVSCVKPASVVKDDTTMFNLDDPEFTKFGVVQGVDVMAARVDWRQKDNCCGAEPMKQVFAENISIGSNSLLFKKFYPLDQQNWEEGILWDNSPVLSKNSAGSCEVSGSDLEDSVSSDVEQQVSIQIVRSEHHIDPNDRRQRLSQHDLPLLEPFGSRKFSGPEEPFSPEMIYHPQMLRLESWKDVDGSCQSEGTRENFSEEHQSNAIRCFSKFSPKNRRMLEGSWLDKVLWESDEPIEKQKFIFDLEDEHMLFEISDEKESKYIQFHAGAMILTRSSMSVNGNSFEISGSGGQGGWRFVSNDKHYSNRKASQQLKSNSKKRSVHGVKVFHSKPAMMLQTMKLKLSNKELANFHRPKASWYPHDNEMAVRELQKLPTQGPMKIILKSLGGKGSKLFVDSEETVSSLMAKASKKLDMKSSEIVKVFYSGKELEREKSLAAQNVQPNSLLHLVRSKIFVMPWTQHLRGDSKSVRSPGAFKKKSDLSVKDGHVFLMEYCEERPLLLGNIGMGARLCTYYQKSSPDDQTGALLRNGGDSLGHVIVLEPSDKSPYIGDIKGGSVQASLETNMYRSPIFPHKVPMTDYILVRSAKGKLSLRRIYRNFAVGQQEPLMEVFSPGTKSLQMFMMNRLTLYIFREFLAAEKRRRLPYIRVDELPSQFPYLSETVIRKKLKEYALQQKSSNGQTILIKKRNASISLKKDAVTPEDVCKYESMQAGLYRLKHLGITEVHPSAISSAMSRLPDEAITLAAASHIERELQITPWNLSSNFVACTTQGKENIERLEITGVGDPSGRGLGFSYVRSVPKAPISNASLKKKAASSRGSSAVTGTDADLRRLSMDAAKEVLLKFDVSEEQIAKLTRWHRIAMIRRLSSEQAASGVQVDPTTISKYARGQRMSFLQLQRQTREKCQEIWERQIQSLSTSDGAENESDSEGNSDLDSFAGDLENLLDAEEFEDEVDTFEIRHEKTDGVKGLKMRRRPSIAQTEEEIEDEVAEATELCRLLMDDEAERRRKKKKNKVMGEAILVPGLQASFGHEIPQQTRHLVSIAQPDIAYTSKENIRDQKEVESIINRKDKSGKFKPMKKNYSSEMSLLNKKLKISGDKVKNFKEKKSARESFVCGKCGQFGHMRTNKNCPKYGEDMETPETTDQEKVSIKLNTVDPSSQSHQKAHTKKVTPKTITKISTTETFEGEKSTLMAKMLPVKFKCSSSEKLSDNLSPGVPQTSDLPFNSDNETGKSVVKVNKITFSKKRNEDIQFESHKPSIVIRPPDAKKVYAEAHKPSIVIRPPTSIDRDRMEFPRRSATVVRSPAETKREKLNKKLIIKRPKEVIDLDQMGFDASAGMQYRKTKRIVELSSFENHTRPGSMSSAESGKKKVREDQIWWEKQERQKNEERLREEKARRVYKEEMRMRDEQEKLAEIRRFEASIRSDKEEEERIKAKKKKKKRIPEIMDDYMEDPRSRQRIPERDRVVKRKPIELGRHGAEHASSTKRRRGGEVGLSNILERIVEALKDNFEISYLFLKPVSKKEAPDYLDIIERPMDLSTIREKVRKLEYKTRDEFRTDVWQIMYNAHMYNDGRNPGIPPLADKLLMHCDNLLNENDDELTEAEIGIEYRDS
Homology
BLAST of MC09g0858 vs. ExPASy Swiss-Prot
Match: Q8LRK9 (Transcription initiation factor TFIID subunit 1 OS=Arabidopsis thaliana OX=3702 GN=TAF1 PE=1 SV=1)

HSP 1 Score: 1688.7 bits (4372), Expect = 0.0e+00
Identity = 1010/1942 (52.01%), Postives = 1325/1942 (68.23%), Query Frame = 0

Query: 9    DDDDYEDAGGGNRFLGFMFGNVDNSGDLDADYLDEDAKEHLDALADKLGSTLTDIDLSTK 68
            DDD+YED   G   LGF+FGNVDNSGDLDADYLDEDAKEHL ALADKLGS+L DI+L  K
Sbjct: 17   DDDEYEDNSRGFN-LGFIFGNVDNSGDLDADYLDEDAKEHLSALADKLGSSLPDINLLAK 76

Query: 69   SAKTPSDAVEPDYDAKAEDAVDYEDIDEEYDGPEIEAAGEEDHLLPKKEYFSTEVSLATL 128
            S +T SD  E DYD KAEDAVDYEDIDEEYDGPE++   EEDHLLPKKEYFST V+L +L
Sbjct: 77   SERTASDPAEQDYDRKAEDAVDYEDIDEEYDGPEVQVVSEEDHLLPKKEYFSTAVALGSL 136

Query: 129  EPTVSVFDDEDYDEDFEKVHD--VINSSVEARTTHASDEKGECLEVAYEGEKSVADDDIQ 188
            +   SVFDDEDYDE+ E+  +   +  S+E         K E   + YE E S+ D +  
Sbjct: 137  KSRASVFDDEDYDEEEEQEEEQAPVEKSLETEKREPVVLK-EDKALEYEEEASILDKE-- 196

Query: 189  SASLNNEVITSSAEELLEETPEVQKKLLDEKAHTPLPVLCMENGMAILQFSEIFGVHDSL 248
                 + + T   +E  EE  E+ +  LD+K  TPLP L +E+GM ILQFSEIF +H+  
Sbjct: 197  -----DHMDTEDVQE--EEVDELLEGTLDDKGATPLPTLYVEDGMVILQFSEIFAIHEPP 256

Query: 249  KKKEKRESRYCTRRDKYRSVDVSDIVEEDEEAFLHGFSQGVSCVKPASVVKDDTTMFNLD 308
            +K+++RE+RY T RDKY+S+D+S++VE+DEE  L    +  + V+ A +++ D      +
Sbjct: 257  QKRDRRENRYVTCRDKYKSMDISELVEDDEEVLLKSHGRIDTHVEQADLIQLDVPFPIRE 316

Query: 309  DPEFTKFGVVQGVDVMAARVDWRQKDNCCGAEPMKQVFAENISIGSNSLLFKKFYPLDQQ 368
              +  K   + G+   +       +D+C   E +KQ F ++ S    S L  + +PLDQ 
Sbjct: 317  GLQLVKASTIGGITPESREFTKLGRDSCIMGELLKQDFIDDNSSLCQSQLSMQVFPLDQH 376

Query: 369  NWEEGILWDNSPVLSKNSAGSCEVS---GSDLEDSVSSDVEQQVSIQIVRSEHHIDPNDR 428
             WE  I+W++SP +S NS    E        L    +S+ EQ+ S+ +V S   +  ++ 
Sbjct: 377  EWERRIIWEHSPEISGNSGEIFEPGLEPEGMLVKGTNSETEQE-SLNVVNSRVQVQADN- 436

Query: 429  RQRLSQHDLPLLEPFGSRKFSGPEEPFSPEMIYHPQMLRLESWKDVDGSCQSEGTRENFS 488
                      LLE FGSR      E  +    +HPQ+LRLES  D +    ++       
Sbjct: 437  -NMFVPFSANLLESFGSRGSQSTNESTNKSR-HHPQLLRLESQWDENHLSGNDEAGVKKI 496

Query: 489  EEHQSNAIRCFSKFSPKNRRMLEGSWLDKVLWESDEPIEKQKFIFDLEDEHMLFEISDEK 548
            +  + +A+  FS+   + R + + +WLD ++W+S++ + + K IFDL+DE M+FEI D +
Sbjct: 497  KRLEKDALGRFSRLVLRERDLGDEAWLDSIIWDSEKELSRSKLIFDLQDEQMVFEIFDNE 556

Query: 549  ESKYIQFHAGAMILTRSSMSVNGNSFEISGSGGQGGWRF-VSNDKHYSNRKASQQLKSNS 608
            ESK +Q HAGAMI++RSS S    +F+  G     GW+F +SNDK Y N K+SQQL++N+
Sbjct: 557  ESKNLQLHAGAMIVSRSSKS-KDETFQ-EGCESNSGWQFNLSNDKFYMNGKSSQQLQANT 616

Query: 609  KKRSVHGVKVFHSKPAMMLQTMKLKLSNKELANFHRPKASWYPHDNEMAVRELQKLPTQG 668
             K SVH ++VFHS PA+ LQTMK KLSNK++ANFHRPKA WYPHDNE+A+++  KLPT+G
Sbjct: 617  NKSSVHSLRVFHSVPAIKLQTMKSKLSNKDIANFHRPKALWYPHDNELAIKQQGKLPTRG 676

Query: 669  PMKIILKSLGGKGSKLFVDSEETVSSLMAKASKKLDMKSSEIVKVFYSGKELEREKSLAA 728
             MKII+KSLGGKGSKL V  EE+VSSL AKAS+KLD K +E VK+FY GKEL+ EKSLAA
Sbjct: 677  SMKIIVKSLGGKGSKLHVGIEESVSSLRAKASRKLDFKETEAVKMFYKGKELDDEKSLAA 736

Query: 729  QNVQPNSLLHLVRSKIFVMPWTQHLRGDSKSVRSPGAFKKKSDLSVKDGHVFLMEYCEER 788
            QNVQPNSL+HL+R+K+ + PW Q L G++KS+R PGAFKKKSDLS KDGHVFLMEYCEER
Sbjct: 737  QNVQPNSLVHLIRTKVHLWPWAQKLPGENKSLRPPGAFKKKSDLSTKDGHVFLMEYCEER 796

Query: 789  PLLLGNIGMGARLCTYYQKSSPDDQTGALLRNGGDSLGHVIVLEPSDKSPYIGDIKGGSV 848
            PL+L N GMGA LCTYYQKSSP+DQ G LLRN  D+LG+V++LEP DKSP++G+I  G  
Sbjct: 797  PLMLSNAGMGANLCTYYQKSSPEDQRGNLLRNQSDTLGNVMILEPGDKSPFLGEIHAGCS 856

Query: 849  QASLETNMYRSPIFPHKVPMTDYILVRSAKGKLSLRRIYRNFAVGQQEPLMEVFSPGTKS 908
            Q+S+ETNMY++PIFP ++  TDY+LVRS KGKLSLRRI +   VGQQEP MEV SPG+K+
Sbjct: 857  QSSVETNMYKAPIFPQRLQSTDYLLVRSPKGKLSLRRIDKIVVVGQQEPRMEVMSPGSKN 916

Query: 909  LQMFMMNRLTLYIFREFLAAEKRRRLPY-IRVDELPSQFPYLSETVIRKKLKEYALQQKS 968
            LQ +++NR+ +Y++REF    KR    + I  DEL   F  L++ +I+K +K  A  ++ 
Sbjct: 917  LQTYLVNRMLVYVYREFF---KRGGGEHPIAADELSFLFSNLTDAIIKKNMKIIACWKRD 976

Query: 969  SNGQTILIKKRN---ASISLKKDAVTPEDVCKYESMQAGLYRLKHLGITE-VHPSAISSA 1028
             NGQ+   KK +      S  K  V PE VC YESM AGLYRLKHLGIT    P++IS+A
Sbjct: 977  KNGQSYWTKKDSLLEPPESELKKLVAPEHVCSYESMLAGLYRLKHLGITRFTLPASISNA 1036

Query: 1029 MSRLPDEAITLAAASHIERELQITPWNLSSNFVACTTQGKENIERLEITGVGDPSGRGLG 1088
            +++LPDEAI LAAASHIERELQITPWNLSSNFVACT Q + NIERLEITGVGDPSGRGLG
Sbjct: 1037 LAQLPDEAIALAAASHIERELQITPWNLSSNFVACTNQDRANIERLEITGVGDPSGRGLG 1096

Query: 1089 FSYVRSVPKAPISNASLKKKAASSRGSSAVTGTDADLRRLSMDAAKEVLLKFDVSEEQIA 1148
            FSYVR+ PKAP +   +KKKAA+ RG+  VTGTDADLRRLSM+AA+EVL+KF+V +E IA
Sbjct: 1097 FSYVRAAPKAPAAAGHMKKKAAAGRGAPTVTGTDADLRRLSMEAAREVLIKFNVPDEIIA 1156

Query: 1149 KLTRWHRIAMIRRLSSEQAASGVQVDPTTISKYARGQRMSFLQLQRQTREKCQEIWERQI 1208
            K TRWHRIAMIR+LSSEQAASGV+VDPTTI KYARGQRMSFLQ+Q+Q REKCQEIW+RQ+
Sbjct: 1157 KQTRWHRIAMIRKLSSEQAASGVKVDPTTIGKYARGQRMSFLQMQQQAREKCQEIWDRQL 1216

Query: 1209 QSLSTSDGAENESDSEGNSDLDSFAGDLENLLDAEEFEDEVDTFEIRHEKTDGVKGLKMR 1268
             SLS  DG ENES++E NSDLDSFAGDLENLLDAEE  +  ++   +++K DGVKGLKMR
Sbjct: 1217 LSLSAFDGDENESENEANSDLDSFAGDLENLLDAEEGGEGEESNISKNDKLDGVKGLKMR 1276

Query: 1269 RRPSIAQTEEEIEDEVAEATELCRLLMDDEAERRRKKKKNKVMGEAI-----LVPGLQAS 1328
            RRPS  +T+EEIEDE  E  ELCRLLM DE ++++KKKK K +GE +       P +   
Sbjct: 1277 RRPSQVETDEEIEDEATEYAELCRLLMQDE-DQKKKKKKMKGVGEGMGSYPPPRPNIALQ 1336

Query: 1329 FGHEIPQ---QTRHLVSIAQPDIAYTSKEN-IRDQKEVESIINRKDKSGKFKPMKKNYSS 1388
             G  + +     +  ++I QPD ++   E+ I+D + V+SII    K+ K K +K+N +S
Sbjct: 1337 SGEPVRKANAMDKKPIAI-QPDASFLVNESTIKDNRNVDSII----KTPKGKQVKENSNS 1396

Query: 1389 EMSLLNKKLKISGDKVKNFKEKKSARESFVCGKCGQFGHMRTNKNCPKYGEDMET-PETT 1448
               L  KK+KI  + +K FKEKKSARE+FVCG CGQ GHMRTNK+CP+Y E+ E+ PE  
Sbjct: 1397 LGQL--KKVKILNENLKVFKEKKSARENFVCGACGQHGHMRTNKHCPRYRENTESQPEGI 1456

Query: 1449 DQEKVSIKLNTVDPSSQSHQK-AHTKKVTPKTITKISTTETFEGEKSTLMAKMLPVKFKC 1508
            D +K + K ++ +PS     K     K  PK+  K S  E  +G+K +     LP+KF+ 
Sbjct: 1457 DMDKSAGKPSSSEPSGLPKLKPIKNSKAAPKSAMKTSVDEALKGDKLSSKTGGLPLKFRY 1516

Query: 1509 S-SSEKLSDNLSPGVPQTSDLPFNSDNETG-KSVVKVNKI-------------------- 1568
               +  LSD      P +S+    SD +TG KS  K++K+                    
Sbjct: 1517 GIPAGDLSDKPVSEAPGSSEQAVVSDIDTGIKSTSKISKLKISSKAKPKESKGESERRSH 1576

Query: 1569 ----TFSKKRNEDIQFESHKPSIVIRP-PDAKKVYAEAHKPSIVI-RPPTSIDRDRMEFP 1628
                TFS++R E    ESHKPS+  +P    ++  A + + +I I RP  S+D D+ E  
Sbjct: 1577 SLMPTFSRERGES---ESHKPSVSGQPLSSTERNQAASSRHTISIPRPSLSMDTDQAE-S 1636

Query: 1629 RRSATVVRSPAETKREKLNKKLIIKRPKEVIDLDQMGFDASAGMQYRKTKRIVELSSFEN 1688
            RR   V+R P  T+RE+  KKL+IKR KE+ D D    + S   + RKTKR+ EL+ F+ 
Sbjct: 1637 RRPHLVIRPP--TEREQPQKKLVIKRSKEMNDHDMSSLEESPRFESRKTKRMAELAGFQR 1696

Query: 1689 HTRPGSMSSAESGKKKVREDQIWWEKQERQKNEERLREEKARRVYKEEMRMRDEQEKLAE 1748
              +     S  S +++ +ED++WWE++E      R RE +ARR Y ++M + +E  ++AE
Sbjct: 1697 --QQSFRLSENSLERRPKEDRVWWEEEEISTG--RHREVRARRDY-DDMSVSEEPNEIAE 1756

Query: 1749 IRRFEASIRSDKEEEERIKAKKKKKKR--IPEIMDDYMED--PRSR-QRIPERDRVVKRK 1808
            IRR+E  IRS++EEEER KAKKKKKK+   PEI++ Y+ED  PR   +R+ ER R V+ +
Sbjct: 1757 IRRYEEVIRSEREEEERQKAKKKKKKKKLQPEIVEGYLEDYPPRKNDRRLSERGRNVRSR 1816

Query: 1809 PI-ELGRHGAEHASSTKRRRGGEVGLSNILERIVEALKDNFEISYLFLKPVSKKEAPDYL 1868
             + +  R GAE+A   KRR+ GEVGL+NILERIV+ L+   E+S LFLKPVSKKEAPDYL
Sbjct: 1817 YVSDFERDGAEYAPQPKRRKKGEVGLANILERIVDTLRLKEEVSRLFLKPVSKKEAPDYL 1876

Query: 1869 DIIERPMDLSTIREKVRKLEYKTRDEFRTDVWQIMYNAHMYNDGRNPGIPPLADKLLMHC 1895
            DI+E PMDLSTIR+KVRK+EY+ R++FR DVWQI YNAH+YNDGRNPGIPPLAD+LL  C
Sbjct: 1877 DIVENPMDLSTIRDKVRKIEYRNREQFRHDVWQIKYNAHLYNDGRNPGIPPLADQLLEIC 1919

BLAST of MC09g0858 vs. ExPASy Swiss-Prot
Match: Q67W65 (Transcription initiation factor TFIID subunit 1 OS=Oryza sativa subsp. japonica OX=39947 GN=TAF1 PE=2 SV=1)

HSP 1 Score: 1470.3 bits (3805), Expect = 0.0e+00
Identity = 938/1916 (48.96%), Postives = 1229/1916 (64.14%), Query Frame = 0

Query: 9    DDDDYEDAGGGNRFLGFMFGNVDNSGDLDADYLDEDAKEHLDALADKLGSTLTDIDLSTK 68
            DD+DY++ GGGN FLGFMFGNVD+SGDLDADYLDEDAKEHL ALADKLG +L DIDL   
Sbjct: 21   DDEDYDEPGGGNHFLGFMFGNVDDSGDLDADYLDEDAKEHLFALADKLGPSLKDIDLIKP 80

Query: 69   SAKTPSDAVEPDYDAKAEDAVDYEDIDEEYDGPEIEAAGEEDHLLPKKEYFSTEVSLATL 128
            SA  P+D  E DYDAKAEDAVDYEDIDEEYDGPE+EAA EEDHLL KK+YFS+    A++
Sbjct: 81   SA-APTDPSEQDYDAKAEDAVDYEDIDEEYDGPEVEAATEEDHLLSKKDYFSSNAVYASV 140

Query: 129  EPTVSVFDDEDYDEDFE--KVHDVINSSVEARTTHASDEKGECLEVAYEGEKSVADDDIQ 188
               VSVFD+E+YDED E    +D+ + ++    T AS E+   L++A   +    +    
Sbjct: 141  NSKVSVFDEENYDEDEEPPNDNDLPSDNIVQNCTSASAEQ---LDMAPSNDNLAVEK--M 200

Query: 189  SASLNNEVITSSAEELLEETPEVQKKLLDEKAHTPLPVLCMENGMAILQFSEIFGVHDSL 248
            S+SL+    +  +E   +E   V ++ L+ K  T LPVLC+E+G  IL+FSEIFG  + +
Sbjct: 201  SSSLSEPEESFESEAFQKEM--VAEEQLESKTATSLPVLCIEDGSVILKFSEIFGAQEPV 260

Query: 249  KK-KEKRESRYCTRRDKYRSVDVSDIVEEDEEAFLHGFSQGVSCVKPASVVKDDTTMFNL 308
            +K K  R  R   +  + +  + +DIVEEDEE FL    Q +S +K    +K +      
Sbjct: 261  RKAKMDRHKRPVNK--ELQITNFTDIVEEDEEVFLRSTIQNLSALKH---IKTNDNFVES 320

Query: 309  DDPEFTKFGVVQGVDVMAARVDWRQKDNCCGAEPMKQVFAENISIGSNSLLFKKFYPLDQ 368
            D  E T            + V  R KD+C   +PMK    ++I     S +F  FYPL+ 
Sbjct: 321  DSDEST------------SDVALRLKDSCLSEQPMKD---KDIPTAVQSPVFPDFYPLEH 380

Query: 369  QNWEEGILWDNSPVLS-KNSAGSCEVSGSDLEDSVSSDVEQQVS-IQIVRSEHHIDPNDR 428
            +NWE  I+W NSP  + +    SC +S   L+D      E  VS    V+++ H      
Sbjct: 381  ENWENDIVWGNSPTTAIQPCLTSCAISKESLDDHNEDQAEGYVSGCWDVQNKFH------ 440

Query: 429  RQRLSQHDLPLLEPFGSRKFSGPEEPFSPEMIYHPQMLRLESWKDVDGSCQSEGTRENFS 488
                      + +PFG  +        SPE  Y P  LR E+ ++ +    S     N +
Sbjct: 441  ------SSSVMADPFGHTEIPDSTSYRSPENSYSP--LRKETAQENN----SLDEPNNIT 500

Query: 489  EEHQSNAIRCFSKFSPKNRRMLEGSWLDKVLWESDEPIEKQKFIFDLEDEHMLFEISDEK 548
            +  + +  R  +K S  N+ +LEGSWLD ++W+  E + K K IFDL+D+HMLFEI DEK
Sbjct: 501  QPVKIDTTRHLNKLSLLNKELLEGSWLDNIVWDPSEDVPKPKLIFDLKDDHMLFEILDEK 560

Query: 549  ESKYIQFHAGAMILTR-----SSMSVNGNSFEISGSGGQGGWRF-VSNDKHYSNRKASQQ 608
               +++ HA AMI+TR     +  +V+ N+  I+ SG     RF +SNDK YSNRK SQQ
Sbjct: 561  NGDHLRSHARAMIVTRPMKTSAVENVDHNNQAIALSG-----RFNISNDKFYSNRKMSQQ 620

Query: 609  LKSNSKKRSVHGVKVFHSKPAMMLQTMKLKLSNKELANFHRPKASWYPHDNEMAVRELQK 668
             +S++KKR+  G+K+ HS PA  LQTMK KLS KE+ANFHRPKA WYPH+N++  R    
Sbjct: 621  ARSHAKKRATMGLKLVHSVPAQKLQTMKPKLSIKEIANFHRPKAKWYPHENKLTARFQGD 680

Query: 669  LPTQGPMKIILKSLGGKGSKLFVDSEETVSSLMAKASKKLDMKSSEIVKVFYSGKELERE 728
              + GPM  I+ +LGGKG K  V++EET  S+ +KASKKL+ K SE +K+F SGKEL+ +
Sbjct: 681  ECSHGPMTAIVMTLGGKGVKFLVNAEETPLSVKSKASKKLEFKPSEKIKLFCSGKELQDD 740

Query: 729  KSLAAQNVQPNSLLHLVRSKIFVMPWTQHLRGDSKSVRSPGAFKKKSDLSVKDGHVFLME 788
             SLA QNV+PNS+LH+VR++I + P  Q L G++K +R PGAF+KKSDLSVKDGHVFLME
Sbjct: 741  ISLAMQNVRPNSILHVVRTEIHLWPKAQRLPGENKPLRPPGAFRKKSDLSVKDGHVFLME 800

Query: 789  YCEERPLLLGNIGMGARLCTYYQKSSPDDQTGALLRNGGDSLGHVIVLEPSDKSPYIGDI 848
            YCEERPLLL N GM ARLCTYYQK+SP DQT   LR+  D LG ++ ++P+DKSP++G+I
Sbjct: 801  YCEERPLLLANAGMAARLCTYYQKTSPSDQTATSLRSNSDGLGTMLAIDPADKSPFLGNI 860

Query: 849  KGGSVQASLETNMYRSPIFPHKVPMTDYILVRSAKGKLSLRRIYRNFAVGQQEPLMEVFS 908
            + GS Q+ LETNMYR+P+FPHKV  TDY+LVRS KG LSLRRI + +AVGQQEP MEVFS
Sbjct: 861  RSGSHQSCLETNMYRAPVFPHKVATTDYLLVRSPKGMLSLRRIDKLYAVGQQEPHMEVFS 920

Query: 909  PGTKSLQMFMMNRLTLYIFREFLAAEKRRRLPYIRVDELPSQFPYLSETVIRKKLKEYAL 968
            PGTK++Q +++NR+ +Y++REF A EK   +P IR DELP Q P ++E ++RK+LK  A 
Sbjct: 921  PGTKNMQNYILNRILVYVYREFRAREKPGIIPQIRADELPIQ-PPITEAIVRKRLKHCAD 980

Query: 969  QQKSSNGQTILIKKRNASISLKKD---AVTPEDVCKYESMQAGLYRLKHLGITEV-HPSA 1028
             +K   G    I++ +  I  +++    +TPE+VC YESMQAG YRLKHLGI ++  P  
Sbjct: 981  LRKGPKGHLFYIQRPDFRIPSEEELRRLLTPENVCCYESMQAGQYRLKHLGIEKLTQPVG 1040

Query: 1029 ISSAMSRLPDEAITLAAASHIERELQITPWNLSSNFVACTTQGKENIERLEITGVGDPSG 1088
            ++SAM++LPDEAI LAAA+HIERELQIT WNL+SNFVACT Q KENIERLEITGVGDPSG
Sbjct: 1041 LASAMNQLPDEAIELAAAAHIERELQITSWNLTSNFVACTNQDKENIERLEITGVGDPSG 1100

Query: 1089 RGLGFSYVRSVPKAPISNASLKKKAASSRGSSAVTGTDADLRRLSMDAAKEVLLKFDVSE 1148
            RGLGFSYVR  PKAP+SN++ KKK+A+++G++ VTGTDADLRRLSMDAA+E+LLKF V E
Sbjct: 1101 RGLGFSYVRVTPKAPVSNSTHKKKSAAAKGTT-VTGTDADLRRLSMDAARELLLKFGVPE 1160

Query: 1149 EQIAKLTRWHRIAMIRRLSSEQAASGVQVDPTTISKYARGQRMSFLQLQRQTREKCQEIW 1208
            EQI KLTRWHRIAM+R+LSSEQAASGV +D   +SK+ARGQRMSFLQLQ+QT+EKCQEIW
Sbjct: 1161 EQIDKLTRWHRIAMVRKLSSEQAASGVTMDEIPVSKFARGQRMSFLQLQQQTKEKCQEIW 1220

Query: 1209 ERQIQSLSTSDGAENESDSEGNSDLDSFAGDLENLLDAEEFEDE-VDTFEIRHEKTDGVK 1268
            +RQIQSLS  DG EN SD+E NSDLDSFAGDLENLLDAEEF+DE V   +IR +K DG++
Sbjct: 1221 DRQIQSLSAMDGNENGSDTEANSDLDSFAGDLENLLDAEEFDDEDVGNTDIRSDKMDGMR 1280

Query: 1269 GLKMRRRPSIAQTEEEIEDEVAEATELCRLLMDDEAERRRKKKKNKVMGEAILVPGLQAS 1328
            GLKMRR  + +Q  EEI+D+VAEA  + +LL + +++ +RKK+  +    +  +     +
Sbjct: 1281 GLKMRRCHTQSQINEEIQDDVAEAALVEKLLEESDSDMKRKKQPVETTNYSTPM----YN 1340

Query: 1329 FGHEIPQ-QTRHLVSIAQPDIAYTSKENI-RDQKEVESIINRKDKSGKFKPMKKNYSSEM 1388
             G+++ Q +   ++  +    A T KE+I R+ KEVE+       S K +      +++ 
Sbjct: 1341 QGNKMKQGKAGQMIKSSVYAGALTPKESIPREAKEVENFAEGSLPS-KLRTKTGFDANDD 1400

Query: 1389 SLLNKKLKISGDKVKNFKEKKSAR--ESFVCGKCGQFGHMRTNKNCPKYGEDMETPETTD 1448
             +L K+  I G     FKEK+     ++ VCG CGQ GHMRTNK CPKYGED   PET++
Sbjct: 1401 IILVKRKNIPGK--DGFKEKRQGARGDTLVCGACGQLGHMRTNKLCPKYGED---PETSE 1460

Query: 1449 QEKVSIKLNTVDPSSQSHQKAHTKKVTPKTITKISTTETFEGEKSTLMAKMLPVKFKCSS 1508
             +  SI+ +  D  S +  K   K++  K  ++   T   EG +S   AK +PVKFKC +
Sbjct: 1461 MDVNSIRSHPPDIVSNAQIKTSNKRLVAKVSSEAFET---EGPESIEKAKPVPVKFKCGA 1520

Query: 1509 SEK-LSDNLSPGVPQTSDLPFNSDNETGKSVVKVNKITFSKKRNEDIQFESHKPSIVIRP 1568
             EK L  N+S      SD     D    KS  KVNKI  S K    I+++ +       P
Sbjct: 1521 PEKSLDRNMSISASLVSDKRM-MDATDSKSTGKVNKIKISNK----IKYDDY-------P 1580

Query: 1569 PDAKKVYAEAHKPSIVIRPPTSIDRDRMEFPRRSATVVRSPAETKREKLNKKLIIKRPKE 1628
            PD         KPS+VIRPP  +++D    PR                  KK+IIK+PK 
Sbjct: 1581 PDTP-------KPSVVIRPPAEVEKD---LPR------------------KKIIIKQPK- 1640

Query: 1629 VIDLDQMGFDASAGMQYRKTKRIVELSSFENHTRPGSMSSAESGKKKVREDQIWWEKQER 1688
            V+   Q   +  +G + RKT++IVELSSFE               K+ RED   +  Q  
Sbjct: 1641 VLGDQQRPTELRSGQEPRKTRKIVELSSFE---------------KRDREDDNGFSGQPI 1700

Query: 1689 QKNEERLR-------EEKARRVYKEEMRMRDEQEKLAEIRRFEASIRSDKEEEERIKAKK 1748
            Q N    R         K      E  R  +EQ +  E R  EA I   + E+E  KAKK
Sbjct: 1701 QINSSHDRGWGLVGKRSKGIMESSESWRAFEEQRERQEQRLIEARIYDARREDELQKAKK 1760

Query: 1749 K-KKKRIPEIMDDYMEDPR---SRQRIPERDRVVKRK-PIELGRHGAEHASSTKRRRGGE 1808
            K KKK+  E  DD + DPR   + +R+PER R  KR+ P ++     E+    KR RGGE
Sbjct: 1761 KNKKKKKHEFRDDDLLDPRPYKNDRRVPERGRAAKRRTPADM----TEYTPPAKRHRGGE 1809

Query: 1809 VGLSNILERIVEALKDNFEISYLFLKPVSKKEAPDYLDIIERPMDLSTIREKVRKLEYKT 1868
            V LSNILE+IV+ L+     S+LF KPV+KKEAPDY DIIERPMDL TIR+KVRK+EYK 
Sbjct: 1821 VELSNILEKIVDHLR-TMSCSFLFRKPVTKKEAPDYFDIIERPMDLGTIRDKVRKMEYKN 1809

Query: 1869 RDEFRTDVWQIMYNAHMYNDGRNPGIPPLADKLLMHCDNLLNENDDELTEAEIGIE 1892
            R++FR DV QI  NAH YN  R+P IPPLAD+LL  CD LL E+ D L +AE  IE
Sbjct: 1881 REDFRHDVAQIALNAHTYNLNRHPHIPPLADELLELCDYLLEESADVLDDAEYAIE 1809

BLAST of MC09g0858 vs. ExPASy Swiss-Prot
Match: Q6PUA2 (Transcription initiation factor TFIID subunit 1b OS=Arabidopsis thaliana OX=3702 GN=TAF1B PE=2 SV=1)

HSP 1 Score: 1363.2 bits (3527), Expect = 0.0e+00
Identity = 857/1834 (46.73%), Postives = 1161/1834 (63.30%), Query Frame = 0

Query: 89   VDYEDIDEEYDGPEIEAAGEEDHLLPKKEYFSTEVSLATLEPTVSVFDDEDYDED--FEK 148
            VDY   DEEYDGPE++   EEDHLLPK+EY S   +L+ L    SVFDDEDYDE    EK
Sbjct: 5    VDYGSNDEEYDGPELQVVTEEDHLLPKREYLSAAFALSGLNSRASVFDDEDYDEQGGQEK 64

Query: 149  VHDVINSSVEARTTHASDEKGECLEVAYEGEKSVADDDIQSASLNNEVITSSAEELLEET 208
             H  +  S ++        K E   V +E E S+  +        N++ T   +E  E  
Sbjct: 65   EHVPVEKSFDSEEREPVVLKEE-KPVKHEKEASILGN-------KNQMDTGDVQE--ELV 124

Query: 209  PEVQKKLLDEKAHTPLPVLCME-NGMAILQFSEIFGVHDSLKKKEKRESRYCTRRDKYRS 268
              + +  LDEK  TPLP L +E +GM ILQFSEIF + +  KK++KRE R  T RDKY S
Sbjct: 125  VGLSEATLDEKRVTPLPTLYLEDDGMVILQFSEIFAIQEPQKKRQKREIRCITYRDKYIS 184

Query: 269  VDVSDIVEEDEEAFLHGFSQGVSCVKPASVVKDDTTMFNLDDPEFTKFGVVQGVDVMAAR 328
            +D+S+++E+DEE  L    +  +  K    ++ D  +   +  +  K G+V+     +  
Sbjct: 185  MDISELIEDDEEVLLKSHGRIDTHGKKTDQIQLDVPLPIRERSQLVKSGIVRDTTSESRE 244

Query: 329  VDWRQKDNCCGAEPMKQVFAENISIGSNSLLFKKFYPLDQQNWEEGILWDNSPVLSKNSA 388
                 +D+C   E +KQ   ++ S    S L  + +PLDQQ WE  ILW+ SP  S N  
Sbjct: 245  FTKLGRDSCIMGELLKQDLKDDNSSLCQSQLTMEVFPLDQQEWEHLILWEISPQFSANCC 304

Query: 389  ----GSCEVSGSDLEDSVSSDVEQQVSIQIVRSEHHIDPNDRRQRLSQHDLPLLEPFGSR 448
                   E +G  ++   S+ V +Q S+ ++ S       D    L    +  LE FGSR
Sbjct: 305  EGFKSGLESAGIMVQVRASNSVTEQESLNVMNSGGQTQ-GDNNNMLEPFFVNPLESFGSR 364

Query: 449  KFSGPEEPFSPEMIYHPQMLRLESWKDVDGSCQS-EGTRENFSEEHQSNAIRCFSKFSPK 508
                  E  +    +HPQ+LRLES  D D   ++ +  REN  ++  S+A    S  + +
Sbjct: 365  GSQSTNESTNKSR-HHPQLLRLESQWDEDHYRENGDAGRENL-KQLNSDARGRLSGLALQ 424

Query: 509  NRRMLEGSWLDKVLWESDEPIEKQKFIFDLEDEHMLFEISDEKESKYIQFHAGAMILTRS 568
            +R M + SWLD ++WESD+ + + K IFDL+DE M+FE+ + KE KY+Q HAG+ I++RS
Sbjct: 425  DRDMWDESWLDSIIWESDKDLSRSKLIFDLQDEQMIFEVPNNKERKYLQLHAGSRIVSRS 484

Query: 569  SMSVNGNSFEISGSGGQGGWRF-VSNDKHYSNRKASQQLKSNSKKRSVHGVKVFHSKPAM 628
            S S +G SF+  G G   GW+F +SNDK Y N K++Q+L+ N+KK +VH ++VFHS PA+
Sbjct: 485  SKSKDG-SFQ-EGCGSNSGWQFNISNDKFYMNGKSAQKLQGNAKKSTVHSLRVFHSAPAI 544

Query: 629  MLQTMKLKLSNKELANFHRPKASWYPHDNEMAVRELQKLPTQGPMKIILKSLGGKGSKLF 688
             LQTMK+KLSNKE ANFHRPKA WYPHDNE+A+++ + LPTQG M I++KSLGGKGS L 
Sbjct: 545  KLQTMKIKLSNKERANFHRPKALWYPHDNELAIKQQKILPTQGSMTIVVKSLGGKGSLLT 604

Query: 689  VDSEETVSSLMAKASKKLDMKSSEIVKVFYSGKELEREKSLAAQNVQPNSLLHLVRSKIF 748
            V  EE+VSSL AKAS+KLD K +E VK+FY GKELE EKSLA QNVQPNSL+HL+R+K+ 
Sbjct: 605  VGREESVSSLKAKASRKLDFKETEAVKMFYMGKELEDEKSLAEQNVQPNSLVHLLRTKVH 664

Query: 749  VMPWTQHLRGDSKSVRSPGAFKKKSDLSVKDGHVFLMEYCEERPLLLGNIGMGARLCTYY 808
            + PW Q L G++KS+R PGAFKKKSDLS +DGHVFLMEYCEERPL+L N GMGA LCTYY
Sbjct: 665  LWPWAQKLPGENKSLRPPGAFKKKSDLSNQDGHVFLMEYCEERPLMLSNAGMGANLCTYY 724

Query: 809  QKSSPDDQTGALLRNGGDSLGHVIVLEPSDKSPYIGDIKGGSVQASLETNMYRSPIFPHK 868
            QKSSP+DQ G LLRN  D+LG VI+LE  +KSP++G++ GG  Q+S+ETNMY++P+FPH+
Sbjct: 725  QKSSPEDQHGNLLRNQSDTLGSVIILEHGNKSPFLGEVHGGCSQSSVETNMYKAPVFPHR 784

Query: 869  VPMTDYILVRSAKGKLSLRRIYRNFAVGQQEPLMEVFSPGTKSLQMFMMNRLTLYIFREF 928
            +  TDY+LVRSAKGKLSLRRI +  AVGQQEP ME+ SP +K+L  +++NR+  Y++REF
Sbjct: 785  LQSTDYLLVRSAKGKLSLRRINKIVAVGQQEPRMEIMSPASKNLHAYLVNRMMAYVYREF 844

Query: 929  LAAEKRRRLPYIRVDELPSQFPYLSETVIRKKLKEYALQQKSSNGQTILIKKRN-ASISL 988
               ++      I  DEL   F  +S+  +RK ++  +  ++ +NG+    KKR    I L
Sbjct: 845  KHRDR------IAADELSFSFSNISDATVRKYMQVCSDLERDANGKACWSKKRKFDKIPL 904

Query: 989  KKDA-VTPEDVCKYESMQAGLYRLKHLGITE-VHPSAISSAMSRLPDEAITLAAASHIER 1048
              +  V PEDVC YESM AGL+RLKHLGIT    P++IS+A+++LPDE I  AAASHI R
Sbjct: 905  GLNTLVAPEDVCSYESMLAGLFRLKHLGITRFTLPASISTALAQLPDERI--AAASHIAR 964

Query: 1049 ELQITPWNLSSNFVACTTQGKENIERLEITGVGDPSGRGLGFSYVRSVPKAPISNASLKK 1108
            ELQITPWNLSS+FV C TQG+ENIERLEITGVGDPSGRGLGFSYVR  PK+  ++   KK
Sbjct: 965  ELQITPWNLSSSFVTCATQGRENIERLEITGVGDPSGRGLGFSYVRVAPKSSAASEHKKK 1024

Query: 1109 KAASSRGSSAVTGTDADLRRLSMDAAKEVLLKFDVSEEQIAKLTRWHRIAMIRRLSSEQA 1168
            KAA+ RG   VTGTDAD RRLSM+AA+EVLLKF+V +E IAK T+ HR AMIR++SSEQA
Sbjct: 1025 KAAACRGVPTVTGTDADPRRLSMEAAREVLLKFNVPDEIIAKQTQRHRTAMIRKISSEQA 1084

Query: 1169 ASGVQVDPTTISKYARGQRMSFLQLQRQTREKCQEIWERQIQSLSTSDGAENESDSEGNS 1228
            ASG +V PTT+  ++R QRMSFLQLQ+Q RE C EIW+RQ  SLS  D   NES++E NS
Sbjct: 1085 ASGGKVGPTTVGMFSRSQRMSFLQLQQQAREMCHEIWDRQRLSLSACDDDGNESENEANS 1144

Query: 1229 DLDSFAGDLENLLDAEEFEDEVDTFEIRHEKTDGVKGLKMRRRPSIAQTEEEIEDEVAEA 1288
            DLDSF GDLE+LLDAE+  +  ++ +  +EK DGVKGLKMRR PS  + +EEIEDE AE 
Sbjct: 1145 DLDSFVGDLEDLLDAEDGGEGEESNKSMNEKLDGVKGLKMRRWPSQVEKDEEIEDEAAEY 1204

Query: 1289 TELCRLLMDDEAERRRKKKKNKVMGEAI-LVPGLQASFGHEIPQQTRHLVSIAQPDIAYT 1348
             ELCRLLM DE +  +KKKK K +GE I   P  +++F   I +  +++ +         
Sbjct: 1205 VELCRLLMQDEND--KKKKKLKDVGEGIGSFPPPRSNFEPFIDK--KYIATEPDASFLIV 1264

Query: 1349 SKENIRDQKEVESIINRKDKSGKFKPMKKNYSSEMSLLNKKLKISGDKVKNFKEKKSARE 1408
            ++  ++  K V+   ++  K  + K +         +L +  K+       F  KK+AR 
Sbjct: 1265 NESTVKHTKNVDKATSKSPKDKQVKEIGTPICQMKKILKENQKV-------FMGKKTARA 1324

Query: 1409 SFVCGKCGQFGHMRTNKNCPKYGEDMET-PETTDQEKVSIKLNTVDPSSQSH-QKAHTKK 1468
            +FVCG CGQ GHM+TNK+CPKY  + E+ PE+ D +K + K ++ D S +        KK
Sbjct: 1325 NFVCGACGQHGHMKTNKHCPKYRRNTESQPESMDMKKSTGKPSSSDLSGEVWLTPIDNKK 1384

Query: 1469 VTPKTITKISTTETFE---------GEKSTLMAKMLPVKFKCSSSE-KLSDNLSPGVPQT 1528
              PK+ TKIS  E  +         G         +    K +S + K+S    P   + 
Sbjct: 1385 PAPKSATKISVNEATKVGDSTSKTPGSSDVAAVSEIDSGTKLTSRKLKISSKAKPKASKV 1444

Query: 1529 -SDLPFNSDNETGKSVVKVNKITFSKKRNEDIQFESHKPSI--VIRPPDAKKVYAEAHKP 1588
             SD PF+S               +S++R E    E H PS+   + P       A +   
Sbjct: 1445 ESDSPFHS-----------LMPAYSRERGES---ELHNPSVSGQLLPSTETDQAASSRYT 1504

Query: 1589 SIVIRPPTSIDRDRMEFPRRSATVVRSPAETKREKLNKKLIIKRPKEVIDLDQMGFDASA 1648
            + V +P  SID+D+ E   R   V+  P  T +E   KKL+IKR KE+ D D    + + 
Sbjct: 1505 TSVPQPSLSIDKDQAE-SCRPHRVIWPP--TGKEHSQKKLVIKRLKEITDHDSGSLEETP 1564

Query: 1649 GMQYRKTKRIVELSSFENHTRPG-SMSSAESGKKKVREDQIWWEKQERQKNEERLREEKA 1708
              + RKTKR+ EL+ F+   R   S +  + G K   +D+ W  ++E+  + E  RE K 
Sbjct: 1565 QFESRKTKRMAELADFQRQQRLRLSENFLDWGPK---DDRKW--RKEQDISTELHREGKV 1624

Query: 1709 RRVYKEEMRMRDEQEKLAEIRRFEASIRSDKEEEERIKAKKKKKKRIPEIMDDYMEDPRS 1768
            RR Y ++  + +E+ ++AE RR+   IRS++EEE+R KAK+KKK +   I+++Y   PR 
Sbjct: 1625 RRAY-DDSTVSEERSEIAESRRYREVIRSEREEEKRRKAKQKKKLQ-RGILENY--PPRR 1684

Query: 1769 RQRIPER--DRVVKRKPIELGRHGAEHASSTKRRRGGEVGLSNILERIVEALK-DNFEIS 1828
               I       +      +  R+  E+A   KRR+ G+VGL+NILE IV+ L+     +S
Sbjct: 1685 NDGISSESGQNINSLCVSDFERNRTEYAPQPKRRKKGQVGLANILESIVDTLRVKEVNVS 1744

Query: 1829 YLFLKPVSKKEAPDYLDIIERPMDLSTIREKVRKLEYKTRDEFRTDVWQIMYNAHMYNDG 1888
            YLFLKPV+KKEAP+YL+I++ PMDLSTIR+KVR++EY+ R +FR DVWQI +NAH+YNDG
Sbjct: 1745 YLFLKPVTKKEAPNYLEIVKCPMDLSTIRDKVRRMEYRDRQQFRHDVWQIKFNAHLYNDG 1778

Query: 1889 RNPGIPPLADKLLMHCDNLLNENDDELTEAEIGI 1891
            RN  IPPLAD+LL+ CD LL+E  DEL EAE GI
Sbjct: 1805 RNLSIPPLADELLVKCDRLLDEYRDELKEAEKGI 1778

BLAST of MC09g0858 vs. ExPASy Swiss-Prot
Match: P21675 (Transcription initiation factor TFIID subunit 1 OS=Homo sapiens OX=9606 GN=TAF1 PE=1 SV=2)

HSP 1 Score: 253.4 bits (646), Expect = 2.0e-65
Identity = 358/1504 (23.80%), Postives = 620/1504 (41.22%), Query Frame = 0

Query: 6    LVADDDDYEDAGGGNRF--LGFMFGNVDNSGDLDAD-YLDEDAKEHLDAL-ADKLGSTLT 65
            +++D D  ED+ GG  F   GF+FGN++ +G L+ +  LD++ K+HL  L A  LGS +T
Sbjct: 20   IMSDTDSDEDSAGGGPFSLAGFLFGNINGAGQLEGESVLDDECKKHLAGLGALGLGSLIT 79

Query: 66   DIDLSTKSAKTPSDAVEPD-YDAKAEDAVDYEDIDEEYDGPEIEAAGEEDHLLPKKEYFS 125
            ++  + +   T    V  + +    EDAVDY DI+E           E++     + Y  
Sbjct: 80   ELTANEELTGTDGALVNDEGWVRSTEDAVDYSDINE---------VAEDE----SRRYQQ 139

Query: 126  TEVSLATLEPTVSVFDDEDYDEDFEKVHDVINSSVEARTTHASDEKGECLEVAYEGEKSV 185
            T  SL  L    S +D++DYD D E   D+    +          K +  + +  GEK  
Sbjct: 140  TMGSLQPL--CHSDYDEDDYDADCE---DIDCKLMPPPPPPPGPMKKDKDQDSITGEKV- 199

Query: 186  ADDDIQSASLNNEVITSSAEELLEETPEVQKKLLDEKAHTPLPVLCMENGM-------AI 245
              D   S+   +E+    A +   E  ++   L     H    +L     +        +
Sbjct: 200  --DFSSSSDSESEMGPQEATQAESEDGKLTLPLAGIMQHDATKLLPSVTELFPEFRPGKV 259

Query: 246  LQFSEIFG-------VHDSLKKKEKRESRYCTRRDKYRSVDVSDIVEEDEEAFLHGFSQG 305
            L+F  +FG       V  S ++K K++ R   + ++ + V+ S +  E  +  L  +   
Sbjct: 260  LRFLRLFGPGKNVPSVWRSARRKRKKKHRELIQEEQIQEVECS-VESEVSQKSLWNYDYA 319

Query: 306  VSCVKPASVVKDDTTMFNLDDPEFTK-FGVVQGVDVMAARV-DWRQKDNCCGAEPMKQVF 365
                    +  D+ TM    + +F++  G +  V     RV +WR            +++
Sbjct: 320  PPPPPEQCLSDDEITMMAPVESKFSQSTGDIDKVTDTKPRVAEWRYGP--------ARLW 379

Query: 366  AENISIGSNSLLFKKFYPLDQQNWEEGILWDNSPVLSKNSAGSCEVSGSDLEDSVSSDVE 425
             + + +  +   F   + L +         ++ PV+              LE++  +D+ 
Sbjct: 380  YDMLGVPEDGSGFDYGFKLRKT--------EHEPVIKSRMIEEFR----KLEENNGTDLL 439

Query: 426  QQVSIQIVRSEHHIDPNDRRQRLSQHDLPLLEPFGSRKFSGPEEPFSPEMIYHPQMLRLE 485
               +  +V   H  D              ++      K  G +          PQ   L 
Sbjct: 440  ADENFLMVTQLHWED-------------DIIWDGEDVKHKGTK----------PQRASLA 499

Query: 486  SW--KDVDGSCQSEGTRENFSEEHQSNAIRCFSKFSPKNRRMLEGSWLDKVLWESD---E 545
             W    +  +  +   ++ F+     +    +S F   N  ++ G W D ++W++     
Sbjct: 500  GWLPSSMTRNAMAYNVQQGFAATLDDDK-PWYSIFPIDNEDLVYGRWEDNIIWDAQAMPR 559

Query: 546  PIEKQKFIFDLEDEHMLFEISDEKE-----------SKYIQFHAGAMILTRSSMSVNGNS 605
             +E      D  DE+++ EI DEKE            K        ++L ++ +      
Sbjct: 560  LLEPPVLTLDPNDENLILEIPDEKEEATSNSPSKESKKESSLKKSRILLGKTGVIKEEPQ 619

Query: 606  FEISGSGGQGGWRFVSNDKHYSNRKASQQLKSNSKKRSVHGVKVFHSKPAMMLQT--MKL 665
              +S    +  W  +SND++Y         K    + +  G  + HS PA+ L+      
Sbjct: 620  QNMSQPEVKDPWN-LSNDEYY-------YPKQQGLRGTFGGNIIQHSIPAVELRQPFFPT 679

Query: 666  KLSNKELANFHRPKASWYPHDNEMAVRELQKLPTQGPMKIILKSLGGKGSKLFVDSEETV 725
             +   +L  FHRP    Y             L   GP                     +V
Sbjct: 680  HMGPIKLRQFHRPPLKKY---------SFGALSQPGP--------------------HSV 739

Query: 726  SSLMAKASKKLDMKSSEIVKVFYSGKELEREKSLAAQNVQPNSLLHLVRSKIFVMPWTQH 785
              L+    KK  M            +E ER+ S                           
Sbjct: 740  QPLLKHIKKKAKM------------REQERQASGG------------------------- 799

Query: 786  LRGDSKSVRSPGAFKKKSDLSVKDGHVFLMEYCEERPLLLGNIGMGARLCTYYQKSSPDD 845
              G+   +R+P       DL+ KDG + L EY EE   L+  +GM  ++  YY++    D
Sbjct: 800  --GEMFFMRTP------QDLTGKDGDLILAEYSEENGPLMMQVGMATKIKNYYKRKPGKD 859

Query: 846  QTGALLRNGGDSLGHVIVLEPSDKSPYIGDIKGGSVQASLETNMYRSPIFPHKVPMTDYI 905
                  + G     H         SP++G +  G +  + E N++R+PI+ HK+P TD++
Sbjct: 860  PGAPDCKYGETVYCHT--------SPFLGSLHPGQLLQAFENNLFRAPIYLHKMPETDFL 919

Query: 906  LVRSAKGKLSLRRIYRNFAVGQQEPLMEVFSPGTKSLQMFMMNRLTLYIFREFLAAEKRR 965
            ++R+ +G   +R +   F VGQQ PL EV  P +K     + + L ++I+R F  ++ R 
Sbjct: 920  IIRTRQG-YYIRELVDIFVVGQQCPLFEVPGPNSKRANTHIRDFLQVFIYRLFWKSKDRP 979

Query: 966  RLPYIRVDELPSQFPYLSETVIRKKLKEYALQQKSSNGQTILIKKRNASISLKKD---AV 1025
            R   IR++++   FP  SE+ IRK+LK  A  +++       + K +  +  +++    V
Sbjct: 980  R--RIRMEDIKKAFPSHSESSIRKRLKLCADFKRTGMDSNWWVLKSDFRLPTEEEIRAMV 1039

Query: 1026 TPEDVCKYESMQAGLYRLKHLGITEVHPSAISSAMSRLPDEAITLAAASHIERELQITPW 1085
            +PE  C Y SM A   RLK  G  E    A        P+E         I+ E++  PW
Sbjct: 1040 SPEQCCAYYSMIAAEQRLKDAGYGEKSFFA--------PEEENEEDFQMKIDDEVRTAPW 1099

Query: 1086 NLSSNFVACTTQGKENIERLEITGVGDPSGRGLGFSYVRSVPKAPISNASLKKKAASSRG 1145
            N +  F+A   +GK     LE+TGV DP+G G GFSYV+ +P  P      K+     + 
Sbjct: 1100 NTTRAFIA-AMKGK---CLLEVTGVADPTGCGEGFSYVK-IPNKPTQQKDDKEPQPVKK- 1159

Query: 1146 SSAVTGTDADLRRLSMDAAKEVLLKFDVSEEQIAKLTRWHRIAMIRRLSSEQAASGVQVD 1205
               VTGTDADLRRLS+  AK++L KF V EE+I KL+RW  I ++R +S+EQA SG    
Sbjct: 1160 --TVTGTDADLRRLSLKNAKQLLRKFGVPEEEIKKLSRWEVIDVVRTMSTEQARSG---- 1219

Query: 1206 PTTISKYARGQRMSFLQLQRQTREKCQEIWERQIQSLSTSD--GAENESDSEGNSDLDSF 1265
               +SK+ARG R S  + Q + +E+CQ I++ Q + LS+++    + +S S  +SD +  
Sbjct: 1220 EGPMSKFARGSRFSVAEHQERYKEECQRIFDLQNKVLSSTEVLSTDTDSSSAEDSDFEEM 1279

Query: 1266 AGDLENLLDAEEFEDEVDTFEIRHEKTDGVKGLKMRRRPSIAQTEEEIEDEVAEATELCR 1325
              ++EN+L  ++   ++       E+ +  +   +    S A      +D+ A  T L  
Sbjct: 1280 GKNIENMLQNKKTSSQLSREREEQERKELQR--MLLAAGSAASGNNHRDDDTASVTSL-- 1323

Query: 1326 LLMDDEAERRRKKKKNKVMGEAILVPGLQASFGHEIPQQTRHLVSIAQPDI--AYTSKEN 1385
               +  A  R  K              +  +F  E  ++     ++ +P +  AY     
Sbjct: 1340 ---NSSATGRCLK--------------IYRTFRDEEGKEYVRCETVRKPAVIDAYVRIRT 1323

Query: 1386 IRDQKEVESII----NRKDKSGKFKPMKKNYSSEMSLLNKKLKISGDKVKNFKEKKSARE 1445
             +D++ +          +++  K +   +     +    +K K+ G   K  K+ K   +
Sbjct: 1400 TKDEEFIRKFALFDEQHREEMRKERRRIQEQLRRLKRNQEKEKLKGPPEKKPKKMKERPD 1323

Query: 1446 -SFVCGKCGQFGHMRTNKNCPKYGEDMETPET----TDQEKVSIKLNTVDPSSQSHQKAH 1455
                CG CG  GHMRTNK CP Y +    P      T++++  ++   +   ++   K  
Sbjct: 1460 LKLKCGACGAIGHMRTNKFCPLYYQTNAPPSNPVAMTEEQEEELEKTVIHNDNEELIKVE 1323


HSP 2 Score: 71.6 bits (174), Expect = 1.1e-10
Identity = 66/207 (31.88%), Postives = 101/207 (48.79%), Query Frame = 0

Query: 1689 MRDEQEKLAEIRRFEASIRSDKEEEERIKAKK-KKKKRIPEIMDDYMEDPR----SRQRI 1748
            M +EQE+  E    +  I +D EE  +++  K    K++ E  D+           +Q++
Sbjct: 1295 MTEEQEEELE----KTVIHNDNEELIKVEGTKIVLGKQLIESADEVRRKSLVLKFPKQQL 1354

Query: 1749 PERDRVVKRKPIELGRHGAEHASSTKRRRGGEVGLSNILERIVEALKDNFEISYLFLKPV 1808
            P + +      +        H S  +RR    V LS+ILE I+  ++D    +Y F  PV
Sbjct: 1355 PPKKKRRVGTTVHCDYLNRPHKSIHRRRTDPMVTLSSILESIINDMRD-LPNTYPFHTPV 1414

Query: 1809 SKKEAPDYLDIIERPMDLSTIREKVRKLEYKTRDEFRTDVWQIMYNAHMYNDGRNPGIPP 1868
            + K   DY  II RPMDL T+RE VRK  Y +R+EFR  +  I+ N+  YN G    +  
Sbjct: 1415 NAKVVKDYYKIITRPMDLQTLRENVRKRLYPSREEFREHLELIVKNSATYN-GPKHSLTQ 1474

Query: 1869 LADKLLMHCDNLLNENDDELTEAEIGI 1891
            ++  +L  CD  L E +D+L   E  I
Sbjct: 1475 ISQSMLDLCDEKLKEKEDKLARLEKAI 1495

BLAST of MC09g0858 vs. ExPASy Swiss-Prot
Match: Q8IZX4 (Transcription initiation factor TFIID subunit 1-like OS=Homo sapiens OX=9606 GN=TAF1L PE=1 SV=1)

HSP 1 Score: 253.1 bits (645), Expect = 2.6e-65
Identity = 363/1525 (23.80%), Postives = 626/1525 (41.05%), Query Frame = 0

Query: 6    LVADDDDYEDAGGGNRF--LGFMFGNVDNSGDLDAD-YLDEDAKEHLDAL-ADKLGSTLT 65
            +++D D  ED+ GG  F   G +FGN+  +G L+ +  LD++ K+HL  L A  LGS +T
Sbjct: 19   IMSDSDSEEDSSGGGPFTLAGILFGNISGAGQLEGESVLDDECKKHLAGLGALGLGSLIT 78

Query: 66   DIDLSTKSAKTPSDAVEPD-YDAKAEDAVDYEDIDEEYDGPEIEAAGEEDHLLPKKEYFS 125
            ++  + +   T    V  + +    EDAVDY DI+E                + + E   
Sbjct: 79   ELTANEELTGTGGALVNDEGWIRSTEDAVDYSDINE----------------VAEDESQR 138

Query: 126  TEVSLATLEPTV-SVFDDEDYDEDFEKVHDVINSSVEARTTHASDEKGE--CLEVAYEGE 185
             + ++ +L+P   S +D++DYD D E +   +             +K +     V+  GE
Sbjct: 139  HQQTMGSLQPLYHSDYDEDDYDADCEDIDCKLMPPPPPPPGPMKKDKDQDAITCVSESGE 198

Query: 186  KSVADDDIQSASLNNEVIT----SSAEELLEETPEVQKKLLDEKAHTPLPVLCMENGM-- 245
              +    I  + L +E +     S +E  +      Q +  D K   PL  +   +    
Sbjct: 199  DIILPSIIAPSFLASEKVDFSSYSDSESEMGPQEATQAESEDGKLTLPLAGIMQHDATKL 258

Query: 246  --------------AILQFSEIFG-------VHDSLKKKEKRESRYCTRRDKYRSVDVSD 305
                           +L+F  +FG       V  S ++K K+  R   + ++ + V+ S 
Sbjct: 259  LPSVTELFPEFRPGKVLRFLHLFGPGKNVPSVWRSARRKRKKH-RELIQEEQIQEVECS- 318

Query: 306  IVEEDEEAFLHGFSQGVSCVKPASVVKDDTTMFNLDDPEFTK-FGVVQGVDVMAARV-DW 365
            +  E  +  L  +           +  D+ TM    + +F++  G V  V     RV +W
Sbjct: 319  VESEVSQKSLWNYDYAPPPPPEQCLADDEITMMVPVESKFSQSTGDVDKVTDTKPRVAEW 378

Query: 366  RQKDNCCGAEPMKQVFAENISIGSNSLLFKKFYPLDQQNWEEGILWDNSPVLSKNSAGSC 425
            R            +++ + + +  +   F   + L +   E        PV+        
Sbjct: 379  RYGP--------ARLWYDMLGVSEDGSGFDYGFKLRKTQHE--------PVIKSRMMEEF 438

Query: 426  EVSGSDLEDSVSSDVEQQVSIQIVRSEHHIDPNDRRQRLSQHDLPLLEPFGSRKFSGPEE 485
                  LE+S  +D+    +  +V   H  D              ++      K  G + 
Sbjct: 439  R----KLEESNGTDLLADENFLMVTQLHWED-------------SIIWDGEDIKHKGTK- 498

Query: 486  PFSPEMIYHPQMLRLESW------KDVDGSCQSEGTRENFSEEHQSNAIRCFSKFSPKNR 545
                     PQ   L  W      ++V      +G      ++        +S F   N 
Sbjct: 499  ---------PQGASLAGWLPSIKTRNVMAYNVQQGFAPTLDDDKP-----WYSIFPIDNE 558

Query: 546  RMLEGSWLDKVLWESD---EPIEKQKFIFDLEDEHMLFEISDEKE-----------SKYI 605
             ++ G W D ++W++      +E      D  DE+++ EI DEKE            K  
Sbjct: 559  DLVYGRWEDNIIWDAQAMPRLLEPPVLALDPNDENLILEIPDEKEEATSNSPSKESKKES 618

Query: 606  QFHAGAMILTRSSMSVNGNSFEISGSGGQGGWRFVSNDKHYSNRKASQQLKSNSKKRSVH 665
                  ++L ++ +        +S    +  W  +SND++Y         K    + +  
Sbjct: 619  SLKKSRILLGKTGVIREEPQQNMSQPEVKDPWN-LSNDEYYFP-------KQQGLRGTFG 678

Query: 666  GVKVFHSKPAMML--QTMKLKLSNKELANFHRPKASWYPHDNEMAVRELQKLPTQGPMKI 725
            G  + HS PAM L        +   ++  FHRP    Y             L   GP   
Sbjct: 679  GNIIQHSIPAMELWQPFFPTHMGPIKIRQFHRPPLKKY---------SFGALSQPGP--- 738

Query: 726  ILKSLGGKGSKLFVDSEETVSSLMAKASKKLDMKSSEIVKVFYSGKELEREKSLAAQNVQ 785
                              +V  L+    KK  M            +E ER+ S   +   
Sbjct: 739  -----------------HSVQPLLKHIKKKAKM------------REQERQASGGGE--- 798

Query: 786  PNSLLHLVRSKIFVMPWTQHLRGDSKSVRSPGAFKKKSDLSVKDGHVFLMEYCEERPLLL 845
                       +F M             R+P       DL+ KDG + L EY EE   L+
Sbjct: 799  -----------LFFM-------------RTP------QDLTGKDGDLILAEYSEENGPLM 858

Query: 846  GNIGMGARLCTYYQKSSPDDQTGALLRNGGDSLGHVIVLEPSDKSPYIGDIKGGSVQASL 905
              +GM  ++  YY++    D      + G     H         SP++G +  G +  +L
Sbjct: 859  MQVGMATKIKNYYKRKPGKDPGAPDCKYGETVYCHT--------SPFLGSLHPGQLLQAL 918

Query: 906  ETNMYRSPIFPHKVPMTDYILVRSAKGKLSLRRIYRNFAVGQQEPLMEVFSPGTKSLQMF 965
            E N++R+P++ HK+P TD++++R+ +G   +R +   F VGQQ PL EV  P ++   M 
Sbjct: 919  ENNLFRAPVYLHKMPETDFLIIRTRQG-YYIRELVDIFVVGQQCPLFEVPGPNSRRANMH 978

Query: 966  MMNRLTLYIFREFLAAEKRRRLPYIRVDELPSQFPYLSETVIRKKLKEYALQQKSSNGQT 1025
            + + L ++I+R F  ++ R R   IR++++   FP  SE+ IRK+LK  A  +++     
Sbjct: 979  IRDFLQVFIYRLFWKSKDRPR--RIRMEDIKKAFPSHSESSIRKRLKLCADFKRTGMDSN 1038

Query: 1026 ILIKKRNASISLKKD---AVTPEDVCKYESMQAGLYRLKHLGITEVHPSAISSAMSRLPD 1085
              + K +  +  +++    V+PE  C Y SM A   RLK  G  E    A        P+
Sbjct: 1039 WWVLKSDFRLPTEEEIRAKVSPEQCCAYYSMIAAKQRLKDAGYGEKSFFA--------PE 1098

Query: 1086 EAITLAAASHIERELQITPWNLSSNFVACTTQGKENIERLEITGVGDPSGRGLGFSYVRS 1145
            E         I+ E+   PWN +  F+A   +GK     LE+TGV DP+G G GFSYV+ 
Sbjct: 1099 EENEEDFQMKIDDEVHAAPWNTTRAFIA-AMKGK---CLLEVTGVADPTGCGEGFSYVK- 1158

Query: 1146 VPKAPISNASLKKKAASSRGSSAVTGTDADLRRLSMDAAKEVLLKFDVSEEQIAKLTRWH 1205
            +P  P      K+  A  +    VTGTDADLRRLS+  AK++L KF V EE+I KL+RW 
Sbjct: 1159 IPNKPTQQKDDKEPQAVKK---TVTGTDADLRRLSLKNAKQLLRKFGVPEEEIKKLSRWE 1218

Query: 1206 RIAMIRRLSSEQAASGVQVDPTTISKYARGQRMSFLQLQRQTREKCQEIWERQIQSLSTS 1265
             I ++R +S+EQA SG       +SK+ARG R S  + Q + +E+CQ I++ Q + LS++
Sbjct: 1219 VIDVVRTMSTEQAHSG----EGPMSKFARGSRFSVAEHQERYKEECQRIFDLQNKVLSST 1278

Query: 1266 D--GAENESDSEGNSDLDSFAGDLENLLDAEEFEDEVDTFEIRHEKTDGVKGLKMRRRPS 1325
            +    + +S S  +SD +    ++EN+L  ++   ++       E+ +  + L +    S
Sbjct: 1279 EVLSTDTDSISAEDSDFEEMGKNIENMLQNKKTSSQLSREWEEQERKELRRMLLV--AGS 1338

Query: 1326 IAQTEEEIEDEVAEATELCRLLMDDEAERRRKKKKNKVMGEAILVPGLQASFGHEIPQQT 1385
             A      +D  A  T L                K+   G  + +     +F  E  ++ 
Sbjct: 1339 AASGNNHRDDVTASMTSL----------------KSSATGHCLKI---YRTFRDEEGKEY 1342

Query: 1386 RHLVSIAQPDI--AYTSKENIRDQKEVE--SIINRKDKSGKFKPMKKNYSSEMSLLNK-- 1445
                ++ +P +  AY      +D+K ++  ++ + K +  + +  ++    ++  L +  
Sbjct: 1399 VRCETVRKPAVIDAYVRIRTTKDEKFIQKFALFDEKHRE-EMRKERRRIQEQLRRLKRNQ 1342

Query: 1446 -KLKISGDKVKNFKEKKSARE-SFVCGKCGQFGHMRTNKNCPKYGEDMETPE----TTDQ 1455
             K K+ G   K  K+ K   +    CG CG  GHMRTNK CP Y +    P      T++
Sbjct: 1459 EKEKLKGPPEKKPKKMKERPDLKLKCGACGAIGHMRTNKFCPLYYQTNVPPSKPVAMTEE 1342


HSP 2 Score: 70.1 bits (170), Expect = 3.1e-10
Identity = 63/207 (30.43%), Postives = 101/207 (48.79%), Query Frame = 0

Query: 1689 MRDEQEKLAEIRRFEASIRSDKEEEERIKAK-----KKKKKRIPEIMDDYMEDPRSRQRI 1748
            M +EQE+  E    +  I +D EE  +++       K+  + + E+    +     +Q++
Sbjct: 1314 MTEEQEEELE----KTVIHNDNEELIKVEGTKIVFGKQLIENVHEVRRKSLVLKFPKQQL 1373

Query: 1749 PERDRVVKRKPIELGRHGAEHASSTKRRRGGEVGLSNILERIVEALKDNFEISYLFLKPV 1808
            P + +      +        H S  +RR    V LS+ILE I+  ++D    ++ F  PV
Sbjct: 1374 PPKKKRRVGTTVHCDYLNIPHKSIHRRRTDPMVTLSSILESIINDMRD-LPNTHPFHTPV 1433

Query: 1809 SKKEAPDYLDIIERPMDLSTIREKVRKLEYKTRDEFRTDVWQIMYNAHMYNDGRNPGIPP 1868
            + K   DY  II RPMDL T+RE VRK  Y +R+EFR  +  I+ N+  YN G    +  
Sbjct: 1434 NAKVVKDYYKIITRPMDLQTLRENVRKCLYPSREEFREHLELIVKNSATYN-GPKHSLTQ 1493

Query: 1869 LADKLLMHCDNLLNENDDELTEAEIGI 1891
            ++  +L  CD  L E +D+L   E  I
Sbjct: 1494 ISQSMLDLCDEKLKEKEDKLARLEKAI 1514

BLAST of MC09g0858 vs. NCBI nr
Match: XP_022155093.1 (transcription initiation factor TFIID subunit 1 [Momordica charantia])

HSP 1 Score: 3625 bits (9399), Expect = 0.0
Identity = 1885/1887 (99.89%), Postives = 1886/1887 (99.95%), Query Frame = 0

Query: 9    DDDDYEDAGGGNRFLGFMFGNVDNSGDLDADYLDEDAKEHLDALADKLGSTLTDIDLSTK 68
            DDDDYEDAGGGNRFLGFMFGNVDNSGDLDADYLDEDAKEHLDALADKLGSTLTDIDLSTK
Sbjct: 2    DDDDYEDAGGGNRFLGFMFGNVDNSGDLDADYLDEDAKEHLDALADKLGSTLTDIDLSTK 61

Query: 69   SAKTPSDAVEPDYDAKAEDAVDYEDIDEEYDGPEIEAAGEEDHLLPKKEYFSTEVSLATL 128
            SAKTPSDAVEPDYDAKAEDAVDYEDIDEEYDGPEIEAAGEEDHLLPKKEYFSTEVSLATL
Sbjct: 62   SAKTPSDAVEPDYDAKAEDAVDYEDIDEEYDGPEIEAAGEEDHLLPKKEYFSTEVSLATL 121

Query: 129  EPTVSVFDDEDYDEDFEKVHDVINSSVEARTTHASDEKGECLEVAYEGEKSVADDDIQSA 188
            EPTVSVFDDEDYDEDFEKVHDVINSSVEARTTHASDEKGECLEVAYEGEKSVADDDIQSA
Sbjct: 122  EPTVSVFDDEDYDEDFEKVHDVINSSVEARTTHASDEKGECLEVAYEGEKSVADDDIQSA 181

Query: 189  SLNNEVITSSAEELLEETPEVQKKLLDEKAHTPLPVLCMENGMAILQFSEIFGVHDSLKK 248
            SLNNEVITSSAEELLEETPEVQKKLLDEKAHTPLPVLCMENGMAILQFSEIFGVHDSLKK
Sbjct: 182  SLNNEVITSSAEELLEETPEVQKKLLDEKAHTPLPVLCMENGMAILQFSEIFGVHDSLKK 241

Query: 249  KEKRESRYCTRRDKYRSVDVSDIVEEDEEAFLHGFSQGVSCVKPASVVKDDTTMFNLDDP 308
            KEKRESRYCTRRDKYRSVDVSDIVEEDEEAFLHGFSQGVSCVKPASVVKDDTTMFNLDDP
Sbjct: 242  KEKRESRYCTRRDKYRSVDVSDIVEEDEEAFLHGFSQGVSCVKPASVVKDDTTMFNLDDP 301

Query: 309  EFTKFGVVQGVDVMAARVDWRQKDNCCGAEPMKQVFAENISIGSNSLLFKKFYPLDQQNW 368
            EFTKFGVVQGVDVMAARVDWRQKDNCCGAEPMKQVFAENISIGSNSLLFKKFYPLDQQNW
Sbjct: 302  EFTKFGVVQGVDVMAARVDWRQKDNCCGAEPMKQVFAENISIGSNSLLFKKFYPLDQQNW 361

Query: 369  EEGILWDNSPVLSKNSAGSCEVSGSDLEDSVSSDVEQQVSIQIVRSEHHIDPNDRRQRLS 428
            EEGILWDNSPVLSKNSAGSCEVSGSDLEDSVSSDVEQQVSIQIVRSEHHIDPNDRRQRLS
Sbjct: 362  EEGILWDNSPVLSKNSAGSCEVSGSDLEDSVSSDVEQQVSIQIVRSEHHIDPNDRRQRLS 421

Query: 429  QHDLPLLEPFGSRKFSGPEEPFSPEMIYHPQMLRLESWKDVDGSCQSEGTRENFSEEHQS 488
            QHDLPLLEPFGSRKFSGPEEPFSPEMIYHPQMLRLESWKDVDGSCQSEGTRENFSEEHQS
Sbjct: 422  QHDLPLLEPFGSRKFSGPEEPFSPEMIYHPQMLRLESWKDVDGSCQSEGTRENFSEEHQS 481

Query: 489  NAIRCFSKFSPKNRRMLEGSWLDKVLWESDEPIEKQKFIFDLEDEHMLFEISDEKESKYI 548
            NAIRCFSKFSPKNRRMLEGSWLDKVLWESDEPIEKQKFIFDLEDEHMLFEISDEKESKYI
Sbjct: 482  NAIRCFSKFSPKNRRMLEGSWLDKVLWESDEPIEKQKFIFDLEDEHMLFEISDEKESKYI 541

Query: 549  QFHAGAMILTRSSMSVNGNSFEISGSGGQGGWRFVSNDKHYSNRKASQQLKSNSKKRSVH 608
            QFHAGAMILTRSSMSVNGNSFEISGSGGQGGWRFVSNDKHYSNRKASQQLKSNSKKRSVH
Sbjct: 542  QFHAGAMILTRSSMSVNGNSFEISGSGGQGGWRFVSNDKHYSNRKASQQLKSNSKKRSVH 601

Query: 609  GVKVFHSKPAMMLQTMKLKLSNKELANFHRPKASWYPHDNEMAVRELQKLPTQGPMKIIL 668
            GVKVFHSKPAMMLQTMKLKLSNKELANFHRPKASWYPHDNEMAVRELQKLPTQGPMKIIL
Sbjct: 602  GVKVFHSKPAMMLQTMKLKLSNKELANFHRPKASWYPHDNEMAVRELQKLPTQGPMKIIL 661

Query: 669  KSLGGKGSKLFVDSEETVSSLMAKASKKLDMKSSEIVKVFYSGKELEREKSLAAQNVQPN 728
            KSLGGKGSKLFVDSEETVS+LMAKASKKLDMKSSEIVKVFYSGKELEREKSLAAQNVQPN
Sbjct: 662  KSLGGKGSKLFVDSEETVSALMAKASKKLDMKSSEIVKVFYSGKELEREKSLAAQNVQPN 721

Query: 729  SLLHLVRSKIFVMPWTQHLRGDSKSVRSPGAFKKKSDLSVKDGHVFLMEYCEERPLLLGN 788
            SLLHLVRSKIFVMPWTQHLRGDSKSVRSPGAFKKKSDLSVKDGHVFLMEYCEERPLLLGN
Sbjct: 722  SLLHLVRSKIFVMPWTQHLRGDSKSVRSPGAFKKKSDLSVKDGHVFLMEYCEERPLLLGN 781

Query: 789  IGMGARLCTYYQKSSPDDQTGALLRNGGDSLGHVIVLEPSDKSPYIGDIKGGSVQASLET 848
            IGMGARLCTYYQKSSPDDQTGALLRNGGDSLGHVIVLEPSDKSPYIGDIKGGSVQASLET
Sbjct: 782  IGMGARLCTYYQKSSPDDQTGALLRNGGDSLGHVIVLEPSDKSPYIGDIKGGSVQASLET 841

Query: 849  NMYRSPIFPHKVPMTDYILVRSAKGKLSLRRIYRNFAVGQQEPLMEVFSPGTKSLQMFMM 908
            NMYRSPIFPHKVPMTDYILVRSAKGKLSLRRI RNFAVGQQEPLMEVFSPGTKSLQMFMM
Sbjct: 842  NMYRSPIFPHKVPMTDYILVRSAKGKLSLRRIDRNFAVGQQEPLMEVFSPGTKSLQMFMM 901

Query: 909  NRLTLYIFREFLAAEKRRRLPYIRVDELPSQFPYLSETVIRKKLKEYALQQKSSNGQTIL 968
            NRLTLYIFREFLAAEKRRRLPYIRVDELPSQFPYLSETVIRKKLKEYALQQKSSNGQTIL
Sbjct: 902  NRLTLYIFREFLAAEKRRRLPYIRVDELPSQFPYLSETVIRKKLKEYALQQKSSNGQTIL 961

Query: 969  IKKRNASISLKKDAVTPEDVCKYESMQAGLYRLKHLGITEVHPSAISSAMSRLPDEAITL 1028
            IKKRNASISLKKDAVTPEDVCKYESMQAGLYRLKHLGITEVHPSAISSAMSRLPDEAITL
Sbjct: 962  IKKRNASISLKKDAVTPEDVCKYESMQAGLYRLKHLGITEVHPSAISSAMSRLPDEAITL 1021

Query: 1029 AAASHIERELQITPWNLSSNFVACTTQGKENIERLEITGVGDPSGRGLGFSYVRSVPKAP 1088
            AAASHIERELQITPWNLSSNFVACTTQGKENIERLEITGVGDPSGRGLGFSYVRSVPKAP
Sbjct: 1022 AAASHIERELQITPWNLSSNFVACTTQGKENIERLEITGVGDPSGRGLGFSYVRSVPKAP 1081

Query: 1089 ISNASLKKKAASSRGSSAVTGTDADLRRLSMDAAKEVLLKFDVSEEQIAKLTRWHRIAMI 1148
            ISNASLKKKAASSRGSSAVTGTDADLRRLSMDAAKEVLLKFDVSEEQIAKLTRWHRIAMI
Sbjct: 1082 ISNASLKKKAASSRGSSAVTGTDADLRRLSMDAAKEVLLKFDVSEEQIAKLTRWHRIAMI 1141

Query: 1149 RRLSSEQAASGVQVDPTTISKYARGQRMSFLQLQRQTREKCQEIWERQIQSLSTSDGAEN 1208
            RRLSSEQAASGVQVDPTTISKYARGQRMSFLQLQRQTREKCQEIWERQIQSLSTSDGAEN
Sbjct: 1142 RRLSSEQAASGVQVDPTTISKYARGQRMSFLQLQRQTREKCQEIWERQIQSLSTSDGAEN 1201

Query: 1209 ESDSEGNSDLDSFAGDLENLLDAEEFEDEVDTFEIRHEKTDGVKGLKMRRRPSIAQTEEE 1268
            ESDSEGNSDLDSFAGDLENLLDAEEFEDEVDTFEIRHEKTDGVKGLKMRRRPSIAQTEEE
Sbjct: 1202 ESDSEGNSDLDSFAGDLENLLDAEEFEDEVDTFEIRHEKTDGVKGLKMRRRPSIAQTEEE 1261

Query: 1269 IEDEVAEATELCRLLMDDEAERRRKKKKNKVMGEAILVPGLQASFGHEIPQQTRHLVSIA 1328
            IEDEVAEATELCRLLMDDEAERRRKKKKNKVMGEAILVPGLQASFGHEIPQQTRHLVSIA
Sbjct: 1262 IEDEVAEATELCRLLMDDEAERRRKKKKNKVMGEAILVPGLQASFGHEIPQQTRHLVSIA 1321

Query: 1329 QPDIAYTSKENIRDQKEVESIINRKDKSGKFKPMKKNYSSEMSLLNKKLKISGDKVKNFK 1388
            QPDIAYTSKENIRDQKEVESIINRKDKSGKFKPMKKNYSSEMSLLNKKLKISGDKVKNFK
Sbjct: 1322 QPDIAYTSKENIRDQKEVESIINRKDKSGKFKPMKKNYSSEMSLLNKKLKISGDKVKNFK 1381

Query: 1389 EKKSARESFVCGKCGQFGHMRTNKNCPKYGEDMETPETTDQEKVSIKLNTVDPSSQSHQK 1448
            EKKSARESFVCGKCGQFGHMRTNKNCPKYGEDMETPETTDQEKVSIKLNTVDPSSQSHQK
Sbjct: 1382 EKKSARESFVCGKCGQFGHMRTNKNCPKYGEDMETPETTDQEKVSIKLNTVDPSSQSHQK 1441

Query: 1449 AHTKKVTPKTITKISTTETFEGEKSTLMAKMLPVKFKCSSSEKLSDNLSPGVPQTSDLPF 1508
            AHTKKVTPKTITKISTTETFEGEKSTLMAKMLPVKFKCSSSEKLSDNLSPGVPQTSDLPF
Sbjct: 1442 AHTKKVTPKTITKISTTETFEGEKSTLMAKMLPVKFKCSSSEKLSDNLSPGVPQTSDLPF 1501

Query: 1509 NSDNETGKSVVKVNKITFSKKRNEDIQFESHKPSIVIRPPDAKKVYAEAHKPSIVIRPPT 1568
            NSDNETGKSVVKVNKITFSKKRNEDIQFESHKPSIVIRPPDAKKVYAEAHKPSIVIRPPT
Sbjct: 1502 NSDNETGKSVVKVNKITFSKKRNEDIQFESHKPSIVIRPPDAKKVYAEAHKPSIVIRPPT 1561

Query: 1569 SIDRDRMEFPRRSATVVRSPAETKREKLNKKLIIKRPKEVIDLDQMGFDASAGMQYRKTK 1628
            SIDRDRMEFPRRSATVVRSPAETKREKLNKKLIIKRPKEVIDLDQMGFDASAGMQYRKTK
Sbjct: 1562 SIDRDRMEFPRRSATVVRSPAETKREKLNKKLIIKRPKEVIDLDQMGFDASAGMQYRKTK 1621

Query: 1629 RIVELSSFENHTRPGSMSSAESGKKKVREDQIWWEKQERQKNEERLREEKARRVYKEEMR 1688
            RIVELSSFENHTRPGSMSSAESGKKKVREDQIWWEKQERQKNEERLREEKARRVYKEEMR
Sbjct: 1622 RIVELSSFENHTRPGSMSSAESGKKKVREDQIWWEKQERQKNEERLREEKARRVYKEEMR 1681

Query: 1689 MRDEQEKLAEIRRFEASIRSDKEEEERIKAKKKKKKRIPEIMDDYMEDPRSRQRIPERDR 1748
            MRDEQEKLAEIRRFEASIRSDKEEEERIKAKKKKKKRIPEIMDDYMEDPRSRQRIPERDR
Sbjct: 1682 MRDEQEKLAEIRRFEASIRSDKEEEERIKAKKKKKKRIPEIMDDYMEDPRSRQRIPERDR 1741

Query: 1749 VVKRKPIELGRHGAEHASSTKRRRGGEVGLSNILERIVEALKDNFEISYLFLKPVSKKEA 1808
            VVKRKPIELGRHGAEHASSTKRRRGGEVGLSNILERIVEALKDNFEISYLFLKPVSKKEA
Sbjct: 1742 VVKRKPIELGRHGAEHASSTKRRRGGEVGLSNILERIVEALKDNFEISYLFLKPVSKKEA 1801

Query: 1809 PDYLDIIERPMDLSTIREKVRKLEYKTRDEFRTDVWQIMYNAHMYNDGRNPGIPPLADKL 1868
            PDYLDIIERPMDLSTIREKVRKLEYKTRDEFRTDVWQIMYNAHMYNDGRNPGIPPLADKL
Sbjct: 1802 PDYLDIIERPMDLSTIREKVRKLEYKTRDEFRTDVWQIMYNAHMYNDGRNPGIPPLADKL 1861

Query: 1869 LMHCDNLLNENDDELTEAEIGIEYRDS 1895
            LMHCDNLLNENDDELTEAEIGIEYRDS
Sbjct: 1862 LMHCDNLLNENDDELTEAEIGIEYRDS 1888

BLAST of MC09g0858 vs. NCBI nr
Match: XP_038899368.1 (transcription initiation factor TFIID subunit 1 isoform X2 [Benincasa hispida])

HSP 1 Score: 3199 bits (8294), Expect = 0.0
Identity = 1661/1893 (87.74%), Postives = 1765/1893 (93.24%), Query Frame = 0

Query: 9    DDDDYEDAGGGNRFLGFMFGNVDNSGDLDADYLDEDAKEHLDALADKLGSTLTDIDLSTK 68
            DDDDYEDA GGNRFLGFMFGNVDNSGDLDADYLDEDAKEHL ALADKLG TLTDIDLSTK
Sbjct: 16   DDDDYEDANGGNRFLGFMFGNVDNSGDLDADYLDEDAKEHLAALADKLGPTLTDIDLSTK 75

Query: 69   SAKTPSDAVEPDYDAKAEDAVDYEDIDEEYDGPEIEAAGEEDHLLPKKEYFSTEVSLATL 128
            S K  SDAVEPDYDAKAEDA+DYEDIDEEYDGPEIEA GEEDHLLPK+EYFS EVSLATL
Sbjct: 76   SPKIRSDAVEPDYDAKAEDAIDYEDIDEEYDGPEIEATGEEDHLLPKREYFSAEVSLATL 135

Query: 129  EPTVSVFDDEDYDEDFEKVHDVINSSVEARTTHASDEKGECLEVAYEGEKSVADDDIQSA 188
            EPT SVFDDEDYDEDFEKV DV+N + EA+  HASDE+GECLE+  EGEKS A DD+QSA
Sbjct: 136  EPTASVFDDEDYDEDFEKVPDVVNGNAEAQNIHASDEQGECLEIVSEGEKSFAVDDLQSA 195

Query: 189  SLNNEVITSSAEELLEETPEVQKKLLDEKAHTPLPVLCMENGMAILQFSEIFGVHDSLKK 248
             LNNEVITS AE LLE TPEVQK+L DEK+HTPLPVLCMENGMAILQFSEIFGVHDSL K
Sbjct: 196  PLNNEVITSDAEGLLEGTPEVQKRLQDEKSHTPLPVLCMENGMAILQFSEIFGVHDSLNK 255

Query: 249  KEKRESRYCTRRDKYRSVDVSDIVEEDEEAFLHGFSQGVSCVKPASVVKDDTTMFNLDDP 308
            KEKR+SRY TRRDKYRS DVSDIVEEDEEAFLHGFS+GVS VKPA VVKDDTTMF++DDP
Sbjct: 256  KEKRDSRYSTRRDKYRSADVSDIVEEDEEAFLHGFSRGVSYVKPAYVVKDDTTMFDVDDP 315

Query: 309  EFTKFGVVQGVDVMAARVDWRQKDNCCGAEPMKQVFAENISIGSNSLLFKKFYPLDQQNW 368
            E+TKFGVVQGVDVMA+RVDWRQKD CCGAEPMKQ+ AEN++IGSNSLLFKKFYPLD QNW
Sbjct: 316  EYTKFGVVQGVDVMASRVDWRQKDRCCGAEPMKQIVAENVTIGSNSLLFKKFYPLDHQNW 375

Query: 369  EEGILWDNSPVLSKNSAGSCEVSGSDLEDSVSSDVEQQVSIQIVRSEHHIDPNDRRQRLS 428
            EE ILWDNSPV SKN++GSCE SGSD+E S +SDVE QVSIQIVRSE  I PN   Q L 
Sbjct: 376  EERILWDNSPVSSKNASGSCEASGSDIEASSNSDVEPQVSIQIVRSEDRIGPNGEGQSLY 435

Query: 429  QHDLPLLEPFGSRKFSGPEEPFSPEMIYHPQMLRLESWKDVDGSCQSEGTRENFSEEHQS 488
             H   LLEPFGSRK SG EE  SPEMIYHPQMLRLESWKDVD SCQS+G +E+ S+EHQS
Sbjct: 436  HHGFQLLEPFGSRKISGTEESVSPEMIYHPQMLRLESWKDVDDSCQSDGIKESISDEHQS 495

Query: 489  NAIRCFSKFSPKNRRMLEGSWLDKVLWESDEPIEKQKFIFDLEDEHMLFEISDEKESKYI 548
             A+R FSKFSPKNRRMLEGSWLDKVLWE+DEPIEK KFIFDLEDEHMLFEISDE ESKYI
Sbjct: 496  YAVRSFSKFSPKNRRMLEGSWLDKVLWETDEPIEKPKFIFDLEDEHMLFEISDENESKYI 555

Query: 549  QFHAGAMILTRSSMSVNGNSFEISGSGGQGGWRFVSNDKHYSNRKASQQLKSNSKKRSVH 608
            QFH+GAMILTRSSMS+NGNSFEISGSGGQGGWR VSNDKHYSNRKASQQLKSNSKKRSVH
Sbjct: 556  QFHSGAMILTRSSMSINGNSFEISGSGGQGGWRVVSNDKHYSNRKASQQLKSNSKKRSVH 615

Query: 609  GVKVFHSKPAMMLQTMKLKLSNKELANFHRPKASWYPHDNEMAVRELQKLPTQGPMKIIL 668
            G+KVFHSKPAMMLQTMKLKLSNKELANFHRPKA WYPHDNE+ VRELQKLPTQG MKII+
Sbjct: 616  GIKVFHSKPAMMLQTMKLKLSNKELANFHRPKALWYPHDNEVTVRELQKLPTQGQMKIIM 675

Query: 669  KSLGGKGSKLFVDSEETVSSLMAKASKKLDMKSSEIVKVFYSGKELEREKSLAAQNVQPN 728
            KSLGGKGSK  VD EETVSS+MAKASKKLDMK SE +K+FYSGKELEREKSLAAQNV+PN
Sbjct: 676  KSLGGKGSKHIVDPEETVSSIMAKASKKLDMKPSENIKLFYSGKELEREKSLAAQNVKPN 735

Query: 729  SLLHLVRSKIFVMPWTQHLRGDSKSVRSPGAFKKKSDLSVKDGHVFLMEYCEERPLLLGN 788
            SLLHLVRSKI++MP  Q+LRG+++SVRSPGAFKKKSDLSVKDGHVFLMEYCEERPLLLGN
Sbjct: 736  SLLHLVRSKIYIMPRAQNLRGENRSVRSPGAFKKKSDLSVKDGHVFLMEYCEERPLLLGN 795

Query: 789  IGMGARLCTYYQKSSPDDQTGALLRNGGDSLGHVIVLEPSDKSPYIGDIKGGSVQASLET 848
            IGMGARLCTYYQKSSPDDQTGALLRNGGDSLGHVI+LEPSDKSPY+GD+KGGS+QASLET
Sbjct: 796  IGMGARLCTYYQKSSPDDQTGALLRNGGDSLGHVIILEPSDKSPYLGDLKGGSIQASLET 855

Query: 849  NMYRSPIFPHKVPMTDYILVRSAKGKLSLRRIYRNFAVGQQEPLMEVFSPGTKSLQMFMM 908
            NMYRSP+F HKVPMTDYILVRSAKGKLSLRRI +NFAVGQQEPLMEVFSPGTKSLQ FMM
Sbjct: 856  NMYRSPVFSHKVPMTDYILVRSAKGKLSLRRIDKNFAVGQQEPLMEVFSPGTKSLQTFMM 915

Query: 909  NRLTLYIFREFLAAEKRRRLPYIRVDELPSQFPYLSETVIRKKLKEYALQQKSSNGQTIL 968
            NRLTLY+FREFLAAEKRRR+P IRV+ELPSQFPYLSET+IRKKLKEYALQQ++S+GQ IL
Sbjct: 916  NRLTLYMFREFLAAEKRRRIPDIRVEELPSQFPYLSETIIRKKLKEYALQQRNSSGQIIL 975

Query: 969  IKKRNASISLKKDAVTPEDVCKYESMQAGLYRLKHLGITEVHPSAISSAMSRLPDEAITL 1028
            IKKRNASISLKKDAVTPEDVCKYESMQAGLYRLKHLG++EVHPSAISSAMSRLPDEAITL
Sbjct: 976  IKKRNASISLKKDAVTPEDVCKYESMQAGLYRLKHLGLSEVHPSAISSAMSRLPDEAITL 1035

Query: 1029 AAASHIERELQITPWNLSSNFVACTTQGKENIERLEITGVGDPSGRGLGFSYVRSVPKAP 1088
            AAASHIERELQITPWNLSSNFVACTTQGKENIERLEITGVGDPSGRGLGFSYVRSVPKAP
Sbjct: 1036 AAASHIERELQITPWNLSSNFVACTTQGKENIERLEITGVGDPSGRGLGFSYVRSVPKAP 1095

Query: 1089 ISNASLKKKAASSRGSSAVTGTDADLRRLSMDAAKEVLLKFDVSEEQIAKLTRWHRIAMI 1148
            ISNASLKKKAASSRGSSAVTGTDADLRRLSMDAAKEVLLKFDVSEEQIAKLTRWHRIAMI
Sbjct: 1096 ISNASLKKKAASSRGSSAVTGTDADLRRLSMDAAKEVLLKFDVSEEQIAKLTRWHRIAMI 1155

Query: 1149 RRLSSEQAASGVQVDPTTISKYARGQRMSFLQLQRQTREKCQEIWERQIQSLSTSDGAEN 1208
            RRLSSEQAASGVQVDPTTISKYARGQRMSFLQLQRQTREKCQEIWERQIQSLS SDGAEN
Sbjct: 1156 RRLSSEQAASGVQVDPTTISKYARGQRMSFLQLQRQTREKCQEIWERQIQSLSASDGAEN 1215

Query: 1209 ESDSEGNSDLDSFAGDLENLLDAEEFEDEVDTFEIRHEKTDGVKGLKMRRRPSIAQTEEE 1268
            ESDSEGNSDLDSFAGDLENLLDAEEFEDEVDTFEIRHEKTDGVKGLKMRRRPSIAQTEEE
Sbjct: 1216 ESDSEGNSDLDSFAGDLENLLDAEEFEDEVDTFEIRHEKTDGVKGLKMRRRPSIAQTEEE 1275

Query: 1269 IEDEVAEATELCRLLMDDEAERRRKKKKNKVMGEAILVPGLQASFGHEIPQQTRHLVSIA 1328
            IEDEVAEA ELCRLLMDDEAERRRKKKKNKVMGEA+L  G QASFGHE P+QTRHLVSIA
Sbjct: 1276 IEDEVAEAAELCRLLMDDEAERRRKKKKNKVMGEAVLSTGFQASFGHENPEQTRHLVSIA 1335

Query: 1329 QPDIAYTSKENIRDQKEVESIINRKDKSGKFKPMKKNYSSEMSLLNKKLKISGDKVKNFK 1388
            QPD+ Y SKENIR+QKEVE+IINRK+K GK KP KKNYSSEM+L+NKKLKISGDKVKNFK
Sbjct: 1336 QPDVTYISKENIREQKEVENIINRKEKFGKLKPTKKNYSSEMNLINKKLKISGDKVKNFK 1395

Query: 1389 EKKSARESFVCGKCGQFGHMRTNKNCPKYGEDMETPETTDQEKVSIKLNTVDPSSQSHQK 1448
            EKKSARESFVCGKCGQFGHMRTNKNCPKYGED+ETPETTDQEKVSIKLNT+DPS+QSHQK
Sbjct: 1396 EKKSARESFVCGKCGQFGHMRTNKNCPKYGEDLETPETTDQEKVSIKLNTLDPSNQSHQK 1455

Query: 1449 AHTKKVTPKTITKISTTETFEGEKSTLMAKMLPVKFKCSSSEKLSDNLSPGVPQTSDLPF 1508
            A TKKVTPK + K STTE FEGEKSTLMAK+ PVKFKCSSS++LSDNLSP +PQTSDLP 
Sbjct: 1456 AVTKKVTPKAVAKSSTTEAFEGEKSTLMAKVFPVKFKCSSSDRLSDNLSPALPQTSDLPV 1515

Query: 1509 NSDNETGKSVVKVNKITFSKKRNEDIQFESHKPSIVIRPPDAKKVYAEAHKPSIVIRPPT 1568
            NSDNETGKSV KVNKITFSKKR EDIQFESHKPSIVIRPPDAKKV  EAHKPSIVIRPPT
Sbjct: 1516 NSDNETGKSV-KVNKITFSKKRTEDIQFESHKPSIVIRPPDAKKVSLEAHKPSIVIRPPT 1575

Query: 1569 SIDRDRMEFPRRSATVVRSPAETKREKLNKKLIIKRPKEV---IDLDQMGFDASAGMQYR 1628
            ++DRD+MEF RRSAT++RS AET++E+L+KKLIIKRPKEV   IDLD+  +D S GM+YR
Sbjct: 1576 NMDRDKMEFSRRSATIIRSAAETEKEQLHKKLIIKRPKEVKEVIDLDRSAYDGSVGMEYR 1635

Query: 1629 KTKRIVELSSFENHTRPGSMSSAESGKKKVREDQIWWEKQERQKNEERLREEKARRVYKE 1688
            KTKRIVELSSFE HTR GSMSS+ESGKKKVRE+  WWEKQE+Q+NEERLREEK RRVY E
Sbjct: 1636 KTKRIVELSSFEKHTRHGSMSSSESGKKKVREEHRWWEKQEKQRNEERLREEKVRRVYNE 1695

Query: 1689 EMRMRDEQEKLAEIRRFEASIRSDKEEEERIKAKKKKKKRIPEIMDDYMEDPRSRQ---R 1748
            +M MR+EQEKLAEIRRFEASIRSDKEEEER+KAKKKKKKRIPEI+DDYMEDPRSR+   R
Sbjct: 1696 QMGMREEQEKLAEIRRFEASIRSDKEEEERLKAKKKKKKRIPEILDDYMEDPRSRRFDKR 1755

Query: 1749 IPERDRVVKRKPIELGRHGAEHASSTKRRRGGEVGLSNILERIVEALKDNFEISYLFLKP 1808
            + E++R  KRKPIELGRH AEHASSTKRRRGGEVGLSNILERIVE LKD F+ISYLFLKP
Sbjct: 1756 VLEKERSGKRKPIELGRHVAEHASSTKRRRGGEVGLSNILERIVETLKDRFDISYLFLKP 1815

Query: 1809 VSKKEAPDYLDIIERPMDLSTIREKVRKLEYKTRDEFRTDVWQIMYNAHMYNDGRNPGIP 1868
            VSKKEAPDYLDIIERPMDLSTIREKVR+LEYKTRDEFR DVWQIMYNAH+YNDGRNPGIP
Sbjct: 1816 VSKKEAPDYLDIIERPMDLSTIREKVRRLEYKTRDEFRNDVWQIMYNAHLYNDGRNPGIP 1875

Query: 1869 PLADKLLMHCDNLLNENDDELTEAEIGIEYRDS 1895
            PLADKLLM CDNLL  +D+ELTEAEIGIEYRDS
Sbjct: 1876 PLADKLLMLCDNLLKHSDEELTEAEIGIEYRDS 1907

BLAST of MC09g0858 vs. NCBI nr
Match: XP_038899362.1 (transcription initiation factor TFIID subunit 1 isoform X1 [Benincasa hispida] >XP_038899363.1 transcription initiation factor TFIID subunit 1 isoform X1 [Benincasa hispida] >XP_038899365.1 transcription initiation factor TFIID subunit 1 isoform X1 [Benincasa hispida] >XP_038899366.1 transcription initiation factor TFIID subunit 1 isoform X1 [Benincasa hispida] >XP_038899367.1 transcription initiation factor TFIID subunit 1 isoform X1 [Benincasa hispida])

HSP 1 Score: 3192 bits (8275), Expect = 0.0
Identity = 1661/1901 (87.38%), Postives = 1765/1901 (92.85%), Query Frame = 0

Query: 9    DDDDYEDAGGGNRFLGFMFGNVDNSGDLDADYLDEDAKEHLDALADKLGSTLTDIDLSTK 68
            DDDDYEDA GGNRFLGFMFGNVDNSGDLDADYLDEDAKEHL ALADKLG TLTDIDLSTK
Sbjct: 16   DDDDYEDANGGNRFLGFMFGNVDNSGDLDADYLDEDAKEHLAALADKLGPTLTDIDLSTK 75

Query: 69   SAKTPSDAVEPDYDAKAEDAVDYEDIDEEYDGPEIEAAGEEDHLLPKKEYFSTEVSLATL 128
            S K  SDAVEPDYDAKAEDA+DYEDIDEEYDGPEIEA GEEDHLLPK+EYFS EVSLATL
Sbjct: 76   SPKIRSDAVEPDYDAKAEDAIDYEDIDEEYDGPEIEATGEEDHLLPKREYFSAEVSLATL 135

Query: 129  EPTVSVFDDEDYDEDFEKVHDVINSSVEARTTHASDEKGECLEVAYEGEKSVADDDIQSA 188
            EPT SVFDDEDYDEDFEKV DV+N + EA+  HASDE+GECLE+  EGEKS A DD+QSA
Sbjct: 136  EPTASVFDDEDYDEDFEKVPDVVNGNAEAQNIHASDEQGECLEIVSEGEKSFAVDDLQSA 195

Query: 189  SLNNEVITSSAEELLEETPEVQKKLLDEKAHTPLPVLCMENGMAILQFSEIFGVHDSLKK 248
             LNNEVITS AE LLE TPEVQK+L DEK+HTPLPVLCMENGMAILQFSEIFGVHDSL K
Sbjct: 196  PLNNEVITSDAEGLLEGTPEVQKRLQDEKSHTPLPVLCMENGMAILQFSEIFGVHDSLNK 255

Query: 249  KEKRESRYCTRRDKYRSVDVSDIVEEDEEAFLHGFSQGVSCVKPASVVKDDTTMFNLDDP 308
            KEKR+SRY TRRDKYRS DVSDIVEEDEEAFLHGFS+GVS VKPA VVKDDTTMF++DDP
Sbjct: 256  KEKRDSRYSTRRDKYRSADVSDIVEEDEEAFLHGFSRGVSYVKPAYVVKDDTTMFDVDDP 315

Query: 309  EFTKFGVVQGVDVMAARVDWRQKDNCCGAEPMKQVFAENISIGSNSLLFKKFYPLDQQNW 368
            E+TKFGVVQGVDVMA+RVDWRQKD CCGAEPMKQ+ AEN++IGSNSLLFKKFYPLD QNW
Sbjct: 316  EYTKFGVVQGVDVMASRVDWRQKDRCCGAEPMKQIVAENVTIGSNSLLFKKFYPLDHQNW 375

Query: 369  EEGILWDNSPVLSKNSAGSCEVSGSDLEDSVSSDVEQQVSIQIVRSEHHIDPNDRRQRLS 428
            EE ILWDNSPV SKN++GSCE SGSD+E S +SDVE QVSIQIVRSE  I PN   Q L 
Sbjct: 376  EERILWDNSPVSSKNASGSCEASGSDIEASSNSDVEPQVSIQIVRSEDRIGPNGEGQSLY 435

Query: 429  QHDLPLLEPFGSRKFSGPEEPFSPEMIYHPQMLRLESWKDVDGSCQSEGTRENFSEEHQS 488
             H   LLEPFGSRK SG EE  SPEMIYHPQMLRLESWKDVD SCQS+G +E+ S+EHQS
Sbjct: 436  HHGFQLLEPFGSRKISGTEESVSPEMIYHPQMLRLESWKDVDDSCQSDGIKESISDEHQS 495

Query: 489  NAIRCFSKFSPKNRRMLEGSWLDKVLWESDEPIEKQKFIFDLEDEHMLFEISDEKESKYI 548
             A+R FSKFSPKNRRMLEGSWLDKVLWE+DEPIEK KFIFDLEDEHMLFEISDE ESKYI
Sbjct: 496  YAVRSFSKFSPKNRRMLEGSWLDKVLWETDEPIEKPKFIFDLEDEHMLFEISDENESKYI 555

Query: 549  QFHAGAMILTRSSMSVNGNSFEISGSGGQGGWRFVSNDKHYSNRKASQQLKSNSKKRSVH 608
            QFH+GAMILTRSSMS+NGNSFEISGSGGQGGWR VSNDKHYSNRKASQQLKSNSKKRSVH
Sbjct: 556  QFHSGAMILTRSSMSINGNSFEISGSGGQGGWRVVSNDKHYSNRKASQQLKSNSKKRSVH 615

Query: 609  GVKVFHSKPAMMLQTMKLKLSNKELANFHRPKASWYPHDNEMAVRELQKLPTQGPMKIIL 668
            G+KVFHSKPAMMLQTMKLKLSNKELANFHRPKA WYPHDNE+ VRELQKLPTQG MKII+
Sbjct: 616  GIKVFHSKPAMMLQTMKLKLSNKELANFHRPKALWYPHDNEVTVRELQKLPTQGQMKIIM 675

Query: 669  KSLGGKGSKLFVDSEETVSSLMAKASKKL--------DMKSSEIVKVFYSGKELEREKSL 728
            KSLGGKGSK  VD EETVSS+MAKASKKL        DMK SE +K+FYSGKELEREKSL
Sbjct: 676  KSLGGKGSKHIVDPEETVSSIMAKASKKLGYQNFFFLDMKPSENIKLFYSGKELEREKSL 735

Query: 729  AAQNVQPNSLLHLVRSKIFVMPWTQHLRGDSKSVRSPGAFKKKSDLSVKDGHVFLMEYCE 788
            AAQNV+PNSLLHLVRSKI++MP  Q+LRG+++SVRSPGAFKKKSDLSVKDGHVFLMEYCE
Sbjct: 736  AAQNVKPNSLLHLVRSKIYIMPRAQNLRGENRSVRSPGAFKKKSDLSVKDGHVFLMEYCE 795

Query: 789  ERPLLLGNIGMGARLCTYYQKSSPDDQTGALLRNGGDSLGHVIVLEPSDKSPYIGDIKGG 848
            ERPLLLGNIGMGARLCTYYQKSSPDDQTGALLRNGGDSLGHVI+LEPSDKSPY+GD+KGG
Sbjct: 796  ERPLLLGNIGMGARLCTYYQKSSPDDQTGALLRNGGDSLGHVIILEPSDKSPYLGDLKGG 855

Query: 849  SVQASLETNMYRSPIFPHKVPMTDYILVRSAKGKLSLRRIYRNFAVGQQEPLMEVFSPGT 908
            S+QASLETNMYRSP+F HKVPMTDYILVRSAKGKLSLRRI +NFAVGQQEPLMEVFSPGT
Sbjct: 856  SIQASLETNMYRSPVFSHKVPMTDYILVRSAKGKLSLRRIDKNFAVGQQEPLMEVFSPGT 915

Query: 909  KSLQMFMMNRLTLYIFREFLAAEKRRRLPYIRVDELPSQFPYLSETVIRKKLKEYALQQK 968
            KSLQ FMMNRLTLY+FREFLAAEKRRR+P IRV+ELPSQFPYLSET+IRKKLKEYALQQ+
Sbjct: 916  KSLQTFMMNRLTLYMFREFLAAEKRRRIPDIRVEELPSQFPYLSETIIRKKLKEYALQQR 975

Query: 969  SSNGQTILIKKRNASISLKKDAVTPEDVCKYESMQAGLYRLKHLGITEVHPSAISSAMSR 1028
            +S+GQ ILIKKRNASISLKKDAVTPEDVCKYESMQAGLYRLKHLG++EVHPSAISSAMSR
Sbjct: 976  NSSGQIILIKKRNASISLKKDAVTPEDVCKYESMQAGLYRLKHLGLSEVHPSAISSAMSR 1035

Query: 1029 LPDEAITLAAASHIERELQITPWNLSSNFVACTTQGKENIERLEITGVGDPSGRGLGFSY 1088
            LPDEAITLAAASHIERELQITPWNLSSNFVACTTQGKENIERLEITGVGDPSGRGLGFSY
Sbjct: 1036 LPDEAITLAAASHIERELQITPWNLSSNFVACTTQGKENIERLEITGVGDPSGRGLGFSY 1095

Query: 1089 VRSVPKAPISNASLKKKAASSRGSSAVTGTDADLRRLSMDAAKEVLLKFDVSEEQIAKLT 1148
            VRSVPKAPISNASLKKKAASSRGSSAVTGTDADLRRLSMDAAKEVLLKFDVSEEQIAKLT
Sbjct: 1096 VRSVPKAPISNASLKKKAASSRGSSAVTGTDADLRRLSMDAAKEVLLKFDVSEEQIAKLT 1155

Query: 1149 RWHRIAMIRRLSSEQAASGVQVDPTTISKYARGQRMSFLQLQRQTREKCQEIWERQIQSL 1208
            RWHRIAMIRRLSSEQAASGVQVDPTTISKYARGQRMSFLQLQRQTREKCQEIWERQIQSL
Sbjct: 1156 RWHRIAMIRRLSSEQAASGVQVDPTTISKYARGQRMSFLQLQRQTREKCQEIWERQIQSL 1215

Query: 1209 STSDGAENESDSEGNSDLDSFAGDLENLLDAEEFEDEVDTFEIRHEKTDGVKGLKMRRRP 1268
            S SDGAENESDSEGNSDLDSFAGDLENLLDAEEFEDEVDTFEIRHEKTDGVKGLKMRRRP
Sbjct: 1216 SASDGAENESDSEGNSDLDSFAGDLENLLDAEEFEDEVDTFEIRHEKTDGVKGLKMRRRP 1275

Query: 1269 SIAQTEEEIEDEVAEATELCRLLMDDEAERRRKKKKNKVMGEAILVPGLQASFGHEIPQQ 1328
            SIAQTEEEIEDEVAEA ELCRLLMDDEAERRRKKKKNKVMGEA+L  G QASFGHE P+Q
Sbjct: 1276 SIAQTEEEIEDEVAEAAELCRLLMDDEAERRRKKKKNKVMGEAVLSTGFQASFGHENPEQ 1335

Query: 1329 TRHLVSIAQPDIAYTSKENIRDQKEVESIINRKDKSGKFKPMKKNYSSEMSLLNKKLKIS 1388
            TRHLVSIAQPD+ Y SKENIR+QKEVE+IINRK+K GK KP KKNYSSEM+L+NKKLKIS
Sbjct: 1336 TRHLVSIAQPDVTYISKENIREQKEVENIINRKEKFGKLKPTKKNYSSEMNLINKKLKIS 1395

Query: 1389 GDKVKNFKEKKSARESFVCGKCGQFGHMRTNKNCPKYGEDMETPETTDQEKVSIKLNTVD 1448
            GDKVKNFKEKKSARESFVCGKCGQFGHMRTNKNCPKYGED+ETPETTDQEKVSIKLNT+D
Sbjct: 1396 GDKVKNFKEKKSARESFVCGKCGQFGHMRTNKNCPKYGEDLETPETTDQEKVSIKLNTLD 1455

Query: 1449 PSSQSHQKAHTKKVTPKTITKISTTETFEGEKSTLMAKMLPVKFKCSSSEKLSDNLSPGV 1508
            PS+QSHQKA TKKVTPK + K STTE FEGEKSTLMAK+ PVKFKCSSS++LSDNLSP +
Sbjct: 1456 PSNQSHQKAVTKKVTPKAVAKSSTTEAFEGEKSTLMAKVFPVKFKCSSSDRLSDNLSPAL 1515

Query: 1509 PQTSDLPFNSDNETGKSVVKVNKITFSKKRNEDIQFESHKPSIVIRPPDAKKVYAEAHKP 1568
            PQTSDLP NSDNETGKSV KVNKITFSKKR EDIQFESHKPSIVIRPPDAKKV  EAHKP
Sbjct: 1516 PQTSDLPVNSDNETGKSV-KVNKITFSKKRTEDIQFESHKPSIVIRPPDAKKVSLEAHKP 1575

Query: 1569 SIVIRPPTSIDRDRMEFPRRSATVVRSPAETKREKLNKKLIIKRPKEV---IDLDQMGFD 1628
            SIVIRPPT++DRD+MEF RRSAT++RS AET++E+L+KKLIIKRPKEV   IDLD+  +D
Sbjct: 1576 SIVIRPPTNMDRDKMEFSRRSATIIRSAAETEKEQLHKKLIIKRPKEVKEVIDLDRSAYD 1635

Query: 1629 ASAGMQYRKTKRIVELSSFENHTRPGSMSSAESGKKKVREDQIWWEKQERQKNEERLREE 1688
             S GM+YRKTKRIVELSSFE HTR GSMSS+ESGKKKVRE+  WWEKQE+Q+NEERLREE
Sbjct: 1636 GSVGMEYRKTKRIVELSSFEKHTRHGSMSSSESGKKKVREEHRWWEKQEKQRNEERLREE 1695

Query: 1689 KARRVYKEEMRMRDEQEKLAEIRRFEASIRSDKEEEERIKAKKKKKKRIPEIMDDYMEDP 1748
            K RRVY E+M MR+EQEKLAEIRRFEASIRSDKEEEER+KAKKKKKKRIPEI+DDYMEDP
Sbjct: 1696 KVRRVYNEQMGMREEQEKLAEIRRFEASIRSDKEEEERLKAKKKKKKRIPEILDDYMEDP 1755

Query: 1749 RSRQ---RIPERDRVVKRKPIELGRHGAEHASSTKRRRGGEVGLSNILERIVEALKDNFE 1808
            RSR+   R+ E++R  KRKPIELGRH AEHASSTKRRRGGEVGLSNILERIVE LKD F+
Sbjct: 1756 RSRRFDKRVLEKERSGKRKPIELGRHVAEHASSTKRRRGGEVGLSNILERIVETLKDRFD 1815

Query: 1809 ISYLFLKPVSKKEAPDYLDIIERPMDLSTIREKVRKLEYKTRDEFRTDVWQIMYNAHMYN 1868
            ISYLFLKPVSKKEAPDYLDIIERPMDLSTIREKVR+LEYKTRDEFR DVWQIMYNAH+YN
Sbjct: 1816 ISYLFLKPVSKKEAPDYLDIIERPMDLSTIREKVRRLEYKTRDEFRNDVWQIMYNAHLYN 1875

Query: 1869 DGRNPGIPPLADKLLMHCDNLLNENDDELTEAEIGIEYRDS 1895
            DGRNPGIPPLADKLLM CDNLL  +D+ELTEAEIGIEYRDS
Sbjct: 1876 DGRNPGIPPLADKLLMLCDNLLKHSDEELTEAEIGIEYRDS 1915

BLAST of MC09g0858 vs. NCBI nr
Match: XP_022959797.1 (transcription initiation factor TFIID subunit 1 isoform X1 [Cucurbita moschata] >XP_022959798.1 transcription initiation factor TFIID subunit 1 isoform X1 [Cucurbita moschata] >XP_022959799.1 transcription initiation factor TFIID subunit 1 isoform X1 [Cucurbita moschata] >XP_022959800.1 transcription initiation factor TFIID subunit 1 isoform X1 [Cucurbita moschata])

HSP 1 Score: 3170 bits (8218), Expect = 0.0
Identity = 1645/1891 (86.99%), Postives = 1757/1891 (92.91%), Query Frame = 0

Query: 9    DDDDYEDAGGGNRFLGFMFGNVDNSGDLDADYLDEDAKEHLDALADKLGSTLTDIDLSTK 68
            DDDDYEDAGGGNR LGFMFGNVDNSGDLDADYLDEDAKEHL ALADKLG TLTDIDLSTK
Sbjct: 16   DDDDYEDAGGGNRLLGFMFGNVDNSGDLDADYLDEDAKEHLAALADKLGPTLTDIDLSTK 75

Query: 69   SAKTPSDAVEPDYDAKAEDAVDYEDIDEEYDGPEIEAAGEEDHLLPKKEYFSTEVSLATL 128
            S K PSDA+EPDYDAKAEDA+DYEDIDEEYDGPEIEAAGEEDHLLPKKEYFS EVSL TL
Sbjct: 76   SPKIPSDAIEPDYDAKAEDAIDYEDIDEEYDGPEIEAAGEEDHLLPKKEYFSAEVSLPTL 135

Query: 129  EPTVSVFDDEDYDEDFEKVHDVINSSVEARTTHASDEKGECLEVAYEGEKSVADDDIQSA 188
            EPT SVFDDEDYDED EKVH+V N SV A T HASDE+GE LEV  EGEKS A+DD+ S 
Sbjct: 136  EPTASVFDDEDYDEDIEKVHEVANRSVVAPTIHASDEQGEYLEVVSEGEKSFAEDDLPSV 195

Query: 189  SLNNEVITSSAEELLEETPEVQKKLLDEKAHTPLPVLCMENGMAILQFSEIFGVHDSLKK 248
              NNEVITSS EEL EETPEV+K++ +EKAHTPLPVLCMENGMAILQFSEIFGVHDSLKK
Sbjct: 196  PFNNEVITSSPEELFEETPEVEKRMQEEKAHTPLPVLCMENGMAILQFSEIFGVHDSLKK 255

Query: 249  KEKRESRYCTRRDKYRSVDVSDIVEEDEEAFLHGFSQGVSCVKPASVVKDDTTMFNLDDP 308
            KEKR+SRYC+RRDKYRS DVSDIVEEDEEAFLHG S+GVSC+KPA VVKDDTTM  LDDP
Sbjct: 256  KEKRDSRYCSRRDKYRSADVSDIVEEDEEAFLHGSSRGVSCMKPAYVVKDDTTM--LDDP 315

Query: 309  EFTKFGVVQGVDVMAARVDWRQKDNCCGAEPMKQVFAENISIGSNSLLFKKFYPLDQQNW 368
            E+TKFG V G D MA+RV+WRQKD+CCGAEP K+V AENI+IGSNSLLF+K YPLDQQNW
Sbjct: 316  EYTKFGAVDGGDEMASRVEWRQKDHCCGAEPAKEVVAENITIGSNSLLFQKLYPLDQQNW 375

Query: 369  EEGILWDNSPVLSKNSAGSCEVSGSDLEDSVSSDVEQQVSIQIVRSEHHIDPNDRRQRLS 428
            EE ILWDNSPV SKNSAGSCEV GSD+E SVSSDVE QVSIQIV S+H IDP+D  Q L 
Sbjct: 376  EERILWDNSPVSSKNSAGSCEVFGSDMEASVSSDVEPQVSIQIVGSDHRIDPDDEDQSLY 435

Query: 429  QHDLPLLEPFGSRKFSGPEEPFSPEMIYHPQMLRLESWKDVDGSCQSEGTRENFSEEHQS 488
             H  PLLE FGSRKFSG EEP SPE+IYHPQMLRLESWKDV+ SCQS+  +EN  +E QS
Sbjct: 436  HHSFPLLEAFGSRKFSGTEEPLSPEIIYHPQMLRLESWKDVEDSCQSDCIKENNPDELQS 495

Query: 489  NAIRCFSKFSPKNRRMLEGSWLDKVLWESDEPIEKQKFIFDLEDEHMLFEISDEKESKYI 548
            NAIR FSKFSPKNRRMLEGSWLDKVLWES+EPIEK KFIFDLEDEHMLFEISDEKESKYI
Sbjct: 496  NAIRSFSKFSPKNRRMLEGSWLDKVLWESNEPIEKPKFIFDLEDEHMLFEISDEKESKYI 555

Query: 549  QFHAGAMILTRSSMSVNGNSFEISGSGGQGGWRFVSNDKHYSNRKASQQLKSNSKKRSVH 608
            QFH+GAMILTRSS+SVNG+SFEISGSGGQGGWRFVSNDKHYSNRKASQQLKSNSKKRSVH
Sbjct: 556  QFHSGAMILTRSSLSVNGHSFEISGSGGQGGWRFVSNDKHYSNRKASQQLKSNSKKRSVH 615

Query: 609  GVKVFHSKPAMMLQTMKLKLSNKELANFHRPKASWYPHDNEMAVRELQKLPTQGPMKIIL 668
            G+KVFHS+PAMMLQTMKLKLSNKELANFHRPKA WYPHDNE AVRELQKLPT GPM+IIL
Sbjct: 616  GIKVFHSRPAMMLQTMKLKLSNKELANFHRPKALWYPHDNERAVRELQKLPTHGPMQIIL 675

Query: 669  KSLGGKGSKLFVDSEETVSSLMAKASKKLDMKSSEIVKVFYSGKELEREKSLAAQNVQPN 728
            KSLGGKGSK FVD+EE VSS+MAKASKKLDMK SEIVKVFYSGKELEREKSLAAQNVQPN
Sbjct: 676  KSLGGKGSKHFVDAEEAVSSIMAKASKKLDMKPSEIVKVFYSGKELEREKSLAAQNVQPN 735

Query: 729  SLLHLVRSKIFVMPWTQHLRGDSKSVRSPGAFKKKSDLSVKDGHVFLMEYCEERPLLLGN 788
            SLLHLVRSKI++MP  Q+L G+++SVRSPGAFKKKSDLSVKDGHVFLMEYCEERPLLLGN
Sbjct: 736  SLLHLVRSKIYIMPGAQNLHGENRSVRSPGAFKKKSDLSVKDGHVFLMEYCEERPLLLGN 795

Query: 789  IGMGARLCTYYQKSSPDDQTGALLRNGGDSLGHVIVLEPSDKSPYIGDIKGGSVQASLET 848
            IGMGARLCTYYQKSSPDDQTGA+LRNGGDSLGHVI+LEPSDKSPY+GD+KGGS+QASLET
Sbjct: 796  IGMGARLCTYYQKSSPDDQTGAMLRNGGDSLGHVIILEPSDKSPYLGDLKGGSIQASLET 855

Query: 849  NMYRSPIFPHKVPMTDYILVRSAKGKLSLRRIYRNFAVGQQEPLMEVFSPGTKSLQMFMM 908
            NMYRSPIF HKVPMTDYILVRSAKGKLSLRRI +NFAVGQQEPLMEVFSPGTKSLQ FMM
Sbjct: 856  NMYRSPIFSHKVPMTDYILVRSAKGKLSLRRIDKNFAVGQQEPLMEVFSPGTKSLQTFMM 915

Query: 909  NRLTLYIFREFLAAEKRRRLPYIRVDELPSQFPYLSETVIRKKLKEYALQQKSSNGQTIL 968
            NRLT YIFREFLAAEK RR+PYIRVDELPSQFPYLSETVIRKKLKEYALQQ++S+GQ IL
Sbjct: 916  NRLTSYIFREFLAAEKHRRIPYIRVDELPSQFPYLSETVIRKKLKEYALQQRNSSGQIIL 975

Query: 969  IKKRNASISLKKDAVTPEDVCKYESMQAGLYRLKHLGITEVHPSAISSAMSRLPDEAITL 1028
            IKKRNASISLKKDAVTPEDVCKYESMQAGLYRLKHLGITEVHPSAISSAMSRLPDEAITL
Sbjct: 976  IKKRNASISLKKDAVTPEDVCKYESMQAGLYRLKHLGITEVHPSAISSAMSRLPDEAITL 1035

Query: 1029 AAASHIERELQITPWNLSSNFVACTTQGKENIERLEITGVGDPSGRGLGFSYVRSVPKAP 1088
            AAASHIERELQITPW LSSNFVACTTQGKENIER+EITGVGDPSGRGLGFSYVRSVPK P
Sbjct: 1036 AAASHIERELQITPWTLSSNFVACTTQGKENIERMEITGVGDPSGRGLGFSYVRSVPKPP 1095

Query: 1089 ISNASLKKKAASSRGSSAVTGTDADLRRLSMDAAKEVLLKFDVSEEQIAKLTRWHRIAMI 1148
            ISNASLKKKAASSRGSSAVTGTDADLRRLSMDAAKEVLLKFDVSEEQIAKLTRWHRIAMI
Sbjct: 1096 ISNASLKKKAASSRGSSAVTGTDADLRRLSMDAAKEVLLKFDVSEEQIAKLTRWHRIAMI 1155

Query: 1149 RRLSSEQAASGVQVDPTTISKYARGQRMSFLQLQRQTREKCQEIWERQIQSLSTSDGAEN 1208
            RRLSSEQAASGVQVDPTTISKYARGQRMSFLQLQRQTREKCQEIWERQIQSLS SDGAEN
Sbjct: 1156 RRLSSEQAASGVQVDPTTISKYARGQRMSFLQLQRQTREKCQEIWERQIQSLSASDGAEN 1215

Query: 1209 ESDSEGNSDLDSFAGDLENLLDAEEFEDEVDTFEIRHEKTDGVKGLKMRRRPSIAQTEEE 1268
            ESDSEGNSDLDSFAGDLENLLDAEEFEDEVDTFEIRHEKTDGVKGLKMRRR SI QTE+E
Sbjct: 1216 ESDSEGNSDLDSFAGDLENLLDAEEFEDEVDTFEIRHEKTDGVKGLKMRRRASIIQTEDE 1275

Query: 1269 IEDEVAEATELCRLLMDDEAERRRKKKKNKVMGEAILVPGLQASFGHEIPQQTRHLVSIA 1328
            IEDEVAEA ELCRLLMDDEAERRRKKKKNKVMGE+ L  G QASFG E  +Q RHLVSIA
Sbjct: 1276 IEDEVAEAAELCRLLMDDEAERRRKKKKNKVMGESTLTSGFQASFGLENSEQIRHLVSIA 1335

Query: 1329 QPDIAYTSKENIRDQKEVESIINRKDKSGKFKPMKKNYSSEMSLLNKKLKISGDKVKNFK 1388
            QPD+ Y SKENIRD KEVE+IINRK+KS K KPMKK YSSEMSL+NKKLKISGDKVKNFK
Sbjct: 1336 QPDVTYISKENIRDHKEVENIINRKEKSAKLKPMKKTYSSEMSLINKKLKISGDKVKNFK 1395

Query: 1389 EKKSARESFVCGKCGQFGHMRTNKNCPKYGEDMETPETTDQEKVSIKLNTVDPSSQSHQK 1448
            EKKSARESFVCGKCGQFGHMRTNKNCPKYGE++ETP+TTDQEKVSIK NT+DPS+QS+QK
Sbjct: 1396 EKKSARESFVCGKCGQFGHMRTNKNCPKYGEELETPDTTDQEKVSIKSNTMDPSNQSNQK 1455

Query: 1449 AHTKKVTPKTITKISTTETFEGEKSTLMAKMLPVKFKCSSSEKLSDNLSPGVPQTSDLPF 1508
            A TKK TPKT  KISTTE FEGEKSTLMAK+LPVKFKCSS+++LSDNLSP +PQTSDLP 
Sbjct: 1456 AQTKKATPKTAAKISTTEAFEGEKSTLMAKVLPVKFKCSSTDRLSDNLSPVLPQTSDLPV 1515

Query: 1509 NSDNETGKSVVKVNKITFSKKRNEDIQFESHKPSIVIRPPDAKKVYAEAHKPSIVIRPPT 1568
            NSDNETGKSVVKVNKITFSKKR ED+QF+SHKPSIVIRPPDAKKV  +AHKPSIVIRPPT
Sbjct: 1516 NSDNETGKSVVKVNKITFSKKRTEDVQFQSHKPSIVIRPPDAKKVSIDAHKPSIVIRPPT 1575

Query: 1569 SIDRDRMEFPRRSATVVRSPAETKREKLNKKLIIKRPKEVIDLDQMGFDASAGMQYRKTK 1628
            +IDRD++EFPRR+A ++RS AET +E+L+KKL+IKRPKEVID+D+ G+D S GM+YRKTK
Sbjct: 1576 NIDRDKLEFPRRTAAIIRSAAETDKEQLHKKLVIKRPKEVIDVDRCGYDGSVGMEYRKTK 1635

Query: 1629 RIVELSSFENHTRPGSMSSAESGKKKVREDQIWWEKQERQKNEERLREEKARRVYKEEMR 1688
            +IVELSSFE HTRPGSMSSAESGKKK RE+  WWEKQE+++ EERLREE+ARRVY EEM 
Sbjct: 1636 KIVELSSFEKHTRPGSMSSAESGKKKAREEHRWWEKQEKRRKEERLREEEARRVYNEEMG 1695

Query: 1689 MRDEQEKLAEIRRFEASIRSDKEEEERIKAKKKKKKRIPEIMDDYMEDPRSRQR----IP 1748
            MR+EQEKLAEIRRFEASIRSDKEEEER+KAKKKKKKRIPEIM+DY+EDPRSR+     + 
Sbjct: 1696 MREEQEKLAEIRRFEASIRSDKEEEERLKAKKKKKKRIPEIMNDYVEDPRSRRMDKRVVL 1755

Query: 1749 ERDRVVKRKPIELGRHGAEHASSTKRRRGGEVGLSNILERIVEALKDNFEISYLFLKPVS 1808
            ERDR +KRKPIELGRHGAEHASSTKRRR GEVGLSNILERIVE LK+ ++ISYLFLKPVS
Sbjct: 1756 ERDRSLKRKPIELGRHGAEHASSTKRRRVGEVGLSNILERIVETLKERYDISYLFLKPVS 1815

Query: 1809 KKEAPDYLDIIERPMDLSTIREKVRKLEYKTRDEFRTDVWQIMYNAHMYNDGRNPGIPPL 1868
            KKEAPDYLDIIERPMDLSTIREKVR+LEYKTRDEFR DVWQIMYNAHMYNDGRNPGIPPL
Sbjct: 1816 KKEAPDYLDIIERPMDLSTIREKVRRLEYKTRDEFRHDVWQIMYNAHMYNDGRNPGIPPL 1875

Query: 1869 ADKLLMHCDNLLNENDDELTEAEIGIEYRDS 1895
            AD+LL  CDNLL++NDDELTEAE+GIE+RDS
Sbjct: 1876 ADQLLGLCDNLLDKNDDELTEAEMGIEFRDS 1904

BLAST of MC09g0858 vs. NCBI nr
Match: XP_023004897.1 (transcription initiation factor TFIID subunit 1 isoform X1 [Cucurbita maxima] >XP_023004898.1 transcription initiation factor TFIID subunit 1 isoform X1 [Cucurbita maxima] >XP_023004899.1 transcription initiation factor TFIID subunit 1 isoform X1 [Cucurbita maxima] >XP_023004900.1 transcription initiation factor TFIID subunit 1 isoform X1 [Cucurbita maxima])

HSP 1 Score: 3166 bits (8209), Expect = 0.0
Identity = 1645/1890 (87.04%), Postives = 1755/1890 (92.86%), Query Frame = 0

Query: 9    DDDDYEDAGGGNRFLGFMFGNVDNSGDLDADYLDEDAKEHLDALADKLGSTLTDIDLSTK 68
            DDDDYEDAGGGNR LGFMFGNVDNSGDLDADYLDEDAKEHL ALADKLGSTLTDIDLSTK
Sbjct: 16   DDDDYEDAGGGNRLLGFMFGNVDNSGDLDADYLDEDAKEHLAALADKLGSTLTDIDLSTK 75

Query: 69   SAKTPSDAVEPDYDAKAEDAVDYEDIDEEYDGPEIEAAGEEDHLLPKKEYFSTEVSLATL 128
            S K PSDAVEPDYDAKAEDA+DYEDIDEEYDGPEIEAAGEEDHLLPKKEYFS EVSL TL
Sbjct: 76   SPKIPSDAVEPDYDAKAEDAIDYEDIDEEYDGPEIEAAGEEDHLLPKKEYFSAEVSLPTL 135

Query: 129  EPTVSVFDDEDYDEDFEKVHDVINSSVEARTTHASDEKGECLEVAYEGEKSVADDDIQSA 188
            EPT SVFDDEDYDED EKVH+V N SV A T HASDE+GE LEV  EGEKS A+DD+ S 
Sbjct: 136  EPTASVFDDEDYDEDIEKVHEVANRSVVAPTIHASDEQGEFLEVVSEGEKSFAEDDLPSV 195

Query: 189  SLNNEVITSSAEELLEETPEVQKKLLDEKAHTPLPVLCMENGMAILQFSEIFGVHDSLKK 248
              NNEVITSS EEL EETPEV+K++ +EKAHTPLPVLCMENGM ILQFSEIFGVHDSLKK
Sbjct: 196  PFNNEVITSSPEELFEETPEVEKRMQEEKAHTPLPVLCMENGMTILQFSEIFGVHDSLKK 255

Query: 249  KEKRESRYCTRRDKYRSVDVSDIVEEDEEAFLHGFSQGVSCVKPASVVKDDTTMFNLDDP 308
            KEKR+SRYC+RRDKYRS DVSDIVEEDEEAFLHG S+G+SC+KPA VVK DTTM  LDDP
Sbjct: 256  KEKRDSRYCSRRDKYRSADVSDIVEEDEEAFLHGSSRGISCMKPAYVVKADTTM--LDDP 315

Query: 309  EFTKFGVVQGVDVMAARVDWRQKDNCCGAEPMKQVFAENISIGSNSLLFKKFYPLDQQNW 368
            E+TKFG V G DVMA+RV+WRQKD+ CGAEP K V AENI+IGSNSLLF+K YPLDQQNW
Sbjct: 316  EYTKFGAVHGGDVMASRVEWRQKDHSCGAEPTKDVVAENITIGSNSLLFQKLYPLDQQNW 375

Query: 369  EEGILWDNSPVLSKNSAGSCEVSGSDLEDSVSSDVEQQVSIQIVRSEHHIDPNDRRQRLS 428
            EE ILWDNSPV SKNSAGSCEV GSD+E SVSSDVE QVSIQIV S H IDP+D  Q L 
Sbjct: 376  EERILWDNSPVSSKNSAGSCEVFGSDMEASVSSDVEPQVSIQIVGSGHRIDPDDEDQSLY 435

Query: 429  QHDLPLLEPFGSRKFSGPEEPFSPEMIYHPQMLRLESWKDVDGSCQSEGTRENFSEEHQS 488
             H  PLLEPFGSRKFSG EEP SPE+IYHPQMLRLESWKDV+ S QS+  +EN  +E QS
Sbjct: 436  HHSFPLLEPFGSRKFSGTEEPLSPEIIYHPQMLRLESWKDVEDSFQSDCIKENNPDELQS 495

Query: 489  NAIRCFSKFSPKNRRMLEGSWLDKVLWESDEPIEKQKFIFDLEDEHMLFEISDEKESKYI 548
            NAIR FSKFSPKNRRMLEGSWLDKVLWES+EPIEK KFIFDLEDEHMLFEISDEKESKYI
Sbjct: 496  NAIRSFSKFSPKNRRMLEGSWLDKVLWESNEPIEKPKFIFDLEDEHMLFEISDEKESKYI 555

Query: 549  QFHAGAMILTRSSMSVNGNSFEISGSGGQGGWRFVSNDKHYSNRKASQQLKSNSKKRSVH 608
            QFH+GAMILTRSS+SVNGNSFEISGSGGQGGWRFVSNDKHYSNRKASQQLKSNSKKRSVH
Sbjct: 556  QFHSGAMILTRSSLSVNGNSFEISGSGGQGGWRFVSNDKHYSNRKASQQLKSNSKKRSVH 615

Query: 609  GVKVFHSKPAMMLQTMKLKLSNKELANFHRPKASWYPHDNEMAVRELQKLPTQGPMKIIL 668
            G+KVFHS+PAMMLQTMKLKLSNKELANFHRPKA WYPHDNE AVRELQKLPT GPM+IIL
Sbjct: 616  GIKVFHSRPAMMLQTMKLKLSNKELANFHRPKALWYPHDNERAVRELQKLPTHGPMQIIL 675

Query: 669  KSLGGKGSKLFVDSEETVSSLMAKASKKLDMKSSEIVKVFYSGKELEREKSLAAQNVQPN 728
            KSLGGKGSK FVD+EE VSS+MAKASKKLDMK SEIVKVFYSGKELEREKSLAAQNVQPN
Sbjct: 676  KSLGGKGSKHFVDAEEAVSSIMAKASKKLDMKPSEIVKVFYSGKELEREKSLAAQNVQPN 735

Query: 729  SLLHLVRSKIFVMPWTQHLRGDSKSVRSPGAFKKKSDLSVKDGHVFLMEYCEERPLLLGN 788
            SLLHLVRSKI++MP  Q+L G+++SVRSPGAFKKKSDLSVKDGHVFLMEYCEERPLLLGN
Sbjct: 736  SLLHLVRSKIYIMPGAQNLHGENRSVRSPGAFKKKSDLSVKDGHVFLMEYCEERPLLLGN 795

Query: 789  IGMGARLCTYYQKSSPDDQTGALLRNGGDSLGHVIVLEPSDKSPYIGDIKGGSVQASLET 848
            IGMGARLCTYYQKSSPDDQTGA+LRNGGDSLGHVI+LEPSDKSPY+GD+KGGS+QASLET
Sbjct: 796  IGMGARLCTYYQKSSPDDQTGAMLRNGGDSLGHVIILEPSDKSPYLGDLKGGSIQASLET 855

Query: 849  NMYRSPIFPHKVPMTDYILVRSAKGKLSLRRIYRNFAVGQQEPLMEVFSPGTKSLQMFMM 908
            NMYRSPIF HKVPMTDYILVRSAKGKLSLRRI +NFAVGQQEPLMEVFSPGTKSLQ FMM
Sbjct: 856  NMYRSPIFSHKVPMTDYILVRSAKGKLSLRRINKNFAVGQQEPLMEVFSPGTKSLQTFMM 915

Query: 909  NRLTLYIFREFLAAEKRRRLPYIRVDELPSQFPYLSETVIRKKLKEYALQQKSSNGQTIL 968
            NRLT YIFREFLAAEK RR+PYIRVDELPSQFPYLSETVIRKKLKEYALQQ++S+GQ IL
Sbjct: 916  NRLTSYIFREFLAAEKHRRIPYIRVDELPSQFPYLSETVIRKKLKEYALQQRNSSGQIIL 975

Query: 969  IKKRNASISLKKDAVTPEDVCKYESMQAGLYRLKHLGITEVHPSAISSAMSRLPDEAITL 1028
            IKKRNASISLKKDAVTPEDVCKYESMQAGLYRLKHLGITEVHPSAISSAMSRLPDEAITL
Sbjct: 976  IKKRNASISLKKDAVTPEDVCKYESMQAGLYRLKHLGITEVHPSAISSAMSRLPDEAITL 1035

Query: 1029 AAASHIERELQITPWNLSSNFVACTTQGKENIERLEITGVGDPSGRGLGFSYVRSVPKAP 1088
            AAASHIERELQITPW LSSNFVACTTQGKENIER+EITGVGDPSGRGLGFSYVRSVPK P
Sbjct: 1036 AAASHIERELQITPWTLSSNFVACTTQGKENIERMEITGVGDPSGRGLGFSYVRSVPKPP 1095

Query: 1089 ISNASLKKKAASSRGSSAVTGTDADLRRLSMDAAKEVLLKFDVSEEQIAKLTRWHRIAMI 1148
            ISNASLKKKAASSRGSSAVTGTDADLRRLSMDAAKEVLLKFDVSEEQIAKLTRWHRIAMI
Sbjct: 1096 ISNASLKKKAASSRGSSAVTGTDADLRRLSMDAAKEVLLKFDVSEEQIAKLTRWHRIAMI 1155

Query: 1149 RRLSSEQAASGVQVDPTTISKYARGQRMSFLQLQRQTREKCQEIWERQIQSLSTSDGAEN 1208
            RRLSSEQAASGVQVDPTTISKYARGQRMSFLQLQRQTREKCQEIWERQIQSLS SDGAEN
Sbjct: 1156 RRLSSEQAASGVQVDPTTISKYARGQRMSFLQLQRQTREKCQEIWERQIQSLSASDGAEN 1215

Query: 1209 ESDSEGNSDLDSFAGDLENLLDAEEFEDEVDTFEIRHEKTDGVKGLKMRRRPSIAQTEEE 1268
            ESDSEGNSDLDSFAGDLENLLDAEEFEDEVDTFEIRHEKTDGVKGLKMRRR SI QTE+E
Sbjct: 1216 ESDSEGNSDLDSFAGDLENLLDAEEFEDEVDTFEIRHEKTDGVKGLKMRRRASIIQTEDE 1275

Query: 1269 IEDEVAEATELCRLLMDDEAERRRKKKKNKVMGEAILVPGLQASFGHEIPQQTRHLVSIA 1328
            IEDEVAEA ELCRLLMDDEAERRRKKKKNKVMGE+ L  G QASFG E  +Q RHLVSIA
Sbjct: 1276 IEDEVAEAAELCRLLMDDEAERRRKKKKNKVMGESTLTSGFQASFGLENSEQIRHLVSIA 1335

Query: 1329 QPDIAYTSKENIRDQKEVESIINRKDKSGKFKPMKKNYSSEMSLLNKKLKISGDKVKNFK 1388
            QPD+ Y SKENIRDQKEVE+IINRK+KS K KPMKK YSSEMSL+NKKLKISGDKVKNFK
Sbjct: 1336 QPDVTYISKENIRDQKEVENIINRKEKSAKLKPMKKTYSSEMSLINKKLKISGDKVKNFK 1395

Query: 1389 EKKSARESFVCGKCGQFGHMRTNKNCPKYGEDMETPETTDQEKVSIKLNTVDPSSQSHQK 1448
            EKKSARESFVCGKCGQFGHMRTNKNCPKYGE++ETP+TTDQEKVSIK N +DPS+QS+QK
Sbjct: 1396 EKKSARESFVCGKCGQFGHMRTNKNCPKYGEELETPDTTDQEKVSIKSNNMDPSNQSNQK 1455

Query: 1449 AHTKKVTPKTITKISTTETFEGEKSTLMAKMLPVKFKCSSSEKLSDNLSPGVPQTSDLPF 1508
            A TKK TPKT  KISTTE FEGEKSTLMAK+LPVKFKCSS+++LSDNLSP +PQTSDLP 
Sbjct: 1456 AQTKKATPKTAAKISTTEAFEGEKSTLMAKVLPVKFKCSSTDRLSDNLSPVLPQTSDLPV 1515

Query: 1509 NSDNETGKSVVKVNKITFSKKRNEDIQFESHKPSIVIRPPDAKKVYAEAHKPSIVIRPPT 1568
            NSDNETGKSVVKVNKITFSKKR ED+QF+SHKPSIVIRPPDAKKV  +AHKPSIVIRPPT
Sbjct: 1516 NSDNETGKSVVKVNKITFSKKRTEDVQFQSHKPSIVIRPPDAKKVSIDAHKPSIVIRPPT 1575

Query: 1569 SIDRDRMEFPRRSATVVRSPAETKREKLNKKLIIKRPKEVIDLDQMGFDASAGMQYRKTK 1628
            +IDRD++EFPRR+A ++RS AET +E+L+KKL+IKRPKEV+DLD+ G+D S GM+YRKTK
Sbjct: 1576 NIDRDKIEFPRRTAAIIRSAAETDKEQLHKKLVIKRPKEVVDLDRCGYDGSVGMEYRKTK 1635

Query: 1629 RIVELSSFENHTRPGSMSSAESGKKKVREDQIWWEKQERQKNEERLREEKARRVYKEEMR 1688
            +IVELSSFE HTRPGSMSSA+SGKKK RE+  WWEKQE+++ EERLREE+ARRVY EEM 
Sbjct: 1636 KIVELSSFEKHTRPGSMSSADSGKKKAREEHRWWEKQEKRRKEERLREEEARRVYNEEMG 1695

Query: 1689 MRDEQEKLAEIRRFEASIRSDKEEEERIKAKKKKKKRIPEIMDDYMEDPRSRQ---RIPE 1748
            MR+EQEKLAEIRRFEASIRSDKEEEER+KAKKKKKKRIPEIM+DY+EDPRSR+   R+ E
Sbjct: 1696 MREEQEKLAEIRRFEASIRSDKEEEERLKAKKKKKKRIPEIMNDYVEDPRSRRMDKRVLE 1755

Query: 1749 RDRVVKRKPIELGRHGAEHASSTKRRRGGEVGLSNILERIVEALKDNFEISYLFLKPVSK 1808
            RDR +KRKPIELGRHGAEHASSTKRRR GEVGLSNILERIVE LK+ ++ISYLFLKPVSK
Sbjct: 1756 RDRSLKRKPIELGRHGAEHASSTKRRRVGEVGLSNILERIVETLKERYDISYLFLKPVSK 1815

Query: 1809 KEAPDYLDIIERPMDLSTIREKVRKLEYKTRDEFRTDVWQIMYNAHMYNDGRNPGIPPLA 1868
            KEAPDYLDIIERPMDLSTIREKVR+LEYKTRDEFR DVWQIMYNAHMYNDGRNPGIPPLA
Sbjct: 1816 KEAPDYLDIIERPMDLSTIREKVRRLEYKTRDEFRHDVWQIMYNAHMYNDGRNPGIPPLA 1875

Query: 1869 DKLLMHCDNLLNENDDELTEAEIGIEYRDS 1895
            D+LL  CDNLL++NDDELTEAE+GIE+RDS
Sbjct: 1876 DQLLGLCDNLLDKNDDELTEAEMGIEFRDS 1903

BLAST of MC09g0858 vs. ExPASy TrEMBL
Match: A0A6J1DM24 (transcription initiation factor TFIID subunit 1 OS=Momordica charantia OX=3673 GN=LOC111022226 PE=3 SV=1)

HSP 1 Score: 3625 bits (9399), Expect = 0.0
Identity = 1885/1887 (99.89%), Postives = 1886/1887 (99.95%), Query Frame = 0

Query: 9    DDDDYEDAGGGNRFLGFMFGNVDNSGDLDADYLDEDAKEHLDALADKLGSTLTDIDLSTK 68
            DDDDYEDAGGGNRFLGFMFGNVDNSGDLDADYLDEDAKEHLDALADKLGSTLTDIDLSTK
Sbjct: 2    DDDDYEDAGGGNRFLGFMFGNVDNSGDLDADYLDEDAKEHLDALADKLGSTLTDIDLSTK 61

Query: 69   SAKTPSDAVEPDYDAKAEDAVDYEDIDEEYDGPEIEAAGEEDHLLPKKEYFSTEVSLATL 128
            SAKTPSDAVEPDYDAKAEDAVDYEDIDEEYDGPEIEAAGEEDHLLPKKEYFSTEVSLATL
Sbjct: 62   SAKTPSDAVEPDYDAKAEDAVDYEDIDEEYDGPEIEAAGEEDHLLPKKEYFSTEVSLATL 121

Query: 129  EPTVSVFDDEDYDEDFEKVHDVINSSVEARTTHASDEKGECLEVAYEGEKSVADDDIQSA 188
            EPTVSVFDDEDYDEDFEKVHDVINSSVEARTTHASDEKGECLEVAYEGEKSVADDDIQSA
Sbjct: 122  EPTVSVFDDEDYDEDFEKVHDVINSSVEARTTHASDEKGECLEVAYEGEKSVADDDIQSA 181

Query: 189  SLNNEVITSSAEELLEETPEVQKKLLDEKAHTPLPVLCMENGMAILQFSEIFGVHDSLKK 248
            SLNNEVITSSAEELLEETPEVQKKLLDEKAHTPLPVLCMENGMAILQFSEIFGVHDSLKK
Sbjct: 182  SLNNEVITSSAEELLEETPEVQKKLLDEKAHTPLPVLCMENGMAILQFSEIFGVHDSLKK 241

Query: 249  KEKRESRYCTRRDKYRSVDVSDIVEEDEEAFLHGFSQGVSCVKPASVVKDDTTMFNLDDP 308
            KEKRESRYCTRRDKYRSVDVSDIVEEDEEAFLHGFSQGVSCVKPASVVKDDTTMFNLDDP
Sbjct: 242  KEKRESRYCTRRDKYRSVDVSDIVEEDEEAFLHGFSQGVSCVKPASVVKDDTTMFNLDDP 301

Query: 309  EFTKFGVVQGVDVMAARVDWRQKDNCCGAEPMKQVFAENISIGSNSLLFKKFYPLDQQNW 368
            EFTKFGVVQGVDVMAARVDWRQKDNCCGAEPMKQVFAENISIGSNSLLFKKFYPLDQQNW
Sbjct: 302  EFTKFGVVQGVDVMAARVDWRQKDNCCGAEPMKQVFAENISIGSNSLLFKKFYPLDQQNW 361

Query: 369  EEGILWDNSPVLSKNSAGSCEVSGSDLEDSVSSDVEQQVSIQIVRSEHHIDPNDRRQRLS 428
            EEGILWDNSPVLSKNSAGSCEVSGSDLEDSVSSDVEQQVSIQIVRSEHHIDPNDRRQRLS
Sbjct: 362  EEGILWDNSPVLSKNSAGSCEVSGSDLEDSVSSDVEQQVSIQIVRSEHHIDPNDRRQRLS 421

Query: 429  QHDLPLLEPFGSRKFSGPEEPFSPEMIYHPQMLRLESWKDVDGSCQSEGTRENFSEEHQS 488
            QHDLPLLEPFGSRKFSGPEEPFSPEMIYHPQMLRLESWKDVDGSCQSEGTRENFSEEHQS
Sbjct: 422  QHDLPLLEPFGSRKFSGPEEPFSPEMIYHPQMLRLESWKDVDGSCQSEGTRENFSEEHQS 481

Query: 489  NAIRCFSKFSPKNRRMLEGSWLDKVLWESDEPIEKQKFIFDLEDEHMLFEISDEKESKYI 548
            NAIRCFSKFSPKNRRMLEGSWLDKVLWESDEPIEKQKFIFDLEDEHMLFEISDEKESKYI
Sbjct: 482  NAIRCFSKFSPKNRRMLEGSWLDKVLWESDEPIEKQKFIFDLEDEHMLFEISDEKESKYI 541

Query: 549  QFHAGAMILTRSSMSVNGNSFEISGSGGQGGWRFVSNDKHYSNRKASQQLKSNSKKRSVH 608
            QFHAGAMILTRSSMSVNGNSFEISGSGGQGGWRFVSNDKHYSNRKASQQLKSNSKKRSVH
Sbjct: 542  QFHAGAMILTRSSMSVNGNSFEISGSGGQGGWRFVSNDKHYSNRKASQQLKSNSKKRSVH 601

Query: 609  GVKVFHSKPAMMLQTMKLKLSNKELANFHRPKASWYPHDNEMAVRELQKLPTQGPMKIIL 668
            GVKVFHSKPAMMLQTMKLKLSNKELANFHRPKASWYPHDNEMAVRELQKLPTQGPMKIIL
Sbjct: 602  GVKVFHSKPAMMLQTMKLKLSNKELANFHRPKASWYPHDNEMAVRELQKLPTQGPMKIIL 661

Query: 669  KSLGGKGSKLFVDSEETVSSLMAKASKKLDMKSSEIVKVFYSGKELEREKSLAAQNVQPN 728
            KSLGGKGSKLFVDSEETVS+LMAKASKKLDMKSSEIVKVFYSGKELEREKSLAAQNVQPN
Sbjct: 662  KSLGGKGSKLFVDSEETVSALMAKASKKLDMKSSEIVKVFYSGKELEREKSLAAQNVQPN 721

Query: 729  SLLHLVRSKIFVMPWTQHLRGDSKSVRSPGAFKKKSDLSVKDGHVFLMEYCEERPLLLGN 788
            SLLHLVRSKIFVMPWTQHLRGDSKSVRSPGAFKKKSDLSVKDGHVFLMEYCEERPLLLGN
Sbjct: 722  SLLHLVRSKIFVMPWTQHLRGDSKSVRSPGAFKKKSDLSVKDGHVFLMEYCEERPLLLGN 781

Query: 789  IGMGARLCTYYQKSSPDDQTGALLRNGGDSLGHVIVLEPSDKSPYIGDIKGGSVQASLET 848
            IGMGARLCTYYQKSSPDDQTGALLRNGGDSLGHVIVLEPSDKSPYIGDIKGGSVQASLET
Sbjct: 782  IGMGARLCTYYQKSSPDDQTGALLRNGGDSLGHVIVLEPSDKSPYIGDIKGGSVQASLET 841

Query: 849  NMYRSPIFPHKVPMTDYILVRSAKGKLSLRRIYRNFAVGQQEPLMEVFSPGTKSLQMFMM 908
            NMYRSPIFPHKVPMTDYILVRSAKGKLSLRRI RNFAVGQQEPLMEVFSPGTKSLQMFMM
Sbjct: 842  NMYRSPIFPHKVPMTDYILVRSAKGKLSLRRIDRNFAVGQQEPLMEVFSPGTKSLQMFMM 901

Query: 909  NRLTLYIFREFLAAEKRRRLPYIRVDELPSQFPYLSETVIRKKLKEYALQQKSSNGQTIL 968
            NRLTLYIFREFLAAEKRRRLPYIRVDELPSQFPYLSETVIRKKLKEYALQQKSSNGQTIL
Sbjct: 902  NRLTLYIFREFLAAEKRRRLPYIRVDELPSQFPYLSETVIRKKLKEYALQQKSSNGQTIL 961

Query: 969  IKKRNASISLKKDAVTPEDVCKYESMQAGLYRLKHLGITEVHPSAISSAMSRLPDEAITL 1028
            IKKRNASISLKKDAVTPEDVCKYESMQAGLYRLKHLGITEVHPSAISSAMSRLPDEAITL
Sbjct: 962  IKKRNASISLKKDAVTPEDVCKYESMQAGLYRLKHLGITEVHPSAISSAMSRLPDEAITL 1021

Query: 1029 AAASHIERELQITPWNLSSNFVACTTQGKENIERLEITGVGDPSGRGLGFSYVRSVPKAP 1088
            AAASHIERELQITPWNLSSNFVACTTQGKENIERLEITGVGDPSGRGLGFSYVRSVPKAP
Sbjct: 1022 AAASHIERELQITPWNLSSNFVACTTQGKENIERLEITGVGDPSGRGLGFSYVRSVPKAP 1081

Query: 1089 ISNASLKKKAASSRGSSAVTGTDADLRRLSMDAAKEVLLKFDVSEEQIAKLTRWHRIAMI 1148
            ISNASLKKKAASSRGSSAVTGTDADLRRLSMDAAKEVLLKFDVSEEQIAKLTRWHRIAMI
Sbjct: 1082 ISNASLKKKAASSRGSSAVTGTDADLRRLSMDAAKEVLLKFDVSEEQIAKLTRWHRIAMI 1141

Query: 1149 RRLSSEQAASGVQVDPTTISKYARGQRMSFLQLQRQTREKCQEIWERQIQSLSTSDGAEN 1208
            RRLSSEQAASGVQVDPTTISKYARGQRMSFLQLQRQTREKCQEIWERQIQSLSTSDGAEN
Sbjct: 1142 RRLSSEQAASGVQVDPTTISKYARGQRMSFLQLQRQTREKCQEIWERQIQSLSTSDGAEN 1201

Query: 1209 ESDSEGNSDLDSFAGDLENLLDAEEFEDEVDTFEIRHEKTDGVKGLKMRRRPSIAQTEEE 1268
            ESDSEGNSDLDSFAGDLENLLDAEEFEDEVDTFEIRHEKTDGVKGLKMRRRPSIAQTEEE
Sbjct: 1202 ESDSEGNSDLDSFAGDLENLLDAEEFEDEVDTFEIRHEKTDGVKGLKMRRRPSIAQTEEE 1261

Query: 1269 IEDEVAEATELCRLLMDDEAERRRKKKKNKVMGEAILVPGLQASFGHEIPQQTRHLVSIA 1328
            IEDEVAEATELCRLLMDDEAERRRKKKKNKVMGEAILVPGLQASFGHEIPQQTRHLVSIA
Sbjct: 1262 IEDEVAEATELCRLLMDDEAERRRKKKKNKVMGEAILVPGLQASFGHEIPQQTRHLVSIA 1321

Query: 1329 QPDIAYTSKENIRDQKEVESIINRKDKSGKFKPMKKNYSSEMSLLNKKLKISGDKVKNFK 1388
            QPDIAYTSKENIRDQKEVESIINRKDKSGKFKPMKKNYSSEMSLLNKKLKISGDKVKNFK
Sbjct: 1322 QPDIAYTSKENIRDQKEVESIINRKDKSGKFKPMKKNYSSEMSLLNKKLKISGDKVKNFK 1381

Query: 1389 EKKSARESFVCGKCGQFGHMRTNKNCPKYGEDMETPETTDQEKVSIKLNTVDPSSQSHQK 1448
            EKKSARESFVCGKCGQFGHMRTNKNCPKYGEDMETPETTDQEKVSIKLNTVDPSSQSHQK
Sbjct: 1382 EKKSARESFVCGKCGQFGHMRTNKNCPKYGEDMETPETTDQEKVSIKLNTVDPSSQSHQK 1441

Query: 1449 AHTKKVTPKTITKISTTETFEGEKSTLMAKMLPVKFKCSSSEKLSDNLSPGVPQTSDLPF 1508
            AHTKKVTPKTITKISTTETFEGEKSTLMAKMLPVKFKCSSSEKLSDNLSPGVPQTSDLPF
Sbjct: 1442 AHTKKVTPKTITKISTTETFEGEKSTLMAKMLPVKFKCSSSEKLSDNLSPGVPQTSDLPF 1501

Query: 1509 NSDNETGKSVVKVNKITFSKKRNEDIQFESHKPSIVIRPPDAKKVYAEAHKPSIVIRPPT 1568
            NSDNETGKSVVKVNKITFSKKRNEDIQFESHKPSIVIRPPDAKKVYAEAHKPSIVIRPPT
Sbjct: 1502 NSDNETGKSVVKVNKITFSKKRNEDIQFESHKPSIVIRPPDAKKVYAEAHKPSIVIRPPT 1561

Query: 1569 SIDRDRMEFPRRSATVVRSPAETKREKLNKKLIIKRPKEVIDLDQMGFDASAGMQYRKTK 1628
            SIDRDRMEFPRRSATVVRSPAETKREKLNKKLIIKRPKEVIDLDQMGFDASAGMQYRKTK
Sbjct: 1562 SIDRDRMEFPRRSATVVRSPAETKREKLNKKLIIKRPKEVIDLDQMGFDASAGMQYRKTK 1621

Query: 1629 RIVELSSFENHTRPGSMSSAESGKKKVREDQIWWEKQERQKNEERLREEKARRVYKEEMR 1688
            RIVELSSFENHTRPGSMSSAESGKKKVREDQIWWEKQERQKNEERLREEKARRVYKEEMR
Sbjct: 1622 RIVELSSFENHTRPGSMSSAESGKKKVREDQIWWEKQERQKNEERLREEKARRVYKEEMR 1681

Query: 1689 MRDEQEKLAEIRRFEASIRSDKEEEERIKAKKKKKKRIPEIMDDYMEDPRSRQRIPERDR 1748
            MRDEQEKLAEIRRFEASIRSDKEEEERIKAKKKKKKRIPEIMDDYMEDPRSRQRIPERDR
Sbjct: 1682 MRDEQEKLAEIRRFEASIRSDKEEEERIKAKKKKKKRIPEIMDDYMEDPRSRQRIPERDR 1741

Query: 1749 VVKRKPIELGRHGAEHASSTKRRRGGEVGLSNILERIVEALKDNFEISYLFLKPVSKKEA 1808
            VVKRKPIELGRHGAEHASSTKRRRGGEVGLSNILERIVEALKDNFEISYLFLKPVSKKEA
Sbjct: 1742 VVKRKPIELGRHGAEHASSTKRRRGGEVGLSNILERIVEALKDNFEISYLFLKPVSKKEA 1801

Query: 1809 PDYLDIIERPMDLSTIREKVRKLEYKTRDEFRTDVWQIMYNAHMYNDGRNPGIPPLADKL 1868
            PDYLDIIERPMDLSTIREKVRKLEYKTRDEFRTDVWQIMYNAHMYNDGRNPGIPPLADKL
Sbjct: 1802 PDYLDIIERPMDLSTIREKVRKLEYKTRDEFRTDVWQIMYNAHMYNDGRNPGIPPLADKL 1861

Query: 1869 LMHCDNLLNENDDELTEAEIGIEYRDS 1895
            LMHCDNLLNENDDELTEAEIGIEYRDS
Sbjct: 1862 LMHCDNLLNENDDELTEAEIGIEYRDS 1888

BLAST of MC09g0858 vs. ExPASy TrEMBL
Match: A0A6J1H5J5 (transcription initiation factor TFIID subunit 1 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111460758 PE=3 SV=1)

HSP 1 Score: 3170 bits (8218), Expect = 0.0
Identity = 1645/1891 (86.99%), Postives = 1757/1891 (92.91%), Query Frame = 0

Query: 9    DDDDYEDAGGGNRFLGFMFGNVDNSGDLDADYLDEDAKEHLDALADKLGSTLTDIDLSTK 68
            DDDDYEDAGGGNR LGFMFGNVDNSGDLDADYLDEDAKEHL ALADKLG TLTDIDLSTK
Sbjct: 16   DDDDYEDAGGGNRLLGFMFGNVDNSGDLDADYLDEDAKEHLAALADKLGPTLTDIDLSTK 75

Query: 69   SAKTPSDAVEPDYDAKAEDAVDYEDIDEEYDGPEIEAAGEEDHLLPKKEYFSTEVSLATL 128
            S K PSDA+EPDYDAKAEDA+DYEDIDEEYDGPEIEAAGEEDHLLPKKEYFS EVSL TL
Sbjct: 76   SPKIPSDAIEPDYDAKAEDAIDYEDIDEEYDGPEIEAAGEEDHLLPKKEYFSAEVSLPTL 135

Query: 129  EPTVSVFDDEDYDEDFEKVHDVINSSVEARTTHASDEKGECLEVAYEGEKSVADDDIQSA 188
            EPT SVFDDEDYDED EKVH+V N SV A T HASDE+GE LEV  EGEKS A+DD+ S 
Sbjct: 136  EPTASVFDDEDYDEDIEKVHEVANRSVVAPTIHASDEQGEYLEVVSEGEKSFAEDDLPSV 195

Query: 189  SLNNEVITSSAEELLEETPEVQKKLLDEKAHTPLPVLCMENGMAILQFSEIFGVHDSLKK 248
              NNEVITSS EEL EETPEV+K++ +EKAHTPLPVLCMENGMAILQFSEIFGVHDSLKK
Sbjct: 196  PFNNEVITSSPEELFEETPEVEKRMQEEKAHTPLPVLCMENGMAILQFSEIFGVHDSLKK 255

Query: 249  KEKRESRYCTRRDKYRSVDVSDIVEEDEEAFLHGFSQGVSCVKPASVVKDDTTMFNLDDP 308
            KEKR+SRYC+RRDKYRS DVSDIVEEDEEAFLHG S+GVSC+KPA VVKDDTTM  LDDP
Sbjct: 256  KEKRDSRYCSRRDKYRSADVSDIVEEDEEAFLHGSSRGVSCMKPAYVVKDDTTM--LDDP 315

Query: 309  EFTKFGVVQGVDVMAARVDWRQKDNCCGAEPMKQVFAENISIGSNSLLFKKFYPLDQQNW 368
            E+TKFG V G D MA+RV+WRQKD+CCGAEP K+V AENI+IGSNSLLF+K YPLDQQNW
Sbjct: 316  EYTKFGAVDGGDEMASRVEWRQKDHCCGAEPAKEVVAENITIGSNSLLFQKLYPLDQQNW 375

Query: 369  EEGILWDNSPVLSKNSAGSCEVSGSDLEDSVSSDVEQQVSIQIVRSEHHIDPNDRRQRLS 428
            EE ILWDNSPV SKNSAGSCEV GSD+E SVSSDVE QVSIQIV S+H IDP+D  Q L 
Sbjct: 376  EERILWDNSPVSSKNSAGSCEVFGSDMEASVSSDVEPQVSIQIVGSDHRIDPDDEDQSLY 435

Query: 429  QHDLPLLEPFGSRKFSGPEEPFSPEMIYHPQMLRLESWKDVDGSCQSEGTRENFSEEHQS 488
             H  PLLE FGSRKFSG EEP SPE+IYHPQMLRLESWKDV+ SCQS+  +EN  +E QS
Sbjct: 436  HHSFPLLEAFGSRKFSGTEEPLSPEIIYHPQMLRLESWKDVEDSCQSDCIKENNPDELQS 495

Query: 489  NAIRCFSKFSPKNRRMLEGSWLDKVLWESDEPIEKQKFIFDLEDEHMLFEISDEKESKYI 548
            NAIR FSKFSPKNRRMLEGSWLDKVLWES+EPIEK KFIFDLEDEHMLFEISDEKESKYI
Sbjct: 496  NAIRSFSKFSPKNRRMLEGSWLDKVLWESNEPIEKPKFIFDLEDEHMLFEISDEKESKYI 555

Query: 549  QFHAGAMILTRSSMSVNGNSFEISGSGGQGGWRFVSNDKHYSNRKASQQLKSNSKKRSVH 608
            QFH+GAMILTRSS+SVNG+SFEISGSGGQGGWRFVSNDKHYSNRKASQQLKSNSKKRSVH
Sbjct: 556  QFHSGAMILTRSSLSVNGHSFEISGSGGQGGWRFVSNDKHYSNRKASQQLKSNSKKRSVH 615

Query: 609  GVKVFHSKPAMMLQTMKLKLSNKELANFHRPKASWYPHDNEMAVRELQKLPTQGPMKIIL 668
            G+KVFHS+PAMMLQTMKLKLSNKELANFHRPKA WYPHDNE AVRELQKLPT GPM+IIL
Sbjct: 616  GIKVFHSRPAMMLQTMKLKLSNKELANFHRPKALWYPHDNERAVRELQKLPTHGPMQIIL 675

Query: 669  KSLGGKGSKLFVDSEETVSSLMAKASKKLDMKSSEIVKVFYSGKELEREKSLAAQNVQPN 728
            KSLGGKGSK FVD+EE VSS+MAKASKKLDMK SEIVKVFYSGKELEREKSLAAQNVQPN
Sbjct: 676  KSLGGKGSKHFVDAEEAVSSIMAKASKKLDMKPSEIVKVFYSGKELEREKSLAAQNVQPN 735

Query: 729  SLLHLVRSKIFVMPWTQHLRGDSKSVRSPGAFKKKSDLSVKDGHVFLMEYCEERPLLLGN 788
            SLLHLVRSKI++MP  Q+L G+++SVRSPGAFKKKSDLSVKDGHVFLMEYCEERPLLLGN
Sbjct: 736  SLLHLVRSKIYIMPGAQNLHGENRSVRSPGAFKKKSDLSVKDGHVFLMEYCEERPLLLGN 795

Query: 789  IGMGARLCTYYQKSSPDDQTGALLRNGGDSLGHVIVLEPSDKSPYIGDIKGGSVQASLET 848
            IGMGARLCTYYQKSSPDDQTGA+LRNGGDSLGHVI+LEPSDKSPY+GD+KGGS+QASLET
Sbjct: 796  IGMGARLCTYYQKSSPDDQTGAMLRNGGDSLGHVIILEPSDKSPYLGDLKGGSIQASLET 855

Query: 849  NMYRSPIFPHKVPMTDYILVRSAKGKLSLRRIYRNFAVGQQEPLMEVFSPGTKSLQMFMM 908
            NMYRSPIF HKVPMTDYILVRSAKGKLSLRRI +NFAVGQQEPLMEVFSPGTKSLQ FMM
Sbjct: 856  NMYRSPIFSHKVPMTDYILVRSAKGKLSLRRIDKNFAVGQQEPLMEVFSPGTKSLQTFMM 915

Query: 909  NRLTLYIFREFLAAEKRRRLPYIRVDELPSQFPYLSETVIRKKLKEYALQQKSSNGQTIL 968
            NRLT YIFREFLAAEK RR+PYIRVDELPSQFPYLSETVIRKKLKEYALQQ++S+GQ IL
Sbjct: 916  NRLTSYIFREFLAAEKHRRIPYIRVDELPSQFPYLSETVIRKKLKEYALQQRNSSGQIIL 975

Query: 969  IKKRNASISLKKDAVTPEDVCKYESMQAGLYRLKHLGITEVHPSAISSAMSRLPDEAITL 1028
            IKKRNASISLKKDAVTPEDVCKYESMQAGLYRLKHLGITEVHPSAISSAMSRLPDEAITL
Sbjct: 976  IKKRNASISLKKDAVTPEDVCKYESMQAGLYRLKHLGITEVHPSAISSAMSRLPDEAITL 1035

Query: 1029 AAASHIERELQITPWNLSSNFVACTTQGKENIERLEITGVGDPSGRGLGFSYVRSVPKAP 1088
            AAASHIERELQITPW LSSNFVACTTQGKENIER+EITGVGDPSGRGLGFSYVRSVPK P
Sbjct: 1036 AAASHIERELQITPWTLSSNFVACTTQGKENIERMEITGVGDPSGRGLGFSYVRSVPKPP 1095

Query: 1089 ISNASLKKKAASSRGSSAVTGTDADLRRLSMDAAKEVLLKFDVSEEQIAKLTRWHRIAMI 1148
            ISNASLKKKAASSRGSSAVTGTDADLRRLSMDAAKEVLLKFDVSEEQIAKLTRWHRIAMI
Sbjct: 1096 ISNASLKKKAASSRGSSAVTGTDADLRRLSMDAAKEVLLKFDVSEEQIAKLTRWHRIAMI 1155

Query: 1149 RRLSSEQAASGVQVDPTTISKYARGQRMSFLQLQRQTREKCQEIWERQIQSLSTSDGAEN 1208
            RRLSSEQAASGVQVDPTTISKYARGQRMSFLQLQRQTREKCQEIWERQIQSLS SDGAEN
Sbjct: 1156 RRLSSEQAASGVQVDPTTISKYARGQRMSFLQLQRQTREKCQEIWERQIQSLSASDGAEN 1215

Query: 1209 ESDSEGNSDLDSFAGDLENLLDAEEFEDEVDTFEIRHEKTDGVKGLKMRRRPSIAQTEEE 1268
            ESDSEGNSDLDSFAGDLENLLDAEEFEDEVDTFEIRHEKTDGVKGLKMRRR SI QTE+E
Sbjct: 1216 ESDSEGNSDLDSFAGDLENLLDAEEFEDEVDTFEIRHEKTDGVKGLKMRRRASIIQTEDE 1275

Query: 1269 IEDEVAEATELCRLLMDDEAERRRKKKKNKVMGEAILVPGLQASFGHEIPQQTRHLVSIA 1328
            IEDEVAEA ELCRLLMDDEAERRRKKKKNKVMGE+ L  G QASFG E  +Q RHLVSIA
Sbjct: 1276 IEDEVAEAAELCRLLMDDEAERRRKKKKNKVMGESTLTSGFQASFGLENSEQIRHLVSIA 1335

Query: 1329 QPDIAYTSKENIRDQKEVESIINRKDKSGKFKPMKKNYSSEMSLLNKKLKISGDKVKNFK 1388
            QPD+ Y SKENIRD KEVE+IINRK+KS K KPMKK YSSEMSL+NKKLKISGDKVKNFK
Sbjct: 1336 QPDVTYISKENIRDHKEVENIINRKEKSAKLKPMKKTYSSEMSLINKKLKISGDKVKNFK 1395

Query: 1389 EKKSARESFVCGKCGQFGHMRTNKNCPKYGEDMETPETTDQEKVSIKLNTVDPSSQSHQK 1448
            EKKSARESFVCGKCGQFGHMRTNKNCPKYGE++ETP+TTDQEKVSIK NT+DPS+QS+QK
Sbjct: 1396 EKKSARESFVCGKCGQFGHMRTNKNCPKYGEELETPDTTDQEKVSIKSNTMDPSNQSNQK 1455

Query: 1449 AHTKKVTPKTITKISTTETFEGEKSTLMAKMLPVKFKCSSSEKLSDNLSPGVPQTSDLPF 1508
            A TKK TPKT  KISTTE FEGEKSTLMAK+LPVKFKCSS+++LSDNLSP +PQTSDLP 
Sbjct: 1456 AQTKKATPKTAAKISTTEAFEGEKSTLMAKVLPVKFKCSSTDRLSDNLSPVLPQTSDLPV 1515

Query: 1509 NSDNETGKSVVKVNKITFSKKRNEDIQFESHKPSIVIRPPDAKKVYAEAHKPSIVIRPPT 1568
            NSDNETGKSVVKVNKITFSKKR ED+QF+SHKPSIVIRPPDAKKV  +AHKPSIVIRPPT
Sbjct: 1516 NSDNETGKSVVKVNKITFSKKRTEDVQFQSHKPSIVIRPPDAKKVSIDAHKPSIVIRPPT 1575

Query: 1569 SIDRDRMEFPRRSATVVRSPAETKREKLNKKLIIKRPKEVIDLDQMGFDASAGMQYRKTK 1628
            +IDRD++EFPRR+A ++RS AET +E+L+KKL+IKRPKEVID+D+ G+D S GM+YRKTK
Sbjct: 1576 NIDRDKLEFPRRTAAIIRSAAETDKEQLHKKLVIKRPKEVIDVDRCGYDGSVGMEYRKTK 1635

Query: 1629 RIVELSSFENHTRPGSMSSAESGKKKVREDQIWWEKQERQKNEERLREEKARRVYKEEMR 1688
            +IVELSSFE HTRPGSMSSAESGKKK RE+  WWEKQE+++ EERLREE+ARRVY EEM 
Sbjct: 1636 KIVELSSFEKHTRPGSMSSAESGKKKAREEHRWWEKQEKRRKEERLREEEARRVYNEEMG 1695

Query: 1689 MRDEQEKLAEIRRFEASIRSDKEEEERIKAKKKKKKRIPEIMDDYMEDPRSRQR----IP 1748
            MR+EQEKLAEIRRFEASIRSDKEEEER+KAKKKKKKRIPEIM+DY+EDPRSR+     + 
Sbjct: 1696 MREEQEKLAEIRRFEASIRSDKEEEERLKAKKKKKKRIPEIMNDYVEDPRSRRMDKRVVL 1755

Query: 1749 ERDRVVKRKPIELGRHGAEHASSTKRRRGGEVGLSNILERIVEALKDNFEISYLFLKPVS 1808
            ERDR +KRKPIELGRHGAEHASSTKRRR GEVGLSNILERIVE LK+ ++ISYLFLKPVS
Sbjct: 1756 ERDRSLKRKPIELGRHGAEHASSTKRRRVGEVGLSNILERIVETLKERYDISYLFLKPVS 1815

Query: 1809 KKEAPDYLDIIERPMDLSTIREKVRKLEYKTRDEFRTDVWQIMYNAHMYNDGRNPGIPPL 1868
            KKEAPDYLDIIERPMDLSTIREKVR+LEYKTRDEFR DVWQIMYNAHMYNDGRNPGIPPL
Sbjct: 1816 KKEAPDYLDIIERPMDLSTIREKVRRLEYKTRDEFRHDVWQIMYNAHMYNDGRNPGIPPL 1875

Query: 1869 ADKLLMHCDNLLNENDDELTEAEIGIEYRDS 1895
            AD+LL  CDNLL++NDDELTEAE+GIE+RDS
Sbjct: 1876 ADQLLGLCDNLLDKNDDELTEAEMGIEFRDS 1904

BLAST of MC09g0858 vs. ExPASy TrEMBL
Match: A0A6J1L0S9 (transcription initiation factor TFIID subunit 1 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111498075 PE=3 SV=1)

HSP 1 Score: 3166 bits (8209), Expect = 0.0
Identity = 1645/1890 (87.04%), Postives = 1755/1890 (92.86%), Query Frame = 0

Query: 9    DDDDYEDAGGGNRFLGFMFGNVDNSGDLDADYLDEDAKEHLDALADKLGSTLTDIDLSTK 68
            DDDDYEDAGGGNR LGFMFGNVDNSGDLDADYLDEDAKEHL ALADKLGSTLTDIDLSTK
Sbjct: 16   DDDDYEDAGGGNRLLGFMFGNVDNSGDLDADYLDEDAKEHLAALADKLGSTLTDIDLSTK 75

Query: 69   SAKTPSDAVEPDYDAKAEDAVDYEDIDEEYDGPEIEAAGEEDHLLPKKEYFSTEVSLATL 128
            S K PSDAVEPDYDAKAEDA+DYEDIDEEYDGPEIEAAGEEDHLLPKKEYFS EVSL TL
Sbjct: 76   SPKIPSDAVEPDYDAKAEDAIDYEDIDEEYDGPEIEAAGEEDHLLPKKEYFSAEVSLPTL 135

Query: 129  EPTVSVFDDEDYDEDFEKVHDVINSSVEARTTHASDEKGECLEVAYEGEKSVADDDIQSA 188
            EPT SVFDDEDYDED EKVH+V N SV A T HASDE+GE LEV  EGEKS A+DD+ S 
Sbjct: 136  EPTASVFDDEDYDEDIEKVHEVANRSVVAPTIHASDEQGEFLEVVSEGEKSFAEDDLPSV 195

Query: 189  SLNNEVITSSAEELLEETPEVQKKLLDEKAHTPLPVLCMENGMAILQFSEIFGVHDSLKK 248
              NNEVITSS EEL EETPEV+K++ +EKAHTPLPVLCMENGM ILQFSEIFGVHDSLKK
Sbjct: 196  PFNNEVITSSPEELFEETPEVEKRMQEEKAHTPLPVLCMENGMTILQFSEIFGVHDSLKK 255

Query: 249  KEKRESRYCTRRDKYRSVDVSDIVEEDEEAFLHGFSQGVSCVKPASVVKDDTTMFNLDDP 308
            KEKR+SRYC+RRDKYRS DVSDIVEEDEEAFLHG S+G+SC+KPA VVK DTTM  LDDP
Sbjct: 256  KEKRDSRYCSRRDKYRSADVSDIVEEDEEAFLHGSSRGISCMKPAYVVKADTTM--LDDP 315

Query: 309  EFTKFGVVQGVDVMAARVDWRQKDNCCGAEPMKQVFAENISIGSNSLLFKKFYPLDQQNW 368
            E+TKFG V G DVMA+RV+WRQKD+ CGAEP K V AENI+IGSNSLLF+K YPLDQQNW
Sbjct: 316  EYTKFGAVHGGDVMASRVEWRQKDHSCGAEPTKDVVAENITIGSNSLLFQKLYPLDQQNW 375

Query: 369  EEGILWDNSPVLSKNSAGSCEVSGSDLEDSVSSDVEQQVSIQIVRSEHHIDPNDRRQRLS 428
            EE ILWDNSPV SKNSAGSCEV GSD+E SVSSDVE QVSIQIV S H IDP+D  Q L 
Sbjct: 376  EERILWDNSPVSSKNSAGSCEVFGSDMEASVSSDVEPQVSIQIVGSGHRIDPDDEDQSLY 435

Query: 429  QHDLPLLEPFGSRKFSGPEEPFSPEMIYHPQMLRLESWKDVDGSCQSEGTRENFSEEHQS 488
             H  PLLEPFGSRKFSG EEP SPE+IYHPQMLRLESWKDV+ S QS+  +EN  +E QS
Sbjct: 436  HHSFPLLEPFGSRKFSGTEEPLSPEIIYHPQMLRLESWKDVEDSFQSDCIKENNPDELQS 495

Query: 489  NAIRCFSKFSPKNRRMLEGSWLDKVLWESDEPIEKQKFIFDLEDEHMLFEISDEKESKYI 548
            NAIR FSKFSPKNRRMLEGSWLDKVLWES+EPIEK KFIFDLEDEHMLFEISDEKESKYI
Sbjct: 496  NAIRSFSKFSPKNRRMLEGSWLDKVLWESNEPIEKPKFIFDLEDEHMLFEISDEKESKYI 555

Query: 549  QFHAGAMILTRSSMSVNGNSFEISGSGGQGGWRFVSNDKHYSNRKASQQLKSNSKKRSVH 608
            QFH+GAMILTRSS+SVNGNSFEISGSGGQGGWRFVSNDKHYSNRKASQQLKSNSKKRSVH
Sbjct: 556  QFHSGAMILTRSSLSVNGNSFEISGSGGQGGWRFVSNDKHYSNRKASQQLKSNSKKRSVH 615

Query: 609  GVKVFHSKPAMMLQTMKLKLSNKELANFHRPKASWYPHDNEMAVRELQKLPTQGPMKIIL 668
            G+KVFHS+PAMMLQTMKLKLSNKELANFHRPKA WYPHDNE AVRELQKLPT GPM+IIL
Sbjct: 616  GIKVFHSRPAMMLQTMKLKLSNKELANFHRPKALWYPHDNERAVRELQKLPTHGPMQIIL 675

Query: 669  KSLGGKGSKLFVDSEETVSSLMAKASKKLDMKSSEIVKVFYSGKELEREKSLAAQNVQPN 728
            KSLGGKGSK FVD+EE VSS+MAKASKKLDMK SEIVKVFYSGKELEREKSLAAQNVQPN
Sbjct: 676  KSLGGKGSKHFVDAEEAVSSIMAKASKKLDMKPSEIVKVFYSGKELEREKSLAAQNVQPN 735

Query: 729  SLLHLVRSKIFVMPWTQHLRGDSKSVRSPGAFKKKSDLSVKDGHVFLMEYCEERPLLLGN 788
            SLLHLVRSKI++MP  Q+L G+++SVRSPGAFKKKSDLSVKDGHVFLMEYCEERPLLLGN
Sbjct: 736  SLLHLVRSKIYIMPGAQNLHGENRSVRSPGAFKKKSDLSVKDGHVFLMEYCEERPLLLGN 795

Query: 789  IGMGARLCTYYQKSSPDDQTGALLRNGGDSLGHVIVLEPSDKSPYIGDIKGGSVQASLET 848
            IGMGARLCTYYQKSSPDDQTGA+LRNGGDSLGHVI+LEPSDKSPY+GD+KGGS+QASLET
Sbjct: 796  IGMGARLCTYYQKSSPDDQTGAMLRNGGDSLGHVIILEPSDKSPYLGDLKGGSIQASLET 855

Query: 849  NMYRSPIFPHKVPMTDYILVRSAKGKLSLRRIYRNFAVGQQEPLMEVFSPGTKSLQMFMM 908
            NMYRSPIF HKVPMTDYILVRSAKGKLSLRRI +NFAVGQQEPLMEVFSPGTKSLQ FMM
Sbjct: 856  NMYRSPIFSHKVPMTDYILVRSAKGKLSLRRINKNFAVGQQEPLMEVFSPGTKSLQTFMM 915

Query: 909  NRLTLYIFREFLAAEKRRRLPYIRVDELPSQFPYLSETVIRKKLKEYALQQKSSNGQTIL 968
            NRLT YIFREFLAAEK RR+PYIRVDELPSQFPYLSETVIRKKLKEYALQQ++S+GQ IL
Sbjct: 916  NRLTSYIFREFLAAEKHRRIPYIRVDELPSQFPYLSETVIRKKLKEYALQQRNSSGQIIL 975

Query: 969  IKKRNASISLKKDAVTPEDVCKYESMQAGLYRLKHLGITEVHPSAISSAMSRLPDEAITL 1028
            IKKRNASISLKKDAVTPEDVCKYESMQAGLYRLKHLGITEVHPSAISSAMSRLPDEAITL
Sbjct: 976  IKKRNASISLKKDAVTPEDVCKYESMQAGLYRLKHLGITEVHPSAISSAMSRLPDEAITL 1035

Query: 1029 AAASHIERELQITPWNLSSNFVACTTQGKENIERLEITGVGDPSGRGLGFSYVRSVPKAP 1088
            AAASHIERELQITPW LSSNFVACTTQGKENIER+EITGVGDPSGRGLGFSYVRSVPK P
Sbjct: 1036 AAASHIERELQITPWTLSSNFVACTTQGKENIERMEITGVGDPSGRGLGFSYVRSVPKPP 1095

Query: 1089 ISNASLKKKAASSRGSSAVTGTDADLRRLSMDAAKEVLLKFDVSEEQIAKLTRWHRIAMI 1148
            ISNASLKKKAASSRGSSAVTGTDADLRRLSMDAAKEVLLKFDVSEEQIAKLTRWHRIAMI
Sbjct: 1096 ISNASLKKKAASSRGSSAVTGTDADLRRLSMDAAKEVLLKFDVSEEQIAKLTRWHRIAMI 1155

Query: 1149 RRLSSEQAASGVQVDPTTISKYARGQRMSFLQLQRQTREKCQEIWERQIQSLSTSDGAEN 1208
            RRLSSEQAASGVQVDPTTISKYARGQRMSFLQLQRQTREKCQEIWERQIQSLS SDGAEN
Sbjct: 1156 RRLSSEQAASGVQVDPTTISKYARGQRMSFLQLQRQTREKCQEIWERQIQSLSASDGAEN 1215

Query: 1209 ESDSEGNSDLDSFAGDLENLLDAEEFEDEVDTFEIRHEKTDGVKGLKMRRRPSIAQTEEE 1268
            ESDSEGNSDLDSFAGDLENLLDAEEFEDEVDTFEIRHEKTDGVKGLKMRRR SI QTE+E
Sbjct: 1216 ESDSEGNSDLDSFAGDLENLLDAEEFEDEVDTFEIRHEKTDGVKGLKMRRRASIIQTEDE 1275

Query: 1269 IEDEVAEATELCRLLMDDEAERRRKKKKNKVMGEAILVPGLQASFGHEIPQQTRHLVSIA 1328
            IEDEVAEA ELCRLLMDDEAERRRKKKKNKVMGE+ L  G QASFG E  +Q RHLVSIA
Sbjct: 1276 IEDEVAEAAELCRLLMDDEAERRRKKKKNKVMGESTLTSGFQASFGLENSEQIRHLVSIA 1335

Query: 1329 QPDIAYTSKENIRDQKEVESIINRKDKSGKFKPMKKNYSSEMSLLNKKLKISGDKVKNFK 1388
            QPD+ Y SKENIRDQKEVE+IINRK+KS K KPMKK YSSEMSL+NKKLKISGDKVKNFK
Sbjct: 1336 QPDVTYISKENIRDQKEVENIINRKEKSAKLKPMKKTYSSEMSLINKKLKISGDKVKNFK 1395

Query: 1389 EKKSARESFVCGKCGQFGHMRTNKNCPKYGEDMETPETTDQEKVSIKLNTVDPSSQSHQK 1448
            EKKSARESFVCGKCGQFGHMRTNKNCPKYGE++ETP+TTDQEKVSIK N +DPS+QS+QK
Sbjct: 1396 EKKSARESFVCGKCGQFGHMRTNKNCPKYGEELETPDTTDQEKVSIKSNNMDPSNQSNQK 1455

Query: 1449 AHTKKVTPKTITKISTTETFEGEKSTLMAKMLPVKFKCSSSEKLSDNLSPGVPQTSDLPF 1508
            A TKK TPKT  KISTTE FEGEKSTLMAK+LPVKFKCSS+++LSDNLSP +PQTSDLP 
Sbjct: 1456 AQTKKATPKTAAKISTTEAFEGEKSTLMAKVLPVKFKCSSTDRLSDNLSPVLPQTSDLPV 1515

Query: 1509 NSDNETGKSVVKVNKITFSKKRNEDIQFESHKPSIVIRPPDAKKVYAEAHKPSIVIRPPT 1568
            NSDNETGKSVVKVNKITFSKKR ED+QF+SHKPSIVIRPPDAKKV  +AHKPSIVIRPPT
Sbjct: 1516 NSDNETGKSVVKVNKITFSKKRTEDVQFQSHKPSIVIRPPDAKKVSIDAHKPSIVIRPPT 1575

Query: 1569 SIDRDRMEFPRRSATVVRSPAETKREKLNKKLIIKRPKEVIDLDQMGFDASAGMQYRKTK 1628
            +IDRD++EFPRR+A ++RS AET +E+L+KKL+IKRPKEV+DLD+ G+D S GM+YRKTK
Sbjct: 1576 NIDRDKIEFPRRTAAIIRSAAETDKEQLHKKLVIKRPKEVVDLDRCGYDGSVGMEYRKTK 1635

Query: 1629 RIVELSSFENHTRPGSMSSAESGKKKVREDQIWWEKQERQKNEERLREEKARRVYKEEMR 1688
            +IVELSSFE HTRPGSMSSA+SGKKK RE+  WWEKQE+++ EERLREE+ARRVY EEM 
Sbjct: 1636 KIVELSSFEKHTRPGSMSSADSGKKKAREEHRWWEKQEKRRKEERLREEEARRVYNEEMG 1695

Query: 1689 MRDEQEKLAEIRRFEASIRSDKEEEERIKAKKKKKKRIPEIMDDYMEDPRSRQ---RIPE 1748
            MR+EQEKLAEIRRFEASIRSDKEEEER+KAKKKKKKRIPEIM+DY+EDPRSR+   R+ E
Sbjct: 1696 MREEQEKLAEIRRFEASIRSDKEEEERLKAKKKKKKRIPEIMNDYVEDPRSRRMDKRVLE 1755

Query: 1749 RDRVVKRKPIELGRHGAEHASSTKRRRGGEVGLSNILERIVEALKDNFEISYLFLKPVSK 1808
            RDR +KRKPIELGRHGAEHASSTKRRR GEVGLSNILERIVE LK+ ++ISYLFLKPVSK
Sbjct: 1756 RDRSLKRKPIELGRHGAEHASSTKRRRVGEVGLSNILERIVETLKERYDISYLFLKPVSK 1815

Query: 1809 KEAPDYLDIIERPMDLSTIREKVRKLEYKTRDEFRTDVWQIMYNAHMYNDGRNPGIPPLA 1868
            KEAPDYLDIIERPMDLSTIREKVR+LEYKTRDEFR DVWQIMYNAHMYNDGRNPGIPPLA
Sbjct: 1816 KEAPDYLDIIERPMDLSTIREKVRRLEYKTRDEFRHDVWQIMYNAHMYNDGRNPGIPPLA 1875

Query: 1869 DKLLMHCDNLLNENDDELTEAEIGIEYRDS 1895
            D+LL  CDNLL++NDDELTEAE+GIE+RDS
Sbjct: 1876 DQLLGLCDNLLDKNDDELTEAEMGIEFRDS 1903

BLAST of MC09g0858 vs. ExPASy TrEMBL
Match: A0A0A0KA05 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G357030 PE=3 SV=1)

HSP 1 Score: 3137 bits (8134), Expect = 0.0
Identity = 1632/1890 (86.35%), Postives = 1749/1890 (92.54%), Query Frame = 0

Query: 9    DDDDYEDAGGGNRFLGFMFGNVDNSGDLDADYLDEDAKEHLDALADKLGSTLTDIDLSTK 68
            DDDDYEDAGGGNRFLGFMFGNVDNSGDLDADYLDEDAKEHLDALADKLG TLTDIDLSTK
Sbjct: 16   DDDDYEDAGGGNRFLGFMFGNVDNSGDLDADYLDEDAKEHLDALADKLGPTLTDIDLSTK 75

Query: 69   SAKTPSDAVEPDYDAKAEDAVDYEDIDEEYDGPEIEAAGEEDHLLPKKEYFSTEVSLATL 128
            S++  SDAVEPDYDAKAEDAVDYEDIDEEYDGPEIEAAGEEDHLLP++EYFS EVSL+TL
Sbjct: 76   SSRIQSDAVEPDYDAKAEDAVDYEDIDEEYDGPEIEAAGEEDHLLPRREYFSAEVSLSTL 135

Query: 129  EPTVSVFDDEDYDEDFEKVHDVINSSVEARTTHASDEKGECLEVAYEGEKSVADDDIQSA 188
            EPT SVFDDEDYDEDFE V DV+N+SVE +  HASDE+GECLE+  EGEKS+A   ++SA
Sbjct: 136  EPTASVFDDEDYDEDFENVPDVVNNSVEPQIIHASDEQGECLEIVSEGEKSLA---VESA 195

Query: 189  SLNNEVITSSAEELLEETPEVQKKLLDEKAHTPLPVLCMENGMAILQFSEIFGVHDSLKK 248
             LNNEVIT  AE L E TPEVQK+L D+K+HTPLPVLCMENGMAILQFSEIFGVHDSLKK
Sbjct: 196  PLNNEVITGRAESLHEGTPEVQKRLQDDKSHTPLPVLCMENGMAILQFSEIFGVHDSLKK 255

Query: 249  KEKRESRYCTRRDKYRSVDVSDIVEEDEEAFLHGFSQGVSCVKPASVVKDDTTMFNLDDP 308
            KEKR SRY TR+DKYRS DVSDIVEEDEEAFLHGFS+GVS VKPA  VKDDTTMF++DD 
Sbjct: 256  KEKRASRYYTRKDKYRSADVSDIVEEDEEAFLHGFSRGVSYVKPAYDVKDDTTMFDVDDL 315

Query: 309  EFTKFGVVQGVDVMAARVDWRQKDNCCGAEPMKQVFAENISIGSNSLLFKKFYPLDQQNW 368
            E+ KFGVVQGVDVM +RVDW+QKD+CCGAEPMKQV AEN+ IGSN LLF  FYPLDQQNW
Sbjct: 316  EYNKFGVVQGVDVMTSRVDWQQKDHCCGAEPMKQVVAENVPIGSNFLLFNTFYPLDQQNW 375

Query: 369  EEGILWDNSPVLSKNSAGSCEVSGSDLEDSVSSDVEQQVSIQIVRSEHHIDPNDRRQRLS 428
            EE ILWD+SPV SKN+ GS + SGSD+E S + DVE QVSIQIVRSEHHI  N   Q L 
Sbjct: 376  EERILWDDSPVSSKNAVGSYKASGSDIEASPNRDVEPQVSIQIVRSEHHIGLNGDGQSLY 435

Query: 429  QHDLPLLEPFGSRKFSGPEEPFSPEMIYHPQMLRLESWKDVDGSCQSEGTRENFSEEHQS 488
              D PLLEPFGSRK S  EE  SPE+IYHPQMLRLESWKDVD SCQS+G +EN  +E QS
Sbjct: 436  HCDFPLLEPFGSRKISRTEESISPEVIYHPQMLRLESWKDVDDSCQSDGLKENIPDERQS 495

Query: 489  NAIRCFSKFSPKNRRMLEGSWLDKVLWESDEPIEKQKFIFDLEDEHMLFEISDEKESKYI 548
            NA+R FSKFSPKNRRMLEGSWLDKVLWE+DEPIEK KFIFDLEDEHMLFEISDE +SKYI
Sbjct: 496  NAVRSFSKFSPKNRRMLEGSWLDKVLWETDEPIEKPKFIFDLEDEHMLFEISDENDSKYI 555

Query: 549  QFHAGAMILTRSSMSVNGNSFEISGSGGQGGWRFVSNDKHYSNRKASQQLKSNSKKRSVH 608
            QFH+GAMILTRSSMSVNGNSFE+SGSGGQGGWRFVSNDKHYSNRKASQQLKSNSKKRSVH
Sbjct: 556  QFHSGAMILTRSSMSVNGNSFELSGSGGQGGWRFVSNDKHYSNRKASQQLKSNSKKRSVH 615

Query: 609  GVKVFHSKPAMMLQTMKLKLSNKELANFHRPKASWYPHDNEMAVRELQKLPTQGPMKIIL 668
            G+KVFHSKPAMMLQTMKLKLSNKELANFHRPKA WYPHDNEM VRELQKLPTQGPMKIIL
Sbjct: 616  GIKVFHSKPAMMLQTMKLKLSNKELANFHRPKALWYPHDNEMTVRELQKLPTQGPMKIIL 675

Query: 669  KSLGGKGSKLFVDSEETVSSLMAKASKKLDMKSSEIVKVFYSGKELEREKSLAAQNVQPN 728
            KSLGGKGSK  VD EETVSS+MAKASKKLDMK SE++K+FYSGKELEREKSLAAQNVQPN
Sbjct: 676  KSLGGKGSKHIVDPEETVSSIMAKASKKLDMKPSEMIKLFYSGKELEREKSLAAQNVQPN 735

Query: 729  SLLHLVRSKIFVMPWTQHLRGDSKSVRSPGAFKKKSDLSVKDGHVFLMEYCEERPLLLGN 788
            SLLHLVRS+I++MP  Q+LRG+++SVRSPGAFKKKSDLSVKDG VFLMEYCEERPLLLGN
Sbjct: 736  SLLHLVRSQIYIMPRAQNLRGENRSVRSPGAFKKKSDLSVKDGRVFLMEYCEERPLLLGN 795

Query: 789  IGMGARLCTYYQKSSPDDQTGALLRNGGDSLGHVIVLEPSDKSPYIGDIKGGSVQASLET 848
            IGMGARLCTYYQKSSPDDQTGALLRNGGDSLGHVI+LEPSDKSPY+G++KGGSVQASLET
Sbjct: 796  IGMGARLCTYYQKSSPDDQTGALLRNGGDSLGHVIILEPSDKSPYLGELKGGSVQASLET 855

Query: 849  NMYRSPIFPHKVPMTDYILVRSAKGKLSLRRIYRNFAVGQQEPLMEVFSPGTKSLQMFMM 908
            NMYR+P+F HKVPMTDYILVRSAKGKLSLRR+ RNFAVGQQEPLMEVFSPGTKSLQ+FMM
Sbjct: 856  NMYRAPVFSHKVPMTDYILVRSAKGKLSLRRVDRNFAVGQQEPLMEVFSPGTKSLQIFMM 915

Query: 909  NRLTLYIFREFLAAEKRRRLPYIRVDELPSQFPYLSETVIRKKLKEYALQQKSSNGQTIL 968
            NRLTLY+FREFLAAEKRRR+P IRVDELPSQFPYLSETVIRKKLKEYALQQ++S+GQ IL
Sbjct: 916  NRLTLYMFREFLAAEKRRRIPDIRVDELPSQFPYLSETVIRKKLKEYALQQRNSSGQIIL 975

Query: 969  IKKRNASISLKKDAVTPEDVCKYESMQAGLYRLKHLGITEVHPSAISSAMSRLPDEAITL 1028
            IKKRNAS+SLKKDAVTPEDVCKYESMQAGLYRLKHLG++EVHPSAISSAMSRLPDEAITL
Sbjct: 976  IKKRNASLSLKKDAVTPEDVCKYESMQAGLYRLKHLGLSEVHPSAISSAMSRLPDEAITL 1035

Query: 1029 AAASHIERELQITPWNLSSNFVACTTQGKENIERLEITGVGDPSGRGLGFSYVRSVPKAP 1088
            AAASHIERELQITPWNLSSNFVACTTQGKENIERLEITGVGDPSGRGLGFSYVRSVPKAP
Sbjct: 1036 AAASHIERELQITPWNLSSNFVACTTQGKENIERLEITGVGDPSGRGLGFSYVRSVPKAP 1095

Query: 1089 ISNASLKKKAASSRGSSAVTGTDADLRRLSMDAAKEVLLKFDVSEEQIAKLTRWHRIAMI 1148
            ISNASLKKKAASSRGSSAVTGTDADLRRLSMDAAKEVLLKFDVSEEQIAKLTRWHRIAMI
Sbjct: 1096 ISNASLKKKAASSRGSSAVTGTDADLRRLSMDAAKEVLLKFDVSEEQIAKLTRWHRIAMI 1155

Query: 1149 RRLSSEQAASGVQVDPTTISKYARGQRMSFLQLQRQTREKCQEIWERQIQSLSTSDGAEN 1208
            RRLSSEQAASGVQVDPTTISKYARGQRMSFLQLQRQTREKCQEIWERQIQSLS SDGAEN
Sbjct: 1156 RRLSSEQAASGVQVDPTTISKYARGQRMSFLQLQRQTREKCQEIWERQIQSLSASDGAEN 1215

Query: 1209 ESDSEGNSDLDSFAGDLENLLDAEEFEDEVDTFEIRHEKTDGVKGLKMRRRPSIAQTEEE 1268
            ESDSEGNSDLDSFAGDLENLLDAEEFEDEVDTFEIRHEKTDGVKGLKMRRRPSI QTEEE
Sbjct: 1216 ESDSEGNSDLDSFAGDLENLLDAEEFEDEVDTFEIRHEKTDGVKGLKMRRRPSIVQTEEE 1275

Query: 1269 IEDEVAEATELCRLLMDDEAERRRKKKKNKVMGEAILVPGLQASFGHEIPQQTRHLVSIA 1328
            IEDEVAEA ELCRLLMDDEAERRRKKKKNKVMGEA+L  G QASF HE P+QTRHL+SIA
Sbjct: 1276 IEDEVAEAAELCRLLMDDEAERRRKKKKNKVMGEAVLSTGFQASFFHEKPEQTRHLISIA 1335

Query: 1329 QPDIAYTSKENIRDQKEVESIINRKDKSGKFKPMKKNYSSEMSLLNKKLKISGDKVKNFK 1388
            QPD+ Y SKENIR+QKEVESI NRK+KSGK KPMKKNYSSEMSL+NKKLKISGDKVKNFK
Sbjct: 1336 QPDVTYISKENIREQKEVESISNRKEKSGKLKPMKKNYSSEMSLINKKLKISGDKVKNFK 1395

Query: 1389 EKKSARESFVCGKCGQFGHMRTNKNCPKYGEDMETPETTDQEKVSIKLNTVDPSSQSHQK 1448
            EKKSARESFVCGKCGQFGHMRTNKNCPKYGED+ETPETTDQ+KVSIKLN +DPS+QSHQK
Sbjct: 1396 EKKSARESFVCGKCGQFGHMRTNKNCPKYGEDLETPETTDQDKVSIKLNAMDPSNQSHQK 1455

Query: 1449 AHTKKVTPKTITKISTTETFEGEKSTLMAKMLPVKFKCSSSEKLSDNLSPGVPQTSDLPF 1508
            A  KKVTPK I K  TTE FEGEKST  AK+LPVKFKCSS+++LSDNLSP +PQTSDLP 
Sbjct: 1456 AVVKKVTPKAIAKSFTTEAFEGEKST--AKVLPVKFKCSSADRLSDNLSPALPQTSDLPV 1515

Query: 1509 NSDNETGKSVVKVNKITFSKKRNEDIQFESHKPSIVIRPPDAKKVYAEAHKPSIVIRPPT 1568
            NSDNETGKS+VKVNKITFSKKR EDIQFESHKPSIVIRPPDAKKV  EAHKPSIVIRPPT
Sbjct: 1516 NSDNETGKSIVKVNKITFSKKRTEDIQFESHKPSIVIRPPDAKKVSLEAHKPSIVIRPPT 1575

Query: 1569 SIDRDRMEFPRRSATVVRSPAETKREKLNKKLIIKRPKEVIDLDQMGFDASAGMQYRKTK 1628
            ++DR+R EFPRRSAT++RS  ET++E+L+KKLIIKRPKEV DLD+  +D S  M+YRKTK
Sbjct: 1576 NMDRERTEFPRRSATIIRSAVETEKEQLHKKLIIKRPKEV-DLDRSAYDGSVDMEYRKTK 1635

Query: 1629 RIVELSSFENHTRPGSMSSAESGKKKVREDQIWWEKQERQKNEERLREEKARRVYKEEMR 1688
            RIVELSS E HTR GSMSS++SGKKKVRE   WWEKQE+Q+NEERLREEK RRVY E+M 
Sbjct: 1636 RIVELSSLEKHTRHGSMSSSDSGKKKVREKHRWWEKQEKQRNEERLREEKVRRVYNEQMG 1695

Query: 1689 MRDEQEKLAEIRRFEASIRSDKEEEERIKAKKKKKKRIPEIMDDYMEDPRSRQ---RIPE 1748
            MR+EQEKLAEIRRFEASIRSDKEEEER+KAKKKKKKRIPEI+DDY+EDPRSR+   R  E
Sbjct: 1696 MREEQEKLAEIRRFEASIRSDKEEEERLKAKKKKKKRIPEILDDYVEDPRSRRFDKRALE 1755

Query: 1749 RDRVVKRKPIELGRHGAEHASSTKRRRGGEVGLSNILERIVEALKDNFEISYLFLKPVSK 1808
            ++R +KRKPIELGRH  E ASSTKRRRGGEVGLSNILERIVE LKD F+ISYLF+KPVSK
Sbjct: 1756 KERSMKRKPIELGRHIPEQASSTKRRRGGEVGLSNILERIVETLKDRFDISYLFIKPVSK 1815

Query: 1809 KEAPDYLDIIERPMDLSTIREKVRKLEYKTRDEFRTDVWQIMYNAHMYNDGRNPGIPPLA 1868
            KEAPDYLDIIERPMDLSTIREKVR+LEYKTRDEFR DVWQIMYNAH+YNDGRNPGIPPLA
Sbjct: 1816 KEAPDYLDIIERPMDLSTIREKVRRLEYKTRDEFRNDVWQIMYNAHLYNDGRNPGIPPLA 1875

Query: 1869 DKLLMHCDNLLNENDDELTEAEIGIEYRDS 1895
            D+LLM CDNLL + D++LTEAEIGIEYRDS
Sbjct: 1876 DQLLMLCDNLLKQCDEDLTEAEIGIEYRDS 1899

BLAST of MC09g0858 vs. ExPASy TrEMBL
Match: A0A1S3C886 (transcription initiation factor TFIID subunit 1 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103497604 PE=3 SV=1)

HSP 1 Score: 3131 bits (8118), Expect = 0.0
Identity = 1629/1890 (86.19%), Postives = 1744/1890 (92.28%), Query Frame = 0

Query: 9    DDDDYEDAGGGNRFLGFMFGNVDNSGDLDADYLDEDAKEHLDALADKLGSTLTDIDLSTK 68
            DDDDYEDAGGGNRFLGFMFGNVDNSGDLDADYLDEDAKEHLDALADKLG TLTDIDLSTK
Sbjct: 16   DDDDYEDAGGGNRFLGFMFGNVDNSGDLDADYLDEDAKEHLDALADKLGPTLTDIDLSTK 75

Query: 69   SAKTPSDAVEPDYDAKAEDAVDYEDIDEEYDGPEIEAAGEEDHLLPKKEYFSTEVSLATL 128
            S+K  SDAVEPDYDAKAEDAVDYEDIDEEYDGPEIEA GEEDHLLP++EYFS EVSLATL
Sbjct: 76   SSKIQSDAVEPDYDAKAEDAVDYEDIDEEYDGPEIEAVGEEDHLLPRREYFSAEVSLATL 135

Query: 129  EPTVSVFDDEDYDEDFEKVHDVINSSVEARTTHASDEKGECLEVAYEGEKSVADDDIQSA 188
            EPT  VFDDEDYDEDFEKV D++N+SVE +  HASDE+GE L++  EGE+  A  D+QSA
Sbjct: 136  EPTAPVFDDEDYDEDFEKVPDIVNNSVEPQIIHASDERGEGLDIVSEGEEPFAVGDLQSA 195

Query: 189  SLNNEVITSSAEELLEETPEVQKKLLDEKAHTPLPVLCMENGMAILQFSEIFGVHDSLKK 248
             LNNEV T  AE L E TPEVQK+L D+K+HTPLPVLCMENGMAILQFSEIFGVHDSLKK
Sbjct: 196  LLNNEVTTGGAEGLHEGTPEVQKRLQDDKSHTPLPVLCMENGMAILQFSEIFGVHDSLKK 255

Query: 249  KEKRESRYCTRRDKYRSVDVSDIVEEDEEAFLHGFSQGVSCVKPASVVKDDTTMFNLDDP 308
            KEKR SRY TRRDKYRS DVSDIVEEDEEAFLHGFS+GVS VKPA  VKDDTTMF++D+ 
Sbjct: 256  KEKRASRYYTRRDKYRSADVSDIVEEDEEAFLHGFSRGVSYVKPAYDVKDDTTMFDVDEL 315

Query: 309  EFTKFGVVQGVDVMAARVDWRQKDNCCGAEPMKQVFAENISIGSNSLLFKKFYPLDQQNW 368
            E+ KFGVVQGVDV A+RVDW+QKD+CCGAEPMK V AEN+ IGSN LLF KFYPLDQQNW
Sbjct: 316  EYNKFGVVQGVDVTASRVDWQQKDHCCGAEPMKPVVAENVPIGSNFLLFNKFYPLDQQNW 375

Query: 369  EEGILWDNSPVLSKNSAGSCEVSGSDLEDSVSSDVEQQVSIQIVRSEHHIDPNDRRQRLS 428
            EE ILWD+SPV SKN+AGS E SGSD+E S + DVE QVSIQIVRSEH I  N   Q L 
Sbjct: 376  EERILWDDSPVSSKNAAGSYEASGSDIEASPNRDVEPQVSIQIVRSEHRIGLNGDGQSLY 435

Query: 429  QHDLPLLEPFGSRKFSGPEEPFSPEMIYHPQMLRLESWKDVDGSCQSEGTRENFSEEHQS 488
             H  PLLEPFGSRK S  EE  SP++IYHPQMLRLESWKDVD SCQS+G +EN  +E QS
Sbjct: 436  NHGFPLLEPFGSRKISRTEESVSPDVIYHPQMLRLESWKDVDDSCQSDGIKENIPDELQS 495

Query: 489  NAIRCFSKFSPKNRRMLEGSWLDKVLWESDEPIEKQKFIFDLEDEHMLFEISDEKESKYI 548
              +R FSKFSPKNRRMLEGSWLD+VLWE+DEPIEK KFIFDLEDEHMLFEISDE ESKYI
Sbjct: 496  KTVRSFSKFSPKNRRMLEGSWLDEVLWETDEPIEKPKFIFDLEDEHMLFEISDENESKYI 555

Query: 549  QFHAGAMILTRSSMSVNGNSFEISGSGGQGGWRFVSNDKHYSNRKASQQLKSNSKKRSVH 608
            QFH+GAMILTRSSMSVNGNSFE+SGSGGQGGWRFVSNDKHYSNRKASQQLKSNSKKRSVH
Sbjct: 556  QFHSGAMILTRSSMSVNGNSFEVSGSGGQGGWRFVSNDKHYSNRKASQQLKSNSKKRSVH 615

Query: 609  GVKVFHSKPAMMLQTMKLKLSNKELANFHRPKASWYPHDNEMAVRELQKLPTQGPMKIIL 668
            G+KVFHSKPAMMLQTMKLKLSNKELANFHRPKA WYPHDNEM V+ELQKLPTQGPMKIIL
Sbjct: 616  GIKVFHSKPAMMLQTMKLKLSNKELANFHRPKALWYPHDNEMTVKELQKLPTQGPMKIIL 675

Query: 669  KSLGGKGSKLFVDSEETVSSLMAKASKKLDMKSSEIVKVFYSGKELEREKSLAAQNVQPN 728
            KSLGGKGSK  VD EETVSS+MAKASKKLDMK SE++K+FYSGKELEREKSLAAQNVQPN
Sbjct: 676  KSLGGKGSKHIVDPEETVSSIMAKASKKLDMKPSEMIKLFYSGKELEREKSLAAQNVQPN 735

Query: 729  SLLHLVRSKIFVMPWTQHLRGDSKSVRSPGAFKKKSDLSVKDGHVFLMEYCEERPLLLGN 788
            SLLHLVRSKI++MP TQ+LRG+++SVRSPGAFKKKSDLSVKDGHVFLMEYCEERPLLL N
Sbjct: 736  SLLHLVRSKIYIMPRTQNLRGENRSVRSPGAFKKKSDLSVKDGHVFLMEYCEERPLLLSN 795

Query: 789  IGMGARLCTYYQKSSPDDQTGALLRNGGDSLGHVIVLEPSDKSPYIGDIKGGSVQASLET 848
            IGMGARLCTYYQKSSPDDQTG LLRNGGDSLGHVI+LEPSDKSPY+G++KGGS+QASLET
Sbjct: 796  IGMGARLCTYYQKSSPDDQTGTLLRNGGDSLGHVIILEPSDKSPYLGELKGGSIQASLET 855

Query: 849  NMYRSPIFPHKVPMTDYILVRSAKGKLSLRRIYRNFAVGQQEPLMEVFSPGTKSLQMFMM 908
            NMYR+P+F HKVPMTDYILVRSAKGKLSLRRI +NFAVGQQEPLMEVFSPGTKSLQ+FMM
Sbjct: 856  NMYRAPVFSHKVPMTDYILVRSAKGKLSLRRIDKNFAVGQQEPLMEVFSPGTKSLQIFMM 915

Query: 909  NRLTLYIFREFLAAEKRRRLPYIRVDELPSQFPYLSETVIRKKLKEYALQQKSSNGQTIL 968
            NRLTLY+FREFLAAEKRRR+P IRVDELPSQFPYLSETVIRKKLKEYALQQ++S+GQ IL
Sbjct: 916  NRLTLYMFREFLAAEKRRRIPDIRVDELPSQFPYLSETVIRKKLKEYALQQRNSSGQIIL 975

Query: 969  IKKRNASISLKKDAVTPEDVCKYESMQAGLYRLKHLGITEVHPSAISSAMSRLPDEAITL 1028
            IKKRNAS+SLKKDAVTPEDVCKYESMQAGLYRLKHLG++EVHPSAISSAMSRLPDEAITL
Sbjct: 976  IKKRNASLSLKKDAVTPEDVCKYESMQAGLYRLKHLGLSEVHPSAISSAMSRLPDEAITL 1035

Query: 1029 AAASHIERELQITPWNLSSNFVACTTQGKENIERLEITGVGDPSGRGLGFSYVRSVPKAP 1088
            AAASHIERELQITPWNLSSNFVACTTQGKENIERLEITGVGDPSGRGLGFSYVRSVPKAP
Sbjct: 1036 AAASHIERELQITPWNLSSNFVACTTQGKENIERLEITGVGDPSGRGLGFSYVRSVPKAP 1095

Query: 1089 ISNASLKKKAASSRGSSAVTGTDADLRRLSMDAAKEVLLKFDVSEEQIAKLTRWHRIAMI 1148
            ISNASLKKKAASSRGSSAVTGTDADLRRLSMDAAKEVLLKFDVSEEQIAKLTRWHRIAMI
Sbjct: 1096 ISNASLKKKAASSRGSSAVTGTDADLRRLSMDAAKEVLLKFDVSEEQIAKLTRWHRIAMI 1155

Query: 1149 RRLSSEQAASGVQVDPTTISKYARGQRMSFLQLQRQTREKCQEIWERQIQSLSTSDGAEN 1208
            RRLSSEQAASGVQVDPTTISKYARGQRMSFLQLQRQTREKCQEIWERQIQSLS SDGAEN
Sbjct: 1156 RRLSSEQAASGVQVDPTTISKYARGQRMSFLQLQRQTREKCQEIWERQIQSLSASDGAEN 1215

Query: 1209 ESDSEGNSDLDSFAGDLENLLDAEEFEDEVDTFEIRHEKTDGVKGLKMRRRPSIAQTEEE 1268
            ESDSEGNSDLDSFAGDLENLLDAEEFEDEVDTFEIRHEKTDGVKGLKMRRRPSIAQTEEE
Sbjct: 1216 ESDSEGNSDLDSFAGDLENLLDAEEFEDEVDTFEIRHEKTDGVKGLKMRRRPSIAQTEEE 1275

Query: 1269 IEDEVAEATELCRLLMDDEAERRRKKKKNKVMGEAILVPGLQASFGHEIPQQTRHLVSIA 1328
            IEDEVAEA E CRLLMDDE ERRRKKKKNKVMGEA+L  G QASF HE P+QTRHL+SIA
Sbjct: 1276 IEDEVAEAAEFCRLLMDDETERRRKKKKNKVMGEAVLSTGFQASFFHEKPEQTRHLISIA 1335

Query: 1329 QPDIAYTSKENIRDQKEVESIINRKDKSGKFKPMKKNYSSEMSLLNKKLKISGDKVKNFK 1388
            QPD+ Y SKENIR+QKEVESIINRK+KSGK KP KKNYSSEMSL+NKKLKISGDKVKNFK
Sbjct: 1336 QPDVTYISKENIREQKEVESIINRKEKSGKLKPTKKNYSSEMSLINKKLKISGDKVKNFK 1395

Query: 1389 EKKSARESFVCGKCGQFGHMRTNKNCPKYGEDMETPETTDQEKVSIKLNTVDPSSQSHQK 1448
            EKKSARESFVCGKCGQFGHMRTNKNCPKYGED+ETPETTDQEKVSIKLN +DPS+QSHQK
Sbjct: 1396 EKKSARESFVCGKCGQFGHMRTNKNCPKYGEDLETPETTDQEKVSIKLNAMDPSNQSHQK 1455

Query: 1449 AHTKKVTPKTITKISTTETFEGEKSTLMAKMLPVKFKCSSSEKLSDNLSPGVPQTSDLPF 1508
            A  KKVTPK I K  TTE FEGEKST  AK+LPVKFKCSS+++LSDNLSP +PQTSDLP 
Sbjct: 1456 AVVKKVTPKAIAKSFTTEAFEGEKST--AKVLPVKFKCSSADRLSDNLSPALPQTSDLPV 1515

Query: 1509 NSDNETGKSVVKVNKITFSKKRNEDIQFESHKPSIVIRPPDAKKVYAEAHKPSIVIRPPT 1568
            NSDNETGKS+VKVNKITFSKKR EDIQFESHKPSIVIRPPDAKKV  EAHKPSIVIRPPT
Sbjct: 1516 NSDNETGKSIVKVNKITFSKKRTEDIQFESHKPSIVIRPPDAKKVSLEAHKPSIVIRPPT 1575

Query: 1569 SIDRDRMEFPRRSATVVRSPAETKREKLNKKLIIKRPKEVIDLDQMGFDASAGMQYRKTK 1628
            +IDRDR EFPRRSAT++RS  ET++E+L+KKLIIKRPKEVIDLD+  +D S  M+YRKTK
Sbjct: 1576 NIDRDRTEFPRRSATIIRSAVETEKEQLHKKLIIKRPKEVIDLDRSAYDGSVDMEYRKTK 1635

Query: 1629 RIVELSSFENHTRPGSMSSAESGKKKVREDQIWWEKQERQKNEERLREEKARRVYKEEMR 1688
            RIVELSSFE HTR GSMSS++SGKKKV+E   WWEKQE+Q+NEERLREEK RRVY E+M 
Sbjct: 1636 RIVELSSFEKHTRYGSMSSSDSGKKKVKEKHRWWEKQEKQRNEERLREEKVRRVYNEQMG 1695

Query: 1689 MRDEQEKLAEIRRFEASIRSDKEEEERIKAKKKKKKRIPEIMDDYMEDPRSRQ---RIPE 1748
            MR+EQEKLAEIRRFEASIRSDKEEEER+KAKKKKKKRIPEI+DDY+EDPRSR+   R  E
Sbjct: 1696 MREEQEKLAEIRRFEASIRSDKEEEERLKAKKKKKKRIPEILDDYVEDPRSRRFDKRALE 1755

Query: 1749 RDRVVKRKPIELGRHGAEHASSTKRRRGGEVGLSNILERIVEALKDNFEISYLFLKPVSK 1808
            ++R +KRKPIELGRH  EHAS TKRRRGGEVGLSNILERIVE LKD F+ISYLFLKPVSK
Sbjct: 1756 KERSMKRKPIELGRHIPEHAS-TKRRRGGEVGLSNILERIVETLKDRFDISYLFLKPVSK 1815

Query: 1809 KEAPDYLDIIERPMDLSTIREKVRKLEYKTRDEFRTDVWQIMYNAHMYNDGRNPGIPPLA 1868
            KEAPDYLDIIERPMDLSTIREKVR+LEYKTRDEFR DVWQIMYNAH+YNDGRNPGIPPLA
Sbjct: 1816 KEAPDYLDIIERPMDLSTIREKVRRLEYKTRDEFRNDVWQIMYNAHLYNDGRNPGIPPLA 1875

Query: 1869 DKLLMHCDNLLNENDDELTEAEIGIEYRDS 1895
            D+LLM CDNLL + D++LTEAEIGIEYRD+
Sbjct: 1876 DQLLMLCDNLLKQCDEDLTEAEIGIEYRDN 1902

BLAST of MC09g0858 vs. TAIR 10
Match: AT1G32750.1 (HAC13 protein (HAC13) )

HSP 1 Score: 1688.7 bits (4372), Expect = 0.0e+00
Identity = 1010/1942 (52.01%), Postives = 1325/1942 (68.23%), Query Frame = 0

Query: 9    DDDDYEDAGGGNRFLGFMFGNVDNSGDLDADYLDEDAKEHLDALADKLGSTLTDIDLSTK 68
            DDD+YED   G   LGF+FGNVDNSGDLDADYLDEDAKEHL ALADKLGS+L DI+L  K
Sbjct: 17   DDDEYEDNSRGFN-LGFIFGNVDNSGDLDADYLDEDAKEHLSALADKLGSSLPDINLLAK 76

Query: 69   SAKTPSDAVEPDYDAKAEDAVDYEDIDEEYDGPEIEAAGEEDHLLPKKEYFSTEVSLATL 128
            S +T SD  E DYD KAEDAVDYEDIDEEYDGPE++   EEDHLLPKKEYFST V+L +L
Sbjct: 77   SERTASDPAEQDYDRKAEDAVDYEDIDEEYDGPEVQVVSEEDHLLPKKEYFSTAVALGSL 136

Query: 129  EPTVSVFDDEDYDEDFEKVHD--VINSSVEARTTHASDEKGECLEVAYEGEKSVADDDIQ 188
            +   SVFDDEDYDE+ E+  +   +  S+E         K E   + YE E S+ D +  
Sbjct: 137  KSRASVFDDEDYDEEEEQEEEQAPVEKSLETEKREPVVLK-EDKALEYEEEASILDKE-- 196

Query: 189  SASLNNEVITSSAEELLEETPEVQKKLLDEKAHTPLPVLCMENGMAILQFSEIFGVHDSL 248
                 + + T   +E  EE  E+ +  LD+K  TPLP L +E+GM ILQFSEIF +H+  
Sbjct: 197  -----DHMDTEDVQE--EEVDELLEGTLDDKGATPLPTLYVEDGMVILQFSEIFAIHEPP 256

Query: 249  KKKEKRESRYCTRRDKYRSVDVSDIVEEDEEAFLHGFSQGVSCVKPASVVKDDTTMFNLD 308
            +K+++RE+RY T RDKY+S+D+S++VE+DEE  L    +  + V+ A +++ D      +
Sbjct: 257  QKRDRRENRYVTCRDKYKSMDISELVEDDEEVLLKSHGRIDTHVEQADLIQLDVPFPIRE 316

Query: 309  DPEFTKFGVVQGVDVMAARVDWRQKDNCCGAEPMKQVFAENISIGSNSLLFKKFYPLDQQ 368
              +  K   + G+   +       +D+C   E +KQ F ++ S    S L  + +PLDQ 
Sbjct: 317  GLQLVKASTIGGITPESREFTKLGRDSCIMGELLKQDFIDDNSSLCQSQLSMQVFPLDQH 376

Query: 369  NWEEGILWDNSPVLSKNSAGSCEVS---GSDLEDSVSSDVEQQVSIQIVRSEHHIDPNDR 428
             WE  I+W++SP +S NS    E        L    +S+ EQ+ S+ +V S   +  ++ 
Sbjct: 377  EWERRIIWEHSPEISGNSGEIFEPGLEPEGMLVKGTNSETEQE-SLNVVNSRVQVQADN- 436

Query: 429  RQRLSQHDLPLLEPFGSRKFSGPEEPFSPEMIYHPQMLRLESWKDVDGSCQSEGTRENFS 488
                      LLE FGSR      E  +    +HPQ+LRLES  D +    ++       
Sbjct: 437  -NMFVPFSANLLESFGSRGSQSTNESTNKSR-HHPQLLRLESQWDENHLSGNDEAGVKKI 496

Query: 489  EEHQSNAIRCFSKFSPKNRRMLEGSWLDKVLWESDEPIEKQKFIFDLEDEHMLFEISDEK 548
            +  + +A+  FS+   + R + + +WLD ++W+S++ + + K IFDL+DE M+FEI D +
Sbjct: 497  KRLEKDALGRFSRLVLRERDLGDEAWLDSIIWDSEKELSRSKLIFDLQDEQMVFEIFDNE 556

Query: 549  ESKYIQFHAGAMILTRSSMSVNGNSFEISGSGGQGGWRF-VSNDKHYSNRKASQQLKSNS 608
            ESK +Q HAGAMI++RSS S    +F+  G     GW+F +SNDK Y N K+SQQL++N+
Sbjct: 557  ESKNLQLHAGAMIVSRSSKS-KDETFQ-EGCESNSGWQFNLSNDKFYMNGKSSQQLQANT 616

Query: 609  KKRSVHGVKVFHSKPAMMLQTMKLKLSNKELANFHRPKASWYPHDNEMAVRELQKLPTQG 668
             K SVH ++VFHS PA+ LQTMK KLSNK++ANFHRPKA WYPHDNE+A+++  KLPT+G
Sbjct: 617  NKSSVHSLRVFHSVPAIKLQTMKSKLSNKDIANFHRPKALWYPHDNELAIKQQGKLPTRG 676

Query: 669  PMKIILKSLGGKGSKLFVDSEETVSSLMAKASKKLDMKSSEIVKVFYSGKELEREKSLAA 728
             MKII+KSLGGKGSKL V  EE+VSSL AKAS+KLD K +E VK+FY GKEL+ EKSLAA
Sbjct: 677  SMKIIVKSLGGKGSKLHVGIEESVSSLRAKASRKLDFKETEAVKMFYKGKELDDEKSLAA 736

Query: 729  QNVQPNSLLHLVRSKIFVMPWTQHLRGDSKSVRSPGAFKKKSDLSVKDGHVFLMEYCEER 788
            QNVQPNSL+HL+R+K+ + PW Q L G++KS+R PGAFKKKSDLS KDGHVFLMEYCEER
Sbjct: 737  QNVQPNSLVHLIRTKVHLWPWAQKLPGENKSLRPPGAFKKKSDLSTKDGHVFLMEYCEER 796

Query: 789  PLLLGNIGMGARLCTYYQKSSPDDQTGALLRNGGDSLGHVIVLEPSDKSPYIGDIKGGSV 848
            PL+L N GMGA LCTYYQKSSP+DQ G LLRN  D+LG+V++LEP DKSP++G+I  G  
Sbjct: 797  PLMLSNAGMGANLCTYYQKSSPEDQRGNLLRNQSDTLGNVMILEPGDKSPFLGEIHAGCS 856

Query: 849  QASLETNMYRSPIFPHKVPMTDYILVRSAKGKLSLRRIYRNFAVGQQEPLMEVFSPGTKS 908
            Q+S+ETNMY++PIFP ++  TDY+LVRS KGKLSLRRI +   VGQQEP MEV SPG+K+
Sbjct: 857  QSSVETNMYKAPIFPQRLQSTDYLLVRSPKGKLSLRRIDKIVVVGQQEPRMEVMSPGSKN 916

Query: 909  LQMFMMNRLTLYIFREFLAAEKRRRLPY-IRVDELPSQFPYLSETVIRKKLKEYALQQKS 968
            LQ +++NR+ +Y++REF    KR    + I  DEL   F  L++ +I+K +K  A  ++ 
Sbjct: 917  LQTYLVNRMLVYVYREFF---KRGGGEHPIAADELSFLFSNLTDAIIKKNMKIIACWKRD 976

Query: 969  SNGQTILIKKRN---ASISLKKDAVTPEDVCKYESMQAGLYRLKHLGITE-VHPSAISSA 1028
             NGQ+   KK +      S  K  V PE VC YESM AGLYRLKHLGIT    P++IS+A
Sbjct: 977  KNGQSYWTKKDSLLEPPESELKKLVAPEHVCSYESMLAGLYRLKHLGITRFTLPASISNA 1036

Query: 1029 MSRLPDEAITLAAASHIERELQITPWNLSSNFVACTTQGKENIERLEITGVGDPSGRGLG 1088
            +++LPDEAI LAAASHIERELQITPWNLSSNFVACT Q + NIERLEITGVGDPSGRGLG
Sbjct: 1037 LAQLPDEAIALAAASHIERELQITPWNLSSNFVACTNQDRANIERLEITGVGDPSGRGLG 1096

Query: 1089 FSYVRSVPKAPISNASLKKKAASSRGSSAVTGTDADLRRLSMDAAKEVLLKFDVSEEQIA 1148
            FSYVR+ PKAP +   +KKKAA+ RG+  VTGTDADLRRLSM+AA+EVL+KF+V +E IA
Sbjct: 1097 FSYVRAAPKAPAAAGHMKKKAAAGRGAPTVTGTDADLRRLSMEAAREVLIKFNVPDEIIA 1156

Query: 1149 KLTRWHRIAMIRRLSSEQAASGVQVDPTTISKYARGQRMSFLQLQRQTREKCQEIWERQI 1208
            K TRWHRIAMIR+LSSEQAASGV+VDPTTI KYARGQRMSFLQ+Q+Q REKCQEIW+RQ+
Sbjct: 1157 KQTRWHRIAMIRKLSSEQAASGVKVDPTTIGKYARGQRMSFLQMQQQAREKCQEIWDRQL 1216

Query: 1209 QSLSTSDGAENESDSEGNSDLDSFAGDLENLLDAEEFEDEVDTFEIRHEKTDGVKGLKMR 1268
             SLS  DG ENES++E NSDLDSFAGDLENLLDAEE  +  ++   +++K DGVKGLKMR
Sbjct: 1217 LSLSAFDGDENESENEANSDLDSFAGDLENLLDAEEGGEGEESNISKNDKLDGVKGLKMR 1276

Query: 1269 RRPSIAQTEEEIEDEVAEATELCRLLMDDEAERRRKKKKNKVMGEAI-----LVPGLQAS 1328
            RRPS  +T+EEIEDE  E  ELCRLLM DE ++++KKKK K +GE +       P +   
Sbjct: 1277 RRPSQVETDEEIEDEATEYAELCRLLMQDE-DQKKKKKKMKGVGEGMGSYPPPRPNIALQ 1336

Query: 1329 FGHEIPQ---QTRHLVSIAQPDIAYTSKEN-IRDQKEVESIINRKDKSGKFKPMKKNYSS 1388
             G  + +     +  ++I QPD ++   E+ I+D + V+SII    K+ K K +K+N +S
Sbjct: 1337 SGEPVRKANAMDKKPIAI-QPDASFLVNESTIKDNRNVDSII----KTPKGKQVKENSNS 1396

Query: 1389 EMSLLNKKLKISGDKVKNFKEKKSARESFVCGKCGQFGHMRTNKNCPKYGEDMET-PETT 1448
               L  KK+KI  + +K FKEKKSARE+FVCG CGQ GHMRTNK+CP+Y E+ E+ PE  
Sbjct: 1397 LGQL--KKVKILNENLKVFKEKKSARENFVCGACGQHGHMRTNKHCPRYRENTESQPEGI 1456

Query: 1449 DQEKVSIKLNTVDPSSQSHQK-AHTKKVTPKTITKISTTETFEGEKSTLMAKMLPVKFKC 1508
            D +K + K ++ +PS     K     K  PK+  K S  E  +G+K +     LP+KF+ 
Sbjct: 1457 DMDKSAGKPSSSEPSGLPKLKPIKNSKAAPKSAMKTSVDEALKGDKLSSKTGGLPLKFRY 1516

Query: 1509 S-SSEKLSDNLSPGVPQTSDLPFNSDNETG-KSVVKVNKI-------------------- 1568
               +  LSD      P +S+    SD +TG KS  K++K+                    
Sbjct: 1517 GIPAGDLSDKPVSEAPGSSEQAVVSDIDTGIKSTSKISKLKISSKAKPKESKGESERRSH 1576

Query: 1569 ----TFSKKRNEDIQFESHKPSIVIRP-PDAKKVYAEAHKPSIVI-RPPTSIDRDRMEFP 1628
                TFS++R E    ESHKPS+  +P    ++  A + + +I I RP  S+D D+ E  
Sbjct: 1577 SLMPTFSRERGES---ESHKPSVSGQPLSSTERNQAASSRHTISIPRPSLSMDTDQAE-S 1636

Query: 1629 RRSATVVRSPAETKREKLNKKLIIKRPKEVIDLDQMGFDASAGMQYRKTKRIVELSSFEN 1688
            RR   V+R P  T+RE+  KKL+IKR KE+ D D    + S   + RKTKR+ EL+ F+ 
Sbjct: 1637 RRPHLVIRPP--TEREQPQKKLVIKRSKEMNDHDMSSLEESPRFESRKTKRMAELAGFQR 1696

Query: 1689 HTRPGSMSSAESGKKKVREDQIWWEKQERQKNEERLREEKARRVYKEEMRMRDEQEKLAE 1748
              +     S  S +++ +ED++WWE++E      R RE +ARR Y ++M + +E  ++AE
Sbjct: 1697 --QQSFRLSENSLERRPKEDRVWWEEEEISTG--RHREVRARRDY-DDMSVSEEPNEIAE 1756

Query: 1749 IRRFEASIRSDKEEEERIKAKKKKKKR--IPEIMDDYMED--PRSR-QRIPERDRVVKRK 1808
            IRR+E  IRS++EEEER KAKKKKKK+   PEI++ Y+ED  PR   +R+ ER R V+ +
Sbjct: 1757 IRRYEEVIRSEREEEERQKAKKKKKKKKLQPEIVEGYLEDYPPRKNDRRLSERGRNVRSR 1816

Query: 1809 PI-ELGRHGAEHASSTKRRRGGEVGLSNILERIVEALKDNFEISYLFLKPVSKKEAPDYL 1868
             + +  R GAE+A   KRR+ GEVGL+NILERIV+ L+   E+S LFLKPVSKKEAPDYL
Sbjct: 1817 YVSDFERDGAEYAPQPKRRKKGEVGLANILERIVDTLRLKEEVSRLFLKPVSKKEAPDYL 1876

Query: 1869 DIIERPMDLSTIREKVRKLEYKTRDEFRTDVWQIMYNAHMYNDGRNPGIPPLADKLLMHC 1895
            DI+E PMDLSTIR+KVRK+EY+ R++FR DVWQI YNAH+YNDGRNPGIPPLAD+LL  C
Sbjct: 1877 DIVENPMDLSTIRDKVRKIEYRNREQFRHDVWQIKYNAHLYNDGRNPGIPPLADQLLEIC 1919

BLAST of MC09g0858 vs. TAIR 10
Match: AT3G19040.1 (histone acetyltransferase of the TAFII250 family 2 )

HSP 1 Score: 1363.2 bits (3527), Expect = 0.0e+00
Identity = 857/1834 (46.73%), Postives = 1161/1834 (63.30%), Query Frame = 0

Query: 89   VDYEDIDEEYDGPEIEAAGEEDHLLPKKEYFSTEVSLATLEPTVSVFDDEDYDED--FEK 148
            VDY   DEEYDGPE++   EEDHLLPK+EY S   +L+ L    SVFDDEDYDE    EK
Sbjct: 5    VDYGSNDEEYDGPELQVVTEEDHLLPKREYLSAAFALSGLNSRASVFDDEDYDEQGGQEK 64

Query: 149  VHDVINSSVEARTTHASDEKGECLEVAYEGEKSVADDDIQSASLNNEVITSSAEELLEET 208
             H  +  S ++        K E   V +E E S+  +        N++ T   +E  E  
Sbjct: 65   EHVPVEKSFDSEEREPVVLKEE-KPVKHEKEASILGN-------KNQMDTGDVQE--ELV 124

Query: 209  PEVQKKLLDEKAHTPLPVLCME-NGMAILQFSEIFGVHDSLKKKEKRESRYCTRRDKYRS 268
              + +  LDEK  TPLP L +E +GM ILQFSEIF + +  KK++KRE R  T RDKY S
Sbjct: 125  VGLSEATLDEKRVTPLPTLYLEDDGMVILQFSEIFAIQEPQKKRQKREIRCITYRDKYIS 184

Query: 269  VDVSDIVEEDEEAFLHGFSQGVSCVKPASVVKDDTTMFNLDDPEFTKFGVVQGVDVMAAR 328
            +D+S+++E+DEE  L    +  +  K    ++ D  +   +  +  K G+V+     +  
Sbjct: 185  MDISELIEDDEEVLLKSHGRIDTHGKKTDQIQLDVPLPIRERSQLVKSGIVRDTTSESRE 244

Query: 329  VDWRQKDNCCGAEPMKQVFAENISIGSNSLLFKKFYPLDQQNWEEGILWDNSPVLSKNSA 388
                 +D+C   E +KQ   ++ S    S L  + +PLDQQ WE  ILW+ SP  S N  
Sbjct: 245  FTKLGRDSCIMGELLKQDLKDDNSSLCQSQLTMEVFPLDQQEWEHLILWEISPQFSANCC 304

Query: 389  ----GSCEVSGSDLEDSVSSDVEQQVSIQIVRSEHHIDPNDRRQRLSQHDLPLLEPFGSR 448
                   E +G  ++   S+ V +Q S+ ++ S       D    L    +  LE FGSR
Sbjct: 305  EGFKSGLESAGIMVQVRASNSVTEQESLNVMNSGGQTQ-GDNNNMLEPFFVNPLESFGSR 364

Query: 449  KFSGPEEPFSPEMIYHPQMLRLESWKDVDGSCQS-EGTRENFSEEHQSNAIRCFSKFSPK 508
                  E  +    +HPQ+LRLES  D D   ++ +  REN  ++  S+A    S  + +
Sbjct: 365  GSQSTNESTNKSR-HHPQLLRLESQWDEDHYRENGDAGRENL-KQLNSDARGRLSGLALQ 424

Query: 509  NRRMLEGSWLDKVLWESDEPIEKQKFIFDLEDEHMLFEISDEKESKYIQFHAGAMILTRS 568
            +R M + SWLD ++WESD+ + + K IFDL+DE M+FE+ + KE KY+Q HAG+ I++RS
Sbjct: 425  DRDMWDESWLDSIIWESDKDLSRSKLIFDLQDEQMIFEVPNNKERKYLQLHAGSRIVSRS 484

Query: 569  SMSVNGNSFEISGSGGQGGWRF-VSNDKHYSNRKASQQLKSNSKKRSVHGVKVFHSKPAM 628
            S S +G SF+  G G   GW+F +SNDK Y N K++Q+L+ N+KK +VH ++VFHS PA+
Sbjct: 485  SKSKDG-SFQ-EGCGSNSGWQFNISNDKFYMNGKSAQKLQGNAKKSTVHSLRVFHSAPAI 544

Query: 629  MLQTMKLKLSNKELANFHRPKASWYPHDNEMAVRELQKLPTQGPMKIILKSLGGKGSKLF 688
             LQTMK+KLSNKE ANFHRPKA WYPHDNE+A+++ + LPTQG M I++KSLGGKGS L 
Sbjct: 545  KLQTMKIKLSNKERANFHRPKALWYPHDNELAIKQQKILPTQGSMTIVVKSLGGKGSLLT 604

Query: 689  VDSEETVSSLMAKASKKLDMKSSEIVKVFYSGKELEREKSLAAQNVQPNSLLHLVRSKIF 748
            V  EE+VSSL AKAS+KLD K +E VK+FY GKELE EKSLA QNVQPNSL+HL+R+K+ 
Sbjct: 605  VGREESVSSLKAKASRKLDFKETEAVKMFYMGKELEDEKSLAEQNVQPNSLVHLLRTKVH 664

Query: 749  VMPWTQHLRGDSKSVRSPGAFKKKSDLSVKDGHVFLMEYCEERPLLLGNIGMGARLCTYY 808
            + PW Q L G++KS+R PGAFKKKSDLS +DGHVFLMEYCEERPL+L N GMGA LCTYY
Sbjct: 665  LWPWAQKLPGENKSLRPPGAFKKKSDLSNQDGHVFLMEYCEERPLMLSNAGMGANLCTYY 724

Query: 809  QKSSPDDQTGALLRNGGDSLGHVIVLEPSDKSPYIGDIKGGSVQASLETNMYRSPIFPHK 868
            QKSSP+DQ G LLRN  D+LG VI+LE  +KSP++G++ GG  Q+S+ETNMY++P+FPH+
Sbjct: 725  QKSSPEDQHGNLLRNQSDTLGSVIILEHGNKSPFLGEVHGGCSQSSVETNMYKAPVFPHR 784

Query: 869  VPMTDYILVRSAKGKLSLRRIYRNFAVGQQEPLMEVFSPGTKSLQMFMMNRLTLYIFREF 928
            +  TDY+LVRSAKGKLSLRRI +  AVGQQEP ME+ SP +K+L  +++NR+  Y++REF
Sbjct: 785  LQSTDYLLVRSAKGKLSLRRINKIVAVGQQEPRMEIMSPASKNLHAYLVNRMMAYVYREF 844

Query: 929  LAAEKRRRLPYIRVDELPSQFPYLSETVIRKKLKEYALQQKSSNGQTILIKKRN-ASISL 988
               ++      I  DEL   F  +S+  +RK ++  +  ++ +NG+    KKR    I L
Sbjct: 845  KHRDR------IAADELSFSFSNISDATVRKYMQVCSDLERDANGKACWSKKRKFDKIPL 904

Query: 989  KKDA-VTPEDVCKYESMQAGLYRLKHLGITE-VHPSAISSAMSRLPDEAITLAAASHIER 1048
              +  V PEDVC YESM AGL+RLKHLGIT    P++IS+A+++LPDE I  AAASHI R
Sbjct: 905  GLNTLVAPEDVCSYESMLAGLFRLKHLGITRFTLPASISTALAQLPDERI--AAASHIAR 964

Query: 1049 ELQITPWNLSSNFVACTTQGKENIERLEITGVGDPSGRGLGFSYVRSVPKAPISNASLKK 1108
            ELQITPWNLSS+FV C TQG+ENIERLEITGVGDPSGRGLGFSYVR  PK+  ++   KK
Sbjct: 965  ELQITPWNLSSSFVTCATQGRENIERLEITGVGDPSGRGLGFSYVRVAPKSSAASEHKKK 1024

Query: 1109 KAASSRGSSAVTGTDADLRRLSMDAAKEVLLKFDVSEEQIAKLTRWHRIAMIRRLSSEQA 1168
            KAA+ RG   VTGTDAD RRLSM+AA+EVLLKF+V +E IAK T+ HR AMIR++SSEQA
Sbjct: 1025 KAAACRGVPTVTGTDADPRRLSMEAAREVLLKFNVPDEIIAKQTQRHRTAMIRKISSEQA 1084

Query: 1169 ASGVQVDPTTISKYARGQRMSFLQLQRQTREKCQEIWERQIQSLSTSDGAENESDSEGNS 1228
            ASG +V PTT+  ++R QRMSFLQLQ+Q RE C EIW+RQ  SLS  D   NES++E NS
Sbjct: 1085 ASGGKVGPTTVGMFSRSQRMSFLQLQQQAREMCHEIWDRQRLSLSACDDDGNESENEANS 1144

Query: 1229 DLDSFAGDLENLLDAEEFEDEVDTFEIRHEKTDGVKGLKMRRRPSIAQTEEEIEDEVAEA 1288
            DLDSF GDLE+LLDAE+  +  ++ +  +EK DGVKGLKMRR PS  + +EEIEDE AE 
Sbjct: 1145 DLDSFVGDLEDLLDAEDGGEGEESNKSMNEKLDGVKGLKMRRWPSQVEKDEEIEDEAAEY 1204

Query: 1289 TELCRLLMDDEAERRRKKKKNKVMGEAI-LVPGLQASFGHEIPQQTRHLVSIAQPDIAYT 1348
             ELCRLLM DE +  +KKKK K +GE I   P  +++F   I +  +++ +         
Sbjct: 1205 VELCRLLMQDEND--KKKKKLKDVGEGIGSFPPPRSNFEPFIDK--KYIATEPDASFLIV 1264

Query: 1349 SKENIRDQKEVESIINRKDKSGKFKPMKKNYSSEMSLLNKKLKISGDKVKNFKEKKSARE 1408
            ++  ++  K V+   ++  K  + K +         +L +  K+       F  KK+AR 
Sbjct: 1265 NESTVKHTKNVDKATSKSPKDKQVKEIGTPICQMKKILKENQKV-------FMGKKTARA 1324

Query: 1409 SFVCGKCGQFGHMRTNKNCPKYGEDMET-PETTDQEKVSIKLNTVDPSSQSH-QKAHTKK 1468
            +FVCG CGQ GHM+TNK+CPKY  + E+ PE+ D +K + K ++ D S +        KK
Sbjct: 1325 NFVCGACGQHGHMKTNKHCPKYRRNTESQPESMDMKKSTGKPSSSDLSGEVWLTPIDNKK 1384

Query: 1469 VTPKTITKISTTETFE---------GEKSTLMAKMLPVKFKCSSSE-KLSDNLSPGVPQT 1528
              PK+ TKIS  E  +         G         +    K +S + K+S    P   + 
Sbjct: 1385 PAPKSATKISVNEATKVGDSTSKTPGSSDVAAVSEIDSGTKLTSRKLKISSKAKPKASKV 1444

Query: 1529 -SDLPFNSDNETGKSVVKVNKITFSKKRNEDIQFESHKPSI--VIRPPDAKKVYAEAHKP 1588
             SD PF+S               +S++R E    E H PS+   + P       A +   
Sbjct: 1445 ESDSPFHS-----------LMPAYSRERGES---ELHNPSVSGQLLPSTETDQAASSRYT 1504

Query: 1589 SIVIRPPTSIDRDRMEFPRRSATVVRSPAETKREKLNKKLIIKRPKEVIDLDQMGFDASA 1648
            + V +P  SID+D+ E   R   V+  P  T +E   KKL+IKR KE+ D D    + + 
Sbjct: 1505 TSVPQPSLSIDKDQAE-SCRPHRVIWPP--TGKEHSQKKLVIKRLKEITDHDSGSLEETP 1564

Query: 1649 GMQYRKTKRIVELSSFENHTRPG-SMSSAESGKKKVREDQIWWEKQERQKNEERLREEKA 1708
              + RKTKR+ EL+ F+   R   S +  + G K   +D+ W  ++E+  + E  RE K 
Sbjct: 1565 QFESRKTKRMAELADFQRQQRLRLSENFLDWGPK---DDRKW--RKEQDISTELHREGKV 1624

Query: 1709 RRVYKEEMRMRDEQEKLAEIRRFEASIRSDKEEEERIKAKKKKKKRIPEIMDDYMEDPRS 1768
            RR Y ++  + +E+ ++AE RR+   IRS++EEE+R KAK+KKK +   I+++Y   PR 
Sbjct: 1625 RRAY-DDSTVSEERSEIAESRRYREVIRSEREEEKRRKAKQKKKLQ-RGILENY--PPRR 1684

Query: 1769 RQRIPER--DRVVKRKPIELGRHGAEHASSTKRRRGGEVGLSNILERIVEALK-DNFEIS 1828
               I       +      +  R+  E+A   KRR+ G+VGL+NILE IV+ L+     +S
Sbjct: 1685 NDGISSESGQNINSLCVSDFERNRTEYAPQPKRRKKGQVGLANILESIVDTLRVKEVNVS 1744

Query: 1829 YLFLKPVSKKEAPDYLDIIERPMDLSTIREKVRKLEYKTRDEFRTDVWQIMYNAHMYNDG 1888
            YLFLKPV+KKEAP+YL+I++ PMDLSTIR+KVR++EY+ R +FR DVWQI +NAH+YNDG
Sbjct: 1745 YLFLKPVTKKEAPNYLEIVKCPMDLSTIRDKVRRMEYRDRQQFRHDVWQIKFNAHLYNDG 1778

Query: 1889 RNPGIPPLADKLLMHCDNLLNENDDELTEAEIGI 1891
            RN  IPPLAD+LL+ CD LL+E  DEL EAE GI
Sbjct: 1805 RNLSIPPLADELLVKCDRLLDEYRDELKEAEKGI 1778

BLAST of MC09g0858 vs. TAIR 10
Match: AT1G20670.1 (DNA-binding bromodomain-containing protein )

HSP 1 Score: 57.8 bits (138), Expect = 1.1e-07
Identity = 39/113 (34.51%), Postives = 60/113 (53.10%), Query Frame = 0

Query: 1781 ILERIVEALKDNFEISYLFLKPVSKKEAPDYLDIIERPMDLSTIREKVRKLEYKTRDEFR 1840
            IL+R+ +  KD + +   +  PV  +E PDY +II+ PMD ST+R K+    Y T ++F 
Sbjct: 183  ILDRLQK--KDTYGV---YSDPVDPEELPDYFEIIKNPMDFSTLRNKLDSGAYSTLEQFE 242

Query: 1841 TDVWQIMYNAHMYNDG------RNPGIPPLADKLLMHCDNLLNENDDELTEAE 1888
             DV+ I  NA  YN        +   I  LA K     +NL  ++DDE  +++
Sbjct: 243  RDVFLICTNAMEYNSADTVYYRQARAIQELAKK---DFENLRQDSDDEEPQSQ 287

BLAST of MC09g0858 vs. TAIR 10
Match: AT5G55040.1 (DNA-binding bromodomain-containing protein )

HSP 1 Score: 50.8 bits (120), Expect = 1.4e-05
Identity = 63/219 (28.77%), Postives = 103/219 (47.03%), Query Frame = 0

Query: 1658 DQIWWEKQERQKNEERLREEKARRVYK-EEMRMRDEQ--EKLAEIRR---FEASIRSD-- 1717
            D  + E++E +  EE+ R++K ++V K  + R R +   +  A +R    +E     D  
Sbjct: 50   DDYFDEEEEDEVEEEKKRQKKLKQVLKLNQSRARADPPVKSRARVRHASDYEEEDEEDDE 109

Query: 1718 -KEEEERIKAKKKKKKRIPEIMDDYMEDPRSRQRIPERDRVVKRKPIELGRHGAEHASST 1777
             +EEEE +  K++ KKR     D+  E+   +    E +     +  E G   +E     
Sbjct: 110  AEEEEEEVSEKRQVKKRKLNRQDEEEEEEEEKDYDVEEE-----EEEEEGHADSEEEDDK 169

Query: 1778 KRRRGGEVGL-------------SNILERIVEALKDNFEISYLFLKPVSKKEAPDYLDII 1837
            +R+R    G                 LE I++ L+   +I  ++ +PV  +E PDY D+I
Sbjct: 170  ERKRRSASGNQCDHSSETTPILDKKSLELILDKLQKK-DIYGVYAEPVDPEELPDYHDMI 229

Query: 1838 ERPMDLSTIREKVRKLEYKTRDEFRTDVWQIMYNAHMYN 1855
            E PMD ST+R+K+    Y T +E  +DV  I  NA  YN
Sbjct: 230  EHPMDFSTVRKKLANGSYSTLEELESDVLLICSNAMQYN 262

BLAST of MC09g0858 vs. TAIR 10
Match: AT5G55040.2 (DNA-binding bromodomain-containing protein )

HSP 1 Score: 50.8 bits (120), Expect = 1.4e-05
Identity = 63/219 (28.77%), Postives = 103/219 (47.03%), Query Frame = 0

Query: 1658 DQIWWEKQERQKNEERLREEKARRVYK-EEMRMRDEQ--EKLAEIRR---FEASIRSD-- 1717
            D  + E++E +  EE+ R++K ++V K  + R R +   +  A +R    +E     D  
Sbjct: 50   DDYFDEEEEDEVEEEKKRQKKLKQVLKLNQSRARADPPVKSRARVRHASDYEEEDEEDDE 109

Query: 1718 -KEEEERIKAKKKKKKRIPEIMDDYMEDPRSRQRIPERDRVVKRKPIELGRHGAEHASST 1777
             +EEEE +  K++ KKR     D+  E+   +    E +     +  E G   +E     
Sbjct: 110  AEEEEEEVSEKRQVKKRKLNRQDEEEEEEEEKDYDVEEE-----EEEEEGHADSEEEDDK 169

Query: 1778 KRRRGGEVGL-------------SNILERIVEALKDNFEISYLFLKPVSKKEAPDYLDII 1837
            +R+R    G                 LE I++ L+   +I  ++ +PV  +E PDY D+I
Sbjct: 170  ERKRRSASGNQCDHSSETTPILDKKSLELILDKLQKK-DIYGVYAEPVDPEELPDYHDMI 229

Query: 1838 ERPMDLSTIREKVRKLEYKTRDEFRTDVWQIMYNAHMYN 1855
            E PMD ST+R+K+    Y T +E  +DV  I  NA  YN
Sbjct: 230  EHPMDFSTVRKKLANGSYSTLEELESDVLLICSNAMQYN 262

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q8LRK90.0e+0052.01Transcription initiation factor TFIID subunit 1 OS=Arabidopsis thaliana OX=3702 ... [more]
Q67W650.0e+0048.96Transcription initiation factor TFIID subunit 1 OS=Oryza sativa subsp. japonica ... [more]
Q6PUA20.0e+0046.73Transcription initiation factor TFIID subunit 1b OS=Arabidopsis thaliana OX=3702... [more]
P216752.0e-6523.80Transcription initiation factor TFIID subunit 1 OS=Homo sapiens OX=9606 GN=TAF1 ... [more]
Q8IZX42.6e-6523.80Transcription initiation factor TFIID subunit 1-like OS=Homo sapiens OX=9606 GN=... [more]
Match NameE-valueIdentityDescription
XP_022155093.10.099.89transcription initiation factor TFIID subunit 1 [Momordica charantia][more]
XP_038899368.10.087.74transcription initiation factor TFIID subunit 1 isoform X2 [Benincasa hispida][more]
XP_038899362.10.087.38transcription initiation factor TFIID subunit 1 isoform X1 [Benincasa hispida] >... [more]
XP_022959797.10.086.99transcription initiation factor TFIID subunit 1 isoform X1 [Cucurbita moschata] ... [more]
XP_023004897.10.087.04transcription initiation factor TFIID subunit 1 isoform X1 [Cucurbita maxima] >X... [more]
Match NameE-valueIdentityDescription
A0A6J1DM240.099.89transcription initiation factor TFIID subunit 1 OS=Momordica charantia OX=3673 G... [more]
A0A6J1H5J50.086.99transcription initiation factor TFIID subunit 1 isoform X1 OS=Cucurbita moschata... [more]
A0A6J1L0S90.087.04transcription initiation factor TFIID subunit 1 isoform X1 OS=Cucurbita maxima O... [more]
A0A0A0KA050.086.35Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G357030 PE=3 SV=1[more]
A0A1S3C8860.086.19transcription initiation factor TFIID subunit 1 isoform X1 OS=Cucumis melo OX=36... [more]
Match NameE-valueIdentityDescription
AT1G32750.10.0e+0052.01HAC13 protein (HAC13) [more]
AT3G19040.10.0e+0046.73histone acetyltransferase of the TAFII250 family 2 [more]
AT1G20670.11.1e-0734.51DNA-binding bromodomain-containing protein [more]
AT5G55040.11.4e-0528.77DNA-binding bromodomain-containing protein [more]
AT5G55040.21.4e-0528.77DNA-binding bromodomain-containing protein [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (Dali-11) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 1262..1286
NoneNo IPR availableGENE3D3.10.20.90coord: 646..736
e-value: 2.5E-12
score: 48.5
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1199..1221
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1635..1656
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1490..1514
NoneNo IPR availablePANTHERPTHR13900:SF3BNAA01G26290D PROTEINcoord: 9..1291
coord: 1371..1890
NoneNo IPR availableCDDcd17064Ubl_TAFs_likecoord: 664..735
e-value: 6.97852E-30
score: 111.771
IPR001487BromodomainPRINTSPR00503BROMODOMAINcoord: 1842..1861
score: 26.85
coord: 1824..1842
score: 28.15
coord: 1794..1807
score: 31.83
coord: 1808..1824
score: 56.65
IPR001487BromodomainSMARTSM00297bromo_6coord: 1771..1881
e-value: 1.4E-24
score: 97.7
IPR001487BromodomainPFAMPF00439Bromodomaincoord: 1782..1857
e-value: 7.1E-18
score: 64.5
IPR001487BromodomainPROSITEPS50014BROMODOMAIN_2coord: 1791..1861
score: 18.971001
IPR000626Ubiquitin-like domainSMARTSM00213ubq_7coord: 664..736
e-value: 1.2E-9
score: 48.1
IPR000626Ubiquitin-like domainPFAMPF00240ubiquitincoord: 666..737
e-value: 4.0E-10
score: 39.3
IPR000626Ubiquitin-like domainPROSITEPS50053UBIQUITIN_2coord: 664..734
score: 12.574503
IPR022591Transcription initiation factor TFIID subunit 1, domain of unknown functionPFAMPF12157DUF3591coord: 583..1147
e-value: 3.0E-125
score: 418.4
IPR009067TAFII-230 TBP-bindingPFAMPF09247TBP-bindingcoord: 11..57
e-value: 1.1E-8
score: 35.2
IPR036427Bromodomain-like superfamilyGENE3D1.20.920.10coord: 1657..1893
e-value: 1.3E-26
score: 95.0
IPR036427Bromodomain-like superfamilySUPERFAMILY47370Bromodomaincoord: 1768..1886
IPR041670Zinc knucklePFAMPF15288zf-CCHC_6coord: 1399..1418
e-value: 4.7E-6
score: 26.4
IPR040240Transcription initiation factor TFIID subunit 1PANTHERPTHR13900TRANSCRIPTION INITIATION FACTOR TFIIDcoord: 9..1291
coord: 1371..1890
IPR018359Bromodomain, conserved sitePROSITEPS00633BROMODOMAIN_1coord: 1796..1853
IPR029071Ubiquitin-like domain superfamilySUPERFAMILY54236Ubiquitin-likecoord: 656..736
IPR036741TAFII-230 TBP-binding domain superfamilySUPERFAMILY47055TAF(II)230 TBP-binding fragmentcoord: 15..59

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
MC09g0858.1MC09g0858.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0051123 RNA polymerase II preinitiation complex assembly
biological_process GO:0006413 translational initiation
cellular_component GO:0005669 transcription factor TFIID complex
molecular_function GO:0016251 RNA polymerase II general transcription initiation factor activity
molecular_function GO:0017025 TBP-class protein binding
molecular_function GO:0003743 translation initiation factor activity
molecular_function GO:0005515 protein binding