CSPI01G06630 (gene) Cucumber (PI 183967) v1

Overview
NameCSPI01G06630
Typegene
OrganismCucumis sativus var. hardwickii cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionTranscription initiation factor TFIID subunit 2
LocationChr1: 4132454 .. 4149485 (+)
RNA-Seq ExpressionCSPI01G06630
SyntenyCSPI01G06630
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCCAAGCCTCGCAAGCCCAAGAACACCGACGACGCCAAGCCACCTGACAACTCCGGAGCTGTAGTTCGTCACCAGAAGCTCTGTCTTTCCATCGACATTGACAATCGTCGCGTTTATGGGTTCGTCTTACCTCTACCTCCTTCAATTATGATCTCTAACTCCCTCTACCCATCTCCCCATTTGCTTAATTTCTTGGTGGATTCATGGGTTTGTTTTTATTTTTCTTGCCTGTATGCGTATGAGTTTGCTTTGCTGAATCTTGGCCACCGATAACTCACCGTTCTTGTTGCTGGATTTTAGGTTCACCGAGTTGGAAATTGCGGTTCCTGATATTGGTATAGTTGGGTTGCACGCGGAGAATCTTGGGATTGTGAGTGTTTCAGTGGATGGTGACCCAACTGAATTTGAGTATTATCCGCGGCCTCAACATGTGGAAAATGAGAGGAGTTTTAAAGCAGTTTCGTCGCCGAGCTCTGCTGCAGATGCTGCAGGGTCAATCTATTTGTCTTCAATAGAGAAGGAATTGGTTCCTAATTTGTTGATAAACTGCTGCAAGGCTTTCAAGAGTGGAAGCGAGCAGCAAGACCAGCCATTTCTGGAGAATGGAGTGCAAACTGCGGATGAGGACAAGCAGGTTGTAGCTGTTAGCTATTATCACGATGACAATCTTTTATATGGGGGGGATATGTACATGGATGCGAAATGTTATACTAGTTATTTATTTGTATTCCATGTATTCTGGCATGTAACTATTGATGTTATGAAGTATATTATAACTTGTCTCTCTTTGGAAATGGTCTCTGGTACATTTATTGTTCATGCAGTTCCTATCTCTTGCAAGCTTGGTGCAAGTATTTTGAAGCATCTATTTGTAATGAGCAGAACTAATCATACGTTTGCTGTGGTAGATTTGGAAATGAAGTAGTTATCTTTGTTTAAGTATCCAATTTTATCATGAGCAGAAAGAAATCAGTATAATTCTTGTTTAGTTCAAACTGCTGAATATTTTTTTCACATAAAGCCGTGTGCATATCACAAACCGAGTTAGGCAGTTATAATTAATGTAAGTTTACGTACTTTTGTGAGAACCAAGGAACAAGGTTCTCTTATTGATATGTAATTGTTTGTGAAAGTGCTGAAGCTCTATCGTAAGTCATGTTTATTATTTCTTTTGTGGCTTCTCATGAAGGACAACTTTTGATGCAGAATGTAAGGCTGGTTCGTATTGATTACTGGGTAGAGAAATCAGAGGTAGGTATTCACTTCTATAATCGTATGGCTCACACTGACAACCAAATAAGGCGTGCAAGGTGCTGGTTCCCTTGTATGGATGATGGTCTACAACGATGCAAGTAAGGATATCATGATCTGACTGGGTTAGACAATTTTAACTTTTTACAACCCATTTATATTCTATATGCTGTCTTATGTTCAGGTATGACCTCGAGTTTACTGTCTCTCAAAATCTTGTGGCTGTCAGTAATGGAATTTTGTTGTACCAGGTGAGAAATACTCTGAAATGAGATGTTTGTTTATCAGTTATGTGGTATAAAGCAATTTTGTTGAGTAATGTGTACCATATGTTATTTTTCTTCCTCTTATTTTATTGGATAAGAAACAATTTCATTGTTTGGACGAAATAATCTAAAGGGGTAAACGAGAGCCCTCAAGGGATTAAAAAGATGTCTCCAATTTGCAACAAGAAAATTTAAACTCAAAACTGTTAAAGGCAAAAAAATGCGTGGAAGATTGGTAGATCTAAGATTTCCTTAGAGGATCTATCTTTGCATTTGAAAAGCCTAGCATTCCTCTCATCACACAATTTCCTCAAAATGGCATCGACAATATTTGTCCATTGGAGACTCTTGGTACTTTTGAAAGGTGTTCGTTCAACATCGTTCCCAACCAATCTATATAGCCATGCTGAGGTGCTATAGACTAACAAAGGTAGACAGTAATTCACCTCATATGTGAGTAGCAAACAGGCAGTGAATGAAAATATGACTTTGACATTCATGATCATTCTGACAGAGGCAGCAACAATAGGGAGAGAGGCTCATCCATGGACATGTCTTTTGGAGCAGATCCTTAGTGTTGAGACAAGAGTGGCTAATTTCCCAAATAAGAAATTTTACTCTCTTTGGGCAATGGAATCCCCACAATTTCCTTTAAAAATCAGCATTAAGAGAGGGCTCATCAGAGGAGAGGTGTATGAGCAAAGATTTTGTGGAAAAGACACCACTGTTATCTAACCTTCTATCTCCAGTGATTAGGCTGTTTTGATAGATGAATGTTGGGATAGAGCTGTGTGAAGGAGATCGTTTAATTTATTTCATCATCTTTAAGATTTCTTCCAGTATTCAAAATCCACGAAATGTTTTCTGTGCTCCAAACTTCTGATGTAGGGGCAGATTTAGACTCTGATATTGCAAAAAGCTTGGGGAATCTACTTTTTAGAGGTTCATTACCAGTCCATTGATGAAACCAAAGTGATGTAGGTTTTCCATCACCAACAATGCACTGTATATGGCTGTAAAAGAATCCCCGTTGGTTCAAGATGTATTTCTAACGCATCTTTGCTGTTGGTTTCTTAGGACTTTTCTTGGACTATCTAAATAAGGCAAGATCTTAAACCTGGCCTAGTGGTGCCCTGGGTTTAGTAAAGGACTTAGAGGGAATGTACTCAATCCATGGTGGCCACCTACCTGTTGGTTTCTTTGCTGTTGGGTCATGCATGTTGTCTCATGAGATAGTCGAGGTTCGCGTAAGCTAGCTTGAACACTCACGAGTATCCAAAGAGAGAAAAAGGCACGATCTTTTTCCTTGAATTTTATTGTATTTGAAAGTTATGGAATTTCAAATTACTTAAATCAACGAAGTTTGTCAAGGTACAATAATCATTTTCCTTGAAGTTTATCGTATTGATCTTCATTAGCTATTCTCTTAATCCCTAGATAAAGAACTACAAAGTTATGAAATCTCAAATTGCTTCAATCAACTAAGTTCGTTCTTTGTAGTTTTTAGCCTTATTATACCTGACTTTAAATGTTCATTCACTCTTTGCTGAATTTTAATGAAATCAGTAGTCTAGTGGTCAGGATATTGATGAAAACTTGGAACGTTTAGTAGTTGACAAGATTCAATGTTGAATGGTGTTTAGTGTAAATAATCTCAAGTGTTGGCCTCTTTATTGATTCTTTTTTTTTTTCATGGAGTTGAATGCAAATTTGGCTTGAAGGAGAGCAACATCTCCTTGAAGGATGATAATTGGAAATACATAGTAAATAACCCCAAGTGTTCAAGTAGGTTATATCTGGCCACTTTTTGGAGATAATAAGTAGTACATTACTTCATGGTATTGAATGAATCCATAAAATCTTTCCTAAGTGACTAAGAAAAATTGGGTCTCCATTTGAATTACAAATTGAATTTCATGTAGTTGAAATAACCTCTTGAAAGAGAGTTCACTTTTGTAATTACATTAGATATTTTTTTGGTAAGAAAATTACATTAGATATTGTTGACGATAAATTGAGGTATCTTGCTAAATTTATCCAATTCATGGTTTCAATTATCTTGTGATTGTGGGAGTTAAGGACAAAATCAATCGAAATGTTGTTCTAAATGTTATCTGGAATAGAGACTTGTGACTAAAGATACACTCGTTCAAGCTATTAAGTACTTCTAGTACTTTGGCAATGAATGAAGTGTTATTTTTGTTACTGAACCAAGCTTTCATTGAGGAAAATGAAAGGGTTTTGTGAACAAAAGTTCATTCTTTTTGACATTTATCTAAAATCTATTTCTAAACAAGGTAACCATTGAACCCATTTAAATGAATCTTTTAATGTACATGTTTTCACGGAAAAATTTCAGGACCGGAGGCCTTTTCTTTTTTGGCTCCCTTTTATTGTGGAATTGTATTTTCTGGATGTTTGTGTAATCTTCATTTTTTCTCCATGAAAGTTGTTCATTCATAAAAAAGAGATTACAATAAAGTTGCTCAATATTGCTGGATCACTAGTGGTCTAGTTTTTATTCAAGAAAGTTCTTGTCAGGACTAATATATGTTTGAGATTTCTCTGGGCTTTATAGATATTAATCTAATTAGAATATTGAAAGGATTGTAGAGAATTGAATTTATTTTAATTTAAACTCACTCTCTCATCTTGAATGTGCTTACGAGGGTGAGGGGTCAATATAATATGTTTTTACGGCAGAGTAGGATTTTTAGTGTATGTGTTTGAATTAAATACCTTTTAGATTGGCCTCTTTATTGGGTACAACTGTGCCTCGATCCTGGTTGGTGCCTATTTGCGGGGATAGCTCAATTGGGAGAGCATCAGATTTTAATCTGGAGGTCGCGTGTTCAATCCACGCTCACCGCACTTATTTCTGGGATCAAACGTGCCCTTTCTCCATGTTTTGAATCGATGGTTGGGCAAAAGGTTCCACCTGGAGCAATATTCTGTTTGGACGCAGTTAAATTTTATGATTTTTTTAAATAAAGAAAATGACCTTAATATCATTATCTTATATGTTCTGCTTCAAAACATATTTCCTTCCCATATTGTACTAGAATATTAGCTAATGATATTATTTGTCCTATCAAACCATGTTATGCTACATTATTTATTAAATGAAAAAACGTGAAGTTAATACTAAACTATAAAAACCATCATAGATTGGTCTCGTGCTAACAAGGAGACATAGTCTCAACCTCTAACCAAGAGGTCATGAGTTCAATCCATAGTGACCACCTACCTAGGAATTAATTTTCTATGAGTTTCTTTGATACTCAAATGTTGTGGGGTCAGACGGGTTGTCCCGTGAAATTAGTTGAGATGCTCGTAAGCTGGTCTGGACACTCATGGATATAAAAAGAATATATATATATATTAATATGAATTTGAATTTGTATTAATTGTATGGGATAGTTGCAAATTTAGCAATTAGATTCAAAATAATTAAGTATATAGCAACATTTTAAAAAATTTGCAAATATAGCAAAATTTGTCAAACTCTTTATCAATAATATAAGTCTATCACCGATAGACCATGTTATAAATATTGGTCTATCAGTAAGTCTATCAACGATACAAGTCTATCAGTGATAGTTTTGTTATTTAAAAATTTTGCTGTATTCTTAATTATTATTGTTGAAATGGTTATCAATTGCAATTTCCCTAATTGTATTCACTCTTTGAGGGAGGATCTCATTTTTGTAGATGTTCATCTAACAACCACAAAAGAATTGAAACTATGTACTGTGCTTTCAAATGCTTTGGATTACACAACTTGTGTATGTTGAAATATTATTTTGCATTCAGAGGATTATCTCAAATATCTCATCCAAAATCACCATCTTCCGGACTCTGTTTTCTTTATCTTGTAAAGGTGGTATCACTGAATAAATCAATTTAGCTAACGTTAAGATGAGCTGCACACACACATGTTGATTTGTAATGTTGGCCTTGTACAACCATGGGGGGTCTGGTTTTCATTGTCTTCCCCGTGATTGCATTTTGCTTTTCTAAGAGATACATCAGTACGAAGATCCCCTCAAATAGATATCTTGGAATTCAAAGTTAGGTAAGGGGTAGCATATTACATCTTGCTGCAAGTTGCCTCTAATCTGACTTGCATATACGTTTTCTTGTCTTGGCGAATCTTGTTTCTTATTTGCATTTCATAAGCCGGGGGCAGTTGATGTTTTAATGTATATATATCAAGTTTCTCTCTACATATCAAATCAGCTTCAAGCATTTTGGATGTTTTGACTTTCCATGTATCTCCATTCCTTTCAGGTCTTGAGCAAAGACAATCCTCCCCGCAAAACTTTTGTGTATAGGGTAGACATCCCTGTCAATGCACGTTGGATATCACTTGCTGTTGGACCGTTTGAAATCCTTGCTGATCACCAAAATGTACTTATATCACACATGTGCTCGCCTGTTAATTCTTTAAAGCTCAAGCATACGGTGGATTTTTTTCATAGTGCATTCAGGTTTGTTTTTCAACTTTTTGAATGCTGCTGTAATGATTACAAACATAAGTTTTATGTGTGTAAATTATGTCTTTCTTTGGAATTTACTTTGCACAGCTGCTATAAGGATTACCTCTCCGTGGACTTCCCATTTGGGTCATATAAACAAATCTTCATAGAGCCGGAGATTGCAGTATCATCAGCATGTTTGGGAGTTTCCATGTGTATATTTAGTTCTCATCTTTTGTTTGATGAGAAAATCATCGATCAGGTTTGTTTATTTATAGGGCTTGTATAGCATGGGCTTAACAATTCATTTGTCTTGGAGTTTATATCTGTTGTAGGTGGACATTCAGTTTCGTGCGTGCGTGTGTGTGTGTGATATGCATCAGTATGAGTCCCACACAGGCAATACTCTAGCTCTTGCAAATATCCACAAACTTGGTGGTTATTAGGGCCGAGTAAAGGGTGAAGTAGTGAAGGGACTTAGATAAGTTGAAACTATGGTGGCTGCCTGATTAGGATGTTAATATTTTATGAATTTCTTGGTCAGCCAAATATTGTAGAGTTTGATGATTATCCTATGAGATTCGTCGAGCTATGTACAAGTTGGTTCAAATACGAGATATAGAAAAGGAAAAAAGAATCGAAATACCGATCTCTCTCTCTCTCTCCATCCCCCTCTTTCGTTTTTTTTTTATTTGGGTCTGTCCAAGTAACTGCCAACTTTTGTTGGTAAGGTGTCTGGCTCCGCAGTGCACACATCCATAATTTGTTTTTCTTTTGTCTTTTTAACAACAACAATAACATGTGTGTGTGTGTGTGTATAGGTAAGAAACCAAGAGAAATTATGTAAATGACTTGAATGCTAAAAACTCGGTTGGTTTGCTTCTTTAATACATGTGAGTTTTGATCAACATTCTTTCTCACAGACCATTGATACGAGGATTAAACTTGCTTATGCGCTGGCAAGACAGTGGTTTGGCATTTATATTACACCTGAAGCTCCAAATGATGGTAAATGGTTATCACATGTTCTCGTTACAATTTACTGTTTCATCTTCTTTCATCTTTTTTTGATCAGGCATGTTTGATTCTTATTGCTAGAGTGGTTGTTGGATGGTCTTGCTGGTTTTTTGACTGATTTATTTATTAAGAAGAACTTGGGAAATAATGAGGCACGCTACCAGAGATACAAGGTTTGCTCCACCATGTTCTTGGTTGCTTGCAGTTATTCTTCATTAGATTTCTTCATCTTTCTTTTCCTTCTTTTCTTCTTGAGGGCTTGGAGTTACTTTTCTCTACGTTGGTCATCCAAAGCAGTTTTACTATTTCTGGTGTCACCAACCATAAAATGAGGGATAATTGCAATTTGTGCCTCTGTACCTTGGCCATTTAGCGATTAAATCATGTATTGATCTGATATTTAATTTATTAATCCTACCAGTGATGTCATTTGTTATTAGTCCTCTGGTTAAATCCAACATACGAGTTTCTTGTTAAAAAAATATTTCACATTTTCAAATATCAATTTACATTGAGAAAACTCTGTTATGCAGGCAAATTGTTCTGTTTGTAGAGCAGATGATTGTGGTTTGACCACCTTGAGTTCCTCTTCTGCTTGTAAAGATTTGCATGGGACCCAATGTATTGGTATATATGGAAAAATAAGATCATGGAAGTCTGTAAGTTCACTTTCCTTATTTAGGTTTTTACCTCTCATTCAACCCTCTAATGTTTTTTTTTTTCCTTCAACAAGGTGGCAATCCTTCAAATGTTGGAAAAGCAGATGGGGCCTGAATCTTTCCGCAAGGTAACAGTTTGGGTTTATGAAGATATCAAGTTCAAGGATGTAAGGATGATTATAGATTGTGCCATTGGTTTGCATTTTAACTTCTTTCGGTCGGATAAGATAGAAGTGTTTTCTTCTCCATTTATTTCTTCAACATTTGTGGATTTGGTCGAACACAGTACTCATTTGTATACAGGGACACTCTTATGTGCCCCAATATGATGCAAGAAAAATAGGGGTCCATTGGGTAATTGTGTAATATTCTAGTTTAATTCCCTTTTGACCCTATTTGTAATAAAATTCTATAAATAGGAGTCTTTTCCTCTTGTTTGAAACAATTATTATCCTTCTAACAAAAGATTCACAATTTTGTTTTTCGGAGGATTAATCCTTGAGGCTACTTAGACTACATCACAATAACAAAAGACCAAAATGTCTTCTCTATTGCGGTCTCAGAACCTTGGGCTGAAAGCCATCCCCTAGCTCTTTCCAATCCAAGTTCCCCTTGGTACATTTTGATAGTTTCTTCCCACATTCTGACCATTTAACTTTGTCCAGCCATAGTGAACGCTCTTCCTGCTTCTTGTTGAGCTGTAGTTCATCCTGACGTTACACATACTGTGTGGCTCAATTTTGTCAATTAATTTTCCTTGTATGTTTGAGAACACACTGTGCTACAAAGACCTACTATTTATAGAAGATAAGTTTCTTTCTTGATCCTTGTATTGATTTCTATGGTCCAGTGGTGTCTTTGTCCTCCTTTTGAGACTATGGTTTGAAAGTAGTTCTATAATTTTTCATGGCCAAGAAACTAACATGGGCATATTTTTTGACATTTGGTCACCCTTTTTGCTTCTACAGTCTCTTACTCCATTTTTTAAAATTATTATTACTATTATGATTGCAATTGTATGCAAGCTCGTATTTACACCATTTGGGCATCTTTTTTTGTAACCATTGGTGTTTGGTTACATCTCTTCTCTCCAAACTCCCATCCTTTAGTGTTTCTTCTCTTGGTAATGTAAGTTATGTTTCTTATAAAAGAAATGATTTTCATGTTGATTATGTTTATTCTACTGTGCCAGATTCTACAAAATATTGTTTCTCATGCTAAAGATACTGGTTCTACGTCACAATTACTCAGCACGAAAGAGGTACTAATTTTCTTGTATGATGATACTTTACGAGGGCTTTTTGTTTTGGAACATCCATTTACTAAAATTGTGTTAGAATACTGAATGGTTTAGGTTGCATTAGGCTTGTGTTTTCAAACTCTACAATTTTACCTCTAAGTTTGCAATTAGCATTATTGGTCCTTGTTAAGTTCTTCATTTACAAACAGAAAAGTGGCAATACATATTTTTATATTTTCATTATTAAATTAAATGAAACATTTGTTTCATTTTATTTCCTCCTTTTCCCTCGTCTCGATCCATTTTTCCCTTCATCGTCGATGCCGGCTGCTGCCTCTTGTTGCTGCCAAGCTAACCCAGAGCTCAAATTCGACTGGTAGTATGTTGAAGATCATCTGAACAATTCAAACAAGAACAAGCAGATCCACAAATGGTTCTGGGAAGAACAACCACCAATCCAATGACCCACAAGCACAAAATTGGCGAGTCCACTTTCATAGACTCACTGGCTTAGCCAGCATAACCACGATATACAAACTTGTGACAAACGTTTCCATCATGAACCAAACGTGAAACGATTACAAACCTTCAGTGGAAACTGTTGATGCAACAAAACTACAAATATTCTCTTTCTTCAACATTCTCCAATGCCTCTTTGACTGTTTCCATTGAGCGTTCTCTGACATTTGAACTCAGATCTGAAATGAACCTAGATTTGTTTCAGATCTAAAACTTCATTTCTGTGTGTATGTTTAAAGAACAATTTTTTTCCTTCAAAATGGGAGAAAAAATAGGAAAAAAGAGGGGGAACAAGAAGCAGAAGAGGATGAAGAGGAAGGAAGAAGAAAAAAGGTCACTGACGATGTCCACTGATGTTTGACAATGGTTAGTGGCAGCCATGAGAAAAAGGAGAAAGGGTGGAAGAAACAAACAAGCAAGAAATAGATCTAAAATTATATTATTATTAGTTTTACATAATTTAATAAAAAGTTTTGGTCTATGGACATGGTGAATGAGAATGACGTCTAATGTCAGACTTCTCTATAATTTTGCGATGCAGTTTCGTCAATTGGCCAATAAGATTGGAAACCTAGAGCGACCATTTCTTAAAGAGTTCTTTCCTCGATGGGTTGAATCATGTGGTTGTCCACTGCTGAGGTAAACAATATATCATCTTTTATGCCCTTATGTCTGTTTATTTTGCTTACCTTGGTTTGTATGTAATACCAGGATGGGATTTTCCTACAACAAGCGAAAAAATATGGTTGAAATGGCCGTTTCACGGGAATGCACAGCTACACCAGCCACAAATGTAGAAAATCGGGACAGTGATGCTGGATGGCCTGGCATGATGAGTATCAGGATTTATGAACTTGATGGTGTCTTTGATCATCCAGTTCTTCCAATGACTGGAGAGTCTTGGCAGCTACTAGAAATACAATGCCACTCCAAACTTGCTGCTAGGCGCCTCCAGAAAACTAAAAAGGGGTCAAAACCTGATGGTTCTGATGACAATGCTGATATACCTGCACTCGACATTCGCTCTAGGTTTATTATTCCCTCATCATTTGATTTTATCACTGTTAAGTCAATCTGCTTATCATTTACTTGAATGCACTATATTGTCCCTTTTCAGTGTGGAATCTCCTTTGTTATGGTTGAGAGCAGACCCTGAGATGGAATACCTTGCTGAAATTCATTTTCATCAACCTGTTCAGATGTGGGTAAGACTGAATTTTGTTATCCATAATATTTTTGTTGGTAGGGTACATGTTTGCCTGTATTTCATTTCTGGTCTTTTGATTTCGTTTTTTGGCATTTTAGGGGGACAGTCACTAGGATTTTCATTAGCATGCAGATTTTGTCTATTCATGTTGTTTGTTGTTGCTTTATTATTATTATTGTTATTATTATTAGTTCCTTTCTGTGGATTTGGGAAAAAAAATATTTTAAAGCCGTTTTTTGGTGGGAACTATTAGGAGAAGCTAAAACTAATGTTTGAAAAATGGGCTCTGATACAAAGATTTATCAGTATTTATAAAACTAATGTTTGAATCATATTATTATTCTAATAAAAGATTCATGAGCTTGCTTCTTGGAGGATTACTCCTTGAGGCTAACTTTGGTTACATTAATTTGGTTTCATAGCGACCACCTAGAACGCCGATGTTAGACTGGAAATGATGGTTTAACAATTGTGTATGTATGCATTTTTGTGTGAAATATTTACTCTAGTTTGTCTTTTACTATCAGATCAATCAATTAGAGAAGGACAAGGATGTCATTGCCCAAGCACAAGCAATTGCGACCTTAGAGATGCTGCCTCAACCATCTTTTTCTATTGTTAATGCTCTTAACAATTTCCTCAAAGACCCCAAGGTTACACATCTTTTTGAGACACGGTTCTTTTTTATTATGTATTAAACTTACAATGGATTGTGAGTTGCTGAACTGTCGTATGAAGGCCTTCTGGAGAGTAAGAATTGAAGCAGCACTGGCAATGGCCAAAACAGCCTCAGAGGTATTATTGCAATTATTTTATTTCAGAGTTGAACACCTAATGAAAACCTTCATTTCTTTATCCAAATGGGTTTTACAGCAAAATTTCTACATTTATTGACCATGAAATGTTGGAAGGTCAAATGGTGGTTCCTCATTTTTCCAACTCAGTGGAACATAGACATTCAGGAGCTTTAAATTTGTGACTTAACCTGTGTAAAGTAATGCAGGTTACTCATTCTTTTCGTTGAATTTGACTTTCTTCTGCAGGATACTGATTGGGCTGGTTTGCTTAATTTGATTAAATTTTTTAAAAGTCAGAGGTTTGATGCTGATACTGGACTCCCCAAGTAAGATCCAAAAGAGTTCTCTTTCAGTGAGTTTTCTCTTGGATCACCTATTATGACTCTTGCTTGTTCTGATATTCTTACCAGGCCAAACGAATTTCGTGATTTTCCTGAATATTTTGTTCTTGAGGTAATGCAATGAGGTAGGATTTGTAATGTGTTGTGGAATTGCCTATTTGGTTTCCACAGTTCAGAATATGGTTCTTATATAAACCATATGAACCCATAAAATTGTCACAATTTGCTGCATACCCTTTCTGGTAATCTTATGTGGGCACCAATTATTCAGATAATACTCATGCAAAACAACCTGATACTATGAAAATAATTAAAATGCTGATATGTAATTCCAGTTAAATCAGATACTCTGAAGTCCATATAATCTTGAAAGCTTTGCAGAATCTCTAGATTGTTTAAAAGATGATCTCATTGTGTAAATTCTTTAGTCTATCTCTGCAGTCCTTTTCTATATAGAGTTTGAATTGCCTTTTATGCAGACTACATACAAACAATGCAGATGCTTCAAATGATATAAGGACTTCTTTTTGTCAGTTTGTGGTGTGGTCATTGATGATTTCAGAATATATTGATACTGTTTGTGCTCTTACTTAATGACGTTACAATGCAGTTCCTGGTAGTCATCCCCATAGTTACCATAGTTAGTAGCATAACTATTGTTCGACTTAAATGTGACTTGTGAAGTAAAAAAGTGCATCACTGTATTGAATTAATCCTTCTTCAATCTCGTATAGATATTGGAAATCAAGGGTTGTTGGGTTGATTTTACTTTATTTGAGTGAAGATTTGTATACACTTCTATTTCTTACTATTTTTGGTTGACACCCGAAAACTAAGGTGCTGTATTTTACCTTGTGTTCGCATGTGTGGTTCTTTTTTGGGGGGTTCTTATAGGCCATTCCTCATGCTGTTGCTATGGTCAGGGGCACCGACCAAAAAAGCCCTAGAGAAGCCGTTGAGTTTGTTTTGCAACTTTTGAAGGTAGTATTCTTTCTGCTTGTTTCATAAGTGGGACAATATGTTATACCAAAAAAATCTTGATTTTCACCCTGCCTGATCTGGCTCTATGGAATATGAAGACACATGTTGTAATATGGCAATACATGTTGTAATATGACCATACATGTTATCCCAATTCTATTACATCGATACAACATTTCAAAACCATGCAACATCACAAAAATGGCCCCTAGCACTTCAAAATATCAAAATTTTGGGTAACTTTTAGATTTTTCTCAGGGAATGCCAAAAGAAGAATCGTTGTCTTATGACAACCACAGTTTTTTGTTTTTGTTTTGTTTTTTGTATTTTAAACAACCCCTGCTTGTTTTTTTGAACTTTTAAACTTAATTTTCTTCTTGATATTATGCTTCTTTTCTGCTCCTTTGTTACTTGAAAATCTTGTTTGTGATCTATTTTTTTTAATACCATCTTTGTATTTCGGTAGTACAATGACAACAATGGGAATCCTTACTCTGATGTTTTTTGGTTGGCTGCACTAGTCCAATCAGTTGGTGAGCTTGAATTTGGGCAACAGGTTGGAATGTTCTCCTCTGCTTTTATCATTTGGTAGTTTCTTGATATGATAAGTTGCGAAGTTTAATTTTTGACATCTTAGTTAATGTTTACATGACTGCATCCCATGCAAAACTTTTTAATATTTTATCTATTTCAAGTCTAATCGATGGTTGGCGTCATATTTATAAAAGTTCCATTTGACCGCAATTTCCGTTTTCTGTTTTAAAAATTGAAGTTTGTTTCCCAAACCCTTGTCTTCTTCATTCTTTCATGAAGATTAGCTCTTCAATTTCTCAATTTAAGAGGAAATTTCTCGTCTTCTGAAACCACCAGAACAATCTTGCCAATATACACGGACACCTCCACTTCAAGAAAATCTACTTGCTTTAAAATGTAAATGGCTACACTCCCAAACCCCATTCATCATAGACTTGTTCTGCAATCCCTATTCCTTTATAGTAATGGGATTTACTATACATCTGTGTGGTCTTTCACGAGTGTTTCCGGTGAGCTCTTAGGCCTTGCCTATTTGTTTAGACAATTATATTGCCTCTGCTTTTATTGGCCATGTTTTTTTCTCTACTTAGACCCCTGTTTTAATTTCTTTTCTTTTTCTTCCCTGATAAATTGCAGAGCATTCTCTTTTTAGCATCGCTTCTCAAAAGGATTGATAGACTTCTGCAGTTTGACAGGTTGGTTAACCTACTGGTATTTGACTAAATGAATTATGTTTATGAAGAAACTAGATCTTATGCTTCTTTAAATGATACCTGACAGGCTGATGCCTAGCTATAATGGGATATTGACCATTAGCTGCATCAGAACCTTGACCCAGATTGCACTGAAACTATCTGGACTTCTCTCTCTTGTAAGTTTTTTCTTGTCTTCATATTATTAGTCAAAAGTTGTACTTCATTTGATATTCTATTCCTCGAGAATGACTATATTGCATTTTAAACTTCCTACAGGATCGTATCATTGAACTGATAAGACCTTTTCGAGATTTTAATTCCATGTGGCAAGTTCGAATTGAGGCAACCCGATCACTTCTCGATCTGGAGTACCACTGCAACGGAATTGATGCCACATTGTTATTGTTTATTAAATATCTAGAAGAAGAGAATTCCTTGCGAGGTTTGTTTCTGTCATTTATCTTTCTGTTTTTCCCCTCATATTACTCTTCGAGTTATCCTTGGTCTTCTGTGCTTTTGGCCTCTTCTTTTGTTTAATTTTTCTTTCTTTGTATGACTATTCCATTTATTTGTTTCTTATAATGAGTTTTCTCTGAGCTCAAGGAAAGAAGTTCTCATATTGCAGGGCAAGTGAAGCTCGCAGTCCATGTTATGCGTTTATGTCAGATAATGAGAAGATCTGGTTCAAATGATGTAGTCAATAATGACACTCTTGTTGCTTTGCTCCTTCTATTGGAAGGCAATATGGCGTTTAACAACGTCTATCTTCGTCACTACTTGTTCTGTATTTTACAAGTTCTTTCAGGAAGGTAGGTACCCCTTTTCCTGGTAATGCTAGGTTGAATTAAGTAGATATGGGAATACATAATTGTTCTGTCTTATAGTTTAATGCTTGCTTTGCTATTTTGTACTTAATCATTGTATTCTTTGTTTTTCCATTTTTTAATCCCCCACCCAGACCATGTCTCTGTGGCAAATAGTAGACAAATGGACTTAACGTGTTAGGTAGTAATGCTTATAATCGTTGATAGGTTTAATAGGTTTAAGTTGTGCTTCTTCAATATCTACTACCATATAACCAAGCTTGCCTCACAGTGGATCTTTGGGGATAAGTATTCATCGGCTTCCCCAATATTCTTCATGAGAGGGCACAGTACTGTTCTCTGTTGTTTACATTTTGTTACATTCCTCTACTGTATTTGTACAGTCATGCTAATGTTTGTTTTTCATTAGTCTGACCTTAATGTTCCTGTGACGGTATGGATTTTGTTATTTATTCTTGTATAAACTCTTTTTTTACTTCATTTTCAGGTCCCCAACACTTTATGGAGTTCCTAGAGAATATAAAACATTGCATATGGGAGACACGGGAACTTTCAGTGAACAGAAAAGGATGTTAACGTCTCTCATTCCTGAATTTAACCCACCAGAGCCATCATCAGTCTCTGCAGTTGCGCCTATGCCATGTATTCCAGCAACTCTTTCATCAGAACCGCTTCATGTTCCCACACCGAGGCCTGACAACTTGGCAGTTCCCGAGCTTTCCAAGGAAGAAGGAGCAATTGCAGAGGATCCTAAGCAAGCTATGGCAATAGTTGAAGCTCCGAGAGAAGCAGCCTCTGTTTCCAGTAGTCATGAGAGGAAGTTACCAGTTGTTAAGATCAAAGTCAGATCATCTGCTGCTACAAGTAGAGCAGATGCTGATAATCTAACCACTGAAAGATCGCATGCTGCACCCCGTGAAACAGATGTTGGTCCTAGTAGTTCTGTTTCTGTTGATGCACCCCCAAGAAATACTGCTGAGGCTACAAGTATCAGCAACCGTATTCTGGAGGAGGTAAACTCCTGTCACGATCACGGGTCTCATATGACAGCCAGCATTGGCAGTGCAAAACTTGCAAGTTATGGTGATGAACTTGGGAAGGAATTCCAGTGCACTGCTGATTCCAGCAGTCGAGCTTTTGGACATTTTCAGCCAGAAGATCCTTCATCATCGAGCATCATACAAGATAACAATATTGATGCTGATGCACAGAAGTATGCAAGTCTTCAGACACTTTCTTTACCACAGCATGATCATGGCTTAGCCTCTTCGCAGTCTCGTCATGGGAAAAAAGAAAAGAAGAAAGACAAAGAAAAGAAACGTAAGCGAGAAAGTCATAAAGAACATCGCAACGATCCTGAATACATTGAGCGCAAGCGACTGAAGAAAGAGAAGAAACAAAAAGAAAAGGAGATGGCAAAGCTATTGAACGAAGAGGTGAAGCCACAGCCTACTGCAATGCCTCGTATAAAAGAACCACCAACCAAGTCAACACCCATGCAATTGGAAACAAATGAACCTAGTGGATCAAGATTAATCATAGGATCAGTTCATAGTAAGCCGGAGGCATCAGAAGGTACAACTTCTGCTGCCCCCAAACTTAGAATTAAATTTAAAAACCGAACGTTGAACAATTCATAATGGTTGTACTCAGCTCGTATATCAGTTGAAATCCCTACAAATTGTTGGCATTCATCCTCAGTCCTCTTTGAGGCTAGGTATATAATCTGAGCCCCACGACATAACAAAATGAAATTCAATCAATTCAAAAGCCCCTCCTTGTGAGTCACTCTTGACATTCTTTACCCTCCATCTCAAAGTCGTACAGAAATGATAATTTCTTTTGTTAGTATCTGCTTACCTAAATGATGGACCTAATCTTATCTTGTTAAGAGAAGGATGAAGTGGAACCTAGTTCACCTGAAGGAATGATGGACCTGTAAAAGGGGAGAGTTTTATGCATATCCAAAGAAAAGGACTTGTAATTTGAGTGAATCTGGGATTGAATGCGGCCATTAGCATCTCAATTTTTGAAGTAATTGCAGGGCGAAAAGAGAAAAATATTGAATATCCAAAATTGAATCAACATCAAGTGTTCTCATTTCTCAATAGTTATCCATGTGAAATCTGTATGATGGCTGCGAACAACATTAGATTCGGCTTTAATGAATGATAATGGACTACTAAAATCAGGGTGGAGAGAAGTATTTCAAGTACACGG

mRNA sequence

ATGGCCAAGCCTCGCAAGCCCAAGAACACCGACGACGCCAAGCCACCTGACAACTCCGGAGCTGTAGTTCGTCACCAGAAGCTCTGTCTTTCCATCGACATTGACAATCGTCGCGTTTATGGGTTCACCGAGTTGGAAATTGCGGTTCCTGATATTGGTATAGTTGGGTTGCACGCGGAGAATCTTGGGATTGTGAGTGTTTCAGTGGATGGTGACCCAACTGAATTTGAGTATTATCCGCGGCCTCAACATGTGGAAAATGAGAGGAGTTTTAAAGCAGTTTCGTCGCCGAGCTCTGCTGCAGATGCTGCAGGGTCAATCTATTTGTCTTCAATAGAGAAGGAATTGGTTCCTAATTTGTTGATAAACTGCTGCAAGGCTTTCAAGAGTGGAAGCGAGCAGCAAGACCAGCCATTTCTGGAGAATGGAGTGCAAACTGCGGATGAGGACAAGCAGAATGTAAGGCTGGTTCGTATTGATTACTGGGTAGAGAAATCAGAGGTAGGTATTCACTTCTATAATCGTATGGCTCACACTGACAACCAAATAAGGCGTGCAAGGTGCTGGTTCCCTTGTATGGATGATGGTCTACAACGATGCAAGTATGACCTCGAGTTTACTGTCTCTCAAAATCTTGTGGCTGTCAGTAATGGAATTTTGTTGTACCAGGTCTTGAGCAAAGACAATCCTCCCCGCAAAACTTTTGTGTATAGGGTAGACATCCCTGTCAATGCACGTTGGATATCACTTGCTGTTGGACCGTTTGAAATCCTTGCTGATCACCAAAATGTACTTATATCACACATGTGCTCGCCTGTTAATTCTTTAAAGCTCAAGCATACGGTGGATTTTTTTCATAGTGCATTCAGCTGCTATAAGGATTACCTCTCCGTGGACTTCCCATTTGGGTCATATAAACAAATCTTCATAGAGCCGGAGATTGCAGTATCATCAGCATGTTTGGGAGTTTCCATGTGTATATTTAGTTCTCATCTTTTGTTTGATGAGAAAATCATCGATCAGACCATTGATACGAGGATTAAACTTGCTTATGCGCTGGCAAGACAGTGGTTTGGCATTTATATTACACCTGAAGCTCCAAATGATGAGTGGTTGTTGGATGGTCTTGCTGGTTTTTTGACTGATTTATTTATTAAGAAGAACTTGGGAAATAATGAGGCACGCTACCAGAGATACAAGGCAAATTGTTCTGTTTGTAGAGCAGATGATTGTGGTTTGACCACCTTGAGTTCCTCTTCTGCTTGTAAAGATTTGCATGGGACCCAATGTATTGGTATATATGGAAAAATAAGATCATGGAAGTCTGTGGCAATCCTTCAAATGTTGGAAAAGCAGATGGGGCCTGAATCTTTCCGCAAGATTCTACAAAATATTGTTTCTCATGCTAAAGATACTGGTTCTACGTCACAATTACTCAGCACGAAAGAGTTTCGTCAATTGGCCAATAAGATTGGAAACCTAGAGCGACCATTTCTTAAAGAGTTCTTTCCTCGATGGGTTGAATCATGTGGTTGTCCACTGCTGAGGATGGGATTTTCCTACAACAAGCGAAAAAATATGGTTGAAATGGCCGTTTCACGGGAATGCACAGCTACACCAGCCACAAATGTAGAAAATCGGGACAGTGATGCTGGATGGCCTGGCATGATGAGTATCAGGATTTATGAACTTGATGGTGTCTTTGATCATCCAGTTCTTCCAATGACTGGAGAGTCTTGGCAGCTACTAGAAATACAATGCCACTCCAAACTTGCTGCTAGGCGCCTCCAGAAAACTAAAAAGGGGTCAAAACCTGATGGTTCTGATGACAATGCTGATATACCTGCACTCGACATTCGCTCTAGTGTGGAATCTCCTTTGTTATGGTTGAGAGCAGACCCTGAGATGGAATACCTTGCTGAAATTCATTTTCATCAACCTGTTCAGATGTGGATCAATCAATTAGAGAAGGACAAGGATGTCATTGCCCAAGCACAAGCAATTGCGACCTTAGAGATGCTGCCTCAACCATCTTTTTCTATTGTTAATGCTCTTAACAATTTCCTCAAAGACCCCAAGGCCTTCTGGAGAGTAAGAATTGAAGCAGCACTGGCAATGGCCAAAACAGCCTCAGAGGATACTGATTGGGCTGGTTTGCTTAATTTGATTAAATTTTTTAAAAGTCAGAGGTTTGATGCTGATACTGGACTCCCCAAGCCAAACGAATTTCGTGATTTTCCTGAATATTTTGTTCTTGAGGCCATTCCTCATGCTGTTGCTATGGTCAGGGGCACCGACCAAAAAAGCCCTAGAGAAGCCGTTGAGTTTGTTTTGCAACTTTTGAAGTACAATGACAACAATGGGAATCCTTACTCTGATGTTTTTTGGTTGGCTGCACTAGTCCAATCAGTTGGTGAGCTTGAATTTGGGCAACAGAGCATTCTCTTTTTAGCATCGCTTCTCAAAAGGATTGATAGACTTCTGCAGTTTGACAGGCTGATGCCTAGCTATAATGGGATATTGACCATTAGCTGCATCAGAACCTTGACCCAGATTGCACTGAAACTATCTGGACTTCTCTCTCTTGATCGTATCATTGAACTGATAAGACCTTTTCGAGATTTTAATTCCATGTGGCAAGTTCGAATTGAGGCAACCCGATCACTTCTCGATCTGGAGTACCACTGCAACGGAATTGATGCCACATTGTTATTGTTTATTAAATATCTAGAAGAAGAGAATTCCTTGCGAGGGCAAGTGAAGCTCGCAGTCCATGTTATGCGTTTATGTCAGATAATGAGAAGATCTGGTTCAAATGATGTAGTCAATAATGACACTCTTGTTGCTTTGCTCCTTCTATTGGAAGGCAATATGGCGTTTAACAACGTCTATCTTCGTCACTACTTGTTCTGTATTTTACAAGTTCTTTCAGGAAGGTCCCCAACACTTTATGGAGTTCCTAGAGAATATAAAACATTGCATATGGGAGACACGGGAACTTTCAGTGAACAGAAAAGGATGTTAACGTCTCTCATTCCTGAATTTAACCCACCAGAGCCATCATCAGTCTCTGCAGTTGCGCCTATGCCATGTATTCCAGCAACTCTTTCATCAGAACCGCTTCATGTTCCCACACCGAGGCCTGACAACTTGGCAGTTCCCGAGCTTTCCAAGGAAGAAGGAGCAATTGCAGAGGATCCTAAGCAAGCTATGGCAATAGTTGAAGCTCCGAGAGAAGCAGCCTCTGTTTCCAGTAGTCATGAGAGGAAGTTACCAGTTGTTAAGATCAAAGTCAGATCATCTGCTGCTACAAGTAGAGCAGATGCTGATAATCTAACCACTGAAAGATCGCATGCTGCACCCCGTGAAACAGATGTTGGTCCTAGTAGTTCTGTTTCTGTTGATGCACCCCCAAGAAATACTGCTGAGGCTACAAGTATCAGCAACCGTATTCTGGAGGAGGTAAACTCCTGTCACGATCACGGGTCTCATATGACAGCCAGCATTGGCAGTGCAAAACTTGCAAGTTATGGTGATGAACTTGGGAAGGAATTCCAGTGCACTGCTGATTCCAGCAGTCGAGCTTTTGGACATTTTCAGCCAGAAGATCCTTCATCATCGAGCATCATACAAGATAACAATATTGATGCTGATGCACAGAAGTATGCAAGTCTTCAGACACTTTCTTTACCACAGCATGATCATGGCTTAGCCTCTTCGCAGTCTCGTCATGGGAAAAAAGAAAAGAAGAAAGACAAAGAAAAGAAACGTAAGCGAGAAAGTCATAAAGAACATCGCAACGATCCTGAATACATTGAGCGCAAGCGACTGAAGAAAGAGAAGAAACAAAAAGAAAAGGAGATGGCAAAGCTATTGAACGAAGAGGTGAAGCCACAGCCTACTGCAATGCCTCGTATAAAAGAACCACCAACCAAGTCAACACCCATGCAATTGGAAACAAATGAACCTAGTGGATCAAGATTAATCATAGGATCAGTTCATAGTAAGCCGGAGGCATCAGAAGGTACAACTTCTGCTGCCCCCAAACTTAGAATTAAATTTAAAAACCGAACGTTGAACAATTCATAATGGTTGTACTCAGCTCGTATATCAGTTGAAATCCCTACAAATTGTTGGCATTCATCCTCAGTCCTCTTTGAGGCTAGGTATATAATCTGAGCCCCACGACATAACAAAATGAAATTCAATCAATTCAAAAGCCCCTCCTTGTGAGTCACTCTTGACATTCTTTACCCTCCATCTCAAAGTCGTACAGAAATGATAATTTCTTTTGTTAGTATCTGCTTACCTAAATGATGGACCTAATCTTATCTTGTTAAGAGAAGGATGAAGTGGAACCTAGTTCACCTGAAGGAATGATGGACCTGTAAAAGGGGAGAGTTTTATGCATATCCAAAGAAAAGGACTTGTAATTTGAGTGAATCTGGGATTGAATGCGGCCATTAGCATCTCAATTTTTGAAGTAATTGCAGGGCGAAAAGAGAAAAATATTGAATATCCAAAATTGAATCAACATCAAGTGTTCTCATTTCTCAATAGTTATCCATGTGAAATCTGTATGATGGCTGCGAACAACATTAGATTCGGCTTTAATGAATGATAATGGACTACTAAAATCAGGGTGGAGAGAAGTATTTCAAGTACACGG

Coding sequence (CDS)

ATGGCCAAGCCTCGCAAGCCCAAGAACACCGACGACGCCAAGCCACCTGACAACTCCGGAGCTGTAGTTCGTCACCAGAAGCTCTGTCTTTCCATCGACATTGACAATCGTCGCGTTTATGGGTTCACCGAGTTGGAAATTGCGGTTCCTGATATTGGTATAGTTGGGTTGCACGCGGAGAATCTTGGGATTGTGAGTGTTTCAGTGGATGGTGACCCAACTGAATTTGAGTATTATCCGCGGCCTCAACATGTGGAAAATGAGAGGAGTTTTAAAGCAGTTTCGTCGCCGAGCTCTGCTGCAGATGCTGCAGGGTCAATCTATTTGTCTTCAATAGAGAAGGAATTGGTTCCTAATTTGTTGATAAACTGCTGCAAGGCTTTCAAGAGTGGAAGCGAGCAGCAAGACCAGCCATTTCTGGAGAATGGAGTGCAAACTGCGGATGAGGACAAGCAGAATGTAAGGCTGGTTCGTATTGATTACTGGGTAGAGAAATCAGAGGTAGGTATTCACTTCTATAATCGTATGGCTCACACTGACAACCAAATAAGGCGTGCAAGGTGCTGGTTCCCTTGTATGGATGATGGTCTACAACGATGCAAGTATGACCTCGAGTTTACTGTCTCTCAAAATCTTGTGGCTGTCAGTAATGGAATTTTGTTGTACCAGGTCTTGAGCAAAGACAATCCTCCCCGCAAAACTTTTGTGTATAGGGTAGACATCCCTGTCAATGCACGTTGGATATCACTTGCTGTTGGACCGTTTGAAATCCTTGCTGATCACCAAAATGTACTTATATCACACATGTGCTCGCCTGTTAATTCTTTAAAGCTCAAGCATACGGTGGATTTTTTTCATAGTGCATTCAGCTGCTATAAGGATTACCTCTCCGTGGACTTCCCATTTGGGTCATATAAACAAATCTTCATAGAGCCGGAGATTGCAGTATCATCAGCATGTTTGGGAGTTTCCATGTGTATATTTAGTTCTCATCTTTTGTTTGATGAGAAAATCATCGATCAGACCATTGATACGAGGATTAAACTTGCTTATGCGCTGGCAAGACAGTGGTTTGGCATTTATATTACACCTGAAGCTCCAAATGATGAGTGGTTGTTGGATGGTCTTGCTGGTTTTTTGACTGATTTATTTATTAAGAAGAACTTGGGAAATAATGAGGCACGCTACCAGAGATACAAGGCAAATTGTTCTGTTTGTAGAGCAGATGATTGTGGTTTGACCACCTTGAGTTCCTCTTCTGCTTGTAAAGATTTGCATGGGACCCAATGTATTGGTATATATGGAAAAATAAGATCATGGAAGTCTGTGGCAATCCTTCAAATGTTGGAAAAGCAGATGGGGCCTGAATCTTTCCGCAAGATTCTACAAAATATTGTTTCTCATGCTAAAGATACTGGTTCTACGTCACAATTACTCAGCACGAAAGAGTTTCGTCAATTGGCCAATAAGATTGGAAACCTAGAGCGACCATTTCTTAAAGAGTTCTTTCCTCGATGGGTTGAATCATGTGGTTGTCCACTGCTGAGGATGGGATTTTCCTACAACAAGCGAAAAAATATGGTTGAAATGGCCGTTTCACGGGAATGCACAGCTACACCAGCCACAAATGTAGAAAATCGGGACAGTGATGCTGGATGGCCTGGCATGATGAGTATCAGGATTTATGAACTTGATGGTGTCTTTGATCATCCAGTTCTTCCAATGACTGGAGAGTCTTGGCAGCTACTAGAAATACAATGCCACTCCAAACTTGCTGCTAGGCGCCTCCAGAAAACTAAAAAGGGGTCAAAACCTGATGGTTCTGATGACAATGCTGATATACCTGCACTCGACATTCGCTCTAGTGTGGAATCTCCTTTGTTATGGTTGAGAGCAGACCCTGAGATGGAATACCTTGCTGAAATTCATTTTCATCAACCTGTTCAGATGTGGATCAATCAATTAGAGAAGGACAAGGATGTCATTGCCCAAGCACAAGCAATTGCGACCTTAGAGATGCTGCCTCAACCATCTTTTTCTATTGTTAATGCTCTTAACAATTTCCTCAAAGACCCCAAGGCCTTCTGGAGAGTAAGAATTGAAGCAGCACTGGCAATGGCCAAAACAGCCTCAGAGGATACTGATTGGGCTGGTTTGCTTAATTTGATTAAATTTTTTAAAAGTCAGAGGTTTGATGCTGATACTGGACTCCCCAAGCCAAACGAATTTCGTGATTTTCCTGAATATTTTGTTCTTGAGGCCATTCCTCATGCTGTTGCTATGGTCAGGGGCACCGACCAAAAAAGCCCTAGAGAAGCCGTTGAGTTTGTTTTGCAACTTTTGAAGTACAATGACAACAATGGGAATCCTTACTCTGATGTTTTTTGGTTGGCTGCACTAGTCCAATCAGTTGGTGAGCTTGAATTTGGGCAACAGAGCATTCTCTTTTTAGCATCGCTTCTCAAAAGGATTGATAGACTTCTGCAGTTTGACAGGCTGATGCCTAGCTATAATGGGATATTGACCATTAGCTGCATCAGAACCTTGACCCAGATTGCACTGAAACTATCTGGACTTCTCTCTCTTGATCGTATCATTGAACTGATAAGACCTTTTCGAGATTTTAATTCCATGTGGCAAGTTCGAATTGAGGCAACCCGATCACTTCTCGATCTGGAGTACCACTGCAACGGAATTGATGCCACATTGTTATTGTTTATTAAATATCTAGAAGAAGAGAATTCCTTGCGAGGGCAAGTGAAGCTCGCAGTCCATGTTATGCGTTTATGTCAGATAATGAGAAGATCTGGTTCAAATGATGTAGTCAATAATGACACTCTTGTTGCTTTGCTCCTTCTATTGGAAGGCAATATGGCGTTTAACAACGTCTATCTTCGTCACTACTTGTTCTGTATTTTACAAGTTCTTTCAGGAAGGTCCCCAACACTTTATGGAGTTCCTAGAGAATATAAAACATTGCATATGGGAGACACGGGAACTTTCAGTGAACAGAAAAGGATGTTAACGTCTCTCATTCCTGAATTTAACCCACCAGAGCCATCATCAGTCTCTGCAGTTGCGCCTATGCCATGTATTCCAGCAACTCTTTCATCAGAACCGCTTCATGTTCCCACACCGAGGCCTGACAACTTGGCAGTTCCCGAGCTTTCCAAGGAAGAAGGAGCAATTGCAGAGGATCCTAAGCAAGCTATGGCAATAGTTGAAGCTCCGAGAGAAGCAGCCTCTGTTTCCAGTAGTCATGAGAGGAAGTTACCAGTTGTTAAGATCAAAGTCAGATCATCTGCTGCTACAAGTAGAGCAGATGCTGATAATCTAACCACTGAAAGATCGCATGCTGCACCCCGTGAAACAGATGTTGGTCCTAGTAGTTCTGTTTCTGTTGATGCACCCCCAAGAAATACTGCTGAGGCTACAAGTATCAGCAACCGTATTCTGGAGGAGGTAAACTCCTGTCACGATCACGGGTCTCATATGACAGCCAGCATTGGCAGTGCAAAACTTGCAAGTTATGGTGATGAACTTGGGAAGGAATTCCAGTGCACTGCTGATTCCAGCAGTCGAGCTTTTGGACATTTTCAGCCAGAAGATCCTTCATCATCGAGCATCATACAAGATAACAATATTGATGCTGATGCACAGAAGTATGCAAGTCTTCAGACACTTTCTTTACCACAGCATGATCATGGCTTAGCCTCTTCGCAGTCTCGTCATGGGAAAAAAGAAAAGAAGAAAGACAAAGAAAAGAAACGTAAGCGAGAAAGTCATAAAGAACATCGCAACGATCCTGAATACATTGAGCGCAAGCGACTGAAGAAAGAGAAGAAACAAAAAGAAAAGGAGATGGCAAAGCTATTGAACGAAGAGGTGAAGCCACAGCCTACTGCAATGCCTCGTATAAAAGAACCACCAACCAAGTCAACACCCATGCAATTGGAAACAAATGAACCTAGTGGATCAAGATTAATCATAGGATCAGTTCATAGTAAGCCGGAGGCATCAGAAGGTACAACTTCTGCTGCCCCCAAACTTAGAATTAAATTTAAAAACCGAACGTTGAACAATTCATAA

Protein sequence

MAKPRKPKNTDDAKPPDNSGAVVRHQKLCLSIDIDNRRVYGFTELEIAVPDIGIVGLHAENLGIVSVSVDGDPTEFEYYPRPQHVENERSFKAVSSPSSAADAAGSIYLSSIEKELVPNLLINCCKAFKSGSEQQDQPFLENGVQTADEDKQNVRLVRIDYWVEKSEVGIHFYNRMAHTDNQIRRARCWFPCMDDGLQRCKYDLEFTVSQNLVAVSNGILLYQVLSKDNPPRKTFVYRVDIPVNARWISLAVGPFEILADHQNVLISHMCSPVNSLKLKHTVDFFHSAFSCYKDYLSVDFPFGSYKQIFIEPEIAVSSACLGVSMCIFSSHLLFDEKIIDQTIDTRIKLAYALARQWFGIYITPEAPNDEWLLDGLAGFLTDLFIKKNLGNNEARYQRYKANCSVCRADDCGLTTLSSSSACKDLHGTQCIGIYGKIRSWKSVAILQMLEKQMGPESFRKILQNIVSHAKDTGSTSQLLSTKEFRQLANKIGNLERPFLKEFFPRWVESCGCPLLRMGFSYNKRKNMVEMAVSRECTATPATNVENRDSDAGWPGMMSIRIYELDGVFDHPVLPMTGESWQLLEIQCHSKLAARRLQKTKKGSKPDGSDDNADIPALDIRSSVESPLLWLRADPEMEYLAEIHFHQPVQMWINQLEKDKDVIAQAQAIATLEMLPQPSFSIVNALNNFLKDPKAFWRVRIEAALAMAKTASEDTDWAGLLNLIKFFKSQRFDADTGLPKPNEFRDFPEYFVLEAIPHAVAMVRGTDQKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGELEFGQQSILFLASLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLSGLLSLDRIIELIRPFRDFNSMWQVRIEATRSLLDLEYHCNGIDATLLLFIKYLEEENSLRGQVKLAVHVMRLCQIMRRSGSNDVVNNDTLVALLLLLEGNMAFNNVYLRHYLFCILQVLSGRSPTLYGVPREYKTLHMGDTGTFSEQKRMLTSLIPEFNPPEPSSVSAVAPMPCIPATLSSEPLHVPTPRPDNLAVPELSKEEGAIAEDPKQAMAIVEAPREAASVSSSHERKLPVVKIKVRSSAATSRADADNLTTERSHAAPRETDVGPSSSVSVDAPPRNTAEATSISNRILEEVNSCHDHGSHMTASIGSAKLASYGDELGKEFQCTADSSSRAFGHFQPEDPSSSSIIQDNNIDADAQKYASLQTLSLPQHDHGLASSQSRHGKKEKKKDKEKKRKRESHKEHRNDPEYIERKRLKKEKKQKEKEMAKLLNEEVKPQPTAMPRIKEPPTKSTPMQLETNEPSGSRLIIGSVHSKPEASEGTTSAAPKLRIKFKNRTLNNS*
Homology
BLAST of CSPI01G06630 vs. ExPASy Swiss-Prot
Match: Q8LPF0 (Transcription initiation factor TFIID subunit 2 OS=Arabidopsis thaliana OX=3702 GN=TAF2 PE=2 SV=1)

HSP 1 Score: 1512.7 bits (3915), Expect = 0.0e+00
Identity = 829/1423 (58.26%), Postives = 1018/1423 (71.54%), Query Frame = 0

Query: 1    MAKPRKPKNTD--DAKPPDNSGAVVRHQKLCLSIDIDNRRVYGFTELEIAVPDIGIVGLH 60
            MAK RKPKN +   AK  +N+GA V HQKL LSID   R++YG+TELE++VPDIGIVGLH
Sbjct: 1    MAKARKPKNEEAPGAKTSENTGAKVLHQKLFLSIDFKKRQIYGYTELEVSVPDIGIVGLH 60

Query: 61   AENLGIVSVSVDGDPTEFEYYPRPQHVENERSFKAVSSPSSAADAAGSIYLSSIEKELVP 120
            AENLGI SV VDG+PT FEYYP  Q+ E E ++ +VS P+SAADAA   Y+  +++E   
Sbjct: 61   AENLGIESVLVDGEPTVFEYYPHHQNSETESNWNSVSDPASAADAAAMEYVGVLKREDTA 120

Query: 121  NLLINCCKAFKSGSEQQDQPFLENGVQTADEDKQNVRLVRIDYWVEKSEVGIHFYNRMAH 180
            NLLINCCK  K  SEQ D   LENG Q++ E KQNV+L+RI+YWVEK E GIHF   + H
Sbjct: 121  NLLINCCKPSKDLSEQLDSVTLENGSQSSGEAKQNVKLIRINYWVEKIESGIHFDGNIVH 180

Query: 181  TDNQIRRARCWFPCMDDGLQRCKYDLEFTVSQNLVAVSNGILLYQVLSKDNPPRKTFVYR 240
            TDNQ+RRARCWFPC+DD   RC +DLEFTV  N VAVS G LLYQV+ K++  +KT+VY 
Sbjct: 181  TDNQMRRARCWFPCIDDEYHRCSFDLEFTVPHNFVAVSVGKLLYQVMCKEDTTQKTYVYE 240

Query: 241  VDIPVNARWISLAVGPFEILADHQNVLISHMCSPVNSLKLKHTVDFFHSAFSCYKDYLSV 300
            + IP+  RW+SL  GP EIL D  N LIS++C P +  +L++T++FFH A+S Y+DYLS 
Sbjct: 241  LAIPIAPRWVSLVAGPLEILPDQTNFLISNLCLPHDLSRLRNTMEFFHEAYSYYEDYLSA 300

Query: 301  DFPFGSYKQIFIEPEIAVSSACLGVSMCIFSSHLLFDEKIIDQTIDTRIKLAYALARQWF 360
            +FPFG YKQ+F+ PE+ V+S+  G S+ IFSSH+L+DE++IDQTIDTRIKLA ALA+QWF
Sbjct: 301  NFPFGFYKQVFLPPEMVVTSSTSGASLSIFSSHILYDERVIDQTIDTRIKLASALAKQWF 360

Query: 361  GIYITPEAPNDEWLLDGLAGFLTDLFIKKNLGNNEARYQRYKANCSVCRADDCGLTTLSS 420
            G+YITPE+PND+WLLDGLAGFLTD+FIK+ LGNNEARY+RYKANC+VC+ADD G   LSS
Sbjct: 361  GVYITPESPNDDWLLDGLAGFLTDMFIKQFLGNNEARYRRYKANCAVCKADDSGAMCLSS 420

Query: 421  SSACKDLHGTQCIGIYGKIRSWKSVAILQMLEKQMGPESFRKILQNIVSHAKDTGSTSQL 480
            S +C+DL GT  IG++GKIRSWKS A+LQMLEKQMG +SFRKILQ I+S AKD  ++ + 
Sbjct: 421  SPSCRDLFGTHSIGMHGKIRSWKSGAVLQMLEKQMGSDSFRKILQKIISRAKDPSNSIRS 480

Query: 481  LSTKEFRQLANKIGNLERPFLKEFFPRWVESCGCPLLRMGFSYNKRKNMVEMAVSRECTA 540
            LSTKEFRQ ANKIGNLERPFLKEFF RWV S GCP+LR+G SYNKRKN VEMA  RECTA
Sbjct: 481  LSTKEFRQFANKIGNLERPFLKEFFQRWVASYGCPVLRIGLSYNKRKNNVEMAALRECTA 540

Query: 541  T---------PATNVENRDSDAGWPGMMSIRIYELDGVFDHPVLPMTGESWQLLEIQCHS 600
                        ++ E+RD DAGWPG+MSIR+YELDG+ DHP LPM G+ WQLLE+ CHS
Sbjct: 541  ALDARLSVIGATSDSESRDVDAGWPGIMSIRVYELDGMSDHPKLPMAGDRWQLLELPCHS 600

Query: 601  KLAARRLQKTKKGSKPDGSDDNAD-IPALDIRSSVESPLLWLRADPEMEYLAEIHFHQPV 660
            KLAA+R QK KKG KPDG++DN D I  L+ ++S+ESPL W++ADPEMEY+AEIH HQP+
Sbjct: 601  KLAAKRYQKPKKGGKPDGAEDNVDAIAPLENKTSIESPLAWIKADPEMEYIAEIHLHQPL 660

Query: 661  QMWINQLEKDKDVIAQAQAIATLEMLPQPSFSIVNALNNFLKDPKAFWRVRIEAALAMAK 720
            QMW+NQLEKD DV+AQAQAIA+LE L Q SFSIVNAL N L D K FWR+RI AA A+AK
Sbjct: 661  QMWVNQLEKDGDVVAQAQAIASLEALKQHSFSIVNALKNVLTDSKVFWRIRIAAAFALAK 720

Query: 721  TASEDTDWAGLLNLIKFFKSQRFDADTGLPKPNEFRDFPEYFVLEAIPHAVAMVRGTDQK 780
            TASE++DWAGL +LIKF+KS+RFDA+ GLPKPN+FRDFPEYFVLEAIPHA+A+VRG + K
Sbjct: 721  TASEESDWAGLQHLIKFYKSRRFDAEIGLPKPNDFRDFPEYFVLEAIPHAIAIVRGAEGK 780

Query: 781  SPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGELEFGQQSILFLASLLKRIDRLL 840
            SPREAVEF+LQLLKYNDN+GN YSDVFWLA LVQSVG+LEF QQS+ FLA LLKRIDRLL
Sbjct: 781  SPREAVEFILQLLKYNDNSGNSYSDVFWLAVLVQSVGDLEFCQQSLTFLAPLLKRIDRLL 840

Query: 841  QFDRLMPSYNGILTISCIRTLTQIALKLSGLLSLDRIIELIRPFRDFNSMWQVRIEATRS 900
            QFDRLMPSYNGILTISCIRTL Q ALKLS  +S D I +LI PFR+ +++ Q+RIE +R+
Sbjct: 841  QFDRLMPSYNGILTISCIRTLAQTALKLSDSISFDHICKLIEPFRNSDTILQIRIEGSRA 900

Query: 901  LLDLEYHCNGIDATLLLFIKYLEEENSLRGQVKLAVHVMRLCQIMRRSGSNDVVNNDTLV 960
            LLD+EY   GI + LLLF+KYL EE+SLRGQVKL VH MRLCQI     S+D V+  TL+
Sbjct: 901  LLDIEYQSKGISSALLLFMKYLVEESSLRGQVKLCVHTMRLCQIAVGCDSDDCVDTVTLL 960

Query: 961  ALLLLLEGNMAFNNVYLRHYLFCILQVLSGRSPTLYGVPREYKTLHMGDTGTFSEQKRML 1020
             LL L + ++ FNN  LR+YLFCI Q+L+GR PTL+GVP+E K L + D     E K + 
Sbjct: 961  DLLHLFKSHVVFNNELLRYYLFCIFQILAGRPPTLFGVPKE-KPLQLVDVEACIEPKNVF 1020

Query: 1021 TSLIPEFNPPEPSSVSA---------VAP-----------MPCIPATLSSEPL------- 1080
              L+P     EPS  +          VAP           MP +P  +  EP+       
Sbjct: 1021 --LVPGAEAGEPSLSALGDAKGQSLDVAPYGVPIIPQEMFMPIVPELMLPEPVAAYDETQ 1080

Query: 1081 HVPTPRPDNLAVPE----LSKEEGAIAEDPKQAMAIVEA---------PREAASVSSSHE 1140
            H+  PR ++   P     +  E  +  E P + +A  EA           +  SVS SHE
Sbjct: 1081 HL-EPRMESQNQPSHENPIVHEIPSDVEGPTEELAHREANPPTKEPQKEPDVVSVSVSHE 1140

Query: 1141 RKLPVVKIKVRSSAATSRADADNLTTERSH--AAPRETDVGPSSSVSVDAPPRNTAEATS 1200
             K  V++IKVR S ATSRA+    T ERS       + D G +SS SVDAP R + +A S
Sbjct: 1141 VKKSVIRIKVRPSGATSRAEGSARTIERSQGIVVRHDIDRGQTSSASVDAPQRISTDAVS 1200

Query: 1201 ISNR-ILEEVNSCHDHGSHMTASIGSAKLASYGDELGKEFQCTADS-----SSRAFGHFQ 1260
            ISN+  +EEVNSCHD GS MTASIGS K AS GD  GKE QCTA+S     S +A  + +
Sbjct: 1201 ISNQNHVEEVNSCHDVGSRMTASIGSVKFASEGDIFGKELQCTAESGKPSTSQKADNNNR 1260

Query: 1261 PEDPSSSSIIQDNNIDADA-QKYASLQTLSLPQHDHGLASSQSRHGKKEKKKDKEKKRKR 1320
               PS   +  D++++ +A QKYASLQTLS+ +             +KEKKKDKEKK K 
Sbjct: 1261 TVPPSFLPL--DHSMENEAQQKYASLQTLSIGK-------------EKEKKKDKEKKEK- 1320

Query: 1321 ESHKEHRNDPEYIERKRLKKEKKQKEKEMAKLLNEEVKPQPTAMPRIKEPPTKSTPMQLE 1363
               K  R DP Y+E+KRLKKEKK+KEKEMAKL++    P    +  + E         ++
Sbjct: 1321 ---KRKREDPVYLEKKRLKKEKKRKEKEMAKLVSSTTDPAKKKIESVAE---------VK 1380

BLAST of CSPI01G06630 vs. ExPASy Swiss-Prot
Match: Q5ZIT8 (Transcription initiation factor TFIID subunit 2 OS=Gallus gallus OX=9031 GN=TAF2 PE=2 SV=1)

HSP 1 Score: 342.4 bits (877), Expect = 2.3e-92
Identity = 269/928 (28.99%), Postives = 444/928 (47.84%), Query Frame = 0

Query: 25  HQKLCL-SIDIDNRRVYGFTELEI--AVPDIGIVGLHAENLGIVSVSVDGDPTEFEYYPR 84
           HQ +C+ +I+   + V G+ EL I   V ++  + L+++   I  V V+     F Y   
Sbjct: 20  HQVVCINNINFQRKSVVGYVELTIFPTVANLNRIKLNSKQCRIYRVRVNDLEAAFIYNDP 79

Query: 85  P----QHVENERSFKAVSSPSSAA------DAAGSIYLSSIEKELVPNLLINCCKAFKSG 144
                 H   +R+    S+  +AA      DA        +  EL  ++           
Sbjct: 80  TLEVCHHESKQRNLNYFSNAYAAAVSAVDPDAGNGELCIKVPSELWKHV----------- 139

Query: 145 SEQQDQPFLENGVQTADEDKQNVRLVRIDYWVEKSEVGIHFY---------NRMAH--TD 204
                             D+  V  V I++ +++ + G+HF           R AH  + 
Sbjct: 140 ------------------DELKVLKVHINFSLDQPKGGLHFVVPNMEGSMAERGAHVFSC 199

Query: 205 NQIRRARCWFPCMDDGLQRCKYDLEFTVSQNLVAVSNGILLYQVLSKDNPPRKTFVYRVD 264
                 R WFPC+D   + C + LE+TV   +VAVSNG L+  V + D   +KTF Y + 
Sbjct: 200 GYQNSTRFWFPCVDSYSELCTWKLEYTVDAAMVAVSNGDLVETVYTHD-MRKKTFHYMLA 259

Query: 265 IPVNARWISLAVGPFEILADHQNVLISHMCSPVNSLKLKHTVDFFHSAFSCYKDYLSVDF 324
           IP  A  ISLA+GPFEIL D     ++H C P     LKHT  + H  F  Y++ L+  +
Sbjct: 260 IPTAASNISLAIGPFEILVDPYMHEVTHFCLPQLLPLLKHTTSYLHEVFEFYEEILTCRY 319

Query: 325 PFGSYKQIFIEPEIAVSSACLGVSMCIFSSHLLFDEKIIDQTIDTRIKLAYALARQWFGI 384
           P+  +K +FI+   A        SM IFS++LL    IID+T  TR  LA ALA+Q+FG 
Sbjct: 320 PYSCFKTVFIDE--AYVEVAAYASMSIFSTNLLHSAMIIDETPLTRRCLAQALAQQFFGC 379

Query: 385 YITPEAPNDEWLLDGLAGFLTDLFIKKNLGNNEARYQ-RYKANCSVCRADDCG---LTTL 444
           +I+  + +DEW+L G++G++  L++KK  G NE R+  + + +  V      G   L  +
Sbjct: 380 FISRMSWSDEWVLKGISGYIYGLWMKKTFGVNEYRHWIKQELDQIVAYELKTGGVLLHPI 439

Query: 445 SSSSACKD----------LHGTQCIGIYGKIRSWKSVAILQMLEKQMGPESFRKILQNIV 504
                 KD           H       Y  +   K+  +++++E ++  E   ++   ++
Sbjct: 440 FGGGKEKDNPASHLHFSIKHPHTLSWEYYTMFQCKAHLVMRLIENRISMEFMLQVFNKLL 499

Query: 505 SHAKDTGS--------TSQLLSTKEF-RQLANKIGNLERPFLKEFFPRWVESCGCPLLRM 564
           S A    S        +  L+ST  F + ++N  G   +P +K+    WV+  G      
Sbjct: 500 SLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPLIKQ----WVDQSGVVKFYG 559

Query: 565 GFSYNKRKNMVEMAVSRECTATPATNVENRDSDAGWPGMMSIRIYELDGVFDHPVLPMTG 624
            F++N+++N++E+ + ++ T +P T          + G + + + ELDG F+H +     
Sbjct: 560 SFAFNRKRNVLELEIKQDYT-SPGTQ--------KYVGPLKVTVQELDGSFNHTL--QIE 619

Query: 625 ESWQLLEIQCHSKLAARRLQKTKKGSKPDGSDDNADIPALDIRSSVESPLLWLRADPEME 684
           E+    +I CHSK    R  K KK    +G + + D+ A+D     +SPLLW+R DP+M 
Sbjct: 620 ENSLKHDIPCHSK---SRRNKKKKIPLMNGEEVDMDLSAMD----ADSPLLWIRIDPDMS 679

Query: 685 YLAEIHFHQPVQMWINQLEKDKDVIAQAQAIATLEMLPQPSFSIVNALNNFLKDPKAFWR 744
            L ++ F Q   MW  QL  ++DV+AQ +AI  LE  P P+  +  AL + L+  + F+R
Sbjct: 680 VLRKVEFEQSDFMWQYQLRYERDVVAQEEAILALEKFPTPASRL--ALTDILEQEQCFYR 739

Query: 745 VRIEAALAMAKTA-SEDTDWAGLLNLIKFFKSQRFDADT--GLPKPNEFRDFPEYFVLEA 804
           VR+ A   +AK A S  + W G   +   F ++ F   T   + K N F +F  YF+ + 
Sbjct: 740 VRMLACFCLAKIANSMVSTWTGPPAMKSLF-TRMFCCKTCPNIVKTNNFMNFQSYFLQKT 799

Query: 805 IPHAVAMVRGTDQKSPREAVEFVLQLLKYNDNNGNPYSDVFWLA----ALVQSVGELEFG 864
           +P A+A++R      P+E + F+L L+KYNDN  N +SD ++ A    AL  SV      
Sbjct: 800 MPVAMALLRDVHNLCPKEVLMFILDLIKYNDNRKNKFSDNYYRAELIDALANSVTPAVSV 859

Query: 865 QQSILFLASL-------LKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLSGLLSLD 892
              +  L +L       L+ I R L  ++L+PSY   +T+SC++ +    L+ +G +  D
Sbjct: 860 NNEVRTLDNLNPDVRLILEEITRFLNMEKLLPSYRHTITVSCLKAIR--VLQKNGHVPSD 886

BLAST of CSPI01G06630 vs. ExPASy Swiss-Prot
Match: Q6P1X5 (Transcription initiation factor TFIID subunit 2 OS=Homo sapiens OX=9606 GN=TAF2 PE=1 SV=3)

HSP 1 Score: 342.4 bits (877), Expect = 2.3e-92
Identity = 267/919 (29.05%), Postives = 447/919 (48.64%), Query Frame = 0

Query: 25  HQKLCL-SIDIDNRRVYGFTELEI--AVPDIGIVGLHAENLGIVSVSVDGDPTEFEYY-P 84
           HQ +C+ +I+   + V GF EL I   V ++  + L+++   I  V ++     F Y  P
Sbjct: 30  HQVVCINNINFQRKSVVGFVELTIFPTVANLNRIKLNSKQCRIYRVRINDLEAAFIYNDP 89

Query: 85  RPQHVENERSFKAVSSPSSAADAAGSIYLSSIEKELVPNLLINCCKAFKSGSEQQDQPFL 144
             +   +E   + ++  S+A  AA    +S+++ +     L       K  SE       
Sbjct: 90  TLEVCHSESKQRNLNYFSNAYAAA----VSAVDPDAGNGEL-----CIKVPSELWKH--- 149

Query: 145 ENGVQTADEDKQNVRLVRIDYWVEKSEVGIHFY---------NRMAH--TDNQIRRARCW 204
                    D+  V  + I++ +++ + G+HF           R AH  +       R W
Sbjct: 150 --------VDELKVLKIHINFSLDQPKGGLHFVVPSVEGSMAERGAHVFSCGYQNSTRFW 209

Query: 205 FPCMDDGLQRCKYDLEFTVSQNLVAVSNGILLYQVLSKDNPPRKTFVYRVDIPVNARWIS 264
           FPC+D   + C + LEFTV   +VAVSNG L+  V + D   +KTF Y + IP  A  IS
Sbjct: 210 FPCVDSYSELCTWKLEFTVDAAMVAVSNGDLVETVYTHD-MRKKTFHYMLTIPTAASNIS 269

Query: 265 LAVGPFEILADHQNVLISHMCSPVNSLKLKHTVDFFHSAFSCYKDYLSVDFPFGSYKQIF 324
           LA+GPFEIL D     ++H C P     LKHT  + H  F  Y++ L+  +P+  +K +F
Sbjct: 270 LAIGPFEILVDPYMHEVTHFCLPQLLPLLKHTTSYLHEVFEFYEEILTCRYPYSCFKTVF 329

Query: 325 IEPEIAVSSACLGVSMCIFSSHLLFDEKIIDQTIDTRIKLAYALARQWFGIYITPEAPND 384
           I+   A        SM IFS++LL    IID+T  TR  LA +LA+Q+FG +I+  + +D
Sbjct: 330 IDE--AYVEVAAYASMSIFSTNLLHSAMIIDETPLTRRCLAQSLAQQFFGCFISRMSWSD 389

Query: 385 EWLLDGLAGFLTDLFIKKNLGNNEARY----QRYKANCSVCRADDCGLTTLSSSSACKD- 444
           EW+L G++G++  L++KK  G NE R+    +  K      +     L  +      KD 
Sbjct: 390 EWVLKGISGYIYGLWMKKTFGVNEYRHWIKEELDKIVAYELKTGGVLLHPIFGGGKEKDN 449

Query: 445 ---------LHGTQCIGIYGKIRSWKSVAILQMLEKQMGPESFRKILQNIVSHAKDTGS- 504
                     H       Y  +   K+  +++++E ++  E   ++   ++S A    S 
Sbjct: 450 PASHLHFSIKHPHTLSWEYYSMFQCKAHLVMRLIENRISMEFMLQVFNKLLSLASTASSQ 509

Query: 505 -------TSQLLSTKEF-RQLANKIGNLERPFLKEFFPRWVESCGCPLLRMGFSYNKRKN 564
                  +  L+ST  F + ++N  G   +P +K+    WV+  G       F++N+++N
Sbjct: 510 KFQSHMWSQMLVSTSGFLKSISNVSGKDIQPLIKQ----WVDQSGVVKFYGSFAFNRKRN 569

Query: 565 MVEMAVSRECTATPATNVENRDSDAGWPGMMSIRIYELDGVFDHPVLPMTGESWQLLEIQ 624
           ++E+ + ++ T +P T          + G + + + ELDG F+H +     E+    +I 
Sbjct: 570 VLELEIKQDYT-SPGTQ--------KYVGPLKVTVQELDGSFNHTL--QIEENSLKHDIP 629

Query: 625 CHSKLAARRLQKTKKGSKPDGSDDNADIPALDIRSSVESPLLWLRADPEMEYLAEIHFHQ 684
           CHSK    R  K KK    +G + + D+ A+D     +SPLLW+R DP+M  L ++ F Q
Sbjct: 630 CHSK---SRRNKKKKIPLMNGEEVDMDLSAMD----ADSPLLWIRIDPDMSVLRKVEFEQ 689

Query: 685 PVQMWINQLEKDKDVIAQAQAIATLEMLPQPSFSIVNALNNFLKDPKAFWRVRIEAALAM 744
              MW  QL  ++DV+AQ ++I  LE  P P+  +  AL + L+  + F+RVR+ A   +
Sbjct: 690 ADFMWQYQLRYERDVVAQQESILALEKFPTPASRL--ALTDILEQEQCFYRVRMSACFCL 749

Query: 745 AKTA-SEDTDWAGLLNLIKFFKSQRFDADT--GLPKPNEFRDFPEYFVLEAIPHAVAMVR 804
           AK A S  + W G   +   F ++ F   +   + K N F  F  YF+ + +P A+A++R
Sbjct: 750 AKIANSMVSTWTGPPAMKSLF-TRMFCCKSCPNIVKTNNFMSFQSYFLQKTMPVAMALLR 809

Query: 805 GTDQKSPREAVEFVLQLLKYNDNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLAS 864
                 P+E + F+L L+KYNDN  N +SD ++ A    AL  SV         +  L +
Sbjct: 810 DVHNLCPKEVLTFILDLIKYNDNRKNKFSDNYYRAEMIDALANSVTPAVSVNNEVRTLDN 869

Query: 865 L-------LKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLSGLLSLDRIIELIRPF 892
           L       L+ I R L  ++L+PSY   +T+SC+R +    L+ +G +  D    L + +
Sbjct: 870 LNPDVRLILEEITRFLNMEKLLPSYRHTITVSCLRAIR--VLQKNGHVPSDP--ALFKSY 896

BLAST of CSPI01G06630 vs. ExPASy Swiss-Prot
Match: Q8C176 (Transcription initiation factor TFIID subunit 2 OS=Mus musculus OX=10090 GN=Taf2 PE=2 SV=2)

HSP 1 Score: 339.0 bits (868), Expect = 2.5e-91
Identity = 265/919 (28.84%), Postives = 446/919 (48.53%), Query Frame = 0

Query: 25  HQKLCL-SIDIDNRRVYGFTELEI--AVPDIGIVGLHAENLGIVSVSVDGDPTEFEYY-P 84
           HQ +C+ +I+   + V GF EL I   V ++  + L+++   I  V ++     F Y  P
Sbjct: 20  HQVVCINNINFQRKSVVGFVELTIFPTVANLNRIKLNSKQCRIYRVRINDLEAAFIYNDP 79

Query: 85  RPQHVENERSFKAVSSPSSAADAAGSIYLSSIEKELVPNLLINCCKAFKSGSEQQDQPFL 144
             +   +E   + ++  S+A  AA    +S+++ +     L       K  SE       
Sbjct: 80  TLEVCHSESKQRNLNYFSNAYAAA----VSAVDPDAGNGEL-----CIKVPSELWKH--- 139

Query: 145 ENGVQTADEDKQNVRLVRIDYWVEKSEVGIHFY---------NRMAH--TDNQIRRARCW 204
                    D+  V  + I++ +++ + G+HF           R AH  +       R W
Sbjct: 140 --------VDELKVLKIHINFSLDQPKGGLHFVVPSVEGSMAERGAHVFSCGYQNSTRFW 199

Query: 205 FPCMDDGLQRCKYDLEFTVSQNLVAVSNGILLYQVLSKDNPPRKTFVYRVDIPVNARWIS 264
           FPC+D   + C + LEFTV   +VAVSNG L+  V + D   +KTF Y + IP  A  IS
Sbjct: 200 FPCVDSYSELCTWKLEFTVDAAMVAVSNGDLVETVYTHD-MRKKTFHYMLTIPTAASNIS 259

Query: 265 LAVGPFEILADHQNVLISHMCSPVNSLKLKHTVDFFHSAFSCYKDYLSVDFPFGSYKQIF 324
           LA+GPFEIL D     ++H C P     LKHT  + H  F  Y++ L+  +P+  +K +F
Sbjct: 260 LAIGPFEILVDPYMHEVTHFCLPQLLPLLKHTTSYIHEVFEFYEEILTCRYPYSCFKTVF 319

Query: 325 IEPEIAVSSACLGVSMCIFSSHLLFDEKIIDQTIDTRIKLAYALARQWFGIYITPEAPND 384
           I+   A        SM IFS++LL    IID+T  TR  LA ALA+Q+FG +I+  + +D
Sbjct: 320 IDE--AYVEVAAYASMSIFSTNLLHSAMIIDETPLTRRCLAQALAQQFFGCFISRMSWSD 379

Query: 385 EWLLDGLAGFLTDLFIKKNLGNNEARY----QRYKANCSVCRADDCGLTTLSSSSACKD- 444
           EW+L G++G++  L++KK  G NE  +    +  K      +     L  +      KD 
Sbjct: 380 EWVLKGISGYIYGLWMKKTFGVNEYHHWIKEELDKIVAYELKTGGVLLHPIFGGGKEKDN 439

Query: 445 ---------LHGTQCIGIYGKIRSWKSVAILQMLEKQMGPESFRKILQNIVSHAKDTGS- 504
                     H       Y  +   K+  +++++E ++  E   ++   ++S A    S 
Sbjct: 440 PASHLHFSIKHPHTLSWEYYTMFQCKAHLVMRLIENRISMEFMLQVFNKLLSLASTASSQ 499

Query: 505 -------TSQLLSTKEF-RQLANKIGNLERPFLKEFFPRWVESCGCPLLRMGFSYNKRKN 564
                  +  L+ST  F + ++N  G   +P +K+    W++  G       F++N+++N
Sbjct: 500 KFQSHMWSQMLVSTYGFLKSISNVSGKDIQPLIKQ----WLDQSGVVKFYGSFAFNRKRN 559

Query: 565 MVEMAVSRECTATPATNVENRDSDAGWPGMMSIRIYELDGVFDHPVLPMTGESWQLLEIQ 624
           ++E+ + ++ T +P T          + G + + + ELDG F+H +     E+    +I 
Sbjct: 560 VLELEIKQDYT-SPGTQ--------KYVGPLKVTVQELDGSFNHTL--QIEENSLKHDIP 619

Query: 625 CHSKLAARRLQKTKKGSKPDGSDDNADIPALDIRSSVESPLLWLRADPEMEYLAEIHFHQ 684
           CHSK    R  K KK    +G + + D+ A++     +SPLLW+R DP+M  L ++ F Q
Sbjct: 620 CHSK---SRRNKKKKIPLMNGEEVDMDLSAME----ADSPLLWIRIDPDMSVLRKVEFEQ 679

Query: 685 PVQMWINQLEKDKDVIAQAQAIATLEMLPQPSFSIVNALNNFLKDPKAFWRVRIEAALAM 744
              MW  +L  ++DV+AQ ++I  LE  P P+  +  AL + L+  + F+RVR+ A   +
Sbjct: 680 ADFMWQYELRYERDVVAQQESILALEKFPTPASRL--ALTDILEQEQCFYRVRMSACFCL 739

Query: 745 AKTA-SEDTDWAGLLNLIKFFKSQRFDADT--GLPKPNEFRDFPEYFVLEAIPHAVAMVR 804
           AK A S  + W G   +   F ++ F   T   + K N F  F  YF+ + +P A+A++R
Sbjct: 740 AKIANSMVSTWTGPPAMKSLF-TRMFCCKTCPNIVKTNNFMSFQSYFLQKTMPVAMALLR 799

Query: 805 GTDQKSPREAVEFVLQLLKYNDNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLAS 864
                 P+E + F+L L+KYNDN  N +SD ++ A    AL  SV         +  L +
Sbjct: 800 DVHNLCPKEVLTFILDLIKYNDNRKNKFSDNYYRAEMIDALANSVTPAVSVNNEVRTLDN 859

Query: 865 L-------LKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLSGLLSLDRIIELIRPF 892
           L       L+ I R L  ++L+PSY   +T+SC+R +    L+ +G +  D    L + +
Sbjct: 860 LNPDVRLILEEITRFLNMEKLLPSYRHTITVSCLRAIR--VLQKNGHVPSD--ASLFKSY 886

BLAST of CSPI01G06630 vs. ExPASy Swiss-Prot
Match: Q32PW3 (Transcription initiation factor TFIID subunit 2 OS=Danio rerio OX=7955 GN=taf2 PE=2 SV=2)

HSP 1 Score: 332.0 bits (850), Expect = 3.1e-89
Identity = 325/1197 (27.15%), Postives = 545/1197 (45.53%), Query Frame = 0

Query: 25   HQKLCL-SIDIDNRRVYGFTELEI--AVPDIGIVGLHAENLGIVSVSVDGDPTEFEYY-P 84
            HQ +C+ +++   + V G+ EL I   V ++  + L+++   I  V V+     F Y  P
Sbjct: 19   HQVVCINNVNFQRKSVIGYVELTIFPTVVNLNRIKLNSKQCRIYRVRVNDLEAPFIYNDP 78

Query: 85   RPQHVENERSFKAVSSPSSAADAAGSIYLSSIEKELVPNLLINCCKAFKSGSEQQDQPFL 144
              +   +E   + ++  SSA  AA    +S+++ +     L     + K  SE   Q   
Sbjct: 79   TLEVCHHESKQRNLNYFSSAYTAA----VSAVDPDAGNGEL-----SIKVPSELWKQ--- 138

Query: 145  ENGVQTADEDKQNVRLVRIDYWVEKSEVGIHFY---------NRMAHT---DNQIRRARC 204
                     D+  V  V I++ +++ + G+HF           R AH     NQ   +R 
Sbjct: 139  --------GDEMKVMKVYIEFSLDQPKGGLHFVVPDVEGNMAERAAHVFSFGNQ-NSSRF 198

Query: 205  WFPCMDDGLQRCKYDLEFTVSQNLVAVSNGILLYQVLSKDNPPRKTFVYRVDIPVNARWI 264
            WFPC+D   + C + LEFTV  ++VAVS G L+  V + D   +KT+ Y + IP  A  I
Sbjct: 199  WFPCVDSYSELCTWKLEFTVDASMVAVSCGDLVETVYTHD-MRKKTYHYMLPIPTAAPNI 258

Query: 265  SLAVGPFEILADHQNVLISHMCSPVNSLKLKHTVDFFHSAFSCYKDYLSVDFPFGSYKQI 324
            S+AVGPFEIL D     ++H C P     LKHT+ + H  F  Y++ L+  +P+  +K +
Sbjct: 259  SMAVGPFEILVDPYMHEVTHFCLPQLLPLLKHTMSYLHEIFEFYEEILTCRYPYSCFKTV 318

Query: 325  FI-EPEIAVSSACLGVSMCIFSSHLLFDEKIIDQTIDTRIKLAYALARQWFGIYITPEAP 384
            F+ E  + VSS     SM IFS++LL    IIDQT  TR  LA ALA+Q+FG +I+  + 
Sbjct: 319  FVDEAYVQVSSY---ASMSIFSTNLLHSGLIIDQTPMTRSFLAQALAQQFFGCFISRMSW 378

Query: 385  NDEWLLDGLAGFLTDLFIKKNLGNNEARY----QRYKANCSVCRADDCGLTTLSSSSACK 444
             DEW+L G++G++  L++KK  G NE R+    +  K      +     L    S    K
Sbjct: 379  ADEWVLKGISGYIYGLYLKKTFGVNEYRHWIKEELDKIVEYELKIGGVLLHPTFSGGKEK 438

Query: 445  D----------LHGTQCIGIYGKIRSWKSVAILQMLEKQMGPESFRKILQNIVSHAKDTG 504
            D           H       Y K+   K+  +++++E ++  E   ++   ++S A    
Sbjct: 439  DNPTPHLHFSIKHPHTLSWEYYKMFQCKAHLVMRLIENRISMEFMLQVFNKLLSLASTAS 498

Query: 505  S--------TSQLLSTKEF-RQLANKIGNLERPFLKEFFPRWVESCGCPLLRMGFSYNKR 564
            S        +  L+ST  F + ++N  G    P +K+    WV+  G       F++N++
Sbjct: 499  SQKYQSHMWSQMLVSTSGFLKSISNVSGKDIGPLIKQ----WVDQSGVVKFFGSFAFNRK 558

Query: 565  KNMVEMAVSRECTATPATNVENRDSDAGWPGMMSIRIYELDGVFDHPVLPMTGESWQLLE 624
            +N++E+ + ++ T++             + G + + + ELDG F+H +     E+    +
Sbjct: 559  RNVLELEIRQDYTSSGTQK---------YVGPIKVTVQELDGSFNHTL--QIEENSLKHD 618

Query: 625  IQCHSKLAARRLQKTKKGSKPDGSDDNADIPALDIRSSVESPLLWLRADPEMEYLAEIHF 684
            I CHSK    R  K KK    +G + + D+ A+D     +SPLLW+R DP+M  L ++ F
Sbjct: 619  IPCHSK---SRRNKKKKIPLMNGEEVDMDLSAMD----ADSPLLWIRIDPDMSILRKVEF 678

Query: 685  HQPVQMWINQLEKDKDVIAQAQAIATLEMLPQPSFSIVNALNNFLKDPKAFWRVRIEAAL 744
             Q   MW  QL  ++DV+AQ +AI  LE  P P      AL + L+  + F++VR+ A  
Sbjct: 679  EQADFMWQYQLRYERDVVAQEEAILALEKFPTPPSR--RALTDILEQDQCFYKVRMHACF 738

Query: 745  AMAKTA-SEDTDWAGLLNLIKFFKSQRFDADT--GLPKPNEFRDFPEYFVLEAIPHAVAM 804
             +AK A S  + W G   +   F ++ F   +   + K N F  F  YF+ + IP A+A 
Sbjct: 739  CLAKIANSMVSTWTGPPAMKSLF-TRMFCCKSCPNIVKTNNFISFQSYFLQKTIPVAMAQ 798

Query: 805  VRGTDQKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSV-----------GELEFG 864
            +R      PRE + F+L L+KYNDN  N +SD ++ A L+ ++            E+   
Sbjct: 799  LRDVQNLCPREVLSFILDLIKYNDNRKNKFSDNYYRAELIDALTNSLTPAISINNEVRTV 858

Query: 865  QQSILFLASLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLSGLLSLDRIIELIR 924
                  +  +L+ I R L  ++L+PSY   +T+SC+R +    L+ +G +  D    L +
Sbjct: 859  DSLNADVRLILEEITRFLNMEKLLPSYRNTITVSCLRAIRM--LQKNGHIPSDP--TLFK 918

Query: 925  PFRDFNSMWQVRIEATRSLLDLEYHCNGIDATLLLFIKYLEEENSLRGQVKLAVHVMRLC 984
             + ++     VRI A  +++D  Y      ++ L ++  L + + +   V+  +  M   
Sbjct: 919  SYAEYGHFVDVRIAALEAVID--YTRVDRSSSELQWLLDLVQNDPVH-YVRHEILSMLAK 978

Query: 985  QIMRRSGSNDVVNNDTLVALLLLLEGNMAFNNVYLRHYLFCILQVLSGRSPTLYGVPREY 1044
                   ++  + N+ LV  L  L  +   ++  LR     +         TL+G+ R  
Sbjct: 979  NPPFTKAADSSLCNEALVDQLWKLMNSGTCHDWRLRCDAVNLYY-------TLFGLTRP- 1038

Query: 1045 KTLHMGDTGTFSEQKRMLTSLIPEFNPPEPSSVSAVAPMPCIPATLSSEPLHVPTPRPDN 1104
              L + + G     K     L P     +   +  + P      TL  E + V  P    
Sbjct: 1039 SCLPLPELGLVLNLKEKKAVLNPTIKSEQIPGLPEMTPSMTFINTLKEEHV-VMDPALSG 1098

Query: 1105 LAVPELS---KEEGAIAEDPKQAMAIVEAPREAASVSSSHERKLPVVKIKVRSSAATSRA 1164
            +A+  LS   K E  +   P+    + E P    ++ S  E  + +    V  S A    
Sbjct: 1099 VAMAPLSLKRKAETPVGSAPEPGQVLQEEPSAKITLKSREEEDIDM--DTVHDSQAFIYH 1145

BLAST of CSPI01G06630 vs. ExPASy TrEMBL
Match: A0A0A0LQC8 (Transcription initiation factor TFIID subunit 2 OS=Cucumis sativus OX=3659 GN=Csa_1G042170 PE=3 SV=1)

HSP 1 Score: 2679.8 bits (6945), Expect = 0.0e+00
Identity = 1358/1362 (99.71%), Postives = 1360/1362 (99.85%), Query Frame = 0

Query: 1    MAKPRKPKNTDDAKPPDNSGAVVRHQKLCLSIDIDNRRVYGFTELEIAVPDIGIVGLHAE 60
            MAKPRKPKNTDDAKPPDNSGAVVRHQKLCLSIDIDNRRVYGFTELEIAVPDIGIVGLHAE
Sbjct: 1    MAKPRKPKNTDDAKPPDNSGAVVRHQKLCLSIDIDNRRVYGFTELEIAVPDIGIVGLHAE 60

Query: 61   NLGIVSVSVDGDPTEFEYYPRPQHVENERSFKAVSSPSSAADAAGSIYLSSIEKELVPNL 120
            NLGIVSVSVDGDPTEFEYYPRPQHVENERSFKAVSSPSSAADAAGSIYLSSIEKELVPNL
Sbjct: 61   NLGIVSVSVDGDPTEFEYYPRPQHVENERSFKAVSSPSSAADAAGSIYLSSIEKELVPNL 120

Query: 121  LINCCKAFKSGSEQQDQPFLENGVQTADEDKQNVRLVRIDYWVEKSEVGIHFYNRMAHTD 180
            LINCCKAFKSGSEQQDQPFLENGVQTADEDKQNVRLVRIDYWVEKSEVGIHFYNRMAHTD
Sbjct: 121  LINCCKAFKSGSEQQDQPFLENGVQTADEDKQNVRLVRIDYWVEKSEVGIHFYNRMAHTD 180

Query: 181  NQIRRARCWFPCMDDGLQRCKYDLEFTVSQNLVAVSNGILLYQVLSKDNPPRKTFVYRVD 240
            NQIRRARCWFPCMDDGLQRCKYDLEFTVSQNLVAVSNGILLYQVLSKDNPPRKTFVYRVD
Sbjct: 181  NQIRRARCWFPCMDDGLQRCKYDLEFTVSQNLVAVSNGILLYQVLSKDNPPRKTFVYRVD 240

Query: 241  IPVNARWISLAVGPFEILADHQNVLISHMCSPVNSLKLKHTVDFFHSAFSCYKDYLSVDF 300
            IPVNARWISLAVGPFEILADHQNVLISHMCSPVNSLKLKHTVDFFHSAFSCYKDYLSVDF
Sbjct: 241  IPVNARWISLAVGPFEILADHQNVLISHMCSPVNSLKLKHTVDFFHSAFSCYKDYLSVDF 300

Query: 301  PFGSYKQIFIEPEIAVSSACLGVSMCIFSSHLLFDEKIIDQTIDTRIKLAYALARQWFGI 360
            PFGSYKQIFIEPEIAVSSACLGVSMCIFSSHLLFDEKIIDQTIDTRIKLAYALARQWFGI
Sbjct: 301  PFGSYKQIFIEPEIAVSSACLGVSMCIFSSHLLFDEKIIDQTIDTRIKLAYALARQWFGI 360

Query: 361  YITPEAPNDEWLLDGLAGFLTDLFIKKNLGNNEARYQRYKANCSVCRADDCGLTTLSSSS 420
            YITPEAPNDEWLLDGLAGFLTDLFIKKNLGNNEARYQRYKANCSVCRADDCGLTTLSSSS
Sbjct: 361  YITPEAPNDEWLLDGLAGFLTDLFIKKNLGNNEARYQRYKANCSVCRADDCGLTTLSSSS 420

Query: 421  ACKDLHGTQCIGIYGKIRSWKSVAILQMLEKQMGPESFRKILQNIVSHAKDTGSTSQLLS 480
            ACKDLHGTQCIGIYGKIRSWKSVAILQMLEKQMGPESFRKILQNIVSHAKDTGSTSQLLS
Sbjct: 421  ACKDLHGTQCIGIYGKIRSWKSVAILQMLEKQMGPESFRKILQNIVSHAKDTGSTSQLLS 480

Query: 481  TKEFRQLANKIGNLERPFLKEFFPRWVESCGCPLLRMGFSYNKRKNMVEMAVSRECTATP 540
            TKEFRQLANKIGNLERPFLKEFFPRWVESCGCPLLRMGFSYNKRKNMVEMAVSRECTATP
Sbjct: 481  TKEFRQLANKIGNLERPFLKEFFPRWVESCGCPLLRMGFSYNKRKNMVEMAVSRECTATP 540

Query: 541  ATNVENRDSDAGWPGMMSIRIYELDGVFDHPVLPMTGESWQLLEIQCHSKLAARRLQKTK 600
            ATNVENRDSDAGWPGMMSIRIYELDGVFDHPVLPMTGESWQLLEIQCHSKLAARRLQKTK
Sbjct: 541  ATNVENRDSDAGWPGMMSIRIYELDGVFDHPVLPMTGESWQLLEIQCHSKLAARRLQKTK 600

Query: 601  KGSKPDGSDDNADIPALDIRSSVESPLLWLRADPEMEYLAEIHFHQPVQMWINQLEKDKD 660
            KGSKPDGSDDNADIPALDIRSSVESPLLWLRADPEMEYLAEIHFHQPVQMWINQLEKDKD
Sbjct: 601  KGSKPDGSDDNADIPALDIRSSVESPLLWLRADPEMEYLAEIHFHQPVQMWINQLEKDKD 660

Query: 661  VIAQAQAIATLEMLPQPSFSIVNALNNFLKDPKAFWRVRIEAALAMAKTASEDTDWAGLL 720
            VIAQAQAIATLEMLPQPSFSIVNALNNFLKDPKAFWRVRIEAALAMAKTASEDTDWAGLL
Sbjct: 661  VIAQAQAIATLEMLPQPSFSIVNALNNFLKDPKAFWRVRIEAALAMAKTASEDTDWAGLL 720

Query: 721  NLIKFFKSQRFDADTGLPKPNEFRDFPEYFVLEAIPHAVAMVRGTDQKSPREAVEFVLQL 780
            NLIKFFKSQRFDADTGLPKPNEFRDFPEYFVLEAIPHAVAMVRGTDQKSPREAVEFVLQL
Sbjct: 721  NLIKFFKSQRFDADTGLPKPNEFRDFPEYFVLEAIPHAVAMVRGTDQKSPREAVEFVLQL 780

Query: 781  LKYNDNNGNPYSDVFWLAALVQSVGELEFGQQSILFLASLLKRIDRLLQFDRLMPSYNGI 840
            LKYNDNNGNPYSDVFWLAALVQSVGELEFGQQSILFLASLLKRIDRLLQFDRLMPSYNGI
Sbjct: 781  LKYNDNNGNPYSDVFWLAALVQSVGELEFGQQSILFLASLLKRIDRLLQFDRLMPSYNGI 840

Query: 841  LTISCIRTLTQIALKLSGLLSLDRIIELIRPFRDFNSMWQVRIEATRSLLDLEYHCNGID 900
            LTISCIRTLTQIALKLSGLLSLDRIIELIRPFRDFNSMWQVRIEATRSLLDLEYHCNGID
Sbjct: 841  LTISCIRTLTQIALKLSGLLSLDRIIELIRPFRDFNSMWQVRIEATRSLLDLEYHCNGID 900

Query: 901  ATLLLFIKYLEEENSLRGQVKLAVHVMRLCQIMRRSGSNDVVNNDTLVALLLLLEGNMAF 960
            ATLLLFIKYLEEENSLRGQVKLAVHVMRLCQIMRRSGSNDVVNNDTLVALLLLLEGNMAF
Sbjct: 901  ATLLLFIKYLEEENSLRGQVKLAVHVMRLCQIMRRSGSNDVVNNDTLVALLLLLEGNMAF 960

Query: 961  NNVYLRHYLFCILQVLSGRSPTLYGVPREYKTLHMGDTGTFSEQKRMLTSLIPEFNPPEP 1020
            NNVYLRHYLF ILQVLSGRSPTLYGVPREYKTLHMGDTGTFSEQKRMLTS+IPEFNPPEP
Sbjct: 961  NNVYLRHYLFSILQVLSGRSPTLYGVPREYKTLHMGDTGTFSEQKRMLTSIIPEFNPPEP 1020

Query: 1021 SSVSAVAPMPCIPATLSSEPLHVPTPRPDNLAVPELSKEEGAIAEDPKQAMAIVEAPREA 1080
            SSVSAVAPMPCIPATLSSEPLHVPTPRPDNLAVPELSKEEGAIAEDPKQAMAIVEAPREA
Sbjct: 1021 SSVSAVAPMPCIPATLSSEPLHVPTPRPDNLAVPELSKEEGAIAEDPKQAMAIVEAPREA 1080

Query: 1081 ASVSSSHERKLPVVKIKVRSSAATSRADADNLTTERSHAAPRETDVGPSSSVSVDAPPRN 1140
            ASVSSSHERKLPVVKIKVRSSAATSRADADNLTTERSHAAPRETDVGPSSSVSVDAPPRN
Sbjct: 1081 ASVSSSHERKLPVVKIKVRSSAATSRADADNLTTERSHAAPRETDVGPSSSVSVDAPPRN 1140

Query: 1141 TAEATSISNRILEEVNSCHDHGSHMTASIGSAKLASYGDELGKEFQCTADSSSRAFGHFQ 1200
            TAEATSISNRILEEVNSCHDHGSHMTASIGSAKLASYGDELGKEFQCTADSSSRAFGHFQ
Sbjct: 1141 TAEATSISNRILEEVNSCHDHGSHMTASIGSAKLASYGDELGKEFQCTADSSSRAFGHFQ 1200

Query: 1201 PEDPSSSSIIQDNNIDADAQKYASLQTLSLPQHDHGLASSQSRHGKKEKKKDKEKKRKRE 1260
            PEDPSSSSIIQDNNIDADAQKYASLQTLSLPQHDHGLASS SRHGKKEKKKDKEKKRKRE
Sbjct: 1201 PEDPSSSSIIQDNNIDADAQKYASLQTLSLPQHDHGLASSHSRHGKKEKKKDKEKKRKRE 1260

Query: 1261 SHKEHRNDPEYIERKRLKKEKKQKEKEMAKLLNEEVKPQPTAMPRIKEPPTKSTPMQLET 1320
            SHKEHRNDPEYIERKRLKKEKKQKEKEMAKLLNEEVKPQPTAMPRIKEPPTKSTP+QLET
Sbjct: 1261 SHKEHRNDPEYIERKRLKKEKKQKEKEMAKLLNEEVKPQPTAMPRIKEPPTKSTPVQLET 1320

Query: 1321 NEPSGSRLIIGSVHSKPEASEGTTSAAPKLRIKFKNRTLNNS 1363
            NEPSGSRLIIGSVHSKPEASEGTTSAAPKLRIKFKNRTLNNS
Sbjct: 1321 NEPSGSRLIIGSVHSKPEASEGTTSAAPKLRIKFKNRTLNNS 1362

BLAST of CSPI01G06630 vs. ExPASy TrEMBL
Match: A0A1S3C357 (Transcription initiation factor TFIID subunit 2 OS=Cucumis melo OX=3656 GN=LOC103496328 PE=3 SV=1)

HSP 1 Score: 2616.6 bits (6781), Expect = 0.0e+00
Identity = 1328/1362 (97.50%), Postives = 1343/1362 (98.60%), Query Frame = 0

Query: 1    MAKPRKPKNTDDAKPPDNSGAVVRHQKLCLSIDIDNRRVYGFTELEIAVPDIGIVGLHAE 60
            MAKPRKPKNTDDAKPPDNSGAVVRHQKLCLSIDIDNRRVYGFTELEIAVPDIGIVGLHAE
Sbjct: 1    MAKPRKPKNTDDAKPPDNSGAVVRHQKLCLSIDIDNRRVYGFTELEIAVPDIGIVGLHAE 60

Query: 61   NLGIVSVSVDGDPTEFEYYPRPQHVENERSFKAVSSPSSAADAAGSIYLSSIEKELVPNL 120
            NLGI+SVSVDGDPTEFEYYPRPQHVE+ERSFKAV+SPSSAADAAGSIY+SSIEKELVPNL
Sbjct: 61   NLGILSVSVDGDPTEFEYYPRPQHVESERSFKAVASPSSAADAAGSIYMSSIEKELVPNL 120

Query: 121  LINCCKAFKSGSEQQDQPFLENGVQTADEDKQNVRLVRIDYWVEKSEVGIHFYNRMAHTD 180
            LINCCKAFKSGSEQQDQPFLENGVQ ADEDKQNVRLVRIDYWVEKSEVGIHFYNR+AHTD
Sbjct: 121  LINCCKAFKSGSEQQDQPFLENGVQPADEDKQNVRLVRIDYWVEKSEVGIHFYNRLAHTD 180

Query: 181  NQIRRARCWFPCMDDGLQRCKYDLEFTVSQNLVAVSNGILLYQVLSKDNPPRKTFVYRVD 240
            NQIRRARCWFPCMDDGLQRCKYDLEFTVSQNLVAVSNGILLYQVLSKDNPPRKT+VYRVD
Sbjct: 181  NQIRRARCWFPCMDDGLQRCKYDLEFTVSQNLVAVSNGILLYQVLSKDNPPRKTYVYRVD 240

Query: 241  IPVNARWISLAVGPFEILADHQNVLISHMCSPVNSLKLKHTVDFFHSAFSCYKDYLSVDF 300
            IPVNARWISLAVGPFEILADHQNVLISHMCSPVNSLKLKHTVDFFH AFSCYKDYLSVDF
Sbjct: 241  IPVNARWISLAVGPFEILADHQNVLISHMCSPVNSLKLKHTVDFFHGAFSCYKDYLSVDF 300

Query: 301  PFGSYKQIFIEPEIAVSSACLGVSMCIFSSHLLFDEKIIDQTIDTRIKLAYALARQWFGI 360
            PFGSYKQIFIEPEIAVSSACLGVSMCIFSSHLLFDEKIIDQTIDTRIKLAYALARQWFGI
Sbjct: 301  PFGSYKQIFIEPEIAVSSACLGVSMCIFSSHLLFDEKIIDQTIDTRIKLAYALARQWFGI 360

Query: 361  YITPEAPNDEWLLDGLAGFLTDLFIKKNLGNNEARYQRYKANCSVCRADDCGLTTLSSSS 420
            YITPEAPNDEWLLDGLAGFLTDLFIKKNLGNNEARYQRYKANC+VCRADD GLTTLSSS+
Sbjct: 361  YITPEAPNDEWLLDGLAGFLTDLFIKKNLGNNEARYQRYKANCAVCRADDSGLTTLSSSA 420

Query: 421  ACKDLHGTQCIGIYGKIRSWKSVAILQMLEKQMGPESFRKILQNIVSHAKDTGSTSQLLS 480
            ACKDLHGTQCIGIYGKIRSWKSVAILQMLEKQMGPESFRKILQNIVSHAKDTGS SQLLS
Sbjct: 421  ACKDLHGTQCIGIYGKIRSWKSVAILQMLEKQMGPESFRKILQNIVSHAKDTGSASQLLS 480

Query: 481  TKEFRQLANKIGNLERPFLKEFFPRWVESCGCPLLRMGFSYNKRKNMVEMAVSRECTATP 540
            TKEFRQLANKIGNLERPFLKEFFPRWVESCGCPLLRMGFSYNKRKNMVEMAVSRECTATP
Sbjct: 481  TKEFRQLANKIGNLERPFLKEFFPRWVESCGCPLLRMGFSYNKRKNMVEMAVSRECTATP 540

Query: 541  ATNVENRDSDAGWPGMMSIRIYELDGVFDHPVLPMTGESWQLLEIQCHSKLAARRLQKTK 600
            AT+VENRDSDAGWPGMMSIRIYELDGVFDHPVLPMTGESWQLLEIQCHSKLAARRLQKTK
Sbjct: 541  ATSVENRDSDAGWPGMMSIRIYELDGVFDHPVLPMTGESWQLLEIQCHSKLAARRLQKTK 600

Query: 601  KGSKPDGSDDNADIPALDIRSSVESPLLWLRADPEMEYLAEIHFHQPVQMWINQLEKDKD 660
            KGSKPDGSDDNADIPALDIRSSVESPLLWLRADPEMEYLAEIHFHQPVQMWINQLEKDKD
Sbjct: 601  KGSKPDGSDDNADIPALDIRSSVESPLLWLRADPEMEYLAEIHFHQPVQMWINQLEKDKD 660

Query: 661  VIAQAQAIATLEMLPQPSFSIVNALNNFLKDPKAFWRVRIEAALAMAKTASEDTDWAGLL 720
            VIAQAQAIATLEMLPQPSFSIVNALNNFL+DPKAFWRVRIEAALAMAKTASEDTDWAGLL
Sbjct: 661  VIAQAQAIATLEMLPQPSFSIVNALNNFLRDPKAFWRVRIEAALAMAKTASEDTDWAGLL 720

Query: 721  NLIKFFKSQRFDADTGLPKPNEFRDFPEYFVLEAIPHAVAMVRGTDQKSPREAVEFVLQL 780
            NLIKFFKSQRFDADTGLPKPNEFRDFPEYFVLEAIPHAVAMVRGTDQKSPREAVEFVLQL
Sbjct: 721  NLIKFFKSQRFDADTGLPKPNEFRDFPEYFVLEAIPHAVAMVRGTDQKSPREAVEFVLQL 780

Query: 781  LKYNDNNGNPYSDVFWLAALVQSVGELEFGQQSILFLASLLKRIDRLLQFDRLMPSYNGI 840
            LKYNDNNGNPYSDVFWLAALVQSVGELEFGQQSILFLASLLKRIDRLLQFDRLMPSYNGI
Sbjct: 781  LKYNDNNGNPYSDVFWLAALVQSVGELEFGQQSILFLASLLKRIDRLLQFDRLMPSYNGI 840

Query: 841  LTISCIRTLTQIALKLSGLLSLDRIIELIRPFRDFNSMWQVRIEATRSLLDLEYHCNGID 900
            LTISCIRTLTQIALKLSGLLSLDRIIELIRPFRDFNS+WQVRIEATRSLLDLEYHCNGID
Sbjct: 841  LTISCIRTLTQIALKLSGLLSLDRIIELIRPFRDFNSIWQVRIEATRSLLDLEYHCNGID 900

Query: 901  ATLLLFIKYLEEENSLRGQVKLAVHVMRLCQIMRRSGSNDVVNNDTLVALLLLLEGNMAF 960
            ATLLLFIKYLEEENSLRGQ KLAVHVMRLCQIMRRSGSNDVVNNDTLVALLLLLEGNMAF
Sbjct: 901  ATLLLFIKYLEEENSLRGQAKLAVHVMRLCQIMRRSGSNDVVNNDTLVALLLLLEGNMAF 960

Query: 961  NNVYLRHYLFCILQVLSGRSPTLYGVPREYKTLHMGDTGTFSEQKRMLTSLIPEFNPPEP 1020
            NNV LRHYLFCILQVL+GR PTLYGVPREYKTLHMGDTGT SEQKRMLTSLIPEFNPPE 
Sbjct: 961  NNVCLRHYLFCILQVLAGRPPTLYGVPREYKTLHMGDTGTCSEQKRMLTSLIPEFNPPEQ 1020

Query: 1021 SSVSAVAPMPCIPATLSSEPLHVPTPRPDNLAVPELSKEEGAIAEDPKQAMAIVEAPREA 1080
            SSVSAVAPMPCIPA+LSSEPLH PTPRPD+LAVPELSKE G IAE PKQAMAIVEAPREA
Sbjct: 1021 SSVSAVAPMPCIPASLSSEPLHAPTPRPDSLAVPELSKEGGEIAEVPKQAMAIVEAPREA 1080

Query: 1081 ASVSSSHERKLPVVKIKVRSSAATSRADADNLTTERSHAAPRETDVGPSSSVSVDAPPRN 1140
            ASVSSSHERKLPVVKIKVRSSAATSRADADNLTTERSHA PRETDVGPSSSVSVDAPPRN
Sbjct: 1081 ASVSSSHERKLPVVKIKVRSSAATSRADADNLTTERSHAVPRETDVGPSSSVSVDAPPRN 1140

Query: 1141 TAEATSISNRILEEVNSCHDHGSHMTASIGSAKLASYGDELGKEFQCTADSSSRAFGHFQ 1200
             AEATSISNRILEEVNSCHDHGSHMTASIGSAKLASYGDELGKEFQCTADSSSRAFGHFQ
Sbjct: 1141 IAEATSISNRILEEVNSCHDHGSHMTASIGSAKLASYGDELGKEFQCTADSSSRAFGHFQ 1200

Query: 1201 PEDPSSSSIIQDNNIDADAQKYASLQTLSLPQHDHGLASSQSRHGKKEKKKDKEKKRKRE 1260
            PEDPSSSSIIQDNNIDADAQKYASLQTLSLPQ+DHGLASS SRHGKKEKKKDKEKKRKRE
Sbjct: 1201 PEDPSSSSIIQDNNIDADAQKYASLQTLSLPQNDHGLASSHSRHGKKEKKKDKEKKRKRE 1260

Query: 1261 SHKEHRNDPEYIERKRLKKEKKQKEKEMAKLLNEEVKPQPTAMPRIKEPPTKSTPMQLET 1320
            SHKEHRNDPEYIERKRLKKEKKQKEKEMAKLLNEEVKPQP AMPRIKEPPTKSTPMQLET
Sbjct: 1261 SHKEHRNDPEYIERKRLKKEKKQKEKEMAKLLNEEVKPQPLAMPRIKEPPTKSTPMQLET 1320

Query: 1321 NEPSGSRLIIGSVHSKPEASEGTTSAAPKLRIKFKNRTLNNS 1363
            NEPSGSRLIIG V SKPEASEGTTSAAPKLRIKFKNRTLNNS
Sbjct: 1321 NEPSGSRLIIG-VQSKPEASEGTTSAAPKLRIKFKNRTLNNS 1361

BLAST of CSPI01G06630 vs. ExPASy TrEMBL
Match: A0A6J1ED82 (Transcription initiation factor TFIID subunit 2 OS=Cucurbita moschata OX=3662 GN=LOC111431475 PE=3 SV=1)

HSP 1 Score: 2501.1 bits (6481), Expect = 0.0e+00
Identity = 1271/1362 (93.32%), Postives = 1308/1362 (96.04%), Query Frame = 0

Query: 1    MAKPRKPKNTDDAKPPDNSGAVVRHQKLCLSIDIDNRRVYGFTELEIAVPDIGIVGLHAE 60
            MAKPRKPKNTDDAKPPDNSGAVVRHQKLCLSIDID RRVYGFTELEIAVPDIGIVGLHAE
Sbjct: 1    MAKPRKPKNTDDAKPPDNSGAVVRHQKLCLSIDIDKRRVYGFTELEIAVPDIGIVGLHAE 60

Query: 61   NLGIVSVSVDGDPTEFEYYPRPQHVENERSFKAVSSPSSAADAAGSIYLSSIEKELVPNL 120
            NLGIVSVSVDGDPTEFEYYPR QHVE+E+SFKAVSSPSSAADAAGSIYLSSIEKELVPNL
Sbjct: 61   NLGIVSVSVDGDPTEFEYYPRTQHVESEKSFKAVSSPSSAADAAGSIYLSSIEKELVPNL 120

Query: 121  LINCCKAFKSGSEQQDQPFLENGVQTADEDKQNVRLVRIDYWVEKSEVGIHFYNRMAHTD 180
            LINCCKAFK+GSE+QDQPFLENGVQ A EDKQN+RLVRIDYWVEKS+VGIHF + +AHTD
Sbjct: 121  LINCCKAFKNGSEKQDQPFLENGVQPAVEDKQNIRLVRIDYWVEKSDVGIHFNDHIAHTD 180

Query: 181  NQIRRARCWFPCMDDGLQRCKYDLEFTVSQNLVAVSNGILLYQVLSKDNPPRKTFVYRVD 240
            NQIRRARCWFPCMDDGLQRCKYDLEFTVSQNLVAVSNGILLYQVLSK+NPP KT+VYRVD
Sbjct: 181  NQIRRARCWFPCMDDGLQRCKYDLEFTVSQNLVAVSNGILLYQVLSKENPPCKTYVYRVD 240

Query: 241  IPVNARWISLAVGPFEILADHQNVLISHMCSPVNSLKLKHTVDFFHSAFSCYKDYLSVDF 300
            IPVNA WISLAVGPFEILADHQN  ISHMCSPVNSLKLKHTV+FFHSAFSCYKDYLSVDF
Sbjct: 241  IPVNAHWISLAVGPFEILADHQNGFISHMCSPVNSLKLKHTVEFFHSAFSCYKDYLSVDF 300

Query: 301  PFGSYKQIFIEPEIAVSSACLGVSMCIFSSHLLFDEKIIDQTIDTRIKLAYALARQWFGI 360
            PFGSYKQIFIEPEIAVSSACLGVSMCIFSSH LFDEKIIDQTIDTRIKLAYALARQWFGI
Sbjct: 301  PFGSYKQIFIEPEIAVSSACLGVSMCIFSSHFLFDEKIIDQTIDTRIKLAYALARQWFGI 360

Query: 361  YITPEAPNDEWLLDGLAGFLTDLFIKKNLGNNEARYQRYKANCSVCRADDCGLTTLSSSS 420
            YITPEAPNDEWLLDGLAGFLTDLFIKKNLGNNEARYQRYKANC+VCRADD G+TTLSSSS
Sbjct: 361  YITPEAPNDEWLLDGLAGFLTDLFIKKNLGNNEARYQRYKANCAVCRADDSGITTLSSSS 420

Query: 421  ACKDLHGTQCIGIYGKIRSWKSVAILQMLEKQMGPESFRKILQNIVSHAKDTGSTSQLLS 480
            ACKDL+G Q IGIYGKIRSWKSVAILQMLEKQMGPESFRKILQNIVS AKDT STSQ LS
Sbjct: 421  ACKDLYGIQRIGIYGKIRSWKSVAILQMLEKQMGPESFRKILQNIVSRAKDTTSTSQSLS 480

Query: 481  TKEFRQLANKIGNLERPFLKEFFPRWVESCGCPLLRMGFSYNKRKNMVEMAVSRECTATP 540
            TKEFR LANKIGNLERPFLKEF PRWVESCGCPLLRMGFSYNKRKNMVEMAVSRECT TP
Sbjct: 481  TKEFRHLANKIGNLERPFLKEFIPRWVESCGCPLLRMGFSYNKRKNMVEMAVSRECTVTP 540

Query: 541  ATNVENRDSDAGWPGMMSIRIYELDGVFDHPVLPMTGESWQLLEIQCHSKLAARRLQKTK 600
            AT +ENRDSD GWPGMMSIRIYELDGVFDHPVLPMTGESWQLLEIQCHSKLAARRLQKTK
Sbjct: 541  ATILENRDSDTGWPGMMSIRIYELDGVFDHPVLPMTGESWQLLEIQCHSKLAARRLQKTK 600

Query: 601  KGSKPDGSDDNADIPALDIRSSVESPLLWLRADPEMEYLAEIHFHQPVQMWINQLEKDKD 660
            KGSKPDGSDDNADIPALD+RSSVESPLLWLRADPEMEYLAEI FHQPVQMWINQLEKDKD
Sbjct: 601  KGSKPDGSDDNADIPALDVRSSVESPLLWLRADPEMEYLAEIQFHQPVQMWINQLEKDKD 660

Query: 661  VIAQAQAIATLEMLPQPSFSIVNALNNFLKDPKAFWRVRIEAALAMAKTASEDTDWAGLL 720
            VIAQAQAIATLEMLPQPSFS+VNALNNFL+DPKAFWRVRIEAALAMAKTASEDTDWAGLL
Sbjct: 661  VIAQAQAIATLEMLPQPSFSVVNALNNFLRDPKAFWRVRIEAALAMAKTASEDTDWAGLL 720

Query: 721  NLIKFFKSQRFDADTGLPKPNEFRDFPEYFVLEAIPHAVAMVRGTDQKSPREAVEFVLQL 780
            NLIKFFKSQRFDADTGLPKPNEFRDFPEYFVLEAIPHAVAMVR TDQKSPREAVEFVLQL
Sbjct: 721  NLIKFFKSQRFDADTGLPKPNEFRDFPEYFVLEAIPHAVAMVRATDQKSPREAVEFVLQL 780

Query: 781  LKYNDNNGNPYSDVFWLAALVQSVGELEFGQQSILFLASLLKRIDRLLQFDRLMPSYNGI 840
            LKYNDNNGNPYSDVFWLAALVQSVGELEFGQQSILFLASLLKRIDRLLQFDRLMPSYNGI
Sbjct: 781  LKYNDNNGNPYSDVFWLAALVQSVGELEFGQQSILFLASLLKRIDRLLQFDRLMPSYNGI 840

Query: 841  LTISCIRTLTQIALKLSGLLSLDRIIELIRPFRDFNSMWQVRIEATRSLLDLEYHCNGID 900
            LTISCIRTLTQIALKLSGLLSLDRIIELIRPFR+FNS WQVRIEATR+LLDLEYHCNGID
Sbjct: 841  LTISCIRTLTQIALKLSGLLSLDRIIELIRPFRNFNSTWQVRIEATRALLDLEYHCNGID 900

Query: 901  ATLLLFIKYLEEENSLRGQVKLAVHVMRLCQIMRRSGSNDVVNNDTLVALLLLLEGNMAF 960
            A LLLFIKYLEEE SLRGQVKL VHVMRLCQIMRRS SND VNNDTLVALLLLLEG+MAF
Sbjct: 901  AVLLLFIKYLEEEKSLRGQVKLGVHVMRLCQIMRRSDSNDSVNNDTLVALLLLLEGHMAF 960

Query: 961  NNVYLRHYLFCILQVLSGRSPTLYGVPREYKTLHMGDTGTFSEQKRMLTSLIPEFNPPEP 1020
            NNV LRHYLFCILQVL+GR PTLYGVPREYKTLHMGD+GT SEQKR+LTSLIPEFNPPEP
Sbjct: 961  NNVCLRHYLFCILQVLAGRPPTLYGVPREYKTLHMGDSGTCSEQKRVLTSLIPEFNPPEP 1020

Query: 1021 SSVSAVAPMPCIPATLSSEPLHVPTPRPDNLAVPELSKEEGAIAEDPKQAMAIVEAPREA 1080
            SSVSAVAP+PCIP TLSSEPLH P PRPD+LA+PE+SKE  A+ E PKQA AI EAPREA
Sbjct: 1021 SSVSAVAPVPCIPVTLSSEPLHTPKPRPDSLAIPEVSKEGAAVVEVPKQATAIAEAPREA 1080

Query: 1081 ASVSSSHERKLPVVKIKVRSSAATSRADADNLTTERSHAAPRETDVGPSSSVSVDAPPRN 1140
            ASVS+SHERKLPVVKIKVRSSAATSRADADN TTERSHAAPRETDVGPSSSVSVDAP RN
Sbjct: 1081 ASVSNSHERKLPVVKIKVRSSAATSRADADNQTTERSHAAPRETDVGPSSSVSVDAPQRN 1140

Query: 1141 TAEATSISNRILEEVNSCHDHGSHMTASIGSAKLASYGDELGKEFQCTADSSSRAFGHFQ 1200
             AEATSISN ILEEVNSCHDHGSHMTASIGSAK ASYGD+LGKEFQCTAD SSRAFGHFQ
Sbjct: 1141 IAEATSISNHILEEVNSCHDHGSHMTASIGSAKPASYGDDLGKEFQCTAD-SSRAFGHFQ 1200

Query: 1201 PEDPSSSSIIQDNNIDADAQKYASLQTLSLPQHDHGLASSQSRHGKKEKKKDKEKKRKRE 1260
            PEDPSSSSIIQDNNIDADAQKYASLQTLSLPQHDHGLAS  SRHGKKEKKKDKEKKRKR+
Sbjct: 1201 PEDPSSSSIIQDNNIDADAQKYASLQTLSLPQHDHGLASLHSRHGKKEKKKDKEKKRKRD 1260

Query: 1261 SHKEHRNDPEYIERKRLKKEKKQKEKEMAKLLNEEVKPQPTAMPRIKEPPTKSTPMQLET 1320
            SHKEHRNDPEYIERKRLKKEKKQKEKEMAKLLNEEVKP P AMPR+KEPPTKSTP+QLE+
Sbjct: 1261 SHKEHRNDPEYIERKRLKKEKKQKEKEMAKLLNEEVKPLPIAMPRMKEPPTKSTPVQLES 1320

Query: 1321 NEPSGSRLIIGSVHSKPEASEGTTSAAPKLRIKFKNRTLNNS 1363
            NEPSGSRLI+G V+SKPEASEGT SAAPKLRIKFKNRTLNNS
Sbjct: 1321 NEPSGSRLIVG-VNSKPEASEGTASAAPKLRIKFKNRTLNNS 1360

BLAST of CSPI01G06630 vs. ExPASy TrEMBL
Match: A0A6J1KGW7 (Transcription initiation factor TFIID subunit 2 OS=Cucurbita maxima OX=3661 GN=LOC111495155 PE=3 SV=1)

HSP 1 Score: 2500.7 bits (6480), Expect = 0.0e+00
Identity = 1268/1362 (93.10%), Postives = 1310/1362 (96.18%), Query Frame = 0

Query: 1    MAKPRKPKNTDDAKPPDNSGAVVRHQKLCLSIDIDNRRVYGFTELEIAVPDIGIVGLHAE 60
            MAKPRKPKNTDD+KPPDNSGAVVRHQKLCLSIDID RRVYGFTELEIAVPDIGIVGLHAE
Sbjct: 1    MAKPRKPKNTDDSKPPDNSGAVVRHQKLCLSIDIDKRRVYGFTELEIAVPDIGIVGLHAE 60

Query: 61   NLGIVSVSVDGDPTEFEYYPRPQHVENERSFKAVSSPSSAADAAGSIYLSSIEKELVPNL 120
            NLGIVSVSVDGDPTEFEYYPR QHVE+E+S+KAVSSPSSAADAAGSIYLSSIEKELVPNL
Sbjct: 61   NLGIVSVSVDGDPTEFEYYPRTQHVESEKSYKAVSSPSSAADAAGSIYLSSIEKELVPNL 120

Query: 121  LINCCKAFKSGSEQQDQPFLENGVQTADEDKQNVRLVRIDYWVEKSEVGIHFYNRMAHTD 180
            LINCCKAFK+GSE+QDQPFLENGVQ A EDKQN+RLVRIDYWVEKS+VGIHF + +AHTD
Sbjct: 121  LINCCKAFKNGSEKQDQPFLENGVQPAVEDKQNIRLVRIDYWVEKSDVGIHFNDHIAHTD 180

Query: 181  NQIRRARCWFPCMDDGLQRCKYDLEFTVSQNLVAVSNGILLYQVLSKDNPPRKTFVYRVD 240
            NQIRRARCWFPCMDDGLQRCKYDLEFTVSQNLVAVSNGILLYQVLSKDNPP KT+VYRVD
Sbjct: 181  NQIRRARCWFPCMDDGLQRCKYDLEFTVSQNLVAVSNGILLYQVLSKDNPPCKTYVYRVD 240

Query: 241  IPVNARWISLAVGPFEILADHQNVLISHMCSPVNSLKLKHTVDFFHSAFSCYKDYLSVDF 300
            IPVNA WISLAVGPFEILADHQN  ISHMCSPVNSLKLKHTV+FFHSAFSCYKDYLSVDF
Sbjct: 241  IPVNAHWISLAVGPFEILADHQNGFISHMCSPVNSLKLKHTVEFFHSAFSCYKDYLSVDF 300

Query: 301  PFGSYKQIFIEPEIAVSSACLGVSMCIFSSHLLFDEKIIDQTIDTRIKLAYALARQWFGI 360
            PFGSYKQIFIEPEIAVSSACLGVSMCIFSSH LFDEKIIDQTIDTRIKL+YALARQWFGI
Sbjct: 301  PFGSYKQIFIEPEIAVSSACLGVSMCIFSSHFLFDEKIIDQTIDTRIKLSYALARQWFGI 360

Query: 361  YITPEAPNDEWLLDGLAGFLTDLFIKKNLGNNEARYQRYKANCSVCRADDCGLTTLSSSS 420
            YITPEAPNDEWLLDGLAGFLTDLFIKKNLGNNEARYQRYKANC+VCRADD G+TTLSSSS
Sbjct: 361  YITPEAPNDEWLLDGLAGFLTDLFIKKNLGNNEARYQRYKANCAVCRADDSGITTLSSSS 420

Query: 421  ACKDLHGTQCIGIYGKIRSWKSVAILQMLEKQMGPESFRKILQNIVSHAKDTGSTSQLLS 480
            ACKDL+GTQ IGIYGKIRSWKSVAILQMLEKQMGPESFRKILQNIVS AKDT STSQ LS
Sbjct: 421  ACKDLYGTQRIGIYGKIRSWKSVAILQMLEKQMGPESFRKILQNIVSRAKDTTSTSQSLS 480

Query: 481  TKEFRQLANKIGNLERPFLKEFFPRWVESCGCPLLRMGFSYNKRKNMVEMAVSRECTATP 540
            TKEFR LANKIGNLERPFLKEF PRWVESCGCPLLRMGFSYNKRKNMVEMAVSRECT TP
Sbjct: 481  TKEFRHLANKIGNLERPFLKEFIPRWVESCGCPLLRMGFSYNKRKNMVEMAVSRECTVTP 540

Query: 541  ATNVENRDSDAGWPGMMSIRIYELDGVFDHPVLPMTGESWQLLEIQCHSKLAARRLQKTK 600
            AT +ENRDSD GWPGMMSIRIYELDGVFDHPVLPMTGE WQLLEIQCHSKLAARRLQKTK
Sbjct: 541  ATILENRDSDTGWPGMMSIRIYELDGVFDHPVLPMTGEPWQLLEIQCHSKLAARRLQKTK 600

Query: 601  KGSKPDGSDDNADIPALDIRSSVESPLLWLRADPEMEYLAEIHFHQPVQMWINQLEKDKD 660
            KGSKPDGSDDNADIPALD+RSSVESPLLWLRADPEMEYLAEI FHQPVQMWINQLEKDKD
Sbjct: 601  KGSKPDGSDDNADIPALDVRSSVESPLLWLRADPEMEYLAEIQFHQPVQMWINQLEKDKD 660

Query: 661  VIAQAQAIATLEMLPQPSFSIVNALNNFLKDPKAFWRVRIEAALAMAKTASEDTDWAGLL 720
            VIAQAQA+ATLEMLPQPSFS+VNALNNFL+DPKAFWRVRIEAALAMAKTASEDTDWAGLL
Sbjct: 661  VIAQAQAVATLEMLPQPSFSVVNALNNFLRDPKAFWRVRIEAALAMAKTASEDTDWAGLL 720

Query: 721  NLIKFFKSQRFDADTGLPKPNEFRDFPEYFVLEAIPHAVAMVRGTDQKSPREAVEFVLQL 780
            NLIKFFKSQRFDADTGLPKPNEFRDFPEYFVLEAIPHAVAMVR TDQKSPREAVEFVLQL
Sbjct: 721  NLIKFFKSQRFDADTGLPKPNEFRDFPEYFVLEAIPHAVAMVRATDQKSPREAVEFVLQL 780

Query: 781  LKYNDNNGNPYSDVFWLAALVQSVGELEFGQQSILFLASLLKRIDRLLQFDRLMPSYNGI 840
            LKYNDNNGNPYSDVFWLAALVQSVGELEFGQQSILFLASLLKRIDRLLQFDRLMPSYNGI
Sbjct: 781  LKYNDNNGNPYSDVFWLAALVQSVGELEFGQQSILFLASLLKRIDRLLQFDRLMPSYNGI 840

Query: 841  LTISCIRTLTQIALKLSGLLSLDRIIELIRPFRDFNSMWQVRIEATRSLLDLEYHCNGID 900
            LTISCIRTLTQIALKLSGLLSLDRIIELIRPFR+FNS WQVRIEATR+LLDLEYHCNGID
Sbjct: 841  LTISCIRTLTQIALKLSGLLSLDRIIELIRPFRNFNSTWQVRIEATRALLDLEYHCNGID 900

Query: 901  ATLLLFIKYLEEENSLRGQVKLAVHVMRLCQIMRRSGSNDVVNNDTLVALLLLLEGNMAF 960
            A LLLFIKYLEEE SLRGQVKL VHVMRLCQIMRRS SND VNNDTLVALLLLLEG+MAF
Sbjct: 901  AVLLLFIKYLEEEKSLRGQVKLGVHVMRLCQIMRRSDSNDSVNNDTLVALLLLLEGHMAF 960

Query: 961  NNVYLRHYLFCILQVLSGRSPTLYGVPREYKTLHMGDTGTFSEQKRMLTSLIPEFNPPEP 1020
            NNV LRHYLFCILQVL+GR PTLYGVPREYKTLHMGD+GT SEQKR+LTSLIPEFNPPEP
Sbjct: 961  NNVCLRHYLFCILQVLAGRPPTLYGVPREYKTLHMGDSGTCSEQKRVLTSLIPEFNPPEP 1020

Query: 1021 SSVSAVAPMPCIPATLSSEPLHVPTPRPDNLAVPELSKEEGAIAEDPKQAMAIVEAPREA 1080
            SSVSAVAP+PCIPATLSSEPLH+P P+PD+LA+PE+SKE  A+ E PKQA AI EAPREA
Sbjct: 1021 SSVSAVAPVPCIPATLSSEPLHIPKPKPDSLAIPEVSKEGAAVVEVPKQATAIAEAPREA 1080

Query: 1081 ASVSSSHERKLPVVKIKVRSSAATSRADADNLTTERSHAAPRETDVGPSSSVSVDAPPRN 1140
            ASVS+SHERKLPVVKIKVRSSAATSRADADNLTTERSHAAPRETDVGPSSSVSVDAP RN
Sbjct: 1081 ASVSNSHERKLPVVKIKVRSSAATSRADADNLTTERSHAAPRETDVGPSSSVSVDAPQRN 1140

Query: 1141 TAEATSISNRILEEVNSCHDHGSHMTASIGSAKLASYGDELGKEFQCTADSSSRAFGHFQ 1200
             AEATSISN ILEEVNSCHDHGSHMTASIGSAK ASYGD+LGKEFQCTAD SSRAFGHFQ
Sbjct: 1141 IAEATSISNHILEEVNSCHDHGSHMTASIGSAKPASYGDDLGKEFQCTAD-SSRAFGHFQ 1200

Query: 1201 PEDPSSSSIIQDNNIDADAQKYASLQTLSLPQHDHGLASSQSRHGKKEKKKDKEKKRKRE 1260
            PEDPSSSSIIQDNNIDADAQKYASLQTLSLPQHDHGL S  SRHGKKEKKKDKEKKRKR+
Sbjct: 1201 PEDPSSSSIIQDNNIDADAQKYASLQTLSLPQHDHGLFSLHSRHGKKEKKKDKEKKRKRD 1260

Query: 1261 SHKEHRNDPEYIERKRLKKEKKQKEKEMAKLLNEEVKPQPTAMPRIKEPPTKSTPMQLET 1320
            SHKEHRNDPEYIER+RLKKEKKQKEKEMAKLLNEEVKP P A+PRIKEPPTKSTP+QLET
Sbjct: 1261 SHKEHRNDPEYIERRRLKKEKKQKEKEMAKLLNEEVKPLPIAIPRIKEPPTKSTPVQLET 1320

Query: 1321 NEPSGSRLIIGSVHSKPEASEGTTSAAPKLRIKFKNRTLNNS 1363
            NEPSGSRLI+G V+SKPEASEGT SAAPKLRIKFKNRTLNNS
Sbjct: 1321 NEPSGSRLIVG-VNSKPEASEGTASAAPKLRIKFKNRTLNNS 1360

BLAST of CSPI01G06630 vs. ExPASy TrEMBL
Match: A0A6J1C691 (Transcription initiation factor TFIID subunit 2 OS=Momordica charantia OX=3673 GN=LOC111008817 PE=3 SV=1)

HSP 1 Score: 2462.6 bits (6381), Expect = 0.0e+00
Identity = 1251/1362 (91.85%), Postives = 1297/1362 (95.23%), Query Frame = 0

Query: 1    MAKPRKPKNTDDAKPPDNSGAVVRHQKLCLSIDIDNRRVYGFTELEIAVPDIGIVGLHAE 60
            MAKPRKPKNTDDAKPPDNSGA+V HQKLC+SIDID RR+YGFTELEIAVPDIGIVGLHAE
Sbjct: 1    MAKPRKPKNTDDAKPPDNSGAIVHHQKLCISIDIDKRRIYGFTELEIAVPDIGIVGLHAE 60

Query: 61   NLGIVSVSVDGDPTEFEYYPRPQHVENERSFKAVSSPSSAADAAGSIYLSSIEKELVPNL 120
            NLGIVSVSVDGDPTEFEYYPR QH E+ERSFKAVSSPSSAAD+AGSIYLSSIEKELVPNL
Sbjct: 61   NLGIVSVSVDGDPTEFEYYPRHQHAESERSFKAVSSPSSAADSAGSIYLSSIEKELVPNL 120

Query: 121  LINCCKAFKSGSEQQDQPFLENGVQTADEDKQNVRLVRIDYWVEKSEVGIHFYNRMAHTD 180
            LINCCKAFK+GSEQQDQPFLENG+Q A EDKQN+RLVRIDYWVEKSEVGIHF + MAHTD
Sbjct: 121  LINCCKAFKNGSEQQDQPFLENGLQPAGEDKQNIRLVRIDYWVEKSEVGIHFSDHMAHTD 180

Query: 181  NQIRRARCWFPCMDDGLQRCKYDLEFTVSQNLVAVSNGILLYQVLSKDNPPRKTFVYRVD 240
            NQIRRARCWFPCMDDGLQRCKYDLEFTVSQNLVAVSNGILLYQVLSKDNPPRKT+VY+VD
Sbjct: 181  NQIRRARCWFPCMDDGLQRCKYDLEFTVSQNLVAVSNGILLYQVLSKDNPPRKTYVYQVD 240

Query: 241  IPVNARWISLAVGPFEILADHQNVLISHMCSPVNSLKLKHTVDFFHSAFSCYKDYLSVDF 300
            IPVNA WISLAVGPFEILADHQN LISHMC PVNSLKLKHTV+FFHSAFSCYKDYLSVDF
Sbjct: 241  IPVNAHWISLAVGPFEILADHQNGLISHMCLPVNSLKLKHTVEFFHSAFSCYKDYLSVDF 300

Query: 301  PFGSYKQIFIEPEIAVSSACLGVSMCIFSSHLLFDEKIIDQTIDTRIKLAYALARQWFGI 360
            PFGSYKQ+F+EPE+A+SSACLGVSMCIFSSHLLFDEKIIDQTIDTRIKLAYALARQWFGI
Sbjct: 301  PFGSYKQVFVEPEMALSSACLGVSMCIFSSHLLFDEKIIDQTIDTRIKLAYALARQWFGI 360

Query: 361  YITPEAPNDEWLLDGLAGFLTDLFIKKNLGNNEARYQRYKANCSVCRADDCGLTTLSSSS 420
            YITPEAPNDEWLLDGLAGFLTDLFIKKNLGNNEARYQRYKANC+VCRADD GLTTLSSSS
Sbjct: 361  YITPEAPNDEWLLDGLAGFLTDLFIKKNLGNNEARYQRYKANCAVCRADDSGLTTLSSSS 420

Query: 421  ACKDLHGTQCIGIYGKIRSWKSVAILQMLEKQMGPESFRKILQNIVSHAKDTGSTSQLLS 480
             CKDL+GTQ IGIYGKIRSWKSVAILQMLEKQMGPESFRKILQNIVS AKDT STS+ LS
Sbjct: 421  TCKDLYGTQRIGIYGKIRSWKSVAILQMLEKQMGPESFRKILQNIVSRAKDTTSTSRSLS 480

Query: 481  TKEFRQLANKIGNLERPFLKEFFPRWVESCGCPLLRMGFSYNKRKNMVEMAVSRECTATP 540
            TKEFR LANKIGNLERPFLKEF PRWVESCGCPLLRMGFSYNKRKNMVEMAVSRECT TP
Sbjct: 481  TKEFRHLANKIGNLERPFLKEFIPRWVESCGCPLLRMGFSYNKRKNMVEMAVSRECTVTP 540

Query: 541  ATNVENRDSDAGWPGMMSIRIYELDGVFDHPVLPMTGESWQLLEIQCHSKLAARRLQKTK 600
            AT+V+NRDSD GWPGMMSIRIYELDGVFDHPVLPMTGESWQLLEIQCHSKLAARRLQKTK
Sbjct: 541  ATSVDNRDSDTGWPGMMSIRIYELDGVFDHPVLPMTGESWQLLEIQCHSKLAARRLQKTK 600

Query: 601  KGSKPDGSDDNADIPALDIRSSVESPLLWLRADPEMEYLAEIHFHQPVQMWINQLEKDKD 660
            KGSKPDGSDDNADIPALDIRSSVESPLLWLRADPEMEYLAEIHFHQPVQMWINQLEKDKD
Sbjct: 601  KGSKPDGSDDNADIPALDIRSSVESPLLWLRADPEMEYLAEIHFHQPVQMWINQLEKDKD 660

Query: 661  VIAQAQAIATLEMLPQPSFSIVNALNNFLKDPKAFWRVRIEAALAMAKTASEDTDWAGLL 720
            VIAQAQAI TLEMLPQPSFS+VNALNNFL DPKAFWRVRIEAALAMAKTASEDTDWAGLL
Sbjct: 661  VIAQAQAITTLEMLPQPSFSVVNALNNFLTDPKAFWRVRIEAALAMAKTASEDTDWAGLL 720

Query: 721  NLIKFFKSQRFDADTGLPKPNEFRDFPEYFVLEAIPHAVAMVRGTDQKSPREAVEFVLQL 780
            NLIKFFKSQRFDADTGLPKPNEFRDFPEYFVLEAIPHAVAMVR TDQKSPREAVEFVLQL
Sbjct: 721  NLIKFFKSQRFDADTGLPKPNEFRDFPEYFVLEAIPHAVAMVRATDQKSPREAVEFVLQL 780

Query: 781  LKYNDNNGNPYSDVFWLAALVQSVGELEFGQQSILFLASLLKRIDRLLQFDRLMPSYNGI 840
            LKYNDNNGNPYSDVFWLAALVQSVGELEFGQQSILFLASLLKRIDRLLQFDRLMPSYNGI
Sbjct: 781  LKYNDNNGNPYSDVFWLAALVQSVGELEFGQQSILFLASLLKRIDRLLQFDRLMPSYNGI 840

Query: 841  LTISCIRTLTQIALKLSGLLSLDRIIELIRPFRDFNSMWQVRIEATRSLLDLEYHCNGID 900
            LTISCIRTLTQIALKLSGLLSLDR+ ELIRPFR+FNSMWQVRIEATR+LLDLEYHCNGID
Sbjct: 841  LTISCIRTLTQIALKLSGLLSLDRVFELIRPFRNFNSMWQVRIEATRALLDLEYHCNGID 900

Query: 901  ATLLLFIKYLEEENSLRGQVKLAVHVMRLCQIMRRSGSNDVVNNDTLVALLLLLEGNMAF 960
            A LLLFIKYLEEE SLRGQVKL VHVMRLCQIMRRS SND+VN+DTLVALLLLLEG+MAF
Sbjct: 901  AALLLFIKYLEEEKSLRGQVKLGVHVMRLCQIMRRSDSNDLVNSDTLVALLLLLEGHMAF 960

Query: 961  NNVYLRHYLFCILQVLSGRSPTLYGVPREYKTLHMGDTGTFSEQKRMLTSLIPEFNPPEP 1020
            NNVYLRHYLFCILQVL GR PTLYGVPREYKTLHMG+TGT SEQK++LTSLIPEF+PPEP
Sbjct: 961  NNVYLRHYLFCILQVLGGRPPTLYGVPREYKTLHMGETGTCSEQKKVLTSLIPEFSPPEP 1020

Query: 1021 SSVSAVAPMPCIPATLSSEPLHVPTPRPDNLAVPELSKEEGAIAEDPKQAMAIVEAPREA 1080
            SSVSAVAPMP I ATLSSEPL  P  R D+LA+PE+SK  GAI E PKQAMAIVE PREA
Sbjct: 1021 SSVSAVAPMPSIAATLSSEPLDAPKQRSDSLAIPEVSKGGGAIPEVPKQAMAIVEPPREA 1080

Query: 1081 ASVSSSHERKLPVVKIKVRSSAATSRADADNLTTERSHAAPRETDVGPSSSVSVDAPPRN 1140
            ASVS+S+ERKLPVVKIKVRSSAATSRA+ADN T ERSHAAP ETDVGPSSSVSVDAP RN
Sbjct: 1081 ASVSNSYERKLPVVKIKVRSSAATSRAEADNQTIERSHAAPHETDVGPSSSVSVDAPQRN 1140

Query: 1141 TAEATSISNRILEEVNSCHDHGSHMTASIGSAKLASYGDELGKEFQCTADSSSRAFGHFQ 1200
             AE TSISN+ LEEVNSCHD GS MTASIGSAKLASYGDELGK+FQCTAD SSRAFGHFQ
Sbjct: 1141 IAETTSISNQNLEEVNSCHDQGSRMTASIGSAKLASYGDELGKDFQCTAD-SSRAFGHFQ 1200

Query: 1201 PEDPSSSSIIQDNNIDADAQKYASLQTLSLPQHDHGLASSQSRHGKKEKKKDKEKKRKRE 1260
            PEDPSSSSIIQDNN+DADAQKYASLQTLSLPQHDHGLAS  SRHGKKEKKKD+EKKRKRE
Sbjct: 1201 PEDPSSSSIIQDNNVDADAQKYASLQTLSLPQHDHGLASLHSRHGKKEKKKDREKKRKRE 1260

Query: 1261 SHKEHRNDPEYIERKRLKKEKKQKEKEMAKLLNEEVKPQPTAMPRIKEPPTKSTPMQLET 1320
            SHKEHRNDPEYIERKRLKKEKKQKEKEMAKLLNEEVKP P A+P IKEPP K T +QLET
Sbjct: 1261 SHKEHRNDPEYIERKRLKKEKKQKEKEMAKLLNEEVKPLPIAIPPIKEPPIKPTLVQLET 1320

Query: 1321 NEPSGSRLIIGSVHSKPEASEGTTSAAPKLRIKFKNRTLNNS 1363
            NEPSGSRLI+G+  SKPEASEGTTSAAPKLRIKFKNR LNNS
Sbjct: 1321 NEPSGSRLIVGT-QSKPEASEGTTSAAPKLRIKFKNRMLNNS 1360

BLAST of CSPI01G06630 vs. NCBI nr
Match: XP_004137463.1 (transcription initiation factor TFIID subunit 2 [Cucumis sativus] >KGN64105.1 hypothetical protein Csa_013753 [Cucumis sativus])

HSP 1 Score: 2679.8 bits (6945), Expect = 0.0e+00
Identity = 1358/1362 (99.71%), Postives = 1360/1362 (99.85%), Query Frame = 0

Query: 1    MAKPRKPKNTDDAKPPDNSGAVVRHQKLCLSIDIDNRRVYGFTELEIAVPDIGIVGLHAE 60
            MAKPRKPKNTDDAKPPDNSGAVVRHQKLCLSIDIDNRRVYGFTELEIAVPDIGIVGLHAE
Sbjct: 1    MAKPRKPKNTDDAKPPDNSGAVVRHQKLCLSIDIDNRRVYGFTELEIAVPDIGIVGLHAE 60

Query: 61   NLGIVSVSVDGDPTEFEYYPRPQHVENERSFKAVSSPSSAADAAGSIYLSSIEKELVPNL 120
            NLGIVSVSVDGDPTEFEYYPRPQHVENERSFKAVSSPSSAADAAGSIYLSSIEKELVPNL
Sbjct: 61   NLGIVSVSVDGDPTEFEYYPRPQHVENERSFKAVSSPSSAADAAGSIYLSSIEKELVPNL 120

Query: 121  LINCCKAFKSGSEQQDQPFLENGVQTADEDKQNVRLVRIDYWVEKSEVGIHFYNRMAHTD 180
            LINCCKAFKSGSEQQDQPFLENGVQTADEDKQNVRLVRIDYWVEKSEVGIHFYNRMAHTD
Sbjct: 121  LINCCKAFKSGSEQQDQPFLENGVQTADEDKQNVRLVRIDYWVEKSEVGIHFYNRMAHTD 180

Query: 181  NQIRRARCWFPCMDDGLQRCKYDLEFTVSQNLVAVSNGILLYQVLSKDNPPRKTFVYRVD 240
            NQIRRARCWFPCMDDGLQRCKYDLEFTVSQNLVAVSNGILLYQVLSKDNPPRKTFVYRVD
Sbjct: 181  NQIRRARCWFPCMDDGLQRCKYDLEFTVSQNLVAVSNGILLYQVLSKDNPPRKTFVYRVD 240

Query: 241  IPVNARWISLAVGPFEILADHQNVLISHMCSPVNSLKLKHTVDFFHSAFSCYKDYLSVDF 300
            IPVNARWISLAVGPFEILADHQNVLISHMCSPVNSLKLKHTVDFFHSAFSCYKDYLSVDF
Sbjct: 241  IPVNARWISLAVGPFEILADHQNVLISHMCSPVNSLKLKHTVDFFHSAFSCYKDYLSVDF 300

Query: 301  PFGSYKQIFIEPEIAVSSACLGVSMCIFSSHLLFDEKIIDQTIDTRIKLAYALARQWFGI 360
            PFGSYKQIFIEPEIAVSSACLGVSMCIFSSHLLFDEKIIDQTIDTRIKLAYALARQWFGI
Sbjct: 301  PFGSYKQIFIEPEIAVSSACLGVSMCIFSSHLLFDEKIIDQTIDTRIKLAYALARQWFGI 360

Query: 361  YITPEAPNDEWLLDGLAGFLTDLFIKKNLGNNEARYQRYKANCSVCRADDCGLTTLSSSS 420
            YITPEAPNDEWLLDGLAGFLTDLFIKKNLGNNEARYQRYKANCSVCRADDCGLTTLSSSS
Sbjct: 361  YITPEAPNDEWLLDGLAGFLTDLFIKKNLGNNEARYQRYKANCSVCRADDCGLTTLSSSS 420

Query: 421  ACKDLHGTQCIGIYGKIRSWKSVAILQMLEKQMGPESFRKILQNIVSHAKDTGSTSQLLS 480
            ACKDLHGTQCIGIYGKIRSWKSVAILQMLEKQMGPESFRKILQNIVSHAKDTGSTSQLLS
Sbjct: 421  ACKDLHGTQCIGIYGKIRSWKSVAILQMLEKQMGPESFRKILQNIVSHAKDTGSTSQLLS 480

Query: 481  TKEFRQLANKIGNLERPFLKEFFPRWVESCGCPLLRMGFSYNKRKNMVEMAVSRECTATP 540
            TKEFRQLANKIGNLERPFLKEFFPRWVESCGCPLLRMGFSYNKRKNMVEMAVSRECTATP
Sbjct: 481  TKEFRQLANKIGNLERPFLKEFFPRWVESCGCPLLRMGFSYNKRKNMVEMAVSRECTATP 540

Query: 541  ATNVENRDSDAGWPGMMSIRIYELDGVFDHPVLPMTGESWQLLEIQCHSKLAARRLQKTK 600
            ATNVENRDSDAGWPGMMSIRIYELDGVFDHPVLPMTGESWQLLEIQCHSKLAARRLQKTK
Sbjct: 541  ATNVENRDSDAGWPGMMSIRIYELDGVFDHPVLPMTGESWQLLEIQCHSKLAARRLQKTK 600

Query: 601  KGSKPDGSDDNADIPALDIRSSVESPLLWLRADPEMEYLAEIHFHQPVQMWINQLEKDKD 660
            KGSKPDGSDDNADIPALDIRSSVESPLLWLRADPEMEYLAEIHFHQPVQMWINQLEKDKD
Sbjct: 601  KGSKPDGSDDNADIPALDIRSSVESPLLWLRADPEMEYLAEIHFHQPVQMWINQLEKDKD 660

Query: 661  VIAQAQAIATLEMLPQPSFSIVNALNNFLKDPKAFWRVRIEAALAMAKTASEDTDWAGLL 720
            VIAQAQAIATLEMLPQPSFSIVNALNNFLKDPKAFWRVRIEAALAMAKTASEDTDWAGLL
Sbjct: 661  VIAQAQAIATLEMLPQPSFSIVNALNNFLKDPKAFWRVRIEAALAMAKTASEDTDWAGLL 720

Query: 721  NLIKFFKSQRFDADTGLPKPNEFRDFPEYFVLEAIPHAVAMVRGTDQKSPREAVEFVLQL 780
            NLIKFFKSQRFDADTGLPKPNEFRDFPEYFVLEAIPHAVAMVRGTDQKSPREAVEFVLQL
Sbjct: 721  NLIKFFKSQRFDADTGLPKPNEFRDFPEYFVLEAIPHAVAMVRGTDQKSPREAVEFVLQL 780

Query: 781  LKYNDNNGNPYSDVFWLAALVQSVGELEFGQQSILFLASLLKRIDRLLQFDRLMPSYNGI 840
            LKYNDNNGNPYSDVFWLAALVQSVGELEFGQQSILFLASLLKRIDRLLQFDRLMPSYNGI
Sbjct: 781  LKYNDNNGNPYSDVFWLAALVQSVGELEFGQQSILFLASLLKRIDRLLQFDRLMPSYNGI 840

Query: 841  LTISCIRTLTQIALKLSGLLSLDRIIELIRPFRDFNSMWQVRIEATRSLLDLEYHCNGID 900
            LTISCIRTLTQIALKLSGLLSLDRIIELIRPFRDFNSMWQVRIEATRSLLDLEYHCNGID
Sbjct: 841  LTISCIRTLTQIALKLSGLLSLDRIIELIRPFRDFNSMWQVRIEATRSLLDLEYHCNGID 900

Query: 901  ATLLLFIKYLEEENSLRGQVKLAVHVMRLCQIMRRSGSNDVVNNDTLVALLLLLEGNMAF 960
            ATLLLFIKYLEEENSLRGQVKLAVHVMRLCQIMRRSGSNDVVNNDTLVALLLLLEGNMAF
Sbjct: 901  ATLLLFIKYLEEENSLRGQVKLAVHVMRLCQIMRRSGSNDVVNNDTLVALLLLLEGNMAF 960

Query: 961  NNVYLRHYLFCILQVLSGRSPTLYGVPREYKTLHMGDTGTFSEQKRMLTSLIPEFNPPEP 1020
            NNVYLRHYLF ILQVLSGRSPTLYGVPREYKTLHMGDTGTFSEQKRMLTS+IPEFNPPEP
Sbjct: 961  NNVYLRHYLFSILQVLSGRSPTLYGVPREYKTLHMGDTGTFSEQKRMLTSIIPEFNPPEP 1020

Query: 1021 SSVSAVAPMPCIPATLSSEPLHVPTPRPDNLAVPELSKEEGAIAEDPKQAMAIVEAPREA 1080
            SSVSAVAPMPCIPATLSSEPLHVPTPRPDNLAVPELSKEEGAIAEDPKQAMAIVEAPREA
Sbjct: 1021 SSVSAVAPMPCIPATLSSEPLHVPTPRPDNLAVPELSKEEGAIAEDPKQAMAIVEAPREA 1080

Query: 1081 ASVSSSHERKLPVVKIKVRSSAATSRADADNLTTERSHAAPRETDVGPSSSVSVDAPPRN 1140
            ASVSSSHERKLPVVKIKVRSSAATSRADADNLTTERSHAAPRETDVGPSSSVSVDAPPRN
Sbjct: 1081 ASVSSSHERKLPVVKIKVRSSAATSRADADNLTTERSHAAPRETDVGPSSSVSVDAPPRN 1140

Query: 1141 TAEATSISNRILEEVNSCHDHGSHMTASIGSAKLASYGDELGKEFQCTADSSSRAFGHFQ 1200
            TAEATSISNRILEEVNSCHDHGSHMTASIGSAKLASYGDELGKEFQCTADSSSRAFGHFQ
Sbjct: 1141 TAEATSISNRILEEVNSCHDHGSHMTASIGSAKLASYGDELGKEFQCTADSSSRAFGHFQ 1200

Query: 1201 PEDPSSSSIIQDNNIDADAQKYASLQTLSLPQHDHGLASSQSRHGKKEKKKDKEKKRKRE 1260
            PEDPSSSSIIQDNNIDADAQKYASLQTLSLPQHDHGLASS SRHGKKEKKKDKEKKRKRE
Sbjct: 1201 PEDPSSSSIIQDNNIDADAQKYASLQTLSLPQHDHGLASSHSRHGKKEKKKDKEKKRKRE 1260

Query: 1261 SHKEHRNDPEYIERKRLKKEKKQKEKEMAKLLNEEVKPQPTAMPRIKEPPTKSTPMQLET 1320
            SHKEHRNDPEYIERKRLKKEKKQKEKEMAKLLNEEVKPQPTAMPRIKEPPTKSTP+QLET
Sbjct: 1261 SHKEHRNDPEYIERKRLKKEKKQKEKEMAKLLNEEVKPQPTAMPRIKEPPTKSTPVQLET 1320

Query: 1321 NEPSGSRLIIGSVHSKPEASEGTTSAAPKLRIKFKNRTLNNS 1363
            NEPSGSRLIIGSVHSKPEASEGTTSAAPKLRIKFKNRTLNNS
Sbjct: 1321 NEPSGSRLIIGSVHSKPEASEGTTSAAPKLRIKFKNRTLNNS 1362

BLAST of CSPI01G06630 vs. NCBI nr
Match: XP_008456406.1 (PREDICTED: transcription initiation factor TFIID subunit 2 [Cucumis melo])

HSP 1 Score: 2616.6 bits (6781), Expect = 0.0e+00
Identity = 1328/1362 (97.50%), Postives = 1343/1362 (98.60%), Query Frame = 0

Query: 1    MAKPRKPKNTDDAKPPDNSGAVVRHQKLCLSIDIDNRRVYGFTELEIAVPDIGIVGLHAE 60
            MAKPRKPKNTDDAKPPDNSGAVVRHQKLCLSIDIDNRRVYGFTELEIAVPDIGIVGLHAE
Sbjct: 1    MAKPRKPKNTDDAKPPDNSGAVVRHQKLCLSIDIDNRRVYGFTELEIAVPDIGIVGLHAE 60

Query: 61   NLGIVSVSVDGDPTEFEYYPRPQHVENERSFKAVSSPSSAADAAGSIYLSSIEKELVPNL 120
            NLGI+SVSVDGDPTEFEYYPRPQHVE+ERSFKAV+SPSSAADAAGSIY+SSIEKELVPNL
Sbjct: 61   NLGILSVSVDGDPTEFEYYPRPQHVESERSFKAVASPSSAADAAGSIYMSSIEKELVPNL 120

Query: 121  LINCCKAFKSGSEQQDQPFLENGVQTADEDKQNVRLVRIDYWVEKSEVGIHFYNRMAHTD 180
            LINCCKAFKSGSEQQDQPFLENGVQ ADEDKQNVRLVRIDYWVEKSEVGIHFYNR+AHTD
Sbjct: 121  LINCCKAFKSGSEQQDQPFLENGVQPADEDKQNVRLVRIDYWVEKSEVGIHFYNRLAHTD 180

Query: 181  NQIRRARCWFPCMDDGLQRCKYDLEFTVSQNLVAVSNGILLYQVLSKDNPPRKTFVYRVD 240
            NQIRRARCWFPCMDDGLQRCKYDLEFTVSQNLVAVSNGILLYQVLSKDNPPRKT+VYRVD
Sbjct: 181  NQIRRARCWFPCMDDGLQRCKYDLEFTVSQNLVAVSNGILLYQVLSKDNPPRKTYVYRVD 240

Query: 241  IPVNARWISLAVGPFEILADHQNVLISHMCSPVNSLKLKHTVDFFHSAFSCYKDYLSVDF 300
            IPVNARWISLAVGPFEILADHQNVLISHMCSPVNSLKLKHTVDFFH AFSCYKDYLSVDF
Sbjct: 241  IPVNARWISLAVGPFEILADHQNVLISHMCSPVNSLKLKHTVDFFHGAFSCYKDYLSVDF 300

Query: 301  PFGSYKQIFIEPEIAVSSACLGVSMCIFSSHLLFDEKIIDQTIDTRIKLAYALARQWFGI 360
            PFGSYKQIFIEPEIAVSSACLGVSMCIFSSHLLFDEKIIDQTIDTRIKLAYALARQWFGI
Sbjct: 301  PFGSYKQIFIEPEIAVSSACLGVSMCIFSSHLLFDEKIIDQTIDTRIKLAYALARQWFGI 360

Query: 361  YITPEAPNDEWLLDGLAGFLTDLFIKKNLGNNEARYQRYKANCSVCRADDCGLTTLSSSS 420
            YITPEAPNDEWLLDGLAGFLTDLFIKKNLGNNEARYQRYKANC+VCRADD GLTTLSSS+
Sbjct: 361  YITPEAPNDEWLLDGLAGFLTDLFIKKNLGNNEARYQRYKANCAVCRADDSGLTTLSSSA 420

Query: 421  ACKDLHGTQCIGIYGKIRSWKSVAILQMLEKQMGPESFRKILQNIVSHAKDTGSTSQLLS 480
            ACKDLHGTQCIGIYGKIRSWKSVAILQMLEKQMGPESFRKILQNIVSHAKDTGS SQLLS
Sbjct: 421  ACKDLHGTQCIGIYGKIRSWKSVAILQMLEKQMGPESFRKILQNIVSHAKDTGSASQLLS 480

Query: 481  TKEFRQLANKIGNLERPFLKEFFPRWVESCGCPLLRMGFSYNKRKNMVEMAVSRECTATP 540
            TKEFRQLANKIGNLERPFLKEFFPRWVESCGCPLLRMGFSYNKRKNMVEMAVSRECTATP
Sbjct: 481  TKEFRQLANKIGNLERPFLKEFFPRWVESCGCPLLRMGFSYNKRKNMVEMAVSRECTATP 540

Query: 541  ATNVENRDSDAGWPGMMSIRIYELDGVFDHPVLPMTGESWQLLEIQCHSKLAARRLQKTK 600
            AT+VENRDSDAGWPGMMSIRIYELDGVFDHPVLPMTGESWQLLEIQCHSKLAARRLQKTK
Sbjct: 541  ATSVENRDSDAGWPGMMSIRIYELDGVFDHPVLPMTGESWQLLEIQCHSKLAARRLQKTK 600

Query: 601  KGSKPDGSDDNADIPALDIRSSVESPLLWLRADPEMEYLAEIHFHQPVQMWINQLEKDKD 660
            KGSKPDGSDDNADIPALDIRSSVESPLLWLRADPEMEYLAEIHFHQPVQMWINQLEKDKD
Sbjct: 601  KGSKPDGSDDNADIPALDIRSSVESPLLWLRADPEMEYLAEIHFHQPVQMWINQLEKDKD 660

Query: 661  VIAQAQAIATLEMLPQPSFSIVNALNNFLKDPKAFWRVRIEAALAMAKTASEDTDWAGLL 720
            VIAQAQAIATLEMLPQPSFSIVNALNNFL+DPKAFWRVRIEAALAMAKTASEDTDWAGLL
Sbjct: 661  VIAQAQAIATLEMLPQPSFSIVNALNNFLRDPKAFWRVRIEAALAMAKTASEDTDWAGLL 720

Query: 721  NLIKFFKSQRFDADTGLPKPNEFRDFPEYFVLEAIPHAVAMVRGTDQKSPREAVEFVLQL 780
            NLIKFFKSQRFDADTGLPKPNEFRDFPEYFVLEAIPHAVAMVRGTDQKSPREAVEFVLQL
Sbjct: 721  NLIKFFKSQRFDADTGLPKPNEFRDFPEYFVLEAIPHAVAMVRGTDQKSPREAVEFVLQL 780

Query: 781  LKYNDNNGNPYSDVFWLAALVQSVGELEFGQQSILFLASLLKRIDRLLQFDRLMPSYNGI 840
            LKYNDNNGNPYSDVFWLAALVQSVGELEFGQQSILFLASLLKRIDRLLQFDRLMPSYNGI
Sbjct: 781  LKYNDNNGNPYSDVFWLAALVQSVGELEFGQQSILFLASLLKRIDRLLQFDRLMPSYNGI 840

Query: 841  LTISCIRTLTQIALKLSGLLSLDRIIELIRPFRDFNSMWQVRIEATRSLLDLEYHCNGID 900
            LTISCIRTLTQIALKLSGLLSLDRIIELIRPFRDFNS+WQVRIEATRSLLDLEYHCNGID
Sbjct: 841  LTISCIRTLTQIALKLSGLLSLDRIIELIRPFRDFNSIWQVRIEATRSLLDLEYHCNGID 900

Query: 901  ATLLLFIKYLEEENSLRGQVKLAVHVMRLCQIMRRSGSNDVVNNDTLVALLLLLEGNMAF 960
            ATLLLFIKYLEEENSLRGQ KLAVHVMRLCQIMRRSGSNDVVNNDTLVALLLLLEGNMAF
Sbjct: 901  ATLLLFIKYLEEENSLRGQAKLAVHVMRLCQIMRRSGSNDVVNNDTLVALLLLLEGNMAF 960

Query: 961  NNVYLRHYLFCILQVLSGRSPTLYGVPREYKTLHMGDTGTFSEQKRMLTSLIPEFNPPEP 1020
            NNV LRHYLFCILQVL+GR PTLYGVPREYKTLHMGDTGT SEQKRMLTSLIPEFNPPE 
Sbjct: 961  NNVCLRHYLFCILQVLAGRPPTLYGVPREYKTLHMGDTGTCSEQKRMLTSLIPEFNPPEQ 1020

Query: 1021 SSVSAVAPMPCIPATLSSEPLHVPTPRPDNLAVPELSKEEGAIAEDPKQAMAIVEAPREA 1080
            SSVSAVAPMPCIPA+LSSEPLH PTPRPD+LAVPELSKE G IAE PKQAMAIVEAPREA
Sbjct: 1021 SSVSAVAPMPCIPASLSSEPLHAPTPRPDSLAVPELSKEGGEIAEVPKQAMAIVEAPREA 1080

Query: 1081 ASVSSSHERKLPVVKIKVRSSAATSRADADNLTTERSHAAPRETDVGPSSSVSVDAPPRN 1140
            ASVSSSHERKLPVVKIKVRSSAATSRADADNLTTERSHA PRETDVGPSSSVSVDAPPRN
Sbjct: 1081 ASVSSSHERKLPVVKIKVRSSAATSRADADNLTTERSHAVPRETDVGPSSSVSVDAPPRN 1140

Query: 1141 TAEATSISNRILEEVNSCHDHGSHMTASIGSAKLASYGDELGKEFQCTADSSSRAFGHFQ 1200
             AEATSISNRILEEVNSCHDHGSHMTASIGSAKLASYGDELGKEFQCTADSSSRAFGHFQ
Sbjct: 1141 IAEATSISNRILEEVNSCHDHGSHMTASIGSAKLASYGDELGKEFQCTADSSSRAFGHFQ 1200

Query: 1201 PEDPSSSSIIQDNNIDADAQKYASLQTLSLPQHDHGLASSQSRHGKKEKKKDKEKKRKRE 1260
            PEDPSSSSIIQDNNIDADAQKYASLQTLSLPQ+DHGLASS SRHGKKEKKKDKEKKRKRE
Sbjct: 1201 PEDPSSSSIIQDNNIDADAQKYASLQTLSLPQNDHGLASSHSRHGKKEKKKDKEKKRKRE 1260

Query: 1261 SHKEHRNDPEYIERKRLKKEKKQKEKEMAKLLNEEVKPQPTAMPRIKEPPTKSTPMQLET 1320
            SHKEHRNDPEYIERKRLKKEKKQKEKEMAKLLNEEVKPQP AMPRIKEPPTKSTPMQLET
Sbjct: 1261 SHKEHRNDPEYIERKRLKKEKKQKEKEMAKLLNEEVKPQPLAMPRIKEPPTKSTPMQLET 1320

Query: 1321 NEPSGSRLIIGSVHSKPEASEGTTSAAPKLRIKFKNRTLNNS 1363
            NEPSGSRLIIG V SKPEASEGTTSAAPKLRIKFKNRTLNNS
Sbjct: 1321 NEPSGSRLIIG-VQSKPEASEGTTSAAPKLRIKFKNRTLNNS 1361

BLAST of CSPI01G06630 vs. NCBI nr
Match: XP_038894650.1 (transcription initiation factor TFIID subunit 2 isoform X1 [Benincasa hispida])

HSP 1 Score: 2519.6 bits (6529), Expect = 0.0e+00
Identity = 1285/1363 (94.28%), Postives = 1316/1363 (96.55%), Query Frame = 0

Query: 1    MAKPRKPKNTDDAKPPDNSGAVVRHQKLCLSIDIDNRRVYGFTELEIAVPDIGIVGLHAE 60
            MAKPRKPKNTDDAKPPDNSGAVVRHQKLCLSIDID RRVYGFTELEIAVPDIGIVGLHAE
Sbjct: 1    MAKPRKPKNTDDAKPPDNSGAVVRHQKLCLSIDIDKRRVYGFTELEIAVPDIGIVGLHAE 60

Query: 61   NLGIVSVSVDGDPTEFEYYPRPQHVENERSFKAVSSPSSAADAAGSIYLSSIEKELVPNL 120
            NLGIVSVSVDGDPTEFEYYPRPQHVE+E+SFKAVSSP+SAAD A SIY+SSIEKELVPNL
Sbjct: 61   NLGIVSVSVDGDPTEFEYYPRPQHVESEKSFKAVSSPNSAADVASSIYMSSIEKELVPNL 120

Query: 121  LINCCKAFKSGSEQQDQPFLENGVQTADEDKQNVRLVRIDYWVEKSEVGIHFYNRMAHTD 180
            LINCCKAFKSGSEQ DQPFLENGVQ ADEDKQNVRLVRIDYWVEKSEVGIHFYN MAHTD
Sbjct: 121  LINCCKAFKSGSEQHDQPFLENGVQPADEDKQNVRLVRIDYWVEKSEVGIHFYNHMAHTD 180

Query: 181  NQIRRARCWFPCMDDGLQRCKYDLEFTVSQNLVAVSNGILLYQVLSKDNPPRKTFVYRVD 240
            NQIRRARCWFPCMDDGLQRCKYDLEFTVSQNLVAVSNGILLYQVLSKDNPPRKT+VY+VD
Sbjct: 181  NQIRRARCWFPCMDDGLQRCKYDLEFTVSQNLVAVSNGILLYQVLSKDNPPRKTYVYQVD 240

Query: 241  IPVNARWISLAVGPFEILADHQNVLISHMCSPVNSLKLKHTVDFFHSAFSCYKDYLSVDF 300
            IPVNA WISLAVGPFEILADHQN LISHMCSPVNSLKLKHTV+FFHSAFSCYKDYLSVDF
Sbjct: 241  IPVNAHWISLAVGPFEILADHQNGLISHMCSPVNSLKLKHTVEFFHSAFSCYKDYLSVDF 300

Query: 301  PFGSYKQIFIEPEIAVSSACLGVSMCIFSSHLLFDEKIIDQTIDTRIKLAYALARQWFGI 360
            PFGSYKQIFIEPEIAVSS CLGVSMCIFSSHLLFDEKIIDQTIDTRIKLAYALARQWFGI
Sbjct: 301  PFGSYKQIFIEPEIAVSSICLGVSMCIFSSHLLFDEKIIDQTIDTRIKLAYALARQWFGI 360

Query: 361  YITPEAPNDEWLLDGLAGFLTDLFIKKNLGNNEARYQRYKANCSVCRADDCGLTTLSSSS 420
            YITPEAPNDEWLLDGLAGFLTDLFIKKNLGNNEARYQRYKANC+VCRADD GLTTLSSSS
Sbjct: 361  YITPEAPNDEWLLDGLAGFLTDLFIKKNLGNNEARYQRYKANCAVCRADDSGLTTLSSSS 420

Query: 421  ACKDLHGTQCIGIYGKIRSWKSVAILQMLEKQMGPESFRKILQNIVSHAKDTGSTSQLLS 480
            ACKDL+GTQ IGIYGKIRSWKSVAILQMLEKQMGPESFRKILQNIVS A+DT STSQ LS
Sbjct: 421  ACKDLYGTQRIGIYGKIRSWKSVAILQMLEKQMGPESFRKILQNIVSRARDTASTSQSLS 480

Query: 481  TKEFRQLANKIGNLERPFLKEFFPRWVESCGCPLLRMGFSYNKRKNMVEMAVSRECTATP 540
            TKEFRQLANKIGNLERPFLKEFFPRWVESCGCP+LRMGFSYNKRKNMVEMAVSRECT +P
Sbjct: 481  TKEFRQLANKIGNLERPFLKEFFPRWVESCGCPVLRMGFSYNKRKNMVEMAVSRECTVSP 540

Query: 541  ATNVENRDSDAGWPGMMSIRIYELDGVFDHPVLPMTGESWQLLEIQCHSKLAARRLQKTK 600
            AT+VENRDSD GWPGMMSIRIYELDGVFDHPVLPMTGE+WQLLEIQCHSKLAARRLQKTK
Sbjct: 541  ATSVENRDSDTGWPGMMSIRIYELDGVFDHPVLPMTGETWQLLEIQCHSKLAARRLQKTK 600

Query: 601  KGSKPDGSDDNADIPALDIRSSVESPLLWLRADPEMEYLAEIHFHQPVQMWINQLEKDKD 660
            KGSKPDGSDDNADIPALD+RSSVESPLLWLRADPEMEYLAEIHFHQP+QMWINQLEKDKD
Sbjct: 601  KGSKPDGSDDNADIPALDVRSSVESPLLWLRADPEMEYLAEIHFHQPIQMWINQLEKDKD 660

Query: 661  VIAQAQAIATLEMLPQPSFSIVNALNNFLKDPKAFWRVRIEAALAMAKTASEDTDWAGLL 720
            VIAQAQAIATLEMLPQPSF++VNALNNFL+DPKAFWRVRIEAA+AMAKTASEDTDWAGLL
Sbjct: 661  VIAQAQAIATLEMLPQPSFAVVNALNNFLRDPKAFWRVRIEAAVAMAKTASEDTDWAGLL 720

Query: 721  NLIKFFKSQRFDADTGLPKPNEFRDFPEYFVLEAIPHAVAMVRGTDQKSPREAVEFVLQL 780
            NLIKFFKSQRFDADTGLPKPNEFRDFPEYF+LEAIPHAVAMVR +DQKSPREAVEFVLQL
Sbjct: 721  NLIKFFKSQRFDADTGLPKPNEFRDFPEYFLLEAIPHAVAMVRTSDQKSPREAVEFVLQL 780

Query: 781  LKYNDNNGNPYSDVFWLAALVQSVGELEFGQQSILFLASLLKRIDRLLQFDRLMPSYNGI 840
            LKYNDNNGNPYSDVFWLAALVQSVGELEFGQQSILFLASLLKRIDRLLQFDRLMPSYNGI
Sbjct: 781  LKYNDNNGNPYSDVFWLAALVQSVGELEFGQQSILFLASLLKRIDRLLQFDRLMPSYNGI 840

Query: 841  LTISCIRTLTQIALKLSGLLSLDRIIELIRPFRDFNSMWQVRIEATRSLLDLEYHCNGID 900
            LTISCIRTLTQIALKLSGLLSLDRI ELIRPFRDF+SMWQVRIEATR+LLDLEYHCNGID
Sbjct: 841  LTISCIRTLTQIALKLSGLLSLDRISELIRPFRDFSSMWQVRIEATRALLDLEYHCNGID 900

Query: 901  ATLLLFIKYLEEENSLRGQVKLAVHVMRLCQIMRRSGSNDVVNNDTLVALLLLLEGNMAF 960
            A LLLFIKYLEEE+SLRGQVKL V VMRLCQIMRRS SNDVVNNDTLVALLLLLEG MAF
Sbjct: 901  AALLLFIKYLEEESSLRGQVKLGVQVMRLCQIMRRSDSNDVVNNDTLVALLLLLEGQMAF 960

Query: 961  NNVYLRHYLFCILQVLSGRSPTLYGVPREYKTLHMGDTGTFSEQKRMLTSLIPEFNPPEP 1020
            NNVYLRHYLFCILQVL+GR PTLYGVPREYKTLHMGDTGT SEQKRMLTSLIPEFNPPEP
Sbjct: 961  NNVYLRHYLFCILQVLAGRPPTLYGVPREYKTLHMGDTGTCSEQKRMLTSLIPEFNPPEP 1020

Query: 1021 SSVSAVAPMPCIPATLSSEPLHVPTPRPDNLAVPELSKEEGAIAEDPKQAMAIVEAPREA 1080
            SSVSAVAPMP IPATLSSEPLH PT RPD+LA+PELSKE GAIAE PKQA AI EAPREA
Sbjct: 1021 SSVSAVAPMPYIPATLSSEPLHTPTLRPDSLAIPELSKEGGAIAEVPKQARAIAEAPREA 1080

Query: 1081 ASVSSSHERKLPVVKIKVRSSAATSRADADNLTTERSHAAPRETDVGPSSSVSVDAPPRN 1140
            ASVS+SHERKLPVVKIKVRSSAATSRADADNLTTERSHAAPRETDVGPSSSVSVDAP RN
Sbjct: 1081 ASVSNSHERKLPVVKIKVRSSAATSRADADNLTTERSHAAPRETDVGPSSSVSVDAPQRN 1140

Query: 1141 TAEATSISNRILEEVNSCHDHGSHMTASIGSAKLASYGDELGKEFQCTADSSSRAFGHFQ 1200
             AEATSISN ILEEVNSCHD GSHMTASIGSAKL SYGDEL KEFQCTAD SSRAFGHFQ
Sbjct: 1141 IAEATSISNHILEEVNSCHDRGSHMTASIGSAKLTSYGDELEKEFQCTAD-SSRAFGHFQ 1200

Query: 1201 PEDPSSSSIIQDNNIDADAQKYASLQTLSLPQHDHGLASSQSRHGKKE-KKKDKEKKRKR 1260
            PEDPSSSSIIQDNNIDADAQKYASLQTLSLPQHDHGLASS SRHGKKE KKKDKEKKRKR
Sbjct: 1201 PEDPSSSSIIQDNNIDADAQKYASLQTLSLPQHDHGLASSHSRHGKKEKKKKDKEKKRKR 1260

Query: 1261 ESHKEHRNDPEYIERKRLKKEKKQKEKEMAKLLNEEVKPQPTAMPRIKEPPTKSTPMQLE 1320
            E HKEHRNDPEYIERKRLKKEKKQKEKEMAKLL EEVKP P AMPRIKEPPTKSTP+QLE
Sbjct: 1261 E-HKEHRNDPEYIERKRLKKEKKQKEKEMAKLLIEEVKPLPIAMPRIKEPPTKSTPVQLE 1320

Query: 1321 TNEPSGSRLIIGSVHSKPEASEGTTSAAPKLRIKFKNRTLNNS 1363
             NEPSGSRLI+G VHSKPEASEGTTSAAPKLRIKFKNRTLNNS
Sbjct: 1321 ANEPSGSRLIVG-VHSKPEASEGTTSAAPKLRIKFKNRTLNNS 1360

BLAST of CSPI01G06630 vs. NCBI nr
Match: XP_023519716.1 (transcription initiation factor TFIID subunit 2 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 2502.2 bits (6484), Expect = 0.0e+00
Identity = 1272/1362 (93.39%), Postives = 1309/1362 (96.11%), Query Frame = 0

Query: 1    MAKPRKPKNTDDAKPPDNSGAVVRHQKLCLSIDIDNRRVYGFTELEIAVPDIGIVGLHAE 60
            MAKPRKPKNTDDAKPPDNSGAVVRHQKLCLSIDID RRVYGFTELEIAVPDIGIVGLHAE
Sbjct: 1    MAKPRKPKNTDDAKPPDNSGAVVRHQKLCLSIDIDKRRVYGFTELEIAVPDIGIVGLHAE 60

Query: 61   NLGIVSVSVDGDPTEFEYYPRPQHVENERSFKAVSSPSSAADAAGSIYLSSIEKELVPNL 120
            NLGIVSVSVDGDPTEFEYYPR QHVE+E+SFKAVSSP+SAADAAGSIYLSSIEKELVPNL
Sbjct: 61   NLGIVSVSVDGDPTEFEYYPRTQHVESEKSFKAVSSPTSAADAAGSIYLSSIEKELVPNL 120

Query: 121  LINCCKAFKSGSEQQDQPFLENGVQTADEDKQNVRLVRIDYWVEKSEVGIHFYNRMAHTD 180
            LINCCKAFK+GSE+QDQPFLENGVQ A EDKQN+RLVRIDYWVEKS+VGIHF + +AHTD
Sbjct: 121  LINCCKAFKNGSEKQDQPFLENGVQPAVEDKQNIRLVRIDYWVEKSDVGIHFNDHIAHTD 180

Query: 181  NQIRRARCWFPCMDDGLQRCKYDLEFTVSQNLVAVSNGILLYQVLSKDNPPRKTFVYRVD 240
            NQIRRARCWFPCMDDGLQRCKYDLEFTVSQNLVAVSNGILLYQVLSK+NPP KT+VYRVD
Sbjct: 181  NQIRRARCWFPCMDDGLQRCKYDLEFTVSQNLVAVSNGILLYQVLSKENPPCKTYVYRVD 240

Query: 241  IPVNARWISLAVGPFEILADHQNVLISHMCSPVNSLKLKHTVDFFHSAFSCYKDYLSVDF 300
            IPVNA WISLAVGPFEILADHQN  ISHMCSPVNSLKLKHTV+FFHSAFSCYKDYLSVDF
Sbjct: 241  IPVNAHWISLAVGPFEILADHQNGFISHMCSPVNSLKLKHTVEFFHSAFSCYKDYLSVDF 300

Query: 301  PFGSYKQIFIEPEIAVSSACLGVSMCIFSSHLLFDEKIIDQTIDTRIKLAYALARQWFGI 360
            PFGSYKQIFIEPEIAVSSACLGVSMCIFSSH LFDEKIIDQTIDTRIKLAYALARQWFGI
Sbjct: 301  PFGSYKQIFIEPEIAVSSACLGVSMCIFSSHFLFDEKIIDQTIDTRIKLAYALARQWFGI 360

Query: 361  YITPEAPNDEWLLDGLAGFLTDLFIKKNLGNNEARYQRYKANCSVCRADDCGLTTLSSSS 420
            YITPEAPNDEWLLDGLAGFLTDLFIKKNLGNNEARYQRYKANC+VCRADD G+TTLSSSS
Sbjct: 361  YITPEAPNDEWLLDGLAGFLTDLFIKKNLGNNEARYQRYKANCAVCRADDSGITTLSSSS 420

Query: 421  ACKDLHGTQCIGIYGKIRSWKSVAILQMLEKQMGPESFRKILQNIVSHAKDTGSTSQLLS 480
            ACKDL+GTQ IGIYGKIRSWKSVAILQMLEKQMGPESFRKILQNIVS AKDT STSQ LS
Sbjct: 421  ACKDLYGTQRIGIYGKIRSWKSVAILQMLEKQMGPESFRKILQNIVSRAKDTTSTSQSLS 480

Query: 481  TKEFRQLANKIGNLERPFLKEFFPRWVESCGCPLLRMGFSYNKRKNMVEMAVSRECTATP 540
            TKEFR LANKIGNLERPFLKEF PRWVESCGCPLLRMGFSYNKRKNMVEMAVSRECT TP
Sbjct: 481  TKEFRHLANKIGNLERPFLKEFIPRWVESCGCPLLRMGFSYNKRKNMVEMAVSRECTVTP 540

Query: 541  ATNVENRDSDAGWPGMMSIRIYELDGVFDHPVLPMTGESWQLLEIQCHSKLAARRLQKTK 600
            AT +ENRDSD GWPGMMSIRIYELDGVFDHPVLPMTGESWQLLEIQCHSKLAARRLQKTK
Sbjct: 541  ATILENRDSDTGWPGMMSIRIYELDGVFDHPVLPMTGESWQLLEIQCHSKLAARRLQKTK 600

Query: 601  KGSKPDGSDDNADIPALDIRSSVESPLLWLRADPEMEYLAEIHFHQPVQMWINQLEKDKD 660
            KGSKPDGSDDNADIPALD+RSSVESPLLWLRADPEMEYLAEI FHQPVQMWINQLEKDKD
Sbjct: 601  KGSKPDGSDDNADIPALDVRSSVESPLLWLRADPEMEYLAEIQFHQPVQMWINQLEKDKD 660

Query: 661  VIAQAQAIATLEMLPQPSFSIVNALNNFLKDPKAFWRVRIEAALAMAKTASEDTDWAGLL 720
            VIAQAQAIATLEMLPQPSFS+VNALNNFL+DPKAFWRVRIEAALAMAKTASEDTDWAGLL
Sbjct: 661  VIAQAQAIATLEMLPQPSFSVVNALNNFLRDPKAFWRVRIEAALAMAKTASEDTDWAGLL 720

Query: 721  NLIKFFKSQRFDADTGLPKPNEFRDFPEYFVLEAIPHAVAMVRGTDQKSPREAVEFVLQL 780
            NLIKFFKSQRFDADTGLPKPNEFRDFPEYFVLEAIPHAVAMVR TDQKSPREAVEFVLQL
Sbjct: 721  NLIKFFKSQRFDADTGLPKPNEFRDFPEYFVLEAIPHAVAMVRATDQKSPREAVEFVLQL 780

Query: 781  LKYNDNNGNPYSDVFWLAALVQSVGELEFGQQSILFLASLLKRIDRLLQFDRLMPSYNGI 840
            LKYNDNNGNPYSDVFWLAALVQSVGELEFGQQSIL LASLLKRIDRLLQFDRLMPSYNGI
Sbjct: 781  LKYNDNNGNPYSDVFWLAALVQSVGELEFGQQSILVLASLLKRIDRLLQFDRLMPSYNGI 840

Query: 841  LTISCIRTLTQIALKLSGLLSLDRIIELIRPFRDFNSMWQVRIEATRSLLDLEYHCNGID 900
            LTISCIRTLTQIALKLSGLLSLDRIIELIRPFR+FNS WQVRIEATR+LLDLEYHCNGID
Sbjct: 841  LTISCIRTLTQIALKLSGLLSLDRIIELIRPFRNFNSTWQVRIEATRALLDLEYHCNGID 900

Query: 901  ATLLLFIKYLEEENSLRGQVKLAVHVMRLCQIMRRSGSNDVVNNDTLVALLLLLEGNMAF 960
            A LLLFIKYLEEE SLRGQVKL VHVMRLCQIMRRS SND VNNDTLVALLLLLEG+MAF
Sbjct: 901  AVLLLFIKYLEEEKSLRGQVKLGVHVMRLCQIMRRSDSNDSVNNDTLVALLLLLEGHMAF 960

Query: 961  NNVYLRHYLFCILQVLSGRSPTLYGVPREYKTLHMGDTGTFSEQKRMLTSLIPEFNPPEP 1020
            NNV LRHYLFCILQVL+GR PTLYGVPREYKTLHMGD+GT SEQKR+LTSLIPEFNPPEP
Sbjct: 961  NNVCLRHYLFCILQVLAGRPPTLYGVPREYKTLHMGDSGTCSEQKRVLTSLIPEFNPPEP 1020

Query: 1021 SSVSAVAPMPCIPATLSSEPLHVPTPRPDNLAVPELSKEEGAIAEDPKQAMAIVEAPREA 1080
            SSVSAVAP+PCIPATLSSEPLH P PRPD+LA+PE+SKE  A+ E PKQA AI EAPREA
Sbjct: 1021 SSVSAVAPVPCIPATLSSEPLHTPKPRPDSLAIPEVSKEGAAVVEVPKQATAIAEAPREA 1080

Query: 1081 ASVSSSHERKLPVVKIKVRSSAATSRADADNLTTERSHAAPRETDVGPSSSVSVDAPPRN 1140
            ASVS+SHERKLPVVKIKVRSSAATSRADADN TTERSHAAPRETDVGPSSSVSVDAP RN
Sbjct: 1081 ASVSNSHERKLPVVKIKVRSSAATSRADADNQTTERSHAAPRETDVGPSSSVSVDAPQRN 1140

Query: 1141 TAEATSISNRILEEVNSCHDHGSHMTASIGSAKLASYGDELGKEFQCTADSSSRAFGHFQ 1200
             AEATSISN ILEEVNSCHDHGSHMTASIGSAK ASYGD+LGKEFQCTAD SSRAFGHFQ
Sbjct: 1141 IAEATSISNHILEEVNSCHDHGSHMTASIGSAKPASYGDDLGKEFQCTAD-SSRAFGHFQ 1200

Query: 1201 PEDPSSSSIIQDNNIDADAQKYASLQTLSLPQHDHGLASSQSRHGKKEKKKDKEKKRKRE 1260
            PEDPSSSSIIQDNNIDADAQKYASLQTLSLPQHDHGLAS  SRHGKKEKKKDKEKKRKR+
Sbjct: 1201 PEDPSSSSIIQDNNIDADAQKYASLQTLSLPQHDHGLASLHSRHGKKEKKKDKEKKRKRD 1260

Query: 1261 SHKEHRNDPEYIERKRLKKEKKQKEKEMAKLLNEEVKPQPTAMPRIKEPPTKSTPMQLET 1320
            SHKEHRNDPEYIERKRLKKEKKQKEKEMAKLLNEEVKP P AMPRIKEPPTKSTP+QLE+
Sbjct: 1261 SHKEHRNDPEYIERKRLKKEKKQKEKEMAKLLNEEVKPLPIAMPRIKEPPTKSTPVQLES 1320

Query: 1321 NEPSGSRLIIGSVHSKPEASEGTTSAAPKLRIKFKNRTLNNS 1363
            NEPSGSRLI+G V+SKPEASEGT SAAPKLRIKFKNRTLNNS
Sbjct: 1321 NEPSGSRLIVG-VNSKPEASEGTASAAPKLRIKFKNRTLNNS 1360

BLAST of CSPI01G06630 vs. NCBI nr
Match: KAG6584373.1 (Transcription initiation factor TFIID subunit 2, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 2501.5 bits (6482), Expect = 0.0e+00
Identity = 1271/1362 (93.32%), Postives = 1308/1362 (96.04%), Query Frame = 0

Query: 1    MAKPRKPKNTDDAKPPDNSGAVVRHQKLCLSIDIDNRRVYGFTELEIAVPDIGIVGLHAE 60
            MAKPRKPKNTDDAKPPDNSGAVVRHQKLCLSIDID RRVYGFTELEIAVPDIGIVGLHAE
Sbjct: 1    MAKPRKPKNTDDAKPPDNSGAVVRHQKLCLSIDIDKRRVYGFTELEIAVPDIGIVGLHAE 60

Query: 61   NLGIVSVSVDGDPTEFEYYPRPQHVENERSFKAVSSPSSAADAAGSIYLSSIEKELVPNL 120
            NLGIVSVSVDGDPTEFEYYPR QHVE+E+SFKAVSSPSSAADAAGSIYLSSIEKELVPNL
Sbjct: 61   NLGIVSVSVDGDPTEFEYYPRTQHVESEKSFKAVSSPSSAADAAGSIYLSSIEKELVPNL 120

Query: 121  LINCCKAFKSGSEQQDQPFLENGVQTADEDKQNVRLVRIDYWVEKSEVGIHFYNRMAHTD 180
            LINCCKAFK+GSE+QDQPFLENGVQ A EDKQN+RLVRIDYWVEKS+VGIHF + +AHTD
Sbjct: 121  LINCCKAFKNGSEKQDQPFLENGVQPAVEDKQNIRLVRIDYWVEKSDVGIHFNDHIAHTD 180

Query: 181  NQIRRARCWFPCMDDGLQRCKYDLEFTVSQNLVAVSNGILLYQVLSKDNPPRKTFVYRVD 240
            NQIRRARCWFPCMDDGLQRCKYDLEFTVSQNLVAVSNGILLYQVLSK+NPP KT+VYRVD
Sbjct: 181  NQIRRARCWFPCMDDGLQRCKYDLEFTVSQNLVAVSNGILLYQVLSKENPPCKTYVYRVD 240

Query: 241  IPVNARWISLAVGPFEILADHQNVLISHMCSPVNSLKLKHTVDFFHSAFSCYKDYLSVDF 300
            IPVNA WISLAVGPFEILADHQN  ISHMCSPVNSLKLKHTV+FFHSAFSCYKDYLSVDF
Sbjct: 241  IPVNAHWISLAVGPFEILADHQNGFISHMCSPVNSLKLKHTVEFFHSAFSCYKDYLSVDF 300

Query: 301  PFGSYKQIFIEPEIAVSSACLGVSMCIFSSHLLFDEKIIDQTIDTRIKLAYALARQWFGI 360
            PFGSYKQIFIEPEIAVSSACLGVSMCIFSSH LFDEKIIDQTIDTRIKLAYALARQWFGI
Sbjct: 301  PFGSYKQIFIEPEIAVSSACLGVSMCIFSSHFLFDEKIIDQTIDTRIKLAYALARQWFGI 360

Query: 361  YITPEAPNDEWLLDGLAGFLTDLFIKKNLGNNEARYQRYKANCSVCRADDCGLTTLSSSS 420
            YITPEAPNDEWLLDGLAGFLTDLFIKKNLGNNEARYQRYKANC+VCRADD G+TTLSSSS
Sbjct: 361  YITPEAPNDEWLLDGLAGFLTDLFIKKNLGNNEARYQRYKANCAVCRADDSGITTLSSSS 420

Query: 421  ACKDLHGTQCIGIYGKIRSWKSVAILQMLEKQMGPESFRKILQNIVSHAKDTGSTSQLLS 480
            ACKDL+GTQ IGIYGKIRSWKSVAILQMLEKQMGPESFRKILQNIVS AKDT STSQ LS
Sbjct: 421  ACKDLYGTQRIGIYGKIRSWKSVAILQMLEKQMGPESFRKILQNIVSRAKDTTSTSQSLS 480

Query: 481  TKEFRQLANKIGNLERPFLKEFFPRWVESCGCPLLRMGFSYNKRKNMVEMAVSRECTATP 540
            TKEFR LANKIGNLERPFLKEF PRWVESCGCPLLRMGFSYNKRKNMVEMAVSRECT TP
Sbjct: 481  TKEFRHLANKIGNLERPFLKEFIPRWVESCGCPLLRMGFSYNKRKNMVEMAVSRECTVTP 540

Query: 541  ATNVENRDSDAGWPGMMSIRIYELDGVFDHPVLPMTGESWQLLEIQCHSKLAARRLQKTK 600
            AT +ENRDSD GWPGMMSIRIYELDGVFDHPVLPMTGESWQLLEIQCHSKLAARRLQKTK
Sbjct: 541  ATILENRDSDTGWPGMMSIRIYELDGVFDHPVLPMTGESWQLLEIQCHSKLAARRLQKTK 600

Query: 601  KGSKPDGSDDNADIPALDIRSSVESPLLWLRADPEMEYLAEIHFHQPVQMWINQLEKDKD 660
            KGSKPDGSDDNADIPALD+RSSVESPLLWLRADPEMEYLAEI FHQPVQMWINQLEKDKD
Sbjct: 601  KGSKPDGSDDNADIPALDVRSSVESPLLWLRADPEMEYLAEIQFHQPVQMWINQLEKDKD 660

Query: 661  VIAQAQAIATLEMLPQPSFSIVNALNNFLKDPKAFWRVRIEAALAMAKTASEDTDWAGLL 720
            VIAQAQA ATLEMLPQPSFS+VNALNNFL+DPKAFWRVRIEAALAMAKTASEDTDWAGLL
Sbjct: 661  VIAQAQATATLEMLPQPSFSVVNALNNFLRDPKAFWRVRIEAALAMAKTASEDTDWAGLL 720

Query: 721  NLIKFFKSQRFDADTGLPKPNEFRDFPEYFVLEAIPHAVAMVRGTDQKSPREAVEFVLQL 780
            NLIKFFKSQRFDADTGLPKPNEFRDFPEYFVLEAIPHAVAMVR TDQKSPREAVEFVLQL
Sbjct: 721  NLIKFFKSQRFDADTGLPKPNEFRDFPEYFVLEAIPHAVAMVRATDQKSPREAVEFVLQL 780

Query: 781  LKYNDNNGNPYSDVFWLAALVQSVGELEFGQQSILFLASLLKRIDRLLQFDRLMPSYNGI 840
            LKYNDNNGNPYSDVFWLAALVQSVGELEFGQQSILFLASLLKRIDRLLQFDRLMPSYNGI
Sbjct: 781  LKYNDNNGNPYSDVFWLAALVQSVGELEFGQQSILFLASLLKRIDRLLQFDRLMPSYNGI 840

Query: 841  LTISCIRTLTQIALKLSGLLSLDRIIELIRPFRDFNSMWQVRIEATRSLLDLEYHCNGID 900
            LTISCIRTLTQIALKLSGLLSLDRIIELIRPFR+FNS WQVRIEATR+LLDLEYHCNGID
Sbjct: 841  LTISCIRTLTQIALKLSGLLSLDRIIELIRPFRNFNSTWQVRIEATRALLDLEYHCNGID 900

Query: 901  ATLLLFIKYLEEENSLRGQVKLAVHVMRLCQIMRRSGSNDVVNNDTLVALLLLLEGNMAF 960
            A LLLFIKYLEEE SLRGQVKL VHVMRLCQIMRRS SND VNNDTLVALLLLLEG+MAF
Sbjct: 901  AVLLLFIKYLEEEKSLRGQVKLGVHVMRLCQIMRRSDSNDSVNNDTLVALLLLLEGHMAF 960

Query: 961  NNVYLRHYLFCILQVLSGRSPTLYGVPREYKTLHMGDTGTFSEQKRMLTSLIPEFNPPEP 1020
            NNV LRHYLFCILQVL+GR PTLYGVPREYKTLHMGD+GT SEQKR+LTSLIPEFNPPEP
Sbjct: 961  NNVCLRHYLFCILQVLAGRPPTLYGVPREYKTLHMGDSGTCSEQKRVLTSLIPEFNPPEP 1020

Query: 1021 SSVSAVAPMPCIPATLSSEPLHVPTPRPDNLAVPELSKEEGAIAEDPKQAMAIVEAPREA 1080
            SSVSAVAP+PCIP TLSSEPLH P PRPD+LA+PE+SKE  A+ E PKQA AI EAPREA
Sbjct: 1021 SSVSAVAPVPCIPVTLSSEPLHTPKPRPDSLAIPEVSKEGAAVVEVPKQATAIAEAPREA 1080

Query: 1081 ASVSSSHERKLPVVKIKVRSSAATSRADADNLTTERSHAAPRETDVGPSSSVSVDAPPRN 1140
            ASVS+SHERKLPVVKIKVRSSAATSRADADN TTERSHAAPRETDVGPSSSVSVDAP RN
Sbjct: 1081 ASVSNSHERKLPVVKIKVRSSAATSRADADNQTTERSHAAPRETDVGPSSSVSVDAPQRN 1140

Query: 1141 TAEATSISNRILEEVNSCHDHGSHMTASIGSAKLASYGDELGKEFQCTADSSSRAFGHFQ 1200
             AEATSISN ILEEVNSCHDHGSHMTASIGSAK ASYGD+LGKEFQCTAD SSRAFGHFQ
Sbjct: 1141 IAEATSISNHILEEVNSCHDHGSHMTASIGSAKPASYGDDLGKEFQCTAD-SSRAFGHFQ 1200

Query: 1201 PEDPSSSSIIQDNNIDADAQKYASLQTLSLPQHDHGLASSQSRHGKKEKKKDKEKKRKRE 1260
            PEDPSSSSIIQDNNIDADAQKYASLQTLSLPQHDHGLAS  SRHGKKEKKKDKEKKRKR+
Sbjct: 1201 PEDPSSSSIIQDNNIDADAQKYASLQTLSLPQHDHGLASLHSRHGKKEKKKDKEKKRKRD 1260

Query: 1261 SHKEHRNDPEYIERKRLKKEKKQKEKEMAKLLNEEVKPQPTAMPRIKEPPTKSTPMQLET 1320
            SHKEHRNDPEYIERKRLKKEKKQKEKEMAKLLNEEVKP P AMPR+KEPPTKSTP+QLE+
Sbjct: 1261 SHKEHRNDPEYIERKRLKKEKKQKEKEMAKLLNEEVKPLPIAMPRMKEPPTKSTPVQLES 1320

Query: 1321 NEPSGSRLIIGSVHSKPEASEGTTSAAPKLRIKFKNRTLNNS 1363
            NEPSGSRLI+G V+SKPEASEGT SAAPKLRIKFKNRTLNNS
Sbjct: 1321 NEPSGSRLIVG-VNSKPEASEGTASAAPKLRIKFKNRTLNNS 1360

BLAST of CSPI01G06630 vs. TAIR 10
Match: AT1G73960.1 (TBP-associated factor 2 )

HSP 1 Score: 1512.7 bits (3915), Expect = 0.0e+00
Identity = 829/1423 (58.26%), Postives = 1018/1423 (71.54%), Query Frame = 0

Query: 1    MAKPRKPKNTD--DAKPPDNSGAVVRHQKLCLSIDIDNRRVYGFTELEIAVPDIGIVGLH 60
            MAK RKPKN +   AK  +N+GA V HQKL LSID   R++YG+TELE++VPDIGIVGLH
Sbjct: 1    MAKARKPKNEEAPGAKTSENTGAKVLHQKLFLSIDFKKRQIYGYTELEVSVPDIGIVGLH 60

Query: 61   AENLGIVSVSVDGDPTEFEYYPRPQHVENERSFKAVSSPSSAADAAGSIYLSSIEKELVP 120
            AENLGI SV VDG+PT FEYYP  Q+ E E ++ +VS P+SAADAA   Y+  +++E   
Sbjct: 61   AENLGIESVLVDGEPTVFEYYPHHQNSETESNWNSVSDPASAADAAAMEYVGVLKREDTA 120

Query: 121  NLLINCCKAFKSGSEQQDQPFLENGVQTADEDKQNVRLVRIDYWVEKSEVGIHFYNRMAH 180
            NLLINCCK  K  SEQ D   LENG Q++ E KQNV+L+RI+YWVEK E GIHF   + H
Sbjct: 121  NLLINCCKPSKDLSEQLDSVTLENGSQSSGEAKQNVKLIRINYWVEKIESGIHFDGNIVH 180

Query: 181  TDNQIRRARCWFPCMDDGLQRCKYDLEFTVSQNLVAVSNGILLYQVLSKDNPPRKTFVYR 240
            TDNQ+RRARCWFPC+DD   RC +DLEFTV  N VAVS G LLYQV+ K++  +KT+VY 
Sbjct: 181  TDNQMRRARCWFPCIDDEYHRCSFDLEFTVPHNFVAVSVGKLLYQVMCKEDTTQKTYVYE 240

Query: 241  VDIPVNARWISLAVGPFEILADHQNVLISHMCSPVNSLKLKHTVDFFHSAFSCYKDYLSV 300
            + IP+  RW+SL  GP EIL D  N LIS++C P +  +L++T++FFH A+S Y+DYLS 
Sbjct: 241  LAIPIAPRWVSLVAGPLEILPDQTNFLISNLCLPHDLSRLRNTMEFFHEAYSYYEDYLSA 300

Query: 301  DFPFGSYKQIFIEPEIAVSSACLGVSMCIFSSHLLFDEKIIDQTIDTRIKLAYALARQWF 360
            +FPFG YKQ+F+ PE+ V+S+  G S+ IFSSH+L+DE++IDQTIDTRIKLA ALA+QWF
Sbjct: 301  NFPFGFYKQVFLPPEMVVTSSTSGASLSIFSSHILYDERVIDQTIDTRIKLASALAKQWF 360

Query: 361  GIYITPEAPNDEWLLDGLAGFLTDLFIKKNLGNNEARYQRYKANCSVCRADDCGLTTLSS 420
            G+YITPE+PND+WLLDGLAGFLTD+FIK+ LGNNEARY+RYKANC+VC+ADD G   LSS
Sbjct: 361  GVYITPESPNDDWLLDGLAGFLTDMFIKQFLGNNEARYRRYKANCAVCKADDSGAMCLSS 420

Query: 421  SSACKDLHGTQCIGIYGKIRSWKSVAILQMLEKQMGPESFRKILQNIVSHAKDTGSTSQL 480
            S +C+DL GT  IG++GKIRSWKS A+LQMLEKQMG +SFRKILQ I+S AKD  ++ + 
Sbjct: 421  SPSCRDLFGTHSIGMHGKIRSWKSGAVLQMLEKQMGSDSFRKILQKIISRAKDPSNSIRS 480

Query: 481  LSTKEFRQLANKIGNLERPFLKEFFPRWVESCGCPLLRMGFSYNKRKNMVEMAVSRECTA 540
            LSTKEFRQ ANKIGNLERPFLKEFF RWV S GCP+LR+G SYNKRKN VEMA  RECTA
Sbjct: 481  LSTKEFRQFANKIGNLERPFLKEFFQRWVASYGCPVLRIGLSYNKRKNNVEMAALRECTA 540

Query: 541  T---------PATNVENRDSDAGWPGMMSIRIYELDGVFDHPVLPMTGESWQLLEIQCHS 600
                        ++ E+RD DAGWPG+MSIR+YELDG+ DHP LPM G+ WQLLE+ CHS
Sbjct: 541  ALDARLSVIGATSDSESRDVDAGWPGIMSIRVYELDGMSDHPKLPMAGDRWQLLELPCHS 600

Query: 601  KLAARRLQKTKKGSKPDGSDDNAD-IPALDIRSSVESPLLWLRADPEMEYLAEIHFHQPV 660
            KLAA+R QK KKG KPDG++DN D I  L+ ++S+ESPL W++ADPEMEY+AEIH HQP+
Sbjct: 601  KLAAKRYQKPKKGGKPDGAEDNVDAIAPLENKTSIESPLAWIKADPEMEYIAEIHLHQPL 660

Query: 661  QMWINQLEKDKDVIAQAQAIATLEMLPQPSFSIVNALNNFLKDPKAFWRVRIEAALAMAK 720
            QMW+NQLEKD DV+AQAQAIA+LE L Q SFSIVNAL N L D K FWR+RI AA A+AK
Sbjct: 661  QMWVNQLEKDGDVVAQAQAIASLEALKQHSFSIVNALKNVLTDSKVFWRIRIAAAFALAK 720

Query: 721  TASEDTDWAGLLNLIKFFKSQRFDADTGLPKPNEFRDFPEYFVLEAIPHAVAMVRGTDQK 780
            TASE++DWAGL +LIKF+KS+RFDA+ GLPKPN+FRDFPEYFVLEAIPHA+A+VRG + K
Sbjct: 721  TASEESDWAGLQHLIKFYKSRRFDAEIGLPKPNDFRDFPEYFVLEAIPHAIAIVRGAEGK 780

Query: 781  SPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGELEFGQQSILFLASLLKRIDRLL 840
            SPREAVEF+LQLLKYNDN+GN YSDVFWLA LVQSVG+LEF QQS+ FLA LLKRIDRLL
Sbjct: 781  SPREAVEFILQLLKYNDNSGNSYSDVFWLAVLVQSVGDLEFCQQSLTFLAPLLKRIDRLL 840

Query: 841  QFDRLMPSYNGILTISCIRTLTQIALKLSGLLSLDRIIELIRPFRDFNSMWQVRIEATRS 900
            QFDRLMPSYNGILTISCIRTL Q ALKLS  +S D I +LI PFR+ +++ Q+RIE +R+
Sbjct: 841  QFDRLMPSYNGILTISCIRTLAQTALKLSDSISFDHICKLIEPFRNSDTILQIRIEGSRA 900

Query: 901  LLDLEYHCNGIDATLLLFIKYLEEENSLRGQVKLAVHVMRLCQIMRRSGSNDVVNNDTLV 960
            LLD+EY   GI + LLLF+KYL EE+SLRGQVKL VH MRLCQI     S+D V+  TL+
Sbjct: 901  LLDIEYQSKGISSALLLFMKYLVEESSLRGQVKLCVHTMRLCQIAVGCDSDDCVDTVTLL 960

Query: 961  ALLLLLEGNMAFNNVYLRHYLFCILQVLSGRSPTLYGVPREYKTLHMGDTGTFSEQKRML 1020
             LL L + ++ FNN  LR+YLFCI Q+L+GR PTL+GVP+E K L + D     E K + 
Sbjct: 961  DLLHLFKSHVVFNNELLRYYLFCIFQILAGRPPTLFGVPKE-KPLQLVDVEACIEPKNVF 1020

Query: 1021 TSLIPEFNPPEPSSVSA---------VAP-----------MPCIPATLSSEPL------- 1080
              L+P     EPS  +          VAP           MP +P  +  EP+       
Sbjct: 1021 --LVPGAEAGEPSLSALGDAKGQSLDVAPYGVPIIPQEMFMPIVPELMLPEPVAAYDETQ 1080

Query: 1081 HVPTPRPDNLAVPE----LSKEEGAIAEDPKQAMAIVEA---------PREAASVSSSHE 1140
            H+  PR ++   P     +  E  +  E P + +A  EA           +  SVS SHE
Sbjct: 1081 HL-EPRMESQNQPSHENPIVHEIPSDVEGPTEELAHREANPPTKEPQKEPDVVSVSVSHE 1140

Query: 1141 RKLPVVKIKVRSSAATSRADADNLTTERSH--AAPRETDVGPSSSVSVDAPPRNTAEATS 1200
             K  V++IKVR S ATSRA+    T ERS       + D G +SS SVDAP R + +A S
Sbjct: 1141 VKKSVIRIKVRPSGATSRAEGSARTIERSQGIVVRHDIDRGQTSSASVDAPQRISTDAVS 1200

Query: 1201 ISNR-ILEEVNSCHDHGSHMTASIGSAKLASYGDELGKEFQCTADS-----SSRAFGHFQ 1260
            ISN+  +EEVNSCHD GS MTASIGS K AS GD  GKE QCTA+S     S +A  + +
Sbjct: 1201 ISNQNHVEEVNSCHDVGSRMTASIGSVKFASEGDIFGKELQCTAESGKPSTSQKADNNNR 1260

Query: 1261 PEDPSSSSIIQDNNIDADA-QKYASLQTLSLPQHDHGLASSQSRHGKKEKKKDKEKKRKR 1320
               PS   +  D++++ +A QKYASLQTLS+ +             +KEKKKDKEKK K 
Sbjct: 1261 TVPPSFLPL--DHSMENEAQQKYASLQTLSIGK-------------EKEKKKDKEKKEK- 1320

Query: 1321 ESHKEHRNDPEYIERKRLKKEKKQKEKEMAKLLNEEVKPQPTAMPRIKEPPTKSTPMQLE 1363
               K  R DP Y+E+KRLKKEKK+KEKEMAKL++    P    +  + E         ++
Sbjct: 1321 ---KRKREDPVYLEKKRLKKEKKRKEKEMAKLVSSTTDPAKKKIESVAE---------VK 1380

BLAST of CSPI01G06630 vs. TAIR 10
Match: AT1G73960.2 (TBP-associated factor 2 )

HSP 1 Score: 1475.3 bits (3818), Expect = 0.0e+00
Identity = 815/1423 (57.27%), Postives = 1002/1423 (70.41%), Query Frame = 0

Query: 1    MAKPRKPKNTD--DAKPPDNSGAVVRHQKLCLSIDIDNRRVYGFTELEIAVPDIGIVGLH 60
            MAK RKPKN +   AK  +N+GA V HQKL LSID   R++YG+TELE++VPDIGIVGLH
Sbjct: 1    MAKARKPKNEEAPGAKTSENTGAKVLHQKLFLSIDFKKRQIYGYTELEVSVPDIGIVGLH 60

Query: 61   AENLGIVSVSVDGDPTEFEYYPRPQHVENERSFKAVSSPSSAADAAGSIYLSSIEKELVP 120
            AENLGI SV VDG+PT FEYYP  Q+ E E ++ +VS P+SAADAA   Y+  +++E   
Sbjct: 61   AENLGIESVLVDGEPTVFEYYPHHQNSETESNWNSVSDPASAADAAAMEYVGVLKREDTA 120

Query: 121  NLLINCCKAFKSGSEQQDQPFLENGVQTADEDKQNVRLVRIDYWVEKSEVGIHFYNRMAH 180
            NLLINCCK  K  SEQ D   LENG Q++ E KQNV+L+RI+YWVEK E GIHF   + H
Sbjct: 121  NLLINCCKPSKDLSEQLDSVTLENGSQSSGEAKQNVKLIRINYWVEKIESGIHFDGNIVH 180

Query: 181  TDNQIRRARCWFPCMDDGLQRCKYDLEFTVSQNLVAVSNGILLYQVLSKDNPPRKTFVYR 240
            TDNQ+RRARCWFPC+DD   RC +DLEFTV  N VAVS G LLYQV+ K++  +KT+VY 
Sbjct: 181  TDNQMRRARCWFPCIDDEYHRCSFDLEFTVPHNFVAVSVGKLLYQVMCKEDTTQKTYVYE 240

Query: 241  VDIPVNARWISLAVGPFEILADHQNVLISHMCSPVNSLKLKHTVDFFHSAFSCYKDYLSV 300
            + IP+  RW+SL  GP EIL D  N LIS++C P +  +L++T++FFH A+S Y+DYLS 
Sbjct: 241  LAIPIAPRWVSLVAGPLEILPDQTNFLISNLCLPHDLSRLRNTMEFFHEAYSYYEDYLSA 300

Query: 301  DFPFGSYKQIFIEPEIAVSSACLGVSMCIFSSHLLFDEKIIDQTIDTRIKLAYALARQWF 360
            +FPFG YKQ+F+ PE+ V+S+  G S+ IFSSH+L+DE++IDQTIDTRIKLA ALA+QWF
Sbjct: 301  NFPFGFYKQVFLPPEMVVTSSTSGASLSIFSSHILYDERVIDQTIDTRIKLASALAKQWF 360

Query: 361  GIYITPEAPNDEWLLDGLAGFLTDLFIKKNLGNNEARYQRYKANCSVCRADDCGLTTLSS 420
            G+YITPE+PND+WLLDGLAGFLTD+FIK+ LGNNEARY+RYKANC+VC+ADD G   LSS
Sbjct: 361  GVYITPESPNDDWLLDGLAGFLTDMFIKQFLGNNEARYRRYKANCAVCKADDSGAMCLSS 420

Query: 421  SSACKDLHGTQCIGIYGKIRSWKSVAILQMLEKQMGPESFRKILQNIVSHAKDTGSTSQL 480
            S +C+DL GT  IG++GKIRSWKS A+LQMLEKQMG +SFRKILQ I+S AKD  ++ + 
Sbjct: 421  SPSCRDLFGTHSIGMHGKIRSWKSGAVLQMLEKQMGSDSFRKILQKIISRAKDPSNSIRS 480

Query: 481  LSTKEFRQLANKIGNLERPFLKEFFPRWVESCGCPLLRMGFSYNKRKNMVEMAVSRECTA 540
            LSTKEFRQ ANKIGNLERPFLKEFF RWV S GCP+LR+G SYNKRKN VEMA  RECTA
Sbjct: 481  LSTKEFRQFANKIGNLERPFLKEFFQRWVASYGCPVLRIGLSYNKRKNNVEMAALRECTA 540

Query: 541  T---------PATNVENRDSDAGWPGMMSIRIYELDGVFDHPVLPMTGESWQLLEIQCHS 600
                        ++ E+RD DAGWPG+MSIR+YELDG+ DHP LPM G+ WQLLE+ CHS
Sbjct: 541  ALDARLSVIGATSDSESRDVDAGWPGIMSIRVYELDGMSDHPKLPMAGDRWQLLELPCHS 600

Query: 601  KLAARRLQKTKKGSKPDGSDDNAD-IPALDIRSSVESPLLWLRADPEMEYLAEIHFHQPV 660
            KLAA+R QK KKG KPDG++DN D I  L+ ++S+ESPL W++ADPEMEY+AEIH HQP+
Sbjct: 601  KLAAKRYQKPKKGGKPDGAEDNVDAIAPLENKTSIESPLAWIKADPEMEYIAEIHLHQPL 660

Query: 661  QMWINQLEKDKDVIAQAQAIATLEMLPQPSFSIVNALNNFLKDPKAFWRVRIEAALAMAK 720
            QMW+NQLEKD DV+AQAQAIA+LE L Q SFSIVNAL N L D K FWR+RI AA A+AK
Sbjct: 661  QMWVNQLEKDGDVVAQAQAIASLEALKQHSFSIVNALKNVLTDSKVFWRIRIAAAFALAK 720

Query: 721  TASEDTDWAGLLNLIKFFKSQRFDADTGLPKPNEFRDFPEYFVLEAIPHAVAMVRGTDQK 780
            TASE++DWAGL +LIKF+KS+RFDA+ GLPKPN+FRDFPEYFVLEAIPHA+A+VRG + K
Sbjct: 721  TASEESDWAGLQHLIKFYKSRRFDAEIGLPKPNDFRDFPEYFVLEAIPHAIAIVRGAEGK 780

Query: 781  SPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGELEFGQQSILFLASLLKRIDRLL 840
            SPREAVEF+LQLLKYNDN+GN YSDVFWLA LVQSVG+LEF QQS+ FLA LLKRIDRLL
Sbjct: 781  SPREAVEFILQLLKYNDNSGNSYSDVFWLAVLVQSVGDLEFCQQSLTFLAPLLKRIDRLL 840

Query: 841  QFDRLMPSYNGILTISCIRTLTQIALKLSGLLSLDRIIELIRPFRDFNSMWQVRIEATRS 900
            QFDRLMPSYNGILTISCIRTL Q ALKLS  +S D I +LI PFR+ +++ Q+RIE +R+
Sbjct: 841  QFDRLMPSYNGILTISCIRTLAQTALKLSDSISFDHICKLIEPFRNSDTILQIRIEGSRA 900

Query: 901  LLDLEYHCNGIDATLLLFIKYLEEENSLRGQVKLAVHVMRLCQIMRRSGSNDVVNNDTLV 960
            LLD+EY                      +GQVKL VH MRLCQI     S+D V+  TL+
Sbjct: 901  LLDIEYQS--------------------KGQVKLCVHTMRLCQIAVGCDSDDCVDTVTLL 960

Query: 961  ALLLLLEGNMAFNNVYLRHYLFCILQVLSGRSPTLYGVPREYKTLHMGDTGTFSEQKRML 1020
             LL L + ++ FNN  LR+YLFCI Q+L+GR PTL+GVP+E K L + D     E K + 
Sbjct: 961  DLLHLFKSHVVFNNELLRYYLFCIFQILAGRPPTLFGVPKE-KPLQLVDVEACIEPKNVF 1020

Query: 1021 TSLIPEFNPPEPSSVSA---------VAP-----------MPCIPATLSSEPL------- 1080
              L+P     EPS  +          VAP           MP +P  +  EP+       
Sbjct: 1021 --LVPGAEAGEPSLSALGDAKGQSLDVAPYGVPIIPQEMFMPIVPELMLPEPVAAYDETQ 1080

Query: 1081 HVPTPRPDNLAVPE----LSKEEGAIAEDPKQAMAIVEA---------PREAASVSSSHE 1140
            H+  PR ++   P     +  E  +  E P + +A  EA           +  SVS SHE
Sbjct: 1081 HL-EPRMESQNQPSHENPIVHEIPSDVEGPTEELAHREANPPTKEPQKEPDVVSVSVSHE 1140

Query: 1141 RKLPVVKIKVRSSAATSRADADNLTTERSH--AAPRETDVGPSSSVSVDAPPRNTAEATS 1200
             K  V++IKVR S ATSRA+    T ERS       + D G +SS SVDAP R + +A S
Sbjct: 1141 VKKSVIRIKVRPSGATSRAEGSARTIERSQGIVVRHDIDRGQTSSASVDAPQRISTDAVS 1200

Query: 1201 ISNR-ILEEVNSCHDHGSHMTASIGSAKLASYGDELGKEFQCTADS-----SSRAFGHFQ 1260
            ISN+  +EEVNSCHD GS MTASIGS K AS GD  GKE QCTA+S     S +A  + +
Sbjct: 1201 ISNQNHVEEVNSCHDVGSRMTASIGSVKFASEGDIFGKELQCTAESGKPSTSQKADNNNR 1260

Query: 1261 PEDPSSSSIIQDNNIDADA-QKYASLQTLSLPQHDHGLASSQSRHGKKEKKKDKEKKRKR 1320
               PS   +  D++++ +A QKYASLQTLS+ +             +KEKKKDKEKK K 
Sbjct: 1261 TVPPSFLPL--DHSMENEAQQKYASLQTLSIGK-------------EKEKKKDKEKKEK- 1320

Query: 1321 ESHKEHRNDPEYIERKRLKKEKKQKEKEMAKLLNEEVKPQPTAMPRIKEPPTKSTPMQLE 1363
               K  R DP Y+E+KRLKKEKK+KEKEMAKL++    P    +  + E         ++
Sbjct: 1321 ---KRKREDPVYLEKKRLKKEKKRKEKEMAKLVSSTTDPAKKKIESVAE---------VK 1370

BLAST of CSPI01G06630 vs. TAIR 10
Match: AT4G33090.1 (aminopeptidase M1 )

HSP 1 Score: 74.3 bits (181), Expect = 8.3e-13
Identity = 76/313 (24.28%), Postives = 132/313 (42.17%), Query Frame = 0

Query: 176 MAHTDNQIRRARCWFPCMDDGLQRCKYDLEFTVSQNLVAVSNGILLYQVLSKDNPPRKTF 235
           MA T  +   AR  FPC D+   +  + +   V  +LVA+SN  ++ +   K N   K  
Sbjct: 132 MAVTQFEPADARRCFPCWDEPACKATFKITLEVPTDLVALSNMPIMEE---KVNGNLKIV 191

Query: 236 VYRVDIPVNARWISLAVGPFEILADH--QNVLISHMCSPVNSLKLKHTVDFFHSAFSCYK 295
            Y+    ++   +++ VG F+ + DH    + +   C    + + K  +         +K
Sbjct: 192 SYQESPIMSTYLVAIVVGLFDYVEDHTSDGIKVRVYCQVGKADQGKFALHVGAKTLDLFK 251

Query: 296 DYLSVDFPFGSYKQIFIEPEIAVSSACLGVSMCIFSSHLLFDEKIIDQTIDTRIK--LAY 355
           +Y +V +P      I I P+ A  +      +    + LL+DE+    +   R+   +A+
Sbjct: 252 EYFAVPYPLPKMDMIAI-PDFAAGAMENYGLVTYRETALLYDEQHSAASNKQRVATVVAH 311

Query: 356 ALARQWFGIYITPEAPNDEWLLDGLAGFLTDLFIKKNLGNNEARYQRYKANCSVCRADDC 415
            LA QWFG  +T E     WL +G A +++ L         +   Q    +    R D  
Sbjct: 312 ELAHQWFGNLVTMEWWTHLWLNEGFATWVSYLATDSLFPEWKIWTQFLDESTEGLRLD-- 371

Query: 416 GLTTLSSSSACKDLHGTQCIGIYGKIRSWKSVAILQMLEKQMGPESFRKIL-QNIVSHAK 475
           GL   S     +  H  +   I+  I   K  ++++ML+  +G E F+K L   I +HA 
Sbjct: 372 GLEE-SHPIEVEVNHAAEIDEIFDAISYRKGASVIRMLQSYLGAEVFQKSLAAYIKNHAY 431

Query: 476 DTGSTSQLLSTKE 484
               T  L +  E
Sbjct: 432 SNAKTEDLWAALE 437

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q8LPF00.0e+0058.26Transcription initiation factor TFIID subunit 2 OS=Arabidopsis thaliana OX=3702 ... [more]
Q5ZIT82.3e-9228.99Transcription initiation factor TFIID subunit 2 OS=Gallus gallus OX=9031 GN=TAF2... [more]
Q6P1X52.3e-9229.05Transcription initiation factor TFIID subunit 2 OS=Homo sapiens OX=9606 GN=TAF2 ... [more]
Q8C1762.5e-9128.84Transcription initiation factor TFIID subunit 2 OS=Mus musculus OX=10090 GN=Taf2... [more]
Q32PW33.1e-8927.15Transcription initiation factor TFIID subunit 2 OS=Danio rerio OX=7955 GN=taf2 P... [more]
Match NameE-valueIdentityDescription
A0A0A0LQC80.0e+0099.71Transcription initiation factor TFIID subunit 2 OS=Cucumis sativus OX=3659 GN=Cs... [more]
A0A1S3C3570.0e+0097.50Transcription initiation factor TFIID subunit 2 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A6J1ED820.0e+0093.32Transcription initiation factor TFIID subunit 2 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1KGW70.0e+0093.10Transcription initiation factor TFIID subunit 2 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1C6910.0e+0091.85Transcription initiation factor TFIID subunit 2 OS=Momordica charantia OX=3673 G... [more]
Match NameE-valueIdentityDescription
XP_004137463.10.0e+0099.71transcription initiation factor TFIID subunit 2 [Cucumis sativus] >KGN64105.1 hy... [more]
XP_008456406.10.0e+0097.50PREDICTED: transcription initiation factor TFIID subunit 2 [Cucumis melo][more]
XP_038894650.10.0e+0094.28transcription initiation factor TFIID subunit 2 isoform X1 [Benincasa hispida][more]
XP_023519716.10.0e+0093.39transcription initiation factor TFIID subunit 2 [Cucurbita pepo subsp. pepo][more]
KAG6584373.10.0e+0093.32Transcription initiation factor TFIID subunit 2, partial [Cucurbita argyrosperma... [more]
Match NameE-valueIdentityDescription
AT1G73960.10.0e+0058.26TBP-associated factor 2 [more]
AT1G73960.20.0e+0057.27TBP-associated factor 2 [more]
AT4G33090.18.3e-1324.28aminopeptidase M1 [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (PI 183967) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 1272..1297
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1233..1323
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1124..1143
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 596..615
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1261..1293
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 599..613
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1098..1143
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..20
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1335..1362
NoneNo IPR availableCDDcd09839M1_like_TAF2coord: 25..507
e-value: 2.02945E-119
score: 381.192
NoneNo IPR availableSUPERFAMILY55486Metalloproteases ("zincins"), catalytic domaincoord: 263..516
IPR014782Peptidase M1, membrane alanine aminopeptidasePFAMPF01433Peptidase_M1coord: 282..480
e-value: 5.9E-13
score: 48.9
IPR027268Peptidase M4/M1, CTD superfamilyGENE3D1.10.390.10Neutral Protease Domain 2coord: 259..511
e-value: 2.0E-31
score: 111.2
IPR042097Aminopeptidase N-like , N-terminalGENE3D2.60.40.1730tricorn interacting facor f3 domaincoord: 21..253
e-value: 4.5E-23
score: 84.5
IPR042097Aminopeptidase N-like , N-terminalSUPERFAMILY63737Leukotriene A4 hydrolase N-terminal domaincoord: 12..254
IPR037813Transcription initiation factor TFIID subunit 2PANTHERPTHR15137TRANSCRIPTION INITIATION FACTOR TFIIDcoord: 20..1294
IPR016024Armadillo-type foldSUPERFAMILY48371ARM repeatcoord: 443..977

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI01G06630.1CSPI01G06630.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
biological_process GO:0006367 transcription initiation from RNA polymerase II promoter
biological_process GO:0006413 translational initiation
cellular_component GO:0005669 transcription factor TFIID complex
molecular_function GO:0003682 chromatin binding
molecular_function GO:0008237 metallopeptidase activity
molecular_function GO:0016251 RNA polymerase II general transcription initiation factor activity
molecular_function GO:0000976 transcription cis-regulatory region binding
molecular_function GO:0003743 translation initiation factor activity
molecular_function GO:0008270 zinc ion binding