Csa1G084260 (gene) Cucumber (Chinese Long) v2

NameCsa1G084260
Typegene
OrganismCucumis. sativus (Cucumber (Chinese Long) v2)
DescriptionDNA-directed RNA polymerase; contains IPR009010 (Aspartate decarboxylase-like domain), IPR015700 (DNA-directed RNA polymerase III largest subunit)
LocationChr1 : 7983188 .. 8003774 (-)
   



The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGTGCTGGTATTGGCATCAATTTTTTTTTGATAAGTTTATGTATATATATATTCATATTTTGCATAAGGTACAGATACCAATGGTTATTGAACTTATGCTTTATGATTCTATTGTAGGTTCGGTGAAGAAGGCTGTATCTATGTTGGGCATACTGCATTATCGTGCGAGATCCAAGGATGCTGGGGTGGTATCAGAGGATCTTAGGGCACCCTATAATGTGTCTAACGATATTCTGAACCCTTTTAGAGTTCTTTGTCTTTTCCAAAGGATGTCAGATGAGGTGTATATTGATAATCTTTAGGGTCTTGGTTTATTGTGAGGCACCCACCCAAGGCACTTGCTTTGGATGTGGTGGCCGTTGCAATTAGTTATGTATTGTATAACACTTCTCTACGAGAAACATGTTATTATCTTTTTAGTTTTGTTCTTTCCATTTTTATCCTCTTCACATATTTTTTTGTGCAGGACTGTGAGTTGCTTTTTCTATCCAATAGACCTGAGAAACTCATAATTACTAATGTTCTAGTGCCTCCTATAGCCATTCGTCCTTCTGTAATCATGGATGGTTCTCAAAGGTTCTTTTCTTTTTATTCCCCTCCATTGAACTACAGTGAAATCTATGTTTTTTATTGATTTTTTTTGTAATTGAAGACTTCATTATTTTTTCCGAAGGAAACAAAATTTTTCATTATTGAAGAGAAAACATCCTATTGCAATCACTTTGTAAGCCATCTGAAGGTGGTATTTATAAATGATTCTATTTAATTTGATTTTTCTTGCTGCAGCAATGAAAATGATATAACTGAGAGGTTAAAACGAATTATTCAACAGAATGCTAGTGTCAGCCAAGAGTTATCAACATCAAACTCACAAGCTAAATGCCTGGTACACTTTGGACTTTGGTACTTCTCTTTTAAGTGTGTGTGTGTGTACAAACACATTATTTCATTACCTTTGCATTTCTTTCAAATAATAAACATTATGTTAAAGTATTTGGATTCGTTGTAACCTATTAGCTTGAGCGTTTGGATTATTTATTTATTTTTCTAAAAATATAATTACTTGTTGCTCCTTGGGCGAAATTTCTAGCCACATGTGAGGGGAGTTTTAAAGGATTTAAATTTAATCCATCAACTTAAGCTTTTGAGTTCGTAGTGATTACATTACTTTGGTTAAGGTAATCTTTTGTGAAGCTTTTTTATTCTCTACATCAATTGGTGAAAAAACCCAAACAAACCAGGTTGGGCCATTTTTGGGAAATGATTAAGATTGTCAAAGAAAATTTGAAGGAAGAGAGAGTATATACTGAGGAAGTAGCATGCTTGTGTTGTAAATGATTTAAAATTACTTTTAAGGTTCTTTATTGTGATAAAAATCCATTTTTTTGAAACAAGTAACACTTCTCATTGATGTAATGAAATCAAATAAAAAATTTAGAGATACAAACTCCCAAAAGGAGAAAAAGAGAAGATGCAAACAACAAATACAACCCAATATAGATAAGTATTTTACGACTTCGACATACATGCCTCCTTCAAATCCGTCAAATCCTTCAAATTCTTCAAATCCTTCAAATAATAATTCATTTAATTGGTGGATCGGTCTTTTGTATATCTTTCACCAGTTTCTTAAAGAAAAGAAAAAGGTCCATTTAGTTGATGGTATTAAATTACAAAAGTGAGAGTGAAAACCCCATTCCAAGGGAGTTGCACAAAACTTTTCCAATTAGATAAAAGAGGAAAATGTATAGGAATTTTCTTTTAAATTGGAAACAGAAAATTTTCATTGAAAAAAATGAAAAGAGTTTAATGGTTGAAATTACGAATTAAGAATATGTATAGTAATGGGAAATATGGAGGGCTTTTATGTTAGGTAATAGAAAAAAAAGAAGAAAAAGAAAAGAAACATAAGTGTCCAAAAACTGATTAATGTCCAAAAACTGATTAAAGGGAAGCTCCATATCTTCAAAAGAATCTTTGATTCCTTTCCTTCGAAGTCGGCCTGTAAAATGTCCTAAAAATTGCATCCAGACTTTTTGTAAGGAGGACCTATGTCTGAGGAGGCACGTCGGGTCATTAGATTGCAAGAAGCATGTGTACAATCTGGATGAGTTCTCTTCTCTTCTTTTGTATATTTCTTGGGAAGTATTTGGCTGGTAAGTTTTCAAGCATGAATGAAAAATTCTCCTCATCAGTGGAGAGCAAAGATCTCTGGTAAAGATTTATCTGGGTTGGATGTTTTTGAAGCAGCCGATTGCCTCTTCTTGGTGATCCCTCCTTTCAAATTGTAGCATGTGGAATAAAAAATTTCAGTCTGAAATGGACGCTTTTCTACAAAGTGTTTGTATTTTCTTTGTATTAAGCTCTTAACTAATGTTTCAATTCTTTGTTGCTTTGGTTTTTTTTTGTTGTTTCTCCTTTGTGATCGTTTTCCCTTTTGTGCTGCCCGAGTTCATCTTTGTTTAGTTTGTTTGGATGATATTGTCTTTTGCTTATTTTGCTCTTAGTATAATACTCTTGTACTTTGAACCTTAGGCTTGTCTCTGTTTAAAAAAAAAAAGAAAATTTCTCCTTTCTTGTATAAAAAACAGACAAATATCAGTAAAGTTACAAGAATATATTTTTTTTCTTGATTCTATTTGTGATATTTATTTTAACTTGGAATCTAGGTAATGTTGTAGATAGTCATATGAAGGGTATAATTATGCTTCATCTCATGCAATACTTTTTCTTGTCTAGGAATCTTGGGATATGCTTCAGAGCGAAGTTGCGCAACTAATAAATAGTGATGTCCGTGGTATTCCATTTTCAATGCAAGTTAGCAAACCGTTGGCTGGTTTTGTTCAACGTCTCAAAGGAAAGCAGGGACGGTTTAGAGGAAATTTGTGTGGGAAGCGTGTTGAGTTTACGGGCAGAACTGTCATATCACCTGATCCTAATCTGAAAATTACAGAGGTCAATATTATATGCTCTTTTTTGACTACTTATGAAGGGTACCGTGTCAAAAGAAAACTAGGCATATTTCTGATTTTTATATGAAATACCTATGGTATTGTAAATATAGTTTCTTAATGATTGTTGGAAATTAGAAGGCTTTCCTTTGTTAGATTGGAGGGCAATCTCTACCTTTCGTCACTTGCATTGTTTCTTTTGGGTTCTTTTGTGTAATAATATGTATGTCTGTTTCTTATAAAAAATAGTTACATTTGGGTTATTTTGTAATATTGTTTCAACTTATTTTTGTTCACTTTGTCTAGGTTGCAGTCCCCATACACATGGCTCGAATTTTAACTTATCCGGAACGTGTTACCAGACACAATATAGAAAAGTTGAGACAATGTGTCAGCAATGGTCCTGATAAATATCCTGGTGCAAGGATGCTTAGGCATCTTGATGGTTCTATGAGGTTACAAATTTTACAATCATAATTTTATTATGTGCGAACTTTGCTATTAATTAGTTGTTTGGGATGTTCAATGTCTTCTTATTTGCATTGTAGGTCATTGATGATTTCAGGTAGAAAGCGGCTTGCTGATGAACTGAAATATGGTGAAATAGTTGAACGTCATCTTGAAGATGGAGACGTTGTTCTTTTTAACAGACAGCCTAGTCTACATCGAATGTCTATCATGTGTCATAGGGTGACTAAAATGTCTAAGAAGTTTTTGAATGATTACTCTTTGATATTTGCTTATTACGGTATCAGATTTTACCATGTCATGGTTGCAGGTAAGAGTTATGCCTTGGAGAACGTTGAGATTCAATGAATCTGTTTGCAATCCCTATAATGCTGATTTTGATGGTGATGAGATGAATATGCATGTTCCACAAACAGAGGAGGCTCGCACAGAGGCAATTCTGTTGATGGGGGTAAATTCAAATGGAAATCCCTATATTTTTTTATTTATTCACCTAGGGCGCATTGCTGGTTGATTTTAGGAGGATTAGAGGTGATTTGAAAGAATCTGAACCGTTTGGTAAAAGAAATTGTGATCAACAATTAAAATCACTTTTAAACCACACATTCAAGGTATGTTTGAAATACATTCTCAAGTGTTTAATTTAAAAAACAAGTCATTTTAAAAAAAATTGGAGTGTTAACCCCTCAAATAGCTATTGAAGTGTATTTTTAAACAATTTTTTATGAAAAAAATCTAAATAAAAGTGAGTTTTTCGAAAAACATTTTTTTCTTAAATTAATCCAAACGGGCCCTCAATTTTCATAATTTTTTAAAATTAGTTTCTATTAACTTTTAAATTCTTGTAAAATTATTTTTTAATAATTTTCAGTTTTATCTTTCAAAATTAAGATTTATTTTATTATATCTAGTATCTTTCTCTCATAAACAAAATTTATTCCCTCATCTTTCTTAGTTTATTTATTTATTTTTATTATTAAATAAATCACTTTCAACCATCTGGATTGATCTAGCGGTAAGAAAGGAGACATAGTCTCGTTAACTAACTAAGAGGTCATAAGTTTAATCCATGACAATCACTATTTTATGAATTAATTTTCTACGTGTTTTCTTGACATTCAAATGTTGTAGGGTCAGACAAGTTGTCCCATTCTTAAAGTGATTGATTCTAAAAGCACTTTTAAAACACAACCCTCATTTTTCATAGTTTGATGCAACCATTTTCTTAAAATCATACCATATTAAAAATTTATTTTTATTTTATTTTTTAAAATTAAAGCCTTTACCCTTAATACTTATCAACTAATCTAGAGAAAACAACATATAACATTTCAAACAATTATGTAAAAACTATGAATTATAAAGAAAATTGCAATTTTTTAAGAAGAAAAATGCAAATTTTAATAGTAGGTTTATTTTAGTTATTATAGCTAAAAACACAACAATATAGTATCTCTTTATCAAGTTTTTGATGAAATGTTTCTTACCTAAAAAAAATAGTATAGCATTTATTTACCAAACACTATACTATAACCAATGTGTTTAGATCAGAAGCTAGAGTTTTCCAAACGCTAATTTGTATCATATTAAAGCTCATTGTTCTAAACGTACAGTTGTTAATTTTAAAGTATATGTATAGAAGGCATTTCTCTCTAAGATCATGGACTCGTATCTCAAATAGACCAATGTAAATTTTAAATTAGGAAAACATTCTTTCCCAAATCTTTTCAGCTTGATGATATTAAGAGGAAATTTTAAGTTTAGTCATTTTAACCATCTATGACATGTCATGTTCTGCTTTCCAAGCCTTCATTACATCCTGTTGTTTGGATCTAGTTGGTTTGTTTTTATCCAGCGGTGTGATCATCGTTCAATATGCCGTGTATAAACATTGTTATCTACTTGGACCTTCTGAGATTATTTTTGCGGTTTTGTGATTGGTAATGGATAAGCTCAGTTCATCAGAGTGTTTCTGTTTATTATCGAATGGTTTCCTTTTGGTATTTGAGTTGATTATCCCTCTTTGGTTCAGTATTTCACTCACGACTGTTTTAAAAAAATGTAGTTTCTGTGTGCAGGTGCAAAATAATCTATGCACTCCCAAAAATGGGGAGATTTTAGTTGCTTCAACTCAGGATTTCTTGACATCATCTTTTCTCATAACAAGAAAAGACACATTTTATGACCGTGCAGCATTTTCTCTTATGTGTTCTTACATGGGAGATGGCATGGATTTGGTTGACTTGCCTACACCTGCTCTGGTCAAAGTGAGTTGATGATTTATTGTTTACCTTTGAATAGCAGTGACTAGATATGTTATTGTTCATGTTACAATATCCATTTACTATTTGTGTTCTCTGGCTTGTGTTATCAATCTTTATTGCAGATATTTCAATTAAAAGTCGTTATTCTTGGATTAGTGTTATGGTTAAGACATTTACTCGTACTCCTTGTGGGCTGGTGGTTCAAACCTTCACTCATTCATTTGTTGTTCAAATTCCTAAAATAAAAAAAGAAAAAGAGATCCTTGAAATGTATTCATCATTGTATCTATATGTATATGTATATCTTTTTTTTCCTTTTATATTCTTGAGTGTTCAAGCTAGCTTACCACTTCGGCTAATCTTACGGGACAACCCGCCTGATCTTACAACTTTTGGGTGTTATGGAAACAAATATTTAATAAAAAACAAATAAAATTTATGAAAATCTTCTTATATAGCTCAACAAGCTCAACTGATTAAGAGACAGTCTTGATCAAGCGATTAGAGTTTGAATCCCCCACCTCCATCTGTTGAGTTCTAAAAAGAGACACTTATGTCATATTCTTATCCTTAGGATAGACTCATTGTTATATTATTGTTTGTGCCTCACAAATGGTTCTCTATTTTTTTGTGCTATTGAATTTATCTGATTTTTTGCTCATTCAATCACAAAATATAAATGTGGTATCTTAGCAAGATTTTTCCACCTTAACATGGTGGTTTTTCTAAATTAGGTTGGTATCTGTTGCAGCCTATTGAGCTTTGGACTGGCAAACAATTATTTAGTGTCCTGGTTCGCCCACATGCAAGTATGAAGGTTTATCTGAATCTTACAGTTAAGGAGAAAAGCTACAGTAAAGTCAAAGGAAATGAAAAAGAACGAGAAACTATGTGTCCTAATGATGGATTTGTCTACTTCCGAAATAGTGAGCTAATATCAGGCCAAGTTGGGAAGGCTACTTTAGGTAATTTTATTATGTAATTCTTAGTTTCTATAGCGGCAACAAAAACAAATAACCCTTGATATTTCTGCTTATGCTGTATCTCATTATAACTTCACGTCGTCTGCGTCTTTATTGGATGAGTCGTCTGCGTCTTTGGTAGCGTTTGAGGAACAAGTTGATTTCAAAATTTTAGAACTTAAACTTCACTTGATGAAGTCATTTTTGATGTTTACAAATCTGTCTCTATTCCAGGCAATGGCAACAAGGATGGGCTTTATTCTGTTCTACTTAGAGATTACAAGGCACATGCTGCTGCTGTTTGCATGAATCGTCTTGCGAAGCTGAGGTAATTGGTAACCGTAACAGGTTCAGTGCTCTTGCACCCGATTTTGTAAATTGTTTGCATGTGTTGGATTTTTGTTTTTTCCCCTCAGTTTTTCTTATTACAGCGAGGATGAGAAAACCAAACCTTTGCTTGAAAGAATATAAGAGGCATTAAAAAAGAAAGCCCACACCAAAGAGAGCCAAACTATATCTATAACTTAGAGTAACTATTAAAGATCTCAAAGATTGAAATTAGAAAAAATCTCAAAGATTGATGAAGCCTGAAAGTAGACATGGAACGTGGAAAGGACCACACCTCTCCTCAGGTCTCTCAATCCCTTGAAAATTCTGTTGTTCCTTTTGTTAAAAGCGATAAAGTAAAGAGACACCAAGTTATGTGGAAACTCGAGTACTGGGAGAAAAAGCAAGATTTATTTTTCTTATTAATTTCTCATATTAACAAATGATACAAGAGAGGGGATAAATAGGTTACAACATATGATAAAAAAGAAAAGGACATTAGGGTAAATCCTTCCTTGGGCCAAGCCCACTAAATCTAACACCTTTCTTTCCGACTAAAAGACTTCACAAAATTGCATAAATCCAGCACGCCAGTAATCTCTTGGACTCATTAAAGAGTGGGTGAAGAAGGAGCTCTCCAGTCAGATAGCTACACCTAATAGTGGGTGAATAATGCCTTTATGTTGCTCTCGTAACTTAATTATGTCTTAATGTTCCTTTTATTTCTTGTTATTACTTTTTAATCTTACTTCTCCTATTGGGGATGCACTTTCCTGAAGAGGTGCTTTCATTTTGTCTATTCTTATTCTCACTGCTTTTTTCTTCTTTCTTTTGAACAATTGATAATTGATTATTGATAAATGAGAGAGAGAGAGAGAGAGTTCCTCTGCCGTCTCTCGTGTTGATGGCCTCCGATGCTGTTTCAAAAGTGTCTTTGTGTTATGTTTCTGATGTGGGATGCCAACATTTCATCCGATTATTTCTTCATCCCCGACTGACTTCATCCTCTCCGATCATCTCCTCACTGAACGACTATGGGGACATCAAAACGGATTTCGACAAAAATCGAGAATAAATCCTTCTCCTGCGAGTTTGATCCGAATAGTAGAGGAAGGGTGATACGCCTAACCGAAGCTCATTTGAATAAGTCTTATACCCTTTTTGGAGAAGAATCTCTTTTTGGCTGGCTGGCTGATTCCTTGGAGGAAGTTCTGCAGACCCCGAAGCCCCAAAAATTTTTCAGAAAGACACTGTAATGGCGGTTTCACCTGGATCCAAAAGACGACAAACAAGAGAGGAAGCTTCTTGGAGATTTCGAAGGTGCTGAAGTCGGGGAAAAAAGGAAACATAGTGGTCCCAGCTGGTGTTGATTTAAAAGGATGGGAAAGTTTTAGGCGCTTACTGATGGATTTCTTGAATGACGAAGTCGACTCTAGGGCCCTCCATTCTGAGAAGAAATTAATCCATAAGCCTTCCTTTGGAAGCAGACGGGAATCTCATTATCAGCTTCGTCATCCTTTGCGAATCTCTAATCATGGAGGGTATCCAAGGGTTTCTCATAGTCAAGAGGAAGGCAAAAGCAAAGGAGGAAGAAGAGTTAGAAAGATTTGCTGGGAGGACACTCTGGTGGTCACTAAGCGCGATTTCCATGACGATTGGCAGAGGATTTTGGATGTCCTTCAAAATCAAACACAAAAAACCTTTATTAGTAATCCGTTCCACGCGGACAAAGCTTTGCTCAAATGCCCTGATCAGACGATAGCTAGACTTCTCACGATGAACAAGGACTGGGCTACCTACGGCCCTTTAACCTTGAAAATGGAATGGTGGGATCCAAAAAAACATGGGAGAATGGCGCTTGTTCCATCTTATGGAGGATGGATCAAACTAAGAAACTTTCCCTTGCATCTGTGGAACGAGGAGGTGTTTAAGAGGGTTGGTGATAAGTTAGGAGGTTTCATTGAGTTTGCGGAAGAAAATTCGAGCCTCATTAACTGCTTGGAAGTGAATATCAAAGTTAAAGGGAGCTACTGCGGCTTCATTCCGGCGGACTTTGAGTTGATAGAAGGGCCCGATTCTTATCTTGTGCAGGTGGTTTCCTTCCAAGATCCCAATCTTCTTATTGATAAAGTCGCCGGAATCCATGGATCCTTCTCGCCGGAACAAGCTGAGAAGTTCTTCAAAGGTCCGGGTGGCCCGGATCCGAATCCGATTGATATTTGGTGTGTGGAGAAGTCGAACTCAAGGCTGGCAGTTTTCGCTTTTCAAAATTCTGAGATGGTGAGTACGGCGACAGAAGAAACTGAGAACTTTTGTCCTTCTCTGCAAACTGACGGAGAGACTAGAGGAACAAATTCAAATTTACCGCCCAAGCAAACCCTAATGGACTTGACTAATAGATTGGGCTTGACCAACGAAGTGGGCCTCATTACAGATGTGGGCCTCCCTGATTTAAAAGGGAAAGGCATAGCAGTGGATGAAGTTCTTATTCAACAGCCCACTAATATTAAAAAGCAAACTGGGCTTTTTATTAGAGAAAACTCCCAAGCTACCCGTGCCAACCAACCCCTTTTACTAAAATTACCCAAGCCCACCAAAATAGTTTGCAGAAAAGGAAGAAATTCGACCTTTCTTTAAAAACAGTGTGATCTGCTCCACAAACCAACCTGACGGCAAGGGAGGCCCTTCAACCTTAACGTTCACATCAGACTGCATTCATGGAGAAATAGATCCAACTTTGGACGTTCCTATTTCTAGCCCGGGCAGTCTCATATCTGAAAATAAATTTGCTCCTCTTGAAGCCGACGACGTGCAGCACATCTCGGATTTTTCTTCCGACGTCCAAAGACTCTTCCACGAGGTAAGCACCACGGAAAGAGGCAGGGAAGAGGGTCTCCAGATAATAGCTGTTGAGGATACCAATGAGCTTCCTTTTTCCAATGAAGAATTAATTGAAATTGAAAGCGTAACCGAGTTATATAACTATTTGTACTTTTGAGCATTAGACTCTTTTCATTTTGTCAATTAAAAGTGTCCGCTCTATCCTTTCAAAAAAGAGAAACAAAAACAAAACCACAGAGCTACACTTTCTAGGCTGGGTTAGATGAAAACTGAAATCCTGAAAGAAGCAGCTCCAAATAAAAAAAACAGTCTCAGCTCCCTTAAAAGGAACCCTAAATGGAAAATTGGTGTGAAGTAGTCTTTGTTCTATCCAAACACCATGCCAAACAAATCAGAGCTCCCATTTCTCACTTTGCAGAAGATATTTTTGTAAATATGGCCATTGGCTTAAGAATGAAGTTCTTGGACTTTTGGAGAAAACTTTTGATTTTGATTTGAGATCAAAATGGTTAATGTGGAATTTGGCTCCTATCACTTTTCTCCAAGGTATCGTTTTCTAATTGAAATTCCAAATTCATTTGGCTAAGAGCCCTGTTCTGAAGTTTCAAGTCAATGATGCCAATCCCACCTTGCAACAAAAAGCTTTTAACCTAATTGCATTTTAGGAGGTAATAGATCCCTTTACCCCTAATCTGATATGAAAACTGTGACTGGAGATTCTGTCATGTCCCTTTTTACTGAAAATCTTGGAGAGCCCTCATCTGCACCTTAGGTATGATGGAGTAGGCCAGCGTCTGGAATTTGCTCCAGGTGAGGCAGAGGAGCCAGGGTAGTCGATGTGAAAACTGTGAATGGAGATTCTATCATGTGCTTCATGTCTTCTTTTACTGGAAATCCTGGGGAGCTCTCATCTCTAACTTTTCTGGATTTGTTCTTCGGTCCTATTGCTAATTATTTTACCTCCATTTTCTTTTTGTTTCGTGACACTAAGTTTAGTCTAGTAATTGGTGTCTAATATGTCTGATTTATGTTATATATGGTCAGATATTGTTCATCTTTTGGCAGCTGCATCTACAAGATCAATAAATCCATGTTGTTTAACTAACTTAAAAGTTATTTATTAGTTATTCATAGGTATTGAGTGCAGTCTTTGTTTTAAGTTGGACTGGTTATGTTGGAAGTTTGACTGCTTAGAGGTAGCTGCCAATTGATTAGCAATTATACTTGTTTCATATAAAAAGTGAATGGTTAAAATGTAATAATTTTGTTGACATTAATGGTACTATTAAAAATATTTGCACAAATGCAGTGCTCGGTGGATAGGTAACCATGGTTTCTCTATTGGAATTGATGATGTCCAACCAGGAGATCAATTGGTTAAAAAAAAACAAACAACTATATTGGAGGGTTATCGTGACTGTGATAAGCAGATCAATTTATTCAACACAGGAAATTTACCTCCTGAAGCTGGTTGTGATGCTGCTCAAAGTTTGGAATCTAAGATAACTCAAATTTTAAATGGTATTCGGGAGGCCACTGCAAATGTAAGATTGATTAGTTTATTTCCGTTCTTATATTATAAGTCCTATCATCTGCCATCATCATGTCATGAATCTTACAGTGTATAAAGAATAACATATCATGCAGTGACTGATATATTGTTCCTGAGGGATAAATGTTGTCTTCATATAATTTGTTGACTGTTTTTTCTTTTTTGAAATGGAGACAAATATAGTTTGTTGACTGTTTATACCATTACTTGTACTTCTTTTGAAGTTTGTTTTAATTCTAAATCATGTTTGTTATGCAATCCACTATATCTGATCTTGGAATTTTCTTCTGCAGGTGTGCATGCAAAACCTACATTGGAGAAATAGTCCGTTGATTATGTCTCAATGTGGCTCCAAAGGATCTCCTATTAACATTAGCCAGATGGTTGCTTGTGTTGGTCAGCAGTCAGTTGGGGGCCGTCGTGCGCCAGATGGATTCATTGATCGGAGCCTTCCCCATTTTCGTAGGAAAGCAAAAACTCCGGCCGTGAGTTTTGCATCTTCTTCTCTGCCTTATTTGGAAAACTTAATTATTATTATTATCTTTCGTTTTTAACCGAAAACAAAATATTTGCCTTGAAGAAATGAAAAGAGACTAATCCTAAAAGATACAAACCTCCAAAAAGATTGAAAGAAAGAGATAAAACAAAATTATATGAGAAATATAGAAGCATTCAAACAAATATCTCGCAGCAGCTTGGTGCTCTCTAAACAAGGAGTTTGTGAATTACTCCACTCAAGATCTGTCTCAACTGGGCTGTTTTAGCTCTCAACCAGCCCTAGGCCCTGTTCTGTTTTGGTGACTTTGTTTTGCATTTGGCAGCTTCATTTTCCTCTCTGGCCAGCCTTCTTTGGAGCATAGTTTATGTTTTCTACACAGTAGCAGCTTTTGCTTTTGTTTATAGACTCAGCCTTTGTACTACTGTGTTTGCCCCATTCCCCTTCTTTATTACTTTGTCTCAACTTTCTGTATTAGGCTGGACTATGTCTCCCTTGTATTTTGGCATATTTTATTTCTAGCTTTAATTAGGACATGTTGTTTTGGTGCTATGAGGGTGTCAACCTAATTGAGATGTCCAAGTGCACCTTCTAATCCCCCTCAATTATTGCTCTCTAGGCTTCTGTATTACGCTCATTGTATAACTCTCTTGTACTTTGAGTTGTTATTGTTAATAAAGTAGTTTGTCCTCGTTTCAAAAAAAATATCTTGCAATGGAAAACCAATAAAATGATTAGAAAGCACCAATGAGAAGCCTTTAGCTTAGTTGATGGATCAAATCAATGCCATTGTTTATTTTAAAAAATCCATTTATTTTTCTTAATTGGATTCCAAGGCTATATGGAATCAATACCCCAAAAAGATTGAATTTTTTTGTGGGAACTTGCACAGAAGCTCGTCCTTACTCTTGACACAAGACAAGCTACAACATCGTCTACCATCTTTGGTGATCTCCCCTAGCTGGTGCAATCTTTGTAAGATGGATAATGAAACTCCAAGCCATCTCTCTTAATATTCTACCCTTTCTCAAAAATATTTTGGACCCACATTCTTTACGCTTTAGTTGGTCTCTGGTCTTTCCTTCGGATATCACTAGCTCCTATCTCTTGTGATCAAGGGACATCCATTCAAGGGTAGAAAGAGTCTCCTTTGGATGCACGGCTCTGTAGAGCCGTTTTGTGGTTGGTATGAATTGAACGTAATAGTTGTCTCTTCAATGACAAAGCACAACCCTACGACCTTTTTATTGAAGAATGTAAGAGACCTCTTCTAGAGGAAAGTTCACTGATATTAATTAAAGGCGGTGTTACAATATCACTCCACTGATCGAGAGAGTGGCAGGCTCCTTCAGTTGGCTAAACCAAAATAAACCGAAAAGCCTCCCTAAACATTACCCAAAATACATTTATATATTAACCCACTCACCTACTACCCTTCCTGACGTAAGTGGGTGGATTTCCCCTACTACCCCTCCTAACATAAGAGTGTATTACTGGAGGCCTAACAAACATGTTCTCCACCTTGCTATCTCTTGGCATGAATTGTCTAAATTATTCCATTCTTAGTGTTTTGATTCACTTTTAACCAAATGGAGATTGAGATGTCGTTTGTAACTCCTTTGGTTTGGGGCTTCTTTCCCCCTTCCCTTTTGTAAGTTCATAAATCAATGAAATTCTCTTACAAAAAAAAAAAAATCAATGTAATATTAATAGCCTTGGAAGAACTGATCGCAATTGTAGGGTGGAGTAATTCATTTTTGTTACCAAGATCATTTTGCTCTTTATTTGCCACTAATTTGGTATTGACATAACACCTGTAGTTGCAATAAGAAGAGAACAAATAGTATTTGACTAGAATTTTGAAGAGATAAGTTATTATATTTTCTTCTATTTTTTATATCAAATACAAATCCTTCTTATACGATAGAATGACCATCAACCAAAGATATATGACAAAGTTGTGCTTAATACTTAGCATATATTACAAAAGATTGTAAGCAAAATTCGACAAAGATAAATGATACAACTTTTGGTCCTCTCCCCATTAGATGCTCCAGAGCCCTTCAAAGTTTCCTTCGGCCAAGCTCCAATCAAATCTTCATCATTAGGCTCTTTCTCCTCATTTGCCCTCCACTCATCAACTAGATCGGGGTCAAGATTAAAACTGAAAAGAAAGAAATCTCCAAAGGTCTTCCTGCATTGTCTCAAGCCCTTTTTAAAGTATTTTATTTGAAGGAAAAAATCAAGGAAGATCTCATTTAAGCGTTCCTTAAGAGCCCATATTCTTGAAGACCAAGTTCCAGACTTGTTAGCGGCAAACTGTGTCTTTGCTCAGCCCTTAGTTCATTCCAGCAGTTTCAAGGAGTTTTTAGAGACTAATCCAGATTTATTGGAAGTTAGTTGTACCCAATTGCAAGCTCTAGCCCGTTCTGGAAAGTCAGATTCTAATTGCTCTCATCCCATGCAGTTTAAGTATGCTGAGTTTTCTTTACTCAACTCAAAGGTGAAATTCTTGAGGGGCTCCCCGAGCAAAACCCCTCTAGAAAAAGGGGAAGTCCTTTTCCGACTCTTCTTTCAGCATTAGTAGTGCAGGAATGGAGCAGTTCGATAAGACAACCGAGCATGAGGAGCAGAGACATTTTAAGCCTCTAGAAACTGATCTTAATGCCCTCTTCCAGTGCGAGGAAGACCCAATTTTTGAAAAACTGCCCCCTGCTACTCATCCTTTTCCCAGCTTTGCAAAATACTAGATAATCTAAAGTCAGTAGTTGCTGATTGCGCTCTTATTCTAGTATGATCCTTCAGACATCCATTGTAGTCCCTGTGGAACTATTGCCGCCTTTCCCAGATTTCTCTCGTGTGGGTGGATGAGTTTGAGTGTTGCTGTTTGTGGTAGTCTGCTTGATATTGTTTTTTTGTTTTGGCCTTCGTTTTTATGGCTTGCCAAAGGGGAATATTTTCTGGGGAGTTCTTTTGGAATCAGTTTCTGCTTGTTTTGGAAGATTTATTCGAGTTCCTCTGGGGTCTGGATGGCACTGACAAGGACAGTTTTGAGGTGGCTGACTGCTTCTCTTTCTCCTCTCATATTGAAGATTTGGGTGTGCAAATTATTATATTCGAATGGACTCCTTTTGAGAGAGTTGTCATGTTTTGCTTTGTATTTAGCTTTCTACGGATGTTTCGCCTTGGAAAGGCTTTATTCTGTTTGTTACTTTTCTGTTTTGACTTTGTTTGTTCAGGTTGCCTTCTTTTTGTTTCCTAAGTTTTCACTTTAGTTTGTATTCTCTCCTAGTTCTTTTGCTCTTAGTTCGACACTCTTGTACTTTGAGCTTAAGTCGTATATATTATCATTAATAATGAGGCTTGTCTCCGTTTCAAAAAATAAATCAGACAATAACATTTTTTCTTCAGCTTCTTTGTCAAATTGATAATTATGTAACATATAACTTAATGGTTAAGTTTTCTACATTAAGCAAGATATTTCATGTTGATTTTTTAAATCTATTTATTTTATTTATTATTTGACAAATAACTGTAATGCAGGCCAAAGGCTTTGTTGCAAATTCATTCTACAGTGGCTTGACAGCTACAGAGTTTTTCTTTCACACGATGGGAGGACGAGAAGGCCTTGTGGATACAGCGGTAGGGCTAACATCCATTAGCATTATAATTACTCAATGTTTATAGGAAATTTAGTTTTACATTTATTTTCCCATTGTTTCTAGGAAATAAGCATTTCAAACTGAGTGAATAAATTATTCAATGAGATGTTGACTCTGATTGTGTCACAAATAATAACAAATAAGGAAAAGTGCTACAATAATGCATAAAGGAATATGTGAGAATGATGCATCTCATTCTGTTGTATGAATAGCATACATGGAAGGGCTAAATTAGTAATAAATTTACGGTAATGGAACTTTTTTCTTTTTTTTACTAGAGACAAGAGACAAATCTTTTTATGAATATATGATAAAAAAGAGTGAAAATGTAAAAATAAAGAGACATACAAAGAAATAGGAAATCACAGACAATCTTAAAAAAACACAATTGGTAGTTTGCAGGTTTTATTGTTACTTTAAGATCTTAACAGAAAAATTCGTTAGTTTTAAGAAAATTATAAAGTTAGAATAGAGGACAAAAGATTGGTAAACTCCTATTTATAGATATACACTATAACTGAAAGAGTATCTTTTTTCCTTCAATATATTCCCAAAATTTTTAATTAATCAATACTTTTTATTATCAATTTAAACTTCCTGCTTGATCCTTTAAAGCTTATTGATTATAAACTTTTCTTATGAAGTTGGTCAGTGTGTTACTTGATTAATATTACTTGGTAGCTTATCTCAGCCTTGACTATTTTGCCAATCCTCAAGATTGAAGCTGAAGATTATTATTTATAAATACATCGCATAAGTAGGGATATAAACTATTAGATTAAATATTACTTACAAATAAGATTAAGCTTAAACACGAGTAAACTAAGAAGCACAAACGAAGACACAGACACTAGTTGATTCAAATGCACTGAGTCCATCAAAATGTGAGTATCTCGTATATACAGGTCATGATCACTTCGTCACATACCAAAACTGAAGTGTCTATGTGTCATAACTTGTCAACAAATTAAACTTTTAGAACCTGGTGAATCTTTGAAGACTAATCCGAGTGTTGGGCCCCTTTTAAAGTGCTAGAGCAGTAGAGTTGGTCATTTCTTGAGCAATATAAAATTCGGGTTAGTCTTGAATTCCTAATGGGACGAGTGTAAAAGTATAGTATTAGACTGAATTTTGAGATTCGTTATTGTTGAGATGAGTGAGTTGAAAGTTATTTTATTATTTATTTATTGTTTTTTCTTTGTGTAAAACTGAGTAAGTTGAAAGTTTTGAGGTACATGTTGAGAATAGTACTTTTCAGGTGAAAACAGCCGACACTGGTTACATGTCTCGTAGATTGATCAAAGCACTGGAGGACTTATCAATTCATTACGACAGCTCTGTACGAAATGCTGGTGGTTGTATAGTTCAATTCTGTTATGGAGATGATGGAATGGATCCAGCACAAATGGAAGGAAAAAGTGGAGCACCTTTGAATTTTGAGCGGCTGTTTTTGAAAGCCAAGGTTTGGAAGCATATCCCATATGCGACATTTTTTTCTTTTCTGCAAAGTTATTACATTTTGGATATCAGCATAGTAAAAGTATCATTTTCGTTTTTACAGGCTACCTGTCCAAGTGACGGAAACAAAATCCTGTCTCCATCAGAATTTTCTGAGACGGTAGAAGACAGGCTCTCAAAAGATGATGCTTCTCCTGAGTGTGGTTGCTCTCCAGCATTTGTTGGTTCTCTCAAGATTTTCCTCAACAAATACGTTGAAGCACAAAAAAAATCATGGGGCACATTGTTGGCAGATAATGAATCAGCTGTAGACAAAAGCATCATCAGTAGTTCTGATAATGACAATATTGTGATTCGGAATAAGGTAGTCCAGAATATAGCTGGTGTCACACACAGACAACTTCAGGTAGTTTTACAATGTTGCATAGTTTTCTTCATTATCTTCACTTCATGTTGTTCTACCAAGGAGTAGTTTATTAGATTGTACCTTGGGAAAGATTATAAAATAACAGTCTTCTCTTTTACTGGAAATTACTTTTGATTTGGGTTAATTATAGGTTTTTTTGGACACTTGCTTGTCCCGCTACCATACCAAAAAAATTGAAGCTGGAACTGCCATTGGTGCCATTGGAGCTCAAAGTATTGGCGAACCTGGGACACAAATGACATTAAAAACTTTTCATTTTGCAGGAGTTGCTAGCATGAGTATCCTTTGGTTATTTTTTCTTTTTCCATCTTGTTTTCAATGTTAACATATTCTATACAATATTTGGGCGGTCCTTGGATATTTTTGGTTTCTCTTGTGATCTTTTAACTTGTTGATAGACGTTACACTTGGTGTTCCACGTATCAAAGAAATTATAAATGGAGCCAAAAGGATCAGCACTCCAATTGTCACTGCAGCACTTACGCATGATGATAATGTCAATATTGCTCGGATGGTAAAAGCTCGAATTGAAAAAACAAATCTAGGACAGGTATATAATTAAACTTGGACTATACCATTTTACTTTGGTATTATGCTTTTTCCTTATTAGTTTTTCATTTTTTTTTAATGAAAAACATCACTTTCATTGAGAATAAATGAAAGAAACACACGGGCATTCAAAAAAGCAAGCCCACAAAAAGGGAGCTCTCTCTAAAAGAAAGGACTCCAACTAAACAAGATAGTATCAATCAAATAGTCACAGAAGGTTTACGAAATCGAAGCCCACAAAGAAATGTGATAATGAACTAAAGACCAAATATCCTCAAGATCCCTCTCCACACCACACCCCTAAACATCCTACTATTAGGCTCGCCCCATAACACCCAATTTGAACAGAGCCCTGCCATCCATAAAAAGCATCCTTCTCTGGAAAAGGCGAGAAGAGGAGGAACTCTTCAATCATAGCACTAGCTTCTTTGTAACGAACATGCTCAAGTTTAATCAACAAATCTTTTATTAAAAGAAATATTGTTTAAGTCTGAACTCAAACGTTGAATATGAGGATTGGGTTGAAATAAAATTCCATGTATGGGTTTGGAAGAGGGGGTTTGGGTGCTTGAAATAAGATTTCCTGGATTGGGTGCTTGGTGTTCTCTGTCCAAGCATTTTGCTGATTTTTCTGTTCAAGATTTATGTTTGAATTGGAGTGCTTTTATTTTCTCGGTTTAGTTTCTGGTTGTGTTCGGAATTTTTAGATTTTGTTTTGGTTGAGTTCATTTTTTTTTTACTCATTGTATTTTTAGCATTGAACTCTTTTCACTGTATCAATAAAAAGTTATGTTTCAACGAATTCATTGTTTTCTTTAATTTTTTTTTAATGAAAATTGTTAAATTTGATAGTATGGGGTTCCATCTCAAAACCAATTGGCACATAAAGGAGTAATCCATCTATGTTATATAGAATATGAGTCGCCTTGGTTTTCCATTGTGGGTCTCTCAACTTCTCCAATATGCCCCCTCAAGATGGTGCCTCGTTGGGTTCACCTATCTTGGATCGAATACTCGTTTAGGTTTAATGGGCTCTGATACCATATTAAATTTGATAGTATGAGATTCCATCTCAAAACCAATTGACAATGAAAAGAGTAACCCATCTATCTTATATAGAGTATGAGTCCCCTTGGTTTTTCAATGTGGGATTCTGAACTTCTCCAATAAAAATAAGACTTTTCAGTAATGAAATGAAAAGAGAGTCCAATGATCATAATACAAAGAAACATAATAAGCAAAGTAACGAAAATATAACCATAAGAAGCCCTTAACTAGACGAAGTCAGAAAAGTAAAGCAATTTTAATCAAGTAACCCACAACGGAACAAATTGGCTCTTGGAACAAAAACCATTGAACCATGTGTAGCAAAGACCATGCGTTGTGAAAAAAGACCAAAACTTCAAAAACAAATGTAGACCGAAACTTCAACACTGCTTCAGAAATCTTGAGACACAAGATCTTTCCAACTGAAAAATGTTCCATCAAAGGCAAAATTTGTAACTAAGGCTTTACAGCTTTGGCAACAAGGATGCTTGAAAATGAAGCCAAAGCAGACTGGACTGCTTGGATCTCAAACAAAGCACCAAGATGAAAGATTATCCAGCAGCAATTTTGCCCAAGAGAATAACTGCTTCAGGTATATGAACTCGAATGACCTCTTGAACATGAAAAGGATGCTTCACTAATCGCTTGGAAGGGGACCTCTCAACCCCTTGCCCTTAGGCCCGTGTTCCTCTTTTGACTTGTTTGAATATACTATTTTGTTTCCTGTTAGAAAAGAGAGAAAAAAAATACCTCCTTTATAGATGCAAACATAATTCAGCCCCTTTTCAACATAAGAGACGCTGTAGCTATCAATTTCATAAAATACAGCAGCAACTTGCAATTAAGCGATAGCCATCTCGCTCCACAAGACTAATGATTTTCATCTGAAGAAGACTTTAAACTTTATATTCTGAAATCATATCAAGTATTTTCTTCCTTTTTTTGTGGTGAGCCTAATGGTTTTCATCTGAACTGATCTTTTTTGGTATCTCTCTATTGATCAGATTGCCAAATGCATCCAAATTGTAATGAGTTCAAGATCAGCTTTAATAGAAATCAAACTTGACATGGAAAAAATCAGAGATGCAGAACTGTACGTAGATGCCAATGTTGTCAAACAGGCAATTCTCGTTACTCCAAAACTGAAACTGAAACATGAGGTGAGTTAAATTATTGATTCTTTCACTCATTTGACCAATTTCAATTCCATTTTATCTCAAATACCACATTGTTTTGCAGCATATCAATGTTTTGGATGATAGAAAGTTACGTGTTCTTCCTCAAGATGCAGATAGGAATAAACTTCATTTTAATCTACACTTTCTTAAAAACATGCTTCCTGGAGTTGTTGTAAAGGTGTGCATACTAATTTTTGTAATCAGTATAGACCAAAGTTTTGTTTTGTTTTGTTTTTTTTCTTTTAACGTTTAGGAGATACATAACAAACTTCTGTATGTCAGGGTATAAAAACTGTAGGGCGTGCTGTCATCAAAGAGGAAAAAGACAAAGCAAGAAACGCCAAAAAGTTCAGTTTGTTGGTTGAAGGGTTAGTTTATTTCTTTTTCCTGCTTCAAATTTGA

mRNA sequence

ATGAGTGCTGGTTCGGTGAAGAAGGCTGTATCTATGTTGGGCATACTGCATTATCGTGCGAGATCCAAGGATGCTGGGGTGGTATCAGAGGATCTTAGGGCACCCTATAATGTGTCTAACGATATTCTGAACCCTTTTAGAGTTCTTTGTCTTTTCCAAAGGATGTCAGATGAGGACTGTGAGTTGCTTTTTCTATCCAATAGACCTGAGAAACTCATAATTACTAATGTTCTAGTGCCTCCTATAGCCATTCGTCCTTCTGTAATCATGGATGGTTCTCAAAGCAATGAAAATGATATAACTGAGAGGTTAAAACGAATTATTCAACAGAATGCTAGTGTCAGCCAAGAGTTATCAACATCAAACTCACAAGCTAAATGCCTGGAATCTTGGGATATGCTTCAGAGCGAAGTTGCGCAACTAATAAATAGTGATGTCCGTGGTATTCCATTTTCAATGCAAGTTAGCAAACCGTTGGCTGGTTTTGTTCAACGTCTCAAAGGAAAGCAGGGACGGTTTAGAGGAAATTTGTGTGGGAAGCGTGTTGAGTTTACGGGCAGAACTGTCATATCACCTGATCCTAATCTGAAAATTACAGAGGTTGCAGTCCCCATACACATGGCTCGAATTTTAACTTATCCGGAACGTGTTACCAGACACAATATAGAAAAGTTGAGACAATGTGTCAGCAATGGTCCTGATAAATATCCTGGTGCAAGGATGCTTAGGCATCTTGATGGTTCTATGAGGTCATTGATGATTTCAGGTAGAAAGCGGCTTGCTGATGAACTGAAATATGGTGAAATAGTTGAACGTCATCTTGAAGATGGAGACGTTGTTCTTTTTAACAGACAGCCTAGTCTACATCGAATGTCTATCATGTGTCATAGGGTAAGAGTTATGCCTTGGAGAACGTTGAGATTCAATGAATCTGTTTGCAATCCCTATAATGCTGATTTTGATGGTGATGAGATGAATATGCATGTTCCACAAACAGAGGAGGCTCGCACAGAGGCAATTCTGTTGATGGGGGTGCAAAATAATCTATGCACTCCCAAAAATGGGGAGATTTTAGTTGCTTCAACTCAGGATTTCTTGACATCATCTTTTCTCATAACAAGAAAAGACACATTTTATGACCGTGCAGCATTTTCTCTTATGTGTTCTTACATGGGAGATGGCATGGATTTGGTTGACTTGCCTACACCTGCTCTGGTCAAACCTATTGAGCTTTGGACTGGCAAACAATTATTTAGTGTCCTGGTTCGCCCACATGCAAGTATGAAGGTTTATCTGAATCTTACAGTTAAGGAGAAAAGCTACAGTAAAGTCAAAGGAAATGAAAAAGAACGAGAAACTATGTGTCCTAATGATGGATTTGTCTACTTCCGAAATAGTGAGCTAATATCAGGCCAAGTTGGGAAGGCTACTTTAGGCAATGGCAACAAGGATGGGCTTTATTCTGTTCTACTTAGAGATTACAAGGCACATGCTGCTGCTGTTTGCATGAATCGTCTTGCGAAGCTGAGTGCTCGGTGGATAGGTAACCATGGTTTCTCTATTGGAATTGATGATGTCCAACCAGGAGATCAATTGGTTAAAAAAAAACAAACAACTATATTGGAGGGTTATCGTGACTGTGATAAGCAGATCAATTTATTCAACACAGGAAATTTACCTCCTGAAGCTGGTTGTGATGCTGCTCAAAGTTTGGAATCTAAGATAACTCAAATTTTAAATGGTATTCGGGAGGCCACTGCAAATGTGTGCATGCAAAACCTACATTGGAGAAATAGTCCGTTGATTATGTCTCAATGTGGCTCCAAAGGATCTCCTATTAACATTAGCCAGATGGTTGCTTGTGTTGGTCAGCAGTCAGTTGGGGGCCGTCGTGCGCCAGATGGATTCATTGATCGGAGCCTTCCCCATTTTCGTAGGAAAGCAAAAACTCCGGCCGCCAAAGGCTTTGTTGCAAATTCATTCTACAGTGGCTTGACAGCTACAGAGTTTTTCTTTCACACGATGGGAGGACGAGAAGGCCTTGTGGATACAGCGGTGAAAACAGCCGACACTGGTTACATGTCTCGTAGATTGATCAAAGCACTGGAGGACTTATCAATTCATTACGACAGCTCTGTACGAAATGCTGGTGGTTGTATAGTTCAATTCTGTTATGGAGATGATGGAATGGATCCAGCACAAATGGAAGGAAAAAGTGGAGCACCTTTGAATTTTGAGCGGCTGTTTTTGAAAGCCAAGGCTACCTGTCCAAGTGACGGAAACAAAATCCTGTCTCCATCAGAATTTTCTGAGACGGTAGAAGACAGGCTCTCAAAAGATGATGCTTCTCCTGAGTGTGGTTGCTCTCCAGCATTTGTTGGTTCTCTCAAGATTTTCCTCAACAAATACGTTGAAGCACAAAAAAAATCATGGGGCACATTGTTGGCAGATAATGAATCAGCTGTAGACAAAAGCATCATCAGTAGTTCTGATAATGACAATATTGTGATTCGGAATAAGGTAGTCCAGAATATAGCTGGTGTCACACACAGACAACTTCAGGTTTTTTTGGACACTTGCTTGTCCCGCTACCATACCAAAAAAATTGAAGCTGGAACTGCCATTGGTGCCATTGGAGCTCAAAGTATTGGCGAACCTGGGACACAAATGACATTAAAAACTTTTCATTTTGCAGGAGTTGCTAGCATGAACGTTACACTTGGTGTTCCACGTATCAAAGAAATTATAAATGGAGCCAAAAGGATCAGCACTCCAATTGTCACTGCAGCACTTACGCATGATGATAATGTCAATATTGCTCGGATGGTAAAAGCTCGAATTGAAAAAACAAATCTAGGACAGATTGCCAAATGCATCCAAATTGTAATGAGTTCAAGATCAGCTTTAATAGAAATCAAACTTGACATGGAAAAAATCAGAGATGCAGAACTGTACGTAGATGCCAATGTTGTCAAACAGGCAATTCTCGTTACTCCAAAACTGAAACTGAAACATGAGCATATCAATGTTTTGGATGATAGAAAGTTACGTGTTCTTCCTCAAGATGCAGATAGGAATAAACTTCATTTTAATCTACACTTTCTTAAAAACATGCTTCCTGGAGTTGTTGTAAAGGGTATAAAAACTGTAGGGCGTGCTGTCATCAAAGAGGAAAAAGACAAAGCAAGAAACGCCAAAAAGTTCAGTTTGTTGGTTGAAGGGTTAGTTTATTTCTTTTTCCTGCTTCAAATTTGA

Coding sequence (CDS)

ATGAGTGCTGGTTCGGTGAAGAAGGCTGTATCTATGTTGGGCATACTGCATTATCGTGCGAGATCCAAGGATGCTGGGGTGGTATCAGAGGATCTTAGGGCACCCTATAATGTGTCTAACGATATTCTGAACCCTTTTAGAGTTCTTTGTCTTTTCCAAAGGATGTCAGATGAGGACTGTGAGTTGCTTTTTCTATCCAATAGACCTGAGAAACTCATAATTACTAATGTTCTAGTGCCTCCTATAGCCATTCGTCCTTCTGTAATCATGGATGGTTCTCAAAGCAATGAAAATGATATAACTGAGAGGTTAAAACGAATTATTCAACAGAATGCTAGTGTCAGCCAAGAGTTATCAACATCAAACTCACAAGCTAAATGCCTGGAATCTTGGGATATGCTTCAGAGCGAAGTTGCGCAACTAATAAATAGTGATGTCCGTGGTATTCCATTTTCAATGCAAGTTAGCAAACCGTTGGCTGGTTTTGTTCAACGTCTCAAAGGAAAGCAGGGACGGTTTAGAGGAAATTTGTGTGGGAAGCGTGTTGAGTTTACGGGCAGAACTGTCATATCACCTGATCCTAATCTGAAAATTACAGAGGTTGCAGTCCCCATACACATGGCTCGAATTTTAACTTATCCGGAACGTGTTACCAGACACAATATAGAAAAGTTGAGACAATGTGTCAGCAATGGTCCTGATAAATATCCTGGTGCAAGGATGCTTAGGCATCTTGATGGTTCTATGAGGTCATTGATGATTTCAGGTAGAAAGCGGCTTGCTGATGAACTGAAATATGGTGAAATAGTTGAACGTCATCTTGAAGATGGAGACGTTGTTCTTTTTAACAGACAGCCTAGTCTACATCGAATGTCTATCATGTGTCATAGGGTAAGAGTTATGCCTTGGAGAACGTTGAGATTCAATGAATCTGTTTGCAATCCCTATAATGCTGATTTTGATGGTGATGAGATGAATATGCATGTTCCACAAACAGAGGAGGCTCGCACAGAGGCAATTCTGTTGATGGGGGTGCAAAATAATCTATGCACTCCCAAAAATGGGGAGATTTTAGTTGCTTCAACTCAGGATTTCTTGACATCATCTTTTCTCATAACAAGAAAAGACACATTTTATGACCGTGCAGCATTTTCTCTTATGTGTTCTTACATGGGAGATGGCATGGATTTGGTTGACTTGCCTACACCTGCTCTGGTCAAACCTATTGAGCTTTGGACTGGCAAACAATTATTTAGTGTCCTGGTTCGCCCACATGCAAGTATGAAGGTTTATCTGAATCTTACAGTTAAGGAGAAAAGCTACAGTAAAGTCAAAGGAAATGAAAAAGAACGAGAAACTATGTGTCCTAATGATGGATTTGTCTACTTCCGAAATAGTGAGCTAATATCAGGCCAAGTTGGGAAGGCTACTTTAGGCAATGGCAACAAGGATGGGCTTTATTCTGTTCTACTTAGAGATTACAAGGCACATGCTGCTGCTGTTTGCATGAATCGTCTTGCGAAGCTGAGTGCTCGGTGGATAGGTAACCATGGTTTCTCTATTGGAATTGATGATGTCCAACCAGGAGATCAATTGGTTAAAAAAAAACAAACAACTATATTGGAGGGTTATCGTGACTGTGATAAGCAGATCAATTTATTCAACACAGGAAATTTACCTCCTGAAGCTGGTTGTGATGCTGCTCAAAGTTTGGAATCTAAGATAACTCAAATTTTAAATGGTATTCGGGAGGCCACTGCAAATGTGTGCATGCAAAACCTACATTGGAGAAATAGTCCGTTGATTATGTCTCAATGTGGCTCCAAAGGATCTCCTATTAACATTAGCCAGATGGTTGCTTGTGTTGGTCAGCAGTCAGTTGGGGGCCGTCGTGCGCCAGATGGATTCATTGATCGGAGCCTTCCCCATTTTCGTAGGAAAGCAAAAACTCCGGCCGCCAAAGGCTTTGTTGCAAATTCATTCTACAGTGGCTTGACAGCTACAGAGTTTTTCTTTCACACGATGGGAGGACGAGAAGGCCTTGTGGATACAGCGGTGAAAACAGCCGACACTGGTTACATGTCTCGTAGATTGATCAAAGCACTGGAGGACTTATCAATTCATTACGACAGCTCTGTACGAAATGCTGGTGGTTGTATAGTTCAATTCTGTTATGGAGATGATGGAATGGATCCAGCACAAATGGAAGGAAAAAGTGGAGCACCTTTGAATTTTGAGCGGCTGTTTTTGAAAGCCAAGGCTACCTGTCCAAGTGACGGAAACAAAATCCTGTCTCCATCAGAATTTTCTGAGACGGTAGAAGACAGGCTCTCAAAAGATGATGCTTCTCCTGAGTGTGGTTGCTCTCCAGCATTTGTTGGTTCTCTCAAGATTTTCCTCAACAAATACGTTGAAGCACAAAAAAAATCATGGGGCACATTGTTGGCAGATAATGAATCAGCTGTAGACAAAAGCATCATCAGTAGTTCTGATAATGACAATATTGTGATTCGGAATAAGGTAGTCCAGAATATAGCTGGTGTCACACACAGACAACTTCAGGTTTTTTTGGACACTTGCTTGTCCCGCTACCATACCAAAAAAATTGAAGCTGGAACTGCCATTGGTGCCATTGGAGCTCAAAGTATTGGCGAACCTGGGACACAAATGACATTAAAAACTTTTCATTTTGCAGGAGTTGCTAGCATGAACGTTACACTTGGTGTTCCACGTATCAAAGAAATTATAAATGGAGCCAAAAGGATCAGCACTCCAATTGTCACTGCAGCACTTACGCATGATGATAATGTCAATATTGCTCGGATGGTAAAAGCTCGAATTGAAAAAACAAATCTAGGACAGATTGCCAAATGCATCCAAATTGTAATGAGTTCAAGATCAGCTTTAATAGAAATCAAACTTGACATGGAAAAAATCAGAGATGCAGAACTGTACGTAGATGCCAATGTTGTCAAACAGGCAATTCTCGTTACTCCAAAACTGAAACTGAAACATGAGCATATCAATGTTTTGGATGATAGAAAGTTACGTGTTCTTCCTCAAGATGCAGATAGGAATAAACTTCATTTTAATCTACACTTTCTTAAAAACATGCTTCCTGGAGTTGTTGTAAAGGGTATAAAAACTGTAGGGCGTGCTGTCATCAAAGAGGAAAAAGACAAAGCAAGAAACGCCAAAAAGTTCAGTTTGTTGGTTGAAGGGTTAGTTTATTTCTTTTTCCTGCTTCAAATTTGA

Protein sequence

MSAGSVKKAVSMLGILHYRARSKDAGVVSEDLRAPYNVSNDILNPFRVLCLFQRMSDEDCELLFLSNRPEKLIITNVLVPPIAIRPSVIMDGSQSNENDITERLKRIIQQNASVSQELSTSNSQAKCLESWDMLQSEVAQLINSDVRGIPFSMQVSKPLAGFVQRLKGKQGRFRGNLCGKRVEFTGRTVISPDPNLKITEVAVPIHMARILTYPERVTRHNIEKLRQCVSNGPDKYPGARMLRHLDGSMRSLMISGRKRLADELKYGEIVERHLEDGDVVLFNRQPSLHRMSIMCHRVRVMPWRTLRFNESVCNPYNADFDGDEMNMHVPQTEEARTEAILLMGVQNNLCTPKNGEILVASTQDFLTSSFLITRKDTFYDRAAFSLMCSYMGDGMDLVDLPTPALVKPIELWTGKQLFSVLVRPHASMKVYLNLTVKEKSYSKVKGNEKERETMCPNDGFVYFRNSELISGQVGKATLGNGNKDGLYSVLLRDYKAHAAAVCMNRLAKLSARWIGNHGFSIGIDDVQPGDQLVKKKQTTILEGYRDCDKQINLFNTGNLPPEAGCDAAQSLESKITQILNGIREATANVCMQNLHWRNSPLIMSQCGSKGSPINISQMVACVGQQSVGGRRAPDGFIDRSLPHFRRKAKTPAAKGFVANSFYSGLTATEFFFHTMGGREGLVDTAVKTADTGYMSRRLIKALEDLSIHYDSSVRNAGGCIVQFCYGDDGMDPAQMEGKSGAPLNFERLFLKAKATCPSDGNKILSPSEFSETVEDRLSKDDASPECGCSPAFVGSLKIFLNKYVEAQKKSWGTLLADNESAVDKSIISSSDNDNIVIRNKVVQNIAGVTHRQLQVFLDTCLSRYHTKKIEAGTAIGAIGAQSIGEPGTQMTLKTFHFAGVASMNVTLGVPRIKEIINGAKRISTPIVTAALTHDDNVNIARMVKARIEKTNLGQIAKCIQIVMSSRSALIEIKLDMEKIRDAELYVDANVVKQAILVTPKLKLKHEHINVLDDRKLRVLPQDADRNKLHFNLHFLKNMLPGVVVKGIKTVGRAVIKEEKDKARNAKKFSLLVEGLVYFFFLLQI*
BLAST of Csa1G084260 vs. Swiss-Prot
Match: NRPC1_ARATH (DNA-directed RNA polymerase III subunit 1 OS=Arabidopsis thaliana GN=NRPC1 PE=2 SV=1)

HSP 1 Score: 1332.8 bits (3448), Expect = 0.0e+00
Identity = 703/1085 (64.79%), Postives = 828/1085 (76.31%), Query Frame = 1

Query: 4    GSVKKAVSMLGILHYRARSKDAGVVSEDLR----------APYNVSNDILNPFRVLCLFQ 63
            G VKK  +  GI     RSK  G   ++ +          A  N    +L+P  VL LF+
Sbjct: 175  GMVKKIAAQFGIGISHDRSKIHGGEIDECKSAISHTKQSTAAINPLTYVLDPNLVLGLFK 234

Query: 64   RMSDEDCELLFLSNRPEKLIITNVLVPPIAIRPSVIMDGSQSNENDITERLKRIIQQNAS 123
            RMSD+DCELL+++ RPE LIIT +LVPP++IRPSV++ G QSNEND+T RLK+II  NAS
Sbjct: 235  RMSDKDCELLYIAYRPENLIITCMLVPPLSIRPSVMIGGIQSNENDLTARLKQIILGNAS 294

Query: 124  VSQELSTSNSQAKCLESWDMLQSEVAQLINSDVRGIPFSMQVSKPLAGFVQRLKGKQGRF 183
            + + LS   S  K ++ WD +Q EVA+ INS+VRG   +     PL+G +QRLKGK GRF
Sbjct: 295  LHKILSQPTSSPKNMQVWDTVQIEVARYINSEVRGCQ-NQPEEHPLSGILQRLKGKGGRF 354

Query: 184  RGNLCGKRVEFTGRTVISPDPNLKITEVAVPIHMARILTYPERVTRHNIEKLRQCVSNGP 243
            R NL GKRVEFTGRTVISPDPNLKITEV +PI MA+ILT+PE V+RHNIEKLRQCV NGP
Sbjct: 355  RANLSGKRVEFTGRTVISPDPNLKITEVGIPILMAQILTFPECVSRHNIEKLRQCVRNGP 414

Query: 244  DKYPGARMLRHLDGSMRSLMISGRKRLADELKYGEIVERHLEDGDVVLFNRQPSLHRMSI 303
            +KYPGAR +R+ DGS R+L+   RKR+ADEL  G IV+RHL++GDVVLFNRQPSLHRMSI
Sbjct: 415  NKYPGARNVRYPDGSSRTLVGDYRKRIADELAIGCIVDRHLQEGDVVLFNRQPSLHRMSI 474

Query: 304  MCHRVRVMPWRTLRFNESVCNPYNADFDGDEMNMHVPQTEEARTEAILLMGVQNNLCTPK 363
            MCHR R+MPWRTLRFNESVCNPYNADFDGDEMNMHVPQTEEARTEAI LMGVQNNLCTPK
Sbjct: 475  MCHRARIMPWRTLRFNESVCNPYNADFDGDEMNMHVPQTEEARTEAITLMGVQNNLCTPK 534

Query: 364  NGEILVASTQDFLTSSFLITRKDTFYDRAAFSLMCSYMGDGMDLVDLPTPALVKPIELWT 423
            NGEILVASTQDFLTSSFLITRKDTFYDRAAFSL+CSYMGDGMD +DLPTP ++KPIELWT
Sbjct: 535  NGEILVASTQDFLTSSFLITRKDTFYDRAAFSLICSYMGDGMDSIDLPTPTILKPIELWT 594

Query: 424  GKQLFSVLVRPHASMKVYLNLTVKEKSYSKVKGNEKERETMCPNDGFVYFRNSELISGQV 483
            GKQ+FSVL+RP+AS++VY+ L VKEK++ K  G     ETMC NDG+VYFRNSELISGQ+
Sbjct: 595  GKQIFSVLLRPNASIRVYVTLNVKEKNFKK--GEHGFDETMCINDGWVYFRNSELISGQL 654

Query: 484  GKATLGNGNKDGLYSVLLRDYKAHAAAVCMNRLAKLSARWIGNHGFSIGIDDVQPGDQLV 543
            GKATLGNGNKDGLYS+LLRDY +HAAAVCMNRLAKLSARWIG HGFSIGIDDVQPG++L 
Sbjct: 655  GKATLGNGNKDGLYSILLRDYNSHAAAVCMNRLAKLSARWIGIHGFSIGIDDVQPGEELS 714

Query: 544  KKKQTTILEGYRDCDKQINLFNTGNLPPEAGCDAAQSLESKITQILNGIREATANVCMQN 603
            K+++ +I  GY  C ++I  FN GNL  +AG D A+SLE++IT ILN IREAT   CM  
Sbjct: 715  KERKDSIQFGYDQCHRKIEEFNRGNLQLKAGLDGAKSLEAEITGILNTIREATGKACMSG 774

Query: 604  LHWRNSPLIMSQCGSKGSPINISQMVACVGQQSVGGRRAPDGFIDRSLPHFRRKAKTPAA 663
            LHWRNSPLIMSQCGSKGSPINISQMVACVGQQ+V G RAPDGFIDRSLPHF R +K+PAA
Sbjct: 775  LHWRNSPLIMSQCGSKGSPINISQMVACVGQQTVNGHRAPDGFIDRSLPHFPRMSKSPAA 834

Query: 664  KGFVANSFYSGLTATEFFFHTMGGREGLVDTAVKTADTGYMSRRLIKALEDLSIHYDSSV 723
            KGFVANSFYSGLTATEFFFHTMGGREGLVDTAVKTA TGYMSRRL+KALEDL +HYD++V
Sbjct: 835  KGFVANSFYSGLTATEFFFHTMGGREGLVDTAVKTASTGYMSRRLMKALEDLLVHYDNTV 894

Query: 724  RNAGGCIVQFCYGDDGMDPAQMEGKSGAPLNFERLFLKAKATC-PSDGNKILSPSEFSET 783
            RNA GCI+QF YGDDGMDPA MEGK GAPLNF RLFLK +ATC P   +  LS  E S+ 
Sbjct: 895  RNASGCILQFTYGDDGMDPALMEGKDGAPLNFNRLFLKVQATCPPRSHHTYLSSEELSQK 954

Query: 784  VEDRLSKDDASPECGCSPAFVGSLKIFLNKYVEAQKKSWGTLLADNESAVDKSIISSSDN 843
             E+ L + D S    C+ AFV SL+ F++             L   +SA           
Sbjct: 955  FEEELVRHDKSRV--CTDAFVKSLREFVS-------------LLGVKSASPP-------- 1014

Query: 844  DNIVIRNKVVQNIAGVTHRQLQVFLDTCLSRYHTKKIEAGTAIGAIGAQSIGEPGTQMTL 903
                   +V+   +GVT +QL+VF+  C+ RY  KKIEAGTAIG IGAQSIGEPGTQMTL
Sbjct: 1015 -------QVLYKASGVTDKQLEVFVKICVFRYREKKIEAGTAIGTIGAQSIGEPGTQMTL 1074

Query: 904  KTFHFAGVASMNVTLGVPRIKEIINGAKRISTPIVTAALTHDDNVNIARMVKARIEKTNL 963
            KTFHFAGVASMN+T GVPRI EIIN +K ISTP+++A L +   +  AR VK RIEKT L
Sbjct: 1075 KTFHFAGVASMNITQGVPRINEIINASKNISTPVISAELENPLELTSARWVKGRIEKTTL 1134

Query: 964  GQIAKCIQIVMSSRSALIEIKLDMEKIRDAELYVDANVVKQAILVTPKLKLKHEHINVLD 1023
            GQ+A+ I+++M+S SA + I LD + I +A L +    VK +IL TP++KL    I VL 
Sbjct: 1135 GQVAESIEVLMTSTSASVRIILDNKIIEEACLSITPWSVKNSILKTPRIKLNDNDIRVL- 1194

Query: 1024 DRKLRVLPQDADRNKLHFNLHFLKNMLPGVVVKGIKTVGRAVIKEEKDKAR---NAKKFS 1075
            D  L + P   D+++ HFNLH LKN+LP ++V GIKTV R V+ E+ DK++      K+ 
Sbjct: 1195 DTGLDITPV-VDKSRAHFNLHNLKNVLPNIIVNGIKTVERVVVAEDMDKSKQIDGKTKWK 1224

BLAST of Csa1G084260 vs. Swiss-Prot
Match: RPC1_HUMAN (DNA-directed RNA polymerase III subunit RPC1 OS=Homo sapiens GN=POLR3A PE=1 SV=2)

HSP 1 Score: 977.2 bits (2525), Expect = 1.4e-283
Identity = 535/1040 (51.44%), Postives = 697/1040 (67.02%), Query Frame = 1

Query: 43   LNPFRVLCLFQRMSDEDCELLFLS---NRPEKLIITNVLVPPIAIRPSVIMD-GSQSNEN 102
            LNP  VL LF+R+  ED  LL ++    +P  LI+T +LVPP+ IRPSV+ D  S +NE+
Sbjct: 218  LNPLVVLNLFKRIPAEDVPLLLMNPEAGKPSDLILTRLLVPPLCIRPSVVSDLKSGTNED 277

Query: 103  DITERLKRIIQQNASVSQELSTSNSQAKCLESWDMLQSEVAQLINSDVRGIPFSMQVSKP 162
            D+T +L  II  N  + +   +       +E WD LQ + A  INS++ GIP +M   K 
Sbjct: 278  DLTMKLTEIIFLNDVIKKHRISGAKTQMIMEDWDFLQLQCALYINSELSGIPLNMAPKKW 337

Query: 163  LAGFVQRLKGKQGRFRGNLCGKRVEFTGRTVISPDPNLKITEVAVPIHMARILTYPERVT 222
              GFVQRLKGKQGRFRGNL GKRV+F+GRTVISPDPNL+I EVAVP+H+A+ILT+PE+V 
Sbjct: 338  TRGFVQRLKGKQGRFRGNLSGKRVDFSGRTVISPDPNLRIDEVAVPVHVAKILTFPEKVN 397

Query: 223  RHNIEKLRQCVSNGPDKYPGARMLRHLDGSM-RSLMISGRKRLADELKYGEIVERHLEDG 282
            + NI  LR+ V NGP+ +PGA  ++     M R L    R+++A ELKYG+IVERHL DG
Sbjct: 398  KANINFLRKLVQNGPEVHPGANFIQQRHTQMKRFLKYGNREKMAQELKYGDIVERHLIDG 457

Query: 283  DVVLFNRQPSLHRMSIMCHRVRVMPWRTLRFNESVCNPYNADFDGDEMNMHVPQTEEART 342
            DVVLFNRQPSLH++SIM H  RV P RT RFNE VC PYNADFDGDEMN+H+PQTEEA+ 
Sbjct: 458  DVVLFNRQPSLHKLSIMAHLARVKPHRTFRFNECVCTPYNADFDGDEMNLHLPQTEEAKA 517

Query: 343  EAILLMGVQNNLCTPKNGEILVASTQDFLTSSFLITRKDTFYDRAAFSLMCSYMGDGMD- 402
            EA++LMG + NL TP+NGE L+A+ QDFLT ++L+T KDTF+DRA    + + +  G D 
Sbjct: 518  EALVLMGTKANLVTPRNGEPLIAAIQDFLTGAYLLTLKDTFFDRAKACQIIASILVGKDE 577

Query: 403  --LVDLPTPALVKPIELWTGKQLFSVLVRPHASMKVYLNLTVKEKSYSKVKGNEKERETM 462
               V LP P ++KP+ LWTGKQ+FSV++RP     V  NL  K K Y   KG     E +
Sbjct: 578  KIKVRLPPPTILKPVTLWTGKQIFSVILRPSDDNPVRANLRTKGKQYCG-KG-----EDL 637

Query: 463  CPNDGFVYFRNSELISGQVGKATLGNGNKDGLYSVLLRDYKAHAAAVCMNRLAKLSARWI 522
            C ND +V  +NSEL+SG + K TLG+G+K+ ++ +LLRD+  + AA  M+RLA+L+  ++
Sbjct: 638  CANDSYVTIQNSELMSGSMDKGTLGSGSKNNIFYILLRDWGQNEAADAMSRLARLAPVYL 697

Query: 523  GNHGFSIGIDDVQPGDQLVKKKQTTILEGYRDCDKQINLFNTGNLPPEAGCDAAQSLESK 582
             N GFSIGI DV PG  L+K K   +  GY+ CD+ I   NTG L  + GC A ++LE+ 
Sbjct: 698  SNRGFSIGIGDVTPGQGLLKAKYELLNAGYKKCDEYIEALNTGKLQQQPGCTAEETLEAL 757

Query: 583  ITQILNGIREATANVCMQNLHWRNSPLIMSQCGSKGSPINISQMVACVGQQSVGGRRAPD 642
            I + L+ IR+   + C++ L   NSPL M+ CGSKGS INISQM+ACVGQQ++ G R PD
Sbjct: 758  ILKELSVIRDHAGSACLRELDKSNSPLTMALCGSKGSFINISQMIACVGQQAISGSRVPD 817

Query: 643  GFIDRSLPHFRRKAKTPAAKGFVANSFYSGLTATEFFFHTMGGREGLVDTAVKTADTGYM 702
            GF +RSLPHF + +K PAAKGFVANSFYSGLT TEFFFHTM GREGLVDTAVKTA+TGYM
Sbjct: 818  GFENRSLPHFEKHSKLPAAKGFVANSFYSGLTPTEFFFHTMAGREGLVDTAVKTAETGYM 877

Query: 703  SRRLIKALEDLSIHYDSSVRNAGGCIVQFCYGDDGMDPAQMEGKSGAPLNFERLFLKAKA 762
             RRL+K+LEDL   YD +VR++ G I+QF YG DG+DPA MEGK   PL F+R+    KA
Sbjct: 878  QRRLVKSLEDLCSQYDLTVRSSTGDIIQFIYGGDGLDPAAMEGKD-EPLEFKRVLDNIKA 937

Query: 763  TCPSDGNKILSPSEFSETVEDRLSKDDASPECGCSPAFVGSLKIFLNKYVEAQKKSWGTL 822
              P      LS +E   T E  + K   S    C  +F+  +K F+    E  KK+    
Sbjct: 938  VFPCPSEPALSKNELILTTESIMKK---SEFLCCQDSFLQEIKKFIKGVSEKIKKT---- 997

Query: 823  LADNESAVDKSIISSSDNDNIVIRNKVVQNIAGVTHRQLQVFLDTCLSRYHTKKIEAGTA 882
                    DK  I    NDN     +V+  +  +T  Q++ FL+TC  +Y   ++E G+A
Sbjct: 998  -------RDKYGI----NDNGTTEPRVLYQLDRITPTQVEKFLETCRDKYMRAQMEPGSA 1057

Query: 883  IGAIGAQSIGEPGTQMTLKTFHFAGVASMNVTLGVPRIKEIINGAKRISTPIVTAALTHD 942
            +GA+ AQSIGEPGTQMTLKTFHFAGVASMN+TLGVPRIKEIIN +K ISTPI+TA L  D
Sbjct: 1058 VGALCAQSIGEPGTQMTLKTFHFAGVASMNITLGVPRIKEIINASKAISTPIITAQLDKD 1117

Query: 943  DNVNIARMVKARIEKTNLGQIAKCIQIVMSSRSALIEIKLDMEKIRDAELYVDANVVKQA 1002
            D+ + AR+VK RIEKT LG+I++ I+ V       I +KL +E+IR   L V+A  V+ +
Sbjct: 1118 DDADYARLVKGRIEKTLLGEISEYIEEVFLPDDCFILVKLSLERIRLLRLEVNAETVRYS 1177

Query: 1003 ILVTPKLKLKHEHINVLDDRKLRVLPQDADRNKLHFNLHFLKNMLPGVVVKGIKTVGRAV 1062
            I  T KL++K   + V  +  + V P++  ++ +++ L FLK  LP VVV+GI  V RAV
Sbjct: 1178 I-CTSKLRVKPGDVAVHGEAVVCVTPRENSKSSMYYVLQFLKEDLPKVVVQGIPEVSRAV 1229

Query: 1063 IKEEKDKARNAKKFSLLVEG 1075
            I    D+    +K+ LLVEG
Sbjct: 1238 I--HIDEQSGKEKYKLLVEG 1229

BLAST of Csa1G084260 vs. Swiss-Prot
Match: RPC1_BOVIN (DNA-directed RNA polymerase III subunit RPC1 OS=Bos taurus GN=POLR3A PE=2 SV=1)

HSP 1 Score: 973.4 bits (2515), Expect = 2.1e-282
Identity = 529/1040 (50.87%), Postives = 698/1040 (67.12%), Query Frame = 1

Query: 43   LNPFRVLCLFQRMSDEDCELLFLS---NRPEKLIITNVLVPPIAIRPSVIMD-GSQSNEN 102
            LNP  VL LF+R+  ED  LL ++    +P  LI+T +LVPP+ IRPSV+ D  S +NE+
Sbjct: 218  LNPLVVLNLFKRIPAEDIPLLLMNPEAGKPSDLILTRLLVPPLCIRPSVVSDLKSGTNED 277

Query: 103  DITERLKRIIQQNASVSQELSTSNSQAKCLESWDMLQSEVAQLINSDVRGIPFSMQVSKP 162
            D+T +L  II  N  + +   +       +E WD LQ + A  INS++ GIP +M   K 
Sbjct: 278  DLTMKLTEIIFLNDVIKKHRISGAKTQMIMEDWDFLQLQCALYINSELSGIPLNMAPKKW 337

Query: 163  LAGFVQRLKGKQGRFRGNLCGKRVEFTGRTVISPDPNLKITEVAVPIHMARILTYPERVT 222
              GFVQRLKGKQGRFRGNL GKRV+F+GRTVISPDPNL+I EVAVP+H+A+ILT+PE+V 
Sbjct: 338  TRGFVQRLKGKQGRFRGNLSGKRVDFSGRTVISPDPNLRIDEVAVPVHVAKILTFPEKVN 397

Query: 223  RHNIEKLRQCVSNGPDKYPGARMLRHLDGSM-RSLMISGRKRLADELKYGEIVERHLEDG 282
            + NI  LR+ V NGP+ +PGA  ++     M R L    R+++A ELK+G+IVERHL DG
Sbjct: 398  KANINFLRKLVRNGPEVHPGANFIQQRHTQMKRFLKYGNREKMAQELKFGDIVERHLIDG 457

Query: 283  DVVLFNRQPSLHRMSIMCHRVRVMPWRTLRFNESVCNPYNADFDGDEMNMHVPQTEEART 342
            DVVLFNRQPSLH++SIM H  RV P RT RFNE VC PYNADFDGDEMN+H+PQTEEA+ 
Sbjct: 458  DVVLFNRQPSLHKLSIMAHLARVKPHRTFRFNECVCTPYNADFDGDEMNLHLPQTEEAKA 517

Query: 343  EAILLMGVQNNLCTPKNGEILVASTQDFLTSSFLITRKDTFYDRAAFSLMCSYMGDGMD- 402
            EA++LMG + NL TP+NGE L+A+ QDFLT ++L+T KDTF+DRA    + + +  G D 
Sbjct: 518  EALVLMGTKANLVTPRNGEPLIAAIQDFLTGAYLLTLKDTFFDRAKACQIIASILVGKDE 577

Query: 403  --LVDLPTPALVKPIELWTGKQLFSVLVRPHASMKVYLNLTVKEKSYSKVKGNEKERETM 462
               V LP P ++KP+ LWTGKQ+FSV++RP     V  NL  K K Y          E +
Sbjct: 578  KIKVRLPPPTILKPVTLWTGKQIFSVILRPSDDNPVRANLRTKGKQYCG------RGEDL 637

Query: 463  CPNDGFVYFRNSELISGQVGKATLGNGNKDGLYSVLLRDYKAHAAAVCMNRLAKLSARWI 522
            C ND +V  +NSEL+SG + K TLG+G+K+ ++ +LLRD+  + AA  M+RLA+L+  ++
Sbjct: 638  CVNDSYVTIQNSELMSGSMDKGTLGSGSKNNIFYILLRDWGQNEAADAMSRLARLAPVYL 697

Query: 523  GNHGFSIGIDDVQPGDQLVKKKQTTILEGYRDCDKQINLFNTGNLPPEAGCDAAQSLESK 582
             N GFSIGI DV PG  L+K K   +  GY+ CD+ I   NTG L  + GC A ++LE+ 
Sbjct: 698  SNRGFSIGIGDVTPGQGLLKAKYELLNAGYKKCDEYIEALNTGKLQQQPGCTAEETLEAL 757

Query: 583  ITQILNGIREATANVCMQNLHWRNSPLIMSQCGSKGSPINISQMVACVGQQSVGGRRAPD 642
            I + L+ IR+   + C++ L   NSPL M+ CGSKGS INISQM+ACVGQQ++ G R PD
Sbjct: 758  ILKELSVIRDHAGSACLRELDKSNSPLTMALCGSKGSFINISQMIACVGQQAISGSRVPD 817

Query: 643  GFIDRSLPHFRRKAKTPAAKGFVANSFYSGLTATEFFFHTMGGREGLVDTAVKTADTGYM 702
            GF +RSLPHF + +K PAAKGFVANSFYSGLT TEFFFHTM GREGLVDTAVKTA+TGYM
Sbjct: 818  GFENRSLPHFEKHSKLPAAKGFVANSFYSGLTPTEFFFHTMAGREGLVDTAVKTAETGYM 877

Query: 703  SRRLIKALEDLSIHYDSSVRNAGGCIVQFCYGDDGMDPAQMEGKSGAPLNFERLFLKAKA 762
             RRL+K+LEDL   YD +VR++ G I+QF YG DG+DPA MEGK   PL F+R+    KA
Sbjct: 878  QRRLVKSLEDLCSQYDLTVRSSTGDIIQFIYGGDGLDPAAMEGKD-EPLEFKRVLDNIKA 937

Query: 763  TCPSDGNKILSPSEFSETVEDRLSKDDASPECGCSPAFVGSLKIFLNKYVEAQKKSWGTL 822
              P      LS +E   + E  + K++      C  +F+  +K F+ +  E  KK+    
Sbjct: 938  VFPCRSEPALSKNELLLSAESIMKKNEF---LCCQDSFLQEIKKFIKEVSEKIKKT---- 997

Query: 823  LADNESAVDKSIISSSDNDNIVIRNKVVQNIAGVTHRQLQVFLDTCLSRYHTKKIEAGTA 882
                    DK  I    NDN     +V+  +  +T  Q++ FL+TC  +Y   ++E G+A
Sbjct: 998  -------RDKYGI----NDNGTTEPRVLYQLDRITPTQIEKFLETCRDKYMRAQMEPGSA 1057

Query: 883  IGAIGAQSIGEPGTQMTLKTFHFAGVASMNVTLGVPRIKEIINGAKRISTPIVTAALTHD 942
            +GA+ AQSIGEPGTQMTLKTFHFAGVASMN+TLGVPRIKEIIN +K ISTPI+TA L  D
Sbjct: 1058 VGALCAQSIGEPGTQMTLKTFHFAGVASMNITLGVPRIKEIINASKAISTPIITAQLDKD 1117

Query: 943  DNVNIARMVKARIEKTNLGQIAKCIQIVMSSRSALIEIKLDMEKIRDAELYVDANVVKQA 1002
            D+ + AR+VK RIEKT LG+I++ I+ V       I +KL +E+IR   L V+A  V+ +
Sbjct: 1118 DDADYARLVKGRIEKTLLGEISEYIEEVFLPDDCFILVKLSLERIRLLRLEVNAETVRYS 1177

Query: 1003 ILVTPKLKLKHEHINVLDDRKLRVLPQDADRNKLHFNLHFLKNMLPGVVVKGIKTVGRAV 1062
            I ++ KL++K   + V  +  + V P++  ++ +++ L FLK  LP VVV+GI  V RAV
Sbjct: 1178 ICMS-KLRVKPGDVAVHGEAVVCVTPRENSKSSMYYVLQFLKEDLPKVVVQGIPEVSRAV 1229

Query: 1063 IKEEKDKARNAKKFSLLVEG 1075
            I    D+    +K+ LLVEG
Sbjct: 1238 I--HIDEQSGKEKYKLLVEG 1229

BLAST of Csa1G084260 vs. Swiss-Prot
Match: RPC1_CHICK (DNA-directed RNA polymerase III subunit RPC1 OS=Gallus gallus GN=POLR3A PE=2 SV=1)

HSP 1 Score: 969.1 bits (2504), Expect = 3.9e-281
Identity = 531/1040 (51.06%), Postives = 698/1040 (67.12%), Query Frame = 1

Query: 43   LNPFRVLCLFQRMSDEDCELLFLS---NRPEKLIITNVLVPPIAIRPSVIMD-GSQSNEN 102
            LNP  VL LF+R+  ED  LL ++    +P  LI+T +LVPP+ IRPSV+ D  S +NE+
Sbjct: 218  LNPLVVLNLFKRIPAEDIPLLLMNPEAGKPSDLILTRLLVPPLCIRPSVVSDLKSGTNED 277

Query: 103  DITERLKRIIQQNASVSQELSTSNSQAKCLESWDMLQSEVAQLINSDVRGIPFSMQVSKP 162
            D+T +L  II  N  + +   +       +E WD LQ + A  INS++ GIP +M   K 
Sbjct: 278  DLTMKLTEIIFLNDVIKKHRISGAKTQMIMEDWDFLQLQCALYINSELSGIPLNMAPKKW 337

Query: 163  LAGFVQRLKGKQGRFRGNLCGKRVEFTGRTVISPDPNLKITEVAVPIHMARILTYPERVT 222
              GFVQRLKGKQGRFRGNL GKRV+F+GRTVISPDPNL+I EVAVPIH+A+ILT+PE+V 
Sbjct: 338  TRGFVQRLKGKQGRFRGNLSGKRVDFSGRTVISPDPNLRIDEVAVPIHVAKILTFPEKVN 397

Query: 223  RHNIEKLRQCVSNGPDKYPGARMLRHLDGSM-RSLMISGRKRLADELKYGEIVERHLEDG 282
            + NI  +R+ V NGPD +PGA  ++     M R L    R+++A ELK+G+IVERHL DG
Sbjct: 398  KANINFMRKLVRNGPDVHPGANFIQQRHTQMKRFLKYGNREKMAQELKFGDIVERHLIDG 457

Query: 283  DVVLFNRQPSLHRMSIMCHRVRVMPWRTLRFNESVCNPYNADFDGDEMNMHVPQTEEART 342
            D+VLFNRQPSLH++SIM H  RV P RT RFNE VC PYNADFDGDEMN+H+PQTEEA+ 
Sbjct: 458  DIVLFNRQPSLHKLSIMAHIARVKPHRTFRFNECVCTPYNADFDGDEMNLHLPQTEEAKA 517

Query: 343  EAILLMGVQNNLCTPKNGEILVASTQDFLTSSFLITRKDTFYDRAAFSLMCSYMGDGMD- 402
            EA++LMG + NL TP+NGE L+A+ QDFLT ++L+T KDTF+DRA    + + +  G D 
Sbjct: 518  EALVLMGTKANLVTPRNGEPLIAAIQDFLTGAYLLTLKDTFFDRAKACQIIASILVGKDE 577

Query: 403  --LVDLPTPALVKPIELWTGKQLFSVLVRPHASMKVYLNLTVKEKSYSKVKGNEKERETM 462
               V LP PA++KP+ LWTGKQ+FS +++P     V  NL  K K Y   KG     E +
Sbjct: 578  KIKVRLPPPAILKPVTLWTGKQVFSSILKPSDDCPVKANLRTKGKQYCG-KG-----EDL 637

Query: 463  CPNDGFVYFRNSELISGQVGKATLGNGNKDGLYSVLLRDYKAHAAAVCMNRLAKLSARWI 522
            C ND +V  +NSEL+SG + K TLG+G+K+ ++ +LLRD+    AA  M+RLA+L+  ++
Sbjct: 638  CYNDSYVTIQNSELMSGSMDKGTLGSGSKNNIFYILLRDWGQVEAADAMSRLARLAPVYL 697

Query: 523  GNHGFSIGIDDVQPGDQLVKKKQTTILEGYRDCDKQINLFNTGNLPPEAGCDAAQSLESK 582
             N GFSIGI DV PG  L+K K   +  GY+ CD+ I   NTG L  + GC A ++LE+ 
Sbjct: 698  SNRGFSIGIGDVTPGQGLLKAKYELLHAGYKKCDEYIEALNTGKLQQQPGCTAEETLEAL 757

Query: 583  ITQILNGIREATANVCMQNLHWRNSPLIMSQCGSKGSPINISQMVACVGQQSVGGRRAPD 642
            I + L+ IR+   + C++ L   NSPLIM+ CGSKGS INISQM+ACVGQQ++ G R PD
Sbjct: 758  ILKELSVIRDHAGSACLRELDKSNSPLIMALCGSKGSFINISQMIACVGQQAISGSRVPD 817

Query: 643  GFIDRSLPHFRRKAKTPAAKGFVANSFYSGLTATEFFFHTMGGREGLVDTAVKTADTGYM 702
            GF +RSLPHF + +K PAAKGFVANSFYSGLT TEFFFHTM GREGLVDTAVKTA+TGYM
Sbjct: 818  GFENRSLPHFEKHSKLPAAKGFVANSFYSGLTPTEFFFHTMAGREGLVDTAVKTAETGYM 877

Query: 703  SRRLIKALEDLSIHYDSSVRNAGGCIVQFCYGDDGMDPAQMEGKSGAPLNFERLFLKAKA 762
             RRL+K+LEDL   YD +VR++ G I+QF YG DG+DPA MEGK   PL F+R+    +A
Sbjct: 878  QRRLVKSLEDLCSQYDLTVRSSTGDIIQFIYGGDGLDPAAMEGKD-EPLEFKRVLDNIRA 937

Query: 763  TCPSDGNKILSPSEFSETVEDRLSKDDASPECGCSPAFVGSLKIFLNKYVEAQKKSWGTL 822
              P      LS +E   T E  + K++      C  +F+  +K F+    E  KK+    
Sbjct: 938  VYPCRSEPALSKNELVLTSESIMKKNEF---LCCRDSFLQEIKKFIKGVSEKIKKT---- 997

Query: 823  LADNESAVDKSIISSSDNDNIVIRNKVVQNIAGVTHRQLQVFLDTCLSRYHTKKIEAGTA 882
                    DK  I    NDN     +V+  +  +T  QL+ FL+TC  +Y   ++E G+A
Sbjct: 998  -------RDKYGI----NDNGTTEPRVLYQLDRITPTQLEKFLETCRDKYMRAQMEPGSA 1057

Query: 883  IGAIGAQSIGEPGTQMTLKTFHFAGVASMNVTLGVPRIKEIINGAKRISTPIVTAALTHD 942
            +GA+ AQSIGEPGTQMTLKTFHFAGVASMN+TLGVPRIKEIIN +K ISTPI+TA L  D
Sbjct: 1058 VGALCAQSIGEPGTQMTLKTFHFAGVASMNITLGVPRIKEIINASKAISTPIITAQLDKD 1117

Query: 943  DNVNIARMVKARIEKTNLGQIAKCIQIVMSSRSALIEIKLDMEKIRDAELYVDANVVKQA 1002
            D+ + AR+VK RIEKT LG+I++ I+ V       I +KL +E+IR   L V+A  V+ +
Sbjct: 1118 DDPDFARLVKGRIEKTLLGEISEYIEEVFLPDDCFILVKLSLERIRLLRLEVNAETVRYS 1177

Query: 1003 ILVTPKLKLKHEHINVLDDRKLRVLPQDADRNKLHFNLHFLKNMLPGVVVKGIKTVGRAV 1062
            I ++ KL++K   + V  +  + V P++  ++ +++ L  LK  LP VVV+GI  V RAV
Sbjct: 1178 ICIS-KLRVKPGDVAVHGEAVVCVTPRENSKSSMYYVLQSLKEELPKVVVQGIPEVSRAV 1229

Query: 1063 IKEEKDKARNAKKFSLLVEG 1075
            I    D+    +K+ LLVEG
Sbjct: 1238 I--HIDEQSGKEKYKLLVEG 1229

BLAST of Csa1G084260 vs. Swiss-Prot
Match: RPC1_SCHPO (DNA-directed RNA polymerase III subunit rpc1 OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=rpc1 PE=2 SV=1)

HSP 1 Score: 961.8 bits (2485), Expect = 6.2e-279
Identity = 510/1043 (48.90%), Postives = 692/1043 (66.35%), Query Frame = 1

Query: 31   DLRAPYNVSNDILNPFRVLCLFQRMSDEDCELLFLS---NRPEKLIITNVLVPPIAIRPS 90
            +L+   + ++D LNP +VL LF++++  DCELL +     RPE L+   V  PP+ IRPS
Sbjct: 202  ELKMHLSKAHDDLNPLKVLNLFKQITPVDCELLGMDPEHGRPENLLWRYVPAPPVCIRPS 261

Query: 91   VIMDGSQSNENDITERLKRIIQQNASVSQELSTSNSQAKCLESWDMLQSEVAQLINSDVR 150
            V  +G+ + E+D+T ++  II  ++ +   LS     +  +E W+ +Q  +A  INS++ 
Sbjct: 262  VAQEGA-TTEDDLTVKITEIIWTSSLIRAALSKGTPISNLMEQWEFMQLSIAMYINSEMP 321

Query: 151  GIPFSMQVSKPLAGFVQRLKGKQGRFRGNLCGKRVEFTGRTVISPDPNLKITEVAVPIHM 210
            G+  S   SKP+ GF QRLKGKQGRFRGNL GKRV+F+GRTVISPDPNL+I +VAVP  +
Sbjct: 322  GLRPSDMPSKPIRGFCQRLKGKQGRFRGNLSGKRVDFSGRTVISPDPNLRIDQVAVPYRI 381

Query: 211  ARILTYPERVTRHNIEKLRQCVSNGPDKYPGAR-MLRHLDGSMRSLMISGRKRLADELKY 270
            A+ILT+PERVT  N + L+ C+ NGPD +PGA  ++    G  R L    R R+AD+LK 
Sbjct: 382  AKILTFPERVTTQNKKHLQDCIRNGPDVHPGANYVIDRESGFKRFLRFGNRNRIADDLKI 441

Query: 271  GEIVERHLEDGDVVLFNRQPSLHRMSIMCHRVRVMPWRTLRFNESVCNPYNADFDGDEMN 330
            G+IVERHL D DVVLFNRQPSLH++SIM H V+V PWRTLRFNE VC PYNADFDGDEMN
Sbjct: 442  GDIVERHLHDNDVVLFNRQPSLHKLSIMAHLVKVRPWRTLRFNECVCGPYNADFDGDEMN 501

Query: 331  MHVPQTEEARTEAILLMGVQNNLCTPKNGEILVASTQDFLTSSFLITRKDTFYDRAAFSL 390
            +HVPQTEEA+TEA+ LMG++NNL +P+NGE ++A+TQDF+T+++L++ KDTF DR + S 
Sbjct: 502  LHVPQTEEAKTEALELMGIKNNLVSPRNGEPIIAATQDFITAAYLLSLKDTFLDRKSISN 561

Query: 391  MCSYMGDGMDLVDLPTPALVKPIELWTGKQLFSVLVRPHASMKVYLNLTVKEKSYSKVKG 450
            +C YM D    +DLP PA++KP  LWTGKQ+F+VL++P+   KV +NL  K +S+S++K 
Sbjct: 562  ICCYMMDASTHIDLPPPAIIKPRCLWTGKQVFTVLMKPNRFSKVLVNLDAKTRSFSRIKS 621

Query: 451  NEKERETMCPNDGFVYFRNSELISGQVGKATLGNGNKDGLYSVLLRDYKAHAAAVCMNRL 510
               E   MCP DG++  RNSE+I+G V K+ +G+G KD L+ V+LRDY A  AA  + RL
Sbjct: 622  KTPE---MCPKDGYLMIRNSEIIAGVVDKSVVGDGKKDSLFYVILRDYGALEAAEAITRL 681

Query: 511  AKLSARWIGNHGFSIGIDDVQPGDQLVKKKQTTILEGYRDCDKQINLFNTGNLPPEAGCD 570
            +K+ AR++GN GFSIGI+DVQPG  L  +K+  + + Y   D  I  +  G L  + G D
Sbjct: 682  SKMCARFLGNRGFSIGIEDVQPGKSLSSQKEILVNKAYATSDDFIMQYAKGILECQPGMD 741

Query: 571  AAQSLESKITQILNGIREATANVCMQNLHWRNSPLIMSQCGSKGSPINISQMVACVGQQS 630
               +LE+KI+  L+ +R+    +CM  L   NSPLIM+ CGSKGS IN+SQMVACVGQQ 
Sbjct: 742  QEATLEAKISSTLSKVRDDVGEICMDELGPANSPLIMATCGSKGSKINVSQMVACVGQQI 801

Query: 631  VGGRRAPDGFIDRSLPHFRRKAKTPAAKGFVANSFYSGLTATEFFFHTMGGREGLVDTAV 690
            + G+R PDGF DRSLPHF + +K P AKGFV+NSFYSGLT TEF FH + GREGLVDTAV
Sbjct: 802  ISGKRVPDGFQDRSLPHFHKNSKHPLAKGFVSNSFYSGLTPTEFLFHAISGREGLVDTAV 861

Query: 691  KTADTGYMSRRLIKALEDLSIHYDSSVRNAGGCIVQFCYGDDGMDPAQMEGKSGAPLNFE 750
            KTA+TGYMSRRL+K+LEDLS  YD +VR++   +VQF YGDDG+DP  MEG  G  + F+
Sbjct: 862  KTAETGYMSRRLMKSLEDLSSAYDGTVRSSNSDVVQFVYGDDGLDPTYMEG-DGQAVEFK 921

Query: 751  RLFLKAKATCPSDGNKILSPSEFSETVEDRLSKDDASPECGCSPAFVGSLKIF----LNK 810
            R ++ +        +  + P E  + V   L  DD      C+  F+ +++ F    + K
Sbjct: 922  RTWIHSVNLNYDRHDSAMLPYEIIDYVNRAL--DDPKFLTNCNRDFIETIRTFVIENIAK 981

Query: 811  YVEAQKKSWGTLLADNESAVDKSIISSSDNDNIVIRNKVVQNIAGVTHRQLQVFLDTCLS 870
            Y+ + ++         E  +D       D    V + K V+NI  VT +QL+ F+D C  
Sbjct: 982  YLASVRERRDLAPMLEEPDMDDLDDMEGDEFAPVAKRKSVENIIRVTEKQLRSFVDRCWE 1041

Query: 871  RYHTKKIEAGTAIGAIGAQSIGEPGTQMTLKTFHFAGVASMNVTLGVPRIKEIINGAKRI 930
            +Y   K+E GTA+GAIGAQSIGEPGTQMTLKTFHFAGVA+   TLGVPRIKEIIN AK I
Sbjct: 1042 KYMRAKVEPGTAVGAIGAQSIGEPGTQMTLKTFHFAGVAA-QTTLGVPRIKEIINAAKTI 1101

Query: 931  STPIVTAALTHDDNVNIARMVKARIEKTNLGQIAKCIQIVMSSRSALIEIKLDMEKIRDA 990
            STPI+T  L +D +   AR+VK RIEKT L  +   I+ V    +  + I+++ + I   
Sbjct: 1102 STPIITGQLINDRDERSARVVKGRIEKTYLKDVTSYIEEVYGPVTTYLSIQVNFDTISKL 1161

Query: 991  ELYVDANVVKQAILVTPKLKLKHEHINVLDDRKLRVLPQDAD----RNKLHFNLHFLKNM 1050
            +L +    +  AI  TPKLK+  + + V +  +   +   +D      ++++ L   K +
Sbjct: 1162 QLDITLADIAAAIWNTPKLKIPSQQVTVNNTLQQIHVHTSSDGKSSETEVYYRLQTYKRV 1221

Query: 1051 LPGVVVKGIKTVGRAVIKEEKDK 1062
            LP VVV GI T+ R+VI +E  K
Sbjct: 1222 LPDVVVAGIPTINRSVINQESGK 1236

BLAST of Csa1G084260 vs. TrEMBL
Match: A0A061FU60_THECC (DNA-directed RNA polymerase subunit OS=Theobroma cacao GN=TCM_012066 PE=3 SV=1)

HSP 1 Score: 1536.2 bits (3976), Expect = 0.0e+00
Identity = 784/1090 (71.93%), Postives = 898/1090 (82.39%), Query Frame = 1

Query: 4    GSVKKAVSMLGILHYRARSKDAGVV--------SEDLRAPYNVSNDILNPFRVLCLFQRM 63
            G+VKKAV+MLGI+H R++  D  +         +++ +A +NV+  +LNP +VL LF+RM
Sbjct: 171  GTVKKAVAMLGIIHDRSKINDNSLEEFRSAISHTKESKASFNVATYVLNPVKVLSLFKRM 230

Query: 64   SDEDCELLFLSNRPEKLIITNVLVPPIAIRPSVIMDGSQSNENDITERLKRIIQQNASVS 123
            +D DCELL+LS+RPEKLIITN+ VPPI IRPSVIMDGSQSNENDITERLKRIIQ NAS+ 
Sbjct: 231  TDLDCELLYLSDRPEKLIITNIAVPPIPIRPSVIMDGSQSNENDITERLKRIIQANASLR 290

Query: 124  QELSTSNSQAKCLESWDMLQSEVAQLINSDVRGIPFSMQVSKPLAGFVQRLKGKQGRFRG 183
            QEL  +N+  KCL  W+MLQ EVAQ INSDVRG+PFSMQVSKPL+GFVQR+KGK GRFRG
Sbjct: 291  QELVETNAAFKCLGGWEMLQVEVAQYINSDVRGVPFSMQVSKPLSGFVQRIKGKHGRFRG 350

Query: 184  NLCGKRVEFTGRTVISPDPNLKITEVAVPIHMARILTYPERVTRHNIEKLRQCVSNGPDK 243
            NL GKRVE+TGRTVISPDPNLKITEVA+PIHMARILTYPERV+ HNIEKLRQCV NGP K
Sbjct: 351  NLSGKRVEYTGRTVISPDPNLKITEVAIPIHMARILTYPERVSNHNIEKLRQCVRNGPSK 410

Query: 244  YPGARMLRHLDGSMRSLMISGRKRLADELKYGEIVERHLEDGDVVLFNRQPSLHRMSIMC 303
            YPGARM+R+ DGS R L+   RKRLADELK+G +V+RHLEDGD+VLFNRQPSLHRMSIMC
Sbjct: 411  YPGARMVRYPDGSARLLIGDYRKRLADELKFGCVVDRHLEDGDIVLFNRQPSLHRMSIMC 470

Query: 304  HRVRVMPWRTLRFNESVCNPYNADFDGDEMNMHVPQTEEARTEAILLMGVQNNLCTPKNG 363
            HR R+MPWRTLRFNESVCNPYNADFDGDEMNMHVPQTEEARTEA++LMGVQNNLCTPKNG
Sbjct: 471  HRARIMPWRTLRFNESVCNPYNADFDGDEMNMHVPQTEEARTEALMLMGVQNNLCTPKNG 530

Query: 364  EILVASTQDFLTSSFLITRKDTFYDRAAFSLMCSYMGDGMDLVDLPTPALVKPIELWTGK 423
            EILVASTQDFLTSSFLITRKD FYDRAAFSL+CSYMGDGMDL+DLPTPAL+KPIELWTGK
Sbjct: 531  EILVASTQDFLTSSFLITRKDIFYDRAAFSLICSYMGDGMDLIDLPTPALLKPIELWTGK 590

Query: 424  QLFSVLVRPHASMKVYLNLTVKEKSYSK-----VKGNEKERETMCPNDGFVYFRNSELIS 483
            QLFSVL+RPHAS++VYLNL VKE++YSK     +   E E ETMCP+DGFVY RNSELI 
Sbjct: 591  QLFSVLLRPHASVRVYLNLIVKERNYSKKIIKRIGNKEIEVETMCPDDGFVYIRNSELIC 650

Query: 484  GQVGKATLGNGNKDGLYSVLLRDYKAHAAAVCMNRLAKLSARWIGNHGFSIGIDDVQPGD 543
            GQ+GKATLGNGNKDGLYSVLLRDY AHAAA CMNRLAKLSARWIGNHGFSIGIDDVQPG 
Sbjct: 651  GQLGKATLGNGNKDGLYSVLLRDYNAHAAAACMNRLAKLSARWIGNHGFSIGIDDVQPGK 710

Query: 544  QLVKKKQTTILEGYRDCDKQINLFNTGNLPPEAGCDAAQSLESKITQILNGIREATANVC 603
            +L  +K  TI   Y+ CD++I  FN G L P+ G DAAQ+LE+ +T ILN IR+ T  VC
Sbjct: 711  RLNDEKALTISGDYKKCDEEIQTFNEGKLKPKPGYDAAQTLEANVTAILNNIRDKTGKVC 770

Query: 604  MQNLHWRNSPLIMSQCGSKGSPINISQMVACVGQQSVGGRRAPDGFIDRSLPHFRRKAKT 663
            M+ LHWRNSPLIMSQCGSKGS INISQM+ACVGQQSVGGRRAP+GFIDRSLPHF R +KT
Sbjct: 771  MKELHWRNSPLIMSQCGSKGSAINISQMIACVGQQSVGGRRAPNGFIDRSLPHFHRGSKT 830

Query: 664  PAAKGFVANSFYSGLTATEFFFHTMGGREGLVDTAVKTADTGYMSRRLIKALEDLSIHYD 723
            PAAKGFVANSFYSGLTATEFFFHTM GREGLVDTAVKTA+TGYMSRRLIKALEDLSIHYD
Sbjct: 831  PAAKGFVANSFYSGLTATEFFFHTMAGREGLVDTAVKTAETGYMSRRLIKALEDLSIHYD 890

Query: 724  SSVRNAGGCIVQFCYGDDGMDPAQMEGKSGAPLNFERLFLKAKATCPSDGNKILSPSEFS 783
            ++VRNA GCIVQF YGDDGMDPA MEGKSG PLNF+RL +K KATCP    K L      
Sbjct: 891  NTVRNASGCIVQFIYGDDGMDPACMEGKSGFPLNFDRLLMKVKATCPPIEQKCLHVGSIM 950

Query: 784  ETVEDRLSKDDASPECGCSPAFVGSLKIFLNKYVEAQKKSWGTLLADNESAVDKSIISSS 843
            + +E++L+K D  P   CS AF  SLK FL    ++Q      ++    +   KS     
Sbjct: 951  QMLEEQLAKHD--PAGVCSEAFKKSLKGFL----KSQTNELDRVMKLVNNCAQKS----- 1010

Query: 844  DNDNIVIRNKVVQNIAGVTHRQLQVFLDTCLSRYHTKKIEAGTAIGAIGAQSIGEPGTQM 903
                  I  KV   I+G++ RQL+VF+ TC+SRY +K IEAGTAIGAIGAQSIGEPGTQM
Sbjct: 1011 -----EILEKVGHKISGISDRQLEVFVSTCISRYRSKVIEAGTAIGAIGAQSIGEPGTQM 1070

Query: 904  TLKTFHFAGVASMNVTLGVPRIKEIINGAKRISTPIVTAALTHDDNVNIARMVKARIEKT 963
            TLKTFHFAGVASMN+T GVPRIKEIIN AKRISTP++TA L  DDN NIA++VK RIEKT
Sbjct: 1071 TLKTFHFAGVASMNITQGVPRIKEIINAAKRISTPVITAELEFDDNPNIAQIVKGRIEKT 1130

Query: 964  NLGQIAKCIQIVMSSRSALIEIKLDMEKIRDAELYVDANVVKQAILVTPKLKLKHEHINV 1023
             LGQ+AK I+IV++SRSA + I LDME I DAELY+DAN+VK++IL TPK+KLK +H+ V
Sbjct: 1131 VLGQVAKSIKIVITSRSASVVITLDMEIILDAELYIDANIVKESILQTPKIKLKEQHVKV 1190

Query: 1024 LDDRKLRVLPQDADRNKLHFNLHFLKNMLPGVVVKGIKTVGRAVIKEEKDKARNAKK--- 1075
            LD RKL V+P  ADR+++HF LH LKN+LP VVVKGIKTV R V+ ++  + +N K+   
Sbjct: 1191 LDGRKLEVVP-PADRSQIHFELHSLKNLLPLVVVKGIKTVERTVVYDKNKEKKNQKEEET 1243

BLAST of Csa1G084260 vs. TrEMBL
Match: I1LK95_SOYBN (DNA-directed RNA polymerase subunit OS=Glycine max GN=GLYMA_U034000 PE=3 SV=2)

HSP 1 Score: 1523.5 bits (3943), Expect = 0.0e+00
Identity = 775/1090 (71.10%), Postives = 893/1090 (81.93%), Query Frame = 1

Query: 4    GSVKKAVSMLGILH-------YRARSKDAGVVS-EDLRAPYNVSNDILNPFRVLCLFQRM 63
            GSVKK  + L I+H       Y     D+ +   +D RA  NVSN ILNPF+VL LF+RM
Sbjct: 173  GSVKKLPASLTIIHDCSKCRNYIVEELDSALSRMKDSRATTNVSNRILNPFQVLSLFKRM 232

Query: 64   SDEDCELLFLSNRPEKLIITNVLVPPIAIRPSVIMDGSQSNENDITERLKRIIQQNASVS 123
             DEDCELL+++ RPEKLI+TNV+VPPIAIRPSV+MD S SNENDITERLK IIQ NA + 
Sbjct: 233  LDEDCELLYVAERPEKLIMTNVVVPPIAIRPSVVMDESLSNENDITERLKNIIQANAVLR 292

Query: 124  QELSTSNSQAKCLESWDMLQSEVAQLINSDVRGIPFSMQVSKPLAGFVQRLKGKQGRFRG 183
            QEL  S   +K L+ WD+LQ+EVAQ INSDVRGIPF MQ +K LAGFVQRLKGK GRFRG
Sbjct: 293  QELQESTFSSKFLDGWDILQNEVAQFINSDVRGIPFYMQPTKQLAGFVQRLKGKHGRFRG 352

Query: 184  NLCGKRVEFTGRTVISPDPNLKITEVAVPIHMARILTYPERVTRHNIEKLRQCVSNGPDK 243
            NL GKRVE+TGRTVISPDPNLKI+EVA+PIHMARILTYPERVT HNIEKLRQCV NGPDK
Sbjct: 353  NLSGKRVEYTGRTVISPDPNLKISEVAIPIHMARILTYPERVTHHNIEKLRQCVRNGPDK 412

Query: 244  YPGARMLRHLDGSMRSLMISGRKRLADELKYGEIVERHLEDGDVVLFNRQPSLHRMSIMC 303
            YPGARMLR   G   SL +  RKR ADEL+ G+IV+RHLEDGD+VLFNRQPSLHRMSIMC
Sbjct: 413  YPGARMLRRDGGHSWSLKVLCRKRAADELRIGDIVDRHLEDGDIVLFNRQPSLHRMSIMC 472

Query: 304  HRVRVMPWRTLRFNESVCNPYNADFDGDEMNMHVPQTEEARTEAILLMGVQNNLCTPKNG 363
            HR R+MPWRTLRFNESVCNPYNADFDGDEMN+HVPQTEEARTEAILLMGV+NNLCTPKNG
Sbjct: 473  HRARIMPWRTLRFNESVCNPYNADFDGDEMNLHVPQTEEARTEAILLMGVENNLCTPKNG 532

Query: 364  EILVASTQDFLTSSFLITRKDTFYDRAAFSLMCSYMGDGMDLVDLPTPALVKPIELWTGK 423
            EILVASTQDFLTSSFLITRKDTFYDR+ FSL+CSY+GDGMD +DLPTPA+VKP+ELW+GK
Sbjct: 533  EILVASTQDFLTSSFLITRKDTFYDRSTFSLICSYIGDGMDPIDLPTPAIVKPVELWSGK 592

Query: 424  QLFSVLVRPHASMKVYLNLTVKEKSYS---KVKGNEKERETMCPNDGFVYFRNSELISGQ 483
            QLFS+++RPHA+M+VY+NLTVKE++Y+   K+K  + E +T+CPNDGFVYFRNSELISGQ
Sbjct: 593  QLFSIILRPHANMRVYVNLTVKERNYTEDKKIKDKKIEWKTLCPNDGFVYFRNSELISGQ 652

Query: 484  VGKATLGNGNKDGLYSVLLRDYKAHAAAVCMNRLAKLSARWIGNHGFSIGIDDVQPGDQL 543
            VGK TLGNGNKDGL+SVLLRDY+AHAAA CMNRLAKLSARWIGNHGFSIGIDDVQP + L
Sbjct: 653  VGKVTLGNGNKDGLFSVLLRDYRAHAAASCMNRLAKLSARWIGNHGFSIGIDDVQPKEIL 712

Query: 544  VKKKQTTILEGYRDCDKQINLFNTGNLPPEAGCDAAQSLESKITQILNGIREATANVCMQ 603
            + KK  TI EGYR+CDK I  FN G L   AGCDAAQ+LE++IT +LNG+R+    VCMQ
Sbjct: 713  INKKDETISEGYRECDKHIEAFNKGKLELLAGCDAAQTLETRITGVLNGLRDTAGKVCMQ 772

Query: 604  NLHWRNSPLIMSQCGSKGSPINISQMVACVGQQSVGGRRAPDGFIDRSLPHFRRKAKTPA 663
             LHWRNSPLIMSQCGSKGS INISQMVACVGQQSVGGRR P+GFIDRSLPHF RK+KTPA
Sbjct: 773  TLHWRNSPLIMSQCGSKGSSINISQMVACVGQQSVGGRRTPNGFIDRSLPHFPRKSKTPA 832

Query: 664  AKGFVANSFYSGLTATEFFFHTMGGREGLVDTAVKTADTGYMSRRLIKALEDLSIHYDSS 723
            AKGFVANSFYSGL+ATEFFFHTMGGREGLVDTAVKTADTGYMSR+L+K+LEDL +HYD +
Sbjct: 833  AKGFVANSFYSGLSATEFFFHTMGGREGLVDTAVKTADTGYMSRQLMKSLEDLFLHYDYT 892

Query: 724  VRNAGGCIVQFCYGDDGMDPAQMEGKSGAPLNFERLFLKAKATCPSD-GNKILSPSEFSE 783
            VRNAGG IVQFCYGDDGMDPA MEGK+G PLNFERLFLK+KA CP+D  ++ILS S+ S+
Sbjct: 893  VRNAGGSIVQFCYGDDGMDPAGMEGKNGKPLNFERLFLKSKAICPNDEDDEILSSSDVSK 952

Query: 784  TVEDRLSKDDASP-------ECGCSPAFVGSLKIFLNKYVEAQKKSWGTLLADNESAVDK 843
             V ++LS+ D S        E G S  FV SL+ F+                DN    ++
Sbjct: 953  VVHEKLSEFDMSRLAEKGVFEVGFSADFVESLQSFIK---------------DNAKLTEE 1012

Query: 844  SIISSSDNDNIVIRNKVVQNIAGVTHRQLQVFLDTCLSRYHTKKIEAGTAIGAIGAQSIG 903
               +   + N+    K  Q I+G+T +QL VFL+ CLSRYH+KK+EAG  +GA GA SIG
Sbjct: 1013 G-FTDEHSQNL---KKFGQRISGITRKQLDVFLNICLSRYHSKKMEAGAPVGATGAHSIG 1072

Query: 904  EPGTQMTLKTFHFAGVASMNVTLGVPRIKEIINGAKRISTPIVTAALTHDDNVNIARMVK 963
            EPGTQMTLKTFHFAGVASMNVTLGVPR+KEI+NG K+ISTPI+TA L  DDN N AR+VK
Sbjct: 1073 EPGTQMTLKTFHFAGVASMNVTLGVPRVKEIMNGNKKISTPIITAILERDDNANTARIVK 1132

Query: 964  ARIEKTNLGQIAKCIQIVMSSRSALIEIKLDMEKIRDAELYVDANVVKQAILVTPKLKLK 1023
             RIEKTNLGQ+AK I++VM+SRSA + I LDM++I+DA L +DAN+VK++IL T K KLK
Sbjct: 1133 GRIEKTNLGQVAKSIKVVMTSRSASVVITLDMKRIQDAHLNIDANIVKESILRTKKTKLK 1192

Query: 1024 HEHINVLDDRKLRVLPQDADRNKLHFNLHFLKNMLPGVVVKGIKTVGRAVIKEEKDKARN 1075
             EHI +LD +KL V+PQD DR+K+HF LH+LKN+LP VVVKGIKTV R VI     K   
Sbjct: 1193 PEHIKILDIKKLEVVPQDVDRSKIHFQLHYLKNLLPTVVVKGIKTVDRVVI----SKDTK 1239

BLAST of Csa1G084260 vs. TrEMBL
Match: W9QSP3_9ROSA (DNA-directed RNA polymerase subunit OS=Morus notabilis GN=L484_007172 PE=3 SV=1)

HSP 1 Score: 1515.0 bits (3921), Expect = 0.0e+00
Identity = 767/1030 (74.47%), Postives = 867/1030 (84.17%), Query Frame = 1

Query: 59   DCELLFLSNRPEKLIITNVLVPPIAIRPSVIMDGSQSNENDITERLKRIIQQNASVSQEL 118
            DCELL+LS RPE LI+TN+ VPPIAIRPSVIMDGSQSNEND+TERLK+IIQ NA++ Q+L
Sbjct: 6    DCELLYLSVRPENLILTNISVPPIAIRPSVIMDGSQSNENDLTERLKQIIQVNATLRQDL 65

Query: 119  STSNSQAKCLESWDMLQSEVAQLINSDVRGIPFSMQVSKPLAGFVQRLKGKQGRFRGNLC 178
            S  ++ AK LE+WD LQ+ VA  INSDVRGIP      K  +GF+QRLKGKQGRFRGNL 
Sbjct: 66   SEGSAAAKFLENWDALQAHVALYINSDVRGIPPPKPGEKQYSGFIQRLKGKQGRFRGNLS 125

Query: 179  GKRVEFTGRTVISPDPNLKITEVAVPIHMARILTYPERVTRHNIEKLRQCVSNGPDKYPG 238
            GKRVE+TGRTVISPDPNLKITEVA+PI MARILTYPERV+ HNIEKLRQCVSNGPDKYPG
Sbjct: 126  GKRVEYTGRTVISPDPNLKITEVAIPIRMARILTYPERVSHHNIEKLRQCVSNGPDKYPG 185

Query: 239  ARMLRHLDGSMRSLMIS------------GRKRLADELKYGEIVERHLEDGDVVLFNRQP 298
            ARMLR  DGS    +I              RKRLADELKYG+IVERHLEDGD+VLFNRQP
Sbjct: 186  ARMLRRADGSTWQGLICFYYLCGHVSIRVSRKRLADELKYGDIVERHLEDGDIVLFNRQP 245

Query: 299  SLHRMSIMCHRVRVMPWRTLRFNESVCNPYNADFDGDEMNMHVPQTEEARTEAILLMGVQ 358
            SLHRMSIMCHR RVMPWRTLRFNESVCNPYNADFDGDEMNMHVPQTEEARTEAILLMGVQ
Sbjct: 246  SLHRMSIMCHRARVMPWRTLRFNESVCNPYNADFDGDEMNMHVPQTEEARTEAILLMGVQ 305

Query: 359  NNLCTPKNGEILVASTQDFLTSSFLITRKDTFYDRAAFSLMCSYMGDGMDLVDLPTPALV 418
            NNLCTPKNGEILVASTQDFLTSSFLITRKDTFYDRAAFSLMCSYMGD M+  DLPTPA++
Sbjct: 306  NNLCTPKNGEILVASTQDFLTSSFLITRKDTFYDRAAFSLMCSYMGDAMEHNDLPTPAVI 365

Query: 419  KPIELWTGKQLFSVLVRPHASMKVYLNLTVKEKSYS-KVKGNEKERETMCPNDGFVYFRN 478
            KP+ELWTGKQLFSVLVRP+A+++V+LNLTV+EK+YS K++ NE+E ETMCP DGFV FRN
Sbjct: 366  KPVELWTGKQLFSVLVRPNANVRVFLNLTVREKNYSNKLQENEREFETMCPADGFVCFRN 425

Query: 479  SELISGQVGKATLGNGNKDGLYSVLLRDYKAHAAAVCMNRLAKLSARWIGNHGFSIGIDD 538
            SELISGQ+GK TLGNGNKDGLYS+LLRDYKAHAAA CMNRLAKLSARWIGNHGFSIGIDD
Sbjct: 426  SELISGQLGKGTLGNGNKDGLYSILLRDYKAHAAAACMNRLAKLSARWIGNHGFSIGIDD 485

Query: 539  VQPGDQLVKKKQTTILEGYRDCDKQINLFNTGNLPPEAGCDAAQSLESKITQILNGIREA 598
            VQP D+L  +K   I EG +DCD +I  +  GNLPPE GCDAAQSLE+ IT ILN IRE 
Sbjct: 486  VQPSDKLRDEKDQKINEGAKDCDDKIKSYKEGNLPPEPGCDAAQSLEAVITGILNKIREE 545

Query: 599  TANVCMQNLHWRNSPLIMSQCGSKGSPINISQMVACVGQQSVGGRRAPDGFIDRSLPHFR 658
            T  +CM+ L WRNSPLIMSQCGSKGS INISQMVACVGQQSVGGRRAP+GFIDRSLPHF 
Sbjct: 546  TGKLCMKALPWRNSPLIMSQCGSKGSLINISQMVACVGQQSVGGRRAPNGFIDRSLPHFP 605

Query: 659  RKAKTPAAKGFVANSFYSGLTATEFFFHTMGGREGLVDTAVKTADTGYMSRRLIKALEDL 718
            RK KTPAAKGFV++SFY GL+ATEFFFHTMGGREGLVDTAVKTADTGYMSRRLIKALEDL
Sbjct: 606  RKDKTPAAKGFVSHSFYDGLSATEFFFHTMGGREGLVDTAVKTADTGYMSRRLIKALEDL 665

Query: 719  SIHYDSSVRNAGGCIVQFCYGDDGMDPAQMEGKSGAPLNFERLFLKAKATCPSDGNKILS 778
             +HYD++VRNA  CI+QFCYGDDGMDPA MEGK+GAPLNFERLFLKAKATCP+ G + LS
Sbjct: 666  FVHYDNTVRNASACIIQFCYGDDGMDPAHMEGKNGAPLNFERLFLKAKATCPAGGTEKLS 725

Query: 779  PSEFSETVEDRLSKDDASPECGCSPAFVGSLKIFLNKYVEAQKKSWGTLLADNESAVDKS 838
            P + SE VE RLSK D +PE GCS AF  SLK FL+ YV+A K++W            +S
Sbjct: 726  PEKVSELVESRLSKKDMTPEGGCSAAFKNSLKSFLDNYVKAFKRTWDPC---------ES 785

Query: 839  IISSSDNDNIVIRNKVVQNIAGVTHRQLQVFLDTCLSRYHTKKIEAGTAIGAIGAQSIGE 898
                +  +N      +V  ++G T RQL+VFLDTC+SRYH+KKIEAGTAIGAIGAQSIGE
Sbjct: 786  TEGHAKMENSATTKNIVLQLSGATARQLEVFLDTCISRYHSKKIEAGTAIGAIGAQSIGE 845

Query: 899  PGTQMTLKTFHFAGVASMNVTLGVPRIKEIINGAKRISTPIVTAALTHDDNVNIARMVKA 958
            PGTQMTLKTFHFAGVASMNVTLGVPRIKEIINGAK+ISTPI+TA L  D+NVN AR+V+ 
Sbjct: 846  PGTQMTLKTFHFAGVASMNVTLGVPRIKEIINGAKKISTPIITAILECDNNVNFARIVRG 905

Query: 959  RIEKTNLGQIAKCIQIVMSSRSALIEIKLDMEKIRDAELYVDANVVKQAILVTPKLKLKH 1018
            RIEK +LGQ+AK I+ VM+SR A + I LDME+I+DA L +DANVV+ +IL TP++KLK 
Sbjct: 906  RIEKISLGQVAKSIKTVMTSRVASVVITLDMERIQDALLSIDANVVRDSILQTPRIKLKK 965

Query: 1019 EHINVLDDRKLRVLPQDADRNKLHFNLHFLKNMLPGVVVKGIKTVGRAVIKEEKDKARN- 1075
            EHI VLD +KL +LP +++R+KL+ +L FL  ML  ++VKGIKTV RAVI  EKDK  + 
Sbjct: 966  EHIRVLDFKKLEILPYESNRSKLYHHLQFLAKMLQNILVKGIKTVERAVISSEKDKKTDI 1025

BLAST of Csa1G084260 vs. TrEMBL
Match: A0A067GCD6_CITSI (DNA-directed RNA polymerase subunit OS=Citrus sinensis GN=CISIN_1g000828mg PE=3 SV=1)

HSP 1 Score: 1509.6 bits (3907), Expect = 0.0e+00
Identity = 784/1092 (71.79%), Postives = 891/1092 (81.59%), Query Frame = 1

Query: 4    GSVKKAVSMLGILHYRARSKD-------AGVVSEDLRAPYNVSNDILNPFRVLCLFQRMS 63
            G VKKAV++LGI+H R++  +       A   +++ +A  NV+  ILNP  VL LF+RM+
Sbjct: 40   GMVKKAVAVLGIIHDRSKVTESLQEFASAITHTKESKAAVNVATYILNPVNVLFLFKRMT 99

Query: 64   DEDCELLFLSNRPEKLIITNVLVPPIAIRPSVIMDGSQSNENDITERLKRIIQQNASVSQ 123
            D DCE+L+LS RPEKLIITN+ VPPIAIRPSVIMDGSQSNENDITERLKRIIQ NAS+ Q
Sbjct: 100  DTDCEVLYLSERPEKLIITNIAVPPIAIRPSVIMDGSQSNENDITERLKRIIQTNASLQQ 159

Query: 124  ELSTSNSQAKCLESWDMLQSEVAQLINSDVRGIPFSMQVSKPLAGFVQRLKGKQGRFRGN 183
            EL  +NS  K L  W+ LQ EVAQ INSDVRG+PFSMQV++PL+GFVQRLKGKQGRFRGN
Sbjct: 160  ELVEANSAFKSLAGWETLQVEVAQYINSDVRGVPFSMQVARPLSGFVQRLKGKQGRFRGN 219

Query: 184  LCGKRVEFTGRTVISPDPNLKITEVAVPIHMARILTYPERVTRHNIEKLRQCVSNGPDKY 243
            L GKRVE+TGRTVISPDPNLKITEVA+PI MARILTYPERV+ HNIEKLRQC+ NGPDKY
Sbjct: 220  LSGKRVEYTGRTVISPDPNLKITEVAIPIRMARILTYPERVSDHNIEKLRQCIQNGPDKY 279

Query: 244  PGARMLRHLDGSMRSLMISGRKRLADELKYGEIVERHLEDGDVVLFNRQPSLHRMSIMCH 303
            PGARM+R+ DG+ R L    R +LA ELK G IV+RHLEDGDVVLFNRQPSLHRMSIMCH
Sbjct: 280  PGARMIRYPDGTARVLYGKFRNQLAVELKSGCIVDRHLEDGDVVLFNRQPSLHRMSIMCH 339

Query: 304  RVRVMPWRTLRFNESVCNPYNADFDGDEMNMHVPQTEEARTEAILLMGVQNNLCTPKNGE 363
            R R+MPWRTLRFNESVCNPYNADFDGDEMNMHVPQTEEARTEA+LLMGVQNNLCTPKNGE
Sbjct: 340  RARIMPWRTLRFNESVCNPYNADFDGDEMNMHVPQTEEARTEALLLMGVQNNLCTPKNGE 399

Query: 364  ILVASTQDFLTSSFLITRKDTFYDRAAFSLMCSYMGDGMDLVDLPTPALVKPIELWTGKQ 423
            ILVASTQDFLTSSFLITRKDTFYDRAAFSLMC YMGDGMD VDLPTPA++KP+ELWTGKQ
Sbjct: 400  ILVASTQDFLTSSFLITRKDTFYDRAAFSLMCCYMGDGMDRVDLPTPAILKPVELWTGKQ 459

Query: 424  LFSVLVRPHASMKVYLNLTVKEKSYS----KVKGNEKER-ETMCPNDGFVYFRNSELISG 483
            LFSVL+RPHA+M+VY+NLTVKEK+YS    + +G+E+ R ETMCPNDGFVYFRNSELISG
Sbjct: 460  LFSVLIRPHANMRVYVNLTVKEKTYSNKLIRTEGDEEIRIETMCPNDGFVYFRNSELISG 519

Query: 484  QVGKATLGNGNKDGLYSVLLRDYKAHAAAVCMNRLAKLSARWIGNHGFSIGIDDVQPGDQ 543
            Q+GKATLGNGNKDGLYSVLLRDY AHA + CMNRLAKLSARWIGNHGFSIGIDDVQP  +
Sbjct: 520  QLGKATLGNGNKDGLYSVLLRDYGAHATSGCMNRLAKLSARWIGNHGFSIGIDDVQPKKE 579

Query: 544  LVKKKQTTILEGYRDCDKQINLFNTGNLPPEAGCDAAQSLESKITQILNGIREATANVCM 603
            L  KK   I E Y  C+ +I  +N G L  + GCDAAQ+LE+ IT ILN IRE     CM
Sbjct: 580  LSDKKGKLISENYEVCNVKIKEYNEGKLQLKPGCDAAQTLEAVITDILNRIREDAGKACM 639

Query: 604  QNLHWRNSPLIMSQCGSKGSPINISQMVACVGQQSVGGRRAPDGFIDRSLPHFRRKAKTP 663
             +L WRNSPLIMSQCGSKGSPINISQMVACVGQQSVGGRRAP+GFIDRSLPHF RKAK P
Sbjct: 640  GSLPWRNSPLIMSQCGSKGSPINISQMVACVGQQSVGGRRAPNGFIDRSLPHFPRKAKEP 699

Query: 664  AAKGFVANSFYSGLTATEFFFHTMGGREGLVDTAVKTADTGYMSRRLIKALEDLSIHYDS 723
            AAKGFVANSFYSGL+ATEFFFHTMGGREGLVDTAVKTADTGYMSRRLIKALEDLSI YD+
Sbjct: 700  AAKGFVANSFYSGLSATEFFFHTMGGREGLVDTAVKTADTGYMSRRLIKALEDLSIQYDN 759

Query: 724  SVRNAGGCIVQFCYGDDGMDPAQMEGKSGAPLNFERLFLKAKATCPSDGNKILSPSEFSE 783
            SVRNAGGCIVQF YGDDGMDPA MEGKSG PLNF+RL +K KATCP  G + LSP + SE
Sbjct: 760  SVRNAGGCIVQFLYGDDGMDPANMEGKSGEPLNFDRLLMKVKATCPPAGQRYLSPQQVSE 819

Query: 784  TVEDRLSKDDASPECGCSPAFVGSLKIFLNKYVEAQKKSWGTLLADNESAVDKSIISSSD 843
             VE +L+   A  +  CS AF+ SL+    K+ E Q           +  +DK I    D
Sbjct: 820  IVEKQLA---AYGKESCSEAFLNSLR----KFFEGQ-----------QDKLDKKIKFVED 879

Query: 844  ---NDNIVIRNKVVQNIAGVTHRQLQVFLDTCLSRYHTKKIEAGTAIGAIGAQSIGEPGT 903
               +D   I  +V    +G+T +QL+VF+ TC SRY  K++EAGTAIGAIGAQSIGEPGT
Sbjct: 880  IGWDDKSQILEEVTHKTSGITEKQLEVFIQTCFSRYRVKRVEAGTAIGAIGAQSIGEPGT 939

Query: 904  QMTLKTFHFAGVASMNVTLGVPRIKEIINGAKRISTPIVTAALTHDDNVNIARMVKARIE 963
            QMTLKTFHFAGVASMN+T GVPRIKEIINGAKRISTPI+TA L  +DN N AR+VK RIE
Sbjct: 940  QMTLKTFHFAGVASMNITQGVPRIKEIINGAKRISTPIITAELECNDNENAARVVKGRIE 999

Query: 964  KTNLGQIAKCIQIVMSSRSALIEIKLDMEKIRDAELYVDANVVKQAILVTPKLKLKHEHI 1023
            KT LGQ+AK I+IVM+SR A I I LDME I+DA L ++A++VK++I+ TPK+KLK +HI
Sbjct: 1000 KTLLGQVAKSIKIVMTSRLASIVIALDMETIQDAHLCINADIVKESIVQTPKIKLKQQHI 1059

Query: 1024 NVLDDRKLRVLPQDADRNKLHFNLHFLKNMLPGVVVKGIKTVGRAVIKE-EKDKAR---- 1075
             VLD RKL + P   D++K+HF L+ LKN+LP V+VKGIKTV R VI E EK+K +    
Sbjct: 1060 KVLDFRKLEIFP-PVDKSKIHFELYSLKNVLPMVIVKGIKTVERVVIAEKEKEKRKVKEN 1112

BLAST of Csa1G084260 vs. TrEMBL
Match: V7AFW1_PHAVU (DNA-directed RNA polymerase subunit OS=Phaseolus vulgaris GN=PHAVU_011G070100g PE=3 SV=1)

HSP 1 Score: 1507.3 bits (3901), Expect = 0.0e+00
Identity = 759/1088 (69.76%), Postives = 891/1088 (81.89%), Query Frame = 1

Query: 4    GSVKKAVSMLGILHYRARSKDAGVVSE---------DLRAPYNVSNDILNPFRVLCLFQR 63
            GSVKK  + L I+H  ++ K+  +V E         D +A  NVSN ILNPF+VL LF++
Sbjct: 173  GSVKKLPASLIIMHDCSKCKN-NIVEELESTLSRIKDSKATANVSNRILNPFQVLSLFRK 232

Query: 64   MSDEDCELLFLSNRPEKLIITNVLVPPIAIRPSVIMDGSQSNENDITERLKRIIQQNASV 123
            M DEDCELL+++ RPEKLIITN++VPPIAIRPSV+MD S SNENDITERLK IIQ NA +
Sbjct: 233  MLDEDCELLYVAERPEKLIITNIVVPPIAIRPSVVMDESLSNENDITERLKNIIQANAVL 292

Query: 124  SQELSTSNSQAKCLESWDMLQSEVAQLINSDVRGIPFSMQVSKPLAGFVQRLKGKQGRFR 183
             QEL  S+  +K L+ W++LQ+EVAQ INS+VRGIPF MQ +K LAGFVQRLKGK GRFR
Sbjct: 293  RQELQESSVSSKFLDGWEILQNEVAQFINSEVRGIPFYMQSTKQLAGFVQRLKGKHGRFR 352

Query: 184  GNLCGKRVEFTGRTVISPDPNLKITEVAVPIHMARILTYPERVTRHNIEKLRQCVSNGPD 243
            GNL GKRVE+TGRTVISPDPNLKI+EVA+PI MA ILTYPERVT HNIEKLRQCV NGPD
Sbjct: 353  GNLSGKRVEYTGRTVISPDPNLKISEVAIPILMASILTYPERVTHHNIEKLRQCVRNGPD 412

Query: 244  KYPGARMLRHLDGSMRSLMISGRKRLADELKYGEIVERHLEDGDVVLFNRQPSLHRMSIM 303
            KYPGARMLR   G   SL +  RKR ADEL+ G+IV+RHLEDGD+VLFNRQPSLHRMSIM
Sbjct: 413  KYPGARMLRRDGGHSWSLKVLCRKRAADELRIGDIVDRHLEDGDIVLFNRQPSLHRMSIM 472

Query: 304  CHRVRVMPWRTLRFNESVCNPYNADFDGDEMNMHVPQTEEARTEAILLMGVQNNLCTPKN 363
             HR R+MPWRTLRFNESVCNPYNADFDGDEMN+HVPQTEEARTEAILLMGVQNNLCTPKN
Sbjct: 473  SHRARIMPWRTLRFNESVCNPYNADFDGDEMNLHVPQTEEARTEAILLMGVQNNLCTPKN 532

Query: 364  GEILVASTQDFLTSSFLITRKDTFYDRAAFSLMCSYMGDGMDLVDLPTPALVKPIELWTG 423
            GEILVASTQDFLTSSFL+TRKDTFYDR+AF+ +C+++GDG+DL+DLPTPA+VKP+ELW+G
Sbjct: 533  GEILVASTQDFLTSSFLVTRKDTFYDRSAFTNICTFIGDGLDLIDLPTPAIVKPVELWSG 592

Query: 424  KQLFSVLVRPHASMKVYLNLTVKEKSYSKVKGNEKERETMCPNDGFVYFRNSELISGQVG 483
            KQLFS+L+RPHA+ KVY+NLTVKEK+Y+K+   ++E +T+CPNDGFVYFRN+ELISGQ+G
Sbjct: 593  KQLFSLLLRPHANFKVYVNLTVKEKTYTKLDDKKRELKTLCPNDGFVYFRNTELISGQIG 652

Query: 484  KATLGNGNKDGLYSVLLRDYKAHAAAVCMNRLAKLSARWIGNHGFSIGIDDVQPGDQLVK 543
            K TLGNGNKDGL+SVLLRDYKAHAAA CMNRLAKLSARWIGNHGFSIGIDDVQP + L+K
Sbjct: 653  KVTLGNGNKDGLFSVLLRDYKAHAAASCMNRLAKLSARWIGNHGFSIGIDDVQPKEILIK 712

Query: 544  KKQTTILEGYRDCDKQINLFNTGNLPPEAGCDAAQSLESKITQILNGIREATANVCMQNL 603
            KK  T+ EGY+ CD  I  FN G L   AGCDA Q+LE++IT +LNG+R+    VCMQ L
Sbjct: 713  KKDETLSEGYKKCDNHIQAFNKGKLELLAGCDAPQTLETQITGVLNGLRDMAGKVCMQTL 772

Query: 604  HWRNSPLIMSQCGSKGSPINISQMVACVGQQSVGGRRAPDGFIDRSLPHFRRKAKTPAAK 663
            HWRNSPLIMSQCGSKGSPINISQMVACVGQQSVGGRRAP+GF+DRSLPHF   AKTPAAK
Sbjct: 773  HWRNSPLIMSQCGSKGSPINISQMVACVGQQSVGGRRAPNGFLDRSLPHFPLNAKTPAAK 832

Query: 664  GFVANSFYSGLTATEFFFHTMGGREGLVDTAVKTADTGYMSRRLIKALEDLSIHYDSSVR 723
            GFVANSFYSGL+ATEFFFHTMGGREGLVDTAVKTADTGYMSR+L+K++EDL +HYD +VR
Sbjct: 833  GFVANSFYSGLSATEFFFHTMGGREGLVDTAVKTADTGYMSRQLMKSMEDLFLHYDYTVR 892

Query: 724  NAGGCIVQFCYGDDGMDPAQMEGKSGAPLNFERLFLKAKATCPS-DGNKILSPSEFSETV 783
            NAGG IVQFCYGDDGMDP  MEGK+G PLNFERLFLK+KA CP+ D +++LS S+  + V
Sbjct: 893  NAGGSIVQFCYGDDGMDPGGMEGKNGKPLNFERLFLKSKAICPNKDDDEVLSSSDVCKVV 952

Query: 784  EDRLSKDDASP-------ECGCSPAFVGSLKIFLNKYVEAQKKSWGTLLADNESAVDKSI 843
            +++LS+   S        E G S  FV SL+ F+                DN    +++ 
Sbjct: 953  QEKLSEFGVSREVEKGVLEVGFSADFVQSLQSFIK---------------DNTKLTEETF 1012

Query: 844  ISSSDNDNIVIRNKVVQNIAGVTHRQLQVFLDTCLSRYHTKKIEAGTAIGAIGAQSIGEP 903
                 +DN  I  K  + I+G+T  QL+VFL+ CLSRYH+KKIEAG  +GA GA SIGEP
Sbjct: 1013 ----TDDNSQILKKFGERISGITRAQLEVFLNICLSRYHSKKIEAGAPVGATGAHSIGEP 1072

Query: 904  GTQMTLKTFHFAGVASMNVTLGVPRIKEIINGAKRISTPIVTAALTHDDNVNIARMVKAR 963
            GTQMTLKTFHFAGVASMNVTLGVPR+KEI+NG K+ISTPI+TA L   D  N AR+VK R
Sbjct: 1073 GTQMTLKTFHFAGVASMNVTLGVPRVKEIMNGNKKISTPIITAILERTDCANTARIVKGR 1132

Query: 964  IEKTNLGQIAKCIQIVMSSRSALIEIKLDMEKIRDAELYVDANVVKQAILVTPKLKLKHE 1023
            IEKTNLGQ+AK I++V++SR A + I LDME+I+DA L +DAN+VK++IL T K KLK E
Sbjct: 1133 IEKTNLGQVAKSIKVVVTSRLASVVITLDMERIQDAHLNIDANIVKESILQTKKAKLKPE 1192

Query: 1024 HINVLDDRKLRVLPQDADRNKLHFNLHFLKNMLPGVVVKGIKTVGRAVIKEEKDKARNAK 1075
            HI +LD +KLRV+PQD DR+KLHF L++LKN+LP VVVKGIKT  R VI +E+DK   A+
Sbjct: 1193 HIKILDVKKLRVVPQDGDRSKLHFQLNYLKNLLPSVVVKGIKTADRVVISKEEDKITKAE 1240

BLAST of Csa1G084260 vs. TAIR10
Match: AT5G60040.2 (AT5G60040.2 nuclear RNA polymerase C1)

HSP 1 Score: 1321.2 bits (3418), Expect = 0.0e+00
Identity = 698/1076 (64.87%), Postives = 820/1076 (76.21%), Query Frame = 1

Query: 4    GSVKKAVSMLGILHYRARSKDAGVVSEDLR----------APYNVSNDILNPFRVLCLFQ 63
            G VKK  +  GI     RSK  G   ++ +          A  N    +L+P  VL LF+
Sbjct: 185  GMVKKIAAQFGIGISHDRSKIHGGEIDECKSAISHTKQSTAAINPLTYVLDPNLVLGLFK 244

Query: 64   RMSDEDCELLFLSNRPEKLIITNVLVPPIAIRPSVIMDGSQSNENDITERLKRIIQQNAS 123
            RMSD+DCELL+++ RPE LIIT +LVPP++IRPSV++ G QSNEND+T RLK+II  NAS
Sbjct: 245  RMSDKDCELLYIAYRPENLIITCMLVPPLSIRPSVMIGGIQSNENDLTARLKQIILGNAS 304

Query: 124  VSQELSTSNSQAKCLESWDMLQSEVAQLINSDVRGIPFSMQVSKPLAGFVQRLKGKQGRF 183
            + + LS   S  K ++ WD +Q EVA+ INS+VRG   +     PL+G +QRLKGK GRF
Sbjct: 305  LHKILSQPTSSPKNMQVWDTVQIEVARYINSEVRGCQ-NQPEEHPLSGILQRLKGKGGRF 364

Query: 184  RGNLCGKRVEFTGRTVISPDPNLKITEVAVPIHMARILTYPERVTRHNIEKLRQCVSNGP 243
            R NL GKRVEFTGRTVISPDPNLKITEV +PI MA+ILT+PE V+RHNIEKLRQCV NGP
Sbjct: 365  RANLSGKRVEFTGRTVISPDPNLKITEVGIPILMAQILTFPECVSRHNIEKLRQCVRNGP 424

Query: 244  DKYPGARMLRHLDGSMRSLMISGRKRLADELKYGEIVERHLEDGDVVLFNRQPSLHRMSI 303
            +KYPGAR +R+ DGS R+L+   RKR+ADEL  G IV+RHL++GDVVLFNRQPSLHRMSI
Sbjct: 425  NKYPGARNVRYPDGSSRTLVGDYRKRIADELAIGCIVDRHLQEGDVVLFNRQPSLHRMSI 484

Query: 304  MCHRVRVMPWRTLRFNESVCNPYNADFDGDEMNMHVPQTEEARTEAILLMGVQNNLCTPK 363
            MCHR R+MPWRTLRFNESVCNPYNADFDGDEMNMHVPQTEEARTEAI LMGVQNNLCTPK
Sbjct: 485  MCHRARIMPWRTLRFNESVCNPYNADFDGDEMNMHVPQTEEARTEAITLMGVQNNLCTPK 544

Query: 364  NGEILVASTQDFLTSSFLITRKDTFYDRAAFSLMCSYMGDGMDLVDLPTPALVKPIELWT 423
            NGEILVASTQDFLTSSFLITRKDTFYDRAAFSL+CSYMGDGMD +DLPTP ++KPIELWT
Sbjct: 545  NGEILVASTQDFLTSSFLITRKDTFYDRAAFSLICSYMGDGMDSIDLPTPTILKPIELWT 604

Query: 424  GKQLFSVLVRPHASMKVYLNLTVKEKSYSKVKGNEKERETMCPNDGFVYFRNSELISGQV 483
            GKQ+FSVL+RP+AS++VY+ L VKEK++   KG     ETMC NDG+VYFRNSELISGQ+
Sbjct: 605  GKQIFSVLLRPNASIRVYVTLNVKEKNFK--KGEHGFDETMCINDGWVYFRNSELISGQL 664

Query: 484  GKAT-------LGNGNKDGLYSVLLRDYKAHAAAVCMNRLAKLSARWIGNHGFSIGIDDV 543
            GKAT       LGNGNKDGLYS+LLRDY +HAAAVCMNRLAKLSARWIG HGFSIGIDDV
Sbjct: 665  GKATLALDIFPLGNGNKDGLYSILLRDYNSHAAAVCMNRLAKLSARWIGIHGFSIGIDDV 724

Query: 544  QPGDQLVKKKQTTILEGYRDCDKQINLFNTGNLPPEAGCDAAQSLESKITQILNGIREAT 603
            QPG++L K+++ +I  GY  C ++I  FN GNL  +AG D A+SLE++IT ILN IREAT
Sbjct: 725  QPGEELSKERKDSIQFGYDQCHRKIEEFNRGNLQLKAGLDGAKSLEAEITGILNTIREAT 784

Query: 604  ANVCMQNLHWRNSPLIMSQCGSKGSPINISQMVACVGQQSVGGRRAPDGFIDRSLPHFRR 663
               CM  LHWRNSPLIMSQCGSKGSPINISQMVACVGQQ+V G RAPDGFIDRSLPHF R
Sbjct: 785  GKACMSGLHWRNSPLIMSQCGSKGSPINISQMVACVGQQTVNGHRAPDGFIDRSLPHFPR 844

Query: 664  KAKTPAAKGFVANSFYSGLTATEFFFHTMGGREGLVDTAVKTADTGYMSRRLIKALEDLS 723
             +K+PAAKGFVANSFYSGLTATEFFFHTMGGREGLVDTAVKTA TGYMSRRL+KALEDL 
Sbjct: 845  MSKSPAAKGFVANSFYSGLTATEFFFHTMGGREGLVDTAVKTASTGYMSRRLMKALEDLL 904

Query: 724  IHYDSSVRNAGGCIVQFCYGDDGMDPAQMEGKSGAPLNFERLFLKAKATC-PSDGNKILS 783
            +HYD++VRNA GCI+QF YGDDGMDPA MEGK GAPLNF RLFLK +ATC P   +  LS
Sbjct: 905  VHYDNTVRNASGCILQFTYGDDGMDPALMEGKDGAPLNFNRLFLKVQATCPPRSHHTYLS 964

Query: 784  PSEFSETVEDRLSKDDASPECGCSPAFVGSLKIFLNKYVEAQKKSWGTLLADNESAVDKS 843
              E S+  E+ L + D S    C+ AFV SL+ F++             L   +SA    
Sbjct: 965  SEELSQKFEEELVRHDKSRV--CTDAFVKSLREFVS-------------LLGVKSASPP- 1024

Query: 844  IISSSDNDNIVIRNKVVQNIAGVTHRQLQVFLDTCLSRYHTKKIEAGTAIGAIGAQSIGE 903
                          +V+   +GVT +QL+VF+  C+ RY  KKIEAGTAIG IGAQSIGE
Sbjct: 1025 --------------QVLYKASGVTDKQLEVFVKICVFRYREKKIEAGTAIGTIGAQSIGE 1084

Query: 904  PGTQMTLKTFHFAGVASMNVTLGVPRIKEIINGAKRISTPIVTAALTHDDNVNIARMVKA 963
            PGTQMTLKTFHFAGVASMN+T GVPRI EIIN +K ISTP+++A L +   +  AR VK 
Sbjct: 1085 PGTQMTLKTFHFAGVASMNITQGVPRINEIINASKNISTPVISAELENPLELTSARWVKG 1144

Query: 964  RIEKTNLGQIAKCIQIVMSSRSALIEIKLDMEKIRDAELYVDANVVKQAILVTPKLKLKH 1023
            RIEKT LGQ+A+ I+++M+S SA + I LD + I +A L +    VK +IL TP++KL  
Sbjct: 1145 RIEKTTLGQVAESIEVLMTSTSASVRIILDNKIIEEACLSITPWSVKNSILKTPRIKLND 1204

Query: 1024 EHINVLDDRKLRVLPQDADRNKLHFNLHFLKNMLPGVVVKGIKTVGRAVIKEEKDK 1062
              I VL D  L + P   D+++ HFNLH LKN+LP ++V GIKTV R V+ E+ DK
Sbjct: 1205 NDIRVL-DTGLDITPV-VDKSRAHFNLHNLKNVLPNIIVNGIKTVERVVVAEDMDK 1225

BLAST of Csa1G084260 vs. TAIR10
Match: AT4G35800.1 (AT4G35800.1 RNA polymerase II large subunit)

HSP 1 Score: 503.4 bits (1295), Expect = 3.4e-142
Identity = 288/700 (41.14%), Postives = 407/700 (58.14%), Query Frame = 1

Query: 47  RVLCLFQRMSDEDCELLFLSN---RPEKLIITNVLVPPIAIRPSVIMDGSQSNENDITER 106
           RVL + +R+SD DC+LL  +    RP+ +I+  + +PP  +RPSV+MD +  +E+D+T +
Sbjct: 217 RVLSVLKRISDADCQLLGFNPKFARPDWMILEVLPIPPPPVRPSVMMDATSRSEDDLTHQ 276

Query: 107 LKRIIQQNASVSQELSTSNSQAKCLESWDMLQSEVAQLINSDVRGIPFSMQVS-KPLAGF 166
           L  II+ N ++ ++           E   +LQ  +A   ++++ G P + Q S +P+   
Sbjct: 277 LAMIIRHNENLKRQEKNGAPAHIISEFTQLLQFHIATYFDNELPGQPRATQKSGRPIKSI 336

Query: 167 VQRLKGKQGRFRGNLCGKRVEFTGRTVISPDPNLKITEVAVPIHMARILTYPERVTRHNI 226
             RLK K+GR RGNL GKRV+F+ RTVI+PDP + I E+ VP  +A  LTYPE VT +NI
Sbjct: 337 CSRLKAKEGRIRGNLMGKRVDFSARTVITPDPTINIDELGVPWSIALNLTYPETVTPYNI 396

Query: 227 EKLRQCVSNGPDKYPGARMLRHL--DGSMRSLMISGRKRLADELKYGEIVERHLEDGDVV 286
           E+L++ V  GP   PG    +++  D   R  +   +K     L+ G  VERHL+DGD V
Sbjct: 397 ERLKELVDYGPHPPPGKTGAKYIIRDDGQRLDLRYLKKSSDQHLELGYKVERHLQDGDFV 456

Query: 287 LFNRQPSLHRMSIMCHRVRVMPWRTLRFNESVCNPYNADFDGDEMNMHVPQTEEARTEAI 346
           LFNRQPSLH+MSIM HR+R+MP+ T R N SV +PYNADFDGDEMNMHVPQ+ E R E +
Sbjct: 457 LFNRQPSLHKMSIMGHRIRIMPYSTFRLNLSVTSPYNADFDGDEMNMHVPQSFETRAEVL 516

Query: 347 LLMGVQNNLCTPKNGEILVASTQDFLTSSFLITRKDTFYDRAAF--SLMCSYMGDGMDLV 406
            LM V   + +P+    ++   QD L     IT++DTF ++  F  +LM     DG    
Sbjct: 517 ELMMVPKCIVSPQANRPVMGIVQDTLLGCRKITKRDTFIEKDVFMNTLMWWEDFDGK--- 576

Query: 407 DLPTPALVKPIELWTGKQLFSVLVRPHASMKVYLNLTVKEKSYSKVKGNEKERETMCPND 466
            +P PA++KP  LWTGKQ+F++++    ++  Y                + E   + P D
Sbjct: 577 -VPAPAILKPRPLWTGKQVFNLIIPKQINLLRY-----------SAWHADTETGFITPGD 636

Query: 467 GFVYFRNSELISGQVGKATLGNGNKDGLYSVLLRDYKAHAAAVCMNRLAKLSARWIGNHG 526
             V     EL++G + K TLG  N   L  V+  +    AA   +     L   W+  +G
Sbjct: 637 TQVRIERGELLAGTLCKKTLGTSN-GSLVHVIWEEVGPDAARKFLGHTQWLVNYWLLQNG 696

Query: 527 FSIGIDDVQPGDQLVKKKQTTILEGYRDCDKQINLFNTGNLPPEAGCDAAQSLESKITQI 586
           F+IGI D       ++K   TI          I  F    L PE G     + E+++ Q+
Sbjct: 697 FTIGIGDTIADSSTMEKINETISNAKTAVKDLIRQFQGKELDPEPGRTMRDTFENRVNQV 756

Query: 587 LNGIREATANVCMQNLHWRNSPLIMSQCGSKGSPINISQMVACVGQQSVGGRRAPDGFID 646
           LN  R+   +   ++L   N+   M   GSKGS INISQM ACVGQQ+V G+R P GF  
Sbjct: 757 LNKARDDAGSSAQKSLAETNNLKAMVTAGSKGSFINISQMTACVGQQNVEGKRIPFGFDG 816

Query: 647 RSLPHFRRKAKTPAAKGFVANSFYSGLTATEFFFHTMGGREGLVDTAVKTADTGYMSRRL 706
           R+LPHF +    P ++GFV NS+  GLT  EFFFH MGGREGL+DTAVKT++TGY+ RRL
Sbjct: 817 RTLPHFTKDDYGPESRGFVENSYLRGLTPQEFFFHAMGGREGLIDTAVKTSETGYIQRRL 876

Query: 707 IKALEDLSIHYDSSVRNAGGCIVQFCYGDDGMDPAQMEGK 739
           +KA+ED+ + YD +VRN+ G ++QF YG+DGMD   +E +
Sbjct: 877 VKAMEDIMVKYDGTVRNSLGDVIQFLYGEDGMDAVWIESQ 900


HSP 2 Score: 96.3 bits (238), Expect = 1.2e-19
Identity = 66/196 (33.67%), Postives = 98/196 (50.00%), Query Frame = 1

Query: 862  SRYHTKKIEAGTAIGAIGAQSIGEPGTQMTLKTFHFAGVASMNVTLGVPRIKEIINGAKR 921
            SR+    +  G  IG + AQSIGEP TQMTL TFH+AGV++ NVTLGVPR++EIIN AKR
Sbjct: 1071 SRFLQSLVAPGEMIGCVAAQSIGEPATQMTLNTFHYAGVSAKNVTLGVPRLREIINVAKR 1130

Query: 922  ISTPIVTAALTHDDNVNI--ARMVKARIEKTNLGQIAKCIQIVMSSRSALIEIKLDMEKI 981
            I TP ++  LT + + +   A+ V+  +E T L  + +  ++          I+ D E +
Sbjct: 1131 IKTPSLSVYLTPEASKSKEGAKTVQCALEYTTLRSVTQATEVWYDPDPMSTIIEEDFEFV 1190

Query: 982  RDAELYVDANV-------------VKQAILVTPKLKLKH--EHINVLDDRKLRVLPQDAD 1041
            R      D +V             + + ++V  KL +    E IN+  D  L  +  D +
Sbjct: 1191 RSYYEMPDEDVSPDKISPWLLRIELNREMMVDKKLSMADIAEKINLEFDDDLTCIFNDDN 1250

BLAST of Csa1G084260 vs. TAIR10
Match: AT3G57660.1 (AT3G57660.1 nuclear RNA polymerase A1)

HSP 1 Score: 407.5 bits (1046), Expect = 2.5e-113
Identity = 302/966 (31.26%), Postives = 453/966 (46.89%), Query Frame = 1

Query: 74   ITNVLVPPIAIRPSVIMDGSQSNENDITERLKRIIQQNASVSQELSTSNSQAKCLESWDM 133
            + +VLVPP   RP     G    E+  T  L ++I+ N  +    +    Q+K +  W  
Sbjct: 343  LESVLVPPTKFRPPTT-GGDSVMEHPQTVGLNKVIESNNILGNACTNKLDQSKVIFRWRN 402

Query: 134  LQSEVAQLINSDVRGIPFSMQVSKPLAGFVQRLKGKQGRFRGNLCGKRVEFTGRTVISPD 193
            LQ  V  L +S       ++Q  +  +G  Q L+ K+G FR  + GKRV    R+VISPD
Sbjct: 403  LQESVNVLFDSKTA----TVQSQRDSSGICQLLEKKEGLFRQKMMGKRVNHACRSVISPD 462

Query: 194  PNLKITEVAVPIHMARILTYPERVTRHNIEKLRQCVSNGPDKYPGARMLRHLDGSMRSLM 253
            P + + ++ +P   A  LTYPERVT  N+EKLR+ + NGPD +PGA        +M+   
Sbjct: 463  PYIAVNDIGIPPCFALKLTYPERVTPWNVEKLREAIINGPDIHPGATHYSDKSSTMKLPS 522

Query: 254  IS-GRKRLADELKY-----------------GEIVERHLEDGDVVLFNRQPSLHRMSIMC 313
                R+ +A +L                   G+ V RH+ DGD+VL NRQP+LH+ S+M 
Sbjct: 523  TEKARRAIARKLLSSRGATTELGKTCDINFEGKTVHRHMRDGDIVLVNRQPTLHKPSLMA 582

Query: 314  HRVRVMPW-RTLRFNESVCNPYNADFDGDEMNMHVPQTEEARTEAILLMGVQNNLCTPKN 373
            H+VRV+   +TLR + + C+ YNADFDGDEMN+H PQ E +R EA  ++   N    P N
Sbjct: 583  HKVRVLKGEKTLRLHYANCSTYNADFDGDEMNVHFPQDEISRAEAYNIVNANNQYARPSN 642

Query: 374  GEILVASTQDFLTSSFLITRKDTFYDRAAFS-----------LMCSYMGDGMDLVDLP-- 433
            GE L A  QD + SS L+T++DTF D+  F+           ++ ++ G     V +   
Sbjct: 643  GEPLRALIQDHIVSSVLLTKRDTFLDKDHFNQLLFSSGVTDMVLSTFSGRSGKKVMVSAS 702

Query: 434  -------TPALVKPIELWTGKQLFSVLV------RPHASMKVYLNL----------TVKE 493
                   TPA++KP+ LWTGKQ+ + ++       P  +++    L           VK 
Sbjct: 703  DAELLTVTPAILKPVPLWTGKQVITAVLNQITKGHPPFTVEKATKLPVDFFKCRSREVKP 762

Query: 494  KSYSKVKGNE-KERETMCPNDGFVYFRNSELISGQVGKATLGNGNKDGLYSVLLRDYKAH 553
             S    K  E  E      N+  ++ R +E + G + KA   +    GL   +   Y ++
Sbjct: 763  NSGDLTKKKEIDESWKQNLNEDKLHIRKNEFVCGVIDKAQFAD---YGLVHTVHELYGSN 822

Query: 554  AAAVCMNRLAKLSARWIGNHGFSIGIDDV----QPGDQLVKKKQTTILEGYRDCDKQINL 613
            AA   ++  ++L   ++  HGF+ G+DD+       ++  K+ Q     G R   K   +
Sbjct: 823  AAGNLLSVFSRLFTVFLQTHGFTCGVDDLIILKDMDEERTKQLQECENVGERVLRKTFGI 882

Query: 614  FNTGNLPP------------EAGCDAAQSLESKITQILNGIREATANVCMQNLHWRNSPL 673
                 + P            E G  A  SL+  I   LN          + +     +P 
Sbjct: 883  DVDVQIDPQDMRSRIERILYEDGESALASLDRSIVNYLNQCSSKGVMNDLLSDGLLKTPG 942

Query: 674  -----IMSQCGSKGSPINISQMVACVGQQSVGGRRAPDGFIDRSLPHFRRKAKTPAAKGF 733
                 +M+  G+KGS +N  Q+ + +GQQ + G+R P     ++LP F     +P A GF
Sbjct: 943  RNCISLMTISGAKGSKVNFQQISSHLGQQDLEGKRVPRMVSGKTLPCFHPWDWSPRAGGF 1002

Query: 734  VANSFYSGLTATEFFFHTMGGREGLVDTAVKTADTGYMSRRLIKALEDLSIHYDSSVRNA 793
            +++ F SGL   E++FH M GREGLVDTAVKT+ +GY+ R L+K LE L ++YD +VR+A
Sbjct: 1003 ISDRFLSGLRPQEYYFHCMAGREGLVDTAVKTSRSGYLQRCLMKNLESLKVNYDCTVRDA 1062

Query: 794  GGCIVQFCYGDDGMDPAQMEGKSGAPLNFERLFLKAKATCPSDGNKILSPSEFSETVEDR 853
             G I+QF YG+DG+D      +S     F+ L +                    + V  +
Sbjct: 1063 DGSIIQFQYGEDGVD----VHRSSFIEKFKELTIN------------------QDMVLQK 1122

Query: 854  LSKDDASPECGCSPAFVGSLKIFLNKYVEAQKKSWGTLLADNESAVDKSIISSSDNDNIV 913
             S+D  S           SLK    K+VEA           NE    K +          
Sbjct: 1123 CSEDMLSGASSYISDLPISLKKGAEKFVEAMPM--------NERIASKFV---------- 1182

Query: 914  IRNKVVQNIAGVTHRQLQVFLDTCLSRYHTKKIEAGTAIGAIGAQSIGEPGTQMTLKTFH 962
                           + +  L    S++     + G  +G + AQS+GEP TQMTL TFH
Sbjct: 1183 ---------------RQEELLKLVKSKFFASLAQPGEPVGVLAAQSVGEPSTQMTLNTFH 1242

BLAST of Csa1G084260 vs. TAIR10
Match: AT1G63020.1 (AT1G63020.1 nuclear RNA polymerase D1A)

HSP 1 Score: 164.9 bits (416), Expect = 2.9e-40
Identity = 172/602 (28.57%), Postives = 261/602 (43.36%), Query Frame = 1

Query: 177 LCGKRVEFTGRTVISPDPNLKITEVAVPIHMARILTYPERVTRHNIEKLRQCVSNGPDKY 236
           L GKR + T RTV+  DP+LK+ E+ +P  +A+ L   E + + N E+L    S  P   
Sbjct: 313 LLGKRSDHTFRTVVVGDPSLKLNEIGIPESIAKRLQVSEHLNQCNKERL--VTSFVPTLL 372

Query: 237 PGARMLRHLDGSMRSLMISGRKRLADELKYGEIVERHLEDGDVVLFNRQPSLHRMSIMCH 296
               M  H+    R + I       ++L+ G+ + R L DGD VL NR PS+H+ S++  
Sbjct: 373 DNKEM--HVRRGDRLVAIQ-----VNDLQTGDKIFRSLMDGDTVLMNRPPSIHQHSLIAM 432

Query: 297 RVRVMPWRTL-RFNESVCNPYNADFDGDEMNMHVPQTEEARTEAILLMGVQNNLCTPKNG 356
            VR++P  ++   N   C P+  DFDGD ++ +VPQ+ +A+ E   L+ +   L   +NG
Sbjct: 433 TVRILPTTSVVSLNPICCLPFRGDFDGDCLHGYVPQSIQAKVELDELVALDKQLINRQNG 492

Query: 357 EILVASTQDFLTSSFLI-TRKDTFYDRAAFSLMCSYMGDGMDLVDLPTPALVKPI----- 416
             L++  QD LT+++L+   K+ + +RA    +  Y         LP PA++K       
Sbjct: 493 RNLLSLGQDSLTAAYLVNVEKNCYLNRAQMQQLQMYCP-----FQLPPPAIIKASPSSTE 552

Query: 417 ELWTGKQLFSVLVRPHASMKVYLNLTVKEKSYSKVKGNEKERETMCPNDGFVYFRNSELI 476
             WTG QLF +L  P       LN  V                            N EL+
Sbjct: 553 PQWTGMQLFGMLFPPGFDYTYPLNNVV--------------------------VSNGELL 612

Query: 477 SGQVGKATLGNGNKDGLYSVLLRDYKAHAAAVCMNRLAKLSARWIGNHGFSIGIDDVQPG 536
           S   G A L +G  + +  +L  D K     +  +    LS +W+   G S+ + D+   
Sbjct: 613 SFSEGSAWLRDGEGNFIERLLKHD-KGKVLDIIYSAQEMLS-QWLLMRGLSVSLADLYLS 672

Query: 537 DQLVKKKQTT--ILEGYRDCDKQIN------------LFNTGNLPPEAG-------CDAA 596
             L  +K  T  I  G R+ ++  N            L   G    E         C   
Sbjct: 673 SDLQSRKNLTEEISYGLREAEQVCNKQQLMVESWRDFLAVNGEDKEEDSVSDLARFCYER 732

Query: 597 QSLESKITQILNGIREATANVCMQNLHWR-----NSPLIMSQCGSKGSPINISQMVACVG 656
           Q   +     ++  ++A  +V  Q L +R     NS LIMS+ GSKG+   + Q   C+G
Sbjct: 733 QKSATLSELAVSAFKDAYRDV--QALAYRYGDQSNSFLIMSKAGSKGNIGKLVQHSMCIG 792

Query: 657 QQ------SVGGRR--APDGFIDRSLPHFRRKAKTPAAK------GFVANSFYSGLTATE 716
            Q      S G  R      + D + P    K K           G + NSF +GL   E
Sbjct: 793 LQNSAVSLSFGFPRELTCAAWNDPNSPLRGAKGKDSTTTESYVPYGVIENSFLTGLNPLE 852

Query: 717 FFFHTMGGREGLVDTAVKTAD-TGYMSRRLIKALEDLSIHYDSSVRNA-GGCIVQFCYGD 730
            F H++  R+    +    AD  G +SRRL+  + D+   YD +VRN+ G  +VQF Y  
Sbjct: 853 SFVHSVTSRD---SSFSGNADLPGTLSRRLMFFMRDIYAAYDGTVRNSFGNQLVQFTYET 867

BLAST of Csa1G084260 vs. TAIR10
Match: AT2G40030.1 (AT2G40030.1 nuclear RNA polymerase D1B)

HSP 1 Score: 141.7 bits (356), Expect = 2.6e-33
Identity = 170/727 (23.38%), Postives = 294/727 (40.44%), Query Frame = 1

Query: 33  RAPYNVSNDILNPF---RVLCLFQRMSDEDCELLFLSNR--PEKLIITNVLVPPIAIRPS 92
           R  Y   +D   P     V  + +R+ +E  + L        E  I+  + VPP  +   
Sbjct: 162 RYGYRYGSDYTRPLLAREVKEILRRIPEESRKKLTAKGHIPQEGYILEYLPVPPNCLSVP 221

Query: 93  VIMDGSQSNEND-----ITERLKRIIQQNASVSQELSTSNSQAKCLESWDMLQSEV-AQL 152
              DG  +   D     + + LK++I   +S S E +  + +A+  E + ++ + +  + 
Sbjct: 222 EASDGFSTMSVDPSRIELKDVLKKVIAIKSSRSGETNFESHKAEASEMFRVVDTYLQVRG 281

Query: 153 INSDVRGIPFSMQVSKPLAGFVQRLKGKQGRFRGNLCGKRVEFTGRTVISPDPNLKITEV 212
                R I     VSK         K    + R     K   F+ R+VI+ D    + EV
Sbjct: 282 TAKAARNIDMRYGVSK--ISDSSSSKAWTEKMRTLFIRKGSGFSSRSVITGDAYRHVNEV 341

Query: 213 AVPIHMARILTYPERVTRHNIEKLRQCVSNGPDKYPGARMLRHLDGSMRSLMISGRKRLA 272
            +PI +A+ +T+ ERV+ HN   L++ V    DK      L +  GS    +  G K   
Sbjct: 342 GIPIEIAQRITFEERVSVHNRGYLQKLVD---DKL----CLSYTQGSTTYSLRDGSKGHT 401

Query: 273 DELKYGEIVERHLEDGDVVLFNRQPSLHRMSIMCHRVRVMPWRTLRFNESVCNPYNADFD 332
            ELK G++V R + DGDVV  NR P+ H+ S+   RV V    T++ N  +C+P +ADFD
Sbjct: 402 -ELKPGQVVHRRVMDGDVVFINRPPTTHKHSLQALRVYVHEDNTVKINPLMCSPLSADFD 461

Query: 333 GDEMNMHVPQTEEARTEAILLMGVQNNLCTPKNGEILVASTQDFLTSSFLITRKDTFYDR 392
           GD +++  PQ+  A+ E + L  V+  L +   G++++    D L S  ++  +  F D+
Sbjct: 462 GDCVHLFYPQSLSAKAEVMELFSVEKQLLSSHTGQLILQMGSDSLLSLRVMLER-VFLDK 521

Query: 393 AAFSLMCSYMGDGMDLVDLPTPALVKPIELWTGKQLFSVLVRPHASMKVYLNLTVKEKSY 452
           A    +  Y       + LP PAL K  +      +F +           L L   E+  
Sbjct: 522 ATAQQLAMY-----GSLSLPPPALRKSSKSGPAWTVFQI-----------LQLAFPER-- 581

Query: 453 SKVKGNEKERETMCPNDGFVYFRNSELISGQVGKATLGNGNKDGLYSVLLRDYKAHAAAV 512
                        C  D F+    S+L+    G   +G+   + + S+ L          
Sbjct: 582 -----------LSCKGDRFL-VDGSDLLKFDFGVDAMGSIINEIVTSIFLEKGPKETLGF 641

Query: 513 CMNRLAKLSARWIGNHGFSIGIDDVQPGDQLVKKKQTTILEGYRDCDKQINLFNTGNLPP 572
             + L  L    +   GFS+ ++D+                   D D   NL      P 
Sbjct: 642 -FDSLQPLLMESLFAEGFSLSLEDLS--------------MSRADMDVIHNLIIREISPM 701

Query: 573 EAGCDAAQSLESKITQILNGIREATANVCMQNLHWRNSPLIMSQCGSKGSPINISQMVAC 632
            +    +   E ++   ++ ++E  AN  +++   RN    +    S  +   + Q    
Sbjct: 702 VSRLRLSYRDELQLENSIHKVKEVAANFMLKSYSIRN----LIDIKSNSAITKLVQQTGF 761

Query: 633 VGQQSVGGRRAPDGFIDRSLPHFRR----KAKTPAAKGFVANSFYSGLTATEFFFHTMGG 692
           +G Q    ++     +   +  F +    +  +    G V   F+ GL   E   H++  
Sbjct: 762 LGLQLSDKKKFYTKTLVEDMAIFCKRKYGRISSSGDFGIVKGCFFHGLDPYEEMAHSIAA 821

Query: 693 REGLVDTAVKTADTGYMSRRLIKALEDLSIHYDSSVRN-AGGCIVQFCYGDDGMDPAQME 744
           RE +V ++   A+ G + + L+  L D+ I  D +VRN     ++QF YG D     Q  
Sbjct: 822 REVIVRSSRGLAEPGTLFKNLMAVLRDIVITNDGTVRNTCSNSVIQFKYGVDSERGHQGL 828

BLAST of Csa1G084260 vs. NCBI nr
Match: gi|700209631|gb|KGN64727.1| (hypothetical protein Csa_1G084260 [Cucumis sativus])

HSP 1 Score: 2153.3 bits (5578), Expect = 0.0e+00
Identity = 1084/1084 (100.00%), Postives = 1084/1084 (100.00%), Query Frame = 1

Query: 1    MSAGSVKKAVSMLGILHYRARSKDAGVVSEDLRAPYNVSNDILNPFRVLCLFQRMSDEDC 60
            MSAGSVKKAVSMLGILHYRARSKDAGVVSEDLRAPYNVSNDILNPFRVLCLFQRMSDEDC
Sbjct: 1    MSAGSVKKAVSMLGILHYRARSKDAGVVSEDLRAPYNVSNDILNPFRVLCLFQRMSDEDC 60

Query: 61   ELLFLSNRPEKLIITNVLVPPIAIRPSVIMDGSQSNENDITERLKRIIQQNASVSQELST 120
            ELLFLSNRPEKLIITNVLVPPIAIRPSVIMDGSQSNENDITERLKRIIQQNASVSQELST
Sbjct: 61   ELLFLSNRPEKLIITNVLVPPIAIRPSVIMDGSQSNENDITERLKRIIQQNASVSQELST 120

Query: 121  SNSQAKCLESWDMLQSEVAQLINSDVRGIPFSMQVSKPLAGFVQRLKGKQGRFRGNLCGK 180
            SNSQAKCLESWDMLQSEVAQLINSDVRGIPFSMQVSKPLAGFVQRLKGKQGRFRGNLCGK
Sbjct: 121  SNSQAKCLESWDMLQSEVAQLINSDVRGIPFSMQVSKPLAGFVQRLKGKQGRFRGNLCGK 180

Query: 181  RVEFTGRTVISPDPNLKITEVAVPIHMARILTYPERVTRHNIEKLRQCVSNGPDKYPGAR 240
            RVEFTGRTVISPDPNLKITEVAVPIHMARILTYPERVTRHNIEKLRQCVSNGPDKYPGAR
Sbjct: 181  RVEFTGRTVISPDPNLKITEVAVPIHMARILTYPERVTRHNIEKLRQCVSNGPDKYPGAR 240

Query: 241  MLRHLDGSMRSLMISGRKRLADELKYGEIVERHLEDGDVVLFNRQPSLHRMSIMCHRVRV 300
            MLRHLDGSMRSLMISGRKRLADELKYGEIVERHLEDGDVVLFNRQPSLHRMSIMCHRVRV
Sbjct: 241  MLRHLDGSMRSLMISGRKRLADELKYGEIVERHLEDGDVVLFNRQPSLHRMSIMCHRVRV 300

Query: 301  MPWRTLRFNESVCNPYNADFDGDEMNMHVPQTEEARTEAILLMGVQNNLCTPKNGEILVA 360
            MPWRTLRFNESVCNPYNADFDGDEMNMHVPQTEEARTEAILLMGVQNNLCTPKNGEILVA
Sbjct: 301  MPWRTLRFNESVCNPYNADFDGDEMNMHVPQTEEARTEAILLMGVQNNLCTPKNGEILVA 360

Query: 361  STQDFLTSSFLITRKDTFYDRAAFSLMCSYMGDGMDLVDLPTPALVKPIELWTGKQLFSV 420
            STQDFLTSSFLITRKDTFYDRAAFSLMCSYMGDGMDLVDLPTPALVKPIELWTGKQLFSV
Sbjct: 361  STQDFLTSSFLITRKDTFYDRAAFSLMCSYMGDGMDLVDLPTPALVKPIELWTGKQLFSV 420

Query: 421  LVRPHASMKVYLNLTVKEKSYSKVKGNEKERETMCPNDGFVYFRNSELISGQVGKATLGN 480
            LVRPHASMKVYLNLTVKEKSYSKVKGNEKERETMCPNDGFVYFRNSELISGQVGKATLGN
Sbjct: 421  LVRPHASMKVYLNLTVKEKSYSKVKGNEKERETMCPNDGFVYFRNSELISGQVGKATLGN 480

Query: 481  GNKDGLYSVLLRDYKAHAAAVCMNRLAKLSARWIGNHGFSIGIDDVQPGDQLVKKKQTTI 540
            GNKDGLYSVLLRDYKAHAAAVCMNRLAKLSARWIGNHGFSIGIDDVQPGDQLVKKKQTTI
Sbjct: 481  GNKDGLYSVLLRDYKAHAAAVCMNRLAKLSARWIGNHGFSIGIDDVQPGDQLVKKKQTTI 540

Query: 541  LEGYRDCDKQINLFNTGNLPPEAGCDAAQSLESKITQILNGIREATANVCMQNLHWRNSP 600
            LEGYRDCDKQINLFNTGNLPPEAGCDAAQSLESKITQILNGIREATANVCMQNLHWRNSP
Sbjct: 541  LEGYRDCDKQINLFNTGNLPPEAGCDAAQSLESKITQILNGIREATANVCMQNLHWRNSP 600

Query: 601  LIMSQCGSKGSPINISQMVACVGQQSVGGRRAPDGFIDRSLPHFRRKAKTPAAKGFVANS 660
            LIMSQCGSKGSPINISQMVACVGQQSVGGRRAPDGFIDRSLPHFRRKAKTPAAKGFVANS
Sbjct: 601  LIMSQCGSKGSPINISQMVACVGQQSVGGRRAPDGFIDRSLPHFRRKAKTPAAKGFVANS 660

Query: 661  FYSGLTATEFFFHTMGGREGLVDTAVKTADTGYMSRRLIKALEDLSIHYDSSVRNAGGCI 720
            FYSGLTATEFFFHTMGGREGLVDTAVKTADTGYMSRRLIKALEDLSIHYDSSVRNAGGCI
Sbjct: 661  FYSGLTATEFFFHTMGGREGLVDTAVKTADTGYMSRRLIKALEDLSIHYDSSVRNAGGCI 720

Query: 721  VQFCYGDDGMDPAQMEGKSGAPLNFERLFLKAKATCPSDGNKILSPSEFSETVEDRLSKD 780
            VQFCYGDDGMDPAQMEGKSGAPLNFERLFLKAKATCPSDGNKILSPSEFSETVEDRLSKD
Sbjct: 721  VQFCYGDDGMDPAQMEGKSGAPLNFERLFLKAKATCPSDGNKILSPSEFSETVEDRLSKD 780

Query: 781  DASPECGCSPAFVGSLKIFLNKYVEAQKKSWGTLLADNESAVDKSIISSSDNDNIVIRNK 840
            DASPECGCSPAFVGSLKIFLNKYVEAQKKSWGTLLADNESAVDKSIISSSDNDNIVIRNK
Sbjct: 781  DASPECGCSPAFVGSLKIFLNKYVEAQKKSWGTLLADNESAVDKSIISSSDNDNIVIRNK 840

Query: 841  VVQNIAGVTHRQLQVFLDTCLSRYHTKKIEAGTAIGAIGAQSIGEPGTQMTLKTFHFAGV 900
            VVQNIAGVTHRQLQVFLDTCLSRYHTKKIEAGTAIGAIGAQSIGEPGTQMTLKTFHFAGV
Sbjct: 841  VVQNIAGVTHRQLQVFLDTCLSRYHTKKIEAGTAIGAIGAQSIGEPGTQMTLKTFHFAGV 900

Query: 901  ASMNVTLGVPRIKEIINGAKRISTPIVTAALTHDDNVNIARMVKARIEKTNLGQIAKCIQ 960
            ASMNVTLGVPRIKEIINGAKRISTPIVTAALTHDDNVNIARMVKARIEKTNLGQIAKCIQ
Sbjct: 901  ASMNVTLGVPRIKEIINGAKRISTPIVTAALTHDDNVNIARMVKARIEKTNLGQIAKCIQ 960

Query: 961  IVMSSRSALIEIKLDMEKIRDAELYVDANVVKQAILVTPKLKLKHEHINVLDDRKLRVLP 1020
            IVMSSRSALIEIKLDMEKIRDAELYVDANVVKQAILVTPKLKLKHEHINVLDDRKLRVLP
Sbjct: 961  IVMSSRSALIEIKLDMEKIRDAELYVDANVVKQAILVTPKLKLKHEHINVLDDRKLRVLP 1020

Query: 1021 QDADRNKLHFNLHFLKNMLPGVVVKGIKTVGRAVIKEEKDKARNAKKFSLLVEGLVYFFF 1080
            QDADRNKLHFNLHFLKNMLPGVVVKGIKTVGRAVIKEEKDKARNAKKFSLLVEGLVYFFF
Sbjct: 1021 QDADRNKLHFNLHFLKNMLPGVVVKGIKTVGRAVIKEEKDKARNAKKFSLLVEGLVYFFF 1080

Query: 1081 LLQI 1085
            LLQI
Sbjct: 1081 LLQI 1084

BLAST of Csa1G084260 vs. NCBI nr
Match: gi|449462093|ref|XP_004148776.1| (PREDICTED: DNA-directed RNA polymerase III subunit rpc1 [Cucumis sativus])

HSP 1 Score: 2129.0 bits (5515), Expect = 0.0e+00
Identity = 1071/1071 (100.00%), Postives = 1071/1071 (100.00%), Query Frame = 1

Query: 4    GSVKKAVSMLGILHYRARSKDAGVVSEDLRAPYNVSNDILNPFRVLCLFQRMSDEDCELL 63
            GSVKKAVSMLGILHYRARSKDAGVVSEDLRAPYNVSNDILNPFRVLCLFQRMSDEDCELL
Sbjct: 178  GSVKKAVSMLGILHYRARSKDAGVVSEDLRAPYNVSNDILNPFRVLCLFQRMSDEDCELL 237

Query: 64   FLSNRPEKLIITNVLVPPIAIRPSVIMDGSQSNENDITERLKRIIQQNASVSQELSTSNS 123
            FLSNRPEKLIITNVLVPPIAIRPSVIMDGSQSNENDITERLKRIIQQNASVSQELSTSNS
Sbjct: 238  FLSNRPEKLIITNVLVPPIAIRPSVIMDGSQSNENDITERLKRIIQQNASVSQELSTSNS 297

Query: 124  QAKCLESWDMLQSEVAQLINSDVRGIPFSMQVSKPLAGFVQRLKGKQGRFRGNLCGKRVE 183
            QAKCLESWDMLQSEVAQLINSDVRGIPFSMQVSKPLAGFVQRLKGKQGRFRGNLCGKRVE
Sbjct: 298  QAKCLESWDMLQSEVAQLINSDVRGIPFSMQVSKPLAGFVQRLKGKQGRFRGNLCGKRVE 357

Query: 184  FTGRTVISPDPNLKITEVAVPIHMARILTYPERVTRHNIEKLRQCVSNGPDKYPGARMLR 243
            FTGRTVISPDPNLKITEVAVPIHMARILTYPERVTRHNIEKLRQCVSNGPDKYPGARMLR
Sbjct: 358  FTGRTVISPDPNLKITEVAVPIHMARILTYPERVTRHNIEKLRQCVSNGPDKYPGARMLR 417

Query: 244  HLDGSMRSLMISGRKRLADELKYGEIVERHLEDGDVVLFNRQPSLHRMSIMCHRVRVMPW 303
            HLDGSMRSLMISGRKRLADELKYGEIVERHLEDGDVVLFNRQPSLHRMSIMCHRVRVMPW
Sbjct: 418  HLDGSMRSLMISGRKRLADELKYGEIVERHLEDGDVVLFNRQPSLHRMSIMCHRVRVMPW 477

Query: 304  RTLRFNESVCNPYNADFDGDEMNMHVPQTEEARTEAILLMGVQNNLCTPKNGEILVASTQ 363
            RTLRFNESVCNPYNADFDGDEMNMHVPQTEEARTEAILLMGVQNNLCTPKNGEILVASTQ
Sbjct: 478  RTLRFNESVCNPYNADFDGDEMNMHVPQTEEARTEAILLMGVQNNLCTPKNGEILVASTQ 537

Query: 364  DFLTSSFLITRKDTFYDRAAFSLMCSYMGDGMDLVDLPTPALVKPIELWTGKQLFSVLVR 423
            DFLTSSFLITRKDTFYDRAAFSLMCSYMGDGMDLVDLPTPALVKPIELWTGKQLFSVLVR
Sbjct: 538  DFLTSSFLITRKDTFYDRAAFSLMCSYMGDGMDLVDLPTPALVKPIELWTGKQLFSVLVR 597

Query: 424  PHASMKVYLNLTVKEKSYSKVKGNEKERETMCPNDGFVYFRNSELISGQVGKATLGNGNK 483
            PHASMKVYLNLTVKEKSYSKVKGNEKERETMCPNDGFVYFRNSELISGQVGKATLGNGNK
Sbjct: 598  PHASMKVYLNLTVKEKSYSKVKGNEKERETMCPNDGFVYFRNSELISGQVGKATLGNGNK 657

Query: 484  DGLYSVLLRDYKAHAAAVCMNRLAKLSARWIGNHGFSIGIDDVQPGDQLVKKKQTTILEG 543
            DGLYSVLLRDYKAHAAAVCMNRLAKLSARWIGNHGFSIGIDDVQPGDQLVKKKQTTILEG
Sbjct: 658  DGLYSVLLRDYKAHAAAVCMNRLAKLSARWIGNHGFSIGIDDVQPGDQLVKKKQTTILEG 717

Query: 544  YRDCDKQINLFNTGNLPPEAGCDAAQSLESKITQILNGIREATANVCMQNLHWRNSPLIM 603
            YRDCDKQINLFNTGNLPPEAGCDAAQSLESKITQILNGIREATANVCMQNLHWRNSPLIM
Sbjct: 718  YRDCDKQINLFNTGNLPPEAGCDAAQSLESKITQILNGIREATANVCMQNLHWRNSPLIM 777

Query: 604  SQCGSKGSPINISQMVACVGQQSVGGRRAPDGFIDRSLPHFRRKAKTPAAKGFVANSFYS 663
            SQCGSKGSPINISQMVACVGQQSVGGRRAPDGFIDRSLPHFRRKAKTPAAKGFVANSFYS
Sbjct: 778  SQCGSKGSPINISQMVACVGQQSVGGRRAPDGFIDRSLPHFRRKAKTPAAKGFVANSFYS 837

Query: 664  GLTATEFFFHTMGGREGLVDTAVKTADTGYMSRRLIKALEDLSIHYDSSVRNAGGCIVQF 723
            GLTATEFFFHTMGGREGLVDTAVKTADTGYMSRRLIKALEDLSIHYDSSVRNAGGCIVQF
Sbjct: 838  GLTATEFFFHTMGGREGLVDTAVKTADTGYMSRRLIKALEDLSIHYDSSVRNAGGCIVQF 897

Query: 724  CYGDDGMDPAQMEGKSGAPLNFERLFLKAKATCPSDGNKILSPSEFSETVEDRLSKDDAS 783
            CYGDDGMDPAQMEGKSGAPLNFERLFLKAKATCPSDGNKILSPSEFSETVEDRLSKDDAS
Sbjct: 898  CYGDDGMDPAQMEGKSGAPLNFERLFLKAKATCPSDGNKILSPSEFSETVEDRLSKDDAS 957

Query: 784  PECGCSPAFVGSLKIFLNKYVEAQKKSWGTLLADNESAVDKSIISSSDNDNIVIRNKVVQ 843
            PECGCSPAFVGSLKIFLNKYVEAQKKSWGTLLADNESAVDKSIISSSDNDNIVIRNKVVQ
Sbjct: 958  PECGCSPAFVGSLKIFLNKYVEAQKKSWGTLLADNESAVDKSIISSSDNDNIVIRNKVVQ 1017

Query: 844  NIAGVTHRQLQVFLDTCLSRYHTKKIEAGTAIGAIGAQSIGEPGTQMTLKTFHFAGVASM 903
            NIAGVTHRQLQVFLDTCLSRYHTKKIEAGTAIGAIGAQSIGEPGTQMTLKTFHFAGVASM
Sbjct: 1018 NIAGVTHRQLQVFLDTCLSRYHTKKIEAGTAIGAIGAQSIGEPGTQMTLKTFHFAGVASM 1077

Query: 904  NVTLGVPRIKEIINGAKRISTPIVTAALTHDDNVNIARMVKARIEKTNLGQIAKCIQIVM 963
            NVTLGVPRIKEIINGAKRISTPIVTAALTHDDNVNIARMVKARIEKTNLGQIAKCIQIVM
Sbjct: 1078 NVTLGVPRIKEIINGAKRISTPIVTAALTHDDNVNIARMVKARIEKTNLGQIAKCIQIVM 1137

Query: 964  SSRSALIEIKLDMEKIRDAELYVDANVVKQAILVTPKLKLKHEHINVLDDRKLRVLPQDA 1023
            SSRSALIEIKLDMEKIRDAELYVDANVVKQAILVTPKLKLKHEHINVLDDRKLRVLPQDA
Sbjct: 1138 SSRSALIEIKLDMEKIRDAELYVDANVVKQAILVTPKLKLKHEHINVLDDRKLRVLPQDA 1197

Query: 1024 DRNKLHFNLHFLKNMLPGVVVKGIKTVGRAVIKEEKDKARNAKKFSLLVEG 1075
            DRNKLHFNLHFLKNMLPGVVVKGIKTVGRAVIKEEKDKARNAKKFSLLVEG
Sbjct: 1198 DRNKLHFNLHFLKNMLPGVVVKGIKTVGRAVIKEEKDKARNAKKFSLLVEG 1248

BLAST of Csa1G084260 vs. NCBI nr
Match: gi|659130677|ref|XP_008465290.1| (PREDICTED: LOW QUALITY PROTEIN: DNA-directed RNA polymerase III subunit rpc1 [Cucumis melo])

HSP 1 Score: 2110.9 bits (5468), Expect = 0.0e+00
Identity = 1058/1071 (98.79%), Postives = 1067/1071 (99.63%), Query Frame = 1

Query: 4    GSVKKAVSMLGILHYRARSKDAGVVSEDLRAPYNVSNDILNPFRVLCLFQRMSDEDCELL 63
            GSVKKAVSMLGILHYRARSKDAGVVSEDLRAPYNVSNDILNPFRVLCLFQRMSDEDCELL
Sbjct: 178  GSVKKAVSMLGILHYRARSKDAGVVSEDLRAPYNVSNDILNPFRVLCLFQRMSDEDCELL 237

Query: 64   FLSNRPEKLIITNVLVPPIAIRPSVIMDGSQSNENDITERLKRIIQQNASVSQELSTSNS 123
            FLS+RPEKLIITNVLVPPIAIRPSVIMDGSQSNENDITERLKRIIQQNASVSQELSTSNS
Sbjct: 238  FLSDRPEKLIITNVLVPPIAIRPSVIMDGSQSNENDITERLKRIIQQNASVSQELSTSNS 297

Query: 124  QAKCLESWDMLQSEVAQLINSDVRGIPFSMQVSKPLAGFVQRLKGKQGRFRGNLCGKRVE 183
            QAKCLESWDMLQSEVAQLINSDVRGIPF+MQVSKPLAGFVQRLKGKQGRFRGNLCGKRVE
Sbjct: 298  QAKCLESWDMLQSEVAQLINSDVRGIPFTMQVSKPLAGFVQRLKGKQGRFRGNLCGKRVE 357

Query: 184  FTGRTVISPDPNLKITEVAVPIHMARILTYPERVTRHNIEKLRQCVSNGPDKYPGARMLR 243
            FTGRTVISPDPNLKITEVAVPIHMARILTYPERVT HNIEKLRQCVSNGPDKYPGARMLR
Sbjct: 358  FTGRTVISPDPNLKITEVAVPIHMARILTYPERVTSHNIEKLRQCVSNGPDKYPGARMLR 417

Query: 244  HLDGSMRSLMISGRKRLADELKYGEIVERHLEDGDVVLFNRQPSLHRMSIMCHRVRVMPW 303
            HLDGSMRSLMISGRKRLADELKYGEIVERHLEDGDVVLFNRQPSLHRMSIMCHRVRVMPW
Sbjct: 418  HLDGSMRSLMISGRKRLADELKYGEIVERHLEDGDVVLFNRQPSLHRMSIMCHRVRVMPW 477

Query: 304  RTLRFNESVCNPYNADFDGDEMNMHVPQTEEARTEAILLMGVQNNLCTPKNGEILVASTQ 363
            RTLRFNESVCNPYNADFDGDEMNMHVPQTEEARTEAILLMGVQNNLCTPKNGEILVASTQ
Sbjct: 478  RTLRFNESVCNPYNADFDGDEMNMHVPQTEEARTEAILLMGVQNNLCTPKNGEILVASTQ 537

Query: 364  DFLTSSFLITRKDTFYDRAAFSLMCSYMGDGMDLVDLPTPALVKPIELWTGKQLFSVLVR 423
            DFLTSSFLITRKDTFYDRAAFSLMCSYMGDGMDL+DLPTPAL+KPIELWTGKQLFSVLVR
Sbjct: 538  DFLTSSFLITRKDTFYDRAAFSLMCSYMGDGMDLIDLPTPALIKPIELWTGKQLFSVLVR 597

Query: 424  PHASMKVYLNLTVKEKSYSKVKGNEKERETMCPNDGFVYFRNSELISGQVGKATLGNGNK 483
            PHASMKVYLNLTVKEKSYSKVKGNEKERETMCPNDGFVYFRNSELISGQVGKATLGNGNK
Sbjct: 598  PHASMKVYLNLTVKEKSYSKVKGNEKERETMCPNDGFVYFRNSELISGQVGKATLGNGNK 657

Query: 484  DGLYSVLLRDYKAHAAAVCMNRLAKLSARWIGNHGFSIGIDDVQPGDQLVKKKQTTILEG 543
            DGLYSVLLRDYKAHAAAVCMNRLAKLSARWIGNHGFSIGIDDVQPGDQLV KKQTTILEG
Sbjct: 658  DGLYSVLLRDYKAHAAAVCMNRLAKLSARWIGNHGFSIGIDDVQPGDQLVXKKQTTILEG 717

Query: 544  YRDCDKQINLFNTGNLPPEAGCDAAQSLESKITQILNGIREATANVCMQNLHWRNSPLIM 603
            YRDCDKQINLFNTGNLPPEAGCDAAQSLE+KITQILNGIREATANVCMQNLHWRNSPLIM
Sbjct: 718  YRDCDKQINLFNTGNLPPEAGCDAAQSLEAKITQILNGIREATANVCMQNLHWRNSPLIM 777

Query: 604  SQCGSKGSPINISQMVACVGQQSVGGRRAPDGFIDRSLPHFRRKAKTPAAKGFVANSFYS 663
            SQCGSKGSPINISQMVACVGQQSVGGRRAPDGFIDRSLPHFRRKAKTPAAKGFVANSFYS
Sbjct: 778  SQCGSKGSPINISQMVACVGQQSVGGRRAPDGFIDRSLPHFRRKAKTPAAKGFVANSFYS 837

Query: 664  GLTATEFFFHTMGGREGLVDTAVKTADTGYMSRRLIKALEDLSIHYDSSVRNAGGCIVQF 723
            GLTATEFFFHTMGGREGLVDTAVKTADTGYMSRRLIKALEDLSI+YDSSVRNAGGCIVQF
Sbjct: 838  GLTATEFFFHTMGGREGLVDTAVKTADTGYMSRRLIKALEDLSIYYDSSVRNAGGCIVQF 897

Query: 724  CYGDDGMDPAQMEGKSGAPLNFERLFLKAKATCPSDGNKILSPSEFSETVEDRLSKDDAS 783
            CYGDDGMDPAQMEGKSGAPLNFERLFLKAKATCPSDGNKILSPSEFSETVEDRLSKDDAS
Sbjct: 898  CYGDDGMDPAQMEGKSGAPLNFERLFLKAKATCPSDGNKILSPSEFSETVEDRLSKDDAS 957

Query: 784  PECGCSPAFVGSLKIFLNKYVEAQKKSWGTLLADNESAVDKSIISSSDNDNIVIRNKVVQ 843
            PECGCSPAF+GSLKIFLNKY+EAQKKSW TLLADNE+AVDKSIISSSDNDNIVIRN VVQ
Sbjct: 958  PECGCSPAFIGSLKIFLNKYIEAQKKSWSTLLADNETAVDKSIISSSDNDNIVIRNNVVQ 1017

Query: 844  NIAGVTHRQLQVFLDTCLSRYHTKKIEAGTAIGAIGAQSIGEPGTQMTLKTFHFAGVASM 903
            NIAGVTHRQLQVFLDTCLSRYHTKKIEAGTAIGAIGAQSIGEPGTQMTLKTFHFAGVASM
Sbjct: 1018 NIAGVTHRQLQVFLDTCLSRYHTKKIEAGTAIGAIGAQSIGEPGTQMTLKTFHFAGVASM 1077

Query: 904  NVTLGVPRIKEIINGAKRISTPIVTAALTHDDNVNIARMVKARIEKTNLGQIAKCIQIVM 963
            NVTLGVPRIKEIINGAKRISTPIVTAALTHDDNVNIARMVKARIEKTNLGQIAKCIQIVM
Sbjct: 1078 NVTLGVPRIKEIINGAKRISTPIVTAALTHDDNVNIARMVKARIEKTNLGQIAKCIQIVM 1137

Query: 964  SSRSALIEIKLDMEKIRDAELYVDANVVKQAILVTPKLKLKHEHINVLDDRKLRVLPQDA 1023
            SSRSALIEIKLDMEKIRDAELYVDANVVKQAILVTPKLKLKHEHINVLDDRKLRVLPQDA
Sbjct: 1138 SSRSALIEIKLDMEKIRDAELYVDANVVKQAILVTPKLKLKHEHINVLDDRKLRVLPQDA 1197

Query: 1024 DRNKLHFNLHFLKNMLPGVVVKGIKTVGRAVIKEEKDKARNAKKFSLLVEG 1075
            DRNKLHFNLHFLKNMLPGVVVKGIKTVGRAVIKEEKDKARNAKKFSLLVEG
Sbjct: 1198 DRNKLHFNLHFLKNMLPGVVVKGIKTVGRAVIKEEKDKARNAKKFSLLVEG 1248

BLAST of Csa1G084260 vs. NCBI nr
Match: gi|1009115388|ref|XP_015874201.1| (PREDICTED: DNA-directed RNA polymerase III subunit 1 [Ziziphus jujuba])

HSP 1 Score: 1574.3 bits (4075), Expect = 0.0e+00
Identity = 785/1080 (72.69%), Postives = 911/1080 (84.35%), Query Frame = 1

Query: 4    GSVKKAVSMLGILHYRARS---------KDAGVVSEDLRAPYNVSNDILNPFRVLCLFQR 63
            G VKK+ +  G+  +  RS         K A    +D +   ++SN ++NP  V  LF+R
Sbjct: 177  GKVKKSTASQGVKIFHDRSRLFDGLEDYKSAISHIKDSKISLDMSNHVINPAMVHFLFKR 236

Query: 64   MSDEDCELLFLSNRPEKLIITNVLVPPIAIRPSVIMDGSQSNENDITERLKRIIQQNASV 123
            M DEDCELL+LS+RPEKL++ N+ VPPI IRPSVI DG +SNENDITERLKR++Q NAS+
Sbjct: 237  MLDEDCELLYLSDRPEKLMMVNIAVPPIPIRPSVIADGDRSNENDITERLKRVVQVNASL 296

Query: 124  SQELSTSNSQAKCLESWDMLQSEVAQLINSDVRGIPFSMQVSKPLAGFVQRLKGKQGRFR 183
             QEL  +N  A+ L  WD LQ EVAQ INSDVRG PF+MQV+KP+ GFVQRLKGKQGRFR
Sbjct: 297  QQELLEANCAAR-LSGWDDLQVEVAQYINSDVRGGPFAMQVAKPMGGFVQRLKGKQGRFR 356

Query: 184  GNLCGKRVEFTGRTVISPDPNLKITEVAVPIHMARILTYPERVTRHNIEKLRQCVSNGPD 243
            GNL GKRVE+TGRTVISPDPNLKITEVA+PIHMARILTYPERV+ HNIEKLRQCVSNGPD
Sbjct: 357  GNLSGKRVEYTGRTVISPDPNLKITEVAIPIHMARILTYPERVSYHNIEKLRQCVSNGPD 416

Query: 244  KYPGARMLRHLDGSMRSLMISGRKRLADELKYGEIVERHLEDGDVVLFNRQPSLHRMSIM 303
            KYPGARMLR  DGS  +L ++ RKR ADELKYG+IV+RHLEDGD+VLFNRQPSLHRMSIM
Sbjct: 417  KYPGARMLRRADGSQWNLNVA-RKRRADELKYGDIVDRHLEDGDIVLFNRQPSLHRMSIM 476

Query: 304  CHRVRVMPWRTLRFNESVCNPYNADFDGDEMNMHVPQTEEARTEAILLMGVQNNLCTPKN 363
            CHR RVMPWRTLRFNESVCNPYNADFDGDEMNMHVPQTEEARTEAILLMGVQNNLCTPKN
Sbjct: 477  CHRARVMPWRTLRFNESVCNPYNADFDGDEMNMHVPQTEEARTEAILLMGVQNNLCTPKN 536

Query: 364  GEILVASTQDFLTSSFLITRKDTFYDRAAFSLMCSYMGDGMDLVDLPTPALVKPIELWTG 423
            GEILVASTQDFLTSSFLITRKDTFYDRAAFSLMCSYMGDGMD +DLPTPA++KP+ELWTG
Sbjct: 537  GEILVASTQDFLTSSFLITRKDTFYDRAAFSLMCSYMGDGMDHIDLPTPAVIKPVELWTG 596

Query: 424  KQLFSVLVRPHASMKVYLNLTVKEKSYSKVKGNEKERETMCPNDGFVYFRNSELISGQVG 483
            KQLFS+LVRP A+++V+LNLTV+EK+Y+K+  N +E E +CPNDGFVYFRNSELISGQ+G
Sbjct: 597  KQLFSILVRPDANVRVFLNLTVREKNYTKLLVNGREIEALCPNDGFVYFRNSELISGQLG 656

Query: 484  KATLGNGNKDGLYSVLLRDYKAHAAAVCMNRLAKLSARWIGNHGFSIGIDDVQPGDQLVK 543
            KATLGNGNKDGLY VLLRDYK HAAA CMNRLAKLSARWIGNHGFSIGIDDVQP  +L  
Sbjct: 657  KATLGNGNKDGLYFVLLRDYKTHAAAACMNRLAKLSARWIGNHGFSIGIDDVQPSKRLHD 716

Query: 544  KKQTTILEGYRDCDKQINLFNTGNLPPEAGCDAAQSLESKITQILNGIREATANVCMQNL 603
            +KQ  ILEGY  CD++INL+N G+LPPE GC+AAQ+LE+KI+QILNGIR+AT  +CMQ L
Sbjct: 717  QKQELILEGYGKCDEKINLYNEGSLPPEPGCNAAQTLEAKISQILNGIRDATGKLCMQEL 776

Query: 604  HWRNSPLIMSQCGSKGSPINISQMVACVGQQSVGGRRAPDGFIDRSLPHFRRKAKTPAAK 663
            HWRNSPLIMSQCGSKGSPINISQM+ACVGQQSVGGRRAPDGFIDRSLPHF R  KTP AK
Sbjct: 777  HWRNSPLIMSQCGSKGSPINISQMIACVGQQSVGGRRAPDGFIDRSLPHFHRNQKTPGAK 836

Query: 664  GFVANSFYSGLTATEFFFHTMGGREGLVDTAVKTADTGYMSRRLIKALEDLSIHYDSSVR 723
            GFVANSFYSGL+ATEFFFHTMGGREGLVDTAVKTADTGYMSRRL+KA+EDL +HYD +VR
Sbjct: 837  GFVANSFYSGLSATEFFFHTMGGREGLVDTAVKTADTGYMSRRLMKAMEDLCVHYDDAVR 896

Query: 724  NAGGCIVQFCYGDDGMDPAQMEGKSGAPLNFERLFLKAKATCPSDGNKILSPSEFSETVE 783
            NA GCIVQF YGDD MDPA MEGK+GAPLNFERLFLKAKATCP+  N+ LSP + SE VE
Sbjct: 897  NASGCIVQFRYGDDNMDPANMEGKNGAPLNFERLFLKAKATCPAGENEKLSPEKVSEIVE 956

Query: 784  DRLSKDDASPECGCSPAFVGSLKIFLNKYVEAQKKSWGTLLADNESAVDKSIISSSDNDN 843
             R+S+ D +PE GCS AF  SLK FL++YV A K++           + +S+ +  + + 
Sbjct: 957  SRISQQDMTPEWGCSLAFKTSLKCFLDEYVMALKRT------QERFGLIESLAAMENFET 1016

Query: 844  IVIRNKVVQNIAGVTHRQLQVFLDTCLSRYHTKKIEAGTAIGAIGAQSIGEPGTQMTLKT 903
            +    K+V +I+GVT RQL+VFLDTC+SRYH KKIE GTAIGAIG  SIGEPGTQMTLKT
Sbjct: 1017 V---EKIVLHISGVTARQLEVFLDTCISRYHMKKIEPGTAIGAIGGHSIGEPGTQMTLKT 1076

Query: 904  FHFAGVASMNVTLGVPRIKEIINGAKRISTPIVTAALTHDDNVNIARMVKARIEKTNLGQ 963
            FHFAGVASMN+T GVPRIKEIIN AK+ISTP++TA L  DDNVN+AR+V+ RIEKT L  
Sbjct: 1077 FHFAGVASMNITQGVPRIKEIINAAKKISTPVITATLECDDNVNVARVVRGRIEKTRLCD 1136

Query: 964  IAKCIQIVMSSRSALIEIKLDMEKIRDAELYVDANVVKQAILVTPKLKLKHEHINVLDDR 1023
            +AK ++ VM++RSA I I LDM+ I+DA+L +DANVV+++IL TPK+KLK EHI VL+ +
Sbjct: 1137 VAKTLKTVMTNRSAAIVITLDMDMIQDAQLSIDANVVQESILKTPKIKLKQEHIKVLEVK 1196

Query: 1024 KLRVLPQDADRNKLHFNLHFLKNMLPGVVVKGIKTVGRAVIKEEKDKARNAKKFSLLVEG 1075
            KL +LPQ+ DR+K+HFNL++LK++LP V+VKGIKT+ R +IK+E+DK     K  LLVEG
Sbjct: 1197 KLEILPQEVDRSKIHFNLNYLKSILPTVIVKGIKTIERVIIKKEEDKKTQMSKCKLLVEG 1245

BLAST of Csa1G084260 vs. NCBI nr
Match: gi|645216740|ref|XP_008222562.1| (PREDICTED: DNA-directed RNA polymerase III subunit rpc1 [Prunus mume])

HSP 1 Score: 1559.7 bits (4037), Expect = 0.0e+00
Identity = 793/1094 (72.49%), Postives = 903/1094 (82.54%), Query Frame = 1

Query: 4    GSVKKAVSMLGILHYRARSKDAGVVSE---------DLRAPYNVSNDILNPFRVLCLFQR 63
            GSVKKAV +LGI+H   RSK  GV+ +         + +A +NV+  +LNP R+  LF+R
Sbjct: 182  GSVKKAVGVLGIIH--DRSKLNGVMDDFRTTLSHTKESKASFNVATHMLNPARIYSLFKR 241

Query: 64   MSDEDCELLFLSNRPEKLIITNVLVPPIAIRPSVIMDGSQSNENDITERLKRIIQQNASV 123
            M DEDCELL LSNRPE LI+ N+ VPPIAIRPSVI+D S SNENDITERLKRIIQ NA +
Sbjct: 242  MVDEDCELLNLSNRPENLIMKNIAVPPIAIRPSVIVDRSLSNENDITERLKRIIQSNAIL 301

Query: 124  SQELSTSNSQAKCLESWDMLQSEVAQLINSDVRGIPFSMQVSKPLAGFVQRLKGKQGRFR 183
             ++L  + S  KCL SWD+LQ EVAQ INSD+RG+PFSMQ +KPL+GFVQRLKGK GRFR
Sbjct: 302  RRDLLEAKSAPKCLASWDILQVEVAQYINSDIRGVPFSMQTAKPLSGFVQRLKGKGGRFR 361

Query: 184  GNLCGKRVEFTGRTVISPDPNLKITEVAVPIHMARILTYPERVTRHNIEKLRQCVSNGPD 243
            GNL GKRVE+TGRTVISPDPNLKITEV +PI MA+ILTYPERV+ HNIEKLRQCVSNGPD
Sbjct: 362  GNLSGKRVEYTGRTVISPDPNLKITEVGIPIKMAQILTYPERVSHHNIEKLRQCVSNGPD 421

Query: 244  KYPGARMLRHLDGSMRSLMISGRKRLADELKYGEIVERHLEDGDVVLFNRQPSLHRMSIM 303
            KYPGARMLR+ DG+  SL ++ RK  AD LKYG+IVERHLEDGD+VLFNRQPSLHRMSIM
Sbjct: 422  KYPGARMLRNPDGTEWSLKVN-RKNAADGLKYGDIVERHLEDGDIVLFNRQPSLHRMSIM 481

Query: 304  CHRVRVMPWRTLRFNESVCNPYNADFDGDEMNMHVPQTEEARTEAILLMGVQNNLCTPKN 363
            CHR +VMPWRTLRFNESVCNPYNADFDGDEMNMHVPQTEEARTEA+LLMGVQNNLCTPKN
Sbjct: 482  CHRAKVMPWRTLRFNESVCNPYNADFDGDEMNMHVPQTEEARTEAVLLMGVQNNLCTPKN 541

Query: 364  GEILVASTQDFLTSSFLITRKDTFYDRAAFSLMCSYMGDGMDLVDLPTPALVKPIELWTG 423
            GE+LVASTQDFLTSSFLITRKDTFYDRA+FSLMCSYMGDG D VDLPTPA++KPIELWTG
Sbjct: 542  GEVLVASTQDFLTSSFLITRKDTFYDRASFSLMCSYMGDGTDPVDLPTPAVIKPIELWTG 601

Query: 424  KQLFSVLVRPHASMKVYLNLTVKEKSYSKVKGNEKERETMCPNDGFVYFRNSELISGQVG 483
            KQLFSVLVRP+++++VYLN TV EKSYSK +      E MCPNDGFVYFRNSELI+GQ+G
Sbjct: 602  KQLFSVLVRPNSNVRVYLNFTVNEKSYSKTEDGGP--EAMCPNDGFVYFRNSELIAGQLG 661

Query: 484  KATLGNGNKDGLYSVLLRDYKAHAAAVCMNRLAKLSARWIGNHGFSIGIDDVQPGDQLVK 543
            K TLGNGNKDGLYSVLLRDYKAHAAA CMNRLAKLSARWIGNHGFSIGI DVQP D L  
Sbjct: 662  KGTLGNGNKDGLYSVLLRDYKAHAAASCMNRLAKLSARWIGNHGFSIGISDVQPSDNLYN 721

Query: 544  KKQTTILEGYRDCDKQINLFNTGNLPPEAGCDAAQSLESKITQILNGIREATANVCMQNL 603
            +K+  I EGY  C  +I L+N G LP E GCDAAQSLES IT+ILN IR+ T  +CMQ L
Sbjct: 722  EKEKIIKEGYNKCAGKIKLYNEGQLPLEPGCDAAQSLESGITKILNNIRDQTGKLCMQTL 781

Query: 604  HWRNSPLIMSQCGSKGSPINISQMVACVGQQSVGGRRAPDGFIDRSLPHFRRKAKTPAAK 663
            HWRNSPLIMSQCGSKGSPINISQM+ACVGQQSVGGRRAP+GFIDRSLPHF RKAKTPAAK
Sbjct: 782  HWRNSPLIMSQCGSKGSPINISQMIACVGQQSVGGRRAPNGFIDRSLPHFPRKAKTPAAK 841

Query: 664  GFVANSFYSGLTATEFFFHTMGGREGLVDTAVKTADTGYMSRRLIKALEDLSIHYDSSVR 723
            GFVA+SFYSGLTATEFFFHTMGGREGLVDTAVKTADTGYMSRRL K LEDLS+ YD++VR
Sbjct: 842  GFVASSFYSGLTATEFFFHTMGGREGLVDTAVKTADTGYMSRRLSKILEDLSVQYDNTVR 901

Query: 724  NAGGCIVQFCYGDDGMDPAQMEGKSGAPLNFERLFLKAKATCPSDGNKILSPSEFSETVE 783
            NA GCIVQFCYGDDGMDPA MEG+ GAPL+F RLFLKAKATCP+  N+ LS  E SE V+
Sbjct: 902  NASGCIVQFCYGDDGMDPAMMEGEGGAPLDFRRLFLKAKATCPAGENESLSLEEVSEIVK 961

Query: 784  DRLSKDDASPECGCSPAFVGSLKIFLNKYVEAQKKSWGTLLADNESAVDKSIISSSDNDN 843
            DRLSK D +P+ GCS  F  SL+ FL+KY +  +K+  T + D   A  +   S      
Sbjct: 962  DRLSKQDMTPDRGCSAGFKSSLEEFLDKYAKELRKTHDTFVLDQSPAWKEKSAS------ 1021

Query: 844  IVIRNKVVQNIAGVTHRQLQVFLDTCLSRYHTKKIEAGTAIGAIGAQSIGEPGTQMTLKT 903
                 K+VQNI+GVT +QL+VFL+TC+SRYH+KK+EAGTAIG IGAQSIGEPGTQMTLKT
Sbjct: 1022 ---LEKIVQNISGVTCKQLEVFLNTCISRYHSKKVEAGTAIGVIGAQSIGEPGTQMTLKT 1081

Query: 904  FHFAGVASMNVTLGVPRIKEIINGAKRISTPIVTAALTHDDNVNIARMVKARIEKTNLGQ 963
            FHFAGVASMNVTLGVPRIKEIINGAK+ISTPIVTA L H++N   AR+V  RIEKT LGQ
Sbjct: 1082 FHFAGVASMNVTLGVPRIKEIINGAKKISTPIVTAILEHNNNAKFARVVAGRIEKTMLGQ 1141

Query: 964  IAKCIQIVMSSRSALIEIKLDMEKIRDAELYVDANVVKQAILVTPKLKLKHEHINVLDDR 1023
            ++K I+IVM+SRSA I I LDM  I+DA L +DANVVK++IL TP++KLK EH+ VLD R
Sbjct: 1142 VSKSIKIVMTSRSASIVITLDMVMIQDAHLSIDANVVKESILRTPRIKLKQEHVKVLDIR 1201

Query: 1024 KLRVLPQDADRNKLHFNLHFLKNMLPGVVVKGIKTVGRAVI------KEEKD-------- 1075
            KL +LPQ+ADR++LHFNL++LK++LP V+V+GI TV R VI      K +KD        
Sbjct: 1202 KLEILPQEADRSRLHFNLYYLKSVLPKVIVRGISTVQRVVIDAKEVKKNKKDLEVTCADG 1261

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
NRPC1_ARATH0.0e+0064.79DNA-directed RNA polymerase III subunit 1 OS=Arabidopsis thaliana GN=NRPC1 PE=2 ... [more]
RPC1_HUMAN1.4e-28351.44DNA-directed RNA polymerase III subunit RPC1 OS=Homo sapiens GN=POLR3A PE=1 SV=2[more]
RPC1_BOVIN2.1e-28250.87DNA-directed RNA polymerase III subunit RPC1 OS=Bos taurus GN=POLR3A PE=2 SV=1[more]
RPC1_CHICK3.9e-28151.06DNA-directed RNA polymerase III subunit RPC1 OS=Gallus gallus GN=POLR3A PE=2 SV=... [more]
RPC1_SCHPO6.2e-27948.90DNA-directed RNA polymerase III subunit rpc1 OS=Schizosaccharomyces pombe (strai... [more]
Match NameE-valueIdentityDescription
A0A061FU60_THECC0.0e+0071.93DNA-directed RNA polymerase subunit OS=Theobroma cacao GN=TCM_012066 PE=3 SV=1[more]
I1LK95_SOYBN0.0e+0071.10DNA-directed RNA polymerase subunit OS=Glycine max GN=GLYMA_U034000 PE=3 SV=2[more]
W9QSP3_9ROSA0.0e+0074.47DNA-directed RNA polymerase subunit OS=Morus notabilis GN=L484_007172 PE=3 SV=1[more]
A0A067GCD6_CITSI0.0e+0071.79DNA-directed RNA polymerase subunit OS=Citrus sinensis GN=CISIN_1g000828mg PE=3 ... [more]
V7AFW1_PHAVU0.0e+0069.76DNA-directed RNA polymerase subunit OS=Phaseolus vulgaris GN=PHAVU_011G070100g P... [more]
Match NameE-valueIdentityDescription
AT5G60040.20.0e+0064.87 nuclear RNA polymerase C1[more]
AT4G35800.13.4e-14241.14 RNA polymerase II large subunit[more]
AT3G57660.12.5e-11331.26 nuclear RNA polymerase A1[more]
AT1G63020.12.9e-4028.57 nuclear RNA polymerase D1A[more]
AT2G40030.12.6e-3323.38 nuclear RNA polymerase D1B[more]
Match NameE-valueIdentityDescription
gi|700209631|gb|KGN64727.1|0.0e+00100.00hypothetical protein Csa_1G084260 [Cucumis sativus][more]
gi|449462093|ref|XP_004148776.1|0.0e+00100.00PREDICTED: DNA-directed RNA polymerase III subunit rpc1 [Cucumis sativus][more]
gi|659130677|ref|XP_008465290.1|0.0e+0098.79PREDICTED: LOW QUALITY PROTEIN: DNA-directed RNA polymerase III subunit rpc1 [Cu... [more]
gi|1009115388|ref|XP_015874201.1|0.0e+0072.69PREDICTED: DNA-directed RNA polymerase III subunit 1 [Ziziphus jujuba][more]
gi|645216740|ref|XP_008222562.1|0.0e+0072.49PREDICTED: DNA-directed RNA polymerase III subunit rpc1 [Prunus mume][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR000722RNA_pol_asu
IPR006592RNA_pol_N
IPR007066RNA_pol_Rpb1_3
IPR007080RNA_pol_Rpb1_1
IPR007081RNA_pol_Rpb1_5
IPR007083RNA_pol_Rpb1_4
Vocabulary: Biological Process
TermDefinition
GO:0006351transcription, DNA-templated
Vocabulary: Molecular Function
TermDefinition
GO:0003677DNA binding
GO:0003899DNA-directed RNA polymerase activity
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006144 purine nucleobase metabolic process
biological_process GO:0006206 pyrimidine nucleobase metabolic process
biological_process GO:0006351 transcription, DNA-templated
cellular_component GO:0005730 nucleolus
molecular_function GO:0003677 DNA binding
molecular_function GO:0003899 DNA-directed RNA polymerase activity
molecular_function GO:0032549 ribonucleoside binding
molecular_function GO:0008270 zinc ion binding
This gene is associated with the following unigenes:
Unigene NameAnalysis NameSequence type in Unigene
CU124784cucumber EST collection version 3.0transcribed_cluster
CU139293cucumber EST collection version 3.0transcribed_cluster
CU144651cucumber EST collection version 3.0transcribed_cluster
CU178088cucumber EST collection version 3.0transcribed_cluster

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Csa1G084260.1Csa1G084260.1mRNA


The following transcribed_cluster feature(s) are associated with this gene:

Feature NameUnique NameType
CU139293CU139293transcribed_cluster
CU144651CU144651transcribed_cluster
CU178088CU178088transcribed_cluster
CU124784CU124784transcribed_cluster


Analysis Name: InterPro Annotations of cucumber (Chinese Long)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000722RNA polymerase, alpha subunitPFAMPF00623RNA_pol_Rpb1_2coord: 179..345
score: 3.8
IPR006592RNA polymerase, N-terminalSMARTSM00663rpolaneu7coord: 70..373
score: 1.1E
IPR007066RNA polymerase Rpb1, domain 3PFAMPF04983RNA_pol_Rpb1_3coord: 348..526
score: 5.6
IPR007080RNA polymerase Rpb1, domain 1PFAMPF04997RNA_pol_Rpb1_1coord: 36..177
score: 6.2
IPR007081RNA polymerase Rpb1, domain 5PFAMPF04998RNA_pol_Rpb1_5coord: 664..1075
score: 2.7
IPR007083RNA polymerase Rpb1, domain 4PFAMPF05000RNA_pol_Rpb1_4coord: 557..657
score: 6.1
NoneNo IPR availableGENE3DG3DSA:2.40.40.20coord: 276..348
score: 2.0E-42coord: 184..211
score: 2.0
NoneNo IPR availableGENE3DG3DSA:3.30.1490.180coord: 212..274
score: 1.7
NoneNo IPR availablePANTHERPTHR19376DNA-DIRECTED RNA POLYMERASEcoord: 1..1074
score:
NoneNo IPR availablePANTHERPTHR19376:SF32DNA-DIRECTED RNA POLYMERASE III SUBUNIT RPC1coord: 1..1074
score:
NoneNo IPR availableunknownSSF64484beta and beta-prime subunits of DNA dependent RNA-polymerasecoord: 36..1059
score:

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
Csa1G084260CSPI01G12610Wild cucumber (PI 183967)cpicuB000
Csa1G084260Cucsa.226960Cucumber (Gy14) v1cgycuB323
The following gene(s) are paralogous to this gene:

None