ClCG00G000380 (gene) Watermelon (Charleston Gray)

NameClCG00G000380
Typegene
OrganismCitrullus lanatus (Watermelon (Charleston Gray))
DescriptionDNA-directed RNA polymerase IV subunit 1
LocationCG_Chr00 : 526082 .. 546292 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
GTGGTGAACTGGCTTGGGGGGGAGAAGAAGTTGTATGAGTTCTGCTGGAGACTTTCGACCAGGCCAGAAGGTATAATTTTCAGGACGATATTGCAGTTTAGTCTTGAACCTGGTTCTAGAGTTGAATCATCCCTTTAACTTGTATTGTATTTAGCAACTCATTATCTATTTCTCTCGTTTAAATTTTCCACTTATGTCTCTTTTTACTAGAAGCTTCTAGAGTGTTAAGTCATTACCTTAGGCTAGTAGTACTAGTCTCTGCTTGTCTTTAGCATGTCTTAATGCCTTGCTATTCTTACATACTTGCAAAATCTTTGTTAGTTCATCTATTAGATTTTCCAACATCAATTGGTATTCTTCTTCTTGTCTTCTTCTTCTTCTTCTTCTAGTGAAGTCTCAGTTTCAGGAATCTCCATAAAGATCAAGTTTTACTTAGCCTTCCAATATCCCCTTCTTGTCCATGTTATTTTCTGTCCTCATAATTGTTTCTTGTCAAATAAAGGATCAAATGCTCTAGAAAAATTTTCATTAAATGTTGTTATTGTGACTCTTTCTTTTGGGTTCCCGACCAACATCTCTCCATGCTTTCCTTCATGTTATAGATGCAATTTAGTTTATTTAATTTATTTCAAATGCACTAGGTGATGATCCATATGGAAGATGAACAGGATGGTGAGCTACCAATTCCATCTGGTCTCGTTACTGGCATAAACTTTAGTGTCTCAACTCAGCAAGATACAGTAAGTATTTTAATTCTGATATAGTATCCTCATATACTGCTTTAGGAAGTCAATAATTTTCAAGACAATCAATTGTCAATAAGTGCGAGTGTAGGTTCAAGCCACAATGGACTACTTATCTAGGATTTAATATCCTATGAACTTCCTTGGTAATTAAATGTAGCCAGGTTAGGTGGTTATCTCATAAGAATAGTCAAGGAGCATGCAAGCTTGCCTCAAAACTCACAATGTAAACAAACAAATTATAACAATTCATATTGGTGAAGGGTTCCATCATAATAAGTAATAACCCAAGTTCAAGACTGGTTTGTGAATTTTAATGGTAAAGGAGTTTAATTGCAATAATTTCATAAGTGGGGGGGCATTGTTCAAATCTATAAAACTCATGGTGTGATTACTCTATATTGGAAGGTGAACCTTTTTTTTCTTCTTCCTCTTTTTCCCCGACATTTATCTTAATTTTTATTCCATGTGCTTTTGTTTGTAAAGTAAGGCAATTTTTTTTTTTCATATATAAAAATTGTTGTTGGATAAAAAACCAAAGGTGCCTTAAACTTGTGAGTTTATACGTGCCTTTAATTTTGTATGGTGTCATGTTCATTACATACAGGATTTTATTTTTGATATTCTGATGTGAAAAGAAACTTTATCGATGTAACAGGAGAATATAGCAGTAATGACAGTTGATGCATCCAGCGAGGTATCTGATCCTAAGTTGGGACTTCCGAATCCATCTTATCAGTGCACCACATGTGGTGCTAGCTCCCTAAAATCTTGTGAAGGTAATTAACTAGTAACGTTACAAAGATGTTACTTGTTAAATTTTCTCACACATGTGTTCTAACATTTTTGTTTGAATGGTTTTGGAAGGACATTTTGGGGTTATCAAATTCCCATATACTATAATCCATCCTTATTTTCTCTCGGAAGTTGCACAAGTGTTGAATAAAGTTTGTCCAGGGTGTAAATCTATCAGGCAGGAACTATGGGGCAAGGTCAGGTACCTCACTTATTGTGATGAACCTTGAATTATTGTAATTAATTTAAAAGTTATGGCTTTCTGCTATTAAACAACGCAGTTATCTTGCCCATGTGGATATGTAGATAATTCTCGTTAAAAAATGTCTTTTCTTTTTTCCATCCATGTTCTGAGGAAAATTCTTTGAATTCTGATTCAATTGAAGAAACATGTACAGGGGTTTTAGTACCCTCCGTCAGAATGTGCCTTACCACTCTCCCCACCCCACCCACTTATCTTACCCCTCCACAACCAGTTTGTCTTCACTCCTTTGGATCCAACTTTAGGTTTAACAAGAGGAGTTATTCACTCGACAAAGGACTCTCCACCTCATGGTGACCTCTTTGAGGTTGACTCCGAGGTAAGTTTGAGTTGTGTGGAATCAGTCATTTAGCCTGTGGATTTCAGTATGGAGTTTGATGTTGTTGAAGATTCTTTATCTGAAGCTTTTGACTTACTATTTTGGGATTGTGACAAGAGGACTCCTCCTCCTATTCCTTCATCCATGCCGGCCAAATTTGCTTCGCTGATGGAAGCTTGTGGTTTTGAGTTTTGTGAAATTCCTCCTTTCTCTCGAGAAAGAAGGCATGTCATGGTTTAGCTTTGCGAAGTTTTGTTTTTCTTTCATCTCTTAGACGTTTGGATTGACTTGCATGAGGAGATTTTTTATTTCTTTGCTGGGGATGGTTCTTCCAAAGCTTCTTTCAATGGGTTAAAGTTGTTGCAGAGGTGAAGAGTTTTTCCAGAAATGGTTTGAACTTTGGGAGTTTGATGGTATCCATCTATTTGGTATTGAGAAGCTTTATGATTTCGTTGCAACAGGTTTCTTTATGATTGGTTATAAAAGGTTTTTTTTGGTTGGTTGGTAGTTGTTTCTTGGTTTTCTAGAAGAATTGAAGTAGCTATCAGTTTTCAGTGTTTGTTGGAGGCCTTACTTATAGTTTCATCGTTCATTCACAAGCTCCCTTAGTCTTACAGTCAAAGATTTTGGAAAAGTTCTTGGTTCTTTAAATGTTTATTTATTTTCTGGCCAATCTCTATAAGATTGTATTTGCTGTTTGTATTATTTCTTTTTCACTCCCGTTGGGAGTTTGTATCCCTGAACTCTTCTTCCTTTTCATTATATCAACGAGAAGTTGTTTCTTGTAAAAGAAAACATAATAAACAAACAAATAAATCAAAGGTATAATAGCGTGTTTAGAAGTGTTTCTAAGGGTATACTTGCATTTTTTTCTTCTTGCGCTTTTGAGTGGAAAAAACTGACCGCTTCTTCTAAAAGCACCCATGTCTCGTATGTAGCTGAATGCCATGAAAAATCAAAACACCTTTTTCAATTCATGAAAACCATGGGGTGGGTGGGTGGTTGATAGAAAAGGAATGGAGAAAAAAAAAAAAATAGAAACCAAGTGACTTATATTTATTCTTCCAAATGCCGGAGTTACCAAAAATCTAAACGAACTTTTCATTGATGTATTAAAAGATCAAAAAATTTTCTAAGATACAACAAACTCCTAAGTGGGGAGTAAAAAGGAAATAAAACAACCACATAAAGAACAAAAATCCAAATATTACGATAGTCTAGAAATAAAACGTTGAAGAACAAAGCACCAAAGATCTTGAAACCTCAAAGCCAAAGCCAAGAGGAATTTCAAAAGCACTATCCCATGAATGAAACCTCAAAGCCAAGATGAATTTGCAAAAGAGGCTCCTAACTCCCTACAAAGCTCCTGAACACTCAGGTGACTAGAGGAAGGAGCAGCCTACTTTGAACCTTCTTTGGAACATCTTTGGCAGAAGGCAAAAGGAGAAGGCTCCATTAAGAACTCCCCATAAGAATTAGCAACCCAAGAAGCAGTTATCCACAAAATTTAGACTTTAAATGAAATCGTAGACTGACAACACCAAGATATCTGGCCCATCTCTTGGAAAAAGACATTCGTTAGACTTGATCTGATGTGCTCATTTCTATGAAATTCTTTGAAAGGTCTCATAAATACAGCCTCAAGTTTTAAAGATGCCATGAGTTAACAAAGAATTCAATATTAAAACCATCAAGTCACAAGATCCATGCCTTATTAGCAGCTGAAAGAAAGCTAAGTTCTTTTTTTTTTAAATCATCATCACTATGAAAAAAAAGCTAAGCTCTTCAGTGAGTTTGAATTTCTTTTTTTGAAAAGGAAACATCATTTTTTCATTCATCTACTGAAGAGGATGAATGATTAGCTTGAATATATGTCGAGTTGGTTGAATAGTAAGTCTCCATGCCGAGAAGTGAGCTGAGAAGTGAGAACTATTCTTATAATAGTTTCCTCTTGGCACACTTTATCTACACCCATTTGTAGTTTCTCCAATCTTCCCCCTTTATTTCTAATGGAGAGAGTCTTTTGTAGCTCGATTTGCTTGGCAGTGCATCTTTCCCACAGTCCCATTCTTAATTTATGTCTCTTCAATTGATGGAAATTTGTTTCTTATCAGGAAGGGGGAAAAAAAAAAAAAAAGAAAAGAAAAAAGAAAAAAGAAAAAAAGATGTCATTGTGCCATTGGAATTCCACCCTTTGATATTTTTATCTGGGTTGGTTCAATATGGGACTAACCATGGTCACATATATAACTGCCTGGTGGTGGTGGTGGTAGCTTCTGCATTTTTTTTCTTAATCTAATCTAAGATTTATTTTTAAATTTGTATTTTGGTTCTCTTTACATGTACCAGGTTGAGGATCCAACATCTGAATATCATCGACCTAAAGGTTGCAGATATTGTTTTGTAAGTTCATTTAGCACACATTGTTTTATTTCTGTCTTAACTGTTTATCATTTCAAAACTGTTCTTGTGAAAAAGTTTTATTTGAAACCATGCTTTATATGGTGAACAGGGAAGTCTAAAGGATTGGTATCCGCCTATGAGGTTTAAGCTTTCAACTACTGATATGTTCAAAAAAAGTATGATTATGGTGGAAGTGAAAGAAAATATGTCGAAGAAATATCAAAAGAGGGTGGCTAAAGGAGGCTTGCCTTCAGATTATTGGAATTTTATCCCTAAGGATGAACAACAGGAAGAAAGTTATTGTAGACCAAACAGGAAAATCCTAACGCATGCTCAGGTATTGGGCCTCTTTATGCATCTGAGTCCTGACTAGATTTAGTTTATTCAATGTTTTCATCTGCGGAAATTTCCATTGTTTTCTCAATTTGTTGTCCTAATCGTCAGAAACCATTGTATTTCTTGATCACTGAACCCCATGAAACCCTGATTTGTTTAGTGATGTGATTAGTATGTTCTCTTTTCTTACTAAAGAAAAAAAGACAAGAAAAAAGAAATTCAAATCAGGAACCAGAGAGAGAGAGTGAGAGAGAAAGAGATGTGTCAAAATTATAAATGCATAAGCTCAACGACTTTAAAGATTATAAAAATAATAGATGTCCTAGTCTTTTACTTCATTGGACCCCTTTTCTAAAATGGGTGTTTTGTGGGCTTGATTTTTTATTTATTTATTTTTTAATGCCCTTGGATGGCCTTTCATTTTTTTCAATGTAGTTATTTCAAAAAAAAAAAAAATAAATAATAAATAAATAAAGATGTCCTGCCATCTAGTTCATGTCGTGTTGTGCTATTCTTGAGTTTTTGTTGTTGTGCTTCTTATGTCTTAGGTCTTTTTTATCATTATCGTTCTTTTAGTTTAAACTGTTCCCTTAGTATAATAATCTCAATCCCCTCTTTAATTTATTGCTAAGAATAAGTCTTACCCCTAATTCTGACCCTAAGAAACTCTATGATGCTGATATTTGTATTAGTGCTTTCACTTTCAATTTTACCATTAGCATGTTGCGGTTTTTCAGTAGGTTTTCTATTTTTTTATTATTTTTTAAATATATATTATAACTTTGAAAAAGAGAAAGTGGATGAACTAAAAAAAGAAAGTGAAAGTGGACGAACTTTGCAGTCTTGAAAACCTTGAACAAATTGGAGAGTTTAACGAATATTATAAACTAGTGGAGTTTGTTGTCTGTAATGTGTTGCTCTAGTCACTACTGATGCATTTCAAAATTAACTTGGCTTCAATTTTATAATTTTGATGTTTCTTTGTCATTCATACCTTACCCATTTCATTCTGGCTACTGCAAATCTTTTGCATTATTGTAAACCTGTAATATTTTCCTTCCCTATAATGATACAATATTTAAGTGATACGTTTTTCTTGGATTTCAGGTTCATTATTTGTTGAAAGACATTGACCCAAAGTTTCTCAAAAAGTTTGTGCCTGCAATAGATTCACTGTTTCTAAACTCTTTCCCTGTTACTCCAAACAGTCATCGTGTGACTGAAATGACACATTCATTTTCAAATGGACAGCGATTGATCTTTGTAAGACTTTTCTTCTTACTGCCCTGTTGTTAGGTTGATGACTGACGCTATTCAAATGGAAGGGTGTTTGAATGACATTTGGCTCTTTCTAACGTGTTGCTTTTAAACATGTTTGATTTAAGGATGAAAGGACCAGGGCTTACAAGAAAGTGGTTGATTTCAGAGGGACAGCTAACGAGTTAGGTTCTCGCGTTCTCGATTGTCTCAAAATTTCGAAGGCAATTTACAATATCCTTTTCTTTTGCCTTAATCATGTTAATTATTATGTGAAACTTTTGCTCTTGCTTATATTGTCACTTCTTCTATGCAGCTTAGCCCAGAGAAGTTACAGAGTAAAGATTTGGTTTACCAGCAAAAGAAAATTAAGGATACTGCTACTAGCTCATATGGTTTAAGATGGATCAAAGATGTTGTTCTTGGAAAGCGGAGTGACCATTGTTTTCGCATGGTTGTTGTTGGCGATCCAAACATTGAGTTAAGTGAAATTGGCATACCATGTCATGTTGCAGAGAGGTTGCAAATATCTGAACATCTGAGTTCTTGGAATATGAAGAAATTAAGCACTTCTTGTTACCTTCATCTTGTTGAAAAGGGAGAGATCTTTGTTCGTCGTGAAGGTCGTCTAGTTCGTGTACGTAATGTTCTTGAACTTAATATGGGGGACACTATATATAGGCCCCTAGCTGATGGGGATGTTGTGCTGGTTAATCGACCACCATCCATACATCAGCACTCACTTATTGCTTTATCTGTCAAGCTTCTTCCTGTCTCCTCAGTTCTTTCCTTAAACCCACTTTGTTGTTCTCCTTTCCGTGGAGATTTTGATGGTGACTGCCTTCATGGTTATGTTCCTCAATCACTCGAAGCCCGAGTGGAAGTTAGAGAGCTGGTTTCTCTAGATAGACAGCTAATTAATGGCCAAAGTGGTAGAAATCTGCTGTCACTTAGTCATGATAGTTTAACTGCTGCTCATTTAATTATGGAAGATGGAGTTTCGTTAAATCTTTTCCAGATGCAGCAGTTGCAAATGCTCGCTTTACATCAGTTGTTGCCCCCAGCAATTTTAAAAGCTCCTTTGCTTAGAAATTGCGCTTGGACTGGTAAACAGTTATTCAGCACCCTCCTACCTCCTGATTTTGATTATTCTTCTCCTTCTCACTGTGTCCTTATTGAAAATGGAGAATTAATATCTTCGGAAGGATCTTACTGGCTCCGCGATAGTGGCAGAAACCTCTTCCAAGCGCTAATAGAACACTGTGAAGGCAAGACCCTTGACTACTTGCACGATGCTCAAGGGGTTCTTTGTGAATGGTTATCAATGAGGGGCTTGAGTGTTTCATTGTCAGACTTGTACCTATCCGTGGATTCATACTCTCACAAAAACATGATGGATGATATCTTTTGTGGGTTACAGGAAGCTGAGGAAACATGTAATTTAAAGCAGCTGATGGTGGATGCACATAAAGATATCCTTACTGAAGATGACGAAGATAATCAACACGTGTTGTCTATTGCTGTGGATCGTTTAAGTTATGAGAAGCAGAAATCTGCTGCTCTAAATCAAGCTTCTGTTGATGCTTTCAAGAAAGTTTTTCGTGATATACAAAATCTAGTTTACAAGTATTCTGGTAAAGACAATTCACTTCTTACCATGTTCAAGGCTGGAAGCAAGGGTAATTTGCTAAAACTAGTTCAGCATAGCATGTGTCTTGGCTTGCAACACTCTTTGGTTACTCTATCCTTTAGCCTTCCACATAAGCTTACGTGTTCTGCATGGAACAGCCAGAAGATGCCTCGTTATACTCAGAAGGATGGTCTTCCTGACCGTACGTCGTCTTTCATACCATATGCTGTGGTTGAAAGTTCCTTTCTCTCAGGGCTTAATCCGTTTGAATGTTTTGCTCATTCGGTGACAAATCGAGATAGCTCTTTCAGCGACAATGCTGAAGTTCCTGGCACTTTGACACGAAAACTTACATTCCTAATGCGGGATATATATACTGCATATGATGGAACAGTGAGGAATGCATATGGAAATCAGCTGGTTCAGTTTTCTTATGACATTGATAGACCTACTAGCGTCTCTAATGAATTGGATAGCGAGAACAATAATAGAGATCATGATATAGGTGGTCATCCTGTTGGGTCATTGGCTGCCTGTGCCATGTCAGAAGCTGCATATAGTGCTCTGGACCAACCAATTAGTCTACTTGAAGCTTCCCCATTGCTAAACCTAAAGGTACGATAGCCTTCTCCTCTCAATATAACAATTTAAGTCAAGGATTTGGTAAATATATATATTATTTTTATTATTCTATTATCTGTTTGTTGAATCACCAATTAACTGATGGGTTTTGATAGAATAAATCTTATATTCAAAGACTCTTGGAATGGCATTTATATCAATAACCACATGTTGGGCATTATGGAAATTTTAAGTCCACAAGTGTGGAAGAGTGTTAAAGTTTCAACATAAAATAATTAAATTTAACAGTAGCCCTTAAACTTAAGCTTTTGGGTTTAGTGTTGATTTAACCATTTGCAAAGTATTTTTATGTTTTTTGCTACATGCTATATATTTGAAAACAATTTTTAAAACTCATCTTACCCAATGAGTTTTCTACCTTAAGAAACCTTTTTTTTTTTTTTTTTTAAATGGTTGAAAAGTTCTCTAGTATTTAGAAAAAATACTAGAAAACTTTTCAAGCATAACTTCTAAACAGGTCTCTAGTTTCTGATACCAAGACTAATGGTTGTGATATATTTGGCTTATGCAGAGAGTGCTGGAGTGTGGTTCAAAGAGGAATAGTACCAAACAAACATTTTCATTGTTCTTATCAGAGAAACTTTCTAAACGAAGTTATGGATTTGAGTATGGAGCATTAGGAGTTAAGAACCATTTAGAAAGAGTAATGTTTAAAGATATTGTGTCTAATGTCATGATAATGTAAGTTATATATTTTCAATGCTTTTTATATTTGTTGTCAGCTGTGTTTATAGTATCTTTTGATTGACATTATTAGCGATGTTTTCTATGGATATTAGCTTCTCCCCACAGCCCTCCCGGAAAAAGCATTTTAGTCCTTGGGTTTGCCACTTTCATGTATGCAAGGTAATGCAATCTGGAACCTGTCTTTCTTTATATGACGCAAGTGTATACTGTTAGTGTCCTCTGTATCTGGTATCAACTATAATTTATCCATCCAGGAAATTTTGAAGAAAAGAAGNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNGTGTGTATTTCTGTCCATTACCTTTCTGACAAGTTCGGCTTTGGTTCTTGCATCAGGAACTTTTGCATGCTTCTAATCTAACCATTCAAATATTTTAATGCAGAGTTTGGAGTCTGCAACGTTGGATATTGGTAAAACAATACGTCTTGAACATTTGCTGCTTGTTGCAAATTCTCTTTCGGCTACAGGAGAGTTTGTTGGCTTAAATGTGAAAGGATTGTCACATCAAAGGGAACATGCTTTGGTCAAAACACCCTTTATGCAAGCTTGCTTCTCGGTTAGTTATCTACTTGCTTATCTCATTTGAATCATCATAGTGAGCAAGCAGTGAATTTAAATTTAATCCATGGTTGGTTTTCTGCAGAGTCCTGGTGCTTGTTTTGTTAAAGCTGCCAAGGCTGGAATTAAGGACAACCTGTCAGGAAGTTTAGATGCCTTGGCATGGGGGAGAATTCCTTCGCTGGGAACCGGGGGACAGTTTGATATCCTATATTCTGGGAGGGTGAGATTATATGTTCCGACTTCATATTCAAAAACAAAATGTTTCAGCTCATAGATGCTTAAGACAAAAAAAATGTTTTTATGGGATTTCAGAAGTATTCCTTTGTTTTTTTATTAGGTAACAATTTCATTGATGTTGTGTGCGTGCAATCTTTTGAATTATATCTGTTATTTGGAATACGGTTTAAGCTAATGAGTTTTCAAACGTGTGGTAAATGAATTCACGATAATCCTAAGATGTTTGTCTTCTAATAATAAAAATGTTCCTATTTCATAGATGTGAGGATTTCTTCTCCTGCTAATATTCCACGTTATAGGCTTTTTGTTCTTATCCTCCTACTGACTCTTGCAGGTCTCTTTAATATTATACGTACATATATGCATATATATTATATTTAACCATTAGTTTGGTCACATAATTACGGTGCTTCTTGTTCAAGTTTAGCGCGGCATTTCTTTTCTTTTTTTATTGAAATGTTGTTAATGAGTATACATAAAAATATCCAATTTATCCAAAAAAAAAAAAAAAAAGAAAAAGAAAATCAAAGTTCAAACCATGAGTGATAATTAAAATGTATACTATATTATCTCATTCTAGATGCATGGAAATTTTCATTTCTACCAGTCTATTCAATGGTCACTGTGCGTGTTATCTGTTGCTTCAGGTTTTTGCGGTACTTCAGCTATCTAATGGTTTCAAATTGTTCTTACTCTTCAGGGGCATGAGCTTAATAAGCCTGTCGATGTTTATAATCTACTGGGTGGCCAAAGCATTTGTGAGAAGCAGAATGCAAAGATCGAATCCCTTGATAAGAACAATATATCTGAGAAATATAGTGCTCAGTTAGTGCTTATAAATGGTGGTTCTACCATTAAAGGACTTAAAAAGCTGGATAGTGTATCTAAATCAATTTTAAGGGAATTTTTGACACTGAACGATATTCAGAAGCTGTCGTTTGCATTGAGAACCATTTTACACAAGTTAGTTCTTCCTCTTTGGGCACTGCTCAGCTCTTACAATTGGCATCATTTCATTAGATGTGTCCTTTTTTCTATTTGGACATTTTATTTTATTTATTATTTCCATTTCGGAAAGAAAATAAGAAAACACCAATCCATGTTCTAGCTTAATGAATAATTGTTTGTTGCTTGAAAATCACTTACATTTCAGATTGTTCCTTTTTAAAATTTCTAATGTTGCTGAAATGCCATATGCAGAATGGAGTCGGAGTTCTAAAAGGTTTCACTTTTGTGTGTAGGTACTCTTTAAACGAAAGATTAAATGAAGTGGACAAATCAACTTTGATGATGGCTTTATACTTTCATCCTCATAGGGATGAAAAGATTGGTGTTGGAGCACAGGACATAAAGGTATTTTCTATAGAAGATGAAACTCTTTTGGCGTATTGCATCTATTCCCCCTCCCTGAACCCCTAAACAGAAATAATATAATAAACAAAGAAAGGAGAAACAAGAACAAGTTTCTTTTGTGATTCATACCTTTTTATCTTCCTTTGCTCTGGGCAATATTCTGCCTTTCATTCTTGTCCAGGTTCCAGCAATTATGGACAGACTTTTCTTTTGAATTTTTTAGTTCGATAATATGTGGGGGTGCGGGGATTCAAACCTTTGATGTGTAGGTCAGCTGTGCATGTTAGTTGAGCTATGCTCAAATTAGTGACTTTTCTAAACAAGGGAGAAAATCCTTGTAGGATGATAGAGATCTGGGAGTTGGAACTTTGCCTGGTGGAAACGGAGCATTTTAGAGTAGTTTAACTAGATGTAAGATCCCATTATTTTCTTGTGCAGCCCGCTATGCAAATGTTTATTTTCTTTGGGATCAATATCTAAGGTGCGGCTCAGTGGCAGGCTCAAAAAGGTTAGATAGGCCGGAGCCCAAGTTAGCAAGGAGAAGTCATCTCATTGACTGCCTTTCAGACCGAGAGCTATTAAGCATTGTTTATGTTCACTGGAAGCCCAAGAATTTGTTTGAAGAATAAGCACCACTTTGAGCATATTATTTATTTACTGTAGCCACGAGGTCCTATGGTGCCACGTGAGGGTGACAGTATGCAGTTTCACATCCATATGATGAAGATGAACATCTCTTGAGGCAACTGAATGTTGAGTGCCTAAGAGGAGCAAGTGGACACTTGTTCCTTGTCTGTTCCAATCTTGTGAAGAGTCTCAGCTCCAATCCTACCCCACAGTATTCCATTTCGGCTCTCACTAGCTCTTTTCTGGGTCTGTACAAAGAGCATCCCCACAGGGTTTTAACCTCTGACCTTTCATAGAATTGGTTGGCTCAGCTATTAGAACTTGACTTTGGTAACACACTAGGTTGATGTTGAACAAGTATAGAGTTTCATGGCTTCTTATAAGGATGGATTTTTAGAATTAAAAAGAGCACAAGTTCTAGTCACCCTTAGCTTAAGTTCCATTGTTTTCGACATAATTGAAACCACATGAGATGTTATTACCTTCTCTTTTGCCATACAGTCAAGGTTACAATAAAAAATCTGATTGGATCTGAATGCACTTAAAGTAGAAATATGTAGTAAGAAATTCAACTCTTTCAGCTCTCTGAAAATGGTCTACATCAGACTGGATTCTGCCCCCAACCAGGACTCAGATAGGACTTGGCTTGCAAAAGGAGACGAGATTAGAATAAAGTGCAGTTGGTGAAATAGGTGGTTCACTACATGTCCAAGCATCTGACTAATATCAGTGACCTAAAGGAGAGTGAGGTTCATCGTTCATGCAATGGGTTCTTCAAGTGAGGCAGTAGGGATCCAGTTGGTCAGCTGATGACACTGGGGAAGACAAATTACAGTCACTTCAAAGTAGTACTTTAATTCTTAAATTTCAGATTCTAGCTTCCCTCAGAAGCAGGATCCAGTTGGTCAGCTGATGACACTTGCTGGTGGAGACAAATAACCGTCACTTCAGATGACCACTCAAATTTCAAATTCTAGCCTCCTTTAGAAGCAAACCCCAATGGTCAAGAAACTTCCATGACCCCACTCACTATCACTCTATATAAATCCCCTATGAACTGTAACTTTGAAATCTGAAGCTATTCTCCTGTATCAATGGTAATAGCCTTATTGTGTAGGGAAGGGATCCAATACTTTAAACAACTGACATATGGACTACCAACACGTTCGATGTTTTTATCCTTTCCTCAGTTAATCAGTGTTTTTGTCCTAAACTGTGGCTCAAATTATGTAGTTGATCTTGATGCGGTTATTTTATTTCCTGCTACCTGTTTAATTATTCATGGAATTAATGTAATCATGTCTACCTGCTACCTGGTTGTACTTTTGCAGGTTGGTAACCACTCAAAGTATCAAAATACACGTTGCTTCGTATTGATACGATCAGATGGAACGACAGAAGATTTTTCGTACCACAAGTGTGTTTTAGGTGCTTTGGAGATCATTGCCCCGCATAGAGTAAAGGGTTATCAGTCTAAATGGATGCAAGAA

mRNA sequence

GTGGTGAACTGGCTTGGGGGGGAGAAGAAGTTGTATGAGTTCTGCTGGAGACTTTCGACCAGGCCAGAAGTCTCAGTTTCAGGAATCTCCATAAAGATCAAGTTTTACTTAGCCTTCCAATATCCCCTTCTTGTCCATGTGATGATCCATATGGAAGATGAACAGGATGGTGAGCTACCAATTCCATCTGGTCTCGTTACTGGCATAAACTTTAGTGTCTCAACTCAGCAAGATACAGAGAATATAGCAGTAATGACAGTTGATGCATCCAGCGAGGTATCTGATCCTAAGTTGGGACTTCCGAATCCATCTTATCAGTGCACCACATGTGGTGCTAGCTCCCTAAAATCTTGTGAAGGACATTTTGGGGTTATCAAATTCCCATATACTATAATCCATCCTTATTTTCTCTCGGAAGTTGCACAAGTGTTGAATAAAGTTTGTCCAGGGTGTAAATCTATCAGGCAGGAACTATGGGGCAAGGTTGAGGATCCAACATCTGAATATCATCGACCTAAAGGTTGCAGATATTGTTTTGGAAGTCTAAAGGATTGGTATCCGCCTATGAGGTTTAAGCTTTCAACTACTGATATGTTCAAAAAAAGTATGATTATGGTGGAAGTGAAAGAAAATATGTCGAAGAAATATCAAAAGAGGGTGGCTAAAGGAGGCTTGCCTTCAGATTATTGGAATTTTATCCCTAAGGATGAACAACAGGAAGAAAGTTATTGTAGACCAAACAGGAAAATCCTAACGCATGCTCAGGTTCATTATTTGTTGAAAGACATTGACCCAAAGTTTCTCAAAAAGTTTGTGCCTGCAATAGATTCACTGTTTCTAAACTCTTTCCCTGTTACTCCAAACAGTCATCGTGTGACTGAAATGACACATTCATTTTCAAATGGACAGCGATTGATCTTTCTTAGCCCAGAGAAGTTACAGAGTAAAGATTTGGTTTACCAGCAAAAGAAAATTAAGGATACTGCTACTAGCTCATATGGTTTAAGATGGATCAAAGATGTTGTTCTTGGAAAGCGGAGTGACCATTGTTTTCGCATGGTTGTTGTTGGCGATCCAAACATTGAGTTAAGTGAAATTGGCATACCATGTCATGTTGCAGAGAGGTTGCAAATATCTGAACATCTGAGTTCTTGGAATATGAAGAAATTAAGCACTTCTTGTTACCTTCATCTTGTTGAAAAGGGAGAGATCTTTGTTCGTCGTGAAGGTCGTCTAGTTCGTGTACGTAATGTTCTTGAACTTAATATGGGGGACACTATATATAGGCCCCTAGCTGATGGGGATGTTGTGCTGGTTAATCGACCACCATCCATACATCAGCACTCACTTATTGCTTTATCTGTCAAGCTTCTTCCTGTCTCCTCAGTTCTTTCCTTAAACCCACTTTGTTGTTCTCCTTTCCGTGGAGATTTTGATGGTGACTGCCTTCATGGTTATGTTCCTCAATCACTCGAAGCCCGAGTGGAAGTTAGAGAGCTGGTTTCTCTAGATAGACAGCTAATTAATGGCCAAAGTGGTAGAAATCTGCTGTCACTTAGTCATGATAGTTTAACTGCTGCTCATTTAATTATGGAAGATGGAGTTTCGTTAAATCTTTTCCAGATGCAGCAGTTGCAAATGCTCGCTTTACATCAGTTGTTGCCCCCAGCAATTTTAAAAGCTCCTTTGCTTAGAAATTGCGCTTGGACTGGTAAACAGTTATTCAGCACCCTCCTACCTCCTGATTTTGATTATTCTTCTCCTTCTCACTGTGTCCTTATTGAAAATGGAGAATTAATATCTTCGGAAGGATCTTACTGGCTCCGCGATAGTGGCAGAAACCTCTTCCAAGCGCTAATAGAACACTGTGAAGGCAAGACCCTTGACTACTTGCACGATGCTCAAGGGGTTCTTTGTGAATGGTTATCAATGAGGGGCTTGAGTGTTTCATTGTCAGACTTGTACCTATCCGTGGATTCATACTCTCACAAAAACATGATGGATGATATCTTTTGTGGGTTACAGGAAGCTGAGGAAACATGTAATTTAAAGCAGCTGATGGTGGATGCACATAAAGATATCCTTACTGAAGATGACGAAGATAATCAACACGTGTTGTCTATTGCTGTGGATCGTTTAAGTTATGAGAAGCAGAAATCTGCTGCTCTAAATCAAGCTTCTGTTGATGCTTTCAAGAAAGTTTTTCGTGATATACAAAATCTAGTTTACAAGTATTCTGGTAAAGACAATTCACTTCTTACCATGTTCAAGGCTGGAAGCAAGGGTAATTTGCTAAAACTAGTTCAGCATAGCATGTGTCTTGGCTTGCAACACTCTTTGGTTACTCTATCCTTTAGCCTTCCACATAAGCTTACGTGTTCTGCATGGAACAGCCAGAAGATGCCTCGTTATACTCAGAAGGATGGTCTTCCTGACCGTACGTCGTCTTTCATACCATATGCTGTGGTTGAAAGTTCCTTTCTCTCAGGGCTTAATCCGTTTGAATGTTTTGCTCATTCGGTGACAAATCGAGATAGCTCTTTCAGCGACAATGCTGAAGTTCCTGGCACTTTGACACGAAAACTTACATTCCTAATGCGGGATATATATACTGCATATGATGGAACAGTGAGGAATGCATATGGAAATCAGCTGGTTCAGTTTTCTTATGACATTGATAGACCTACTAGCGTCTCTAATGAATTGGATAGCGAGAACAATAATAGAGATCATGATATAGGTGGTCATCCTGTTGGGTCATTGGCTGCCTGTGCCATGTCAGAAGCTGCATATAGTGCTCTGGACCAACCAATTAGTCTACTTGAAGCTTCCCCATTGCTAAACCTAAAGAGAGTGCTGGAGTGTGGTTCAAAGAGGAATAGTACCAAACAAACATTTTCATTGTTCTTATCAGAGAAACTTTCTAAACGAAGTTATGGATTTGAGTATGGAGCATTAGGAGTTAAGAACCATTTAGAAAGAGTAATGTTTAAAGATATTGTGTCTAATGTCATGATAATCTTCTCCCCACAGCCCTCCCGGAAAAAGCATTTTAGTCCTTGGGTTTGCCACTTTCATGTATGCAAGAGTTTGGAGTCTGCAACGTTGGATATTGGTAAAACAATACGTCTTGAACATTTGCTGCTTGTTGCAAATTCTCTTTCGGCTACAGGAGAGTTTGTTGGCTTAAATGTGAAAGGATTGTCACATCAAAGGGAACATGCTTTGGTCAAAACACCCTTTATGCAAGCTTGCTTCTCGAGTCCTGGTGCTTGTTTTGTTAAAGCTGCCAAGGCTGGAATTAAGGACAACCTGTCAGGAAGTTTAGATGCCTTGGCATGGGGGAGAATTCCTTCGCTGGGAACCGGGGGACAGTTTGATATCCTATATTCTGGGAGGGGGCATGAGCTTAATAAGCCTGTCGATGTTTATAATCTACTGGGTGGCCAAAGCATTTGTGAGAAGCAGAATGCAAAGATCGAATCCCTTGATAAGAACAATATATCTGAGAAATATAGTGCTCAGTTAGTGCTTATAAATGGTGGTTCTACCATTAAAGGACTTAAAAAGCTGGATAGTGTATCTAAATCAATTTTAAGGGAATTTTTGACACTGAACGATATTCAGAAGCTGTCGTTTGCATTGAGAACCATTTTACACAAGTACTCTTTAAACGAAAGATTAAATGAAGTGGACAAATCAACTTTGATGATGGCTTTATACTTTCATCCTCATAGGGATGAAAAGATTGGTGTTGGAGCACAGGACATAAAGTTAATCAGTGTTTTTGTCCTAAACTGTGGCTCAAATTATGTAGTTGATCTTGATGCGGTTGGTAACCACTCAAAGTATCAAAATACACGTTGCTTCGTATTGATACGATCAGATGGAACGACAGAAGATTTTTCGTACCACAAGTGTGTTTTAGGTGCTTTGGAGATCATTGCCCCGCATAGAGTAAAGGGTTATCAGTCTAAATGGATGCAAGAA

Coding sequence (CDS)

GTGGTGAACTGGCTTGGGGGGGAGAAGAAGTTGTATGAGTTCTGCTGGAGACTTTCGACCAGGCCAGAAGTCTCAGTTTCAGGAATCTCCATAAAGATCAAGTTTTACTTAGCCTTCCAATATCCCCTTCTTGTCCATGTGATGATCCATATGGAAGATGAACAGGATGGTGAGCTACCAATTCCATCTGGTCTCGTTACTGGCATAAACTTTAGTGTCTCAACTCAGCAAGATACAGAGAATATAGCAGTAATGACAGTTGATGCATCCAGCGAGGTATCTGATCCTAAGTTGGGACTTCCGAATCCATCTTATCAGTGCACCACATGTGGTGCTAGCTCCCTAAAATCTTGTGAAGGACATTTTGGGGTTATCAAATTCCCATATACTATAATCCATCCTTATTTTCTCTCGGAAGTTGCACAAGTGTTGAATAAAGTTTGTCCAGGGTGTAAATCTATCAGGCAGGAACTATGGGGCAAGGTTGAGGATCCAACATCTGAATATCATCGACCTAAAGGTTGCAGATATTGTTTTGGAAGTCTAAAGGATTGGTATCCGCCTATGAGGTTTAAGCTTTCAACTACTGATATGTTCAAAAAAAGTATGATTATGGTGGAAGTGAAAGAAAATATGTCGAAGAAATATCAAAAGAGGGTGGCTAAAGGAGGCTTGCCTTCAGATTATTGGAATTTTATCCCTAAGGATGAACAACAGGAAGAAAGTTATTGTAGACCAAACAGGAAAATCCTAACGCATGCTCAGGTTCATTATTTGTTGAAAGACATTGACCCAAAGTTTCTCAAAAAGTTTGTGCCTGCAATAGATTCACTGTTTCTAAACTCTTTCCCTGTTACTCCAAACAGTCATCGTGTGACTGAAATGACACATTCATTTTCAAATGGACAGCGATTGATCTTTCTTAGCCCAGAGAAGTTACAGAGTAAAGATTTGGTTTACCAGCAAAAGAAAATTAAGGATACTGCTACTAGCTCATATGGTTTAAGATGGATCAAAGATGTTGTTCTTGGAAAGCGGAGTGACCATTGTTTTCGCATGGTTGTTGTTGGCGATCCAAACATTGAGTTAAGTGAAATTGGCATACCATGTCATGTTGCAGAGAGGTTGCAAATATCTGAACATCTGAGTTCTTGGAATATGAAGAAATTAAGCACTTCTTGTTACCTTCATCTTGTTGAAAAGGGAGAGATCTTTGTTCGTCGTGAAGGTCGTCTAGTTCGTGTACGTAATGTTCTTGAACTTAATATGGGGGACACTATATATAGGCCCCTAGCTGATGGGGATGTTGTGCTGGTTAATCGACCACCATCCATACATCAGCACTCACTTATTGCTTTATCTGTCAAGCTTCTTCCTGTCTCCTCAGTTCTTTCCTTAAACCCACTTTGTTGTTCTCCTTTCCGTGGAGATTTTGATGGTGACTGCCTTCATGGTTATGTTCCTCAATCACTCGAAGCCCGAGTGGAAGTTAGAGAGCTGGTTTCTCTAGATAGACAGCTAATTAATGGCCAAAGTGGTAGAAATCTGCTGTCACTTAGTCATGATAGTTTAACTGCTGCTCATTTAATTATGGAAGATGGAGTTTCGTTAAATCTTTTCCAGATGCAGCAGTTGCAAATGCTCGCTTTACATCAGTTGTTGCCCCCAGCAATTTTAAAAGCTCCTTTGCTTAGAAATTGCGCTTGGACTGGTAAACAGTTATTCAGCACCCTCCTACCTCCTGATTTTGATTATTCTTCTCCTTCTCACTGTGTCCTTATTGAAAATGGAGAATTAATATCTTCGGAAGGATCTTACTGGCTCCGCGATAGTGGCAGAAACCTCTTCCAAGCGCTAATAGAACACTGTGAAGGCAAGACCCTTGACTACTTGCACGATGCTCAAGGGGTTCTTTGTGAATGGTTATCAATGAGGGGCTTGAGTGTTTCATTGTCAGACTTGTACCTATCCGTGGATTCATACTCTCACAAAAACATGATGGATGATATCTTTTGTGGGTTACAGGAAGCTGAGGAAACATGTAATTTAAAGCAGCTGATGGTGGATGCACATAAAGATATCCTTACTGAAGATGACGAAGATAATCAACACGTGTTGTCTATTGCTGTGGATCGTTTAAGTTATGAGAAGCAGAAATCTGCTGCTCTAAATCAAGCTTCTGTTGATGCTTTCAAGAAAGTTTTTCGTGATATACAAAATCTAGTTTACAAGTATTCTGGTAAAGACAATTCACTTCTTACCATGTTCAAGGCTGGAAGCAAGGGTAATTTGCTAAAACTAGTTCAGCATAGCATGTGTCTTGGCTTGCAACACTCTTTGGTTACTCTATCCTTTAGCCTTCCACATAAGCTTACGTGTTCTGCATGGAACAGCCAGAAGATGCCTCGTTATACTCAGAAGGATGGTCTTCCTGACCGTACGTCGTCTTTCATACCATATGCTGTGGTTGAAAGTTCCTTTCTCTCAGGGCTTAATCCGTTTGAATGTTTTGCTCATTCGGTGACAAATCGAGATAGCTCTTTCAGCGACAATGCTGAAGTTCCTGGCACTTTGACACGAAAACTTACATTCCTAATGCGGGATATATATACTGCATATGATGGAACAGTGAGGAATGCATATGGAAATCAGCTGGTTCAGTTTTCTTATGACATTGATAGACCTACTAGCGTCTCTAATGAATTGGATAGCGAGAACAATAATAGAGATCATGATATAGGTGGTCATCCTGTTGGGTCATTGGCTGCCTGTGCCATGTCAGAAGCTGCATATAGTGCTCTGGACCAACCAATTAGTCTACTTGAAGCTTCCCCATTGCTAAACCTAAAGAGAGTGCTGGAGTGTGGTTCAAAGAGGAATAGTACCAAACAAACATTTTCATTGTTCTTATCAGAGAAACTTTCTAAACGAAGTTATGGATTTGAGTATGGAGCATTAGGAGTTAAGAACCATTTAGAAAGAGTAATGTTTAAAGATATTGTGTCTAATGTCATGATAATCTTCTCCCCACAGCCCTCCCGGAAAAAGCATTTTAGTCCTTGGGTTTGCCACTTTCATGTATGCAAGAGTTTGGAGTCTGCAACGTTGGATATTGGTAAAACAATACGTCTTGAACATTTGCTGCTTGTTGCAAATTCTCTTTCGGCTACAGGAGAGTTTGTTGGCTTAAATGTGAAAGGATTGTCACATCAAAGGGAACATGCTTTGGTCAAAACACCCTTTATGCAAGCTTGCTTCTCGAGTCCTGGTGCTTGTTTTGTTAAAGCTGCCAAGGCTGGAATTAAGGACAACCTGTCAGGAAGTTTAGATGCCTTGGCATGGGGGAGAATTCCTTCGCTGGGAACCGGGGGACAGTTTGATATCCTATATTCTGGGAGGGGGCATGAGCTTAATAAGCCTGTCGATGTTTATAATCTACTGGGTGGCCAAAGCATTTGTGAGAAGCAGAATGCAAAGATCGAATCCCTTGATAAGAACAATATATCTGAGAAATATAGTGCTCAGTTAGTGCTTATAAATGGTGGTTCTACCATTAAAGGACTTAAAAAGCTGGATAGTGTATCTAAATCAATTTTAAGGGAATTTTTGACACTGAACGATATTCAGAAGCTGTCGTTTGCATTGAGAACCATTTTACACAAGTACTCTTTAAACGAAAGATTAAATGAAGTGGACAAATCAACTTTGATGATGGCTTTATACTTTCATCCTCATAGGGATGAAAAGATTGGTGTTGGAGCACAGGACATAAAGTTAATCAGTGTTTTTGTCCTAAACTGTGGCTCAAATTATGTAGTTGATCTTGATGCGGTTGGTAACCACTCAAAGTATCAAAATACACGTTGCTTCGTATTGATACGATCAGATGGAACGACAGAAGATTTTTCGTACCACAAGTGTGTTTTAGGTGCTTTGGAGATCATTGCCCCGCATAGAGTAAAGGGTTATCAGTCTAAATGGATGCAAGAA

Protein sequence

VVNWLGGEKKLYEFCWRLSTRPEVSVSGISIKIKFYLAFQYPLLVHVMIHMEDEQDGELPIPSGLVTGINFSVSTQQDTENIAVMTVDASSEVSDPKLGLPNPSYQCTTCGASSLKSCEGHFGVIKFPYTIIHPYFLSEVAQVLNKVCPGCKSIRQELWGKVEDPTSEYHRPKGCRYCFGSLKDWYPPMRFKLSTTDMFKKSMIMVEVKENMSKKYQKRVAKGGLPSDYWNFIPKDEQQEESYCRPNRKILTHAQVHYLLKDIDPKFLKKFVPAIDSLFLNSFPVTPNSHRVTEMTHSFSNGQRLIFLSPEKLQSKDLVYQQKKIKDTATSSYGLRWIKDVVLGKRSDHCFRMVVVGDPNIELSEIGIPCHVAERLQISEHLSSWNMKKLSTSCYLHLVEKGEIFVRREGRLVRVRNVLELNMGDTIYRPLADGDVVLVNRPPSIHQHSLIALSVKLLPVSSVLSLNPLCCSPFRGDFDGDCLHGYVPQSLEARVEVRELVSLDRQLINGQSGRNLLSLSHDSLTAAHLIMEDGVSLNLFQMQQLQMLALHQLLPPAILKAPLLRNCAWTGKQLFSTLLPPDFDYSSPSHCVLIENGELISSEGSYWLRDSGRNLFQALIEHCEGKTLDYLHDAQGVLCEWLSMRGLSVSLSDLYLSVDSYSHKNMMDDIFCGLQEAEETCNLKQLMVDAHKDILTEDDEDNQHVLSIAVDRLSYEKQKSAALNQASVDAFKKVFRDIQNLVYKYSGKDNSLLTMFKAGSKGNLLKLVQHSMCLGLQHSLVTLSFSLPHKLTCSAWNSQKMPRYTQKDGLPDRTSSFIPYAVVESSFLSGLNPFECFAHSVTNRDSSFSDNAEVPGTLTRKLTFLMRDIYTAYDGTVRNAYGNQLVQFSYDIDRPTSVSNELDSENNNRDHDIGGHPVGSLAACAMSEAAYSALDQPISLLEASPLLNLKRVLECGSKRNSTKQTFSLFLSEKLSKRSYGFEYGALGVKNHLERVMFKDIVSNVMIIFSPQPSRKKHFSPWVCHFHVCKSLESATLDIGKTIRLEHLLLVANSLSATGEFVGLNVKGLSHQREHALVKTPFMQACFSSPGACFVKAAKAGIKDNLSGSLDALAWGRIPSLGTGGQFDILYSGRGHELNKPVDVYNLLGGQSICEKQNAKIESLDKNNISEKYSAQLVLINGGSTIKGLKKLDSVSKSILREFLTLNDIQKLSFALRTILHKYSLNERLNEVDKSTLMMALYFHPHRDEKIGVGAQDIKLISVFVLNCGSNYVVDLDAVGNHSKYQNTRCFVLIRSDGTTEDFSYHKCVLGALEIIAPHRVKGYQSKWMQE
BLAST of ClCG00G000380 vs. Swiss-Prot
Match: NRPD1_ARATH (DNA-directed RNA polymerase IV subunit 1 OS=Arabidopsis thaliana GN=NRPD1 PE=1 SV=1)

HSP 1 Score: 1005.7 bits (2599), Expect = 4.6e-292
Identity = 534/1022 (52.25%), Postives = 696/1022 (68.10%), Query Frame = 1

Query: 51   MEDEQDGELPIPSGLVTGINFSVSTQQDTENIAVMTVDASSEVSDPKLGLPNPSYQCTTC 110
            MED+ + EL +P G +T I FS+S   D + ++V+ V+A ++V+D +LGLPNP   C TC
Sbjct: 1    MEDDCE-ELQVPVGTLTSIGFSISNNNDRDKMSVLEVEAPNQVTDSRLGLPNPDSVCRTC 60

Query: 111  GASSLKSCEGHFGVIKFPYTIIHPYFLSEVAQVLNKVCPGCKSIRQELWGKVEDPTSEYH 170
            G+   K CEGHFGVI F Y+II+PYFL EVA +LNK+CPGCK IR++ +   ED      
Sbjct: 61   GSKDRKVCEGHFGVINFAYSIINPYFLKEVAALLNKICPGCKYIRKKQFQITED------ 120

Query: 171  RPKGCRYCFGSLKDWYPPMRFKLSTTDMFKKSMIMVEVKENMSKKYQKRVAKGGLPSDYW 230
            +P+ CRYC  +L   YP M+F+++T ++F++S I+VEV E    K +KR     LP DYW
Sbjct: 121  QPERCRYC--TLNTGYPLMKFRVTTKEVFRRSGIVVEVNEESLMKLKKRGVL-TLPPDYW 180

Query: 231  NFIPKDEQQEESYCRPNRKILTHAQVHYLLKDIDPKFLKKFVPAIDSLFLNSFPVTPNSH 290
            +F+P+D   +ES  +P R+I+THAQV+ LL  ID + +KK +P  +SL L SFPVTPN +
Sbjct: 181  SFLPQDSNIDESCLKPTRRIITHAQVYALLLGIDQRLIKKDIPMFNSLGLTSFPVTPNGY 240

Query: 291  RVTEMTHSFSNGQRLIFLSPEKLQSK----------------DLVYQQKKIKDTATSSYG 350
            RVTE+ H F NG RLIF    ++  K                + +   +   +T +SS  
Sbjct: 241  RVTEIVHQF-NGARLIFDERTRIYKKLVGFEGNTLELSSRVMECMQYSRLFSETVSSS-- 300

Query: 351  LRWIKDVV--LGKRSD--------------------HCFRMVVVGDPNIELSEIGIPCHV 410
                KD      K+SD                    H FR VVVGDP+++L+EIGIP  +
Sbjct: 301  ----KDSANPYQKKSDTPKLCGLRFMKDVLLGKRSDHTFRTVVVGDPSLKLNEIGIPESI 360

Query: 411  AERLQISEHLSSWNMKKLSTSCYLHLVEKGEIFVRREGRLVRVRNVLELNMGDTIYRPLA 470
            A+RLQ+SEHL+  N ++L TS    L++  E+ VRR  RLV ++ V +L  GD I+R L 
Sbjct: 361  AKRLQVSEHLNQCNKERLVTSFVPTLLDNKEMHVRRGDRLVAIQ-VNDLQTGDKIFRSLM 420

Query: 471  DGDVVLVNRPPSIHQHSLIALSVKLLPVSSVLSLNPLCCSPFRGDFDGDCLHGYVPQSLE 530
            DGD VL+NRPPSIHQHSLIA++V++LP +SV+SLNP+CC PFRGDFDGDCLHGYVPQS++
Sbjct: 421  DGDTVLMNRPPSIHQHSLIAMTVRILPTTSVVSLNPICCLPFRGDFDGDCLHGYVPQSIQ 480

Query: 531  ARVEVRELVSLDRQLINGQSGRNLLSLSHDSLTAAHLI-MEDGVSLNLFQMQQLQMLALH 590
            A+VE+ ELV+LD+QLIN Q+GRNLLSL  DSLTAA+L+ +E    LN  QMQQLQM    
Sbjct: 481  AKVELDELVALDKQLINRQNGRNLLSLGQDSLTAAYLVNVEKNCYLNRAQMQQLQMYCPF 540

Query: 591  QLLPPAILKA-PLLRNCAWTGKQLFSTLLPPDFDYSSPSHCVLIENGELIS-SEGSYWLR 650
            QL PPAI+KA P      WTG QLF  L PP FDY+ P + V++ NGEL+S SEGS WLR
Sbjct: 541  QLPPPAIIKASPSSTEPQWTGMQLFGMLFPPGFDYTYPLNNVVVSNGELLSFSEGSAWLR 600

Query: 651  DSGRNLFQALIEHCEGKTLDYLHDAQGVLCEWLSMRGLSVSLSDLYLSVDSYSHKNMMDD 710
            D   N  + L++H +GK LD ++ AQ +L +WL MRGLSVSL+DLYLS D  S KN+ ++
Sbjct: 601  DGEGNFIERLLKHDKGKVLDIIYSAQEMLSQWLLMRGLSVSLADLYLSSDLQSRKNLTEE 660

Query: 711  IFCGLQEAEETCNLKQLMVDAHKDILTEDDEDNQHVLSIAVDRLSYEKQKSAALNQASVD 770
            I  GL+EAE+ CN +QLMV++ +D L  + ED +      + R  YE+QKSA L++ +V 
Sbjct: 661  ISYGLREAEQVCNKQQLMVESWRDFLAVNGEDKEEDSVSDLARFCYERQKSATLSELAVS 720

Query: 771  AFKKVFRDIQNLVYKYSGKDNSLLTMFKAGSKGNLLKLVQHSMCLGLQHSLVTLSFSLPH 830
            AFK  +RD+Q L Y+Y  + NS L M KAGSKGN+ KLVQHSMC+GLQ+S V+LSF  P 
Sbjct: 721  AFKDAYRDVQALAYRYGDQSNSFLIMSKAGSKGNIGKLVQHSMCIGLQNSAVSLSFGFPR 780

Query: 831  KLTCSAWNSQKMPRYTQKDGLPDRTSSFIPYAVVESSFLSGLNPFECFAHSVTNRDSSFS 890
            +LTC+AWN    P    K      T S++PY V+E+SFL+GLNP E F HSVT+RDSSFS
Sbjct: 781  ELTCAAWNDPNSPLRGAKGKDSTTTESYVPYGVIENSFLTGLNPLESFVHSVTSRDSSFS 840

Query: 891  DNAEVPGTLTRKLTFLMRDIYTAYDGTVRNAYGNQLVQFSYDIDRPTSVSNELDSENNNR 950
             NA++PGTL+R+L F MRDIY AYDGTVRN++GNQLVQF+Y+ D P              
Sbjct: 841  GNADLPGTLSRRLMFFMRDIYAAYDGTVRNSFGNQLVQFTYETDGPV------------- 900

Query: 951  DHDIGGHPVGSLAACAMSEAAYSALDQPISLLEASPLLNLKRVLECGSKRNSTKQTFSLF 1010
              DI G  +GSL+ACA+SEAAYSALDQPISLLE SPLLNLK VLECGSK+   +QT SL+
Sbjct: 901  -EDITGEALGSLSACALSEAAYSALDQPISLLETSPLLNLKNVLECGSKKGQREQTMSLY 960

Query: 1011 LSEKLSKRSYGFEYGALGVKNHLERVMFKDIVSNVMIIFSPQPSRKKHFSPWVCHFHVCK 1032
            LSE LSK+ +GFEYG+L +KNHLE++ F +IVS  MIIFSP  + K   SPWVCHFH+ +
Sbjct: 961  LSEYLSKKKHGFEYGSLEIKNHLEKLSFSEIVSTSMIIFSPSSNTKVPLSPWVCHFHISE 990


HSP 2 Score: 244.2 bits (622), Expect = 8.1e-63
Identity = 135/302 (44.70%), Postives = 181/302 (59.93%), Query Frame = 1

Query: 1030 SLESATLDIGKTIRLEHLLLVANSLSATGEFVGLNVKGLSHQREHALVKTPFMQACFSSP 1089
            +LESA  D GK I  EHLLLVA+SLS TGEFV LN KG S QR+      PF QACFSSP
Sbjct: 1166 NLESAVSDTGKEILREHLLLVADSLSVTGEFVALNAKGWSKQRQVESTPAPFTQACFSSP 1225

Query: 1090 GACFVKAAKAGIKDNLSGSLDALAWGRIPSLGTGGQFDILYSGRGHELNKPVDVYNLLGG 1149
              CF+KAAK G++D+L GS+DALAWG++P  GTG QF+I+ S + H    PVDVY+LL  
Sbjct: 1226 SQCFLKAAKEGVRDDLQGSIDALAWGKVPGFGTGDQFEIIISPKVHGFTTPVDVYDLLSS 1285

Query: 1150 QSICEKQNAKIESLDKNNISEKYSAQLVLINGGSTIKGLKKLD--SVSKSILREFLTLND 1209
                 + N+  +       S+K + Q   +   + +K +K LD   +  S+LR   T  +
Sbjct: 1286 TKTMRRTNSAPK-------SDKATVQPFGLLHSAFLKDIKVLDGKGIPMSLLRTIFTWKN 1345

Query: 1210 IQKLSFALRTILHKYSLNERLNEVDKSTLMMALYFHPHRDEKIGVGAQDIKLISVFVLNC 1269
            I+ LS +L+ ILH Y +NE LNE D+  + M L  HP+  EKIG G + I++        
Sbjct: 1346 IELLSQSLKRILHSYEINELLNERDEGLVKMVLQLHPNSVEKIGPGVKGIRVAK------ 1405

Query: 1270 GSNYVVDLDAVGNHSKYQNTRCFVLIRSDGTTEDFSYHKCVLGALEIIAPHRVKGYQSKW 1329
                          SK+ ++ CF ++R DGT EDFSYHKCVLGA +IIAP ++  Y+SK+
Sbjct: 1406 --------------SKHGDSCCFEVVRIDGTFEDFSYHKCVLGATKIIAPKKMNFYKSKY 1440

BLAST of ClCG00G000380 vs. Swiss-Prot
Match: NRPE1_ARATH (DNA-directed RNA polymerase V subunit 1 OS=Arabidopsis thaliana GN=NRPE1 PE=1 SV=1)

HSP 1 Score: 239.2 bits (609), Expect = 2.6e-61
Identity = 217/733 (29.60%), Postives = 330/733 (45.02%), Query Frame = 1

Query: 317  DLVYQQKKIKDTATSSYGLRWIKDVVLGKRSDHCFRMVVVGDPNIELSEIGIPCHVAERL 376
            D+ Y   KI D+++S      ++ + + K S    R V+ GD    ++E+GIP  +A+R+
Sbjct: 290  DMRYGVSKISDSSSSKAWTEKMRTLFIRKGSGFSSRSVITGDAYRHVNEVGIPIEIAQRI 349

Query: 377  QISEHLSSWN----MKKLSTSCYLHLVEKGEIFVRREGRLVRVRNVLELNMGDTIYRPLA 436
               E +S  N     K +     L   +    +  R+G     +   EL  G  ++R + 
Sbjct: 350  TFEERVSVHNRGYLQKLVDDKLCLSYTQGSTTYSLRDGS----KGHTELKPGQVVHRRVM 409

Query: 437  DGDVVLVNRPPSIHQHSLIALSVKLLPVSSVLSLNPLCCSPFRGDFDGDCLHGYVPQSLE 496
            DGDVV +NRPP+ H+HSL AL V +   ++V  +NPL CSP   DFDGDC+H + PQSL 
Sbjct: 410  DGDVVFINRPPTTHKHSLQALRVYVHEDNTV-KINPLMCSPLSADFDGDCVHLFYPQSLS 469

Query: 497  ARVEVRELVSLDRQLINGQSGRNLLSLSHDSLTAAHLIMEDGVSLNLFQMQQLQMLALHQ 556
            A+ EV EL S+++QL++  +G+ +L +  DSL +  +++E  V L+    QQL M     
Sbjct: 470  AKAEVMELFSVEKQLLSSHTGQLILQMGSDSLLSLRVMLE-RVFLDKATAQQLAMYGSLS 529

Query: 557  LLPPAILKAPLLRNCAWTGKQLFSTLLPPDFDYSSPSHCVLIENGELISSEGSYWLRDSG 616
            L PPA+ K+      AWT  Q+     P     S      L++  +L+  +       S 
Sbjct: 530  LPPPALRKSS-KSGPAWTVFQILQLAFPERL--SCKGDRFLVDGSDLLKFDFGVDAMGSI 589

Query: 617  RN--LFQALIEHCEGKTLDYLHDAQGVLCEWLSMRGLSVSLSDLYLSVDSYSHKNMMDDI 676
             N  +    +E    +TL +    Q +L E L   G S+SL DL     S S  +M  D+
Sbjct: 590  INEIVTSIFLEKGPKETLGFFDSLQPLLMESLFAEGFSLSLEDL-----SMSRADM--DV 649

Query: 677  FCGLQEAEETCNLKQLMVDAHKDILTEDDEDNQHVLSIAVDRLSYEKQKSAALNQASVDA 736
               L   E +  + +L +    ++  E+               S  K K  A N      
Sbjct: 650  IHNLIIREISPMVSRLRLSYRDELQLEN---------------SIHKVKEVAAN------ 709

Query: 737  FKKVFRDIQNLVYKYSGKDNSLLTMFKAGSKGNLLKLVQHSMCLGLQHSLVTLSFSLPHK 796
            F      I+NL+     K NS +T           KLVQ +  LGLQ S     ++    
Sbjct: 710  FMLKSYSIRNLI---DIKSNSAIT-----------KLVQQTGFLGLQLSDKKKFYTKTLV 769

Query: 797  LTCSAWNSQKMPRYTQKDGLPDRTSSFIPYAVVESSFLSGLNPFECFAHSVTNRD--SSF 856
               + +  +K            R SS   + +V+  F  GL+P+E  AHS+  R+     
Sbjct: 770  EDMAIFCKRKY----------GRISSSGDFGIVKGCFFHGLDPYEEMAHSIAAREVIVRS 829

Query: 857  SDNAEVPGTLTRKLTFLMRDIYTAYDGTVRNAYGNQLVQFSYDIDRPTSVSNELDSENNN 916
            S     PGTL + L  ++RDI    DGTVRN   N ++QF Y +          DSE  +
Sbjct: 830  SRGLAEPGTLFKNLMAVLRDIVITNDGTVRNTCSNSVIQFKYGV----------DSERGH 889

Query: 917  RDHDIGGHPVGSLAACAMSEAAYSALDQPISLLEASPLLN-----LKRVLEC--GSKRNS 976
            +     G PVG LAA AMS  AY A      +L++SP  N     +K VL C    +  +
Sbjct: 890  QGLFEAGEPVGVLAATAMSNPAYKA------VLDSSPNSNSSWELMKEVLLCKVNFQNTT 945

Query: 977  TKQTFSLFLSEKLSKRSYGFEYGALGVKNHLERVMFKDIVSNVMIIFSPQPSRKKHFSPW 1032
              +   L+L+E    + +  E  A  V+N L +V  KD     ++ +  QP+  + F   
Sbjct: 950  NDRRVILYLNECHCGKRFCQENAACTVRNKLNKVSLKDTAVEFLVEYRKQPTISEIFGID 945


HSP 2 Score: 72.4 bits (176), Expect = 4.2e-11
Identity = 40/130 (30.77%), Postives = 69/130 (53.08%), Query Frame = 1

Query: 1031 LESATLDIGKTIRLEHLLLVANSLSATGEFVGLNVKGLSHQREHALVKTPFMQACFSSPG 1090
            L ++   + K +  EH++L+AN+++ +G  +G N  G         +K PF +A   +P 
Sbjct: 1132 LSASVRMVSKGVLKEHIILLANNMTCSGTMLGFNSGGYKALTRSLNIKAPFTEATLIAPR 1191

Query: 1091 ACFVKAAKAGIKDNLSGSLDALAWGRIPSLGTGGQFDILYSGR--GHELNKPVDVYNLLG 1150
             CF KAA+    D+LS  + + +WG+   +GTG QF++L++ +  G +  +  DVY+ L 
Sbjct: 1192 KCFEKAAEKCHTDSLSTVVGSCSWGKRVDVGTGSQFELLWNQKETGLDDKEETDVYSFL- 1251

Query: 1151 GQSICEKQNA 1159
             Q +    NA
Sbjct: 1252 -QMVISTTNA 1259


HSP 3 Score: 70.5 bits (171), Expect = 1.6e-10
Identity = 36/108 (33.33%), Postives = 62/108 (57.41%), Query Frame = 1

Query: 51  MEDEQDGELPIPSGLVTGINFSVSTQQDT--ENIAVMTVDASSEVSDPKLGLPNPSYQCT 110
           ME+E   E  I  G + GI F++++  +   ++I+   ++  S++++  LGLP    +C 
Sbjct: 1   MEEESTSE--ILDGEIVGITFALASHHEICIQSISESAINHPSQLTNAFLGLPLEFGKCE 60

Query: 111 TCGASSLKSCEGHFGVIKFPYTIIHPYFLSEVAQVLNKVCPGCKSIRQ 157
           +CGA+    CEGHFG I+ P  I HP  ++E+ Q+L+ +C  C  I++
Sbjct: 61  SCGATEPDKCEGHFGYIQLPVPIYHPAHVNELKQMLSLLCLKCLKIKK 106


HSP 4 Score: 54.3 bits (129), Expect = 1.2e-05
Identity = 39/142 (27.46%), Postives = 65/142 (45.77%), Query Frame = 1

Query: 1189 KKLDSVSKSILREFLTLNDIQKLSFALRTILHK--YSLNERLNEVDKS-TLMMALYFHPH 1248
            ++LDS +     E   L+D++ +   LR I+H   Y   + +++ DK+  L   L FHP 
Sbjct: 1727 QRLDSFTSE---EQELLSDVEPVMRTLRKIMHPSAYPDGDPISDDDKTFVLEKILNFHPQ 1786

Query: 1249 RDEKIGVGAQDIKLISVFVLNCGSNYVVDLDAVGNHSKYQNTRCFVLIRSDGTTEDFSYH 1308
            ++ K+G G                   VD   V  H+ + ++RCF ++ +DG  +DFSY 
Sbjct: 1787 KETKLGSG-------------------VDFITVDKHTIFSDSRCFFVVSTDGAKQDFSYR 1846

Query: 1309 KCVLGALEIIAPHRVKGYQSKW 1328
            K +   L    P R + +  K+
Sbjct: 1847 KSLNNYLMKKYPDRAEEFIDKY 1846

BLAST of ClCG00G000380 vs. Swiss-Prot
Match: RPB1_DICDI (DNA-directed RNA polymerase II subunit rpb1 OS=Dictyostelium discoideum GN=polr2a PE=2 SV=2)

HSP 1 Score: 193.7 bits (491), Expect = 1.2e-47
Identity = 166/598 (27.76%), Postives = 269/598 (44.98%), Query Frame = 1

Query: 338 IKDVVLGKRSDHCFRMVVVGDPNIELSEIGIPCHVAERLQISEHLSSWNMKKLSTSCYLH 397
           I+  ++GKR D   R V+  DPN+ + ++G+P  +A  L   E ++ +N+ K+       
Sbjct: 341 IRGNLMGKRVDFSARTVITADPNLSIDQVGVPRSIALNLTYPETVTPFNIDKMR-----E 400

Query: 398 LVEKG-------EIFVRREG-----RLVRVRNVLELNMGDTIYRPLADGDVVLVNRPPSI 457
           L+  G       +  +R +G     R V+  +   L  G  + R + DGDVV+ NR PS+
Sbjct: 401 LIRNGPSEHPGAKYIIREDGTRFDLRFVKKVSDTHLECGYKVERHINDGDVVIFNRQPSL 460

Query: 458 HQHSLIALSVKLLPVSSVLSLNPLCCSPFRGDFDGDCLHGYVPQSLEARVEVRELVSLDR 517
           H+ S++   +K++P S+   LN    SP+  DFDGD ++ +VPQ+LE R EV E++ + R
Sbjct: 461 HKMSMMGHRIKVMPYST-FRLNLSVTSPYNADFDGDEMNLHVPQTLETRAEVIEIMMVPR 520

Query: 518 QLINGQSGRNLLSLSHDSLTAAHLIMEDGVSLNLFQMQQLQMLALHQLL-------PPAI 577
           Q+++ QS R ++ +  D+L  + L  +     + F  + L M  L  L        PPAI
Sbjct: 521 QIVSPQSNRPVMGIVQDTLLGSRLFTKR----DCFMEKDLVMNILMWLPSWDGKVPPPAI 580

Query: 578 LKAPLLRNCAWTGKQLFSTLLPP--------DFDYSSPSHC------VLIENGELISSEG 637
           LK   L    WTGKQLFS ++P           +   P+ C      V+IE GEL++  G
Sbjct: 581 LKPKQL----WTGKQLFSLIIPDINLIRFTSTHNDKEPNECSAGDTRVIIERGELLA--G 640

Query: 638 SYWLRD----SGRNLFQALIEHCEGKTLDYLHDAQGVLCEWLSMRGLSVSLSDLYLSVDS 697
               R     +G  +   + EH       ++   Q V+  WL  RG ++ + D     DS
Sbjct: 641 ILCKRSLGAANGSIIHVVMNEHGHDTCRLFIDQTQTVVNHWLINRGFTMGIGDTI--ADS 700

Query: 698 YSHKNMMDDIFCGLQEAEET---CNLKQLMVDAHKDILTEDDEDNQHVLSIAVDRLSYEK 757
            +   +   I     + +E       KQ      K ++   ++    VL+ A D      
Sbjct: 701 ATMAKVTLTISSAKNQVKELIIKAQNKQFECQPGKSVIETFEQKVNQVLNKARDTAGSSA 760

Query: 758 QKSAALNQASVDAFKKVFRDIQNLVYKYSGKDNSLLTMFKAGSKGNLLKLVQHSMCLGLQ 817
           Q S +                         +DN+L  M  AGSKG+ + + Q   C+G Q
Sbjct: 761 QDSLS-------------------------EDNNLKAMVTAGSKGSFINISQMMACVGQQ 820

Query: 818 HSLVTLSFSLPHKLTCSAWNSQKMPRYTQKDGLPDRTSSFIPYAVVESSFLSGLNPFECF 877
                   ++  K     + S+ +P +T+ D  P+          VE+S+L GL P E F
Sbjct: 821 --------NVEGKRIPFGFQSRTLPHFTKDDYGPESR------GFVENSYLRGLTPQEFF 880

Query: 878 AHSVTNRDSSFSDNAEV--PGTLTRKLTFLMRDIYTAYDGTVRNAYGNQLVQFSYDID 894
            H++  R+       +    G + R+L   M D+   YD TVRN+ G+ ++QF+Y  D
Sbjct: 881 FHAMGGREGLIDTAVKTSETGYIQRRLVKAMEDVSIKYDATVRNSLGD-VIQFAYGED 880

BLAST of ClCG00G000380 vs. Swiss-Prot
Match: RPB1_SCHPO (DNA-directed RNA polymerase II subunit rpb1 OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=rpb1 PE=1 SV=1)

HSP 1 Score: 191.8 bits (486), Expect = 4.7e-47
Identity = 165/593 (27.82%), Postives = 277/593 (46.71%), Query Frame = 1

Query: 342 VLGKRSDHCFRMVVVGDPNIELSEIGIPCHVAERLQISEHLSSWNMKKLSTSCYLHLVEK 401
           ++GKR D   R V+ GDPN+ L E+G+P  +A+ L   E ++ +N+ +L       LV  
Sbjct: 346 LMGKRVDFSARTVITGDPNLSLDELGVPRSIAKTLTYPETVTPYNIYQLQ-----ELVRN 405

Query: 402 G-------EIFVRREGRLVRVR-----NVLELNMGDTIYRPLADGDVVLVNRPPSIHQHS 461
           G       +  +R  G  + +R       + L  G  + R + DGDVV+ NR PS+H+ S
Sbjct: 406 GPDEHPGAKYIIRDTGERIDLRYHKRAGDIPLRYGWRVERHIRDGDVVIFNRQPSLHKMS 465

Query: 462 LIALSVKLLPVSSVLSLNPLCCSPFRGDFDGDCLHGYVPQSLEARVEVRELVSLDRQLIN 521
           ++   ++++P S+   LN    SP+  DFDGD ++ +VPQS E R E++E+  + +Q+++
Sbjct: 466 MMGHRIRVMPYST-FRLNLSVTSPYNADFDGDEMNMHVPQSEETRAEIQEITMVPKQIVS 525

Query: 522 GQSGRNLLSLSHDSLTAAH--LIMEDGVSLNLFQMQQLQMLALHQLLPPAILKAPLLRNC 581
            QS + ++ +  D+L       + ++ ++ N      L +     +LPP ++  P     
Sbjct: 526 PQSNKPVMGIVQDTLAGVRKFSLRDNFLTRNAVMNIMLWVPDWDGILPPPVILKP---KV 585

Query: 582 AWTGKQLFSTLLPP------DFDYSSPSH----CVLIENGELI----------SSEG--- 641
            WTGKQ+ S ++P       D D  S S+     +LIENGE+I          +S+G   
Sbjct: 586 LWTGKQILSLIIPKGINLIRDDDKQSLSNPTDSGMLIENGEIIYGVVDKKTVGASQGGLV 645

Query: 642 -SYWLRDSGRNLFQALIEHCEGKTLDYLHDAQGVLCEWLSMRGLSVSLSDLYLSVDSYSH 701
            + W ++ G        E C+G    + +  Q V+  WL   G S+ + D     D+   
Sbjct: 646 HTIW-KEKGP-------EICKG----FFNGIQRVVNYWLLHNGFSIGIGDTIADADT--- 705

Query: 702 KNMMDDIFCGLQEAEETCNLKQLMVDAHKDILTEDDEDNQHVLSIAVDRLSYEKQKSAAL 761
              M ++   ++EA     + + + DA  + L  +             R S+E + S  L
Sbjct: 706 ---MKEVTRTVKEARR--QVAECIQDAQHNRLKPEPG--------MTLRESFEAKVSRIL 765

Query: 762 NQASVDAFKKVFRDIQNLVYKYSGKD-NSLLTMFKAGSKGNLLKLVQHSMCLGLQHSLVT 821
           NQA         RD      ++S KD N++  M  AGSKG+ + + Q S C+G Q     
Sbjct: 766 NQA---------RDNAGRSAEHSLKDSNNVKQMVAAGSKGSFINISQMSACVGQQ----- 825

Query: 822 LSFSLPHKLTCSAWNSQKMPRYTQKDGLPDRTSSFIPYAVVESSFLSGLNPFECFAHSVT 881
               +  K     +  + +P + + D  P+          +E+S+L GL P E F H++ 
Sbjct: 826 ---IVEGKRIPFGFKYRTLPHFPKDDDSPESR------GFIENSYLRGLTPQEFFFHAMA 877

Query: 882 NRDSSFSDNAEV--PGTLTRKLTFLMRDIYTAYDGTVRNAYGNQLVQFSYDID 894
            R+       +    G + R+L   M D+   YDGTVRNA G+ ++QF+Y  D
Sbjct: 886 GREGLIDTAVKTAETGYIQRRLVKAMEDVMVRYDGTVRNAMGD-IIQFAYGED 877

BLAST of ClCG00G000380 vs. Swiss-Prot
Match: RPB1A_TRYBB (DNA-directed RNA polymerase II subunit RPB1-A OS=Trypanosoma brucei brucei GN=TRP4.8 PE=1 SV=1)

HSP 1 Score: 190.7 bits (483), Expect = 1.1e-46
Identity = 163/598 (27.26%), Postives = 279/598 (46.66%), Query Frame = 1

Query: 323 KKIKDTATSSYGLRWIKDVVLGKRSDHCFRMVVVGDPNIELSEIGIPCHVAERLQISEHL 382
           K + +     YG   ++  ++GKR D   R V+ GDPNI++ E+G+P  VA  L   E +
Sbjct: 331 KSLTERLKGKYGR--LRGNLMGKRVDFSARTVITGDPNIDVDEVGVPFSVAMTLTFPERV 390

Query: 383 SSWNMKKLSTSCYLHLVEKGEIFVRREG-----RLVRVRNVLELNMGDTIYRPLADGDVV 442
           ++ N K+L+      +           G      L+R R+ + LN+GD + R + +GDVV
Sbjct: 391 NTVNKKRLTEFARRTVYPSANYIHHPNGTITKLALLRDRSKVTLNIGDVVERHVINGDVV 450

Query: 443 LVNRPPSIHQHSLIALSVKLLPVSSVLSLNPLCCSPFRGDFDGDCLHGYVPQSLEARVEV 502
           L NR P++H+ S++   V++L  S+   LN  C +P+  DFDGD ++ +VPQSL  + E+
Sbjct: 451 LFNRQPTLHRMSMMGHRVRVLNYST-FRLNLSCTTPYNADFDGDEMNLHVPQSLLTKAEL 510

Query: 503 RELVSLDRQLINGQSGRNLLSLSHDSLTAAHLIMEDGVSLNLFQMQQLQM-LALHQLLPP 562
            E++ + +  ++       + +  DSL  ++ + +    L+ + +Q + + L L QL  P
Sbjct: 511 IEMMMVPKNFVSPNKSAPCMGIVQDSLLGSYRLTDKDTFLDKYFVQSVALWLDLWQLPIP 570

Query: 563 AILKAPLLRNCAWTGKQLFSTLLP-----------PDFDYSSPSHCVLIENGELISSEGS 622
           AILK   L    WTGKQ+FS +LP           P F ++     V+I  G+L+    +
Sbjct: 571 AILKPRPL----WTGKQVFSLILPEVNHPATPQDRPPFPHN--DSVVMIRRGQLLCGPIT 630

Query: 623 YWLRDS--GRNLFQALIEHCEGKTLDYLHDAQGVLCEWLSMRGLSVSLSDLYLSVDSYSH 682
             +  +  G  +     EH   +   +++  Q V   +L   G SV + D     D+   
Sbjct: 631 KSIVGAAPGSLIHVIFNEHGSDEVARFINGVQRVTTFFLLNFGFSVGVQDTVADSDTL-- 690

Query: 683 KNMMDDIFCGLQEAEETCNLKQLMVDAHKDILTEDDEDNQHVLSIAVDRLSYEKQKSAAL 742
              M+D+    +      N++++   A+   L    +    +L       S+E   ++AL
Sbjct: 691 -RQMNDVLVKTRR-----NVEKIGAAANNRTLNR--KAGMTLLQ------SFEADVNSAL 750

Query: 743 NQASVDAFKKVFRDIQNLVYKYSGKDNSLLTMFKAGSKGNLLKLVQHSMCLGLQH---SL 802
           N+   +A KK   +++        + NS   M +AGSKG  L + Q ++ +G Q+   S 
Sbjct: 751 NKCREEAAKKALSNVR--------RTNSFKVMIEAGSKGTDLNICQIAVFVGQQNVAGSR 810

Query: 803 VTLSF---SLPHKLTCSAWNSQKMPRYTQKDGLPDRTSSFIPYAVVESSFLSGLNPFECF 862
           +   F   +LPH +      + +        G+ +R             ++ GL P E F
Sbjct: 811 IPFGFRRRTLPHFMLDDYGETSR--------GMANR------------GYVEGLKPHEFF 870

Query: 863 AHSVTNRDSSFSDNAEV--PGTLTRKLTFLMRDIYTAYDGTVRNAYGNQLVQFSYDID 894
            H++  R+       +    G L RKL   + D++ AYDGTVRNA  ++L+QF Y  D
Sbjct: 871 FHTMAGREGLIDTAVKTSDTGYLQRKLIKALEDVHAAYDGTVRNA-NDELIQFMYGED 874

BLAST of ClCG00G000380 vs. TrEMBL
Match: A0A0A0L2L4_CUCSA (DNA-directed RNA polymerase subunit OS=Cucumis sativus GN=Csa_3G039340 PE=3 SV=1)

HSP 1 Score: 1368.2 bits (3540), Expect = 0.0e+00
Identity = 679/724 (93.78%), Postives = 701/724 (96.82%), Query Frame = 1

Query: 308  LSPEKLQSKDLVYQQKKIKDTATSSYGLRWIKDVVLGKRSDHCFRMVVVGDPNIELSEIG 367
            LSPEKLQ+KDLVYQQKKIKDTATSS GLRWIKDVVLGKRSDHCFRMVVVGDPNIELSEIG
Sbjct: 306  LSPEKLQNKDLVYQQKKIKDTATSSSGLRWIKDVVLGKRSDHCFRMVVVGDPNIELSEIG 365

Query: 368  IPCHVAERLQISEHLSSWNMKKLSTSCYLHLVEKGEIFVRREGRLVRVRNVLELNMGDTI 427
            IPCHVAERLQISEHLSSWNMKKLSTSCYLHLVEKGEI+VRREGRLVRVRNVLELNMGDTI
Sbjct: 366  IPCHVAERLQISEHLSSWNMKKLSTSCYLHLVEKGEIYVRREGRLVRVRNVLELNMGDTI 425

Query: 428  YRPLADGDVVLVNRPPSIHQHSLIALSVKLLPVSSVLSLNPLCCSPFRGDFDGDCLHGYV 487
            YRPLADGD+VLVNRPPSIHQHSLIALSVKLLPVS+VLSLNPLCCSPFRGDFDGDCLHGYV
Sbjct: 426  YRPLADGDIVLVNRPPSIHQHSLIALSVKLLPVSAVLSLNPLCCSPFRGDFDGDCLHGYV 485

Query: 488  PQSLEARVEVRELVSLDRQLINGQSGRNLLSLSHDSLTAAHLIMEDGVSLNLFQMQQLQM 547
            PQSLEARVEVRELVSLD+QL NGQSGRNLLSLSHDSLTAAHLI+EDGVSLNLFQMQQLQM
Sbjct: 486  PQSLEARVEVRELVSLDKQLTNGQSGRNLLSLSHDSLTAAHLILEDGVSLNLFQMQQLQM 545

Query: 548  LALHQLLPPAILKAPLLRNCAWTGKQLFSTLLPPDFDYSSPSHCVLIENGELISSEGSYW 607
            L LHQLLPPAI+K+PLLRNCAWTGKQLFS LLPPDFDYSSPSH V IE GELISSEGSYW
Sbjct: 546  LTLHQLLPPAIVKSPLLRNCAWTGKQLFSILLPPDFDYSSPSHNVFIEKGELISSEGSYW 605

Query: 608  LRDSGRNLFQALIEHCEGKTLDYLHDAQGVLCEWLSMRGLSVSLSDLYLSVDSYSHKNMM 667
            LRDSGRNLFQALIEHCEGKTLDYL DAQGVLCEWLS RGLSVSLSDLYLSVDSYSH+NMM
Sbjct: 606  LRDSGRNLFQALIEHCEGKTLDYLRDAQGVLCEWLSTRGLSVSLSDLYLSVDSYSHENMM 665

Query: 668  DDIFCGLQEAEETCNLKQLMVDAHKDILTEDDEDNQHVLSIAVDRLSYEKQKSAALNQAS 727
            DDIFCGLQEAEETCNLKQLMVD+HK+IL  +DEDNQH+LSIAV+RL YEKQKSAALNQAS
Sbjct: 666  DDIFCGLQEAEETCNLKQLMVDSHKEILIGNDEDNQHLLSIAVERLIYEKQKSAALNQAS 725

Query: 728  VDAFKKVFRDIQNLVYKYSGKDNSLLTMFKAGSKGNLLKLVQHSMCLGLQHSLVTLSFSL 787
            VDAFKKVFRDIQNLVYKYSGKDNSLLTMFKAGSKGNL+KLVQHSMCLGLQHSLVTLSFSL
Sbjct: 726  VDAFKKVFRDIQNLVYKYSGKDNSLLTMFKAGSKGNLMKLVQHSMCLGLQHSLVTLSFSL 785

Query: 788  PHKLTCSAWNSQKMPRYTQKDGLPDRTSSFIPYAVVESSFLSGLNPFECFAHSVTNRDSS 847
            PHKL+C+AWNSQKMPRY QKDGLPDRT SFIPYAVVE+SFLSGLNPFECFAHSVTNRDSS
Sbjct: 786  PHKLSCAAWNSQKMPRYIQKDGLPDRTQSFIPYAVVENSFLSGLNPFECFAHSVTNRDSS 845

Query: 848  FSDNAEVPGTLTRKLTFLMRDIYTAYDGTVRNAYGNQLVQFSYDIDRPTSVSNELDSENN 907
            FSDNAEVPGTLTRKLTFLMRDIYTAYDGTVRNAYGNQLVQF YDIDRPTS   E +SENN
Sbjct: 846  FSDNAEVPGTLTRKLTFLMRDIYTAYDGTVRNAYGNQLVQFCYDIDRPTS---ESESENN 905

Query: 908  NRDHDIGGHPVGSLAACAMSEAAYSALDQPISLLEASPLLNLKRVLECGSKRNSTKQTFS 967
            NRD  IGGHPVGSLAACA+SEAAYSALDQPISLLEASPLLNLKRVLECGSKRNSTKQTFS
Sbjct: 906  NRDRGIGGHPVGSLAACAISEAAYSALDQPISLLEASPLLNLKRVLECGSKRNSTKQTFS 965

Query: 968  LFLSEKLSKRSYGFEYGALGVKNHLERVMFKDIVSNVMIIFSPQPSRKKHFSPWVCHFHV 1027
            LFLSEKLSKRSYGFEYGALGVKNHLERVMFKDIVS+VMIIFSP PSRKKHFSPWVCHFHV
Sbjct: 966  LFLSEKLSKRSYGFEYGALGVKNHLERVMFKDIVSSVMIIFSPLPSRKKHFSPWVCHFHV 1025

Query: 1028 CKSL 1032
            CK +
Sbjct: 1026 CKEI 1026

BLAST of ClCG00G000380 vs. TrEMBL
Match: A0A0A0L2L4_CUCSA (DNA-directed RNA polymerase subunit OS=Cucumis sativus GN=Csa_3G039340 PE=3 SV=1)

HSP 1 Score: 528.9 bits (1361), Expect = 1.8e-146
Identity = 267/301 (88.70%), Postives = 277/301 (92.03%), Query Frame = 1

Query: 1030 SLESATLDIGKTIRLEHLLLVANSLSATGEFVGLNVKGLSHQREHALVKTPFMQACFSSP 1089
            SLESATLD+GKTIRLEHLLLV+NSLSATGEFVGLNVKGL+HQREHALVKTPFMQACFSSP
Sbjct: 1199 SLESATLDVGKTIRLEHLLLVSNSLSATGEFVGLNVKGLTHQREHALVKTPFMQACFSSP 1258

Query: 1090 GACFVKAAKAGIKDNLSGSLDALAWGRIPSLGTGGQFDILYSGRGHELNKPVDVYNLLGG 1149
            GAC +KAAKAGIKDNLSGSLDALAWGR+PSLGTGGQFDILYSG+GHELNKPVDVYNLLGG
Sbjct: 1259 GACMIKAAKAGIKDNLSGSLDALAWGRMPSLGTGGQFDILYSGKGHELNKPVDVYNLLGG 1318

Query: 1150 QSICEKQNAKIESLDKNNISEKYSAQLVLINGGSTIKGLKKLDSVSKSILREFLTLNDIQ 1209
            QS CEKQN KIESLDKN ISEKYSAQL+L NGGSTIKGLK+LDSVSKSILR+FLTLNDIQ
Sbjct: 1319 QSTCEKQNTKIESLDKNTISEKYSAQLMLKNGGSTIKGLKRLDSVSKSILRKFLTLNDIQ 1378

Query: 1210 KLSFALRTILHKYSLNERLNEVDKSTLMMALYFHPHRDEKIGVGAQDIKLISVFVLNCGS 1269
            KLSFALRTILHKYSLNERLNEVDKSTLMMALYFHPHRDEKIGVGAQDIK           
Sbjct: 1379 KLSFALRTILHKYSLNERLNEVDKSTLMMALYFHPHRDEKIGVGAQDIK----------- 1438

Query: 1270 NYVVDLDAVGNHSKYQNTRCFVLIRSDGTTEDFSYHKCVLGALEIIAPHRVKGYQSKWMQ 1329
                    VG+HSKYQNTRCFVLIRSDGTTEDFSYHKCVLGALEIIAPHRVKGYQSKWMQ
Sbjct: 1439 --------VGSHSKYQNTRCFVLIRSDGTTEDFSYHKCVLGALEIIAPHRVKGYQSKWMQ 1480

Query: 1330 E 1331
            E
Sbjct: 1499 E 1480


HSP 2 Score: 526.9 bits (1356), Expect = 6.9e-146
Identity = 249/261 (95.40%), Postives = 257/261 (98.47%), Query Frame = 1

Query: 47  VMIHMEDEQDGELPIPSGLVTGINFSVSTQQDTENIAVMTVDASSEVSDPKLGLPNPSYQ 106
           VMIHMEDEQDGELPIPSGL+TGINFSVS QQD ENIAV+TVDA++EVSDPKLGLPNPSYQ
Sbjct: 13  VMIHMEDEQDGELPIPSGLLTGINFSVSNQQDIENIAVITVDAANEVSDPKLGLPNPSYQ 72

Query: 107 CTTCGASSLKSCEGHFGVIKFPYTIIHPYFLSEVAQVLNKVCPGCKSIRQELWGKVEDPT 166
           CTTCGASSLK CEGHFGVIKFPYTIIHPYFLSEVAQVLNKVCPGCKS+RQELWGKVEDPT
Sbjct: 73  CTTCGASSLKFCEGHFGVIKFPYTIIHPYFLSEVAQVLNKVCPGCKSVRQELWGKVEDPT 132

Query: 167 SEYHRPKGCRYCFGSLKDWYPPMRFKLSTTDMFKKSMIMVEVKENMSKKYQKRVAKGGLP 226
           S+Y+RPKGCRYCFGSLKDWYPPMRFKLSTTDMFKKSMIMVEVKENMSKKYQKRVAKGGLP
Sbjct: 133 SDYNRPKGCRYCFGSLKDWYPPMRFKLSTTDMFKKSMIMVEVKENMSKKYQKRVAKGGLP 192

Query: 227 SDYWNFIPKDEQQEESYCRPNRKILTHAQVHYLLKDIDPKFLKKFVPAIDSLFLNSFPVT 286
           SDYW+FIPKDEQQEESYCRPNRKILTHAQVHYLLKDIDPKFLKKFVPAIDSLFLNSFPVT
Sbjct: 193 SDYWDFIPKDEQQEESYCRPNRKILTHAQVHYLLKDIDPKFLKKFVPAIDSLFLNSFPVT 252

Query: 287 PNSHRVTEMTHSFSNGQRLIF 308
           PNSHRVTEM HSFSNGQRLIF
Sbjct: 253 PNSHRVTEMAHSFSNGQRLIF 273


HSP 3 Score: 1281.2 bits (3314), Expect = 0.0e+00
Identity = 625/985 (63.45%), Postives = 781/985 (79.29%), Query Frame = 1

Query: 51   MEDEQDGELPIPSGLVTGINFSVSTQQDTENIAVMTVDASSEVSDPKLGLPNPSYQCTTC 110
            M+++   E  +PSGL+ GI F VST++D E I+VM +DA +E++DPKLG+PNPS QC+TC
Sbjct: 1    MDNDFLEEQQVPSGLLIGIKFDVSTEEDMEKISVMKIDAVNEITDPKLGVPNPSCQCSTC 60

Query: 111  GASSLKSCEGHFGVIKFPYTIIHPYFLSEVAQVLNKVCPGCKSIRQELWGKVEDPTSEYH 170
            GA   K CEGHFGVIKFP+TI+HPYFL+EV Q+LNK+CPGCKS RQ  W KV        
Sbjct: 61   GAKDTKKCEGHFGVIKFPFTILHPYFLTEVVQILNKICPGCKSTRQGQWVKVRRL----- 120

Query: 171  RPKGCRYCFGSLKDWYPPMRFKLSTTDMFKKSMIMVEVKENMSKKYQKRVAKGGLPSDYW 230
            R KGC+YC  +  DWYP M+FK+S+ D+F+K+ I+VE+ E + KK QK+  +  LP DYW
Sbjct: 121  RSKGCKYCAANSNDWYPTMKFKVSSKDLFRKTAIIVEMNEKLPKKLQKKSFRPVLPLDYW 180

Query: 231  NFIPKDEQQEESYCRPNRKILTHAQVHYLLKDIDPKFLKKFVPAIDSLFLNSFPVTPNSH 290
            +FIPKD QQEE+   PNR++L+HAQVHYLLKDIDP F+K+FV  +DS FLN  PVTPN+H
Sbjct: 181  DFIPKDPQQEENCLNPNRRVLSHAQVHYLLKDIDPGFIKEFVSRMDSFFLNCLPVTPNNH 240

Query: 291  RVTEMTHSFSNGQRLIFLSPEKLQSKDLVYQQKKIK---DTATSSYGLRWIKDVVLGKRS 350
            RVTE+TH+ SNGQ LIF    +   K + ++    +    +A+   GL+WIK+V+LGKR+
Sbjct: 241  RVTEITHALSNGQTLIFDQHSRAYKKLVDFRGTANELSCHSASKMSGLKWIKEVLLGKRT 300

Query: 351  DHCFRMVVVGDPNIELSEIGIPCHVAERLQISEHLSSWNMKKLSTSCYLHLVEKGEIFVR 410
            +H FRM+VVGDP + LSEIGIPCH+AE L ISEHL+SWN +K++  C L L+EKG+ +VR
Sbjct: 301  NHSFRMIVVGDPKLRLSEIGIPCHIAEELLISEHLNSWNWEKVTNGCNLRLLEKGQTYVR 360

Query: 411  REGRLVRVRNVLELNMGDTIYRPLADGDVVLVNRPPSIHQHSLIALSVKLLPVSSVLSLN 470
            R+G L  VR + +   GD IYRPL DGD+VL+NRPPSIHQHS+IALSVK+LP++SV+S+N
Sbjct: 361  RKGTLAPVRRMNDFQAGDIIYRPLTDGDIVLINRPPSIHQHSVIALSVKVLPLNSVVSIN 420

Query: 471  PLCCSPFRGDFDGDCLHGYVPQSLEARVEVRELVSLDRQLINGQSGRNLLSLSHDSLTAA 530
            PLCCSPFRGDFDGDCLHGY+PQS+++RVE+ ELV+L+RQLIN QSGRNLLSLS DSL+AA
Sbjct: 421  PLCCSPFRGDFDGDCLHGYIPQSVDSRVELSELVALNRQLINRQSGRNLLSLSQDSLSAA 480

Query: 531  HLIMEDGVSLNLFQMQQLQMLALHQLLPPAILKAPLLRNCAWTGKQLFSTLLPPDFDYSS 590
            HL+MEDGV LNLFQMQQL+M   +QL  PAI+KAPLL    WTGKQLFS LLPP F+Y  
Sbjct: 481  HLVMEDGVLLNLFQMQQLEMFCPYQLQSPAIIKAPLLDTQVWTGKQLFSMLLPPGFNYVF 540

Query: 591  PSHCVLIENGELI-SSEGSYWLRDSGRNLFQALIEHCEGKTLDYLHDAQGVLCEWLSMRG 650
            P + V I +GELI SS+GS WLRD   NLF +L++ C+GK LD+L+ AQ VLCEWLSMRG
Sbjct: 541  PLNGVRISDGELISSSDGSAWLRDIDGNLFSSLVKDCQGKALDFLYAAQEVLCEWLSMRG 600

Query: 651  LSVSLSDLYLSVDSYSHKNMMDDIFCGLQEAEETCNLKQLMVDAHKDILTEDDEDNQHVL 710
            LSVSLSD+YLS DS S KNM+D++FCGL  AE+TC+ KQL+VD+ ++ L    E+NQ+ +
Sbjct: 601  LSVSLSDIYLSSDSISRKNMIDEVFCGLLVAEQTCHFKQLLVDSSQNFLIGSGENNQNGV 660

Query: 711  SIAVDRLSYEKQKSAALNQASVDAFKKVFRDIQNLVYKYSGKDNSLLTMFKAGSKGNLLK 770
               V  L YE+Q SAAL Q+SV AFK+ FRDIQNLVY+Y+ KDNSLL M KAGSKGNLLK
Sbjct: 661  VPDVQSLWYERQGSAALCQSSVCAFKQKFRDIQNLVYQYANKDNSLLAMLKAGSKGNLLK 720

Query: 771  LVQHSMCLGLQHSLVTLSFSLPHKLTCSAWNSQKMPRYTQKDGLPDRTSSFIPYAVVESS 830
            LVQ  +CLGLQHSLV LSF +PH+L+C+AWN QK+P   Q D   +   S+IPYAVVE+S
Sbjct: 721  LVQQGLCLGLQHSLVPLSFKIPHQLSCAAWNKQKVPGLIQND-TSEYAESYIPYAVVENS 780

Query: 831  FLSGLNPFECFAHSVTNRDSSFSDNAEVPGTLTRKLTFLMRDIYTAYDGTVRNAYGNQLV 890
            FL GLNP ECF HSVT+RDSSFSDNA++PGTLTR+L F MRD+Y AYDGTVRNAYGNQLV
Sbjct: 781  FLMGLNPLECFVHSVTSRDSSFSDNADLPGTLTRRLMFFMRDLYIAYDGTVRNAYGNQLV 840

Query: 891  QFSYDIDRPTSVSNELDSENNNRDHDIGGHPVGSLAACAMSEAAYSALDQPISLLEASPL 950
            QFSY+I+  ++ S+ ++ +     +D+GG PVGS++ACA+SEAAYSALDQPISLLE SPL
Sbjct: 841  QFSYNIEHTSTPSDGINED--TCAYDMGGQPVGSISACAISEAAYSALDQPISLLEPSPL 900

Query: 951  LNLKRVLECGSKRNSTKQTFSLFLSEKLSKRSYGFEYGALGVKNHLERVMFKDIVSNVMI 1010
            LNLKRVLECG ++++  +T SLFLS+KL KR +GFEYGAL VKNHLE+++F DIVS VMI
Sbjct: 901  LNLKRVLECGLRKSTADRTVSLFLSKKLEKRKHGFEYGALEVKNHLEKLLFSDIVSTVMI 960

Query: 1011 IFSPQPSRKKHFSPWVCHFHVCKSL 1032
            +FSPQ   K HFSPWVCHFHVC+ +
Sbjct: 961  VFSPQNGSKTHFSPWVCHFHVCEEI 977

BLAST of ClCG00G000380 vs. TrEMBL
Match: F6HUI3_VITVI (DNA-directed RNA polymerase subunit OS=Vitis vinifera GN=VIT_02s0025g04530 PE=3 SV=1)

HSP 1 Score: 356.7 bits (914), Expect = 1.2e-94
Identity = 179/300 (59.67%), Postives = 218/300 (72.67%), Query Frame = 1

Query: 1030 SLESATLDIGKTIRLEHLLLVANSLSATGEFVGLNVKGLSHQREHALVKTPFMQACFSSP 1089
            SL+SA  DIGKT+  EHLLLVA+ LSATGEFVGLN KG++ Q+E   + +PFMQ CFSSP
Sbjct: 1138 SLKSAISDIGKTVLPEHLLLVASCLSATGEFVGLNAKGMARQKELTSISSPFMQGCFSSP 1197

Query: 1090 GACFVKAAKAGIKDNLSGSLDALAWGRIPSLGTGGQFDILYSGRGHELNKPVDVYNLLGG 1149
            G+CF+KA K  + DNL GSLDALAWG+IPS+G+GG FDILYS +GHEL +P D+Y LLG 
Sbjct: 1198 GSCFIKAGKRAVADNLHGSLDALAWGKIPSVGSGGHFDILYSAKGHELARPEDIYKLLGS 1257

Query: 1150 QSICEKQNAKIE-SLDKNNISEKYSAQLVLINGGSTIKGLKKLDSVSKSILREFLTLNDI 1209
            Q+ C +QN K++  +     + K  AQLV  NG S  KG K L+ +SKS+LR FL+LNDI
Sbjct: 1258 QTSCHEQNLKVKVPITCYQTTTKCGAQLVYANGDSASKGCKSLEKISKSVLRSFLSLNDI 1317

Query: 1210 QKLSFALRTILHKYSLNERLNEVDKSTLMMALYFHPHRDEKIGVGAQDIKLISVFVLNCG 1269
            QKLS  L+ IL KY +N +L+E+DK+TLMMALYFHP RDEKIG GAQ+IK          
Sbjct: 1318 QKLSRRLKFILQKYPINHQLSEIDKTTLMMALYFHPRRDEKIGPGAQNIK---------- 1377

Query: 1270 SNYVVDLDAVGNHSKYQNTRCFVLIRSDGTTEDFSYHKCVLGALEIIAPHRVKGYQSKWM 1329
                     V  HSKY NTRCF L+R+DGT EDFSYHKCV GALEII P R + YQS+W+
Sbjct: 1378 ---------VRYHSKYHNTRCFSLVRTDGTEEDFSYHKCVHGALEIIDPRRARSYQSRWL 1418


HSP 2 Score: 1210.7 bits (3131), Expect = 0.0e+00
Identity = 611/1004 (60.86%), Postives = 763/1004 (76.00%), Query Frame = 1

Query: 61   IPSGLVTGINFSVSTQQDTENIAVMTVDASSEVSDPKLGLPNPSYQCTTCGASSLKSCEG 120
            +PS L+T I F VST+ + E ++V+T+D  SEV+D KLGLPNP+ QC+TCG+  LKSCEG
Sbjct: 12   LPSALLTAITFGVSTEAEKEKLSVLTIDTVSEVTDSKLGLPNPTNQCSTCGSKDLKSCEG 71

Query: 121  HFGVIKFPYTIIHPYFLSEVAQVLNKVCPGCKSIRQELWGKVEDPTSEYHRPKGCRYCFG 180
            HFGVIKFP+TI+HPY+LSEV ++LN+VCP CKSIR+E   KV      +  PK       
Sbjct: 72   HFGVIKFPFTILHPYYLSEVVRILNQVCPKCKSIRKE--SKVR--CLNHLNPKLPVLLI- 131

Query: 181  SLKDWYPPMRFKLSTTDMFKKSMIMVEVKENMSKKYQKRVAKGGLPSDYWNFIPKDEQQE 240
             L  WYP M+F +S+ ++F+K++I+ +  E  + K QKR  K  L +DYW+ IPKDEQQE
Sbjct: 132  -LLCWYPAMKFSVSSEEIFRKNVIIAKFSERPTNKSQKRGFKKKLAADYWDIIPKDEQQE 191

Query: 241  ESYCRPNRKILTHAQVHYLLKDIDPKFLKKFVPAIDSLFLNSFPVTPNSHRVTEMTHSFS 300
            E+  RPN+++L+HAQV +LL++IDP F++KFV   DS+FLN F VTPN HRVTE+TH+FS
Sbjct: 192  ENITRPNQRVLSHAQVIHLLENIDPNFIRKFVLKRDSIFLNCFSVTPNCHRVTEVTHAFS 251

Query: 301  NGQRLIF--------------------------------LSPEK-LQSKDLVYQQKKIKD 360
            NGQRL+F                                ++P+K + + D +  Q+K+ D
Sbjct: 252  NGQRLVFDDRTRAYKKMVDFRGIAKELSFRVLDCLKTSKINPDKSVNNDDYMALQRKMND 311

Query: 361  TATSSYGLRWIKDVVLGKRSDHCFRMVVVGDPNIELSEIGIPCHVAERLQISEHLSSWNM 420
            +++SS GLRWIKDVVLGKR+D+ FRMVVVGDPNI+ SEIGIPC +AERLQISEHL++WN 
Sbjct: 312  SSSSSSGLRWIKDVVLGKRNDNSFRMVVVGDPNIKFSEIGIPCPIAERLQISEHLTTWNW 371

Query: 421  KKLSTSCYLHLVEKGEIFVRREGRLVRVRNVLELNMGDTIYRPLADGDVVLVNRPPSIHQ 480
             KL+T C + L+EKG++ VRREG+LVRVR   EL +GD IYRPL DGD VL+NRPPSIHQ
Sbjct: 372  DKLNTCCEVRLLEKGDMHVRREGKLVRVRRTKELRIGDIIYRPLNDGDTVLINRPPSIHQ 431

Query: 481  HSLIALSVKLLPVSSVLSLNPLCCSPFRGDFDGDCLHGYVPQSLEARVEVRELVSLDRQL 540
            HSLIALSVK+LP +SVL++NPL C+PFRGDFDGDCLHGYVPQS++ RVE+RELV+LD+QL
Sbjct: 432  HSLIALSVKVLPATSVLAINPLICAPFRGDFDGDCLHGYVPQSVDTRVELRELVALDKQL 491

Query: 541  INGQSGRNLLSLSHDSLTAAHLIMEDGVSLNLFQMQQLQMLALHQLLPPAILKAPLLRNC 600
            IN Q+GRNLLS S DSL AAHL+MEDGV L+L QMQQLQM   HQL  PA+ KAP L  C
Sbjct: 492  INVQNGRNLLSFSQDSLVAAHLVMEDGVLLSLQQMQQLQMFCPHQLFSPAVRKAPSLNGC 551

Query: 601  AWTGKQLFSTLLPPDFDYSSPSHCVLIENGELISSEGSYWLRDSGRNLFQALIEHCEGKT 660
            AWTGKQL S LLP  FD+  PS  V I +GELISSEGS+WLRD+  NLFQ+LI+ C+ + 
Sbjct: 552  AWTGKQLISMLLPRGFDHECPSSDVYIRDGELISSEGSFWLRDTDGNLFQSLIKQCQDQV 611

Query: 661  LDYLHDAQGVLCEWLSMRGLSVSLSDLYLSVDSYSHKNMMDDIFCGLQEAEETCNLKQLM 720
            LD+L+ AQ VLCEWLSMRGLSVSLSDLYL  DS S +NMMD++  GLQ+A+ TCN+KQ M
Sbjct: 612  LDFLYIAQEVLCEWLSMRGLSVSLSDLYLCPDSDSRENMMDEVLFGLQDAKGTCNMKQFM 671

Query: 721  VDAHKDILTEDDEDNQHVLSIAVDRLSYEKQKSAALNQASVDAFKKVFRDIQNLVYKYSG 780
            VD+ +D L   DED Q+ ++  V+ L +EKQ+SAAL+QASVDAFK VFRDIQ L YKY+ 
Sbjct: 672  VDSCRDFLASIDEDEQYSVNFDVEHLCHEKQRSAALSQASVDAFKHVFRDIQTLGYKYAS 731

Query: 781  KDNSLLTMFKAGSKGNLLKLVQHSMCLGLQHSLVTLSFSLPHKLTCSAWNSQKMPRYTQK 840
            KDN+L+ MFK+GSKGNLLK+VQHSMCLGLQHSLV LSF +P +L+C AWN QK       
Sbjct: 732  KDNALMAMFKSGSKGNLLKVVQHSMCLGLQHSLVPLSFRMPLQLSCDAWNKQK------A 791

Query: 841  DGLPDRTSSFIPYAVVESSFLSGLNPFECFAHSVTNRDSSFSDNAEVPGTLTRKLTFLMR 900
            +   +   S+IP AVVE  FL+GLNP ECF HSVT+R+SSFSDNA++PGTLTR+L F MR
Sbjct: 792  ENAVECARSYIPSAVVEGCFLTGLNPLECFVHSVTSRESSFSDNADLPGTLTRRLMFFMR 851

Query: 901  DIYTAYDGTVRNAYGNQLVQFSYDIDRPTSVSNELDSENNNRDHDIGGHPVGSLAACAMS 960
            D++ AYDG+VR+AYGNQL+QFSY+ID   S      ++  +    + G PVGSLAAC++S
Sbjct: 852  DVHAAYDGSVRSAYGNQLIQFSYNIDEGRSAETYGTAKIVDNYDGMAGKPVGSLAACSIS 911

Query: 961  EAAYSALDQPISLLEASPLLNLKRVLECGSKRNSTKQTFSLFLSEKLSKRSYGFEYGALG 1020
            EAAYSALDQPISLLE SPLLNLK VLECG K+++  ++ SLFLSEKL +R +GFEYGAL 
Sbjct: 912  EAAYSALDQPISLLEKSPLLNLKNVLECGLKKSNAHKSMSLFLSEKLGRRRHGFEYGALK 971

Query: 1021 VKNHLERVMFKDIVSNVMIIFSPQPSRKKHFSPWVCHFHVCKSL 1032
            V++HLER++F DIVS   IIFS Q   K  FSPWVCHFHV K +
Sbjct: 972  VQDHLERLLFSDIVSVSRIIFSSQSESKTCFSPWVCHFHVYKEI 1003

BLAST of ClCG00G000380 vs. TrEMBL
Match: B9RC12_RICCO (DNA-directed RNA polymerase subunit OS=Ricinus communis GN=RCOM_1683300 PE=3 SV=1)

HSP 1 Score: 152.1 bits (383), Expect = 4.6e-33
Identity = 73/103 (70.87%), Postives = 87/103 (84.47%), Query Frame = 1

Query: 1031 LESATLDIGKTIRLEHLLLVANSLSATGEFVGLNVKGLSHQREHALVKTPFMQACFSSPG 1090
            LESA  D+GK++  EH+LLVAN LS TGEFVGLN KG   QRE A V +PF+QACFSSPG
Sbjct: 1174 LESAISDVGKSVLPEHMLLVANCLSVTGEFVGLNAKGWKRQREDASVSSPFVQACFSSPG 1233

Query: 1091 ACFVKAAKAGIKDNLSGSLDALAWGRIPSLGTGGQFDILYSGR 1134
             CF+KAAKAG+KD+L GSLDALAWG++PS+GT GQFDI+YSG+
Sbjct: 1234 NCFIKAAKAGVKDDLQGSLDALAWGKVPSVGT-GQFDIVYSGK 1275


HSP 2 Score: 110.5 bits (275), Expect = 1.5e-20
Identity = 56/111 (50.45%), Postives = 72/111 (64.86%), Query Frame = 1

Query: 1211 LSFALRTILHKYSLNERLNEVDKSTLMMALYFHPHRDEKIGVGAQDIKLISVFVLNCGSN 1270
            L   L  +L  YS++++LNE DK TL MALYFHP ++EKIG G +DIK++          
Sbjct: 1302 LETPLINLLVWYSVDQQLNEADKCTLTMALYFHPRKEEKIGSGFKDIKVVK--------- 1361

Query: 1271 YVVDLDAVGNHSKYQNTRCFVLIRSDGTTEDFSYHKCVLGALEIIAPHRVK 1322
                      H +YQ++RCF L+RSDGT EDFSY KCV GALEIIAPH+ +
Sbjct: 1362 ----------HPEYQDSRCFSLVRSDGTIEDFSYRKCVYGALEIIAPHKAR 1393


HSP 3 Score: 1167.9 bits (3020), Expect = 0.0e+00
Identity = 576/929 (62.00%), Postives = 718/929 (77.29%), Query Frame = 1

Query: 51  MEDEQDGELPIPSGLVTGINFSVSTQQDTENIAVMTVDASSEVSDPKLGLPNPSYQCTTC 110
           M+++   E  +PSGL+ GI F VST++D E I+VM +DA +E++DPKLG+PNPS QC+TC
Sbjct: 1   MDNDFLEEQQVPSGLLIGIKFDVSTEEDMEKISVMKIDAVNEITDPKLGVPNPSCQCSTC 60

Query: 111 GASSLKSCEGHFGVIKFPYTIIHPYFLSEVAQVLNKVCPGCKSIRQELWGKVEDPTSEYH 170
           GA   K CEGHFGVIKFP+TI+HPYFL+EV Q+LNK+CPGCKS RQ  W K  D  S   
Sbjct: 61  GAKDTKKCEGHFGVIKFPFTILHPYFLTEVVQILNKICPGCKSTRQGQWVKGADSGSRRL 120

Query: 171 RPKGCRYCFGSLKDWYPPMRFKLSTTDMFKKSMIMVEVKENMSKKYQKRVAKGGLPSDYW 230
           R KGC+YC  +  DWYP M+FK+S+ D+F+K+ I+VE+ E + KK QK+  +  LP DYW
Sbjct: 121 RSKGCKYCAANSNDWYPTMKFKVSSKDLFRKTAIIVEMNEKLPKKLQKKSFRPVLPLDYW 180

Query: 231 NFIPKDEQQEESYCRPNRKILTHAQVHYLLKDIDPKFLKKFVPAIDSLFLNSFPVTPNSH 290
           +FIPKD QQEE+   PNR++L+HAQVHYLLKDIDP F+K+FV  +DS FLN  PVTPN+H
Sbjct: 181 DFIPKDPQQEENCLNPNRRVLSHAQVHYLLKDIDPGFIKEFVSRMDSFFLNCLPVTPNNH 240

Query: 291 RVTEMTHSFSNGQRLIFLSPEKLQSK-------------------------DLVYQQKKI 350
           RVTE+TH+ SNGQ LIF    +   K                         +L  ++   
Sbjct: 241 RVTEITHALSNGQTLIFDQHSRAYKKLVDFRGTANELSCRVLDCLKTSKASNLRSEKSTS 300

Query: 351 KDTATSSYGLRWIKDVVLGKRSDHCFRMVVVGDPNIELSEIGIPCHVAERLQISEHLSSW 410
           KD+A+   GL+WIK+V+LGKR++H FRM+VVGDP + LSEIGIPCH+AE L ISEHL+SW
Sbjct: 301 KDSASKMSGLKWIKEVLLGKRTNHSFRMIVVGDPKLRLSEIGIPCHIAEELLISEHLNSW 360

Query: 411 NMKKLSTSCYLHLVEKGEIFVRREGRLVRVRNVLELNMGDTIYRPLADGDVVLVNRPPSI 470
           N +K++  C L L+EKG+ +VRR+G L  VR + +   GD IYRPL DGD+VL+NRPPSI
Sbjct: 361 NWEKVTNGCNLRLLEKGQTYVRRKGTLAPVRRMNDFQAGDIIYRPLTDGDIVLINRPPSI 420

Query: 471 HQHSLIALSVKLLPVSSVLSLNPLCCSPFRGDFDGDCLHGYVPQSLEARVEVRELVSLDR 530
           HQHS+IALSVK+LP++SV+S+NPLCCSPFRGDFDGDCLHGY+PQS+++RVE+ ELV+L+R
Sbjct: 421 HQHSVIALSVKVLPLNSVVSINPLCCSPFRGDFDGDCLHGYIPQSVDSRVELSELVALNR 480

Query: 531 QLINGQSGRNLLSLSHDSLTAAHLIMEDGVSLNLFQMQQLQMLALHQLLPPAILKAPLLR 590
           QLIN QSGRNLLSLS DSL+AAHL+MEDGV LNLFQMQQL+M   +QL  PAI+KAPLL 
Sbjct: 481 QLINRQSGRNLLSLSQDSLSAAHLVMEDGVLLNLFQMQQLEMFCPYQLQSPAIIKAPLLD 540

Query: 591 NCAWTGKQLFSTLLPPDFDYSSPSHCVLIENGELI-SSEGSYWLRDSGRNLFQALIEHCE 650
              WTGKQLFS LLPP F+Y  P + V I +GELI SS+GS WLRD   NLF +L++ C+
Sbjct: 541 TQVWTGKQLFSMLLPPGFNYVFPLNGVRISDGELISSSDGSAWLRDIDGNLFSSLVKDCQ 600

Query: 651 GKTLDYLHDAQGVLCEWLSMRGLSVSLSDLYLSVDSYSHKNMMDDIFCGLQEAEETCNLK 710
           GK LD+L+ AQ VLCEWLSMRGLSVSLSD+YLS DS S KNM+D++FCGL  AE+TC+ K
Sbjct: 601 GKALDFLYAAQEVLCEWLSMRGLSVSLSDIYLSSDSISRKNMIDEVFCGLLVAEQTCHFK 660

Query: 711 QLMVDAHKDILTEDDEDNQHVLSIAVDRLSYEKQKSAALNQASVDAFKKVFRDIQNLVYK 770
           QL+VD+ ++ L    E+NQ+ +   V  L YE+Q SAAL Q+SV AFK+ FRDIQNLVY+
Sbjct: 661 QLLVDSSQNFLIGSGENNQNGVVPDVQSLWYERQGSAALCQSSVCAFKQKFRDIQNLVYQ 720

Query: 771 YSGKDNSLLTMFKAGSKGNLLKLVQHSMCLGLQHSLVTLSFSLPHKLTCSAWNSQKMPRY 830
           Y+ KDNSLL M KAGSKGNLLKLVQ  +CLGLQHSLV LSF +PH+L+C+AWN QK+P  
Sbjct: 721 YANKDNSLLAMLKAGSKGNLLKLVQQGLCLGLQHSLVPLSFKIPHQLSCAAWNKQKVPGL 780

Query: 831 TQKDGLPDRTSSFIPYAVVESSFLSGLNPFECFAHSVTNRDSSFSDNAEVPGTLTRKLTF 890
            Q D   +   S+IPYAVVE+SFL GLNP ECF HSVT+RDSSFSDNA++PGTLTR+L F
Sbjct: 781 IQND-TSEYAESYIPYAVVENSFLMGLNPLECFVHSVTSRDSSFSDNADLPGTLTRRLMF 840

Query: 891 LMRDIYTAYDGTVRNAYGNQLVQFSYDIDRPTSVSNELDSENNNRDHDIGGHPVGSLAAC 950
            MRD+Y AYDGTVRNAYGNQLVQFSY+I+  ++ S+ ++ +     +D+GG PVGS++AC
Sbjct: 841 FMRDLYIAYDGTVRNAYGNQLVQFSYNIEHTSTPSDGINED--TCAYDMGGQPVGSISAC 900

Query: 951 AMSEAAYSALDQPISLLEASPLLNLKRVL 954
           A+SEAAYSALDQPISLLE SPLLNLK  L
Sbjct: 901 AISEAAYSALDQPISLLEPSPLLNLKNHL 926

BLAST of ClCG00G000380 vs. TrEMBL
Match: A5BZZ3_VITVI (DNA-directed RNA polymerase subunit OS=Vitis vinifera GN=VITISV_011232 PE=3 SV=1)

HSP 1 Score: 125.6 bits (314), Expect = 4.7e-25
Identity = 56/85 (65.88%), Postives = 70/85 (82.35%), Query Frame = 1

Query: 947  LNLKRVLECGSKRNSTKQTFSLFLSEKLSKRSYGFEYGALGVKNHLERVMFKDIVSNVMI 1006
            L L+RVLECG ++++  +T SLFLS+KL KR +GFEYGAL VKNHLE+++F DIVS VMI
Sbjct: 1065 LILQRVLECGLRKSTADRTVSLFLSKKLEKRKHGFEYGALEVKNHLEKLLFSDIVSTVMI 1124

Query: 1007 IFSPQPSRKKHFSPWVCHFHVCKSL 1032
            +FSPQ   K HFSPWVCHFHVC+ +
Sbjct: 1125 VFSPQNGSKTHFSPWVCHFHVCEEI 1149


HSP 2 Score: 78.6 bits (192), Expect = 6.5e-11
Identity = 38/58 (65.52%), Postives = 46/58 (79.31%), Query Frame = 1

Query: 1030 SLESATLDIGKTIRLEHLLLVANSLSATGEFVGLNVKGLSHQREHALVKTPFMQACFS 1088
            SL+SA  DIGKT+  EHLLLVA+ LSATGEFVGLN KG++ Q+E   + +PFMQ CFS
Sbjct: 1321 SLKSAISDIGKTVLPEHLLLVASCLSATGEFVGLNAKGMARQKELTSISSPFMQGCFS 1378


HSP 3 Score: 1161.4 bits (3003), Expect = 0.0e+00
Identity = 593/1009 (58.77%), Postives = 741/1009 (73.44%), Query Frame = 1

Query: 59   LPIPSGLVTGINFSVSTQQDTENIAVMTVDASSEVSDPKLGLPNPSYQCTTCGASSLKSC 118
            L +PSG++TGI+  +ST  + E ++VM + A SEV++P+LGLPNP+ +C++CGA   K+C
Sbjct: 10   LEVPSGILTGISLGISTDTEKEKLSVMEIGAVSEVTNPRLGLPNPTNECSSCGAKDRKAC 69

Query: 119  EGHFGVIKFPYTIIHPYFLSEVAQVLNKVCPGCKSIRQELWGKVEDPTSEYHRPKGCRYC 178
            EGHFG IKFP+TI+HPYFLS++A++LN +CP CK+IR+E   +    +S   +P+ C+YC
Sbjct: 70   EGHFGFIKFPFTILHPYFLSDIAKLLNSICPKCKTIRKE--RQKGAGSSRKEQPRVCKYC 129

Query: 179  FGSLKDWYPPMRFKLSTTDMFKKSMIMVEVKENMSKKYQKRVAKGGLPSDYWNFIPKDEQ 238
              +   WYP MRFKLS+ D+  K+ I+VE+ E +SKK +K      LP DYW FIP D Q
Sbjct: 130  VRNPAQWYPRMRFKLSSKDLSGKTAIIVEIDEKLSKKNKK------LPDDYWGFIPFDAQ 189

Query: 239  QEESYCRPNRKILTHAQVHYLLKDIDPKFLKKFVPAIDSLFLNSFPVTPNSHRVTEMTHS 298
            QEE+  +PNRK+L+  QV YLLKD+DP   ++F+ + D+ FL  FPVTPN+HRVTE+ H+
Sbjct: 190  QEENSVKPNRKVLSCKQVSYLLKDVDPSIREEFILSKDAPFLKCFPVTPNNHRVTEVPHA 249

Query: 299  FSNGQRLIF----------------------LSPEKLQSKDLVYQQKKIKDTATSSYGL- 358
            FS+ ++L F                      L  + L+   L   +   KD+A       
Sbjct: 250  FSHEKKLFFDNWTRHLKKMVDYRGRDIELSHLVQDCLKISKLHLDKSSRKDSAEVRQKKN 309

Query: 359  ---------RWIKDVVLGKRSDHCFRMVVVGDPNIELSEIGIPCHVAERLQISEHLSSWN 418
                     RWIKDVVLGKR+D CFRMVV GDPNI+L EIGIPC VAERLQISE L+SWN
Sbjct: 310  IDISNSSGLRWIKDVVLGKRNDDCFRMVVTGDPNIKLKEIGIPCQVAERLQISERLNSWN 369

Query: 419  MKKLSTSCYLHLVEKGEIFVRREGRLVRVRNVLELNMGDTIYRPLADGDVVLVNRPPSIH 478
             ++LS      L+EKGE++V R+G LVR+R +  L +GD IYRPL DGD+VL+NRPPSIH
Sbjct: 370  WERLSVCISFRLLEKGELYVCRKGGLVRIRRIDALELGDIIYRPLTDGDIVLINRPPSIH 429

Query: 479  QHSLIALSVKLLPVSSVLSLNPLCCSPFRGDFDGDCLHGYVPQSLEARVEVRELVSLDRQ 538
             HSLIAL+VK+LP+SSV+++NPLCCSPF GDFDGDCLHGY+PQ++ ARVE+ ELV+LD+Q
Sbjct: 430  PHSLIALTVKVLPISSVVTINPLCCSPFHGDFDGDCLHGYIPQAIGARVELTELVALDKQ 489

Query: 539  LINGQSGRNLLSLSHDSLTAAHLIMEDGVSLNLFQMQQLQMLALHQLLPPAILKAPLLRN 598
            LIN QSGRNLLSL  DSLTAAHL+MEDGV L+  QMQQLQM   H+ L P I K  + ++
Sbjct: 490  LINQQSGRNLLSLGQDSLTAAHLLMEDGVLLSHLQMQQLQMFCPHRFLSPDIFK--ISKD 549

Query: 599  CAWTGKQLFSTLLPPDFDYSSPSHCVLIENGELISSEGSYWLRDSGRNLFQALIEHCEGK 658
              W+GKQLFS LLPPDF+Y+ PS  V I  G+LIS+EGS WLRD   NLFQ LI+    K
Sbjct: 550  SVWSGKQLFSMLLPPDFEYTFPSKDVYISGGKLISAEGSSWLRDYEGNLFQYLIKRYRDK 609

Query: 659  TLDYLHDAQGVLCEWLSMRGLSVSLSDLYLSVDSYSHKNMMDDIFCGLQEAEETCNLKQL 718
             LD+L+ AQ VLCEWLS+RG++VSLSDLYL+  S S K +MD+IF GL+EA++TCN +QL
Sbjct: 610  VLDFLYAAQEVLCEWLSVRGMTVSLSDLYLASHSCSRKILMDEIFYGLREAQDTCNFQQL 669

Query: 719  MVDAHKDILTEDDEDNQHVLSIAVDRLSYEKQKSAALNQASVDAFKKVFRDIQNLVYKYS 778
            MVD+H + L    +D++   S+  + LSYEKQ+SAAL+QASVDAFK VF DIQNL YKY 
Sbjct: 670  MVDSHMNFLM-SAKDSESTRSLQGEHLSYEKQRSAALSQASVDAFKHVFWDIQNLAYKYG 729

Query: 779  GKDNSLLTMFKAGSKGNLLKLVQHSMCLGLQHSLVTLSFSLPHKLTCSAWNSQKMPRYTQ 838
             KDNSLL MFKAGSKGNLLKLVQHS+CLGLQHSL  LSF  PH+L+C+AWN  +    T 
Sbjct: 730  SKDNSLLGMFKAGSKGNLLKLVQHSLCLGLQHSLAPLSFRFPHELSCAAWNRLRAGDNT- 789

Query: 839  KDGLPDRTSSFIPYAVVESSFLSGLNPFECFAHSVTNRDSSFSDNAEVPGTLTRKLTFLM 898
                 +   S+IP AVVE+SFL+GLNP ECF HSVT+RDSSFSDNA++PGTLTR+L F M
Sbjct: 790  -----ECAKSYIPSAVVENSFLTGLNPLECFIHSVTSRDSSFSDNADLPGTLTRRLMFFM 849

Query: 899  RDIYTAYDGTVRNAYGNQLVQFSYDIDRPTSVSNELDSENNNRDHDIGGHPVGSLAACAM 958
            RD+ TAYDGTVRNAYGNQ+VQFSY+I+  ++ + E           IG  PVGSL+ACA+
Sbjct: 850  RDLCTAYDGTVRNAYGNQIVQFSYNIEGTSTPTGE-----------IGDQPVGSLSACAI 909

Query: 959  SEAAYSALDQPISLLEASPLLNLKRVLECGSKRNSTKQTFSLFLSEKLSKRSYGFEYGAL 1018
            SEAAYSALDQPISLLE SPLLNLK VLECGSK+++  QT SLFLS KL KR +GFEYGAL
Sbjct: 910  SEAAYSALDQPISLLETSPLLNLKNVLECGSKKSNADQTMSLFLSNKLGKRRHGFEYGAL 969

Query: 1019 GVKNHLERVMFKDIVSNVMIIFSPQPSRKKHFSPWVCHFHVCKSLESAT 1036
             VKNHLE +M  DIVS  MII+SPQ    KHFSPW+CHFHV K +   T
Sbjct: 970  EVKNHLECLMLSDIVSTSMIIYSPQTGSMKHFSPWICHFHVRKEIMKRT 990

BLAST of ClCG00G000380 vs. TAIR10
Match: AT1G63020.1 (AT1G63020.1 nuclear RNA polymerase D1A)

HSP 1 Score: 1005.7 bits (2599), Expect = 2.6e-293
Identity = 534/1022 (52.25%), Postives = 696/1022 (68.10%), Query Frame = 1

Query: 51   MEDEQDGELPIPSGLVTGINFSVSTQQDTENIAVMTVDASSEVSDPKLGLPNPSYQCTTC 110
            MED+ + EL +P G +T I FS+S   D + ++V+ V+A ++V+D +LGLPNP   C TC
Sbjct: 1    MEDDCE-ELQVPVGTLTSIGFSISNNNDRDKMSVLEVEAPNQVTDSRLGLPNPDSVCRTC 60

Query: 111  GASSLKSCEGHFGVIKFPYTIIHPYFLSEVAQVLNKVCPGCKSIRQELWGKVEDPTSEYH 170
            G+   K CEGHFGVI F Y+II+PYFL EVA +LNK+CPGCK IR++ +   ED      
Sbjct: 61   GSKDRKVCEGHFGVINFAYSIINPYFLKEVAALLNKICPGCKYIRKKQFQITED------ 120

Query: 171  RPKGCRYCFGSLKDWYPPMRFKLSTTDMFKKSMIMVEVKENMSKKYQKRVAKGGLPSDYW 230
            +P+ CRYC  +L   YP M+F+++T ++F++S I+VEV E    K +KR     LP DYW
Sbjct: 121  QPERCRYC--TLNTGYPLMKFRVTTKEVFRRSGIVVEVNEESLMKLKKRGVL-TLPPDYW 180

Query: 231  NFIPKDEQQEESYCRPNRKILTHAQVHYLLKDIDPKFLKKFVPAIDSLFLNSFPVTPNSH 290
            +F+P+D   +ES  +P R+I+THAQV+ LL  ID + +KK +P  +SL L SFPVTPN +
Sbjct: 181  SFLPQDSNIDESCLKPTRRIITHAQVYALLLGIDQRLIKKDIPMFNSLGLTSFPVTPNGY 240

Query: 291  RVTEMTHSFSNGQRLIFLSPEKLQSK----------------DLVYQQKKIKDTATSSYG 350
            RVTE+ H F NG RLIF    ++  K                + +   +   +T +SS  
Sbjct: 241  RVTEIVHQF-NGARLIFDERTRIYKKLVGFEGNTLELSSRVMECMQYSRLFSETVSSS-- 300

Query: 351  LRWIKDVV--LGKRSD--------------------HCFRMVVVGDPNIELSEIGIPCHV 410
                KD      K+SD                    H FR VVVGDP+++L+EIGIP  +
Sbjct: 301  ----KDSANPYQKKSDTPKLCGLRFMKDVLLGKRSDHTFRTVVVGDPSLKLNEIGIPESI 360

Query: 411  AERLQISEHLSSWNMKKLSTSCYLHLVEKGEIFVRREGRLVRVRNVLELNMGDTIYRPLA 470
            A+RLQ+SEHL+  N ++L TS    L++  E+ VRR  RLV ++ V +L  GD I+R L 
Sbjct: 361  AKRLQVSEHLNQCNKERLVTSFVPTLLDNKEMHVRRGDRLVAIQ-VNDLQTGDKIFRSLM 420

Query: 471  DGDVVLVNRPPSIHQHSLIALSVKLLPVSSVLSLNPLCCSPFRGDFDGDCLHGYVPQSLE 530
            DGD VL+NRPPSIHQHSLIA++V++LP +SV+SLNP+CC PFRGDFDGDCLHGYVPQS++
Sbjct: 421  DGDTVLMNRPPSIHQHSLIAMTVRILPTTSVVSLNPICCLPFRGDFDGDCLHGYVPQSIQ 480

Query: 531  ARVEVRELVSLDRQLINGQSGRNLLSLSHDSLTAAHLI-MEDGVSLNLFQMQQLQMLALH 590
            A+VE+ ELV+LD+QLIN Q+GRNLLSL  DSLTAA+L+ +E    LN  QMQQLQM    
Sbjct: 481  AKVELDELVALDKQLINRQNGRNLLSLGQDSLTAAYLVNVEKNCYLNRAQMQQLQMYCPF 540

Query: 591  QLLPPAILKA-PLLRNCAWTGKQLFSTLLPPDFDYSSPSHCVLIENGELIS-SEGSYWLR 650
            QL PPAI+KA P      WTG QLF  L PP FDY+ P + V++ NGEL+S SEGS WLR
Sbjct: 541  QLPPPAIIKASPSSTEPQWTGMQLFGMLFPPGFDYTYPLNNVVVSNGELLSFSEGSAWLR 600

Query: 651  DSGRNLFQALIEHCEGKTLDYLHDAQGVLCEWLSMRGLSVSLSDLYLSVDSYSHKNMMDD 710
            D   N  + L++H +GK LD ++ AQ +L +WL MRGLSVSL+DLYLS D  S KN+ ++
Sbjct: 601  DGEGNFIERLLKHDKGKVLDIIYSAQEMLSQWLLMRGLSVSLADLYLSSDLQSRKNLTEE 660

Query: 711  IFCGLQEAEETCNLKQLMVDAHKDILTEDDEDNQHVLSIAVDRLSYEKQKSAALNQASVD 770
            I  GL+EAE+ CN +QLMV++ +D L  + ED +      + R  YE+QKSA L++ +V 
Sbjct: 661  ISYGLREAEQVCNKQQLMVESWRDFLAVNGEDKEEDSVSDLARFCYERQKSATLSELAVS 720

Query: 771  AFKKVFRDIQNLVYKYSGKDNSLLTMFKAGSKGNLLKLVQHSMCLGLQHSLVTLSFSLPH 830
            AFK  +RD+Q L Y+Y  + NS L M KAGSKGN+ KLVQHSMC+GLQ+S V+LSF  P 
Sbjct: 721  AFKDAYRDVQALAYRYGDQSNSFLIMSKAGSKGNIGKLVQHSMCIGLQNSAVSLSFGFPR 780

Query: 831  KLTCSAWNSQKMPRYTQKDGLPDRTSSFIPYAVVESSFLSGLNPFECFAHSVTNRDSSFS 890
            +LTC+AWN    P    K      T S++PY V+E+SFL+GLNP E F HSVT+RDSSFS
Sbjct: 781  ELTCAAWNDPNSPLRGAKGKDSTTTESYVPYGVIENSFLTGLNPLESFVHSVTSRDSSFS 840

Query: 891  DNAEVPGTLTRKLTFLMRDIYTAYDGTVRNAYGNQLVQFSYDIDRPTSVSNELDSENNNR 950
             NA++PGTL+R+L F MRDIY AYDGTVRN++GNQLVQF+Y+ D P              
Sbjct: 841  GNADLPGTLSRRLMFFMRDIYAAYDGTVRNSFGNQLVQFTYETDGPV------------- 900

Query: 951  DHDIGGHPVGSLAACAMSEAAYSALDQPISLLEASPLLNLKRVLECGSKRNSTKQTFSLF 1010
              DI G  +GSL+ACA+SEAAYSALDQPISLLE SPLLNLK VLECGSK+   +QT SL+
Sbjct: 901  -EDITGEALGSLSACALSEAAYSALDQPISLLETSPLLNLKNVLECGSKKGQREQTMSLY 960

Query: 1011 LSEKLSKRSYGFEYGALGVKNHLERVMFKDIVSNVMIIFSPQPSRKKHFSPWVCHFHVCK 1032
            LSE LSK+ +GFEYG+L +KNHLE++ F +IVS  MIIFSP  + K   SPWVCHFH+ +
Sbjct: 961  LSEYLSKKKHGFEYGSLEIKNHLEKLSFSEIVSTSMIIFSPSSNTKVPLSPWVCHFHISE 990


HSP 2 Score: 244.2 bits (622), Expect = 4.5e-64
Identity = 135/302 (44.70%), Postives = 181/302 (59.93%), Query Frame = 1

Query: 1030 SLESATLDIGKTIRLEHLLLVANSLSATGEFVGLNVKGLSHQREHALVKTPFMQACFSSP 1089
            +LESA  D GK I  EHLLLVA+SLS TGEFV LN KG S QR+      PF QACFSSP
Sbjct: 1166 NLESAVSDTGKEILREHLLLVADSLSVTGEFVALNAKGWSKQRQVESTPAPFTQACFSSP 1225

Query: 1090 GACFVKAAKAGIKDNLSGSLDALAWGRIPSLGTGGQFDILYSGRGHELNKPVDVYNLLGG 1149
              CF+KAAK G++D+L GS+DALAWG++P  GTG QF+I+ S + H    PVDVY+LL  
Sbjct: 1226 SQCFLKAAKEGVRDDLQGSIDALAWGKVPGFGTGDQFEIIISPKVHGFTTPVDVYDLLSS 1285

Query: 1150 QSICEKQNAKIESLDKNNISEKYSAQLVLINGGSTIKGLKKLD--SVSKSILREFLTLND 1209
                 + N+  +       S+K + Q   +   + +K +K LD   +  S+LR   T  +
Sbjct: 1286 TKTMRRTNSAPK-------SDKATVQPFGLLHSAFLKDIKVLDGKGIPMSLLRTIFTWKN 1345

Query: 1210 IQKLSFALRTILHKYSLNERLNEVDKSTLMMALYFHPHRDEKIGVGAQDIKLISVFVLNC 1269
            I+ LS +L+ ILH Y +NE LNE D+  + M L  HP+  EKIG G + I++        
Sbjct: 1346 IELLSQSLKRILHSYEINELLNERDEGLVKMVLQLHPNSVEKIGPGVKGIRVAK------ 1405

Query: 1270 GSNYVVDLDAVGNHSKYQNTRCFVLIRSDGTTEDFSYHKCVLGALEIIAPHRVKGYQSKW 1329
                          SK+ ++ CF ++R DGT EDFSYHKCVLGA +IIAP ++  Y+SK+
Sbjct: 1406 --------------SKHGDSCCFEVVRIDGTFEDFSYHKCVLGATKIIAPKKMNFYKSKY 1440

BLAST of ClCG00G000380 vs. TAIR10
Match: AT2G40030.1 (AT2G40030.1 nuclear RNA polymerase D1B)

HSP 1 Score: 239.2 bits (609), Expect = 1.5e-62
Identity = 217/733 (29.60%), Postives = 330/733 (45.02%), Query Frame = 1

Query: 317  DLVYQQKKIKDTATSSYGLRWIKDVVLGKRSDHCFRMVVVGDPNIELSEIGIPCHVAERL 376
            D+ Y   KI D+++S      ++ + + K S    R V+ GD    ++E+GIP  +A+R+
Sbjct: 290  DMRYGVSKISDSSSSKAWTEKMRTLFIRKGSGFSSRSVITGDAYRHVNEVGIPIEIAQRI 349

Query: 377  QISEHLSSWN----MKKLSTSCYLHLVEKGEIFVRREGRLVRVRNVLELNMGDTIYRPLA 436
               E +S  N     K +     L   +    +  R+G     +   EL  G  ++R + 
Sbjct: 350  TFEERVSVHNRGYLQKLVDDKLCLSYTQGSTTYSLRDGS----KGHTELKPGQVVHRRVM 409

Query: 437  DGDVVLVNRPPSIHQHSLIALSVKLLPVSSVLSLNPLCCSPFRGDFDGDCLHGYVPQSLE 496
            DGDVV +NRPP+ H+HSL AL V +   ++V  +NPL CSP   DFDGDC+H + PQSL 
Sbjct: 410  DGDVVFINRPPTTHKHSLQALRVYVHEDNTV-KINPLMCSPLSADFDGDCVHLFYPQSLS 469

Query: 497  ARVEVRELVSLDRQLINGQSGRNLLSLSHDSLTAAHLIMEDGVSLNLFQMQQLQMLALHQ 556
            A+ EV EL S+++QL++  +G+ +L +  DSL +  +++E  V L+    QQL M     
Sbjct: 470  AKAEVMELFSVEKQLLSSHTGQLILQMGSDSLLSLRVMLE-RVFLDKATAQQLAMYGSLS 529

Query: 557  LLPPAILKAPLLRNCAWTGKQLFSTLLPPDFDYSSPSHCVLIENGELISSEGSYWLRDSG 616
            L PPA+ K+      AWT  Q+     P     S      L++  +L+  +       S 
Sbjct: 530  LPPPALRKSS-KSGPAWTVFQILQLAFPERL--SCKGDRFLVDGSDLLKFDFGVDAMGSI 589

Query: 617  RN--LFQALIEHCEGKTLDYLHDAQGVLCEWLSMRGLSVSLSDLYLSVDSYSHKNMMDDI 676
             N  +    +E    +TL +    Q +L E L   G S+SL DL     S S  +M  D+
Sbjct: 590  INEIVTSIFLEKGPKETLGFFDSLQPLLMESLFAEGFSLSLEDL-----SMSRADM--DV 649

Query: 677  FCGLQEAEETCNLKQLMVDAHKDILTEDDEDNQHVLSIAVDRLSYEKQKSAALNQASVDA 736
               L   E +  + +L +    ++  E+               S  K K  A N      
Sbjct: 650  IHNLIIREISPMVSRLRLSYRDELQLEN---------------SIHKVKEVAAN------ 709

Query: 737  FKKVFRDIQNLVYKYSGKDNSLLTMFKAGSKGNLLKLVQHSMCLGLQHSLVTLSFSLPHK 796
            F      I+NL+     K NS +T           KLVQ +  LGLQ S     ++    
Sbjct: 710  FMLKSYSIRNLI---DIKSNSAIT-----------KLVQQTGFLGLQLSDKKKFYTKTLV 769

Query: 797  LTCSAWNSQKMPRYTQKDGLPDRTSSFIPYAVVESSFLSGLNPFECFAHSVTNRD--SSF 856
               + +  +K            R SS   + +V+  F  GL+P+E  AHS+  R+     
Sbjct: 770  EDMAIFCKRKY----------GRISSSGDFGIVKGCFFHGLDPYEEMAHSIAAREVIVRS 829

Query: 857  SDNAEVPGTLTRKLTFLMRDIYTAYDGTVRNAYGNQLVQFSYDIDRPTSVSNELDSENNN 916
            S     PGTL + L  ++RDI    DGTVRN   N ++QF Y +          DSE  +
Sbjct: 830  SRGLAEPGTLFKNLMAVLRDIVITNDGTVRNTCSNSVIQFKYGV----------DSERGH 889

Query: 917  RDHDIGGHPVGSLAACAMSEAAYSALDQPISLLEASPLLN-----LKRVLEC--GSKRNS 976
            +     G PVG LAA AMS  AY A      +L++SP  N     +K VL C    +  +
Sbjct: 890  QGLFEAGEPVGVLAATAMSNPAYKA------VLDSSPNSNSSWELMKEVLLCKVNFQNTT 945

Query: 977  TKQTFSLFLSEKLSKRSYGFEYGALGVKNHLERVMFKDIVSNVMIIFSPQPSRKKHFSPW 1032
              +   L+L+E    + +  E  A  V+N L +V  KD     ++ +  QP+  + F   
Sbjct: 950  NDRRVILYLNECHCGKRFCQENAACTVRNKLNKVSLKDTAVEFLVEYRKQPTISEIFGID 945


HSP 2 Score: 72.4 bits (176), Expect = 2.4e-12
Identity = 40/130 (30.77%), Postives = 69/130 (53.08%), Query Frame = 1

Query: 1031 LESATLDIGKTIRLEHLLLVANSLSATGEFVGLNVKGLSHQREHALVKTPFMQACFSSPG 1090
            L ++   + K +  EH++L+AN+++ +G  +G N  G         +K PF +A   +P 
Sbjct: 1132 LSASVRMVSKGVLKEHIILLANNMTCSGTMLGFNSGGYKALTRSLNIKAPFTEATLIAPR 1191

Query: 1091 ACFVKAAKAGIKDNLSGSLDALAWGRIPSLGTGGQFDILYSGR--GHELNKPVDVYNLLG 1150
             CF KAA+    D+LS  + + +WG+   +GTG QF++L++ +  G +  +  DVY+ L 
Sbjct: 1192 KCFEKAAEKCHTDSLSTVVGSCSWGKRVDVGTGSQFELLWNQKETGLDDKEETDVYSFL- 1251

Query: 1151 GQSICEKQNA 1159
             Q +    NA
Sbjct: 1252 -QMVISTTNA 1259


HSP 3 Score: 70.5 bits (171), Expect = 9.0e-12
Identity = 36/108 (33.33%), Postives = 62/108 (57.41%), Query Frame = 1

Query: 51  MEDEQDGELPIPSGLVTGINFSVSTQQDT--ENIAVMTVDASSEVSDPKLGLPNPSYQCT 110
           ME+E   E  I  G + GI F++++  +   ++I+   ++  S++++  LGLP    +C 
Sbjct: 1   MEEESTSE--ILDGEIVGITFALASHHEICIQSISESAINHPSQLTNAFLGLPLEFGKCE 60

Query: 111 TCGASSLKSCEGHFGVIKFPYTIIHPYFLSEVAQVLNKVCPGCKSIRQ 157
           +CGA+    CEGHFG I+ P  I HP  ++E+ Q+L+ +C  C  I++
Sbjct: 61  SCGATEPDKCEGHFGYIQLPVPIYHPAHVNELKQMLSLLCLKCLKIKK 106


HSP 4 Score: 54.3 bits (129), Expect = 6.7e-07
Identity = 39/142 (27.46%), Postives = 65/142 (45.77%), Query Frame = 1

Query: 1189 KKLDSVSKSILREFLTLNDIQKLSFALRTILHK--YSLNERLNEVDKS-TLMMALYFHPH 1248
            ++LDS +     E   L+D++ +   LR I+H   Y   + +++ DK+  L   L FHP 
Sbjct: 1727 QRLDSFTSE---EQELLSDVEPVMRTLRKIMHPSAYPDGDPISDDDKTFVLEKILNFHPQ 1786

Query: 1249 RDEKIGVGAQDIKLISVFVLNCGSNYVVDLDAVGNHSKYQNTRCFVLIRSDGTTEDFSYH 1308
            ++ K+G G                   VD   V  H+ + ++RCF ++ +DG  +DFSY 
Sbjct: 1787 KETKLGSG-------------------VDFITVDKHTIFSDSRCFFVVSTDGAKQDFSYR 1846

Query: 1309 KCVLGALEIIAPHRVKGYQSKW 1328
            K +   L    P R + +  K+
Sbjct: 1847 KSLNNYLMKKYPDRAEEFIDKY 1846

BLAST of ClCG00G000380 vs. TAIR10
Match: AT4G35800.1 (AT4G35800.1 RNA polymerase II large subunit)

HSP 1 Score: 176.8 bits (447), Expect = 8.9e-44
Identity = 163/607 (26.85%), Postives = 277/607 (45.63%), Query Frame = 1

Query: 338 IKDVVLGKRSDHCFRMVVVGDPNIELSEIGIPCHVAERLQISEHLSSWNMKKLSTSCYLH 397
           I+  ++GKR D   R V+  DP I + E+G+P  +A  L   E ++ +N+++L       
Sbjct: 347 IRGNLMGKRVDFSARTVITPDPTINIDELGVPWSIALNLTYPETVTPYNIERLK-----E 406

Query: 398 LVEKG----------EIFVRREGRLVRVRNVLE-----LNMGDTIYRPLADGDVVLVNRP 457
           LV+ G          +  +R +G+ + +R + +     L +G  + R L DGD VL NR 
Sbjct: 407 LVDYGPHPPPGKTGAKYIIRDDGQRLDLRYLKKSSDQHLELGYKVERHLQDGDFVLFNRQ 466

Query: 458 PSIHQHSLIALSVKLLPVSSVLSLNPLCCSPFRGDFDGDCLHGYVPQSLEARVEVRELVS 517
           PS+H+ S++   ++++P S+   LN    SP+  DFDGD ++ +VPQS E R EV EL+ 
Sbjct: 467 PSLHKMSIMGHRIRIMPYST-FRLNLSVTSPYNADFDGDEMNMHVPQSFETRAEVLELMM 526

Query: 518 LDRQLINGQSGRNLLSLSHDSLTAAHLI--MEDGVSLNLFQMQQLQMLALHQLLP-PAIL 577
           + + +++ Q+ R ++ +  D+L     I   +  +  ++F    +        +P PAIL
Sbjct: 527 VPKCIVSPQANRPVMGIVQDTLLGCRKITKRDTFIEKDVFMNTLMWWEDFDGKVPAPAIL 586

Query: 578 KAPLLRNCAWTGKQLFSTLLPPDFD--------------YSSPSHC-VLIENGELISSE- 637
           K   L    WTGKQ+F+ ++P   +              + +P    V IE GEL++   
Sbjct: 587 KPRPL----WTGKQVFNLIIPKQINLLRYSAWHADTETGFITPGDTQVRIERGELLAGTL 646

Query: 638 GSYWLRDSGRNLFQALIEHC-EGKTLDYLHDAQGVLCEWLSMRGLSVSLSDLYLSVDSYS 697
               L  S  +L   + E         +L   Q ++  WL   G ++ +       D+ +
Sbjct: 647 CKKTLGTSNGSLVHVIWEEVGPDAARKFLGHTQWLVNYWLLQNGFTIGIG------DTIA 706

Query: 698 HKNMMDDIFCGLQEAEETCNLKQLMVDAHKDILTEDDEDNQHVLSIAVDRLSYEKQKSAA 757
             + M+ I       E   N K     A KD++ +              R ++E + +  
Sbjct: 707 DSSTMEKI------NETISNAK----TAVKDLIRQFQGKELDPEPGRTMRDTFENRVNQV 766

Query: 758 LNQASVDAFKKVFRDIQNLVYKYSGKDNSLLTMFKAGSKGNLLKLVQHSMCLGLQHSLVT 817
           LN+A  DA     + +         + N+L  M  AGSKG+ + + Q + C+G Q     
Sbjct: 767 LNKARDDAGSSAQKSL--------AETNNLKAMVTAGSKGSFINISQMTACVGQQ----- 826

Query: 818 LSFSLPHKLTCSAWNSQKMPRYTQKDGLPDRTSSFIPYAVVESSFLSGLNPFECFAHSVT 877
              ++  K     ++ + +P +T+ D  P+          VE+S+L GL P E F H++ 
Sbjct: 827 ---NVEGKRIPFGFDGRTLPHFTKDDYGPESR------GFVENSYLRGLTPQEFFFHAMG 886

Query: 878 NRDSSFSDNAEV--PGTLTRKLTFLMRDIYTAYDGTVRNAYGNQLVQFSYDIDRPTSV-- 905
            R+       +    G + R+L   M DI   YDGTVRN+ G+ ++QF Y  D   +V  
Sbjct: 887 GREGLIDTAVKTSETGYIQRRLVKAMEDIMVKYDGTVRNSLGD-VIQFLYGEDGMDAVWI 904

BLAST of ClCG00G000380 vs. TAIR10
Match: AT5G60040.2 (AT5G60040.2 nuclear RNA polymerase C1)

HSP 1 Score: 114.0 bits (284), Expect = 7.1e-25
Identity = 88/253 (34.78%), Postives = 130/253 (51.38%), Query Frame = 1

Query: 344 GKRSDHCFRMVVVGDPNIELSEIGIPCHVAERLQISEHLSSWNMKKLSTSCYLHLVEK-- 403
           GKR +   R V+  DPN++++E+GIP  +A+ L   E +S  N++KL   C  +   K  
Sbjct: 369 GKRVEFTGRTVISPDPNLKITEVGIPILMAQILTFPECVSRHNIEKL-RQCVRNGPNKYP 428

Query: 404 GEIFVR----REGRLV---RVRNVLELNMGDTIYRPLADGDVVLVNRPPSIHQHSLIALS 463
           G   VR        LV   R R   EL +G  + R L +GDVVL NR PS+H+ S++   
Sbjct: 429 GARNVRYPDGSSRTLVGDYRKRIADELAIGCIVDRHLQEGDVVLFNRQPSLHRMSIMCHR 488

Query: 464 VKLLPVSSVLSLNPLCCSPFRGDFDGDCLHGYVPQSLEARVEVRELVSLDRQLINGQSGR 523
            +++P  + L  N   C+P+  DFDGD ++ +VPQ+ EAR E   L+ +   L   ++G 
Sbjct: 489 ARIMPWRT-LRFNESVCNPYNADFDGDEMNMHVPQTEEARTEAITLMGVQNNLCTPKNGE 548

Query: 524 NLLSLSHDSLTAAHLIME-----DGVSLNLFQMQQLQMLALHQLLPPAILKAPLLRNCAW 583
            L++ + D LT++ LI       D  + +L        +    L  P ILK   L    W
Sbjct: 549 ILVASTQDFLTSSFLITRKDTFYDRAAFSLICSYMGDGMDSIDLPTPTILKPIEL----W 608

BLAST of ClCG00G000380 vs. TAIR10
Match: AT3G57660.1 (AT3G57660.1 nuclear RNA polymerase A1)

HSP 1 Score: 100.1 bits (248), Expect = 1.1e-20
Identity = 65/230 (28.26%), Postives = 108/230 (46.96%), Query Frame = 1

Query: 342 VLGKRSDHCFRMVVVGDPNIELSEIGIPCHVAERLQISEHLSSWNMKKLSTSCY------ 401
           ++GKR +H  R V+  DP I +++IGIP   A +L   E ++ WN++KL  +        
Sbjct: 441 MMGKRVNHACRSVISPDPYIAVNDIGIPPCFALKLTYPERVTPWNVEKLREAIINGPDIH 500

Query: 402 ---LHLVEKGEI----------------FVRREGRLVRVRNVLELNM-GDTIYRPLADGD 461
               H  +K                    +   G    +    ++N  G T++R + DGD
Sbjct: 501 PGATHYSDKSSTMKLPSTEKARRAIARKLLSSRGATTELGKTCDINFEGKTVHRHMRDGD 560

Query: 462 VVLVNRPPSIHQHSLIALSVKLLPVSSVLSLNPLCCSPFRGDFDGDCLHGYVPQSLEARV 521
           +VLVNR P++H+ SL+A  V++L     L L+   CS +  DFDGD ++ + PQ   +R 
Sbjct: 561 IVLVNRQPTLHKPSLMAHKVRVLKGEKTLRLHYANCSTYNADFDGDEMNVHFPQDEISRA 620

Query: 522 EVRELVSLDRQLINGQSGRNLLSLSHDSLTAAHLIMEDGVSLNLFQMQQL 546
           E   +V+ + Q     +G  L +L  D + ++ L+ +    L+     QL
Sbjct: 621 EAYNIVNANNQYARPSNGEPLRALIQDHIVSSVLLTKRDTFLDKDHFNQL 670

BLAST of ClCG00G000380 vs. NCBI nr
Match: gi|778675679|ref|XP_011650451.1| (PREDICTED: DNA-directed RNA polymerase IV subunit 1 isoform X3 [Cucumis sativus])

HSP 1 Score: 1368.2 bits (3540), Expect = 0.0e+00
Identity = 679/724 (93.78%), Postives = 701/724 (96.82%), Query Frame = 1

Query: 308  LSPEKLQSKDLVYQQKKIKDTATSSYGLRWIKDVVLGKRSDHCFRMVVVGDPNIELSEIG 367
            LSPEKLQ+KDLVYQQKKIKDTATSS GLRWIKDVVLGKRSDHCFRMVVVGDPNIELSEIG
Sbjct: 184  LSPEKLQNKDLVYQQKKIKDTATSSSGLRWIKDVVLGKRSDHCFRMVVVGDPNIELSEIG 243

Query: 368  IPCHVAERLQISEHLSSWNMKKLSTSCYLHLVEKGEIFVRREGRLVRVRNVLELNMGDTI 427
            IPCHVAERLQISEHLSSWNMKKLSTSCYLHLVEKGEI+VRREGRLVRVRNVLELNMGDTI
Sbjct: 244  IPCHVAERLQISEHLSSWNMKKLSTSCYLHLVEKGEIYVRREGRLVRVRNVLELNMGDTI 303

Query: 428  YRPLADGDVVLVNRPPSIHQHSLIALSVKLLPVSSVLSLNPLCCSPFRGDFDGDCLHGYV 487
            YRPLADGD+VLVNRPPSIHQHSLIALSVKLLPVS+VLSLNPLCCSPFRGDFDGDCLHGYV
Sbjct: 304  YRPLADGDIVLVNRPPSIHQHSLIALSVKLLPVSAVLSLNPLCCSPFRGDFDGDCLHGYV 363

Query: 488  PQSLEARVEVRELVSLDRQLINGQSGRNLLSLSHDSLTAAHLIMEDGVSLNLFQMQQLQM 547
            PQSLEARVEVRELVSLD+QL NGQSGRNLLSLSHDSLTAAHLI+EDGVSLNLFQMQQLQM
Sbjct: 364  PQSLEARVEVRELVSLDKQLTNGQSGRNLLSLSHDSLTAAHLILEDGVSLNLFQMQQLQM 423

Query: 548  LALHQLLPPAILKAPLLRNCAWTGKQLFSTLLPPDFDYSSPSHCVLIENGELISSEGSYW 607
            L LHQLLPPAI+K+PLLRNCAWTGKQLFS LLPPDFDYSSPSH V IE GELISSEGSYW
Sbjct: 424  LTLHQLLPPAIVKSPLLRNCAWTGKQLFSILLPPDFDYSSPSHNVFIEKGELISSEGSYW 483

Query: 608  LRDSGRNLFQALIEHCEGKTLDYLHDAQGVLCEWLSMRGLSVSLSDLYLSVDSYSHKNMM 667
            LRDSGRNLFQALIEHCEGKTLDYL DAQGVLCEWLS RGLSVSLSDLYLSVDSYSH+NMM
Sbjct: 484  LRDSGRNLFQALIEHCEGKTLDYLRDAQGVLCEWLSTRGLSVSLSDLYLSVDSYSHENMM 543

Query: 668  DDIFCGLQEAEETCNLKQLMVDAHKDILTEDDEDNQHVLSIAVDRLSYEKQKSAALNQAS 727
            DDIFCGLQEAEETCNLKQLMVD+HK+IL  +DEDNQH+LSIAV+RL YEKQKSAALNQAS
Sbjct: 544  DDIFCGLQEAEETCNLKQLMVDSHKEILIGNDEDNQHLLSIAVERLIYEKQKSAALNQAS 603

Query: 728  VDAFKKVFRDIQNLVYKYSGKDNSLLTMFKAGSKGNLLKLVQHSMCLGLQHSLVTLSFSL 787
            VDAFKKVFRDIQNLVYKYSGKDNSLLTMFKAGSKGNL+KLVQHSMCLGLQHSLVTLSFSL
Sbjct: 604  VDAFKKVFRDIQNLVYKYSGKDNSLLTMFKAGSKGNLMKLVQHSMCLGLQHSLVTLSFSL 663

Query: 788  PHKLTCSAWNSQKMPRYTQKDGLPDRTSSFIPYAVVESSFLSGLNPFECFAHSVTNRDSS 847
            PHKL+C+AWNSQKMPRY QKDGLPDRT SFIPYAVVE+SFLSGLNPFECFAHSVTNRDSS
Sbjct: 664  PHKLSCAAWNSQKMPRYIQKDGLPDRTQSFIPYAVVENSFLSGLNPFECFAHSVTNRDSS 723

Query: 848  FSDNAEVPGTLTRKLTFLMRDIYTAYDGTVRNAYGNQLVQFSYDIDRPTSVSNELDSENN 907
            FSDNAEVPGTLTRKLTFLMRDIYTAYDGTVRNAYGNQLVQF YDIDRPTS   E +SENN
Sbjct: 724  FSDNAEVPGTLTRKLTFLMRDIYTAYDGTVRNAYGNQLVQFCYDIDRPTS---ESESENN 783

Query: 908  NRDHDIGGHPVGSLAACAMSEAAYSALDQPISLLEASPLLNLKRVLECGSKRNSTKQTFS 967
            NRD  IGGHPVGSLAACA+SEAAYSALDQPISLLEASPLLNLKRVLECGSKRNSTKQTFS
Sbjct: 784  NRDRGIGGHPVGSLAACAISEAAYSALDQPISLLEASPLLNLKRVLECGSKRNSTKQTFS 843

Query: 968  LFLSEKLSKRSYGFEYGALGVKNHLERVMFKDIVSNVMIIFSPQPSRKKHFSPWVCHFHV 1027
            LFLSEKLSKRSYGFEYGALGVKNHLERVMFKDIVS+VMIIFSP PSRKKHFSPWVCHFHV
Sbjct: 844  LFLSEKLSKRSYGFEYGALGVKNHLERVMFKDIVSSVMIIFSPLPSRKKHFSPWVCHFHV 903

Query: 1028 CKSL 1032
            CK +
Sbjct: 904  CKEI 904

BLAST of ClCG00G000380 vs. NCBI nr
Match: gi|778675679|ref|XP_011650451.1| (PREDICTED: DNA-directed RNA polymerase IV subunit 1 isoform X3 [Cucumis sativus])

HSP 1 Score: 528.9 bits (1361), Expect = 2.6e-146
Identity = 267/301 (88.70%), Postives = 277/301 (92.03%), Query Frame = 1

Query: 1030 SLESATLDIGKTIRLEHLLLVANSLSATGEFVGLNVKGLSHQREHALVKTPFMQACFSSP 1089
            SLESATLD+GKTIRLEHLLLV+NSLSATGEFVGLNVKGL+HQREHALVKTPFMQACFSSP
Sbjct: 1077 SLESATLDVGKTIRLEHLLLVSNSLSATGEFVGLNVKGLTHQREHALVKTPFMQACFSSP 1136

Query: 1090 GACFVKAAKAGIKDNLSGSLDALAWGRIPSLGTGGQFDILYSGRGHELNKPVDVYNLLGG 1149
            GAC +KAAKAGIKDNLSGSLDALAWGR+PSLGTGGQFDILYSG+GHELNKPVDVYNLLGG
Sbjct: 1137 GACMIKAAKAGIKDNLSGSLDALAWGRMPSLGTGGQFDILYSGKGHELNKPVDVYNLLGG 1196

Query: 1150 QSICEKQNAKIESLDKNNISEKYSAQLVLINGGSTIKGLKKLDSVSKSILREFLTLNDIQ 1209
            QS CEKQN KIESLDKN ISEKYSAQL+L NGGSTIKGLK+LDSVSKSILR+FLTLNDIQ
Sbjct: 1197 QSTCEKQNTKIESLDKNTISEKYSAQLMLKNGGSTIKGLKRLDSVSKSILRKFLTLNDIQ 1256

Query: 1210 KLSFALRTILHKYSLNERLNEVDKSTLMMALYFHPHRDEKIGVGAQDIKLISVFVLNCGS 1269
            KLSFALRTILHKYSLNERLNEVDKSTLMMALYFHPHRDEKIGVGAQDIK           
Sbjct: 1257 KLSFALRTILHKYSLNERLNEVDKSTLMMALYFHPHRDEKIGVGAQDIK----------- 1316

Query: 1270 NYVVDLDAVGNHSKYQNTRCFVLIRSDGTTEDFSYHKCVLGALEIIAPHRVKGYQSKWMQ 1329
                    VG+HSKYQNTRCFVLIRSDGTTEDFSYHKCVLGALEIIAPHRVKGYQSKWMQ
Sbjct: 1317 --------VGSHSKYQNTRCFVLIRSDGTTEDFSYHKCVLGALEIIAPHRVKGYQSKWMQ 1358

Query: 1330 E 1331
            E
Sbjct: 1377 E 1358


HSP 2 Score: 305.8 bits (782), Expect = 3.6e-79
Identity = 143/148 (96.62%), Postives = 147/148 (99.32%), Query Frame = 1

Query: 160 GKVEDPTSEYHRPKGCRYCFGSLKDWYPPMRFKLSTTDMFKKSMIMVEVKENMSKKYQKR 219
           G+VEDPTS+Y+RPKGCRYCFGSLKDWYPPMRFKLSTTDMFKKSMIMVEVKENMSKKYQKR
Sbjct: 4   GQVEDPTSDYNRPKGCRYCFGSLKDWYPPMRFKLSTTDMFKKSMIMVEVKENMSKKYQKR 63

Query: 220 VAKGGLPSDYWNFIPKDEQQEESYCRPNRKILTHAQVHYLLKDIDPKFLKKFVPAIDSLF 279
           VAKGGLPSDYW+FIPKDEQQEESYCRPNRKILTHAQVHYLLKDIDPKFLKKFVPAIDSLF
Sbjct: 64  VAKGGLPSDYWDFIPKDEQQEESYCRPNRKILTHAQVHYLLKDIDPKFLKKFVPAIDSLF 123

Query: 280 LNSFPVTPNSHRVTEMTHSFSNGQRLIF 308
           LNSFPVTPNSHRVTEM HSFSNGQRLIF
Sbjct: 124 LNSFPVTPNSHRVTEMAHSFSNGQRLIF 151


HSP 3 Score: 1368.2 bits (3540), Expect = 0.0e+00
Identity = 679/724 (93.78%), Postives = 701/724 (96.82%), Query Frame = 1

Query: 308  LSPEKLQSKDLVYQQKKIKDTATSSYGLRWIKDVVLGKRSDHCFRMVVVGDPNIELSEIG 367
            LSPEKLQ+KDLVYQQKKIKDTATSS GLRWIKDVVLGKRSDHCFRMVVVGDPNIELSEIG
Sbjct: 306  LSPEKLQNKDLVYQQKKIKDTATSSSGLRWIKDVVLGKRSDHCFRMVVVGDPNIELSEIG 365

Query: 368  IPCHVAERLQISEHLSSWNMKKLSTSCYLHLVEKGEIFVRREGRLVRVRNVLELNMGDTI 427
            IPCHVAERLQISEHLSSWNMKKLSTSCYLHLVEKGEI+VRREGRLVRVRNVLELNMGDTI
Sbjct: 366  IPCHVAERLQISEHLSSWNMKKLSTSCYLHLVEKGEIYVRREGRLVRVRNVLELNMGDTI 425

Query: 428  YRPLADGDVVLVNRPPSIHQHSLIALSVKLLPVSSVLSLNPLCCSPFRGDFDGDCLHGYV 487
            YRPLADGD+VLVNRPPSIHQHSLIALSVKLLPVS+VLSLNPLCCSPFRGDFDGDCLHGYV
Sbjct: 426  YRPLADGDIVLVNRPPSIHQHSLIALSVKLLPVSAVLSLNPLCCSPFRGDFDGDCLHGYV 485

Query: 488  PQSLEARVEVRELVSLDRQLINGQSGRNLLSLSHDSLTAAHLIMEDGVSLNLFQMQQLQM 547
            PQSLEARVEVRELVSLD+QL NGQSGRNLLSLSHDSLTAAHLI+EDGVSLNLFQMQQLQM
Sbjct: 486  PQSLEARVEVRELVSLDKQLTNGQSGRNLLSLSHDSLTAAHLILEDGVSLNLFQMQQLQM 545

Query: 548  LALHQLLPPAILKAPLLRNCAWTGKQLFSTLLPPDFDYSSPSHCVLIENGELISSEGSYW 607
            L LHQLLPPAI+K+PLLRNCAWTGKQLFS LLPPDFDYSSPSH V IE GELISSEGSYW
Sbjct: 546  LTLHQLLPPAIVKSPLLRNCAWTGKQLFSILLPPDFDYSSPSHNVFIEKGELISSEGSYW 605

Query: 608  LRDSGRNLFQALIEHCEGKTLDYLHDAQGVLCEWLSMRGLSVSLSDLYLSVDSYSHKNMM 667
            LRDSGRNLFQALIEHCEGKTLDYL DAQGVLCEWLS RGLSVSLSDLYLSVDSYSH+NMM
Sbjct: 606  LRDSGRNLFQALIEHCEGKTLDYLRDAQGVLCEWLSTRGLSVSLSDLYLSVDSYSHENMM 665

Query: 668  DDIFCGLQEAEETCNLKQLMVDAHKDILTEDDEDNQHVLSIAVDRLSYEKQKSAALNQAS 727
            DDIFCGLQEAEETCNLKQLMVD+HK+IL  +DEDNQH+LSIAV+RL YEKQKSAALNQAS
Sbjct: 666  DDIFCGLQEAEETCNLKQLMVDSHKEILIGNDEDNQHLLSIAVERLIYEKQKSAALNQAS 725

Query: 728  VDAFKKVFRDIQNLVYKYSGKDNSLLTMFKAGSKGNLLKLVQHSMCLGLQHSLVTLSFSL 787
            VDAFKKVFRDIQNLVYKYSGKDNSLLTMFKAGSKGNL+KLVQHSMCLGLQHSLVTLSFSL
Sbjct: 726  VDAFKKVFRDIQNLVYKYSGKDNSLLTMFKAGSKGNLMKLVQHSMCLGLQHSLVTLSFSL 785

Query: 788  PHKLTCSAWNSQKMPRYTQKDGLPDRTSSFIPYAVVESSFLSGLNPFECFAHSVTNRDSS 847
            PHKL+C+AWNSQKMPRY QKDGLPDRT SFIPYAVVE+SFLSGLNPFECFAHSVTNRDSS
Sbjct: 786  PHKLSCAAWNSQKMPRYIQKDGLPDRTQSFIPYAVVENSFLSGLNPFECFAHSVTNRDSS 845

Query: 848  FSDNAEVPGTLTRKLTFLMRDIYTAYDGTVRNAYGNQLVQFSYDIDRPTSVSNELDSENN 907
            FSDNAEVPGTLTRKLTFLMRDIYTAYDGTVRNAYGNQLVQF YDIDRPTS   E +SENN
Sbjct: 846  FSDNAEVPGTLTRKLTFLMRDIYTAYDGTVRNAYGNQLVQFCYDIDRPTS---ESESENN 905

Query: 908  NRDHDIGGHPVGSLAACAMSEAAYSALDQPISLLEASPLLNLKRVLECGSKRNSTKQTFS 967
            NRD  IGGHPVGSLAACA+SEAAYSALDQPISLLEASPLLNLKRVLECGSKRNSTKQTFS
Sbjct: 906  NRDRGIGGHPVGSLAACAISEAAYSALDQPISLLEASPLLNLKRVLECGSKRNSTKQTFS 965

Query: 968  LFLSEKLSKRSYGFEYGALGVKNHLERVMFKDIVSNVMIIFSPQPSRKKHFSPWVCHFHV 1027
            LFLSEKLSKRSYGFEYGALGVKNHLERVMFKDIVS+VMIIFSP PSRKKHFSPWVCHFHV
Sbjct: 966  LFLSEKLSKRSYGFEYGALGVKNHLERVMFKDIVSSVMIIFSPLPSRKKHFSPWVCHFHV 1025

Query: 1028 CKSL 1032
            CK +
Sbjct: 1026 CKEI 1026

BLAST of ClCG00G000380 vs. NCBI nr
Match: gi|778675668|ref|XP_011650447.1| (PREDICTED: DNA-directed RNA polymerase IV subunit 1 isoform X1 [Cucumis sativus])

HSP 1 Score: 528.9 bits (1361), Expect = 2.6e-146
Identity = 267/301 (88.70%), Postives = 277/301 (92.03%), Query Frame = 1

Query: 1030 SLESATLDIGKTIRLEHLLLVANSLSATGEFVGLNVKGLSHQREHALVKTPFMQACFSSP 1089
            SLESATLD+GKTIRLEHLLLV+NSLSATGEFVGLNVKGL+HQREHALVKTPFMQACFSSP
Sbjct: 1199 SLESATLDVGKTIRLEHLLLVSNSLSATGEFVGLNVKGLTHQREHALVKTPFMQACFSSP 1258

Query: 1090 GACFVKAAKAGIKDNLSGSLDALAWGRIPSLGTGGQFDILYSGRGHELNKPVDVYNLLGG 1149
            GAC +KAAKAGIKDNLSGSLDALAWGR+PSLGTGGQFDILYSG+GHELNKPVDVYNLLGG
Sbjct: 1259 GACMIKAAKAGIKDNLSGSLDALAWGRMPSLGTGGQFDILYSGKGHELNKPVDVYNLLGG 1318

Query: 1150 QSICEKQNAKIESLDKNNISEKYSAQLVLINGGSTIKGLKKLDSVSKSILREFLTLNDIQ 1209
            QS CEKQN KIESLDKN ISEKYSAQL+L NGGSTIKGLK+LDSVSKSILR+FLTLNDIQ
Sbjct: 1319 QSTCEKQNTKIESLDKNTISEKYSAQLMLKNGGSTIKGLKRLDSVSKSILRKFLTLNDIQ 1378

Query: 1210 KLSFALRTILHKYSLNERLNEVDKSTLMMALYFHPHRDEKIGVGAQDIKLISVFVLNCGS 1269
            KLSFALRTILHKYSLNERLNEVDKSTLMMALYFHPHRDEKIGVGAQDIK           
Sbjct: 1379 KLSFALRTILHKYSLNERLNEVDKSTLMMALYFHPHRDEKIGVGAQDIK----------- 1438

Query: 1270 NYVVDLDAVGNHSKYQNTRCFVLIRSDGTTEDFSYHKCVLGALEIIAPHRVKGYQSKWMQ 1329
                    VG+HSKYQNTRCFVLIRSDGTTEDFSYHKCVLGALEIIAPHRVKGYQSKWMQ
Sbjct: 1439 --------VGSHSKYQNTRCFVLIRSDGTTEDFSYHKCVLGALEIIAPHRVKGYQSKWMQ 1480

Query: 1330 E 1331
            E
Sbjct: 1499 E 1480


HSP 2 Score: 526.9 bits (1356), Expect = 9.9e-146
Identity = 249/261 (95.40%), Postives = 257/261 (98.47%), Query Frame = 1

Query: 47  VMIHMEDEQDGELPIPSGLVTGINFSVSTQQDTENIAVMTVDASSEVSDPKLGLPNPSYQ 106
           VMIHMEDEQDGELPIPSGL+TGINFSVS QQD ENIAV+TVDA++EVSDPKLGLPNPSYQ
Sbjct: 13  VMIHMEDEQDGELPIPSGLLTGINFSVSNQQDIENIAVITVDAANEVSDPKLGLPNPSYQ 72

Query: 107 CTTCGASSLKSCEGHFGVIKFPYTIIHPYFLSEVAQVLNKVCPGCKSIRQELWGKVEDPT 166
           CTTCGASSLK CEGHFGVIKFPYTIIHPYFLSEVAQVLNKVCPGCKS+RQELWGKVEDPT
Sbjct: 73  CTTCGASSLKFCEGHFGVIKFPYTIIHPYFLSEVAQVLNKVCPGCKSVRQELWGKVEDPT 132

Query: 167 SEYHRPKGCRYCFGSLKDWYPPMRFKLSTTDMFKKSMIMVEVKENMSKKYQKRVAKGGLP 226
           S+Y+RPKGCRYCFGSLKDWYPPMRFKLSTTDMFKKSMIMVEVKENMSKKYQKRVAKGGLP
Sbjct: 133 SDYNRPKGCRYCFGSLKDWYPPMRFKLSTTDMFKKSMIMVEVKENMSKKYQKRVAKGGLP 192

Query: 227 SDYWNFIPKDEQQEESYCRPNRKILTHAQVHYLLKDIDPKFLKKFVPAIDSLFLNSFPVT 286
           SDYW+FIPKDEQQEESYCRPNRKILTHAQVHYLLKDIDPKFLKKFVPAIDSLFLNSFPVT
Sbjct: 193 SDYWDFIPKDEQQEESYCRPNRKILTHAQVHYLLKDIDPKFLKKFVPAIDSLFLNSFPVT 252

Query: 287 PNSHRVTEMTHSFSNGQRLIF 308
           PNSHRVTEM HSFSNGQRLIF
Sbjct: 253 PNSHRVTEMAHSFSNGQRLIF 273


HSP 3 Score: 1368.2 bits (3540), Expect = 0.0e+00
Identity = 679/724 (93.78%), Postives = 701/724 (96.82%), Query Frame = 1

Query: 308  LSPEKLQSKDLVYQQKKIKDTATSSYGLRWIKDVVLGKRSDHCFRMVVVGDPNIELSEIG 367
            LSPEKLQ+KDLVYQQKKIKDTATSS GLRWIKDVVLGKRSDHCFRMVVVGDPNIELSEIG
Sbjct: 185  LSPEKLQNKDLVYQQKKIKDTATSSSGLRWIKDVVLGKRSDHCFRMVVVGDPNIELSEIG 244

Query: 368  IPCHVAERLQISEHLSSWNMKKLSTSCYLHLVEKGEIFVRREGRLVRVRNVLELNMGDTI 427
            IPCHVAERLQISEHLSSWNMKKLSTSCYLHLVEKGEI+VRREGRLVRVRNVLELNMGDTI
Sbjct: 245  IPCHVAERLQISEHLSSWNMKKLSTSCYLHLVEKGEIYVRREGRLVRVRNVLELNMGDTI 304

Query: 428  YRPLADGDVVLVNRPPSIHQHSLIALSVKLLPVSSVLSLNPLCCSPFRGDFDGDCLHGYV 487
            YRPLADGD+VLVNRPPSIHQHSLIALSVKLLPVS+VLSLNPLCCSPFRGDFDGDCLHGYV
Sbjct: 305  YRPLADGDIVLVNRPPSIHQHSLIALSVKLLPVSAVLSLNPLCCSPFRGDFDGDCLHGYV 364

Query: 488  PQSLEARVEVRELVSLDRQLINGQSGRNLLSLSHDSLTAAHLIMEDGVSLNLFQMQQLQM 547
            PQSLEARVEVRELVSLD+QL NGQSGRNLLSLSHDSLTAAHLI+EDGVSLNLFQMQQLQM
Sbjct: 365  PQSLEARVEVRELVSLDKQLTNGQSGRNLLSLSHDSLTAAHLILEDGVSLNLFQMQQLQM 424

Query: 548  LALHQLLPPAILKAPLLRNCAWTGKQLFSTLLPPDFDYSSPSHCVLIENGELISSEGSYW 607
            L LHQLLPPAI+K+PLLRNCAWTGKQLFS LLPPDFDYSSPSH V IE GELISSEGSYW
Sbjct: 425  LTLHQLLPPAIVKSPLLRNCAWTGKQLFSILLPPDFDYSSPSHNVFIEKGELISSEGSYW 484

Query: 608  LRDSGRNLFQALIEHCEGKTLDYLHDAQGVLCEWLSMRGLSVSLSDLYLSVDSYSHKNMM 667
            LRDSGRNLFQALIEHCEGKTLDYL DAQGVLCEWLS RGLSVSLSDLYLSVDSYSH+NMM
Sbjct: 485  LRDSGRNLFQALIEHCEGKTLDYLRDAQGVLCEWLSTRGLSVSLSDLYLSVDSYSHENMM 544

Query: 668  DDIFCGLQEAEETCNLKQLMVDAHKDILTEDDEDNQHVLSIAVDRLSYEKQKSAALNQAS 727
            DDIFCGLQEAEETCNLKQLMVD+HK+IL  +DEDNQH+LSIAV+RL YEKQKSAALNQAS
Sbjct: 545  DDIFCGLQEAEETCNLKQLMVDSHKEILIGNDEDNQHLLSIAVERLIYEKQKSAALNQAS 604

Query: 728  VDAFKKVFRDIQNLVYKYSGKDNSLLTMFKAGSKGNLLKLVQHSMCLGLQHSLVTLSFSL 787
            VDAFKKVFRDIQNLVYKYSGKDNSLLTMFKAGSKGNL+KLVQHSMCLGLQHSLVTLSFSL
Sbjct: 605  VDAFKKVFRDIQNLVYKYSGKDNSLLTMFKAGSKGNLMKLVQHSMCLGLQHSLVTLSFSL 664

Query: 788  PHKLTCSAWNSQKMPRYTQKDGLPDRTSSFIPYAVVESSFLSGLNPFECFAHSVTNRDSS 847
            PHKL+C+AWNSQKMPRY QKDGLPDRT SFIPYAVVE+SFLSGLNPFECFAHSVTNRDSS
Sbjct: 665  PHKLSCAAWNSQKMPRYIQKDGLPDRTQSFIPYAVVENSFLSGLNPFECFAHSVTNRDSS 724

Query: 848  FSDNAEVPGTLTRKLTFLMRDIYTAYDGTVRNAYGNQLVQFSYDIDRPTSVSNELDSENN 907
            FSDNAEVPGTLTRKLTFLMRDIYTAYDGTVRNAYGNQLVQF YDIDRPTS   E +SENN
Sbjct: 725  FSDNAEVPGTLTRKLTFLMRDIYTAYDGTVRNAYGNQLVQFCYDIDRPTS---ESESENN 784

Query: 908  NRDHDIGGHPVGSLAACAMSEAAYSALDQPISLLEASPLLNLKRVLECGSKRNSTKQTFS 967
            NRD  IGGHPVGSLAACA+SEAAYSALDQPISLLEASPLLNLKRVLECGSKRNSTKQTFS
Sbjct: 785  NRDRGIGGHPVGSLAACAISEAAYSALDQPISLLEASPLLNLKRVLECGSKRNSTKQTFS 844

Query: 968  LFLSEKLSKRSYGFEYGALGVKNHLERVMFKDIVSNVMIIFSPQPSRKKHFSPWVCHFHV 1027
            LFLSEKLSKRSYGFEYGALGVKNHLERVMFKDIVS+VMIIFSP PSRKKHFSPWVCHFHV
Sbjct: 845  LFLSEKLSKRSYGFEYGALGVKNHLERVMFKDIVSSVMIIFSPLPSRKKHFSPWVCHFHV 904

Query: 1028 CKSL 1032
            CK +
Sbjct: 905  CKEI 905

BLAST of ClCG00G000380 vs. NCBI nr
Match: gi|778675676|ref|XP_011650450.1| (PREDICTED: DNA-directed RNA polymerase IV subunit 1 isoform X2 [Cucumis sativus])

HSP 1 Score: 528.9 bits (1361), Expect = 2.6e-146
Identity = 267/301 (88.70%), Postives = 277/301 (92.03%), Query Frame = 1

Query: 1030 SLESATLDIGKTIRLEHLLLVANSLSATGEFVGLNVKGLSHQREHALVKTPFMQACFSSP 1089
            SLESATLD+GKTIRLEHLLLV+NSLSATGEFVGLNVKGL+HQREHALVKTPFMQACFSSP
Sbjct: 1078 SLESATLDVGKTIRLEHLLLVSNSLSATGEFVGLNVKGLTHQREHALVKTPFMQACFSSP 1137

Query: 1090 GACFVKAAKAGIKDNLSGSLDALAWGRIPSLGTGGQFDILYSGRGHELNKPVDVYNLLGG 1149
            GAC +KAAKAGIKDNLSGSLDALAWGR+PSLGTGGQFDILYSG+GHELNKPVDVYNLLGG
Sbjct: 1138 GACMIKAAKAGIKDNLSGSLDALAWGRMPSLGTGGQFDILYSGKGHELNKPVDVYNLLGG 1197

Query: 1150 QSICEKQNAKIESLDKNNISEKYSAQLVLINGGSTIKGLKKLDSVSKSILREFLTLNDIQ 1209
            QS CEKQN KIESLDKN ISEKYSAQL+L NGGSTIKGLK+LDSVSKSILR+FLTLNDIQ
Sbjct: 1198 QSTCEKQNTKIESLDKNTISEKYSAQLMLKNGGSTIKGLKRLDSVSKSILRKFLTLNDIQ 1257

Query: 1210 KLSFALRTILHKYSLNERLNEVDKSTLMMALYFHPHRDEKIGVGAQDIKLISVFVLNCGS 1269
            KLSFALRTILHKYSLNERLNEVDKSTLMMALYFHPHRDEKIGVGAQDIK           
Sbjct: 1258 KLSFALRTILHKYSLNERLNEVDKSTLMMALYFHPHRDEKIGVGAQDIK----------- 1317

Query: 1270 NYVVDLDAVGNHSKYQNTRCFVLIRSDGTTEDFSYHKCVLGALEIIAPHRVKGYQSKWMQ 1329
                    VG+HSKYQNTRCFVLIRSDGTTEDFSYHKCVLGALEIIAPHRVKGYQSKWMQ
Sbjct: 1318 --------VGSHSKYQNTRCFVLIRSDGTTEDFSYHKCVLGALEIIAPHRVKGYQSKWMQ 1359

Query: 1330 E 1331
            E
Sbjct: 1378 E 1359


HSP 2 Score: 303.5 bits (776), Expect = 1.8e-78
Identity = 142/147 (96.60%), Postives = 146/147 (99.32%), Query Frame = 1

Query: 161 KVEDPTSEYHRPKGCRYCFGSLKDWYPPMRFKLSTTDMFKKSMIMVEVKENMSKKYQKRV 220
           +VEDPTS+Y+RPKGCRYCFGSLKDWYPPMRFKLSTTDMFKKSMIMVEVKENMSKKYQKRV
Sbjct: 6   QVEDPTSDYNRPKGCRYCFGSLKDWYPPMRFKLSTTDMFKKSMIMVEVKENMSKKYQKRV 65

Query: 221 AKGGLPSDYWNFIPKDEQQEESYCRPNRKILTHAQVHYLLKDIDPKFLKKFVPAIDSLFL 280
           AKGGLPSDYW+FIPKDEQQEESYCRPNRKILTHAQVHYLLKDIDPKFLKKFVPAIDSLFL
Sbjct: 66  AKGGLPSDYWDFIPKDEQQEESYCRPNRKILTHAQVHYLLKDIDPKFLKKFVPAIDSLFL 125

Query: 281 NSFPVTPNSHRVTEMTHSFSNGQRLIF 308
           NSFPVTPNSHRVTEM HSFSNGQRLIF
Sbjct: 126 NSFPVTPNSHRVTEMAHSFSNGQRLIF 152


HSP 3 Score: 1353.2 bits (3501), Expect = 0.0e+00
Identity = 672/709 (94.78%), Postives = 691/709 (97.46%), Query Frame = 1

Query: 323  KKIKDTATSSYGLRWIKDVVLGKRSDHCFRMVVVGDPNIELSEIGIPCHVAERLQISEHL 382
            KKIKDTATSS GLRWIKDVVLGKRSDHCFRMVVVGDPNIELSEIGIPCHVAERLQISEHL
Sbjct: 325  KKIKDTATSSSGLRWIKDVVLGKRSDHCFRMVVVGDPNIELSEIGIPCHVAERLQISEHL 384

Query: 383  SSWNMKKLSTSCYLHLVEKGEIFVRREGRLVRVRNVLELNMGDTIYRPLADGDVVLVNRP 442
            SSWNMKKLSTSCYLHLVEKGEI+VRREGRLVRVRNVLELNMGDTIYRPLADGDVVLVNRP
Sbjct: 385  SSWNMKKLSTSCYLHLVEKGEIYVRREGRLVRVRNVLELNMGDTIYRPLADGDVVLVNRP 444

Query: 443  PSIHQHSLIALSVKLLPVSSVLSLNPLCCSPFRGDFDGDCLHGYVPQSLEARVEVRELVS 502
            PSIHQHSLIALSVKLLPVS+VLSLNPLCCSPFRGDFDGDCLHGYVPQSLEARVEVRELVS
Sbjct: 445  PSIHQHSLIALSVKLLPVSAVLSLNPLCCSPFRGDFDGDCLHGYVPQSLEARVEVRELVS 504

Query: 503  LDRQLINGQSGRNLLSLSHDSLTAAHLIMEDGVSLNLFQMQQLQMLALHQLLPPAILKAP 562
            LDRQLINGQSGRNLLSLSHDSLTAAHLI+EDGVSLNLFQMQQLQML LHQLLPPAI+K+P
Sbjct: 505  LDRQLINGQSGRNLLSLSHDSLTAAHLILEDGVSLNLFQMQQLQMLTLHQLLPPAIVKSP 564

Query: 563  LLRNCAWTGKQLFSTLLPPDFDYSSPSHCVLIENGELISSEGSYWLRDSGRNLFQALIEH 622
            LLRNCAWTGKQLFS LLPPDF+YSSPSH V IE GELISSEGSYWLRDSGRNLFQALIEH
Sbjct: 565  LLRNCAWTGKQLFSILLPPDFEYSSPSHNVFIEKGELISSEGSYWLRDSGRNLFQALIEH 624

Query: 623  CEGKTLDYLHDAQGVLCEWLSMRGLSVSLSDLYLSVDSYSHKNMMDDIFCGLQEAEETCN 682
            CEGKTLDYL DAQGVLCEWLSMRGLSVSLSDLYLSVDSYSHKNMMDDIFCGLQEAEETCN
Sbjct: 625  CEGKTLDYLRDAQGVLCEWLSMRGLSVSLSDLYLSVDSYSHKNMMDDIFCGLQEAEETCN 684

Query: 683  LKQLMVDAHKDILTEDDEDNQHVLSIAVDRLSYEKQKSAALNQASVDAFKKVFRDIQNLV 742
            LKQLMVD+HK+ILT +DEDNQH+LSIAV+ L YEKQKSAALNQASVDAFKKVFRDIQNLV
Sbjct: 685  LKQLMVDSHKEILTGNDEDNQHLLSIAVEHLIYEKQKSAALNQASVDAFKKVFRDIQNLV 744

Query: 743  YKYSGKDNSLLTMFKAGSKGNLLKLVQHSMCLGLQHSLVTLSFSLPHKLTCSAWNSQKMP 802
            +KYSGKDNSLLTMFKAGSKGNL+KLVQHSMCLGLQHSLVTLSFSLPHKL+CSAWNSQKMP
Sbjct: 745  HKYSGKDNSLLTMFKAGSKGNLMKLVQHSMCLGLQHSLVTLSFSLPHKLSCSAWNSQKMP 804

Query: 803  RYTQKDGLPDRTSSFIPYAVVESSFLSGLNPFECFAHSVTNRDSSFSDNAEVPGTLTRKL 862
            RY Q+DGLPDRT SFIPYAVVE+SFLSGLNPFECFAHSVTNRDSSFSDNAEVPGTLTRKL
Sbjct: 805  RYIQEDGLPDRTQSFIPYAVVENSFLSGLNPFECFAHSVTNRDSSFSDNAEVPGTLTRKL 864

Query: 863  TFLMRDIYTAYDGTVRNAYGNQLVQFSYDIDRPTSVSNELDSENNNRDHDIGGHPVGSLA 922
            TFLMRDIYTAYDGTVRNAYGNQLVQF YDIDRPTSVS+E DSE NNRD DIGGHPVGSLA
Sbjct: 865  TFLMRDIYTAYDGTVRNAYGNQLVQFCYDIDRPTSVSSESDSE-NNRDRDIGGHPVGSLA 924

Query: 923  ACAMSEAAYSALDQPISLLEASPLLNLKRVLECGSKRNSTKQTFSLFLSEKLSKRSYGFE 982
            ACA SEAAYSALDQPISLLEASPLLNLKRVLECGSKRNSTKQTFSLFLSEKLSKRSYGFE
Sbjct: 925  ACAFSEAAYSALDQPISLLEASPLLNLKRVLECGSKRNSTKQTFSLFLSEKLSKRSYGFE 984

Query: 983  YGALGVKNHLERVMFKDIVSNVMIIFSPQPSRKKHFSPWVCHFHVCKSL 1032
            YGALGVKNHLERVMFKDIVS+VMIIFSPQPSRKKHFSPWVCHFHVCK +
Sbjct: 985  YGALGVKNHLERVMFKDIVSSVMIIFSPQPSRKKHFSPWVCHFHVCKDI 1032

BLAST of ClCG00G000380 vs. NCBI nr
Match: gi|659096195|ref|XP_008448971.1| (PREDICTED: LOW QUALITY PROTEIN: DNA-directed RNA polymerase IV subunit 1 [Cucumis melo])

HSP 1 Score: 532.3 bits (1370), Expect = 2.4e-147
Identity = 270/301 (89.70%), Postives = 278/301 (92.36%), Query Frame = 1

Query: 1030 SLESATLDIGKTIRLEHLLLVANSLSATGEFVGLNVKGLSHQREHALVKTPFMQACFSSP 1089
            SLE ATLDIGKTIRLEHLLLVANSLSATGEFVGLNVKGL+HQREHALVKTPFMQACFSSP
Sbjct: 1205 SLECATLDIGKTIRLEHLLLVANSLSATGEFVGLNVKGLTHQREHALVKTPFMQACFSSP 1264

Query: 1090 GACFVKAAKAGIKDNLSGSLDALAWGRIPSLGTGGQFDILYSGRGHELNKPVDVYNLLGG 1149
            GAC +KAAKAGIKDNLSGSLDALAWGR+PSLGTGGQFDILYSG+GHELNKPVDVYNLLGG
Sbjct: 1265 GACLIKAAKAGIKDNLSGSLDALAWGRMPSLGTGGQFDILYSGKGHELNKPVDVYNLLGG 1324

Query: 1150 QSICEKQNAKIESLDKNNISEKYSAQLVLINGGSTIKGLKKLDSVSKSILREFLTLNDIQ 1209
            QS CEKQNAKIES+DKNNISEKYSAQLVL NGGSTIKGLK+LDSVSKSILR+FLTLNDIQ
Sbjct: 1325 QSTCEKQNAKIESVDKNNISEKYSAQLVLKNGGSTIKGLKRLDSVSKSILRKFLTLNDIQ 1384

Query: 1210 KLSFALRTILHKYSLNERLNEVDKSTLMMALYFHPHRDEKIGVGAQDIKLISVFVLNCGS 1269
            KLSFALRTILHKYSLNERLNEVDKSTLMMALYFHPHRDEKIGVGAQDIK           
Sbjct: 1385 KLSFALRTILHKYSLNERLNEVDKSTLMMALYFHPHRDEKIGVGAQDIK----------- 1444

Query: 1270 NYVVDLDAVGNHSKYQNTRCFVLIRSDGTTEDFSYHKCVLGALEIIAPHRVKGYQSKWMQ 1329
                    VG+HSKYQNTRCFVLIRSDGTTEDFSYHKCVLGALEIIAPHRVKGYQSKWMQ
Sbjct: 1445 --------VGSHSKYQNTRCFVLIRSDGTTEDFSYHKCVLGALEIIAPHRVKGYQSKWMQ 1486

Query: 1330 E 1331
            E
Sbjct: 1505 E 1486


HSP 2 Score: 526.2 bits (1354), Expect = 1.7e-145
Identity = 250/261 (95.79%), Postives = 256/261 (98.08%), Query Frame = 1

Query: 47  VMIHMEDEQDGELPIPSGLVTGINFSVSTQQDTENIAVMTVDASSEVSDPKLGLPNPSYQ 106
           VMIHMEDEQDGELPIPSG +TGINFSVS QQD ENIAV+TVDA+SEVSDPKLGLPNPSYQ
Sbjct: 13  VMIHMEDEQDGELPIPSGRLTGINFSVSNQQDIENIAVITVDAASEVSDPKLGLPNPSYQ 72

Query: 107 CTTCGASSLKSCEGHFGVIKFPYTIIHPYFLSEVAQVLNKVCPGCKSIRQELWGKVEDPT 166
           CTTCGASSLK CEGHFGVIKFPYTIIHPYFLSEVAQVLNKVCPGCKSIRQELWGKVEDPT
Sbjct: 73  CTTCGASSLKFCEGHFGVIKFPYTIIHPYFLSEVAQVLNKVCPGCKSIRQELWGKVEDPT 132

Query: 167 SEYHRPKGCRYCFGSLKDWYPPMRFKLSTTDMFKKSMIMVEVKENMSKKYQKRVAKGGLP 226
           S+Y+RPKGCRYCFGSLKDWYPPMRFKLSTTDMFKKSMIMVEVKENMSKKYQKRVAKGGLP
Sbjct: 133 SDYNRPKGCRYCFGSLKDWYPPMRFKLSTTDMFKKSMIMVEVKENMSKKYQKRVAKGGLP 192

Query: 227 SDYWNFIPKDEQQEESYCRPNRKILTHAQVHYLLKDIDPKFLKKFVPAIDSLFLNSFPVT 286
           SDYW+FIPKDEQQEESYCRPNRKILTHAQVHYLLKDIDPKFLKKFVPAIDSLFLNSFPVT
Sbjct: 193 SDYWDFIPKDEQQEESYCRPNRKILTHAQVHYLLKDIDPKFLKKFVPAIDSLFLNSFPVT 252

Query: 287 PNSHRVTEMTHSFSNGQRLIF 308
           PNSHRVTEM HSFSNGQRLIF
Sbjct: 253 PNSHRVTEMAHSFSNGQRLIF 273


HSP 3 Score: 1287.7 bits (3331), Expect = 0.0e+00
Identity = 629/1004 (62.65%), Postives = 785/1004 (78.19%), Query Frame = 1

Query: 51   MEDEQDGELPIPSGLVTGINFSVSTQQDTENIAVMTVDASSEVSDPKLGLPNPSYQCTTC 110
            M+++   E  +PSGL+ GI F VST++D E I+VM +DA +E++DPKLG+PNPS QC+TC
Sbjct: 1    MDNDFLEEQQVPSGLLIGIKFDVSTEEDMEKISVMKIDAVNEITDPKLGVPNPSCQCSTC 60

Query: 111  GASSLKSCEGHFGVIKFPYTIIHPYFLSEVAQVLNKVCPGCKSIRQELWGKVEDPTSEYH 170
            GA   K CEGHFGVIKFP+TI+HPYFL+EV Q+LNK+CPGCKS RQ  W K  D  S   
Sbjct: 61   GAKDTKKCEGHFGVIKFPFTILHPYFLTEVVQILNKICPGCKSTRQGQWVKGADSGSRRL 120

Query: 171  RPKGCRYCFGSLKDWYPPMRFKLSTTDMFKKSMIMVEVKENMSKKYQKRVAKGGLPSDYW 230
            R KGC+YC  +  DWYP M+FK+S+ D+F+K+ I+VE+ E + KK QK+  +  LP DYW
Sbjct: 121  RSKGCKYCAANSNDWYPTMKFKVSSKDLFRKTAIIVEMNEKLPKKLQKKSFRPVLPLDYW 180

Query: 231  NFIPKDEQQEESYCRPNRKILTHAQVHYLLKDIDPKFLKKFVPAIDSLFLNSFPVTPNSH 290
            +FIPKD QQEE+   PNR++L+HAQVHYLLKDIDP F+K+FV  +DS FLN  PVTPN+H
Sbjct: 181  DFIPKDPQQEENCLNPNRRVLSHAQVHYLLKDIDPGFIKEFVSRMDSFFLNCLPVTPNNH 240

Query: 291  RVTEMTHSFSNGQRLIFLSPEK----------------------LQSKDLVYQQKKIKDT 350
            RVTE+TH+ SNGQ LIF    +                      L++  L  ++   KD+
Sbjct: 241  RVTEITHALSNGQTLIFDQHSRAYKKLVDFRGTANELSCRVLDCLKTSKLRSEKSTSKDS 300

Query: 351  ATSSYGLRWIKDVVLGKRSDHCFRMVVVGDPNIELSEIGIPCHVAERLQISEHLSSWNMK 410
            A+   GL+WIK+V+LGKR++H FRM+VVGDP + LSEIGIPCH+AE L ISEHL+SWN +
Sbjct: 301  ASKMSGLKWIKEVLLGKRTNHSFRMIVVGDPKLRLSEIGIPCHIAEELLISEHLNSWNWE 360

Query: 411  KLSTSCYLHLVEKGEIFVRREGRLVRVRNVLELNMGDTIYRPLADGDVVLVNRPPSIHQH 470
            K++  C L L+EKG+ +VRR+G L  VR + +   GD IYRPL DGD+VL+NRPPSIHQH
Sbjct: 361  KVTNGCNLRLLEKGQTYVRRKGTLAPVRRMNDFQAGDIIYRPLTDGDIVLINRPPSIHQH 420

Query: 471  SLIALSVKLLPVSSVLSLNPLCCSPFRGDFDGDCLHGYVPQSLEARVEVRELVSLDRQLI 530
            S+IALSVK+LP++SV+S+NPLCCSPFRGDFDGDCLHGY+PQS+++RVE+ ELV+L+RQLI
Sbjct: 421  SVIALSVKVLPLNSVVSINPLCCSPFRGDFDGDCLHGYIPQSVDSRVELSELVALNRQLI 480

Query: 531  NGQSGRNLLSLSHDSLTAAHLIMEDGVSLNLFQMQQLQMLALHQLLPPAILKAPLLRNCA 590
            N QSGRNLLSLS DSL+AAHL+MEDGV LNLFQMQQL+M   +QL  PAI+KAPLL    
Sbjct: 481  NRQSGRNLLSLSQDSLSAAHLVMEDGVLLNLFQMQQLEMFCPYQLQSPAIIKAPLLDTQV 540

Query: 591  WTGKQLFSTLLPPDFDYSSPSHCVLIENGELI-SSEGSYWLRDSGRNLFQALIEHCEGKT 650
            WTGKQLFS LLPP F+Y  P + V I +GELI SS+GS WLRD   NLF +L++ C+GK 
Sbjct: 541  WTGKQLFSMLLPPGFNYVFPLNGVRISDGELISSSDGSAWLRDIDGNLFSSLVKDCQGKA 600

Query: 651  LDYLHDAQGVLCEWLSMRGLSVSLSDLYLSVDSYSHKNMMDDIFCGLQEAEETCNLKQLM 710
            LD+L+ AQ VLCEWLSMRGLSVSLSD+YLS DS S KNM+D++FCGL  AE+TC+ KQL+
Sbjct: 601  LDFLYAAQEVLCEWLSMRGLSVSLSDIYLSSDSISRKNMIDEVFCGLLVAEQTCHFKQLL 660

Query: 711  VDAHKDILTEDDEDNQHVLSIAVDRLSYEKQKSAALNQASVDAFKKVFRDIQNLVYKYSG 770
            VD+ ++ L    E+NQ+ +   V  L YE+Q SAAL Q+SV AFK+ FRDIQNLVY+Y+ 
Sbjct: 661  VDSSQNFLIGSGENNQNGVVPDVQSLWYERQGSAALCQSSVCAFKQKFRDIQNLVYQYAN 720

Query: 771  KDNSLLTMFKAGSKGNLLKLVQHSMCLGLQHSLVTLSFSLPHKLTCSAWNSQKMPRYTQK 830
            KDNSLL M KAGSKGNLLKLVQ  +CLGLQHSLV LSF +PH+L+C+AWN QK+P   Q 
Sbjct: 721  KDNSLLAMLKAGSKGNLLKLVQQGLCLGLQHSLVPLSFKIPHQLSCAAWNKQKVPGLIQN 780

Query: 831  DGLPDRTSSFIPYAVVESSFLSGLNPFECFAHSVTNRDSSFSDNAEVPGTLTRKLTFLMR 890
            D   +   S+IPYAVVE+SFL GLNP ECF HSVT+RDSSFSDNA++PGTLTR+L F MR
Sbjct: 781  D-TSEYAESYIPYAVVENSFLMGLNPLECFVHSVTSRDSSFSDNADLPGTLTRRLMFFMR 840

Query: 891  DIYTAYDGTVRNAYGNQLVQFSYDIDRPTSVSNELDSENNNRDHDIGGHPVGSLAACAMS 950
            D+Y AYDGTVRNAYGNQLVQFSY+I+  ++ S+ ++ +     +D+GG PVGS++ACA+S
Sbjct: 841  DLYIAYDGTVRNAYGNQLVQFSYNIEHTSTPSDGINED--TCAYDMGGQPVGSISACAIS 900

Query: 951  EAAYSALDQPISLLEASPLLNLKRVLECGSKRNSTKQTFSLFLSEKLSKRSYGFEYGALG 1010
            EAAYSALDQPISLLE SPLLNLKRVLECG ++++  +T SLFLS+KL KR +GFEYGAL 
Sbjct: 901  EAAYSALDQPISLLEPSPLLNLKRVLECGLRKSTADRTVSLFLSKKLEKRKHGFEYGALE 960

Query: 1011 VKNHLERVMFKDIVSNVMIIFSPQPSRKKHFSPWVCHFHVCKSL 1032
            VKNHLE+++F DIVS VMI+FSPQ   K HFSPWVCHFHVC+ +
Sbjct: 961  VKNHLEKLLFSDIVSTVMIVFSPQNGSKTHFSPWVCHFHVCEEI 1001

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
NRPD1_ARATH4.6e-29252.25DNA-directed RNA polymerase IV subunit 1 OS=Arabidopsis thaliana GN=NRPD1 PE=1 S... [more]
NRPE1_ARATH2.6e-6129.60DNA-directed RNA polymerase V subunit 1 OS=Arabidopsis thaliana GN=NRPE1 PE=1 SV... [more]
RPB1_DICDI1.2e-4727.76DNA-directed RNA polymerase II subunit rpb1 OS=Dictyostelium discoideum GN=polr2... [more]
RPB1_SCHPO4.7e-4727.82DNA-directed RNA polymerase II subunit rpb1 OS=Schizosaccharomyces pombe (strain... [more]
RPB1A_TRYBB1.1e-4627.26DNA-directed RNA polymerase II subunit RPB1-A OS=Trypanosoma brucei brucei GN=TR... [more]
Match NameE-valueIdentityDescription
A0A0A0L2L4_CUCSA0.0e+0093.78DNA-directed RNA polymerase subunit OS=Cucumis sativus GN=Csa_3G039340 PE=3 SV=1[more]
A0A0A0L2L4_CUCSA1.8e-14688.70DNA-directed RNA polymerase subunit OS=Cucumis sativus GN=Csa_3G039340 PE=3 SV=1[more]
F6HUI3_VITVI1.2e-9459.67DNA-directed RNA polymerase subunit OS=Vitis vinifera GN=VIT_02s0025g04530 PE=3 ... [more]
B9RC12_RICCO4.6e-3370.87DNA-directed RNA polymerase subunit OS=Ricinus communis GN=RCOM_1683300 PE=3 SV=... [more]
A5BZZ3_VITVI4.7e-2565.88DNA-directed RNA polymerase subunit OS=Vitis vinifera GN=VITISV_011232 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT1G63020.12.6e-29352.25 nuclear RNA polymerase D1A[more]
AT2G40030.11.5e-6229.60 nuclear RNA polymerase D1B[more]
AT4G35800.18.9e-4426.85 RNA polymerase II large subunit[more]
AT5G60040.27.1e-2534.78 nuclear RNA polymerase C1[more]
AT3G57660.11.1e-2028.26 nuclear RNA polymerase A1[more]
Match NameE-valueIdentityDescription
gi|778675679|ref|XP_011650451.1|0.0e+0093.78PREDICTED: DNA-directed RNA polymerase IV subunit 1 isoform X3 [Cucumis sativus][more]
gi|778675679|ref|XP_011650451.1|2.6e-14688.70PREDICTED: DNA-directed RNA polymerase IV subunit 1 isoform X3 [Cucumis sativus][more]
gi|778675668|ref|XP_011650447.1|2.6e-14688.70PREDICTED: DNA-directed RNA polymerase IV subunit 1 isoform X1 [Cucumis sativus][more]
gi|778675676|ref|XP_011650450.1|2.6e-14688.70PREDICTED: DNA-directed RNA polymerase IV subunit 1 isoform X2 [Cucumis sativus][more]
gi|659096195|ref|XP_008448971.1|2.4e-14789.70PREDICTED: LOW QUALITY PROTEIN: DNA-directed RNA polymerase IV subunit 1 [Cucumi... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR000722RNA_pol_asu
IPR006592RNA_pol_N
IPR007066RNA_pol_Rpb1_3
IPR007080RNA_pol_Rpb1_1
IPR007083RNA_pol_Rpb1_4
IPR021602Protein of unknown function DUF3223
Vocabulary: Biological Process
TermDefinition
GO:0006351transcription, DNA-templated
Vocabulary: Molecular Function
TermDefinition
GO:0003677DNA binding
GO:0003899DNA-directed RNA polymerase activity
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009308 amine metabolic process
biological_process GO:0006351 transcription, DNA-templated
biological_process GO:0010495 long-distance posttranscriptional gene silencing
biological_process GO:0006346 methylation-dependent chromatin silencing
biological_process GO:0030422 production of siRNA involved in RNA interference
biological_process GO:0006144 purine nucleobase metabolic process
biological_process GO:0006206 pyrimidine nucleobase metabolic process
biological_process GO:0007267 cell-cell signaling
biological_process GO:0016246 RNA interference
cellular_component GO:0000418 DNA-directed RNA polymerase IV complex
cellular_component GO:0005730 nucleolus
cellular_component GO:0005575 cellular_component
cellular_component GO:0005654 nucleoplasm
molecular_function GO:0016779 nucleotidyltransferase activity
molecular_function GO:0005507 copper ion binding
molecular_function GO:0048038 quinone binding
molecular_function GO:0003899 DNA-directed RNA polymerase activity
molecular_function GO:0003677 DNA binding
molecular_function GO:0016740 transferase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG00G000380.1ClCG00G000380.1mRNA


Analysis Name: InterPro Annotations of watermelon (Charleston Gray)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000722RNA polymerase, alpha subunitPFAMPF00623RNA_pol_Rpb1_2coord: 344..502
score: 1.1
IPR006592RNA polymerase, N-terminalSMARTSM00663rpolaneu7coord: 276..531
score: 1.9
IPR007066RNA polymerase Rpb1, domain 3PFAMPF04983RNA_pol_Rpb1_3coord: 507..654
score: 3.4
IPR007080RNA polymerase Rpb1, domain 1PFAMPF04997RNA_pol_Rpb1_1coord: 93..289
score: 3.8
IPR007083RNA polymerase Rpb1, domain 4PFAMPF05000RNA_pol_Rpb1_4coord: 718..781
score: 6.9
IPR021602Protein of unknown function DUF3223PFAMPF11523DUF3223coord: 1216..1308
score: 2.2
NoneNo IPR availableGENE3DG3DSA:2.40.40.20coord: 351..376
score: 2.9E-25coord: 433..505
score: 2.9
NoneNo IPR availableGENE3DG3DSA:3.10.450.40coord: 1278..1313
score: 1.6E-10coord: 1196..1259
score: 2.3
NoneNo IPR availablePANTHERPTHR19376DNA-DIRECTED RNA POLYMERASEcoord: 815..1131
score: 0.0coord: 53..798
score: 0.0coord: 1217..1294
score:
NoneNo IPR availablePANTHERPTHR19376:SF40DNA-DIRECTED RNA POLYMERASE IV SUBUNIT 1coord: 53..798
score: 0.0coord: 815..1131
score: 0.0coord: 1217..1294
score:
NoneNo IPR availableunknownSSF64484beta and beta-prime subunits of DNA dependent RNA-polymerasecoord: 66..686
score: 4.35E-142coord: 723..931
score: 4.35E