Cp4.1LG09g02310 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG09g02310
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionDNA-directed RNA polymerase
LocationCp4.1LG09 : 1371774 .. 1385477 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGTGAAGGAATAAATGTTGGAGAAAACGTAGGTTTTAGGAGGGTTTAATGGCCGCCGCTGTGGTTCTCTTCTCCGTTCTCCCACCATTTTGAGATCTCTACCTTCATCTCTACCTTCATCTCTACCTTCATCTCTCATCCTTCCCGCGTTTGAATTCTTTGTATTCAACTTTGTGTTCTTAGTTTCTCACTGCCTGAGAACATTTCGGAGAAACTGCATATTCCGACTATCAGTTCTTCGCATTTTCCATCTCCTTCACCTGCGTTTTGAATTGCTTGAACTTGTTGTTGTGGGAATTCGGAATCCCTACTACCCGAGAATGTGGAGAAACATTGTCAAAAGAGCTGCTTCAAGGAGGGATACGTTCTTCTCCAGAAAAGTTATTGAAGACTCTATCTTTGTTGAGGAAATTAGATCCTCCAGGACACTCGGAAGCTTGAAATCATGTTGCCTGTTAGAGGGCTCTCGTGGAATCAGCTTTTTCCATTCCCGTGAAGTTGGGTTTGCGAAATCCAATTTCGCACATTCAAGCGATCCCGTAGCTTTTTATGGGGTTCTAAACCATGCCAATGGCTACGCAACTGCTGCGGAGGCTGCCATTTCCGATGAGGACTTATCAGGGTCGGAAGAAATTCAGGAATTGATGGAGGAACTAAGCAAACAAGATAAGGTGGAGTCTCACTTTAAGCAGCCTAAGAAAATGGTGGATGGAATGAGGGTAGGTAAGTACAATATTCTACGGAAGAGACAGATAAAGATGGAGACGGAGGCTTGGGAAGAGGCTGCCAGAGAGTATCAAGAGCTATTAACGGATATGTGTGAGCAGAAGTTGGCGCCCAATTTACCTTACATGAAGTCTTTATTCCTTGGTTGGTTTGAACCCTTGCGTGATGCAATTGCTGCAGACCAGGAGTTTTGCAAGAAGAAGAGTAGAGTGTCTCATTTTGCTTATTTTGATCTTCTGCCGGCGGATATGATGGCTGTGATTACAATGCATAAGTTAATGGGGTTGTTGATGACTAACAGTGGAGGAAACAGTAGTGTCAGGGTAGTCCAAGCTGCTTGTCAGATAGGAGAAGCCATTGAACACGAGGTTGGTTAGTTTCAAATTCTTTTACTTTATTGTCTAAAATTTTGCATTTGTTTTAGATTTCCCATTGATATGAAGTTCGCTAGGTATCAAAGTAAAAAAACCAGTCATGGTTTTCTTTCCAATATTGAGAGCTATCGATCATATAGCAAGGTTTCTTACCTCATTCTGTCAGTTTTTCTTTCTCTTATTTAGATATGTGATATATTTGTTCTTGTTTGGCAATGAACATCTGAACTCTGATTGACATTTTTAACAAAAGTGTTGAAGTAATTGAAATATTTGTTTGGACAATTTTCTTTTCTACGAAATAGGTTAGAATACACAAGTTCTTTGAGAATATGAAGAAGAAGAAGAGTAATGAAAAAACTACTGAAGGAGACACTGAACCTGTGGTTGAAGACCAGGAGAAAGTGGCCAAAGAACAAGACAAATTGAGGAAAAAAGTCACCAACCTGATGAAAAAACAGAAGTTGCGGCAAGTTAGGATGATAGTCAAGGAACACGACCATTTAAAGCCTTGGGGCCAGGATGCGCATGTGAAGGTTTCTGTTTTTAATTTTTTGCCCATAGTCTCAATGTGAAGTAATGCGTAAATTCTGGTGTACTTGCTGTGAGCTACTGTAGATGTTGATGTACACATATTAACAGCGAAAAATTGTTTAATTCTGGTTTCTATTGCCTTCCAGGTTGGTTGTCGTTTGATTCAGCTATTGATTGAAACAGCGTATATTCAACCTCCAATGGATCAGTTAGGAGGGGGTCCTCCTGATATTCGTCCCGCATTTGTCCATACTCTTAAAACCATCACAAAAGAAGCACAGTTAATGAGCCTTCTGCTCTCGTTGTTTATTTTTCATATTTATCTCACTTCAATTTTTCTACATATTCTTTCTATGTCTGATCTGTATTACTACTACTACTAATTATTACCATTTTGTTTCTTCCCGCAGAAAGACTAGCAGAAGATATGGTGTTATTGAATGCGATCCACTTGTTCGCAGAGGCCTGGAGAAAACTGTAAGTAAAATCATACATTGTGGGCTGTTCCGTACTCGGTGTGAATTTGGTTGCTTTTCTATAAATAGTTTGTAGACATTATTTTTGCTGTATGATTATGCAACTCTGTAGTTCTGCAAGGTTCTCTCTCTCTCAAAATTACTCAGTTTATGTCTTGGATTAGTGATTGTCTTCAGTTTTGTCTCAGGCAAGACACATGGTCATACCATATATGCCTATGCTGGTGCCTCCCCTTAATTGGACAGGGTAGGTTACTCTATATTTTAATCTATTTCAATAAATTCTAATAGTTTCCATAATATTCAGAGAATGAAAACCTAGTTTCATGTAACGATATTTACTGGTTCATTTGTTGAAGAGTATATATATCTGAATGATATGTATTTTCTGAAGCTTAAGTATCATTTGTGCTGTATGCTTGTTATGTTTGGTGGTTGATGTCTGCTATTCGTNTTGCTTGAACTTGTTGTTGTGGGAATTCGGAATCCCTACTACCAGAGAATGTGGAGAAACATTGTCAAAAGAGCTGCTTCAAGGAGGGATACGTTCTTCTCCAGAAAAGTTATTGAAGACTCTATCTCTGTTGAGGAAATTAGATCCTCCAGGACACTTGGAAGCTTGAAATCATGTTGCCTGTTAGAGGGCTCTCGTGGAATCAGCTTTTTCCATTCCCGTGAAGTTGGGTTTGCGAAATCCAATTTCGCACATTCAAGCGAGCCCGTAGCTTTTTATGGGGTTCTAAACCATGCCAAAGGCTACGCAACTGCTGCGGAGGCTGCGATTTCCGAGGAGGACTTATCAGGGTCGGAAGAAATTCAGGAATTGATGGAGGAACTAAGCAAACAAGATAAGGTGGAGTCTCACTTTAAGCAGCCTAAGAAAATGGTGGATGGAATGAGGGTAGGTAAGTACAATATTCTACGGAAGAGACAGATAAAGATGGAGACGGAGGCTTGGGAAGAGGCTGCCAGAGAGTATCAAGAGCTATTAACGGATATGTGTGAGCAGAAGTTGGCGCCCAATTTACCTTACATAAAGTCTTTATTCCTTGGTTGGTTTGAACCCTTGCGTGATGCAATTGCTGCAGACCAGGAGTTTTGCAAGAAGAAGAGNATATACCATGTTGAAAAATTAATATCACTGGACTCTAGAGACGATCATAAGATATTAGGACTCTGGTACTTCCTCTTTGTCGAAGAAGATTTTTGCTTTGACTATATGTTTCATGGGCTTTGTCTTTGAAGTCGCGATACCATCTTNTAGTTTCATGTAACGATATTTACTGGTTCATTTGTTGAAGAGAATATATATCTGAATGATGCGTATTCTCTGAAGCTTAAGTTTCATTTGTGCTGTATGCTTGTTATGTTTGGTGTTTGATGTCTGCTATTCATGTGTAGGTATGATAAAGGAGCACACTTATTCCTACCATCATATGTTATGCGAATACATGGGGCAAAGCAGCAACGTGAAGCAGTTAAAAGGGTTCCAAAGAAACAACTAGAGCCTGTTTTTGAGGTTTGAGAGTTCTGATGATTGTTTCCTAGTTTAATGATTTACATACACACCTTTGGTTTTGCCTGACCTCTTCCTCCATTCCCATATACCATGTTGAAAAATTAATATCACTGGACTCTAGAGACGATCATAAGATATTAGGACTCTGGTACTTCCTCTTTGTCGAAGAAGGTTTTTGATTTGACTATATGTTTCATGGGCTTTGTCTTTGAAGTCGCGATACCATCTTTAGAAGGAACTCCTAAGATTTTCCCATAGAAAAAATCATCTTGCAGTTTCTTTTTGAACTTCAGTTACGTAAGAATTCTTAATATATGATAATTTTTGCATTTTTAGGCACTTGATACGCTTGGAAGAACCAAGTGGAGGGTGAACAAAAGAGTTCTTAGCATTATTGACAGGATATGGGCCAGTGGCGGTCGTCTTGCTGATTTGGTTGACCGTGAAGATGTATGTTACAGAAGAAGAATGCTAGTTGTTCCTATTTGCATGTTTTCCTATCTTATGAAATTTATATCTTTTTTTGTAACTTTAACTTTAACTTCAATTTGTACAGGCTNTACTGAAGGAGACACTGAACCTGTGGTTGAAGACCAGGAGAAAGTGGCCAAAGAACAAGACAAATTGAGGAAAAAAGTCACCAACCTGATGAAAAAACAGAAGTTGCGGCAAGTTAGGATGATAGTCAAGGAACACGACCATTTAAAGCCTTGGGGCCAGGATGCGCATGTGAAGGTTTCTGTTTTTAATTTTTTGCCCATAGTCTCAATGTGAAGTAATGCGTAAATTCTGGTGTACTTGCTGTGAGCTACTGTAGATGTTGATGTACACATATTAACAGCGAAAAATTGTTTAATTCTGGTTTCTATTGCCTTCCAGGTTGGTTGTCGTTTGATTCAGCTATTGATTGAAACAGCGTATATTCAACCTCCAATGGATCAGTTAGGAGGGGGTCCTCCTGATATTCGTCCCGCATTTGTCCATACTCTTAAAACCATCACAAAAGAAGCACAGTTAATGAGCCTTCTGCTCTCTTGGTTTATTTTTCATATTTATCTCACTTCGATTTTTCTACATATTCTTTCTATGTCTGATCTGTATTACTACTACTACTAATTATTACCATTTTGTTTCTTCCCGCAGAAAGACTAGCAGAAGATATGGTGTTATTGAATGCGATCCACTTGTTCGCAGAGGCCTGGAGAAAACTGTAAGTAAAATCATACATTGTGGGCTGTTCCGTACTCGGTGTGAATTTGGTTGCTTTTCTATAAATAGTTTGTAGACATTATTTTTGCTGTATGATTATGCAACTCTGTAGTTCTGCAAGGTTCTCTCTCTCTCAAAATTACTCAGTTTATGTCTTGGATTAGTGATTGTCTTCAGTTTTGTCTCAGGCAAGACACATGGTCATACCATATATGCCTATGCTGGTGCCTCCCCTTAATTGGACAGGGTAGGTTACTCTATATTTTAATCTATTTCAATAAATTCTAATAGTTTCCATACTATTCAGAGATTGAAAACCTAGTTTCATGTAACGATATTTACTGGTTCATTTGTTGAAGAGAATATATATCTGAATGATGCGTATTCTCTGAAGCTTAAGTTTCATTTGTGCTGTATGCTTGTTATGTTTGGTGTTTGATGTCTGCTATTCATGTGTAGGTATGATAAAGGAGCACACTTATTCCTACCATCATATGTTATGCGAATACATGGGGCAAAGCAGCAACGTGAAGCAGTTAAAAGGGTTCCAAAGAAACAACTAGAGCCTGTTTTTGAGGTTTGAGAGTTCTGATGATTGTTTCCTAGTTTAATGATTTACATACACACCTTTGGTTTTGCCTGACCTCTTCCTCCATTCCCCTATACCATGTTGAAAAATTAATATCACTGGACTCTAGAGACGATCATAAGATATTAGGACTCTGGTACTTCCTCTTTGTCGAAGAAGGTTTTTGATTTGACTATATGTTTCATGGGCTTTGTCTTTGAAGTTGCGATACCATCTTAGAAGGAACTCCTAAGATTTTCCCATAGAAAAAATCATCTTGCAGTTTCTTTTTGAACTTCAGTTACTAAAGAATTCGTAATATATGATAATTTTTGCATTTTTAGGCACTTGATACTCTTGGAAGAACCAAGTGGAGGGTGAACAAAAGAGTCCTTAGCATTATTGACAGGATATGGACCAGTGGTGGTCGTCTTGCTNAGCTACTTCCATTTTTACTTGGCTATTTCTTATTCTGCGTGTTCGGGGATTGAAAATACCTCAATGCTTGCTTATTCGAAGCTAAAAGCACCTGAATTTAAATAATGCTTCTTGTACTANTTGACCGTGAAGATGTATGTTACAGAAGAAGAATGCTAGTTGGTCCTATTTGCATGTTTTCCTATCTTATGAAATTTATTTTTTTTGTAACTTTAACTTCAATTTTTACAGGTTCCTCTACCAGAGGAGCCAACTGTGGAAGATGAAGCAGAAATTCGAAAATGGAAGTGGAAACTCAAGGCTGCAAAAAAAGAGAATAGTGAGAGGCATTCACAGCGGTGTGACATTGAGCTTAAGCTTGCAGTAAGAACTTATTGAATGAATAGAATTTAAATTTGAATACTCTTGTCAAATACATTTGAATTTACTTTTATTGTGAGCAGGTGGCCAGGAAAATGAAAGAAGAGGAGGGCTTTTACTATCCTCACAACTTAGATTTTCGAGGTCGTGCATACCCAATGCATCCACATTTGAATCATCTTGGTTCTGATATGTGTCGAGGAATTCTAGAATTTGCGGAGGGACGGCCACTTGGCGAGTCAGGGTTACGCTGGTTGAAGATACATTTGGCAAATCTATACGCTGGTGGTGTGGACAAGTTATCTTACAAGGATCGAATATCATTTACTGAGAATCATTTGGATGAGATTTTTGATTCGGCAGACAGGCCTCTAGAAGGAAATCGTTGGTGGTTGGGCGCAGAGGATCCTTTTCAGTGCTTGGCAGTGTGTATTAATCTCTCAGAGGCTTTAAGAAGTCCATCGCCGGAAACAACTCTTTCCCATATGCCTGTACACCAGGTACTACTTACATTATTCTTGCAATTCGTCCCCAAATTATGAATGCAATTTTTGTTCTCTGCAACTTTATCCCTTTCGAAACCATAATCATATAGCTATTCTGCAGTATATATCTTTGTTTATCCTTATTTATTTCCCCATGATTTGGCAGGATGGTTCCTGCAATGGCTTGCAACACTATGCAGCTCTCGGGAGGGACAAGGTACTTTTAATTTCTCTAGTTAGATAAGATTAAGATACAAAGTCAGCTGAGCTACTTCCATTTTTACTTGGCTATTTCTTATTCTGCGTGTTCGGGGATTGAAAATACCTCAATGCTTGCTTATTCGAAGCTAAAAGCACCTGAATTTAAATAATGCTTCTTGTACTACACATACGAAGAACTTTAAAAATATTTCTTTTAACTATCTCTATTTTTAGGAACTCCTTAATTTTTACAAAAAAAGTCTTCTTTGAGATGACACGGTCTGTTTCTTAATTAGCATTTAATATTGTGTCATTTTTCTTCAGTTGGGAGCAGCGGCAGTTAACCTTGTAGCAGGAGATAAGCCCGCAGATGTGTACTCAGGAATTGCTGCCAGGTTACTGCATCTCTTGTTTGCATGTTTGATTGTAGGGAGGATTATTTAAGTGTTCCTCATTTTTTTTTTAATGTTTTTTTAAAATTGCTCTTTGGTGTTTTTTACATATGAAAGTTGAGTCTTAATAACCCGGCAGGCGTCAATGCCATGTTTGTTTGAATAAGCTGAAAAGTATTTTGAAGCCTGGTCATTATGATAAGGGAGCTAATATATGCGTTCCGCGAACTATAGAAAATGAAAGTATGCTCAATGCAAGTGATATTTTTTAGATCAAATTGAAGTGACTCTGCTCTCTGCATTCAACTTTGGAAGCTTCGTTTACCAACTAGTATGAAAGGAAAAATACCTATTTAGACAGTATTCTCCATGCTTTCGGTATAAAAGTTTGCAAATGACTTGTGTTTTAGAGCATATTTTTGGCGTTCGGAGGGGATCTAGTTTCTTTGTTTATTTGTATATTTTTCTGGGTGCAGAGTTCTTGACATAATACGAAGTGATGCAGAGAAAGATCCTGCAACTAATCCAAATGCGTTGCATGCTAGACTTTTAATCAATCAGGTCTGGTTACTAATAATTCTATGAGAATAATGTTTTTTAGAACTGTTTCTAATAGCTGCTTGTATGTTTTTCCTCTAATGGATTTATATTTGTGACTGCAATTAGGTGGATCGTAAACTGGTGAAACAAACAGTCATGACATCCGTGTATGGTGTCACGTATATGGGTGCACGGGATCAGATTAAGAAAAGGTTGAAAGAACGAGCATCCATTGCTGATGATTCACAATTATTTGCAGCTTCTTGCTATGCAGCTAGAGTGAGTCAATAAACTGCCATTTGCTTGTGAGAAAAATGTAGCACATTTATGCCTATTTGCATGCATATTATAGTTTTGTTTCTTTTTTTTACACAACTTTTTCTTTTATATTTACAAGAAAAATGTGTTACTACAATTATTATTATTTTGAAATGAACGTTCTGCTGATTCGAAGACTTATCGTGCAGACTACCTTGACTGCCTTGGGGGAAATGTTTGAAGCAGCAAGAAGTATCATGAGCTGGCTTGGTGAATGTGCAAAGGTGCGGTCCTTAATTAGTGATATTGTCATTATTTTGGTTATTTTGATAAAGTTCGTAGTATCACTAGCTTGAAGTGGGAATTCTTCGATAGTTCCCCTCCTCTTGTAAATTGTGTTCCTTCCTAAAAAGAAAGGACTTTTCATTGATTGGTGAAACCTTCACGCTGAAGCCGTCATAAACATGCACCCACATGGGACTTACAGAGTCTTTACTAACCCACTTTTTTCAACATCATTTACATTACTTAGGCTCATTTGGTTTTCTCCAGCCATAAGTGAACTGGATAGTCTTCCCATCAGATCTTACCAGCTTCAAAAATAGATGTTTAGGAGAAAACTGCCTTAATGCATCGACCTTCCATCTTCACTCGATAAATCTCCCAACTCCTCAGTTTCCTATCCTACTAAATTTCTTCCAGTCAAGATTTTCCTCTATTTCGTTTCAATTTCTCACGTAGAGATTGAGAAATTTAACATCAATGGTTGGGTGACCAGCCATATTTCCTAGAACCTCACCCTTTACCCAAACCCACATCGAGCTTAATTTCGTCTTCGAACAAAGATCTTAATTTTGAAATACAACTACAAGGGTTGTGATCTCCTCCATATTTCTACAATTTTCAGACTCATGTTCCCTCGTCTTAGCCGTAGGTACTTCAAATAATTTGATTTTATAACATCTTGAGGAGTGGGTTATTTTAGCAATGTTACGTTTCTTATCTTAGTTAAGTATAGAAATCTGATTTTATGTGCATTTCAGCAAGCGTTCCTCTGCCTTTTTGTCACTCGACTAAGGTGTAGCTTTTCACACATTTAATTTTCAACAGGTAATAGCTTCAGAAAATCAGCCAGTTCGATGGACAACTCCTCTTGGACTGCCAGTGGTACAACCTTACCGGCAACTAGGAAGACATCTTGTAAAGTTCTAATAAATTTGTGAACTTTAAGCTAACAATGATTTGGACTTCTCCAACAAACTGAAACTGTAATTGACTTATTGCAGATCAAGACTTCCTTGCAAGTGTTGGCTCTACAACGAGAAACTGACAAGGTAAGGCCAACCAAATCAGGAATTATTGGGTAGACTCGATGTGTTTAAAACCTTTTGAAATTGAGACGCTTCAAGTTTATTAACTCTCTTGAGATGGGTCACTGTATGCAGTAGTGCAATTCATTGGTTGCATGATATGTTTACTAGCCGGATAATTCCTATTATTTTCTGTAGGTCATGGCTACGCGTCAGAAAACAGCTTTTCCTCCAAATTTTGTACACTCCCTTGATGGTTCTCATATGATGATGACTGCTGTCGCCTGCCGAAGGGCAGGCCTTAACTTTGCAGGTTCACTTCATTTTTCTTTTACTTCTCTATTGATTGGTTCCTTTATCTCAGTGAACTTATGTGAATTCTCGAGTGTTCAATCGTGGCCATATCTCTTCTTACCATGTTTGATTACAGGTGTCCATGATTCCTATTGGACCCATGCATGCGATGTTGACGTAATGAACAGGCTACTAAGGGAGAAATTTGTTGAGCTATACGAGGCTCCTATTCTGGAGAATGTAAGTATCTCGCCCCATGTGTACTTGATCGGCTAACTTATCCTGGTTATTATTGGCAAAATATATTCCTCTTGAGACAAAATAGAGCACATTGACAGTATTATCCTATAGATACAATCTTTGGATCAATTAGAACAAAGATCTATTTGTCTTCTTCTGTCCTGGAATTTCTTTTAACTTATAAATTGAACGGTTTTTGCCTTTACAGTTACTGAAAAATTTCCAAAAGTCCTTCCCCACTTTAAAGTTTCCGCCCTTACCCGATCGGGGAGACTTCGATCTCAAGGACGTCCTGCAATCTTCCTATTTCTTCAATTAGTGCAGCCTAGATACTTCAGTCGGATGCTCTTTCTCATATGTGCTACACGTCTGTGGCTGACATACAGTCCTGTCTACGACTCTTGAAATCAGACTCTGGAGGCGAGTTTGCTACTTCGAGGTCAGTGCTGTCCAGCTACTTGTACATAACACATTCTCCAACCATTGGTAGTCTAAATCTTGGAACTTGGGAGTTCAATCTGCCATTCACGTATTCAACTTTTACCTTACAAAATAAAGATAGGTTATATTTGGAAGCTACGGAATTTCTACCTTCATTGTACTCTGATCCGGACTTAAGTTCTGACACTGCAATCAATTAGAGGTTCAGCGATGCTGAGAGACATGGCTTTCAAAGTGTGATCAATGAAGTAAGATCTCGGTGTAATCTCCCAAAGATCAGTTGGAGTGTTGCCATTTAGTTTTCATAGGATCATGGCTTAACTTCATGACAAAAGGAAGAATGGGCTGCTGAGAAGAAAGTCGTGGTGCATACTCATACCATAGCGAGGTATTGCAGGTTGTTTCATTCCCAAAAATAAGGAAAAATGAAGAAGATTGTTGTTCTAGAAATACCAGTAAATTTTAGTCTATGCCGATTGTATTCAATTTTTTAAACAAGGCTGACATTGGATGACGACTCTATATAGTACTGGGGATGGCTGGCATTCCCTTGNGCTCCTATTCTGGAGAATGTAAGTATCTCGCCCCATGTGTACTTGATCGGCTAACTTATCCTGGTTATTATTGGCAAAATATATTCCTCTTGAGACAAAATAGAGCACATTGACAGTATTATCCTATAGATACAATCTTTGGATCAATTAGAACAAAGATCTATTTGTCTTCTTCTGTCCTGGAATTTCTTTTAACTTATAAATTGAACGGTTTTTGCCTTTACAGTTACTGAAAAATTTCCAAAAGTCCTTCCCCACTTTAAAGTTTCCGCCCTTACCCGATCGGGGAGACTTCGATCTCAAGGACGTCCTGCAATCTTCCTATTTCTTCAATTAGTGCAGCCTAGATACTTCAGTCGGATGCTCTTTCTCATATGTGCTACACGTCTGTGGCTGACATACAGTCCTGTCTACGACTCTTGAAATCAGACTCTGGAGGCGAGTTTGCTACTTCGAGGTCAGTGCTGTCCAGCTACTTGTACATAACACATTCTCCAACCATTGGTAGTCTAAATCTTGGAACTTGGGAGTTCAATCTGCCATTCACGTATTCAACTTTTACCTTACAAAATAAAGATAGGTTATATTTGGAAGCTACGGAATTTCTACCTTCATTGTACTCTGATCCGGACTTAAGTTCTGACACTGCAATCAATTAGAGGTTCAGCGATGCTGAGAGACATGGCTTTCAAAGTGTGATCAATGAAGTAAGATCTCGGTGTAATCTCCCAAAGATCAGTTGGAGTGTTGCCATTTAGTTTTCATAGGATCATGGCTTAACTTCATGACAAAAGGAAGAATGGGCTGCTGAGAAGAAAGTCGTGGTGCATACTCATACCATAGCGAGGTATTGCAGGTTGTTTCATTCCCAAAAATAAGGAAAAATGAAGAAGATTGTTGTTCTAGAAATACCAGTAAATTTTAGTCTATGCCGATTGTATTCAATTTTTTAAACAAGGCTGACATTGGATGACGACTCTATATAGTACTGGGGATGGCTGGCATTCCCTTGCTCGCAGGTGTACAGAATTTACAGATGAATTTGTTAAATCGATTATGGTTGTTTGGAATTATTCTACTTGCGGATGAACTTAACTTCCTTTTGGAGCTCTTTTTCTTAGTTAGAAACAGAAGGCGATGCAGAAACGAAGTGCCTTCTGAATCCCCACCTGCTACTAAAAATTTCCTTTGTATTCTGAAGCTTTCATGGCCGACTGTCTGATTCAGCTGCTACTATTATACTTCCCTGGTATGCTCATCTCTTATACTTCAGGACTGATTGTTTATGTTCGGTTGAAAATAGAATGTAAAACCGTAAGCTTCCAAGTAGTGGGGGCAACTAGGACGAGTACTCATAAGCAGTGACTTGGTTAGGAGTTCTCTCTCAGCTTAGTTCTTTTTGTTTCGTTTGATAGATGGTCGACAAGTATGATGGTGTCTTAACGATGAATATGCACGTATTTTGTCTTTTATTGGCGAATTTGAAAACCTCAGTAATTGACTTGGGTTACAAGTGTGGTTTTGGTTGTAACGACCCCTTCCTGTTGTGTAACTGGTTATTCCTTACCTCGAGACCAAATTAAGTATTAGGTAGTCAAAACTAACAGGTATACTGTGAGACAATCTATCTTGACGTGTCTTTTTTTTACCGTTTATACAGGAGCAAAAATTCATTGGACAAAAGAAAGAGACATGGAAGTGGCTAAATGCTTGTTTGAATTAAGTTTAAAATAATAATATTATGATAAGTCGATTTTTTATTTAATTTTCTAATAATCTTTCCAATAATAGCTCTTAAAATTGTTGGTAGAATAAGCCCTTAAAATTGTAGTTTGTTAGAAATAATCTTAAGTTTTTTTCTTTCTTTTTAAAAAAATGACGTGTTCTATTAGTGTGACGTGTTTTCTATTAGTGTGACGTGTCGATGCTTGTTTATTTGGCTGAATTGCTTTCTTATTGGCCACGCAAGACGTCTTCTTTGTGTAAAAATGAAAATCCTCCCCGTCAGGAAAAATTTATAAATTAAGTCAAATTCTGACGTGTTCTGTTGCTACTTAGGTCATGCTACTTATTTTTCTTGAACGATTTTCGAGAATGAAAACTATTCCGATAAACCAGCATGATTATCAATAATATTCTCGATAGGTCACATAACATAGAATTGTTTCAAGTCAAACACGCTCTGTTTTAATATTTCTATAATTAAATTATACAAAAAGAAAAATGTATTTTATTGATATAAATAATAAACTTGTCCCGACCATGTGTTACATTTGAAGGTCTACTCTTATTTGGATGTCGTCTTGACCTATTCGTGTAATGGATATATCAATATTTTGATGAAACGTGGGTCTTCCATTCTTGGAAGCTAAGTAAAGTCACATGTTGTATTTCTTGTTCTCAAATTAAATAACATTATATATTTGACGAAAATATTAAAGAATCAAATCTTCACTTTTACATATATTCGACATAACAAAACCCGATGAAACTAAGGTCTTCCATGTTTGGAAACTAAGTCAGTCACAATTTCAAGCAGTCCAAACCAAGAAGAACACGACACCCGAACGTGTCTTGCACGTTCCAGAGCTACATAAAATACACGTAACGTTCAACACGTGGGATGTAACCGAGTAGAGGCTTAATTGAACTTGACCTTGTGAGCAGAAGAGAAATGCTTGGCTGCAGCAACCATGGCGGCCTCTGGTCCAATTGGAGGAGGAGGAGGTGCAGCCATGGCTGCCTCTGGGCTTCCGCCATTAAAGCAGCCAAGCAAGCAATAA

mRNA sequence

ATGGTTTCTCACTGCCTGAGAACATTTCGGAGAAACTGCATATTCCGACTATCAGTTCTTCGCATTTTCCATCTCCTTCACCTGCGTTTTGAATTGCTTGAACTTGTTGTTGTGGGAATTCGGAATCCCTACTACCCGAGAATGTGGAGAAACATTGTCAAAAGAGCTGCTTCAAGGAGGGATACGTTCTTCTCCAGAAAAGTTATTGAAGACTCTATCTTTGTTGAGGAAATTAGATCCTCCAGGACACTCGGAAGCTTGAAATCATGTTGCCTGTTAGAGGGCTCTCGTGGAATCAGCTTTTTCCATTCCCGTGAAGTTGGGTTTGCGAAATCCAATTTCGCACATTCAAGCGATCCCGTAGCTTTTTATGGGGTTCTAAACCATGCCAATGGCTACGCAACTGCTGCGGAGGCTGCCATTTCCGATGAGGACTTATCAGGGTCGGAAGAAATTCAGGAATTGATGGAGGAACTAAGCAAACAAGATAAGGTGGAGTCTCACTTTAAGCAGCCTAAGAAAATGGTGGATGGAATGAGGGTAGGTAAGTACAATATTCTACGGAAGAGACAGATAAAGATGGAGACGGAGGCTTGGGAAGAGGCTGCCAGAGAGTATCAAGAGCTATTAACGGATATGTGTGAGCAGAAGTTGGCGCCCAATTTACCTTACATGAAGTCTTTATTCCTTGGTTGGTTTGAACCCTTGCGTGATGCAATTGCTGCAGACCAGGAGTTTTGCAAGAAGAAGAGTAGAGTGTCTCATTTTGCTTATTTTGATCTTCTGCCGGCGGATATGATGGCTGTGATTACAATGCATAAGTTAATGGGGTTGTTGATGACTAACAGTGGAGGAAACAGTAGTGTCAGGGTAGTCCAAGCTGCTTGTCAGATAGGAGAAGCCATTGAACACGAGGTTAGAATACACAAGTTCTTTGAGAATATGAAGAAGAAGAAGAGTAATGAAAAAACTACTGAAGGAGACACTGAACCTGTGGTTGAAGACCAGGAGAAAGTGGCCAAAGAACAAGACAAATTGAGGAAAAAAGTCACCAACCTGATGAAAAAACAGAAGTTGCGGCAAGTTAGGATGATAGTCAAGGAACACGACCATTTAAAGCCTTGGGGCCAGGATGCGCATGTGAAGGTTGGTTGTCGTTTGATTCAGCTATTGATTGAAACAGCGTATATTCAACCTCCAATGGATCAGTTAGGAGGGGGTCCTCCTGATATTCGTCCCGCATTTGTCCATACTCTTAAAACCATCACAAAAGAAGCACAAAAGACTAGCAGAAGATATGGTGTTATTGAATGCGATCCACTTGTTCGCAGAGGCCTGGAGAAAACTAGAATGTGGAGAAACATTGTCAAAAGAGCTGCTTCAAGGAGGGATACGTTCTTCTCCAGAAAAGTTATTGAAGACTCTATCTCTGTTGAGGAAATTAGATCCTCCAGGACACTTGGAAGCTTGAAATCATGTTGCCTGTTAGAGGGCTCTCGTGGAATCAGCTTTTTCCATTCCCGTGAAGTTGGGTTTGCGAAATCCAATTTCGCACATTCAAGCGAGCCCGTAGCTTTTTATGGGGTTCTAAACCATGCCAAAGGCTACGCAACTGCTGCGGAGGCTGCGATTTCCGAGGAGGACTTATCAGGGTCGGAAGAAATTCAGGAATTGATGGAGGAACTAAGCAAACAAGATAAGGTGGAGTCTCACTTTAAGCAGCCTAAGAAAATGGTGGATGGAATGAGGGTAGGTAAGTACAATATTCTACGGAAGAGACAGATAAAGATGGAGACGGAGGCTTGGGAAGAGGCTGCCAGAGAGTATCAAGAGCTATTAACGGATATGTGTGAGCAGAAGTTGGCGCCCAATTTACCTTACATAAAGTCTTTATTCCTTGGTTGGTTTGAACCCTTGCGTGATGCAATTGCTGCAGACCAGGAGTATGATAAAGGAGCACACTTATTCCTACCATCATATGTTATGCGAATACATGGGGCAAAGCAGCAACGTGAAGCAGTTAAAAGGGTTCCAAAGAAACAACTAGAGCCTGTTTTTGAGGCACTTGATACGCTTGGAAGAACCAAGTGGAGGGTGAACAAAAGAGTTCTTAGCATTATTGACAGGATATGGGCCAGTGGCGGTCGTCTTGCTGATTTGGTTGACCGCTNTACTGAAGGAGACACTGAACCTGTGGTTGAAGACCAGGAGAAAGTGGCCAAAGAACAAGACAAATTGAGGAAAAAAGTCACCAACCTGATGAAAAAACAGAAGTTGCGGCAAGTTAGGATGATAGTCAAGGAACACGACCATTTAAAGCCTTGGGGCCAGGATGCGCATGTGAAGGTTGGTTGTCGTTTGATTCAGCTATTGATTGAAACAGCGTATATTCAACCTCCAATGGATCAGTTAGGAGGGGGTCCTCCTGATATTCGTCCCGCATTTGTCCATACTCTTAAAACCATCACAAAAGAAGCACAAAAGACTAGCAGAAGATATGGTGTTATTGAATGCGATCCACTTGTTCGCAGAGGCCTGGAGAAAACTGCAAGACACATGGTCATACCATATATGCCTATGCTGGTGCCTCCCCTTAATTGGACAGGGTATGATAAAGGAGCACACTTATTCCTACCATCATATGTTATGCGAATACATGGGGCAAAGCAGCAACGTGAAGCAGTTAAAAGGGTTCCAAAGAAACAACTAGAGCCTGTTTTTGAGGCACTTGATACTCTTGGAAGAACCAAGTGGAGGGTTCCTCTACCAGAGGAGCCAACTGTGGAAGATGAAGCAGAAATTCGAAAATGGAAGTGGAAACTCAAGGCTGCAAAAAAAGAGAATAGTGAGAGGCATTCACAGCGGTGTGACATTGAGCTTAAGCTTGCAGTGGCCAGGAAAATGAAAGAAGAGGAGGGCTTTTACTATCCTCACAACTTAGATTTTCGAGGTCGTGCATACCCAATGCATCCACATTTGAATCATCTTGGTTCTGATATGTGTCGAGGAATTCTAGAATTTGCGGAGGGACGGCCACTTGGCGAGTCAGGGTTACGCTGGTTGAAGATACATTTGGCAAATCTATACGCTGGTGGTGTGGACAAGTTATCTTACAAGGATCGAATATCATTTACTGAGAATCATTTGGATGAGATTTTTGATTCGGCAGACAGGCCTCTAGAAGGAAATCGTTGGTGGTTGGGCGCAGAGGATCCTTTTCAGTGCTTGGCAGTGTGTATTAATCTCTCAGAGGCTTTAAGAAGTCCATCGCCGGAAACAACTCTTTCCCATATGCCTGTACACCAGGATGGTTCCTGCAATGGCTTGCAACACTATGCAGCTCTCGGGAGGGACAAGTTGAGTCTTAATAACCCGGCAGGCGTCAATGCCATAGTTCTTGACATAATACGAAGTGATGCAGAGAAAGATCCTGCAACTAATCCAAATGCGTTGCATGCTAGACTTTTAATCAATCAGGTGGATCGTAAACTGGTGAAACAAACAGTCATGACATCCGTGTATGGTGTCACGTATATGGGTGCACGGGATCAGATTAAGAAAAGGTTGAAAGAACGAGCATCCATTGCTGATGATTCACAATTATTTGCAGCTTCTTGCTATGCAGCTAGAGTAATAGCTTCAGAAAATCAGCCAGTTCGATGGACAACTCCTCTTGGACTGCCAGTGGTACAACCTTACCGGCAACTAGGAAGACATCTTATCAAGACTTCCTTGCAAGTGTTGGCTCTACAACGAGAAACTGACAAGGTCATGGCTACGCGTCAGAAAACAGCTTTTCCTCCAAATTTTGTACACTCCCTTGATGGTTCTCATATGATGATGACTGCTGTCGCCTGCCGAAGGGCAGGCCTTAACTTTGCAGGTGTCCATGATTCCTATTGGACCCATGCATGCGATGTTGACGTAATGAACAGGCTACTAAGGGAGAAATTTGTTGAGCTATACGAGGCTCCTATTCTGGAGAATATACTTCAGTCGGATGCTCTTTCTCATATGTGCTACACGTCTGTGGCTGACATACAGTCCTGTCTACGACTCTTGAAATCAGACTCTGGAGGCGAGTTTGCTACTTCGAGAATCAAATCTTCACTTTTACATATATTCGACATAACAAAACCCGATGAAACTAAGGTCTTCCATGTTTGGAAACTAAGTCAGTCACAATTTCAAGCAGTCCAAACCAAGAAGAACACGACACCCGAACAAGAGAAATGCTTGGCTGCAGCAACCATGGCGGCCTCTGGTCCAATTGGAGGAGGAGGAGGTGCAGCCATGGCTGCCTCTGGGCTTCCGCCATTAAAGCAGCCAAGCAAGCAATAA

Coding sequence (CDS)

ATGGTTTCTCACTGCCTGAGAACATTTCGGAGAAACTGCATATTCCGACTATCAGTTCTTCGCATTTTCCATCTCCTTCACCTGCGTTTTGAATTGCTTGAACTTGTTGTTGTGGGAATTCGGAATCCCTACTACCCGAGAATGTGGAGAAACATTGTCAAAAGAGCTGCTTCAAGGAGGGATACGTTCTTCTCCAGAAAAGTTATTGAAGACTCTATCTTTGTTGAGGAAATTAGATCCTCCAGGACACTCGGAAGCTTGAAATCATGTTGCCTGTTAGAGGGCTCTCGTGGAATCAGCTTTTTCCATTCCCGTGAAGTTGGGTTTGCGAAATCCAATTTCGCACATTCAAGCGATCCCGTAGCTTTTTATGGGGTTCTAAACCATGCCAATGGCTACGCAACTGCTGCGGAGGCTGCCATTTCCGATGAGGACTTATCAGGGTCGGAAGAAATTCAGGAATTGATGGAGGAACTAAGCAAACAAGATAAGGTGGAGTCTCACTTTAAGCAGCCTAAGAAAATGGTGGATGGAATGAGGGTAGGTAAGTACAATATTCTACGGAAGAGACAGATAAAGATGGAGACGGAGGCTTGGGAAGAGGCTGCCAGAGAGTATCAAGAGCTATTAACGGATATGTGTGAGCAGAAGTTGGCGCCCAATTTACCTTACATGAAGTCTTTATTCCTTGGTTGGTTTGAACCCTTGCGTGATGCAATTGCTGCAGACCAGGAGTTTTGCAAGAAGAAGAGTAGAGTGTCTCATTTTGCTTATTTTGATCTTCTGCCGGCGGATATGATGGCTGTGATTACAATGCATAAGTTAATGGGGTTGTTGATGACTAACAGTGGAGGAAACAGTAGTGTCAGGGTAGTCCAAGCTGCTTGTCAGATAGGAGAAGCCATTGAACACGAGGTTAGAATACACAAGTTCTTTGAGAATATGAAGAAGAAGAAGAGTAATGAAAAAACTACTGAAGGAGACACTGAACCTGTGGTTGAAGACCAGGAGAAAGTGGCCAAAGAACAAGACAAATTGAGGAAAAAAGTCACCAACCTGATGAAAAAACAGAAGTTGCGGCAAGTTAGGATGATAGTCAAGGAACACGACCATTTAAAGCCTTGGGGCCAGGATGCGCATGTGAAGGTTGGTTGTCGTTTGATTCAGCTATTGATTGAAACAGCGTATATTCAACCTCCAATGGATCAGTTAGGAGGGGGTCCTCCTGATATTCGTCCCGCATTTGTCCATACTCTTAAAACCATCACAAAAGAAGCACAAAAGACTAGCAGAAGATATGGTGTTATTGAATGCGATCCACTTGTTCGCAGAGGCCTGGAGAAAACTAGAATGTGGAGAAACATTGTCAAAAGAGCTGCTTCAAGGAGGGATACGTTCTTCTCCAGAAAAGTTATTGAAGACTCTATCTCTGTTGAGGAAATTAGATCCTCCAGGACACTTGGAAGCTTGAAATCATGTTGCCTGTTAGAGGGCTCTCGTGGAATCAGCTTTTTCCATTCCCGTGAAGTTGGGTTTGCGAAATCCAATTTCGCACATTCAAGCGAGCCCGTAGCTTTTTATGGGGTTCTAAACCATGCCAAAGGCTACGCAACTGCTGCGGAGGCTGCGATTTCCGAGGAGGACTTATCAGGGTCGGAAGAAATTCAGGAATTGATGGAGGAACTAAGCAAACAAGATAAGGTGGAGTCTCACTTTAAGCAGCCTAAGAAAATGGTGGATGGAATGAGGGTAGGTAAGTACAATATTCTACGGAAGAGACAGATAAAGATGGAGACGGAGGCTTGGGAAGAGGCTGCCAGAGAGTATCAAGAGCTATTAACGGATATGTGTGAGCAGAAGTTGGCGCCCAATTTACCTTACATAAAGTCTTTATTCCTTGGTTGGTTTGAACCCTTGCGTGATGCAATTGCTGCAGACCAGGAGTATGATAAAGGAGCACACTTATTCCTACCATCATATGTTATGCGAATACATGGGGCAAAGCAGCAACGTGAAGCAGTTAAAAGGGTTCCAAAGAAACAACTAGAGCCTGTTTTTGAGGCACTTGATACGCTTGGAAGAACCAAGTGGAGGGTGAACAAAAGAGTTCTTAGCATTATTGACAGGATATGGGCCAGTGGCGGTCGTCTTGCTGATTTGGTTGACCGCTNTACTGAAGGAGACACTGAACCTGTGGTTGAAGACCAGGAGAAAGTGGCCAAAGAACAAGACAAATTGAGGAAAAAAGTCACCAACCTGATGAAAAAACAGAAGTTGCGGCAAGTTAGGATGATAGTCAAGGAACACGACCATTTAAAGCCTTGGGGCCAGGATGCGCATGTGAAGGTTGGTTGTCGTTTGATTCAGCTATTGATTGAAACAGCGTATATTCAACCTCCAATGGATCAGTTAGGAGGGGGTCCTCCTGATATTCGTCCCGCATTTGTCCATACTCTTAAAACCATCACAAAAGAAGCACAAAAGACTAGCAGAAGATATGGTGTTATTGAATGCGATCCACTTGTTCGCAGAGGCCTGGAGAAAACTGCAAGACACATGGTCATACCATATATGCCTATGCTGGTGCCTCCCCTTAATTGGACAGGGTATGATAAAGGAGCACACTTATTCCTACCATCATATGTTATGCGAATACATGGGGCAAAGCAGCAACGTGAAGCAGTTAAAAGGGTTCCAAAGAAACAACTAGAGCCTGTTTTTGAGGCACTTGATACTCTTGGAAGAACCAAGTGGAGGGTTCCTCTACCAGAGGAGCCAACTGTGGAAGATGAAGCAGAAATTCGAAAATGGAAGTGGAAACTCAAGGCTGCAAAAAAAGAGAATAGTGAGAGGCATTCACAGCGGTGTGACATTGAGCTTAAGCTTGCAGTGGCCAGGAAAATGAAAGAAGAGGAGGGCTTTTACTATCCTCACAACTTAGATTTTCGAGGTCGTGCATACCCAATGCATCCACATTTGAATCATCTTGGTTCTGATATGTGTCGAGGAATTCTAGAATTTGCGGAGGGACGGCCACTTGGCGAGTCAGGGTTACGCTGGTTGAAGATACATTTGGCAAATCTATACGCTGGTGGTGTGGACAAGTTATCTTACAAGGATCGAATATCATTTACTGAGAATCATTTGGATGAGATTTTTGATTCGGCAGACAGGCCTCTAGAAGGAAATCGTTGGTGGTTGGGCGCAGAGGATCCTTTTCAGTGCTTGGCAGTGTGTATTAATCTCTCAGAGGCTTTAAGAAGTCCATCGCCGGAAACAACTCTTTCCCATATGCCTGTACACCAGGATGGTTCCTGCAATGGCTTGCAACACTATGCAGCTCTCGGGAGGGACAAGTTGAGTCTTAATAACCCGGCAGGCGTCAATGCCATAGTTCTTGACATAATACGAAGTGATGCAGAGAAAGATCCTGCAACTAATCCAAATGCGTTGCATGCTAGACTTTTAATCAATCAGGTGGATCGTAAACTGGTGAAACAAACAGTCATGACATCCGTGTATGGTGTCACGTATATGGGTGCACGGGATCAGATTAAGAAAAGGTTGAAAGAACGAGCATCCATTGCTGATGATTCACAATTATTTGCAGCTTCTTGCTATGCAGCTAGAGTAATAGCTTCAGAAAATCAGCCAGTTCGATGGACAACTCCTCTTGGACTGCCAGTGGTACAACCTTACCGGCAACTAGGAAGACATCTTATCAAGACTTCCTTGCAAGTGTTGGCTCTACAACGAGAAACTGACAAGGTCATGGCTACGCGTCAGAAAACAGCTTTTCCTCCAAATTTTGTACACTCCCTTGATGGTTCTCATATGATGATGACTGCTGTCGCCTGCCGAAGGGCAGGCCTTAACTTTGCAGGTGTCCATGATTCCTATTGGACCCATGCATGCGATGTTGACGTAATGAACAGGCTACTAAGGGAGAAATTTGTTGAGCTATACGAGGCTCCTATTCTGGAGAATATACTTCAGTCGGATGCTCTTTCTCATATGTGCTACACGTCTGTGGCTGACATACAGTCCTGTCTACGACTCTTGAAATCAGACTCTGGAGGCGAGTTTGCTACTTCGAGAATCAAATCTTCACTTTTACATATATTCGACATAACAAAACCCGATGAAACTAAGGTCTTCCATGTTTGGAAACTAAGTCAGTCACAATTTCAAGCAGTCCAAACCAAGAAGAACACGACACCCGAACAAGAGAAATGCTTGGCTGCAGCAACCATGGCGGCCTCTGGTCCAATTGGAGGAGGAGGAGGTGCAGCCATGGCTGCCTCTGGGCTTCCGCCATTAAAGCAGCCAAGCAAGCAATAA

Protein sequence

MVSHCLRTFRRNCIFRLSVLRIFHLLHLRFELLELVVVGIRNPYYPRMWRNIVKRAASRRDTFFSRKVIEDSIFVEEIRSSRTLGSLKSCCLLEGSRGISFFHSREVGFAKSNFAHSSDPVAFYGVLNHANGYATAAEAAISDEDLSGSEEIQELMEELSKQDKVESHFKQPKKMVDGMRVGKYNILRKRQIKMETEAWEEAAREYQELLTDMCEQKLAPNLPYMKSLFLGWFEPLRDAIAADQEFCKKKSRVSHFAYFDLLPADMMAVITMHKLMGLLMTNSGGNSSVRVVQAACQIGEAIEHEVRIHKFFENMKKKKSNEKTTEGDTEPVVEDQEKVAKEQDKLRKKVTNLMKKQKLRQVRMIVKEHDHLKPWGQDAHVKVGCRLIQLLIETAYIQPPMDQLGGGPPDIRPAFVHTLKTITKEAQKTSRRYGVIECDPLVRRGLEKTRMWRNIVKRAASRRDTFFSRKVIEDSISVEEIRSSRTLGSLKSCCLLEGSRGISFFHSREVGFAKSNFAHSSEPVAFYGVLNHAKGYATAAEAAISEEDLSGSEEIQELMEELSKQDKVESHFKQPKKMVDGMRVGKYNILRKRQIKMETEAWEEAAREYQELLTDMCEQKLAPNLPYIKSLFLGWFEPLRDAIAADQEYDKGAHLFLPSYVMRIHGAKQQREAVKRVPKKQLEPVFEALDTLGRTKWRVNKRVLSIIDRIWASGGRLADLVDRXTEGDTEPVVEDQEKVAKEQDKLRKKVTNLMKKQKLRQVRMIVKEHDHLKPWGQDAHVKVGCRLIQLLIETAYIQPPMDQLGGGPPDIRPAFVHTLKTITKEAQKTSRRYGVIECDPLVRRGLEKTARHMVIPYMPMLVPPLNWTGYDKGAHLFLPSYVMRIHGAKQQREAVKRVPKKQLEPVFEALDTLGRTKWRVPLPEEPTVEDEAEIRKWKWKLKAAKKENSERHSQRCDIELKLAVARKMKEEEGFYYPHNLDFRGRAYPMHPHLNHLGSDMCRGILEFAEGRPLGESGLRWLKIHLANLYAGGVDKLSYKDRISFTENHLDEIFDSADRPLEGNRWWLGAEDPFQCLAVCINLSEALRSPSPETTLSHMPVHQDGSCNGLQHYAALGRDKLSLNNPAGVNAIVLDIIRSDAEKDPATNPNALHARLLINQVDRKLVKQTVMTSVYGVTYMGARDQIKKRLKERASIADDSQLFAASCYAARVIASENQPVRWTTPLGLPVVQPYRQLGRHLIKTSLQVLALQRETDKVMATRQKTAFPPNFVHSLDGSHMMMTAVACRRAGLNFAGVHDSYWTHACDVDVMNRLLREKFVELYEAPILENILQSDALSHMCYTSVADIQSCLRLLKSDSGGEFATSRIKSSLLHIFDITKPDETKVFHVWKLSQSQFQAVQTKKNTTPEQEKCLAAATMAASGPIGGGGGAAMAASGLPPLKQPSKQ
BLAST of Cp4.1LG09g02310 vs. Swiss-Prot
Match: RPO1B_TOBAC (DNA-directed RNA polymerase 1B, mitochondrial OS=Nicotiana tabacum GN=RPOT1-TOM PE=2 SV=2)

HSP 1 Score: 1117.1 bits (2888), Expect = 0.0e+00
Identity = 581/883 (65.80%), Postives = 670/883 (75.88%), Query Frame = 1

Query: 528  GVLNHAKGYATAAEAAIS--EEDLSGSEEIQELMEELSKQDKVESHFKQPK--KMVDGMR 587
            G L   + Y +AAEA +S  EED+   +EIQEL+EE+ K+++      QPK  K + GM 
Sbjct: 106  GSLGRLRSYGSAAEAIVSTSEEDI---DEIQELIEEMDKENEALKANLQPKQPKTIGGMG 165

Query: 588  VGKYNILRKRQIKMETEAWEEAAREYQELLTDMCEQKLAPNLPYIKSLFLGWFEPLRDAI 647
            VGKYN LR+RQIK+ETEAWEEAA+EYQELL DMCEQKLAPNLPY+KSLFLGWFEPLRDAI
Sbjct: 166  VGKYNFLRRRQIKVETEAWEEAAKEYQELLMDMCEQKLAPNLPYMKSLFLGWFEPLRDAI 225

Query: 648  AADQEY-----DKGAHL----FLPSYVMRIHGAKQQREAVKRVPKKQLEPVFEALDTLGR 707
            AA+Q+      ++GA+      LP+ +M +    +    +          V +A   +G 
Sbjct: 226  AAEQKLCDEGKNRGAYAPFFDQLPAEMMAVITMHKLMGLLMTGGGTGSARVVQAASYIGE 285

Query: 708  TKWRVNKRVLSIIDRIWASGGRLADLVDRXTEGDTEPVVEDQEKVAKEQDKLRKKVTNLM 767
                   R+   +++   S     DL +  T GD          + KE+++LRKKV  LM
Sbjct: 286  AIEH-EARIHRFLEKTKKSNALSGDLEE--TPGD----------MMKERERLRKKVKILM 345

Query: 768  KKQKLRQVRMIVKEHDHLKPWGQDAHVKVGCRLIQLLIETAYIQPPMDQLGGGPPDIRPA 827
            KKQKLRQVR IVK+ D  KPWGQD  VKVGCRLIQ+L+ETAYIQPP DQL  GPPDIRPA
Sbjct: 346  KKQKLRQVRKIVKQQDDEKPWGQDNLVKVGCRLIQILMETAYIQPPNDQLDDGPPDIRPA 405

Query: 828  FVHTLKTITKEAQKTSRRYGVIECDPLVRRGLEKTARHMVIPYMPMLVPPLNWTGYDKGA 887
            FVHTLKT+  E  K SRRYGVI+CDPLVR+GL+KTARHMVIPYMPMLVPP +W GYDKG 
Sbjct: 406  FVHTLKTV--ETMKGSRRYGVIQCDPLVRKGLDKTARHMVIPYMPMLVPPQSWLGYDKGG 465

Query: 888  HLFLPSYVMRIHGAKQQREAVKRVPKKQLEPVFEALDTLGRTKWRV-------------- 947
            +LFLPSY+MR HGAKQQREAVKRVPKKQLEPVF+ALDTLG TKWRV              
Sbjct: 466  YLFLPSYIMRTHGAKQQREAVKRVPKKQLEPVFQALDTLGNTKWRVNRKVLGIVDRIWAS 525

Query: 948  -------------PLPEEPTVEDEAEIRKWKWKLKAAKKENSERHSQRCDIELKLAVARK 1007
                         PLPE P  EDEAEIRKWKWK+K  KKEN ERHSQRCDIELKLAVARK
Sbjct: 526  GGRLADLVDREDVPLPEAPDTEDEAEIRKWKWKVKGVKKENCERHSQRCDIELKLAVARK 585

Query: 1008 MKEEEGFYYPHNLDFRGRAYPMHPHLNHLGSDMCRGILEFAEGRPLGESGLRWLKIHLAN 1067
            MK+E+GFYYPHNLDFRGRAYPMHP+LNHLGSD+CRGILEFAEGRPLG SGLRWLKIHLAN
Sbjct: 586  MKDEDGFYYPHNLDFRGRAYPMHPYLNHLGSDLCRGILEFAEGRPLGTSGLRWLKIHLAN 645

Query: 1068 LYAGGVDKLSYKDRISFTENHLDEIFDSADRPLEGNRWWLGAEDPFQCLAVCINLSEALR 1127
            +Y GGVDKLSY+ R++F+ENHL++IFDSA+RPLEG RWWLGAEDPFQCLA CIN++EALR
Sbjct: 646  VYGGGVDKLSYEGRVAFSENHLEDIFDSAERPLEGKRWWLGAEDPFQCLATCINIAEALR 705

Query: 1128 SPSPETTLSHMPVHQDGSCNGLQHYAALGRDKLSL--------NNPA----GVNAIVLDI 1187
            SPSPET +S+MP+HQDGSCNGLQHYAALGRDKL          + PA    G+ A VLDI
Sbjct: 706  SPSPETAISYMPIHQDGSCNGLQHYAALGRDKLGAAAVNLVAGDKPADVYSGIAARVLDI 765

Query: 1188 IRSDAEKDPATNPNALHARLLINQVDRKLVKQTVMTSVYGVTYMGARDQIKKRLKERASI 1247
            ++ DA KDPA +PN + ARLLINQVDRKLVKQTVMTSVYGVTY+GARDQIKKRLKER  I
Sbjct: 766  MKRDAAKDPANDPNVMRARLLINQVDRKLVKQTVMTSVYGVTYIGARDQIKKRLKERGVI 825

Query: 1248 ADDSQLFAASCYAAR-------------------------VIASENQPVRWTTPLGLPVV 1307
             DD++LFAA+CYAA+                         +IA EN PVRWTTPLGLPVV
Sbjct: 826  EDDNELFAAACYAAKTTLTALGEMFEAARSIMSWLGDCAKIIAMENHPVRWTTPLGLPVV 885

Query: 1308 QPYRQLGRHLIKTSLQVLALQRETDKVMATRQKTAFPPNFVHSLDGSHMMMTAVACRRAG 1334
            QPYR+LGRHLIKTSLQ+L LQRETDKVM  RQ+TAFPPNFVHSLDGSHMMMTA+AC+ +G
Sbjct: 886  QPYRKLGRHLIKTSLQILTLQRETDKVMVKRQRTAFPPNFVHSLDGSHMMMTAIACKESG 945

BLAST of Cp4.1LG09g02310 vs. Swiss-Prot
Match: RPOT1_NICSY (DNA-directed RNA polymerase 1, mitochondrial OS=Nicotiana sylvestris GN=RPOT1 PE=2 SV=1)

HSP 1 Score: 1108.2 bits (2865), Expect = 0.0e+00
Identity = 576/883 (65.23%), Postives = 672/883 (76.10%), Query Frame = 1

Query: 528  GVLNHAKGYATAAEA--AISEEDLSGSEEIQELMEELSKQDKVESHFKQPK--KMVDGMR 587
            G L   + Y +AAEA  + SEED+   +EIQEL+EE++K+++      QPK  K + GM 
Sbjct: 106  GSLGFLRSYGSAAEAIASTSEEDI---DEIQELIEEMNKENEALKTNLQPKQPKTIGGMG 165

Query: 588  VGKYNILRKRQIKMETEAWEEAAREYQELLTDMCEQKLAPNLPYIKSLFLGWFEPLRDAI 647
            VGKYN+LR+RQIK+ETEAWEEAA+EYQELL DMCEQKLAPNLPY+KSLFLGWFEPLRDAI
Sbjct: 166  VGKYNLLRRRQIKVETEAWEEAAKEYQELLMDMCEQKLAPNLPYMKSLFLGWFEPLRDAI 225

Query: 648  AADQEY-----DKGAHL----FLPSYVMRIHGAKQQREAVKRVPKKQLEPVFEALDTLGR 707
            AA+Q+      ++GA+      LP+ +M +    +    +          V +A   +G 
Sbjct: 226  AAEQKLCDEGKNRGAYAPFFDQLPAEMMAVITMHKLMGLLMTGGGTGSARVVQAASHIGE 285

Query: 708  TKWRVNKRVLSIIDRIWASGGRLADLVDRXTEGDTEPVVEDQEKVAKEQDKLRKKVTNLM 767
                   R+   +++   S     DL D  T GD          + KE++++RKKV  LM
Sbjct: 286  AIEH-EARIHRFLEKTKKSNALSGDLED--TPGD----------IMKERERVRKKVKILM 345

Query: 768  KKQKLRQVRMIVKEHDHLKPWGQDAHVKVGCRLIQLLIETAYIQPPMDQLGGGPPDIRPA 827
            KKQKL+QVR IVK+ D  KPWGQD  VKVGCRLIQ+L+ETAYIQPP DQL   PPDIRPA
Sbjct: 346  KKQKLQQVRKIVKQQDDEKPWGQDNLVKVGCRLIQILMETAYIQPPNDQLDDCPPDIRPA 405

Query: 828  FVHTLKTITKEAQKTSRRYGVIECDPLVRRGLEKTARHMVIPYMPMLVPPLNWTGYDKGA 887
            FVHTLKT+  E  K SRRYGVI+CDPLVR+GL+KTARHMVIPYMPMLVPP +W GYDKGA
Sbjct: 406  FVHTLKTV--ETMKGSRRYGVIQCDPLVRKGLDKTARHMVIPYMPMLVPPQSWLGYDKGA 465

Query: 888  HLFLPSYVMRIHGAKQQREAVKRVPKKQLEPVFEALDTLGRTKWR--------------- 947
            +LFLPSY+MR HGAKQQREAVKRVPKKQLEPVF+ALDTLG TKWR               
Sbjct: 466  YLFLPSYIMRTHGAKQQREAVKRVPKKQLEPVFQALDTLGNTKWRLNRKVLGIVDRIWAS 525

Query: 948  ------------VPLPEEPTVEDEAEIRKWKWKLKAAKKENSERHSQRCDIELKLAVARK 1007
                        VPLPEEP  EDEA+IRKWKWK+K  KKEN ERHSQRCDIELKLAVARK
Sbjct: 526  GGRLADLVDREDVPLPEEPDAEDEAQIRKWKWKVKGVKKENCERHSQRCDIELKLAVARK 585

Query: 1008 MKEEEGFYYPHNLDFRGRAYPMHPHLNHLGSDMCRGILEFAEGRPLGESGLRWLKIHLAN 1067
            MK+E+GFYYPHNLDFRGRAYPMHP+LNHLGSD+CRGILEFAEGRPLG+SGLRWLKIHLAN
Sbjct: 586  MKDEDGFYYPHNLDFRGRAYPMHPYLNHLGSDLCRGILEFAEGRPLGKSGLRWLKIHLAN 645

Query: 1068 LYAGGVDKLSYKDRISFTENHLDEIFDSADRPLEGNRWWLGAEDPFQCLAVCINLSEALR 1127
            +Y GGVDKLSY+ R++F+ENH+++IFDSA+RPLEG RWWLGAEDPFQCLA CIN++EALR
Sbjct: 646  VYGGGVDKLSYEGRVAFSENHVEDIFDSAERPLEGKRWWLGAEDPFQCLATCINIAEALR 705

Query: 1128 SPSPETTLSHMPVHQDGSCNGLQHYAALGRDKLSL--------NNPA----GVNAIVLDI 1187
            SPSPET +S+MP+HQDGSCNGLQHYAALGRD L          + PA    G+ A VLDI
Sbjct: 706  SPSPETAISYMPIHQDGSCNGLQHYAALGRDTLGAAAVNLVAGDKPADVYSGIAARVLDI 765

Query: 1188 IRSDAEKDPATNPNALHARLLINQVDRKLVKQTVMTSVYGVTYMGARDQIKKRLKERASI 1247
            ++ DA KDPA +PN + ARLLINQVDRKLVKQTVMTSVYGVTY+GARDQIK+RLKER  I
Sbjct: 766  MKRDAAKDPANDPNVMRARLLINQVDRKLVKQTVMTSVYGVTYIGARDQIKRRLKERGVI 825

Query: 1248 ADDSQLFAASCYAAR-------------------------VIASENQPVRWTTPLGLPVV 1307
             DD++LFAA+CYAA+                         +IA EN PVRWTTPLGLPVV
Sbjct: 826  EDDNELFAAACYAAKTTLTALGEMFEAARSIMSWLGDCAKIIAMENHPVRWTTPLGLPVV 885

Query: 1308 QPYRQLGRHLIKTSLQVLALQRETDKVMATRQKTAFPPNFVHSLDGSHMMMTAVACRRAG 1334
            QPYR+LGRHLIKTSLQ+L LQRETDKVM  RQ+TAFPPNFVHSLDGSHMMMTA+AC+ +G
Sbjct: 886  QPYRKLGRHLIKTSLQILTLQRETDKVMVKRQRTAFPPNFVHSLDGSHMMMTAIACKESG 945

BLAST of Cp4.1LG09g02310 vs. Swiss-Prot
Match: RPOT2_NICSY (DNA-directed RNA polymerase 2, chloroplastic/mitochondrial OS=Nicotiana sylvestris GN=RPOT2 PE=2 SV=2)

HSP 1 Score: 1076.6 bits (2783), Expect = 0.0e+00
Identity = 574/968 (59.30%), Postives = 690/968 (71.28%), Query Frame = 1

Query: 451  MWRNIVKRAASRRDT---FFSRKVIEDSISVEEIRSSRTLGSLKSCCLLEGSRGISFFHS 510
            MWRNI+K+ +SR      F S+      +   +    +     +S   +  S  +  F +
Sbjct: 35   MWRNIIKQLSSRTPQKLLFSSKNRTYSFLGFGQDSIFKDNTKFRSLIPISCSNIVMGFQN 94

Query: 511  REVGFAKSNFAHSSEPVAFYGVLNH---AKGYATAAEAAI-----SEEDLSGSEEIQELM 570
                     F   S P+    V N+    K YA+ AEA       +EED+S  +E+ EL+
Sbjct: 95   LGEYLPGDEFL--SRPLIKNQVNNNFCCRKSYASVAEAVAVSSTDAEEDVSVVDEVHELL 154

Query: 571  EELSKQDKVESHFKQPKK--MVDGMRVGKYNILRKRQIKMETEAWEEAAREYQELLTDMC 630
             EL K++K +  F++ K+  +  GM   KY  L++RQ+K+ETEAWE+AA+EY+ELL DMC
Sbjct: 155  TELKKEEKKQFAFRRRKQRMLTSGMGHRKYQTLKRRQVKVETEAWEQAAKEYKELLFDMC 214

Query: 631  EQKLAPNLPYIKSLFLGWFEPLRDAIAADQEYDKGAHLFLPSYVMRIHGAKQQREAVKRV 690
            EQKLAPNLPY+KSLFLGWFEPLRD IA +QE          +Y    +       AV  +
Sbjct: 215  EQKLAPNLPYVKSLFLGWFEPLRDKIAEEQELCSQGKS-KAAYAKYFYQLPADMMAVITM 274

Query: 691  PKKQLEPVFEALDTLG-RTKWRVNKRVLSIIDRIWASGGRLADLVDRXTE--GDTEPVVE 750
             K     +   L T G     RV +  L I D I     R+ + +++  +   + +   E
Sbjct: 275  HK-----LMGLLMTGGDHGTARVVQAALVIGDAI-EQEVRIHNFLEKTKKQKAEKDKQKE 334

Query: 751  DQEKVAKEQDKLRKKVTNLMKKQKLRQVRMIVKEHDHLKPWGQDAHVKVGCRLIQLLIET 810
            D E V +EQ+KLRKKVTNLMKKQKLR V  IV+  D  KPWGQDA  KVG RLI LL++T
Sbjct: 335  DGEHVTQEQEKLRKKVTNLMKKQKLRAVGQIVRRQDDSKPWGQDARAKVGSRLIDLLLQT 394

Query: 811  AYIQPPMDQLGGGPPDIRPAFVHTLKTITKEAQKTSRRYGVIECDPLVRRGLEKTARHMV 870
            AYIQPP +QL   PPDIRPAFVH+++T+ KE +  SRRYG+I+CD LV +GLE+TARHMV
Sbjct: 395  AYIQPPANQLAVDPPDIRPAFVHSVRTVAKETKSASRRYGIIQCDELVFKGLERTARHMV 454

Query: 871  IPYMPMLVPPLNWTGYDKGAHLFLPSYVMRIHGAKQQREAVKRVPKKQLEPVFEALDTLG 930
            IPYMPMLVPP+ WTGYDKG HL+LPSYVMR HGA+QQREAVKR  + QL+PVFEALDTLG
Sbjct: 455  IPYMPMLVPPVKWTGYDKGGHLYLPSYVMRTHGARQQREAVKRASRNQLQPVFEALDTLG 514

Query: 931  RTKWRV---------------------------PLPEEPTVEDEAEIRKWKWKLKAAKKE 990
             TKWR+                           PLPEEP  EDEA   KW+WK+K+ KKE
Sbjct: 515  NTKWRINKRVLSVVDRIWAGGGRLADLVDRDDAPLPEEPDTEDEALRTKWRWKVKSVKKE 574

Query: 991  NSERHSQRCDIELKLAVARKMKEEEGFYYPHNLDFRGRAYPMHPHLNHLGSDMCRGILEF 1050
            N ERHSQRCDIELKLAVARKMK+EE F+YPHN+DFRGRAYPMHPHLNHLGSD+CRG+LEF
Sbjct: 575  NRERHSQRCDIELKLAVARKMKDEESFFYPHNVDFRGRAYPMHPHLNHLGSDICRGVLEF 634

Query: 1051 AEGRPLGESGLRWLKIHLANLYAGGVDKLSYKDRISFTENHLDEIFDSADRPLEGNRWWL 1110
            AEGRPLGESGLRWLKIHLANL+AGGV+KLS + RI FTENH+D+IFDS+D+PLEG RWWL
Sbjct: 635  AEGRPLGESGLRWLKIHLANLFAGGVEKLSLEGRIGFTENHMDDIFDSSDKPLEGRRWWL 694

Query: 1111 GAEDPFQCLAVCINLSEALRSPSPETTLSHMPVHQDGSCNGLQHYAALGRDKLSL----- 1170
             AEDPFQCLAVCINLSEA+RS SPET++SH+PVHQDGSCNGLQHYAALGRDKL       
Sbjct: 695  NAEDPFQCLAVCINLSEAVRSSSPETSVSHIPVHQDGSCNGLQHYAALGRDKLGAAAVNL 754

Query: 1171 ---NNPA----GVNAIVLDIIRSDAEKDPATNPNALHARLLINQVDRKLVKQTVMTSVYG 1230
                 PA    G+ A VLDI++ DA++DPA  P+A+ AR+L+NQVDRKLVKQTVMTSVYG
Sbjct: 755  VAGEKPADVYSGIAARVLDIMKRDAQRDPAEFPDAVRARVLVNQVDRKLVKQTVMTSVYG 814

Query: 1231 VTYMGARDQIKKRLKERASIADDSQLFAASCYAA-------------------------R 1290
            VTY+GARDQIK+RLKER +IADDS+LF A+CYAA                         +
Sbjct: 815  VTYIGARDQIKRRLKERGAIADDSELFGAACYAAKVTLTALGEMFEAARSIMTWLGECAK 874

Query: 1291 VIASENQPVRWTTPLGLPVVQPYRQLGRHLIKTSLQVLALQRETDKVMATRQKTAFPPNF 1339
            +IASEN+PVRWTTPLGLPVVQPYR++GRHLIKTSLQ+L LQRET+KVM  RQ+TAFPPNF
Sbjct: 875  IIASENEPVRWTTPLGLPVVQPYRKIGRHLIKTSLQILTLQRETEKVMVKRQRTAFPPNF 934

BLAST of Cp4.1LG09g02310 vs. Swiss-Prot
Match: RPO2B_TOBAC (DNA-directed RNA polymerase 2B, chloroplastic/mitochondrial OS=Nicotiana tabacum GN=RPOT2-TOM PE=2 SV=2)

HSP 1 Score: 1072.8 bits (2773), Expect = 2.8e-312
Identity = 571/972 (58.74%), Postives = 697/972 (71.71%), Query Frame = 1

Query: 451  MWRNIVKRAASR--RDTFFSRK--------VIEDSISVEEIRSSRTLGSLKSCCLLEGSR 510
            MWRNI+K+ +SR  +   FS K          +DS+  +  +  R+L  +    ++ G +
Sbjct: 35   MWRNIIKQLSSRTPQKLLFSSKNRTYSFLGFGQDSVFKDNTKF-RSLIPISCSNIVMGFQ 94

Query: 511  GISFFHSREVGFAKSNFAHSSEPVAFYGVLNHAKGYATAAEAAI-----SEEDLSGSEEI 570
             +  +   +   ++    +      F       K YA+ AEA       +EED+S  +E+
Sbjct: 95   NLGEYLPGDEFLSRPLLKNQVNSNDFCC----RKSYASVAEAVAVSSTDAEEDVSVVDEV 154

Query: 571  QELMEELSKQDKVESHFKQPKK--MVDGMRVGKYNILRKRQIKMETEAWEEAAREYQELL 630
            QEL+ EL K++K +  F++ K+  +  GM   KY  L++RQ+K+ETEAWE+AA+EY+ELL
Sbjct: 155  QELLTELKKEEKKQFAFRRRKQRMLTSGMGHRKYQTLKRRQVKVETEAWEQAAKEYKELL 214

Query: 631  TDMCEQKLAPNLPYIKSLFLGWFEPLRDAIAADQEYDKGAHLFLPSYVMRIHGAKQQREA 690
             DMCEQKLAPNLPY+KSLFLGWFEPLRD IA +QE          +Y   ++       A
Sbjct: 215  FDMCEQKLAPNLPYVKSLFLGWFEPLRDKIAEEQELCSQGKS-KAAYAKYLYQLPADMMA 274

Query: 691  VKRVPKKQLEPVFEALDTLG-RTKWRVNKRVLSIIDRIWASGGRLADLVDRXTE--GDTE 750
            V  + K     +   L T G     RV +  L I D I     R+ + +++  +   + +
Sbjct: 275  VITMHK-----LMGLLMTGGDHGTARVVQAALVIGDAI-EQEVRIHNFLEKTKKQKAEKD 334

Query: 751  PVVEDQEKVAKEQDKLRKKVTNLMKKQKLRQVRMIVKEHDHLKPWGQDAHVKVGCRLIQL 810
               ED E V +EQ+KLRKKVTNLMKKQKLR V  IV+  D  KPWGQDA  KVG RLI+L
Sbjct: 335  KQKEDGEHVTQEQEKLRKKVTNLMKKQKLRAVGQIVRRQDDSKPWGQDAKAKVGSRLIEL 394

Query: 811  LIETAYIQPPMDQLGGGPPDIRPAFVHTLKTITKEAQKTSRRYGVIECDPLVRRGLEKTA 870
            L++TAYIQPP +QL   PPDIRPAF+H+++T+ KE +  SRRYG+I+CD LV +GLE+TA
Sbjct: 395  LLQTAYIQPPANQLAVDPPDIRPAFLHSVRTVAKETKSASRRYGIIQCDELVFKGLERTA 454

Query: 871  RHMVIPYMPMLVPPLNWTGYDKGAHLFLPSYVMRIHGAKQQREAVKRVPKKQLEPVFEAL 930
            RHMVIPYMPMLVPP+ WTGYDKG HL+LPSYVMR HGA+QQREAVKR  + QL+PVFEAL
Sbjct: 455  RHMVIPYMPMLVPPVKWTGYDKGGHLYLPSYVMRTHGARQQREAVKRASRNQLQPVFEAL 514

Query: 931  DTLGRTKWRV---------------------------PLPEEPTVEDEAEIRKWKWKLKA 990
            DTLG TKWR+                           PLPEEP  EDEA   KW+WK+K+
Sbjct: 515  DTLGSTKWRINKRVLSVIDRIWAGGGRLADLVDRDDAPLPEEPDTEDEALRTKWRWKVKS 574

Query: 991  AKKENSERHSQRCDIELKLAVARKMKEEEGFYYPHNLDFRGRAYPMHPHLNHLGSDMCRG 1050
             KKEN ERHSQRCDIELKLAVARKMK+EEGF+YPHN+DFRGRAYPMHPHLNHLGSD+CRG
Sbjct: 575  VKKENRERHSQRCDIELKLAVARKMKDEEGFFYPHNVDFRGRAYPMHPHLNHLGSDICRG 634

Query: 1051 ILEFAEGRPLGESGLRWLKIHLANLYAGGVDKLSYKDRISFTENHLDEIFDSADRPLEGN 1110
            +L FAEGRPLGESGLRWLKIHLANL+AGGV+KLS + RI+FTENH+D+IFDSAD+PLEG 
Sbjct: 635  VLVFAEGRPLGESGLRWLKIHLANLFAGGVEKLSLEGRIAFTENHMDDIFDSADKPLEGR 694

Query: 1111 RWWLGAEDPFQCLAVCINLSEALRSPSPETTLSHMPVHQDGSCNGLQHYAALGRDKLSL- 1170
            RWWL AEDPFQCLAVCINLSEA+RS SPET++SH+PVHQDGSCNGLQHYAALGRD+L   
Sbjct: 695  RWWLNAEDPFQCLAVCINLSEAVRSSSPETSISHIPVHQDGSCNGLQHYAALGRDELGAA 754

Query: 1171 -------NNPA----GVNAIVLDIIRSDAEKDPATNPNALHARLLINQVDRKLVKQTVMT 1230
                     PA    G+ A VLDI++ DA++DPA  P+A+ AR L+NQVDRKLVKQTVMT
Sbjct: 755  AVNLVAGEKPADVYSGIAARVLDIMKRDAQRDPAEFPDAVRARALVNQVDRKLVKQTVMT 814

Query: 1231 SVYGVTYMGARDQIKKRLKERASIADDSQLFAASCYAA---------------------- 1290
            SVYGVTY+GARDQIK+RLKER +IADDS+LF A+CYAA                      
Sbjct: 815  SVYGVTYIGARDQIKRRLKERGAIADDSELFGAACYAAKVTLTALGEMFEAARSIMTWLG 874

Query: 1291 ---RVIASENQPVRWTTPLGLPVVQPYRQLGRHLIKTSLQVLALQRETDKVMATRQKTAF 1339
               ++IASEN+PVRWTTPLGLPVVQPYR++GRHLIKTSLQ+L LQ+ET+KVM  RQ+TAF
Sbjct: 875  ECAKIIASENEPVRWTTPLGLPVVQPYRKIGRHLIKTSLQILTLQQETEKVMVKRQRTAF 934

BLAST of Cp4.1LG09g02310 vs. Swiss-Prot
Match: RPOT2_ARATH (DNA-directed RNA polymerase 2, chloroplastic/mitochondrial OS=Arabidopsis thaliana GN=RPOT2 PE=1 SV=1)

HSP 1 Score: 1009.6 bits (2609), Expect = 3.4e-293
Identity = 546/964 (56.64%), Postives = 655/964 (67.95%), Query Frame = 1

Query: 451  MWRNIVKRAASRRDTFFSRK------VIEDSISVEEIRSSRTLGSLKSCCLLEGSRGISF 510
            MWRNI K+A SR     +        ++    S+     S     L S C  +G R +S 
Sbjct: 40   MWRNIAKQAISRSAARLNVSSQTRGLLVSSPESIFSKNLSFRFPVLGSPCHGKGFRCLSG 99

Query: 511  FHSREVGFAKSNFAHSSEPVAFYGVLNHAKGYATAAEAAISEEDLSGSEEIQELMEELSK 570
               RE  F+KS    S       G L  A+GY + AE  +   D+    E+ EL++E+ K
Sbjct: 100  ITRREE-FSKSERCLS-------GTL--ARGYTSVAEEEVLSTDVEEEPEVDELLKEMKK 159

Query: 571  QDKVESHFKQPKKMVD--GMRVGKYNILRKRQIKMETEAWEEAAREYQELLTDMCEQKLA 630
            + K ESH     K  D  GM   K+  L +RQ+K+ETE WE AA EY ELLTDMCEQKLA
Sbjct: 160  EKKRESHRSWRMKKQDQFGMGRTKFQNLWRRQVKIETEEWERAAAEYMELLTDMCEQKLA 219

Query: 631  PNLPYIKSLFLGWFEPLRDAIAADQEYDKGAHLFLPSYVMRIHGAKQQ-REAVKRVPKKQ 690
            PNLPY+KSLFLGWFEPLRDAIA DQE            + R+  +K      + ++P  +
Sbjct: 220  PNLPYVKSLFLGWFEPLRDAIAKDQE------------LYRLGKSKATYAHYLDQLPADK 279

Query: 691  LEPVFEALDTLGRTKWRVNKRVLSIIDRIWASGG------RLADLVDRXTEGDT--EPVV 750
            +  V      +G      +   + ++      G       R+   +D+  +GD   E   
Sbjct: 280  IS-VITMHKLMGHLMTGGDNGCVKVVHAACTVGDAIEQEIRICTFLDKKKKGDDNEESGG 339

Query: 751  EDQEKVAKEQDKLRKKVTNLMKKQKLRQVRMIVKEHDHLKPWGQDAHVKVGCRLIQLLIE 810
             + E   KEQDKLRKKV  L+KKQKL  VR I++ HD+ KPW  D   KVG RLI+LL+ 
Sbjct: 340  VENETSMKEQDKLRKKVNELIKKQKLSAVRKILQSHDYTKPWIADVRAKVGSRLIELLVR 399

Query: 811  TAYIQPPMDQLGGGPPDIRPAFVHTLKTITKEAQKTSRRYGVIECDPLVRRGLEKTARHM 870
            TAYIQ P DQ     PD+RPAFVHT K + K +  + R+YGVIECDPLVR+GLEK+ R+ 
Sbjct: 400  TAYIQSPADQQDNDLPDVRPAFVHTFK-VAKGSMNSGRKYGVIECDPLVRKGLEKSGRYA 459

Query: 871  VIPYMPMLVPPLNWTGYDKGAHLFLPSYVMRIHGAKQQREAVKRVPKKQLEPVFEALDTL 930
            V+PYMPMLVPPL W+GYDKGA+LFL SY+M+ HGAKQQREA+K  PK QL+PVFEALDTL
Sbjct: 460  VMPYMPMLVPPLKWSGYDKGAYLFLTSYIMKTHGAKQQREALKSAPKGQLQPVFEALDTL 519

Query: 931  GRTKWRV---------------------------PLPEEPTVEDEAEIRKWKWKLKAAKK 990
            G TKWRV                           PLPE+P  EDE  ++KWKW++K+AKK
Sbjct: 520  GSTKWRVNKRVLTVVDRIWSSGGCVADMVDRSDVPLPEKPDTEDEGILKKWKWEVKSAKK 579

Query: 991  ENSERHSQRCDIELKLAVARKMKEEEGFYYPHNLDFRGRAYPMHPHLNHLGSDMCRGILE 1050
             NSERHSQRCD ELKL+VARKMK+EE FYYPHN+DFRGRAYPM PHLNHLGSD+CRG+LE
Sbjct: 580  VNSERHSQRCDTELKLSVARKMKDEEAFYYPHNMDFRGRAYPMPPHLNHLGSDLCRGVLE 639

Query: 1051 FAEGRPLGESGLRWLKIHLANLYAGGVDKLSYKDRISFTENHLDEIFDSADRPLEGNRWW 1110
            FAEGRP+G SGLRWLKIHLANLYAGGVDKLS   R++FTENHLD+IFDSADRPLEG+RWW
Sbjct: 640  FAEGRPMGISGLRWLKIHLANLYAGGVDKLSLDGRLAFTENHLDDIFDSADRPLEGSRWW 699

Query: 1111 LGAEDPFQCLAVCINLSEALRSPSPETTLSHMPVHQDGSCNGLQHYAALGRDKLSLN--- 1170
            L AEDPFQCLAVCI+L+EALRSPSPET LSH+P+HQDGSCNGLQHYAALGRD L      
Sbjct: 700  LQAEDPFQCLAVCISLTEALRSPSPETVLSHIPIHQDGSCNGLQHYAALGRDTLGAEAVN 759

Query: 1171 -----NPA----GVNAIVLDIIRSDAEKDPATNPNALHARLLINQVDRKLVKQTVMTSVY 1230
                  PA    G+   VLDI+R DA++DP   P AL AR L+NQVDRKLVKQTVMTSVY
Sbjct: 760  LVAGEKPADVYSGIATRVLDIMRRDADRDPEVFPEALRARKLLNQVDRKLVKQTVMTSVY 819

Query: 1231 GVTYMGARDQIKKRLKERASIADDSQLFAASCYAARV----------------------- 1290
            GVTY+GARDQIK+RLKER+   D+ ++F A+CYAA+V                       
Sbjct: 820  GVTYIGARDQIKRRLKERSDFGDEKEVFGAACYAAKVTLAAIDEMFQAARAIMRWFGECA 879

Query: 1291 --IASENQPVRWTTPLGLPVVQPYRQLGRHLIKTSLQVLALQRETDKVMATRQKTAFPPN 1334
              IASEN+ VRWTTPLGLPVVQPY Q+G  L+KTSLQ L+LQ ETD+V+  RQ+TAFPPN
Sbjct: 880  KIIASENETVRWTTPLGLPVVQPYHQMGTKLVKTSLQTLSLQHETDQVIVRRQRTAFPPN 939

BLAST of Cp4.1LG09g02310 vs. TrEMBL
Match: A0A0A0KR04_CUCSA (DNA-directed RNA polymerase OS=Cucumis sativus GN=Csa_5G607420 PE=3 SV=1)

HSP 1 Score: 1254.2 bits (3244), Expect = 0.0e+00
Identity = 651/953 (68.31%), Postives = 738/953 (77.44%), Query Frame = 1

Query: 451  MWRNIVKRAASRRDTFFS-------RKVIEDSISVEEIRSSRTLGSLKSCCLLEGSRGIS 510
            MWRN+ K AASR+   FS       RKV ED I +++IRSSR+L +L +CC   GS  I 
Sbjct: 1    MWRNVFKTAASRKAKLFSEFSSSTSRKVTEDHIFLDQIRSSRSLETLNTCCQSGGSSRII 60

Query: 511  FFHSREVGFAKSNFAHSSEPVAFYGVLNHAKGYATAAEAAISEEDLSGSEEIQELMEELS 570
            F H ++VGF  SNF HSS PV FYGVL HAKGYATAAEAAI E DLSGSEEIQE+ME L+
Sbjct: 61   FLHPQKVGFTNSNFPHSSNPVPFYGVLYHAKGYATAAEAAIFEGDLSGSEEIQEIMEGLN 120

Query: 571  KQDKVESHFKQPKKMVDGMRVGKYNILRKRQIKMETEAWEEAAREYQELLTDMCEQKLAP 630
            KQDKVE HFKQPK MVDG R  KY++LRKRQIK+ETEAWEEAA+EYQ+L+ D+CEQKLAP
Sbjct: 121  KQDKVELHFKQPKGMVDGNRDTKYDMLRKRQIKIETEAWEEAAKEYQDLIADICEQKLAP 180

Query: 631  NLPYIKSLFLGWFEPLRDAIAADQEYDKGAHLFLPSYVMRIHGAKQQREAVKRVPKKQLE 690
            NLPY+KSLFLGWF+PLRDAI A+QE  K      PS+ +  H       AV  +   +L 
Sbjct: 181  NLPYMKSLFLGWFQPLRDAIVAEQESVKFKRSS-PSHALYFHLLPADMMAV--ITMHKLM 240

Query: 691  PVFEALDTLGRTKWRVNKRVLSIIDRIWASGGRLADLVDRXTEGDTEPVVEDQEKVAKEQ 750
             +  + D  G    RV +    I + I  +  R+ +  ++  +          E++A+  
Sbjct: 241  GLLMS-DIEGGGSVRVTQAASGIGEAI-ENEVRIRNFFEKTKK--------QPEQLAEGH 300

Query: 751  DKLRKKVTNLMKKQKLRQVRMIVKEHDHLKPWGQDAHVKVGCRLIQLLIETAYIQPPMDQ 810
            DKLRKK+T LMK+QKL++V  IVK HD  KPWG DAHVKVGCRLIQLLIETAYIQPP+DQ
Sbjct: 301  DKLRKKLTKLMKQQKLQKVNFIVKNHDDSKPWGTDAHVKVGCRLIQLLIETAYIQPPVDQ 360

Query: 811  LGGGPPDIRPAFVHTLKTITKEAQKTSRRYGVIECDPLVRRGLEKTARHMVIPYMPMLVP 870
            +G  PPD+RPAFVH+LKT  KE+Q+  +RYGVIECDPLV RG+ KTA HM+IPYMPMLVP
Sbjct: 361  IGDAPPDLRPAFVHSLKTSLKESQRLGKRYGVIECDPLVYRGMVKTAGHMIIPYMPMLVP 420

Query: 871  PLNWTGYDKGAHLFLPSYVMRIHGAKQQREAVKRVPKKQLEPVFEALDTLGRTKWRV--- 930
            P  WTGYD+GAH FLPSYVMRI GA+QQREAVKR  KKQL PVF+ALD LG TKWRV   
Sbjct: 421  PRKWTGYDQGAHFFLPSYVMRIRGARQQREAVKRASKKQLGPVFKALDILGSTKWRVNKR 480

Query: 931  ------------------------PLPEEPTVEDEAEIRKWKWKLKAAKKENSERHSQRC 990
                                    PLPE+P VEDEAEIR WKWK+KA K+ENSERHSQRC
Sbjct: 481  VLSVIEKIWASGGRLADLVDREDMPLPEQPMVEDEAEIRNWKWKVKAVKRENSERHSQRC 540

Query: 991  DIELKLAVARKMKEEEGFYYPHNLDFRGRAYPMHPHLNHLGSDMCRGILEFAEGRPLGES 1050
            D ELKLAVARKMKEEEGFYYPHNLDFRGRAYPMHPHLNH+GSD CRG LEFAEGRPLGES
Sbjct: 541  DTELKLAVARKMKEEEGFYYPHNLDFRGRAYPMHPHLNHIGSDFCRGTLEFAEGRPLGES 600

Query: 1051 GLRWLKIHLANLYAGGVDKLSYKDRISFTENHLDEIFDSADRPLEGNRWWLGAEDPFQCL 1110
            GLRWLKIHLANLYAGGVDKLSYKDRISFTENHLDEIFDSADRPLEG+RWWLGAEDPFQCL
Sbjct: 601  GLRWLKIHLANLYAGGVDKLSYKDRISFTENHLDEIFDSADRPLEGSRWWLGAEDPFQCL 660

Query: 1111 AVCINLSEALRSPSPETTLSHMPVHQDGSCNGLQHYAALGRDKLSL--------NNPA-- 1170
            AVCI+LSEALRSPSPETT+SHMPVHQDGSCNGLQHYAALGRDKL          + PA  
Sbjct: 661  AVCIDLSEALRSPSPETTISHMPVHQDGSCNGLQHYAALGRDKLGAEAVNLAAGDKPADV 720

Query: 1171 --GVNAIVLDIIRSDAEKDPATNPNALHARLLINQVDRKLVKQTVMTSVYGVTYMGARDQ 1230
              G+ + VLDI+RSDA KDPA+NPNALHARLLINQVDRKLVKQTVMTSVYGVTY GA+DQ
Sbjct: 721  YSGIASRVLDIMRSDAAKDPASNPNALHARLLINQVDRKLVKQTVMTSVYGVTYAGAKDQ 780

Query: 1231 IKKRLKERASIADDSQLFAASCYA-------------------------ARVIASENQPV 1290
            I++RLKER+SI ++  LF ASCYA                         A+VIASENQ V
Sbjct: 781  IRQRLKERSSIENERHLFTASCYAAKTTLTAIGEMFEAAKSIMNWLGECAKVIASENQAV 840

Query: 1291 RWTTPLGLPVVQPYRQLGRHLIKTSLQVLALQRETDKVMATRQKTAFPPNFVHSLDGSHM 1333
            RWTTPLGLPVVQPYR+LGRHL+KTSLQ+L+LQRETDKVMA RQ+TAFPPN++HSLD SHM
Sbjct: 841  RWTTPLGLPVVQPYRKLGRHLVKTSLQMLSLQRETDKVMAMRQRTAFPPNYIHSLDSSHM 900

BLAST of Cp4.1LG09g02310 vs. TrEMBL
Match: M5XKH1_PRUPE (DNA-directed RNA polymerase OS=Prunus persica GN=PRUPE_ppa000780mg PE=3 SV=1)

HSP 1 Score: 1226.5 bits (3172), Expect = 0.0e+00
Identity = 641/979 (65.47%), Postives = 738/979 (75.38%), Query Frame = 1

Query: 451  MWRNIVKRAASRRDTFFSRKVI-----------EDSISVEEIRSSRTLGSLKSCCLLEGS 510
            MWRN+ K+ ASR+    S+              ++S  +++ R       + +  L+ G 
Sbjct: 1    MWRNLAKQVASRKTNLSSQSHFGSPSSTSMIFSQESSFLDKARHFEARKCINNRILVMGF 60

Query: 511  RGISFFHSREVGFAKSNFAHSSEPVAFYGVLNHAKGYATAAEAAIS---EEDLSGSEEIQ 570
            R +    S++    + +  + S P    G  N+AKGYA+ AEA  S   EED SGSEEIQ
Sbjct: 61   RQVGDMASQKEELGRCSSLNPSYPYGISGFCNYAKGYASVAEAIASTDGEEDSSGSEEIQ 120

Query: 571  ELMEELSKQDK-VESHFKQPKKMVDGMRVGKYNILRKRQIKMETEAWEEAAREYQELLTD 630
            E++E+L +++  VESHFKQPK++V GM VGKYN+LRKRQIK+ETEAW+EAA+EYQELL D
Sbjct: 121  EMLEDLIRENNMVESHFKQPKRVVVGMGVGKYNLLRKRQIKLETEAWQEAAKEYQELLAD 180

Query: 631  MCEQKLAPNLPYIKSLFLGWFEPLRDAIAADQEYDKGAHLFLPSYVMRIHGAKQQREAVK 690
            MCEQKLAPNLPYIKSLFLGWFEPLRDAIAADQ+  K  +        + H          
Sbjct: 181  MCEQKLAPNLPYIKSLFLGWFEPLRDAIAADQDSCKQPNS------RQSHAPYFDHLPAD 240

Query: 691  RVPKKQLEPVFEALDTL--GRTKWRVNKRVLSI---------IDRIWASGGRLADLVDRX 750
            ++    +  +   L T   G    RV +   +I         I R      +  + +D+ 
Sbjct: 241  KMAVITMHKLMGLLMTNNGGIGSVRVVQAACAIGEAIEHEVRIHRFLEKTKKKKNTIDKK 300

Query: 751  TEGDTEPVV-------EDQEKVAKEQDKLRKKVTNLMKKQKLRQVRMIVKEHDHLKPWGQ 810
             E D+ PV        ++QEK+ KEQ++LRKKV  L+K+QK++QVR IVKE + LKPWGQ
Sbjct: 301  AEADSVPVTIEQEKLADEQEKLTKEQERLRKKVNKLIKRQKMQQVRGIVKEQEDLKPWGQ 360

Query: 811  DAHVKVGCRLIQLLIETAYIQPPMDQLGGGPPDIRPAFVHTLKTITKEAQKTSRRYGVIE 870
            +AHVKVGCRLIQLL++TAYIQPP+DQ+G GPPDIRPAFVH LKTIT++ QKTSRRYGVIE
Sbjct: 361  EAHVKVGCRLIQLLMDTAYIQPPVDQIGDGPPDIRPAFVHNLKTITRDTQKTSRRYGVIE 420

Query: 871  CDPLVRRGLEKTARHMVIPYMPMLVPPLNWTGYDKGAHLFLPSYVMRIHGAKQQREAVKR 930
            CDP+VR+G+EKTARHMV+PYMPMLVPP+NWTGYD+GA+LFLPSYVMR HGAKQQRE VKR
Sbjct: 421  CDPIVRKGMEKTARHMVMPYMPMLVPPINWTGYDRGAYLFLPSYVMRTHGAKQQREVVKR 480

Query: 931  VPKKQLEPVFEALDTLGRTKWRV---------------------------PLPEEPTVED 990
             P+KQLEPVFEALDTLG TKWRV                           PLPEEP  ED
Sbjct: 481  TPRKQLEPVFEALDTLGSTKWRVNKRVLGVIDRIWASGGRLADLVDREDVPLPEEPDTED 540

Query: 991  EAEIRKWKWKLKAAKKENSERHSQRCDIELKLAVARKMKEEEGFYYPHNLDFRGRAYPMH 1050
            EAEIRKWKWKLKAAKKENSERHSQRCDIELKLA +RKMK+EEGFYYPHNLDFRGRAYPMH
Sbjct: 541  EAEIRKWKWKLKAAKKENSERHSQRCDIELKLAASRKMKDEEGFYYPHNLDFRGRAYPMH 600

Query: 1051 PHLNHLGSDMCRGILEFAEGRPLGESGLRWLKIHLANLYAGGVDKLSYKDRISFTENHLD 1110
            P+LNHLGSDMCRGILEF+EGR LG+SGLRWLKIHLANLYAGGVDKLS+ DR +FTENH+D
Sbjct: 601  PYLNHLGSDMCRGILEFSEGRHLGKSGLRWLKIHLANLYAGGVDKLSFDDRAAFTENHVD 660

Query: 1111 EIFDSADRPLEGNRWWLGAEDPFQCLAVCINLSEALRSPSPETTLSHMPVHQDGSCNGLQ 1170
            EIFDSADRPLEG RWWLGAEDPFQCLA CINL EALRSPSPETT+S+MPVHQDGSCNGLQ
Sbjct: 661  EIFDSADRPLEGRRWWLGAEDPFQCLAACINLCEALRSPSPETTISYMPVHQDGSCNGLQ 720

Query: 1171 HYAALGRDKLSL--------NNPA----GVNAIVLDIIRSDAEKDPATNPNALHARLLIN 1230
            HYAALGRDKL          + PA    G+ A VLDI+R+DAEKDPATNPNALHARLLIN
Sbjct: 721  HYAALGRDKLGAAAVNLVGGDKPADVYSGIAARVLDIMRNDAEKDPATNPNALHARLLIN 780

Query: 1231 QVDRKLVKQTVMTSVYGVTYMGARDQIKKRLKERASIADDSQLFAASCYAA--------- 1290
            QVDRKLVKQTVMTSVYGVTY+GARDQIK+RLKER SIADD+ LFAA+CYAA         
Sbjct: 781  QVDRKLVKQTVMTSVYGVTYVGARDQIKRRLKERGSIADDTALFAAACYAARTTLTALGE 840

Query: 1291 ----------------RVIASENQPVRWTTPLGLPVVQPYRQLGRHLIKTSLQVLALQRE 1333
                            +VIASENQPVRW TPLGLPVVQPYRQLGRHLIKTSLQVL LQRE
Sbjct: 841  MFEAARSIMSWLGECAKVIASENQPVRWITPLGLPVVQPYRQLGRHLIKTSLQVLTLQRE 900

BLAST of Cp4.1LG09g02310 vs. TrEMBL
Match: A0A0D2T267_GOSRA (DNA-directed RNA polymerase OS=Gossypium raimondii GN=B456_008G174700 PE=3 SV=1)

HSP 1 Score: 1204.1 bits (3114), Expect = 0.0e+00
Identity = 628/969 (64.81%), Postives = 723/969 (74.61%), Query Frame = 1

Query: 451  MWRNIVKRAASRRDTFFSRKVIEDSIS-------------VEEIRSSRTLGSLKSCCLLE 510
            MWR+++K+A+    + F  +    S+S             + + R S+ +    S C + 
Sbjct: 1    MWRSLLKQASHTHKSKFVPESCTSSLSSSSSTGFCQTSAFINKFRPSKAINHGFSGCPIL 60

Query: 511  GSRGISFFHSREVGFAKSNFAHSSEPVAFYGVLNHAKGYATAAEAAIS-EEDLSGSEEIQ 570
            G R    F S++    +  F+HS  P+ F G   + KGYA+AAEA +S E+DLSGSEEI 
Sbjct: 61   GFRENYVFPSQKDCLGRFGFSHSGNPLTFSGKFWNFKGYASAAEAIVSNEDDLSGSEEIN 120

Query: 571  ELMEELSKQDKVESHFKQPKKMVDGMRVGKYNILRKRQIKMETEAWEEAAREYQELLTDM 630
            EL+E + K+++ ES  KQPK MV GM V KYN L++RQIK+ETEAWEEAA+EYQEL+ DM
Sbjct: 121  ELVEAMIKEERKESFSKQPKIMVGGMGVAKYNTLKRRQIKIETEAWEEAAKEYQELIADM 180

Query: 631  CEQKLAPNLPYIKSLFLGWFEPLRDAIAADQEYDKGAHLFLPSYVMRIHGAKQQREAVKR 690
            C+QKLAPNLPY+KSLFLGWFEPLRD+IAA+QE  KG   F  S+    +       AV  
Sbjct: 181  CQQKLAPNLPYVKSLFLGWFEPLRDSIAAEQEVCKGN--FKISHAAYFNELSADMMAVVT 240

Query: 691  VPKKQLEPVFEALDTLGRTKWRVNKRVLSIIDRI--------WASGGRLADLVDRXTEGD 750
            + K          +T G    RV +    I + I        +    +  +  D+ +  +
Sbjct: 241  MHKLM---GLLMTNTAGTGGIRVVQAACQIGEAIENEARIQKFLEKTKKKNTTDKKSVTE 300

Query: 751  TEPVVEDQEKVAKEQDKLRKKVTNLMKKQKLRQVRMIVKEHDHLKPWGQDAHVKVGCRLI 810
            +EP   +Q K+AK ++KLRKKVT LMKKQK+ QVR IVK  D  KPWGQ+AHVKVGCRLI
Sbjct: 301  SEPETTEQGKLAKNEEKLRKKVTQLMKKQKVHQVREIVKGRDTSKPWGQEAHVKVGCRLI 360

Query: 811  QLLIETAYIQPPMDQLGGGPPDIRPAFVHTLKTITKEAQKTSRRYGVIECDPLVRRGLEK 870
            QLL+E AYIQPP+DQ+G GPPDIRPAFVH LK + K+  K SRRYGVIECDPLVR+GLEK
Sbjct: 361  QLLMENAYIQPPVDQIGDGPPDIRPAFVHALKNVIKDGNKGSRRYGVIECDPLVRKGLEK 420

Query: 871  TARHMVIPYMPMLVPPLNWTGYDKGAHLFLPSYVMRIHGAKQQREAVKRVPKKQLEPVFE 930
            TA+HMVIPYMPMLVPP NWTGYD+GA+LFLPSYVMR HGAKQQRE VKR P+KQLEPVFE
Sbjct: 421  TAKHMVIPYMPMLVPPQNWTGYDQGAYLFLPSYVMRTHGAKQQRETVKRTPRKQLEPVFE 480

Query: 931  ALDTLGRTKWR---------------------------VPLPEEPTVEDEAEIRKWKWKL 990
            ALDTLG TKWR                           VPLPEEP  EDE EIRKWKWK+
Sbjct: 481  ALDTLGNTKWRINRRILGVVDRLWANGGRLADLVDREDVPLPEEPDTEDETEIRKWKWKV 540

Query: 991  KAAKKENSERHSQRCDIELKLAVARKMKEEEGFYYPHNLDFRGRAYPMHPHLNHLGSDMC 1050
            KA KKEN+ERHSQRCD+ELKLAVARKMK+E GFYYPHNLDFRGRAYPMHP+LNHLGSD+C
Sbjct: 541  KAVKKENNERHSQRCDVELKLAVARKMKDEVGFYYPHNLDFRGRAYPMHPYLNHLGSDLC 600

Query: 1051 RGILEFAEGRPLGESGLRWLKIHLANLYAGGVDKLSYKDRISFTENHLDEIFDSADRPLE 1110
            RG+LEFAEGRPLG+SGLRWLKIHLANLYAGGVDKLSY+ R+ FTE+HLD+IFDSADRPLE
Sbjct: 601  RGVLEFAEGRPLGKSGLRWLKIHLANLYAGGVDKLSYEGRVEFTESHLDDIFDSADRPLE 660

Query: 1111 GNRWWLGAEDPFQCLAVCINLSEALRSPSPETTLSHMPVHQDGSCNGLQHYAALGRDKLS 1170
            G RWWL AEDPFQCLA CINLSEALRS  PE T+SHMPVHQDGSCNGLQHYAALGRDKL 
Sbjct: 661  GKRWWLSAEDPFQCLAACINLSEALRSSIPEATISHMPVHQDGSCNGLQHYAALGRDKLG 720

Query: 1171 L--------NNPA----GVNAIVLDIIRSDAEKDPATNPNALHARLLINQVDRKLVKQTV 1230
                     + PA    G+ A VLDI++ DA++DPATNPNALHARLLINQVDRKLVKQTV
Sbjct: 721  AAAVNLVAGDKPADVYSGIAARVLDIMKRDAQEDPATNPNALHARLLINQVDRKLVKQTV 780

Query: 1231 MTSVYGVTYMGARDQIKKRLKERASIADDSQLFAASCYAA-------------------- 1290
            MTSVYGVTY+GARDQIK+RLKER +IADD+QLF ASCYAA                    
Sbjct: 781  MTSVYGVTYVGARDQIKRRLKERGAIADDTQLFVASCYAARTTLTALGEMFQAARSIMGW 840

Query: 1291 -----RVIASENQPVRWTTPLGLPVVQPYRQLGRHLIKTSLQVLALQRETDKVMATRQKT 1334
                 +VIASENQPVRW TPLGLPVVQPYRQLGRHLIKTSLQVL LQRETDKVM  RQ+T
Sbjct: 841  LGECAKVIASENQPVRWVTPLGLPVVQPYRQLGRHLIKTSLQVLTLQRETDKVMVKRQRT 900

BLAST of Cp4.1LG09g02310 vs. TrEMBL
Match: A0A0D2T7T6_GOSRA (DNA-directed RNA polymerase OS=Gossypium raimondii GN=B456_008G174700 PE=3 SV=1)

HSP 1 Score: 1201.4 bits (3107), Expect = 0.0e+00
Identity = 626/967 (64.74%), Postives = 721/967 (74.56%), Query Frame = 1

Query: 451  MWRNIVKRAASRRDTFFSRKVIEDSIS-------------VEEIRSSRTLGSLKSCCLLE 510
            MWR+++K+A+    + F  +    S+S             + + R S+ +    S C + 
Sbjct: 1    MWRSLLKQASHTHKSKFVPESCTSSLSSSSSTGFCQTSAFINKFRPSKAINHGFSGCPIL 60

Query: 511  GSRGISFFHSREVGFAKSNFAHSSEPVAFYGVLNHAKGYATAAEAAIS-EEDLSGSEEIQ 570
            G R    F S++    +  F+HS  P+ F G   + KGYA+AAEA +S E+DLSGSEEI 
Sbjct: 61   GFRENYVFPSQKDCLGRFGFSHSGNPLTFSGKFWNFKGYASAAEAIVSNEDDLSGSEEIN 120

Query: 571  ELMEELSKQDKVESHFKQPKKMVDGMRVGKYNILRKRQIKMETEAWEEAAREYQELLTDM 630
            EL+E + K+++ ES  KQPK MV GM V KYN L++RQIK+ETEAWEEAA+EYQEL+ DM
Sbjct: 121  ELVEAMIKEERKESFSKQPKIMVGGMGVAKYNTLKRRQIKIETEAWEEAAKEYQELIADM 180

Query: 631  CEQKLAPNLPYIKSLFLGWFEPLRDAIAADQEYDKGAHLFLPSYVMRIHGAKQQREAVKR 690
            C+QKLAPNLPY+KSLFLGWFEPLRD+IAA+QE  KG   F  S+    +       AV  
Sbjct: 181  CQQKLAPNLPYVKSLFLGWFEPLRDSIAAEQEVCKGN--FKISHAAYFNELSADMMAVVT 240

Query: 691  VPKKQLEPVFEALDTLGRTKWRVNKRVLSIIDRI--------WASGGRLADLVDRXTEGD 750
            + K          +T G    RV +    I + I        +    +  +  D+ +  +
Sbjct: 241  MHKLM---GLLMTNTAGTGGIRVVQAACQIGEAIENEARIQKFLEKTKKKNTTDKKSVTE 300

Query: 751  TEPVVEDQEKVAKEQDKLRKKVTNLMKKQKLRQVRMIVKEHDHLKPWGQDAHVKVGCRLI 810
            +EP   +Q K+AK ++KLRKKVT LMKKQK+ QVR IVK  D  KPWGQ+AHVKVGCRLI
Sbjct: 301  SEPETTEQGKLAKNEEKLRKKVTQLMKKQKVHQVREIVKGRDTSKPWGQEAHVKVGCRLI 360

Query: 811  QLLIETAYIQPPMDQLGGGPPDIRPAFVHTLKTITKEAQKTSRRYGVIECDPLVRRGLEK 870
            QLL+E AYIQPP+DQ+G GPPDIRPAFVH LK + K+  K SRRYGVIECDPLVR+GLEK
Sbjct: 361  QLLMENAYIQPPVDQIGDGPPDIRPAFVHALKNVIKDGNKGSRRYGVIECDPLVRKGLEK 420

Query: 871  TARHMVIPYMPMLVPPLNWTGYDKGAHLFLPSYVMRIHGAKQQREAVKRVPKKQLEPVFE 930
            TA+HMVIPYMPMLVPP NWTGYD+GA+LFLPSYVMR HGAKQQRE VKR P+KQLEPVFE
Sbjct: 421  TAKHMVIPYMPMLVPPQNWTGYDQGAYLFLPSYVMRTHGAKQQRETVKRTPRKQLEPVFE 480

Query: 931  ALDTLGRTKWR---------------------------VPLPEEPTVEDEAEIRKWKWKL 990
            ALDTLG TKWR                           VPLPEEP  EDE EIRKWKWK+
Sbjct: 481  ALDTLGNTKWRINRRILGVVDRLWANGGRLADLVDREDVPLPEEPDTEDETEIRKWKWKV 540

Query: 991  KAAKKENSERHSQRCDIELKLAVARKMKEEEGFYYPHNLDFRGRAYPMHPHLNHLGSDMC 1050
            KA KKEN+ERHSQRCD+ELKLAVARKMK+E GFYYPHNLDFRGRAYPMHP+LNHLGSD+C
Sbjct: 541  KAVKKENNERHSQRCDVELKLAVARKMKDEVGFYYPHNLDFRGRAYPMHPYLNHLGSDLC 600

Query: 1051 RGILEFAEGRPLGESGLRWLKIHLANLYAGGVDKLSYKDRISFTENHLDEIFDSADRPLE 1110
            RG+LEFAEGRPLG+SGLRWLKIHLANLYAGGVDKLSY+ R+ FTE+HLD+IFDSADRPLE
Sbjct: 601  RGVLEFAEGRPLGKSGLRWLKIHLANLYAGGVDKLSYEGRVEFTESHLDDIFDSADRPLE 660

Query: 1111 GNRWWLGAEDPFQCLAVCINLSEALRSPSPETTLSHMPVHQDGSCNGLQHYAALGRDKLS 1170
            G RWWL AEDPFQCLA CINLSEALRS  PE T+SHMPVHQDGSCNGLQHYAALGRDKL 
Sbjct: 661  GKRWWLSAEDPFQCLAACINLSEALRSSIPEATISHMPVHQDGSCNGLQHYAALGRDKLG 720

Query: 1171 L--------NNPA----GVNAIVLDIIRSDAEKDPATNPNALHARLLINQVDRKLVKQTV 1230
                     + PA    G+ A VLDI++ DA++DPATNPNALHARLLINQVDRKLVKQTV
Sbjct: 721  AAAVNLVAGDKPADVYSGIAARVLDIMKRDAQEDPATNPNALHARLLINQVDRKLVKQTV 780

Query: 1231 MTSVYGVTYMGARDQIKKRLKERASIADDSQLFAASCYAA-------------------- 1290
            MTSVYGVTY+GARDQIK+RLKER +IADD+QLF ASCYAA                    
Sbjct: 781  MTSVYGVTYVGARDQIKRRLKERGAIADDTQLFVASCYAARTTLTALGEMFQAARSIMGW 840

Query: 1291 -----RVIASENQPVRWTTPLGLPVVQPYRQLGRHLIKTSLQVLALQRETDKVMATRQKT 1332
                 +VIASENQPVRW TPLGLPVVQPYRQLGRHLIKTSLQVL LQRETDKVM  RQ+T
Sbjct: 841  LGECAKVIASENQPVRWVTPLGLPVVQPYRQLGRHLIKTSLQVLTLQRETDKVMVKRQRT 900

BLAST of Cp4.1LG09g02310 vs. TrEMBL
Match: B9HIV5_POPTR (DNA-directed RNA polymerase OS=Populus trichocarpa GN=POPTR_0008s11200g PE=3 SV=2)

HSP 1 Score: 1183.7 bits (3061), Expect = 0.0e+00
Identity = 629/976 (64.45%), Postives = 722/976 (73.98%), Query Frame = 1

Query: 451  MWRNIVKRAASRRDTFFSRK--VIEDSISVEEIRSSRTLGSLKSCCLLEGSRGISFFHSR 510
            MWR + K + S++  F S      +DS  +E IRS      L S      SR I  F   
Sbjct: 1    MWRTLAKCSPSKQLKFPSNSSNFFKDSTFIENIRSPDAKKCLNSAFSFLCSRQIGVFPQN 60

Query: 511  EVGFAKSNFAHSSEPVAFYGVLNHAKGYATAAEAAI----SEEDLSGSEEIQELMEELSK 570
            +     S+F   ++P          KGYATAA A +     E DLSGS++ Q LME+++K
Sbjct: 61   DK-LCNSSFGDLTKPFDLSPFF--FKGYATAAAADVIPSNDESDLSGSDDFQGLMEQVNK 120

Query: 571  Q-DKVESHFK-QPKKMVDGMRVGKYNILRKRQIKMETEAWEEAAREYQELLTDMCEQKLA 630
               K+E  F+ Q KKMV GM +GKY IL++RQIKMETEAWE+AA+EYQE+L DMCEQKLA
Sbjct: 121  HFQKMEPQFRPQEKKMVAGMGIGKYAILKRRQIKMETEAWEQAAQEYQEMLEDMCEQKLA 180

Query: 631  PNLPYIKSLFLGWFEPLRDAIAADQEYDKG-------AHLF-LPSYVMRI---------- 690
            PNLPY+KSLFLGWFEPLRDAI A+QE  K        AH   LP+ +M +          
Sbjct: 181  PNLPYVKSLFLGWFEPLRDAIVAEQELCKRNLRVSHRAHFSDLPADMMAVITMHKLMGLL 240

Query: 691  ---HGAKQQREAVKRVPKKQLEPVFEALDTLGRTKWRVNKRVLSIIDRIWASGGRLADLV 750
               +G       V+         V EA++  GR            I +      +  ++ 
Sbjct: 241  MTGNGGSASIRVVQAA-----SVVGEAIEHEGR------------IHKFLEKTKKRKNVE 300

Query: 751  DRXTEGDTEPVVEDQEKVAKEQDKLRKKVTNLMKKQKLRQVRMIVKEHDHLKPWGQDAHV 810
             + +EG+++  +E+ EK++KEQ+KLRKKVT L+KKQK++QVR IVK HD  +PWGQ+ HV
Sbjct: 301  AKISEGESDAAIEEGEKLSKEQEKLRKKVTTLIKKQKVQQVRRIVKGHDDSRPWGQEEHV 360

Query: 811  KVGCRLIQLLIETAYIQPPMDQLGGGPPDIRPAFVHTLKTITKEAQKTSRRYGVIECDPL 870
            KVG RLIQL+IETAYIQPP+DQ+G GPPDIRPAFVHTLKTITK+ QK+SRRYGVIECDPL
Sbjct: 361  KVGSRLIQLMIETAYIQPPIDQIGDGPPDIRPAFVHTLKTITKDTQKSSRRYGVIECDPL 420

Query: 871  VRRGLEKTARHMVIPYMPMLVPPLNWTGYDKGAHLFLPSYVMRIHGAKQQREAVKRVPKK 930
            VR+GLEK+ARHMVIPYMPMLVPPLNWTGYD+GAHLFLPSYVMRIHG+KQQR+AVKR  + 
Sbjct: 421  VRKGLEKSARHMVIPYMPMLVPPLNWTGYDQGAHLFLPSYVMRIHGSKQQRDAVKRASRN 480

Query: 931  QLEPVFEALDTLGRTKWRV---------------------------PLPEEPTVEDEAEI 990
            QLEPVF+ALDTLG TKWR+                           PLPEEP  EDEAEI
Sbjct: 481  QLEPVFKALDTLGNTKWRINKRVLVVVDRIWASGGHLAGLVDREDAPLPEEPQTEDEAEI 540

Query: 991  RKWKWKLKAAKKENSERHSQRCDIELKLAVARKMKEEEGFYYPHNLDFRGRAYPMHPHLN 1050
            RKW WK+++ KKENSERHSQRCDIELKLAVARKMK+EEGFYYPHNLDFRGRAYPMHP+LN
Sbjct: 541  RKWTWKVRSVKKENSERHSQRCDIELKLAVARKMKDEEGFYYPHNLDFRGRAYPMHPYLN 600

Query: 1051 HLGSDMCRGILEFAEGRPLGESGLRWLKIHLANLYAGGVDKLSYKDRISFTENHLDEIFD 1110
            HLGSD+CRGILEFAEGRPLG+SGLRWLKIHLANLYAGGVDKLSY  RISFTENHLD+IFD
Sbjct: 601  HLGSDVCRGILEFAEGRPLGKSGLRWLKIHLANLYAGGVDKLSYDGRISFTENHLDDIFD 660

Query: 1111 SADRPLEGNRWWLGAEDPFQCLAVCINLSEALRSPSPETTLSHMPVHQDGSCNGLQHYAA 1170
            SAD+PLEG RWWLGAEDPFQCLA CINLSEALRSPSPET  SH PVHQDGSCNGLQHYAA
Sbjct: 661  SADQPLEGRRWWLGAEDPFQCLAACINLSEALRSPSPETATSHTPVHQDGSCNGLQHYAA 720

Query: 1171 LGRDKLSL--------NNPA----GVNAIVLDIIRSDAEKDPATNPNALHARLLINQVDR 1230
            LGRDKL            PA    G+   VLDI++ DAEKDPA NPN++HA+LL+NQVDR
Sbjct: 721  LGRDKLGAAAVNLVGGEKPADVYSGIATRVLDIMQRDAEKDPAINPNSVHAKLLVNQVDR 780

Query: 1231 KLVKQTVMTSVYGVTYMGARDQIKKRLKERASIADDSQLFAASCYAA------------- 1290
            KLVKQTVMTSVYGVTY+GARDQIK+RLKER  IADD QL++A+CYAA             
Sbjct: 781  KLVKQTVMTSVYGVTYIGARDQIKRRLKERCIIADDPQLYSAACYAAKTTLMALEEMFEG 840

Query: 1291 ------------RVIASENQPVRWTTPLGLPVVQPYRQLGRHLIKTSLQVLALQRETDKV 1334
                        +VIASENQPVRWTTPLGLPVVQPYRQLGRHLIKTSLQVL L+RETDKV
Sbjct: 841  ARGIMAWLGECAKVIASENQPVRWTTPLGLPVVQPYRQLGRHLIKTSLQVLTLKRETDKV 900

BLAST of Cp4.1LG09g02310 vs. TAIR10
Match: AT5G15700.2 (AT5G15700.2 DNA/RNA polymerases superfamily protein)

HSP 1 Score: 1009.6 bits (2609), Expect = 1.9e-294
Identity = 546/964 (56.64%), Postives = 655/964 (67.95%), Query Frame = 1

Query: 451  MWRNIVKRAASRRDTFFSRK------VIEDSISVEEIRSSRTLGSLKSCCLLEGSRGISF 510
            MWRNI K+A SR     +        ++    S+     S     L S C  +G R +S 
Sbjct: 40   MWRNIAKQAISRSAARLNVSSQTRGLLVSSPESIFSKNLSFRFPVLGSPCHGKGFRCLSG 99

Query: 511  FHSREVGFAKSNFAHSSEPVAFYGVLNHAKGYATAAEAAISEEDLSGSEEIQELMEELSK 570
               RE  F+KS    S       G L  A+GY + AE  +   D+    E+ EL++E+ K
Sbjct: 100  ITRREE-FSKSERCLS-------GTL--ARGYTSVAEEEVLSTDVEEEPEVDELLKEMKK 159

Query: 571  QDKVESHFKQPKKMVD--GMRVGKYNILRKRQIKMETEAWEEAAREYQELLTDMCEQKLA 630
            + K ESH     K  D  GM   K+  L +RQ+K+ETE WE AA EY ELLTDMCEQKLA
Sbjct: 160  EKKRESHRSWRMKKQDQFGMGRTKFQNLWRRQVKIETEEWERAAAEYMELLTDMCEQKLA 219

Query: 631  PNLPYIKSLFLGWFEPLRDAIAADQEYDKGAHLFLPSYVMRIHGAKQQ-REAVKRVPKKQ 690
            PNLPY+KSLFLGWFEPLRDAIA DQE            + R+  +K      + ++P  +
Sbjct: 220  PNLPYVKSLFLGWFEPLRDAIAKDQE------------LYRLGKSKATYAHYLDQLPADK 279

Query: 691  LEPVFEALDTLGRTKWRVNKRVLSIIDRIWASGG------RLADLVDRXTEGDT--EPVV 750
            +  V      +G      +   + ++      G       R+   +D+  +GD   E   
Sbjct: 280  IS-VITMHKLMGHLMTGGDNGCVKVVHAACTVGDAIEQEIRICTFLDKKKKGDDNEESGG 339

Query: 751  EDQEKVAKEQDKLRKKVTNLMKKQKLRQVRMIVKEHDHLKPWGQDAHVKVGCRLIQLLIE 810
             + E   KEQDKLRKKV  L+KKQKL  VR I++ HD+ KPW  D   KVG RLI+LL+ 
Sbjct: 340  VENETSMKEQDKLRKKVNELIKKQKLSAVRKILQSHDYTKPWIADVRAKVGSRLIELLVR 399

Query: 811  TAYIQPPMDQLGGGPPDIRPAFVHTLKTITKEAQKTSRRYGVIECDPLVRRGLEKTARHM 870
            TAYIQ P DQ     PD+RPAFVHT K + K +  + R+YGVIECDPLVR+GLEK+ R+ 
Sbjct: 400  TAYIQSPADQQDNDLPDVRPAFVHTFK-VAKGSMNSGRKYGVIECDPLVRKGLEKSGRYA 459

Query: 871  VIPYMPMLVPPLNWTGYDKGAHLFLPSYVMRIHGAKQQREAVKRVPKKQLEPVFEALDTL 930
            V+PYMPMLVPPL W+GYDKGA+LFL SY+M+ HGAKQQREA+K  PK QL+PVFEALDTL
Sbjct: 460  VMPYMPMLVPPLKWSGYDKGAYLFLTSYIMKTHGAKQQREALKSAPKGQLQPVFEALDTL 519

Query: 931  GRTKWRV---------------------------PLPEEPTVEDEAEIRKWKWKLKAAKK 990
            G TKWRV                           PLPE+P  EDE  ++KWKW++K+AKK
Sbjct: 520  GSTKWRVNKRVLTVVDRIWSSGGCVADMVDRSDVPLPEKPDTEDEGILKKWKWEVKSAKK 579

Query: 991  ENSERHSQRCDIELKLAVARKMKEEEGFYYPHNLDFRGRAYPMHPHLNHLGSDMCRGILE 1050
             NSERHSQRCD ELKL+VARKMK+EE FYYPHN+DFRGRAYPM PHLNHLGSD+CRG+LE
Sbjct: 580  VNSERHSQRCDTELKLSVARKMKDEEAFYYPHNMDFRGRAYPMPPHLNHLGSDLCRGVLE 639

Query: 1051 FAEGRPLGESGLRWLKIHLANLYAGGVDKLSYKDRISFTENHLDEIFDSADRPLEGNRWW 1110
            FAEGRP+G SGLRWLKIHLANLYAGGVDKLS   R++FTENHLD+IFDSADRPLEG+RWW
Sbjct: 640  FAEGRPMGISGLRWLKIHLANLYAGGVDKLSLDGRLAFTENHLDDIFDSADRPLEGSRWW 699

Query: 1111 LGAEDPFQCLAVCINLSEALRSPSPETTLSHMPVHQDGSCNGLQHYAALGRDKLSLN--- 1170
            L AEDPFQCLAVCI+L+EALRSPSPET LSH+P+HQDGSCNGLQHYAALGRD L      
Sbjct: 700  LQAEDPFQCLAVCISLTEALRSPSPETVLSHIPIHQDGSCNGLQHYAALGRDTLGAEAVN 759

Query: 1171 -----NPA----GVNAIVLDIIRSDAEKDPATNPNALHARLLINQVDRKLVKQTVMTSVY 1230
                  PA    G+   VLDI+R DA++DP   P AL AR L+NQVDRKLVKQTVMTSVY
Sbjct: 760  LVAGEKPADVYSGIATRVLDIMRRDADRDPEVFPEALRARKLLNQVDRKLVKQTVMTSVY 819

Query: 1231 GVTYMGARDQIKKRLKERASIADDSQLFAASCYAARV----------------------- 1290
            GVTY+GARDQIK+RLKER+   D+ ++F A+CYAA+V                       
Sbjct: 820  GVTYIGARDQIKRRLKERSDFGDEKEVFGAACYAAKVTLAAIDEMFQAARAIMRWFGECA 879

Query: 1291 --IASENQPVRWTTPLGLPVVQPYRQLGRHLIKTSLQVLALQRETDKVMATRQKTAFPPN 1334
              IASEN+ VRWTTPLGLPVVQPY Q+G  L+KTSLQ L+LQ ETD+V+  RQ+TAFPPN
Sbjct: 880  KIIASENETVRWTTPLGLPVVQPYHQMGTKLVKTSLQTLSLQHETDQVIVRRQRTAFPPN 939

BLAST of Cp4.1LG09g02310 vs. TAIR10
Match: AT1G68990.2 (AT1G68990.2 male gametophyte defective 3)

HSP 1 Score: 914.8 bits (2363), Expect = 6.5e-266
Identity = 453/677 (66.91%), Postives = 524/677 (77.40%), Query Frame = 1

Query: 728  DTEPVVEDQEKVAKEQDKLRKKVTNLMKKQKLRQVRMIVKEHDHLKPWGQDAHVKVGCRL 787
            +TE     +E VAKE +K RK+VT LM+K KLRQV+ +V++HD  KPWGQ+A VKVG RL
Sbjct: 275  NTEAENVSEEIVAKETEKARKQVTVLMEKNKLRQVKALVRKHDSFKPWGQEAQVKVGARL 334

Query: 788  IQLLIETAYIQPPMDQLGGGPPDIRPAFVHTLKTITKEAQKTSRRYGVIECDPLVRRGLE 847
            IQLL+E AYIQPP +Q   GPPDIRPAF    +T+T E  KTSRRYG IECDPLV +GL+
Sbjct: 335  IQLLMENAYIQPPAEQFDDGPPDIRPAFKQNFRTVTLENTKTSRRYGCIECDPLVLKGLD 394

Query: 848  KT-------ARHMVIPYMPMLVPPLNWTGYDKGAHLFLPSYVMRIHGAKQQREAVKRVPK 907
            K+       ARHMVIPY+PML+PP NWTGYD+GAH FLPSYVMR HGAKQQR  +KR PK
Sbjct: 395  KSVSRIVDYARHMVIPYLPMLIPPQNWTGYDQGAHFFLPSYVMRTHGAKQQRTVMKRTPK 454

Query: 908  KQLEPVFEALDTLGRTKWR---------------------------VPLPEEPTVEDEAE 967
            +QLEPV+EALDTLG TKW+                           VP+PEEP  ED+ +
Sbjct: 455  EQLEPVYEALDTLGNTKWKINKKVLSLVDRIWANGGRIGGLVDREDVPIPEEPEREDQEK 514

Query: 968  IRKWKWKLKAAKKENSERHSQRCDIELKLAVARKMKEEEGFYYPHNLDFRGRAYPMHPHL 1027
             + W+W+ K A K+N+ERHSQRCDIELKL VARKMK+EEGFYYPHN+DFRGRAYP+HP+L
Sbjct: 515  FKNWRWESKKAIKQNNERHSQRCDIELKLEVARKMKDEEGFYYPHNVDFRGRAYPIHPYL 574

Query: 1028 NHLGSDMCRGILEFAEGRPLGESGLRWLKIHLANLYAGGVDKLSYKDRISFTENHLDEIF 1087
            NHLGSD+CRGILEF EG+PLG+SGLRWLKIH+ANLYAGGVDKL+Y+DRI+FTE+HL++IF
Sbjct: 575  NHLGSDLCRGILEFCEGKPLGKSGLRWLKIHIANLYAGGVDKLAYEDRIAFTESHLEDIF 634

Query: 1088 DSADRPLEGNRWWLGAEDPFQCLAVCINLSEALRSPSPETTLSHMPVHQDGSCNGLQHYA 1147
            DS+DRPLEG RWWL AEDPFQCLA CINLSEALRSP PE  +SH+P+HQDGSCNGLQHYA
Sbjct: 635  DSSDRPLEGKRWWLNAEDPFQCLAACINLSEALRSPFPEAAISHIPIHQDGSCNGLQHYA 694

Query: 1148 ALGRDKLSLN--------NPAGV----NAIVLDIIRSDAEKDPATNPNALHARLLINQVD 1207
            ALGRDKL  +         PA V     A VL I++ DAE+DP T PNA +A+L+++QVD
Sbjct: 695  ALGRDKLGADAVNLVTGEKPADVYTEIAARVLKIMQQDAEEDPETFPNATYAKLMLDQVD 754

Query: 1208 RKLVKQTVMTSVYGVTYMGARDQIKKRLKERASIADDSQLFAASCYA------------- 1267
            RKLVKQTVMTSVYGVTY GARDQIKKRLKER +  DDS  F ASCYA             
Sbjct: 755  RKLVKQTVMTSVYGVTYSGARDQIKKRLKERGTFEDDSLTFHASCYAAKITLKALEEMFE 814

Query: 1268 ------------ARVIASENQPVRWTTPLGLPVVQPYRQLGRHLIKTSLQVLALQRETDK 1327
                        A++IASEN  V WTTPLGLPVVQPYR+ GRHL+KT+LQVL L RETDK
Sbjct: 815  AARAIKSWFGDCAKIIASENNAVCWTTPLGLPVVQPYRKPGRHLVKTTLQVLTLSRETDK 874

Query: 1328 VMATRQKTAFPPNFVHSLDGSHMMMTAVACRRAGLNFAGVHDSYWTHACDVDVMNRLLRE 1334
            VMA RQ TAF PNF+HSLDGSHMMMTAVAC RAGL+FAGVHDS+WTHACDVDVMN +LRE
Sbjct: 875  VMARRQMTAFAPNFIHSLDGSHMMMTAVACNRAGLSFAGVHDSFWTHACDVDVMNTILRE 934

BLAST of Cp4.1LG09g02310 vs. TAIR10
Match: AT2G24120.1 (AT2G24120.1 DNA/RNA polymerases superfamily protein)

HSP 1 Score: 835.9 bits (2158), Expect = 3.8e-242
Identity = 445/851 (52.29%), Postives = 570/851 (66.98%), Query Frame = 1

Query: 559  MEELSKQDKVESHFKQPKKMVDGMRVGKYNILRKRQIKMETEAWEEAAREYQELLTDMCE 618
            ++ LSK   V+   K  +K +D     K++ LR+RQ+K ETEAWE    EY++L  +MCE
Sbjct: 136  LKGLSKM--VDQTLKIERKDIDKR---KFDSLRRRQVKEETEAWERMVDEYRDLEKEMCE 195

Query: 619  QKLAPNLPYIKSLFLGWFEPLRDAIAADQEYDK----------GAHL-FLPSYVMRIHGA 678
            + LAPNLPY+K +FLGWF+PL+D I  +Q+  K            H+  LP+  M +   
Sbjct: 196  KNLAPNLPYVKHMFLGWFQPLKDVIEREQKLQKNKSKKVRAAYAPHIELLPADKMAVIVM 255

Query: 679  KQQREAVKRVPKKQLEPVFEALDTLGRTKWRVNKRVLSIIDRIWASGGRLADLVDRXTEG 738
             +    V    +     V +A  ++G          ++I   +     R+ + + R  + 
Sbjct: 256  HKMMGLVMSGHEDGCIQVVQAAVSIG----------IAIEQEV-----RIHNFLKRTRKN 315

Query: 739  DTEPVVEDQEKVAKEQDKLRKKVTNLMKKQKLRQVRMIVKEHDHLKPWGQDAHVKVGCRL 798
            +      D ++  KE+  LRK+V +L++++++     +VK  +  KPWG+    K+G RL
Sbjct: 316  N----AGDSQEELKEKQLLRKRVNSLIRRKRIIDALKVVKS-EGTKPWGRATQAKLGSRL 375

Query: 799  IQLLIETAYIQPPMDQLGGGPPDIRPAFVHTLKTITK-EAQKTSRRYGVIECDPLVRRGL 858
            ++LLIE AY+QPP+ Q G   P+ RPAF H  KT+TK    K  RRYGVIECD L+  GL
Sbjct: 376  LELLIEAAYVQPPLTQSGDSIPEFRPAFRHRFKTVTKYPGSKLVRRYGVIECDSLLLAGL 435

Query: 859  EKTARHMVIPYMPMLVPPLNWTGYDKGAHLFLPSYVMRIHGAKQQREAVKRVPKKQLEPV 918
            +K+A+HM+IPY+PMLVPP  W GYDKG +LFLPSY+MR HG+K+Q++A+K +  K    V
Sbjct: 436  DKSAKHMLIPYVPMLVPPKRWKGYDKGGYLFLPSYIMRTHGSKKQQDALKDISHKTAHRV 495

Query: 919  FEALDTLGRTKWRVP--------------------LPEEPTVEDEAEIRKWKWKLKAAKK 978
            FEALDTLG TKWRV                     +  E     E    +   +L++ K 
Sbjct: 496  FEALDTLGNTKWRVNRNILDVVERLWADGGNIAGLVNREDVPIPEKPSSEDPEELQSWKW 555

Query: 979  E-------NSERHSQRCDIELKLAVARKMKEEEGFYYPHNLDFRGRAYPMHPHLNHLGSD 1038
                    N ERHS RCD+ELKL+VARKMK+EEGFYYPHNLDFRGRAYPMHPHLNHL SD
Sbjct: 556  SARKANKINRERHSLRCDVELKLSVARKMKDEEGFYYPHNLDFRGRAYPMHPHLNHLSSD 615

Query: 1039 MCRGILEFAEGRPLGESGLRWLKIHLANLYAGGVDKLSYKDRISFTENHLDEIFDSADRP 1098
            +CRG LEFAEGRPLG+SGL WLKIHLANLYAGGV+KLS+  R++F ENHLD+I DSA+ P
Sbjct: 616  LCRGTLEFAEGRPLGKSGLHWLKIHLANLYAGGVEKLSHDARLAFVENHLDDIMDSAENP 675

Query: 1099 LEGNRWWLGAEDPFQCLAVCINLSEALRSPSPETTLSHMPVHQDGSCNGLQHYAALGRDK 1158
            + G RWWL AEDPFQCLA C+ L++AL+SPSP + +SH+P+HQDGSCNGLQHYAALGRD 
Sbjct: 676  IHGKRWWLKAEDPFQCLAACVILTQALKSPSPYSVISHLPIHQDGSCNGLQHYAALGRDS 735

Query: 1159 L---SLNNPAG---------VNAIVLDIIRSDAEKDPATNPNALHARLLINQVDRKLVKQ 1218
                ++N  AG         ++  V +I++ D+ KDP +NP A  A++LI QVDRKLVKQ
Sbjct: 736  FEAAAVNLVAGEKPADVYSEISRRVHEIMKKDSSKDPESNPTAALAKILITQVDRKLVKQ 795

Query: 1219 TVMTSVYGVTYMGARDQIKKRLKERASIADDSQLFAASCYAARV---------------- 1278
            TVMTSVYGVTY+GAR+QIK+RL+E+  I D+  LFAA+CY+A+V                
Sbjct: 796  TVMTSVYGVTYVGAREQIKRRLEEKGVITDERMLFAAACYSAKVTLAALGEIFEAARAIM 855

Query: 1279 ---------IASENQPVRWTTPLGLPVVQPYRQLGRHLIKTSLQVLALQRETDKVMATRQ 1334
                     IAS+N PVRW TPLGLPVVQPY +  RHLI+TSLQVLALQRE + V   +Q
Sbjct: 856  SWLGDCAKIIASDNHPVRWITPLGLPVVQPYCRSERHLIRTSLQVLALQREGNTVDVRKQ 915

BLAST of Cp4.1LG09g02310 vs. NCBI nr
Match: gi|659091310|ref|XP_008446483.1| (PREDICTED: DNA-directed RNA polymerase 1B, mitochondrial isoform X2 [Cucumis melo])

HSP 1 Score: 1268.1 bits (3280), Expect = 0.0e+00
Identity = 656/954 (68.76%), Postives = 748/954 (78.41%), Query Frame = 1

Query: 451  MWRNIVKRAASRRDTFFS-------RKVIEDSISVEEIRSSRTLGSLKSCCLLEGSRGIS 510
            MWRN++KRAASR+ T FS       RKV ED I +++IR SR+L +L +CC    S  IS
Sbjct: 1    MWRNVLKRAASRKATLFSELSSSTSRKVTEDHIFLDQIRYSRSLETLNTCCQSGDSSRIS 60

Query: 511  FFHSREVGFAKSNFAHSSEPVAFYGVLNHAKGYATAAEAAISEEDLSGSEEIQELMEELS 570
            F H ++VGF  SNF HSS PVAFYGVLNHA+GYATAAEAAISE DLSGSEEIQE+ME +S
Sbjct: 61   FLHPQKVGFTNSNFPHSSNPVAFYGVLNHAEGYATAAEAAISEGDLSGSEEIQEIMEGIS 120

Query: 571  KQDKVESHFKQPKKMVDGMRVGKYNILRKRQIKMETEAWEEAAREYQELLTDMCEQKLAP 630
            KQDKVE HFK+PK+MVD      Y++LRKRQIK+ETEAWEEAAREYQEL+ ++CEQKL+P
Sbjct: 121  KQDKVEPHFKKPKRMVDRKGEATYDMLRKRQIKIETEAWEEAAREYQELIAEICEQKLSP 180

Query: 631  NLPYIKSLFLGWFEPLRDAIAADQEYDKG-AHLFLPSYVMRIHGAKQQREAVKRVPKKQL 690
            NLPY+KSLFLGWF+P RDAI A+QE  K  +  F PS+    +       AV  +   +L
Sbjct: 181  NLPYMKSLFLGWFQPFRDAIVAEQESIKSKSKNFCPSHAPYFNLLPADMMAV--ITMHKL 240

Query: 691  EPVFEALDTLGRTKWRVNKRVLSIIDRIWASGGRLADLVDRXTEGDTEPVVEDQEKVAKE 750
              V  + D  G    +V +    I + I  +  R+    ++  +          EK+A++
Sbjct: 241  VGVMMS-DFEGNGIVKVTQAATHIGEAI-ENEVRIRHFFEKKKQ---------PEKLAED 300

Query: 751  QDKLRKKVTNLMKKQKLRQVRMIVKEHDHLKPWGQDAHVKVGCRLIQLLIETAYIQPPMD 810
            QDKLRKKVT LMK+QKL++V  IVK+HD LKPWG D HVKVGC LI+ LIETAYIQPP+D
Sbjct: 301  QDKLRKKVTKLMKQQKLQKVNFIVKKHDDLKPWGTDVHVKVGCTLIKFLIETAYIQPPVD 360

Query: 811  QLGGGPPDIRPAFVHTLKTITKEAQKTSRRYGVIECDPLVRRGLEKTARHMVIPYMPMLV 870
            QLG  PPD+RPAFVH LKT  KE QK  RRYGVIECDPLVRRG+ KTA HMVIPYMPMLV
Sbjct: 361  QLGDAPPDLRPAFVHYLKTSPKETQKLGRRYGVIECDPLVRRGMVKTAGHMVIPYMPMLV 420

Query: 871  PPLNWTGYDKGAHLFLPSYVMRIHGAKQQREAVKRVPKKQLEPVFEALDTLGRTKWRV-- 930
            PPLNWTGYDKGA+ FLPSYVMRI GAKQQREAV+R P+ QLEPVF+ALD LG TKWRV  
Sbjct: 421  PPLNWTGYDKGAYFFLPSYVMRIRGAKQQREAVRRAPRTQLEPVFKALDILGSTKWRVNK 480

Query: 931  -------------------------PLPEEPTVEDEAEIRKWKWKLKAAKKENSERHSQR 990
                                     PLPEEP+VEDEAEIRKWKWK+KA KKENSERHSQR
Sbjct: 481  RVLSIIERIWASGGRLADLVDREDLPLPEEPSVEDEAEIRKWKWKVKAVKKENSERHSQR 540

Query: 991  CDIELKLAVARKMKEEEGFYYPHNLDFRGRAYPMHPHLNHLGSDMCRGILEFAEGRPLGE 1050
            CD+ELKLAVARKMK+EEGFYYPHNLDFRGRAYPMHP+LNH+GSD+CRGILEFAEGRPLGE
Sbjct: 541  CDVELKLAVARKMKDEEGFYYPHNLDFRGRAYPMHPNLNHIGSDLCRGILEFAEGRPLGE 600

Query: 1051 SGLRWLKIHLANLYAGGVDKLSYKDRISFTENHLDEIFDSADRPLEGNRWWLGAEDPFQC 1110
            SGLRWLKIHLANLYAGGVDKLSYKDRISFTENHLDEIFDSADRPLEG+RWWLGAEDPFQC
Sbjct: 601  SGLRWLKIHLANLYAGGVDKLSYKDRISFTENHLDEIFDSADRPLEGSRWWLGAEDPFQC 660

Query: 1111 LAVCINLSEALRSPSPETTLSHMPVHQDGSCNGLQHYAALGRDKLSL--------NNPA- 1170
            LAVCINLSEALRSPSPETT+SH+P+HQDGSCNGLQHYAALGRDKL          + PA 
Sbjct: 661  LAVCINLSEALRSPSPETTISHLPIHQDGSCNGLQHYAALGRDKLGAEAVNLAAGDKPAD 720

Query: 1171 ---GVNAIVLDIIRSDAEKDPATNPNALHARLLINQVDRKLVKQTVMTSVYGVTYMGARD 1230
               G+ + VLDIIRSDA KDPA+ PNALHA+LLINQVDRKLVKQTVMTSVYGVT +GA+D
Sbjct: 721  VYSGIASRVLDIIRSDALKDPASYPNALHAKLLINQVDRKLVKQTVMTSVYGVTLVGAKD 780

Query: 1231 QIKKRLKERASIADDSQLFAASCYA-------------------------ARVIASENQP 1290
            QI +RLKERASI ++ QLF+ASCYA                         A+VIASEN+ 
Sbjct: 781  QISQRLKERASIGNERQLFSASCYAAKTTLTAIGEMFEEAKSIMNWLGECAKVIASENKD 840

Query: 1291 VRWTTPLGLPVVQPYRQLGRHLIKTSLQVLALQRETDKVMATRQKTAFPPNFVHSLDGSH 1333
            VRWTTPLGLPVVQPYR+ GRH++KTSLQVL+LQRETDKVMA RQK+AFPPNF+HSLD SH
Sbjct: 841  VRWTTPLGLPVVQPYRKPGRHIVKTSLQVLSLQRETDKVMAARQKSAFPPNFIHSLDSSH 900

BLAST of Cp4.1LG09g02310 vs. NCBI nr
Match: gi|659091308|ref|XP_008446482.1| (PREDICTED: DNA-directed RNA polymerase 1B, mitochondrial isoform X1 [Cucumis melo])

HSP 1 Score: 1263.1 bits (3267), Expect = 0.0e+00
Identity = 656/956 (68.62%), Postives = 748/956 (78.24%), Query Frame = 1

Query: 451  MWRNIVKRAASRRDTFFS-------RKVIEDSISVEEIRSSRTLGSLKSCCLLEGSRGIS 510
            MWRN++KRAASR+ T FS       RKV ED I +++IR SR+L +L +CC    S  IS
Sbjct: 1    MWRNVLKRAASRKATLFSELSSSTSRKVTEDHIFLDQIRYSRSLETLNTCCQSGDSSRIS 60

Query: 511  FFHSREVGFAKSNFAHSSEPVAFYGVLNHAKGYATAAEAAISEEDLSGSEEIQELMEELS 570
            F H ++VGF  SNF HSS PVAFYGVLNHA+GYATAAEAAISE DLSGSEEIQE+ME +S
Sbjct: 61   FLHPQKVGFTNSNFPHSSNPVAFYGVLNHAEGYATAAEAAISEGDLSGSEEIQEIMEGIS 120

Query: 571  KQDKVESHFKQPKKMVDGMRVGKYNILRKRQIKMETEAWEEAAREYQELLTDMCEQKLAP 630
            KQDKVE HFK+PK+MVD      Y++LRKRQIK+ETEAWEEAAREYQEL+ ++CEQKL+P
Sbjct: 121  KQDKVEPHFKKPKRMVDRKGEATYDMLRKRQIKIETEAWEEAAREYQELIAEICEQKLSP 180

Query: 631  NLPYIKSLFLGWFEPLRDAIAADQEYDKG-AHLFLPSYVMRIHGAKQQREAVKRVPKKQL 690
            NLPY+KSLFLGWF+P RDAI A+QE  K  +  F PS+    +       AV  +   +L
Sbjct: 181  NLPYMKSLFLGWFQPFRDAIVAEQESIKSKSKNFCPSHAPYFNLLPADMMAV--ITMHKL 240

Query: 691  EPVFEALDTLGRTKWRVNKRVLSIIDRIWASGGRLADLVDRXTEGDTEPVVEDQEKVAKE 750
              V  + D  G    +V +    I + I  +  R+    ++  +          EK+A++
Sbjct: 241  VGVMMS-DFEGNGIVKVTQAATHIGEAI-ENEVRIRHFFEKKKQ---------PEKLAED 300

Query: 751  QDKLRKKVTNLMKKQKLRQVRMIVKEHDHLKPWGQDAHVKVGCRLIQLLIETAYIQPPMD 810
            QDKLRKKVT LMK+QKL++V  IVK+HD LKPWG D HVKVGC LI+ LIETAYIQPP+D
Sbjct: 301  QDKLRKKVTKLMKQQKLQKVNFIVKKHDDLKPWGTDVHVKVGCTLIKFLIETAYIQPPVD 360

Query: 811  QLGGGPPDIRPAFVHTLKTITKEAQ--KTSRRYGVIECDPLVRRGLEKTARHMVIPYMPM 870
            QLG  PPD+RPAFVH LKT  KE Q  K  RRYGVIECDPLVRRG+ KTA HMVIPYMPM
Sbjct: 361  QLGDAPPDLRPAFVHYLKTSPKETQLTKLGRRYGVIECDPLVRRGMVKTAGHMVIPYMPM 420

Query: 871  LVPPLNWTGYDKGAHLFLPSYVMRIHGAKQQREAVKRVPKKQLEPVFEALDTLGRTKWRV 930
            LVPPLNWTGYDKGA+ FLPSYVMRI GAKQQREAV+R P+ QLEPVF+ALD LG TKWRV
Sbjct: 421  LVPPLNWTGYDKGAYFFLPSYVMRIRGAKQQREAVRRAPRTQLEPVFKALDILGSTKWRV 480

Query: 931  ---------------------------PLPEEPTVEDEAEIRKWKWKLKAAKKENSERHS 990
                                       PLPEEP+VEDEAEIRKWKWK+KA KKENSERHS
Sbjct: 481  NKRVLSIIERIWASGGRLADLVDREDLPLPEEPSVEDEAEIRKWKWKVKAVKKENSERHS 540

Query: 991  QRCDIELKLAVARKMKEEEGFYYPHNLDFRGRAYPMHPHLNHLGSDMCRGILEFAEGRPL 1050
            QRCD+ELKLAVARKMK+EEGFYYPHNLDFRGRAYPMHP+LNH+GSD+CRGILEFAEGRPL
Sbjct: 541  QRCDVELKLAVARKMKDEEGFYYPHNLDFRGRAYPMHPNLNHIGSDLCRGILEFAEGRPL 600

Query: 1051 GESGLRWLKIHLANLYAGGVDKLSYKDRISFTENHLDEIFDSADRPLEGNRWWLGAEDPF 1110
            GESGLRWLKIHLANLYAGGVDKLSYKDRISFTENHLDEIFDSADRPLEG+RWWLGAEDPF
Sbjct: 601  GESGLRWLKIHLANLYAGGVDKLSYKDRISFTENHLDEIFDSADRPLEGSRWWLGAEDPF 660

Query: 1111 QCLAVCINLSEALRSPSPETTLSHMPVHQDGSCNGLQHYAALGRDKLSL--------NNP 1170
            QCLAVCINLSEALRSPSPETT+SH+P+HQDGSCNGLQHYAALGRDKL          + P
Sbjct: 661  QCLAVCINLSEALRSPSPETTISHLPIHQDGSCNGLQHYAALGRDKLGAEAVNLAAGDKP 720

Query: 1171 A----GVNAIVLDIIRSDAEKDPATNPNALHARLLINQVDRKLVKQTVMTSVYGVTYMGA 1230
            A    G+ + VLDIIRSDA KDPA+ PNALHA+LLINQVDRKLVKQTVMTSVYGVT +GA
Sbjct: 721  ADVYSGIASRVLDIIRSDALKDPASYPNALHAKLLINQVDRKLVKQTVMTSVYGVTLVGA 780

Query: 1231 RDQIKKRLKERASIADDSQLFAASCYA-------------------------ARVIASEN 1290
            +DQI +RLKERASI ++ QLF+ASCYA                         A+VIASEN
Sbjct: 781  KDQISQRLKERASIGNERQLFSASCYAAKTTLTAIGEMFEEAKSIMNWLGECAKVIASEN 840

Query: 1291 QPVRWTTPLGLPVVQPYRQLGRHLIKTSLQVLALQRETDKVMATRQKTAFPPNFVHSLDG 1333
            + VRWTTPLGLPVVQPYR+ GRH++KTSLQVL+LQRETDKVMA RQK+AFPPNF+HSLD 
Sbjct: 841  KDVRWTTPLGLPVVQPYRKPGRHIVKTSLQVLSLQRETDKVMAARQKSAFPPNFIHSLDS 900

BLAST of Cp4.1LG09g02310 vs. NCBI nr
Match: gi|778705699|ref|XP_004135424.2| (PREDICTED: DNA-directed RNA polymerase 1B, mitochondrial [Cucumis sativus])

HSP 1 Score: 1254.2 bits (3244), Expect = 0.0e+00
Identity = 651/953 (68.31%), Postives = 738/953 (77.44%), Query Frame = 1

Query: 451  MWRNIVKRAASRRDTFFS-------RKVIEDSISVEEIRSSRTLGSLKSCCLLEGSRGIS 510
            MWRN+ K AASR+   FS       RKV ED I +++IRSSR+L +L +CC   GS  I 
Sbjct: 1    MWRNVFKTAASRKAKLFSEFSSSTSRKVTEDHIFLDQIRSSRSLETLNTCCQSGGSSRII 60

Query: 511  FFHSREVGFAKSNFAHSSEPVAFYGVLNHAKGYATAAEAAISEEDLSGSEEIQELMEELS 570
            F H ++VGF  SNF HSS PV FYGVL HAKGYATAAEAAI E DLSGSEEIQE+ME L+
Sbjct: 61   FLHPQKVGFTNSNFPHSSNPVPFYGVLYHAKGYATAAEAAIFEGDLSGSEEIQEIMEGLN 120

Query: 571  KQDKVESHFKQPKKMVDGMRVGKYNILRKRQIKMETEAWEEAAREYQELLTDMCEQKLAP 630
            KQDKVE HFKQPK MVDG R  KY++LRKRQIK+ETEAWEEAA+EYQ+L+ D+CEQKLAP
Sbjct: 121  KQDKVELHFKQPKGMVDGNRDTKYDMLRKRQIKIETEAWEEAAKEYQDLIADICEQKLAP 180

Query: 631  NLPYIKSLFLGWFEPLRDAIAADQEYDKGAHLFLPSYVMRIHGAKQQREAVKRVPKKQLE 690
            NLPY+KSLFLGWF+PLRDAI A+QE  K      PS+ +  H       AV  +   +L 
Sbjct: 181  NLPYMKSLFLGWFQPLRDAIVAEQESVKFKRSS-PSHALYFHLLPADMMAV--ITMHKLM 240

Query: 691  PVFEALDTLGRTKWRVNKRVLSIIDRIWASGGRLADLVDRXTEGDTEPVVEDQEKVAKEQ 750
             +  + D  G    RV +    I + I  +  R+ +  ++  +          E++A+  
Sbjct: 241  GLLMS-DIEGGGSVRVTQAASGIGEAI-ENEVRIRNFFEKTKK--------QPEQLAEGH 300

Query: 751  DKLRKKVTNLMKKQKLRQVRMIVKEHDHLKPWGQDAHVKVGCRLIQLLIETAYIQPPMDQ 810
            DKLRKK+T LMK+QKL++V  IVK HD  KPWG DAHVKVGCRLIQLLIETAYIQPP+DQ
Sbjct: 301  DKLRKKLTKLMKQQKLQKVNFIVKNHDDSKPWGTDAHVKVGCRLIQLLIETAYIQPPVDQ 360

Query: 811  LGGGPPDIRPAFVHTLKTITKEAQKTSRRYGVIECDPLVRRGLEKTARHMVIPYMPMLVP 870
            +G  PPD+RPAFVH+LKT  KE+Q+  +RYGVIECDPLV RG+ KTA HM+IPYMPMLVP
Sbjct: 361  IGDAPPDLRPAFVHSLKTSLKESQRLGKRYGVIECDPLVYRGMVKTAGHMIIPYMPMLVP 420

Query: 871  PLNWTGYDKGAHLFLPSYVMRIHGAKQQREAVKRVPKKQLEPVFEALDTLGRTKWRV--- 930
            P  WTGYD+GAH FLPSYVMRI GA+QQREAVKR  KKQL PVF+ALD LG TKWRV   
Sbjct: 421  PRKWTGYDQGAHFFLPSYVMRIRGARQQREAVKRASKKQLGPVFKALDILGSTKWRVNKR 480

Query: 931  ------------------------PLPEEPTVEDEAEIRKWKWKLKAAKKENSERHSQRC 990
                                    PLPE+P VEDEAEIR WKWK+KA K+ENSERHSQRC
Sbjct: 481  VLSVIEKIWASGGRLADLVDREDMPLPEQPMVEDEAEIRNWKWKVKAVKRENSERHSQRC 540

Query: 991  DIELKLAVARKMKEEEGFYYPHNLDFRGRAYPMHPHLNHLGSDMCRGILEFAEGRPLGES 1050
            D ELKLAVARKMKEEEGFYYPHNLDFRGRAYPMHPHLNH+GSD CRG LEFAEGRPLGES
Sbjct: 541  DTELKLAVARKMKEEEGFYYPHNLDFRGRAYPMHPHLNHIGSDFCRGTLEFAEGRPLGES 600

Query: 1051 GLRWLKIHLANLYAGGVDKLSYKDRISFTENHLDEIFDSADRPLEGNRWWLGAEDPFQCL 1110
            GLRWLKIHLANLYAGGVDKLSYKDRISFTENHLDEIFDSADRPLEG+RWWLGAEDPFQCL
Sbjct: 601  GLRWLKIHLANLYAGGVDKLSYKDRISFTENHLDEIFDSADRPLEGSRWWLGAEDPFQCL 660

Query: 1111 AVCINLSEALRSPSPETTLSHMPVHQDGSCNGLQHYAALGRDKLSL--------NNPA-- 1170
            AVCI+LSEALRSPSPETT+SHMPVHQDGSCNGLQHYAALGRDKL          + PA  
Sbjct: 661  AVCIDLSEALRSPSPETTISHMPVHQDGSCNGLQHYAALGRDKLGAEAVNLAAGDKPADV 720

Query: 1171 --GVNAIVLDIIRSDAEKDPATNPNALHARLLINQVDRKLVKQTVMTSVYGVTYMGARDQ 1230
              G+ + VLDI+RSDA KDPA+NPNALHARLLINQVDRKLVKQTVMTSVYGVTY GA+DQ
Sbjct: 721  YSGIASRVLDIMRSDAAKDPASNPNALHARLLINQVDRKLVKQTVMTSVYGVTYAGAKDQ 780

Query: 1231 IKKRLKERASIADDSQLFAASCYA-------------------------ARVIASENQPV 1290
            I++RLKER+SI ++  LF ASCYA                         A+VIASENQ V
Sbjct: 781  IRQRLKERSSIENERHLFTASCYAAKTTLTAIGEMFEAAKSIMNWLGECAKVIASENQAV 840

Query: 1291 RWTTPLGLPVVQPYRQLGRHLIKTSLQVLALQRETDKVMATRQKTAFPPNFVHSLDGSHM 1333
            RWTTPLGLPVVQPYR+LGRHL+KTSLQ+L+LQRETDKVMA RQ+TAFPPN++HSLD SHM
Sbjct: 841  RWTTPLGLPVVQPYRKLGRHLVKTSLQMLSLQRETDKVMAMRQRTAFPPNYIHSLDSSHM 900

BLAST of Cp4.1LG09g02310 vs. NCBI nr
Match: gi|596284970|ref|XP_007225383.1| (hypothetical protein PRUPE_ppa000780mg [Prunus persica])

HSP 1 Score: 1226.5 bits (3172), Expect = 0.0e+00
Identity = 641/979 (65.47%), Postives = 738/979 (75.38%), Query Frame = 1

Query: 451  MWRNIVKRAASRRDTFFSRKVI-----------EDSISVEEIRSSRTLGSLKSCCLLEGS 510
            MWRN+ K+ ASR+    S+              ++S  +++ R       + +  L+ G 
Sbjct: 1    MWRNLAKQVASRKTNLSSQSHFGSPSSTSMIFSQESSFLDKARHFEARKCINNRILVMGF 60

Query: 511  RGISFFHSREVGFAKSNFAHSSEPVAFYGVLNHAKGYATAAEAAIS---EEDLSGSEEIQ 570
            R +    S++    + +  + S P    G  N+AKGYA+ AEA  S   EED SGSEEIQ
Sbjct: 61   RQVGDMASQKEELGRCSSLNPSYPYGISGFCNYAKGYASVAEAIASTDGEEDSSGSEEIQ 120

Query: 571  ELMEELSKQDK-VESHFKQPKKMVDGMRVGKYNILRKRQIKMETEAWEEAAREYQELLTD 630
            E++E+L +++  VESHFKQPK++V GM VGKYN+LRKRQIK+ETEAW+EAA+EYQELL D
Sbjct: 121  EMLEDLIRENNMVESHFKQPKRVVVGMGVGKYNLLRKRQIKLETEAWQEAAKEYQELLAD 180

Query: 631  MCEQKLAPNLPYIKSLFLGWFEPLRDAIAADQEYDKGAHLFLPSYVMRIHGAKQQREAVK 690
            MCEQKLAPNLPYIKSLFLGWFEPLRDAIAADQ+  K  +        + H          
Sbjct: 181  MCEQKLAPNLPYIKSLFLGWFEPLRDAIAADQDSCKQPNS------RQSHAPYFDHLPAD 240

Query: 691  RVPKKQLEPVFEALDTL--GRTKWRVNKRVLSI---------IDRIWASGGRLADLVDRX 750
            ++    +  +   L T   G    RV +   +I         I R      +  + +D+ 
Sbjct: 241  KMAVITMHKLMGLLMTNNGGIGSVRVVQAACAIGEAIEHEVRIHRFLEKTKKKKNTIDKK 300

Query: 751  TEGDTEPVV-------EDQEKVAKEQDKLRKKVTNLMKKQKLRQVRMIVKEHDHLKPWGQ 810
             E D+ PV        ++QEK+ KEQ++LRKKV  L+K+QK++QVR IVKE + LKPWGQ
Sbjct: 301  AEADSVPVTIEQEKLADEQEKLTKEQERLRKKVNKLIKRQKMQQVRGIVKEQEDLKPWGQ 360

Query: 811  DAHVKVGCRLIQLLIETAYIQPPMDQLGGGPPDIRPAFVHTLKTITKEAQKTSRRYGVIE 870
            +AHVKVGCRLIQLL++TAYIQPP+DQ+G GPPDIRPAFVH LKTIT++ QKTSRRYGVIE
Sbjct: 361  EAHVKVGCRLIQLLMDTAYIQPPVDQIGDGPPDIRPAFVHNLKTITRDTQKTSRRYGVIE 420

Query: 871  CDPLVRRGLEKTARHMVIPYMPMLVPPLNWTGYDKGAHLFLPSYVMRIHGAKQQREAVKR 930
            CDP+VR+G+EKTARHMV+PYMPMLVPP+NWTGYD+GA+LFLPSYVMR HGAKQQRE VKR
Sbjct: 421  CDPIVRKGMEKTARHMVMPYMPMLVPPINWTGYDRGAYLFLPSYVMRTHGAKQQREVVKR 480

Query: 931  VPKKQLEPVFEALDTLGRTKWRV---------------------------PLPEEPTVED 990
             P+KQLEPVFEALDTLG TKWRV                           PLPEEP  ED
Sbjct: 481  TPRKQLEPVFEALDTLGSTKWRVNKRVLGVIDRIWASGGRLADLVDREDVPLPEEPDTED 540

Query: 991  EAEIRKWKWKLKAAKKENSERHSQRCDIELKLAVARKMKEEEGFYYPHNLDFRGRAYPMH 1050
            EAEIRKWKWKLKAAKKENSERHSQRCDIELKLA +RKMK+EEGFYYPHNLDFRGRAYPMH
Sbjct: 541  EAEIRKWKWKLKAAKKENSERHSQRCDIELKLAASRKMKDEEGFYYPHNLDFRGRAYPMH 600

Query: 1051 PHLNHLGSDMCRGILEFAEGRPLGESGLRWLKIHLANLYAGGVDKLSYKDRISFTENHLD 1110
            P+LNHLGSDMCRGILEF+EGR LG+SGLRWLKIHLANLYAGGVDKLS+ DR +FTENH+D
Sbjct: 601  PYLNHLGSDMCRGILEFSEGRHLGKSGLRWLKIHLANLYAGGVDKLSFDDRAAFTENHVD 660

Query: 1111 EIFDSADRPLEGNRWWLGAEDPFQCLAVCINLSEALRSPSPETTLSHMPVHQDGSCNGLQ 1170
            EIFDSADRPLEG RWWLGAEDPFQCLA CINL EALRSPSPETT+S+MPVHQDGSCNGLQ
Sbjct: 661  EIFDSADRPLEGRRWWLGAEDPFQCLAACINLCEALRSPSPETTISYMPVHQDGSCNGLQ 720

Query: 1171 HYAALGRDKLSL--------NNPA----GVNAIVLDIIRSDAEKDPATNPNALHARLLIN 1230
            HYAALGRDKL          + PA    G+ A VLDI+R+DAEKDPATNPNALHARLLIN
Sbjct: 721  HYAALGRDKLGAAAVNLVGGDKPADVYSGIAARVLDIMRNDAEKDPATNPNALHARLLIN 780

Query: 1231 QVDRKLVKQTVMTSVYGVTYMGARDQIKKRLKERASIADDSQLFAASCYAA--------- 1290
            QVDRKLVKQTVMTSVYGVTY+GARDQIK+RLKER SIADD+ LFAA+CYAA         
Sbjct: 781  QVDRKLVKQTVMTSVYGVTYVGARDQIKRRLKERGSIADDTALFAAACYAARTTLTALGE 840

Query: 1291 ----------------RVIASENQPVRWTTPLGLPVVQPYRQLGRHLIKTSLQVLALQRE 1333
                            +VIASENQPVRW TPLGLPVVQPYRQLGRHLIKTSLQVL LQRE
Sbjct: 841  MFEAARSIMSWLGECAKVIASENQPVRWITPLGLPVVQPYRQLGRHLIKTSLQVLTLQRE 900

BLAST of Cp4.1LG09g02310 vs. NCBI nr
Match: gi|645277464|ref|XP_008243783.1| (PREDICTED: DNA-directed RNA polymerase 1, mitochondrial [Prunus mume])

HSP 1 Score: 1221.8 bits (3160), Expect = 0.0e+00
Identity = 642/979 (65.58%), Postives = 733/979 (74.87%), Query Frame = 1

Query: 451  MWRNIVKRAASRRDTFFSRKVIEDSISVEEIRSSRTL-----------GSLKSCCLLEGS 510
            MWRN+ K+ ASR+    S+       S   I S  +              + +  L+ G 
Sbjct: 1    MWRNLAKQVASRKTNLSSQSHFSSPSSTSMIFSQESSFLEKAWHFEARKCINNRILVMGF 60

Query: 511  RGISFFHSREVGFAKSNFAHSSEPVAFYGVLNHAKGYATAAEAAIS---EEDLSGSEEIQ 570
            R +    S++    + +  + S P    G  N+AKGYA+ AEA  S   EED SGSEEIQ
Sbjct: 61   RQVGDMASQKDELGRCSSLNPSYPYGISGFCNYAKGYASVAEAIASTDGEEDSSGSEEIQ 120

Query: 571  ELMEELSKQDK-VESHFKQPKKMVDGMRVGKYNILRKRQIKMETEAWEEAAREYQELLTD 630
            E++E+L +++  VESHFKQPK++V GM VGKYN+LRKRQIK+ETEAW+EAA+EYQELL D
Sbjct: 121  EMLEDLIRENNMVESHFKQPKRVVVGMGVGKYNLLRKRQIKLETEAWQEAAKEYQELLAD 180

Query: 631  MCEQKLAPNLPYIKSLFLGWFEPLRDAIAADQEYDKGAHLFLPSYVMRIHGAKQQREAVK 690
            MCEQKLAPNLPYIKSLFLGWFEPLRDAIAADQ+  K  +        + H          
Sbjct: 181  MCEQKLAPNLPYIKSLFLGWFEPLRDAIAADQDSCKQTNS------RQSHAPYFDHLPAD 240

Query: 691  RVPKKQLEPVFEALDTL--GRTKWRVNKRVLSI---------IDRIWASGGRLADLVDRX 750
            ++    +  +   L T   G    RV +   +I         I R      +  +  D+ 
Sbjct: 241  KMAVITMHKLMGLLMTNNGGVGSVRVVQAACAIGEAIEHEVRIHRFLEKTKKKKNTTDKK 300

Query: 751  TEGDTEPVVEDQEKVAKEQ-------DKLRKKVTNLMKKQKLRQVRMIVKEHDHLKPWGQ 810
             E D+ PV ++QEK+A EQ       +KLRKKV  L+K+QK++QVR IVKE + LKPWGQ
Sbjct: 301  AEADSVPVTDEQEKLADEQGKLADEQEKLRKKVNKLIKRQKMQQVRGIVKEQEDLKPWGQ 360

Query: 811  DAHVKVGCRLIQLLIETAYIQPPMDQLGGGPPDIRPAFVHTLKTITKEAQKTSRRYGVIE 870
            +AHVKVGCRLIQLL++TAYIQPP+DQ+G GPPDIRPAFVH LKTITK+ QKTSRRYGVIE
Sbjct: 361  EAHVKVGCRLIQLLMDTAYIQPPVDQIGDGPPDIRPAFVHNLKTITKDTQKTSRRYGVIE 420

Query: 871  CDPLVRRGLEKTARHMVIPYMPMLVPPLNWTGYDKGAHLFLPSYVMRIHGAKQQREAVKR 930
            CDPLVR+G++KTARHMV+PYMPMLVPP+NWTGYD+GA+LFLPSYVMR HGAKQQRE VKR
Sbjct: 421  CDPLVRKGMDKTARHMVMPYMPMLVPPINWTGYDRGAYLFLPSYVMRTHGAKQQREVVKR 480

Query: 931  VPKKQLEPVFEALDTLGRTKWRV---------------------------PLPEEPTVED 990
             P+KQLEPVFEALDTLG TKWRV                           PLPEEP  ED
Sbjct: 481  TPRKQLEPVFEALDTLGSTKWRVNKRVLGVIDRIWASGGRLADLVDREDVPLPEEPDTED 540

Query: 991  EAEIRKWKWKLKAAKKENSERHSQRCDIELKLAVARKMKEEEGFYYPHNLDFRGRAYPMH 1050
            EAEIRKWKWKLKAAKKENSERHSQRCDIELKLA +RKMK+EEGFYYPHNLDFRGRAYPMH
Sbjct: 541  EAEIRKWKWKLKAAKKENSERHSQRCDIELKLAASRKMKDEEGFYYPHNLDFRGRAYPMH 600

Query: 1051 PHLNHLGSDMCRGILEFAEGRPLGESGLRWLKIHLANLYAGGVDKLSYKDRISFTENHLD 1110
            P+LNHLGSDMCRGILEF+EGR LG+SGLRWLKIHLANLYAGGVDKLS+ DR +FTENH+D
Sbjct: 601  PYLNHLGSDMCRGILEFSEGRHLGKSGLRWLKIHLANLYAGGVDKLSFDDRAAFTENHVD 660

Query: 1111 EIFDSADRPLEGNRWWLGAEDPFQCLAVCINLSEALRSPSPETTLSHMPVHQDGSCNGLQ 1170
            EIFDSADRPLEG RWWLGAEDPFQCLA CINL EALRSPSPETT+S+MPVHQDGSCNGLQ
Sbjct: 661  EIFDSADRPLEGRRWWLGAEDPFQCLAACINLCEALRSPSPETTISYMPVHQDGSCNGLQ 720

Query: 1171 HYAALGRDKLSL--------NNPA----GVNAIVLDIIRSDAEKDPATNPNALHARLLIN 1230
            HYAALGRDKL          + PA    G+ A VLDI+R+DAEKDPATNPNALHARLLIN
Sbjct: 721  HYAALGRDKLGAAAVNLVGGDKPADVYSGIAARVLDIMRNDAEKDPATNPNALHARLLIN 780

Query: 1231 QVDRKLVKQTVMTSVYGVTYMGARDQIKKRLKERASIADDSQLFAASCYAA--------- 1290
            QVDRKLVKQTVMTSVYGVTY+GAR+QIK+RLKER SIADD+ LF A+CYAA         
Sbjct: 781  QVDRKLVKQTVMTSVYGVTYVGAREQIKRRLKERGSIADDTALFVAACYAARTTLTALGE 840

Query: 1291 ----------------RVIASENQPVRWTTPLGLPVVQPYRQLGRHLIKTSLQVLALQRE 1333
                            +VIASENQPVRW TPLGLPVVQPYRQLGRHLIKTSLQVL LQRE
Sbjct: 841  MFEAARSIMSWLGECAKVIASENQPVRWITPLGLPVVQPYRQLGRHLIKTSLQVLTLQRE 900

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
RPO1B_TOBAC0.0e+0065.80DNA-directed RNA polymerase 1B, mitochondrial OS=Nicotiana tabacum GN=RPOT1-TOM ... [more]
RPOT1_NICSY0.0e+0065.23DNA-directed RNA polymerase 1, mitochondrial OS=Nicotiana sylvestris GN=RPOT1 PE... [more]
RPOT2_NICSY0.0e+0059.30DNA-directed RNA polymerase 2, chloroplastic/mitochondrial OS=Nicotiana sylvestr... [more]
RPO2B_TOBAC2.8e-31258.74DNA-directed RNA polymerase 2B, chloroplastic/mitochondrial OS=Nicotiana tabacum... [more]
RPOT2_ARATH3.4e-29356.64DNA-directed RNA polymerase 2, chloroplastic/mitochondrial OS=Arabidopsis thalia... [more]
Match NameE-valueIdentityDescription
A0A0A0KR04_CUCSA0.0e+0068.31DNA-directed RNA polymerase OS=Cucumis sativus GN=Csa_5G607420 PE=3 SV=1[more]
M5XKH1_PRUPE0.0e+0065.47DNA-directed RNA polymerase OS=Prunus persica GN=PRUPE_ppa000780mg PE=3 SV=1[more]
A0A0D2T267_GOSRA0.0e+0064.81DNA-directed RNA polymerase OS=Gossypium raimondii GN=B456_008G174700 PE=3 SV=1[more]
A0A0D2T7T6_GOSRA0.0e+0064.74DNA-directed RNA polymerase OS=Gossypium raimondii GN=B456_008G174700 PE=3 SV=1[more]
B9HIV5_POPTR0.0e+0064.45DNA-directed RNA polymerase OS=Populus trichocarpa GN=POPTR_0008s11200g PE=3 SV=... [more]
Match NameE-valueIdentityDescription
AT5G15700.21.9e-29456.64 DNA/RNA polymerases superfamily protein[more]
AT1G68990.26.5e-26666.91 male gametophyte defective 3[more]
AT2G24120.13.8e-24252.29 DNA/RNA polymerases superfamily protein[more]
Match NameE-valueIdentityDescription
gi|659091310|ref|XP_008446483.1|0.0e+0068.76PREDICTED: DNA-directed RNA polymerase 1B, mitochondrial isoform X2 [Cucumis mel... [more]
gi|659091308|ref|XP_008446482.1|0.0e+0068.62PREDICTED: DNA-directed RNA polymerase 1B, mitochondrial isoform X1 [Cucumis mel... [more]
gi|778705699|ref|XP_004135424.2|0.0e+0068.31PREDICTED: DNA-directed RNA polymerase 1B, mitochondrial [Cucumis sativus][more]
gi|596284970|ref|XP_007225383.1|0.0e+0065.47hypothetical protein PRUPE_ppa000780mg [Prunus persica][more]
gi|645277464|ref|XP_008243783.1|0.0e+0065.58PREDICTED: DNA-directed RNA polymerase 1, mitochondrial [Prunus mume][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0003899DNA-directed RNA polymerase activity
GO:0003677DNA binding
Vocabulary: Biological Process
TermDefinition
GO:0006351transcription, DNA-templated
Vocabulary: INTERPRO
TermDefinition
IPR024075DNA-dir_RNA_pol_helix_hairp_sf
IPR002092DNA-dir_Rpol_phage-type
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006144 purine nucleobase metabolic process
biological_process GO:0006206 pyrimidine nucleobase metabolic process
biological_process GO:0006351 transcription, DNA-templated
biological_process GO:0009793 embryo development ending in seed dormancy
biological_process GO:0048481 plant ovule development
biological_process GO:0009860 pollen tube growth
biological_process GO:0016567 protein ubiquitination
biological_process GO:0006390 transcription from mitochondrial promoter
cellular_component GO:0005730 nucleolus
cellular_component GO:0034245 mitochondrial DNA-directed RNA polymerase complex
molecular_function GO:0003677 DNA binding
molecular_function GO:0003899 DNA-directed RNA polymerase activity
molecular_function GO:0004842 ubiquitin-protein transferase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG09g02310.1Cp4.1LG09g02310.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002092DNA-directed RNA polymerase, phage-typePANTHERPTHR10102DNA-DIRECTED RNA POLYMERASE, MITOCHONDRIALcoord: 807..1333
score: 0.0coord: 132..410
score:
IPR002092DNA-directed RNA polymerase, phage-typePFAMPF00940RNA_polcoord: 1215..1367
score: 3.6E-50coord: 1011..1213
score: 3.4
IPR002092DNA-directed RNA polymerase, phage-typePROSITEPS00489RNA_POL_PHAGE_2coord: 1160..1174
scor
IPR002092DNA-directed RNA polymerase, phage-typePROSITEPS00900RNA_POL_PHAGE_1coord: 1099..1110
scor
IPR024075DNA-directed RNA polymerase, helix hairpin domainGENE3DG3DSA:1.10.287.260coord: 695..751
score: 4.9E-9coord: 919..971
score: 2.0
NoneNo IPR availableunknownCoilCoilcoord: 189..209
score: -coord: 592..612
scor
NoneNo IPR availableGENE3DG3DSA:1.10.150.20coord: 1212..1272
score: 1.2E-19coord: 1129..1211
score: 8.1
NoneNo IPR availableGENE3DG3DSA:1.10.287.280coord: 1011..1091
score: 2.7
NoneNo IPR availableGENE3DG3DSA:3.30.70.370coord: 1273..1333
score: 5.2E-54coord: 981..1010
score: 5.2E-54coord: 1094..1119
score: 5.2
NoneNo IPR availablePANTHERPTHR10102:SF3SUBFAMILY NOT NAMEDcoord: 807..1333
score: 0.0coord: 132..410
score:
NoneNo IPR availableunknownSSF56672DNA/RNA polymerasescoord: 741..1340
score: 2.27E-206coord: 177..454
score: 3.56E-32coord: 580..753
score: 4.89

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
Cp4.1LG09g02310CsaV3_5G038780Cucumber (Chinese Long) v3cpecucB0050
Cp4.1LG09g02310Bhi06G000267Wax gourdcpewgoB0023
Cp4.1LG09g02310CsGy5G024030Cucumber (Gy14) v2cgybcpeB581
Cp4.1LG09g02310CsGy5G029080Cucumber (Gy14) v2cgybcpeB587
The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Cp4.1LG09g02310Cucurbita pepo (Zucchini)cpecpeB026
Cp4.1LG09g02310Cucurbita pepo (Zucchini)cpecpeB040
Cp4.1LG09g02310Cucurbita pepo (Zucchini)cpecpeB054
Cp4.1LG09g02310Cucumber (Gy14) v1cgycpeB0248
Cp4.1LG09g02310Cucumber (Gy14) v1cgycpeB0745
Cp4.1LG09g02310Cucurbita maxima (Rimu)cmacpeB072
Cp4.1LG09g02310Cucurbita maxima (Rimu)cmacpeB105
Cp4.1LG09g02310Cucurbita maxima (Rimu)cmacpeB415
Cp4.1LG09g02310Cucurbita maxima (Rimu)cmacpeB700
Cp4.1LG09g02310Cucurbita moschata (Rifu)cmocpeB046
Cp4.1LG09g02310Cucurbita moschata (Rifu)cmocpeB081
Cp4.1LG09g02310Cucurbita moschata (Rifu)cmocpeB376
Cp4.1LG09g02310Cucurbita moschata (Rifu)cmocpeB651
Cp4.1LG09g02310Wild cucumber (PI 183967)cpecpiB018
Cp4.1LG09g02310Wild cucumber (PI 183967)cpecpiB036
Cp4.1LG09g02310Cucumber (Chinese Long) v2cpecuB041
Cp4.1LG09g02310Bottle gourd (USVL1VR-Ls)cpelsiB024
Cp4.1LG09g02310Bottle gourd (USVL1VR-Ls)cpelsiB032
Cp4.1LG09g02310Bottle gourd (USVL1VR-Ls)cpelsiB041
Cp4.1LG09g02310Bottle gourd (USVL1VR-Ls)cpelsiB044
Cp4.1LG09g02310Watermelon (Charleston Gray)cpewcgB045
Cp4.1LG09g02310Watermelon (Charleston Gray)cpewcgB049
Cp4.1LG09g02310Watermelon (97103) v1cpewmB021
Cp4.1LG09g02310Watermelon (97103) v1cpewmB031
Cp4.1LG09g02310Watermelon (97103) v1cpewmB059
Cp4.1LG09g02310Melon (DHL92) v3.5.1cpemeB024
Cp4.1LG09g02310Melon (DHL92) v3.5.1cpemeB034
Cp4.1LG09g02310Melon (DHL92) v3.5.1cpemeB036
Cp4.1LG09g02310Melon (DHL92) v3.6.1cpemedB028
Cp4.1LG09g02310Melon (DHL92) v3.6.1cpemedB039
Cp4.1LG09g02310Melon (DHL92) v3.6.1cpemedB042
Cp4.1LG09g02310Silver-seed gourdcarcpeB0146
Cp4.1LG09g02310Silver-seed gourdcarcpeB0645
Cp4.1LG09g02310Silver-seed gourdcarcpeB0841
Cp4.1LG09g02310Silver-seed gourdcarcpeB1169
Cp4.1LG09g02310Cucumber (Chinese Long) v3cpecucB0020
Cp4.1LG09g02310Cucumber (Chinese Long) v3cpecucB0043
Cp4.1LG09g02310Wax gourdcpewgoB0058