Cp4.1LG09g02310 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG09g02310
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionDNA-directed RNA polymerase
LocationCp4.1LG09: 1371774 .. 1385477 (+)
RNA-Seq ExpressionCp4.1LG09g02310
SyntenyCp4.1LG09g02310
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGTGAAGGAATAAATGTTGGAGAAAACGTAGGTTTTAGGAGGGTTTAATGGCCGCCGCTGTGGTTCTCTTCTCCGTTCTCCCACCATTTTGAGATCTCTACCTTCATCTCTACCTTCATCTCTACCTTCATCTCTCATCCTTCCCGCGTTTGAATTCTTTGTATTCAACTTTGTGTTCTTAGTTTCTCACTGCCTGAGAACATTTCGGAGAAACTGCATATTCCGACTATCAGTTCTTCGCATTTTCCATCTCCTTCACCTGCGTTTTGAATTGCTTGAACTTGTTGTTGTGGGAATTCGGAATCCCTACTACCCGAGAATGTGGAGAAACATTGTCAAAAGAGCTGCTTCAAGGAGGGATACGTTCTTCTCCAGAAAAGTTATTGAAGACTCTATCTTTGTTGAGGAAATTAGATCCTCCAGGACACTCGGAAGCTTGAAATCATGTTGCCTGTTAGAGGGCTCTCGTGGAATCAGCTTTTTCCATTCCCGTGAAGTTGGGTTTGCGAAATCCAATTTCGCACATTCAAGCGATCCCGTAGCTTTTTATGGGGTTCTAAACCATGCCAATGGCTACGCAACTGCTGCGGAGGCTGCCATTTCCGATGAGGACTTATCAGGGTCGGAAGAAATTCAGGAATTGATGGAGGAACTAAGCAAACAAGATAAGGTGGAGTCTCACTTTAAGCAGCCTAAGAAAATGGTGGATGGAATGAGGGTAGGTAAGTACAATATTCTACGGAAGAGACAGATAAAGATGGAGACGGAGGCTTGGGAAGAGGCTGCCAGAGAGTATCAAGAGCTATTAACGGATATGTGTGAGCAGAAGTTGGCGCCCAATTTACCTTACATGAAGTCTTTATTCCTTGGTTGGTTTGAACCCTTGCGTGATGCAATTGCTGCAGACCAGGAGTTTTGCAAGAAGAAGAGTAGAGTGTCTCATTTTGCTTATTTTGATCTTCTGCCGGCGGATATGATGGCTGTGATTACAATGCATAAGTTAATGGGGTTGTTGATGACTAACAGTGGAGGAAACAGTAGTGTCAGGGTAGTCCAAGCTGCTTGTCAGATAGGAGAAGCCATTGAACACGAGGTTGGTTAGTTTCAAATTCTTTTACTTTATTGTCTAAAATTTTGCATTTGTTTTAGATTTCCCATTGATATGAAGTTCGCTAGGTATCAAAGTAAAAAAACCAGTCATGGTTTTCTTTCCAATATTGAGAGCTATCGATCATATAGCAAGGTTTCTTACCTCATTCTGTCAGTTTTTCTTTCTCTTATTTAGATATGTGATATATTTGTTCTTGTTTGGCAATGAACATCTGAACTCTGATTGACATTTTTAACAAAAGTGTTGAAGTAATTGAAATATTTGTTTGGACAATTTTCTTTTCTACGAAATAGGTTAGAATACACAAGTTCTTTGAGAATATGAAGAAGAAGAAGAGTAATGAAAAAACTACTGAAGGAGACACTGAACCTGTGGTTGAAGACCAGGAGAAAGTGGCCAAAGAACAAGACAAATTGAGGAAAAAAGTCACCAACCTGATGAAAAAACAGAAGTTGCGGCAAGTTAGGATGATAGTCAAGGAACACGACCATTTAAAGCCTTGGGGCCAGGATGCGCATGTGAAGGTTTCTGTTTTTAATTTTTTGCCCATAGTCTCAATGTGAAGTAATGCGTAAATTCTGGTGTACTTGCTGTGAGCTACTGTAGATGTTGATGTACACATATTAACAGCGAAAAATTGTTTAATTCTGGTTTCTATTGCCTTCCAGGTTGGTTGTCGTTTGATTCAGCTATTGATTGAAACAGCGTATATTCAACCTCCAATGGATCAGTTAGGAGGGGGTCCTCCTGATATTCGTCCCGCATTTGTCCATACTCTTAAAACCATCACAAAAGAAGCACAGTTAATGAGCCTTCTGCTCTCGTTGTTTATTTTTCATATTTATCTCACTTCAATTTTTCTACATATTCTTTCTATGTCTGATCTGTATTACTACTACTACTAATTATTACCATTTTGTTTCTTCCCGCAGAAAGACTAGCAGAAGATATGGTGTTATTGAATGCGATCCACTTGTTCGCAGAGGCCTGGAGAAAACTGTAAGTAAAATCATACATTGTGGGCTGTTCCGTACTCGGTGTGAATTTGGTTGCTTTTCTATAAATAGTTTGTAGACATTATTTTTGCTGTATGATTATGCAACTCTGTAGTTCTGCAAGGTTCTCTCTCTCTCAAAATTACTCAGTTTATGTCTTGGATTAGTGATTGTCTTCAGTTTTGTCTCAGGCAAGACACATGGTCATACCATATATGCCTATGCTGGTGCCTCCCCTTAATTGGACAGGGTAGGTTACTCTATATTTTAATCTATTTCAATAAATTCTAATAGTTTCCATAATATTCAGAGAATGAAAACCTAGTTTCATGTAACGATATTTACTGGTTCATTTGTTGAAGAGTATATATATCTGAATGATATGTATTTTCTGAAGCTTAAGTATCATTTGTGCTGTATGCTTGTTATGTTTGGTGGTTGATGTCTGCTATTCGTNTTGCTTGAACTTGTTGTTGTGGGAATTCGGAATCCCTACTACCAGAGAATGTGGAGAAACATTGTCAAAAGAGCTGCTTCAAGGAGGGATACGTTCTTCTCCAGAAAAGTTATTGAAGACTCTATCTCTGTTGAGGAAATTAGATCCTCCAGGACACTTGGAAGCTTGAAATCATGTTGCCTGTTAGAGGGCTCTCGTGGAATCAGCTTTTTCCATTCCCGTGAAGTTGGGTTTGCGAAATCCAATTTCGCACATTCAAGCGAGCCCGTAGCTTTTTATGGGGTTCTAAACCATGCCAAAGGCTACGCAACTGCTGCGGAGGCTGCGATTTCCGAGGAGGACTTATCAGGGTCGGAAGAAATTCAGGAATTGATGGAGGAACTAAGCAAACAAGATAAGGTGGAGTCTCACTTTAAGCAGCCTAAGAAAATGGTGGATGGAATGAGGGTAGGTAAGTACAATATTCTACGGAAGAGACAGATAAAGATGGAGACGGAGGCTTGGGAAGAGGCTGCCAGAGAGTATCAAGAGCTATTAACGGATATGTGTGAGCAGAAGTTGGCGCCCAATTTACCTTACATAAAGTCTTTATTCCTTGGTTGGTTTGAACCCTTGCGTGATGCAATTGCTGCAGACCAGGAGTTTTGCAAGAAGAAGAGNATATACCATGTTGAAAAATTAATATCACTGGACTCTAGAGACGATCATAAGATATTAGGACTCTGGTACTTCCTCTTTGTCGAAGAAGATTTTTGCTTTGACTATATGTTTCATGGGCTTTGTCTTTGAAGTCGCGATACCATCTTNTAGTTTCATGTAACGATATTTACTGGTTCATTTGTTGAAGAGAATATATATCTGAATGATGCGTATTCTCTGAAGCTTAAGTTTCATTTGTGCTGTATGCTTGTTATGTTTGGTGTTTGATGTCTGCTATTCATGTGTAGGTATGATAAAGGAGCACACTTATTCCTACCATCATATGTTATGCGAATACATGGGGCAAAGCAGCAACGTGAAGCAGTTAAAAGGGTTCCAAAGAAACAACTAGAGCCTGTTTTTGAGGTTTGAGAGTTCTGATGATTGTTTCCTAGTTTAATGATTTACATACACACCTTTGGTTTTGCCTGACCTCTTCCTCCATTCCCATATACCATGTTGAAAAATTAATATCACTGGACTCTAGAGACGATCATAAGATATTAGGACTCTGGTACTTCCTCTTTGTCGAAGAAGGTTTTTGATTTGACTATATGTTTCATGGGCTTTGTCTTTGAAGTCGCGATACCATCTTTAGAAGGAACTCCTAAGATTTTCCCATAGAAAAAATCATCTTGCAGTTTCTTTTTGAACTTCAGTTACGTAAGAATTCTTAATATATGATAATTTTTGCATTTTTAGGCACTTGATACGCTTGGAAGAACCAAGTGGAGGGTGAACAAAAGAGTTCTTAGCATTATTGACAGGATATGGGCCAGTGGCGGTCGTCTTGCTGATTTGGTTGACCGTGAAGATGTATGTTACAGAAGAAGAATGCTAGTTGTTCCTATTTGCATGTTTTCCTATCTTATGAAATTTATATCTTTTTTTGTAACTTTAACTTTAACTTCAATTTGTACAGGCTNTACTGAAGGAGACACTGAACCTGTGGTTGAAGACCAGGAGAAAGTGGCCAAAGAACAAGACAAATTGAGGAAAAAAGTCACCAACCTGATGAAAAAACAGAAGTTGCGGCAAGTTAGGATGATAGTCAAGGAACACGACCATTTAAAGCCTTGGGGCCAGGATGCGCATGTGAAGGTTTCTGTTTTTAATTTTTTGCCCATAGTCTCAATGTGAAGTAATGCGTAAATTCTGGTGTACTTGCTGTGAGCTACTGTAGATGTTGATGTACACATATTAACAGCGAAAAATTGTTTAATTCTGGTTTCTATTGCCTTCCAGGTTGGTTGTCGTTTGATTCAGCTATTGATTGAAACAGCGTATATTCAACCTCCAATGGATCAGTTAGGAGGGGGTCCTCCTGATATTCGTCCCGCATTTGTCCATACTCTTAAAACCATCACAAAAGAAGCACAGTTAATGAGCCTTCTGCTCTCTTGGTTTATTTTTCATATTTATCTCACTTCGATTTTTCTACATATTCTTTCTATGTCTGATCTGTATTACTACTACTACTAATTATTACCATTTTGTTTCTTCCCGCAGAAAGACTAGCAGAAGATATGGTGTTATTGAATGCGATCCACTTGTTCGCAGAGGCCTGGAGAAAACTGTAAGTAAAATCATACATTGTGGGCTGTTCCGTACTCGGTGTGAATTTGGTTGCTTTTCTATAAATAGTTTGTAGACATTATTTTTGCTGTATGATTATGCAACTCTGTAGTTCTGCAAGGTTCTCTCTCTCTCAAAATTACTCAGTTTATGTCTTGGATTAGTGATTGTCTTCAGTTTTGTCTCAGGCAAGACACATGGTCATACCATATATGCCTATGCTGGTGCCTCCCCTTAATTGGACAGGGTAGGTTACTCTATATTTTAATCTATTTCAATAAATTCTAATAGTTTCCATACTATTCAGAGATTGAAAACCTAGTTTCATGTAACGATATTTACTGGTTCATTTGTTGAAGAGAATATATATCTGAATGATGCGTATTCTCTGAAGCTTAAGTTTCATTTGTGCTGTATGCTTGTTATGTTTGGTGTTTGATGTCTGCTATTCATGTGTAGGTATGATAAAGGAGCACACTTATTCCTACCATCATATGTTATGCGAATACATGGGGCAAAGCAGCAACGTGAAGCAGTTAAAAGGGTTCCAAAGAAACAACTAGAGCCTGTTTTTGAGGTTTGAGAGTTCTGATGATTGTTTCCTAGTTTAATGATTTACATACACACCTTTGGTTTTGCCTGACCTCTTCCTCCATTCCCCTATACCATGTTGAAAAATTAATATCACTGGACTCTAGAGACGATCATAAGATATTAGGACTCTGGTACTTCCTCTTTGTCGAAGAAGGTTTTTGATTTGACTATATGTTTCATGGGCTTTGTCTTTGAAGTTGCGATACCATCTTAGAAGGAACTCCTAAGATTTTCCCATAGAAAAAATCATCTTGCAGTTTCTTTTTGAACTTCAGTTACTAAAGAATTCGTAATATATGATAATTTTTGCATTTTTAGGCACTTGATACTCTTGGAAGAACCAAGTGGAGGGTGAACAAAAGAGTCCTTAGCATTATTGACAGGATATGGACCAGTGGTGGTCGTCTTGCTNAGCTACTTCCATTTTTACTTGGCTATTTCTTATTCTGCGTGTTCGGGGATTGAAAATACCTCAATGCTTGCTTATTCGAAGCTAAAAGCACCTGAATTTAAATAATGCTTCTTGTACTANTTGACCGTGAAGATGTATGTTACAGAAGAAGAATGCTAGTTGGTCCTATTTGCATGTTTTCCTATCTTATGAAATTTATTTTTTTTGTAACTTTAACTTCAATTTTTACAGGTTCCTCTACCAGAGGAGCCAACTGTGGAAGATGAAGCAGAAATTCGAAAATGGAAGTGGAAACTCAAGGCTGCAAAAAAAGAGAATAGTGAGAGGCATTCACAGCGGTGTGACATTGAGCTTAAGCTTGCAGTAAGAACTTATTGAATGAATAGAATTTAAATTTGAATACTCTTGTCAAATACATTTGAATTTACTTTTATTGTGAGCAGGTGGCCAGGAAAATGAAAGAAGAGGAGGGCTTTTACTATCCTCACAACTTAGATTTTCGAGGTCGTGCATACCCAATGCATCCACATTTGAATCATCTTGGTTCTGATATGTGTCGAGGAATTCTAGAATTTGCGGAGGGACGGCCACTTGGCGAGTCAGGGTTACGCTGGTTGAAGATACATTTGGCAAATCTATACGCTGGTGGTGTGGACAAGTTATCTTACAAGGATCGAATATCATTTACTGAGAATCATTTGGATGAGATTTTTGATTCGGCAGACAGGCCTCTAGAAGGAAATCGTTGGTGGTTGGGCGCAGAGGATCCTTTTCAGTGCTTGGCAGTGTGTATTAATCTCTCAGAGGCTTTAAGAAGTCCATCGCCGGAAACAACTCTTTCCCATATGCCTGTACACCAGGTACTACTTACATTATTCTTGCAATTCGTCCCCAAATTATGAATGCAATTTTTGTTCTCTGCAACTTTATCCCTTTCGAAACCATAATCATATAGCTATTCTGCAGTATATATCTTTGTTTATCCTTATTTATTTCCCCATGATTTGGCAGGATGGTTCCTGCAATGGCTTGCAACACTATGCAGCTCTCGGGAGGGACAAGGTACTTTTAATTTCTCTAGTTAGATAAGATTAAGATACAAAGTCAGCTGAGCTACTTCCATTTTTACTTGGCTATTTCTTATTCTGCGTGTTCGGGGATTGAAAATACCTCAATGCTTGCTTATTCGAAGCTAAAAGCACCTGAATTTAAATAATGCTTCTTGTACTACACATACGAAGAACTTTAAAAATATTTCTTTTAACTATCTCTATTTTTAGGAACTCCTTAATTTTTACAAAAAAAGTCTTCTTTGAGATGACACGGTCTGTTTCTTAATTAGCATTTAATATTGTGTCATTTTTCTTCAGTTGGGAGCAGCGGCAGTTAACCTTGTAGCAGGAGATAAGCCCGCAGATGTGTACTCAGGAATTGCTGCCAGGTTACTGCATCTCTTGTTTGCATGTTTGATTGTAGGGAGGATTATTTAAGTGTTCCTCATTTTTTTTTTAATGTTTTTTTAAAATTGCTCTTTGGTGTTTTTTACATATGAAAGTTGAGTCTTAATAACCCGGCAGGCGTCAATGCCATGTTTGTTTGAATAAGCTGAAAAGTATTTTGAAGCCTGGTCATTATGATAAGGGAGCTAATATATGCGTTCCGCGAACTATAGAAAATGAAAGTATGCTCAATGCAAGTGATATTTTTTAGATCAAATTGAAGTGACTCTGCTCTCTGCATTCAACTTTGGAAGCTTCGTTTACCAACTAGTATGAAAGGAAAAATACCTATTTAGACAGTATTCTCCATGCTTTCGGTATAAAAGTTTGCAAATGACTTGTGTTTTAGAGCATATTTTTGGCGTTCGGAGGGGATCTAGTTTCTTTGTTTATTTGTATATTTTTCTGGGTGCAGAGTTCTTGACATAATACGAAGTGATGCAGAGAAAGATCCTGCAACTAATCCAAATGCGTTGCATGCTAGACTTTTAATCAATCAGGTCTGGTTACTAATAATTCTATGAGAATAATGTTTTTTAGAACTGTTTCTAATAGCTGCTTGTATGTTTTTCCTCTAATGGATTTATATTTGTGACTGCAATTAGGTGGATCGTAAACTGGTGAAACAAACAGTCATGACATCCGTGTATGGTGTCACGTATATGGGTGCACGGGATCAGATTAAGAAAAGGTTGAAAGAACGAGCATCCATTGCTGATGATTCACAATTATTTGCAGCTTCTTGCTATGCAGCTAGAGTGAGTCAATAAACTGCCATTTGCTTGTGAGAAAAATGTAGCACATTTATGCCTATTTGCATGCATATTATAGTTTTGTTTCTTTTTTTTACACAACTTTTTCTTTTATATTTACAAGAAAAATGTGTTACTACAATTATTATTATTTTGAAATGAACGTTCTGCTGATTCGAAGACTTATCGTGCAGACTACCTTGACTGCCTTGGGGGAAATGTTTGAAGCAGCAAGAAGTATCATGAGCTGGCTTGGTGAATGTGCAAAGGTGCGGTCCTTAATTAGTGATATTGTCATTATTTTGGTTATTTTGATAAAGTTCGTAGTATCACTAGCTTGAAGTGGGAATTCTTCGATAGTTCCCCTCCTCTTGTAAATTGTGTTCCTTCCTAAAAAGAAAGGACTTTTCATTGATTGGTGAAACCTTCACGCTGAAGCCGTCATAAACATGCACCCACATGGGACTTACAGAGTCTTTACTAACCCACTTTTTTCAACATCATTTACATTACTTAGGCTCATTTGGTTTTCTCCAGCCATAAGTGAACTGGATAGTCTTCCCATCAGATCTTACCAGCTTCAAAAATAGATGTTTAGGAGAAAACTGCCTTAATGCATCGACCTTCCATCTTCACTCGATAAATCTCCCAACTCCTCAGTTTCCTATCCTACTAAATTTCTTCCAGTCAAGATTTTCCTCTATTTCGTTTCAATTTCTCACGTAGAGATTGAGAAATTTAACATCAATGGTTGGGTGACCAGCCATATTTCCTAGAACCTCACCCTTTACCCAAACCCACATCGAGCTTAATTTCGTCTTCGAACAAAGATCTTAATTTTGAAATACAACTACAAGGGTTGTGATCTCCTCCATATTTCTACAATTTTCAGACTCATGTTCCCTCGTCTTAGCCGTAGGTACTTCAAATAATTTGATTTTATAACATCTTGAGGAGTGGGTTATTTTAGCAATGTTACGTTTCTTATCTTAGTTAAGTATAGAAATCTGATTTTATGTGCATTTCAGCAAGCGTTCCTCTGCCTTTTTGTCACTCGACTAAGGTGTAGCTTTTCACACATTTAATTTTCAACAGGTAATAGCTTCAGAAAATCAGCCAGTTCGATGGACAACTCCTCTTGGACTGCCAGTGGTACAACCTTACCGGCAACTAGGAAGACATCTTGTAAAGTTCTAATAAATTTGTGAACTTTAAGCTAACAATGATTTGGACTTCTCCAACAAACTGAAACTGTAATTGACTTATTGCAGATCAAGACTTCCTTGCAAGTGTTGGCTCTACAACGAGAAACTGACAAGGTAAGGCCAACCAAATCAGGAATTATTGGGTAGACTCGATGTGTTTAAAACCTTTTGAAATTGAGACGCTTCAAGTTTATTAACTCTCTTGAGATGGGTCACTGTATGCAGTAGTGCAATTCATTGGTTGCATGATATGTTTACTAGCCGGATAATTCCTATTATTTTCTGTAGGTCATGGCTACGCGTCAGAAAACAGCTTTTCCTCCAAATTTTGTACACTCCCTTGATGGTTCTCATATGATGATGACTGCTGTCGCCTGCCGAAGGGCAGGCCTTAACTTTGCAGGTTCACTTCATTTTTCTTTTACTTCTCTATTGATTGGTTCCTTTATCTCAGTGAACTTATGTGAATTCTCGAGTGTTCAATCGTGGCCATATCTCTTCTTACCATGTTTGATTACAGGTGTCCATGATTCCTATTGGACCCATGCATGCGATGTTGACGTAATGAACAGGCTACTAAGGGAGAAATTTGTTGAGCTATACGAGGCTCCTATTCTGGAGAATGTAAGTATCTCGCCCCATGTGTACTTGATCGGCTAACTTATCCTGGTTATTATTGGCAAAATATATTCCTCTTGAGACAAAATAGAGCACATTGACAGTATTATCCTATAGATACAATCTTTGGATCAATTAGAACAAAGATCTATTTGTCTTCTTCTGTCCTGGAATTTCTTTTAACTTATAAATTGAACGGTTTTTGCCTTTACAGTTACTGAAAAATTTCCAAAAGTCCTTCCCCACTTTAAAGTTTCCGCCCTTACCCGATCGGGGAGACTTCGATCTCAAGGACGTCCTGCAATCTTCCTATTTCTTCAATTAGTGCAGCCTAGATACTTCAGTCGGATGCTCTTTCTCATATGTGCTACACGTCTGTGGCTGACATACAGTCCTGTCTACGACTCTTGAAATCAGACTCTGGAGGCGAGTTTGCTACTTCGAGGTCAGTGCTGTCCAGCTACTTGTACATAACACATTCTCCAACCATTGGTAGTCTAAATCTTGGAACTTGGGAGTTCAATCTGCCATTCACGTATTCAACTTTTACCTTACAAAATAAAGATAGGTTATATTTGGAAGCTACGGAATTTCTACCTTCATTGTACTCTGATCCGGACTTAAGTTCTGACACTGCAATCAATTAGAGGTTCAGCGATGCTGAGAGACATGGCTTTCAAAGTGTGATCAATGAAGTAAGATCTCGGTGTAATCTCCCAAAGATCAGTTGGAGTGTTGCCATTTAGTTTTCATAGGATCATGGCTTAACTTCATGACAAAAGGAAGAATGGGCTGCTGAGAAGAAAGTCGTGGTGCATACTCATACCATAGCGAGGTATTGCAGGTTGTTTCATTCCCAAAAATAAGGAAAAATGAAGAAGATTGTTGTTCTAGAAATACCAGTAAATTTTAGTCTATGCCGATTGTATTCAATTTTTTAAACAAGGCTGACATTGGATGACGACTCTATATAGTACTGGGGATGGCTGGCATTCCCTTGNGCTCCTATTCTGGAGAATGTAAGTATCTCGCCCCATGTGTACTTGATCGGCTAACTTATCCTGGTTATTATTGGCAAAATATATTCCTCTTGAGACAAAATAGAGCACATTGACAGTATTATCCTATAGATACAATCTTTGGATCAATTAGAACAAAGATCTATTTGTCTTCTTCTGTCCTGGAATTTCTTTTAACTTATAAATTGAACGGTTTTTGCCTTTACAGTTACTGAAAAATTTCCAAAAGTCCTTCCCCACTTTAAAGTTTCCGCCCTTACCCGATCGGGGAGACTTCGATCTCAAGGACGTCCTGCAATCTTCCTATTTCTTCAATTAGTGCAGCCTAGATACTTCAGTCGGATGCTCTTTCTCATATGTGCTACACGTCTGTGGCTGACATACAGTCCTGTCTACGACTCTTGAAATCAGACTCTGGAGGCGAGTTTGCTACTTCGAGGTCAGTGCTGTCCAGCTACTTGTACATAACACATTCTCCAACCATTGGTAGTCTAAATCTTGGAACTTGGGAGTTCAATCTGCCATTCACGTATTCAACTTTTACCTTACAAAATAAAGATAGGTTATATTTGGAAGCTACGGAATTTCTACCTTCATTGTACTCTGATCCGGACTTAAGTTCTGACACTGCAATCAATTAGAGGTTCAGCGATGCTGAGAGACATGGCTTTCAAAGTGTGATCAATGAAGTAAGATCTCGGTGTAATCTCCCAAAGATCAGTTGGAGTGTTGCCATTTAGTTTTCATAGGATCATGGCTTAACTTCATGACAAAAGGAAGAATGGGCTGCTGAGAAGAAAGTCGTGGTGCATACTCATACCATAGCGAGGTATTGCAGGTTGTTTCATTCCCAAAAATAAGGAAAAATGAAGAAGATTGTTGTTCTAGAAATACCAGTAAATTTTAGTCTATGCCGATTGTATTCAATTTTTTAAACAAGGCTGACATTGGATGACGACTCTATATAGTACTGGGGATGGCTGGCATTCCCTTGCTCGCAGGTGTACAGAATTTACAGATGAATTTGTTAAATCGATTATGGTTGTTTGGAATTATTCTACTTGCGGATGAACTTAACTTCCTTTTGGAGCTCTTTTTCTTAGTTAGAAACAGAAGGCGATGCAGAAACGAAGTGCCTTCTGAATCCCCACCTGCTACTAAAAATTTCCTTTGTATTCTGAAGCTTTCATGGCCGACTGTCTGATTCAGCTGCTACTATTATACTTCCCTGGTATGCTCATCTCTTATACTTCAGGACTGATTGTTTATGTTCGGTTGAAAATAGAATGTAAAACCGTAAGCTTCCAAGTAGTGGGGGCAACTAGGACGAGTACTCATAAGCAGTGACTTGGTTAGGAGTTCTCTCTCAGCTTAGTTCTTTTTGTTTCGTTTGATAGATGGTCGACAAGTATGATGGTGTCTTAACGATGAATATGCACGTATTTTGTCTTTTATTGGCGAATTTGAAAACCTCAGTAATTGACTTGGGTTACAAGTGTGGTTTTGGTTGTAACGACCCCTTCCTGTTGTGTAACTGGTTATTCCTTACCTCGAGACCAAATTAAGTATTAGGTAGTCAAAACTAACAGGTATACTGTGAGACAATCTATCTTGACGTGTCTTTTTTTTACCGTTTATACAGGAGCAAAAATTCATTGGACAAAAGAAAGAGACATGGAAGTGGCTAAATGCTTGTTTGAATTAAGTTTAAAATAATAATATTATGATAAGTCGATTTTTTATTTAATTTTCTAATAATCTTTCCAATAATAGCTCTTAAAATTGTTGGTAGAATAAGCCCTTAAAATTGTAGTTTGTTAGAAATAATCTTAAGTTTTTTTCTTTCTTTTTAAAAAAATGACGTGTTCTATTAGTGTGACGTGTTTTCTATTAGTGTGACGTGTCGATGCTTGTTTATTTGGCTGAATTGCTTTCTTATTGGCCACGCAAGACGTCTTCTTTGTGTAAAAATGAAAATCCTCCCCGTCAGGAAAAATTTATAAATTAAGTCAAATTCTGACGTGTTCTGTTGCTACTTAGGTCATGCTACTTATTTTTCTTGAACGATTTTCGAGAATGAAAACTATTCCGATAAACCAGCATGATTATCAATAATATTCTCGATAGGTCACATAACATAGAATTGTTTCAAGTCAAACACGCTCTGTTTTAATATTTCTATAATTAAATTATACAAAAAGAAAAATGTATTTTATTGATATAAATAATAAACTTGTCCCGACCATGTGTTACATTTGAAGGTCTACTCTTATTTGGATGTCGTCTTGACCTATTCGTGTAATGGATATATCAATATTTTGATGAAACGTGGGTCTTCCATTCTTGGAAGCTAAGTAAAGTCACATGTTGTATTTCTTGTTCTCAAATTAAATAACATTATATATTTGACGAAAATATTAAAGAATCAAATCTTCACTTTTACATATATTCGACATAACAAAACCCGATGAAACTAAGGTCTTCCATGTTTGGAAACTAAGTCAGTCACAATTTCAAGCAGTCCAAACCAAGAAGAACACGACACCCGAACGTGTCTTGCACGTTCCAGAGCTACATAAAATACACGTAACGTTCAACACGTGGGATGTAACCGAGTAGAGGCTTAATTGAACTTGACCTTGTGAGCAGAAGAGAAATGCTTGGCTGCAGCAACCATGGCGGCCTCTGGTCCAATTGGAGGAGGAGGAGGTGCAGCCATGGCTGCCTCTGGGCTTCCGCCATTAAAGCAGCCAAGCAAGCAATAA

mRNA sequence

ATGGTTTCTCACTGCCTGAGAACATTTCGGAGAAACTGCATATTCCGACTATCAGTTCTTCGCATTTTCCATCTCCTTCACCTGCGTTTTGAATTGCTTGAACTTGTTGTTGTGGGAATTCGGAATCCCTACTACCCGAGAATGTGGAGAAACATTGTCAAAAGAGCTGCTTCAAGGAGGGATACGTTCTTCTCCAGAAAAGTTATTGAAGACTCTATCTTTGTTGAGGAAATTAGATCCTCCAGGACACTCGGAAGCTTGAAATCATGTTGCCTGTTAGAGGGCTCTCGTGGAATCAGCTTTTTCCATTCCCGTGAAGTTGGGTTTGCGAAATCCAATTTCGCACATTCAAGCGATCCCGTAGCTTTTTATGGGGTTCTAAACCATGCCAATGGCTACGCAACTGCTGCGGAGGCTGCCATTTCCGATGAGGACTTATCAGGGTCGGAAGAAATTCAGGAATTGATGGAGGAACTAAGCAAACAAGATAAGGTGGAGTCTCACTTTAAGCAGCCTAAGAAAATGGTGGATGGAATGAGGGTAGGTAAGTACAATATTCTACGGAAGAGACAGATAAAGATGGAGACGGAGGCTTGGGAAGAGGCTGCCAGAGAGTATCAAGAGCTATTAACGGATATGTGTGAGCAGAAGTTGGCGCCCAATTTACCTTACATGAAGTCTTTATTCCTTGGTTGGTTTGAACCCTTGCGTGATGCAATTGCTGCAGACCAGGAGTTTTGCAAGAAGAAGAGTAGAGTGTCTCATTTTGCTTATTTTGATCTTCTGCCGGCGGATATGATGGCTGTGATTACAATGCATAAGTTAATGGGGTTGTTGATGACTAACAGTGGAGGAAACAGTAGTGTCAGGGTAGTCCAAGCTGCTTGTCAGATAGGAGAAGCCATTGAACACGAGGTTAGAATACACAAGTTCTTTGAGAATATGAAGAAGAAGAAGAGTAATGAAAAAACTACTGAAGGAGACACTGAACCTGTGGTTGAAGACCAGGAGAAAGTGGCCAAAGAACAAGACAAATTGAGGAAAAAAGTCACCAACCTGATGAAAAAACAGAAGTTGCGGCAAGTTAGGATGATAGTCAAGGAACACGACCATTTAAAGCCTTGGGGCCAGGATGCGCATGTGAAGGTTGGTTGTCGTTTGATTCAGCTATTGATTGAAACAGCGTATATTCAACCTCCAATGGATCAGTTAGGAGGGGGTCCTCCTGATATTCGTCCCGCATTTGTCCATACTCTTAAAACCATCACAAAAGAAGCACAAAAGACTAGCAGAAGATATGGTGTTATTGAATGCGATCCACTTGTTCGCAGAGGCCTGGAGAAAACTAGAATGTGGAGAAACATTGTCAAAAGAGCTGCTTCAAGGAGGGATACGTTCTTCTCCAGAAAAGTTATTGAAGACTCTATCTCTGTTGAGGAAATTAGATCCTCCAGGACACTTGGAAGCTTGAAATCATGTTGCCTGTTAGAGGGCTCTCGTGGAATCAGCTTTTTCCATTCCCGTGAAGTTGGGTTTGCGAAATCCAATTTCGCACATTCAAGCGAGCCCGTAGCTTTTTATGGGGTTCTAAACCATGCCAAAGGCTACGCAACTGCTGCGGAGGCTGCGATTTCCGAGGAGGACTTATCAGGGTCGGAAGAAATTCAGGAATTGATGGAGGAACTAAGCAAACAAGATAAGGTGGAGTCTCACTTTAAGCAGCCTAAGAAAATGGTGGATGGAATGAGGGTAGGTAAGTACAATATTCTACGGAAGAGACAGATAAAGATGGAGACGGAGGCTTGGGAAGAGGCTGCCAGAGAGTATCAAGAGCTATTAACGGATATGTGTGAGCAGAAGTTGGCGCCCAATTTACCTTACATAAAGTCTTTATTCCTTGGTTGGTTTGAACCCTTGCGTGATGCAATTGCTGCAGACCAGGAGTATGATAAAGGAGCACACTTATTCCTACCATCATATGTTATGCGAATACATGGGGCAAAGCAGCAACGTGAAGCAGTTAAAAGGGTTCCAAAGAAACAACTAGAGCCTGTTTTTGAGGCACTTGATACGCTTGGAAGAACCAAGTGGAGGGTGAACAAAAGAGTTCTTAGCATTATTGACAGGATATGGGCCAGTGGCGGTCGTCTTGCTGATTTGGTTGACCGCTNTACTGAAGGAGACACTGAACCTGTGGTTGAAGACCAGGAGAAAGTGGCCAAAGAACAAGACAAATTGAGGAAAAAAGTCACCAACCTGATGAAAAAACAGAAGTTGCGGCAAGTTAGGATGATAGTCAAGGAACACGACCATTTAAAGCCTTGGGGCCAGGATGCGCATGTGAAGGTTGGTTGTCGTTTGATTCAGCTATTGATTGAAACAGCGTATATTCAACCTCCAATGGATCAGTTAGGAGGGGGTCCTCCTGATATTCGTCCCGCATTTGTCCATACTCTTAAAACCATCACAAAAGAAGCACAAAAGACTAGCAGAAGATATGGTGTTATTGAATGCGATCCACTTGTTCGCAGAGGCCTGGAGAAAACTGCAAGACACATGGTCATACCATATATGCCTATGCTGGTGCCTCCCCTTAATTGGACAGGGTATGATAAAGGAGCACACTTATTCCTACCATCATATGTTATGCGAATACATGGGGCAAAGCAGCAACGTGAAGCAGTTAAAAGGGTTCCAAAGAAACAACTAGAGCCTGTTTTTGAGGCACTTGATACTCTTGGAAGAACCAAGTGGAGGGTTCCTCTACCAGAGGAGCCAACTGTGGAAGATGAAGCAGAAATTCGAAAATGGAAGTGGAAACTCAAGGCTGCAAAAAAAGAGAATAGTGAGAGGCATTCACAGCGGTGTGACATTGAGCTTAAGCTTGCAGTGGCCAGGAAAATGAAAGAAGAGGAGGGCTTTTACTATCCTCACAACTTAGATTTTCGAGGTCGTGCATACCCAATGCATCCACATTTGAATCATCTTGGTTCTGATATGTGTCGAGGAATTCTAGAATTTGCGGAGGGACGGCCACTTGGCGAGTCAGGGTTACGCTGGTTGAAGATACATTTGGCAAATCTATACGCTGGTGGTGTGGACAAGTTATCTTACAAGGATCGAATATCATTTACTGAGAATCATTTGGATGAGATTTTTGATTCGGCAGACAGGCCTCTAGAAGGAAATCGTTGGTGGTTGGGCGCAGAGGATCCTTTTCAGTGCTTGGCAGTGTGTATTAATCTCTCAGAGGCTTTAAGAAGTCCATCGCCGGAAACAACTCTTTCCCATATGCCTGTACACCAGGATGGTTCCTGCAATGGCTTGCAACACTATGCAGCTCTCGGGAGGGACAAGTTGAGTCTTAATAACCCGGCAGGCGTCAATGCCATAGTTCTTGACATAATACGAAGTGATGCAGAGAAAGATCCTGCAACTAATCCAAATGCGTTGCATGCTAGACTTTTAATCAATCAGGTGGATCGTAAACTGGTGAAACAAACAGTCATGACATCCGTGTATGGTGTCACGTATATGGGTGCACGGGATCAGATTAAGAAAAGGTTGAAAGAACGAGCATCCATTGCTGATGATTCACAATTATTTGCAGCTTCTTGCTATGCAGCTAGAGTAATAGCTTCAGAAAATCAGCCAGTTCGATGGACAACTCCTCTTGGACTGCCAGTGGTACAACCTTACCGGCAACTAGGAAGACATCTTATCAAGACTTCCTTGCAAGTGTTGGCTCTACAACGAGAAACTGACAAGGTCATGGCTACGCGTCAGAAAACAGCTTTTCCTCCAAATTTTGTACACTCCCTTGATGGTTCTCATATGATGATGACTGCTGTCGCCTGCCGAAGGGCAGGCCTTAACTTTGCAGGTGTCCATGATTCCTATTGGACCCATGCATGCGATGTTGACGTAATGAACAGGCTACTAAGGGAGAAATTTGTTGAGCTATACGAGGCTCCTATTCTGGAGAATATACTTCAGTCGGATGCTCTTTCTCATATGTGCTACACGTCTGTGGCTGACATACAGTCCTGTCTACGACTCTTGAAATCAGACTCTGGAGGCGAGTTTGCTACTTCGAGAATCAAATCTTCACTTTTACATATATTCGACATAACAAAACCCGATGAAACTAAGGTCTTCCATGTTTGGAAACTAAGTCAGTCACAATTTCAAGCAGTCCAAACCAAGAAGAACACGACACCCGAACAAGAGAAATGCTTGGCTGCAGCAACCATGGCGGCCTCTGGTCCAATTGGAGGAGGAGGAGGTGCAGCCATGGCTGCCTCTGGGCTTCCGCCATTAAAGCAGCCAAGCAAGCAATAA

Coding sequence (CDS)

ATGGTTTCTCACTGCCTGAGAACATTTCGGAGAAACTGCATATTCCGACTATCAGTTCTTCGCATTTTCCATCTCCTTCACCTGCGTTTTGAATTGCTTGAACTTGTTGTTGTGGGAATTCGGAATCCCTACTACCCGAGAATGTGGAGAAACATTGTCAAAAGAGCTGCTTCAAGGAGGGATACGTTCTTCTCCAGAAAAGTTATTGAAGACTCTATCTTTGTTGAGGAAATTAGATCCTCCAGGACACTCGGAAGCTTGAAATCATGTTGCCTGTTAGAGGGCTCTCGTGGAATCAGCTTTTTCCATTCCCGTGAAGTTGGGTTTGCGAAATCCAATTTCGCACATTCAAGCGATCCCGTAGCTTTTTATGGGGTTCTAAACCATGCCAATGGCTACGCAACTGCTGCGGAGGCTGCCATTTCCGATGAGGACTTATCAGGGTCGGAAGAAATTCAGGAATTGATGGAGGAACTAAGCAAACAAGATAAGGTGGAGTCTCACTTTAAGCAGCCTAAGAAAATGGTGGATGGAATGAGGGTAGGTAAGTACAATATTCTACGGAAGAGACAGATAAAGATGGAGACGGAGGCTTGGGAAGAGGCTGCCAGAGAGTATCAAGAGCTATTAACGGATATGTGTGAGCAGAAGTTGGCGCCCAATTTACCTTACATGAAGTCTTTATTCCTTGGTTGGTTTGAACCCTTGCGTGATGCAATTGCTGCAGACCAGGAGTTTTGCAAGAAGAAGAGTAGAGTGTCTCATTTTGCTTATTTTGATCTTCTGCCGGCGGATATGATGGCTGTGATTACAATGCATAAGTTAATGGGGTTGTTGATGACTAACAGTGGAGGAAACAGTAGTGTCAGGGTAGTCCAAGCTGCTTGTCAGATAGGAGAAGCCATTGAACACGAGGTTAGAATACACAAGTTCTTTGAGAATATGAAGAAGAAGAAGAGTAATGAAAAAACTACTGAAGGAGACACTGAACCTGTGGTTGAAGACCAGGAGAAAGTGGCCAAAGAACAAGACAAATTGAGGAAAAAAGTCACCAACCTGATGAAAAAACAGAAGTTGCGGCAAGTTAGGATGATAGTCAAGGAACACGACCATTTAAAGCCTTGGGGCCAGGATGCGCATGTGAAGGTTGGTTGTCGTTTGATTCAGCTATTGATTGAAACAGCGTATATTCAACCTCCAATGGATCAGTTAGGAGGGGGTCCTCCTGATATTCGTCCCGCATTTGTCCATACTCTTAAAACCATCACAAAAGAAGCACAAAAGACTAGCAGAAGATATGGTGTTATTGAATGCGATCCACTTGTTCGCAGAGGCCTGGAGAAAACTAGAATGTGGAGAAACATTGTCAAAAGAGCTGCTTCAAGGAGGGATACGTTCTTCTCCAGAAAAGTTATTGAAGACTCTATCTCTGTTGAGGAAATTAGATCCTCCAGGACACTTGGAAGCTTGAAATCATGTTGCCTGTTAGAGGGCTCTCGTGGAATCAGCTTTTTCCATTCCCGTGAAGTTGGGTTTGCGAAATCCAATTTCGCACATTCAAGCGAGCCCGTAGCTTTTTATGGGGTTCTAAACCATGCCAAAGGCTACGCAACTGCTGCGGAGGCTGCGATTTCCGAGGAGGACTTATCAGGGTCGGAAGAAATTCAGGAATTGATGGAGGAACTAAGCAAACAAGATAAGGTGGAGTCTCACTTTAAGCAGCCTAAGAAAATGGTGGATGGAATGAGGGTAGGTAAGTACAATATTCTACGGAAGAGACAGATAAAGATGGAGACGGAGGCTTGGGAAGAGGCTGCCAGAGAGTATCAAGAGCTATTAACGGATATGTGTGAGCAGAAGTTGGCGCCCAATTTACCTTACATAAAGTCTTTATTCCTTGGTTGGTTTGAACCCTTGCGTGATGCAATTGCTGCAGACCAGGAGTATGATAAAGGAGCACACTTATTCCTACCATCATATGTTATGCGAATACATGGGGCAAAGCAGCAACGTGAAGCAGTTAAAAGGGTTCCAAAGAAACAACTAGAGCCTGTTTTTGAGGCACTTGATACGCTTGGAAGAACCAAGTGGAGGGTGAACAAAAGAGTTCTTAGCATTATTGACAGGATATGGGCCAGTGGCGGTCGTCTTGCTGATTTGGTTGACCGCTNTACTGAAGGAGACACTGAACCTGTGGTTGAAGACCAGGAGAAAGTGGCCAAAGAACAAGACAAATTGAGGAAAAAAGTCACCAACCTGATGAAAAAACAGAAGTTGCGGCAAGTTAGGATGATAGTCAAGGAACACGACCATTTAAAGCCTTGGGGCCAGGATGCGCATGTGAAGGTTGGTTGTCGTTTGATTCAGCTATTGATTGAAACAGCGTATATTCAACCTCCAATGGATCAGTTAGGAGGGGGTCCTCCTGATATTCGTCCCGCATTTGTCCATACTCTTAAAACCATCACAAAAGAAGCACAAAAGACTAGCAGAAGATATGGTGTTATTGAATGCGATCCACTTGTTCGCAGAGGCCTGGAGAAAACTGCAAGACACATGGTCATACCATATATGCCTATGCTGGTGCCTCCCCTTAATTGGACAGGGTATGATAAAGGAGCACACTTATTCCTACCATCATATGTTATGCGAATACATGGGGCAAAGCAGCAACGTGAAGCAGTTAAAAGGGTTCCAAAGAAACAACTAGAGCCTGTTTTTGAGGCACTTGATACTCTTGGAAGAACCAAGTGGAGGGTTCCTCTACCAGAGGAGCCAACTGTGGAAGATGAAGCAGAAATTCGAAAATGGAAGTGGAAACTCAAGGCTGCAAAAAAAGAGAATAGTGAGAGGCATTCACAGCGGTGTGACATTGAGCTTAAGCTTGCAGTGGCCAGGAAAATGAAAGAAGAGGAGGGCTTTTACTATCCTCACAACTTAGATTTTCGAGGTCGTGCATACCCAATGCATCCACATTTGAATCATCTTGGTTCTGATATGTGTCGAGGAATTCTAGAATTTGCGGAGGGACGGCCACTTGGCGAGTCAGGGTTACGCTGGTTGAAGATACATTTGGCAAATCTATACGCTGGTGGTGTGGACAAGTTATCTTACAAGGATCGAATATCATTTACTGAGAATCATTTGGATGAGATTTTTGATTCGGCAGACAGGCCTCTAGAAGGAAATCGTTGGTGGTTGGGCGCAGAGGATCCTTTTCAGTGCTTGGCAGTGTGTATTAATCTCTCAGAGGCTTTAAGAAGTCCATCGCCGGAAACAACTCTTTCCCATATGCCTGTACACCAGGATGGTTCCTGCAATGGCTTGCAACACTATGCAGCTCTCGGGAGGGACAAGTTGAGTCTTAATAACCCGGCAGGCGTCAATGCCATAGTTCTTGACATAATACGAAGTGATGCAGAGAAAGATCCTGCAACTAATCCAAATGCGTTGCATGCTAGACTTTTAATCAATCAGGTGGATCGTAAACTGGTGAAACAAACAGTCATGACATCCGTGTATGGTGTCACGTATATGGGTGCACGGGATCAGATTAAGAAAAGGTTGAAAGAACGAGCATCCATTGCTGATGATTCACAATTATTTGCAGCTTCTTGCTATGCAGCTAGAGTAATAGCTTCAGAAAATCAGCCAGTTCGATGGACAACTCCTCTTGGACTGCCAGTGGTACAACCTTACCGGCAACTAGGAAGACATCTTATCAAGACTTCCTTGCAAGTGTTGGCTCTACAACGAGAAACTGACAAGGTCATGGCTACGCGTCAGAAAACAGCTTTTCCTCCAAATTTTGTACACTCCCTTGATGGTTCTCATATGATGATGACTGCTGTCGCCTGCCGAAGGGCAGGCCTTAACTTTGCAGGTGTCCATGATTCCTATTGGACCCATGCATGCGATGTTGACGTAATGAACAGGCTACTAAGGGAGAAATTTGTTGAGCTATACGAGGCTCCTATTCTGGAGAATATACTTCAGTCGGATGCTCTTTCTCATATGTGCTACACGTCTGTGGCTGACATACAGTCCTGTCTACGACTCTTGAAATCAGACTCTGGAGGCGAGTTTGCTACTTCGAGAATCAAATCTTCACTTTTACATATATTCGACATAACAAAACCCGATGAAACTAAGGTCTTCCATGTTTGGAAACTAAGTCAGTCACAATTTCAAGCAGTCCAAACCAAGAAGAACACGACACCCGAACAAGAGAAATGCTTGGCTGCAGCAACCATGGCGGCCTCTGGTCCAATTGGAGGAGGAGGAGGTGCAGCCATGGCTGCCTCTGGGCTTCCGCCATTAAAGCAGCCAAGCAAGCAATAA

Protein sequence

MVSHCLRTFRRNCIFRLSVLRIFHLLHLRFELLELVVVGIRNPYYPRMWRNIVKRAASRRDTFFSRKVIEDSIFVEEIRSSRTLGSLKSCCLLEGSRGISFFHSREVGFAKSNFAHSSDPVAFYGVLNHANGYATAAEAAISDEDLSGSEEIQELMEELSKQDKVESHFKQPKKMVDGMRVGKYNILRKRQIKMETEAWEEAAREYQELLTDMCEQKLAPNLPYMKSLFLGWFEPLRDAIAADQEFCKKKSRVSHFAYFDLLPADMMAVITMHKLMGLLMTNSGGNSSVRVVQAACQIGEAIEHEVRIHKFFENMKKKKSNEKTTEGDTEPVVEDQEKVAKEQDKLRKKVTNLMKKQKLRQVRMIVKEHDHLKPWGQDAHVKVGCRLIQLLIETAYIQPPMDQLGGGPPDIRPAFVHTLKTITKEAQKTSRRYGVIECDPLVRRGLEKTRMWRNIVKRAASRRDTFFSRKVIEDSISVEEIRSSRTLGSLKSCCLLEGSRGISFFHSREVGFAKSNFAHSSEPVAFYGVLNHAKGYATAAEAAISEEDLSGSEEIQELMEELSKQDKVESHFKQPKKMVDGMRVGKYNILRKRQIKMETEAWEEAAREYQELLTDMCEQKLAPNLPYIKSLFLGWFEPLRDAIAADQEYDKGAHLFLPSYVMRIHGAKQQREAVKRVPKKQLEPVFEALDTLGRTKWRVNKRVLSIIDRIWASGGRLADLVDRXTEGDTEPVVEDQEKVAKEQDKLRKKVTNLMKKQKLRQVRMIVKEHDHLKPWGQDAHVKVGCRLIQLLIETAYIQPPMDQLGGGPPDIRPAFVHTLKTITKEAQKTSRRYGVIECDPLVRRGLEKTARHMVIPYMPMLVPPLNWTGYDKGAHLFLPSYVMRIHGAKQQREAVKRVPKKQLEPVFEALDTLGRTKWRVPLPEEPTVEDEAEIRKWKWKLKAAKKENSERHSQRCDIELKLAVARKMKEEEGFYYPHNLDFRGRAYPMHPHLNHLGSDMCRGILEFAEGRPLGESGLRWLKIHLANLYAGGVDKLSYKDRISFTENHLDEIFDSADRPLEGNRWWLGAEDPFQCLAVCINLSEALRSPSPETTLSHMPVHQDGSCNGLQHYAALGRDKLSLNNPAGVNAIVLDIIRSDAEKDPATNPNALHARLLINQVDRKLVKQTVMTSVYGVTYMGARDQIKKRLKERASIADDSQLFAASCYAARVIASENQPVRWTTPLGLPVVQPYRQLGRHLIKTSLQVLALQRETDKVMATRQKTAFPPNFVHSLDGSHMMMTAVACRRAGLNFAGVHDSYWTHACDVDVMNRLLREKFVELYEAPILENILQSDALSHMCYTSVADIQSCLRLLKSDSGGEFATSRIKSSLLHIFDITKPDETKVFHVWKLSQSQFQAVQTKKNTTPEQEKCLAAATMAASGPIGGGGGAAMAASGLPPLKQPSKQ
Homology
BLAST of Cp4.1LG09g02310 vs. ExPASy Swiss-Prot
Match: Q8L6J5 (DNA-directed RNA polymerase 1B, mitochondrial OS=Nicotiana tabacum OX=4097 GN=RPOT1-TOM PE=2 SV=2)

HSP 1 Score: 1117.1 bits (2888), Expect = 0.0e+00
Identity = 581/883 (65.80%), Postives = 670/883 (75.88%), Query Frame = 0

Query: 528  GVLNHAKGYATAAEAAI--SEEDLSGSEEIQELMEELSKQDKVESHFKQPK--KMVDGMR 587
            G L   + Y +AAEA +  SEED+   +EIQEL+EE+ K+++      QPK  K + GM 
Sbjct: 106  GSLGRLRSYGSAAEAIVSTSEEDI---DEIQELIEEMDKENEALKANLQPKQPKTIGGMG 165

Query: 588  VGKYNILRKRQIKMETEAWEEAAREYQELLTDMCEQKLAPNLPYIKSLFLGWFEPLRDAI 647
            VGKYN LR+RQIK+ETEAWEEAA+EYQELL DMCEQKLAPNLPY+KSLFLGWFEPLRDAI
Sbjct: 166  VGKYNFLRRRQIKVETEAWEEAAKEYQELLMDMCEQKLAPNLPYMKSLFLGWFEPLRDAI 225

Query: 648  AADQEY-----DKGAHL----FLPSYVMRIHGAKQQREAVKRVPKKQLEPVFEALDTLGR 707
            AA+Q+      ++GA+      LP+ +M +    +    +          V +A   +G 
Sbjct: 226  AAEQKLCDEGKNRGAYAPFFDQLPAEMMAVITMHKLMGLLMTGGGTGSARVVQAASYIGE 285

Query: 708  TKWRVNKRVLSIIDRIWASGGRLADLVDRXTEGDTEPVVEDQEKVAKEQDKLRKKVTNLM 767
                   R+   +++   S     DL +  T GD          + KE+++LRKKV  LM
Sbjct: 286  A-IEHEARIHRFLEKTKKSNALSGDLEE--TPGD----------MMKERERLRKKVKILM 345

Query: 768  KKQKLRQVRMIVKEHDHLKPWGQDAHVKVGCRLIQLLIETAYIQPPMDQLGGGPPDIRPA 827
            KKQKLRQVR IVK+ D  KPWGQD  VKVGCRLIQ+L+ETAYIQPP DQL  GPPDIRPA
Sbjct: 346  KKQKLRQVRKIVKQQDDEKPWGQDNLVKVGCRLIQILMETAYIQPPNDQLDDGPPDIRPA 405

Query: 828  FVHTLKTITKEAQKTSRRYGVIECDPLVRRGLEKTARHMVIPYMPMLVPPLNWTGYDKGA 887
            FVHTLKT+  E  K SRRYGVI+CDPLVR+GL+KTARHMVIPYMPMLVPP +W GYDKG 
Sbjct: 406  FVHTLKTV--ETMKGSRRYGVIQCDPLVRKGLDKTARHMVIPYMPMLVPPQSWLGYDKGG 465

Query: 888  HLFLPSYVMRIHGAKQQREAVKRVPKKQLEPVFEALDTLGRTKWR--------------- 947
            +LFLPSY+MR HGAKQQREAVKRVPKKQLEPVF+ALDTLG TKWR               
Sbjct: 466  YLFLPSYIMRTHGAKQQREAVKRVPKKQLEPVFQALDTLGNTKWRVNRKVLGIVDRIWAS 525

Query: 948  ------------VPLPEEPTVEDEAEIRKWKWKLKAAKKENSERHSQRCDIELKLAVARK 1007
                        VPLPE P  EDEAEIRKWKWK+K  KKEN ERHSQRCDIELKLAVARK
Sbjct: 526  GGRLADLVDREDVPLPEAPDTEDEAEIRKWKWKVKGVKKENCERHSQRCDIELKLAVARK 585

Query: 1008 MKEEEGFYYPHNLDFRGRAYPMHPHLNHLGSDMCRGILEFAEGRPLGESGLRWLKIHLAN 1067
            MK+E+GFYYPHNLDFRGRAYPMHP+LNHLGSD+CRGILEFAEGRPLG SGLRWLKIHLAN
Sbjct: 586  MKDEDGFYYPHNLDFRGRAYPMHPYLNHLGSDLCRGILEFAEGRPLGTSGLRWLKIHLAN 645

Query: 1068 LYAGGVDKLSYKDRISFTENHLDEIFDSADRPLEGNRWWLGAEDPFQCLAVCINLSEALR 1127
            +Y GGVDKLSY+ R++F+ENHL++IFDSA+RPLEG RWWLGAEDPFQCLA CIN++EALR
Sbjct: 646  VYGGGVDKLSYEGRVAFSENHLEDIFDSAERPLEGKRWWLGAEDPFQCLATCINIAEALR 705

Query: 1128 SPSPETTLSHMPVHQDGSCNGLQHYAALGRDKLSL--------NNPA----GVNAIVLDI 1187
            SPSPET +S+MP+HQDGSCNGLQHYAALGRDKL          + PA    G+ A VLDI
Sbjct: 706  SPSPETAISYMPIHQDGSCNGLQHYAALGRDKLGAAAVNLVAGDKPADVYSGIAARVLDI 765

Query: 1188 IRSDAEKDPATNPNALHARLLINQVDRKLVKQTVMTSVYGVTYMGARDQIKKRLKERASI 1247
            ++ DA KDPA +PN + ARLLINQVDRKLVKQTVMTSVYGVTY+GARDQIKKRLKER  I
Sbjct: 766  MKRDAAKDPANDPNVMRARLLINQVDRKLVKQTVMTSVYGVTYIGARDQIKKRLKERGVI 825

Query: 1248 ADDSQLFAASCYA-------------------------ARVIASENQPVRWTTPLGLPVV 1307
             DD++LFAA+CYA                         A++IA EN PVRWTTPLGLPVV
Sbjct: 826  EDDNELFAAACYAAKTTLTALGEMFEAARSIMSWLGDCAKIIAMENHPVRWTTPLGLPVV 885

Query: 1308 QPYRQLGRHLIKTSLQVLALQRETDKVMATRQKTAFPPNFVHSLDGSHMMMTAVACRRAG 1334
            QPYR+LGRHLIKTSLQ+L LQRETDKVM  RQ+TAFPPNFVHSLDGSHMMMTA+AC+ +G
Sbjct: 886  QPYRKLGRHLIKTSLQILTLQRETDKVMVKRQRTAFPPNFVHSLDGSHMMMTAIACKESG 945

BLAST of Cp4.1LG09g02310 vs. ExPASy Swiss-Prot
Match: Q93Y94 (DNA-directed RNA polymerase 1, mitochondrial OS=Nicotiana sylvestris OX=4096 GN=RPOT1 PE=2 SV=1)

HSP 1 Score: 1108.2 bits (2865), Expect = 0.0e+00
Identity = 576/883 (65.23%), Postives = 672/883 (76.10%), Query Frame = 0

Query: 528  GVLNHAKGYATAAE--AAISEEDLSGSEEIQELMEELSKQDKVESHFKQPK--KMVDGMR 587
            G L   + Y +AAE  A+ SEED+   +EIQEL+EE++K+++      QPK  K + GM 
Sbjct: 106  GSLGFLRSYGSAAEAIASTSEEDI---DEIQELIEEMNKENEALKTNLQPKQPKTIGGMG 165

Query: 588  VGKYNILRKRQIKMETEAWEEAAREYQELLTDMCEQKLAPNLPYIKSLFLGWFEPLRDAI 647
            VGKYN+LR+RQIK+ETEAWEEAA+EYQELL DMCEQKLAPNLPY+KSLFLGWFEPLRDAI
Sbjct: 166  VGKYNLLRRRQIKVETEAWEEAAKEYQELLMDMCEQKLAPNLPYMKSLFLGWFEPLRDAI 225

Query: 648  AADQEY-----DKGAHL----FLPSYVMRIHGAKQQREAVKRVPKKQLEPVFEALDTLGR 707
            AA+Q+      ++GA+      LP+ +M +    +    +          V +A   +G 
Sbjct: 226  AAEQKLCDEGKNRGAYAPFFDQLPAEMMAVITMHKLMGLLMTGGGTGSARVVQAASHIGE 285

Query: 708  TKWRVNKRVLSIIDRIWASGGRLADLVDRXTEGDTEPVVEDQEKVAKEQDKLRKKVTNLM 767
                   R+   +++   S     DL D  T GD          + KE++++RKKV  LM
Sbjct: 286  A-IEHEARIHRFLEKTKKSNALSGDLED--TPGD----------IMKERERVRKKVKILM 345

Query: 768  KKQKLRQVRMIVKEHDHLKPWGQDAHVKVGCRLIQLLIETAYIQPPMDQLGGGPPDIRPA 827
            KKQKL+QVR IVK+ D  KPWGQD  VKVGCRLIQ+L+ETAYIQPP DQL   PPDIRPA
Sbjct: 346  KKQKLQQVRKIVKQQDDEKPWGQDNLVKVGCRLIQILMETAYIQPPNDQLDDCPPDIRPA 405

Query: 828  FVHTLKTITKEAQKTSRRYGVIECDPLVRRGLEKTARHMVIPYMPMLVPPLNWTGYDKGA 887
            FVHTLKT+  E  K SRRYGVI+CDPLVR+GL+KTARHMVIPYMPMLVPP +W GYDKGA
Sbjct: 406  FVHTLKTV--ETMKGSRRYGVIQCDPLVRKGLDKTARHMVIPYMPMLVPPQSWLGYDKGA 465

Query: 888  HLFLPSYVMRIHGAKQQREAVKRVPKKQLEPVFEALDTLGRTKWR--------------- 947
            +LFLPSY+MR HGAKQQREAVKRVPKKQLEPVF+ALDTLG TKWR               
Sbjct: 466  YLFLPSYIMRTHGAKQQREAVKRVPKKQLEPVFQALDTLGNTKWRLNRKVLGIVDRIWAS 525

Query: 948  ------------VPLPEEPTVEDEAEIRKWKWKLKAAKKENSERHSQRCDIELKLAVARK 1007
                        VPLPEEP  EDEA+IRKWKWK+K  KKEN ERHSQRCDIELKLAVARK
Sbjct: 526  GGRLADLVDREDVPLPEEPDAEDEAQIRKWKWKVKGVKKENCERHSQRCDIELKLAVARK 585

Query: 1008 MKEEEGFYYPHNLDFRGRAYPMHPHLNHLGSDMCRGILEFAEGRPLGESGLRWLKIHLAN 1067
            MK+E+GFYYPHNLDFRGRAYPMHP+LNHLGSD+CRGILEFAEGRPLG+SGLRWLKIHLAN
Sbjct: 586  MKDEDGFYYPHNLDFRGRAYPMHPYLNHLGSDLCRGILEFAEGRPLGKSGLRWLKIHLAN 645

Query: 1068 LYAGGVDKLSYKDRISFTENHLDEIFDSADRPLEGNRWWLGAEDPFQCLAVCINLSEALR 1127
            +Y GGVDKLSY+ R++F+ENH+++IFDSA+RPLEG RWWLGAEDPFQCLA CIN++EALR
Sbjct: 646  VYGGGVDKLSYEGRVAFSENHVEDIFDSAERPLEGKRWWLGAEDPFQCLATCINIAEALR 705

Query: 1128 SPSPETTLSHMPVHQDGSCNGLQHYAALGRDKLSL--------NNPA----GVNAIVLDI 1187
            SPSPET +S+MP+HQDGSCNGLQHYAALGRD L          + PA    G+ A VLDI
Sbjct: 706  SPSPETAISYMPIHQDGSCNGLQHYAALGRDTLGAAAVNLVAGDKPADVYSGIAARVLDI 765

Query: 1188 IRSDAEKDPATNPNALHARLLINQVDRKLVKQTVMTSVYGVTYMGARDQIKKRLKERASI 1247
            ++ DA KDPA +PN + ARLLINQVDRKLVKQTVMTSVYGVTY+GARDQIK+RLKER  I
Sbjct: 766  MKRDAAKDPANDPNVMRARLLINQVDRKLVKQTVMTSVYGVTYIGARDQIKRRLKERGVI 825

Query: 1248 ADDSQLFAASCYA-------------------------ARVIASENQPVRWTTPLGLPVV 1307
             DD++LFAA+CYA                         A++IA EN PVRWTTPLGLPVV
Sbjct: 826  EDDNELFAAACYAAKTTLTALGEMFEAARSIMSWLGDCAKIIAMENHPVRWTTPLGLPVV 885

Query: 1308 QPYRQLGRHLIKTSLQVLALQRETDKVMATRQKTAFPPNFVHSLDGSHMMMTAVACRRAG 1334
            QPYR+LGRHLIKTSLQ+L LQRETDKVM  RQ+TAFPPNFVHSLDGSHMMMTA+AC+ +G
Sbjct: 886  QPYRKLGRHLIKTSLQILTLQRETDKVMVKRQRTAFPPNFVHSLDGSHMMMTAIACKESG 945

BLAST of Cp4.1LG09g02310 vs. ExPASy Swiss-Prot
Match: Q8VWF8 (DNA-directed RNA polymerase 2, chloroplastic/mitochondrial OS=Nicotiana sylvestris OX=4096 GN=RPOT2 PE=2 SV=2)

HSP 1 Score: 1077.4 bits (2785), Expect = 0.0e+00
Identity = 578/975 (59.28%), Postives = 697/975 (71.49%), Query Frame = 0

Query: 451  MWRNIVKRAASR--RDTFFSRK--------VIEDSISVEEIRSSRTLGSLKSCCLLEGSR 510
            MWRNI+K+ +SR  +   FS K          +DSI  +  +  R+L  +    ++ G +
Sbjct: 35   MWRNIIKQLSSRTPQKLLFSSKNRTYSFLGFGQDSIFKDNTK-FRSLIPISCSNIVMGFQ 94

Query: 511  GISFFHSREVGFAKSNFAHSSEPVAFYGVLNH---AKGYATAAEAAI-----SEEDLSGS 570
             +  +   +           S P+    V N+    K YA+ AEA       +EED+S  
Sbjct: 95   NLGEYLPGD--------EFLSRPLIKNQVNNNFCCRKSYASVAEAVAVSSTDAEEDVSVV 154

Query: 571  EEIQELMEELSKQDKVESHFKQPKK--MVDGMRVGKYNILRKRQIKMETEAWEEAAREYQ 630
            +E+ EL+ EL K++K +  F++ K+  +  GM   KY  L++RQ+K+ETEAWE+AA+EY+
Sbjct: 155  DEVHELLTELKKEEKKQFAFRRRKQRMLTSGMGHRKYQTLKRRQVKVETEAWEQAAKEYK 214

Query: 631  ELLTDMCEQKLAPNLPYIKSLFLGWFEPLRDAIAADQEYDKGAHLFLPSYVMRIHGAKQQ 690
            ELL DMCEQKLAPNLPY+KSLFLGWFEPLRD IA +QE          +Y    +     
Sbjct: 215  ELLFDMCEQKLAPNLPYVKSLFLGWFEPLRDKIAEEQELCSQGK-SKAAYAKYFYQLPAD 274

Query: 691  REAVKRVPKKQLEPVFEALDTLG-RTKWRVNKRVLSIIDRIWASGGRLADLVD--RXTEG 750
              AV  + K     +   L T G     RV +  L I D I     R+ + ++  +  + 
Sbjct: 275  MMAVITMHK-----LMGLLMTGGDHGTARVVQAALVIGDAI-EQEVRIHNFLEKTKKQKA 334

Query: 751  DTEPVVEDQEKVAKEQDKLRKKVTNLMKKQKLRQVRMIVKEHDHLKPWGQDAHVKVGCRL 810
            + +   ED E V +EQ+KLRKKVTNLMKKQKLR V  IV+  D  KPWGQDA  KVG RL
Sbjct: 335  EKDKQKEDGEHVTQEQEKLRKKVTNLMKKQKLRAVGQIVRRQDDSKPWGQDARAKVGSRL 394

Query: 811  IQLLIETAYIQPPMDQLGGGPPDIRPAFVHTLKTITKEAQKTSRRYGVIECDPLVRRGLE 870
            I LL++TAYIQPP +QL   PPDIRPAFVH+++T+ KE +  SRRYG+I+CD LV +GLE
Sbjct: 395  IDLLLQTAYIQPPANQLAVDPPDIRPAFVHSVRTVAKETKSASRRYGIIQCDELVFKGLE 454

Query: 871  KTARHMVIPYMPMLVPPLNWTGYDKGAHLFLPSYVMRIHGAKQQREAVKRVPKKQLEPVF 930
            +TARHMVIPYMPMLVPP+ WTGYDKG HL+LPSYVMR HGA+QQREAVKR  + QL+PVF
Sbjct: 455  RTARHMVIPYMPMLVPPVKWTGYDKGGHLYLPSYVMRTHGARQQREAVKRASRNQLQPVF 514

Query: 931  EALDTLGRTKWRV---------------------------PLPEEPTVEDEAEIRKWKWK 990
            EALDTLG TKWR+                           PLPEEP  EDEA   KW+WK
Sbjct: 515  EALDTLGNTKWRINKRVLSVVDRIWAGGGRLADLVDRDDAPLPEEPDTEDEALRTKWRWK 574

Query: 991  LKAAKKENSERHSQRCDIELKLAVARKMKEEEGFYYPHNLDFRGRAYPMHPHLNHLGSDM 1050
            +K+ KKEN ERHSQRCDIELKLAVARKMK+EE F+YPHN+DFRGRAYPMHPHLNHLGSD+
Sbjct: 575  VKSVKKENRERHSQRCDIELKLAVARKMKDEESFFYPHNVDFRGRAYPMHPHLNHLGSDI 634

Query: 1051 CRGILEFAEGRPLGESGLRWLKIHLANLYAGGVDKLSYKDRISFTENHLDEIFDSADRPL 1110
            CRG+LEFAEGRPLGESGLRWLKIHLANL+AGGV+KLS + RI FTENH+D+IFDS+D+PL
Sbjct: 635  CRGVLEFAEGRPLGESGLRWLKIHLANLFAGGVEKLSLEGRIGFTENHMDDIFDSSDKPL 694

Query: 1111 EGNRWWLGAEDPFQCLAVCINLSEALRSPSPETTLSHMPVHQDGSCNGLQHYAALGRDKL 1170
            EG RWWL AEDPFQCLAVCINLSEA+RS SPET++SH+PVHQDGSCNGLQHYAALGRDKL
Sbjct: 695  EGRRWWLNAEDPFQCLAVCINLSEAVRSSSPETSVSHIPVHQDGSCNGLQHYAALGRDKL 754

Query: 1171 SL--------NNPA----GVNAIVLDIIRSDAEKDPATNPNALHARLLINQVDRKLVKQT 1230
                        PA    G+ A VLDI++ DA++DPA  P+A+ AR+L+NQVDRKLVKQT
Sbjct: 755  GAAAVNLVAGEKPADVYSGIAARVLDIMKRDAQRDPAEFPDAVRARVLVNQVDRKLVKQT 814

Query: 1231 VMTSVYGVTYMGARDQIKKRLKERASIADDSQLFAASCYAARV----------------- 1290
            VMTSVYGVTY+GARDQIK+RLKER +IADDS+LF A+CYAA+V                 
Sbjct: 815  VMTSVYGVTYIGARDQIKRRLKERGAIADDSELFGAACYAAKVTLTALGEMFEAARSIMT 874

Query: 1291 --------IASENQPVRWTTPLGLPVVQPYRQLGRHLIKTSLQVLALQRETDKVMATRQK 1339
                    IASEN+PVRWTTPLGLPVVQPYR++GRHLIKTSLQ+L LQRET+KVM  RQ+
Sbjct: 875  WLGECAKIIASENEPVRWTTPLGLPVVQPYRKIGRHLIKTSLQILTLQRETEKVMVKRQR 934

BLAST of Cp4.1LG09g02310 vs. ExPASy Swiss-Prot
Match: Q8L6J3 (DNA-directed RNA polymerase 2B, chloroplastic/mitochondrial OS=Nicotiana tabacum OX=4097 GN=RPOT2-TOM PE=2 SV=2)

HSP 1 Score: 1073.2 bits (2774), Expect = 2.9e-312
Identity = 572/972 (58.85%), Postives = 697/972 (71.71%), Query Frame = 0

Query: 451  MWRNIVKRAASR--RDTFFSRK--------VIEDSISVEEIRSSRTLGSLKSCCLLEGSR 510
            MWRNI+K+ +SR  +   FS K          +DS+  +  +  R+L  +    ++ G +
Sbjct: 35   MWRNIIKQLSSRTPQKLLFSSKNRTYSFLGFGQDSVFKDNTK-FRSLIPISCSNIVMGFQ 94

Query: 511  GISFFHSREVGFAKSNFAHSSEPVAFYGVLNHAKGYATAAEAAI-----SEEDLSGSEEI 570
             +  +   +   ++    +      F       K YA+ AEA       +EED+S  +E+
Sbjct: 95   NLGEYLPGDEFLSRPLLKNQVNSNDFC----CRKSYASVAEAVAVSSTDAEEDVSVVDEV 154

Query: 571  QELMEELSKQDKVESHFKQPKK--MVDGMRVGKYNILRKRQIKMETEAWEEAAREYQELL 630
            QEL+ EL K++K +  F++ K+  +  GM   KY  L++RQ+K+ETEAWE+AA+EY+ELL
Sbjct: 155  QELLTELKKEEKKQFAFRRRKQRMLTSGMGHRKYQTLKRRQVKVETEAWEQAAKEYKELL 214

Query: 631  TDMCEQKLAPNLPYIKSLFLGWFEPLRDAIAADQEYDKGAHLFLPSYVMRIHGAKQQREA 690
             DMCEQKLAPNLPY+KSLFLGWFEPLRD IA +QE          +Y   ++       A
Sbjct: 215  FDMCEQKLAPNLPYVKSLFLGWFEPLRDKIAEEQELCSQGK-SKAAYAKYLYQLPADMMA 274

Query: 691  VKRVPKKQLEPVFEALDTLG-RTKWRVNKRVLSIIDRIWASGGRLADLVD--RXTEGDTE 750
            V  + K     +   L T G     RV +  L I D I     R+ + ++  +  + + +
Sbjct: 275  VITMHK-----LMGLLMTGGDHGTARVVQAALVIGDAI-EQEVRIHNFLEKTKKQKAEKD 334

Query: 751  PVVEDQEKVAKEQDKLRKKVTNLMKKQKLRQVRMIVKEHDHLKPWGQDAHVKVGCRLIQL 810
               ED E V +EQ+KLRKKVTNLMKKQKLR V  IV+  D  KPWGQDA  KVG RLI+L
Sbjct: 335  KQKEDGEHVTQEQEKLRKKVTNLMKKQKLRAVGQIVRRQDDSKPWGQDAKAKVGSRLIEL 394

Query: 811  LIETAYIQPPMDQLGGGPPDIRPAFVHTLKTITKEAQKTSRRYGVIECDPLVRRGLEKTA 870
            L++TAYIQPP +QL   PPDIRPAF+H+++T+ KE +  SRRYG+I+CD LV +GLE+TA
Sbjct: 395  LLQTAYIQPPANQLAVDPPDIRPAFLHSVRTVAKETKSASRRYGIIQCDELVFKGLERTA 454

Query: 871  RHMVIPYMPMLVPPLNWTGYDKGAHLFLPSYVMRIHGAKQQREAVKRVPKKQLEPVFEAL 930
            RHMVIPYMPMLVPP+ WTGYDKG HL+LPSYVMR HGA+QQREAVKR  + QL+PVFEAL
Sbjct: 455  RHMVIPYMPMLVPPVKWTGYDKGGHLYLPSYVMRTHGARQQREAVKRASRNQLQPVFEAL 514

Query: 931  DTLGRTKWRV---------------------------PLPEEPTVEDEAEIRKWKWKLKA 990
            DTLG TKWR+                           PLPEEP  EDEA   KW+WK+K+
Sbjct: 515  DTLGSTKWRINKRVLSVIDRIWAGGGRLADLVDRDDAPLPEEPDTEDEALRTKWRWKVKS 574

Query: 991  AKKENSERHSQRCDIELKLAVARKMKEEEGFYYPHNLDFRGRAYPMHPHLNHLGSDMCRG 1050
             KKEN ERHSQRCDIELKLAVARKMK+EEGF+YPHN+DFRGRAYPMHPHLNHLGSD+CRG
Sbjct: 575  VKKENRERHSQRCDIELKLAVARKMKDEEGFFYPHNVDFRGRAYPMHPHLNHLGSDICRG 634

Query: 1051 ILEFAEGRPLGESGLRWLKIHLANLYAGGVDKLSYKDRISFTENHLDEIFDSADRPLEGN 1110
            +L FAEGRPLGESGLRWLKIHLANL+AGGV+KLS + RI+FTENH+D+IFDSAD+PLEG 
Sbjct: 635  VLVFAEGRPLGESGLRWLKIHLANLFAGGVEKLSLEGRIAFTENHMDDIFDSADKPLEGR 694

Query: 1111 RWWLGAEDPFQCLAVCINLSEALRSPSPETTLSHMPVHQDGSCNGLQHYAALGRDKLSL- 1170
            RWWL AEDPFQCLAVCINLSEA+RS SPET++SH+PVHQDGSCNGLQHYAALGRD+L   
Sbjct: 695  RWWLNAEDPFQCLAVCINLSEAVRSSSPETSISHIPVHQDGSCNGLQHYAALGRDELGAA 754

Query: 1171 -------NNPA----GVNAIVLDIIRSDAEKDPATNPNALHARLLINQVDRKLVKQTVMT 1230
                     PA    G+ A VLDI++ DA++DPA  P+A+ AR L+NQVDRKLVKQTVMT
Sbjct: 755  AVNLVAGEKPADVYSGIAARVLDIMKRDAQRDPAEFPDAVRARALVNQVDRKLVKQTVMT 814

Query: 1231 SVYGVTYMGARDQIKKRLKERASIADDSQLFAASCYAARV-------------------- 1290
            SVYGVTY+GARDQIK+RLKER +IADDS+LF A+CYAA+V                    
Sbjct: 815  SVYGVTYIGARDQIKRRLKERGAIADDSELFGAACYAAKVTLTALGEMFEAARSIMTWLG 874

Query: 1291 -----IASENQPVRWTTPLGLPVVQPYRQLGRHLIKTSLQVLALQRETDKVMATRQKTAF 1339
                 IASEN+PVRWTTPLGLPVVQPYR++GRHLIKTSLQ+L LQ+ET+KVM  RQ+TAF
Sbjct: 875  ECAKIIASENEPVRWTTPLGLPVVQPYRKIGRHLIKTSLQILTLQQETEKVMVKRQRTAF 934

BLAST of Cp4.1LG09g02310 vs. ExPASy Swiss-Prot
Match: P92969 (DNA-directed RNA polymerase 1, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=RPOT1 PE=2 SV=1)

HSP 1 Score: 1034.6 bits (2674), Expect = 1.0e-300
Identity = 556/966 (57.56%), Postives = 669/966 (69.25%), Query Frame = 0

Query: 451  MWRNIVKRAASRRDTFFSRKVIEDSISVEEIRSSRTLGSLKSCCLLEGSRGISFFHSREV 510
            MWRNI+ RA+ R+  F S    + S S      +R  G L S  L     G+S     E+
Sbjct: 1    MWRNILGRASLRKVKFLS----DSSSSGTHYPVNRVRGILSSVNLSGVRNGLSINPVNEM 60

Query: 511  GFAKSNFAHSSEPVAFYGVLNHAKGYATAAEAAIS---EEDLSGSEEIQELMEELSKQ-- 570
            G   S+F H        G     +GYATAA+A  S   E++ SGS+E+ EL+ E+ K+  
Sbjct: 61   G-GLSSFRH--------GQCYVFEGYATAAQAIDSTDPEDESSGSDEVNELITEMEKETE 120

Query: 571  ---DKVESHFKQPKKMVDGMRVGKYNILRKRQIKMETEAWEEAAREYQELLTDMCEQKLA 630
                K       PK+++ GM   K+ +L++RQ+KMETE WE AARE +E+L DMCEQKLA
Sbjct: 121  RIRKKARLAAIPPKRVIAGMGAQKFYMLKQRQVKMETEEWERAARECREILADMCEQKLA 180

Query: 631  PNLPYIKSLFLGWFEPLRDAIAADQEYDKGAHLFLPSYVMRIHGAKQQREAVKRVPKKQ- 690
            PNLPY+KSLFLGWFEP+R+AI  D +  K     +P Y   +      + AV  + K   
Sbjct: 181  PNLPYMKSLFLGWFEPVRNAIQDDLDTFKIKKGKIP-YAPFMEQLPADKMAVITMHKMMG 240

Query: 691  ----------LEPVFEALDTLGRTKWRVNKRVLSIIDRIWASGGRLADLVDRXTEGDTEP 750
                      +  +  A   +G        R+ S + +         +  D+    + E 
Sbjct: 241  LLMTNAEGVGIVKLVNAATQIGEAV-EQEVRINSFLQK-----KNKKNATDKTINTEAEN 300

Query: 751  VVEDQEKVAKEQDKLRKKVTNLMKKQKLRQVRMIVKEHDHLKPWGQDAHVKVGCRLIQLL 810
            V E  E VAKE +K RK+VT LM+K KLRQV+ +V++HD  KPWGQ+A VKVG RLIQLL
Sbjct: 301  VSE--EIVAKETEKARKQVTVLMEKNKLRQVKALVRKHDSFKPWGQEAQVKVGARLIQLL 360

Query: 811  IETAYIQPPMDQLGGGPPDIRPAFVHTLKTITKEAQKTSRRYGVIECDPLVRRGLEKTAR 870
            +E AYIQPP +Q   GPPDIRPAF    +T+T E  KTSRRYG IECDPLV +GL+K+AR
Sbjct: 361  MENAYIQPPAEQFDDGPPDIRPAFKQNFRTVTLENTKTSRRYGCIECDPLVLKGLDKSAR 420

Query: 871  HMVIPYMPMLVPPLNWTGYDKGAHLFLPSYVMRIHGAKQQREAVKRVPKKQLEPVFEALD 930
            HMVIPY+PML+PP NWTGYD+GAH FLPSYVMR HGAKQQR  +KR PK+QLEPV+EALD
Sbjct: 421  HMVIPYLPMLIPPQNWTGYDQGAHFFLPSYVMRTHGAKQQRTVMKRTPKEQLEPVYEALD 480

Query: 931  TLGRTKWR---------------------------VPLPEEPTVEDEAEIRKWKWKLKAA 990
            TLG TKW+                           VP+PEEP  ED+ + + W+W+ K A
Sbjct: 481  TLGNTKWKINKKVLSLVDRIWANGGRIGGLVDREDVPIPEEPEREDQEKFKNWRWESKKA 540

Query: 991  KKENSERHSQRCDIELKLAVARKMKEEEGFYYPHNLDFRGRAYPMHPHLNHLGSDMCRGI 1050
             K+N+ERHSQRCDIELKL VARKMK+EEGFYYPHN+DFRGRAYP+HP+LNHLGSD+CRGI
Sbjct: 541  IKQNNERHSQRCDIELKLEVARKMKDEEGFYYPHNVDFRGRAYPIHPYLNHLGSDLCRGI 600

Query: 1051 LEFAEGRPLGESGLRWLKIHLANLYAGGVDKLSYKDRISFTENHLDEIFDSADRPLEGNR 1110
            LEF EG+PLG+SGLRWLKIH+ANLYAGGVDKL+Y+DRI+FTE+HL++IFDS+DRPLEG R
Sbjct: 601  LEFCEGKPLGKSGLRWLKIHIANLYAGGVDKLAYEDRIAFTESHLEDIFDSSDRPLEGKR 660

Query: 1111 WWLGAEDPFQCLAVCINLSEALRSPSPETTLSHMPVHQDGSCNGLQHYAALGRDKLSLN- 1170
            WWL AEDPFQCLA CINLSEALRSP PE  +SH+P+HQDGSCNGLQHYAALGRDKL  + 
Sbjct: 661  WWLNAEDPFQCLAACINLSEALRSPFPEAAISHIPIHQDGSCNGLQHYAALGRDKLGADA 720

Query: 1171 -------NPAGV----NAIVLDIIRSDAEKDPATNPNALHARLLINQVDRKLVKQTVMTS 1230
                    PA V     A VL I++ DAE+DP T PNA +A+L+++QVDRKLVKQTVMTS
Sbjct: 721  VNLVTGEKPADVYTEIAARVLKIMQQDAEEDPETFPNATYAKLMLDQVDRKLVKQTVMTS 780

Query: 1231 VYGVTYMGARDQIKKRLKERASIADDSQLFAASCYA------------------------ 1290
            VYGVTY GARDQIKKRLKER +  DDS  F ASCYA                        
Sbjct: 781  VYGVTYSGARDQIKKRLKERGTFEDDSLTFHASCYAAKITLKALEEMFEAARAIKSWFGD 840

Query: 1291 -ARVIASENQPVRWTTPLGLPVVQPYRQLGRHLIKTSLQVLALQRETDKVMATRQKTAFP 1334
             A++IASEN  V WTTPLGLPVVQPYR+ GRHL+KT+LQVL L RETDKVMA RQ TAF 
Sbjct: 841  CAKIIASENNAVCWTTPLGLPVVQPYRKPGRHLVKTTLQVLTLSRETDKVMARRQMTAFA 900

BLAST of Cp4.1LG09g02310 vs. NCBI nr
Match: KAG6574029.1 (DNA-directed RNA polymerase 2, chloroplastic/mitochondrial, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1596 bits (4132), Expect = 0.0
Identity = 907/1386 (65.44%), Postives = 931/1386 (67.17%), Query Frame = 0

Query: 28   LRFELLELVVVGIRNPYYPRMWRNIVKRAASRRDTFFSRKVIEDSIFVEEIRSSRTLGSL 87
            LRFELLELVVVGIRNPYY RMWRNIVKRAASRRDTF SRKVIEDSIFVEEIRSSRTLGSL
Sbjct: 365  LRFELLELVVVGIRNPYYQRMWRNIVKRAASRRDTFLSRKVIEDSIFVEEIRSSRTLGSL 424

Query: 88   KSCCLLEGSRGISFFHSREVGFAKSNFAHSSDPVAFYGVLNHANGYATAAEAAISDEDLS 147
            KSCCLLEGSRGIS FHSREVGFAKSNFAHSSDPVAFYGVLNHANGYATAAEAAIS+EDLS
Sbjct: 425  KSCCLLEGSRGISLFHSREVGFAKSNFAHSSDPVAFYGVLNHANGYATAAEAAISEEDLS 484

Query: 148  GSEEIQELMEELSKQDKVESHFKQPKKMVDGMRVGKYNILRKRQIKMETEAWEEAAREYQ 207
            GSEEIQELMEELSKQDKVESHF+QPKKMVDG+ VGKYNILRKRQIKMETEAWEEAAREYQ
Sbjct: 485  GSEEIQELMEELSKQDKVESHFQQPKKMVDGVGVGKYNILRKRQIKMETEAWEEAAREYQ 544

Query: 208  ELLTDMCEQKLAPNLPYMKSLFLGWFEPLRDAIAADQEFCKKKSRVSHFAYFDLLPADMM 267
            ELLTDMCEQKLAPNLPY+KSLFLGWFEPLRDAIAADQEFCKKKSRVSH  YFDLLPADMM
Sbjct: 545  ELLTDMCEQKLAPNLPYIKSLFLGWFEPLRDAIAADQEFCKKKSRVSHAPYFDLLPADMM 604

Query: 268  AVITMHKLMGLLMTNSGGNSSVRVVQAACQIGEAIEHEVRIHKFFENMKKKKSNEKTTEG 327
            AVITMHKLMGLLMT+SGG+SSVRVVQAACQIGEAIEHEVRIHKFFEN KKKKSNEKTTEG
Sbjct: 605  AVITMHKLMGLLMTSSGGSSSVRVVQAACQIGEAIEHEVRIHKFFENTKKKKSNEKTTEG 664

Query: 328  DTEPVVEDQEKVAKEQDKLRKKVTNLMKKQKLRQVRMIVKEHDHLKPWGQDAHVKVGCRL 387
            DTEPVVEDQ+K+AKEQDKLRKKVTNLMKKQKLRQVRMIVKEHDHLKPWGQDAHVKVGCRL
Sbjct: 665  DTEPVVEDQKKLAKEQDKLRKKVTNLMKKQKLRQVRMIVKEHDHLKPWGQDAHVKVGCRL 724

Query: 388  IQLLIETAYIQPPMDQLGGGPPDIRPAFVHTLKTITKEAQKTSRRYGVIECDPLVRRGLE 447
            IQLLIETAYIQPPMDQLGGGPPDIRPAFVHTLKTIT E QKTSRRYGVIECDPLVRRGLE
Sbjct: 725  IQLLIETAYIQPPMDQLGGGPPDIRPAFVHTLKTITTEVQKTSRRYGVIECDPLVRRGLE 784

Query: 448  KTRMWRNIVKRAASRRDTFFSRKVIEDSISVEEIRSSRTLGSLKSCCLLEGSRGISFFHS 507
            KT   R+IV                                                   
Sbjct: 785  KTA--RHIV--------------------------------------------------- 844

Query: 508  REVGFAKSNFAHSSEPVAFYGVLNHAKGYATAAEAAISEEDLSGSEEIQELMEELSKQDK 567
                                                                        
Sbjct: 845  ------------------------------------------------------------ 904

Query: 568  VESHFKQPKKMVDGMRVGKYNILRKRQIKMETEAWEEAAREYQELLTDMCEQKLAPNLPY 627
                                                                     +PY
Sbjct: 905  ---------------------------------------------------------IPY 964

Query: 628  IKSLF--LGWFEPLRDAIAADQEYDKGAHLFLPSYVMRIHGAKQQREAVKRVPKKQLEPV 687
            +  L   L W             YDKGAHLFLPSYVMRIHGAKQQREAVKRVPKKQLEPV
Sbjct: 965  MPMLVPPLNW-----------TGYDKGAHLFLPSYVMRIHGAKQQREAVKRVPKKQLEPV 1024

Query: 688  FEALDTLGRTKWRVNKRVLSIIDRIWASGGRLADLVDRXTEGDTEPVVEDQEKVAKEQDK 747
            FEALDTLGRTKWRVNKRVLSIIDRIWASGGRLADLVDR                      
Sbjct: 1025 FEALDTLGRTKWRVNKRVLSIIDRIWASGGRLADLVDR---------------------- 1084

Query: 748  LRKKVTNLMKKQKLRQVRMIVKEHDHLKPWGQDAHVKVGCRLIQLLIETAYIQPPMDQLG 807
                                           +DA                          
Sbjct: 1085 -------------------------------EDA-------------------------- 1144

Query: 808  GGPPDIRPAFVHTLKTITKEAQKTSRRYGVIECDPLVRRGLEKTARHMVIPYMPMLVPPL 867
                                                                        
Sbjct: 1145 ------------------------------------------------------------ 1204

Query: 868  NWTGYDKGAHLFLPSYVMRIHGAKQQREAVKRVPKKQLEPVFEALDTLGRTKWRVPLPEE 927
                                                                   PLPEE
Sbjct: 1205 -------------------------------------------------------PLPEE 1264

Query: 928  PTVEDEAEIRKWKWKLKAAKKENSERHSQRCDIELKLAVARKMKEEEGFYYPHNLDFRGR 987
            PTVEDE EIRKWKWKLKAAKKENSERHSQRCDIELKLAVARKMKEEEGFYYPHNLDFRGR
Sbjct: 1265 PTVEDEEEIRKWKWKLKAAKKENSERHSQRCDIELKLAVARKMKEEEGFYYPHNLDFRGR 1324

Query: 988  AYPMHPHLNHLGSDMCRGILEFAEGRPLGESGLRWLKIHLANLYAGGVDKLSYKDRISFT 1047
            AYPMHPHLNHLGSDMCRGILEFAEGRPLGESGLRWLKIHLANLYAGGVDKLSYKDR+SFT
Sbjct: 1325 AYPMHPHLNHLGSDMCRGILEFAEGRPLGESGLRWLKIHLANLYAGGVDKLSYKDRMSFT 1375

Query: 1048 ENHLDEIFDSADRPLEGNRWWLGAEDPFQCLAVCINLSEALRSPSPETTLSHMPVHQDGS 1107
            ENHLDEIFDSADRPLEGNRWWLGAEDPFQCLAVCINLSEALRS SPETTLSHMPVHQDGS
Sbjct: 1385 ENHLDEIFDSADRPLEGNRWWLGAEDPFQCLAVCINLSEALRSSSPETTLSHMPVHQDGS 1375

Query: 1108 CNGLQHYAALGRDKLSL--------NNPA----GVNAIVLDIIRSDAEKDPATNPNALHA 1167
            CNGLQHYAALGRDKL          + PA    G+ A VLDI+RSDAEKDPATNPNALHA
Sbjct: 1445 CNGLQHYAALGRDKLGAAAVNLVAGDKPADVYSGIAARVLDIMRSDAEKDPATNPNALHA 1375

Query: 1168 RLLINQVDRKLVKQTVMTSVYGVTYMGARDQIKKRLKERASIADDSQLFAASCYAAR--- 1227
            RLLINQVDRKLVKQTVMTSVYGVTYMGARDQIKKRLKERASIADDSQLFAASCYAAR   
Sbjct: 1505 RLLINQVDRKLVKQTVMTSVYGVTYMGARDQIKKRLKERASIADDSQLFAASCYAARTTL 1375

Query: 1228 ----------------------VIASENQPVRWTTPLGLPVVQPYRQLGRHLIKTSLQVL 1287
                                  VIASENQPVRWTTPLGLPVVQPYRQLGRHLIKTSLQVL
Sbjct: 1565 TALGEMFEAARSIMSWLGECAKVIASENQPVRWTTPLGLPVVQPYRQLGRHLIKTSLQVL 1375

Query: 1288 ALQRETDKVMATRQKTAFPPNFVHSLDGSHMMMTAVACRRAGLNFAGVHDSYWTHACDVD 1347
            ALQRETDKVMATRQKTAFPPNFVHSLDGSHMMMTAVACRRAGLNFAGVHDSYWTHACDVD
Sbjct: 1625 ALQRETDKVMATRQKTAFPPNFVHSLDGSHMMMTAVACRRAGLNFAGVHDSYWTHACDVD 1375

Query: 1348 VMNRLLREKFVELYEAPILENILQS--DALSHMCYTSVADI------QSCLRLLKSDSGG 1366
            VMNRLLREKFVELYEAPILEN+L++   +   + +  + D        SCLRLLKSDSGG
Sbjct: 1685 VMNRLLREKFVELYEAPILENLLKTFQKSFPSLKFPPLPDRGDFDLKDSCLRLLKSDSGG 1375

BLAST of Cp4.1LG09g02310 vs. NCBI nr
Match: XP_023542839.1 (DNA-directed RNA polymerase 1B, mitochondrial-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1575 bits (4078), Expect = 0.0
Identity = 888/1325 (67.02%), Postives = 899/1325 (67.85%), Query Frame = 0

Query: 48   MWRNIVKRAASRRDTFFSRKVIEDSIFVEEIRSSRTLGSLKSCCLLEGSRGISFFHSREV 107
            MWRNIVKRAASRRDTFFSRKVIEDSIFVEEIRSSRTLGSLKSCCLLEGSRGISFFHSREV
Sbjct: 1    MWRNIVKRAASRRDTFFSRKVIEDSIFVEEIRSSRTLGSLKSCCLLEGSRGISFFHSREV 60

Query: 108  GFAKSNFAHSSDPVAFYGVLNHANGYATAAEAAISDEDLSGSEEIQELMEELSKQDKVES 167
            GFAKSNFAHSSDPVAFYGVLNHANGYATAAEAAISDEDLSGSEEIQELMEELSKQDKVES
Sbjct: 61   GFAKSNFAHSSDPVAFYGVLNHANGYATAAEAAISDEDLSGSEEIQELMEELSKQDKVES 120

Query: 168  HFKQPKKMVDGMRVGKYNILRKRQIKMETEAWEEAAREYQELLTDMCEQKLAPNLPYMKS 227
            HFKQPKKMVDGMRVGKYNILRKRQIKMETEAWEEAAREYQELLTDMCEQKLAPNLPYMKS
Sbjct: 121  HFKQPKKMVDGMRVGKYNILRKRQIKMETEAWEEAAREYQELLTDMCEQKLAPNLPYMKS 180

Query: 228  LFLGWFEPLRDAIAADQEFCKKKSRVSHFAYFDLLPADMMAVITMHKLMGLLMTNSGGNS 287
            LFLGWFEPLRDAIAADQEFCKKKSRVSHFAYFDLLPADMMAVITMHKLMGLLMTNSGGNS
Sbjct: 181  LFLGWFEPLRDAIAADQEFCKKKSRVSHFAYFDLLPADMMAVITMHKLMGLLMTNSGGNS 240

Query: 288  SVRVVQAACQIGEAIEHEVRIHKFFENMKKKKSNEKTTEGDTEPVVEDQEKVAKEQDKLR 347
            SVRVVQAACQIGEAIEHEVRIHKFFENMKKKKSNEKTTEGDTEPVVEDQEKVAKEQDKLR
Sbjct: 241  SVRVVQAACQIGEAIEHEVRIHKFFENMKKKKSNEKTTEGDTEPVVEDQEKVAKEQDKLR 300

Query: 348  KKVTNLMKKQKLRQVRMIVKEHDHLKPWGQDAHVKVGCRLIQLLIETAYIQPPMDQLGGG 407
            KKVTNLMKKQKLRQVRMIVKEHDHLKPWGQDAHVKVGCRLIQLLIETAYIQPPMDQLGGG
Sbjct: 301  KKVTNLMKKQKLRQVRMIVKEHDHLKPWGQDAHVKVGCRLIQLLIETAYIQPPMDQLGGG 360

Query: 408  PPDIRPAFVHTLKTITKEAQKTSRRYGVIECDPLVRRGLEKTRMWRNIVKRAASRRDTFF 467
            PPDIRPAFVHTLKTITKEAQKTSRRYGVIECDPLVRRGLEKT                  
Sbjct: 361  PPDIRPAFVHTLKTITKEAQKTSRRYGVIECDPLVRRGLEKT------------------ 420

Query: 468  SRKVIEDSISVEEIRSSRTLGSLKSCCLLEGSRGISFFHSREVGFAKSNFAHSSEPVAFY 527
            +R ++                                                       
Sbjct: 421  ARHMV------------------------------------------------------- 480

Query: 528  GVLNHAKGYATAAEAAISEEDLSGSEEIQELMEELSKQDKVESHFKQPKKMVDGMRVGKY 587
                                                                        
Sbjct: 481  ------------------------------------------------------------ 540

Query: 588  NILRKRQIKMETEAWEEAAREYQELLTDMCEQKLAPNLPYIKSLF--LGWFEPLRDAIAA 647
                                                 +PY+  L   L W          
Sbjct: 541  -------------------------------------IPYMPMLVPPLNW---------- 600

Query: 648  DQEYDKGAHLFLPSYVMRIHGAKQQREAVKRVPKKQLEPVFEALDTLGRTKWRVNKRVLS 707
               YDKGAHLFLPSYVMRIHGAKQQREAVKRVPKKQLEPVFEALDTLGRTKWRVNKRVLS
Sbjct: 601  -TGYDKGAHLFLPSYVMRIHGAKQQREAVKRVPKKQLEPVFEALDTLGRTKWRVNKRVLS 660

Query: 708  IIDRIWASGGRLADLVDRXTEGDTEPVVEDQEKVAKEQDKLRKKVTNLMKKQKLRQVRMI 767
            IIDRIWASGGRLADLVDR                                          
Sbjct: 661  IIDRIWASGGRLADLVDR------------------------------------------ 720

Query: 768  VKEHDHLKPWGQDAHVKVGCRLIQLLIETAYIQPPMDQLGGGPPDIRPAFVHTLKTITKE 827
                       +D                                               
Sbjct: 721  -----------ED----------------------------------------------- 780

Query: 828  AQKTSRRYGVIECDPLVRRGLEKTARHMVIPYMPMLVPPLNWTGYDKGAHLFLPSYVMRI 887
                                                                        
Sbjct: 781  ------------------------------------------------------------ 840

Query: 888  HGAKQQREAVKRVPKKQLEPVFEALDTLGRTKWRVPLPEEPTVEDEAEIRKWKWKLKAAK 947
                                              VPLPEEPTVEDEAEIRKWKWKLKAAK
Sbjct: 841  ----------------------------------VPLPEEPTVEDEAEIRKWKWKLKAAK 900

Query: 948  KENSERHSQRCDIELKLAVARKMKEEEGFYYPHNLDFRGRAYPMHPHLNHLGSDMCRGIL 1007
            KENSERHSQRCDIELKLAVARKMKEEEGFYYPHNLDFRGRAYPMHPHLNHLGSDMCRGIL
Sbjct: 901  KENSERHSQRCDIELKLAVARKMKEEEGFYYPHNLDFRGRAYPMHPHLNHLGSDMCRGIL 950

Query: 1008 EFAEGRPLGESGLRWLKIHLANLYAGGVDKLSYKDRISFTENHLDEIFDSADRPLEGNRW 1067
            EFAEGRPLGESGLRWLKIHLANLYAGGVDKLSYKDRISFTENHLDEIFDSADRPLEGNRW
Sbjct: 961  EFAEGRPLGESGLRWLKIHLANLYAGGVDKLSYKDRISFTENHLDEIFDSADRPLEGNRW 950

Query: 1068 WLGAEDPFQCLAVCINLSEALRSPSPETTLSHMPVHQDGSCNGLQHYAALGRDKLSL--- 1127
            WLGAEDPFQCLAVCINLSEALRSPSPETTLSHMPVHQDGSCNGLQHYAALGRDKL     
Sbjct: 1021 WLGAEDPFQCLAVCINLSEALRSPSPETTLSHMPVHQDGSCNGLQHYAALGRDKLGAAAV 950

Query: 1128 -----NNPA----GVNAIVLDIIRSDAEKDPATNPNALHARLLINQVDRKLVKQTVMTSV 1187
                 + PA    G+ A VLDIIRSDAEKDPATNPNALHARLLINQVDRKLVKQTVMTSV
Sbjct: 1081 NLVAGDKPADVYSGIAARVLDIIRSDAEKDPATNPNALHARLLINQVDRKLVKQTVMTSV 950

Query: 1188 YGVTYMGARDQIKKRLKERASIADDSQLFAASCYAAR----------------------- 1247
            YGVTYMGARDQIKKRLKERASIADDSQLFAASCYAAR                       
Sbjct: 1141 YGVTYMGARDQIKKRLKERASIADDSQLFAASCYAARTTLTALGEMFEAARSIMSWLGEC 950

Query: 1248 --VIASENQPVRWTTPLGLPVVQPYRQLGRHLIKTSLQVLALQRETDKVMATRQKTAFPP 1307
              VIASENQPVRWTTPLGLPVVQPYRQLGRHLIKTSLQVLALQRETDKVMATRQKTAFPP
Sbjct: 1201 AKVIASENQPVRWTTPLGLPVVQPYRQLGRHLIKTSLQVLALQRETDKVMATRQKTAFPP 950

Query: 1308 NFVHSLDGSHMMMTAVACRRAGLNFAGVHDSYWTHACDVDVMNRLLREKFVELYEAPILE 1333
            NFVHSLDGSHMMMTAVACRRAGLNFAGVHDSYWTHACDVDVMNRLLREKFVELYEAPILE
Sbjct: 1261 NFVHSLDGSHMMMTAVACRRAGLNFAGVHDSYWTHACDVDVMNRLLREKFVELYEAPILE 950

BLAST of Cp4.1LG09g02310 vs. NCBI nr
Match: XP_022944908.1 (DNA-directed RNA polymerase 1B, mitochondrial-like [Cucurbita moschata])

HSP 1 Score: 1549 bits (4010), Expect = 0.0
Identity = 874/1325 (65.96%), Postives = 893/1325 (67.40%), Query Frame = 0

Query: 48   MWRNIVKRAASRRDTFFSRKVIEDSIFVEEIRSSRTLGSLKSCCLLEGSRGISFFHSREV 107
            MWRNIVKRAASRRDTFFSRKVIEDSIFVEEIRSSRTLGSLKSCCLLEGSRGISFFHSREV
Sbjct: 1    MWRNIVKRAASRRDTFFSRKVIEDSIFVEEIRSSRTLGSLKSCCLLEGSRGISFFHSREV 60

Query: 108  GFAKSNFAHSSDPVAFYGVLNHANGYATAAEAAISDEDLSGSEEIQELMEELSKQDKVES 167
            GFAKSNFAHSSDPVAFYGVLNHANGYATAAEAAIS+EDLSGSEEIQELMEELSKQDKVES
Sbjct: 61   GFAKSNFAHSSDPVAFYGVLNHANGYATAAEAAISEEDLSGSEEIQELMEELSKQDKVES 120

Query: 168  HFKQPKKMVDGMRVGKYNILRKRQIKMETEAWEEAAREYQELLTDMCEQKLAPNLPYMKS 227
            +F+QPKKMVDGM VGKYNILRKRQIKMETEAWEEAAREYQELLTDMCEQKLAPNLPYMKS
Sbjct: 121  YFQQPKKMVDGMGVGKYNILRKRQIKMETEAWEEAAREYQELLTDMCEQKLAPNLPYMKS 180

Query: 228  LFLGWFEPLRDAIAADQEFCKKKSRVSHFAYFDLLPADMMAVITMHKLMGLLMTNSGGNS 287
            LFLGWFEPLRDAIAADQEFCKKKSRVSH  YFDLLPADMMAVITMHKLMGLLMT+SGGNS
Sbjct: 181  LFLGWFEPLRDAIAADQEFCKKKSRVSHAPYFDLLPADMMAVITMHKLMGLLMTSSGGNS 240

Query: 288  SVRVVQAACQIGEAIEHEVRIHKFFENMKKKKSNEKTTEGDTEPVVEDQEKVAKEQDKLR 347
            SVRVVQAACQIGEAIEHEVRIHKFFEN KKKKSNEKTTEGDTEPVV+DQEK+AKEQDKLR
Sbjct: 241  SVRVVQAACQIGEAIEHEVRIHKFFENTKKKKSNEKTTEGDTEPVVKDQEKLAKEQDKLR 300

Query: 348  KKVTNLMKKQKLRQVRMIVKEHDHLKPWGQDAHVKVGCRLIQLLIETAYIQPPMDQLGGG 407
            KKVTNLMKKQKLRQVRMIVKEHDHLKPWGQDAHVKVGCRLIQLLIETAYIQPPMDQLGGG
Sbjct: 301  KKVTNLMKKQKLRQVRMIVKEHDHLKPWGQDAHVKVGCRLIQLLIETAYIQPPMDQLGGG 360

Query: 408  PPDIRPAFVHTLKTITKEAQKTSRRYGVIECDPLVRRGLEKTRMWRNIVKRAASRRDTFF 467
            PPDIRPAFVHTLKTIT EAQKTSRRYGVIECDPLVRRGLEKT                  
Sbjct: 361  PPDIRPAFVHTLKTITTEAQKTSRRYGVIECDPLVRRGLEKT------------------ 420

Query: 468  SRKVIEDSISVEEIRSSRTLGSLKSCCLLEGSRGISFFHSREVGFAKSNFAHSSEPVAFY 527
            +R ++                                                       
Sbjct: 421  ARHMV------------------------------------------------------- 480

Query: 528  GVLNHAKGYATAAEAAISEEDLSGSEEIQELMEELSKQDKVESHFKQPKKMVDGMRVGKY 587
                                                                        
Sbjct: 481  ------------------------------------------------------------ 540

Query: 588  NILRKRQIKMETEAWEEAAREYQELLTDMCEQKLAPNLPYIKSLF--LGWFEPLRDAIAA 647
                                                 +PY+  L   L W          
Sbjct: 541  -------------------------------------IPYMPMLVPPLNW---------- 600

Query: 648  DQEYDKGAHLFLPSYVMRIHGAKQQREAVKRVPKKQLEPVFEALDTLGRTKWRVNKRVLS 707
               YDKGAHLFLPSYVMRIHGAKQQREAVKRVPKKQLEPVFEALDTLGRTKWRVNKRVLS
Sbjct: 601  -TGYDKGAHLFLPSYVMRIHGAKQQREAVKRVPKKQLEPVFEALDTLGRTKWRVNKRVLS 660

Query: 708  IIDRIWASGGRLADLVDRXTEGDTEPVVEDQEKVAKEQDKLRKKVTNLMKKQKLRQVRMI 767
            IIDRIWASGGRLADLVDR                                          
Sbjct: 661  IIDRIWASGGRLADLVDR------------------------------------------ 720

Query: 768  VKEHDHLKPWGQDAHVKVGCRLIQLLIETAYIQPPMDQLGGGPPDIRPAFVHTLKTITKE 827
                       +DA                                              
Sbjct: 721  -----------EDA---------------------------------------------- 780

Query: 828  AQKTSRRYGVIECDPLVRRGLEKTARHMVIPYMPMLVPPLNWTGYDKGAHLFLPSYVMRI 887
                                                                        
Sbjct: 781  ------------------------------------------------------------ 840

Query: 888  HGAKQQREAVKRVPKKQLEPVFEALDTLGRTKWRVPLPEEPTVEDEAEIRKWKWKLKAAK 947
                                               PLPEEPTVEDE EIRKWKWKLKAAK
Sbjct: 841  -----------------------------------PLPEEPTVEDEEEIRKWKWKLKAAK 900

Query: 948  KENSERHSQRCDIELKLAVARKMKEEEGFYYPHNLDFRGRAYPMHPHLNHLGSDMCRGIL 1007
            KENSERHSQRCDIELKLAVARKMKEEEGFYYPHNLDFRGRAYPMHPHLNHLGSDMCRGIL
Sbjct: 901  KENSERHSQRCDIELKLAVARKMKEEEGFYYPHNLDFRGRAYPMHPHLNHLGSDMCRGIL 950

Query: 1008 EFAEGRPLGESGLRWLKIHLANLYAGGVDKLSYKDRISFTENHLDEIFDSADRPLEGNRW 1067
            EFAEGRPLGESGLRWLKIHLANLYAGGVDKLSYKDR+SFTENHLDEIFDSADRPLEGNRW
Sbjct: 961  EFAEGRPLGESGLRWLKIHLANLYAGGVDKLSYKDRMSFTENHLDEIFDSADRPLEGNRW 950

Query: 1068 WLGAEDPFQCLAVCINLSEALRSPSPETTLSHMPVHQDGSCNGLQHYAALGRDKLSL--- 1127
            WLGAEDPFQCLAVCINLSEALRSPSPETTLSHMPVHQDGSCNGLQHYAALGRDKL     
Sbjct: 1021 WLGAEDPFQCLAVCINLSEALRSPSPETTLSHMPVHQDGSCNGLQHYAALGRDKLGAAAV 950

Query: 1128 -----NNPA----GVNAIVLDIIRSDAEKDPATNPNALHARLLINQVDRKLVKQTVMTSV 1187
                 + PA    G+ A VLDI+RSDAEKDPATNPNALHARLLINQVDRKLVKQTVMTSV
Sbjct: 1081 NLVAGDKPADVYSGIAARVLDIMRSDAEKDPATNPNALHARLLINQVDRKLVKQTVMTSV 950

Query: 1188 YGVTYMGARDQIKKRLKERASIADDSQLFAASCYAAR----------------------- 1247
            YGVTYMGARDQIKKRLKERASIADDSQLFAASCYAAR                       
Sbjct: 1141 YGVTYMGARDQIKKRLKERASIADDSQLFAASCYAARTTLTALGEMFEAARSIMSWLGEC 950

Query: 1248 --VIASENQPVRWTTPLGLPVVQPYRQLGRHLIKTSLQVLALQRETDKVMATRQKTAFPP 1307
              VIASENQPVRWTTPLGLPVVQPYRQLGRHLIKTSLQVLALQRETDKVMATRQKTAFPP
Sbjct: 1201 AKVIASENQPVRWTTPLGLPVVQPYRQLGRHLIKTSLQVLALQRETDKVMATRQKTAFPP 950

Query: 1308 NFVHSLDGSHMMMTAVACRRAGLNFAGVHDSYWTHACDVDVMNRLLREKFVELYEAPILE 1333
            NFVHSLDGSHMMMTAVACRRAGLNFAGVHDSYWTHACDVDVMNRLLREKFVELYEAPILE
Sbjct: 1261 NFVHSLDGSHMMMTAVACRRAGLNFAGVHDSYWTHACDVDVMNRLLREKFVELYEAPILE 950

BLAST of Cp4.1LG09g02310 vs. NCBI nr
Match: XP_023542836.1 (DNA-directed RNA polymerase 1B, mitochondrial-like [Cucurbita pepo subsp. pepo] >XP_023542837.1 DNA-directed RNA polymerase 1B, mitochondrial-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1547 bits (4006), Expect = 0.0
Identity = 876/1327 (66.01%), Postives = 891/1327 (67.14%), Query Frame = 0

Query: 48   MWRNIVKRAASRRDTFFSRKVIEDSIFVEEIRSSRTLGSLKSCCLLEGSRGISFFHSREV 107
            MWRNIVKRAASRRDTFFSRKVIEDSI VEEIRSSRTLGSLKSCCLLEGSRGISFFHSREV
Sbjct: 1    MWRNIVKRAASRRDTFFSRKVIEDSISVEEIRSSRTLGSLKSCCLLEGSRGISFFHSREV 60

Query: 108  GFAKSNFAHSSDPVAFYGVLNHANGYATAAEAAISDEDLSGSEEIQELMEELSKQDKVES 167
            GFAKSNFAHSS+PVAFYGVLNHA GYATAAEAAIS+EDLSGSEEIQELMEELSKQDKVES
Sbjct: 61   GFAKSNFAHSSEPVAFYGVLNHAKGYATAAEAAISEEDLSGSEEIQELMEELSKQDKVES 120

Query: 168  HFKQPKKMVDGMRVGKYNILRKRQIKMETEAWEEAAREYQELLTDMCEQKLAPNLPYMKS 227
            HFKQPKKMVDGMRVGKYNILRKRQIKMETEAWEEAAREYQELLTDMCEQKLAPNLPY+KS
Sbjct: 121  HFKQPKKMVDGMRVGKYNILRKRQIKMETEAWEEAAREYQELLTDMCEQKLAPNLPYIKS 180

Query: 228  LFLGWFEPLRDAIAADQEFCKKKSRVSHFAYFDLLPADMMAVITMHKLMGLLMTNSGGNS 287
            LFLGWFEPLRDAIAADQEFCKKKSRVSH  YFDLLPADMMAVITMHKLMGLLMTNSGGNS
Sbjct: 181  LFLGWFEPLRDAIAADQEFCKKKSRVSHAPYFDLLPADMMAVITMHKLMGLLMTNSGGNS 240

Query: 288  SVRVVQAACQIGEAIEHEVRIHKFFENMKKKK--SNEKTTEGDTEPVVEDQEKVAKEQDK 347
            SVRVVQAACQIGEAIEHEVRIHKFFENMKKKK  SNEKTTEGDTEPVVEDQEKVAKEQDK
Sbjct: 241  SVRVVQAACQIGEAIEHEVRIHKFFENMKKKKRNSNEKTTEGDTEPVVEDQEKVAKEQDK 300

Query: 348  LRKKVTNLMKKQKLRQVRMIVKEHDHLKPWGQDAHVKVGCRLIQLLIETAYIQPPMDQLG 407
            LRKKVTNLMKKQKLRQVRMIVKEHDHLKPWGQDAHVKVGCRLIQLLIETAYIQPPMDQLG
Sbjct: 301  LRKKVTNLMKKQKLRQVRMIVKEHDHLKPWGQDAHVKVGCRLIQLLIETAYIQPPMDQLG 360

Query: 408  GGPPDIRPAFVHTLKTITKEAQKTSRRYGVIECDPLVRRGLEKTRMWRNIVKRAASRRDT 467
            GGPPDIRPAFVHTLKTITKEAQKTSRRYGVIECDPLVRRGLEKT                
Sbjct: 361  GGPPDIRPAFVHTLKTITKEAQKTSRRYGVIECDPLVRRGLEKT---------------- 420

Query: 468  FFSRKVIEDSISVEEIRSSRTLGSLKSCCLLEGSRGISFFHSREVGFAKSNFAHSSEPVA 527
              +R ++                                                     
Sbjct: 421  --ARHMV----------------------------------------------------- 480

Query: 528  FYGVLNHAKGYATAAEAAISEEDLSGSEEIQELMEELSKQDKVESHFKQPKKMVDGMRVG 587
                                                                        
Sbjct: 481  ------------------------------------------------------------ 540

Query: 588  KYNILRKRQIKMETEAWEEAAREYQELLTDMCEQKLAPNLPYIKSLF--LGWFEPLRDAI 647
                                                   +PY+  L   L W        
Sbjct: 541  ---------------------------------------IPYMPMLVPPLNW-------- 600

Query: 648  AADQEYDKGAHLFLPSYVMRIHGAKQQREAVKRVPKKQLEPVFEALDTLGRTKWRVNKRV 707
                 YDKGAHLFLPSYVMRIHGAKQQREAVKRVPKKQLEPVFEALDTLGRTKWRVNKRV
Sbjct: 601  ---TGYDKGAHLFLPSYVMRIHGAKQQREAVKRVPKKQLEPVFEALDTLGRTKWRVNKRV 660

Query: 708  LSIIDRIWASGGRLADLVDRXTEGDTEPVVEDQEKVAKEQDKLRKKVTNLMKKQKLRQVR 767
            LSIIDRIW SGGRLADLVDR                                        
Sbjct: 661  LSIIDRIWTSGGRLADLVDR---------------------------------------- 720

Query: 768  MIVKEHDHLKPWGQDAHVKVGCRLIQLLIETAYIQPPMDQLGGGPPDIRPAFVHTLKTIT 827
                         +D                                             
Sbjct: 721  -------------ED--------------------------------------------- 780

Query: 828  KEAQKTSRRYGVIECDPLVRRGLEKTARHMVIPYMPMLVPPLNWTGYDKGAHLFLPSYVM 887
                                                                        
Sbjct: 781  ------------------------------------------------------------ 840

Query: 888  RIHGAKQQREAVKRVPKKQLEPVFEALDTLGRTKWRVPLPEEPTVEDEAEIRKWKWKLKA 947
                                                VPLPEEPTVEDEAEIRKWKWKLKA
Sbjct: 841  ------------------------------------VPLPEEPTVEDEAEIRKWKWKLKA 900

Query: 948  AKKENSERHSQRCDIELKLAVARKMKEEEGFYYPHNLDFRGRAYPMHPHLNHLGSDMCRG 1007
            AKKENSERHSQRCDIELKLAVARKMKEEEGFYYPHNLDFRGRAYPMHPHLNHLGSDMCRG
Sbjct: 901  AKKENSERHSQRCDIELKLAVARKMKEEEGFYYPHNLDFRGRAYPMHPHLNHLGSDMCRG 952

Query: 1008 ILEFAEGRPLGESGLRWLKIHLANLYAGGVDKLSYKDRISFTENHLDEIFDSADRPLEGN 1067
            ILEFAEGRPLGESGLRWLKIHLANLYAGGVDKLSYKDRISFTENHLDEIFDSADRPLEGN
Sbjct: 961  ILEFAEGRPLGESGLRWLKIHLANLYAGGVDKLSYKDRISFTENHLDEIFDSADRPLEGN 952

Query: 1068 RWWLGAEDPFQCLAVCINLSEALRSPSPETTLSHMPVHQDGSCNGLQHYAALGRDKLSL- 1127
            RWWLGAEDPFQCLAVCINLSEALRSPSPETTLSHMPVHQDGSCNGLQHYAALGRDKL   
Sbjct: 1021 RWWLGAEDPFQCLAVCINLSEALRSPSPETTLSHMPVHQDGSCNGLQHYAALGRDKLGAA 952

Query: 1128 -------NNPA----GVNAIVLDIIRSDAEKDPATNPNALHARLLINQVDRKLVKQTVMT 1187
                   + PA    G+ A VLDIIRSDAEKDPATNPNALHARLLINQVDRKLVKQTVMT
Sbjct: 1081 AVNLVAGDKPADVYSGIAARVLDIIRSDAEKDPATNPNALHARLLINQVDRKLVKQTVMT 952

Query: 1188 SVYGVTYMGARDQIKKRLKERASIADDSQLFAASCYAAR--------------------- 1247
            SVYGVTYMGARDQIKKRLKERASIADDSQLFAASCYAAR                     
Sbjct: 1141 SVYGVTYMGARDQIKKRLKERASIADDSQLFAASCYAARTTLTALGEMFEAARSIMSWLG 952

Query: 1248 ----VIASENQPVRWTTPLGLPVVQPYRQLGRHLIKTSLQVLALQRETDKVMATRQKTAF 1307
                VIASENQPVRWTTPLGLPVVQPYRQLGRH IKTSLQ+L LQRETDKVMA RQKTAF
Sbjct: 1201 ECAKVIASENQPVRWTTPLGLPVVQPYRQLGRHFIKTSLQMLTLQRETDKVMAKRQKTAF 952

Query: 1308 PPNFVHSLDGSHMMMTAVACRRAGLNFAGVHDSYWTHACDVDVMNRLLREKFVELYEAPI 1333
            PPNFVHSLDGSHMMMTAVACRRAGLNFAGVHDSYWTHACDVDVMNRLLREKFVELYEAPI
Sbjct: 1261 PPNFVHSLDGSHMMMTAVACRRAGLNFAGVHDSYWTHACDVDVMNRLLREKFVELYEAPI 952

BLAST of Cp4.1LG09g02310 vs. NCBI nr
Match: KAG6574025.1 (DNA-directed RNA polymerase 2, chloroplastic/mitochondrial, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1536 bits (3978), Expect = 0.0
Identity = 870/1327 (65.56%), Postives = 890/1327 (67.07%), Query Frame = 0

Query: 48   MWRNIVKRAASRRDTFFSRKVIEDSIFVEEIRSSRTLGSLKSCCLLEGSRGISFFHSREV 107
            MWRNIVKRAASRRDTFFSRKVIEDSIFVEEIRSSRTLGSLKSCCLLEGSRGISFFHSREV
Sbjct: 1    MWRNIVKRAASRRDTFFSRKVIEDSIFVEEIRSSRTLGSLKSCCLLEGSRGISFFHSREV 60

Query: 108  GFAKSNFAHSSDPVAFYGVLNHANGYATAAEAAISDEDLSGSEEIQELMEELSKQDKVES 167
            GFAKSNFAHSS+PVAFYGVLNHA GYATAAEAAIS+EDLSGSEEIQELMEELSKQDKVES
Sbjct: 61   GFAKSNFAHSSEPVAFYGVLNHAKGYATAAEAAISEEDLSGSEEIQELMEELSKQDKVES 120

Query: 168  HFKQPKKMVDGMRVGKYNILRKRQIKMETEAWEEAAREYQELLTDMCEQKLAPNLPYMKS 227
            HFKQPKKMVDGM VGKYNILRKRQIKMETEAWEEAA+EYQELLTDMCEQKLAPNLPY+KS
Sbjct: 121  HFKQPKKMVDGMGVGKYNILRKRQIKMETEAWEEAAKEYQELLTDMCEQKLAPNLPYIKS 180

Query: 228  LFLGWFEPLRDAIAADQEFCKKKSRVSHFAYFDLLPADMMAVITMHKLMGLLMTNSGGNS 287
            LFLGWFEPLRDAIAADQEFCKKKSRVSH  YFDLLPADMMAVITMHKLMGLLMT+SGGNS
Sbjct: 181  LFLGWFEPLRDAIAADQEFCKKKSRVSHAPYFDLLPADMMAVITMHKLMGLLMTSSGGNS 240

Query: 288  SVRVVQAACQIGEAIEHEVRIHKFFENMKKKK--SNEKTTEGDTEPVVEDQEKVAKEQDK 347
            SVRVVQAACQIGEAIEHEVRIHKFFEN KKKK  SNEKTTEGDTEPVVEDQEK+AKEQDK
Sbjct: 241  SVRVVQAACQIGEAIEHEVRIHKFFENTKKKKKNSNEKTTEGDTEPVVEDQEKLAKEQDK 300

Query: 348  LRKKVTNLMKKQKLRQVRMIVKEHDHLKPWGQDAHVKVGCRLIQLLIETAYIQPPMDQLG 407
            LRKKVTNLMKKQKLRQVRMIVKEHD LKPWGQ+AHVKVGCRLIQLLIETAYIQPPMDQLG
Sbjct: 301  LRKKVTNLMKKQKLRQVRMIVKEHDDLKPWGQEAHVKVGCRLIQLLIETAYIQPPMDQLG 360

Query: 408  GGPPDIRPAFVHTLKTITKEAQKTSRRYGVIECDPLVRRGLEKTRMWRNIVKRAASRRDT 467
            GGPPDIRPAFVHTLKTITKEAQKTSRRYGVIECDPLVRRGLEKT                
Sbjct: 361  GGPPDIRPAFVHTLKTITKEAQKTSRRYGVIECDPLVRRGLEKT---------------- 420

Query: 468  FFSRKVIEDSISVEEIRSSRTLGSLKSCCLLEGSRGISFFHSREVGFAKSNFAHSSEPVA 527
              +R ++                                                     
Sbjct: 421  --ARHMV----------------------------------------------------- 480

Query: 528  FYGVLNHAKGYATAAEAAISEEDLSGSEEIQELMEELSKQDKVESHFKQPKKMVDGMRVG 587
                                                                        
Sbjct: 481  ------------------------------------------------------------ 540

Query: 588  KYNILRKRQIKMETEAWEEAAREYQELLTDMCEQKLAPNLPYIKSLF--LGWFEPLRDAI 647
                                                   +PY+  L   L W        
Sbjct: 541  ---------------------------------------IPYMPMLVPPLNW-------- 600

Query: 648  AADQEYDKGAHLFLPSYVMRIHGAKQQREAVKRVPKKQLEPVFEALDTLGRTKWRVNKRV 707
                 YDKGAHLFLPSYVMRIHGAKQQREAVKRVPKKQLEPVFEALDTLGRTKWRVNKRV
Sbjct: 601  ---TGYDKGAHLFLPSYVMRIHGAKQQREAVKRVPKKQLEPVFEALDTLGRTKWRVNKRV 660

Query: 708  LSIIDRIWASGGRLADLVDRXTEGDTEPVVEDQEKVAKEQDKLRKKVTNLMKKQKLRQVR 767
            LSIIDRIWASGGRLADLVDR                                        
Sbjct: 661  LSIIDRIWASGGRLADLVDR---------------------------------------- 720

Query: 768  MIVKEHDHLKPWGQDAHVKVGCRLIQLLIETAYIQPPMDQLGGGPPDIRPAFVHTLKTIT 827
                         +D                                             
Sbjct: 721  -------------ED--------------------------------------------- 780

Query: 828  KEAQKTSRRYGVIECDPLVRRGLEKTARHMVIPYMPMLVPPLNWTGYDKGAHLFLPSYVM 887
                                                                        
Sbjct: 781  ------------------------------------------------------------ 840

Query: 888  RIHGAKQQREAVKRVPKKQLEPVFEALDTLGRTKWRVPLPEEPTVEDEAEIRKWKWKLKA 947
                                                VPLPEEPTVEDEAEIRKWKWKLKA
Sbjct: 841  ------------------------------------VPLPEEPTVEDEAEIRKWKWKLKA 900

Query: 948  AKKENSERHSQRCDIELKLAVARKMKEEEGFYYPHNLDFRGRAYPMHPHLNHLGSDMCRG 1007
            AKKENSERHSQRCDIELKLAVARKMKEEEGFYYPHNLDFRGRAYPMHPHLNHLGSDMCRG
Sbjct: 901  AKKENSERHSQRCDIELKLAVARKMKEEEGFYYPHNLDFRGRAYPMHPHLNHLGSDMCRG 952

Query: 1008 ILEFAEGRPLGESGLRWLKIHLANLYAGGVDKLSYKDRISFTENHLDEIFDSADRPLEGN 1067
            ILEFAEGRPLGESGLRWLKIHLANLYAGGVDKLSYKDRISFTENHLDEIFDSADRPLEGN
Sbjct: 961  ILEFAEGRPLGESGLRWLKIHLANLYAGGVDKLSYKDRISFTENHLDEIFDSADRPLEGN 952

Query: 1068 RWWLGAEDPFQCLAVCINLSEALRSPSPETTLSHMPVHQDGSCNGLQHYAALGRDKLSL- 1127
            RWWLGAEDPFQCLAVCINLSEALRSPSPETTLSHMPVHQDGSCNGLQHYAALGRDKL   
Sbjct: 1021 RWWLGAEDPFQCLAVCINLSEALRSPSPETTLSHMPVHQDGSCNGLQHYAALGRDKLGAE 952

Query: 1128 -------NNPA----GVNAIVLDIIRSDAEKDPATNPNALHARLLINQVDRKLVKQTVMT 1187
                   + PA    G+ A VLDIIRSDAEKDPATNPNALHARLLINQVDRKLVKQTVMT
Sbjct: 1081 AVNLVAGDKPADVYSGIAARVLDIIRSDAEKDPATNPNALHARLLINQVDRKLVKQTVMT 952

Query: 1188 SVYGVTYMGARDQIKKRLKERASIADDSQLFAASCYAAR--------------------- 1247
            SVYGVTYMGARDQIKKRLKERASIADDSQLFAASCYAA+                     
Sbjct: 1141 SVYGVTYMGARDQIKKRLKERASIADDSQLFAASCYAAKTTLTALGEMFEAARSIMSWLG 952

Query: 1248 ----VIASENQPVRWTTPLGLPVVQPYRQLGRHLIKTSLQVLALQRETDKVMATRQKTAF 1307
                VIASENQPVRWTTPLGLPVVQPYRQLGRH IKTSLQ+L LQRETDKVMA RQKTAF
Sbjct: 1201 ECAKVIASENQPVRWTTPLGLPVVQPYRQLGRHFIKTSLQMLTLQRETDKVMAKRQKTAF 952

Query: 1308 PPNFVHSLDGSHMMMTAVACRRAGLNFAGVHDSYWTHACDVDVMNRLLREKFVELYEAPI 1333
            PPNFVHSLDGSHMMMTAVACRRAGLNFAGVHDSYWTHACDVDVMNRLLREKFVELYEAPI
Sbjct: 1261 PPNFVHSLDGSHMMMTAVACRRAGLNFAGVHDSYWTHACDVDVMNRLLREKFVELYEAPI 952

BLAST of Cp4.1LG09g02310 vs. ExPASy TrEMBL
Match: A0A6J1FZC4 (DNA-directed RNA polymerase OS=Cucurbita moschata OX=3662 GN=LOC111449299 PE=3 SV=1)

HSP 1 Score: 1549 bits (4010), Expect = 0.0
Identity = 874/1325 (65.96%), Postives = 893/1325 (67.40%), Query Frame = 0

Query: 48   MWRNIVKRAASRRDTFFSRKVIEDSIFVEEIRSSRTLGSLKSCCLLEGSRGISFFHSREV 107
            MWRNIVKRAASRRDTFFSRKVIEDSIFVEEIRSSRTLGSLKSCCLLEGSRGISFFHSREV
Sbjct: 1    MWRNIVKRAASRRDTFFSRKVIEDSIFVEEIRSSRTLGSLKSCCLLEGSRGISFFHSREV 60

Query: 108  GFAKSNFAHSSDPVAFYGVLNHANGYATAAEAAISDEDLSGSEEIQELMEELSKQDKVES 167
            GFAKSNFAHSSDPVAFYGVLNHANGYATAAEAAIS+EDLSGSEEIQELMEELSKQDKVES
Sbjct: 61   GFAKSNFAHSSDPVAFYGVLNHANGYATAAEAAISEEDLSGSEEIQELMEELSKQDKVES 120

Query: 168  HFKQPKKMVDGMRVGKYNILRKRQIKMETEAWEEAAREYQELLTDMCEQKLAPNLPYMKS 227
            +F+QPKKMVDGM VGKYNILRKRQIKMETEAWEEAAREYQELLTDMCEQKLAPNLPYMKS
Sbjct: 121  YFQQPKKMVDGMGVGKYNILRKRQIKMETEAWEEAAREYQELLTDMCEQKLAPNLPYMKS 180

Query: 228  LFLGWFEPLRDAIAADQEFCKKKSRVSHFAYFDLLPADMMAVITMHKLMGLLMTNSGGNS 287
            LFLGWFEPLRDAIAADQEFCKKKSRVSH  YFDLLPADMMAVITMHKLMGLLMT+SGGNS
Sbjct: 181  LFLGWFEPLRDAIAADQEFCKKKSRVSHAPYFDLLPADMMAVITMHKLMGLLMTSSGGNS 240

Query: 288  SVRVVQAACQIGEAIEHEVRIHKFFENMKKKKSNEKTTEGDTEPVVEDQEKVAKEQDKLR 347
            SVRVVQAACQIGEAIEHEVRIHKFFEN KKKKSNEKTTEGDTEPVV+DQEK+AKEQDKLR
Sbjct: 241  SVRVVQAACQIGEAIEHEVRIHKFFENTKKKKSNEKTTEGDTEPVVKDQEKLAKEQDKLR 300

Query: 348  KKVTNLMKKQKLRQVRMIVKEHDHLKPWGQDAHVKVGCRLIQLLIETAYIQPPMDQLGGG 407
            KKVTNLMKKQKLRQVRMIVKEHDHLKPWGQDAHVKVGCRLIQLLIETAYIQPPMDQLGGG
Sbjct: 301  KKVTNLMKKQKLRQVRMIVKEHDHLKPWGQDAHVKVGCRLIQLLIETAYIQPPMDQLGGG 360

Query: 408  PPDIRPAFVHTLKTITKEAQKTSRRYGVIECDPLVRRGLEKTRMWRNIVKRAASRRDTFF 467
            PPDIRPAFVHTLKTIT EAQKTSRRYGVIECDPLVRRGLEKT                  
Sbjct: 361  PPDIRPAFVHTLKTITTEAQKTSRRYGVIECDPLVRRGLEKT------------------ 420

Query: 468  SRKVIEDSISVEEIRSSRTLGSLKSCCLLEGSRGISFFHSREVGFAKSNFAHSSEPVAFY 527
            +R ++                                                       
Sbjct: 421  ARHMV------------------------------------------------------- 480

Query: 528  GVLNHAKGYATAAEAAISEEDLSGSEEIQELMEELSKQDKVESHFKQPKKMVDGMRVGKY 587
                                                                        
Sbjct: 481  ------------------------------------------------------------ 540

Query: 588  NILRKRQIKMETEAWEEAAREYQELLTDMCEQKLAPNLPYIKSLF--LGWFEPLRDAIAA 647
                                                 +PY+  L   L W          
Sbjct: 541  -------------------------------------IPYMPMLVPPLNW---------- 600

Query: 648  DQEYDKGAHLFLPSYVMRIHGAKQQREAVKRVPKKQLEPVFEALDTLGRTKWRVNKRVLS 707
               YDKGAHLFLPSYVMRIHGAKQQREAVKRVPKKQLEPVFEALDTLGRTKWRVNKRVLS
Sbjct: 601  -TGYDKGAHLFLPSYVMRIHGAKQQREAVKRVPKKQLEPVFEALDTLGRTKWRVNKRVLS 660

Query: 708  IIDRIWASGGRLADLVDRXTEGDTEPVVEDQEKVAKEQDKLRKKVTNLMKKQKLRQVRMI 767
            IIDRIWASGGRLADLVDR                                          
Sbjct: 661  IIDRIWASGGRLADLVDR------------------------------------------ 720

Query: 768  VKEHDHLKPWGQDAHVKVGCRLIQLLIETAYIQPPMDQLGGGPPDIRPAFVHTLKTITKE 827
                       +DA                                              
Sbjct: 721  -----------EDA---------------------------------------------- 780

Query: 828  AQKTSRRYGVIECDPLVRRGLEKTARHMVIPYMPMLVPPLNWTGYDKGAHLFLPSYVMRI 887
                                                                        
Sbjct: 781  ------------------------------------------------------------ 840

Query: 888  HGAKQQREAVKRVPKKQLEPVFEALDTLGRTKWRVPLPEEPTVEDEAEIRKWKWKLKAAK 947
                                               PLPEEPTVEDE EIRKWKWKLKAAK
Sbjct: 841  -----------------------------------PLPEEPTVEDEEEIRKWKWKLKAAK 900

Query: 948  KENSERHSQRCDIELKLAVARKMKEEEGFYYPHNLDFRGRAYPMHPHLNHLGSDMCRGIL 1007
            KENSERHSQRCDIELKLAVARKMKEEEGFYYPHNLDFRGRAYPMHPHLNHLGSDMCRGIL
Sbjct: 901  KENSERHSQRCDIELKLAVARKMKEEEGFYYPHNLDFRGRAYPMHPHLNHLGSDMCRGIL 950

Query: 1008 EFAEGRPLGESGLRWLKIHLANLYAGGVDKLSYKDRISFTENHLDEIFDSADRPLEGNRW 1067
            EFAEGRPLGESGLRWLKIHLANLYAGGVDKLSYKDR+SFTENHLDEIFDSADRPLEGNRW
Sbjct: 961  EFAEGRPLGESGLRWLKIHLANLYAGGVDKLSYKDRMSFTENHLDEIFDSADRPLEGNRW 950

Query: 1068 WLGAEDPFQCLAVCINLSEALRSPSPETTLSHMPVHQDGSCNGLQHYAALGRDKLSL--- 1127
            WLGAEDPFQCLAVCINLSEALRSPSPETTLSHMPVHQDGSCNGLQHYAALGRDKL     
Sbjct: 1021 WLGAEDPFQCLAVCINLSEALRSPSPETTLSHMPVHQDGSCNGLQHYAALGRDKLGAAAV 950

Query: 1128 -----NNPA----GVNAIVLDIIRSDAEKDPATNPNALHARLLINQVDRKLVKQTVMTSV 1187
                 + PA    G+ A VLDI+RSDAEKDPATNPNALHARLLINQVDRKLVKQTVMTSV
Sbjct: 1081 NLVAGDKPADVYSGIAARVLDIMRSDAEKDPATNPNALHARLLINQVDRKLVKQTVMTSV 950

Query: 1188 YGVTYMGARDQIKKRLKERASIADDSQLFAASCYAAR----------------------- 1247
            YGVTYMGARDQIKKRLKERASIADDSQLFAASCYAAR                       
Sbjct: 1141 YGVTYMGARDQIKKRLKERASIADDSQLFAASCYAARTTLTALGEMFEAARSIMSWLGEC 950

Query: 1248 --VIASENQPVRWTTPLGLPVVQPYRQLGRHLIKTSLQVLALQRETDKVMATRQKTAFPP 1307
              VIASENQPVRWTTPLGLPVVQPYRQLGRHLIKTSLQVLALQRETDKVMATRQKTAFPP
Sbjct: 1201 AKVIASENQPVRWTTPLGLPVVQPYRQLGRHLIKTSLQVLALQRETDKVMATRQKTAFPP 950

Query: 1308 NFVHSLDGSHMMMTAVACRRAGLNFAGVHDSYWTHACDVDVMNRLLREKFVELYEAPILE 1333
            NFVHSLDGSHMMMTAVACRRAGLNFAGVHDSYWTHACDVDVMNRLLREKFVELYEAPILE
Sbjct: 1261 NFVHSLDGSHMMMTAVACRRAGLNFAGVHDSYWTHACDVDVMNRLLREKFVELYEAPILE 950

BLAST of Cp4.1LG09g02310 vs. ExPASy TrEMBL
Match: A0A6J1FZD9 (DNA-directed RNA polymerase OS=Cucurbita moschata OX=3662 GN=LOC111449300 PE=3 SV=1)

HSP 1 Score: 1526 bits (3951), Expect = 0.0
Identity = 867/1327 (65.34%), Postives = 885/1327 (66.69%), Query Frame = 0

Query: 48   MWRNIVKRAASRRDTFFSRKVIEDSIFVEEIRSSRTLGSLKSCCLLEGSRGISFFHSREV 107
            MWRNIVKRAASRRDTFFSRKVIEDSIFVEEIRSSRTLGSLKS CLLEGSRGISFFHSREV
Sbjct: 1    MWRNIVKRAASRRDTFFSRKVIEDSIFVEEIRSSRTLGSLKSSCLLEGSRGISFFHSREV 60

Query: 108  GFAKSNFAHSSDPVAFYGVLNHANGYATAAEAAISDEDLSGSEEIQELMEELSKQDKVES 167
            GFAKSNFAHSS+ VAFYGVLNHA GYATAAEAAIS+EDLSGSEEIQELMEELSKQDKV S
Sbjct: 61   GFAKSNFAHSSEHVAFYGVLNHAKGYATAAEAAISEEDLSGSEEIQELMEELSKQDKVGS 120

Query: 168  HFKQPKKMVDGMRVGKYNILRKRQIKMETEAWEEAAREYQELLTDMCEQKLAPNLPYMKS 227
            HFKQPKKMVDGM VGKYNILRKRQIKMETEAWEEAAREYQELLTDMCEQKLAPNLPY+KS
Sbjct: 121  HFKQPKKMVDGMGVGKYNILRKRQIKMETEAWEEAAREYQELLTDMCEQKLAPNLPYIKS 180

Query: 228  LFLGWFEPLRDAIAADQEFCKKKSRVSHFAYFDLLPADMMAVITMHKLMGLLMTNSGGNS 287
            LFLGWFEPLRDAIAADQEFCKKKSRVSH  YFDLLPADMMAVITMHKLMGLLMT+SGGNS
Sbjct: 181  LFLGWFEPLRDAIAADQEFCKKKSRVSHAPYFDLLPADMMAVITMHKLMGLLMTSSGGNS 240

Query: 288  SVRVVQAACQIGEAIEHEVRIHKFFENMKKKK--SNEKTTEGDTEPVVEDQEKVAKEQDK 347
            SVRVVQAACQIGEAIEHEVRIHKFFEN KKKK  SNEKTTEGDTEPVVEDQEK+AKEQDK
Sbjct: 241  SVRVVQAACQIGEAIEHEVRIHKFFENTKKKKKNSNEKTTEGDTEPVVEDQEKLAKEQDK 300

Query: 348  LRKKVTNLMKKQKLRQVRMIVKEHDHLKPWGQDAHVKVGCRLIQLLIETAYIQPPMDQLG 407
            LRKKVTNLMKKQKLRQVRMIVKEHD LKPWGQDAHVKVGCRLIQLLIETAYIQPPMDQLG
Sbjct: 301  LRKKVTNLMKKQKLRQVRMIVKEHDDLKPWGQDAHVKVGCRLIQLLIETAYIQPPMDQLG 360

Query: 408  GGPPDIRPAFVHTLKTITKEAQKTSRRYGVIECDPLVRRGLEKTRMWRNIVKRAASRRDT 467
            GGPPDIRPAFVHTLKTITKEAQKTSRRYGVIECDPLVRRGLEKT                
Sbjct: 361  GGPPDIRPAFVHTLKTITKEAQKTSRRYGVIECDPLVRRGLEKT---------------- 420

Query: 468  FFSRKVIEDSISVEEIRSSRTLGSLKSCCLLEGSRGISFFHSREVGFAKSNFAHSSEPVA 527
              +R ++                                                     
Sbjct: 421  --ARHMV----------------------------------------------------- 480

Query: 528  FYGVLNHAKGYATAAEAAISEEDLSGSEEIQELMEELSKQDKVESHFKQPKKMVDGMRVG 587
                                                                        
Sbjct: 481  ------------------------------------------------------------ 540

Query: 588  KYNILRKRQIKMETEAWEEAAREYQELLTDMCEQKLAPNLPYIKSLF--LGWFEPLRDAI 647
                                                   +PY+  L   L W        
Sbjct: 541  ---------------------------------------IPYMPMLVPPLNW-------- 600

Query: 648  AADQEYDKGAHLFLPSYVMRIHGAKQQREAVKRVPKKQLEPVFEALDTLGRTKWRVNKRV 707
                 YDKGAHLFLPSYVMRIHGAKQQREAVKRVPKKQLEPVFEALDTLGRTKWRVNKRV
Sbjct: 601  ---TGYDKGAHLFLPSYVMRIHGAKQQREAVKRVPKKQLEPVFEALDTLGRTKWRVNKRV 660

Query: 708  LSIIDRIWASGGRLADLVDRXTEGDTEPVVEDQEKVAKEQDKLRKKVTNLMKKQKLRQVR 767
            LSIIDRIW SGGRLADLVDR                                        
Sbjct: 661  LSIIDRIWTSGGRLADLVDR---------------------------------------- 720

Query: 768  MIVKEHDHLKPWGQDAHVKVGCRLIQLLIETAYIQPPMDQLGGGPPDIRPAFVHTLKTIT 827
                         +D                                             
Sbjct: 721  -------------ED--------------------------------------------- 780

Query: 828  KEAQKTSRRYGVIECDPLVRRGLEKTARHMVIPYMPMLVPPLNWTGYDKGAHLFLPSYVM 887
                                                                        
Sbjct: 781  ------------------------------------------------------------ 840

Query: 888  RIHGAKQQREAVKRVPKKQLEPVFEALDTLGRTKWRVPLPEEPTVEDEAEIRKWKWKLKA 947
                                                VPLPEEPTVEDEAEIRKWKWKLK 
Sbjct: 841  ------------------------------------VPLPEEPTVEDEAEIRKWKWKLKG 900

Query: 948  AKKENSERHSQRCDIELKLAVARKMKEEEGFYYPHNLDFRGRAYPMHPHLNHLGSDMCRG 1007
            AKKENSERHSQRCDIELKLAVARKMKEEEGFYYPHNLDFRGRAYPMHPHLNHLGSDMCRG
Sbjct: 901  AKKENSERHSQRCDIELKLAVARKMKEEEGFYYPHNLDFRGRAYPMHPHLNHLGSDMCRG 952

Query: 1008 ILEFAEGRPLGESGLRWLKIHLANLYAGGVDKLSYKDRISFTENHLDEIFDSADRPLEGN 1067
            ILEFAEGRPLGESGLRWLKIHLANLYAGGVDKLSYKDRISFTENHLDEIFDSADRPLEGN
Sbjct: 961  ILEFAEGRPLGESGLRWLKIHLANLYAGGVDKLSYKDRISFTENHLDEIFDSADRPLEGN 952

Query: 1068 RWWLGAEDPFQCLAVCINLSEALRSPSPETTLSHMPVHQDGSCNGLQHYAALGRDKLSL- 1127
            RWWLGAEDPFQCLAVCINLSEALRSPSPETTLSHMPVHQDGSCNGLQHYAALGRDKL   
Sbjct: 1021 RWWLGAEDPFQCLAVCINLSEALRSPSPETTLSHMPVHQDGSCNGLQHYAALGRDKLGAA 952

Query: 1128 -------NNPA----GVNAIVLDIIRSDAEKDPATNPNALHARLLINQVDRKLVKQTVMT 1187
                   + PA    G+ A VLDIIRSDAEKDPATNPNALHARLLINQVDRKLVKQTVMT
Sbjct: 1081 AVNLVAGDKPADVYSGIAARVLDIIRSDAEKDPATNPNALHARLLINQVDRKLVKQTVMT 952

Query: 1188 SVYGVTYMGARDQIKKRLKERASIADDSQLFAASCYAAR--------------------- 1247
            SVYGVTYMGARDQIKKRLKERASIADDSQLFAASCYAA+                     
Sbjct: 1141 SVYGVTYMGARDQIKKRLKERASIADDSQLFAASCYAAKTTLTALGEMFEAARSIMSWLG 952

Query: 1248 ----VIASENQPVRWTTPLGLPVVQPYRQLGRHLIKTSLQVLALQRETDKVMATRQKTAF 1307
                VIASENQPVRWTTPLGLPVVQPYRQLGRH IKTSLQ+L LQRETDKVMA RQKTAF
Sbjct: 1201 ECAKVIASENQPVRWTTPLGLPVVQPYRQLGRHFIKTSLQMLTLQRETDKVMAKRQKTAF 952

Query: 1308 PPNFVHSLDGSHMMMTAVACRRAGLNFAGVHDSYWTHACDVDVMNRLLREKFVELYEAPI 1333
            PPNFVHSLDGSHMMMTAVACRRAGLNFAGVHDSYWTHACDVDVMNRLLREKFVELYEAPI
Sbjct: 1261 PPNFVHSLDGSHMMMTAVACRRAGLNFAGVHDSYWTHACDVDVMNRLLREKFVELYEAPI 952

BLAST of Cp4.1LG09g02310 vs. ExPASy TrEMBL
Match: A0A6J1HS83 (DNA-directed RNA polymerase OS=Cucurbita maxima OX=3661 GN=LOC111467347 PE=3 SV=1)

HSP 1 Score: 1506 bits (3900), Expect = 0.0
Identity = 853/1326 (64.33%), Postives = 884/1326 (66.67%), Query Frame = 0

Query: 48   MWRNIVKRAASRRDTFFSRKVIEDSIFVEEIRSSRTLGSLKSCCLLEGSRGISFFHSREV 107
            MWRNIVKRAASRRDTFFSRKVIEDSIFVEEIRSSRTLGSLKSCCLLEGSRGI+FFHSREV
Sbjct: 1    MWRNIVKRAASRRDTFFSRKVIEDSIFVEEIRSSRTLGSLKSCCLLEGSRGINFFHSREV 60

Query: 108  GFAKSNFAHSSDPVAFYGVLNHANGYATAAEAAISDEDLSGSEEIQELMEELSKQDKVES 167
            GFAKSN AHSS+P+AFYGVLNHA GYATAAEAAIS+EDLSGSEEIQELMEELSKQDKVES
Sbjct: 61   GFAKSNCAHSSEPLAFYGVLNHAKGYATAAEAAISEEDLSGSEEIQELMEELSKQDKVES 120

Query: 168  HFKQPKKMVDGMRVGKYNILRKRQIKMETEAWEEAAREYQELLTDMCEQKLAPNLPYMKS 227
            HFKQPKKMVDGM +GKYNILRKRQIKMETEAWE+AAREYQELLTDMCEQKL PNLPYMKS
Sbjct: 121  HFKQPKKMVDGMGIGKYNILRKRQIKMETEAWEKAAREYQELLTDMCEQKLVPNLPYMKS 180

Query: 228  LFLGWFEPLRDAIAADQEFCK-KKSRVSHFAYFDLLPADMMAVITMHKLMGLLMTNSGGN 287
            LFLGWFEPLRDAIAADQEF K KK+RVSH  YFDLLPADMMAVITMHKLMGLLMTNSGGN
Sbjct: 181  LFLGWFEPLRDAIAADQEFFKDKKTRVSHAPYFDLLPADMMAVITMHKLMGLLMTNSGGN 240

Query: 288  SSVRVVQAACQIGEAIEHEVRIHKFFENMKKKKSNEKTTEGDTEPVVEDQEKVAKEQDKL 347
            SSVRVVQAACQIGEAIEHEVRIHKFFE  KKK SNEKTTEGDTEPVVEDQEK+AK+QDKL
Sbjct: 241  SSVRVVQAACQIGEAIEHEVRIHKFFEKTKKKNSNEKTTEGDTEPVVEDQEKLAKQQDKL 300

Query: 348  RKKVTNLMKKQKLRQVRMIVKEHDHLKPWGQDAHVKVGCRLIQLLIETAYIQPPMDQLGG 407
            RKKVTNLMKKQKLRQVRMIVKEHD LKPWGQDAHVKVGCRLIQLLIETAYIQPP+DQLG 
Sbjct: 301  RKKVTNLMKKQKLRQVRMIVKEHDDLKPWGQDAHVKVGCRLIQLLIETAYIQPPIDQLGE 360

Query: 408  GPPDIRPAFVHTLKTITKEAQKTSRRYGVIECDPLVRRGLEKTRMWRNIVKRAASRRDTF 467
            GPPDIRPAFVHTLKTITKEAQKTSRRYGVIECDPLVRRGLEKT                 
Sbjct: 361  GPPDIRPAFVHTLKTITKEAQKTSRRYGVIECDPLVRRGLEKT----------------- 420

Query: 468  FSRKVIEDSISVEEIRSSRTLGSLKSCCLLEGSRGISFFHSREVGFAKSNFAHSSEPVAF 527
             +R ++                                                      
Sbjct: 421  -ARHMV------------------------------------------------------ 480

Query: 528  YGVLNHAKGYATAAEAAISEEDLSGSEEIQELMEELSKQDKVESHFKQPKKMVDGMRVGK 587
                                                                        
Sbjct: 481  ------------------------------------------------------------ 540

Query: 588  YNILRKRQIKMETEAWEEAAREYQELLTDMCEQKLAPNLPYIKSLF--LGWFEPLRDAIA 647
                                                  +PY+  L   L W         
Sbjct: 541  --------------------------------------IPYMPMLVPPLYW--------- 600

Query: 648  ADQEYDKGAHLFLPSYVMRIHGAKQQREAVKRVPKKQLEPVFEALDTLGRTKWRVNKRVL 707
                YDKGA+LFLPSYVMRIHGAKQQREAVKRVPKKQLEPVFEALDTLGRTKWRVNKR+L
Sbjct: 601  --TGYDKGAYLFLPSYVMRIHGAKQQREAVKRVPKKQLEPVFEALDTLGRTKWRVNKRIL 660

Query: 708  SIIDRIWASGGRLADLVDRXTEGDTEPVVEDQEKVAKEQDKLRKKVTNLMKKQKLRQVRM 767
            SIIDRIWASGGRLADLVDR                                         
Sbjct: 661  SIIDRIWASGGRLADLVDR----------------------------------------- 720

Query: 768  IVKEHDHLKPWGQDAHVKVGCRLIQLLIETAYIQPPMDQLGGGPPDIRPAFVHTLKTITK 827
                        +D                                              
Sbjct: 721  ------------ED---------------------------------------------- 780

Query: 828  EAQKTSRRYGVIECDPLVRRGLEKTARHMVIPYMPMLVPPLNWTGYDKGAHLFLPSYVMR 887
                                                                        
Sbjct: 781  ------------------------------------------------------------ 840

Query: 888  IHGAKQQREAVKRVPKKQLEPVFEALDTLGRTKWRVPLPEEPTVEDEAEIRKWKWKLKAA 947
                                               VPLPEEPTVEDEAEIRKWKWK+KAA
Sbjct: 841  -----------------------------------VPLPEEPTVEDEAEIRKWKWKVKAA 900

Query: 948  KKENSERHSQRCDIELKLAVARKMKEEEGFYYPHNLDFRGRAYPMHPHLNHLGSDMCRGI 1007
            KKENSERHSQRCDIELKLAVARKMKEEEGFYYPHNLDFRGRAYPMHP+LNHLGSDMCRGI
Sbjct: 901  KKENSERHSQRCDIELKLAVARKMKEEEGFYYPHNLDFRGRAYPMHPYLNHLGSDMCRGI 951

Query: 1008 LEFAEGRPLGESGLRWLKIHLANLYAGGVDKLSYKDRISFTENHLDEIFDSADRPLEGNR 1067
            LEFAEGRPLGESGLRWLKIHLANLYAGGVDKLSYKDRISFTENHLDEIFDSADRPLEGNR
Sbjct: 961  LEFAEGRPLGESGLRWLKIHLANLYAGGVDKLSYKDRISFTENHLDEIFDSADRPLEGNR 951

Query: 1068 WWLGAEDPFQCLAVCINLSEALRSPSPETTLSHMPVHQDGSCNGLQHYAALGRDKLSL-- 1127
            WWLGAEDPFQCLAVCINLSEALRSPSPETT+SHMPVHQDGSCNGLQHYAALGRDKL    
Sbjct: 1021 WWLGAEDPFQCLAVCINLSEALRSPSPETTISHMPVHQDGSCNGLQHYAALGRDKLGAAA 951

Query: 1128 ------NNPA----GVNAIVLDIIRSDAEKDPATNPNALHARLLINQVDRKLVKQTVMTS 1187
                  + PA    G+ A VLDI+RSDAEKDPATNPNALHARLLINQVDRKLVKQTVMTS
Sbjct: 1081 VNLVAGDKPADVYSGIAARVLDIMRSDAEKDPATNPNALHARLLINQVDRKLVKQTVMTS 951

Query: 1188 VYGVTYMGARDQIKKRLKERASIADDSQLFAASCYAAR---------------------- 1247
            VYGVTYMGARDQIK+RLKERASIADDSQLFAA+CYAA+                      
Sbjct: 1141 VYGVTYMGARDQIKRRLKERASIADDSQLFAAACYAAKTTLTALGEMFEAARSIMSWLGE 951

Query: 1248 ---VIASENQPVRWTTPLGLPVVQPYRQLGRHLIKTSLQVLALQRETDKVMATRQKTAFP 1307
               VIASENQPVRWTTPLGLPVVQPYRQLGRHLIKTSLQVL LQRETDKVMA RQKTAFP
Sbjct: 1201 CAKVIASENQPVRWTTPLGLPVVQPYRQLGRHLIKTSLQVLTLQRETDKVMAKRQKTAFP 951

Query: 1308 PNFVHSLDGSHMMMTAVACRRAGLNFAGVHDSYWTHACDVDVMNRLLREKFVELYEAPIL 1333
            PNFVHSLDGSHMMMTAVAC+RAGLNFAGVHDSYWTHACDVD MNRLLREKFVELYEAPIL
Sbjct: 1261 PNFVHSLDGSHMMMTAVACKRAGLNFAGVHDSYWTHACDVDEMNRLLREKFVELYEAPIL 951

BLAST of Cp4.1LG09g02310 vs. ExPASy TrEMBL
Match: A0A6J1D747 (DNA-directed RNA polymerase OS=Momordica charantia OX=3673 GN=LOC111017540 PE=3 SV=1)

HSP 1 Score: 1374 bits (3557), Expect = 0.0
Identity = 782/1331 (58.75%), Postives = 841/1331 (63.19%), Query Frame = 0

Query: 48   MWRNIVKRAASRRDTFFS-------RKVIEDSIFVEEIRSSRTLGSLKSCCLLEGSRGIS 107
            MWRNIVKRAASR+D  FS       RKVIED IF+E+IRSSRT   L +C  L G R I 
Sbjct: 1    MWRNIVKRAASRKDILFSQPSSSRFRKVIEDHIFLEQIRSSRTHRGLNTCRQLAGFRRIG 60

Query: 108  FFHSREVGFAKSNFAHSSDPVAFYGVLNHANGYATAAEAAISDEDLSGSEEIQELMEELS 167
            F H            HSS+P+ F+GVLNHA GYATAAEA IS+EDLSGSEEIQELMEEL+
Sbjct: 61   FLHPDN--------PHSSEPLLFHGVLNHAKGYATAAEATISEEDLSGSEEIQELMEELT 120

Query: 168  KQDKVESHFKQPKKMVDGMRVGKYNILRKRQIKMETEAWEEAAREYQELLTDMCEQKLAP 227
            KQ+K+E HFKQPK+MVDGM V KYN+LRKRQ+KMETEAWEEAAREYQELLTDMCEQKLAP
Sbjct: 121  KQEKLEHHFKQPKRMVDGMGVSKYNMLRKRQVKMETEAWEEAAREYQELLTDMCEQKLAP 180

Query: 228  NLPYMKSLFLGWFEPLRDAIAADQEFCK-KKSRVSHFAYFDLLPADMMAVITMHKLMGLL 287
            NLPY+KSLFLGWFEPLRDAIAADQE+CK KK+RVSH  YFDL+PADMMAVITMHKLMGLL
Sbjct: 181  NLPYVKSLFLGWFEPLRDAIAADQEYCKDKKNRVSHAPYFDLIPADMMAVITMHKLMGLL 240

Query: 288  MTNSGGNSSVRVVQAACQIGEAIEHEVRIHKFFENMKKKKSNEKTTEGDTEPVVEDQEKV 347
            MTNSGGN SVRVVQAACQIGEAIEHEVRIHKFFE  KKK  NEK TE + +PV E+QEK+
Sbjct: 241  MTNSGGNGSVRVVQAACQIGEAIEHEVRIHKFFEKTKKKNVNEKNTEAEADPVAEEQEKL 300

Query: 348  AKEQDKLRKKVTNLMKKQKLRQVRMIVKEHDHLKPWGQDAHVKVGCRLIQLLIETAYIQP 407
            AKEQDKLRKKVT LMKKQKL+QVRMIVKE D LKPWGQDAHVKVGCRL+QLL+ETAYIQP
Sbjct: 301  AKEQDKLRKKVTKLMKKQKLQQVRMIVKEQDDLKPWGQDAHVKVGCRLMQLLMETAYIQP 360

Query: 408  PMDQLGGGPPDIRPAFVHTLKTITKEAQKTSRRYGVIECDPLVRRGLEKTRMWRNIVKRA 467
            P+DQLG GPPDIRPAFVHTLKTIT+EAQKTSRRYGVIECDPLVRRGLEKT          
Sbjct: 361  PIDQLGNGPPDIRPAFVHTLKTITREAQKTSRRYGVIECDPLVRRGLEKT---------- 420

Query: 468  ASRRDTFFSRKVIEDSISVEEIRSSRTLGSLKSCCLLEGSRGISFFHSREVGFAKSNFAH 527
                                                                        
Sbjct: 421  ------------------------------------------------------------ 480

Query: 528  SSEPVAFYGVLNHAKGYATAAEAAISEEDLSGSEEIQELMEELSKQDKVESHFKQPKKMV 587
                                                                        
Sbjct: 481  ------------------------------------------------------------ 540

Query: 588  DGMRVGKYNILRKRQIKMETEAWEEAAREYQELLTDMCEQKLAPNLPYIKSLFLGWFEPL 647
                                                     + P LP +    L W    
Sbjct: 541  -------------------------------------ARHTVIPYLPMLVPP-LNW---- 600

Query: 648  RDAIAADQEYDKGAHLFLPSYVMRIHGAKQQREAVKRVPKKQLEPVFEALDTLGRTKWRV 707
                     YDKGA+LFLPSYVMRIHGAKQQREAVKR PKKQLEPVFEALDTLGRTKWRV
Sbjct: 601  -------TGYDKGAYLFLPSYVMRIHGAKQQREAVKRAPKKQLEPVFEALDTLGRTKWRV 660

Query: 708  NKRVLSIIDRIWASGGRLADLVDRXTEGDTEPVVEDQEKVAKEQDKLRKKVTNLMKKQKL 767
            NKRVLS+IDRIWASGGRLADLVDR                                    
Sbjct: 661  NKRVLSVIDRIWASGGRLADLVDR------------------------------------ 720

Query: 768  RQVRMIVKEHDHLKPWGQDAHVKVGCRLIQLLIETAYIQPPMDQLGGGPPDIRPAFVHTL 827
                             +D                                         
Sbjct: 721  -----------------ED----------------------------------------- 780

Query: 828  KTITKEAQKTSRRYGVIECDPLVRRGLEKTARHMVIPYMPMLVPPLNWTGYDKGAHLFLP 887
                                                                        
Sbjct: 781  ------------------------------------------------------------ 840

Query: 888  SYVMRIHGAKQQREAVKRVPKKQLEPVFEALDTLGRTKWRVPLPEEPTVEDEAEIRKWKW 947
                                                    VPLPEEP V+DEAEIRKWKW
Sbjct: 841  ----------------------------------------VPLPEEPNVDDEAEIRKWKW 900

Query: 948  KLKAAKKENSERHSQRCDIELKLAVARKMKEEEGFYYPHNLDFRGRAYPMHPHLNHLGSD 1007
            K+KAAKKENSERHSQRCDIELKLAVARKMKEE+GFYYPHNLDFRGRAYPMHP+LNHLGSD
Sbjct: 901  KVKAAKKENSERHSQRCDIELKLAVARKMKEEDGFYYPHNLDFRGRAYPMHPYLNHLGSD 950

Query: 1008 MCRGILEFAEGRPLGESGLRWLKIHLANLYAGGVDKLSYKDRISFTENHLDEIFDSADRP 1067
            +CRG+LEFAEGRPLGESGL WLKIHLANLYAGGVDKLSYKDR+SFTENHL+EIFDSADRP
Sbjct: 961  LCRGVLEFAEGRPLGESGLYWLKIHLANLYAGGVDKLSYKDRVSFTENHLEEIFDSADRP 950

Query: 1068 LEGNRWWLGAEDPFQCLAVCINLSEALRSPSPETTLSHMPVHQDGSCNGLQHYAALGRDK 1127
            LEGNRWWLGAEDPFQCLAVCINLSEALRSPSPETT+SHMPVHQDGSCNGLQHYAALGRDK
Sbjct: 1021 LEGNRWWLGAEDPFQCLAVCINLSEALRSPSPETTISHMPVHQDGSCNGLQHYAALGRDK 950

Query: 1128 LSL--------NNPA----GVNAIVLDIIRSDAEKDPATNPNALHARLLINQVDRKLVKQ 1187
            L          + PA    G+ + VLDI+RSDAEKDPATNPNALHARLLINQVDRKLVKQ
Sbjct: 1081 LGAAAVNLVAGDKPADVYSGIASRVLDIMRSDAEKDPATNPNALHARLLINQVDRKLVKQ 950

Query: 1188 TVMTSVYGVTYMGARDQIKKRLKERASIADDSQLFAASCYAAR----------------- 1247
            TVMTSVYGVTY+GAR+QIK+RLKER+SI+DDS+LFAASCYAA+                 
Sbjct: 1141 TVMTSVYGVTYVGAREQIKRRLKERSSISDDSKLFAASCYAAKTTLTALGEMFEAARSIM 950

Query: 1248 --------VIASENQPVRWTTPLGLPVVQPYRQLGRHLIKTSLQVLALQRETDKVMATRQ 1307
                    VIASENQPVRWTTPLGLPVVQPYRQLGRHLIKTSLQVL LQRETD+VM  RQ
Sbjct: 1201 NWLGECAKVIASENQPVRWTTPLGLPVVQPYRQLGRHLIKTSLQVLTLQRETDQVMVKRQ 950

Query: 1308 KTAFPPNFVHSLDGSHMMMTAVACRRAGLNFAGVHDSYWTHACDVDVMNRLLREKFVELY 1333
            +TAFPPNFVHSLDGSHMMMTAVAC+RAGLNFAGVHDSYWTHACDVD MNR+LREKFVELY
Sbjct: 1261 RTAFPPNFVHSLDGSHMMMTAVACKRAGLNFAGVHDSYWTHACDVDEMNRILREKFVELY 950

BLAST of Cp4.1LG09g02310 vs. ExPASy TrEMBL
Match: A0A5A7SYG2 (DNA-directed RNA polymerase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold475G00320 PE=3 SV=1)

HSP 1 Score: 1256 bits (3251), Expect = 0.0
Identity = 657/954 (68.87%), Postives = 748/954 (78.41%), Query Frame = 0

Query: 451  MWRNIVKRAASRRDTFFS-------RKVIEDSISVEEIRSSRTLGSLKSCCLLEGSRGIS 510
            MWRN++KRAASR+ T FS       RKV ED I +++IR SR+L +L +CC    S  IS
Sbjct: 1    MWRNVLKRAASRKATLFSELSSSTSRKVTEDHIFLDQIRYSRSLETLNTCCQSGDSSRIS 60

Query: 511  FFHSREVGFAKSNFAHSSEPVAFYGVLNHAKGYATAAEAAISEEDLSGSEEIQELMEELS 570
            F H ++VGF  SNF HSS PVAFYGVLNHA+GYATAAEAAISE DLSGSEEIQE+ME +S
Sbjct: 61   FLHPQKVGFTNSNFPHSSNPVAFYGVLNHAEGYATAAEAAISEGDLSGSEEIQEIMEGIS 120

Query: 571  KQDKVESHFKQPKKMVDGMRVGKYNILRKRQIKMETEAWEEAAREYQELLTDMCEQKLAP 630
            KQDKVE HFK+PK+MVD      Y++LRKRQIK+ETEAWEEAAREYQEL+ ++CEQKL+P
Sbjct: 121  KQDKVEPHFKKPKRMVDRKGEATYDMLRKRQIKIETEAWEEAAREYQELIAEICEQKLSP 180

Query: 631  NLPYIKSLFLGWFEPLRDAIAADQEYDKG-AHLFLPSYVMRIHGAKQQREAVKRVPKKQL 690
            NLPY+KSLFLGWF+P RDAI A+QE  K  +  F PS+    +       AV  + K  L
Sbjct: 181  NLPYMKSLFLGWFQPFRDAIVAEQESIKSKSKNFCPSHAPYFNLLPADMMAVITMHK--L 240

Query: 691  EPVFEALDTLGRTKWRVNKRVLSIIDRIWASGGRLADLVDRXTEGDTEPVVEDQEKVAKE 750
              V  + D  G    +V +    I + I  +  R+    ++  +          EK+A++
Sbjct: 241  VGVMMS-DFEGNGIVKVTQAATHIGEAI-ENEVRIRHFFEKKKQP---------EKLAED 300

Query: 751  QDKLRKKVTNLMKKQKLRQVRMIVKEHDHLKPWGQDAHVKVGCRLIQLLIETAYIQPPMD 810
            QDKLRKKVT LMK+QKL++V  IVK+HD LKPWG D HVKVGC LI+ LIETAYIQPP+D
Sbjct: 301  QDKLRKKVTKLMKQQKLQKVNFIVKKHDDLKPWGTDVHVKVGCTLIKFLIETAYIQPPVD 360

Query: 811  QLGGGPPDIRPAFVHTLKTITKEAQKTSRRYGVIECDPLVRRGLEKTARHMVIPYMPMLV 870
            QLG  PPD+RPAFVH LKT  KE QK  RRYGVIECDPLVRRG+ KTA HMVIPYMPMLV
Sbjct: 361  QLGDAPPDLRPAFVHYLKTSPKETQKLGRRYGVIECDPLVRRGMVKTAGHMVIPYMPMLV 420

Query: 871  PPLNWTGYDKGAHLFLPSYVMRIHGAKQQREAVKRVPKKQLEPVFEALDTLGRTKWRV-- 930
            PPLNWTGYDKGA+ FLPSYVMRI GAKQQREAV+R P+ QLEPVF+ALD LG TKWRV  
Sbjct: 421  PPLNWTGYDKGAYFFLPSYVMRIRGAKQQREAVRRAPRTQLEPVFKALDILGSTKWRVNK 480

Query: 931  -------------------------PLPEEPTVEDEAEIRKWKWKLKAAKKENSERHSQR 990
                                     PLPEEP+VEDEAEIRKWKWK+KA KKENSERHSQR
Sbjct: 481  RVLSIIERIWASGGRLADLVDREDLPLPEEPSVEDEAEIRKWKWKVKAVKKENSERHSQR 540

Query: 991  CDIELKLAVARKMKEEEGFYYPHNLDFRGRAYPMHPHLNHLGSDMCRGILEFAEGRPLGE 1050
            CD+ELKLAVARKMK+EEGFYYPHNLDFRGRAYPMHP+LNH+GSD+CRGILEFAEGRPLGE
Sbjct: 541  CDVELKLAVARKMKDEEGFYYPHNLDFRGRAYPMHPNLNHIGSDLCRGILEFAEGRPLGE 600

Query: 1051 SGLRWLKIHLANLYAGGVDKLSYKDRISFTENHLDEIFDSADRPLEGNRWWLGAEDPFQC 1110
            SGLRWLKIHLANLYAGGVDKLSYKDRISFTENHLDEIFDSADRPLEG+RWWLGAEDPFQC
Sbjct: 601  SGLRWLKIHLANLYAGGVDKLSYKDRISFTENHLDEIFDSADRPLEGSRWWLGAEDPFQC 660

Query: 1111 LAVCINLSEALRSPSPETTLSHMPVHQDGSCNGLQHYAALGRDKLSL--------NNPA- 1170
            LAVCINLSEALRSPSPETT+SH+P+HQDGSCNGLQHYAALGRDKL          + PA 
Sbjct: 661  LAVCINLSEALRSPSPETTISHLPIHQDGSCNGLQHYAALGRDKLGAEAVNLAAGDKPAD 720

Query: 1171 ---GVNAIVLDIIRSDAEKDPATNPNALHARLLINQVDRKLVKQTVMTSVYGVTYMGARD 1230
               G+ + VLDIIRSDA KDPA+ PNALHA+LLINQVDRKLVKQTVMTSVYGVT +GA+D
Sbjct: 721  VYSGIASRVLDIIRSDALKDPASYPNALHAKLLINQVDRKLVKQTVMTSVYGVTLVGAKD 780

Query: 1231 QIKKRLKERASIADDSQLFAASCYAAR-------------------------VIASENQP 1290
            QI +RLKERASI ++ QLF+ASCYAA+                         VIASEN+ 
Sbjct: 781  QISQRLKERASIGNERQLFSASCYAAKTTLTAIGEMFEEAKSIMNWLGECAKVIASENKD 840

Query: 1291 VRWTTPLGLPVVQPYRQLGRHLIKTSLQVLALQRETDKVMATRQKTAFPPNFVHSLDGSH 1332
            VRWTTPLGLPVVQPYR+ GRH++KTSLQVL+LQRETDKVMA RQK+AFPPNF+HSLD SH
Sbjct: 841  VRWTTPLGLPVVQPYRKPGRHIVKTSLQVLSLQRETDKVMAARQKSAFPPNFIHSLDSSH 900

BLAST of Cp4.1LG09g02310 vs. TAIR 10
Match: AT1G68990.1 (male gametophyte defective 3 )

HSP 1 Score: 1034.6 bits (2674), Expect = 7.3e-302
Identity = 556/966 (57.56%), Postives = 669/966 (69.25%), Query Frame = 0

Query: 451  MWRNIVKRAASRRDTFFSRKVIEDSISVEEIRSSRTLGSLKSCCLLEGSRGISFFHSREV 510
            MWRNI+ RA+ R+  F S    + S S      +R  G L S  L     G+S     E+
Sbjct: 1    MWRNILGRASLRKVKFLS----DSSSSGTHYPVNRVRGILSSVNLSGVRNGLSINPVNEM 60

Query: 511  GFAKSNFAHSSEPVAFYGVLNHAKGYATAAEAAIS---EEDLSGSEEIQELMEELSKQ-- 570
            G   S+F H        G     +GYATAA+A  S   E++ SGS+E+ EL+ E+ K+  
Sbjct: 61   G-GLSSFRH--------GQCYVFEGYATAAQAIDSTDPEDESSGSDEVNELITEMEKETE 120

Query: 571  ---DKVESHFKQPKKMVDGMRVGKYNILRKRQIKMETEAWEEAAREYQELLTDMCEQKLA 630
                K       PK+++ GM   K+ +L++RQ+KMETE WE AARE +E+L DMCEQKLA
Sbjct: 121  RIRKKARLAAIPPKRVIAGMGAQKFYMLKQRQVKMETEEWERAARECREILADMCEQKLA 180

Query: 631  PNLPYIKSLFLGWFEPLRDAIAADQEYDKGAHLFLPSYVMRIHGAKQQREAVKRVPKKQ- 690
            PNLPY+KSLFLGWFEP+R+AI  D +  K     +P Y   +      + AV  + K   
Sbjct: 181  PNLPYMKSLFLGWFEPVRNAIQDDLDTFKIKKGKIP-YAPFMEQLPADKMAVITMHKMMG 240

Query: 691  ----------LEPVFEALDTLGRTKWRVNKRVLSIIDRIWASGGRLADLVDRXTEGDTEP 750
                      +  +  A   +G        R+ S + +         +  D+    + E 
Sbjct: 241  LLMTNAEGVGIVKLVNAATQIGEAV-EQEVRINSFLQK-----KNKKNATDKTINTEAEN 300

Query: 751  VVEDQEKVAKEQDKLRKKVTNLMKKQKLRQVRMIVKEHDHLKPWGQDAHVKVGCRLIQLL 810
            V E  E VAKE +K RK+VT LM+K KLRQV+ +V++HD  KPWGQ+A VKVG RLIQLL
Sbjct: 301  VSE--EIVAKETEKARKQVTVLMEKNKLRQVKALVRKHDSFKPWGQEAQVKVGARLIQLL 360

Query: 811  IETAYIQPPMDQLGGGPPDIRPAFVHTLKTITKEAQKTSRRYGVIECDPLVRRGLEKTAR 870
            +E AYIQPP +Q   GPPDIRPAF    +T+T E  KTSRRYG IECDPLV +GL+K+AR
Sbjct: 361  MENAYIQPPAEQFDDGPPDIRPAFKQNFRTVTLENTKTSRRYGCIECDPLVLKGLDKSAR 420

Query: 871  HMVIPYMPMLVPPLNWTGYDKGAHLFLPSYVMRIHGAKQQREAVKRVPKKQLEPVFEALD 930
            HMVIPY+PML+PP NWTGYD+GAH FLPSYVMR HGAKQQR  +KR PK+QLEPV+EALD
Sbjct: 421  HMVIPYLPMLIPPQNWTGYDQGAHFFLPSYVMRTHGAKQQRTVMKRTPKEQLEPVYEALD 480

Query: 931  TLGRTKWR---------------------------VPLPEEPTVEDEAEIRKWKWKLKAA 990
            TLG TKW+                           VP+PEEP  ED+ + + W+W+ K A
Sbjct: 481  TLGNTKWKINKKVLSLVDRIWANGGRIGGLVDREDVPIPEEPEREDQEKFKNWRWESKKA 540

Query: 991  KKENSERHSQRCDIELKLAVARKMKEEEGFYYPHNLDFRGRAYPMHPHLNHLGSDMCRGI 1050
             K+N+ERHSQRCDIELKL VARKMK+EEGFYYPHN+DFRGRAYP+HP+LNHLGSD+CRGI
Sbjct: 541  IKQNNERHSQRCDIELKLEVARKMKDEEGFYYPHNVDFRGRAYPIHPYLNHLGSDLCRGI 600

Query: 1051 LEFAEGRPLGESGLRWLKIHLANLYAGGVDKLSYKDRISFTENHLDEIFDSADRPLEGNR 1110
            LEF EG+PLG+SGLRWLKIH+ANLYAGGVDKL+Y+DRI+FTE+HL++IFDS+DRPLEG R
Sbjct: 601  LEFCEGKPLGKSGLRWLKIHIANLYAGGVDKLAYEDRIAFTESHLEDIFDSSDRPLEGKR 660

Query: 1111 WWLGAEDPFQCLAVCINLSEALRSPSPETTLSHMPVHQDGSCNGLQHYAALGRDKLSLN- 1170
            WWL AEDPFQCLA CINLSEALRSP PE  +SH+P+HQDGSCNGLQHYAALGRDKL  + 
Sbjct: 661  WWLNAEDPFQCLAACINLSEALRSPFPEAAISHIPIHQDGSCNGLQHYAALGRDKLGADA 720

Query: 1171 -------NPAGV----NAIVLDIIRSDAEKDPATNPNALHARLLINQVDRKLVKQTVMTS 1230
                    PA V     A VL I++ DAE+DP T PNA +A+L+++QVDRKLVKQTVMTS
Sbjct: 721  VNLVTGEKPADVYTEIAARVLKIMQQDAEEDPETFPNATYAKLMLDQVDRKLVKQTVMTS 780

Query: 1231 VYGVTYMGARDQIKKRLKERASIADDSQLFAASCYA------------------------ 1290
            VYGVTY GARDQIKKRLKER +  DDS  F ASCYA                        
Sbjct: 781  VYGVTYSGARDQIKKRLKERGTFEDDSLTFHASCYAAKITLKALEEMFEAARAIKSWFGD 840

Query: 1291 -ARVIASENQPVRWTTPLGLPVVQPYRQLGRHLIKTSLQVLALQRETDKVMATRQKTAFP 1334
             A++IASEN  V WTTPLGLPVVQPYR+ GRHL+KT+LQVL L RETDKVMA RQ TAF 
Sbjct: 841  CAKIIASENNAVCWTTPLGLPVVQPYRKPGRHLVKTTLQVLTLSRETDKVMARRQMTAFA 900

BLAST of Cp4.1LG09g02310 vs. TAIR 10
Match: AT1G68990.2 (male gametophyte defective 3 )

HSP 1 Score: 1027.7 bits (2656), Expect = 9.0e-300
Identity = 556/973 (57.14%), Postives = 669/973 (68.76%), Query Frame = 0

Query: 451  MWRNIVKRAASRRDTFFSRKVIEDSISVEEIRSSRTLGSLKSCCLLEGSRGISFFHSREV 510
            MWRNI+ RA+ R+  F S    + S S      +R  G L S  L     G+S     E+
Sbjct: 1    MWRNILGRASLRKVKFLS----DSSSSGTHYPVNRVRGILSSVNLSGVRNGLSINPVNEM 60

Query: 511  GFAKSNFAHSSEPVAFYGVLNHAKGYATAAEAAIS---EEDLSGSEEIQELMEELSKQ-- 570
            G   S+F H        G     +GYATAA+A  S   E++ SGS+E+ EL+ E+ K+  
Sbjct: 61   G-GLSSFRH--------GQCYVFEGYATAAQAIDSTDPEDESSGSDEVNELITEMEKETE 120

Query: 571  ---DKVESHFKQPKKMVDGMRVGKYNILRKRQIKMETEAWEEAAREYQELLTDMCEQKLA 630
                K       PK+++ GM   K+ +L++RQ+KMETE WE AARE +E+L DMCEQKLA
Sbjct: 121  RIRKKARLAAIPPKRVIAGMGAQKFYMLKQRQVKMETEEWERAARECREILADMCEQKLA 180

Query: 631  PNLPYIKSLFLGWFEPLRDAIAADQEYDKGAHLFLPSYVMRIHGAKQQREAVKRVPKKQ- 690
            PNLPY+KSLFLGWFEP+R+AI  D +  K     +P Y   +      + AV  + K   
Sbjct: 181  PNLPYMKSLFLGWFEPVRNAIQDDLDTFKIKKGKIP-YAPFMEQLPADKMAVITMHKMMG 240

Query: 691  ----------LEPVFEALDTLGRTKWRVNKRVLSIIDRIWASGGRLADLVDRXTEGDTEP 750
                      +  +  A   +G        R+ S + +         +  D+    + E 
Sbjct: 241  LLMTNAEGVGIVKLVNAATQIGEAV-EQEVRINSFLQK-----KNKKNATDKTINTEAEN 300

Query: 751  VVEDQEKVAKEQDKLRKKVTNLMKKQKLRQVRMIVKEHDHLKPWGQDAHVKVGCRLIQLL 810
            V E  E VAKE +K RK+VT LM+K KLRQV+ +V++HD  KPWGQ+A VKVG RLIQLL
Sbjct: 301  VSE--EIVAKETEKARKQVTVLMEKNKLRQVKALVRKHDSFKPWGQEAQVKVGARLIQLL 360

Query: 811  IETAYIQPPMDQLGGGPPDIRPAFVHTLKTITKEAQKTSRRYGVIECDPLVRRGLEKT-- 870
            +E AYIQPP +Q   GPPDIRPAF    +T+T E  KTSRRYG IECDPLV +GL+K+  
Sbjct: 361  MENAYIQPPAEQFDDGPPDIRPAFKQNFRTVTLENTKTSRRYGCIECDPLVLKGLDKSVS 420

Query: 871  -----ARHMVIPYMPMLVPPLNWTGYDKGAHLFLPSYVMRIHGAKQQREAVKRVPKKQLE 930
                 ARHMVIPY+PML+PP NWTGYD+GAH FLPSYVMR HGAKQQR  +KR PK+QLE
Sbjct: 421  RIVDYARHMVIPYLPMLIPPQNWTGYDQGAHFFLPSYVMRTHGAKQQRTVMKRTPKEQLE 480

Query: 931  PVFEALDTLGRTKWR---------------------------VPLPEEPTVEDEAEIRKW 990
            PV+EALDTLG TKW+                           VP+PEEP  ED+ + + W
Sbjct: 481  PVYEALDTLGNTKWKINKKVLSLVDRIWANGGRIGGLVDREDVPIPEEPEREDQEKFKNW 540

Query: 991  KWKLKAAKKENSERHSQRCDIELKLAVARKMKEEEGFYYPHNLDFRGRAYPMHPHLNHLG 1050
            +W+ K A K+N+ERHSQRCDIELKL VARKMK+EEGFYYPHN+DFRGRAYP+HP+LNHLG
Sbjct: 541  RWESKKAIKQNNERHSQRCDIELKLEVARKMKDEEGFYYPHNVDFRGRAYPIHPYLNHLG 600

Query: 1051 SDMCRGILEFAEGRPLGESGLRWLKIHLANLYAGGVDKLSYKDRISFTENHLDEIFDSAD 1110
            SD+CRGILEF EG+PLG+SGLRWLKIH+ANLYAGGVDKL+Y+DRI+FTE+HL++IFDS+D
Sbjct: 601  SDLCRGILEFCEGKPLGKSGLRWLKIHIANLYAGGVDKLAYEDRIAFTESHLEDIFDSSD 660

Query: 1111 RPLEGNRWWLGAEDPFQCLAVCINLSEALRSPSPETTLSHMPVHQDGSCNGLQHYAALGR 1170
            RPLEG RWWL AEDPFQCLA CINLSEALRSP PE  +SH+P+HQDGSCNGLQHYAALGR
Sbjct: 661  RPLEGKRWWLNAEDPFQCLAACINLSEALRSPFPEAAISHIPIHQDGSCNGLQHYAALGR 720

Query: 1171 DKLSLN--------NPAGV----NAIVLDIIRSDAEKDPATNPNALHARLLINQVDRKLV 1230
            DKL  +         PA V     A VL I++ DAE+DP T PNA +A+L+++QVDRKLV
Sbjct: 721  DKLGADAVNLVTGEKPADVYTEIAARVLKIMQQDAEEDPETFPNATYAKLMLDQVDRKLV 780

Query: 1231 KQTVMTSVYGVTYMGARDQIKKRLKERASIADDSQLFAASCYA----------------- 1290
            KQTVMTSVYGVTY GARDQIKKRLKER +  DDS  F ASCYA                 
Sbjct: 781  KQTVMTSVYGVTYSGARDQIKKRLKERGTFEDDSLTFHASCYAAKITLKALEEMFEAARA 840

Query: 1291 --------ARVIASENQPVRWTTPLGLPVVQPYRQLGRHLIKTSLQVLALQRETDKVMAT 1334
                    A++IASEN  V WTTPLGLPVVQPYR+ GRHL+KT+LQVL L RETDKVMA 
Sbjct: 841  IKSWFGDCAKIIASENNAVCWTTPLGLPVVQPYRKPGRHLVKTTLQVLTLSRETDKVMAR 900

BLAST of Cp4.1LG09g02310 vs. TAIR 10
Match: AT5G15700.1 (DNA/RNA polymerases superfamily protein )

HSP 1 Score: 1009.2 bits (2608), Expect = 3.3e-294
Identity = 545/964 (56.54%), Postives = 657/964 (68.15%), Query Frame = 0

Query: 451  MWRNIVKRAASRRDTFFSRK------VIEDSISVEEIRSSRTLGSLKSCCLLEGSRGISF 510
            MWRNI K+A SR     +        ++    S+     S     L S C  +G R +S 
Sbjct: 40   MWRNIAKQAISRSAARLNVSSQTRGLLVSSPESIFSKNLSFRFPVLGSPCHGKGFRCLSG 99

Query: 511  FHSREVGFAKSNFAHSSEPVAFYGVLNHAKGYATAAEAAISEEDLSGSEEIQELMEELSK 570
               RE  F+KS    S       G L  A+GY + AE  +   D+    E+ EL++E+ K
Sbjct: 100  ITRRE-EFSKSERCLS-------GTL--ARGYTSVAEEEVLSTDVEEEPEVDELLKEMKK 159

Query: 571  QDKVESH--FKQPKKMVDGMRVGKYNILRKRQIKMETEAWEEAAREYQELLTDMCEQKLA 630
            + K ESH  ++  K+   GM   K+  L +RQ+K+ETE WE AA EY ELLTDMCEQKLA
Sbjct: 160  EKKRESHRSWRMKKQDQFGMGRTKFQNLWRRQVKIETEEWERAAAEYMELLTDMCEQKLA 219

Query: 631  PNLPYIKSLFLGWFEPLRDAIAADQEYDKGAHLFLPSYVMRIHGAK-QQREAVKRVPKKQ 690
            PNLPY+KSLFLGWFEPLRDAIA DQE            + R+  +K      + ++P  +
Sbjct: 220  PNLPYVKSLFLGWFEPLRDAIAKDQE------------LYRLGKSKATYAHYLDQLPADK 279

Query: 691  LEPVFEALDTLGRTKWRVNKRVLSIIDRIWASGG------RLADLVDRXTEGD--TEPVV 750
            +  V      +G      +   + ++      G       R+   +D+  +GD   E   
Sbjct: 280  IS-VITMHKLMGHLMTGGDNGCVKVVHAACTVGDAIEQEIRICTFLDKKKKGDDNEESGG 339

Query: 751  EDQEKVAKEQDKLRKKVTNLMKKQKLRQVRMIVKEHDHLKPWGQDAHVKVGCRLIQLLIE 810
             + E   KEQDKLRKKV  L+KKQKL  VR I++ HD+ KPW  D   KVG RLI+LL+ 
Sbjct: 340  VENETSMKEQDKLRKKVNELIKKQKLSAVRKILQSHDYTKPWIADVRAKVGSRLIELLVR 399

Query: 811  TAYIQPPMDQLGGGPPDIRPAFVHTLKTITKEAQKTSRRYGVIECDPLVRRGLEKTARHM 870
            TAYIQ P DQ     PD+RPAFVHT K + K +  + R+YGVIECDPLVR+GLEK+ R+ 
Sbjct: 400  TAYIQSPADQQDNDLPDVRPAFVHTFK-VAKGSMNSGRKYGVIECDPLVRKGLEKSGRYA 459

Query: 871  VIPYMPMLVPPLNWTGYDKGAHLFLPSYVMRIHGAKQQREAVKRVPKKQLEPVFEALDTL 930
            V+PYMPMLVPPL W+GYDKGA+LFL SY+M+ HGAKQQREA+K  PK QL+PVFEALDTL
Sbjct: 460  VMPYMPMLVPPLKWSGYDKGAYLFLTSYIMKTHGAKQQREALKSAPKGQLQPVFEALDTL 519

Query: 931  GRTKWR---------------------------VPLPEEPTVEDEAEIRKWKWKLKAAKK 990
            G TKWR                           VPLPE+P  EDE  ++KWKW++K+AKK
Sbjct: 520  GSTKWRVNKRVLTVVDRIWSSGGCVADMVDRSDVPLPEKPDTEDEGILKKWKWEVKSAKK 579

Query: 991  ENSERHSQRCDIELKLAVARKMKEEEGFYYPHNLDFRGRAYPMHPHLNHLGSDMCRGILE 1050
             NSERHSQRCD ELKL+VARKMK+EE FYYPHN+DFRGRAYPM PHLNHLGSD+CRG+LE
Sbjct: 580  VNSERHSQRCDTELKLSVARKMKDEEAFYYPHNMDFRGRAYPMPPHLNHLGSDLCRGVLE 639

Query: 1051 FAEGRPLGESGLRWLKIHLANLYAGGVDKLSYKDRISFTENHLDEIFDSADRPLEGNRWW 1110
            FAEGRP+G SGLRWLKIHLANLYAGGVDKLS   R++FTENHLD+IFDSADRPLEG+RWW
Sbjct: 640  FAEGRPMGISGLRWLKIHLANLYAGGVDKLSLDGRLAFTENHLDDIFDSADRPLEGSRWW 699

Query: 1111 LGAEDPFQCLAVCINLSEALRSPSPETTLSHMPVHQDGSCNGLQHYAALGRDKLSL---- 1170
            L AEDPFQCLAVCI+L+EALRSPSPET LSH+P+HQDGSCNGLQHYAALGRD L      
Sbjct: 700  LQAEDPFQCLAVCISLTEALRSPSPETVLSHIPIHQDGSCNGLQHYAALGRDTLGAEAVN 759

Query: 1171 ----NNPA----GVNAIVLDIIRSDAEKDPATNPNALHARLLINQVDRKLVKQTVMTSVY 1230
                  PA    G+   VLDI+R DA++DP   P AL AR L+NQVDRKLVKQTVMTSVY
Sbjct: 760  LVAGEKPADVYSGIATRVLDIMRRDADRDPEVFPEALRARKLLNQVDRKLVKQTVMTSVY 819

Query: 1231 GVTYMGARDQIKKRLKERASIADDSQLFAASCYAARV----------------------- 1290
            GVTY+GARDQIK+RLKER+   D+ ++F A+CYAA+V                       
Sbjct: 820  GVTYIGARDQIKRRLKERSDFGDEKEVFGAACYAAKVTLAAIDEMFQAARAIMRWFGECA 879

Query: 1291 --IASENQPVRWTTPLGLPVVQPYRQLGRHLIKTSLQVLALQRETDKVMATRQKTAFPPN 1334
              IASEN+ VRWTTPLGLPVVQPY Q+G  L+KTSLQ L+LQ ETD+V+  RQ+TAFPPN
Sbjct: 880  KIIASENETVRWTTPLGLPVVQPYHQMGTKLVKTSLQTLSLQHETDQVIVRRQRTAFPPN 939

BLAST of Cp4.1LG09g02310 vs. TAIR 10
Match: AT5G15700.2 (DNA/RNA polymerases superfamily protein )

HSP 1 Score: 1009.2 bits (2608), Expect = 3.3e-294
Identity = 545/964 (56.54%), Postives = 657/964 (68.15%), Query Frame = 0

Query: 451  MWRNIVKRAASRRDTFFSRK------VIEDSISVEEIRSSRTLGSLKSCCLLEGSRGISF 510
            MWRNI K+A SR     +        ++    S+     S     L S C  +G R +S 
Sbjct: 40   MWRNIAKQAISRSAARLNVSSQTRGLLVSSPESIFSKNLSFRFPVLGSPCHGKGFRCLSG 99

Query: 511  FHSREVGFAKSNFAHSSEPVAFYGVLNHAKGYATAAEAAISEEDLSGSEEIQELMEELSK 570
               RE  F+KS    S       G L  A+GY + AE  +   D+    E+ EL++E+ K
Sbjct: 100  ITRRE-EFSKSERCLS-------GTL--ARGYTSVAEEEVLSTDVEEEPEVDELLKEMKK 159

Query: 571  QDKVESH--FKQPKKMVDGMRVGKYNILRKRQIKMETEAWEEAAREYQELLTDMCEQKLA 630
            + K ESH  ++  K+   GM   K+  L +RQ+K+ETE WE AA EY ELLTDMCEQKLA
Sbjct: 160  EKKRESHRSWRMKKQDQFGMGRTKFQNLWRRQVKIETEEWERAAAEYMELLTDMCEQKLA 219

Query: 631  PNLPYIKSLFLGWFEPLRDAIAADQEYDKGAHLFLPSYVMRIHGAK-QQREAVKRVPKKQ 690
            PNLPY+KSLFLGWFEPLRDAIA DQE            + R+  +K      + ++P  +
Sbjct: 220  PNLPYVKSLFLGWFEPLRDAIAKDQE------------LYRLGKSKATYAHYLDQLPADK 279

Query: 691  LEPVFEALDTLGRTKWRVNKRVLSIIDRIWASGG------RLADLVDRXTEGD--TEPVV 750
            +  V      +G      +   + ++      G       R+   +D+  +GD   E   
Sbjct: 280  IS-VITMHKLMGHLMTGGDNGCVKVVHAACTVGDAIEQEIRICTFLDKKKKGDDNEESGG 339

Query: 751  EDQEKVAKEQDKLRKKVTNLMKKQKLRQVRMIVKEHDHLKPWGQDAHVKVGCRLIQLLIE 810
             + E   KEQDKLRKKV  L+KKQKL  VR I++ HD+ KPW  D   KVG RLI+LL+ 
Sbjct: 340  VENETSMKEQDKLRKKVNELIKKQKLSAVRKILQSHDYTKPWIADVRAKVGSRLIELLVR 399

Query: 811  TAYIQPPMDQLGGGPPDIRPAFVHTLKTITKEAQKTSRRYGVIECDPLVRRGLEKTARHM 870
            TAYIQ P DQ     PD+RPAFVHT K + K +  + R+YGVIECDPLVR+GLEK+ R+ 
Sbjct: 400  TAYIQSPADQQDNDLPDVRPAFVHTFK-VAKGSMNSGRKYGVIECDPLVRKGLEKSGRYA 459

Query: 871  VIPYMPMLVPPLNWTGYDKGAHLFLPSYVMRIHGAKQQREAVKRVPKKQLEPVFEALDTL 930
            V+PYMPMLVPPL W+GYDKGA+LFL SY+M+ HGAKQQREA+K  PK QL+PVFEALDTL
Sbjct: 460  VMPYMPMLVPPLKWSGYDKGAYLFLTSYIMKTHGAKQQREALKSAPKGQLQPVFEALDTL 519

Query: 931  GRTKWR---------------------------VPLPEEPTVEDEAEIRKWKWKLKAAKK 990
            G TKWR                           VPLPE+P  EDE  ++KWKW++K+AKK
Sbjct: 520  GSTKWRVNKRVLTVVDRIWSSGGCVADMVDRSDVPLPEKPDTEDEGILKKWKWEVKSAKK 579

Query: 991  ENSERHSQRCDIELKLAVARKMKEEEGFYYPHNLDFRGRAYPMHPHLNHLGSDMCRGILE 1050
             NSERHSQRCD ELKL+VARKMK+EE FYYPHN+DFRGRAYPM PHLNHLGSD+CRG+LE
Sbjct: 580  VNSERHSQRCDTELKLSVARKMKDEEAFYYPHNMDFRGRAYPMPPHLNHLGSDLCRGVLE 639

Query: 1051 FAEGRPLGESGLRWLKIHLANLYAGGVDKLSYKDRISFTENHLDEIFDSADRPLEGNRWW 1110
            FAEGRP+G SGLRWLKIHLANLYAGGVDKLS   R++FTENHLD+IFDSADRPLEG+RWW
Sbjct: 640  FAEGRPMGISGLRWLKIHLANLYAGGVDKLSLDGRLAFTENHLDDIFDSADRPLEGSRWW 699

Query: 1111 LGAEDPFQCLAVCINLSEALRSPSPETTLSHMPVHQDGSCNGLQHYAALGRDKLSL---- 1170
            L AEDPFQCLAVCI+L+EALRSPSPET LSH+P+HQDGSCNGLQHYAALGRD L      
Sbjct: 700  LQAEDPFQCLAVCISLTEALRSPSPETVLSHIPIHQDGSCNGLQHYAALGRDTLGAEAVN 759

Query: 1171 ----NNPA----GVNAIVLDIIRSDAEKDPATNPNALHARLLINQVDRKLVKQTVMTSVY 1230
                  PA    G+   VLDI+R DA++DP   P AL AR L+NQVDRKLVKQTVMTSVY
Sbjct: 760  LVAGEKPADVYSGIATRVLDIMRRDADRDPEVFPEALRARKLLNQVDRKLVKQTVMTSVY 819

Query: 1231 GVTYMGARDQIKKRLKERASIADDSQLFAASCYAARV----------------------- 1290
            GVTY+GARDQIK+RLKER+   D+ ++F A+CYAA+V                       
Sbjct: 820  GVTYIGARDQIKRRLKERSDFGDEKEVFGAACYAAKVTLAAIDEMFQAARAIMRWFGECA 879

Query: 1291 --IASENQPVRWTTPLGLPVVQPYRQLGRHLIKTSLQVLALQRETDKVMATRQKTAFPPN 1334
              IASEN+ VRWTTPLGLPVVQPY Q+G  L+KTSLQ L+LQ ETD+V+  RQ+TAFPPN
Sbjct: 880  KIIASENETVRWTTPLGLPVVQPYHQMGTKLVKTSLQTLSLQHETDQVIVRRQRTAFPPN 939

BLAST of Cp4.1LG09g02310 vs. TAIR 10
Match: AT2G24120.1 (DNA/RNA polymerases superfamily protein )

HSP 1 Score: 872.5 bits (2253), Expect = 4.8e-253
Identity = 453/851 (53.23%), Postives = 579/851 (68.04%), Query Frame = 0

Query: 559  MEELSKQDKVESHFKQPKKMVDGMRVGKYNILRKRQIKMETEAWEEAAREYQELLTDMCE 618
            ++ LSK   V+   K  +K +D     K++ LR+RQ+K ETEAWE    EY++L  +MCE
Sbjct: 136  LKGLSKM--VDQTLKIERKDIDKR---KFDSLRRRQVKEETEAWERMVDEYRDLEKEMCE 195

Query: 619  QKLAPNLPYIKSLFLGWFEPLRDAIAADQEYDK----------GAHL-FLPSYVMRIHGA 678
            + LAPNLPY+K +FLGWF+PL+D I  +Q+  K            H+  LP+  M +   
Sbjct: 196  KNLAPNLPYVKHMFLGWFQPLKDVIEREQKLQKNKSKKVRAAYAPHIELLPADKMAVIVM 255

Query: 679  KQQREAVKRVPKKQLEPVFEALDTLGRTKWRVNKRVLSIIDRIWASGGRLADLVDRXTEG 738
             +    V    +     V +A  ++G          ++I   +     R+ + + R  + 
Sbjct: 256  HKMMGLVMSGHEDGCIQVVQAAVSIG----------IAIEQEV-----RIHNFLKRTRKN 315

Query: 739  DTEPVVEDQEKVAKEQDKLRKKVTNLMKKQKLRQVRMIVKEHDHLKPWGQDAHVKVGCRL 798
            +      D ++  KE+  LRK+V +L++++++     +VK  +  KPWG+    K+G RL
Sbjct: 316  N----AGDSQEELKEKQLLRKRVNSLIRRKRIIDALKVVKS-EGTKPWGRATQAKLGSRL 375

Query: 799  IQLLIETAYIQPPMDQLGGGPPDIRPAFVHTLKTITK-EAQKTSRRYGVIECDPLVRRGL 858
            ++LLIE AY+QPP+ Q G   P+ RPAF H  KT+TK    K  RRYGVIECD L+  GL
Sbjct: 376  LELLIEAAYVQPPLTQSGDSIPEFRPAFRHRFKTVTKYPGSKLVRRYGVIECDSLLLAGL 435

Query: 859  EKTARHMVIPYMPMLVPPLNWTGYDKGAHLFLPSYVMRIHGAKQQREAVKRVPKKQLEPV 918
            +K+A+HM+IPY+PMLVPP  W GYDKG +LFLPSY+MR HG+K+Q++A+K +  K    V
Sbjct: 436  DKSAKHMLIPYVPMLVPPKRWKGYDKGGYLFLPSYIMRTHGSKKQQDALKDISHKTAHRV 495

Query: 919  FEALDTLGRTKWR---------------------------VPLPEEPTVEDEAEIRKWKW 978
            FEALDTLG TKWR                           VP+PE+P+ ED  E++ WKW
Sbjct: 496  FEALDTLGNTKWRVNRNILDVVERLWADGGNIAGLVNREDVPIPEKPSSEDPEELQSWKW 555

Query: 979  KLKAAKKENSERHSQRCDIELKLAVARKMKEEEGFYYPHNLDFRGRAYPMHPHLNHLGSD 1038
              + A K N ERHS RCD+ELKL+VARKMK+EEGFYYPHNLDFRGRAYPMHPHLNHL SD
Sbjct: 556  SARKANKINRERHSLRCDVELKLSVARKMKDEEGFYYPHNLDFRGRAYPMHPHLNHLSSD 615

Query: 1039 MCRGILEFAEGRPLGESGLRWLKIHLANLYAGGVDKLSYKDRISFTENHLDEIFDSADRP 1098
            +CRG LEFAEGRPLG+SGL WLKIHLANLYAGGV+KLS+  R++F ENHLD+I DSA+ P
Sbjct: 616  LCRGTLEFAEGRPLGKSGLHWLKIHLANLYAGGVEKLSHDARLAFVENHLDDIMDSAENP 675

Query: 1099 LEGNRWWLGAEDPFQCLAVCINLSEALRSPSPETTLSHMPVHQDGSCNGLQHYAALGRDK 1158
            + G RWWL AEDPFQCLA C+ L++AL+SPSP + +SH+P+HQDGSCNGLQHYAALGRD 
Sbjct: 676  IHGKRWWLKAEDPFQCLAACVILTQALKSPSPYSVISHLPIHQDGSCNGLQHYAALGRDS 735

Query: 1159 L---SLNNPAG---------VNAIVLDIIRSDAEKDPATNPNALHARLLINQVDRKLVKQ 1218
                ++N  AG         ++  V +I++ D+ KDP +NP A  A++LI QVDRKLVKQ
Sbjct: 736  FEAAAVNLVAGEKPADVYSEISRRVHEIMKKDSSKDPESNPTAALAKILITQVDRKLVKQ 795

Query: 1219 TVMTSVYGVTYMGARDQIKKRLKERASIADDSQLFAASCYAARV---------------- 1278
            TVMTSVYGVTY+GAR+QIK+RL+E+  I D+  LFAA+CY+A+V                
Sbjct: 796  TVMTSVYGVTYVGAREQIKRRLEEKGVITDERMLFAAACYSAKVTLAALGEIFEAARAIM 855

Query: 1279 ---------IASENQPVRWTTPLGLPVVQPYRQLGRHLIKTSLQVLALQRETDKVMATRQ 1334
                     IAS+N PVRW TPLGLPVVQPY +  RHLI+TSLQVLALQRE + V   +Q
Sbjct: 856  SWLGDCAKIIASDNHPVRWITPLGLPVVQPYCRSERHLIRTSLQVLALQREGNTVDVRKQ 915

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q8L6J50.0e+0065.80DNA-directed RNA polymerase 1B, mitochondrial OS=Nicotiana tabacum OX=4097 GN=RP... [more]
Q93Y940.0e+0065.23DNA-directed RNA polymerase 1, mitochondrial OS=Nicotiana sylvestris OX=4096 GN=... [more]
Q8VWF80.0e+0059.28DNA-directed RNA polymerase 2, chloroplastic/mitochondrial OS=Nicotiana sylvestr... [more]
Q8L6J32.9e-31258.85DNA-directed RNA polymerase 2B, chloroplastic/mitochondrial OS=Nicotiana tabacum... [more]
P929691.0e-30057.56DNA-directed RNA polymerase 1, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=... [more]
Match NameE-valueIdentityDescription
KAG6574029.10.065.44DNA-directed RNA polymerase 2, chloroplastic/mitochondrial, partial [Cucurbita a... [more]
XP_023542839.10.067.02DNA-directed RNA polymerase 1B, mitochondrial-like [Cucurbita pepo subsp. pepo][more]
XP_022944908.10.065.96DNA-directed RNA polymerase 1B, mitochondrial-like [Cucurbita moschata][more]
XP_023542836.10.066.01DNA-directed RNA polymerase 1B, mitochondrial-like [Cucurbita pepo subsp. pepo] ... [more]
KAG6574025.10.065.56DNA-directed RNA polymerase 2, chloroplastic/mitochondrial, partial [Cucurbita a... [more]
Match NameE-valueIdentityDescription
A0A6J1FZC40.065.96DNA-directed RNA polymerase OS=Cucurbita moschata OX=3662 GN=LOC111449299 PE=3 S... [more]
A0A6J1FZD90.065.34DNA-directed RNA polymerase OS=Cucurbita moschata OX=3662 GN=LOC111449300 PE=3 S... [more]
A0A6J1HS830.064.33DNA-directed RNA polymerase OS=Cucurbita maxima OX=3661 GN=LOC111467347 PE=3 SV=... [more]
A0A6J1D7470.058.75DNA-directed RNA polymerase OS=Momordica charantia OX=3673 GN=LOC111017540 PE=3 ... [more]
A0A5A7SYG20.068.87DNA-directed RNA polymerase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaf... [more]
Match NameE-valueIdentityDescription
AT1G68990.17.3e-30257.56male gametophyte defective 3 [more]
AT1G68990.29.0e-30057.14male gametophyte defective 3 [more]
AT5G15700.13.3e-29456.54DNA/RNA polymerases superfamily protein [more]
AT5G15700.23.3e-29456.54DNA/RNA polymerases superfamily protein [more]
AT2G24120.14.8e-25353.23DNA/RNA polymerases superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 592..612
NoneNo IPR availableCOILSCoilCoilcoord: 189..209
NoneNo IPR availableGENE3D1.10.287.280coord: 1010..1088
e-value: 4.7E-28
score: 98.8
NoneNo IPR availableGENE3D1.10.150.20coord: 1124..1214
e-value: 1.1E-15
score: 59.9
NoneNo IPR availableGENE3D3.30.70.370coord: 1232..1349
e-value: 1.6E-25
score: 92.0
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1425..1445
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 317..340
NoneNo IPR availablePANTHERPTHR10102:SF7DNA-DIRECTED RNA POLYMERASEcoord: 649..725
NoneNo IPR availablePANTHERPTHR10102:SF7DNA-DIRECTED RNA POLYMERASEcoord: 1208..1332
NoneNo IPR availablePANTHERPTHR10102:SF7DNA-DIRECTED RNA POLYMERASEcoord: 65..450
coord: 468..648
NoneNo IPR availablePANTHERPTHR10102:SF7DNA-DIRECTED RNA POLYMERASEcoord: 919..1209
coord: 726..919
IPR029262DNA-directed RNA polymerase, N-terminalSMARTSM01311RPOL_N_2coord: 189..489
e-value: 3.4E-67
score: 239.2
coord: 592..914
e-value: 1.6E-85
score: 300.1
IPR029262DNA-directed RNA polymerase, N-terminalPFAMPF14700RPOL_Ncoord: 190..451
e-value: 2.2E-51
score: 175.3
coord: 737..914
e-value: 3.4E-44
score: 151.6
coord: 648..694
e-value: 1.9E-11
score: 44.0
coord: 593..648
e-value: 7.2E-9
score: 35.5
IPR037159DNA-directed RNA polymerase, N-terminal domain superfamilyGENE3D1.10.1320.10coord: 731..913
e-value: 3.8E-35
score: 123.5
coord: 570..649
e-value: 2.3E-14
score: 55.3
coord: 167..461
e-value: 4.5E-54
score: 185.7
IPR002092DNA-directed RNA polymerase, phage-typePFAMPF00940RNA_polcoord: 1214..1346
e-value: 1.4E-50
score: 172.4
coord: 1010..1212
e-value: 2.4E-71
score: 240.9
IPR002092DNA-directed RNA polymerase, phage-typePANTHERPTHR10102DNA-DIRECTED RNA POLYMERASE, MITOCHONDRIALcoord: 1208..1332
IPR002092DNA-directed RNA polymerase, phage-typePANTHERPTHR10102DNA-DIRECTED RNA POLYMERASE, MITOCHONDRIALcoord: 726..919
IPR002092DNA-directed RNA polymerase, phage-typePANTHERPTHR10102DNA-DIRECTED RNA POLYMERASE, MITOCHONDRIALcoord: 65..450
coord: 468..648
coord: 649..725
coord: 919..1209
IPR002092DNA-directed RNA polymerase, phage-typePROSITEPS00900RNA_POL_PHAGE_1coord: 1098..1109
IPR002092DNA-directed RNA polymerase, phage-typePROSITEPS00489RNA_POL_PHAGE_2coord: 1159..1173
IPR043502DNA/RNA polymerase superfamilySUPERFAMILY56672DNA/RNA polymerasescoord: 580..744
IPR043502DNA/RNA polymerase superfamilySUPERFAMILY56672DNA/RNA polymerasescoord: 740..1339
IPR043502DNA/RNA polymerase superfamilySUPERFAMILY56672DNA/RNA polymerasescoord: 177..454

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG09g02310.1Cp4.1LG09g02310.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006390 mitochondrial transcription
biological_process GO:0006351 transcription, DNA-templated
cellular_component GO:0034245 mitochondrial DNA-directed RNA polymerase complex
molecular_function GO:0003677 DNA binding
molecular_function GO:0003899 DNA-directed 5'-3' RNA polymerase activity