Carg21647 (gene) Silver-seed gourd (SMH-JMG-627) v2

Overview
NameCarg21647
Typegene
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionDNA-directed RNA polymerase
LocationCarg_Chr06: 3720946 .. 3733810 (+)
RNA-Seq ExpressionCarg21647
SyntenyCarg21647
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GAACAGGGAAGAGGCCGGCGTCGATACTCAATCCCTTTCCGCCCCCTTCTTCTCTTCCCTCTTCCCTCTTCCCTCTTCCCTCTTCCCCGTTTTGGAACAACCCCTTTTCCCTTTCTTTCCTTCACTTTTCCATAACGGTTTTACAGCTCCCCCACCCCCTTTGCCAGAACTGTAAACCAAGAAATTTCCTTCCTGGGGTTGCGCGCTTTGATTCCTCTGAAGCCCCACATTGAACCACCTCCTGCTCTTTCTCTCACTTTCGAGTATGTTATCTCTCCTTTCTAGATTTTGATTTCTGGGTTCTAATGGGGTTTTCCTTTGCATGTCTAAAATGTGGGATCTTTGCACTTGTACCACTGCATTGCATTTGGGTTTCTTGCTTTTGGTTGATCATGCCGATTTTCTGCATTTTCTTGGTGTTCTTGGTTCCTGGGATCATTCGGATGGTGAACGGAATGAGTGTTGTTTGTTTGGAGGCTATATAGCATGTCGATCATGAAGTTTTGAGGTTTTATAAAGGAGCCCTTTGCCCCAAATGTTGAAATTTTCCCATCAAAGAGTTAAGCTCATTGAACTAATGGAAGCAACAAGTTTTGCTTTACTTTTGAAACATTCATGATTCTGTTTATGTTTCTTTATACATCAACATTCTCTAATGCCGAGGACTTAAACTTTTATGTGCAAAATCAGTGCATTTTCTACTTCAAGTCCTAAGGATGAGTCAAAGTTATAATATTTGAAGGCACAACAGCTTGGCCTGACACATGAATATGAGTTTGTTGTCTTATTTATTTCTTCAGCTGCTGAATTGGCTTTGCTCGGCGAAGAAGTTGTATGAATTTCGCTGGAGGCTTTCGATCGGGCCAGAAGGTATAATGTTTAGGATGCTGTAGTTCTAGAGTTGAATAATCCCTTAACTTATATTGTATTAAGCAACTCGTCGTATACTTTTATCGTCTAAATTTTCGACTTAAACATGTCTCTTTTTCTCTTGAGGCTTCTAGAGTATGTAGAGTTTGCCTAGGCCTGTTGAACTGATCTCTGCTTATCTTCATCATGCCCTAATGCTTCGCTATTCTTACTTGCACAAGCTTTGTTAGTTCATCAACTTTATTTTCCATCATCAAGGGCTATTATTTTGCTTCTTTCTTTTTATCTTTCTAATAATGTTTTTGTTGTAGCTAGAGCCCTTTTAGTTTCAGTACGCTCCAGATCAAGTTTTACATCGACTTCCTATATCACCTTTTTAAAGTTACAGAAGGCTTCTATTAAATGTCATTGTTATTTTGGGTTTCCGATCAACATCTCTCCACGTCTCCCTTTGTGATTTAGATGCTGTATAGCTTATTTAATTTATTTCAAATGCACTAGGCGATGAACCATATGGAAGACGAACAGGATAGCGAGCTACAAATTCCATCTGGTGTCCTTGTTGGCGTAAACTTTAGTGTCTCAACTCAGCAAGATATGGTAAGTGTTTTAATGGTGATATGATAAATACATATACTGATTTAGGAAGTCAATAATTTTAAAAAACAATCACTTGTTATGAAAAGTTTGTTTTTTGTTCAGAAAAGAAAACCGAACAATCACTATCTTGTTATTTTCGGGGCCAGGGTTTATGATAGTGTTTTGGGTTTTGGATTACCTGGATATATATGTTATGAACAGTACTAGTTGAATGGGTTAGTTGAATGGATTTGAGCTTTTCATCAAAGGAACAGTCGCTTGCTACTAAATGGCGGTGGATTCGCTCAAGAAAAAGGATCTCAATGAAGGAGAGTTATTGCTAGCATATATTAGACAATGCCCTCGGTTGGCTACCTTTAAATCCAATAGGTAAATCGAGGGAGGGCCTAGGTTCGAAATTGCAAAGAATATCTTCTCCCTCAAATTTATGGCTTTCAAGGTTAACCATGGGTTTTGCAACTTTCGGTGTCGATGCATTTCTTCAAACAATTTCGAGACAACAATTGTACAATCCTGTCTTTTGAAACAAAGGCACCCAACAGGGAGGGTGGTTCTTTGAATTCATTGTCTGGGCATGTACTGGAGGAAAGAAGCTTTTACAATATTTTGGGTCAGGCAGCAGTGGAAAAATTAAGTCTCCTACCATCAATTATGATGTTGTGAGGGTGAAAAAATATTTTGGACGTCTTTCCTTAGAGCTATATTTAGTTGGTGCCAGTCGTCACATTTGCTTCAATATCACAAGAAACTTCTACATCATCCCAACATTTTTGGCTGGGATACAGTGTTCTAGCAATCTCTCAACCAACATGGGATCTTGACTGCAACACAACTAACATTCTTCATTTCATTTTAGTGGGTTTCCAATATAAGTATGATGGACAACTCCCTACTTTCTCTCGTTGCCCTACACATTTTCTTATCTTAGTTTTTATTCCATTTATTTTTGTTTCATTTACAAAAAAATTGTTGTTGGATAAGAAACCGAAGGTGCCCTTGTGAGTTCATACGTGCCTTAAATTTTGTATGTCTCTGTGTCGCTACATATATACTGGATTTGACTTTGATATTTCTGATGTGATCGATAAAACAGGAGAATATAGCAGTGATAAACATTGAGGCAGCCTGCGAGGTGTCTGATCCTAAGTTGGGACTTCCTAATCCATCTTATCAGTGCACTACATGTGGTGCGAGCGTTCTCAAATGTTGTGAAGGTAATTAACTAGTCACAGTATAAAGATATTACTTGTCGACTTTCCTTTCACACATTGGAGTGTACTATCGTTGTTTTCCTTGAATTGTTTTGTAAGGTCATTTTGGGGCTATCAAATTTCCGTATACTATAATCCATCCGTATTTTCTCTCGGAAGTTGCACAAGTGTTGAATAAAGTTTGTCCAGGGTGTAAATCTATCCGGCGGGAACTATGGGGCAAGGTCAGGTACCTCACTTATTGTGATGAACCTTGAATTGTTATGATTAGTTTAAAAGTTACGAATTTCTGCTATTAAACAATGTGGTTACCTTCACCACGTGAATATGTAGATTCTAGTAGGGGACAAGAAACAGTATCAATAACTGAAAGAAAAACCGGAAATTTGTTTCTCGTCTTTGAATCTAAATATAAAAATTATTTTTAAATGCATATTTTGATTATGTTTACATGTACCAGGTTGAGGATCCCACGTCTGATTTTCATCGACCTAAAGGTTGCAGATATTGTTTTGTAAGTTCGTTTAGCCCTCTTTGTTTTTCTATTTCGTTCTTAATTGTTTATCGTTTGAGAACTGTTCTGATGAAAAAAGAATCTTTTGAAACCATGATCTATGTTGCTCAACAGGGAAGTCTTAAGGATTGGTACCCGCCTATGAGGTTTAAGCTTTCAACTACTGATATGTTCAGAAAAAGTATGATTATGGTGGAAGTGAAAGAAAACATGTCGAAGAAGTACCAAAAGAGGGTGGCTAGGGGAGGATTGCCTCCTGATTATTGGAATTTTATCCCGAAAGATGAACAGCAGGAGGAAAGTTATTGTAGACCGAACAGGAAAGTCCTAACACATGCTCAGGTATTGGGCCTCTTTATGAAATTGACTAGATTTGGTTTGTTTAGGCTTTTCGGCTATACACATTTTCATTGTTTTCTCTATTTGTTTTCGTTATAAACTTTAGAAACCATTATTTTTCTTGATACAAATGAACCTGATTTGCTTAATGATGCAAGTACTATGTGTTGAAGTATTGATATAATTAAATTTACCATAACCCACTCGGTTAAGCTTTTGGGTGGATCGTGATTTAACAATATATTCTCTTTTCATAGGGGGTGTTTCCAATTTTTTAAAAATAATGTGGAAGACCATCCTTATCCAAAGCAATAAAGGATTGGAAGTCCTTATTGTAGTCTTCTTGGAAGCTTCACCACTTCTCTTGGCTAATGTGGTACCCTTGAGGAATTAGCTCTTTTCGGCTAAGCACTTAGTTCACTTTCCGTAGCCTCCATAGTTTTCTTTTGCAGACAACTTATTACTCGGGGTATAGTTTTTCAGGCGACTTTTCCATGGTGAAACCAACACCAATGATAAAGCAAGGAATCGGTCTTTCTCCCTTAATCTTTCATAATGCAGCTTTCGGAATACTCGTGGGAAAATTAGGTTAATCTTTTCCTATGCCCTGTTTGCCTAAGGGTTAAAATCGATCAATTTATTCAACTTTCATGTCCCTTAGTGTTTGTATGTGAGGTCATCCTCAATTTTTGTTGTAATGACATGAAAGATCAAGCTAAATTCTCGTGGATCGAAGGTTTTTAAGCGCTTTGGGTGCATTGGTTAGAAAAGAGTAGAGTTTTAAACTTATGTGTTTAGAAACCTTTGCATAGAGAGTCATTGTGAACAAACTCGGCCCGTTCCCTTTTGTATGGATCTCGAGTGGAATCAAAGGCACTCCCAAAGGGGATAGAGCTCTCTGCGCTTTGTTTCCTCGTTTATATCAATTATCCTTGACGAGGAACTGTTTGGTTGTAACGGTTTGTCTCAACCCCATGAACCTTGCCTTCCTTAGGGTTTTGTTGTGCTTTGTTCAATATGAGTCCTTCTGACACTTGTTATTGTTAGATGTGTCAAATATTTAGCCTAATAAACAAATGAATGAAATCTATAAGGAATTGAATATTGCTCAGAGAATACTTAAGCCTTCTATAAACTTTATCGTTTCTCACAACACGCTTCAACAGATATTTTGCCAAATAAAAATCATCTCTCCATAGCCCAAATGCTCGATTCGATCACGGAGATAACTTGCCAAATGAAAAACTTAATCTTCTTGAAAATTTTTACCTTCCACAATGAGACAAAGACAAAGACATCCAAGGTAGAGGTATTAATTAGACAAGAAAAAGAAGGATTAACATGGAAAACCGTTGGTCGCCCTAGAACTCTAAATACCGAAATCCTTACTCCTTGAAGTTCAACAGAACTTCACAAGCAATTAAAGAAAATCAATAACATTTGTTGTTTCCCTATTAAACAATGGACTAAGGAATTCCCAAAATAGGAGATAATGTCGGATGAGAAAAATAGTAGCCTCCGACTGATTCTTTATTTGATAAACGGTAAGACTTGAAAATACAAATGCAGAGGGGACTATCCCTAACCAATAGTCCTCCCAAAAGAAAAGTATCCCTTTGTTCCCCTCACAAAACAATGGAAGAAAAGAGAACAGAGAAAGGAGTGGACTATCCTGACCCAATAAACTCTATGATGCTGATACTCATATTAGTGCTTTAAAATTATCAATTTTACCAATACCATGTCATGGTTTTTCTGTGTTGCAGTGTCAAAAGTCTTGAACAGACTGGAGAATTTAACGAAAATTATAAATTAGTGGTGTTCGTTATCTTGTACTTTGCTCCACCGATGCATTTTAAAATTGATTTGGTTTCAACTTTTCAGTTTTAGTGTTTCTTTGTCATTTATACCTTCTGCATTTCTAGTTGGAAGCTTTTAAATCCTTTACAGTATTGGAAACCTATGATATTTTGTCTTTTCTGATACTGAACCGATATTTTAGTGATACATTTCTCTTGGATTTCAGGTCCATTATTTGTTGAAAGACATCGACCCAAAATTTCTTAAAAAGTTCGTGTCTGCGACAGATTCATTGTTTCTAAACTCTTTCCCTGTGACTCCAAACTGTCATCGTGTGACTGAAATGACACATTCATTTTCAAGTGGACAGCGTTTGGTCTTTGTAAGACTTGTCTTCTCTACTGCTCTATTGTTAGATCGAGGTTATTCAAATAGAAGGATATTTGAATGATATTTGTGTCTTTCTAATGTGTTTCTTTTGAACCTGTTTAACGATTTAAGGATGAAAGGACCAGGGCTTACAAGAAGCTGGTTGATTTCAGAGGGACAGCTAACGAGTTAGGTTCTCGTGTTCTCGATTGTCTCAAAATTTCCAAGGCAATTTTTAACATCTTTTTCTTTTGCCTTAATTGACTTAACTATTATGTGAAACTTTTGTTCTTGCGTATAATGACACTTCTTTTATGCAGCTTAGCCCAGAGAAGTTAGAAAGTAAAGATTTGATTTACCAGCAAAAGAAAATTAAGGATACTGCTACTAGCTCAAATGGTTTGAGATGGATCAAAGATGTTGTTCTTGGAAAGCGGAGTGACCATTGTTTCCGCATGGTTGTTGTTGGTGATCCAAACATTGAGTTAAGTGAAATCGGTATACCTTGTCATGTTGCAGAGCGGTTGCAAATATCCGAGCATCTAAGTTCTTGGAATATGAAGAAACTAAGCACTTCTTGTTACCTTCGTCTTGTTGAAAAGGGAGAGATTTTTGTTCGTCGTGAAGGTCGTCTGGTTCGTGTACGTCACGTTCTTGAACTTAGTATGGGGGATACAATATATAGGCCCCTAGCTGATGGGGATGTTGTATTGGTTAATCGACCTCCATCCATACATCAACACTCCCTTATTGCCCTATCTGTCAGGGTTCTTCCAGTCTCCTCAGTTCTTTCCTTAAACCCACTTTGTTGTTCTCCTTTCCGTGGAGATTTCGATGGCGATTGCCTTCACGGTTATGTTCCTCAATCACTTGAGGCCCGAGTGGAGCTTAGAGAGCTGGTTGCACTAGATAGACAGCTAGTCAATGGCCAAAGTGGTAGAAATCTGCTGTCACTTAGTCATGACAGTTTAACTGCCGCTCATTTAATTATGGAAGATGGAGTTTCTTTAAATCTTTTCCAAATTCAGCAGTTGCAAATGTTTGCTTTACATCAGTTGTTGCCCCCAGCAATTGTAAAAGCTCCTTCATTTAGAAGTTGTGCGTGGACAGGAAAACAATTATTCAGCATCTTCCTACCTCCTGATTTTGATTATTCTTCTCCGTCTCATCGTGTTCACATTAACAATGGAGAACTTCTATCTTCTGAAGGATCTTATTGGCTTCGCGACACTGGCAGAAACCCTTTTCAAGCTCTGATAGAACATTGTGAGGGAAGGACCCTGAACTACTTGCACATCGCTCAAAGAGTTCTTTGTGAATGGTTATCAATGAGGGGATTGAGTGTTTCACTATCAGATTTGTATCTCTCAGTCGATTCACACTCACATAAAAACATGATGGATGATATCTTTTGTGGGTTACAAGAAGCGGAGGAAACGTGTAATTTAATACAGCTGATGGTGGATTCACATAAAGATGTCCTTACTGGAGATGATGAAGGTAATCAACACGTTCTATCTATCGAAGTGGAGCATTTAAGTTATGAGAAGCAGAAATCTGCTGCTCTAAATCAAGCTTCGGTTGATGCTTTCAAGAGAGTTTTCCGTGAGATACAAAATCTTGTGTACAAGTATTCTGGAAAAGACAATTCACTTTTAACCATGTTCAAGGCTGGAAGCAAGGGTAATTTGTTGAAATTAGTTCAGCATAGCATGTGTCTTGGCTTGCAACACTCTTTGGTTACTCTGTCCTTCGGCCTTCCACATAAACTCTCATGTTCTTCATGGAATAGTCAAAAGATGCCTCGTTATATTCAGAAGGATGGTCTTGCTGACCGTACTCAGTCTTTCATTCCATATGCTGTGGTTGAAAATTCCTTTCTCTCAGGGCTTAATCCTTTCGAGTGTTTTGCTCATTCAGTGACAAATCGAGATAGCTCTTTCAGTGACAATGCTGAAGTTCCTGGGACTTTGACACGAAAACTAACATTTCTAATGCGGGATATATATAATGCATATGATCGAACAGTGAGGAATGCATATGGAAATCAACTGGTTCAGTTTTCTTATGATACTGATAGCTCCACGAGCATCTCTAATGAATTGGATGGTGAGAATAATAATACAAACCGTGATATCGGTGGTCAGCCTGTTGGTTCATTGGCTGCTTGTGCCCTTTCAGAAGCTGCATATAGTGCTTTGGACCAGCCAATTAGTCTACTTGAAACTTCTCCATTGCTAAACCTTAAGGTACTGCAGCCTGCTCCTCTCAATTTGACAATTTAAATCAAGGATTTGTATATATTTATAAACAAGTCTTTAGTTTCTAATATCAAGACTAATGATCTGTTATATATTTGATTTATGCAGAAAGTTCTGGAGTGTGGTTCAAAGAGGAATAGTCCCAAACAAACATTTTCCTTGTTCTTATTGGAGAAACTTTCGAAACGAAGTTATGGATTCGAGTATGGAGCTTTAGGAGTTAAGAACCATTTAGAAAGAGTAATATTTAAAGATATTGTGTCTAGTGTTATGATTATGTAAGTTATTATGTTTTCTTTGATTCTTTTTTGTTGTATTTTCGTTAGCTATGTTTATCTTTGGTTTGACGTTATTAACAACGTTTTCTACGATATTAGCTTCGCCCCAGAGCCCTCCCGGAAAAGACATTTTAGTCCATGGGTTTGCCACTTTCATGTATGCAAGGTAATGCAATCTAGAATATGTCTTTCATTATATGATGTATGTATATATTGTTTCAGTCCTCTATATCTGGTATCAACTCCGGCTCATCGATCCAGGAAATTTTGAAGAAAAGAAGATTGAAAATTAGTTCCGTCATCCATTCCCTTAATATGCGGTGCGACTCTGTGAGACAAGAAGCAAAAATAAATTTGCCCTTTTTGCACATATCTACCCAGTAAGTTCATTGTTCAATTGACACTTAGATCTCCCCTTTCATGTACTTGGGGGTTTTTTTTCCTTACCGAAGCAAGTTTAACAATTGTCATCTCACTTGAGCTGTATGTCGTTTTTTGGCTTGGCTATTAAGGGATTGTTCTCTAGCTGATTCATCGAGAGAAGATGGTGATACCGTGTGCTTAACTGTTACAATAGCTGAAAACACAAAAAACTCTTTCCTGCAATTAGATTTCATTCAAGATTTGCTGATTCATTTCCTTCTTGGTACAGTTATAAGAGGTCTGTACTGTTTGTTTCCTTCCAATATTTCGTTTTGACTTGATAATATTTCGTTAGATATAAATCTTATCATCGTTCGTTTTTTTTTTCTTGAATTTAGAGTTGTAGTTACAGTGTGTTAAAGACATCTTCAGTGATATGTTTTTGGGCTCTTTCCATATCTGGTTATAATTGGTATGCTTTAATTTTGGCAACCTGGTTCATTCTATTTGAATAATCTACAAGTCGAGAGCAAGTGAGGTTGTAGTGGGGTCGAGTTTTTGTAAAAGAGAGCTTTATTTTGAAAAGACCTAAGACAAGGCCAATTGTTCGGTTATTAATGGTTTGAGCTTTAGATGGAGGACGTGTTTGTGCGTGTGTATATCTATATATATGTATATATATATCACGTGCCATGCCTTATATTTGACGAGATTTAGATGATATTTACAATAAATTGCTAGGTTTCTCTTGATTTCTCATTGTTTCAACTGGCATAACTTTCTTGAAATATGAAGCGACTTTTTAAGTTGGTGTGTTGTACGCCTGAATATGGTTTTTATTTGGCTAGATTTACCAAAATTTTATGTTTTTTTCCGCTTTGGTTTTCGTATTAGGCTTTGCTGAGATTGACAAGGTAGATATCTCATGGAATGACCGACCAAAGGTACCAAAACCTCATTGTAAATCTCACGGGGAGCTCTACTTGCGAGTGACCATGTCGGGAGAAGGAACTTCTAGATTCTGGGCAACTCTTATGAATCATTGCCTCCCGATAATGGATTTGATTGATTGGTCTCGTAGTCATCCAGATAACATCCATAGTTTCTGCATGGCATATGGAATAGATTCTGGAAGGAATTACTTTCTCAATGTGAGTAATTTTCCGTCCTCTACTCTCGCTACAAGTTTGACCCGGTTCTTGCATCTGACCGTTCACGGGAACTTTTGAATGCTTCTAATTTAACCATTCAAAGATTTTAATGTAGAGTTTGGAGTCTGCAACATTGGATATCGGTAAAACAATACGTCATGAACATTTGCTGCTCGTTGCAAATACTCTTTCGGCTACCGGAGAGTTTGTTGGCTTAAATGTTAAAGGAGTATCACGGCAAAGGGAACATGCTTTGGTCAAAACGCCCTTTATGCAAGCTTGCTTCTCAGTTAGTTATCTAATTGCTCATATTCATTTGAATCCTCATGAGCAAGCAATGAATGTAAATTTAATCCGTACTCGGTTTTCTGCAGAGTCCTGGTGCTTCCTTTGTTAAAGCTGCTAAGGCTGGAATTAAGGACAGTCTTTCAGGAAGTCTAGACGCCTTGGCGTGGGGGAAAATACCTTCTATGGGAACCGGGGGACAGTTCGATATCCTTTACTCTGGGAAGGTGAGATTATAACATTATGGTGTCATATCACCGTGCCTGTTATTTGTTGCGGTACTTCAACGATCTAATGTTTTCTAATTGTTCTTACTCTTCAGGGGCACGAGCTTAATAAGCCCGTCGATGTTTATAATCTACTGGGTAGCCAAGGCATTTGTGAGAAGCCGAACGTGAAGATTGAATCTCTTGATAAAAACACTATATATGAGAAATATAGTGCTGTAGTGCATAAAAATGGTGGCTCTACCATTAAAGGACTTAAAAAGCTGGATAGTGTTTCTAAATCAATTTTAAGGGAGTTCTTGACATTGAATGATATTCAGAAGCTGTCTCATACATTGAGAAGCATTTTACGCAAGTTAGTTCTTCCTCTTTTTGCATTTCTTGGCTCGTACAATTGGCATCGTTATATTAGATGCGTCGTTTTTTCTATTTGGACATGTTCTAGATCGATTACACTTAGCTAGTATCTTAGAATTATTTCACTTGTGTGTATAGGTATTCTTTAAATGAAAGATTAAATGAAGTGGACAAGTCAACTTTGATGATGGCTTTATACTTTCATCCTCAAAGGGATGAAAAAATTGGCGTTGGAGCTCAGGATATAAAGGTATTTTCTATAGAACATCAAATTCTTTTAGCATATTTCATGTATCCCCCCTCCCTGCACCCCCGAACAATAATAAAATTATGAATAATTTAAGTAAAAACAATGAAAGAGGACACCGGGCAATGGTTTTTGTAACAGCCTAAGCCCACCTCTAGCAGATATTGTCCTTTTTGGGCTTTCTCTTTCGGGTTTCCCTTCAAAGTTTCTAAAATGCGTCTGTTAGGGAGAGGTTTCCACACCCTTATAAAGAATGGTTCGTTCTCCTCCCTAATTGATGTGGGATCTCAGTCTTGAACTCTTAGGGGTGCATGCTCCATTCTGTCGTACCGTTCATAACCCTTAACCATTCTTATCTTCCTCTGCTCTGGGCGATAGTCTTTGACTCTCGGGTTAGCATGCTCCATTCTGCCGAACCATTCTTGTCCGGGTTCCAGCAATTATGGACAGACTTTTCTTTGGAATTAAGAAGGGAGAACATCCTTGAAGGATGATAGAGATCTGGGATTGGGAGCTGCTGGGAACGGAGCATTCTCGAGCAGTCTAATCTGGATGCAAGGCACCGTATCTGCTCACTATTATTTTCTGCGAATAATGCAATTTTGCATTCTCCAGGATAAATATCAGAGGTGTGTCTCAGTGGTAGGTTCAAATAGGTTAGATAGGACGGAGCCCAAGTTGGCATGGCGAAGTCATCTCACTGATTGCCTCTCCGACTCGGAGCAATTAAGCCTTGAGATCTCTTACTTGTTTATGTTCACTGAAAAGTCTAAGATATTGTTTGAAGAATAAGTACACCTTGATCATATTATTTACCATACCCACAACTCAAGGACCATGAGGGTGCAAGGTATGCACTTCACATGGCCATATGATGAAGAGATGAAACACGTCTTGAGCTAACTGGATGTATAGTGCCTATGAGAAAATTGTTGACAATTGTTGAGAGGGAGTCCCACATTGACTAATTTAGGGAATGATCATGGGTTTATAAGTAAGGAATACATCTCCATAGGTATGAGACCTTTTGGGGAAGCCCAAAGCAAAGCCAAAGTGACAATATCATACCATTGTGGAGATCCTTGATTCCTAATAGAAATTTGTTCCAATCCTACACGACATGGTATGTCATCTTGGCCCCCTCTAGATCCTTGCTTGGCTGTAGAAAGAGCATGCCCACAGATTGAATGAGTATAGAATTTGATTACGCAGGAAACACCCTGAAGGATGCTTCTTCTTACAAGGATGGATTTTAGGATTTAAAAAAAAGATAGTTCTAGTTACCTGGTACCTGTTTATTCATGTAATTTATCTAATCATTTCTCGACCCGCAACCTGGTTGTTCTTTTGCAGGTTGGTAGCCACTCAAAATACTCGAATACGCGTTGCTTCATATTGGTACGATCAGACGGGACGACAGAAGATTTCTCTTATCACAAGTGTGTTTTAGGTGCGTTGGAGATCATTGCGCCTCATCGAGTGAAGGCATATCAGTCAAAATGGATGCAAGACAAGTTCGAGTGATCTATCTATCATGTACATATTCCGAAGCTCAGATATGTACAGTATCCAGGTTTTGAATCTAAACTATCCTGAAGTTGTTGGGTGTTTGAACGCAGTTGAGTCGTCGTCTTTGACTTAGCCTATTGTCTTTATATCGAGCAATGCCGAAAAAAGGTATTGGTTGTTGCTTTGCTGACCTGTTTCGCTATTAGTATTGATGAATTTTCACTTCGACTGTGACGGTGGTGGTTCATCAGATTTTGAATATGTATACGCTGTTATGATCTCGCCGATAGGTAGTAGTATAGGATGTAGAATATATGTGAAGAGGAAGAGAGCTAGATGAGCAGTTAACCATTGTGTTTCTTTGTTTCTTTTTGTCAATAAACATAATAAACTCTCATCTGTATATTTGAATTCCTGAGCTTCTTTTTGCTCCTTCTACTCTAACTTATTTGCCTTCAATATAAGCACTAGGCAAAAAAGATCT

mRNA sequence

GAACAGGGAAGAGGCCGGCGTCGATACTCAATCCCTTTCCGCCCCCTTCTTCTCTTCCCTCTTCCCTCTTCCCTCTTCCCTCTTCCCCGTTTTGGAACAACCCCTTTTCCCTTTCTTTCCTTCACTTTTCCATAACGGTTTTACAGCTCCCCCACCCCCTTTGCCAGAACTGTAAACCAAGAAATTTCCTTCCTGGGGTTGCGCGCTTTGATTCCTCTGAAGCCCCACATTGAACCACCTCCTGCTCTTTCTCTCACTTTCGACTGCTGAATTGGCTTTGCTCGGCGAAGAAGTTGTATGAATTTCGCTGGAGGCTTTCGATCGGGCCAGAAGGCGATGAACCATATGGAAGACGAACAGGATAGCGAGCTACAAATTCCATCTGGTGTCCTTGTTGGCGTAAACTTTAGTGTCTCAACTCAGCAAGATATGGAGAATATAGCAGTGATAAACATTGAGGCAGCCTGCGAGGTGTCTGATCCTAAGTTGGGACTTCCTAATCCATCTTATCAGTGCACTACATGTGGTGCGAGCGTTCTCAAATGTTGTGAAGGTCATTTTGGGGCTATCAAATTTCCGTATACTATAATCCATCCGTATTTTCTCTCGGAAGTTGCACAAGTGTTGAATAAAGTTTGTCCAGGGTGTAAATCTATCCGGCGGGAACTATGGGGCAAGGTTGAGGATCCCACGTCTGATTTTCATCGACCTAAAGGTTGCAGATATTGTTTTGGAAGTCTTAAGGATTGGTACCCGCCTATGAGGTTTAAGCTTTCAACTACTGATATGTTCAGAAAAAGTATGATTATGGTGGAAGTGAAAGAAAACATGTCGAAGAAGTACCAAAAGAGGGTGGCTAGGGGAGGATTGCCTCCTGATTATTGGAATTTTATCCCGAAAGATGAACAGCAGGAGGAAAGTTATTGTAGACCGAACAGGAAAGTCCTAACACATGCTCAGGTCCATTATTTGTTGAAAGACATCGACCCAAAATTTCTTAAAAAGTTCGTGTCTGCGACAGATTCATTGTTTCTAAACTCTTTCCCTGTGACTCCAAACTGTCATCGTGTGACTGAAATGACACATTCATTTTCAAGTGGACAGCGTTTGGTCTTTGATGAAAGGACCAGGGCTTACAAGAAGCTGGTTGATTTCAGAGGGACAGCTAACGAGTTAGGTTCTCGTGTTCTCGATTGTCTCAAAATTTCCAAGCTTAGCCCAGAGAAGTTAGAAAGTAAAGATTTGATTTACCAGCAAAAGAAAATTAAGGATACTGCTACTAGCTCAAATGGTTTGAGATGGATCAAAGATGTTGTTCTTGGAAAGCGGAGTGACCATTGTTTCCGCATGGTTGTTGTTGGTGATCCAAACATTGAGTTAAGTGAAATCGGTATACCTTGTCATGTTGCAGAGCGGTTGCAAATATCCGAGCATCTAAGTTCTTGGAATATGAAGAAACTAAGCACTTCTTGTTACCTTCGTCTTGTTGAAAAGGGAGAGATTTTTGTTCGTCGTGAAGGTCGTCTGGTTCGTGTACGTCACGTTCTTGAACTTAGTATGGGGGATACAATATATAGGCCCCTAGCTGATGGGGATGTTGTATTGGTTAATCGACCTCCATCCATACATCAACACTCCCTTATTGCCCTATCTGTCAGGGTTCTTCCAGTCTCCTCAGTTCTTTCCTTAAACCCACTTTGTTGTTCTCCTTTCCGTGGAGATTTCGATGGCGATTGCCTTCACGGTTATGTTCCTCAATCACTTGAGGCCCGAGTGGAGCTTAGAGAGCTGGTTGCACTAGATAGACAGCTAGTCAATGGCCAAAGTGGTAGAAATCTGCTGTCACTTAGTCATGACAGTTTAACTGCCGCTCATTTAATTATGGAAGATGGAGTTTCTTTAAATCTTTTCCAAATTCAGCAGTTGCAAATGTTTGCTTTACATCAGTTGTTGCCCCCAGCAATTGTAAAAGCTCCTTCATTTAGAAGTTGTGCGTGGACAGGAAAACAATTATTCAGCATCTTCCTACCTCCTGATTTTGATTATTCTTCTCCGTCTCATCGTGTTCACATTAACAATGGAGAACTTCTATCTTCTGAAGGATCTTATTGGCTTCGCGACACTGGCAGAAACCCTTTTCAAGCTCTGATAGAACATTGTGAGGGAAGGACCCTGAACTACTTGCACATCGCTCAAAGAGTTCTTTGTGAATGGTTATCAATGAGGGGATTGAGTGTTTCACTATCAGATTTGTATCTCTCAGTCGATTCACACTCACATAAAAACATGATGGATGATATCTTTTGTGGGTTACAAGAAGCGGAGGAAACGTGTAATTTAATACAGCTGATGGTGGATTCACATAAAGATGTCCTTACTGGAGATGATGAAGGTAATCAACACGTTCTATCTATCGAAGTGGAGCATTTAAGTTATGAGAAGCAGAAATCTGCTGCTCTAAATCAAGCTTCGGTTGATGCTTTCAAGAGAGTTTTCCGTGAGATACAAAATCTTGTGTACAAGTATTCTGGAAAAGACAATTCACTTTTAACCATGTTCAAGGCTGGAAGCAAGGGTAATTTGTTGAAATTAGTTCAGCATAGCATGTGTCTTGGCTTGCAACACTCTTTGGTTACTCTGTCCTTCGGCCTTCCACATAAACTCTCATGTTCTTCATGGAATAGTCAAAAGATGCCTCGTTATATTCAGAAGGATGGTCTTGCTGACCGTACTCAGTCTTTCATTCCATATGCTGTGGTTGAAAATTCCTTTCTCTCAGGGCTTAATCCTTTCGAGTGTTTTGCTCATTCAGTGACAAATCGAGATAGCTCTTTCAGTGACAATGCTGAAGTTCCTGGGACTTTGACACGAAAACTAACATTTCTAATGCGGGATATATATAATGCATATGATCGAACAGTGAGGAATGCATATGGAAATCAACTGGTTCAGTTTTCTTATGATACTGATAGCTCCACGAGCATCTCTAATGAATTGGATGGTGAGAATAATAATACAAACCGTGATATCGGTGGTCAGCCTGTTGGTTCATTGGCTGCTTGTGCCCTTTCAGAAGCTGCATATAGTGCTTTGGACCAGCCAATTAGTCTACTTGAAACTTCTCCATTGCTAAACCTTAAGAAAGTTCTGGAGTGTGGTTCAAAGAGGAATAGTCCCAAACAAACATTTTCCTTGTTCTTATTGGAGAAACTTTCGAAACGAAGTTATGGATTCGAGTATGGAGCTTTAGGAGTTAAGAACCATTTAGAAAGAGTAATATTTAAAGATATTGTGTCTAGTGTTATGATTATCTTCGCCCCAGAGCCCTCCCGGAAAAGACATTTTAGTCCATGGGTTTGCCACTTTCATGTATGCAAGGAAATTTTGAAGAAAAGAAGATTGAAAATTAGTTCCGTCATCCATTCCCTTAATATGCGGTGCGACTCTGTGAGACAAGAAGCAAAAATAAATTTGCCCTTTTTGCACATATCTACCCAGGATTGTTCTCTAGCTGATTCATCGAGAGAAGATGGTGATACCGTGTGCTTAACTGTTACAATAGCTGAAAACACAAAAAACTCTTTCCTGCAATTAGATTTCATTCAAGATTTGCTGATTCATTTCCTTCTTGGTACAGTTATAAGAGGCTTTGCTGAGATTGACAAGGTAGATATCTCATGGAATGACCGACCAAAGGTACCAAAACCTCATTGTAAATCTCACGGGGAGCTCTACTTGCGAGTGACCATGTCGGGAGAAGGAACTTCTAGATTCTGGGCAACTCTTATGAATCATTGCCTCCCGATAATGGATTTGATTGATTGGTCTCGTAGTCATCCAGATAACATCCATAGTTTCTGCATGGCATATGGAATAGATTCTGGAAGGAATTACTTTCTCAATAGTTTGGAGTCTGCAACATTGGATATCGGTAAAACAATACGTCATGAACATTTGCTGCTCGTTGCAAATACTCTTTCGGCTACCGGAGAGTTTGTTGGCTTAAATGTTAAAGGAGTATCACGGCAAAGGGAACATGCTTTGGTCAAAACGCCCTTTATGCAAGCTTGCTTCTCAAGTCCTGGTGCTTCCTTTGTTAAAGCTGCTAAGGCTGGAATTAAGGACAGTCTTTCAGGAAGTCTAGACGCCTTGGCGTGGGGGAAAATACCTTCTATGGGAACCGGGGGACAGTTCGATATCCTTTACTCTGGGAAGGGGCACGAGCTTAATAAGCCCGTCGATGTTTATAATCTACTGGGTAGCCAAGGCATTTGTGAGAAGCCGAACGTGAAGATTGAATCTCTTGATAAAAACACTATATATGAGAAATATAGTGCTGTAGTGCATAAAAATGGTGGCTCTACCATTAAAGGACTTAAAAAGCTGGATAGTGTTTCTAAATCAATTTTAAGGGAGTTCTTGACATTGAATGATATTCAGAAGCTGTCTCATACATTGAGAAGCATTTTACGCAAGTATTCTTTAAATGAAAGATTAAATGAAGTGGACAAGTCAACTTTGATGATGGCTTTATACTTTCATCCTCAAAGGGATGAAAAAATTGGCGTTGGAGCTCAGGATATAAAGGTTGGTAGCCACTCAAAATACTCGAATACGCGTTGCTTCATATTGGTACGATCAGACGGGACGACAGAAGATTTCTCTTATCACAAGTGTGTTTTAGGTGCGTTGGAGATCATTGCGCCTCATCGAGTGAAGGCATATCAGTCAAAATGGATGCAAGACAAGTTCGAGTGATCTATCTATCATGTACATATTCCGAAGCTCAGATATGTACAGTATCCAGGTTTTGAATCTAAACTATCCTGAAGTTGTTGGGTGTTTGAACGCAGTTGAGTCGTCGTCTTTGACTTAGCCTATTGTCTTTATATCGAGCAATGCCGAAAAAAGGTATTGGTTGTTGCTTTGCTGACCTGTTTCGCTATTAGTATTGATGAATTTTCACTTCGACTGTGACGGTGGTGGTTCATCAGATTTTGAATATGTATACGCTGTTATGATCTCGCCGATAGGTAGTAGTATAGGATGTAGAATATATGTGAAGAGGAAGAGAGCTAGATGAGCAGTTAACCATTGTGTTTCTTTGTTTCTTTTTGTCAATAAACATAATAAACTCTCATCTGTATATTTGAATTCCTGAGCTTCTTTTTGCTCCTTCTACTCTAACTTATTTGCCTTCAATATAAGCACTAGGCAAAAAAGATCT

Coding sequence (CDS)

ATGAATTTCGCTGGAGGCTTTCGATCGGGCCAGAAGGCGATGAACCATATGGAAGACGAACAGGATAGCGAGCTACAAATTCCATCTGGTGTCCTTGTTGGCGTAAACTTTAGTGTCTCAACTCAGCAAGATATGGAGAATATAGCAGTGATAAACATTGAGGCAGCCTGCGAGGTGTCTGATCCTAAGTTGGGACTTCCTAATCCATCTTATCAGTGCACTACATGTGGTGCGAGCGTTCTCAAATGTTGTGAAGGTCATTTTGGGGCTATCAAATTTCCGTATACTATAATCCATCCGTATTTTCTCTCGGAAGTTGCACAAGTGTTGAATAAAGTTTGTCCAGGGTGTAAATCTATCCGGCGGGAACTATGGGGCAAGGTTGAGGATCCCACGTCTGATTTTCATCGACCTAAAGGTTGCAGATATTGTTTTGGAAGTCTTAAGGATTGGTACCCGCCTATGAGGTTTAAGCTTTCAACTACTGATATGTTCAGAAAAAGTATGATTATGGTGGAAGTGAAAGAAAACATGTCGAAGAAGTACCAAAAGAGGGTGGCTAGGGGAGGATTGCCTCCTGATTATTGGAATTTTATCCCGAAAGATGAACAGCAGGAGGAAAGTTATTGTAGACCGAACAGGAAAGTCCTAACACATGCTCAGGTCCATTATTTGTTGAAAGACATCGACCCAAAATTTCTTAAAAAGTTCGTGTCTGCGACAGATTCATTGTTTCTAAACTCTTTCCCTGTGACTCCAAACTGTCATCGTGTGACTGAAATGACACATTCATTTTCAAGTGGACAGCGTTTGGTCTTTGATGAAAGGACCAGGGCTTACAAGAAGCTGGTTGATTTCAGAGGGACAGCTAACGAGTTAGGTTCTCGTGTTCTCGATTGTCTCAAAATTTCCAAGCTTAGCCCAGAGAAGTTAGAAAGTAAAGATTTGATTTACCAGCAAAAGAAAATTAAGGATACTGCTACTAGCTCAAATGGTTTGAGATGGATCAAAGATGTTGTTCTTGGAAAGCGGAGTGACCATTGTTTCCGCATGGTTGTTGTTGGTGATCCAAACATTGAGTTAAGTGAAATCGGTATACCTTGTCATGTTGCAGAGCGGTTGCAAATATCCGAGCATCTAAGTTCTTGGAATATGAAGAAACTAAGCACTTCTTGTTACCTTCGTCTTGTTGAAAAGGGAGAGATTTTTGTTCGTCGTGAAGGTCGTCTGGTTCGTGTACGTCACGTTCTTGAACTTAGTATGGGGGATACAATATATAGGCCCCTAGCTGATGGGGATGTTGTATTGGTTAATCGACCTCCATCCATACATCAACACTCCCTTATTGCCCTATCTGTCAGGGTTCTTCCAGTCTCCTCAGTTCTTTCCTTAAACCCACTTTGTTGTTCTCCTTTCCGTGGAGATTTCGATGGCGATTGCCTTCACGGTTATGTTCCTCAATCACTTGAGGCCCGAGTGGAGCTTAGAGAGCTGGTTGCACTAGATAGACAGCTAGTCAATGGCCAAAGTGGTAGAAATCTGCTGTCACTTAGTCATGACAGTTTAACTGCCGCTCATTTAATTATGGAAGATGGAGTTTCTTTAAATCTTTTCCAAATTCAGCAGTTGCAAATGTTTGCTTTACATCAGTTGTTGCCCCCAGCAATTGTAAAAGCTCCTTCATTTAGAAGTTGTGCGTGGACAGGAAAACAATTATTCAGCATCTTCCTACCTCCTGATTTTGATTATTCTTCTCCGTCTCATCGTGTTCACATTAACAATGGAGAACTTCTATCTTCTGAAGGATCTTATTGGCTTCGCGACACTGGCAGAAACCCTTTTCAAGCTCTGATAGAACATTGTGAGGGAAGGACCCTGAACTACTTGCACATCGCTCAAAGAGTTCTTTGTGAATGGTTATCAATGAGGGGATTGAGTGTTTCACTATCAGATTTGTATCTCTCAGTCGATTCACACTCACATAAAAACATGATGGATGATATCTTTTGTGGGTTACAAGAAGCGGAGGAAACGTGTAATTTAATACAGCTGATGGTGGATTCACATAAAGATGTCCTTACTGGAGATGATGAAGGTAATCAACACGTTCTATCTATCGAAGTGGAGCATTTAAGTTATGAGAAGCAGAAATCTGCTGCTCTAAATCAAGCTTCGGTTGATGCTTTCAAGAGAGTTTTCCGTGAGATACAAAATCTTGTGTACAAGTATTCTGGAAAAGACAATTCACTTTTAACCATGTTCAAGGCTGGAAGCAAGGGTAATTTGTTGAAATTAGTTCAGCATAGCATGTGTCTTGGCTTGCAACACTCTTTGGTTACTCTGTCCTTCGGCCTTCCACATAAACTCTCATGTTCTTCATGGAATAGTCAAAAGATGCCTCGTTATATTCAGAAGGATGGTCTTGCTGACCGTACTCAGTCTTTCATTCCATATGCTGTGGTTGAAAATTCCTTTCTCTCAGGGCTTAATCCTTTCGAGTGTTTTGCTCATTCAGTGACAAATCGAGATAGCTCTTTCAGTGACAATGCTGAAGTTCCTGGGACTTTGACACGAAAACTAACATTTCTAATGCGGGATATATATAATGCATATGATCGAACAGTGAGGAATGCATATGGAAATCAACTGGTTCAGTTTTCTTATGATACTGATAGCTCCACGAGCATCTCTAATGAATTGGATGGTGAGAATAATAATACAAACCGTGATATCGGTGGTCAGCCTGTTGGTTCATTGGCTGCTTGTGCCCTTTCAGAAGCTGCATATAGTGCTTTGGACCAGCCAATTAGTCTACTTGAAACTTCTCCATTGCTAAACCTTAAGAAAGTTCTGGAGTGTGGTTCAAAGAGGAATAGTCCCAAACAAACATTTTCCTTGTTCTTATTGGAGAAACTTTCGAAACGAAGTTATGGATTCGAGTATGGAGCTTTAGGAGTTAAGAACCATTTAGAAAGAGTAATATTTAAAGATATTGTGTCTAGTGTTATGATTATCTTCGCCCCAGAGCCCTCCCGGAAAAGACATTTTAGTCCATGGGTTTGCCACTTTCATGTATGCAAGGAAATTTTGAAGAAAAGAAGATTGAAAATTAGTTCCGTCATCCATTCCCTTAATATGCGGTGCGACTCTGTGAGACAAGAAGCAAAAATAAATTTGCCCTTTTTGCACATATCTACCCAGGATTGTTCTCTAGCTGATTCATCGAGAGAAGATGGTGATACCGTGTGCTTAACTGTTACAATAGCTGAAAACACAAAAAACTCTTTCCTGCAATTAGATTTCATTCAAGATTTGCTGATTCATTTCCTTCTTGGTACAGTTATAAGAGGCTTTGCTGAGATTGACAAGGTAGATATCTCATGGAATGACCGACCAAAGGTACCAAAACCTCATTGTAAATCTCACGGGGAGCTCTACTTGCGAGTGACCATGTCGGGAGAAGGAACTTCTAGATTCTGGGCAACTCTTATGAATCATTGCCTCCCGATAATGGATTTGATTGATTGGTCTCGTAGTCATCCAGATAACATCCATAGTTTCTGCATGGCATATGGAATAGATTCTGGAAGGAATTACTTTCTCAATAGTTTGGAGTCTGCAACATTGGATATCGGTAAAACAATACGTCATGAACATTTGCTGCTCGTTGCAAATACTCTTTCGGCTACCGGAGAGTTTGTTGGCTTAAATGTTAAAGGAGTATCACGGCAAAGGGAACATGCTTTGGTCAAAACGCCCTTTATGCAAGCTTGCTTCTCAAGTCCTGGTGCTTCCTTTGTTAAAGCTGCTAAGGCTGGAATTAAGGACAGTCTTTCAGGAAGTCTAGACGCCTTGGCGTGGGGGAAAATACCTTCTATGGGAACCGGGGGACAGTTCGATATCCTTTACTCTGGGAAGGGGCACGAGCTTAATAAGCCCGTCGATGTTTATAATCTACTGGGTAGCCAAGGCATTTGTGAGAAGCCGAACGTGAAGATTGAATCTCTTGATAAAAACACTATATATGAGAAATATAGTGCTGTAGTGCATAAAAATGGTGGCTCTACCATTAAAGGACTTAAAAAGCTGGATAGTGTTTCTAAATCAATTTTAAGGGAGTTCTTGACATTGAATGATATTCAGAAGCTGTCTCATACATTGAGAAGCATTTTACGCAAGTATTCTTTAAATGAAAGATTAAATGAAGTGGACAAGTCAACTTTGATGATGGCTTTATACTTTCATCCTCAAAGGGATGAAAAAATTGGCGTTGGAGCTCAGGATATAAAGGTTGGTAGCCACTCAAAATACTCGAATACGCGTTGCTTCATATTGGTACGATCAGACGGGACGACAGAAGATTTCTCTTATCACAAGTGTGTTTTAGGTGCGTTGGAGATCATTGCGCCTCATCGAGTGAAGGCATATCAGTCAAAATGGATGCAAGACAAGTTCGAGTGA

Protein sequence

MNFAGGFRSGQKAMNHMEDEQDSELQIPSGVLVGVNFSVSTQQDMENIAVINIEAACEVSDPKLGLPNPSYQCTTCGASVLKCCEGHFGAIKFPYTIIHPYFLSEVAQVLNKVCPGCKSIRRELWGKVEDPTSDFHRPKGCRYCFGSLKDWYPPMRFKLSTTDMFRKSMIMVEVKENMSKKYQKRVARGGLPPDYWNFIPKDEQQEESYCRPNRKVLTHAQVHYLLKDIDPKFLKKFVSATDSLFLNSFPVTPNCHRVTEMTHSFSSGQRLVFDERTRAYKKLVDFRGTANELGSRVLDCLKISKLSPEKLESKDLIYQQKKIKDTATSSNGLRWIKDVVLGKRSDHCFRMVVVGDPNIELSEIGIPCHVAERLQISEHLSSWNMKKLSTSCYLRLVEKGEIFVRREGRLVRVRHVLELSMGDTIYRPLADGDVVLVNRPPSIHQHSLIALSVRVLPVSSVLSLNPLCCSPFRGDFDGDCLHGYVPQSLEARVELRELVALDRQLVNGQSGRNLLSLSHDSLTAAHLIMEDGVSLNLFQIQQLQMFALHQLLPPAIVKAPSFRSCAWTGKQLFSIFLPPDFDYSSPSHRVHINNGELLSSEGSYWLRDTGRNPFQALIEHCEGRTLNYLHIAQRVLCEWLSMRGLSVSLSDLYLSVDSHSHKNMMDDIFCGLQEAEETCNLIQLMVDSHKDVLTGDDEGNQHVLSIEVEHLSYEKQKSAALNQASVDAFKRVFREIQNLVYKYSGKDNSLLTMFKAGSKGNLLKLVQHSMCLGLQHSLVTLSFGLPHKLSCSSWNSQKMPRYIQKDGLADRTQSFIPYAVVENSFLSGLNPFECFAHSVTNRDSSFSDNAEVPGTLTRKLTFLMRDIYNAYDRTVRNAYGNQLVQFSYDTDSSTSISNELDGENNNTNRDIGGQPVGSLAACALSEAAYSALDQPISLLETSPLLNLKKVLECGSKRNSPKQTFSLFLLEKLSKRSYGFEYGALGVKNHLERVIFKDIVSSVMIIFAPEPSRKRHFSPWVCHFHVCKEILKKRRLKISSVIHSLNMRCDSVRQEAKINLPFLHISTQDCSLADSSREDGDTVCLTVTIAENTKNSFLQLDFIQDLLIHFLLGTVIRGFAEIDKVDISWNDRPKVPKPHCKSHGELYLRVTMSGEGTSRFWATLMNHCLPIMDLIDWSRSHPDNIHSFCMAYGIDSGRNYFLNSLESATLDIGKTIRHEHLLLVANTLSATGEFVGLNVKGVSRQREHALVKTPFMQACFSSPGASFVKAAKAGIKDSLSGSLDALAWGKIPSMGTGGQFDILYSGKGHELNKPVDVYNLLGSQGICEKPNVKIESLDKNTIYEKYSAVVHKNGGSTIKGLKKLDSVSKSILREFLTLNDIQKLSHTLRSILRKYSLNERLNEVDKSTLMMALYFHPQRDEKIGVGAQDIKVGSHSKYSNTRCFILVRSDGTTEDFSYHKCVLGALEIIAPHRVKAYQSKWMQDKFE
Homology
BLAST of Carg21647 vs. NCBI nr
Match: KAG7028326.1 (DNA-directed RNA polymerase IV subunit 1 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 3011.5 bits (7806), Expect = 0.0e+00
Identity = 1486/1486 (100.00%), Postives = 1486/1486 (100.00%), Query Frame = 0

Query: 1    MNFAGGFRSGQKAMNHMEDEQDSELQIPSGVLVGVNFSVSTQQDMENIAVINIEAACEVS 60
            MNFAGGFRSGQKAMNHMEDEQDSELQIPSGVLVGVNFSVSTQQDMENIAVINIEAACEVS
Sbjct: 1    MNFAGGFRSGQKAMNHMEDEQDSELQIPSGVLVGVNFSVSTQQDMENIAVINIEAACEVS 60

Query: 61   DPKLGLPNPSYQCTTCGASVLKCCEGHFGAIKFPYTIIHPYFLSEVAQVLNKVCPGCKSI 120
            DPKLGLPNPSYQCTTCGASVLKCCEGHFGAIKFPYTIIHPYFLSEVAQVLNKVCPGCKSI
Sbjct: 61   DPKLGLPNPSYQCTTCGASVLKCCEGHFGAIKFPYTIIHPYFLSEVAQVLNKVCPGCKSI 120

Query: 121  RRELWGKVEDPTSDFHRPKGCRYCFGSLKDWYPPMRFKLSTTDMFRKSMIMVEVKENMSK 180
            RRELWGKVEDPTSDFHRPKGCRYCFGSLKDWYPPMRFKLSTTDMFRKSMIMVEVKENMSK
Sbjct: 121  RRELWGKVEDPTSDFHRPKGCRYCFGSLKDWYPPMRFKLSTTDMFRKSMIMVEVKENMSK 180

Query: 181  KYQKRVARGGLPPDYWNFIPKDEQQEESYCRPNRKVLTHAQVHYLLKDIDPKFLKKFVSA 240
            KYQKRVARGGLPPDYWNFIPKDEQQEESYCRPNRKVLTHAQVHYLLKDIDPKFLKKFVSA
Sbjct: 181  KYQKRVARGGLPPDYWNFIPKDEQQEESYCRPNRKVLTHAQVHYLLKDIDPKFLKKFVSA 240

Query: 241  TDSLFLNSFPVTPNCHRVTEMTHSFSSGQRLVFDERTRAYKKLVDFRGTANELGSRVLDC 300
            TDSLFLNSFPVTPNCHRVTEMTHSFSSGQRLVFDERTRAYKKLVDFRGTANELGSRVLDC
Sbjct: 241  TDSLFLNSFPVTPNCHRVTEMTHSFSSGQRLVFDERTRAYKKLVDFRGTANELGSRVLDC 300

Query: 301  LKISKLSPEKLESKDLIYQQKKIKDTATSSNGLRWIKDVVLGKRSDHCFRMVVVGDPNIE 360
            LKISKLSPEKLESKDLIYQQKKIKDTATSSNGLRWIKDVVLGKRSDHCFRMVVVGDPNIE
Sbjct: 301  LKISKLSPEKLESKDLIYQQKKIKDTATSSNGLRWIKDVVLGKRSDHCFRMVVVGDPNIE 360

Query: 361  LSEIGIPCHVAERLQISEHLSSWNMKKLSTSCYLRLVEKGEIFVRREGRLVRVRHVLELS 420
            LSEIGIPCHVAERLQISEHLSSWNMKKLSTSCYLRLVEKGEIFVRREGRLVRVRHVLELS
Sbjct: 361  LSEIGIPCHVAERLQISEHLSSWNMKKLSTSCYLRLVEKGEIFVRREGRLVRVRHVLELS 420

Query: 421  MGDTIYRPLADGDVVLVNRPPSIHQHSLIALSVRVLPVSSVLSLNPLCCSPFRGDFDGDC 480
            MGDTIYRPLADGDVVLVNRPPSIHQHSLIALSVRVLPVSSVLSLNPLCCSPFRGDFDGDC
Sbjct: 421  MGDTIYRPLADGDVVLVNRPPSIHQHSLIALSVRVLPVSSVLSLNPLCCSPFRGDFDGDC 480

Query: 481  LHGYVPQSLEARVELRELVALDRQLVNGQSGRNLLSLSHDSLTAAHLIMEDGVSLNLFQI 540
            LHGYVPQSLEARVELRELVALDRQLVNGQSGRNLLSLSHDSLTAAHLIMEDGVSLNLFQI
Sbjct: 481  LHGYVPQSLEARVELRELVALDRQLVNGQSGRNLLSLSHDSLTAAHLIMEDGVSLNLFQI 540

Query: 541  QQLQMFALHQLLPPAIVKAPSFRSCAWTGKQLFSIFLPPDFDYSSPSHRVHINNGELLSS 600
            QQLQMFALHQLLPPAIVKAPSFRSCAWTGKQLFSIFLPPDFDYSSPSHRVHINNGELLSS
Sbjct: 541  QQLQMFALHQLLPPAIVKAPSFRSCAWTGKQLFSIFLPPDFDYSSPSHRVHINNGELLSS 600

Query: 601  EGSYWLRDTGRNPFQALIEHCEGRTLNYLHIAQRVLCEWLSMRGLSVSLSDLYLSVDSHS 660
            EGSYWLRDTGRNPFQALIEHCEGRTLNYLHIAQRVLCEWLSMRGLSVSLSDLYLSVDSHS
Sbjct: 601  EGSYWLRDTGRNPFQALIEHCEGRTLNYLHIAQRVLCEWLSMRGLSVSLSDLYLSVDSHS 660

Query: 661  HKNMMDDIFCGLQEAEETCNLIQLMVDSHKDVLTGDDEGNQHVLSIEVEHLSYEKQKSAA 720
            HKNMMDDIFCGLQEAEETCNLIQLMVDSHKDVLTGDDEGNQHVLSIEVEHLSYEKQKSAA
Sbjct: 661  HKNMMDDIFCGLQEAEETCNLIQLMVDSHKDVLTGDDEGNQHVLSIEVEHLSYEKQKSAA 720

Query: 721  LNQASVDAFKRVFREIQNLVYKYSGKDNSLLTMFKAGSKGNLLKLVQHSMCLGLQHSLVT 780
            LNQASVDAFKRVFREIQNLVYKYSGKDNSLLTMFKAGSKGNLLKLVQHSMCLGLQHSLVT
Sbjct: 721  LNQASVDAFKRVFREIQNLVYKYSGKDNSLLTMFKAGSKGNLLKLVQHSMCLGLQHSLVT 780

Query: 781  LSFGLPHKLSCSSWNSQKMPRYIQKDGLADRTQSFIPYAVVENSFLSGLNPFECFAHSVT 840
            LSFGLPHKLSCSSWNSQKMPRYIQKDGLADRTQSFIPYAVVENSFLSGLNPFECFAHSVT
Sbjct: 781  LSFGLPHKLSCSSWNSQKMPRYIQKDGLADRTQSFIPYAVVENSFLSGLNPFECFAHSVT 840

Query: 841  NRDSSFSDNAEVPGTLTRKLTFLMRDIYNAYDRTVRNAYGNQLVQFSYDTDSSTSISNEL 900
            NRDSSFSDNAEVPGTLTRKLTFLMRDIYNAYDRTVRNAYGNQLVQFSYDTDSSTSISNEL
Sbjct: 841  NRDSSFSDNAEVPGTLTRKLTFLMRDIYNAYDRTVRNAYGNQLVQFSYDTDSSTSISNEL 900

Query: 901  DGENNNTNRDIGGQPVGSLAACALSEAAYSALDQPISLLETSPLLNLKKVLECGSKRNSP 960
            DGENNNTNRDIGGQPVGSLAACALSEAAYSALDQPISLLETSPLLNLKKVLECGSKRNSP
Sbjct: 901  DGENNNTNRDIGGQPVGSLAACALSEAAYSALDQPISLLETSPLLNLKKVLECGSKRNSP 960

Query: 961  KQTFSLFLLEKLSKRSYGFEYGALGVKNHLERVIFKDIVSSVMIIFAPEPSRKRHFSPWV 1020
            KQTFSLFLLEKLSKRSYGFEYGALGVKNHLERVIFKDIVSSVMIIFAPEPSRKRHFSPWV
Sbjct: 961  KQTFSLFLLEKLSKRSYGFEYGALGVKNHLERVIFKDIVSSVMIIFAPEPSRKRHFSPWV 1020

Query: 1021 CHFHVCKEILKKRRLKISSVIHSLNMRCDSVRQEAKINLPFLHISTQDCSLADSSREDGD 1080
            CHFHVCKEILKKRRLKISSVIHSLNMRCDSVRQEAKINLPFLHISTQDCSLADSSREDGD
Sbjct: 1021 CHFHVCKEILKKRRLKISSVIHSLNMRCDSVRQEAKINLPFLHISTQDCSLADSSREDGD 1080

Query: 1081 TVCLTVTIAENTKNSFLQLDFIQDLLIHFLLGTVIRGFAEIDKVDISWNDRPKVPKPHCK 1140
            TVCLTVTIAENTKNSFLQLDFIQDLLIHFLLGTVIRGFAEIDKVDISWNDRPKVPKPHCK
Sbjct: 1081 TVCLTVTIAENTKNSFLQLDFIQDLLIHFLLGTVIRGFAEIDKVDISWNDRPKVPKPHCK 1140

Query: 1141 SHGELYLRVTMSGEGTSRFWATLMNHCLPIMDLIDWSRSHPDNIHSFCMAYGIDSGRNYF 1200
            SHGELYLRVTMSGEGTSRFWATLMNHCLPIMDLIDWSRSHPDNIHSFCMAYGIDSGRNYF
Sbjct: 1141 SHGELYLRVTMSGEGTSRFWATLMNHCLPIMDLIDWSRSHPDNIHSFCMAYGIDSGRNYF 1200

Query: 1201 LNSLESATLDIGKTIRHEHLLLVANTLSATGEFVGLNVKGVSRQREHALVKTPFMQACFS 1260
            LNSLESATLDIGKTIRHEHLLLVANTLSATGEFVGLNVKGVSRQREHALVKTPFMQACFS
Sbjct: 1201 LNSLESATLDIGKTIRHEHLLLVANTLSATGEFVGLNVKGVSRQREHALVKTPFMQACFS 1260

Query: 1261 SPGASFVKAAKAGIKDSLSGSLDALAWGKIPSMGTGGQFDILYSGKGHELNKPVDVYNLL 1320
            SPGASFVKAAKAGIKDSLSGSLDALAWGKIPSMGTGGQFDILYSGKGHELNKPVDVYNLL
Sbjct: 1261 SPGASFVKAAKAGIKDSLSGSLDALAWGKIPSMGTGGQFDILYSGKGHELNKPVDVYNLL 1320

Query: 1321 GSQGICEKPNVKIESLDKNTIYEKYSAVVHKNGGSTIKGLKKLDSVSKSILREFLTLNDI 1380
            GSQGICEKPNVKIESLDKNTIYEKYSAVVHKNGGSTIKGLKKLDSVSKSILREFLTLNDI
Sbjct: 1321 GSQGICEKPNVKIESLDKNTIYEKYSAVVHKNGGSTIKGLKKLDSVSKSILREFLTLNDI 1380

Query: 1381 QKLSHTLRSILRKYSLNERLNEVDKSTLMMALYFHPQRDEKIGVGAQDIKVGSHSKYSNT 1440
            QKLSHTLRSILRKYSLNERLNEVDKSTLMMALYFHPQRDEKIGVGAQDIKVGSHSKYSNT
Sbjct: 1381 QKLSHTLRSILRKYSLNERLNEVDKSTLMMALYFHPQRDEKIGVGAQDIKVGSHSKYSNT 1440

Query: 1441 RCFILVRSDGTTEDFSYHKCVLGALEIIAPHRVKAYQSKWMQDKFE 1487
            RCFILVRSDGTTEDFSYHKCVLGALEIIAPHRVKAYQSKWMQDKFE
Sbjct: 1441 RCFILVRSDGTTEDFSYHKCVLGALEIIAPHRVKAYQSKWMQDKFE 1486

BLAST of Carg21647 vs. NCBI nr
Match: KAG6596800.1 (DNA-directed RNA polymerase IV subunit 1, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 3003.0 bits (7784), Expect = 0.0e+00
Identity = 1481/1486 (99.66%), Postives = 1482/1486 (99.73%), Query Frame = 0

Query: 1    MNFAGGFRSGQKAMNHMEDEQDSELQIPSGVLVGVNFSVSTQQDMENIAVINIEAACEVS 60
            MNFAGGFRSGQKAMNHMEDEQDSELQIPSGVLVGVNFSVSTQQDMENIAVINIEAACEVS
Sbjct: 1    MNFAGGFRSGQKAMNHMEDEQDSELQIPSGVLVGVNFSVSTQQDMENIAVINIEAACEVS 60

Query: 61   DPKLGLPNPSYQCTTCGASVLKCCEGHFGAIKFPYTIIHPYFLSEVAQVLNKVCPGCKSI 120
            DPKLGLPNPSYQCTTCGASVLKCCEGHFGAIKFPYTIIHPYFLSEVAQVLNKVCPGCKSI
Sbjct: 61   DPKLGLPNPSYQCTTCGASVLKCCEGHFGAIKFPYTIIHPYFLSEVAQVLNKVCPGCKSI 120

Query: 121  RRELWGKVEDPTSDFHRPKGCRYCFGSLKDWYPPMRFKLSTTDMFRKSMIMVEVKENMSK 180
            RRELWGKVEDPTSDFHRPKGCRYCFGSLKDWYPPMRFKLSTTDMFRKSMIMVEVKENMSK
Sbjct: 121  RRELWGKVEDPTSDFHRPKGCRYCFGSLKDWYPPMRFKLSTTDMFRKSMIMVEVKENMSK 180

Query: 181  KYQKRVARGGLPPDYWNFIPKDEQQEESYCRPNRKVLTHAQVHYLLKDIDPKFLKKFVSA 240
            KYQKRVARGGLPPDYWNFIPKDEQQEESYCRPNRKVLTHAQVHYLLKDIDPKFLKKFVSA
Sbjct: 181  KYQKRVARGGLPPDYWNFIPKDEQQEESYCRPNRKVLTHAQVHYLLKDIDPKFLKKFVSA 240

Query: 241  TDSLFLNSFPVTPNCHRVTEMTHSFSSGQRLVFDERTRAYKKLVDFRGTANELGSRVLDC 300
            TDSLFLNSFPVTPNCHRVTEMTHSFSSGQRLVFDERTRAYKKLVDFRGTANELGSRVLDC
Sbjct: 241  TDSLFLNSFPVTPNCHRVTEMTHSFSSGQRLVFDERTRAYKKLVDFRGTANELGSRVLDC 300

Query: 301  LKISKLSPEKLESKDLIYQQKKIKDTATSSNGLRWIKDVVLGKRSDHCFRMVVVGDPNIE 360
            LKISKLSPEKLESKDLIYQQKKIKDTATSSNGLRWIKDVVLGKRSDHCFRMVVVGDPNIE
Sbjct: 301  LKISKLSPEKLESKDLIYQQKKIKDTATSSNGLRWIKDVVLGKRSDHCFRMVVVGDPNIE 360

Query: 361  LSEIGIPCHVAERLQISEHLSSWNMKKLSTSCYLRLVEKGEIFVRREGRLVRVRHVLELS 420
            LSEIGIPCHVAERLQISEHLSSWNMKKLSTSCYLRLVEKGEIFVRREGRLVRVRHVLELS
Sbjct: 361  LSEIGIPCHVAERLQISEHLSSWNMKKLSTSCYLRLVEKGEIFVRREGRLVRVRHVLELS 420

Query: 421  MGDTIYRPLADGDVVLVNRPPSIHQHSLIALSVRVLPVSSVLSLNPLCCSPFRGDFDGDC 480
            MGDTIYRPLADGDVVLVNRPPSIHQHSLIALSVRVLPVSSVLSLNPLCCSPFRGDFDGDC
Sbjct: 421  MGDTIYRPLADGDVVLVNRPPSIHQHSLIALSVRVLPVSSVLSLNPLCCSPFRGDFDGDC 480

Query: 481  LHGYVPQSLEARVELRELVALDRQLVNGQSGRNLLSLSHDSLTAAHLIMEDGVSLNLFQI 540
            LHGYVPQSLEARVELRELVALDRQLVNGQSGRNLLSLSHDSLTAAHLIMEDGVSLNLFQI
Sbjct: 481  LHGYVPQSLEARVELRELVALDRQLVNGQSGRNLLSLSHDSLTAAHLIMEDGVSLNLFQI 540

Query: 541  QQLQMFALHQLLPPAIVKAPSFRSCAWTGKQLFSIFLPPDFDYSSPSHRVHINNGELLSS 600
            QQLQMFALHQLLPPAIVKAPSFRSCAWTGKQLFSIFLPPDFDYSSPSHRVHI NGELLSS
Sbjct: 541  QQLQMFALHQLLPPAIVKAPSFRSCAWTGKQLFSIFLPPDFDYSSPSHRVHIKNGELLSS 600

Query: 601  EGSYWLRDTGRNPFQALIEHCEGRTLNYLHIAQRVLCEWLSMRGLSVSLSDLYLSVDSHS 660
            EGSYWLRDTGRNPFQALIEHCEGRTLNYLHIAQRVLCEWLSMRGLSVSLSDLYLSVDSHS
Sbjct: 601  EGSYWLRDTGRNPFQALIEHCEGRTLNYLHIAQRVLCEWLSMRGLSVSLSDLYLSVDSHS 660

Query: 661  HKNMMDDIFCGLQEAEETCNLIQLMVDSHKDVLTGDDEGNQHVLSIEVEHLSYEKQKSAA 720
            HKNMMDDIFCGLQEAEETCNLIQLMVDSHKDVLTGDDEGNQHVLSIEVEHLSYEKQKSAA
Sbjct: 661  HKNMMDDIFCGLQEAEETCNLIQLMVDSHKDVLTGDDEGNQHVLSIEVEHLSYEKQKSAA 720

Query: 721  LNQASVDAFKRVFREIQNLVYKYSGKDNSLLTMFKAGSKGNLLKLVQHSMCLGLQHSLVT 780
            LNQASVDAFKRVFREIQNLVYKYSGKDNSLLTMFKAGSKGNLLKLVQHSMCLGLQHSLVT
Sbjct: 721  LNQASVDAFKRVFREIQNLVYKYSGKDNSLLTMFKAGSKGNLLKLVQHSMCLGLQHSLVT 780

Query: 781  LSFGLPHKLSCSSWNSQKMPRYIQKDGLADRTQSFIPYAVVENSFLSGLNPFECFAHSVT 840
            LSFGLPHKLSCSSWNSQKMPRYIQKDGL DRTQSFIPYAVVENSFLSGLNPFECFAHSVT
Sbjct: 781  LSFGLPHKLSCSSWNSQKMPRYIQKDGLPDRTQSFIPYAVVENSFLSGLNPFECFAHSVT 840

Query: 841  NRDSSFSDNAEVPGTLTRKLTFLMRDIYNAYDRTVRNAYGNQLVQFSYDTDSSTSISNEL 900
            NRDSSFSDNAEVPGTLTRKLTFLMRDIYNAYDRTVRNAYGNQLVQFSYDTDSSTS SNEL
Sbjct: 841  NRDSSFSDNAEVPGTLTRKLTFLMRDIYNAYDRTVRNAYGNQLVQFSYDTDSSTSTSNEL 900

Query: 901  DGENNNTNRDIGGQPVGSLAACALSEAAYSALDQPISLLETSPLLNLKKVLECGSKRNSP 960
            DGENNNTNRDIGGQPVGSLAACALSEAAYSALDQPISLLETSPLLNLKKVLECGSKRNSP
Sbjct: 901  DGENNNTNRDIGGQPVGSLAACALSEAAYSALDQPISLLETSPLLNLKKVLECGSKRNSP 960

Query: 961  KQTFSLFLLEKLSKRSYGFEYGALGVKNHLERVIFKDIVSSVMIIFAPEPSRKRHFSPWV 1020
            KQTFSLFLLEKLSKRSYGFEYGALGVKNHLERVIFKDIVSSVM+IFAPEPSRKRHFSPWV
Sbjct: 961  KQTFSLFLLEKLSKRSYGFEYGALGVKNHLERVIFKDIVSSVMVIFAPEPSRKRHFSPWV 1020

Query: 1021 CHFHVCKEILKKRRLKISSVIHSLNMRCDSVRQEAKINLPFLHISTQDCSLADSSREDGD 1080
            CHFHVCKEILKKRRLKISSVIHSLNMRCDSVRQEAKINLPFLHISTQDCSLADSSREDGD
Sbjct: 1021 CHFHVCKEILKKRRLKISSVIHSLNMRCDSVRQEAKINLPFLHISTQDCSLADSSREDGD 1080

Query: 1081 TVCLTVTIAENTKNSFLQLDFIQDLLIHFLLGTVIRGFAEIDKVDISWNDRPKVPKPHCK 1140
            TVCLTVTIAENTKNSFLQLDFIQDLLIHFLLGTVIRGFAEIDKVDISWNDRPKVPKPHCK
Sbjct: 1081 TVCLTVTIAENTKNSFLQLDFIQDLLIHFLLGTVIRGFAEIDKVDISWNDRPKVPKPHCK 1140

Query: 1141 SHGELYLRVTMSGEGTSRFWATLMNHCLPIMDLIDWSRSHPDNIHSFCMAYGIDSGRNYF 1200
            SHGELYLRVTMSGEG SRFWATLMNHCLPIMDLIDWSRSHPDNIHSFCMAYGIDSGRNYF
Sbjct: 1141 SHGELYLRVTMSGEGNSRFWATLMNHCLPIMDLIDWSRSHPDNIHSFCMAYGIDSGRNYF 1200

Query: 1201 LNSLESATLDIGKTIRHEHLLLVANTLSATGEFVGLNVKGVSRQREHALVKTPFMQACFS 1260
            LNSLESATLDIGKTIRHEHLLLVANTLSATGEFVGLNVKGVSRQREHALVKTPFMQACFS
Sbjct: 1201 LNSLESATLDIGKTIRHEHLLLVANTLSATGEFVGLNVKGVSRQREHALVKTPFMQACFS 1260

Query: 1261 SPGASFVKAAKAGIKDSLSGSLDALAWGKIPSMGTGGQFDILYSGKGHELNKPVDVYNLL 1320
            SPGASFVKAAKAGIKDSLSGSLDALAWGKIPSMGTGGQFDILYSGKGHELNKPVDVYNLL
Sbjct: 1261 SPGASFVKAAKAGIKDSLSGSLDALAWGKIPSMGTGGQFDILYSGKGHELNKPVDVYNLL 1320

Query: 1321 GSQGICEKPNVKIESLDKNTIYEKYSAVVHKNGGSTIKGLKKLDSVSKSILREFLTLNDI 1380
            GSQGICEKPNVKIESLDKNTIYEKYSAVVHKNGGSTIKGLKKLDSVSKSILREFLTLNDI
Sbjct: 1321 GSQGICEKPNVKIESLDKNTIYEKYSAVVHKNGGSTIKGLKKLDSVSKSILREFLTLNDI 1380

Query: 1381 QKLSHTLRSILRKYSLNERLNEVDKSTLMMALYFHPQRDEKIGVGAQDIKVGSHSKYSNT 1440
            QKLSHTLRSILRKYSLNERLNEVDKSTLMMALYFHPQRDEKIGVGAQDIKVGSHSKYSNT
Sbjct: 1381 QKLSHTLRSILRKYSLNERLNEVDKSTLMMALYFHPQRDEKIGVGAQDIKVGSHSKYSNT 1440

Query: 1441 RCFILVRSDGTTEDFSYHKCVLGALEIIAPHRVKAYQSKWMQDKFE 1487
            RCFILVRSDGTTEDFSYHKCVLGALEIIAPHRVKAYQSKWMQDKFE
Sbjct: 1441 RCFILVRSDGTTEDFSYHKCVLGALEIIAPHRVKAYQSKWMQDKFE 1486

BLAST of Carg21647 vs. NCBI nr
Match: XP_022938810.1 (DNA-directed RNA polymerase IV subunit 1 isoform X1 [Cucurbita moschata])

HSP 1 Score: 3001.1 bits (7779), Expect = 0.0e+00
Identity = 1481/1486 (99.66%), Postives = 1481/1486 (99.66%), Query Frame = 0

Query: 1    MNFAGGFRSGQKAMNHMEDEQDSELQIPSGVLVGVNFSVSTQQDMENIAVINIEAACEVS 60
            MNFAGGFRSGQKAMNHMEDEQDSELQIPSGVLVGVNFSVSTQQDMENIAVINIEAACEVS
Sbjct: 1    MNFAGGFRSGQKAMNHMEDEQDSELQIPSGVLVGVNFSVSTQQDMENIAVINIEAACEVS 60

Query: 61   DPKLGLPNPSYQCTTCGASVLKCCEGHFGAIKFPYTIIHPYFLSEVAQVLNKVCPGCKSI 120
            DPKLGLPNPSYQCTTCGASVLKCCEGHFGAIKFPYTIIHPYFLSEVAQVLNKVCPGCKSI
Sbjct: 61   DPKLGLPNPSYQCTTCGASVLKCCEGHFGAIKFPYTIIHPYFLSEVAQVLNKVCPGCKSI 120

Query: 121  RRELWGKVEDPTSDFHRPKGCRYCFGSLKDWYPPMRFKLSTTDMFRKSMIMVEVKENMSK 180
            RRELWGKVEDPTSDFHRPKGCRYCFGSLKDWYPPMRFKLSTTDMFRKSMIMVEVKENMSK
Sbjct: 121  RRELWGKVEDPTSDFHRPKGCRYCFGSLKDWYPPMRFKLSTTDMFRKSMIMVEVKENMSK 180

Query: 181  KYQKRVARGGLPPDYWNFIPKDEQQEESYCRPNRKVLTHAQVHYLLKDIDPKFLKKFVSA 240
            KYQKRVARGGLPPDYWNFIPKDEQQEESYCRPNRKVLTHAQVHYLLKDIDPKFLKKFVSA
Sbjct: 181  KYQKRVARGGLPPDYWNFIPKDEQQEESYCRPNRKVLTHAQVHYLLKDIDPKFLKKFVSA 240

Query: 241  TDSLFLNSFPVTPNCHRVTEMTHSFSSGQRLVFDERTRAYKKLVDFRGTANELGSRVLDC 300
            TDSLFLNSFPVTPNCHRVTEMTHSFSSGQRLVFDERTRAYKKLVDFRGTANELGSRVLDC
Sbjct: 241  TDSLFLNSFPVTPNCHRVTEMTHSFSSGQRLVFDERTRAYKKLVDFRGTANELGSRVLDC 300

Query: 301  LKISKLSPEKLESKDLIYQQKKIKDTATSSNGLRWIKDVVLGKRSDHCFRMVVVGDPNIE 360
            LKISKLSPEKLESKDLIYQQKKIKDTATSSNGLRWIKDVVLGKRSDHCFRMVVVGDPNIE
Sbjct: 301  LKISKLSPEKLESKDLIYQQKKIKDTATSSNGLRWIKDVVLGKRSDHCFRMVVVGDPNIE 360

Query: 361  LSEIGIPCHVAERLQISEHLSSWNMKKLSTSCYLRLVEKGEIFVRREGRLVRVRHVLELS 420
            LSEIGIPCHVAERLQISEHLSSWNMKKLSTSCYLRLVEKGEIFVRREGRLVRVRHVLELS
Sbjct: 361  LSEIGIPCHVAERLQISEHLSSWNMKKLSTSCYLRLVEKGEIFVRREGRLVRVRHVLELS 420

Query: 421  MGDTIYRPLADGDVVLVNRPPSIHQHSLIALSVRVLPVSSVLSLNPLCCSPFRGDFDGDC 480
            MGDTIYRPLADGDVVLVNRPPSIHQHSLIALSVRVLPVSSVLSLNPLCCSPFRGDFDGDC
Sbjct: 421  MGDTIYRPLADGDVVLVNRPPSIHQHSLIALSVRVLPVSSVLSLNPLCCSPFRGDFDGDC 480

Query: 481  LHGYVPQSLEARVELRELVALDRQLVNGQSGRNLLSLSHDSLTAAHLIMEDGVSLNLFQI 540
            LHGYVPQSLEARVELRELVALDRQLVNGQSGRNLLSLSHDSLTAAHLIMEDGVSLNLFQI
Sbjct: 481  LHGYVPQSLEARVELRELVALDRQLVNGQSGRNLLSLSHDSLTAAHLIMEDGVSLNLFQI 540

Query: 541  QQLQMFALHQLLPPAIVKAPSFRSCAWTGKQLFSIFLPPDFDYSSPSHRVHINNGELLSS 600
            QQLQMFALHQLLPPAIVKAPSFRSCAWTGKQLFSIFLPPDFDYSSPSHRVHI NGELLSS
Sbjct: 541  QQLQMFALHQLLPPAIVKAPSFRSCAWTGKQLFSIFLPPDFDYSSPSHRVHIKNGELLSS 600

Query: 601  EGSYWLRDTGRNPFQALIEHCEGRTLNYLHIAQRVLCEWLSMRGLSVSLSDLYLSVDSHS 660
            EGSYWLRDTGRNPFQALIEHCEGRTLNYLHIAQRVLCEWLSMRGLSVSLSDLYLSVDSHS
Sbjct: 601  EGSYWLRDTGRNPFQALIEHCEGRTLNYLHIAQRVLCEWLSMRGLSVSLSDLYLSVDSHS 660

Query: 661  HKNMMDDIFCGLQEAEETCNLIQLMVDSHKDVLTGDDEGNQHVLSIEVEHLSYEKQKSAA 720
            HKNMMDDIFCGLQEAEETCNLIQLMVDSHKDVLTGDDEGNQHVLSIEVEHLSYEKQKSAA
Sbjct: 661  HKNMMDDIFCGLQEAEETCNLIQLMVDSHKDVLTGDDEGNQHVLSIEVEHLSYEKQKSAA 720

Query: 721  LNQASVDAFKRVFREIQNLVYKYSGKDNSLLTMFKAGSKGNLLKLVQHSMCLGLQHSLVT 780
            LNQASVDAFKRVFREIQNLVYKYSGKDNSLLTMFKAGSKGNLLKLVQHSMCLGLQHSLVT
Sbjct: 721  LNQASVDAFKRVFREIQNLVYKYSGKDNSLLTMFKAGSKGNLLKLVQHSMCLGLQHSLVT 780

Query: 781  LSFGLPHKLSCSSWNSQKMPRYIQKDGLADRTQSFIPYAVVENSFLSGLNPFECFAHSVT 840
            LSFGLPHKLSCSSWNSQKMPRYIQKDGLADRTQSFIPYAVVENSFLSGLNPFECFAHSVT
Sbjct: 781  LSFGLPHKLSCSSWNSQKMPRYIQKDGLADRTQSFIPYAVVENSFLSGLNPFECFAHSVT 840

Query: 841  NRDSSFSDNAEVPGTLTRKLTFLMRDIYNAYDRTVRNAYGNQLVQFSYDTDSSTSISNEL 900
            NRDSSFSDNAEVPGTLTRKLTFLMRDIYNAYDRTVRNAYGNQLVQFSYDTDS  S SNEL
Sbjct: 841  NRDSSFSDNAEVPGTLTRKLTFLMRDIYNAYDRTVRNAYGNQLVQFSYDTDSPMSTSNEL 900

Query: 901  DGENNNTNRDIGGQPVGSLAACALSEAAYSALDQPISLLETSPLLNLKKVLECGSKRNSP 960
            DGENNNTNRDIGGQPVGSLAACALSEAAYSALDQPISLLETSPLLNLKKVLECGSKRNSP
Sbjct: 901  DGENNNTNRDIGGQPVGSLAACALSEAAYSALDQPISLLETSPLLNLKKVLECGSKRNSP 960

Query: 961  KQTFSLFLLEKLSKRSYGFEYGALGVKNHLERVIFKDIVSSVMIIFAPEPSRKRHFSPWV 1020
            KQTFSLFLLEKLSKRSYGFEYGALGVKNHLERVIFKDIVSSVMIIFAPEPSRKRHFSPWV
Sbjct: 961  KQTFSLFLLEKLSKRSYGFEYGALGVKNHLERVIFKDIVSSVMIIFAPEPSRKRHFSPWV 1020

Query: 1021 CHFHVCKEILKKRRLKISSVIHSLNMRCDSVRQEAKINLPFLHISTQDCSLADSSREDGD 1080
            CHFHVCKEILKKRRLKISSVIHSLNMRCDSVRQEAKINLPFLHISTQDCSLADSSREDGD
Sbjct: 1021 CHFHVCKEILKKRRLKISSVIHSLNMRCDSVRQEAKINLPFLHISTQDCSLADSSREDGD 1080

Query: 1081 TVCLTVTIAENTKNSFLQLDFIQDLLIHFLLGTVIRGFAEIDKVDISWNDRPKVPKPHCK 1140
            TVCLTVTIAENTKNSFLQLDFIQDLLIHFLLGTVIRGFAEIDKVDISWNDRPKVPKPHCK
Sbjct: 1081 TVCLTVTIAENTKNSFLQLDFIQDLLIHFLLGTVIRGFAEIDKVDISWNDRPKVPKPHCK 1140

Query: 1141 SHGELYLRVTMSGEGTSRFWATLMNHCLPIMDLIDWSRSHPDNIHSFCMAYGIDSGRNYF 1200
            SHGELYLRVTMSGEG SRFWATLMNHCLPIMDLIDWSRSHPDNIHSFCMAYGIDSGRNYF
Sbjct: 1141 SHGELYLRVTMSGEGNSRFWATLMNHCLPIMDLIDWSRSHPDNIHSFCMAYGIDSGRNYF 1200

Query: 1201 LNSLESATLDIGKTIRHEHLLLVANTLSATGEFVGLNVKGVSRQREHALVKTPFMQACFS 1260
            LNSLESATLDIGKTIRHEHLLLVANTLSATGEFVGLNVKGVSRQREHALVKTPFMQACFS
Sbjct: 1201 LNSLESATLDIGKTIRHEHLLLVANTLSATGEFVGLNVKGVSRQREHALVKTPFMQACFS 1260

Query: 1261 SPGASFVKAAKAGIKDSLSGSLDALAWGKIPSMGTGGQFDILYSGKGHELNKPVDVYNLL 1320
            SPGASFVKAAKAGIKDSLSGSLDALAWGKIPSMGTGGQFDILYSGKGHELNKPVDVYNLL
Sbjct: 1261 SPGASFVKAAKAGIKDSLSGSLDALAWGKIPSMGTGGQFDILYSGKGHELNKPVDVYNLL 1320

Query: 1321 GSQGICEKPNVKIESLDKNTIYEKYSAVVHKNGGSTIKGLKKLDSVSKSILREFLTLNDI 1380
            GSQGICEKPNVKIESLDKNTIYEKYSAVVHKNGGSTIKGLKKLDSVSKSILREFLTLNDI
Sbjct: 1321 GSQGICEKPNVKIESLDKNTIYEKYSAVVHKNGGSTIKGLKKLDSVSKSILREFLTLNDI 1380

Query: 1381 QKLSHTLRSILRKYSLNERLNEVDKSTLMMALYFHPQRDEKIGVGAQDIKVGSHSKYSNT 1440
            QKLSHTLRSILRKYSLNERLNEVDKSTLMMALYFHPQRDEKIGVGAQDIKVGSHSKYSNT
Sbjct: 1381 QKLSHTLRSILRKYSLNERLNEVDKSTLMMALYFHPQRDEKIGVGAQDIKVGSHSKYSNT 1440

Query: 1441 RCFILVRSDGTTEDFSYHKCVLGALEIIAPHRVKAYQSKWMQDKFE 1487
            RCFILVRSDGTTEDFSYHKCVLGALEIIAPHRVKAYQSKWMQDKFE
Sbjct: 1441 RCFILVRSDGTTEDFSYHKCVLGALEIIAPHRVKAYQSKWMQDKFE 1486

BLAST of Carg21647 vs. NCBI nr
Match: XP_023539358.1 (DNA-directed RNA polymerase IV subunit 1 isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 2994.5 bits (7762), Expect = 0.0e+00
Identity = 1478/1486 (99.46%), Postives = 1480/1486 (99.60%), Query Frame = 0

Query: 1    MNFAGGFRSGQKAMNHMEDEQDSELQIPSGVLVGVNFSVSTQQDMENIAVINIEAACEVS 60
            MNFAGGFRS QKAMNHMEDEQDSELQIPSGVLVGVNFSVSTQQDMENIAVINIEAACEVS
Sbjct: 1    MNFAGGFRSSQKAMNHMEDEQDSELQIPSGVLVGVNFSVSTQQDMENIAVINIEAACEVS 60

Query: 61   DPKLGLPNPSYQCTTCGASVLKCCEGHFGAIKFPYTIIHPYFLSEVAQVLNKVCPGCKSI 120
            DPKLGLPNPSYQCTTCGASVLKCCEGHFGAIKFPYTIIHPYFLSEVAQVLNKVCPGCKSI
Sbjct: 61   DPKLGLPNPSYQCTTCGASVLKCCEGHFGAIKFPYTIIHPYFLSEVAQVLNKVCPGCKSI 120

Query: 121  RRELWGKVEDPTSDFHRPKGCRYCFGSLKDWYPPMRFKLSTTDMFRKSMIMVEVKENMSK 180
            RRELWGKVEDPTSDFHRPKGCRYCFGSLKDWYPPMRFKLSTTDMFRKSMIMVEVKENMSK
Sbjct: 121  RRELWGKVEDPTSDFHRPKGCRYCFGSLKDWYPPMRFKLSTTDMFRKSMIMVEVKENMSK 180

Query: 181  KYQKRVARGGLPPDYWNFIPKDEQQEESYCRPNRKVLTHAQVHYLLKDIDPKFLKKFVSA 240
            KYQKRVARGGLPPDYWNFIPKDEQQEESYCRPNRKVLTHAQVHYLLKDIDPKFLKKFVSA
Sbjct: 181  KYQKRVARGGLPPDYWNFIPKDEQQEESYCRPNRKVLTHAQVHYLLKDIDPKFLKKFVSA 240

Query: 241  TDSLFLNSFPVTPNCHRVTEMTHSFSSGQRLVFDERTRAYKKLVDFRGTANELGSRVLDC 300
            TDSLFLNSFPVTPNCHRVTEMTHSFSSGQRLVFDERTRAYKKLVDFRGTANELGSRVLDC
Sbjct: 241  TDSLFLNSFPVTPNCHRVTEMTHSFSSGQRLVFDERTRAYKKLVDFRGTANELGSRVLDC 300

Query: 301  LKISKLSPEKLESKDLIYQQKKIKDTATSSNGLRWIKDVVLGKRSDHCFRMVVVGDPNIE 360
            LKISKLSPEKLESKD IYQQKKIKDTATSSNGLRWIKDVVLGKRSDHCFRMVVVGDPNIE
Sbjct: 301  LKISKLSPEKLESKDSIYQQKKIKDTATSSNGLRWIKDVVLGKRSDHCFRMVVVGDPNIE 360

Query: 361  LSEIGIPCHVAERLQISEHLSSWNMKKLSTSCYLRLVEKGEIFVRREGRLVRVRHVLELS 420
            LSEIGIPCHVAERLQISEHLSSWNMKKLSTSCYLRLVEKGEIFVRREGRLVRVRHVLELS
Sbjct: 361  LSEIGIPCHVAERLQISEHLSSWNMKKLSTSCYLRLVEKGEIFVRREGRLVRVRHVLELS 420

Query: 421  MGDTIYRPLADGDVVLVNRPPSIHQHSLIALSVRVLPVSSVLSLNPLCCSPFRGDFDGDC 480
            MGDTIYRPLADGDVVLVNRPPSIHQHSLIALSVRVLPVSSVLSLNPLCCSPFRGDFDGDC
Sbjct: 421  MGDTIYRPLADGDVVLVNRPPSIHQHSLIALSVRVLPVSSVLSLNPLCCSPFRGDFDGDC 480

Query: 481  LHGYVPQSLEARVELRELVALDRQLVNGQSGRNLLSLSHDSLTAAHLIMEDGVSLNLFQI 540
            LHGYVPQSLEARVELRELVALDRQLVNGQSGRNLLSLSHDSLTAAHLIMEDGVSLNLFQI
Sbjct: 481  LHGYVPQSLEARVELRELVALDRQLVNGQSGRNLLSLSHDSLTAAHLIMEDGVSLNLFQI 540

Query: 541  QQLQMFALHQLLPPAIVKAPSFRSCAWTGKQLFSIFLPPDFDYSSPSHRVHINNGELLSS 600
            QQLQMFALHQLLPPAIVKAPSFRSCAWTGKQLFSIFLPPDFDYSSPSHRVHI NGELLSS
Sbjct: 541  QQLQMFALHQLLPPAIVKAPSFRSCAWTGKQLFSIFLPPDFDYSSPSHRVHIKNGELLSS 600

Query: 601  EGSYWLRDTGRNPFQALIEHCEGRTLNYLHIAQRVLCEWLSMRGLSVSLSDLYLSVDSHS 660
            EGSYWLRDTGRNPFQALIEHCEG TLNYLHIAQRVLCEWLSMRGLSVSLSDLYLSVDSHS
Sbjct: 601  EGSYWLRDTGRNPFQALIEHCEGMTLNYLHIAQRVLCEWLSMRGLSVSLSDLYLSVDSHS 660

Query: 661  HKNMMDDIFCGLQEAEETCNLIQLMVDSHKDVLTGDDEGNQHVLSIEVEHLSYEKQKSAA 720
            HKNMMDDIFCGLQEAEETCNLIQLMVDSHKDVLTGDDEGNQHVLSIEVEHLSYEKQKSAA
Sbjct: 661  HKNMMDDIFCGLQEAEETCNLIQLMVDSHKDVLTGDDEGNQHVLSIEVEHLSYEKQKSAA 720

Query: 721  LNQASVDAFKRVFREIQNLVYKYSGKDNSLLTMFKAGSKGNLLKLVQHSMCLGLQHSLVT 780
            LNQASVDAFKRVFREIQNLVYKYSGKDNSLLTMFKAGSKGNLLKLVQHSMCLGLQHSLVT
Sbjct: 721  LNQASVDAFKRVFREIQNLVYKYSGKDNSLLTMFKAGSKGNLLKLVQHSMCLGLQHSLVT 780

Query: 781  LSFGLPHKLSCSSWNSQKMPRYIQKDGLADRTQSFIPYAVVENSFLSGLNPFECFAHSVT 840
            LSFGLPHKLSCSSWNSQKMPRYIQKDGLADRTQSFIPYAVVENSFLSGLNPFECFAHSVT
Sbjct: 781  LSFGLPHKLSCSSWNSQKMPRYIQKDGLADRTQSFIPYAVVENSFLSGLNPFECFAHSVT 840

Query: 841  NRDSSFSDNAEVPGTLTRKLTFLMRDIYNAYDRTVRNAYGNQLVQFSYDTDSSTSISNEL 900
            NRDSSFSDNAEVPGTLTRKLTFLMRDIYNAYDRTVRNAYGNQLVQFSYDTDS TSISNEL
Sbjct: 841  NRDSSFSDNAEVPGTLTRKLTFLMRDIYNAYDRTVRNAYGNQLVQFSYDTDSPTSISNEL 900

Query: 901  DGENNNTNRDIGGQPVGSLAACALSEAAYSALDQPISLLETSPLLNLKKVLECGSKRNSP 960
            DGENNNTNRDIGGQPVGSLAACALSEAAYSALDQPISLLETSPLLNLKKVLECGSKRNSP
Sbjct: 901  DGENNNTNRDIGGQPVGSLAACALSEAAYSALDQPISLLETSPLLNLKKVLECGSKRNSP 960

Query: 961  KQTFSLFLLEKLSKRSYGFEYGALGVKNHLERVIFKDIVSSVMIIFAPEPSRKRHFSPWV 1020
            KQTFSLFLLEKLSKRSYGFEYGALGVKNHLERVIFKDIVSSVMIIFAPEPSRKRHFSPWV
Sbjct: 961  KQTFSLFLLEKLSKRSYGFEYGALGVKNHLERVIFKDIVSSVMIIFAPEPSRKRHFSPWV 1020

Query: 1021 CHFHVCKEILKKRRLKISSVIHSLNMRCDSVRQEAKINLPFLHISTQDCSLADSSREDGD 1080
            CHFHVCKEILKKRRLKISSVIHSLNMRCDSVRQEAKINLPFLHISTQDCSLADSSREDGD
Sbjct: 1021 CHFHVCKEILKKRRLKISSVIHSLNMRCDSVRQEAKINLPFLHISTQDCSLADSSREDGD 1080

Query: 1081 TVCLTVTIAENTKNSFLQLDFIQDLLIHFLLGTVIRGFAEIDKVDISWNDRPKVPKPHCK 1140
            TVCLTVTIAENTKNSFLQLDFIQDLLIHFLLGTVIRGFAEIDKVDISWNDRPKVPKPHCK
Sbjct: 1081 TVCLTVTIAENTKNSFLQLDFIQDLLIHFLLGTVIRGFAEIDKVDISWNDRPKVPKPHCK 1140

Query: 1141 SHGELYLRVTMSGEGTSRFWATLMNHCLPIMDLIDWSRSHPDNIHSFCMAYGIDSGRNYF 1200
            SHGELYLRVTMSGEG SRFWATLMN+CLPIMDLIDWSRSHPDNIHSFCMAYGIDSGRNYF
Sbjct: 1141 SHGELYLRVTMSGEGNSRFWATLMNNCLPIMDLIDWSRSHPDNIHSFCMAYGIDSGRNYF 1200

Query: 1201 LNSLESATLDIGKTIRHEHLLLVANTLSATGEFVGLNVKGVSRQREHALVKTPFMQACFS 1260
            LNSLESATLDIGKTIRHEHLLLVANTLSATGEFVGLNVKGV+RQREHALVKTPFMQACFS
Sbjct: 1201 LNSLESATLDIGKTIRHEHLLLVANTLSATGEFVGLNVKGVTRQREHALVKTPFMQACFS 1260

Query: 1261 SPGASFVKAAKAGIKDSLSGSLDALAWGKIPSMGTGGQFDILYSGKGHELNKPVDVYNLL 1320
            SPGASFVKAAKAGIKDSLSGSLDALAWGKIPSMGTGGQFDILYSGKGHELNKPVDVYNLL
Sbjct: 1261 SPGASFVKAAKAGIKDSLSGSLDALAWGKIPSMGTGGQFDILYSGKGHELNKPVDVYNLL 1320

Query: 1321 GSQGICEKPNVKIESLDKNTIYEKYSAVVHKNGGSTIKGLKKLDSVSKSILREFLTLNDI 1380
            GSQGICEKPNVKIESLDKNTIYEKYSAVVHKNGGSTIKGLKKLDSVSKSILREFLTLNDI
Sbjct: 1321 GSQGICEKPNVKIESLDKNTIYEKYSAVVHKNGGSTIKGLKKLDSVSKSILREFLTLNDI 1380

Query: 1381 QKLSHTLRSILRKYSLNERLNEVDKSTLMMALYFHPQRDEKIGVGAQDIKVGSHSKYSNT 1440
            QKLSHTLRSILRKYSLNERLNEVDKSTLMMALYFHPQRDEKIGVGAQDIKVGSHSKYSNT
Sbjct: 1381 QKLSHTLRSILRKYSLNERLNEVDKSTLMMALYFHPQRDEKIGVGAQDIKVGSHSKYSNT 1440

Query: 1441 RCFILVRSDGTTEDFSYHKCVLGALEIIAPHRVKAYQSKWMQDKFE 1487
            RCFILVRSDGTTEDFSYHKCVLGALEIIAPHRVKAYQSKWMQDKFE
Sbjct: 1441 RCFILVRSDGTTEDFSYHKCVLGALEIIAPHRVKAYQSKWMQDKFE 1486

BLAST of Carg21647 vs. NCBI nr
Match: XP_023005247.1 (DNA-directed RNA polymerase IV subunit 1 isoform X1 [Cucurbita maxima])

HSP 1 Score: 2986.1 bits (7740), Expect = 0.0e+00
Identity = 1472/1486 (99.06%), Postives = 1479/1486 (99.53%), Query Frame = 0

Query: 1    MNFAGGFRSGQKAMNHMEDEQDSELQIPSGVLVGVNFSVSTQQDMENIAVINIEAACEVS 60
            MNFAGGFRSGQKAMNHMEDEQDSELQIPSGVLVGVNFSVSTQQDMENIAVINIEAACEVS
Sbjct: 1    MNFAGGFRSGQKAMNHMEDEQDSELQIPSGVLVGVNFSVSTQQDMENIAVINIEAACEVS 60

Query: 61   DPKLGLPNPSYQCTTCGASVLKCCEGHFGAIKFPYTIIHPYFLSEVAQVLNKVCPGCKSI 120
            DPKLGLPNPSYQCTTCGASVLKCCEGHFGAIKFPYTIIHPYFLSEVAQVLNKVCPGCKSI
Sbjct: 61   DPKLGLPNPSYQCTTCGASVLKCCEGHFGAIKFPYTIIHPYFLSEVAQVLNKVCPGCKSI 120

Query: 121  RRELWGKVEDPTSDFHRPKGCRYCFGSLKDWYPPMRFKLSTTDMFRKSMIMVEVKENMSK 180
            RRELWGKVEDPTSDFHRPKGCRYCFGSLKDWYPPMRFKLSTTDMFRKSMIMVEVKENMSK
Sbjct: 121  RRELWGKVEDPTSDFHRPKGCRYCFGSLKDWYPPMRFKLSTTDMFRKSMIMVEVKENMSK 180

Query: 181  KYQKRVARGGLPPDYWNFIPKDEQQEESYCRPNRKVLTHAQVHYLLKDIDPKFLKKFVSA 240
            KYQKRVARGGLPPDYWNFIPKDEQQEESYCRPNRKVLTHAQVHYLLKDIDPKFLKKFVSA
Sbjct: 181  KYQKRVARGGLPPDYWNFIPKDEQQEESYCRPNRKVLTHAQVHYLLKDIDPKFLKKFVSA 240

Query: 241  TDSLFLNSFPVTPNCHRVTEMTHSFSSGQRLVFDERTRAYKKLVDFRGTANELGSRVLDC 300
            TDSLFLNSFPVTPNCHRVTEMTHSFSSGQRLVFDERTRAYKKLVDFRGTANELGSRVLDC
Sbjct: 241  TDSLFLNSFPVTPNCHRVTEMTHSFSSGQRLVFDERTRAYKKLVDFRGTANELGSRVLDC 300

Query: 301  LKISKLSPEKLESKDLIYQQKKIKDTATSSNGLRWIKDVVLGKRSDHCFRMVVVGDPNIE 360
            LKISKLSPEKLESKDLIYQQKKIKDTATSSNGLRWIKDVVLGKRSDHCFRMVVVGDPNIE
Sbjct: 301  LKISKLSPEKLESKDLIYQQKKIKDTATSSNGLRWIKDVVLGKRSDHCFRMVVVGDPNIE 360

Query: 361  LSEIGIPCHVAERLQISEHLSSWNMKKLSTSCYLRLVEKGEIFVRREGRLVRVRHVLELS 420
            LSEIGIPCHVAERLQISEHLSSWNMKKLSTSCYLRLVEKGEIFVRREGRLVRVRHVLELS
Sbjct: 361  LSEIGIPCHVAERLQISEHLSSWNMKKLSTSCYLRLVEKGEIFVRREGRLVRVRHVLELS 420

Query: 421  MGDTIYRPLADGDVVLVNRPPSIHQHSLIALSVRVLPVSSVLSLNPLCCSPFRGDFDGDC 480
            MGDTIYRPLADGDVVLVNRPPSIHQHSLIALSVRVLPVSSVLSLNPLCCSPFRGDFDGDC
Sbjct: 421  MGDTIYRPLADGDVVLVNRPPSIHQHSLIALSVRVLPVSSVLSLNPLCCSPFRGDFDGDC 480

Query: 481  LHGYVPQSLEARVELRELVALDRQLVNGQSGRNLLSLSHDSLTAAHLIMEDGVSLNLFQI 540
            LHGYVPQSLEARVELRELVALDRQLVNGQSGRNLLSLSHDSLTAAHLIMEDGVSLNLFQI
Sbjct: 481  LHGYVPQSLEARVELRELVALDRQLVNGQSGRNLLSLSHDSLTAAHLIMEDGVSLNLFQI 540

Query: 541  QQLQMFALHQLLPPAIVKAPSFRSCAWTGKQLFSIFLPPDFDYSSPSHRVHINNGELLSS 600
            QQLQMFALHQLLPPAIVKAPSFRSCAWTGKQLFSIFLPPDFDYSSPSHRVHI NGELLSS
Sbjct: 541  QQLQMFALHQLLPPAIVKAPSFRSCAWTGKQLFSIFLPPDFDYSSPSHRVHIKNGELLSS 600

Query: 601  EGSYWLRDTGRNPFQALIEHCEGRTLNYLHIAQRVLCEWLSMRGLSVSLSDLYLSVDSHS 660
            EGSYWLRDTGRNPFQALIEHCEG TLNYLHIAQRVLCEWLSMRGLSVSLSDLYLSVDSHS
Sbjct: 601  EGSYWLRDTGRNPFQALIEHCEGMTLNYLHIAQRVLCEWLSMRGLSVSLSDLYLSVDSHS 660

Query: 661  HKNMMDDIFCGLQEAEETCNLIQLMVDSHKDVLTGDDEGNQHVLSIEVEHLSYEKQKSAA 720
            HKNMMDDIFCGLQEAEETCNLIQLMVDSHKD LTGDDEGNQHVLSIEVEHLSYEKQKSAA
Sbjct: 661  HKNMMDDIFCGLQEAEETCNLIQLMVDSHKDALTGDDEGNQHVLSIEVEHLSYEKQKSAA 720

Query: 721  LNQASVDAFKRVFREIQNLVYKYSGKDNSLLTMFKAGSKGNLLKLVQHSMCLGLQHSLVT 780
            LNQASVDAFKRVFREIQNLVYKYSGKDNSLLTMFKAGSKGNLLKLVQHSMCLGLQHSLVT
Sbjct: 721  LNQASVDAFKRVFREIQNLVYKYSGKDNSLLTMFKAGSKGNLLKLVQHSMCLGLQHSLVT 780

Query: 781  LSFGLPHKLSCSSWNSQKMPRYIQKDGLADRTQSFIPYAVVENSFLSGLNPFECFAHSVT 840
            LSFGLPHKLSCSSWNSQKMPRYI+KDGLADRTQSFIPYAVVENSFLSGLNPFECFAHSVT
Sbjct: 781  LSFGLPHKLSCSSWNSQKMPRYIRKDGLADRTQSFIPYAVVENSFLSGLNPFECFAHSVT 840

Query: 841  NRDSSFSDNAEVPGTLTRKLTFLMRDIYNAYDRTVRNAYGNQLVQFSYDTDSSTSISNEL 900
            NRDSSFSDNAEVPGTLTRKLTFLMRDIYNAYD TVRNAYGNQLVQFSYDTDS TSISNEL
Sbjct: 841  NRDSSFSDNAEVPGTLTRKLTFLMRDIYNAYDGTVRNAYGNQLVQFSYDTDSPTSISNEL 900

Query: 901  DGENNNTNRDIGGQPVGSLAACALSEAAYSALDQPISLLETSPLLNLKKVLECGSKRNSP 960
            DGENNNTNRDIGGQPVGSLAACA+SEAAYSALDQPISLLETSPLLNLKKVLECGSKRNSP
Sbjct: 901  DGENNNTNRDIGGQPVGSLAACAISEAAYSALDQPISLLETSPLLNLKKVLECGSKRNSP 960

Query: 961  KQTFSLFLLEKLSKRSYGFEYGALGVKNHLERVIFKDIVSSVMIIFAPEPSRKRHFSPWV 1020
            KQ FSLFLLEKLSKRSYG+EYGALGVKNHLERVIFKDIVSSVMIIFAPEPSRKRHFSPWV
Sbjct: 961  KQIFSLFLLEKLSKRSYGYEYGALGVKNHLERVIFKDIVSSVMIIFAPEPSRKRHFSPWV 1020

Query: 1021 CHFHVCKEILKKRRLKISSVIHSLNMRCDSVRQEAKINLPFLHISTQDCSLADSSREDGD 1080
            CHFHVCKEILKKRRLKISSVIHSLNMRCDS+RQEAKINLPFLHISTQDCSLADSSREDGD
Sbjct: 1021 CHFHVCKEILKKRRLKISSVIHSLNMRCDSMRQEAKINLPFLHISTQDCSLADSSREDGD 1080

Query: 1081 TVCLTVTIAENTKNSFLQLDFIQDLLIHFLLGTVIRGFAEIDKVDISWNDRPKVPKPHCK 1140
            TVCLTVTIAENTKNSFLQLDFIQDLLIHFLLGTVIRGFAEIDKVDISWNDRPKVPKPHCK
Sbjct: 1081 TVCLTVTIAENTKNSFLQLDFIQDLLIHFLLGTVIRGFAEIDKVDISWNDRPKVPKPHCK 1140

Query: 1141 SHGELYLRVTMSGEGTSRFWATLMNHCLPIMDLIDWSRSHPDNIHSFCMAYGIDSGRNYF 1200
            SHGELYLRVTMSGEG SRFWATLMN+CLPIMDLIDWSRSHPDNIHSFCMAYGIDSGRNYF
Sbjct: 1141 SHGELYLRVTMSGEGNSRFWATLMNNCLPIMDLIDWSRSHPDNIHSFCMAYGIDSGRNYF 1200

Query: 1201 LNSLESATLDIGKTIRHEHLLLVANTLSATGEFVGLNVKGVSRQREHALVKTPFMQACFS 1260
            LNSLESATLDIGKTIRHEHLLLVANTLSATGEFVGLNVKGVSRQREHALVKTPFMQACFS
Sbjct: 1201 LNSLESATLDIGKTIRHEHLLLVANTLSATGEFVGLNVKGVSRQREHALVKTPFMQACFS 1260

Query: 1261 SPGASFVKAAKAGIKDSLSGSLDALAWGKIPSMGTGGQFDILYSGKGHELNKPVDVYNLL 1320
            SPGASFVKAAKAGIKDSLSGSLDALAWGKIPSMGTGGQFDILYSGKGHEL+KPVDVYNLL
Sbjct: 1261 SPGASFVKAAKAGIKDSLSGSLDALAWGKIPSMGTGGQFDILYSGKGHELSKPVDVYNLL 1320

Query: 1321 GSQGICEKPNVKIESLDKNTIYEKYSAVVHKNGGSTIKGLKKLDSVSKSILREFLTLNDI 1380
            GSQGICEKPNVK+ESLDKNTIYEKYSAVVHKNGGSTIKGLKKLDSVSKSILREFLTLNDI
Sbjct: 1321 GSQGICEKPNVKMESLDKNTIYEKYSAVVHKNGGSTIKGLKKLDSVSKSILREFLTLNDI 1380

Query: 1381 QKLSHTLRSILRKYSLNERLNEVDKSTLMMALYFHPQRDEKIGVGAQDIKVGSHSKYSNT 1440
            QKLSHTLRSILRKYSLNERLNEVDKSTLMMALYFHPQRDEKIGVGAQDIKVGSHSKYSNT
Sbjct: 1381 QKLSHTLRSILRKYSLNERLNEVDKSTLMMALYFHPQRDEKIGVGAQDIKVGSHSKYSNT 1440

Query: 1441 RCFILVRSDGTTEDFSYHKCVLGALEIIAPHRVKAYQSKWMQDKFE 1487
            RCFILVRSDGTTEDFSYHKCVLGALEIIAPHRVKAYQSKWMQDKFE
Sbjct: 1441 RCFILVRSDGTTEDFSYHKCVLGALEIIAPHRVKAYQSKWMQDKFE 1486

BLAST of Carg21647 vs. ExPASy Swiss-Prot
Match: Q9LQ02 (DNA-directed RNA polymerase IV subunit 1 OS=Arabidopsis thaliana OX=3702 GN=NRPD1 PE=1 SV=1)

HSP 1 Score: 1482.6 bits (3837), Expect = 0.0e+00
Identity = 781/1476 (52.91%), Postives = 1033/1476 (69.99%), Query Frame = 0

Query: 17   MEDEQDSELQIPSGVLVGVNFSVSTQQDMENIAVINIEAACEVSDPKLGLPNPSYQCTTC 76
            MED+ + ELQ+P G L  + FS+S   D + ++V+ +EA  +V+D +LGLPNP   C TC
Sbjct: 1    MEDDCE-ELQVPVGTLTSIGFSISNNNDRDKMSVLEVEAPNQVTDSRLGLPNPDSVCRTC 60

Query: 77   GASVLKCCEGHFGAIKFPYTIIHPYFLSEVAQVLNKVCPGCKSIRRELWGKVEDPTSDFH 136
            G+   K CEGHFG I F Y+II+PYFL EVA +LNK+CPGCK IR++ +   ED      
Sbjct: 61   GSKDRKVCEGHFGVINFAYSIINPYFLKEVAALLNKICPGCKYIRKKQFQITED------ 120

Query: 137  RPKGCRYCFGSLKDWYPPMRFKLSTTDMFRKSMIMVEVKENMSKKYQKRVARGGLPPDYW 196
            +P+ CRYC  +L   YP M+F+++T ++FR+S I+VEV E    K +KR     LPPDYW
Sbjct: 121  QPERCRYC--TLNTGYPLMKFRVTTKEVFRRSGIVVEVNEESLMKLKKRGVL-TLPPDYW 180

Query: 197  NFIPKDEQQEESYCRPNRKVLTHAQVHYLLKDIDPKFLKKFVSATDSLFLNSFPVTPNCH 256
            +F+P+D   +ES  +P R+++THAQV+ LL  ID + +KK +   +SL L SFPVTPN +
Sbjct: 181  SFLPQDSNIDESCLKPTRRIITHAQVYALLLGIDQRLIKKDIPMFNSLGLTSFPVTPNGY 240

Query: 257  RVTEMTHSFSSGQRLVFDERTRAYKKLVDFRGTANELGSRVLDCLKISKLSPEKL-ESKD 316
            RVTE+ H F +G RL+FDERTR YKKLV F G   EL SRV++C++ S+L  E +  SKD
Sbjct: 241  RVTEIVHQF-NGARLIFDERTRIYKKLVGFEGNTLELSSRVMECMQYSRLFSETVSSSKD 300

Query: 317  LIYQQKKIKDTATSSNGLRWIKDVVLGKRSDHCFRMVVVGDPNIELSEIGIPCHVAERLQ 376
                 +K  DT     GLR++KDV+LGKRSDH FR VVVGDP+++L+EIGIP  +A+RLQ
Sbjct: 301  SANPYQKKSDTPKLC-GLRFMKDVLLGKRSDHTFRTVVVGDPSLKLNEIGIPESIAKRLQ 360

Query: 377  ISEHLSSWNMKKLSTSCYLRLVEKGEIFVRREGRLVRVRHVLELSMGDTIYRPLADGDVV 436
            +SEHL+  N ++L TS    L++  E+ VRR  RLV ++ V +L  GD I+R L DGD V
Sbjct: 361  VSEHLNQCNKERLVTSFVPTLLDNKEMHVRRGDRLVAIQ-VNDLQTGDKIFRSLMDGDTV 420

Query: 437  LVNRPPSIHQHSLIALSVRVLPVSSVLSLNPLCCSPFRGDFDGDCLHGYVPQSLEARVEL 496
            L+NRPPSIHQHSLIA++VR+LP +SV+SLNP+CC PFRGDFDGDCLHGYVPQS++A+VEL
Sbjct: 421  LMNRPPSIHQHSLIAMTVRILPTTSVVSLNPICCLPFRGDFDGDCLHGYVPQSIQAKVEL 480

Query: 497  RELVALDRQLVNGQSGRNLLSLSHDSLTAAHLI-MEDGVSLNLFQIQQLQMFALHQLLPP 556
             ELVALD+QL+N Q+GRNLLSL  DSLTAA+L+ +E    LN  Q+QQLQM+   QL PP
Sbjct: 481  DELVALDKQLINRQNGRNLLSLGQDSLTAAYLVNVEKNCYLNRAQMQQLQMYCPFQLPPP 540

Query: 557  AIVKA-PSFRSCAWTGKQLFSIFLPPDFDYSSPSHRVHINNGELLS-SEGSYWLRDTGRN 616
            AI+KA PS     WTG QLF +  PP FDY+ P + V ++NGELLS SEGS WLRD   N
Sbjct: 541  AIIKASPSSTEPQWTGMQLFGMLFPPGFDYTYPLNNVVVSNGELLSFSEGSAWLRDGEGN 600

Query: 617  PFQALIEHCEGRTLNYLHIAQRVLCEWLSMRGLSVSLSDLYLSVDSHSHKNMMDDIFCGL 676
              + L++H +G+ L+ ++ AQ +L +WL MRGLSVSL+DLYLS D  S KN+ ++I  GL
Sbjct: 601  FIERLLKHDKGKVLDIIYSAQEMLSQWLLMRGLSVSLADLYLSSDLQSRKNLTEEISYGL 660

Query: 677  QEAEETCNLIQLMVDSHKDVLTGDDEGNQHVLSIEVEHLSYEKQKSAALNQASVDAFKRV 736
            +EAE+ CN  QLMV+S +D L  + E  +     ++    YE+QKSA L++ +V AFK  
Sbjct: 661  REAEQVCNKQQLMVESWRDFLAVNGEDKEEDSVSDLARFCYERQKSATLSELAVSAFKDA 720

Query: 737  FREIQNLVYKYSGKDNSLLTMFKAGSKGNLLKLVQHSMCLGLQHSLVTLSFGLPHKLSCS 796
            +R++Q L Y+Y  + NS L M KAGSKGN+ KLVQHSMC+GLQ+S V+LSFG P +L+C+
Sbjct: 721  YRDVQALAYRYGDQSNSFLIMSKAGSKGNIGKLVQHSMCIGLQNSAVSLSFGFPRELTCA 780

Query: 797  SWNSQKMPRYIQKDGLADRTQSFIPYAVVENSFLSGLNPFECFAHSVTNRDSSFSDNAEV 856
            +WN    P    K   +  T+S++PY V+ENSFL+GLNP E F HSVT+RDSSFS NA++
Sbjct: 781  AWNDPNSPLRGAKGKDSTTTESYVPYGVIENSFLTGLNPLESFVHSVTSRDSSFSGNADL 840

Query: 857  PGTLTRKLTFLMRDIYNAYDRTVRNAYGNQLVQFSYDTDSSTSISNELDGENNNTNRDIG 916
            PGTL+R+L F MRDIY AYD TVRN++GNQLVQF+Y+TD                  DI 
Sbjct: 841  PGTLSRRLMFFMRDIYAAYDGTVRNSFGNQLVQFTYETDGPV--------------EDIT 900

Query: 917  GQPVGSLAACALSEAAYSALDQPISLLETSPLLNLKKVLECGSKRNSPKQTFSLFLLEKL 976
            G+ +GSL+ACALSEAAYSALDQPISLLETSPLLNLK VLECGSK+   +QT SL+L E L
Sbjct: 901  GEALGSLSACALSEAAYSALDQPISLLETSPLLNLKNVLECGSKKGQREQTMSLYLSEYL 960

Query: 977  SKRSYGFEYGALGVKNHLERVIFKDIVSSVMIIFAPEPSRKRHFSPWVCHFHVCKEILKK 1036
            SK+ +GFEYG+L +KNHLE++ F +IVS+ MIIF+P  + K   SPWVCHFH+ +++LK+
Sbjct: 961  SKKKHGFEYGSLEIKNHLEKLSFSEIVSTSMIIFSPSSNTKVPLSPWVCHFHISEKVLKR 1020

Query: 1037 RRLKISSVIHSLNMRCDSVRQEAKINLPFLHI-STQDCSLADSSREDGDTVCLTVTIAEN 1096
            ++L   SV+ SLN +  S  +E K+++  L I +T  CS  D + +D D VC+TVT+ E 
Sbjct: 1021 KQLSAESVVSSLNEQYKSRNRELKLDIVDLDIQNTNHCSSDDQAMKD-DNVCITVTVVEA 1080

Query: 1097 TKNSFLQLDFIQDLLIHFLLGTVIRGFAEIDKVDISWNDRPKVPKPHCKS-HGELYLRVT 1156
            +K+S L+LD I+ +LI FLL + ++G   I KV+I W DRPK PK +     GELYL+VT
Sbjct: 1081 SKHSVLELDAIRLVLIPFLLDSPVKGDQGIKKVNILWTDRPKAPKRNGNHLAGELYLKVT 1140

Query: 1157 MSGE-GTSRFWATLMNHCLPIMDLIDWSRSHPDNIHSFCMAYGIDSGRNYFLNSLESATL 1216
            M G+ G    W  L+  CLPIMD+IDW RSHPDNI   C  YGID+GR+ F+ +LESA  
Sbjct: 1141 MYGDRGKRNCWTALLETCLPIMDMIDWGRSHPDNIRQCCSVYGIDAGRSIFVANLESAVS 1200

Query: 1217 DIGKTIRHEHLLLVANTLSATGEFVGLNVKGVSRQREHALVKTPFMQACFSSPGASFVKA 1276
            D GK I  EHLLLVA++LS TGEFV LN KG S+QR+      PF QACFSSP   F+KA
Sbjct: 1201 DTGKEILREHLLLVADSLSVTGEFVALNAKGWSKQRQVESTPAPFTQACFSSPSQCFLKA 1260

Query: 1277 AKAGIKDSLSGSLDALAWGKIPSMGTGGQFDILYSGKGHELNKPVDVYNLLGSQGICEKP 1336
            AK G++D L GS+DALAWGK+P  GTG QF+I+ S K H    PVDVY+LL S     + 
Sbjct: 1261 AKEGVRDDLQGSIDALAWGKVPGFGTGDQFEIIISPKVHGFTTPVDVYDLLSSTKTMRRT 1320

Query: 1337 NVKIESLDKNTIYEKYSAVVHKNGGSTIKGLKKLD--SVSKSILREFLTLNDIQKLSHTL 1396
            N   +S DK T+  +   ++H    + +K +K LD   +  S+LR   T  +I+ LS +L
Sbjct: 1321 NSAPKS-DKATV--QPFGLLH---SAFLKDIKVLDGKGIPMSLLRTIFTWKNIELLSQSL 1380

Query: 1397 RSILRKYSLNERLNEVDKSTLMMALYFHPQRDEKIGVGAQDIKVGSHSKYSNTRCFILVR 1456
            + IL  Y +NE LNE D+  + M L  HP   EKIG G + I+V + SK+ ++ CF +VR
Sbjct: 1381 KRILHSYEINELLNERDEGLVKMVLQLHPNSVEKIGPGVKGIRV-AKSKHGDSCCFEVVR 1440

Query: 1457 SDGTTEDFSYHKCVLGALEIIAPHRVKAYQSKWMQD 1484
             DGT EDFSYHKCVLGA +IIAP ++  Y+SK++++
Sbjct: 1441 IDGTFEDFSYHKCVLGATKIIAPKKMNFYKSKYLKN 1441

BLAST of Carg21647 vs. ExPASy Swiss-Prot
Match: Q5D869 (DNA-directed RNA polymerase V subunit 1 OS=Arabidopsis thaliana OX=3702 GN=NRPE1 PE=1 SV=1)

HSP 1 Score: 399.4 bits (1025), Expect = 1.7e-109
Identity = 368/1361 (27.04%), Postives = 610/1361 (44.82%), Query Frame = 0

Query: 20   EQDSELQIPSGVLVGVNFSVSTQQD--MENIAVINIEAACEVSDPKLGLPNPSYQCTTCG 79
            E++S  +I  G +VG+ F++++  +  +++I+   I    ++++  LGLP    +C +CG
Sbjct: 2    EEESTSEILDGEIVGITFALASHHEICIQSISESAINHPSQLTNAFLGLPLEFGKCESCG 61

Query: 80   ASVLKCCEGHFGAIKFPYTIIHPYFLSEVAQVLNKVCPGCKSIRRELWGKVEDPTSDFHR 139
            A+    CEGHFG I+ P  I HP  ++E+ Q+L+ +C  C  I+               +
Sbjct: 62   ATEPDKCEGHFGYIQLPVPIYHPAHVNELKQMLSLLCLKCLKIK---------------K 121

Query: 140  PKGCRYCFGSLKDWYPPMRFKLSTTDMFRKSMIMVEVKENMSKKY--QKRVARGGLPPDY 199
             KG     G L D       +L        S I ++ + +    Y   K  +R  L P  
Sbjct: 122  AKGTS---GGLAD-------RLLGVCCEEASQISIKDRASDGASYLELKLPSRSRLQPGC 181

Query: 200  WNFIPK-DEQQEESYCRPNRKVLTHAQVHYLLKDIDPKFLKKFVS----ATDSLFLNSFP 259
            WNF+ +   +    Y RP    L   +V  +L+ I  +  KK  +      +   L   P
Sbjct: 182  WNFLERYGYRYGSDYTRP----LLAREVKEILRRIPEESRKKLTAKGHIPQEGYILEYLP 241

Query: 260  VTPNCHRVTEMTHSFSSG-------------QRLVFDERTRAYK-KLVDFRGTANELGSR 319
            V PNC  V E +  FS+              ++++  + +R+ +      +  A+E+  R
Sbjct: 242  VPPNCLSVPEASDGFSTMSVDPSRIELKDVLKKVIAIKSSRSGETNFESHKAEASEM-FR 301

Query: 320  VLDCLKISKLSPEKLESKDLIYQQKKIKDTATSSNGLRWIKDVVLGKRSDHCFRMVVVGD 379
            V+D     + + +   + D+ Y   KI D+++S      ++ + + K S    R V+ GD
Sbjct: 302  VVDTYLQVRGTAKAARNIDMRYGVSKISDSSSSKAWTEKMRTLFIRKGSGFSSRSVITGD 361

Query: 380  PNIELSEIGIPCHVAERLQISEHLSSWN----MKKLSTSCYLRLVEKGEIFVRREGRLVR 439
                ++E+GIP  +A+R+   E +S  N     K +     L   +    +  R+G    
Sbjct: 362  AYRHVNEVGIPIEIAQRITFEERVSVHNRGYLQKLVDDKLCLSYTQGSTTYSLRDGS--- 421

Query: 440  VRHVLELSMGDTIYRPLADGDVVLVNRPPSIHQHSLIALSVRVLPVSSVLSLNPLCCSPF 499
             +   EL  G  ++R + DGDVV +NRPP+ H+HSL AL V V   ++V  +NPL CSP 
Sbjct: 422  -KGHTELKPGQVVHRRVMDGDVVFINRPPTTHKHSLQALRVYVHEDNTV-KINPLMCSPL 481

Query: 500  RGDFDGDCLHGYVPQSLEARVELRELVALDRQLVNGQSGRNLLSLSHDSLTAAHLIMEDG 559
              DFDGDC+H + PQSL A+ E+ EL ++++QL++  +G+ +L +  DSL +  +++E  
Sbjct: 482  SADFDGDCVHLFYPQSLSAKAEVMELFSVEKQLLSSHTGQLILQMGSDSLLSLRVMLE-R 541

Query: 560  VSLNLFQIQQLQMFALHQLLPPAIVKAPSFRSCAWTGKQLFSIFLPPDFDYSSPSHRVHI 619
            V L+    QQL M+    L PPA+ K+ S    AWT  Q+  +  P     S    R  +
Sbjct: 542  VFLDKATAQQLAMYGSLSLPPPALRKS-SKSGPAWTVFQILQLAFPERL--SCKGDRFLV 601

Query: 620  NNGELLSSEGSYWLRDTGRNPF--QALIEHCEGRTLNYLHIAQRVLCEWLSMRGLSVSLS 679
            +  +LL  +       +  N       +E     TL +    Q +L E L   G S+SL 
Sbjct: 602  DGSDLLKFDFGVDAMGSIINEIVTSIFLEKGPKETLGFFDSLQPLLMESLFAEGFSLSLE 661

Query: 680  DLYLS-VDSHSHKNMMDDIFCGLQEAEETCNLIQLMVDSHKDVLTGDDEGNQHVLSIEVE 739
            DL +S  D     N++      ++E     + ++L   S++D              +++E
Sbjct: 662  DLSMSRADMDVIHNLI------IREISPMVSRLRL---SYRD-------------ELQLE 721

Query: 740  HLSYEKQKSAALNQASVDAFKRVFREIQNLVYKYSGKDNSLLTMFKAGSKGNLLKLVQHS 799
            + S  K K  A N      F      I+NL+     K NS +T           KLVQ +
Sbjct: 722  N-SIHKVKEVAAN------FMLKSYSIRNLI---DIKSNSAIT-----------KLVQQT 781

Query: 800  MCLGLQHSLVTLSFGLPHKLSCSSWNSQKMPRYIQKDGLADRTQSFIPYAVVENSFLSGL 859
              LGLQ S          K   +    + M  + ++     R  S   + +V+  F  GL
Sbjct: 782  GFLGLQLS--------DKKKFYTKTLVEDMAIFCKRK--YGRISSSGDFGIVKGCFFHGL 841

Query: 860  NPFECFAHSVTNRD--SSFSDNAEVPGTLTRKLTFLMRDIYNAYDRTVRNAYGNQLVQFS 919
            +P+E  AHS+  R+     S     PGTL + L  ++RDI    D TVRN   N ++QF 
Sbjct: 842  DPYEEMAHSIAAREVIVRSSRGLAEPGTLFKNLMAVLRDIVITNDGTVRNTCSNSVIQFK 901

Query: 920  YDTDSSTSISNELDGENNNTNRDIGGQPVGSLAACALSEAAYSALDQPISLLETSPLLN- 979
            Y  DS          E  +      G+PVG LAA A+S  AY A      +L++SP  N 
Sbjct: 902  YGVDS----------ERGHQGLFEAGEPVGVLAATAMSNPAYKA------VLDSSPNSNS 961

Query: 980  ----LKKVLEC--GSKRNSPKQTFSLFLLEKLSKRSYGFEYGALGVKNHLERVIFKDIVS 1039
                +K+VL C    +  +  +   L+L E    + +  E  A  V+N L +V  KD   
Sbjct: 962  SWELMKEVLLCKVNFQNTTNDRRVILYLNECHCGKRFCQENAACTVRNKLNKVSLKDTAV 1021

Query: 1040 SVMIIFAPEPSRKRHFSPWVC---HFHVCKEILKKRRLKISSVIHSLNMRCDSV------ 1099
              ++ +  +P+    F    C   H H+ K +L+   + +  +    + +C+ V      
Sbjct: 1022 EFLVEYRKQPTISEIFGIDSCLHGHIHLNKTLLQDWNISMQDI----HQKCEDVINSLGQ 1081

Query: 1100 RQEAKINLPFLHIS---TQDCSLADSSREDG-DTVCLTVTIAENTKNSFLQLDFIQDLLI 1159
            +++ K    F   S   ++ CS  D     G D  CLT +      +    LD + + + 
Sbjct: 1082 KKKKKATDDFKRTSLSVSECCSFRDPCGSKGSDMPCLTFSYNATDPDLERTLDVLCNTVY 1141

Query: 1160 HFLLGTVIRGFAEIDKVDISWNDRPK---VPKPHCKSHGELYLRVTMSGEGTSR---FWA 1219
              LL  VI+G + I   +I WN       +   H    GE  L VT+      +    W 
Sbjct: 1142 PVLLEIVIKGDSRICSANIIWNSSDMTTWIRNRHASRRGEWVLDVTVEKSAVKQSGDAWR 1201

Query: 1220 TLMNHCLPIMDLIDWSRSHPDNIHSFCMAYGIDSGRNYFLNSLESATLDIGKTIRHEHLL 1279
             +++ CL ++ LID  RS P ++       G+       +  L ++   + K +  EH++
Sbjct: 1202 VVIDSCLSVLHLIDTKRSIPYSVKQVQELLGLSCAFEQAVQRLSASVRMVSKGVLKEHII 1250

Query: 1280 LVANTLSATGEFVGLNVKGVSRQREHALVKTPFMQACFSSPGASFVKAAKAGIKDSLSGS 1321
            L+AN ++ +G  +G N  G         +K PF +A   +P   F KAA+    DSLS  
Sbjct: 1262 LLANNMTCSGTMLGFNSGGYKALTRSLNIKAPFTEATLIAPRKCFEKAAEKCHTDSLSTV 1250


HSP 2 Score: 68.6 bits (166), Expect = 7.0e-10
Identity = 41/123 (33.33%), Postives = 68/123 (55.28%), Query Frame = 0

Query: 1361 KKLDSVSKSILREFLTLNDIQKLSHTLRSILR--KYSLNERLNEVDKS-TLMMALYFHPQ 1420
            ++LDS +     E   L+D++ +  TLR I+    Y   + +++ DK+  L   L FHPQ
Sbjct: 1727 QRLDSFTS---EEQELLSDVEPVMRTLRKIMHPSAYPDGDPISDDDKTFVLEKILNFHPQ 1786

Query: 1421 RDEKIGVGAQDIKVGSHSKYSNTRCFILVRSDGTTEDFSYHKCVLGALEIIAPHRVKAYQ 1480
            ++ K+G G   I V  H+ +S++RCF +V +DG  +DFSY K +   L    P R + + 
Sbjct: 1787 KETKLGSGVDFITVDKHTIFSDSRCFFVVSTDGAKQDFSYRKSLNNYLMKKYPDRAEEFI 1846

BLAST of Carg21647 vs. ExPASy Swiss-Prot
Match: P36594 (DNA-directed RNA polymerase II subunit rpb1 OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=rpb1 PE=1 SV=1)

HSP 1 Score: 221.1 bits (562), Expect = 8.4e-56
Identity = 225/944 (23.83%), Postives = 408/944 (43.22%), Query Frame = 0

Query: 17  MEDEQDSELQIPSGVLVGVNFSVSTQQDMENIAVINIE------------AACEVSDPKL 76
           M   Q S   +P   +  V F + + +++ +++V  IE                + DP+L
Sbjct: 1   MSGIQFSPSSVPLRRVEEVQFGILSPEEIRSMSVAKIEFPETMDESGQRPRVGGLLDPRL 60

Query: 77  GLPNPSYQCTTCGASVLKCCEGHFGAIKFPYTIIHPYFLSEVAQVLNKVCPGCKSIRREL 136
           G  +  ++C TCG ++   C GHFG I+    + H  FLS++ ++L  VC  C  ++ + 
Sbjct: 61  GTIDRQFKCQTCGETMAD-CPGHFGHIELAKPVFHIGFLSKIKKILECVCWNCGKLKIDS 120

Query: 137 WGKVEDPTSDFHRPKGCRYCFGSLKDWYPPMRFKLST-TDMFRKSMIMVEVKENMSKKYQ 196
                + T  +  PK       ++          LS  +D F  S     +        Q
Sbjct: 121 SNPKFNDTQRYRDPKNRLNAVWNVCKTKMVCDTGLSAGSDNFDLSNPSANMGHGGCGAAQ 180

Query: 197 KRVARGGLPPDYWNFIPKDEQQEESYCRPNRKVLTHAQVHYLL-----KDIDPKFLKKFV 256
             + + GL    W    + + + +    P +++L+  +VH +      +D+    L +  
Sbjct: 181 PTIRKDGL--RLWGSWKRGKDESD---LPEKRLLSPLEVHTIFTHISSEDLAHLGLNEQY 240

Query: 257 SATDSLFLNSFPVTPNCHRVTEMTHSFSSGQRLVFDERTRAYKKLVDFRGTANELGSR-- 316
           +  D + +   PV P   R +      S G+  +  + +   K   + R    E      
Sbjct: 241 ARPDWMIITVLPVPPPSVRPSISVDGTSRGEDDLTHKLSDIIKANANVRRCEQEGAPAHI 300

Query: 317 VLDCLKISKLSPEKLESKDLIYQQKKIKDTATSSNGLR--------WIKDVVLGKRSDHC 376
           V +  ++ +         ++  Q + ++ +      +R         ++  ++GKR D  
Sbjct: 301 VSEYEQLLQFHVATYMDNEIAGQPQALQKSGRPLKSIRARLKGKEGRLRGNLMGKRVDFS 360

Query: 377 FRMVVVGDPNIELSEIGIPCHVAERLQISEHLSSWNMKKLSTSCYLRLVEKG-------E 436
            R V+ GDPN+ L E+G+P  +A+ L   E ++ +N+ +L       LV  G       +
Sbjct: 361 ARTVITGDPNLSLDELGVPRSIAKTLTYPETVTPYNIYQLQ-----ELVRNGPDEHPGAK 420

Query: 437 IFVRREGRLVRVRH-----VLELSMGDTIYRPLADGDVVLVNRPPSIHQHSLIALSVRVL 496
             +R  G  + +R+      + L  G  + R + DGDVV+ NR PS+H+ S++   +RV+
Sbjct: 421 YIIRDTGERIDLRYHKRAGDIPLRYGWRVERHIRDGDVVIFNRQPSLHKMSMMGHRIRVM 480

Query: 497 PVSSVLSLNPLCCSPFRGDFDGDCLHGYVPQSLEARVELRELVALDRQLVNGQSGRNLLS 556
           P S+   LN    SP+  DFDGD ++ +VPQS E R E++E+  + +Q+V+ QS + ++ 
Sbjct: 481 PYST-FRLNLSVTSPYNADFDGDEMNMHVPQSEETRAEIQEITMVPKQIVSPQSNKPVMG 540

Query: 557 LSHDSLTAAH--LIMEDGVSLNLFQIQQLQMFALHQLLPPAIVKAPSFRSCAWTGKQLFS 616
           +  D+L       + ++ ++ N      L +     +LPP ++  P      WTGKQ+ S
Sbjct: 541 IVQDTLAGVRKFSLRDNFLTRNAVMNIMLWVPDWDGILPPPVILKP---KVLWTGKQILS 600

Query: 617 IFLPP------DFDYSSPSHRVH----INNGELL----------SSEG----SYWLRDTG 676
           + +P       D D  S S+       I NGE++          +S+G    + W     
Sbjct: 601 LIIPKGINLIRDDDKQSLSNPTDSGMLIENGEIIYGVVDKKTVGASQGGLVHTIWKE--- 660

Query: 677 RNPFQALIEHCEGRTLNYLHIAQRVLCEWLSMRGLSVSLSDLYLSVDSHSHKNMMDDIFC 736
           + P     E C+G    + +  QRV+  WL   G S+ + D     D+      M ++  
Sbjct: 661 KGP-----EICKG----FFNGIQRVVNYWLLHNGFSIGIGDTIADADT------MKEVTR 720

Query: 737 GLQEAEETCNLIQLMVDSHKDVLTGDDEGNQHVLSIEVEHLSYEKQKSAALNQASVDAFK 796
            ++EA     + + + D+  + L  +               S+E + S  LNQA  +A +
Sbjct: 721 TVKEARR--QVAECIQDAQHNRLKPEPGMTLRE--------SFEAKVSRILNQARDNAGR 780

Query: 797 RVFREIQNLVYKYSGKDNSLLTMFKAGSKGNLLKLVQHSMCLGLQ-HSLVTLSFGLPHKL 856
                +++         N++  M  AGSKG+ + + Q S C+G Q      + FG  ++ 
Sbjct: 781 SAEHSLKD--------SNNVKQMVAAGSKGSFINISQMSACVGQQIVEGKRIPFGFKYR- 840

Query: 857 SCSSWNSQKMPRYIQKDGLADRTQSFIPYAVVENSFLSGLNPFECFAHSVTNRDSSFSDN 892
                    +P +  KD  +  ++ FI     ENS+L GL P E F H++  R+      
Sbjct: 841 --------TLPHF-PKDDDSPESRGFI-----ENSYLRGLTPQEFFFHAMAGREGLIDTA 877

BLAST of Carg21647 vs. ExPASy Swiss-Prot
Match: P11414 (DNA-directed RNA polymerase II subunit RPB1 OS=Cricetulus griseus OX=10029 GN=POLR2A PE=1 SV=2)

HSP 1 Score: 210.3 bits (534), Expect = 1.5e-52
Identity = 226/913 (24.75%), Postives = 395/913 (43.26%), Query Frame = 0

Query: 61  DPKLGLPNPSYQCTTCGASVLKCCEGHFGAIKFPYTIIHPYFLSEVAQVLNKVC------ 120
           DP+ G+   + +C TC  ++ + C GHFG I+    + H  FL +  +VL  VC      
Sbjct: 59  DPRQGVIERTGRCQTCAGNMTE-CPGHFGHIELAKPVFHVGFLVKTMKVLRCVCFFCSKL 118

Query: 121 ------PGCKSIRRELWGKVEDP-TSDFHRPKGCRYCFGSLKDWYPPMRFKLSTTDMFRK 180
                 P  K I  +  G+ +   T  +   KG   C G        M  K         
Sbjct: 119 LVDSNNPKIKDILAKSKGQPKKRLTHVYDLCKGKNICEGG-----EEMDNKFGVEQPEGD 178

Query: 181 SMIMVEVKENMSKKYQKRVARGGLP-PDYWNFIPKDEQQEESYCRPNRKVLTHAQVHYLL 240
             +  E       +YQ R+ R GL     W  + +D Q+++    P R       VH + 
Sbjct: 179 EDLTKEKGHGGCGRYQPRIRRSGLELYAEWKHVNEDSQEKKILLSPER-------VHEIF 238

Query: 241 KDIDPK-----FLKKFVSATDSLFLNSFPVTPNCHRVTEMTHSFSSGQRLVFDERTRAYK 300
           K I  +      ++   +  + + +   PV P   R   +    +  Q    D+ T    
Sbjct: 239 KRISDEECFVLGMEPRYARPEWMIVTVLPVPPLSVRPAVVMQGSARNQ----DDLTHKLA 298

Query: 301 KLVDF-----RGTANELGSRVL-DCLKISKLSPEKLESKDLIYQQKKIKDTATSSNGLRW 360
            +V       R   N   + V+ + +K+ +     +   +L    + ++ +      L+ 
Sbjct: 299 DIVKINNQLRRNEQNGAAAHVIAEDVKLLQFHVATMVDNELPGLPRAMQKSGRPLKSLKQ 358

Query: 361 --------IKDVVLGKRSDHCFRMVVVGDPNIELSEIGIPCHVAERLQISEHLSSWNMKK 420
                   ++  ++GKR D   R V+  DPN+ + ++G+P  +A  +  +E ++ +N+ +
Sbjct: 359 RLKGKEGRVRGNLMGKRVDFSARTVITPDPNLSIDQVGVPRSIAANMTFAEIVTPFNIDR 418

Query: 421 LSTSCYLRLVEKG-------EIFVRREGRLVRVR-----HVLELSMGDTIYRPLADGDVV 480
           L       LV +G       +  +R  G  + +R       L L  G  + R + DGD+V
Sbjct: 419 LQ-----ELVRRGNSQYPGAKYIIRDNGDRIDLRFHPKPSDLHLQTGYKVERHMCDGDIV 478

Query: 481 LVNRPPSIHQHSLIALSVRVLPVSSVLSLNPLCCSPFRGDFDGDCLHGYVPQSLEARVEL 540
           + NR P++H+ S++   VR+LP S+   LN    +P+  DFDGD ++ ++PQSLE R E+
Sbjct: 479 IFNRQPTLHKMSMMGHRVRILPWST-FRLNLSVTTPYNADFDGDEMNLHLPQSLETRAEI 538

Query: 541 RELVALDRQLVNGQSGRNLLSLSHDSLTAAHLIMEDGVSLNLFQIQQLQMFAL---HQLL 600
           +EL  + R +V  QS R ++ +  D+LTA     +  V L   ++  L MF      ++ 
Sbjct: 539 QELAMVPRMIVTPQSNRPVMGIVQDTLTAVRKFTKRDVFLERGEVMNLLMFLSTWDGKVP 598

Query: 601 PPAIVKAPSFRSCAWTGKQLFSIFLP------------PDFDYSSP-------SHRVHIN 660
            PAI+K        WTGKQ+FS+ +P            PD + S P         +V + 
Sbjct: 599 QPAILKPRPL----WTGKQIFSLIIPGHINCIRTHSTHPDDEDSGPYKHISPGDTKVVVE 658

Query: 661 NGELLSSEGSYWLRDTGRNPFQAL-IEHCE-GRTLNYLHIA--QRVLCEWLSMRGLSVSL 720
           NGEL+   G    +  G +    + I + E G  +  L  +  Q V+  WL + G ++ +
Sbjct: 659 NGELIM--GILCKKSLGTSAGSLVHISYLEMGHDITRLFYSNIQTVINNWLLIEGHTIGI 718

Query: 721 SDLYLSVDSHSHKNMMDDIFCGLQEAEETCNLIQLMVDSHKDVLTGDDEGNQHVLSIEVE 780
            D     DS +++++ + I    ++A++  ++I+++  +H + L     GN         
Sbjct: 719 GDSI--ADSKTYQDIQNTI----KKAKQ--DVIEVIEKAHNNELE-PTPGN-------TL 778

Query: 781 HLSYEKQKSAALNQASVDAFKRVFREIQNLVYKYSGKDNSLLTMFKAGSKGNLLKLVQHS 840
             ++E Q +  LN    DA  +     Q  + +Y    N+  +M  +G+KG+ + + Q  
Sbjct: 779 RQTFENQVNRILN----DARDKTGSSAQKSLSEY----NNFKSMVVSGAKGSKINISQVI 838

Query: 841 MCLGLQH-SLVTLSFGLPHKLSCSSWNSQKMPRYIQKDGLADRTQSFIPYAVVENSFLSG 900
             +G Q+     + FG  H+          +P +I KD     ++ F     VENS+L+G
Sbjct: 839 AVVGQQNVEGKRIPFGFKHR---------TLPHFI-KDDYGPESRGF-----VENSYLAG 898

BLAST of Carg21647 vs. ExPASy Swiss-Prot
Match: P24928 (DNA-directed RNA polymerase II subunit RPB1 OS=Homo sapiens OX=9606 GN=POLR2A PE=1 SV=2)

HSP 1 Score: 210.3 bits (534), Expect = 1.5e-52
Identity = 226/913 (24.75%), Postives = 395/913 (43.26%), Query Frame = 0

Query: 61  DPKLGLPNPSYQCTTCGASVLKCCEGHFGAIKFPYTIIHPYFLSEVAQVLNKVC------ 120
           DP+ G+   + +C TC  ++ + C GHFG I+    + H  FL +  +VL  VC      
Sbjct: 59  DPRQGVIERTGRCQTCAGNMTE-CPGHFGHIELAKPVFHVGFLVKTMKVLRCVCFFCSKL 118

Query: 121 ------PGCKSIRRELWGKVEDP-TSDFHRPKGCRYCFGSLKDWYPPMRFKLSTTDMFRK 180
                 P  K I  +  G+ +   T  +   KG   C G        M  K         
Sbjct: 119 LVDSNNPKIKDILAKSKGQPKKRLTHVYDLCKGKNICEGG-----EEMDNKFGVEQPEGD 178

Query: 181 SMIMVEVKENMSKKYQKRVARGGLP-PDYWNFIPKDEQQEESYCRPNRKVLTHAQVHYLL 240
             +  E       +YQ R+ R GL     W  + +D Q+++    P R       VH + 
Sbjct: 179 EDLTKEKGHGGCGRYQPRIRRSGLELYAEWKHVNEDSQEKKILLSPER-------VHEIF 238

Query: 241 KDIDPK-----FLKKFVSATDSLFLNSFPVTPNCHRVTEMTHSFSSGQRLVFDERTRAYK 300
           K I  +      ++   +  + + +   PV P   R   +    +  Q    D+ T    
Sbjct: 239 KRISDEECFVLGMEPRYARPEWMIVTVLPVPPLSVRPAVVMQGSARNQ----DDLTHKLA 298

Query: 301 KLVDF-----RGTANELGSRVL-DCLKISKLSPEKLESKDLIYQQKKIKDTATSSNGLRW 360
            +V       R   N   + V+ + +K+ +     +   +L    + ++ +      L+ 
Sbjct: 299 DIVKINNQLRRNEQNGAAAHVIAEDVKLLQFHVATMVDNELPGLPRAMQKSGRPLKSLKQ 358

Query: 361 --------IKDVVLGKRSDHCFRMVVVGDPNIELSEIGIPCHVAERLQISEHLSSWNMKK 420
                   ++  ++GKR D   R V+  DPN+ + ++G+P  +A  +  +E ++ +N+ +
Sbjct: 359 RLKGKEGRVRGNLMGKRVDFSARTVITPDPNLSIDQVGVPRSIAANMTFAEIVTPFNIDR 418

Query: 421 LSTSCYLRLVEKG-------EIFVRREGRLVRVR-----HVLELSMGDTIYRPLADGDVV 480
           L       LV +G       +  +R  G  + +R       L L  G  + R + DGD+V
Sbjct: 419 LQ-----ELVRRGNSQYPGAKYIIRDNGDRIDLRFHPKPSDLHLQTGYKVERHMCDGDIV 478

Query: 481 LVNRPPSIHQHSLIALSVRVLPVSSVLSLNPLCCSPFRGDFDGDCLHGYVPQSLEARVEL 540
           + NR P++H+ S++   VR+LP S+   LN    +P+  DFDGD ++ ++PQSLE R E+
Sbjct: 479 IFNRQPTLHKMSMMGHRVRILPWST-FRLNLSVTTPYNADFDGDEMNLHLPQSLETRAEI 538

Query: 541 RELVALDRQLVNGQSGRNLLSLSHDSLTAAHLIMEDGVSLNLFQIQQLQMFAL---HQLL 600
           +EL  + R +V  QS R ++ +  D+LTA     +  V L   ++  L MF      ++ 
Sbjct: 539 QELAMVPRMIVTPQSNRPVMGIVQDTLTAVRKFTKRDVFLERGEVMNLLMFLSTWDGKVP 598

Query: 601 PPAIVKAPSFRSCAWTGKQLFSIFLP------------PDFDYSSP-------SHRVHIN 660
            PAI+K        WTGKQ+FS+ +P            PD + S P         +V + 
Sbjct: 599 QPAILKPRPL----WTGKQIFSLIIPGHINCIRTHSTHPDDEDSGPYKHISPGDTKVVVE 658

Query: 661 NGELLSSEGSYWLRDTGRNPFQAL-IEHCE-GRTLNYLHIA--QRVLCEWLSMRGLSVSL 720
           NGEL+   G    +  G +    + I + E G  +  L  +  Q V+  WL + G ++ +
Sbjct: 659 NGELIM--GILCKKSLGTSAGSLVHISYLEMGHDITRLFYSNIQTVINNWLLIEGHTIGI 718

Query: 721 SDLYLSVDSHSHKNMMDDIFCGLQEAEETCNLIQLMVDSHKDVLTGDDEGNQHVLSIEVE 780
            D     DS +++++ + I    ++A++  ++I+++  +H + L     GN         
Sbjct: 719 GDSI--ADSKTYQDIQNTI----KKAKQ--DVIEVIEKAHNNELE-PTPGN-------TL 778

Query: 781 HLSYEKQKSAALNQASVDAFKRVFREIQNLVYKYSGKDNSLLTMFKAGSKGNLLKLVQHS 840
             ++E Q +  LN    DA  +     Q  + +Y    N+  +M  +G+KG+ + + Q  
Sbjct: 779 RQTFENQVNRILN----DARDKTGSSAQKSLSEY----NNFKSMVVSGAKGSKINISQVI 838

Query: 841 MCLGLQH-SLVTLSFGLPHKLSCSSWNSQKMPRYIQKDGLADRTQSFIPYAVVENSFLSG 900
             +G Q+     + FG  H+          +P +I KD     ++ F     VENS+L+G
Sbjct: 839 AVVGQQNVEGKRIPFGFKHR---------TLPHFI-KDDYGPESRGF-----VENSYLAG 898

BLAST of Carg21647 vs. ExPASy TrEMBL
Match: A0A6J1FKU9 (DNA-directed RNA polymerase OS=Cucurbita moschata OX=3662 GN=LOC111444908 PE=4 SV=1)

HSP 1 Score: 3001.1 bits (7779), Expect = 0.0e+00
Identity = 1481/1486 (99.66%), Postives = 1481/1486 (99.66%), Query Frame = 0

Query: 1    MNFAGGFRSGQKAMNHMEDEQDSELQIPSGVLVGVNFSVSTQQDMENIAVINIEAACEVS 60
            MNFAGGFRSGQKAMNHMEDEQDSELQIPSGVLVGVNFSVSTQQDMENIAVINIEAACEVS
Sbjct: 1    MNFAGGFRSGQKAMNHMEDEQDSELQIPSGVLVGVNFSVSTQQDMENIAVINIEAACEVS 60

Query: 61   DPKLGLPNPSYQCTTCGASVLKCCEGHFGAIKFPYTIIHPYFLSEVAQVLNKVCPGCKSI 120
            DPKLGLPNPSYQCTTCGASVLKCCEGHFGAIKFPYTIIHPYFLSEVAQVLNKVCPGCKSI
Sbjct: 61   DPKLGLPNPSYQCTTCGASVLKCCEGHFGAIKFPYTIIHPYFLSEVAQVLNKVCPGCKSI 120

Query: 121  RRELWGKVEDPTSDFHRPKGCRYCFGSLKDWYPPMRFKLSTTDMFRKSMIMVEVKENMSK 180
            RRELWGKVEDPTSDFHRPKGCRYCFGSLKDWYPPMRFKLSTTDMFRKSMIMVEVKENMSK
Sbjct: 121  RRELWGKVEDPTSDFHRPKGCRYCFGSLKDWYPPMRFKLSTTDMFRKSMIMVEVKENMSK 180

Query: 181  KYQKRVARGGLPPDYWNFIPKDEQQEESYCRPNRKVLTHAQVHYLLKDIDPKFLKKFVSA 240
            KYQKRVARGGLPPDYWNFIPKDEQQEESYCRPNRKVLTHAQVHYLLKDIDPKFLKKFVSA
Sbjct: 181  KYQKRVARGGLPPDYWNFIPKDEQQEESYCRPNRKVLTHAQVHYLLKDIDPKFLKKFVSA 240

Query: 241  TDSLFLNSFPVTPNCHRVTEMTHSFSSGQRLVFDERTRAYKKLVDFRGTANELGSRVLDC 300
            TDSLFLNSFPVTPNCHRVTEMTHSFSSGQRLVFDERTRAYKKLVDFRGTANELGSRVLDC
Sbjct: 241  TDSLFLNSFPVTPNCHRVTEMTHSFSSGQRLVFDERTRAYKKLVDFRGTANELGSRVLDC 300

Query: 301  LKISKLSPEKLESKDLIYQQKKIKDTATSSNGLRWIKDVVLGKRSDHCFRMVVVGDPNIE 360
            LKISKLSPEKLESKDLIYQQKKIKDTATSSNGLRWIKDVVLGKRSDHCFRMVVVGDPNIE
Sbjct: 301  LKISKLSPEKLESKDLIYQQKKIKDTATSSNGLRWIKDVVLGKRSDHCFRMVVVGDPNIE 360

Query: 361  LSEIGIPCHVAERLQISEHLSSWNMKKLSTSCYLRLVEKGEIFVRREGRLVRVRHVLELS 420
            LSEIGIPCHVAERLQISEHLSSWNMKKLSTSCYLRLVEKGEIFVRREGRLVRVRHVLELS
Sbjct: 361  LSEIGIPCHVAERLQISEHLSSWNMKKLSTSCYLRLVEKGEIFVRREGRLVRVRHVLELS 420

Query: 421  MGDTIYRPLADGDVVLVNRPPSIHQHSLIALSVRVLPVSSVLSLNPLCCSPFRGDFDGDC 480
            MGDTIYRPLADGDVVLVNRPPSIHQHSLIALSVRVLPVSSVLSLNPLCCSPFRGDFDGDC
Sbjct: 421  MGDTIYRPLADGDVVLVNRPPSIHQHSLIALSVRVLPVSSVLSLNPLCCSPFRGDFDGDC 480

Query: 481  LHGYVPQSLEARVELRELVALDRQLVNGQSGRNLLSLSHDSLTAAHLIMEDGVSLNLFQI 540
            LHGYVPQSLEARVELRELVALDRQLVNGQSGRNLLSLSHDSLTAAHLIMEDGVSLNLFQI
Sbjct: 481  LHGYVPQSLEARVELRELVALDRQLVNGQSGRNLLSLSHDSLTAAHLIMEDGVSLNLFQI 540

Query: 541  QQLQMFALHQLLPPAIVKAPSFRSCAWTGKQLFSIFLPPDFDYSSPSHRVHINNGELLSS 600
            QQLQMFALHQLLPPAIVKAPSFRSCAWTGKQLFSIFLPPDFDYSSPSHRVHI NGELLSS
Sbjct: 541  QQLQMFALHQLLPPAIVKAPSFRSCAWTGKQLFSIFLPPDFDYSSPSHRVHIKNGELLSS 600

Query: 601  EGSYWLRDTGRNPFQALIEHCEGRTLNYLHIAQRVLCEWLSMRGLSVSLSDLYLSVDSHS 660
            EGSYWLRDTGRNPFQALIEHCEGRTLNYLHIAQRVLCEWLSMRGLSVSLSDLYLSVDSHS
Sbjct: 601  EGSYWLRDTGRNPFQALIEHCEGRTLNYLHIAQRVLCEWLSMRGLSVSLSDLYLSVDSHS 660

Query: 661  HKNMMDDIFCGLQEAEETCNLIQLMVDSHKDVLTGDDEGNQHVLSIEVEHLSYEKQKSAA 720
            HKNMMDDIFCGLQEAEETCNLIQLMVDSHKDVLTGDDEGNQHVLSIEVEHLSYEKQKSAA
Sbjct: 661  HKNMMDDIFCGLQEAEETCNLIQLMVDSHKDVLTGDDEGNQHVLSIEVEHLSYEKQKSAA 720

Query: 721  LNQASVDAFKRVFREIQNLVYKYSGKDNSLLTMFKAGSKGNLLKLVQHSMCLGLQHSLVT 780
            LNQASVDAFKRVFREIQNLVYKYSGKDNSLLTMFKAGSKGNLLKLVQHSMCLGLQHSLVT
Sbjct: 721  LNQASVDAFKRVFREIQNLVYKYSGKDNSLLTMFKAGSKGNLLKLVQHSMCLGLQHSLVT 780

Query: 781  LSFGLPHKLSCSSWNSQKMPRYIQKDGLADRTQSFIPYAVVENSFLSGLNPFECFAHSVT 840
            LSFGLPHKLSCSSWNSQKMPRYIQKDGLADRTQSFIPYAVVENSFLSGLNPFECFAHSVT
Sbjct: 781  LSFGLPHKLSCSSWNSQKMPRYIQKDGLADRTQSFIPYAVVENSFLSGLNPFECFAHSVT 840

Query: 841  NRDSSFSDNAEVPGTLTRKLTFLMRDIYNAYDRTVRNAYGNQLVQFSYDTDSSTSISNEL 900
            NRDSSFSDNAEVPGTLTRKLTFLMRDIYNAYDRTVRNAYGNQLVQFSYDTDS  S SNEL
Sbjct: 841  NRDSSFSDNAEVPGTLTRKLTFLMRDIYNAYDRTVRNAYGNQLVQFSYDTDSPMSTSNEL 900

Query: 901  DGENNNTNRDIGGQPVGSLAACALSEAAYSALDQPISLLETSPLLNLKKVLECGSKRNSP 960
            DGENNNTNRDIGGQPVGSLAACALSEAAYSALDQPISLLETSPLLNLKKVLECGSKRNSP
Sbjct: 901  DGENNNTNRDIGGQPVGSLAACALSEAAYSALDQPISLLETSPLLNLKKVLECGSKRNSP 960

Query: 961  KQTFSLFLLEKLSKRSYGFEYGALGVKNHLERVIFKDIVSSVMIIFAPEPSRKRHFSPWV 1020
            KQTFSLFLLEKLSKRSYGFEYGALGVKNHLERVIFKDIVSSVMIIFAPEPSRKRHFSPWV
Sbjct: 961  KQTFSLFLLEKLSKRSYGFEYGALGVKNHLERVIFKDIVSSVMIIFAPEPSRKRHFSPWV 1020

Query: 1021 CHFHVCKEILKKRRLKISSVIHSLNMRCDSVRQEAKINLPFLHISTQDCSLADSSREDGD 1080
            CHFHVCKEILKKRRLKISSVIHSLNMRCDSVRQEAKINLPFLHISTQDCSLADSSREDGD
Sbjct: 1021 CHFHVCKEILKKRRLKISSVIHSLNMRCDSVRQEAKINLPFLHISTQDCSLADSSREDGD 1080

Query: 1081 TVCLTVTIAENTKNSFLQLDFIQDLLIHFLLGTVIRGFAEIDKVDISWNDRPKVPKPHCK 1140
            TVCLTVTIAENTKNSFLQLDFIQDLLIHFLLGTVIRGFAEIDKVDISWNDRPKVPKPHCK
Sbjct: 1081 TVCLTVTIAENTKNSFLQLDFIQDLLIHFLLGTVIRGFAEIDKVDISWNDRPKVPKPHCK 1140

Query: 1141 SHGELYLRVTMSGEGTSRFWATLMNHCLPIMDLIDWSRSHPDNIHSFCMAYGIDSGRNYF 1200
            SHGELYLRVTMSGEG SRFWATLMNHCLPIMDLIDWSRSHPDNIHSFCMAYGIDSGRNYF
Sbjct: 1141 SHGELYLRVTMSGEGNSRFWATLMNHCLPIMDLIDWSRSHPDNIHSFCMAYGIDSGRNYF 1200

Query: 1201 LNSLESATLDIGKTIRHEHLLLVANTLSATGEFVGLNVKGVSRQREHALVKTPFMQACFS 1260
            LNSLESATLDIGKTIRHEHLLLVANTLSATGEFVGLNVKGVSRQREHALVKTPFMQACFS
Sbjct: 1201 LNSLESATLDIGKTIRHEHLLLVANTLSATGEFVGLNVKGVSRQREHALVKTPFMQACFS 1260

Query: 1261 SPGASFVKAAKAGIKDSLSGSLDALAWGKIPSMGTGGQFDILYSGKGHELNKPVDVYNLL 1320
            SPGASFVKAAKAGIKDSLSGSLDALAWGKIPSMGTGGQFDILYSGKGHELNKPVDVYNLL
Sbjct: 1261 SPGASFVKAAKAGIKDSLSGSLDALAWGKIPSMGTGGQFDILYSGKGHELNKPVDVYNLL 1320

Query: 1321 GSQGICEKPNVKIESLDKNTIYEKYSAVVHKNGGSTIKGLKKLDSVSKSILREFLTLNDI 1380
            GSQGICEKPNVKIESLDKNTIYEKYSAVVHKNGGSTIKGLKKLDSVSKSILREFLTLNDI
Sbjct: 1321 GSQGICEKPNVKIESLDKNTIYEKYSAVVHKNGGSTIKGLKKLDSVSKSILREFLTLNDI 1380

Query: 1381 QKLSHTLRSILRKYSLNERLNEVDKSTLMMALYFHPQRDEKIGVGAQDIKVGSHSKYSNT 1440
            QKLSHTLRSILRKYSLNERLNEVDKSTLMMALYFHPQRDEKIGVGAQDIKVGSHSKYSNT
Sbjct: 1381 QKLSHTLRSILRKYSLNERLNEVDKSTLMMALYFHPQRDEKIGVGAQDIKVGSHSKYSNT 1440

Query: 1441 RCFILVRSDGTTEDFSYHKCVLGALEIIAPHRVKAYQSKWMQDKFE 1487
            RCFILVRSDGTTEDFSYHKCVLGALEIIAPHRVKAYQSKWMQDKFE
Sbjct: 1441 RCFILVRSDGTTEDFSYHKCVLGALEIIAPHRVKAYQSKWMQDKFE 1486

BLAST of Carg21647 vs. ExPASy TrEMBL
Match: A0A6J1KSL8 (DNA-directed RNA polymerase OS=Cucurbita maxima OX=3661 GN=LOC111498320 PE=4 SV=1)

HSP 1 Score: 2986.1 bits (7740), Expect = 0.0e+00
Identity = 1472/1486 (99.06%), Postives = 1479/1486 (99.53%), Query Frame = 0

Query: 1    MNFAGGFRSGQKAMNHMEDEQDSELQIPSGVLVGVNFSVSTQQDMENIAVINIEAACEVS 60
            MNFAGGFRSGQKAMNHMEDEQDSELQIPSGVLVGVNFSVSTQQDMENIAVINIEAACEVS
Sbjct: 1    MNFAGGFRSGQKAMNHMEDEQDSELQIPSGVLVGVNFSVSTQQDMENIAVINIEAACEVS 60

Query: 61   DPKLGLPNPSYQCTTCGASVLKCCEGHFGAIKFPYTIIHPYFLSEVAQVLNKVCPGCKSI 120
            DPKLGLPNPSYQCTTCGASVLKCCEGHFGAIKFPYTIIHPYFLSEVAQVLNKVCPGCKSI
Sbjct: 61   DPKLGLPNPSYQCTTCGASVLKCCEGHFGAIKFPYTIIHPYFLSEVAQVLNKVCPGCKSI 120

Query: 121  RRELWGKVEDPTSDFHRPKGCRYCFGSLKDWYPPMRFKLSTTDMFRKSMIMVEVKENMSK 180
            RRELWGKVEDPTSDFHRPKGCRYCFGSLKDWYPPMRFKLSTTDMFRKSMIMVEVKENMSK
Sbjct: 121  RRELWGKVEDPTSDFHRPKGCRYCFGSLKDWYPPMRFKLSTTDMFRKSMIMVEVKENMSK 180

Query: 181  KYQKRVARGGLPPDYWNFIPKDEQQEESYCRPNRKVLTHAQVHYLLKDIDPKFLKKFVSA 240
            KYQKRVARGGLPPDYWNFIPKDEQQEESYCRPNRKVLTHAQVHYLLKDIDPKFLKKFVSA
Sbjct: 181  KYQKRVARGGLPPDYWNFIPKDEQQEESYCRPNRKVLTHAQVHYLLKDIDPKFLKKFVSA 240

Query: 241  TDSLFLNSFPVTPNCHRVTEMTHSFSSGQRLVFDERTRAYKKLVDFRGTANELGSRVLDC 300
            TDSLFLNSFPVTPNCHRVTEMTHSFSSGQRLVFDERTRAYKKLVDFRGTANELGSRVLDC
Sbjct: 241  TDSLFLNSFPVTPNCHRVTEMTHSFSSGQRLVFDERTRAYKKLVDFRGTANELGSRVLDC 300

Query: 301  LKISKLSPEKLESKDLIYQQKKIKDTATSSNGLRWIKDVVLGKRSDHCFRMVVVGDPNIE 360
            LKISKLSPEKLESKDLIYQQKKIKDTATSSNGLRWIKDVVLGKRSDHCFRMVVVGDPNIE
Sbjct: 301  LKISKLSPEKLESKDLIYQQKKIKDTATSSNGLRWIKDVVLGKRSDHCFRMVVVGDPNIE 360

Query: 361  LSEIGIPCHVAERLQISEHLSSWNMKKLSTSCYLRLVEKGEIFVRREGRLVRVRHVLELS 420
            LSEIGIPCHVAERLQISEHLSSWNMKKLSTSCYLRLVEKGEIFVRREGRLVRVRHVLELS
Sbjct: 361  LSEIGIPCHVAERLQISEHLSSWNMKKLSTSCYLRLVEKGEIFVRREGRLVRVRHVLELS 420

Query: 421  MGDTIYRPLADGDVVLVNRPPSIHQHSLIALSVRVLPVSSVLSLNPLCCSPFRGDFDGDC 480
            MGDTIYRPLADGDVVLVNRPPSIHQHSLIALSVRVLPVSSVLSLNPLCCSPFRGDFDGDC
Sbjct: 421  MGDTIYRPLADGDVVLVNRPPSIHQHSLIALSVRVLPVSSVLSLNPLCCSPFRGDFDGDC 480

Query: 481  LHGYVPQSLEARVELRELVALDRQLVNGQSGRNLLSLSHDSLTAAHLIMEDGVSLNLFQI 540
            LHGYVPQSLEARVELRELVALDRQLVNGQSGRNLLSLSHDSLTAAHLIMEDGVSLNLFQI
Sbjct: 481  LHGYVPQSLEARVELRELVALDRQLVNGQSGRNLLSLSHDSLTAAHLIMEDGVSLNLFQI 540

Query: 541  QQLQMFALHQLLPPAIVKAPSFRSCAWTGKQLFSIFLPPDFDYSSPSHRVHINNGELLSS 600
            QQLQMFALHQLLPPAIVKAPSFRSCAWTGKQLFSIFLPPDFDYSSPSHRVHI NGELLSS
Sbjct: 541  QQLQMFALHQLLPPAIVKAPSFRSCAWTGKQLFSIFLPPDFDYSSPSHRVHIKNGELLSS 600

Query: 601  EGSYWLRDTGRNPFQALIEHCEGRTLNYLHIAQRVLCEWLSMRGLSVSLSDLYLSVDSHS 660
            EGSYWLRDTGRNPFQALIEHCEG TLNYLHIAQRVLCEWLSMRGLSVSLSDLYLSVDSHS
Sbjct: 601  EGSYWLRDTGRNPFQALIEHCEGMTLNYLHIAQRVLCEWLSMRGLSVSLSDLYLSVDSHS 660

Query: 661  HKNMMDDIFCGLQEAEETCNLIQLMVDSHKDVLTGDDEGNQHVLSIEVEHLSYEKQKSAA 720
            HKNMMDDIFCGLQEAEETCNLIQLMVDSHKD LTGDDEGNQHVLSIEVEHLSYEKQKSAA
Sbjct: 661  HKNMMDDIFCGLQEAEETCNLIQLMVDSHKDALTGDDEGNQHVLSIEVEHLSYEKQKSAA 720

Query: 721  LNQASVDAFKRVFREIQNLVYKYSGKDNSLLTMFKAGSKGNLLKLVQHSMCLGLQHSLVT 780
            LNQASVDAFKRVFREIQNLVYKYSGKDNSLLTMFKAGSKGNLLKLVQHSMCLGLQHSLVT
Sbjct: 721  LNQASVDAFKRVFREIQNLVYKYSGKDNSLLTMFKAGSKGNLLKLVQHSMCLGLQHSLVT 780

Query: 781  LSFGLPHKLSCSSWNSQKMPRYIQKDGLADRTQSFIPYAVVENSFLSGLNPFECFAHSVT 840
            LSFGLPHKLSCSSWNSQKMPRYI+KDGLADRTQSFIPYAVVENSFLSGLNPFECFAHSVT
Sbjct: 781  LSFGLPHKLSCSSWNSQKMPRYIRKDGLADRTQSFIPYAVVENSFLSGLNPFECFAHSVT 840

Query: 841  NRDSSFSDNAEVPGTLTRKLTFLMRDIYNAYDRTVRNAYGNQLVQFSYDTDSSTSISNEL 900
            NRDSSFSDNAEVPGTLTRKLTFLMRDIYNAYD TVRNAYGNQLVQFSYDTDS TSISNEL
Sbjct: 841  NRDSSFSDNAEVPGTLTRKLTFLMRDIYNAYDGTVRNAYGNQLVQFSYDTDSPTSISNEL 900

Query: 901  DGENNNTNRDIGGQPVGSLAACALSEAAYSALDQPISLLETSPLLNLKKVLECGSKRNSP 960
            DGENNNTNRDIGGQPVGSLAACA+SEAAYSALDQPISLLETSPLLNLKKVLECGSKRNSP
Sbjct: 901  DGENNNTNRDIGGQPVGSLAACAISEAAYSALDQPISLLETSPLLNLKKVLECGSKRNSP 960

Query: 961  KQTFSLFLLEKLSKRSYGFEYGALGVKNHLERVIFKDIVSSVMIIFAPEPSRKRHFSPWV 1020
            KQ FSLFLLEKLSKRSYG+EYGALGVKNHLERVIFKDIVSSVMIIFAPEPSRKRHFSPWV
Sbjct: 961  KQIFSLFLLEKLSKRSYGYEYGALGVKNHLERVIFKDIVSSVMIIFAPEPSRKRHFSPWV 1020

Query: 1021 CHFHVCKEILKKRRLKISSVIHSLNMRCDSVRQEAKINLPFLHISTQDCSLADSSREDGD 1080
            CHFHVCKEILKKRRLKISSVIHSLNMRCDS+RQEAKINLPFLHISTQDCSLADSSREDGD
Sbjct: 1021 CHFHVCKEILKKRRLKISSVIHSLNMRCDSMRQEAKINLPFLHISTQDCSLADSSREDGD 1080

Query: 1081 TVCLTVTIAENTKNSFLQLDFIQDLLIHFLLGTVIRGFAEIDKVDISWNDRPKVPKPHCK 1140
            TVCLTVTIAENTKNSFLQLDFIQDLLIHFLLGTVIRGFAEIDKVDISWNDRPKVPKPHCK
Sbjct: 1081 TVCLTVTIAENTKNSFLQLDFIQDLLIHFLLGTVIRGFAEIDKVDISWNDRPKVPKPHCK 1140

Query: 1141 SHGELYLRVTMSGEGTSRFWATLMNHCLPIMDLIDWSRSHPDNIHSFCMAYGIDSGRNYF 1200
            SHGELYLRVTMSGEG SRFWATLMN+CLPIMDLIDWSRSHPDNIHSFCMAYGIDSGRNYF
Sbjct: 1141 SHGELYLRVTMSGEGNSRFWATLMNNCLPIMDLIDWSRSHPDNIHSFCMAYGIDSGRNYF 1200

Query: 1201 LNSLESATLDIGKTIRHEHLLLVANTLSATGEFVGLNVKGVSRQREHALVKTPFMQACFS 1260
            LNSLESATLDIGKTIRHEHLLLVANTLSATGEFVGLNVKGVSRQREHALVKTPFMQACFS
Sbjct: 1201 LNSLESATLDIGKTIRHEHLLLVANTLSATGEFVGLNVKGVSRQREHALVKTPFMQACFS 1260

Query: 1261 SPGASFVKAAKAGIKDSLSGSLDALAWGKIPSMGTGGQFDILYSGKGHELNKPVDVYNLL 1320
            SPGASFVKAAKAGIKDSLSGSLDALAWGKIPSMGTGGQFDILYSGKGHEL+KPVDVYNLL
Sbjct: 1261 SPGASFVKAAKAGIKDSLSGSLDALAWGKIPSMGTGGQFDILYSGKGHELSKPVDVYNLL 1320

Query: 1321 GSQGICEKPNVKIESLDKNTIYEKYSAVVHKNGGSTIKGLKKLDSVSKSILREFLTLNDI 1380
            GSQGICEKPNVK+ESLDKNTIYEKYSAVVHKNGGSTIKGLKKLDSVSKSILREFLTLNDI
Sbjct: 1321 GSQGICEKPNVKMESLDKNTIYEKYSAVVHKNGGSTIKGLKKLDSVSKSILREFLTLNDI 1380

Query: 1381 QKLSHTLRSILRKYSLNERLNEVDKSTLMMALYFHPQRDEKIGVGAQDIKVGSHSKYSNT 1440
            QKLSHTLRSILRKYSLNERLNEVDKSTLMMALYFHPQRDEKIGVGAQDIKVGSHSKYSNT
Sbjct: 1381 QKLSHTLRSILRKYSLNERLNEVDKSTLMMALYFHPQRDEKIGVGAQDIKVGSHSKYSNT 1440

Query: 1441 RCFILVRSDGTTEDFSYHKCVLGALEIIAPHRVKAYQSKWMQDKFE 1487
            RCFILVRSDGTTEDFSYHKCVLGALEIIAPHRVKAYQSKWMQDKFE
Sbjct: 1441 RCFILVRSDGTTEDFSYHKCVLGALEIIAPHRVKAYQSKWMQDKFE 1486

BLAST of Carg21647 vs. ExPASy TrEMBL
Match: A0A6J1FF54 (DNA-directed RNA polymerase OS=Cucurbita moschata OX=3662 GN=LOC111444908 PE=4 SV=1)

HSP 1 Score: 2738.8 bits (7098), Expect = 0.0e+00
Identity = 1355/1361 (99.56%), Postives = 1356/1361 (99.63%), Query Frame = 0

Query: 126  GKVEDPTSDFHRPKGCRYCFGSLKDWYPPMRFKLSTTDMFRKSMIMVEVKENMSKKYQKR 185
            G+VEDPTSDFHRPKGCRYCFGSLKDWYPPMRFKLSTTDMFRKSMIMVEVKENMSKKYQKR
Sbjct: 4    GQVEDPTSDFHRPKGCRYCFGSLKDWYPPMRFKLSTTDMFRKSMIMVEVKENMSKKYQKR 63

Query: 186  VARGGLPPDYWNFIPKDEQQEESYCRPNRKVLTHAQVHYLLKDIDPKFLKKFVSATDSLF 245
            VARGGLPPDYWNFIPKDEQQEESYCRPNRKVLTHAQVHYLLKDIDPKFLKKFVSATDSLF
Sbjct: 64   VARGGLPPDYWNFIPKDEQQEESYCRPNRKVLTHAQVHYLLKDIDPKFLKKFVSATDSLF 123

Query: 246  LNSFPVTPNCHRVTEMTHSFSSGQRLVFDERTRAYKKLVDFRGTANELGSRVLDCLKISK 305
            LNSFPVTPNCHRVTEMTHSFSSGQRLVFDERTRAYKKLVDFRGTANELGSRVLDCLKISK
Sbjct: 124  LNSFPVTPNCHRVTEMTHSFSSGQRLVFDERTRAYKKLVDFRGTANELGSRVLDCLKISK 183

Query: 306  LSPEKLESKDLIYQQKKIKDTATSSNGLRWIKDVVLGKRSDHCFRMVVVGDPNIELSEIG 365
            LSPEKLESKDLIYQQKKIKDTATSSNGLRWIKDVVLGKRSDHCFRMVVVGDPNIELSEIG
Sbjct: 184  LSPEKLESKDLIYQQKKIKDTATSSNGLRWIKDVVLGKRSDHCFRMVVVGDPNIELSEIG 243

Query: 366  IPCHVAERLQISEHLSSWNMKKLSTSCYLRLVEKGEIFVRREGRLVRVRHVLELSMGDTI 425
            IPCHVAERLQISEHLSSWNMKKLSTSCYLRLVEKGEIFVRREGRLVRVRHVLELSMGDTI
Sbjct: 244  IPCHVAERLQISEHLSSWNMKKLSTSCYLRLVEKGEIFVRREGRLVRVRHVLELSMGDTI 303

Query: 426  YRPLADGDVVLVNRPPSIHQHSLIALSVRVLPVSSVLSLNPLCCSPFRGDFDGDCLHGYV 485
            YRPLADGDVVLVNRPPSIHQHSLIALSVRVLPVSSVLSLNPLCCSPFRGDFDGDCLHGYV
Sbjct: 304  YRPLADGDVVLVNRPPSIHQHSLIALSVRVLPVSSVLSLNPLCCSPFRGDFDGDCLHGYV 363

Query: 486  PQSLEARVELRELVALDRQLVNGQSGRNLLSLSHDSLTAAHLIMEDGVSLNLFQIQQLQM 545
            PQSLEARVELRELVALDRQLVNGQSGRNLLSLSHDSLTAAHLIMEDGVSLNLFQIQQLQM
Sbjct: 364  PQSLEARVELRELVALDRQLVNGQSGRNLLSLSHDSLTAAHLIMEDGVSLNLFQIQQLQM 423

Query: 546  FALHQLLPPAIVKAPSFRSCAWTGKQLFSIFLPPDFDYSSPSHRVHINNGELLSSEGSYW 605
            FALHQLLPPAIVKAPSFRSCAWTGKQLFSIFLPPDFDYSSPSHRVHI NGELLSSEGSYW
Sbjct: 424  FALHQLLPPAIVKAPSFRSCAWTGKQLFSIFLPPDFDYSSPSHRVHIKNGELLSSEGSYW 483

Query: 606  LRDTGRNPFQALIEHCEGRTLNYLHIAQRVLCEWLSMRGLSVSLSDLYLSVDSHSHKNMM 665
            LRDTGRNPFQALIEHCEGRTLNYLHIAQRVLCEWLSMRGLSVSLSDLYLSVDSHSHKNMM
Sbjct: 484  LRDTGRNPFQALIEHCEGRTLNYLHIAQRVLCEWLSMRGLSVSLSDLYLSVDSHSHKNMM 543

Query: 666  DDIFCGLQEAEETCNLIQLMVDSHKDVLTGDDEGNQHVLSIEVEHLSYEKQKSAALNQAS 725
            DDIFCGLQEAEETCNLIQLMVDSHKDVLTGDDEGNQHVLSIEVEHLSYEKQKSAALNQAS
Sbjct: 544  DDIFCGLQEAEETCNLIQLMVDSHKDVLTGDDEGNQHVLSIEVEHLSYEKQKSAALNQAS 603

Query: 726  VDAFKRVFREIQNLVYKYSGKDNSLLTMFKAGSKGNLLKLVQHSMCLGLQHSLVTLSFGL 785
            VDAFKRVFREIQNLVYKYSGKDNSLLTMFKAGSKGNLLKLVQHSMCLGLQHSLVTLSFGL
Sbjct: 604  VDAFKRVFREIQNLVYKYSGKDNSLLTMFKAGSKGNLLKLVQHSMCLGLQHSLVTLSFGL 663

Query: 786  PHKLSCSSWNSQKMPRYIQKDGLADRTQSFIPYAVVENSFLSGLNPFECFAHSVTNRDSS 845
            PHKLSCSSWNSQKMPRYIQKDGLADRTQSFIPYAVVENSFLSGLNPFECFAHSVTNRDSS
Sbjct: 664  PHKLSCSSWNSQKMPRYIQKDGLADRTQSFIPYAVVENSFLSGLNPFECFAHSVTNRDSS 723

Query: 846  FSDNAEVPGTLTRKLTFLMRDIYNAYDRTVRNAYGNQLVQFSYDTDSSTSISNELDGENN 905
            FSDNAEVPGTLTRKLTFLMRDIYNAYDRTVRNAYGNQLVQFSYDTDS  S SNELDGENN
Sbjct: 724  FSDNAEVPGTLTRKLTFLMRDIYNAYDRTVRNAYGNQLVQFSYDTDSPMSTSNELDGENN 783

Query: 906  NTNRDIGGQPVGSLAACALSEAAYSALDQPISLLETSPLLNLKKVLECGSKRNSPKQTFS 965
            NTNRDIGGQPVGSLAACALSEAAYSALDQPISLLETSPLLNLKKVLECGSKRNSPKQTFS
Sbjct: 784  NTNRDIGGQPVGSLAACALSEAAYSALDQPISLLETSPLLNLKKVLECGSKRNSPKQTFS 843

Query: 966  LFLLEKLSKRSYGFEYGALGVKNHLERVIFKDIVSSVMIIFAPEPSRKRHFSPWVCHFHV 1025
            LFLLEKLSKRSYGFEYGALGVKNHLERVIFKDIVSSVMIIFAPEPSRKRHFSPWVCHFHV
Sbjct: 844  LFLLEKLSKRSYGFEYGALGVKNHLERVIFKDIVSSVMIIFAPEPSRKRHFSPWVCHFHV 903

Query: 1026 CKEILKKRRLKISSVIHSLNMRCDSVRQEAKINLPFLHISTQDCSLADSSREDGDTVCLT 1085
            CKEILKKRRLKISSVIHSLNMRCDSVRQEAKINLPFLHISTQDCSLADSSREDGDTVCLT
Sbjct: 904  CKEILKKRRLKISSVIHSLNMRCDSVRQEAKINLPFLHISTQDCSLADSSREDGDTVCLT 963

Query: 1086 VTIAENTKNSFLQLDFIQDLLIHFLLGTVIRGFAEIDKVDISWNDRPKVPKPHCKSHGEL 1145
            VTIAENTKNSFLQLDFIQDLLIHFLLGTVIRGFAEIDKVDISWNDRPKVPKPHCKSHGEL
Sbjct: 964  VTIAENTKNSFLQLDFIQDLLIHFLLGTVIRGFAEIDKVDISWNDRPKVPKPHCKSHGEL 1023

Query: 1146 YLRVTMSGEGTSRFWATLMNHCLPIMDLIDWSRSHPDNIHSFCMAYGIDSGRNYFLNSLE 1205
            YLRVTMSGEG SRFWATLMNHCLPIMDLIDWSRSHPDNIHSFCMAYGIDSGRNYFLNSLE
Sbjct: 1024 YLRVTMSGEGNSRFWATLMNHCLPIMDLIDWSRSHPDNIHSFCMAYGIDSGRNYFLNSLE 1083

Query: 1206 SATLDIGKTIRHEHLLLVANTLSATGEFVGLNVKGVSRQREHALVKTPFMQACFSSPGAS 1265
            SATLDIGKTIRHEHLLLVANTLSATGEFVGLNVKGVSRQREHALVKTPFMQACFSSPGAS
Sbjct: 1084 SATLDIGKTIRHEHLLLVANTLSATGEFVGLNVKGVSRQREHALVKTPFMQACFSSPGAS 1143

Query: 1266 FVKAAKAGIKDSLSGSLDALAWGKIPSMGTGGQFDILYSGKGHELNKPVDVYNLLGSQGI 1325
            FVKAAKAGIKDSLSGSLDALAWGKIPSMGTGGQFDILYSGKGHELNKPVDVYNLLGSQGI
Sbjct: 1144 FVKAAKAGIKDSLSGSLDALAWGKIPSMGTGGQFDILYSGKGHELNKPVDVYNLLGSQGI 1203

Query: 1326 CEKPNVKIESLDKNTIYEKYSAVVHKNGGSTIKGLKKLDSVSKSILREFLTLNDIQKLSH 1385
            CEKPNVKIESLDKNTIYEKYSAVVHKNGGSTIKGLKKLDSVSKSILREFLTLNDIQKLSH
Sbjct: 1204 CEKPNVKIESLDKNTIYEKYSAVVHKNGGSTIKGLKKLDSVSKSILREFLTLNDIQKLSH 1263

Query: 1386 TLRSILRKYSLNERLNEVDKSTLMMALYFHPQRDEKIGVGAQDIKVGSHSKYSNTRCFIL 1445
            TLRSILRKYSLNERLNEVDKSTLMMALYFHPQRDEKIGVGAQDIKVGSHSKYSNTRCFIL
Sbjct: 1264 TLRSILRKYSLNERLNEVDKSTLMMALYFHPQRDEKIGVGAQDIKVGSHSKYSNTRCFIL 1323

Query: 1446 VRSDGTTEDFSYHKCVLGALEIIAPHRVKAYQSKWMQDKFE 1487
            VRSDGTTEDFSYHKCVLGALEIIAPHRVKAYQSKWMQDKFE
Sbjct: 1324 VRSDGTTEDFSYHKCVLGALEIIAPHRVKAYQSKWMQDKFE 1364

BLAST of Carg21647 vs. ExPASy TrEMBL
Match: A0A6J1L1N1 (DNA-directed RNA polymerase OS=Cucurbita maxima OX=3661 GN=LOC111498320 PE=4 SV=1)

HSP 1 Score: 2723.7 bits (7059), Expect = 0.0e+00
Identity = 1346/1361 (98.90%), Postives = 1354/1361 (99.49%), Query Frame = 0

Query: 126  GKVEDPTSDFHRPKGCRYCFGSLKDWYPPMRFKLSTTDMFRKSMIMVEVKENMSKKYQKR 185
            G+VEDPTSDFHRPKGCRYCFGSLKDWYPPMRFKLSTTDMFRKSMIMVEVKENMSKKYQKR
Sbjct: 4    GQVEDPTSDFHRPKGCRYCFGSLKDWYPPMRFKLSTTDMFRKSMIMVEVKENMSKKYQKR 63

Query: 186  VARGGLPPDYWNFIPKDEQQEESYCRPNRKVLTHAQVHYLLKDIDPKFLKKFVSATDSLF 245
            VARGGLPPDYWNFIPKDEQQEESYCRPNRKVLTHAQVHYLLKDIDPKFLKKFVSATDSLF
Sbjct: 64   VARGGLPPDYWNFIPKDEQQEESYCRPNRKVLTHAQVHYLLKDIDPKFLKKFVSATDSLF 123

Query: 246  LNSFPVTPNCHRVTEMTHSFSSGQRLVFDERTRAYKKLVDFRGTANELGSRVLDCLKISK 305
            LNSFPVTPNCHRVTEMTHSFSSGQRLVFDERTRAYKKLVDFRGTANELGSRVLDCLKISK
Sbjct: 124  LNSFPVTPNCHRVTEMTHSFSSGQRLVFDERTRAYKKLVDFRGTANELGSRVLDCLKISK 183

Query: 306  LSPEKLESKDLIYQQKKIKDTATSSNGLRWIKDVVLGKRSDHCFRMVVVGDPNIELSEIG 365
            LSPEKLESKDLIYQQKKIKDTATSSNGLRWIKDVVLGKRSDHCFRMVVVGDPNIELSEIG
Sbjct: 184  LSPEKLESKDLIYQQKKIKDTATSSNGLRWIKDVVLGKRSDHCFRMVVVGDPNIELSEIG 243

Query: 366  IPCHVAERLQISEHLSSWNMKKLSTSCYLRLVEKGEIFVRREGRLVRVRHVLELSMGDTI 425
            IPCHVAERLQISEHLSSWNMKKLSTSCYLRLVEKGEIFVRREGRLVRVRHVLELSMGDTI
Sbjct: 244  IPCHVAERLQISEHLSSWNMKKLSTSCYLRLVEKGEIFVRREGRLVRVRHVLELSMGDTI 303

Query: 426  YRPLADGDVVLVNRPPSIHQHSLIALSVRVLPVSSVLSLNPLCCSPFRGDFDGDCLHGYV 485
            YRPLADGDVVLVNRPPSIHQHSLIALSVRVLPVSSVLSLNPLCCSPFRGDFDGDCLHGYV
Sbjct: 304  YRPLADGDVVLVNRPPSIHQHSLIALSVRVLPVSSVLSLNPLCCSPFRGDFDGDCLHGYV 363

Query: 486  PQSLEARVELRELVALDRQLVNGQSGRNLLSLSHDSLTAAHLIMEDGVSLNLFQIQQLQM 545
            PQSLEARVELRELVALDRQLVNGQSGRNLLSLSHDSLTAAHLIMEDGVSLNLFQIQQLQM
Sbjct: 364  PQSLEARVELRELVALDRQLVNGQSGRNLLSLSHDSLTAAHLIMEDGVSLNLFQIQQLQM 423

Query: 546  FALHQLLPPAIVKAPSFRSCAWTGKQLFSIFLPPDFDYSSPSHRVHINNGELLSSEGSYW 605
            FALHQLLPPAIVKAPSFRSCAWTGKQLFSIFLPPDFDYSSPSHRVHI NGELLSSEGSYW
Sbjct: 424  FALHQLLPPAIVKAPSFRSCAWTGKQLFSIFLPPDFDYSSPSHRVHIKNGELLSSEGSYW 483

Query: 606  LRDTGRNPFQALIEHCEGRTLNYLHIAQRVLCEWLSMRGLSVSLSDLYLSVDSHSHKNMM 665
            LRDTGRNPFQALIEHCEG TLNYLHIAQRVLCEWLSMRGLSVSLSDLYLSVDSHSHKNMM
Sbjct: 484  LRDTGRNPFQALIEHCEGMTLNYLHIAQRVLCEWLSMRGLSVSLSDLYLSVDSHSHKNMM 543

Query: 666  DDIFCGLQEAEETCNLIQLMVDSHKDVLTGDDEGNQHVLSIEVEHLSYEKQKSAALNQAS 725
            DDIFCGLQEAEETCNLIQLMVDSHKD LTGDDEGNQHVLSIEVEHLSYEKQKSAALNQAS
Sbjct: 544  DDIFCGLQEAEETCNLIQLMVDSHKDALTGDDEGNQHVLSIEVEHLSYEKQKSAALNQAS 603

Query: 726  VDAFKRVFREIQNLVYKYSGKDNSLLTMFKAGSKGNLLKLVQHSMCLGLQHSLVTLSFGL 785
            VDAFKRVFREIQNLVYKYSGKDNSLLTMFKAGSKGNLLKLVQHSMCLGLQHSLVTLSFGL
Sbjct: 604  VDAFKRVFREIQNLVYKYSGKDNSLLTMFKAGSKGNLLKLVQHSMCLGLQHSLVTLSFGL 663

Query: 786  PHKLSCSSWNSQKMPRYIQKDGLADRTQSFIPYAVVENSFLSGLNPFECFAHSVTNRDSS 845
            PHKLSCSSWNSQKMPRYI+KDGLADRTQSFIPYAVVENSFLSGLNPFECFAHSVTNRDSS
Sbjct: 664  PHKLSCSSWNSQKMPRYIRKDGLADRTQSFIPYAVVENSFLSGLNPFECFAHSVTNRDSS 723

Query: 846  FSDNAEVPGTLTRKLTFLMRDIYNAYDRTVRNAYGNQLVQFSYDTDSSTSISNELDGENN 905
            FSDNAEVPGTLTRKLTFLMRDIYNAYD TVRNAYGNQLVQFSYDTDS TSISNELDGENN
Sbjct: 724  FSDNAEVPGTLTRKLTFLMRDIYNAYDGTVRNAYGNQLVQFSYDTDSPTSISNELDGENN 783

Query: 906  NTNRDIGGQPVGSLAACALSEAAYSALDQPISLLETSPLLNLKKVLECGSKRNSPKQTFS 965
            NTNRDIGGQPVGSLAACA+SEAAYSALDQPISLLETSPLLNLKKVLECGSKRNSPKQ FS
Sbjct: 784  NTNRDIGGQPVGSLAACAISEAAYSALDQPISLLETSPLLNLKKVLECGSKRNSPKQIFS 843

Query: 966  LFLLEKLSKRSYGFEYGALGVKNHLERVIFKDIVSSVMIIFAPEPSRKRHFSPWVCHFHV 1025
            LFLLEKLSKRSYG+EYGALGVKNHLERVIFKDIVSSVMIIFAPEPSRKRHFSPWVCHFHV
Sbjct: 844  LFLLEKLSKRSYGYEYGALGVKNHLERVIFKDIVSSVMIIFAPEPSRKRHFSPWVCHFHV 903

Query: 1026 CKEILKKRRLKISSVIHSLNMRCDSVRQEAKINLPFLHISTQDCSLADSSREDGDTVCLT 1085
            CKEILKKRRLKISSVIHSLNMRCDS+RQEAKINLPFLHISTQDCSLADSSREDGDTVCLT
Sbjct: 904  CKEILKKRRLKISSVIHSLNMRCDSMRQEAKINLPFLHISTQDCSLADSSREDGDTVCLT 963

Query: 1086 VTIAENTKNSFLQLDFIQDLLIHFLLGTVIRGFAEIDKVDISWNDRPKVPKPHCKSHGEL 1145
            VTIAENTKNSFLQLDFIQDLLIHFLLGTVIRGFAEIDKVDISWNDRPKVPKPHCKSHGEL
Sbjct: 964  VTIAENTKNSFLQLDFIQDLLIHFLLGTVIRGFAEIDKVDISWNDRPKVPKPHCKSHGEL 1023

Query: 1146 YLRVTMSGEGTSRFWATLMNHCLPIMDLIDWSRSHPDNIHSFCMAYGIDSGRNYFLNSLE 1205
            YLRVTMSGEG SRFWATLMN+CLPIMDLIDWSRSHPDNIHSFCMAYGIDSGRNYFLNSLE
Sbjct: 1024 YLRVTMSGEGNSRFWATLMNNCLPIMDLIDWSRSHPDNIHSFCMAYGIDSGRNYFLNSLE 1083

Query: 1206 SATLDIGKTIRHEHLLLVANTLSATGEFVGLNVKGVSRQREHALVKTPFMQACFSSPGAS 1265
            SATLDIGKTIRHEHLLLVANTLSATGEFVGLNVKGVSRQREHALVKTPFMQACFSSPGAS
Sbjct: 1084 SATLDIGKTIRHEHLLLVANTLSATGEFVGLNVKGVSRQREHALVKTPFMQACFSSPGAS 1143

Query: 1266 FVKAAKAGIKDSLSGSLDALAWGKIPSMGTGGQFDILYSGKGHELNKPVDVYNLLGSQGI 1325
            FVKAAKAGIKDSLSGSLDALAWGKIPSMGTGGQFDILYSGKGHEL+KPVDVYNLLGSQGI
Sbjct: 1144 FVKAAKAGIKDSLSGSLDALAWGKIPSMGTGGQFDILYSGKGHELSKPVDVYNLLGSQGI 1203

Query: 1326 CEKPNVKIESLDKNTIYEKYSAVVHKNGGSTIKGLKKLDSVSKSILREFLTLNDIQKLSH 1385
            CEKPNVK+ESLDKNTIYEKYSAVVHKNGGSTIKGLKKLDSVSKSILREFLTLNDIQKLSH
Sbjct: 1204 CEKPNVKMESLDKNTIYEKYSAVVHKNGGSTIKGLKKLDSVSKSILREFLTLNDIQKLSH 1263

Query: 1386 TLRSILRKYSLNERLNEVDKSTLMMALYFHPQRDEKIGVGAQDIKVGSHSKYSNTRCFIL 1445
            TLRSILRKYSLNERLNEVDKSTLMMALYFHPQRDEKIGVGAQDIKVGSHSKYSNTRCFIL
Sbjct: 1264 TLRSILRKYSLNERLNEVDKSTLMMALYFHPQRDEKIGVGAQDIKVGSHSKYSNTRCFIL 1323

Query: 1446 VRSDGTTEDFSYHKCVLGALEIIAPHRVKAYQSKWMQDKFE 1487
            VRSDGTTEDFSYHKCVLGALEIIAPHRVKAYQSKWMQDKFE
Sbjct: 1324 VRSDGTTEDFSYHKCVLGALEIIAPHRVKAYQSKWMQDKFE 1364

BLAST of Carg21647 vs. ExPASy TrEMBL
Match: A0A0A0L2L4 (DNA-directed RNA polymerase OS=Cucumis sativus OX=3659 GN=Csa_3G039340 PE=4 SV=1)

HSP 1 Score: 2668.3 bits (6915), Expect = 0.0e+00
Identity = 1306/1487 (87.83%), Postives = 1391/1487 (93.54%), Query Frame = 0

Query: 1    MNFAGGFRSGQKAMNHMEDEQDSELQIPSGVLVGVNFSVSTQQDMENIAVINIEAACEVS 60
            M+    FR GQK M HMEDEQD EL IPSG+L G+NFSVS QQD+ENIAVI ++AA EVS
Sbjct: 1    MSSVEDFRPGQKVMIHMEDEQDGELPIPSGLLTGINFSVSNQQDIENIAVITVDAANEVS 60

Query: 61   DPKLGLPNPSYQCTTCGASVLKCCEGHFGAIKFPYTIIHPYFLSEVAQVLNKVCPGCKSI 120
            DPKLGLPNPSYQCTTCGAS LK CEGHFG IKFPYTIIHPYFLSEVAQVLNKVCPGCKS+
Sbjct: 61   DPKLGLPNPSYQCTTCGASSLKFCEGHFGVIKFPYTIIHPYFLSEVAQVLNKVCPGCKSV 120

Query: 121  RRELWGKVEDPTSDFHRPKGCRYCFGSLKDWYPPMRFKLSTTDMFRKSMIMVEVKENMSK 180
            R+ELWGKVEDPTSD++RPKGCRYCFGSLKDWYPPMRFKLSTTDMF+KSMIMVEVKENMSK
Sbjct: 121  RQELWGKVEDPTSDYNRPKGCRYCFGSLKDWYPPMRFKLSTTDMFKKSMIMVEVKENMSK 180

Query: 181  KYQKRVARGGLPPDYWNFIPKDEQQEESYCRPNRKVLTHAQVHYLLKDIDPKFLKKFVSA 240
            KYQKRVA+GGLP DYW+FIPKDEQQEESYCRPNRK+LTHAQVHYLLKDIDPKFLKKFV A
Sbjct: 181  KYQKRVAKGGLPSDYWDFIPKDEQQEESYCRPNRKILTHAQVHYLLKDIDPKFLKKFVPA 240

Query: 241  TDSLFLNSFPVTPNCHRVTEMTHSFSSGQRLVFDERTRAYKKLVDFRGTANELGSRVLDC 300
             DSLFLNSFPVTPN HRVTEM HSFS+GQRL+FDERTRAYKK+VDFRGTANELGSRVLDC
Sbjct: 241  IDSLFLNSFPVTPNSHRVTEMAHSFSNGQRLIFDERTRAYKKVVDFRGTANELGSRVLDC 300

Query: 301  LKISKLSPEKLESKDLIYQQKKIKDTATSSNGLRWIKDVVLGKRSDHCFRMVVVGDPNIE 360
            LKISKLSPEKL++KDL+YQQKKIKDTATSS+GLRWIKDVVLGKRSDHCFRMVVVGDPNIE
Sbjct: 301  LKISKLSPEKLQNKDLVYQQKKIKDTATSSSGLRWIKDVVLGKRSDHCFRMVVVGDPNIE 360

Query: 361  LSEIGIPCHVAERLQISEHLSSWNMKKLSTSCYLRLVEKGEIFVRREGRLVRVRHVLELS 420
            LSEIGIPCHVAERLQISEHLSSWNMKKLSTSCYL LVEKGEI+VRREGRLVRVR+VLEL+
Sbjct: 361  LSEIGIPCHVAERLQISEHLSSWNMKKLSTSCYLHLVEKGEIYVRREGRLVRVRNVLELN 420

Query: 421  MGDTIYRPLADGDVVLVNRPPSIHQHSLIALSVRVLPVSSVLSLNPLCCSPFRGDFDGDC 480
            MGDTIYRPLADGD+VLVNRPPSIHQHSLIALSV++LPVS+VLSLNPLCCSPFRGDFDGDC
Sbjct: 421  MGDTIYRPLADGDIVLVNRPPSIHQHSLIALSVKLLPVSAVLSLNPLCCSPFRGDFDGDC 480

Query: 481  LHGYVPQSLEARVELRELVALDRQLVNGQSGRNLLSLSHDSLTAAHLIMEDGVSLNLFQI 540
            LHGYVPQSLEARVE+RELV+LD+QL NGQSGRNLLSLSHDSLTAAHLI+EDGVSLNLFQ+
Sbjct: 481  LHGYVPQSLEARVEVRELVSLDKQLTNGQSGRNLLSLSHDSLTAAHLILEDGVSLNLFQM 540

Query: 541  QQLQMFALHQLLPPAIVKAPSFRSCAWTGKQLFSIFLPPDFDYSSPSHRVHINNGELLSS 600
            QQLQM  LHQLLPPAIVK+P  R+CAWTGKQLFSI LPPDFDYSSPSH V I  GEL+SS
Sbjct: 541  QQLQMLTLHQLLPPAIVKSPLLRNCAWTGKQLFSILLPPDFDYSSPSHNVFIEKGELISS 600

Query: 601  EGSYWLRDTGRNPFQALIEHCEGRTLNYLHIAQRVLCEWLSMRGLSVSLSDLYLSVDSHS 660
            EGSYWLRD+GRN FQALIEHCEG+TL+YL  AQ VLCEWLS RGLSVSLSDLYLSVDS+S
Sbjct: 601  EGSYWLRDSGRNLFQALIEHCEGKTLDYLRDAQGVLCEWLSTRGLSVSLSDLYLSVDSYS 660

Query: 661  HKNMMDDIFCGLQEAEETCNLIQLMVDSHKDVLTGDDEGNQHVLSIEVEHLSYEKQKSAA 720
            H+NMMDDIFCGLQEAEETCNL QLMVDSHK++L G+DE NQH+LSI VE L YEKQKSAA
Sbjct: 661  HENMMDDIFCGLQEAEETCNLKQLMVDSHKEILIGNDEDNQHLLSIAVERLIYEKQKSAA 720

Query: 721  LNQASVDAFKRVFREIQNLVYKYSGKDNSLLTMFKAGSKGNLLKLVQHSMCLGLQHSLVT 780
            LNQASVDAFK+VFR+IQNLVYKYSGKDNSLLTMFKAGSKGNL+KLVQHSMCLGLQHSLVT
Sbjct: 721  LNQASVDAFKKVFRDIQNLVYKYSGKDNSLLTMFKAGSKGNLMKLVQHSMCLGLQHSLVT 780

Query: 781  LSFGLPHKLSCSSWNSQKMPRYIQKDGLADRTQSFIPYAVVENSFLSGLNPFECFAHSVT 840
            LSF LPHKLSC++WNSQKMPRYIQKDGL DRTQSFIPYAVVENSFLSGLNPFECFAHSVT
Sbjct: 781  LSFSLPHKLSCAAWNSQKMPRYIQKDGLPDRTQSFIPYAVVENSFLSGLNPFECFAHSVT 840

Query: 841  NRDSSFSDNAEVPGTLTRKLTFLMRDIYNAYDRTVRNAYGNQLVQFSYDTDSSTSISNEL 900
            NRDSSFSDNAEVPGTLTRKLTFLMRDIY AYD TVRNAYGNQLVQF YD D  TS   E 
Sbjct: 841  NRDSSFSDNAEVPGTLTRKLTFLMRDIYTAYDGTVRNAYGNQLVQFCYDIDRPTS---ES 900

Query: 901  DGENNNTNRDIGGQPVGSLAACALSEAAYSALDQPISLLETSPLLNLKKVLECGSKRNSP 960
            + ENNN +R IGG PVGSLAACA+SEAAYSALDQPISLLE SPLLNLK+VLECGSKRNS 
Sbjct: 901  ESENNNRDRGIGGHPVGSLAACAISEAAYSALDQPISLLEASPLLNLKRVLECGSKRNST 960

Query: 961  KQTFSLFLLEKLSKRSYGFEYGALGVKNHLERVIFKDIVSSVMIIFAPEPSRKRHFSPWV 1020
            KQTFSLFL EKLSKRSYGFEYGALGVKNHLERV+FKDIVSSVMIIF+P PSRK+HFSPWV
Sbjct: 961  KQTFSLFLSEKLSKRSYGFEYGALGVKNHLERVMFKDIVSSVMIIFSPLPSRKKHFSPWV 1020

Query: 1021 CHFHVCKEILKKRRLKISSVIHSLNMRCDSVRQEAKINLPFLHISTQDCSLADSSREDGD 1080
            CHFHVCKEILKKRRLK++SVIHSLNMRCDS+RQE ++NLP L I TQDC LADS  EDGD
Sbjct: 1021 CHFHVCKEILKKRRLKMNSVIHSLNMRCDSMRQEGRMNLPSLQIITQDCPLADSLTEDGD 1080

Query: 1081 TVCLTVTIAENTKNSFLQLDFIQDLLIHFLLGTVIRGFAEIDKVDISWNDRPKVPKPHCK 1140
            TVCLTVTIAENTKNSFLQLDFIQDLLIHFLLGTVIRGF EID+VDI+WNDRPKVPKP C 
Sbjct: 1081 TVCLTVTIAENTKNSFLQLDFIQDLLIHFLLGTVIRGFTEIDRVDITWNDRPKVPKPRC- 1140

Query: 1141 SHGELYLRVTMSGEGTSRFWATLMNHCLPIMDLIDWSRSHPDNIHSFCMAYGIDSGRNYF 1200
            SHGELYLRVTMSGEG SRFWATLMN+CLPIMDLIDW+RSHPDN HS C+AYGIDSG  YF
Sbjct: 1141 SHGELYLRVTMSGEGNSRFWATLMNNCLPIMDLIDWTRSHPDNTHSLCLAYGIDSGWKYF 1200

Query: 1201 LNSLESATLDIGKTIRHEHLLLVANTLSATGEFVGLNVKGVSRQREHALVKTPFMQACFS 1260
            LNSLESATLD+GKTIR EHLLLV+N+LSATGEFVGLNVKG++ QREHALVKTPFMQACFS
Sbjct: 1201 LNSLESATLDVGKTIRLEHLLLVSNSLSATGEFVGLNVKGLTHQREHALVKTPFMQACFS 1260

Query: 1261 SPGASFVKAAKAGIKDSLSGSLDALAWGKIPSMGTGGQFDILYSGKGHELNKPVDVYNLL 1320
            SPGA  +KAAKAGIKD+LSGSLDALAWG++PS+GTGGQFDILYSGKGHELNKPVDVYNLL
Sbjct: 1261 SPGACMIKAAKAGIKDNLSGSLDALAWGRMPSLGTGGQFDILYSGKGHELNKPVDVYNLL 1320

Query: 1321 GSQGICEKPNVKIESLDKNTIYEKYSA-VVHKNGGSTIKGLKKLDSVSKSILREFLTLND 1380
            G Q  CEK N KIESLDKNTI EKYSA ++ KNGGSTIKGLK+LDSVSKSILR+FLTLND
Sbjct: 1321 GGQSTCEKQNTKIESLDKNTISEKYSAQLMLKNGGSTIKGLKRLDSVSKSILRKFLTLND 1380

Query: 1381 IQKLSHTLRSILRKYSLNERLNEVDKSTLMMALYFHPQRDEKIGVGAQDIKVGSHSKYSN 1440
            IQKLS  LR+IL KYSLNERLNEVDKSTLMMALYFHP RDEKIGVGAQDIKVGSHSKY N
Sbjct: 1381 IQKLSFALRTILHKYSLNERLNEVDKSTLMMALYFHPHRDEKIGVGAQDIKVGSHSKYQN 1440

Query: 1441 TRCFILVRSDGTTEDFSYHKCVLGALEIIAPHRVKAYQSKWMQDKFE 1487
            TRCF+L+RSDGTTEDFSYHKCVLGALEIIAPHRVK YQSKWMQ+KFE
Sbjct: 1441 TRCFVLIRSDGTTEDFSYHKCVLGALEIIAPHRVKGYQSKWMQEKFE 1483

BLAST of Carg21647 vs. TAIR 10
Match: AT1G63020.1 (nuclear RNA polymerase D1A )

HSP 1 Score: 1482.6 bits (3837), Expect = 0.0e+00
Identity = 781/1476 (52.91%), Postives = 1033/1476 (69.99%), Query Frame = 0

Query: 17   MEDEQDSELQIPSGVLVGVNFSVSTQQDMENIAVINIEAACEVSDPKLGLPNPSYQCTTC 76
            MED+ + ELQ+P G L  + FS+S   D + ++V+ +EA  +V+D +LGLPNP   C TC
Sbjct: 1    MEDDCE-ELQVPVGTLTSIGFSISNNNDRDKMSVLEVEAPNQVTDSRLGLPNPDSVCRTC 60

Query: 77   GASVLKCCEGHFGAIKFPYTIIHPYFLSEVAQVLNKVCPGCKSIRRELWGKVEDPTSDFH 136
            G+   K CEGHFG I F Y+II+PYFL EVA +LNK+CPGCK IR++ +   ED      
Sbjct: 61   GSKDRKVCEGHFGVINFAYSIINPYFLKEVAALLNKICPGCKYIRKKQFQITED------ 120

Query: 137  RPKGCRYCFGSLKDWYPPMRFKLSTTDMFRKSMIMVEVKENMSKKYQKRVARGGLPPDYW 196
            +P+ CRYC  +L   YP M+F+++T ++FR+S I+VEV E    K +KR     LPPDYW
Sbjct: 121  QPERCRYC--TLNTGYPLMKFRVTTKEVFRRSGIVVEVNEESLMKLKKRGVL-TLPPDYW 180

Query: 197  NFIPKDEQQEESYCRPNRKVLTHAQVHYLLKDIDPKFLKKFVSATDSLFLNSFPVTPNCH 256
            +F+P+D   +ES  +P R+++THAQV+ LL  ID + +KK +   +SL L SFPVTPN +
Sbjct: 181  SFLPQDSNIDESCLKPTRRIITHAQVYALLLGIDQRLIKKDIPMFNSLGLTSFPVTPNGY 240

Query: 257  RVTEMTHSFSSGQRLVFDERTRAYKKLVDFRGTANELGSRVLDCLKISKLSPEKL-ESKD 316
            RVTE+ H F +G RL+FDERTR YKKLV F G   EL SRV++C++ S+L  E +  SKD
Sbjct: 241  RVTEIVHQF-NGARLIFDERTRIYKKLVGFEGNTLELSSRVMECMQYSRLFSETVSSSKD 300

Query: 317  LIYQQKKIKDTATSSNGLRWIKDVVLGKRSDHCFRMVVVGDPNIELSEIGIPCHVAERLQ 376
                 +K  DT     GLR++KDV+LGKRSDH FR VVVGDP+++L+EIGIP  +A+RLQ
Sbjct: 301  SANPYQKKSDTPKLC-GLRFMKDVLLGKRSDHTFRTVVVGDPSLKLNEIGIPESIAKRLQ 360

Query: 377  ISEHLSSWNMKKLSTSCYLRLVEKGEIFVRREGRLVRVRHVLELSMGDTIYRPLADGDVV 436
            +SEHL+  N ++L TS    L++  E+ VRR  RLV ++ V +L  GD I+R L DGD V
Sbjct: 361  VSEHLNQCNKERLVTSFVPTLLDNKEMHVRRGDRLVAIQ-VNDLQTGDKIFRSLMDGDTV 420

Query: 437  LVNRPPSIHQHSLIALSVRVLPVSSVLSLNPLCCSPFRGDFDGDCLHGYVPQSLEARVEL 496
            L+NRPPSIHQHSLIA++VR+LP +SV+SLNP+CC PFRGDFDGDCLHGYVPQS++A+VEL
Sbjct: 421  LMNRPPSIHQHSLIAMTVRILPTTSVVSLNPICCLPFRGDFDGDCLHGYVPQSIQAKVEL 480

Query: 497  RELVALDRQLVNGQSGRNLLSLSHDSLTAAHLI-MEDGVSLNLFQIQQLQMFALHQLLPP 556
             ELVALD+QL+N Q+GRNLLSL  DSLTAA+L+ +E    LN  Q+QQLQM+   QL PP
Sbjct: 481  DELVALDKQLINRQNGRNLLSLGQDSLTAAYLVNVEKNCYLNRAQMQQLQMYCPFQLPPP 540

Query: 557  AIVKA-PSFRSCAWTGKQLFSIFLPPDFDYSSPSHRVHINNGELLS-SEGSYWLRDTGRN 616
            AI+KA PS     WTG QLF +  PP FDY+ P + V ++NGELLS SEGS WLRD   N
Sbjct: 541  AIIKASPSSTEPQWTGMQLFGMLFPPGFDYTYPLNNVVVSNGELLSFSEGSAWLRDGEGN 600

Query: 617  PFQALIEHCEGRTLNYLHIAQRVLCEWLSMRGLSVSLSDLYLSVDSHSHKNMMDDIFCGL 676
              + L++H +G+ L+ ++ AQ +L +WL MRGLSVSL+DLYLS D  S KN+ ++I  GL
Sbjct: 601  FIERLLKHDKGKVLDIIYSAQEMLSQWLLMRGLSVSLADLYLSSDLQSRKNLTEEISYGL 660

Query: 677  QEAEETCNLIQLMVDSHKDVLTGDDEGNQHVLSIEVEHLSYEKQKSAALNQASVDAFKRV 736
            +EAE+ CN  QLMV+S +D L  + E  +     ++    YE+QKSA L++ +V AFK  
Sbjct: 661  REAEQVCNKQQLMVESWRDFLAVNGEDKEEDSVSDLARFCYERQKSATLSELAVSAFKDA 720

Query: 737  FREIQNLVYKYSGKDNSLLTMFKAGSKGNLLKLVQHSMCLGLQHSLVTLSFGLPHKLSCS 796
            +R++Q L Y+Y  + NS L M KAGSKGN+ KLVQHSMC+GLQ+S V+LSFG P +L+C+
Sbjct: 721  YRDVQALAYRYGDQSNSFLIMSKAGSKGNIGKLVQHSMCIGLQNSAVSLSFGFPRELTCA 780

Query: 797  SWNSQKMPRYIQKDGLADRTQSFIPYAVVENSFLSGLNPFECFAHSVTNRDSSFSDNAEV 856
            +WN    P    K   +  T+S++PY V+ENSFL+GLNP E F HSVT+RDSSFS NA++
Sbjct: 781  AWNDPNSPLRGAKGKDSTTTESYVPYGVIENSFLTGLNPLESFVHSVTSRDSSFSGNADL 840

Query: 857  PGTLTRKLTFLMRDIYNAYDRTVRNAYGNQLVQFSYDTDSSTSISNELDGENNNTNRDIG 916
            PGTL+R+L F MRDIY AYD TVRN++GNQLVQF+Y+TD                  DI 
Sbjct: 841  PGTLSRRLMFFMRDIYAAYDGTVRNSFGNQLVQFTYETDGPV--------------EDIT 900

Query: 917  GQPVGSLAACALSEAAYSALDQPISLLETSPLLNLKKVLECGSKRNSPKQTFSLFLLEKL 976
            G+ +GSL+ACALSEAAYSALDQPISLLETSPLLNLK VLECGSK+   +QT SL+L E L
Sbjct: 901  GEALGSLSACALSEAAYSALDQPISLLETSPLLNLKNVLECGSKKGQREQTMSLYLSEYL 960

Query: 977  SKRSYGFEYGALGVKNHLERVIFKDIVSSVMIIFAPEPSRKRHFSPWVCHFHVCKEILKK 1036
            SK+ +GFEYG+L +KNHLE++ F +IVS+ MIIF+P  + K   SPWVCHFH+ +++LK+
Sbjct: 961  SKKKHGFEYGSLEIKNHLEKLSFSEIVSTSMIIFSPSSNTKVPLSPWVCHFHISEKVLKR 1020

Query: 1037 RRLKISSVIHSLNMRCDSVRQEAKINLPFLHI-STQDCSLADSSREDGDTVCLTVTIAEN 1096
            ++L   SV+ SLN +  S  +E K+++  L I +T  CS  D + +D D VC+TVT+ E 
Sbjct: 1021 KQLSAESVVSSLNEQYKSRNRELKLDIVDLDIQNTNHCSSDDQAMKD-DNVCITVTVVEA 1080

Query: 1097 TKNSFLQLDFIQDLLIHFLLGTVIRGFAEIDKVDISWNDRPKVPKPHCKS-HGELYLRVT 1156
            +K+S L+LD I+ +LI FLL + ++G   I KV+I W DRPK PK +     GELYL+VT
Sbjct: 1081 SKHSVLELDAIRLVLIPFLLDSPVKGDQGIKKVNILWTDRPKAPKRNGNHLAGELYLKVT 1140

Query: 1157 MSGE-GTSRFWATLMNHCLPIMDLIDWSRSHPDNIHSFCMAYGIDSGRNYFLNSLESATL 1216
            M G+ G    W  L+  CLPIMD+IDW RSHPDNI   C  YGID+GR+ F+ +LESA  
Sbjct: 1141 MYGDRGKRNCWTALLETCLPIMDMIDWGRSHPDNIRQCCSVYGIDAGRSIFVANLESAVS 1200

Query: 1217 DIGKTIRHEHLLLVANTLSATGEFVGLNVKGVSRQREHALVKTPFMQACFSSPGASFVKA 1276
            D GK I  EHLLLVA++LS TGEFV LN KG S+QR+      PF QACFSSP   F+KA
Sbjct: 1201 DTGKEILREHLLLVADSLSVTGEFVALNAKGWSKQRQVESTPAPFTQACFSSPSQCFLKA 1260

Query: 1277 AKAGIKDSLSGSLDALAWGKIPSMGTGGQFDILYSGKGHELNKPVDVYNLLGSQGICEKP 1336
            AK G++D L GS+DALAWGK+P  GTG QF+I+ S K H    PVDVY+LL S     + 
Sbjct: 1261 AKEGVRDDLQGSIDALAWGKVPGFGTGDQFEIIISPKVHGFTTPVDVYDLLSSTKTMRRT 1320

Query: 1337 NVKIESLDKNTIYEKYSAVVHKNGGSTIKGLKKLD--SVSKSILREFLTLNDIQKLSHTL 1396
            N   +S DK T+  +   ++H    + +K +K LD   +  S+LR   T  +I+ LS +L
Sbjct: 1321 NSAPKS-DKATV--QPFGLLH---SAFLKDIKVLDGKGIPMSLLRTIFTWKNIELLSQSL 1380

Query: 1397 RSILRKYSLNERLNEVDKSTLMMALYFHPQRDEKIGVGAQDIKVGSHSKYSNTRCFILVR 1456
            + IL  Y +NE LNE D+  + M L  HP   EKIG G + I+V + SK+ ++ CF +VR
Sbjct: 1381 KRILHSYEINELLNERDEGLVKMVLQLHPNSVEKIGPGVKGIRV-AKSKHGDSCCFEVVR 1440

Query: 1457 SDGTTEDFSYHKCVLGALEIIAPHRVKAYQSKWMQD 1484
             DGT EDFSYHKCVLGA +IIAP ++  Y+SK++++
Sbjct: 1441 IDGTFEDFSYHKCVLGATKIIAPKKMNFYKSKYLKN 1441

BLAST of Carg21647 vs. TAIR 10
Match: AT1G63020.2 (nuclear RNA polymerase D1A )

HSP 1 Score: 1482.6 bits (3837), Expect = 0.0e+00
Identity = 781/1476 (52.91%), Postives = 1033/1476 (69.99%), Query Frame = 0

Query: 17   MEDEQDSELQIPSGVLVGVNFSVSTQQDMENIAVINIEAACEVSDPKLGLPNPSYQCTTC 76
            MED+ + ELQ+P G L  + FS+S   D + ++V+ +EA  +V+D +LGLPNP   C TC
Sbjct: 1    MEDDCE-ELQVPVGTLTSIGFSISNNNDRDKMSVLEVEAPNQVTDSRLGLPNPDSVCRTC 60

Query: 77   GASVLKCCEGHFGAIKFPYTIIHPYFLSEVAQVLNKVCPGCKSIRRELWGKVEDPTSDFH 136
            G+   K CEGHFG I F Y+II+PYFL EVA +LNK+CPGCK IR++ +   ED      
Sbjct: 61   GSKDRKVCEGHFGVINFAYSIINPYFLKEVAALLNKICPGCKYIRKKQFQITED------ 120

Query: 137  RPKGCRYCFGSLKDWYPPMRFKLSTTDMFRKSMIMVEVKENMSKKYQKRVARGGLPPDYW 196
            +P+ CRYC  +L   YP M+F+++T ++FR+S I+VEV E    K +KR     LPPDYW
Sbjct: 121  QPERCRYC--TLNTGYPLMKFRVTTKEVFRRSGIVVEVNEESLMKLKKRGVL-TLPPDYW 180

Query: 197  NFIPKDEQQEESYCRPNRKVLTHAQVHYLLKDIDPKFLKKFVSATDSLFLNSFPVTPNCH 256
            +F+P+D   +ES  +P R+++THAQV+ LL  ID + +KK +   +SL L SFPVTPN +
Sbjct: 181  SFLPQDSNIDESCLKPTRRIITHAQVYALLLGIDQRLIKKDIPMFNSLGLTSFPVTPNGY 240

Query: 257  RVTEMTHSFSSGQRLVFDERTRAYKKLVDFRGTANELGSRVLDCLKISKLSPEKL-ESKD 316
            RVTE+ H F +G RL+FDERTR YKKLV F G   EL SRV++C++ S+L  E +  SKD
Sbjct: 241  RVTEIVHQF-NGARLIFDERTRIYKKLVGFEGNTLELSSRVMECMQYSRLFSETVSSSKD 300

Query: 317  LIYQQKKIKDTATSSNGLRWIKDVVLGKRSDHCFRMVVVGDPNIELSEIGIPCHVAERLQ 376
                 +K  DT     GLR++KDV+LGKRSDH FR VVVGDP+++L+EIGIP  +A+RLQ
Sbjct: 301  SANPYQKKSDTPKLC-GLRFMKDVLLGKRSDHTFRTVVVGDPSLKLNEIGIPESIAKRLQ 360

Query: 377  ISEHLSSWNMKKLSTSCYLRLVEKGEIFVRREGRLVRVRHVLELSMGDTIYRPLADGDVV 436
            +SEHL+  N ++L TS    L++  E+ VRR  RLV ++ V +L  GD I+R L DGD V
Sbjct: 361  VSEHLNQCNKERLVTSFVPTLLDNKEMHVRRGDRLVAIQ-VNDLQTGDKIFRSLMDGDTV 420

Query: 437  LVNRPPSIHQHSLIALSVRVLPVSSVLSLNPLCCSPFRGDFDGDCLHGYVPQSLEARVEL 496
            L+NRPPSIHQHSLIA++VR+LP +SV+SLNP+CC PFRGDFDGDCLHGYVPQS++A+VEL
Sbjct: 421  LMNRPPSIHQHSLIAMTVRILPTTSVVSLNPICCLPFRGDFDGDCLHGYVPQSIQAKVEL 480

Query: 497  RELVALDRQLVNGQSGRNLLSLSHDSLTAAHLI-MEDGVSLNLFQIQQLQMFALHQLLPP 556
             ELVALD+QL+N Q+GRNLLSL  DSLTAA+L+ +E    LN  Q+QQLQM+   QL PP
Sbjct: 481  DELVALDKQLINRQNGRNLLSLGQDSLTAAYLVNVEKNCYLNRAQMQQLQMYCPFQLPPP 540

Query: 557  AIVKA-PSFRSCAWTGKQLFSIFLPPDFDYSSPSHRVHINNGELLS-SEGSYWLRDTGRN 616
            AI+KA PS     WTG QLF +  PP FDY+ P + V ++NGELLS SEGS WLRD   N
Sbjct: 541  AIIKASPSSTEPQWTGMQLFGMLFPPGFDYTYPLNNVVVSNGELLSFSEGSAWLRDGEGN 600

Query: 617  PFQALIEHCEGRTLNYLHIAQRVLCEWLSMRGLSVSLSDLYLSVDSHSHKNMMDDIFCGL 676
              + L++H +G+ L+ ++ AQ +L +WL MRGLSVSL+DLYLS D  S KN+ ++I  GL
Sbjct: 601  FIERLLKHDKGKVLDIIYSAQEMLSQWLLMRGLSVSLADLYLSSDLQSRKNLTEEISYGL 660

Query: 677  QEAEETCNLIQLMVDSHKDVLTGDDEGNQHVLSIEVEHLSYEKQKSAALNQASVDAFKRV 736
            +EAE+ CN  QLMV+S +D L  + E  +     ++    YE+QKSA L++ +V AFK  
Sbjct: 661  REAEQVCNKQQLMVESWRDFLAVNGEDKEEDSVSDLARFCYERQKSATLSELAVSAFKDA 720

Query: 737  FREIQNLVYKYSGKDNSLLTMFKAGSKGNLLKLVQHSMCLGLQHSLVTLSFGLPHKLSCS 796
            +R++Q L Y+Y  + NS L M KAGSKGN+ KLVQHSMC+GLQ+S V+LSFG P +L+C+
Sbjct: 721  YRDVQALAYRYGDQSNSFLIMSKAGSKGNIGKLVQHSMCIGLQNSAVSLSFGFPRELTCA 780

Query: 797  SWNSQKMPRYIQKDGLADRTQSFIPYAVVENSFLSGLNPFECFAHSVTNRDSSFSDNAEV 856
            +WN    P    K   +  T+S++PY V+ENSFL+GLNP E F HSVT+RDSSFS NA++
Sbjct: 781  AWNDPNSPLRGAKGKDSTTTESYVPYGVIENSFLTGLNPLESFVHSVTSRDSSFSGNADL 840

Query: 857  PGTLTRKLTFLMRDIYNAYDRTVRNAYGNQLVQFSYDTDSSTSISNELDGENNNTNRDIG 916
            PGTL+R+L F MRDIY AYD TVRN++GNQLVQF+Y+TD                  DI 
Sbjct: 841  PGTLSRRLMFFMRDIYAAYDGTVRNSFGNQLVQFTYETDGPV--------------EDIT 900

Query: 917  GQPVGSLAACALSEAAYSALDQPISLLETSPLLNLKKVLECGSKRNSPKQTFSLFLLEKL 976
            G+ +GSL+ACALSEAAYSALDQPISLLETSPLLNLK VLECGSK+   +QT SL+L E L
Sbjct: 901  GEALGSLSACALSEAAYSALDQPISLLETSPLLNLKNVLECGSKKGQREQTMSLYLSEYL 960

Query: 977  SKRSYGFEYGALGVKNHLERVIFKDIVSSVMIIFAPEPSRKRHFSPWVCHFHVCKEILKK 1036
            SK+ +GFEYG+L +KNHLE++ F +IVS+ MIIF+P  + K   SPWVCHFH+ +++LK+
Sbjct: 961  SKKKHGFEYGSLEIKNHLEKLSFSEIVSTSMIIFSPSSNTKVPLSPWVCHFHISEKVLKR 1020

Query: 1037 RRLKISSVIHSLNMRCDSVRQEAKINLPFLHI-STQDCSLADSSREDGDTVCLTVTIAEN 1096
            ++L   SV+ SLN +  S  +E K+++  L I +T  CS  D + +D D VC+TVT+ E 
Sbjct: 1021 KQLSAESVVSSLNEQYKSRNRELKLDIVDLDIQNTNHCSSDDQAMKD-DNVCITVTVVEA 1080

Query: 1097 TKNSFLQLDFIQDLLIHFLLGTVIRGFAEIDKVDISWNDRPKVPKPHCKS-HGELYLRVT 1156
            +K+S L+LD I+ +LI FLL + ++G   I KV+I W DRPK PK +     GELYL+VT
Sbjct: 1081 SKHSVLELDAIRLVLIPFLLDSPVKGDQGIKKVNILWTDRPKAPKRNGNHLAGELYLKVT 1140

Query: 1157 MSGE-GTSRFWATLMNHCLPIMDLIDWSRSHPDNIHSFCMAYGIDSGRNYFLNSLESATL 1216
            M G+ G    W  L+  CLPIMD+IDW RSHPDNI   C  YGID+GR+ F+ +LESA  
Sbjct: 1141 MYGDRGKRNCWTALLETCLPIMDMIDWGRSHPDNIRQCCSVYGIDAGRSIFVANLESAVS 1200

Query: 1217 DIGKTIRHEHLLLVANTLSATGEFVGLNVKGVSRQREHALVKTPFMQACFSSPGASFVKA 1276
            D GK I  EHLLLVA++LS TGEFV LN KG S+QR+      PF QACFSSP   F+KA
Sbjct: 1201 DTGKEILREHLLLVADSLSVTGEFVALNAKGWSKQRQVESTPAPFTQACFSSPSQCFLKA 1260

Query: 1277 AKAGIKDSLSGSLDALAWGKIPSMGTGGQFDILYSGKGHELNKPVDVYNLLGSQGICEKP 1336
            AK G++D L GS+DALAWGK+P  GTG QF+I+ S K H    PVDVY+LL S     + 
Sbjct: 1261 AKEGVRDDLQGSIDALAWGKVPGFGTGDQFEIIISPKVHGFTTPVDVYDLLSSTKTMRRT 1320

Query: 1337 NVKIESLDKNTIYEKYSAVVHKNGGSTIKGLKKLD--SVSKSILREFLTLNDIQKLSHTL 1396
            N   +S DK T+  +   ++H    + +K +K LD   +  S+LR   T  +I+ LS +L
Sbjct: 1321 NSAPKS-DKATV--QPFGLLH---SAFLKDIKVLDGKGIPMSLLRTIFTWKNIELLSQSL 1380

Query: 1397 RSILRKYSLNERLNEVDKSTLMMALYFHPQRDEKIGVGAQDIKVGSHSKYSNTRCFILVR 1456
            + IL  Y +NE LNE D+  + M L  HP   EKIG G + I+V + SK+ ++ CF +VR
Sbjct: 1381 KRILHSYEINELLNERDEGLVKMVLQLHPNSVEKIGPGVKGIRV-AKSKHGDSCCFEVVR 1440

Query: 1457 SDGTTEDFSYHKCVLGALEIIAPHRVKAYQSKWMQD 1484
             DGT EDFSYHKCVLGA +IIAP ++  Y+SK++++
Sbjct: 1441 IDGTFEDFSYHKCVLGATKIIAPKKMNFYKSKYLKN 1441

BLAST of Carg21647 vs. TAIR 10
Match: AT2G40030.1 (nuclear RNA polymerase D1B )

HSP 1 Score: 399.4 bits (1025), Expect = 1.2e-110
Identity = 368/1361 (27.04%), Postives = 610/1361 (44.82%), Query Frame = 0

Query: 20   EQDSELQIPSGVLVGVNFSVSTQQD--MENIAVINIEAACEVSDPKLGLPNPSYQCTTCG 79
            E++S  +I  G +VG+ F++++  +  +++I+   I    ++++  LGLP    +C +CG
Sbjct: 2    EEESTSEILDGEIVGITFALASHHEICIQSISESAINHPSQLTNAFLGLPLEFGKCESCG 61

Query: 80   ASVLKCCEGHFGAIKFPYTIIHPYFLSEVAQVLNKVCPGCKSIRRELWGKVEDPTSDFHR 139
            A+    CEGHFG I+ P  I HP  ++E+ Q+L+ +C  C  I+               +
Sbjct: 62   ATEPDKCEGHFGYIQLPVPIYHPAHVNELKQMLSLLCLKCLKIK---------------K 121

Query: 140  PKGCRYCFGSLKDWYPPMRFKLSTTDMFRKSMIMVEVKENMSKKY--QKRVARGGLPPDY 199
             KG     G L D       +L        S I ++ + +    Y   K  +R  L P  
Sbjct: 122  AKGTS---GGLAD-------RLLGVCCEEASQISIKDRASDGASYLELKLPSRSRLQPGC 181

Query: 200  WNFIPK-DEQQEESYCRPNRKVLTHAQVHYLLKDIDPKFLKKFVS----ATDSLFLNSFP 259
            WNF+ +   +    Y RP    L   +V  +L+ I  +  KK  +      +   L   P
Sbjct: 182  WNFLERYGYRYGSDYTRP----LLAREVKEILRRIPEESRKKLTAKGHIPQEGYILEYLP 241

Query: 260  VTPNCHRVTEMTHSFSSG-------------QRLVFDERTRAYK-KLVDFRGTANELGSR 319
            V PNC  V E +  FS+              ++++  + +R+ +      +  A+E+  R
Sbjct: 242  VPPNCLSVPEASDGFSTMSVDPSRIELKDVLKKVIAIKSSRSGETNFESHKAEASEM-FR 301

Query: 320  VLDCLKISKLSPEKLESKDLIYQQKKIKDTATSSNGLRWIKDVVLGKRSDHCFRMVVVGD 379
            V+D     + + +   + D+ Y   KI D+++S      ++ + + K S    R V+ GD
Sbjct: 302  VVDTYLQVRGTAKAARNIDMRYGVSKISDSSSSKAWTEKMRTLFIRKGSGFSSRSVITGD 361

Query: 380  PNIELSEIGIPCHVAERLQISEHLSSWN----MKKLSTSCYLRLVEKGEIFVRREGRLVR 439
                ++E+GIP  +A+R+   E +S  N     K +     L   +    +  R+G    
Sbjct: 362  AYRHVNEVGIPIEIAQRITFEERVSVHNRGYLQKLVDDKLCLSYTQGSTTYSLRDGS--- 421

Query: 440  VRHVLELSMGDTIYRPLADGDVVLVNRPPSIHQHSLIALSVRVLPVSSVLSLNPLCCSPF 499
             +   EL  G  ++R + DGDVV +NRPP+ H+HSL AL V V   ++V  +NPL CSP 
Sbjct: 422  -KGHTELKPGQVVHRRVMDGDVVFINRPPTTHKHSLQALRVYVHEDNTV-KINPLMCSPL 481

Query: 500  RGDFDGDCLHGYVPQSLEARVELRELVALDRQLVNGQSGRNLLSLSHDSLTAAHLIMEDG 559
              DFDGDC+H + PQSL A+ E+ EL ++++QL++  +G+ +L +  DSL +  +++E  
Sbjct: 482  SADFDGDCVHLFYPQSLSAKAEVMELFSVEKQLLSSHTGQLILQMGSDSLLSLRVMLE-R 541

Query: 560  VSLNLFQIQQLQMFALHQLLPPAIVKAPSFRSCAWTGKQLFSIFLPPDFDYSSPSHRVHI 619
            V L+    QQL M+    L PPA+ K+ S    AWT  Q+  +  P     S    R  +
Sbjct: 542  VFLDKATAQQLAMYGSLSLPPPALRKS-SKSGPAWTVFQILQLAFPERL--SCKGDRFLV 601

Query: 620  NNGELLSSEGSYWLRDTGRNPF--QALIEHCEGRTLNYLHIAQRVLCEWLSMRGLSVSLS 679
            +  +LL  +       +  N       +E     TL +    Q +L E L   G S+SL 
Sbjct: 602  DGSDLLKFDFGVDAMGSIINEIVTSIFLEKGPKETLGFFDSLQPLLMESLFAEGFSLSLE 661

Query: 680  DLYLS-VDSHSHKNMMDDIFCGLQEAEETCNLIQLMVDSHKDVLTGDDEGNQHVLSIEVE 739
            DL +S  D     N++      ++E     + ++L   S++D              +++E
Sbjct: 662  DLSMSRADMDVIHNLI------IREISPMVSRLRL---SYRD-------------ELQLE 721

Query: 740  HLSYEKQKSAALNQASVDAFKRVFREIQNLVYKYSGKDNSLLTMFKAGSKGNLLKLVQHS 799
            + S  K K  A N      F      I+NL+     K NS +T           KLVQ +
Sbjct: 722  N-SIHKVKEVAAN------FMLKSYSIRNLI---DIKSNSAIT-----------KLVQQT 781

Query: 800  MCLGLQHSLVTLSFGLPHKLSCSSWNSQKMPRYIQKDGLADRTQSFIPYAVVENSFLSGL 859
              LGLQ S          K   +    + M  + ++     R  S   + +V+  F  GL
Sbjct: 782  GFLGLQLS--------DKKKFYTKTLVEDMAIFCKRK--YGRISSSGDFGIVKGCFFHGL 841

Query: 860  NPFECFAHSVTNRD--SSFSDNAEVPGTLTRKLTFLMRDIYNAYDRTVRNAYGNQLVQFS 919
            +P+E  AHS+  R+     S     PGTL + L  ++RDI    D TVRN   N ++QF 
Sbjct: 842  DPYEEMAHSIAAREVIVRSSRGLAEPGTLFKNLMAVLRDIVITNDGTVRNTCSNSVIQFK 901

Query: 920  YDTDSSTSISNELDGENNNTNRDIGGQPVGSLAACALSEAAYSALDQPISLLETSPLLN- 979
            Y  DS          E  +      G+PVG LAA A+S  AY A      +L++SP  N 
Sbjct: 902  YGVDS----------ERGHQGLFEAGEPVGVLAATAMSNPAYKA------VLDSSPNSNS 961

Query: 980  ----LKKVLEC--GSKRNSPKQTFSLFLLEKLSKRSYGFEYGALGVKNHLERVIFKDIVS 1039
                +K+VL C    +  +  +   L+L E    + +  E  A  V+N L +V  KD   
Sbjct: 962  SWELMKEVLLCKVNFQNTTNDRRVILYLNECHCGKRFCQENAACTVRNKLNKVSLKDTAV 1021

Query: 1040 SVMIIFAPEPSRKRHFSPWVC---HFHVCKEILKKRRLKISSVIHSLNMRCDSV------ 1099
              ++ +  +P+    F    C   H H+ K +L+   + +  +    + +C+ V      
Sbjct: 1022 EFLVEYRKQPTISEIFGIDSCLHGHIHLNKTLLQDWNISMQDI----HQKCEDVINSLGQ 1081

Query: 1100 RQEAKINLPFLHIS---TQDCSLADSSREDG-DTVCLTVTIAENTKNSFLQLDFIQDLLI 1159
            +++ K    F   S   ++ CS  D     G D  CLT +      +    LD + + + 
Sbjct: 1082 KKKKKATDDFKRTSLSVSECCSFRDPCGSKGSDMPCLTFSYNATDPDLERTLDVLCNTVY 1141

Query: 1160 HFLLGTVIRGFAEIDKVDISWNDRPK---VPKPHCKSHGELYLRVTMSGEGTSR---FWA 1219
              LL  VI+G + I   +I WN       +   H    GE  L VT+      +    W 
Sbjct: 1142 PVLLEIVIKGDSRICSANIIWNSSDMTTWIRNRHASRRGEWVLDVTVEKSAVKQSGDAWR 1201

Query: 1220 TLMNHCLPIMDLIDWSRSHPDNIHSFCMAYGIDSGRNYFLNSLESATLDIGKTIRHEHLL 1279
             +++ CL ++ LID  RS P ++       G+       +  L ++   + K +  EH++
Sbjct: 1202 VVIDSCLSVLHLIDTKRSIPYSVKQVQELLGLSCAFEQAVQRLSASVRMVSKGVLKEHII 1250

Query: 1280 LVANTLSATGEFVGLNVKGVSRQREHALVKTPFMQACFSSPGASFVKAAKAGIKDSLSGS 1321
            L+AN ++ +G  +G N  G         +K PF +A   +P   F KAA+    DSLS  
Sbjct: 1262 LLANNMTCSGTMLGFNSGGYKALTRSLNIKAPFTEATLIAPRKCFEKAAEKCHTDSLSTV 1250


HSP 2 Score: 68.6 bits (166), Expect = 5.0e-11
Identity = 41/123 (33.33%), Postives = 68/123 (55.28%), Query Frame = 0

Query: 1361 KKLDSVSKSILREFLTLNDIQKLSHTLRSILR--KYSLNERLNEVDKS-TLMMALYFHPQ 1420
            ++LDS +     E   L+D++ +  TLR I+    Y   + +++ DK+  L   L FHPQ
Sbjct: 1727 QRLDSFTS---EEQELLSDVEPVMRTLRKIMHPSAYPDGDPISDDDKTFVLEKILNFHPQ 1786

Query: 1421 RDEKIGVGAQDIKVGSHSKYSNTRCFILVRSDGTTEDFSYHKCVLGALEIIAPHRVKAYQ 1480
            ++ K+G G   I V  H+ +S++RCF +V +DG  +DFSY K +   L    P R + + 
Sbjct: 1787 KETKLGSGVDFITVDKHTIFSDSRCFFVVSTDGAKQDFSYRKSLNNYLMKKYPDRAEEFI 1846

BLAST of Carg21647 vs. TAIR 10
Match: AT4G35800.1 (RNA polymerase II large subunit )

HSP 1 Score: 184.1 bits (466), Expect = 8.1e-46
Identity = 224/952 (23.53%), Postives = 397/952 (41.70%), Query Frame = 0

Query: 35  VNFSVSTQQDMENIAVINIE----------AACEVSDPKLGLPNPSYQCTTCGASVLKCC 94
           V F + +  ++  ++VI++E              +SD +LG  +   +C TC A++ + C
Sbjct: 18  VQFGILSPDEIRQMSVIHVEHSETTEKGKPKVGGLSDTRLGTIDRKVKCETCMANMAE-C 77

Query: 95  EGHFGAIKFPYTIIHPYFLSEVAQVLNKVCPGCKSI-------RRELWGKVEDPTSDFHR 154
            GHFG ++    + H  F+  V  ++  VC  C  I       + +   K+++P +   +
Sbjct: 78  PGHFGYLELAKPMYHVGFMKTVLSIMRCVCFNCSKILADEEEHKFKQAMKIKNPKNRLKK 137

Query: 155 ----PKGCRYCFGSLKDWYPPMRFKLSTTDMFRKSMIMVEVKENMSKKYQKRVARGGLPP 214
                K    C G   D    ++   ST +  +KS      +     +  K    G    
Sbjct: 138 ILDACKNKTKCDGG--DDIDDVQ-SHSTDEPVKKS------RGGCGAQQPKLTIEGMKMI 197

Query: 215 DYWNFIPKDEQQEESYCRP--NRKVLTHAQVHYLLKDI----------DPKFLKKFVSAT 274
             +    K   + +    P   ++ L   +V  +LK I          +PKF +      
Sbjct: 198 AEYKIQRKKNDEPDQLPEPAERKQTLGADRVLSVLKRISDADCQLLGFNPKFAR-----P 257

Query: 275 DSLFLNSFPVTPNCHRVTEMTHSFSSGQ-----RLVFDERTRAYKKLVDFRGTANELGSR 334
           D + L   P+ P   R + M  + S  +     +L    R     K  +  G    + S 
Sbjct: 258 DWMILEVLPIPPPPVRPSVMMDATSRSEDDLTHQLAMIIRHNENLKRQEKNGAPAHIISE 317

Query: 335 VLDCLK--ISKLSPEKLESKDLIYQQ-----KKIKDTATSSNGLRWIKDVVLGKRSDHCF 394
               L+  I+     +L  +    Q+     K I     +  G   I+  ++GKR D   
Sbjct: 318 FTQLLQFHIATYFDNELPGQPRATQKSGRPIKSICSRLKAKEGR--IRGNLMGKRVDFSA 377

Query: 395 RMVVVGDPNIELSEIGIPCHVAERLQISEHLSSWNMKKLSTSCYLRLVEKG--------- 454
           R V+  DP I + E+G+P  +A  L   E ++ +N+++L       LV+ G         
Sbjct: 378 RTVITPDPTINIDELGVPWSIALNLTYPETVTPYNIERLK-----ELVDYGPHPPPGKTG 437

Query: 455 -EIFVRREGRLVRVRHVLE-----LSMGDTIYRPLADGDVVLVNRPPSIHQHSLIALSVR 514
            +  +R +G+ + +R++ +     L +G  + R L DGD VL NR PS+H+ S++   +R
Sbjct: 438 AKYIIRDDGQRLDLRYLKKSSDQHLELGYKVERHLQDGDFVLFNRQPSLHKMSIMGHRIR 497

Query: 515 VLPVSSVLSLNPLCCSPFRGDFDGDCLHGYVPQSLEARVELRELVALDRQLVNGQSGRNL 574
           ++P S+   LN    SP+  DFDGD ++ +VPQS E R E+ EL+ + + +V+ Q+ R +
Sbjct: 498 IMPYST-FRLNLSVTSPYNADFDGDEMNMHVPQSFETRAEVLELMMVPKCIVSPQANRPV 557

Query: 575 LSLSHDSLTAAHLI--MEDGVSLNLFQIQQLQMFALHQLLP-PAIVKAPSFRSCAWTGKQ 634
           + +  D+L     I   +  +  ++F    +        +P PAI+K        WTGKQ
Sbjct: 558 MGIVQDTLLGCRKITKRDTFIEKDVFMNTLMWWEDFDGKVPAPAILKPRPL----WTGKQ 617

Query: 635 LFSIFLPPDFD---YSS------------PSHRVHINNGELLSSEGSYWLRDTGRNPFQA 694
           +F++ +P   +   YS+               +V I  GELL+         T       
Sbjct: 618 VFNLIIPKQINLLRYSAWHADTETGFITPGDTQVRIERGELLAGTLCKKTLGTSNGSLVH 677

Query: 695 LI--EHCEGRTLNYLHIAQRVLCEWLSMRGLSVSLSDLYLSVDSHSHKNMMDDIFCGLQE 754
           +I  E        +L   Q ++  WL   G ++ + D   ++   S    +++     + 
Sbjct: 678 VIWEEVGPDAARKFLGHTQWLVNYWLLQNGFTIGIGD---TIADSSTMEKINETISNAKT 737

Query: 755 AEETCNLIQLMVDSHKDVLTGDDEGNQHVLSIEVEHLSYEKQKSAALNQASVDAFKRVFR 814
           A +  +LI+       D   G                ++E + +  LN+A  DA      
Sbjct: 738 AVK--DLIRQFQGKELDPEPG-----------RTMRDTFENRVNQVLNKARDDA------ 797

Query: 815 EIQNLVYKYSGKDNSLLTMFKAGSKGNLLKLVQHSMCLGLQHSLVTLSFGLPHKLSCSSW 874
              +   K   + N+L  M  AGSKG+ + + Q + C+G Q+        +  K     +
Sbjct: 798 --GSSAQKSLAETNNLKAMVTAGSKGSFINISQMTACVGQQN--------VEGKRIPFGF 857

Query: 875 NSQKMPRYIQKDGLADRTQSFIPYAVVENSFLSGLNPFECFAHSVTNRDSSFSDNAEV-- 902
           + + +P +  KD     ++ F     VENS+L GL P E F H++  R+       +   
Sbjct: 858 DGRTLPHF-TKDDYGPESRGF-----VENSYLRGLTPQEFFFHAMGGREGLIDTAVKTSE 903

BLAST of Carg21647 vs. TAIR 10
Match: AT5G60040.1 (nuclear RNA polymerase C1 )

HSP 1 Score: 164.1 bits (414), Expect = 8.7e-40
Identity = 307/1393 (22.04%), Postives = 530/1393 (38.05%), Query Frame = 0

Query: 61   DPKLGLPNPSYQCTTCGASVLKCCEGHFGAIKFPYTIIHPYFLSEVAQVLNKVCPGCKSI 120
            DP++G PN    CTTC  +  + C GH+G +K    + +  + + +  +L  +C  C ++
Sbjct: 63   DPRMGPPNKKSICTTCEGN-FQNCPGHYGYLKLDLPVYNVGYFNFILDILKCICKRCSNM 122

Query: 121  RRELWGKVEDPTSDFHRPKGCRYCFGSLKDWYPPMRFKLSTTDMFRKSMIMVEVKENMSK 180
                   +++   + H  K        LK            + M  + +I  +    ++ 
Sbjct: 123  ------LLDEKLYEDHLRKMRNPRMEPLKKTELAKAVVKKCSTMASQRIITCKKCGYLNG 182

Query: 181  KYQKRVARGGL------------PPDYWNFIPKDEQQEESYCRPNRKVLTHAQVHYLLKD 240
              +K  A+ G+              D         +Q  +   P   VL    V  L K 
Sbjct: 183  MVKKIAAQFGIGISHDRSKIHGGEIDECKSAISHTKQSTAAINPLTYVLDPNLVLGLFKR 242

Query: 241  IDPKFLKKFVSA--TDSLFLNSFPVTPNCHRVTEMTHSFSSGQRLVFDERTRAYKKLVDF 300
            +  K  +    A   ++L +    V P   R + M     S +    ++ T   K+++  
Sbjct: 243  MSDKDCELLYIAYRPENLIITCMLVPPLSIRPSVMIGGIQSNE----NDLTARLKQIILG 302

Query: 301  RGTANELGSRVLDCLKISKLSPEKLESKDLI------YQQKKIKDTATS------SNGLR 360
              + +++ S+          SP+ ++  D +      Y   +++           S  L+
Sbjct: 303  NASLHKILSQPTS-------SPKNMQVWDTVQIEVARYINSEVRGCQNQPEEHPLSGILQ 362

Query: 361  WIKDV-------VLGKRSDHCFRMVVVGDPNIELSEIGIPCHVAERLQISEHLSSWNMKK 420
             +K         + GKR +   R V+  DPN++++E+GIP  +A+ L   E +S  N++K
Sbjct: 363  RLKGKGGRFRANLSGKRVEFTGRTVISPDPNLKITEVGIPILMAQILTFPECVSRHNIEK 422

Query: 421  LSTSCYLRLVEK--GEIFVR----REGRLV---RVRHVLELSMGDTIYRPLADGDVVLVN 480
            L   C      K  G   VR        LV   R R   EL++G  + R L +GDVVL N
Sbjct: 423  L-RQCVRNGPNKYPGARNVRYPDGSSRTLVGDYRKRIADELAIGCIVDRHLQEGDVVLFN 482

Query: 481  RPPSIHQHSLIALSVRVLPVSSVLSLNPLCCSPFRGDFDGDCLHGYVPQSLEARVELREL 540
            R PS+H+ S++    R++P  + L  N   C+P+  DFDGD ++ +VPQ+ EAR E   L
Sbjct: 483  RQPSLHRMSIMCHRARIMPWRT-LRFNESVCNPYNADFDGDEMNMHVPQTEEARTEAITL 542

Query: 541  VALDRQLVNGQSGRNLLSLSHDSLTAAHLIME-----DGVSLNLFQIQQLQMFALHQLLP 600
            + +   L   ++G  L++ + D LT++ LI       D  + +L             L  
Sbjct: 543  MGVQNNLCTPKNGEILVASTQDFLTSSFLITRKDTFYDRAAFSLICSYMGDGMDSIDLPT 602

Query: 601  PAIVKAPSFRSCAWTGKQLFSIFLPP-------------DFDYSSPSHR----------- 660
            P I+K        WTGKQ+FS+ L P             + ++    H            
Sbjct: 603  PTILKPIEL----WTGKQIFSVLLRPNASIRVYVTLNVKEKNFKKGEHGFDETMCINDGW 662

Query: 661  VHINNGELLSSE-GSYWLRDTGRNPFQALI-----EHCEGRTLNYLHIAQRVLCEWLSMR 720
            V+  N EL+S + G   L +  ++   +++      H     +N L    ++   W+ + 
Sbjct: 663  VYFRNSELISGQLGKATLGNGNKDGLYSILLRDYNSHAAAVCMNRL---AKLSARWIGIH 722

Query: 721  GLSVSLSDLYLSVDSHSHKNMMDDIFCGLQEAEETCNLIQLMVDS-HKDVLTGDDEGNQH 780
            G S+ +                DD+  G + ++E  + IQ   D  H+ +    +E N+ 
Sbjct: 723  GFSIGI----------------DDVQPGEELSKERKDSIQFGYDQCHRKI----EEFNRG 782

Query: 781  VLSIEVEHLSYEKQKSAALNQASVDAFKRVFREIQ---NLVYKYSGK--------DNSLL 840
             L +                +A +D  K +  EI    N + + +GK         NS L
Sbjct: 783  NLQL----------------KAGLDGAKSLEAEITGILNTIREATGKACMSGLHWRNSPL 842

Query: 841  TMFKAGSKGNLLKLVQHSMCLGLQHSLVTLSFGLPHKLSCSSWNSQKMPRYIQKDGLADR 900
             M + GSKG+ + + Q   C+G Q                 + N  + P     DG  DR
Sbjct: 843  IMSQCGSKGSPINISQMVACVGQQ-----------------TVNGHRAP-----DGFIDR 902

Query: 901  TQSFIP--------YAVVENSFLSGLNPFECFAHSVTNRDSSFSDNAEVPGT--LTRKLT 960
            +    P           V NSF SGL   E F H++  R+       +   T  ++R+L 
Sbjct: 903  SLPHFPRMSKSPAAKGFVANSFYSGLTATEFFFHTMGGREGLVDTAVKTASTGYMSRRLM 962

Query: 961  FLMRDIYNAYDRTVRNAYGNQLVQFSYDTDS-STSISNELDGENNNTNRDI----GGQPV 1020
              + D+   YD TVRNA G  ++QF+Y  D    ++    DG   N NR         P 
Sbjct: 963  KALEDLLVHYDNTVRNASG-CILQFTYGDDGMDPALMEGKDGAPLNFNRLFLKVQATCPP 1022

Query: 1021 GSLAACALSEAAYSALDQPISLLETSPLLN---LKKVLE----CGSKRNSPKQTF----- 1080
             S      SE      ++ +   + S +     +K + E     G K  SP Q       
Sbjct: 1023 RSHHTYLSSEELSQKFEEELVRHDKSRVCTDAFVKSLREFVSLLGVKSASPPQVLYKASG 1082

Query: 1081 ----SLFLLEKL-------SKRSYGFEYGALGVKN-------------HLERVIFKDIVS 1140
                 L +  K+        K   G   G +G ++             H   V   +I  
Sbjct: 1083 VTDKQLEVFVKICVFRYREKKIEAGTAIGTIGAQSIGEPGTQMTLKTFHFAGVASMNITQ 1142

Query: 1141 SVMIIFAPEPSRKRHFSPWV-CHFHVCKEILKKRRLK-------ISSVIHSLNMRCDSVR 1200
             V  I     + K   +P +        E+   R +K       +  V  S+ +   S  
Sbjct: 1143 GVPRINEIINASKNISTPVISAELENPLELTSARWVKGRIEKTTLGQVAESIEVLMTSTS 1202

Query: 1201 QEAKINLPFLHISTQDCSLADSSREDGDTVCLTVTIAENTKNSFLQLDFIQDLL------ 1260
               +I L    I     S+   S +  +++  T  I  N  N    LD   D+       
Sbjct: 1203 ASVRIILDNKIIEEACLSITPWSVK--NSILKTPRIKLN-DNDIRVLDTGLDITPVVDKS 1262

Query: 1261 -IHFLLGTVIRGFAEIDKVDISWNDRPKVPKPHCKSHGELYLRVTMSGEGTSRFWATLMN 1297
              HF L  +      I    I   +R  V +   KS         + G+   + +    N
Sbjct: 1263 RAHFNLHNLKNVLPNIIVNGIKTVERVVVAEDMDKSK-------QIDGKTKWKLFVEGTN 1322

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAG7028326.10.0e+00100.00DNA-directed RNA polymerase IV subunit 1 [Cucurbita argyrosperma subsp. argyrosp... [more]
KAG6596800.10.0e+0099.66DNA-directed RNA polymerase IV subunit 1, partial [Cucurbita argyrosperma subsp.... [more]
XP_022938810.10.0e+0099.66DNA-directed RNA polymerase IV subunit 1 isoform X1 [Cucurbita moschata][more]
XP_023539358.10.0e+0099.46DNA-directed RNA polymerase IV subunit 1 isoform X1 [Cucurbita pepo subsp. pepo][more]
XP_023005247.10.0e+0099.06DNA-directed RNA polymerase IV subunit 1 isoform X1 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
Q9LQ020.0e+0052.91DNA-directed RNA polymerase IV subunit 1 OS=Arabidopsis thaliana OX=3702 GN=NRPD... [more]
Q5D8691.7e-10927.04DNA-directed RNA polymerase V subunit 1 OS=Arabidopsis thaliana OX=3702 GN=NRPE1... [more]
P365948.4e-5623.83DNA-directed RNA polymerase II subunit rpb1 OS=Schizosaccharomyces pombe (strain... [more]
P114141.5e-5224.75DNA-directed RNA polymerase II subunit RPB1 OS=Cricetulus griseus OX=10029 GN=PO... [more]
P249281.5e-5224.75DNA-directed RNA polymerase II subunit RPB1 OS=Homo sapiens OX=9606 GN=POLR2A PE... [more]
Match NameE-valueIdentityDescription
A0A6J1FKU90.0e+0099.66DNA-directed RNA polymerase OS=Cucurbita moschata OX=3662 GN=LOC111444908 PE=4 S... [more]
A0A6J1KSL80.0e+0099.06DNA-directed RNA polymerase OS=Cucurbita maxima OX=3661 GN=LOC111498320 PE=4 SV=... [more]
A0A6J1FF540.0e+0099.56DNA-directed RNA polymerase OS=Cucurbita moschata OX=3662 GN=LOC111444908 PE=4 S... [more]
A0A6J1L1N10.0e+0098.90DNA-directed RNA polymerase OS=Cucurbita maxima OX=3661 GN=LOC111498320 PE=4 SV=... [more]
A0A0A0L2L40.0e+0087.83DNA-directed RNA polymerase OS=Cucumis sativus OX=3659 GN=Csa_3G039340 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G63020.10.0e+0052.91nuclear RNA polymerase D1A [more]
AT1G63020.20.0e+0052.91nuclear RNA polymerase D1A [more]
AT2G40030.11.2e-11027.04nuclear RNA polymerase D1B [more]
AT4G35800.18.1e-4623.53RNA polymerase II large subunit [more]
AT5G60040.18.7e-4022.04nuclear RNA polymerase C1 [more]
InterPro
Analysis Name: InterPro Annotations of Silver-seed gourd (SMH-JMG-627) v2
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR006592RNA polymerase, N-terminalSMARTSM00663rpolaneu7coord: 242..529
e-value: 4.9E-29
score: 112.5
IPR000722RNA polymerase, alpha subunitPFAMPF00623RNA_pol_Rpb1_2coord: 342..499
e-value: 6.6E-34
score: 117.5
NoneNo IPR availableGENE3D2.40.40.20coord: 337..508
e-value: 9.0E-42
score: 145.0
NoneNo IPR availableGENE3D3.10.450.40coord: 1371..1463
e-value: 1.5E-24
score: 88.3
NoneNo IPR availableGENE3D3.30.1490.180RNA polymerase iicoord: 375..427
e-value: 9.0E-42
score: 145.0
NoneNo IPR availablePFAMPF11523DUF3223coord: 1389..1461
e-value: 7.1E-25
score: 87.5
NoneNo IPR availablePANTHERPTHR19376DNA-DIRECTED RNA POLYMERASEcoord: 17..1368
NoneNo IPR availablePANTHERPTHR19376:SF36DNA-DIRECTED RNA POLYMERASE IV SUBUNIT 1coord: 17..1368
NoneNo IPR availableSUPERFAMILY64484beta and beta-prime subunits of DNA dependent RNA-polymerasecoord: 26..1303
IPR038120RNA polymerase Rpb1, funnel domain superfamilyGENE3D1.10.132.30coord: 674..829
e-value: 2.5E-10
score: 42.5
IPR044893RNA polymerase Rpb1, clamp domain superfamilyGENE3D4.10.860.120RNA polymerase II, clamp domaincoord: 28..125
e-value: 2.4E-11
score: 45.4
IPR007083RNA polymerase Rpb1, domain 4PFAMPF05000RNA_pol_Rpb1_4coord: 721..777
e-value: 4.0E-10
score: 39.6
IPR007066RNA polymerase Rpb1, domain 3PFAMPF04983RNA_pol_Rpb1_3coord: 505..652
e-value: 1.2E-11
score: 44.7
IPR042102RNA polymerase Rpb1, domain 3 superfamilyGENE3D1.10.274.100RNA polymerase Rpb1, domain 3coord: 509..657
e-value: 1.3E-18
score: 69.2
IPR040403DNA-directed RNA polymerase IV/V subunit 1, N-terminalCDDcd10506RNAP_IV_RPD1_Ncoord: 40..876
e-value: 0.0
score: 1218.02

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Carg21647-RACarg21647-RAmRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006351 transcription, DNA-templated
cellular_component GO:0005666 RNA polymerase III complex
molecular_function GO:0003677 DNA binding
molecular_function GO:0003899 DNA-directed 5'-3' RNA polymerase activity
molecular_function GO:0046872 metal ion binding