MC04g1461 (gene) Bitter gourd (Dali-11) v1

Overview
NameMC04g1461
Typegene
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptionprotein RNA-directed DNA methylation 3
LocationMC04: 22505760 .. 22515103 (-)
RNA-Seq ExpressionMC04g1461
SyntenyMC04g1461
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: utr5polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
GTTTAATTTAAAAAATATAAAAAATGGTACAAATTCAACTTTTCTGAAAGAGGAGAAAAAGAAAAAAGAAAAAAGAAAAAAAAAAGAGCGAAAACGCAGCGTAAAAAAGGGAGGGAACCCATCAGAGCAGAGCTCGCAGTGGACGATTCAGACTCTCAACGCCTGAGGGGCGCAGTCCATTTCTCTCTCAATTTGCATGTTCCTTCTGGTTTCTGCCATTTCCTTACTCCAAATTCGAATCTTCCCCTCTCCGCCATAGCCATGGCGTCCAAGGGCAAGGGGATTGCCAAGGACACATCCTCTGGAAAGCGGAAGCAACGCGAAGACAACAATGGCGCCGACCTCGCCCGTAAGAGGAGGGACCGGAGCGTTCTTCAGTTCTTCGAGGACGTCGCTCCGGAGGTTGGTGGCGAAAGTGACGATAGCGATTTTCTCGATGGTACATTATCTCTCATTCTCGTTCACTCATCATCACTGTTTTCGCCTCTTTTGGGGGTTTCGAGGGCTGTTTCGAGCTTCTAGTTTTTAATATGTTTGCTCTGCATTTTTTTCGGTCTTGGGGGAGTAATTGGTGAAAATGGTCACGTTTTGGCTCGTTTTTTCCTCCTAGAAGATCCGTCTTAAACCCTTTTACTCCGTTGAAAAAGCCTGTTAGATTATAAATTTGTGCGCCTATCTCATTCTATAGGTTGATGATAATTTTGACATATTTTCTGAACATGTGGAGAGGCCCGCGCTAATTGTATGATGCTTCTAGCGAGGTCGATTTAAGTCGCTAGAGTGCATCTGGTGCGGCTTTCTCTGACTTTGATTTTGGATGTCTTAAGTTTGTTAATGGATAGATGGACTGATCTCGGAACTCAAAGTCAGCATTGGCTGTAACTTTGTGGGTCGTTTTTCTTCCAAATTCCTCATGTACGTGTGTTATAATCACCGTTTTGTATCTTTGTATCACAAGGTTTTGGTTTGGTTCTTGACAATACGGAGATATGACCAGTTCGGTAGTATTGATTTGGTTTGGTGGTCAGTTTTGTTGCTTTTGTTCAGACAATCAGTTCAGGTTTTCAAAAAACTGGTTTTAGCTTCGGTTTGTTCCTTAAAACGATGGGTCGGTTTGGTGACAATTAAGATCCTAAAAGTCAAGCACAATGGTTGAGTTCTTTCATATCGTGCTTCTTCACGTTTAAACTTTAAACTTTCTTCTTCTGGGTATTCATTTTGTTGCCATTTCTATTGGTTTTAATTTTTGATAAATTCTTAATTTGGTGTATATCGTCTTTTAAGGTTTTGGATTGTGAGAAGGTTGCTTTTCATTCTGAAGTCTAGGTATATTTTCCCTTTATGCAATTTCCTTGTGCTGTTATGAATATGCTGAAAGTTTGATTTCCTTACATTATTTTACATTTGTCTTTGTAAGAGAATGGTCGGTTTTTTTGGGTGAGTGGTTGGCTTGTGCCATGGGCTCACCTCGAGGCAAGGTGTAAATTGTTAGCCTCAATGCTTATGCGTACTTTGGTTTATAAGAGGTTTTTCCCTCCATGAAAATGCATTAAGGGATGGAGTCTTTCCACCCCATGTAATGGCCAGAAGAATAACGTTGATTTATTGTATATTTTTCAAAATATAATAGTCCACTTATGTTTTTAATATTGATAGAACATAGATTGTGGTAACTCACTCATTGTTACTTTAACCTCCAAGGCTTAAAAAATTTATTGAAGATTAAAGTGATAGAATAAGTTGATCCTATTAGGAATGACCTTAAGAATCATTATTTAAGAAAGTAATCTAATCTTTAGAATTTCTCAGTGCTAAAGGTTATGAAATCATTTGGTTGTCTTGTGGGATGAATTGAGGCATGTATAAGTTGGCTCAATGCCCGCAGTTAAAAAAAAAATCAATGTTTCTTCTATCCAAACTGTAGACTCAATTCTGCCTTTTCATTTCCTTTTACTCAACCCTATTATTTGACAAAAACAGATTTCATGGAGGAAGAGTTTGATACAGAACCGGCATTTAAGAATGATGCTGCAAAAGATCAAAATATTCCATTCTTCCCAAAAGAAGAGGAAATGAATGAAGAGGAGTTTGATAGAATTATGGAGGAGCACTACAATCAAGGTCCTGGACTCGGTGCATTTGCAGAAGAAAATTATGAGAACAAAAATTCTACTGGAAGAAACCCTCTTCAGCCGTCTTCCAGGGATACTATCTCTCTGTGGAAAGTTAAATGCATGGTATGGAGATTAATCAATGGAAACAAATGCTTCAGCTAAGGATTGAGTATCCAGTCTTCATTTCCATTTTTTCTAACTCCATATATTGATTTTTACTGCCTGGAAATTTTCTTTGTAGGTTGGACGTGAGCGGCAATCAGTTTTTTGTCTTATGCAGAAATTTGTTGATTTGCACTCATTTGGTACCAAGCTACAGATAAAATCTGCATTTTATGTAGACCATGTAAAAGGTTTTATTTACATAGAAGCTCCTAGGCAGTATGATTTAATTGAGGTAACACGAGATTTTCTTGTAGTGCTAGCATAATTTCTTTAGTCAATAATCAGTACAACATTTGACAAGTTACTCTGGATCCAGGCATGTAAAGGGATCAGTGGCATATATTCTACTCGCATAGCTTCTGTTCCCGAAAATGACATCTCTCAGTTGCTTACTGTTCGAAGTAGAGTCAGTGAAGTTTCTGTAGGTACAATGGCCCGTGTAAAAAATGGAAAATACAAGGGAGACCTTGCTCAGGTAAAGTGTACTAACTTGTAGACAGACTGTGCTTTTGCATCTATGAGCACTACTTCCCCCTTTGTTTTTCAAAAATAATATTTAGTGATTTCTGCAGATTGTTGCTGTCAACAATGCACGCAAGAGAGCGACTGTGAAGCTTGTTCCAAGGATTGATCTCCAAGCTATGGCTGCAAAATTTGTATGAAATTTTGTTTACCTTTTGAAATCTTATATCTTAAAAGAGTGTAGTATTATTCTACTTGCTGCAACTAGCTATATGATACTACATTACTGGGAGCTTGTGAGGTTACCCTACAGACTCTACCTGGCCTATTCCATTTATATATTTCAGATAATTCGTACCTATTTGATTATCACTATTAGTACACCACAAAAATGTGGGTTCTTTTTATTTCCAAAAATCGGACTCTTTTTATTATTTTTAATTGTTATATAGGGAAATGCAGTTATGATGTTGGTAACTGCTTTGTGTACTTGAGATGCATTGAAATGGAGAGTTCGTACTTTCTTTAAAATCTTTTTTTATTAAAGTTATCCCCCGTTCTTTATTCAGGACATTTGCAGAAAGCTTTTTTTAATAAATTGCCTCAGCCTTTTGTCTTTGGGTTTGATTTCTTACCCTTTGTAACCTTGAGGGACATTCCTTTTGACGATTTTTTTGGTTTATTGGAGAATATTATTAGCTATAGTTTGGATCTGTCATTTTCTTGACTTATATTTTGAATCGTTAGCCTTTCTGCCTTTTGTTTTCAAGTGATTCTATGATGCTTTGAACGGTAGTTTTCAGGTGCTGCTCCTCGAGCAATTTATCTTAACAGCTACACTAAAATTATGTTAACAACTTAACACTAGGGAAGCACAGACTTGATGTCAAATTAAATATCATAAATTAGTAAACTTGATATATAAGCATAGATGATTTTGATTTTGTTGTAAGGTTTGTATTTATCACGTAAAGAATAAGGTCTGAAAAGTGTGTGCGTTTGTTCGGGGGGTGTGTGGGGGGGGGGGATTGGTTCTTGGACGTGCCTTTGATCAACACCAACCATATTATATTGGAATGAACAATGTAGTTGTACGAATATATGTAGTAGTCAATTGAAAAAGCTCTCTCATTTCAGACGTCGGTTTTCCGCACGAAAGAATTTGGTTGTAGATTTTTTATGGAAAGTTGAATTGTAAAATCTGTAGTCAATTAACAAAGTTTGTGCTATATACGTCTTATTTTGTGTTTGAAGTTTCTTCTATTATATATATGTACTTGGTGCCTTTAAGTTTAAAGAGAAAAATGTTATGAATACATATTCCATGTGGAATAATATATCTTCTCATTATAGGGTGGAGGAGTTGCTGCTAAGAAAACTACCAATCCTGCACCACGATTGATCAACTCTAGTGAACTTGAGTAATAAGAAATAATTTGTTGCTTTCATTATTATTTTTTCTATCTGGAGTACATTGACATTGATATTTTTTTTTCTTTGCTTTTTGTAGTGAATTTCGACCTCTCATGCAATTTAGGCGTGACCGTGAAACTGGGAAACTTTTTGAGTTTCTTGATGGGATGATGCTCAAGGATGGATATCTATTCAAAAAAATATCTTTAGATTCATTGAACTGCTGGGGCGTAATGCCATCTGAAGATGAGCTCTTAAAGTTCAAGCCTCCTGAGAGCAACGAGTCTAATGATCTAGAGTGGCTTTCTCAACTTTATGGTGAAAAAAGGAAAAAGAAGATCATTAGGACTGAAAAGGGGGGTGGAAAAGGAGAGGGCACATCAGGATCTAGTTCCATGAACAGCTTTGCAGATCATGACCTTGTTTGTTTTGGGTGAGTCCTTGGTCTATCTACTCATAAAAATCGTGAAATACTTGCAGCTAATAGATTTCTGTTGACTCGTTACTCAATTACTTTGCAATGGTAAACGTTGCTGTACCATGCAGCCGGAAAGATTTTGGGATGATATTAGGAACGGAAAAAGATGACAGTTACAAGGTATATATATCTCTAAAAGTTTACACTGATTTATTTTTTACCATAGGGGCTTTTACTTACTTCCATTACTTGTATTTGATCTAGATTTTGAAGGAAGGCCCTGATGGGTCTATTGTGGTGAATGTGCAACGGAAAGAGTTGAAAAGTGGGCCTTTAGAGGGTAAATTTACCGCTGCTGATCACAATGGAAAGATCATCTCTGTTTCAGACAATGTCAAGGTGTTGGAAGGATCACTTAAGGTTTGTTATTTTTTTTATACAATTTACCTGTTCATGTACCATATAAAGTATGCTGTCTTATGCAATCACTTTTAACTTACAGGATAAGCAAGGGATTGTTAAGCATGTTTATAGACACACGGTGTTTGTATATGATGAGAATGAGGTGGACAACGATGGTTATTTCTGCTGCAAATCTAACATGTGCGAGAAAATCAAGATTTCTTATGATGCACCTAGTGGAAAGGTCGAGTGAACTTCCCCTTTATTTTGTTGATCAGTTGTTTAGCTTGGGCCCAGTCCAACAATTTTTTCTTTTCTCCATTTAGGATGATGACAAAGGTTTCTCTGGTTTTGAGGATTTCTCCTTTTCTCCCAAGTCACCCCTATCACCTAAAAAGCCATGGGCAGAGAAGGACAGTGGTCGTGAATGTATGTATTAATATTGCTGATGCATAGGCATTTAACCCATATATATTTCTGTTGAAGACATTCTCTGGATGCAGACAACCGTGATGATAAAGATGGAATGTTCTCTATTGGTCAAACCTTGAGAATACGTGTCGGTCCTTTAAAGGGATACCTATGCCGTGTTATAGCCGTACGTAAAAAAGATGTTACAGTGAAGCTTGATTCTCAACAGAAGGTTCTTACAGGTATGTTGGTTGTGATTGTATCTTGGCCCTGTTGAATACTCTGTCAATACTGGTAAGGTTGAGGGCTAGGCAACATTTCATCTATTATTTATTTTCTTCAATTTATTTCCCACAGTCAGATCTGATTTTCTTTCTGAGGTATATCGGAAGACCTCTACTGTGTCTCTCAGGTATTTTTAACTAAAAAATCTCAATGTGTTCTTGTGTAGTTTACTATAATTTGTACTGCTATGTCTATATCTCTCTCTCACAGAATACGAATATCATTTTGTAGTGAGGATACAGAATTCGGTTCTCTAAAACCTTTCGATATACTGGGAAATGAAGGCAGTTCCCAAGGTTTGAATTCGTGTTTCAGCCTAGAAATTTGAAACTTTCTTACATGAATAGGTGCACTTGAATGTTTGAGAGCTTTTTTAATCTCTTTCAAATGTATTTTTTAAAACTAAAAATTCAGATTTATTGCCTATGCACAGAATCAGTTTTATTTTACCAAAAAATGCAGATTTATTTCCTATGTGCAAGTTTTTATTTTTCGTTGCAAGTTCTCTCTCTCTCTCTCTCTGGGAAATGATTACTAGTTTGACACGCTTGTTCTTTTTTTTAAGATTGGATGGGTGGCACGGGGTCATCCACAGGTGGTGATGGCTGGAATTCTGCTGGACCTTCTTCAGAAAGGTATGTTTCTGCTACCAATTTACATGGATATCCAATTTTTTTTGCTTGTGTGTGTGTTTTCCATTTCTTGATCTAAAATTTAGAAAGAAGAAGAAAAAGGAAAAGAAAAAGAGAAAAGAAACTGTGGAATGCTGTCCTATGACAATCAAGTCTGAAATTTGTAGAATGTGATTATCTGCCTGGTTTTCAATGTAATACTGTAGTTCATAGAAAGTTTTATTGACGCGGTTCATTATCTACTTCAAGCTATATGAAATCACGATCAGGAAAATAAAGACAACAGTAGTTATTTATGGATGGACCTGCAAGTGTGTATTTTTGTAAAATGCATTTATTGAACTTTCCTTGCATTTGTTTTTGCGTTAAAAGAGGTTGTCCCATTAACGCCTCTTTTGAAATTTTTATTTCCTCGACATTTTTATGTCAATCTTAGGTCGCCTATCTTCTTCTCCAAGAGAACATTGATTGCGTCTCTGCTTTTACGTGTCTTTTTGCTGCTTGTAGAAATAAGCTTCTTGTTTCAATCTGGATAGATCACTGAAAGTAGTTGATAGATATGATCATGAGAAATTTATAATGTAATAGTCCCAAAAAAAATTGTCTTGGACATATTCTAACTATGCATGCATCAAGATGTGGAATGGTTGGTGAGTGAAACGTTGAAATTGTCGTTTGTTTGCATGTATAAATAATTTTCTTTGATTAGTTTTATCTCATATAGGAACCAATAACTGTTTACACGTCTTTTAAAAAATTACTTTTTCGTTTTACCTTATATAAGATCCAATAACTGCATAGTGAGACCTTTGTAATTATAAATTATGAATTTCACTACTTTCTCTATGTGCCTTCATAAGTTTGTTTATGCAAATGTACTTCCCTGCAGAAGTCCTTGGCCCAGTTTCCCAGAGTCTGGTACCTCGGTTAGTTTTTTTATTCCATCTTCTTCTCTCTCTTCTTTGTGCTGTTTAATGTTAATATTATAAAAGAATTCCATGCATTTTATTTCTTTCCCTTTCTGGTTGCAGAATTGTCCAGGGTCCAGTTCTAATCCTTTCGGTTCTGAAAGCCTTGATGCTCAAAAAGGTACCATGAATAGGGCATTTACTTTAGAAATTGATGCTTTATTCATGAATAGAAATTTTTGATAAGCTGAGTTATGTTGCTTCCATGATAATTACAGATGTTGAAGATTCTCCTTGGGTTAGCAAGACGGTTCCAAATGCAAGCACTTCCTGGGGTGCTGCAAAATCAAATGTTGACGATACTGTTAATGATGACCAAGCCCCTGGATGGGGGAAGAGCGAGTCATGGGACAAAGCTACTGCTAAAACTAGTTCAGATGGCAATGCCTCTGGTGCCTGGGACAAAAAAGTAGTACCTGGTGGAGATTGTGCAGGCCCTACAGATCAAGCTGAGGACAAGTGGGACAAGGGAAAGCGTGTGTCTTCTGATAATCAGACTGGCAACTGGGGTGATGGAACTTCTGGTAAGAATGAACCCAGTGCATGGAGCAGAGATAAAGATTCGGAATCCGGGGGGTGGAAAAAGAGTCAAAATGCTAGTTTTGGTGATGAAAATAATTCAGCAGAAGCCTCAGCTGATAAGTGGAGCAGCAAAAATCGATCAAGTGGAGGCTGGGGAGATTGTAATGCTTCAACCACTGTCTCTGAGATCAAGCCAGCTGGCAAGAGCAATGCAGGTAGTTCTGCTTGGAATATAAGTGACGAAAACTCTTCCTGGAATACCCAAAAACAAGAATCAAATCGTGGCAGTTGGGGGAAGCCTAGTGACCGTGGAGGTACTAGGTCTTCTGAAAGTGGAAGAGGTGGAAATCAAGTCAGGGGTGGATCATCTGGTCAAGACAGCTCATCTGATTGGAATAAATCTACCACTACTGATGGAGCTGGAAAAAAGGACGGTTGGAATAAACCAAATCTTGCTAGTGAGGATGAAAGTATTGGGAAAAAAGGATGGGGACAGGGTAATGAAGCCAATGATAGTGGTAACAAATGGCAGAGTTCAGGGTCTGATGGTGGAAAAAAATGGGGCACTAATGAATCAGAACGTGATGGTGGAAACAAAAGTTGGAATACATCCAAGTCATCTGATGGGGATACTGGCGGTGGGAATTCAGCTATTTGGAAAGACAAATCTGATTCCTCAAGTTTGACAGCCCCAAAAGGAGACCAATGGGGCGGAGATTGGGATAAGCAGCACAGTTCAAATGATACCAAAGCCTCTGAGGACAATTCTCCTTGGAATAAGAAATCTGTTGAAAGTGGCAAAGATGGTGAGCGTAAGGATCAAGGCAGTGGCTGGAATGTTGGAAAAAGTTCTGGTGGAGATTCTGCATCTGGATGGGGTCAAACCAGCAAGGAAGCTGGTTCAAGTGACCAAGCAGGTAGCTGGGGTTCTAATTGGAAAAAGAATTCTAATGCTGGGAATGAGGATTCTAGTTGGGCCAAGAAAAGCAACTGGAACTCAGGAAATGAATCTAATGATAATCAATTCACAGGTGATACCTCAGGCCATGGAAGTTGGGGGGGAGATGGTAGTGATAGAGGAGGCTTTAGAGGTAGAGGTGGTTTCAGAGGAAGGGGAGAAAGGGGCAGGTTTGGTGGTAGAGGCAGATCTGATAGAGGTGGATCAGACAGAGGAGGATTTGGAGGCAGAGGCCGTGGAAGATGGAACAGTGAAGGTGGTTCAAATGATGGGGAGAATAGAGGGTGGAGTAGTGGTGGTGGTGGAGGTGACTGGGAAAAATCAGGATCAGACCGAGGAGGATTTGGTGGAGGCAGAGGCCGTGGAAGATGGAACCAAGAAGCTGGTTCAAATGACGGTGACAGTGGAGGGTGGAGTGGTGGTGGTGGCGGACATAGAGGCAGAGGCCGGGGAAGATGGAATCAAAATGGTGATTCAAATTCAAATGATGGTGACAATGGAGGTTGGAGTAGCGGCGGTGGCAGAGGCGGATTTGGAGGTGGTAGAGGCCGTGGAAGATGGAATCAAGAAGGTGATTCAAATTCAAATGATGGT

mRNA sequence

GTTTAATTTAAAAAATATAAAAAATGGTACAAATTCAACTTTTCTGAAAGAGGAGAAAAAGAAAAAAGAAAAAAGAAAAAAAAAAGAGCGAAAACGCAGCGTAAAAAAGGGAGGGAACCCATCAGAGCAGAGCTCGCAGTGGACGATTCAGACTCTCAACGCCTGAGGGGCGCAGTCCATTTCTCTCTCAATTTGCATGTTCCTTCTGGTTTCTGCCATTTCCTTACTCCAAATTCGAATCTTCCCCTCTCCGCCATAGCCATGGCGTCCAAGGGCAAGGGGATTGCCAAGGACACATCCTCTGGAAAGCGGAAGCAACGCGAAGACAACAATGGCGCCGACCTCGCCCGTAAGAGGAGGGACCGGAGCGTTCTTCAGTTCTTCGAGGACGTCGCTCCGGAGGTTGGTGGCGAAAGTGACGATAGCGATTTTCTCGATGATTTCATGGAGGAAGAGTTTGATACAGAACCGGCATTTAAGAATGATGCTGCAAAAGATCAAAATATTCCATTCTTCCCAAAAGAAGAGGAAATGAATGAAGAGGAGTTTGATAGAATTATGGAGGAGCACTACAATCAAGGTCCTGGACTCGGTGCATTTGCAGAAGAAAATTATGAGAACAAAAATTCTACTGGAAGAAACCCTCTTCAGCCGTCTTCCAGGGATACTATCTCTCTGTGGAAAGTTAAATGCATGGTTGGACGTGAGCGGCAATCAGTTTTTTGTCTTATGCAGAAATTTGTTGATTTGCACTCATTTGGTACCAAGCTACAGATAAAATCTGCATTTTATGTAGACCATGTAAAAGGTTTTATTTACATAGAAGCTCCTAGGCAGTATGATTTAATTGAGGCATGTAAAGGGATCAGTGGCATATATTCTACTCGCATAGCTTCTGTTCCCGAAAATGACATCTCTCAGTTGCTTACTGTTCGAAGTAGAGTCAGTGAAGTTTCTGTAGGTACAATGGCCCGTGTAAAAAATGGAAAATACAAGGGAGACCTTGCTCAGATTGTTGCTGTCAACAATGCACGCAAGAGAGCGACTGTGAAGCTTGTTCCAAGGATTGATCTCCAAGCTATGGCTGCAAAATTTGGTGGAGGAGTTGCTGCTAAGAAAACTACCAATCCTGCACCACGATTGATCAACTCTAGTGAACTTGATGAATTTCGACCTCTCATGCAATTTAGGCGTGACCGTGAAACTGGGAAACTTTTTGAGTTTCTTGATGGGATGATGCTCAAGGATGGATATCTATTCAAAAAAATATCTTTAGATTCATTGAACTGCTGGGGCGTAATGCCATCTGAAGATGAGCTCTTAAAGTTCAAGCCTCCTGAGAGCAACGAGTCTAATGATCTAGAGTGGCTTTCTCAACTTTATGGTGAAAAAAGGAAAAAGAAGATCATTAGGACTGAAAAGGGGGGTGGAAAAGGAGAGGGCACATCAGGATCTAGTTCCATGAACAGCTTTGCAGATCATGACCTTGTTTGTTTTGGCCGGAAAGATTTTGGGATGATATTAGGAACGGAAAAAGATGACAGTTACAAGATTTTGAAGGAAGGCCCTGATGGGTCTATTGTGGTGAATGTGCAACGGAAAGAGTTGAAAAGTGGGCCTTTAGAGGGTAAATTTACCGCTGCTGATCACAATGGAAAGATCATCTCTGTTTCAGACAATGTCAAGGTGTTGGAAGGATCACTTAAGGATAAGCAAGGGATTGTTAAGCATGTTTATAGACACACGGTGTTTGTATATGATGAGAATGAGGTGGACAACGATGGTTATTTCTGCTGCAAATCTAACATGTGCGAGAAAATCAAGATTTCTTATGATGCACCTAGTGGAAAGGATGATGACAAAGGTTTCTCTGGTTTTGAGGATTTCTCCTTTTCTCCCAAGTCACCCCTATCACCTAAAAAGCCATGGGCAGAGAAGGACAGTGGTCGTGAATACAACCGTGATGATAAAGATGGAATGTTCTCTATTGGTCAAACCTTGAGAATACGTGTCGGTCCTTTAAAGGGATACCTATGCCGTGTTATAGCCGTACGTAAAAAAGATGTTACAGTGAAGCTTGATTCTCAACAGAAGGTTCTTACAGTCAGATCTGATTTTCTTTCTGAGGTATATCGGAAGACCTCTACTGTGTCTCTCAGAATACGAATATCATTTTGTAGTGAGGATACAGAATTCGGTTCTCTAAAACCTTTCGATATACTGGGAAATGAAGGCAGTTCCCAAGATTGGATGGGTGGCACGGGGTCATCCACAGGTGGTGATGGCTGGAATTCTGCTGGACCTTCTTCAGAAAGAAGTCCTTGGCCCAGTTTCCCAGAGTCTGGTACCTCGAATTGTCCAGGGTCCAGTTCTAATCCTTTCGGTTCTGAAAGCCTTGATGCTCAAAAAGATGTTGAAGATTCTCCTTGGGTTAGCAAGACGGTTCCAAATGCAAGCACTTCCTGGGGTGCTGCAAAATCAAATGTTGACGATACTGTTAATGATGACCAAGCCCCTGGATGGGGGAAGAGCGAGTCATGGGACAAAGCTACTGCTAAAACTAGTTCAGATGGCAATGCCTCTGGTGCCTGGGACAAAAAAGTAGTACCTGGTGGAGATTGTGCAGGCCCTACAGATCAAGCTGAGGACAAAGATAAAGATTCGGAATCCGGGGGGTGGAAAAAGAGTCAAAATGCTAGTTTTGGTGATGAAAATAATTCAGCAGAAGCCTCAGCTGATAAGTGGAGCAGCAAAAATCGATCAAGTGGAGGCTGGGGAGATTGTAATGCTTCAACCACTGTCTCTGAGATCAAGCCAGCTGGCAAGAGCAATGCAGGTAGTTCTGCTTGGAATATAAGTGACGAAAACTCTTCCTGGAATACCCAAAAACAAGAATCAAATCGTGGCAGTTGGGGGAAGCCTAGTGACCGTGGAGGTACTAGCTCATCTGATTGGAATAAATCTACCACTACTGATGGAGCTGGAAAAAAGGACGGTTGGAATAAACCAAATCTTGCTAGTGAGGATGAAAGTATTGGGAAAAAAGGATGGGGACAGGGTAATGAAGCCAATGATAGTGGTAACAAATGGCAGAGTTCAGGGTCTGATGGTGGAAAAAAATGGGGCACTAATGAATCAGAACGTGATGGTGGAAACAAAAGTTGGAATACATCCAAGTCATCTGATGGGGATACTGGCGGTGGGAATTCAGCTATTTGGAAAGACAAATCTGATTCCTCAAGTTTGACAGCCCCAAAAGGAGACCAATGGGGCGGAGATTGGGATAAGCAGCACAGTTCAAATGATACCAAAGCCTCTGAGGACAATTCTCCTTGGAATAAGAAATCTGTTGAAAGTGGCAAAGATGGTGAGCGTAAGGATCAAGGCAGTGGCTGGAATGTTGGAAAAAGTTCTGGTGGAGATTCTGCATCTGGATGGGGTCAAACCAGCAAGGAAGCTGGTTCAAGTGACCAAGCAGGTAGCTGGGGTTCTAATTGGAAAAAGAATTCTAATGCTGGGAATGAGGATTCTAGTTGGGCCAAGAAAAGCAACTGGAACTCAGGAAATGAATCTAATGATAATCAATTCACAGGTGATACCTCAGGCCATGGAAGTTGGGGGGGAGATGGTAGTGATAGAGGAGGCTTTAGAGGTAGAGGTGGTTTCAGAGGAAGGGGAGAAAGGGGCAGGTTTGGTGGTAGAGGCAGATCTGATAGAGGTGGATCAGACAGAGGAGGATTTGGAGGCAGAGGCCGTGGAAGATGGAACAGTGAAGGTGGTTCAAATGATGGGGAGAATAGAGGGTGGAGTAGTGGTGGTGGTGGAGGTGACTGGGAAAAATCAGGATCAGACCGAGGAGGATTTGGTGGAGGCAGAGGCCGTGGAAGATGGAACCAAGAAGCTGGTTCAAATGACGGTGACAGTGGAGGGTGGAGTGGTGGTGGTGGCGGACATAGAGGCAGAGGCCGGGGAAGATGGAATCAAAATGGTGATTCAAATTCAAATGATGGTGACAATGGAGGTTGGAGTAGCGGCGGTGGCAGAGGCGGATTTGGAGGTGGTAGAGGCCGTGGAAGATGGAATCAAGAAGGTGATTCAAATTCAAATGATGGT

Coding sequence (CDS)

ATGGCGTCCAAGGGCAAGGGGATTGCCAAGGACACATCCTCTGGAAAGCGGAAGCAACGCGAAGACAACAATGGCGCCGACCTCGCCCGTAAGAGGAGGGACCGGAGCGTTCTTCAGTTCTTCGAGGACGTCGCTCCGGAGGTTGGTGGCGAAAGTGACGATAGCGATTTTCTCGATGATTTCATGGAGGAAGAGTTTGATACAGAACCGGCATTTAAGAATGATGCTGCAAAAGATCAAAATATTCCATTCTTCCCAAAAGAAGAGGAAATGAATGAAGAGGAGTTTGATAGAATTATGGAGGAGCACTACAATCAAGGTCCTGGACTCGGTGCATTTGCAGAAGAAAATTATGAGAACAAAAATTCTACTGGAAGAAACCCTCTTCAGCCGTCTTCCAGGGATACTATCTCTCTGTGGAAAGTTAAATGCATGGTTGGACGTGAGCGGCAATCAGTTTTTTGTCTTATGCAGAAATTTGTTGATTTGCACTCATTTGGTACCAAGCTACAGATAAAATCTGCATTTTATGTAGACCATGTAAAAGGTTTTATTTACATAGAAGCTCCTAGGCAGTATGATTTAATTGAGGCATGTAAAGGGATCAGTGGCATATATTCTACTCGCATAGCTTCTGTTCCCGAAAATGACATCTCTCAGTTGCTTACTGTTCGAAGTAGAGTCAGTGAAGTTTCTGTAGGTACAATGGCCCGTGTAAAAAATGGAAAATACAAGGGAGACCTTGCTCAGATTGTTGCTGTCAACAATGCACGCAAGAGAGCGACTGTGAAGCTTGTTCCAAGGATTGATCTCCAAGCTATGGCTGCAAAATTTGGTGGAGGAGTTGCTGCTAAGAAAACTACCAATCCTGCACCACGATTGATCAACTCTAGTGAACTTGATGAATTTCGACCTCTCATGCAATTTAGGCGTGACCGTGAAACTGGGAAACTTTTTGAGTTTCTTGATGGGATGATGCTCAAGGATGGATATCTATTCAAAAAAATATCTTTAGATTCATTGAACTGCTGGGGCGTAATGCCATCTGAAGATGAGCTCTTAAAGTTCAAGCCTCCTGAGAGCAACGAGTCTAATGATCTAGAGTGGCTTTCTCAACTTTATGGTGAAAAAAGGAAAAAGAAGATCATTAGGACTGAAAAGGGGGGTGGAAAAGGAGAGGGCACATCAGGATCTAGTTCCATGAACAGCTTTGCAGATCATGACCTTGTTTGTTTTGGCCGGAAAGATTTTGGGATGATATTAGGAACGGAAAAAGATGACAGTTACAAGATTTTGAAGGAAGGCCCTGATGGGTCTATTGTGGTGAATGTGCAACGGAAAGAGTTGAAAAGTGGGCCTTTAGAGGGTAAATTTACCGCTGCTGATCACAATGGAAAGATCATCTCTGTTTCAGACAATGTCAAGGTGTTGGAAGGATCACTTAAGGATAAGCAAGGGATTGTTAAGCATGTTTATAGACACACGGTGTTTGTATATGATGAGAATGAGGTGGACAACGATGGTTATTTCTGCTGCAAATCTAACATGTGCGAGAAAATCAAGATTTCTTATGATGCACCTAGTGGAAAGGATGATGACAAAGGTTTCTCTGGTTTTGAGGATTTCTCCTTTTCTCCCAAGTCACCCCTATCACCTAAAAAGCCATGGGCAGAGAAGGACAGTGGTCGTGAATACAACCGTGATGATAAAGATGGAATGTTCTCTATTGGTCAAACCTTGAGAATACGTGTCGGTCCTTTAAAGGGATACCTATGCCGTGTTATAGCCGTACGTAAAAAAGATGTTACAGTGAAGCTTGATTCTCAACAGAAGGTTCTTACAGTCAGATCTGATTTTCTTTCTGAGGTATATCGGAAGACCTCTACTGTGTCTCTCAGAATACGAATATCATTTTGTAGTGAGGATACAGAATTCGGTTCTCTAAAACCTTTCGATATACTGGGAAATGAAGGCAGTTCCCAAGATTGGATGGGTGGCACGGGGTCATCCACAGGTGGTGATGGCTGGAATTCTGCTGGACCTTCTTCAGAAAGAAGTCCTTGGCCCAGTTTCCCAGAGTCTGGTACCTCGAATTGTCCAGGGTCCAGTTCTAATCCTTTCGGTTCTGAAAGCCTTGATGCTCAAAAAGATGTTGAAGATTCTCCTTGGGTTAGCAAGACGGTTCCAAATGCAAGCACTTCCTGGGGTGCTGCAAAATCAAATGTTGACGATACTGTTAATGATGACCAAGCCCCTGGATGGGGGAAGAGCGAGTCATGGGACAAAGCTACTGCTAAAACTAGTTCAGATGGCAATGCCTCTGGTGCCTGGGACAAAAAAGTAGTACCTGGTGGAGATTGTGCAGGCCCTACAGATCAAGCTGAGGACAAAGATAAAGATTCGGAATCCGGGGGGTGGAAAAAGAGTCAAAATGCTAGTTTTGGTGATGAAAATAATTCAGCAGAAGCCTCAGCTGATAAGTGGAGCAGCAAAAATCGATCAAGTGGAGGCTGGGGAGATTGTAATGCTTCAACCACTGTCTCTGAGATCAAGCCAGCTGGCAAGAGCAATGCAGGTAGTTCTGCTTGGAATATAAGTGACGAAAACTCTTCCTGGAATACCCAAAAACAAGAATCAAATCGTGGCAGTTGGGGGAAGCCTAGTGACCGTGGAGGTACTAGCTCATCTGATTGGAATAAATCTACCACTACTGATGGAGCTGGAAAAAAGGACGGTTGGAATAAACCAAATCTTGCTAGTGAGGATGAAAGTATTGGGAAAAAAGGATGGGGACAGGGTAATGAAGCCAATGATAGTGGTAACAAATGGCAGAGTTCAGGGTCTGATGGTGGAAAAAAATGGGGCACTAATGAATCAGAACGTGATGGTGGAAACAAAAGTTGGAATACATCCAAGTCATCTGATGGGGATACTGGCGGTGGGAATTCAGCTATTTGGAAAGACAAATCTGATTCCTCAAGTTTGACAGCCCCAAAAGGAGACCAATGGGGCGGAGATTGGGATAAGCAGCACAGTTCAAATGATACCAAAGCCTCTGAGGACAATTCTCCTTGGAATAAGAAATCTGTTGAAAGTGGCAAAGATGGTGAGCGTAAGGATCAAGGCAGTGGCTGGAATGTTGGAAAAAGTTCTGGTGGAGATTCTGCATCTGGATGGGGTCAAACCAGCAAGGAAGCTGGTTCAAGTGACCAAGCAGGTAGCTGGGGTTCTAATTGGAAAAAGAATTCTAATGCTGGGAATGAGGATTCTAGTTGGGCCAAGAAAAGCAACTGGAACTCAGGAAATGAATCTAATGATAATCAATTCACAGGTGATACCTCAGGCCATGGAAGTTGGGGGGGAGATGGTAGTGATAGAGGAGGCTTTAGAGGTAGAGGTGGTTTCAGAGGAAGGGGAGAAAGGGGCAGGTTTGGTGGTAGAGGCAGATCTGATAGAGGTGGATCAGACAGAGGAGGATTTGGAGGCAGAGGCCGTGGAAGATGGAACAGTGAAGGTGGTTCAAATGATGGGGAGAATAGAGGGTGGAGTAGTGGTGGTGGTGGAGGTGACTGGGAAAAATCAGGATCAGACCGAGGAGGATTTGGTGGAGGCAGAGGCCGTGGAAGATGGAACCAAGAAGCTGGTTCAAATGACGGTGACAGTGGAGGGTGGAGTGGTGGTGGTGGCGGACATAGAGGCAGAGGCCGGGGAAGATGGAATCAAAATGGTGATTCAAATTCAAATGATGGTGACAATGGAGGTTGGAGTAGCGGCGGTGGCAGAGGCGGATTTGGAGGTGGTAGAGGCCGTGGAAGATGGAATCAAGAAGGTGATTCAAATTCAAATGATGGT

Protein sequence

MASKGKGIAKDTSSGKRKQREDNNGADLARKRRDRSVLQFFEDVAPEVGGESDDSDFLDDFMEEEFDTEPAFKNDAAKDQNIPFFPKEEEMNEEEFDRIMEEHYNQGPGLGAFAEENYENKNSTGRNPLQPSSRDTISLWKVKCMVGRERQSVFCLMQKFVDLHSFGTKLQIKSAFYVDHVKGFIYIEAPRQYDLIEACKGISGIYSTRIASVPENDISQLLTVRSRVSEVSVGTMARVKNGKYKGDLAQIVAVNNARKRATVKLVPRIDLQAMAAKFGGGVAAKKTTNPAPRLINSSELDEFRPLMQFRRDRETGKLFEFLDGMMLKDGYLFKKISLDSLNCWGVMPSEDELLKFKPPESNESNDLEWLSQLYGEKRKKKIIRTEKGGGKGEGTSGSSSMNSFADHDLVCFGRKDFGMILGTEKDDSYKILKEGPDGSIVVNVQRKELKSGPLEGKFTAADHNGKIISVSDNVKVLEGSLKDKQGIVKHVYRHTVFVYDENEVDNDGYFCCKSNMCEKIKISYDAPSGKDDDKGFSGFEDFSFSPKSPLSPKKPWAEKDSGREYNRDDKDGMFSIGQTLRIRVGPLKGYLCRVIAVRKKDVTVKLDSQQKVLTVRSDFLSEVYRKTSTVSLRIRISFCSEDTEFGSLKPFDILGNEGSSQDWMGGTGSSTGGDGWNSAGPSSERSPWPSFPESGTSNCPGSSSNPFGSESLDAQKDVEDSPWVSKTVPNASTSWGAAKSNVDDTVNDDQAPGWGKSESWDKATAKTSSDGNASGAWDKKVVPGGDCAGPTDQAEDKDKDSESGGWKKSQNASFGDENNSAEASADKWSSKNRSSGGWGDCNASTTVSEIKPAGKSNAGSSAWNISDENSSWNTQKQESNRGSWGKPSDRGGTSSSDWNKSTTTDGAGKKDGWNKPNLASEDESIGKKGWGQGNEANDSGNKWQSSGSDGGKKWGTNESERDGGNKSWNTSKSSDGDTGGGNSAIWKDKSDSSSLTAPKGDQWGGDWDKQHSSNDTKASEDNSPWNKKSVESGKDGERKDQGSGWNVGKSSGGDSASGWGQTSKEAGSSDQAGSWGSNWKKNSNAGNEDSSWAKKSNWNSGNESNDNQFTGDTSGHGSWGGDGSDRGGFRGRGGFRGRGERGRFGGRGRSDRGGSDRGGFGGRGRGRWNSEGGSNDGENRGWSSGGGGGDWEKSGSDRGGFGGGRGRGRWNQEAGSNDGDSGGWSGGGGGHRGRGRGRWNQNGDSNSNDGDNGGWSSGGGRGGFGGGRGRGRWNQEGDSNSNDG
Homology
BLAST of MC04g1461 vs. ExPASy Swiss-Prot
Match: F4JW79 (Protein RNA-directed DNA methylation 3 OS=Arabidopsis thaliana OX=3702 GN=RDM3 PE=1 SV=1)

HSP 1 Score: 658.3 bits (1697), Expect = 1.8e-187
Identity = 546/1363 (40.06%), Postives = 747/1363 (54.81%), Query Frame = 0

Query: 1    MASKGKG---IAKDTSSGKRKQREDNNGAD---LARKRRDRSVLQFFEDVAP--EVGGES 60
            M  KGKG      D+ SG +K++      D     +KR++  VLQFFE+ A     GG S
Sbjct: 1    MDRKGKGKQVAGSDSYSGGQKRKNSVEFRDEGLRIKKRKNPEVLQFFEESAEVGYYGGSS 60

Query: 61   DDSD----FLDDFMEEEFDTEPAFK-NDAAKDQNIPFFPKEEEMNEEEFDRIMEEHYNQG 120
            D+ D    FL+D ME+E + E + K     K ++   FPKEE++NEEEFDRIMEE Y  G
Sbjct: 61   DEDDDGLGFLND-MEDEPEVEESSKAGKGEKGKSSFVFPKEEDLNEEEFDRIMEERYKPG 120

Query: 121  PGLGAFAEENYENKNSTGRNPLQPSSRDTISLWKVKCMVGRERQSVFCLMQKFVDLHSFG 180
             G   +A+++   K++   + L P+S+D   +WKVKC +GRER+SVFCLM KFV+L   G
Sbjct: 121  SGFLRYADDDI--KDAIEMDALAPTSKDP-PIWKVKCAIGRERRSVFCLMHKFVELRKIG 180

Query: 181  TKLQIKSAFYVDHVKGFIYIEAPRQYDLIEACKGISGIYSTRIASVPENDISQLLTVRSR 240
            TKL+I S F VDHVKGFI+IEA +++D++EACK + GIY+TR+  +P+ +   LLTV+ +
Sbjct: 181  TKLEIISVFSVDHVKGFIFIEADKEHDVLEACKSLVGIYATRMVLLPKAETPNLLTVQKK 240

Query: 241  VSEVSVGTMARVKNGKYKGDLAQIVAVNNARKRATVKLVPRIDLQAMAAKFGGGVAAKKT 300
              +VS GT ARVKNGKYKGDLAQIVAV++ R +A +KL+PRID+QA+  K+GGGV  +K 
Sbjct: 241  TKKVSEGTWARVKNGKYKGDLAQIVAVSDTRNKALIKLIPRIDIQALTQKYGGGVTVQKG 300

Query: 301  TNPAPRLINSSELDEFRPLMQFRRDRETGKLFEFLDGMMLKDGYLFKKISLDSLNCWGVM 360
              PAPRLI+SSEL+EFRPL+Q RRDR+TG  FE LD +MLKDGYL+KK+SLDS++ WGV+
Sbjct: 301  QTPAPRLISSSELEEFRPLIQVRRDRDTGITFEHLDSLMLKDGYLYKKVSLDSISSWGVI 360

Query: 361  PSEDELLKFKPPESNESNDLEWLSQLYGEKRKKKIIRTEKGGGKGEGTSGS--------- 420
            P++DELLKF P +  E+ D+EW+S++YGE+RKKKI+ T + GGKGEG+ G          
Sbjct: 361  PTKDELLKFTPVDRKETGDVEWISEIYGEERKKKILPTCREGGKGEGSGGGKGEGSGGGK 420

Query: 421  ---------------SSMNSFADHDLVCFGRKDFGMILGT-EKDDSYKILKEGPDGSIVV 480
                            S +S+  ++LVCF RKDFG+I+G  +K D YK+LKEG DG +VV
Sbjct: 421  GEGSRGGKGEGSSDFKSESSYELYNLVCFSRKDFGLIVGVDDKGDGYKVLKEGIDGPVVV 480

Query: 481  NVQRKELKSGPLEGKFTAADHNGKIISVSDNVKVLEGSLKDKQGIVKHVYRHTVFVYDEN 540
             V +KE+++GP + KFTA D N K ISV+D VK+ +G  + KQG+V+ VYR  +F+YDE+
Sbjct: 481  TVGKKEMQNGPFDSKFTALDLNKKQISVNDVVKISKGPSEGKQGVVRQVYRGIIFLYDES 540

Query: 541  EVDNDGYFCCKSNMCEKIKISYDAPSGKDDDKGFSGFEDFSFSPKSPLSPKKPWAEKDSG 600
            E +N GYFCCKS  CEK+K+  +  + K      + FEDF  SPKSPLSP+K W  ++  
Sbjct: 541  EEENGGYFCCKSQSCEKVKLFTEESNEKTGGFDGTAFEDFVSSPKSPLSPEKEWQPRERY 600

Query: 601  REYNRDDKDGMFSIGQTLRIRVGPLKGYLCRVIAVRKKDVTVKLDSQQKVLTVRSDFLSE 660
               N+ D    +SIGQ LRIRVGPLKGYLCRVIA+R  DVTVKLDSQ K+ TV+S+ L+E
Sbjct: 601  NSSNQGDIGSTYSIGQKLRIRVGPLKGYLCRVIALRYSDVTVKLDSQHKIFTVKSEHLAE 660

Query: 661  VYRKTSTVSLRIRISFCSEDTEFGSLKPFDILGNEGSSQDWMGGTGSSTGGDGWNSAGPS 720
            V  + + +S        S D   GS +PF +LG E S+ DW  G G+S+ G  WN  GPS
Sbjct: 661  VRDRNTVLS-------TSGDAGTGSFQPFGMLGTESSTGDWAIGAGTSSEGGNWNIGGPS 720

Query: 721  SERSPWPSFPESGTSNCPGSSSNPFGSESLDAQKDVEDSPWVSKTVPNASTS-WGAAKSN 780
            ++     +   +    C     NP+G       K   D   VS TV + +TS W  A + 
Sbjct: 721  TDSHESLNIERNMVQLC--REKNPWG-----GSKPTSD---VSPTVADDNTSAWANAAAE 780

Query: 781  VDDTVNDDQAPG---WGKSESWDKATAKTSSDGNAS----GAWDKKVVPGGDCAGPTDQA 840
                   DQ  G   WGK+ + +  T     D +AS     +W+K+   G   +   D  
Sbjct: 781  NKPASASDQPGGWNPWGKTPASEAGTVSGWGDTSASNVEASSWEKQ---GASTSNVADLG 840

Query: 841  EDKDKDSESGGWKKSQNASFGDENNSAEASADK----WSSKNRSSG--GWG--DCNASTT 900
                    SGG K+ +++ +G    ++E+S  K    W  K  S G   WG  D N+S +
Sbjct: 841  SWGTHGGSSGGNKQDEDSVWGKLCEASESSQKKEESSWGKKGGSDGESSWGNKDGNSSAS 900

Query: 901  VSEIKPAGKSNAGSSAWNISDENSSWNTQKQESNRGSWGKPSDRGGTSSSDWNKSTTTDG 960
              +    G+ + GS     S   S+W+ Q      G +G    + G  SS WNKS     
Sbjct: 901  KKDGVSWGQQDKGSDE---SKGGSAWSNQ-----CGDFGSGKKKDG--SSGWNKSAEDSN 960

Query: 961  AGKK--DGWNKPNLASEDESIGKKG-----WGQGNEANDSGNKWQSSGSDGGKKWG-TNE 1020
            A  K    W +PN   +  S GKKG     WG+ ++    G K   +  DGG  WG  ++
Sbjct: 961  ANSKGVPDWGQPN---DGSSWGKKGDGAASWGKKDDGGSWGKKDDGNKDDGGSSWGKKDD 1020

Query: 1021 SERDGGNKSWNTSKSSDGDTGGG----NSAIWKDKSDSSSLTAPKGDQWGGDWDKQHSSN 1080
             ++D G  SW   K  DG +  G      + W  K D  SL   K D  G  W K+    
Sbjct: 1021 GQKDDGGSSW--EKKFDGGSSWGKKDDGGSSWGKKDDGGSLWGKK-DDGGSSWGKEDDGG 1080

Query: 1081 DT--KASEDNSPWNKKSVESGKDGERKDQGSGWNVGKSSGGDSASGWGQTSKEAGSSDQA 1140
                K  +  S W KK       G++ D GS W   K  GG S   + +  +  G     
Sbjct: 1081 SLWGKKDDGESSWGKKDDGESSWGKKDDGGSSWG-KKDEGGYSEQTFDRGGRGFGGRRGG 1140

Query: 1141 GSWG--SNWKKNSNAGNED--SSWAKKSNWNSGNESNDNQFTGDTSGHGSWGGDGSDRGG 1200
            G  G    + + S+ GN +  + W+K S  +S  + +     GD  G  SWG +    GG
Sbjct: 1141 GRRGGRDQFGRGSSFGNSEDPAPWSKPSGGSSWGKQD-----GD-GGGSSWGKENDAGGG 1200

Query: 1201 FRGRGGFRGRGERGRFGGRGRSDRGGSDRGGFGGRGRG-RWNSEGGSNDGENRGWSSGGG 1260
                 G +  G    +G +     GGS  G     G G  W  +    DG + G   GGG
Sbjct: 1201 --SSWGKQDNGVGSSWGKQNDGSGGGSSWGKQNDAGGGSSWGKQDSGGDGSSWGKQDGGG 1260

Query: 1261 --GGDWEKSGSDRGGFGGGR-----GRGRWNQEAGSNDGDSGGWSGGGGGHRGRGRGRWN 1282
              G  W K  +  GG   G+     G   W ++ G   G S G   GGGG  G   G+ N
Sbjct: 1261 DSGSAWGKQNNTSGGSSWGKQSDAGGGSSWGKQDGGGGGSSWGKQDGGGG-SGSAWGKQN 1313

BLAST of MC04g1461 vs. ExPASy Swiss-Prot
Match: Q9STN3 (Putative transcription elongation factor SPT5 homolog 1 OS=Arabidopsis thaliana OX=3702 GN=At4g08350 PE=1 SV=2)

HSP 1 Score: 266.5 bits (680), Expect = 1.5e-69
Identity = 248/844 (29.38%), Postives = 378/844 (44.79%), Query Frame = 0

Query: 4   KGKGIAKDTSSGKRKQREDNN---------GADLARKRRDRSVLQFFEDVAPEVG--GES 63
           +G+    D  + +  Q ED++         G   A KR+  S   F +  A +V    E 
Sbjct: 45  RGRSNFIDDYAEEDSQEEDDDDEDYGSSRGGKGAASKRKKPSASIFLDREAHQVDDEDEE 104

Query: 64  DDSDFLDDFMEEEFDTEPAFKNDAAKDQNIPFFPKEE-EMNEEEFDRIMEEHYNQGPGLG 123
           ++ +  DDF+ +     P  + D   ++   F P++E + + E+ +R ++E ++      
Sbjct: 105 EEDEAEDDFIVDNGTDLPDERGDRRYERR--FLPRDENDEDVEDLERRIQERFS-----S 164

Query: 124 AFAEENYENKNSTGRNPLQPSSRDTISLWKVKCMVGRERQSVFCLMQKFVDLHSFGTKLQ 183
              EE  E      +  L PS RD   LW VKC +GRER+   CLMQKF+D    G  LQ
Sbjct: 165 RHHEEYDEEATEVEQQALLPSVRDP-KLWMVKCAIGREREVAVCLMQKFIDR---GADLQ 224

Query: 184 IKSAFYVDHVKGFIYIEAPRQYDLIEACKGISGIYST-RIASVPENDISQLLTVRSRVSE 243
           I+S   +DH+K FIY+EA ++  + EA KG+  IY+  +I  VP  +++ +L+V S+  +
Sbjct: 225 IRSVVALDHLKNFIYVEADKEAHVKEAIKGMRNIYANQKILLVPIREMTDVLSVESKAID 284

Query: 244 VSVGTMARVKNGKYKGDLAQIVAVNNARKRATVKLVPRIDLQAMAAKFGG-GVAAKKTTN 303
           +S  T  R+K G YKGDLA++V V+N R+R TVKL+PRIDLQA+A+K  G  V+ KK   
Sbjct: 285 LSRDTWVRMKIGTYKGDLAKVVDVDNVRQRVTVKLIPRIDLQALASKLDGREVSKKKAFV 344

Query: 304 PAPRLINSSELDEFRPLMQFRRDRETGKLFEFLDGMMLKDGYLFKKISLDSLNCWGVMPS 363
           P PR +N  E  E    ++ RRD  TG  FE + GM+ KDG+ +K++SL S+    V P+
Sbjct: 345 PPPRFMNIDEARELHIRVERRRDHMTGDYFENIGGMLFKDGFHYKQVSLKSITVQNVTPT 404

Query: 364 EDELLKFKPPESNESNDLEWLSQLYGEKRK-----------------------------K 423
            DEL KF  P  N   D   LS L+  ++K                              
Sbjct: 405 FDELEKFNKPSENGEGDFGGLSTLFANRKKGHFMKGDAVIVIKGDLKNLKGWVEKVDEEN 464

Query: 424 KIIRTEKGG--------------------------GKGEG-------------------- 483
            +IR+E  G                          G  EG                    
Sbjct: 465 VLIRSEVKGLPDPLAVNERELCKYFEPGNHVKVVSGTHEGATGMVVKVDQHVLIILSDTT 524

Query: 484 -----------------TSGSSSMNSFADHDLVCFGRKDFGMILGTEKDDSYKILKEGPD 543
                            T+G + +  +  HDLV      FG+I+  E ++++++LK  PD
Sbjct: 525 KEHVRVFADHVVESSEVTTGVTKIGDYELHDLVLLDNLSFGVIIRLE-NEAFQVLKGVPD 584

Query: 544 GSIVVNVQRKELKSGPLEGKFTAADHNGKIISVSDNVKVLEGSLKDKQGIVKHVYRHTVF 603
              V  V+ +E+K   LE K    D    +I+V D+V+V+EG  K KQG VKH+Y+  +F
Sbjct: 585 RPEVALVKLREIKC-KLEKKINVQDRYKNVIAVKDDVRVIEGPSKGKQGPVKHIYKGVLF 644

Query: 604 VYDENEVDNDGYFCCKSNMCEKIKISYDAPSGKDDDKGFSGFEDFSFSPKSPLSPKKPWA 663
           +YD + +++ G+ C K   C  +  S    +    D   S + +F      P SP +   
Sbjct: 645 IYDRHHLEHAGFICAKCTSCIVVGGSRSGANRNGGD-SLSRYGNFKAPAPVPSSPGR--F 704

Query: 664 EKDSGREYNRD--------DKDGMFSIGQTLRIRVGPLKGYLCRVIAVRKKDVTVKLDSQ 712
           ++  G  YN           +     +G T++IR+GP KGY   V+ V+   V V+L  +
Sbjct: 705 QRGRGGGYNNSGGRHGGGRGRGDDSLLGTTVKIRLGPFKGYRGPVVEVKGNSVRVEL--E 764

BLAST of MC04g1461 vs. ExPASy Swiss-Prot
Match: O80770 (Putative transcription elongation factor SPT5 homolog 2 OS=Arabidopsis thaliana OX=3702 GN=At2g34210 PE=3 SV=2)

HSP 1 Score: 235.3 bits (599), Expect = 3.7e-60
Identity = 238/846 (28.13%), Postives = 371/846 (43.85%), Query Frame = 0

Query: 14  SGKRKQR--EDNNGADLARKRRDRSVLQFFE-DVAPEVGGESDDSDFLD--------DF- 73
           SGK++ R   D++G   ++K+   S    +E +V  +V  + DD D  D        DF 
Sbjct: 36  SGKKRGRSNSDSDGRRGSKKKSSGSAFIDWEVEVDDDVEDDDDDVDVEDGKQQLKFGDFS 95

Query: 74  ----MEEEFDTEPAFKNDAAKDQNIPFFPKEEEMNEEEFDRIMEEHYNQGPGLGAFAEEN 133
               +  E D      +   +     F P EE+++E E  R +E    +      +A+++
Sbjct: 96  LCFIVSGEADLPNEDSDHRRQYYQRGFHPHEEDVDELE-KRTLERLSTK------YAKDD 155

Query: 134 YE--NKNSTGRNPLQPSSRDTISLWKVKCMVGRERQSVFCLMQKFVDLHSFGTKLQIKSA 193
           YE  + N   +  L PS RD   LW VKC +GRER+   CLMQK VD    G++ +I+SA
Sbjct: 156 YELDDVNDVDQQALLPSVRDP-KLWLVKCAIGREREVAVCLMQKIVDR---GSEFKIRSA 215

Query: 194 FYVDHVKGFIYIEAPRQYDLIEACKGISGIYST-RIASVPENDISQLLTVRSRVSEVSVG 253
             +DH++ ++YIEA  +  + EA KG+  IY+  +I  VP  +++ +L+V S+  ++S  
Sbjct: 216 IALDHLQNYVYIEADMEAHVKEAIKGMRNIYANQKILLVPIKEMTAVLSVESKAIDLSRD 275

Query: 254 TMARVKNGKYKGDLAQIVAVNNARKRATVKLVPRIDLQAMAAKFGGGV-AAKKTTNPAPR 313
           +  R+K G YKGDLAQ+V V+N RKR TVKL+PRIDLQA+A K  G     KK   P PR
Sbjct: 276 SWVRMKLGIYKGDLAQVVDVDNVRKRVTVKLIPRIDLQALANKLEGTENVKKKAFAPPPR 335

Query: 314 LINSSELDEFRPLMQFRRDRETGKLFEFLDGMMLKDGYLFKKISLDSLNCWGVMPSEDEL 373
            +N  E  E    ++ RRD  TG  FE + GM+ KDG+L+KK+S  S+    V P+ DEL
Sbjct: 336 FMNIDEARELHIRVEHRRDPMTGDYFENIGGMLFKDGFLYKKVSTKSIAAQNVTPTFDEL 395

Query: 374 LKFKPPESNESNDLEWLSQLYGEKRK-----------------------------KKIIR 433
            +FK P  N   D    S L+  ++K                               +IR
Sbjct: 396 ERFKRPNENGEIDFVDESTLFANRKKGHFMKGDAVIVIKGDLKNLKGWIEKVDEENVLIR 455

Query: 434 TE-------------------------------KGGGKG--------------------- 493
           +E                                 GG G                     
Sbjct: 456 SEMKDLPNPIAVNGRELCKYFEPGNFVKVVSGIHEGGTGMIVKVDQHMLIILSDTTKEHI 515

Query: 494 -----------EGTSGSSSMNSFADHDLVCFGRKDFGMILGTEKDDSYKILKEGPDGSIV 553
                      E T G + +  +  HDLV      FG+IL  +  ++ +ILK  PD S V
Sbjct: 516 CVFADHVAKSAEVTKGVTKIGDYELHDLVILSDFSFGVILKLD-SEAIQILKGVPDSSEV 575

Query: 554 VNVQRKELKSGPLEGKFTAADHNGKIISVSDNVKVLEGSLKDKQGIVKHVYRHTVFVYDE 613
             V+  E+K   +  K    D    +++V D V+V+EG  K KQG V  +Y+  +F++D 
Sbjct: 576 SIVKASEIKY-KIWKKINVQDRYKNVVAVKDVVRVIEGPSKGKQGPVVQIYKGVLFIHDR 635

Query: 614 NEVDNDGYFCCKSNMCEKIKISYDAPSGKDDDKGFSGFEDFSFSPKSPLSPKKPWAEKDS 673
           + +++ G+ C + + C     ++  P+            D  ++P +             
Sbjct: 636 HNLEHTGFICTRCSSCVLAGGNFKTPALVPPSPRRFQRADMGYNPGA-------GGRHQG 695

Query: 674 GREYNRDDKDGMFSIGQTLRIRVGPLKGYLCRVIAVRKKDVTVKLDSQQKVLTVRSDFLS 733
           GR    DD      +G  ++IR+GP KGY  R++ V+ K V V+L++  K++TV    +S
Sbjct: 696 GRGRRGDD----HLVGTYVKIRLGPFKGYSGRLVEVKDKLVRVELEA--KIVTVERKAIS 755

BLAST of MC04g1461 vs. ExPASy Swiss-Prot
Match: O13936 (Transcription elongation factor spt5 OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=spt5 PE=1 SV=1)

HSP 1 Score: 124.8 bits (312), Expect = 7.1e-27
Identity = 223/934 (23.88%), Postives = 383/934 (41.01%), Query Frame = 0

Query: 10  KDTSSGKRKQREDNNGADLARKRRDRSVLQFFEDVAPEVGGESDDSDFLDDFMEEEFDTE 69
           +D S G R++R  ++     R+ +   +    ++   E+  E D+    D F+EEE   +
Sbjct: 127 EDESGGGRRKRARHD-----RRNQFLDIEAEVDEDEEELEDEEDEIGREDGFIEEEVGAD 186

Query: 70  PAFKNDAAKDQNIPFFPKEEEMNEEEFDRIMEEHYNQGPGLGAFAEENYENKNSTGRNPL 129
             +  D  + + +    + +E+   + +R+ EE Y +  G    ++    + ++  +  L
Sbjct: 187 --YVGDDRRHRELD--RQRQELQSVDAERLAEE-YREKYGR---SQTVVGDTSNVPQRLL 246

Query: 130 QPSSRDTISLWKVKCMVGRERQSVFCLMQKFVDLHSFGTKLQIKSAFYVDHVKGFIYIEA 189
            PS  D  ++W V+C +G+E+  VF +M+K +DL    + L+I SAF  D + G+IY+EA
Sbjct: 247 LPSVNDP-NIWAVRCKIGKEKDIVFTIMRKAMDLQYTSSPLEIISAFQRDSLVGYIYVEA 306

Query: 190 PRQYDLIEACKGISGIYSTRIASVPENDISQLLTVRSRVSEVSVGTMARVKNGKYKGDLA 249
            +Q  +++A  G+  +Y+  +  VP  ++  LL V+ +V E+  G   R++ GKY GDLA
Sbjct: 307 RKQSHVLDALNGVLNVYTNNMILVPIKEMPDLLKVQKQVVELLPGAYVRIRRGKYAGDLA 366

Query: 250 QIVAVNNARKRATVKLVPRIDLQAMAAKFGGGVAAKKT-TNPAPRLINSSELDEFRPLMQ 309
           Q+  ++     A V++VPRID       +  G+  K + T P  RL N SE  +  P   
Sbjct: 367 QVDNLSENGLTARVRIVPRID-------YSDGLKRKNSATRPQARLFNESEAFKSNPSKF 426

Query: 310 FRRDRETGKLFEFLDGMMLKDGYLFKKISLDSLNCWGVMPSEDELLKFKPPESNESNDLE 369
            +R     +LF F +    +DG+L K I + SL   GV P+ DE+ KF P  +NE  DL 
Sbjct: 427 SKRG---PRLFLF-NNEEFEDGFLVKDIRISSLITEGVNPTLDEVSKFNP--NNEDLDLS 486

Query: 370 WLS----------------QLY-GEK--------------------------------RK 429
            L+                ++Y GE+                                RK
Sbjct: 487 SLALSVKGGHAEFQPGDHVEVYVGEQTGVSGVVENVRGSVITMVSSDGLRLDVPSRGLRK 546

Query: 430 K-------------------KIIRTEK-----------------GGGKGEGTSGSSSMNS 489
           +                    ++R  K                     GE +S  +  ++
Sbjct: 547 RFRHGDYVKVIAGKYKDDTGMVVRISKDEVTFLSDTLMTELTVFSRDLGEASSAQAVNSA 606

Query: 490 FADHDLVCFGRKDFGMILGTEKDDSYKILKEGPDGSIVVNVQRKELKSGPLEGKFTAADH 549
           +  HDLV         I   ++ D+YK++ +      V  V   ++       +  A D 
Sbjct: 607 YELHDLVQLDVNTVACIFSVDR-DTYKVIDQNGG---VRTVLASQITMRHSNRRGVATDR 666

Query: 550 NGKIISVSDNVKVLEGSLKDKQGIVKHVYRHTVFVYDENEVDNDGYFCCKSNMCEKIKIS 609
           NG  I + D VK + G  + KQG + H+YR  VF+++ +  +N+G F  +S     I   
Sbjct: 667 NGAEIRIGDKVKEVGG--EGKQGTILHIYRAFVFLHNRDIAENNGVFSARSRNVATIAA- 726

Query: 610 YDAPSGKDDDKGFSGFEDFS-FSPKSPLSPKKPWAEKDSGREYNRDDKDGMFSIGQTLRI 669
                     KG     D +  +P     P  P    +  R   RD      +IG T+RI
Sbjct: 727 ----------KGARISADLTKMNPALSNGPALP-PVANLKRTIGRDK-----AIGATVRI 786

Query: 670 RVGPLKGYLCRVIAVRKKDVTVKLDSQQKVLTVRSDFLSEVYRKTSTVSLRIRISFCSED 729
           R GP+KG L  +      +  V+L +  K++T+  +    +   T T  L   IS+    
Sbjct: 787 RRGPMKGLLGVIKDTTDANARVELHTGNKMVTIPKE---NLLYTTKTGEL---ISYTEFI 846

Query: 730 TEFGSLKPFDILGNEGSS-QDWMGGTGSSTGGDG-----WNSAGPSSERSPWPSFPESGT 789
                ++P  I   +G +  +W  G  +    +G     WN+    S    W S      
Sbjct: 847 ERSRGIRPGSISTADGPNVPNWAQGARTPAVANGSRTPAWNT---GSRTPAWNS------ 906

Query: 790 SNCPGSSSNPFGSESLDAQKDVEDSPWVSKTVPNASTSWGAAKSNVDDTVNDDQAPGW-- 841
               GS +  + S S           W S    N + +W A  S      + ++ P W  
Sbjct: 907 ----GSKTPAWNSGS-------RTPAWNS---GNKTPAWNAG-SRTPAWNSGNKTPAWNV 966

BLAST of MC04g1461 vs. ExPASy Swiss-Prot
Match: O55201 (Transcription elongation factor SPT5 OS=Mus musculus OX=10090 GN=Supt5h PE=1 SV=1)

HSP 1 Score: 119.4 bits (298), Expect = 3.0e-25
Identity = 191/827 (23.10%), Postives = 329/827 (39.78%), Query Frame = 0

Query: 18  KQREDNNGADLARKRRDRSVLQFFEDVAPEVGGESDDSDFLDDFMEEEFDTEPAFKNDAA 77
           ++ E+ +     +K R    +    DV  E   E    D  +D +E+E + E +  ++  
Sbjct: 54  EEEEEEDDDRPPKKPRHGGFILDEADVDDEYEDEDQWEDGAEDILEKE-EIEASNIDNVV 113

Query: 78  KDQNIPFFPKEEEMNEEEFDRIMEEHYNQGPGLGAFAEENY----ENKNSTGRNPLQPSS 137
            D++     + + +  ++ +  + E+Y +     +  E  Y    E  +   +  L P  
Sbjct: 114 LDEDRSGARRLQNLWRDQREEELGEYYMKKYAKSSVGETVYGGSDELSDDITQQQLLPGV 173

Query: 138 RDTISLWKVKCMVGRERQSVFCLMQKFVDLHSFGTKLQIKSAFYVDHVKGFIYIEAPRQY 197
           +D  +LW VKC +G ER +   LM+KF+      T LQIKS    +HVKG+IY+EA +Q 
Sbjct: 174 KDP-NLWTVKCKIGEERATAISLMRKFIAYQFTDTPLQIKSVVAPEHVKGYIYVEAYKQT 233

Query: 198 DLIEACKGIS----GIYSTRIASVPENDISQLLTVRSRVSEVSVGTMARVKNGKYKGDLA 257
            + +A +G+     G ++ ++  VP  +++ +L V   V+ +   +  R+K G YK D+A
Sbjct: 234 HVKQAIEGVGNLRLGYWNQQM--VPIKEMTDVLKVVKEVANLKPKSWVRLKRGIYKDDIA 293

Query: 258 QIVAVNNARKRATVKLVPRIDLQAMAAKFG---GGVAAKKTTNPAPRLINSSELDEFRPL 317
           Q+  V  ++   ++K++PRID   + A+          KK   P  RL ++ ++      
Sbjct: 294 QVDYVEPSQNTISLKMIPRIDYDRIKARMSLKDWFAKRKKFKRPPQRLFDAEKIRSLGGD 353

Query: 318 MQFRRDRETGKLFEFLDGMMLKDGYLFKKISLDSLNCWGVMPSEDELLKFKPPESNESND 377
           +        G    F      + G+LFK  ++ ++   GV P+  EL KF+  +  E  D
Sbjct: 354 V-----ASDGDFLIFEGNRYSRKGFLFKSFAMSAVITEGVKPTLSELEKFE--DQPEGID 413

Query: 378 LEWLSQLYGEKRKKKI-----IRTEKG-----GGKGEGTSGS-----------SSMNSF- 437
           LE +++  G++R+        +   +G      GK     G+             M  F 
Sbjct: 414 LEVVTESTGKEREHNFQPGDNVEVCEGELINLQGKVLSVDGNKITIMPKHEDLKDMLEFP 473

Query: 438 ----------ADHDLVCFGR--KDFGMILGTEKD-----------------DSYKILKEG 497
                      DH  V  GR   D G+I+  E++                    ++  E 
Sbjct: 474 AQELRKYFKMGDHVKVIAGRFEGDTGLIVRVEENFVILFSDLTMHELKVLPRDLQLCSET 533

Query: 498 PDG--------------------SIVVNVQRKELKSGPLEGKF----------------- 557
             G                     ++V ++R+  +   + GK                  
Sbjct: 534 ASGVDVGGQHEWGELVQLDPRTVGVIVRLERETFQVLNMHGKVVTVRHQAVTQKKDNRFA 593

Query: 558 TAADHNGKIISVSDNVKVLEGSLKDKQGIVKHVYRHTVFVYDENEVDNDGYFCCKSNMCE 617
            A D +   I V D VKV++G    ++G ++H+YR   F++ +  V+N G F CK+    
Sbjct: 594 VALDSDQNNIHVKDIVKVIDGPHSGREGEIRHLYRSFAFLHCKKLVENGGMFVCKAR--- 653

Query: 618 KIKISYDAPSGKDDDKGFSGFEDFSFSPKSP--LSPKKPWAE-------KDSGREYNRDD 677
                +   +G    +  +      F+P SP   SP  P AE          G    R  
Sbjct: 654 -----HLVLAGGSKPRDVTNLTVGGFTPMSPRISSPMHPSAEGQHGGFGSPGGMSRGRGR 713

Query: 678 KDGMFSIGQTLRIRVGPLKGYLCRVIAVRKKDVTVKLDSQQKVLTVRSDFLSEV--YRKT 734
           +D    IGQT+RI  GP KGY+  V    +    V+L S  + ++V    L+ V   R  
Sbjct: 714 RDNEL-IGQTVRISQGPYKGYIGVVKDATESTARVELHSTCQTISVDRQRLTTVDSQRPG 773

BLAST of MC04g1461 vs. NCBI nr
Match: XP_022133448.1 (protein RNA-directed DNA methylation 3, partial [Momordica charantia])

HSP 1 Score: 2343 bits (6071), Expect = 0.0
Identity = 1230/1288 (95.50%), Postives = 1230/1288 (95.50%), Query Frame = 0

Query: 1    MASKGKGIAKDTSSGKRKQREDNNGADLARKRRDRSVLQFFEDVAPEVGGESDDSDFLDD 60
            MASKGKGIAKDTSSGKRKQREDNNGADLARKRRDRSVLQFFEDVAPEVGGESDDSDFLDD
Sbjct: 1    MASKGKGIAKDTSSGKRKQREDNNGADLARKRRDRSVLQFFEDVAPEVGGESDDSDFLDD 60

Query: 61   FMEEEFDTEPAFKNDAAKDQNIPFFPKEEEMNEEEFDRIMEEHYNQGPGLGAFAEENYEN 120
            FMEEEFDTEPAFKNDAAKDQNIPFFPKEEEMNEEEFDRIMEEHYNQGPGLGAFAEENYEN
Sbjct: 61   FMEEEFDTEPAFKNDAAKDQNIPFFPKEEEMNEEEFDRIMEEHYNQGPGLGAFAEENYEN 120

Query: 121  KNSTGRNPLQPSSRDTISLWKVKCMVGRERQSVFCLMQKFVDLHSFGTKLQIKSAFYVDH 180
            KNSTGRNPLQPSSRDTISLWKVKCMVGRERQSVFCLMQKFVDLHSFGTKLQIKSAFYVDH
Sbjct: 121  KNSTGRNPLQPSSRDTISLWKVKCMVGRERQSVFCLMQKFVDLHSFGTKLQIKSAFYVDH 180

Query: 181  VKGFIYIEAPRQYDLIEACKGISGIYSTRIASVPENDISQLLTVRSRVSEVSVGTMARVK 240
            VKGFIYIEAPRQYDLIEACKGISGIYSTRIASVPENDISQLLTVRSRVSEVSVGTMARVK
Sbjct: 181  VKGFIYIEAPRQYDLIEACKGISGIYSTRIASVPENDISQLLTVRSRVSEVSVGTMARVK 240

Query: 241  NGKYKGDLAQIVAVNNARKRATVKLVPRIDLQAMAAKFGGGVAAKKTTNPAPRLINSSEL 300
            NGKYKGDLAQIVAVNNARKRATVKLVPRIDLQAMAAKFGGGVAAKKTTNPAPRLINSSEL
Sbjct: 241  NGKYKGDLAQIVAVNNARKRATVKLVPRIDLQAMAAKFGGGVAAKKTTNPAPRLINSSEL 300

Query: 301  DEFRPLMQFRRDRETGKLFEFLDGMMLKDGYLFKKISLDSLNCWGVMPSEDELLKFKPPE 360
            DEFRPLMQFRRDRETGKLFEFLDGMMLKDGYLFKKISLDSLNCWGVMPSEDELLKFKPPE
Sbjct: 301  DEFRPLMQFRRDRETGKLFEFLDGMMLKDGYLFKKISLDSLNCWGVMPSEDELLKFKPPE 360

Query: 361  SNESNDLEWLSQLYGEKRKKKIIRTEKGGGKGEGTSGSSSMNSFADHDLVCFGRKDFGMI 420
            SNESNDLEWLSQLYGEKRKKKIIRTEKGGGKGEGTSGSSSMNSFADHDLVCFGRKDFGMI
Sbjct: 361  SNESNDLEWLSQLYGEKRKKKIIRTEKGGGKGEGTSGSSSMNSFADHDLVCFGRKDFGMI 420

Query: 421  LGTEKDDSYKILKEGPDGSIVVNVQRKELKSGPLEGKFTAADHNGKIISVSDNVKVLEGS 480
            LGTEKDDSYKILKEGPDGSIVVNVQRKELKSGPLEGKFTAADHNGKIISVSDNVKVLEGS
Sbjct: 421  LGTEKDDSYKILKEGPDGSIVVNVQRKELKSGPLEGKFTAADHNGKIISVSDNVKVLEGS 480

Query: 481  LKDKQGIVKHVYRHTVFVYDENEVDNDGYFCCKSNMCEKIKISYDAPSGKDDDKGFSGFE 540
            LKDKQGIVKHVYRHTVFVYDENEVDNDGYFCCKSNMCEKIKISYDAPSGKDDDKGFSGFE
Sbjct: 481  LKDKQGIVKHVYRHTVFVYDENEVDNDGYFCCKSNMCEKIKISYDAPSGKDDDKGFSGFE 540

Query: 541  DFSFSPKSPLSPKKPWAEKDSGREYNRDDKDGMFSIGQTLRIRVGPLKGYLCRVIAVRKK 600
            DFSFSPKSPLSPKKPWAEKDSGREYNRDDKDGMFSIGQTLRIRVGPLKGYLCRVIAVRKK
Sbjct: 541  DFSFSPKSPLSPKKPWAEKDSGREYNRDDKDGMFSIGQTLRIRVGPLKGYLCRVIAVRKK 600

Query: 601  DVTVKLDSQQKVLTVRSDFLSEVYRKTSTVSLRIRISFCSEDTEFGSLKPFDILGNEGSS 660
            DVTVKLDSQQKVLTVRSDFLSEVYRKTSTVSL       SEDTEFGSLKPFDILGNEGSS
Sbjct: 601  DVTVKLDSQQKVLTVRSDFLSEVYRKTSTVSL-------SEDTEFGSLKPFDILGNEGSS 660

Query: 661  QDWMGGTGSSTGGDGWNSAGPSSERSPWPSFPESGTSNCPGSSSNPFGSESLDAQKDVED 720
            QDWMGGTGSSTGGDGWNSAGPSSERSPWPSFPESGTSNCPGSSSNPFGSESLDAQKDVED
Sbjct: 661  QDWMGGTGSSTGGDGWNSAGPSSERSPWPSFPESGTSNCPGSSSNPFGSESLDAQKDVED 720

Query: 721  SPWVSKTVPNASTSWGAAKSNVDDTVNDDQAPGWGKSESWDKATAKTSSDGNASGAWDKK 780
            SPWVSKTVPNASTSWGAAKSNVDDTVNDDQAPGWGKSESWDKATAKTSSDGNASGAWDKK
Sbjct: 721  SPWVSKTVPNASTSWGAAKSNVDDTVNDDQAPGWGKSESWDKATAKTSSDGNASGAWDKK 780

Query: 781  VVPGGDCAGPTDQAEDK-------------------------------DKDSESGGWKKS 840
            VVPGGDCAGPTDQAEDK                               DKDSESGGWKKS
Sbjct: 781  VVPGGDCAGPTDQAEDKWDKGKRVSSDNQTGNWGDGTSGKNEPSAWSRDKDSESGGWKKS 840

Query: 841  QNASFGDENNSAEASADKWSSKNRSSGGWGDCNASTTVSEIKPAGKSNAGSSAWNISDEN 900
            QNASFGDENNSAEASADKWSSKNRSSGGWGDCNASTTVSEIKPAGKSNAGSSAWNISDEN
Sbjct: 841  QNASFGDENNSAEASADKWSSKNRSSGGWGDCNASTTVSEIKPAGKSNAGSSAWNISDEN 900

Query: 901  SSWNTQKQESNRGSWGKPSDRGGT--------------------SSSDWNKSTTTDGAGK 960
            SSWNTQKQESNRGSWGKPSDRGGT                    SSSDWNKSTTTDGAGK
Sbjct: 901  SSWNTQKQESNRGSWGKPSDRGGTRSSESGRGGNQVRGGSSGQDSSSDWNKSTTTDGAGK 960

Query: 961  KDGWNKPNLASEDESIGKKGWGQGNEANDSGNKWQSSGSDGGKKWGTNESERDGGNKSWN 1020
            KDGWNKPNLASEDESIGKKGWGQGNEANDSGNKWQSSGSDGGKKWGTNESERDGGNKSWN
Sbjct: 961  KDGWNKPNLASEDESIGKKGWGQGNEANDSGNKWQSSGSDGGKKWGTNESERDGGNKSWN 1020

Query: 1021 TSKSSDGDTGGGNSAIWKDKSDSSSLTAPKGDQWGGDWDKQHSSNDTKASEDNSPWNKKS 1080
            TSKSSDGDTGGGNSAIWKDKSDSSSLTAPKGDQWGGDWDKQHSSNDTKASEDNSPWNKKS
Sbjct: 1021 TSKSSDGDTGGGNSAIWKDKSDSSSLTAPKGDQWGGDWDKQHSSNDTKASEDNSPWNKKS 1080

Query: 1081 VESGKDGERKDQGSGWNVGKSSGGDSASGWGQTSKEAGSSDQAGSWGSNWKKNSNAGNED 1140
            VESGKDGERKDQGSGWNVGKSSGGDSASGWGQTSKEAGSSDQAGSWGSNWKKNSNAGNED
Sbjct: 1081 VESGKDGERKDQGSGWNVGKSSGGDSASGWGQTSKEAGSSDQAGSWGSNWKKNSNAGNED 1140

Query: 1141 SSWAKKSNWNSGNESNDNQFTGDTSGHGSWGGDGSDRGGFRGRGGFRGRGERGRFGGRGR 1200
            SSWAKKSNWNSGNESNDNQFTGDTSGHGSWGGDGSDRGGFRGRGGFRGRGERGRFGGRGR
Sbjct: 1141 SSWAKKSNWNSGNESNDNQFTGDTSGHGSWGGDGSDRGGFRGRGGFRGRGERGRFGGRGR 1200

Query: 1201 SDRGGSDRGGFGGRGRGRWNSEGGSNDGENRGWSSGGGGGDWEKSGSDRGGFGGGRGRGR 1237
            SDRGGSDRGGFGGRGRGRWNSEGGSNDGENRGWSSGGGGGDWEKSGSDRGGFGGGRGRGR
Sbjct: 1201 SDRGGSDRGGFGGRGRGRWNSEGGSNDGENRGWSSGGGGGDWEKSGSDRGGFGGGRGRGR 1260

BLAST of MC04g1461 vs. NCBI nr
Match: XP_023524523.1 (protein RNA-directed DNA methylation 3 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1796 bits (4651), Expect = 0.0
Identity = 1015/1359 (74.69%), Postives = 1089/1359 (80.13%), Query Frame = 0

Query: 1    MASKGKGIAKDTSSGKRKQREDNNGADLARKRRDRSVLQFFEDVAPEVGGESDDSDFLDD 60
            MASKGKGIAKDTSSGKRK R+D + +  ARKRRDRSVLQFFEDV+PE+G  SDDSDFLDD
Sbjct: 1    MASKGKGIAKDTSSGKRKLRDDTDSS-AARKRRDRSVLQFFEDVSPELGAYSDDSDFLDD 60

Query: 61   FMEEEFDTEPAFKNDAAKDQNIPFFPKEEEMNEEEFDRIMEEHYNQGPGLGAFAEENYEN 120
            FMEEEF+  PAFKND +K QNIPFFPKEEEMNEEEFDR+MEEHY +GPGLGAFAEENYEN
Sbjct: 61   FMEEEFEPVPAFKNDDSKAQNIPFFPKEEEMNEEEFDRMMEEHYTRGPGLGAFAEENYEN 120

Query: 121  KNSTGRNPLQPSSRDTISLWKVKCMVGRERQSVFCLMQKFVDLHSFGTKLQIKSAFYVDH 180
            K STGRNP Q S+RD + LWK+KCMVGRERQSVFCLMQKFVDLHSFGTKLQIKSAF V+H
Sbjct: 121  KISTGRNPPQQSARDIVFLWKLKCMVGRERQSVFCLMQKFVDLHSFGTKLQIKSAFCVEH 180

Query: 181  VKGFIYIEAPRQYDLIEACKGISGIYSTRIASVPENDISQLLTVRSRVSEVSVGTMARVK 240
            +KGFIYIEAPRQYDLIEACKGI+GIYSTRIASVPENDISQLLTV+SRVSEV+VGTMARVK
Sbjct: 181  IKGFIYIEAPRQYDLIEACKGINGIYSTRIASVPENDISQLLTVQSRVSEVAVGTMARVK 240

Query: 241  NGKYKGDLAQIVAVNNARKRATVKLVPRIDLQAMAAKFGGGVAAKKTTNPAPRLINSSEL 300
            NGKYKGDLAQIVAVNNARKRATVKL+PRIDLQ+MA KFGGGV AKK TNPAPRLINSSEL
Sbjct: 241  NGKYKGDLAQIVAVNNARKRATVKLIPRIDLQSMAEKFGGGVVAKKATNPAPRLINSSEL 300

Query: 301  DEFRPLMQFRRDRETGKLFEFLDGMMLKDGYLFKKISLDSLNCWGVMPSEDELLKFKPPE 360
            +EFRPLMQFRRDRETGKLFEFLDGMMLKDGYL+KK+SLDSLNCWGVMPSEDELLKFKPPE
Sbjct: 301  EEFRPLMQFRRDRETGKLFEFLDGMMLKDGYLYKKVSLDSLNCWGVMPSEDELLKFKPPE 360

Query: 361  SNESNDLEWLSQLYGEKRKKKIIRTEKGGGKGEGTSGSSSMNSFADHDLVCFGRKDFGMI 420
            SNE+NDLEWLSQLYGEK+KKKIIRTEKGGGKGEG+SGSSSM SF DHDL+CFGRKDFGMI
Sbjct: 361  SNEANDLEWLSQLYGEKKKKKIIRTEKGGGKGEGSSGSSSMISFGDHDLICFGRKDFGMI 420

Query: 421  LGTEKDDSYKILKEGPDGSIVVNVQRKELKSGPLEGKFTAADHNGKIISVSDNVKVLEGS 480
            LGTEKDDSYKILK+GPDGS+VVNVQRKELKSGPL+ KFT+ DHNGKIISVSDNVKVLEGS
Sbjct: 421  LGTEKDDSYKILKDGPDGSVVVNVQRKELKSGPLDAKFTSVDHNGKIISVSDNVKVLEGS 480

Query: 481  LKDKQGIVKHVYRHTVFVYDENEVDNDGYFCCKSNMCEKIKISYDAPSGKDDDKGFSGFE 540
            LKDKQGIVKHVYRHTVFVYDENEVDNDGYFCCKSN+C KIKISYDAPSGK+DDKGFSGFE
Sbjct: 481  LKDKQGIVKHVYRHTVFVYDENEVDNDGYFCCKSNLCAKIKISYDAPSGKEDDKGFSGFE 540

Query: 541  DFSFSPKSPLSPKKPWAEKDSGREYNRDDK-DGMFSIGQTLRIRVGPLKGYLCRVIAVRK 600
            DFS SPKSPLSPKKPWAEK+ GREYNRDD+ DGMFSIGQTLRIRVGPLKGYLCR IAVRK
Sbjct: 541  DFSSSPKSPLSPKKPWAEKE-GREYNRDDRGDGMFSIGQTLRIRVGPLKGYLCRAIAVRK 600

Query: 601  KDVTVKLDSQQKVLTVRSDFLSEVYRKTSTVSLRIRISFCSEDTEFGSLKPFDILGNEGS 660
            +DVTVKLDSQQKVLTVRSD L+EV+RK+S VS+       SED EF SLKPFDILGNEGS
Sbjct: 601  RDVTVKLDSQQKVLTVRSDLLTEVHRKSSAVSV-------SEDPEFSSLKPFDILGNEGS 660

Query: 661  SQDWMGGTGSSTGGDGWNSAGPSSERSPWPSFPESGTSNCPGSSS-NPFGSESLDAQKDV 720
            SQDWMGGTGSS  GDGWNSAGPSSERSPWPSFPESGT NCPGSSS NPFG+E+LDA++DV
Sbjct: 661  SQDWMGGTGSSAAGDGWNSAGPSSERSPWPSFPESGTLNCPGSSSTNPFGTENLDAKEDV 720

Query: 721  EDSPWVSKTVPNASTSWGAAKSNVDDTVNDDQAPGWGKSESWDKATAKTSSDGNASGAWD 780
            EDSPWVSK+   A+TSWGAAKS+VD T NDDQA GWGKSESWDKATAKT SDGN SGAW 
Sbjct: 721  EDSPWVSKSTAEANTSWGAAKSSVD-TANDDQACGWGKSESWDKATAKTISDGNVSGAWG 780

Query: 781  KKVVPGGDCAGPTDQAE-------------------------DKDKDSESGGWKKSQNAS 840
            K VVP GD AG     E                          +DKDSESGGWKK+ + S
Sbjct: 781  KSVVPSGDSAGDKWDKEKHVTSDNQTGKWGGGTSGKNEFSEWSRDKDSESGGWKKNPSVS 840

Query: 841  FGDENNSAEASADKWSSKNRSSGGWGDCNASTTVSEIKPAGKSNAGS------------S 900
             GD N  AE SADKW SKNRS+G WGD N STTVSEI+ AGK N G             S
Sbjct: 841  VGDNNTPAETSADKWGSKNRSNGSWGDQNVSTTVSEIQTAGKGNVGGWTKPGAENKINPS 900

Query: 901  AWNISD-----ENSSWNTQKQESNRGSWGKPSDRGGTSSSDWNKSTTTDGAGKKDGWNKP 960
             WN        + S+W  Q +    G WGKP + G   SS WNKST              
Sbjct: 901  GWNEDTSMKGGQTSNWGNQDEA---GGWGKPMNVGDGGSSAWNKSTAC------------ 960

Query: 961  NLASEDESIGKKGWGQGNEANDSGNKWQSSGSDGGKKWGTNESERDGGNKSWNTSKSSDG 1020
                          G GN++      W+ S          NE ER+GGN+SWN SKSSDG
Sbjct: 961  --------------GDGNDS------WKRS----------NELEREGGNRSWNASKSSDG 1020

Query: 1021 DTGGGNSAIWKDKSDSSSLTAPKGDQWGGDWDKQHSSNDTKASEDNSPWNKKSVESGKDG 1080
            D     SAIWKDKSDSSSLTA KGDQWGG W+KQHSSN+TKASEDNSPWNKKSVESGKD 
Sbjct: 1021 D-----SAIWKDKSDSSSLTASKGDQWGGGWNKQHSSNETKASEDNSPWNKKSVESGKDN 1080

Query: 1081 ERKDQGSGWNVGKSSGGDSASGWGQTSKEAGSSDQAGSWGSNWKKNSNAGNEDSS----- 1140
            E ++QGSGWNVGK+SG DSASGWGQTSKEAGSSDQ  SWGSNWKK S+ GNEDSS     
Sbjct: 1081 EVENQGSGWNVGKTSG-DSASGWGQTSKEAGSSDQGSSWGSNWKKKSDTGNEDSSSAKKS 1140

Query: 1141 ----------WAKKSNWNSGNESNDNQFT----GDTSGHGSWGGDGSDRGGFRGRGGFRG 1200
                      W +KSNWNSGNE N N         T     W G+ SDRGGFRGRG FRG
Sbjct: 1141 NWSSGSGNSNWGEKSNWNSGNEFNANHSNDGAEAQTEVSNDWRGESSDRGGFRGRGSFRG 1200

Query: 1201 RGERGRFGGRGRSDRGG-----SDRGGFGGRGRGRWNSEGGSNDGENRGWSSGGGGGDWE 1260
            RGERGRFGGRGRSDRGG     SD GGFGGRGRGRWNSEGGSNDG+N+GWS  GGGGDWE
Sbjct: 1201 RGERGRFGGRGRSDRGGFGRGGSDGGGFGGRGRGRWNSEGGSNDGDNKGWS--GGGGDWE 1260

Query: 1261 KSGSDRGGFGGGRGRGRWNQEAGSNDGDSGGWSGGGG----------GHRGRGRGRWNQN 1280
            KS SDRGGFGG RGRGRWN+E GSNDG++ GWS GGG          G RGRGRGRWNQ 
Sbjct: 1261 KSSSDRGGFGG-RGRGRWNRERGSNDGENRGWSSGGGDRERSGSDRGGFRGRGRGRWNQE 1290

BLAST of MC04g1461 vs. NCBI nr
Match: KAG6602288.1 (Protein RNA-directed DNA methylation 3, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1794 bits (4647), Expect = 0.0
Identity = 1010/1349 (74.87%), Postives = 1086/1349 (80.50%), Query Frame = 0

Query: 1    MASKGKGIAKDTSSGKRKQREDNNGADLARKRRDRSVLQFFEDVAPEVGGESDDSDFLDD 60
            MASKGKGIAKDTSSGKRK R+D + +  ARKRRDRSVLQFFEDV+PE+G  SDDSDFLDD
Sbjct: 1    MASKGKGIAKDTSSGKRKLRDDTDSS-AARKRRDRSVLQFFEDVSPELGAYSDDSDFLDD 60

Query: 61   FMEEEFDTEPAFKNDAAKDQNIPFFPKEEEMNEEEFDRIMEEHYNQGPGLGAFAEENYEN 120
            FMEEEF+  PAFKND +K QNIPFFPKEEEMNEEEFDR+MEEHY +GPGLGAFAEENYEN
Sbjct: 61   FMEEEFEPVPAFKNDDSKAQNIPFFPKEEEMNEEEFDRMMEEHYTRGPGLGAFAEENYEN 120

Query: 121  KNSTGRNPLQPSSRDTISLWKVKCMVGRERQSVFCLMQKFVDLHSFGTKLQIKSAFYVDH 180
            K STGRNP Q S+RD ISLWKVKCMVGRERQSVFCLMQKFVDLHSFGTKLQIKSAF V+H
Sbjct: 121  KISTGRNPPQQSARDNISLWKVKCMVGRERQSVFCLMQKFVDLHSFGTKLQIKSAFCVEH 180

Query: 181  VKGFIYIEAPRQYDLIEACKGISGIYSTRIASVPENDISQLLTVRSRVSEVSVGTMARVK 240
            +KGFIY+EAPRQYDLIEACKGI+GIYSTRIASVPENDI QLLTV+SRVSEV+VGTMARVK
Sbjct: 181  IKGFIYVEAPRQYDLIEACKGINGIYSTRIASVPENDIPQLLTVQSRVSEVAVGTMARVK 240

Query: 241  NGKYKGDLAQIVAVNNARKRATVKLVPRIDLQAMAAKFGGGVAAKKTTNPAPRLINSSEL 300
            NGKYKGDLAQIVAVNNARKRATVKL+PRIDLQ+MA KFGGGV AKK TNPAPRLINSSEL
Sbjct: 241  NGKYKGDLAQIVAVNNARKRATVKLIPRIDLQSMAEKFGGGVVAKKATNPAPRLINSSEL 300

Query: 301  DEFRPLMQFRRDRETGKLFEFLDGMMLKDGYLFKKISLDSLNCWGVMPSEDELLKFKPPE 360
            +EFRPLMQFRRDRETGKLFEFLDGMMLKDGYL+KK+SLDSLNCWGVMPSEDELLKFKPPE
Sbjct: 301  EEFRPLMQFRRDRETGKLFEFLDGMMLKDGYLYKKVSLDSLNCWGVMPSEDELLKFKPPE 360

Query: 361  SNESNDLEWLSQLYGEKRKKKIIRTEKGGGKGEGTSGSSSMNSFADHDLVCFGRKDFGMI 420
            SNE+NDLEWLSQLYGEK+KKKIIRTEKGGGKGEG+SGSSSM SF DHDLVCFGRKDFGMI
Sbjct: 361  SNEANDLEWLSQLYGEKKKKKIIRTEKGGGKGEGSSGSSSMISFGDHDLVCFGRKDFGMI 420

Query: 421  LGTEKDDSYKILKEGPDGSIVVNVQRKELKSGPLEGKFTAADHNGKIISVSDNVKVLEGS 480
            LG EKDDSYKILK+GPDGS+VVNVQRKELKSGPL+ KFT+ D NGKIISVSDNVKVLEGS
Sbjct: 421  LGMEKDDSYKILKDGPDGSVVVNVQRKELKSGPLDAKFTSVDLNGKIISVSDNVKVLEGS 480

Query: 481  LKDKQGIVKHVYRHTVFVYDENEVDNDGYFCCKSNMCEKIKISYDAPSGKDDDKGFSGFE 540
            LKD+QGIVKHVYRHTVFVYDENEVDNDGYFCCKSN+C KIKISYDAPSGK+DDKGFSGFE
Sbjct: 481  LKDEQGIVKHVYRHTVFVYDENEVDNDGYFCCKSNLCAKIKISYDAPSGKEDDKGFSGFE 540

Query: 541  DFSFSPKSPLSPKKPWAEKDSGREYNRDDK-DGMFSIGQTLRIRVGPLKGYLCRVIAVRK 600
            DFS SPKSPLSPKKPWAEK+ GREYNRDD+ DGMFSIGQTLRIRVGPLKGYLCRVIAVRK
Sbjct: 541  DFSSSPKSPLSPKKPWAEKE-GREYNRDDRGDGMFSIGQTLRIRVGPLKGYLCRVIAVRK 600

Query: 601  KDVTVKLDSQQKVLTVRSDFLSEVYRKTSTVSLRIRISFCSEDTEFGSLKPFDILGNEGS 660
            +DVTVKLDSQQKVLTVRSD L+EV+RK+S VS+       SED EF SLKPFDILGNEGS
Sbjct: 601  RDVTVKLDSQQKVLTVRSDLLTEVHRKSSAVSV-------SEDPEFSSLKPFDILGNEGS 660

Query: 661  SQDWMGGTGSSTGGDGWNSAGPSSERSPWPSFPESGTSNCPGSSS-NPFGSESLDAQKDV 720
            SQDWMGGTGSS  GDGWNSAGPSSERSPWPSFPESGT NCPGSSS NPFGSE LDA+KDV
Sbjct: 661  SQDWMGGTGSSAAGDGWNSAGPSSERSPWPSFPESGTLNCPGSSSTNPFGSEDLDAKKDV 720

Query: 721  EDSPWVSKTVPNASTSWGAAKSNVDDTVNDDQAPGWGKSESWDKATAKTSSDGNASGAWD 780
            EDSPWVSK+   A+TSWGAA S+VD T NDDQA GWGKSESWDKATAKT SDGN SGAW 
Sbjct: 721  EDSPWVSKSTAEANTSWGAANSSVD-TANDDQACGWGKSESWDKATAKTISDGNVSGAWG 780

Query: 781  KKVVPGGDCAGP---------TDQ----------------AEDKDKDSESGGWKKSQNAS 840
            K VVP GD AG          +D                 A  +DKDSESGGWKK+ + S
Sbjct: 781  KSVVPSGDSAGDKWDKGKHVTSDNLTGKWGGETSGKNELSAWSRDKDSESGGWKKNPSVS 840

Query: 841  FGDENNSAEASADKWSSKNRSSGGWGDCNASTTVSEIKPAGKSNAGSSAWNISDENSSWN 900
             GD N  AE SADKW SKNRS+G WGD N STTVSEI+ AGK N G              
Sbjct: 841  VGDNNTPAETSADKWGSKNRSNGSWGDQNVSTTVSEIQTAGKDNVGG------------- 900

Query: 901  TQKQESNRGSWGKPSDRGGTSSSDWNKSTTTDGA-----GKKD---GWNKPNLASEDESI 960
                      W KP      + S WN+ T+  G      G +D   GW KP    +    
Sbjct: 901  ----------WIKPGAENKINPSGWNEDTSMKGGQTSNWGNRDEAGGWGKPMNVGDG--- 960

Query: 961  GKKGWGQGNEANDSGNKWQSSGSDGGKKWGTNESERDGGNKSWNTSKSSDGDTGGGNSAI 1020
            G   W +     D  + W+ S          NESER+GGN+SWN SKSSDGD     SAI
Sbjct: 961  GNSAWNKSTACGDGNDSWKKS----------NESEREGGNRSWNASKSSDGD-----SAI 1020

Query: 1021 WKDKSDSSSLTAPKGDQWGGDWDKQHSSNDTKASEDNSPWNKKSVESGKDGERKDQGSGW 1080
            WK+KSDSSSLTA KGDQWGG W+KQHSSN+TKASEDNSPWNKKSVESGKD E ++QGSGW
Sbjct: 1021 WKEKSDSSSLTASKGDQWGGGWNKQHSSNETKASEDNSPWNKKSVESGKDNELENQGSGW 1080

Query: 1081 NVGKSSGGDSASGWGQTSKEAGSSDQAGSWGSNWKKNSNAGNEDSS-------------- 1140
            NVGK+SG DSASGWGQTSKEAGSSDQ  SWGSNWKK S+ GNEDSS              
Sbjct: 1081 NVGKTSG-DSASGWGQTSKEAGSSDQGSSWGSNWKKKSDTGNEDSSSAKKSNWSSGSGNS 1140

Query: 1141 -WAKKSNWNSGNESNDNQFT----GDTSGHGSWGGDGSDRGGFRGRGGFRGRGERGRFGG 1200
             W +KSNWNSGNE N N         T     W G+ SDRGGFRGRG FRGRGERGRFGG
Sbjct: 1141 NWGEKSNWNSGNEFNANHSNDGAEAQTEVSNDWRGESSDRGGFRGRGSFRGRGERGRFGG 1200

Query: 1201 RGRSDRGG-----SDRGGFGGRGRGRWNSEGGSNDGENRGWSSGGGGGDWEKSGSDRGGF 1260
            RGRSDRGG     SDRGGFGGRGRGRWNSEGGSNDG+N+GWSSGGG  DWEKS SDRGGF
Sbjct: 1201 RGRSDRGGLGRGGSDRGGFGGRGRGRWNSEGGSNDGDNKGWSSGGG--DWEKSSSDRGGF 1260

Query: 1261 GGGRGRGRWNQEAGSNDGDSGGWSGGGG----------GHRGRGRGRWNQNGDSNSNDGD 1280
            GG RGRGRWN+E+GSNDG++ GWS GGG          G RGRGRGRWNQ G S + D +
Sbjct: 1261 GG-RGRGRWNRESGSNDGENRGWSSGGGDRERSGSDRGGFRGRGRGRWNQEGGSRNGD-N 1290

BLAST of MC04g1461 vs. NCBI nr
Match: XP_022957824.1 (protein RNA-directed DNA methylation 3-like [Cucurbita moschata])

HSP 1 Score: 1790 bits (4637), Expect = 0.0
Identity = 1010/1350 (74.81%), Postives = 1086/1350 (80.44%), Query Frame = 0

Query: 1    MASKGKGIAKDTSSGKRKQREDNNGADLARKRRDRSVLQFFEDVAPEVGGESDDSDFLDD 60
            MASKGKGIAKDTS+GKRK R+D + +  ARKRRDRSVLQFFEDV+PE+G  SDDSDFLDD
Sbjct: 1    MASKGKGIAKDTSAGKRKLRDDTDSS-AARKRRDRSVLQFFEDVSPELGAYSDDSDFLDD 60

Query: 61   FMEEEFDTEPAFKNDAAKDQNIPFFPKEEEMNEEEFDRIMEEHYNQGPGLGAFAEENYEN 120
            FMEEEF+  PAFKND +K QNIPFFPKEEEMNEEEFDR+MEEHY +GPGLGAFAEENYEN
Sbjct: 61   FMEEEFEPVPAFKNDDSKAQNIPFFPKEEEMNEEEFDRMMEEHYTRGPGLGAFAEENYEN 120

Query: 121  KNSTGRNPLQPSSRDTISLWKVKCMVGRERQSVFCLMQKFVDLHSFGTKLQIKSAFYVDH 180
            K STGRNP Q S+RD ISLWKVKCMVG ERQSVFCLMQKFVDLHSFGTKLQIKSAF V+H
Sbjct: 121  KISTGRNPPQQSARDNISLWKVKCMVGHERQSVFCLMQKFVDLHSFGTKLQIKSAFCVEH 180

Query: 181  VKGFIYIEAPRQYDLIEACKGISGIYSTRIASVPENDISQLLTVRSRVSEVSVGTMARVK 240
            +KGFIYIEAPRQYDLIEACKGI+GIYSTRIASVPENDISQLLTV+SRVSEV+VGTMARVK
Sbjct: 181  IKGFIYIEAPRQYDLIEACKGINGIYSTRIASVPENDISQLLTVQSRVSEVAVGTMARVK 240

Query: 241  NGKYKGDLAQIVAVNNARKRATVKLVPRIDLQAMAAKFGGGVAAKKTTNPAPRLINSSEL 300
            NGKYKGDLAQIVAVNNARKRATVKL+PRIDLQ+MA KFGGGV AKK TNPAPRLINSSEL
Sbjct: 241  NGKYKGDLAQIVAVNNARKRATVKLIPRIDLQSMAEKFGGGVVAKKATNPAPRLINSSEL 300

Query: 301  DEFRPLMQFRRDRETGKLFEFLDGMMLKDGYLFKKISLDSLNCWGVMPSEDELLKFKPPE 360
            +EFRPLMQFRRDRETGKLFEF DGMMLKDGYL+KK+SLDSLNCWGVMPSEDELLKFKPPE
Sbjct: 301  EEFRPLMQFRRDRETGKLFEFFDGMMLKDGYLYKKVSLDSLNCWGVMPSEDELLKFKPPE 360

Query: 361  SNESNDLEWLSQLYGEKRKKKIIRTEKGGGKGEGTSGSSSMNSFADHDLVCFGRKDFGMI 420
            SNE+NDLEWLSQLYGEK+KKKIIRTEKGGGKGEG+SGSSSM SF DHDLVCFGRKDFGMI
Sbjct: 361  SNEANDLEWLSQLYGEKKKKKIIRTEKGGGKGEGSSGSSSMISFGDHDLVCFGRKDFGMI 420

Query: 421  LGTEKDDSYKILKEGPDGSIVVNVQRKELKSGPLEGKFTAADHNGKIISVSDNVKVLEGS 480
            LGTEKDDSYKILK+GPDGS+VVNVQRKELKSGPL+ KFT+ D NGKIISVSDNVKVLEGS
Sbjct: 421  LGTEKDDSYKILKDGPDGSVVVNVQRKELKSGPLDAKFTSVDLNGKIISVSDNVKVLEGS 480

Query: 481  LKDKQGIVKHVYRHTVFVYDENEVDNDGYFCCKSNMCEKIKISYDAPSGKDDDKGFSGFE 540
            LKDKQGIVKHVYRHTVFVYDENEVDNDGYFCCKSN+C KIKISYDAPSGK+DDKGFSGFE
Sbjct: 481  LKDKQGIVKHVYRHTVFVYDENEVDNDGYFCCKSNLCAKIKISYDAPSGKEDDKGFSGFE 540

Query: 541  DFSFSPKSPLSPKKPWAEKDSGREYNRDDK-DGMFSIGQTLRIRVGPLKGYLCRVIAVRK 600
            DFS SPKSPLSP+KPWAEK+ GREYNRDD+ DGMFSIGQTLRIRVGPLKGYLCRVIAV K
Sbjct: 541  DFSSSPKSPLSPQKPWAEKE-GREYNRDDRGDGMFSIGQTLRIRVGPLKGYLCRVIAVHK 600

Query: 601  KDVTVKLDSQQKVLTVRSDFLSEVYRKTSTVSLRIRISFCSEDTEFGSLKPFDILGNEGS 660
            +DVTVKLDSQQKVLTVRSD L+EV+RK+S VS+       SED EF SLKPFDILGNEGS
Sbjct: 601  RDVTVKLDSQQKVLTVRSDLLAEVHRKSSAVSV-------SEDPEFSSLKPFDILGNEGS 660

Query: 661  SQDWMGGTGSSTGGDGWNSAGPSSERSPWPSFPESGTSNCPGSSS-NPFGSESLDAQKDV 720
            SQDWMGGTGSS   DGWNSAGPSSERSPWPSFPESGT NCPGSSS NPFGSE+LDA+KDV
Sbjct: 661  SQDWMGGTGSSAAADGWNSAGPSSERSPWPSFPESGTLNCPGSSSTNPFGSENLDAKKDV 720

Query: 721  EDSPWVSKTVPNASTSWGAAKSNVDDTVNDDQAPGWGKSESWDKATAKTSSDGNASGAWD 780
            EDSPWVSK+   A+TSWGAA S+VD T NDDQA GWGKSESWDKATAKT SDGN SGAW 
Sbjct: 721  EDSPWVSKSTAEANTSWGAANSSVD-TANDDQACGWGKSESWDKATAKTISDGNVSGAWG 780

Query: 781  KKVVPGGDCAGP---------TDQ----------------AEDKDKDSESGGWKKSQNAS 840
            K VVP GD AG          +D                 A  +DKDSESGGWKK+ + S
Sbjct: 781  KSVVPSGDSAGDKWDKGKHVTSDNQTGKWGGGTSGKNELSAWSRDKDSESGGWKKNPSVS 840

Query: 841  FGDENNSAEASADKWSSKNRSSGGWGDCNASTTVSEIKPAGKSNAGSSAWNISDENSSWN 900
             GD N  AE SADKW SKNRS+G WGD N STTVSEI+ AGK N G              
Sbjct: 841  VGDNNTPAETSADKWGSKNRSNGSWGDQNVSTTVSEIQTAGKDNVGG------------- 900

Query: 901  TQKQESNRGSWGKPSDRGGTSSSDWNKSTTTDGA-----GKKD---GWNKPNLASEDESI 960
                      W KP      + S WN+ T+  G      G +D   GW KP    +    
Sbjct: 901  ----------WTKPGAESKINPSGWNEDTSMKGGQTSNWGNRDEAGGWGKPMNVGDG--- 960

Query: 961  GKKGWGQGNEANDSGNKWQSSGSDGGKKWGTNESERDGGNKSWNTSKSSDGDTGGGNSAI 1020
            G   W +     D  + W+ S          NESER+GGN+SWN SKSSDGD     SAI
Sbjct: 961  GNSAWNKSTACGDGNDSWKKS----------NESEREGGNRSWNASKSSDGD-----SAI 1020

Query: 1021 WKDKSDSSSLTAPKGDQWGGDWDKQHSSNDTKASEDNSPWNKKSVESGKDGERKDQGSGW 1080
            WK+KSDSSSLTA KGDQWGG W+KQHSSN+TKASEDNSPWNKKSVESGKD E ++QGSGW
Sbjct: 1021 WKEKSDSSSLTASKGDQWGGGWNKQHSSNETKASEDNSPWNKKSVESGKDNELENQGSGW 1080

Query: 1081 NVGKSSGGDSASGWGQTSKEAGSSDQAGSWGSNWKKNSNAGNEDSS-------------- 1140
            NVGK+SG DSASGWGQTS+EAGSSDQ  SWGSNWKK S+ GNEDSS              
Sbjct: 1081 NVGKTSG-DSASGWGQTSREAGSSDQGSSWGSNWKKKSDTGNEDSSSAKKSNWSSGSGNS 1140

Query: 1141 -WAKKSNWNSGNESNDNQFT----GDTSGHGSWGGDGSDRGGFRGRGGFRGRGERGRFGG 1200
             W +KSNWNSGNE N N         T     W G+ SDRGGFRGRG FRGRGERGRFGG
Sbjct: 1141 NWGEKSNWNSGNEFNANHSNDGAEAQTEVSNDWRGESSDRGGFRGRGSFRGRGERGRFGG 1200

Query: 1201 RGRSDRGG-----SDRGGFGGRGRGRWNSEGGSNDGENRGWSSGGGGGDWEKSGSDRGGF 1260
            RGRSDRGG     SDRGGFGGRGRGRWNSEGGSNDG+N+GWSSGGG  DWEKS SDRGGF
Sbjct: 1201 RGRSDRGGFGRGGSDRGGFGGRGRGRWNSEGGSNDGDNKGWSSGGG--DWEKSSSDRGGF 1260

Query: 1261 GGGRGRGRWNQEAGSNDGDSGGWSGGGG----------GHRGRGRGRWNQNGDSNSNDGD 1280
            GG RGRGRWN+E+GSNDG++ GWS GGG          G RGRGRGRWNQ G S   DGD
Sbjct: 1261 GG-RGRGRWNRESGSNDGENRGWSSGGGDRERSGSDRGGFRGRGRGRWNQEGGSR--DGD 1290

BLAST of MC04g1461 vs. NCBI nr
Match: KAG7032970.1 (Protein RNA-directed DNA methylation 3, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1789 bits (4633), Expect = 0.0
Identity = 1010/1350 (74.81%), Postives = 1086/1350 (80.44%), Query Frame = 0

Query: 1    MASKGKGIAKDTSSGKRKQREDNNGADLARKRRDRSVLQFFEDVAPEVGGESDDSDFLDD 60
            MASKGKGIAKDTSSGKRK R+D + +  ARKRRDRSVLQFFEDV+PE+G  SDDSDFLDD
Sbjct: 1    MASKGKGIAKDTSSGKRKLRDDTDSS-AARKRRDRSVLQFFEDVSPELGAYSDDSDFLDD 60

Query: 61   FMEEEFDTEPAFKNDAAKDQNIPFFPKEEEMNEEEFDRIMEEHYNQGPGLGAFAEENYEN 120
            FMEEEF+  PAFKND +K QNIPFFPKEEEMNEEEFDR+MEEHY +GPGLGAFAEENYEN
Sbjct: 61   FMEEEFEPVPAFKNDDSKAQNIPFFPKEEEMNEEEFDRMMEEHYTRGPGLGAFAEENYEN 120

Query: 121  KNSTGRNPLQPSSRDTISLWKVKCMVGRERQSVFCLMQKFVDLHSFGTKLQIKSAFYVDH 180
            K STGRNP Q S+RD ISLWKVKCMVGRERQSVFCLMQKFVDLHSFGTKLQIKSAF V+H
Sbjct: 121  KISTGRNPPQQSARDNISLWKVKCMVGRERQSVFCLMQKFVDLHSFGTKLQIKSAFCVEH 180

Query: 181  VKGFIYIEAPRQYDLIEACKGISGIYSTRIASVPENDISQLLTVRSRVSEVSVGTMARVK 240
            +KGFIYIEAPRQYDLIEACKGI+GIYSTRIASVPENDISQLLTV+SRVSEV+VGTMARVK
Sbjct: 181  IKGFIYIEAPRQYDLIEACKGINGIYSTRIASVPENDISQLLTVQSRVSEVAVGTMARVK 240

Query: 241  NGKYKGDLAQIVAVNNARKRATVKLVPRIDLQAMAAKFGGGVAAKKTTNPAPRLINSSEL 300
            NGKYKGDLAQIVAVNNARKRATVKL+PRIDLQ+MA KFGGGV AKK TNPAPRLINSSEL
Sbjct: 241  NGKYKGDLAQIVAVNNARKRATVKLIPRIDLQSMAEKFGGGVVAKKATNPAPRLINSSEL 300

Query: 301  DEFRPLMQFRRDRETGKLFEFLDGMMLKDGYLFKKISLDSLNCWGVMPSEDELLKFKPPE 360
            +EFRPLMQFRRDRETGKLFEF DGMMLKDGYL+KK+SLDSLNCWGVMPSEDELLKFKPPE
Sbjct: 301  EEFRPLMQFRRDRETGKLFEFFDGMMLKDGYLYKKVSLDSLNCWGVMPSEDELLKFKPPE 360

Query: 361  SNESNDLEWLSQLYGEKRKKKIIRTEKGGGKGEGTSGSSSMNSFADHDLVCFGRKDFGMI 420
            SNE+NDLEWLSQLYGEK+KKKIIRTEKGGGKGEG+SGSSSM SF DHDLVCFGRKDFGMI
Sbjct: 361  SNEANDLEWLSQLYGEKKKKKIIRTEKGGGKGEGSSGSSSMISFGDHDLVCFGRKDFGMI 420

Query: 421  LGTEKDDSYKILKEGPDGSIVVNVQRKELKSGPLEGKFTAADHNGKIISVSDNVKVLEGS 480
            LGTEKDDSYKILK+GPDGS+VVNVQRKELKSGPL+ KFT+ D NGKIISVSDNVKVLEGS
Sbjct: 421  LGTEKDDSYKILKDGPDGSVVVNVQRKELKSGPLDAKFTSVDLNGKIISVSDNVKVLEGS 480

Query: 481  LKDKQGIVKHVYRHTVFVYDENEVDNDGYFCCKSNMCEKIKISYDAPSGKDDDKGFSGFE 540
            LKDKQGIVKHVYRHTVFVYDENEVDNDGYFCCKSN+C KIKISYDAPSGK+DDKGFSGFE
Sbjct: 481  LKDKQGIVKHVYRHTVFVYDENEVDNDGYFCCKSNLCAKIKISYDAPSGKEDDKGFSGFE 540

Query: 541  DFSFSPKSPLSPKKPWAEKDSGREYNRDDK-DGMFSIGQTLRIRVGPLKGYLCRVIAVRK 600
            DFS SPKSPLSP+KPWAEK+    +NRDD+ DGMFSIGQTLRIRVGPLKGYLCRVIAVRK
Sbjct: 541  DFSSSPKSPLSPQKPWAEKEG---HNRDDRGDGMFSIGQTLRIRVGPLKGYLCRVIAVRK 600

Query: 601  KDVTVKLDSQQKVLTVRSDFLSEVYRKTSTVSLRIRISFCSEDTEFGSLKPFDILGNEGS 660
            +DVTVKLDSQQKVLTVRSD L+EV+RK+S VS+       SED EF SLKPFDILGNEGS
Sbjct: 601  RDVTVKLDSQQKVLTVRSDLLTEVHRKSSAVSV-------SEDPEFSSLKPFDILGNEGS 660

Query: 661  SQDWMGGTGSSTGGDGWNSAGPSSERSPWPSFPESGTSNCPGSSS-NPFGSESLDAQKDV 720
            SQDWMGGTGSS  GDGWNSAGPSSERSPWPSFPESGT NCPGSSS NPFGSE+LDA+KDV
Sbjct: 661  SQDWMGGTGSSAAGDGWNSAGPSSERSPWPSFPESGTLNCPGSSSTNPFGSENLDAKKDV 720

Query: 721  EDSPWVSKTVPNASTSWGAAKSNVDDTVNDDQAPGWGKSESWDKATAKTSSDGNASGAWD 780
            EDSPWVSK+   A+TSWGAA S+VD T NDDQA GWGKSESWDKATAKT SDGN SGAW 
Sbjct: 721  EDSPWVSKSTAEANTSWGAANSSVD-TANDDQACGWGKSESWDKATAKTISDGNVSGAWG 780

Query: 781  KKVVPGGDCAGP---------TDQ----------------AEDKDKDSESGGWKKSQNAS 840
            K VVP GD AG          +D                 A  +DKDSESGGWKK+ + S
Sbjct: 781  KSVVPSGDSAGDKWDKGKHVTSDNQTGKWGGGTSGKNELSAWSRDKDSESGGWKKNPSVS 840

Query: 841  FGDENNSAEASADKWSSKNRSSGGWGDCNASTTVSEIKPAGKSNAGSSAWNISDENSSWN 900
             GD N  AE SADKW SKNRS+G WGD N STTVSEI+ AGK N G              
Sbjct: 841  VGDNNTPAETSADKWGSKNRSNGSWGDQNVSTTVSEIQTAGKDNVGG------------- 900

Query: 901  TQKQESNRGSWGKPSDRGGTSSSDWNKSTTTDGA-----GKKD---GWNKPNLASEDESI 960
                      W KP      + S WN+ T+  G      G +D   GW KP    +    
Sbjct: 901  ----------WTKPGAENKINPSGWNEDTSMKGGQTSNWGNRDEAGGWGKPMNVGDG--- 960

Query: 961  GKKGWGQGNEANDSGNKWQSSGSDGGKKWGTNESERDGGNKSWNTSKSSDGDTGGGNSAI 1020
            G   W +     D  + W+ S          NESER+GGN+SWN SKSSDGD     SAI
Sbjct: 961  GNSAWNKSTACGDGNDSWKKS----------NESEREGGNRSWNASKSSDGD-----SAI 1020

Query: 1021 WKDKSDSSSLTAPKGDQWGGDWDKQHSSNDTKASEDNSPWNKKSVESGKDGERKDQGSGW 1080
            WK+KSDSSSLTA KGDQWGG W+KQHSSN+TKASEDNSPWNKKSVESGKD E ++QGSGW
Sbjct: 1021 WKEKSDSSSLTASKGDQWGGGWNKQHSSNETKASEDNSPWNKKSVESGKDNELENQGSGW 1080

Query: 1081 NVGKSSGGDSASGWGQTSKEAGSSDQAGSWGSNWKKNSNAGNEDSS-------------- 1140
            NVGK+SG DSASGWGQTS+EAGSSDQ  SWGSNWKK S+ GNEDSS              
Sbjct: 1081 NVGKTSG-DSASGWGQTSREAGSSDQGSSWGSNWKKKSDTGNEDSSSAKKSNWSSGSGNS 1140

Query: 1141 -WAKKSNWNSGNESNDNQFT----GDTSGHGSWGGDGSDRGGFRGRGGFRGRGERGRFGG 1200
             W +KSNWNSGNE N N         T     W G+ SDRGGFRGRG FRGRGERGRFGG
Sbjct: 1141 NWGEKSNWNSGNEFNANHSNDGAEAQTEVSNDWRGESSDRGGFRGRGSFRGRGERGRFGG 1200

Query: 1201 RGRSDRGG-----SDRGGFGGRGRGRWNSEGGSNDGENRGWSSGGGGGDWEKSGSDRGGF 1260
            RGRSDRGG     SDRGGFGGRGRGRWNSEGGSNDG+N+GWSSGGG  DWEKS SDRGGF
Sbjct: 1201 RGRSDRGGFGRGGSDRGGFGGRGRGRWNSEGGSNDGDNKGWSSGGG--DWEKSSSDRGGF 1260

Query: 1261 GGGRGRGRWNQEAGSNDGDSGGWSGGGG----------GHRGRGRGRWNQNGDSNSNDGD 1280
            GG RGRGRWN+E+GSNDG++ GWS GGG          G RGRGRGRWNQ G S   DGD
Sbjct: 1261 GG-RGRGRWNRESGSNDGENRGWSSGGGDRERSGSDRGGFRGRGRGRWNQEGGSR--DGD 1288

BLAST of MC04g1461 vs. ExPASy TrEMBL
Match: A0A6J1BZ56 (protein RNA-directed DNA methylation 3 OS=Momordica charantia OX=3673 GN=LOC111006022 PE=4 SV=1)

HSP 1 Score: 2343 bits (6071), Expect = 0.0
Identity = 1230/1288 (95.50%), Postives = 1230/1288 (95.50%), Query Frame = 0

Query: 1    MASKGKGIAKDTSSGKRKQREDNNGADLARKRRDRSVLQFFEDVAPEVGGESDDSDFLDD 60
            MASKGKGIAKDTSSGKRKQREDNNGADLARKRRDRSVLQFFEDVAPEVGGESDDSDFLDD
Sbjct: 1    MASKGKGIAKDTSSGKRKQREDNNGADLARKRRDRSVLQFFEDVAPEVGGESDDSDFLDD 60

Query: 61   FMEEEFDTEPAFKNDAAKDQNIPFFPKEEEMNEEEFDRIMEEHYNQGPGLGAFAEENYEN 120
            FMEEEFDTEPAFKNDAAKDQNIPFFPKEEEMNEEEFDRIMEEHYNQGPGLGAFAEENYEN
Sbjct: 61   FMEEEFDTEPAFKNDAAKDQNIPFFPKEEEMNEEEFDRIMEEHYNQGPGLGAFAEENYEN 120

Query: 121  KNSTGRNPLQPSSRDTISLWKVKCMVGRERQSVFCLMQKFVDLHSFGTKLQIKSAFYVDH 180
            KNSTGRNPLQPSSRDTISLWKVKCMVGRERQSVFCLMQKFVDLHSFGTKLQIKSAFYVDH
Sbjct: 121  KNSTGRNPLQPSSRDTISLWKVKCMVGRERQSVFCLMQKFVDLHSFGTKLQIKSAFYVDH 180

Query: 181  VKGFIYIEAPRQYDLIEACKGISGIYSTRIASVPENDISQLLTVRSRVSEVSVGTMARVK 240
            VKGFIYIEAPRQYDLIEACKGISGIYSTRIASVPENDISQLLTVRSRVSEVSVGTMARVK
Sbjct: 181  VKGFIYIEAPRQYDLIEACKGISGIYSTRIASVPENDISQLLTVRSRVSEVSVGTMARVK 240

Query: 241  NGKYKGDLAQIVAVNNARKRATVKLVPRIDLQAMAAKFGGGVAAKKTTNPAPRLINSSEL 300
            NGKYKGDLAQIVAVNNARKRATVKLVPRIDLQAMAAKFGGGVAAKKTTNPAPRLINSSEL
Sbjct: 241  NGKYKGDLAQIVAVNNARKRATVKLVPRIDLQAMAAKFGGGVAAKKTTNPAPRLINSSEL 300

Query: 301  DEFRPLMQFRRDRETGKLFEFLDGMMLKDGYLFKKISLDSLNCWGVMPSEDELLKFKPPE 360
            DEFRPLMQFRRDRETGKLFEFLDGMMLKDGYLFKKISLDSLNCWGVMPSEDELLKFKPPE
Sbjct: 301  DEFRPLMQFRRDRETGKLFEFLDGMMLKDGYLFKKISLDSLNCWGVMPSEDELLKFKPPE 360

Query: 361  SNESNDLEWLSQLYGEKRKKKIIRTEKGGGKGEGTSGSSSMNSFADHDLVCFGRKDFGMI 420
            SNESNDLEWLSQLYGEKRKKKIIRTEKGGGKGEGTSGSSSMNSFADHDLVCFGRKDFGMI
Sbjct: 361  SNESNDLEWLSQLYGEKRKKKIIRTEKGGGKGEGTSGSSSMNSFADHDLVCFGRKDFGMI 420

Query: 421  LGTEKDDSYKILKEGPDGSIVVNVQRKELKSGPLEGKFTAADHNGKIISVSDNVKVLEGS 480
            LGTEKDDSYKILKEGPDGSIVVNVQRKELKSGPLEGKFTAADHNGKIISVSDNVKVLEGS
Sbjct: 421  LGTEKDDSYKILKEGPDGSIVVNVQRKELKSGPLEGKFTAADHNGKIISVSDNVKVLEGS 480

Query: 481  LKDKQGIVKHVYRHTVFVYDENEVDNDGYFCCKSNMCEKIKISYDAPSGKDDDKGFSGFE 540
            LKDKQGIVKHVYRHTVFVYDENEVDNDGYFCCKSNMCEKIKISYDAPSGKDDDKGFSGFE
Sbjct: 481  LKDKQGIVKHVYRHTVFVYDENEVDNDGYFCCKSNMCEKIKISYDAPSGKDDDKGFSGFE 540

Query: 541  DFSFSPKSPLSPKKPWAEKDSGREYNRDDKDGMFSIGQTLRIRVGPLKGYLCRVIAVRKK 600
            DFSFSPKSPLSPKKPWAEKDSGREYNRDDKDGMFSIGQTLRIRVGPLKGYLCRVIAVRKK
Sbjct: 541  DFSFSPKSPLSPKKPWAEKDSGREYNRDDKDGMFSIGQTLRIRVGPLKGYLCRVIAVRKK 600

Query: 601  DVTVKLDSQQKVLTVRSDFLSEVYRKTSTVSLRIRISFCSEDTEFGSLKPFDILGNEGSS 660
            DVTVKLDSQQKVLTVRSDFLSEVYRKTSTVSL       SEDTEFGSLKPFDILGNEGSS
Sbjct: 601  DVTVKLDSQQKVLTVRSDFLSEVYRKTSTVSL-------SEDTEFGSLKPFDILGNEGSS 660

Query: 661  QDWMGGTGSSTGGDGWNSAGPSSERSPWPSFPESGTSNCPGSSSNPFGSESLDAQKDVED 720
            QDWMGGTGSSTGGDGWNSAGPSSERSPWPSFPESGTSNCPGSSSNPFGSESLDAQKDVED
Sbjct: 661  QDWMGGTGSSTGGDGWNSAGPSSERSPWPSFPESGTSNCPGSSSNPFGSESLDAQKDVED 720

Query: 721  SPWVSKTVPNASTSWGAAKSNVDDTVNDDQAPGWGKSESWDKATAKTSSDGNASGAWDKK 780
            SPWVSKTVPNASTSWGAAKSNVDDTVNDDQAPGWGKSESWDKATAKTSSDGNASGAWDKK
Sbjct: 721  SPWVSKTVPNASTSWGAAKSNVDDTVNDDQAPGWGKSESWDKATAKTSSDGNASGAWDKK 780

Query: 781  VVPGGDCAGPTDQAEDK-------------------------------DKDSESGGWKKS 840
            VVPGGDCAGPTDQAEDK                               DKDSESGGWKKS
Sbjct: 781  VVPGGDCAGPTDQAEDKWDKGKRVSSDNQTGNWGDGTSGKNEPSAWSRDKDSESGGWKKS 840

Query: 841  QNASFGDENNSAEASADKWSSKNRSSGGWGDCNASTTVSEIKPAGKSNAGSSAWNISDEN 900
            QNASFGDENNSAEASADKWSSKNRSSGGWGDCNASTTVSEIKPAGKSNAGSSAWNISDEN
Sbjct: 841  QNASFGDENNSAEASADKWSSKNRSSGGWGDCNASTTVSEIKPAGKSNAGSSAWNISDEN 900

Query: 901  SSWNTQKQESNRGSWGKPSDRGGT--------------------SSSDWNKSTTTDGAGK 960
            SSWNTQKQESNRGSWGKPSDRGGT                    SSSDWNKSTTTDGAGK
Sbjct: 901  SSWNTQKQESNRGSWGKPSDRGGTRSSESGRGGNQVRGGSSGQDSSSDWNKSTTTDGAGK 960

Query: 961  KDGWNKPNLASEDESIGKKGWGQGNEANDSGNKWQSSGSDGGKKWGTNESERDGGNKSWN 1020
            KDGWNKPNLASEDESIGKKGWGQGNEANDSGNKWQSSGSDGGKKWGTNESERDGGNKSWN
Sbjct: 961  KDGWNKPNLASEDESIGKKGWGQGNEANDSGNKWQSSGSDGGKKWGTNESERDGGNKSWN 1020

Query: 1021 TSKSSDGDTGGGNSAIWKDKSDSSSLTAPKGDQWGGDWDKQHSSNDTKASEDNSPWNKKS 1080
            TSKSSDGDTGGGNSAIWKDKSDSSSLTAPKGDQWGGDWDKQHSSNDTKASEDNSPWNKKS
Sbjct: 1021 TSKSSDGDTGGGNSAIWKDKSDSSSLTAPKGDQWGGDWDKQHSSNDTKASEDNSPWNKKS 1080

Query: 1081 VESGKDGERKDQGSGWNVGKSSGGDSASGWGQTSKEAGSSDQAGSWGSNWKKNSNAGNED 1140
            VESGKDGERKDQGSGWNVGKSSGGDSASGWGQTSKEAGSSDQAGSWGSNWKKNSNAGNED
Sbjct: 1081 VESGKDGERKDQGSGWNVGKSSGGDSASGWGQTSKEAGSSDQAGSWGSNWKKNSNAGNED 1140

Query: 1141 SSWAKKSNWNSGNESNDNQFTGDTSGHGSWGGDGSDRGGFRGRGGFRGRGERGRFGGRGR 1200
            SSWAKKSNWNSGNESNDNQFTGDTSGHGSWGGDGSDRGGFRGRGGFRGRGERGRFGGRGR
Sbjct: 1141 SSWAKKSNWNSGNESNDNQFTGDTSGHGSWGGDGSDRGGFRGRGGFRGRGERGRFGGRGR 1200

Query: 1201 SDRGGSDRGGFGGRGRGRWNSEGGSNDGENRGWSSGGGGGDWEKSGSDRGGFGGGRGRGR 1237
            SDRGGSDRGGFGGRGRGRWNSEGGSNDGENRGWSSGGGGGDWEKSGSDRGGFGGGRGRGR
Sbjct: 1201 SDRGGSDRGGFGGRGRGRWNSEGGSNDGENRGWSSGGGGGDWEKSGSDRGGFGGGRGRGR 1260

BLAST of MC04g1461 vs. ExPASy TrEMBL
Match: A0A6J1H340 (protein RNA-directed DNA methylation 3-like OS=Cucurbita moschata OX=3662 GN=LOC111459247 PE=4 SV=1)

HSP 1 Score: 1790 bits (4637), Expect = 0.0
Identity = 1010/1350 (74.81%), Postives = 1086/1350 (80.44%), Query Frame = 0

Query: 1    MASKGKGIAKDTSSGKRKQREDNNGADLARKRRDRSVLQFFEDVAPEVGGESDDSDFLDD 60
            MASKGKGIAKDTS+GKRK R+D + +  ARKRRDRSVLQFFEDV+PE+G  SDDSDFLDD
Sbjct: 1    MASKGKGIAKDTSAGKRKLRDDTDSS-AARKRRDRSVLQFFEDVSPELGAYSDDSDFLDD 60

Query: 61   FMEEEFDTEPAFKNDAAKDQNIPFFPKEEEMNEEEFDRIMEEHYNQGPGLGAFAEENYEN 120
            FMEEEF+  PAFKND +K QNIPFFPKEEEMNEEEFDR+MEEHY +GPGLGAFAEENYEN
Sbjct: 61   FMEEEFEPVPAFKNDDSKAQNIPFFPKEEEMNEEEFDRMMEEHYTRGPGLGAFAEENYEN 120

Query: 121  KNSTGRNPLQPSSRDTISLWKVKCMVGRERQSVFCLMQKFVDLHSFGTKLQIKSAFYVDH 180
            K STGRNP Q S+RD ISLWKVKCMVG ERQSVFCLMQKFVDLHSFGTKLQIKSAF V+H
Sbjct: 121  KISTGRNPPQQSARDNISLWKVKCMVGHERQSVFCLMQKFVDLHSFGTKLQIKSAFCVEH 180

Query: 181  VKGFIYIEAPRQYDLIEACKGISGIYSTRIASVPENDISQLLTVRSRVSEVSVGTMARVK 240
            +KGFIYIEAPRQYDLIEACKGI+GIYSTRIASVPENDISQLLTV+SRVSEV+VGTMARVK
Sbjct: 181  IKGFIYIEAPRQYDLIEACKGINGIYSTRIASVPENDISQLLTVQSRVSEVAVGTMARVK 240

Query: 241  NGKYKGDLAQIVAVNNARKRATVKLVPRIDLQAMAAKFGGGVAAKKTTNPAPRLINSSEL 300
            NGKYKGDLAQIVAVNNARKRATVKL+PRIDLQ+MA KFGGGV AKK TNPAPRLINSSEL
Sbjct: 241  NGKYKGDLAQIVAVNNARKRATVKLIPRIDLQSMAEKFGGGVVAKKATNPAPRLINSSEL 300

Query: 301  DEFRPLMQFRRDRETGKLFEFLDGMMLKDGYLFKKISLDSLNCWGVMPSEDELLKFKPPE 360
            +EFRPLMQFRRDRETGKLFEF DGMMLKDGYL+KK+SLDSLNCWGVMPSEDELLKFKPPE
Sbjct: 301  EEFRPLMQFRRDRETGKLFEFFDGMMLKDGYLYKKVSLDSLNCWGVMPSEDELLKFKPPE 360

Query: 361  SNESNDLEWLSQLYGEKRKKKIIRTEKGGGKGEGTSGSSSMNSFADHDLVCFGRKDFGMI 420
            SNE+NDLEWLSQLYGEK+KKKIIRTEKGGGKGEG+SGSSSM SF DHDLVCFGRKDFGMI
Sbjct: 361  SNEANDLEWLSQLYGEKKKKKIIRTEKGGGKGEGSSGSSSMISFGDHDLVCFGRKDFGMI 420

Query: 421  LGTEKDDSYKILKEGPDGSIVVNVQRKELKSGPLEGKFTAADHNGKIISVSDNVKVLEGS 480
            LGTEKDDSYKILK+GPDGS+VVNVQRKELKSGPL+ KFT+ D NGKIISVSDNVKVLEGS
Sbjct: 421  LGTEKDDSYKILKDGPDGSVVVNVQRKELKSGPLDAKFTSVDLNGKIISVSDNVKVLEGS 480

Query: 481  LKDKQGIVKHVYRHTVFVYDENEVDNDGYFCCKSNMCEKIKISYDAPSGKDDDKGFSGFE 540
            LKDKQGIVKHVYRHTVFVYDENEVDNDGYFCCKSN+C KIKISYDAPSGK+DDKGFSGFE
Sbjct: 481  LKDKQGIVKHVYRHTVFVYDENEVDNDGYFCCKSNLCAKIKISYDAPSGKEDDKGFSGFE 540

Query: 541  DFSFSPKSPLSPKKPWAEKDSGREYNRDDK-DGMFSIGQTLRIRVGPLKGYLCRVIAVRK 600
            DFS SPKSPLSP+KPWAEK+ GREYNRDD+ DGMFSIGQTLRIRVGPLKGYLCRVIAV K
Sbjct: 541  DFSSSPKSPLSPQKPWAEKE-GREYNRDDRGDGMFSIGQTLRIRVGPLKGYLCRVIAVHK 600

Query: 601  KDVTVKLDSQQKVLTVRSDFLSEVYRKTSTVSLRIRISFCSEDTEFGSLKPFDILGNEGS 660
            +DVTVKLDSQQKVLTVRSD L+EV+RK+S VS+       SED EF SLKPFDILGNEGS
Sbjct: 601  RDVTVKLDSQQKVLTVRSDLLAEVHRKSSAVSV-------SEDPEFSSLKPFDILGNEGS 660

Query: 661  SQDWMGGTGSSTGGDGWNSAGPSSERSPWPSFPESGTSNCPGSSS-NPFGSESLDAQKDV 720
            SQDWMGGTGSS   DGWNSAGPSSERSPWPSFPESGT NCPGSSS NPFGSE+LDA+KDV
Sbjct: 661  SQDWMGGTGSSAAADGWNSAGPSSERSPWPSFPESGTLNCPGSSSTNPFGSENLDAKKDV 720

Query: 721  EDSPWVSKTVPNASTSWGAAKSNVDDTVNDDQAPGWGKSESWDKATAKTSSDGNASGAWD 780
            EDSPWVSK+   A+TSWGAA S+VD T NDDQA GWGKSESWDKATAKT SDGN SGAW 
Sbjct: 721  EDSPWVSKSTAEANTSWGAANSSVD-TANDDQACGWGKSESWDKATAKTISDGNVSGAWG 780

Query: 781  KKVVPGGDCAGP---------TDQ----------------AEDKDKDSESGGWKKSQNAS 840
            K VVP GD AG          +D                 A  +DKDSESGGWKK+ + S
Sbjct: 781  KSVVPSGDSAGDKWDKGKHVTSDNQTGKWGGGTSGKNELSAWSRDKDSESGGWKKNPSVS 840

Query: 841  FGDENNSAEASADKWSSKNRSSGGWGDCNASTTVSEIKPAGKSNAGSSAWNISDENSSWN 900
             GD N  AE SADKW SKNRS+G WGD N STTVSEI+ AGK N G              
Sbjct: 841  VGDNNTPAETSADKWGSKNRSNGSWGDQNVSTTVSEIQTAGKDNVGG------------- 900

Query: 901  TQKQESNRGSWGKPSDRGGTSSSDWNKSTTTDGA-----GKKD---GWNKPNLASEDESI 960
                      W KP      + S WN+ T+  G      G +D   GW KP    +    
Sbjct: 901  ----------WTKPGAESKINPSGWNEDTSMKGGQTSNWGNRDEAGGWGKPMNVGDG--- 960

Query: 961  GKKGWGQGNEANDSGNKWQSSGSDGGKKWGTNESERDGGNKSWNTSKSSDGDTGGGNSAI 1020
            G   W +     D  + W+ S          NESER+GGN+SWN SKSSDGD     SAI
Sbjct: 961  GNSAWNKSTACGDGNDSWKKS----------NESEREGGNRSWNASKSSDGD-----SAI 1020

Query: 1021 WKDKSDSSSLTAPKGDQWGGDWDKQHSSNDTKASEDNSPWNKKSVESGKDGERKDQGSGW 1080
            WK+KSDSSSLTA KGDQWGG W+KQHSSN+TKASEDNSPWNKKSVESGKD E ++QGSGW
Sbjct: 1021 WKEKSDSSSLTASKGDQWGGGWNKQHSSNETKASEDNSPWNKKSVESGKDNELENQGSGW 1080

Query: 1081 NVGKSSGGDSASGWGQTSKEAGSSDQAGSWGSNWKKNSNAGNEDSS-------------- 1140
            NVGK+SG DSASGWGQTS+EAGSSDQ  SWGSNWKK S+ GNEDSS              
Sbjct: 1081 NVGKTSG-DSASGWGQTSREAGSSDQGSSWGSNWKKKSDTGNEDSSSAKKSNWSSGSGNS 1140

Query: 1141 -WAKKSNWNSGNESNDNQFT----GDTSGHGSWGGDGSDRGGFRGRGGFRGRGERGRFGG 1200
             W +KSNWNSGNE N N         T     W G+ SDRGGFRGRG FRGRGERGRFGG
Sbjct: 1141 NWGEKSNWNSGNEFNANHSNDGAEAQTEVSNDWRGESSDRGGFRGRGSFRGRGERGRFGG 1200

Query: 1201 RGRSDRGG-----SDRGGFGGRGRGRWNSEGGSNDGENRGWSSGGGGGDWEKSGSDRGGF 1260
            RGRSDRGG     SDRGGFGGRGRGRWNSEGGSNDG+N+GWSSGGG  DWEKS SDRGGF
Sbjct: 1201 RGRSDRGGFGRGGSDRGGFGGRGRGRWNSEGGSNDGDNKGWSSGGG--DWEKSSSDRGGF 1260

Query: 1261 GGGRGRGRWNQEAGSNDGDSGGWSGGGG----------GHRGRGRGRWNQNGDSNSNDGD 1280
            GG RGRGRWN+E+GSNDG++ GWS GGG          G RGRGRGRWNQ G S   DGD
Sbjct: 1261 GG-RGRGRWNRESGSNDGENRGWSSGGGDRERSGSDRGGFRGRGRGRWNQEGGSR--DGD 1290

BLAST of MC04g1461 vs. ExPASy TrEMBL
Match: A0A6J1JRE1 (protein RNA-directed DNA methylation 3-like isoform X3 OS=Cucurbita maxima OX=3661 GN=LOC111489134 PE=4 SV=1)

HSP 1 Score: 1778 bits (4604), Expect = 0.0
Identity = 1018/1450 (70.21%), Postives = 1096/1450 (75.59%), Query Frame = 0

Query: 1    MASKGKGIAKDTSSGKRKQREDNNGADLARKRRDRSVLQFFEDVAPEVGGESDDSDFLDD 60
            MASKGKGIAKD S GKRK R+DNN +  ARKRRDRSVLQFFEDVAPE GGESDDSDF D 
Sbjct: 1    MASKGKGIAKDASPGKRKLRDDNNSS-AARKRRDRSVLQFFEDVAPEFGGESDDSDFFDG 60

Query: 61   FMEEEFDTEPAFKNDAAKDQNIPFFPKEEEMNEEEFDRIMEEHYNQGPGLGAFAEENYEN 120
            FMEE+F+T PAFKND AK QNIPFFPKEEEMNEEEFDR+MEEHYNQGPG GAFAEENYEN
Sbjct: 61   FMEEDFETIPAFKNDDAKSQNIPFFPKEEEMNEEEFDRMMEEHYNQGPGFGAFAEENYEN 120

Query: 121  KNSTGRNPLQPSSRDTISLWKVKCMVGRERQSVFCLMQKFVDLHSFGTKLQIKSAFYVDH 180
            KNSTGRNP   S+RDTISLWKVKCMVGRERQSVFCLMQKFVDL+SFGTKLQIK+AF V+H
Sbjct: 121  KNSTGRNPPLQSARDTISLWKVKCMVGRERQSVFCLMQKFVDLNSFGTKLQIKAAFCVEH 180

Query: 181  VKGFIYIEAPRQYDLIEACKGISGIYSTRIASVPENDISQLLTVRSRVSEVSVGTMARVK 240
            VKGFIYIEAPRQYDLIEACKGISGIYSTR+ASVPENDISQLLTVRSRVSEV+VGTMARVK
Sbjct: 181  VKGFIYIEAPRQYDLIEACKGISGIYSTRVASVPENDISQLLTVRSRVSEVAVGTMARVK 240

Query: 241  NGKYKGDLAQIVAVNNARKRATVKLVPRIDLQAMAAKFGGGVAAKKTTNPAPRLINSSEL 300
            NGKYKGDLAQIVAVNNARKRATVKLVPRIDLQAMA KFGGG AAKKT+NPAPRLINSSEL
Sbjct: 241  NGKYKGDLAQIVAVNNARKRATVKLVPRIDLQAMAEKFGGGSAAKKTSNPAPRLINSSEL 300

Query: 301  DEFRPLMQFRRDRETGKLFEFLDGMMLKDGYLFKKISLDSLNCWGVMPSEDELLKFKPPE 360
            DEFRPLMQFRRDRETGKLFEFLDGMMLKDGYL+KKISLDSLNCWGVMPSEDELLKFKPPE
Sbjct: 301  DEFRPLMQFRRDRETGKLFEFLDGMMLKDGYLYKKISLDSLNCWGVMPSEDELLKFKPPE 360

Query: 361  SNESNDLEWLSQLYGEKRKKKIIRTEKGGGKGEGTSGSSSMNSFADHDLVCFGRKDFGMI 420
            SNES+DLEWLSQLYGEK+KKKIIRTEKGGGKGEG+SGSSS +SF DHDL+CFGRKDFGMI
Sbjct: 361  SNESHDLEWLSQLYGEKKKKKIIRTEKGGGKGEGSSGSSSTSSFGDHDLICFGRKDFGMI 420

Query: 421  LGTEKDDSYKILKEGPDGSIVVNVQRKELKSGPLEGKFTAADHNGKIISVSDNVKVLEGS 480
            LG EKDDSYKILK+ P+GS+VVNVQRKELK G L+ KFTAADHNGK+ISVSDNVKVLEGS
Sbjct: 421  LGMEKDDSYKILKDSPNGSVVVNVQRKELKGGSLDAKFTAADHNGKMISVSDNVKVLEGS 480

Query: 481  LKDKQGIVKHVYRHTVFVYDENEVDNDGYFCCKSNMCEKIKISYDAPSGKDDDKGFSGFE 540
            LKDKQGIVKHVYRHTVFVYDENEV+NDGYFCCKSN CEKIKISYDAP GK+DDKGFSGFE
Sbjct: 481  LKDKQGIVKHVYRHTVFVYDENEVENDGYFCCKSNKCEKIKISYDAPCGKEDDKGFSGFE 540

Query: 541  DFSFSPKSPLSPKKPWAEKDSGREYNRDDK-DGMFSIGQTLRIRVGPLKGYLCRVIAVRK 600
            DFS SPKSPLSPKKPWAEK++GREYNRDD+ DGMFSIGQTLRIRVGPLKGYLCRVIAVRK
Sbjct: 541  DFSSSPKSPLSPKKPWAEKETGREYNRDDRGDGMFSIGQTLRIRVGPLKGYLCRVIAVRK 600

Query: 601  KDVTVKLDSQQKVLTVRSDFLSEVYRKTSTVSLRIRISFCSEDTEFGSLKPFDILGNEGS 660
            +DVTVKLDSQQKVLTVRSDFLSEV RK+S VSL       SED     LKPFDILGNEGS
Sbjct: 601  RDVTVKLDSQQKVLTVRSDFLSEVQRKSSAVSL-------SEDP----LKPFDILGNEGS 660

Query: 661  SQDWMGGTGSSTGGDGWNSAGPSSERSPWPSFPESGTSNCPGSSS-NPFGSESLDAQKDV 720
            SQDWMGGTGSS GGDGWNS GPSSE +PWPSFPES T N PGSSS NP GSES DA KD 
Sbjct: 661  SQDWMGGTGSSAGGDGWNSTGPSSEGNPWPSFPESSTLNGPGSSSTNPIGSESFDANKD- 720

Query: 721  EDSPWVSKTVPNASTSWGAAKSNVDDTVNDDQAPGWGKSESWDKATAKTSSDGNASGAWD 780
            EDSPWVSK+ P+ASTSWGAAKS+VD T N+ QA GWGKS+SW K  AKT SDGNASGAW 
Sbjct: 721  EDSPWVSKSTPDASTSWGAAKSSVD-TANNGQASGWGKSDSWGKTIAKTCSDGNASGAWG 780

Query: 781  KKVVPGGDCAGPTDQAEDK-------------------------------DKDSESGGWK 840
            K  VP GD AG T+   DK                               DKD+ESGGWK
Sbjct: 781  KTAVPSGDSAGLTENTWDKWDKGKQVSSDNQTGNWDNGTSGKNEHSAWSRDKDAESGGWK 840

Query: 841  KSQNASFGDENNSAEASADKWSSKNRSSGGWGDCNASTTVSEIKPAGKSNAGSSAWNISD 900
            K+Q+A+F D+   AE++ D W++                 +++ P+G  N G+S      
Sbjct: 841  KTQSANFDDDKTPAESAGD-WTNPE-------------AENKVNPSGW-NEGTSM--KGS 900

Query: 901  ENSSWNTQKQESNRGSWGKPSDRGGTSSSDWNKSTTTDGAGKKDGWNKPNLASEDESIGK 960
            + S+W  Q +    G WGKP + G   SS WNKST+ DGA + D WNKP L S DESIGK
Sbjct: 901  QTSNWGNQDET---GGWGKPKNVGNGGSSAWNKSTSGDGAVENDSWNKPKLFSHDESIGK 960

Query: 961  KGWGQGNEANDSGNKWQSSGSDGGKKWGTNESERDGGNKSWNTSKSSDGDTGGGNSAIWK 1020
            KGWGQ NEA+D+GNKWQSS SDGG KWGTNESE +GG                      K
Sbjct: 961  KGWGQSNEASDNGNKWQSSRSDGGTKWGTNESEHEGG----------------------K 1020

Query: 1021 DKSDSSSLTAPKGDQWGGDWDKQHSSNDTKASEDNSPWNKKSVESGKDGERKDQGSGWNV 1080
            DKSDSSSLT P+GDQ  G WDKQ SSNDTKASE+NSPWNKKSVESGKDGE K+QGSGWNV
Sbjct: 1021 DKSDSSSLTTPRGDQSVGGWDKQRSSNDTKASEENSPWNKKSVESGKDGELKNQGSGWNV 1080

Query: 1081 GKSSGGDSASGWGQTSKEAGSSDQAGSWGSNWKKNSNAGNEDSS---------------W 1140
            GK+SGGDSASGWGQ SKEAGSSD  G+WGSNWKKNS+ GNEDSS               W
Sbjct: 1081 GKTSGGDSASGWGQASKEAGSSDLVGNWGSNWKKNSDVGNEDSSLAKKSNWSSGSGNSNW 1140

Query: 1141 AKKSNWNSGNESNDNQFTG----------DTSGHGSWGGDGSDRGGFRGRGGFRGRGERG 1200
             +KSNWNSGNE N N  TG          DTSG+GSW G+ SDRGG+RGRGGFRGRGERG
Sbjct: 1141 GEKSNWNSGNEYNANHSTGGAEAQTDVSNDTSGYGSWRGENSDRGGYRGRGGFRGRGERG 1200

Query: 1201 RFGGRGRSDRGGSDRGGFGGRGRGRWNSEGGSNDGENRGWSSGGGGGD------------ 1260
            RFGGRGRSDRGG  RGGFGGRGRGRWNSEGGSN G+N+GWSSGGGGGD            
Sbjct: 1201 RFGGRGRSDRGGFGRGGFGGRGRGRWNSEGGSNGGDNKGWSSGGGGGDNKGWSSGGGGGD 1260

Query: 1261 ----------------------------------------------------------WE 1284
                                                                      WE
Sbjct: 1261 NRGWSSGGGGDDNKGWSSGGAGDNKGWGGGGGGDNKGWSSGGGDNKGWSGGGGGGSSDWE 1320

BLAST of MC04g1461 vs. ExPASy TrEMBL
Match: A0A6J1JYW2 (protein RNA-directed DNA methylation 3-like isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111489134 PE=4 SV=1)

HSP 1 Score: 1778 bits (4604), Expect = 0.0
Identity = 1018/1450 (70.21%), Postives = 1096/1450 (75.59%), Query Frame = 0

Query: 1    MASKGKGIAKDTSSGKRKQREDNNGADLARKRRDRSVLQFFEDVAPEVGGESDDSDFLDD 60
            MASKGKGIAKD S GKRK R+DNN +  ARKRRDRSVLQFFEDVAPE GGESDDSDF D 
Sbjct: 1    MASKGKGIAKDASPGKRKLRDDNNSS-AARKRRDRSVLQFFEDVAPEFGGESDDSDFFDG 60

Query: 61   FMEEEFDTEPAFKNDAAKDQNIPFFPKEEEMNEEEFDRIMEEHYNQGPGLGAFAEENYEN 120
            FMEE+F+T PAFKND AK QNIPFFPKEEEMNEEEFDR+MEEHYNQGPG GAFAEENYEN
Sbjct: 61   FMEEDFETIPAFKNDDAKSQNIPFFPKEEEMNEEEFDRMMEEHYNQGPGFGAFAEENYEN 120

Query: 121  KNSTGRNPLQPSSRDTISLWKVKCMVGRERQSVFCLMQKFVDLHSFGTKLQIKSAFYVDH 180
            KNSTGRNP   S+RDTISLWKVKCMVGRERQSVFCLMQKFVDL+SFGTKLQIK+AF V+H
Sbjct: 121  KNSTGRNPPLQSARDTISLWKVKCMVGRERQSVFCLMQKFVDLNSFGTKLQIKAAFCVEH 180

Query: 181  VKGFIYIEAPRQYDLIEACKGISGIYSTRIASVPENDISQLLTVRSRVSEVSVGTMARVK 240
            VKGFIYIEAPRQYDLIEACKGISGIYSTR+ASVPENDISQLLTVRSRVSEV+VGTMARVK
Sbjct: 181  VKGFIYIEAPRQYDLIEACKGISGIYSTRVASVPENDISQLLTVRSRVSEVAVGTMARVK 240

Query: 241  NGKYKGDLAQIVAVNNARKRATVKLVPRIDLQAMAAKFGGGVAAKKTTNPAPRLINSSEL 300
            NGKYKGDLAQIVAVNNARKRATVKLVPRIDLQAMA KFGGG AAKKT+NPAPRLINSSEL
Sbjct: 241  NGKYKGDLAQIVAVNNARKRATVKLVPRIDLQAMAEKFGGGSAAKKTSNPAPRLINSSEL 300

Query: 301  DEFRPLMQFRRDRETGKLFEFLDGMMLKDGYLFKKISLDSLNCWGVMPSEDELLKFKPPE 360
            DEFRPLMQFRRDRETGKLFEFLDGMMLKDGYL+KKISLDSLNCWGVMPSEDELLKFKPPE
Sbjct: 301  DEFRPLMQFRRDRETGKLFEFLDGMMLKDGYLYKKISLDSLNCWGVMPSEDELLKFKPPE 360

Query: 361  SNESNDLEWLSQLYGEKRKKKIIRTEKGGGKGEGTSGSSSMNSFADHDLVCFGRKDFGMI 420
            SNES+DLEWLSQLYGEK+KKKIIRTEKGGGKGEG+SGSSS +SF DHDL+CFGRKDFGMI
Sbjct: 361  SNESHDLEWLSQLYGEKKKKKIIRTEKGGGKGEGSSGSSSTSSFGDHDLICFGRKDFGMI 420

Query: 421  LGTEKDDSYKILKEGPDGSIVVNVQRKELKSGPLEGKFTAADHNGKIISVSDNVKVLEGS 480
            LG EKDDSYKILK+ P+GS+VVNVQRKELK G L+ KFTAADHNGK+ISVSDNVKVLEGS
Sbjct: 421  LGMEKDDSYKILKDSPNGSVVVNVQRKELKGGSLDAKFTAADHNGKMISVSDNVKVLEGS 480

Query: 481  LKDKQGIVKHVYRHTVFVYDENEVDNDGYFCCKSNMCEKIKISYDAPSGKDDDKGFSGFE 540
            LKDKQGIVKHVYRHTVFVYDENEV+NDGYFCCKSN CEKIKISYDAP GK+DDKGFSGFE
Sbjct: 481  LKDKQGIVKHVYRHTVFVYDENEVENDGYFCCKSNKCEKIKISYDAPCGKEDDKGFSGFE 540

Query: 541  DFSFSPKSPLSPKKPWAEKDSGREYNRDDK-DGMFSIGQTLRIRVGPLKGYLCRVIAVRK 600
            DFS SPKSPLSPKKPWAEK++GREYNRDD+ DGMFSIGQTLRIRVGPLKGYLCRVIAVRK
Sbjct: 541  DFSSSPKSPLSPKKPWAEKETGREYNRDDRGDGMFSIGQTLRIRVGPLKGYLCRVIAVRK 600

Query: 601  KDVTVKLDSQQKVLTVRSDFLSEVYRKTSTVSLRIRISFCSEDTEFGSLKPFDILGNEGS 660
            +DVTVKLDSQQKVLTVRSDFLSEV RK+S VSL       SED     LKPFDILGNEGS
Sbjct: 601  RDVTVKLDSQQKVLTVRSDFLSEVQRKSSAVSL-------SEDP----LKPFDILGNEGS 660

Query: 661  SQDWMGGTGSSTGGDGWNSAGPSSERSPWPSFPESGTSNCPGSSS-NPFGSESLDAQKDV 720
            SQDWMGGTGSS GGDGWNS GPSSE +PWPSFPES T N PGSSS NP GSES DA KD 
Sbjct: 661  SQDWMGGTGSSAGGDGWNSTGPSSEGNPWPSFPESSTLNGPGSSSTNPIGSESFDANKD- 720

Query: 721  EDSPWVSKTVPNASTSWGAAKSNVDDTVNDDQAPGWGKSESWDKATAKTSSDGNASGAWD 780
            EDSPWVSK+ P+ASTSWGAAKS+VD T N+ QA GWGKS+SW K  AKT SDGNASGAW 
Sbjct: 721  EDSPWVSKSTPDASTSWGAAKSSVD-TANNGQASGWGKSDSWGKTIAKTCSDGNASGAWG 780

Query: 781  KKVVPGGDCAGPTDQAEDK-------------------------------DKDSESGGWK 840
            K  VP GD AG T+   DK                               DKD+ESGGWK
Sbjct: 781  KTAVPSGDSAGLTENTWDKWDKGKQVSSDNQTGNWDNGTSGKNEHSAWSRDKDAESGGWK 840

Query: 841  KSQNASFGDENNSAEASADKWSSKNRSSGGWGDCNASTTVSEIKPAGKSNAGSSAWNISD 900
            K+Q+A+F D+   AE++ D W++                 +++ P+G  N G+S      
Sbjct: 841  KTQSANFDDDKTPAESAGD-WTNPE-------------AENKVNPSGW-NEGTSM--KGS 900

Query: 901  ENSSWNTQKQESNRGSWGKPSDRGGTSSSDWNKSTTTDGAGKKDGWNKPNLASEDESIGK 960
            + S+W  Q +    G WGKP + G   SS WNKST+ DGA + D WNKP L S DESIGK
Sbjct: 901  QTSNWGNQDET---GGWGKPKNVGNGGSSAWNKSTSGDGAVENDSWNKPKLFSHDESIGK 960

Query: 961  KGWGQGNEANDSGNKWQSSGSDGGKKWGTNESERDGGNKSWNTSKSSDGDTGGGNSAIWK 1020
            KGWGQ NEA+D+GNKWQSS SDGG KWGTNESE +GG                      K
Sbjct: 961  KGWGQSNEASDNGNKWQSSRSDGGTKWGTNESEHEGG----------------------K 1020

Query: 1021 DKSDSSSLTAPKGDQWGGDWDKQHSSNDTKASEDNSPWNKKSVESGKDGERKDQGSGWNV 1080
            DKSDSSSLT P+GDQ  G WDKQ SSNDTKASE+NSPWNKKSVESGKDGE K+QGSGWNV
Sbjct: 1021 DKSDSSSLTTPRGDQSVGGWDKQRSSNDTKASEENSPWNKKSVESGKDGELKNQGSGWNV 1080

Query: 1081 GKSSGGDSASGWGQTSKEAGSSDQAGSWGSNWKKNSNAGNEDSS---------------W 1140
            GK+SGGDSASGWGQ SKEAGSSD  G+WGSNWKKNS+ GNEDSS               W
Sbjct: 1081 GKTSGGDSASGWGQASKEAGSSDLVGNWGSNWKKNSDVGNEDSSLAKKSNWSSGSGNSNW 1140

Query: 1141 AKKSNWNSGNESNDNQFTG----------DTSGHGSWGGDGSDRGGFRGRGGFRGRGERG 1200
             +KSNWNSGNE N N  TG          DTSG+GSW G+ SDRGG+RGRGGFRGRGERG
Sbjct: 1141 GEKSNWNSGNEYNANHSTGGAEAQTDVSNDTSGYGSWRGENSDRGGYRGRGGFRGRGERG 1200

Query: 1201 RFGGRGRSDRGGSDRGGFGGRGRGRWNSEGGSNDGENRGWSSGGGGGD------------ 1260
            RFGGRGRSDRGG  RGGFGGRGRGRWNSEGGSN G+N+GWSSGGGGGD            
Sbjct: 1201 RFGGRGRSDRGGFGRGGFGGRGRGRWNSEGGSNGGDNKGWSSGGGGGDNKGWSSGGGGGD 1260

Query: 1261 ----------------------------------------------------------WE 1284
                                                                      WE
Sbjct: 1261 NRGWSSGGGGDDNKGWSSGGAGDNKGWGGGGGGDNKGWSSGGGDNKGWSGGGGGGSSDWE 1320

BLAST of MC04g1461 vs. ExPASy TrEMBL
Match: A0A6J1JV02 (protein RNA-directed DNA methylation 3-like isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111489134 PE=4 SV=1)

HSP 1 Score: 1778 bits (4604), Expect = 0.0
Identity = 1018/1450 (70.21%), Postives = 1096/1450 (75.59%), Query Frame = 0

Query: 1    MASKGKGIAKDTSSGKRKQREDNNGADLARKRRDRSVLQFFEDVAPEVGGESDDSDFLDD 60
            MASKGKGIAKD S GKRK R+DNN +  ARKRRDRSVLQFFEDVAPE GGESDDSDF D 
Sbjct: 1    MASKGKGIAKDASPGKRKLRDDNNSS-AARKRRDRSVLQFFEDVAPEFGGESDDSDFFDG 60

Query: 61   FMEEEFDTEPAFKNDAAKDQNIPFFPKEEEMNEEEFDRIMEEHYNQGPGLGAFAEENYEN 120
            FMEE+F+T PAFKND AK QNIPFFPKEEEMNEEEFDR+MEEHYNQGPG GAFAEENYEN
Sbjct: 61   FMEEDFETIPAFKNDDAKSQNIPFFPKEEEMNEEEFDRMMEEHYNQGPGFGAFAEENYEN 120

Query: 121  KNSTGRNPLQPSSRDTISLWKVKCMVGRERQSVFCLMQKFVDLHSFGTKLQIKSAFYVDH 180
            KNSTGRNP   S+RDTISLWKVKCMVGRERQSVFCLMQKFVDL+SFGTKLQIK+AF V+H
Sbjct: 121  KNSTGRNPPLQSARDTISLWKVKCMVGRERQSVFCLMQKFVDLNSFGTKLQIKAAFCVEH 180

Query: 181  VKGFIYIEAPRQYDLIEACKGISGIYSTRIASVPENDISQLLTVRSRVSEVSVGTMARVK 240
            VKGFIYIEAPRQYDLIEACKGISGIYSTR+ASVPENDISQLLTVRSRVSEV+VGTMARVK
Sbjct: 181  VKGFIYIEAPRQYDLIEACKGISGIYSTRVASVPENDISQLLTVRSRVSEVAVGTMARVK 240

Query: 241  NGKYKGDLAQIVAVNNARKRATVKLVPRIDLQAMAAKFGGGVAAKKTTNPAPRLINSSEL 300
            NGKYKGDLAQIVAVNNARKRATVKLVPRIDLQAMA KFGGG AAKKT+NPAPRLINSSEL
Sbjct: 241  NGKYKGDLAQIVAVNNARKRATVKLVPRIDLQAMAEKFGGGSAAKKTSNPAPRLINSSEL 300

Query: 301  DEFRPLMQFRRDRETGKLFEFLDGMMLKDGYLFKKISLDSLNCWGVMPSEDELLKFKPPE 360
            DEFRPLMQFRRDRETGKLFEFLDGMMLKDGYL+KKISLDSLNCWGVMPSEDELLKFKPPE
Sbjct: 301  DEFRPLMQFRRDRETGKLFEFLDGMMLKDGYLYKKISLDSLNCWGVMPSEDELLKFKPPE 360

Query: 361  SNESNDLEWLSQLYGEKRKKKIIRTEKGGGKGEGTSGSSSMNSFADHDLVCFGRKDFGMI 420
            SNES+DLEWLSQLYGEK+KKKIIRTEKGGGKGEG+SGSSS +SF DHDL+CFGRKDFGMI
Sbjct: 361  SNESHDLEWLSQLYGEKKKKKIIRTEKGGGKGEGSSGSSSTSSFGDHDLICFGRKDFGMI 420

Query: 421  LGTEKDDSYKILKEGPDGSIVVNVQRKELKSGPLEGKFTAADHNGKIISVSDNVKVLEGS 480
            LG EKDDSYKILK+ P+GS+VVNVQRKELK G L+ KFTAADHNGK+ISVSDNVKVLEGS
Sbjct: 421  LGMEKDDSYKILKDSPNGSVVVNVQRKELKGGSLDAKFTAADHNGKMISVSDNVKVLEGS 480

Query: 481  LKDKQGIVKHVYRHTVFVYDENEVDNDGYFCCKSNMCEKIKISYDAPSGKDDDKGFSGFE 540
            LKDKQGIVKHVYRHTVFVYDENEV+NDGYFCCKSN CEKIKISYDAP GK+DDKGFSGFE
Sbjct: 481  LKDKQGIVKHVYRHTVFVYDENEVENDGYFCCKSNKCEKIKISYDAPCGKEDDKGFSGFE 540

Query: 541  DFSFSPKSPLSPKKPWAEKDSGREYNRDDK-DGMFSIGQTLRIRVGPLKGYLCRVIAVRK 600
            DFS SPKSPLSPKKPWAEK++GREYNRDD+ DGMFSIGQTLRIRVGPLKGYLCRVIAVRK
Sbjct: 541  DFSSSPKSPLSPKKPWAEKETGREYNRDDRGDGMFSIGQTLRIRVGPLKGYLCRVIAVRK 600

Query: 601  KDVTVKLDSQQKVLTVRSDFLSEVYRKTSTVSLRIRISFCSEDTEFGSLKPFDILGNEGS 660
            +DVTVKLDSQQKVLTVRSDFLSEV RK+S VSL       SED     LKPFDILGNEGS
Sbjct: 601  RDVTVKLDSQQKVLTVRSDFLSEVQRKSSAVSL-------SEDP----LKPFDILGNEGS 660

Query: 661  SQDWMGGTGSSTGGDGWNSAGPSSERSPWPSFPESGTSNCPGSSS-NPFGSESLDAQKDV 720
            SQDWMGGTGSS GGDGWNS GPSSE +PWPSFPES T N PGSSS NP GSES DA KD 
Sbjct: 661  SQDWMGGTGSSAGGDGWNSTGPSSEGNPWPSFPESSTLNGPGSSSTNPIGSESFDANKD- 720

Query: 721  EDSPWVSKTVPNASTSWGAAKSNVDDTVNDDQAPGWGKSESWDKATAKTSSDGNASGAWD 780
            EDSPWVSK+ P+ASTSWGAAKS+VD T N+ QA GWGKS+SW K  AKT SDGNASGAW 
Sbjct: 721  EDSPWVSKSTPDASTSWGAAKSSVD-TANNGQASGWGKSDSWGKTIAKTCSDGNASGAWG 780

Query: 781  KKVVPGGDCAGPTDQAEDK-------------------------------DKDSESGGWK 840
            K  VP GD AG T+   DK                               DKD+ESGGWK
Sbjct: 781  KTAVPSGDSAGLTENTWDKWDKGKQVSSDNQTGNWDNGTSGKNEHSAWSRDKDAESGGWK 840

Query: 841  KSQNASFGDENNSAEASADKWSSKNRSSGGWGDCNASTTVSEIKPAGKSNAGSSAWNISD 900
            K+Q+A+F D+   AE++ D W++                 +++ P+G  N G+S      
Sbjct: 841  KTQSANFDDDKTPAESAGD-WTNPE-------------AENKVNPSGW-NEGTSM--KGS 900

Query: 901  ENSSWNTQKQESNRGSWGKPSDRGGTSSSDWNKSTTTDGAGKKDGWNKPNLASEDESIGK 960
            + S+W  Q +    G WGKP + G   SS WNKST+ DGA + D WNKP L S DESIGK
Sbjct: 901  QTSNWGNQDET---GGWGKPKNVGNGGSSAWNKSTSGDGAVENDSWNKPKLFSHDESIGK 960

Query: 961  KGWGQGNEANDSGNKWQSSGSDGGKKWGTNESERDGGNKSWNTSKSSDGDTGGGNSAIWK 1020
            KGWGQ NEA+D+GNKWQSS SDGG KWGTNESE +GG                      K
Sbjct: 961  KGWGQSNEASDNGNKWQSSRSDGGTKWGTNESEHEGG----------------------K 1020

Query: 1021 DKSDSSSLTAPKGDQWGGDWDKQHSSNDTKASEDNSPWNKKSVESGKDGERKDQGSGWNV 1080
            DKSDSSSLT P+GDQ  G WDKQ SSNDTKASE+NSPWNKKSVESGKDGE K+QGSGWNV
Sbjct: 1021 DKSDSSSLTTPRGDQSVGGWDKQRSSNDTKASEENSPWNKKSVESGKDGELKNQGSGWNV 1080

Query: 1081 GKSSGGDSASGWGQTSKEAGSSDQAGSWGSNWKKNSNAGNEDSS---------------W 1140
            GK+SGGDSASGWGQ SKEAGSSD  G+WGSNWKKNS+ GNEDSS               W
Sbjct: 1081 GKTSGGDSASGWGQASKEAGSSDLVGNWGSNWKKNSDVGNEDSSLAKKSNWSSGSGNSNW 1140

Query: 1141 AKKSNWNSGNESNDNQFTG----------DTSGHGSWGGDGSDRGGFRGRGGFRGRGERG 1200
             +KSNWNSGNE N N  TG          DTSG+GSW G+ SDRGG+RGRGGFRGRGERG
Sbjct: 1141 GEKSNWNSGNEYNANHSTGGAEAQTDVSNDTSGYGSWRGENSDRGGYRGRGGFRGRGERG 1200

Query: 1201 RFGGRGRSDRGGSDRGGFGGRGRGRWNSEGGSNDGENRGWSSGGGGGD------------ 1260
            RFGGRGRSDRGG  RGGFGGRGRGRWNSEGGSN G+N+GWSSGGGGGD            
Sbjct: 1201 RFGGRGRSDRGGFGRGGFGGRGRGRWNSEGGSNGGDNKGWSSGGGGGDNKGWSSGGGGGD 1260

Query: 1261 ----------------------------------------------------------WE 1284
                                                                      WE
Sbjct: 1261 NRGWSSGGGGDDNKGWSSGGAGDNKGWGGGGGGDNKGWSSGGGDNKGWSGGGGGGSSDWE 1320

BLAST of MC04g1461 vs. TAIR 10
Match: AT5G04290.1 (kow domain-containing transcription factor 1 )

HSP 1 Score: 658.3 bits (1697), Expect = 1.3e-188
Identity = 546/1363 (40.06%), Postives = 747/1363 (54.81%), Query Frame = 0

Query: 1    MASKGKG---IAKDTSSGKRKQREDNNGAD---LARKRRDRSVLQFFEDVAP--EVGGES 60
            M  KGKG      D+ SG +K++      D     +KR++  VLQFFE+ A     GG S
Sbjct: 1    MDRKGKGKQVAGSDSYSGGQKRKNSVEFRDEGLRIKKRKNPEVLQFFEESAEVGYYGGSS 60

Query: 61   DDSD----FLDDFMEEEFDTEPAFK-NDAAKDQNIPFFPKEEEMNEEEFDRIMEEHYNQG 120
            D+ D    FL+D ME+E + E + K     K ++   FPKEE++NEEEFDRIMEE Y  G
Sbjct: 61   DEDDDGLGFLND-MEDEPEVEESSKAGKGEKGKSSFVFPKEEDLNEEEFDRIMEERYKPG 120

Query: 121  PGLGAFAEENYENKNSTGRNPLQPSSRDTISLWKVKCMVGRERQSVFCLMQKFVDLHSFG 180
             G   +A+++   K++   + L P+S+D   +WKVKC +GRER+SVFCLM KFV+L   G
Sbjct: 121  SGFLRYADDDI--KDAIEMDALAPTSKDP-PIWKVKCAIGRERRSVFCLMHKFVELRKIG 180

Query: 181  TKLQIKSAFYVDHVKGFIYIEAPRQYDLIEACKGISGIYSTRIASVPENDISQLLTVRSR 240
            TKL+I S F VDHVKGFI+IEA +++D++EACK + GIY+TR+  +P+ +   LLTV+ +
Sbjct: 181  TKLEIISVFSVDHVKGFIFIEADKEHDVLEACKSLVGIYATRMVLLPKAETPNLLTVQKK 240

Query: 241  VSEVSVGTMARVKNGKYKGDLAQIVAVNNARKRATVKLVPRIDLQAMAAKFGGGVAAKKT 300
              +VS GT ARVKNGKYKGDLAQIVAV++ R +A +KL+PRID+QA+  K+GGGV  +K 
Sbjct: 241  TKKVSEGTWARVKNGKYKGDLAQIVAVSDTRNKALIKLIPRIDIQALTQKYGGGVTVQKG 300

Query: 301  TNPAPRLINSSELDEFRPLMQFRRDRETGKLFEFLDGMMLKDGYLFKKISLDSLNCWGVM 360
              PAPRLI+SSEL+EFRPL+Q RRDR+TG  FE LD +MLKDGYL+KK+SLDS++ WGV+
Sbjct: 301  QTPAPRLISSSELEEFRPLIQVRRDRDTGITFEHLDSLMLKDGYLYKKVSLDSISSWGVI 360

Query: 361  PSEDELLKFKPPESNESNDLEWLSQLYGEKRKKKIIRTEKGGGKGEGTSGS--------- 420
            P++DELLKF P +  E+ D+EW+S++YGE+RKKKI+ T + GGKGEG+ G          
Sbjct: 361  PTKDELLKFTPVDRKETGDVEWISEIYGEERKKKILPTCREGGKGEGSGGGKGEGSGGGK 420

Query: 421  ---------------SSMNSFADHDLVCFGRKDFGMILGT-EKDDSYKILKEGPDGSIVV 480
                            S +S+  ++LVCF RKDFG+I+G  +K D YK+LKEG DG +VV
Sbjct: 421  GEGSRGGKGEGSSDFKSESSYELYNLVCFSRKDFGLIVGVDDKGDGYKVLKEGIDGPVVV 480

Query: 481  NVQRKELKSGPLEGKFTAADHNGKIISVSDNVKVLEGSLKDKQGIVKHVYRHTVFVYDEN 540
             V +KE+++GP + KFTA D N K ISV+D VK+ +G  + KQG+V+ VYR  +F+YDE+
Sbjct: 481  TVGKKEMQNGPFDSKFTALDLNKKQISVNDVVKISKGPSEGKQGVVRQVYRGIIFLYDES 540

Query: 541  EVDNDGYFCCKSNMCEKIKISYDAPSGKDDDKGFSGFEDFSFSPKSPLSPKKPWAEKDSG 600
            E +N GYFCCKS  CEK+K+  +  + K      + FEDF  SPKSPLSP+K W  ++  
Sbjct: 541  EEENGGYFCCKSQSCEKVKLFTEESNEKTGGFDGTAFEDFVSSPKSPLSPEKEWQPRERY 600

Query: 601  REYNRDDKDGMFSIGQTLRIRVGPLKGYLCRVIAVRKKDVTVKLDSQQKVLTVRSDFLSE 660
               N+ D    +SIGQ LRIRVGPLKGYLCRVIA+R  DVTVKLDSQ K+ TV+S+ L+E
Sbjct: 601  NSSNQGDIGSTYSIGQKLRIRVGPLKGYLCRVIALRYSDVTVKLDSQHKIFTVKSEHLAE 660

Query: 661  VYRKTSTVSLRIRISFCSEDTEFGSLKPFDILGNEGSSQDWMGGTGSSTGGDGWNSAGPS 720
            V  + + +S        S D   GS +PF +LG E S+ DW  G G+S+ G  WN  GPS
Sbjct: 661  VRDRNTVLS-------TSGDAGTGSFQPFGMLGTESSTGDWAIGAGTSSEGGNWNIGGPS 720

Query: 721  SERSPWPSFPESGTSNCPGSSSNPFGSESLDAQKDVEDSPWVSKTVPNASTS-WGAAKSN 780
            ++     +   +    C     NP+G       K   D   VS TV + +TS W  A + 
Sbjct: 721  TDSHESLNIERNMVQLC--REKNPWG-----GSKPTSD---VSPTVADDNTSAWANAAAE 780

Query: 781  VDDTVNDDQAPG---WGKSESWDKATAKTSSDGNAS----GAWDKKVVPGGDCAGPTDQA 840
                   DQ  G   WGK+ + +  T     D +AS     +W+K+   G   +   D  
Sbjct: 781  NKPASASDQPGGWNPWGKTPASEAGTVSGWGDTSASNVEASSWEKQ---GASTSNVADLG 840

Query: 841  EDKDKDSESGGWKKSQNASFGDENNSAEASADK----WSSKNRSSG--GWG--DCNASTT 900
                    SGG K+ +++ +G    ++E+S  K    W  K  S G   WG  D N+S +
Sbjct: 841  SWGTHGGSSGGNKQDEDSVWGKLCEASESSQKKEESSWGKKGGSDGESSWGNKDGNSSAS 900

Query: 901  VSEIKPAGKSNAGSSAWNISDENSSWNTQKQESNRGSWGKPSDRGGTSSSDWNKSTTTDG 960
              +    G+ + GS     S   S+W+ Q      G +G    + G  SS WNKS     
Sbjct: 901  KKDGVSWGQQDKGSDE---SKGGSAWSNQ-----CGDFGSGKKKDG--SSGWNKSAEDSN 960

Query: 961  AGKK--DGWNKPNLASEDESIGKKG-----WGQGNEANDSGNKWQSSGSDGGKKWG-TNE 1020
            A  K    W +PN   +  S GKKG     WG+ ++    G K   +  DGG  WG  ++
Sbjct: 961  ANSKGVPDWGQPN---DGSSWGKKGDGAASWGKKDDGGSWGKKDDGNKDDGGSSWGKKDD 1020

Query: 1021 SERDGGNKSWNTSKSSDGDTGGG----NSAIWKDKSDSSSLTAPKGDQWGGDWDKQHSSN 1080
             ++D G  SW   K  DG +  G      + W  K D  SL   K D  G  W K+    
Sbjct: 1021 GQKDDGGSSW--EKKFDGGSSWGKKDDGGSSWGKKDDGGSLWGKK-DDGGSSWGKEDDGG 1080

Query: 1081 DT--KASEDNSPWNKKSVESGKDGERKDQGSGWNVGKSSGGDSASGWGQTSKEAGSSDQA 1140
                K  +  S W KK       G++ D GS W   K  GG S   + +  +  G     
Sbjct: 1081 SLWGKKDDGESSWGKKDDGESSWGKKDDGGSSWG-KKDEGGYSEQTFDRGGRGFGGRRGG 1140

Query: 1141 GSWG--SNWKKNSNAGNED--SSWAKKSNWNSGNESNDNQFTGDTSGHGSWGGDGSDRGG 1200
            G  G    + + S+ GN +  + W+K S  +S  + +     GD  G  SWG +    GG
Sbjct: 1141 GRRGGRDQFGRGSSFGNSEDPAPWSKPSGGSSWGKQD-----GD-GGGSSWGKENDAGGG 1200

Query: 1201 FRGRGGFRGRGERGRFGGRGRSDRGGSDRGGFGGRGRG-RWNSEGGSNDGENRGWSSGGG 1260
                 G +  G    +G +     GGS  G     G G  W  +    DG + G   GGG
Sbjct: 1201 --SSWGKQDNGVGSSWGKQNDGSGGGSSWGKQNDAGGGSSWGKQDSGGDGSSWGKQDGGG 1260

Query: 1261 --GGDWEKSGSDRGGFGGGR-----GRGRWNQEAGSNDGDSGGWSGGGGGHRGRGRGRWN 1282
              G  W K  +  GG   G+     G   W ++ G   G S G   GGGG  G   G+ N
Sbjct: 1261 DSGSAWGKQNNTSGGSSWGKQSDAGGGSSWGKQDGGGGGSSWGKQDGGGG-SGSAWGKQN 1313

BLAST of MC04g1461 vs. TAIR 10
Match: AT4G08350.1 (global transcription factor group A2 )

HSP 1 Score: 266.5 bits (680), Expect = 1.1e-70
Identity = 248/844 (29.38%), Postives = 378/844 (44.79%), Query Frame = 0

Query: 4   KGKGIAKDTSSGKRKQREDNN---------GADLARKRRDRSVLQFFEDVAPEVG--GES 63
           +G+    D  + +  Q ED++         G   A KR+  S   F +  A +V    E 
Sbjct: 45  RGRSNFIDDYAEEDSQEEDDDDEDYGSSRGGKGAASKRKKPSASIFLDREAHQVDDEDEE 104

Query: 64  DDSDFLDDFMEEEFDTEPAFKNDAAKDQNIPFFPKEE-EMNEEEFDRIMEEHYNQGPGLG 123
           ++ +  DDF+ +     P  + D   ++   F P++E + + E+ +R ++E ++      
Sbjct: 105 EEDEAEDDFIVDNGTDLPDERGDRRYERR--FLPRDENDEDVEDLERRIQERFS-----S 164

Query: 124 AFAEENYENKNSTGRNPLQPSSRDTISLWKVKCMVGRERQSVFCLMQKFVDLHSFGTKLQ 183
              EE  E      +  L PS RD   LW VKC +GRER+   CLMQKF+D    G  LQ
Sbjct: 165 RHHEEYDEEATEVEQQALLPSVRDP-KLWMVKCAIGREREVAVCLMQKFIDR---GADLQ 224

Query: 184 IKSAFYVDHVKGFIYIEAPRQYDLIEACKGISGIYST-RIASVPENDISQLLTVRSRVSE 243
           I+S   +DH+K FIY+EA ++  + EA KG+  IY+  +I  VP  +++ +L+V S+  +
Sbjct: 225 IRSVVALDHLKNFIYVEADKEAHVKEAIKGMRNIYANQKILLVPIREMTDVLSVESKAID 284

Query: 244 VSVGTMARVKNGKYKGDLAQIVAVNNARKRATVKLVPRIDLQAMAAKFGG-GVAAKKTTN 303
           +S  T  R+K G YKGDLA++V V+N R+R TVKL+PRIDLQA+A+K  G  V+ KK   
Sbjct: 285 LSRDTWVRMKIGTYKGDLAKVVDVDNVRQRVTVKLIPRIDLQALASKLDGREVSKKKAFV 344

Query: 304 PAPRLINSSELDEFRPLMQFRRDRETGKLFEFLDGMMLKDGYLFKKISLDSLNCWGVMPS 363
           P PR +N  E  E    ++ RRD  TG  FE + GM+ KDG+ +K++SL S+    V P+
Sbjct: 345 PPPRFMNIDEARELHIRVERRRDHMTGDYFENIGGMLFKDGFHYKQVSLKSITVQNVTPT 404

Query: 364 EDELLKFKPPESNESNDLEWLSQLYGEKRK-----------------------------K 423
            DEL KF  P  N   D   LS L+  ++K                              
Sbjct: 405 FDELEKFNKPSENGEGDFGGLSTLFANRKKGHFMKGDAVIVIKGDLKNLKGWVEKVDEEN 464

Query: 424 KIIRTEKGG--------------------------GKGEG-------------------- 483
            +IR+E  G                          G  EG                    
Sbjct: 465 VLIRSEVKGLPDPLAVNERELCKYFEPGNHVKVVSGTHEGATGMVVKVDQHVLIILSDTT 524

Query: 484 -----------------TSGSSSMNSFADHDLVCFGRKDFGMILGTEKDDSYKILKEGPD 543
                            T+G + +  +  HDLV      FG+I+  E ++++++LK  PD
Sbjct: 525 KEHVRVFADHVVESSEVTTGVTKIGDYELHDLVLLDNLSFGVIIRLE-NEAFQVLKGVPD 584

Query: 544 GSIVVNVQRKELKSGPLEGKFTAADHNGKIISVSDNVKVLEGSLKDKQGIVKHVYRHTVF 603
              V  V+ +E+K   LE K    D    +I+V D+V+V+EG  K KQG VKH+Y+  +F
Sbjct: 585 RPEVALVKLREIKC-KLEKKINVQDRYKNVIAVKDDVRVIEGPSKGKQGPVKHIYKGVLF 644

Query: 604 VYDENEVDNDGYFCCKSNMCEKIKISYDAPSGKDDDKGFSGFEDFSFSPKSPLSPKKPWA 663
           +YD + +++ G+ C K   C  +  S    +    D   S + +F      P SP +   
Sbjct: 645 IYDRHHLEHAGFICAKCTSCIVVGGSRSGANRNGGD-SLSRYGNFKAPAPVPSSPGR--F 704

Query: 664 EKDSGREYNRD--------DKDGMFSIGQTLRIRVGPLKGYLCRVIAVRKKDVTVKLDSQ 712
           ++  G  YN           +     +G T++IR+GP KGY   V+ V+   V V+L  +
Sbjct: 705 QRGRGGGYNNSGGRHGGGRGRGDDSLLGTTVKIRLGPFKGYRGPVVEVKGNSVRVEL--E 764

BLAST of MC04g1461 vs. TAIR 10
Match: AT2G34210.1 (Transcription elongation factor Spt5 )

HSP 1 Score: 235.3 bits (599), Expect = 2.7e-61
Identity = 238/846 (28.13%), Postives = 371/846 (43.85%), Query Frame = 0

Query: 14  SGKRKQR--EDNNGADLARKRRDRSVLQFFE-DVAPEVGGESDDSDFLD--------DF- 73
           SGK++ R   D++G   ++K+   S    +E +V  +V  + DD D  D        DF 
Sbjct: 36  SGKKRGRSNSDSDGRRGSKKKSSGSAFIDWEVEVDDDVEDDDDDVDVEDGKQQLKFGDFS 95

Query: 74  ----MEEEFDTEPAFKNDAAKDQNIPFFPKEEEMNEEEFDRIMEEHYNQGPGLGAFAEEN 133
               +  E D      +   +     F P EE+++E E  R +E    +      +A+++
Sbjct: 96  LCFIVSGEADLPNEDSDHRRQYYQRGFHPHEEDVDELE-KRTLERLSTK------YAKDD 155

Query: 134 YE--NKNSTGRNPLQPSSRDTISLWKVKCMVGRERQSVFCLMQKFVDLHSFGTKLQIKSA 193
           YE  + N   +  L PS RD   LW VKC +GRER+   CLMQK VD    G++ +I+SA
Sbjct: 156 YELDDVNDVDQQALLPSVRDP-KLWLVKCAIGREREVAVCLMQKIVDR---GSEFKIRSA 215

Query: 194 FYVDHVKGFIYIEAPRQYDLIEACKGISGIYST-RIASVPENDISQLLTVRSRVSEVSVG 253
             +DH++ ++YIEA  +  + EA KG+  IY+  +I  VP  +++ +L+V S+  ++S  
Sbjct: 216 IALDHLQNYVYIEADMEAHVKEAIKGMRNIYANQKILLVPIKEMTAVLSVESKAIDLSRD 275

Query: 254 TMARVKNGKYKGDLAQIVAVNNARKRATVKLVPRIDLQAMAAKFGGGV-AAKKTTNPAPR 313
           +  R+K G YKGDLAQ+V V+N RKR TVKL+PRIDLQA+A K  G     KK   P PR
Sbjct: 276 SWVRMKLGIYKGDLAQVVDVDNVRKRVTVKLIPRIDLQALANKLEGTENVKKKAFAPPPR 335

Query: 314 LINSSELDEFRPLMQFRRDRETGKLFEFLDGMMLKDGYLFKKISLDSLNCWGVMPSEDEL 373
            +N  E  E    ++ RRD  TG  FE + GM+ KDG+L+KK+S  S+    V P+ DEL
Sbjct: 336 FMNIDEARELHIRVEHRRDPMTGDYFENIGGMLFKDGFLYKKVSTKSIAAQNVTPTFDEL 395

Query: 374 LKFKPPESNESNDLEWLSQLYGEKRK-----------------------------KKIIR 433
            +FK P  N   D    S L+  ++K                               +IR
Sbjct: 396 ERFKRPNENGEIDFVDESTLFANRKKGHFMKGDAVIVIKGDLKNLKGWIEKVDEENVLIR 455

Query: 434 TE-------------------------------KGGGKG--------------------- 493
           +E                                 GG G                     
Sbjct: 456 SEMKDLPNPIAVNGRELCKYFEPGNFVKVVSGIHEGGTGMIVKVDQHMLIILSDTTKEHI 515

Query: 494 -----------EGTSGSSSMNSFADHDLVCFGRKDFGMILGTEKDDSYKILKEGPDGSIV 553
                      E T G + +  +  HDLV      FG+IL  +  ++ +ILK  PD S V
Sbjct: 516 CVFADHVAKSAEVTKGVTKIGDYELHDLVILSDFSFGVILKLD-SEAIQILKGVPDSSEV 575

Query: 554 VNVQRKELKSGPLEGKFTAADHNGKIISVSDNVKVLEGSLKDKQGIVKHVYRHTVFVYDE 613
             V+  E+K   +  K    D    +++V D V+V+EG  K KQG V  +Y+  +F++D 
Sbjct: 576 SIVKASEIKY-KIWKKINVQDRYKNVVAVKDVVRVIEGPSKGKQGPVVQIYKGVLFIHDR 635

Query: 614 NEVDNDGYFCCKSNMCEKIKISYDAPSGKDDDKGFSGFEDFSFSPKSPLSPKKPWAEKDS 673
           + +++ G+ C + + C     ++  P+            D  ++P +             
Sbjct: 636 HNLEHTGFICTRCSSCVLAGGNFKTPALVPPSPRRFQRADMGYNPGA-------GGRHQG 695

Query: 674 GREYNRDDKDGMFSIGQTLRIRVGPLKGYLCRVIAVRKKDVTVKLDSQQKVLTVRSDFLS 733
           GR    DD      +G  ++IR+GP KGY  R++ V+ K V V+L++  K++TV    +S
Sbjct: 696 GRGRRGDD----HLVGTYVKIRLGPFKGYSGRLVEVKDKLVRVELEA--KIVTVERKAIS 755

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
F4JW791.8e-18740.06Protein RNA-directed DNA methylation 3 OS=Arabidopsis thaliana OX=3702 GN=RDM3 P... [more]
Q9STN31.5e-6929.38Putative transcription elongation factor SPT5 homolog 1 OS=Arabidopsis thaliana ... [more]
O807703.7e-6028.13Putative transcription elongation factor SPT5 homolog 2 OS=Arabidopsis thaliana ... [more]
O139367.1e-2723.88Transcription elongation factor spt5 OS=Schizosaccharomyces pombe (strain 972 / ... [more]
O552013.0e-2523.10Transcription elongation factor SPT5 OS=Mus musculus OX=10090 GN=Supt5h PE=1 SV=... [more]
Match NameE-valueIdentityDescription
XP_022133448.10.095.50protein RNA-directed DNA methylation 3, partial [Momordica charantia][more]
XP_023524523.10.074.69protein RNA-directed DNA methylation 3 [Cucurbita pepo subsp. pepo][more]
KAG6602288.10.074.87Protein RNA-directed DNA methylation 3, partial [Cucurbita argyrosperma subsp. s... [more]
XP_022957824.10.074.81protein RNA-directed DNA methylation 3-like [Cucurbita moschata][more]
KAG7032970.10.074.81Protein RNA-directed DNA methylation 3, partial [Cucurbita argyrosperma subsp. a... [more]
Match NameE-valueIdentityDescription
A0A6J1BZ560.095.50protein RNA-directed DNA methylation 3 OS=Momordica charantia OX=3673 GN=LOC1110... [more]
A0A6J1H3400.074.81protein RNA-directed DNA methylation 3-like OS=Cucurbita moschata OX=3662 GN=LOC... [more]
A0A6J1JRE10.070.21protein RNA-directed DNA methylation 3-like isoform X3 OS=Cucurbita maxima OX=36... [more]
A0A6J1JYW20.070.21protein RNA-directed DNA methylation 3-like isoform X1 OS=Cucurbita maxima OX=36... [more]
A0A6J1JV020.070.21protein RNA-directed DNA methylation 3-like isoform X2 OS=Cucurbita maxima OX=36... [more]
Match NameE-valueIdentityDescription
AT5G04290.11.3e-18840.06kow domain-containing transcription factor 1 [more]
AT4G08350.11.1e-7029.38global transcription factor group A2 [more]
AT2G34210.12.7e-6128.13Transcription elongation factor Spt5 [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (Dali-11) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR005824KOWSMARTSM00739kow_9coord: 467..494
e-value: 0.0083
score: 25.3
coord: 573..600
e-value: 26.0
score: 11.3
coord: 230..257
e-value: 0.089
score: 21.9
IPR005824KOWPFAMPF00467KOWcoord: 472..499
e-value: 3.6E-5
score: 23.5
IPR014722Ribosomal protein L2, domain 2GENE3D2.30.30.30coord: 232..272
e-value: 2.5E-5
score: 26.0
IPR014722Ribosomal protein L2, domain 2GENE3D2.30.30.30coord: 558..626
e-value: 8.0E-16
score: 59.3
IPR036735NusG, N-terminal domain superfamilyGENE3D3.30.70.940coord: 133..231
e-value: 8.7E-30
score: 104.6
IPR005100NGN domainPFAMPF03439Spt5-NGNcoord: 138..222
e-value: 2.1E-19
score: 69.2
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 543..570
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1239..1254
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 658..1284
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 806..908
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 660..711
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 10..32
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1129..1158
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1004..1042
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 725..747
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 791..805
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 961..999
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 553..570
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 934..953
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1044..1117
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..32
NoneNo IPR availablePANTHERPTHR11125:SF8PROTEIN RNA-DIRECTED DNA METHYLATION 3coord: 2..1278
IPR039659Transcription elongation factor SPT5PANTHERPTHR11125SUPPRESSOR OF TY 5coord: 2..1278
IPR041977Spt5, KOW domain repeat 4CDDcd06084KOW_Spt5_4coord: 472..513
e-value: 2.38016E-12
score: 60.6113
IPR041973Spt5, KOW domain repeat 1CDDcd06081KOW_Spt5_1coord: 234..271
e-value: 1.53721E-14
score: 66.7234
IPR039385NGN domain, eukaryoticCDDcd09888NGN_Eukcoord: 138..223
e-value: 6.35318E-34
score: 123.41

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
MC04g1461.1MC04g1461.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0032784 regulation of DNA-templated transcription, elongation
biological_process GO:0006357 regulation of transcription by RNA polymerase II
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0005634 nucleus