Sed0009206 (gene) Chayote v1

Overview
NameSed0009206
Typegene
OrganismSechium edule (Chayote v1)
DescriptionHistone-lysine N-methyltransferase ASHH2
LocationLG03: 2582365 .. 2602181 (+)
RNA-Seq ExpressionSed0009206
SyntenySed0009206
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CGCTCGGTCGGAAACTGTGGCCAGAACGAACAACGTCTAGAAAGCGTAAACATCCCTCTTCGATTCATCTTCTTCTTTTCTCTCTCTCTCTCTCTCTCTCTTTGCAAACGGATTTCGCAAGAACAGCAATTTTGACGCCATTGAATTAACTGCACGATTTCATCCGTTTTCTCAGTTCCGTCGTCCCCCAGGTAATTTGCCGCCGTTCTTTCTGACCTTTTCTTTCTGTATTTGTTCTCCCTTTTTGCAGAACTCTGTGTTCTCCGGCGCGGTTGTTTCTGCATTGCTTTGCAAATTCTGAAACTAGGGTTTTAATGCTGCAGTGAATCCGCATAGTCCCTTTTTTCGTGTGTTATCTGGCGGTTGGAATTTGTTATCTGACGGGGGTTGGTCTTGTTTTCTCCGATTGAGGATGTTGTTAGTCAGCAGAATCGAGATTTGATTGAAGGAGATTGATATACACATTGTTTGATTGAAGGAGAATAGGGAATTTTTTGAGTTACGGGTTTTGATTGGATTTTGTTATCGACAAGGTAGATGAGTTCATTCATGTCATGACCCGAACCATTTCGCGGCTCTGTTGCTCGGCTGGTCGGCTGTTTGAGTCAACCTCTTCTTCCCCAGCAGTAGTCTCGCCTTGAGATGGCTTCATCTCCGTCTAGTTCAAGGATGTGTGAGCCAGATAGGGAATTGGGAGTGATTACTACCACTATCTGCGCAAATGTGTCGGAGGTAGCTGCTGCAGGGGAGGATTGCACATTCAGAGGCCTCGTACGTGCCGATACTTTATCACTGGATGAACGGTTCAATAGTGATTCCGGAGATATCGGTCTCGGTCTAAACGAAGAGGATGAGTCTTACAACTTGGGGAATGAAACATTCAGCTTGGATATGGAAGAGCCTCAAGATGATGAGGGTTTGGTTGATATTTTGGGCTGTAAAACTACCATGGAAATGATGTCTTTAACTGGGTCATTAGTAAACTCTGTTGAACTCAATGATGATAGTTTTGTAGTTGATACCGTTGCAAAGGTTGTAAGAGATGACTTAAAATCTCCTTCTCACATCTGCGAGATTGTTTCTAATTCAGCTTCTGCTGATGGATTGCCAAGTGACTTCATACAACTAAACGAGCTGGATAATGATGGTGGTTGTTCATTTTCAGATGTTATGGATAACAGGATAAGTGATGGTTCCATTGTCACAGAAGCAGAAATGTTGAATGAGATGTCCCCTTTACGTAGTGGTCAAATACTATCTGTACATGTGGGACAATTAGTTGCCAATTGTGACCAGTATATTTGCGAGATGGATGGGAAAAACTTAAGTGGTACCTCTGGTGAAACAATTAATGAAGTTACTGATATGAACGACATTCATGAATCATGCTTGCAAATGTTGCCTTCACATGGCTGCAAAAAAAGGGAAAGGTTGCAATCTAATAGTTCACCGCTTACCATCCATGCTTTAGAGAATGATCAATGTGGCGAAATGAGTAATTCCTCACCCAAGTACATTTTAGAGGTTGTTGAAGATGTTGATGTCTCGACTACTAATAATAATGCTGATGCTGAACAGTATGTAGATCCCAAGATTAAAAATAACTATAATCTGGAGGAAGCTACTATTCAATTGAGCCTCAACTGTGTAGAGCTACTTGCATCGCCCTTACCTTCTCAATTTCCTAATTGTGAAAGAGATGAATTTCATGAGATGTTGAATGTAGCAGATATTCCAATAAAGGATATTAGTTCTGTCAACTCATGCAGCATCGGTGATCAAGACAACAATGACGTAGAGAAAGTTGGCTGTGTTTCTGAAGTTAAATGTCTTGAAACAGTTATCCCATTCTCCAAGAGGAGTGGTCGAAGAAAAACATCAAGCCAAAAAACTGTAACAAAAAGAGCACCCAGGAAAAGCAGGAAAAAAGTGCCAAATGCACTGATTTTTGACACTGCAAGAAGGAGAAGAAGCTCTATATCTAGACCTGCTCGTCCTTCACTCTGGGGATCACTGGACTATATTATTCAGTCGTTTGAAAACAATGAAGATGTTTGGGTAAATCAAAGCCAGAAGCAAGGAAATAAGAAACCTAAAGGTAATCGGGGAGGTACCAAGCTGAATAATAAACAACCAAGTGAAAGTTCACATAGATCACGAAAAGGGACCCAAGTAAATTCTGCTACTTCAACTTCAACCAACCGTATTCGTTTGAAGGTTAAATTAGGAAAGAATATGGGTCATAATTTTCTGAACATTGTGGTTCCGGAGGTTGTTGGTTCATCATTGTCTGCCAAGGGTATCAATAGCAACTATGGGAACAAATCATATTGGGGAGGTAATTTGGAATTTCCTCCATCAACCATTGGCGTTGATGATCAAAAGCTTGAAGAGGGACCTTTAAGAAAAATCTTTTGCTACAATAGGAATCAGGAGAAAGAAGAGAAATGTTCTGATCCTTATATTGTGAAGGAACAGTGTGCTACTAATGATTCAAGTTGCACTAATATTGTAAGCAAGTTGTCTGTAGAACATGCAGATGACAGTTTTGCTGTTTCGTCCGATTTGGTTGAGCTTGTAGAACATGCAAGTGATACCAGGAATTTGGATCCTGGAACTTCACCTGATTCAGAAGTGATAAATTCAATTTTAGATATTCAAGTTGGAGCAGTTCGCCAGGAAAATTTGCAGGAATCAGTTTTGGCGTCCTCAGAAGATTTTGCTGCTTCTGGAAATGTTACCAGTAGTAAGAAAGGAAGAAAGAAGGAAAAGCCTTATCAGGCAGTTAGTTGTTCTCAAGAGGGGGGGACATGTGCTTCTGCTTGCAGTAATGGGTCCAAATCGTCAAAGAAGCATGGAACAAGACTGAATGTAGACAATCAGCTTGGTGCTGGTGAGACATTTACTTCTGTTGGTGAAAACGTTTTAAACGACTCTTTCACCTTTAAGGAATTGTCAACAGAGTCGACAGAGATTGAACATCCAGAAGAGGCTTTGAAAGTGGAGCGCATTCTCGATGTCAAAGAATGTTGCAAAACAGAGGCTTGCAGTGTGTTTCCTGAATCAGAGAATTTGAAAATGTTTCTTCCTTCTCAATCTGCTAGGAAGAAACATCCCAAAAGCTCAAAACCTATTAAAACTAGTGATTGCAAGCCCAAGGATCCTGGCTCAAAAAACAAAATAAAGAATGCTTCTAAAGAGAGGGTTTACCAACGGAAGTTTGTCAATAAGAGTAAAATCAAGAAGGATATATGTGAACAAGTTGTGACCGAAACTGAAAGTCACCAAATAGTAGGTAATTTTACTTTATTTGATTAAGAGTACAGTTGCAGATGCAACTCTTCTGAACTTTCTTGTCGTTGATATTTATGAAATTGTAATTTTCTTTTCTAATTTTACTTGAATGTTTTAGGAAATTTTCTTGTAGACAAGCCTGCAAAAGTTGATGACATCACTGCATCCACAGTGGCAATAAATTTGAATGCGGTACAGGGTGTTGTGAATGAGCAGTATACACCTCCTCGCAATGCTTGGGTGCTCTGTGATGATTGTCAAAAATGGCGACGCATAGCAGCTTCTCTTGTTGATAGTTTAGGACATGCAAGTTGCACTTGGTACACATTTACTAATCCCACAACACTTGAAGAGTGGCAGTGATTCAAAAGTCTTTTTCAAATTGTTAAAAAAATTTCCTTTTAGATTCACAATATGCATTAGAATATTAGAATATATTCTCAGACAACATCTTCCTAAATGGCATTCTACTATGCCCATTTGTATGCACGAGCAAACTTAGGAATGATGGTGAATTGGGATTGAAGAAACTTTAACTATGGGTGTTAATAGACAAATATAGGTTGGGAGTTGGGTAAGTGCCAGTTTTATTTGTCTTGTTAGGTTGCTTACTCTTTGAAGTTTGAACTACCCTACCATGTCTTTAGATTAATTGCTGTAAATATGAACATATTTTTCTTTCTTTCTCGAAACAAAGTCTTCTACTATTTACCTATTCCTATCTTTCCTCTGTTCTGGAATGTTTTCTTTGAAAGCTCTCTGTTTTCGAATGTTATTGAACTTACAAGAGAAAACGACCAAATCATACCAGATGCCAGGTTGGCTTGATTTTCAATCTTGTGCTAAATGTAGTTCCCTTTTTTTCACTTTTCCCTTTTGGAGAAAATCAAGGAGGATGTGCAGGCTCTTGTTAGGTTTCATGTCCCTCTTTGGACTTTGGCTGTTAATTTGTTTTGCAATTACCTTTTTGGTTTCATTTTAGTTGATTGGAGCCTTTTTTTTTAGTTTGGCTCATTTTGTTAGGGGGGTTGTTATTTTTTTCCTACCACCTTGTATTTTCAAAATTTTCTCAAAAACAGCTCAACAATGCATTAAAAAAATGCTTTTTACTTTATTCTTACCCCTTTCTAACGTAAAAGTTCTCAAAATTTTTCATGAATGAACTTTATTAAACTATCACACCATTCTAATATTTTCTTGTTATGGAGTGTATTTTGTTCATATGTTGAATTATGTTAAATAACGAATCAGCCCAAAAGCTTAAGTTGATGGGTTGGGTTAAATTTAATTATGTCAACAACACTCCCCCTCACTTGTGAGCTTGGAAATTTGAGAAATGCCCTACAAGTGGAATTCAATTTTAATTGGGGAAGAAAATGACTCGGCAGAGACTTGAACTCAGAATCTCCTGCCCTGATACCATATTGAATTTGTTAAATCAAGAATCAACCAAAAAGTTTAAGCTGATGGGTTGGGTTAACCTTTTTTTAATAGCAACTAATGTACTTTCATATATATATTTAATTATATTAACCAACAACATATATATTACAAGGATATGTATATATGTGTGTGTATAGATAAATTTAAAATCTTAACTTATAGTTATTTGTCTTTGAAGGACTTGTAAGGATAATGTGGATAAAGCTTTTGCTGATTGCTTAATCCCGCAAGAGAAGTCAAATGCAGAGATTAATGCAGAGTTGGAAATATCAGATGAATCTGGGGAAGAAAATGCTTCTAATAAACGGCTTACTTATAGAGAATTTGAGAGTTTTCATCCATTAACAGGTACTTGGATGTAGCTGGGGTGCCTCAATAGTACTATAAATTTTAGAATTAAGCTACAATTTTCCTTGGGAAGTACATTTAAAAAACAACAATTACCTTTTTTTTAGCCTAGTATACTGCATTTCTTAGATTCAAAATGAGACATCTGTACTTGTTTGCAGTGACTGCAGTTCTTCAAGAGAACAAATTTGCTTCCATTAGCAGCAATCAGTTTTTGCATCGTAGTCGTAAAACTCAAAATATTGATGAGGTGCATTTTTCTAAATGATGTTTCGTCTTGATTTATTATTATTAAGAGTTTTATGATGTGTCCTCAAGGTTCAGAGTGGGTTAACTCGTATGCTCAAACTTTATCAAGTGCATTGGTTGGGAAAATTGGACAATGGTAAATGGAAGAACATTTGAAAAGTGTGAAGTGAGGTTTTGAGGATTTATATATTCTTGTGTATTTAGTAAAGAAGATTTTGTTCTCTCTAAAAAAGAAGTAATGGAGTTGGAGATTTTGCTGTATTCTCATGTTCTTGATTTTTGTGAGTGTTTTTTAGCTCTATTGCTTTGGTTAGGGTTAGACCAATGTTTTAAAAGGCTCGTGCAGGTGCCATAGCTCAAGGAATAAGTTTAGCACCTTGCCTTGCGCCTTTAAACCCAAGAGGCATTAAACCTTACTAACTCCCAATTCTATTCGCTAGATCTTTCAACCTCCCTAAAAATTCTATTATTTCTCTAAATCCAAACTCCCCACGAGATTGCAAAAACTAGCATGCTACACCACTTTGCCTTTATCACAAAAAGGCAACGCCAAAAGATCCTCCTCAATCATTTTAAGAGGTATGGGAGTTAGCAAGATTTAACAAAAAGGCGACTCCAATAGAACTTCCTCAATATCAAGCTTTGTTAAAGCTTGAAACTGAAAAGACCTTCAATTGTCTCGATTGGGATTTATTGTAGTCCTATTGTTGATTTCTTTTATATTTTTTTCTTGTCTAGTAGGATAGTGAACAAAATTATCTACCTTATCTTGCTAGGTAAAATATAATTTTTTTAGTGCATATAACTGTGGTTTGAATCATTATCTTGTTGGCATTTTTGTTATGTTTGTTTTGTACATATTATAGGTCATTAATCTTAGTCGTATCATGATTTGTCAGTGTTGGAAACCTTTGGGTAGCCAATTTTGTTATTGGATTAACTTCATCTAAGCCAAAGTTATGTTTTCCTGATTTTATATATTTATGAATCTTAAACAGGTAATGGTTTGTCATTGCAAACCAACTTTGGATGGTCGGTTAGGTTGTGCAAATGAATGCTTGAACCGAATGCTCAATATTGAATGCGTCCGAGGTACATGTCCTTGTGGAGACCTTTGTTCTAATCAGCAGGTATAACAATTTTTTCTTAGAGGTTTAAGTTGACCTGTTTAGTGAATAAGTAAACTTCGGATGAAAGGATGAATTCAATTTTTCAATATGTGGTTTCAGTTTAATTCATATGTTTCACTTTCGTGGAGCAAGCTTTCTCTGCCTTTTATGATTTAGCTACCCTTTTATTCACTTTCATTATATTAATGAGAAAATCCTAAACCCTAAACCTAGAGCAAGTTTTTTTAATCTTGTTCTTTTTTATAAAAAAATCTTTGATAGGATCTACTTTCGTATATATTTCACTTACCAATGAAATTGTTCTTGTTTTATATAAAAATTGTGGCTTCAGTTCCAGAAACGCAAGTATGCCAAATTACAGTGGTTGCGATGTGGAAAGAAAGGTTATGGGCTACAGTTGCTTGAAGATATCTCAAAAGGGAAGTTTCTCATTGAGTATGTTGGAGAGGTAACTCTGAAGTCTTATCCATAGTCTAAAACATTCTATAGGAAGATTATCATGATCTTTGAATATTTTTCATATGTCCTGAGAGGTGCAGGTATATTTCTTGCTCCATTTCTTTTGTCTATCATGTTGATTGCATGTGCAGGTAAGTAACTGTTGTATTGTGGGACTAATAATGTAATCTCTCCTTTATGACTTCACTCTTTCGTCTGACTACAAAAATTGTGTAATAGTGTGAATATGTCATGAACATGTGAAGCTCTAGGTTGAATTTTCATAATTGTAAAATCCCTTGTTGTCTCCCGAGTCCTGGCCCAGGGATGGGCACGAGTGCCCCAGGGTATAGTGGAGCGAAGCTCCGATATCCCGGTTTCAAAAAAAGAACAAACAAAGCAATGGAAGGATAGTAGGCTAAGTGATGAGAACGTGAGTTCCAAGTCCCTTATAGTATGCTTTTTTTTCGTTCAAAAAAATGAAATTTCTTTTTCCAACTAGAGGTGTTATCTGCAACCTTTTTGGTATGGGTTTTGACTTACCCCTTTCTTGTAATTTCATTACATTAACTTGATTTTACCTTACACACCCATTTTGATGTTTATGAGGTTTTATGATTTCATGTTTTTCATATTCATTTATCTTTTAATCTTAACAAATTCATTCATATTTATTTTTTATTTCTATATTCGGTCTACACAGGTCCTTGACATGCATGCTTATGAATCACGTCAAAAGGAGTATGCATTTAATGGCCATAGACATTTTTACTTCATGACACTAAATGGCAGTGAGGTTAAATCTTTATTTTCTTTGCTACTGTTCTTATTGTTGAGCCATCCATTATTATCTGATGTCTTTGTTTTGCGGCCTTTCGTTTTTTTTTGTATGCCTCTTTTTTGTATGTTCTTTCGTTCTTTTTCAATGAAAGCTTGGTTTGCATCGAAAAAAATCATTGAATATCTTTCCATAAAAGAGTTCTTGTTCTTAGTGGCTGATCATCTTTTTGGAGGTTTGGATGAAGTATGTTGTCGTTTTCCTTAGTGTTATTCAATGGTTTATTTATCTTTTAACCATATTCTAAGACGGTGGAGTTGGGGGTTAGACATTGGTAATTTGGAAGCTCGAAATAGAGCTCTCTAAGAAAAATGATAATTTTTCAGGTTGAGTTAGTTGTCTACTTAGTTGGATTTTCATAGCTCAGTTTTTTGGCGGCTTGTAGCTTTGCCACTCCATCATTTGCTTCCAACACTTTATCTTAGTGTTGCTGTTCGTAGCTGTGTTTCATCTATGAAGTTTGAAGCATGGAGCTTTATTGGATGCTTTCTTTTCTATTTCGTGTGGCATTCTGTTGTTTTGCTATTGTTTAGTCCTTTTGCCTTGTTTTGGTCTTTTTTCTTTTTTGTATCTCTTGTTATCTCCTTTTTTTAGTCCATTTTTTAGGTTTGTATCTAAAGCCTTTGCTTCTTTTCATTCGCTTAATGAAAAGCTTATTTTTTGTTTTAAAAAAACTTTGCTTGATTTATTGATCTTGGGTCATCCAATTTCTTAAAGGAAAGACTTGTTTGTGAATTTATTTTTTTATTAGATAACTTGTTTGTGAATTTATATCATCAAAGACTTTGTTGGTCCATTTGACTAGAGAGTGAAACTGCATCTTCGAAGATAAGACACAAGAATTCAAATCTTTCAAGTGATCCTCTCGCTTGGAGTAATTTTTCTCCTTACATTTCATTGAAAAACTTGCTTGTTTTCTATGCACCCATGTGGAGAGATACTAAAAGGAACCTGTTGAAGGAATCTATTTTTCTCCTTCACCTGAATTCCTTTTAACTAATGGTTTGAGAATATAAGTATGCCTTCTTGGTTACTACCAAGTGAAAATTATCTTATCGGAATTGTTTTTGTTCCTATTGCATTCTATTAGTAGGGCTAACATTTTTTTGTGTAAACAAATTATAGCTTGGGATTTTATCATATTCCATACAGTCCAATAACACCGAAGAGGAACTAAAATATGGACGAGTTTTCTTAATTGCATCATTTTTCCTGTCCCCAAGGGGTGGCACAGTAGTTCAAGACTTGGGCTTTGAAGGTATGCTCCCTTCAAGGTCCTAGGTTCGAGACTTGTCTGTGACATTGCTTCTTCTATGTCTCCCGGTGCCTGGCCTAGGGACGGGCGTGGTTGCCCTTGTTTCAAAAAAAAAAAATTTGTATCAGTTTTCCCCTTTATTTCCCCTATTGGTGAAAGTTATTTCAAGTCTGTTATTGGTCTATACATGTTTTTTTTCCCAGGTCATAGATGCATGTGGAAAGGGAAATTTGGGGCGTTTCATTAACCACAGTTGTGATCCAAATTGCCGCACGGAAAAGGTTTATATTCTATTCTCTAAAACATTGTAGCATATTGTTATGTTTTCTTGATATATTGTCGTTGTCTTCTTTCTTCTTCTTACTCGTGTGTGTGAGAGAACTTTCATCTGGGAAGGTAGGACTGATATGCAGAAGTGGAATTGGCATGCTATTGCTCGACGTCATCCTATGTTTTGCTTTTGCCTCATAGGTTGAGCGAAGAGTTTCCTAGTCTGATTACAGAGACTCGAAGGGCTTTTTATTCCAAGGAGACGAATTCTTGATAAAGTTCTTCTAGATAGTGAGGTGATTGAGGATTATAGGAGTTCAAGCATGGAGATGTATTTTTCATCTTTGTTTCTTATAAAAGTAGTGTTGGTTTTGGTTGATATAATTACATTTACCCCAATCCATCAACCTAAGTTTTTGAGTTAATTGACAATTTCACAAATGAGGGGGAGTGTTGGTTGATATAATTAAATTTACTCCAAGCCATCGGCTAAAGCTTTTGAGTTGACTATTATTACTTCTTGGATCAGACAATTGCATTTGATAAATTATTCTAATTCATGCATGTCAATGAAAGATAATAGTTTCAGACAAAACCATACTTTATTAAAATGGAACTGGAGATACCCTTTAATTATTCTATAATCTATAATATTTCTGCCAAAAGGAGTAGCTCAACGACTTCAATGTATACTTATGAATCAAAAGGTCATGGATCGAACCCCCACCCCATTACTGAACTCAAAAAAATTATTCTAGCATATTTCTCACAAAATACACTAACATCATTGGATGAGAGACTAGTGGTGGCCTTTTGGAATATTTATCTTTAAACTCTAGAAGTTGGCCTTTTGGAATAAACTAGAAGAGACTAGAGGTGGTCATAGGGAATGATGCTATGTTGTGGTTAAATAGTGAAAGAATGTCCGTTAAATATATAGTCGTGAAGAATAGATATCAGTATACCATTCTTTAGAATTTTTCCAATCATTTTTTTTTGGATAAGCCGAAAGGAGTAGTACTCCAGTCCAGTGGCATCATGTATCATGCAGTAACCAGTTAGATAGGGTTCGAGTACGAGTACCAAATGTTGTTGAATTCAAAAGAAAAGAAGAATTTGGATAAAGATAAAAAATTTCATGGGGTAACGAATGAAAGAACATTATGAAGAGATAAACAAAAACATAGCCCGAATAGGGAGCAATCCACTTAAAGGCAGAGTCTCCAATCAAGTAAAATGACTCCTAGAGAATAACTACAAAAAAAATTGGACACAGAGACCTAAAGAGAATCATTGAATCTAACGAGAGCTCAAATATCCATCAAAGAACTCTCTGTCACTCTAAAGATTCTATTATTTCTCTCACCTCAAAGACCCCACAATATAGCACAGTGCACACAACCCAATTTTCTACAAAAAATTCTCTTTTCTCACTAAAGGGAGGGAGACAAAAAAAAAATTCTGGAAACTATGTTCAACTTTGTTCTTATGTATTTACATGCCAGCCAACCTTAATTTTCAACAGTTTGAGATTTTTAATATTTCTGTGTTGACCAGGTTAGTTTCTACTTCTCTATAGCAATGAGAATTTTATTTTCCAAATTTTCCTATCACCGAAGGCAGTGTTAGCGATTTAAATTTAAAATTTCAGAGATTGATTAAAGAATTTACTATTGTAGTTCTCTTTTATTTTAATTCTCATTTTCTTGGTTTTGGAACAGTGGATGGTGAATGGAGAAATTTGCATTGGTCTTTTTGCACTAAGGGATATTAAGAAGGTATATTTAATGTTTTCTTTTGTTACCTTTGTTTTCATTTACTGAAACATTAACAGAATGAATATCCATAAGTTATGATTTGGATGACTTTTGGTTGGAAGGAGTGATAGAATTTTTAATTAGAGTAGAGTGGTTGTGTTGTCGATAAGCTTGGTTTAATGCTGCATTTTGTGTTTTGTTTCAAATTCTCTTTTTGATTATTATAACTCAAGTAATGCTAATTGGAGAGTGTTCTTTTAAACTCTTGGAAGGTGTTTGGATGATAAGTTGTGGAGAAGTCAGTAGAGGGATTATAATAATAGCTCGTTTGGATGTGGTTGTCTGTAGAGGAGTTATAATAATATTTTGTTTGGATGTGATTGTTTCGAAAGGATTTGTAGGAGAGTTTGTTATTTATGAATGATGTGACATTTATGATATATTTTTTTTCAATAAAAGATAATGTCTTTTTATTTGTTTATTATTTCTAATCTCAAATATTATATTTATGGTTAGAAGGTTCTTTGATATTTAAGCTGAGCTGTAATGAAGTCAAGCTGTAATTAATTAGTTTTGTTCACATTATCCACAGAAGCAGGACCATGCAATTTCATAACTTTTTGTAGTTCTCTTTTGAAATAAAAGTCCAGTCTTATAGACAATCTTTTTACTTGATCCTGGAACTATATGTATGGATGGGTGAGACTGAATAGCCTTGACTTTTCATTCATATCGTAGACTGTATATTATATCTTTCTTAAAAATTATCCTTTCCTTTTCTTAAAAATTTCATGGTTGATTCAGAATTTTGGACATTCCTTCTTTTGTTTGTTCAAAGCTCGGTATCTGTTACTTTTGTGTCATTTTGGCGAGCAAGAGTTTTATTTGTTTTTTTTTGGTTAAATAAAGTTTAATAAGAATAAATAGAATGTTTGAAAATAGTCATCACTTATTTTTTTCATGAAACAAGAAACAAGCTTTTCATGAAAAGAGTCTGAAACTCAAAGAAACAAACTCCCAAAGGTGTGAAAAAAAGGAGAAAACCAACAGAACATAAAGAAACATAGACATCACTTATTCAAGTCATATTCAGAGCTCACTTGCATTTTTTGCTAGTTAACAAATCTTGATTGGTTGTATCAGATGTATTTTGTGGTCCAAACCTTTTCTTGTTGGTCCCTTCTTTACTGAAGAGTATTAATGCTTTCTATTTGATTGTTGTCTATATCCGATCAATTTCATACTTGAGTGCCTGGCTTTGCATTTGCAAGAAGCTTTATCCTCTCCGGTTCTTTTCTGTTTCAGGGTGAAGAAGTGACATTTGACTACAATTATGTAAGGGTATGTGGAGCTGCAGCCAAAAAATGTTATTGTGGGTCTTCCCAATGTCGAGGTTATATAGGTGGAGACCCTCACAATTCTGAGGTCATTATTCATAGTGATCCAGATGAAGAATTTCCAGAACCAGTGATGCTTCGTGCATATGGTAGAAGTTCAAATGGTAACTTGCCAACTGCAGCTAGTTCGGTGGATGGTGCTAAAATGCAACTCTCAGAGCGTATAAAAGGGGTTAGGAATAAGAAAGAACAACCTACTGGTATAGCTATCCAAATGAAGATTCTAGAAGAAAAAGAGGAACCCTTTCAGCTTTCTGCTTTGAAGATTTCAGAGGATCCTCCAAAGCTTTCCGCTTCAAAGATTTCTGAAGAACAAGAGGATCATCATAACCTTTCTGCCTTAATTATATCACCATTGCACAGTTCATTGGAATTTGAAGACTCGAAGGTAGCATCATCAATTCGGCTGCCAGAAATTACTCAACAGACTGGAGATGTGACAAGCATACCGGTCTTTGTTGATCAGACAGAAATACCTCTTGTGAACAATATTTCTGACAAAAATACATGCTCCATTGAGCAGGAGGCGAAGTTATCATTTGATGACATTGATGCTCGCAAGAACTCCGAGCTGGATGCTATTGAAGATAAGCAAGTGTATATAAATTCGCATCCTCAGATGAAAACTTCGCGTAAACAAGGTTCCATCAAGAAAGGAAAAGTTAGCTCTGTAGAAAAAGTAAAAATAACTAACAAGCCTCAGATTTTGTCTTTAAAGTCCAAGCGATTGTTTGAAGGTTCTCCGGGTAACCGCTTTGAGGCAGGTTAGTCATGTTTTTGGGATACTAGCTTTTGCTTAAATAGCTGTGATGAGCATATTAATAGTTAATGCTATTTAATGCTTCAGTCGAGGAGAAGCTTAATGAGCTCCTGGATGCTGAAGGGGGGATTAGCAAAAGAAAAGTGAGTCATCATGACATTCCACATTTTGATTTGTTGAATTTTTTGCTCTTCTATTGGCTTTAATTGCTGGTGGCAATGAGTGGTTATTTTATTTTGGCTCTTTTATGCTTAACTAACTGCTACTCTTTCTTAATGTAATTTTGTTGCGTTTTTTTCAAGGCTCAAAATATTGGTTTCTGCTGAATGTGTTTTTTAACCAAACCCAAAGGATAAACAAAGTGGATTTGCATCTGCAGCAGTGTGCTTGTTTTTTGTTGAGGAATACATGCATAGCGTTCAAGTACTCATAAGCTTTTTCCTCCCTCTAACAGGATGCTCCAAAAGGGTACTTAAAGCTTCTTCTCCTGACTGCTGCATCAGGTGCAAGTGCCAGTGGTGAAGCAATTCAGAGGTTGCCTTCGATCAGTTTTATATTTTTGATGCTCTAAGTTATAAACTTTATAGCCTGCATTCGCAGTCTTTTGAACTCACTTATGTCATGCATGTGAATTAAATGCCTTTTTTTTACAGCAACCGTGATCTTTCAATGCTCCTTGATGCTCTTTTGAAGACAAAATCACGAGTAGTATTGACTGACATAATAAACAAAAATGGTATCCCCCCTTGGTCTTGGAGTATATTTTCTTTAATCTTGCTCACTTATAGTACCATTTATTTTATAGAACACTACAAACTATTAATTCCCCCATTTTGTTTGCTCTCAGGTTTACGGATGCTGCATAATATAATGAAACAATACAGAAGTGACTTCAAAAAGATACCTATTCTTCGGAAGCTTTTGAAGGTAGTACTCTCGATAGAAATTGTAATATTTCTTTTCTGAGATTGGCAAGACTTTTCCCAGCTCAATATATGTATTATGTATATATAATTGTTTCCTTGTCTTCATTATGTGACTAATGAAAAGTTGAGTTAAATATTTGTTTTAGTACTCCACTTTTAATCGAGAATAAACATGAGTCAAGTGCGTAATTAGCAAAGAAACTGACGGTATTGAAAAGAACTTGCTATTATATTGAAAGCTAACTATAAGAAATGTTTGTTTCTAAATGATGGATGTTGTGAATGGCAAGTCTCTTGGCTTTGAATATGGAGGACTAGAAATTTTGGTAAATATTGGCTGGCTTCCTGCTTTGGGCTGGTCTTTCTCTTTTCCTTTCTTTTCTCCTTGTCTTTGCACTTCCACTCAGAATAATGAATTTTTTTTTTTGATTTTATTTGACGTGCCTTTATATCCCTTGCTTAAAATCTTTATGTTTTTTTAATTCATACTCCTTATGTGAATATGTATCCTTTGAATATTTTTGTTCCTTTTCATTTTACCCATAAAAATTTCATTGCTAAAAGATGATAAGCTCAAACATTTTCAGACAAACAATTGAATTTTGAAGAATTTCATGGAATGTGGAGTTAAATTACTTAATTACATGTGCTAAAACCTAAAGGACCTAAACAATCGGATTTTGCCGGGCTTCATAAAATTTAGTTAAATTACTTTTTTTTAATAGCTAACCTTTTTTTATTTTAGTTTATATTAATAAATCAACTTTGAAAGCCTACATTAGTAAGCTTTATATTTTCTATTATTTTCCAGCTAATATCTAAACCCAAAAATGTATTTAGATTCATTTAAATTTTTCTCCATTCCAAACAAGATAAAGAATTTGAAATCACTTTTCAGATTTTAAATCACATTTCATAGAATTTCTTGATGTTAGTGGGTTCATAATCGAACTCTTTTGCAGGTTTTAGAATATTTGGTAACAAGGGAGATACTCACCTCAGAGCATATTTATGGAAGTCCCCCATGCCCTGGAATGGAAAGGTTAGAGATTTGATGCTTAGTTCCAAACAGTGGTTTCCTAATCATTATGCAGATAGCGTTGAAATATGTGGTAGTTTAAATCTGTTGAGATGTGTTGATGAAAACTTCAATTTCTTCATGATGTTGCCTTTTGTAGTTTTAAAATAGAAGGTACATCATGAAAGAATACAAAAGAGATGATAAGAAATTCTCATTTATCACCTCGATCCTCATATGAAAGCACTCCAATCGAGATTCAACACCAATAGCTAAGAGAACAGATACAAAAAGGTGTAATTTTTTTTATAAGAATTTCATTGACGTATTAAAGTTACAGGTTACAAAATGAATAATAAACCCACGAGTTACAAAAAAAAAAATCCTAATTTGTCAAAAGGGAGAAAAGGCTATAAGAAGCGAAGGCGTTAGACAATTTATCCCAAGAGACAACATGACAAACAACACAAAAGGGAGAAAAGGCTATAAGAAGCGAAGGCGTTAGACAATTTATCCCGAGAGACAACATGACAAACAACAACGTCCAGGAGTTTGTTGGGAGGAAGCTCCTCATCTAGGAAAACTCTTTGGTTTCGTTCTTTCCACAAAATCCAAAAGGTGCCCTATTGAGGCTCTTCCACTATAGATAATAGTTCTTTACCCCTTGATGGTTTCATTTTGAAAAAGTATCTTGGAAACTCTTGTAATGTCAAATTTCATAATGTTCCAACATTAAAAAAAAATGAAATTTGTAAAATTTCTGATTTGTAGTCATTTAGACAGGAGTTTTTTTCAGTACCAAAAAAGCTTCAGAACTTTCCATATCTCATCCTTAGTCAAAGCAACCTCTAACAAACCAGCTTTGGTGACTGTTGATGGGGCTTCATGAAATGTCAGAAGGACAACCAGACAAGTTATCCTGCTTGGTATAGAGATCCTAGTAAAAATTCCCAAATTTTGACACAATTTCACCTTCCCCATTAAGCTTCTTCCTTGTACTGAAAGTAACTCCAATATGGTACTTTTATGCTTTTGGTGGTGATTATATGATGGAGAAAGTTGGTGTTTGTATCACCTTCCTCGAGCCATCTTTTCTTGCACTTTTGCCTCCATTTTCGCTCCTTGGCAGCCTAGGATAAAAGTTTATCCTTCATAAGCTTTCTGTGATTATGGTGTAAAGGAGTAGAACTAACTTCCATTTGATCGAGCAAAGTAACTTCAGTGATTAGTTGATTCTTTTTAGTGGTAGTAAACCCAAAACCTTCAAAGTTCCATCTCTTCATCCTGTCTTTAATCTCTTTCAACTTCATAATAAAGCCATCACCCGACCAATCCGCAAGAGGTGTTTTTCTCCAATACTCAAACCATCTGGTCTAGCCACATATTTTCAAATCGTGAAGGGGCCGCTAGACCCTATTTGATTCGACTCATGGAAAAAGTAGTAGGGTATGTTTGAAGTCGGACAATCCAATCTTTTTACATTGTCATTTACCAATTTTGTAAGGAGTTCTCTGTAGCCAAAATTTTGTTTACTGGAGTCATGGTGGGGGAGTTCGATTATCAGATGGTGTATATACAGTTTTGGAGGAGTAGATCGATGAGCTCGATATCATTGATAAATTTATTTAAAAGCCTCACACCCCTAGTATCCTTGCCACTGGATTTTTCATGAACCCAGCGAGATATATTTTAAAGTCACTAGCCAAGAGCCAATTCTTTGAGCAAAGACAAGATTAGTCGTAGCTCGTATATTTCCTGCAAAAAGATAGACCTCTCTATACATGGATGGTCTGTAGACCCCTGAAATATAGAAACTAAACCCATCAACAAAAATGATAAATGAAAATAGACCTTTAGTGACTTTGATGATAGGCATAGATGAGTCATTCCACATAGTAAGACTCCTTCCCGAAGATCCAGTCTAGAAGGAAAATGAGAATGTCAAGACTGATATGTGTGAAGGCAAAGTTGCTTACAGTATTTGACATCTAATGTGGGCAAGTAGTTCAAGTAGTATGCCATTGTTTTTCTTCACTCTTGGTATGAACTCCAAAAATAAACGAGCTTTGCCGGTTCTTGCATGCACAAATGTGTTTGTTCTTTTCTTTTAGATGTCTGCTTCCAAAAGGGAAAAGGCAACTTTCATCGAAGTATTTGATGTGCGTATGTGTACCCATACCCATCTTTTATCAAAGGTTTTGACATGCGTATCTGTATCTTTAGGTAATCATATCCTTGTGCAATGAGTGATGATTTGCAAGTTCTTCTTTTGTGCGGGTATATTTTTGTGATCCTTACATTATAACTTTTATGCAGTTTGAGGGAGTCTTTACTATCACTGACAGAGCATGATGACAAACAGGTATTAGTTACAAACTATTACAGTATTGATTGACCTATTCCAGTCATAGGTGTTGAGTTGTATGTTTTTTAGCGTTCCTATAGCACAATATTGCAAAGATTAGAGTATAGGAACTTCAGCTACAAATGAAAGTCTTAAATAATATATATATATATATACACGGTGCAGTAAAATGGGTCCCTATAGCTTTACTATGTTAAAGAGTAAAGTGACAAATATATAGCAGTTCACAGTTGCCAGTTGTAGTGTATCAAGAAAGGAGTTACTGCGTTAAATCCTCCTTTGAGCACTAAGATAAGTCGGTATAAAATATTTTATAGTGTATACTGCATCTATACAATTTGCTAAAAGAGCACCTCTAATTCTCTATGCAAAGGTCAAAAAGCATTCAAATATAATCCTCTATCCATTTCCTGTGAAATGAAAAGTCTTTAAAGCAGGCATTATCTATGGGCAAAGCCTTCAAAAAGTTTTGATGCGTTAAGCATGGACCTGCAGGTACATCAAATTGCACGAGGTTTTCGAGACAGATGGTTCCCCAGACATAATAGAAAATTTGGTTACTCTGGGAGGGCGGATGGGAGATTGGAAGCTTACAGGGGTACAAACTGCAGTAGGTTTACAGCATCACCCAGTTATTGTCATGATCAGGATTTCAGACCCTCTGAAGCAATCGACTGTAATAAGCAGCCGTCAACACCAACATCTCTGTCAGATTTTCATCCTGCAGAGGTCTGTTCAGTGCCTTCTACAGCTGGTCACTCATTGGATGGGCAAAGAATTCGTAAGCGTAAGAGTCGATGGGATCAGCCTGCAGACACAAGCATAGATCTGAGATCAAAGGAGCAGAAGCTTGAATCAACATCGGTGCAGCGATTTGATTCCAGCCAATCAAATAGTGTTGGAGTGGCATCAATGTTGGTAGACAAGATAAACAGTGACAATATGGGCTCCTCCCTCTCTGGTTCTGTTGAAATTTGTTGTCGCCGAGATGAAGATATTCGGTTAGACAGTGCAGTGCATAACACCCCTGAAGATATTCCTCCTGGATTTTCATCTCCCTTCAACCTCCCCGTGGCTTCCTCAAGTCCTTTTTCAACCATTTTAGATCCTCCTCGACAGAGCATAGACAATTTGAGTTGTACTTTTTCTACAGTTGGGCACCCACAGGAAAGATTTATTTCTCGATTGCCGGTGTCCTATGGAATTCCGTTTTCTATTGTCGAGCAATGTGGAACATCCGATTCAGAAAATCTGGAGTTTTGGTGTTGGGATGTTGCTCCTGGAGTGCCTTTTCATCCTTTTCCACCTTTGCCACCATATCCCCGAGGTAAGCGAGGCCCGTCAACGTCTGCCTGTGGTACTGCAGTTACACAATCTGATCGAGAAAGGCAGGTCAAGAGCCATGATTCTCAAACTTCTTTCTCAGAAGAAAGCGCTCCTAGTACAAGTACTAATTACCAACAGGGTTTGCGCATTCTATCAAACAACCAACAGACATTGAAACAGGCTGAGGAATCATCATATAATCTGGGGAGAAGGTACTTTAGGCAGCAAAAGTGGCGCAATACAAAGTTTAGGCCCTCTTGGTCACAAAGAAAGAGTCAATGGAGATACCAGGGGAACTTCAGGGGTGAGGTGAGCACTATGAGTGATGACAGTACACCAAACCAAAGGGATAAGCCCATATTGCTCGGATGAGGCAAGTGTTAGAGTGGACAAAGCTAACCAAAATCAGAGTTGATTCTCAGGAAACATCATGAACTGCTGAAATTTCAATAGAACCCAATGTTTTTTCCTAGGTTTGATAATCAAAGGAAAGTTTGTTCAACTTCAGTACTTACTCCAGCTTTAGTAGTACAATGCATGAGGTTGATCTTTGCTTCCATCAGCATATGAGGTTTTGGATCTTTATCCATCAATCGGCATCTCACAGAGGTAGGCTTTCTTATTTGTATATCTTTTTCTCACAAATATAAAATTGTGACACGCAACGTTCAGAGATCTGGGGAAAATAAAATTTTCATTGATTCTTGACCTCATTTGCCTAAGATTATTGGAGATTCTAAACATTTCTTTTTCATTTGTCCATAAGCTAAACTATCATAGATTGACCTAGTAATTAAATACCATTGTAAATTATGAAGTGTTTAGAAATTATGGGTTCAATATATAATGGCTACCTATCGAGGAATTGATTTCCCACAAATTACCTTGACTATCAAAGATAGAGGATCAGGCTTGTCATGTAAAAATAGTCGATATGCGTGTAAGCTAGTCTGAATACCGATATCAACAACAAGAAAAAAGGTTTACCTTTCAAGTACTTGGC

mRNA sequence

CGCTCGGTCGGAAACTGTGGCCAGAACGAACAACGTCTAGAAAGCGTAAACATCCCTCTTCGATTCATCTTCTTCTTTTCTCTCTCTCTCTCTCTCTCTCTTTGCAAACGGATTTCGCAAGAACAGCAATTTTGACGCCATTGAATTAACTGCACGATTTCATCCGTTTTCTCAGTTCCGTCGTCCCCCAGTGAATCCGCATAGTCCCTTTTTTCGTGTGTTATCTGGCGGTTGGAATTTGTTATCTGACGGGGGTTGGTCTTGTTTTCTCCGATTGAGGATGTTGTTAGTCAGCAGAATCGAGATTTGATTGAAGGAGATTGATATACACATTGTTTGATTGAAGGAGAATAGGGAATTTTTTGAGTTACGGGTTTTGATTGGATTTTGTTATCGACAAGGTAGATGAGTTCATTCATGTCATGACCCGAACCATTTCGCGGCTCTGTTGCTCGGCTGGTCGGCTGTTTGAGTCAACCTCTTCTTCCCCAGCAGTAGTCTCGCCTTGAGATGGCTTCATCTCCGTCTAGTTCAAGGATGTGTGAGCCAGATAGGGAATTGGGAGTGATTACTACCACTATCTGCGCAAATGTGTCGGAGGTAGCTGCTGCAGGGGAGGATTGCACATTCAGAGGCCTCGTACGTGCCGATACTTTATCACTGGATGAACGGTTCAATAGTGATTCCGGAGATATCGGTCTCGGTCTAAACGAAGAGGATGAGTCTTACAACTTGGGGAATGAAACATTCAGCTTGGATATGGAAGAGCCTCAAGATGATGAGGGTTTGGTTGATATTTTGGGCTGTAAAACTACCATGGAAATGATGTCTTTAACTGGGTCATTAGTAAACTCTGTTGAACTCAATGATGATAGTTTTGTAGTTGATACCGTTGCAAAGGTTGTAAGAGATGACTTAAAATCTCCTTCTCACATCTGCGAGATTGTTTCTAATTCAGCTTCTGCTGATGGATTGCCAAGTGACTTCATACAACTAAACGAGCTGGATAATGATGGTGGTTGTTCATTTTCAGATGTTATGGATAACAGGATAAGTGATGGTTCCATTGTCACAGAAGCAGAAATGTTGAATGAGATGTCCCCTTTACGTAGTGGTCAAATACTATCTGTACATGTGGGACAATTAGTTGCCAATTGTGACCAGTATATTTGCGAGATGGATGGGAAAAACTTAAGTGGTACCTCTGGTGAAACAATTAATGAAGTTACTGATATGAACGACATTCATGAATCATGCTTGCAAATGTTGCCTTCACATGGCTGCAAAAAAAGGGAAAGGTTGCAATCTAATAGTTCACCGCTTACCATCCATGCTTTAGAGAATGATCAATGTGGCGAAATGAGTAATTCCTCACCCAAGTACATTTTAGAGGTTGTTGAAGATGTTGATGTCTCGACTACTAATAATAATGCTGATGCTGAACAGTATGTAGATCCCAAGATTAAAAATAACTATAATCTGGAGGAAGCTACTATTCAATTGAGCCTCAACTGTGTAGAGCTACTTGCATCGCCCTTACCTTCTCAATTTCCTAATTGTGAAAGAGATGAATTTCATGAGATGTTGAATGTAGCAGATATTCCAATAAAGGATATTAGTTCTGTCAACTCATGCAGCATCGGTGATCAAGACAACAATGACGTAGAGAAAGTTGGCTGTGTTTCTGAAGTTAAATGTCTTGAAACAGTTATCCCATTCTCCAAGAGGAGTGGTCGAAGAAAAACATCAAGCCAAAAAACTGTAACAAAAAGAGCACCCAGGAAAAGCAGGAAAAAAGTGCCAAATGCACTGATTTTTGACACTGCAAGAAGGAGAAGAAGCTCTATATCTAGACCTGCTCGTCCTTCACTCTGGGGATCACTGGACTATATTATTCAGTCGTTTGAAAACAATGAAGATGTTTGGGTAAATCAAAGCCAGAAGCAAGGAAATAAGAAACCTAAAGGTAATCGGGGAGGTACCAAGCTGAATAATAAACAACCAAGTGAAAGTTCACATAGATCACGAAAAGGGACCCAAGTAAATTCTGCTACTTCAACTTCAACCAACCGTATTCGTTTGAAGGTTAAATTAGGAAAGAATATGGGTCATAATTTTCTGAACATTGTGGTTCCGGAGGTTGTTGGTTCATCATTGTCTGCCAAGGGTATCAATAGCAACTATGGGAACAAATCATATTGGGGAGGTAATTTGGAATTTCCTCCATCAACCATTGGCGTTGATGATCAAAAGCTTGAAGAGGGACCTTTAAGAAAAATCTTTTGCTACAATAGGAATCAGGAGAAAGAAGAGAAATGTTCTGATCCTTATATTGTGAAGGAACAGTGTGCTACTAATGATTCAAGTTGCACTAATATTGTAAGCAAGTTGTCTGTAGAACATGCAGATGACAGTTTTGCTGTTTCGTCCGATTTGGTTGAGCTTGTAGAACATGCAAGTGATACCAGGAATTTGGATCCTGGAACTTCACCTGATTCAGAAGTGATAAATTCAATTTTAGATATTCAAGTTGGAGCAGTTCGCCAGGAAAATTTGCAGGAATCAGTTTTGGCGTCCTCAGAAGATTTTGCTGCTTCTGGAAATGTTACCAGTAGTAAGAAAGGAAGAAAGAAGGAAAAGCCTTATCAGGCAGTTAGTTGTTCTCAAGAGGGGGGGACATGTGCTTCTGCTTGCAGTAATGGGTCCAAATCGTCAAAGAAGCATGGAACAAGACTGAATGTAGACAATCAGCTTGGTGCTGGTGAGACATTTACTTCTGTTGGTGAAAACGTTTTAAACGACTCTTTCACCTTTAAGGAATTGTCAACAGAGTCGACAGAGATTGAACATCCAGAAGAGGCTTTGAAAGTGGAGCGCATTCTCGATGTCAAAGAATGTTGCAAAACAGAGGCTTGCAGTGTGTTTCCTGAATCAGAGAATTTGAAAATGTTTCTTCCTTCTCAATCTGCTAGGAAGAAACATCCCAAAAGCTCAAAACCTATTAAAACTAGTGATTGCAAGCCCAAGGATCCTGGCTCAAAAAACAAAATAAAGAATGCTTCTAAAGAGAGGGTTTACCAACGGAAGTTTGTCAATAAGAGTAAAATCAAGAAGGATATATGTGAACAAGTTGTGACCGAAACTGAAAGTCACCAAATAGTAGGAAATTTTCTTGTAGACAAGCCTGCAAAAGTTGATGACATCACTGCATCCACAGTGGCAATAAATTTGAATGCGGTACAGGGTGTTGTGAATGAGCAGTATACACCTCCTCGCAATGCTTGGGTGCTCTGTGATGATTGTCAAAAATGGCGACGCATAGCAGCTTCTCTTGTTGATAGTTTAGGACATGCAAGTTGCACTTGGACTTGTAAGGATAATGTGGATAAAGCTTTTGCTGATTGCTTAATCCCGCAAGAGAAGTCAAATGCAGAGATTAATGCAGAGTTGGAAATATCAGATGAATCTGGGGAAGAAAATGCTTCTAATAAACGGCTTACTTATAGAGAATTTGAGAGTTTTCATCCATTAACAGTGACTGCAGTTCTTCAAGAGAACAAATTTGCTTCCATTAGCAGCAATCAGTTTTTGCATCGTAGTCGTAAAACTCAAAATATTGATGAGGTAATGGTTTGTCATTGCAAACCAACTTTGGATGGTCGGTTAGGTTGTGCAAATGAATGCTTGAACCGAATGCTCAATATTGAATGCGTCCGAGGTACATGTCCTTGTGGAGACCTTTGTTCTAATCAGCAGTTCCAGAAACGCAAGTATGCCAAATTACAGTGGTTGCGATGTGGAAAGAAAGGTTATGGGCTACAGTTGCTTGAAGATATCTCAAAAGGGAAGTTTCTCATTGAGTATGTTGGAGAGGTCCTTGACATGCATGCTTATGAATCACGTCAAAAGGAGTATGCATTTAATGGCCATAGACATTTTTACTTCATGACACTAAATGGCAGTGAGGTCATAGATGCATGTGGAAAGGGAAATTTGGGGCGTTTCATTAACCACAGTTGTGATCCAAATTGCCGCACGGAAAAGTGGATGGTGAATGGAGAAATTTGCATTGGTCTTTTTGCACTAAGGGATATTAAGAAGGGTGAAGAAGTGACATTTGACTACAATTATGTAAGGGTATGTGGAGCTGCAGCCAAAAAATGTTATTGTGGGTCTTCCCAATGTCGAGGTTATATAGGTGGAGACCCTCACAATTCTGAGGTCATTATTCATAGTGATCCAGATGAAGAATTTCCAGAACCAGTGATGCTTCGTGCATATGGTAGAAGTTCAAATGGTAACTTGCCAACTGCAGCTAGTTCGGTGGATGGTGCTAAAATGCAACTCTCAGAGCGTATAAAAGGGGTTAGGAATAAGAAAGAACAACCTACTGGTATAGCTATCCAAATGAAGATTCTAGAAGAAAAAGAGGAACCCTTTCAGCTTTCTGCTTTGAAGATTTCAGAGGATCCTCCAAAGCTTTCCGCTTCAAAGATTTCTGAAGAACAAGAGGATCATCATAACCTTTCTGCCTTAATTATATCACCATTGCACAGTTCATTGGAATTTGAAGACTCGAAGGAGGCGAAGTTATCATTTGATGACATTGATGCTCGCAAGAACTCCGAGCTGGATGCTATTGAAGATAAGCAAGTGTATATAAATTCGCATCCTCAGATGAAAACTTCGCGTAAACAAGGTTCCATCAAGAAAGGAAAAGTTAGCTCTGTAGAAAAAGTAAAAATAACTAACAAGCCTCAGATTTTGTCTTTAAAGTCCAAGCGATTGTTTGAAGGTTCTCCGGGTAACCGCTTTGAGGCAGTCGAGGAGAAGCTTAATGAGCTCCTGGATGCTGAAGGGGGGATTAGCAAAAGAAAAGATGCTCCAAAAGGGTACTTAAAGCTTCTTCTCCTGACTGCTGCATCAGGTGCAAGTGCCAGTGGTGAAGCAATTCAGAGCAACCGTGATCTTTCAATGCTCCTTGATGCTCTTTTGAAGACAAAATCACGAGTAGTATTGACTGACATAATAAACAAAAATGGTTTACGGATGCTGCATAATATAATGAAACAATACAGAAGTGACTTCAAAAAGATACCTATTCTTCGGAAGCTTTTGAAGGTTTTAGAATATTTGGTAACAAGGGAGATACTCACCTCAGAGCATATTTATGGAAGTCCCCCATGCCCTGGAATGGAAAGTTTGAGGGAGTCTTTACTATCACTGACAGAGCATGATGACAAACAGGTACATCAAATTGCACGAGGTTTTCGAGACAGATGGTTCCCCAGACATAATAGAAAATTTGGTTACTCTGGGAGGGCGGATGGGAGATTGGAAGCTTACAGGGGTACAAACTGCAGTAGGTTTACAGCATCACCCAGTTATTGTCATGATCAGGATTTCAGACCCTCTGAAGCAATCGACTGTAATAAGCAGCCGTCAACACCAACATCTCTGTCAGATTTTCATCCTGCAGAGGTCTGTTCAGTGCCTTCTACAGCTGGTCACTCATTGGATGGGCAAAGAATTCGTAAGCGTAAGAGTCGATGGGATCAGCCTGCAGACACAAGCATAGATCTGAGATCAAAGGAGCAGAAGCTTGAATCAACATCGGTGCAGCGATTTGATTCCAGCCAATCAAATAGTGTTGGAGTGGCATCAATGTTGGTAGACAAGATAAACAGTGACAATATGGGCTCCTCCCTCTCTGGTTCTGTTGAAATTTGTTGTCGCCGAGATGAAGATATTCGGTTAGACAGTGCAGTGCATAACACCCCTGAAGATATTCCTCCTGGATTTTCATCTCCCTTCAACCTCCCCGTGGCTTCCTCAAGTCCTTTTTCAACCATTTTAGATCCTCCTCGACAGAGCATAGACAATTTGAGTTGTACTTTTTCTACAGTTGGGCACCCACAGGAAAGATTTATTTCTCGATTGCCGGTGTCCTATGGAATTCCGTTTTCTATTGTCGAGCAATGTGGAACATCCGATTCAGAAAATCTGGAGTTTTGGTGTTGGGATGTTGCTCCTGGAGTGCCTTTTCATCCTTTTCCACCTTTGCCACCATATCCCCGAGGTAAGCGAGGCCCGTCAACGTCTGCCTGTGGTACTGCAGTTACACAATCTGATCGAGAAAGGCAGGTCAAGAGCCATGATTCTCAAACTTCTTTCTCAGAAGAAAGCGCTCCTAGTACAAGTACTAATTACCAACAGGGTTTGCGCATTCTATCAAACAACCAACAGACATTGAAACAGGCTGAGGAATCATCATATAATCTGGGGAGAAGGTACTTTAGGCAGCAAAAGTGGCGCAATACAAAGTTTAGGCCCTCTTGGTCACAAAGAAAGAGTCAATGGAGATACCAGGGGAACTTCAGGGGTGAGGTGAGCACTATGAGTGATGACAGTACACCAAACCAAAGGGATAAGCCCATATTGCTCGGATGAGGCAAGTGTTAGAGTGGACAAAGCTAACCAAAATCAGAGTTGATTCTCAGGAAACATCATGAACTGCTGAAATTTCAATAGAACCCAATGTTTTTTCCTAGGTTTGATAATCAAAGGAAAGTTTGTTCAACTTCAGTACTTACTCCAGCTTTAGTAGTACAATGCATGAGGTTGATCTTTGCTTCCATCAGCATATGAGGTTTTGGATCTTTATCCATCAATCGGCATCTCACAGAGGTAGGCTTTCTTATTTGTATATCTTTTTCTCACAAATATAAAATTGTGACACGCAACGTTCAGAGATCTGGGGAAAATAAAATTTTCATTGATTCTTGACCTCATTTGCCTAAGATTATTGGAGATTCTAAACATTTCTTTTTCATTTGTCCATAAGCTAAACTATCATAGATTGACCTAGTAATTAAATACCATTGTAAATTATGAAGTGTTTAGAAATTATGGGTTCAATATATAATGGCTACCTATCGAGGAATTGATTTCCCACAAATTACCTTGACTATCAAAGATAGAGGATCAGGCTTGTCATGTAAAAATAGTCGATATGCGTGTAAGCTAGTCTGAATACCGATATCAACAACAAGAAAAAAGGTTTACCTTTCAAGTACTTGGC

Coding sequence (CDS)

ATGGCTTCATCTCCGTCTAGTTCAAGGATGTGTGAGCCAGATAGGGAATTGGGAGTGATTACTACCACTATCTGCGCAAATGTGTCGGAGGTAGCTGCTGCAGGGGAGGATTGCACATTCAGAGGCCTCGTACGTGCCGATACTTTATCACTGGATGAACGGTTCAATAGTGATTCCGGAGATATCGGTCTCGGTCTAAACGAAGAGGATGAGTCTTACAACTTGGGGAATGAAACATTCAGCTTGGATATGGAAGAGCCTCAAGATGATGAGGGTTTGGTTGATATTTTGGGCTGTAAAACTACCATGGAAATGATGTCTTTAACTGGGTCATTAGTAAACTCTGTTGAACTCAATGATGATAGTTTTGTAGTTGATACCGTTGCAAAGGTTGTAAGAGATGACTTAAAATCTCCTTCTCACATCTGCGAGATTGTTTCTAATTCAGCTTCTGCTGATGGATTGCCAAGTGACTTCATACAACTAAACGAGCTGGATAATGATGGTGGTTGTTCATTTTCAGATGTTATGGATAACAGGATAAGTGATGGTTCCATTGTCACAGAAGCAGAAATGTTGAATGAGATGTCCCCTTTACGTAGTGGTCAAATACTATCTGTACATGTGGGACAATTAGTTGCCAATTGTGACCAGTATATTTGCGAGATGGATGGGAAAAACTTAAGTGGTACCTCTGGTGAAACAATTAATGAAGTTACTGATATGAACGACATTCATGAATCATGCTTGCAAATGTTGCCTTCACATGGCTGCAAAAAAAGGGAAAGGTTGCAATCTAATAGTTCACCGCTTACCATCCATGCTTTAGAGAATGATCAATGTGGCGAAATGAGTAATTCCTCACCCAAGTACATTTTAGAGGTTGTTGAAGATGTTGATGTCTCGACTACTAATAATAATGCTGATGCTGAACAGTATGTAGATCCCAAGATTAAAAATAACTATAATCTGGAGGAAGCTACTATTCAATTGAGCCTCAACTGTGTAGAGCTACTTGCATCGCCCTTACCTTCTCAATTTCCTAATTGTGAAAGAGATGAATTTCATGAGATGTTGAATGTAGCAGATATTCCAATAAAGGATATTAGTTCTGTCAACTCATGCAGCATCGGTGATCAAGACAACAATGACGTAGAGAAAGTTGGCTGTGTTTCTGAAGTTAAATGTCTTGAAACAGTTATCCCATTCTCCAAGAGGAGTGGTCGAAGAAAAACATCAAGCCAAAAAACTGTAACAAAAAGAGCACCCAGGAAAAGCAGGAAAAAAGTGCCAAATGCACTGATTTTTGACACTGCAAGAAGGAGAAGAAGCTCTATATCTAGACCTGCTCGTCCTTCACTCTGGGGATCACTGGACTATATTATTCAGTCGTTTGAAAACAATGAAGATGTTTGGGTAAATCAAAGCCAGAAGCAAGGAAATAAGAAACCTAAAGGTAATCGGGGAGGTACCAAGCTGAATAATAAACAACCAAGTGAAAGTTCACATAGATCACGAAAAGGGACCCAAGTAAATTCTGCTACTTCAACTTCAACCAACCGTATTCGTTTGAAGGTTAAATTAGGAAAGAATATGGGTCATAATTTTCTGAACATTGTGGTTCCGGAGGTTGTTGGTTCATCATTGTCTGCCAAGGGTATCAATAGCAACTATGGGAACAAATCATATTGGGGAGGTAATTTGGAATTTCCTCCATCAACCATTGGCGTTGATGATCAAAAGCTTGAAGAGGGACCTTTAAGAAAAATCTTTTGCTACAATAGGAATCAGGAGAAAGAAGAGAAATGTTCTGATCCTTATATTGTGAAGGAACAGTGTGCTACTAATGATTCAAGTTGCACTAATATTGTAAGCAAGTTGTCTGTAGAACATGCAGATGACAGTTTTGCTGTTTCGTCCGATTTGGTTGAGCTTGTAGAACATGCAAGTGATACCAGGAATTTGGATCCTGGAACTTCACCTGATTCAGAAGTGATAAATTCAATTTTAGATATTCAAGTTGGAGCAGTTCGCCAGGAAAATTTGCAGGAATCAGTTTTGGCGTCCTCAGAAGATTTTGCTGCTTCTGGAAATGTTACCAGTAGTAAGAAAGGAAGAAAGAAGGAAAAGCCTTATCAGGCAGTTAGTTGTTCTCAAGAGGGGGGGACATGTGCTTCTGCTTGCAGTAATGGGTCCAAATCGTCAAAGAAGCATGGAACAAGACTGAATGTAGACAATCAGCTTGGTGCTGGTGAGACATTTACTTCTGTTGGTGAAAACGTTTTAAACGACTCTTTCACCTTTAAGGAATTGTCAACAGAGTCGACAGAGATTGAACATCCAGAAGAGGCTTTGAAAGTGGAGCGCATTCTCGATGTCAAAGAATGTTGCAAAACAGAGGCTTGCAGTGTGTTTCCTGAATCAGAGAATTTGAAAATGTTTCTTCCTTCTCAATCTGCTAGGAAGAAACATCCCAAAAGCTCAAAACCTATTAAAACTAGTGATTGCAAGCCCAAGGATCCTGGCTCAAAAAACAAAATAAAGAATGCTTCTAAAGAGAGGGTTTACCAACGGAAGTTTGTCAATAAGAGTAAAATCAAGAAGGATATATGTGAACAAGTTGTGACCGAAACTGAAAGTCACCAAATAGTAGGAAATTTTCTTGTAGACAAGCCTGCAAAAGTTGATGACATCACTGCATCCACAGTGGCAATAAATTTGAATGCGGTACAGGGTGTTGTGAATGAGCAGTATACACCTCCTCGCAATGCTTGGGTGCTCTGTGATGATTGTCAAAAATGGCGACGCATAGCAGCTTCTCTTGTTGATAGTTTAGGACATGCAAGTTGCACTTGGACTTGTAAGGATAATGTGGATAAAGCTTTTGCTGATTGCTTAATCCCGCAAGAGAAGTCAAATGCAGAGATTAATGCAGAGTTGGAAATATCAGATGAATCTGGGGAAGAAAATGCTTCTAATAAACGGCTTACTTATAGAGAATTTGAGAGTTTTCATCCATTAACAGTGACTGCAGTTCTTCAAGAGAACAAATTTGCTTCCATTAGCAGCAATCAGTTTTTGCATCGTAGTCGTAAAACTCAAAATATTGATGAGGTAATGGTTTGTCATTGCAAACCAACTTTGGATGGTCGGTTAGGTTGTGCAAATGAATGCTTGAACCGAATGCTCAATATTGAATGCGTCCGAGGTACATGTCCTTGTGGAGACCTTTGTTCTAATCAGCAGTTCCAGAAACGCAAGTATGCCAAATTACAGTGGTTGCGATGTGGAAAGAAAGGTTATGGGCTACAGTTGCTTGAAGATATCTCAAAAGGGAAGTTTCTCATTGAGTATGTTGGAGAGGTCCTTGACATGCATGCTTATGAATCACGTCAAAAGGAGTATGCATTTAATGGCCATAGACATTTTTACTTCATGACACTAAATGGCAGTGAGGTCATAGATGCATGTGGAAAGGGAAATTTGGGGCGTTTCATTAACCACAGTTGTGATCCAAATTGCCGCACGGAAAAGTGGATGGTGAATGGAGAAATTTGCATTGGTCTTTTTGCACTAAGGGATATTAAGAAGGGTGAAGAAGTGACATTTGACTACAATTATGTAAGGGTATGTGGAGCTGCAGCCAAAAAATGTTATTGTGGGTCTTCCCAATGTCGAGGTTATATAGGTGGAGACCCTCACAATTCTGAGGTCATTATTCATAGTGATCCAGATGAAGAATTTCCAGAACCAGTGATGCTTCGTGCATATGGTAGAAGTTCAAATGGTAACTTGCCAACTGCAGCTAGTTCGGTGGATGGTGCTAAAATGCAACTCTCAGAGCGTATAAAAGGGGTTAGGAATAAGAAAGAACAACCTACTGGTATAGCTATCCAAATGAAGATTCTAGAAGAAAAAGAGGAACCCTTTCAGCTTTCTGCTTTGAAGATTTCAGAGGATCCTCCAAAGCTTTCCGCTTCAAAGATTTCTGAAGAACAAGAGGATCATCATAACCTTTCTGCCTTAATTATATCACCATTGCACAGTTCATTGGAATTTGAAGACTCGAAGGAGGCGAAGTTATCATTTGATGACATTGATGCTCGCAAGAACTCCGAGCTGGATGCTATTGAAGATAAGCAAGTGTATATAAATTCGCATCCTCAGATGAAAACTTCGCGTAAACAAGGTTCCATCAAGAAAGGAAAAGTTAGCTCTGTAGAAAAAGTAAAAATAACTAACAAGCCTCAGATTTTGTCTTTAAAGTCCAAGCGATTGTTTGAAGGTTCTCCGGGTAACCGCTTTGAGGCAGTCGAGGAGAAGCTTAATGAGCTCCTGGATGCTGAAGGGGGGATTAGCAAAAGAAAAGATGCTCCAAAAGGGTACTTAAAGCTTCTTCTCCTGACTGCTGCATCAGGTGCAAGTGCCAGTGGTGAAGCAATTCAGAGCAACCGTGATCTTTCAATGCTCCTTGATGCTCTTTTGAAGACAAAATCACGAGTAGTATTGACTGACATAATAAACAAAAATGGTTTACGGATGCTGCATAATATAATGAAACAATACAGAAGTGACTTCAAAAAGATACCTATTCTTCGGAAGCTTTTGAAGGTTTTAGAATATTTGGTAACAAGGGAGATACTCACCTCAGAGCATATTTATGGAAGTCCCCCATGCCCTGGAATGGAAAGTTTGAGGGAGTCTTTACTATCACTGACAGAGCATGATGACAAACAGGTACATCAAATTGCACGAGGTTTTCGAGACAGATGGTTCCCCAGACATAATAGAAAATTTGGTTACTCTGGGAGGGCGGATGGGAGATTGGAAGCTTACAGGGGTACAAACTGCAGTAGGTTTACAGCATCACCCAGTTATTGTCATGATCAGGATTTCAGACCCTCTGAAGCAATCGACTGTAATAAGCAGCCGTCAACACCAACATCTCTGTCAGATTTTCATCCTGCAGAGGTCTGTTCAGTGCCTTCTACAGCTGGTCACTCATTGGATGGGCAAAGAATTCGTAAGCGTAAGAGTCGATGGGATCAGCCTGCAGACACAAGCATAGATCTGAGATCAAAGGAGCAGAAGCTTGAATCAACATCGGTGCAGCGATTTGATTCCAGCCAATCAAATAGTGTTGGAGTGGCATCAATGTTGGTAGACAAGATAAACAGTGACAATATGGGCTCCTCCCTCTCTGGTTCTGTTGAAATTTGTTGTCGCCGAGATGAAGATATTCGGTTAGACAGTGCAGTGCATAACACCCCTGAAGATATTCCTCCTGGATTTTCATCTCCCTTCAACCTCCCCGTGGCTTCCTCAAGTCCTTTTTCAACCATTTTAGATCCTCCTCGACAGAGCATAGACAATTTGAGTTGTACTTTTTCTACAGTTGGGCACCCACAGGAAAGATTTATTTCTCGATTGCCGGTGTCCTATGGAATTCCGTTTTCTATTGTCGAGCAATGTGGAACATCCGATTCAGAAAATCTGGAGTTTTGGTGTTGGGATGTTGCTCCTGGAGTGCCTTTTCATCCTTTTCCACCTTTGCCACCATATCCCCGAGGTAAGCGAGGCCCGTCAACGTCTGCCTGTGGTACTGCAGTTACACAATCTGATCGAGAAAGGCAGGTCAAGAGCCATGATTCTCAAACTTCTTTCTCAGAAGAAAGCGCTCCTAGTACAAGTACTAATTACCAACAGGGTTTGCGCATTCTATCAAACAACCAACAGACATTGAAACAGGCTGAGGAATCATCATATAATCTGGGGAGAAGGTACTTTAGGCAGCAAAAGTGGCGCAATACAAAGTTTAGGCCCTCTTGGTCACAAAGAAAGAGTCAATGGAGATACCAGGGGAACTTCAGGGGTGAGGTGAGCACTATGAGTGATGACAGTACACCAAACCAAAGGGATAAGCCCATATTGCTCGGATGA

Protein sequence

MASSPSSSRMCEPDRELGVITTTICANVSEVAAAGEDCTFRGLVRADTLSLDERFNSDSGDIGLGLNEEDESYNLGNETFSLDMEEPQDDEGLVDILGCKTTMEMMSLTGSLVNSVELNDDSFVVDTVAKVVRDDLKSPSHICEIVSNSASADGLPSDFIQLNELDNDGGCSFSDVMDNRISDGSIVTEAEMLNEMSPLRSGQILSVHVGQLVANCDQYICEMDGKNLSGTSGETINEVTDMNDIHESCLQMLPSHGCKKRERLQSNSSPLTIHALENDQCGEMSNSSPKYILEVVEDVDVSTTNNNADAEQYVDPKIKNNYNLEEATIQLSLNCVELLASPLPSQFPNCERDEFHEMLNVADIPIKDISSVNSCSIGDQDNNDVEKVGCVSEVKCLETVIPFSKRSGRRKTSSQKTVTKRAPRKSRKKVPNALIFDTARRRRSSISRPARPSLWGSLDYIIQSFENNEDVWVNQSQKQGNKKPKGNRGGTKLNNKQPSESSHRSRKGTQVNSATSTSTNRIRLKVKLGKNMGHNFLNIVVPEVVGSSLSAKGINSNYGNKSYWGGNLEFPPSTIGVDDQKLEEGPLRKIFCYNRNQEKEEKCSDPYIVKEQCATNDSSCTNIVSKLSVEHADDSFAVSSDLVELVEHASDTRNLDPGTSPDSEVINSILDIQVGAVRQENLQESVLASSEDFAASGNVTSSKKGRKKEKPYQAVSCSQEGGTCASACSNGSKSSKKHGTRLNVDNQLGAGETFTSVGENVLNDSFTFKELSTESTEIEHPEEALKVERILDVKECCKTEACSVFPESENLKMFLPSQSARKKHPKSSKPIKTSDCKPKDPGSKNKIKNASKERVYQRKFVNKSKIKKDICEQVVTETESHQIVGNFLVDKPAKVDDITASTVAINLNAVQGVVNEQYTPPRNAWVLCDDCQKWRRIAASLVDSLGHASCTWTCKDNVDKAFADCLIPQEKSNAEINAELEISDESGEENASNKRLTYREFESFHPLTVTAVLQENKFASISSNQFLHRSRKTQNIDEVMVCHCKPTLDGRLGCANECLNRMLNIECVRGTCPCGDLCSNQQFQKRKYAKLQWLRCGKKGYGLQLLEDISKGKFLIEYVGEVLDMHAYESRQKEYAFNGHRHFYFMTLNGSEVIDACGKGNLGRFINHSCDPNCRTEKWMVNGEICIGLFALRDIKKGEEVTFDYNYVRVCGAAAKKCYCGSSQCRGYIGGDPHNSEVIIHSDPDEEFPEPVMLRAYGRSSNGNLPTAASSVDGAKMQLSERIKGVRNKKEQPTGIAIQMKILEEKEEPFQLSALKISEDPPKLSASKISEEQEDHHNLSALIISPLHSSLEFEDSKEAKLSFDDIDARKNSELDAIEDKQVYINSHPQMKTSRKQGSIKKGKVSSVEKVKITNKPQILSLKSKRLFEGSPGNRFEAVEEKLNELLDAEGGISKRKDAPKGYLKLLLLTAASGASASGEAIQSNRDLSMLLDALLKTKSRVVLTDIINKNGLRMLHNIMKQYRSDFKKIPILRKLLKVLEYLVTREILTSEHIYGSPPCPGMESLRESLLSLTEHDDKQVHQIARGFRDRWFPRHNRKFGYSGRADGRLEAYRGTNCSRFTASPSYCHDQDFRPSEAIDCNKQPSTPTSLSDFHPAEVCSVPSTAGHSLDGQRIRKRKSRWDQPADTSIDLRSKEQKLESTSVQRFDSSQSNSVGVASMLVDKINSDNMGSSLSGSVEICCRRDEDIRLDSAVHNTPEDIPPGFSSPFNLPVASSSPFSTILDPPRQSIDNLSCTFSTVGHPQERFISRLPVSYGIPFSIVEQCGTSDSENLEFWCWDVAPGVPFHPFPPLPPYPRGKRGPSTSACGTAVTQSDRERQVKSHDSQTSFSEESAPSTSTNYQQGLRILSNNQQTLKQAEESSYNLGRRYFRQQKWRNTKFRPSWSQRKSQWRYQGNFRGEVSTMSDDSTPNQRDKPILLG
Homology
BLAST of Sed0009206 vs. NCBI nr
Match: XP_038885001.1 (histone-lysine N-methyltransferase ASHH2 isoform X3 [Benincasa hispida])

HSP 1 Score: 2901.7 bits (7521), Expect = 0.0e+00
Identity = 1536/2008 (76.49%), Postives = 1682/2008 (83.76%), Query Frame = 0

Query: 1    MASSPSSS---RMCEPDRELGVITTTICANVSEVAAAGEDCTFRGLVRADTLSLDERFNS 60
            MAS PS S   +M EPDR L V TT +C N SE   AGED TFRG   ADTL +D+R + 
Sbjct: 38   MASFPSDSSDGQMFEPDRGLEVTTTCVCTNASESGTAGEDGTFRGFEHADTLLMDKRLDG 97

Query: 61   DSGDIGLGLNEEDESYNLGNETFSLDMEEPQDDEGLVDILGCKTTMEMMSLTGSLVNSV- 120
            DSGD G  LNE+ E+ N GN T SLDM+E QD +GLVDILGCKTTMEMMSL GSLV+SV 
Sbjct: 98   DSGDSGPCLNEDKEACNGGNRTLSLDMKESQDVDGLVDILGCKTTMEMMSLNGSLVDSVK 157

Query: 121  --ELNDDSFVVDTVAKVVR-----------------DDLKSPSHICEIVSNSASADGLPS 180
              +L+++S ++D  AKV R                 D+LKSP ++CEIVSNSASADGLPS
Sbjct: 158  PEDLDNNSCIIDAPAKVERDNTVANGPVLARMGTCTDNLKSP-YVCEIVSNSASADGLPS 217

Query: 181  DFIQLNELDNDG-GCSFSDVMDNRISDGSIVTEAEMLNEMSPLRSGQILSVHVGQLVANC 240
            DFIQ NEL+NDG GCSFS+  D RI++ S+  EA++LNEMSPL+SGQIL  ++   VAN 
Sbjct: 218  DFIQQNELENDGAGCSFSETAD-RITEASVEIEADVLNEMSPLQSGQILPTYMELSVANF 277

Query: 241  DQYICEMDGKNLSGTSGETINEVTDMNDIHESCLQMLPSHGCKK-RERLQSNSSPLTIHA 300
            DQY+C+M+GK+LSGTSGET+ EV  MN   E CLQMLPS  C++  E LQS+ SPLTI A
Sbjct: 278  DQYVCQMEGKSLSGTSGETVIEVAAMNSNPEVCLQMLPSQECERIGECLQSDGSPLTIDA 337

Query: 301  LENDQCGE-MSNSSPKYILEVVEDVDVSTTNNNADAEQYVDPKIKNNYNLEEATIQLSLN 360
             END C E   N+S KYI EVVED     TNNN+D  Q++ P I+N+ NLEE TIQ++ N
Sbjct: 338  SENDWCDEKRDNNSSKYITEVVEDDIDVLTNNNSDGGQHIVPGIENDRNLEEGTIQVNHN 397

Query: 361  CVELLASPLPSQFPNCERDEFHEMLNVADIPIKDISSVNSCSIGDQDNNDVEKVGCVSEV 420
            CVELLASPL SQ PN E+DEF+ MLN AD PIKDISSVNSCS+GDQD+ND+EKVGCVSEV
Sbjct: 398  CVELLASPLLSQPPNSEKDEFYGMLNGADFPIKDISSVNSCSVGDQDHNDIEKVGCVSEV 457

Query: 421  KCLETVIPFSKRSGRRKTSSQKTVTKRAPRKSRKKVPNALIFDTARRRRSSISRPARPSL 480
            KC ETVI  SKRSG+R+TS+QK VTKRA RKS+KKVP  LIFDTARRRRSS+SRPARPS 
Sbjct: 458  KCPETVITSSKRSGQRRTSNQKAVTKRASRKSKKKVPEPLIFDTARRRRSSLSRPARPSP 517

Query: 481  WGSLDYIIQSFENNEDVWVNQSQKQGNKKPKGNRGGTKLNNKQPSESSHRSRKGTQVNSA 540
            WGSL YIIQSFE  +DV +NQSQKQGN K K N+GG K N K+P ESSHRSRKGTQ   A
Sbjct: 518  WGSLGYIIQSFEEIDDVLINQSQKQGNDKSKSNQGGIKRNKKKPKESSHRSRKGTQGKCA 577

Query: 541  TSTSTNRIRLKVKLGKNMGHNFLNIVVPEVVGSSLSAKGINSNYGNKSYWGGNLEFPPST 600
            TSTSTNRIRLKVKLGKN+GHNFLNIVVPE+V SSLSAKGIN NYGN+SYW GNLEFPPST
Sbjct: 578  TSTSTNRIRLKVKLGKNVGHNFLNIVVPEIVDSSLSAKGINCNYGNESYWEGNLEFPPST 637

Query: 601  IGVDDQKLEEGPLRKIFCYNRNQEKEEKCSDPYIVKEQCATNDSSCTNIVSKLSVEHADD 660
            +GVDDQK EEGPL+KIFCY+RNQ+KEEKC D  +V EQCA NDSSCT  + K S +HADD
Sbjct: 638  LGVDDQKPEEGPLKKIFCYSRNQDKEEKCPDASVVNEQCANNDSSCTINIDKSSAKHADD 697

Query: 661  SFAVSSDLVELVEHASDTRNLDPGTSPDSEVINSILDIQVGAVRQENLQESVLASSEDFA 720
            +  VS  LVE VE  SDTRN DPGTSPDSEVINSILDI VGA+R+E LQ+SVLAS EDF+
Sbjct: 698  NLCVSPHLVEPVERVSDTRNSDPGTSPDSEVINSILDIPVGAMRREILQDSVLASLEDFS 757

Query: 721  ASGNVTSSKKGRKKEKPYQAVSCSQEGGTCASACSNGSKSSKKHGTRLNVDNQLGAGETF 780
            ASGN  S+ KGRKKEKP QAVSCS+EGGT ASACSN SKSSKKHG R NVDNQ G+GETF
Sbjct: 758  ASGNAVST-KGRKKEKPCQAVSCSEEGGTGASACSNRSKSSKKHGRRRNVDNQHGSGETF 817

Query: 781  TSVGENVLNDSFTFKELSTES----TEIEHPEEALKVERILDVKECCKTEACSVFPESEN 840
            T    N+LN + T KELS E     TEIE PEE LK + IL  KECC+T+  SVFPESEN
Sbjct: 818  TYTDANILNYALTVKELSMEQVPLLTEIELPEEVLKADNILKDKECCRTDVGSVFPESEN 877

Query: 841  LKMFLPSQSARKKHPKSSKPIKTSDCKPKDPGSKNKIKNASKERVYQRKFVNKSKIKKDI 900
             K FLPSQSA+KKHPK SK IKTS  K K PGSKNKIKNASKERVYQRK  NKSKIK+D+
Sbjct: 878  SKTFLPSQSAKKKHPKGSKSIKTSKDKLKAPGSKNKIKNASKERVYQRKSFNKSKIKEDL 937

Query: 901  CEQVVTETESHQIVGNFLVDKPAKVDDITASTVAINLNAVQGVVNEQYTPPRNAWVLCDD 960
            C++VVTE  SHQI+GN  VDK  K DDI ASTVA+NL+ VQG  NEQY PPRNAWVLCDD
Sbjct: 938  CDRVVTEMGSHQILGNCFVDKHEKSDDIIASTVAVNLSVVQGATNEQYMPPRNAWVLCDD 997

Query: 961  CQKWRRIAASLVDSLGHASCTWTCKDNVDKAFADCLIPQEKSNAEINAELEISDESGEEN 1020
            C KWRRI ASLVDSLGHASCTWTCKDNVDKAFA C IPQEKSNAEINAELEISDESGEEN
Sbjct: 998  CHKWRRIPASLVDSLGHASCTWTCKDNVDKAFAHCSIPQEKSNAEINAELEISDESGEEN 1057

Query: 1021 ASNKRLTYREFESFHPLTVTAVLQENKFASISSNQFLHRSRKTQNIDEVMVCHCKPTLDG 1080
            ASNKRLTYRE ESFHP TVTAV QENKFASISSNQFLHRSRKTQ IDE+MVCHCKP LDG
Sbjct: 1058 ASNKRLTYRELESFHPTTVTAVPQENKFASISSNQFLHRSRKTQTIDEIMVCHCKPALDG 1117

Query: 1081 RLGCANECLNRMLNIECVRGTCPCGDLCSNQQFQKRKYAKLQWLRCGKKGYGLQLLEDIS 1140
            RLGC +ECLNRMLNIECVRGTCPCGDLCSNQQFQKRKYAKLQWLRCGKKGYGLQLLEDIS
Sbjct: 1118 RLGCGDECLNRMLNIECVRGTCPCGDLCSNQQFQKRKYAKLQWLRCGKKGYGLQLLEDIS 1177

Query: 1141 KGKFLIEYVGEVLDMHAYESRQKEYAFNGHRHFYFMTLNGSEVIDACGKGNLGRFINHSC 1200
            KG+FLIEYVGEVLDMHAYE+RQKEYA NGHRHFYFMTLNGSEVIDAC KGNLGRFINHSC
Sbjct: 1178 KGQFLIEYVGEVLDMHAYEARQKEYALNGHRHFYFMTLNGSEVIDACRKGNLGRFINHSC 1237

Query: 1201 DPNCRTEKWMVNGEICIGLFALRDIKKGEEVTFDYNYVRVCGAAAKKCYCGSSQCRGYIG 1260
            DPNCRTEKWMVNGEICIGLFALRDIKKGEEVTFDYNYVRV GAAAKKCYCGS QCRGYIG
Sbjct: 1238 DPNCRTEKWMVNGEICIGLFALRDIKKGEEVTFDYNYVRVFGAAAKKCYCGSFQCRGYIG 1297

Query: 1261 GDPHNSEVIIHSDPDEEFPEPVMLRAYGRSSNGNLPTAASSVDGAKMQLSERIKGVRNKK 1320
            GDP NSEVII SD DEEFPEPVMLRA GRS N N+PTA SS+D AKMQ S  IKG+R+K+
Sbjct: 1298 GDPLNSEVIIQSDSDEEFPEPVMLRADGRSWNNNVPTAVSSLDVAKMQPSGHIKGIRDKR 1357

Query: 1321 EQPTGIAIQMKILEEKEEPFQLSALKIS---EDPPKLSASKISEEQEDHHNLSALIISPL 1380
            +QP  IAI+ KI EEK +  +LS  KIS   ED   LSASKISEE+E+H NLSA  ISPL
Sbjct: 1358 DQPIRIAIESKISEEKVDTLKLSVSKISEEKEDSLNLSASKISEEKEEHLNLSASTISPL 1417

Query: 1381 HSSLEFEDSKEAKLSFDDIDARKNSELDAIEDKQVYINSHPQMKTSRKQGSIKKGKVSSV 1440
            HSSLEFEDSKEAKLS DDID RK S+LDAIEDKQVYI SHPQMKTSRK GSIKKGKVSSV
Sbjct: 1418 HSSLEFEDSKEAKLSVDDIDGRKKSKLDAIEDKQVYIKSHPQMKTSRKPGSIKKGKVSSV 1477

Query: 1441 EKVKITNKPQILSLKSKRLFEGSPGNRFEAVEEKLNELLDAEGGISKRKDAPKGYLKLLL 1500
            EK++ITN+ QI S+K KRL EGSPGNRFEAVEEKLNELLDAEGGISKRKDAPKGYLKLLL
Sbjct: 1478 EKIQITNRSQISSVKPKRLIEGSPGNRFEAVEEKLNELLDAEGGISKRKDAPKGYLKLLL 1537

Query: 1501 LTAASGASASGEAIQSNRDLSMLLDALLKTKSRVVLTDIINKNGLRMLHNIMKQYRSDFK 1560
            LTAASGASASGEAIQSNRDLSM+LDALLKTKSRVVLTDIINKNGLRMLHNIMKQYRSDFK
Sbjct: 1538 LTAASGASASGEAIQSNRDLSMILDALLKTKSRVVLTDIINKNGLRMLHNIMKQYRSDFK 1597

Query: 1561 KIPILRKLLKVLEYLVTREILTSEHIYGSPPCPGMESLRESLLSLTEHDDKQVHQIARGF 1620
            KIPILRKLLKVLEYLVTREILTSEHI G PPCPGMESLRESLLSLTEHDDKQVHQIAR F
Sbjct: 1598 KIPILRKLLKVLEYLVTREILTSEHINGGPPCPGMESLRESLLSLTEHDDKQVHQIARSF 1657

Query: 1621 RDRWFPRHNRKFGYSGRADGRLEAYRGTNCSRFTASPSYCHDQDFRPSEAIDCNKQPSTP 1680
            RDRWFPRH+RKFGYS R DGRLE YRG+NCSRFTAS SY HDQD RP++AIDC KQ S P
Sbjct: 1658 RDRWFPRHSRKFGYSEREDGRLEVYRGSNCSRFTASHSYRHDQDSRPTDAIDCVKQ-SMP 1717

Query: 1681 TSLSDFHPAEVCSVPSTAGHSLDGQRIRKRKSRWDQPADTSIDLRSKEQKLESTSVQRFD 1740
            TSL D HP EVCSV STAGHS +GQ++RKRKSRWDQPADTS+DLRSKEQKLESTSVQ+ +
Sbjct: 1718 TSLPDAHPVEVCSVASTAGHSSNGQKVRKRKSRWDQPADTSLDLRSKEQKLESTSVQQLN 1777

Query: 1741 SSQSNSVGVASMLVDKINSDNMGSSLSGSVEICCRRDEDIRLDSAVHNTPEDIPPGFSSP 1800
            SSQ N VG+ASML+DK+N+D+  SSLS SV + CR+DEDIR DSAV N PEDIPPGFSSP
Sbjct: 1778 SSQLNCVGMASMLIDKVNNDDKDSSLSDSVGVRCRQDEDIRADSAVQNVPEDIPPGFSSP 1837

Query: 1801 FNLPVASSSPFSTILDPPRQSIDNLSCTFSTVGHPQERFISRLPVSYGIPFSIVEQCGTS 1860
            FN PVASSS FST+LDPPRQ+I +L C FSTVGHPQERFISR+PVSYGIPFSI+EQCGTS
Sbjct: 1838 FNPPVASSSAFSTVLDPPRQNICDLGCAFSTVGHPQERFISRMPVSYGIPFSIIEQCGTS 1897

Query: 1861 DSENLEFWCWDVAPGVPFHPFPPLPPYPRGKRGPSTSACGTAVTQSDRERQVKSHDSQTS 1920
             +ENLE  CWDVAPGVPFHPFPPLPPYPRGKRGP TSACGTAV QS +E QV SHDS+TS
Sbjct: 1898 HAENLE--CWDVAPGVPFHPFPPLPPYPRGKRGPPTSACGTAVGQSSQEGQVNSHDSRTS 1957

Query: 1921 FSEESAPSTSTNYQQGLRILSNNQQTLKQAEESSYNLGRRYFRQQKWRNTKFRPSWSQRK 1976
            FSEES PSTSTNYQ  L   SNNQQ   + +ESSY+LGRRYFRQQKWRNTK+ P W Q++
Sbjct: 1958 FSEESPPSTSTNYQPDLCTSSNNQQIPNRTKESSYDLGRRYFRQQKWRNTKYGPPWLQKR 2017

BLAST of Sed0009206 vs. NCBI nr
Match: XP_038884997.1 (histone-lysine N-methyltransferase ASHH2 isoform X1 [Benincasa hispida] >XP_038884998.1 histone-lysine N-methyltransferase ASHH2 isoform X1 [Benincasa hispida] >XP_038884999.1 histone-lysine N-methyltransferase ASHH2 isoform X1 [Benincasa hispida])

HSP 1 Score: 2880.1 bits (7465), Expect = 0.0e+00
Identity = 1536/2053 (74.82%), Postives = 1682/2053 (81.93%), Query Frame = 0

Query: 1    MASSPSSS---RMCEPDRELGVITTTICANVSEVAAAGEDCTFRGLVRADTLSLDERFNS 60
            MAS PS S   +M EPDR L V TT +C N SE   AGED TFRG   ADTL +D+R + 
Sbjct: 38   MASFPSDSSDGQMFEPDRGLEVTTTCVCTNASESGTAGEDGTFRGFEHADTLLMDKRLDG 97

Query: 61   DSGDIGLGLNEEDESYNLGNETFSLDMEEPQDDEGLVDILGCKTTMEMMSLTGSLVNSV- 120
            DSGD G  LNE+ E+ N GN T SLDM+E QD +GLVDILGCKTTMEMMSL GSLV+SV 
Sbjct: 98   DSGDSGPCLNEDKEACNGGNRTLSLDMKESQDVDGLVDILGCKTTMEMMSLNGSLVDSVK 157

Query: 121  --ELNDDSFVVDTVAKVVR-----------------DDLKSPSHICEIVSNSASADGLPS 180
              +L+++S ++D  AKV R                 D+LKSP ++CEIVSNSASADGLPS
Sbjct: 158  PEDLDNNSCIIDAPAKVERDNTVANGPVLARMGTCTDNLKSP-YVCEIVSNSASADGLPS 217

Query: 181  DFIQLNELDNDG-GCSFSDVMDNRISDGSIVTEAEMLNEMSPLRSGQILSVHVGQLVANC 240
            DFIQ NEL+NDG GCSFS+  D RI++ S+  EA++LNEMSPL+SGQIL  ++   VAN 
Sbjct: 218  DFIQQNELENDGAGCSFSETAD-RITEASVEIEADVLNEMSPLQSGQILPTYMELSVANF 277

Query: 241  DQYICEMDGKNLSGTSGETINEVTDMNDIHESCLQMLPSHGCKK-RERLQSNSSPLTIHA 300
            DQY+C+M+GK+LSGTSGET+ EV  MN   E CLQMLPS  C++  E LQS+ SPLTI A
Sbjct: 278  DQYVCQMEGKSLSGTSGETVIEVAAMNSNPEVCLQMLPSQECERIGECLQSDGSPLTIDA 337

Query: 301  LENDQCGE-MSNSSPKYILEVVEDVDVSTTNNNADAEQYVDPKIKNNYNLEEATIQLSLN 360
             END C E   N+S KYI EVVED     TNNN+D  Q++ P I+N+ NLEE TIQ++ N
Sbjct: 338  SENDWCDEKRDNNSSKYITEVVEDDIDVLTNNNSDGGQHIVPGIENDRNLEEGTIQVNHN 397

Query: 361  CVELLASPLPSQFPNCERDEFHEMLNVADIPIKDISSVNSCSIGDQDNNDVEKVGCVSEV 420
            CVELLASPL SQ PN E+DEF+ MLN AD PIKDISSVNSCS+GDQD+ND+EKVGCVSEV
Sbjct: 398  CVELLASPLLSQPPNSEKDEFYGMLNGADFPIKDISSVNSCSVGDQDHNDIEKVGCVSEV 457

Query: 421  KCLETVIPFSKRSGRRKTSSQKTVTKRAPRKSRKKVPNALIFDTARRRRSSISRPARPSL 480
            KC ETVI  SKRSG+R+TS+QK VTKRA RKS+KKVP  LIFDTARRRRSS+SRPARPS 
Sbjct: 458  KCPETVITSSKRSGQRRTSNQKAVTKRASRKSKKKVPEPLIFDTARRRRSSLSRPARPSP 517

Query: 481  WGSLDYIIQSFENNEDVWVNQSQKQGNKKPKGNRGGTKLNNKQPSESSHRSRKGTQVNSA 540
            WGSL YIIQSFE  +DV +NQSQKQGN K K N+GG K N K+P ESSHRSRKGTQ   A
Sbjct: 518  WGSLGYIIQSFEEIDDVLINQSQKQGNDKSKSNQGGIKRNKKKPKESSHRSRKGTQGKCA 577

Query: 541  TSTSTNRIRLKVKLGKNMGHNFLNIVVPEVVGSSLSAKGINSNYGNKSYWGGNLEFPPST 600
            TSTSTNRIRLKVKLGKN+GHNFLNIVVPE+V SSLSAKGIN NYGN+SYW GNLEFPPST
Sbjct: 578  TSTSTNRIRLKVKLGKNVGHNFLNIVVPEIVDSSLSAKGINCNYGNESYWEGNLEFPPST 637

Query: 601  IGVDDQKLEEGPLRKIFCYNRNQEKEEKCSDPYIVKEQCATNDSSCTNIVSKLSVEHADD 660
            +GVDDQK EEGPL+KIFCY+RNQ+KEEKC D  +V EQCA NDSSCT  + K S +HADD
Sbjct: 638  LGVDDQKPEEGPLKKIFCYSRNQDKEEKCPDASVVNEQCANNDSSCTINIDKSSAKHADD 697

Query: 661  SFAVSSDLVELVEHASDTRNLDPGTSPDSEVINSILDIQVGAVRQENLQESVLASSEDFA 720
            +  VS  LVE VE  SDTRN DPGTSPDSEVINSILDI VGA+R+E LQ+SVLAS EDF+
Sbjct: 698  NLCVSPHLVEPVERVSDTRNSDPGTSPDSEVINSILDIPVGAMRREILQDSVLASLEDFS 757

Query: 721  ASGNVTSSKKGRKKEKPYQAVSCSQEGGTCASACSNGSKSSKKHGTRLNVDNQLGAGETF 780
            ASGN  S+ KGRKKEKP QAVSCS+EGGT ASACSN SKSSKKHG R NVDNQ G+GETF
Sbjct: 758  ASGNAVST-KGRKKEKPCQAVSCSEEGGTGASACSNRSKSSKKHGRRRNVDNQHGSGETF 817

Query: 781  TSVGENVLNDSFTFKELSTES----TEIEHPEEALKVERILDVKECCKTEACSVFPESEN 840
            T    N+LN + T KELS E     TEIE PEE LK + IL  KECC+T+  SVFPESEN
Sbjct: 818  TYTDANILNYALTVKELSMEQVPLLTEIELPEEVLKADNILKDKECCRTDVGSVFPESEN 877

Query: 841  LKMFLPSQSARKKHPKSSKPIKTSDCKPKDPGSKNKIKNASKERVYQRKFVNKSKIKKDI 900
             K FLPSQSA+KKHPK SK IKTS  K K PGSKNKIKNASKERVYQRK  NKSKIK+D+
Sbjct: 878  SKTFLPSQSAKKKHPKGSKSIKTSKDKLKAPGSKNKIKNASKERVYQRKSFNKSKIKEDL 937

Query: 901  CEQVVTETESHQIVGNFLVDKPAKVDDITASTVAINLNAVQGVVNEQYTPPRNAWVLCDD 960
            C++VVTE  SHQI+GN  VDK  K DDI ASTVA+NL+ VQG  NEQY PPRNAWVLCDD
Sbjct: 938  CDRVVTEMGSHQILGNCFVDKHEKSDDIIASTVAVNLSVVQGATNEQYMPPRNAWVLCDD 997

Query: 961  CQKWRRIAASLVDSLGHASCTWTCKDNVDKAFADCLIPQEKSNAEINAELEISDESGEEN 1020
            C KWRRI ASLVDSLGHASCTWTCKDNVDKAFA C IPQEKSNAEINAELEISDESGEEN
Sbjct: 998  CHKWRRIPASLVDSLGHASCTWTCKDNVDKAFAHCSIPQEKSNAEINAELEISDESGEEN 1057

Query: 1021 ASNKRLTYREFESFHPLTVTAVLQENKFASISSNQFLHRSRKTQNIDEVMVCHCKPTLDG 1080
            ASNKRLTYRE ESFHP TVTAV QENKFASISSNQFLHRSRKTQ IDE+MVCHCKP LDG
Sbjct: 1058 ASNKRLTYRELESFHPTTVTAVPQENKFASISSNQFLHRSRKTQTIDEIMVCHCKPALDG 1117

Query: 1081 RLGCANECLNRMLNIECVRGTCPCGDLCSNQQFQKRKYAKLQWLRCGKKGYGLQLLEDIS 1140
            RLGC +ECLNRMLNIECVRGTCPCGDLCSNQQFQKRKYAKLQWLRCGKKGYGLQLLEDIS
Sbjct: 1118 RLGCGDECLNRMLNIECVRGTCPCGDLCSNQQFQKRKYAKLQWLRCGKKGYGLQLLEDIS 1177

Query: 1141 KGKFLIEYVGEVLDMHAYESRQKEYAFNGHRHFYFMTLNGSEVIDACGKGNLGRFINHSC 1200
            KG+FLIEYVGEVLDMHAYE+RQKEYA NGHRHFYFMTLNGSEVIDAC KGNLGRFINHSC
Sbjct: 1178 KGQFLIEYVGEVLDMHAYEARQKEYALNGHRHFYFMTLNGSEVIDACRKGNLGRFINHSC 1237

Query: 1201 DPNCRTEKWMVNGEICIGLFALRDIKKGEEVTFDYNYVRVCGAAAKKCYCGSSQCRGYIG 1260
            DPNCRTEKWMVNGEICIGLFALRDIKKGEEVTFDYNYVRV GAAAKKCYCGS QCRGYIG
Sbjct: 1238 DPNCRTEKWMVNGEICIGLFALRDIKKGEEVTFDYNYVRVFGAAAKKCYCGSFQCRGYIG 1297

Query: 1261 GDPHNSEVIIHSDPDEEFPEPVMLRAYGRSSNGNLPTAASSVDGAKMQLSERIKGVRNKK 1320
            GDP NSEVII SD DEEFPEPVMLRA GRS N N+PTA SS+D AKMQ S  IKG+R+K+
Sbjct: 1298 GDPLNSEVIIQSDSDEEFPEPVMLRADGRSWNNNVPTAVSSLDVAKMQPSGHIKGIRDKR 1357

Query: 1321 EQPTGIAIQMKILEEKEEPFQLSALKIS---EDPPKLSASKISEEQEDHHNLSALIISPL 1380
            +QP  IAI+ KI EEK +  +LS  KIS   ED   LSASKISEE+E+H NLSA  ISPL
Sbjct: 1358 DQPIRIAIESKISEEKVDTLKLSVSKISEEKEDSLNLSASKISEEKEEHLNLSASTISPL 1417

Query: 1381 HSSLEFEDSK---------------------------------------------EAKLS 1440
            HSSLEFEDSK                                             EAKLS
Sbjct: 1418 HSSLEFEDSKVASPTPLPDITHQTEDVTSKPVFVDQTEISLVDNISDKNTCSIEQEAKLS 1477

Query: 1441 FDDIDARKNSELDAIEDKQVYINSHPQMKTSRKQGSIKKGKVSSVEKVKITNKPQILSLK 1500
             DDID RK S+LDAIEDKQVYI SHPQMKTSRK GSIKKGKVSSVEK++ITN+ QI S+K
Sbjct: 1478 VDDIDGRKKSKLDAIEDKQVYIKSHPQMKTSRKPGSIKKGKVSSVEKIQITNRSQISSVK 1537

Query: 1501 SKRLFEGSPGNRFEAVEEKLNELLDAEGGISKRKDAPKGYLKLLLLTAASGASASGEAIQ 1560
             KRL EGSPGNRFEAVEEKLNELLDAEGGISKRKDAPKGYLKLLLLTAASGASASGEAIQ
Sbjct: 1538 PKRLIEGSPGNRFEAVEEKLNELLDAEGGISKRKDAPKGYLKLLLLTAASGASASGEAIQ 1597

Query: 1561 SNRDLSMLLDALLKTKSRVVLTDIINKNGLRMLHNIMKQYRSDFKKIPILRKLLKVLEYL 1620
            SNRDLSM+LDALLKTKSRVVLTDIINKNGLRMLHNIMKQYRSDFKKIPILRKLLKVLEYL
Sbjct: 1598 SNRDLSMILDALLKTKSRVVLTDIINKNGLRMLHNIMKQYRSDFKKIPILRKLLKVLEYL 1657

Query: 1621 VTREILTSEHIYGSPPCPGMESLRESLLSLTEHDDKQVHQIARGFRDRWFPRHNRKFGYS 1680
            VTREILTSEHI G PPCPGMESLRESLLSLTEHDDKQVHQIAR FRDRWFPRH+RKFGYS
Sbjct: 1658 VTREILTSEHINGGPPCPGMESLRESLLSLTEHDDKQVHQIARSFRDRWFPRHSRKFGYS 1717

Query: 1681 GRADGRLEAYRGTNCSRFTASPSYCHDQDFRPSEAIDCNKQPSTPTSLSDFHPAEVCSVP 1740
             R DGRLE YRG+NCSRFTAS SY HDQD RP++AIDC KQ S PTSL D HP EVCSV 
Sbjct: 1718 EREDGRLEVYRGSNCSRFTASHSYRHDQDSRPTDAIDCVKQ-SMPTSLPDAHPVEVCSVA 1777

Query: 1741 STAGHSLDGQRIRKRKSRWDQPADTSIDLRSKEQKLESTSVQRFDSSQSNSVGVASMLVD 1800
            STAGHS +GQ++RKRKSRWDQPADTS+DLRSKEQKLESTSVQ+ +SSQ N VG+ASML+D
Sbjct: 1778 STAGHSSNGQKVRKRKSRWDQPADTSLDLRSKEQKLESTSVQQLNSSQLNCVGMASMLID 1837

Query: 1801 KINSDNMGSSLSGSVEICCRRDEDIRLDSAVHNTPEDIPPGFSSPFNLPVASSSPFSTIL 1860
            K+N+D+  SSLS SV + CR+DEDIR DSAV N PEDIPPGFSSPFN PVASSS FST+L
Sbjct: 1838 KVNNDDKDSSLSDSVGVRCRQDEDIRADSAVQNVPEDIPPGFSSPFNPPVASSSAFSTVL 1897

Query: 1861 DPPRQSIDNLSCTFSTVGHPQERFISRLPVSYGIPFSIVEQCGTSDSENLEFWCWDVAPG 1920
            DPPRQ+I +L C FSTVGHPQERFISR+PVSYGIPFSI+EQCGTS +ENLE  CWDVAPG
Sbjct: 1898 DPPRQNICDLGCAFSTVGHPQERFISRMPVSYGIPFSIIEQCGTSHAENLE--CWDVAPG 1957

Query: 1921 VPFHPFPPLPPYPRGKRGPSTSACGTAVTQSDRERQVKSHDSQTSFSEESAPSTSTNYQQ 1976
            VPFHPFPPLPPYPRGKRGP TSACGTAV QS +E QV SHDS+TSFSEES PSTSTNYQ 
Sbjct: 1958 VPFHPFPPLPPYPRGKRGPPTSACGTAVGQSSQEGQVNSHDSRTSFSEESPPSTSTNYQP 2017

BLAST of Sed0009206 vs. NCBI nr
Match: XP_038885000.1 (histone-lysine N-methyltransferase ASHH2 isoform X2 [Benincasa hispida])

HSP 1 Score: 2846.2 bits (7377), Expect = 0.0e+00
Identity = 1521/2049 (74.23%), Postives = 1665/2049 (81.26%), Query Frame = 0

Query: 1    MASSPSSS---RMCEPDRELGVITTTICANVSEVAAAGEDCTFRGLVRADTLSLDERFNS 60
            MAS PS S   +M EPDR L V TT +C N SE   AGED TFRG   ADTL +D+R + 
Sbjct: 38   MASFPSDSSDGQMFEPDRGLEVTTTCVCTNASESGTAGEDGTFRGFEHADTLLMDKRLDG 97

Query: 61   DSGDIGLGLNEEDESYNLGNETFSLDMEEPQDDEGLVDILGCKTTMEMMSLTGSLVNSV- 120
            DSGD G  LNE+ E+ N GN T SLDM+E QD +GLVDILGCKTTMEMMSL GSLV+SV 
Sbjct: 98   DSGDSGPCLNEDKEACNGGNRTLSLDMKESQDVDGLVDILGCKTTMEMMSLNGSLVDSVK 157

Query: 121  --ELNDDSFVVDTVAKVVR-----------------DDLKSPSHICEIVSNSASADGLPS 180
              +L+++S ++D  AKV R                 D+LKSP ++CEIVSNSASADGLPS
Sbjct: 158  PEDLDNNSCIIDAPAKVERDNTVANGPVLARMGTCTDNLKSP-YVCEIVSNSASADGLPS 217

Query: 181  DFIQLNELDNDG-GCSFSDVMDNRISDGSIVTEAEMLNEMSPLRSGQILSVHVGQLVANC 240
            DFIQ NEL+NDG GCSFS+  D RI++ S+  EA++LNEMSPL+SGQIL  ++   VAN 
Sbjct: 218  DFIQQNELENDGAGCSFSETAD-RITEASVEIEADVLNEMSPLQSGQILPTYMELSVANF 277

Query: 241  DQYICEMDGKNLSGTSGETINEVTDMNDIHESCLQMLPSHGCKK-RERLQSNSSPLTIHA 300
            DQY+C+M+GK+LSGTSGET+ EV  MN   E CLQMLPS  C++  E LQS+ SPLTI A
Sbjct: 278  DQYVCQMEGKSLSGTSGETVIEVAAMNSNPEVCLQMLPSQECERIGECLQSDGSPLTIDA 337

Query: 301  LENDQCGE-MSNSSPKYILEVVEDVDVSTTNNNADAEQYVDPKIKNNYNLEEATIQLSLN 360
             END C E   N+S KYI EVVED     TNNN+D  Q++ P I+N+ NLEE TIQ++ N
Sbjct: 338  SENDWCDEKRDNNSSKYITEVVEDDIDVLTNNNSDGGQHIVPGIENDRNLEEGTIQVNHN 397

Query: 361  CVELLASPLPSQFPNCERDEFHEMLNVADIPIKDISSVNSCSIGDQDNNDVEKVGCVSEV 420
            CVELLASPL SQ PN E+DEF+ MLN AD PIKDISSVNSCS+GDQD+ND+EKVGCVSEV
Sbjct: 398  CVELLASPLLSQPPNSEKDEFYGMLNGADFPIKDISSVNSCSVGDQDHNDIEKVGCVSEV 457

Query: 421  KCLETVIPFSKRSGRRKTSSQKTVTKRAPRKSRKKVPNALIFDTARRRRSSISRPARPSL 480
            KC ETVI  SKRSG+R+TS+QK VTKRA RKS+KKVP  LIFDTARRRRSS+SRPARPS 
Sbjct: 458  KCPETVITSSKRSGQRRTSNQKAVTKRASRKSKKKVPEPLIFDTARRRRSSLSRPARPSP 517

Query: 481  WGSLDYIIQSFENNEDVWVNQSQKQGNKKPKGNRGGTKLNNKQPSESSHRSRKGTQVNSA 540
            WGSL YIIQSFE  +DV +NQSQKQGN K K N+GG K N K+P ESSHRSRKGTQ   A
Sbjct: 518  WGSLGYIIQSFEEIDDVLINQSQKQGNDKSKSNQGGIKRNKKKPKESSHRSRKGTQGKCA 577

Query: 541  TSTSTNRIRLKVKLGKNMGHNFLNIVVPEVVGSSLSAKGINSNYGNKSYWGGNLEFPPST 600
            TSTSTNRIRLKVKLGKN+GHNFLNIVVPE+V SSLSAKGIN NYGN+SYW GNLEFPPST
Sbjct: 578  TSTSTNRIRLKVKLGKNVGHNFLNIVVPEIVDSSLSAKGINCNYGNESYWEGNLEFPPST 637

Query: 601  IGVDDQKLEEGPLRKIFCYNRNQEKEEKCSDPYIVKEQCATNDSSCTNIVSKLSVEHADD 660
            +GVDDQK EEGPL+KIFCY+RNQ+KEEKC D  +V EQCA NDSSCT  + K S +HADD
Sbjct: 638  LGVDDQKPEEGPLKKIFCYSRNQDKEEKCPDASVVNEQCANNDSSCTINIDKSSAKHADD 697

Query: 661  SFAVSSDLVELVEHASDTRNLDPGTSPDSEVINSILDIQVGAVRQENLQESVLASSEDFA 720
            +  VS  LVE VE  SDTRN DPGTSPDSEVINSILDI VGA+R+E LQ+SVLAS EDF+
Sbjct: 698  NLCVSPHLVEPVERVSDTRNSDPGTSPDSEVINSILDIPVGAMRREILQDSVLASLEDFS 757

Query: 721  ASGNVTSSKKGRKKEKPYQAVSCSQEGGTCASACSNGSKSSKKHGTRLNVDNQLGAGETF 780
            ASGN  S+ KGRKKEKP QAVSCS+EGGT ASACSN SKSSKKHG R NVDNQ G     
Sbjct: 758  ASGNAVST-KGRKKEKPCQAVSCSEEGGTGASACSNRSKSSKKHGRRRNVDNQHG----- 817

Query: 781  TSVGENVLNDSFTFKELSTESTEIEHPEEALKVERILDVKECCKTEACSVFPESENLKMF 840
                                 +EIE PEE LK + IL  KECC+T+  SVFPESEN K F
Sbjct: 818  ---------------------SEIELPEEVLKADNILKDKECCRTDVGSVFPESENSKTF 877

Query: 841  LPSQSARKKHPKSSKPIKTSDCKPKDPGSKNKIKNASKERVYQRKFVNKSKIKKDICEQV 900
            LPSQSA+KKHPK SK IKTS  K K PGSKNKIKNASKERVYQRK  NKSKIK+D+C++V
Sbjct: 878  LPSQSAKKKHPKGSKSIKTSKDKLKAPGSKNKIKNASKERVYQRKSFNKSKIKEDLCDRV 937

Query: 901  VTETESHQIVGNFLVDKPAKVDDITASTVAINLNAVQGVVNEQYTPPRNAWVLCDDCQKW 960
            VTE  SHQI+GN  VDK  K DDI ASTVA+NL+ VQG  NEQY PPRNAWVLCDDC KW
Sbjct: 938  VTEMGSHQILGNCFVDKHEKSDDIIASTVAVNLSVVQGATNEQYMPPRNAWVLCDDCHKW 997

Query: 961  RRIAASLVDSLGHASCTWTCKDNVDKAFADCLIPQEKSNAEINAELEISDESGEENASNK 1020
            RRI ASLVDSLGHASCTWTCKDNVDKAFA C IPQEKSNAEINAELEISDESGEENASNK
Sbjct: 998  RRIPASLVDSLGHASCTWTCKDNVDKAFAHCSIPQEKSNAEINAELEISDESGEENASNK 1057

Query: 1021 RLTYREFESFHPLTVTAVLQENKFASISSNQFLHRSRKTQNIDEVMVCHCKPTLDGRLGC 1080
            RLTYRE ESFHP TVTAV QENKFASISSNQFLHRSRKTQ IDE+MVCHCKP LDGRLGC
Sbjct: 1058 RLTYRELESFHPTTVTAVPQENKFASISSNQFLHRSRKTQTIDEIMVCHCKPALDGRLGC 1117

Query: 1081 ANECLNRMLNIECVRGTCPCGDLCSNQQFQKRKYAKLQWLRCGKKGYGLQLLEDISKGKF 1140
             +ECLNRMLNIECVRGTCPCGDLCSNQQFQKRKYAKLQWLRCGKKGYGLQLLEDISKG+F
Sbjct: 1118 GDECLNRMLNIECVRGTCPCGDLCSNQQFQKRKYAKLQWLRCGKKGYGLQLLEDISKGQF 1177

Query: 1141 LIEYVGEVLDMHAYESRQKEYAFNGHRHFYFMTLNGSEVIDACGKGNLGRFINHSCDPNC 1200
            LIEYVGEVLDMHAYE+RQKEYA NGHRHFYFMTLNGSEVIDAC KGNLGRFINHSCDPNC
Sbjct: 1178 LIEYVGEVLDMHAYEARQKEYALNGHRHFYFMTLNGSEVIDACRKGNLGRFINHSCDPNC 1237

Query: 1201 RTEKWMVNGEICIGLFALRDIKKGEEVTFDYNYVRVCGAAAKKCYCGSSQCRGYIGGDPH 1260
            RTEKWMVNGEICIGLFALRDIKKGEEVTFDYNYVRV GAAAKKCYCGS QCRGYIGGDP 
Sbjct: 1238 RTEKWMVNGEICIGLFALRDIKKGEEVTFDYNYVRVFGAAAKKCYCGSFQCRGYIGGDPL 1297

Query: 1261 NSEVIIHSDPDEEFPEPVMLRAYGRSSNGNLPTAASSVDGAKMQLSERIKGVRNKKEQPT 1320
            NSEVII SD DEEFPEPVMLRA GRS N N+PTA SS+D AKMQ S  IKG+R+K++QP 
Sbjct: 1298 NSEVIIQSDSDEEFPEPVMLRADGRSWNNNVPTAVSSLDVAKMQPSGHIKGIRDKRDQPI 1357

Query: 1321 GIAIQMKILEEKEEPFQLSALKIS---EDPPKLSASKISEEQEDHHNLSALIISPLHSSL 1380
             IAI+ KI EEK +  +LS  KIS   ED   LSASKISEE+E+H NLSA  ISPLHSSL
Sbjct: 1358 RIAIESKISEEKVDTLKLSVSKISEEKEDSLNLSASKISEEKEEHLNLSASTISPLHSSL 1417

Query: 1381 EFEDSK---------------------------------------------EAKLSFDDI 1440
            EFEDSK                                             EAKLS DDI
Sbjct: 1418 EFEDSKVASPTPLPDITHQTEDVTSKPVFVDQTEISLVDNISDKNTCSIEQEAKLSVDDI 1477

Query: 1441 DARKNSELDAIEDKQVYINSHPQMKTSRKQGSIKKGKVSSVEKVKITNKPQILSLKSKRL 1500
            D RK S+LDAIEDKQVYI SHPQMKTSRK GSIKKGKVSSVEK++ITN+ QI S+K KRL
Sbjct: 1478 DGRKKSKLDAIEDKQVYIKSHPQMKTSRKPGSIKKGKVSSVEKIQITNRSQISSVKPKRL 1537

Query: 1501 FEGSPGNRFEAVEEKLNELLDAEGGISKRKDAPKGYLKLLLLTAASGASASGEAIQSNRD 1560
             EGSPGNRFEAVEEKLNELLDAEGGISKRKDAPKGYLKLLLLTAASGASASGEAIQSNRD
Sbjct: 1538 IEGSPGNRFEAVEEKLNELLDAEGGISKRKDAPKGYLKLLLLTAASGASASGEAIQSNRD 1597

Query: 1561 LSMLLDALLKTKSRVVLTDIINKNGLRMLHNIMKQYRSDFKKIPILRKLLKVLEYLVTRE 1620
            LSM+LDALLKTKSRVVLTDIINKNGLRMLHNIMKQYRSDFKKIPILRKLLKVLEYLVTRE
Sbjct: 1598 LSMILDALLKTKSRVVLTDIINKNGLRMLHNIMKQYRSDFKKIPILRKLLKVLEYLVTRE 1657

Query: 1621 ILTSEHIYGSPPCPGMESLRESLLSLTEHDDKQVHQIARGFRDRWFPRHNRKFGYSGRAD 1680
            ILTSEHI G PPCPGMESLRESLLSLTEHDDKQVHQIAR FRDRWFPRH+RKFGYS R D
Sbjct: 1658 ILTSEHINGGPPCPGMESLRESLLSLTEHDDKQVHQIARSFRDRWFPRHSRKFGYSERED 1717

Query: 1681 GRLEAYRGTNCSRFTASPSYCHDQDFRPSEAIDCNKQPSTPTSLSDFHPAEVCSVPSTAG 1740
            GRLE YRG+NCSRFTAS SY HDQD RP++AIDC KQ S PTSL D HP EVCSV STAG
Sbjct: 1718 GRLEVYRGSNCSRFTASHSYRHDQDSRPTDAIDCVKQ-SMPTSLPDAHPVEVCSVASTAG 1777

Query: 1741 HSLDGQRIRKRKSRWDQPADTSIDLRSKEQKLESTSVQRFDSSQSNSVGVASMLVDKINS 1800
            HS +GQ++RKRKSRWDQPADTS+DLRSKEQKLESTSVQ+ +SSQ N VG+ASML+DK+N+
Sbjct: 1778 HSSNGQKVRKRKSRWDQPADTSLDLRSKEQKLESTSVQQLNSSQLNCVGMASMLIDKVNN 1837

Query: 1801 DNMGSSLSGSVEICCRRDEDIRLDSAVHNTPEDIPPGFSSPFNLPVASSSPFSTILDPPR 1860
            D+  SSLS SV + CR+DEDIR DSAV N PEDIPPGFSSPFN PVASSS FST+LDPPR
Sbjct: 1838 DDKDSSLSDSVGVRCRQDEDIRADSAVQNVPEDIPPGFSSPFNPPVASSSAFSTVLDPPR 1897

Query: 1861 QSIDNLSCTFSTVGHPQERFISRLPVSYGIPFSIVEQCGTSDSENLEFWCWDVAPGVPFH 1920
            Q+I +L C FSTVGHPQERFISR+PVSYGIPFSI+EQCGTS +ENLE  CWDVAPGVPFH
Sbjct: 1898 QNICDLGCAFSTVGHPQERFISRMPVSYGIPFSIIEQCGTSHAENLE--CWDVAPGVPFH 1957

Query: 1921 PFPPLPPYPRGKRGPSTSACGTAVTQSDRERQVKSHDSQTSFSEESAPSTSTNYQQGLRI 1976
            PFPPLPPYPRGKRGP TSACGTAV QS +E QV SHDS+TSFSEES PSTSTNYQ  L  
Sbjct: 1958 PFPPLPPYPRGKRGPPTSACGTAVGQSSQEGQVNSHDSRTSFSEESPPSTSTNYQPDLCT 2017

BLAST of Sed0009206 vs. NCBI nr
Match: XP_011657417.1 (histone-lysine N-methyltransferase ASHH2 isoform X1 [Cucumis sativus] >XP_011657418.1 histone-lysine N-methyltransferase ASHH2 isoform X1 [Cucumis sativus] >KAE8647263.1 hypothetical protein Csa_018308 [Cucumis sativus])

HSP 1 Score: 2785.0 bits (7218), Expect = 0.0e+00
Identity = 1495/2051 (72.89%), Postives = 1652/2051 (80.55%), Query Frame = 0

Query: 1    MASSPSSSR---MCEPDRELGVITTTICANVSEVAAAGEDCTFRGLVRADTLSLDERFNS 60
            MAS  S+SR   M EPDR L V T ++C N S+   +GED T RG   AD+L +D+R + 
Sbjct: 38   MASFSSNSREGQMFEPDRGLEVTTASLCTNASDPDTSGEDGTLRGFEHADSLLMDKRLDG 97

Query: 61   DSGDIGLGLNEEDESYNLGNETFSLDMEEPQDDEGLVDILGCKTTMEMMSLTGSLVNSV- 120
            DSG     LN ++ES N GN+T SLDM+E +D +GLVDILGC  TMEM+SLT SLVNSV 
Sbjct: 98   DSGGSDPCLNLDNESCNEGNKTLSLDMKESEDVDGLVDILGCDATMEMISLTESLVNSVK 157

Query: 121  --ELNDDSFVVDTVAKVVR-----------------DDLKSPSHICEIVSNSASADGLPS 180
              EL+++S ++D  AKV R                 DDLKS S++CEIVSNSASADGLP+
Sbjct: 158  PEELDNNSCIIDAPAKVERDDTAQNGPILAGTGTRTDDLKS-SYVCEIVSNSASADGLPN 217

Query: 181  DFIQLNELDNDG-GCSFSDVMDNRISDGSIVTEAEMLNEMSPLRSGQILSVHVGQLVANC 240
            DFIQ NEL+NDG GCSFS+V D RI++ S+  EA+MLNEMSPL+SGQIL +HVGQ +AN 
Sbjct: 218  DFIQKNELENDGAGCSFSEVAD-RITEASVELEADMLNEMSPLQSGQILPIHVGQSIANY 277

Query: 241  DQYICEMDGKNLSGTSGETINEVTDMNDIHESCLQMLPSHGCKK-RERLQSNSSPLTIHA 300
            D+Y+C MDGK+LS TSGET+  V DMN   E CLQMLPS GC +  E LQS+  PLTI+A
Sbjct: 278  DRYVCRMDGKSLSSTSGETVTVVADMNSNPEGCLQMLPSQGCDRIGECLQSDGLPLTINA 337

Query: 301  LENDQCGEM--SNSSPKYILEVVEDVDVSTTNNNADAEQYVDPKIKNNYNLEEATIQLSL 360
             END C E   SNSS KY+ +V  D     TNNN+D  Q+  P I N++NLE+AT+Q++ 
Sbjct: 338  SENDLCEEKHDSNSSSKYVPDVGGDDSDVLTNNNSDGGQHTVPGIGNDHNLEDATVQVNH 397

Query: 361  NCVELLASPLPSQFPNCERDEFHEMLNVADIPIKDISSVNSCSIGDQDNNDVEKVGCVSE 420
            +CVELL+SPLPSQ PN E+DEF+ MLN ADIPIK ISSVNSCS+GDQDNND+EKVGCVSE
Sbjct: 398  DCVELLSSPLPSQLPNSEKDEFYGMLNGADIPIKYISSVNSCSVGDQDNNDIEKVGCVSE 457

Query: 421  VKCLETVIPFSKRSGRRKTSSQKTVTKRAPRKSRKKVPNALIFDTARRRRSSISRPARPS 480
            VKC ETVI  SKRSGRR+TSSQKTVTKRA RK++KKVP  LIFDTARRRRSSISRPARPS
Sbjct: 458  VKCPETVITSSKRSGRRRTSSQKTVTKRASRKTKKKVPEPLIFDTARRRRSSISRPARPS 517

Query: 481  LWGSLDYIIQSFENNEDVWVNQSQKQGNKKPKGNRGGTKLNNKQPSESSHRSRKGTQVNS 540
             WGSL +IIQSFE  +DV VNQ+QKQGN+K KGN+GG K N KQ SESSHRSRKGTQ  S
Sbjct: 518  PWGSLGHIIQSFEEIDDVLVNQTQKQGNEKSKGNQGGAKRNKKQLSESSHRSRKGTQGKS 577

Query: 541  ATSTSTNRIRLKVKLGKNMGHNFLNIVVPEVVGSSLSAKGINSNYGNKSYWGGNLEFPPS 600
            ATSTSTNRIRLKVKLGKN+GHNFLNIVVPE+V SSLSAKG+N NYGN+SYW GNLEFPPS
Sbjct: 578  ATSTSTNRIRLKVKLGKNVGHNFLNIVVPEIVDSSLSAKGVNCNYGNESYWEGNLEFPPS 637

Query: 601  TIGVDDQKL-EEGPLRKIFCYNRNQEKEEKCSDPYIVKEQCATNDSSCTNIVSKLSVEHA 660
             +GVDDQK  EEGPLRKIFCY+RNQ+KE+ C D  +V EQC  NDSSC   + K S +HA
Sbjct: 638  NLGVDDQKAEEEGPLRKIFCYSRNQDKEDNCPDASVVNEQCTNNDSSCIVGIDKSSEKHA 697

Query: 661  DDSFAVSSDLVELVEHASDTRNLDPGTSPDSEVINSILDIQVGAVRQENLQESVLASSED 720
            DD+  VSS LV+ V   SD R+LDPGTSPDSEVINS+LDIQVGA RQE LQ+SVLAS ED
Sbjct: 698  DDNLCVSSHLVDPVA-TSDARSLDPGTSPDSEVINSVLDIQVGAARQEILQDSVLASLED 757

Query: 721  FAASGNVTSSKKGRKKEKPYQAVSCSQEGGTCASACSNGSKSSKKHGTRLNVDNQLGAGE 780
            FAASGN   SKKGRKK+KP + VSCS+E G   SACSN SKSSKKHG R NVDNQL    
Sbjct: 758  FAASGNAPGSKKGRKKDKPSRVVSCSEERGISVSACSNRSKSSKKHGRRHNVDNQL---- 817

Query: 781  TFTSVGENVLNDSFTFKELSTESTEIEHPEEALKVERILDVKECCKTEACSVFPESENLK 840
                                  S+EIE PEE LK E IL+ KECC+ +  SVF ESEN K
Sbjct: 818  ----------------------SSEIELPEETLKAEDILNDKECCRADVGSVFSESENSK 877

Query: 841  MFLPSQSARKKHPKSSKPIKTSDCKPKDPGSKNKIKNASKERVYQRKFVNKSKIKKDICE 900
             FLPSQSA+KKHPK SK IKTS  K K PGSKNKIKNAS ERVYQRK    SK K+ +C+
Sbjct: 878  TFLPSQSAKKKHPKGSKSIKTSKGKSKAPGSKNKIKNASNERVYQRKSFKNSKSKEALCD 937

Query: 901  QVVTETESHQIVGNFLVDKPAKVDDITASTVAINLNAVQGVVNEQYTPPRNAWVLCDDCQ 960
            QVVTETESHQI+GN LVDKP K D+I ASTVA++L+ VQG VNEQY PPRNAWVLCDDC 
Sbjct: 938  QVVTETESHQIIGNCLVDKPEKSDNIIASTVAVDLSVVQGAVNEQYMPPRNAWVLCDDCH 997

Query: 961  KWRRIAASLVDSLGHASCTWTCKDNVDKAFADCLIPQEKSNAEINAELEISDESGEENAS 1020
            KWRRI ASLVDSLGHASCTWTCKDNVDKAFA+C IPQEKSNAEINAELEISDESGEEN S
Sbjct: 998  KWRRIPASLVDSLGHASCTWTCKDNVDKAFANCSIPQEKSNAEINAELEISDESGEENGS 1057

Query: 1021 NKRLTYREFESFHPLTVTAVLQENKFASISSNQFLHRSRKTQNIDEVMVCHCKPTLDGRL 1080
             KRLTYRE ESFHP TV AV Q+NKFASISSNQFLHRSRKTQ IDE+MVCHCKP LDGRL
Sbjct: 1058 KKRLTYRELESFHPATVNAVPQQNKFASISSNQFLHRSRKTQTIDEIMVCHCKPALDGRL 1117

Query: 1081 GCANECLNRMLNIECVRGTCPCGDLCSNQQFQKRKYAKLQWLRCGKKGYGLQLLEDISKG 1140
            GC +ECLNRMLNIECVRGTCPCG+LCSNQQFQKRKYAKLQWLRCGKKGYGLQLLEDISKG
Sbjct: 1118 GCGDECLNRMLNIECVRGTCPCGELCSNQQFQKRKYAKLQWLRCGKKGYGLQLLEDISKG 1177

Query: 1141 KFLIEYVGEVLDMHAYESRQKEYAFNGHRHFYFMTLNGSEVIDACGKGNLGRFINHSCDP 1200
            +FLIEYVGEVLDMHAYE+RQKEYA NGHRHFYFMTLNGSEVIDACGKGNLGRFINHSCDP
Sbjct: 1178 QFLIEYVGEVLDMHAYEARQKEYALNGHRHFYFMTLNGSEVIDACGKGNLGRFINHSCDP 1237

Query: 1201 NCRTEKWMVNGEICIGLFALRDIKKGEEVTFDYNYVRVCGAAAKKCYCGSSQCRGYIGGD 1260
            NCRTEKWMVNGEICIGLFALRDIKKGEEVTFDYNYVRV GAAAKKCYCGS  CRGYIGGD
Sbjct: 1238 NCRTEKWMVNGEICIGLFALRDIKKGEEVTFDYNYVRVFGAAAKKCYCGSFHCRGYIGGD 1297

Query: 1261 PHNSEVIIHSDPDEEFPEPVMLRAYGRSSNGNLPTAASSVDGAKMQLSERIKGVRNKKEQ 1320
            P NSEVII SD DEEFPEPVMLR  GRS N NL TA SS+D AKMQ SE +KG R+K++Q
Sbjct: 1298 PLNSEVIIQSDSDEEFPEPVMLRGDGRSLNSNLSTAVSSMDVAKMQSSEHLKGNRDKRDQ 1357

Query: 1321 PTGIAIQMKILEEKEEPFQLSALKIS---EDPPKLSASKISEEQEDHHNLSALIISPLHS 1380
            P  IA ++KI EEK +P +LSA KIS   EDP KLSA+KISEE+ED  NLSA  ISPLHS
Sbjct: 1358 PIRIASELKISEEKVDPLKLSASKISEEKEDPLKLSATKISEEKEDPLNLSASTISPLHS 1417

Query: 1381 SLEFEDSK---------------------------------------------EAKLSFD 1440
            SLEFEDSK                                             EAKLS D
Sbjct: 1418 SLEFEDSKVASPIPVPDITHQTEDVTSQPIFVDQTEISLLDNIPDKNTCSIEQEAKLSVD 1477

Query: 1441 DIDARKNSELDAIEDKQVYINSHPQMKTSRKQGSIKKGKVSSVEKVKITNKPQILSLKSK 1500
            DIDARK S+LD++EDKQVYI SHP+MKTSRK GSIKKGKVSS EK++ITN+ QI S+K K
Sbjct: 1478 DIDARKKSKLDSVEDKQVYIKSHPRMKTSRKLGSIKKGKVSSAEKIQITNRSQISSVKPK 1537

Query: 1501 RLFEGSPGNRFEAVEEKLNELLDAEGGISKRKDAPKGYLKLLLLTAASGASASGEAIQSN 1560
            RL EGSPGNRFEAVEEKLNELLDAEGGISKRKDAPKGYLKLLLLTAASGASASGEAIQSN
Sbjct: 1538 RLIEGSPGNRFEAVEEKLNELLDAEGGISKRKDAPKGYLKLLLLTAASGASASGEAIQSN 1597

Query: 1561 RDLSMLLDALLKTKSRVVLTDIINKNGLRMLHNIMKQYRSDFKKIPILRKLLKVLEYLVT 1620
            RDLSM+LDALLKTKSR+VLTDIINKNGLRMLHNIMKQYRSDFKKIPILRKLLKVLEYLVT
Sbjct: 1598 RDLSMILDALLKTKSRLVLTDIINKNGLRMLHNIMKQYRSDFKKIPILRKLLKVLEYLVT 1657

Query: 1621 REILTSEHIYGSPPCPGMESLRESLLSLTEHDDKQVHQIARGFRDRWFPRHNRKFGYSGR 1680
            REILTSEHI G PPCPGMESLRESLLSLTEHDDKQVHQIAR FRDRWFPRH RKFGYS R
Sbjct: 1658 REILTSEHINGGPPCPGMESLRESLLSLTEHDDKQVHQIARSFRDRWFPRHTRKFGYSER 1717

Query: 1681 ADGRLEAYRGTNCSRFTASPSYCHDQDFRPSEAIDCNKQPSTPTSLSDFHPAEVCSVPST 1740
             DGRLE YRG+N SRFTAS S+ HDQD RP++AIDC KQ S PTSL D HPAEVCS+ S 
Sbjct: 1718 EDGRLEVYRGSNSSRFTASHSFRHDQDCRPTDAIDCIKQ-SMPTSLPDAHPAEVCSLASA 1777

Query: 1741 AGHSLDGQRIRKRKSRWDQPADTSIDLRSKEQKLESTSVQRFDSSQSNSVGVASMLVDKI 1800
            A HS++GQ++RKRKSRWDQPADTS+DLRSKEQKLESTSVQ  +SSQ NSVG ASML+DK+
Sbjct: 1778 ASHSVNGQKVRKRKSRWDQPADTSLDLRSKEQKLESTSVQELNSSQLNSVGAASMLIDKV 1837

Query: 1801 NSDNMGSSLSGSVEICCRRDEDIRLDSAVHNTPEDIPPGFSSPFNLPVASSSPFSTILDP 1860
            N+D+   SLS SV + CR+DEDIR DSAV N PEDIPPGFSSPFN PVASSS FS +LDP
Sbjct: 1838 NNDDKDISLSDSVGVPCRQDEDIRADSAVPNIPEDIPPGFSSPFNPPVASSSAFSAVLDP 1897

Query: 1861 PRQSIDNLSCTFSTVGHPQERFISRLPVSYGIPFSIVEQCGTSDSENLEFWCWDVAPGVP 1920
            PRQ+I +LSC FSTVGH QERFISRLPVSYGIPFSI+EQCGTS +ENLE  CWDVAPGVP
Sbjct: 1898 PRQNIGDLSCAFSTVGHLQERFISRLPVSYGIPFSIIEQCGTSHAENLE--CWDVAPGVP 1957

Query: 1921 FHPFPPLPPYPRGKRGPSTSACGTAVTQSDRERQVKSHDSQTSFSEESAPSTSTNYQQGL 1976
            FHPFPPLPPYPRG RG  TSACGTA  QS +E QV SHDS+TSFSEES PSTSTNYQ  L
Sbjct: 1958 FHPFPPLPPYPRGMRGLPTSACGTA-GQSSQEGQVNSHDSRTSFSEESPPSTSTNYQTDL 2017

BLAST of Sed0009206 vs. NCBI nr
Match: XP_031744047.1 (histone-lysine N-methyltransferase ASHH2 isoform X2 [Cucumis sativus])

HSP 1 Score: 2785.0 bits (7218), Expect = 0.0e+00
Identity = 1495/2051 (72.89%), Postives = 1652/2051 (80.55%), Query Frame = 0

Query: 1    MASSPSSSR---MCEPDRELGVITTTICANVSEVAAAGEDCTFRGLVRADTLSLDERFNS 60
            MAS  S+SR   M EPDR L V T ++C N S+   +GED T RG   AD+L +D+R + 
Sbjct: 1    MASFSSNSREGQMFEPDRGLEVTTASLCTNASDPDTSGEDGTLRGFEHADSLLMDKRLDG 60

Query: 61   DSGDIGLGLNEEDESYNLGNETFSLDMEEPQDDEGLVDILGCKTTMEMMSLTGSLVNSV- 120
            DSG     LN ++ES N GN+T SLDM+E +D +GLVDILGC  TMEM+SLT SLVNSV 
Sbjct: 61   DSGGSDPCLNLDNESCNEGNKTLSLDMKESEDVDGLVDILGCDATMEMISLTESLVNSVK 120

Query: 121  --ELNDDSFVVDTVAKVVR-----------------DDLKSPSHICEIVSNSASADGLPS 180
              EL+++S ++D  AKV R                 DDLKS S++CEIVSNSASADGLP+
Sbjct: 121  PEELDNNSCIIDAPAKVERDDTAQNGPILAGTGTRTDDLKS-SYVCEIVSNSASADGLPN 180

Query: 181  DFIQLNELDNDG-GCSFSDVMDNRISDGSIVTEAEMLNEMSPLRSGQILSVHVGQLVANC 240
            DFIQ NEL+NDG GCSFS+V D RI++ S+  EA+MLNEMSPL+SGQIL +HVGQ +AN 
Sbjct: 181  DFIQKNELENDGAGCSFSEVAD-RITEASVELEADMLNEMSPLQSGQILPIHVGQSIANY 240

Query: 241  DQYICEMDGKNLSGTSGETINEVTDMNDIHESCLQMLPSHGCKK-RERLQSNSSPLTIHA 300
            D+Y+C MDGK+LS TSGET+  V DMN   E CLQMLPS GC +  E LQS+  PLTI+A
Sbjct: 241  DRYVCRMDGKSLSSTSGETVTVVADMNSNPEGCLQMLPSQGCDRIGECLQSDGLPLTINA 300

Query: 301  LENDQCGEM--SNSSPKYILEVVEDVDVSTTNNNADAEQYVDPKIKNNYNLEEATIQLSL 360
             END C E   SNSS KY+ +V  D     TNNN+D  Q+  P I N++NLE+AT+Q++ 
Sbjct: 301  SENDLCEEKHDSNSSSKYVPDVGGDDSDVLTNNNSDGGQHTVPGIGNDHNLEDATVQVNH 360

Query: 361  NCVELLASPLPSQFPNCERDEFHEMLNVADIPIKDISSVNSCSIGDQDNNDVEKVGCVSE 420
            +CVELL+SPLPSQ PN E+DEF+ MLN ADIPIK ISSVNSCS+GDQDNND+EKVGCVSE
Sbjct: 361  DCVELLSSPLPSQLPNSEKDEFYGMLNGADIPIKYISSVNSCSVGDQDNNDIEKVGCVSE 420

Query: 421  VKCLETVIPFSKRSGRRKTSSQKTVTKRAPRKSRKKVPNALIFDTARRRRSSISRPARPS 480
            VKC ETVI  SKRSGRR+TSSQKTVTKRA RK++KKVP  LIFDTARRRRSSISRPARPS
Sbjct: 421  VKCPETVITSSKRSGRRRTSSQKTVTKRASRKTKKKVPEPLIFDTARRRRSSISRPARPS 480

Query: 481  LWGSLDYIIQSFENNEDVWVNQSQKQGNKKPKGNRGGTKLNNKQPSESSHRSRKGTQVNS 540
             WGSL +IIQSFE  +DV VNQ+QKQGN+K KGN+GG K N KQ SESSHRSRKGTQ  S
Sbjct: 481  PWGSLGHIIQSFEEIDDVLVNQTQKQGNEKSKGNQGGAKRNKKQLSESSHRSRKGTQGKS 540

Query: 541  ATSTSTNRIRLKVKLGKNMGHNFLNIVVPEVVGSSLSAKGINSNYGNKSYWGGNLEFPPS 600
            ATSTSTNRIRLKVKLGKN+GHNFLNIVVPE+V SSLSAKG+N NYGN+SYW GNLEFPPS
Sbjct: 541  ATSTSTNRIRLKVKLGKNVGHNFLNIVVPEIVDSSLSAKGVNCNYGNESYWEGNLEFPPS 600

Query: 601  TIGVDDQKL-EEGPLRKIFCYNRNQEKEEKCSDPYIVKEQCATNDSSCTNIVSKLSVEHA 660
             +GVDDQK  EEGPLRKIFCY+RNQ+KE+ C D  +V EQC  NDSSC   + K S +HA
Sbjct: 601  NLGVDDQKAEEEGPLRKIFCYSRNQDKEDNCPDASVVNEQCTNNDSSCIVGIDKSSEKHA 660

Query: 661  DDSFAVSSDLVELVEHASDTRNLDPGTSPDSEVINSILDIQVGAVRQENLQESVLASSED 720
            DD+  VSS LV+ V   SD R+LDPGTSPDSEVINS+LDIQVGA RQE LQ+SVLAS ED
Sbjct: 661  DDNLCVSSHLVDPVA-TSDARSLDPGTSPDSEVINSVLDIQVGAARQEILQDSVLASLED 720

Query: 721  FAASGNVTSSKKGRKKEKPYQAVSCSQEGGTCASACSNGSKSSKKHGTRLNVDNQLGAGE 780
            FAASGN   SKKGRKK+KP + VSCS+E G   SACSN SKSSKKHG R NVDNQL    
Sbjct: 721  FAASGNAPGSKKGRKKDKPSRVVSCSEERGISVSACSNRSKSSKKHGRRHNVDNQL---- 780

Query: 781  TFTSVGENVLNDSFTFKELSTESTEIEHPEEALKVERILDVKECCKTEACSVFPESENLK 840
                                  S+EIE PEE LK E IL+ KECC+ +  SVF ESEN K
Sbjct: 781  ----------------------SSEIELPEETLKAEDILNDKECCRADVGSVFSESENSK 840

Query: 841  MFLPSQSARKKHPKSSKPIKTSDCKPKDPGSKNKIKNASKERVYQRKFVNKSKIKKDICE 900
             FLPSQSA+KKHPK SK IKTS  K K PGSKNKIKNAS ERVYQRK    SK K+ +C+
Sbjct: 841  TFLPSQSAKKKHPKGSKSIKTSKGKSKAPGSKNKIKNASNERVYQRKSFKNSKSKEALCD 900

Query: 901  QVVTETESHQIVGNFLVDKPAKVDDITASTVAINLNAVQGVVNEQYTPPRNAWVLCDDCQ 960
            QVVTETESHQI+GN LVDKP K D+I ASTVA++L+ VQG VNEQY PPRNAWVLCDDC 
Sbjct: 901  QVVTETESHQIIGNCLVDKPEKSDNIIASTVAVDLSVVQGAVNEQYMPPRNAWVLCDDCH 960

Query: 961  KWRRIAASLVDSLGHASCTWTCKDNVDKAFADCLIPQEKSNAEINAELEISDESGEENAS 1020
            KWRRI ASLVDSLGHASCTWTCKDNVDKAFA+C IPQEKSNAEINAELEISDESGEEN S
Sbjct: 961  KWRRIPASLVDSLGHASCTWTCKDNVDKAFANCSIPQEKSNAEINAELEISDESGEENGS 1020

Query: 1021 NKRLTYREFESFHPLTVTAVLQENKFASISSNQFLHRSRKTQNIDEVMVCHCKPTLDGRL 1080
             KRLTYRE ESFHP TV AV Q+NKFASISSNQFLHRSRKTQ IDE+MVCHCKP LDGRL
Sbjct: 1021 KKRLTYRELESFHPATVNAVPQQNKFASISSNQFLHRSRKTQTIDEIMVCHCKPALDGRL 1080

Query: 1081 GCANECLNRMLNIECVRGTCPCGDLCSNQQFQKRKYAKLQWLRCGKKGYGLQLLEDISKG 1140
            GC +ECLNRMLNIECVRGTCPCG+LCSNQQFQKRKYAKLQWLRCGKKGYGLQLLEDISKG
Sbjct: 1081 GCGDECLNRMLNIECVRGTCPCGELCSNQQFQKRKYAKLQWLRCGKKGYGLQLLEDISKG 1140

Query: 1141 KFLIEYVGEVLDMHAYESRQKEYAFNGHRHFYFMTLNGSEVIDACGKGNLGRFINHSCDP 1200
            +FLIEYVGEVLDMHAYE+RQKEYA NGHRHFYFMTLNGSEVIDACGKGNLGRFINHSCDP
Sbjct: 1141 QFLIEYVGEVLDMHAYEARQKEYALNGHRHFYFMTLNGSEVIDACGKGNLGRFINHSCDP 1200

Query: 1201 NCRTEKWMVNGEICIGLFALRDIKKGEEVTFDYNYVRVCGAAAKKCYCGSSQCRGYIGGD 1260
            NCRTEKWMVNGEICIGLFALRDIKKGEEVTFDYNYVRV GAAAKKCYCGS  CRGYIGGD
Sbjct: 1201 NCRTEKWMVNGEICIGLFALRDIKKGEEVTFDYNYVRVFGAAAKKCYCGSFHCRGYIGGD 1260

Query: 1261 PHNSEVIIHSDPDEEFPEPVMLRAYGRSSNGNLPTAASSVDGAKMQLSERIKGVRNKKEQ 1320
            P NSEVII SD DEEFPEPVMLR  GRS N NL TA SS+D AKMQ SE +KG R+K++Q
Sbjct: 1261 PLNSEVIIQSDSDEEFPEPVMLRGDGRSLNSNLSTAVSSMDVAKMQSSEHLKGNRDKRDQ 1320

Query: 1321 PTGIAIQMKILEEKEEPFQLSALKIS---EDPPKLSASKISEEQEDHHNLSALIISPLHS 1380
            P  IA ++KI EEK +P +LSA KIS   EDP KLSA+KISEE+ED  NLSA  ISPLHS
Sbjct: 1321 PIRIASELKISEEKVDPLKLSASKISEEKEDPLKLSATKISEEKEDPLNLSASTISPLHS 1380

Query: 1381 SLEFEDSK---------------------------------------------EAKLSFD 1440
            SLEFEDSK                                             EAKLS D
Sbjct: 1381 SLEFEDSKVASPIPVPDITHQTEDVTSQPIFVDQTEISLLDNIPDKNTCSIEQEAKLSVD 1440

Query: 1441 DIDARKNSELDAIEDKQVYINSHPQMKTSRKQGSIKKGKVSSVEKVKITNKPQILSLKSK 1500
            DIDARK S+LD++EDKQVYI SHP+MKTSRK GSIKKGKVSS EK++ITN+ QI S+K K
Sbjct: 1441 DIDARKKSKLDSVEDKQVYIKSHPRMKTSRKLGSIKKGKVSSAEKIQITNRSQISSVKPK 1500

Query: 1501 RLFEGSPGNRFEAVEEKLNELLDAEGGISKRKDAPKGYLKLLLLTAASGASASGEAIQSN 1560
            RL EGSPGNRFEAVEEKLNELLDAEGGISKRKDAPKGYLKLLLLTAASGASASGEAIQSN
Sbjct: 1501 RLIEGSPGNRFEAVEEKLNELLDAEGGISKRKDAPKGYLKLLLLTAASGASASGEAIQSN 1560

Query: 1561 RDLSMLLDALLKTKSRVVLTDIINKNGLRMLHNIMKQYRSDFKKIPILRKLLKVLEYLVT 1620
            RDLSM+LDALLKTKSR+VLTDIINKNGLRMLHNIMKQYRSDFKKIPILRKLLKVLEYLVT
Sbjct: 1561 RDLSMILDALLKTKSRLVLTDIINKNGLRMLHNIMKQYRSDFKKIPILRKLLKVLEYLVT 1620

Query: 1621 REILTSEHIYGSPPCPGMESLRESLLSLTEHDDKQVHQIARGFRDRWFPRHNRKFGYSGR 1680
            REILTSEHI G PPCPGMESLRESLLSLTEHDDKQVHQIAR FRDRWFPRH RKFGYS R
Sbjct: 1621 REILTSEHINGGPPCPGMESLRESLLSLTEHDDKQVHQIARSFRDRWFPRHTRKFGYSER 1680

Query: 1681 ADGRLEAYRGTNCSRFTASPSYCHDQDFRPSEAIDCNKQPSTPTSLSDFHPAEVCSVPST 1740
             DGRLE YRG+N SRFTAS S+ HDQD RP++AIDC KQ S PTSL D HPAEVCS+ S 
Sbjct: 1681 EDGRLEVYRGSNSSRFTASHSFRHDQDCRPTDAIDCIKQ-SMPTSLPDAHPAEVCSLASA 1740

Query: 1741 AGHSLDGQRIRKRKSRWDQPADTSIDLRSKEQKLESTSVQRFDSSQSNSVGVASMLVDKI 1800
            A HS++GQ++RKRKSRWDQPADTS+DLRSKEQKLESTSVQ  +SSQ NSVG ASML+DK+
Sbjct: 1741 ASHSVNGQKVRKRKSRWDQPADTSLDLRSKEQKLESTSVQELNSSQLNSVGAASMLIDKV 1800

Query: 1801 NSDNMGSSLSGSVEICCRRDEDIRLDSAVHNTPEDIPPGFSSPFNLPVASSSPFSTILDP 1860
            N+D+   SLS SV + CR+DEDIR DSAV N PEDIPPGFSSPFN PVASSS FS +LDP
Sbjct: 1801 NNDDKDISLSDSVGVPCRQDEDIRADSAVPNIPEDIPPGFSSPFNPPVASSSAFSAVLDP 1860

Query: 1861 PRQSIDNLSCTFSTVGHPQERFISRLPVSYGIPFSIVEQCGTSDSENLEFWCWDVAPGVP 1920
            PRQ+I +LSC FSTVGH QERFISRLPVSYGIPFSI+EQCGTS +ENLE  CWDVAPGVP
Sbjct: 1861 PRQNIGDLSCAFSTVGHLQERFISRLPVSYGIPFSIIEQCGTSHAENLE--CWDVAPGVP 1920

Query: 1921 FHPFPPLPPYPRGKRGPSTSACGTAVTQSDRERQVKSHDSQTSFSEESAPSTSTNYQQGL 1976
            FHPFPPLPPYPRG RG  TSACGTA  QS +E QV SHDS+TSFSEES PSTSTNYQ  L
Sbjct: 1921 FHPFPPLPPYPRGMRGLPTSACGTA-GQSSQEGQVNSHDSRTSFSEESPPSTSTNYQTDL 1980

BLAST of Sed0009206 vs. ExPASy Swiss-Prot
Match: Q2LAE1 (Histone-lysine N-methyltransferase ASHH2 OS=Arabidopsis thaliana OX=3702 GN=ASHH2 PE=1 SV=1)

HSP 1 Score: 741.5 bits (1913), Expect = 2.5e-212
Identity = 639/1818 (35.15%), Postives = 866/1818 (47.63%), Query Frame = 0

Query: 149  SASADGLPSDFIQLNELDNDGGCSFSDVMDNRISDGSIVTEAEMLNEMSPLRSGQIL--- 208
            S+  D   SD I L++       SF D + +    G   TE+ + + +    +G I+   
Sbjct: 228  SSDLDTGSSDDISLSQ-----SFSFPDSLLDSSVFGCSATESYLEDAIDIEGNGTIVVSP 287

Query: 209  SVHVGQLVANCDQYICEMDGKNLSGTSGETINEVTDMNDIHESCLQMLPSHGCKKRERLQ 268
            S+ + +++ N D  +C  D   ++ T  ETIN                P     + +RL 
Sbjct: 288  SLAITEMLNNDDGGLCSHDLNKITVT--ETIN----------------PDLKLVREDRLD 347

Query: 269  SNSSPLTIHALENDQCGEMSNSSPKYILEVVEDVDVSTTNNNADAEQYVDPKIKNNYNLE 328
            ++ S +    L+N   G+ S+ S    L +          NN  A          +  ++
Sbjct: 348  TDLSVMNEKMLKN-HVGDSSSESAVAALSM----------NNGMAADLRAENFSQSSPID 407

Query: 329  EATIQLSLNCVELLASPLPSQFPNCERDEFHEMLNVADIPIKDISSVNSCSIGDQDNNDV 388
            E T+ +  N   +  S L   FP        E+ N         ++V    I D +    
Sbjct: 408  EKTLDMEANS-PITDSSLIWNFPLNFGSGGIEVCNPE-------NAVEPLRIVDDNGRIG 467

Query: 389  EKVGCVSEVKCLETVIPFSKRSGRR----KTSSQKTVTKRAPRKSRKKVPN---ALIFDT 448
             +V   S     E  +  S+R  R     K    KT  +   + SRKK        IF  
Sbjct: 468  GEVASASGSDFCEAGMSSSRRKARDGKQCKVVQTKTSARHLRKSSRKKQSERDIESIFKC 527

Query: 449  ARRRRSSISRPARPSLWGSLDYIIQSFENNEDVWVNQSQKQGNKKPKGNRGGTKLNNKQP 508
            ++++RSS+ + +R S WG      + F  + ++  +       ++ +GN     LNN + 
Sbjct: 528  SKQKRSSLLKTSRSSEWGLPSKTTEIFLQSNNIPYDGPPHHEPQRSQGN-----LNNGEH 587

Query: 509  SESSHRSRKGTQVNSATSTSTNRIRLKVKLGKNMGHNFLNIVVPEVVGSSLSAKGINSNY 568
            + SSH         +  ++S + +RLKVK GK+ G N LNI V +V G+SL   GI    
Sbjct: 588  NRSSHNGNVEGSNRNIQASSGSCLRLKVKFGKSGGQNPLNITVSKVSGNSLPGNGIVKA- 647

Query: 569  GNKSYWGGNLEFPPSTIGVDDQKLEEGPLRKIFCYNRNQEKEEKCSDPYIVKEQCATNDS 628
                  G  LE P S    +D+         +   +   EK         ++++    D+
Sbjct: 648  ------GTCLELPGSAHFGEDKMQTVETKEDLVEKSNPVEKVSYLQSSDSMRDKKYNQDA 707

Query: 629  SCTNIVSKLSVEHADDSFAVSS-DLVELVEHASDTRNLDPGTSPDSEVINSILDIQVGAV 688
                +  K+  +  DD   +SS  +VE  E A+ T++LD  TSPDSEVINS+ D  V   
Sbjct: 708  G--GLCRKVGGDVLDDDPHLSSIRMVEECERATGTQSLDAETSPDSEVINSVPDSIVNIE 767

Query: 689  RQENLQESVLASSEDFAASGNVTSSKKGRKKEKPYQAVSCSQEGGTCASACSNGSKSSKK 748
             +E L     ++ ED          KK R  EK  +                  SKS  +
Sbjct: 768  HKEGLHHGFFSTPEDVV--------KKNRVLEKEDEL---------------RASKSPSE 827

Query: 749  HGTRLNVDNQLGAGETFTSVGENVLNDSFTFKELSTESTEIEHPEEALKVERILDVKECC 808
            +G+ L                                                       
Sbjct: 828  NGSHL------------------------------------------------------- 887

Query: 809  KTEACSVFPESENLKMFLPSQSARKKHPKSSKPIKTSDCKPK-DPGSKNKIKNASKERVY 868
                             +P+ + + KHPK SK   T   K K    +K+  KN S E V 
Sbjct: 888  -----------------IPN-AKKAKHPK-SKSNGTKKGKSKFSESAKDGRKNESHEGVE 947

Query: 869  QRKFVNKSKIKKDICEQVVTETESHQIVGNFL---VDKPAKVDDITASTVAINLNAVQGV 928
            QRK +N S  + D     V   ESH+  G  L   + K +      +S V      V   
Sbjct: 948  QRKSLNTSMGRDDSDYPEVGRIESHKTTGALLDADIGKTSATYGTISSDVTHGEMVVDVT 1007

Query: 929  VNEQYTPPRNAWVLCDDCQKWRRIAASLVDSLGHASCTWTCKDNVDKAFADCLIPQEKSN 988
            + + Y+   +AWV CDDC KWRRI AS+V S+  +S  W C +N DK FADC   QE SN
Sbjct: 1008 IEDSYS-TESAWVRCDDCFKWRRIPASVVGSIDESS-RWICMNNSDKRFADCSKSQEMSN 1067

Query: 989  AEINAELEISDESGEENASNKRLTYREFESFHPLTVTAVLQENKFASISSNQFLHRSRKT 1048
             EIN EL I  +  E +A +     R  E           Q+  F +I +NQFLHR+RK+
Sbjct: 1068 EEINEELGIGQD--EADAYDCDAAKRGKEKEQKSKRLTGKQKACFKAIKTNQFLHRNRKS 1127

Query: 1049 QNIDEVMVCHCKPTLDGRLGCANECLNRMLNIECVRGTCPCGDLCSNQQFQKRKYAKLQW 1108
            Q IDE+MVCHCKP+ DGRLGC  ECLNRMLNIEC++GTCP GDLCSNQQFQKRKY K + 
Sbjct: 1128 QTIDEIMVCHCKPSPDGRLGCGEECLNRMLNIECLQGTCPAGDLCSNQQFQKRKYVKFER 1187

Query: 1109 LRCGKKGYGLQLLEDISKGKFLIEYVGEVLDMHAYESRQKEYAFNGHRHFYFMTLNGSEV 1168
             + GKKGYGL+LLED+ +G+FLIEYVGEVLDM +YE+RQKEYAF G +HFYFMTLNG+EV
Sbjct: 1188 FQSGKKGYGLRLLEDVREGQFLIEYVGEVLDMQSYETRQKEYAFKGQKHFYFMTLNGNEV 1247

Query: 1169 IDACGKGNLGRFINHSCDPNCRTEKWMVNGEICIGLFALRDIKKGEEVTFDYNYVRVCGA 1228
            IDA  KGNLGRFINHSC+PNCRTEKWMVNGEIC+G+F+++D+KKG+E+TFDYNYVRV GA
Sbjct: 1248 IDAGAKGNLGRFINHSCEPNCRTEKWMVNGEICVGIFSMQDLKKGQELTFDYNYVRVFGA 1307

Query: 1229 AAKKCYCGSSQCRGYIGGDPHNSEVIIHSDPDEEFPEPVMLRAYGRSSNGNL-PTAASSV 1288
            AAKKCYCGSS CRGYIGGDP N +VII SD DEE+PE V+L     S  G L  T+ +  
Sbjct: 1308 AAKKCYCGSSHCRGYIGGDPLNGDVIIQSDSDEEYPELVILDD-DESGEGILGATSRTFT 1367

Query: 1289 DGAKMQLSERIKGVRNKKE-QPTGIAIQMKI---LEEKEEPFQLSALKISEDPPKLSA-- 1348
            D A  Q+ +  + V   K+  P     Q  +   L E+E P  L  L+ +E   +LS+  
Sbjct: 1368 DDADEQMPQSFEKVNGYKDLAPDNTQTQSSVSVKLPEREIPPPL--LQPTEVLKELSSGI 1427

Query: 1349 SKISEEQEDHHNLSALIISPLHSSLEFEDSKEAKLSFDDIDARKNSELDAIEDKQVYINS 1408
            S  + +QE          SP  SSL       +++S    ++ K ++  + EDK++    
Sbjct: 1428 SITAVQQEVPAEKKTKSTSPTSSSL-------SRMSPGGTNSDKTTKHGSGEDKKILPRP 1487

Query: 1409 HPQMKTSRKQGSIKKGK---VSSVEKVKI--TNKPQILSLKSKRLFEGSPGNRFEAVEEK 1468
             P+MKTSR   S K+ K      V K ++   NK Q   +KSK   + SP    E  E K
Sbjct: 1488 RPRMKTSRSSESSKRDKGGIYPGVNKAQVIPVNKLQQQPIKSKGSEKVSPS--IETFEGK 1547

Query: 1469 LNELLDAEGGISKRKDAPKGYLKLLLLTAASGASASGEAIQSNRDLSMLLDALLKTKSRV 1528
            LNELLDA GGISKR+D+ KGYLKLLLLTAAS      E I SNRDLSM+LDALLKTKS+ 
Sbjct: 1548 LNELLDAVGGISKRRDSAKGYLKLLLLTAAS-RGTDEEGIYSNRDLSMILDALLKTKSKS 1607

Query: 1529 VLTDIINKNGLRMLHNIMKQYRSDFKKIPILRKLLKVLEYLVTREILTSEHIYGSPPCPG 1588
            VL DIINKNG                                              P  G
Sbjct: 1608 VLVDIINKNG----------------------------------------------PFAG 1667

Query: 1589 MESLRESLLSLTEHDDKQVHQIARGFRDRWFPRHNRKFGYSGRADGRLEAYRGTNCSRFT 1648
            MES ++S+LS TEHDD  VH IAR FRDRW P+H RK     R + R E+ R     RF 
Sbjct: 1668 MESFKDSVLSFTEHDDYTVHNIARSFRDRWIPKHFRKPWRINREE-RSESMRSPINRRFR 1727

Query: 1649 AS--PSYCHDQDFRPSE--AIDCNKQPSTP--TSLSDFHPAEVCSVPSTAGHSLDGQRIR 1708
            AS  P Y H Q  RP+E  A   + + +TP   S+S+ +      +P T G        R
Sbjct: 1728 ASQEPRYDH-QSPRPAEPAASVTSSKAATPETASVSEGYSEPNSGLPETNG--------R 1727

Query: 1709 KRKSRWDQPADTSIDLRSKEQKLESTSVQRFDSSQSNSVGVASMLVDKINSDNMGSSLSG 1768
            KRKSRWDQP+ T      KEQ++ +   Q+ D +  N                       
Sbjct: 1788 KRKSRWDQPSKT------KEQRIMTILSQQTDETNGN----------------------- 1727

Query: 1769 SVEICCRRDEDIRLDSAVHNTPEDIPPGFSSPFNLPVASSSPFSTILDPPRQSIDNLSCT 1828
                     +D++         +D+PPGFSSP               D P          
Sbjct: 1848 ---------QDVQ---------DDLPPGFSSP-------------CTDVPD--------- 1727

Query: 1829 FSTVGHPQERFISRLPVSYGIPFSIVEQCGTSDSENLEFWCWDVAPGVPFHPFPPLPPYP 1888
             +    PQ++F+SRLPVSYGIP SIV Q G+   E+     W VAPG+PF+PFPPLPP  
Sbjct: 1908 -AITAQPQQKFLSRLPVSYGIPLSIVHQFGSPGKEDPT--TWSVAPGMPFYPFPPLPPVS 1727

Query: 1889 RGKRGPSTSACGTAVTQSDRERQVKSHDSQTSFSEESAPSTSTNYQQGLRILSNNQQTLK 1934
             G+     +            R   S     ++S E  P+T          ++++    +
Sbjct: 1968 HGEFFAKRNV-----------RACSSSMGNLTYSNEILPATP---------VTDSTAPTR 1727

BLAST of Sed0009206 vs. ExPASy Swiss-Prot
Match: E9Q5F9 (Histone-lysine N-methyltransferase SETD2 OS=Mus musculus OX=10090 GN=Setd2 PE=1 SV=1)

HSP 1 Score: 221.1 bits (562), Expect = 1.1e-55
Identity = 124/336 (36.90%), Postives = 186/336 (55.36%), Query Frame = 0

Query: 1018 FASISSNQFLHRSRKTQNIDEV--MVCHCKP-----TLDGRLGCANECLNRMLNIECVRG 1077
            F  I  N +L   +K ++  ++  M C C P        G + C  +CLNR+L IEC   
Sbjct: 1447 FDLIEENVYLTERKKNKSHRDIKRMQCECTPLSKDERAQGEVACGEDCLNRLLMIEC-SS 1506

Query: 1078 TCPCGDLCSNQQFQKRKYAKLQWLRCGKKGYGLQLLEDISKGKFLIEYVGEVLDMHAYES 1137
             CP GD CSN++FQ++++A ++ +   KKG+GL+  +D+    F++EY GEVLD   +++
Sbjct: 1507 RCPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDHKEFKA 1566

Query: 1138 RQKEYAFNGHRHFYFMTLNGSEVIDACGKGNLGRFINHSCDPNCRTEKWMVNGEICIGLF 1197
            R KEYA N + H+YFM L   E+IDA  KGN  RF+NHSC+PNC T+KW VNG++ +G F
Sbjct: 1567 RVKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFF 1626

Query: 1198 ALRDIKKGEEVTFDYNYVRVCGAAAKKCYCGSSQCRGYIGGDPHNSEVIIHSDPDEEFPE 1257
              + +  G E+TFDY + R  G  A+KC+CGS+ CRGY+GG+                  
Sbjct: 1627 TTKLVPSGSELTFDYQFQRY-GKEAQKCFCGSANCRGYLGGE-----------------N 1686

Query: 1258 PVMLRAYGRSSNGNLPTAASSVDGAKMQLSERIKGVRNKKE--QPTGIAIQMKILEEKEE 1317
             V +RA G            SVDG    L E  +G+ +K +    + + ++++ LE+K  
Sbjct: 1687 RVSIRAAGGKMKKERSRKKDSVDGELEALMENGEGLSDKNQVLSLSRLMVRIETLEQK-- 1746

Query: 1318 PFQLSALKISEDPPKLSASKISEEQEDHHNLSALII 1345
               L+ LK+ ++    S  K   E+   H LS L I
Sbjct: 1747 ---LTCLKLIQNTHSQSCLKSFLER---HGLSLLWI 1755

BLAST of Sed0009206 vs. ExPASy Swiss-Prot
Match: Q9BYW2 (Histone-lysine N-methyltransferase SETD2 OS=Homo sapiens OX=9606 GN=SETD2 PE=1 SV=3)

HSP 1 Score: 219.9 bits (559), Expect = 2.5e-55
Identity = 123/336 (36.61%), Postives = 186/336 (55.36%), Query Frame = 0

Query: 1018 FASISSNQFLHRSRKTQNIDEV--MVCHCKP-----TLDGRLGCANECLNRMLNIECVRG 1077
            F  I  N +L   +K ++  ++  M C C P        G + C  +CLNR+L IEC   
Sbjct: 1473 FDLIEENVYLTERKKNKSHRDIKRMQCECTPLSKDERAQGEIACGEDCLNRLLMIEC-SS 1532

Query: 1078 TCPCGDLCSNQQFQKRKYAKLQWLRCGKKGYGLQLLEDISKGKFLIEYVGEVLDMHAYES 1137
             CP GD CSN++FQ++++A ++ +   KKG+GL+  +D+    F++EY GEVLD   +++
Sbjct: 1533 RCPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDHKEFKA 1592

Query: 1138 RQKEYAFNGHRHFYFMTLNGSEVIDACGKGNLGRFINHSCDPNCRTEKWMVNGEICIGLF 1197
            R KEYA N + H+YFM L   E+IDA  KGN  RF+NHSC+PNC T+KW VNG++ +G F
Sbjct: 1593 RVKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFF 1652

Query: 1198 ALRDIKKGEEVTFDYNYVRVCGAAAKKCYCGSSQCRGYIGGDPHNSEVIIHSDPDEEFPE 1257
              + +  G E+TFDY + R  G  A+KC+CGS+ CRGY+GG+                  
Sbjct: 1653 TTKLVPSGSELTFDYQFQRY-GKEAQKCFCGSANCRGYLGGE-----------------N 1712

Query: 1258 PVMLRAYGRSSNGNLPTAASSVDGAKMQLSERIKGVRNKKE--QPTGIAIQMKILEEKEE 1317
             V +RA G            SVDG    L E  +G+ +K +    + + ++++ LE+K  
Sbjct: 1713 RVSIRAAGGKMKKERSRKKDSVDGELEALMENGEGLSDKNQVLSLSRLMVRIETLEQK-- 1772

Query: 1318 PFQLSALKISEDPPKLSASKISEEQEDHHNLSALII 1345
               L+ L++ ++    S  K   E+   H LS L I
Sbjct: 1773 ---LTCLELIQNTHSQSCLKSFLER---HGLSLLWI 1781

BLAST of Sed0009206 vs. ExPASy Swiss-Prot
Match: Q9VYD1 (Probable histone-lysine N-methyltransferase CG1716 OS=Drosophila melanogaster OX=7227 GN=Set2 PE=1 SV=2)

HSP 1 Score: 204.1 bits (518), Expect = 1.4e-50
Identity = 114/284 (40.14%), Postives = 161/284 (56.69%), Query Frame = 0

Query: 1016 NKFASISSNQFLHRSRKTQNIDEVMVCHCKPTLD----GRLGCANECLNRMLNIECVRGT 1075
            N F  +  N F   +R+    +  M C C  T D    G L C   C+NRML IEC    
Sbjct: 1287 NTFQLLKEN-FYRCARQVSQENAEMQCDCFLTGDEEAQGHLSCGAGCINRMLMIEC-GPL 1346

Query: 1076 CPCGDLCSNQQFQKRKYAKLQWLRCGKKGYGLQLLEDISKGKFLIEYVGEVLDMHAYESR 1135
            C  G  C+N++FQ+ +    +  R  KKG G+     I  G+F++EYVGEV+D   +E R
Sbjct: 1347 CSNGARCTNKRFQQHQCWPCRVFRTEKKGCGITAELLIPPGEFIMEYVGEVIDSEEFERR 1406

Query: 1136 QKEYAFNGHRHFYFMTLNGSEVIDACGKGNLGRFINHSCDPNCRTEKWMVNGEICIGLFA 1195
            Q  Y+ + +RH+YFM L G  VIDA  KGN+ R+INHSCDPN  T+KW VNGE+ IG F+
Sbjct: 1407 QHLYSKDRNRHYYFMALRGEAVIDATSKGNISRYINHSCDPNAETQKWTVNGELRIGFFS 1466

Query: 1196 LRDIKKGEEVTFDYNYVRVCGAAAKKCYCGSSQCRGYIGGDPHNSE---VIIHSDPDEEF 1255
            ++ I+ GEE+TFDY Y+R  G  A++CYC ++ CRG+IGG+P + E   +   SD D E 
Sbjct: 1467 VKPIQPGEEITFDYQYLRY-GRDAQRCYCEAANCRGWIGGEPDSDEGEQLDEESDSDAEM 1526

Query: 1256 PEPVMLRAYGRSSNGNLPTAASSVDGAKMQLSERIKGVRNKKEQ 1293
             E  +         G    +A +   +K++    +   R +KEQ
Sbjct: 1527 DEEEL---EAEPEEGQPRKSAKAKAKSKLKAKLPLATGRKRKEQ 1564

BLAST of Sed0009206 vs. ExPASy Swiss-Prot
Match: Q84WW6 (Histone-lysine N-methyltransferase ASHH1 OS=Arabidopsis thaliana OX=3702 GN=ASHH1 PE=1 SV=1)

HSP 1 Score: 198.7 bits (504), Expect = 6.0e-49
Identity = 93/235 (39.57%), Postives = 134/235 (57.02%), Query Frame = 0

Query: 1017 KFASISSNQFLHRSRKTQNIDEVMVCHCKPTL-DGRLGCANECLNRMLNIECVRGTCPCG 1076
            ++  I  N F +R  K Q  +++ +C CK    D    C   CLN + N EC  G CPCG
Sbjct: 16   QYEHIYQNDFSYRKHKKQKEEDISICECKFDFGDPDSACGERCLNVITNTECTPGYCPCG 75

Query: 1077 DLCSNQQFQKRKYAKLQWLRCGKKGYGLQLLEDISKGKFLIEYVGEVLDMHAYESRQKEY 1136
              C NQ+FQK +YAK + ++C  +G+GL  LE+I  G+F++EY GEV+     + R + Y
Sbjct: 76   VYCKNQKFQKCEYAKTKLIKCEGRGWGLVALEEIKAGQFIMEYCGEVISWKEAKKRAQTY 135

Query: 1137 AFNGHRHFYFMTLNGSEVIDACGKGNLGRFINHSCDPNCRTEKWMVNGEICIGLFALRDI 1196
              +G +  Y ++LN SE IDA  KG+L RFINHSC PNC T KW V GE+ +G+FA   I
Sbjct: 136  ETHGVKDAYIISLNASEAIDATKKGSLARFINHSCRPNCETRKWNVLGEVRVGIFAKESI 195

Query: 1197 KKGEEVTFDYNYVRVCGAAAKKCYCGSSQCRGYIGGDPH--NSEVIIHSDPDEEF 1249
                E+ +DYN+    G A  +C CG+  C G++G        +  +  D D+ +
Sbjct: 196  SPRTELAYDYNF-EWYGGAKVRCLCGAVACSGFLGAKSRGFQEDTYVWEDGDDRY 249

BLAST of Sed0009206 vs. ExPASy TrEMBL
Match: A0A0A0KDR4 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G376290 PE=4 SV=1)

HSP 1 Score: 2785.0 bits (7218), Expect = 0.0e+00
Identity = 1495/2051 (72.89%), Postives = 1652/2051 (80.55%), Query Frame = 0

Query: 1    MASSPSSSR---MCEPDRELGVITTTICANVSEVAAAGEDCTFRGLVRADTLSLDERFNS 60
            MAS  S+SR   M EPDR L V T ++C N S+   +GED T RG   AD+L +D+R + 
Sbjct: 107  MASFSSNSREGQMFEPDRGLEVTTASLCTNASDPDTSGEDGTLRGFEHADSLLMDKRLDG 166

Query: 61   DSGDIGLGLNEEDESYNLGNETFSLDMEEPQDDEGLVDILGCKTTMEMMSLTGSLVNSV- 120
            DSG     LN ++ES N GN+T SLDM+E +D +GLVDILGC  TMEM+SLT SLVNSV 
Sbjct: 167  DSGGSDPCLNLDNESCNEGNKTLSLDMKESEDVDGLVDILGCDATMEMISLTESLVNSVK 226

Query: 121  --ELNDDSFVVDTVAKVVR-----------------DDLKSPSHICEIVSNSASADGLPS 180
              EL+++S ++D  AKV R                 DDLKS S++CEIVSNSASADGLP+
Sbjct: 227  PEELDNNSCIIDAPAKVERDDTAQNGPILAGTGTRTDDLKS-SYVCEIVSNSASADGLPN 286

Query: 181  DFIQLNELDNDG-GCSFSDVMDNRISDGSIVTEAEMLNEMSPLRSGQILSVHVGQLVANC 240
            DFIQ NEL+NDG GCSFS+V D RI++ S+  EA+MLNEMSPL+SGQIL +HVGQ +AN 
Sbjct: 287  DFIQKNELENDGAGCSFSEVAD-RITEASVELEADMLNEMSPLQSGQILPIHVGQSIANY 346

Query: 241  DQYICEMDGKNLSGTSGETINEVTDMNDIHESCLQMLPSHGCKK-RERLQSNSSPLTIHA 300
            D+Y+C MDGK+LS TSGET+  V DMN   E CLQMLPS GC +  E LQS+  PLTI+A
Sbjct: 347  DRYVCRMDGKSLSSTSGETVTVVADMNSNPEGCLQMLPSQGCDRIGECLQSDGLPLTINA 406

Query: 301  LENDQCGEM--SNSSPKYILEVVEDVDVSTTNNNADAEQYVDPKIKNNYNLEEATIQLSL 360
             END C E   SNSS KY+ +V  D     TNNN+D  Q+  P I N++NLE+AT+Q++ 
Sbjct: 407  SENDLCEEKHDSNSSSKYVPDVGGDDSDVLTNNNSDGGQHTVPGIGNDHNLEDATVQVNH 466

Query: 361  NCVELLASPLPSQFPNCERDEFHEMLNVADIPIKDISSVNSCSIGDQDNNDVEKVGCVSE 420
            +CVELL+SPLPSQ PN E+DEF+ MLN ADIPIK ISSVNSCS+GDQDNND+EKVGCVSE
Sbjct: 467  DCVELLSSPLPSQLPNSEKDEFYGMLNGADIPIKYISSVNSCSVGDQDNNDIEKVGCVSE 526

Query: 421  VKCLETVIPFSKRSGRRKTSSQKTVTKRAPRKSRKKVPNALIFDTARRRRSSISRPARPS 480
            VKC ETVI  SKRSGRR+TSSQKTVTKRA RK++KKVP  LIFDTARRRRSSISRPARPS
Sbjct: 527  VKCPETVITSSKRSGRRRTSSQKTVTKRASRKTKKKVPEPLIFDTARRRRSSISRPARPS 586

Query: 481  LWGSLDYIIQSFENNEDVWVNQSQKQGNKKPKGNRGGTKLNNKQPSESSHRSRKGTQVNS 540
             WGSL +IIQSFE  +DV VNQ+QKQGN+K KGN+GG K N KQ SESSHRSRKGTQ  S
Sbjct: 587  PWGSLGHIIQSFEEIDDVLVNQTQKQGNEKSKGNQGGAKRNKKQLSESSHRSRKGTQGKS 646

Query: 541  ATSTSTNRIRLKVKLGKNMGHNFLNIVVPEVVGSSLSAKGINSNYGNKSYWGGNLEFPPS 600
            ATSTSTNRIRLKVKLGKN+GHNFLNIVVPE+V SSLSAKG+N NYGN+SYW GNLEFPPS
Sbjct: 647  ATSTSTNRIRLKVKLGKNVGHNFLNIVVPEIVDSSLSAKGVNCNYGNESYWEGNLEFPPS 706

Query: 601  TIGVDDQKL-EEGPLRKIFCYNRNQEKEEKCSDPYIVKEQCATNDSSCTNIVSKLSVEHA 660
             +GVDDQK  EEGPLRKIFCY+RNQ+KE+ C D  +V EQC  NDSSC   + K S +HA
Sbjct: 707  NLGVDDQKAEEEGPLRKIFCYSRNQDKEDNCPDASVVNEQCTNNDSSCIVGIDKSSEKHA 766

Query: 661  DDSFAVSSDLVELVEHASDTRNLDPGTSPDSEVINSILDIQVGAVRQENLQESVLASSED 720
            DD+  VSS LV+ V   SD R+LDPGTSPDSEVINS+LDIQVGA RQE LQ+SVLAS ED
Sbjct: 767  DDNLCVSSHLVDPVA-TSDARSLDPGTSPDSEVINSVLDIQVGAARQEILQDSVLASLED 826

Query: 721  FAASGNVTSSKKGRKKEKPYQAVSCSQEGGTCASACSNGSKSSKKHGTRLNVDNQLGAGE 780
            FAASGN   SKKGRKK+KP + VSCS+E G   SACSN SKSSKKHG R NVDNQL    
Sbjct: 827  FAASGNAPGSKKGRKKDKPSRVVSCSEERGISVSACSNRSKSSKKHGRRHNVDNQL---- 886

Query: 781  TFTSVGENVLNDSFTFKELSTESTEIEHPEEALKVERILDVKECCKTEACSVFPESENLK 840
                                  S+EIE PEE LK E IL+ KECC+ +  SVF ESEN K
Sbjct: 887  ----------------------SSEIELPEETLKAEDILNDKECCRADVGSVFSESENSK 946

Query: 841  MFLPSQSARKKHPKSSKPIKTSDCKPKDPGSKNKIKNASKERVYQRKFVNKSKIKKDICE 900
             FLPSQSA+KKHPK SK IKTS  K K PGSKNKIKNAS ERVYQRK    SK K+ +C+
Sbjct: 947  TFLPSQSAKKKHPKGSKSIKTSKGKSKAPGSKNKIKNASNERVYQRKSFKNSKSKEALCD 1006

Query: 901  QVVTETESHQIVGNFLVDKPAKVDDITASTVAINLNAVQGVVNEQYTPPRNAWVLCDDCQ 960
            QVVTETESHQI+GN LVDKP K D+I ASTVA++L+ VQG VNEQY PPRNAWVLCDDC 
Sbjct: 1007 QVVTETESHQIIGNCLVDKPEKSDNIIASTVAVDLSVVQGAVNEQYMPPRNAWVLCDDCH 1066

Query: 961  KWRRIAASLVDSLGHASCTWTCKDNVDKAFADCLIPQEKSNAEINAELEISDESGEENAS 1020
            KWRRI ASLVDSLGHASCTWTCKDNVDKAFA+C IPQEKSNAEINAELEISDESGEEN S
Sbjct: 1067 KWRRIPASLVDSLGHASCTWTCKDNVDKAFANCSIPQEKSNAEINAELEISDESGEENGS 1126

Query: 1021 NKRLTYREFESFHPLTVTAVLQENKFASISSNQFLHRSRKTQNIDEVMVCHCKPTLDGRL 1080
             KRLTYRE ESFHP TV AV Q+NKFASISSNQFLHRSRKTQ IDE+MVCHCKP LDGRL
Sbjct: 1127 KKRLTYRELESFHPATVNAVPQQNKFASISSNQFLHRSRKTQTIDEIMVCHCKPALDGRL 1186

Query: 1081 GCANECLNRMLNIECVRGTCPCGDLCSNQQFQKRKYAKLQWLRCGKKGYGLQLLEDISKG 1140
            GC +ECLNRMLNIECVRGTCPCG+LCSNQQFQKRKYAKLQWLRCGKKGYGLQLLEDISKG
Sbjct: 1187 GCGDECLNRMLNIECVRGTCPCGELCSNQQFQKRKYAKLQWLRCGKKGYGLQLLEDISKG 1246

Query: 1141 KFLIEYVGEVLDMHAYESRQKEYAFNGHRHFYFMTLNGSEVIDACGKGNLGRFINHSCDP 1200
            +FLIEYVGEVLDMHAYE+RQKEYA NGHRHFYFMTLNGSEVIDACGKGNLGRFINHSCDP
Sbjct: 1247 QFLIEYVGEVLDMHAYEARQKEYALNGHRHFYFMTLNGSEVIDACGKGNLGRFINHSCDP 1306

Query: 1201 NCRTEKWMVNGEICIGLFALRDIKKGEEVTFDYNYVRVCGAAAKKCYCGSSQCRGYIGGD 1260
            NCRTEKWMVNGEICIGLFALRDIKKGEEVTFDYNYVRV GAAAKKCYCGS  CRGYIGGD
Sbjct: 1307 NCRTEKWMVNGEICIGLFALRDIKKGEEVTFDYNYVRVFGAAAKKCYCGSFHCRGYIGGD 1366

Query: 1261 PHNSEVIIHSDPDEEFPEPVMLRAYGRSSNGNLPTAASSVDGAKMQLSERIKGVRNKKEQ 1320
            P NSEVII SD DEEFPEPVMLR  GRS N NL TA SS+D AKMQ SE +KG R+K++Q
Sbjct: 1367 PLNSEVIIQSDSDEEFPEPVMLRGDGRSLNSNLSTAVSSMDVAKMQSSEHLKGNRDKRDQ 1426

Query: 1321 PTGIAIQMKILEEKEEPFQLSALKIS---EDPPKLSASKISEEQEDHHNLSALIISPLHS 1380
            P  IA ++KI EEK +P +LSA KIS   EDP KLSA+KISEE+ED  NLSA  ISPLHS
Sbjct: 1427 PIRIASELKISEEKVDPLKLSASKISEEKEDPLKLSATKISEEKEDPLNLSASTISPLHS 1486

Query: 1381 SLEFEDSK---------------------------------------------EAKLSFD 1440
            SLEFEDSK                                             EAKLS D
Sbjct: 1487 SLEFEDSKVASPIPVPDITHQTEDVTSQPIFVDQTEISLLDNIPDKNTCSIEQEAKLSVD 1546

Query: 1441 DIDARKNSELDAIEDKQVYINSHPQMKTSRKQGSIKKGKVSSVEKVKITNKPQILSLKSK 1500
            DIDARK S+LD++EDKQVYI SHP+MKTSRK GSIKKGKVSS EK++ITN+ QI S+K K
Sbjct: 1547 DIDARKKSKLDSVEDKQVYIKSHPRMKTSRKLGSIKKGKVSSAEKIQITNRSQISSVKPK 1606

Query: 1501 RLFEGSPGNRFEAVEEKLNELLDAEGGISKRKDAPKGYLKLLLLTAASGASASGEAIQSN 1560
            RL EGSPGNRFEAVEEKLNELLDAEGGISKRKDAPKGYLKLLLLTAASGASASGEAIQSN
Sbjct: 1607 RLIEGSPGNRFEAVEEKLNELLDAEGGISKRKDAPKGYLKLLLLTAASGASASGEAIQSN 1666

Query: 1561 RDLSMLLDALLKTKSRVVLTDIINKNGLRMLHNIMKQYRSDFKKIPILRKLLKVLEYLVT 1620
            RDLSM+LDALLKTKSR+VLTDIINKNGLRMLHNIMKQYRSDFKKIPILRKLLKVLEYLVT
Sbjct: 1667 RDLSMILDALLKTKSRLVLTDIINKNGLRMLHNIMKQYRSDFKKIPILRKLLKVLEYLVT 1726

Query: 1621 REILTSEHIYGSPPCPGMESLRESLLSLTEHDDKQVHQIARGFRDRWFPRHNRKFGYSGR 1680
            REILTSEHI G PPCPGMESLRESLLSLTEHDDKQVHQIAR FRDRWFPRH RKFGYS R
Sbjct: 1727 REILTSEHINGGPPCPGMESLRESLLSLTEHDDKQVHQIARSFRDRWFPRHTRKFGYSER 1786

Query: 1681 ADGRLEAYRGTNCSRFTASPSYCHDQDFRPSEAIDCNKQPSTPTSLSDFHPAEVCSVPST 1740
             DGRLE YRG+N SRFTAS S+ HDQD RP++AIDC KQ S PTSL D HPAEVCS+ S 
Sbjct: 1787 EDGRLEVYRGSNSSRFTASHSFRHDQDCRPTDAIDCIKQ-SMPTSLPDAHPAEVCSLASA 1846

Query: 1741 AGHSLDGQRIRKRKSRWDQPADTSIDLRSKEQKLESTSVQRFDSSQSNSVGVASMLVDKI 1800
            A HS++GQ++RKRKSRWDQPADTS+DLRSKEQKLESTSVQ  +SSQ NSVG ASML+DK+
Sbjct: 1847 ASHSVNGQKVRKRKSRWDQPADTSLDLRSKEQKLESTSVQELNSSQLNSVGAASMLIDKV 1906

Query: 1801 NSDNMGSSLSGSVEICCRRDEDIRLDSAVHNTPEDIPPGFSSPFNLPVASSSPFSTILDP 1860
            N+D+   SLS SV + CR+DEDIR DSAV N PEDIPPGFSSPFN PVASSS FS +LDP
Sbjct: 1907 NNDDKDISLSDSVGVPCRQDEDIRADSAVPNIPEDIPPGFSSPFNPPVASSSAFSAVLDP 1966

Query: 1861 PRQSIDNLSCTFSTVGHPQERFISRLPVSYGIPFSIVEQCGTSDSENLEFWCWDVAPGVP 1920
            PRQ+I +LSC FSTVGH QERFISRLPVSYGIPFSI+EQCGTS +ENLE  CWDVAPGVP
Sbjct: 1967 PRQNIGDLSCAFSTVGHLQERFISRLPVSYGIPFSIIEQCGTSHAENLE--CWDVAPGVP 2026

Query: 1921 FHPFPPLPPYPRGKRGPSTSACGTAVTQSDRERQVKSHDSQTSFSEESAPSTSTNYQQGL 1976
            FHPFPPLPPYPRG RG  TSACGTA  QS +E QV SHDS+TSFSEES PSTSTNYQ  L
Sbjct: 2027 FHPFPPLPPYPRGMRGLPTSACGTA-GQSSQEGQVNSHDSRTSFSEESPPSTSTNYQTDL 2086

BLAST of Sed0009206 vs. ExPASy TrEMBL
Match: A0A5A7UPQ6 (Histone-lysine N-methyltransferase ASHH2 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold221G001140 PE=4 SV=1)

HSP 1 Score: 2783.4 bits (7214), Expect = 0.0e+00
Identity = 1500/2054 (73.03%), Postives = 1652/2054 (80.43%), Query Frame = 0

Query: 1    MASSPSSSR---MCEPDRELGVITTTICANVSEVAAAGEDCTFRGLVRADTLSLDERFNS 60
            MAS  SSSR   M EPDR LGV T ++C N S+    GED T      AD+L +D+R + 
Sbjct: 38   MASFSSSSREGQMFEPDRGLGVTTASVCMNASDPDTYGEDGTLGAFEHADSLLMDKRLDG 97

Query: 61   DSGDIGLGLNEEDESYNLGNETFSLDMEEPQDDEGLVDILGCKTTMEMMSLTGSLVNSV- 120
            D G     LN E+ES N GN T SLDM+E +D +G VDILGC  TMEM+SLT SLVNSV 
Sbjct: 98   DFGGSDPCLNLENESCNEGNRTLSLDMKESEDVDGFVDILGCDATMEMISLTESLVNSVK 157

Query: 121  --ELNDDSFVVDTVAKVVR-----------------DDLKSPSHICEIVSNSASADGLPS 180
              EL+ +S + D  AKV R                 DDLKS S++CEIVSNSASADGLP+
Sbjct: 158  PEELDKNSCIFDAPAKVERDDTVQNGPILVGTGTRTDDLKS-SYVCEIVSNSASADGLPN 217

Query: 181  DFIQLNELDNDG-GCSFSDVMDNRISDGSIVTEAEMLNEMSPLRSGQILSVHVGQLVANC 240
            DFIQ N+++NDG GCSFS+V D RI++ S+  EA+MLNE+SPL+SGQIL + VGQ +ANC
Sbjct: 218  DFIQQNKMENDGAGCSFSEVAD-RITEASVELEADMLNEISPLQSGQILPIDVGQSIANC 277

Query: 241  DQYICEMDGKNLSGTSGETINEVTDMNDIHESCLQMLPSHGCKK-RERLQSNSSPLTIHA 300
            D+Y+C+MDGK+LS TSGET+ EV DMN   E CLQMLPS GC +  E LQS+  PLTIHA
Sbjct: 278  DRYVCQMDGKSLSSTSGETVIEVADMNSNPEVCLQMLPSQGCDRIGECLQSDGLPLTIHA 337

Query: 301  LENDQCGEM--SNSSPKYILEVVEDVDVSTTNNNADAEQYVDPKIKNNYNLEEATIQLSL 360
             END C E   SNSS KYI +V  D     TNNN+D  Q+V P I N++NLE+AT+Q++ 
Sbjct: 338  SENDLCEEKHDSNSSSKYIPDVGGDDSDVLTNNNSDGGQHVVPGIGNDHNLEDATVQVNH 397

Query: 361  NCVELLASPLPSQFPNCERDEFHEMLNVADIPIKDISSVNSCSIGDQDNNDVEKVGCVSE 420
            NCVELLASPLPSQ PN E+DEF+  L   DIPIK ISSVNS  +GDQDNND+ KVGCVSE
Sbjct: 398  NCVELLASPLPSQPPNSEKDEFYGTLK-EDIPIKYISSVNSRCLGDQDNNDIGKVGCVSE 457

Query: 421  VKCLETVIPFSKRSGRRKTSSQKTVTKRAPRKSRKKVPNALIFDTARRRRSSISRPARPS 480
            VKC ETVI  SKRSGRR+TSSQK VTKRA RK++KKVP  LIFDT RRRRSSISR ARPS
Sbjct: 458  VKCPETVIMSSKRSGRRRTSSQKAVTKRASRKTKKKVPEPLIFDTTRRRRSSISRSARPS 517

Query: 481  LWGSLDYIIQSFENNEDVWVNQSQKQGNKKPKGNRGGTKLNNKQPSESSHRSRKGTQVNS 540
             WGSL +IIQSFE  +DV VNQ+QKQGN+K KGN+GG K N KQ SESSHRSRKGTQ   
Sbjct: 518  PWGSLGHIIQSFEEIDDVLVNQTQKQGNEKSKGNQGGAKRNKKQLSESSHRSRKGTQGKP 577

Query: 541  ATSTSTNRIRLKVKLGKNMGHNFLNIVVPEVVGSSLSAKGINSNYGNKSYWGGNLEFPPS 600
            ATSTSTNRIRLKVKLGKN+GHNFLNIVVPE+V SSLSAKG+N NYGN SYW GNLEFPPS
Sbjct: 578  ATSTSTNRIRLKVKLGKNVGHNFLNIVVPEIVDSSLSAKGVNCNYGNDSYWEGNLEFPPS 637

Query: 601  TIGVDDQKLEEGPLRKIFCYNRNQEKEEKCSDPYIVKEQCATNDSSCTNIVSKLSVEHAD 660
            T+GVDDQK+EEGPLRKIFCY+RNQ+KEEKC D  +V EQC  NDSSC   + K S +HAD
Sbjct: 638  TLGVDDQKVEEGPLRKIFCYSRNQDKEEKCPDASVVNEQCTNNDSSCIIGIDKSSEKHAD 697

Query: 661  DSFAVSSDLVELVEHASDTRNLDPGTSPDSEVINSILDIQVGAVRQENLQESVLASSEDF 720
            D+  VSS LVE VE  SDTR+LDPGTSPDSEVINS+LDIQVGA RQE L +SVLAS EDF
Sbjct: 698  DNLCVSSHLVEPVERTSDTRSLDPGTSPDSEVINSVLDIQVGAARQEILPDSVLASLEDF 757

Query: 721  AASGNVTSSKKGRKKEKPYQAVSCSQEGGTCASACSNGSKSSKKHGTRLNVDNQLGAGET 780
            AASGN   SKKGRKK+KP +AVSCS E G   SACSN SKSSKKHG R NVDNQLG+GET
Sbjct: 758  AASGNAPGSKKGRKKDKPSRAVSCSGERGISVSACSNRSKSSKKHGRRQNVDNQLGSGET 817

Query: 781  FTSVGENVLNDSFTFKELSTES----TEIEHPEEALKVERILDVKECCKTEACSVFPESE 840
            FT    NVLN S T +ELS E     TEIE PE+ LK + IL+ KECC+ +  S FPESE
Sbjct: 818  FTYSDANVLNYSLTVEELSMEQVSLLTEIELPEDTLKADDILNDKECCRADVGSTFPESE 877

Query: 841  NLKMFLPSQSARKKHPKSSKPIKTSDCKPKDPGSKNKIKNASKERVYQRKFVNKSKIKKD 900
            N K FLPSQSA+KKHPK SK IKTS  K K PGSKNKIKNAS ERVYQRK   KSK K+ 
Sbjct: 878  NSKTFLPSQSAKKKHPKGSKSIKTSKGKSKAPGSKNKIKNASNERVYQRKSFKKSKSKEA 937

Query: 901  ICEQVVTETESHQIVGNFLVDKPAKVDDITASTVAINLNAVQGVVNEQYTPPRNAWVLCD 960
            +C++VVTETESHQI+GN LVDKP K D+I ASTVA++L+ VQG VNEQY PPRNAWVLCD
Sbjct: 938  LCDRVVTETESHQIIGNCLVDKPEKSDNIIASTVAVDLSVVQGAVNEQYMPPRNAWVLCD 997

Query: 961  DCQKWRRIAASLVDSLGHASCTWTCKDNVDKAFADCLIPQEKSNAEINAELEISDESGEE 1020
            DC KWRRI ASLVDSLGHASCTWTCKDNVDKAFA+C IPQEKSNAEINAELEISDESGEE
Sbjct: 998  DCHKWRRIPASLVDSLGHASCTWTCKDNVDKAFANCSIPQEKSNAEINAELEISDESGEE 1057

Query: 1021 NASNKRLTYREFESFHPLTVTAVLQENKFASISSNQFLHRSRKTQNIDEVMVCHCKPTLD 1080
            N S KRLTYRE ESFHP TVTA+ QENKFASISSNQFLHRSRKTQ IDE+MVCHCKP LD
Sbjct: 1058 NGSKKRLTYRELESFHPATVTAIPQENKFASISSNQFLHRSRKTQTIDEIMVCHCKPALD 1117

Query: 1081 GRLGCANECLNRMLNIECVRGTCPCGDLCSNQQFQKRKYAKLQWLRCGKKGYGLQLLEDI 1140
            GRLGC +ECLNRMLNIECVRGTCPCGDLCSNQQFQKRKYAKLQWLRCGKKGYGLQLLEDI
Sbjct: 1118 GRLGCGDECLNRMLNIECVRGTCPCGDLCSNQQFQKRKYAKLQWLRCGKKGYGLQLLEDI 1177

Query: 1141 SKGKFLIEYVGEVLDMHAYESRQKEYAFNGHRHFYFMTLNGSEVIDACGKGNLGRFINHS 1200
            SKG+FLIEYVGEVLDM+AYE+RQKEYA NGHRHFYFMTLNGSEVIDACGKGNLGRFINHS
Sbjct: 1178 SKGQFLIEYVGEVLDMNAYEARQKEYALNGHRHFYFMTLNGSEVIDACGKGNLGRFINHS 1237

Query: 1201 CDPNCRTEKWMVNGEICIGLFALRDIKKGEEVTFDYNYVRVCGAAAKKCYCGSSQCRGYI 1260
            CDPNCRTEKWMVNGEICIGLFALRDIKKGEEVTFDYNYVRV GAAAKKCYCGS  CRGYI
Sbjct: 1238 CDPNCRTEKWMVNGEICIGLFALRDIKKGEEVTFDYNYVRVFGAAAKKCYCGSFHCRGYI 1297

Query: 1261 GGDPHNSEVIIHSDPDEEFPEPVMLRAYGRSSNGNLPTAASSVDGAKMQLSERIKGVRNK 1320
            GGDP NSEVII SD DEEFPEPVMLRA GRS N NL TA SS+D AKMQ SE +KG R+K
Sbjct: 1298 GGDPLNSEVIIQSDSDEEFPEPVMLRADGRSWNNNLSTAVSSMDVAKMQPSEHLKGNRDK 1357

Query: 1321 KEQPTGIAIQMKILEEKEEPFQLSALKIS---EDPPKLSASKISEEQEDHHNLSALIISP 1380
            ++QP  IA ++KI EEK +  +L A KIS   EDP KLSA K SEE+ED  NLSA  ISP
Sbjct: 1358 RDQPIRIASELKISEEKVDTLKLPASKISEEKEDPLKLSALKTSEEKEDPLNLSASTISP 1417

Query: 1381 LHSSLEFEDSK---------------------------------------------EAKL 1440
            LHSSLEFEDSK                                             EAKL
Sbjct: 1418 LHSSLEFEDSKVASPIPVPDITHQTEDVTSKPIFVDQTGISLLDNISDKNTCSIEQEAKL 1477

Query: 1441 SFDDIDARKNSELDAIEDKQVYINSHPQMKTSRKQGSIKKGKVSSVEKVKITNKPQILSL 1500
            S DDIDARK S+LD++EDK+VYI SHP+MKTSRK GS+KKGKVSSVEK++ITN+  I S+
Sbjct: 1478 SVDDIDARKKSKLDSVEDKKVYIKSHPRMKTSRKPGSVKKGKVSSVEKIQITNRSLISSV 1537

Query: 1501 KSKRLFEGSPGNRFEAVEEKLNELLDAEGGISKRKDAPKGYLKLLLLTAASGASASGEAI 1560
            K KRL EGSPGNRFEAVEEKLNELLDAEGGISKRKDAPKGYLKLLLLTAASGASASGEAI
Sbjct: 1538 KPKRLIEGSPGNRFEAVEEKLNELLDAEGGISKRKDAPKGYLKLLLLTAASGASASGEAI 1597

Query: 1561 QSNRDLSMLLDALLKTKSRVVLTDIINKNGLRMLHNIMKQYRSDFKKIPILRKLLKVLEY 1620
            QSNRDLSM+LDALLKTKSR+VLTDIINKNGLRMLHNIMKQYRSDFKKIPILRKLLKVLEY
Sbjct: 1598 QSNRDLSMILDALLKTKSRLVLTDIINKNGLRMLHNIMKQYRSDFKKIPILRKLLKVLEY 1657

Query: 1621 LVTREILTSEHIYGSPPCPGMESLRESLLSLTEHDDKQVHQIARGFRDRWFPRHNRKFGY 1680
            LVTREILTSEHI G PPCPGMESLRESLLSLTEHDDKQVHQIAR FRDRWFPRH RKFGY
Sbjct: 1658 LVTREILTSEHINGGPPCPGMESLRESLLSLTEHDDKQVHQIARSFRDRWFPRHTRKFGY 1717

Query: 1681 SGRADGRLEAYRGTNCSRFTASPSYCHDQDFRPSEAIDCNKQPSTPTSLSDFHPAEVCSV 1740
            S R DGRLE YRG+N SRFTAS SY HDQD RP++AIDC KQ S PT L D H AEVCS+
Sbjct: 1718 SEREDGRLEVYRGSNSSRFTASHSYRHDQDCRPTDAIDCIKQ-SMPTPLPDAHTAEVCSL 1777

Query: 1741 PSTAGHSLDGQRIRKRKSRWDQPADTSIDLRSKEQKLESTSVQRFDSSQSNSVGVASMLV 1800
             S AG S++GQ++RKRKSRWDQPADTS+DLRSKEQKLESTSVQ  +SSQ NSV VASML+
Sbjct: 1778 ASVAGPSVNGQKVRKRKSRWDQPADTSLDLRSKEQKLESTSVQELNSSQLNSVRVASMLI 1837

Query: 1801 DKINSDNMGSSLSGSVEICCRRDEDIRLDSAVHNTPEDIPPGFSSPFNLPVASSSPFSTI 1860
            DK+N+D+  SSLS SV + CR+DED R DSAV N PEDIPPGFSSPFN  VASSS FS +
Sbjct: 1838 DKVNNDDKDSSLSDSVGVPCRQDEDTRADSAVPNIPEDIPPGFSSPFNPSVASSSAFSAV 1897

Query: 1861 LDPPRQSIDNLSCTFSTVGHPQERFISRLPVSYGIPFSIVEQCGTSDSENLEFWCWDVAP 1920
            LDPP+Q+I  LSC FSTVGH QERFISRLPVSYGIPFSI+EQCGTS +ENLE  CWDVAP
Sbjct: 1898 LDPPQQNIGYLSCAFSTVGHLQERFISRLPVSYGIPFSIIEQCGTSRAENLE--CWDVAP 1957

Query: 1921 GVPFHPFPPLPPYPRGKRGPSTSACGTAVTQSDRERQVKSHDSQTSFSEESAPSTSTNYQ 1976
            GVPFHPFPPLPPYPRG  GP TSACGTA  QS +E QV SHDS+TSFSEES PSTSTNYQ
Sbjct: 1958 GVPFHPFPPLPPYPRGMSGPRTSACGTA-GQSSQEGQVNSHDSRTSFSEESPPSTSTNYQ 2017

BLAST of Sed0009206 vs. ExPASy TrEMBL
Match: A0A6J1JXF7 (histone-lysine N-methyltransferase ASHH2-like isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111489728 PE=4 SV=1)

HSP 1 Score: 2768.8 bits (7176), Expect = 0.0e+00
Identity = 1502/2051 (73.23%), Postives = 1652/2051 (80.55%), Query Frame = 0

Query: 1    MASSPSSS---RMCEPDRELGVITTTICANVSEVAAAGEDCTFRGLVRADTLSLDERFNS 60
            M+S PS+S   +M EPDR LGV  T IC N SE   AGED TFRG   ADTL LD+R + 
Sbjct: 38   MSSLPSNSSDCQMSEPDRGLGVTATNICMNASEPDTAGEDGTFRGFEHADTLLLDKRLDC 97

Query: 61   DSGDIGLGLNEEDESYNLGNETFSLDMEEPQDDEGLVDILGCKTTMEMMSLTGSLVNSVE 120
            DSGD    LNEE+E+ N+ N T  LDM+E QD + LVDILGCKTTMEMMSL GSL+NSV+
Sbjct: 98   DSGDSDPCLNEENEACNVENRTLKLDMKESQDVDDLVDILGCKTTMEMMSLAGSLMNSVK 157

Query: 121  ---LNDDSFVVDTVAKV-----------------VRDDLKSPSHICEIVSNSASADGLPS 180
               L+ +S ++D   KV                   DDLKSP HICEIVS+SASADGL S
Sbjct: 158  SEGLDHNSCIIDASEKVESGDIVENGPLLARIGTCTDDLKSP-HICEIVSSSASADGLSS 217

Query: 181  DFIQLNELDNDG-GCSFSDVMDNRISDGSIVTEAEMLNEMSPLRSGQILSVHVGQLVANC 240
            DFI   +L+NDG GCSFS+V D R+++  +  +A+MLNEMSPL+S QIL  H+G+ VANC
Sbjct: 218  DFIHQKQLENDGAGCSFSEVAD-RLTEALVEIDADMLNEMSPLQSDQILPTHMGRSVANC 277

Query: 241  DQYICEMDGKNLSGTSGETINEVTDMNDIHESCLQMLPSHGCKK-RERLQSNSSPLTIHA 300
            +QYIC+MDGK+LSGTSGET+NE  DMN   E CLQMLPS GC++ RE LQ++ SPLTIH+
Sbjct: 278  EQYICQMDGKSLSGTSGETVNEFADMNSNPELCLQMLPSQGCERIRECLQADDSPLTIHS 337

Query: 301  LENDQCGEM--SNSSPKYILEVVEDVDVSTTNNNADAEQYVDPKIKNNYNLEEATIQLSL 360
             E ++C E   SNS PKYI EV +D  V  T+ N D  Q++ P ++NN NLEEA+IQ + 
Sbjct: 338  PEINRCDEKHDSNSLPKYIPEVGDDDFVVLTDINGDGGQHIVPDMENNCNLEEASIQENT 397

Query: 361  NCVELLASPLPSQFPNCERDEFHEMLNVADIPIKDISSVNSCSIGDQDNNDVEKVGCVSE 420
            NCVELLASPLPSQ  N E+ EF+ ML  AD+PIKD    NSCS+ DQDNND EKVG VSE
Sbjct: 398  NCVELLASPLPSQPFNSEKYEFYGMLIGADMPIKD----NSCSVSDQDNNDTEKVGRVSE 457

Query: 421  VKCLETVIPFSKRSGRRKTSSQKTVTKRAPRKSRKKVPNALIFDTARRRRSSISRPARPS 480
            VKC ETV+  SKRSGRR+ SSQK VTKRA RKS+K VP  LIFDT RRRRSSISRPARP 
Sbjct: 458  VKCPETVLMSSKRSGRRRMSSQKNVTKRASRKSKKIVPEPLIFDTTRRRRSSISRPARPL 517

Query: 481  LWGSLDYIIQSFENNEDVWVNQSQKQGNKKPKGNRGGTKLNNKQPSESSHRSRKGTQVNS 540
             WGSL +IIQSFE  +DV VNQS+KQGN+K KGN+GGTK + KQPSESSHRSRKGTQ   
Sbjct: 518  PWGSLGFIIQSFEKIDDVLVNQSKKQGNEKSKGNQGGTKRSKKQPSESSHRSRKGTQGKC 577

Query: 541  ATSTSTNRIRLKVKLGKNMGHNFLNIVVPEVVGSSLSAKGINSNYGNKSYWGGNLEFPPS 600
             TSTSTNRIRLKVKLGKN+GHNFLNIVVPE+V SSLSAKGIN +YGN+SYW GNLEFPPS
Sbjct: 578  DTSTSTNRIRLKVKLGKNVGHNFLNIVVPEIVDSSLSAKGINCHYGNESYWEGNLEFPPS 637

Query: 601  TIGVDDQKLEEGPLRKIFCYNRNQEKEEKCSDPYIVKEQCATNDSSCTNIVSKLSVEHAD 660
               VDDQK EEGPLRKIFCY++NQ KEEKC D  +V EQCA NDSSCT  + K S +HAD
Sbjct: 638  ---VDDQKPEEGPLRKIFCYSKNQGKEEKCPDASVVNEQCANNDSSCTVTIDKSSTKHAD 697

Query: 661  DSFAVSSDLVELVEHASDTRNLDPGTSPDSEVINSILDIQVGAVRQENLQESVLASSEDF 720
            D+  VSS LVE VE ASDTR LDPGTSPDSEVINS+LDIQVGA+RQE LQ+SVL S EDF
Sbjct: 698  DNLCVSSHLVEPVERASDTRCLDPGTSPDSEVINSMLDIQVGAMRQEKLQDSVLPSLEDF 757

Query: 721  AASGNVTSSKKGRKKEKPYQAVSCSQEGGTCASACSNGSKSSKKHGTRLNVDNQLGAGET 780
            AASGN TSSKKGRKKEKP QAVSCS E GT ASAC+N SKSSKKHG RLNVDNQLG+GET
Sbjct: 758  AASGNATSSKKGRKKEKPCQAVSCSDEAGTGASACNNRSKSSKKHGRRLNVDNQLGSGET 817

Query: 781  FTSVGENVLNDSFTFKELSTE----STEIEHPEEALKVERILDVKECCKTEACSVFPESE 840
            FT    NV+N S T KELS +    STEIE PEEALK + IL+ KEC +T+  SVFPESE
Sbjct: 818  FTYTDANVVNYSLTVKELSMDQVPLSTEIELPEEALKADGILEDKECYRTDVGSVFPESE 877

Query: 841  NLKMFLPSQSARKKHPKSSKPIKTSDCKPKDPGSKNKIKNASKERVYQRKFVNKSKIKKD 900
            N K FLPSQSA KKHPK SK IKTS  K K PGSKNKIKNASKERVY+RK  NKS I + 
Sbjct: 878  NSKTFLPSQSAGKKHPKGSKSIKTSKGKSKAPGSKNKIKNASKERVYRRKSFNKS-ITEA 937

Query: 901  ICEQVVTETESHQIVGNFLVDKPAKVDDITASTVAINLNAVQGVVNEQYTPPRNAWVLCD 960
            +C+QVVTETESHQIVGN+LVDKP K +DI ASTVA+NLN VQG VNEQY PPRNAWVLCD
Sbjct: 938  LCDQVVTETESHQIVGNYLVDKPEKSNDIIASTVAVNLNVVQGAVNEQYMPPRNAWVLCD 997

Query: 961  DCQKWRRIAASLVDSLGHASCTWTCKDNVDKAFADCLIPQEKSNAEINAELEISDESGEE 1020
            DC KWRRI ASLVDSLGHASCTWTCK+NVDKAFADC IPQEKSNAEINAELEISDESGEE
Sbjct: 998  DCHKWRRIPASLVDSLGHASCTWTCKENVDKAFADCSIPQEKSNAEINAELEISDESGEE 1057

Query: 1021 NASNKRLTYREFESFHPLTVTAVLQENKFASISSNQFLHRSRKTQNIDEVMVCHCKPTLD 1080
            NASNKRLTYRE +SFHP TVTAV QENKF SISSN FLHRSRKTQ IDE+MVCHCKP LD
Sbjct: 1058 NASNKRLTYRELDSFHPTTVTAVPQENKFTSISSNHFLHRSRKTQTIDEIMVCHCKPALD 1117

Query: 1081 GRLGCANECLNRMLNIECVRGTCPCGDLCSNQQFQKRKYAKLQWLRCGKKGYGLQLLEDI 1140
            GRLGC +ECLNRMLNIECVRGTCPCGDLCSNQQFQKRKYAKLQWLRCGKKGYGLQ +EDI
Sbjct: 1118 GRLGCGDECLNRMLNIECVRGTCPCGDLCSNQQFQKRKYAKLQWLRCGKKGYGLQSVEDI 1177

Query: 1141 SKGKFLIEYVGEVLDMHAYESRQKEYAFNGHRHFYFMTLNGSEVIDACGKGNLGRFINHS 1200
            SKG+FLIEYVGEVLDMHAYE+RQKEYA NGHRHFYFMTL+GSEVIDACGKGNLGRFINHS
Sbjct: 1178 SKGQFLIEYVGEVLDMHAYEARQKEYALNGHRHFYFMTLDGSEVIDACGKGNLGRFINHS 1237

Query: 1201 CDPNCRTEKWMVNGEICIGLFALRDIKKGEEVTFDYNYVRVCGAAAKKCYCGSSQCRGYI 1260
            CDPNCRTEKWMVNGEICIGLFALRDIKKGEEVTFDYNYVRV GAAAKKCYCGS QCRGYI
Sbjct: 1238 CDPNCRTEKWMVNGEICIGLFALRDIKKGEEVTFDYNYVRVFGAAAKKCYCGSFQCRGYI 1297

Query: 1261 GGDPHNSEVIIHSDPDEEFPEPVMLRAYGRSSNGNLPTAASSVDGAKMQLSERIKGVRNK 1320
            GGDP NSEVII SD DEEFPEPVMLR  GRS N NLPT  S +DG K Q SE IKGVR+K
Sbjct: 1298 GGDPLNSEVIIQSDSDEEFPEPVMLRPDGRSWNNNLPTTVSLLDGVKKQPSEHIKGVRDK 1357

Query: 1321 KEQPTGIAIQMKILEEKEEPFQLSALKISEDPPKLSASKISEEQEDHHNLSALIISPLHS 1380
            K+QP+  +++ KI +EK            ED  KLSASKISE +ED  NLSA  ISPLHS
Sbjct: 1358 KDQPSRTSVESKISDEK------------EDTLKLSASKISEAKEDPLNLSASTISPLHS 1417

Query: 1381 SLEFEDSK---------------------------------------------EAKLSFD 1440
            SLEFEDSK                                             EAKLS  
Sbjct: 1418 SLEFEDSKVASPTPLADITHQTEDVTSQPVFVDQPEISPGDNNSDKNTCSIEQEAKLSVA 1477

Query: 1441 DIDARKNSELDAIEDKQVYINSHPQMKTSRKQGSIKKGKVSSVEKVKITNKPQILSLKSK 1500
            DIDARK S+L AIEDK+VYI SH +MKTSRK GSIKKGKVSSVEKV+I N+PQI S+K K
Sbjct: 1478 DIDARKKSKLVAIEDKKVYIKSHLRMKTSRKPGSIKKGKVSSVEKVQIANRPQISSVKPK 1537

Query: 1501 RLFEGSPGNRFEAVEEKLNELLDAEGGISKRKDAPKGYLKLLLLTAASGASASGEAIQSN 1560
            RL +GSPGNRFEAVEEKLNELLDAEGGISKRKDAPKGYLKLLLLTAASGASASGEAIQSN
Sbjct: 1538 RLVDGSPGNRFEAVEEKLNELLDAEGGISKRKDAPKGYLKLLLLTAASGASASGEAIQSN 1597

Query: 1561 RDLSMLLDALLKTKSRVVLTDIINKNGLRMLHNIMKQYRSDFKKIPILRKLLKVLEYLVT 1620
            RDLSM+LDALLKTKSRVVLTDI+NKNGLRMLHNIMKQYRSDFKKIPILRKLLKVLEYLV 
Sbjct: 1598 RDLSMILDALLKTKSRVVLTDIMNKNGLRMLHNIMKQYRSDFKKIPILRKLLKVLEYLVM 1657

Query: 1621 REILTSEHIYGSPPCPGMESLRESLLSLTEHDDKQVHQIARGFRDRWFPRHNRKFGYSGR 1680
            REILTSEHI G PPCPGMESLR+SLLSLTEHDDKQVHQIAR FRDRWFPRH RKF YS R
Sbjct: 1658 REILTSEHINGGPPCPGMESLRDSLLSLTEHDDKQVHQIARSFRDRWFPRHTRKFVYSER 1717

Query: 1681 ADGRLEAYRGTNCSRFTASPSYCHDQDFRPSEAIDCNKQPSTPTSLSDFHPAEVCSVPST 1740
             DGRLE YRG+NCSRFTAS SY  DQD RP++AIDC KQ S  TSL D HPAEVCS+ ST
Sbjct: 1718 EDGRLEVYRGSNCSRFTASHSYRRDQDSRPTDAIDCVKQ-SLSTSLPDAHPAEVCSMAST 1777

Query: 1741 AGHSLDGQRIRKRKSRWDQPADTSIDLRSKEQKLESTSVQRFDSSQSNSVGVASMLVDKI 1800
            AGHSL+GQ++ KRKSRWDQPADTS+DLRSKEQKLES SVQ+F+SSQ +SVGV SML+DK+
Sbjct: 1778 AGHSLNGQKVCKRKSRWDQPADTSLDLRSKEQKLESKSVQQFNSSQLSSVGVVSMLIDKV 1837

Query: 1801 NSDNMGSSLSGSVEICCRRDEDIRLDSAVHNTPEDIPPGFSSPFNLPVASSSPFSTILDP 1860
            NSD+   SLS SV +   +DEDIR DSAV N PEDIPPGF  PF+LPVASSS FST+LDP
Sbjct: 1838 NSDDKDFSLSDSVGVRGSQDEDIRADSAVQNIPEDIPPGF-FPFSLPVASSSAFSTVLDP 1897

Query: 1861 PRQSIDNLSCTFSTVGHPQERFISRLPVSYGIPFSIVEQCGTSDSENLEFWCWDVAPGVP 1920
            PRQSI  LSC FSTVG+PQE+FIS LPVSYGIPFSIVEQCGTS +ENLE  CWDVAPG+P
Sbjct: 1898 PRQSIGKLSCAFSTVGYPQEKFISCLPVSYGIPFSIVEQCGTSCAENLE--CWDVAPGMP 1957

Query: 1921 FHPFPPLPPYPRGKRGPSTSACGTAVTQSDRERQVKSHDSQTSFSEESAPSTSTNYQQGL 1976
            FHPFPPLPPYPRGKRG  TSACGTAV QS +E QV  HDS+TSFSEE+ PSTSTNYQQ L
Sbjct: 1958 FHPFPPLPPYPRGKRGLPTSACGTAVRQSSQEMQVNCHDSRTSFSEETPPSTSTNYQQDL 2017

BLAST of Sed0009206 vs. ExPASy TrEMBL
Match: A0A6J1K197 (histone-lysine N-methyltransferase ASHH2-like isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111489728 PE=4 SV=1)

HSP 1 Score: 2753.4 bits (7136), Expect = 0.0e+00
Identity = 1498/2051 (73.04%), Postives = 1647/2051 (80.30%), Query Frame = 0

Query: 1    MASSPSSS---RMCEPDRELGVITTTICANVSEVAAAGEDCTFRGLVRADTLSLDERFNS 60
            M+S PS+S   +M EPDR LGV  T IC N SE   AGED TFRG   ADTL LD+R + 
Sbjct: 38   MSSLPSNSSDCQMSEPDRGLGVTATNICMNASEPDTAGEDGTFRGFEHADTLLLDKRLDC 97

Query: 61   DSGDIGLGLNEEDESYNLGNETFSLDMEEPQDDEGLVDILGCKTTMEMMSLTGSLVNSVE 120
            DSGD    LNEE+E+ N+ N T  LDM+E QD + LVDILGCKTTMEMMSL GSL+NSV+
Sbjct: 98   DSGDSDPCLNEENEACNVENRTLKLDMKESQDVDDLVDILGCKTTMEMMSLAGSLMNSVK 157

Query: 121  ---LNDDSFVVDTVAKV-----------------VRDDLKSPSHICEIVSNSASADGLPS 180
               L+ +S ++D   KV                   DDLKSP HICEIVS+SASADGL S
Sbjct: 158  SEGLDHNSCIIDASEKVESGDIVENGPLLARIGTCTDDLKSP-HICEIVSSSASADGLSS 217

Query: 181  DFIQLNELDNDG-GCSFSDVMDNRISDGSIVTEAEMLNEMSPLRSGQILSVHVGQLVANC 240
            DFI   +L+NDG GCSFS+V D R+++  +  +A+MLNEMSPL+S QIL  H+G+ VANC
Sbjct: 218  DFIHQKQLENDGAGCSFSEVAD-RLTEALVEIDADMLNEMSPLQSDQILPTHMGRSVANC 277

Query: 241  DQYICEMDGKNLSGTSGETINEVTDMNDIHESCLQMLPSHGCKK-RERLQSNSSPLTIHA 300
            +QYIC+MDGK+LSGTSGET+NE  DMN   E CLQMLPS GC++ RE LQ++ SPLTIH+
Sbjct: 278  EQYICQMDGKSLSGTSGETVNEFADMNSNPELCLQMLPSQGCERIRECLQADDSPLTIHS 337

Query: 301  LENDQCGEM--SNSSPKYILEVVEDVDVSTTNNNADAEQYVDPKIKNNYNLEEATIQLSL 360
             E ++C E   SNS PKYI EV +D  V  T+ N D  Q++ P ++NN NLEEA+IQ + 
Sbjct: 338  PEINRCDEKHDSNSLPKYIPEVGDDDFVVLTDINGDGGQHIVPDMENNCNLEEASIQENT 397

Query: 361  NCVELLASPLPSQFPNCERDEFHEMLNVADIPIKDISSVNSCSIGDQDNNDVEKVGCVSE 420
            NCVELLASPLPSQ  N E+ EF+ ML  AD+PIKD    NSCS+ DQDNND EKVG VSE
Sbjct: 398  NCVELLASPLPSQPFNSEKYEFYGMLIGADMPIKD----NSCSVSDQDNNDTEKVGRVSE 457

Query: 421  VKCLETVIPFSKRSGRRKTSSQKTVTKRAPRKSRKKVPNALIFDTARRRRSSISRPARPS 480
            VKC ETV+  SKRSGRR+ SSQK VTKRA RKS+K VP  LIFDT RRRRSSISRPARP 
Sbjct: 458  VKCPETVLMSSKRSGRRRMSSQKNVTKRASRKSKKIVPEPLIFDTTRRRRSSISRPARPL 517

Query: 481  LWGSLDYIIQSFENNEDVWVNQSQKQGNKKPKGNRGGTKLNNKQPSESSHRSRKGTQVNS 540
             WGSL +IIQSFE  +DV VNQS+KQGN+K KGN+GGTK + KQPSESSHRSRKGTQ   
Sbjct: 518  PWGSLGFIIQSFEKIDDVLVNQSKKQGNEKSKGNQGGTKRSKKQPSESSHRSRKGTQGKC 577

Query: 541  ATSTSTNRIRLKVKLGKNMGHNFLNIVVPEVVGSSLSAKGINSNYGNKSYWGGNLEFPPS 600
             TSTSTNRIRLKVKLGKN+GHNFLNIVVPE+V SSLSAKGIN +YGN+SYW GNLEFPPS
Sbjct: 578  DTSTSTNRIRLKVKLGKNVGHNFLNIVVPEIVDSSLSAKGINCHYGNESYWEGNLEFPPS 637

Query: 601  TIGVDDQKLEEGPLRKIFCYNRNQEKEEKCSDPYIVKEQCATNDSSCTNIVSKLSVEHAD 660
               VDDQK EEGPLRKIFCY++NQ KEEKC D  +V EQCA NDSSCT  + K S +HAD
Sbjct: 638  ---VDDQKPEEGPLRKIFCYSKNQGKEEKCPDASVVNEQCANNDSSCTVTIDKSSTKHAD 697

Query: 661  DSFAVSSDLVELVEHASDTRNLDPGTSPDSEVINSILDIQVGAVRQENLQESVLASSEDF 720
            D+  VSS LVE VE ASDTR LDPGTSPDSEVINS+LDIQVGA+RQE LQ+SVL S EDF
Sbjct: 698  DNLCVSSHLVEPVERASDTRCLDPGTSPDSEVINSMLDIQVGAMRQEKLQDSVLPSLEDF 757

Query: 721  AASGNVTSSKKGRKKEKPYQAVSCSQEGGTCASACSNGSKSSKKHGTRLNVDNQLGAGET 780
            AASGN TSSKKGRKKEKP QAVSCS E GT ASAC+N SKSSKKHG RLNVDNQLG+GET
Sbjct: 758  AASGNATSSKKGRKKEKPCQAVSCSDEAGTGASACNNRSKSSKKHGRRLNVDNQLGSGET 817

Query: 781  FTSVGENVLNDSFTFKELSTE----STEIEHPEEALKVERILDVKECCKTEACSVFPESE 840
            FT    NV+N S T KELS +    STEIE PEEALK + IL+ KEC +T+  SVFPESE
Sbjct: 818  FTYTDANVVNYSLTVKELSMDQVPLSTEIELPEEALKADGILEDKECYRTDVGSVFPESE 877

Query: 841  NLKMFLPSQSARKKHPKSSKPIKTSDCKPKDPGSKNKIKNASKERVYQRKFVNKSKIKKD 900
            N K FLPSQSA KKHPK SK IKTS  K K PGSKNKIKNASKERVY+RK  NKS I + 
Sbjct: 878  NSKTFLPSQSAGKKHPKGSKSIKTSKGKSKAPGSKNKIKNASKERVYRRKSFNKS-ITEA 937

Query: 901  ICEQVVTETESHQIVGNFLVDKPAKVDDITASTVAINLNAVQGVVNEQYTPPRNAWVLCD 960
            +C+QVVTETESHQI     VDKP K +DI ASTVA+NLN VQG VNEQY PPRNAWVLCD
Sbjct: 938  LCDQVVTETESHQI-----VDKPEKSNDIIASTVAVNLNVVQGAVNEQYMPPRNAWVLCD 997

Query: 961  DCQKWRRIAASLVDSLGHASCTWTCKDNVDKAFADCLIPQEKSNAEINAELEISDESGEE 1020
            DC KWRRI ASLVDSLGHASCTWTCK+NVDKAFADC IPQEKSNAEINAELEISDESGEE
Sbjct: 998  DCHKWRRIPASLVDSLGHASCTWTCKENVDKAFADCSIPQEKSNAEINAELEISDESGEE 1057

Query: 1021 NASNKRLTYREFESFHPLTVTAVLQENKFASISSNQFLHRSRKTQNIDEVMVCHCKPTLD 1080
            NASNKRLTYRE +SFHP TVTAV QENKF SISSN FLHRSRKTQ IDE+MVCHCKP LD
Sbjct: 1058 NASNKRLTYRELDSFHPTTVTAVPQENKFTSISSNHFLHRSRKTQTIDEIMVCHCKPALD 1117

Query: 1081 GRLGCANECLNRMLNIECVRGTCPCGDLCSNQQFQKRKYAKLQWLRCGKKGYGLQLLEDI 1140
            GRLGC +ECLNRMLNIECVRGTCPCGDLCSNQQFQKRKYAKLQWLRCGKKGYGLQ +EDI
Sbjct: 1118 GRLGCGDECLNRMLNIECVRGTCPCGDLCSNQQFQKRKYAKLQWLRCGKKGYGLQSVEDI 1177

Query: 1141 SKGKFLIEYVGEVLDMHAYESRQKEYAFNGHRHFYFMTLNGSEVIDACGKGNLGRFINHS 1200
            SKG+FLIEYVGEVLDMHAYE+RQKEYA NGHRHFYFMTL+GSEVIDACGKGNLGRFINHS
Sbjct: 1178 SKGQFLIEYVGEVLDMHAYEARQKEYALNGHRHFYFMTLDGSEVIDACGKGNLGRFINHS 1237

Query: 1201 CDPNCRTEKWMVNGEICIGLFALRDIKKGEEVTFDYNYVRVCGAAAKKCYCGSSQCRGYI 1260
            CDPNCRTEKWMVNGEICIGLFALRDIKKGEEVTFDYNYVRV GAAAKKCYCGS QCRGYI
Sbjct: 1238 CDPNCRTEKWMVNGEICIGLFALRDIKKGEEVTFDYNYVRVFGAAAKKCYCGSFQCRGYI 1297

Query: 1261 GGDPHNSEVIIHSDPDEEFPEPVMLRAYGRSSNGNLPTAASSVDGAKMQLSERIKGVRNK 1320
            GGDP NSEVII SD DEEFPEPVMLR  GRS N NLPT  S +DG K Q SE IKGVR+K
Sbjct: 1298 GGDPLNSEVIIQSDSDEEFPEPVMLRPDGRSWNNNLPTTVSLLDGVKKQPSEHIKGVRDK 1357

Query: 1321 KEQPTGIAIQMKILEEKEEPFQLSALKISEDPPKLSASKISEEQEDHHNLSALIISPLHS 1380
            K+QP+  +++ KI +EK            ED  KLSASKISE +ED  NLSA  ISPLHS
Sbjct: 1358 KDQPSRTSVESKISDEK------------EDTLKLSASKISEAKEDPLNLSASTISPLHS 1417

Query: 1381 SLEFEDSK---------------------------------------------EAKLSFD 1440
            SLEFEDSK                                             EAKLS  
Sbjct: 1418 SLEFEDSKVASPTPLADITHQTEDVTSQPVFVDQPEISPGDNNSDKNTCSIEQEAKLSVA 1477

Query: 1441 DIDARKNSELDAIEDKQVYINSHPQMKTSRKQGSIKKGKVSSVEKVKITNKPQILSLKSK 1500
            DIDARK S+L AIEDK+VYI SH +MKTSRK GSIKKGKVSSVEKV+I N+PQI S+K K
Sbjct: 1478 DIDARKKSKLVAIEDKKVYIKSHLRMKTSRKPGSIKKGKVSSVEKVQIANRPQISSVKPK 1537

Query: 1501 RLFEGSPGNRFEAVEEKLNELLDAEGGISKRKDAPKGYLKLLLLTAASGASASGEAIQSN 1560
            RL +GSPGNRFEAVEEKLNELLDAEGGISKRKDAPKGYLKLLLLTAASGASASGEAIQSN
Sbjct: 1538 RLVDGSPGNRFEAVEEKLNELLDAEGGISKRKDAPKGYLKLLLLTAASGASASGEAIQSN 1597

Query: 1561 RDLSMLLDALLKTKSRVVLTDIINKNGLRMLHNIMKQYRSDFKKIPILRKLLKVLEYLVT 1620
            RDLSM+LDALLKTKSRVVLTDI+NKNGLRMLHNIMKQYRSDFKKIPILRKLLKVLEYLV 
Sbjct: 1598 RDLSMILDALLKTKSRVVLTDIMNKNGLRMLHNIMKQYRSDFKKIPILRKLLKVLEYLVM 1657

Query: 1621 REILTSEHIYGSPPCPGMESLRESLLSLTEHDDKQVHQIARGFRDRWFPRHNRKFGYSGR 1680
            REILTSEHI G PPCPGMESLR+SLLSLTEHDDKQVHQIAR FRDRWFPRH RKF YS R
Sbjct: 1658 REILTSEHINGGPPCPGMESLRDSLLSLTEHDDKQVHQIARSFRDRWFPRHTRKFVYSER 1717

Query: 1681 ADGRLEAYRGTNCSRFTASPSYCHDQDFRPSEAIDCNKQPSTPTSLSDFHPAEVCSVPST 1740
             DGRLE YRG+NCSRFTAS SY  DQD RP++AIDC KQ S  TSL D HPAEVCS+ ST
Sbjct: 1718 EDGRLEVYRGSNCSRFTASHSYRRDQDSRPTDAIDCVKQ-SLSTSLPDAHPAEVCSMAST 1777

Query: 1741 AGHSLDGQRIRKRKSRWDQPADTSIDLRSKEQKLESTSVQRFDSSQSNSVGVASMLVDKI 1800
            AGHSL+GQ++ KRKSRWDQPADTS+DLRSKEQKLES SVQ+F+SSQ +SVGV SML+DK+
Sbjct: 1778 AGHSLNGQKVCKRKSRWDQPADTSLDLRSKEQKLESKSVQQFNSSQLSSVGVVSMLIDKV 1837

Query: 1801 NSDNMGSSLSGSVEICCRRDEDIRLDSAVHNTPEDIPPGFSSPFNLPVASSSPFSTILDP 1860
            NSD+   SLS SV +   +DEDIR DSAV N PEDIPPGF  PF+LPVASSS FST+LDP
Sbjct: 1838 NSDDKDFSLSDSVGVRGSQDEDIRADSAVQNIPEDIPPGF-FPFSLPVASSSAFSTVLDP 1897

Query: 1861 PRQSIDNLSCTFSTVGHPQERFISRLPVSYGIPFSIVEQCGTSDSENLEFWCWDVAPGVP 1920
            PRQSI  LSC FSTVG+PQE+FIS LPVSYGIPFSIVEQCGTS +ENLE  CWDVAPG+P
Sbjct: 1898 PRQSIGKLSCAFSTVGYPQEKFISCLPVSYGIPFSIVEQCGTSCAENLE--CWDVAPGMP 1957

Query: 1921 FHPFPPLPPYPRGKRGPSTSACGTAVTQSDRERQVKSHDSQTSFSEESAPSTSTNYQQGL 1976
            FHPFPPLPPYPRGKRG  TSACGTAV QS +E QV  HDS+TSFSEE+ PSTSTNYQQ L
Sbjct: 1958 FHPFPPLPPYPRGKRGLPTSACGTAVRQSSQEMQVNCHDSRTSFSEETPPSTSTNYQQDL 2017

BLAST of Sed0009206 vs. ExPASy TrEMBL
Match: A0A6J1FEZ0 (histone-lysine N-methyltransferase ASHH2 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111444881 PE=4 SV=1)

HSP 1 Score: 2752.6 bits (7134), Expect = 0.0e+00
Identity = 1492/2047 (72.89%), Postives = 1648/2047 (80.51%), Query Frame = 0

Query: 1    MASSPSSS---RMCEPDRELGVITTTICANVSEVAAAGEDCTFRGLVRADTLSLDERFNS 60
            M+S PS+S   +M EPDR LGV  T IC N SE   AGED T RG   ADTL LD+R + 
Sbjct: 38   MSSLPSNSSDCQMLEPDRGLGVTATNICTNASEPDTAGEDGTSRGFKHADTLLLDKRLDC 97

Query: 61   DSGDIGLGLNEEDESYNLGNETFSLDMEEPQDDEGLVDILGCKTTMEMMSLTGSLVNSVE 120
            DSGD    LNEE+E+ N+GN   SLDM+E QD + LVDILGCKTTMEMMSL GSL+NSV+
Sbjct: 98   DSGDSDPCLNEENEACNVGNGALSLDMKESQDVDDLVDILGCKTTMEMMSLAGSLMNSVK 157

Query: 121  ---LNDDSFVVDTVAKV-----------------VRDDLKSPSHICEIVSNSASADGLPS 180
               L+++S ++D   KV                   DDLKSP HICEIVS+SASADGL S
Sbjct: 158  PEGLDNNSCIIDASEKVESGDIVENGPLLSRIGTCTDDLKSP-HICEIVSSSASADGLSS 217

Query: 181  DFIQLNELDNDG-GCSFSDVMDNRISDGSIVTEAEMLNEMSPLRSGQILSVHVGQLVANC 240
            DFI   +L+NDG GCSFS+V D R+++  +  EA++LNEMSPL+S QIL +H+ + VANC
Sbjct: 218  DFIHQKQLENDGAGCSFSEVAD-RLTEALVEIEADILNEMSPLQSDQILPIHMARSVANC 277

Query: 241  DQYICEMDGKNLSGTSGETINEVTDMNDIHESCLQMLPSHGCKK-RERLQSNSSPLTIHA 300
            +QYIC+MDGK LSGTSGET+NE  DMN   E CLQ+LPS GC++ RE LQ++ SPLTIH+
Sbjct: 278  EQYICQMDGKGLSGTSGETVNEFADMNSNPELCLQILPSQGCERIRECLQADDSPLTIHS 337

Query: 301  LENDQCGEM--SNSSPKYILEVVEDVDVSTTNNNADAEQYVDPKIKNNYNLEEATIQLSL 360
             E ++C E   SNS  KYI EVVED  V  T+NN D  Q++ P ++NN NLEEA+IQ + 
Sbjct: 338  PEINRCDEKHDSNSLSKYIPEVVEDDFVVLTDNNGDGGQHIVPDMENNCNLEEASIQENT 397

Query: 361  NCVELLASPLPSQFPNCERDEFHEMLNVADIPIKDISSVNSCSIGDQDNNDVEKVGCVSE 420
            NCVELLASPLP Q  N E+ EF+ ML  AD+PIKD    NSCS+ DQDNND EKVG VSE
Sbjct: 398  NCVELLASPLPFQPFNSEKYEFYGMLIGADMPIKD----NSCSVSDQDNNDTEKVGHVSE 457

Query: 421  VKCLETVIPFSKRSGRRKTSSQKTVTKRAPRKSRKKVPNALIFDTARRRRSSISRPARPS 480
            VKC ETV+  SKRSGRR+ SSQK VTKRA RKS+K VP  LIFDT RRRRSSISRPARP 
Sbjct: 458  VKCPETVLMSSKRSGRRRPSSQKNVTKRASRKSKKIVPEPLIFDTTRRRRSSISRPARPL 517

Query: 481  LWGSLDYIIQSFENNEDVWVNQSQKQGNKKPKGNRGGTKLNNKQPSESSHRSRKGTQVNS 540
             WGSL +IIQSFE  +D   NQS+KQGN+K KGN+GGTK + KQPSESSHRSRKGTQ   
Sbjct: 518  PWGSLGFIIQSFEKIDDALANQSKKQGNEKSKGNQGGTKRSKKQPSESSHRSRKGTQGKC 577

Query: 541  ATSTSTNRIRLKVKLGKNMGHNFLNIVVPEVVGSSLSAKGINSNYGNKSYWGGNLEFPPS 600
             TSTSTNRIRLKVKLGKN+GHNFLNIVVPE+V SSLSAKGIN +YGN+SYW GNLEFPPS
Sbjct: 578  DTSTSTNRIRLKVKLGKNVGHNFLNIVVPEIVDSSLSAKGINCHYGNESYWEGNLEFPPS 637

Query: 601  TIGVDDQKLEEGPLRKIFCYNRNQEKEEKCSDPYIVKEQCATNDSSCTNIVSKLSVEHAD 660
               VDDQK EEG LRKIFCY++N++KE+KC D  +V EQCA NDSSCT  + K S +HAD
Sbjct: 638  ---VDDQKPEEGSLRKIFCYSKNEDKEKKCPDASVVNEQCANNDSSCTVTIEKSSTKHAD 697

Query: 661  DSFAVSSDLVELVEHASDTRNLDPGTSPDSEVINSILDIQVGAVRQENLQESVLASSEDF 720
            D+  VSS +VE VE A DTR+LDPGTSPDSEVINS+LDIQVGA+RQE LQ+SVL S EDF
Sbjct: 698  DNLCVSSHMVEPVERAIDTRSLDPGTSPDSEVINSMLDIQVGAMRQEKLQDSVLPSLEDF 757

Query: 721  AASGNVTSSKKGRKKEKPYQAVSCSQEGGTCASACSNGSKSSKKHGTRLNVDNQLGAGET 780
            AASGN +SSKKGRKKEKP QAVSCS E GT ASAC+N SKSSKKHG RLN DNQLG+GET
Sbjct: 758  AASGNASSSKKGRKKEKPCQAVSCSDEAGTGASACNNRSKSSKKHGRRLNADNQLGSGET 817

Query: 781  FTSVGENVLNDSFTFKELSTE----STEIEHPEEALKVERILDVKECCKTEACSVFPESE 840
            FT    NV+N S T KELS +    STEIE PEEALK + IL+ KEC +T+  SVF ESE
Sbjct: 818  FTYNDANVVNYSLTVKELSIDQVPLSTEIELPEEALKADGILEDKECYRTDVGSVFLESE 877

Query: 841  NLKMFLPSQSARKKHPKSSKPIKTSDCKPKDPGSKNKIKNASKERVYQRKFVNKSKIKKD 900
            N K FLPSQSARKKHPK SK IKTS  K K PGSKNKIKNASKERVY+RK  NKS IK+ 
Sbjct: 878  NSKTFLPSQSARKKHPKGSKSIKTSKGKSKAPGSKNKIKNASKERVYRRKSFNKS-IKEA 937

Query: 901  ICEQVVTETESHQIVGNFLVDKPAKVDDITASTVAINLNAVQGVVNEQYTPPRNAWVLCD 960
            +C++VVTETESHQIVGN+LVDKP K +DI  STVA+NLN VQG VNEQY PPRNAWVLCD
Sbjct: 938  LCDRVVTETESHQIVGNYLVDKPEKSNDIIESTVAVNLNVVQGAVNEQYMPPRNAWVLCD 997

Query: 961  DCQKWRRIAASLVDSLGHASCTWTCKDNVDKAFADCLIPQEKSNAEINAELEISDESGEE 1020
            DC KWRRI ASLVDSLGHASCTWTCKDNVDKAFADC IPQEKSNAEINAELEISDESGEE
Sbjct: 998  DCHKWRRIPASLVDSLGHASCTWTCKDNVDKAFADCSIPQEKSNAEINAELEISDESGEE 1057

Query: 1021 NASNKRLTYREFESFHPLTVTAVLQENKFASISSNQFLHRSRKTQNIDEVMVCHCKPTLD 1080
            NASNKRLTYRE +SFHP TVTAV QENKF SISSN FLHRSRKTQ IDE+MVCHCKP LD
Sbjct: 1058 NASNKRLTYRELDSFHPTTVTAVPQENKFTSISSNHFLHRSRKTQTIDEIMVCHCKPALD 1117

Query: 1081 GRLGCANECLNRMLNIECVRGTCPCGDLCSNQQFQKRKYAKLQWLRCGKKGYGLQLLEDI 1140
            GRLGC +ECLNRMLNIECVRGTCPCGDLCSNQQFQKRKYAKL+WLRCGKKGYGLQ +EDI
Sbjct: 1118 GRLGCGDECLNRMLNIECVRGTCPCGDLCSNQQFQKRKYAKLRWLRCGKKGYGLQSVEDI 1177

Query: 1141 SKGKFLIEYVGEVLDMHAYESRQKEYAFNGHRHFYFMTLNGSEVIDACGKGNLGRFINHS 1200
            SKG+FLIEYVGEVLDMHAYE+RQKEYA NGHRHFYFMTL+GSEVIDACGKGNLGRFINHS
Sbjct: 1178 SKGQFLIEYVGEVLDMHAYEARQKEYALNGHRHFYFMTLDGSEVIDACGKGNLGRFINHS 1237

Query: 1201 CDPNCRTEKWMVNGEICIGLFALRDIKKGEEVTFDYNYVRVCGAAAKKCYCGSSQCRGYI 1260
            CDPNCRTEKWMVNGEICIGLFALRDIKKGEEVTFDYNYVRV GAAAKKCYCGS QCRGYI
Sbjct: 1238 CDPNCRTEKWMVNGEICIGLFALRDIKKGEEVTFDYNYVRVFGAAAKKCYCGSFQCRGYI 1297

Query: 1261 GGDPHNSEVIIHSDPDEEFPEPVMLRAYGRSSNGNLPTAASSVDGAKMQLSERIKGVRNK 1320
            GGDP NSEVII SD DEEFPEPVMLR  GRS N NLPTA S +DG K Q SE IKGVR+K
Sbjct: 1298 GGDPLNSEVIIQSDSDEEFPEPVMLRPDGRSWNNNLPTAVSLLDGVKKQPSEHIKGVRDK 1357

Query: 1321 KEQPTGIAIQMKILEEKEEPFQLSALKISEDPPKLSASKISEEQEDHHNLSALIISPLHS 1380
            K+QP   A++ KI +EK            ED PKLSASKISE +ED  NLSA  ISPLHS
Sbjct: 1358 KDQPIRTAVESKISDEK------------EDTPKLSASKISEAKEDPLNLSASTISPLHS 1417

Query: 1381 SLEFEDSK---------------------------------------------EAKLSFD 1440
            SLEFEDSK                                             EAKLS  
Sbjct: 1418 SLEFEDSKVASPTPLADITHQTEDVTSQPVFVDQPEISPGDNNSDKNTCSIEQEAKLSVA 1477

Query: 1441 DIDARKNSELDAIEDKQVYINSHPQMKTSRKQGSIKKGKVSSVEKVKITNKPQILSLKSK 1500
            +IDARK S+LDA+EDK+VYI SH +MKTSRK GSIKKGKVSSVEKV+ITN+PQI S+K K
Sbjct: 1478 EIDARKKSKLDAVEDKKVYIKSHLRMKTSRKPGSIKKGKVSSVEKVQITNRPQISSVKPK 1537

Query: 1501 RLFEGSPGNRFEAVEEKLNELLDAEGGISKRKDAPKGYLKLLLLTAASGASASGEAIQSN 1560
            RL +GSPGNRFEAVEEKLNELLDAEGGISKRKDAPKGYLKLLLLTAASGASASGEAIQSN
Sbjct: 1538 RLVDGSPGNRFEAVEEKLNELLDAEGGISKRKDAPKGYLKLLLLTAASGASASGEAIQSN 1597

Query: 1561 RDLSMLLDALLKTKSRVVLTDIINKNGLRMLHNIMKQYRSDFKKIPILRKLLKVLEYLVT 1620
            RDLSM+LDALLKTKSRVVLTDIINKNGLRMLHNIMKQYRSDFKKIPILRKLLKVLEYLV 
Sbjct: 1598 RDLSMILDALLKTKSRVVLTDIINKNGLRMLHNIMKQYRSDFKKIPILRKLLKVLEYLVM 1657

Query: 1621 REILTSEHIYGSPPCPGMESLRESLLSLTEHDDKQVHQIARGFRDRWFPRHNRKFGYSGR 1680
            REILTSEHI G PPCPGMESLR+SLLSLTEHDDKQVHQIAR FRDRWFPRH RKF YS R
Sbjct: 1658 REILTSEHINGGPPCPGMESLRDSLLSLTEHDDKQVHQIARSFRDRWFPRHTRKFVYSER 1717

Query: 1681 ADGRLEAYRGTNCSRFTASPSYCHDQDFRPSEAIDCNKQPSTPTSLSDFHPAEVCSVPST 1740
             DGRLE YRG+NCSRFTAS SY  DQD RP++AIDC KQ S  TSL D HPAEVC++ ST
Sbjct: 1718 EDGRLEVYRGSNCSRFTASHSYRRDQDSRPTDAIDCVKQ-SMSTSLPDAHPAEVCTMAST 1777

Query: 1741 AGHSLDGQRIRKRKSRWDQPADTSIDLRSKEQKLESTSVQRFDSSQSNSVGVASMLVDKI 1800
            AGHSL+GQ++ KRKSRWDQPADTS+DLRSKEQKLES SVQ+F+SSQ +SVGV SML+DK+
Sbjct: 1778 AGHSLNGQKVCKRKSRWDQPADTSLDLRSKEQKLESKSVQQFNSSQLSSVGVVSMLIDKV 1837

Query: 1801 NSDNMGSSLSGSVEICCRRDEDIRLDSAVHNTPEDIPPGFSSPFNLPVASSSPFSTILDP 1860
            NSD+   SLS SV +   +D+DIR DSAV N PEDIPPGF SPF+LPVASSS FST+LDP
Sbjct: 1838 NSDDKDFSLSDSVGVRGSQDDDIRADSAVQNIPEDIPPGF-SPFSLPVASSSAFSTVLDP 1897

Query: 1861 PRQSIDNLSCTFSTVGHPQERFISRLPVSYGIPFSIVEQCGTSDSENLEFWCWDVAPGVP 1920
            PRQSI  LS  FSTVG+PQE+FIS LPVSYGIPFSIVEQCGTS +ENLE  CWDVAPGVP
Sbjct: 1898 PRQSIGKLSSAFSTVGYPQEKFISCLPVSYGIPFSIVEQCGTSRAENLE--CWDVAPGVP 1957

Query: 1921 FHPFPPLPPYPRGKRGPSTSACGTAVTQSDRERQVKSHDSQTSFSEESAPSTSTNYQQGL 1972
            FHPFPPLPPYPRGKRGP TSACGTAV QS +E QV  HDS+TSFSEE+ PSTS NYQQ  
Sbjct: 1958 FHPFPPLPPYPRGKRGPPTSACGTAVEQSSQEMQVNCHDSRTSFSEENPPSTS-NYQQDS 2017

BLAST of Sed0009206 vs. TAIR 10
Match: AT1G77300.1 (histone methyltransferases(H3-K4 specific);histone methyltransferases(H3-K36 specific) )

HSP 1 Score: 830.9 bits (2145), Expect = 2.2e-240
Identity = 674/1818 (37.07%), Postives = 905/1818 (49.78%), Query Frame = 0

Query: 149  SASADGLPSDFIQLNELDNDGGCSFSDVMDNRISDGSIVTEAEMLNEMSPLRSGQIL--- 208
            S+  D   SD I L++       SF D + +    G   TE+ + + +    +G I+   
Sbjct: 228  SSDLDTGSSDDISLSQ-----SFSFPDSLLDSSVFGCSATESYLEDAIDIEGNGTIVVSP 287

Query: 209  SVHVGQLVANCDQYICEMDGKNLSGTSGETINEVTDMNDIHESCLQMLPSHGCKKRERLQ 268
            S+ + +++ N D  +C  D   ++ T  ETIN                P     + +RL 
Sbjct: 288  SLAITEMLNNDDGGLCSHDLNKITVT--ETIN----------------PDLKLVREDRLD 347

Query: 269  SNSSPLTIHALENDQCGEMSNSSPKYILEVVEDVDVSTTNNNADAEQYVDPKIKNNYNLE 328
            ++ S +    L+N   G+ S+ S    L +          NN  A          +  ++
Sbjct: 348  TDLSVMNEKMLKN-HVGDSSSESAVAALSM----------NNGMAADLRAENFSQSSPID 407

Query: 329  EATIQLSLNCVELLASPLPSQFPNCERDEFHEMLNVADIPIKDISSVNSCSIGDQDNNDV 388
            E T+ +  N   +  S L   FP        E+ N         ++V    I D +    
Sbjct: 408  EKTLDMEANS-PITDSSLIWNFPLNFGSGGIEVCNPE-------NAVEPLRIVDDNGRIG 467

Query: 389  EKVGCVSEVKCLETVIPFSKRSGRR----KTSSQKTVTKRAPRKSRKKVPN---ALIFDT 448
             +V   S     E  +  S+R  R     K    KT  +   + SRKK        IF  
Sbjct: 468  GEVASASGSDFCEAGMSSSRRKARDGKQCKVVQTKTSARHLRKSSRKKQSERDIESIFKC 527

Query: 449  ARRRRSSISRPARPSLWGSLDYIIQSFENNEDVWVNQSQKQGNKKPKGNRGGTKLNNKQP 508
            ++++RSS+ + +R S WG      + F  + ++  +       ++ +GN     LNN + 
Sbjct: 528  SKQKRSSLLKTSRSSEWGLPSKTTEIFLQSNNIPYDGPPHHEPQRSQGN-----LNNGEH 587

Query: 509  SESSHRSRKGTQVNSATSTSTNRIRLKVKLGKNMGHNFLNIVVPEVVGSSLSAKGINSNY 568
            + SSH         +  ++S + +RLKVK GK+ G N LNI V +V G+SL   GI    
Sbjct: 588  NRSSHNGNVEGSNRNIQASSGSCLRLKVKFGKSGGQNPLNITVSKVSGNSLPGNGIVKA- 647

Query: 569  GNKSYWGGNLEFPPSTIGVDDQKLEEGPLRKIFCYNRNQEKEEKCSDPYIVKEQCATNDS 628
                  G  LE P S    +D+         +   +   EK         ++++    D+
Sbjct: 648  ------GTCLELPGSAHFGEDKMQTVETKEDLVEKSNPVEKVSYLQSSDSMRDKKYNQDA 707

Query: 629  SCTNIVSKLSVEHADDSFAVSS-DLVELVEHASDTRNLDPGTSPDSEVINSILDIQVGAV 688
                +  K+  +  DD   +SS  +VE  E A+ T++LD  TSPDSEVINS+ D  V   
Sbjct: 708  G--GLCRKVGGDVLDDDPHLSSIRMVEECERATGTQSLDAETSPDSEVINSVPDSIVNIE 767

Query: 689  RQENLQESVLASSEDFAASGNVTSSKKGRKKEKPYQAVSCSQEGGTCASACSNGSKSSKK 748
             +E L     ++ ED          KK R  EK  +                  SKS  +
Sbjct: 768  HKEGLHHGFFSTPEDVV--------KKNRVLEKEDEL---------------RASKSPSE 827

Query: 749  HGTRLNVDNQLGAGETFTSVGENVLNDSFTFKELSTESTEIEHPEEALKVERILDVKECC 808
            +G+ L                                                       
Sbjct: 828  NGSHL------------------------------------------------------- 887

Query: 809  KTEACSVFPESENLKMFLPSQSARKKHPKSSKPIKTSDCKPK-DPGSKNKIKNASKERVY 868
                             +P+ + + KHPK SK   T   K K    +K+  KN S E V 
Sbjct: 888  -----------------IPN-AKKAKHPK-SKSNGTKKGKSKFSESAKDGRKNESHEGVE 947

Query: 869  QRKFVNKSKIKKDICEQVVTETESHQIVGNFL---VDKPAKVDDITASTVAINLNAVQGV 928
            QRK +N S  + D     V   ESH+  G  L   + K +      +S V      V   
Sbjct: 948  QRKSLNTSMGRDDSDYPEVGRIESHKTTGALLDADIGKTSATYGTISSDVTHGEMVVDVT 1007

Query: 929  VNEQYTPPRNAWVLCDDCQKWRRIAASLVDSLGHASCTWTCKDNVDKAFADCLIPQEKSN 988
            + + Y+   +AWV CDDC KWRRI AS+V S+  +S  W C +N DK FADC   QE SN
Sbjct: 1008 IEDSYS-TESAWVRCDDCFKWRRIPASVVGSIDESS-RWICMNNSDKRFADCSKSQEMSN 1067

Query: 989  AEINAELEISDESGEENASNKRLTYREFESFHPLTVTAVLQENKFASISSNQFLHRSRKT 1048
             EIN EL I  +  E +A +     R  E           Q+  F +I +NQFLHR+RK+
Sbjct: 1068 EEINEELGIGQD--EADAYDCDAAKRGKEKEQKSKRLTGKQKACFKAIKTNQFLHRNRKS 1127

Query: 1049 QNIDEVMVCHCKPTLDGRLGCANECLNRMLNIECVRGTCPCGDLCSNQQFQKRKYAKLQW 1108
            Q IDE+MVCHCKP+ DGRLGC  ECLNRMLNIEC++GTCP GDLCSNQQFQKRKY K + 
Sbjct: 1128 QTIDEIMVCHCKPSPDGRLGCGEECLNRMLNIECLQGTCPAGDLCSNQQFQKRKYVKFER 1187

Query: 1109 LRCGKKGYGLQLLEDISKGKFLIEYVGEVLDMHAYESRQKEYAFNGHRHFYFMTLNGSEV 1168
             + GKKGYGL+LLED+ +G+FLIEYVGEVLDM +YE+RQKEYAF G +HFYFMTLNG+EV
Sbjct: 1188 FQSGKKGYGLRLLEDVREGQFLIEYVGEVLDMQSYETRQKEYAFKGQKHFYFMTLNGNEV 1247

Query: 1169 IDACGKGNLGRFINHSCDPNCRTEKWMVNGEICIGLFALRDIKKGEEVTFDYNYVRVCGA 1228
            IDA  KGNLGRFINHSC+PNCRTEKWMVNGEIC+G+F+++D+KKG+E+TFDYNYVRV GA
Sbjct: 1248 IDAGAKGNLGRFINHSCEPNCRTEKWMVNGEICVGIFSMQDLKKGQELTFDYNYVRVFGA 1307

Query: 1229 AAKKCYCGSSQCRGYIGGDPHNSEVIIHSDPDEEFPEPVMLRAYGRSSNGNL-PTAASSV 1288
            AAKKCYCGSS CRGYIGGDP N +VII SD DEE+PE V+L     S  G L  T+ +  
Sbjct: 1308 AAKKCYCGSSHCRGYIGGDPLNGDVIIQSDSDEEYPELVILDD-DESGEGILGATSRTFT 1367

Query: 1289 DGAKMQLSERIKGVRNKKE-QPTGIAIQMKI---LEEKEEPFQLSALKISEDPPKLSA-- 1348
            D A  Q+ +  + V   K+  P     Q  +   L E+E P  L  L+ +E   +LS+  
Sbjct: 1368 DDADEQMPQSFEKVNGYKDLAPDNTQTQSSVSVKLPEREIPPPL--LQPTEVLKELSSGI 1427

Query: 1349 SKISEEQEDHHNLSALIISPLHSSLEFEDSKEAKLSFDDIDARKNSELDAIEDKQVYINS 1408
            S  + +QE          SP  SSL       +++S    ++ K ++  + EDK++    
Sbjct: 1428 SITAVQQEVPAEKKTKSTSPTSSSL-------SRMSPGGTNSDKTTKHGSGEDKKILPRP 1487

Query: 1409 HPQMKTSRKQGSIKKGK---VSSVEKVKI--TNKPQILSLKSKRLFEGSPGNRFEAVEEK 1468
             P+MKTSR   S K+ K      V K ++   NK Q   +KSK   + SP    E  E K
Sbjct: 1488 RPRMKTSRSSESSKRDKGGIYPGVNKAQVIPVNKLQQQPIKSKGSEKVSPS--IETFEGK 1547

Query: 1469 LNELLDAEGGISKRKDAPKGYLKLLLLTAASGASASGEAIQSNRDLSMLLDALLKTKSRV 1528
            LNELLDA GGISKR+D+ KGYLKLLLLTAAS      E I SNRDLSM+LDALLKTKS+ 
Sbjct: 1548 LNELLDAVGGISKRRDSAKGYLKLLLLTAAS-RGTDEEGIYSNRDLSMILDALLKTKSKS 1607

Query: 1529 VLTDIINKNGLRMLHNIMKQYRSDFKKIPILRKLLKVLEYLVTREILTSEHIYGSPPCPG 1588
            VL DIINKNGL+MLHNIMKQYR DFK+IPI+RKLLKVLEYL TR+IL  EHI   PP  G
Sbjct: 1608 VLVDIINKNGLQMLHNIMKQYRGDFKRIPIIRKLLKVLEYLATRKILALEHIIRRPPFAG 1667

Query: 1589 MESLRESLLSLTEHDDKQVHQIARGFRDRWFPRHNRKFGYSGRADGRLEAYRGTNCSRFT 1648
            MES ++S+LS TEHDD  VH IAR FRDRW P+H RK     R + R E+ R     RF 
Sbjct: 1668 MESFKDSVLSFTEHDDYTVHNIARSFRDRWIPKHFRKPWRINREE-RSESMRSPINRRFR 1727

Query: 1649 AS--PSYCHDQDFRPSE--AIDCNKQPSTP--TSLSDFHPAEVCSVPSTAGHSLDGQRIR 1708
            AS  P Y H Q  RP+E  A   + + +TP   S+S+ +      +P T G        R
Sbjct: 1728 ASQEPRYDH-QSPRPAEPAASVTSSKAATPETASVSEGYSEPNSGLPETNG--------R 1773

Query: 1709 KRKSRWDQPADTSIDLRSKEQKLESTSVQRFDSSQSNSVGVASMLVDKINSDNMGSSLSG 1768
            KRKSRWDQP+ T      KEQ++ +   Q+ D +  N                       
Sbjct: 1788 KRKSRWDQPSKT------KEQRIMTILSQQTDETNGN----------------------- 1773

Query: 1769 SVEICCRRDEDIRLDSAVHNTPEDIPPGFSSPFNLPVASSSPFSTILDPPRQSIDNLSCT 1828
                     +D++         +D+PPGFSSP               D P          
Sbjct: 1848 ---------QDVQ---------DDLPPGFSSP-------------CTDVPD--------- 1773

Query: 1829 FSTVGHPQERFISRLPVSYGIPFSIVEQCGTSDSENLEFWCWDVAPGVPFHPFPPLPPYP 1888
             +    PQ++F+SRLPVSYGIP SIV Q G+   E+     W VAPG+PF+PFPPLPP  
Sbjct: 1908 -AITAQPQQKFLSRLPVSYGIPLSIVHQFGSPGKEDPT--TWSVAPGMPFYPFPPLPPVS 1773

Query: 1889 RGKRGPSTSACGTAVTQSDRERQVKSHDSQTSFSEESAPSTSTNYQQGLRILSNNQQTLK 1934
             G+     +            R   S     ++S E  P+T          ++++    +
Sbjct: 1968 HGEFFAKRNV-----------RACSSSMGNLTYSNEILPATP---------VTDSTAPTR 1773

BLAST of Sed0009206 vs. TAIR 10
Match: AT1G77300.2 (histone methyltransferases(H3-K4 specific);histone methyltransferases(H3-K36 specific) )

HSP 1 Score: 625.5 bits (1612), Expect = 1.4e-178
Identity = 513/1391 (36.88%), Postives = 694/1391 (49.89%), Query Frame = 0

Query: 149  SASADGLPSDFIQLNELDNDGGCSFSDVMDNRISDGSIVTEAEMLNEMSPLRSGQIL--- 208
            S+  D   SD I L++       SF D + +    G   TE+ + + +    +G I+   
Sbjct: 228  SSDLDTGSSDDISLSQ-----SFSFPDSLLDSSVFGCSATESYLEDAIDIEGNGTIVVSP 287

Query: 209  SVHVGQLVANCDQYICEMDGKNLSGTSGETINEVTDMNDIHESCLQMLPSHGCKKRERLQ 268
            S+ + +++ N D  +C  D   ++ T  ETIN                P     + +RL 
Sbjct: 288  SLAITEMLNNDDGGLCSHDLNKITVT--ETIN----------------PDLKLVREDRLD 347

Query: 269  SNSSPLTIHALENDQCGEMSNSSPKYILEVVEDVDVSTTNNNADAEQYVDPKIKNNYNLE 328
            ++ S +    L+N   G+ S+ S    L +          NN  A          +  ++
Sbjct: 348  TDLSVMNEKMLKN-HVGDSSSESAVAALSM----------NNGMAADLRAENFSQSSPID 407

Query: 329  EATIQLSLNCVELLASPLPSQFPNCERDEFHEMLNVADIPIKDISSVNSCSIGDQDNNDV 388
            E T+ +  N   +  S L   FP        E+ N         ++V    I D +    
Sbjct: 408  EKTLDMEANS-PITDSSLIWNFPLNFGSGGIEVCNPE-------NAVEPLRIVDDNGRIG 467

Query: 389  EKVGCVSEVKCLETVIPFSKRSGRR----KTSSQKTVTKRAPRKSRKKVPN---ALIFDT 448
             +V   S     E  +  S+R  R     K    KT  +   + SRKK        IF  
Sbjct: 468  GEVASASGSDFCEAGMSSSRRKARDGKQCKVVQTKTSARHLRKSSRKKQSERDIESIFKC 527

Query: 449  ARRRRSSISRPARPSLWGSLDYIIQSFENNEDVWVNQSQKQGNKKPKGNRGGTKLNNKQP 508
            ++++RSS+ + +R S WG      + F  + ++  +       ++ +GN     LNN + 
Sbjct: 528  SKQKRSSLLKTSRSSEWGLPSKTTEIFLQSNNIPYDGPPHHEPQRSQGN-----LNNGEH 587

Query: 509  SESSHRSRKGTQVNSATSTSTNRIRLKVKLGKNMGHNFLNIVVPEVVGSSLSAKGINSNY 568
            + SSH         +  ++S + +RLKVK GK+ G N LNI V +V G+SL   GI    
Sbjct: 588  NRSSHNGNVEGSNRNIQASSGSCLRLKVKFGKSGGQNPLNITVSKVSGNSLPGNGIVKA- 647

Query: 569  GNKSYWGGNLEFPPSTIGVDDQKLEEGPLRKIFCYNRNQEKEEKCSDPYIVKEQCATNDS 628
                  G  LE P S    +D+         +   +   EK         ++++    D+
Sbjct: 648  ------GTCLELPGSAHFGEDKMQTVETKEDLVEKSNPVEKVSYLQSSDSMRDKKYNQDA 707

Query: 629  SCTNIVSKLSVEHADDSFAVSS-DLVELVEHASDTRNLDPGTSPDSEVINSILDIQVGAV 688
                +  K+  +  DD   +SS  +VE  E A+ T++LD  TSPDSEVINS+ D  V   
Sbjct: 708  G--GLCRKVGGDVLDDDPHLSSIRMVEECERATGTQSLDAETSPDSEVINSVPDSIVNIE 767

Query: 689  RQENLQESVLASSEDFAASGNVTSSKKGRKKEKPYQAVSCSQEGGTCASACSNGSKSSKK 748
             +E L     ++ ED          KK R  EK  +                  SKS  +
Sbjct: 768  HKEGLHHGFFSTPEDVV--------KKNRVLEKEDEL---------------RASKSPSE 827

Query: 749  HGTRLNVDNQLGAGETFTSVGENVLNDSFTFKELSTESTEIEHPEEALKVERILDVKECC 808
            +G+ L                                                       
Sbjct: 828  NGSHL------------------------------------------------------- 887

Query: 809  KTEACSVFPESENLKMFLPSQSARKKHPKSSKPIKTSDCKPK-DPGSKNKIKNASKERVY 868
                             +P+ + + KHPK SK   T   K K    +K+  KN S E V 
Sbjct: 888  -----------------IPN-AKKAKHPK-SKSNGTKKGKSKFSESAKDGRKNESHEGVE 947

Query: 869  QRKFVNKSKIKKDICEQVVTETESHQIVGNFL---VDKPAKVDDITASTVAINLNAVQGV 928
            QRK +N S  + D     V   ESH+  G  L   + K +      +S V      V   
Sbjct: 948  QRKSLNTSMGRDDSDYPEVGRIESHKTTGALLDADIGKTSATYGTISSDVTHGEMVVDVT 1007

Query: 929  VNEQYTPPRNAWVLCDDCQKWRRIAASLVDSLGHASCTWTCKDNVDKAFADCLIPQEKSN 988
            + + Y+   +AWV CDDC KWRRI AS+V S+  +S  W C +N DK FADC   QE SN
Sbjct: 1008 IEDSYS-TESAWVRCDDCFKWRRIPASVVGSIDESS-RWICMNNSDKRFADCSKSQEMSN 1067

Query: 989  AEINAELEISDESGEENASNKRLTYREFESFHPLTVTAVLQENKFASISSNQFLHRSRKT 1048
             EIN EL I  +  E +A +     R  E           Q+  F +I +NQFLHR+RK+
Sbjct: 1068 EEINEELGIGQD--EADAYDCDAAKRGKEKEQKSKRLTGKQKACFKAIKTNQFLHRNRKS 1127

Query: 1049 QNIDEVMVCHCKPTLDGRLGCANECLNRMLNIECVRGTCPCGDLCSNQQFQKRKYAKLQW 1108
            Q IDE+MVCHCKP+ DGRLGC  ECLNRMLNIEC++GTCP GDLCSNQQFQKRKY K + 
Sbjct: 1128 QTIDEIMVCHCKPSPDGRLGCGEECLNRMLNIECLQGTCPAGDLCSNQQFQKRKYVKFER 1187

Query: 1109 LRCGKKGYGLQLLEDISKGKFLIEYVGEVLDMHAYESRQKEYAFNGHRHFYFMTLNGSEV 1168
             + GKKGYGL+LLED+ +G+FLIEYVGEVLDM +YE+RQKEYAF G +HFYFMTLNG+EV
Sbjct: 1188 FQSGKKGYGLRLLEDVREGQFLIEYVGEVLDMQSYETRQKEYAFKGQKHFYFMTLNGNEV 1247

Query: 1169 IDACGKGNLGRFINHSCDPNCRTEKWMVNGEICIGLFALRDIKKGEEVTFDYNYVRVCGA 1228
            IDA  KGNLGRFINHSC+PNCRTEKWMVNGEIC+G+F+++D+KKG+E+TFDYNYVRV GA
Sbjct: 1248 IDAGAKGNLGRFINHSCEPNCRTEKWMVNGEICVGIFSMQDLKKGQELTFDYNYVRVFGA 1307

Query: 1229 AAKKCYCGSSQCRGYIGGDPHNSEVIIHSDPDEEFPEPVMLRAYGRSSNGNL-PTAASSV 1288
            AAKKCYCGSS CRGYIGGDP N +VII SD DEE+PE V+L     S  G L  T+ +  
Sbjct: 1308 AAKKCYCGSSHCRGYIGGDPLNGDVIIQSDSDEEYPELVILDD-DESGEGILGATSRTFT 1367

Query: 1289 DGAKMQLSERIKGVRNKKE-QPTGIAIQMKI---LEEKEEPFQLSALKISEDPPKLSA-- 1348
            D A  Q+ +  + V   K+  P     Q  +   L E+E P  L  L+ +E   +LS+  
Sbjct: 1368 DDADEQMPQSFEKVNGYKDLAPDNTQTQSSVSVKLPEREIPPPL--LQPTEVLKELSSGI 1427

Query: 1349 SKISEEQEDHHNLSALIISPLHSSLEFEDSKEAKLSFDDIDARKNSELDAIEDKQVYINS 1408
            S  + +QE          SP  SSL       +++S    ++ K ++  + EDK++    
Sbjct: 1428 SITAVQQEVPAEKKTKSTSPTSSSL-------SRMSPGGTNSDKTTKHGSGEDKKILPRP 1448

Query: 1409 HPQMKTSRKQGSIKKGK---VSSVEKVKI--TNKPQILSLKSKRLFEGSPGNRFEAVEEK 1468
             P+MKTSR   S K+ K      V K ++   NK Q   +KSK   + SP    E  E K
Sbjct: 1488 RPRMKTSRSSESSKRDKGGIYPGVNKAQVIPVNKLQQQPIKSKGSEKVSPS--IETFEGK 1448

Query: 1469 LNELLDAEGGISKRKDAPKGYLKLLLLTAASGASASGEAIQSNRDLSMLLDALLKTKSRV 1513
            LNELLDA GGISKR+D+ KGYLKLLLLTAAS      E I SNRDLSM+LDALLKTKS+ 
Sbjct: 1548 LNELLDAVGGISKRRDSAKGYLKLLLLTAAS-RGTDEEGIYSNRDLSMILDALLKTKSKS 1448

BLAST of Sed0009206 vs. TAIR 10
Match: AT1G76710.1 (SET domain group 26 )

HSP 1 Score: 198.7 bits (504), Expect = 4.2e-50
Identity = 93/235 (39.57%), Postives = 134/235 (57.02%), Query Frame = 0

Query: 1017 KFASISSNQFLHRSRKTQNIDEVMVCHCKPTL-DGRLGCANECLNRMLNIECVRGTCPCG 1076
            ++  I  N F +R  K Q  +++ +C CK    D    C   CLN + N EC  G CPCG
Sbjct: 16   QYEHIYQNDFSYRKHKKQKEEDISICECKFDFGDPDSACGERCLNVITNTECTPGYCPCG 75

Query: 1077 DLCSNQQFQKRKYAKLQWLRCGKKGYGLQLLEDISKGKFLIEYVGEVLDMHAYESRQKEY 1136
              C NQ+FQK +YAK + ++C  +G+GL  LE+I  G+F++EY GEV+     + R + Y
Sbjct: 76   VYCKNQKFQKCEYAKTKLIKCEGRGWGLVALEEIKAGQFIMEYCGEVISWKEAKKRAQTY 135

Query: 1137 AFNGHRHFYFMTLNGSEVIDACGKGNLGRFINHSCDPNCRTEKWMVNGEICIGLFALRDI 1196
              +G +  Y ++LN SE IDA  KG+L RFINHSC PNC T KW V GE+ +G+FA   I
Sbjct: 136  ETHGVKDAYIISLNASEAIDATKKGSLARFINHSCRPNCETRKWNVLGEVRVGIFAKESI 195

Query: 1197 KKGEEVTFDYNYVRVCGAAAKKCYCGSSQCRGYIGGDPH--NSEVIIHSDPDEEF 1249
                E+ +DYN+    G A  +C CG+  C G++G        +  +  D D+ +
Sbjct: 196  SPRTELAYDYNF-EWYGGAKVRCLCGAVACSGFLGAKSRGFQEDTYVWEDGDDRY 249

BLAST of Sed0009206 vs. TAIR 10
Match: AT1G76710.2 (SET domain group 26 )

HSP 1 Score: 198.7 bits (504), Expect = 4.2e-50
Identity = 93/235 (39.57%), Postives = 134/235 (57.02%), Query Frame = 0

Query: 1017 KFASISSNQFLHRSRKTQNIDEVMVCHCKPTL-DGRLGCANECLNRMLNIECVRGTCPCG 1076
            ++  I  N F +R  K Q  +++ +C CK    D    C   CLN + N EC  G CPCG
Sbjct: 16   QYEHIYQNDFSYRKHKKQKEEDISICECKFDFGDPDSACGERCLNVITNTECTPGYCPCG 75

Query: 1077 DLCSNQQFQKRKYAKLQWLRCGKKGYGLQLLEDISKGKFLIEYVGEVLDMHAYESRQKEY 1136
              C NQ+FQK +YAK + ++C  +G+GL  LE+I  G+F++EY GEV+     + R + Y
Sbjct: 76   VYCKNQKFQKCEYAKTKLIKCEGRGWGLVALEEIKAGQFIMEYCGEVISWKEAKKRAQTY 135

Query: 1137 AFNGHRHFYFMTLNGSEVIDACGKGNLGRFINHSCDPNCRTEKWMVNGEICIGLFALRDI 1196
              +G +  Y ++LN SE IDA  KG+L RFINHSC PNC T KW V GE+ +G+FA   I
Sbjct: 136  ETHGVKDAYIISLNASEAIDATKKGSLARFINHSCRPNCETRKWNVLGEVRVGIFAKESI 195

Query: 1197 KKGEEVTFDYNYVRVCGAAAKKCYCGSSQCRGYIGGDPH--NSEVIIHSDPDEEF 1249
                E+ +DYN+    G A  +C CG+  C G++G        +  +  D D+ +
Sbjct: 196  SPRTELAYDYNF-EWYGGAKVRCLCGAVACSGFLGAKSRGFQEDTYVWEDGDDRY 249

BLAST of Sed0009206 vs. TAIR 10
Match: AT2G44150.1 (histone-lysine N-methyltransferase ASHH3 )

HSP 1 Score: 162.5 bits (410), Expect = 3.4e-39
Identity = 83/220 (37.73%), Postives = 124/220 (56.36%), Query Frame = 0

Query: 1021 ISSNQFLHRSRKTQNIDEVMVCHCKPTLDGRLG--CANECLNRMLNIECVRGTCPCGDLC 1080
            I  N +L +  K +  D+ + C C  +  G     C + C   ML   C   +C CG  C
Sbjct: 47   IRRNIYLTKKVKRRVEDDGIFCSCSSSSPGSSSTVCGSNCHCGMLFSSC-SSSCKCGSEC 106

Query: 1081 SNQQFQKRKYAKLQWLRCGKKGYGLQLLEDISKGKFLIEYVGEVLDMHAYESRQKEYAFN 1140
            +N+ FQ+R   K++ ++  K G G+   E+I  G+F+IEYVGEV+D    E R  +    
Sbjct: 107  NNKPFQQRHVKKMKLIQTEKCGSGIVAEEEIEAGEFIIEYVGEVIDDKTCEERLWKMKHR 166

Query: 1141 GHRHFYFMTLNGSEVIDACGKGNLGRFINHSCDPNCRTEKWMVNGEICIGLFALRDIKKG 1200
            G  +FY   +    VIDA  KGN  R+INHSC+PN + +KW+++GE  IG+FA R IKKG
Sbjct: 167  GETNFYLCEITRDMVIDATHKGNKSRYINHSCNPNTQMQKWIIDGETRIGIFATRGIKKG 226

Query: 1201 EEVTFDYNYVRVCGAAAKKCYCGSSQCRGYIGGDPHNSEV 1239
            E +T+DY +V+    A + C+CG+  CR  +G  P   ++
Sbjct: 227  EHLTYDYQFVQF--GADQDCHCGAVGCRRKLGVKPSKPKI 263

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038885001.10.0e+0076.49histone-lysine N-methyltransferase ASHH2 isoform X3 [Benincasa hispida][more]
XP_038884997.10.0e+0074.82histone-lysine N-methyltransferase ASHH2 isoform X1 [Benincasa hispida] >XP_0388... [more]
XP_038885000.10.0e+0074.23histone-lysine N-methyltransferase ASHH2 isoform X2 [Benincasa hispida][more]
XP_011657417.10.0e+0072.89histone-lysine N-methyltransferase ASHH2 isoform X1 [Cucumis sativus] >XP_011657... [more]
XP_031744047.10.0e+0072.89histone-lysine N-methyltransferase ASHH2 isoform X2 [Cucumis sativus][more]
Match NameE-valueIdentityDescription
Q2LAE12.5e-21235.15Histone-lysine N-methyltransferase ASHH2 OS=Arabidopsis thaliana OX=3702 GN=ASHH... [more]
E9Q5F91.1e-5536.90Histone-lysine N-methyltransferase SETD2 OS=Mus musculus OX=10090 GN=Setd2 PE=1 ... [more]
Q9BYW22.5e-5536.61Histone-lysine N-methyltransferase SETD2 OS=Homo sapiens OX=9606 GN=SETD2 PE=1 S... [more]
Q9VYD11.4e-5040.14Probable histone-lysine N-methyltransferase CG1716 OS=Drosophila melanogaster OX... [more]
Q84WW66.0e-4939.57Histone-lysine N-methyltransferase ASHH1 OS=Arabidopsis thaliana OX=3702 GN=ASHH... [more]
Match NameE-valueIdentityDescription
A0A0A0KDR40.0e+0072.89Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G376290 PE=4 SV=1[more]
A0A5A7UPQ60.0e+0073.03Histone-lysine N-methyltransferase ASHH2 OS=Cucumis melo var. makuwa OX=1194695 ... [more]
A0A6J1JXF70.0e+0073.23histone-lysine N-methyltransferase ASHH2-like isoform X1 OS=Cucurbita maxima OX=... [more]
A0A6J1K1970.0e+0073.04histone-lysine N-methyltransferase ASHH2-like isoform X2 OS=Cucurbita maxima OX=... [more]
A0A6J1FEZ00.0e+0072.89histone-lysine N-methyltransferase ASHH2 isoform X1 OS=Cucurbita moschata OX=366... [more]
Match NameE-valueIdentityDescription
AT1G77300.12.2e-24037.07histone methyltransferases(H3-K4 specific);histone methyltransferases(H3-K36 spe... [more]
AT1G77300.21.4e-17836.88histone methyltransferases(H3-K4 specific);histone methyltransferases(H3-K36 spe... [more]
AT1G76710.14.2e-5039.57SET domain group 26 [more]
AT1G76710.24.2e-5039.57SET domain group 26 [more]
AT2G44150.13.4e-3937.73histone-lysine N-methyltransferase ASHH3 [more]
InterPro
Analysis Name: InterPro Annotations of Chayote (edule) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR006560AWS domainSMARTSM00570shorttest3coord: 1037..1088
e-value: 6.9E-23
score: 92.0
IPR006560AWS domainPFAMPF17907AWScoord: 1052..1086
e-value: 7.1E-13
score: 48.4
IPR006560AWS domainPROSITEPS51215AWScoord: 1037..1087
score: 18.230049
IPR003616Post-SET domainSMARTSM00508PostSET_3coord: 1214..1230
e-value: 5.6E-4
score: 29.2
IPR003616Post-SET domainPROSITEPS50868POST_SETcoord: 1214..1230
score: 10.08443
IPR001214SET domainSMARTSM00317set_7coord: 1089..1212
e-value: 2.1E-37
score: 140.3
IPR001214SET domainPFAMPF00856SETcoord: 1100..1206
e-value: 5.2E-18
score: 66.0
IPR001214SET domainPROSITEPS50280SETcoord: 1089..1206
score: 19.262857
NoneNo IPR availableGENE3D2.170.270.10SET domaincoord: 994..1243
e-value: 3.9E-82
score: 277.8
NoneNo IPR availableGENE3D3.30.40.100coord: 922..982
e-value: 2.6E-17
score: 64.7
NoneNo IPR availablePIRSRPIRSR009343-2PIRSR009343-2coord: 1066..1229
e-value: 3.6E-27
score: 93.0
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 409..428
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 430..444
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1851..1900
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1664..1706
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1670..1701
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1955..1971
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 472..519
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 815..850
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 406..449
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 829..850
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1885..1900
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1955..1979
NoneNo IPR availablePANTHERPTHR22884SET DOMAIN PROTEINScoord: 413..1916
NoneNo IPR availablePANTHERPTHR22884:SF413HISTONE-LYSINE N-METHYLTRANSFERASE, H3 LYSINE-36 SPECIFICcoord: 413..1916
NoneNo IPR availableSUPERFAMILY82199SET domaincoord: 1018..1227
IPR011124Zinc finger, CW-typePFAMPF07496zf-CWcoord: 925..970
e-value: 2.9E-10
score: 40.0
IPR011124Zinc finger, CW-typePROSITEPS51050ZF_CWcoord: 919..973
score: 12.173474
IPR044437SETD2/Set2, SET domainCDDcd19172SET_SETD2coord: 1088..1230
e-value: 7.87834E-86
score: 274.458

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sed0009206.1Sed0009206.1mRNA
Sed0009206.2Sed0009206.2mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0010452 histone H3-K36 methylation
cellular_component GO:0005634 nucleus
molecular_function GO:0046975 histone methyltransferase activity (H3-K36 specific)
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0018024 histone-lysine N-methyltransferase activity