Sgr021830 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr021830
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
Descriptionhistone-lysine N-methyltransferase ATX2-like isoform X1
Locationtig00153840: 334032 .. 369781 (+)
RNA-Seq ExpressionSgr021830
SyntenySgr021830
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCATTCCCTCTTGAGCAGCGACCGAAGCCGCAAGTTATGGATGGGGAAGACGGAGAAGATATCAATATCGATGTTTATAATGCTGGGACTCCGGTTCGGTACCTCTCGCTTGATCATGTCTACTCCACTACGTCTCCGTTTGTCAGTACAAGTGGGTCTTCGAATGTCATGTCCAAGAAGGTGAAAGCCCGGAGGCTTATGGTGAATCGCTTTGACGATCTTAATTTCAAGCCGCCTCGTCTGCTCCATGTCTATTCTCGCCGCCGTAAGAAACCCCGCCATTCGTCTGCCACTCCCTCCTTCTACGACTCTTTGGTTGAGAAGGTCGAATTGGGGTCGAAAGCTGTTCTGAAATCCGAAGCTCGTGAGATGGATGAGATGGTGAATAGTGCAGACGACCATGCAGACGACTTCGAAGTTGACAGAATGCCGAAGAAGAAGAAAAAGAAGAAAGATAAGTTTGGGTTCAATGAGCTTGTTAAATTGGAGGTCGATTCCAGTGTTTTTCGTGCGATGAATGGTCCTAGGTTGAGAGACTGCCGCACTCATAACAATAACAATCTAAACGTTAGTAATTCCGGACGGAGAAGAAAGCGCAATTCTTCAAAGATTTCTGAGAAGACTGTATTGAAATCTCCTTCTGCTAAGAGATGGGTCAGGTACTGGTTTTTGATTTTCGAGTTGTAATTTAAGGCCTCGGAGGGAGCAATAAGCTTCTTCGAAGCTGAAATGTGTCTATGAAGTATTTGCTAGATAGTTTTTCCATCTTGATTTTTACGTAGTCGTTTCTCACTTAGTCAATTGTGTTTACGGGAATATCATTAATTATAAACTCACAGGTTAAGTTTTGAGGATGTCGACCCGAAAGTATATATTGGATTACAATGCAAGGCAAGTTCTTCTGTTTATACTGCAGCGTTCGATTTGGATCACCTTTTTGGGATTTGGTTGAAATAATATGGTTCTTTGGGACATCACAGGTTTATTGGCCATTGGATGCTGATTGGTACTGTGGTCGTGTTGTGGGTTATACATCAGAGACTAATCGTCATCATGTATGTTCTCACATATTTAGTTTTTGAACCTCACAGTGGGTGTTTGAAAAAGTACTTTTTGTTAACTGTAAATTATGTGTCTTCTCTTGTATTGATCGCACCAGTTGGCTTTTGGTTGTTGGTGATCCCATTTATTCTTACATACAAAATTCAATGTAGATTGAATATGAAGATGATGACAAAGAAGATTTGATTCTTTCAAATGAGAAAGTCAAATTTTATATTTCTGGTGAAGAGATGCAGTCTTTGAACTTGAGTTTTGGTGTTGATAGCGTAGATAGTGATGCTTATGACTACAATGAGATGCTTGTTTTAGCAGCAAGCTTGGATGACTGCCTGGAACCTGAACCTGGGGATATTGTCTGGGCCAAACTTACTGGTCTGTGCTTCATCCTCTTCCCTTTTCCCCTTTTTTCTACTCTTTTTCCCACTCTTTTGCATGATTAGGAAGAAATGCCCCAGGCAAGAAAACTGAAGTAACATGTGATATTTGAAATTAAGAAAGATAGAGAGGTCACCGTGCTTCCTTGTTTAATTTAGAGGTTGAATTTTGCTTGACATGGGTTTAATATTTTCACATTTTATCACATAATGATCATTTGAAACTGAAATATTCAATGCAATTTCTGTTGTTTTTAGATTAATTTAAAACCCTATAGTTTGGATCAATAGCAAAGCATTATATCCAATACTAACCATTTACTTATTGGATTATATATATATCTATCTATCAATCTATCTGTATATATTCCAAAAAGTATGGTTCAATTAATTTTACTTGTATTATACTCCCACAAGTACAAACCATATGATTGGAAGGCTCTTGGTGGTGTTCCAAACAGCTTACTGCCTCTTGGTGGTGTTCAAACCATACAAAATTCTTTTGTAACTACAGTCTTTAATGATTATTAGCGATTGGAAGGCTCTCCTTCGGTAGTTCTTTGGGCAGGGGTTACCTCTATCCCCGGCCCCTAGGTTGTACTTTTGGTGGCCTTCTTTTGGAATATAGTTCTCATTTCTTATATATAAAAAAGTACCCCCACAAGATTAGAAATAATCCATTTTCCATTACTGGTATTTTTTGGTGATGTGAAATGGTGACATTTCATTCAATATGAAGTAGATGTATTGCATGAAAGAAGTGGTTTTGGACGTGAAAACTTGACACCACATGCATCTTATGGTGAAGAAATAAAATCCATTTATCCAAACATGAGAATTTACAAGTATGGTCTTTAATTCTTACAGTAAGGAAAAGTGGTATAGACCACAACACAAGAGTTAGACACCCGGGAAGGCATGGGGGTAAATACCTAGTTCAAATTTCTAATGTTATTTTGTAGCTGTCTCGTAGACCATATTGATCATTTTCCTAGTAGTTTCCTCAAAATTAAGATGTCAATCCAAATATTCTAAAAGTTTCTCTCTCCTACATAGCTCTAAATAATTGAAAAAAGTTTCTCAAATGGTCTCCCTTTGTTTTGATGGTGTTTTATTTACTCAGGGGGACGAGACTCGTGCTGGTTGAGAGGATGAATGGATTGACAATGATCCCCACTTCCTTTCTCTGTCTTTACTGTTTGTCATGAGGCACTGCACTAGTGACCCCATTGCTGCTTTGTTCTCCTCTTCACTGTTTTGAAAAGCGCGCCTAGGCGTGCGCCTCAAAAAAGCCCACACAGGGCGCCTTGCTTCAAAGAAGCGAGGCGCCCTCACCAAGGCGCGCGCCTTAGCGCCTCAAACCTTTTTTTTTTTAAATTTTCTTTTTTTTTTAAATTTTTATGTAAAGAAATTAAAATTATCCTTCAAAATCATTTTTTCTCTAAATAATTTCTTAGAAAACTCCTAATTTCTTTATTTTTTCATTGATTCTTTTATATATATAAATAATTATCTACATTTTTCTATTGTGCGTCTCACAAAAAAAAGCCCTCGCTTTTTTTGGCGCCTTACGCTTAAGCTCCGGAGGGCTATTGCGCTTTGGTGTGCCTTGCGCTTTAAAAAACACTGCTCCTCTTGGTCGGTTTCTTAGAATTTTGGTTTTGACGTGCTGTTAATGATAGGAAGGCTTGCGAATTAGCCTCCTTGTTGTCTTTTATTTTTTAGTTTATTCTTGAGGAGAGTTTGGCATTGTTGGTCTCTTGATTTCTTCTGGAATTTCTCCTTTGGTTCTTTCCTTCTGGCCTCCTCCTTTTGGAGGCTTTATTGGATTCTCTCCCAAAACGCTTAGTAAGGTAGAAAACTTGGAGGGTTGTGGTAGAGTGCCAGACAATTATTGTAAATTCGTCCCCTTAGAAATTAGAGGCCAGAAGGATGAAACCTCCTTAATTGCCCAAACGGTCACCTTCAATGATTGGAGATTGCTTGTTGGAAGAATGGTTGAAATATGTGGTGGTTCCTCAAAGGATGAGCCTGTTAGATATCTTTATCAGAATGGCAAGGAGAAAGATCTTTGCCCAGTGGATATTAGAAGCTCATAGGATAATAGCTCCATTCCTTGTGTTTCCTTAAAAAGTACTGATAGGGTGGCTCAAGAAAAAGAAAACTTGCTTCTTCCACATGACAGAGGACCTTTGGGTTGCAAAGATGATTTGCTTTCTTTGGTAGATAAATTTGGTGATACAACAGAGGAGTTCATGGTTTTGTGCAGGGTTAGAACTTGCTACAAATGATGGGTCGCATCAAATATCCCATTTATAAGCTAAGATAAAAAACGGGCCCACAATGCAAGAGGAAAACTTGGCCTTTGGCCCTTTTGCCCCTAAAAAATAGGTCGTAGGAAAGTCCTGGATTAAAACTTTCCTAAAGAAAGGCAACATTGATTCCTCTTCCTCTTCTTTAAAAATTGATCTTATTACTCTAAAAGGGAGCCCCTTTTATGAAGGAGGACCTAATTTCAACCCTTCCAGCTGTGGTGGAGATAGCAAAGATTTTAATGGGGAAGGAGAAAAGCTTTGATAATAATGGTTGAGGACCATTGTGAAGCTGTCGTAGACTCTTTAGTGAAGATAACCTTGCTTTAAATGACATTGAAAATTAAAGGTGCCCAACTCCCCAAATTCAGGGGCAAGACCCAAACTTCCCGGTCATTTAGATGGAGGAACGAATTTCAATGTTGCCCTTCGATGCGACAAGTCTCTTGATTGCAGGCGAGAGGTGACAAAAAGGCATTGCTTAGGTGAAGTGAGACTAATAAATTCAAGAAAATATATAGTATAAATTTATTTAGGAAATAAAGCCGCAAAAAGCCTTACAGAATAAGATTTAATACATCAAAATGTCAAGTGAAAAATTAAAAAAAACAAGGAAGAAAAATTAAAAAACAGTAACGTTTTAAAAATAAAGAAGAACGAGTATCTAAAAACATTAATAAAAGAGAGCTGGTTTTCCCCAAAAAAGAAGAAATAAAGGAAATAGAAAATATATTAGAAATAAGAAATAGCTAAGAATCATCTATAGATGTTAAGAGTCCTCTGTTGTGAAAACAATGACTCTTGGAGTGAACCTATCGTGAGAGGCTGAGAATCAATTTTTGAACGAAAATCAAAAGGAAATTGAAAAACATGAATCTGTTGTACCTTTTTGTGATGCTTCAAGCTTCAAAGTGTTGCAATGGAGAGTAGATGGGTTTTGTTCATAAGTGTAAGAACTTGCTGCGTTTGCCCACGCCATCTTCTCTAAAATCGGAAAGCCGAATAGATTGCTAAATTAAAATTGAAAGTTGAAATTGATAAGGGAAGAATAAGCGAGAATTGAAGAAAGAATGAAAAGTGGGGCTAAAATTGCAGAAGGAAGCAAAATCAAGGATTGAAGAAGGAAGATGAAAGAAATCGTTGGGCATAAACACATACCTTCATGTAGGCGAGTGCTTTTGTTGCCATGATGGATAAAAATGCAAAGATGGAAAACTCCTCATTTCTCTTAATAACAGTTGGTTTCTTATAAATATTGATAATTCTTGTAAAGGGATGATTTTTGTACTTGAATATATTTTTATTACTTCTTTTGTTAGTTAAATGCAGGTCATGCTATGTGGCCAGCAATTATAGTGGATGAATCACTCATCGGTGATCGGAAGGGCTTAAGAAATATTTCAGGAGGAAGAACAGTCCCTGTTCAATTTTTTGGTACACACGACTTTGCAAGGTTTGTGAGCTTTCTGGTTTAAAATGAACTATATCTGTCTGGTATAATGTACTCCAAAACTTTGAACCAAAAAAATGCAGAAACATTTTGTTTGGAATGCGTGTACTGACCAGTTTCTTTACTGTACAGGATTAAAGTAAAACAGGCGATGTCATTTCTCAAAGGTCTTCTTTCCTCTTTCCACCTGAAATGCAAGAAACCGCACTTCATTCGGAGCCTAGAAGAGGCAAAAATGTAATGCTGTAATTGGTTTGGGTTCTTATATCAAAAAAGATATGATTATGACATTTCTTGCTCACACAGTCCCTATAATTGTTCATCTCATATGACACTCATACTACTGTCCGACTGCTGATTGTTGTTTGCGGATCATCTCTGTCAACATGCAAAAGTACATGATCAGACCAACTTTACTTTGAGTTGGCAAATTCCTATTGATTTAGTTTTTTTTTTTTTTTTGATAAAAGATTCCTATTGATGTAGTTTTTCCCGCACCCCTGTAATGCAATGTCACTGGGAAATTAAATTTTCATATAGGTACTTAGAGAGCATGGTTCTATGTATATCTGTCTCCATTTTCAATTTAATACGGCCATAGCCTCATGGAGATTCCAATAATGTAAAATTAAACTGAGTATGATGACGGAAGGGGGATTTGGTTGGTTCTTGTTACTTCTTGCTAAGCTGTGTTCTCTCTCTAAGATGGTGACTGGTAGAAATATTCTTTACTGACTTATTCATGTCATTGAAGCAGGTATCTCAGTGAGCAGAAACTTCCACCAAGTATGCTACAGTTGCAAAATGGAATTGAAGTGGATGATTTTGCAAGTGCAAGTGGAGAGGAAGAAGCGACAACAGATTCAGGGGAAGAATGCCTAAATGAAGGAGGGATGCCTTGTCCACACAATGGATATGGATCATCTCCATTTATAGTTGGGGATCTAGAAATAGTAAGCCTTGGTAAGATTTGATTTGCAATTTTTCCTAATGTATTATAGGATAGTTTGAGACGTCGATTTTCATACACCGATTTGGAAATTGTAGGCCATGATAAGATTTGAATTGCTATTCTTCCTTAAGTTATATAGGTTAAATTGACATGTCAGTTTTCAAGAAGCAAGAGAGTGACGCAGGAAAAATGATCGAAATCGTCATTTTACTTATGGCACATAACACACCTTTGAGGTAGTATGGTAACCTTATGCTGTGAAGTAACATCAGTATTAATCTTTCTATGAGCAAACAGACCAAACAAAAAAAAAGTTGTTCCCTTTGGTAAGCTATCATAGATCTTTCTGTGAGTAGCACACACCCACACAAACACGATTCTTTGGCAAGCTATCACTAATCCTTATGTGAGCAACTCACCGAACAAAACAAAATTGAACTTTTGTTGGGACTTTTGTTTGCCAAATCATCGATTAAGATAATGCGTACTTTTCTTTTTTGCATTACCATCTTTATAAATGTGGTGGGTTGATGGCTAGTTGTCAAAATATGTGGCATCACAAGGGGTAATTGTATTTTAAGCTTTGGCAAACTGTTTAGTAATGTTCCTGAGCATTTTTTTGGTTCTCAGGGAAGATTGTCAAAGATTCTAAATATTTTCAGAATGATGGGTCTGTATGGCCTGAAGGGTATACAGCTGTGAGGAAATTTTCTTCTTTAACTGGTGGATATTTGAACTTTTGTAATTATTATTTTTGGGATTATCTTTCTCTTGCAGATATTTCTAGTTTATCTTTTGACTGTGATGATTTCTAATAACATTCTTGTCATCTCTTTCGCTTGTTTCCAGATCCCAATGTCTGTACCTTATATAAAATGGAAGTTTTGAGAGATTTTGAATCAAAATTTCGACCTTTATGTAGGGTAACTTTGGATAATGGAGAGCAGGTCAGTTCGTTCAATTGGATAGATATACTTGCATTTGCATGTTAATTCTCTGTTTACTGCATTATTTGATGGCTTCTTTGGTGCTTTAAAGTTACAACTATAAATTTACTTTTCTTGCCACTAGATATATCTGGTTTATTATTATTATTTTTTAATTGGATGGAACCCAGATACTGTATTTTTTTTTTTTTGGATAAGAAACTGGTATTTCATTAATGAATGAAATATACAAAAAGGGGATGGACTCCCAATGAACTGGATTACAAAAAAAGCCTTCCAGTTTTTCATGAGGTCATTTAGACTATATTCTGTGAAAGGGAAGAAAAAACATTTACACCATCCAAAAGCCATAACAAAAATATTATCAAGGAATCTGTCAAAGGGTAGAGATTTGTTGTTGAAGATTCTGTTGTTCCTCTCCATCCATAGGCATAAAAAATAAGCTCTGATTATCTGCAACTCAGATACTGTAATTACTTACAAACTCAGATCAAAGTGCCGTTTTAATGGAAAATCATTATGGAAAACCAGTCAATTTGAGAAGTGTAAATCTCTAAGAATTCTTTATTGTTATGCATTCTGAGAATATCAAACCTTATCTTCCATTGCCCTTTTAGTCTTACATCTTAGCTGTTTGGGATTCACTTACCATGAATTACTTGTTAACTAATGCTACTTTTTTTGAACCTCATATGCAGCAAAGGCTGTCAACGTTCTCTCTTTTGCATGTGCTATTCTTCACATCAGATTCTGCATTTCATTAAAATTTTTTGTTAGCATTACTTTAATATTGTAACTTCTTAAGTAATGAGTTTCTCTATACTGGATTTTCATTGTAGAGGTTTTTTGATTGCCTTCTTTCCTATTAAAAAATGTCAAGGTTATTAATGCAAGCCTTTTCCCTTGTTCGTCAACTCCTGTGTATTGGTATAGTTTAAAGGATCCTCTCCATCTGCTTGCTGGAATAAAATATACAAAAGGATGAGGAAAATACAACATATTTCTGATGCTTCTACCGAAGGTAGAGGGGAAACTGTATACAAGTCCGGTTCTGACATGTTTGGTTTCTCTAATCCGGATGTTAAGAAACTCATCCAGGTATACATCAATTTTCTTTTCCAATTCAAATGTCTTCCTCTGCCTTTTTTTTTTTTCCCTTTTCGTTTGGTATTGCAATTGTATCAATTTGGTGCACAAGTATAATTTCTGCTTACACCTTTTGTTTTGCTAAGAAACAAAGAAATTCTTGTTTCTTGTAAAGAAAAAAAAAACAAGAATTTCTTTGAATGAATGGAATTACAGAGGAGGAACTACACCTACAAACGCCAAGGAGTTGTAAAGCTCTCTAATTGGAAATAATAGAAGAAAAGGGTATTGACAAAAAGAATTAGAAAGAAAGCTTCAAGACGAGGCTAGACAAACTAGAAGGTCCCATATCTCATGTCAATCTTTTTCTACCCTTGAAAAATTCTCTCATTCCTCTTTTGTGACACTCGGCAAAAAAAAAATCTGTAGTTGCATTCAAATGTAGCACTTCACCCTCCTTTTGAAGTGGTGAACACAGAGATTCATGTCTAACCTTGCTGGAACATTGAAGGGACGGCAACAGCAAGCCTAAACTTTTCCACAAATTCCACCAACATTTAAGAGTAAAAGGCTGATATACATAAAATGATGATCTCAAATTTCAGCAAAGGAATAAGAATACACGAATTTGGGCAGATACTGATTGAAGGATTTTTTTGAAAGGGAAACAAAGATGTGTATTAAACCAGCCACTCAAAAAACAACCTAAAGGAGTCCCTATGCCAAGGAGTAATCACAAAAAAAAACCTCCCCTTTAAATTAATTAGAAGGGAAGGGGGTATTACAAAAGGAATTGAGAAGAGCACTCCAATTGAGGCCGCATATTCTACACGATTGCAAAAATTTGCAAAAGAATTAATTTATCTAAGAAGACAATAGTGTTTCTTCCAAAGGAGCCAAAGGAGGGCTCTGCTAGCATTTGTCCACAAAACTCTTGCTTTTACTTTGAACCACCAGCTGGAGAGAATTTCTATAATTCTGTCTTCCATCGTTTTAGGGGGCACCAAGCAAGATCATCGTTGGCATATGAAATTCCATCCCTTAGCAGAAAATTACAGTGAAAAAAGAGATGGTTAATTGATTCCTTGCTGCCGCCGCACATCAAACAAATTTGAGGGGATAATGAAAGTTGGGGATTCCTTTTTGGATTCTATCTGCGGTATTTATACTGCCATGAGCTAAATAAAGTCCAGATGAATTTATAGAAGGATTTTGAAGCTGGATTTTGTCAAAAGTGTTTAAACGTCCATGGGCTGCAGCCTTAAGAACACTCCACCATTTTTAGGGTTGGAATGGCTCCAAAGGACCTTAAAAAAATGGGCGCTAATGCCTAACCCACTAACTGTTAGCTTATTGAATAAGGACATAACTGAAAAGGAACTGAGCTCTCTAGCAGTCAAACTTTGTTCACCCTCCCTCTAGGTGTAGTGAGATTCCCATGTTCAATGAGAAGTATCATCTTGCAATTTCTAAGTTGTAACTTCTGATTGCTCATTTCTTCAATCCACAAACTATTGATGGACTTGTTGTGCTTGCAGTTAGTGTGTAAACCCCTGGAAAATAGATGCTAAAAGGGGATTCTTCAAGCAACTACCTTTCTAGAAAAGGTGTGACGACCACTCCCTGCCTTGTGAGTTGAATCCTGCTTGTCAATATTTCTAGATTCAGCAATGGCAGGCCAAGGACTTCTTGAGTTGGCATTTTTTTTGAACAAGAAACAACTTCTTTCATTGATAGATAAAAAGAGACAAAATGTTCAAGGAGAAATAGAAAAGCCAACAACCAACCAAAAGAAAAATGCCAAAACAACAGCCCCTAAAAAGAAATAAATCAAGAACAAACCAAAAGAAAGAAAAAAAACACCAGCCATAACACCAAAAACAAACCAGAAAACATCAAAATCAGAGAACTCCTTCGTAATAACCAAAAACCAAACGATAGACCAGCAAACTAAAAAACCCCAAAAAAATTGGAACTCAACTCAGAACGTTTCACCCTTTTGGAATCTGAAGCTGCATATATGTCTTTATGCTTGTTATATATGCTTGGGGTTATACTCCTAATTGGGGTTTTGTTGGAATGTTTGATGAGCTCCTAATAGGAGTCTTTTTTAATTTTACTTATATCTTTTCATAACATCAACGAAATGTTTGTTTCCTTTAAAAAGAAAGAAGTAACTGGACTGTATCTTGAATACTATTAATGTTAAAACAATGAATTTCTTTGTTATTTCTTAAAGCCTTAGTCTATTAGTTGCTAGGCTGCATTATATTGTAATAAATCTTTGGTTGTTTAATTGTGCCTTGTTTCACTCCAGCAACACTTGGTGTGCCTCTTTGCCTCCAGGAGGCCAAGAAGCTTGTGCCCTTGGAGTGTATCTTCTAATTTAAAAATACTGATTGAAATACATTGTTTGTCTTTTTTTTTTTAATAAATAAACTCTTAACTCTTCAGTATTATTATATGGATATATCATATATATATTTCTTGGGTGAGGTTGGGACCATCATGTCCTTTGAGAAAAACTTACTATTGGCTTTTTTTGCTTGTAATAAACTGTGACGTTTCAAAGTAAAATTCCTTTTTTCTTTTGACAGGGGATATCTAAATCTGGACTTTCTTCTTCCAGATCCTTGAGCAAAGTAGCCTCTAAAAAATACAAAGATTTTCCCGTTGGTTATAGACCTGTTCGTGTTGATTGGAAAGACCTTGACAAGTGCAGTGTTTGTCATATGGATGAGGTGAGAGCTGGCCTCTAGTCTTTTATATTCATTTAATTCCTTTGTCTTTGTAATCACTCTTTTTATAGAAATGACATCTTTTCTTATCCATAGTAAAGTTACTTGTGTCTTTTTTTAATATCTTTTTTATAGCCTTCATCCGTTGCTGATATATGTTTTCCTTTTTCTTTTCCTTTTAAATATAATGTTTGGGACTGTGCAAGTGAAGTTGTTCCAGTGGTTATTAGTTTGTAGAAAGTTAATGTTTGTAATTTACCTTAGTAAATGAAGTACAAAATTCCCAATCAGTGTGTAGTTTGAAGGGCTTTCTTTTCTTTTTTGGATCAATCAGATTTGGGGTAGATGGATGCCTTACTTTTAGTTGCTAATTGTAATAAAATAAAGATTAGTAATATTTTGTTTCCTAAAAAAATGAAGGAAACTCTAGGACTCTTCTGTTGTTTTCTTAATATTTCTTAAGAGATTTTGACAGGTTCCATAGCCAAGAAAAAAAATATTTATTACTCGAAGATAGGAGAGAAAGACACTCAAAAATATTCTCAAATAAGTTTGATTCTTTTGTAAATTTTTGTGACTGTATATGGTGTATAGCCTCCAATTGGAGTGCTCTTCACAATTCCTTTTGTAATACTCCGTTCTTTCTAATCAATTTAGATTAGAGGCCTTTTGTGTGATTCCTCCTTCTCAGCATAGGTTCCTTGTCTCTATGGCCTCTAGGTTGTTCTTTCTGTTTGGCTGTTTTATATACATCGTTTTTCTCATTAAAAATAAAAAAAGAAAATAGGAACGCTGTTAGTTACAGGTGGATGATGGCCTTTTTTCTGGGTTGTCTCTCTCCTGCTAGCTCACAACCAATATTTACTTACCTCATCCTCTATAGCCTTCCCTTATTTGAAACTTTCTTACCAAAATCTGCTTAACTCACTCTTAATCTACTCTTTTTTGTGCATTAGATCATGTTGTTGAGACTTTTTTTTAACATCCATGTTATTGAGACTGAATTTGTCCCATTTACCCCTTTTTGAGTAAGGGGCTTGAATGGGCTCTTTTATTCCTCGACATTCATTTTTCGATTCTAACAATGTTGTTGTTGCCGTTAATGCTGTATAAGTGTTATGCTAATTGAGCTTCCCCTTCTTTTTCATAGGAGTATGAAAATAATCTCTTCTTGCAGTGTGACAAATGCAGAATGATGGTATGTTTACTTTTTCTTTTCATAATTTTAATTCCAAATCTCATGTAAACTGGTTTTATTGAGGTGATTTAGCTTCAGGTTGAAATCAAGAATGATAATGATCCACGAAGGCAAAAAATTTAATGAATTATAAATGCATACGTATAACTGTAGCAGTTGGTTAGAATGTCACATCTTTAATTTCACTGGTTTATGTGGATTGCATTAGTTTAGCATTCGTGAAATGAAATTTCTTGCATGGATTATGCATCAAATAGATTTATGAGCTAATCTGTTTCCTTGTTTCCTTTGTTCTTTACCCTTTTAATCTCTCTATCCTGCCTTCGTAAAGATATTTTGTATTGTTCTTTTAGATTGTCATATTGAAGTTTACCTGGAAATTCTTAATGTGTTTTATGTTAGGTCCATGCTAGGTGCTACGGAGAACTAGAACCAGTTGACGGAGTGTTATGGCTGTGCAACTTGTGTCGGCCTGGGTCTCCTGATTGTCCCCCACCATGCTGCCTTTGTCCTGTCATAGGTATTTAAATTGTCGAATTATATTGTACAGCTGTATTATTGTGTAGTTATTTCTTTATCTATATAGAGCTAGATACTGTTTCCTTGTATGTTCTATCCTTCTAAATCTGTCCTCCTTGTAATATCTATTTAGCCTAGCATTGTATTCATCCTTATAATCAAATAAGAATATTTTTTCTTATTGTTGACATGGTATCAGAGCCTTGATGCTCTTTTCGATTGGCCTTCATTGCCGATTAACCCTCTTTTTTGATTGGCCTTCATTGCTGATAGAACTTTTGCGTGGGTATTCAAAGTTTGCAACTTCTCTTCCTTTGATGGGTATTCAGCCTTCTTTTTCCGGTTAATTGTTTGGGTATTCTGTAGCCTTGTTCGAAGTTCCATTCTGAACGTGTGGGTTTCTCCTTCTACGGTTGATGCTCGTTAGTGGTTCCTTGTTTGTTCGCTTGTTTGTGGCTATTGACGCTCCACTCCATAGGCTTGTTCGTGGGTATTGATATTCACGTGGGTATTGACATTTGCGTGGGTATTGGTGTGTGTTCGCTGGAGGTTTGTGGGTTTTCTTCGCGGCTCCACTCCACTTCGTTTGTTTGCCTGCTTCTTCTGCTACCTTGCATCTTCTTCAACTGATGAACGTCCTGGTGTTTTTTTTCCCTCGATTGGATCTGTCTTAACAACTCTGTGAGTTCTCTTGGACTCAACTAAAAATACCAATGAAGCTCTTACGGAGGAGACCAATAAAGTCAGCTATAGGAGACCTTCAGAACATTCATCCTAGTTAAGGTTGGATGAAAGAAATTATCTCAAATGGTCTCAACTTGTTCGCATGTTTCTAAAGGGAAAAGGAAAGCTGAATCATATACTTGGGTTGGGACCCAAATGTGGAGATCCAAAGTTTGATGTGTGGGATGAAGAAGACTCGCTGATTATGTCTTGGCTGTGGAATTCGATGACTCCCACTGTTAGTGATACGTGTATGTTTCTTGACACTGCCAAGGATATATGGGAAATGCTGAAACAAACTTGTTCAAAAGTAAAAGATGCCGCACAAATATATGACATCAAGACTCATATTTTGGTAACAAAACAAGGGAGTCGAACAGTGACAGAATGTTCCTCCATTCTTCCAGGTTTGTGGCAAGAGTTGGATCATTATCAGGTGTTGAAATGAAATGTAGTGAGGATGCTGCTATCCTTAAAAAATTTGTTGAAAAAGATCAAATTTATACTTTTCTTGCAGGTCTTAATGTTGAATTTGATGTGGTTCGAATACAAATTTTGGGAAAAAGATCTATCATCTCTAAATGAGACTATTGGCACGGTTAGAAGTGAAGAATCTGTCGCGGTGTTATGCTAGAACTACACACAGAAGGATCCGCCATGAAGATAGGAAGTTTCAAAACAAATGATGCCGAGCAAAATGCACAAGCTGTAGCTATAAAGGAAAATTGGGAAGGAGGAAAATCAGGAAGTAAAGATCATGTAATCTGCTCTAGGGGTGATCATCGGTTGGTCGACGTTTTTGGGCTCAAACCGATGCCAAAATCCGACCATTCGGTTTCGGTTGGTTGGAAACCAGCAGCTTTTTAAGGCTTGTACAAACCGACCAACAGACCATCGGTTCGGTTGGCCGTTTTTGGCCCAAATTGAAATTGAAATGGGGGCTGGTCTGTTTTGAACGCTGGAAAAAGCTGTAGATCGATCTGTCTTGAACGCTTGAGGTAGATGAAGGTAGATTAAAGAGAAGAAAACGTAAGAGAGGGAGAGTGATTGCTGGAGGGAGGAGAGAGCTGGAGAAGGCAACGTTAGAGAGGGAGGAAAGGGCTGGAGAGAAGAAGGGGAAAGTCGGAAGGAGGGGAGACAAAAAATAAGAAATGAAAGGGGAAAAGAGATTTTATGAAATTAAAAAAAAGAGAGTAATAATTTAACTTAGCAGCCACTGGTTGAAAACCAACGGCTGCTTCTCATTGCTTTCCATAAAACCAGCTACTGATTTTATGGAAACCAACAACTGCTGTCAGCAAAGCAGTTTTTTTTTTAGCTATAAGCGGACGGTCGGTCAGTTGTCCCAATTTTGGCTATTTCGACCGACCGACCGGTCGATTTTGCATATCTGCAAACCGACGCCGACCGAACCGGGGTATAGACGGACCGACCGATGTCGGTTTGGTCGGTTCCTGTCGGTTGGCCTGGTTTTTGGAAATTAATGTGCACCCCTAATCTGCTCTTATTGCAAAAAACCAAGGCATACCAAGGATAAATATTGGAAGCTGCATGGAAAACCCACCAATTTTCAGAATTCAAATTCCAATAATGGTCAAATAAAGGTAATGCAAATAAAGGAAATGGACAAGCCCATACTTCGTCAACTCATAAAGGTGAAATGACTGAAGAGAAGTTTGAAAAAAGAGAGCTAAACACAAATGAGATTGAAAAGCTAAGGAGTCTGTTGGAAACCCTACAAAAAGCACCTAGTCAAGGTACATGTTCGGTGGCTTTTTTAGGTACAACCTCTCAACCTAAATTTTCTATGTCTACTACCTGGATTGTTGATTTGGGTGCAATTGATCGTATGACAAATACTAGCTGCAATTTTATTTCTTATAGTCTCTATCCAAGCAGTAAGAAAATATCTAAAGCAGATGGAACCTTAGTAACAGTGGCTGGTCAAGGAGATATCAAAATAAATGCAAACATCACATTAAAGAACGTGCTTCATATACCACGCTTAATTGTTAACTTGACTTCAATAAAAAATTTGATCAAAGATTTGGGATGCAAAGTAACGTTTTTTCATTCTTATTGTATTTTTCAGGACTAGGGCAATGGGAGGACGATTGGACTTGCTAAAGAAAAAAATGGACTTTATTACCTTGATGAACCTACTGCTCAGGACAGTATTAGGGAGATACTCAAGTCTCCCTCAAGTCTCCCTCATCTCTCAGACTCAACTTCCTTCGAATAAGCCCTGGCTGTACCATTATAGACTAGGGCATTCCTCTTTTAGGCCTCTCAAGATTTTATTCCCTGATTCATAAAAAAACCTTGATATTGAGTCAATTCAATGTGGTGTTTGTGAATTTGCTAAACACAAGTGCGCTTCATTTCATGGTAACAATAAACAAGTATCTGTTCCTTTTTCTCTTATTCATAGTGATGTTTGGGAACCTTCTAATGTGTTCAACATTCCAAATCTCGTTGGTTTATTGTCTTCATTGATGATTGTACTCGAGTTTTGTGGGTCTATCTTTTGAAATAGAAATCTAGTATTAGTACTGTTGTTCCAAACTTCTTTCACATGATAAAAAATCAATTTGAAGTTCAACCTAAACAAATTCATTCTGATAATGCCTGAGGTTACTTCAATCAAGCCCTAACATGCTTCTTTCAAAAACAAGGAGTCATCTATGAGTCGTCTTGTGTGGAAACTCCACAATAAAATGGGGTGGCTGAAAGGAAAATTGGTCATATTCTTTCTGTGACTCGTGCCCTTCTTTTCCAAAGAAACGTCCTAAATCCTTTTGGGGGGAGGCTGTATTGACAGATGCCTACTTGATCAATTGTTTACCCACCGCAACTCTTGATCAGAAAACTCCCATGACTATCCTATCAAGTTTTTATCTTGACTTCTACACGACAAAGAACTTGATTCCTTGTGTGTTCGGCTCTGTCGCCTTTGTTCATATTCATAATCATTAGCGAAGTAAACTTGATCCACGAGCACTCAAATGAGGTATAAATATTATCATCCACCTTTTAAAAAATACCTTATCTCTGCCTACGTTACCTTTTACGAGACAACAGTTTTTTTACCTCACCTTATCTGTAGGGGGAGAATACATTGTTAGAAGATAAGGGTCTTCCATTTCCTAATCTCACCTTCCCTTCATCTTCTTTACCACCCGCTTCACCTTTACCGAACTTTGTTGAAATTAACTCCAACCATCCTTCACCTTTACCATCCATAGATACCACCATTTCAAGTCCATCTAACCCTAATCCTACTGCACCGTTACAGGTTTATTCAAGAAGGAACAATGGTTAGACTGAGCAAGTCCAAGAGTTTAAACATACCCCTGGACTGAAACTAATGAGAAAGGAATCAGTGATAATTCAGACATGGAATTACCTACTAATGATGAGTCAAATGTGGATTTATCTATTGCTTTGAGGAAGGGAATACGAACTTGTCCTCAACAATCGTTATAACCCTTGTCAAGACTTTGTATCCTATGAAAGACTCAAGTAAGTATAAAAGTTTTGTGGTTAATCTAACCATATAAATATTCCAAAGACATATATAGAAGCATTACAATATGAAGAATGGAGACAAGGAATGAATCTTGAAATGCAGGCCTTAAAAAAAAAATCAAACTTGGGAGTTGGTCACTATGCCTAAAGATAAAAGCAGTCGGATGTTGGTGGGTATATACTGTCAAATATAAAGCTGATGGAACACTTGAGAGGTACAAAGCACAGTTGGTTGCCAAGGGACATACTCAAACTTATGGAGTCGATTACCTTGAAACATTTGCTCCTGTTGCCAAGATGAATACAGTACGAATACTTTTATCCCTTGCTATTAATCGTGGTTGGGACATGTTACAATATGATGTAAAGAATGCTTTTCTACATGGTAACTTAAAAGAAAAAATTTATATGGAAATTCCACTGGGGTATGAATAGGCAGGAAATAAGGTTTGCAAATTGAGGAAAGCACTGTATGGATTGAAACAATCTCTTAGGGCTTGGTTTGGAAGACTTTCTTAGTTTATGAAGAAAATGGGCTACAAACAAAGTCAAGGAGATCATATATTGTTTATAGAACACTCTACTGCAGGGGGGATTATAGCATTGATTGTGTATGTTGATGATATTATTGTTACTGGAAACGATAGTAATGAACGAGATAAGTTGAGAAGATATTTGCTTCATGAATTTGAAGTTAAAGAACTTGGAAAGCTGAAATATTTCCTTGGGATAGAAGTTGCCTATTCTAAACAAGGTATATTCTTATCACAACATAAATACGTGATTGACTTACTCTTTGAAACAGGGAAGCTTGGTAATAAACCAGTTGAAACTCCGATTGATCAAAATCATAGATTATGTATACCTGTGGAGAGCCCTTCAGTAAATAAAGGAACTTACCAGAGGTTTGTTGGAAAACTAATATATCTTTCTCATACAAGACCAAATATTGCCTATGCTGTGAGAGTGGTTAGTCAATTTATGCATAATCCAAAGGAAGTTCATCTTCAAGCATTGTATAGAATACTTCATTACTTGAAAAACTCAATTGGGAGAGGTTTATTGTTTAAGGAGATTAACTCAACGTAGAAGTTTATACAGATGCAGATCATGCAGGATCAATAGATGATAGAAAATCTACTTTCGGCTATTGTACCTTCTTGGGTGGAAATTTAGTAACGTGGAGAAGTAAAAAGCAAAATGTAGTTGCAAGATCGAGTTCAGAAGCAAAATTTAGAGCAATGGCCTTGGGTATTTGCGAATTGCTTTGGATGAAAAATATTTTAGAAGATTTGAAGATTCCATGGGATGGAACTATGAAACTTTATTGTGATAATAAGTCTGCAATTAGTATTGCTCATAATCCAGTTCAACATGATAGAACCAGCATATTGAGGTTGATAGACATTTTATCAAAGAGAAGTTGGACATCGGTCTTATATGTACCCCATTTGTGCCAACCAACAATCAAATAGCAGATATTCTCACAAAAGGATTGAATCTTAGCAGCTTCGAGAATTTGGTTAGCAAGCTAGGAATGGAAAACATTCACTCACCAGCTTGAGGGGGAGTGTCAAATTATATTGTACAACTGTATTATTGTGTAGTTATTTCTTTATCTATAATCAGCTCCTAATTGGTAGATGATTGATAGAGCTAGATACTGTTTCCTTGTATGTTCTATCCTTCTAAATGTGTCCTCCTTGAAATTTCTATTTAGCCTAGCATTGTATTCATCCTTATAATCAAGTAAGAATATTTTTTCTTATTGTTGGCAAAATTCATACCATTAAAATATAATTTTTGTATATCATCCTTGTGCTGTATTGAATTGTGAATTCATATTTTCCAGGGGGTGCAATGAAGCCTACAACAGATGGACGCTGGGCTCATCTTGCTTGTGCTATATGGATACCTGGTTTGTGAATTTATTCTTTGGAGTTTTTAGATAATTTAAAATCATAAAATGTATTTTGACTTCCCACATTCTGCAGAAACGTGTTTATCTGATATCAAGAAAATGGAGCCTATTGACGGTCTTAGTAGAATCAATAAGGTAACATCTATGAGTTTCTTAGTGCATAGATAACTTGGTCTTTAGATTCATCGCACCTTTTTCCCCTTCGAACATTTACATAAAAACTACTTAATGCAAAAGTTGGATGTGCTTTGTTTATTACTAATGGAAAAAAACCTAGATTGTGCATTTGATTTATTTGATATAAAATCAAGTCATACATATAGTCTTCAATCTTTATGCTTCCACGAGTATTAATGCCAAAGGTTTTTTTTTTTTTGATAAGAAACAATTTCATTGATTGAATGAAATTACAAAAGAAGGGGAGAACCCCAATCCAAGGGAGTTACAAAAAAACCTCTCTAATTGGATAAAAGAGAAGAAAAACTATAAAGAATGAAGGAGAGGGTTGCATTAACACTAAGAGATAACTAAAAAAAACTATAATGTCGTAAAATTGAGCAAAGGGTAACTCTTTATCTTTGAAGATCCTATGATTCCTTTCTTGCCAAGTAAACCAAAAGAAAGCTCTCATGACGTGTAACCAAAGGAGCTTCTTCTCCTTAAACAGGTGCTCCAACAGAACAAAGTAGAGAAGAGCTAAGGGATTGATGGGAAGAACCATAGACCATCCAAAGGCTATCAGCGTATTCCTTGCTTTGTGTCTTGGAGCTCTGTAGGAACGCCAAAAAGGAGAGATGATAATGCAAGGCATTCTCCTTTGAAGCTTATCCTGAGTATTAATCGCTTTGTGGCTCAACTCCCATAGAAAAAATTTCACTTTCTTAGGGTAGTGCTCTTTCCATATTGCATCATATAAGAGTGGAGACTACTCGGATCTATTGTGGGGGAGATCTTTCAGAAGAGAGCTAACGGTAAACACCCCATTCGATTCCAGCTTCCCAAACATTTTATCCTTCACTTGTGTCAAATGAAGTGTGGATAGGAGTTGTAACAGTGATGCCCATTCTTCAATTTCAGCTTTCTTTAGGTTTCTTCTCATGCATGTATTCCACATTCCAGATTCTACGAGCCAAAATTTTTTTACAGCAGCTGTTTTCTTTGTAGAGAGACCAAAAAGAAGGGGAAATCTTCTGTTTAATGGCCCGTCCAGCAGCCAAGCATCATGCCAAAATGAGATAGAAGCACCGTCTCCAACCTTGAGAGCGACCCGGTCAAATAACAAGTTCTGAAGTTTCAAGATGGATCTCCAAGGACCCCTAGCTGTGGCTAAAGAAAAAGTTCCCATATTTTTCAGAGAGGAAGCCAACCCATATTTTGCATCAATCACCTTTCTCCGAAGGGCCGAATGTTCCTGATGATATCTCCAAATCCATTTCACAAGAAGAGCTTCATTTTTTTTTTAAGGCCGTGCAAGCCAAGCCCCCCATCCACCAATGGAAGAAGAATGTCATCCCACTTAACAAGGTGAAGGCCCGACTTGTCACTATTACCCTTCCAAAGGAATTTCTATAAAGTTTCTCCACAGTTTTTCTAACCTTTGATGGCATGGGATAGAGTGACATGTAATATGTTAGGAGATTGGTGAGTGTTGCTTGTACAAGGGCGAGCCTTCCTCCCTTAGAAATGAGAGAGTTCCTCTAAGAATGCAACCTTCTCTCCACTTTCTCGATGAAAGGCTCCCAAAAAGAAAGAGAATGATGTTTATCATTGAGAGGGAGTCCCAAGTAGGAGTTAGGCCAATTTCCAATCTTGCAACCGTATTTAGAAGCCAAAGCCTTCGACTCCGGGAAATCCACATTTATGCCCATGAATTCTGATTGAGTGATTAATAGACAGCCCCGAAACTTCCTCAAAGATTCTAACAAGGCTGAACAGATTAGATAGCGCCCAATCATCGGATATAGAGATGAGAACGGTGTCCTCTGCGAATTGAAGATGGTGAATATCCAAAGAACCCTTCCCCAAATCAAAGCCTTTAATCGAATTATTCTTTGCAGCATGAGACAAAATCCTACTAAAGCAATCAGCAACCATAATGAAAAGAAAGGGAGAGAGGGGATCACCTTGTGGAAGACCCTTGTAGCTAGAATCTTTCCTCTAGGACGGTAATTTATGATAATCAAGAAGTTGGTTGAGGAGATACAACCTCTTATCCATTGCCTCCATCGTAAACCAAAACCTTTTACCGAAAGAATCTCATCAAGGAATTCCCAATCAACTTTATCAAAAGCCTTTTCTATGTCAAGCGTGATGACCACACCAGTCTTCCTTTTTCTCTTCCATTCATCAATGAGTTTGTTGGCTATAAGGGAAGCATCCAGGATTTGTCTACCTGCGACAAAGGCAGATTGAAATTCAGAGATGGTGTGGGGGAGCACTTTTAAAAGCCTTTTTATAAAACCCTAGCAATGACTTTATATAGGCAAGAGATGAGGCTAATGGGCCGGAAATCCCCAGCTATTCTCGCATAAGTATTAATGCCAAAGGTAATAGCTATTCAATTGATTTGAATAACTAAAGCAGAATAATTGCAAAATTTCTTGGAAATAGAATTTCACCTAGAGCAAGAACTATGATACTATTGGAGAGAATTGAAATAGAAGTGGTTATCCTAAAAAATAGTGTGCTTCTCTCTAGCCGAAGGGACTAAAATTCAGCTGCAAAGGCAGCATTCCATAAGTTCCTAAGACTACTTAGAAAAATCAATGTGAAAGACTAACTCATGCTCTGTGCCCTATAAGCTTCTTAAATCATATTTTCATCTCATGGTTCCCCTTTCATAAGCTTTCATTGAACTCCCTTCATGTGCGATAGAATGTTTGGTCAAATTTGTATTTGCTCTTGATTCTTTTTGAGAAGCAGCAAATTGTAAGGTGTATAATCCACAAATAGAAAATGCATGTAAATAGATCAAGTATACTTATTGATACGGAAACACCATTATCTACTTGGACATATATACATTTTTCTCTCGGTAAAGAACCAAGTGGATGGATATGCACAGTTCGCCTTAATACCTTGTGCTGATCGACTAATACCTATTCTTGTCTTTGTAGTTTGTACTTATTTTGCCTTAAACGTTTGAATTTAAGTTGCATAATCTATGTTCATTTTGTTATAATTTTATAATCCTATTATCGAACACGCAACCTTCTGATCTGGAGTCAGACGCGCTACCATTGCGTCATGGATCTCTTGATAATTTTATAATCCTAAAAATCTTATTTGTTTATTTTCTTTTATTATAAATTTTAAAACAGGATCGTTGGAAGCTGTTATGTAGCATCTGTGGGGTCTCTTACGGAGCTTGCATTCAAGTAAGTTTGTTATTAATTTAAGCATGGGTGTGTGTGATAGCTATCTTCCTGATTTGATGGGTGTGTTATAATTTCTTTAATCTTTATAGTCTTGGATCTCTGTGCTATCATATGATATTACCATTTTGAGTATTAACTATGATATGGTTGGGTCGCCTCAATACCAAGTGATATCTTAATGGGAACAGCAAAAATTTTAAAAAAGGACACTAATGGTTTTCTCTGGGGAAAATTTGGTGGGCCTCTATGGTCTTTCTTCCAGGACAAATGGACCTTCTTTTTGAGTGGAGCTATTTGACGCTGCTGGCTTGTGTATATGGACAATTGGGTGCTTGGAGGTGACCTTGATGTGGTTAGGTGGCCTAGTGAGATGTCTTCTACCGATGACAATTCTTGAAATGGGGAATCTCTTGTGGGGAAGGTTTTCTTGGTGTTGAAAATGGATGGCTTTCTCGCCCCCATTTTCAATCTCTTTGTGATGCTTGGTGGAAGAGGGTGAGGGTGTCAATCAAGCTAATTAGAAAACTATGTCTCGTGGTTCTATCGGGTTTAGGGACTAGTAATATATTAAGAAAGGGAAAATATTGCTCTTGATACGTAATAACTTTAGAATATCATTTGGATGATGTTCTGTAGCACAAAATTATCAAGAACAAGTATGGTGAACATCCCTATCGTTGCTTCTCAAGCAGTGGGAGTTTTCCTAACATAAGAGTTCCATGGTTGGATGTGATGATTTCCATTGGACGTTAATTTGTAGATAATACTATTCTTTATTTTCTAAGGTTAAGAGAAAGAGATTCTCTTTACACTATGAATGCTTAGAGAGCTAAAAAGCACGGACACGGACACGGACACGGGACATGGATGCAACACGACACGGACACAACTGCACACCAATTTCGAAAAAAGTAGGATACAGACATGTTGGGGGACACAGACACGTTGGGGGGCATGTTCTTTTTTTAAAAGTATATTTATATATTTAATACTTCATATTAGTCGGAAGAGCTTGAAGATATAGAAGGATTTGATTGGCTATGGCTAGAGGAAGATGTCATCCTAATAATTTTGATCAACATATAGGTTTTGTTAGAACTTAGAATGATTAGAAACTAGAAGTGTTCCAAGAACTTATGTTTTTTTTTATGAAGTATATAAAAAAAGAAAAAGACCAAACCTTGCGGTAGTGGGCGGTGGTTGACAAAGTGGCGATGCTTACGATTAGGCCAATCAACGATGGTTTCCTTTCTCCACTTAATATATATATATATATATATATATATATATTTTTTTTTTAAAATTTTAAGGGTTTGGGGGTTGTTTTCCTTTTCTTTTTTAGGTTGGGCTTAATTGGCTTACTCATGGGCTGCCAAGGTGGGATTTATTTTTTTTTGCAGATGGCGTGTTCAAGGTGTGTCCAAAACGTGTCCATGCATGTCCGGGAAATAATAATAATAATAAAATATTGGACCCGTATTTTGGCGTGTCGGACACGTGTCTGGAGCGTGTCCGAACGAATCCATATCCAACACGTGTTCGACACGGACACTCTGCCTAAAACAGAGTGTTTGTGCTTCATAGCTAGAGAGTGAAGTTGGATCTCAAGGTCAACTATATTAAAGGTAATATTGTTGGTAATTGGTATTAACATTTCTTTGGATATGGTAAATGGATATTGCTATTAAGCACTCATGTGAACCACCAGGCATTGGGGCATGCAAAAAAAGTATAGATATTTTGGAGAATGGGTTCAAGTCTCTACTTAAGATATTAAAATCCTACGAGTTTTTTAACAACTAAATGTTGTAGGGTCTAATAGTTGTCCTCTGTGATTAGTTGAGGTGCGTGTAAGCAGGCTCAGATGCTTACGGATTAAAAAAAGAAAAGAGAACACATCCAAATCTTCCTCTTGGTATGAATCTTCACTCTATGGAGTCCTGGGGTCTTATGGTAAATTTCATAAGTAGAAGTATAGTTTCTTCTCTCCATGGAGAAGACTAATACTCACTCACGCAATAGTATCAAGTATGCTGCTCTACTACTTGTCCTTTTAAAATGCCCAAGAAAGTTGGTTGCATCTTGAGGAAGTCCTTAAAGGGATTTCTTTGAGGGTGGCTCGCAGTCCACAAAGCTAGCCACCAAATAAATCTGAAAATAGTTTGCATGTCAAAAGAATATGGAGGCCCCAGTATTAGAGATTTGGAGTAGAAGAATGTTTTTTGCCTGCTAAATGGACTTTGGAGATCCCTCTTGAAAAGGAAATATTTGGCACGAAAACCAAATATGATCCATCTTCACCAGGGTGGTGGTACTTTGGTGAAGATAAAAGATGTAGAACAAGAAGCTCATGGATATGTCCACTTATAGAAGCATTTTGGAACAAATTATAAATTTCAAGCTTGGGAATGGAAAAAATTGATGGGTGTCTGTTCGGAGCCCTTTTGGGTTGGTGGTTAAAAGACAAAGCAAGTGTTGTGGTATTGTGTGGTTAGAGCTTTTTTGTGGCTGTATGGCTAGAAAGAAATCAGAGGATTTTTCAAAATAAAGAGAAGTCCTTCATTGGCCACCCTTTTAAAGGAGAGAGAGAGAGAAGCTTTACGGTCCAATCTTTCTCAACGGGGCTTTCTTTTGGAAGATTTGGTTAGAAAGAAAAAACAGAATCTTTGGCGACTCCAAGATGCCTTTTGGAACTTTATTGAACTCTGTAATTACCCTTGCCTTATCTTGGCATAAATACTCCACCCCCCCCCCCCCCCCAATTGTAGTCTAGCATTCTTTTGGCCAATTGGAAGGGTTTTTTGTAATTCTTTGGGTGGGGCCCTTCCCCTTTTGTAATTTCATTTATCCATGAAATTTATGTTTCTTTTTTTAAAAAAAAGAAGATAGCTTTTGTGATCATGTATCTGGCCTCTTGTTCGAGTGCCTTCACAAGAGTTTTTTTTTTTATAATCACAATGTTTTTTTTTTTTTTATGAACCTCGAGGGGTGCTACATGTAATAGAGTTTTCTTTGCAAATGCCTTGGTGGAAGGCTAGAAAGGGCTACATCAGATCATTACCCCATTCTTTTATCCATTTGGCATTGATAAATGGATCCCCCTTCCCTTCCAGTTCGAGAATATGTGGCTTCAACACTGTAGTTTCTACCCATGGTGGCAATTGGTGGAAAAACACTCCCCTTTGAGGCTGGCCCAAGCATTCTTCTTGCATAAATTAAAAGGCCTTAAATCCATTATTCGCACATGGAACAAGGAGACCTTTGGTTGCATTGCACTACAACTTAGAGAAACCAACTCCATATGAACTCTCATTGCTTGTTTCATTGGAGGAAATAGGGGATCTTCAATCGGTCCATGGTATTTAAAGGAGGTCTATCAAGATTCTACTTTTTTCCTTGGCTGTCAAGGCAGAGACTTTTTGGAGACAAAAATGCAAAGCTAAATGGTTGCATGATGGGGACATTAATTCTAGCTTTTTCCACTATTTCGTGGCAACAAGAAAGAGGCAAAACTCTATCCACAAGTTACTTTCTCGAGATGGTAAAAGTTTATTGATTGATCAAGACATTGAGTTAGAATTCATAGAGTTCTATAAACATCTTTACTCTAGAAAGGAGGGCTCCAATTTTCTTGCTACAATTCAGTCAATGGGATCCTATTTCCACCCTCGCTTTGAGCTAGAGAATGCTTTCATTGAAGAAGAAGTGTGGCTCGCTGTGAATGAATTGGGCACCAACAAATCCCCCGACCTAGATGGATACACCGCGAAATTCTATAAAAAATTTTAGAACATCCTCAAGGACGACATTTTCCAGTACTTCTTTAAGGATGAGATTATCAAGGCGAGTCTTAATGAAACCTATATATGTCTCATCCCCAAGAAAGTGGATACCCGTTTTGTTGGAGACTGTGGACCCATTAGTCTCATCTCATATATGTACAAAATTGTGGCTTGGGTATTATCTGAGAGGCTAAAAAAGGCCCTTCCATTTACCATTACGGATTATCAATCAGCTTTTGTTGCGGGTCGTTAAATCCTTGGTGCTTCACTTATCGCAAATGAACTCATAGGCCAATGGAAGCGGAAAAAGAAAAAGGGGGTTATCATCAAGCTTGATATAGAAAAGGCGTTTGACAAAGTGGATTGGGAATTCCTTGAAGAAATCCTTGTGGCAAAAGGGTTTGGAATCAAATAGAGGAAATGGACTAGAGGGTGCATTTCTTCTACAAACTTCTCTATTATTATCAACTGCCTTAGGGGTAAGATTTTTGCTTCTAGAGGGCTTAGATAAGGTGATTCACTCTCACCCTTCCTTTTTATCATTGTGGTTGATTGCTTTAGTAGAATTATGTCCCACGCCGATTCCAAGAATTTGATTAAGGGCTTTGAGATGGGAGATGGATCTATCAGCATACATCACCAACGCCAATGATACCATTCTCTTCTCATCACATGAAGATATGGCCATACAAAACCTTTTCAACACTGTATATGCCTTTGAAGAGGCATTGGGGCTTAATATCAATCACCAAACGTAAAAAATAATGGGCCTTAACATGGACCCCCATTCGGTTGACATCATTGCTAATAGGTTTGGTTGTAAGGTTGGTGTGGGGCTGTACTCCTACCTAGGCCTTCCATTGAATGGCAATCCACGTGCTTTATCCTTTTGGGCACCGATCATTGAGAAGATTGAAAGAAGACTCCATTATTGGAGGAACTGTCATATTTCGAAGGGAGGGAGGATTACTCTTATACAGGGACCTTCTGATTTACTACATGTCTCTCTATTCTATATGCCGTCCAAGGTGGTTCACTCTATAGGAAGGATTTTTAGAAATCTCCTTTGGAAAGGTATCAGTGACAAATCTGGTATCCATCTTGTTAAATGGGAAAAAGCCCATCTTCCTATTGATCAAGGTGGTCTTGGCCTTCATAGCATGAAAGAGAAGAATAATCTTGCTAAATGGCTTTGGCATTACCATGGTGAGAAAGAGGCTTTATGGAGAAAGGTTAAAGATGCTAAATATGGAACCACTCATTTGAATATAAAGCCTGGAAAAGCATCCATTGCCACTGCCAGAGGGCCTTGGAAATACATCTTGAAGTATCAAGACCTCATCCATGACCGTATGGTTTGGAAAGTTGGAAATGGAGCCTCCACATCCTTCTGGAATGACAAATGGCTACTGGATGAATCATTAGCATCTCGATATCCCCTTATTTACTCCATCACTCTTTGTAAACAAGCTGCAGTTAAAGATCTTGGAAGGCAGAAGATGGGATTTGAGGATGAGAAGAAACTTAAAGGATGCTGAAATTTGAGGAATGGGCTGACCTTATGCATTATTTATCCTTCATTCACTTGACTCAATCTACTGATTTATGGGTTTGGAAGCTTGACAGCAAAGGTACCTTCTCTACTAAATCTTTATTGCTTGACATTGGTCCTAAAGGTGATGATTCATCGACCCTTTTATACAAGTCTATTGGAGTGACCATTACCCAAAGAAGATAAAGTTCTTTCTTTGGGGGCTTAGCCATTGAGCCATCAACACATTTGATAAGCTTTAATGGAGACTGCCTTACATGTCTATGTCCCCACAATGGTATCCACTTTGCAAGAATAGTGTTGAGTCACAAAGCCACCTCTTTCTCTCTTGTAGCACTGCAACCTCGTTTTGGAAGGAAATTCTTTCATCCTTCAAATGGTCAATGGCTTCACCAAATGACCCAAGGAGCATCCTTTCTCTTACATTATTGGGCCACCGGCTTAAAAAGTAACAGAAGATCCTTTGGATGCACTTCATCATAGTGTTATTTTTGGCTACATGGAATGAAAGGAATCAAAGGATTTTTAATGATAAGGAGCAAACTTTTGATGGGCTTTTTGTTTCTATAGCATTTTTGGCTTTCACTTGGTGTAAATTATCTCCGCTCTTCCATTACTATAGTTGTTCTTCACTTTTGTCCAAGTGGAGAAGTTTCATGTAACTCCCTTGGATTGGGGTTGAATTCTCCCTATCTTTTGTAATTTCATATTATCAATGAAATTGTTTCTTATTATTATTTTTTTTAAGGGAAATTGTTTCTTATAAAAAAAAAAAAGTTTCTCAACCTGTGGACAGGGTGGTTGGAGAGTTTGGACATTGATACTTCTGGACAATTCTCTTGCTCCTATTTGGTCAATCATATTGTTGGACGGGCTTTTTATTTCATGCTCTTTTGCTCCTACTTTTTGGCACTAGCACCTTTGCTTCCAATTAGTTGGTGCACCTTCGCATAGTTAGAGTGTTACTGTTTTTTCTTTTGTTTTTTGTTTTTTGTTCTGAGCACCTTTGATATGGGATTTTTTTTTGACAAGTCGATAACATTTTGAGACTTTTGGTTGCGGGTTTTTTTGGGTGATTTGGATGGATAGAAATCAACATGTATTTGAGGACCCCTATTACGAGAAACTTATGATAAATTCGATTTTCAGATGACGTAGAACCCTTTCTTGGATTCTTTTCTGATTATTACTCTTTGACATCCAAGTCTATAGAAGAGCTTTCTGTAACTCCTTGTTCAATTCGTGTTAATCTCTTGACTTAATATATTTTTTTCCCTAAAGAAGCTTGCCATTTGTTAATGCAGTTTGCATTAATCAATTTATAATATGATAAAATCATTATCGGTTGGTTCATGGGACAAAGGGGCAAGTCTTATTTGTCTCCTACTTAATTTTATCTCAGGACTCTGGAGTTTCTTAATGTCAAATACTGTTGTGTTAGACAGAGTTTTGTTGGAGTAGTTGACACAGGTGTAAGTTGACTTAGACAACTATAGTTACTTCAGAAAAAATGAAAAAAGAACAATTCGATATAATTATGTGCTTATGACAAAGTACCTGTTCCTTCTTTACATTTATTTATATTGTTACTTTAACTGTTTCTTATTTTGTAACAGTGTTCAAACAATACTTGTTATGTAGCATATCACCCTCTTTGTGCACGAGCTGCTGGTCTTTGTGTTGAGGTATATCTTCCATTCATTGTGCGTGCATTTGTTCAAGTTTTTATTTAATATTTATATCAACATATGTTGAAGTGTTGACATAGTATCTATCTTGTATTTTTTTTTTTCTTAATTTCAAAGCTTGAGGACGATGACAGGCTCCATCTACTTGCTGCGGATGAAGATGAAGAAGATCAGTGCATTCGCTTACTTTCCTTTTGCAAGAAACACAGGCCACCATCTAATGAACGTTTAATGGCTGAGGATCGTATAGGGCAAGCTGGACAGCAGTGTTCTAATTATACTCCACCATGCAATCCATCTGGTTGTGCTCGTACAGGTTTGACACTTCTCTGGTTGAAGCAAATGTGCGATTAGTTGCATCACTGTTGGATATGGAAAATTTTTGAGACTGTATATGGCTATTTTAAAGTTAATGTGACTGGTTTTGCTCCTCCTGTGGGTGTTCTTTGAACGAGAGATCAACTTTTTTTTTTTTTGTTCATAGTAACTGGAAAGCCTTTGGTAACCTCCTTGGTACTTAGGAGATAATCCTTGTCTCTCTTTTCTGTACTTTTATTGATTTCAATCTACTGTTTACTTTTTCCAAAATGAAATGGTACTTATATTGTACTCATGATCGAGCTGGTAGACAATAGGTTAGTGTGGGCATCGTGTTCAAATTAATTTCAAAATGAGAAAAAGTAGGAACTGTACTTATTAACCTCAAGTATCAATTAGGGGGAAAGTGGTATGGAAATAATCAATAGGCTGACCAAAAGTTTAGAATAGGTGTCATGCTTGTGCCTTTGTTGTTTATTTTTCTCCCTACATCAGCTATGGTTTATTTTGCGGTATTGAATAGGTTTTGTTAGTTCTCATTGAACTGTTTTATCTTAATAAATTGTCCTCTCCTTGGCAGAGCCCTATAATTACTTTGGGAGAAGAGGGCGCAAAGCCCCTGAAGCCCTTGCTGCTGCATCCTTGAAACGCTTGTTTGTTGAAAATCAGCCTTATATAGCGAGCGGTTATTCCCAACATTTGTTATCAGGGAACTTATTGCCTTCCAGTGGAGTCCTAGGCATGAAGTTTTCTCTTCAACATTTGAAAACCTGTCAGCTTGATCCCCGTAACATACTTTCTGTGGCTGAAAAATACAAATTTATGAGGGAGACCTTCAGGAAGAGACTTGCATTTGGTAAGGATTAGATTGCAATAATTGTTTTTTTATAAACGATTTATGGTTGTTAAATGTTGGTTTATAACTTATTCAGTTACCATGTTTTTTATATTTATCTTAGTGTATTAGTTAAATGTGTTTTTTTTTAATGAGAAATGAATAATCACTTAAATCAGAACATTTTCATTTCAATTCTTGAACTTATTGTTCCAAGCAATGAGATTGAGAAGTAACGAAATATTGTTGTTCATGTGATTTTAAAGCAATCTCCTAGTGGGCTGGTTGTTTCAATGGTTATGACGGGTTCAAAGGGTTTGCCTTTGATCTTTTACGGCTCACTATATTCTCATGCTTTCAGAGATTTTCTATCTTGACCAAGAGATGATCTATTATTTTTGAATTATCACATATCTTGAAAATAGAGCTTACACAAAAAAAGATTTTGTGTTTGAGATGCAAAGTTTAGGGTATTTATGTTGTGTGTTCTCTACTAGATGGGGTTTTATTTTTATTACACTGCATTTGTTTAGCTCGGCTTTATTATTATAAAGTGCAGCAAGCCTGTTTTAAATTTCATTACTCTGTCATCTTGGATAATCTGAATGCTTGGAGCTTCTAAGTTATATCGTTTATCTTTAATATCGTCTGAGCAAGAAAAGAATAAAATTCAGGTGATACCTAGCTAGTCGCACTGTCACTTATTTATTTCTTCAGAGAAATCATGGTCATGTTTTTTCTTACTTTAACAATCACACCTTTATGATTATACAACACAGGTTTACTTGAGAATGCATTTAATTAGTTCTCATATCCATCTGACTGGCTTTGTTTCAATTGGAATCACTTTTGGCTCTAGGGAAGTCTGGGATTCATGGATTTGGAATCTTTGCTAAGCATCCACACAGAGCAGGGGATATGGTAAGAAAATGTTATTCTCTTTTGATATTTCCTGTTTTCAGTAAGTATATGAAATTTGGTCTCTACAGGTAATCGAATACACGGGAGAAATTGTTAGACCTCCCATTGCTGACAGGAGAGAGCGGTTCATATATAATTTATTGGTGGTAAGTTCATTAGTCTGCTCATTGTCATTAGCTATAGGTTTCTATGTTGTTGATTCATATTTATCATGTCATCCCCATGCAAGATAAATTTCCGTTTAGAAATTAGAATGAGCTGTCTTCATCGGTTAATGTTAAGTGGGTTTGTCTTTGCTTGAACTGAGTTGATTAGAATATTTATCGCATCATTATTTTGAATTTTTGTTCTTTGATTTAGTTGAAGTTGAATACGATGATTTTTTGCAAAACAACCATAATTTGAGAACTTACTATGCGTTAGTTTCATTTTAGCTTTCTTGTAATCTGACTGTAATAAGGTGTGCTTAAGTTGCAACATCGTCTAATCAGGCAAGTTTTTCTTGAGCACATAGTGGTTCTTATGATCTGCTTATGAGGTGGTTTAGGAGTGTAAAATTAAAGGTAAGATAGATTCTTCTTTTAATTGATCCAGATTTCAAATAGTCACTTGGGCTGGAACCTTTTGAAAATCTACTTTAAAATCGGTTTTCTGGACGGAAGTGAATTTTTAGTTTCTTGAATTTGGTAATTTAGGAGCATTCCTCTATGTATCCTCTTGTAGATGTTGATTTTTCCCTTCTCCCACTGTAACGAAAACAGACTTTGAAAAGTTAACGGGTTCAATTTAATCCAGGGTGCTGGCACTTACATGTTTAGAATTGATGATGAACGAGTAATTGATGCTACAAGGGCCGGAAGCATTGCTCATCTGATCAACCACTCTTGTGAAGTTTGTATTACTGCTTTAGTTTTTCATTTATTTTCCTTTTGGCCTTTTAACTTGTTAGCGTTCTTTATCTTTATGAAATGAACCAATGAATTAGATATAAGTAATCTTATACCCATATTGTTGTATGAAAAAGATTCAAGTAGCTGACATTTTTCCCTTGGATGCTTGGTAAGAGTAACTTAGAACTCCCATTTTTCTCTCAACTAAGCTGCCTTACACAGTAACTGTTGCTCTCTGTGGAGTTGTATTAGTAAAAAGCAAATTTCAAGTAAGAAGGGGTGGGTTGCATACTAAGTTCATTTGTGGAATCCAAGGACATTAACCATAATCCAATTCTTAGAATTAATTGATAGTAGATTAAGATCAAATTTCATTGCAGTTACCTCCCATGATAAACTACATTCATACATTGTGAATGGACCTTAAAAAAATTTGTGTAATGACCATCATGGAAATATCTTGTATGTGGCAGTTGTCATATCCCCAAAAAAATATTGACCATTTGGCAATAGTTTACTTATGGAAATTGGTTTGCCCTTTTTTGTTGGTCTGTTTCCAATTTTTTCCATTTCTTCAAAACAAGAGATTCTCATTGATTATTCATACTAATGTTTCCTTTATGCAGCCAAATTGCTATTCGAGAGTTATAACTGTCAATGGAGATGAACACATTATAATCTTTGCAAAGAGAGACATTAAGCGATGGGAAGAACTTACATACGATTATAGGTCTGCTCAAACTTAGTGATTATATCTTACTTTCTGGCATTCTTTAGATGCTGACATGTCAATCTTCATTCAGATTTTTTTCCATTGATGAACAACTAGCGTGCTACTGCGGCTTTCCTAGATGTCGGGGTGTAGTCAATGATACTGAAGAAGAAGAGCGAGTTGCAAAGCTATATGTACCTCGAACAGATTTAGTAGATTGGAGAGGTGAATGAGGCAGGTCAGCGAAACTAGAATAGATTCTCATTTGATCCCTGTTTTCGGAGTCTATTTAGATGTAGAGTAGTCAAATTGCATTTTCTTTCCTCTTTTTCTTTTAAAAACTATAGTAACCATCACACCAAACATGTATAAAATCAACTAACTACTGATAATGATAATAATTGCATAATGACATATTTTAGTTGATATTGTATATCGATTTTAAGACATTATTTTGTTGCCATTGGTTAAATCTGCAGTTCGGTATGACGCTAATATTTTCTCACTTTCGACATATAGAGAGAACATGCATGATCTTGTGTAAAATGTATGAGCTGCACATTTACTTTCTCAGATTTTCAAGGCATCTGCCATCGCTGCCTATACCTTTGTGATCCTCTCGCATGATAAGAAATATCTCGTGTATATACATTTTTTCTGACATACTTTGTTTGGCTACACTCACAACTCTTTCAGATTCTCTCCACTTCTGTTTCAGCATGATCCTGCTGAATGTGCCCTGGTTCTTTGCCTTGTTTCATTGTCATTGTAGTTGCTGATGTTTCTCTGTTTTTCTTTGGCAGAACTATCATATAGAGTTTGCCCATGGAAGCTCACCTCCAGCAAATTTGCCCATGACGTTGACATTTCTGCGATGCGCACTGGAAGTCGGTTTTTGAGTTCCAGTTTCACAACTCCGCAATGTCGAATCAGGCAATTCTTTATCAAAAAAATCACCCAATTTTTTTTTCAATCCCATATTTCCTCATTGTACGAACATAATTTGTGAGAGATGTCGATTTTAGCGATCGCAAATTTATAATGTTAGCAGCATCATAGGAAAACTCATTCTTAGTCTGGTTGAGATGGGTCCATTGCCCTTGGTTTTGTAGATTTGCGATAGTTGGGCATCAAGTGCTTCCATTGTTAACCATTAATTTTCTGTACAGAATACTCTGAAAGTAATGGAAAATAATTGTTCTAAAGCTTTCTTGTTATGCCTTTTTCCGATCATTTTCCAGATATTTACCACGTGAAGTGAGCAAAAACACAGGAATAATTTGGGGTTCCATTCTGATTAGACTTCTTTTTATGGTTATTCTGAATAGACTTGAATGACTTTAAATAATTATTTTAACTTCCAATTGGTGGGCCAATTACTTCACCAATTACTTCCCTTTTACTTTTGCTTACGATAACACTCTTAGGAATCTTATCTTTTAGAATATAATATTACTATTCAAATAAAGAGTGAAGTTTTAAACTTATGATTTATAAAAGAGGTTACTTATAACTATAATCTCATGATTATTAATGATTGGAAGACTCTTCCTTTATAGCTCTCTGGGCGGGAGTTTTCTCCACCCTTGGTTTACTCTGTTCTTTCTTCCCTTTTGTTTAAGTTGATATTAACACTTTGGGTTCACTGTTGAAGGTTGTGTTTGGGATTTTCAGCATGACTGAAGTTCTTGTGCAACAATATTTCACAACCAGTTTAAAAAGTATAATCTTATATTTTTCCTTTTACAAAAGTGGTTCTAATTCTTTTACATATTTTTTATTCAGTTACATTTCAAATTAATGTCTTTTAAATATGTTAGTATTTTTATATTTTTGTGAACAAGGCACAAATGTAATGATATAACATGTGAGATATTGCGATTGATTTCTTATTTTATATTTTACATTTCATTTATTTTTGTAAACTTACAAATGGATTGATATGTACTAATTTTATTATGTTAGTTTTGTAATTTCACAATCTTTTTATATTGCAAACAATTTGGTAGATAAATATTTATGGTTACAAAAAGTAGTGGATCTTTATGAAGAAAAAAACAAATGAACATTCAAAGAGTGACCCACAATTGAAAACTTAAACCTCACTTTATATGTCAAACTAAGAACTAAAATATAAATTCCTTCCTCTTCACTCTGATATTGATTCAATGTTTGTTAGTTTGATTGAGTCTAGATTAAGGGAAAACCATATCTTTCACCATCAATATGAATTGAAGCAAGTGAAATCCATTGGTGGCTTGGTCTTCAAGAGTGGCAAAAAGGGGAGCCCCTACTTTGATGTTAACATATCTTGGCCCATAGGCCCAACCCTAAACCCACAAGATTAGACCTTCAATCAAAGAGAAAGAGAGGGTGGGAAAGAAAGAGTTTTTAGAGAGAGAAAGAGATAATATTAGGTTGAATTGCACTGTTGAACTAACTATCTAACTAATACCCACCCCCATAATTGAAGATTCTCTTGCTCCTCTCTGTCTTCTAAGTACCAAACTTTGGGAGTAGGTCTTGTATTTTCCACATGTAAATCATTCTATCTTGCTTCTAAAGCCAGCTTTTGGTAACATAACAAACCCCTCTTTTTTTCATTTTCATACTTTCCATATCCTTAACTACAAGAACCTTATCATTTTGTTTCAAGATTCTTCTTCTATAGTTTCTCTCTTCACTTTGCATTTAACCTCTTCTACCTTTCCCTTGTTGAAAACATGAGAGCCGTCGGAAAATTTGCAGCTCTTCCCTTCTTCTCTGGTTGCGCTTCTCGCTCCAGCGTTGCGACGGGTACGTCCATTTCTACGACTCAAATCCAAGAGCTTAAGGCTGCAGAGATAAACCCAAGTGTAGCAA

mRNA sequence

ATGGCATTCCCTCTTGAGCAGCGACCGAAGCCGCAAGTTATGGATGGGGAAGACGGAGAAGATATCAATATCGATGTTTATAATGCTGGGACTCCGGTTCGGTACCTCTCGCTTGATCATGTCTACTCCACTACGTCTCCGTTTGTCAGTACAAGTGGGTCTTCGAATGTCATGTCCAAGAAGGTGAAAGCCCGGAGGCTTATGGTGAATCGCTTTGACGATCTTAATTTCAAGCCGCCTCGTCTGCTCCATGTCTATTCTCGCCGCCGTAAGAAACCCCGCCATTCGTCTGCCACTCCCTCCTTCTACGACTCTTTGGTTGAGAAGGTCGAATTGGGGTCGAAAGCTGTTCTGAAATCCGAAGCTCGTGAGATGGATGAGATGGTGAATAGTGCAGACGACCATGCAGACGACTTCGAAGTTGACAGAATGCCGAAGAAGAAGAAAAAGAAGAAAGATAAGTTTGGGTTCAATGAGCTTGTTAAATTGGAGGTCGATTCCAGTGTTTTTCGTGCGATGAATGGTCCTAGGTTGAGAGACTGCCGCACTCATAACAATAACAATCTAAACGTTAGTAATTCCGGACGGAGAAGAAAGCGCAATTCTTCAAAGATTTCTGAGAAGACTGTATTGAAATCTCCTTCTGCTAAGAGATGGGTCAGGTTAAGTTTTGAGGATGTCGACCCGAAAGTATATATTGGATTACAATGCAAGGTTTATTGGCCATTGGATGCTGATTGGTACTGTGGTCGTGTTGTGGGTTATACATCAGAGACTAATCGTCATCATATTGAATATGAAGATGATGACAAAGAAGATTTGATTCTTTCAAATGAGAAAGTCAAATTTTATATTTCTGGTGAAGAGATGCAGTCTTTGAACTTGAGTTTTGGTGTTGATAGCGTAGATAGTGATGCTTATGACTACAATGAGATGCTTGTTTTAGCAGCAAGCTTGGATGACTGCCTGGAACCTGAACCTGGGGATATTGTCTGGGCCAAACTTACTGGTCATGCTATGTGGCCAGCAATTATAGTGGATGAATCACTCATCGGTGATCGGAAGGGCTTAAGAAATATTTCAGGAGGAAGAACAGTCCCTGTTCAATTTTTTGGTACACACGACTTTGCAAGGATTAAAGTAAAACAGGCGATGTCATTTCTCAAAGGTCTTCTTTCCTCTTTCCACCTGAAATGCAAGAAACCGCACTTCATTCGGAGCCTAGAAGAGGCAAAAATGTATCTCAGTGAGCAGAAACTTCCACCAAGTATGCTACAGTTGCAAAATGGAATTGAAGTGGATGATTTTGCAAGTGCAAGTGGAGAGGAAGAAGCGACAACAGATTCAGGGGAAGAATGCCTAAATGAAGGAGGGATGCCTTGTCCACACAATGGATATGGATCATCTCCATTTATAGTTGGGGATCTAGAAATAGTAAGCCTTGGGAAGATTGTCAAAGATTCTAAATATTTTCAGAATGATGGGTCTGTATGGCCTGAAGGGTATACAGCTGTGAGGAAATTTTCTTCTTTAACTGATCCCAATGTCTGTACCTTATATAAAATGGAAGTTTTGAGAGATTTTGAATCAAAATTTCGACCTTTATGTAGGGTAACTTTGGATAATGGAGAGCAGTTTAAAGGATCCTCTCCATCTGCTTGCTGGAATAAAATATACAAAAGGATGAGGAAAATACAACATATTTCTGATGCTTCTACCGAAGGTAGAGGGGAAACTGTATACAAGTCCGGTTCTGACATGTTTGGTTTCTCTAATCCGGATGTTAAGAAACTCATCCAGGGGATATCTAAATCTGGACTTTCTTCTTCCAGATCCTTGAGCAAAGTAGCCTCTAAAAAATACAAAGATTTTCCCGTTGGTTATAGACCTGTTCGTGTTGATTGGAAAGACCTTGACAAGTGCAGTGTTTGTCATATGGATGAGGAGTATGAAAATAATCTCTTCTTGCAGTGTGACAAATGCAGAATGATGGTCCATGCTAGGTGCTACGGAGAACTAGAACCAGTTGACGGAGTGTTATGGCTGTGCAACTTGTGTCGGCCTGGGTCTCCTGATTGTCCCCCACCATGCTGCCTTTGTCCTGTCATAGGGGGTGCAATGAAGCCTACAACAGATGGACGCTGGGCTCATCTTGCTTGTGCTATATGGATACCTGAAACGTGTTTATCTGATATCAAGAAAATGGAGCCTATTGACGGTCTTAGTAGAATCAATAAGGATCGTTGGAAGCTGTTATGTAGCATCTGTGGGGTCTCTTACGGAGCTTGCATTCAATGTTCAAACAATACTTGTTATGTAGCATATCACCCTCTTTGTGCACGAGCTGCTGGTCTTTGTGTTGAGCTTGAGGACGATGACAGGCTCCATCTACTTGCTGCGGATGAAGATGAAGAAGATCAGTGCATTCGCTTACTTTCCTTTTGCAAGAAACACAGGCCACCATCTAATGAACGTTTAATGGCTGAGGATCGTATAGGGCAAGCTGGACAGCAGTGTTCTAATTATACTCCACCATGCAATCCATCTGGTTGTGCTCGTACAGAGCCCTATAATTACTTTGGGAGAAGAGGGCGCAAAGCCCCTGAAGCCCTTGCTGCTGCATCCTTGAAACGCTTGTTTGTTGAAAATCAGCCTTATATAGCGAGCGGTTATTCCCAACATTTGTTATCAGGGAACTTATTGCCTTCCAGTGGAGTCCTAGGCATGAAGTTTTCTCTTCAACATTTGAAAACCTGTCAGCTTGATCCCCGTAACATACTTTCTGTGGCTGAAAAATACAAATTTATGAGGGAGACCTTCAGGAAGAGACTTGCATTTGGGAAGTCTGGGATTCATGGATTTGGAATCTTTGCTAAGCATCCACACAGAGCAGGGGATATGGTAATCGAATACACGGGAGAAATTGTTAGACCTCCCATTGCTGACAGGAGAGAGCGGTTCATATATAATTTATTGGTGGGTGCTGGCACTTACATGTTTAGAATTGATGATGAACGAGTAATTGATGCTACAAGGGCCGGAAGCATTGCTCATCTGATCAACCACTCTTGTGAACCAAATTGCTATTCGAGAGTTATAACTGTCAATGGAGATGAACACATTATAATCTTTGCAAAGAGAGACATTAAGCGATGGGAAGAACTTACATACGATTATAGATTTTTTTCCATTGATGAACAACTAGCGTGCTACTGCGGCTTTCCTAGATGTCGGGGTGTAGTCAATGATACTGAAGAAGAAGAGCGAGTTGCAAAGCTATATGTACCTCGAACAGATTTAGTAGATTGGAGAGAACTATCATATAGAGTTTGCCCATGGAAGCTCACCTCCAGCAAATTTGCCCATGACGTTGACATTTCTGCGATGCGCACTGGAAGTCGGTTTTTGAGTTCCAGTTTCACAACTCCGCAATCTCTTCCCTTCTTCTCTGGTTGCGCTTCTCGCTCCAGCGTTGCGACGGGTACGTCCATTTCTACGACTCAAATCCAAGAGCTTAAGGCTGCAGAGATAAACCCAAGTGTAGCAA

Coding sequence (CDS)

ATGGCATTCCCTCTTGAGCAGCGACCGAAGCCGCAAGTTATGGATGGGGAAGACGGAGAAGATATCAATATCGATGTTTATAATGCTGGGACTCCGGTTCGGTACCTCTCGCTTGATCATGTCTACTCCACTACGTCTCCGTTTGTCAGTACAAGTGGGTCTTCGAATGTCATGTCCAAGAAGGTGAAAGCCCGGAGGCTTATGGTGAATCGCTTTGACGATCTTAATTTCAAGCCGCCTCGTCTGCTCCATGTCTATTCTCGCCGCCGTAAGAAACCCCGCCATTCGTCTGCCACTCCCTCCTTCTACGACTCTTTGGTTGAGAAGGTCGAATTGGGGTCGAAAGCTGTTCTGAAATCCGAAGCTCGTGAGATGGATGAGATGGTGAATAGTGCAGACGACCATGCAGACGACTTCGAAGTTGACAGAATGCCGAAGAAGAAGAAAAAGAAGAAAGATAAGTTTGGGTTCAATGAGCTTGTTAAATTGGAGGTCGATTCCAGTGTTTTTCGTGCGATGAATGGTCCTAGGTTGAGAGACTGCCGCACTCATAACAATAACAATCTAAACGTTAGTAATTCCGGACGGAGAAGAAAGCGCAATTCTTCAAAGATTTCTGAGAAGACTGTATTGAAATCTCCTTCTGCTAAGAGATGGGTCAGGTTAAGTTTTGAGGATGTCGACCCGAAAGTATATATTGGATTACAATGCAAGGTTTATTGGCCATTGGATGCTGATTGGTACTGTGGTCGTGTTGTGGGTTATACATCAGAGACTAATCGTCATCATATTGAATATGAAGATGATGACAAAGAAGATTTGATTCTTTCAAATGAGAAAGTCAAATTTTATATTTCTGGTGAAGAGATGCAGTCTTTGAACTTGAGTTTTGGTGTTGATAGCGTAGATAGTGATGCTTATGACTACAATGAGATGCTTGTTTTAGCAGCAAGCTTGGATGACTGCCTGGAACCTGAACCTGGGGATATTGTCTGGGCCAAACTTACTGGTCATGCTATGTGGCCAGCAATTATAGTGGATGAATCACTCATCGGTGATCGGAAGGGCTTAAGAAATATTTCAGGAGGAAGAACAGTCCCTGTTCAATTTTTTGGTACACACGACTTTGCAAGGATTAAAGTAAAACAGGCGATGTCATTTCTCAAAGGTCTTCTTTCCTCTTTCCACCTGAAATGCAAGAAACCGCACTTCATTCGGAGCCTAGAAGAGGCAAAAATGTATCTCAGTGAGCAGAAACTTCCACCAAGTATGCTACAGTTGCAAAATGGAATTGAAGTGGATGATTTTGCAAGTGCAAGTGGAGAGGAAGAAGCGACAACAGATTCAGGGGAAGAATGCCTAAATGAAGGAGGGATGCCTTGTCCACACAATGGATATGGATCATCTCCATTTATAGTTGGGGATCTAGAAATAGTAAGCCTTGGGAAGATTGTCAAAGATTCTAAATATTTTCAGAATGATGGGTCTGTATGGCCTGAAGGGTATACAGCTGTGAGGAAATTTTCTTCTTTAACTGATCCCAATGTCTGTACCTTATATAAAATGGAAGTTTTGAGAGATTTTGAATCAAAATTTCGACCTTTATGTAGGGTAACTTTGGATAATGGAGAGCAGTTTAAAGGATCCTCTCCATCTGCTTGCTGGAATAAAATATACAAAAGGATGAGGAAAATACAACATATTTCTGATGCTTCTACCGAAGGTAGAGGGGAAACTGTATACAAGTCCGGTTCTGACATGTTTGGTTTCTCTAATCCGGATGTTAAGAAACTCATCCAGGGGATATCTAAATCTGGACTTTCTTCTTCCAGATCCTTGAGCAAAGTAGCCTCTAAAAAATACAAAGATTTTCCCGTTGGTTATAGACCTGTTCGTGTTGATTGGAAAGACCTTGACAAGTGCAGTGTTTGTCATATGGATGAGGAGTATGAAAATAATCTCTTCTTGCAGTGTGACAAATGCAGAATGATGGTCCATGCTAGGTGCTACGGAGAACTAGAACCAGTTGACGGAGTGTTATGGCTGTGCAACTTGTGTCGGCCTGGGTCTCCTGATTGTCCCCCACCATGCTGCCTTTGTCCTGTCATAGGGGGTGCAATGAAGCCTACAACAGATGGACGCTGGGCTCATCTTGCTTGTGCTATATGGATACCTGAAACGTGTTTATCTGATATCAAGAAAATGGAGCCTATTGACGGTCTTAGTAGAATCAATAAGGATCGTTGGAAGCTGTTATGTAGCATCTGTGGGGTCTCTTACGGAGCTTGCATTCAATGTTCAAACAATACTTGTTATGTAGCATATCACCCTCTTTGTGCACGAGCTGCTGGTCTTTGTGTTGAGCTTGAGGACGATGACAGGCTCCATCTACTTGCTGCGGATGAAGATGAAGAAGATCAGTGCATTCGCTTACTTTCCTTTTGCAAGAAACACAGGCCACCATCTAATGAACGTTTAATGGCTGAGGATCGTATAGGGCAAGCTGGACAGCAGTGTTCTAATTATACTCCACCATGCAATCCATCTGGTTGTGCTCGTACAGAGCCCTATAATTACTTTGGGAGAAGAGGGCGCAAAGCCCCTGAAGCCCTTGCTGCTGCATCCTTGAAACGCTTGTTTGTTGAAAATCAGCCTTATATAGCGAGCGGTTATTCCCAACATTTGTTATCAGGGAACTTATTGCCTTCCAGTGGAGTCCTAGGCATGAAGTTTTCTCTTCAACATTTGAAAACCTGTCAGCTTGATCCCCGTAACATACTTTCTGTGGCTGAAAAATACAAATTTATGAGGGAGACCTTCAGGAAGAGACTTGCATTTGGGAAGTCTGGGATTCATGGATTTGGAATCTTTGCTAAGCATCCACACAGAGCAGGGGATATGGTAATCGAATACACGGGAGAAATTGTTAGACCTCCCATTGCTGACAGGAGAGAGCGGTTCATATATAATTTATTGGTGGGTGCTGGCACTTACATGTTTAGAATTGATGATGAACGAGTAATTGATGCTACAAGGGCCGGAAGCATTGCTCATCTGATCAACCACTCTTGTGAACCAAATTGCTATTCGAGAGTTATAACTGTCAATGGAGATGAACACATTATAATCTTTGCAAAGAGAGACATTAAGCGATGGGAAGAACTTACATACGATTATAGATTTTTTTCCATTGATGAACAACTAGCGTGCTACTGCGGCTTTCCTAGATGTCGGGGTGTAGTCAATGATACTGAAGAAGAAGAGCGAGTTGCAAAGCTATATGTACCTCGAACAGATTTAGTAGATTGGAGAGAACTATCATATAGAGTTTGCCCATGGAAGCTCACCTCCAGCAAATTTGCCCATGACGTTGACATTTCTGCGATGCGCACTGGAAGTCGGTTTTTGAGTTCCAGTTTCACAACTCCGCAATCTCTTCCCTTCTTCTCTGGTTGCGCTTCTCGCTCCAGCGTTGCGACGGGTACGTCCATTTCTACGACTCAAATCCAAGAGCTTAAGGCTGCAGAGATAAACCCAAGTGTAGCAA

Protein sequence

MAFPLEQRPKPQVMDGEDGEDINIDVYNAGTPVRYLSLDHVYSTTSPFVSTSGSSNVMSKKVKARRLMVNRFDDLNFKPPRLLHVYSRRRKKPRHSSATPSFYDSLVEKVELGSKAVLKSEAREMDEMVNSADDHADDFEVDRMPKKKKKKKDKFGFNELVKLEVDSSVFRAMNGPRLRDCRTHNNNNLNVSNSGRRRKRNSSKISEKTVLKSPSAKRWVRLSFEDVDPKVYIGLQCKVYWPLDADWYCGRVVGYTSETNRHHIEYEDDDKEDLILSNEKVKFYISGEEMQSLNLSFGVDSVDSDAYDYNEMLVLAASLDDCLEPEPGDIVWAKLTGHAMWPAIIVDESLIGDRKGLRNISGGRTVPVQFFGTHDFARIKVKQAMSFLKGLLSSFHLKCKKPHFIRSLEEAKMYLSEQKLPPSMLQLQNGIEVDDFASASGEEEATTDSGEECLNEGGMPCPHNGYGSSPFIVGDLEIVSLGKIVKDSKYFQNDGSVWPEGYTAVRKFSSLTDPNVCTLYKMEVLRDFESKFRPLCRVTLDNGEQFKGSSPSACWNKIYKRMRKIQHISDASTEGRGETVYKSGSDMFGFSNPDVKKLIQGISKSGLSSSRSLSKVASKKYKDFPVGYRPVRVDWKDLDKCSVCHMDEEYENNLFLQCDKCRMMVHARCYGELEPVDGVLWLCNLCRPGSPDCPPPCCLCPVIGGAMKPTTDGRWAHLACAIWIPETCLSDIKKMEPIDGLSRINKDRWKLLCSICGVSYGACIQCSNNTCYVAYHPLCARAAGLCVELEDDDRLHLLAADEDEEDQCIRLLSFCKKHRPPSNERLMAEDRIGQAGQQCSNYTPPCNPSGCARTEPYNYFGRRGRKAPEALAAASLKRLFVENQPYIASGYSQHLLSGNLLPSSGVLGMKFSLQHLKTCQLDPRNILSVAEKYKFMRETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGEIVRPPIADRRERFIYNLLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVITVNGDEHIIIFAKRDIKRWEELTYDYRFFSIDEQLACYCGFPRCRGVVNDTEEEERVAKLYVPRTDLVDWRELSYRVCPWKLTSSKFAHDVDISAMRTGSRFLSSSFTTPQSLPFFSGCASRSSVATGTSISTTQIQELKAAEINPSVAX
Homology
BLAST of Sgr021830 vs. NCBI nr
Match: XP_038884706.1 (histone-lysine N-methyltransferase ATX2-like [Benincasa hispida] >XP_038884708.1 histone-lysine N-methyltransferase ATX2-like [Benincasa hispida])

HSP 1 Score: 2117.4 bits (5485), Expect = 0.0e+00
Identity = 1018/1105 (92.13%), Postives = 1063/1105 (96.20%), Query Frame = 0

Query: 1    MAFPLEQRPKPQVMDGEDGEDINIDVYNAGTPVRYLSLDHVYSTTSPFVSTSGSSNVMSK 60
            MA PL+QRPKP ++DGEDG+DINIDVYNAGTP+RYLSLDHVYSTTSPFVSTSGSSNVMSK
Sbjct: 1    MALPLQQRPKPPIVDGEDGDDINIDVYNAGTPIRYLSLDHVYSTTSPFVSTSGSSNVMSK 60

Query: 61   KVKARRLMVNRFDDLNFKPPRLLHVYSRRRKKPRHSSATPSFYDSLVEKVELGSKAVLKS 120
            KVKARRL+VN FDDLNFKPPRLLHVYSRRRKKPRHSSA  S Y+SLVE+VELGS+ V++S
Sbjct: 61   KVKARRLVVNHFDDLNFKPPRLLHVYSRRRKKPRHSSAGSSVYESLVEQVELGSRTVVES 120

Query: 121  EAREMDEMVNSADDHADDFEVDRMPKKKKKKKDKFGFNELVKLEVDSSVFRAMNGPRLRD 180
            EARE+DEMVN  DDH ++ EVDR PKKKKK+KDKFG NELVKLEVDSSV RAMNGPRLRD
Sbjct: 121  EAREIDEMVNGVDDHEEESEVDRTPKKKKKRKDKFGCNELVKLEVDSSVIRAMNGPRLRD 180

Query: 181  CRTHNNNNLNVSNSGRRRKRNSSKISEKTVLKSPSAKRWVRLSFEDVDPKVYIGLQCKVY 240
            CRTH+NNN    N G+R+KRNSS++SEKT+ KSP+AKRWVRLSFEDVDPKVYIGLQCKVY
Sbjct: 181  CRTHSNNN---KNPGQRKKRNSSQLSEKTMFKSPTAKRWVRLSFEDVDPKVYIGLQCKVY 240

Query: 241  WPLDADWYCGRVVGYTSETNRHHIEYEDDDKEDLILSNEKVKFYISGEEMQSLNLSFGVD 300
            WPLDADWYCG VVGY SET RHHIEYED+D+EDLILSNEKVKF+ISGEEMQSLNL+FGVD
Sbjct: 241  WPLDADWYCGCVVGYNSETGRHHIEYEDEDREDLILSNEKVKFHISGEEMQSLNLNFGVD 300

Query: 301  SVDSDAYDYNEMLVLAASLDDCLEPEPGDIVWAKLTGHAMWPAIIVDESLIGDRKGLRNI 360
            SVDSDAYDYNEMLVLAASLDDCLEPEPGDIVWAKLTGHAMWPAIIVDESLIGDRKGLRNI
Sbjct: 301  SVDSDAYDYNEMLVLAASLDDCLEPEPGDIVWAKLTGHAMWPAIIVDESLIGDRKGLRNI 360

Query: 361  SGGRTVPVQFFGTHDFARIKVKQAMSFLKGLLSSFHLKCKKPHFIRSLEEAKMYLSEQKL 420
            SGGRTVPVQFFGTHDFARIKVKQA+SFLKGLLS FH KCKKPHF+RSLEEAKMYLSEQKL
Sbjct: 361  SGGRTVPVQFFGTHDFARIKVKQAISFLKGLLSFFHQKCKKPHFMRSLEEAKMYLSEQKL 420

Query: 421  PPSMLQLQNGIEVDDFASASGEEEATTDSGEECLNE-GGMPCPHNGYGSSPFIVGDLEIV 480
            PPSMLQLQNGIEVDDFA+ASGEEE TTDSGEECLNE GGM C  N YGSSPFIVGDLEI+
Sbjct: 421  PPSMLQLQNGIEVDDFATASGEEEGTTDSGEECLNEGGGMHCLLNEYGSSPFIVGDLEII 480

Query: 481  SLGKIVKDSKYFQNDGSVWPEGYTAVRKFSSLTDPNVCTLYKMEVLRDFESKFRPLCRVT 540
            SLGKIVKDSKYFQNDGSVWPEGYTAVRKFSSLTDPNVCTLY+MEVLRDFESKFRPL RVT
Sbjct: 481  SLGKIVKDSKYFQNDGSVWPEGYTAVRKFSSLTDPNVCTLYRMEVLRDFESKFRPLFRVT 540

Query: 541  LDNGEQFKGSSPSACWNKIYKRMRKIQHISDASTEGRGETVYKSGSDMFGFSNPDVKKLI 600
            LDNGEQFKGSSPSACWNKIYKRMRK+QHISDA+ E +GE VYKSGSDMFGFSNPDVKKLI
Sbjct: 541  LDNGEQFKGSSPSACWNKIYKRMRKVQHISDAAAESKGEFVYKSGSDMFGFSNPDVKKLI 600

Query: 601  QGISKSGLSSSRSLSKVASKKYKDFPVGYRPVRVDWKDLDKCSVCHMDEEYENNLFLQCD 660
            QGISKSGLSSSRS  KVASKKYKDFP+GYRPVRVDWKDLDKCSVCHMDEEYENNLFLQCD
Sbjct: 601  QGISKSGLSSSRSSGKVASKKYKDFPIGYRPVRVDWKDLDKCSVCHMDEEYENNLFLQCD 660

Query: 661  KCRMMVHARCYGELEPVDGVLWLCNLCRPGSPDCPPPCCLCPVIGGAMKPTTDGRWAHLA 720
            KCRMMVHARCYGELEPVDGV+WLCNLCRPGSPDCPPPCCLCPVIGGAMKPTTDGRWAHLA
Sbjct: 661  KCRMMVHARCYGELEPVDGVIWLCNLCRPGSPDCPPPCCLCPVIGGAMKPTTDGRWAHLA 720

Query: 721  CAIWIPETCLSDIKKMEPIDGLSRINKDRWKLLCSICGVSYGACIQCSNNTCYVAYHPLC 780
            CAIWIPETCLSDIKKMEPIDGL+RINKDRWKLLCSICGVSYGACIQCSNNTCYVAYHPLC
Sbjct: 721  CAIWIPETCLSDIKKMEPIDGLNRINKDRWKLLCSICGVSYGACIQCSNNTCYVAYHPLC 780

Query: 781  ARAAGLCVELEDDDRLHLLAADEDEEDQCIRLLSFCKKHRPPSNERLMAEDRIGQAGQQC 840
            ARAAGLCVELE+DDRLHLLAADEDEE QCIRLLSFCKKHRPPSNERLMAEDRIGQAGQQC
Sbjct: 781  ARAAGLCVELEEDDRLHLLAADEDEEHQCIRLLSFCKKHRPPSNERLMAEDRIGQAGQQC 840

Query: 841  SNYTPPCNPSGCARTEPYNYFGRRGRKAPEALAAASLKRLFVENQPYIASGYSQHLLSGN 900
            SNYTPPCNPSGCARTEPYNYFGRRGRKAPEA+AAASLKRLFVENQPYIASGYSQHLLSGN
Sbjct: 841  SNYTPPCNPSGCARTEPYNYFGRRGRKAPEAVAAASLKRLFVENQPYIASGYSQHLLSGN 900

Query: 901  LLPSSGVLGMKFSLQHLKTCQLDPRNILSVAEKYKFMRETFRKRLAFGKSGIHGFGIFAK 960
            LLP  GVLGMKFSLQHLKTCQLDPRNILSVAEKYKFMRETFRKRLAFGKSGIHGFGIFAK
Sbjct: 901  LLPCIGVLGMKFSLQHLKTCQLDPRNILSVAEKYKFMRETFRKRLAFGKSGIHGFGIFAK 960

Query: 961  HPHRAGDMVIEYTGEIVRPPIADRRERFIYNLLVGAGTYMFRIDDERVIDATRAGSIAHL 1020
            HPHRAGDMVIEYTGEIVRPPIADRRERFIYNLLVGAGTYMFRIDDERVIDATRAGSIAHL
Sbjct: 961  HPHRAGDMVIEYTGEIVRPPIADRRERFIYNLLVGAGTYMFRIDDERVIDATRAGSIAHL 1020

Query: 1021 INHSCEPNCYSRVITVNGDEHIIIFAKRDIKRWEELTYDYRFFSIDEQLACYCGFPRCRG 1080
            INHSCEPNCYSRVI+VNGDEHIIIFAKRDIKRWEELTYDYRFFSIDEQLACYCG+PRCRG
Sbjct: 1021 INHSCEPNCYSRVISVNGDEHIIIFAKRDIKRWEELTYDYRFFSIDEQLACYCGYPRCRG 1080

Query: 1081 VVNDTEEEERVAKLYVPRTDLVDWR 1105
            VVND +EEERV+KL+V RTDLVDW+
Sbjct: 1081 VVNDMDEEERVSKLHVSRTDLVDWK 1102

BLAST of Sgr021830 vs. NCBI nr
Match: KAG7025293.1 (Histone-lysine N-methyltransferase ATX2 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 2113.2 bits (5474), Expect = 0.0e+00
Identity = 1037/1182 (87.73%), Postives = 1092/1182 (92.39%), Query Frame = 0

Query: 1    MAFPLEQRPKPQVMDGEDGEDINIDVYNAGTPVRYLSLDHVYSTTSPFVSTSGSSNVMSK 60
            MAFPLEQRPKP ++DGEDG+DINIDVYNAGTP+RYLSLDHVYSTTSPFVSTSGSSNVMSK
Sbjct: 1    MAFPLEQRPKPPILDGEDGDDINIDVYNAGTPIRYLSLDHVYSTTSPFVSTSGSSNVMSK 60

Query: 61   KVKARRLMVNRFDDLNFKPPRLLHVYSRRRKKPRHSSATPSFYDSLVEKVELGSKAVLKS 120
            KVKARRL+VN FDDLNFKPPRLLHVYSRRRKKPRHSS + S YDSLVE+VELGSK V+KS
Sbjct: 61   KVKARRLLVNHFDDLNFKPPRLLHVYSRRRKKPRHSSVSSSVYDSLVEEVELGSKTVMKS 120

Query: 121  EAREMDEMVNSADDHADDFEVDRMPKKKKKKKDKFGFNELVKLEVDSSVFRAMNGPRLRD 180
            EA E+DEMVN  DD   +FEVDR P  KKKKKD FG NELVKLEV+SSV RAMNGPRLRD
Sbjct: 121  EAFEIDEMVNGVDDLVGEFEVDRTP--KKKKKDNFGCNELVKLEVNSSVIRAMNGPRLRD 180

Query: 181  CRTHNNNNLNVSNSGRRRKRNSSKISEKTVLKSPSAKRWVRLSFEDVDPKVYIGLQCKVY 240
            CRTH+NNN    NSGRR+KRNSS+ISEKT+ KSP+AKRWVRLSFEDVDP        KVY
Sbjct: 181  CRTHSNNN---KNSGRRKKRNSSQISEKTMFKSPTAKRWVRLSFEDVDP--------KVY 240

Query: 241  WPLDADWYCGRVVGYTSETNRHHIEYEDDDKEDLILSNEKVKFYISGEEMQSLNLSFGVD 300
            WPLDADWY GRVV Y SET RH+IEYEDDDKEDL+LSNEKVKFYISGEEMQSLNLSFGVD
Sbjct: 241  WPLDADWYHGRVVDYDSETGRHNIEYEDDDKEDLVLSNEKVKFYISGEEMQSLNLSFGVD 300

Query: 301  SVDSDAYDYNEMLVLAASLDDCLEPEPGDIVWAKLTGHAMWPAIIVDESLIGDRKGLRNI 360
             +DSDAY+YNEMLVLAA+LDD LEPEPGDIVWAKLTGHAMWPAIIVDESLIGDRKGLRNI
Sbjct: 301  GIDSDAYEYNEMLVLAATLDDYLEPEPGDIVWAKLTGHAMWPAIIVDESLIGDRKGLRNI 360

Query: 361  SGGRTVPVQFFGTHDFARIKVKQAMSFLKGLLSSFHLKCKKPHFIRSLEEAKMYLSEQKL 420
            SGGRTVPVQFFGTHDFARIKVKQA+SFLKGLLS FH KCKKPHFIRSLEEAKMYLSEQKL
Sbjct: 361  SGGRTVPVQFFGTHDFARIKVKQAISFLKGLLSFFHQKCKKPHFIRSLEEAKMYLSEQKL 420

Query: 421  PPSMLQLQNGIEVDDFASASGEEEATTDSGEECLNEGGMPCPHNGYGSSPFIVGDLEIVS 480
            PPSMLQLQNGIEVDDFASASGEEE TTDSGEECLNE GMPCP NGYGS PF+VGDLEI+S
Sbjct: 421  PPSMLQLQNGIEVDDFASASGEEEGTTDSGEECLNEAGMPCPPNGYGSCPFMVGDLEILS 480

Query: 481  LGKIVKDSKYFQNDGSVWPEGYTAVRKFSSLTDPNVCTLYKMEVLRDFESKFRPLCRVTL 540
            LGK+VK+SKYFQNDGS+WPEGYTAVRKFSSLTDPNV T YKMEVLRDFESKFRPL RVTL
Sbjct: 481  LGKVVKNSKYFQNDGSIWPEGYTAVRKFSSLTDPNVRTKYKMEVLRDFESKFRPLFRVTL 540

Query: 541  DNGEQFKGSSPSACWNKIYKRMRKIQHISDASTEGRGETVYKSGSDMFGFSNPDVKKLIQ 600
            DNGEQFKGSSPSACWNKIYKRMRKIQHISD S E +GE VYKSGSDMFGFSNPDVKKLIQ
Sbjct: 541  DNGEQFKGSSPSACWNKIYKRMRKIQHISDTSAEVKGEIVYKSGSDMFGFSNPDVKKLIQ 600

Query: 601  GISKSGLSSSRSLSKVASKKYKDFPVGYRPVRVDWKDLDKCSVCHMDEEYENNLFLQCDK 660
            GISKSGLSSSRSL KVASKKYK+FP+GYRPVRVDWKDLDKCSVCHMDEEYENNLFLQCDK
Sbjct: 601  GISKSGLSSSRSLGKVASKKYKNFPIGYRPVRVDWKDLDKCSVCHMDEEYENNLFLQCDK 660

Query: 661  CRMMVHARCYGELEPVDGVLWLCNLCRPGSPDCPPPCCLCPVIGGAMKPTTDGRWAHLAC 720
            CRMMVHARCYGELEPVDGVLWLCNLCRPGSPDC PPCCLCPVIGGAMKPTTDGRWAHLAC
Sbjct: 661  CRMMVHARCYGELEPVDGVLWLCNLCRPGSPDCLPPCCLCPVIGGAMKPTTDGRWAHLAC 720

Query: 721  AIWIPETCLSDIKKMEPIDGLSRINKDRWKLLCSICGVSYGACIQCSNNTCYVAYHPLCA 780
            AIWIPETCLSD+KKMEPIDGL+RINKDRWKLLCSICGVSYGACIQCSNNTCYVAYHPLCA
Sbjct: 721  AIWIPETCLSDVKKMEPIDGLNRINKDRWKLLCSICGVSYGACIQCSNNTCYVAYHPLCA 780

Query: 781  RAAGLCVELEDDDRLHLLAADEDEEDQCIRLLSFCKKHRPPSNERLMAEDRIGQAGQQCS 840
            RAAGLCVELE+DDRLHLLAAD+DEEDQCIRLLSFCKKHRPPSNERLMAEDRIGQAGQQCS
Sbjct: 781  RAAGLCVELEEDDRLHLLAADDDEEDQCIRLLSFCKKHRPPSNERLMAEDRIGQAGQQCS 840

Query: 841  NYTPPCNPSGCARTEPYNYFGRRGRKAPEALAAASLKRLFVENQPYIASGYSQHLLSGNL 900
            NYTPPCNPSGCARTEPYNYFGRRGRKAPEA+AAASLKRLFVENQP+IASGYSQHL SGNL
Sbjct: 841  NYTPPCNPSGCARTEPYNYFGRRGRKAPEAVAAASLKRLFVENQPHIASGYSQHLSSGNL 900

Query: 901  LPSSGVLGMKFSLQHLKTCQLDPRNILSVAEKYKFMRETFRKRLAFGKSGIHGFGIFAKH 960
            LPSSGVLG+KFSLQHLKTCQLDP+NILS+AEKYKFMRETFRKRLAFGKSGIHGFGIFAK+
Sbjct: 901  LPSSGVLGLKFSLQHLKTCQLDPQNILSMAEKYKFMRETFRKRLAFGKSGIHGFGIFAKY 960

Query: 961  PHRAGDMVIEYTGEIVRPPIADRRERFIYNLLVGAGTYMFRIDDERVIDATRAGSIAHLI 1020
            PHRAGDMVIEYTGEIVRPPIADRRERFIYNLLVGAGTYMFRIDDERVIDATRAGSIAHLI
Sbjct: 961  PHRAGDMVIEYTGEIVRPPIADRRERFIYNLLVGAGTYMFRIDDERVIDATRAGSIAHLI 1020

Query: 1021 NHSCEPNCYSRVITVNGDEHIIIFAKRDIKRWEELTYDYRFFSIDEQLACYCGFPRCRGV 1080
            NHSCEPNCYSRVI+VNGDEHIIIFAKRDIKRWEELTYDYRFFSIDEQLACYCGFPRCRGV
Sbjct: 1021 NHSCEPNCYSRVISVNGDEHIIIFAKRDIKRWEELTYDYRFFSIDEQLACYCGFPRCRGV 1080

Query: 1081 VNDTEEEERVAKLYVPRTDLVDWRELSYRVCPWKLTSSKFAHDVDISAMRTGSRFLSSSF 1140
            VNDTEEEERV+KL+V RTD      L+Y V   KL ++   HD+              +F
Sbjct: 1081 VNDTEEEERVSKLHVSRTD------LNYHV---KLMAAHLQHDI--------------AF 1140

Query: 1141 TTPQSLPFFSGCASRSSVATGTSISTTQIQELKAAEINPSVA 1183
                +LPFFSGCASRS+VATGT +STTQ +EL  +++NPSVA
Sbjct: 1141 AKFVALPFFSGCASRSTVATGTPVSTTQGRELNGSDVNPSVA 1146

BLAST of Sgr021830 vs. NCBI nr
Match: XP_023004925.1 (histone-lysine N-methyltransferase ATX2-like isoform X1 [Cucurbita maxima])

HSP 1 Score: 2109.0 bits (5463), Expect = 0.0e+00
Identity = 1018/1104 (92.21%), Postives = 1060/1104 (96.01%), Query Frame = 0

Query: 1    MAFPLEQRPKPQVMDGEDGEDINIDVYNAGTPVRYLSLDHVYSTTSPFVSTSGSSNVMSK 60
            MAFPLEQRPKP ++DGEDG+DINIDVYNAGTP+RYLSLDHVYSTTSPFVSTSGSSNVMSK
Sbjct: 1    MAFPLEQRPKPPILDGEDGDDINIDVYNAGTPIRYLSLDHVYSTTSPFVSTSGSSNVMSK 60

Query: 61   KVKARRLMVNRFDDLNFKPPRLLHVYSRRRKKPRHSSATPSFYDSLVEKVELGSKAVLKS 120
            KVKARRL+VN FDDLNFKPPRLLHVYSRRRKKPRHSS + S YDSLVE+VELGSK V+KS
Sbjct: 61   KVKARRLLVNHFDDLNFKPPRLLHVYSRRRKKPRHSSVSSSVYDSLVEEVELGSKTVMKS 120

Query: 121  EAREMDEMVNSADDHADDFEVDRMPKKKKKKKDKFGFNELVKLEVDSSVFRAMNGPRLRD 180
            EA E+DEMVN  DD   +FEVDR P  KKKKKD FG NELVKLEV+SSV RAMNGPRLRD
Sbjct: 121  EACEIDEMVNGVDDLVGEFEVDRTP--KKKKKDNFGCNELVKLEVNSSVIRAMNGPRLRD 180

Query: 181  CRTHNNNNLNVSNSGRRRKRNSSKISEKTVLKSPSAKRWVRLSFEDVDPKVYIGLQCKVY 240
            CRTH+NNN    NSGRR+KRNSS+ISEKT+ KSP+AKRWVRLSFEDVDPKVYIGLQCKVY
Sbjct: 181  CRTHSNNN---KNSGRRKKRNSSQISEKTMFKSPTAKRWVRLSFEDVDPKVYIGLQCKVY 240

Query: 241  WPLDADWYCGRVVGYTSETNRHHIEYEDDDKEDLILSNEKVKFYISGEEMQSLNLSFGVD 300
            WPLDADWY GRVVGY SET RH+IEYEDDDKEDL+LSNEKVKFYISGEEMQSLNLSFGVD
Sbjct: 241  WPLDADWYHGRVVGYDSETGRHNIEYEDDDKEDLVLSNEKVKFYISGEEMQSLNLSFGVD 300

Query: 301  SVDSDAYDYNEMLVLAASLDDCLEPEPGDIVWAKLTGHAMWPAIIVDESLIGDRKGLRNI 360
             +DSDAY+YNEMLVLAA+LDD LEPEPGDIVWAKLTGHAMWPAIIVDESLIGDRKGLRNI
Sbjct: 301  GIDSDAYEYNEMLVLAATLDDYLEPEPGDIVWAKLTGHAMWPAIIVDESLIGDRKGLRNI 360

Query: 361  SGGRTVPVQFFGTHDFARIKVKQAMSFLKGLLSSFHLKCKKPHFIRSLEEAKMYLSEQKL 420
            SGGRTVPVQFFGTHDFARIKVKQA+SFLKGLLS FH KCKKPHFIRSLEEAKMYLSEQKL
Sbjct: 361  SGGRTVPVQFFGTHDFARIKVKQAISFLKGLLSFFHQKCKKPHFIRSLEEAKMYLSEQKL 420

Query: 421  PPSMLQLQNGIEVDDFASASGEEEATTDSGEECLNEGGMPCPHNGYGSSPFIVGDLEIVS 480
            PPSMLQLQNGIEVDDFASASGEEE TTDSGEECLNE GMPCP NGYGSSPF+VGDLEI+S
Sbjct: 421  PPSMLQLQNGIEVDDFASASGEEEGTTDSGEECLNEAGMPCPPNGYGSSPFMVGDLEILS 480

Query: 481  LGKIVKDSKYFQNDGSVWPEGYTAVRKFSSLTDPNVCTLYKMEVLRDFESKFRPLCRVTL 540
            LGK+VK+SKYFQNDGSVWPEGYTAVRKFSSLTDPNV T YKMEVLRDFESKFRPL RVTL
Sbjct: 481  LGKVVKNSKYFQNDGSVWPEGYTAVRKFSSLTDPNVRTSYKMEVLRDFESKFRPLFRVTL 540

Query: 541  DNGEQFKGSSPSACWNKIYKRMRKIQHISDASTEGRGETVYKSGSDMFGFSNPDVKKLIQ 600
            DNGEQFKGSSPSACWNKIYKRMRKIQHISD S E +GE VYKSGSDMFGFSNPDVKKLIQ
Sbjct: 541  DNGEQFKGSSPSACWNKIYKRMRKIQHISDTSAEVKGEIVYKSGSDMFGFSNPDVKKLIQ 600

Query: 601  GISKSGLSSSRSLSKVASKKYKDFPVGYRPVRVDWKDLDKCSVCHMDEEYENNLFLQCDK 660
            GISKSGLSSSRSL KVASKKYK+FP+GYRPVRVDWKDLDKCSVCHMDEEYENNLFLQCDK
Sbjct: 601  GISKSGLSSSRSLGKVASKKYKNFPIGYRPVRVDWKDLDKCSVCHMDEEYENNLFLQCDK 660

Query: 661  CRMMVHARCYGELEPVDGVLWLCNLCRPGSPDCPPPCCLCPVIGGAMKPTTDGRWAHLAC 720
            CRMMVHARCYGELEPVDGVLWLCNLCRPGSPDC PPCCLCPVIGGAMKPTTDGRWAHLAC
Sbjct: 661  CRMMVHARCYGELEPVDGVLWLCNLCRPGSPDCLPPCCLCPVIGGAMKPTTDGRWAHLAC 720

Query: 721  AIWIPETCLSDIKKMEPIDGLSRINKDRWKLLCSICGVSYGACIQCSNNTCYVAYHPLCA 780
            AIWIPETCLSD+KKMEPIDGL+RINKDRWKLLCSICGVSYGACIQCSNNTCYVAYHPLCA
Sbjct: 721  AIWIPETCLSDVKKMEPIDGLNRINKDRWKLLCSICGVSYGACIQCSNNTCYVAYHPLCA 780

Query: 781  RAAGLCVELEDDDRLHLLAADEDEEDQCIRLLSFCKKHRPPSNERLMAEDRIGQAGQQCS 840
            RAAGLCVELE+DDRLHLLAAD+DEEDQCIRLLSFCKKHRPPSNERLMAEDRIGQAGQQCS
Sbjct: 781  RAAGLCVELEEDDRLHLLAADDDEEDQCIRLLSFCKKHRPPSNERLMAEDRIGQAGQQCS 840

Query: 841  NYTPPCNPSGCARTEPYNYFGRRGRKAPEALAAASLKRLFVENQPYIASGYSQHLLSGNL 900
            NYTPPCNPSGCARTEPYNYFGRRGRKAPEA+AAASLKRLFVENQP+IASGYSQHL SGNL
Sbjct: 841  NYTPPCNPSGCARTEPYNYFGRRGRKAPEAVAAASLKRLFVENQPHIASGYSQHLSSGNL 900

Query: 901  LPSSGVLGMKFSLQHLKTCQLDPRNILSVAEKYKFMRETFRKRLAFGKSGIHGFGIFAKH 960
            LPSSGVLG+KFSLQHLKTCQLDP+NILS+AEKYKFMRETFRKRLAFGKSGIHGFGIFAK+
Sbjct: 901  LPSSGVLGLKFSLQHLKTCQLDPQNILSMAEKYKFMRETFRKRLAFGKSGIHGFGIFAKY 960

Query: 961  PHRAGDMVIEYTGEIVRPPIADRRERFIYNLLVGAGTYMFRIDDERVIDATRAGSIAHLI 1020
            PHRAGDMVIEYTGEIVRPPIADRRERFIYNLLVGAGTYMFRIDDERVIDATRAGSIAHLI
Sbjct: 961  PHRAGDMVIEYTGEIVRPPIADRRERFIYNLLVGAGTYMFRIDDERVIDATRAGSIAHLI 1020

Query: 1021 NHSCEPNCYSRVITVNGDEHIIIFAKRDIKRWEELTYDYRFFSIDEQLACYCGFPRCRGV 1080
            NHSCEPNCYSRVI+VNGDEHIIIFAKRDIKRWEELTYDYRFFSIDEQLACYCGFPRCRGV
Sbjct: 1021 NHSCEPNCYSRVISVNGDEHIIIFAKRDIKRWEELTYDYRFFSIDEQLACYCGFPRCRGV 1080

Query: 1081 VNDTEEEERVAKLYVPRTDLVDWR 1105
            VNDTEEEERV+KL+V RTDLVDWR
Sbjct: 1081 VNDTEEEERVSKLHVSRTDLVDWR 1099

BLAST of Sgr021830 vs. NCBI nr
Match: XP_023513866.1 (histone-lysine N-methyltransferase ATX2-like isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 2108.6 bits (5462), Expect = 0.0e+00
Identity = 1018/1104 (92.21%), Postives = 1060/1104 (96.01%), Query Frame = 0

Query: 1    MAFPLEQRPKPQVMDGEDGEDINIDVYNAGTPVRYLSLDHVYSTTSPFVSTSGSSNVMSK 60
            MAFPLEQRPKP ++DGEDG+DINIDVYNAGTP+RYLSLDHVYSTTSPFVSTSGSSNVMSK
Sbjct: 1    MAFPLEQRPKPPILDGEDGDDINIDVYNAGTPIRYLSLDHVYSTTSPFVSTSGSSNVMSK 60

Query: 61   KVKARRLMVNRFDDLNFKPPRLLHVYSRRRKKPRHSSATPSFYDSLVEKVELGSKAVLKS 120
            KVKARRL+VN FDDLNFKPPRLLHVYSRRRKKPRHSS + S YDSLVE+VELGSK V+KS
Sbjct: 61   KVKARRLLVNHFDDLNFKPPRLLHVYSRRRKKPRHSSVSSSVYDSLVEEVELGSKTVMKS 120

Query: 121  EAREMDEMVNSADDHADDFEVDRMPKKKKKKKDKFGFNELVKLEVDSSVFRAMNGPRLRD 180
            EA E+DEMVN  DD   +FEVDR P  KKKKKD FG NELVKLEV+SSV RAMNGPRLRD
Sbjct: 121  EACEIDEMVNGVDDLVREFEVDRTP--KKKKKDNFGCNELVKLEVNSSVIRAMNGPRLRD 180

Query: 181  CRTHNNNNLNVSNSGRRRKRNSSKISEKTVLKSPSAKRWVRLSFEDVDPKVYIGLQCKVY 240
            CRTH+NNN    NSGRR+KRNSS+ISEKT+ KSP+AKRWVRLSFEDVDPKVYIGLQCKVY
Sbjct: 181  CRTHSNNN---KNSGRRKKRNSSQISEKTMFKSPTAKRWVRLSFEDVDPKVYIGLQCKVY 240

Query: 241  WPLDADWYCGRVVGYTSETNRHHIEYEDDDKEDLILSNEKVKFYISGEEMQSLNLSFGVD 300
            WPLDADWY GRVVGY SET RH+IEYEDDDKEDL+LSNEKVKFYISGEEMQSLNLSFGVD
Sbjct: 241  WPLDADWYHGRVVGYDSETGRHNIEYEDDDKEDLVLSNEKVKFYISGEEMQSLNLSFGVD 300

Query: 301  SVDSDAYDYNEMLVLAASLDDCLEPEPGDIVWAKLTGHAMWPAIIVDESLIGDRKGLRNI 360
             +DSDAY+YNEMLVLAA+LDD LEPEPGDIVWAKLTGHAMWPAIIVDESLIGDRKGLRNI
Sbjct: 301  GIDSDAYEYNEMLVLAATLDDYLEPEPGDIVWAKLTGHAMWPAIIVDESLIGDRKGLRNI 360

Query: 361  SGGRTVPVQFFGTHDFARIKVKQAMSFLKGLLSSFHLKCKKPHFIRSLEEAKMYLSEQKL 420
            SGGRTVPVQFFGTHDFARIKVKQA+SFLKGLLS FH KCKKPHFIRSLEEAKMYLSEQKL
Sbjct: 361  SGGRTVPVQFFGTHDFARIKVKQAISFLKGLLSFFHQKCKKPHFIRSLEEAKMYLSEQKL 420

Query: 421  PPSMLQLQNGIEVDDFASASGEEEATTDSGEECLNEGGMPCPHNGYGSSPFIVGDLEIVS 480
            PPSMLQLQNGIEVDDFASASGEEE TTDSGEECLNE GMPCP NGYGSSPF+VGDLEI+S
Sbjct: 421  PPSMLQLQNGIEVDDFASASGEEEGTTDSGEECLNEAGMPCPPNGYGSSPFMVGDLEILS 480

Query: 481  LGKIVKDSKYFQNDGSVWPEGYTAVRKFSSLTDPNVCTLYKMEVLRDFESKFRPLCRVTL 540
            LGK+VK+SKYFQNDGSVWPEGYTAVRKFSSLTDPNV T YKMEVLRDFESKFRPL RVTL
Sbjct: 481  LGKVVKNSKYFQNDGSVWPEGYTAVRKFSSLTDPNVRTSYKMEVLRDFESKFRPLFRVTL 540

Query: 541  DNGEQFKGSSPSACWNKIYKRMRKIQHISDASTEGRGETVYKSGSDMFGFSNPDVKKLIQ 600
            DNGEQFKGSSPSACWNKIYKRMRKIQHISD S E +GE VYKSGSDMFGFSNPDVKKLIQ
Sbjct: 541  DNGEQFKGSSPSACWNKIYKRMRKIQHISDTSAEVKGEIVYKSGSDMFGFSNPDVKKLIQ 600

Query: 601  GISKSGLSSSRSLSKVASKKYKDFPVGYRPVRVDWKDLDKCSVCHMDEEYENNLFLQCDK 660
            GISKSGLSSSRSL KVASKKYK+FP+GYRPVRVDWKDLDKCSVCHMDEEYENNLFLQCDK
Sbjct: 601  GISKSGLSSSRSLGKVASKKYKNFPIGYRPVRVDWKDLDKCSVCHMDEEYENNLFLQCDK 660

Query: 661  CRMMVHARCYGELEPVDGVLWLCNLCRPGSPDCPPPCCLCPVIGGAMKPTTDGRWAHLAC 720
            CRMMVHARCYGELEPVDGVLWLCNLCRPGSPDC PPCCLCPVIGGAMKPTTDGRWAHLAC
Sbjct: 661  CRMMVHARCYGELEPVDGVLWLCNLCRPGSPDCLPPCCLCPVIGGAMKPTTDGRWAHLAC 720

Query: 721  AIWIPETCLSDIKKMEPIDGLSRINKDRWKLLCSICGVSYGACIQCSNNTCYVAYHPLCA 780
            AIWIPETCLSD+KKMEPIDGL+RINKDRWKLLCSICGVSYGACIQCSNNTCYVAYHPLCA
Sbjct: 721  AIWIPETCLSDVKKMEPIDGLNRINKDRWKLLCSICGVSYGACIQCSNNTCYVAYHPLCA 780

Query: 781  RAAGLCVELEDDDRLHLLAADEDEEDQCIRLLSFCKKHRPPSNERLMAEDRIGQAGQQCS 840
            RAAGLCVELE+DDRLHLLAAD+DEEDQCIRLLSFCKKHRPPSNERLMAEDRIGQAGQQCS
Sbjct: 781  RAAGLCVELEEDDRLHLLAADDDEEDQCIRLLSFCKKHRPPSNERLMAEDRIGQAGQQCS 840

Query: 841  NYTPPCNPSGCARTEPYNYFGRRGRKAPEALAAASLKRLFVENQPYIASGYSQHLLSGNL 900
            NYTPPCNPSGCARTEPYNYFGRRGRKAPEA+AAASLKRLFVENQP+IASGYSQHL SGNL
Sbjct: 841  NYTPPCNPSGCARTEPYNYFGRRGRKAPEAVAAASLKRLFVENQPHIASGYSQHLSSGNL 900

Query: 901  LPSSGVLGMKFSLQHLKTCQLDPRNILSVAEKYKFMRETFRKRLAFGKSGIHGFGIFAKH 960
            LPSSGVLG+KFSLQHLKTCQLDP+NILS+AEKYKFMRETFRKRLAFGKSGIHGFGIFAK+
Sbjct: 901  LPSSGVLGLKFSLQHLKTCQLDPQNILSMAEKYKFMRETFRKRLAFGKSGIHGFGIFAKY 960

Query: 961  PHRAGDMVIEYTGEIVRPPIADRRERFIYNLLVGAGTYMFRIDDERVIDATRAGSIAHLI 1020
            PHRAGDMVIEYTGEIVRPPIADRRERFIYNLLVGAGTYMFRIDDERVIDATRAGSIAHLI
Sbjct: 961  PHRAGDMVIEYTGEIVRPPIADRRERFIYNLLVGAGTYMFRIDDERVIDATRAGSIAHLI 1020

Query: 1021 NHSCEPNCYSRVITVNGDEHIIIFAKRDIKRWEELTYDYRFFSIDEQLACYCGFPRCRGV 1080
            NHSCEPNCYSRVI+VNGDEHIIIFAKRDIKRWEELTYDYRFFSIDEQLACYCGFPRCRGV
Sbjct: 1021 NHSCEPNCYSRVISVNGDEHIIIFAKRDIKRWEELTYDYRFFSIDEQLACYCGFPRCRGV 1080

Query: 1081 VNDTEEEERVAKLYVPRTDLVDWR 1105
            VNDTEEEERV+KL+V RTDLVDWR
Sbjct: 1081 VNDTEEEERVSKLHVSRTDLVDWR 1099

BLAST of Sgr021830 vs. NCBI nr
Match: XP_011656480.1 (histone-lysine N-methyltransferase ATX2 [Cucumis sativus] >KGN45919.1 hypothetical protein Csa_005420 [Cucumis sativus])

HSP 1 Score: 2107.8 bits (5460), Expect = 0.0e+00
Identity = 1017/1104 (92.12%), Postives = 1059/1104 (95.92%), Query Frame = 0

Query: 2    AFPLEQRPKPQVMDGEDGEDINIDVYNAGTPVRYLSLDHVYSTTSPFVSTSGSSNVMSKK 61
            AF L QRPKP ++DGEDG+DINIDVYNAGTP+RYLSLDHVYSTTSPFVSTSGSSNVMSKK
Sbjct: 3    AFSLHQRPKPPIVDGEDGDDINIDVYNAGTPIRYLSLDHVYSTTSPFVSTSGSSNVMSKK 62

Query: 62   VKARRLMVNRFDDLNFKPPRLLHVYSRRRKKPRHSSATPSFYDSLVEKVELGSKAVLKSE 121
            VKARRLMVN FDDLNFKPPRLLHVYSRRRKKPRHSSA+ S YDSLVE+VELGS  V++SE
Sbjct: 63   VKARRLMVNHFDDLNFKPPRLLHVYSRRRKKPRHSSASSSMYDSLVEQVELGSTTVMESE 122

Query: 122  AREMDEMVNSADDHADDFEVDRMPKKKKKKKDKFGFNELVKLEVDSSVFRAMNGPRLRDC 181
            A E DEMVN  D HA++FEVDR PK KKKK DKFG NELVKLEVDSSV R MNGPRLRDC
Sbjct: 123  ACETDEMVNGVDGHAEEFEVDRTPKNKKKKNDKFGCNELVKLEVDSSVIRTMNGPRLRDC 182

Query: 182  RTHNNNNLNVSNSGRRRKRNSSKISEKTVLKSPSAKRWVRLSFEDVDPKVYIGLQCKVYW 241
            RTH+NNN   +NSG+ +KRNSS+ISEKT  KSP+AKRWVRLSFEDVDPKVY+GLQCKVYW
Sbjct: 183  RTHSNNN---NNSGQSKKRNSSQISEKTTFKSPTAKRWVRLSFEDVDPKVYVGLQCKVYW 242

Query: 242  PLDADWYCGRVVGYTSETNRHHIEYEDDDKEDLILSNEKVKFYISGEEMQSLNLSFGVDS 301
            PLDA WYCGRVVGY SET+ HHIEYED D+EDL+LSNEKVKF+ISGEEMQ+LNL+FGVDS
Sbjct: 243  PLDAQWYCGRVVGYNSETSCHHIEYEDGDREDLVLSNEKVKFHISGEEMQTLNLNFGVDS 302

Query: 302  VDSDAYDYNEMLVLAASLDDCLEPEPGDIVWAKLTGHAMWPAIIVDESLIGDRKGLRNIS 361
            VDSDAYDYNEMLVLAA+LDDCLEPEPGDIVWAKLTGHAMWPAIIVDESLIGDRKGLRNIS
Sbjct: 303  VDSDAYDYNEMLVLAATLDDCLEPEPGDIVWAKLTGHAMWPAIIVDESLIGDRKGLRNIS 362

Query: 362  GGRTVPVQFFGTHDFARIKVKQAMSFLKGLLSSFHLKCKKPHFIRSLEEAKMYLSEQKLP 421
            GGRTVPVQFFGTHDFARIKVKQA+SFLKGLLS FH KCKKPHF+RSLEEAKMYLSEQKLP
Sbjct: 363  GGRTVPVQFFGTHDFARIKVKQAISFLKGLLSFFHQKCKKPHFMRSLEEAKMYLSEQKLP 422

Query: 422  PSMLQLQNGIEVDDFASASGEEEATTDSGEECLNE-GGMPCPHNGYGSSPFIVGDLEIVS 481
            PSMLQLQNGIEVDDFASASGEEE TTDSGEECLNE GG+ C  NGY  SPF VGDLEI+S
Sbjct: 423  PSMLQLQNGIEVDDFASASGEEEGTTDSGEECLNEGGGVRCALNGY-RSPFKVGDLEIIS 482

Query: 482  LGKIVKDSKYFQNDGSVWPEGYTAVRKFSSLTDPNVCTLYKMEVLRDFESKFRPLCRVTL 541
            LGKIVKDSKYFQNDGSVWPEGYTAVRKFSSLTDPNVCTLY+MEVLRDFESKFRPL RVTL
Sbjct: 483  LGKIVKDSKYFQNDGSVWPEGYTAVRKFSSLTDPNVCTLYRMEVLRDFESKFRPLFRVTL 542

Query: 542  DNGEQFKGSSPSACWNKIYKRMRKIQHISDASTEGRGETVYKSGSDMFGFSNPDVKKLIQ 601
            DNGEQFKGSSPSACWNKIYKRM+KIQH SDASTE +GE VYKSGSDMFGFSNPDVKKLIQ
Sbjct: 543  DNGEQFKGSSPSACWNKIYKRMKKIQHTSDASTETKGEFVYKSGSDMFGFSNPDVKKLIQ 602

Query: 602  GISKSGLSSSRSLSKVASKKYKDFPVGYRPVRVDWKDLDKCSVCHMDEEYENNLFLQCDK 661
            GISKSGLSSSRSLSKVASKKYKDFP+GYRPVRVDWKDLDKCSVCHMDEEYENNLFLQCDK
Sbjct: 603  GISKSGLSSSRSLSKVASKKYKDFPIGYRPVRVDWKDLDKCSVCHMDEEYENNLFLQCDK 662

Query: 662  CRMMVHARCYGELEPVDGVLWLCNLCRPGSPDCPPPCCLCPVIGGAMKPTTDGRWAHLAC 721
            CRMMVHARCYGELEPVDGV+WLCNLCRPGSPDCPPPCCLCPVIGGAMKPTTDGRWAHLAC
Sbjct: 663  CRMMVHARCYGELEPVDGVIWLCNLCRPGSPDCPPPCCLCPVIGGAMKPTTDGRWAHLAC 722

Query: 722  AIWIPETCLSDIKKMEPIDGLSRINKDRWKLLCSICGVSYGACIQCSNNTCYVAYHPLCA 781
            AIWIPETCLSDIKKMEPIDGL+RINKDRWKLLCSICGVSYGACIQCSNNTCYVAYHPLCA
Sbjct: 723  AIWIPETCLSDIKKMEPIDGLNRINKDRWKLLCSICGVSYGACIQCSNNTCYVAYHPLCA 782

Query: 782  RAAGLCVELEDDDRLHLLAADEDEEDQCIRLLSFCKKHRPPSNERLMAEDRIGQAGQQCS 841
            RAAGLCVELE+DDRLHLLAADEDEEDQCIRLLSFCKKHRPPSNERLMAEDRIGQAGQQCS
Sbjct: 783  RAAGLCVELEEDDRLHLLAADEDEEDQCIRLLSFCKKHRPPSNERLMAEDRIGQAGQQCS 842

Query: 842  NYTPPCNPSGCARTEPYNYFGRRGRKAPEALAAASLKRLFVENQPYIASGYSQHLLSGNL 901
            NYTPPCNPSGCARTEPYNYF RRGRKAPEA+AAA+LKRLFVENQPYIASGYSQHLLSGNL
Sbjct: 843  NYTPPCNPSGCARTEPYNYFERRGRKAPEAVAAAALKRLFVENQPYIASGYSQHLLSGNL 902

Query: 902  LPSSGVLGMKFSLQHLKTCQLDPRNILSVAEKYKFMRETFRKRLAFGKSGIHGFGIFAKH 961
            LPSSGVLGMKFSLQHLKTCQLDPRNILSVAEKYKFMRETFRKRLAFGKSGIHGFGIFAKH
Sbjct: 903  LPSSGVLGMKFSLQHLKTCQLDPRNILSVAEKYKFMRETFRKRLAFGKSGIHGFGIFAKH 962

Query: 962  PHRAGDMVIEYTGEIVRPPIADRRERFIYNLLVGAGTYMFRIDDERVIDATRAGSIAHLI 1021
            PHRAGDMVIEYTGEIVRPPIADRRERFIYNLLVGAGTYMFRIDDERVIDATRAGSIAHLI
Sbjct: 963  PHRAGDMVIEYTGEIVRPPIADRRERFIYNLLVGAGTYMFRIDDERVIDATRAGSIAHLI 1022

Query: 1022 NHSCEPNCYSRVITVNGDEHIIIFAKRDIKRWEELTYDYRFFSIDEQLACYCGFPRCRGV 1081
            NHSCEPNCYSRVI+VNGDEHIIIFAKRDIKRWEELTYDYRFFSIDEQLACYCG+PRCRGV
Sbjct: 1023 NHSCEPNCYSRVISVNGDEHIIIFAKRDIKRWEELTYDYRFFSIDEQLACYCGYPRCRGV 1082

Query: 1082 VNDTEEEERVAKLYVPRTDLVDWR 1105
            VNDT+EEERV+KL+V RTDLVDWR
Sbjct: 1083 VNDTDEEERVSKLHVSRTDLVDWR 1102

BLAST of Sgr021830 vs. ExPASy Swiss-Prot
Match: P0CB22 (Histone-lysine N-methyltransferase ATX2 OS=Arabidopsis thaliana OX=3702 GN=ATX2 PE=1 SV=1)

HSP 1 Score: 1325.5 bits (3429), Expect = 0.0e+00
Identity = 676/1096 (61.68%), Postives = 808/1096 (73.72%), Query Frame = 0

Query: 17   EDGEDINIDV----YNAGTPVRYLSLDHVYSTTSPFVSTSGSSNVMS-KKVKARRL-MVN 76
            E+GED  I      + A  PVRY SL+ VYS +S   S    +   S KKV A +L M +
Sbjct: 11   EEGEDTQIKTELHDHAADNPVRYASLESVYSVSSSSSSLCCKTAAGSHKKVNALKLPMSD 70

Query: 77   RFDDLNFKPPRLLHVYSRRRKKPRHSSATPSFYDSLVEKVELGSKAVLKSEAREMDEMVN 136
             F+    + P ++HVY RR+++ R         +S +E       A+L++E  E D+ + 
Sbjct: 71   SFELQPHRRPEIVHVYCRRKRRRRRRR------ESFLE------LAILQNEGVERDDRIV 130

Query: 137  SADDHADDFEVDRMPKKKKKKKDKFGFNELVKLEVDSSVFRAMNGPRLRDCRTHNNNNLN 196
              +    D E +   KKKK+KK + G  EL+KL VDS+       P LR CR     + N
Sbjct: 131  KIESAELDDEKEEENKKKKQKKRRIGNGELMKLGVDSTTLSVSATPPLRGCRIKAVCSGN 190

Query: 197  VSNSGRRRKRNSSKISEKTVLKSPSAKRWVRLSFEDVDPKVYIGLQCKVYWPLDADWYCG 256
              +   R KRN+ K  EK V  S +AK+WVRLS++ VDPK +IGLQCKV+WPLDA WY G
Sbjct: 191  KQDGSSRSKRNTVKNQEKVVTASATAKKWVRLSYDGVDPKHFIGLQCKVFWPLDAVWYPG 250

Query: 257  RVVGYTSETNRHHIEYEDDDKEDLILSNEKVKFYISGEEMQSLNLSFGVDSVDSDAYDYN 316
             +VGY  ET  H ++Y D D E+L L  EK+KF IS ++M+ LN+ FG + V  D  DY+
Sbjct: 251  SIVGYNVETKHHIVKYGDGDGEELALRREKIKFLISRDDMELLNMKFGTNDVVVDGQDYD 310

Query: 317  EMLVLAASLDDCLEPEPGDIVWAKLTGHAMWPAIIVDESLIGDRKGLRN-ISGGRTVPVQ 376
            E+++LAAS ++C + EP DI+WAKLTGHAMWPAIIVDES+I  RKGL N ISGGR+V VQ
Sbjct: 311  ELVILAASFEECQDFEPRDIIWAKLTGHAMWPAIIVDESVIVKRKGLNNKISGGRSVLVQ 370

Query: 377  FFGTHDFARIKVKQAMSFLKGLLSSFHLKCKKPHFIRSLEEAKMYLSEQKLPPSMLQLQN 436
            FFGTHDFARI+VKQA+SFLKGLLS   LKCK+P F  ++EEAKMYL E KLP  M QLQ 
Sbjct: 371  FFGTHDFARIQVKQAVSFLKGLLSRSPLKCKQPRFEEAMEEAKMYLKEYKLPGRMDQLQK 430

Query: 437  GIEVDDFASASGEEEATTDSGEECLNEGGMPCPHNGYGSSPFIVGDLEIVSLGKIVKDSK 496
              + D     +  EE +++SG++   +G +       G     +GDL+I++LG+IV DS+
Sbjct: 431  VADTDCSERINSGEEDSSNSGDDYTKDGEVWLRPTELGDCLHRIGDLQIINLGRIVTDSE 490

Query: 497  YFQNDGSVWPEGYTAVRKFSSLTDPNVCTLYKMEVLRDFESKFRPLCRVTLDNGEQFKGS 556
            +F++    WPEGYTA RKF SL DPN   +YKMEVLRD ESK RP+ RVT ++GEQFKG 
Sbjct: 491  FFKDSKHTWPEGYTATRKFISLKDPNASAMYKMEVLRDAESKTRPVFRVTTNSGEQFKGD 550

Query: 557  SPSACWNKIYKRMRKIQHISDASTEGRGETVYKSGSDMFGFSNPDVKKLIQGISKSGLSS 616
            +PSACWNKIY R++KIQ  SD + +  GE +++SG+DMFGFSNP+V KLIQG+ +S   S
Sbjct: 551  TPSACWNKIYNRIKKIQIASD-NPDVLGEGLHESGTDMFGFSNPEVDKLIQGLLQSRPPS 610

Query: 617  SRSLSKVASKKYKDFPVGYRPVRVDWKDLDKCSVCHMDEEYENNLFLQCDKCRMMVHARC 676
              S  K +S KY+D P GYRPVRV+WKDLDKC+VCHMDEEYENNLFLQCDKCRMMVH RC
Sbjct: 611  KVSQRKYSSGKYQDHPTGYRPVRVEWKDLDKCNVCHMDEEYENNLFLQCDKCRMMVHTRC 670

Query: 677  YGELEPVDGVLWLCNLCRPGSPDCPPPCCLCPVIGGAMKPTTDGRWAHLACAIWIPETCL 736
            YG+LEP +G+LWLCNLCRP + D PP CCLCPV+GGAMKPTTDGRWAHLACAIWIPETCL
Sbjct: 671  YGQLEPHNGILWLCNLCRPVALDIPPRCCLCPVVGGAMKPTTDGRWAHLACAIWIPETCL 730

Query: 737  SDIKKMEPIDGLSRINKDRWKLLCSICGVSYGACIQCSNNTCYVAYHPLCARAAGLCVEL 796
             D+KKMEPIDG+ +++KDRWKLLCSICGVSYGACIQCSNNTC VAYHPLCARAAGLCVEL
Sbjct: 731  LDVKKMEPIDGVKKVSKDRWKLLCSICGVSYGACIQCSNNTCRVAYHPLCARAAGLCVEL 790

Query: 797  EDDDRLHLLAADEDEEDQCIRLLSFCKKHRPPSNERLMAEDRIGQAGQQCSNYTPPCNPS 856
             D+DRL LL+ D+DE DQCIRLLSFCK+HR  SN  L  E  I  A    + Y PP NPS
Sbjct: 791  ADEDRLFLLSMDDDEADQCIRLLSFCKRHRQTSNYHLETEYMIKPA-HNIAEYLPPPNPS 850

Query: 857  GCARTEPYNYFGRRGRKAPEALAAASLKRLFVENQPYIASGYSQHLLSGNLLPSSGVLGM 916
            GCARTEPYNY GRRGRK PEALA AS KRLFVENQPYI  GYS+H  S        + G 
Sbjct: 851  GCARTEPYNYLGRRGRKEPEALAGASSKRLFVENQPYIVGGYSRHEFS----TYERIYGS 910

Query: 917  KFSLQHLKTCQLDPRNILSVAEKYKFMRETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVI 976
            K S   + T    P NILS+AEKY FM+ET+RKRLAFGKSGIHGFGIFAK PHRAGDMVI
Sbjct: 911  KMS--QITT----PSNILSMAEKYTFMKETYRKRLAFGKSGIHGFGIFAKLPHRAGDMVI 970

Query: 977  EYTGEIVRPPIADRRERFIYNLLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCY 1036
            EYTGE+VRPPIAD+RE  IYN +VGAGTYMFRID+ERVIDATR GSIAHLINHSCEPNCY
Sbjct: 971  EYTGELVRPPIADKREHLIYNSMVGAGTYMFRIDNERVIDATRTGSIAHLINHSCEPNCY 1030

Query: 1037 SRVITVNGDEHIIIFAKRDIKRWEELTYDYRFFSIDEQLACYCGFPRCRGVVNDTEEEER 1096
            SRVI+VNGDEHIIIFAKRD+ +WEELTYDYRFFSIDE+LACYCGFPRCRGVVNDTE EER
Sbjct: 1031 SRVISVNGDEHIIIFAKRDVAKWEELTYDYRFFSIDERLACYCGFPRCRGVVNDTEAEER 1082

Query: 1097 VAKLYVPRTDLVDWRE 1106
             A ++  R +L +W E
Sbjct: 1091 QANIHASRCELKEWTE 1082

BLAST of Sgr021830 vs. ExPASy Swiss-Prot
Match: Q9C5X4 (Histone H3-lysine(4) N-trimethyltransferase ATX1 OS=Arabidopsis thaliana OX=3702 GN=ATX1 PE=1 SV=2)

HSP 1 Score: 1282.7 bits (3318), Expect = 0.0e+00
Identity = 660/1106 (59.67%), Postives = 806/1106 (72.88%), Query Frame = 0

Query: 22   INIDVYN-AGTPVRYLSLDHVYSTTSP---FVSTSGSSNVMSKKVKARRL-MVNRFD--- 81
            I IDV++    P+RY S++ +YS  S     V+  GS ++MSKKVKA++L M+ +F+   
Sbjct: 10   IEIDVHDLVEAPIRYDSIESIYSIPSSALCCVNAVGSHSLMSKKVKAQKLPMIEQFEIEG 69

Query: 82   ---------------DLNFKPPRLLHVYSRRRKKPRHSSATPSFYDSLVEKVELGSKAVL 141
                            L  + P ++ VY RRRK+P             + +  L     +
Sbjct: 70   SGVSASDDCCRSDDYKLRIQRPEIVRVYYRRRKRP-------------LRECLLDQAVAV 129

Query: 142  KSEAREMDEMVNSADDHADDFEVDRMPKKKKKKKDKFGFNELVKLEVDSSVFRAMNGPRL 201
            K+E+ E+DE+        D FE        +KK+ K G  ELVK          M    L
Sbjct: 130  KTESVELDEI--------DCFE--------EKKRRKIGNCELVK--------SGMESIGL 189

Query: 202  RDCRTHNNNNLNVSNSGRRRKRNSSKISEKTVLKSPSAKRWVRLSFEDVDPKVYIGLQCK 261
            R C+ +N  + N  N   RRK +SSK  +K  L S SAK+WVRLS++ VDP  +IGLQCK
Sbjct: 190  RRCKENNAFSGNKQNGSSRRKGSSSKNQDKATLASRSAKKWVRLSYDGVDPTSFIGLQCK 249

Query: 262  VYWPLDADWYCGRVVGYTSETNRHHIEYEDDDKEDLILSNEKVKFYISGEEMQSLNLSFG 321
            V+WPLDA WY G +VGY++E  R+ ++Y D   ED++   E +KF +S EEM+ L+L F 
Sbjct: 250  VFWPLDALWYEGSIVGYSAERKRYTVKYRDGCDEDIVFDREMIKFLVSREEMELLHLKFC 309

Query: 322  VDSVDSDAYDYNEMLVLAASLDDCLEPEPGDIVWAKLTGHAMWPAIIVDESLIGDRKGLR 381
              +V  D  DY+EM+VLAA+LD+C + EPGDIVWAKL GHAMWPA+IVDES+IG+RKGL 
Sbjct: 310  TSNVTVDGRDYDEMVVLAATLDECQDFEPGDIVWAKLAGHAMWPAVIVDESIIGERKGLN 369

Query: 382  N-ISGGRTVPVQFFGTHDFARIKVKQAMSFLKGLLSSFHLKCKKPHFIRSLEEAKMYLSE 441
            N +SGG ++ VQFFGTHDFARIKVKQA+SF+KGLLS  HLKCK+P F   ++EAKMYL  
Sbjct: 370  NKVSGGGSLLVQFFGTHDFARIKVKQAISFIKGLLSPSHLKCKQPRFEEGMQEAKMYLKA 429

Query: 442  QKLPPSMLQLQNGIEVDDFASASGEEEATTDSGEECLNEGGMPCPHNGYGSSPFIVGDLE 501
             +LP  M QLQ G +  D   A+  EE   +SG + LN+G +      +     I+GDL 
Sbjct: 430  HRLPERMSQLQKGADSVDSDMANSTEEG--NSGGDLLNDGEVWLRPTEHVDFRHIIGDLL 489

Query: 502  IVSLGKIVKDSKYFQNDGSVWPEGYTAVRKFSSLTDPNVCTLYKMEVLRDFESKFRPLCR 561
            I++LGK+V DS++F+++  +WPEGYTA+RKF+SLTD +   LYKMEVLRD E+K  PL  
Sbjct: 490  IINLGKVVTDSQFFKDENHIWPEGYTAMRKFTSLTDHSASALYKMEVLRDAETKTHPLFI 549

Query: 562  VTLDNGEQFKGSSPSACWNKIYKRMRKIQHISDASTEGRGETVYKSGSDMFGFSNPDVKK 621
            VT D+GEQFKG +PSACWNKIY R++K+Q+ SD S    GE +  SG+DMFG SNP+V K
Sbjct: 550  VTADSGEQFKGPTPSACWNKIYNRIKKVQN-SD-SPNILGEELNGSGTDMFGLSNPEVIK 609

Query: 622  LIQGISKSGLSSSRSLSKVASKKYKDFPVGYRPVRVDWKDLDKCSVCHMDEEYENNLFLQ 681
            L+Q +SKS  SS  S+ K +  ++++ P GYRPVRVDWKDLDKC+VCHMDEEYENNLFLQ
Sbjct: 610  LVQDLSKSRPSSHVSMCKNSLGRHQNQPTGYRPVRVDWKDLDKCNVCHMDEEYENNLFLQ 669

Query: 682  CDKCRMMVHARCYGELEPVDGVLWLCNLCRPGSPDCPPPCCLCPVIGGAMKPTTDGRWAH 741
            CDKCRMMVHA+CYGELEP DG LWLCNLCRPG+PD PP CCLCPV+GGAMKPTTDGRWAH
Sbjct: 670  CDKCRMMVHAKCYGELEPCDGALWLCNLCRPGAPDMPPRCCLCPVVGGAMKPTTDGRWAH 729

Query: 742  LACAIWIPETCLSDIKKMEPIDGLSRINKDRWKLLCSICGVSYGACIQCSNNTCYVAYHP 801
            LACAIWIPETCLSD+KKMEPIDG+++++KDRWKL+C+ICGVSYGACIQCSNN+C VAYHP
Sbjct: 730  LACAIWIPETCLSDVKKMEPIDGVNKVSKDRWKLMCTICGVSYGACIQCSNNSCRVAYHP 789

Query: 802  LCARAAGLCVELEDDDRLHLLAADEDEEDQCIRLLSFCKKHRPPSNERLMAEDRIGQAGQ 861
            LCARAAGLCVELE+D     ++ + +E DQCIR+LSFCK+HR  S   L +EDRI  A  
Sbjct: 790  LCARAAGLCVELEND-----MSVEGEEADQCIRMLSFCKRHRQTSTACLGSEDRIKSATH 849

Query: 862  QCSNYTPPCNPSGCARTEPYNYFGRRGRKAPEALAAASLKRLFVENQPYIASGYSQHLLS 921
            + S Y PP NPSGCARTEPYN FGRRGRK PEALAAAS KRLFVENQPY+  GYS+   S
Sbjct: 850  KTSEYLPPPNPSGCARTEPYNCFGRRGRKEPEALAAASSKRLFVENQPYVIGGYSRLEFS 909

Query: 922  GNLLPSSGVLGMKFSLQHLKTCQLDPRNILSVAEKYKFMRETFRKRLAFGKSGIHGFGIF 981
                    + G K S  +       P NILS+AEKY++MRET+RKRLAFGKSGIHGFGIF
Sbjct: 910  ----TYKSIHGSKVSQMN------TPSNILSMAEKYRYMRETYRKRLAFGKSGIHGFGIF 969

Query: 982  AKHPHRAGDMVIEYTGEIVRPPIADRRERFIYNLLVGAGTYMFRIDDERVIDATRAGSIA 1041
            AK PHRAGDM+IEYTGE+VRP IAD+RE+ IYN +VGAGTYMFRIDDERVIDATR GSIA
Sbjct: 970  AKLPHRAGDMMIEYTGELVRPSIADKREQLIYNSMVGAGTYMFRIDDERVIDATRTGSIA 1029

Query: 1042 HLINHSCEPNCYSRVITVNGDEHIIIFAKRDIKRWEELTYDYRFFSIDEQLACYCGFPRC 1101
            HLINHSC PNCYSRVITVNGDEHIIIFAKR I +WEELTYDYRFFSI E+L+C CGFP C
Sbjct: 1030 HLINHSCVPNCYSRVITVNGDEHIIIFAKRHIPKWEELTYDYRFFSIGERLSCSCGFPGC 1059

Query: 1102 RGVVNDTEEEERVAKLYVPRTDLVDW 1104
            RGVVNDTE EE+ AK+ VPR DL+DW
Sbjct: 1090 RGVVNDTEAEEQHAKICVPRCDLIDW 1059

BLAST of Sgr021830 vs. ExPASy Swiss-Prot
Match: Q6K431 (Histone-lysine N-methyltransferase TRX1 OS=Oryza sativa subsp. japonica OX=39947 GN=TRX1 PE=1 SV=1)

HSP 1 Score: 1070.8 bits (2768), Expect = 1.1e-311
Identity = 575/1072 (53.64%), Postives = 713/1072 (66.51%), Query Frame = 0

Query: 32   PVRYLSLDHVYSTTSPFVSTSGSSNVMSKKVKARRLMVNRFDDLNFKPPRLLHVYSRRRK 91
            P+RYL L  VYS+++P          + KK ++           + KPP +++ Y RRRK
Sbjct: 19   PIRYLPLGRVYSSSAPC--------PLPKKPRSAE---------DGKPPVIVY-YRRRRK 78

Query: 92   KPRHSSATPSFYDSLVEKVELGSKAVLKSEAREMDEMVNSADDHADDFEVDRMPKKKKKK 151
            KPR     PS            + A      RE DE          D EV R       +
Sbjct: 79   KPRVEGPPPS-----------PATAPPMLHPREDDE----------DEEVTR-------R 138

Query: 152  KDKFGFNELVKLEVDSSVFRAMNGPRLRDCRTHNNNNLNVSNSGRRRKRNSSKISEKTVL 211
            K    +  L   +   ++      P  R C   +            ++R    + ++   
Sbjct: 139  KGSLKYELLSLGQAPPALGGDGEEPARRRCLRRSGGAERRGYFSEPKRRQRQGVHKEAA- 198

Query: 212  KSPSAKRWVRLSFEDVDPKVYIGLQCKVYWPLDADWYCGRVVGYTSETNRHHIEYEDDDK 271
             S + +RW+ L  E  DP  ++GL CKV+WPLD DWY G + GY   T +H ++Y+D + 
Sbjct: 199  -SSAGRRWLELEIEAADPLAFVGLGCKVFWPLDEDWYKGSITGYNEATKKHSVKYDDGES 258

Query: 272  EDLILSNEKVKFYISGEEMQSLNLSFGVDSVDSDAYDYNEMLVLAASLDDCLEPEPGDIV 331
            EDL L++E++KF IS EEM+  NL FG+ +++   YD  E+L LA SL D    +PGD+V
Sbjct: 259  EDLNLADERIKFSISSEEMKCRNLKFGISNLNKRGYD--ELLALAVSLHDYQGLDPGDLV 318

Query: 332  WAKLTGHAMWPAIIVDESLIGDRKGLRNISGGRTVPVQFFGTHDFARIKVKQAMSFLKGL 391
            WAKLTGHAMWPA++VDES +   + L+     +++ VQFFGTHDFARIK+KQA+ FL GL
Sbjct: 319  WAKLTGHAMWPAVVVDESNVPANRALKPGRLDQSILVQFFGTHDFARIKLKQAVPFLNGL 378

Query: 392  LSSFHLKCKKPHFIRSLEEAKMYLSEQKLPPSMLQLQNGIEVDDFASASGEEEATTD--S 451
            LSS HLKCK+  F RSLEEAK +L  Q LP +MLQLQ  +E     + S ++  + D  S
Sbjct: 379  LSSLHLKCKQARFYRSLEEAKEFLCTQLLPENMLQLQKSMEKGSSDANSNKDVHSCDNLS 438

Query: 452  GEECLNEGGMPCPHNGYGSSPFIVGDLEIVSLGKIVKDSKYFQNDGSVWPEGYTAVRKFS 511
             ++    GG     +    +P  +G+L +  LG+IV DS YF N   +WPEGYTA RKF 
Sbjct: 439  EDKTAESGG-----DYDEMTPIELGNLRVSKLGRIVTDSDYFHNKKHIWPEGYTAFRKFR 498

Query: 512  SLTDPNVCTLYKMEVLRDFESKFRPLCRVTLDNGEQFKGSSPSACWNKIYKRMRKIQHIS 571
            S+ DP+V  LYKMEVLR+ + K RPL RVT ++G Q  GS+P+ CW +IY R+++ Q  +
Sbjct: 499  SVKDPHVVILYKMEVLRNSDIKARPLFRVTSEDGTQIDGSTPNTCWKEIYCRLKEKQR-N 558

Query: 572  DASTEGRGETVYKSGSDMFGFSNPDVKKLIQGISKSGLSSSRSLSKVASKKYKDFPVGYR 631
             AS   R +    SGS MFGFSNP +++LIQ      L ++RS  K        F  GYR
Sbjct: 559  VASGLDR-DVCQGSGSYMFGFSNPQIRQLIQ-----ELPNARSCLKYFENAGDTFR-GYR 618

Query: 632  PVRVDWKDLDKCSVCHMDEEYENNLFLQCDKCRMMVHARCYGELEPVDGVLWLCNLCRPG 691
             V V+WKDLD CSVC MDEEYE+NLFLQCDKCRMMVHARCYGELEP++GVLWLCNLCRP 
Sbjct: 619  AVHVNWKDLDYCSVCDMDEEYEDNLFLQCDKCRMMVHARCYGELEPLNGVLWLCNLCRPE 678

Query: 692  SPDCPPPCCLCPVIGGAMKPTTDGRWAHLACAIWIPETCLSDIKKMEPIDGLSRINKDRW 751
            +P   P CCLCPV GGAMKPTTDGRWAHLACAIWIPETCL D+K+MEPIDGLSRINKDRW
Sbjct: 679  APRVSPRCCLCPVTGGAMKPTTDGRWAHLACAIWIPETCLKDVKRMEPIDGLSRINKDRW 738

Query: 752  KLLCSICGVSYGACIQCSNNTCYVAYHPLCARAAGLCVELEDDDRLHLLAADEDEEDQCI 811
            KLLCSICGV+YGACIQCS+ TC VAYHPLCARAA LCVELEDDD++HL+  DED ED CI
Sbjct: 739  KLLCSICGVAYGACIQCSHPTCRVAYHPLCARAADLCVELEDDDKIHLMLLDED-EDPCI 798

Query: 812  RLLSFCKKHRPPSNERLMAEDRIGQAGQQCSNYTPPCNPSGCARTEPYNYFGRRGRKAPE 871
            RLLS+CKKHR PS ER   E  + +          P  PSGCARTEPYN  GRRG+K P+
Sbjct: 799  RLLSYCKKHRQPSTERPSLESNLAKPAVVVQTDAVP--PSGCARTEPYNIHGRRGQKQPQ 858

Query: 872  ALAAASLKRLFVENQPYIASGYSQHLLSGNLLPSSGVLGMKF-SLQHLKTCQLDPRNILS 931
             +A AS+KRL+VEN PYI SG+ Q+ +  + + S  +  + F  + H    Q    N+ S
Sbjct: 859  VMATASVKRLYVENMPYIVSGFCQNRVGHDAI-SEPIQSVGFLDVAH----QEAVGNVSS 918

Query: 932  VAEKYKFMRETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGEIVRPPIADRRERFI 991
            + EKYK M+ TFR+RLAFGKS IHGFG+FAK  H+AGDM+IEY GE+VRPPI+D RER I
Sbjct: 919  MIEKYKSMKATFRRRLAFGKSRIHGFGVFAKVSHKAGDMMIEYIGELVRPPISDIRERRI 978

Query: 992  YNLLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVITVNGDEHIIIFAKRD 1051
            YN LVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVI+V GDEHIIIFAKRD
Sbjct: 979  YNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVLGDEHIIIFAKRD 1019

Query: 1052 IKRWEELTYDYRFFSIDEQLACYCGFPRCRGVVNDTEEEERVAKLYVPRTDL 1101
            I  WEELTYDYRF S D++L CYCGFP+CRGVVND E E + AK+ V R++L
Sbjct: 1039 INPWEELTYDYRFVSSDQRLPCYCGFPKCRGVVNDVEAEGQSAKIRVNRSEL 1019

BLAST of Sgr021830 vs. ExPASy Swiss-Prot
Match: Q8GZ42 (Histone-lysine N-methyltransferase ATX5 OS=Arabidopsis thaliana OX=3702 GN=ATX5 PE=2 SV=1)

HSP 1 Score: 267.3 bits (682), Expect = 8.2e-70
Identity = 166/487 (34.09%), Postives = 238/487 (48.87%), Query Frame = 0

Query: 628  YRPVRVDWKDLDKCSVCHMDEEYENNLFLQCDKCRMMVHARCYGELEPVDGVLWLCNLCR 687
            Y PV V W   ++C+VC   E+++ N  + C++C++ VH  CYG     D   W+C  C 
Sbjct: 598  YEPVNVKW-TTERCAVCRWVEDWDYNKIIICNRCQIAVHQECYGTRNVRDFTSWVCKACE 657

Query: 688  PGSPDCPPPCCLCPVIGGAMKPT-TDGRWAHLACAIWIPETCLSDIKKMEPIDGLSRINK 747
              +P+    CCLCPV GGA+KPT  +  W H+ CA + PE C +  +KMEP  G+  I  
Sbjct: 658  --TPEIKRECCLCPVKGGALKPTDVETLWVHVTCAWFQPEVCFASEEKMEPALGILSIPS 717

Query: 748  DRWKLLCSICGVSYGACIQCSNNTCYVAYHPLCARAAGLCVELEDDDRLHLLAADEDEED 807
              +  +C IC   +G+C QC    C   YH +CA  AG  +E      LH L   E    
Sbjct: 718  SNFVKICVICKQIHGSCTQCCK--CSTYYHAMCASRAGYRME------LHCL---EKNGR 777

Query: 808  QCIRLLSFCKKHRPP--------------------------SNERLMAEDRIGQAGQQCS 867
            Q  +++S+C  HR P                          S  RL+  +R  +  +  +
Sbjct: 778  QITKMVSYCSYHRAPNPDTVLIIQTPSGVFSAKSLVQNKKKSGTRLILANR-EEIEESAA 837

Query: 868  NYTPPCNPSGCARTEPYNYFGRRGRKAPEALAAASLKRLFVENQPYIASGYSQHLLSGNL 927
              T P +P   AR   Y                 S KR   E  P+   G   H      
Sbjct: 838  EDTIPIDPFSSARCRLYK------------RTVNSKKRTKEEGIPHYTGGLRHH------ 897

Query: 928  LPSSGVLGMKFSLQHLKTCQLDPRNILSVAEKYKFMRETFRKRLAFGKSGIHGFGIFAKH 987
             PS+ +     +L   +    +P++  S  E+   ++ T  +R+ FG+SGIHG+G+FA+ 
Sbjct: 898  -PSAAIQ----TLNAFRHVAEEPKSFSSFRERLHHLQRTEMERVCFGRSGIHGWGLFARR 957

Query: 988  PHRAGDMVIEYTGEIVRPPIADRRERFIYNLLVGAGTYMFRIDDERVIDATRAGSIAHLI 1047
              + G+MV+EY GE VR  IAD RE        G   Y+F+I +E V+DAT  G+IA LI
Sbjct: 958  NIQEGEMVLEYRGEQVRGIIADLREARYRR--EGKDCYLFKISEEVVVDATEKGNIARLI 1017

Query: 1048 NHSCEPNCYSRVITVNGDE-HIIIFAKRDIKRWEELTYDYRFFSIDE----QLACYCGFP 1083
            NHSC PNCY+R+++V  DE  I++ AK  +   EELTYDY  F  DE    ++ C C  P
Sbjct: 1018 NHSCMPNCYARIMSVGDDESRIVLIAKTTVASCEELTYDY-LFDPDEPDEFKVPCLCKSP 1043

BLAST of Sgr021830 vs. ExPASy Swiss-Prot
Match: Q9SUE7 (Histone-lysine N-methyltransferase ATX4 OS=Arabidopsis thaliana OX=3702 GN=ATX4 PE=2 SV=3)

HSP 1 Score: 263.8 bits (673), Expect = 9.0e-69
Identity = 163/485 (33.61%), Postives = 239/485 (49.28%), Query Frame = 0

Query: 628  YRPVRVDWKDLDKCSVCHMDEEYENNLFLQCDKCRMMVHARCYGELEPVDGVLWLCNLCR 687
            Y PV   W   ++C+VC   E+++ N  + C++C++ VH  CYG     D   W+C  C 
Sbjct: 583  YEPVNAKW-TTERCAVCRWVEDWDYNKIIICNRCQIAVHQECYGARHVRDFTSWVCKACE 642

Query: 688  PGSPDCPPPCCLCPVIGGAMKPT-TDGRWAHLACAIWIPETCLSDIKKMEPIDGLSRINK 747
               PD    CCLCPV GGA+KPT  +  W H+ CA + PE C +  +KMEP  G+  I  
Sbjct: 643  --RPDIKRECCLCPVKGGALKPTDVETLWVHVTCAWFQPEVCFASEEKMEPAVGILSIPS 702

Query: 748  DRWKLLCSICGVSYGACIQCSNNTCYVAYHPLCARAAGLCVELEDDDRLHLLAADEDEED 807
              +  +C IC   +G+C QC    C   YH +CA  AG  +E      LH L   E    
Sbjct: 703  TNFVKICVICKQIHGSCTQCCK--CSTYYHAMCASRAGYRME------LHCL---EKNGQ 762

Query: 808  QCIRLLSFCKKHRPPSNERLMAEDR------------------------IGQAGQQCSNY 867
            Q  +++S+C  HR P+ + ++                            I +  +  +  
Sbjct: 763  QITKMVSYCAYHRAPNPDNVLIIQTPSGAFSAKSLVQNKKKGGSRLISLIREDDEAPAEN 822

Query: 868  TPPCNPSGCARTEPYNYFGRRGRKAPEALAAASLKRLFVENQPYIASGYSQHLLSGNLLP 927
            T  C+P   AR   +       RK        S KR+  E  P+   G   H        
Sbjct: 823  TITCDPFSAARCRVFK------RK------INSKKRIEEEAIPHHTRGPRHH-------- 882

Query: 928  SSGVLGMKFSLQHLKTCQLDPRNILSVAEKYKFMRETFRKRLAFGKSGIHGFGIFAKHPH 987
            +S  +    + +H+     +P++  S  E+   ++ T   R+ FG+SGIHG+G+FA+   
Sbjct: 883  ASAAIQTLNTFRHVPE---EPKSFSSFRERLHHLQRTEMDRVCFGRSGIHGWGLFARRNI 942

Query: 988  RAGDMVIEYTGEIVRPPIADRRERFIYNLLVGAGTYMFRIDDERVIDATRAGSIAHLINH 1047
            + G+MV+EY GE VR  IAD RE       VG   Y+F+I +E V+DAT  G+IA LINH
Sbjct: 943  QEGEMVLEYRGEQVRGSIADLREARYRR--VGKDCYLFKISEEVVVDATDKGNIARLINH 1002

Query: 1048 SCEPNCYSRVITVNGDE-HIIIFAKRDIKRWEELTYDYRFFSIDE----QLACYCGFPRC 1083
            SC PNCY+R+++V  +E  I++ AK ++   EELTYDY  F  DE    ++ C C  P C
Sbjct: 1003 SCTPNCYARIMSVGDEESRIVLIAKANVAVGEELTYDY-LFDPDEAEELKVPCLCKAPNC 1027

BLAST of Sgr021830 vs. ExPASy TrEMBL
Match: A0A6J1KVZ1 (histone-lysine N-methyltransferase ATX2-like isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111498085 PE=4 SV=1)

HSP 1 Score: 2109.0 bits (5463), Expect = 0.0e+00
Identity = 1018/1104 (92.21%), Postives = 1060/1104 (96.01%), Query Frame = 0

Query: 1    MAFPLEQRPKPQVMDGEDGEDINIDVYNAGTPVRYLSLDHVYSTTSPFVSTSGSSNVMSK 60
            MAFPLEQRPKP ++DGEDG+DINIDVYNAGTP+RYLSLDHVYSTTSPFVSTSGSSNVMSK
Sbjct: 1    MAFPLEQRPKPPILDGEDGDDINIDVYNAGTPIRYLSLDHVYSTTSPFVSTSGSSNVMSK 60

Query: 61   KVKARRLMVNRFDDLNFKPPRLLHVYSRRRKKPRHSSATPSFYDSLVEKVELGSKAVLKS 120
            KVKARRL+VN FDDLNFKPPRLLHVYSRRRKKPRHSS + S YDSLVE+VELGSK V+KS
Sbjct: 61   KVKARRLLVNHFDDLNFKPPRLLHVYSRRRKKPRHSSVSSSVYDSLVEEVELGSKTVMKS 120

Query: 121  EAREMDEMVNSADDHADDFEVDRMPKKKKKKKDKFGFNELVKLEVDSSVFRAMNGPRLRD 180
            EA E+DEMVN  DD   +FEVDR P  KKKKKD FG NELVKLEV+SSV RAMNGPRLRD
Sbjct: 121  EACEIDEMVNGVDDLVGEFEVDRTP--KKKKKDNFGCNELVKLEVNSSVIRAMNGPRLRD 180

Query: 181  CRTHNNNNLNVSNSGRRRKRNSSKISEKTVLKSPSAKRWVRLSFEDVDPKVYIGLQCKVY 240
            CRTH+NNN    NSGRR+KRNSS+ISEKT+ KSP+AKRWVRLSFEDVDPKVYIGLQCKVY
Sbjct: 181  CRTHSNNN---KNSGRRKKRNSSQISEKTMFKSPTAKRWVRLSFEDVDPKVYIGLQCKVY 240

Query: 241  WPLDADWYCGRVVGYTSETNRHHIEYEDDDKEDLILSNEKVKFYISGEEMQSLNLSFGVD 300
            WPLDADWY GRVVGY SET RH+IEYEDDDKEDL+LSNEKVKFYISGEEMQSLNLSFGVD
Sbjct: 241  WPLDADWYHGRVVGYDSETGRHNIEYEDDDKEDLVLSNEKVKFYISGEEMQSLNLSFGVD 300

Query: 301  SVDSDAYDYNEMLVLAASLDDCLEPEPGDIVWAKLTGHAMWPAIIVDESLIGDRKGLRNI 360
             +DSDAY+YNEMLVLAA+LDD LEPEPGDIVWAKLTGHAMWPAIIVDESLIGDRKGLRNI
Sbjct: 301  GIDSDAYEYNEMLVLAATLDDYLEPEPGDIVWAKLTGHAMWPAIIVDESLIGDRKGLRNI 360

Query: 361  SGGRTVPVQFFGTHDFARIKVKQAMSFLKGLLSSFHLKCKKPHFIRSLEEAKMYLSEQKL 420
            SGGRTVPVQFFGTHDFARIKVKQA+SFLKGLLS FH KCKKPHFIRSLEEAKMYLSEQKL
Sbjct: 361  SGGRTVPVQFFGTHDFARIKVKQAISFLKGLLSFFHQKCKKPHFIRSLEEAKMYLSEQKL 420

Query: 421  PPSMLQLQNGIEVDDFASASGEEEATTDSGEECLNEGGMPCPHNGYGSSPFIVGDLEIVS 480
            PPSMLQLQNGIEVDDFASASGEEE TTDSGEECLNE GMPCP NGYGSSPF+VGDLEI+S
Sbjct: 421  PPSMLQLQNGIEVDDFASASGEEEGTTDSGEECLNEAGMPCPPNGYGSSPFMVGDLEILS 480

Query: 481  LGKIVKDSKYFQNDGSVWPEGYTAVRKFSSLTDPNVCTLYKMEVLRDFESKFRPLCRVTL 540
            LGK+VK+SKYFQNDGSVWPEGYTAVRKFSSLTDPNV T YKMEVLRDFESKFRPL RVTL
Sbjct: 481  LGKVVKNSKYFQNDGSVWPEGYTAVRKFSSLTDPNVRTSYKMEVLRDFESKFRPLFRVTL 540

Query: 541  DNGEQFKGSSPSACWNKIYKRMRKIQHISDASTEGRGETVYKSGSDMFGFSNPDVKKLIQ 600
            DNGEQFKGSSPSACWNKIYKRMRKIQHISD S E +GE VYKSGSDMFGFSNPDVKKLIQ
Sbjct: 541  DNGEQFKGSSPSACWNKIYKRMRKIQHISDTSAEVKGEIVYKSGSDMFGFSNPDVKKLIQ 600

Query: 601  GISKSGLSSSRSLSKVASKKYKDFPVGYRPVRVDWKDLDKCSVCHMDEEYENNLFLQCDK 660
            GISKSGLSSSRSL KVASKKYK+FP+GYRPVRVDWKDLDKCSVCHMDEEYENNLFLQCDK
Sbjct: 601  GISKSGLSSSRSLGKVASKKYKNFPIGYRPVRVDWKDLDKCSVCHMDEEYENNLFLQCDK 660

Query: 661  CRMMVHARCYGELEPVDGVLWLCNLCRPGSPDCPPPCCLCPVIGGAMKPTTDGRWAHLAC 720
            CRMMVHARCYGELEPVDGVLWLCNLCRPGSPDC PPCCLCPVIGGAMKPTTDGRWAHLAC
Sbjct: 661  CRMMVHARCYGELEPVDGVLWLCNLCRPGSPDCLPPCCLCPVIGGAMKPTTDGRWAHLAC 720

Query: 721  AIWIPETCLSDIKKMEPIDGLSRINKDRWKLLCSICGVSYGACIQCSNNTCYVAYHPLCA 780
            AIWIPETCLSD+KKMEPIDGL+RINKDRWKLLCSICGVSYGACIQCSNNTCYVAYHPLCA
Sbjct: 721  AIWIPETCLSDVKKMEPIDGLNRINKDRWKLLCSICGVSYGACIQCSNNTCYVAYHPLCA 780

Query: 781  RAAGLCVELEDDDRLHLLAADEDEEDQCIRLLSFCKKHRPPSNERLMAEDRIGQAGQQCS 840
            RAAGLCVELE+DDRLHLLAAD+DEEDQCIRLLSFCKKHRPPSNERLMAEDRIGQAGQQCS
Sbjct: 781  RAAGLCVELEEDDRLHLLAADDDEEDQCIRLLSFCKKHRPPSNERLMAEDRIGQAGQQCS 840

Query: 841  NYTPPCNPSGCARTEPYNYFGRRGRKAPEALAAASLKRLFVENQPYIASGYSQHLLSGNL 900
            NYTPPCNPSGCARTEPYNYFGRRGRKAPEA+AAASLKRLFVENQP+IASGYSQHL SGNL
Sbjct: 841  NYTPPCNPSGCARTEPYNYFGRRGRKAPEAVAAASLKRLFVENQPHIASGYSQHLSSGNL 900

Query: 901  LPSSGVLGMKFSLQHLKTCQLDPRNILSVAEKYKFMRETFRKRLAFGKSGIHGFGIFAKH 960
            LPSSGVLG+KFSLQHLKTCQLDP+NILS+AEKYKFMRETFRKRLAFGKSGIHGFGIFAK+
Sbjct: 901  LPSSGVLGLKFSLQHLKTCQLDPQNILSMAEKYKFMRETFRKRLAFGKSGIHGFGIFAKY 960

Query: 961  PHRAGDMVIEYTGEIVRPPIADRRERFIYNLLVGAGTYMFRIDDERVIDATRAGSIAHLI 1020
            PHRAGDMVIEYTGEIVRPPIADRRERFIYNLLVGAGTYMFRIDDERVIDATRAGSIAHLI
Sbjct: 961  PHRAGDMVIEYTGEIVRPPIADRRERFIYNLLVGAGTYMFRIDDERVIDATRAGSIAHLI 1020

Query: 1021 NHSCEPNCYSRVITVNGDEHIIIFAKRDIKRWEELTYDYRFFSIDEQLACYCGFPRCRGV 1080
            NHSCEPNCYSRVI+VNGDEHIIIFAKRDIKRWEELTYDYRFFSIDEQLACYCGFPRCRGV
Sbjct: 1021 NHSCEPNCYSRVISVNGDEHIIIFAKRDIKRWEELTYDYRFFSIDEQLACYCGFPRCRGV 1080

Query: 1081 VNDTEEEERVAKLYVPRTDLVDWR 1105
            VNDTEEEERV+KL+V RTDLVDWR
Sbjct: 1081 VNDTEEEERVSKLHVSRTDLVDWR 1099

BLAST of Sgr021830 vs. ExPASy TrEMBL
Match: A0A0A0KAQ5 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G022310 PE=3 SV=1)

HSP 1 Score: 2107.8 bits (5460), Expect = 0.0e+00
Identity = 1017/1104 (92.12%), Postives = 1059/1104 (95.92%), Query Frame = 0

Query: 2    AFPLEQRPKPQVMDGEDGEDINIDVYNAGTPVRYLSLDHVYSTTSPFVSTSGSSNVMSKK 61
            AF L QRPKP ++DGEDG+DINIDVYNAGTP+RYLSLDHVYSTTSPFVSTSGSSNVMSKK
Sbjct: 3    AFSLHQRPKPPIVDGEDGDDINIDVYNAGTPIRYLSLDHVYSTTSPFVSTSGSSNVMSKK 62

Query: 62   VKARRLMVNRFDDLNFKPPRLLHVYSRRRKKPRHSSATPSFYDSLVEKVELGSKAVLKSE 121
            VKARRLMVN FDDLNFKPPRLLHVYSRRRKKPRHSSA+ S YDSLVE+VELGS  V++SE
Sbjct: 63   VKARRLMVNHFDDLNFKPPRLLHVYSRRRKKPRHSSASSSMYDSLVEQVELGSTTVMESE 122

Query: 122  AREMDEMVNSADDHADDFEVDRMPKKKKKKKDKFGFNELVKLEVDSSVFRAMNGPRLRDC 181
            A E DEMVN  D HA++FEVDR PK KKKK DKFG NELVKLEVDSSV R MNGPRLRDC
Sbjct: 123  ACETDEMVNGVDGHAEEFEVDRTPKNKKKKNDKFGCNELVKLEVDSSVIRTMNGPRLRDC 182

Query: 182  RTHNNNNLNVSNSGRRRKRNSSKISEKTVLKSPSAKRWVRLSFEDVDPKVYIGLQCKVYW 241
            RTH+NNN   +NSG+ +KRNSS+ISEKT  KSP+AKRWVRLSFEDVDPKVY+GLQCKVYW
Sbjct: 183  RTHSNNN---NNSGQSKKRNSSQISEKTTFKSPTAKRWVRLSFEDVDPKVYVGLQCKVYW 242

Query: 242  PLDADWYCGRVVGYTSETNRHHIEYEDDDKEDLILSNEKVKFYISGEEMQSLNLSFGVDS 301
            PLDA WYCGRVVGY SET+ HHIEYED D+EDL+LSNEKVKF+ISGEEMQ+LNL+FGVDS
Sbjct: 243  PLDAQWYCGRVVGYNSETSCHHIEYEDGDREDLVLSNEKVKFHISGEEMQTLNLNFGVDS 302

Query: 302  VDSDAYDYNEMLVLAASLDDCLEPEPGDIVWAKLTGHAMWPAIIVDESLIGDRKGLRNIS 361
            VDSDAYDYNEMLVLAA+LDDCLEPEPGDIVWAKLTGHAMWPAIIVDESLIGDRKGLRNIS
Sbjct: 303  VDSDAYDYNEMLVLAATLDDCLEPEPGDIVWAKLTGHAMWPAIIVDESLIGDRKGLRNIS 362

Query: 362  GGRTVPVQFFGTHDFARIKVKQAMSFLKGLLSSFHLKCKKPHFIRSLEEAKMYLSEQKLP 421
            GGRTVPVQFFGTHDFARIKVKQA+SFLKGLLS FH KCKKPHF+RSLEEAKMYLSEQKLP
Sbjct: 363  GGRTVPVQFFGTHDFARIKVKQAISFLKGLLSFFHQKCKKPHFMRSLEEAKMYLSEQKLP 422

Query: 422  PSMLQLQNGIEVDDFASASGEEEATTDSGEECLNE-GGMPCPHNGYGSSPFIVGDLEIVS 481
            PSMLQLQNGIEVDDFASASGEEE TTDSGEECLNE GG+ C  NGY  SPF VGDLEI+S
Sbjct: 423  PSMLQLQNGIEVDDFASASGEEEGTTDSGEECLNEGGGVRCALNGY-RSPFKVGDLEIIS 482

Query: 482  LGKIVKDSKYFQNDGSVWPEGYTAVRKFSSLTDPNVCTLYKMEVLRDFESKFRPLCRVTL 541
            LGKIVKDSKYFQNDGSVWPEGYTAVRKFSSLTDPNVCTLY+MEVLRDFESKFRPL RVTL
Sbjct: 483  LGKIVKDSKYFQNDGSVWPEGYTAVRKFSSLTDPNVCTLYRMEVLRDFESKFRPLFRVTL 542

Query: 542  DNGEQFKGSSPSACWNKIYKRMRKIQHISDASTEGRGETVYKSGSDMFGFSNPDVKKLIQ 601
            DNGEQFKGSSPSACWNKIYKRM+KIQH SDASTE +GE VYKSGSDMFGFSNPDVKKLIQ
Sbjct: 543  DNGEQFKGSSPSACWNKIYKRMKKIQHTSDASTETKGEFVYKSGSDMFGFSNPDVKKLIQ 602

Query: 602  GISKSGLSSSRSLSKVASKKYKDFPVGYRPVRVDWKDLDKCSVCHMDEEYENNLFLQCDK 661
            GISKSGLSSSRSLSKVASKKYKDFP+GYRPVRVDWKDLDKCSVCHMDEEYENNLFLQCDK
Sbjct: 603  GISKSGLSSSRSLSKVASKKYKDFPIGYRPVRVDWKDLDKCSVCHMDEEYENNLFLQCDK 662

Query: 662  CRMMVHARCYGELEPVDGVLWLCNLCRPGSPDCPPPCCLCPVIGGAMKPTTDGRWAHLAC 721
            CRMMVHARCYGELEPVDGV+WLCNLCRPGSPDCPPPCCLCPVIGGAMKPTTDGRWAHLAC
Sbjct: 663  CRMMVHARCYGELEPVDGVIWLCNLCRPGSPDCPPPCCLCPVIGGAMKPTTDGRWAHLAC 722

Query: 722  AIWIPETCLSDIKKMEPIDGLSRINKDRWKLLCSICGVSYGACIQCSNNTCYVAYHPLCA 781
            AIWIPETCLSDIKKMEPIDGL+RINKDRWKLLCSICGVSYGACIQCSNNTCYVAYHPLCA
Sbjct: 723  AIWIPETCLSDIKKMEPIDGLNRINKDRWKLLCSICGVSYGACIQCSNNTCYVAYHPLCA 782

Query: 782  RAAGLCVELEDDDRLHLLAADEDEEDQCIRLLSFCKKHRPPSNERLMAEDRIGQAGQQCS 841
            RAAGLCVELE+DDRLHLLAADEDEEDQCIRLLSFCKKHRPPSNERLMAEDRIGQAGQQCS
Sbjct: 783  RAAGLCVELEEDDRLHLLAADEDEEDQCIRLLSFCKKHRPPSNERLMAEDRIGQAGQQCS 842

Query: 842  NYTPPCNPSGCARTEPYNYFGRRGRKAPEALAAASLKRLFVENQPYIASGYSQHLLSGNL 901
            NYTPPCNPSGCARTEPYNYF RRGRKAPEA+AAA+LKRLFVENQPYIASGYSQHLLSGNL
Sbjct: 843  NYTPPCNPSGCARTEPYNYFERRGRKAPEAVAAAALKRLFVENQPYIASGYSQHLLSGNL 902

Query: 902  LPSSGVLGMKFSLQHLKTCQLDPRNILSVAEKYKFMRETFRKRLAFGKSGIHGFGIFAKH 961
            LPSSGVLGMKFSLQHLKTCQLDPRNILSVAEKYKFMRETFRKRLAFGKSGIHGFGIFAKH
Sbjct: 903  LPSSGVLGMKFSLQHLKTCQLDPRNILSVAEKYKFMRETFRKRLAFGKSGIHGFGIFAKH 962

Query: 962  PHRAGDMVIEYTGEIVRPPIADRRERFIYNLLVGAGTYMFRIDDERVIDATRAGSIAHLI 1021
            PHRAGDMVIEYTGEIVRPPIADRRERFIYNLLVGAGTYMFRIDDERVIDATRAGSIAHLI
Sbjct: 963  PHRAGDMVIEYTGEIVRPPIADRRERFIYNLLVGAGTYMFRIDDERVIDATRAGSIAHLI 1022

Query: 1022 NHSCEPNCYSRVITVNGDEHIIIFAKRDIKRWEELTYDYRFFSIDEQLACYCGFPRCRGV 1081
            NHSCEPNCYSRVI+VNGDEHIIIFAKRDIKRWEELTYDYRFFSIDEQLACYCG+PRCRGV
Sbjct: 1023 NHSCEPNCYSRVISVNGDEHIIIFAKRDIKRWEELTYDYRFFSIDEQLACYCGYPRCRGV 1082

Query: 1082 VNDTEEEERVAKLYVPRTDLVDWR 1105
            VNDT+EEERV+KL+V RTDLVDWR
Sbjct: 1083 VNDTDEEERVSKLHVSRTDLVDWR 1102

BLAST of Sgr021830 vs. ExPASy TrEMBL
Match: A0A6J1H5D6 (histone-lysine N-methyltransferase ATX2-like isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111460646 PE=4 SV=1)

HSP 1 Score: 2107.0 bits (5458), Expect = 0.0e+00
Identity = 1017/1104 (92.12%), Postives = 1059/1104 (95.92%), Query Frame = 0

Query: 1    MAFPLEQRPKPQVMDGEDGEDINIDVYNAGTPVRYLSLDHVYSTTSPFVSTSGSSNVMSK 60
            MAFPLEQRPKP ++DGEDG+DINIDVYNAGTP+RYLSLDHVYSTTSPFVSTSGSSNVMSK
Sbjct: 1    MAFPLEQRPKPPILDGEDGDDINIDVYNAGTPIRYLSLDHVYSTTSPFVSTSGSSNVMSK 60

Query: 61   KVKARRLMVNRFDDLNFKPPRLLHVYSRRRKKPRHSSATPSFYDSLVEKVELGSKAVLKS 120
            KVKARRL+VN FDDLNFKPPRLLHVYSRRRKKPRHSS + S YDSLVE+VELGSK V+KS
Sbjct: 61   KVKARRLLVNHFDDLNFKPPRLLHVYSRRRKKPRHSSVSSSVYDSLVEEVELGSKTVMKS 120

Query: 121  EAREMDEMVNSADDHADDFEVDRMPKKKKKKKDKFGFNELVKLEVDSSVFRAMNGPRLRD 180
            EA E+DEMVN  DD   +FEVDR P  KKKKKD FG NELVKLEV+SSV RAMNGPRLRD
Sbjct: 121  EAFEIDEMVNGVDDLVGEFEVDRTP--KKKKKDNFGCNELVKLEVNSSVIRAMNGPRLRD 180

Query: 181  CRTHNNNNLNVSNSGRRRKRNSSKISEKTVLKSPSAKRWVRLSFEDVDPKVYIGLQCKVY 240
            CRTH+NNN    NSGRR+KRNSS+ISEKT+ KSP+AKRWVRLSFEDVDPKVYIGLQCKVY
Sbjct: 181  CRTHSNNN---KNSGRRKKRNSSQISEKTMFKSPTAKRWVRLSFEDVDPKVYIGLQCKVY 240

Query: 241  WPLDADWYCGRVVGYTSETNRHHIEYEDDDKEDLILSNEKVKFYISGEEMQSLNLSFGVD 300
            WPLDADWY GRVVGY SET RH+IEYEDDDKEDL+LSNEKVKFYISGEEMQSLNLSFGVD
Sbjct: 241  WPLDADWYHGRVVGYDSETGRHNIEYEDDDKEDLVLSNEKVKFYISGEEMQSLNLSFGVD 300

Query: 301  SVDSDAYDYNEMLVLAASLDDCLEPEPGDIVWAKLTGHAMWPAIIVDESLIGDRKGLRNI 360
             +DSDAY+YNEMLVLAA+LDD LEPEPGDIVWAKLTGHAMWPAIIVDESLIGDRKGLRNI
Sbjct: 301  GIDSDAYEYNEMLVLAATLDDYLEPEPGDIVWAKLTGHAMWPAIIVDESLIGDRKGLRNI 360

Query: 361  SGGRTVPVQFFGTHDFARIKVKQAMSFLKGLLSSFHLKCKKPHFIRSLEEAKMYLSEQKL 420
            SGGRTVPVQFFGTHDFARIKVKQA+SFLKGLLS FH KCKKPHFIRSLEEAKMYLSEQKL
Sbjct: 361  SGGRTVPVQFFGTHDFARIKVKQAISFLKGLLSFFHQKCKKPHFIRSLEEAKMYLSEQKL 420

Query: 421  PPSMLQLQNGIEVDDFASASGEEEATTDSGEECLNEGGMPCPHNGYGSSPFIVGDLEIVS 480
            PPSMLQLQNGIEVDDFASASGEEE TTDSGEECLNE GMPCP NGYGS PF+VGDLEI+S
Sbjct: 421  PPSMLQLQNGIEVDDFASASGEEEGTTDSGEECLNEAGMPCPPNGYGSCPFMVGDLEILS 480

Query: 481  LGKIVKDSKYFQNDGSVWPEGYTAVRKFSSLTDPNVCTLYKMEVLRDFESKFRPLCRVTL 540
            LGK+VK+SKYFQNDGSVWPEGYTAVRKFSSLTDPNV T YKMEVLRDFESKFRPL RVTL
Sbjct: 481  LGKVVKNSKYFQNDGSVWPEGYTAVRKFSSLTDPNVRTSYKMEVLRDFESKFRPLFRVTL 540

Query: 541  DNGEQFKGSSPSACWNKIYKRMRKIQHISDASTEGRGETVYKSGSDMFGFSNPDVKKLIQ 600
            DNGEQFKGSSPSACWNKIYKRMRKIQHISD S E +GE VYKSGSDMFGFSNPDVKKLIQ
Sbjct: 541  DNGEQFKGSSPSACWNKIYKRMRKIQHISDTSAEVKGEIVYKSGSDMFGFSNPDVKKLIQ 600

Query: 601  GISKSGLSSSRSLSKVASKKYKDFPVGYRPVRVDWKDLDKCSVCHMDEEYENNLFLQCDK 660
            GISKSGLSSSRSL KVASKKYK+FP+GYRPVRVDWKDLDKCSVCHMDEEYENNLFLQCDK
Sbjct: 601  GISKSGLSSSRSLGKVASKKYKNFPIGYRPVRVDWKDLDKCSVCHMDEEYENNLFLQCDK 660

Query: 661  CRMMVHARCYGELEPVDGVLWLCNLCRPGSPDCPPPCCLCPVIGGAMKPTTDGRWAHLAC 720
            CRMMVHARCYGELEPVDGVLWLCNLCRPGSPDC PPCCLCPVIGGAMKPTTDGRWAHLAC
Sbjct: 661  CRMMVHARCYGELEPVDGVLWLCNLCRPGSPDCLPPCCLCPVIGGAMKPTTDGRWAHLAC 720

Query: 721  AIWIPETCLSDIKKMEPIDGLSRINKDRWKLLCSICGVSYGACIQCSNNTCYVAYHPLCA 780
            AIWIPETCLSD+KKMEPIDGL+RINKDRWKLLCSICGVSYGACIQCSNNTCYVAYHPLCA
Sbjct: 721  AIWIPETCLSDVKKMEPIDGLNRINKDRWKLLCSICGVSYGACIQCSNNTCYVAYHPLCA 780

Query: 781  RAAGLCVELEDDDRLHLLAADEDEEDQCIRLLSFCKKHRPPSNERLMAEDRIGQAGQQCS 840
            RAAGLCVELE+DDRLHLLAAD+DEEDQCIRLLSFCKKHRPPSNERLMAEDRIGQAGQQCS
Sbjct: 781  RAAGLCVELEEDDRLHLLAADDDEEDQCIRLLSFCKKHRPPSNERLMAEDRIGQAGQQCS 840

Query: 841  NYTPPCNPSGCARTEPYNYFGRRGRKAPEALAAASLKRLFVENQPYIASGYSQHLLSGNL 900
            NYTPPCNPSGCARTEPYNYFGRRGRKAPEA+AAASLKRLFVENQP+IASGYSQHL SGNL
Sbjct: 841  NYTPPCNPSGCARTEPYNYFGRRGRKAPEAVAAASLKRLFVENQPHIASGYSQHLSSGNL 900

Query: 901  LPSSGVLGMKFSLQHLKTCQLDPRNILSVAEKYKFMRETFRKRLAFGKSGIHGFGIFAKH 960
            LPSSGVLG+KFSLQHLKTCQLDP+NILS+AEKYKFMRETFRKRLAFGKSGIHGFGIFAK+
Sbjct: 901  LPSSGVLGLKFSLQHLKTCQLDPQNILSMAEKYKFMRETFRKRLAFGKSGIHGFGIFAKY 960

Query: 961  PHRAGDMVIEYTGEIVRPPIADRRERFIYNLLVGAGTYMFRIDDERVIDATRAGSIAHLI 1020
            PHRAGDMVIEYTGEIVRPPIADRRERFIYNLLVGAGTYMFRIDDERVIDATRAGSIAHLI
Sbjct: 961  PHRAGDMVIEYTGEIVRPPIADRRERFIYNLLVGAGTYMFRIDDERVIDATRAGSIAHLI 1020

Query: 1021 NHSCEPNCYSRVITVNGDEHIIIFAKRDIKRWEELTYDYRFFSIDEQLACYCGFPRCRGV 1080
            NHSCEPNCYSRVI+VNGDEHIIIFAKRDIKRWEELTYDYRFFSIDEQLACYCGFPRCRGV
Sbjct: 1021 NHSCEPNCYSRVISVNGDEHIIIFAKRDIKRWEELTYDYRFFSIDEQLACYCGFPRCRGV 1080

Query: 1081 VNDTEEEERVAKLYVPRTDLVDWR 1105
            VNDTEEEERV+KL+V RTDLVDWR
Sbjct: 1081 VNDTEEEERVSKLHVSRTDLVDWR 1099

BLAST of Sgr021830 vs. ExPASy TrEMBL
Match: A0A1S3CLC3 (histone-lysine N-methyltransferase ATX2-like isoform X1 OS=Cucumis melo OX=3656 GN=LOC103502242 PE=3 SV=1)

HSP 1 Score: 2102.4 bits (5446), Expect = 0.0e+00
Identity = 1011/1104 (91.58%), Postives = 1058/1104 (95.83%), Query Frame = 0

Query: 2    AFPLEQRPKPQVMDGEDGEDINIDVYNAGTPVRYLSLDHVYSTTSPFVSTSGSSNVMSKK 61
            AF L QRPKP ++DGEDG+DINIDVYNAGTP+RYLSLDHVYSTTSPFVSTSGSSNVMSKK
Sbjct: 3    AFSLHQRPKPPIVDGEDGDDINIDVYNAGTPIRYLSLDHVYSTTSPFVSTSGSSNVMSKK 62

Query: 62   VKARRLMVNRFDDLNFKPPRLLHVYSRRRKKPRHSSATPSFYDSLVEKVELGSKAVLKSE 121
            VKARRL+VN FDDLNFKPPRLLHVYSRRRKK RHSSA+ S YDSLVE+VELGS  V++SE
Sbjct: 63   VKARRLVVNHFDDLNFKPPRLLHVYSRRRKKARHSSASSSMYDSLVEQVELGSTTVMESE 122

Query: 122  AREMDEMVNSADDHADDFEVDRMPKKKKKKKDKFGFNELVKLEVDSSVFRAMNGPRLRDC 181
            A E DEM N  DDHA++FEVDR PK KKK+ DKFG NELVKLEVDSSV RAMNGPRLRDC
Sbjct: 123  ACETDEMENGVDDHAEEFEVDRSPKNKKKRTDKFGCNELVKLEVDSSVIRAMNGPRLRDC 182

Query: 182  RTHNNNNLNVSNSGRRRKRNSSKISEKTVLKSPSAKRWVRLSFEDVDPKVYIGLQCKVYW 241
            RT +NNN   +NSG+R+KRNSS+ISEK + KSP+AKRWVRLSFEDVDPKVY+GLQCKVYW
Sbjct: 183  RTPSNNN---NNSGQRKKRNSSQISEKIMFKSPTAKRWVRLSFEDVDPKVYVGLQCKVYW 242

Query: 242  PLDADWYCGRVVGYTSETNRHHIEYEDDDKEDLILSNEKVKFYISGEEMQSLNLSFGVDS 301
            PLDA WYCGRVVGY SET+ HHIEYED D+EDLILSNEKVKF+ISGEEMQ+LNL+FGVDS
Sbjct: 243  PLDAQWYCGRVVGYNSETSSHHIEYEDGDREDLILSNEKVKFHISGEEMQTLNLNFGVDS 302

Query: 302  VDSDAYDYNEMLVLAASLDDCLEPEPGDIVWAKLTGHAMWPAIIVDESLIGDRKGLRNIS 361
            VDSDAYDYNEMLVLAA+LDDCLEPEPGDIVWAKLTGHAMWPAIIVDESLIGDRKGLRNIS
Sbjct: 303  VDSDAYDYNEMLVLAATLDDCLEPEPGDIVWAKLTGHAMWPAIIVDESLIGDRKGLRNIS 362

Query: 362  GGRTVPVQFFGTHDFARIKVKQAMSFLKGLLSSFHLKCKKPHFIRSLEEAKMYLSEQKLP 421
            GGRTVPVQFFGTHDFARIKVKQA+SFLKGLLS FH KCKKPHF+RSLEEAKMYLSEQKLP
Sbjct: 363  GGRTVPVQFFGTHDFARIKVKQAISFLKGLLSFFHQKCKKPHFMRSLEEAKMYLSEQKLP 422

Query: 422  PSMLQLQNGIEVDDFASASGEEEATTDSGEECLNE-GGMPCPHNGYGSSPFIVGDLEIVS 481
            PSMLQLQNGIEVDDFASASGEEE TTDSGEECLNE GGM C  NGY +SPF VGDLEI+S
Sbjct: 423  PSMLQLQNGIEVDDFASASGEEEGTTDSGEECLNEGGGMRCALNGYRASPFKVGDLEIIS 482

Query: 482  LGKIVKDSKYFQNDGSVWPEGYTAVRKFSSLTDPNVCTLYKMEVLRDFESKFRPLCRVTL 541
            LGKIVKDSKYFQNDGSVWPEGYTAVRKFSS+TDPNVCTLY+MEVLRDFESKFRPL RVTL
Sbjct: 483  LGKIVKDSKYFQNDGSVWPEGYTAVRKFSSITDPNVCTLYRMEVLRDFESKFRPLFRVTL 542

Query: 542  DNGEQFKGSSPSACWNKIYKRMRKIQHISDASTEGRGETVYKSGSDMFGFSNPDVKKLIQ 601
            DNGEQFKGSSPSACWNKIYKRM+KIQH SDA TE +GE V+KSGSDMFGFSNPDVKKLIQ
Sbjct: 543  DNGEQFKGSSPSACWNKIYKRMKKIQHTSDACTESKGEFVFKSGSDMFGFSNPDVKKLIQ 602

Query: 602  GISKSGLSSSRSLSKVASKKYKDFPVGYRPVRVDWKDLDKCSVCHMDEEYENNLFLQCDK 661
            GISKSGLSSSR LSKVASKKYKDFP+GYRPVRVDWKDLDKCSVCHMDEEYENNLFLQCDK
Sbjct: 603  GISKSGLSSSRFLSKVASKKYKDFPIGYRPVRVDWKDLDKCSVCHMDEEYENNLFLQCDK 662

Query: 662  CRMMVHARCYGELEPVDGVLWLCNLCRPGSPDCPPPCCLCPVIGGAMKPTTDGRWAHLAC 721
            CRMMVHARCYGELEPVDGV+WLCNLCRPGSPDCPPPCCLCPVIGGAMKPTTDGRWAHLAC
Sbjct: 663  CRMMVHARCYGELEPVDGVIWLCNLCRPGSPDCPPPCCLCPVIGGAMKPTTDGRWAHLAC 722

Query: 722  AIWIPETCLSDIKKMEPIDGLSRINKDRWKLLCSICGVSYGACIQCSNNTCYVAYHPLCA 781
            AIWIPETCLSDIKKMEPIDGL+RINKDRWKLLCSICGVSYGACIQCSNNTCYVAYHPLCA
Sbjct: 723  AIWIPETCLSDIKKMEPIDGLNRINKDRWKLLCSICGVSYGACIQCSNNTCYVAYHPLCA 782

Query: 782  RAAGLCVELEDDDRLHLLAADEDEEDQCIRLLSFCKKHRPPSNERLMAEDRIGQAGQQCS 841
            RAAGLCVELE+DDRLHLLAADEDEEDQCIRLLSFCKKHRPPSNERLMAEDRIGQAGQQCS
Sbjct: 783  RAAGLCVELEEDDRLHLLAADEDEEDQCIRLLSFCKKHRPPSNERLMAEDRIGQAGQQCS 842

Query: 842  NYTPPCNPSGCARTEPYNYFGRRGRKAPEALAAASLKRLFVENQPYIASGYSQHLLSGNL 901
            NYTPPCNPSGCARTEPYNYFGRRGRK PEA+AAASLKRLFVENQPYIASGYSQHLLSGNL
Sbjct: 843  NYTPPCNPSGCARTEPYNYFGRRGRKEPEAVAAASLKRLFVENQPYIASGYSQHLLSGNL 902

Query: 902  LPSSGVLGMKFSLQHLKTCQLDPRNILSVAEKYKFMRETFRKRLAFGKSGIHGFGIFAKH 961
            LPSSGVLGMKFSLQHLKTCQLDPRNILSVAEKYKFMRETFRKRLAFGKSGIHGFGIFAKH
Sbjct: 903  LPSSGVLGMKFSLQHLKTCQLDPRNILSVAEKYKFMRETFRKRLAFGKSGIHGFGIFAKH 962

Query: 962  PHRAGDMVIEYTGEIVRPPIADRRERFIYNLLVGAGTYMFRIDDERVIDATRAGSIAHLI 1021
            PHRAGDMVIEYTGE+VRPPIADRRERFIYNLLVGAGTYMFRIDDERVIDATRAGSIAHLI
Sbjct: 963  PHRAGDMVIEYTGEVVRPPIADRRERFIYNLLVGAGTYMFRIDDERVIDATRAGSIAHLI 1022

Query: 1022 NHSCEPNCYSRVITVNGDEHIIIFAKRDIKRWEELTYDYRFFSIDEQLACYCGFPRCRGV 1081
            NHSCEPNCYSRV++VNGDEHIIIFAKRDIKRWEELTYDYRFFSIDEQLACYCG+PRCRGV
Sbjct: 1023 NHSCEPNCYSRVLSVNGDEHIIIFAKRDIKRWEELTYDYRFFSIDEQLACYCGYPRCRGV 1082

Query: 1082 VNDTEEEERVAKLYVPRTDLVDWR 1105
            VNDT+EEERV+KL+V RTDLVDWR
Sbjct: 1083 VNDTDEEERVSKLHVSRTDLVDWR 1103

BLAST of Sgr021830 vs. ExPASy TrEMBL
Match: A0A6J1D999 (histone-lysine N-methyltransferase ATX2 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111018552 PE=4 SV=1)

HSP 1 Score: 2100.5 bits (5441), Expect = 0.0e+00
Identity = 1024/1103 (92.84%), Postives = 1053/1103 (95.47%), Query Frame = 0

Query: 2    AFPLEQRPKPQVMDGEDGEDINIDVYNAGTPVRYLSLDHVYSTTSPFVSTSGSSNVMSKK 61
            A PLE+R KP +MDGEDG+DINIDVYNAGTPVRYLSLDHVYSTTSPFVSTSGSSNVMSKK
Sbjct: 3    ALPLERRLKPPIMDGEDGDDINIDVYNAGTPVRYLSLDHVYSTTSPFVSTSGSSNVMSKK 62

Query: 62   VKARRLMVNRFDDLNFKPPRLLHVYSRRRKKPRHSSATPSFYDSLVEKVELGSKAVLKSE 121
            VKARRL+VN FDDLNFKPPRLLHVYSRRRKKPRHS          VE VELG    +KSE
Sbjct: 63   VKARRLVVNHFDDLNFKPPRLLHVYSRRRKKPRHSP---------VENVELGP---VKSE 122

Query: 122  AREMDEMVNSADDHADDFEVDRMPKKKKKKKDKFGFNELVKLEVDSSVFRAMNGPRLRDC 181
             REMDEMVN  D+HA +FEV+RM +KKKKK+DKFG NELVKLEVDSSV RAMNGPRLRDC
Sbjct: 123  VREMDEMVNGDDEHAGEFEVERMSQKKKKKRDKFGCNELVKLEVDSSVIRAMNGPRLRDC 182

Query: 182  RTHNNNNLNVSNSGRRRKRNSSKISEKTVLKSPSAKRWVRLSFEDVDPKVYIGLQCKVYW 241
            RTHNNN+   SNSGRRRKRNSS ++EKT+ KSPS+KRWVRLSFEDVDPKVYIGLQCK+YW
Sbjct: 183  RTHNNNS---SNSGRRRKRNSSGMAEKTIFKSPSSKRWVRLSFEDVDPKVYIGLQCKIYW 242

Query: 242  PLDADWYCGRVVGYTSETNRHHIEYEDDDKEDLILSNEKVKFYISGEEMQSLNLSFGVDS 301
            PLDADWY GRVVGY SETNRHHIEYED DKEDLILSNEKVKFYISGEEMQSLNLSFGVDS
Sbjct: 243  PLDADWYSGRVVGYDSETNRHHIEYEDGDKEDLILSNEKVKFYISGEEMQSLNLSFGVDS 302

Query: 302  VDSDAYDYNEMLVLAASLDDCLEPEPGDIVWAKLTGHAMWPAIIVDESLIGDRKGLRNIS 361
            VDSDAYDYNEMLVLAASLDDCLEPEPGDIVWAKLTGHAMWPAIIVDESLIGDRKGLRNIS
Sbjct: 303  VDSDAYDYNEMLVLAASLDDCLEPEPGDIVWAKLTGHAMWPAIIVDESLIGDRKGLRNIS 362

Query: 362  GGRTVPVQFFGTHDFARIKVKQAMSFLKGLLSSFHLKCKKPHFIRSLEEAKMYLSEQKLP 421
            GGRTVPVQFFGTHDFARIKVKQA+SFLKGLLSSFHLKCKKPHFIRSLEEAKMYLSEQKLP
Sbjct: 363  GGRTVPVQFFGTHDFARIKVKQAISFLKGLLSSFHLKCKKPHFIRSLEEAKMYLSEQKLP 422

Query: 422  PSMLQLQNGIEVDDFASASGEEEATTDSGEECLNEGGMPCPHNGYGSSPFIVGDLEIVSL 481
            PSMLQLQNGIEVDD ASASGEEE TTDSGEECLNEGGMP P NGYGSSPFIVGDLEIVSL
Sbjct: 423  PSMLQLQNGIEVDDSASASGEEEVTTDSGEECLNEGGMPRPPNGYGSSPFIVGDLEIVSL 482

Query: 482  GKIVKDSKYFQNDGSVWPEGYTAVRKFSSLTDPNVCTLYKMEVLRDFESKFRPLCRVTLD 541
            GKIVKDSKYFQNDGSVWPEGYTAVRKFSSLTDPNV TLYKMEVLRD+ESKFRPL RVTLD
Sbjct: 483  GKIVKDSKYFQNDGSVWPEGYTAVRKFSSLTDPNVRTLYKMEVLRDYESKFRPLFRVTLD 542

Query: 542  NGEQFKGSSPSACWNKIYKRMRKIQHISDASTEGRGETVYKSGSDMFGFSNPDVKKLIQG 601
            NGEQFKGSSPSACWNKIYKRM+KIQH SDASTEG GE VYKSGSDMFGFSNPDVKKLI+G
Sbjct: 543  NGEQFKGSSPSACWNKIYKRMKKIQHTSDASTEGGGEIVYKSGSDMFGFSNPDVKKLIKG 602

Query: 602  ISKSGLSSSRSLSKVASKKYKDFPVGYRPVRVDWKDLDKCSVCHMDEEYENNLFLQCDKC 661
            ISKSGLSSSRSLSKV SKKYKDFPVGYRPVRVDWKDLDKCSVCHMDEEYENNLFLQCDKC
Sbjct: 603  ISKSGLSSSRSLSKVTSKKYKDFPVGYRPVRVDWKDLDKCSVCHMDEEYENNLFLQCDKC 662

Query: 662  RMMVHARCYGELEPVDGVLWLCNLCRPGSPDCPPPCCLCPVIGGAMKPTTDGRWAHLACA 721
            RMMVHARCYGELEPVDGVLWLCNLCRPGSPDCPPPCCLCPVIGGAMKPTTDGRWAHLACA
Sbjct: 663  RMMVHARCYGELEPVDGVLWLCNLCRPGSPDCPPPCCLCPVIGGAMKPTTDGRWAHLACA 722

Query: 722  IWIPETCLSDIKKMEPIDGLSRINKDRWKLLCSICGVSYGACIQCSNNTCYVAYHPLCAR 781
            IWIPETCLSDIKKMEPIDGL+RI KDRWKLLCSICGVSYGACIQCSNNTCYVAYHPLCAR
Sbjct: 723  IWIPETCLSDIKKMEPIDGLNRITKDRWKLLCSICGVSYGACIQCSNNTCYVAYHPLCAR 782

Query: 782  AAGLCVELEDDDRLHLLAADEDEEDQCIRLLSFCKKHRPPSNERLMAEDRIGQAGQQCSN 841
            AAGLCVELE+DDRLHLLAADEDEEDQCIRLLSFCKKHRPPSNERLMAEDRIGQ+GQQCSN
Sbjct: 783  AAGLCVELEEDDRLHLLAADEDEEDQCIRLLSFCKKHRPPSNERLMAEDRIGQSGQQCSN 842

Query: 842  YTPPCNPSGCARTEPYNYFGRRGRKAPEALAAASLKRLFVENQPYIASGYSQHLLSGNLL 901
            YTPPCNPSG ARTEPYNYFGRRGRKAPEALAAASLKRLFVENQPYI SGYSQHLLSGNLL
Sbjct: 843  YTPPCNPSGSARTEPYNYFGRRGRKAPEALAAASLKRLFVENQPYIVSGYSQHLLSGNLL 902

Query: 902  PSSGVLGMKFSLQHLKTCQLDPRNILSVAEKYKFMRETFRKRLAFGKSGIHGFGIFAKHP 961
            PS+GVLG+KFSLQ+LKT QLDPRNILSVA+KYKFMRETFRKRLAFGKSGIHGFGIFAKHP
Sbjct: 903  PSTGVLGLKFSLQNLKTSQLDPRNILSVADKYKFMRETFRKRLAFGKSGIHGFGIFAKHP 962

Query: 962  HRAGDMVIEYTGEIVRPPIADRRERFIYNLLVGAGTYMFRIDDERVIDATRAGSIAHLIN 1021
            HRAGDMVIEY+GEIVRPPIADRRERFIYNLLVGAGTYMFRIDDERVIDATRAGSIAHLIN
Sbjct: 963  HRAGDMVIEYSGEIVRPPIADRRERFIYNLLVGAGTYMFRIDDERVIDATRAGSIAHLIN 1022

Query: 1022 HSCEPNCYSRVITVNGDEHIIIFAKRDIKRWEELTYDYRFFSIDEQLACYCGFPRCRGVV 1081
            HSCEPNCYSRVI+VNGDEHIIIFAKRDIKRWEELTYDYRFFSIDEQLACYCGFPRCRGVV
Sbjct: 1023 HSCEPNCYSRVISVNGDEHIIIFAKRDIKRWEELTYDYRFFSIDEQLACYCGFPRCRGVV 1082

Query: 1082 NDTEEEERVAKLYVPRTDLVDWR 1105
            ND EEEERVAKLYV RTDLVDWR
Sbjct: 1083 NDMEEEERVAKLYVSRTDLVDWR 1090

BLAST of Sgr021830 vs. TAIR 10
Match: AT1G05830.1 (trithorax-like protein 2 )

HSP 1 Score: 1325.5 bits (3429), Expect = 0.0e+00
Identity = 676/1096 (61.68%), Postives = 808/1096 (73.72%), Query Frame = 0

Query: 17   EDGEDINIDV----YNAGTPVRYLSLDHVYSTTSPFVSTSGSSNVMS-KKVKARRL-MVN 76
            E+GED  I      + A  PVRY SL+ VYS +S   S    +   S KKV A +L M +
Sbjct: 11   EEGEDTQIKTELHDHAADNPVRYASLESVYSVSSSSSSLCCKTAAGSHKKVNALKLPMSD 70

Query: 77   RFDDLNFKPPRLLHVYSRRRKKPRHSSATPSFYDSLVEKVELGSKAVLKSEAREMDEMVN 136
             F+    + P ++HVY RR+++ R         +S +E       A+L++E  E D+ + 
Sbjct: 71   SFELQPHRRPEIVHVYCRRKRRRRRRR------ESFLE------LAILQNEGVERDDRIV 130

Query: 137  SADDHADDFEVDRMPKKKKKKKDKFGFNELVKLEVDSSVFRAMNGPRLRDCRTHNNNNLN 196
              +    D E +   KKKK+KK + G  EL+KL VDS+       P LR CR     + N
Sbjct: 131  KIESAELDDEKEEENKKKKQKKRRIGNGELMKLGVDSTTLSVSATPPLRGCRIKAVCSGN 190

Query: 197  VSNSGRRRKRNSSKISEKTVLKSPSAKRWVRLSFEDVDPKVYIGLQCKVYWPLDADWYCG 256
              +   R KRN+ K  EK V  S +AK+WVRLS++ VDPK +IGLQCKV+WPLDA WY G
Sbjct: 191  KQDGSSRSKRNTVKNQEKVVTASATAKKWVRLSYDGVDPKHFIGLQCKVFWPLDAVWYPG 250

Query: 257  RVVGYTSETNRHHIEYEDDDKEDLILSNEKVKFYISGEEMQSLNLSFGVDSVDSDAYDYN 316
             +VGY  ET  H ++Y D D E+L L  EK+KF IS ++M+ LN+ FG + V  D  DY+
Sbjct: 251  SIVGYNVETKHHIVKYGDGDGEELALRREKIKFLISRDDMELLNMKFGTNDVVVDGQDYD 310

Query: 317  EMLVLAASLDDCLEPEPGDIVWAKLTGHAMWPAIIVDESLIGDRKGLRN-ISGGRTVPVQ 376
            E+++LAAS ++C + EP DI+WAKLTGHAMWPAIIVDES+I  RKGL N ISGGR+V VQ
Sbjct: 311  ELVILAASFEECQDFEPRDIIWAKLTGHAMWPAIIVDESVIVKRKGLNNKISGGRSVLVQ 370

Query: 377  FFGTHDFARIKVKQAMSFLKGLLSSFHLKCKKPHFIRSLEEAKMYLSEQKLPPSMLQLQN 436
            FFGTHDFARI+VKQA+SFLKGLLS   LKCK+P F  ++EEAKMYL E KLP  M QLQ 
Sbjct: 371  FFGTHDFARIQVKQAVSFLKGLLSRSPLKCKQPRFEEAMEEAKMYLKEYKLPGRMDQLQK 430

Query: 437  GIEVDDFASASGEEEATTDSGEECLNEGGMPCPHNGYGSSPFIVGDLEIVSLGKIVKDSK 496
              + D     +  EE +++SG++   +G +       G     +GDL+I++LG+IV DS+
Sbjct: 431  VADTDCSERINSGEEDSSNSGDDYTKDGEVWLRPTELGDCLHRIGDLQIINLGRIVTDSE 490

Query: 497  YFQNDGSVWPEGYTAVRKFSSLTDPNVCTLYKMEVLRDFESKFRPLCRVTLDNGEQFKGS 556
            +F++    WPEGYTA RKF SL DPN   +YKMEVLRD ESK RP+ RVT ++GEQFKG 
Sbjct: 491  FFKDSKHTWPEGYTATRKFISLKDPNASAMYKMEVLRDAESKTRPVFRVTTNSGEQFKGD 550

Query: 557  SPSACWNKIYKRMRKIQHISDASTEGRGETVYKSGSDMFGFSNPDVKKLIQGISKSGLSS 616
            +PSACWNKIY R++KIQ  SD + +  GE +++SG+DMFGFSNP+V KLIQG+ +S   S
Sbjct: 551  TPSACWNKIYNRIKKIQIASD-NPDVLGEGLHESGTDMFGFSNPEVDKLIQGLLQSRPPS 610

Query: 617  SRSLSKVASKKYKDFPVGYRPVRVDWKDLDKCSVCHMDEEYENNLFLQCDKCRMMVHARC 676
              S  K +S KY+D P GYRPVRV+WKDLDKC+VCHMDEEYENNLFLQCDKCRMMVH RC
Sbjct: 611  KVSQRKYSSGKYQDHPTGYRPVRVEWKDLDKCNVCHMDEEYENNLFLQCDKCRMMVHTRC 670

Query: 677  YGELEPVDGVLWLCNLCRPGSPDCPPPCCLCPVIGGAMKPTTDGRWAHLACAIWIPETCL 736
            YG+LEP +G+LWLCNLCRP + D PP CCLCPV+GGAMKPTTDGRWAHLACAIWIPETCL
Sbjct: 671  YGQLEPHNGILWLCNLCRPVALDIPPRCCLCPVVGGAMKPTTDGRWAHLACAIWIPETCL 730

Query: 737  SDIKKMEPIDGLSRINKDRWKLLCSICGVSYGACIQCSNNTCYVAYHPLCARAAGLCVEL 796
             D+KKMEPIDG+ +++KDRWKLLCSICGVSYGACIQCSNNTC VAYHPLCARAAGLCVEL
Sbjct: 731  LDVKKMEPIDGVKKVSKDRWKLLCSICGVSYGACIQCSNNTCRVAYHPLCARAAGLCVEL 790

Query: 797  EDDDRLHLLAADEDEEDQCIRLLSFCKKHRPPSNERLMAEDRIGQAGQQCSNYTPPCNPS 856
             D+DRL LL+ D+DE DQCIRLLSFCK+HR  SN  L  E  I  A    + Y PP NPS
Sbjct: 791  ADEDRLFLLSMDDDEADQCIRLLSFCKRHRQTSNYHLETEYMIKPA-HNIAEYLPPPNPS 850

Query: 857  GCARTEPYNYFGRRGRKAPEALAAASLKRLFVENQPYIASGYSQHLLSGNLLPSSGVLGM 916
            GCARTEPYNY GRRGRK PEALA AS KRLFVENQPYI  GYS+H  S        + G 
Sbjct: 851  GCARTEPYNYLGRRGRKEPEALAGASSKRLFVENQPYIVGGYSRHEFS----TYERIYGS 910

Query: 917  KFSLQHLKTCQLDPRNILSVAEKYKFMRETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVI 976
            K S   + T    P NILS+AEKY FM+ET+RKRLAFGKSGIHGFGIFAK PHRAGDMVI
Sbjct: 911  KMS--QITT----PSNILSMAEKYTFMKETYRKRLAFGKSGIHGFGIFAKLPHRAGDMVI 970

Query: 977  EYTGEIVRPPIADRRERFIYNLLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCY 1036
            EYTGE+VRPPIAD+RE  IYN +VGAGTYMFRID+ERVIDATR GSIAHLINHSCEPNCY
Sbjct: 971  EYTGELVRPPIADKREHLIYNSMVGAGTYMFRIDNERVIDATRTGSIAHLINHSCEPNCY 1030

Query: 1037 SRVITVNGDEHIIIFAKRDIKRWEELTYDYRFFSIDEQLACYCGFPRCRGVVNDTEEEER 1096
            SRVI+VNGDEHIIIFAKRD+ +WEELTYDYRFFSIDE+LACYCGFPRCRGVVNDTE EER
Sbjct: 1031 SRVISVNGDEHIIIFAKRDVAKWEELTYDYRFFSIDERLACYCGFPRCRGVVNDTEAEER 1082

Query: 1097 VAKLYVPRTDLVDWRE 1106
             A ++  R +L +W E
Sbjct: 1091 QANIHASRCELKEWTE 1082

BLAST of Sgr021830 vs. TAIR 10
Match: AT1G05830.2 (trithorax-like protein 2 )

HSP 1 Score: 1325.5 bits (3429), Expect = 0.0e+00
Identity = 676/1096 (61.68%), Postives = 808/1096 (73.72%), Query Frame = 0

Query: 17   EDGEDINIDV----YNAGTPVRYLSLDHVYSTTSPFVSTSGSSNVMS-KKVKARRL-MVN 76
            E+GED  I      + A  PVRY SL+ VYS +S   S    +   S KKV A +L M +
Sbjct: 11   EEGEDTQIKTELHDHAADNPVRYASLESVYSVSSSSSSLCCKTAAGSHKKVNALKLPMSD 70

Query: 77   RFDDLNFKPPRLLHVYSRRRKKPRHSSATPSFYDSLVEKVELGSKAVLKSEAREMDEMVN 136
             F+    + P ++HVY RR+++ R         +S +E       A+L++E  E D+ + 
Sbjct: 71   SFELQPHRRPEIVHVYCRRKRRRRRRR------ESFLE------LAILQNEGVERDDRIV 130

Query: 137  SADDHADDFEVDRMPKKKKKKKDKFGFNELVKLEVDSSVFRAMNGPRLRDCRTHNNNNLN 196
              +    D E +   KKKK+KK + G  EL+KL VDS+       P LR CR     + N
Sbjct: 131  KIESAELDDEKEEENKKKKQKKRRIGNGELMKLGVDSTTLSVSATPPLRGCRIKAVCSGN 190

Query: 197  VSNSGRRRKRNSSKISEKTVLKSPSAKRWVRLSFEDVDPKVYIGLQCKVYWPLDADWYCG 256
              +   R KRN+ K  EK V  S +AK+WVRLS++ VDPK +IGLQCKV+WPLDA WY G
Sbjct: 191  KQDGSSRSKRNTVKNQEKVVTASATAKKWVRLSYDGVDPKHFIGLQCKVFWPLDAVWYPG 250

Query: 257  RVVGYTSETNRHHIEYEDDDKEDLILSNEKVKFYISGEEMQSLNLSFGVDSVDSDAYDYN 316
             +VGY  ET  H ++Y D D E+L L  EK+KF IS ++M+ LN+ FG + V  D  DY+
Sbjct: 251  SIVGYNVETKHHIVKYGDGDGEELALRREKIKFLISRDDMELLNMKFGTNDVVVDGQDYD 310

Query: 317  EMLVLAASLDDCLEPEPGDIVWAKLTGHAMWPAIIVDESLIGDRKGLRN-ISGGRTVPVQ 376
            E+++LAAS ++C + EP DI+WAKLTGHAMWPAIIVDES+I  RKGL N ISGGR+V VQ
Sbjct: 311  ELVILAASFEECQDFEPRDIIWAKLTGHAMWPAIIVDESVIVKRKGLNNKISGGRSVLVQ 370

Query: 377  FFGTHDFARIKVKQAMSFLKGLLSSFHLKCKKPHFIRSLEEAKMYLSEQKLPPSMLQLQN 436
            FFGTHDFARI+VKQA+SFLKGLLS   LKCK+P F  ++EEAKMYL E KLP  M QLQ 
Sbjct: 371  FFGTHDFARIQVKQAVSFLKGLLSRSPLKCKQPRFEEAMEEAKMYLKEYKLPGRMDQLQK 430

Query: 437  GIEVDDFASASGEEEATTDSGEECLNEGGMPCPHNGYGSSPFIVGDLEIVSLGKIVKDSK 496
              + D     +  EE +++SG++   +G +       G     +GDL+I++LG+IV DS+
Sbjct: 431  VADTDCSERINSGEEDSSNSGDDYTKDGEVWLRPTELGDCLHRIGDLQIINLGRIVTDSE 490

Query: 497  YFQNDGSVWPEGYTAVRKFSSLTDPNVCTLYKMEVLRDFESKFRPLCRVTLDNGEQFKGS 556
            +F++    WPEGYTA RKF SL DPN   +YKMEVLRD ESK RP+ RVT ++GEQFKG 
Sbjct: 491  FFKDSKHTWPEGYTATRKFISLKDPNASAMYKMEVLRDAESKTRPVFRVTTNSGEQFKGD 550

Query: 557  SPSACWNKIYKRMRKIQHISDASTEGRGETVYKSGSDMFGFSNPDVKKLIQGISKSGLSS 616
            +PSACWNKIY R++KIQ  SD + +  GE +++SG+DMFGFSNP+V KLIQG+ +S   S
Sbjct: 551  TPSACWNKIYNRIKKIQIASD-NPDVLGEGLHESGTDMFGFSNPEVDKLIQGLLQSRPPS 610

Query: 617  SRSLSKVASKKYKDFPVGYRPVRVDWKDLDKCSVCHMDEEYENNLFLQCDKCRMMVHARC 676
              S  K +S KY+D P GYRPVRV+WKDLDKC+VCHMDEEYENNLFLQCDKCRMMVH RC
Sbjct: 611  KVSQRKYSSGKYQDHPTGYRPVRVEWKDLDKCNVCHMDEEYENNLFLQCDKCRMMVHTRC 670

Query: 677  YGELEPVDGVLWLCNLCRPGSPDCPPPCCLCPVIGGAMKPTTDGRWAHLACAIWIPETCL 736
            YG+LEP +G+LWLCNLCRP + D PP CCLCPV+GGAMKPTTDGRWAHLACAIWIPETCL
Sbjct: 671  YGQLEPHNGILWLCNLCRPVALDIPPRCCLCPVVGGAMKPTTDGRWAHLACAIWIPETCL 730

Query: 737  SDIKKMEPIDGLSRINKDRWKLLCSICGVSYGACIQCSNNTCYVAYHPLCARAAGLCVEL 796
             D+KKMEPIDG+ +++KDRWKLLCSICGVSYGACIQCSNNTC VAYHPLCARAAGLCVEL
Sbjct: 731  LDVKKMEPIDGVKKVSKDRWKLLCSICGVSYGACIQCSNNTCRVAYHPLCARAAGLCVEL 790

Query: 797  EDDDRLHLLAADEDEEDQCIRLLSFCKKHRPPSNERLMAEDRIGQAGQQCSNYTPPCNPS 856
             D+DRL LL+ D+DE DQCIRLLSFCK+HR  SN  L  E  I  A    + Y PP NPS
Sbjct: 791  ADEDRLFLLSMDDDEADQCIRLLSFCKRHRQTSNYHLETEYMIKPA-HNIAEYLPPPNPS 850

Query: 857  GCARTEPYNYFGRRGRKAPEALAAASLKRLFVENQPYIASGYSQHLLSGNLLPSSGVLGM 916
            GCARTEPYNY GRRGRK PEALA AS KRLFVENQPYI  GYS+H  S        + G 
Sbjct: 851  GCARTEPYNYLGRRGRKEPEALAGASSKRLFVENQPYIVGGYSRHEFS----TYERIYGS 910

Query: 917  KFSLQHLKTCQLDPRNILSVAEKYKFMRETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVI 976
            K S   + T    P NILS+AEKY FM+ET+RKRLAFGKSGIHGFGIFAK PHRAGDMVI
Sbjct: 911  KMS--QITT----PSNILSMAEKYTFMKETYRKRLAFGKSGIHGFGIFAKLPHRAGDMVI 970

Query: 977  EYTGEIVRPPIADRRERFIYNLLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCY 1036
            EYTGE+VRPPIAD+RE  IYN +VGAGTYMFRID+ERVIDATR GSIAHLINHSCEPNCY
Sbjct: 971  EYTGELVRPPIADKREHLIYNSMVGAGTYMFRIDNERVIDATRTGSIAHLINHSCEPNCY 1030

Query: 1037 SRVITVNGDEHIIIFAKRDIKRWEELTYDYRFFSIDEQLACYCGFPRCRGVVNDTEEEER 1096
            SRVI+VNGDEHIIIFAKRD+ +WEELTYDYRFFSIDE+LACYCGFPRCRGVVNDTE EER
Sbjct: 1031 SRVISVNGDEHIIIFAKRDVAKWEELTYDYRFFSIDERLACYCGFPRCRGVVNDTEAEER 1082

Query: 1097 VAKLYVPRTDLVDWRE 1106
             A ++  R +L +W E
Sbjct: 1091 QANIHASRCELKEWTE 1082

BLAST of Sgr021830 vs. TAIR 10
Match: AT2G31650.1 (homologue of trithorax )

HSP 1 Score: 1282.7 bits (3318), Expect = 0.0e+00
Identity = 660/1106 (59.67%), Postives = 806/1106 (72.88%), Query Frame = 0

Query: 22   INIDVYN-AGTPVRYLSLDHVYSTTSP---FVSTSGSSNVMSKKVKARRL-MVNRFD--- 81
            I IDV++    P+RY S++ +YS  S     V+  GS ++MSKKVKA++L M+ +F+   
Sbjct: 10   IEIDVHDLVEAPIRYDSIESIYSIPSSALCCVNAVGSHSLMSKKVKAQKLPMIEQFEIEG 69

Query: 82   ---------------DLNFKPPRLLHVYSRRRKKPRHSSATPSFYDSLVEKVELGSKAVL 141
                            L  + P ++ VY RRRK+P             + +  L     +
Sbjct: 70   SGVSASDDCCRSDDYKLRIQRPEIVRVYYRRRKRP-------------LRECLLDQAVAV 129

Query: 142  KSEAREMDEMVNSADDHADDFEVDRMPKKKKKKKDKFGFNELVKLEVDSSVFRAMNGPRL 201
            K+E+ E+DE+        D FE        +KK+ K G  ELVK          M    L
Sbjct: 130  KTESVELDEI--------DCFE--------EKKRRKIGNCELVK--------SGMESIGL 189

Query: 202  RDCRTHNNNNLNVSNSGRRRKRNSSKISEKTVLKSPSAKRWVRLSFEDVDPKVYIGLQCK 261
            R C+ +N  + N  N   RRK +SSK  +K  L S SAK+WVRLS++ VDP  +IGLQCK
Sbjct: 190  RRCKENNAFSGNKQNGSSRRKGSSSKNQDKATLASRSAKKWVRLSYDGVDPTSFIGLQCK 249

Query: 262  VYWPLDADWYCGRVVGYTSETNRHHIEYEDDDKEDLILSNEKVKFYISGEEMQSLNLSFG 321
            V+WPLDA WY G +VGY++E  R+ ++Y D   ED++   E +KF +S EEM+ L+L F 
Sbjct: 250  VFWPLDALWYEGSIVGYSAERKRYTVKYRDGCDEDIVFDREMIKFLVSREEMELLHLKFC 309

Query: 322  VDSVDSDAYDYNEMLVLAASLDDCLEPEPGDIVWAKLTGHAMWPAIIVDESLIGDRKGLR 381
              +V  D  DY+EM+VLAA+LD+C + EPGDIVWAKL GHAMWPA+IVDES+IG+RKGL 
Sbjct: 310  TSNVTVDGRDYDEMVVLAATLDECQDFEPGDIVWAKLAGHAMWPAVIVDESIIGERKGLN 369

Query: 382  N-ISGGRTVPVQFFGTHDFARIKVKQAMSFLKGLLSSFHLKCKKPHFIRSLEEAKMYLSE 441
            N +SGG ++ VQFFGTHDFARIKVKQA+SF+KGLLS  HLKCK+P F   ++EAKMYL  
Sbjct: 370  NKVSGGGSLLVQFFGTHDFARIKVKQAISFIKGLLSPSHLKCKQPRFEEGMQEAKMYLKA 429

Query: 442  QKLPPSMLQLQNGIEVDDFASASGEEEATTDSGEECLNEGGMPCPHNGYGSSPFIVGDLE 501
             +LP  M QLQ G +  D   A+  EE   +SG + LN+G +      +     I+GDL 
Sbjct: 430  HRLPERMSQLQKGADSVDSDMANSTEEG--NSGGDLLNDGEVWLRPTEHVDFRHIIGDLL 489

Query: 502  IVSLGKIVKDSKYFQNDGSVWPEGYTAVRKFSSLTDPNVCTLYKMEVLRDFESKFRPLCR 561
            I++LGK+V DS++F+++  +WPEGYTA+RKF+SLTD +   LYKMEVLRD E+K  PL  
Sbjct: 490  IINLGKVVTDSQFFKDENHIWPEGYTAMRKFTSLTDHSASALYKMEVLRDAETKTHPLFI 549

Query: 562  VTLDNGEQFKGSSPSACWNKIYKRMRKIQHISDASTEGRGETVYKSGSDMFGFSNPDVKK 621
            VT D+GEQFKG +PSACWNKIY R++K+Q+ SD S    GE +  SG+DMFG SNP+V K
Sbjct: 550  VTADSGEQFKGPTPSACWNKIYNRIKKVQN-SD-SPNILGEELNGSGTDMFGLSNPEVIK 609

Query: 622  LIQGISKSGLSSSRSLSKVASKKYKDFPVGYRPVRVDWKDLDKCSVCHMDEEYENNLFLQ 681
            L+Q +SKS  SS  S+ K +  ++++ P GYRPVRVDWKDLDKC+VCHMDEEYENNLFLQ
Sbjct: 610  LVQDLSKSRPSSHVSMCKNSLGRHQNQPTGYRPVRVDWKDLDKCNVCHMDEEYENNLFLQ 669

Query: 682  CDKCRMMVHARCYGELEPVDGVLWLCNLCRPGSPDCPPPCCLCPVIGGAMKPTTDGRWAH 741
            CDKCRMMVHA+CYGELEP DG LWLCNLCRPG+PD PP CCLCPV+GGAMKPTTDGRWAH
Sbjct: 670  CDKCRMMVHAKCYGELEPCDGALWLCNLCRPGAPDMPPRCCLCPVVGGAMKPTTDGRWAH 729

Query: 742  LACAIWIPETCLSDIKKMEPIDGLSRINKDRWKLLCSICGVSYGACIQCSNNTCYVAYHP 801
            LACAIWIPETCLSD+KKMEPIDG+++++KDRWKL+C+ICGVSYGACIQCSNN+C VAYHP
Sbjct: 730  LACAIWIPETCLSDVKKMEPIDGVNKVSKDRWKLMCTICGVSYGACIQCSNNSCRVAYHP 789

Query: 802  LCARAAGLCVELEDDDRLHLLAADEDEEDQCIRLLSFCKKHRPPSNERLMAEDRIGQAGQ 861
            LCARAAGLCVELE+D     ++ + +E DQCIR+LSFCK+HR  S   L +EDRI  A  
Sbjct: 790  LCARAAGLCVELEND-----MSVEGEEADQCIRMLSFCKRHRQTSTACLGSEDRIKSATH 849

Query: 862  QCSNYTPPCNPSGCARTEPYNYFGRRGRKAPEALAAASLKRLFVENQPYIASGYSQHLLS 921
            + S Y PP NPSGCARTEPYN FGRRGRK PEALAAAS KRLFVENQPY+  GYS+   S
Sbjct: 850  KTSEYLPPPNPSGCARTEPYNCFGRRGRKEPEALAAASSKRLFVENQPYVIGGYSRLEFS 909

Query: 922  GNLLPSSGVLGMKFSLQHLKTCQLDPRNILSVAEKYKFMRETFRKRLAFGKSGIHGFGIF 981
                    + G K S  +       P NILS+AEKY++MRET+RKRLAFGKSGIHGFGIF
Sbjct: 910  ----TYKSIHGSKVSQMN------TPSNILSMAEKYRYMRETYRKRLAFGKSGIHGFGIF 969

Query: 982  AKHPHRAGDMVIEYTGEIVRPPIADRRERFIYNLLVGAGTYMFRIDDERVIDATRAGSIA 1041
            AK PHRAGDM+IEYTGE+VRP IAD+RE+ IYN +VGAGTYMFRIDDERVIDATR GSIA
Sbjct: 970  AKLPHRAGDMMIEYTGELVRPSIADKREQLIYNSMVGAGTYMFRIDDERVIDATRTGSIA 1029

Query: 1042 HLINHSCEPNCYSRVITVNGDEHIIIFAKRDIKRWEELTYDYRFFSIDEQLACYCGFPRC 1101
            HLINHSC PNCYSRVITVNGDEHIIIFAKR I +WEELTYDYRFFSI E+L+C CGFP C
Sbjct: 1030 HLINHSCVPNCYSRVITVNGDEHIIIFAKRHIPKWEELTYDYRFFSIGERLSCSCGFPGC 1059

Query: 1102 RGVVNDTEEEERVAKLYVPRTDLVDW 1104
            RGVVNDTE EE+ AK+ VPR DL+DW
Sbjct: 1090 RGVVNDTEAEEQHAKICVPRCDLIDW 1059

BLAST of Sgr021830 vs. TAIR 10
Match: AT5G53430.1 (SET domain group 29 )

HSP 1 Score: 267.3 bits (682), Expect = 5.8e-71
Identity = 166/487 (34.09%), Postives = 238/487 (48.87%), Query Frame = 0

Query: 628  YRPVRVDWKDLDKCSVCHMDEEYENNLFLQCDKCRMMVHARCYGELEPVDGVLWLCNLCR 687
            Y PV V W   ++C+VC   E+++ N  + C++C++ VH  CYG     D   W+C  C 
Sbjct: 598  YEPVNVKW-TTERCAVCRWVEDWDYNKIIICNRCQIAVHQECYGTRNVRDFTSWVCKACE 657

Query: 688  PGSPDCPPPCCLCPVIGGAMKPT-TDGRWAHLACAIWIPETCLSDIKKMEPIDGLSRINK 747
              +P+    CCLCPV GGA+KPT  +  W H+ CA + PE C +  +KMEP  G+  I  
Sbjct: 658  --TPEIKRECCLCPVKGGALKPTDVETLWVHVTCAWFQPEVCFASEEKMEPALGILSIPS 717

Query: 748  DRWKLLCSICGVSYGACIQCSNNTCYVAYHPLCARAAGLCVELEDDDRLHLLAADEDEED 807
              +  +C IC   +G+C QC    C   YH +CA  AG  +E      LH L   E    
Sbjct: 718  SNFVKICVICKQIHGSCTQCCK--CSTYYHAMCASRAGYRME------LHCL---EKNGR 777

Query: 808  QCIRLLSFCKKHRPP--------------------------SNERLMAEDRIGQAGQQCS 867
            Q  +++S+C  HR P                          S  RL+  +R  +  +  +
Sbjct: 778  QITKMVSYCSYHRAPNPDTVLIIQTPSGVFSAKSLVQNKKKSGTRLILANR-EEIEESAA 837

Query: 868  NYTPPCNPSGCARTEPYNYFGRRGRKAPEALAAASLKRLFVENQPYIASGYSQHLLSGNL 927
              T P +P   AR   Y                 S KR   E  P+   G   H      
Sbjct: 838  EDTIPIDPFSSARCRLYK------------RTVNSKKRTKEEGIPHYTGGLRHH------ 897

Query: 928  LPSSGVLGMKFSLQHLKTCQLDPRNILSVAEKYKFMRETFRKRLAFGKSGIHGFGIFAKH 987
             PS+ +     +L   +    +P++  S  E+   ++ T  +R+ FG+SGIHG+G+FA+ 
Sbjct: 898  -PSAAIQ----TLNAFRHVAEEPKSFSSFRERLHHLQRTEMERVCFGRSGIHGWGLFARR 957

Query: 988  PHRAGDMVIEYTGEIVRPPIADRRERFIYNLLVGAGTYMFRIDDERVIDATRAGSIAHLI 1047
              + G+MV+EY GE VR  IAD RE        G   Y+F+I +E V+DAT  G+IA LI
Sbjct: 958  NIQEGEMVLEYRGEQVRGIIADLREARYRR--EGKDCYLFKISEEVVVDATEKGNIARLI 1017

Query: 1048 NHSCEPNCYSRVITVNGDE-HIIIFAKRDIKRWEELTYDYRFFSIDE----QLACYCGFP 1083
            NHSC PNCY+R+++V  DE  I++ AK  +   EELTYDY  F  DE    ++ C C  P
Sbjct: 1018 NHSCMPNCYARIMSVGDDESRIVLIAKTTVASCEELTYDY-LFDPDEPDEFKVPCLCKSP 1043

BLAST of Sgr021830 vs. TAIR 10
Match: AT4G27910.1 (SET domain protein 16 )

HSP 1 Score: 263.8 bits (673), Expect = 6.4e-70
Identity = 163/485 (33.61%), Postives = 239/485 (49.28%), Query Frame = 0

Query: 628  YRPVRVDWKDLDKCSVCHMDEEYENNLFLQCDKCRMMVHARCYGELEPVDGVLWLCNLCR 687
            Y PV   W   ++C+VC   E+++ N  + C++C++ VH  CYG     D   W+C  C 
Sbjct: 583  YEPVNAKW-TTERCAVCRWVEDWDYNKIIICNRCQIAVHQECYGARHVRDFTSWVCKACE 642

Query: 688  PGSPDCPPPCCLCPVIGGAMKPT-TDGRWAHLACAIWIPETCLSDIKKMEPIDGLSRINK 747
               PD    CCLCPV GGA+KPT  +  W H+ CA + PE C +  +KMEP  G+  I  
Sbjct: 643  --RPDIKRECCLCPVKGGALKPTDVETLWVHVTCAWFQPEVCFASEEKMEPAVGILSIPS 702

Query: 748  DRWKLLCSICGVSYGACIQCSNNTCYVAYHPLCARAAGLCVELEDDDRLHLLAADEDEED 807
              +  +C IC   +G+C QC    C   YH +CA  AG  +E      LH L   E    
Sbjct: 703  TNFVKICVICKQIHGSCTQCCK--CSTYYHAMCASRAGYRME------LHCL---EKNGQ 762

Query: 808  QCIRLLSFCKKHRPPSNERLMAEDR------------------------IGQAGQQCSNY 867
            Q  +++S+C  HR P+ + ++                            I +  +  +  
Sbjct: 763  QITKMVSYCAYHRAPNPDNVLIIQTPSGAFSAKSLVQNKKKGGSRLISLIREDDEAPAEN 822

Query: 868  TPPCNPSGCARTEPYNYFGRRGRKAPEALAAASLKRLFVENQPYIASGYSQHLLSGNLLP 927
            T  C+P   AR   +       RK        S KR+  E  P+   G   H        
Sbjct: 823  TITCDPFSAARCRVFK------RK------INSKKRIEEEAIPHHTRGPRHH-------- 882

Query: 928  SSGVLGMKFSLQHLKTCQLDPRNILSVAEKYKFMRETFRKRLAFGKSGIHGFGIFAKHPH 987
            +S  +    + +H+     +P++  S  E+   ++ T   R+ FG+SGIHG+G+FA+   
Sbjct: 883  ASAAIQTLNTFRHVPE---EPKSFSSFRERLHHLQRTEMDRVCFGRSGIHGWGLFARRNI 942

Query: 988  RAGDMVIEYTGEIVRPPIADRRERFIYNLLVGAGTYMFRIDDERVIDATRAGSIAHLINH 1047
            + G+MV+EY GE VR  IAD RE       VG   Y+F+I +E V+DAT  G+IA LINH
Sbjct: 943  QEGEMVLEYRGEQVRGSIADLREARYRR--VGKDCYLFKISEEVVVDATDKGNIARLINH 1002

Query: 1048 SCEPNCYSRVITVNGDE-HIIIFAKRDIKRWEELTYDYRFFSIDE----QLACYCGFPRC 1083
            SC PNCY+R+++V  +E  I++ AK ++   EELTYDY  F  DE    ++ C C  P C
Sbjct: 1003 SCTPNCYARIMSVGDEESRIVLIAKANVAVGEELTYDY-LFDPDEAEELKVPCLCKAPNC 1027

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038884706.10.0e+0092.13histone-lysine N-methyltransferase ATX2-like [Benincasa hispida] >XP_038884708.1... [more]
KAG7025293.10.0e+0087.73Histone-lysine N-methyltransferase ATX2 [Cucurbita argyrosperma subsp. argyrospe... [more]
XP_023004925.10.0e+0092.21histone-lysine N-methyltransferase ATX2-like isoform X1 [Cucurbita maxima][more]
XP_023513866.10.0e+0092.21histone-lysine N-methyltransferase ATX2-like isoform X1 [Cucurbita pepo subsp. p... [more]
XP_011656480.10.0e+0092.12histone-lysine N-methyltransferase ATX2 [Cucumis sativus] >KGN45919.1 hypothetic... [more]
Match NameE-valueIdentityDescription
P0CB220.0e+0061.68Histone-lysine N-methyltransferase ATX2 OS=Arabidopsis thaliana OX=3702 GN=ATX2 ... [more]
Q9C5X40.0e+0059.67Histone H3-lysine(4) N-trimethyltransferase ATX1 OS=Arabidopsis thaliana OX=3702... [more]
Q6K4311.1e-31153.64Histone-lysine N-methyltransferase TRX1 OS=Oryza sativa subsp. japonica OX=39947... [more]
Q8GZ428.2e-7034.09Histone-lysine N-methyltransferase ATX5 OS=Arabidopsis thaliana OX=3702 GN=ATX5 ... [more]
Q9SUE79.0e-6933.61Histone-lysine N-methyltransferase ATX4 OS=Arabidopsis thaliana OX=3702 GN=ATX4 ... [more]
Match NameE-valueIdentityDescription
A0A6J1KVZ10.0e+0092.21histone-lysine N-methyltransferase ATX2-like isoform X1 OS=Cucurbita maxima OX=3... [more]
A0A0A0KAQ50.0e+0092.12Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G022310 PE=3 SV=1[more]
A0A6J1H5D60.0e+0092.12histone-lysine N-methyltransferase ATX2-like isoform X1 OS=Cucurbita moschata OX... [more]
A0A1S3CLC30.0e+0091.58histone-lysine N-methyltransferase ATX2-like isoform X1 OS=Cucumis melo OX=3656 ... [more]
A0A6J1D9990.0e+0092.84histone-lysine N-methyltransferase ATX2 isoform X1 OS=Momordica charantia OX=367... [more]
Match NameE-valueIdentityDescription
AT1G05830.10.0e+0061.68trithorax-like protein 2 [more]
AT1G05830.20.0e+0061.68trithorax-like protein 2 [more]
AT2G31650.10.0e+0059.67homologue of trithorax [more]
AT5G53430.15.8e-7134.09SET domain group 29 [more]
AT4G27910.16.4e-7033.61SET domain protein 16 [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR003889FY-rich, C-terminalSMARTSM00542fyrc_3coord: 535..627
e-value: 1.6E-10
score: 51.0
IPR003889FY-rich, C-terminalPFAMPF05965FYRCcoord: 534..601
e-value: 1.8E-10
score: 40.8
IPR003889FY-rich, C-terminalPROSITEPS51543FYRCcoord: 531..615
score: 18.108181
IPR001214SET domainSMARTSM00317set_7coord: 942..1066
e-value: 6.4E-35
score: 132.0
IPR001214SET domainPFAMPF00856SETcoord: 953..1059
e-value: 1.1E-16
score: 61.7
IPR001214SET domainPROSITEPS50280SETcoord: 942..1060
score: 17.794878
IPR001965Zinc finger, PHD-typeSMARTSM00249PHD_3coord: 640..687
e-value: 2.3E-6
score: 37.1
coord: 752..819
e-value: 0.29
score: 20.2
IPR003888FY-rich, N-terminalSMARTSM00541fyrn_3coord: 483..527
e-value: 3.9E-9
score: 46.3
IPR003888FY-rich, N-terminalPFAMPF05964FYRNcoord: 474..524
e-value: 1.5E-13
score: 50.3
IPR003888FY-rich, N-terminalPROSITEPS51542FYRNcoord: 468..527
score: 25.171341
IPR000313PWWP domainSMARTSM00293PWWP_4coord: 325..388
e-value: 3.1E-8
score: 43.4
IPR000313PWWP domainPFAMPF00855PWWPcoord: 326..415
e-value: 1.4E-14
score: 54.3
IPR000313PWWP domainPROSITEPS50812PWWPcoord: 327..390
score: 14.444873
IPR013083Zinc finger, RING/FYVE/PHD-typeGENE3D3.30.40.10Zinc/RING finger domain, C3HC4 (zinc finger)coord: 631..690
e-value: 3.3E-8
score: 35.0
IPR013083Zinc finger, RING/FYVE/PHD-typeGENE3D3.30.40.10Zinc/RING finger domain, C3HC4 (zinc finger)coord: 691..820
e-value: 4.5E-22
score: 80.5
NoneNo IPR availablePFAMPF13831PHD_2coord: 653..687
e-value: 8.6E-8
score: 31.5
NoneNo IPR availableGENE3D2.170.270.10SET domaincoord: 913..1094
e-value: 6.3E-50
score: 172.2
NoneNo IPR availableGENE3D3.30.160.360coord: 469..606
e-value: 3.5E-29
score: 103.5
NoneNo IPR availableGENE3D2.30.30.140coord: 314..421
e-value: 1.9E-19
score: 71.7
NoneNo IPR availableGENE3D2.30.30.140coord: 230..282
e-value: 1.5E-6
score: 29.7
NoneNo IPR availablePFAMPF13832zf-HC5HC2H_2coord: 696..818
e-value: 8.5E-33
score: 112.8
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 177..207
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 180..196
NoneNo IPR availablePANTHERPTHR13793PHD FINGER PROTEINScoord: 17..1104
NoneNo IPR availablePANTHERPTHR13793:SF147HISTONE-LYSINE N-METHYLTRANSFERASE ATX2coord: 17..1104
NoneNo IPR availableCDDcd10518SET_SETD1-likecoord: 929..1078
e-value: 4.90993E-82
score: 262.148
NoneNo IPR availableSUPERFAMILY63748Tudor/PWWP/MBTcoord: 322..430
NoneNo IPR availableSUPERFAMILY82199SET domaincoord: 940..1079
IPR019786Zinc finger, PHD-type, conserved sitePROSITEPS01359ZF_PHD_1coord: 641..686
IPR034732Extended PHD (ePHD) domainPROSITEPS51805EPHDcoord: 694..819
score: 34.951328
IPR003616Post-SET domainPROSITEPS50868POST_SETcoord: 1066..1082
score: 9.298017
IPR019787Zinc finger, PHD-fingerPROSITEPS50016ZF_PHD_2coord: 638..689
score: 8.867
IPR042010ATX1/2, PHD domainCDDcd15494PHD_ATX1_2_likecoord: 640..686
e-value: 2.08127E-26
score: 100.602
IPR041956ATX1/2, ePHD domainCDDcd15662ePHD_ATX1_2_likecoord: 702..818
e-value: 3.42264E-63
score: 208.097
IPR011011Zinc finger, FYVE/PHD-typeSUPERFAMILY57903FYVE/PHD zinc fingercoord: 632..705

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr021830.1Sgr021830.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006325 chromatin organization
biological_process GO:0051568 histone H3-K4 methylation
biological_process GO:0048578 positive regulation of long-day photoperiodism, flowering
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0000785 chromatin
cellular_component GO:0005634 nucleus
molecular_function GO:0003682 chromatin binding
molecular_function GO:0018024 histone-lysine N-methyltransferase activity
molecular_function GO:0046872 metal ion binding
molecular_function GO:0005515 protein binding