CsGy6G019510 (gene) Cucumber (Gy14) v2.1

Overview
NameCsGy6G019510
Typegene
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionHistone-lysine N-methyltransferase ASHH2
LocationGy14Chr6: 19968922 .. 19992281 (-)
RNA-Seq ExpressionCsGy6G019510
SyntenyCsGy6G019510
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CGAAAATACCCTAAAATGTACGATCAGAGCGTGTAGGGCTAAAACGGTCTCTACCGACTACCAATCAACACCAAATTCCTTCTCTCGGTCGGGAACTTGAAACTTAGAACAACGTTTATAAGGCCTAAAACATCCTTCTTCGATTCTTTTTCCTTTTCGTAAACCGAATTCCTAAGAGCAGCAATCTTGGACGCCATTCAAAAACGCCTCACCTTCATTTCATCTTCATTTACTCAATTTCATCTTCATTTACTCAATTTCATCTTCATTCTCAGGTAATTTCCCCTCTTTCTTTGATTCTTTCTCACTTTTCTTTTTCTCTTTCTTCTCGTTTTTGGCACAACTTTGTGTTTTTCTGCACCGGATGTTTCTGCATTGTGTTTCAAATTCTGAAATTAGGGTTTTAATGCTGCAGAAAGTTGGGATAGTGCCTTCTTTTTTGGTGGGTCATCTGCCCCGGTTAAGATTCAATTTGAATTCGACGAGAAAGTATTGTTCTTGTTTTCTCCGGTTGTGGATGTTGTGAGCTACCAGAACTGGGATTTGATTGAAGAAGATTTGATACACACATTTTACTTTGTTTCGGGTTAGTCGCGTTGTAGGCTTCTTTTCTTTTGTTGCTTTATGCAATTTTTGAGTTATGGCTATCCATTGGTTTTCGCTATCGATAATTTAGATGGGTTCATGTGATGACCCGGCTGTGATCGGGGAACCGTTTCGTGCCTCTGTTACTCGGCTGGTCAGGTGTTCGAGTCAACCTCTTCCCGAGCATCAGTCGCACCAGGAGATGGCTTCCTTTTCATCTAATTCTCGTGAGGGCCAGATGTTTGAGCCAGATAGGGGGCTGGAAGTGACTACGGCTTCTCTCTGTACGAATGCATCGGACCCTGATACTTCTGGGGAGGATGGGACGCTTAGAGGCTTCGAACATGCAGATAGTTTGCTAATGGATAAAAGATTGGATGGTGATTCTGGCGGTAGTGATCCCTGTCTAAACTTGGATAACGAGTCTTGCAACGAGGGGAATAAAACATTGAGCTTGGATATGAAAGAGTCTGAAGACGTTGATGGTTTGGTTGATATTTTGGGATGCGATGCTACCATGGAAATGATTTCTTTAACTGAGTCATTAGTAAATTCTGTGAAACCTGAAGAACTTGATAATAATAGTTGTATAATTGATGCTCCTGCAAAAGTTGAAAGGGATGATACTGCACAAAATGGTCCTATTTTAGCAGGGACGGGTACTCGTACAGATGACTTAAAATCTTCTTATGTCTGTGAAATTGTTTCTAATTCAGCTTCGGCTGATGGACTGCCAAATGATTTCATACAGAAAAACGAGCTGGAAAATGATGGTGCTGGTTGTTCATTTTCTGAGGTTGCAGATAGGATAACTGAGGCTTCGGTTGAACTAGAAGCAGATATGTTGAATGAGATGTCCCCTTTACAGAGTGGTCAAATACTACCAATACATGTGGGGCAATCAATTGCCAACTATGATCGGTATGTTTGCCGGATGGACGGGAAGAGCTTAAGTAGCACCTCTGGAGAAACAGTAACTGTAGTTGCTGATATGAACAGCAACCCTGAAGGATGCCTGCAAATGTTGCCTTCCCAGGGATGCGATAGGATTGGGGAATGCTTGCAATCTGATGGTTTACCACTAACTATTAATGCTTCAGAGAATGATTTGTGTGAGGAAAAGCATGACAGTAATTCCTCATCCAAGTACGTTCCAGATGTTGGGGGAGATGATAGTGATGTCTTGACTAACAATAATAGTGATGGTGGACAACATACAGTTCCGGGGATTGGAAATGACCATAATCTGGAGGACGCTACTGTTCAAGTGAACCACGACTGTGTCGAACTACTTTCATCGCCCTTGCCTTCTCAGCTCCCTAATTCTGAGAAAGATGAATTTTATGGAATGTTGAATGGAGCAGATATTCCAATAAAATATATTAGTTCTGTTAATTCATGTAGCGTCGGTGATCAAGACAACAATGACATAGAGAAAGTTGGCTGTGTTTCTGAAGTTAAATGTCCTGAAACAGTTATCACGTCTTCTAAGAGGAGTGGCCGAAGGAGAACATCAAGCCAAAAAACTGTGACAAAAAGGGCTTCCAGGAAAACAAAAAAAAAGGTGCCAGAGCCACTGATTTTTGACACTGCAAGAAGGAGGAGAAGCTCTATATCTAGACCTGCTCGTCCCTCACCCTGGGGATCACTGGGCCATATTATTCAGTCATTTGAAGAAATTGATGATGTTCTGGTAAATCAAACCCAGAAGCAAGGAAATGAGAAATCTAAAGGTAATCAAGGAGGTGCCAAGCGGAATAAGAAACAGCTAAGTGAAAGTTCACATAGATCAAGAAAAGGGACCCAAGGAAAATCTGCTACTTCAACTTCAACCAACCGTATTCGTCTGAAGGTTAAATTAGGTAAAAACGTGGGTCATAATTTTCTGAACATTGTGGTTCCCGAAATTGTTGATTCATCATTGTCTGCCAAGGGTGTCAACTGCAATTATGGCAATGAATCATATTGGGAAGGTAATTTGGAATTCCCTCCATCAAACCTTGGTGTTGATGACCAAAAGGCCGAGGAGGAGGGGCCTTTAAGAAAGATCTTCTGCTACAGCAGGAATCAGGATAAAGAAGATAATTGTCCTGATGCTTCTGTTGTGAATGAACAATGTACTAATAATGATTCAAGTTGCATCGTTGGTATAGACAAGTCCTCTGAAAAACATGCAGATGATAATCTCTGTGTTTCCTCCCATTTGGTTGATCCTGTAGCGACAAGTGATGCCAGGAGTTTGGATCCTGGAACTTCGCCTGATTCAGAAGTGATAAATTCAGTTTTAGATATTCAAGTCGGAGCAGCACGTCAGGAAATTTTGCAGGACTCGGTTTTGGCATCCTTAGAAGATTTTGCCGCTTCTGGAAATGCTCCCGGTAGTAAGAAAGGTAGGAAGAAGGACAAACCCTCTCGGGTAGTTAGTTGTTCTGAGGAACGTGGCATAAGTGTTTCTGCTTGCAGTAACAGGTCCAAGTCATCAAAGAAGCACGGAAGAAGACATAATGTGGACAATCAACTTAGTTCTGGTGAGACACTTACTTACGCTGATGCAAATGTTTTAAACTACTCTTTAACTGTTAAGGAATTGTCTATGGAGCAAGTGTCTTTGTTGACAGAGATTGAACTTCCTGAAGAGACTTTGAAAGCAGAAGACATTCTCAATGATAAAGAATGTTGCAGAGCAGATGTTGGCAGTGTGTTTTCTGAATCAGAGAATTCAAAAACATTTCTTCCTTCTCAATCCGCAAAGAAGAAACATCCCAAAGGTTCTAAATCTATTAAAACTAGCAAAGGCAAGTCAAAGGCTCCTGGCTCAAAAAACAAAATAAAGAATGCTTCAAATGAAAGGGTTTACCAACGGAAGTCTTTTAAAAATAGTAAAAGCAAGGAGGCTCTATGTGACCAAGTTGTGACTGAAACAGAAAGTCACCAAATCATAGGTAATTTTTCTTCGATTGATTAAGGGACACTTACAATCTTCTCCTTCTCTGAACATGGAGATGTTCGTGTTCTACTGTTTTTGTGCTAGATTTGTACTAACACATTGGAAAAATAAAAAACTTCTTTCTAATTTGTTTTGAATGTTTTAGGAAATTGTCTTGTCGACAAGCCTGAGAAAAGCGATAACATTATTGCATCCACGGTGGCAGTAGATTTGAGTGTGGTACAGGGTGCTGTGAATGAGCAGTATATGCCTCCTCGCAATGCTTGGGTGCTCTGCGATGACTGTCATAAATGGCGACGCATACCAGCTTCTCTTGTTGATAGTTTAGGACATGCAAGTTGCACATGGTACGCATTTATTTTACCATTTATTTCTCATATATGTATGTATGTGTGTATAATAACCACATTAAGAAAAAAGGAAAAGAAATACAAGGGCGGTCATGATTCAGAAATCTTCTTCAAATCATTTAATAATTTCCTTTTTAAACTTGCAGCATGCATTAGAATATTAGATTAGATTCTCATGCAACATCTTCACAAGTCAGCATTCTACTATATCATTTGTACGCATGAGCAAATTTGGAAATAGGGGGGATTGGGATTGAATAAAATTTAACTAAATCTGGGTTTCAATAAGCAAACTTAGGTTGGAAGTTGGGTAAATGTCAACCTCTTGTTTTGCTAAGTTGCTTACTCTTTGAGTTATATCTTCTTAATTTAGTGTTCTTTTTGTTGCAAAAATAGATATAAATATGCCAATTTATTTTTTCCTGGAAATAGAATGTTCAGTGTATTACCTATTGGCATATTTCCTTCATTTTTGGGATATTATTGAACTTACGAGAGAAGGACCAAGTCATACCATTTGCCAGTCTGGTGGGATTTCTAATCACGTGCTTAATGTAGTTCCATGTTCTTCCTTTTCAGTGATATCGTAAAAGTTCTCTAGTTTTTCCATGAAATTAATAAATCATCATGACATTATAATATTTTATCGATGGGAATGTATTTTGTTCATAATTTTTCAAATATGCCTTTTACAAATTAATGCAGTTGAAATAGTTATTTTATGGTATCCTCAAGTCTTTAGTTCACTGAATCTGTTTGTTTTCATGCAATTCATTTTCCTTTTCTAGAACCAATATTCTACAAGGATTTTATTTTATGTAAATTCAATACTCTGTATATATATGTATACATGCACTACAATGATTATTGTTATTTTATTTTAAAATTATTTTTTGATTTATTATTTCTAGTATTTTTAATTTATAGTCGTTTGTCCTTGAAGGACTTGTAAGGACAATGTGGATAAAGCTTTTGCTAATTGCTCAATCCCACAAGAGAAGTCGAATGCAGAGATTAATGCAGAGTTGGAAATATCAGATGAATCTGGGGAAGAAAATGGTTCCAAAAAACGGCTAACTTACAGGGAATTGGAGAGTTTTCATCCAGCAACAGGTACTTGGATTTTGCTGGGATGCCTTAATAGTGCCGTAAACTTGAATTAAATTTAATTTTCCTTAGGGAATGCATGCAATAATAATAGTAGTAACTACTTTTGAAGTCCATAATCCTGCATTATATAGTTGAAATGTGACCTCTGTACTTGTTTGCAGTGAATGCTGTTCCTCAACAGAACAAATTTGCTTCCATTAGCAGCAATCAGTTTTTGCACCGCAGTCGTAAAACCCAGACTATTGATGAGGTGAATTTTTCTAAATGATGTTTTTACTTGATTTGTGATTATCATCATATTTTGAGCATGCGTCCGCAGTCCTAAAACCCAGACAAAGTATGTTTAAATTGTATGGTTGTAATTGATCAAGTGAATTGGTTGGGATTGGAAACCTTTGAAATTGGAGGTGGTGGAACATTTGAGAAGTGTGCTGTGAAGTTTGGATGATTTTCATTTTCTTATGCCTTTAGTAATGGAAAATTTTGTTGTGTTCTCAGGTTCTTGATTTTCCTGAGTGCATTTTAGCTTTGTTGCGTTGGTTACGGTTAGATACCTAGCCTTTTGGCATACAAGTATATTTTTTTTAGAAGAAAGTGTGTGATGTTTCTATTTATTTTCTTTTCTTGATGAAAATAACCACTTTCATTGAGAAAATGAAAGAATACACGGACATACAGAGAAACCAAGCCTACAAAAAGGATAGACACCCTTTACAAGAAGAGGATCCAACCATGCAAAATCATTCCTAAAGAGTAATTACAAAAGATCTTTGAAATCGAAGCCTTGAGATGCATGGAAACAAACAAGGGGCCAAAGCTTGCTGGGTCCCTCTCCACCCCCCTAAACACCCTCCTATTTCTCTCTCATCCTACAAAACCTATAACACCCATATGGGGCGTTTCTACTGATTTATCCCATAGTCCTTCCATTAAATTGGAGTCTACCTCTAAAGTTATTATTTTTTTTCCCTTGAATAAGGCAATTCATTAACTCACTATAGAGGTACAAGGTGGGAGATGCAGGATCTCCTTGTACTACTAAAGCAATTAGAGAACGACCCTTGAATTTCAATGAAAAGAAAGGAGCACAGAAGTCTACTTATTGGCAACTTGATTTTGAAGCTAGGAACGAAGTAATTTGTGTTCTAGGAAATCCCTGTAAACATCCTCGAAAATTTCCTATTTCTTTCTGACCAATGGCCTTAGGTCCTTGCAAGCACTAGATTGGCCCACAAATCTTCCCCTTGCACATTACACTGTGGTCGCCCAAACTTGACTGCAGAAAATCTCTTAGGATAGCACTGGCATGTATAAACTGGTACCATGGAGAATAAAATGCTGAATTGTGCTCCACAAATTCCCACACAGGCAGCCTGGACCAGTGGAGAGGAATAAGTGAAGAATTTTCCTCTGATTTTTTCCTCAGTTGATTATTCCATGGAACAATGAGCAGATAAAAGATTCACGAAGTTGTTGAGGGTTTCTTTTGTTTTCTATTTCTTTTGGACAGCTTTCTATCCTATATGTGTGATTGTAATACATATTTTTTATTTGAAGTTCAGTTTCTCATAAAAAGTGTAATTCCTACCAAGTAAAGGATCCATGTAATATGCTTTCTAGTAGGGGAGAATATTTGAAATGGAACTTATCAATGCAATGCATTTAGATTTAATAAGGCGAGGCAATCTGCACACCTGTGTGTGTGTGTATTTGATAAGACAGTCTACAGCATGCTTCACAAAGCTGGTGTGAAAAGTTAAAAACTGAAAACTGAAACTTTCTGACTGTGCTTCTGAAAGTAATATGTAAAGGAACACTTTGTGGAGTTATGGACAGGGAATGCACAAAAAGTAGTCAACAAAACTTAAATTAATGGCAAATTTACAATGGCTGGAATGTTCTAATGCACCTGAATTGTGGTCATTCTATTCATGTAAAGTGTGGTTGCTGATGCTGCTTCTTCTGTTAGTCCTGTTGTTAATTGGTTAGCTTTTTTTCCTTTTTATAAGATGGGAACAAAAATACAAATCTTCTCTTTTGCTGGGATAAAACTAATTTTGTTAGCTTTCTTGTGTTTTTTTGCACATATTACCAGTGGTTGAAATTGGTTGTTTGTAGGCATTTTTGTTTGTTCTGTTTTGTATATTTTATGGTTCTTAATCTTATAATGATTTGTCAGTGTAAGTGGTCAACTCTCGAGTTTCAGTGGCCAATTAGGTTACAAGACTGAATACATTTAGGTCAATGTGGCAATATCCTTATTTTATATGTTTATACATCTCTTGAACAGATAATGGTTTGTCACTGCAAACCAGCTTTGGATGGTCGGTTAGGTTGTGGAGATGAATGCTTGAACCGAATGCTCAATATTGAATGTGTACGAGGTACATGTCCTTGTGGGGAACTTTGTTCTAATCAACAGGTATTGAAGCTTTTCCCTACAGTTCTCTCTCTTTCTCTCTCTCTCTCTCACACATACATTATTGATCTATCTTTTTGGTCGAGTTAATTGAGTCCTTATGAAAATCTTATTTAATTTTTGAACATGTGGTTTCAGTTCCAGAAGCGTAAGTATGCTAAATTACAGTGGTTGCGATGTGGAAAGAAAGGTTATGGGCTACAGTTGCTTGAGGATATATCAAAAGGACAATTTCTCATTGAGTATGTTGGAGAGGTAATTGTGATGTATCCATGGTTGAAGACATATCCTATAGAACAATCTATCTCGATCGTTGGTTACTTTTCATGTGCCTTGAAAGTTGCATATATATGTTTCTTGCTTTGCTTGATTTATCTATCATGTTAATTGCATCGGTAAGAAATGATAATATTTGTGGACCAATAATGTAATCTCTTTTATGACATCATTTTTTTGGCTGAATAAAAATTTGTGCAACTGTGTGAAGATGTCAAGCTGTAGGTTGAATTTGGATAAGATTCTTTATGACTTCATGACGGGTTATATAAACATAACAGTGGAAGGGTAGAATGTAAGTTAGTAGAACGTGTGTAGAGTTCCAAGTCCTTTGTTAGTATATGTTTTTTTTTATAATTACAGTGTTCATTTTTTTTTTGTCCAACTGGAGATATTATTTGTTATCTTCTTTGGTATAGGTTCCTCCTTTTTTGCAATTTCATCTCATTAATAAAAGTACCTTACACACACGGACACACATATTTTGTTGTGTTTCAATTTTTGTGATTTTATGTCATTCATGTTTGTTTATTTTTATTTGGGGTTCTCCCCTGATTTCATTTATCAATAAAATGTTTCTTTTACAAATATATGTGTGTGTATATATATTTTATTCTATTATTTCATCTACTCAGGTCCTTGACATGCATGCTTATGAGGCACGCCAAAAGGAATATGCATTGAATGGTCATCGGCATTTTTACTTCATGACCCTAAATGGCAGCGAGGTTGGATCTTCATTTTCTTTGCTAGTGTTCTTAATTGAATCATGTTCTTAATGCGCCTTTCCTTTTGAGGCTAGGCTATCATCTTGTTATTGTTGTACTTCATTTTGATCATTAAGTTGTGAGTTGCAGAATGTGGGAATTCTTTGTAGTAATTCATTACTGTATATTGCTTCTGATATTCCTCGTCGGTTTGATTCTGGGTTGTTGCTTATGCTTTATTACGTAACTTTAGTTTGCTTTATGTTGTTGCTTATGCTTCATTGTAGTAGCTTCTTTGTTTAGTTATTTTAGTTTCTTCTTGTATCCTTGAGCATTAGTCTCTTTTTATTTCCTTCCTAAAAAATTTCATCTCTTTCTCAAAAAATAAAAGTGACTACTCAAAGACTGAAATATATATAACGTTTAAACGTTAACACTAGAATGTTTTATAACCTATTATGATAGATTCCTTCTATCTTATGTAAATATCCTTATGGGTGTCATGTGGTGTTTTTTTTTATAAATTTTGACGAAGAAACTTCTCTCTATCCTTAGAGGGTTCTATACTCTATTGACAGAGGTTTTTCTTTTGATGCATGAGATGATGCATTTGTGTCACATGCGCAATTCGTTAGAGCTTAGAAATACCACAATTGCGTTGTCAGTTTGCTCGCTCCATTCGAGCTACCTTTCCAGTTATTTGATGTCTGTGTGGCAGGTATAGGGACTTAGGCTAAGAAGCATGGACACAGACACGACGACACGTCATTTTTTTAAAATCTAGGACACAACATGGCAATGACACGTTTATAAAATGAATTGATGCATTTATATGCTTACTTAGCTTGATGTATTCCACACTAACAAGTGTCCTAGTGTTTAACACGTGGTGGACATGGACAAGCTAGCCAAACTAAAGTGTCCGGCTTCTTAGGACTTAGGTTTATGGTTGAGGAGGATCTCGAGCATTCTCCCTTTTGTAACAGGAGTCAACTTTCTCGGCAAGCCAATTCTTTTTTCTATTTTGGGTTAACCTTTTTCATTGGGCTGTCTTTTTTGTATGCCTGTCTGTGTTCTTTCAATTTTTCTCAAGATAGTCTGGTTTTTCATTTGAAAGAGGTTTGTTTAACTTTTGGCTCCATTGTAATGTTTCAGATTCTTACTTGTTTGTTTTCTTTCGGAAAAGAAACAAAAGAAGGAACATGTTGATAGAATCTATTTTATCGTTCACCTGGACAGCTCTTAAGTAATGCTTTAAGAATATAACTAGGTCTAATTACTGTAAAGTTCAAATGATTTTCATTCTTACTGTATCTTGTTAGGACTAACATTTCATTGTACAAGATCTATAGCTTGAGGTTTATCATATTTCACATTGTTCTAGACTACTGACAAGGAACTGATACGAGGACCACTCCTCTTAATTGCATCATCTTTTCCTTCATTTCACCTTTGGTGAAAGTTATTTAAAGTCTGTAATTCATCTGTATATGTGTTACAGGTCATAGACGCATGTGGAAAGGGAAATTTGGGGCGTTTCATTAATCACAGTTGTGATCCAAATTGCCGCACGGAGAAGGTCTATACCCTATTCACTAAGCATTGTTGCATATTGCGAACATTGTAGCATACTGTTACATGGTAAAATGATGTTTGTGTGTGTTTTAAACAAGTATTTTCACTAGGGATGTAAACGGCAATACAACCCAACCCTTAAAATATGGGTTGGGTTGTTGGGTTGCCATCCCTAAGTATCAACCAACCTTTTCTTTTACCCCATTTTCTTCAATTAATAATAAAATATATATATCATTAATTTGGGTTGGGTTGGGTTGAATTGAATTTTTCAACCCCCAATCTGAGATCCAACCCAACCCGAAAAAAACTAAAATTTTGAACCCAACCCAACCCACATTTTAAACTAGCCCAACCCAATTTTTATACTATGGGTTGGGTAGTCTAGGTTGTCGGGTTATTCATGCACTTCCTAATTTTCATCCTAGGATGTAGACTTCTATGCAGCACTGAGACTTCTGTAGTCTGATTACCAAGTGTTAATGGTTTTATTTAATGTTTTATTTATTACCAAAATGGCATATTCTGGACCAAGTTCTTATGCATGAGGTGATAGAAGATCATAGAAGCAGAATTCTGACTTTTTTCTTATTTAAGATTTTTGTGTCGGCTATCAAGATTGTAGAGAAGTTTGTGAAAGACTTTTGAATTTGGTGGTTTAGATATAGGTAATTTGAGATTGCACAATGAAATTCTCTTGGTTTTATTATGGCATTTCTTGGCTCCATGCGACAGATTTGTTATTCAGCAAGTATGAGCCTCATTCTTTTGTGCAACTGTGAGTAGCCTTTCGGTTCTAGATTGAAAGACACATAGGACCATAGGGGAAGGGGCCAACAAAAAGAAGATAAGAAACATAAAGATTTAAAATAGATTGACGATGTTATAAGCCCACAAATAAGGGGAAAACTAAACCAATCAACATTGCAAAGAACTTCAAATTGAAGCACCCCATAAGCAGATCAGAAAACAAGCCAAAAGCACCCTTTGGTGGCTCATTTAAAGCCAAAAAGAGGAGGGGAAATTGCTTAGCCATCAAGCTGACAATAACCCCTCTTGAAGCAAAGATGTAGCATTTTTCGCATAAGAGATCGAGCATTCCAAGTTTCAGACCTCTCTAAGGCTTTTCTTTTGTTTCACTGTTTGCTTATTATTTTTCTTTGTACTAGCGGGTAATAGGTATAACCATAATACAGAGGCCTTGTCTCTGAACCAAGACGCCAGTGCAAGTATATTTGGCCCCGAATTGAAAAACCCGTTTGTCGGCTGAGGTATGAAATCATACGATCATGATTCATGAGGCTTATGGAAGGAGTGAAGTCAGGTAGAGTTGGGGCTTGAAAGGTGAAATTCCGTATCAGCGTTCGAAGGCAGAACCACAACGGTAGTAGAGTGGATGGATAACGGCAGGGCCAACCACAATGCTCTTCTCTTGTTTCACTGTGTGCTTGGTGTTTTTCTGTCCTAGCGGGTATAGCCATAATACAGAGGCCTTATTTCTGAACCAAGATGCCGATGCAACTGTATGAGGCCCTGAATTGAAAAACCTTTTTGTTGGTGGAGGTATGAAAGTATACGATCGTGCTTCATGAGGCTCATAGAAGGATGAGTGGAGTTAGGTGGAGTTGGGGCTTGAAAGGTGAAATTTTGTATTGGCTTTCAAAGGCACAACCACAATGGTAGTGTAGTGGATAACGCCAGGGAAAATTGTCTTAGGGGGATTCTGAATCACATTTGAAAGGTGCTGTTTCGTATTTCTATCCATGATCTCTTTGCTTCTGTTTTGGTGGGGTACCCATTCTATGGTTTGAAGAAGACTCTTTGGTTGGTTCTTGTCCTTGCATTCTTCTGGCATCTTTGGGGAAACAAAATGATTGCATTTTTAGGGATGCATCCTCTTCCCTTGAGCACTATCTGGATAGGGTTCTTTCTACTGCTCTCTTTTGGTGTAAAAATCAGCACTCTTATTGCACTTTATAGCTTATCTTTTTTTCATCTCCAATTGGAATTCTTTCTTATAACTTGTATTACTCCTTCATTTCATTTAAAGTTTTCGTTCTTTAAAACAAAAATCCATTTCTTACAAAAAGAAACAAAAGAGTTGTGCCCTCATTGAGATCTTTGAGAAGGATTTGATCATGGAGTCAAAACTATGAGCAGTTTGTTTTCCTAGAATCCCTCCCATCATAAAGATGGCTAGACGACAAGCAAACCATCAATGGAATTTACTTTGAAAACTCATGGTCTATCACGTTGATATTATGTATATACACGTTAACCAATCTTAGTTATTTACAGCTGATTAAGATTTTTAATATTTCTGTGTTGATCGGGTTAGCAAGTAGCAACTTAAACATTAATTTTCTGAGATTTGTTTGAGAATTTACGAGTTTCTAGTCCTCTTTTTTTTTCAATTCTTATTTTCTTGGTCTTGGAACAGTGGATGGTGAATGGAGAAATCTGCATTGGGCTCTTTGCACTAAGGGATATTAAGAAGGTACATTTACTGTTTGCTTTGTTTCCTTCTGAATCATTCACAGAAAGAATATCCGTGACTTATGATTGGGGGACTTTTGGTTGAAAATGATGGTGTCATTTTTAATGAGAAGCTCGGTTTGGTATATATATCTTTTAAGATAATTTTTTTAGTTATTATTATTATAATAATTAAAACCCTTAGGCCACAAGTAATGCTAATTGGAGTGAGTTCATTTGAGCTCCGAGGATTGGAATTTTGGTTCTCTTCTGTAAATTTTTTACTACGTCAATGAAAAGGGACATGCATGTGTTTTCCTTAATTCTTTAAAAGCTCAGTACCTTTTGACTTGATGGGAGAGAATAATTCTGTAATCTCTATCTGTTTAGGGCTTCTGAAATGCCATTGTGGATTTTCTATCTTTTTGTACCATTTGCTGATCTGTCCTTCAGTACAAATTACGATTATAGAACATTTTCACAGCCCTCTTGTCATAGTGTATATCCCCATTTCTCCAGCTATAGTTTCAGTGGTCTTTGAGATGAAAAATAATGTCTTGATGAAAACTTGACATCAGATTATCAATGAATTGTTTCTTTTACCAAAAAAGAAGTAGAAAAAGAAAAAAGATTGACATCTGATTACGTTATATCTGTTTTAATTTATTTTTCAACAAGTAAATTGAAGCTGAGGTGCAATAAAATAGTTTTGGTGGCATCATTCGCCAAAGCAAGACCATGTGATTCTATAATTGTTGTATTTCTCTTTTGATAAAGAAAATCGAGTTGTTCATAGAATATCTTTTCACATGATTCTGCAATTTTGAGTATGAATGTGGATGAAACTGAACAGCCATGATGATGACATTTCCATTCATATCTTAGATATATTTTTTAAAAATTTATCCTTTCCTTAAATGTTTTCTCCTAATTTCTTAGTTGATTTGGACATTCCTGCCCTTCTTGTTCAAAGCTCGATATATGATGCTTTGGCATTATTTTGATCAGCAACATTTTTTGTAAAAAAAATAAATCCCTTTGATCTTCTTCCTTTTTTTACGAATTCATTTTCTGCCTTACATTGCATTAGTAGAAGCTGTATCCTCATCAATTCTTTCCTATTTCAGGGGGAAGAGGTGACATTTGACTACAATTATGTAAGGGTATTTGGGGCTGCTGCAAAAAAATGTTATTGTGGTTCTTTCCATTGTCGAGGTTATATAGGTGGAGACCCCCTCAATTCTGAGGTCATTATTCAAAGTGATTCAGATGAAGAATTCCCCGAACCTGTGATGCTTCGAGGAGATGGAAGAAGTTTGAATAGTAACTTGTCAACTGCAGTTAGTTCAATGGATGTTGCTAAAATGCAATCCTCTGAGCATCTAAAAGGGAATAGGGATAAGAGAGATCAACCTATCAGAATTGCTAGTGAATTGAAGATTTCAGAAGAAAAAGTGGATCCTCTTAAGCTTTCTGCGTCGAAGATTTCAGAAGAAAAAGAGGATCCTCTTAAACTTTCTGCAACGAAGATTTCAGAAGAAAAAGAAGATCCTCTTAACCTTTCTGCCTCTACCATTTCTCCATTGCATAGTTCATTGGAATTCGAAGATTCGAAGGTAGCATCACCTATTCCAGTGCCAGATATTACCCATCAGACTGAGGATGTGACAAGCCAACCTATCTTTGTTGATCAGACAGAAATATCTCTTTTGGACAATATTCCTGACAAAAATACATGCTCTATTGAGCAGGAGGCAAAGTTATCAGTGGATGACATTGACGCTCGTAAGAAGTCCAAGCTGGATTCTGTTGAAGATAAGCAAGTGTATATAAAATCGCATCCTCGGATGAAAACTTCGCGTAAACTAGGTTCCATCAAGAAAGGAAAAGTTAGCTCTGCAGAAAAAATACAAATAACTAACAGGTCCCAGATTTCCTCTGTAAAGCCCAAGCGATTGATTGAAGGTTCTCCGGGTAACCGCTTTGAAGCAGGTTAGTCGTGCTTGGAGTACTAGCTTTTGCTTAAATAGCTGTGATGGGCTTATTAATGGTTAATTGTATTTAATACAGTTGAGGAGAAGCTTAATGAACTTCTGGATGCTGAAGGGGGAATTAGCAAAAGAAAAGTGAGTTATTGTGACTTACCACATGTTGACTACTTGAATATTCTGCTATTCTATTGGTTTTAATTGCCGGTGGCAACGATGGTCAGTGGGCTCTTTTATGCTTTACCATCAACTGCTCTTTCTTCATGTAACTCTGCTCTGCATTTTTCATGGATCAGAGTATTGGTTTTTAGTACTTTATTTGGCCTATTCTTCTGTACTTTTGTTCTTTCTTGATAAAAGGTGTCGTTTTAGGTGGATTTGTGCTGAAGTGTGCTCATTTTCCGTTCAGAGATATTTGCAGTGTTCAAGAACTTATAAGTTTTTTCCTTCCTCTAACAGGATGCCCCTAAAGGGTACTTAAAGCTTCTTCTTCTGACTGCTGCATCGGGTGCAAGTGCTAGCGGTGAAGCAATTCAGAGGTTGCCTGAGATTATTACTATGTTTTTATGTTCCAAATTGTGACAGTCATTTATGTAATGTATATCATAATATTAAATGCTTTTTGTTATTTTTTTTTGACAGCAATCGGGATCTTTCAATGATCCTTGATGCTCTTTTGAAGACAAAGTCACGATTAGTATTGACTGACATAATAAACAAAAATGGTATAGTTTTTTTCTTAAATAATTTTTATGCTCTCCGTGTATATTTTATGTGGCCTTCTTGTGTCATTTGGTTTATAGAATACCATAAACTATTCATTTCCCCTCCTTTTTGCTCTCAGGTTTGCGGATGCTGCATAACATAATGAAGCAATATAGAAGTGACTTCAAAAAGATACCTATTCTTCGGAAGCTTTTGAAGGTATTACTTTTATGCCAATTTCAAGATTCCTTTTAATGAGTCTGGGAAAAAACTTCCCAGTTCAGTATATATATATATATGATCATTTCCTTGTCTTAATCATTTTGTATTTTGGAGTATTAGTTTCTTTTAATTATATCAATAAAAAATCTCTGTTTGAAAACGAAAATTCATTTCCTTGTCTTCATAATATGACTAATGAAGTCACATTTGAATGTTTGAGTAAACAAGGATCAAATATGTAATTAGCAAAGAAACTGAAATTGAGTATAAAAACTGTTAAATTTAACACTAATTGCCAATTGGTTTTGAGATGGAACCTCATACTATCAAATTTAACATGGTATCAGAGCCCATTAAACCCAATTGAGTATTCGGTCCAAGATAGGTGAACCCAAAGAGGCACCATCTTGAGGGGCATATTGGAGAAGTTGGGAGTCCCACCTTGGAAAAACTAACTCCTCTAATTGCTAATTGGTTCTGAGATGGAACTCCATACTATCAAATTTAACAAAAACGATGTGCGATTATATTTAAGTGAACTCTATTACAAATGTTCGAAATGATGAGGGGGTAAGTGCCAACCTTTTTGGTTTTGTATAGGGGAGAGGGACTAGGAAAATGTGGTACAGATTTCAATCTCTGGTCAGACTTGGTATTGAAAGTATAGTTAGCGCTTTTGTTATTGCAACCAAAAAGATATATACATAATTGGTGTTTACTTCAGATGAAATATATCTTATTTAGTTAACATATTACTGTGTAGCATTAGGACTTCAGTTTTTTCTGAGGAAAGATAAGATAAGACATGTCGAACCTGTATGAGATTGCTTATAGGCACTAGTCATTTTCACCTTTTCTAGTTAGGTATCCTGGAAGTGCAAACATATCCTGGAGGGAGCATGGCCTTTTTCTTCTCTTCTCTTTTCTTTTATCTTCTCTTCTCTTTTCTTTTATCTTCTCTTCTCTTTTCTTTTATCTCCTCTTCTCTTTTCTTTTATCTCCTCTTCTCTTCTCTTTTCTTTTATCTCCTCTTCTCTTCTCTTTTCTTTTATCTTCTCTTCTCTTCTTTTTCTTTTTTATTTTCTTTCTTTCTTCCTTCCTTTCTTTTATCTATTTATTTTGCATGGCTTTTATGCAATTGAGTGTTTGTTTTCCCCCCTTCTTTTTTTGATCTGGTTCTTGATGGGTCTGGTAGACTTGCTTGAGAAACGAGAAGGATATACTATCCAATGCCCTCATAGGTTGTGGCATGGCCAAAGTTAACCATTTTTTTAAAAACAGAAACCACATTTTTCATTGATGTAATTAAAATAAACTTATGCTCTAACTAAAATGAGATACGAATAGTGGGAAGATATCTCAACTAGGTTAACACACTCATAACGTCCGAATCACGTCCCCTTACAAAGAAAGCATTAAAGTAAAGTACAAAATTACATATTGATTTAAATAGTGAAATACATAGTTAACTTGGAGTAATGAAAGCATTCCAATTTGATCATATATCCTGGATAGAGTAATCAACAATTTTTTTGGAAAGGGAACACCGTGATGAAGCTACGAGCTGAGCAGAACCAAAAGACCATTCAATGGGTTTGTCTTGAAATACCCTTCGTTCAAACCAAATTTATGAAACCAAGGCTTTGGCTGCATTTGTTCATATTAATTGAGCTTTAAAGGACAATAAAGGGCCAACGAACAGAAGCCCTTTGCAATCAATTAAGTAGAGAATCCATATATTAAATTACCAGGAAAGAGATGGAGTGGAACTGGTGAAAAAACAAACTCCATCTTCAATGATCTGGATGAAAAACCATTCTTATAATTGATCGGGACAACAACTTAGATAAATCTCCACAAACTTCAAGATGAAATCTAAATTTTCAACTCGCTCTAGGAGGAGACAAATTTGGGCCTACTTTTGAAATCTCAATTATGGACCTTCATCATCAAGGCCTACTTTTGGCTCATATAGAATGAATGGACCTTCATCATCAAAGCCTACTTTTGTATATTGCTCTTACTTGGTGTAAATTCACCCCATCCTTTTGTGATCACAACCTCACTTTTCTATTTCACAGTGGAATAGTATAATGTAATCTCACCTGGCAATTTTTCCTTTTGTAATTTCATACTATCGATGAGATTATTATTGACTGTTTCTTGTAAAAATAATTGCTGGAGGGAGGCAATTGTTGATTGGTTTTGGGATTGGGTTGGCATACTCAAGAAGGGTGGTTCCAACTGCCTTTATACTTGGTTTATCTTGTAGTTCTTCATCCTTTAGAAAATTTCGTTTTTTTGGAAGATCATCCCAAATGCTGTTGTTATGTTCAGTACCATGATTGACATTCCAACCTGTTTGAAGGATTTTGCCAAGAAACCTGAGAGAGTTCTTAAAGCCAGATCTAAGTATCTGCTTCTTCATTGAAACCCATAAAGGATTGGACATGTGAGGAGATGTCCATGCGAAAAGCAAAGAAAATCTCTTGAGATTCCATCCAAAGTCTTTGTGTGAGGGTCTCATGTGCAACCTTTCTCTGTGATCAAGTTAACCCTCCTCCTCCTGTCTTTTATTCTATATGCCATGGAAGTTTTAAATCTGCAAGAATGTGATTTCCCCCTCCCTTCTTGTCTTTGCAAGTTCTACGCGACAAAGTTAACATCGAGGATTGGACTCTTTGAGGTCTTCTTGTTTGATTGATGTGAATGTAGTGTTGTCCTTTTTAAGGAAGCTGCAGAGGTTCTCGTTCATCTTTTATGGTTTTATGATTATGCTTTGGCTATGTAGACATGTTTTTCAATCCTTTAGCATTCATGCGGCTCGACCTAAAGTTTGTAGTTCAATGCTCGAGGAGTCCTTACTTAATTTTCCCTTTAGCAAAAGGGTAGGTTTTTTTGGATAGCTGACATTTGTGATCTCCTCCTCCTGTCTTTTATTCTATATGCCATGGAAGTTTTAAATCTGCAAGAATGTGATTTCCCCCTCCCTTCTTGTCTTTGCAAGTTCTACGCGACAAAGTTAACATCGAGGATTGGACTCTTTGAGGTCTTCTTGTTTGATTGATGTGAATGTAGTGTTGTCCTTTTTAAGGAAGCTGCAGAGGTTCTCGTTCATCTTTTATGGTTTTATGATTATGCTTTGGCTATGTAGACATGTTTTTCAATCCTTTAGCATTCATGCGGCTCGACCTAAAGTTTGTAGTTCAATGCTCGAGGAGTCCTTGCTTAATTTTCCCTTTAGCAAAAGGGTAGGTTTTTTTGGATAGCTGACATTTGTGATTTGTTATAAAATCTTCGTGGTGAGAGGAACGATGGATCTTTCAAAGTCACAATCAGGGTAATAAGACATTCATCAGAGAGCAAATGTATGGCATAAGACATTGTGAGGAATTTCGAACTGGAGCTGTTTTGTAAAACTTTTCTTGATTAGAATGATGAAGTATTTTTCTAACCTTGTCCTATTTTAATTTTATATAATATCAAAATAAAAGTCTATATTAGTAAATTTTGTATATATATATATATTTTTTTTTTAAGAAACAATTTCATTAATGAAATGAAATTACAAGACAAGAGGGGGAAGAGGCCCCAAAACAACAAAGCTATTTGTATTTTTTTTCCTTTTCTCCTTTACCATTCAGACGAAGAAATTTGATTAGGATTATTTCTAACTTTTTTTTTTACGTTCCAAACAATTCTTTATAATCACTTCTCAAATTTTAAATCATAGGTTTTTAAATAATTTCTTGAAATTTCTTGACACCGAACGGAGTGTTTAATAATCAAACTCTTCTGCAGGTCCTGGAATACTTGGTGACAAGAGAGATACTCACTTCAGAGCATATCAATGGCGGTCCCCCTTGCCCTGGAATGGAAAGGTTAGAGATTGTGATGCTTAATTTCAATAACTATTTCCTATCATTCTGTAGGTGTCGGTGAATTATGTAGTAGTTTAATTTGTTGAGATGTGCTAATGAAAACTTTAGTTTCTTTACTGGGGTATACTTTGGAGGGGGGTTGGGTTGGAGACCATGTTTGGGTCATTTGGGTTGCTAACTTGACCAACCCAAGTTTTTGGGTTGGTCCAAAAATGTGCCACAACCCAACCCATGTACAACTTTAACTTTCAGTGTATATTTTTTTACCACTTTCCTTCTAAAGTTCAATTTATAGAAATTTTATCATGAAGATCTAGTTGAATGTTTAGTTTTCTTTTCTTTTATTTTCCTTTTTCGTTTCTAAGATGAGAAACAACGTACGTCTCTGGATCCCCAGGTTCTTTTACTCATTGTATAACTCTCTTGTACTTTGAACTTTTCTCTTGTTATTATCATTAATAAAAGAGACGTGTTTCCCTTTAGAAAAAGAAGAAAAACAACATTGAGTATCAAAAAATTTGGGCATTATATGCCACCAATTGAGACCAACTTGCAATTAAAGAAGATATGATCCACTGTTGTCTTGTTTTGTCTCTTGTATTTGATATAATCCACTGTTTCATTATCTATTTTGTACAAAATGCAGCAGCTAGGAGAAATTGGAATATTAGGTTTATGTTTTCTTGACGTTTTCAGCTCTTCAATTGAGTGAATTCCTCTCTAACTTAGAGACAAATGAAAAAGTTGATGCTGTTAGGAGCTTATGATTTTCAGATTCTTACAAATGAGAAGAATTTATAAGGACGTACCCCCTTATCTATTTTCTTCCATTTGCTTTCGAAACTCTATCAAGAACTGCCAATCCACTCTTTTATTGGCATTTTCTAGATCAACTTTGATTATCAACATTTTTAATATAATATTTTTGTTTCTTATAAAAATTACTTTGATTACTCAACTTTTTCTTTTTTAGTTTTCTTCATTTTATCAAATGTGCAAATTGAAATTAGGCTTTGCTTTGTCATTCTATTCCGAATATAATTGTGAGATTTTTACACTGCATGTACATGCATTTGAATCCCTTTAAAAGAATGATTGTGCCCTTGTCATTGAATTGTTGTTTTATTCTAGAGCATGGATGCATTTGAATCCCAACGATATTGGAAGTTGCACAGGAGACGTACTATTGTTTAAGTTTTTGGTGGCCAAACACATAAGTTATATCTCTTGCCAGGGCTGACCATGTGGTGGGACTCGTAGAGTTTTTGTTTAGAGAAAATTGTAAACAAGAAGTGGGTGTAACAAAATATTTTTGTTTGCTAAAACTTGGTATTGTAATGTTTATCTTTGTTAAAACAACCACCCCGACAAGAGTGGTTCTGAAGTAAAGTGAAACTCTAGTGAGATTCCTTTGGTTAGGTAGGTCGTACAAAGGTTTATCTTTCAGATTATTGAGGATCTTGTACAAAATAATATGAAACAGATTAAAATGGGATGATTATAACTGATAGGGTGTCAAATTTGCCAAGACAAAGTTGCTGAAAGTACTTTACATCTAATGCTGGCAATTAGTTCTAGACTAGTTGTAAGTTGTTTTTCTTCACTCTTGAGTAACTCCAAAAAAGAACAGGCATTTCTAGTTTATGCTGGCACTGATGTGTTTTTTCTTTTGACATGCTTATTCTGTATCTTCAGCTAATTATTTTCTTTTGCCATGAGTGACAATTTGTAGATCCTTGCGCTTGCTGTATTTTAGTGATCCTTACATGATTAATTTTTTTGCAGTTTGAGAGAGTCCTTATTGTCACTGACAGAGCATGATGACAAACAGGTATCATTTACAAATTATTAGAGTATCGATTGACCTATTCTATTAAAATGAATAGTAGTGCAAAGAGTAGAAGAGTATAAGAACTTTAACTACAATAAAAAATTCTATATAAAAATAAACAAACAGACAAATAAATAATATATAGGTGTAATTATGGTGGAGTAAAAGGGCTTCCCATTGCTTAACTATATTAAAGAGTGAAGTCACAAACAGGGTAATAGGCCATTTATCATCAAATGAAGATCTATCCAAAAGTCACACTCATGACTAAATCACAGTTCACATTTGCAATATATCAAAAACAGAGCAATGATGTGAAAACCTCCGGTTTGCTTGAAGATGTTGGTAAAAATGATTTTTATTTCAAGTGTATGCCAAATCTAGACAATTTTTCCAAAGACCACCACTGACAAAATGTCCAAAAACATTCAAATATAATTTACAATTCTTTTCCTAAGTCTTAGTGTAGGTGTGATCTTTGGCCAAAAGCTTTTAAACTTCAGTTTTGATGAGTTACGCATGACCTGCAGGTACATCAAATTGCCCGAAGTTTTCGGGACAGATGGTTCCCAAGACATACTAGAAAATTTGGTTATTCTGAGAGAGAGGATGGGAGATTGGAAGTTTACAGGGGTTCAAACAGCAGTAGGTTTACTGCATCACACAGTTTTCGGCACGATCAGGATTGCAGACCCACAGATGCAATTGATTGTATCAAGCAGTCGATGCCTACATCTTTGCCAGATGCTCATCCTGCAGAGGTCTGTTCTCTGGCTTCCGCAGCTAGTCACTCAGTGAATGGACAAAAAGTCCGTAAGCGTAAGAGTCGATGGGATCAGCCTGCAGATACGAGCCTAGATCTGAGATCTAAGGAGCAGAAGCTTGAATCAACATCGGTGCAGGAATTGAATTCCAGCCAATTAAATAGTGTTGGAGCAGCATCAATGCTGATAGACAAAGTAAACAATGATGATAAGGACATCTCCCTCTCTGATTCTGTTGGAGTACCTTGTCGGCAAGATGAAGATATAAGGGCTGACAGTGCAGTGCCAAACATCCCTGAAGATATTCCTCCTGGATTTTCATCTCCCTTCAATCCCCCTGTTGCTTCCTCAAGTGCTTTTTCAGCAGTTTTAGATCCTCCTCGACAGAATATTGGCGATTTGAGTTGTGCTTTTTCCACAGTTGGGCATCTACAGGAAAGATTTATTTCTCGCTTGCCTGTATCCTATGGAATTCCATTTTCTATCATTGAGCAATGTGGAACATCTCATGCAGAGAATCTGGAGTGTTGGGATGTTGCTCCTGGAGTGCCTTTTCATCCTTTTCCACCCCTACCACCATATCCCCGAGGTATGAGAGGCCTGCCTACATCTGCCTGTGGTACTGCTGGACAATCGTCTCAGGAAGGGCAGGTGAACAGCCATGATTCTCGAACTTCTTTCTCAGAAGAAAGCCCTCCTAGTACAAGTACTAATTACCAAACAGATTTGTGCACTCCATCAAACAACCAGCAGATAGCGAAACGGGCAAAAGAATCATCATGTGACTTGGGACGAAGATACTTTAGGCAGCAGAAGTGGCGTAATACAAAGTTTGGCCCCCCTTGGTTACAGAGAAGAAGTCAGTGGGGATGCCAGGGGAACTTCAGGGGTGGGGTGAGCACTATAGGTGATGAAAATATTCCCGACGAGGAAATAAGTCCATATTGCTCGGATGAAGCAAGCGGTAGAGTGGATAAAGCTAATGGTGATTTTTATCAGCATTTGCAGAACCAAAATCTGCGTTAGTCTTAGGAAAATAATGATCTGAAATTTCAATAGAACAATGTATTCTCAGTTCTTTTTCTTACTTAGGTTTGTACTCGATGAAGTTTCTTCAACTTCAGTATGGTAGTGCTCTTACTCCAGCATTAGTAGTAGTACTATTCATGAGGTTGATTATTGCTTCCATTAGCATATGAGGTTTTGGATCTTTCTCCATCAATCAACACCTCAAAGAGGTAGGCATTCTTATTTGTATATCATTTTCTCACAAAAATTAAATTGTGACACGCAATGTTTAAAAATCTCGGAAAAATAAAATTTTCATTTATTCTTGCTGTCATTTGCCTATGATTATTGGAGATTCAAAATATTTCTTCTTGATTTTCTCTTAAGCTATTTCAACTATCCTGTAAATTACATGGACCCATGTCGGATGAAAGTATTTTGAGCATTTGTATGAATGAACACTGCTTTTATTTTTACCCTTTACTTGTTACACAGGAAATGATTTATTAGAAGTTTT

mRNA sequence

CGAAAATACCCTAAAATGTACGATCAGAGCGTGTAGGGCTAAAACGGTCTCTACCGACTACCAATCAACACCAAATTCCTTCTCTCGGTCGGGAACTTGAAACTTAGAACAACGTTTATAAGGCCTAAAACATCCTTCTTCGATTCTTTTTCCTTTTCGTAAACCGAATTCCTAAGAGCAGCAATCTTGGACGCCATTCAAAAACGCCTCACCTTCATTTCATCTTCATTTACTCAATTTCATCTTCATTTACTCAATTTCATCTTCATTCTCAGAAAGTTGGGATAGTGCCTTCTTTTTTGGTGGGTCATCTGCCCCGGTTAAGATTCAATTTGAATTCGACGAGAAAGTATTGTTCTTGTTTTCTCCGGTTGTGGATGTTGTGAGCTACCAGAACTGGGATTTGATTGAAGAAGATTTGATACACACATTTTACTTTGTTTCGGGTTAGTCGCGTTGTAGGCTTCTTTTCTTTTGTTGCTTTATGCAATTTTTGAGTTATGGCTATCCATTGGTTTTCGCTATCGATAATTTAGATGGGTTCATGTGATGACCCGGCTGTGATCGGGGAACCGTTTCGTGCCTCTGTTACTCGGCTGGTCAGGTGTTCGAGTCAACCTCTTCCCGAGCATCAGTCGCACCAGGAGATGGCTTCCTTTTCATCTAATTCTCGTGAGGGCCAGATGTTTGAGCCAGATAGGGGGCTGGAAGTGACTACGGCTTCTCTCTGTACGAATGCATCGGACCCTGATACTTCTGGGGAGGATGGGACGCTTAGAGGCTTCGAACATGCAGATAGTTTGCTAATGGATAAAAGATTGGATGGTGATTCTGGCGGTAGTGATCCCTGTCTAAACTTGGATAACGAGTCTTGCAACGAGGGGAATAAAACATTGAGCTTGGATATGAAAGAGTCTGAAGACGTTGATGGTTTGGTTGATATTTTGGGATGCGATGCTACCATGGAAATGATTTCTTTAACTGAGTCATTAGTAAATTCTGTGAAACCTGAAGAACTTGATAATAATAGTTGTATAATTGATGCTCCTGCAAAAGTTGAAAGGGATGATACTGCACAAAATGGTCCTATTTTAGCAGGGACGGGTACTCGTACAGATGACTTAAAATCTTCTTATGTCTGTGAAATTGTTTCTAATTCAGCTTCGGCTGATGGACTGCCAAATGATTTCATACAGAAAAACGAGCTGGAAAATGATGGTGCTGGTTGTTCATTTTCTGAGGTTGCAGATAGGATAACTGAGGCTTCGGTTGAACTAGAAGCAGATATGTTGAATGAGATGTCCCCTTTACAGAGTGGTCAAATACTACCAATACATGTGGGGCAATCAATTGCCAACTATGATCGGTATGTTTGCCGGATGGACGGGAAGAGCTTAAGTAGCACCTCTGGAGAAACAGTAACTGTAGTTGCTGATATGAACAGCAACCCTGAAGGATGCCTGCAAATGTTGCCTTCCCAGGGATGCGATAGGATTGGGGAATGCTTGCAATCTGATGGTTTACCACTAACTATTAATGCTTCAGAGAATGATTTGTGTGAGGAAAAGCATGACAGTAATTCCTCATCCAAGTACGTTCCAGATGTTGGGGGAGATGATAGTGATGTCTTGACTAACAATAATAGTGATGGTGGACAACATACAGTTCCGGGGATTGGAAATGACCATAATCTGGAGGACGCTACTGTTCAAGTGAACCACGACTGTGTCGAACTACTTTCATCGCCCTTGCCTTCTCAGCTCCCTAATTCTGAGAAAGATGAATTTTATGGAATGTTGAATGGAGCAGATATTCCAATAAAATATATTAGTTCTGTTAATTCATGTAGCGTCGGTGATCAAGACAACAATGACATAGAGAAAGTTGGCTGTGTTTCTGAAGTTAAATGTCCTGAAACAGTTATCACGTCTTCTAAGAGGAGTGGCCGAAGGAGAACATCAAGCCAAAAAACTGTGACAAAAAGGGCTTCCAGGAAAACAAAAAAAAAGGTGCCAGAGCCACTGATTTTTGACACTGCAAGAAGGAGGAGAAGCTCTATATCTAGACCTGCTCGTCCCTCACCCTGGGGATCACTGGGCCATATTATTCAGTCATTTGAAGAAATTGATGATGTTCTGGTAAATCAAACCCAGAAGCAAGGAAATGAGAAATCTAAAGGTAATCAAGGAGGTGCCAAGCGGAATAAGAAACAGCTAAGTGAAAGTTCACATAGATCAAGAAAAGGGACCCAAGGAAAATCTGCTACTTCAACTTCAACCAACCGTATTCGTCTGAAGGTTAAATTAGGTAAAAACGTGGGTCATAATTTTCTGAACATTGTGGTTCCCGAAATTGTTGATTCATCATTGTCTGCCAAGGGTGTCAACTGCAATTATGGCAATGAATCATATTGGGAAGGTAATTTGGAATTCCCTCCATCAAACCTTGGTGTTGATGACCAAAAGGCCGAGGAGGAGGGGCCTTTAAGAAAGATCTTCTGCTACAGCAGGAATCAGGATAAAGAAGATAATTGTCCTGATGCTTCTGTTGTGAATGAACAATGTACTAATAATGATTCAAGTTGCATCGTTGGTATAGACAAGTCCTCTGAAAAACATGCAGATGATAATCTCTGTGTTTCCTCCCATTTGGTTGATCCTGTAGCGACAAGTGATGCCAGGAGTTTGGATCCTGGAACTTCGCCTGATTCAGAAGTGATAAATTCAGTTTTAGATATTCAAGTCGGAGCAGCACGTCAGGAAATTTTGCAGGACTCGGTTTTGGCATCCTTAGAAGATTTTGCCGCTTCTGGAAATGCTCCCGGTAGTAAGAAAGGTAGGAAGAAGGACAAACCCTCTCGGGTAGTTAGTTGTTCTGAGGAACGTGGCATAAGTGTTTCTGCTTGCAGTAACAGGTCCAAGTCATCAAAGAAGCACGGAAGAAGACATAATGTGGACAATCAACTTAGTTCTGGTGAGACACTTACTTACGCTGATGCAAATGTTTTAAACTACTCTTTAACTGTTAAGGAATTGTCTATGGAGCAAGTGTCTTTGTTGACAGAGATTGAACTTCCTGAAGAGACTTTGAAAGCAGAAGACATTCTCAATGATAAAGAATGTTGCAGAGCAGATGTTGGCAGTGTGTTTTCTGAATCAGAGAATTCAAAAACATTTCTTCCTTCTCAATCCGCAAAGAAGAAACATCCCAAAGGTTCTAAATCTATTAAAACTAGCAAAGGCAAGTCAAAGGCTCCTGGCTCAAAAAACAAAATAAAGAATGCTTCAAATGAAAGGGTTTACCAACGGAAGTCTTTTAAAAATAGTAAAAGCAAGGAGGCTCTATGTGACCAAGTTGTGACTGAAACAGAAAGTCACCAAATCATAGGAAATTGTCTTGTCGACAAGCCTGAGAAAAGCGATAACATTATTGCATCCACGGTGGCAGTAGATTTGAGTGTGGTACAGGGTGCTGTGAATGAGCAGTATATGCCTCCTCGCAATGCTTGGGTGCTCTGCGATGACTGTCATAAATGGCGACGCATACCAGCTTCTCTTGTTGATAGTTTAGGACATGCAAGTTGCACATGGACTTGTAAGGACAATGTGGATAAAGCTTTTGCTAATTGCTCAATCCCACAAGAGAAGTCGAATGCAGAGATTAATGCAGAGTTGGAAATATCAGATGAATCTGGGGAAGAAAATGGTTCCAAAAAACGGCTAACTTACAGGGAATTGGAGAGTTTTCATCCAGCAACAGTGAATGCTGTTCCTCAACAGAACAAATTTGCTTCCATTAGCAGCAATCAGTTTTTGCACCGCAGTCGTAAAACCCAGACTATTGATGAGATAATGGTTTGTCACTGCAAACCAGCTTTGGATGGTCGGTTAGGTTGTGGAGATGAATGCTTGAACCGAATGCTCAATATTGAATGTGTACGAGGTACATGTCCTTGTGGGGAACTTTGTTCTAATCAACAGTTCCAGAAGCGTAAGTATGCTAAATTACAGTGGTTGCGATGTGGAAAGAAAGGTTATGGGCTACAGTTGCTTGAGGATATATCAAAAGGACAATTTCTCATTGAGTATGTTGGAGAGGTCCTTGACATGCATGCTTATGAGGCACGCCAAAAGGAATATGCATTGAATGGTCATCGGCATTTTTACTTCATGACCCTAAATGGCAGCGAGGTCATAGACGCATGTGGAAAGGGAAATTTGGGGCGTTTCATTAATCACAGTTGTGATCCAAATTGCCGCACGGAGAAGTGGATGGTGAATGGAGAAATCTGCATTGGGCTCTTTGCACTAAGGGATATTAAGAAGGGGGAAGAGGTGACATTTGACTACAATTATGTAAGGGTATTTGGGGCTGCTGCAAAAAAATGTTATTGTGGTTCTTTCCATTGTCGAGGTTATATAGGTGGAGACCCCCTCAATTCTGAGGTCATTATTCAAAGTGATTCAGATGAAGAATTCCCCGAACCTGTGATGCTTCGAGGAGATGGAAGAAGTTTGAATAGTAACTTGTCAACTGCAGTTAGTTCAATGGATGTTGCTAAAATGCAATCCTCTGAGCATCTAAAAGGGAATAGGGATAAGAGAGATCAACCTATCAGAATTGCTAGTGAATTGAAGATTTCAGAAGAAAAAGTGGATCCTCTTAAGCTTTCTGCGTCGAAGATTTCAGAAGAAAAAGAGGATCCTCTTAAACTTTCTGCAACGAAGATTTCAGAAGAAAAAGAAGATCCTCTTAACCTTTCTGCCTCTACCATTTCTCCATTGCATAGTTCATTGGAATTCGAAGATTCGAAGGTAGCATCACCTATTCCAGTGCCAGATATTACCCATCAGACTGAGGATGTGACAAGCCAACCTATCTTTGTTGATCAGACAGAAATATCTCTTTTGGACAATATTCCTGACAAAAATACATGCTCTATTGAGCAGGAGGCAAAGTTATCAGTGGATGACATTGACGCTCGTAAGAAGTCCAAGCTGGATTCTGTTGAAGATAAGCAAGTGTATATAAAATCGCATCCTCGGATGAAAACTTCGCGTAAACTAGGTTCCATCAAGAAAGGAAAAGTTAGCTCTGCAGAAAAAATACAAATAACTAACAGGTCCCAGATTTCCTCTGTAAAGCCCAAGCGATTGATTGAAGGTTCTCCGGGTAACCGCTTTGAAGCAGTTGAGGAGAAGCTTAATGAACTTCTGGATGCTGAAGGGGGAATTAGCAAAAGAAAAGATGCCCCTAAAGGGTACTTAAAGCTTCTTCTTCTGACTGCTGCATCGGGTGCAAGTGCTAGCGGTGAAGCAATTCAGAGCAATCGGGATCTTTCAATGATCCTTGATGCTCTTTTGAAGACAAAGTCACGATTAGTATTGACTGACATAATAAACAAAAATGGTTTGCGGATGCTGCATAACATAATGAAGCAATATAGAAGTGACTTCAAAAAGATACCTATTCTTCGGAAGCTTTTGAAGGTCCTGGAATACTTGGTGACAAGAGAGATACTCACTTCAGAGCATATCAATGGCGGTCCCCCTTGCCCTGGAATGGAAAGTTTGAGAGAGTCCTTATTGTCACTGACAGAGCATGATGACAAACAGGTACATCAAATTGCCCGAAGTTTTCGGGACAGATGGTTCCCAAGACATACTAGAAAATTTGGTTATTCTGAGAGAGAGGATGGGAGATTGGAAGTTTACAGGGGTTCAAACAGCAGTAGGTTTACTGCATCACACAGTTTTCGGCACGATCAGGATTGCAGACCCACAGATGCAATTGATTGTATCAAGCAGTCGATGCCTACATCTTTGCCAGATGCTCATCCTGCAGAGGTCTGTTCTCTGGCTTCCGCAGCTAGTCACTCAGTGAATGGACAAAAAGTCCGTAAGCGTAAGAGTCGATGGGATCAGCCTGCAGATACGAGCCTAGATCTGAGATCTAAGGAGCAGAAGCTTGAATCAACATCGGTGCAGGAATTGAATTCCAGCCAATTAAATAGTGTTGGAGCAGCATCAATGCTGATAGACAAAGTAAACAATGATGATAAGGACATCTCCCTCTCTGATTCTGTTGGAGTACCTTGTCGGCAAGATGAAGATATAAGGGCTGACAGTGCAGTGCCAAACATCCCTGAAGATATTCCTCCTGGATTTTCATCTCCCTTCAATCCCCCTGTTGCTTCCTCAAGTGCTTTTTCAGCAGTTTTAGATCCTCCTCGACAGAATATTGGCGATTTGAGTTGTGCTTTTTCCACAGTTGGGCATCTACAGGAAAGATTTATTTCTCGCTTGCCTGTATCCTATGGAATTCCATTTTCTATCATTGAGCAATGTGGAACATCTCATGCAGAGAATCTGGAGTGTTGGGATGTTGCTCCTGGAGTGCCTTTTCATCCTTTTCCACCCCTACCACCATATCCCCGAGGTATGAGAGGCCTGCCTACATCTGCCTGTGGTACTGCTGGACAATCGTCTCAGGAAGGGCAGGTGAACAGCCATGATTCTCGAACTTCTTTCTCAGAAGAAAGCCCTCCTAGTACAAGTACTAATTACCAAACAGATTTGTGCACTCCATCAAACAACCAGCAGATAGCGAAACGGGCAAAAGAATCATCATGTGACTTGGGACGAAGATACTTTAGGCAGCAGAAGTGGCGTAATACAAAGTTTGGCCCCCCTTGGTTACAGAGAAGAAGTCAGTGGGGATGCCAGGGGAACTTCAGGGGTGGGGTGAGCACTATAGGTGATGAAAATATTCCCGACGAGGAAATAAGTCCATATTGCTCGGATGAAGCAAGCGGTAGAGTGGATAAAGCTAATGGTGATTTTTATCAGCATTTGCAGAACCAAAATCTGCGTTAGTCTTAGGAAAATAATGATCTGAAATTTCAATAGAACAATGTATTCTCAGTTCTTTTTCTTACTTAGGTTTGTACTCGATGAAGTTTCTTCAACTTCAGTATGGTAGTGCTCTTACTCCAGCATTAGTAGTAGTACTATTCATGAGGTTGATTATTGCTTCCATTAGCATATGAGGTTTTGGATCTTTCTCCATCAATCAACACCTCAAAGAGGTAGGCATTCTTATTTGTATATCATTTTCTCACAAAAATTAAATTGTGACACGCAATGTTTAAAAATCTCGGAAAAATAAAATTTTCATTTATTCTTGCTGTCATTTGCCTATGATTATTGGAGATTCAAAATATTTCTTCTTGATTTTCTCTTAAGCTATTTCAACTATCCTGTAAATTACATGGACCCATGTCGGATGAAAGTATTTTGAGCATTTGTATGAATGAACACTGCTTTTATTTTTACCCTTTACTTGTTACACAGGAAATGATTTATTAGAAGTTTT

Coding sequence (CDS)

ATGGGTTCATGTGATGACCCGGCTGTGATCGGGGAACCGTTTCGTGCCTCTGTTACTCGGCTGGTCAGGTGTTCGAGTCAACCTCTTCCCGAGCATCAGTCGCACCAGGAGATGGCTTCCTTTTCATCTAATTCTCGTGAGGGCCAGATGTTTGAGCCAGATAGGGGGCTGGAAGTGACTACGGCTTCTCTCTGTACGAATGCATCGGACCCTGATACTTCTGGGGAGGATGGGACGCTTAGAGGCTTCGAACATGCAGATAGTTTGCTAATGGATAAAAGATTGGATGGTGATTCTGGCGGTAGTGATCCCTGTCTAAACTTGGATAACGAGTCTTGCAACGAGGGGAATAAAACATTGAGCTTGGATATGAAAGAGTCTGAAGACGTTGATGGTTTGGTTGATATTTTGGGATGCGATGCTACCATGGAAATGATTTCTTTAACTGAGTCATTAGTAAATTCTGTGAAACCTGAAGAACTTGATAATAATAGTTGTATAATTGATGCTCCTGCAAAAGTTGAAAGGGATGATACTGCACAAAATGGTCCTATTTTAGCAGGGACGGGTACTCGTACAGATGACTTAAAATCTTCTTATGTCTGTGAAATTGTTTCTAATTCAGCTTCGGCTGATGGACTGCCAAATGATTTCATACAGAAAAACGAGCTGGAAAATGATGGTGCTGGTTGTTCATTTTCTGAGGTTGCAGATAGGATAACTGAGGCTTCGGTTGAACTAGAAGCAGATATGTTGAATGAGATGTCCCCTTTACAGAGTGGTCAAATACTACCAATACATGTGGGGCAATCAATTGCCAACTATGATCGGTATGTTTGCCGGATGGACGGGAAGAGCTTAAGTAGCACCTCTGGAGAAACAGTAACTGTAGTTGCTGATATGAACAGCAACCCTGAAGGATGCCTGCAAATGTTGCCTTCCCAGGGATGCGATAGGATTGGGGAATGCTTGCAATCTGATGGTTTACCACTAACTATTAATGCTTCAGAGAATGATTTGTGTGAGGAAAAGCATGACAGTAATTCCTCATCCAAGTACGTTCCAGATGTTGGGGGAGATGATAGTGATGTCTTGACTAACAATAATAGTGATGGTGGACAACATACAGTTCCGGGGATTGGAAATGACCATAATCTGGAGGACGCTACTGTTCAAGTGAACCACGACTGTGTCGAACTACTTTCATCGCCCTTGCCTTCTCAGCTCCCTAATTCTGAGAAAGATGAATTTTATGGAATGTTGAATGGAGCAGATATTCCAATAAAATATATTAGTTCTGTTAATTCATGTAGCGTCGGTGATCAAGACAACAATGACATAGAGAAAGTTGGCTGTGTTTCTGAAGTTAAATGTCCTGAAACAGTTATCACGTCTTCTAAGAGGAGTGGCCGAAGGAGAACATCAAGCCAAAAAACTGTGACAAAAAGGGCTTCCAGGAAAACAAAAAAAAAGGTGCCAGAGCCACTGATTTTTGACACTGCAAGAAGGAGGAGAAGCTCTATATCTAGACCTGCTCGTCCCTCACCCTGGGGATCACTGGGCCATATTATTCAGTCATTTGAAGAAATTGATGATGTTCTGGTAAATCAAACCCAGAAGCAAGGAAATGAGAAATCTAAAGGTAATCAAGGAGGTGCCAAGCGGAATAAGAAACAGCTAAGTGAAAGTTCACATAGATCAAGAAAAGGGACCCAAGGAAAATCTGCTACTTCAACTTCAACCAACCGTATTCGTCTGAAGGTTAAATTAGGTAAAAACGTGGGTCATAATTTTCTGAACATTGTGGTTCCCGAAATTGTTGATTCATCATTGTCTGCCAAGGGTGTCAACTGCAATTATGGCAATGAATCATATTGGGAAGGTAATTTGGAATTCCCTCCATCAAACCTTGGTGTTGATGACCAAAAGGCCGAGGAGGAGGGGCCTTTAAGAAAGATCTTCTGCTACAGCAGGAATCAGGATAAAGAAGATAATTGTCCTGATGCTTCTGTTGTGAATGAACAATGTACTAATAATGATTCAAGTTGCATCGTTGGTATAGACAAGTCCTCTGAAAAACATGCAGATGATAATCTCTGTGTTTCCTCCCATTTGGTTGATCCTGTAGCGACAAGTGATGCCAGGAGTTTGGATCCTGGAACTTCGCCTGATTCAGAAGTGATAAATTCAGTTTTAGATATTCAAGTCGGAGCAGCACGTCAGGAAATTTTGCAGGACTCGGTTTTGGCATCCTTAGAAGATTTTGCCGCTTCTGGAAATGCTCCCGGTAGTAAGAAAGGTAGGAAGAAGGACAAACCCTCTCGGGTAGTTAGTTGTTCTGAGGAACGTGGCATAAGTGTTTCTGCTTGCAGTAACAGGTCCAAGTCATCAAAGAAGCACGGAAGAAGACATAATGTGGACAATCAACTTAGTTCTGGTGAGACACTTACTTACGCTGATGCAAATGTTTTAAACTACTCTTTAACTGTTAAGGAATTGTCTATGGAGCAAGTGTCTTTGTTGACAGAGATTGAACTTCCTGAAGAGACTTTGAAAGCAGAAGACATTCTCAATGATAAAGAATGTTGCAGAGCAGATGTTGGCAGTGTGTTTTCTGAATCAGAGAATTCAAAAACATTTCTTCCTTCTCAATCCGCAAAGAAGAAACATCCCAAAGGTTCTAAATCTATTAAAACTAGCAAAGGCAAGTCAAAGGCTCCTGGCTCAAAAAACAAAATAAAGAATGCTTCAAATGAAAGGGTTTACCAACGGAAGTCTTTTAAAAATAGTAAAAGCAAGGAGGCTCTATGTGACCAAGTTGTGACTGAAACAGAAAGTCACCAAATCATAGGAAATTGTCTTGTCGACAAGCCTGAGAAAAGCGATAACATTATTGCATCCACGGTGGCAGTAGATTTGAGTGTGGTACAGGGTGCTGTGAATGAGCAGTATATGCCTCCTCGCAATGCTTGGGTGCTCTGCGATGACTGTCATAAATGGCGACGCATACCAGCTTCTCTTGTTGATAGTTTAGGACATGCAAGTTGCACATGGACTTGTAAGGACAATGTGGATAAAGCTTTTGCTAATTGCTCAATCCCACAAGAGAAGTCGAATGCAGAGATTAATGCAGAGTTGGAAATATCAGATGAATCTGGGGAAGAAAATGGTTCCAAAAAACGGCTAACTTACAGGGAATTGGAGAGTTTTCATCCAGCAACAGTGAATGCTGTTCCTCAACAGAACAAATTTGCTTCCATTAGCAGCAATCAGTTTTTGCACCGCAGTCGTAAAACCCAGACTATTGATGAGATAATGGTTTGTCACTGCAAACCAGCTTTGGATGGTCGGTTAGGTTGTGGAGATGAATGCTTGAACCGAATGCTCAATATTGAATGTGTACGAGGTACATGTCCTTGTGGGGAACTTTGTTCTAATCAACAGTTCCAGAAGCGTAAGTATGCTAAATTACAGTGGTTGCGATGTGGAAAGAAAGGTTATGGGCTACAGTTGCTTGAGGATATATCAAAAGGACAATTTCTCATTGAGTATGTTGGAGAGGTCCTTGACATGCATGCTTATGAGGCACGCCAAAAGGAATATGCATTGAATGGTCATCGGCATTTTTACTTCATGACCCTAAATGGCAGCGAGGTCATAGACGCATGTGGAAAGGGAAATTTGGGGCGTTTCATTAATCACAGTTGTGATCCAAATTGCCGCACGGAGAAGTGGATGGTGAATGGAGAAATCTGCATTGGGCTCTTTGCACTAAGGGATATTAAGAAGGGGGAAGAGGTGACATTTGACTACAATTATGTAAGGGTATTTGGGGCTGCTGCAAAAAAATGTTATTGTGGTTCTTTCCATTGTCGAGGTTATATAGGTGGAGACCCCCTCAATTCTGAGGTCATTATTCAAAGTGATTCAGATGAAGAATTCCCCGAACCTGTGATGCTTCGAGGAGATGGAAGAAGTTTGAATAGTAACTTGTCAACTGCAGTTAGTTCAATGGATGTTGCTAAAATGCAATCCTCTGAGCATCTAAAAGGGAATAGGGATAAGAGAGATCAACCTATCAGAATTGCTAGTGAATTGAAGATTTCAGAAGAAAAAGTGGATCCTCTTAAGCTTTCTGCGTCGAAGATTTCAGAAGAAAAAGAGGATCCTCTTAAACTTTCTGCAACGAAGATTTCAGAAGAAAAAGAAGATCCTCTTAACCTTTCTGCCTCTACCATTTCTCCATTGCATAGTTCATTGGAATTCGAAGATTCGAAGGTAGCATCACCTATTCCAGTGCCAGATATTACCCATCAGACTGAGGATGTGACAAGCCAACCTATCTTTGTTGATCAGACAGAAATATCTCTTTTGGACAATATTCCTGACAAAAATACATGCTCTATTGAGCAGGAGGCAAAGTTATCAGTGGATGACATTGACGCTCGTAAGAAGTCCAAGCTGGATTCTGTTGAAGATAAGCAAGTGTATATAAAATCGCATCCTCGGATGAAAACTTCGCGTAAACTAGGTTCCATCAAGAAAGGAAAAGTTAGCTCTGCAGAAAAAATACAAATAACTAACAGGTCCCAGATTTCCTCTGTAAAGCCCAAGCGATTGATTGAAGGTTCTCCGGGTAACCGCTTTGAAGCAGTTGAGGAGAAGCTTAATGAACTTCTGGATGCTGAAGGGGGAATTAGCAAAAGAAAAGATGCCCCTAAAGGGTACTTAAAGCTTCTTCTTCTGACTGCTGCATCGGGTGCAAGTGCTAGCGGTGAAGCAATTCAGAGCAATCGGGATCTTTCAATGATCCTTGATGCTCTTTTGAAGACAAAGTCACGATTAGTATTGACTGACATAATAAACAAAAATGGTTTGCGGATGCTGCATAACATAATGAAGCAATATAGAAGTGACTTCAAAAAGATACCTATTCTTCGGAAGCTTTTGAAGGTCCTGGAATACTTGGTGACAAGAGAGATACTCACTTCAGAGCATATCAATGGCGGTCCCCCTTGCCCTGGAATGGAAAGTTTGAGAGAGTCCTTATTGTCACTGACAGAGCATGATGACAAACAGGTACATCAAATTGCCCGAAGTTTTCGGGACAGATGGTTCCCAAGACATACTAGAAAATTTGGTTATTCTGAGAGAGAGGATGGGAGATTGGAAGTTTACAGGGGTTCAAACAGCAGTAGGTTTACTGCATCACACAGTTTTCGGCACGATCAGGATTGCAGACCCACAGATGCAATTGATTGTATCAAGCAGTCGATGCCTACATCTTTGCCAGATGCTCATCCTGCAGAGGTCTGTTCTCTGGCTTCCGCAGCTAGTCACTCAGTGAATGGACAAAAAGTCCGTAAGCGTAAGAGTCGATGGGATCAGCCTGCAGATACGAGCCTAGATCTGAGATCTAAGGAGCAGAAGCTTGAATCAACATCGGTGCAGGAATTGAATTCCAGCCAATTAAATAGTGTTGGAGCAGCATCAATGCTGATAGACAAAGTAAACAATGATGATAAGGACATCTCCCTCTCTGATTCTGTTGGAGTACCTTGTCGGCAAGATGAAGATATAAGGGCTGACAGTGCAGTGCCAAACATCCCTGAAGATATTCCTCCTGGATTTTCATCTCCCTTCAATCCCCCTGTTGCTTCCTCAAGTGCTTTTTCAGCAGTTTTAGATCCTCCTCGACAGAATATTGGCGATTTGAGTTGTGCTTTTTCCACAGTTGGGCATCTACAGGAAAGATTTATTTCTCGCTTGCCTGTATCCTATGGAATTCCATTTTCTATCATTGAGCAATGTGGAACATCTCATGCAGAGAATCTGGAGTGTTGGGATGTTGCTCCTGGAGTGCCTTTTCATCCTTTTCCACCCCTACCACCATATCCCCGAGGTATGAGAGGCCTGCCTACATCTGCCTGTGGTACTGCTGGACAATCGTCTCAGGAAGGGCAGGTGAACAGCCATGATTCTCGAACTTCTTTCTCAGAAGAAAGCCCTCCTAGTACAAGTACTAATTACCAAACAGATTTGTGCACTCCATCAAACAACCAGCAGATAGCGAAACGGGCAAAAGAATCATCATGTGACTTGGGACGAAGATACTTTAGGCAGCAGAAGTGGCGTAATACAAAGTTTGGCCCCCCTTGGTTACAGAGAAGAAGTCAGTGGGGATGCCAGGGGAACTTCAGGGGTGGGGTGAGCACTATAGGTGATGAAAATATTCCCGACGAGGAAATAAGTCCATATTGCTCGGATGAAGCAAGCGGTAGAGTGGATAAAGCTAATGGTGATTTTTATCAGCATTTGCAGAACCAAAATCTGCGTTAG

Protein sequence

MGSCDDPAVIGEPFRASVTRLVRCSSQPLPEHQSHQEMASFSSNSREGQMFEPDRGLEVTTASLCTNASDPDTSGEDGTLRGFEHADSLLMDKRLDGDSGGSDPCLNLDNESCNEGNKTLSLDMKESEDVDGLVDILGCDATMEMISLTESLVNSVKPEELDNNSCIIDAPAKVERDDTAQNGPILAGTGTRTDDLKSSYVCEIVSNSASADGLPNDFIQKNELENDGAGCSFSEVADRITEASVELEADMLNEMSPLQSGQILPIHVGQSIANYDRYVCRMDGKSLSSTSGETVTVVADMNSNPEGCLQMLPSQGCDRIGECLQSDGLPLTINASENDLCEEKHDSNSSSKYVPDVGGDDSDVLTNNNSDGGQHTVPGIGNDHNLEDATVQVNHDCVELLSSPLPSQLPNSEKDEFYGMLNGADIPIKYISSVNSCSVGDQDNNDIEKVGCVSEVKCPETVITSSKRSGRRRTSSQKTVTKRASRKTKKKVPEPLIFDTARRRRSSISRPARPSPWGSLGHIIQSFEEIDDVLVNQTQKQGNEKSKGNQGGAKRNKKQLSESSHRSRKGTQGKSATSTSTNRIRLKVKLGKNVGHNFLNIVVPEIVDSSLSAKGVNCNYGNESYWEGNLEFPPSNLGVDDQKAEEEGPLRKIFCYSRNQDKEDNCPDASVVNEQCTNNDSSCIVGIDKSSEKHADDNLCVSSHLVDPVATSDARSLDPGTSPDSEVINSVLDIQVGAARQEILQDSVLASLEDFAASGNAPGSKKGRKKDKPSRVVSCSEERGISVSACSNRSKSSKKHGRRHNVDNQLSSGETLTYADANVLNYSLTVKELSMEQVSLLTEIELPEETLKAEDILNDKECCRADVGSVFSESENSKTFLPSQSAKKKHPKGSKSIKTSKGKSKAPGSKNKIKNASNERVYQRKSFKNSKSKEALCDQVVTETESHQIIGNCLVDKPEKSDNIIASTVAVDLSVVQGAVNEQYMPPRNAWVLCDDCHKWRRIPASLVDSLGHASCTWTCKDNVDKAFANCSIPQEKSNAEINAELEISDESGEENGSKKRLTYRELESFHPATVNAVPQQNKFASISSNQFLHRSRKTQTIDEIMVCHCKPALDGRLGCGDECLNRMLNIECVRGTCPCGELCSNQQFQKRKYAKLQWLRCGKKGYGLQLLEDISKGQFLIEYVGEVLDMHAYEARQKEYALNGHRHFYFMTLNGSEVIDACGKGNLGRFINHSCDPNCRTEKWMVNGEICIGLFALRDIKKGEEVTFDYNYVRVFGAAAKKCYCGSFHCRGYIGGDPLNSEVIIQSDSDEEFPEPVMLRGDGRSLNSNLSTAVSSMDVAKMQSSEHLKGNRDKRDQPIRIASELKISEEKVDPLKLSASKISEEKEDPLKLSATKISEEKEDPLNLSASTISPLHSSLEFEDSKVASPIPVPDITHQTEDVTSQPIFVDQTEISLLDNIPDKNTCSIEQEAKLSVDDIDARKKSKLDSVEDKQVYIKSHPRMKTSRKLGSIKKGKVSSAEKIQITNRSQISSVKPKRLIEGSPGNRFEAVEEKLNELLDAEGGISKRKDAPKGYLKLLLLTAASGASASGEAIQSNRDLSMILDALLKTKSRLVLTDIINKNGLRMLHNIMKQYRSDFKKIPILRKLLKVLEYLVTREILTSEHINGGPPCPGMESLRESLLSLTEHDDKQVHQIARSFRDRWFPRHTRKFGYSEREDGRLEVYRGSNSSRFTASHSFRHDQDCRPTDAIDCIKQSMPTSLPDAHPAEVCSLASAASHSVNGQKVRKRKSRWDQPADTSLDLRSKEQKLESTSVQELNSSQLNSVGAASMLIDKVNNDDKDISLSDSVGVPCRQDEDIRADSAVPNIPEDIPPGFSSPFNPPVASSSAFSAVLDPPRQNIGDLSCAFSTVGHLQERFISRLPVSYGIPFSIIEQCGTSHAENLECWDVAPGVPFHPFPPLPPYPRGMRGLPTSACGTAGQSSQEGQVNSHDSRTSFSEESPPSTSTNYQTDLCTPSNNQQIAKRAKESSCDLGRRYFRQQKWRNTKFGPPWLQRRSQWGCQGNFRGGVSTIGDENIPDEEISPYCSDEASGRVDKANGDFYQHLQNQNLR*
Homology
BLAST of CsGy6G019510 vs. ExPASy Swiss-Prot
Match: Q2LAE1 (Histone-lysine N-methyltransferase ASHH2 OS=Arabidopsis thaliana OX=3702 GN=ASHH2 PE=1 SV=1)

HSP 1 Score: 764.2 bits (1972), Expect = 3.8e-219
Identity = 590/1624 (36.33%), Postives = 792/1624 (48.77%), Query Frame = 0

Query: 460  ETVITSSKRSGRR-------RTSSQKTVTKRASRKTKKKVPEPLIFDTARRRRSSISRPA 519
            E  ++SS+R  R        +T +     +++SRK + +     IF  ++++RSS+ + +
Sbjct: 438  EAGMSSSRRKARDGKQCKVVQTKTSARHLRKSSRKKQSERDIESIFKCSKQKRSSLLKTS 497

Query: 520  RPSPWGSLGHIIQSFEEIDDVLVNQTQKQGNEKSKGNQGGAKRNKKQLSESSHRSRKGTQ 579
            R S WG      + F + +++  +       ++S+GN    + N+     SSH       
Sbjct: 498  RSSEWGLPSKTTEIFLQSNNIPYDGPPHHEPQRSQGNLNNGEHNR-----SSHNGNVEGS 557

Query: 580  GKSATSTSTNRIRLKVKLGKNVGHNFLNIVVPEIVDSSLSAKGVNCNYGNESYWEGNLEF 639
             ++  ++S + +RLKVK GK+ G N LNI V ++  +SL   G+    G      G+  F
Sbjct: 558  NRNIQASSGSCLRLKVKFGKSGGQNPLNITVSKVSGNSLPGNGI-VKAGTCLELPGSAHF 617

Query: 640  PPSNLGVDDQK---AEEEGPLRKIFCYSRNQDKEDNCPDASVVNEQCTNNDSSCIVGIDK 699
                +   + K    E+  P+ K+  Y ++ D          + ++  N D+  +     
Sbjct: 618  GEDKMQTVETKEDLVEKSNPVEKV-SYLQSSDS---------MRDKKYNQDAGGLCRKVG 677

Query: 700  SSEKHADDNLCVSSHLVDPVATSDARSLDPGTSPDSEVINSVLDIQVGAARQEILQDSVL 759
                  D +L     + +    +  +SLD  TSPDSEVINSV                  
Sbjct: 678  GDVLDDDPHLSSIRMVEECERATGTQSLDAETSPDSEVINSV------------------ 737

Query: 760  ASLEDFAASGNAPGSKKGRKKDKPSRVVSCSEERGISVSACSNRSKSSKKHGRRHNVDNQ 819
                                   P  +V+   + G+              HG        
Sbjct: 738  -----------------------PDSIVNIEHKEGL-------------HHG-------- 797

Query: 820  LSSGETLTYADANVLNYSLTVKELSMEQVSLLTEIELPEETLKAEDILNDKECCRADVGS 879
                                                 PE+ +K   +L  ++  RA    
Sbjct: 798  ---------------------------------FFSTPEDVVKKNRVLEKEDELRASK-- 857

Query: 880  VFSESENSKTFLPSQSAKKKHPKGSKSIKTSKGKSK-APGSKNKIKNASNERVYQRKSFK 939
              S SEN    +P+ + K KHPK SKS  T KGKSK +  +K+  KN S+E V QRKS  
Sbjct: 858  --SPSENGSHLIPN-AKKAKHPK-SKSNGTKKGKSKFSESAKDGRKNESHEGVEQRKSLN 917

Query: 940  NSKSKEALCDQVVTETESHQIIGNCL---VDKPEKSDNIIASTVAVDLSVVQGAVNEQYM 999
             S  ++      V   ESH+  G  L   + K   +   I+S V     VV   + + Y 
Sbjct: 918  TSMGRDDSDYPEVGRIESHKTTGALLDADIGKTSATYGTISSDVTHGEMVVDVTIEDSY- 977

Query: 1000 PPRNAWVLCDDCHKWRRIPASLVDSLGHASCTWTCKDNVDKAFANCSIPQEKSNAEINAE 1059
               +AWV CDDC KWRRIPAS+V S+  +S  W C +N DK FA+CS  QE SN EIN E
Sbjct: 978  STESAWVRCDDCFKWRRIPASVVGSIDESS-RWICMNNSDKRFADCSKSQEMSNEEINEE 1037

Query: 1060 LEISDESGEENGSKKRLTYRELESFHPATVNAVPQQNKFASISSNQFLHRSRKTQTIDEI 1119
            L I  +  +          +E E           Q+  F +I +NQFLHR+RK+QTIDEI
Sbjct: 1038 LGIGQDEADAYDCDAAKRGKEKEQKSKRLTG--KQKACFKAIKTNQFLHRNRKSQTIDEI 1097

Query: 1120 MVCHCKPALDGRLGCGDECLNRMLNIECVRGTCPCGELCSNQQFQKRKYAKLQWLRCGKK 1179
            MVCHCKP+ DGRLGCG+ECLNRMLNIEC++GTCP G+LCSNQQFQKRKY K +  + GKK
Sbjct: 1098 MVCHCKPSPDGRLGCGEECLNRMLNIECLQGTCPAGDLCSNQQFQKRKYVKFERFQSGKK 1157

Query: 1180 GYGLQLLEDISKGQFLIEYVGEVLDMHAYEARQKEYALNGHRHFYFMTLNGSEVIDACGK 1239
            GYGL+LLED+ +GQFLIEYVGEVLDM +YE RQKEYA  G +HFYFMTLNG+EVIDA  K
Sbjct: 1158 GYGLRLLEDVREGQFLIEYVGEVLDMQSYETRQKEYAFKGQKHFYFMTLNGNEVIDAGAK 1217

Query: 1240 GNLGRFINHSCDPNCRTEKWMVNGEICIGLFALRDIKKGEEVTFDYNYVRVFGAAAKKCY 1299
            GNLGRFINHSC+PNCRTEKWMVNGEIC+G+F+++D+KKG+E+TFDYNYVRVFGAAAKKCY
Sbjct: 1218 GNLGRFINHSCEPNCRTEKWMVNGEICVGIFSMQDLKKGQELTFDYNYVRVFGAAAKKCY 1277

Query: 1300 CGSFHCRGYIGGDPLNSEVIIQSDSDEEFPEPVMLRGDGRS---LNSNLSTAVSSMDVAK 1359
            CGS HCRGYIGGDPLN +VIIQSDSDEE+PE V+L  D      L +   T     D   
Sbjct: 1278 CGSSHCRGYIGGDPLNGDVIIQSDSDEEYPELVILDDDESGEGILGATSRTFTDDADEQM 1337

Query: 1360 MQSSEHLKGNRDKRDQPIRIAS--ELKISEEKVDPLKLSASKISEEKEDPLKLSATKISE 1419
             QS E + G +D      +  S   +K+ E ++ P  L  +++ +E    + ++A     
Sbjct: 1338 PQSFEKVNGYKDLAPDNTQTQSSVSVKLPEREIPPPLLQPTEVLKELSSGISITAV---- 1397

Query: 1420 EKEDPLNLSASTISPLHSSLEFEDSKVASPIPVPDITHQTEDVTSQPIFVDQTEISLLDN 1479
            ++E P      + SP  SSL                                        
Sbjct: 1398 QQEVPAEKKTKSTSPTSSSL---------------------------------------- 1457

Query: 1480 IPDKNTCSIEQEAKLSVDDIDARKKSKLDSVEDKQVYIKSHPRMKTSRKLGSIKKGK--- 1539
                        +++S    ++ K +K  S EDK++  +  PRMKTSR   S K+ K   
Sbjct: 1458 ------------SRMSPGGTNSDKTTKHGSGEDKKILPRPRPRMKTSRSSESSKRDKGGI 1517

Query: 1540 ---VSSAEKIQITNRSQISSVKPKRLIEGSPGNRFEAVEEKLNELLDAEGGISKRKDAPK 1599
               V+ A+ I + N+ Q   +K K   + SP    E  E KLNELLDA GGISKR+D+ K
Sbjct: 1518 YPGVNKAQVIPV-NKLQQQPIKSKGSEKVSPS--IETFEGKLNELLDAVGGISKRRDSAK 1577

Query: 1600 GYLKLLLLTAASGASASGEAIQSNRDLSMILDALLKTKSRLVLTDIINKNGLRMLHNIMK 1659
            GYLKLLLLTAAS      E I SNRDLSMILDALLKTKS+ VL DIINKNG         
Sbjct: 1578 GYLKLLLLTAAS-RGTDEEGIYSNRDLSMILDALLKTKSKSVLVDIINKNG--------- 1637

Query: 1660 QYRSDFKKIPILRKLLKVLEYLVTREILTSEHINGGPPCPGMESLRESLLSLTEHDDKQV 1719
                                                 P  GMES ++S+LS TEHDD  V
Sbjct: 1638 -------------------------------------PFAGMESFKDSVLSFTEHDDYTV 1697

Query: 1720 HQIARSFRDRWFPRHTRKFGYSEREDGRLEVYRGSNSSRFTASHSFRHD-QDCRPTDAID 1779
            H IARSFRDRW P+H RK     RE+ R E  R   + RF AS   R+D Q  RP +   
Sbjct: 1698 HNIARSFRDRWIPKHFRKPWRINREE-RSESMRSPINRRFRASQEPRYDHQSPRPAEPAA 1735

Query: 1780 CIKQSMPTSLPDAHPAEVCSLASAASHSVNGQKVRKRKSRWDQPADTSLDLRSKEQKLES 1839
             +  S   +   A  +E  S  ++     NG   RKRKSRWDQP+ T      KEQ++ +
Sbjct: 1758 SVTSSKAATPETASVSEGYSEPNSGLPETNG---RKRKSRWDQPSKT------KEQRIMT 1735

Query: 1840 TSVQELNSSQLNSVGAASMLIDKVNNDDKDISLSDSVGVPCRQDEDIRADSAVPNIPEDI 1899
               Q+ + +  N                                          ++ +D+
Sbjct: 1818 ILSQQTDETNGNQ-----------------------------------------DVQDDL 1735

Query: 1900 PPGFSSPFNPPVASSSAFSAVLDPPRQNIGDLSCAFSTVGHLQERFISRLPVSYGIPFSI 1959
            PPGFSSP       +    A+   P                 Q++F+SRLPVSYGIP SI
Sbjct: 1878 PPGFSSP------CTDVPDAITAQP-----------------QQKFLSRLPVSYGIPLSI 1735

Query: 1960 IEQCGTSHAENLECWDVAPGVPFHPFPPLPPYPRG--MRGLPTSACGTAGQSSQEGQVNS 2019
            + Q G+   E+   W VAPG+PF+PFPPLPP   G         AC     SS  G +  
Sbjct: 1938 VHQFGSPGKEDPTTWSVAPGMPFYPFPPLPPVSHGEFFAKRNVRAC-----SSSMGNL-- 1735

Query: 2020 HDSRTSFSEESPPSTSTNYQTDLCTPSNNQQIAKRAKESSCDLGRRYFRQQKWRNTKFGP 2056
                 ++S E  P+T     TD   P+  +++       S D+G  YFRQQK    +  P
Sbjct: 1998 -----TYSNEILPATPV---TDSTAPTRKREL------FSSDIGTTYFRQQK----QSVP 1735

BLAST of CsGy6G019510 vs. ExPASy Swiss-Prot
Match: Q9BYW2 (Histone-lysine N-methyltransferase SETD2 OS=Homo sapiens OX=9606 GN=SETD2 PE=1 SV=3)

HSP 1 Score: 221.9 bits (564), Expect = 7.0e-56
Identity = 108/260 (41.54%), Postives = 159/260 (61.15%), Query Frame = 0

Query: 1084 FASISSNQFLHRSRKTQTIDEI--MVCHCKP-----ALDGRLGCGDECLNRMLNIECVRG 1143
            F  I  N +L   +K ++  +I  M C C P        G + CG++CLNR+L IEC   
Sbjct: 1473 FDLIEENVYLTERKKNKSHRDIKRMQCECTPLSKDERAQGEIACGEDCLNRLLMIEC-SS 1532

Query: 1144 TCPCGELCSNQQFQKRKYAKLQWLRCGKKGYGLQLLEDISKGQFLIEYVGEVLDMHAYEA 1203
             CP G+ CSN++FQ++++A ++ +   KKG+GL+  +D+    F++EY GEVLD   ++A
Sbjct: 1533 RCPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDHKEFKA 1592

Query: 1204 RQKEYALNGHRHFYFMTLNGSEVIDACGKGNLGRFINHSCDPNCRTEKWMVNGEICIGLF 1263
            R KEYA N + H+YFM L   E+IDA  KGN  RF+NHSC+PNC T+KW VNG++ +G F
Sbjct: 1593 RVKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFF 1652

Query: 1264 ALRDIKKGEEVTFDYNYVRVFGAAAKKCYCGSFHCRGYIGGD----------PLNSEVII 1323
              + +  G E+TFDY + R +G  A+KC+CGS +CRGY+GG+           +  E   
Sbjct: 1653 TTKLVPSGSELTFDYQFQR-YGKEAQKCFCGSANCRGYLGGENRVSIRAAGGKMKKERSR 1712

Query: 1324 QSDSDEEFPEPVMLRGDGRS 1327
            + DS +   E +M  G+G S
Sbjct: 1713 KKDSVDGELEALMENGEGLS 1730

BLAST of CsGy6G019510 vs. ExPASy Swiss-Prot
Match: Q84WW6 (Histone-lysine N-methyltransferase ASHH1 OS=Arabidopsis thaliana OX=3702 GN=ASHH1 PE=1 SV=1)

HSP 1 Score: 207.2 bits (526), Expect = 1.8e-51
Identity = 96/235 (40.85%), Postives = 137/235 (58.30%), Query Frame = 0

Query: 1083 KFASISSNQFLHRSRKTQTIDEIMVCHCKPAL-DGRLGCGDECLNRMLNIECVRGTCPCG 1142
            ++  I  N F +R  K Q  ++I +C CK    D    CG+ CLN + N EC  G CPCG
Sbjct: 16   QYEHIYQNDFSYRKHKKQKEEDISICECKFDFGDPDSACGERCLNVITNTECTPGYCPCG 75

Query: 1143 ELCSNQQFQKRKYAKLQWLRCGKKGYGLQLLEDISKGQFLIEYVGEVLDMHAYEARQKEY 1202
              C NQ+FQK +YAK + ++C  +G+GL  LE+I  GQF++EY GEV+     + R + Y
Sbjct: 76   VYCKNQKFQKCEYAKTKLIKCEGRGWGLVALEEIKAGQFIMEYCGEVISWKEAKKRAQTY 135

Query: 1203 ALNGHRHFYFMTLNGSEVIDACGKGNLGRFINHSCDPNCRTEKWMVNGEICIGLFALRDI 1262
              +G +  Y ++LN SE IDA  KG+L RFINHSC PNC T KW V GE+ +G+FA   I
Sbjct: 136  ETHGVKDAYIISLNASEAIDATKKGSLARFINHSCRPNCETRKWNVLGEVRVGIFAKESI 195

Query: 1263 KKGEEVTFDYNYVRVFGAAAKKCYCGSFHCRGYIGGDP--LNSEVIIQSDSDEEF 1315
                E+ +DYN+   +G A  +C CG+  C G++G        +  +  D D+ +
Sbjct: 196  SPRTELAYDYNF-EWYGGAKVRCLCGAVACSGFLGAKSRGFQEDTYVWEDGDDRY 249

BLAST of CsGy6G019510 vs. ExPASy Swiss-Prot
Match: E9Q5F9 (Histone-lysine N-methyltransferase SETD2 OS=Mus musculus OX=10090 GN=Setd2 PE=1 SV=1)

HSP 1 Score: 207.2 bits (526), Expect = 1.8e-51
Identity = 141/439 (32.12%), Postives = 222/439 (50.57%), Query Frame = 0

Query: 1084 FASISSNQFLHRSRKTQTIDEI--MVCHCKP-----ALDGRLGCGDECLNRMLNIECVRG 1143
            F  I  N +L   +K ++  +I  M C C P        G + CG++CLNR+L IEC   
Sbjct: 1447 FDLIEENVYLTERKKNKSHRDIKRMQCECTPLSKDERAQGEVACGEDCLNRLLMIEC-SS 1506

Query: 1144 TCPCGELCSNQQFQKRKYAKLQWLRCGKKGYGLQLLEDISKGQFLIEYVGEVLDMHAYEA 1203
             CP G+ CSN++FQ++++A ++ +   KKG+GL+  +D+    F++EY GEVLD   ++A
Sbjct: 1507 RCPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDHKEFKA 1566

Query: 1204 RQKEYALNGHRHFYFMTLNGSEVIDACGKGNLGRFINHSCDPNCRTEKWMVNGEICIGLF 1263
            R KEYA N + H+YFM L   E+IDA  KGN  RF+NHSC+PNC T+KW VNG++ +G F
Sbjct: 1567 RVKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFF 1626

Query: 1264 ALRDIKKGEEVTFDYNYVRVFGAAAKKCYCGSFHCRGYIGGDPLNSEVIIQSDSDEEFPE 1323
              + +  G E+TFDY + R +G  A+KC+CGS +CRGY+GG+                  
Sbjct: 1627 TTKLVPSGSELTFDYQFQR-YGKEAQKCFCGSANCRGYLGGE-----------------N 1686

Query: 1324 PVMLRGDGRSLNSNLSTAVSSMDVAKMQSSEHLKGNRDKRD--QPIRIASELKISEEKVD 1383
             V +R  G  +    S    S+D       E+ +G  DK       R+   ++  E+K+ 
Sbjct: 1687 RVSIRAAGGKMKKERSRKKDSVDGELEALMENGEGLSDKNQVLSLSRLMVRIETLEQKLT 1746

Query: 1384 PLKL-------SASKISEEKE--DPLKLSATKISEEKEDPLNLSASTIS-----PLHSSL 1443
             LKL       S  K   E+     L +   ++ + +E    L    I      P+ +  
Sbjct: 1747 CLKLIQNTHSQSCLKSFLERHGLSLLWIWMAELGDGRESNQKLQEEIIKTLEHLPIPTKN 1806

Query: 1444 EFEDSKVASPIPVPDITHQTEDVTSQPIFVDQTEISLLDNIPDKNTCSIEQEAKLSVDDI 1500
              E+SKV     +P I   ++  T+ P      ++S  D    +NT        L+  D 
Sbjct: 1807 MLEESKV-----LPIIQRWSQTKTAVP------QLSEGDGYSSENTS--RAHTPLNTPDP 1853

BLAST of CsGy6G019510 vs. ExPASy Swiss-Prot
Match: Q9VYD1 (Probable histone-lysine N-methyltransferase CG1716 OS=Drosophila melanogaster OX=7227 GN=Set2 PE=1 SV=2)

HSP 1 Score: 202.6 bits (514), Expect = 4.4e-50
Identity = 128/348 (36.78%), Postives = 184/348 (52.87%), Query Frame = 0

Query: 1073 ATVNAVPQQ-------NKFASISSNQFLHRSRKTQTIDEIMVCHC----KPALDGRLGCG 1132
            A + A+ +Q       N F  +  N F   +R+    +  M C C         G L CG
Sbjct: 1271 ANIEAINEQFLRSEGLNTFQLLKEN-FYRCARQVSQENAEMQCDCFLTGDEEAQGHLSCG 1330

Query: 1133 DECLNRMLNIECVRGTCPCGELCSNQQFQKRKYAKLQWLRCGKKGYGLQLLEDISKGQFL 1192
              C+NRML IEC    C  G  C+N++FQ+ +    +  R  KKG G+     I  G+F+
Sbjct: 1331 AGCINRMLMIEC-GPLCSNGARCTNKRFQQHQCWPCRVFRTEKKGCGITAELLIPPGEFI 1390

Query: 1193 IEYVGEVLDMHAYEARQKEYALNGHRHFYFMTLNGSEVIDACGKGNLGRFINHSCDPNCR 1252
            +EYVGEV+D   +E RQ  Y+ + +RH+YFM L G  VIDA  KGN+ R+INHSCDPN  
Sbjct: 1391 MEYVGEVIDSEEFERRQHLYSKDRNRHYYFMALRGEAVIDATSKGNISRYINHSCDPNAE 1450

Query: 1253 TEKWMVNGEICIGLFALRDIKKGEEVTFDYNYVRVFGAAAKKCYCGSFHCRGYIGGDPLN 1312
            T+KW VNGE+ IG F+++ I+ GEE+TFDY Y+R +G  A++CYC + +CRG+IGG+P +
Sbjct: 1451 TQKWTVNGELRIGFFSVKPIQPGEEITFDYQYLR-YGRDAQRCYCEAANCRGWIGGEPDS 1510

Query: 1313 SE---VIIQSDSDEEFPEPVMLRGDGRSLNSNLSTAVSSMDVAKMQSSEHLKGNRDKRDQ 1372
             E   +  +SDSD E  E  +   +          +  +   +K+++   L   R +++Q
Sbjct: 1511 DEGEQLDEESDSDAEMDEEEL---EAEPEEGQPRKSAKAKAKSKLKAKLPLATGRKRKEQ 1570

Query: 1373 PIRIASELKISEEKVDPLKLSASKISEEKE-DPLKLSATKISEEKEDP 1406
                  E K        LK SA+  S   E  P K    K     EDP
Sbjct: 1571 TKPKDREYKAGRW----LKPSATGSSSSAEKPPKKPKVNKFQAMLEDP 1608

BLAST of CsGy6G019510 vs. NCBI nr
Match: XP_011657417.1 (histone-lysine N-methyltransferase ASHH2 isoform X1 [Cucumis sativus] >XP_011657418.1 histone-lysine N-methyltransferase ASHH2 isoform X1 [Cucumis sativus] >KAE8647263.1 hypothetical protein Csa_018308 [Cucumis sativus])

HSP 1 Score: 4068 bits (10549), Expect = 0.0
Identity = 2082/2112 (98.58%), Postives = 2082/2112 (98.58%), Query Frame = 0

Query: 1    MGSCDDPAVIGEPFRASVTRLVRCSSQPLPEHQSHQEMASFSSNSREGQMFEPDRGLEVT 60
            MGSCDDPAVIGEPFRASVTRLVRCSSQPLPEHQSHQEMASFSSNSREGQMFEPDRGLEVT
Sbjct: 1    MGSCDDPAVIGEPFRASVTRLVRCSSQPLPEHQSHQEMASFSSNSREGQMFEPDRGLEVT 60

Query: 61   TASLCTNASDPDTSGEDGTLRGFEHADSLLMDKRLDGDSGGSDPCLNLDNESCNEGNKTL 120
            TASLCTNASDPDTSGEDGTLRGFEHADSLLMDKRLDGDSGGSDPCLNLDNESCNEGNKTL
Sbjct: 61   TASLCTNASDPDTSGEDGTLRGFEHADSLLMDKRLDGDSGGSDPCLNLDNESCNEGNKTL 120

Query: 121  SLDMKESEDVDGLVDILGCDATMEMISLTESLVNSVKPEELDNNSCIIDAPAKVERDDTA 180
            SLDMKESEDVDGLVDILGCDATMEMISLTESLVNSVKPEELDNNSCIIDAPAKVERDDTA
Sbjct: 121  SLDMKESEDVDGLVDILGCDATMEMISLTESLVNSVKPEELDNNSCIIDAPAKVERDDTA 180

Query: 181  QNGPILAGTGTRTDDLKSSYVCEIVSNSASADGLPNDFIQKNELENDGAGCSFSEVADRI 240
            QNGPILAGTGTRTDDLKSSYVCEIVSNSASADGLPNDFIQKNELENDGAGCSFSEVADRI
Sbjct: 181  QNGPILAGTGTRTDDLKSSYVCEIVSNSASADGLPNDFIQKNELENDGAGCSFSEVADRI 240

Query: 241  TEASVELEADMLNEMSPLQSGQILPIHVGQSIANYDRYVCRMDGKSLSSTSGETVTVVAD 300
            TEASVELEADMLNEMSPLQSGQILPIHVGQSIANYDRYVCRMDGKSLSSTSGETVTVVAD
Sbjct: 241  TEASVELEADMLNEMSPLQSGQILPIHVGQSIANYDRYVCRMDGKSLSSTSGETVTVVAD 300

Query: 301  MNSNPEGCLQMLPSQGCDRIGECLQSDGLPLTINASENDLCEEKHDSNSSSKYVPDVGGD 360
            MNSNPEGCLQMLPSQGCDRIGECLQSDGLPLTINASENDLCEEKHDSNSSSKYVPDVGGD
Sbjct: 301  MNSNPEGCLQMLPSQGCDRIGECLQSDGLPLTINASENDLCEEKHDSNSSSKYVPDVGGD 360

Query: 361  DSDVLTNNNSDGGQHTVPGIGNDHNLEDATVQVNHDCVELLSSPLPSQLPNSEKDEFYGM 420
            DSDVLTNNNSDGGQHTVPGIGNDHNLEDATVQVNHDCVELLSSPLPSQLPNSEKDEFYGM
Sbjct: 361  DSDVLTNNNSDGGQHTVPGIGNDHNLEDATVQVNHDCVELLSSPLPSQLPNSEKDEFYGM 420

Query: 421  LNGADIPIKYISSVNSCSVGDQDNNDIEKVGCVSEVKCPETVITSSKRSGRRRTSSQKTV 480
            LNGADIPIKYISSVNSCSVGDQDNNDIEKVGCVSEVKCPETVITSSKRSGRRRTSSQKTV
Sbjct: 421  LNGADIPIKYISSVNSCSVGDQDNNDIEKVGCVSEVKCPETVITSSKRSGRRRTSSQKTV 480

Query: 481  TKRASRKTKKKVPEPLIFDTARRRRSSISRPARPSPWGSLGHIIQSFEEIDDVLVNQTQK 540
            TKRASRKTKKKVPEPLIFDTARRRRSSISRPARPSPWGSLGHIIQSFEEIDDVLVNQTQK
Sbjct: 481  TKRASRKTKKKVPEPLIFDTARRRRSSISRPARPSPWGSLGHIIQSFEEIDDVLVNQTQK 540

Query: 541  QGNEKSKGNQGGAKRNKKQLSESSHRSRKGTQGKSATSTSTNRIRLKVKLGKNVGHNFLN 600
            QGNEKSKGNQGGAKRNKKQLSESSHRSRKGTQGKSATSTSTNRIRLKVKLGKNVGHNFLN
Sbjct: 541  QGNEKSKGNQGGAKRNKKQLSESSHRSRKGTQGKSATSTSTNRIRLKVKLGKNVGHNFLN 600

Query: 601  IVVPEIVDSSLSAKGVNCNYGNESYWEGNLEFPPSNLGVDDQKAEEEGPLRKIFCYSRNQ 660
            IVVPEIVDSSLSAKGVNCNYGNESYWEGNLEFPPSNLGVDDQKAEEEGPLRKIFCYSRNQ
Sbjct: 601  IVVPEIVDSSLSAKGVNCNYGNESYWEGNLEFPPSNLGVDDQKAEEEGPLRKIFCYSRNQ 660

Query: 661  DKEDNCPDASVVNEQCTNNDSSCIVGIDKSSEKHADDNLCVSSHLVDPVATSDARSLDPG 720
            DKEDNCPDASVVNEQCTNNDSSCIVGIDKSSEKHADDNLCVSSHLVDPVATSDARSLDPG
Sbjct: 661  DKEDNCPDASVVNEQCTNNDSSCIVGIDKSSEKHADDNLCVSSHLVDPVATSDARSLDPG 720

Query: 721  TSPDSEVINSVLDIQVGAARQEILQDSVLASLEDFAASGNAPGSKKGRKKDKPSRVVSCS 780
            TSPDSEVINSVLDIQVGAARQEILQDSVLASLEDFAASGNAPGSKKGRKKDKPSRVVSCS
Sbjct: 721  TSPDSEVINSVLDIQVGAARQEILQDSVLASLEDFAASGNAPGSKKGRKKDKPSRVVSCS 780

Query: 781  EERGISVSACSNRSKSSKKHGRRHNVDNQLSSGETLTYADANVLNYSLTVKELSMEQVSL 840
            EERGISVSACSNRSKSSKKHGRRHNVDNQLSS                            
Sbjct: 781  EERGISVSACSNRSKSSKKHGRRHNVDNQLSS---------------------------- 840

Query: 841  LTEIELPEETLKAEDILNDKECCRADVGSVFSESENSKTFLPSQSAKKKHPKGSKSIKTS 900
              EIELPEETLKAEDILNDKECCRADVGSVFSESENSKTFLPSQSAKKKHPKGSKSIKTS
Sbjct: 841  --EIELPEETLKAEDILNDKECCRADVGSVFSESENSKTFLPSQSAKKKHPKGSKSIKTS 900

Query: 901  KGKSKAPGSKNKIKNASNERVYQRKSFKNSKSKEALCDQVVTETESHQIIGNCLVDKPEK 960
            KGKSKAPGSKNKIKNASNERVYQRKSFKNSKSKEALCDQVVTETESHQIIGNCLVDKPEK
Sbjct: 901  KGKSKAPGSKNKIKNASNERVYQRKSFKNSKSKEALCDQVVTETESHQIIGNCLVDKPEK 960

Query: 961  SDNIIASTVAVDLSVVQGAVNEQYMPPRNAWVLCDDCHKWRRIPASLVDSLGHASCTWTC 1020
            SDNIIASTVAVDLSVVQGAVNEQYMPPRNAWVLCDDCHKWRRIPASLVDSLGHASCTWTC
Sbjct: 961  SDNIIASTVAVDLSVVQGAVNEQYMPPRNAWVLCDDCHKWRRIPASLVDSLGHASCTWTC 1020

Query: 1021 KDNVDKAFANCSIPQEKSNAEINAELEISDESGEENGSKKRLTYRELESFHPATVNAVPQ 1080
            KDNVDKAFANCSIPQEKSNAEINAELEISDESGEENGSKKRLTYRELESFHPATVNAVPQ
Sbjct: 1021 KDNVDKAFANCSIPQEKSNAEINAELEISDESGEENGSKKRLTYRELESFHPATVNAVPQ 1080

Query: 1081 QNKFASISSNQFLHRSRKTQTIDEIMVCHCKPALDGRLGCGDECLNRMLNIECVRGTCPC 1140
            QNKFASISSNQFLHRSRKTQTIDEIMVCHCKPALDGRLGCGDECLNRMLNIECVRGTCPC
Sbjct: 1081 QNKFASISSNQFLHRSRKTQTIDEIMVCHCKPALDGRLGCGDECLNRMLNIECVRGTCPC 1140

Query: 1141 GELCSNQQFQKRKYAKLQWLRCGKKGYGLQLLEDISKGQFLIEYVGEVLDMHAYEARQKE 1200
            GELCSNQQFQKRKYAKLQWLRCGKKGYGLQLLEDISKGQFLIEYVGEVLDMHAYEARQKE
Sbjct: 1141 GELCSNQQFQKRKYAKLQWLRCGKKGYGLQLLEDISKGQFLIEYVGEVLDMHAYEARQKE 1200

Query: 1201 YALNGHRHFYFMTLNGSEVIDACGKGNLGRFINHSCDPNCRTEKWMVNGEICIGLFALRD 1260
            YALNGHRHFYFMTLNGSEVIDACGKGNLGRFINHSCDPNCRTEKWMVNGEICIGLFALRD
Sbjct: 1201 YALNGHRHFYFMTLNGSEVIDACGKGNLGRFINHSCDPNCRTEKWMVNGEICIGLFALRD 1260

Query: 1261 IKKGEEVTFDYNYVRVFGAAAKKCYCGSFHCRGYIGGDPLNSEVIIQSDSDEEFPEPVML 1320
            IKKGEEVTFDYNYVRVFGAAAKKCYCGSFHCRGYIGGDPLNSEVIIQSDSDEEFPEPVML
Sbjct: 1261 IKKGEEVTFDYNYVRVFGAAAKKCYCGSFHCRGYIGGDPLNSEVIIQSDSDEEFPEPVML 1320

Query: 1321 RGDGRSLNSNLSTAVSSMDVAKMQSSEHLKGNRDKRDQPIRIASELKISEEKVDPLKLSA 1380
            RGDGRSLNSNLSTAVSSMDVAKMQSSEHLKGNRDKRDQPIRIASELKISEEKVDPLKLSA
Sbjct: 1321 RGDGRSLNSNLSTAVSSMDVAKMQSSEHLKGNRDKRDQPIRIASELKISEEKVDPLKLSA 1380

Query: 1381 SKISEEKEDPLKLSATKISEEKEDPLNLSASTISPLHSSLEFEDSKVASPIPVPDITHQT 1440
            SKISEEKEDPLKLSATKISEEKEDPLNLSASTISPLHSSLEFEDSKVASPIPVPDITHQT
Sbjct: 1381 SKISEEKEDPLKLSATKISEEKEDPLNLSASTISPLHSSLEFEDSKVASPIPVPDITHQT 1440

Query: 1441 EDVTSQPIFVDQTEISLLDNIPDKNTCSIEQEAKLSVDDIDARKKSKLDSVEDKQVYIKS 1500
            EDVTSQPIFVDQTEISLLDNIPDKNTCSIEQEAKLSVDDIDARKKSKLDSVEDKQVYIKS
Sbjct: 1441 EDVTSQPIFVDQTEISLLDNIPDKNTCSIEQEAKLSVDDIDARKKSKLDSVEDKQVYIKS 1500

Query: 1501 HPRMKTSRKLGSIKKGKVSSAEKIQITNRSQISSVKPKRLIEGSPGNRFEAVEEKLNELL 1560
            HPRMKTSRKLGSIKKGKVSSAEKIQITNRSQISSVKPKRLIEGSPGNRFEAVEEKLNELL
Sbjct: 1501 HPRMKTSRKLGSIKKGKVSSAEKIQITNRSQISSVKPKRLIEGSPGNRFEAVEEKLNELL 1560

Query: 1561 DAEGGISKRKDAPKGYLKLLLLTAASGASASGEAIQSNRDLSMILDALLKTKSRLVLTDI 1620
            DAEGGISKRKDAPKGYLKLLLLTAASGASASGEAIQSNRDLSMILDALLKTKSRLVLTDI
Sbjct: 1561 DAEGGISKRKDAPKGYLKLLLLTAASGASASGEAIQSNRDLSMILDALLKTKSRLVLTDI 1620

Query: 1621 INKNGLRMLHNIMKQYRSDFKKIPILRKLLKVLEYLVTREILTSEHINGGPPCPGMESLR 1680
            INKNGLRMLHNIMKQYRSDFKKIPILRKLLKVLEYLVTREILTSEHINGGPPCPGMESLR
Sbjct: 1621 INKNGLRMLHNIMKQYRSDFKKIPILRKLLKVLEYLVTREILTSEHINGGPPCPGMESLR 1680

Query: 1681 ESLLSLTEHDDKQVHQIARSFRDRWFPRHTRKFGYSEREDGRLEVYRGSNSSRFTASHSF 1740
            ESLLSLTEHDDKQVHQIARSFRDRWFPRHTRKFGYSEREDGRLEVYRGSNSSRFTASHSF
Sbjct: 1681 ESLLSLTEHDDKQVHQIARSFRDRWFPRHTRKFGYSEREDGRLEVYRGSNSSRFTASHSF 1740

Query: 1741 RHDQDCRPTDAIDCIKQSMPTSLPDAHPAEVCSLASAASHSVNGQKVRKRKSRWDQPADT 1800
            RHDQDCRPTDAIDCIKQSMPTSLPDAHPAEVCSLASAASHSVNGQKVRKRKSRWDQPADT
Sbjct: 1741 RHDQDCRPTDAIDCIKQSMPTSLPDAHPAEVCSLASAASHSVNGQKVRKRKSRWDQPADT 1800

Query: 1801 SLDLRSKEQKLESTSVQELNSSQLNSVGAASMLIDKVNNDDKDISLSDSVGVPCRQDEDI 1860
            SLDLRSKEQKLESTSVQELNSSQLNSVGAASMLIDKVNNDDKDISLSDSVGVPCRQDEDI
Sbjct: 1801 SLDLRSKEQKLESTSVQELNSSQLNSVGAASMLIDKVNNDDKDISLSDSVGVPCRQDEDI 1860

Query: 1861 RADSAVPNIPEDIPPGFSSPFNPPVASSSAFSAVLDPPRQNIGDLSCAFSTVGHLQERFI 1920
            RADSAVPNIPEDIPPGFSSPFNPPVASSSAFSAVLDPPRQNIGDLSCAFSTVGHLQERFI
Sbjct: 1861 RADSAVPNIPEDIPPGFSSPFNPPVASSSAFSAVLDPPRQNIGDLSCAFSTVGHLQERFI 1920

Query: 1921 SRLPVSYGIPFSIIEQCGTSHAENLECWDVAPGVPFHPFPPLPPYPRGMRGLPTSACGTA 1980
            SRLPVSYGIPFSIIEQCGTSHAENLECWDVAPGVPFHPFPPLPPYPRGMRGLPTSACGTA
Sbjct: 1921 SRLPVSYGIPFSIIEQCGTSHAENLECWDVAPGVPFHPFPPLPPYPRGMRGLPTSACGTA 1980

Query: 1981 GQSSQEGQVNSHDSRTSFSEESPPSTSTNYQTDLCTPSNNQQIAKRAKESSCDLGRRYFR 2040
            GQSSQEGQVNSHDSRTSFSEESPPSTSTNYQTDLCTPSNNQQIAKRAKESSCDLGRRYFR
Sbjct: 1981 GQSSQEGQVNSHDSRTSFSEESPPSTSTNYQTDLCTPSNNQQIAKRAKESSCDLGRRYFR 2040

Query: 2041 QQKWRNTKFGPPWLQRRSQWGCQGNFRGGVSTIGDENIPDEEISPYCSDEASGRVDKANG 2100
            QQKWRNTKFGPPWLQRRSQWGCQGNFRGGVSTIGDENIPDEEISPYCSDEASGRVDKANG
Sbjct: 2041 QQKWRNTKFGPPWLQRRSQWGCQGNFRGGVSTIGDENIPDEEISPYCSDEASGRVDKANG 2082

Query: 2101 DFYQHLQNQNLR 2112
            DFYQHLQNQNLR
Sbjct: 2101 DFYQHLQNQNLR 2082

BLAST of CsGy6G019510 vs. NCBI nr
Match: XP_031744047.1 (histone-lysine N-methyltransferase ASHH2 isoform X2 [Cucumis sativus])

HSP 1 Score: 3991 bits (10351), Expect = 0.0
Identity = 2045/2075 (98.55%), Postives = 2045/2075 (98.55%), Query Frame = 0

Query: 38   MASFSSNSREGQMFEPDRGLEVTTASLCTNASDPDTSGEDGTLRGFEHADSLLMDKRLDG 97
            MASFSSNSREGQMFEPDRGLEVTTASLCTNASDPDTSGEDGTLRGFEHADSLLMDKRLDG
Sbjct: 1    MASFSSNSREGQMFEPDRGLEVTTASLCTNASDPDTSGEDGTLRGFEHADSLLMDKRLDG 60

Query: 98   DSGGSDPCLNLDNESCNEGNKTLSLDMKESEDVDGLVDILGCDATMEMISLTESLVNSVK 157
            DSGGSDPCLNLDNESCNEGNKTLSLDMKESEDVDGLVDILGCDATMEMISLTESLVNSVK
Sbjct: 61   DSGGSDPCLNLDNESCNEGNKTLSLDMKESEDVDGLVDILGCDATMEMISLTESLVNSVK 120

Query: 158  PEELDNNSCIIDAPAKVERDDTAQNGPILAGTGTRTDDLKSSYVCEIVSNSASADGLPND 217
            PEELDNNSCIIDAPAKVERDDTAQNGPILAGTGTRTDDLKSSYVCEIVSNSASADGLPND
Sbjct: 121  PEELDNNSCIIDAPAKVERDDTAQNGPILAGTGTRTDDLKSSYVCEIVSNSASADGLPND 180

Query: 218  FIQKNELENDGAGCSFSEVADRITEASVELEADMLNEMSPLQSGQILPIHVGQSIANYDR 277
            FIQKNELENDGAGCSFSEVADRITEASVELEADMLNEMSPLQSGQILPIHVGQSIANYDR
Sbjct: 181  FIQKNELENDGAGCSFSEVADRITEASVELEADMLNEMSPLQSGQILPIHVGQSIANYDR 240

Query: 278  YVCRMDGKSLSSTSGETVTVVADMNSNPEGCLQMLPSQGCDRIGECLQSDGLPLTINASE 337
            YVCRMDGKSLSSTSGETVTVVADMNSNPEGCLQMLPSQGCDRIGECLQSDGLPLTINASE
Sbjct: 241  YVCRMDGKSLSSTSGETVTVVADMNSNPEGCLQMLPSQGCDRIGECLQSDGLPLTINASE 300

Query: 338  NDLCEEKHDSNSSSKYVPDVGGDDSDVLTNNNSDGGQHTVPGIGNDHNLEDATVQVNHDC 397
            NDLCEEKHDSNSSSKYVPDVGGDDSDVLTNNNSDGGQHTVPGIGNDHNLEDATVQVNHDC
Sbjct: 301  NDLCEEKHDSNSSSKYVPDVGGDDSDVLTNNNSDGGQHTVPGIGNDHNLEDATVQVNHDC 360

Query: 398  VELLSSPLPSQLPNSEKDEFYGMLNGADIPIKYISSVNSCSVGDQDNNDIEKVGCVSEVK 457
            VELLSSPLPSQLPNSEKDEFYGMLNGADIPIKYISSVNSCSVGDQDNNDIEKVGCVSEVK
Sbjct: 361  VELLSSPLPSQLPNSEKDEFYGMLNGADIPIKYISSVNSCSVGDQDNNDIEKVGCVSEVK 420

Query: 458  CPETVITSSKRSGRRRTSSQKTVTKRASRKTKKKVPEPLIFDTARRRRSSISRPARPSPW 517
            CPETVITSSKRSGRRRTSSQKTVTKRASRKTKKKVPEPLIFDTARRRRSSISRPARPSPW
Sbjct: 421  CPETVITSSKRSGRRRTSSQKTVTKRASRKTKKKVPEPLIFDTARRRRSSISRPARPSPW 480

Query: 518  GSLGHIIQSFEEIDDVLVNQTQKQGNEKSKGNQGGAKRNKKQLSESSHRSRKGTQGKSAT 577
            GSLGHIIQSFEEIDDVLVNQTQKQGNEKSKGNQGGAKRNKKQLSESSHRSRKGTQGKSAT
Sbjct: 481  GSLGHIIQSFEEIDDVLVNQTQKQGNEKSKGNQGGAKRNKKQLSESSHRSRKGTQGKSAT 540

Query: 578  STSTNRIRLKVKLGKNVGHNFLNIVVPEIVDSSLSAKGVNCNYGNESYWEGNLEFPPSNL 637
            STSTNRIRLKVKLGKNVGHNFLNIVVPEIVDSSLSAKGVNCNYGNESYWEGNLEFPPSNL
Sbjct: 541  STSTNRIRLKVKLGKNVGHNFLNIVVPEIVDSSLSAKGVNCNYGNESYWEGNLEFPPSNL 600

Query: 638  GVDDQKAEEEGPLRKIFCYSRNQDKEDNCPDASVVNEQCTNNDSSCIVGIDKSSEKHADD 697
            GVDDQKAEEEGPLRKIFCYSRNQDKEDNCPDASVVNEQCTNNDSSCIVGIDKSSEKHADD
Sbjct: 601  GVDDQKAEEEGPLRKIFCYSRNQDKEDNCPDASVVNEQCTNNDSSCIVGIDKSSEKHADD 660

Query: 698  NLCVSSHLVDPVATSDARSLDPGTSPDSEVINSVLDIQVGAARQEILQDSVLASLEDFAA 757
            NLCVSSHLVDPVATSDARSLDPGTSPDSEVINSVLDIQVGAARQEILQDSVLASLEDFAA
Sbjct: 661  NLCVSSHLVDPVATSDARSLDPGTSPDSEVINSVLDIQVGAARQEILQDSVLASLEDFAA 720

Query: 758  SGNAPGSKKGRKKDKPSRVVSCSEERGISVSACSNRSKSSKKHGRRHNVDNQLSSGETLT 817
            SGNAPGSKKGRKKDKPSRVVSCSEERGISVSACSNRSKSSKKHGRRHNVDNQLSS     
Sbjct: 721  SGNAPGSKKGRKKDKPSRVVSCSEERGISVSACSNRSKSSKKHGRRHNVDNQLSS----- 780

Query: 818  YADANVLNYSLTVKELSMEQVSLLTEIELPEETLKAEDILNDKECCRADVGSVFSESENS 877
                                     EIELPEETLKAEDILNDKECCRADVGSVFSESENS
Sbjct: 781  -------------------------EIELPEETLKAEDILNDKECCRADVGSVFSESENS 840

Query: 878  KTFLPSQSAKKKHPKGSKSIKTSKGKSKAPGSKNKIKNASNERVYQRKSFKNSKSKEALC 937
            KTFLPSQSAKKKHPKGSKSIKTSKGKSKAPGSKNKIKNASNERVYQRKSFKNSKSKEALC
Sbjct: 841  KTFLPSQSAKKKHPKGSKSIKTSKGKSKAPGSKNKIKNASNERVYQRKSFKNSKSKEALC 900

Query: 938  DQVVTETESHQIIGNCLVDKPEKSDNIIASTVAVDLSVVQGAVNEQYMPPRNAWVLCDDC 997
            DQVVTETESHQIIGNCLVDKPEKSDNIIASTVAVDLSVVQGAVNEQYMPPRNAWVLCDDC
Sbjct: 901  DQVVTETESHQIIGNCLVDKPEKSDNIIASTVAVDLSVVQGAVNEQYMPPRNAWVLCDDC 960

Query: 998  HKWRRIPASLVDSLGHASCTWTCKDNVDKAFANCSIPQEKSNAEINAELEISDESGEENG 1057
            HKWRRIPASLVDSLGHASCTWTCKDNVDKAFANCSIPQEKSNAEINAELEISDESGEENG
Sbjct: 961  HKWRRIPASLVDSLGHASCTWTCKDNVDKAFANCSIPQEKSNAEINAELEISDESGEENG 1020

Query: 1058 SKKRLTYRELESFHPATVNAVPQQNKFASISSNQFLHRSRKTQTIDEIMVCHCKPALDGR 1117
            SKKRLTYRELESFHPATVNAVPQQNKFASISSNQFLHRSRKTQTIDEIMVCHCKPALDGR
Sbjct: 1021 SKKRLTYRELESFHPATVNAVPQQNKFASISSNQFLHRSRKTQTIDEIMVCHCKPALDGR 1080

Query: 1118 LGCGDECLNRMLNIECVRGTCPCGELCSNQQFQKRKYAKLQWLRCGKKGYGLQLLEDISK 1177
            LGCGDECLNRMLNIECVRGTCPCGELCSNQQFQKRKYAKLQWLRCGKKGYGLQLLEDISK
Sbjct: 1081 LGCGDECLNRMLNIECVRGTCPCGELCSNQQFQKRKYAKLQWLRCGKKGYGLQLLEDISK 1140

Query: 1178 GQFLIEYVGEVLDMHAYEARQKEYALNGHRHFYFMTLNGSEVIDACGKGNLGRFINHSCD 1237
            GQFLIEYVGEVLDMHAYEARQKEYALNGHRHFYFMTLNGSEVIDACGKGNLGRFINHSCD
Sbjct: 1141 GQFLIEYVGEVLDMHAYEARQKEYALNGHRHFYFMTLNGSEVIDACGKGNLGRFINHSCD 1200

Query: 1238 PNCRTEKWMVNGEICIGLFALRDIKKGEEVTFDYNYVRVFGAAAKKCYCGSFHCRGYIGG 1297
            PNCRTEKWMVNGEICIGLFALRDIKKGEEVTFDYNYVRVFGAAAKKCYCGSFHCRGYIGG
Sbjct: 1201 PNCRTEKWMVNGEICIGLFALRDIKKGEEVTFDYNYVRVFGAAAKKCYCGSFHCRGYIGG 1260

Query: 1298 DPLNSEVIIQSDSDEEFPEPVMLRGDGRSLNSNLSTAVSSMDVAKMQSSEHLKGNRDKRD 1357
            DPLNSEVIIQSDSDEEFPEPVMLRGDGRSLNSNLSTAVSSMDVAKMQSSEHLKGNRDKRD
Sbjct: 1261 DPLNSEVIIQSDSDEEFPEPVMLRGDGRSLNSNLSTAVSSMDVAKMQSSEHLKGNRDKRD 1320

Query: 1358 QPIRIASELKISEEKVDPLKLSASKISEEKEDPLKLSATKISEEKEDPLNLSASTISPLH 1417
            QPIRIASELKISEEKVDPLKLSASKISEEKEDPLKLSATKISEEKEDPLNLSASTISPLH
Sbjct: 1321 QPIRIASELKISEEKVDPLKLSASKISEEKEDPLKLSATKISEEKEDPLNLSASTISPLH 1380

Query: 1418 SSLEFEDSKVASPIPVPDITHQTEDVTSQPIFVDQTEISLLDNIPDKNTCSIEQEAKLSV 1477
            SSLEFEDSKVASPIPVPDITHQTEDVTSQPIFVDQTEISLLDNIPDKNTCSIEQEAKLSV
Sbjct: 1381 SSLEFEDSKVASPIPVPDITHQTEDVTSQPIFVDQTEISLLDNIPDKNTCSIEQEAKLSV 1440

Query: 1478 DDIDARKKSKLDSVEDKQVYIKSHPRMKTSRKLGSIKKGKVSSAEKIQITNRSQISSVKP 1537
            DDIDARKKSKLDSVEDKQVYIKSHPRMKTSRKLGSIKKGKVSSAEKIQITNRSQISSVKP
Sbjct: 1441 DDIDARKKSKLDSVEDKQVYIKSHPRMKTSRKLGSIKKGKVSSAEKIQITNRSQISSVKP 1500

Query: 1538 KRLIEGSPGNRFEAVEEKLNELLDAEGGISKRKDAPKGYLKLLLLTAASGASASGEAIQS 1597
            KRLIEGSPGNRFEAVEEKLNELLDAEGGISKRKDAPKGYLKLLLLTAASGASASGEAIQS
Sbjct: 1501 KRLIEGSPGNRFEAVEEKLNELLDAEGGISKRKDAPKGYLKLLLLTAASGASASGEAIQS 1560

Query: 1598 NRDLSMILDALLKTKSRLVLTDIINKNGLRMLHNIMKQYRSDFKKIPILRKLLKVLEYLV 1657
            NRDLSMILDALLKTKSRLVLTDIINKNGLRMLHNIMKQYRSDFKKIPILRKLLKVLEYLV
Sbjct: 1561 NRDLSMILDALLKTKSRLVLTDIINKNGLRMLHNIMKQYRSDFKKIPILRKLLKVLEYLV 1620

Query: 1658 TREILTSEHINGGPPCPGMESLRESLLSLTEHDDKQVHQIARSFRDRWFPRHTRKFGYSE 1717
            TREILTSEHINGGPPCPGMESLRESLLSLTEHDDKQVHQIARSFRDRWFPRHTRKFGYSE
Sbjct: 1621 TREILTSEHINGGPPCPGMESLRESLLSLTEHDDKQVHQIARSFRDRWFPRHTRKFGYSE 1680

Query: 1718 REDGRLEVYRGSNSSRFTASHSFRHDQDCRPTDAIDCIKQSMPTSLPDAHPAEVCSLASA 1777
            REDGRLEVYRGSNSSRFTASHSFRHDQDCRPTDAIDCIKQSMPTSLPDAHPAEVCSLASA
Sbjct: 1681 REDGRLEVYRGSNSSRFTASHSFRHDQDCRPTDAIDCIKQSMPTSLPDAHPAEVCSLASA 1740

Query: 1778 ASHSVNGQKVRKRKSRWDQPADTSLDLRSKEQKLESTSVQELNSSQLNSVGAASMLIDKV 1837
            ASHSVNGQKVRKRKSRWDQPADTSLDLRSKEQKLESTSVQELNSSQLNSVGAASMLIDKV
Sbjct: 1741 ASHSVNGQKVRKRKSRWDQPADTSLDLRSKEQKLESTSVQELNSSQLNSVGAASMLIDKV 1800

Query: 1838 NNDDKDISLSDSVGVPCRQDEDIRADSAVPNIPEDIPPGFSSPFNPPVASSSAFSAVLDP 1897
            NNDDKDISLSDSVGVPCRQDEDIRADSAVPNIPEDIPPGFSSPFNPPVASSSAFSAVLDP
Sbjct: 1801 NNDDKDISLSDSVGVPCRQDEDIRADSAVPNIPEDIPPGFSSPFNPPVASSSAFSAVLDP 1860

Query: 1898 PRQNIGDLSCAFSTVGHLQERFISRLPVSYGIPFSIIEQCGTSHAENLECWDVAPGVPFH 1957
            PRQNIGDLSCAFSTVGHLQERFISRLPVSYGIPFSIIEQCGTSHAENLECWDVAPGVPFH
Sbjct: 1861 PRQNIGDLSCAFSTVGHLQERFISRLPVSYGIPFSIIEQCGTSHAENLECWDVAPGVPFH 1920

Query: 1958 PFPPLPPYPRGMRGLPTSACGTAGQSSQEGQVNSHDSRTSFSEESPPSTSTNYQTDLCTP 2017
            PFPPLPPYPRGMRGLPTSACGTAGQSSQEGQVNSHDSRTSFSEESPPSTSTNYQTDLCTP
Sbjct: 1921 PFPPLPPYPRGMRGLPTSACGTAGQSSQEGQVNSHDSRTSFSEESPPSTSTNYQTDLCTP 1980

Query: 2018 SNNQQIAKRAKESSCDLGRRYFRQQKWRNTKFGPPWLQRRSQWGCQGNFRGGVSTIGDEN 2077
            SNNQQIAKRAKESSCDLGRRYFRQQKWRNTKFGPPWLQRRSQWGCQGNFRGGVSTIGDEN
Sbjct: 1981 SNNQQIAKRAKESSCDLGRRYFRQQKWRNTKFGPPWLQRRSQWGCQGNFRGGVSTIGDEN 2040

Query: 2078 IPDEEISPYCSDEASGRVDKANGDFYQHLQNQNLR 2112
            IPDEEISPYCSDEASGRVDKANGDFYQHLQNQNLR
Sbjct: 2041 IPDEEISPYCSDEASGRVDKANGDFYQHLQNQNLR 2045

BLAST of CsGy6G019510 vs. NCBI nr
Match: KAA0055531.1 (histone-lysine N-methyltransferase ASHH2 [Cucumis melo var. makuwa])

HSP 1 Score: 3907 bits (10133), Expect = 0.0
Identity = 2004/2113 (94.84%), Postives = 2037/2113 (96.40%), Query Frame = 0

Query: 1    MGSCDDPAVIGEPFRASVTRLVRCSSQPLPEHQSHQEMASFSSNSREGQMFEPDRGLEVT 60
            MGSCDDPAVIGEPFRASVTRLVRCSSQPLPEHQSHQEMASFSS+SREGQMFEPDRGL VT
Sbjct: 1    MGSCDDPAVIGEPFRASVTRLVRCSSQPLPEHQSHQEMASFSSSSREGQMFEPDRGLGVT 60

Query: 61   TASLCTNASDPDTSGEDGTLRGFEHADSLLMDKRLDGDSGGSDPCLNLDNESCNEGNKTL 120
            TAS+C NASDPDT GEDGTL  FEHADSLLMDKRLDGD GGSDPCLNL+NESCNEGN+TL
Sbjct: 61   TASVCMNASDPDTYGEDGTLGAFEHADSLLMDKRLDGDFGGSDPCLNLENESCNEGNRTL 120

Query: 121  SLDMKESEDVDGLVDILGCDATMEMISLTESLVNSVKPEELDNNSCIIDAPAKVERDDTA 180
            SLDMKESEDVDG VDILGCDATMEMISLTESLVNSVKPEELD NSCI DAPAKVERDDT 
Sbjct: 121  SLDMKESEDVDGFVDILGCDATMEMISLTESLVNSVKPEELDKNSCIFDAPAKVERDDTV 180

Query: 181  QNGPILAGTGTRTDDLKSSYVCEIVSNSASADGLPNDFIQKNELENDGAGCSFSEVADRI 240
            QNGPIL GTGTRTDDLKSSYVCEIVSNSASADGLPNDFIQ+N++ENDGAGCSFSEVADRI
Sbjct: 181  QNGPILVGTGTRTDDLKSSYVCEIVSNSASADGLPNDFIQQNKMENDGAGCSFSEVADRI 240

Query: 241  TEASVELEADMLNEMSPLQSGQILPIHVGQSIANYDRYVCRMDGKSLSSTSGETVTVVAD 300
            TEASVELEADMLNE+SPLQSGQILPI VGQSIAN DRYVC+MDGKSLSSTSGETV  VAD
Sbjct: 241  TEASVELEADMLNEISPLQSGQILPIDVGQSIANCDRYVCQMDGKSLSSTSGETVIEVAD 300

Query: 301  MNSNPEGCLQMLPSQGCDRIGECLQSDGLPLTINASENDLCEEKHDSNSSSKYVPDVGGD 360
            MNSNPE CLQMLPSQGCDRIGECLQSDGLPLTI+ASENDLCEEKHDSNSSSKY+PDVGGD
Sbjct: 301  MNSNPEVCLQMLPSQGCDRIGECLQSDGLPLTIHASENDLCEEKHDSNSSSKYIPDVGGD 360

Query: 361  DSDVLTNNNSDGGQHTVPGIGNDHNLEDATVQVNHDCVELLSSPLPSQLPNSEKDEFYGM 420
            DSDVLTNNNSDGGQH VPGIGNDHNLEDATVQVNH+CVELL+SPLPSQ PNSEKDEFYG 
Sbjct: 361  DSDVLTNNNSDGGQHVVPGIGNDHNLEDATVQVNHNCVELLASPLPSQPPNSEKDEFYGT 420

Query: 421  LNGADIPIKYISSVNSCSVGDQDNNDIEKVGCVSEVKCPETVITSSKRSGRRRTSSQKTV 480
            L   DIPIKYISSVNS  +GDQDNNDI KVGCVSEVKCPETVI SSKRSGRRRTSSQK V
Sbjct: 421  LK-EDIPIKYISSVNSRCLGDQDNNDIGKVGCVSEVKCPETVIMSSKRSGRRRTSSQKAV 480

Query: 481  TKRASRKTKKKVPEPLIFDTARRRRSSISRPARPSPWGSLGHIIQSFEEIDDVLVNQTQK 540
            TKRASRKTKKKVPEPLIFDT RRRRSSISR ARPSPWGSLGHIIQSFEEIDDVLVNQTQK
Sbjct: 481  TKRASRKTKKKVPEPLIFDTTRRRRSSISRSARPSPWGSLGHIIQSFEEIDDVLVNQTQK 540

Query: 541  QGNEKSKGNQGGAKRNKKQLSESSHRSRKGTQGKSATSTSTNRIRLKVKLGKNVGHNFLN 600
            QGNEKSKGNQGGAKRNKKQLSESSHRSRKGTQGK ATSTSTNRIRLKVKLGKNVGHNFLN
Sbjct: 541  QGNEKSKGNQGGAKRNKKQLSESSHRSRKGTQGKPATSTSTNRIRLKVKLGKNVGHNFLN 600

Query: 601  IVVPEIVDSSLSAKGVNCNYGNESYWEGNLEFPPSNLGVDDQKAEEEGPLRKIFCYSRNQ 660
            IVVPEIVDSSLSAKGVNCNYGN+SYWEGNLEFPPS LGVDDQK EE GPLRKIFCYSRNQ
Sbjct: 601  IVVPEIVDSSLSAKGVNCNYGNDSYWEGNLEFPPSTLGVDDQKVEE-GPLRKIFCYSRNQ 660

Query: 661  DKEDNCPDASVVNEQCTNNDSSCIVGIDKSSEKHADDNLCVSSHLVDPVA-TSDARSLDP 720
            DKE+ CPDASVVNEQCTNNDSSCI+GIDKSSEKHADDNLCVSSHLV+PV  TSD RSLDP
Sbjct: 661  DKEEKCPDASVVNEQCTNNDSSCIIGIDKSSEKHADDNLCVSSHLVEPVERTSDTRSLDP 720

Query: 721  GTSPDSEVINSVLDIQVGAARQEILQDSVLASLEDFAASGNAPGSKKGRKKDKPSRVVSC 780
            GTSPDSEVINSVLDIQVGAARQEIL DSVLASLEDFAASGNAPGSKKGRKKDKPSR VSC
Sbjct: 721  GTSPDSEVINSVLDIQVGAARQEILPDSVLASLEDFAASGNAPGSKKGRKKDKPSRAVSC 780

Query: 781  SEERGISVSACSNRSKSSKKHGRRHNVDNQLSSGETLTYADANVLNYSLTVKELSMEQVS 840
            S ERGISVSACSNRSKSSKKHGRR NVDNQL SGET TY+DANVLNYSLTV+ELSMEQVS
Sbjct: 781  SGERGISVSACSNRSKSSKKHGRRQNVDNQLGSGETFTYSDANVLNYSLTVEELSMEQVS 840

Query: 841  LLTEIELPEETLKAEDILNDKECCRADVGSVFSESENSKTFLPSQSAKKKHPKGSKSIKT 900
            LLTEIELPE+TLKA+DILNDKECCRADVGS F ESENSKTFLPSQSAKKKHPKGSKSIKT
Sbjct: 841  LLTEIELPEDTLKADDILNDKECCRADVGSTFPESENSKTFLPSQSAKKKHPKGSKSIKT 900

Query: 901  SKGKSKAPGSKNKIKNASNERVYQRKSFKNSKSKEALCDQVVTETESHQIIGNCLVDKPE 960
            SKGKSKAPGSKNKIKNASNERVYQRKSFK SKSKEALCD+VVTETESHQIIGNCLVDKPE
Sbjct: 901  SKGKSKAPGSKNKIKNASNERVYQRKSFKKSKSKEALCDRVVTETESHQIIGNCLVDKPE 960

Query: 961  KSDNIIASTVAVDLSVVQGAVNEQYMPPRNAWVLCDDCHKWRRIPASLVDSLGHASCTWT 1020
            KSDNIIASTVAVDLSVVQGAVNEQYMPPRNAWVLCDDCHKWRRIPASLVDSLGHASCTWT
Sbjct: 961  KSDNIIASTVAVDLSVVQGAVNEQYMPPRNAWVLCDDCHKWRRIPASLVDSLGHASCTWT 1020

Query: 1021 CKDNVDKAFANCSIPQEKSNAEINAELEISDESGEENGSKKRLTYRELESFHPATVNAVP 1080
            CKDNVDKAFANCSIPQEKSNAEINAELEISDESGEENGSKKRLTYRELESFHPATV A+P
Sbjct: 1021 CKDNVDKAFANCSIPQEKSNAEINAELEISDESGEENGSKKRLTYRELESFHPATVTAIP 1080

Query: 1081 QQNKFASISSNQFLHRSRKTQTIDEIMVCHCKPALDGRLGCGDECLNRMLNIECVRGTCP 1140
            Q+NKFASISSNQFLHRSRKTQTIDEIMVCHCKPALDGRLGCGDECLNRMLNIECVRGTCP
Sbjct: 1081 QENKFASISSNQFLHRSRKTQTIDEIMVCHCKPALDGRLGCGDECLNRMLNIECVRGTCP 1140

Query: 1141 CGELCSNQQFQKRKYAKLQWLRCGKKGYGLQLLEDISKGQFLIEYVGEVLDMHAYEARQK 1200
            CG+LCSNQQFQKRKYAKLQWLRCGKKGYGLQLLEDISKGQFLIEYVGEVLDM+AYEARQK
Sbjct: 1141 CGDLCSNQQFQKRKYAKLQWLRCGKKGYGLQLLEDISKGQFLIEYVGEVLDMNAYEARQK 1200

Query: 1201 EYALNGHRHFYFMTLNGSEVIDACGKGNLGRFINHSCDPNCRTEKWMVNGEICIGLFALR 1260
            EYALNGHRHFYFMTLNGSEVIDACGKGNLGRFINHSCDPNCRTEKWMVNGEICIGLFALR
Sbjct: 1201 EYALNGHRHFYFMTLNGSEVIDACGKGNLGRFINHSCDPNCRTEKWMVNGEICIGLFALR 1260

Query: 1261 DIKKGEEVTFDYNYVRVFGAAAKKCYCGSFHCRGYIGGDPLNSEVIIQSDSDEEFPEPVM 1320
            DIKKGEEVTFDYNYVRVFGAAAKKCYCGSFHCRGYIGGDPLNSEVIIQSDSDEEFPEPVM
Sbjct: 1261 DIKKGEEVTFDYNYVRVFGAAAKKCYCGSFHCRGYIGGDPLNSEVIIQSDSDEEFPEPVM 1320

Query: 1321 LRGDGRSLNSNLSTAVSSMDVAKMQSSEHLKGNRDKRDQPIRIASELKISEEKVDPLKLS 1380
            LR DGRS N+NLSTAVSSMDVAKMQ SEHLKGNRDKRDQPIRIASELKISEEKVD LKL 
Sbjct: 1321 LRADGRSWNNNLSTAVSSMDVAKMQPSEHLKGNRDKRDQPIRIASELKISEEKVDTLKLP 1380

Query: 1381 ASKISEEKEDPLKLSATKISEEKEDPLNLSASTISPLHSSLEFEDSKVASPIPVPDITHQ 1440
            ASKISEEKEDPLKLSA K SEEKEDPLNLSASTISPLHSSLEFEDSKVASPIPVPDITHQ
Sbjct: 1381 ASKISEEKEDPLKLSALKTSEEKEDPLNLSASTISPLHSSLEFEDSKVASPIPVPDITHQ 1440

Query: 1441 TEDVTSQPIFVDQTEISLLDNIPDKNTCSIEQEAKLSVDDIDARKKSKLDSVEDKQVYIK 1500
            TEDVTS+PIFVDQT ISLLDNI DKNTCSIEQEAKLSVDDIDARKKSKLDSVEDK+VYIK
Sbjct: 1441 TEDVTSKPIFVDQTGISLLDNISDKNTCSIEQEAKLSVDDIDARKKSKLDSVEDKKVYIK 1500

Query: 1501 SHPRMKTSRKLGSIKKGKVSSAEKIQITNRSQISSVKPKRLIEGSPGNRFEAVEEKLNEL 1560
            SHPRMKTSRK GS+KKGKVSS EKIQITNRS ISSVKPKRLIEGSPGNRFEAVEEKLNEL
Sbjct: 1501 SHPRMKTSRKPGSVKKGKVSSVEKIQITNRSLISSVKPKRLIEGSPGNRFEAVEEKLNEL 1560

Query: 1561 LDAEGGISKRKDAPKGYLKLLLLTAASGASASGEAIQSNRDLSMILDALLKTKSRLVLTD 1620
            LDAEGGISKRKDAPKGYLKLLLLTAASGASASGEAIQSNRDLSMILDALLKTKSRLVLTD
Sbjct: 1561 LDAEGGISKRKDAPKGYLKLLLLTAASGASASGEAIQSNRDLSMILDALLKTKSRLVLTD 1620

Query: 1621 IINKNGLRMLHNIMKQYRSDFKKIPILRKLLKVLEYLVTREILTSEHINGGPPCPGMESL 1680
            IINKNGLRMLHNIMKQYRSDFKKIPILRKLLKVLEYLVTREILTSEHINGGPPCPGMESL
Sbjct: 1621 IINKNGLRMLHNIMKQYRSDFKKIPILRKLLKVLEYLVTREILTSEHINGGPPCPGMESL 1680

Query: 1681 RESLLSLTEHDDKQVHQIARSFRDRWFPRHTRKFGYSEREDGRLEVYRGSNSSRFTASHS 1740
            RESLLSLTEHDDKQVHQIARSFRDRWFPRHTRKFGYSEREDGRLEVYRGSNSSRFTASHS
Sbjct: 1681 RESLLSLTEHDDKQVHQIARSFRDRWFPRHTRKFGYSEREDGRLEVYRGSNSSRFTASHS 1740

Query: 1741 FRHDQDCRPTDAIDCIKQSMPTSLPDAHPAEVCSLASAASHSVNGQKVRKRKSRWDQPAD 1800
            +RHDQDCRPTDAIDCIKQSMPT LPDAH AEVCSLAS A  SVNGQKVRKRKSRWDQPAD
Sbjct: 1741 YRHDQDCRPTDAIDCIKQSMPTPLPDAHTAEVCSLASVAGPSVNGQKVRKRKSRWDQPAD 1800

Query: 1801 TSLDLRSKEQKLESTSVQELNSSQLNSVGAASMLIDKVNNDDKDISLSDSVGVPCRQDED 1860
            TSLDLRSKEQKLESTSVQELNSSQLNSV  ASMLIDKVNNDDKD SLSDSVGVPCRQDED
Sbjct: 1801 TSLDLRSKEQKLESTSVQELNSSQLNSVRVASMLIDKVNNDDKDSSLSDSVGVPCRQDED 1860

Query: 1861 IRADSAVPNIPEDIPPGFSSPFNPPVASSSAFSAVLDPPRQNIGDLSCAFSTVGHLQERF 1920
             RADSAVPNIPEDIPPGFSSPFNP VASSSAFSAVLDPP+QNIG LSCAFSTVGHLQERF
Sbjct: 1861 TRADSAVPNIPEDIPPGFSSPFNPSVASSSAFSAVLDPPQQNIGYLSCAFSTVGHLQERF 1920

Query: 1921 ISRLPVSYGIPFSIIEQCGTSHAENLECWDVAPGVPFHPFPPLPPYPRGMRGLPTSACGT 1980
            ISRLPVSYGIPFSIIEQCGTS AENLECWDVAPGVPFHPFPPLPPYPRGM G  TSACGT
Sbjct: 1921 ISRLPVSYGIPFSIIEQCGTSRAENLECWDVAPGVPFHPFPPLPPYPRGMSGPRTSACGT 1980

Query: 1981 AGQSSQEGQVNSHDSRTSFSEESPPSTSTNYQTDLCTPSNNQQIAKRAKESSCDLGRRYF 2040
            AGQSSQEGQVNSHDSRTSFSEESPPSTSTNYQTDLCTPSNNQQI KR KESSCDLGRRYF
Sbjct: 1981 AGQSSQEGQVNSHDSRTSFSEESPPSTSTNYQTDLCTPSNNQQITKRPKESSCDLGRRYF 2040

Query: 2041 RQQKWRNTKFGPPWLQRRSQWGCQGNFRGGVSTIGDENIPDEEISPYCSDEASGRVDKAN 2100
            RQQKWRNTKFGPPWLQRRSQWGCQGNFRGGVSTIGDENIPDEEISPYCSDEASGRVDKAN
Sbjct: 2041 RQQKWRNTKFGPPWLQRRSQWGCQGNFRGGVSTIGDENIPDEEISPYCSDEASGRVDKAN 2100

Query: 2101 GDFYQHLQNQNLR 2112
            GDFYQHLQNQNLR
Sbjct: 2101 GDFYQHLQNQNLR 2111

BLAST of CsGy6G019510 vs. NCBI nr
Match: XP_008441612.1 (PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase ASHH2 [Cucumis melo])

HSP 1 Score: 3831 bits (9934), Expect = 0.0
Identity = 1975/2114 (93.42%), Postives = 2006/2114 (94.89%), Query Frame = 0

Query: 1    MGSCDDPAVIGEPFRASVTRLVRCSSQPLPEHQSHQEMASFSSNSREGQMFEPDRGLEVT 60
            MGSCDDPAVIGEPFRASVTRLVRCSSQPLPEHQSHQEMASFSS+SREGQMFEPDRGL VT
Sbjct: 1    MGSCDDPAVIGEPFRASVTRLVRCSSQPLPEHQSHQEMASFSSSSREGQMFEPDRGLGVT 60

Query: 61   TASLCTNASDPDTSGEDGTLRGFEHADSLLMDKRLDGDSGGSDPCLNLDNESCNEGNKTL 120
            TAS+C NASDPDT GEDGTL  FEHADSLLMDKRLDGD GGSDPCLNL+NESCNEGN+TL
Sbjct: 61   TASVCMNASDPDTYGEDGTLGAFEHADSLLMDKRLDGDFGGSDPCLNLENESCNEGNRTL 120

Query: 121  SLDMKESEDVDGLVDILGCDATMEMISLTESLVNSVKPEELDNNSCIIDAPAKVERDDTA 180
            SLDMKESEDVDG VDILGCDATMEMISLTESLVNSVKPEELD NSCI DAPAKVERDDT 
Sbjct: 121  SLDMKESEDVDGFVDILGCDATMEMISLTESLVNSVKPEELDKNSCIFDAPAKVERDDTV 180

Query: 181  QNGPILAGTGTRTDDLKSSYVCEIVSNSASADGLPNDFIQKNELENDGAGCSFSEVADRI 240
            QNGPIL GTGTRTDDLKSSYVCEIVSNSASADGLPNDFIQ+N++ENDGAGCSFSEVADRI
Sbjct: 181  QNGPILVGTGTRTDDLKSSYVCEIVSNSASADGLPNDFIQQNKMENDGAGCSFSEVADRI 240

Query: 241  TEASVELEADMLNEMSPLQSGQILPIHVGQSIANYDRYVCRMDGKSLSSTSGETVTVVAD 300
            TEASVELEADMLNE+SPLQSGQILPI VGQSIAN DRYVC+MDGKSLSSTSGETV  VAD
Sbjct: 241  TEASVELEADMLNEISPLQSGQILPIDVGQSIANCDRYVCQMDGKSLSSTSGETVIEVAD 300

Query: 301  MNSNPEGCLQMLPSQGCDRIGECLQSDGLPLTINASENDLCEEKHDSNSSSKYVPDVGGD 360
            MNSNPE CLQMLPSQGCDRIGECLQSDGLPLTI+ASENDLCEEKHDSNSSSKY+PDVGGD
Sbjct: 301  MNSNPEVCLQMLPSQGCDRIGECLQSDGLPLTIHASENDLCEEKHDSNSSSKYIPDVGGD 360

Query: 361  DSDVLTNNNSDGGQHTVPGIGNDHNLEDATVQVNHDCVELLSSPLPSQLPNSEKDEFYGM 420
            DSDVLTNNNSDGGQH VPGIGNDHNLEDATVQVNH+CVELL+SPLPSQ PNSEKDEFYG 
Sbjct: 361  DSDVLTNNNSDGGQHVVPGIGNDHNLEDATVQVNHNCVELLASPLPSQPPNSEKDEFYGT 420

Query: 421  LNGADIPIKYISSVNSCSVGDQDNNDIEKVGCVSEVKCPETVITSSKRSGRRRTSSQKTV 480
            L   DIPIKYISSVNS  +GDQDNNDI KVGCVSEVKCPETVI SSKRSGRRRTSSQK V
Sbjct: 421  LK-EDIPIKYISSVNSRCLGDQDNNDIGKVGCVSEVKCPETVIMSSKRSGRRRTSSQKAV 480

Query: 481  TKRASRKTKKKVPEPLIFDTARRRRSSISRPARPSPWGSLGHIIQSFEEIDDVLVNQTQK 540
            TKRASRKTKKKVPEPLIFDT RRRRSSISR ARPSPWGSLGHIIQSFEEIDDVLVNQTQK
Sbjct: 481  TKRASRKTKKKVPEPLIFDTTRRRRSSISRSARPSPWGSLGHIIQSFEEIDDVLVNQTQK 540

Query: 541  QGNEKSKGNQGGAKRNKKQLSESSHRSRKGTQGKSATSTSTNRIRLKVKLGKNVGHNFLN 600
            QGNEKSKGNQGGAKRNKKQLSESSHRSRKGTQGK ATSTSTNRIRLKVKLGKNVGHNFLN
Sbjct: 541  QGNEKSKGNQGGAKRNKKQLSESSHRSRKGTQGKPATSTSTNRIRLKVKLGKNVGHNFLN 600

Query: 601  IVVPEIVDSSLSAKGVNCNYGNESYWEGNLEFPPSNLGVDDQKAEEEGPLRKIFCYSRNQ 660
            IVVPEIVDSSLSAKGVNCNYGN+SYWEGNLEFPPS LGVDDQK EE GPLRKIFCYSRNQ
Sbjct: 601  IVVPEIVDSSLSAKGVNCNYGNDSYWEGNLEFPPSTLGVDDQKVEE-GPLRKIFCYSRNQ 660

Query: 661  DKEDNCPDASVVNEQCTNNDSSCIVGIDKSSEKHADDNLCVSSHLVDPVA-TSDARSLDP 720
            DKE+ CPDASVVNEQCTNNDSSCI+GIDKSSEKHADDNLCVSSHLV+PV  TSD RSLDP
Sbjct: 661  DKEEKCPDASVVNEQCTNNDSSCIIGIDKSSEKHADDNLCVSSHLVEPVERTSDTRSLDP 720

Query: 721  GTSPDSEVINSVLDIQVGAARQEILQDSVLASLEDFAASGNAPGSKKGRKKDKPSRVVSC 780
            GTSPDSEVINSVLDIQVGAARQEIL DSVLASLEDFAASGNAPGSKKGRKKDKPSR VSC
Sbjct: 721  GTSPDSEVINSVLDIQVGAARQEILPDSVLASLEDFAASGNAPGSKKGRKKDKPSRAVSC 780

Query: 781  SEERGISVSACSNRSKSSKKHGRRHNVDNQLSSGETLTYADANVLNYSLTVKELSMEQVS 840
            S ERGISVSACSNRSKSSKKHGRR NVDNQL S                           
Sbjct: 781  SGERGISVSACSNRSKSSKKHGRRQNVDNQLGS--------------------------- 840

Query: 841  LLTEIELPEETLKAEDILNDKECCRADVGSVFSESENSKTFLPSQSAKKKHPKGSKSIKT 900
               EIELPE+TLKA+DILNDKECCRADVGS F ESENSKTFLPSQSAKKKHPKGSKSIKT
Sbjct: 841  ---EIELPEDTLKADDILNDKECCRADVGSTFPESENSKTFLPSQSAKKKHPKGSKSIKT 900

Query: 901  SKGKSKAPGSKNKIKNASNERVYQRKSFKNSKSKEALCDQVVTETESHQIIGNCLVDKPE 960
            SKGKSKAPGSKNKIKNASNERVYQRKSFK SKSKEALCD+VVTETESHQIIGNCLVDKPE
Sbjct: 901  SKGKSKAPGSKNKIKNASNERVYQRKSFKKSKSKEALCDRVVTETESHQIIGNCLVDKPE 960

Query: 961  KSDNIIASTVAVDLSVVQGAVNEQYMPPRNAWVLCDDCHKWRRIPASLVDSLGHASCTWT 1020
            KSDNIIASTVAVDLSVVQGAVNEQYMPPRNAWVLCDDCHKWRRIPASLVDSLGHASCTWT
Sbjct: 961  KSDNIIASTVAVDLSVVQGAVNEQYMPPRNAWVLCDDCHKWRRIPASLVDSLGHASCTWT 1020

Query: 1021 CKDNVDKAFANCSIPQEKSNAEINAELEISDESGEENGSKKRLTYRELESFHPATVNAVP 1080
            CKDNVDKAFANCSIPQEKSNAEINAELEISDESGEENGSKKRLTYRELESFHPATV A+P
Sbjct: 1021 CKDNVDKAFANCSIPQEKSNAEINAELEISDESGEENGSKKRLTYRELESFHPATVTAIP 1080

Query: 1081 QQNKFASISSNQFLHRSRKTQTIDEIMVCHCKPALDGRLGCGDECLNRMLNIECVRGTCP 1140
            Q+NKFASISSNQFLHRSRKTQTIDEIMVCHCKPALDGRLGCGDECLNRMLNIECVRGTCP
Sbjct: 1081 QENKFASISSNQFLHRSRKTQTIDEIMVCHCKPALDGRLGCGDECLNRMLNIECVRGTCP 1140

Query: 1141 CGELCSNQQFQKRKYAKLQWLRCGKKGYGLQLLEDISKGQFLIEYVGEVLDMHAYEARQK 1200
            CG+LCSNQQFQKRKYAKLQWLRCGKKGYGLQLLEDISKGQFLIEYVGEVLDM+AYEARQK
Sbjct: 1141 CGDLCSNQQFQKRKYAKLQWLRCGKKGYGLQLLEDISKGQFLIEYVGEVLDMNAYEARQK 1200

Query: 1201 EYALNGHRHFYFMTLNGSEVIDACGKGNLGRFINHSCDPNCRTEKWMVNGEICIGLFALR 1260
            EYALNGHRHFYFMTLNGSEVIDACGKGNLGRFINHSCDPNCRTEKWMVNGEICIGLFALR
Sbjct: 1201 EYALNGHRHFYFMTLNGSEVIDACGKGNLGRFINHSCDPNCRTEKWMVNGEICIGLFALR 1260

Query: 1261 DIKKGEEVTFDYNYVRVFGAAAKKCYCGSFHCRGYIGGDPLNSEVIIQSDSDEEFPEPVM 1320
            DIKKGEEVTFDYNYVRVFGAAAKKCYCGSFHCRGYIGGDPLNSEVIIQSDSDEEFPEPVM
Sbjct: 1261 DIKKGEEVTFDYNYVRVFGAAAKKCYCGSFHCRGYIGGDPLNSEVIIQSDSDEEFPEPVM 1320

Query: 1321 LRGDGRSLNSNLSTAVSSMDVAKMQSSEHLKGNRDKRDQPIRIASELKISEEKVDPLKLS 1380
            LR DGRS N+NLSTAVSSMDVAKMQ SEHLKGNRDKRDQPIRIASELKISEEKVD LKL 
Sbjct: 1321 LRADGRSWNNNLSTAVSSMDVAKMQPSEHLKGNRDKRDQPIRIASELKISEEKVDTLKLP 1380

Query: 1381 ASKISEEKEDPLKLSATKISEEKEDPLNLSASTISPLHSSLEFEDSKVASPIPVPDITHQ 1440
            ASKISEEKEDPLKLSA K SEEKEDPLNLSASTISPLHSSLEFEDSKVASPIPVPDITHQ
Sbjct: 1381 ASKISEEKEDPLKLSALKTSEEKEDPLNLSASTISPLHSSLEFEDSKVASPIPVPDITHQ 1440

Query: 1441 TEDVTSQPIFVDQTEISLLDNIPDKNTCSIEQEAKLSVDDIDARKKSKLDSVEDKQVYIK 1500
            TEDVTS+PIFVDQT ISLLDNI DKNTCSIEQEAKLSVDDIDARKKSKLDSVEDK+VYIK
Sbjct: 1441 TEDVTSKPIFVDQTGISLLDNISDKNTCSIEQEAKLSVDDIDARKKSKLDSVEDKKVYIK 1500

Query: 1501 SHPRMKTSRKLGSIKKGK-VSSAEKIQITNRSQISSVKPKRLIEGSPGNRFEAVEEKLNE 1560
            SHPRMKTSRK GS+KK K  SS EKIQITNRS ISSVKPKRLIEGSPGNRFEAVEEKLNE
Sbjct: 1501 SHPRMKTSRKPGSVKKRKSXSSVEKIQITNRSLISSVKPKRLIEGSPGNRFEAVEEKLNE 1560

Query: 1561 LLDAEGGISKRKDAPKGYLKLLLLTAASGASASGEAIQSNRDLSMILDALLKTKSRLVLT 1620
            LLDAEGGISKRKDAPKGYLKLLLLTAASGASASGEAIQSNRDLSMILDALLKTKSRLVLT
Sbjct: 1561 LLDAEGGISKRKDAPKGYLKLLLLTAASGASASGEAIQSNRDLSMILDALLKTKSRLVLT 1620

Query: 1621 DIINKNGLRMLHNIMKQYRSDFKKIPILRKLLKVLEYLVTREILTSEHINGGPPCPGMES 1680
            DIINKNGLRMLHNIMKQYRSDFKKIPILRKLLKVLEYLVTREILTSEHINGGPPCPGMES
Sbjct: 1621 DIINKNGLRMLHNIMKQYRSDFKKIPILRKLLKVLEYLVTREILTSEHINGGPPCPGMES 1680

Query: 1681 LRESLLSLTEHDDKQVHQIARSFRDRWFPRHTRKFGYSEREDGRLEVYRGSNSSRFTASH 1740
            LRESLLSLTEHDDKQVHQIARSFRDRWFPRHTRKFGYSEREDGRLEVYRGSNSSRFTASH
Sbjct: 1681 LRESLLSLTEHDDKQVHQIARSFRDRWFPRHTRKFGYSEREDGRLEVYRGSNSSRFTASH 1740

Query: 1741 SFRHDQDCRPTDAIDCIKQSMPTSLPDAHPAEVCSLASAASHSVNGQKVRKRKSRWDQPA 1800
            S+RHDQDCRPTDAIDCIKQSMPT LPDAH AEVCSLAS A  SVNGQKVRKRKSRWDQPA
Sbjct: 1741 SYRHDQDCRPTDAIDCIKQSMPTPLPDAHTAEVCSLASVAGPSVNGQKVRKRKSRWDQPA 1800

Query: 1801 DTSLDLRSKEQKLESTSVQELNSSQLNSVGAASMLIDKVNNDDKDISLSDSVGVPCRQDE 1860
            DTSLDLRSKEQKLESTSVQELNSSQLNSV  ASMLIDKVNNDDKD SLSDSVGVPCRQDE
Sbjct: 1801 DTSLDLRSKEQKLESTSVQELNSSQLNSVRVASMLIDKVNNDDKDSSLSDSVGVPCRQDE 1860

Query: 1861 DIRADSAVPNIPEDIPPGFSSPFNPPVASSSAFSAVLDPPRQNIGDLSCAFSTVGHLQER 1920
            D RADSAVPNIPEDIPPGFSSPFNP VASSSAFSAVLDPP+QNIG LSCAFSTVGHLQER
Sbjct: 1861 DTRADSAVPNIPEDIPPGFSSPFNPSVASSSAFSAVLDPPQQNIGYLSCAFSTVGHLQER 1920

Query: 1921 FISRLPVSYGIPFSIIEQCGTSHAENLECWDVAPGVPFHPFPPLPPYPRGMRGLPTSACG 1980
            FISRLPVSYGIPFSIIEQCGTS AENLECWDVAPGVPFHPFPPLPPYPRGM G  TSACG
Sbjct: 1921 FISRLPVSYGIPFSIIEQCGTSRAENLECWDVAPGVPFHPFPPLPPYPRGMSGPRTSACG 1980

Query: 1981 TAGQSSQEGQVNSHDSRTSFSEESPPSTSTNYQTDLCTPSNNQQIAKRAKESSCDLGRRY 2040
            TAGQSSQEGQVNSHDSRTSFSEESPPSTSTNYQTDLCTPSNNQQI KR KESSCDLGRRY
Sbjct: 1981 TAGQSSQEGQVNSHDSRTSFSEESPPSTSTNYQTDLCTPSNNQQITKRPKESSCDLGRRY 2040

Query: 2041 FRQQKWRNTKFGPPWLQRRSQWGCQGNFRGGVSTIGDENIPDEEISPYCSDEASGRVDKA 2100
            FRQQKWRNTKFGPPWLQRRSQWGCQGNFRGGVSTIGDENIPDEEISPYCSDEASGRVDKA
Sbjct: 2041 FRQQKWRNTKFGPPWLQRRSQWGCQGNFRGGVSTIGDENIPDEEISPYCSDEASGRVDKA 2082

Query: 2101 NGDFYQHLQNQNLR 2112
            NGDFYQHLQNQNLR
Sbjct: 2101 NGDFYQHLQNQNLR 2082

BLAST of CsGy6G019510 vs. NCBI nr
Match: XP_038884997.1 (histone-lysine N-methyltransferase ASHH2 isoform X1 [Benincasa hispida] >XP_038884998.1 histone-lysine N-methyltransferase ASHH2 isoform X1 [Benincasa hispida] >XP_038884999.1 histone-lysine N-methyltransferase ASHH2 isoform X1 [Benincasa hispida])

HSP 1 Score: 3639 bits (9437), Expect = 0.0
Identity = 1863/2114 (88.13%), Postives = 1960/2114 (92.72%), Query Frame = 0

Query: 1    MGSCDDPAVIGEPFRASVTRLVRCSSQPLPEHQSHQEMASFSSNSREGQMFEPDRGLEVT 60
            MGSCDDPAVIGEPFR SVTRLVRCSSQPLP+HQS QEMASF S+S +GQMFEPDRGLEVT
Sbjct: 1    MGSCDDPAVIGEPFRGSVTRLVRCSSQPLPKHQSRQEMASFPSDSSDGQMFEPDRGLEVT 60

Query: 61   TASLCTNASDPDTSGEDGTLRGFEHADSLLMDKRLDGDSGGSDPCLNLDNESCNEGNKTL 120
            T  +CTNAS+  T+GEDGT RGFEHAD+LLMDKRLDGDSG S PCLN D E+CN GN+TL
Sbjct: 61   TTCVCTNASESGTAGEDGTFRGFEHADTLLMDKRLDGDSGDSGPCLNEDKEACNGGNRTL 120

Query: 121  SLDMKESEDVDGLVDILGCDATMEMISLTESLVNSVKPEELDNNSCIIDAPAKVERDDTA 180
            SLDMKES+DVDGLVDILGC  TMEM+SL  SLV+SVKPE+LDNNSCIIDAPAKVERD+T 
Sbjct: 121  SLDMKESQDVDGLVDILGCKTTMEMMSLNGSLVDSVKPEDLDNNSCIIDAPAKVERDNTV 180

Query: 181  QNGPILAGTGTRTDDLKSSYVCEIVSNSASADGLPNDFIQKNELENDGAGCSFSEVADRI 240
             NGP+LA  GT TD+LKS YVCEIVSNSASADGLP+DFIQ+NELENDGAGCSFSE ADRI
Sbjct: 181  ANGPVLARMGTCTDNLKSPYVCEIVSNSASADGLPSDFIQQNELENDGAGCSFSETADRI 240

Query: 241  TEASVELEADMLNEMSPLQSGQILPIHVGQSIANYDRYVCRMDGKSLSSTSGETVTVVAD 300
            TEASVE+EAD+LNEMSPLQSGQILP ++  S+AN+D+YVC+M+GKSLS TSGETV  VA 
Sbjct: 241  TEASVEIEADVLNEMSPLQSGQILPTYMELSVANFDQYVCQMEGKSLSGTSGETVIEVAA 300

Query: 301  MNSNPEGCLQMLPSQGCDRIGECLQSDGLPLTINASENDLCEEKHDSNSSSKYVPDVGGD 360
            MNSNPE CLQMLPSQ C+RIGECLQSDG PLTI+ASEND C+EK D+NSS KY+ +V  D
Sbjct: 301  MNSNPEVCLQMLPSQECERIGECLQSDGSPLTIDASENDWCDEKRDNNSS-KYITEVVED 360

Query: 361  DSDVLTNNNSDGGQHTVPGIGNDHNLEDATVQVNHDCVELLSSPLPSQLPNSEKDEFYGM 420
            D DVLTNNNSDGGQH VPGI ND NLE+ T+QVNH+CVELL+SPL SQ PNSEKDEFYGM
Sbjct: 361  DIDVLTNNNSDGGQHIVPGIENDRNLEEGTIQVNHNCVELLASPLLSQPPNSEKDEFYGM 420

Query: 421  LNGADIPIKYISSVNSCSVGDQDNNDIEKVGCVSEVKCPETVITSSKRSGRRRTSSQKTV 480
            LNGAD PIK ISSVNSCSVGDQD+NDIEKVGCVSEVKCPETVITSSKRSG+RRTS+QK V
Sbjct: 421  LNGADFPIKDISSVNSCSVGDQDHNDIEKVGCVSEVKCPETVITSSKRSGQRRTSNQKAV 480

Query: 481  TKRASRKTKKKVPEPLIFDTARRRRSSISRPARPSPWGSLGHIIQSFEEIDDVLVNQTQK 540
            TKRASRK+KKKVPEPLIFDTARRRRSS+SRPARPSPWGSLG+IIQSFEEIDDVL+NQ+QK
Sbjct: 481  TKRASRKSKKKVPEPLIFDTARRRRSSLSRPARPSPWGSLGYIIQSFEEIDDVLINQSQK 540

Query: 541  QGNEKSKGNQGGAKRNKKQLSESSHRSRKGTQGKSATSTSTNRIRLKVKLGKNVGHNFLN 600
            QGN+KSK NQGG KRNKK+  ESSHRSRKGTQGK ATSTSTNRIRLKVKLGKNVGHNFLN
Sbjct: 541  QGNDKSKSNQGGIKRNKKKPKESSHRSRKGTQGKCATSTSTNRIRLKVKLGKNVGHNFLN 600

Query: 601  IVVPEIVDSSLSAKGVNCNYGNESYWEGNLEFPPSNLGVDDQKAEEEGPLRKIFCYSRNQ 660
            IVVPEIVDSSLSAKG+NCNYGNESYWEGNLEFPPS LGVDDQK EE GPL+KIFCYSRNQ
Sbjct: 601  IVVPEIVDSSLSAKGINCNYGNESYWEGNLEFPPSTLGVDDQKPEE-GPLKKIFCYSRNQ 660

Query: 661  DKEDNCPDASVVNEQCTNNDSSCIVGIDKSSEKHADDNLCVSSHLVDPVA-TSDARSLDP 720
            DKE+ CPDASVVNEQC NNDSSC + IDKSS KHADDNLCVS HLV+PV   SD R+ DP
Sbjct: 661  DKEEKCPDASVVNEQCANNDSSCTINIDKSSAKHADDNLCVSPHLVEPVERVSDTRNSDP 720

Query: 721  GTSPDSEVINSVLDIQVGAARQEILQDSVLASLEDFAASGNAPGSKKGRKKDKPSRVVSC 780
            GTSPDSEVINS+LDI VGA R+EILQDSVLASLEDF+ASGNA  S KGRKK+KP + VSC
Sbjct: 721  GTSPDSEVINSILDIPVGAMRREILQDSVLASLEDFSASGNAV-STKGRKKEKPCQAVSC 780

Query: 781  SEERGISVSACSNRSKSSKKHGRRHNVDNQLSSGETLTYADANVLNYSLTVKELSMEQVS 840
            SEE G   SACSNRSKSSKKHGRR NVDNQ  SGET TY DAN+LNY+LTVKELSMEQV 
Sbjct: 781  SEEGGTGASACSNRSKSSKKHGRRRNVDNQHGSGETFTYTDANILNYALTVKELSMEQVP 840

Query: 841  LLTEIELPEETLKAEDILNDKECCRADVGSVFSESENSKTFLPSQSAKKKHPKGSKSIKT 900
            LLTEIELPEE LKA++IL DKECCR DVGSVF ESENSKTFLPSQSAKKKHPKGSKSIKT
Sbjct: 841  LLTEIELPEEVLKADNILKDKECCRTDVGSVFPESENSKTFLPSQSAKKKHPKGSKSIKT 900

Query: 901  SKGKSKAPGSKNKIKNASNERVYQRKSFKNSKSKEALCDQVVTETESHQIIGNCLVDKPE 960
            SK K KAPGSKNKIKNAS ERVYQRKSF  SK KE LCD+VVTE  SHQI+GNC VDK E
Sbjct: 901  SKDKLKAPGSKNKIKNASKERVYQRKSFNKSKIKEDLCDRVVTEMGSHQILGNCFVDKHE 960

Query: 961  KSDNIIASTVAVDLSVVQGAVNEQYMPPRNAWVLCDDCHKWRRIPASLVDSLGHASCTWT 1020
            KSD+IIASTVAV+LSVVQGA NEQYMPPRNAWVLCDDCHKWRRIPASLVDSLGHASCTWT
Sbjct: 961  KSDDIIASTVAVNLSVVQGATNEQYMPPRNAWVLCDDCHKWRRIPASLVDSLGHASCTWT 1020

Query: 1021 CKDNVDKAFANCSIPQEKSNAEINAELEISDESGEENGSKKRLTYRELESFHPATVNAVP 1080
            CKDNVDKAFA+CSIPQEKSNAEINAELEISDESGEEN S KRLTYRELESFHP TV AVP
Sbjct: 1021 CKDNVDKAFAHCSIPQEKSNAEINAELEISDESGEENASNKRLTYRELESFHPTTVTAVP 1080

Query: 1081 QQNKFASISSNQFLHRSRKTQTIDEIMVCHCKPALDGRLGCGDECLNRMLNIECVRGTCP 1140
            Q+NKFASISSNQFLHRSRKTQTIDEIMVCHCKPALDGRLGCGDECLNRMLNIECVRGTCP
Sbjct: 1081 QENKFASISSNQFLHRSRKTQTIDEIMVCHCKPALDGRLGCGDECLNRMLNIECVRGTCP 1140

Query: 1141 CGELCSNQQFQKRKYAKLQWLRCGKKGYGLQLLEDISKGQFLIEYVGEVLDMHAYEARQK 1200
            CG+LCSNQQFQKRKYAKLQWLRCGKKGYGLQLLEDISKGQFLIEYVGEVLDMHAYEARQK
Sbjct: 1141 CGDLCSNQQFQKRKYAKLQWLRCGKKGYGLQLLEDISKGQFLIEYVGEVLDMHAYEARQK 1200

Query: 1201 EYALNGHRHFYFMTLNGSEVIDACGKGNLGRFINHSCDPNCRTEKWMVNGEICIGLFALR 1260
            EYALNGHRHFYFMTLNGSEVIDAC KGNLGRFINHSCDPNCRTEKWMVNGEICIGLFALR
Sbjct: 1201 EYALNGHRHFYFMTLNGSEVIDACRKGNLGRFINHSCDPNCRTEKWMVNGEICIGLFALR 1260

Query: 1261 DIKKGEEVTFDYNYVRVFGAAAKKCYCGSFHCRGYIGGDPLNSEVIIQSDSDEEFPEPVM 1320
            DIKKGEEVTFDYNYVRVFGAAAKKCYCGSF CRGYIGGDPLNSEVIIQSDSDEEFPEPVM
Sbjct: 1261 DIKKGEEVTFDYNYVRVFGAAAKKCYCGSFQCRGYIGGDPLNSEVIIQSDSDEEFPEPVM 1320

Query: 1321 LRGDGRSLNSNLSTAVSSMDVAKMQSSEHLKGNRDKRDQPIRIASELKISEEKVDPLKLS 1380
            LR DGRS N+N+ TAVSS+DVAKMQ S H+KG RDKRDQPIRIA E KISEEKVD LKLS
Sbjct: 1321 LRADGRSWNNNVPTAVSSLDVAKMQPSGHIKGIRDKRDQPIRIAIESKISEEKVDTLKLS 1380

Query: 1381 ASKISEEKEDPLKLSATKISEEKEDPLNLSASTISPLHSSLEFEDSKVASPIPVPDITHQ 1440
             SKISEEKED L LSA+KISEEKE+ LNLSASTISPLHSSLEFEDSKVASP P+PDITHQ
Sbjct: 1381 VSKISEEKEDSLNLSASKISEEKEEHLNLSASTISPLHSSLEFEDSKVASPTPLPDITHQ 1440

Query: 1441 TEDVTSQPIFVDQTEISLLDNIPDKNTCSIEQEAKLSVDDIDARKKSKLDSVEDKQVYIK 1500
            TEDVTS+P+FVDQTEISL+DNI DKNTCSIEQEAKLSVDDID RKKSKLD++EDKQVYIK
Sbjct: 1441 TEDVTSKPVFVDQTEISLVDNISDKNTCSIEQEAKLSVDDIDGRKKSKLDAIEDKQVYIK 1500

Query: 1501 SHPRMKTSRKLGSIKKGKVSSAEKIQITNRSQISSVKPKRLIEGSPGNRFEAVEEKLNEL 1560
            SHP+MKTSRK GSIKKGKVSS EKIQITNRSQISSVKPKRLIEGSPGNRFEAVEEKLNEL
Sbjct: 1501 SHPQMKTSRKPGSIKKGKVSSVEKIQITNRSQISSVKPKRLIEGSPGNRFEAVEEKLNEL 1560

Query: 1561 LDAEGGISKRKDAPKGYLKLLLLTAASGASASGEAIQSNRDLSMILDALLKTKSRLVLTD 1620
            LDAEGGISKRKDAPKGYLKLLLLTAASGASASGEAIQSNRDLSMILDALLKTKSR+VLTD
Sbjct: 1561 LDAEGGISKRKDAPKGYLKLLLLTAASGASASGEAIQSNRDLSMILDALLKTKSRVVLTD 1620

Query: 1621 IINKNGLRMLHNIMKQYRSDFKKIPILRKLLKVLEYLVTREILTSEHINGGPPCPGMESL 1680
            IINKNGLRMLHNIMKQYRSDFKKIPILRKLLKVLEYLVTREILTSEHINGGPPCPGMESL
Sbjct: 1621 IINKNGLRMLHNIMKQYRSDFKKIPILRKLLKVLEYLVTREILTSEHINGGPPCPGMESL 1680

Query: 1681 RESLLSLTEHDDKQVHQIARSFRDRWFPRHTRKFGYSEREDGRLEVYRGSNSSRFTASHS 1740
            RESLLSLTEHDDKQVHQIARSFRDRWFPRH+RKFGYSEREDGRLEVYRGSN SRFTASHS
Sbjct: 1681 RESLLSLTEHDDKQVHQIARSFRDRWFPRHSRKFGYSEREDGRLEVYRGSNCSRFTASHS 1740

Query: 1741 FRHDQDCRPTDAIDCIKQSMPTSLPDAHPAEVCSLASAASHSVNGQKVRKRKSRWDQPAD 1800
            +RHDQD RPTDAIDC+KQSMPTSLPDAHP EVCS+AS A HS NGQKVRKRKSRWDQPAD
Sbjct: 1741 YRHDQDSRPTDAIDCVKQSMPTSLPDAHPVEVCSVASTAGHSSNGQKVRKRKSRWDQPAD 1800

Query: 1801 TSLDLRSKEQKLESTSVQELNSSQLNSVGAASMLIDKVNNDDKDISLSDSVGVPCRQDED 1860
            TSLDLRSKEQKLESTSVQ+LNSSQLN VG ASMLIDKVNNDDKD SLSDSVGV CRQDED
Sbjct: 1801 TSLDLRSKEQKLESTSVQQLNSSQLNCVGMASMLIDKVNNDDKDSSLSDSVGVRCRQDED 1860

Query: 1861 IRADSAVPNIPEDIPPGFSSPFNPPVASSSAFSAVLDPPRQNIGDLSCAFSTVGHLQERF 1920
            IRADSAV N+PEDIPPGFSSPFNPPVASSSAFS VLDPPRQNI DL CAFSTVGH QERF
Sbjct: 1861 IRADSAVQNVPEDIPPGFSSPFNPPVASSSAFSTVLDPPRQNICDLGCAFSTVGHPQERF 1920

Query: 1921 ISRLPVSYGIPFSIIEQCGTSHAENLECWDVAPGVPFHPFPPLPPYPRGMRGLPTSACGT 1980
            ISR+PVSYGIPFSIIEQCGTSHAENLECWDVAPGVPFHPFPPLPPYPRG RG PTSACGT
Sbjct: 1921 ISRMPVSYGIPFSIIEQCGTSHAENLECWDVAPGVPFHPFPPLPPYPRGKRGPPTSACGT 1980

Query: 1981 A-GQSSQEGQVNSHDSRTSFSEESPPSTSTNYQTDLCTPSNNQQIAKRAKESSCDLGRRY 2040
            A GQSSQEGQVNSHDSRTSFSEESPPSTSTNYQ DLCT SNNQQI  R KESS DLGRRY
Sbjct: 1981 AVGQSSQEGQVNSHDSRTSFSEESPPSTSTNYQPDLCTSSNNQQIPNRTKESSYDLGRRY 2040

Query: 2041 FRQQKWRNTKFGPPWLQRRSQWGCQGNFRGGVSTIGDENIPDEEISPYCSDEASGRVDKA 2100
            FRQQKWRNTK+GPPWLQ+R+QWGCQGNFRGGVS I D+NIP+EEISPYCSDEASGR+DKA
Sbjct: 2041 FRQQKWRNTKYGPPWLQKRNQWGCQGNFRGGVSAIVDDNIPNEEISPYCSDEASGRLDKA 2100

Query: 2101 NGDFYQHLQNQNLR 2112
            N +FYQHLQNQNLR
Sbjct: 2101 NDEFYQHLQNQNLR 2111

BLAST of CsGy6G019510 vs. ExPASy TrEMBL
Match: A0A0A0KDR4 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G376290 PE=4 SV=1)

HSP 1 Score: 4021 bits (10429), Expect = 0.0
Identity = 2059/2089 (98.56%), Postives = 2059/2089 (98.56%), Query Frame = 0

Query: 24   CSSQPLPEHQSHQEMASFSSNSREGQMFEPDRGLEVTTASLCTNASDPDTSGEDGTLRGF 83
            CSSQPLPEHQSHQEMASFSSNSREGQMFEPDRGLEVTTASLCTNASDPDTSGEDGTLRGF
Sbjct: 93   CSSQPLPEHQSHQEMASFSSNSREGQMFEPDRGLEVTTASLCTNASDPDTSGEDGTLRGF 152

Query: 84   EHADSLLMDKRLDGDSGGSDPCLNLDNESCNEGNKTLSLDMKESEDVDGLVDILGCDATM 143
            EHADSLLMDKRLDGDSGGSDPCLNLDNESCNEGNKTLSLDMKESEDVDGLVDILGCDATM
Sbjct: 153  EHADSLLMDKRLDGDSGGSDPCLNLDNESCNEGNKTLSLDMKESEDVDGLVDILGCDATM 212

Query: 144  EMISLTESLVNSVKPEELDNNSCIIDAPAKVERDDTAQNGPILAGTGTRTDDLKSSYVCE 203
            EMISLTESLVNSVKPEELDNNSCIIDAPAKVERDDTAQNGPILAGTGTRTDDLKSSYVCE
Sbjct: 213  EMISLTESLVNSVKPEELDNNSCIIDAPAKVERDDTAQNGPILAGTGTRTDDLKSSYVCE 272

Query: 204  IVSNSASADGLPNDFIQKNELENDGAGCSFSEVADRITEASVELEADMLNEMSPLQSGQI 263
            IVSNSASADGLPNDFIQKNELENDGAGCSFSEVADRITEASVELEADMLNEMSPLQSGQI
Sbjct: 273  IVSNSASADGLPNDFIQKNELENDGAGCSFSEVADRITEASVELEADMLNEMSPLQSGQI 332

Query: 264  LPIHVGQSIANYDRYVCRMDGKSLSSTSGETVTVVADMNSNPEGCLQMLPSQGCDRIGEC 323
            LPIHVGQSIANYDRYVCRMDGKSLSSTSGETVTVVADMNSNPEGCLQMLPSQGCDRIGEC
Sbjct: 333  LPIHVGQSIANYDRYVCRMDGKSLSSTSGETVTVVADMNSNPEGCLQMLPSQGCDRIGEC 392

Query: 324  LQSDGLPLTINASENDLCEEKHDSNSSSKYVPDVGGDDSDVLTNNNSDGGQHTVPGIGND 383
            LQSDGLPLTINASENDLCEEKHDSNSSSKYVPDVGGDDSDVLTNNNSDGGQHTVPGIGND
Sbjct: 393  LQSDGLPLTINASENDLCEEKHDSNSSSKYVPDVGGDDSDVLTNNNSDGGQHTVPGIGND 452

Query: 384  HNLEDATVQVNHDCVELLSSPLPSQLPNSEKDEFYGMLNGADIPIKYISSVNSCSVGDQD 443
            HNLEDATVQVNHDCVELLSSPLPSQLPNSEKDEFYGMLNGADIPIKYISSVNSCSVGDQD
Sbjct: 453  HNLEDATVQVNHDCVELLSSPLPSQLPNSEKDEFYGMLNGADIPIKYISSVNSCSVGDQD 512

Query: 444  NNDIEKVGCVSEVKCPETVITSSKRSGRRRTSSQKTVTKRASRKTKKKVPEPLIFDTARR 503
            NNDIEKVGCVSEVKCPETVITSSKRSGRRRTSSQKTVTKRASRKTKKKVPEPLIFDTARR
Sbjct: 513  NNDIEKVGCVSEVKCPETVITSSKRSGRRRTSSQKTVTKRASRKTKKKVPEPLIFDTARR 572

Query: 504  RRSSISRPARPSPWGSLGHIIQSFEEIDDVLVNQTQKQGNEKSKGNQGGAKRNKKQLSES 563
            RRSSISRPARPSPWGSLGHIIQSFEEIDDVLVNQTQKQGNEKSKGNQGGAKRNKKQLSES
Sbjct: 573  RRSSISRPARPSPWGSLGHIIQSFEEIDDVLVNQTQKQGNEKSKGNQGGAKRNKKQLSES 632

Query: 564  SHRSRKGTQGKSATSTSTNRIRLKVKLGKNVGHNFLNIVVPEIVDSSLSAKGVNCNYGNE 623
            SHRSRKGTQGKSATSTSTNRIRLKVKLGKNVGHNFLNIVVPEIVDSSLSAKGVNCNYGNE
Sbjct: 633  SHRSRKGTQGKSATSTSTNRIRLKVKLGKNVGHNFLNIVVPEIVDSSLSAKGVNCNYGNE 692

Query: 624  SYWEGNLEFPPSNLGVDDQKAEEEGPLRKIFCYSRNQDKEDNCPDASVVNEQCTNNDSSC 683
            SYWEGNLEFPPSNLGVDDQKAEEEGPLRKIFCYSRNQDKEDNCPDASVVNEQCTNNDSSC
Sbjct: 693  SYWEGNLEFPPSNLGVDDQKAEEEGPLRKIFCYSRNQDKEDNCPDASVVNEQCTNNDSSC 752

Query: 684  IVGIDKSSEKHADDNLCVSSHLVDPVATSDARSLDPGTSPDSEVINSVLDIQVGAARQEI 743
            IVGIDKSSEKHADDNLCVSSHLVDPVATSDARSLDPGTSPDSEVINSVLDIQVGAARQEI
Sbjct: 753  IVGIDKSSEKHADDNLCVSSHLVDPVATSDARSLDPGTSPDSEVINSVLDIQVGAARQEI 812

Query: 744  LQDSVLASLEDFAASGNAPGSKKGRKKDKPSRVVSCSEERGISVSACSNRSKSSKKHGRR 803
            LQDSVLASLEDFAASGNAPGSKKGRKKDKPSRVVSCSEERGISVSACSNRSKSSKKHGRR
Sbjct: 813  LQDSVLASLEDFAASGNAPGSKKGRKKDKPSRVVSCSEERGISVSACSNRSKSSKKHGRR 872

Query: 804  HNVDNQLSSGETLTYADANVLNYSLTVKELSMEQVSLLTEIELPEETLKAEDILNDKECC 863
            HNVDNQLSS                              EIELPEETLKAEDILNDKECC
Sbjct: 873  HNVDNQLSS------------------------------EIELPEETLKAEDILNDKECC 932

Query: 864  RADVGSVFSESENSKTFLPSQSAKKKHPKGSKSIKTSKGKSKAPGSKNKIKNASNERVYQ 923
            RADVGSVFSESENSKTFLPSQSAKKKHPKGSKSIKTSKGKSKAPGSKNKIKNASNERVYQ
Sbjct: 933  RADVGSVFSESENSKTFLPSQSAKKKHPKGSKSIKTSKGKSKAPGSKNKIKNASNERVYQ 992

Query: 924  RKSFKNSKSKEALCDQVVTETESHQIIGNCLVDKPEKSDNIIASTVAVDLSVVQGAVNEQ 983
            RKSFKNSKSKEALCDQVVTETESHQIIGNCLVDKPEKSDNIIASTVAVDLSVVQGAVNEQ
Sbjct: 993  RKSFKNSKSKEALCDQVVTETESHQIIGNCLVDKPEKSDNIIASTVAVDLSVVQGAVNEQ 1052

Query: 984  YMPPRNAWVLCDDCHKWRRIPASLVDSLGHASCTWTCKDNVDKAFANCSIPQEKSNAEIN 1043
            YMPPRNAWVLCDDCHKWRRIPASLVDSLGHASCTWTCKDNVDKAFANCSIPQEKSNAEIN
Sbjct: 1053 YMPPRNAWVLCDDCHKWRRIPASLVDSLGHASCTWTCKDNVDKAFANCSIPQEKSNAEIN 1112

Query: 1044 AELEISDESGEENGSKKRLTYRELESFHPATVNAVPQQNKFASISSNQFLHRSRKTQTID 1103
            AELEISDESGEENGSKKRLTYRELESFHPATVNAVPQQNKFASISSNQFLHRSRKTQTID
Sbjct: 1113 AELEISDESGEENGSKKRLTYRELESFHPATVNAVPQQNKFASISSNQFLHRSRKTQTID 1172

Query: 1104 EIMVCHCKPALDGRLGCGDECLNRMLNIECVRGTCPCGELCSNQQFQKRKYAKLQWLRCG 1163
            EIMVCHCKPALDGRLGCGDECLNRMLNIECVRGTCPCGELCSNQQFQKRKYAKLQWLRCG
Sbjct: 1173 EIMVCHCKPALDGRLGCGDECLNRMLNIECVRGTCPCGELCSNQQFQKRKYAKLQWLRCG 1232

Query: 1164 KKGYGLQLLEDISKGQFLIEYVGEVLDMHAYEARQKEYALNGHRHFYFMTLNGSEVIDAC 1223
            KKGYGLQLLEDISKGQFLIEYVGEVLDMHAYEARQKEYALNGHRHFYFMTLNGSEVIDAC
Sbjct: 1233 KKGYGLQLLEDISKGQFLIEYVGEVLDMHAYEARQKEYALNGHRHFYFMTLNGSEVIDAC 1292

Query: 1224 GKGNLGRFINHSCDPNCRTEKWMVNGEICIGLFALRDIKKGEEVTFDYNYVRVFGAAAKK 1283
            GKGNLGRFINHSCDPNCRTEKWMVNGEICIGLFALRDIKKGEEVTFDYNYVRVFGAAAKK
Sbjct: 1293 GKGNLGRFINHSCDPNCRTEKWMVNGEICIGLFALRDIKKGEEVTFDYNYVRVFGAAAKK 1352

Query: 1284 CYCGSFHCRGYIGGDPLNSEVIIQSDSDEEFPEPVMLRGDGRSLNSNLSTAVSSMDVAKM 1343
            CYCGSFHCRGYIGGDPLNSEVIIQSDSDEEFPEPVMLRGDGRSLNSNLSTAVSSMDVAKM
Sbjct: 1353 CYCGSFHCRGYIGGDPLNSEVIIQSDSDEEFPEPVMLRGDGRSLNSNLSTAVSSMDVAKM 1412

Query: 1344 QSSEHLKGNRDKRDQPIRIASELKISEEKVDPLKLSASKISEEKEDPLKLSATKISEEKE 1403
            QSSEHLKGNRDKRDQPIRIASELKISEEKVDPLKLSASKISEEKEDPLKLSATKISEEKE
Sbjct: 1413 QSSEHLKGNRDKRDQPIRIASELKISEEKVDPLKLSASKISEEKEDPLKLSATKISEEKE 1472

Query: 1404 DPLNLSASTISPLHSSLEFEDSKVASPIPVPDITHQTEDVTSQPIFVDQTEISLLDNIPD 1463
            DPLNLSASTISPLHSSLEFEDSKVASPIPVPDITHQTEDVTSQPIFVDQTEISLLDNIPD
Sbjct: 1473 DPLNLSASTISPLHSSLEFEDSKVASPIPVPDITHQTEDVTSQPIFVDQTEISLLDNIPD 1532

Query: 1464 KNTCSIEQEAKLSVDDIDARKKSKLDSVEDKQVYIKSHPRMKTSRKLGSIKKGKVSSAEK 1523
            KNTCSIEQEAKLSVDDIDARKKSKLDSVEDKQVYIKSHPRMKTSRKLGSIKKGKVSSAEK
Sbjct: 1533 KNTCSIEQEAKLSVDDIDARKKSKLDSVEDKQVYIKSHPRMKTSRKLGSIKKGKVSSAEK 1592

Query: 1524 IQITNRSQISSVKPKRLIEGSPGNRFEAVEEKLNELLDAEGGISKRKDAPKGYLKLLLLT 1583
            IQITNRSQISSVKPKRLIEGSPGNRFEAVEEKLNELLDAEGGISKRKDAPKGYLKLLLLT
Sbjct: 1593 IQITNRSQISSVKPKRLIEGSPGNRFEAVEEKLNELLDAEGGISKRKDAPKGYLKLLLLT 1652

Query: 1584 AASGASASGEAIQSNRDLSMILDALLKTKSRLVLTDIINKNGLRMLHNIMKQYRSDFKKI 1643
            AASGASASGEAIQSNRDLSMILDALLKTKSRLVLTDIINKNGLRMLHNIMKQYRSDFKKI
Sbjct: 1653 AASGASASGEAIQSNRDLSMILDALLKTKSRLVLTDIINKNGLRMLHNIMKQYRSDFKKI 1712

Query: 1644 PILRKLLKVLEYLVTREILTSEHINGGPPCPGMESLRESLLSLTEHDDKQVHQIARSFRD 1703
            PILRKLLKVLEYLVTREILTSEHINGGPPCPGMESLRESLLSLTEHDDKQVHQIARSFRD
Sbjct: 1713 PILRKLLKVLEYLVTREILTSEHINGGPPCPGMESLRESLLSLTEHDDKQVHQIARSFRD 1772

Query: 1704 RWFPRHTRKFGYSEREDGRLEVYRGSNSSRFTASHSFRHDQDCRPTDAIDCIKQSMPTSL 1763
            RWFPRHTRKFGYSEREDGRLEVYRGSNSSRFTASHSFRHDQDCRPTDAIDCIKQSMPTSL
Sbjct: 1773 RWFPRHTRKFGYSEREDGRLEVYRGSNSSRFTASHSFRHDQDCRPTDAIDCIKQSMPTSL 1832

Query: 1764 PDAHPAEVCSLASAASHSVNGQKVRKRKSRWDQPADTSLDLRSKEQKLESTSVQELNSSQ 1823
            PDAHPAEVCSLASAASHSVNGQKVRKRKSRWDQPADTSLDLRSKEQKLESTSVQELNSSQ
Sbjct: 1833 PDAHPAEVCSLASAASHSVNGQKVRKRKSRWDQPADTSLDLRSKEQKLESTSVQELNSSQ 1892

Query: 1824 LNSVGAASMLIDKVNNDDKDISLSDSVGVPCRQDEDIRADSAVPNIPEDIPPGFSSPFNP 1883
            LNSVGAASMLIDKVNNDDKDISLSDSVGVPCRQDEDIRADSAVPNIPEDIPPGFSSPFNP
Sbjct: 1893 LNSVGAASMLIDKVNNDDKDISLSDSVGVPCRQDEDIRADSAVPNIPEDIPPGFSSPFNP 1952

Query: 1884 PVASSSAFSAVLDPPRQNIGDLSCAFSTVGHLQERFISRLPVSYGIPFSIIEQCGTSHAE 1943
            PVASSSAFSAVLDPPRQNIGDLSCAFSTVGHLQERFISRLPVSYGIPFSIIEQCGTSHAE
Sbjct: 1953 PVASSSAFSAVLDPPRQNIGDLSCAFSTVGHLQERFISRLPVSYGIPFSIIEQCGTSHAE 2012

Query: 1944 NLECWDVAPGVPFHPFPPLPPYPRGMRGLPTSACGTAGQSSQEGQVNSHDSRTSFSEESP 2003
            NLECWDVAPGVPFHPFPPLPPYPRGMRGLPTSACGTAGQSSQEGQVNSHDSRTSFSEESP
Sbjct: 2013 NLECWDVAPGVPFHPFPPLPPYPRGMRGLPTSACGTAGQSSQEGQVNSHDSRTSFSEESP 2072

Query: 2004 PSTSTNYQTDLCTPSNNQQIAKRAKESSCDLGRRYFRQQKWRNTKFGPPWLQRRSQWGCQ 2063
            PSTSTNYQTDLCTPSNNQQIAKRAKESSCDLGRRYFRQQKWRNTKFGPPWLQRRSQWGCQ
Sbjct: 2073 PSTSTNYQTDLCTPSNNQQIAKRAKESSCDLGRRYFRQQKWRNTKFGPPWLQRRSQWGCQ 2132

Query: 2064 GNFRGGVSTIGDENIPDEEISPYCSDEASGRVDKANGDFYQHLQNQNLR 2112
            GNFRGGVSTIGDENIPDEEISPYCSDEASGRVDKANGDFYQHLQNQNLR
Sbjct: 2133 GNFRGGVSTIGDENIPDEEISPYCSDEASGRVDKANGDFYQHLQNQNLR 2151

BLAST of CsGy6G019510 vs. ExPASy TrEMBL
Match: A0A5A7UPQ6 (Histone-lysine N-methyltransferase ASHH2 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold221G001140 PE=4 SV=1)

HSP 1 Score: 3907 bits (10133), Expect = 0.0
Identity = 2004/2113 (94.84%), Postives = 2037/2113 (96.40%), Query Frame = 0

Query: 1    MGSCDDPAVIGEPFRASVTRLVRCSSQPLPEHQSHQEMASFSSNSREGQMFEPDRGLEVT 60
            MGSCDDPAVIGEPFRASVTRLVRCSSQPLPEHQSHQEMASFSS+SREGQMFEPDRGL VT
Sbjct: 1    MGSCDDPAVIGEPFRASVTRLVRCSSQPLPEHQSHQEMASFSSSSREGQMFEPDRGLGVT 60

Query: 61   TASLCTNASDPDTSGEDGTLRGFEHADSLLMDKRLDGDSGGSDPCLNLDNESCNEGNKTL 120
            TAS+C NASDPDT GEDGTL  FEHADSLLMDKRLDGD GGSDPCLNL+NESCNEGN+TL
Sbjct: 61   TASVCMNASDPDTYGEDGTLGAFEHADSLLMDKRLDGDFGGSDPCLNLENESCNEGNRTL 120

Query: 121  SLDMKESEDVDGLVDILGCDATMEMISLTESLVNSVKPEELDNNSCIIDAPAKVERDDTA 180
            SLDMKESEDVDG VDILGCDATMEMISLTESLVNSVKPEELD NSCI DAPAKVERDDT 
Sbjct: 121  SLDMKESEDVDGFVDILGCDATMEMISLTESLVNSVKPEELDKNSCIFDAPAKVERDDTV 180

Query: 181  QNGPILAGTGTRTDDLKSSYVCEIVSNSASADGLPNDFIQKNELENDGAGCSFSEVADRI 240
            QNGPIL GTGTRTDDLKSSYVCEIVSNSASADGLPNDFIQ+N++ENDGAGCSFSEVADRI
Sbjct: 181  QNGPILVGTGTRTDDLKSSYVCEIVSNSASADGLPNDFIQQNKMENDGAGCSFSEVADRI 240

Query: 241  TEASVELEADMLNEMSPLQSGQILPIHVGQSIANYDRYVCRMDGKSLSSTSGETVTVVAD 300
            TEASVELEADMLNE+SPLQSGQILPI VGQSIAN DRYVC+MDGKSLSSTSGETV  VAD
Sbjct: 241  TEASVELEADMLNEISPLQSGQILPIDVGQSIANCDRYVCQMDGKSLSSTSGETVIEVAD 300

Query: 301  MNSNPEGCLQMLPSQGCDRIGECLQSDGLPLTINASENDLCEEKHDSNSSSKYVPDVGGD 360
            MNSNPE CLQMLPSQGCDRIGECLQSDGLPLTI+ASENDLCEEKHDSNSSSKY+PDVGGD
Sbjct: 301  MNSNPEVCLQMLPSQGCDRIGECLQSDGLPLTIHASENDLCEEKHDSNSSSKYIPDVGGD 360

Query: 361  DSDVLTNNNSDGGQHTVPGIGNDHNLEDATVQVNHDCVELLSSPLPSQLPNSEKDEFYGM 420
            DSDVLTNNNSDGGQH VPGIGNDHNLEDATVQVNH+CVELL+SPLPSQ PNSEKDEFYG 
Sbjct: 361  DSDVLTNNNSDGGQHVVPGIGNDHNLEDATVQVNHNCVELLASPLPSQPPNSEKDEFYGT 420

Query: 421  LNGADIPIKYISSVNSCSVGDQDNNDIEKVGCVSEVKCPETVITSSKRSGRRRTSSQKTV 480
            L   DIPIKYISSVNS  +GDQDNNDI KVGCVSEVKCPETVI SSKRSGRRRTSSQK V
Sbjct: 421  LK-EDIPIKYISSVNSRCLGDQDNNDIGKVGCVSEVKCPETVIMSSKRSGRRRTSSQKAV 480

Query: 481  TKRASRKTKKKVPEPLIFDTARRRRSSISRPARPSPWGSLGHIIQSFEEIDDVLVNQTQK 540
            TKRASRKTKKKVPEPLIFDT RRRRSSISR ARPSPWGSLGHIIQSFEEIDDVLVNQTQK
Sbjct: 481  TKRASRKTKKKVPEPLIFDTTRRRRSSISRSARPSPWGSLGHIIQSFEEIDDVLVNQTQK 540

Query: 541  QGNEKSKGNQGGAKRNKKQLSESSHRSRKGTQGKSATSTSTNRIRLKVKLGKNVGHNFLN 600
            QGNEKSKGNQGGAKRNKKQLSESSHRSRKGTQGK ATSTSTNRIRLKVKLGKNVGHNFLN
Sbjct: 541  QGNEKSKGNQGGAKRNKKQLSESSHRSRKGTQGKPATSTSTNRIRLKVKLGKNVGHNFLN 600

Query: 601  IVVPEIVDSSLSAKGVNCNYGNESYWEGNLEFPPSNLGVDDQKAEEEGPLRKIFCYSRNQ 660
            IVVPEIVDSSLSAKGVNCNYGN+SYWEGNLEFPPS LGVDDQK EE GPLRKIFCYSRNQ
Sbjct: 601  IVVPEIVDSSLSAKGVNCNYGNDSYWEGNLEFPPSTLGVDDQKVEE-GPLRKIFCYSRNQ 660

Query: 661  DKEDNCPDASVVNEQCTNNDSSCIVGIDKSSEKHADDNLCVSSHLVDPVA-TSDARSLDP 720
            DKE+ CPDASVVNEQCTNNDSSCI+GIDKSSEKHADDNLCVSSHLV+PV  TSD RSLDP
Sbjct: 661  DKEEKCPDASVVNEQCTNNDSSCIIGIDKSSEKHADDNLCVSSHLVEPVERTSDTRSLDP 720

Query: 721  GTSPDSEVINSVLDIQVGAARQEILQDSVLASLEDFAASGNAPGSKKGRKKDKPSRVVSC 780
            GTSPDSEVINSVLDIQVGAARQEIL DSVLASLEDFAASGNAPGSKKGRKKDKPSR VSC
Sbjct: 721  GTSPDSEVINSVLDIQVGAARQEILPDSVLASLEDFAASGNAPGSKKGRKKDKPSRAVSC 780

Query: 781  SEERGISVSACSNRSKSSKKHGRRHNVDNQLSSGETLTYADANVLNYSLTVKELSMEQVS 840
            S ERGISVSACSNRSKSSKKHGRR NVDNQL SGET TY+DANVLNYSLTV+ELSMEQVS
Sbjct: 781  SGERGISVSACSNRSKSSKKHGRRQNVDNQLGSGETFTYSDANVLNYSLTVEELSMEQVS 840

Query: 841  LLTEIELPEETLKAEDILNDKECCRADVGSVFSESENSKTFLPSQSAKKKHPKGSKSIKT 900
            LLTEIELPE+TLKA+DILNDKECCRADVGS F ESENSKTFLPSQSAKKKHPKGSKSIKT
Sbjct: 841  LLTEIELPEDTLKADDILNDKECCRADVGSTFPESENSKTFLPSQSAKKKHPKGSKSIKT 900

Query: 901  SKGKSKAPGSKNKIKNASNERVYQRKSFKNSKSKEALCDQVVTETESHQIIGNCLVDKPE 960
            SKGKSKAPGSKNKIKNASNERVYQRKSFK SKSKEALCD+VVTETESHQIIGNCLVDKPE
Sbjct: 901  SKGKSKAPGSKNKIKNASNERVYQRKSFKKSKSKEALCDRVVTETESHQIIGNCLVDKPE 960

Query: 961  KSDNIIASTVAVDLSVVQGAVNEQYMPPRNAWVLCDDCHKWRRIPASLVDSLGHASCTWT 1020
            KSDNIIASTVAVDLSVVQGAVNEQYMPPRNAWVLCDDCHKWRRIPASLVDSLGHASCTWT
Sbjct: 961  KSDNIIASTVAVDLSVVQGAVNEQYMPPRNAWVLCDDCHKWRRIPASLVDSLGHASCTWT 1020

Query: 1021 CKDNVDKAFANCSIPQEKSNAEINAELEISDESGEENGSKKRLTYRELESFHPATVNAVP 1080
            CKDNVDKAFANCSIPQEKSNAEINAELEISDESGEENGSKKRLTYRELESFHPATV A+P
Sbjct: 1021 CKDNVDKAFANCSIPQEKSNAEINAELEISDESGEENGSKKRLTYRELESFHPATVTAIP 1080

Query: 1081 QQNKFASISSNQFLHRSRKTQTIDEIMVCHCKPALDGRLGCGDECLNRMLNIECVRGTCP 1140
            Q+NKFASISSNQFLHRSRKTQTIDEIMVCHCKPALDGRLGCGDECLNRMLNIECVRGTCP
Sbjct: 1081 QENKFASISSNQFLHRSRKTQTIDEIMVCHCKPALDGRLGCGDECLNRMLNIECVRGTCP 1140

Query: 1141 CGELCSNQQFQKRKYAKLQWLRCGKKGYGLQLLEDISKGQFLIEYVGEVLDMHAYEARQK 1200
            CG+LCSNQQFQKRKYAKLQWLRCGKKGYGLQLLEDISKGQFLIEYVGEVLDM+AYEARQK
Sbjct: 1141 CGDLCSNQQFQKRKYAKLQWLRCGKKGYGLQLLEDISKGQFLIEYVGEVLDMNAYEARQK 1200

Query: 1201 EYALNGHRHFYFMTLNGSEVIDACGKGNLGRFINHSCDPNCRTEKWMVNGEICIGLFALR 1260
            EYALNGHRHFYFMTLNGSEVIDACGKGNLGRFINHSCDPNCRTEKWMVNGEICIGLFALR
Sbjct: 1201 EYALNGHRHFYFMTLNGSEVIDACGKGNLGRFINHSCDPNCRTEKWMVNGEICIGLFALR 1260

Query: 1261 DIKKGEEVTFDYNYVRVFGAAAKKCYCGSFHCRGYIGGDPLNSEVIIQSDSDEEFPEPVM 1320
            DIKKGEEVTFDYNYVRVFGAAAKKCYCGSFHCRGYIGGDPLNSEVIIQSDSDEEFPEPVM
Sbjct: 1261 DIKKGEEVTFDYNYVRVFGAAAKKCYCGSFHCRGYIGGDPLNSEVIIQSDSDEEFPEPVM 1320

Query: 1321 LRGDGRSLNSNLSTAVSSMDVAKMQSSEHLKGNRDKRDQPIRIASELKISEEKVDPLKLS 1380
            LR DGRS N+NLSTAVSSMDVAKMQ SEHLKGNRDKRDQPIRIASELKISEEKVD LKL 
Sbjct: 1321 LRADGRSWNNNLSTAVSSMDVAKMQPSEHLKGNRDKRDQPIRIASELKISEEKVDTLKLP 1380

Query: 1381 ASKISEEKEDPLKLSATKISEEKEDPLNLSASTISPLHSSLEFEDSKVASPIPVPDITHQ 1440
            ASKISEEKEDPLKLSA K SEEKEDPLNLSASTISPLHSSLEFEDSKVASPIPVPDITHQ
Sbjct: 1381 ASKISEEKEDPLKLSALKTSEEKEDPLNLSASTISPLHSSLEFEDSKVASPIPVPDITHQ 1440

Query: 1441 TEDVTSQPIFVDQTEISLLDNIPDKNTCSIEQEAKLSVDDIDARKKSKLDSVEDKQVYIK 1500
            TEDVTS+PIFVDQT ISLLDNI DKNTCSIEQEAKLSVDDIDARKKSKLDSVEDK+VYIK
Sbjct: 1441 TEDVTSKPIFVDQTGISLLDNISDKNTCSIEQEAKLSVDDIDARKKSKLDSVEDKKVYIK 1500

Query: 1501 SHPRMKTSRKLGSIKKGKVSSAEKIQITNRSQISSVKPKRLIEGSPGNRFEAVEEKLNEL 1560
            SHPRMKTSRK GS+KKGKVSS EKIQITNRS ISSVKPKRLIEGSPGNRFEAVEEKLNEL
Sbjct: 1501 SHPRMKTSRKPGSVKKGKVSSVEKIQITNRSLISSVKPKRLIEGSPGNRFEAVEEKLNEL 1560

Query: 1561 LDAEGGISKRKDAPKGYLKLLLLTAASGASASGEAIQSNRDLSMILDALLKTKSRLVLTD 1620
            LDAEGGISKRKDAPKGYLKLLLLTAASGASASGEAIQSNRDLSMILDALLKTKSRLVLTD
Sbjct: 1561 LDAEGGISKRKDAPKGYLKLLLLTAASGASASGEAIQSNRDLSMILDALLKTKSRLVLTD 1620

Query: 1621 IINKNGLRMLHNIMKQYRSDFKKIPILRKLLKVLEYLVTREILTSEHINGGPPCPGMESL 1680
            IINKNGLRMLHNIMKQYRSDFKKIPILRKLLKVLEYLVTREILTSEHINGGPPCPGMESL
Sbjct: 1621 IINKNGLRMLHNIMKQYRSDFKKIPILRKLLKVLEYLVTREILTSEHINGGPPCPGMESL 1680

Query: 1681 RESLLSLTEHDDKQVHQIARSFRDRWFPRHTRKFGYSEREDGRLEVYRGSNSSRFTASHS 1740
            RESLLSLTEHDDKQVHQIARSFRDRWFPRHTRKFGYSEREDGRLEVYRGSNSSRFTASHS
Sbjct: 1681 RESLLSLTEHDDKQVHQIARSFRDRWFPRHTRKFGYSEREDGRLEVYRGSNSSRFTASHS 1740

Query: 1741 FRHDQDCRPTDAIDCIKQSMPTSLPDAHPAEVCSLASAASHSVNGQKVRKRKSRWDQPAD 1800
            +RHDQDCRPTDAIDCIKQSMPT LPDAH AEVCSLAS A  SVNGQKVRKRKSRWDQPAD
Sbjct: 1741 YRHDQDCRPTDAIDCIKQSMPTPLPDAHTAEVCSLASVAGPSVNGQKVRKRKSRWDQPAD 1800

Query: 1801 TSLDLRSKEQKLESTSVQELNSSQLNSVGAASMLIDKVNNDDKDISLSDSVGVPCRQDED 1860
            TSLDLRSKEQKLESTSVQELNSSQLNSV  ASMLIDKVNNDDKD SLSDSVGVPCRQDED
Sbjct: 1801 TSLDLRSKEQKLESTSVQELNSSQLNSVRVASMLIDKVNNDDKDSSLSDSVGVPCRQDED 1860

Query: 1861 IRADSAVPNIPEDIPPGFSSPFNPPVASSSAFSAVLDPPRQNIGDLSCAFSTVGHLQERF 1920
             RADSAVPNIPEDIPPGFSSPFNP VASSSAFSAVLDPP+QNIG LSCAFSTVGHLQERF
Sbjct: 1861 TRADSAVPNIPEDIPPGFSSPFNPSVASSSAFSAVLDPPQQNIGYLSCAFSTVGHLQERF 1920

Query: 1921 ISRLPVSYGIPFSIIEQCGTSHAENLECWDVAPGVPFHPFPPLPPYPRGMRGLPTSACGT 1980
            ISRLPVSYGIPFSIIEQCGTS AENLECWDVAPGVPFHPFPPLPPYPRGM G  TSACGT
Sbjct: 1921 ISRLPVSYGIPFSIIEQCGTSRAENLECWDVAPGVPFHPFPPLPPYPRGMSGPRTSACGT 1980

Query: 1981 AGQSSQEGQVNSHDSRTSFSEESPPSTSTNYQTDLCTPSNNQQIAKRAKESSCDLGRRYF 2040
            AGQSSQEGQVNSHDSRTSFSEESPPSTSTNYQTDLCTPSNNQQI KR KESSCDLGRRYF
Sbjct: 1981 AGQSSQEGQVNSHDSRTSFSEESPPSTSTNYQTDLCTPSNNQQITKRPKESSCDLGRRYF 2040

Query: 2041 RQQKWRNTKFGPPWLQRRSQWGCQGNFRGGVSTIGDENIPDEEISPYCSDEASGRVDKAN 2100
            RQQKWRNTKFGPPWLQRRSQWGCQGNFRGGVSTIGDENIPDEEISPYCSDEASGRVDKAN
Sbjct: 2041 RQQKWRNTKFGPPWLQRRSQWGCQGNFRGGVSTIGDENIPDEEISPYCSDEASGRVDKAN 2100

Query: 2101 GDFYQHLQNQNLR 2112
            GDFYQHLQNQNLR
Sbjct: 2101 GDFYQHLQNQNLR 2111

BLAST of CsGy6G019510 vs. ExPASy TrEMBL
Match: A0A1S3B3U9 (LOW QUALITY PROTEIN: histone-lysine N-methyltransferase ASHH2 OS=Cucumis melo OX=3656 GN=LOC103485694 PE=4 SV=1)

HSP 1 Score: 3831 bits (9934), Expect = 0.0
Identity = 1975/2114 (93.42%), Postives = 2006/2114 (94.89%), Query Frame = 0

Query: 1    MGSCDDPAVIGEPFRASVTRLVRCSSQPLPEHQSHQEMASFSSNSREGQMFEPDRGLEVT 60
            MGSCDDPAVIGEPFRASVTRLVRCSSQPLPEHQSHQEMASFSS+SREGQMFEPDRGL VT
Sbjct: 1    MGSCDDPAVIGEPFRASVTRLVRCSSQPLPEHQSHQEMASFSSSSREGQMFEPDRGLGVT 60

Query: 61   TASLCTNASDPDTSGEDGTLRGFEHADSLLMDKRLDGDSGGSDPCLNLDNESCNEGNKTL 120
            TAS+C NASDPDT GEDGTL  FEHADSLLMDKRLDGD GGSDPCLNL+NESCNEGN+TL
Sbjct: 61   TASVCMNASDPDTYGEDGTLGAFEHADSLLMDKRLDGDFGGSDPCLNLENESCNEGNRTL 120

Query: 121  SLDMKESEDVDGLVDILGCDATMEMISLTESLVNSVKPEELDNNSCIIDAPAKVERDDTA 180
            SLDMKESEDVDG VDILGCDATMEMISLTESLVNSVKPEELD NSCI DAPAKVERDDT 
Sbjct: 121  SLDMKESEDVDGFVDILGCDATMEMISLTESLVNSVKPEELDKNSCIFDAPAKVERDDTV 180

Query: 181  QNGPILAGTGTRTDDLKSSYVCEIVSNSASADGLPNDFIQKNELENDGAGCSFSEVADRI 240
            QNGPIL GTGTRTDDLKSSYVCEIVSNSASADGLPNDFIQ+N++ENDGAGCSFSEVADRI
Sbjct: 181  QNGPILVGTGTRTDDLKSSYVCEIVSNSASADGLPNDFIQQNKMENDGAGCSFSEVADRI 240

Query: 241  TEASVELEADMLNEMSPLQSGQILPIHVGQSIANYDRYVCRMDGKSLSSTSGETVTVVAD 300
            TEASVELEADMLNE+SPLQSGQILPI VGQSIAN DRYVC+MDGKSLSSTSGETV  VAD
Sbjct: 241  TEASVELEADMLNEISPLQSGQILPIDVGQSIANCDRYVCQMDGKSLSSTSGETVIEVAD 300

Query: 301  MNSNPEGCLQMLPSQGCDRIGECLQSDGLPLTINASENDLCEEKHDSNSSSKYVPDVGGD 360
            MNSNPE CLQMLPSQGCDRIGECLQSDGLPLTI+ASENDLCEEKHDSNSSSKY+PDVGGD
Sbjct: 301  MNSNPEVCLQMLPSQGCDRIGECLQSDGLPLTIHASENDLCEEKHDSNSSSKYIPDVGGD 360

Query: 361  DSDVLTNNNSDGGQHTVPGIGNDHNLEDATVQVNHDCVELLSSPLPSQLPNSEKDEFYGM 420
            DSDVLTNNNSDGGQH VPGIGNDHNLEDATVQVNH+CVELL+SPLPSQ PNSEKDEFYG 
Sbjct: 361  DSDVLTNNNSDGGQHVVPGIGNDHNLEDATVQVNHNCVELLASPLPSQPPNSEKDEFYGT 420

Query: 421  LNGADIPIKYISSVNSCSVGDQDNNDIEKVGCVSEVKCPETVITSSKRSGRRRTSSQKTV 480
            L   DIPIKYISSVNS  +GDQDNNDI KVGCVSEVKCPETVI SSKRSGRRRTSSQK V
Sbjct: 421  LK-EDIPIKYISSVNSRCLGDQDNNDIGKVGCVSEVKCPETVIMSSKRSGRRRTSSQKAV 480

Query: 481  TKRASRKTKKKVPEPLIFDTARRRRSSISRPARPSPWGSLGHIIQSFEEIDDVLVNQTQK 540
            TKRASRKTKKKVPEPLIFDT RRRRSSISR ARPSPWGSLGHIIQSFEEIDDVLVNQTQK
Sbjct: 481  TKRASRKTKKKVPEPLIFDTTRRRRSSISRSARPSPWGSLGHIIQSFEEIDDVLVNQTQK 540

Query: 541  QGNEKSKGNQGGAKRNKKQLSESSHRSRKGTQGKSATSTSTNRIRLKVKLGKNVGHNFLN 600
            QGNEKSKGNQGGAKRNKKQLSESSHRSRKGTQGK ATSTSTNRIRLKVKLGKNVGHNFLN
Sbjct: 541  QGNEKSKGNQGGAKRNKKQLSESSHRSRKGTQGKPATSTSTNRIRLKVKLGKNVGHNFLN 600

Query: 601  IVVPEIVDSSLSAKGVNCNYGNESYWEGNLEFPPSNLGVDDQKAEEEGPLRKIFCYSRNQ 660
            IVVPEIVDSSLSAKGVNCNYGN+SYWEGNLEFPPS LGVDDQK EE GPLRKIFCYSRNQ
Sbjct: 601  IVVPEIVDSSLSAKGVNCNYGNDSYWEGNLEFPPSTLGVDDQKVEE-GPLRKIFCYSRNQ 660

Query: 661  DKEDNCPDASVVNEQCTNNDSSCIVGIDKSSEKHADDNLCVSSHLVDPVA-TSDARSLDP 720
            DKE+ CPDASVVNEQCTNNDSSCI+GIDKSSEKHADDNLCVSSHLV+PV  TSD RSLDP
Sbjct: 661  DKEEKCPDASVVNEQCTNNDSSCIIGIDKSSEKHADDNLCVSSHLVEPVERTSDTRSLDP 720

Query: 721  GTSPDSEVINSVLDIQVGAARQEILQDSVLASLEDFAASGNAPGSKKGRKKDKPSRVVSC 780
            GTSPDSEVINSVLDIQVGAARQEIL DSVLASLEDFAASGNAPGSKKGRKKDKPSR VSC
Sbjct: 721  GTSPDSEVINSVLDIQVGAARQEILPDSVLASLEDFAASGNAPGSKKGRKKDKPSRAVSC 780

Query: 781  SEERGISVSACSNRSKSSKKHGRRHNVDNQLSSGETLTYADANVLNYSLTVKELSMEQVS 840
            S ERGISVSACSNRSKSSKKHGRR NVDNQL S                           
Sbjct: 781  SGERGISVSACSNRSKSSKKHGRRQNVDNQLGS--------------------------- 840

Query: 841  LLTEIELPEETLKAEDILNDKECCRADVGSVFSESENSKTFLPSQSAKKKHPKGSKSIKT 900
               EIELPE+TLKA+DILNDKECCRADVGS F ESENSKTFLPSQSAKKKHPKGSKSIKT
Sbjct: 841  ---EIELPEDTLKADDILNDKECCRADVGSTFPESENSKTFLPSQSAKKKHPKGSKSIKT 900

Query: 901  SKGKSKAPGSKNKIKNASNERVYQRKSFKNSKSKEALCDQVVTETESHQIIGNCLVDKPE 960
            SKGKSKAPGSKNKIKNASNERVYQRKSFK SKSKEALCD+VVTETESHQIIGNCLVDKPE
Sbjct: 901  SKGKSKAPGSKNKIKNASNERVYQRKSFKKSKSKEALCDRVVTETESHQIIGNCLVDKPE 960

Query: 961  KSDNIIASTVAVDLSVVQGAVNEQYMPPRNAWVLCDDCHKWRRIPASLVDSLGHASCTWT 1020
            KSDNIIASTVAVDLSVVQGAVNEQYMPPRNAWVLCDDCHKWRRIPASLVDSLGHASCTWT
Sbjct: 961  KSDNIIASTVAVDLSVVQGAVNEQYMPPRNAWVLCDDCHKWRRIPASLVDSLGHASCTWT 1020

Query: 1021 CKDNVDKAFANCSIPQEKSNAEINAELEISDESGEENGSKKRLTYRELESFHPATVNAVP 1080
            CKDNVDKAFANCSIPQEKSNAEINAELEISDESGEENGSKKRLTYRELESFHPATV A+P
Sbjct: 1021 CKDNVDKAFANCSIPQEKSNAEINAELEISDESGEENGSKKRLTYRELESFHPATVTAIP 1080

Query: 1081 QQNKFASISSNQFLHRSRKTQTIDEIMVCHCKPALDGRLGCGDECLNRMLNIECVRGTCP 1140
            Q+NKFASISSNQFLHRSRKTQTIDEIMVCHCKPALDGRLGCGDECLNRMLNIECVRGTCP
Sbjct: 1081 QENKFASISSNQFLHRSRKTQTIDEIMVCHCKPALDGRLGCGDECLNRMLNIECVRGTCP 1140

Query: 1141 CGELCSNQQFQKRKYAKLQWLRCGKKGYGLQLLEDISKGQFLIEYVGEVLDMHAYEARQK 1200
            CG+LCSNQQFQKRKYAKLQWLRCGKKGYGLQLLEDISKGQFLIEYVGEVLDM+AYEARQK
Sbjct: 1141 CGDLCSNQQFQKRKYAKLQWLRCGKKGYGLQLLEDISKGQFLIEYVGEVLDMNAYEARQK 1200

Query: 1201 EYALNGHRHFYFMTLNGSEVIDACGKGNLGRFINHSCDPNCRTEKWMVNGEICIGLFALR 1260
            EYALNGHRHFYFMTLNGSEVIDACGKGNLGRFINHSCDPNCRTEKWMVNGEICIGLFALR
Sbjct: 1201 EYALNGHRHFYFMTLNGSEVIDACGKGNLGRFINHSCDPNCRTEKWMVNGEICIGLFALR 1260

Query: 1261 DIKKGEEVTFDYNYVRVFGAAAKKCYCGSFHCRGYIGGDPLNSEVIIQSDSDEEFPEPVM 1320
            DIKKGEEVTFDYNYVRVFGAAAKKCYCGSFHCRGYIGGDPLNSEVIIQSDSDEEFPEPVM
Sbjct: 1261 DIKKGEEVTFDYNYVRVFGAAAKKCYCGSFHCRGYIGGDPLNSEVIIQSDSDEEFPEPVM 1320

Query: 1321 LRGDGRSLNSNLSTAVSSMDVAKMQSSEHLKGNRDKRDQPIRIASELKISEEKVDPLKLS 1380
            LR DGRS N+NLSTAVSSMDVAKMQ SEHLKGNRDKRDQPIRIASELKISEEKVD LKL 
Sbjct: 1321 LRADGRSWNNNLSTAVSSMDVAKMQPSEHLKGNRDKRDQPIRIASELKISEEKVDTLKLP 1380

Query: 1381 ASKISEEKEDPLKLSATKISEEKEDPLNLSASTISPLHSSLEFEDSKVASPIPVPDITHQ 1440
            ASKISEEKEDPLKLSA K SEEKEDPLNLSASTISPLHSSLEFEDSKVASPIPVPDITHQ
Sbjct: 1381 ASKISEEKEDPLKLSALKTSEEKEDPLNLSASTISPLHSSLEFEDSKVASPIPVPDITHQ 1440

Query: 1441 TEDVTSQPIFVDQTEISLLDNIPDKNTCSIEQEAKLSVDDIDARKKSKLDSVEDKQVYIK 1500
            TEDVTS+PIFVDQT ISLLDNI DKNTCSIEQEAKLSVDDIDARKKSKLDSVEDK+VYIK
Sbjct: 1441 TEDVTSKPIFVDQTGISLLDNISDKNTCSIEQEAKLSVDDIDARKKSKLDSVEDKKVYIK 1500

Query: 1501 SHPRMKTSRKLGSIKKGK-VSSAEKIQITNRSQISSVKPKRLIEGSPGNRFEAVEEKLNE 1560
            SHPRMKTSRK GS+KK K  SS EKIQITNRS ISSVKPKRLIEGSPGNRFEAVEEKLNE
Sbjct: 1501 SHPRMKTSRKPGSVKKRKSXSSVEKIQITNRSLISSVKPKRLIEGSPGNRFEAVEEKLNE 1560

Query: 1561 LLDAEGGISKRKDAPKGYLKLLLLTAASGASASGEAIQSNRDLSMILDALLKTKSRLVLT 1620
            LLDAEGGISKRKDAPKGYLKLLLLTAASGASASGEAIQSNRDLSMILDALLKTKSRLVLT
Sbjct: 1561 LLDAEGGISKRKDAPKGYLKLLLLTAASGASASGEAIQSNRDLSMILDALLKTKSRLVLT 1620

Query: 1621 DIINKNGLRMLHNIMKQYRSDFKKIPILRKLLKVLEYLVTREILTSEHINGGPPCPGMES 1680
            DIINKNGLRMLHNIMKQYRSDFKKIPILRKLLKVLEYLVTREILTSEHINGGPPCPGMES
Sbjct: 1621 DIINKNGLRMLHNIMKQYRSDFKKIPILRKLLKVLEYLVTREILTSEHINGGPPCPGMES 1680

Query: 1681 LRESLLSLTEHDDKQVHQIARSFRDRWFPRHTRKFGYSEREDGRLEVYRGSNSSRFTASH 1740
            LRESLLSLTEHDDKQVHQIARSFRDRWFPRHTRKFGYSEREDGRLEVYRGSNSSRFTASH
Sbjct: 1681 LRESLLSLTEHDDKQVHQIARSFRDRWFPRHTRKFGYSEREDGRLEVYRGSNSSRFTASH 1740

Query: 1741 SFRHDQDCRPTDAIDCIKQSMPTSLPDAHPAEVCSLASAASHSVNGQKVRKRKSRWDQPA 1800
            S+RHDQDCRPTDAIDCIKQSMPT LPDAH AEVCSLAS A  SVNGQKVRKRKSRWDQPA
Sbjct: 1741 SYRHDQDCRPTDAIDCIKQSMPTPLPDAHTAEVCSLASVAGPSVNGQKVRKRKSRWDQPA 1800

Query: 1801 DTSLDLRSKEQKLESTSVQELNSSQLNSVGAASMLIDKVNNDDKDISLSDSVGVPCRQDE 1860
            DTSLDLRSKEQKLESTSVQELNSSQLNSV  ASMLIDKVNNDDKD SLSDSVGVPCRQDE
Sbjct: 1801 DTSLDLRSKEQKLESTSVQELNSSQLNSVRVASMLIDKVNNDDKDSSLSDSVGVPCRQDE 1860

Query: 1861 DIRADSAVPNIPEDIPPGFSSPFNPPVASSSAFSAVLDPPRQNIGDLSCAFSTVGHLQER 1920
            D RADSAVPNIPEDIPPGFSSPFNP VASSSAFSAVLDPP+QNIG LSCAFSTVGHLQER
Sbjct: 1861 DTRADSAVPNIPEDIPPGFSSPFNPSVASSSAFSAVLDPPQQNIGYLSCAFSTVGHLQER 1920

Query: 1921 FISRLPVSYGIPFSIIEQCGTSHAENLECWDVAPGVPFHPFPPLPPYPRGMRGLPTSACG 1980
            FISRLPVSYGIPFSIIEQCGTS AENLECWDVAPGVPFHPFPPLPPYPRGM G  TSACG
Sbjct: 1921 FISRLPVSYGIPFSIIEQCGTSRAENLECWDVAPGVPFHPFPPLPPYPRGMSGPRTSACG 1980

Query: 1981 TAGQSSQEGQVNSHDSRTSFSEESPPSTSTNYQTDLCTPSNNQQIAKRAKESSCDLGRRY 2040
            TAGQSSQEGQVNSHDSRTSFSEESPPSTSTNYQTDLCTPSNNQQI KR KESSCDLGRRY
Sbjct: 1981 TAGQSSQEGQVNSHDSRTSFSEESPPSTSTNYQTDLCTPSNNQQITKRPKESSCDLGRRY 2040

Query: 2041 FRQQKWRNTKFGPPWLQRRSQWGCQGNFRGGVSTIGDENIPDEEISPYCSDEASGRVDKA 2100
            FRQQKWRNTKFGPPWLQRRSQWGCQGNFRGGVSTIGDENIPDEEISPYCSDEASGRVDKA
Sbjct: 2041 FRQQKWRNTKFGPPWLQRRSQWGCQGNFRGGVSTIGDENIPDEEISPYCSDEASGRVDKA 2082

Query: 2101 NGDFYQHLQNQNLR 2112
            NGDFYQHLQNQNLR
Sbjct: 2101 NGDFYQHLQNQNLR 2082

BLAST of CsGy6G019510 vs. ExPASy TrEMBL
Match: A0A6J1JXF7 (histone-lysine N-methyltransferase ASHH2-like isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111489728 PE=4 SV=1)

HSP 1 Score: 3358 bits (8707), Expect = 0.0
Identity = 1750/2114 (82.78%), Postives = 1874/2114 (88.65%), Query Frame = 0

Query: 1    MGSCDDPAVIGEPFRASVTRLVRCSSQPLPEHQSHQEMASFSSNSREGQMFEPDRGLEVT 60
            MGSCDDPAVIGEP R S+T L  CSSQPLP++QS QEM+S  SNS + QM EPDRGL VT
Sbjct: 1    MGSCDDPAVIGEPVRGSITPLFGCSSQPLPKYQSRQEMSSLPSNSSDCQMSEPDRGLGVT 60

Query: 61   TASLCTNASDPDTSGEDGTLRGFEHADSLLMDKRLDGDSGGSDPCLNLDNESCNEGNKTL 120
              ++C NAS+PDT+GEDGT RGFEHAD+LL+DKRLD DSG SDPCLN +NE+CN  N+TL
Sbjct: 61   ATNICMNASEPDTAGEDGTFRGFEHADTLLLDKRLDCDSGDSDPCLNEENEACNVENRTL 120

Query: 121  SLDMKESEDVDGLVDILGCDATMEMISLTESLVNSVKPEELDNNSCIIDAPAKVERDDTA 180
             LDMKES+DVD LVDILGC  TMEM+SL  SL+NSVK E LD+NSCIIDA  KVE  D  
Sbjct: 121  KLDMKESQDVDDLVDILGCKTTMEMMSLAGSLMNSVKSEGLDHNSCIIDASEKVESGDIV 180

Query: 181  QNGPILAGTGTRTDDLKSSYVCEIVSNSASADGLPNDFIQKNELENDGAGCSFSEVADRI 240
            +NGP+LA  GT TDDLKS ++CEIVS+SASADGL +DFI + +LENDGAGCSFSEVADR+
Sbjct: 181  ENGPLLARIGTCTDDLKSPHICEIVSSSASADGLSSDFIHQKQLENDGAGCSFSEVADRL 240

Query: 241  TEASVELEADMLNEMSPLQSGQILPIHVGQSIANYDRYVCRMDGKSLSSTSGETVTVVAD 300
            TEA VE++ADMLNEMSPLQS QILP H+G+S+AN ++Y+C+MDGKSLS TSGETV   AD
Sbjct: 241  TEALVEIDADMLNEMSPLQSDQILPTHMGRSVANCEQYICQMDGKSLSGTSGETVNEFAD 300

Query: 301  MNSNPEGCLQMLPSQGCDRIGECLQSDGLPLTINASENDLCEEKHDSNSSSKYVPDVGGD 360
            MNSNPE CLQMLPSQGC+RI ECLQ+D  PLTI++ E + C+EKHDSNS  KY+P+VG D
Sbjct: 301  MNSNPELCLQMLPSQGCERIRECLQADDSPLTIHSPEINRCDEKHDSNSLPKYIPEVGDD 360

Query: 361  DSDVLTNNNSDGGQHTVPGIGNDHNLEDATVQVNHDCVELLSSPLPSQLPNSEKDEFYGM 420
            D  VLT+ N DGGQH VP + N+ NLE+A++Q N +CVELL+SPLPSQ  NSEK EFYGM
Sbjct: 361  DFVVLTDINGDGGQHIVPDMENNCNLEEASIQENTNCVELLASPLPSQPFNSEKYEFYGM 420

Query: 421  LNGADIPIKYISSVNSCSVGDQDNNDIEKVGCVSEVKCPETVITSSKRSGRRRTSSQKTV 480
            L GAD+PIK     NSCSV DQDNND EKVG VSEVKCPETV+ SSKRSGRRR SSQK V
Sbjct: 421  LIGADMPIKD----NSCSVSDQDNNDTEKVGRVSEVKCPETVLMSSKRSGRRRMSSQKNV 480

Query: 481  TKRASRKTKKKVPEPLIFDTARRRRSSISRPARPSPWGSLGHIIQSFEEIDDVLVNQTQK 540
            TKRASRK+KK VPEPLIFDT RRRRSSISRPARP PWGSLG IIQSFE+IDDVLVNQ++K
Sbjct: 481  TKRASRKSKKIVPEPLIFDTTRRRRSSISRPARPLPWGSLGFIIQSFEKIDDVLVNQSKK 540

Query: 541  QGNEKSKGNQGGAKRNKKQLSESSHRSRKGTQGKSATSTSTNRIRLKVKLGKNVGHNFLN 600
            QGNEKSKGNQGG KR+KKQ SESSHRSRKGTQGK  TSTSTNRIRLKVKLGKNVGHNFLN
Sbjct: 541  QGNEKSKGNQGGTKRSKKQPSESSHRSRKGTQGKCDTSTSTNRIRLKVKLGKNVGHNFLN 600

Query: 601  IVVPEIVDSSLSAKGVNCNYGNESYWEGNLEFPPSNLGVDDQKAEEEGPLRKIFCYSRNQ 660
            IVVPEIVDSSLSAKG+NC+YGNESYWEGNLEFPPS   VDDQK EE GPLRKIFCYS+NQ
Sbjct: 601  IVVPEIVDSSLSAKGINCHYGNESYWEGNLEFPPS---VDDQKPEE-GPLRKIFCYSKNQ 660

Query: 661  DKEDNCPDASVVNEQCTNNDSSCIVGIDKSSEKHADDNLCVSSHLVDPVA-TSDARSLDP 720
             KE+ CPDASVVNEQC NNDSSC V IDKSS KHADDNLCVSSHLV+PV   SD R LDP
Sbjct: 661  GKEEKCPDASVVNEQCANNDSSCTVTIDKSSTKHADDNLCVSSHLVEPVERASDTRCLDP 720

Query: 721  GTSPDSEVINSVLDIQVGAARQEILQDSVLASLEDFAASGNAPGSKKGRKKDKPSRVVSC 780
            GTSPDSEVINS+LDIQVGA RQE LQDSVL SLEDFAASGNA  SKKGRKK+KP + VSC
Sbjct: 721  GTSPDSEVINSMLDIQVGAMRQEKLQDSVLPSLEDFAASGNATSSKKGRKKEKPCQAVSC 780

Query: 781  SEERGISVSACSNRSKSSKKHGRRHNVDNQLSSGETLTYADANVLNYSLTVKELSMEQVS 840
            S+E G   SAC+NRSKSSKKHGRR NVDNQL SGET TY DANV+NYSLTVKELSM+QV 
Sbjct: 781  SDEAGTGASACNNRSKSSKKHGRRLNVDNQLGSGETFTYTDANVVNYSLTVKELSMDQVP 840

Query: 841  LLTEIELPEETLKAEDILNDKECCRADVGSVFSESENSKTFLPSQSAKKKHPKGSKSIKT 900
            L TEIELPEE LKA+ IL DKEC R DVGSVF ESENSKTFLPSQSA KKHPKGSKSIKT
Sbjct: 841  LSTEIELPEEALKADGILEDKECYRTDVGSVFPESENSKTFLPSQSAGKKHPKGSKSIKT 900

Query: 901  SKGKSKAPGSKNKIKNASNERVYQRKSFKNSKSKEALCDQVVTETESHQIIGNCLVDKPE 960
            SKGKSKAPGSKNKIKNAS ERVY+RKSF N    EALCDQVVTETESHQI+GN LVDKPE
Sbjct: 901  SKGKSKAPGSKNKIKNASKERVYRRKSF-NKSITEALCDQVVTETESHQIVGNYLVDKPE 960

Query: 961  KSDNIIASTVAVDLSVVQGAVNEQYMPPRNAWVLCDDCHKWRRIPASLVDSLGHASCTWT 1020
            KS++IIASTVAV+L+VVQGAVNEQYMPPRNAWVLCDDCHKWRRIPASLVDSLGHASCTWT
Sbjct: 961  KSNDIIASTVAVNLNVVQGAVNEQYMPPRNAWVLCDDCHKWRRIPASLVDSLGHASCTWT 1020

Query: 1021 CKDNVDKAFANCSIPQEKSNAEINAELEISDESGEENGSKKRLTYRELESFHPATVNAVP 1080
            CK+NVDKAFA+CSIPQEKSNAEINAELEISDESGEEN S KRLTYREL+SFHP TV AVP
Sbjct: 1021 CKENVDKAFADCSIPQEKSNAEINAELEISDESGEENASNKRLTYRELDSFHPTTVTAVP 1080

Query: 1081 QQNKFASISSNQFLHRSRKTQTIDEIMVCHCKPALDGRLGCGDECLNRMLNIECVRGTCP 1140
            Q+NKF SISSN FLHRSRKTQTIDEIMVCHCKPALDGRLGCGDECLNRMLNIECVRGTCP
Sbjct: 1081 QENKFTSISSNHFLHRSRKTQTIDEIMVCHCKPALDGRLGCGDECLNRMLNIECVRGTCP 1140

Query: 1141 CGELCSNQQFQKRKYAKLQWLRCGKKGYGLQLLEDISKGQFLIEYVGEVLDMHAYEARQK 1200
            CG+LCSNQQFQKRKYAKLQWLRCGKKGYGLQ +EDISKGQFLIEYVGEVLDMHAYEARQK
Sbjct: 1141 CGDLCSNQQFQKRKYAKLQWLRCGKKGYGLQSVEDISKGQFLIEYVGEVLDMHAYEARQK 1200

Query: 1201 EYALNGHRHFYFMTLNGSEVIDACGKGNLGRFINHSCDPNCRTEKWMVNGEICIGLFALR 1260
            EYALNGHRHFYFMTL+GSEVIDACGKGNLGRFINHSCDPNCRTEKWMVNGEICIGLFALR
Sbjct: 1201 EYALNGHRHFYFMTLDGSEVIDACGKGNLGRFINHSCDPNCRTEKWMVNGEICIGLFALR 1260

Query: 1261 DIKKGEEVTFDYNYVRVFGAAAKKCYCGSFHCRGYIGGDPLNSEVIIQSDSDEEFPEPVM 1320
            DIKKGEEVTFDYNYVRVFGAAAKKCYCGSF CRGYIGGDPLNSEVIIQSDSDEEFPEPVM
Sbjct: 1261 DIKKGEEVTFDYNYVRVFGAAAKKCYCGSFQCRGYIGGDPLNSEVIIQSDSDEEFPEPVM 1320

Query: 1321 LRGDGRSLNSNLSTAVSSMDVAKMQSSEHLKGNRDKRDQPIRIASELKISEEKVDPLKLS 1380
            LR DGRS N+NL T VS +D  K Q SEH+KG RDK+DQP R + E              
Sbjct: 1321 LRPDGRSWNNNLPTTVSLLDGVKKQPSEHIKGVRDKKDQPSRTSVE-------------- 1380

Query: 1381 ASKISEEKEDPLKLSATKISEEKEDPLNLSASTISPLHSSLEFEDSKVASPIPVPDITHQ 1440
             SKIS+EKED LKLSA+KISE KEDPLNLSASTISPLHSSLEFEDSKVASP P+ DITHQ
Sbjct: 1381 -SKISDEKEDTLKLSASKISEAKEDPLNLSASTISPLHSSLEFEDSKVASPTPLADITHQ 1440

Query: 1441 TEDVTSQPIFVDQTEISLLDNIPDKNTCSIEQEAKLSVDDIDARKKSKLDSVEDKQVYIK 1500
            TEDVTSQP+FVDQ EIS  DN  DKNTCSIEQEAKLSV DIDARKKSKL ++EDK+VYIK
Sbjct: 1441 TEDVTSQPVFVDQPEISPGDNNSDKNTCSIEQEAKLSVADIDARKKSKLVAIEDKKVYIK 1500

Query: 1501 SHPRMKTSRKLGSIKKGKVSSAEKIQITNRSQISSVKPKRLIEGSPGNRFEAVEEKLNEL 1560
            SH RMKTSRK GSIKKGKVSS EK+QI NR QISSVKPKRL++GSPGNRFEAVEEKLNEL
Sbjct: 1501 SHLRMKTSRKPGSIKKGKVSSVEKVQIANRPQISSVKPKRLVDGSPGNRFEAVEEKLNEL 1560

Query: 1561 LDAEGGISKRKDAPKGYLKLLLLTAASGASASGEAIQSNRDLSMILDALLKTKSRLVLTD 1620
            LDAEGGISKRKDAPKGYLKLLLLTAASGASASGEAIQSNRDLSMILDALLKTKSR+VLTD
Sbjct: 1561 LDAEGGISKRKDAPKGYLKLLLLTAASGASASGEAIQSNRDLSMILDALLKTKSRVVLTD 1620

Query: 1621 IINKNGLRMLHNIMKQYRSDFKKIPILRKLLKVLEYLVTREILTSEHINGGPPCPGMESL 1680
            I+NKNGLRMLHNIMKQYRSDFKKIPILRKLLKVLEYLV REILTSEHINGGPPCPGMESL
Sbjct: 1621 IMNKNGLRMLHNIMKQYRSDFKKIPILRKLLKVLEYLVMREILTSEHINGGPPCPGMESL 1680

Query: 1681 RESLLSLTEHDDKQVHQIARSFRDRWFPRHTRKFGYSEREDGRLEVYRGSNSSRFTASHS 1740
            R+SLLSLTEHDDKQVHQIARSFRDRWFPRHTRKF YSEREDGRLEVYRGSN SRFTASHS
Sbjct: 1681 RDSLLSLTEHDDKQVHQIARSFRDRWFPRHTRKFVYSEREDGRLEVYRGSNCSRFTASHS 1740

Query: 1741 FRHDQDCRPTDAIDCIKQSMPTSLPDAHPAEVCSLASAASHSVNGQKVRKRKSRWDQPAD 1800
            +R DQD RPTDAIDC+KQS+ TSLPDAHPAEVCS+AS A HS+NGQKV KRKSRWDQPAD
Sbjct: 1741 YRRDQDSRPTDAIDCVKQSLSTSLPDAHPAEVCSMASTAGHSLNGQKVCKRKSRWDQPAD 1800

Query: 1801 TSLDLRSKEQKLESTSVQELNSSQLNSVGAASMLIDKVNNDDKDISLSDSVGVPCRQDED 1860
            TSLDLRSKEQKLES SVQ+ NSSQL+SVG  SMLIDKVN+DDKD SLSDSVGV   QDED
Sbjct: 1801 TSLDLRSKEQKLESKSVQQFNSSQLSSVGVVSMLIDKVNSDDKDFSLSDSVGVRGSQDED 1860

Query: 1861 IRADSAVPNIPEDIPPGFSSPFNPPVASSSAFSAVLDPPRQNIGDLSCAFSTVGHLQERF 1920
            IRADSAV NIPEDIPPGF  PF+ PVASSSAFS VLDPPRQ+IG LSCAFSTVG+ QE+F
Sbjct: 1861 IRADSAVQNIPEDIPPGFF-PFSLPVASSSAFSTVLDPPRQSIGKLSCAFSTVGYPQEKF 1920

Query: 1921 ISRLPVSYGIPFSIIEQCGTSHAENLECWDVAPGVPFHPFPPLPPYPRGMRGLPTSACGT 1980
            IS LPVSYGIPFSI+EQCGTS AENLECWDVAPG+PFHPFPPLPPYPRG RGLPTSACGT
Sbjct: 1921 ISCLPVSYGIPFSIVEQCGTSCAENLECWDVAPGMPFHPFPPLPPYPRGKRGLPTSACGT 1980

Query: 1981 A-GQSSQEGQVNSHDSRTSFSEESPPSTSTNYQTDLCTPSNNQQIAKRAKESSCDLGRRY 2040
            A  QSSQE QVN HDSRTSFSEE+PPSTSTNYQ DLC  SN QQ +KRAKESS DLGR+Y
Sbjct: 1981 AVRQSSQEMQVNCHDSRTSFSEETPPSTSTNYQQDLCNLSNIQQTSKRAKESSYDLGRKY 2040

Query: 2041 FRQQKWRNTKFGPPWLQRRSQWGCQGNFRGGVSTIGDENIPDEEISPYCSDEASGRVDKA 2100
            FRQ+KWRNTKFGPP   R  QWG QGNFR GVST+GD+NIP+E I PYCSDEASGRVDKA
Sbjct: 2041 FRQEKWRNTKFGPP---RTDQWGYQGNFRSGVSTLGDDNIPNEGIRPYCSDEASGRVDKA 2086

Query: 2101 NGDFYQHLQNQNLR 2112
            N DFYQHLQNQN R
Sbjct: 2101 NDDFYQHLQNQNQR 2086

BLAST of CsGy6G019510 vs. ExPASy TrEMBL
Match: A0A6J1K197 (histone-lysine N-methyltransferase ASHH2-like isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111489728 PE=4 SV=1)

HSP 1 Score: 3345 bits (8674), Expect = 0.0
Identity = 1746/2114 (82.59%), Postives = 1870/2114 (88.46%), Query Frame = 0

Query: 1    MGSCDDPAVIGEPFRASVTRLVRCSSQPLPEHQSHQEMASFSSNSREGQMFEPDRGLEVT 60
            MGSCDDPAVIGEP R S+T L  CSSQPLP++QS QEM+S  SNS + QM EPDRGL VT
Sbjct: 1    MGSCDDPAVIGEPVRGSITPLFGCSSQPLPKYQSRQEMSSLPSNSSDCQMSEPDRGLGVT 60

Query: 61   TASLCTNASDPDTSGEDGTLRGFEHADSLLMDKRLDGDSGGSDPCLNLDNESCNEGNKTL 120
              ++C NAS+PDT+GEDGT RGFEHAD+LL+DKRLD DSG SDPCLN +NE+CN  N+TL
Sbjct: 61   ATNICMNASEPDTAGEDGTFRGFEHADTLLLDKRLDCDSGDSDPCLNEENEACNVENRTL 120

Query: 121  SLDMKESEDVDGLVDILGCDATMEMISLTESLVNSVKPEELDNNSCIIDAPAKVERDDTA 180
             LDMKES+DVD LVDILGC  TMEM+SL  SL+NSVK E LD+NSCIIDA  KVE  D  
Sbjct: 121  KLDMKESQDVDDLVDILGCKTTMEMMSLAGSLMNSVKSEGLDHNSCIIDASEKVESGDIV 180

Query: 181  QNGPILAGTGTRTDDLKSSYVCEIVSNSASADGLPNDFIQKNELENDGAGCSFSEVADRI 240
            +NGP+LA  GT TDDLKS ++CEIVS+SASADGL +DFI + +LENDGAGCSFSEVADR+
Sbjct: 181  ENGPLLARIGTCTDDLKSPHICEIVSSSASADGLSSDFIHQKQLENDGAGCSFSEVADRL 240

Query: 241  TEASVELEADMLNEMSPLQSGQILPIHVGQSIANYDRYVCRMDGKSLSSTSGETVTVVAD 300
            TEA VE++ADMLNEMSPLQS QILP H+G+S+AN ++Y+C+MDGKSLS TSGETV   AD
Sbjct: 241  TEALVEIDADMLNEMSPLQSDQILPTHMGRSVANCEQYICQMDGKSLSGTSGETVNEFAD 300

Query: 301  MNSNPEGCLQMLPSQGCDRIGECLQSDGLPLTINASENDLCEEKHDSNSSSKYVPDVGGD 360
            MNSNPE CLQMLPSQGC+RI ECLQ+D  PLTI++ E + C+EKHDSNS  KY+P+VG D
Sbjct: 301  MNSNPELCLQMLPSQGCERIRECLQADDSPLTIHSPEINRCDEKHDSNSLPKYIPEVGDD 360

Query: 361  DSDVLTNNNSDGGQHTVPGIGNDHNLEDATVQVNHDCVELLSSPLPSQLPNSEKDEFYGM 420
            D  VLT+ N DGGQH VP + N+ NLE+A++Q N +CVELL+SPLPSQ  NSEK EFYGM
Sbjct: 361  DFVVLTDINGDGGQHIVPDMENNCNLEEASIQENTNCVELLASPLPSQPFNSEKYEFYGM 420

Query: 421  LNGADIPIKYISSVNSCSVGDQDNNDIEKVGCVSEVKCPETVITSSKRSGRRRTSSQKTV 480
            L GAD+PIK     NSCSV DQDNND EKVG VSEVKCPETV+ SSKRSGRRR SSQK V
Sbjct: 421  LIGADMPIKD----NSCSVSDQDNNDTEKVGRVSEVKCPETVLMSSKRSGRRRMSSQKNV 480

Query: 481  TKRASRKTKKKVPEPLIFDTARRRRSSISRPARPSPWGSLGHIIQSFEEIDDVLVNQTQK 540
            TKRASRK+KK VPEPLIFDT RRRRSSISRPARP PWGSLG IIQSFE+IDDVLVNQ++K
Sbjct: 481  TKRASRKSKKIVPEPLIFDTTRRRRSSISRPARPLPWGSLGFIIQSFEKIDDVLVNQSKK 540

Query: 541  QGNEKSKGNQGGAKRNKKQLSESSHRSRKGTQGKSATSTSTNRIRLKVKLGKNVGHNFLN 600
            QGNEKSKGNQGG KR+KKQ SESSHRSRKGTQGK  TSTSTNRIRLKVKLGKNVGHNFLN
Sbjct: 541  QGNEKSKGNQGGTKRSKKQPSESSHRSRKGTQGKCDTSTSTNRIRLKVKLGKNVGHNFLN 600

Query: 601  IVVPEIVDSSLSAKGVNCNYGNESYWEGNLEFPPSNLGVDDQKAEEEGPLRKIFCYSRNQ 660
            IVVPEIVDSSLSAKG+NC+YGNESYWEGNLEFPPS   VDDQK EE GPLRKIFCYS+NQ
Sbjct: 601  IVVPEIVDSSLSAKGINCHYGNESYWEGNLEFPPS---VDDQKPEE-GPLRKIFCYSKNQ 660

Query: 661  DKEDNCPDASVVNEQCTNNDSSCIVGIDKSSEKHADDNLCVSSHLVDPVA-TSDARSLDP 720
             KE+ CPDASVVNEQC NNDSSC V IDKSS KHADDNLCVSSHLV+PV   SD R LDP
Sbjct: 661  GKEEKCPDASVVNEQCANNDSSCTVTIDKSSTKHADDNLCVSSHLVEPVERASDTRCLDP 720

Query: 721  GTSPDSEVINSVLDIQVGAARQEILQDSVLASLEDFAASGNAPGSKKGRKKDKPSRVVSC 780
            GTSPDSEVINS+LDIQVGA RQE LQDSVL SLEDFAASGNA  SKKGRKK+KP + VSC
Sbjct: 721  GTSPDSEVINSMLDIQVGAMRQEKLQDSVLPSLEDFAASGNATSSKKGRKKEKPCQAVSC 780

Query: 781  SEERGISVSACSNRSKSSKKHGRRHNVDNQLSSGETLTYADANVLNYSLTVKELSMEQVS 840
            S+E G   SAC+NRSKSSKKHGRR NVDNQL SGET TY DANV+NYSLTVKELSM+QV 
Sbjct: 781  SDEAGTGASACNNRSKSSKKHGRRLNVDNQLGSGETFTYTDANVVNYSLTVKELSMDQVP 840

Query: 841  LLTEIELPEETLKAEDILNDKECCRADVGSVFSESENSKTFLPSQSAKKKHPKGSKSIKT 900
            L TEIELPEE LKA+ IL DKEC R DVGSVF ESENSKTFLPSQSA KKHPKGSKSIKT
Sbjct: 841  LSTEIELPEEALKADGILEDKECYRTDVGSVFPESENSKTFLPSQSAGKKHPKGSKSIKT 900

Query: 901  SKGKSKAPGSKNKIKNASNERVYQRKSFKNSKSKEALCDQVVTETESHQIIGNCLVDKPE 960
            SKGKSKAPGSKNKIKNAS ERVY+RKSF N    EALCDQVVTETESHQI+     DKPE
Sbjct: 901  SKGKSKAPGSKNKIKNASKERVYRRKSF-NKSITEALCDQVVTETESHQIV-----DKPE 960

Query: 961  KSDNIIASTVAVDLSVVQGAVNEQYMPPRNAWVLCDDCHKWRRIPASLVDSLGHASCTWT 1020
            KS++IIASTVAV+L+VVQGAVNEQYMPPRNAWVLCDDCHKWRRIPASLVDSLGHASCTWT
Sbjct: 961  KSNDIIASTVAVNLNVVQGAVNEQYMPPRNAWVLCDDCHKWRRIPASLVDSLGHASCTWT 1020

Query: 1021 CKDNVDKAFANCSIPQEKSNAEINAELEISDESGEENGSKKRLTYRELESFHPATVNAVP 1080
            CK+NVDKAFA+CSIPQEKSNAEINAELEISDESGEEN S KRLTYREL+SFHP TV AVP
Sbjct: 1021 CKENVDKAFADCSIPQEKSNAEINAELEISDESGEENASNKRLTYRELDSFHPTTVTAVP 1080

Query: 1081 QQNKFASISSNQFLHRSRKTQTIDEIMVCHCKPALDGRLGCGDECLNRMLNIECVRGTCP 1140
            Q+NKF SISSN FLHRSRKTQTIDEIMVCHCKPALDGRLGCGDECLNRMLNIECVRGTCP
Sbjct: 1081 QENKFTSISSNHFLHRSRKTQTIDEIMVCHCKPALDGRLGCGDECLNRMLNIECVRGTCP 1140

Query: 1141 CGELCSNQQFQKRKYAKLQWLRCGKKGYGLQLLEDISKGQFLIEYVGEVLDMHAYEARQK 1200
            CG+LCSNQQFQKRKYAKLQWLRCGKKGYGLQ +EDISKGQFLIEYVGEVLDMHAYEARQK
Sbjct: 1141 CGDLCSNQQFQKRKYAKLQWLRCGKKGYGLQSVEDISKGQFLIEYVGEVLDMHAYEARQK 1200

Query: 1201 EYALNGHRHFYFMTLNGSEVIDACGKGNLGRFINHSCDPNCRTEKWMVNGEICIGLFALR 1260
            EYALNGHRHFYFMTL+GSEVIDACGKGNLGRFINHSCDPNCRTEKWMVNGEICIGLFALR
Sbjct: 1201 EYALNGHRHFYFMTLDGSEVIDACGKGNLGRFINHSCDPNCRTEKWMVNGEICIGLFALR 1260

Query: 1261 DIKKGEEVTFDYNYVRVFGAAAKKCYCGSFHCRGYIGGDPLNSEVIIQSDSDEEFPEPVM 1320
            DIKKGEEVTFDYNYVRVFGAAAKKCYCGSF CRGYIGGDPLNSEVIIQSDSDEEFPEPVM
Sbjct: 1261 DIKKGEEVTFDYNYVRVFGAAAKKCYCGSFQCRGYIGGDPLNSEVIIQSDSDEEFPEPVM 1320

Query: 1321 LRGDGRSLNSNLSTAVSSMDVAKMQSSEHLKGNRDKRDQPIRIASELKISEEKVDPLKLS 1380
            LR DGRS N+NL T VS +D  K Q SEH+KG RDK+DQP R + E              
Sbjct: 1321 LRPDGRSWNNNLPTTVSLLDGVKKQPSEHIKGVRDKKDQPSRTSVE-------------- 1380

Query: 1381 ASKISEEKEDPLKLSATKISEEKEDPLNLSASTISPLHSSLEFEDSKVASPIPVPDITHQ 1440
             SKIS+EKED LKLSA+KISE KEDPLNLSASTISPLHSSLEFEDSKVASP P+ DITHQ
Sbjct: 1381 -SKISDEKEDTLKLSASKISEAKEDPLNLSASTISPLHSSLEFEDSKVASPTPLADITHQ 1440

Query: 1441 TEDVTSQPIFVDQTEISLLDNIPDKNTCSIEQEAKLSVDDIDARKKSKLDSVEDKQVYIK 1500
            TEDVTSQP+FVDQ EIS  DN  DKNTCSIEQEAKLSV DIDARKKSKL ++EDK+VYIK
Sbjct: 1441 TEDVTSQPVFVDQPEISPGDNNSDKNTCSIEQEAKLSVADIDARKKSKLVAIEDKKVYIK 1500

Query: 1501 SHPRMKTSRKLGSIKKGKVSSAEKIQITNRSQISSVKPKRLIEGSPGNRFEAVEEKLNEL 1560
            SH RMKTSRK GSIKKGKVSS EK+QI NR QISSVKPKRL++GSPGNRFEAVEEKLNEL
Sbjct: 1501 SHLRMKTSRKPGSIKKGKVSSVEKVQIANRPQISSVKPKRLVDGSPGNRFEAVEEKLNEL 1560

Query: 1561 LDAEGGISKRKDAPKGYLKLLLLTAASGASASGEAIQSNRDLSMILDALLKTKSRLVLTD 1620
            LDAEGGISKRKDAPKGYLKLLLLTAASGASASGEAIQSNRDLSMILDALLKTKSR+VLTD
Sbjct: 1561 LDAEGGISKRKDAPKGYLKLLLLTAASGASASGEAIQSNRDLSMILDALLKTKSRVVLTD 1620

Query: 1621 IINKNGLRMLHNIMKQYRSDFKKIPILRKLLKVLEYLVTREILTSEHINGGPPCPGMESL 1680
            I+NKNGLRMLHNIMKQYRSDFKKIPILRKLLKVLEYLV REILTSEHINGGPPCPGMESL
Sbjct: 1621 IMNKNGLRMLHNIMKQYRSDFKKIPILRKLLKVLEYLVMREILTSEHINGGPPCPGMESL 1680

Query: 1681 RESLLSLTEHDDKQVHQIARSFRDRWFPRHTRKFGYSEREDGRLEVYRGSNSSRFTASHS 1740
            R+SLLSLTEHDDKQVHQIARSFRDRWFPRHTRKF YSEREDGRLEVYRGSN SRFTASHS
Sbjct: 1681 RDSLLSLTEHDDKQVHQIARSFRDRWFPRHTRKFVYSEREDGRLEVYRGSNCSRFTASHS 1740

Query: 1741 FRHDQDCRPTDAIDCIKQSMPTSLPDAHPAEVCSLASAASHSVNGQKVRKRKSRWDQPAD 1800
            +R DQD RPTDAIDC+KQS+ TSLPDAHPAEVCS+AS A HS+NGQKV KRKSRWDQPAD
Sbjct: 1741 YRRDQDSRPTDAIDCVKQSLSTSLPDAHPAEVCSMASTAGHSLNGQKVCKRKSRWDQPAD 1800

Query: 1801 TSLDLRSKEQKLESTSVQELNSSQLNSVGAASMLIDKVNNDDKDISLSDSVGVPCRQDED 1860
            TSLDLRSKEQKLES SVQ+ NSSQL+SVG  SMLIDKVN+DDKD SLSDSVGV   QDED
Sbjct: 1801 TSLDLRSKEQKLESKSVQQFNSSQLSSVGVVSMLIDKVNSDDKDFSLSDSVGVRGSQDED 1860

Query: 1861 IRADSAVPNIPEDIPPGFSSPFNPPVASSSAFSAVLDPPRQNIGDLSCAFSTVGHLQERF 1920
            IRADSAV NIPEDIPPGF  PF+ PVASSSAFS VLDPPRQ+IG LSCAFSTVG+ QE+F
Sbjct: 1861 IRADSAVQNIPEDIPPGFF-PFSLPVASSSAFSTVLDPPRQSIGKLSCAFSTVGYPQEKF 1920

Query: 1921 ISRLPVSYGIPFSIIEQCGTSHAENLECWDVAPGVPFHPFPPLPPYPRGMRGLPTSACGT 1980
            IS LPVSYGIPFSI+EQCGTS AENLECWDVAPG+PFHPFPPLPPYPRG RGLPTSACGT
Sbjct: 1921 ISCLPVSYGIPFSIVEQCGTSCAENLECWDVAPGMPFHPFPPLPPYPRGKRGLPTSACGT 1980

Query: 1981 A-GQSSQEGQVNSHDSRTSFSEESPPSTSTNYQTDLCTPSNNQQIAKRAKESSCDLGRRY 2040
            A  QSSQE QVN HDSRTSFSEE+PPSTSTNYQ DLC  SN QQ +KRAKESS DLGR+Y
Sbjct: 1981 AVRQSSQEMQVNCHDSRTSFSEETPPSTSTNYQQDLCNLSNIQQTSKRAKESSYDLGRKY 2040

Query: 2041 FRQQKWRNTKFGPPWLQRRSQWGCQGNFRGGVSTIGDENIPDEEISPYCSDEASGRVDKA 2100
            FRQ+KWRNTKFGPP   R  QWG QGNFR GVST+GD+NIP+E I PYCSDEASGRVDKA
Sbjct: 2041 FRQEKWRNTKFGPP---RTDQWGYQGNFRSGVSTLGDDNIPNEGIRPYCSDEASGRVDKA 2081

Query: 2101 NGDFYQHLQNQNLR 2112
            N DFYQHLQNQN R
Sbjct: 2101 NDDFYQHLQNQNQR 2081

BLAST of CsGy6G019510 vs. TAIR 10
Match: AT1G77300.1 (histone methyltransferases(H3-K4 specific);histone methyltransferases(H3-K36 specific) )

HSP 1 Score: 852.8 bits (2202), Expect = 5.8e-247
Identity = 625/1624 (38.49%), Postives = 831/1624 (51.17%), Query Frame = 0

Query: 460  ETVITSSKRSGRR-------RTSSQKTVTKRASRKTKKKVPEPLIFDTARRRRSSISRPA 519
            E  ++SS+R  R        +T +     +++SRK + +     IF  ++++RSS+ + +
Sbjct: 438  EAGMSSSRRKARDGKQCKVVQTKTSARHLRKSSRKKQSERDIESIFKCSKQKRSSLLKTS 497

Query: 520  RPSPWGSLGHIIQSFEEIDDVLVNQTQKQGNEKSKGNQGGAKRNKKQLSESSHRSRKGTQ 579
            R S WG      + F + +++  +       ++S+GN    + N+     SSH       
Sbjct: 498  RSSEWGLPSKTTEIFLQSNNIPYDGPPHHEPQRSQGNLNNGEHNR-----SSHNGNVEGS 557

Query: 580  GKSATSTSTNRIRLKVKLGKNVGHNFLNIVVPEIVDSSLSAKGVNCNYGNESYWEGNLEF 639
             ++  ++S + +RLKVK GK+ G N LNI V ++  +SL   G+    G      G+  F
Sbjct: 558  NRNIQASSGSCLRLKVKFGKSGGQNPLNITVSKVSGNSLPGNGI-VKAGTCLELPGSAHF 617

Query: 640  PPSNLGVDDQK---AEEEGPLRKIFCYSRNQDKEDNCPDASVVNEQCTNNDSSCIVGIDK 699
                +   + K    E+  P+ K+  Y ++ D          + ++  N D+  +     
Sbjct: 618  GEDKMQTVETKEDLVEKSNPVEKV-SYLQSSDS---------MRDKKYNQDAGGLCRKVG 677

Query: 700  SSEKHADDNLCVSSHLVDPVATSDARSLDPGTSPDSEVINSVLDIQVGAARQEILQDSVL 759
                  D +L     + +    +  +SLD  TSPDSEVINSV                  
Sbjct: 678  GDVLDDDPHLSSIRMVEECERATGTQSLDAETSPDSEVINSV------------------ 737

Query: 760  ASLEDFAASGNAPGSKKGRKKDKPSRVVSCSEERGISVSACSNRSKSSKKHGRRHNVDNQ 819
                                   P  +V+   + G+              HG        
Sbjct: 738  -----------------------PDSIVNIEHKEGL-------------HHG-------- 797

Query: 820  LSSGETLTYADANVLNYSLTVKELSMEQVSLLTEIELPEETLKAEDILNDKECCRADVGS 879
                                                 PE+ +K   +L  ++  RA    
Sbjct: 798  ---------------------------------FFSTPEDVVKKNRVLEKEDELRASK-- 857

Query: 880  VFSESENSKTFLPSQSAKKKHPKGSKSIKTSKGKSK-APGSKNKIKNASNERVYQRKSFK 939
              S SEN    +P+ + K KHPK SKS  T KGKSK +  +K+  KN S+E V QRKS  
Sbjct: 858  --SPSENGSHLIPN-AKKAKHPK-SKSNGTKKGKSKFSESAKDGRKNESHEGVEQRKSLN 917

Query: 940  NSKSKEALCDQVVTETESHQIIGNCL---VDKPEKSDNIIASTVAVDLSVVQGAVNEQYM 999
             S  ++      V   ESH+  G  L   + K   +   I+S V     VV   + + Y 
Sbjct: 918  TSMGRDDSDYPEVGRIESHKTTGALLDADIGKTSATYGTISSDVTHGEMVVDVTIEDSY- 977

Query: 1000 PPRNAWVLCDDCHKWRRIPASLVDSLGHASCTWTCKDNVDKAFANCSIPQEKSNAEINAE 1059
               +AWV CDDC KWRRIPAS+V S+  +S  W C +N DK FA+CS  QE SN EIN E
Sbjct: 978  STESAWVRCDDCFKWRRIPASVVGSIDESS-RWICMNNSDKRFADCSKSQEMSNEEINEE 1037

Query: 1060 LEISDESGEENGSKKRLTYRELESFHPATVNAVPQQNKFASISSNQFLHRSRKTQTIDEI 1119
            L I  +  +          +E E           Q+  F +I +NQFLHR+RK+QTIDEI
Sbjct: 1038 LGIGQDEADAYDCDAAKRGKEKEQKSKRLTG--KQKACFKAIKTNQFLHRNRKSQTIDEI 1097

Query: 1120 MVCHCKPALDGRLGCGDECLNRMLNIECVRGTCPCGELCSNQQFQKRKYAKLQWLRCGKK 1179
            MVCHCKP+ DGRLGCG+ECLNRMLNIEC++GTCP G+LCSNQQFQKRKY K +  + GKK
Sbjct: 1098 MVCHCKPSPDGRLGCGEECLNRMLNIECLQGTCPAGDLCSNQQFQKRKYVKFERFQSGKK 1157

Query: 1180 GYGLQLLEDISKGQFLIEYVGEVLDMHAYEARQKEYALNGHRHFYFMTLNGSEVIDACGK 1239
            GYGL+LLED+ +GQFLIEYVGEVLDM +YE RQKEYA  G +HFYFMTLNG+EVIDA  K
Sbjct: 1158 GYGLRLLEDVREGQFLIEYVGEVLDMQSYETRQKEYAFKGQKHFYFMTLNGNEVIDAGAK 1217

Query: 1240 GNLGRFINHSCDPNCRTEKWMVNGEICIGLFALRDIKKGEEVTFDYNYVRVFGAAAKKCY 1299
            GNLGRFINHSC+PNCRTEKWMVNGEIC+G+F+++D+KKG+E+TFDYNYVRVFGAAAKKCY
Sbjct: 1218 GNLGRFINHSCEPNCRTEKWMVNGEICVGIFSMQDLKKGQELTFDYNYVRVFGAAAKKCY 1277

Query: 1300 CGSFHCRGYIGGDPLNSEVIIQSDSDEEFPEPVMLRGDGRS---LNSNLSTAVSSMDVAK 1359
            CGS HCRGYIGGDPLN +VIIQSDSDEE+PE V+L  D      L +   T     D   
Sbjct: 1278 CGSSHCRGYIGGDPLNGDVIIQSDSDEEYPELVILDDDESGEGILGATSRTFTDDADEQM 1337

Query: 1360 MQSSEHLKGNRDKRDQPIRIAS--ELKISEEKVDPLKLSASKISEEKEDPLKLSATKISE 1419
             QS E + G +D      +  S   +K+ E ++ P  L  +++ +E    + ++A     
Sbjct: 1338 PQSFEKVNGYKDLAPDNTQTQSSVSVKLPEREIPPPLLQPTEVLKELSSGISITAV---- 1397

Query: 1420 EKEDPLNLSASTISPLHSSLEFEDSKVASPIPVPDITHQTEDVTSQPIFVDQTEISLLDN 1479
            ++E P      + SP  SSL                                        
Sbjct: 1398 QQEVPAEKKTKSTSPTSSSL---------------------------------------- 1457

Query: 1480 IPDKNTCSIEQEAKLSVDDIDARKKSKLDSVEDKQVYIKSHPRMKTSRKLGSIKKGK--- 1539
                        +++S    ++ K +K  S EDK++  +  PRMKTSR   S K+ K   
Sbjct: 1458 ------------SRMSPGGTNSDKTTKHGSGEDKKILPRPRPRMKTSRSSESSKRDKGGI 1517

Query: 1540 ---VSSAEKIQITNRSQISSVKPKRLIEGSPGNRFEAVEEKLNELLDAEGGISKRKDAPK 1599
               V+ A+ I + N+ Q   +K K   + SP    E  E KLNELLDA GGISKR+D+ K
Sbjct: 1518 YPGVNKAQVIPV-NKLQQQPIKSKGSEKVSPS--IETFEGKLNELLDAVGGISKRRDSAK 1577

Query: 1600 GYLKLLLLTAASGASASGEAIQSNRDLSMILDALLKTKSRLVLTDIINKNGLRMLHNIMK 1659
            GYLKLLLLTAAS      E I SNRDLSMILDALLKTKS+ VL DIINKNGL+MLHNIMK
Sbjct: 1578 GYLKLLLLTAAS-RGTDEEGIYSNRDLSMILDALLKTKSKSVLVDIINKNGLQMLHNIMK 1637

Query: 1660 QYRSDFKKIPILRKLLKVLEYLVTREILTSEHINGGPPCPGMESLRESLLSLTEHDDKQV 1719
            QYR DFK+IPI+RKLLKVLEYL TR+IL  EHI   PP  GMES ++S+LS TEHDD  V
Sbjct: 1638 QYRGDFKRIPIIRKLLKVLEYLATRKILALEHIIRRPPFAGMESFKDSVLSFTEHDDYTV 1697

Query: 1720 HQIARSFRDRWFPRHTRKFGYSEREDGRLEVYRGSNSSRFTASHSFRHD-QDCRPTDAID 1779
            H IARSFRDRW P+H RK     RE+ R E  R   + RF AS   R+D Q  RP +   
Sbjct: 1698 HNIARSFRDRWIPKHFRKPWRINREE-RSESMRSPINRRFRASQEPRYDHQSPRPAEPAA 1757

Query: 1780 CIKQSMPTSLPDAHPAEVCSLASAASHSVNGQKVRKRKSRWDQPADTSLDLRSKEQKLES 1839
             +  S   +   A  +E  S  ++     NG   RKRKSRWDQP+ T      KEQ++ +
Sbjct: 1758 SVTSSKAATPETASVSEGYSEPNSGLPETNG---RKRKSRWDQPSKT------KEQRIMT 1781

Query: 1840 TSVQELNSSQLNSVGAASMLIDKVNNDDKDISLSDSVGVPCRQDEDIRADSAVPNIPEDI 1899
               Q+ + +  N                                          ++ +D+
Sbjct: 1818 ILSQQTDETNGNQ-----------------------------------------DVQDDL 1781

Query: 1900 PPGFSSPFNPPVASSSAFSAVLDPPRQNIGDLSCAFSTVGHLQERFISRLPVSYGIPFSI 1959
            PPGFSSP       +    A+   P                 Q++F+SRLPVSYGIP SI
Sbjct: 1878 PPGFSSP------CTDVPDAITAQP-----------------QQKFLSRLPVSYGIPLSI 1781

Query: 1960 IEQCGTSHAENLECWDVAPGVPFHPFPPLPPYPRG--MRGLPTSACGTAGQSSQEGQVNS 2019
            + Q G+   E+   W VAPG+PF+PFPPLPP   G         AC     SS  G +  
Sbjct: 1938 VHQFGSPGKEDPTTWSVAPGMPFYPFPPLPPVSHGEFFAKRNVRAC-----SSSMGNL-- 1781

Query: 2020 HDSRTSFSEESPPSTSTNYQTDLCTPSNNQQIAKRAKESSCDLGRRYFRQQKWRNTKFGP 2056
                 ++S E  P+T     TD   P+  +++       S D+G  YFRQQK    +  P
Sbjct: 1998 -----TYSNEILPATPV---TDSTAPTRKREL------FSSDIGTTYFRQQK----QSVP 1781

BLAST of CsGy6G019510 vs. TAIR 10
Match: AT1G77300.2 (histone methyltransferases(H3-K4 specific);histone methyltransferases(H3-K36 specific) )

HSP 1 Score: 637.1 bits (1642), Expect = 5.0e-182
Identity = 460/1192 (38.59%), Postives = 617/1192 (51.76%), Query Frame = 0

Query: 460  ETVITSSKRSGRR-------RTSSQKTVTKRASRKTKKKVPEPLIFDTARRRRSSISRPA 519
            E  ++SS+R  R        +T +     +++SRK + +     IF  ++++RSS+ + +
Sbjct: 438  EAGMSSSRRKARDGKQCKVVQTKTSARHLRKSSRKKQSERDIESIFKCSKQKRSSLLKTS 497

Query: 520  RPSPWGSLGHIIQSFEEIDDVLVNQTQKQGNEKSKGNQGGAKRNKKQLSESSHRSRKGTQ 579
            R S WG      + F + +++  +       ++S+GN    + N+     SSH       
Sbjct: 498  RSSEWGLPSKTTEIFLQSNNIPYDGPPHHEPQRSQGNLNNGEHNR-----SSHNGNVEGS 557

Query: 580  GKSATSTSTNRIRLKVKLGKNVGHNFLNIVVPEIVDSSLSAKGVNCNYGNESYWEGNLEF 639
             ++  ++S + +RLKVK GK+ G N LNI V ++  +SL   G+    G      G+  F
Sbjct: 558  NRNIQASSGSCLRLKVKFGKSGGQNPLNITVSKVSGNSLPGNGI-VKAGTCLELPGSAHF 617

Query: 640  PPSNLGVDDQK---AEEEGPLRKIFCYSRNQDKEDNCPDASVVNEQCTNNDSSCIVGIDK 699
                +   + K    E+  P+ K+  Y ++ D          + ++  N D+  +     
Sbjct: 618  GEDKMQTVETKEDLVEKSNPVEKV-SYLQSSDS---------MRDKKYNQDAGGLCRKVG 677

Query: 700  SSEKHADDNLCVSSHLVDPVATSDARSLDPGTSPDSEVINSVLDIQVGAARQEILQDSVL 759
                  D +L     + +    +  +SLD  TSPDSEVINSV                  
Sbjct: 678  GDVLDDDPHLSSIRMVEECERATGTQSLDAETSPDSEVINSV------------------ 737

Query: 760  ASLEDFAASGNAPGSKKGRKKDKPSRVVSCSEERGISVSACSNRSKSSKKHGRRHNVDNQ 819
                                   P  +V+   + G+              HG        
Sbjct: 738  -----------------------PDSIVNIEHKEGL-------------HHG-------- 797

Query: 820  LSSGETLTYADANVLNYSLTVKELSMEQVSLLTEIELPEETLKAEDILNDKECCRADVGS 879
                                                 PE+ +K   +L  ++  RA    
Sbjct: 798  ---------------------------------FFSTPEDVVKKNRVLEKEDELRASK-- 857

Query: 880  VFSESENSKTFLPSQSAKKKHPKGSKSIKTSKGKSK-APGSKNKIKNASNERVYQRKSFK 939
              S SEN    +P+ + K KHPK SKS  T KGKSK +  +K+  KN S+E V QRKS  
Sbjct: 858  --SPSENGSHLIPN-AKKAKHPK-SKSNGTKKGKSKFSESAKDGRKNESHEGVEQRKSLN 917

Query: 940  NSKSKEALCDQVVTETESHQIIGNCL---VDKPEKSDNIIASTVAVDLSVVQGAVNEQYM 999
             S  ++      V   ESH+  G  L   + K   +   I+S V     VV   + + Y 
Sbjct: 918  TSMGRDDSDYPEVGRIESHKTTGALLDADIGKTSATYGTISSDVTHGEMVVDVTIEDSY- 977

Query: 1000 PPRNAWVLCDDCHKWRRIPASLVDSLGHASCTWTCKDNVDKAFANCSIPQEKSNAEINAE 1059
               +AWV CDDC KWRRIPAS+V S+  +S  W C +N DK FA+CS  QE SN EIN E
Sbjct: 978  STESAWVRCDDCFKWRRIPASVVGSIDESS-RWICMNNSDKRFADCSKSQEMSNEEINEE 1037

Query: 1060 LEISDESGEENGSKKRLTYRELESFHPATVNAVPQQNKFASISSNQFLHRSRKTQTIDEI 1119
            L I  +  +          +E E           Q+  F +I +NQFLHR+RK+QTIDEI
Sbjct: 1038 LGIGQDEADAYDCDAAKRGKEKEQKSKRLTG--KQKACFKAIKTNQFLHRNRKSQTIDEI 1097

Query: 1120 MVCHCKPALDGRLGCGDECLNRMLNIECVRGTCPCGELCSNQQFQKRKYAKLQWLRCGKK 1179
            MVCHCKP+ DGRLGCG+ECLNRMLNIEC++GTCP G+LCSNQQFQKRKY K +  + GKK
Sbjct: 1098 MVCHCKPSPDGRLGCGEECLNRMLNIECLQGTCPAGDLCSNQQFQKRKYVKFERFQSGKK 1157

Query: 1180 GYGLQLLEDISKGQFLIEYVGEVLDMHAYEARQKEYALNGHRHFYFMTLNGSEVIDACGK 1239
            GYGL+LLED+ +GQFLIEYVGEVLDM +YE RQKEYA  G +HFYFMTLNG+EVIDA  K
Sbjct: 1158 GYGLRLLEDVREGQFLIEYVGEVLDMQSYETRQKEYAFKGQKHFYFMTLNGNEVIDAGAK 1217

Query: 1240 GNLGRFINHSCDPNCRTEKWMVNGEICIGLFALRDIKKGEEVTFDYNYVRVFGAAAKKCY 1299
            GNLGRFINHSC+PNCRTEKWMVNGEIC+G+F+++D+KKG+E+TFDYNYVRVFGAAAKKCY
Sbjct: 1218 GNLGRFINHSCEPNCRTEKWMVNGEICVGIFSMQDLKKGQELTFDYNYVRVFGAAAKKCY 1277

Query: 1300 CGSFHCRGYIGGDPLNSEVIIQSDSDEEFPEPVMLRGDGRS---LNSNLSTAVSSMDVAK 1359
            CGS HCRGYIGGDPLN +VIIQSDSDEE+PE V+L  D      L +   T     D   
Sbjct: 1278 CGSSHCRGYIGGDPLNGDVIIQSDSDEEYPELVILDDDESGEGILGATSRTFTDDADEQM 1337

Query: 1360 MQSSEHLKGNRDKRDQPIRIAS--ELKISEEKVDPLKLSASKISEEKEDPLKLSATKISE 1419
             QS E + G +D      +  S   +K+ E ++ P  L  +++ +E    + ++A     
Sbjct: 1338 PQSFEKVNGYKDLAPDNTQTQSSVSVKLPEREIPPPLLQPTEVLKELSSGISITAV---- 1397

Query: 1420 EKEDPLNLSASTISPLHSSLEFEDSKVASPIPVPDITHQTEDVTSQPIFVDQTEISLLDN 1479
            ++E P      + SP  SSL                                        
Sbjct: 1398 QQEVPAEKKTKSTSPTSSSL---------------------------------------- 1448

Query: 1480 IPDKNTCSIEQEAKLSVDDIDARKKSKLDSVEDKQVYIKSHPRMKTSRKLGSIKKGK--- 1539
                        +++S    ++ K +K  S EDK++  +  PRMKTSR   S K+ K   
Sbjct: 1458 ------------SRMSPGGTNSDKTTKHGSGEDKKILPRPRPRMKTSRSSESSKRDKGGI 1448

Query: 1540 ---VSSAEKIQITNRSQISSVKPKRLIEGSPGNRFEAVEEKLNELLDAEGGISKRKDAPK 1599
               V+ A+ I + N+ Q   +K K   + SP    E  E KLNELLDA GGISKR+D+ K
Sbjct: 1518 YPGVNKAQVIPV-NKLQQQPIKSKGSEKVSPS--IETFEGKLNELLDAVGGISKRRDSAK 1448

Query: 1600 GYLKLLLLTAASGASASGEAIQSNRDLSMILDALLKTKSRLVLTDIINKNGL 1627
            GYLKLLLLTAAS      E I SNRDLSMILDALLKTKS+ VL DIINKNGL
Sbjct: 1578 GYLKLLLLTAAS-RGTDEEGIYSNRDLSMILDALLKTKSKSVLVDIINKNGL 1448

BLAST of CsGy6G019510 vs. TAIR 10
Match: AT1G76710.1 (SET domain group 26 )

HSP 1 Score: 207.2 bits (526), Expect = 1.3e-52
Identity = 96/235 (40.85%), Postives = 137/235 (58.30%), Query Frame = 0

Query: 1083 KFASISSNQFLHRSRKTQTIDEIMVCHCKPAL-DGRLGCGDECLNRMLNIECVRGTCPCG 1142
            ++  I  N F +R  K Q  ++I +C CK    D    CG+ CLN + N EC  G CPCG
Sbjct: 16   QYEHIYQNDFSYRKHKKQKEEDISICECKFDFGDPDSACGERCLNVITNTECTPGYCPCG 75

Query: 1143 ELCSNQQFQKRKYAKLQWLRCGKKGYGLQLLEDISKGQFLIEYVGEVLDMHAYEARQKEY 1202
              C NQ+FQK +YAK + ++C  +G+GL  LE+I  GQF++EY GEV+     + R + Y
Sbjct: 76   VYCKNQKFQKCEYAKTKLIKCEGRGWGLVALEEIKAGQFIMEYCGEVISWKEAKKRAQTY 135

Query: 1203 ALNGHRHFYFMTLNGSEVIDACGKGNLGRFINHSCDPNCRTEKWMVNGEICIGLFALRDI 1262
              +G +  Y ++LN SE IDA  KG+L RFINHSC PNC T KW V GE+ +G+FA   I
Sbjct: 136  ETHGVKDAYIISLNASEAIDATKKGSLARFINHSCRPNCETRKWNVLGEVRVGIFAKESI 195

Query: 1263 KKGEEVTFDYNYVRVFGAAAKKCYCGSFHCRGYIGGDP--LNSEVIIQSDSDEEF 1315
                E+ +DYN+   +G A  +C CG+  C G++G        +  +  D D+ +
Sbjct: 196  SPRTELAYDYNF-EWYGGAKVRCLCGAVACSGFLGAKSRGFQEDTYVWEDGDDRY 249

BLAST of CsGy6G019510 vs. TAIR 10
Match: AT1G76710.2 (SET domain group 26 )

HSP 1 Score: 207.2 bits (526), Expect = 1.3e-52
Identity = 96/235 (40.85%), Postives = 137/235 (58.30%), Query Frame = 0

Query: 1083 KFASISSNQFLHRSRKTQTIDEIMVCHCKPAL-DGRLGCGDECLNRMLNIECVRGTCPCG 1142
            ++  I  N F +R  K Q  ++I +C CK    D    CG+ CLN + N EC  G CPCG
Sbjct: 16   QYEHIYQNDFSYRKHKKQKEEDISICECKFDFGDPDSACGERCLNVITNTECTPGYCPCG 75

Query: 1143 ELCSNQQFQKRKYAKLQWLRCGKKGYGLQLLEDISKGQFLIEYVGEVLDMHAYEARQKEY 1202
              C NQ+FQK +YAK + ++C  +G+GL  LE+I  GQF++EY GEV+     + R + Y
Sbjct: 76   VYCKNQKFQKCEYAKTKLIKCEGRGWGLVALEEIKAGQFIMEYCGEVISWKEAKKRAQTY 135

Query: 1203 ALNGHRHFYFMTLNGSEVIDACGKGNLGRFINHSCDPNCRTEKWMVNGEICIGLFALRDI 1262
              +G +  Y ++LN SE IDA  KG+L RFINHSC PNC T KW V GE+ +G+FA   I
Sbjct: 136  ETHGVKDAYIISLNASEAIDATKKGSLARFINHSCRPNCETRKWNVLGEVRVGIFAKESI 195

Query: 1263 KKGEEVTFDYNYVRVFGAAAKKCYCGSFHCRGYIGGDP--LNSEVIIQSDSDEEF 1315
                E+ +DYN+   +G A  +C CG+  C G++G        +  +  D D+ +
Sbjct: 196  SPRTELAYDYNF-EWYGGAKVRCLCGAVACSGFLGAKSRGFQEDTYVWEDGDDRY 249

BLAST of CsGy6G019510 vs. TAIR 10
Match: AT2G44150.1 (histone-lysine N-methyltransferase ASHH3 )

HSP 1 Score: 167.5 bits (423), Expect = 1.1e-40
Identity = 90/230 (39.13%), Postives = 130/230 (56.52%), Query Frame = 0

Query: 1087 ISSNQFLHRSRKTQTIDEIMVCHCKPALDGRLG--CGDECLNRMLNIECVRGTCPCGELC 1146
            I  N +L +  K +  D+ + C C  +  G     CG  C   ML   C   +C CG  C
Sbjct: 47   IRRNIYLTKKVKRRVEDDGIFCSCSSSSPGSSSTVCGSNCHCGMLFSSC-SSSCKCGSEC 106

Query: 1147 SNQQFQKRKYAKLQWLRCGKKGYGLQLLEDISKGQFLIEYVGEVLDMHAYEARQKEYALN 1206
            +N+ FQ+R   K++ ++  K G G+   E+I  G+F+IEYVGEV+D    E R  +    
Sbjct: 107  NNKPFQQRHVKKMKLIQTEKCGSGIVAEEEIEAGEFIIEYVGEVIDDKTCEERLWKMKHR 166

Query: 1207 GHRHFYFMTLNGSEVIDACGKGNLGRFINHSCDPNCRTEKWMVNGEICIGLFALRDIKKG 1266
            G  +FY   +    VIDA  KGN  R+INHSC+PN + +KW+++GE  IG+FA R IKKG
Sbjct: 167  GETNFYLCEITRDMVIDATHKGNKSRYINHSCNPNTQMQKWIIDGETRIGIFATRGIKKG 226

Query: 1267 EEVTFDYNYVRVFGAAAKKCYCGSFHCRGYIGGDPLNSEVIIQSDSDEEF 1315
            E +T+DY +V+ FG A + C+CG+  CR  +G  P   ++     SDE F
Sbjct: 227  EHLTYDYQFVQ-FG-ADQDCHCGAVGCRRKLGVKPSKPKIA----SDEAF 269

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q2LAE13.8e-21936.33Histone-lysine N-methyltransferase ASHH2 OS=Arabidopsis thaliana OX=3702 GN=ASHH... [more]
Q9BYW27.0e-5641.54Histone-lysine N-methyltransferase SETD2 OS=Homo sapiens OX=9606 GN=SETD2 PE=1 S... [more]
Q84WW61.8e-5140.85Histone-lysine N-methyltransferase ASHH1 OS=Arabidopsis thaliana OX=3702 GN=ASHH... [more]
E9Q5F91.8e-5132.12Histone-lysine N-methyltransferase SETD2 OS=Mus musculus OX=10090 GN=Setd2 PE=1 ... [more]
Q9VYD14.4e-5036.78Probable histone-lysine N-methyltransferase CG1716 OS=Drosophila melanogaster OX... [more]
Match NameE-valueIdentityDescription
XP_011657417.10.098.58histone-lysine N-methyltransferase ASHH2 isoform X1 [Cucumis sativus] >XP_011657... [more]
XP_031744047.10.098.55histone-lysine N-methyltransferase ASHH2 isoform X2 [Cucumis sativus][more]
KAA0055531.10.094.84histone-lysine N-methyltransferase ASHH2 [Cucumis melo var. makuwa][more]
XP_008441612.10.093.42PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase ASHH2 [Cucumi... [more]
XP_038884997.10.088.13histone-lysine N-methyltransferase ASHH2 isoform X1 [Benincasa hispida] >XP_0388... [more]
Match NameE-valueIdentityDescription
A0A0A0KDR40.098.56Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G376290 PE=4 SV=1[more]
A0A5A7UPQ60.094.84Histone-lysine N-methyltransferase ASHH2 OS=Cucumis melo var. makuwa OX=1194695 ... [more]
A0A1S3B3U90.093.42LOW QUALITY PROTEIN: histone-lysine N-methyltransferase ASHH2 OS=Cucumis melo OX... [more]
A0A6J1JXF70.082.78histone-lysine N-methyltransferase ASHH2-like isoform X1 OS=Cucurbita maxima OX=... [more]
A0A6J1K1970.082.59histone-lysine N-methyltransferase ASHH2-like isoform X2 OS=Cucurbita maxima OX=... [more]
Match NameE-valueIdentityDescription
AT1G77300.15.8e-24738.49histone methyltransferases(H3-K4 specific);histone methyltransferases(H3-K36 spe... [more]
AT1G77300.25.0e-18238.59histone methyltransferases(H3-K4 specific);histone methyltransferases(H3-K36 spe... [more]
AT1G76710.11.3e-5240.85SET domain group 26 [more]
AT1G76710.21.3e-5240.85SET domain group 26 [more]
AT2G44150.11.1e-4039.13histone-lysine N-methyltransferase ASHH3 [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (Gy14) v2.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR006560AWS domainSMARTSM00570shorttest3coord: 1103..1154
e-value: 5.0E-24
score: 95.8
IPR006560AWS domainPFAMPF17907AWScoord: 1117..1152
e-value: 1.7E-14
score: 53.6
IPR006560AWS domainPROSITEPS51215AWScoord: 1103..1153
score: 18.919239
IPR003616Post-SET domainSMARTSM00508PostSET_3coord: 1280..1296
e-value: 0.008
score: 25.4
IPR003616Post-SET domainPROSITEPS50868POST_SETcoord: 1280..1296
score: 9.642072
IPR001214SET domainSMARTSM00317set_7coord: 1155..1278
e-value: 2.7E-39
score: 146.5
IPR001214SET domainPFAMPF00856SETcoord: 1166..1272
e-value: 3.7E-18
score: 66.5
IPR001214SET domainPROSITEPS50280SETcoord: 1155..1272
score: 19.507521
NoneNo IPR availableGENE3D3.30.40.100coord: 988..1048
e-value: 9.6E-19
score: 69.3
NoneNo IPR availableGENE3D2.170.270.10SET domaincoord: 1053..1312
e-value: 6.0E-83
score: 280.5
NoneNo IPR availablePIRSRPIRSR009343-2PIRSR009343-2coord: 1132..1295
e-value: 1.2E-27
score: 93.9
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 463..520
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 27..49
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 539..584
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 471..489
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 878..927
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 27..50
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 490..506
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 757..780
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1777..1803
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 539..557
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 342..383
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 362..376
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1788..1803
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 570..584
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 900..916
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1970..2023
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1976..2023
NoneNo IPR availablePANTHERPTHR22884SET DOMAIN PROTEINScoord: 475..2028
NoneNo IPR availablePANTHERPTHR22884:SF413HISTONE-LYSINE N-METHYLTRANSFERASE, H3 LYSINE-36 SPECIFICcoord: 475..2028
NoneNo IPR availableSUPERFAMILY82199SET domaincoord: 1082..1293
IPR011124Zinc finger, CW-typePFAMPF07496zf-CWcoord: 991..1036
e-value: 1.3E-12
score: 47.5
IPR011124Zinc finger, CW-typePROSITEPS51050ZF_CWcoord: 985..1039
score: 13.255798
IPR044437SETD2/Set2, SET domainCDDcd19172SET_SETD2coord: 1154..1296
e-value: 5.13093E-87
score: 277.925

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsGy6G019510.2CsGy6G019510.2mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0010452 histone H3-K36 methylation
cellular_component GO:0005634 nucleus
molecular_function GO:0046975 histone methyltransferase activity (H3-K36 specific)
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0018024 histone-lysine N-methyltransferase activity