Sgr026330 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr026330
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
Descriptionhistone-lysine N-methyltransferase SUVR5
Locationtig00153031: 4146280 .. 4168851 (+)
RNA-Seq ExpressionSgr026330
SyntenySgr026330
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
TGGAGAGGGAAGTGGCAGGCTGGAATTAGATGTGCAAGGGCTGACTGGCCATTATCTACTTTAAAAGCCAAACCTACACATGACAGGAAGAAGTATTTTGTGGTCTTTTTTCCACACACAAGGAATTATTCTTGGGCAGATGCATTGCTTGTTCGTTCTATTGAAGAATTTCCTCAGCCTATTGCATACAAGAGCCACAAAGCTGGTTTAAAATTGGTTGAAGATGTAAAAGTTGCTAGGAGATTTATAATGAAAAAACTTGCCGTTAGCATGCTAAATATCATAGACCAATTTCACCTCGAGGTTTGTATATGGTCCACACTTGTGTATTCTATTTATCATTGTAATTAGTTGTGAAGTTACTTGAGTTTTTATTTTCCTGCTTGTTCATTTTTTAGTCGAGTTCAAATTGATTGATGTAGGCTCTGATAGAGAGTGCTCGTGATGTAATGACTTGGAAAGAGTTTGCCATGGAAGCTTCACGCTGTAATGGTTATTCCGATCTTGGAAGAATGCTCCTGAAGCTGCAGAATGTAAACTTCTATTCTTCTTGATGGTATATGAGATGGAAATAACAAAAATTTTATTTTGATTTCATGGATTATTATTTCCTGTTTTTTGTGATTAGGAGTTTATGTTACTGAGTGCGTTTACAATTTAATCTGATTTAGATGCACCACTATATTTCTGGCCCTTAAATATTTTTGTTGTGGCATCCATTTGTTATGATATAAATGCATCCATTTGTTATGATATAAATATTAAGGCTAGCAAAGATTATTCAGAGGAAAGATCTTAGGAGAGCAATGTGTTTCCATCTAGTAACTTGAGTTCAAATATGATGTGGCTATAAGTGTTAGCAATGACATGGCCAAATAATTAGTGGTGTATACACTTCGCTTCAATTTGGTTTTCTTTAGAACGAAAATTGGGCTGTGATGTTCAATTTTCCATATGTGCCAACCGAATAGTTTTTTCCTTTTGTACAAACTGGTTTAAACCAAATTGGTTTTAGTTTTGGTTTCTTGTGTTGGGCCTCATTCACATACTAGTTTGAAGCCCAATTTTGTGTTTAATTTTAGTTTTTTATCGTTTAAGCTCAATCTCTAGCATTTCTTTTTCTTTTTTTCATTCTAAGACCAACTTAGATTCCTCCAAACATATTCAAATTTCAGAAAATTAACACTTAGAACATTTGGATAACCATTTAAGCTCAATTTTCAGCTATATTTTTGTTTTCTTTTTTCATTTCAATATTAAGTTTTCGGTTTGGATTTTTGATACCTAATAAACCAAATGAAATGATTTATTGACATGGCGAATACATGATTCCAAATGTTCATTTGAAGATCTGTAACTGAAATTATTTATTGACAATTGAGATGGTGAATGAATGATGAACTCTATGGAATGTTTTTAATTGGTCTAAGAAATAGTATTTATTATATTATCATTTTATTATTATTATTTTATTCTCCAAGATCCTAGTATTATCTCTGTATTTGAGATTTTCTTTAGTTTGTTATACCAAATTTTAGGCTCTCCCTTGCCTTTCTCTATCAGATGTTTTATGGTTAGTGTCATACTGTCAATGGAAAGTAGACAGACTGGTTTACAGTTAATGAAATTAATTAGACTAAGGTTATTATTTCAATTTTGTTTTTGATACCCAATATTTTCCAACCTGTTTCTGTTCACTGGATTGAGGCTGTACAACAGTTGATTGTTTGAACTCGTGAAATAATTAATATTCCTTATGTATAGACCCAAGTCTTTGAACCCTTAATCACTAGGCCAGATCTAGGTATCTTTTATCACTTTATCTCACTTTGGCGGATGTTCAATATACCGAGTTGCTTGCCTTAATACTATGACTTGAAATTGCTTATTTGAATCTAAAAATGATTCTTTCTAATTTTGGATAACACATATGACTCAGGCATAAATGACCTGTGCAATTCATGTTGGACATTTAGGTCCTATTTAGTTCGTGATCCATAACAAGGTTTATATAGATATATATTTAAATTTATATATTTGCAATAGAAAACAAGCTTGTATTTATTGTGTTTCTTGAATTGAAAATGCATTTAGTAGCATATTGAAAACTGATTTTTGAATTTAGAAATTGGAAAACTGTGTTGGTGGTGAATTTTGGAAACATAAAGAATAAATATGTTTGGTAGAATCCAATTTTTTTTAATTATAAAATTGTTTTTTTTTCTTATTACCAAAATAAATAAACAAATACAATGGTTTCATTAAATCCACAGATCTGATTCCAATAGAGGCAAGTAATTAGGAGTTGAATTGGGGGGGAAATTAAAACAACTGATCTGAACCGCCTACTAATCTTATTGTCTAAACCATAAAGATAAAGGATTATGGAACTGAACCTTCTAAGATATCCAGCAACTTCTTAGCAACTTCTAGATTTGTAGCCAAATCTGAAGACCCTTAGTTACTTGAGATTTTATATATCTGTTAATGGATCTAGAGAGAGAAAGAGAGGAAGGGAGAGGTTGCTTAGGAAGATAGAAAGAGGGGAGAGGATGCTTAGAGGGAGAGAAACGATATGCTTTATGAGAAAACATGAAGCCTCATTTTTCAGGAAAACCTCACTTTGTGTTTTCATAATTTTGAAATATTGAATTTGAAAGCATTAATAGAATGTTTTAAAAATTGGCTTAATTGGATTACTATTTTTGAAATAATACTCAAATGTGAACTCAGTATATTTATAGCGTCCCTTGTTTTCAAAAACATGAAAATGAAAATGGATTAGATACCAAGCTCCTTACTTTTGAACTTCCGATTCTAGGCCTTTTGTCTGGCTCAATTCCATGTTCTCTTTTGCATCTGCCCATAGTTGGTTGAAAGGTCTAGTTTTATTCTAGGGTTTGTCTCAACTATGCTCAGGAGAGTTTTTCTTTTTAAGATTTCTACAATGTTTGGACTCTGCTCTTCAACTTCTGCTCTTAACTCTCCTATTCCTGTGACTTCTCTTTTAGATAGTGATAAACTAATGTTCGTCCCTTTGCAGATGATAGTGCAGTGCTTCATAAATTCAGATTGGCTTCATCATTCTTTGCATTCTTGGGTACAACGATGTCAAAATGCTCAAACTGCAGAAATTATTGAAATGCTCAAGGAGGTAATGCCTTGACTGAGAACCTGGTTTTTTTGCCCCATATTTCTGCTTGTCTCAAACATGTTTTAGATCGAGAATGAGTGGGAATTTGCGGTGAGAGCATATGATTTTTATGTCAACATGTTATTTTTATGGAGAAGGGGGAAAAAATTCCAGGGATGAACCTATCCTTTTTTATTTTGATGGGAAAATGCAACGATGAGCACATCCTCTTTTCCCTCCTCTCCTTTTCCCGTTGGGGAAAGGAAGAACATCCGGTTTTCTTTCACTTGATGAAAAAGAAAACAATGAAATAGAGTAGATGCTTGTATGTAAGTAGCATAATGTCAATAATTTGTGTTCTTGAACTTGCTTAGTGCCACAGTATTTTGAGAGGTTGTGAAAGCCTTCACCAATGACACCTTTTTTCCCCCTCATATCGAATGACCTCACACTCAAGATCAAGTGTATAGGTTCTAAGTGAAACCTCGGAACAAAGTGGGATGAAGGTAGGATAAGAATAAGGAAGAAAGAACTCACCTATGCATTCTTACTCTTGACTCGACATCGTTTCTCTGTTAACAAAGGAACAAGCCTCCTTTTGGACTTTTGGAAAATAAGCTCTCTCTTATTGCTCACCTTCCTCTTCCTTGCCTACAAAATGGTTGTGAATCTCTTTAGATAGAATCTCCACCCATAGACTGGTTTACAAGATTTAAACTACCCTAAGTTTGATAAGTTAAGATTAAGAACTCTTGGACAACAACTCTGTCCATTCAGGTTTTGGAACTTAATTCAAACATCCTTAATTTGAATTCCCTATCTCTTAGCTTTGGCCTTCTAGAGTCCATGTTGTTGAGGATTCTCCATGATTTGGCGATCATTCTTTACAACCGCATGCTTATCCAATCTTCCTAGATTCATAGGGAGTTCATTACAAATGAACCTTCTTGTTCGCACCTTCCTCGATCCACACAACTACCTTATCTCCTCCTTGGATAAGCTATCCTTGGATTTGGCTTCAAAGATATCCTTCACTTATCTCACCCATTTAAACGATACTAAGATTCAACTTAAATTTCAAATTTCACATGTCCGCCAGCCTGTTTGCCCATCACATGCTTGCTGCTTGCCTATGTGTTCAACATAATCTTGCTTGCATGTCACCTTGCACTACATGCCCTGCGCTGCACCTTGCTGCATACATACATCTGCATGTGCACTGCCATGCGCATGGTGCTGTGCCTACACGAGCACCCCAATGCTCGCAGCTTGGTGCATGGAACCTCACTGCTTACATCCTGCGCACCTATGCCATGACAATCAATCTCACGGCCAGTCTCATCATGCTTGCCCACAAGCTCCATGCTAGTCATGACTTGTTAGACACTCATCTCTAGAATAGTTACTTGGTCAAGTACCTCACCTCGATCTTCTTGTACTGCAACAAGTGTAGTGACATAGGTGCTCTCCTCTTTCTCTTTTCTTCATTTACCTACTAAGTTGAGGGCTGCTAGTCTCCTACACTCCCGCCATTGGTTGCTATTAGGTCTACGTAGGTAGTAGTCAGATTAGGAAAAAGAGAATTTCAAACTTTGATAGCTTGTTAGCAACCCCACATGCATGGGTTGTAGATTCAGGGAAAATTTTAGAGAATCCATGGTCCCGCACCATTCTTGAGCCCCAACAAAGGTGCTTTTAGATGATTGTAAGCATGATCTGCATACAACGGCATGTTGCCATTGCCCCAAATCAGTAACAAATAGATTAGGGAAAAGAAAAATTTCAAACTCTGACAACTGTTAGCAACTCCAAACGAGGTATGGGTTGTACTTTGTAGGTTCAGGGAAAAATTTATATGACCCACACATTCTTGAACCATAACAATTTTTTTAAATGGTTAAAAAACACGCATATAATACTCATGTTTGGCTACAAGTTAGAAGCACATTTAGAAAAAGAAAAAGAATAAAGAAATGCTTTCTAAGAAAACAAATGATAGATGCTTTATTAAATGCTTTATCCTTAATTTATACTTCATAAATCACATTTTTTTTCACTAACTTTTTTTTTAAAAAAATATTCGTGAGTTTCTTGGCTAGCTTACGTGCACTTTAATTAATCTTATGAGACAATTATTAGAATTTACAATATTTAATAATCTAGGCAAGTGGCCAGCATGGGTTTGTACCATTCTCATTCACGAGAGTTTTTAACTTCTCTCACATAGTTTCCCTTGCGTGATACATTATCCCGTTTGGACTTCTAAGAGTTACCATCACACTTTCTGTCCAAACTTTTTCCCTCTAAGAAATTCCTATTTGCTTTCTTCATGTTTGACCATTATTCATGTCATTTTGAAAATATGCCGCTAGCTTGAAGTCCACTAACTCACCAACGCATTTGCTGCTGCAAACACTAACTCGCCAAGGCATTTGCTGCTGCAAAAGTCGTTTGAAGGTCTTTCACCTATCTATGCAATTATGTGTCCTGTCTTTAATCTTGACATAAAAATTGCACAATTTATTCTCCTTTGGCATGTTGGTTATGTCCAATATATCGAGTTGAAGTTCTTCATGTACTCACGAATTTTGCCGGTAGGTTTGAGTTTCCTGGGGTGTCCTTGCAATCAAAGATGTACCGCAAGGAAGGAACTTATCCCTTGCTTCCTACTTTAGACTTTCCTAAGTCGCTATCTTGGGTTTGCCTTCATTGGCATCATCAAGAGCATGAGTCCACCAAAGTTCTGCATCACTATATATATATATATATATTCGTGGATACTTGCTCTTCAGACGGAGTACGGCCTACACCAAAGTATTGCTTCTAAAGATTCTGTCTCCAACTCCTTTGCGTCCCACAAGCTGCTAAAAGCTTTAGGCTTTGGAACCTTGACTTTGAATGGACGCCTTTCCCTAGGGTTTGGCATTGCCATTGCTTTTTTCATCAAAGCTATGTCTTCCTTTATGGAGTCTAATCTTGTTTTCCGTTGTGTGCATAATGCTGCCAAGTCTCTTGCCAGCAAAGCCAATCGGCTTCTTGTATGTTATACTAAAACTTGGATGTCCTCTTTCACTTTCTGTAGAGATGCATGCTGTTGCTCTGATTGTGGTAATGTTGGAGGCACATCATCAGATTGCAGAATGCCTACAAATTCCTCCAATTTGCTAACTCAATCTCTTATGGTTTCCAACATTTGTAATCTCCTATCAGATGAACTATTTTCTCTCGGATTTTGATATTTTCCCCAAACCCTAATTGAAGCTCTGATACCAAAATTGTCATGGTCTTTTGAAACGTTGGAAGCCCACATCAATTGCATTTTTCTGTCTCACATGGAATGACCCCTTCTCTAGCACATGTGGCCTAGATGGTAAGACCTCACCTAGTCAGCTCTCACCATCATGATCGAAGTGTATACATAGGTCCAAGGGTAAGCCTCATTCCCAAGTGGGATAAAGGTAGAATAAGAGTAAAGGAGAAAGAACTTGCCCAACCCTTGCTTTAGCCATACAAACACTTTACCGTAAGCCTTCCCGATCTTTATTAAAAATTAACAAAGGAGTAAGCGGCCTTGTAAATTTTTGGAACATAAGCTTTGACTCTCTGTCGCTCACCTTCCTCCTCCTTGCCTACAATATGGCCTTCCTTAAATTTAATTTCCCTATCTCTAAGCCTTAGCCTTCTAGAGCCCATGCTATTGTGGAGTCTCCATGGTTTGGCCGATTCCTGATTCTCTTCAACTGCATGCTTATCAGATCCTCCTAGAATCATGGGGAGTTTATGAAATCTCCTCGTGGTGGCTTCCTTGATCCATGAAAACTTGTTATATTTTCACCTTGTATAGGATCTCCTTGGATTTGGCTTCAAAGATATCCATCACTTATCTCATCCATTTCTGCAGATTAGGAAATCTAACTCAAATTTCAATCACTACTTTTCTAGCCCATGTGTACATCACATGCTTGTTGCTTGCCCATGTGCCCAGCCCAAGCCTACTCGCACACTGCCTTGTGTCTCACGCCCATTCGTCGTGCTTTGTTGTTTGCTTCCTTGCCCCATGTGCACTGCCTAATGCATGGCGCTGCACCTGTGCTGTCATCCCTGGCACTAAGTAGCTGCTCAGTCTCCCCCCAGCACTCTGGTGGGTAGGTAGTTTGCCTCCATGGCCGCCCAAATTGTGTCATGCTAACCTTCTTAACATGTTGCTGACCCTCCAAAGTCAAATTCGGCCCTGTTTCACACACGTAGCCCTCCTTGTTTTATTGATAGCTTGACATGTCTGGATTAGTTCGCTAATATTACTATCAGCTAGCGTGCTTAGATTTTTAAATAGCTTGCATATGGGAAATAGGAATAAAATTGGATTTTCTGACCTAGGTTTTACCTTTCAATAATATTCATATGCCCCTTTGCAGCTCAACTAGCTCGTCGGAACCACTTTATTTAAATTAAAATTCTTTTATGGTAAATTTCTGGTTCTTTCTTTTGAAGTGCTGGGTAATTTGTAGAGACAGCATCAACTGCTCAACTTTTTTCTTAATTAGGAAATAGAGAAATGTGGTATTGAGAAATAAGCTTCTATTATCCAGTACTGATGGCCAAAAGTTAATTCTATATCAAAGGTTCAGTAATCTACTTTCTCTTTCCTTTTATAGGAATTGGCTGATGCTATTTTGTGGGACGAAGTGAACTCTCATGATGATGCACCAGTGCAGCCTACTTTTAGTTCTGTGTGGAAAACATGGAAGCATGAAGTTACAAAATGGTTTTCAATATCTCCCACCCTTCCCATTATCAGAGACAAAGAGCAGCAGACTGTTGAAGCTTTCTTAGCTACAACTCTCCAAGTTAGCAGGAAGAGGCCCAAGCTTGAAGTTCGTCGTGCAGAGGCACATGCTTCACTGGTCGAATCAAAGTGCTCAGATGTAGCTATGGCTCTTCATATTGATTCTGGTTTTTCAATAGCCAGAATAGTTTAAATGCTAAATTAGCATCAGAATCTCACAAAGTAGAGGCAAGGAAGGTTGCTAAATCAGCAGACTCACTCAGTACCGTACCTGGTAGGTTGGGTGGGATTGTAGTTCAAACTGGAAATTCGGAGCTAGCCTTTTGCAAGGATGTGGAACTGACGCCTCTTACTGAAGTAGTAGCAGAAAAACCCTTAAATTTTGGTAATAAGAATCGACAATGCATAGCATTTATTGAATCCAAGGGAAGGCAGTGTGTTAGGTGGGCCAATGAGGGTGATGTTTACTGTTGTGTGCATTTATCCTCTCGTTTCACAGGCAACTCTGATAAGAAAGAACACACTCGTTCTGTTGAATCGCCAATGTGCCAAGGTACTACTGTTCTTGGAACTAGGTGCAAGCATCGATCTTTATTTGGCTCCTATTCTGTAAGAAGCACAGACCAAGGAGTGAAACAAATCAGAATCAAATTCCCTTGAAAATAAGCTTATCGAGAAGCAACAGGACATTTATGGTGTAGAAGCTACTGGTTATAAAGAAATAAAGTTTGTTGGAGATGTTGGAAATCCCCTTGGAGTGGATGAGGGTGATGTGACCAATAATGGAAATAGCTCATCTGATAAGCTTGGGCATCATGGAAAAGACTCTATTGCCTCAGAGGTCCGACACTGTATTGGCTCTTGTGAACATATTGACAGCAATCCATGTTTAGAAAGCCCAAAACGTCATTCTCTATATTGTGAAAAGCATCTACCAAGCTGGCTTAAACGTGCAAGAAATGGTAAGAGTAGAGTAATATCGAAGGAAGTATTCATGGATCTTTTAAGAGACTGTATCTCACAGGAGCAAAAGATACATCTGCATCAAGCCTGTGAGCTATTTTACAGGCTTTTCAAAAGTATTTTATCACTGAGGAATCCAGTTCCTGTGGAGGTTCAATTTCAGTGGGCGCTGTCTGAAGCTTCTAAAAATTTTGGAGTTGGGGAACAGTTTATGAAATTAGTTTGTCGTGAAAAGGAAAGATTGAAAAGAATATGGGGATTTGATGCTGAAGAAGCACAACTTTCCTCATATTCGATGGAAGTGCCAACTTCAGGGCCATTATTAGCTTCAGGTAATCACGATGATGATATGGGTATGAGATGCAAAATTTGCTCTGAAAATTTCTTGATGACCAAGCACTCAGTACTCACTTCATGGATAGTCATAAAAAGGAAGCACAGTGGCTGTTCAGAGGTTATGCTTGTGCCATCTGCCTGGATTCGTTCACCAATAAGAAAGTTCTAGAAACTCATGTACAGGAGAGACACCATGCACCATTTGTTGAGCAATGCATGCTTCTCCAGTGTATTCCTTGTGGCAGCCATTTTGGGAATACTGAACAATTATGGTTACATGTAGTTGCTGTTCATCCTGTTGATTTCAGATTGTCAAATTCTACTCAACAGCATAATTCTTCAGATGATGAGGATTCTCCAGTTAAACCCGAGCAGTGTAATATAGTTTCTCAGGAGAATGACAACAAGAACGTTGGTGGTTTACGAAAGTTTATTTGTAGGTTCTGTGGTTTGAAGTTTGATTTATTGCCTGATCTTGGTCGTCACCATCAAGCTGCGCACATGGGGCCAGGTTTAGTTAACTCCCGACCTGCAAAGAGGGGATTACATTATTTTGCTTATAAATTAAAATCTGGGAAACTTGGTCATCCTAGATTTAAGAAGACTCTAGCAGGTGCATCAAATAGGATCAGAAACAGAACAAAAGCAAGCATGAAAAAACATATTCAAGCTTCAAAATTACTAAGCACAGGAAGCATAAACCTTCAACCTCATGTGTCTCAGTCAGCAAGTTCTCGTAAATTGACCCAAGGTTCAACTGTTGCGAAGGCATTGGTTTCTGAGATTCAGAAAAGAAAATTATCACCTACCAATATTGATATTTTGTCTATTGCTCGCTCTGCCTGTTGCAAGGTCAATTTTAAAGTCCTGTTGGAACAGATTTTGGAGTATTACCTGAATATTTTTATCTTAAGGCAGCTGAGTTATGCAAGGATAAAGTGAAGTTAACTGGTACGTCAAGGGGTTTGTTTGTCCTAAAGGATGTGAGACATTTGAGGATCCTCTTTTGCTTCCCCATGTGATGCCTCATCCACATGGTTTTGGGAACCACAAGAATGCACGCACTCCTGATACTGTGAGTAGTAAATGGAAAGTCACAGATGTGGTTATGTCATCGGTTCTCATCTTTCAAGCCAGCAGGTAAAGGAAAAGGTCATCATCTTGTGTGAAGATATAAGCTTTGGCCAGGAATTAGTTCCTGTGGTCTGTGTAGCTGATGAAGGTCAAAGGAACTCACTTCACATACCTCTAGCTAACTCTGATGATCAAAATGCTAGATACTCCATGCCTTGGGAAAATTTTTCCTATATTAAGAAACCATTGCTTGATAAGTCCCTCGCTAGTCATACAGAGGTAGTTTTATATGAATTACTTTTTGTCTCAAGAGTAGTGCTAAACTGATATAATTTCTTGGTTTCAATGTACTGTTTCTTCTATATGAAAATTATGTAGATTAAATGTGAGTTCTCTTAGGGTGTTATTGTTCATGACACATATCCTTACTTTTTTTAGTGTTTATCATAGAGGTCCTTATGAGTTAACTACCTAATGGGTAGATATAAATTTCCTTGTACAATCTGTATATGATCCTTAGTTGAGAGACAGCCAGATGTTTTATTCGTTTGTACCATCCAGGATAATGCATATTGTTCTGGTTTTGGCCGGCATCAAAATGCTTTTCTTTTTATGAGCAGCATCAAAATGCTTTTATATAACTTCTGAATATCATTTCTGAATAAAACCTAATTATGATTGTAAATTTTCAATGTACACAATGTAAAGTCATTTTTGTATTTGACATTATGCACAGAATTTGCAGTTCGGATGTGCCTGCCCACATTCACTCTGTTCTTCTGAAACATGTGATCACGTATACCTCTTTGATAGTGATTATGAAGACCCAAAGGACATTTATGGGAATCCCATGCGTCGCAGGTTCCCATATGATGAGAATGGTCGGATGATTCTAGAGGTATGGTTTCCATTACTTTCCAGCCACACAGTCACATTTCTTGAAGTAATAATCGTTTTTGAACTGCTACCATAATCTTTATATTGTTTATTCCCACCTCCCCTCTCTTTGGAGACTTTGGTTTTATATAATTGCTAGTTTTGTTAAGTTTGACTTGATAATCTGATTTACTCATGCCGATTTGCTGCTTGACCATGACAGTTTGTAATTTCTTCTTTATATGAAATATAAATGTGACTAAGAGTGCTTTGTTGGCATTAAATAAACTTGCAATCTCTTTTGTTTTTTATTCATTCATCTGCTGCTGTATCATCTTCTCTGGACCATTACAAAGCAATTGGCCTTGGTTCTAGGAAGCCTTTAAATTTTAAAACCTTAATACTTCTTGTTCCGACCTTTTAATGTATGATTTCTGAGATTGAGTTATGTCTTTGAATGCTCGCATTTATAACTAATGAAATTGTGCAGGAAGGTTACCTTGTCTATGAGTGTAATGAAAGGTGCAGCTGTAGTCAAACCTGTCCAAATAGAGTGTTGCAAAATGGAGTTCAAGTGAAACTTGAAGTCTTCATGACAGAAACAAAGGCAATTTCTTCTTTATGCGCTTTTTATTTTCTTTTTCCGCTAACTGGTATATAACCTTTAATTTCCTAGATGGTGTTTTCAGTAGTTTCTTATTTCTTATATGAAGTTGTAAAAACATCACAGGGATGGGCAGTGAGGGCTGGTGAAGCCATCCTGCGTGGCACATTTATTTGTGAGTACATTGGGGAGGTGTTGGATGAGCAGGAAGCAAACAGAAGACGTAACAGGTTTGTATTATTTCTTGAAAGTTCTGTTTAAAATCAGATTTTCCCTATAATTGATCCATTATATCTTTTATCAGGTATAACAGTGAAGGCAACTGCTATTTCTTGGATGTGGATGCTCATATTAATGACATTAGCAGATTAATTGAAGGATCGGCTAGATATATTATTGATGCCACAAATTACGGAAATGTTTCGAGATTCATAAATCACTGGTGAGCAGAGATTTGTGTTTCACATGTTCTTTTTACTTGTTTAGACATTTGGTAATTTTAGTACGTATTTTGAATCTTTTGTGACCATTTTAAGACCAACCAATTTGTGAGCAACAAATTTATTCTATGCTCGGGATCATTTCTTTTGAATTTCCAGAAAGCTGCTAATCAGTTCCTTTAGATCTATTTTTACCACATATTATATGCAAAATACATCATTTATGAAACAGGTGGTGTTATTTACCTTTTAACTTTTTTTTATGCTAAGATTATTATTTATCTAATAGCATCTAACTGATTGTATTGTTAACAATTATATAAATTATACATTCTTTTTTACATCATTGTTCAATGTGAAGTTTTGAGCTTTTTTGAGCCAGTTAATGTTGTATCGTGCATATCTCAAGATGTACATCTCTACTTTATTACAATTGACATTCTTCGTTGTTTGCAGTTGCTCACCAAATCTTGTAACTTACCAAGTCCTCGTAGAAAGCATGGAATATCAACGCTCACATATTGGATTGTATGCAAACCGGGATGTAAGTACTCACTTTAATGGCATGATGGATTTCAGAGCATATTTGAGTTCTAATTATTCTGCAACTGGTTTTGACATTTGAGATATAAATATAACCTAGTGAACCTTAACTTGAGATCTATATTAGGAATCTTATCTCAGAAACATTCTGTATAATCATTGACAATCTTCATATAGAAAATATGTATACTTGTACTTGTGCAGATAGCTACTGGTGAAGAGCTGACATTTGACTATCGACGCGAGCTATTGCCTGGAGGAAATCACGGCTGTGAATCTTCGAATTGCTGAGACCACCTTTATTAAAACATTTGAAGAACTTGTACATCCTATCATTTGAGGGAAAAAGATCTGGAGCTTAAAAGTTTGGAAAGGGAGCAAGAGAGATTCTGAGAGCGCATGCCCTGCACTTTTTTGAGAATTATTAATGGTCTTATTCCCTCATATAGCCAAGCAACTCCCTCAGCCTGGAAACTGAGTTGGAATTAGAGAAAAATGGTTAACTAAATTTTCTGTATGTAGGAACTCGTGAACGGTTGTAGAATTGTCGATGTCCCCCGACCACCTCTAGTTTGGGATTTATCTTATGTTCACTGCTGAACCAAGTTAGTTTAGCAAATTGTAAATGATTTTTTTTGTTCTCCAAAGTTTAAGGAAGTTTTAGGAATAATATGTTCCTTTTTTTTTGGCTGTTATGATGGGTTGTTTTAAAGTGTATCATATTCATAGTCATAATGTGGCAGATCATTCAATTGTCCCATCAGAATGGATCTATTTTATCAGGATTTAGCATTACAAATTGGAAACTACATGGAACTAAGGCAACCAACGTTGAACTGAAAAATACTCTCTCCTTTTGAACTTTTTTGAGTAAGATTTGTCCAACTACATGATTCAGAATTTCATTTTCCCCTTTTGAACCAAGAAGAAGAAAATGGGAAGTCCATAACCAAATGCCACCAACGTTGCCATTCCTGCAATCACAATTCCATCCTCGAAATCAGAAATACATTCCAATTGAAAACGTTCTTTAGGCCAAAAAATAGATATCTAATGTATCATATAAATGGATAAAAAACGCGCTTATGTGAAATAGTTTTTTATGCAAAATTTTGGAATTAGATGCTTGAAAGTAAAAAGGACTAAATAGAATTGGAATTTAAAAAAAAAAAACTGTTGAACAAAAAGAGTGGAATGAGCTAGCCTTACCATATGGGAGATATCTTGGAAACACCCCTTTACTTTTAGTTCTGTGAATTTTTGGGGATGATGATGCTGCTATCAGCTGTAAAATCATGGAAGACACAACCAACCCACTTCCTGTTTTCTCCCTCACCACTCTCTCACTTAAATCTAGAGAGAGAAACCCTGCAATTCCATTCACATAATCCTCTGCTGCCACCTTCACCCACAAATATATTCCAAGTCCCAGTGTTGCCTTGGCAACATAACGCATGGCTTCTGAGGAGATAACCCAGCTGACAACATTCACAGCACAAGCACATCCATAAAGTGGAACCACAAATACCAATCTGCCCAAGTTGAGATCAAAATTTGAGATTCAACCAAACGTTTATTTAAATCGATTCTCGAATCGAGTTCAAGTTTCTGAAAGGATAGTCTAGGTTTTATTAAATCCTTCGTTTTCAGGCCTAAAATTTTGAAAAGTTGGATCTTTCATTTCCCAGTTTGATAATTGAAGGAAGAAAAGAGAAGGGTTTGGTGTACCAGGATCGTTAAACTGAACAGCAGTGGAATAAGCAAATAGAAGCGCCATTAACAGAGAGCAGCTGCTGTATAACTTGGAGGGTGTTACCATTTTCTCCTTTTTCCAGAGCTCCGATCCAACAGCAACCCTTTATGGTCAACGCCGTTTTGAATATGTTTTTTTAATATGATTAAGTAAACCATAAGTTTAGGTGTACCAGAAATGGCTTAAAGTTTCCTTTGTGGGTTGAAAATTAACATTCACTCACATCGGTTTGGTGGAAAGTTGATAATTAATCATCTGTAAGGTTTTGTTAATCGCCGTTTAATCTAAAACTTGGACAGATCAACTGTCAAATTTTGAACAAAGTTGAAAATGCTTAAATTAATCTGTATAGAATGTGTTTATTTTTTTAAAAAAATCTCAAGCTTAAATTTTAGGAATATCCATGTGAATCAACTTGAAACAAGTTTGTAATCCAAGGAAATCCAATTAAACTAGATTGATTAGATATCTCAAATCATACAATTTCTTTCTTGAGTACTCAAATAGTTAAGAGATTTCGTGAAATTGAGGAATGAGAATAATAACATATGATGATGATGAGTGAGAAGGGAAGATGCAATAAAAGTGAAGGAATGGGCATGCAATGCCCTTATGATTTGGCAGCACAACATTGTTCTTATTGCATGGTCCTTGCCTTGTTTTGTTTGAGGTAAGAGATTCTTTTGTCATATGTTTTGTAACATTTACTGAAAAATGTCAGATGAATGAAACAACTGAAAAGTTCAAATTGAACAAACTTGTATCAATTGCTTTAAATCAGGCAGTTGAAGTTGCAGTGAATGTAGCTCTGATGCCAAACCATAAGTTTTGTATGGCATGCATTAAGATGTAACCAAAAAAAAGAAAAGGAAGTTTTGTTGTGGGTCATATCATAATGAGAGTATTAGAATGCTTTTTCCTCCTCTATATTTGTTGGATAGTGTTGTTCATTTTGTAAAATTCAACGTTAGTTTTCACTGTAATAGTTTGATAGTATATTTGCAATCGATGGTTACCTGCATAATTGCGATGGAATAGTAGTAGTTGTAGATAGTGATGAGTTATCAAAAAAAAACTAAATACTTGTATAGTTCAATTGGAGGATTACAATTGTGAGAACACTTCAATTATATGAGACGACCCTATTATTAATTATTGGAATTCAAGTTACATGAACTGAAGACAAGTTTTAAAATGGCTTATGTGACCCTTGATAGTAGGATAATGCATATTCACGTTGAACACAATCCTAACAAAGCAGTTATTAGTGTCATCAAATTGGGTTGGGGAGAAAGGCCATTATTTCTGGGGTGGCACCCTCACAAGCTTCTTTCCCCATAATGTGGCCCCTTCTACCCATGTAGTCTTCTATGATGAGAAAGTGTAAGTGGAGACAACATACTAATATTATCCAAGGAGACAACACTTTTCAAATAAAGACCCATTTTGATGAACACTTTGAATAAATCTAGAATGTGTCTCCAAAATTGATGGTTGCTAGATGATATTTAATATAGTTTATCATATATGAGCCGAATTCAATTGAATGCTCTTGAAAGTGATTCATGTTGAAGATGACTTCTTTGTAAGAAAGGAGCCAAATTTGTAGAGTAAGAAAGAAAATCTTGACAAAGATGAGAGTTTAGTTGATAAGTAATTCAAGAAATTTGTTTCGGCCTACAATATACCATTATACTAACAAAAAAAAAGTTATACCGAAATTATACATTTTAATAACTATCATTTTTTCAATCTCAAACGCATACATAATATTTTTTCCACACATATTGAAATCCTATTCAATAAATCATCTCTCTATATTAGATCACTCATGACATTTAACATGGTATTAAAATAAGAGATGTTATGTTCAAACTCTTGTCCTCCTTAATTTAATTAGTGTTGATAGTTTGTTTAGTAGTGTCGTGCCAAATTCGAAACCCACGAATGAAAATGAGGAAGAAAGGTTGGTGAGAAGGGTTAGTGAGTTGCATTAAAAGTGAAAGGTTTTGCATATGGGAAGGGCTAAGTTTGTAGGCTAGAAAATTAAATAGAGCATGTTGAAAGGAAAGGTTTGCCAACGAGGCCACGACCTATGTAACATTTAGAGTATACGCCATTGGCCCATTTTTACACCAATCACTACTCATCCATCCTTCTCAATTATTTCCCTTCTCCTACCCCATTTGTTAATCACAATACCTTCTCTTTTTTTCTTTTTGAGTTCAATAATTGCAGGGTGGGGATCGAACTATCGACTTTTGAGATGGTAATAGGTACCTTATTCACTAAACTATGCTTGAATTGACTTTTCTCTTCTTCTTAATATATATTTTATTTTAAAAAAATTACACGAGTCACCATAAACTATAAAAAGGGTTGATATAATTATACCTCGGCTTCTAATTTAAACAATTATTCATCAAACTTATGCAATGAAATCTTATCGATTGATCTAGGGTTGTCAATTAGTAGATAAGTTTTTAACTACCAACTTCTCTATGGTTTATTGGTTGTTACAGAGAAATGTATGATTCAAGTTCATTTTATTTTGCTACTTTTATCATTATGATTATTGTGCTGTCATATATTTTAATTTAGTCCTAATATTTTTGTAAACGTATCACTTTAGTCATTAACGTTGCTTTTTAAATTAAACTGTAATTTAAATTGATATAATATCACACGTGGAAGCTAACATGGTACATTAATATGCTACAGGGTTGTATATTTGTGAGAGTTAGTTTTTTTTTTATTGTGTTGGATGTCTCTTAGTTGGATTTGGACATTTTGTCGCGCTTTTTTTATCTTTATGATTAGGTTCAAGTATTTTTGTGTTGGGCATTTGTGCATTTCCATAACATGATTTTGAATATTTTGTTGGTTGATTTTTGTTTGATTAATTATTTATTGATTCATGACCAATATTTTAATTTAGTTTTGAAGAGTATTAACTGCATCAGTTATTTAATTTACTGTTTAATTGGATGAAAATAAAAGTTTAAAATTAAAATTGAGGATAAAAAACTTATAAACAAACTTATATTTTTTACATAAAATTTTTTACAAGGACAAAATGGGTAATCAAAATCATAGCATGTGATGAATATATCCATTTAAAACTATTTCTTACTTGCCACACTCAATAATTGCACTCAAGTCACATTATAATTCATCAACCACAAAACCTAGAAATCAAAACGAAAAATTAAAAAAGAGAACAAGTCAAAAAGCTTCCCAAGACAATGTTTCATATTTGGTCATCTGATTTGTCTGCATGATGGTGATGTTGTCTACCTACTCTCTCTCTCTCTCTCCTCTCTCTCTCTCAAGCAAATTCAGCAAAAATAAAAAGAAGTGTAAAATGAAAAAGAAAGGGGAGGAACAAAACAAGATAGCAAAAATCTTCAAGGAGGTTTTGAAGTTTAATTATTACTTAAAATAATGGGCCCAACACGGAGTTTTGACAAGTGTGACATGGAATTATAGAAAGCAAGTTTTGATTGGAGGAGAGCATTGTGAAAATTGAGCTTTGACAAATGAGGTTGCTTTGGGGGAGATGAGGTTGTTGTGGGGAGGACTTGACTTGCCATTATTTAGAATCCAAGTTGAATTTGCTTTGACTAGATTGTGTATCTCAAGCTAGCTTAAGTTTGCATTTCAAAATTAATTTAATCCAAATCTCAGTTTTCTTAATTCAATTTGGCCCACCAACATCATTCATATCGCTTTTGTTTTTTCTTATCACATCACACTTTTCATATATCTTAATCAAATTGTTAACTTGATTTTCAAGTTAAACAATTTTGAAATGTGAAGTTTGCAAACAAAACAGGTGAGAACAGGGAGCATGCAGTGAAGCTCCAAAGAAGACATTTGCATGCTGTTAGGATTTTGGACCATCCATCGTTGTCATCATCCCCCCCTATGAAATCCAAACCGCTAATCCGTAGCTTTCACTGAGAAAATCAAAAAATGTTCTGTTCTGGAACTGGAGATGCCCTTTTCCTCCATTATTGCTCCCACTTTGAGCTCAACTTAGCAATGTCGTGAGGGTATTTCGGTCACTGTTCTGACAGTACACATACAGACATATTACCACCGTACCCACCACACCAAGTGAAAGCGAAGGCCTTCTCATTTTGGGCATGCCTAGAATATCCACAGATGCAACAAAGCATTGAGGAACCCAATAAATTTTAAAGTAGAATTGGAAATGATGAGGGGCGTGCCTGCTTGATTGCTGAGAAAGCTTTGTGGGTGCGTGTAACAGAGAGAGAGATTGCAGAAAAAAAAAAAGTTGTGCAAATTCTGAGAGACCCACATGGAGATACAATCATCAAAAGTGAGATGGAGGCAAACCGGTAAGTGATAAAGTAGCCATACCCACCCCTTCTTTCACTTTTTTTTTTTCTTTTTCTTTTGTGGGTCTTCTGGTTTTTTCTTTACTTCAGAAAGTAAAAGAGTCTCTTCACTAATTATTCCACTTACAAGGCTTTGTTCGCTGGTTCCCTTGATGCTTGACACCGGAGCTCCCTTTTCAATCATACTACGCAGTCGTGGTTTCGAATAAGAGGTGGGTTTTCTCAGACTTCTCAGCCTTCACTTTGGCACTGGTATTGACACTGTTGAGAACAGATGGTGGTGAGTTTTGGCTCTTTGGGAAGCTTTAAGAATCTCTAGTCTGCAAATGAACAATGAAATTCAGTACGCGGGTGTTTGAGTTTTTTAGGTTTTCTTCAAGGGAAATGTACGAGGCTGCGAATCTGTAGGTGATGCTTTTAATGCTTTCGATTCTGAAGCAAATTAGGGGCGAAGGAGCGATACGAATACAAAGTGCTTTTGGCTGAAAATACCGTTCTTCCATTTGGCCTTCCACTTCATTTTTCTGCTAAATTTAGATCGTTTGTCGTCTGATCTGATCTTCTTATGTCAGAAATTGGGTACACTTCAGTTTAAGTTTGAGTTGCAATTTGTGCATGTCGTTGTTCAGTTCAAAGCTTGGCGTCTTTGATTCAGAAAGTAGGAGGCTGTAAGTTTCAATTATCTTACTTTCCTCTTTTATCGTCGAGCACATATTTTTTCCTTGTAGAAAGCTTTTGCGGCTGTCCTGAGGTGCTCCGAGTCTTTTGTGCTTAATGTTCCGTTTTCCCATAGTCAAACAAGCTGCAATAAATCAATAAGAATGGCTGAAATAAGGTTAACTAGAGTAGAACAAGGCCAAACCAAGATTAAAAATGTTCCAATTGCTGTTACCCCAGAAGGTTTTTGGTGCTGCCCTTCTCCTGTTGTTTTCCAAAAGACCCTCAAAGGTCAAAATGCTCTAAACAAACCGAAACCTGCCTCACCGACCCCTAAGAGCCCTGTTGAGAAGAAACCGACCCCAGTGACTGATAGGAAGCCAGCGCTTACAAGATCACGCTCGGCTGCTGTTTCTGACGATGACCGAAAATGCAATGCTGATAATTCTGGCTTTAGTGCCCCAGAGGTCGTACACAGGGTACCACGGCCTAAGATTGAAAATATGCCAAGAAAAATAGCAATTGAGTTTGGTGAGCCAGGGACAAGTAATATAAAGGTTGTTTTACTCGGGAAGCAAGGATTTTCGGTGAAGTTGAGTGTTCATAAGAACGTTTTGATGGATAATAGTACTTTTTTTGCCAATAAACTTTCTGACAAAGAAGGTTCCTCTCTGGAAATTGGTGATTGTGAAGATGTTGAAATATACGTTGAAACCGTGGGATTGATGTACTGCAAAGAAATGAAGCAATGGCTGATGAAGCAAAATGTTTCTCGTGTTCTGCGAATTCTTAAGGTAAGATGCAGATTCTTATACTCATAGTATGTTAAAATCATAAATTACATAATATTCTTTGCCCTTCTGTGGGAATGTTCTTATAACGATTTTGATATTTCAGTTGATTGTTTGTGGTTCTGATTAAAAATTATAAATCACATTATAATGAGTAGGATCTTTTCTACTGGGAAGGCATGAGATGGTTATACGTGTTGCAGTCCTAGTTGTTCAAAAACAGTAGTTCGCTAAATAAATGTACTTGATAGATTTTGGTGGGTGCGTCATGAGAAAATCATTGAAGGATTAACTGTCTCAAAATGAATCGGTGGCAAGTGCTTGGTTTTGTTCATGGCTTGCTTCTTCTCCAAAGTAAGAGGATGGTATAGGAGGGTAGGATCCCATACATACAAACCTGTATGAGTGCACTGTGCAAACTGCAAAAGCATAGCACTCTGTAGACCCCTCGTTATGACATAATTGGACAGTCTTGCAACTTAGCTAAGTGGTTTTAAGACCTTCAAATGCCAAAAAATTTTCTGTTTCGTTTGAATGCCATTTGGCTCAATTTTTACTTCATTCTTTCTTAGTTTTGGAAAAAAAAACGAAATAAAAATAAAAATCCTGGTGATGTTTAGTGGCTATAGTTGAAAGGATGTCATTGGTCTGGCTAGTGGAATGATGGAGCCATCCTTTTTGATATGTATCTATAGATGCTTGGAGAAGCTATGTGGTCTGTATATGTATATATGAGGATTAGATGATAATAAAGACACAGAAGACCCTTAGATAACTCTATTTTAATAACCATATTGGGGTTAATATATCCTTATTTCCTACTCCACTCCACTTGAAAGTTGCATATATCGTAAAGGTTAGTATTGTTCGATTATGTTTGAAGTATTCAAGCAATTTTGGATAAATCAAATCAGAAAATTGGTTACTATTGTCAAGGCATATAAATTGAAGTTTGGGAAGCACATGGGTTACAGGAAAAGTCTAGGGGAAGAACGATATCAGTGTGGATGTGGCTCAATCTGGAAGTACCTTGCTACTTCCAGGACATTTAAATTTAGGTATCTGTGGAAAGATTTTCTGGGCTTGTTCTAATCTGAATAAAATCTAATGATCTTCATGGATGTTTTATTAGTCTTATCTTTTATCGCCAGGCAATGGCTGTGGCATACGCATGCTTATACAGCTCCCTCGCCATTTTGAGAGATCCTTTTAATGTTTGTTACCTTGAATTCTTTTACTGTGTAATGGACTTCAAACTTACCAAAAATACTTATCTTGCAGGTTGCTGAATTCCTTGGTTTTAAATCATGCATGCAGTCTTGTCTTGAATATTTGGAAGCAGTCCCTTGGGTTGGTGACGAAGAAGAAAAGGTTGTCACATCAATCCTGCGCCTTCAAAGTGAGGGCATTGGAGTGAGCCCGGTATTGAAACGAGTGTCTGCGGATGTGTCTAAACCCCATAAAGATACTCTTTCCCATATCATCGAACTTGTTCTTAGAAGCAACGAGGAGAGAGGCCGACGTGAAATGAAATTGGTGGTACTGAGGCTGCTTAGGGAGAACCAGAGTGTCCCGAGCCATGCGAGTTCTGCTGACATTTGCAATGAAATTATTTACTCTTCTTGTAGAAGCTGTTTGGGGTCGCTATTGTTCCTGTTCCAGCAGGCTGCTGAAACTGATTTTACAGATAGATCAGTGGACAGGAAAGAACCAGTGCTGAAGCAAATTACTCTAGAGGCCGATAACCTTTCGTGGTTGCTTGAGATTTTAGCTGACAGGCAAGCAGCCGATGAGTTTGCGTTATATGGTCAAACCTTCAGGAACTAGCAGCTCTCCATGCAAAGTTGCCTATCGTTTCACGTTACCATGTTAGCTGCATAACAGCACGGTTATTTGTTGGCATTGGCAAAGGAGAGCTCTTACCAGCAAAGGATACCCGAAAGTTGCTGTTACATACATGGTTGGAGCCGTTGATCAATGATTATAGCTGGTTAAAACATGGTTGTGGGTCGTTTGATCGAAAGGTCGTGGAGGAAGGAATTGGTCGGACGATCCTCACTCTCCCTCTAGAGGATCAGCAAAACATTTTGCTGACTTGGTTGGGCAGTTTTCTGAAAGTTGGAGATAGTTGCCCAAATCTCCAAAGAGCGTTCGAGGTATGGTGGCGTCGGACCTTCATTCGACCTTACGTTGAGACAGAAGGAAGCATTCGACAGCAAGATTGCTCAATCACACCCCAGTTGGAGCCTTGA

mRNA sequence

TGGAGAGGGAAGTGGCAGGCTGGAATTAGATGTGCAAGGGCTGACTGGCCATTATCTACTTTAAAAGCCAAACCTACACATGACAGGAAGAAGTATTTTGTGGTCTTTTTTCCACACACAAGGAATTATTCTTGGGCAGATGCATTGCTTGTTCGTTCTATTGAAGAATTTCCTCAGCCTATTGCATACAAGAGCCACAAAGCTGGTTTAAAATTGGTTGAAGATGTAAAAGTTGCTAGGAGATTTATAATGAAAAAACTTGCCGTTAGCATGCTAAATATCATAGACCAATTTCACCTCGAGGCTCTGATAGAGAGTGCTCGTGATGTAATGACTTGGAAAGAGTTTGCCATGGAAGCTTCACGCTGTAATGGTTATTCCGATCTTGGAAGAATGCTCCTGAAGCTGCAGAATATGATAGTGCAGTGCTTCATAAATTCAGATTGGCTTCATCATTCTTTGCATTCTTGGGTACAACGATGTCAAAATGCTCAAACTGCAGAAATTATTGAAATGCTCAAGGAGGAATTGGCTGATGCTATTTTGTGGGACGAAGTGAACTCTCATGATGATGCACCAGTGCAGCCTACTTTTAGTTCTGTGTGGAAAACATGGAAGCATGAAGTTACAAAATGGTTTTCAATATCTCCCACCCTTCCCATTATCAGAGACAAAGAGCAGCAGACTGTTGAAGCTTTCTTAGCTACAACTCTCCAAGTTAGCAGGAAGAGGCCCAAGCTTGAAGTTCGTCGTGCAGAGGCACATGCTTCACTGGTCGAATCAAAGTGCTCAGATGTAGCTATGGCTCTTCATATTGATTCTGAATCTCACAAAGTAGAGGCAAGGAAGGTTGCTAAATCAGCAGACTCACTCAGTACCGTACCTGGTAGGTTGGGTGGGATTGTAGTTCAAACTGGAAATTCGGAGCTAGCCTTTTGCAAGGATGTGGAACTGACGCCTCTTACTGAAGTAGTAGCAGAAAAACCCTTAAATTTTGGTAATAAGAATCGACAATGCATAGCATTTATTGAATCCAAGGGAAGGCAGTGTGTTAGGTGGGCCAATGAGGGTGATGTTTACTGTTGTGTGCATTTATCCTCTCGTTTCACAGGCAACTCTGATAAGAAAGAACACACTCGTTCTGTTGAATCGCCAATGTGCCAAGGTACTACTGTTCTTGGAACTAGGTGCAAGCATCGATCTTTATTTGGCTCCTATTCTGACATTTATGGTGTAGAAGCTACTGGTTATAAAGAAATAAAGTTTGTTGGAGATGTTGGAAATCCCCTTGGAGTGGATGAGGGTGATGTGACCAATAATGGAAATAGCTCATCTGATAAGCTTGGGCATCATGGAAAAGACTCTATTGCCTCAGAGGTCCGACACTGTATTGGCTCTTGTGAACATATTGACAGCAATCCATGTTTAGAAAGCCCAAAACGTCATTCTCTATATTGTGAAAAGCATCTACCAAGCTGGCTTAAACGTGCAAGAAATGGTAAGAGTAGAGTAATATCGAAGGAAGTATTCATGGATCTTTTAAGAGACTGTATCTCACAGGAGCAAAAGATACATCTGCATCAAGCCTGTGAGCTATTTTACAGGCTTTTCAAAAGTATTTTATCACTGAGGAATCCAGTTCCTGTGGAGGTTCAATTTCAGTGGGCGCTGTCTGAAGCTTCTAAAAATTTTGGAGTTGGGGAACAGTTTATGAAATTAGTTTGTCGTGAAAAGGAAAGATTGAAAAGAATATGGGGATTTGATGCTGAAGAAGCACAACTTTCCTCATATTCGATGGAAGTGCCAACTTCAGGGCCATTATTAGCTTCAGGTAATCACGATGATGATATGGCACTCAGTACTCACTTCATGGATAGTCATAAAAAGGAAGCACAGTGGCTGTTCAGAGGTTATGCTTGTGCCATCTGCCTGGATTCGTTCACCAATAAGAAAGTTCTAGAAACTCATGTACAGGAGAGACACCATGCACCATTTGTTGAGCAATGCATGCTTCTCCAGTGTATTCCTTGTGGCAGCCATTTTGGGAATACTGAACAATTATGGTTACATGTAGTTGCTGTTCATCCTGTTGATTTCAGATTGTCAAATTCTACTCAACAGCATAATTCTTCAGATGATGAGGATTCTCCAGTTAAACCCGAGCAGTGTAATATAGTTTCTCAGGAGAATGACAACAAGAACGTTGGTGGTTTACGAAAGTTTATTTGTAGGTTCTGTGGTTTGAAGTTTGATTTATTGCCTGATCTTGGTCGTCACCATCAAGCTGCGCACATGGGGCCAGGTTTAGTTAACTCCCGACCTGCAAAGAGGGGATTACATTATTTTGCTTATAAATTAAAATCTGGGAAACTTGGTCATCCTAGATTTAAGAAGACTCTAGCAGGTGCATCAAATAGGATCAGAAACAGAACAAAAGCAAGCATGAAAAAACATATTCAAGCTTCAAAATTACTAAGCACAGGAAGCATAAACCTTCAACCTCATGTGTCTCAGTCAGCAAGTTCTCGTAAATTGACCCAAGGTTCAACTGTTGCGAAGGCATTGGTTTCTGAGATTCAGAAAAGAAAATTATCACCTACCAATATTGATATTTTGTCTATTGCTCGCTCTGCCTGTTGCAAGGGGTTTGTTTGTCCTAAAGGATGTGAGACATTTGAGGATCCTCTTTTGCTTCCCCATGTGATGCCTCATCCACATGGTTTTGGGAACCACAAGAATGCACGCACTCCTGATACTGTAAAGGAAAAGGTCATCATCTTGTGTGAAGATATAAGCTTTGGCCAGGAATTAGTTCCTGTGGTCTGTGTAGCTGATGAAGGTCAAAGGAACTCACTTCACATACCTCTAGCTAACTCTGATGATCAAAATGCTAGATACTCCATGCCTTGGGAAAATTTTTCCTATATTAAGAAACCATTGCTTGATAAGTCCCTCGCTAGTCATACAGAGAATTTGCAGTTCGGATGTGCCTGCCCACATTCACTCTGTTCTTCTGAAACATGTGATCACGTATACCTCTTTGATAGTGATTATGAAGACCCAAAGGACATTTATGGGAATCCCATGCGTCGCAGGTTCCCATATGATGAGAATGGTCGGATGATTCTAGAGGAAGGTTACCTTGTCTATGAGTGTAATGAAAGGTGCAGCTGTAGTCAAACCTGTCCAAATAGAGTGTTGCAAAATGGAGTTCAAGTGAAACTTGAAGTCTTCATGACAGAAACAAAGGGATGGGCAGTGAGGGCTGGTGAAGCCATCCTGCGTGGCACATTTATTTGTGAGTACATTGGGGAGGTGTTGGATGAGCAGGAAGCAAACAGAAGACGTAACAGGTATAACAGTGAAGGCAACTGCTATTTCTTGGATGTGGATGCTCATATTAATGACATTAGCAGATTAATTGAAGGATCGGCTAGATATATTATTGATGCCACAAATTACGGAAATGTTTCGAGATTCATAAATCACTGTTGCTCACCAAATCTTGTAACTTACCAAGTCCTCGTAGAAAGCATGGAATATCAACGCTCACATATTGGATTGTATGCAAACCGGGATATAGCTACTGGTGAAGAGCTGACATTTGACTATCGACGCGAGCTATTGCCTGGAGGAAATCACGGCTGTGAGAACAGGGAGCATGCAGTGAAGCTCCAAAGAAGACATTTGCATGCTGTTAGGATTTTGGACCATCCATCGTTAATTGGAAATGATGAGGGGCGTGCCTGCTTGATTGCTGAGAAAGCTTTGTGGGTGCGTGTAACAGAGAGAGAGATTGCAGAAAAAAAAAAAGTTGTGCAAATTCTGAGAGACCCACATGGAGATACAATCATCAAAAGTGAGATGGAGGCAAACCGAAATTGGGTACACTTCAGTTTAAGTTTGAGTTGCAATTTGTGCATGTCGTTGTTCAGTTCAAAGCTTGGCGTCTTTGATTCAGAAAGTAGGAGGCTGTGCTCCGAGTCTTTTGTGCTTAATGTTCCGTTTTCCCATAGTCAAACAAGCTGCAATAAATCAATAAGAATGGCTGAAATAAGGTTAACTAGAGTAGAACAAGGCCAAACCAAGATTAAAAATGTTCCAATTGCTGTTACCCCAGAAGGTTTTTGGTGCTGCCCTTCTCCTGTTGTTTTCCAAAAGACCCTCAAAGGTCAAAATGCTCTAAACAAACCGAAACCTGCCTCACCGACCCCTAAGAGCCCTGTTGAGAAGAAACCGACCCCAGTGACTGATAGGAAGCCAGCGCTTACAAGATCACGCTCGGCTGCTGTTTCTGACGATGACCGAAAATGCAATGCTGATAATTCTGGCTTTAGTGCCCCAGAGGTCGTACACAGGGTACCACGGCCTAAGATTGAAAATATGCCAAGAAAAATAGCAATTGAGTTTGGTGAGCCAGGGACAAGTAATATAAAGGTTGTTTTACTCGGGAAGCAAGGATTTTCGGTGAAGTTGAGTGTTCATAAGAACGTTTTGATGGATAATAGTACTTTTTTTGCCAATAAACTTTCTGACAAAGAAGGTTCCTCTCTGGAAATTGGTGATTGTGAAGATGTTGAAATATACGTTGAAACCGTGGGATTGATGTACTGCAAAGAAATGAAGCAATGGCTGATGAAGCAAAATGTTTCTCGTGTTCTGCGAATTCTTAAGGTTGCTGAATTCCTTGGTTTTAAATCATGCATGCAGTCTTGTCTTGAATATTTGGAAGCAGTCCCTTGGGTTGGTGACGAAGAAGAAAAGGTTGTCACATCAATCCTGCGCCTTCAAAGTGAGGGCATTGGAGTGAGCCCGGTATTGAAACGAGTGTCTGCGGATGTGTCTAAACCCCATAAAGATACTCTTTCCCATATCATCGAACTTGTTCTTAGAAGCAACGAGGAGAGAGGCCGACGTGAAATGAAATTGGTGGTACTGAGGCTGCTTAGGGAGAACCAGAGTGTCCCGAGCCATGCGAGTTCTGCTGACATTTGCAATGAAATTATTTACTCTTCTTGTAGAAGCTGTTTGGGGTCGCTATTGTTCCTGTTCCAGCAGGCTGCTGAAACTGATTTTACAGATAGATCAGTGGACAGGAAAGAACCAGTGCTGAAGCAAATTACTCTAGAGGCCGATAACCTTTCGTGGTTGCTTGAGATTTTAGCTGACAGGCAAGCAGCCGATGAGTTTGCGTTATATGCACGGTTATTTGTTGGCATTGGCAAAGGAGAGCTCTTACCAGCAAAGGATACCCGAAAGTTGCTGTTACATACATGGTTGGAGCCGTTGATCAATGATTATAGCTGGTTAAAACATGGTTGTGGGTCGTTTGATCGAAAGGTCGTGGAGGAAGGAATTGGTCGGACGATCCTCACTCTCCCTCTAGAGGATCAGCAAAACATTTTGCTGACTTGGTTGGGCAGTTTTCTGAAAGTTGGAGATAGTTGCCCAAATCTCCAAAGAGCGTTCGAGGTATGGTGGCGTCGGACCTTCATTCGACCTTACGTTGAGACAGAAGGAAGCATTCGACAGCAAGATTGCTCAATCACACCCCAGTTGGAGCCTTGA

Coding sequence (CDS)

TGGAGAGGGAAGTGGCAGGCTGGAATTAGATGTGCAAGGGCTGACTGGCCATTATCTACTTTAAAAGCCAAACCTACACATGACAGGAAGAAGTATTTTGTGGTCTTTTTTCCACACACAAGGAATTATTCTTGGGCAGATGCATTGCTTGTTCGTTCTATTGAAGAATTTCCTCAGCCTATTGCATACAAGAGCCACAAAGCTGGTTTAAAATTGGTTGAAGATGTAAAAGTTGCTAGGAGATTTATAATGAAAAAACTTGCCGTTAGCATGCTAAATATCATAGACCAATTTCACCTCGAGGCTCTGATAGAGAGTGCTCGTGATGTAATGACTTGGAAAGAGTTTGCCATGGAAGCTTCACGCTGTAATGGTTATTCCGATCTTGGAAGAATGCTCCTGAAGCTGCAGAATATGATAGTGCAGTGCTTCATAAATTCAGATTGGCTTCATCATTCTTTGCATTCTTGGGTACAACGATGTCAAAATGCTCAAACTGCAGAAATTATTGAAATGCTCAAGGAGGAATTGGCTGATGCTATTTTGTGGGACGAAGTGAACTCTCATGATGATGCACCAGTGCAGCCTACTTTTAGTTCTGTGTGGAAAACATGGAAGCATGAAGTTACAAAATGGTTTTCAATATCTCCCACCCTTCCCATTATCAGAGACAAAGAGCAGCAGACTGTTGAAGCTTTCTTAGCTACAACTCTCCAAGTTAGCAGGAAGAGGCCCAAGCTTGAAGTTCGTCGTGCAGAGGCACATGCTTCACTGGTCGAATCAAAGTGCTCAGATGTAGCTATGGCTCTTCATATTGATTCTGAATCTCACAAAGTAGAGGCAAGGAAGGTTGCTAAATCAGCAGACTCACTCAGTACCGTACCTGGTAGGTTGGGTGGGATTGTAGTTCAAACTGGAAATTCGGAGCTAGCCTTTTGCAAGGATGTGGAACTGACGCCTCTTACTGAAGTAGTAGCAGAAAAACCCTTAAATTTTGGTAATAAGAATCGACAATGCATAGCATTTATTGAATCCAAGGGAAGGCAGTGTGTTAGGTGGGCCAATGAGGGTGATGTTTACTGTTGTGTGCATTTATCCTCTCGTTTCACAGGCAACTCTGATAAGAAAGAACACACTCGTTCTGTTGAATCGCCAATGTGCCAAGGTACTACTGTTCTTGGAACTAGGTGCAAGCATCGATCTTTATTTGGCTCCTATTCTGACATTTATGGTGTAGAAGCTACTGGTTATAAAGAAATAAAGTTTGTTGGAGATGTTGGAAATCCCCTTGGAGTGGATGAGGGTGATGTGACCAATAATGGAAATAGCTCATCTGATAAGCTTGGGCATCATGGAAAAGACTCTATTGCCTCAGAGGTCCGACACTGTATTGGCTCTTGTGAACATATTGACAGCAATCCATGTTTAGAAAGCCCAAAACGTCATTCTCTATATTGTGAAAAGCATCTACCAAGCTGGCTTAAACGTGCAAGAAATGGTAAGAGTAGAGTAATATCGAAGGAAGTATTCATGGATCTTTTAAGAGACTGTATCTCACAGGAGCAAAAGATACATCTGCATCAAGCCTGTGAGCTATTTTACAGGCTTTTCAAAAGTATTTTATCACTGAGGAATCCAGTTCCTGTGGAGGTTCAATTTCAGTGGGCGCTGTCTGAAGCTTCTAAAAATTTTGGAGTTGGGGAACAGTTTATGAAATTAGTTTGTCGTGAAAAGGAAAGATTGAAAAGAATATGGGGATTTGATGCTGAAGAAGCACAACTTTCCTCATATTCGATGGAAGTGCCAACTTCAGGGCCATTATTAGCTTCAGGTAATCACGATGATGATATGGCACTCAGTACTCACTTCATGGATAGTCATAAAAAGGAAGCACAGTGGCTGTTCAGAGGTTATGCTTGTGCCATCTGCCTGGATTCGTTCACCAATAAGAAAGTTCTAGAAACTCATGTACAGGAGAGACACCATGCACCATTTGTTGAGCAATGCATGCTTCTCCAGTGTATTCCTTGTGGCAGCCATTTTGGGAATACTGAACAATTATGGTTACATGTAGTTGCTGTTCATCCTGTTGATTTCAGATTGTCAAATTCTACTCAACAGCATAATTCTTCAGATGATGAGGATTCTCCAGTTAAACCCGAGCAGTGTAATATAGTTTCTCAGGAGAATGACAACAAGAACGTTGGTGGTTTACGAAAGTTTATTTGTAGGTTCTGTGGTTTGAAGTTTGATTTATTGCCTGATCTTGGTCGTCACCATCAAGCTGCGCACATGGGGCCAGGTTTAGTTAACTCCCGACCTGCAAAGAGGGGATTACATTATTTTGCTTATAAATTAAAATCTGGGAAACTTGGTCATCCTAGATTTAAGAAGACTCTAGCAGGTGCATCAAATAGGATCAGAAACAGAACAAAAGCAAGCATGAAAAAACATATTCAAGCTTCAAAATTACTAAGCACAGGAAGCATAAACCTTCAACCTCATGTGTCTCAGTCAGCAAGTTCTCGTAAATTGACCCAAGGTTCAACTGTTGCGAAGGCATTGGTTTCTGAGATTCAGAAAAGAAAATTATCACCTACCAATATTGATATTTTGTCTATTGCTCGCTCTGCCTGTTGCAAGGGGTTTGTTTGTCCTAAAGGATGTGAGACATTTGAGGATCCTCTTTTGCTTCCCCATGTGATGCCTCATCCACATGGTTTTGGGAACCACAAGAATGCACGCACTCCTGATACTGTAAAGGAAAAGGTCATCATCTTGTGTGAAGATATAAGCTTTGGCCAGGAATTAGTTCCTGTGGTCTGTGTAGCTGATGAAGGTCAAAGGAACTCACTTCACATACCTCTAGCTAACTCTGATGATCAAAATGCTAGATACTCCATGCCTTGGGAAAATTTTTCCTATATTAAGAAACCATTGCTTGATAAGTCCCTCGCTAGTCATACAGAGAATTTGCAGTTCGGATGTGCCTGCCCACATTCACTCTGTTCTTCTGAAACATGTGATCACGTATACCTCTTTGATAGTGATTATGAAGACCCAAAGGACATTTATGGGAATCCCATGCGTCGCAGGTTCCCATATGATGAGAATGGTCGGATGATTCTAGAGGAAGGTTACCTTGTCTATGAGTGTAATGAAAGGTGCAGCTGTAGTCAAACCTGTCCAAATAGAGTGTTGCAAAATGGAGTTCAAGTGAAACTTGAAGTCTTCATGACAGAAACAAAGGGATGGGCAGTGAGGGCTGGTGAAGCCATCCTGCGTGGCACATTTATTTGTGAGTACATTGGGGAGGTGTTGGATGAGCAGGAAGCAAACAGAAGACGTAACAGGTATAACAGTGAAGGCAACTGCTATTTCTTGGATGTGGATGCTCATATTAATGACATTAGCAGATTAATTGAAGGATCGGCTAGATATATTATTGATGCCACAAATTACGGAAATGTTTCGAGATTCATAAATCACTGTTGCTCACCAAATCTTGTAACTTACCAAGTCCTCGTAGAAAGCATGGAATATCAACGCTCACATATTGGATTGTATGCAAACCGGGATATAGCTACTGGTGAAGAGCTGACATTTGACTATCGACGCGAGCTATTGCCTGGAGGAAATCACGGCTGTGAGAACAGGGAGCATGCAGTGAAGCTCCAAAGAAGACATTTGCATGCTGTTAGGATTTTGGACCATCCATCGTTAATTGGAAATGATGAGGGGCGTGCCTGCTTGATTGCTGAGAAAGCTTTGTGGGTGCGTGTAACAGAGAGAGAGATTGCAGAAAAAAAAAAAGTTGTGCAAATTCTGAGAGACCCACATGGAGATACAATCATCAAAAGTGAGATGGAGGCAAACCGAAATTGGGTACACTTCAGTTTAAGTTTGAGTTGCAATTTGTGCATGTCGTTGTTCAGTTCAAAGCTTGGCGTCTTTGATTCAGAAAGTAGGAGGCTGTGCTCCGAGTCTTTTGTGCTTAATGTTCCGTTTTCCCATAGTCAAACAAGCTGCAATAAATCAATAAGAATGGCTGAAATAAGGTTAACTAGAGTAGAACAAGGCCAAACCAAGATTAAAAATGTTCCAATTGCTGTTACCCCAGAAGGTTTTTGGTGCTGCCCTTCTCCTGTTGTTTTCCAAAAGACCCTCAAAGGTCAAAATGCTCTAAACAAACCGAAACCTGCCTCACCGACCCCTAAGAGCCCTGTTGAGAAGAAACCGACCCCAGTGACTGATAGGAAGCCAGCGCTTACAAGATCACGCTCGGCTGCTGTTTCTGACGATGACCGAAAATGCAATGCTGATAATTCTGGCTTTAGTGCCCCAGAGGTCGTACACAGGGTACCACGGCCTAAGATTGAAAATATGCCAAGAAAAATAGCAATTGAGTTTGGTGAGCCAGGGACAAGTAATATAAAGGTTGTTTTACTCGGGAAGCAAGGATTTTCGGTGAAGTTGAGTGTTCATAAGAACGTTTTGATGGATAATAGTACTTTTTTTGCCAATAAACTTTCTGACAAAGAAGGTTCCTCTCTGGAAATTGGTGATTGTGAAGATGTTGAAATATACGTTGAAACCGTGGGATTGATGTACTGCAAAGAAATGAAGCAATGGCTGATGAAGCAAAATGTTTCTCGTGTTCTGCGAATTCTTAAGGTTGCTGAATTCCTTGGTTTTAAATCATGCATGCAGTCTTGTCTTGAATATTTGGAAGCAGTCCCTTGGGTTGGTGACGAAGAAGAAAAGGTTGTCACATCAATCCTGCGCCTTCAAAGTGAGGGCATTGGAGTGAGCCCGGTATTGAAACGAGTGTCTGCGGATGTGTCTAAACCCCATAAAGATACTCTTTCCCATATCATCGAACTTGTTCTTAGAAGCAACGAGGAGAGAGGCCGACGTGAAATGAAATTGGTGGTACTGAGGCTGCTTAGGGAGAACCAGAGTGTCCCGAGCCATGCGAGTTCTGCTGACATTTGCAATGAAATTATTTACTCTTCTTGTAGAAGCTGTTTGGGGTCGCTATTGTTCCTGTTCCAGCAGGCTGCTGAAACTGATTTTACAGATAGATCAGTGGACAGGAAAGAACCAGTGCTGAAGCAAATTACTCTAGAGGCCGATAACCTTTCGTGGTTGCTTGAGATTTTAGCTGACAGGCAAGCAGCCGATGAGTTTGCGTTATATGCACGGTTATTTGTTGGCATTGGCAAAGGAGAGCTCTTACCAGCAAAGGATACCCGAAAGTTGCTGTTACATACATGGTTGGAGCCGTTGATCAATGATTATAGCTGGTTAAAACATGGTTGTGGGTCGTTTGATCGAAAGGTCGTGGAGGAAGGAATTGGTCGGACGATCCTCACTCTCCCTCTAGAGGATCAGCAAAACATTTTGCTGACTTGGTTGGGCAGTTTTCTGAAAGTTGGAGATAGTTGCCCAAATCTCCAAAGAGCGTTCGAGGTATGGTGGCGTCGGACCTTCATTCGACCTTACGTTGAGACAGAAGGAAGCATTCGACAGCAAGATTGCTCAATCACACCCCAGTTGGAGCCTTGA

Protein sequence

WRGKWQAGIRCARADWPLSTLKAKPTHDRKKYFVVFFPHTRNYSWADALLVRSIEEFPQPIAYKSHKAGLKLVEDVKVARRFIMKKLAVSMLNIIDQFHLEALIESARDVMTWKEFAMEASRCNGYSDLGRMLLKLQNMIVQCFINSDWLHHSLHSWVQRCQNAQTAEIIEMLKEELADAILWDEVNSHDDAPVQPTFSSVWKTWKHEVTKWFSISPTLPIIRDKEQQTVEAFLATTLQVSRKRPKLEVRRAEAHASLVESKCSDVAMALHIDSESHKVEARKVAKSADSLSTVPGRLGGIVVQTGNSELAFCKDVELTPLTEVVAEKPLNFGNKNRQCIAFIESKGRQCVRWANEGDVYCCVHLSSRFTGNSDKKEHTRSVESPMCQGTTVLGTRCKHRSLFGSYSDIYGVEATGYKEIKFVGDVGNPLGVDEGDVTNNGNSSSDKLGHHGKDSIASEVRHCIGSCEHIDSNPCLESPKRHSLYCEKHLPSWLKRARNGKSRVISKEVFMDLLRDCISQEQKIHLHQACELFYRLFKSILSLRNPVPVEVQFQWALSEASKNFGVGEQFMKLVCREKERLKRIWGFDAEEAQLSSYSMEVPTSGPLLASGNHDDDMALSTHFMDSHKKEAQWLFRGYACAICLDSFTNKKVLETHVQERHHAPFVEQCMLLQCIPCGSHFGNTEQLWLHVVAVHPVDFRLSNSTQQHNSSDDEDSPVKPEQCNIVSQENDNKNVGGLRKFICRFCGLKFDLLPDLGRHHQAAHMGPGLVNSRPAKRGLHYFAYKLKSGKLGHPRFKKTLAGASNRIRNRTKASMKKHIQASKLLSTGSINLQPHVSQSASSRKLTQGSTVAKALVSEIQKRKLSPTNIDILSIARSACCKGFVCPKGCETFEDPLLLPHVMPHPHGFGNHKNARTPDTVKEKVIILCEDISFGQELVPVVCVADEGQRNSLHIPLANSDDQNARYSMPWENFSYIKKPLLDKSLASHTENLQFGCACPHSLCSSETCDHVYLFDSDYEDPKDIYGNPMRRRFPYDENGRMILEEGYLVYECNERCSCSQTCPNRVLQNGVQVKLEVFMTETKGWAVRAGEAILRGTFICEYIGEVLDEQEANRRRNRYNSEGNCYFLDVDAHINDISRLIEGSARYIIDATNYGNVSRFINHCCSPNLVTYQVLVESMEYQRSHIGLYANRDIATGEELTFDYRRELLPGGNHGCENREHAVKLQRRHLHAVRILDHPSLIGNDEGRACLIAEKALWVRVTEREIAEKKKVVQILRDPHGDTIIKSEMEANRNWVHFSLSLSCNLCMSLFSSKLGVFDSESRRLCSESFVLNVPFSHSQTSCNKSIRMAEIRLTRVEQGQTKIKNVPIAVTPEGFWCCPSPVVFQKTLKGQNALNKPKPASPTPKSPVEKKPTPVTDRKPALTRSRSAAVSDDDRKCNADNSGFSAPEVVHRVPRPKIENMPRKIAIEFGEPGTSNIKVVLLGKQGFSVKLSVHKNVLMDNSTFFANKLSDKEGSSLEIGDCEDVEIYVETVGLMYCKEMKQWLMKQNVSRVLRILKVAEFLGFKSCMQSCLEYLEAVPWVGDEEEKVVTSILRLQSEGIGVSPVLKRVSADVSKPHKDTLSHIIELVLRSNEERGRREMKLVVLRLLRENQSVPSHASSADICNEIIYSSCRSCLGSLLFLFQQAAETDFTDRSVDRKEPVLKQITLEADNLSWLLEILADRQAADEFALYARLFVGIGKGELLPAKDTRKLLLHTWLEPLINDYSWLKHGCGSFDRKVVEEGIGRTILTLPLEDQQNILLTWLGSFLKVGDSCPNLQRAFEVWWRRTFIRPYVETEGSIRQQDCSITPQLEP
Homology
BLAST of Sgr026330 vs. NCBI nr
Match: XP_022132628.1 (histone-lysine N-methyltransferase SUVR5 isoform X1 [Momordica charantia])

HSP 1 Score: 2211.8 bits (5730), Expect = 0.0e+00
Identity = 1114/1328 (83.89%), Postives = 1152/1328 (86.75%), Query Frame = 0

Query: 1    WRGKWQAGIRCARADWPLSTLKAKPTHDRKKYFVVFFPHTRNYSWADALLVRSIEEFPQP 60
            WRGKWQAGIRCARADWPLSTLKAKPTHDRKKYFVVFFPHTRNYSWADALLVRSIEEFPQP
Sbjct: 165  WRGKWQAGIRCARADWPLSTLKAKPTHDRKKYFVVFFPHTRNYSWADALLVRSIEEFPQP 224

Query: 61   IAYKSHKAGLKLVEDVKVARRFIMKKLAVSMLNIIDQFHLEALIESARDVMTWKEFAMEA 120
            IAYKSHKAGLKLVEDVKVARRFIMKKLAV MLNIIDQFHLEALIESARDVMTWKEFAMEA
Sbjct: 225  IAYKSHKAGLKLVEDVKVARRFIMKKLAVGMLNIIDQFHLEALIESARDVMTWKEFAMEA 284

Query: 121  SRCNGYSDLGRMLLKLQNMIVQCFINSDWLHHSLHSWVQRCQNAQTAEIIEMLKEELADA 180
            SRCNGYSDLGRMLLKLQNMIVQCF+NSDWL +SL+SWVQRCQNAQTAEIIEMLKEELADA
Sbjct: 285  SRCNGYSDLGRMLLKLQNMIVQCFMNSDWLQNSLNSWVQRCQNAQTAEIIEMLKEELADA 344

Query: 181  ILWDEVNSHDDAPVQPTFSSVWKTWKHEVTKWFSISPTLPIIRDKEQQTVEAFLATTLQV 240
            ILW EVNSH DAPVQPTFSSVWKTWKHEVTKWFSISP LPIIRDKEQ++VEAFLATTLQV
Sbjct: 345  ILWKEVNSHGDAPVQPTFSSVWKTWKHEVTKWFSISPILPIIRDKEQRSVEAFLATTLQV 404

Query: 241  SRKRPKLEVRRAEAHASLVESKCSDVAMALHID---------------SESHKVEARKVA 300
            SRKRPKLE+RRAEAHASLVESKCS  AMAL+ID               SE+HKVEAR+VA
Sbjct: 405  SRKRPKLEIRRAEAHASLVESKCSGDAMALNIDSGFFKSRNSLNAKLASEAHKVEAREVA 464

Query: 301  KSADSLSTVPGRLGGIVVQTGNSELAFCKDVELTPLTEVVAEKPLNFGNKNRQCIAFIES 360
             S DSLS VPGR GG  VQTGN +LA CKDVEL P T+VVAEKP N GN+NRQCIAFIES
Sbjct: 465  TSVDSLSIVPGRSGG--VQTGNLQLASCKDVELMPHTDVVAEKPFNSGNRNRQCIAFIES 524

Query: 361  KGRQCVRWANEGDVYCCVHLSSRFTGNSDKKEHTRSVESPMCQGTTVLGTRCKHRSLFGS 420
            KGRQCVRWANEGDVYCCVHLSSRFTGNSDKKE TRSVESPMCQGTTVLG+RCKHRSLFGS
Sbjct: 525  KGRQCVRWANEGDVYCCVHLSSRFTGNSDKKEQTRSVESPMCQGTTVLGSRCKHRSLFGS 584

Query: 421  --------------------------YSDIYGVEATGYKEIKFVGDVGNPLGVDEGDVTN 480
                                        DIY VEA GYKEIKF  DVGN LGVD GDVTN
Sbjct: 585  SFCKKHRPRSETKSESNSLENKLIEKQQDIYSVEAIGYKEIKF-ADVGNTLGVDNGDVTN 644

Query: 481  NGNSSSDKLGHHGKDSIASEVRHCIGSCEHIDSNPCLESPKRHSLYCEKHLPSWLKRARN 540
            NGNSSSDKL H GK+SIA+EVRHCIGSC  IDSNPCLESPKRHSLYCEKHLPSWLKRARN
Sbjct: 645  NGNSSSDKLEHRGKESIATEVRHCIGSC--IDSNPCLESPKRHSLYCEKHLPSWLKRARN 704

Query: 541  GKSRVISKEVFMDLLRDCISQEQKIHLHQACELFYRLFKSILSLRNPVPVEVQFQWALSE 600
            GKSRVISKEVFMDLLRD  SQE KIHLHQACELFYRLFKSILSLRNPVP+EVQFQWALSE
Sbjct: 705  GKSRVISKEVFMDLLRDSSSQELKIHLHQACELFYRLFKSILSLRNPVPMEVQFQWALSE 764

Query: 601  ASKNFGVGEQFMKLVCREKERLKRIWGFDAEEAQLSSYSMEVPTSGPL---------LAS 660
            ASK FGVGEQFMKLVCREKERLKRIWGFDAEEAQLSSYSMEVPTSG           + S
Sbjct: 765  ASKTFGVGEQFMKLVCREKERLKRIWGFDAEEAQLSSYSMEVPTSGNCNDDMGIRCKICS 824

Query: 661  GNHDDDMALSTHFMDSHKKEAQWLFRGYACAICLDSFTNKKVLETHVQERHHAPFVEQCM 720
                DD ALSTHFMD HKKEAQWLFRGYACAICLDSFTNKKVLETHVQERHHAPFVEQCM
Sbjct: 825  EEFLDDQALSTHFMDGHKKEAQWLFRGYACAICLDSFTNKKVLETHVQERHHAPFVEQCM 884

Query: 721  LLQCIPCGSHFGNTEQLWLHVVAVHPVDFRLSNSTQQHNSSDDEDSPVKPEQCNIVSQEN 780
            LLQCIPCGSHFGNTEQLWLHVVAVHPVDFRLSNSTQQHNSS  EDSPVKPEQCNIVSQEN
Sbjct: 885  LLQCIPCGSHFGNTEQLWLHVVAVHPVDFRLSNSTQQHNSSAGEDSPVKPEQCNIVSQEN 944

Query: 781  DNKNVGGLRKFICRFCGLKFDLLPDLGRHHQAAHMGPGLVNSRPAKRGLHYFAYKLKSGK 840
            D KNVGGLRKFICRFCGLKFDLLPDLGRHHQAAHMGPGLVNSRPAKRGLHY+AYKLKSGK
Sbjct: 945  DKKNVGGLRKFICRFCGLKFDLLPDLGRHHQAAHMGPGLVNSRPAKRGLHYYAYKLKSGK 1004

Query: 841  LGHPRFKKTLAGASNRIRNRTKASMKKHIQASKLLSTGSINLQPHVSQSASSRKLTQGST 900
            LGHPRFKKTLAGASNR RNRTKASMKKHIQASKL STGSINLQPHV Q ASSRKLTQGST
Sbjct: 1005 LGHPRFKKTLAGASNRNRNRTKASMKKHIQASKLRSTGSINLQPHVPQLASSRKLTQGST 1064

Query: 901  VAKALVSEIQKRKLSPTNIDILSIARSACC------------------------------ 960
            VAKALVSEIQKRKLSP NIDILSIARSACC                              
Sbjct: 1065 VAKALVSEIQKRKLSPINIDILSIARSACCKVNFKVLLEQKFGVLPEYIYLKAAELCREK 1124

Query: 961  -------KGFVCPKGCETFEDPLLLPHVMPHPHGFGNHKNARTPD--------------- 1020
                   KGFVCP+GCETFEDPLLLPH+MPHP+GFG+H+NA +PD               
Sbjct: 1125 GEVNWHIKGFVCPEGCETFEDPLLLPHLMPHPNGFGDHENACSPDPVSCKWEARRCGYVI 1184

Query: 1021 -------TVKEKVIILCEDISFGQELVPVVCVADEGQRNSLHIPLANSDDQNARYSMPWE 1080
                    VKE VIILCEDISFGQELVPVVCVADEG+RNS  IP+ANSDDQNARY MPWE
Sbjct: 1185 GSHLSSQQVKENVIILCEDISFGQELVPVVCVADEGRRNSPDIPIANSDDQNARYFMPWE 1244

Query: 1081 NFSYIKKPLLDKSLASHTENLQFGCACPHSLCSSETCDHVYLFDSDYEDPKDIYGNPMRR 1140
            NF+YIKKPLLDKSLA HTE+LQFGCACP SLCSSETCDHVYLF+SDYEDPKDIYGNPMRR
Sbjct: 1245 NFTYIKKPLLDKSLAIHTESLQFGCACPQSLCSSETCDHVYLFNSDYEDPKDIYGNPMRR 1304

Query: 1141 RFPYDENGRMILEEGYLVYECNERCSCSQTCPNRVLQNGVQVKLEVFMTETKGWAVRAGE 1200
            RFPYDENGR+ILEEGYLVYECNERCSCS+TCPNRVLQNGVQVKLEVFMTETKGWAVRAGE
Sbjct: 1305 RFPYDENGRIILEEGYLVYECNERCSCSRTCPNRVLQNGVQVKLEVFMTETKGWAVRAGE 1364

Query: 1201 AILRGTFICEYIGEVLDEQEANRRRNRYNSEGNCYFLDVDAHINDISRLIEGSARYIIDA 1219
             ILRGTF+CEYIGEVLDEQEANRRR+RYN+EGNCYFLDVDAHINDISRL+EGSARYIIDA
Sbjct: 1365 PILRGTFVCEYIGEVLDEQEANRRRDRYNTEGNCYFLDVDAHINDISRLVEGSARYIIDA 1424

BLAST of Sgr026330 vs. NCBI nr
Match: XP_038881305.1 (histone-lysine N-methyltransferase SUVR5 isoform X1 [Benincasa hispida])

HSP 1 Score: 2169.4 bits (5620), Expect = 0.0e+00
Identity = 1089/1333 (81.70%), Postives = 1132/1333 (84.92%), Query Frame = 0

Query: 1    WRGKWQAGIRCARADWPLSTLKAKPTHDRKKYFVVFFPHTRNYSWADALLVRSIEEFPQP 60
            WRGKWQAGIRCARADWPLSTLKAKPTHDRKKYFVV+FPHTRNYSWADALLVRSIEEFPQP
Sbjct: 167  WRGKWQAGIRCARADWPLSTLKAKPTHDRKKYFVVYFPHTRNYSWADALLVRSIEEFPQP 226

Query: 61   IAYKSHKAGLKLVEDVKVARRFIMKKLAVSMLNIIDQFHLEALIESARDVMTWKEFAMEA 120
            IAYKSHKAGLKLVEDVKVARRFIMKKL+V MLNIIDQFHLEALIESARDV TWKEFAMEA
Sbjct: 227  IAYKSHKAGLKLVEDVKVARRFIMKKLSVGMLNIIDQFHLEALIESARDVTTWKEFAMEA 286

Query: 121  SRCNGYSDLGRMLLKLQNMIVQCFINSDWLHHSLHSWVQRCQNAQTAEIIEMLKEELADA 180
            SRCNGYSDLGRML+KLQNMIVQCFINSDWL +SLHSWVQRCQNAQTAE+IEMLKEELADA
Sbjct: 287  SRCNGYSDLGRMLMKLQNMIVQCFINSDWLQNSLHSWVQRCQNAQTAEMIEMLKEELADA 346

Query: 181  ILWDEVNSHDDAPVQPTFSSVWKTWKHEVTKWFSISPTLPIIRDKEQQTVEAFLATTLQV 240
            ILWD+V SH DAPVQ TFSSVWKTWKHEVTKWFSISPTLPI RDKEQQTVEAFLAT LQV
Sbjct: 347  ILWDKVKSHGDAPVQHTFSSVWKTWKHEVTKWFSISPTLPITRDKEQQTVEAFLATALQV 406

Query: 241  SRKRPKLEVRRAEAHASLVESKCSDVAMALHID---------------SESHKVEARKVA 300
            SRKRPKLEVRRAEAHASLVESKCSD AMA+ ID               SESHK EAR++ 
Sbjct: 407  SRKRPKLEVRRAEAHASLVESKCSDQAMAVDIDSVFFNNRNSLNAKLASESHKGEAREIV 466

Query: 301  KSADSLSTVPGRLGGIVVQTGNSELAFCKDVELTPLTEVVAEKPLNFGNKNRQCIAFIES 360
             SA SLSTVP RL GIVVQTGN +LA CKDVEL P  EVVAEK L +GNKNRQCIAFIES
Sbjct: 467  TSAGSLSTVPCRLTGIVVQTGNLDLASCKDVELMPRAEVVAEKSLTYGNKNRQCIAFIES 526

Query: 361  KGRQCVRWANEGDVYCCVHLSSRFTGNSDKKEHTRSVESPMCQGTTVLGTRCKHRSLFGS 420
            KGRQCVRWANEGDVYCCVHLSSRFTGN+DKKE TRSVESPMCQGTTVLG+RCKHRSLFGS
Sbjct: 527  KGRQCVRWANEGDVYCCVHLSSRFTGNADKKEQTRSVESPMCQGTTVLGSRCKHRSLFGS 586

Query: 421  --------------------------YSDIYGVEATGYKEIKFVGDVGNPLGVDEGDVTN 480
                                        DIY VE T  KE        NPLGVDE DV N
Sbjct: 587  SFCKKHRPRSETKTESTSLGNKLIEKQQDIYSVEDTSNKE--------NPLGVDEADVIN 646

Query: 481  NGNSSSDKLGHHGKDSIASEVRHCIGSCEHIDSNPCLESPKRHSLYCEKHLPSWLKRARN 540
            NGNSSSDKL HHGKDSIASE+RHCIGSCEHIDSNPCLESPKRHSLYCEKHLPSWLKRARN
Sbjct: 647  NGNSSSDKLEHHGKDSIASELRHCIGSCEHIDSNPCLESPKRHSLYCEKHLPSWLKRARN 706

Query: 541  GKSRVISKEVFMDLLRDCISQEQKIHLHQACELFYRLFKSILSLRNPVPVEVQFQWALSE 600
            GKSRVISKEVFMDLLRDC SQE KIHLHQACELFYRLFKSILSLRNPVP+EVQFQWALSE
Sbjct: 707  GKSRVISKEVFMDLLRDCNSQEPKIHLHQACELFYRLFKSILSLRNPVPMEVQFQWALSE 766

Query: 601  ASKNFGVGEQFMKLVCREKERLKRIWGFDAEEAQLSSYSMEVPTSGPLLASGNHDDDM-- 660
            ASKN GVGEQF+KLV REKERLKRIWGFDAE+AQLSS SME  TSGPLL SGN  DDM  
Sbjct: 767  ASKNLGVGEQFLKLVGREKERLKRIWGFDAEDAQLSSPSMEAATSGPLLTSGNCGDDMSI 826

Query: 661  -------------ALSTHFMDSHKKEAQWLFRGYACAICLDSFTNKKVLETHVQERHHAP 720
                         ALSTHFMD HKKEAQWLFRGYACAICLDSFTNKKVLETHVQERHHAP
Sbjct: 827  RCKICSEEFLDDQALSTHFMDGHKKEAQWLFRGYACAICLDSFTNKKVLETHVQERHHAP 886

Query: 721  FVEQCMLLQCIPCGSHFGNTEQLWLHVVAVHPVDFRLSNSTQQHNSSDDEDSPVKPEQCN 780
            FVEQCMLLQCIPCGSHFGNTEQLWLHVV VHP+DFRLSNS++Q NSS  EDSPVKP QCN
Sbjct: 887  FVEQCMLLQCIPCGSHFGNTEQLWLHVVTVHPIDFRLSNSSRQQNSSSGEDSPVKPTQCN 946

Query: 781  IVSQENDNKNVGGLRKFICRFCGLKFDLLPDLGRHHQAAHMGPGLVNSRPAKRGLHYFAY 840
            IVS+  DNKNVGGLRKF CRFCGLKFDLLPDLGRHHQAAHMGPGLVNSRPAKRG HY+AY
Sbjct: 947  IVSKAKDNKNVGGLRKFNCRFCGLKFDLLPDLGRHHQAAHMGPGLVNSRPAKRGFHYYAY 1006

Query: 841  KLKSGKLGHPRFKKTLAGASNRIRNRTKASMKKHIQASKLLSTGSINLQPHVSQSASSRK 900
            K KSGKLGHPRFKKT AG SNRIRNRTKASMKKHIQASKLLSTGSINLQPHVSQ ASSRK
Sbjct: 1007 KSKSGKLGHPRFKKTKAGVSNRIRNRTKASMKKHIQASKLLSTGSINLQPHVSQLASSRK 1066

Query: 901  LTQGSTVAKALVSEIQKRKLSPTNIDILSIARSACC------------------------ 960
            LTQGSTVAKALVSEIQKRKLSPTNIDILSIA+SACC                        
Sbjct: 1067 LTQGSTVAKALVSEIQKRKLSPTNIDILSIAQSACCKVNFKVLLEQKFGVLPEYFYLKAA 1126

Query: 961  -------------KGFVCPKGCETFEDPLLLPHVMPHPHGFGNHKNARTPDTV------- 1020
                         KGFVCP GCETFEDPLLL H+MPHP+ FG+++NA TPD V       
Sbjct: 1127 ELCREKGKVNWYIKGFVCPNGCETFEDPLLLAHLMPHPNSFGDNENAHTPDPVSSKWKSH 1186

Query: 1021 ---------------KEKVIILCEDISFGQELVPVVCVADEGQRNSLHIPLANSDDQNAR 1080
                           +EK ++LCEDISFGQELVPVVCVAD+ QRN  H+ LANS  QN  
Sbjct: 1187 GCSYVSGSHLSSQQFREKAVVLCEDISFGQELVPVVCVADDCQRNPHHMSLANSGAQNVG 1246

Query: 1081 YSMPWENFSYIKKPLLDKSLASHTENLQFGCACPHSLCSSETCDHVYLFDSDYEDPKDIY 1140
            YSMPWENF+YIKKPLLDKSLA HTE+LQFGCACPHSLCSSETCDHVYLF+SDYEDPKDIY
Sbjct: 1247 YSMPWENFTYIKKPLLDKSLAIHTESLQFGCACPHSLCSSETCDHVYLFNSDYEDPKDIY 1306

Query: 1141 GNPMRRRFPYDENGRMILEEGYLVYECNERCSCSQTCPNRVLQNGVQVKLEVFMTETKGW 1200
            GNPM RRFPYDENGR+ILEEGYLVYECNE CSCS+TCPNRVLQNGVQVKLEVFMTETKGW
Sbjct: 1307 GNPMLRRFPYDENGRIILEEGYLVYECNEMCSCSRTCPNRVLQNGVQVKLEVFMTETKGW 1366

Query: 1201 AVRAGEAILRGTFICEYIGEVLDEQEANRRRNRYNSEGNCYFLDVDAHINDISRLIEGSA 1219
            AVRAGEAILRGTF+CEYIGEVLDEQEANRRR RYNSEG+CYFLDVDAHINDISRL+EGSA
Sbjct: 1367 AVRAGEAILRGTFVCEYIGEVLDEQEANRRRYRYNSEGSCYFLDVDAHINDISRLVEGSA 1426

BLAST of Sgr026330 vs. NCBI nr
Match: XP_038881307.1 (histone-lysine N-methyltransferase SUVR5 isoform X2 [Benincasa hispida])

HSP 1 Score: 2163.3 bits (5604), Expect = 0.0e+00
Identity = 1088/1333 (81.62%), Postives = 1131/1333 (84.85%), Query Frame = 0

Query: 1    WRGKWQAGIRCARADWPLSTLKAKPTHDRKKYFVVFFPHTRNYSWADALLVRSIEEFPQP 60
            WRGKWQAGIRCARADWPLSTLKAKPTHDRKKYFVV+FPHTRNYSWADALLVRSIEEFPQP
Sbjct: 167  WRGKWQAGIRCARADWPLSTLKAKPTHDRKKYFVVYFPHTRNYSWADALLVRSIEEFPQP 226

Query: 61   IAYKSHKAGLKLVEDVKVARRFIMKKLAVSMLNIIDQFHLEALIESARDVMTWKEFAMEA 120
            IAYKSHKAGLKLVEDVKVARRFIMKKL+V MLNIIDQFHLEALIESARDV TWKEFAMEA
Sbjct: 227  IAYKSHKAGLKLVEDVKVARRFIMKKLSVGMLNIIDQFHLEALIESARDVTTWKEFAMEA 286

Query: 121  SRCNGYSDLGRMLLKLQNMIVQCFINSDWLHHSLHSWVQRCQNAQTAEIIEMLKEELADA 180
            SRCNGYSDLGRML+KLQNMIVQCFINSDWL +SLHSWVQRCQNAQTAE+IEMLKEELADA
Sbjct: 287  SRCNGYSDLGRMLMKLQNMIVQCFINSDWLQNSLHSWVQRCQNAQTAEMIEMLKEELADA 346

Query: 181  ILWDEVNSHDDAPVQPTFSSVWKTWKHEVTKWFSISPTLPIIRDKEQQTVEAFLATTLQV 240
            ILWD+V SH DAPVQ TFSSVWKTWKHEVTKWFSISPTLPI RDKEQQTVEAFLAT LQV
Sbjct: 347  ILWDKVKSHGDAPVQHTFSSVWKTWKHEVTKWFSISPTLPITRDKEQQTVEAFLATALQV 406

Query: 241  SRKRPKLEVRRAEAHASLVESKCSDVAMALHID---------------SESHKVEARKVA 300
            SRKRPKLEVRRAEAHASLVESKCSD AMA+ ID               SESHK EAR++ 
Sbjct: 407  SRKRPKLEVRRAEAHASLVESKCSDQAMAVDIDSVFFNNRNSLNAKLASESHKGEAREIV 466

Query: 301  KSADSLSTVPGRLGGIVVQTGNSELAFCKDVELTPLTEVVAEKPLNFGNKNRQCIAFIES 360
             SA SLSTVP RL GIVVQTGN +LA CKDVEL P  EVVAEK L +GNKNRQCIAFIES
Sbjct: 467  TSAGSLSTVPCRLTGIVVQTGNLDLASCKDVELMPRAEVVAEKSLTYGNKNRQCIAFIES 526

Query: 361  KGRQCVRWANEGDVYCCVHLSSRFTGNSDKKEHTRSVESPMCQGTTVLGTRCKHRSLFGS 420
            KGRQCVRWANEGDVYCCVHLSSRFTGN+DKKE TRSVESPMCQGTTVLG+RCKHRSLFGS
Sbjct: 527  KGRQCVRWANEGDVYCCVHLSSRFTGNADKKEQTRSVESPMCQGTTVLGSRCKHRSLFGS 586

Query: 421  --------------------------YSDIYGVEATGYKEIKFVGDVGNPLGVDEGDVTN 480
                                        DIY VE T  KE        NPLGVDE DV N
Sbjct: 587  SFCKKHRPRSETKTESTSLGNKLIEKQQDIYSVEDTSNKE--------NPLGVDEADVIN 646

Query: 481  NGNSSSDKLGHHGKDSIASEVRHCIGSCEHIDSNPCLESPKRHSLYCEKHLPSWLKRARN 540
            NGNSSSDKL HHGKDSIASE+RHCIGSCEHIDSNPCLESPKRHSLYCEKHLPSWLKRARN
Sbjct: 647  NGNSSSDKLEHHGKDSIASELRHCIGSCEHIDSNPCLESPKRHSLYCEKHLPSWLKRARN 706

Query: 541  GKSRVISKEVFMDLLRDCISQEQKIHLHQACELFYRLFKSILSLRNPVPVEVQFQWALSE 600
            GKSRVISKEVFMDLLRDC SQE KIHLHQACELFYRLFKSILSLRNPVP+EVQFQWALSE
Sbjct: 707  GKSRVISKEVFMDLLRDCNSQEPKIHLHQACELFYRLFKSILSLRNPVPMEVQFQWALSE 766

Query: 601  ASKNFGVGEQFMKLVCREKERLKRIWGFDAEEAQLSSYSMEVPTSGPLLASGNHDDDM-- 660
            ASKN GVGEQF+KLV REKERLKRIWGFDAE+AQLSS SME  TSGPLL SGN  DDM  
Sbjct: 767  ASKNLGVGEQFLKLVGREKERLKRIWGFDAEDAQLSSPSMEAATSGPLLTSGNCGDDMSI 826

Query: 661  -------------ALSTHFMDSHKKEAQWLFRGYACAICLDSFTNKKVLETHVQERHHAP 720
                         ALSTHFMD HKKEAQWLFRGYACAICLDSFTNKKVLETHVQERHHAP
Sbjct: 827  RCKICSEEFLDDQALSTHFMDGHKKEAQWLFRGYACAICLDSFTNKKVLETHVQERHHAP 886

Query: 721  FVEQCMLLQCIPCGSHFGNTEQLWLHVVAVHPVDFRLSNSTQQHNSSDDEDSPVKPEQCN 780
            FVEQCMLLQCIPCGSHFGNTEQLWLHVV VHP+DFRLSNS++Q NSS  EDSPVKP QCN
Sbjct: 887  FVEQCMLLQCIPCGSHFGNTEQLWLHVVTVHPIDFRLSNSSRQQNSSSGEDSPVKPTQCN 946

Query: 781  IVSQENDNKNVGGLRKFICRFCGLKFDLLPDLGRHHQAAHMGPGLVNSRPAKRGLHYFAY 840
            IVS+  DNKNVGGLRKF CRFCGLKFDLLPDLGRHHQAAHMGPGLVNSRPAKRG HY+AY
Sbjct: 947  IVSKAKDNKNVGGLRKFNCRFCGLKFDLLPDLGRHHQAAHMGPGLVNSRPAKRGFHYYAY 1006

Query: 841  KLKSGKLGHPRFKKTLAGASNRIRNRTKASMKKHIQASKLLSTGSINLQPHVSQSASSRK 900
            K KSGKLGHPRFKKT AG SNRIRNRTKASMKKHIQASKLLSTGSINLQPHVSQ ASSRK
Sbjct: 1007 KSKSGKLGHPRFKKTKAGVSNRIRNRTKASMKKHIQASKLLSTGSINLQPHVSQLASSRK 1066

Query: 901  LTQGSTVAKALVSEIQKRKLSPTNIDILSIARSACC------------------------ 960
            LTQGSTVAKALVSEIQKRKLSPTNIDILSIA+SACC                        
Sbjct: 1067 LTQGSTVAKALVSEIQKRKLSPTNIDILSIAQSACCKVNFKVLLEQKFGVLPEYFYLKAA 1126

Query: 961  -------------KGFVCPKGCETFEDPLLLPHVMPHPHGFGNHKNARTPDTV------- 1020
                         KGFVCP GCETFEDPLLL H+MPHP+ FG+++NA TPD V       
Sbjct: 1127 ELCREKGKVNWYIKGFVCPNGCETFEDPLLLAHLMPHPNSFGDNENAHTPDPVSSKWKSH 1186

Query: 1021 ---------------KEKVIILCEDISFGQELVPVVCVADEGQRNSLHIPLANSDDQNAR 1080
                           +EK ++LCEDISFGQELVPVVCVAD+ QRN  H+ LANS  QN  
Sbjct: 1187 GCSYVSGSHLSSQQFREKAVVLCEDISFGQELVPVVCVADDCQRNPHHMSLANSGAQNVG 1246

Query: 1081 YSMPWENFSYIKKPLLDKSLASHTENLQFGCACPHSLCSSETCDHVYLFDSDYEDPKDIY 1140
            YSMPWENF+YIKKPLLDKSLA HTE+LQFGCACPHSLCSSETCDHVYLF+SDYEDPKDIY
Sbjct: 1247 YSMPWENFTYIKKPLLDKSLAIHTESLQFGCACPHSLCSSETCDHVYLFNSDYEDPKDIY 1306

Query: 1141 GNPMRRRFPYDENGRMILEEGYLVYECNERCSCSQTCPNRVLQNGVQVKLEVFMTETKGW 1200
            GNPM RRFPYDENGR+ILEEGYLVYECNE CSCS+TCPNRVLQNGVQVKLEVFMTETKGW
Sbjct: 1307 GNPMLRRFPYDENGRIILEEGYLVYECNEMCSCSRTCPNRVLQNGVQVKLEVFMTETKGW 1366

Query: 1201 AVRAGEAILRGTFICEYIGEVLDEQEANRRRNRYNSEGNCYFLDVDAHINDISRLIEGSA 1219
            AVRAGEAILRGTF+CEYIGEVLDEQEANRR  RYNSEG+CYFLDVDAHINDISRL+EGSA
Sbjct: 1367 AVRAGEAILRGTFVCEYIGEVLDEQEANRR--RYNSEGSCYFLDVDAHINDISRLVEGSA 1426

BLAST of Sgr026330 vs. NCBI nr
Match: XP_022132629.1 (histone-lysine N-methyltransferase SUVR5 isoform X2 [Momordica charantia])

HSP 1 Score: 2162.1 bits (5601), Expect = 0.0e+00
Identity = 1089/1313 (82.94%), Postives = 1124/1313 (85.61%), Query Frame = 0

Query: 1    WRGKWQAGIRCARADWPLSTLKAKPTHDRKKYFVVFFPHTRNYSWADALLVRSIEEFPQP 60
            WRGKWQAGIRCARADWPLSTLKAKPTHDRKKYFVVFFPHTRNYSWADALLVRSIEEFPQP
Sbjct: 165  WRGKWQAGIRCARADWPLSTLKAKPTHDRKKYFVVFFPHTRNYSWADALLVRSIEEFPQP 224

Query: 61   IAYKSHKAGLKLVEDVKVARRFIMKKLAVSMLNIIDQFHLEALIESARDVMTWKEFAMEA 120
            IAYKSHKAGLKLVEDVKVARRFIMKKLAV MLNIIDQFHLEALIESARDVMTWKEFAMEA
Sbjct: 225  IAYKSHKAGLKLVEDVKVARRFIMKKLAVGMLNIIDQFHLEALIESARDVMTWKEFAMEA 284

Query: 121  SRCNGYSDLGRMLLKLQNMIVQCFINSDWLHHSLHSWVQRCQNAQTAEIIEMLKEELADA 180
            SRCNGYSDLGRMLLKLQNMIVQCF+NSDWL +SL+SWVQRCQNAQTAEIIEMLKEELADA
Sbjct: 285  SRCNGYSDLGRMLLKLQNMIVQCFMNSDWLQNSLNSWVQRCQNAQTAEIIEMLKEELADA 344

Query: 181  ILWDEVNSHDDAPVQPTFSSVWKTWKHEVTKWFSISPTLPIIRDKEQQTVEAFLATTLQV 240
            ILW EVNSH DAPVQPTFSSVWKTWKHEVTKWFSISP LPIIRDKEQ++VEAFLATTLQV
Sbjct: 345  ILWKEVNSHGDAPVQPTFSSVWKTWKHEVTKWFSISPILPIIRDKEQRSVEAFLATTLQV 404

Query: 241  SRKRPKLEVRRAEAHASLVESKCSDVAMALHIDSESHKVEARKVAKSADSLSTVPGRLGG 300
            SRKRPKLE+RRAEAHASLVESKCS                                  GG
Sbjct: 405  SRKRPKLEIRRAEAHASLVESKCS----------------------------------GG 464

Query: 301  IVVQTGNSELAFCKDVELTPLTEVVAEKPLNFGNKNRQCIAFIESKGRQCVRWANEGDVY 360
              VQTGN +LA CKDVEL P T+VVAEKP N GN+NRQCIAFIESKGRQCVRWANEGDVY
Sbjct: 465  --VQTGNLQLASCKDVELMPHTDVVAEKPFNSGNRNRQCIAFIESKGRQCVRWANEGDVY 524

Query: 361  CCVHLSSRFTGNSDKKEHTRSVESPMCQGTTVLGTRCKHRSLFGS--------------- 420
            CCVHLSSRFTGNSDKKE TRSVESPMCQGTTVLG+RCKHRSLFGS               
Sbjct: 525  CCVHLSSRFTGNSDKKEQTRSVESPMCQGTTVLGSRCKHRSLFGSSFCKKHRPRSETKSE 584

Query: 421  -----------YSDIYGVEATGYKEIKFVGDVGNPLGVDEGDVTNNGNSSSDKLGHHGKD 480
                         DIY VEA GYKEIKF  DVGN LGVD GDVTNNGNSSSDKL H GK+
Sbjct: 585  SNSLENKLIEKQQDIYSVEAIGYKEIKF-ADVGNTLGVDNGDVTNNGNSSSDKLEHRGKE 644

Query: 481  SIASEVRHCIGSCEHIDSNPCLESPKRHSLYCEKHLPSWLKRARNGKSRVISKEVFMDLL 540
            SIA+EVRHCIGSC  IDSNPCLESPKRHSLYCEKHLPSWLKRARNGKSRVISKEVFMDLL
Sbjct: 645  SIATEVRHCIGSC--IDSNPCLESPKRHSLYCEKHLPSWLKRARNGKSRVISKEVFMDLL 704

Query: 541  RDCISQEQKIHLHQACELFYRLFKSILSLRNPVPVEVQFQWALSEASKNFGVGEQFMKLV 600
            RD  SQE KIHLHQACELFYRLFKSILSLRNPVP+EVQFQWALSEASK FGVGEQFMKLV
Sbjct: 705  RDSSSQELKIHLHQACELFYRLFKSILSLRNPVPMEVQFQWALSEASKTFGVGEQFMKLV 764

Query: 601  CREKERLKRIWGFDAEEAQLSSYSMEVPTSGPL---------LASGNHDDDMALSTHFMD 660
            CREKERLKRIWGFDAEEAQLSSYSMEVPTSG           + S    DD ALSTHFMD
Sbjct: 765  CREKERLKRIWGFDAEEAQLSSYSMEVPTSGNCNDDMGIRCKICSEEFLDDQALSTHFMD 824

Query: 661  SHKKEAQWLFRGYACAICLDSFTNKKVLETHVQERHHAPFVEQCMLLQCIPCGSHFGNTE 720
             HKKEAQWLFRGYACAICLDSFTNKKVLETHVQERHHAPFVEQCMLLQCIPCGSHFGNTE
Sbjct: 825  GHKKEAQWLFRGYACAICLDSFTNKKVLETHVQERHHAPFVEQCMLLQCIPCGSHFGNTE 884

Query: 721  QLWLHVVAVHPVDFRLSNSTQQHNSSDDEDSPVKPEQCNIVSQENDNKNVGGLRKFICRF 780
            QLWLHVVAVHPVDFRLSNSTQQHNSS  EDSPVKPEQCNIVSQEND KNVGGLRKFICRF
Sbjct: 885  QLWLHVVAVHPVDFRLSNSTQQHNSSAGEDSPVKPEQCNIVSQENDKKNVGGLRKFICRF 944

Query: 781  CGLKFDLLPDLGRHHQAAHMGPGLVNSRPAKRGLHYFAYKLKSGKLGHPRFKKTLAGASN 840
            CGLKFDLLPDLGRHHQAAHMGPGLVNSRPAKRGLHY+AYKLKSGKLGHPRFKKTLAGASN
Sbjct: 945  CGLKFDLLPDLGRHHQAAHMGPGLVNSRPAKRGLHYYAYKLKSGKLGHPRFKKTLAGASN 1004

Query: 841  RIRNRTKASMKKHIQASKLLSTGSINLQPHVSQSASSRKLTQGSTVAKALVSEIQKRKLS 900
            R RNRTKASMKKHIQASKL STGSINLQPHV Q ASSRKLTQGSTVAKALVSEIQKRKLS
Sbjct: 1005 RNRNRTKASMKKHIQASKLRSTGSINLQPHVPQLASSRKLTQGSTVAKALVSEIQKRKLS 1064

Query: 901  PTNIDILSIARSACC-------------------------------------KGFVCPKG 960
            P NIDILSIARSACC                                     KGFVCP+G
Sbjct: 1065 PINIDILSIARSACCKVNFKVLLEQKFGVLPEYIYLKAAELCREKGEVNWHIKGFVCPEG 1124

Query: 961  CETFEDPLLLPHVMPHPHGFGNHKNARTPD----------------------TVKEKVII 1020
            CETFEDPLLLPH+MPHP+GFG+H+NA +PD                       VKE VII
Sbjct: 1125 CETFEDPLLLPHLMPHPNGFGDHENACSPDPVSCKWEARRCGYVIGSHLSSQQVKENVII 1184

Query: 1021 LCEDISFGQELVPVVCVADEGQRNSLHIPLANSDDQNARYSMPWENFSYIKKPLLDKSLA 1080
            LCEDISFGQELVPVVCVADEG+RNS  IP+ANSDDQNARY MPWENF+YIKKPLLDKSLA
Sbjct: 1185 LCEDISFGQELVPVVCVADEGRRNSPDIPIANSDDQNARYFMPWENFTYIKKPLLDKSLA 1244

Query: 1081 SHTENLQFGCACPHSLCSSETCDHVYLFDSDYEDPKDIYGNPMRRRFPYDENGRMILEEG 1140
             HTE+LQFGCACP SLCSSETCDHVYLF+SDYEDPKDIYGNPMRRRFPYDENGR+ILEEG
Sbjct: 1245 IHTESLQFGCACPQSLCSSETCDHVYLFNSDYEDPKDIYGNPMRRRFPYDENGRIILEEG 1304

Query: 1141 YLVYECNERCSCSQTCPNRVLQNGVQVKLEVFMTETKGWAVRAGEAILRGTFICEYIGEV 1200
            YLVYECNERCSCS+TCPNRVLQNGVQVKLEVFMTETKGWAVRAGE ILRGTF+CEYIGEV
Sbjct: 1305 YLVYECNERCSCSRTCPNRVLQNGVQVKLEVFMTETKGWAVRAGEPILRGTFVCEYIGEV 1364

Query: 1201 LDEQEANRRRNRYNSEGNCYFLDVDAHINDISRLIEGSARYIIDATNYGNVSRFINHCCS 1219
            LDEQEANRRR+RYN+EGNCYFLDVDAHINDISRL+EGSARYIIDATNYGNVSRFINH CS
Sbjct: 1365 LDEQEANRRRDRYNTEGNCYFLDVDAHINDISRLVEGSARYIIDATNYGNVSRFINHSCS 1424

BLAST of Sgr026330 vs. NCBI nr
Match: XP_023517545.1 (histone-lysine N-methyltransferase SUVR5 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 2149.4 bits (5568), Expect = 0.0e+00
Identity = 1072/1331 (80.54%), Postives = 1122/1331 (84.30%), Query Frame = 0

Query: 1    WRGKWQAGIRCARADWPLSTLKAKPTHDRKKYFVVFFPHTRNYSWADALLVRSIEEFPQP 60
            WRGKWQAGIRCARADWPLSTLKAKPTH+RKKYFVVFFPHTRNYSWADALLVRSIEEFPQP
Sbjct: 166  WRGKWQAGIRCARADWPLSTLKAKPTHERKKYFVVFFPHTRNYSWADALLVRSIEEFPQP 225

Query: 61   IAYKSHKAGLKLVEDVKVARRFIMKKLAVSMLNIIDQFHLEALIESARDVMTWKEFAMEA 120
            IAYKSHKAGLKLVEDVKVARRFIMKKLAV MLNIIDQFHLEALIESARDVM WKEF+MEA
Sbjct: 226  IAYKSHKAGLKLVEDVKVARRFIMKKLAVGMLNIIDQFHLEALIESARDVMNWKEFSMEA 285

Query: 121  SRCNGYSDLGRMLLKLQNMIVQCFINSDWLHHSLHSWVQRCQNAQTAEIIEMLKEELADA 180
            SRCNGYSDLGRMLLKLQNMI+QCF+N DWL +SLHSWVQRCQNAQTAE+IEMLKEEL DA
Sbjct: 286  SRCNGYSDLGRMLLKLQNMILQCFVNPDWLQNSLHSWVQRCQNAQTAEVIEMLKEELTDA 345

Query: 181  ILWDEVNSHDDAPVQPTFSSVWKTWKHEVTKWFSISPTLPIIRDKEQQTVEAFLATTLQV 240
            ILWD+V SH DAPVQPTFSSVWKTWKHEVTKWFSI PTLPI RDKEQQTVEAFLAT L+V
Sbjct: 346  ILWDKVKSHGDAPVQPTFSSVWKTWKHEVTKWFSIYPTLPISRDKEQQTVEAFLATALEV 405

Query: 241  SRKRPKLEVRRAEAHASLVESKCSDVAMA---------------LHIDSESHKVEARKVA 300
            SRKRPKLE+RRAE  ASL+ESKCSD AMA                 + SESHKVE RK+ 
Sbjct: 406  SRKRPKLEIRRAETQASLMESKCSDEAMAPDNDSGFFNNQTSLNAKLGSESHKVEVRKIV 465

Query: 301  KSADSLSTVPGRLGGIVVQTGNSELAFCKDVELTPLTEVVAEKPLNFGNKNRQCIAFIES 360
             SA  LS VPGRL GIV QTG+ +LA CKDVEL P TE   EK L++GNKNRQCIAFIES
Sbjct: 466  TSAGPLSIVPGRLAGIVAQTGSLDLASCKDVELRPHTETATEKLLHYGNKNRQCIAFIES 525

Query: 361  KGRQCVRWANEGDVYCCVHLSSRFTGNSDKKEHTRSVESPMCQGTTVLGTRCKHRSLFGS 420
            KGRQCVRWANEGDVYCCVHLSSRFTGN+DKKE TR VESPMCQGTTVLG+RCKHRSLFGS
Sbjct: 526  KGRQCVRWANEGDVYCCVHLSSRFTGNNDKKEQTRFVESPMCQGTTVLGSRCKHRSLFGS 585

Query: 421  --------------------------YSDIYGVEATGYKEIKFVGDVGNPLGVDEGDVTN 480
                                        DIY VE T  KEIKF  D GNPLGVDEGDVTN
Sbjct: 586  SFCKKHRPRSETNMESTSYENKLIEKQQDIYRVEDTRNKEIKFDRDAGNPLGVDEGDVTN 645

Query: 481  NGNSSSDKLGHHGKDSIASEVRHCIGSCEHIDSNPCLESPKRHSLYCEKHLPSWLKRARN 540
            NGNSSSDKL HHGKDSIASEVRHCIGS EHIDSNPCLESPKRHSLYCEKHLPSWLKRARN
Sbjct: 646  NGNSSSDKLEHHGKDSIASEVRHCIGSSEHIDSNPCLESPKRHSLYCEKHLPSWLKRARN 705

Query: 541  GKSRVISKEVFMDLLRDCISQEQKIHLHQACELFYRLFKSILSLRNPVPVEVQFQWALSE 600
            GKSRVISKEVFMDLLRDC S+EQKIHLHQACELFYRLFKSILSLRNPVP+EVQFQWALSE
Sbjct: 706  GKSRVISKEVFMDLLRDCNSEEQKIHLHQACELFYRLFKSILSLRNPVPMEVQFQWALSE 765

Query: 601  ASKNFGVGEQFMKLVCREKERLKRIWGFDAEEAQLSSYSMEVPTSGPLLASGNHD----- 660
            ASKN GVGEQFMKLVC EKERLKR+WGFDAE AQLSS SMEVPT+GPLL SGN +     
Sbjct: 766  ASKNLGVGEQFMKLVCHEKERLKRLWGFDAEGAQLSSPSMEVPTAGPLLTSGNCNDGSSI 825

Query: 661  ----------DDMALSTHFMDSHKKEAQWLFRGYACAICLDSFTNKKVLETHVQERHHAP 720
                      DD ALSTHFMD HKKEAQWLFRGYACAICLDSFTNKKVLETHVQERHHAP
Sbjct: 826  RCKICSEEFLDDQALSTHFMDGHKKEAQWLFRGYACAICLDSFTNKKVLETHVQERHHAP 885

Query: 721  FVEQCMLLQCIPCGSHFGNTEQLWLHVVAVHPVDFRLSNSTQQHNSSDDEDSPVKPEQCN 780
            FVEQCMLLQCIPCGSHFGNT+QLWLHVVAVHP+DFRLSNST+QHNSS  EDSPVKP++CN
Sbjct: 886  FVEQCMLLQCIPCGSHFGNTDQLWLHVVAVHPIDFRLSNSTRQHNSSSGEDSPVKPKECN 945

Query: 781  IVSQENDNKNVGGLRKFICRFCGLKFDLLPDLGRHHQAAHMGPGLVNSRPAKRGLHYFAY 840
            IVS+ NDNKNVGGLRKF CRFCGLKFDLLPDLGRHHQAAHMGPGL NSR AKRG HY+AY
Sbjct: 946  IVSKSNDNKNVGGLRKFNCRFCGLKFDLLPDLGRHHQAAHMGPGLANSRTAKRGFHYYAY 1005

Query: 841  KLKSGKLGHPRFKKTLAGASNRIRNRTKASMKKHIQASKLLSTGSINLQPHVSQSASSRK 900
            KLKSGKLGHPRFKKTLAGASNRIRNRTKASMKKHIQ SKLLS+GSINLQPH S  ASSRK
Sbjct: 1006 KLKSGKLGHPRFKKTLAGASNRIRNRTKASMKKHIQTSKLLSSGSINLQPHESHLASSRK 1065

Query: 901  LTQGSTVAKALVSEIQKRKLSPTNIDILSIARSACC------------------------ 960
            LTQGSTV+KALVSEIQK KL PTN+DILSIA SACC                        
Sbjct: 1066 LTQGSTVSKALVSEIQKIKLFPTNVDILSIAHSACCKVNFKVLLEQKFGVLPEYFYLKAA 1125

Query: 961  -----------KGFVCPKGCETFEDPLLLPHVMPHPHGFGNHKNARTPD----------- 1020
                       KGFVCPKGCET +DPLL P++MPHP+GFG HKNA TPD           
Sbjct: 1126 ELCREKVNWYIKGFVCPKGCETLKDPLLHPNLMPHPNGFGLHKNAHTPDPVSSKWEAHGC 1185

Query: 1021 -----------TVKEKVIILCEDISFGQELVPVVCVADEGQRNSLHIPLANSDDQNARYS 1080
                        VKEK +ILCEDISFGQE VPVVCVADEG RNS HI LANSD Q   YS
Sbjct: 1186 SYAIGSHLSSHQVKEKAVILCEDISFGQEFVPVVCVADEGLRNSPHISLANSDSQEVGYS 1245

Query: 1081 MPWENFSYIKKPLLDKSLASHTENLQFGCACPHSLCSSETCDHVYLFDSDYEDPKDIYGN 1140
            MPWE+F+YIKK LL+KSLA  TE+LQFGCAC HSLCSSETCDHVYLFDSDYEDPKDIYGN
Sbjct: 1246 MPWESFTYIKKSLLNKSLAIDTESLQFGCACAHSLCSSETCDHVYLFDSDYEDPKDIYGN 1305

Query: 1141 PMRRRFPYDENGRMILEEGYLVYECNERCSCSQTCPNRVLQNGVQVKLEVFMTETKGWAV 1200
            PM RRFPYDENGR+ILEEGYLVYECNERC+CS+TCPNRVLQNGV VKLEVFMTETKGW V
Sbjct: 1306 PMSRRFPYDENGRIILEEGYLVYECNERCNCSRTCPNRVLQNGVHVKLEVFMTETKGWTV 1365

Query: 1201 RAGEAILRGTFICEYIGEVLDEQEANRRRNRYNSEGNCYFLDVDAHINDISRLIEGSARY 1219
            RAGEAILRGTF+CEYIGEVL+EQEANRRR+RYN EGN YFLDVDAHINDISRLIEGSARY
Sbjct: 1366 RAGEAILRGTFVCEYIGEVLEEQEANRRRDRYNCEGNGYFLDVDAHINDISRLIEGSARY 1425

BLAST of Sgr026330 vs. ExPASy Swiss-Prot
Match: O64827 (Histone-lysine N-methyltransferase SUVR5 OS=Arabidopsis thaliana OX=3702 GN=SUVR5 PE=1 SV=3)

HSP 1 Score: 1191.8 bits (3082), Expect = 0.0e+00
Identity = 656/1328 (49.40%), Postives = 836/1328 (62.95%), Query Frame = 0

Query: 1    WRGKWQAGIRCARADWPLSTLKAKPTHDRKKYFVVFFPHTRNYSWADALLVRSIEEFPQP 60
            WRGKWQAGIRCA+ADWPL+TL+ KPTHDRKKY V+FFPHT+NYSWAD  LVRSI EFP P
Sbjct: 76   WRGKWQAGIRCAKADWPLTTLRGKPTHDRKKYCVIFFPHTKNYSWADMQLVRSINEFPDP 135

Query: 61   IAYKSHKAGLKLVEDVKVARRFIMKKLAVSMLNIIDQFHLEALIESARDVMTWKEFAMEA 120
            IAYKSHK GLKLV+D+  ARR+IM+KL V M NI+DQF  E + E+ARD++ WKEFAMEA
Sbjct: 136  IAYKSHKIGLKLVKDLTAARRYIMRKLTVGMFNIVDQFPSEVVSEAARDIIIWKEFAMEA 195

Query: 121  SRCNGYSDLGRMLLKLQNMIVQCFINSDWLHHSLHSWVQRCQNAQTAEIIEMLKEELADA 180
            +R   Y DLG ML+KL +MI+Q +++  WL +S   WVQ+C NA  AE IE+L EE  + 
Sbjct: 196  TRSTSYHDLGIMLVKLHSMILQRYMDPIWLENSFPLWVQKCNNAVNAESIELLNEEFDNC 255

Query: 181  ILWDEVNSHDDAPVQPTFSSVWKTWKHEVTKWFSISPTLPIIRDKEQQTVEAFLATTLQV 240
            I W+EV S  ++P+QP   S WKTWKH++ KWFSIS     + +  Q   ++   + +Q 
Sbjct: 256  IKWNEVKSLSESPMQPMLLSEWKTWKHDIAKWFSISRR--GVGEIAQPDSKSVFNSDVQA 315

Query: 241  SRKRPKLEVRRAE-AHASLVESKCSDVAMALHIDSE----SHKVEARKVAKSADSLSTVP 300
            SRKRPKLE+RRAE  +A+ +ES  S   ++  IDSE         + +  K  + +   P
Sbjct: 316  SRKRPKLEIRRAETTNATHMESDTSPQGLSA-IDSEFFSSRGNTNSPETMKEENPVMNTP 375

Query: 301  GR----LGGIVVQTGNSELAFCKDV------ELTPLTEVVAEKPLNFGNKNRQCIAFIES 360
                    GIVV+ G S+    K+       +   + E V +KP   GNK++QCIAFIES
Sbjct: 376  ENGLDLWDGIVVEAGGSQFMKTKETNGLSHPQDQHINESVLKKPFGSGNKSQQCIAFIES 435

Query: 361  KGRQCVRWANEGDVYCCVHLSSRFTGNSDKKEHTRSVESPMCQGTTVLGTRCKHRSLFG- 420
            KGRQCVRWANEGDVYCCVHL+SRFT  S K E + +VE+PMC G TVLGT+CKHRSL G 
Sbjct: 436  KGRQCVRWANEGDVYCCVHLASRFTTKSMKNEGSPAVEAPMCGGVTVLGTKCKHRSLPGF 495

Query: 421  ----SYSDIYGV----EATGYKEIKFVGDVGNPLGVDE-GDVTNNG---NSSSDKLGHHG 480
                 +    G+    +++ +   + V ++ + L  ++  D+   G     S +K   HG
Sbjct: 496  LYCKKHRPHTGMVKPDDSSSFLVKRKVSEIMSTLETNQCQDLVPFGEPEGPSFEKQEPHG 555

Query: 481  KDSIASEVRH-------CIGSCEHIDSNPCLESPKRHSLYCEKHLPSWLKRARNGKSRVI 540
              S      H       CIGSC       C E   +HSLYCE+HLP+WLKRARNGKSR+I
Sbjct: 556  ATSFTEMFEHCSQEDNLCIGSCSENSYISCSEFSTKHSLYCEQHLPNWLKRARNGKSRII 615

Query: 541  SKEVFMDLLRDCISQEQKIHLHQACELFYRLFKSILSLRNPVPVEVQFQWALSEASKN-- 600
            SKEVF+DLLR C+S+E+K+ LHQAC++FY+LFKS+LSLRN VP+EVQ  WA +EAS+N  
Sbjct: 616  SKEVFVDLLRGCLSREEKLALHQACDIFYKLFKSVLSLRNSVPMEVQIDWAKTEASRNAD 675

Query: 601  FGVGEQFMKLVCREKERLKRIWGF----DAEEAQLSSYSMEVPTSGPLLASGNHDDDMAL 660
             GVGE  MKLV  E+ERL RIWGF    D E+  LS Y         LLA  N  DD   
Sbjct: 676  AGVGEFLMKLVSNERERLTRIWGFATGADEEDVSLSEY------PNRLLAITNTCDD--- 735

Query: 661  STHFMDSHKKEAQWLFRGYACAICLDSFTNKKVLETHVQERHHAPFVEQCMLLQCIPCGS 720
                 D  K+  +W F G+ACAICLDSF  +K+LE HV+ERHH  F E+CMLLQCIPCGS
Sbjct: 736  -----DDDKE--KWSFSGFACAICLDSFVRRKLLEIHVEERHHVQFAEKCMLLQCIPCGS 795

Query: 721  HFGNTEQLWLHVVAVHPVDFRLSNSTQQHNSSDDEDSPVKPE--QCNIVSQENDNKNVGG 780
            HFG+ EQL +HV AVHP + +      + N ++ E S  KPE     IV  +N N+N  G
Sbjct: 796  HFGDKEQLLVHVQAVHPSECKSLTVASECNLTNGEFSQ-KPEAGSSQIVVSQN-NENTSG 855

Query: 781  LRKFICRFCGLKFDLLPDLGRHHQAAHMGPGLVNSRPAKRGLHYFAYKLKSGKLGHP-RF 840
            + KF+C+FCGLKF+LLPDLGRHHQA HMGP LV SR  K+G+ +  Y++KSG+L  P +F
Sbjct: 856  VHKFVCKFCGLKFNLLPDLGRHHQAEHMGPSLVGSRGPKKGIRFNTYRMKSGRLSRPNKF 915

Query: 841  KKTLAGASNRIRNRTKASMKKHIQASKLLST---GSINLQPHVSQSASSRKLTQG--STV 900
            KK+L   S RIRNR   +MK+ +Q SK L T       + P +  S +   +T    S V
Sbjct: 916  KKSLGAVSYRIRNRAGVNMKRRMQGSKSLGTEGNTEAGVSPPLDDSRNFDGVTDAHCSVV 975

Query: 901  AKALVSEIQKRKLSPTNIDILSIARSACCK------------------------------ 960
            +  L+S++QK K  P N+DILS ARSACC+                              
Sbjct: 976  SDILLSKVQKAKHRPNNLDILSAARSACCRVSVETSLEAKFGDLPDRIYLKAAKLCGEQG 1035

Query: 961  --------GFVCPKGCETFEDPLLLPHVMPHPHG--FGNHKNARTPDTVKEKV------- 1020
                    G++C  GC+  +DP LL  ++P      FG   +A     ++ +V       
Sbjct: 1036 VQVQWHQEGYICSNGCKPVKDPNLLHPLIPRQENDRFGIAVDAGQHSNIELEVDECHCIM 1095

Query: 1021 -------------IILCEDISFGQELVPVVCVADEGQRNSLHIPLANSDDQNARYSMPWE 1080
                          +LC+DISFG+E VP +CV D+   NS              Y MPWE
Sbjct: 1096 EAHHFSKRPFGNTAVLCKDISFGKESVP-ICVVDDDLWNS-----------EKPYEMPWE 1155

Query: 1081 NFSYIKKPLLDKSLASHTENLQFGCACPHSLCSSETCDHVYLFDSDYEDPKDIYGNPMRR 1140
             F+Y+   +L  S+    ENLQ  C+C  S+CS  TCDHVYLF +D+ED +DIYG  MR 
Sbjct: 1156 CFTYVTNSILHPSMDLVKENLQLRCSCRSSVCSPVTCDHVYLFGNDFEDARDIYGKSMRC 1215

Query: 1141 RFPYDENGRMILEEGYLVYECNERCSCSQTCPNRVLQNGVQVKLEVFMTETKGWAVRAGE 1200
            RFPYD   R+ILEEGY VYECN+ C CS+TC NRVLQNG++ KLEVF TE+KGW +RA E
Sbjct: 1216 RFPYDGKQRIILEEGYPVYECNKFCGCSRTCQNRVLQNGIRAKLEVFRTESKGWGLRACE 1275

Query: 1201 AILRGTFICEYIGEVLDEQEANRRRNRYNSEGNCYFLDVDAHINDISRLIEGSARYIIDA 1217
             ILRGTF+CEYIGEVLD+QEAN+RRN+Y +    Y LD+DA+INDI RL+E    Y IDA
Sbjct: 1276 HILRGTFVCEYIGEVLDQQEANKRRNQYGNGDCSYILDIDANINDIGRLMEEELDYAIDA 1335

BLAST of Sgr026330 vs. ExPASy Swiss-Prot
Match: Q9SVM0 (BTB/POZ domain-containing protein At3g50780 OS=Arabidopsis thaliana OX=3702 GN=At3g50780 PE=2 SV=1)

HSP 1 Score: 595.9 bits (1535), Expect = 1.6e-168
Identity = 315/527 (59.77%), Postives = 394/527 (74.76%), Query Frame = 0

Query: 1349 MAEIRLTRVEQGQTKIKNVPIAVTPEGFWCCPSPVVFQKTLKGQNALNKPKPASPTPKSP 1408
            MAEI+  +VEQ QTKI+NVP+AVTPEGFWCCPSPV FQKTLK  N+L K K +SP  + P
Sbjct: 1    MAEIKNAKVEQRQTKIRNVPVAVTPEGFWCCPSPVAFQKTLKSHNSLTKHKQSSPALQPP 60

Query: 1409 VEKKPTPVTDRKPALTRSRSAAVSDDDRKCNADN----SGFSAPEVVHRVP-RPKIENMP 1468
               KP    ++KP+ T  RS   SD+ ++ N  +       + P  V   P R K+E +P
Sbjct: 61   ---KP----EKKPSSTTIRSVIASDETQQ-NLGSFDTVHSIAVPATVQERPQRQKVETLP 120

Query: 1469 RKIAIEFGEPGTSNIKVVLLGKQGFSVKLSVHKNVLMDNSTFFANKLSDKEG--SSLEIG 1528
            RK+AIEFGEPG+S+ KV+L+GKQGF VKLSVHK VL+D+S FFA KL++K+   + LEI 
Sbjct: 121  RKVAIEFGEPGSSDAKVILVGKQGFCVKLSVHKKVLVDHSCFFAKKLAEKDSVFACLEIE 180

Query: 1529 DCEDVEIYVETVGLMYCKEMKQWLMKQNVSRVLRILKVAEFLGFKSCMQSCLEYLEAVPW 1588
             CED E+YVET+GLMYCK+MKQ LMKQNVSRVLR+LKVAE LGF SC+QSCL+YLEAVPW
Sbjct: 181  SCEDAELYVETIGLMYCKDMKQRLMKQNVSRVLRVLKVAELLGFSSCIQSCLDYLEAVPW 240

Query: 1589 VG-DEEEKVVTSILRLQSEGIGVSPVLKRVSADVSKPHKDTLSHIIELVLRSNEERGRRE 1648
            VG +EEEKV++SILRL++EG+GV+PVLKRV+++   P K+TLS IIELVLRS EE+ RRE
Sbjct: 241  VGEEEEEKVISSILRLKTEGVGVTPVLKRVASNAVDPPKETLSRIIELVLRSKEEKSRRE 300

Query: 1649 MKLVVLRLLRENQSVPSHASSADICNEIIYSSCRSCLGSLLFLFQQAAETDFTDRSVDRK 1708
            MK +VL+LLRE       A+ AD  N+ IYSSC++CL S+L LF+QA+E        ++ 
Sbjct: 301  MKSIVLKLLREQNG----ANVADNFNDTIYSSCQTCLDSVLSLFKQASEG-------EKP 360

Query: 1709 EPVLKQITLEADNLSWLLEILADRQAADEFA--------------------------LYA 1768
            E   KQI +EADNL+WLL++LA+RQAA+EF+                          + +
Sbjct: 361  ETDTKQIAVEADNLTWLLDVLAERQAAEEFSVTWANQKELALLHEKLPLMSRYHISRVTS 420

Query: 1769 RLFVGIGKGELLPAKDTRKLLLHTWLEPLINDYSWLKHGCGSFDRKVVEEGIGRTILTLP 1828
            RLF+GIG+GELLP+KDTR LLL TWL+PL NDY+WL+HGC SFD K+VEEGIGRTILTLP
Sbjct: 421  RLFIGIGRGELLPSKDTRLLLLTTWLQPLFNDYNWLQHGCRSFDGKLVEEGIGRTILTLP 480

Query: 1829 LEDQQNILLTWLGSFLKVGDSCPNLQRAFEVWWRRTFIRPYVETEGS 1842
            LEDQQ+ILL+WLGSFL  GD CPNLQRAFEVWWRR+FIRPY + + +
Sbjct: 481  LEDQQSILLSWLGSFLNGGDGCPNLQRAFEVWWRRSFIRPYSDRQAN 508

BLAST of Sgr026330 vs. ExPASy Swiss-Prot
Match: Q9CAJ9 (BTB/POZ domain-containing protein At1g63850 OS=Arabidopsis thaliana OX=3702 GN=At1g63850 PE=1 SV=1)

HSP 1 Score: 305.8 bits (782), Expect = 3.3e-81
Identity = 170/461 (36.88%), Postives = 257/461 (55.75%), Query Frame = 0

Query: 1421 PALTRSRSAAVSDDDRKCNADNSGFSAPE----VVHRVPRPKIENMPRKIAIEFGEPGTS 1480
            P+ + S S+A +   R  N ++   SA +     + R+    +   P     +F +P +S
Sbjct: 85   PSSSSSSSSAAATAARTTNVNHLVISAQDKQALAMQRISDLLVIRSPGN---QFNDPNSS 144

Query: 1481 NIKVVLLGKQGFSVKLSVHKNVLMDNSTFFANKLSDKEGSS----------LEIGDCEDV 1540
            ++K+ L  K G S+ + VH+ +L+ +S FFA KLSD+              +EI DC+DV
Sbjct: 145  DVKLTLSSKDGISITMCVHRQILVAHSRFFAMKLSDRWSKQQLPPSSSPYIVEISDCDDV 204

Query: 1541 EIYVETVGLMYCKEMKQWLMKQNVSRVLRILKVAEFLGFKSCMQSCLEYLEAVPWVGDEE 1600
            E+Y+ET+ LMYC+++++ +M+ +VSRVL ILKV+  +GF + + SCLEYLEA PW  DEE
Sbjct: 205  EVYIETLMLMYCRDLRKKMMRHDVSRVLGILKVSAAIGFDAGVLSCLEYLEAAPWSEDEE 264

Query: 1601 EKVVTSILRLQSEGIGVSPVLKRVSADVSK--------PHKDTLSHIIELVLRSNEERGR 1660
             ++ + +  L  E +G + VL+RVS + S          + + L +++ +VL   +E+ R
Sbjct: 265  YRIASLLSELHLENVGATEVLRRVSVEASNGNNGSNGGSNDEVLLNLLHIVLEGKDEKAR 324

Query: 1661 REMKLVVLRLLRENQSVPSHASSADICNEIIYSSCRSCLGSLLFLFQQAAETDFTDRSVD 1720
            R+MK +V ++LREN      +S  D+  E +Y +C  CL  L   F QAAE+D  +    
Sbjct: 325  RDMKTLVSKMLREN------SSGNDLRKESLYLACDGCLHKLKRQFLQAAESDLEN---- 384

Query: 1721 RKEPVLKQITLEADNLSWLLEILADRQAADEFALY------------------------- 1780
                 + QI  +ADNL W+L+IL DRQ A++F +                          
Sbjct: 385  -----VDQIARQADNLHWILDILIDRQIAEDFIVMWASLSELSEVHSKVPVVHRFEISRV 444

Query: 1781 -ARLFVGIGKGELLPAKDTRKLLLHTWLEPLINDYSWLKHGCGSFDRKVVEEGIGRTILT 1834
             AR+FVGIGKG++L  K+ R LLL  WL P  +D+ W++      DR ++E+G+  TILT
Sbjct: 445  TARIFVGIGKGQILTPKEVRCLLLRNWLTPFYDDFGWMRRASKGLDRYLIEDGLSNTILT 504

BLAST of Sgr026330 vs. ExPASy Swiss-Prot
Match: Q9LVG9 (BTB/POZ domain-containing protein At5g60050 OS=Arabidopsis thaliana OX=3702 GN=At5g60050 PE=2 SV=1)

HSP 1 Score: 275.4 bits (703), Expect = 4.7e-72
Identity = 150/396 (37.88%), Postives = 229/396 (57.83%), Query Frame = 0

Query: 1474 GTSNIKVVLLGKQGFSVKLSVHKNVLMDNSTFFANKLSDKE----GSSLEIGDCEDVEIY 1533
            G  ++K+ ++GK G+ V + VH+ VL + S FF  K++ +        +EI +C+D+EIY
Sbjct: 97   GPGDVKLTVVGKDGYRVTMDVHRKVLSEKSRFFMEKMNSRREKGVSHMVEISECDDLEIY 156

Query: 1534 VETVGLMYCKEMKQWLMKQNVSRVLRILKVAEFLGFKSCMQSCLEYLEAVPWVGDEEEKV 1593
            VETV LMY  ++K+ L+ +NV ++L +LKV+  + F   + SCLE+LEAVPW  DEEE V
Sbjct: 157  VETVVLMYSDDLKKKLIGENVIKILALLKVSAAISFDEGVMSCLEHLEAVPWSEDEEETV 216

Query: 1594 VTSILRLQSEGIGVSPVLKRVSADVSKPH-----KDTLSHIIELVLRSNEERGRREMKLV 1653
            VT +  L      V+ +L+RVS+  S         D  S ++  VL++ +++ RREMK++
Sbjct: 217  VTCLEELHLPDDSVTLILQRVSSQPSTSSTRTRTDDIFSKLLTGVLQAKDDKARREMKVL 276

Query: 1654 VLRLLRENQSVPSHASSADICNEIIYSSCRSCLGSLLFLFQQAAETDFTDRSVDRKEPVL 1713
            + +L+RE        +  D+  + +Y  C  CL SL+    +   T   D   DR   ++
Sbjct: 277  IFKLVREE-------ADYDVSRDTLYGLCHRCLTSLVLCLSEVT-TQMNDPGKDR-GALM 336

Query: 1714 KQITLEADNLSWLLEILADRQAADEFA--------------------------LYARLFV 1773
             +I  EADN+ W+++IL +++   EF                           + A++ V
Sbjct: 337  GEIAREADNMLWMVDILIEKKLCSEFVKLWADQKELANLHSKIPTMYRHEISKITAQICV 396

Query: 1774 GIGKGELLPAKDTRKLLLHTWLEPLINDYSWLKH-GCGSFDRKVVEEGIGRTILTLPLED 1833
            GIGKG +L  ++TR  +L+TWLE L +D+ W++     S DRK+VE+G+ +TILTL L  
Sbjct: 397  GIGKGRILVNRETRFAVLNTWLEALYDDFGWMRRLSSRSLDRKLVEDGLSQTILTLSLRQ 456

BLAST of Sgr026330 vs. ExPASy Swiss-Prot
Match: Q9SKH2 (BTB/POZ domain-containing protein At2g13690 OS=Arabidopsis thaliana OX=3702 GN=PRL1-IFG PE=2 SV=2)

HSP 1 Score: 190.7 bits (483), Expect = 1.5e-46
Identity = 138/499 (27.66%), Postives = 241/499 (48.30%), Query Frame = 0

Query: 1381 SPVVFQKTLKGQNALNKPKPASPTPKSPVEKKPTPVTDRKPALTRSRSAAVSDDDRKCNA 1440
            SP   +  L   N ++  +  SP   SP++  PT  T ++   T+         D   N 
Sbjct: 62   SPQSSKSALNIVNRIDPRRILSPGRVSPIDSDPTVTTMQETETTQEEEDDAVVVDSTPNL 121

Query: 1441 DNSGFSAPEVVHRVPRPKIENMPRKIAIEFGEPGTS---NIKVVLLGKQGFSV-KLSVHK 1500
             +  F AP+                  IE    G S   + ++ L G+ G  V  L +  
Sbjct: 122  RSESFRAPK------------------IEVTGSGLSEGYDARLSLKGRNGGGVLVLELSL 181

Query: 1501 NVLMDNSTFFANKLSDKEGSS-------------LEIGDCEDVEIYVETVGLMY--CKEM 1560
             VL  NS  F+  +++++  S             +E+ D E++ ++ ETV LM+     +
Sbjct: 182  EVLAANSDVFSGLIAEEKKCSSSSSSLGLKNTCRIEVCDVENLGVFRETVELMFEESNVI 241

Query: 1561 KQWLMKQNVSRVLRILKVAEFLGFKSCMQSCLEYLEAVPWVGDEEEKVVTSILRLQSEGI 1620
             +  M   V R + +L+VA  + F   + SCL+YLEAVPW  DEEEK+   +     +  
Sbjct: 242  IKKFMTMGVYRAIDVLEVAAGIKFSRAVLSCLKYLEAVPWTEDEEEKLRRLLGIYSFDDD 301

Query: 1621 GVSPVLKRVSADVSKPHKDTLS-HIIELVLRSNEERGRREMKLVVLRLLRENQSVPSHAS 1680
             VS +L R +++ ++  +D+LS  ++  +   ++   R E+K +V  LL   +S      
Sbjct: 302  AVSEILARFNSNETENLQDSLSKKLVWSITSCSDVNPRNELKSLVKGLL--CKSSVYEKE 361

Query: 1681 SADICNEIIYSSCRSCLGSLLFLFQQAAETDFTDRSVDRKEPVLKQITLEADNLSWLLEI 1740
              +I  E IY + + C+ SL  LF++ + +     S  +++P+++ I+ E +N++WLLEI
Sbjct: 362  QPEINKEDIYRAGKCCVDSLAKLFEEGSSSS----SSKKEKPLIESISREVENINWLLEI 421

Query: 1741 LADRQAADEFA--------------------------LYARLFVGIGKGELLPAKDTRKL 1800
            + DR+ A+EF                           +   +F+ +GK  +    + R  
Sbjct: 422  MIDREIAEEFVEIWGKQRRLVEMHERVSPMVRYEVSRVTGAIFIAMGKRRVQCGGEARAG 481

Query: 1801 LLHTWLEPLINDYSWLKHGCGSFDRKVVEEGIGRTILTLPLEDQQNILLTWLGSFLKVGD 1834
            L+  W +P++ D+ WL+      D + VEEG+G+T+LTLP+++Q  + + W   F K G 
Sbjct: 482  LVEAWFKPMLVDFGWLQRCKKGLDMREVEEGMGQTLLTLPVKEQYQVFMEWFRWFSKHGT 536

BLAST of Sgr026330 vs. ExPASy TrEMBL
Match: A0A6J1BWT1 (histone-lysine N-methyltransferase SUVR5 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111005444 PE=4 SV=1)

HSP 1 Score: 2211.8 bits (5730), Expect = 0.0e+00
Identity = 1114/1328 (83.89%), Postives = 1152/1328 (86.75%), Query Frame = 0

Query: 1    WRGKWQAGIRCARADWPLSTLKAKPTHDRKKYFVVFFPHTRNYSWADALLVRSIEEFPQP 60
            WRGKWQAGIRCARADWPLSTLKAKPTHDRKKYFVVFFPHTRNYSWADALLVRSIEEFPQP
Sbjct: 165  WRGKWQAGIRCARADWPLSTLKAKPTHDRKKYFVVFFPHTRNYSWADALLVRSIEEFPQP 224

Query: 61   IAYKSHKAGLKLVEDVKVARRFIMKKLAVSMLNIIDQFHLEALIESARDVMTWKEFAMEA 120
            IAYKSHKAGLKLVEDVKVARRFIMKKLAV MLNIIDQFHLEALIESARDVMTWKEFAMEA
Sbjct: 225  IAYKSHKAGLKLVEDVKVARRFIMKKLAVGMLNIIDQFHLEALIESARDVMTWKEFAMEA 284

Query: 121  SRCNGYSDLGRMLLKLQNMIVQCFINSDWLHHSLHSWVQRCQNAQTAEIIEMLKEELADA 180
            SRCNGYSDLGRMLLKLQNMIVQCF+NSDWL +SL+SWVQRCQNAQTAEIIEMLKEELADA
Sbjct: 285  SRCNGYSDLGRMLLKLQNMIVQCFMNSDWLQNSLNSWVQRCQNAQTAEIIEMLKEELADA 344

Query: 181  ILWDEVNSHDDAPVQPTFSSVWKTWKHEVTKWFSISPTLPIIRDKEQQTVEAFLATTLQV 240
            ILW EVNSH DAPVQPTFSSVWKTWKHEVTKWFSISP LPIIRDKEQ++VEAFLATTLQV
Sbjct: 345  ILWKEVNSHGDAPVQPTFSSVWKTWKHEVTKWFSISPILPIIRDKEQRSVEAFLATTLQV 404

Query: 241  SRKRPKLEVRRAEAHASLVESKCSDVAMALHID---------------SESHKVEARKVA 300
            SRKRPKLE+RRAEAHASLVESKCS  AMAL+ID               SE+HKVEAR+VA
Sbjct: 405  SRKRPKLEIRRAEAHASLVESKCSGDAMALNIDSGFFKSRNSLNAKLASEAHKVEAREVA 464

Query: 301  KSADSLSTVPGRLGGIVVQTGNSELAFCKDVELTPLTEVVAEKPLNFGNKNRQCIAFIES 360
             S DSLS VPGR GG  VQTGN +LA CKDVEL P T+VVAEKP N GN+NRQCIAFIES
Sbjct: 465  TSVDSLSIVPGRSGG--VQTGNLQLASCKDVELMPHTDVVAEKPFNSGNRNRQCIAFIES 524

Query: 361  KGRQCVRWANEGDVYCCVHLSSRFTGNSDKKEHTRSVESPMCQGTTVLGTRCKHRSLFGS 420
            KGRQCVRWANEGDVYCCVHLSSRFTGNSDKKE TRSVESPMCQGTTVLG+RCKHRSLFGS
Sbjct: 525  KGRQCVRWANEGDVYCCVHLSSRFTGNSDKKEQTRSVESPMCQGTTVLGSRCKHRSLFGS 584

Query: 421  --------------------------YSDIYGVEATGYKEIKFVGDVGNPLGVDEGDVTN 480
                                        DIY VEA GYKEIKF  DVGN LGVD GDVTN
Sbjct: 585  SFCKKHRPRSETKSESNSLENKLIEKQQDIYSVEAIGYKEIKF-ADVGNTLGVDNGDVTN 644

Query: 481  NGNSSSDKLGHHGKDSIASEVRHCIGSCEHIDSNPCLESPKRHSLYCEKHLPSWLKRARN 540
            NGNSSSDKL H GK+SIA+EVRHCIGSC  IDSNPCLESPKRHSLYCEKHLPSWLKRARN
Sbjct: 645  NGNSSSDKLEHRGKESIATEVRHCIGSC--IDSNPCLESPKRHSLYCEKHLPSWLKRARN 704

Query: 541  GKSRVISKEVFMDLLRDCISQEQKIHLHQACELFYRLFKSILSLRNPVPVEVQFQWALSE 600
            GKSRVISKEVFMDLLRD  SQE KIHLHQACELFYRLFKSILSLRNPVP+EVQFQWALSE
Sbjct: 705  GKSRVISKEVFMDLLRDSSSQELKIHLHQACELFYRLFKSILSLRNPVPMEVQFQWALSE 764

Query: 601  ASKNFGVGEQFMKLVCREKERLKRIWGFDAEEAQLSSYSMEVPTSGPL---------LAS 660
            ASK FGVGEQFMKLVCREKERLKRIWGFDAEEAQLSSYSMEVPTSG           + S
Sbjct: 765  ASKTFGVGEQFMKLVCREKERLKRIWGFDAEEAQLSSYSMEVPTSGNCNDDMGIRCKICS 824

Query: 661  GNHDDDMALSTHFMDSHKKEAQWLFRGYACAICLDSFTNKKVLETHVQERHHAPFVEQCM 720
                DD ALSTHFMD HKKEAQWLFRGYACAICLDSFTNKKVLETHVQERHHAPFVEQCM
Sbjct: 825  EEFLDDQALSTHFMDGHKKEAQWLFRGYACAICLDSFTNKKVLETHVQERHHAPFVEQCM 884

Query: 721  LLQCIPCGSHFGNTEQLWLHVVAVHPVDFRLSNSTQQHNSSDDEDSPVKPEQCNIVSQEN 780
            LLQCIPCGSHFGNTEQLWLHVVAVHPVDFRLSNSTQQHNSS  EDSPVKPEQCNIVSQEN
Sbjct: 885  LLQCIPCGSHFGNTEQLWLHVVAVHPVDFRLSNSTQQHNSSAGEDSPVKPEQCNIVSQEN 944

Query: 781  DNKNVGGLRKFICRFCGLKFDLLPDLGRHHQAAHMGPGLVNSRPAKRGLHYFAYKLKSGK 840
            D KNVGGLRKFICRFCGLKFDLLPDLGRHHQAAHMGPGLVNSRPAKRGLHY+AYKLKSGK
Sbjct: 945  DKKNVGGLRKFICRFCGLKFDLLPDLGRHHQAAHMGPGLVNSRPAKRGLHYYAYKLKSGK 1004

Query: 841  LGHPRFKKTLAGASNRIRNRTKASMKKHIQASKLLSTGSINLQPHVSQSASSRKLTQGST 900
            LGHPRFKKTLAGASNR RNRTKASMKKHIQASKL STGSINLQPHV Q ASSRKLTQGST
Sbjct: 1005 LGHPRFKKTLAGASNRNRNRTKASMKKHIQASKLRSTGSINLQPHVPQLASSRKLTQGST 1064

Query: 901  VAKALVSEIQKRKLSPTNIDILSIARSACC------------------------------ 960
            VAKALVSEIQKRKLSP NIDILSIARSACC                              
Sbjct: 1065 VAKALVSEIQKRKLSPINIDILSIARSACCKVNFKVLLEQKFGVLPEYIYLKAAELCREK 1124

Query: 961  -------KGFVCPKGCETFEDPLLLPHVMPHPHGFGNHKNARTPD--------------- 1020
                   KGFVCP+GCETFEDPLLLPH+MPHP+GFG+H+NA +PD               
Sbjct: 1125 GEVNWHIKGFVCPEGCETFEDPLLLPHLMPHPNGFGDHENACSPDPVSCKWEARRCGYVI 1184

Query: 1021 -------TVKEKVIILCEDISFGQELVPVVCVADEGQRNSLHIPLANSDDQNARYSMPWE 1080
                    VKE VIILCEDISFGQELVPVVCVADEG+RNS  IP+ANSDDQNARY MPWE
Sbjct: 1185 GSHLSSQQVKENVIILCEDISFGQELVPVVCVADEGRRNSPDIPIANSDDQNARYFMPWE 1244

Query: 1081 NFSYIKKPLLDKSLASHTENLQFGCACPHSLCSSETCDHVYLFDSDYEDPKDIYGNPMRR 1140
            NF+YIKKPLLDKSLA HTE+LQFGCACP SLCSSETCDHVYLF+SDYEDPKDIYGNPMRR
Sbjct: 1245 NFTYIKKPLLDKSLAIHTESLQFGCACPQSLCSSETCDHVYLFNSDYEDPKDIYGNPMRR 1304

Query: 1141 RFPYDENGRMILEEGYLVYECNERCSCSQTCPNRVLQNGVQVKLEVFMTETKGWAVRAGE 1200
            RFPYDENGR+ILEEGYLVYECNERCSCS+TCPNRVLQNGVQVKLEVFMTETKGWAVRAGE
Sbjct: 1305 RFPYDENGRIILEEGYLVYECNERCSCSRTCPNRVLQNGVQVKLEVFMTETKGWAVRAGE 1364

Query: 1201 AILRGTFICEYIGEVLDEQEANRRRNRYNSEGNCYFLDVDAHINDISRLIEGSARYIIDA 1219
             ILRGTF+CEYIGEVLDEQEANRRR+RYN+EGNCYFLDVDAHINDISRL+EGSARYIIDA
Sbjct: 1365 PILRGTFVCEYIGEVLDEQEANRRRDRYNTEGNCYFLDVDAHINDISRLVEGSARYIIDA 1424

BLAST of Sgr026330 vs. ExPASy TrEMBL
Match: A0A6J1BTN5 (histone-lysine N-methyltransferase SUVR5 isoform X2 OS=Momordica charantia OX=3673 GN=LOC111005444 PE=4 SV=1)

HSP 1 Score: 2162.1 bits (5601), Expect = 0.0e+00
Identity = 1089/1313 (82.94%), Postives = 1124/1313 (85.61%), Query Frame = 0

Query: 1    WRGKWQAGIRCARADWPLSTLKAKPTHDRKKYFVVFFPHTRNYSWADALLVRSIEEFPQP 60
            WRGKWQAGIRCARADWPLSTLKAKPTHDRKKYFVVFFPHTRNYSWADALLVRSIEEFPQP
Sbjct: 165  WRGKWQAGIRCARADWPLSTLKAKPTHDRKKYFVVFFPHTRNYSWADALLVRSIEEFPQP 224

Query: 61   IAYKSHKAGLKLVEDVKVARRFIMKKLAVSMLNIIDQFHLEALIESARDVMTWKEFAMEA 120
            IAYKSHKAGLKLVEDVKVARRFIMKKLAV MLNIIDQFHLEALIESARDVMTWKEFAMEA
Sbjct: 225  IAYKSHKAGLKLVEDVKVARRFIMKKLAVGMLNIIDQFHLEALIESARDVMTWKEFAMEA 284

Query: 121  SRCNGYSDLGRMLLKLQNMIVQCFINSDWLHHSLHSWVQRCQNAQTAEIIEMLKEELADA 180
            SRCNGYSDLGRMLLKLQNMIVQCF+NSDWL +SL+SWVQRCQNAQTAEIIEMLKEELADA
Sbjct: 285  SRCNGYSDLGRMLLKLQNMIVQCFMNSDWLQNSLNSWVQRCQNAQTAEIIEMLKEELADA 344

Query: 181  ILWDEVNSHDDAPVQPTFSSVWKTWKHEVTKWFSISPTLPIIRDKEQQTVEAFLATTLQV 240
            ILW EVNSH DAPVQPTFSSVWKTWKHEVTKWFSISP LPIIRDKEQ++VEAFLATTLQV
Sbjct: 345  ILWKEVNSHGDAPVQPTFSSVWKTWKHEVTKWFSISPILPIIRDKEQRSVEAFLATTLQV 404

Query: 241  SRKRPKLEVRRAEAHASLVESKCSDVAMALHIDSESHKVEARKVAKSADSLSTVPGRLGG 300
            SRKRPKLE+RRAEAHASLVESKCS                                  GG
Sbjct: 405  SRKRPKLEIRRAEAHASLVESKCS----------------------------------GG 464

Query: 301  IVVQTGNSELAFCKDVELTPLTEVVAEKPLNFGNKNRQCIAFIESKGRQCVRWANEGDVY 360
              VQTGN +LA CKDVEL P T+VVAEKP N GN+NRQCIAFIESKGRQCVRWANEGDVY
Sbjct: 465  --VQTGNLQLASCKDVELMPHTDVVAEKPFNSGNRNRQCIAFIESKGRQCVRWANEGDVY 524

Query: 361  CCVHLSSRFTGNSDKKEHTRSVESPMCQGTTVLGTRCKHRSLFGS--------------- 420
            CCVHLSSRFTGNSDKKE TRSVESPMCQGTTVLG+RCKHRSLFGS               
Sbjct: 525  CCVHLSSRFTGNSDKKEQTRSVESPMCQGTTVLGSRCKHRSLFGSSFCKKHRPRSETKSE 584

Query: 421  -----------YSDIYGVEATGYKEIKFVGDVGNPLGVDEGDVTNNGNSSSDKLGHHGKD 480
                         DIY VEA GYKEIKF  DVGN LGVD GDVTNNGNSSSDKL H GK+
Sbjct: 585  SNSLENKLIEKQQDIYSVEAIGYKEIKF-ADVGNTLGVDNGDVTNNGNSSSDKLEHRGKE 644

Query: 481  SIASEVRHCIGSCEHIDSNPCLESPKRHSLYCEKHLPSWLKRARNGKSRVISKEVFMDLL 540
            SIA+EVRHCIGSC  IDSNPCLESPKRHSLYCEKHLPSWLKRARNGKSRVISKEVFMDLL
Sbjct: 645  SIATEVRHCIGSC--IDSNPCLESPKRHSLYCEKHLPSWLKRARNGKSRVISKEVFMDLL 704

Query: 541  RDCISQEQKIHLHQACELFYRLFKSILSLRNPVPVEVQFQWALSEASKNFGVGEQFMKLV 600
            RD  SQE KIHLHQACELFYRLFKSILSLRNPVP+EVQFQWALSEASK FGVGEQFMKLV
Sbjct: 705  RDSSSQELKIHLHQACELFYRLFKSILSLRNPVPMEVQFQWALSEASKTFGVGEQFMKLV 764

Query: 601  CREKERLKRIWGFDAEEAQLSSYSMEVPTSGPL---------LASGNHDDDMALSTHFMD 660
            CREKERLKRIWGFDAEEAQLSSYSMEVPTSG           + S    DD ALSTHFMD
Sbjct: 765  CREKERLKRIWGFDAEEAQLSSYSMEVPTSGNCNDDMGIRCKICSEEFLDDQALSTHFMD 824

Query: 661  SHKKEAQWLFRGYACAICLDSFTNKKVLETHVQERHHAPFVEQCMLLQCIPCGSHFGNTE 720
             HKKEAQWLFRGYACAICLDSFTNKKVLETHVQERHHAPFVEQCMLLQCIPCGSHFGNTE
Sbjct: 825  GHKKEAQWLFRGYACAICLDSFTNKKVLETHVQERHHAPFVEQCMLLQCIPCGSHFGNTE 884

Query: 721  QLWLHVVAVHPVDFRLSNSTQQHNSSDDEDSPVKPEQCNIVSQENDNKNVGGLRKFICRF 780
            QLWLHVVAVHPVDFRLSNSTQQHNSS  EDSPVKPEQCNIVSQEND KNVGGLRKFICRF
Sbjct: 885  QLWLHVVAVHPVDFRLSNSTQQHNSSAGEDSPVKPEQCNIVSQENDKKNVGGLRKFICRF 944

Query: 781  CGLKFDLLPDLGRHHQAAHMGPGLVNSRPAKRGLHYFAYKLKSGKLGHPRFKKTLAGASN 840
            CGLKFDLLPDLGRHHQAAHMGPGLVNSRPAKRGLHY+AYKLKSGKLGHPRFKKTLAGASN
Sbjct: 945  CGLKFDLLPDLGRHHQAAHMGPGLVNSRPAKRGLHYYAYKLKSGKLGHPRFKKTLAGASN 1004

Query: 841  RIRNRTKASMKKHIQASKLLSTGSINLQPHVSQSASSRKLTQGSTVAKALVSEIQKRKLS 900
            R RNRTKASMKKHIQASKL STGSINLQPHV Q ASSRKLTQGSTVAKALVSEIQKRKLS
Sbjct: 1005 RNRNRTKASMKKHIQASKLRSTGSINLQPHVPQLASSRKLTQGSTVAKALVSEIQKRKLS 1064

Query: 901  PTNIDILSIARSACC-------------------------------------KGFVCPKG 960
            P NIDILSIARSACC                                     KGFVCP+G
Sbjct: 1065 PINIDILSIARSACCKVNFKVLLEQKFGVLPEYIYLKAAELCREKGEVNWHIKGFVCPEG 1124

Query: 961  CETFEDPLLLPHVMPHPHGFGNHKNARTPD----------------------TVKEKVII 1020
            CETFEDPLLLPH+MPHP+GFG+H+NA +PD                       VKE VII
Sbjct: 1125 CETFEDPLLLPHLMPHPNGFGDHENACSPDPVSCKWEARRCGYVIGSHLSSQQVKENVII 1184

Query: 1021 LCEDISFGQELVPVVCVADEGQRNSLHIPLANSDDQNARYSMPWENFSYIKKPLLDKSLA 1080
            LCEDISFGQELVPVVCVADEG+RNS  IP+ANSDDQNARY MPWENF+YIKKPLLDKSLA
Sbjct: 1185 LCEDISFGQELVPVVCVADEGRRNSPDIPIANSDDQNARYFMPWENFTYIKKPLLDKSLA 1244

Query: 1081 SHTENLQFGCACPHSLCSSETCDHVYLFDSDYEDPKDIYGNPMRRRFPYDENGRMILEEG 1140
             HTE+LQFGCACP SLCSSETCDHVYLF+SDYEDPKDIYGNPMRRRFPYDENGR+ILEEG
Sbjct: 1245 IHTESLQFGCACPQSLCSSETCDHVYLFNSDYEDPKDIYGNPMRRRFPYDENGRIILEEG 1304

Query: 1141 YLVYECNERCSCSQTCPNRVLQNGVQVKLEVFMTETKGWAVRAGEAILRGTFICEYIGEV 1200
            YLVYECNERCSCS+TCPNRVLQNGVQVKLEVFMTETKGWAVRAGE ILRGTF+CEYIGEV
Sbjct: 1305 YLVYECNERCSCSRTCPNRVLQNGVQVKLEVFMTETKGWAVRAGEPILRGTFVCEYIGEV 1364

Query: 1201 LDEQEANRRRNRYNSEGNCYFLDVDAHINDISRLIEGSARYIIDATNYGNVSRFINHCCS 1219
            LDEQEANRRR+RYN+EGNCYFLDVDAHINDISRL+EGSARYIIDATNYGNVSRFINH CS
Sbjct: 1365 LDEQEANRRRDRYNTEGNCYFLDVDAHINDISRLVEGSARYIIDATNYGNVSRFINHSCS 1424

BLAST of Sgr026330 vs. ExPASy TrEMBL
Match: A0A6J1HDR3 (histone-lysine N-methyltransferase SUVR5 OS=Cucurbita moschata OX=3662 GN=LOC111463235 PE=4 SV=1)

HSP 1 Score: 2147.5 bits (5563), Expect = 0.0e+00
Identity = 1071/1331 (80.47%), Postives = 1121/1331 (84.22%), Query Frame = 0

Query: 1    WRGKWQAGIRCARADWPLSTLKAKPTHDRKKYFVVFFPHTRNYSWADALLVRSIEEFPQP 60
            WRGKWQAGIRCARADWPLSTLKAKPTH+RKKYFVVFFPHTRNYSWADALLVRSIEEFPQP
Sbjct: 166  WRGKWQAGIRCARADWPLSTLKAKPTHERKKYFVVFFPHTRNYSWADALLVRSIEEFPQP 225

Query: 61   IAYKSHKAGLKLVEDVKVARRFIMKKLAVSMLNIIDQFHLEALIESARDVMTWKEFAMEA 120
            IAYKSHKAGLKLVEDVKVARRFIMKKLAV MLNIIDQFHLEALIESARDVM WKEFA+EA
Sbjct: 226  IAYKSHKAGLKLVEDVKVARRFIMKKLAVGMLNIIDQFHLEALIESARDVMNWKEFAIEA 285

Query: 121  SRCNGYSDLGRMLLKLQNMIVQCFINSDWLHHSLHSWVQRCQNAQTAEIIEMLKEELADA 180
            SRCNGYSDLGRMLLKLQNMI+QCF+N DWL +SLHSWVQRCQNAQTAE+IEMLKEELADA
Sbjct: 286  SRCNGYSDLGRMLLKLQNMILQCFVNPDWLQNSLHSWVQRCQNAQTAEVIEMLKEELADA 345

Query: 181  ILWDEVNSHDDAPVQPTFSSVWKTWKHEVTKWFSISPTLPIIRDKEQQTVEAFLATTLQV 240
            ILWD+V SH DAPVQPTFSSVWKTWKHEVTKWFSI PTLPI RDKEQQTVEAFLAT L+V
Sbjct: 346  ILWDKVKSHGDAPVQPTFSSVWKTWKHEVTKWFSIYPTLPISRDKEQQTVEAFLATALEV 405

Query: 241  SRKRPKLEVRRAEAHASLVESKCSDVAMA---------------LHIDSESHKVEARKVA 300
            SRKRPKLE+RRAE  ASL+ESKCSD AMA                 + SESHKVE RK+ 
Sbjct: 406  SRKRPKLEIRRAETQASLMESKCSDEAMAPDNDSGFFNNQTSLNAKLGSESHKVEVRKIV 465

Query: 301  KSADSLSTVPGRLGGIVVQTGNSELAFCKDVELTPLTEVVAEKPLNFGNKNRQCIAFIES 360
             SA  LS VPGRL GIV QTG+ +LA CKDVEL P TE   EK L++GNKNRQCIAFIES
Sbjct: 466  TSAGPLSIVPGRLAGIVAQTGSLDLASCKDVELRPHTETATEKLLHYGNKNRQCIAFIES 525

Query: 361  KGRQCVRWANEGDVYCCVHLSSRFTGNSDKKEHTRSVESPMCQGTTVLGTRCKHRSLFGS 420
            KGRQCVRWANEGDVYCCVHLSSRFTGN+DKKE TRSVESPMCQGTTVLG+RCKHRSLFGS
Sbjct: 526  KGRQCVRWANEGDVYCCVHLSSRFTGNNDKKEQTRSVESPMCQGTTVLGSRCKHRSLFGS 585

Query: 421  --------------------------YSDIYGVEATGYKEIKFVGDVGNPLGVDEGDVTN 480
                                        DIY VE T  KEIKF  D GNPLGVDEGDVTN
Sbjct: 586  SFCKKHRPRSETNMESTSYENKLIEKQQDIYRVEDTRNKEIKFDRDAGNPLGVDEGDVTN 645

Query: 481  NGNSSSDKLGHHGKDSIASEVRHCIGSCEHIDSNPCLESPKRHSLYCEKHLPSWLKRARN 540
            NGNSSSDKL HHGKDSIASEVRHCIGS EHIDSNPCLESPKRHSLYCEKHLPSWLKRARN
Sbjct: 646  NGNSSSDKLEHHGKDSIASEVRHCIGSSEHIDSNPCLESPKRHSLYCEKHLPSWLKRARN 705

Query: 541  GKSRVISKEVFMDLLRDCISQEQKIHLHQACELFYRLFKSILSLRNPVPVEVQFQWALSE 600
            GKSRVISKEVFMDLLRDC S+EQKIHLHQACELFYRLFKSILSLRNPVP+EVQFQWALSE
Sbjct: 706  GKSRVISKEVFMDLLRDCNSEEQKIHLHQACELFYRLFKSILSLRNPVPMEVQFQWALSE 765

Query: 601  ASKNFGVGEQFMKLVCREKERLKRIWGFDAEEAQLSSYSMEVPTSGPLLASGNHD----- 660
            ASKN GVGEQFMKLVC EKERLKR+WGFDAE AQLSS SMEVPT+GPLL SGN +     
Sbjct: 766  ASKNLGVGEQFMKLVCHEKERLKRLWGFDAEGAQLSSPSMEVPTAGPLLTSGNCNDGSSI 825

Query: 661  ----------DDMALSTHFMDSHKKEAQWLFRGYACAICLDSFTNKKVLETHVQERHHAP 720
                      DD ALSTHFMD HKKEAQWLFRGYACAICLDSFTNKKVLETHVQERHHAP
Sbjct: 826  RCKICSEEFLDDQALSTHFMDGHKKEAQWLFRGYACAICLDSFTNKKVLETHVQERHHAP 885

Query: 721  FVEQCMLLQCIPCGSHFGNTEQLWLHVVAVHPVDFRLSNSTQQHNSSDDEDSPVKPEQCN 780
            FVEQCMLLQCIPCGSHFGNT+QLWLHVVAVHP+DFRLSNST+QHNSS  EDSPVKP++CN
Sbjct: 886  FVEQCMLLQCIPCGSHFGNTDQLWLHVVAVHPIDFRLSNSTRQHNSSSGEDSPVKPKECN 945

Query: 781  IVSQENDNKNVGGLRKFICRFCGLKFDLLPDLGRHHQAAHMGPGLVNSRPAKRGLHYFAY 840
            IVS+ NDNKNVGGLRKF CRFCGLKFDLLPDLGRHHQAAHMGPGL NSR AKRG HY+AY
Sbjct: 946  IVSKSNDNKNVGGLRKFNCRFCGLKFDLLPDLGRHHQAAHMGPGLANSRTAKRGFHYYAY 1005

Query: 841  KLKSGKLGHPRFKKTLAGASNRIRNRTKASMKKHIQASKLLSTGSINLQPHVSQSASSRK 900
            KLKSGKLGHPRFKKTLAGASNRIRNRTKASMKKHIQ SKLLSTGSINLQPH S  ASSRK
Sbjct: 1006 KLKSGKLGHPRFKKTLAGASNRIRNRTKASMKKHIQTSKLLSTGSINLQPHESHLASSRK 1065

Query: 901  LTQGSTVAKALVSEIQKRKLSPTNIDILSIARSACC------------------------ 960
            LTQGSTV+KALVSEIQK KL PTN+DILSIA SACC                        
Sbjct: 1066 LTQGSTVSKALVSEIQKIKLFPTNVDILSIAHSACCKVNFKVLLEQKFGVLPEYFYLKAA 1125

Query: 961  -----------KGFVCPKGCETFEDPLLLPHVMPHPHGFGNHKNARTPDTV--------- 1020
                       KGFVCPKGCET +DPLL P++M HP+GFG HKNA T D V         
Sbjct: 1126 ELCREKVNWYIKGFVCPKGCETLKDPLLTPNLMSHPNGFGGHKNAHTSDPVSSKWEAHGC 1185

Query: 1021 -------------KEKVIILCEDISFGQELVPVVCVADEGQRNSLHIPLANSDDQNARYS 1080
                         KEK +ILCEDISFGQE VPVVCVADEG RNS HI LANSD Q   YS
Sbjct: 1186 SYAIGSHLSSQQLKEKAVILCEDISFGQEFVPVVCVADEGLRNSPHISLANSDSQEVGYS 1245

Query: 1081 MPWENFSYIKKPLLDKSLASHTENLQFGCACPHSLCSSETCDHVYLFDSDYEDPKDIYGN 1140
            MPWE+F+YIKK LL+KSLA  TE+LQFGCAC HSLCSSETCDHVYLFDSDYEDPKDIYGN
Sbjct: 1246 MPWESFTYIKKSLLNKSLAIDTESLQFGCACAHSLCSSETCDHVYLFDSDYEDPKDIYGN 1305

Query: 1141 PMRRRFPYDENGRMILEEGYLVYECNERCSCSQTCPNRVLQNGVQVKLEVFMTETKGWAV 1200
            PM RRFPYDENGR+ILEEGYLVYECNERC+CS+TCPNRVLQNGV VKLEVF+TETKGW V
Sbjct: 1306 PMSRRFPYDENGRIILEEGYLVYECNERCNCSRTCPNRVLQNGVHVKLEVFLTETKGWTV 1365

Query: 1201 RAGEAILRGTFICEYIGEVLDEQEANRRRNRYNSEGNCYFLDVDAHINDISRLIEGSARY 1219
            RAGE ILRGTF+CEYIGEVL+EQEANRRR+RYN EGN YFLDVDAHINDISRLIEGSARY
Sbjct: 1366 RAGEVILRGTFVCEYIGEVLEEQEANRRRDRYNCEGNGYFLDVDAHINDISRLIEGSARY 1425

BLAST of Sgr026330 vs. ExPASy TrEMBL
Match: A0A6J1KT71 (histone-lysine N-methyltransferase SUVR5 OS=Cucurbita maxima OX=3661 GN=LOC111497014 PE=4 SV=1)

HSP 1 Score: 2143.6 bits (5553), Expect = 0.0e+00
Identity = 1070/1331 (80.39%), Postives = 1120/1331 (84.15%), Query Frame = 0

Query: 1    WRGKWQAGIRCARADWPLSTLKAKPTHDRKKYFVVFFPHTRNYSWADALLVRSIEEFPQP 60
            WRGKWQAGIRCARADWPLSTLKAKPTHDRKKYFVVFFPHTRNYSWADALLVRSIEEFPQP
Sbjct: 166  WRGKWQAGIRCARADWPLSTLKAKPTHDRKKYFVVFFPHTRNYSWADALLVRSIEEFPQP 225

Query: 61   IAYKSHKAGLKLVEDVKVARRFIMKKLAVSMLNIIDQFHLEALIESARDVMTWKEFAMEA 120
            IAYKSHKAGLKLVED+KVARRFIMKKLAV MLNI+DQFHLEALIESARDVM WKEFAMEA
Sbjct: 226  IAYKSHKAGLKLVEDIKVARRFIMKKLAVGMLNIMDQFHLEALIESARDVMNWKEFAMEA 285

Query: 121  SRCNGYSDLGRMLLKLQNMIVQCFINSDWLHHSLHSWVQRCQNAQTAEIIEMLKEELADA 180
            SRCNGYSDLGRMLLKLQNMI+ CF+N DWL +SLHSWVQRCQNAQTAE+IEMLKEELADA
Sbjct: 286  SRCNGYSDLGRMLLKLQNMILXCFVNPDWLQNSLHSWVQRCQNAQTAEVIEMLKEELADA 345

Query: 181  ILWDEVNSHDDAPVQPTFSSVWKTWKHEVTKWFSISPTLPIIRDKEQQTVEAFLATTLQV 240
            ILWD+V SH DAPVQPTFSSVWKTWKHEVTKWFSI PTLPI RDKEQQTVEAFLAT L+V
Sbjct: 346  ILWDKVKSHCDAPVQPTFSSVWKTWKHEVTKWFSIYPTLPISRDKEQQTVEAFLATALEV 405

Query: 241  SRKRPKLEVRRAEAHASLVESKCSDVAMA---------------LHIDSESHKVEARKVA 300
            SRKRPKLE+RRAE  ASL+ESKCSD AMA                 + SESHKVE R++ 
Sbjct: 406  SRKRPKLEIRRAETQASLMESKCSDEAMAPDNDSGFFNNQTSLNAKLGSESHKVEVREIV 465

Query: 301  KSADSLSTVPGRLGGIVVQTGNSELAFCKDVELTPLTEVVAEKPLNFGNKNRQCIAFIES 360
             SA  LS VPGRL GIV QTG+ +LA CKDVEL P TE   EK L++GNKNRQCIAFIES
Sbjct: 466  TSAGPLSIVPGRLAGIVAQTGSLDLASCKDVELKPHTETATEKLLHYGNKNRQCIAFIES 525

Query: 361  KGRQCVRWANEGDVYCCVHLSSRFTGNSDKKEHTRSVESPMCQGTTVLGTRCKHRSLFGS 420
            KGRQCVRWANEGDVYCCVHLSSRFTGN+DKKE TRSVESPMCQGTTVLG+RCKHRSLFGS
Sbjct: 526  KGRQCVRWANEGDVYCCVHLSSRFTGNNDKKEQTRSVESPMCQGTTVLGSRCKHRSLFGS 585

Query: 421  --------------------------YSDIYGVEATGYKEIKFVGDVGNPLGVDEGDVTN 480
                                        DIY VE T  KEIKF  D GNPLGVDEGDVTN
Sbjct: 586  SFCKKHRPRSETNMESTSYENKLIEKQQDIYRVEDTRNKEIKFDRDAGNPLGVDEGDVTN 645

Query: 481  NGNSSSDKLGHHGKDSIASEVRHCIGSCEHIDSNPCLESPKRHSLYCEKHLPSWLKRARN 540
            NGNSSSDKL HHGKDSIASEVRHCIGS EHIDSNPCLESPKRHSLYCEKHLPSWLKRARN
Sbjct: 646  NGNSSSDKLEHHGKDSIASEVRHCIGSSEHIDSNPCLESPKRHSLYCEKHLPSWLKRARN 705

Query: 541  GKSRVISKEVFMDLLRDCISQEQKIHLHQACELFYRLFKSILSLRNPVPVEVQFQWALSE 600
            GKSRVISKEVFMDLLRDC S+EQKI+LHQACELFYRLFKSILSLRNPVP+EVQFQWALSE
Sbjct: 706  GKSRVISKEVFMDLLRDCNSEEQKINLHQACELFYRLFKSILSLRNPVPMEVQFQWALSE 765

Query: 601  ASKNFGVGEQFMKLVCREKERLKRIWGFDAEEAQLSSYSMEVPTSGPLLASGNHD----- 660
            ASKN GVGEQFMKLVC EKERLKR+WGFDAE AQLSS SMEVPT+GPLL SGN +     
Sbjct: 766  ASKNLGVGEQFMKLVCHEKERLKRLWGFDAEGAQLSSPSMEVPTAGPLLTSGNCNDGSSI 825

Query: 661  ----------DDMALSTHFMDSHKKEAQWLFRGYACAICLDSFTNKKVLETHVQERHHAP 720
                      DD ALSTHFMD HKKEAQWLFRGYACAICLDSFTNKKVLETHVQERHHAP
Sbjct: 826  RCKICSEEFLDDQALSTHFMDGHKKEAQWLFRGYACAICLDSFTNKKVLETHVQERHHAP 885

Query: 721  FVEQCMLLQCIPCGSHFGNTEQLWLHVVAVHPVDFRLSNSTQQHNSSDDEDSPVKPEQCN 780
            FVEQCMLLQCIPCGSHFGNTEQLWLHVVAVHP+DFRLSNST+QHN S  EDSPVKP++CN
Sbjct: 886  FVEQCMLLQCIPCGSHFGNTEQLWLHVVAVHPIDFRLSNSTRQHNFSSGEDSPVKPKECN 945

Query: 781  IVSQENDNKNVGGLRKFICRFCGLKFDLLPDLGRHHQAAHMGPGLVNSRPAKRGLHYFAY 840
            IVS+ NDNKNVGGLRKF CRFCGLKFDLLPDLGRHHQAAHMGPGL NSR AKRG HY+AY
Sbjct: 946  IVSKSNDNKNVGGLRKFNCRFCGLKFDLLPDLGRHHQAAHMGPGLANSRTAKRGFHYYAY 1005

Query: 841  KLKSGKLGHPRFKKTLAGASNRIRNRTKASMKKHIQASKLLSTGSINLQPHVSQSASSRK 900
            KLKSGKLGHPRFKKTLAGASNRIRNRTKASMKKHIQ SKLLSTGSINLQPH S  ASSRK
Sbjct: 1006 KLKSGKLGHPRFKKTLAGASNRIRNRTKASMKKHIQTSKLLSTGSINLQPHESHLASSRK 1065

Query: 901  LTQGSTVAKALVSEIQKRKLSPTNIDILSIARSACC------------------------ 960
            LTQGSTV+KALVSEIQK KL PTN+DILSIA SACC                        
Sbjct: 1066 LTQGSTVSKALVSEIQKIKLFPTNVDILSIAHSACCKVNFKVLLEQKFGVLPEYFYLKAA 1125

Query: 961  -----------KGFVCPKGCETFEDPLLLPHVMPHPHGFGNHKNARTPDTV--------- 1020
                       KGFVCPKGCET +DPLL P++MPHP+GFG HKNA TP  V         
Sbjct: 1126 ELCREKVNWYIKGFVCPKGCETLKDPLLHPNLMPHPNGFGGHKNAHTPGPVSSKWEGHGC 1185

Query: 1021 -------------KEKVIILCEDISFGQELVPVVCVADEGQRNSLHIPLANSDDQNARYS 1080
                         KE  +ILCEDISFGQE VPVVCVADEG RNS HI LANSD Q   YS
Sbjct: 1186 SYAIGSHLSSQQLKETAVILCEDISFGQEFVPVVCVADEGLRNSPHISLANSDSQEVGYS 1245

Query: 1081 MPWENFSYIKKPLLDKSLASHTENLQFGCACPHSLCSSETCDHVYLFDSDYEDPKDIYGN 1140
            MPWE+F+YIKK LL+KSLA  TE+LQFGCAC HSLCSSETCDHVYLFDSDYEDPKDIYGN
Sbjct: 1246 MPWESFTYIKKSLLNKSLAIDTESLQFGCACAHSLCSSETCDHVYLFDSDYEDPKDIYGN 1305

Query: 1141 PMRRRFPYDENGRMILEEGYLVYECNERCSCSQTCPNRVLQNGVQVKLEVFMTETKGWAV 1200
            PM RRFPYDENGR+ILEEGYLVYECNERC+CS+TCPNRVLQNGV VKLEVFMTETKGW V
Sbjct: 1306 PMSRRFPYDENGRIILEEGYLVYECNERCNCSRTCPNRVLQNGVHVKLEVFMTETKGWTV 1365

Query: 1201 RAGEAILRGTFICEYIGEVLDEQEANRRRNRYNSEGNCYFLDVDAHINDISRLIEGSARY 1219
            RAGEAILRGTF+CEYIGEVL+EQEANRRR+RYN EGN YFLDVDAHINDISRLIEGSARY
Sbjct: 1366 RAGEAILRGTFVCEYIGEVLEEQEANRRRDRYNCEGNGYFLDVDAHINDISRLIEGSARY 1425

BLAST of Sgr026330 vs. ExPASy TrEMBL
Match: A0A5D3CM86 (Histone-lysine N-methyltransferase SUVR5 isoform X2 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold255G005360 PE=4 SV=1)

HSP 1 Score: 2108.2 bits (5461), Expect = 0.0e+00
Identity = 1063/1331 (79.86%), Postives = 1114/1331 (83.70%), Query Frame = 0

Query: 1    WRGKWQAGIRCARADWPLSTLKAKPTHDRKKYFVVFFPHTRNYSWADALLVRSIEEFPQP 60
            WRGKWQAGIRCARADWPLSTLKAKPTHDRKKYFVVFFPHTRNYSWADA LVRSIEEFPQP
Sbjct: 172  WRGKWQAGIRCARADWPLSTLKAKPTHDRKKYFVVFFPHTRNYSWADAFLVRSIEEFPQP 231

Query: 61   IAYKSHKAGLKLVEDVKVARRFIMKKLAVSMLNIIDQFHLEALIESARDVMTWKEFAMEA 120
            IAYKSHKAGLKLVEDVKVARRFIMKKL+V MLNIIDQFHLEALIESARDV+TWKEFAMEA
Sbjct: 232  IAYKSHKAGLKLVEDVKVARRFIMKKLSVGMLNIIDQFHLEALIESARDVVTWKEFAMEA 291

Query: 121  SRCNGYSDLGRMLLKLQNMIVQCFINSDWLHHSLHSWVQRCQNAQTAEIIEMLKEELADA 180
            SRCNGYSDLGRML+KLQNMIVQCFINSDWL +SLHSWVQ CQNAQTAEIIEMLKEELADA
Sbjct: 292  SRCNGYSDLGRMLIKLQNMIVQCFINSDWLQNSLHSWVQGCQNAQTAEIIEMLKEELADA 351

Query: 181  ILWDEVNSHDDAPVQPTFSSVWKTWKHEVTKWFSISPTLPIIRDKEQQTVEAFLATTLQV 240
            ILWD+V SH DAPVQPTFSSVWKTWKHEVTKWFSISPTLPI +DKEQQTVEAFLAT LQV
Sbjct: 352  ILWDKVKSHGDAPVQPTFSSVWKTWKHEVTKWFSISPTLPITKDKEQQTVEAFLATALQV 411

Query: 241  SRKRPKLEVRRAEAHASLVESKCSDVAMALHID---------------SESHKVEARKVA 300
            SRKRPKLEVRRAEAHASLVESKCSD AMAL ID               SESHK EAR++A
Sbjct: 412  SRKRPKLEVRRAEAHASLVESKCSDQAMALDIDSGFFNNQNSLNAKLASESHKGEAREIA 471

Query: 301  KSADSLSTVPGRLGGIVVQTGNSELAFCKDVELTPLTEVVAEKPLNFGNKNRQCIAFIES 360
             SA SL+T+PGRL GI+ QTGN +LA CKDVEL P  EV AEK L +GNKNRQCIAFIES
Sbjct: 472  TSAGSLNTIPGRLTGIIAQTGNLDLASCKDVELMPRAEVAAEKSLTYGNKNRQCIAFIES 531

Query: 361  KGRQCVRWANEGDVYCCVHLSSRFTGNSDKKEHTRSVESPMCQGTTVLGTRCKHRSLFGS 420
            KGRQCVRWANEGDVYCCVHLSSRFTGNSDKKE TRSVESPMCQGTTVLG+RCKHRSLFGS
Sbjct: 532  KGRQCVRWANEGDVYCCVHLSSRFTGNSDKKEQTRSVESPMCQGTTVLGSRCKHRSLFGS 591

Query: 421  --------------------------YSDIYGVEATGYKEIKFVGDVGNPLGVDEGDVTN 480
                                        DIY VE    KE        NPLG+DEGDVTN
Sbjct: 592  SFCKKHRPRGETKTESTSVGNKLIEKQHDIYSVEDASNKE--------NPLGLDEGDVTN 651

Query: 481  NGNSSSDKLGHHGKDSIASEVRHCIGSCEHIDSNPCLESPKRHSLYCEKHLPSWLKRARN 540
            NGNSSSDKL HHGKDSIASE+RHCIGSCEHIDSNPCLESPKRHSLYCEKHLPSWLKRARN
Sbjct: 652  NGNSSSDKLEHHGKDSIASELRHCIGSCEHIDSNPCLESPKRHSLYCEKHLPSWLKRARN 711

Query: 541  GKSRVISKEVFMDLLRDCISQEQKIHLHQACELFYRLFKSILSLRNPVPVEVQFQWALSE 600
            GKSRVISKEVFMDLLRDC SQE KIHLHQACELFYRLFKSILSLRNPVP+EVQFQWALSE
Sbjct: 712  GKSRVISKEVFMDLLRDCNSQEPKIHLHQACELFYRLFKSILSLRNPVPMEVQFQWALSE 771

Query: 601  ASKNFGVGEQFMKLVCREKERLKRIWGFDAEEAQLSSYSMEVPTSGPLLASGNHDDDM-- 660
            ASKN GVGEQF+KLVCREKERLKRIWGFDAE+AQLSS SM   TSG LL SGN  DDM  
Sbjct: 772  ASKNLGVGEQFLKLVCREKERLKRIWGFDAEDAQLSSPSMGAATSGALLTSGNCGDDMSI 831

Query: 661  -------------ALSTHFMDSHKKEAQWLFRGYACAICLDSFTNKKVLETHVQERHHAP 720
                         ALS HFMD HKKEAQWLFRGYACAICLDSFTNKKVLETHVQERHHAP
Sbjct: 832  RCKICSEEFLDDQALSAHFMDGHKKEAQWLFRGYACAICLDSFTNKKVLETHVQERHHAP 891

Query: 721  FVEQCMLLQCIPCGSHFGNTEQLWLHVVAVHPVDFRLSNSTQQHNSSDDEDSPVKPEQCN 780
            FVEQCMLLQCIPCGSHFGN+EQLWLHVVAVHPVDFRLSNS+++ NSS  EDSPVKP+QC 
Sbjct: 892  FVEQCMLLQCIPCGSHFGNSEQLWLHVVAVHPVDFRLSNSSRRQNSSSGEDSPVKPKQCK 951

Query: 781  IVSQENDNKNVGGLRKFICRFCGLKFDLLPDLGRHHQAAHMGPGLVNSRPAKRGLHYFAY 840
            IVS+ENDNKNVGGLRKF CRFCGLKFDLLPDLGRHHQAAHMGPGLVNSRPAKRG HY++Y
Sbjct: 952  IVSKENDNKNVGGLRKFNCRFCGLKFDLLPDLGRHHQAAHMGPGLVNSRPAKRGFHYYSY 1011

Query: 841  KLKSGKLGHPRFKKTLAGASNRIRNRTKASMKKHIQASKLLSTGSINLQPHVSQSASSRK 900
            K KSGKLGHPRFKKT AG SNRIRNRTKASMKKHIQASKLLSTGS++LQPHVSQ ASSRK
Sbjct: 1012 KSKSGKLGHPRFKKTKAGVSNRIRNRTKASMKKHIQASKLLSTGSVDLQPHVSQLASSRK 1071

Query: 901  LTQGSTVAKALVSEIQKRKLSPTNIDILSIARSACC------------------------ 960
            LTQGS VAKA VSEIQKRKLSPTNIDILSIA SACC                        
Sbjct: 1072 LTQGSIVAKAFVSEIQKRKLSPTNIDILSIASSACCKVNFKVLLEQKFGVLPEYFYLKAA 1131

Query: 961  -------------KGFVCPKGCETFEDPLLLPHVMPHPHGFGNHKNARTPDTVK------ 1020
                         KGFVCPKGCET+      P +MPH +GFG++KNA TPD  K      
Sbjct: 1132 ELCREKGEVNWYMKGFVCPKGCETY------PLLMPHRNGFGDNKNACTPDPSKWKDHGC 1191

Query: 1021 --------------EKVIILCEDISFGQELVPVVCVADEGQRNSLHIPLANSDDQNARYS 1080
                          EK ++LCEDISFGQELVPVVCVAD              D QN   S
Sbjct: 1192 SYVSGSHLSSQQSREKTVVLCEDISFGQELVPVVCVAD--------------DSQNVGDS 1251

Query: 1081 MPWENFSYIKKPLLDKSLASHTENLQFGCACPHSLCSSETCDHVYLFDSDYEDPKDIYGN 1140
            +PWENF YIKKPLLDKSLA  TE+LQFGCAC H LCSSETCDHVYLF+SDYEDPKDIYGN
Sbjct: 1252 VPWENFIYIKKPLLDKSLAIDTESLQFGCACSHLLCSSETCDHVYLFNSDYEDPKDIYGN 1311

Query: 1141 PMRRRFPYDENGRMILEEGYLVYECNERCSCSQTCPNRVLQNGVQVKLEVFMTETKGWAV 1200
            PMRRRFPYDENG++ILEEGYLVYECNERCSCS+TCPNRVLQNGVQVKLEVFMTETKGWAV
Sbjct: 1312 PMRRRFPYDENGQIILEEGYLVYECNERCSCSRTCPNRVLQNGVQVKLEVFMTETKGWAV 1371

Query: 1201 RAGEAILRGTFICEYIGEVLDEQEANRRRNRYNSEGNCYFLDVDAHINDISRLIEGSARY 1219
            RAGEAI+RGTF+CEYIGEVLDEQEANRRR++YNSEGNCYFLDVDAHINDISRL++GSARY
Sbjct: 1372 RAGEAIMRGTFVCEYIGEVLDEQEANRRRDKYNSEGNCYFLDVDAHINDISRLVDGSARY 1431

BLAST of Sgr026330 vs. TAIR 10
Match: AT2G23740.2 (nucleic acid binding;sequence-specific DNA binding transcription factors;zinc ion binding )

HSP 1 Score: 1191.8 bits (3082), Expect = 0.0e+00
Identity = 656/1328 (49.40%), Postives = 836/1328 (62.95%), Query Frame = 0

Query: 1    WRGKWQAGIRCARADWPLSTLKAKPTHDRKKYFVVFFPHTRNYSWADALLVRSIEEFPQP 60
            WRGKWQAGIRCA+ADWPL+TL+ KPTHDRKKY V+FFPHT+NYSWAD  LVRSI EFP P
Sbjct: 76   WRGKWQAGIRCAKADWPLTTLRGKPTHDRKKYCVIFFPHTKNYSWADMQLVRSINEFPDP 135

Query: 61   IAYKSHKAGLKLVEDVKVARRFIMKKLAVSMLNIIDQFHLEALIESARDVMTWKEFAMEA 120
            IAYKSHK GLKLV+D+  ARR+IM+KL V M NI+DQF  E + E+ARD++ WKEFAMEA
Sbjct: 136  IAYKSHKIGLKLVKDLTAARRYIMRKLTVGMFNIVDQFPSEVVSEAARDIIIWKEFAMEA 195

Query: 121  SRCNGYSDLGRMLLKLQNMIVQCFINSDWLHHSLHSWVQRCQNAQTAEIIEMLKEELADA 180
            +R   Y DLG ML+KL +MI+Q +++  WL +S   WVQ+C NA  AE IE+L EE  + 
Sbjct: 196  TRSTSYHDLGIMLVKLHSMILQRYMDPIWLENSFPLWVQKCNNAVNAESIELLNEEFDNC 255

Query: 181  ILWDEVNSHDDAPVQPTFSSVWKTWKHEVTKWFSISPTLPIIRDKEQQTVEAFLATTLQV 240
            I W+EV S  ++P+QP   S WKTWKH++ KWFSIS     + +  Q   ++   + +Q 
Sbjct: 256  IKWNEVKSLSESPMQPMLLSEWKTWKHDIAKWFSISRR--GVGEIAQPDSKSVFNSDVQA 315

Query: 241  SRKRPKLEVRRAE-AHASLVESKCSDVAMALHIDSE----SHKVEARKVAKSADSLSTVP 300
            SRKRPKLE+RRAE  +A+ +ES  S   ++  IDSE         + +  K  + +   P
Sbjct: 316  SRKRPKLEIRRAETTNATHMESDTSPQGLSA-IDSEFFSSRGNTNSPETMKEENPVMNTP 375

Query: 301  GR----LGGIVVQTGNSELAFCKDV------ELTPLTEVVAEKPLNFGNKNRQCIAFIES 360
                    GIVV+ G S+    K+       +   + E V +KP   GNK++QCIAFIES
Sbjct: 376  ENGLDLWDGIVVEAGGSQFMKTKETNGLSHPQDQHINESVLKKPFGSGNKSQQCIAFIES 435

Query: 361  KGRQCVRWANEGDVYCCVHLSSRFTGNSDKKEHTRSVESPMCQGTTVLGTRCKHRSLFG- 420
            KGRQCVRWANEGDVYCCVHL+SRFT  S K E + +VE+PMC G TVLGT+CKHRSL G 
Sbjct: 436  KGRQCVRWANEGDVYCCVHLASRFTTKSMKNEGSPAVEAPMCGGVTVLGTKCKHRSLPGF 495

Query: 421  ----SYSDIYGV----EATGYKEIKFVGDVGNPLGVDE-GDVTNNG---NSSSDKLGHHG 480
                 +    G+    +++ +   + V ++ + L  ++  D+   G     S +K   HG
Sbjct: 496  LYCKKHRPHTGMVKPDDSSSFLVKRKVSEIMSTLETNQCQDLVPFGEPEGPSFEKQEPHG 555

Query: 481  KDSIASEVRH-------CIGSCEHIDSNPCLESPKRHSLYCEKHLPSWLKRARNGKSRVI 540
              S      H       CIGSC       C E   +HSLYCE+HLP+WLKRARNGKSR+I
Sbjct: 556  ATSFTEMFEHCSQEDNLCIGSCSENSYISCSEFSTKHSLYCEQHLPNWLKRARNGKSRII 615

Query: 541  SKEVFMDLLRDCISQEQKIHLHQACELFYRLFKSILSLRNPVPVEVQFQWALSEASKN-- 600
            SKEVF+DLLR C+S+E+K+ LHQAC++FY+LFKS+LSLRN VP+EVQ  WA +EAS+N  
Sbjct: 616  SKEVFVDLLRGCLSREEKLALHQACDIFYKLFKSVLSLRNSVPMEVQIDWAKTEASRNAD 675

Query: 601  FGVGEQFMKLVCREKERLKRIWGF----DAEEAQLSSYSMEVPTSGPLLASGNHDDDMAL 660
             GVGE  MKLV  E+ERL RIWGF    D E+  LS Y         LLA  N  DD   
Sbjct: 676  AGVGEFLMKLVSNERERLTRIWGFATGADEEDVSLSEY------PNRLLAITNTCDD--- 735

Query: 661  STHFMDSHKKEAQWLFRGYACAICLDSFTNKKVLETHVQERHHAPFVEQCMLLQCIPCGS 720
                 D  K+  +W F G+ACAICLDSF  +K+LE HV+ERHH  F E+CMLLQCIPCGS
Sbjct: 736  -----DDDKE--KWSFSGFACAICLDSFVRRKLLEIHVEERHHVQFAEKCMLLQCIPCGS 795

Query: 721  HFGNTEQLWLHVVAVHPVDFRLSNSTQQHNSSDDEDSPVKPE--QCNIVSQENDNKNVGG 780
            HFG+ EQL +HV AVHP + +      + N ++ E S  KPE     IV  +N N+N  G
Sbjct: 796  HFGDKEQLLVHVQAVHPSECKSLTVASECNLTNGEFSQ-KPEAGSSQIVVSQN-NENTSG 855

Query: 781  LRKFICRFCGLKFDLLPDLGRHHQAAHMGPGLVNSRPAKRGLHYFAYKLKSGKLGHP-RF 840
            + KF+C+FCGLKF+LLPDLGRHHQA HMGP LV SR  K+G+ +  Y++KSG+L  P +F
Sbjct: 856  VHKFVCKFCGLKFNLLPDLGRHHQAEHMGPSLVGSRGPKKGIRFNTYRMKSGRLSRPNKF 915

Query: 841  KKTLAGASNRIRNRTKASMKKHIQASKLLST---GSINLQPHVSQSASSRKLTQG--STV 900
            KK+L   S RIRNR   +MK+ +Q SK L T       + P +  S +   +T    S V
Sbjct: 916  KKSLGAVSYRIRNRAGVNMKRRMQGSKSLGTEGNTEAGVSPPLDDSRNFDGVTDAHCSVV 975

Query: 901  AKALVSEIQKRKLSPTNIDILSIARSACCK------------------------------ 960
            +  L+S++QK K  P N+DILS ARSACC+                              
Sbjct: 976  SDILLSKVQKAKHRPNNLDILSAARSACCRVSVETSLEAKFGDLPDRIYLKAAKLCGEQG 1035

Query: 961  --------GFVCPKGCETFEDPLLLPHVMPHPHG--FGNHKNARTPDTVKEKV------- 1020
                    G++C  GC+  +DP LL  ++P      FG   +A     ++ +V       
Sbjct: 1036 VQVQWHQEGYICSNGCKPVKDPNLLHPLIPRQENDRFGIAVDAGQHSNIELEVDECHCIM 1095

Query: 1021 -------------IILCEDISFGQELVPVVCVADEGQRNSLHIPLANSDDQNARYSMPWE 1080
                          +LC+DISFG+E VP +CV D+   NS              Y MPWE
Sbjct: 1096 EAHHFSKRPFGNTAVLCKDISFGKESVP-ICVVDDDLWNS-----------EKPYEMPWE 1155

Query: 1081 NFSYIKKPLLDKSLASHTENLQFGCACPHSLCSSETCDHVYLFDSDYEDPKDIYGNPMRR 1140
             F+Y+   +L  S+    ENLQ  C+C  S+CS  TCDHVYLF +D+ED +DIYG  MR 
Sbjct: 1156 CFTYVTNSILHPSMDLVKENLQLRCSCRSSVCSPVTCDHVYLFGNDFEDARDIYGKSMRC 1215

Query: 1141 RFPYDENGRMILEEGYLVYECNERCSCSQTCPNRVLQNGVQVKLEVFMTETKGWAVRAGE 1200
            RFPYD   R+ILEEGY VYECN+ C CS+TC NRVLQNG++ KLEVF TE+KGW +RA E
Sbjct: 1216 RFPYDGKQRIILEEGYPVYECNKFCGCSRTCQNRVLQNGIRAKLEVFRTESKGWGLRACE 1275

Query: 1201 AILRGTFICEYIGEVLDEQEANRRRNRYNSEGNCYFLDVDAHINDISRLIEGSARYIIDA 1217
             ILRGTF+CEYIGEVLD+QEAN+RRN+Y +    Y LD+DA+INDI RL+E    Y IDA
Sbjct: 1276 HILRGTFVCEYIGEVLDQQEANKRRNQYGNGDCSYILDIDANINDIGRLMEEELDYAIDA 1335

BLAST of Sgr026330 vs. TAIR 10
Match: AT2G23740.1 (nucleic acid binding;sequence-specific DNA binding transcription factors;zinc ion binding )

HSP 1 Score: 1182.5 bits (3058), Expect = 0.0e+00
Identity = 654/1328 (49.25%), Postives = 833/1328 (62.73%), Query Frame = 0

Query: 1    WRGKWQAGIRCARADWPLSTLKAKPTHDRKKYFVVFFPHTRNYSWADALLVRSIEEFPQP 60
            WRGKWQAGIRCA+ADWPL+TL+ KPTHDRKKY V+FFPHT+NYSWAD  LVRSI EFP P
Sbjct: 76   WRGKWQAGIRCAKADWPLTTLRGKPTHDRKKYCVIFFPHTKNYSWADMQLVRSINEFPDP 135

Query: 61   IAYKSHKAGLKLVEDVKVARRFIMKKLAVSMLNIIDQFHLEALIESARDVMTWKEFAMEA 120
            IAYKSHK GLKLV+D+  ARR+IM+KL V M NI+DQF  E + E+ARD++ WKEFAMEA
Sbjct: 136  IAYKSHKIGLKLVKDLTAARRYIMRKLTVGMFNIVDQFPSEVVSEAARDIIIWKEFAMEA 195

Query: 121  SRCNGYSDLGRMLLKLQNMIVQCFINSDWLHHSLHSWVQRCQNAQTAEIIEMLKEELADA 180
            +R   Y DLG ML+KL +MI+Q +++  WL +S   WVQ+C NA  AE IE+L E     
Sbjct: 196  TRSTSYHDLGIMLVKLHSMILQRYMDPIWLENSFPLWVQKCNNAVNAESIELLNE----- 255

Query: 181  ILWDEVNSHDDAPVQPTFSSVWKTWKHEVTKWFSISPTLPIIRDKEQQTVEAFLATTLQV 240
              W+EV S  ++P+QP   S WKTWKH++ KWFSIS     + +  Q   ++   + +Q 
Sbjct: 256  --WNEVKSLSESPMQPMLLSEWKTWKHDIAKWFSISRR--GVGEIAQPDSKSVFNSDVQA 315

Query: 241  SRKRPKLEVRRAE-AHASLVESKCSDVAMALHIDSE----SHKVEARKVAKSADSLSTVP 300
            SRKRPKLE+RRAE  +A+ +ES  S   ++  IDSE         + +  K  + +   P
Sbjct: 316  SRKRPKLEIRRAETTNATHMESDTSPQGLSA-IDSEFFSSRGNTNSPETMKEENPVMNTP 375

Query: 301  GR----LGGIVVQTGNSELAFCKDV------ELTPLTEVVAEKPLNFGNKNRQCIAFIES 360
                    GIVV+ G S+    K+       +   + E V +KP   GNK++QCIAFIES
Sbjct: 376  ENGLDLWDGIVVEAGGSQFMKTKETNGLSHPQDQHINESVLKKPFGSGNKSQQCIAFIES 435

Query: 361  KGRQCVRWANEGDVYCCVHLSSRFTGNSDKKEHTRSVESPMCQGTTVLGTRCKHRSLFG- 420
            KGRQCVRWANEGDVYCCVHL+SRFT  S K E + +VE+PMC G TVLGT+CKHRSL G 
Sbjct: 436  KGRQCVRWANEGDVYCCVHLASRFTTKSMKNEGSPAVEAPMCGGVTVLGTKCKHRSLPGF 495

Query: 421  ----SYSDIYGV----EATGYKEIKFVGDVGNPLGVDE-GDVTNNG---NSSSDKLGHHG 480
                 +    G+    +++ +   + V ++ + L  ++  D+   G     S +K   HG
Sbjct: 496  LYCKKHRPHTGMVKPDDSSSFLVKRKVSEIMSTLETNQCQDLVPFGEPEGPSFEKQEPHG 555

Query: 481  KDSIASEVRH-------CIGSCEHIDSNPCLESPKRHSLYCEKHLPSWLKRARNGKSRVI 540
              S      H       CIGSC       C E   +HSLYCE+HLP+WLKRARNGKSR+I
Sbjct: 556  ATSFTEMFEHCSQEDNLCIGSCSENSYISCSEFSTKHSLYCEQHLPNWLKRARNGKSRII 615

Query: 541  SKEVFMDLLRDCISQEQKIHLHQACELFYRLFKSILSLRNPVPVEVQFQWALSEASKN-- 600
            SKEVF+DLLR C+S+E+K+ LHQAC++FY+LFKS+LSLRN VP+EVQ  WA +EAS+N  
Sbjct: 616  SKEVFVDLLRGCLSREEKLALHQACDIFYKLFKSVLSLRNSVPMEVQIDWAKTEASRNAD 675

Query: 601  FGVGEQFMKLVCREKERLKRIWGF----DAEEAQLSSYSMEVPTSGPLLASGNHDDDMAL 660
             GVGE  MKLV  E+ERL RIWGF    D E+  LS Y         LLA  N  DD   
Sbjct: 676  AGVGEFLMKLVSNERERLTRIWGFATGADEEDVSLSEY------PNRLLAITNTCDD--- 735

Query: 661  STHFMDSHKKEAQWLFRGYACAICLDSFTNKKVLETHVQERHHAPFVEQCMLLQCIPCGS 720
                 D  K+  +W F G+ACAICLDSF  +K+LE HV+ERHH  F E+CMLLQCIPCGS
Sbjct: 736  -----DDDKE--KWSFSGFACAICLDSFVRRKLLEIHVEERHHVQFAEKCMLLQCIPCGS 795

Query: 721  HFGNTEQLWLHVVAVHPVDFRLSNSTQQHNSSDDEDSPVKPE--QCNIVSQENDNKNVGG 780
            HFG+ EQL +HV AVHP + +      + N ++ E S  KPE     IV  +N N+N  G
Sbjct: 796  HFGDKEQLLVHVQAVHPSECKSLTVASECNLTNGEFSQ-KPEAGSSQIVVSQN-NENTSG 855

Query: 781  LRKFICRFCGLKFDLLPDLGRHHQAAHMGPGLVNSRPAKRGLHYFAYKLKSGKLGHP-RF 840
            + KF+C+FCGLKF+LLPDLGRHHQA HMGP LV SR  K+G+ +  Y++KSG+L  P +F
Sbjct: 856  VHKFVCKFCGLKFNLLPDLGRHHQAEHMGPSLVGSRGPKKGIRFNTYRMKSGRLSRPNKF 915

Query: 841  KKTLAGASNRIRNRTKASMKKHIQASKLLST---GSINLQPHVSQSASSRKLTQG--STV 900
            KK+L   S RIRNR   +MK+ +Q SK L T       + P +  S +   +T    S V
Sbjct: 916  KKSLGAVSYRIRNRAGVNMKRRMQGSKSLGTEGNTEAGVSPPLDDSRNFDGVTDAHCSVV 975

Query: 901  AKALVSEIQKRKLSPTNIDILSIARSACCK------------------------------ 960
            +  L+S++QK K  P N+DILS ARSACC+                              
Sbjct: 976  SDILLSKVQKAKHRPNNLDILSAARSACCRVSVETSLEAKFGDLPDRIYLKAAKLCGEQG 1035

Query: 961  --------GFVCPKGCETFEDPLLLPHVMPHPHG--FGNHKNARTPDTVKEKV------- 1020
                    G++C  GC+  +DP LL  ++P      FG   +A     ++ +V       
Sbjct: 1036 VQVQWHQEGYICSNGCKPVKDPNLLHPLIPRQENDRFGIAVDAGQHSNIELEVDECHCIM 1095

Query: 1021 -------------IILCEDISFGQELVPVVCVADEGQRNSLHIPLANSDDQNARYSMPWE 1080
                          +LC+DISFG+E VP +CV D+   NS              Y MPWE
Sbjct: 1096 EAHHFSKRPFGNTAVLCKDISFGKESVP-ICVVDDDLWNS-----------EKPYEMPWE 1155

Query: 1081 NFSYIKKPLLDKSLASHTENLQFGCACPHSLCSSETCDHVYLFDSDYEDPKDIYGNPMRR 1140
             F+Y+   +L  S+    ENLQ  C+C  S+CS  TCDHVYLF +D+ED +DIYG  MR 
Sbjct: 1156 CFTYVTNSILHPSMDLVKENLQLRCSCRSSVCSPVTCDHVYLFGNDFEDARDIYGKSMRC 1215

Query: 1141 RFPYDENGRMILEEGYLVYECNERCSCSQTCPNRVLQNGVQVKLEVFMTETKGWAVRAGE 1200
            RFPYD   R+ILEEGY VYECN+ C CS+TC NRVLQNG++ KLEVF TE+KGW +RA E
Sbjct: 1216 RFPYDGKQRIILEEGYPVYECNKFCGCSRTCQNRVLQNGIRAKLEVFRTESKGWGLRACE 1275

Query: 1201 AILRGTFICEYIGEVLDEQEANRRRNRYNSEGNCYFLDVDAHINDISRLIEGSARYIIDA 1217
             ILRGTF+CEYIGEVLD+QEAN+RRN+Y +    Y LD+DA+INDI RL+E    Y IDA
Sbjct: 1276 HILRGTFVCEYIGEVLDQQEANKRRNQYGNGDCSYILDIDANINDIGRLMEEELDYAIDA 1335

BLAST of Sgr026330 vs. TAIR 10
Match: AT3G50780.1 (BEST Arabidopsis thaliana protein match is: BTB/POZ domain-containing protein (TAIR:AT1G63850.1); Has 298 Blast hits to 298 proteins in 22 species: Archae - 0; Bacteria - 0; Metazoa - 10; Fungi - 0; Plants - 287; Viruses - 0; Other Eukaryotes - 1 (source: NCBI BLink). )

HSP 1 Score: 595.9 bits (1535), Expect = 1.1e-169
Identity = 315/527 (59.77%), Postives = 394/527 (74.76%), Query Frame = 0

Query: 1349 MAEIRLTRVEQGQTKIKNVPIAVTPEGFWCCPSPVVFQKTLKGQNALNKPKPASPTPKSP 1408
            MAEI+  +VEQ QTKI+NVP+AVTPEGFWCCPSPV FQKTLK  N+L K K +SP  + P
Sbjct: 1    MAEIKNAKVEQRQTKIRNVPVAVTPEGFWCCPSPVAFQKTLKSHNSLTKHKQSSPALQPP 60

Query: 1409 VEKKPTPVTDRKPALTRSRSAAVSDDDRKCNADN----SGFSAPEVVHRVP-RPKIENMP 1468
               KP    ++KP+ T  RS   SD+ ++ N  +       + P  V   P R K+E +P
Sbjct: 61   ---KP----EKKPSSTTIRSVIASDETQQ-NLGSFDTVHSIAVPATVQERPQRQKVETLP 120

Query: 1469 RKIAIEFGEPGTSNIKVVLLGKQGFSVKLSVHKNVLMDNSTFFANKLSDKEG--SSLEIG 1528
            RK+AIEFGEPG+S+ KV+L+GKQGF VKLSVHK VL+D+S FFA KL++K+   + LEI 
Sbjct: 121  RKVAIEFGEPGSSDAKVILVGKQGFCVKLSVHKKVLVDHSCFFAKKLAEKDSVFACLEIE 180

Query: 1529 DCEDVEIYVETVGLMYCKEMKQWLMKQNVSRVLRILKVAEFLGFKSCMQSCLEYLEAVPW 1588
             CED E+YVET+GLMYCK+MKQ LMKQNVSRVLR+LKVAE LGF SC+QSCL+YLEAVPW
Sbjct: 181  SCEDAELYVETIGLMYCKDMKQRLMKQNVSRVLRVLKVAELLGFSSCIQSCLDYLEAVPW 240

Query: 1589 VG-DEEEKVVTSILRLQSEGIGVSPVLKRVSADVSKPHKDTLSHIIELVLRSNEERGRRE 1648
            VG +EEEKV++SILRL++EG+GV+PVLKRV+++   P K+TLS IIELVLRS EE+ RRE
Sbjct: 241  VGEEEEEKVISSILRLKTEGVGVTPVLKRVASNAVDPPKETLSRIIELVLRSKEEKSRRE 300

Query: 1649 MKLVVLRLLRENQSVPSHASSADICNEIIYSSCRSCLGSLLFLFQQAAETDFTDRSVDRK 1708
            MK +VL+LLRE       A+ AD  N+ IYSSC++CL S+L LF+QA+E        ++ 
Sbjct: 301  MKSIVLKLLREQNG----ANVADNFNDTIYSSCQTCLDSVLSLFKQASEG-------EKP 360

Query: 1709 EPVLKQITLEADNLSWLLEILADRQAADEFA--------------------------LYA 1768
            E   KQI +EADNL+WLL++LA+RQAA+EF+                          + +
Sbjct: 361  ETDTKQIAVEADNLTWLLDVLAERQAAEEFSVTWANQKELALLHEKLPLMSRYHISRVTS 420

Query: 1769 RLFVGIGKGELLPAKDTRKLLLHTWLEPLINDYSWLKHGCGSFDRKVVEEGIGRTILTLP 1828
            RLF+GIG+GELLP+KDTR LLL TWL+PL NDY+WL+HGC SFD K+VEEGIGRTILTLP
Sbjct: 421  RLFIGIGRGELLPSKDTRLLLLTTWLQPLFNDYNWLQHGCRSFDGKLVEEGIGRTILTLP 480

Query: 1829 LEDQQNILLTWLGSFLKVGDSCPNLQRAFEVWWRRTFIRPYVETEGS 1842
            LEDQQ+ILL+WLGSFL  GD CPNLQRAFEVWWRR+FIRPY + + +
Sbjct: 481  LEDQQSILLSWLGSFLNGGDGCPNLQRAFEVWWRRSFIRPYSDRQAN 508

BLAST of Sgr026330 vs. TAIR 10
Match: AT1G63850.1 (BTB/POZ domain-containing protein )

HSP 1 Score: 305.8 bits (782), Expect = 2.3e-82
Identity = 170/461 (36.88%), Postives = 257/461 (55.75%), Query Frame = 0

Query: 1421 PALTRSRSAAVSDDDRKCNADNSGFSAPE----VVHRVPRPKIENMPRKIAIEFGEPGTS 1480
            P+ + S S+A +   R  N ++   SA +     + R+    +   P     +F +P +S
Sbjct: 85   PSSSSSSSSAAATAARTTNVNHLVISAQDKQALAMQRISDLLVIRSPGN---QFNDPNSS 144

Query: 1481 NIKVVLLGKQGFSVKLSVHKNVLMDNSTFFANKLSDKEGSS----------LEIGDCEDV 1540
            ++K+ L  K G S+ + VH+ +L+ +S FFA KLSD+              +EI DC+DV
Sbjct: 145  DVKLTLSSKDGISITMCVHRQILVAHSRFFAMKLSDRWSKQQLPPSSSPYIVEISDCDDV 204

Query: 1541 EIYVETVGLMYCKEMKQWLMKQNVSRVLRILKVAEFLGFKSCMQSCLEYLEAVPWVGDEE 1600
            E+Y+ET+ LMYC+++++ +M+ +VSRVL ILKV+  +GF + + SCLEYLEA PW  DEE
Sbjct: 205  EVYIETLMLMYCRDLRKKMMRHDVSRVLGILKVSAAIGFDAGVLSCLEYLEAAPWSEDEE 264

Query: 1601 EKVVTSILRLQSEGIGVSPVLKRVSADVSK--------PHKDTLSHIIELVLRSNEERGR 1660
             ++ + +  L  E +G + VL+RVS + S          + + L +++ +VL   +E+ R
Sbjct: 265  YRIASLLSELHLENVGATEVLRRVSVEASNGNNGSNGGSNDEVLLNLLHIVLEGKDEKAR 324

Query: 1661 REMKLVVLRLLRENQSVPSHASSADICNEIIYSSCRSCLGSLLFLFQQAAETDFTDRSVD 1720
            R+MK +V ++LREN      +S  D+  E +Y +C  CL  L   F QAAE+D  +    
Sbjct: 325  RDMKTLVSKMLREN------SSGNDLRKESLYLACDGCLHKLKRQFLQAAESDLEN---- 384

Query: 1721 RKEPVLKQITLEADNLSWLLEILADRQAADEFALY------------------------- 1780
                 + QI  +ADNL W+L+IL DRQ A++F +                          
Sbjct: 385  -----VDQIARQADNLHWILDILIDRQIAEDFIVMWASLSELSEVHSKVPVVHRFEISRV 444

Query: 1781 -ARLFVGIGKGELLPAKDTRKLLLHTWLEPLINDYSWLKHGCGSFDRKVVEEGIGRTILT 1834
             AR+FVGIGKG++L  K+ R LLL  WL P  +D+ W++      DR ++E+G+  TILT
Sbjct: 445  TARIFVGIGKGQILTPKEVRCLLLRNWLTPFYDDFGWMRRASKGLDRYLIEDGLSNTILT 504

BLAST of Sgr026330 vs. TAIR 10
Match: AT5G60050.1 (BTB/POZ domain-containing protein )

HSP 1 Score: 275.4 bits (703), Expect = 3.3e-73
Identity = 150/396 (37.88%), Postives = 229/396 (57.83%), Query Frame = 0

Query: 1474 GTSNIKVVLLGKQGFSVKLSVHKNVLMDNSTFFANKLSDKE----GSSLEIGDCEDVEIY 1533
            G  ++K+ ++GK G+ V + VH+ VL + S FF  K++ +        +EI +C+D+EIY
Sbjct: 97   GPGDVKLTVVGKDGYRVTMDVHRKVLSEKSRFFMEKMNSRREKGVSHMVEISECDDLEIY 156

Query: 1534 VETVGLMYCKEMKQWLMKQNVSRVLRILKVAEFLGFKSCMQSCLEYLEAVPWVGDEEEKV 1593
            VETV LMY  ++K+ L+ +NV ++L +LKV+  + F   + SCLE+LEAVPW  DEEE V
Sbjct: 157  VETVVLMYSDDLKKKLIGENVIKILALLKVSAAISFDEGVMSCLEHLEAVPWSEDEEETV 216

Query: 1594 VTSILRLQSEGIGVSPVLKRVSADVSKPH-----KDTLSHIIELVLRSNEERGRREMKLV 1653
            VT +  L      V+ +L+RVS+  S         D  S ++  VL++ +++ RREMK++
Sbjct: 217  VTCLEELHLPDDSVTLILQRVSSQPSTSSTRTRTDDIFSKLLTGVLQAKDDKARREMKVL 276

Query: 1654 VLRLLRENQSVPSHASSADICNEIIYSSCRSCLGSLLFLFQQAAETDFTDRSVDRKEPVL 1713
            + +L+RE        +  D+  + +Y  C  CL SL+    +   T   D   DR   ++
Sbjct: 277  IFKLVREE-------ADYDVSRDTLYGLCHRCLTSLVLCLSEVT-TQMNDPGKDR-GALM 336

Query: 1714 KQITLEADNLSWLLEILADRQAADEFA--------------------------LYARLFV 1773
             +I  EADN+ W+++IL +++   EF                           + A++ V
Sbjct: 337  GEIAREADNMLWMVDILIEKKLCSEFVKLWADQKELANLHSKIPTMYRHEISKITAQICV 396

Query: 1774 GIGKGELLPAKDTRKLLLHTWLEPLINDYSWLKH-GCGSFDRKVVEEGIGRTILTLPLED 1833
            GIGKG +L  ++TR  +L+TWLE L +D+ W++     S DRK+VE+G+ +TILTL L  
Sbjct: 397  GIGKGRILVNRETRFAVLNTWLEALYDDFGWMRRLSSRSLDRKLVEDGLSQTILTLSLRQ 456

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022132628.10.0e+0083.89histone-lysine N-methyltransferase SUVR5 isoform X1 [Momordica charantia][more]
XP_038881305.10.0e+0081.70histone-lysine N-methyltransferase SUVR5 isoform X1 [Benincasa hispida][more]
XP_038881307.10.0e+0081.62histone-lysine N-methyltransferase SUVR5 isoform X2 [Benincasa hispida][more]
XP_022132629.10.0e+0082.94histone-lysine N-methyltransferase SUVR5 isoform X2 [Momordica charantia][more]
XP_023517545.10.0e+0080.54histone-lysine N-methyltransferase SUVR5 [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
O648270.0e+0049.40Histone-lysine N-methyltransferase SUVR5 OS=Arabidopsis thaliana OX=3702 GN=SUVR... [more]
Q9SVM01.6e-16859.77BTB/POZ domain-containing protein At3g50780 OS=Arabidopsis thaliana OX=3702 GN=A... [more]
Q9CAJ93.3e-8136.88BTB/POZ domain-containing protein At1g63850 OS=Arabidopsis thaliana OX=3702 GN=A... [more]
Q9LVG94.7e-7237.88BTB/POZ domain-containing protein At5g60050 OS=Arabidopsis thaliana OX=3702 GN=A... [more]
Q9SKH21.5e-4627.66BTB/POZ domain-containing protein At2g13690 OS=Arabidopsis thaliana OX=3702 GN=P... [more]
Match NameE-valueIdentityDescription
A0A6J1BWT10.0e+0083.89histone-lysine N-methyltransferase SUVR5 isoform X1 OS=Momordica charantia OX=36... [more]
A0A6J1BTN50.0e+0082.94histone-lysine N-methyltransferase SUVR5 isoform X2 OS=Momordica charantia OX=36... [more]
A0A6J1HDR30.0e+0080.47histone-lysine N-methyltransferase SUVR5 OS=Cucurbita moschata OX=3662 GN=LOC111... [more]
A0A6J1KT710.0e+0080.39histone-lysine N-methyltransferase SUVR5 OS=Cucurbita maxima OX=3661 GN=LOC11149... [more]
A0A5D3CM860.0e+0079.86Histone-lysine N-methyltransferase SUVR5 isoform X2 OS=Cucumis melo var. makuwa ... [more]
Match NameE-valueIdentityDescription
AT2G23740.20.0e+0049.40nucleic acid binding;sequence-specific DNA binding transcription factors;zinc io... [more]
AT2G23740.10.0e+0049.25nucleic acid binding;sequence-specific DNA binding transcription factors;zinc io... [more]
AT3G50780.11.1e-16959.77BEST Arabidopsis thaliana protein match is: BTB/POZ domain-containing protein (T... [more]
AT1G63850.12.3e-8236.88BTB/POZ domain-containing protein [more]
AT5G60050.13.3e-7337.88BTB/POZ domain-containing protein [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR007728Pre-SET domainSMARTSM00468preset_2coord: 928..1057
e-value: 9.9E-5
score: 30.5
IPR007728Pre-SET domainPFAMPF05033Pre-SETcoord: 949..1065
e-value: 3.7E-15
score: 56.5
IPR007728Pre-SET domainPROSITEPS50867PRE_SETcoord: 994..1070
score: 9.086351
IPR013087Zinc finger C2H2-typeSMARTSM00355c2h2final6coord: 638..661
e-value: 0.11
score: 21.6
coord: 672..695
e-value: 6.6
score: 14.9
coord: 741..764
e-value: 1.7
score: 17.7
IPR013087Zinc finger C2H2-typePROSITEPS00028ZINC_FINGER_C2H2_1coord: 743..764
IPR013087Zinc finger C2H2-typePROSITEPS00028ZINC_FINGER_C2H2_1coord: 674..695
IPR013087Zinc finger C2H2-typePROSITEPS50157ZINC_FINGER_C2H2_2coord: 741..769
score: 8.953676
IPR013087Zinc finger C2H2-typePROSITEPS50157ZINC_FINGER_C2H2_2coord: 638..661
score: 8.787411
IPR001214SET domainSMARTSM00317set_7coord: 1073..1211
e-value: 1.4E-37
score: 140.9
IPR001214SET domainPFAMPF00856SETcoord: 1084..1204
e-value: 1.8E-19
score: 70.8
IPR001214SET domainPROSITEPS50280SETcoord: 1073..1205
score: 18.377905
IPR040689SUVR5, C2H2-type Zinc finger, 3 repeatsPFAMPF18868zf-C2H2_3repcoord: 640..765
e-value: 1.0E-67
score: 226.6
NoneNo IPR availableGENE3D2.170.270.10SET domaincoord: 908..1208
e-value: 7.7E-82
score: 276.9
NoneNo IPR availableGENE3D3.30.160.60Classic Zinc Fingercoord: 635..719
e-value: 8.4E-6
score: 27.7
NoneNo IPR availablePIRSRPIRSR009343-2PIRSR009343-2coord: 968..1207
e-value: 1.4E-53
score: 179.5
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1392..1444
NoneNo IPR availablePANTHERPTHR47325:SF1HISTONE-LYSINE N-METHYLTRANSFERASE SUVR5coord: 1..881
coord: 882..1218
NoneNo IPR availablePANTHERPTHR47325HISTONE-LYSINE N-METHYLTRANSFERASE SUVR5coord: 1..881
coord: 882..1218
NoneNo IPR availableSUPERFAMILY82199SET domaincoord: 919..1207
IPR011333SKP1/BTB/POZ domain superfamilyGENE3D3.30.710.10Potassium Channel Kv1.1; Chain Acoord: 1478..1579
e-value: 5.9E-6
score: 28.7

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr026330.1Sgr026330.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0034968 histone lysine methylation
cellular_component GO:0005634 nucleus
molecular_function GO:0018024 histone-lysine N-methyltransferase activity
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding