Sgr021134 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr021134
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
Descriptionprotein SET DOMAIN GROUP 40 isoform X1
Locationtig00153640: 992031 .. 1008092 (+)
RNA-Seq ExpressionSgr021134
SyntenySgr021134
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAACAGAAGGAAGTCTTGAAAGCCTGCTGAGATGGGCGGCGGATCATGGAATTTCAGATTCTGTCGAGAAACAGAGTTCACATTCTTGTCTAGGTCGTTCATTATGTGTCTCTTTCTTCCCTGATGCTGGCGGGTGGGTATACTCTCATGTTTCGCTTTCTATTTTCATCACTCTTTTTTCACTTCGCCGCACGTAAGTTCTTAATTTTGAGGGAAATGGTGCTTCAGGAGAGGTTTGGGGGCTGTTCGTAATCTTAACAAAGGAGATTTAGTACTAAGAGTTCCAAAATCTGTCTTGTTGACGACCCAGAGTTTGTTGTTGGAAGATGAGAAGCTCTCCATGGCTCTGAAGAGATACCCATCTCTTTCTTCAACTCAGGTTCTTCCTTTGCTAACTTGGACTTTTCCTGTTCTTTTCACTCTCGATGATGCATAGTTAGCACGAATTTATGTTGAACTTGAACATGATCACTTTGGAGGTGGGTGCTTGGTGAGAATAAGGTGCAAGTACGAGTGAGATAAGTTATATAACTTATCAAAAAAATATTAAACTTTGAAAGCTTATGAATTCCTTGCACATTTAACATGAAGAGCATCTAAAATCAAGAGAGTTAAAGTCTTTTGATGTTAGGAGTACGTTAAGTCACAAACACAAATAATTTGTTCCCAGTCATAGAAGGAGAGTAAACTACCTATCAGAACTTGGAAAAGGATGCAATGGCTAATGCGGGAGTGGTTCTTCATATTAATTATGAGCAACTGTTCTACGAAAGCATCAATATTGTAGACAAAGATTTTGGTCATTGAGTCATTGTACAGAGGAGTTTTATGGGTTAAGTGCCTGAAATAAACTAGTTGAGAGTGAGTATTAACCAATTGCCCAACTTAAGGTTTAAGGTTGAATATTCAAGAAAGTTGGAACTCCATCTTATCAGATGTTTAACTGATGCAATAAGCCTGGCACATAAGATTGAAAAACAGCTAGAGAAGAGATACTAAAAGAATCGGTCACCTAAAAGAGTGAATTGAGACAAAGATGGATTAAATGGATGATGGGCTGTGTTAAGGGCACTAAATTCTCTATTTTCATTAACGGTCGGCCTAGAGGAAGAATCCAAGCAACAAGAGGTTTAAGGCAAGGTGACCCCTATCACCTTTCCTCTTTCTAATGGTTGGAGATGTGTTGGTAGTCTTATTGATAACATCCACTTGAAAGGTTCTTTTGAAGGTTTTATTGTTGGCAGCAAGTCAGTACACATTTCATGCCTTCAATTTGCTGATGACACTTTACTATTTTGTAAAGACAAAGATTCTATGTTGGAAACTTTGATCTGGAGCTTTTGAATGGGTATCAGGACTGAAAGTTAGTTGGGATAAATAAGCCATTTGTGGTACTTCTTCACATTTGATTCAAATGGCATCTAGACTTCATTGCAAGGTGGAGACGCCACCAATCTCTTATTTGGGCATGCCTCTAGGAAGTAATCCGAGATCCTTTAATTTTTAGCTTCCTATTACTGAAAAAATTAAAAAGAAATTGGATAGATGGAAGAGATTCCAACTTTCTAGAGGTGGAAGATTGACGTTATGAATACGGTCCTCTCTAATATTTTTGTCTTTGTTTCAGAAAAAAAAAAAAGTGAACTGATACCTTCCAGGAAATATACAAACAAAGGGAGTAGTAGATGATCAAAAGAAGTTCCAAATTGGGAGTTCTTCCACCTTTAATAATCAAAGATTTAAAGAAGAATGCCCAACCAAAAATATAAAGAAGCAAAGGGTACAGAAAACAGTACGATAAGGAAGTCCAATGATCCTTGCAACAAACCTACTTTGGGTGAATGTTTTAGGTATGGACAGTAGGAGCATCTTCCAAATGAGTCCTCTGAAGAAGATCAGTTGCTACCATTAAGGAAATTTATGAAGACAATCTAAATGAAAGTGAGAGTAAGACGAGGAATTATCCCACATAAAACTTGATGAGAGGAACAACTATCTTGTGCAGTACAAAGAATTTCAGAAACAAGGATCACAATATATTTCCGTCTCCACGAATTTCTTCTAGGTGTTTACACAAAACAGAACAAGAATGTAGGAAAGCGTTCGACTTCATGTTATTCAGTTCATGCAGAGGATAACATGTCACACAATCAATTGATTAAAAAGACAGCAGATGTCTGCAATTTCAGCTCAAATAATTTACATATATTGTGTGGGAACCCTCATTTCTTCCCATTCTTCTTGAGCCCAGCACCTCCAAAGATCCCCTTCTGTTGGGCCTTTGCTCTAAGCTCCTGAGCGCCTTTCCTTTTCCTTCTTCTTTTGAATATTGGCCAAATCAACTCGTTATAATCTTACTTCTCAACCTTGGGTTGCTTCAGAGGTGTAACTAAAGGTTTGAGAGAATCAATGATTCTATTCCATTCACTAAAAGATATATATAGACAAGTATACAAGGTGAACATAAAGCAAAGAATGTAAAATGACGATAAAGGACAAATGACTAAAACTAAGATATTTACAATAATATAAAATCTAATAATATAAGCACTATAACACTCCCCTTCAAGTTGGAGCATATATATTAATCATGCCCAGCTGATTAGAGAGATAATCTATACGTGCTCCATTTAATGCTTTCGTAAAGATATTCCTAATTACTCTCCAGTCTTTACATATCCTGTGGACACCAAACCTTGCTGTATTTTTCACGTACAAAATGACAATCAATCTCAATGTGTTTGGTTCGTTCATGAAATACTGGATTAGATGCAATATGGAGAGTTGCTTGATTATCACACCTTTGGTTGGGGTTGTGATATCAAATCCCAATTTAGTCAGAAGTTGATATATCACACTAATTCACACACAGATTGTGTCATCGCTCTATATTCTGATTCAGCACTTGAACGTGACACCACATTTTGTTTCTTACTCTTCCAAAAACTAGATTACCTCTACAAATACACAATATCCTGAAGTTGATCTTCTGTCTTCCTTAGATCCTGCCCAGTCACATCTGAGAAACATTCAATATTAGTATAACCATGATCTTTATATAATAAACCACGCCCAGGAGCAGCTTTCAAATAACATAGAATTTGTTATAATGCAGCCCAATGGTCAATAGTAGGAGAAGACATATACTGACTCACAATACGCACTGCATAAGCTATGTTCGGCCTAGTCACTGTAAGATAATTTAGCTTTCCTTCTAACCTCTTATACCTTTCAGGATCTTTCAGTAATTCTCCCTCTTTTGTGAGTTGTAAATTAAGCATCATTGGGGTACTACGTGGCTTAGCACTTAACTTCCCTGTCTCGGTTAACAAATCAAGTACATATTTTCTCTATGATAATAAAATTCCCTTCTTACTTTGTATTACCTCAATTCTCAAAAAGTATTTCAACATTCCCAAATCTTTTGTATGGAATTGACTATGAAGAAAAGTCTTAAGAGATAGAATACTTGATGTATCATTACTAGTAATAACAATATCATCAACATATACAACTAGTAAGATGACACCAGTATCAGATCGTTTATAAAAGACATAATGATCTGACTTGCTTTTCTTCATTCCAAAGTTCTCAATCACCTGACTGAATTTTCAAACCACGCTCGTGGACTTTGCTTTAAGCCATACAAGGATTTACGAAGATGACATACCTTTCCATTCTCCCCTGGCAACAAAACCCGGTGGTTGCTCCATATACACTTCTTTAAGATCACCATGTAGAAAGACATTTTAATATCAAGCTGATGCAAGGGCCAATGATAGATTGATGCCAACGAAATGAATAGCCTGACAGAAGCCAATTAGCAACAGGAGAAAAATATATCAGAATAGTCAACTCCATAAGTCTGCACGTAGCCTTTAGCAACAAGACGAGCTTTCAATTGCGCGACAGATCTGTCAGGATTAACTTTAATGCAAACACCCATTTACAACCAATAGGCTTCTTTCCTGCAGGAAGAAAACTAAATCCTAAGTACCATTGTCACCTAAGGTAGTCATCTCCTCTAACATTGCAGCACGCCAACCAGAATGAGACAACGCTTCATGAACAGTTTTAGAACAGAGAAAGACTCAAGAGATGCAATGAACGAACAAGTAGGAATGACAAATGATTATATGAAGCAAAAAGGGAAACAGGATAGCACACTGGCGTTTACCTTTGTGTAGAGCAATAGGAAGATCATCGCTCGTTCCGGATCCAATGACGAATAAGCCTCTAGTATAGGGCATGTGACCGAAGGAGGTTCCGGGAAATAACTTGAGTAATAGATATGAATGCATGCCAAATGGGTTCGAGTATGAATAGAAGATGACTTTTAAGACTAAAGAAGGCTTATTTAAATGGATGGTGATCCCATTTGGGCTATAGATACCCTTAGCACTTCATGCGCCTAATGAACTATGTTTTGCAAAGTGATTGATTGCATCCCATAAAGTGACTACATCTCAAAGTAGGAGAGACTTGGGAAGAAAAAATAATTAAGAATTACAAGCATGCCTGTCACTTGAAATGGAAAAAGAAGAGGGGTATTGTTCTAAGTTCCTTTATACTTGAAGAAGATGACAGGTACGCTTAAAGATGTTTTATATTCCATTCTTTGTTAGAGCAACAAAGATAATGGAGATTTAACTGATTTTAGTACTTTTTAAAATTAAAAGTTGATTGTCTAGAGCATTTCTAGTGCGCTTTATCTGAGACATTACCCATTTTCCTTTGTTAGAATGAAGTTCTTGAAGGTGTACTAATATTAGTTCTACTTTTTTCTGCACTTAAATGCAGAAGTTGACCTTCTGTTTACTCTATGAGATTGGTAAAGGAAGCAGTTCTTGGTGGTTCACTTACTTCAAGCACTTGCCCCACAGTTATGACATGCTGGCAACTTTTGGAGAATTTGAAAAGCAAGCGCTGCAGGTTCAGTTTTCCTTTGTGACATTTTTATTTACTTCTTTAGATTATATTCATAGTCATGAATATATCCAATAAGAACTTTTTACTTGTGTTTTTGACATTCTTTGATAATAAAATAGTGAAAGTCATTAACAGGTGGATTACGCCATCTGGGCAACAGAAAAGGCTGCTTTGAAGTCTCATACAGAGTGGGTAGGAGTTAAAGGACTAATGGAAGAATCTAATAGTAAAAGCCAAATCAAAACATTCAAGGCATGGCTTTGGGCCTCTGCAACTGTAAGCACACTGGATTCATGCTAGAAGAATGTCCTACACACTTGGTTAGATAAGACTTGGATTTATACAAAGTAGTGAAACCTACCATGCCATTTTCTTATTACGTTTCAAATCTTTTTTCCATTTTTATTTATGTTTGGTGAAAAATTTACGCACCATCTAAAGATGGTTTAGCTGGAAGTCTAGTACGGGTAAATTATGCAAGAAAACTATCCAAAGTCAGTGCAAATATAATTTTGGTTTTGAAAGATAAAACCACTGAGTGACACCCATTTGCTATAATCTATCCTTCCATTACCCATGACCGCCTATTCATCATTTTCTGGTGTTTCTTACCTTCATTTCTTTTTTAATATTTCAAGGAAGATGGTAGAATACAATTTTGTATTTCATTTGAATCATTTCATTACAAAACGCACAAAAAAAGCTACTATAATTACAAAAGTTCCCATGTGAGGCACTTTAAATTGAGGCCATCCCGTATACATATAGCCAAAAGATCAAAATTTTTGTATCCATCACTAGAAATCGTATTGTTTCTTTCTAACCACAAGGACCAGAGATTCTCTCTCTCCTGTCCATCACTAGAAATCGTATTAAGTTGACGGTGCCTAAAGGGGAAGCAGAAGGCTCTCTAAGATTGGAAAGGATAACTCTTTCTCATCTCTTGGATTCAAAAATGTTTTCAGACCTGCTCCTCTTTCCTCTTAACCATTTTTTTTTTCCGAGAACTCTGAAGTGAGGATTATGTGGCTTGCATTGAGTAGATTATGTGGCTTGCATTGAGTAAACCTCCAACGGGAAAGACATCTGGGAGGTCCATTGTAGTCACTGTTTTAAAAGCGCGCTTTCCAGAAAAGAAAAGCGAGGTAAGGACGCCTCGCTTCAAAGAAGCGAGGTGTCCCTAATAAGGCGCCTTGAGGCATGCATCTCTCTACATGTTGGCGTGTGCCTCTGCTCCTTTATTCCTTTTTTATTTATTTTTTGTTTTTTAATTCTTTTTATAAATAAATCCTTTAATACTTTAAAAAAATGTTAATTTTTCTCATTATCTTCTAAAAATTGTAATTTTTTTAAAATTTTCATTATCTTATCATTTATATTACTATATTTTTTCATATATAAAAATTAATTTACATTTTCTATTGTGTGCCTCACATAAAATAGCCCTTGCTTTTTTTTGTGCCTCGTGCTTAAGCTCCGGAAGACTATTGCGCTTTAGTGCGCTTCACGCTTTTAAAAACACTGATTGTAGTGAGAAGAAGATTCTTCCATGACGATCGGTATACAGTTTCAATTATAATATGGAGGTGGAAACATTTTTAGAAAAGGTGAGTTTGTTTTTACTCCCCTAGCAAAAGCACCTCCCCCCTCACCTTGACTTTACTCCCCTGAAGATTGCCCCTCCAGCCCCTGCTTGCTTCAAGATGAGGAATCTGTTGCTAAACTCCCAATAGTTACGAAAAGAAACTCTTATCGTTCTTGAATAGAATTTAGGATTCACATCCCTTCAAATTCCCCATAGCGTTGTGTTGCCATAGAATTTTGCTACCATCCTCTATTTTTTATGGATGCCAATTGTAGAGTGCTTGTTTTGTAATCTTCTTCAGCCCTTCTTTTGGAGGATATATACCATCCTCACTGCTGTCTTTTTTCTTTTTAATATGAATCAATTATTCGTTTCTTATGGAAAAACAAGTGTTACATAATTTGTCTTATATTGTATGATGAATCTAAATTTGACAACATTTTGGTCCTTCGTAAATTTCTTCAATCTTAATCTCAGTTTCTTTCCCTTCTTTCTACATTTAGTAGTTAAAAACTAAAAATATATGTTACCCTTTGAGAGATATCATCTAGGGCATTGTATGTACCATGGGATGAGGCTGGATGTCTATGTCCAGTTGGTGACTTGTTTAACTATGCTGCACCTGAGGGGGAATCCCTTGATATTAGGGATGTTTCATCTTTTTCACTACATGCTTCTTGAGCGGAAACATGACTACTGATGAGTTACACGAAGAGCAACGAGATACTCAGTGGGCTTTGACCGATGGCGGATTTGAGGAAAATGTTTCTGCATACTGCTTCTATGCAAGGGAAAGTTATAAGAAGGGACAGCAGGTATTTTCGGCTTTCTTGGTGTCATTTAATATTTTGCACTCGTATGTAAGTGGGGTTGTTGCGAAATACATTAATTGGAGGCTTAACATTTAATTAGTTGGTTGATTCCCAAGATGAAGAAGGTTGAAATTTACATTAACTTTTTTATTTGTTCTCTGATGCATTCTTGTTTATCTATAACTTGAGATGTTCATAAATGAAACTAGATATTGTTTTCTTTCCGTTTCAGATTGTTATTACACTGATTTGATTTGCATTTTTCAGCACTTCCTTTAAAAAAAACTCTTTTTTTAGGTTTTTTTTTTTTCAAATGTATGTTATCCTCATTATAGTTCCTAGGTAGTTTTTTTTTGGGTCTTATCCATCTTTCCAATTCTTTGTATTACTAACAATAACATAACCTGTTATGTGAGATCCCAAACGCTACTCTATACCTCATCATTAGGTTTACCCATCTTTGGGATCTTGTGGATAGGTGTGCTTTCCAACAGAAAAAAATTTGATTTGATGTTAGGTGGATAGCAATATTTTTAAAGTGCAAGGTGCCCTCGAAGTGACTCGGCTTTTGATTGCTTGGAGGCGAGTGATGCAAAAAAGTAATTCCTGAGTGAAGCAAGGCATACTTAATAAATTAATTAATAATATTATCTATAGCAATAATTTGTAACACGATAAATACATATTTAAGACAGGTACATCAATAAAACCATTTAATGTCAAAACAAGAGTTTAAGTTGTTAATAAAGTAAAAAGATGAATATAAAACATAATTTAGATTAAATTACAAATTTAGTCTCCGAACAATTTTGATGGTTGAAATCTTTCTGTTGTTTGCTGCAGGTTCTTTTAAGCTATGGTACATACACAAACTTAGAGCTTCTTGAATATTATGGGTTTCTTCTACAGGAAAATCCAAATGACAAGTTTTTATTCCTTTGGAACATGTCATTTATACTACCAGTTCTTGGCCCAAGGAGTCTCTTTATATTCATCAAAATGGAAATCCATCTTTTGCTCTACTTTCAGCTTTGCGATTATGGGCAACCCGCCCGAACAAGCGTAGAGGCGTTGGACACCTTGCTTATGCCGGTCTCAACTCTCCGTCGAAAATGAAAGATTTGTCATGCAGTGGTTATCAAAGAACTGTCATGCTGTTTTAAACAATCTGCCAACGTCTATTGAAGAAGATAATCAGCTTCTGTGCAACATCTGCAAAATCCAGGATCTGCAGATACCAAGGGAGCTTGGGAAGATACTATCGACTTATGGAGGCGAGTTTTCTGCTTTCTTAGAGACCAATGGTCTGATGAATGGAGATGAAGCGAGTTACATTAACTGGGAAAACTAAACATTCTCTGGAGAGATGGAAGTTAGCAGTCCAGTGGAGGCTCTTGTATAAGAAGGCTTTGGTTGGTTGTATAAGTTACTGCACCAGAACTATTTGTTCTTTATCTTCTTAATCTGATCCAGCTTGTGACTATTAGGTTGTAAGTTAACTTATTATCTATTAAATGAACTTTATAGGAATTGAAAGATTAGAATGCTATGAAGATTGTAGCAAATGACCTCTTGCTTCTGTTTCGCTTTACACCCTTGATTTGTCAAAATGTGCAGGTGCAGAAAGCAATTTCCTATCTTCTAGAATGGGCTATGATGTTTTGGCTCTGTTAGCATTTTAGTTCTTTGATTATTTTGTATCGATTTAGTTCGTATTTTAAAAAATGTAATAATTGAGATCTTGATGTGAAAATCATATTGTAATTGAGCATACAAATGAGCAGAGAATTGAACCATCGTTTAATTTGGATTGTAGATCTGTTTTTCGGCTTCATTTACTAGCTCCCACATTTCATGTATAAATTGGTTTTTTTTTTTTTGGTTAAAATTTAAATTCAAGGACTTAAAGGTTATATTTTTAAAATACGGACTAAATTGCCAGTACAAGGATCAAATTGCATTCTCATAATCTCTCATAATCTCTGCTATGCGACCCATCTCCCCTTGTGAAATATGGAGCTTATGAACTTTCTTTTTCTTTTTTTTAATTTTTTTTCTCGAGGAGATATGCAACGAATAATGCGAAGTCATATTACTGTTGCATAATCAAGGAAAAAATTAACATTCAATTTATAAAGATTAAATACAGAATGAGTCGTCAAAATAAAATTAGTAAAGAATAAAAGGCAAGTACTAGACTAGAAAAGGTGAATTAGATACGTCCTTGAACTTTACATTCTCCTTCTCGATTGAGAATTTTTCTCTTTTTGCCTCTGTAGCTGCCTCCCTTTATTATTATTATTTTTTTATTCCCTTCCCCCCTTATTTTGCCAACACCCTTTAAAATTTACTTTAACCTATTTTTCTTAAATGTAGCATTGGTATGGCGATACTGAAGAAAAGACTACTAGATTTCAGTTGAGTGATGCAACATTGAGCTGAGTGCTTAATTCTCTAAATTTTACAAATGGTCATTGAATTGTCAACATTTGTATCTTTTCTTTCATTCTTATTTGTTTTTCCTTCCTACAAAACACACAAAAGAAGCATAATCTACACTAAAAATGTGTAAGACTTGCAATGACGTGCTTACAAATATCTCTAACCTCAAAAGGAAGATGAACGCCAATAACTAAAAAGAAAATTTGGTAGGATTAAACTCAGGTCTCAACATAAGGGTATCAAACTCAAAGGAAACAATTCTTATACTTTCAAACCTAATGAATTTTCATAAACATTTATGTTTGTCTTTATTGTACTTACTAAAAACCTTAGGCTATCTATTTTCTAATTCAATAACTAAATATACATAGCTTTGGAACACAAGCTCGTGTGGTATTATGCAAAGCAAAACAAAAATGAATTTTTATGTAAAGGACAGGAATCAAAACAAAACAATTGTACAATGACTGAACTGGAATAACTATGAAAATACATGAACTAAAATAAAATTTCAATCATTTGACAATAATATAAAATTTAAATCTTATAGTATCTACACAAAAGAGAGCATATTGAGAAGACTAATGAATGTTTCTATATCATATTTCGTAGACCCATTTATAGAACCTTACCACCTCATTATGGATCTCCATTTCCTATCCTCTTCCCACATGTCCTATCTCTCTCATCTCTCACATATTTTCCTCACTCTTCCCCACTAATTTTTTTAAAGGAAAACATGCTTTCATTGAGATAAAATGAAAGAATATAGAAGGGCATACAAAAAACAGCCCAACAATAGGAACCGCCCAAACCTAGACTAGTTATACAAAAAAGGGCTCCAATTCAAAAGGACAAGGCCTAAAGGATAGTTACAAAAAACCTTAGAAACAGACACCCATCACCAACTTAATATACATAGGTTTGGAACACAAACTCATGAACTCCACCCCACCCCCCCCCCCCCATCACCAACTTGTGCAGAGCCAATACTCCTTCCATTTTCAAATCCCATATCCTCCAGTTCCCCTCCTTTCTTCTGCTGTTTCAGCGGCCATCCATTGCCTCTACTCTTACCAGTACCACCAGGCTTCAACCCTTCAACCCTTTTAGACTTCTCTCTTTCAATGCACCCTTTTGGCAACTCCTTTTCACCTCCATTCGCTGAAACTTCAGTCACTAATGGTGCAAACACCCTCTCAATCTCAAGTCTTATACAGTCGTACAATTTGAAGAGTTTCAAATTTGAGCTGATCAAGGAATCAATGGTGTCCCATGAAATGATCCACAGGTAGCATGATTATATGGAATGAACAAGATTGGAGAGTTTCCATCTTGCATTCATAAATGGGAAAGAGTTGTTGAGGAATAGAAATGGTTTATAAATGGAAGAGCGAGGCTATGATGTTTCAAGGCACTGTATTTGTCAATTCTTTATAAAAAGTTAATAATAAAATGGCAGGTGCCGTGAAGCGAGACACCCTCCTCGTGCTTTTTGGGAAGCACACCTAAACGTGCTTCTCAAAACACTGATGTTATAAACCATTATGAAAATGTACGTAGCAAAATCCATTCAGCTGGACTTCAAAAAAAGCTTCCTATAATAGATAGATCTTCAACAGCATAAATCTATGGAAAAATATTAGGCTGCAATAGTATTTTGGTTCTGTACTTTGAGCTTTTGTTCATTTTGGTCATTGTATTTTCAAAACGTTGATTTTGGTCCCTATACTTTCAACTTTTGTCCATTTTGGTCCTAAACTTTCAACTTTTGTCATTTTGGTTCCTATACGTTCAAAATATCCATTTTGTTCTTATACTTTTAAAAAGTAACTGTTTTGGTCCCTATTTGCAAAATTCCAACACAAATTCTATATATAATGGAAACCTTTATTACAAAGCTATGACAACATTTCAAGAAGTTTATTAAGTATATTGTTGTATTGAAATTATGATAAAAAAACAGATAAAAAAGAAAACAGTTACTTTTTAAAAGTACAAGACCAAAAGAACGTTTTGAAATTATAGAGACCAAAATAGACAAAACTGAAAGTACAGGGATCAAAATAGACGTTTTGAATACAGGATCAAAATGAACAAAAGCTTAAAGTACAAGGACAAAATAGGATTTAAACGAAAAAATTATATTTATCTGATTGAAGAATAAAAATGTCGCATCTTGTCTAGATAATACAAATCTTTAAGAATCAAAAACCAAATAACAAAAAGATGGAGTGAGTCAAAACCACCATGGCTTTCGCTGGCTATATTTTTAAGTGATTTATCAAAGATTCCTAAAGAATTTATGAATGATGATCCAATTCCAAAGCTATACAATATAAATGTTGTGGCCTGTGATTTATGCAATGAGTTTCTATGTGTTATTACATTAACCAGAGGGAAAAAAAAAACAAAGAATAATGGAACATCTAATCTCATCAAATCAACTGAAAAATCTAAGTAAAAGGGCAGAATTCACCTCTTATCTTGAGAACTTAGTGTTAGAGAATCCTGAAACCAACTCTTCTACAAGCATCAATGGAACTCCTTCTTCACAGCTTCTGATGAGATTCAATAAGCAAAGCATTAAGCATAATATGAATGGCAGCTTCATATACAACACCATCGTAATGCATCCCATCCACTGTACAGCGAACCCCACAATTCCAACTCAAAGTCTCAATGTCCAGCAATAAGAGGGGGCCAGCGGATGAACGCAGAAGCTTGCTCTTGCGGAGCGATTCATCATAAGCTGCCCTCATTGTATCGGTCATCTTCTTCCTCTTCTCCTCAGTGTTTAACATGGAGTTTATCAGCGTTGGCATTCCGATCCAGAACAAGTGCGGGGTTCTAATGGAGACAGAGCCCGTTAGAGGGCCTTCAGAACCCAGCTCTGGAGTCAATGGAAGCAGTGAAACAACCGAAGTTCCGAGAGATTCTAAGGAAAATCGAAAGTCTGATGCATTGGTAAAATGAAGCATATGCCATAGCCCCGACCCCATGATTAGAACATCTGGGTAGCTACGATTTTGCTTGAATTCCGCCATTAAATCAGTCAGATTGGAAGCATAAGGAGCCCAAATGAAATCCAATTTCATCCCAGTTTCGCTGATCAGAATTTGATAATTGCTATGCCGCTTGAACAGATCGCCCCGGATGGCTTCCATTCGATCGGAATCCAACGTCAAATTCAGCAGTGAGAGAGTCATGAGTCGCGCCTGGGAGTCGCCGGCGACAACCACCCATGATCCGTTAAGCAAGTCGGCGGCGTCCATGGGAGCCAACTTCTGAAAGTCACAGGCGATAACAGGAGCAGATCTCCCCCATTTCCAGGATTTCTCATAATTAACAGAGTTGTTACTCAAAGGGTTAAATCCAGCGCTCTGCTTGACTTCCCAAACAACTTCATGTAACTGGGAAATGCACGAGGCTCGGCTGTCGAAGACGACGACGGAGGAGACGGTGGTGACAAAATCAGAAACTGAGGGAATGTACGGCGTGAGATGAACGCCGATAGCAAGGGTGATGAAGAGGGCGCCGCTGAAGAAGAGCATCTTGTTGCGGCTCAAATGCCATCCCGCCATTCCCATCGGCACAAACAGCACCACGCACGCAGCTAAAACTCCCAATTGAACACCGCCTAACATAATAACTCTCAAAATTCTAGCTTGCCCCTACAAAAATTGTCTCCAACAGAACCAAATGATCCAATTTCAGAGGAAGATTGAGATGAAAAAACTGGATTTGGGAGGTGGGTATTTCTGGGTTTGTTGTGTTCATTGAGAATATTTGAGTTCTTTGGGAGGGAGATCCATATAGGTCGAAGGATGGAGAGAGGTTGGTGGGAAGGTTTCTGGCTTGAAAAATAAGGAAGTTGTAAAGTTGGGAGAAATAAATATTACGTGCCGCCGGTTCAGTTCCGGGATCGGATGAAATGAAAGAGGGGTTCGGACATGAGATGTTGACGAAGTTTATTGAGAATGTTCGCCGGTGGCCGGAAGCCTGGGGCAAATTAGTAACGCCGGGCGGCTTTCTATACGTAGTCTTTAATACGAAGGGAAGTTAAAAGAATATACTTTTTTAATAAATTAAATATTTATTTTATTTTGTAGATATGAGTGTAACATAAATAATACGACCGATGGTTTTACGCAATTAAAATTTATGACAAGAAATCGAAAAATATAGCTTAAACATGCAGAAGGCAATTTTATAACCTGATTTTTTTTAGTTCAAAAGGGGTGAGGTAGAAATTTGAACCGATTTTTAGGGAGATTATCGGTGCTTTAAGCTTTATTCAGTTTGACCATATAATCTAATCTCTACTTTTTAAGTTTAAATCTTTATCTTTATATTTGTGTTGTAATAATTTTTTAAAAATAAAACAGTCACATTTGATAATTTTTCCTAATTTTGAGGTTGTGTGGTCTAAGTTTTTTCTTTCAAATTTACCCTTTCTCTAATTTTTAATTTAGATATACTTTTGTAATTTCAAATTGGTCCTTAGCATTTACTAGTTTTAATTAAGATTATAATCCAAAATTAATAACTGATATTGAACGTGGCAATTATGTAGGGAATCGGGATAATAGAAAGTGGTCAGTTTTTTGTCGTATTTGAGTGCAATTATCTTAAATATTAATATGGGTCTTTTTCTTTTGTAATATATGTCGTATGGCTTGTAGAATAAATTGGTAAATTTTTGCAAAGTTGAGATGCTCAAGCCCTCAATCCCTCGTGTTCCAATGATTTTAAGCACTCTATCTAATTCAACTTGACAATCAAATTTAGACAGACTCTTCCTTTTCTTAGTTTAGCCTATTCAAAACCCATCTAGTCATGTCTACTATAAATCCAGTCTCAAATATCATCAACTATTCATCACAATCAATATTCCACGACTATAACCTTTGGACATCAAGTATCATGTGACCATCTTCGCCTGCCCCAACTCACTTACTATATAAGGATTAAGGAGTTTTGAATATCAATAAAGAAAAACTAAAGTATACATATTTTTTATCCTGACTCTCTTAGAATTCTCTTCTCACTCCCATCTTTAGAGATAACTTGACAGTGTCTAAGATTCCATAGTCATTCTTTTTGTTTGCAAATATACCCAATAAAGAATTTCTTACCTCTACAAAAAATAGGTTTATTATCTAAATTTTTTATTTGTTTTATTCATTGTGGTTTTCTTTTAATTTTGCAGGAAAGGAGGAAGTACGGATTGTGAGTATTGGTAAGGGAGATAAGCTAGTGACTATATCTAATGGACACTTGAATACTTCAAATGTTAACAAAAAAAATTTAAAACGTTGAGCTTTAGATTACACAATATTAGACTTTGAGTTTTATATATGAAAATTGGAGGATTTGTGCCTTGTATATGTAATATTTTGGAGGAGATTGATGCTTAGCTCGAAATGTTCGTCGACATAATTAAGTAATGACAATTAATATTTCAAAAAATTCTTAAATCTTAATGAGAGCTCCTTGTTGTTTAAGGGTAGTTTGTTTGGTAGGATTTAAGATGTCTATATTATGGATGTCCAGATTCCATGTTTGGATTATGGTTATCTCAGGATAGTCGGATATCTTAAAGATGTCCTAAAAAGCTATCTTTTAAGCTATGATTCAAAATGGTTTCGTGAAAGGAAACGCAACGAATGTTGTTGGTTCGTGGCCTGAGTCATGGAGGTGAAGCAGCAACAGAAATTTGAAAAATCGAAGGTGGAACGAATGCTAATTGCAGACTATGCCAATTTGCCTCCCCTCCCCCTCTAGATTTCCGTCTTTCTCCCCCATTCCTTCCGCAACTGGTTTTTCTCAAGAATTTGACGAGATGAGTAGATTCTGGCAATGGACTTTTCGGAGTTTCTGTTTATTCGTTTGAAGATCAAGAAAACTGACCTACAATTTTTCTTCATTTCCTGCTAATTGAAGATATAGCAATATAACTTACCACGCGGTGCGCTCAAATCAGTGTTGCTCCGCCGCAATCGAAGAGATCGACGCCCCTGTTCTACCGTCTGATTCGTCTTCCGATGCTCGACAACTCACATTCATCTATGATCGTCGGTACACTCCGTGAAATCTGCGTCATTCCGGCGTACCTGTCGCCGACGGCACCGAGCAATTGGAGATACTCGACGACGAACACCATATTTATAGCTTTGGCGTGGGAAGTCAGTGGGAGAGAAGAAAAGGAGGGAAATCCGGGGAAGATTGGAGCTTTTAAAGAAAAAAAAAATTTTTTTTTTTCTTTTTCTTTTTTTAAAGCTTTTTAA

mRNA sequence

ATGGAAACAGAAGGAAGTCTTGAAAGCCTGCTGAGATGGGCGGCGGATCATGGAATTTCAGATTCTGTCGAGAAACAGAGTTCACATTCTTGTCTAGGTCGTTCATTATGTGTCTCTTTCTTCCCTGATGCTGGCGGGAGAGGTTTGGGGGCTGTTCGTAATCTTAACAAAGGAGATTTAGTACTAAGAGTTCCAAAATCTGTCTTGTTGACGACCCAGAGTTTGTTGTTGGAAGATGAGAAGCTCTCCATGGCTCTGAAGAGATACCCATCTCTTTCTTCAACTCAGAAGTTGACCTTCTGTTTACTCTATGAGATTGGTAAAGGAAGCAGTTCTTGGTGGTTCACTTACTTCAAGCACTTGCCCCACAGTTATGACATGCTGGCAACTTTTGGAGAATTTGAAAAGCAAGCGCTGCAGGTGGATTACGCCATCTGGGCAACAGAAAAGGCTGCTTTGAAGTCTCATACAGAGTGGGTAGGAGTTAAAGGACTAATGGAAGAATCTAATAGTAAAAGCCAAATCAAAACATTCAAGGCATGGCTTTGGGCCTCTGCAACTGGATGTTTCATCTTTTTCACTACATGCTTCTTGAGCGGAAACATGACTACTGATGAGTTACACGAAGAGCAACGAGATACTCAGTGGGCTTTGACCGATGGCGGATTTGAGGAAAATGTTTCTGCATACTGCTTCTATGCAAGGGAAAGTTATAAGAAGGGACAGCAGGTTCTTTTAAGCTATGGTACATACACAAACTTAGAGCTTCTTGAATATTATGGGTTTCTTCTACAGGAAAATCCAAATGACAAGTTTTTATTCCTTTGGAACATGTCATTTATACTACCAGTTCTTGGCCCAAGGAGTCTCTTTATATTCATCAAAATGGAAATCCATCTTTTGCTCTACTTTCAGCTTTGCGATTATGGGCAACCCGCCCGAACAAGCGTAGAGGCGTTGGACACCTTGCTTATGCCGGATCTGCAGATACCAAGGGAGCTTGGGAAGATACTATCGACTTATGGAGGCGAGTTTTCTGCTTTCTTAGAGACCAATGGTGCAGAAAGCAATTTCCTATCTTCTAGAATGGGCTATGATGTTTTGGCTCTCGGCCATCCATTGCCTCTACTCTTACCAGTACCACCAGGCTTCAACCCTTCAACCCTTTTAGACTTCTCTCTTTCAATGCACCCTTTTGGCAACTCCTTTTCACCTCCATTCGCTGAAACTTCAGTCACTAATGGTGCAAACACCCTCTCAATCTCAAGTCTTATACAGTCGTACAATTTGAAGAGTTTCAAATTTGAGCTGATCAAGGAATCAATGGTGTCCCATGAAATGATCCACAGTAAAAGGGCAGAATTCACCTCTTATCTTGAGAACTTAGTGTTAGAGAATCCTGAAACCAACTCTTCTACAAGCATCAATGGAACTCCTTCTTCACAGCTTCTGATGAGATTCAATAAGCAAAGCATTAAGCATAATATGAATGGCAGCTTCATATACAACACCATCGTAATGCATCCCATCCACTGTACAGCGAACCCCACAATTCCAACTCAAACTGCCCTCATTGTATCGGTCATCTTCTTCCTCTTCTCCTCAGTGTTTAACATGGAGTTTATCAGCGTTGGCATTCCGATCCAGAACAAGTGCGGGGTTCTAATGGAGACAGAGCCCGTTAGAGGGCCTTCAGAACCCAGCTCTGGAGTCAATGGAAGCAGTGAAACAACCGAAGTTCCGAGAGATTCTAAGGAAAATCGAAAGTCTGATGCATTGATTGGAAGCATAAGGAGCCCAAATGAAATCCAATTTCATCCCAGTTTCGCTGATCAGAATTTGATAATTGCTATGCCGCTTGAACAGATCGCCCCGGATGGCTTCCATTCGATCGGAATCCAACGTCAAATTCAGCATGTTGCTCCGCCGCAATCGAAGAGATCGACGCCCCTGTTCTACCGTCTGATTCGTCTTCCGATGCTCGACAACTCACATTCATCTATGATCGTCGGTACACTCCGTGAAATCTGCGTCATTCCGGCGTACCTGTCGCCGACGGCACCGAGCAATTGGAGATACTCGACGACGAACACCATATTTATAGCTTTGGCGTGGGAAGTCAGTGGGAGAGAAGAAAAGGAGGGAAATCCGGGGAAGATTGGAGCTTTTAAAGAAAAAAAAAATTTTTTTTTTTCTTTTTCTTTTTTTAAAGCTTTTTAA

Coding sequence (CDS)

ATGGAAACAGAAGGAAGTCTTGAAAGCCTGCTGAGATGGGCGGCGGATCATGGAATTTCAGATTCTGTCGAGAAACAGAGTTCACATTCTTGTCTAGGTCGTTCATTATGTGTCTCTTTCTTCCCTGATGCTGGCGGGAGAGGTTTGGGGGCTGTTCGTAATCTTAACAAAGGAGATTTAGTACTAAGAGTTCCAAAATCTGTCTTGTTGACGACCCAGAGTTTGTTGTTGGAAGATGAGAAGCTCTCCATGGCTCTGAAGAGATACCCATCTCTTTCTTCAACTCAGAAGTTGACCTTCTGTTTACTCTATGAGATTGGTAAAGGAAGCAGTTCTTGGTGGTTCACTTACTTCAAGCACTTGCCCCACAGTTATGACATGCTGGCAACTTTTGGAGAATTTGAAAAGCAAGCGCTGCAGGTGGATTACGCCATCTGGGCAACAGAAAAGGCTGCTTTGAAGTCTCATACAGAGTGGGTAGGAGTTAAAGGACTAATGGAAGAATCTAATAGTAAAAGCCAAATCAAAACATTCAAGGCATGGCTTTGGGCCTCTGCAACTGGATGTTTCATCTTTTTCACTACATGCTTCTTGAGCGGAAACATGACTACTGATGAGTTACACGAAGAGCAACGAGATACTCAGTGGGCTTTGACCGATGGCGGATTTGAGGAAAATGTTTCTGCATACTGCTTCTATGCAAGGGAAAGTTATAAGAAGGGACAGCAGGTTCTTTTAAGCTATGGTACATACACAAACTTAGAGCTTCTTGAATATTATGGGTTTCTTCTACAGGAAAATCCAAATGACAAGTTTTTATTCCTTTGGAACATGTCATTTATACTACCAGTTCTTGGCCCAAGGAGTCTCTTTATATTCATCAAAATGGAAATCCATCTTTTGCTCTACTTTCAGCTTTGCGATTATGGGCAACCCGCCCGAACAAGCGTAGAGGCGTTGGACACCTTGCTTATGCCGGATCTGCAGATACCAAGGGAGCTTGGGAAGATACTATCGACTTATGGAGGCGAGTTTTCTGCTTTCTTAGAGACCAATGGTGCAGAAAGCAATTTCCTATCTTCTAGAATGGGCTATGATGTTTTGGCTCTCGGCCATCCATTGCCTCTACTCTTACCAGTACCACCAGGCTTCAACCCTTCAACCCTTTTAGACTTCTCTCTTTCAATGCACCCTTTTGGCAACTCCTTTTCACCTCCATTCGCTGAAACTTCAGTCACTAATGGTGCAAACACCCTCTCAATCTCAAGTCTTATACAGTCGTACAATTTGAAGAGTTTCAAATTTGAGCTGATCAAGGAATCAATGGTGTCCCATGAAATGATCCACAGTAAAAGGGCAGAATTCACCTCTTATCTTGAGAACTTAGTGTTAGAGAATCCTGAAACCAACTCTTCTACAAGCATCAATGGAACTCCTTCTTCACAGCTTCTGATGAGATTCAATAAGCAAAGCATTAAGCATAATATGAATGGCAGCTTCATATACAACACCATCGTAATGCATCCCATCCACTGTACAGCGAACCCCACAATTCCAACTCAAACTGCCCTCATTGTATCGGTCATCTTCTTCCTCTTCTCCTCAGTGTTTAACATGGAGTTTATCAGCGTTGGCATTCCGATCCAGAACAAGTGCGGGGTTCTAATGGAGACAGAGCCCGTTAGAGGGCCTTCAGAACCCAGCTCTGGAGTCAATGGAAGCAGTGAAACAACCGAAGTTCCGAGAGATTCTAAGGAAAATCGAAAGTCTGATGCATTGATTGGAAGCATAAGGAGCCCAAATGAAATCCAATTTCATCCCAGTTTCGCTGATCAGAATTTGATAATTGCTATGCCGCTTGAACAGATCGCCCCGGATGGCTTCCATTCGATCGGAATCCAACGTCAAATTCAGCATGTTGCTCCGCCGCAATCGAAGAGATCGACGCCCCTGTTCTACCGTCTGATTCGTCTTCCGATGCTCGACAACTCACATTCATCTATGATCGTCGGTACACTCCGTGAAATCTGCGTCATTCCGGCGTACCTGTCGCCGACGGCACCGAGCAATTGGAGATACTCGACGACGAACACCATATTTATAGCTTTGGCGTGGGAAGTCAGTGGGAGAGAAGAAAAGGAGGGAAATCCGGGGAAGATTGGAGCTTTTAAAGAAAAAAAAAATTTTTTTTTTTCTTTTTCTTTTTTTAAAGCTTTTTAA

Protein sequence

METEGSLESLLRWAADHGISDSVEKQSSHSCLGRSLCVSFFPDAGGRGLGAVRNLNKGDLVLRVPKSVLLTTQSLLLEDEKLSMALKRYPSLSSTQKLTFCLLYEIGKGSSSWWFTYFKHLPHSYDMLATFGEFEKQALQVDYAIWATEKAALKSHTEWVGVKGLMEESNSKSQIKTFKAWLWASATGCFIFFTTCFLSGNMTTDELHEEQRDTQWALTDGGFEENVSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFLLQENPNDKFLFLWNMSFILPVLGPRSLFIFIKMEIHLLLYFQLCDYGQPARTSVEALDTLLMPDLQIPRELGKILSTYGGEFSAFLETNGAESNFLSSRMGYDVLALGHPLPLLLPVPPGFNPSTLLDFSLSMHPFGNSFSPPFAETSVTNGANTLSISSLIQSYNLKSFKFELIKESMVSHEMIHSKRAEFTSYLENLVLENPETNSSTSINGTPSSQLLMRFNKQSIKHNMNGSFIYNTIVMHPIHCTANPTIPTQTALIVSVIFFLFSSVFNMEFISVGIPIQNKCGVLMETEPVRGPSEPSSGVNGSSETTEVPRDSKENRKSDALIGSIRSPNEIQFHPSFADQNLIIAMPLEQIAPDGFHSIGIQRQIQHVAPPQSKRSTPLFYRLIRLPMLDNSHSSMIVGTLREICVIPAYLSPTAPSNWRYSTTNTIFIALAWEVSGREEKEGNPGKIGAFKEKKNFFFSFSFFKAF
Homology
BLAST of Sgr021134 vs. NCBI nr
Match: XP_022143354.1 (protein SET DOMAIN GROUP 40 isoform X1 [Momordica charantia])

HSP 1 Score: 469.5 bits (1207), Expect = 5.2e-128
Identity = 269/442 (60.86%), Postives = 296/442 (66.97%), Query Frame = 0

Query: 1   METEGSLESLLRWAADHGISDSVEKQSSHSCLGRSLCVSFFPDAGGRGLGAVRNLNKGDL 60
           METEGS E+LLRWAAD GISDSV+K+SS+SCLGRSLCVS FPDAGGRGLGAVRNLN G+L
Sbjct: 1   METEGSFENLLRWAADRGISDSVDKESSNSCLGRSLCVSLFPDAGGRGLGAVRNLNNGEL 60

Query: 61  VLRVPKSVLLTTQSLLLEDEKLSMALKRYPSLSSTQKLTFCLLYEIGKGSSSWWFTYFKH 120
           VLRVPKSVL TTQSLLLE+EKLSMALKRYPSLSSTQKLTFCLLYEIG+GS+SWWF YFKH
Sbjct: 61  VLRVPKSVLFTTQSLLLENEKLSMALKRYPSLSSTQKLTFCLLYEIGEGSNSWWFPYFKH 120

Query: 121 LPHSYDMLATFGEFEKQALQVDYAIWATEKAALKSHTEWVGVKGLMEESNSKSQIKTFKA 180
           LP SYD+LATFGEFEKQALQVDYAIWATEKAALKSHTEW  VKGLME+SN K Q++TFKA
Sbjct: 121 LPQSYDILATFGEFEKQALQVDYAIWATEKAALKSHTEWREVKGLMEDSNVKRQLQTFKA 180

Query: 181 WLWASAT-------------GCFI-----------------------FFTTCFLSGNMTT 240
           WLWASAT             GC                         F       G+MTT
Sbjct: 181 WLWASATISSRALYVPWDEAGCLCPVGDLFNYAAPEGESLDIRDVSSFSQHASSGGDMTT 240

Query: 241 DELHEEQRDTQWALTDGGFEENVSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFLL 300
           DELHEEQ DTQ ALTDGGF+E VSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGF+L
Sbjct: 241 DELHEEQGDTQRALTDGGFDEKVSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFIL 300

Query: 301 QENPNDKF----------------------------------LFLW-----------NMS 354
           QENPND+                                   L LW           +++
Sbjct: 301 QENPNDRVFIPLEHDIYTTSSWPKESLYICQNGNPSFALLSALRLWATRPNKRRGVGHLA 360

BLAST of Sgr021134 vs. NCBI nr
Match: XP_038896047.1 (protein SET DOMAIN GROUP 40 [Benincasa hispida])

HSP 1 Score: 465.3 bits (1196), Expect = 9.7e-127
Identity = 266/434 (61.29%), Postives = 293/434 (67.51%), Query Frame = 0

Query: 1   METEGSLESLLRWAADHGISDSVEKQSSHSCLGRSLCVSFFPDAGGRGLGAVRNLNKGDL 60
           M TE S  SLLRWAADHGISDSV++Q+SHSCLG SLCV FFPDAGGRGLGAVR LNKG+L
Sbjct: 1   MGTEESFGSLLRWAADHGISDSVDQQTSHSCLGDSLCVCFFPDAGGRGLGAVRQLNKGEL 60

Query: 61  VLRVPKSVLLTTQSLLLEDEKLSMALKRYPSLSSTQKLTFCLLYEIGKGSSSWWFTYFKH 120
           VLRVPKSVL TTQSL LEDEKL+ ALKRYPSLSSTQKLTFCLLYEIGKG+SSWW  Y KH
Sbjct: 61  VLRVPKSVLFTTQSLSLEDEKLARALKRYPSLSSTQKLTFCLLYEIGKGTSSWWLPYLKH 120

Query: 121 LPHSYDMLATFGEFEKQALQVDYAIWATEKAALKSHTEWVGVKGLMEESNSKSQIKTFKA 180
           LP SYD+LATFGEFEKQALQVDY IWATEKAALKS  EW GVKGLMEE N K+Q++TFKA
Sbjct: 121 LPQSYDILATFGEFEKQALQVDYGIWATEKAALKSLMEWRGVKGLMEEFNIKNQLQTFKA 180

Query: 181 WLWASAT-------------GCF-----------------------IFFTTCFLSGNMTT 240
           WLWASAT             GC                         F     L+G++TT
Sbjct: 181 WLWASATISSRALYVPWDEAGCLCPVGDLFNYAAPEGESIDGTDVSFFSPHASLNGDITT 240

Query: 241 DELHEEQRDTQWALTDGGFEENVSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFLL 300
           DELHEEQRDTQWALTDGGFEE+VSAYCFYARESYKKG+QVLLSYGTY+NLELLEYYGFLL
Sbjct: 241 DELHEEQRDTQWALTDGGFEEDVSAYCFYARESYKKGEQVLLSYGTYSNLELLEYYGFLL 300

Query: 301 QENPNDKFLF-----LWNM------SFILPVLGPRSLFIFIKMEI------------HL- 354
           QENPNDK        ++N       S  +   G  S  +   + +            HL 
Sbjct: 301 QENPNDKVFIPLEHDIYNSSSWPKESLYIHQNGNPSFSLLSALRLWATHPNKRRGVGHLA 360

BLAST of Sgr021134 vs. NCBI nr
Match: XP_022983189.1 (protein SET DOMAIN GROUP 40 isoform X1 [Cucurbita maxima])

HSP 1 Score: 461.5 bits (1186), Expect = 1.4e-125
Identity = 262/434 (60.37%), Postives = 295/434 (67.97%), Query Frame = 0

Query: 1   METEGSLESLLRWAADHGISDSVEKQSSHSCLGRSLCVSFFPDAGGRGLGAVRNLNKGDL 60
           M TEGS ESLLRWAADHGISDSV+KQSSHSCLGRSLCV FFPDAGGRGLGAVR+L KG+L
Sbjct: 1   MGTEGSFESLLRWAADHGISDSVDKQSSHSCLGRSLCVCFFPDAGGRGLGAVRHLTKGEL 60

Query: 61  VLRVPKSVLLTTQSLLLEDEKLSMALKRYPSLSSTQKLTFCLLYEIGKGSSSWWFTYFKH 120
           VL+VPKSVLLTTQSL L+DEKLSMALKRYPSLSSTQKLTFCLLYEIGKGSSSWWF YFKH
Sbjct: 61  VLKVPKSVLLTTQSLSLQDEKLSMALKRYPSLSSTQKLTFCLLYEIGKGSSSWWFPYFKH 120

Query: 121 LPHSYDMLATFGEFEKQALQVDYAIWATEKAALKSHTEWVGVKGLMEESNSKSQIKTFKA 180
           LP +Y+ LATFGEFEKQALQVDYA+W  EKAA KSHTEW GVKGLMEESN K+Q++TFKA
Sbjct: 121 LPTTYETLATFGEFEKQALQVDYALWEAEKAASKSHTEWRGVKGLMEESNIKNQLQTFKA 180

Query: 181 WLWASAT-------------GCFI-----------------------FFTTCFLSGNMTT 240
           WLWASAT             GC                         F     L+GN+TT
Sbjct: 181 WLWASATISSRALYVPWDEAGCLCPVGDLFNYAAPEGESLDIMDVSSFSQHASLNGNITT 240

Query: 241 DELHEEQRDTQWALTDGGFEENVSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFLL 300
           D LH+E++DTQ ALTDGGFEENVSAYCFYARESYK+G+QVLLSYGTY+NLELL+YYGFLL
Sbjct: 241 DGLHKEEQDTQRALTDGGFEENVSAYCFYARESYKRGEQVLLSYGTYSNLELLQYYGFLL 300

Query: 301 QENPNDK-FLFLWNMSFILPVLGPRSLFIF------------------------------ 354
           QENPND+ F+ L +  +        SLFI                               
Sbjct: 301 QENPNDRVFIPLEHEIYSSSSWPKESLFIHQNGNPSFALLSALRLWATHPNKRRGVGHLA 360

BLAST of Sgr021134 vs. NCBI nr
Match: KAG6581196.1 (Protein SET DOMAIN GROUP 40, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 459.9 bits (1182), Expect = 4.1e-125
Identity = 261/434 (60.14%), Postives = 294/434 (67.74%), Query Frame = 0

Query: 1   METEGSLESLLRWAADHGISDSVEKQSSHSCLGRSLCVSFFPDAGGRGLGAVRNLNKGDL 60
           M TE S ESLLRWAADHGISDSV+KQ SHSCLGRSLCV FFPDAGGRGLGAVR+L KG+L
Sbjct: 1   MGTEESFESLLRWAADHGISDSVDKQCSHSCLGRSLCVCFFPDAGGRGLGAVRHLTKGEL 60

Query: 61  VLRVPKSVLLTTQSLLLEDEKLSMALKRYPSLSSTQKLTFCLLYEIGKGSSSWWFTYFKH 120
           VL+VPKSVLLTTQSL L+DEKLSMALKRYPSLSSTQKLTFCLLYEIGKGSSSWWF YFKH
Sbjct: 61  VLKVPKSVLLTTQSLSLQDEKLSMALKRYPSLSSTQKLTFCLLYEIGKGSSSWWFPYFKH 120

Query: 121 LPHSYDMLATFGEFEKQALQVDYAIWATEKAALKSHTEWVGVKGLMEESNSKSQIKTFKA 180
           LP +Y+ LATFGEFEKQALQVDYA+W  EKAA KSH EW GVKGLMEES  K+Q++TFKA
Sbjct: 121 LPTTYETLATFGEFEKQALQVDYALWEAEKAASKSHAEWRGVKGLMEESIIKNQLQTFKA 180

Query: 181 WLWASAT-------------GCFI-----------------------FFTTCFLSGNMTT 240
           WLWASAT             GC                         F     L+GN+TT
Sbjct: 181 WLWASATISSRALYVPWDEAGCLCPVGDLFNYAAPEAESFDIMDVSSFSQHASLNGNITT 240

Query: 241 DELHEEQRDTQWALTDGGFEENVSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFLL 300
           D LH+E++DTQ ALTDGGFEENVSAYCFYARESYK+G+QVLLSYGTYTNLELL+YYGFLL
Sbjct: 241 DGLHKEEQDTQRALTDGGFEENVSAYCFYARESYKRGEQVLLSYGTYTNLELLQYYGFLL 300

Query: 301 QENPNDK-FLFLWNMSFILPVLGPRSLFIF------------------------------ 354
           QENPND+ F+ L +  +        SLFI                               
Sbjct: 301 QENPNDRVFIPLEHDIYSSSSWPKESLFIHQNGNPSFALLSALRLWATHPNKRRGVGHLA 360

BLAST of Sgr021134 vs. NCBI nr
Match: KAG7017936.1 (Protein SET DOMAIN GROUP 40, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 456.8 bits (1174), Expect = 3.5e-124
Identity = 260/434 (59.91%), Postives = 293/434 (67.51%), Query Frame = 0

Query: 1   METEGSLESLLRWAADHGISDSVEKQSSHSCLGRSLCVSFFPDAGGRGLGAVRNLNKGDL 60
           M TE S ESLLRWAADHGISDSV+KQ SHSCLGRSLCV FFPDAGGRGLGAVR+L KG+L
Sbjct: 1   MGTEESFESLLRWAADHGISDSVDKQCSHSCLGRSLCVCFFPDAGGRGLGAVRHLTKGEL 60

Query: 61  VLRVPKSVLLTTQSLLLEDEKLSMALKRYPSLSSTQKLTFCLLYEIGKGSSSWWFTYFKH 120
           VL+VPKSVLLTTQSL L+DEKLSMALKRYPSLSSTQKLTFCLLYEIGKGSSSWWF YFKH
Sbjct: 61  VLKVPKSVLLTTQSLSLQDEKLSMALKRYPSLSSTQKLTFCLLYEIGKGSSSWWFPYFKH 120

Query: 121 LPHSYDMLATFGEFEKQALQVDYAIWATEKAALKSHTEWVGVKGLMEESNSKSQIKTFKA 180
           LP +Y+ LATFGEFEKQALQVDYA+W  EKAA KS  EW GVKGLMEES  K+Q++TFKA
Sbjct: 121 LPTTYETLATFGEFEKQALQVDYALWEAEKAASKSRAEWRGVKGLMEESIIKNQLQTFKA 180

Query: 181 WLWASAT-------------GCFI-----------------------FFTTCFLSGNMTT 240
           WLWASAT             GC                         F     L+GN+TT
Sbjct: 181 WLWASATISSRALYVPWDEAGCLCPVGDLFNYAAPEAESFDIMDVSSFSQHASLNGNITT 240

Query: 241 DELHEEQRDTQWALTDGGFEENVSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFLL 300
           D LH+E++DTQ ALTDGGFEENVSAYCFYARESYK+G+QVLLSYGTYTNLELL+YYGFLL
Sbjct: 241 DGLHKEEQDTQRALTDGGFEENVSAYCFYARESYKRGEQVLLSYGTYTNLELLQYYGFLL 300

Query: 301 QENPNDK-FLFLWNMSFILPVLGPRSLFIF------------------------------ 354
           QENPND+ F+ L +  +        SLFI                               
Sbjct: 301 QENPNDRVFIPLEHDIYSSSSWPKESLFIHQNGNPSFALLSALRLWATHPNKRRGVGHLA 360

BLAST of Sgr021134 vs. ExPASy Swiss-Prot
Match: Q6NQJ8 (Protein SET DOMAIN GROUP 40 OS=Arabidopsis thaliana OX=3702 GN=SDG40 PE=2 SV=1)

HSP 1 Score: 278.9 bits (712), Expect = 1.7e-73
Identity = 159/317 (50.16%), Postives = 204/317 (64.35%), Query Frame = 0

Query: 6   SLESLLRWAADHGISDSVE-KQSSHSCLGRSLCVSFFPDAGGRGLGAVRNLNKGDLVLRV 65
           ++E+ LRWAA+ GISDS++  +   SCLG SL VS FPDAGGRGLGA R L KG+LVL+V
Sbjct: 7   TMETFLRWAAEIGISDSIDSSRFRDSCLGHSLSVSDFPDAGGRGLGAARELKKGELVLKV 66

Query: 66  PKSVLLTTQSLLLEDEKLSMALKRYPSLSSTQKLTFCLLYEIGKGSSSWWFTYFKHLPHS 125
           P+  L+TT+S++ +D KLS A+  + SLSSTQ L+ CLLYE+ K   S+W+ Y  H+P  
Sbjct: 67  PRKALMTTESIIAKDLKLSDAVNLHNSLSSTQILSVCLLYEMSKEKKSFWYPYLFHIPRD 126

Query: 126 YDMLATFGEFEKQALQVDYAIWATEKAALKSHTEWVGVKGLMEESNSKSQIKTFKAWLWA 185
           YD+LATFG FEKQALQV+ A+WATEKA  K  +EW     LM+E   K + ++F+AWLWA
Sbjct: 127 YDLLATFGNFEKQALQVEDAVWATEKATAKCQSEWKEAGSLMKELELKPKFRSFQAWLWA 186

Query: 186 SAT-------------GCFI----FFTTCFLSGNMTTDELHEEQRDTQWA---------- 245
           SAT             GC       F          T +  E   + + A          
Sbjct: 187 SATISSRTLHVPWDSAGCLCPVGDLFNYDAPGDYSNTPQGPESANNVEEAGLVVETHSER 246

Query: 246 LTDGGFEENVSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFLLQENPNDK-FLFLW 293
           LTDGGFEE+V+AYC YAR +Y+ G+QVLL YGTYTNLELLE+YGF+L+EN NDK F+ L 
Sbjct: 247 LTDGGFEEDVNAYCLYARRNYQLGEQVLLCYGTYTNLELLEHYGFMLEENSNDKVFIPLE 306

BLAST of Sgr021134 vs. ExPASy Swiss-Prot
Match: B7ZUF3 (Actin-histidine N-methyltransferase OS=Xenopus tropicalis OX=8364 GN=setd3 PE=2 SV=1)

HSP 1 Score: 55.5 bits (132), Expect = 3.1e-06
Identity = 66/240 (27.50%), Postives = 102/240 (42.50%), Query Frame = 0

Query: 41  FPDAGGRGLGAVRNLNKGDLVLRVPKSVLLTTQSLLLEDEKLSMALKRYPSLSSTQKLTF 100
           FP+  G GL A R +   +L L VP+ +L+T +S          +  R         L F
Sbjct: 101 FPEE-GFGLKATREIKAEELFLWVPRKLLMTVESAKGSVLGPLYSQDRILQAMGNITLAF 160

Query: 101 CLLYEIGKGSSSWWFTYFKHLPHSYDMLATFGEFEKQALQVDYAI---WATEKAALKSHT 160
            LL E     +S+W  Y K LP+ YD    F E E Q LQ   AI   ++  K   + + 
Sbjct: 161 HLLCE-RADPNSFWLPYIKTLPNEYDTPLYFNEDEVQYLQSTQAILDVFSQYKNTARQYA 220

Query: 161 EWVGVKGLMEESNSKSQIK---TFKAWLWASATGCFIFFTTCFLSGNMTT---DELHEEQ 220
            +  V      +N K  +K   TF  + WA ++            G+  T     L +  
Sbjct: 221 YFYKVIQTHPNAN-KLPLKDSFTFDDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMC 280

Query: 221 RDTQWALTDGGFEENVSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFLLQENPNDK 272
             T   +T G   E+    C  A + +K G+Q+ + YGT +N E + + GF  + N +D+
Sbjct: 281 NHTNGLITTGYNLEDDRCEC-VALQDFKSGEQIYIFYGTRSNAEFVIHNGFFFENNLHDR 336

BLAST of Sgr021134 vs. ExPASy Swiss-Prot
Match: Q7SXS7 (Actin-histidine N-methyltransferase OS=Danio rerio OX=7955 GN=setd3 PE=2 SV=1)

HSP 1 Score: 53.5 bits (127), Expect = 1.2e-05
Identity = 67/277 (24.19%), Postives = 114/277 (41.16%), Query Frame = 0

Query: 4   EGSLESLLRWAADHGISDSVEKQSSHSCLGRSLCVSFFPDAGGRGLGAVRNLNKGDLVLR 63
           E     L+ WAA          +   SC G    +S F D  G GL A +++   +L L 
Sbjct: 76  EDFFSELMAWAA----------ECRASCDGFE--ISNFADE-GYGLKATKDIKAEELFLW 135

Query: 64  VPKSVLLTTQSLLLEDEKLSMALKRYPSLSSTQKLTFCLLYEIGKGS-SSWWFTYFKHLP 123
           +P+ +L+T +S   ++  L     +   L +   +T  L     + + SS W  Y K LP
Sbjct: 136 IPRKMLMTVES--AKNSVLGPLYSQDRILQAMGNVTLALHLLCERANPSSPWLPYIKTLP 195

Query: 124 HSYDMLATFGEFEKQALQVDYAIWATEKAALKSHTEWVGVKGLMEESNSKSQIK-----T 183
             YD    F E E + L    AI         +  ++     ++    + S++      T
Sbjct: 196 SEYDTPLYFEEEEVRHLLATQAIQDVLSQYKNTARQYAYFYKVIHTHPNASKLPLKDAFT 255

Query: 184 FKAWLWASATGCFIFFTTCFLSGNMTT---DELHEEQRDTQWALTDGGFEENVSAYCFYA 243
           F  + WA ++            G+  T     L +    T   +T G   E+    C  A
Sbjct: 256 FDDYRWAVSSVMTRQNQIPTADGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCEC-VA 315

Query: 244 RESYKKGQQVLLSYGTYTNLELLEYYGFLLQENPNDK 272
            + YK+G+Q+ + YGT +N E + + GF  ++N +D+
Sbjct: 316 LKDYKEGEQIYIFYGTRSNAEFVIHNGFFFEDNAHDR 336

BLAST of Sgr021134 vs. ExPASy Swiss-Prot
Match: B5FW36 (Actin-histidine N-methyltransferase OS=Otolemur garnettii OX=30611 GN=SETD3 PE=3 SV=1)

HSP 1 Score: 50.8 bits (120), Expect = 7.5e-05
Identity = 70/277 (25.27%), Postives = 116/277 (41.88%), Query Frame = 0

Query: 4   EGSLESLLRWAADHGISDSVEKQSSHSCLGRSLCVSFFPDAGGRGLGAVRNLNKGDLVLR 63
           E     L++WA+++G   SVE             V+F  +  G GL A R++   +L L 
Sbjct: 76  ENYFPDLMKWASENGA--SVEGFE---------MVNFKEE--GFGLRATRDIKAEELFLW 135

Query: 64  VPKSVLLTTQSLLLEDEKLSMALKRYPSLSSTQKLTFCLLYEIGKGSSSWWFTYFKHLPH 123
           VP+ +L+T +S          +  R         L F LL E     +S+W  Y + LP 
Sbjct: 136 VPRKLLMTVESAKNSVLGPLYSQDRILQAMGNIALAFHLLCE-RASPNSFWQPYIQSLPS 195

Query: 124 SYDMLATFGEFEKQALQVDYAI---WATEKAALKSHTEWVGVKGLMEESNSKSQIK---T 183
            YD    F E E + LQ   AI   ++  K   + +  +  V      +N K  +K   T
Sbjct: 196 EYDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKVIQTHPHAN-KLPLKDSFT 255

Query: 184 FKAWLWASATGCFIFFTTCFLSGNMTT---DELHEEQRDTQWALTDGGFEENVSAYCFYA 243
           ++ + WA ++            G+  T     L +    T   +T G   E+    C  A
Sbjct: 256 YEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCEC-VA 315

Query: 244 RESYKKGQQVLLSYGTYTNLELLEYYGFLLQENPNDK 272
            + ++ G+Q+ + YGT +N E + + GF    N +D+
Sbjct: 316 LQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDR 336

BLAST of Sgr021134 vs. ExPASy Swiss-Prot
Match: Q5ZML9 (Actin-histidine N-methyltransferase OS=Gallus gallus OX=9031 GN=SETD3 PE=2 SV=1)

HSP 1 Score: 50.4 bits (119), Expect = 9.8e-05
Identity = 62/270 (22.96%), Postives = 108/270 (40.00%), Query Frame = 0

Query: 10  LLRWAADHGISDSVEKQSSHSCLGRSLCVSFFPDAGGRGLGAVRNLNKGDLVLRVPKSVL 69
           L++WA ++G S    + ++              +  G GL A R +   +L L VP+ +L
Sbjct: 82  LIKWATENGASTEGFEIANF-------------EEEGFGLKATREIKAEELFLWVPRKLL 141

Query: 70  LTTQSLLLEDEKLSMALKRYPSLSSTQKLTFCLLYEIGKGSSSWWFTYFKHLPHSYDMLA 129
           +T +S          +  R         L F LL E     +S+W  Y + LP  YD   
Sbjct: 142 MTVESAKNSVLGSLYSQDRILQAMGNITLAFHLLCE-RANPNSFWLPYIQTLPSEYDTPL 201

Query: 130 TFGEFEKQALQVDYAIWATEKAALKSHTEWVGVKGLMEESNSKSQIK-----TFKAWLWA 189
            F E E Q L+   AI         +  ++     +++   + S++      T+  + WA
Sbjct: 202 YFEEDEVQYLRSTQAIHDVFSQYKNTARQYAYFYKVIQTHPNASKLPLKDSFTYDDYRWA 261

Query: 190 SATGCFIFFTTCFLSGNMTT---DELHEEQRDTQWALTDGGFEENVSAYCFYARESYKKG 249
            ++            G+  T     L +    T   +T G   E+    C  A + +K G
Sbjct: 262 VSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCEC-VALQDFKAG 321

Query: 250 QQVLLSYGTYTNLELLEYYGFLLQENPNDK 272
           +Q+ + YGT +N E + + GF    N +D+
Sbjct: 322 EQIYIFYGTRSNAEFVIHSGFFFDNNSHDR 336

BLAST of Sgr021134 vs. ExPASy TrEMBL
Match: A0A6J1CP24 (protein SET DOMAIN GROUP 40 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111013246 PE=4 SV=1)

HSP 1 Score: 469.5 bits (1207), Expect = 2.5e-128
Identity = 269/442 (60.86%), Postives = 296/442 (66.97%), Query Frame = 0

Query: 1   METEGSLESLLRWAADHGISDSVEKQSSHSCLGRSLCVSFFPDAGGRGLGAVRNLNKGDL 60
           METEGS E+LLRWAAD GISDSV+K+SS+SCLGRSLCVS FPDAGGRGLGAVRNLN G+L
Sbjct: 1   METEGSFENLLRWAADRGISDSVDKESSNSCLGRSLCVSLFPDAGGRGLGAVRNLNNGEL 60

Query: 61  VLRVPKSVLLTTQSLLLEDEKLSMALKRYPSLSSTQKLTFCLLYEIGKGSSSWWFTYFKH 120
           VLRVPKSVL TTQSLLLE+EKLSMALKRYPSLSSTQKLTFCLLYEIG+GS+SWWF YFKH
Sbjct: 61  VLRVPKSVLFTTQSLLLENEKLSMALKRYPSLSSTQKLTFCLLYEIGEGSNSWWFPYFKH 120

Query: 121 LPHSYDMLATFGEFEKQALQVDYAIWATEKAALKSHTEWVGVKGLMEESNSKSQIKTFKA 180
           LP SYD+LATFGEFEKQALQVDYAIWATEKAALKSHTEW  VKGLME+SN K Q++TFKA
Sbjct: 121 LPQSYDILATFGEFEKQALQVDYAIWATEKAALKSHTEWREVKGLMEDSNVKRQLQTFKA 180

Query: 181 WLWASAT-------------GCFI-----------------------FFTTCFLSGNMTT 240
           WLWASAT             GC                         F       G+MTT
Sbjct: 181 WLWASATISSRALYVPWDEAGCLCPVGDLFNYAAPEGESLDIRDVSSFSQHASSGGDMTT 240

Query: 241 DELHEEQRDTQWALTDGGFEENVSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFLL 300
           DELHEEQ DTQ ALTDGGF+E VSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGF+L
Sbjct: 241 DELHEEQGDTQRALTDGGFDEKVSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFIL 300

Query: 301 QENPNDKF----------------------------------LFLW-----------NMS 354
           QENPND+                                   L LW           +++
Sbjct: 301 QENPNDRVFIPLEHDIYTTSSWPKESLYICQNGNPSFALLSALRLWATRPNKRRGVGHLA 360

BLAST of Sgr021134 vs. ExPASy TrEMBL
Match: A0A6J1J6L6 (protein SET DOMAIN GROUP 40 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111481847 PE=4 SV=1)

HSP 1 Score: 461.5 bits (1186), Expect = 6.8e-126
Identity = 262/434 (60.37%), Postives = 295/434 (67.97%), Query Frame = 0

Query: 1   METEGSLESLLRWAADHGISDSVEKQSSHSCLGRSLCVSFFPDAGGRGLGAVRNLNKGDL 60
           M TEGS ESLLRWAADHGISDSV+KQSSHSCLGRSLCV FFPDAGGRGLGAVR+L KG+L
Sbjct: 1   MGTEGSFESLLRWAADHGISDSVDKQSSHSCLGRSLCVCFFPDAGGRGLGAVRHLTKGEL 60

Query: 61  VLRVPKSVLLTTQSLLLEDEKLSMALKRYPSLSSTQKLTFCLLYEIGKGSSSWWFTYFKH 120
           VL+VPKSVLLTTQSL L+DEKLSMALKRYPSLSSTQKLTFCLLYEIGKGSSSWWF YFKH
Sbjct: 61  VLKVPKSVLLTTQSLSLQDEKLSMALKRYPSLSSTQKLTFCLLYEIGKGSSSWWFPYFKH 120

Query: 121 LPHSYDMLATFGEFEKQALQVDYAIWATEKAALKSHTEWVGVKGLMEESNSKSQIKTFKA 180
           LP +Y+ LATFGEFEKQALQVDYA+W  EKAA KSHTEW GVKGLMEESN K+Q++TFKA
Sbjct: 121 LPTTYETLATFGEFEKQALQVDYALWEAEKAASKSHTEWRGVKGLMEESNIKNQLQTFKA 180

Query: 181 WLWASAT-------------GCFI-----------------------FFTTCFLSGNMTT 240
           WLWASAT             GC                         F     L+GN+TT
Sbjct: 181 WLWASATISSRALYVPWDEAGCLCPVGDLFNYAAPEGESLDIMDVSSFSQHASLNGNITT 240

Query: 241 DELHEEQRDTQWALTDGGFEENVSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFLL 300
           D LH+E++DTQ ALTDGGFEENVSAYCFYARESYK+G+QVLLSYGTY+NLELL+YYGFLL
Sbjct: 241 DGLHKEEQDTQRALTDGGFEENVSAYCFYARESYKRGEQVLLSYGTYSNLELLQYYGFLL 300

Query: 301 QENPNDK-FLFLWNMSFILPVLGPRSLFIF------------------------------ 354
           QENPND+ F+ L +  +        SLFI                               
Sbjct: 301 QENPNDRVFIPLEHEIYSSSSWPKESLFIHQNGNPSFALLSALRLWATHPNKRRGVGHLA 360

BLAST of Sgr021134 vs. ExPASy TrEMBL
Match: A0A6J1F4A7 (protein SET DOMAIN GROUP 40 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111442034 PE=4 SV=1)

HSP 1 Score: 455.3 bits (1170), Expect = 4.9e-124
Identity = 259/434 (59.68%), Postives = 292/434 (67.28%), Query Frame = 0

Query: 1   METEGSLESLLRWAADHGISDSVEKQSSHSCLGRSLCVSFFPDAGGRGLGAVRNLNKGDL 60
           M  E S ESLLRWAADHGISDSV+KQ SHSCLGRSLCV FFPDAGGRGLGAVR+L KG+L
Sbjct: 1   MGNEESFESLLRWAADHGISDSVDKQCSHSCLGRSLCVCFFPDAGGRGLGAVRHLTKGEL 60

Query: 61  VLRVPKSVLLTTQSLLLEDEKLSMALKRYPSLSSTQKLTFCLLYEIGKGSSSWWFTYFKH 120
           VL+VPKSVLLTTQSL L+DEKLSMALKRYPSLSSTQKLTFCLLYEIGKGSSSWWF YFKH
Sbjct: 61  VLKVPKSVLLTTQSLSLQDEKLSMALKRYPSLSSTQKLTFCLLYEIGKGSSSWWFPYFKH 120

Query: 121 LPHSYDMLATFGEFEKQALQVDYAIWATEKAALKSHTEWVGVKGLMEESNSKSQIKTFKA 180
           LP +Y+ LATFGEFEKQALQVDYA+W  EKAA KS  EW GVKGLMEESN K+Q++TFKA
Sbjct: 121 LPTTYETLATFGEFEKQALQVDYALWEAEKAASKSRAEWRGVKGLMEESNIKNQLQTFKA 180

Query: 181 WLWASAT-------------GCFI-----------------------FFTTCFLSGNMTT 240
           WLWASAT             GC                         F     L+GN+TT
Sbjct: 181 WLWASATISSRALYVPWDEAGCLCPVGDLFNYAAPEAESFDIMDVSSFSQHASLNGNITT 240

Query: 241 DELHEEQRDTQWALTDGGFEENVSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFLL 300
           D LH+E++DTQ ALTDGGFEENVSAYCFYARESYK+G+QVLLSYGTYTNLELL+YYGFLL
Sbjct: 241 DGLHKEEQDTQRALTDGGFEENVSAYCFYARESYKRGEQVLLSYGTYTNLELLQYYGFLL 300

Query: 301 QENPNDK-FLFLWNMSFILPVLGPRSLFIF------------------------------ 354
           QENPND+ F+ L +  +        SLFI                               
Sbjct: 301 QENPNDRVFIPLEHDIYSSSSWPKESLFIHQNGNPSFALLSALRLWATHPNKRRGVGHLA 360

BLAST of Sgr021134 vs. ExPASy TrEMBL
Match: A0A0A0L7L4 (SET domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G307670 PE=4 SV=1)

HSP 1 Score: 450.3 bits (1157), Expect = 1.6e-122
Identity = 260/431 (60.32%), Postives = 284/431 (65.89%), Query Frame = 0

Query: 1   METEGSLESLLRWAADHGISDSVEKQSSHSCLGRSLCVSFFPDAGGRGLGAVRNLNKGDL 60
           METEGSL SLLRWAADHGISDSV++ +SHSCLG SLCVSFFPD GGRGL AVR L KG+L
Sbjct: 1   METEGSLGSLLRWAADHGISDSVDQPTSHSCLGHSLCVSFFPDTGGRGLAAVRQLKKGEL 60

Query: 61  VLRVPKSVLLTTQSLLLEDEKLSMALKRYPSLSSTQKLTFCLLYEIGKGSSSWWFTYFKH 120
           VLR PKS+LLTTQSL LEDEKL MALKRYPSLSSTQKLTFCLLYEI KG SSWWF Y KH
Sbjct: 61  VLRAPKSILLTTQSLSLEDEKLDMALKRYPSLSSTQKLTFCLLYEISKGPSSWWFPYLKH 120

Query: 121 LPHSYDMLATFGEFEKQALQVDYAIWATEKAALKSHTEWVGVKGLMEESNSKSQIKTFKA 180
           LP SYD+LATFGEFEKQALQVDYAIWATEKAALKS T+W GV+GLM+ESN KSQ++TFKA
Sbjct: 121 LPQSYDILATFGEFEKQALQVDYAIWATEKAALKSRTDWRGVEGLMQESNIKSQLQTFKA 180

Query: 181 WLWASAT-------------GCF------------------IFFTTCFLSGNMTTDELH- 240
           WLWASAT             GC                         F S     DEL  
Sbjct: 181 WLWASATISSRTLYVPWDEAGCLCPVGDLFNYAAPEGESFNAVDVLSFPSHASLNDELEL 240

Query: 241 -EEQRDTQWALTDGGFEENVSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFLLQEN 300
            EEQRD+QWALTDGGFEEN SAYCFYARESY+KG+QVLLSYGTYTNLELLEYYGFLLQEN
Sbjct: 241 LEEQRDSQWALTDGGFEENASAYCFYARESYRKGEQVLLSYGTYTNLELLEYYGFLLQEN 300

Query: 301 PNDKFLF----------LW-----------NMSFIL-----------------PVLGPRS 354
           PNDK              W           N SF L                   L    
Sbjct: 301 PNDKVFIPIEHDIYGSSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAG 360

BLAST of Sgr021134 vs. ExPASy TrEMBL
Match: A0A5D3BQD3 (Protein SET DOMAIN GROUP 40 isoform X2 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold451G001720 PE=4 SV=1)

HSP 1 Score: 431.0 bits (1107), Expect = 9.9e-117
Identity = 249/431 (57.77%), Postives = 283/431 (65.66%), Query Frame = 0

Query: 1   METEGSLESLLRWAADHGISDSVEKQSSHSCLGRSLCVSFFPDAGGRGLGAVRNLNKGDL 60
           METEGS  SLLRWAADHGISDS+++ +S SCLGRSLCVSFFPD+GGRGL AVR LNKG+L
Sbjct: 1   METEGSFGSLLRWAADHGISDSIDQHTSRSCLGRSLCVSFFPDSGGRGLAAVRQLNKGEL 60

Query: 61  VLRVPKSVLLTTQSLLLEDEKLSMALKRYPSLSSTQKLTFCLLYEIGKGSSSWWFTYFKH 120
           +LR PKSVLLTTQSL LEDEKL+MALK +PSLSSTQKLTFCLL EI KG+SS WF Y KH
Sbjct: 61  ILRAPKSVLLTTQSLSLEDEKLAMALKIFPSLSSTQKLTFCLLNEISKGASSRWFPYLKH 120

Query: 121 LPHSYDMLATFGEFEKQALQVDYAIWATEKAALKSHTEWVGVKGLMEESNSKSQIKTFKA 180
           LP SYD+LATFGEFEKQALQVDYAIWATEKAALKS  +W GVKGLM+ESN K+Q++TFKA
Sbjct: 121 LPQSYDILATFGEFEKQALQVDYAIWATEKAALKSRMDWRGVKGLMQESNIKNQLQTFKA 180

Query: 181 WLWASAT-------------GCF------------------IFFTTCFLSGNMTTDELH- 240
           WLWASAT             GC                         F S     DEL  
Sbjct: 181 WLWASATISSRTLYVPWDEAGCLCPVGDLFNYAAPEGESFNAMDVLSFPSHASLNDELES 240

Query: 241 -EEQRDTQWALTDGGFEENVSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFLLQEN 300
            EEQRD+QW LTDGGFEEN SAYCFYARESYKKG+QVLLSYGTYTN+ELLEYYGFLLQEN
Sbjct: 241 LEEQRDSQWDLTDGGFEENASAYCFYARESYKKGEQVLLSYGTYTNIELLEYYGFLLQEN 300

Query: 301 PNDK-FLFLWNMSFILPVLGPRSLFIF--------------------------------- 354
           PNDK F+ + +  ++       SL+I                                  
Sbjct: 301 PNDKVFIPIEHDIYVSSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAG 360

BLAST of Sgr021134 vs. TAIR 10
Match: AT5G17240.1 (SET domain group 40 )

HSP 1 Score: 278.9 bits (712), Expect = 1.2e-74
Identity = 159/317 (50.16%), Postives = 204/317 (64.35%), Query Frame = 0

Query: 6   SLESLLRWAADHGISDSVE-KQSSHSCLGRSLCVSFFPDAGGRGLGAVRNLNKGDLVLRV 65
           ++E+ LRWAA+ GISDS++  +   SCLG SL VS FPDAGGRGLGA R L KG+LVL+V
Sbjct: 7   TMETFLRWAAEIGISDSIDSSRFRDSCLGHSLSVSDFPDAGGRGLGAARELKKGELVLKV 66

Query: 66  PKSVLLTTQSLLLEDEKLSMALKRYPSLSSTQKLTFCLLYEIGKGSSSWWFTYFKHLPHS 125
           P+  L+TT+S++ +D KLS A+  + SLSSTQ L+ CLLYE+ K   S+W+ Y  H+P  
Sbjct: 67  PRKALMTTESIIAKDLKLSDAVNLHNSLSSTQILSVCLLYEMSKEKKSFWYPYLFHIPRD 126

Query: 126 YDMLATFGEFEKQALQVDYAIWATEKAALKSHTEWVGVKGLMEESNSKSQIKTFKAWLWA 185
           YD+LATFG FEKQALQV+ A+WATEKA  K  +EW     LM+E   K + ++F+AWLWA
Sbjct: 127 YDLLATFGNFEKQALQVEDAVWATEKATAKCQSEWKEAGSLMKELELKPKFRSFQAWLWA 186

Query: 186 SAT-------------GCFI----FFTTCFLSGNMTTDELHEEQRDTQWA---------- 245
           SAT             GC       F          T +  E   + + A          
Sbjct: 187 SATISSRTLHVPWDSAGCLCPVGDLFNYDAPGDYSNTPQGPESANNVEEAGLVVETHSER 246

Query: 246 LTDGGFEENVSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFLLQENPNDK-FLFLW 293
           LTDGGFEE+V+AYC YAR +Y+ G+QVLL YGTYTNLELLE+YGF+L+EN NDK F+ L 
Sbjct: 247 LTDGGFEEDVNAYCLYARRNYQLGEQVLLCYGTYTNLELLEHYGFMLEENSNDKVFIPLE 306

BLAST of Sgr021134 vs. TAIR 10
Match: AT3G07670.1 (Rubisco methyltransferase family protein )

HSP 1 Score: 49.3 bits (116), Expect = 1.6e-05
Identity = 63/245 (25.71%), Postives = 101/245 (41.22%), Query Frame = 0

Query: 43  DAGGRGLGAVRNLNKGDLVLRVPKSVLLTTQSLLLEDEKLSMALKRYPSLSSTQKLTFCL 102
           D G RGL A +NL KG+ +L VP S++++  S     E     +KRY  +     L   L
Sbjct: 97  DIGERGLVASQNLRKGEKLLFVPPSLVISADSEWTNAE-AGEVMKRY-DVPDWPLLATYL 156

Query: 103 LYEIGKGSSSWWFTYFKHLPHS-YDMLATFGEFEKQALQVDYAIWATEKAALKSHTEWVG 162
           + E     SS WF Y   LP   Y +L     + +  L +        + A++  T  VG
Sbjct: 157 ISEASLQKSSRWFNYISALPRQPYSLL----YWTRTELDMYLEASQIRERAIERITNVVG 216

Query: 163 VKGLMEESNSKSQIKTFKAWLWASATGCFIFFTTCFLSGNMTTDELHEEQRDTQWAL--- 222
               +         + F   ++   T  + F       G + +  +     D ++AL   
Sbjct: 217 TYEDLRSRIFSKHPQLFPKEVFNDETFKWSF-------GILFSRLVRLPSMDGRFALVPW 276

Query: 223 -----------TDGGFEENVSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFLLQE- 271
                      T   ++++     F     Y+ G+QV +SYG  +N ELL  YGF+ +E 
Sbjct: 277 ADMLNHNCEVETFLDYDKSSKGVVFTTDRPYQPGEQVFISYGNKSNGELLLSYGFVPREG 328

BLAST of Sgr021134 vs. TAIR 10
Match: AT5G14260.3 (Rubisco methyltransferase family protein )

HSP 1 Score: 45.4 bits (106), Expect = 2.2e-04
Identity = 58/253 (22.92%), Postives = 99/253 (39.13%), Query Frame = 0

Query: 49  LGAVRNLNKGDLVLRVPKSVLLTTQSLLLEDEKLSMALKRYPSLSSTQKLTFCLLYEIGK 108
           + A  +L KGD+   VP S+++T + +L    +    L     LS    L   L+YE  +
Sbjct: 115 VAASEDLQKGDVAFSVPDSLVVTLERVL--GNETIAELLTTNKLSELACLALYLMYEKKQ 174

Query: 109 GSSSWWFTYFKHLPHSYDMLATFGEFEKQA------LQVDYAIWATEKAALKSHTEWVGV 168
           G  S W+ Y + L    D     G+ + ++       ++DY   +  KA +    E    
Sbjct: 175 GKKSVWYPYIREL----DRQRGRGQLDAESPLLWSEAELDYLTGSPTKAEVLERAE---- 234

Query: 169 KGLMEESNSKSQI-----KTFKAWLWASATGCFIF--FTTCFLSGNMTTDELHEEQRDTQ 228
            G+  E N    +       F+ + +   T  F F  F   F++       L       +
Sbjct: 235 -GIKREYNELDTVWFMAGSLFQQYPFDIPTEAFSFEIFKQAFVAIQSCVVHLQNVGLARR 294

Query: 229 WALTDGGFEENVSAYCFYAR---------------ESYKKGQQVLLSYGTYTNLELLEYY 274
           +AL   G    + AYC   +                 YK G  +++  G   N +LL  Y
Sbjct: 295 FALVPLG--PPLLAYCSNCKAMLTAVDGAVELVVDRPYKAGDPIVVWCGPQPNAKLLLNY 354

BLAST of Sgr021134 vs. TAIR 10
Match: AT5G14260.1 (Rubisco methyltransferase family protein )

HSP 1 Score: 45.4 bits (106), Expect = 2.2e-04
Identity = 58/253 (22.92%), Postives = 99/253 (39.13%), Query Frame = 0

Query: 49  LGAVRNLNKGDLVLRVPKSVLLTTQSLLLEDEKLSMALKRYPSLSSTQKLTFCLLYEIGK 108
           + A  +L KGD+   VP S+++T + +L    +    L     LS    L   L+YE  +
Sbjct: 115 VAASEDLQKGDVAFSVPDSLVVTLERVL--GNETIAELLTTNKLSELACLALYLMYEKKQ 174

Query: 109 GSSSWWFTYFKHLPHSYDMLATFGEFEKQA------LQVDYAIWATEKAALKSHTEWVGV 168
           G  S W+ Y + L    D     G+ + ++       ++DY   +  KA +    E    
Sbjct: 175 GKKSVWYPYIREL----DRQRGRGQLDAESPLLWSEAELDYLTGSPTKAEVLERAE---- 234

Query: 169 KGLMEESNSKSQI-----KTFKAWLWASATGCFIF--FTTCFLSGNMTTDELHEEQRDTQ 228
            G+  E N    +       F+ + +   T  F F  F   F++       L       +
Sbjct: 235 -GIKREYNELDTVWFMAGSLFQQYPFDIPTEAFSFEIFKQAFVAIQSCVVHLQNVGLARR 294

Query: 229 WALTDGGFEENVSAYCFYAR---------------ESYKKGQQVLLSYGTYTNLELLEYY 274
           +AL   G    + AYC   +                 YK G  +++  G   N +LL  Y
Sbjct: 295 FALVPLG--PPLLAYCSNCKAMLTAVDGAVELVVDRPYKAGDPIVVWCGPQPNAKLLLNY 354

BLAST of Sgr021134 vs. TAIR 10
Match: AT5G14260.2 (Rubisco methyltransferase family protein )

HSP 1 Score: 45.4 bits (106), Expect = 2.2e-04
Identity = 58/253 (22.92%), Postives = 99/253 (39.13%), Query Frame = 0

Query: 49  LGAVRNLNKGDLVLRVPKSVLLTTQSLLLEDEKLSMALKRYPSLSSTQKLTFCLLYEIGK 108
           + A  +L KGD+   VP S+++T + +L    +    L     LS    L   L+YE  +
Sbjct: 115 VAASEDLQKGDVAFSVPDSLVVTLERVL--GNETIAELLTTNKLSELACLALYLMYEKKQ 174

Query: 109 GSSSWWFTYFKHLPHSYDMLATFGEFEKQA------LQVDYAIWATEKAALKSHTEWVGV 168
           G  S W+ Y + L    D     G+ + ++       ++DY   +  KA +    E    
Sbjct: 175 GKKSVWYPYIREL----DRQRGRGQLDAESPLLWSEAELDYLTGSPTKAEVLERAE---- 234

Query: 169 KGLMEESNSKSQI-----KTFKAWLWASATGCFIF--FTTCFLSGNMTTDELHEEQRDTQ 228
            G+  E N    +       F+ + +   T  F F  F   F++       L       +
Sbjct: 235 -GIKREYNELDTVWFMAGSLFQQYPFDIPTEAFSFEIFKQAFVAIQSCVVHLQNVGLARR 294

Query: 229 WALTDGGFEENVSAYCFYAR---------------ESYKKGQQVLLSYGTYTNLELLEYY 274
           +AL   G    + AYC   +                 YK G  +++  G   N +LL  Y
Sbjct: 295 FALVPLG--PPLLAYCSNCKAMLTAVDGAVELVVDRPYKAGDPIVVWCGPQPNAKLLLNY 354

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022143354.15.2e-12860.86protein SET DOMAIN GROUP 40 isoform X1 [Momordica charantia][more]
XP_038896047.19.7e-12761.29protein SET DOMAIN GROUP 40 [Benincasa hispida][more]
XP_022983189.11.4e-12560.37protein SET DOMAIN GROUP 40 isoform X1 [Cucurbita maxima][more]
KAG6581196.14.1e-12560.14Protein SET DOMAIN GROUP 40, partial [Cucurbita argyrosperma subsp. sororia][more]
KAG7017936.13.5e-12459.91Protein SET DOMAIN GROUP 40, partial [Cucurbita argyrosperma subsp. argyrosperma... [more]
Match NameE-valueIdentityDescription
Q6NQJ81.7e-7350.16Protein SET DOMAIN GROUP 40 OS=Arabidopsis thaliana OX=3702 GN=SDG40 PE=2 SV=1[more]
B7ZUF33.1e-0627.50Actin-histidine N-methyltransferase OS=Xenopus tropicalis OX=8364 GN=setd3 PE=2 ... [more]
Q7SXS71.2e-0524.19Actin-histidine N-methyltransferase OS=Danio rerio OX=7955 GN=setd3 PE=2 SV=1[more]
B5FW367.5e-0525.27Actin-histidine N-methyltransferase OS=Otolemur garnettii OX=30611 GN=SETD3 PE=3... [more]
Q5ZML99.8e-0522.96Actin-histidine N-methyltransferase OS=Gallus gallus OX=9031 GN=SETD3 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A6J1CP242.5e-12860.86protein SET DOMAIN GROUP 40 isoform X1 OS=Momordica charantia OX=3673 GN=LOC1110... [more]
A0A6J1J6L66.8e-12660.37protein SET DOMAIN GROUP 40 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC1114818... [more]
A0A6J1F4A74.9e-12459.68protein SET DOMAIN GROUP 40 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC11144... [more]
A0A0A0L7L41.6e-12260.32SET domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G307670 PE=4 SV... [more]
A0A5D3BQD39.9e-11757.77Protein SET DOMAIN GROUP 40 isoform X2 OS=Cucumis melo var. makuwa OX=1194695 GN... [more]
Match NameE-valueIdentityDescription
AT5G17240.11.2e-7450.16SET domain group 40 [more]
AT3G07670.11.6e-0525.71Rubisco methyltransferase family protein [more]
AT5G14260.32.2e-0422.92Rubisco methyltransferase family protein [more]
AT5G14260.12.2e-0422.92Rubisco methyltransferase family protein [more]
AT5G14260.22.2e-0422.92Rubisco methyltransferase family protein [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableGENE3D3.90.1410.10set domain protein methyltransferase, domain 1coord: 4..263
e-value: 2.2E-32
score: 114.4
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 559..590
NoneNo IPR availablePANTHERPTHR13271:SF91PROTEIN SET DOMAIN GROUP 40coord: 4..272
NoneNo IPR availablePANTHERPTHR13271UNCHARACTERIZED PUTATIVE METHYLTRANSFERASEcoord: 4..272
NoneNo IPR availableCDDcd10527SET_LSMTcoord: 35..263
e-value: 7.08381E-32
score: 122.173
NoneNo IPR availableSUPERFAMILY82199SET domaincoord: 6..267

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr021134.1Sgr021134.1mRNA