CsGy3G020670 (gene) Cucumber (Gy14) v2

NameCsGy3G020670
Typegene
OrganismCucumis sativus (Cucumber (Gy14) v2)
Descriptionprotein SET DOMAIN GROUP 40 isoform X2
LocationChr3 : 17824325 .. 17835620 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
GCATTAGAGAACAAAAAAAATAAAATAGAGCAATGAGAAAAGTTATGATAAATGACGTGTGAAATTAAAAGGAAAAAAGAAGAAGAAAAAGATAAAGAAAAAGAAGGTGAATTGCGTCTGTTGAAAGGTTTTGATGGAGGGTTTATGAGATGGAAACCGAAGGTAGTTTGGGAAGCCTGCTGAGATGGGCAGCCGATCATGGAATTTCAGATTCTGTAGATCAACCCACTTCACATTCTTGTTTGGGTCATTCTTTGTGCGTCTCTTTCTTCCCTGATACCGGCGGGTATGCTTTGATTTCTCGTTTTCTAGTTACATCACTCTTTTTCTCTTCTTTCGCCGCTTGTAAGTTGTTCATTTTGACGGATTGTTCTTCAGGAGAGGTTTGGCCGCTGTTCGTCAACTTAAGAAAGGAGAGTTAGTGCTGAGAGCTCCAAAATCTATCTTGTTGACCACCCAAAGTTTGTCGTTGGAAGATGAGAAGCTCGACATGGCTCTAAAGAGATACCCATCTCTTTCATCGACACAGGTTATTCCCTTGCTAACCTAATTTGCGCGAACTTTTGTTAGGAAGTAGGAGAGCTAGGAAGAAATTTGGGTGTTTAGGGCGCTAGGTTTAGTTATGATGTAACGACTACGTGCATGCATTACTTGAAAATAGATCTTTTGCAAATTCTAAAACAAAACATTATTAAATTACAAACGAAGTCCTTGCTAACCCTTAACTAATTAATCTTTTTTACGAGGAAGTACAACTTTAGAAAAAAATGAAAGAATACAAGGGCATACAAATAAACCAAACCACAAATAACACCCCAACTAAAGGAAGGGGACTAACTGAAAAGGATATCACCTATGGAATAATTACAAAATGATCTTGAAATCGAAGTCTAAGGGGACACATGAAATCTAATCAAAGACCAAACTTCAATAAGATCCCTTTCCTTTCCCTACCACAAAACACACAATCATTCCTCTCCCTCCAAATGTCCCACAAGATAGCACGCACCCTAATGATCCATAGAAAACCTCCCTTGAGAAAGTGGATGGAGGTGAAACTCCTCGATGGTCATACGAACCCCTCTAAGATCAACATATCTCATGCCAAACTCCTAAAACAACATACTTCACAAAGCAAGCGCATAATGACAATCCTAAAAGAGGTGACCAAGATCTTTCTTTGCCCTTCGACAAAGCAAGCAACAAAAAGGCCGAACAAGAGAAGATCTCCTCCTCACAAGCCCATCCACAGTGTTAATCCGACTCAACAGAACTTGCTAGGTGAAGAACCTATCTTTCATAGGAACCTTGGTCCTCCAAACCACATTAAAAACAGACCCCCTATAGGGGGAGAGATCCAATAACAAACTAAAGAAAGATTTACATAAGAATCCCTAACTTGGATGAGGACTCCAAACACAAACATCCCTCCTTCCCTCCTTAAGACTGACCCTTTCAACCAGAGAAAGAAGAGAGACCACCTCCGTTGTTTTTCTATCGGTCAAATTACGACAGAACCAGAAAAAGAAGGATACATAATTATCGAACCTGACCAAAAAGTCCAATATTGTACAATTTTTTTGAATTGGATAAATGATAAAGATGCAGAAACAAGGAGCAAAGAGAAAGCATTGATCCTCCCAGAAAATCATGTCCTTATCCTCCCTCACAGAACAACGGATGAAATGGGAAAAAAGAACTCAAGACAAACATCTTTCCAAGAATTGCGTTTTGTGCGTATAACCCCTTTCGTCAACCACTCAAAAGGATGGGTACCGTACTTACTCTCAATAATCCTATGCCATAAGTAGTCGGACACGAGGGGAAAGCGCCAAATCCACTTCGGCAACAAGGCTCTGTTTCAGAGACTTAGATTACCAATCTCTAACCCCCCTGACCAACAGGGTGCCCCACTGCAATCCTATTAACCAGATGGGGGCCATGACCTTCCTCCACCCCTTCCAAGAAAATTTCTCATGTATTTTTCTAGGCTCTTGAACACTGACCTAGGAGCCCTGAAGAGAGACAAATAATAAACCAGAATCCCACTAAGCACAGACCTGATTAGGGTTAATCTACCAGCTCTTGAAAAGAACCCTCTTCCATACTGCTAGTCTCTTACGAATTTTATCATAAAGAGGATCTTAAAAAGTCAAAGCTTCCAGATTTCCACCAAGGGGAAGCCCAAGATACGTCGAAGGGAACAAGCCAACCTTACAATCAAACATCTCAGCCCAATTTAGCAACTTAGCTTGATCAGTTTTAATACCAAAAATGGAATATTTTCTTCTGATAATTTTTAACCTGAATAGATCTTCAAAGAAAGCCACAATATGGTTGATAATAAGGAAGGACTCCTCTTTTCCAGAACGAAGAAGCATAGTTATCATCCGCGAATTGAAGATGCGATAAGGCAACTTCATTCCTCCCACACTAAAAGGGTCAACAATGGTTCCTCCCACCCCTTCATAAATAATCCGACTTAAAACATCCACAACCAATAAAAACAGGGAGGGGATAGAGGATCCTTGTCTTAACTCTTTGGAGGCTTGAATAGGCCCTCTGGGGTAGCATTGATGAGAATATAATGCTTCACATTCCTCACAACCCCCCACATCCACATTATCCATTTAGAGCCAAACCCTTTTTTAATTAGAACTCTATCCAAAAATCCCAATCCACATGATCGTATGCCTTTTCAAAATCAAGTTTAAGCAACACACCTCTTTTCCTAGATCTGTACTCCTCGATAGCTTCATTTGCAACAAGGACTTGATCCGGAATCTGCCTTCCTGCCACCAAGGCACCTTGGGCCTCAGAAATAGTAGATGACATAACTTTTCTTAAATGATTACCAAGAACCTTAGTCAAAATCTTATAAACACTGGCAACAAACCTAATAGACCTAAAATCCTAACTCTGGCACACTATCCTTTTTAGGAATCAAACATACGAAGGTTTCAACCAGCGAACTATTTAAAATACCCCTTTCAAAGAACTCCTTAAACACTCCCTCTAAATCACCTTTGACCAGACTCCAACTGTCTTGATAGAGAGCCAAAGAAACGCCATCCGCCCATGGAGATTTGTTTAATCCTCATTCACCCTAGTTGAGATAGGGCTCCAATCAATACCTTCCACAAATGGTTTGACCCTAACTTCAGGGCTATAAAGATTTGAGAAGAACTTAATGATTTCCTCCTTAATAACCTTGCCATCTCTTGAAATCTCCCCATTGTCTAAACACAAAAGACCTATTTGGGTCCTGCATATAGTCAACTAATCATTTATGGCTATCTAATGAACAGTTTAACCTAACTTGACAAGCATTTGGTGGTTTGAAGGCACACAAAAAAAGTAGGGCACCTGTTGGCACTTGGTAACATAGTAGGAAACTCGTCGGTCTCCGATAACATACTAGGAAATTTGTTGGCCTCCAATAACATAGTAGTATAAGGAGGGTAAACCGTCTCCTCATAACTTATCTTAAAACATGCTTACTAAATACATCTCACAACATAAACCATATAAACATATATTAACAACCTCAAGCTCACAAGTGGCTCAACTCATAATAATTATCATGCTTCCCCGTATCAGTCAAACAGATCAACTAAATCATGTTGTCTAAATTGCCTCGCTCAAGGGAAGTTCCAGTAGTAGGGTTACTTATCTTGAAATTAGGTCCAAAAATATTATCACAACAATAACCTCCACCAAAAAAGTGTCGAATCAAACCTAGAACATGAATTTCAAACTTAGATTAAATATCTAGAGCCTAAGTCCAGTTGTGGATCTGCCAGAACCATCCAAACAATTTTTAACCAAAGTGTGCTTTAACCAAAACATGTGCTACCAATCTCCTAAGTCAACCACGAACCAGCAACACCCACAAAACCAACCTTTCACTCCGCTGGACAGCACATTAAATCCCTCCAAAATTCCAATTTAAGACCAAACTAAGAATGATAAAGTGAGAATTAGCCCACAAGCTTTTTAAGTATTAAAAAATTATCAATAATAACCCACAAACTTACCAAAACCTTGCTCGAACTAAAGGGACAGCAGTTCAATCTCTAACAAAAGCTACCAATTAACCTTCAAATATCATGGGAAACATGATTTTAAACCCACACATATAGTGGGTGTTCAAGAACCAACCGAGAATAAACAATAGCCAACACTTACTTCAAACCTCAAGATTCCGACGACTGGATGGAGGGAAACGTGGCTGAAGAGGATGTCGTTGATGACTGGCAGTTGAATGGCGCATCTGGGTAGCGGATCAATGGTGAAAACGAACTGCTGCACAAGAAGCTTCAGTTGATAGGTGCCTGGCACACACATTTTCCAATATGTGACTTAAAAGGATATAAATAATTACTTTTAAACCCTAATTTCTTTTAATAACGGAATAAATTTCCTCTTCCATGAACTAGATGTAAACATATTCATATACTTAAATCCAAATTTCTCTCTTAGGATGAGATAAATAAAGGAAATATACTACTTTCTTTTCCTTAGCCCATTTCGAAATAAAAAATAAATCCAACCTTCCAAATTAAATTAAATTTCTATTGTTTGAATTAATTCCATAACCTTTATAAAAAACCCAATATTCAAATCCTTTTCACTTAATCTCTCCAAAAAACCCTTTATTATACCTATTTCAAAATTGAATCTAATTCAATTTTAACTCCGACATTTAAAATATAACCAAATTTTGAAGGCAATGATTTTAAATTTTACCTAGGAAATTAGGGGATGTCACATATGAAGTCTGAAGTAAATATGTTGGAATTAGGAAGTTTATTTTTGGCTCCAACGCTTTTGTCTTGAACTTGATTCTTTATGGCATAGGATTATTGTAAGTAAGCATGACACCCATCCTTTTGAATGGACAGCGGTGGGGGTTAAAGGCACACACCAATGTCCTTGAAAGCATATGGCACAGACCGATATCCTTGAAAGCATATCTCCTTCGAGCTTCCTACTTTCTCTTGGTTTACTCGTTGCTTTGTGGGGGATGGTAAGGACACGTGCTTTTGGGAGGATCAGTGGGTGGGGGAAAATTCCATTTGTTCTTTATTTCCACATCTTTATTATTTATCTTCTTCCAAAAAAGGTATGATATTGGATCTTTTGGTTGGGTTCGAGAATCCCATGTTTATTTCTTTCGGGTTTGTCGTAATTTGACCATTAAAGAAACGATGGAGGTGACCTCTCTTCTTGCTTTGGTTGAGGGGTGTAGTTTTAGGGAGGGGAGAAGGGATGTTTGTGTTTTTTGGAATCGTAATCCAAGTTAGGGTTATCTGCATGGGTTTAGTCCTTTGTTAGATCCCTACCCCCCTAGGGAGTCGGTTTATGATGCGTTTGGAGGACTAAGGTTCCTAAGAAAGTTAGGTTATTTATCTGGCAAGTCTTGCTTGGTCGGGTTAACATTGTTGATAGGCTTCTTAGGAGTACTTTGCTTGTTAGACCTTTTTGTTGCATGCTTTGTCAGAAGGCAGAGGAAGATCTTGATCATCTTTTTTGGGATTGCCCTTATGTGGGCTGAGTGAATTTTCTTTTCAGGAGTTTGGTGATAGTTATGTCGGCCTTCAGAGTGTCAAAGCGACGATTGAGGAGTTCCTCCTCCATCCGTTCTTCAAAGACAAAAGGGTTTTTTTTTATGGCTGGCCGAGGTGTGTGCCTTTAACTAGGACATTTGGGGTAAGAGGAATGATTGGGTGTTTCTTGGTAGGGATTTGGTTGTTGGTTGGATTTCACGTGTTTCTTTGGGCTTTGATTTTGAAGACCTTTCGTAATTATTCAATTGGGCTTTGTTTTGGTGGGTTGGTTATTTTGTATGCTCTTGTATTCTTTCATTTGTTCTCAATTAAAGTTGTTTTCATAAAGAAAAACGAAAAAAGAGATTGAGTTCTTTGATAAAGATAGGTACGACCTGAGATGCACCTTAGTATTCTTCACTCTTGGGCCCCCACTGAAAGGAATGATAGTCTTAAATGGATTCCTAACATGAATGGCAATTTCACTACAAAATCTACTTTTCTTAACTTAACTAAGAGATCCCCCAACATTGCCGTCCCCTTGATTCGTCAGATTTGGAAGAATAAAATCCCGAAGAAGGTGAAGTTTTTCTTATGGTCGCTTGCTTACAGAAGCCTCAACACCCAGGAGAAACTACAAAAAAAAATCCATAACACATTGCTTAGCCCCTCGATGTGTTGCCTATGCGCTAAAGATGAGGAAACTTTGGATCACTTATTCCTACATTGTCCCTTCACAAGAAAAGCTTGGAACACGCTGTTTGGTATTTTCGACTTGGAGTTTTGCCTTCCTAGCAAGATTGATAGATGGATGATTGAAGGTCTTAATATTAGAGGTTACAGCCCTAAAGGAAACATCTTATGGAAATGTGCGTCGCGTTCCCTTTTGTGGAGCATCTGGAAAGAAAGGAATAGCAGAATTTTTTATGATAGATTTAATTCTTTTGATTCTTTTTGGACTGTGGTTCAACACACAGCCTCTTGGTGGAGTACGAATTACACCAAACACTTTTGTAATTATAGCCTTTCTATGATTTTCAACAATTGGAAGGCCATTATGTCCTAGCTCTTTAGCTTCTTCCGGGGAGGGCCCTCTTGTCCCTCGCCCTTAGGATGTTCTGTTTTGTTATATGAATATACTTGTCTCTTATAAAAAAAAAAGTATTTTCTCTTTAAGAATTACAAGCCTGCAGCCACCTAAATGGAAAAAGAGGAGATCTTTATACATGAATGACCATAGTTATGCTTAAAGATGTTTTATATTCTATTCTATGTTAATGTTTAAATGTTCTTAGTGCAATTGGGACAGATTATCTAAAACCAAATGGATCCTTTTCAGCACATTGGGATAACCCAATATCCTCTTGGTCCATAACTTTTAGAAGCCTTCTTAAAGATAACGAGATTCTGGATTTTCAAAATTTGATGGCTCAACTGTCAATTCAAACCTTGATAGAAGGCCATGGTCTTTGGAAGCTTCTGGCAGTTTCTCGGTCAAATCTCTCACATAACACCTCTCAGCTTTATCTCTTTTGGATAGAGACCTCTAAAAGCTTTGTGGAATCTCATATTACACTTCTCGGCTTTATCTCTTTTGGATAGAGGCCTCTAAAAGATTTTTGGAAATCTAAATGCCCGAGAAGAGTAAACATTCTTATAACCCATGTTGGACTTCAAAACGGCTCTTTAGTCATGCAAAGAAAGTTACCAAACAGCTGCCTTTCACCCTCAATATGCCCACTGTGCAAGCAAGAAGGAGAAGATTTGCAGCATTTGTTCTTTTCCTGTTTCTATTCTGCTAATTGTTGGTGGAAGTTCTTCTCAATATTTGAAGTTGCGTGGGTTTTTAGAGAATCATTTAGCCTAAATGTACAGAAAGTTTTATGGGGCCCATTTTAAAGAAAGGAATGGGACTAATTGGGCAAATTTATCAAAGGTCGTGTTGGCAGAAATTTGGTTTGAACGAAATCAAAGGAGAGATGTATTGGTTAGACCTCTTTGAGATAGCTAATAGGAATGCTGCAGCTTGGTGTACTTTACGGATCATTGGAAGCCGCTTCATGACTCATGTTTTTGCAGCCATCGTTGAAAGCTCTTTGTCTTCACATCTTCAGGCCATTCGTGCCTCACTTGTGCCGTTTTCTTGTTTCTTATTTTAGTTGCTTTACAGCCCCTACTTGTTGGATTCTTGTAAGATCTTTTTATGTATCTTAGGCTGTATTTTTGTTGGATATGACTAGGGCGCTAGGGGGATGTAAACCTAGTTGAGATGTTGGGCTACGCCAGCTGATTCTTTGGTCTCTTTTTGTTTCGCTCTTTGTACGAACCTTTTGTACTTTGAGCTTTATTCTTATCAATAAAGAGGTTGGTTTCCTTTTCAGAAAGAAAAAACATCACATTTTCTCTTGTTAGTTCTAAAGGTGTACTGATATTAATTTCACAATCCCTCCCCCTACTCAAATGCAGAAGTTGACCTTTTGTTTGCTCTATGAGATCAGTAAAGGACCCAGTTCTTGGTGGTTTCCTTACTTAAAGCATTTGCCCCAGAGCTATGACATACTGGCAACTTTTGGAGAATTTGAAAAGCAAGCCCTGCAGGTTCAATTTTCCGTTTGTGGCAATTTGATTTATTTCTTAGGATCATATTCCTATTCGTGAATCATATTCAATGAGGATATTTTACATGTGTTTTTGACATTTGTTTTATAATAAAATAGTTAAAGTCATTAGCAGGTTGATTATGCTATCTGGGCAACAGAGAAGGCTGCTTTGAAGTCTCGTACGGATTGGAGAGGAGTCGAAGGATTAATGCAAGAATCCAATATTAAAAGCCAACTCCAAACATTCAAGGCATGGCTTTGGGCCTCTGCAACTGTAAGCACACTGGATTCATGCTAGAAGATTGTATATTGCATGGTTAGATTGGACCTGGATTTTAAAATTTATTTTTATTTTTGGTAAGAAACCAAACTTTTATTGAGAAAAAAAATGAGTGAAAGAGGGCCCCCAACTAACTTTGAAAAGAGTTCCAATCTAAGAAATTCCAAATTCTTAATTACGATATAGGATTAATAGACATTTTCTTATTAGTTATTACCAAAAACCTGGGCTTAGGTTTCAATTTTTTTTTAACAATTAATTAATTTGTTTGTTTGGTGAAAAATTTATGCATCATTTAAAGTTGGTTTAGCTGGAGGTCTCGTATGGGTAAATTATCCAAGACCACTATCCAAGGTCACTGCAAATATGCTTTTCTTATCGAGGAAAGATTCCACTAATTGACACCCATTTGCTGATTATCTATCCTTCCATTACCCACTACCTTGTAATTGTAGTCCTGTACGCATTTCCCATTACCAGCCAGTAAGCATAATTTAGACTCCTATTCTATCTAATATGCTCTCTCTCCCATCCCACCTTGCTCTCCTCCTATAGTCCTATCCCCTTCACTTAAGGTTCCCCTTTAGGTATTCTTTGTCTCAAACTACTGTCACTGACCTTTCGTGATAGGACCTAATTATTGAAGTTGAGGGTGGATGGTTATAGATGAAGCAAACCAAAACTCTCTAGGATTGGAAAGGATAACTTTTCCTCCATCTCTCTCATCAATGGGGTCCTCTCGTGGATTAAAACTTGCTACATAGGCGTGCTCCTCCACTAGAACTCTATAGGGAGGATTATGTGTTTGCATTGAATAAATCTCGTATAGGAAAGAGACTACTGTTGAGATCATTAAGCTGAACTCAAAGACACATTTTATTCCATGTAGGTGAGAATTATTGAGGTCAGTTCTTCTCCTTGTGAAAGATTACCTTGTGCCACCCTGCCTCTAATAGTTCATTGTTTTCCTTGACTTGTGAAGTGGAAGTTGCTTGCACAGAGGCTTTTGGATCGTCCCATACAAGAATACTTGGATGCTCCTTCAAGCACGGTTGCATTTTGTACTAAAATCCCTTCAAAGTCTCAATAGCTTTTTGTTGCCATAGAATAGAACTATAGAATTCATCCTCTTTTTTATGAATGTGAATTGTTGAGACTCGAGAACTTGTTTTGTAATTTTCTCTTACCCTTCTTTTGGAGGATATATTATTGACCTTACTTGTATCTATTTTTCTTATGGAAAAACAAGTCTTTGTTTGTCTTATGCATTGTATGATGAGTCTAAATGTGAGCATGGTTTGGTCCGATGTAAACTTTTTCAATCTTAACTTCAGTTTGTTGAATTCTTTTTTTTTTTTTTTTTTTTGACGTTAGTAGCTAAAAACTAAAAGTATATGTTGCCCTTTGAGAGATATCATCTAGGACATTGTATGTACCATGGGATGAGGCCGGATGTTTATGTCCAGTTGGTGACCTGTTTAATTATGCTGCACCTGAAGGGGAATCCTTTAATGCTGTGGATGTTTTGTCCTTTCCATCACACGCTTCTTTGAATGATGAGTTAGAGTTACTTGAAGAGCAAAGAGATAGTCAATGGGCTTTGACAGATGGCGGATTTGAGGAAAATGCTTCTGCCTACTGCTTCTATGCTCGGGAAAATTACAGGAAGGGAGAGCAGGTATTTTATATTTTCTTAGTGTCATTTATATATTTTGTACTCATATGGAAAGAGGAGTTAATGCAAAATACAATAAACAGAAGGTTAACATTTTATTTATGGAGTTTGATTTCTATTTGAAGAAGGTTGAAATTTGCATTAACATTTTTTATTTGTTCTCTGATGCATTCCTGTTAATCTGCACCTTGAGATTTCCATAATAAAACTCGAGATTGTTTCTTCATGTCTCAAGGCCGAGATTGTTTCTTCATGTCTCAAGGCATGAATATCAATAAGTCCATTTTTAATTTGAAAAGAAAAATAAATGTTCTCTAAAAAATAATGCTTTGCACGAGTCACTAAAAAGGAATGTGTGTGTGGGTGAGTAACAATTGAAACCTTAAGAGGCCATGGGTTCAATCCATGGTAGGCATCTTTGATTTCCTACGAGTTTTCTTAACACCCAAATGTTGTAGGGTCAGACGGGTTGTCTTGTGAGATTAGTCGAGTTGCACGTAAGCTGACCCAGACACTCATGGATATCAGAAGAAAGTAACGATTAAAACATTTCTATTGTTTGCTGCAGGTTCTTTTAAGCTATGGTACATACACAAACTTAGAGCTTCTTGAATACTATGGGTTTCTTCTACAGGAAAATCCAAATGACAAAGTTTTTATTCCTATCGAACATGATATCTATGGTTCCAGTTCTTGGCCCGAGGAATCTCTTTATATTCATCAAAATGGAAACCCATCATTTGCTCTACTTTCTGCTCTGAGATTATGGGCAACCCATCCGAACAAGCGTAGAGGTGTCGGGCATCTTGCTTATGCCGGATCGCAACTCTCTGTCAAGAATGAAACATTAGTCATGCAGTGGTTATCCAAGAACTGCCATACTGTTTTAAACAATCTGCCAACATCAATTGAAGAAGACAATCAGCTTCTGTGCAATATCGCCAAAGTCCAAGACCTGCAGGTACCAAGGGAGCTCCAGAAGACATTGTTGACCTATGGAGGCGAGTTTTGTGCTTTCTTGGAAACCAATGGTGTGGTCAATAGAGATGAAGCAGAGTCACATTCATCCCAGAAACTAAAACGTTCTCTAGACAGATGGAAACTAGCAGTCCAGTGGAGGCTTTTGTACAAGAAGGCGTTGGTTGATTGCATAGGTTACTGCACCACAACTATTTGCTCTCTTTCTTCTTAA

mRNA sequence

GCATTAGAGAACAAAAAAAATAAAATAGAGCAATGAGAAAAGTTATGATAAATGACGTGTGAAATTAAAAGGAAAAAAGAAGAAGAAAAAGATAAAGAAAAAGAAGGTGAATTGCGTCTGTTGAAAGGTTTTGATGGAGGGTTTATGAGATGGAAACCGAAGGTAGTTTGGGAAGCCTGCTGAGATGGGCAGCCGATCATGGAATTTCAGATTCTGTAGATCAACCCACTTCACATTCTTGTTTGGGTCATTCTTTGTGCGTCTCTTTCTTCCCTGATACCGGCGGGAGAGGTTTGGCCGCTGTTCGTCAACTTAAGAAAGGAGAGTTAGTGCTGAGAGCTCCAAAATCTATCTTGTTGACCACCCAAAGTTTGTCGTTGGAAGATGAGAAGCTCGACATGGCTCTAAAGAGATACCCATCTCTTTCATCGACACAGAAGTTGACCTTTTGTTTGCTCTATGAGATCAGTAAAGGACCCAGTTCTTGGTGGTTTCCTTACTTAAAGCATTTGCCCCAGAGCTATGACATACTGGCAACTTTTGGAGAATTTGAAAAGCAAGCCCTGCAGGTTGATTATGCTATCTGGGCAACAGAGAAGGCTGCTTTGAAGTCTCGTACGGATTGGAGAGGAGTCGAAGGATTAATGCAAGAATCCAATATTAAAAGCCAACTCCAAACATTCAAGGCATGGCTTTGGGCCTCTGCAACTATATCATCTAGGACATTGTATGTACCATGGGATGAGGCCGGATGTTTATGTCCAGTTGGTGACCTGTTTAATTATGCTGCACCTGAAGGGGAATCCTTTAATGCTGTGGATGTTTTGTCCTTTCCATCACACGCTTCTTTGAATGATGAGTTAGAGTTACTTGAAGAGCAAAGAGATAGTCAATGGGCTTTGACAGATGGCGGATTTGAGGAAAATGCTTCTGCCTACTGCTTCTATGCTCGGGAAAATTACAGGAAGGGAGAGCAGGAAAATCCAAATGACAAAGTTTTTATTCCTATCGAACATGATATCTATGGTTCCAGTTCTTGGCCCGAGGAATCTCTTTATATTCATCAAAATGGAAACCCATCATTTGCTCTACTTTCTGCTCTGAGATTATGGGCAACCCATCCGAACAAGCGTAGAGGTGTCGGGCATCTTGCTTATGCCGGATCGCAACTCTCTGTCAAGAATGAAACATTAGTCATGCAGTGGTTATCCAAGAACTGCCATACTGTTTTAAACAATCTGCCAACATCAATTGAAGAAGACAATCAGCTTCTGTGCAATATCGCCAAAGTCCAAGACCTGCAGGTACCAAGGGAGCTCCAGAAGACATTGTTGACCTATGGAGGCGAGTTTTGTGCTTTCTTGGAAACCAATGGTGTGGTCAATAGAGATGAAGCAGAGTCACATTCATCCCAGAAACTAAAACGTTCTCTAGACAGATGGAAACTAGCAGTCCAGTGGAGGCTTTTGTACAAGAAGGCGTTGGTTGATTGCATAGGTTACTGCACCACAACTATTTGCTCTCTTTCTTCTTAA

Coding sequence (CDS)

ATGGAAACCGAAGGTAGTTTGGGAAGCCTGCTGAGATGGGCAGCCGATCATGGAATTTCAGATTCTGTAGATCAACCCACTTCACATTCTTGTTTGGGTCATTCTTTGTGCGTCTCTTTCTTCCCTGATACCGGCGGGAGAGGTTTGGCCGCTGTTCGTCAACTTAAGAAAGGAGAGTTAGTGCTGAGAGCTCCAAAATCTATCTTGTTGACCACCCAAAGTTTGTCGTTGGAAGATGAGAAGCTCGACATGGCTCTAAAGAGATACCCATCTCTTTCATCGACACAGAAGTTGACCTTTTGTTTGCTCTATGAGATCAGTAAAGGACCCAGTTCTTGGTGGTTTCCTTACTTAAAGCATTTGCCCCAGAGCTATGACATACTGGCAACTTTTGGAGAATTTGAAAAGCAAGCCCTGCAGGTTGATTATGCTATCTGGGCAACAGAGAAGGCTGCTTTGAAGTCTCGTACGGATTGGAGAGGAGTCGAAGGATTAATGCAAGAATCCAATATTAAAAGCCAACTCCAAACATTCAAGGCATGGCTTTGGGCCTCTGCAACTATATCATCTAGGACATTGTATGTACCATGGGATGAGGCCGGATGTTTATGTCCAGTTGGTGACCTGTTTAATTATGCTGCACCTGAAGGGGAATCCTTTAATGCTGTGGATGTTTTGTCCTTTCCATCACACGCTTCTTTGAATGATGAGTTAGAGTTACTTGAAGAGCAAAGAGATAGTCAATGGGCTTTGACAGATGGCGGATTTGAGGAAAATGCTTCTGCCTACTGCTTCTATGCTCGGGAAAATTACAGGAAGGGAGAGCAGGAAAATCCAAATGACAAAGTTTTTATTCCTATCGAACATGATATCTATGGTTCCAGTTCTTGGCCCGAGGAATCTCTTTATATTCATCAAAATGGAAACCCATCATTTGCTCTACTTTCTGCTCTGAGATTATGGGCAACCCATCCGAACAAGCGTAGAGGTGTCGGGCATCTTGCTTATGCCGGATCGCAACTCTCTGTCAAGAATGAAACATTAGTCATGCAGTGGTTATCCAAGAACTGCCATACTGTTTTAAACAATCTGCCAACATCAATTGAAGAAGACAATCAGCTTCTGTGCAATATCGCCAAAGTCCAAGACCTGCAGGTACCAAGGGAGCTCCAGAAGACATTGTTGACCTATGGAGGCGAGTTTTGTGCTTTCTTGGAAACCAATGGTGTGGTCAATAGAGATGAAGCAGAGTCACATTCATCCCAGAAACTAAAACGTTCTCTAGACAGATGGAAACTAGCAGTCCAGTGGAGGCTTTTGTACAAGAAGGCGTTGGTTGATTGCATAGGTTACTGCACCACAACTATTTGCTCTCTTTCTTCTTAA

Protein sequence

METEGSLGSLLRWAADHGISDSVDQPTSHSCLGHSLCVSFFPDTGGRGLAAVRQLKKGELVLRAPKSILLTTQSLSLEDEKLDMALKRYPSLSSTQKLTFCLLYEISKGPSSWWFPYLKHLPQSYDILATFGEFEKQALQVDYAIWATEKAALKSRTDWRGVEGLMQESNIKSQLQTFKAWLWASATISSRTLYVPWDEAGCLCPVGDLFNYAAPEGESFNAVDVLSFPSHASLNDELELLEEQRDSQWALTDGGFEENASAYCFYARENYRKGEQENPNDKVFIPIEHDIYGSSSWPEESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLSVKNETLVMQWLSKNCHTVLNNLPTSIEEDNQLLCNIAKVQDLQVPRELQKTLLTYGGEFCAFLETNGVVNRDEAESHSSQKLKRSLDRWKLAVQWRLLYKKALVDCIGYCTTTICSLSS
BLAST of CsGy3G020670 vs. NCBI nr
Match: XP_004145844.1 (PREDICTED: protein SET DOMAIN GROUP 40 [Cucumis sativus])

HSP 1 Score: 926.4 bits (2393), Expect = 3.8e-266
Identity = 458/483 (94.82%), Postives = 460/483 (95.24%), Query Frame = 0

Query: 1   METEGSLGSLLRWAADHGISDSVDQPTSHSCLGHSLCVSFFPDTGGRGLAAVRQLKKGEL 60
           METEGSLGSLLRWAADHGISDSVDQPTSHSCLGHSLCVSFFPDTGGRGLAAVRQLKKGEL
Sbjct: 1   METEGSLGSLLRWAADHGISDSVDQPTSHSCLGHSLCVSFFPDTGGRGLAAVRQLKKGEL 60

Query: 61  VLRAPKSILLTTQSLSLEDEKLDMALKRYPSLSSTQKLTFCLLYEISKGPSSWWFPYLKH 120
           VLRAPKSILLTTQSLSLEDEKLDMALKRYPSLSSTQKLTFCLLYEISKGPSSWWFPYLKH
Sbjct: 61  VLRAPKSILLTTQSLSLEDEKLDMALKRYPSLSSTQKLTFCLLYEISKGPSSWWFPYLKH 120

Query: 121 LPQSYDILATFGEFEKQALQVDYAIWATEKAALKSRTDWRGVEGLMQESNIKSQLQTFKA 180
           LPQSYDILATFGEFEKQALQVDYAIWATEKAALKSRTDWRGVEGLMQESNIKSQLQTFKA
Sbjct: 121 LPQSYDILATFGEFEKQALQVDYAIWATEKAALKSRTDWRGVEGLMQESNIKSQLQTFKA 180

Query: 181 WLWASATISSRTLYVPWDEAGCLCPVGDLFNYAAPEGESFNAVDVLSFPSHASLNDELEL 240
           WLWASATISSRTLYVPWDEAGCLCPVGDLFNYAAPEGESFNAVDVLSFPSHASLNDELEL
Sbjct: 181 WLWASATISSRTLYVPWDEAGCLCPVGDLFNYAAPEGESFNAVDVLSFPSHASLNDELEL 240

Query: 241 LEEQRDSQWALTDGGFEENASAYCFYARENYRKGE----------------------QEN 300
           LEEQRDSQWALTDGGFEENASAYCFYARE+YRKGE                      QEN
Sbjct: 241 LEEQRDSQWALTDGGFEENASAYCFYARESYRKGEQVLLSYGTYTNLELLEYYGFLLQEN 300

Query: 301 PNDKVFIPIEHDIYGSSSWPEESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAG 360
           PNDKVFIPIEHDIYGSSSWP+ESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAG
Sbjct: 301 PNDKVFIPIEHDIYGSSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAG 360

Query: 361 SQLSVKNETLVMQWLSKNCHTVLNNLPTSIEEDNQLLCNIAKVQDLQVPRELQKTLLTYG 420
           SQLSVKNE LVMQWLSKNCHTVLNNLPTSIEEDNQLLCNIAKVQDLQVPRELQKTLLTYG
Sbjct: 361 SQLSVKNEILVMQWLSKNCHTVLNNLPTSIEEDNQLLCNIAKVQDLQVPRELQKTLLTYG 420

Query: 421 GEFCAFLETNGVVNRDEAESHSSQKLKRSLDRWKLAVQWRLLYKKALVDCIGYCTTTICS 462
           GEFCAFLETNGVVNRDEAESHSSQKLKRSLDRWKLAVQWRLLYKKALVDCIGYCTTTICS
Sbjct: 421 GEFCAFLETNGVVNRDEAESHSSQKLKRSLDRWKLAVQWRLLYKKALVDCIGYCTTTICS 480

BLAST of CsGy3G020670 vs. NCBI nr
Match: KGN57798.1 (hypothetical protein Csa_3G307670 [Cucumis sativus])

HSP 1 Score: 904.4 bits (2336), Expect = 1.5e-259
Identity = 447/472 (94.70%), Postives = 449/472 (95.13%), Query Frame = 0

Query: 1   METEGSLGSLLRWAADHGISDSVDQPTSHSCLGHSLCVSFFPDTGGRGLAAVRQLKKGEL 60
           METEGSLGSLLRWAADHGISDSVDQPTSHSCLGHSLCVSFFPDTGGRGLAAVRQLKKGEL
Sbjct: 1   METEGSLGSLLRWAADHGISDSVDQPTSHSCLGHSLCVSFFPDTGGRGLAAVRQLKKGEL 60

Query: 61  VLRAPKSILLTTQSLSLEDEKLDMALKRYPSLSSTQKLTFCLLYEISKGPSSWWFPYLKH 120
           VLRAPKSILLTTQSLSLEDEKLDMALKRYPSLSSTQKLTFCLLYEISKGPSSWWFPYLKH
Sbjct: 61  VLRAPKSILLTTQSLSLEDEKLDMALKRYPSLSSTQKLTFCLLYEISKGPSSWWFPYLKH 120

Query: 121 LPQSYDILATFGEFEKQALQVDYAIWATEKAALKSRTDWRGVEGLMQESNIKSQLQTFKA 180
           LPQSYDILATFGEFEKQALQVDYAIWATEKAALKSRTDWRGVEGLMQESNIKSQLQTFKA
Sbjct: 121 LPQSYDILATFGEFEKQALQVDYAIWATEKAALKSRTDWRGVEGLMQESNIKSQLQTFKA 180

Query: 181 WLWASATISSRTLYVPWDEAGCLCPVGDLFNYAAPEGESFNAVDVLSFPSHASLNDELEL 240
           WLWASATISSRTLYVPWDEAGCLCPVGDLFNYAAPEGESFNAVDVLSFPSHASLNDELEL
Sbjct: 181 WLWASATISSRTLYVPWDEAGCLCPVGDLFNYAAPEGESFNAVDVLSFPSHASLNDELEL 240

Query: 241 LEEQRDSQWALTDGGFEENASAYCFYARENYRKGE----------------------QEN 300
           LEEQRDSQWALTDGGFEENASAYCFYARE+YRKGE                      QEN
Sbjct: 241 LEEQRDSQWALTDGGFEENASAYCFYARESYRKGEQVLLSYGTYTNLELLEYYGFLLQEN 300

Query: 301 PNDKVFIPIEHDIYGSSSWPEESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAG 360
           PNDKVFIPIEHDIYGSSSWP+ESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAG
Sbjct: 301 PNDKVFIPIEHDIYGSSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAG 360

Query: 361 SQLSVKNETLVMQWLSKNCHTVLNNLPTSIEEDNQLLCNIAKVQDLQVPRELQKTLLTYG 420
           SQLSVKNE LVMQWLSKNCHTVLNNLPTSIEEDNQLLCNIAKVQDLQVPRELQKTLLTYG
Sbjct: 361 SQLSVKNEILVMQWLSKNCHTVLNNLPTSIEEDNQLLCNIAKVQDLQVPRELQKTLLTYG 420

Query: 421 GEFCAFLETNGVVNRDEAESHSSQKLKRSLDRWKLAVQWRLLYKKALVDCIG 451
           GEFCAFLETNGVVNRDEAESHSSQKLKRSLDRWKLAVQWRLLYKKALVDCIG
Sbjct: 421 GEFCAFLETNGVVNRDEAESHSSQKLKRSLDRWKLAVQWRLLYKKALVDCIG 472

BLAST of CsGy3G020670 vs. NCBI nr
Match: XP_008457030.1 (PREDICTED: protein SET DOMAIN GROUP 40 isoform X2 [Cucumis melo])

HSP 1 Score: 858.6 bits (2217), Expect = 9.7e-246
Identity = 427/483 (88.41%), Postives = 442/483 (91.51%), Query Frame = 0

Query: 1   METEGSLGSLLRWAADHGISDSVDQPTSHSCLGHSLCVSFFPDTGGRGLAAVRQLKKGEL 60
           METEGS GSLLRWAADHGISDS+DQ TS SCLG SLCVSFFPD+GGRGLAAVRQL KGEL
Sbjct: 1   METEGSFGSLLRWAADHGISDSIDQHTSRSCLGRSLCVSFFPDSGGRGLAAVRQLNKGEL 60

Query: 61  VLRAPKSILLTTQSLSLEDEKLDMALKRYPSLSSTQKLTFCLLYEISKGPSSWWFPYLKH 120
           +LRAPKS+LLTTQSLSLEDEKL MALK +PSLSSTQKLTFCLL EISKG SS WFPYLKH
Sbjct: 61  ILRAPKSVLLTTQSLSLEDEKLAMALKIFPSLSSTQKLTFCLLNEISKGASSRWFPYLKH 120

Query: 121 LPQSYDILATFGEFEKQALQVDYAIWATEKAALKSRTDWRGVEGLMQESNIKSQLQTFKA 180
           LPQSYDILATFGEFEKQALQVDYAIWATEKAALKSR DWRGV+GLMQESNIK+QLQTFKA
Sbjct: 121 LPQSYDILATFGEFEKQALQVDYAIWATEKAALKSRMDWRGVKGLMQESNIKNQLQTFKA 180

Query: 181 WLWASATISSRTLYVPWDEAGCLCPVGDLFNYAAPEGESFNAVDVLSFPSHASLNDELEL 240
           WLWASATISSRTLYVPWDEAGCLCPVGDLFNYAAPEGESFNA+DVLSFPSHASLNDELE 
Sbjct: 181 WLWASATISSRTLYVPWDEAGCLCPVGDLFNYAAPEGESFNAMDVLSFPSHASLNDELES 240

Query: 241 LEEQRDSQWALTDGGFEENASAYCFYARENYRKGE----------------------QEN 300
           LEEQRDSQW LTDGGFEENASAYCFYARE+Y+KGE                      QEN
Sbjct: 241 LEEQRDSQWDLTDGGFEENASAYCFYARESYKKGEQVLLSYGTYTNIELLEYYGFLLQEN 300

Query: 301 PNDKVFIPIEHDIYGSSSWPEESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAG 360
           PNDKVFIPIEHDIY SSSWP+ESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAG
Sbjct: 301 PNDKVFIPIEHDIYVSSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAG 360

Query: 361 SQLSVKNETLVMQWLSKNCHTVLNNLPTSIEEDNQLLCNIAKVQDLQVPRELQKTLLTYG 420
           SQLSVKNETLVMQWLSKNCHTVLNNLPTSIEED+QLLCNIAKVQDLQV REL+K LLTYG
Sbjct: 361 SQLSVKNETLVMQWLSKNCHTVLNNLPTSIEEDDQLLCNIAKVQDLQVQRELRKMLLTYG 420

Query: 421 GEFCAFLETNGVVNRDEAESHSSQKLKRSLDRWKLAVQWRLLYKKALVDCIGYCTTTICS 462
           GE CAFLETNGVVNRDEAESH S+KLKRSL+RWKLAVQWRLLYKKALVDCIGYCT TICS
Sbjct: 421 GECCAFLETNGVVNRDEAESHLSEKLKRSLERWKLAVQWRLLYKKALVDCIGYCTRTICS 480

BLAST of CsGy3G020670 vs. NCBI nr
Match: XP_008457029.1 (PREDICTED: protein SET DOMAIN GROUP 40 isoform X1 [Cucumis melo])

HSP 1 Score: 852.4 bits (2201), Expect = 7.0e-244
Identity = 427/488 (87.50%), Postives = 442/488 (90.57%), Query Frame = 0

Query: 1   METEGSLGSLLRWAADHGISDSVDQPTSHSCLGHSLCVSFFPDTGG-----RGLAAVRQL 60
           METEGS GSLLRWAADHGISDS+DQ TS SCLG SLCVSFFPD+GG     RGLAAVRQL
Sbjct: 1   METEGSFGSLLRWAADHGISDSIDQHTSRSCLGRSLCVSFFPDSGGKLCFRRGLAAVRQL 60

Query: 61  KKGELVLRAPKSILLTTQSLSLEDEKLDMALKRYPSLSSTQKLTFCLLYEISKGPSSWWF 120
            KGEL+LRAPKS+LLTTQSLSLEDEKL MALK +PSLSSTQKLTFCLL EISKG SS WF
Sbjct: 61  NKGELILRAPKSVLLTTQSLSLEDEKLAMALKIFPSLSSTQKLTFCLLNEISKGASSRWF 120

Query: 121 PYLKHLPQSYDILATFGEFEKQALQVDYAIWATEKAALKSRTDWRGVEGLMQESNIKSQL 180
           PYLKHLPQSYDILATFGEFEKQALQVDYAIWATEKAALKSR DWRGV+GLMQESNIK+QL
Sbjct: 121 PYLKHLPQSYDILATFGEFEKQALQVDYAIWATEKAALKSRMDWRGVKGLMQESNIKNQL 180

Query: 181 QTFKAWLWASATISSRTLYVPWDEAGCLCPVGDLFNYAAPEGESFNAVDVLSFPSHASLN 240
           QTFKAWLWASATISSRTLYVPWDEAGCLCPVGDLFNYAAPEGESFNA+DVLSFPSHASLN
Sbjct: 181 QTFKAWLWASATISSRTLYVPWDEAGCLCPVGDLFNYAAPEGESFNAMDVLSFPSHASLN 240

Query: 241 DELELLEEQRDSQWALTDGGFEENASAYCFYARENYRKGE-------------------- 300
           DELE LEEQRDSQW LTDGGFEENASAYCFYARE+Y+KGE                    
Sbjct: 241 DELESLEEQRDSQWDLTDGGFEENASAYCFYARESYKKGEQVLLSYGTYTNIELLEYYGF 300

Query: 301 --QENPNDKVFIPIEHDIYGSSSWPEESLYIHQNGNPSFALLSALRLWATHPNKRRGVGH 360
             QENPNDKVFIPIEHDIY SSSWP+ESLYIHQNGNPSFALLSALRLWATHPNKRRGVGH
Sbjct: 301 LLQENPNDKVFIPIEHDIYVSSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGH 360

Query: 361 LAYAGSQLSVKNETLVMQWLSKNCHTVLNNLPTSIEEDNQLLCNIAKVQDLQVPRELQKT 420
           LAYAGSQLSVKNETLVMQWLSKNCHTVLNNLPTSIEED+QLLCNIAKVQDLQV REL+K 
Sbjct: 361 LAYAGSQLSVKNETLVMQWLSKNCHTVLNNLPTSIEEDDQLLCNIAKVQDLQVQRELRKM 420

Query: 421 LLTYGGEFCAFLETNGVVNRDEAESHSSQKLKRSLDRWKLAVQWRLLYKKALVDCIGYCT 462
           LLTYGGE CAFLETNGVVNRDEAESH S+KLKRSL+RWKLAVQWRLLYKKALVDCIGYCT
Sbjct: 421 LLTYGGECCAFLETNGVVNRDEAESHLSEKLKRSLERWKLAVQWRLLYKKALVDCIGYCT 480

BLAST of CsGy3G020670 vs. NCBI nr
Match: XP_022983189.1 (protein SET DOMAIN GROUP 40 isoform X1 [Cucurbita maxima])

HSP 1 Score: 768.1 bits (1982), Expect = 1.7e-218
Identity = 375/486 (77.16%), Postives = 410/486 (84.36%), Query Frame = 0

Query: 1   METEGSLGSLLRWAADHGISDSVDQPTSHSCLGHSLCVSFFPDTGGRGLAAVRQLKKGEL 60
           M TEGS  SLLRWAADHGISDSVD+ +SHSCLG SLCV FFPD GGRGL AVR L KGEL
Sbjct: 1   MGTEGSFESLLRWAADHGISDSVDKQSSHSCLGRSLCVCFFPDAGGRGLGAVRHLTKGEL 60

Query: 61  VLRAPKSILLTTQSLSLEDEKLDMALKRYPSLSSTQKLTFCLLYEISKGPSSWWFPYLKH 120
           VL+ PKS+LLTTQSLSL+DEKL MALKRYPSLSSTQKLTFCLLYEI KG SSWWFPY KH
Sbjct: 61  VLKVPKSVLLTTQSLSLQDEKLSMALKRYPSLSSTQKLTFCLLYEIGKGSSSWWFPYFKH 120

Query: 121 LPQSYDILATFGEFEKQALQVDYAIWATEKAALKSRTDWRGVEGLMQESNIKSQLQTFKA 180
           LP +Y+ LATFGEFEKQALQVDYA+W  EKAA KS T+WRGV+GLM+ESNIK+QLQTFKA
Sbjct: 121 LPTTYETLATFGEFEKQALQVDYALWEAEKAASKSHTEWRGVKGLMEESNIKNQLQTFKA 180

Query: 181 WLWASATISSRTLYVPWDEAGCLCPVGDLFNYAAPEGESFNAVDVLSFPSHASLNDELE- 240
           WLWASATISSR LYVPWDEAGCLCPVGDLFNYAAPEGES + +DV SF  HASLN  +  
Sbjct: 181 WLWASATISSRALYVPWDEAGCLCPVGDLFNYAAPEGESLDIMDVSSFSQHASLNGNITT 240

Query: 241 --LLEEQRDSQWALTDGGFEENASAYCFYARENYRKGE---------------------- 300
             L +E++D+Q ALTDGGFEEN SAYCFYARE+Y++GE                      
Sbjct: 241 DGLHKEEQDTQRALTDGGFEENVSAYCFYARESYKRGEQVLLSYGTYSNLELLQYYGFLL 300

Query: 301 QENPNDKVFIPIEHDIYGSSSWPEESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLA 360
           QENPND+VFIP+EH+IY SSSWP+ESL+IHQNGNPSFALLSALRLWATHPNKRRGVGHLA
Sbjct: 301 QENPNDRVFIPLEHEIYSSSSWPKESLFIHQNGNPSFALLSALRLWATHPNKRRGVGHLA 360

Query: 361 YAGSQLSVKNETLVMQWLSKNCHTVLNNLPTSIEEDNQLLCNIAKVQDLQVPRELQKTLL 420
           YAGSQLSVKNE LVMQWLSKNCH VLNNLPTS+EEDNQLLCNI K+QDLQ P EL K LL
Sbjct: 361 YAGSQLSVKNEVLVMQWLSKNCHAVLNNLPTSVEEDNQLLCNICKIQDLQGPTELGKMLL 420

Query: 421 TYGGEFCAFLETNGVVNRDEAESHSSQKLKRSLDRWKLAVQWRLLYKKALVDCIGYCTTT 462
           T GGEFCAFLET G+VNR+E E H + K+KRSL+RWKLAVQWR+LYKKALVDC  YCT T
Sbjct: 421 TVGGEFCAFLETYGLVNREETELHLTGKIKRSLERWKLAVQWRILYKKALVDCTSYCTRT 480

BLAST of CsGy3G020670 vs. TAIR10
Match: AT5G17240.1 (SET domain group 40)

HSP 1 Score: 459.9 bits (1182), Expect = 1.8e-129
Identity = 248/482 (51.45%), Postives = 320/482 (66.39%), Query Frame = 0

Query: 6   SLGSLLRWAADHGISDSVDQPT-SHSCLGHSLCVSFFPDTGGRGLAAVRQLKKGELVLRA 65
           ++ + LRWAA+ GISDS+D      SCLGHSL VS FPD GGRGL A R+LKKGELVL+ 
Sbjct: 7   TMETFLRWAAEIGISDSIDSSRFRDSCLGHSLSVSDFPDAGGRGLGAARELKKGELVLKV 66

Query: 66  PKSILLTTQSLSLEDEKLDMALKRYPSLSSTQKLTFCLLYEISKGPSSWWFPYLKHLPQS 125
           P+  L+TT+S+  +D KL  A+  + SLSSTQ L+ CLLYE+SK   S+W+PYL H+P+ 
Sbjct: 67  PRKALMTTESIIAKDLKLSDAVNLHNSLSSTQILSVCLLYEMSKEKKSFWYPYLFHIPRD 126

Query: 126 YDILATFGEFEKQALQVDYAIWATEKAALKSRTDWRGVEGLMQESNIKSQLQTFKAWLWA 185
           YD+LATFG FEKQALQV+ A+WATEKA  K +++W+    LM+E  +K + ++F+AWLWA
Sbjct: 127 YDLLATFGNFEKQALQVEDAVWATEKATAKCQSEWKEAGSLMKELELKPKFRSFQAWLWA 186

Query: 186 SATISSRTLYVPWDEAGCLCPVGDLFNYAAPEGESFNAVDVLSFPSHASLNDELELLEEQ 245
           SATISSRTL+VPWD AGCLCPVGDLFNY AP   S    +    P  A+  +E  L+ E 
Sbjct: 187 SATISSRTLHVPWDSAGCLCPVGDLFNYDAPGDYS----NTPQGPESANNVEEAGLVVET 246

Query: 246 RDSQWALTDGGFEENASAYCFYARENYRKGEQ----------------------ENPNDK 305
              +  LTDGGFEE+ +AYC YAR NY+ GEQ                      EN NDK
Sbjct: 247 HSER--LTDGGFEEDVNAYCLYARRNYQLGEQVLLCYGTYTNLELLEHYGFMLEENSNDK 306

Query: 306 VFIPIEHDIYG-SSSWPEESLYIHQNGNPSFALLSALRLWATHPNKR-RGVGHLAYAGSQ 365
           VFIP+E  ++  +SSWP++SLYIHQ+G  SFAL+S LRLW    ++R + V  L YAGSQ
Sbjct: 307 VFIPLETSLFSLASSWPKDSLYIHQDGKLSFALISTLRLWLIPQSQRDKSVMRLVYAGSQ 366

Query: 366 LSVKNETLVMQWLSKNCHTVLNNLPTSIEEDNQLLCNIAKVQDLQVPRELQKTLLTYGGE 425
           +SVKNE LVM+W+S+ C +VL +LPTS+ ED  LL NI K+QD ++  E QK    +G E
Sbjct: 367 ISVKNEILVMKWMSEKCGSVLRDLPTSVTEDTVLLHNIDKLQDPELRLE-QKETEAFGSE 426

Query: 426 FCAFLETN---GVVNRDEAESHSSQKLKRSLDRWKLAVQWRLLYKKALVDCIGYCTTTIC 460
             AFL+ N    V          S+K  R L +W+ +VQWRL YK+ L DCI YC   + 
Sbjct: 427 VRAFLDANCLWDVTVLSGKPIEFSRKTSRMLSKWRWSVQWRLSYKRTLADCISYCNEKMN 481

BLAST of CsGy3G020670 vs. TAIR10
Match: AT3G07670.1 (Rubisco methyltransferase family protein)

HSP 1 Score: 44.3 bits (103), Expect = 2.4e-04
Identity = 86/374 (22.99%), Postives = 150/374 (40.11%), Query Frame = 0

Query: 43  DTGGRGLAAVRQLKKGELVLRAPKSILLTTQSLSLEDEKLDMALKRYPSLSSTQKLTFCL 102
           D G RGL A + L+KGE +L  P S++++  S    + +    +KRY  +     L   L
Sbjct: 97  DIGERGLVASQNLRKGEKLLFVPPSLVISADS-EWTNAEAGEVMKRY-DVPDWPLLATYL 156

Query: 103 LYEISKGPSSWWFPYLKHLP-QSYDILATFGEFEKQALQVDYAIWATEKAALKSRTDWRG 162
           + E S   SS WF Y+  LP Q Y +L     + +  L +        + A++  T+  G
Sbjct: 157 ISEASLQKSSRWFNYISALPRQPYSLL----YWTRTELDMYLEASQIRERAIERITNVVG 216

Query: 163 VEGLMQESNIKSQLQTF-------KAWLWASATISSRTLYVP-WDEAGCLCPVGDLFNYA 222
               ++        Q F       + + W+   + SR + +P  D    L P  D+ N+ 
Sbjct: 217 TYEDLRSRIFSKHPQLFPKEVFNDETFKWSFGILFSRLVRLPSMDGRFALVPWADMLNHN 276

Query: 223 APEGESFNAVDVLSFPSHASLNDELELLEEQRDSQWALTDGGFEENASAYCFYARENYRK 282
             E E+F   D  S     + +   +  E+   S    ++G   E   +Y F  RE    
Sbjct: 277 C-EVETFLDYDKSSKGVVFTTDRPYQPGEQVFISYGNKSNG---ELLLSYGFVPREG--- 336

Query: 283 GEQENPNDKVFIPIEHDIYGSSSWPEE---SLYIHQNGNPS----------FALLSALRL 342
               NP+D V + +   +  +    EE   +L  H    P             L++   L
Sbjct: 337 ---TNPSDSVELAL--SLRKNDKCYEEKLDALKKHGLSTPQCFPVRITGWPMELMAYAYL 396

Query: 343 WATHPNKRRGVGHLAYAGS-QLSVKNETLVMQWLSKNCHTVLNNLPTSIEEDNQLLCNIA 393
             + P+ R     +A A S + S KN+    +        +L++  TSI + ++ L    
Sbjct: 397 VVSPPDMRNNFEEMAKAASNKTSTKNDLKYPEIEEDALQFILDSCETSISKYSRFLKESG 452

BLAST of CsGy3G020670 vs. Swiss-Prot
Match: sp|Q6NQJ8|SDG40_ARATH (Protein SET DOMAIN GROUP 40 OS=Arabidopsis thaliana OX=3702 GN=SDG40 PE=2 SV=1)

HSP 1 Score: 459.9 bits (1182), Expect = 3.3e-128
Identity = 248/482 (51.45%), Postives = 320/482 (66.39%), Query Frame = 0

Query: 6   SLGSLLRWAADHGISDSVDQPT-SHSCLGHSLCVSFFPDTGGRGLAAVRQLKKGELVLRA 65
           ++ + LRWAA+ GISDS+D      SCLGHSL VS FPD GGRGL A R+LKKGELVL+ 
Sbjct: 7   TMETFLRWAAEIGISDSIDSSRFRDSCLGHSLSVSDFPDAGGRGLGAARELKKGELVLKV 66

Query: 66  PKSILLTTQSLSLEDEKLDMALKRYPSLSSTQKLTFCLLYEISKGPSSWWFPYLKHLPQS 125
           P+  L+TT+S+  +D KL  A+  + SLSSTQ L+ CLLYE+SK   S+W+PYL H+P+ 
Sbjct: 67  PRKALMTTESIIAKDLKLSDAVNLHNSLSSTQILSVCLLYEMSKEKKSFWYPYLFHIPRD 126

Query: 126 YDILATFGEFEKQALQVDYAIWATEKAALKSRTDWRGVEGLMQESNIKSQLQTFKAWLWA 185
           YD+LATFG FEKQALQV+ A+WATEKA  K +++W+    LM+E  +K + ++F+AWLWA
Sbjct: 127 YDLLATFGNFEKQALQVEDAVWATEKATAKCQSEWKEAGSLMKELELKPKFRSFQAWLWA 186

Query: 186 SATISSRTLYVPWDEAGCLCPVGDLFNYAAPEGESFNAVDVLSFPSHASLNDELELLEEQ 245
           SATISSRTL+VPWD AGCLCPVGDLFNY AP   S    +    P  A+  +E  L+ E 
Sbjct: 187 SATISSRTLHVPWDSAGCLCPVGDLFNYDAPGDYS----NTPQGPESANNVEEAGLVVET 246

Query: 246 RDSQWALTDGGFEENASAYCFYARENYRKGEQ----------------------ENPNDK 305
              +  LTDGGFEE+ +AYC YAR NY+ GEQ                      EN NDK
Sbjct: 247 HSER--LTDGGFEEDVNAYCLYARRNYQLGEQVLLCYGTYTNLELLEHYGFMLEENSNDK 306

Query: 306 VFIPIEHDIYG-SSSWPEESLYIHQNGNPSFALLSALRLWATHPNKR-RGVGHLAYAGSQ 365
           VFIP+E  ++  +SSWP++SLYIHQ+G  SFAL+S LRLW    ++R + V  L YAGSQ
Sbjct: 307 VFIPLETSLFSLASSWPKDSLYIHQDGKLSFALISTLRLWLIPQSQRDKSVMRLVYAGSQ 366

Query: 366 LSVKNETLVMQWLSKNCHTVLNNLPTSIEEDNQLLCNIAKVQDLQVPRELQKTLLTYGGE 425
           +SVKNE LVM+W+S+ C +VL +LPTS+ ED  LL NI K+QD ++  E QK    +G E
Sbjct: 367 ISVKNEILVMKWMSEKCGSVLRDLPTSVTEDTVLLHNIDKLQDPELRLE-QKETEAFGSE 426

Query: 426 FCAFLETN---GVVNRDEAESHSSQKLKRSLDRWKLAVQWRLLYKKALVDCIGYCTTTIC 460
             AFL+ N    V          S+K  R L +W+ +VQWRL YK+ L DCI YC   + 
Sbjct: 427 VRAFLDANCLWDVTVLSGKPIEFSRKTSRMLSKWRWSVQWRLSYKRTLADCISYCNEKMN 481

BLAST of CsGy3G020670 vs. Swiss-Prot
Match: sp|B7ZUF3|SETD3_XENTR (Histone-lysine N-methyltransferase setd3 OS=Xenopus tropicalis OX=8364 GN=setd3 PE=2 SV=1)

HSP 1 Score: 65.5 bits (158), Expect = 1.8e-09
Identity = 84/365 (23.01%), Postives = 158/365 (43.29%), Query Frame = 0

Query: 41  FPDTGGRGLAAVRQLKKGELVLRAPKSILLTTQSLSLEDEKLDMALKRYPSLSSTQKLTF 100
           FP+  G GL A R++K  EL L  P+ +L+T +S          +  R         L F
Sbjct: 101 FPEE-GFGLKATREIKAEELFLWVPRKLLMTVESAKGSVLGPLYSQDRILQAMGNITLAF 160

Query: 101 CLLYEISKGPSSWWFPYLKHLPQSYDILATFGEFEKQALQVDYAIWATEKAALKSRTDWR 160
            LL E    P+S+W PY+K LP  YD    F E E Q LQ   AI         +   + 
Sbjct: 161 HLLCE-RADPNSFWLPYIKTLPNEYDTPLYFNEDEVQYLQSTQAILDVFSQYKNTARQYA 220

Query: 161 GVEGLMQESNIKSQLQ-----TFKAWLWASATISSRTLYVPWDEAG----CLCPVGDLFN 220
               ++Q     ++L      TF  + WA +++ +R   +P ++       L P+ D+ N
Sbjct: 221 YFYKVIQTHPNANKLPLKDSFTFDDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCN 280

Query: 221 YAAP--------EGESFNAVDVLSFPSHASLNDELELLEEQR-DSQWALTDGGFEENASA 280
           +           E +    V +  F S     +++ +    R ++++ + +G F EN   
Sbjct: 281 HTNGLITTGYNLEDDRCECVALQDFKS----GEQIYIFYGTRSNAEFVIHNGFFFENN-- 340

Query: 281 YCFYARENYRKGEQENPNDKVFIPIEHDIYGSSSWPEESLY-IHQNGNP-SFALLSALRL 340
              + R   + G  +  +D+++  ++ ++   +  P  S++ +H    P S  LL+ LR+
Sbjct: 341 --LHDRVKIKLGVSK--SDRLY-AMKAEVLARAGIPTSSVFALHVTEPPISAQLLAFLRV 400

Query: 341 WATHPNKRRG----------VGHLAYAGSQLSVKNETLVMQWLSKNCHTVLNNLPTSIEE 376
           +  + ++ +G          +  L  +   +S +NE  +  +L      +L    T++E+
Sbjct: 401 FCMNEDELKGHLIGDHAIDKIFTLGNSEFPVSWENEIKLWTFLEARASLLLKTYKTTVED 452

BLAST of CsGy3G020670 vs. Swiss-Prot
Match: sp|Q91WC0|SETD3_MOUSE (Histone-lysine N-methyltransferase setd3 OS=Mus musculus OX=10090 GN=Setd3 PE=1 SV=1)

HSP 1 Score: 60.8 bits (146), Expect = 4.5e-08
Identity = 98/444 (22.07%), Postives = 180/444 (40.54%), Query Frame = 0

Query: 10  LLRWAADHGISDSVDQPTSHSCLGHSLCVSFFPDTGGRGLAAVRQLKKGELVLRAPKSIL 69
           L++WA+++G S            G  + V+F  +  G GL A R +K  EL L  P+ +L
Sbjct: 82  LMKWASENGASVE----------GFEM-VNFKEE--GFGLRATRDIKAEELFLWVPRKLL 141

Query: 70  LTTQSLSLEDEKLDMALKRYPSLSSTQKLTFCLLYEISKGPSSWWFPYLKHLPQSYDILA 129
           +T +S          +  R         L F LL E    P+S+W PY++ LP  YD   
Sbjct: 142 MTVESAKNSVLGPLYSQDRILQAMGNIALAFHLLCE-RASPNSFWQPYIQTLPSEYDTPL 201

Query: 130 TFGEFEKQALQVDYAIWATEKAALKSRTDWRGVEGLMQESNIKSQLQ-----TFKAWLWA 189
            F E E + LQ   AI         +   +     ++Q     ++L      T++ + WA
Sbjct: 202 YFEEEEVRCLQSTQAIHDVFSQYKNTARQYAYFYKVIQTHPHANKLPLKESFTYEDYRWA 261

Query: 190 SATISSRTLYVPWDEAG----CLCPVGDLFNYAAP--------EGESFNAVDVLSFPSHA 249
            +++ +R   +P ++       L P+ D+ N+           E +    V +  F +  
Sbjct: 262 VSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFQA-- 321

Query: 250 SLNDELELLEEQR-DSQWALTDGGFEENASAYCFYARENYRKGEQENPNDKVFIPIEHDI 309
              D++ +    R ++++ +  G F +N S    + R   + G  +  +D+++  ++ ++
Sbjct: 322 --GDQIYIFYGTRSNAEFVIHSGFFFDNNS----HDRVKIKLGVSK--SDRLY-AMKAEV 381

Query: 310 YGSSSWPEESLY-IHQNGNP-SFALLSALRLWATHPNKRR----------GVGHLAYAGS 369
              +  P  S++ +H    P S  LL+ LR++     + +           +  L  A  
Sbjct: 382 LARAGIPTSSVFALHSTEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNAEF 441

Query: 370 QLSVKNETLVMQWLSKNCHTVLNNLPTSIEEDNQLLCNIAKVQDLQVPRELQKTLLTYGG 424
            +S  NE  +  +L      +L    T+IEED  +L N     DL V   +   L     
Sbjct: 442 PVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKIVLKN----PDLSVRATMAIKLRLGEK 496

BLAST of CsGy3G020670 vs. Swiss-Prot
Match: sp|B2KI88|SETD3_RHIFE (Histone-lysine N-methyltransferase setd3 OS=Rhinolophus ferrumequinum OX=59479 GN=SETD3 PE=3 SV=1)

HSP 1 Score: 59.3 bits (142), Expect = 1.3e-07
Identity = 94/407 (23.10%), Postives = 158/407 (38.82%), Query Frame = 0

Query: 10  LLRWAADHGISDSVDQPTSHSCLGHSLCVSFFPDTGGRGLAAVRQLKKGELVLRAPKSIL 69
           L++WA+++G S            G  + VSF  +  G GL A R +K  EL L  P+ +L
Sbjct: 82  LMKWASENGASVE----------GFEM-VSFKEE--GFGLRATRDIKAEELFLWVPRKLL 141

Query: 70  LTTQSLSLEDEKLDMALKRYPSLSSTQKLTFCLLYEISKGPSSWWFPYLKHLPQSYDILA 129
           +T +S          +  R         L F LL E    P+S+W PY++ LP  YD   
Sbjct: 142 MTVESAKNSVLGPLYSQDRILQAMGNITLAFHLLCE-RADPNSFWQPYIQTLPSEYDTPL 201

Query: 130 TFGEFEKQALQVDYAIWATEKAALKSRTDWRGVEGLMQESNIKSQLQ-----TFKAWLWA 189
            FGE E + LQ   AI         +   +     ++Q     ++L      T++ + WA
Sbjct: 202 YFGEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKVIQTHPHANKLPLKDSFTYEDYRWA 261

Query: 190 SATISSRTLYVPWDEAG----CLCPVGDLFNYAAPEGESFNAVDVLSFPSHASLNDELEL 249
            +++ +R   +P ++       L P+ D+ N+        N +    +            
Sbjct: 262 VSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHT-------NGLITTGYN----------- 321

Query: 250 LEEQRDSQWALTDGGFEENASAYCFYARENYRKGE-------QENPNDKVFIPI------ 309
           LE+ R    AL D  F+     Y FY   +  +           N +D+V I +      
Sbjct: 322 LEDDRCECVALQD--FQAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSD 381

Query: 310 -----EHDIYGSSSWPEESLY-IHQNGNP-SFALLSALRLWA-------THPNKRRGVGH 369
                + ++   +  P  S++ +H    P S  LL+ LR++         H      +  
Sbjct: 382 RLYAMKAEVLARAGIPTSSVFALHFTEPPISAQLLAFLRVFCMTEEELKEHLLGDNAIDR 441

Query: 370 LAYAGSQ---LSVKNETLVMQWLSKNCHTVLNNLPTSIEEDNQLLCN 378
           +   G+    +S  NE  +  +L      +L    T+IEED   L N
Sbjct: 442 IFTLGNSEYPVSWDNEVKLWTFLEDRASLLLKTYKTNIEEDKSFLKN 454

BLAST of CsGy3G020670 vs. Swiss-Prot
Match: sp|B0VX69|SETD3_CALJA (Histone-lysine N-methyltransferase setd3 OS=Callithrix jacchus OX=9483 GN=SETD3 PE=3 SV=2)

HSP 1 Score: 58.2 bits (139), Expect = 2.9e-07
Identity = 97/444 (21.85%), Postives = 181/444 (40.77%), Query Frame = 0

Query: 10  LLRWAADHGISDSVDQPTSHSCLGHSLCVSFFPDTGGRGLAAVRQLKKGELVLRAPKSIL 69
           L++WA+++G S            G  + V+F  +  G GL A R +K  EL L  P+ +L
Sbjct: 82  LMKWASENGASVE----------GFEM-VNFKEE--GFGLRATRDIKAEELFLWVPRKLL 141

Query: 70  LTTQSLSLEDEKLDMALKRYPSLSSTQKLTFCLLYEISKGPSSWWFPYLKHLPQSYDILA 129
           +T +S          +  R         L F LL E    P+S+W PY++ LP  YD   
Sbjct: 142 MTVESAKNSVLGPLYSQDRILQAMGNIALAFHLLCE-RASPNSFWQPYIQTLPSEYDTPL 201

Query: 130 TFGEFEKQALQVDYAIWATEKAALKSRTDWRGVEGLMQESNIKSQLQ-----TFKAWLWA 189
            F E E + LQ   A+         +   +     ++Q     ++L      T++ + WA
Sbjct: 202 YFEEEEVRYLQSTQAVHDVFSQYKNTARQYAYFYKVIQTHPHANKLPLKDSFTYEDYRWA 261

Query: 190 SATISSRTLYVPWDEAG----CLCPVGDLFNYAAP--------EGESFNAVDVLSFPSHA 249
            +++ +R   +P ++       L P+ D+ N+           E +    V +  F +  
Sbjct: 262 VSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFRA-- 321

Query: 250 SLNDELELLEEQR-DSQWALTDGGFEENASAYCFYARENYRKGEQENPNDKVFIPIEHDI 309
              +++ +    R ++++ +  G F +N S    + R   + G  +  +D+++  ++ ++
Sbjct: 322 --GEQIYIFYGTRSNAEFVIHSGFFFDNNS----HDRVKIKLGVSK--SDRLY-AMKAEV 381

Query: 310 YGSSSWPEESLY-IHQNGNP-SFALLSALRLWA-------THPNKRRGVGHLAYAGSQ-- 369
              +  P  S++ +H    P S  LL+ LR++         H      +  +   G+   
Sbjct: 382 LARAGIPTSSVFALHFTEPPISAQLLAFLRVFCMTEEELKEHLLGDNAIDRIFTLGNSEF 441

Query: 370 -LSVKNETLVMQWLSKNCHTVLNNLPTSIEEDNQLLCNIAKVQDLQVPRELQKTLLTYGG 424
            +S  NE  +  +L      +L    T+IEED  +L N    QDL V  ++   L     
Sbjct: 442 PVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKN----QDLSVRAKMAIKLRLGEK 496

BLAST of CsGy3G020670 vs. TrEMBL
Match: tr|A0A0A0L7L4|A0A0A0L7L4_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G307670 PE=4 SV=1)

HSP 1 Score: 904.4 bits (2336), Expect = 1.0e-259
Identity = 447/472 (94.70%), Postives = 449/472 (95.13%), Query Frame = 0

Query: 1   METEGSLGSLLRWAADHGISDSVDQPTSHSCLGHSLCVSFFPDTGGRGLAAVRQLKKGEL 60
           METEGSLGSLLRWAADHGISDSVDQPTSHSCLGHSLCVSFFPDTGGRGLAAVRQLKKGEL
Sbjct: 1   METEGSLGSLLRWAADHGISDSVDQPTSHSCLGHSLCVSFFPDTGGRGLAAVRQLKKGEL 60

Query: 61  VLRAPKSILLTTQSLSLEDEKLDMALKRYPSLSSTQKLTFCLLYEISKGPSSWWFPYLKH 120
           VLRAPKSILLTTQSLSLEDEKLDMALKRYPSLSSTQKLTFCLLYEISKGPSSWWFPYLKH
Sbjct: 61  VLRAPKSILLTTQSLSLEDEKLDMALKRYPSLSSTQKLTFCLLYEISKGPSSWWFPYLKH 120

Query: 121 LPQSYDILATFGEFEKQALQVDYAIWATEKAALKSRTDWRGVEGLMQESNIKSQLQTFKA 180
           LPQSYDILATFGEFEKQALQVDYAIWATEKAALKSRTDWRGVEGLMQESNIKSQLQTFKA
Sbjct: 121 LPQSYDILATFGEFEKQALQVDYAIWATEKAALKSRTDWRGVEGLMQESNIKSQLQTFKA 180

Query: 181 WLWASATISSRTLYVPWDEAGCLCPVGDLFNYAAPEGESFNAVDVLSFPSHASLNDELEL 240
           WLWASATISSRTLYVPWDEAGCLCPVGDLFNYAAPEGESFNAVDVLSFPSHASLNDELEL
Sbjct: 181 WLWASATISSRTLYVPWDEAGCLCPVGDLFNYAAPEGESFNAVDVLSFPSHASLNDELEL 240

Query: 241 LEEQRDSQWALTDGGFEENASAYCFYARENYRKGE----------------------QEN 300
           LEEQRDSQWALTDGGFEENASAYCFYARE+YRKGE                      QEN
Sbjct: 241 LEEQRDSQWALTDGGFEENASAYCFYARESYRKGEQVLLSYGTYTNLELLEYYGFLLQEN 300

Query: 301 PNDKVFIPIEHDIYGSSSWPEESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAG 360
           PNDKVFIPIEHDIYGSSSWP+ESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAG
Sbjct: 301 PNDKVFIPIEHDIYGSSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAG 360

Query: 361 SQLSVKNETLVMQWLSKNCHTVLNNLPTSIEEDNQLLCNIAKVQDLQVPRELQKTLLTYG 420
           SQLSVKNE LVMQWLSKNCHTVLNNLPTSIEEDNQLLCNIAKVQDLQVPRELQKTLLTYG
Sbjct: 361 SQLSVKNEILVMQWLSKNCHTVLNNLPTSIEEDNQLLCNIAKVQDLQVPRELQKTLLTYG 420

Query: 421 GEFCAFLETNGVVNRDEAESHSSQKLKRSLDRWKLAVQWRLLYKKALVDCIG 451
           GEFCAFLETNGVVNRDEAESHSSQKLKRSLDRWKLAVQWRLLYKKALVDCIG
Sbjct: 421 GEFCAFLETNGVVNRDEAESHSSQKLKRSLDRWKLAVQWRLLYKKALVDCIG 472

BLAST of CsGy3G020670 vs. TrEMBL
Match: tr|A0A1S3C4J5|A0A1S3C4J5_CUCME (protein SET DOMAIN GROUP 40 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103496809 PE=4 SV=1)

HSP 1 Score: 858.6 bits (2217), Expect = 6.4e-246
Identity = 427/483 (88.41%), Postives = 442/483 (91.51%), Query Frame = 0

Query: 1   METEGSLGSLLRWAADHGISDSVDQPTSHSCLGHSLCVSFFPDTGGRGLAAVRQLKKGEL 60
           METEGS GSLLRWAADHGISDS+DQ TS SCLG SLCVSFFPD+GGRGLAAVRQL KGEL
Sbjct: 1   METEGSFGSLLRWAADHGISDSIDQHTSRSCLGRSLCVSFFPDSGGRGLAAVRQLNKGEL 60

Query: 61  VLRAPKSILLTTQSLSLEDEKLDMALKRYPSLSSTQKLTFCLLYEISKGPSSWWFPYLKH 120
           +LRAPKS+LLTTQSLSLEDEKL MALK +PSLSSTQKLTFCLL EISKG SS WFPYLKH
Sbjct: 61  ILRAPKSVLLTTQSLSLEDEKLAMALKIFPSLSSTQKLTFCLLNEISKGASSRWFPYLKH 120

Query: 121 LPQSYDILATFGEFEKQALQVDYAIWATEKAALKSRTDWRGVEGLMQESNIKSQLQTFKA 180
           LPQSYDILATFGEFEKQALQVDYAIWATEKAALKSR DWRGV+GLMQESNIK+QLQTFKA
Sbjct: 121 LPQSYDILATFGEFEKQALQVDYAIWATEKAALKSRMDWRGVKGLMQESNIKNQLQTFKA 180

Query: 181 WLWASATISSRTLYVPWDEAGCLCPVGDLFNYAAPEGESFNAVDVLSFPSHASLNDELEL 240
           WLWASATISSRTLYVPWDEAGCLCPVGDLFNYAAPEGESFNA+DVLSFPSHASLNDELE 
Sbjct: 181 WLWASATISSRTLYVPWDEAGCLCPVGDLFNYAAPEGESFNAMDVLSFPSHASLNDELES 240

Query: 241 LEEQRDSQWALTDGGFEENASAYCFYARENYRKGE----------------------QEN 300
           LEEQRDSQW LTDGGFEENASAYCFYARE+Y+KGE                      QEN
Sbjct: 241 LEEQRDSQWDLTDGGFEENASAYCFYARESYKKGEQVLLSYGTYTNIELLEYYGFLLQEN 300

Query: 301 PNDKVFIPIEHDIYGSSSWPEESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAG 360
           PNDKVFIPIEHDIY SSSWP+ESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAG
Sbjct: 301 PNDKVFIPIEHDIYVSSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAG 360

Query: 361 SQLSVKNETLVMQWLSKNCHTVLNNLPTSIEEDNQLLCNIAKVQDLQVPRELQKTLLTYG 420
           SQLSVKNETLVMQWLSKNCHTVLNNLPTSIEED+QLLCNIAKVQDLQV REL+K LLTYG
Sbjct: 361 SQLSVKNETLVMQWLSKNCHTVLNNLPTSIEEDDQLLCNIAKVQDLQVQRELRKMLLTYG 420

Query: 421 GEFCAFLETNGVVNRDEAESHSSQKLKRSLDRWKLAVQWRLLYKKALVDCIGYCTTTICS 462
           GE CAFLETNGVVNRDEAESH S+KLKRSL+RWKLAVQWRLLYKKALVDCIGYCT TICS
Sbjct: 421 GECCAFLETNGVVNRDEAESHLSEKLKRSLERWKLAVQWRLLYKKALVDCIGYCTRTICS 480

BLAST of CsGy3G020670 vs. TrEMBL
Match: tr|A0A1S3C4N2|A0A1S3C4N2_CUCME (protein SET DOMAIN GROUP 40 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103496809 PE=4 SV=1)

HSP 1 Score: 852.4 bits (2201), Expect = 4.6e-244
Identity = 427/488 (87.50%), Postives = 442/488 (90.57%), Query Frame = 0

Query: 1   METEGSLGSLLRWAADHGISDSVDQPTSHSCLGHSLCVSFFPDTGG-----RGLAAVRQL 60
           METEGS GSLLRWAADHGISDS+DQ TS SCLG SLCVSFFPD+GG     RGLAAVRQL
Sbjct: 1   METEGSFGSLLRWAADHGISDSIDQHTSRSCLGRSLCVSFFPDSGGKLCFRRGLAAVRQL 60

Query: 61  KKGELVLRAPKSILLTTQSLSLEDEKLDMALKRYPSLSSTQKLTFCLLYEISKGPSSWWF 120
            KGEL+LRAPKS+LLTTQSLSLEDEKL MALK +PSLSSTQKLTFCLL EISKG SS WF
Sbjct: 61  NKGELILRAPKSVLLTTQSLSLEDEKLAMALKIFPSLSSTQKLTFCLLNEISKGASSRWF 120

Query: 121 PYLKHLPQSYDILATFGEFEKQALQVDYAIWATEKAALKSRTDWRGVEGLMQESNIKSQL 180
           PYLKHLPQSYDILATFGEFEKQALQVDYAIWATEKAALKSR DWRGV+GLMQESNIK+QL
Sbjct: 121 PYLKHLPQSYDILATFGEFEKQALQVDYAIWATEKAALKSRMDWRGVKGLMQESNIKNQL 180

Query: 181 QTFKAWLWASATISSRTLYVPWDEAGCLCPVGDLFNYAAPEGESFNAVDVLSFPSHASLN 240
           QTFKAWLWASATISSRTLYVPWDEAGCLCPVGDLFNYAAPEGESFNA+DVLSFPSHASLN
Sbjct: 181 QTFKAWLWASATISSRTLYVPWDEAGCLCPVGDLFNYAAPEGESFNAMDVLSFPSHASLN 240

Query: 241 DELELLEEQRDSQWALTDGGFEENASAYCFYARENYRKGE-------------------- 300
           DELE LEEQRDSQW LTDGGFEENASAYCFYARE+Y+KGE                    
Sbjct: 241 DELESLEEQRDSQWDLTDGGFEENASAYCFYARESYKKGEQVLLSYGTYTNIELLEYYGF 300

Query: 301 --QENPNDKVFIPIEHDIYGSSSWPEESLYIHQNGNPSFALLSALRLWATHPNKRRGVGH 360
             QENPNDKVFIPIEHDIY SSSWP+ESLYIHQNGNPSFALLSALRLWATHPNKRRGVGH
Sbjct: 301 LLQENPNDKVFIPIEHDIYVSSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGH 360

Query: 361 LAYAGSQLSVKNETLVMQWLSKNCHTVLNNLPTSIEEDNQLLCNIAKVQDLQVPRELQKT 420
           LAYAGSQLSVKNETLVMQWLSKNCHTVLNNLPTSIEED+QLLCNIAKVQDLQV REL+K 
Sbjct: 361 LAYAGSQLSVKNETLVMQWLSKNCHTVLNNLPTSIEEDDQLLCNIAKVQDLQVQRELRKM 420

Query: 421 LLTYGGEFCAFLETNGVVNRDEAESHSSQKLKRSLDRWKLAVQWRLLYKKALVDCIGYCT 462
           LLTYGGE CAFLETNGVVNRDEAESH S+KLKRSL+RWKLAVQWRLLYKKALVDCIGYCT
Sbjct: 421 LLTYGGECCAFLETNGVVNRDEAESHLSEKLKRSLERWKLAVQWRLLYKKALVDCIGYCT 480

BLAST of CsGy3G020670 vs. TrEMBL
Match: tr|A0A1S3C590|A0A1S3C590_CUCME (protein SET DOMAIN GROUP 40 isoform X3 OS=Cucumis melo OX=3656 GN=LOC103496809 PE=4 SV=1)

HSP 1 Score: 749.2 bits (1933), Expect = 5.5e-213
Identity = 386/488 (79.10%), Postives = 401/488 (82.17%), Query Frame = 0

Query: 1   METEGSLGSLLRWAADHGISDSVDQPTSHSCLGHSLCVSFFPDTGG-----RGLAAVRQL 60
           METEGS GSLLRWAADHGISDS+DQ TS SCLG SLCVSFFPD+GG     RGLAAVRQL
Sbjct: 1   METEGSFGSLLRWAADHGISDSIDQHTSRSCLGRSLCVSFFPDSGGKLCFRRGLAAVRQL 60

Query: 61  KKGELVLRAPKSILLTTQSLSLEDEKLDMALKRYPSLSSTQKLTFCLLYEISKGPSSWWF 120
            KGEL+LRAPKS+LLTTQSLSLEDEKL MALK +PSLSST                    
Sbjct: 61  NKGELILRAPKSVLLTTQSLSLEDEKLAMALKIFPSLSST-------------------- 120

Query: 121 PYLKHLPQSYDILATFGEFEKQALQVDYAIWATEKAALKSRTDWRGVEGLMQESNIKSQL 180
                                   QVDYAIWATEKAALKSR DWRGV+GLMQESNIK+QL
Sbjct: 121 ------------------------QVDYAIWATEKAALKSRMDWRGVKGLMQESNIKNQL 180

Query: 181 QTFKAWLWASATISSRTLYVPWDEAGCLCPVGDLFNYAAPEGESFNAVDVLSFPSHASLN 240
           QTFKAWLWASATISSRTLYVPWDEAGCLCPVGDLFNYAAPEGESFNA+DVLSFPSHASLN
Sbjct: 181 QTFKAWLWASATISSRTLYVPWDEAGCLCPVGDLFNYAAPEGESFNAMDVLSFPSHASLN 240

Query: 241 DELELLEEQRDSQWALTDGGFEENASAYCFYARENYRKGE-------------------- 300
           DELE LEEQRDSQW LTDGGFEENASAYCFYARE+Y+KGE                    
Sbjct: 241 DELESLEEQRDSQWDLTDGGFEENASAYCFYARESYKKGEQVLLSYGTYTNIELLEYYGF 300

Query: 301 --QENPNDKVFIPIEHDIYGSSSWPEESLYIHQNGNPSFALLSALRLWATHPNKRRGVGH 360
             QENPNDKVFIPIEHDIY SSSWP+ESLYIHQNGNPSFALLSALRLWATHPNKRRGVGH
Sbjct: 301 LLQENPNDKVFIPIEHDIYVSSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGH 360

Query: 361 LAYAGSQLSVKNETLVMQWLSKNCHTVLNNLPTSIEEDNQLLCNIAKVQDLQVPRELQKT 420
           LAYAGSQLSVKNETLVMQWLSKNCHTVLNNLPTSIEED+QLLCNIAKVQDLQV REL+K 
Sbjct: 361 LAYAGSQLSVKNETLVMQWLSKNCHTVLNNLPTSIEEDDQLLCNIAKVQDLQVQRELRKM 420

Query: 421 LLTYGGEFCAFLETNGVVNRDEAESHSSQKLKRSLDRWKLAVQWRLLYKKALVDCIGYCT 462
           LLTYGGE CAFLETNGVVNRDEAESH S+KLKRSL+RWKLAVQWRLLYKKALVDCIGYCT
Sbjct: 421 LLTYGGECCAFLETNGVVNRDEAESHLSEKLKRSLERWKLAVQWRLLYKKALVDCIGYCT 444

BLAST of CsGy3G020670 vs. TrEMBL
Match: tr|A0A1Q3B175|A0A1Q3B175_CEPFO (SET domain-containing protein/Rubis-subs-bind domain-containing protein OS=Cephalotus follicularis OX=3775 GN=CFOL_v3_05277 PE=4 SV=1)

HSP 1 Score: 543.1 bits (1398), Expect = 6.0e-151
Identity = 279/481 (58.00%), Postives = 347/481 (72.14%), Query Frame = 0

Query: 2   ETEGSLGSLLRWAADHGISDSV-----DQPTSHSCLGHSLCVSFFPDTGGRGLAAVRQLK 61
           E E  L S L+WAA+ GI+DS         TSHSCLGHSL VS FPD GGRGL AVR L+
Sbjct: 4   EEERRLESFLKWAAELGITDSTKNQQSQNATSHSCLGHSLKVSNFPDAGGRGLGAVRDLR 63

Query: 62  KGELVLRAPKSILLTTQSLSLEDEKLDMALKRYPSLSSTQKLTFCLLYEISKGPSSWWFP 121
           KGE++LR PKS L+T+++LS  D KL +AL R+PSLSSTQ+LT CLLYE+ KG SSWW+P
Sbjct: 64  KGEMILRVPKSALITSKTLSFNDHKLYLALNRHPSLSSTQRLTVCLLYEMGKGASSWWYP 123

Query: 122 YLKHLPQSYDILATFGEFEKQALQVDYAIWATEKAALKSRTDWRGVEGLMQESNIKSQLQ 181
           YL H P+SY ILATFGEFEKQALQVD AIW TEKA  K+  +W+    LM+E  +K QL 
Sbjct: 124 YLMHFPRSYHILATFGEFEKQALQVDDAIWTTEKAIAKAELEWKEANMLMKELKLKRQLL 183

Query: 182 TFKAWLWASATISSRTLYVPWDEAGCLCPVGDLFNYAAPEGESFNAVDVLSFPSHASLND 241
           +F AWLWASA ISSRTL++ WDEAGCLCPVGDLFNY AP+ E+  ++ V S  +  S+ D
Sbjct: 184 SFTAWLWASAAISSRTLHIHWDEAGCLCPVGDLFNYDAPD-EATPSLQVSSLRNGESM-D 243

Query: 242 ELELLEEQRDSQWALTDGGFEENASAYCFYARENYRKGEQ-------------------- 301
            L+  ++   SQ  LTDGGFEE+ +AYCFYAR++Y++GEQ                    
Sbjct: 244 ALDSEDQLAQSQ-RLTDGGFEEDVAAYCFYARKSYQEGEQVLLSYGTYTNLELLEHYGFF 303

Query: 302 --ENPNDKVFIPIEHDIYGSSSWPEESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHL 361
             +NPNDKVFIP+E  +Y SSSWP+ESLYIHQ+G PSFALLS LRLWAT  ++RR VGHL
Sbjct: 304 LNKNPNDKVFIPLEPKMYCSSSWPKESLYIHQDGKPSFALLSTLRLWATPQSQRRSVGHL 363

Query: 362 AYAGSQLSVKNETLVMQWLSKNCHTVLNNLPTSIEEDNQLLCNIAKVQDLQVPRELQKTL 421
           AY+GSQLS+ NE  VM+W+SKNCH +L N P+SI+ED+ LL  I ++ +     EL+  +
Sbjct: 364 AYSGSQLSMDNEISVMRWISKNCHLILKNFPSSIKEDSFLLSAIDEIPNSCTALELRNMM 423

Query: 422 LTYGGEFCAFLETNGVVNRDEAES-HSSQKLKRSLDRWKLAVQWRLLYKKALVDCIGYCT 455
            T GGE C FL   G++NR+ A + H S+K + S++RWKLAVQWRL YKK+LVDCI YC 
Sbjct: 424 STLGGEGCNFLRAIGMLNRESAANLHLSKKARSSIERWKLAVQWRLWYKKSLVDCIDYCG 481

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004145844.13.8e-26694.82PREDICTED: protein SET DOMAIN GROUP 40 [Cucumis sativus][more]
KGN57798.11.5e-25994.70hypothetical protein Csa_3G307670 [Cucumis sativus][more]
XP_008457030.19.7e-24688.41PREDICTED: protein SET DOMAIN GROUP 40 isoform X2 [Cucumis melo][more]
XP_008457029.17.0e-24487.50PREDICTED: protein SET DOMAIN GROUP 40 isoform X1 [Cucumis melo][more]
XP_022983189.11.7e-21877.16protein SET DOMAIN GROUP 40 isoform X1 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
AT5G17240.11.8e-12951.45SET domain group 40[more]
AT3G07670.12.4e-0422.99Rubisco methyltransferase family protein[more]
Match NameE-valueIdentityDescription
sp|Q6NQJ8|SDG40_ARATH3.3e-12851.45Protein SET DOMAIN GROUP 40 OS=Arabidopsis thaliana OX=3702 GN=SDG40 PE=2 SV=1[more]
sp|B7ZUF3|SETD3_XENTR1.8e-0923.01Histone-lysine N-methyltransferase setd3 OS=Xenopus tropicalis OX=8364 GN=setd3 ... [more]
sp|Q91WC0|SETD3_MOUSE4.5e-0822.07Histone-lysine N-methyltransferase setd3 OS=Mus musculus OX=10090 GN=Setd3 PE=1 ... [more]
sp|B2KI88|SETD3_RHIFE1.3e-0723.10Histone-lysine N-methyltransferase setd3 OS=Rhinolophus ferrumequinum OX=59479 G... [more]
sp|B0VX69|SETD3_CALJA2.9e-0721.85Histone-lysine N-methyltransferase setd3 OS=Callithrix jacchus OX=9483 GN=SETD3 ... [more]
Match NameE-valueIdentityDescription
tr|A0A0A0L7L4|A0A0A0L7L4_CUCSA1.0e-25994.70Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G307670 PE=4 SV=1[more]
tr|A0A1S3C4J5|A0A1S3C4J5_CUCME6.4e-24688.41protein SET DOMAIN GROUP 40 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103496809 P... [more]
tr|A0A1S3C4N2|A0A1S3C4N2_CUCME4.6e-24487.50protein SET DOMAIN GROUP 40 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103496809 P... [more]
tr|A0A1S3C590|A0A1S3C590_CUCME5.5e-21379.10protein SET DOMAIN GROUP 40 isoform X3 OS=Cucumis melo OX=3656 GN=LOC103496809 P... [more]
tr|A0A1Q3B175|A0A1Q3B175_CEPFO6.0e-15158.00SET domain-containing protein/Rubis-subs-bind domain-containing protein OS=Cepha... [more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR036464Rubisco_LSMT_subst-bd_sf
IPR015353Rubisco_LSMT_subst-bd
IPR001214SET_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function
molecular_function GO:0005515 protein binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsGy3G020670.1CsGy3G020670.1mRNA


Analysis Name: InterPro Annotations of cucumber Gy14 genome (v2)
Date Performed: 2018-09-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001214SET domainPFAMPF00856SETcoord: 46..276
e-value: 3.3E-7
score: 30.8
NoneNo IPR availableGENE3DG3DSA:3.90.1410.10coord: 1..278
e-value: 6.0E-32
score: 113.0
NoneNo IPR availablePANTHERPTHR13271:SF8SET DOMAIN-CONTAINING PROTEIN 4coord: 6..445
NoneNo IPR availablePANTHERPTHR13271UNCHARACTERIZED PUTATIVE METHYLTRANSFERASEcoord: 6..445
NoneNo IPR availableSUPERFAMILYSSF82199SET domaincoord: 7..213
IPR015353Rubisco LSMT, substrate-binding domainPFAMPF09273Rubis-subs-bindcoord: 311..421
e-value: 2.2E-6
score: 28.4
IPR036464Rubisco LSMT, substrate-binding domain superfamilySUPERFAMILYSSF81822RuBisCo LSMT C-terminal, substrate-binding domaincoord: 305..377