CsaV3_1G033850 (gene) Cucumber (Chinese Long) v3

Overview
NameCsaV3_1G033850
Typegene
OrganismCucumis sativus L. var. sativus cv. Chinese Long (Cucumber (Chinese Long) v3)
Descriptionphenolic glucoside malonyltransferase 1-like
Locationchr1: 20989365 .. 21018446 (-)
RNA-Seq ExpressionCsaV3_1G033850
SyntenyCsaV3_1G033850
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTAACTTTCCTTCTCTGAAACTAATTGAGGTCTGCAAAGTCTCCCCTCCTCCGACAGCGGCAGCGCCCTCATCTCTTCCCCTCACTTTCTTTGATTTGATGTGGCTTAGGTTTCATCCAATCCAACGTCTTTTTTTCTACGAATTTCCATCGAATGAGGTCTCGTTTCACGATGTTATTGTTCCGAAGCTCAAGAACTCTCTCTCTCTTACTCTTCGTCACTATCTTCCTTTGGCGGGCAACCTCGTTTGGCCATCAGAATCTGATGTACCCTTCATTGAATTTGCAGAAGGCGATGGAGTTTCCATGACGGTGGCGGAGTCCGACGACAACTTTTATCATCTTTCTGGTAACGGATTTCGGGAAGTTTCAGAGTTTCATCCTCTTGTTCCTCAATTGCCAGTCTCTCATAACCGTGCTGCAGTCATTGCTATTCAGGTTTTTAACCATTAATTCTTAAGTAGGTGATAATATGATATTAGATTTGCATTTGTTTGATAAATATTTCTAAAACATAAATATGTTTGCTTATTTGAAAATAATAAATTATTTATTTGTAAATAAACAAGTTTTTATTAAGGTTTTTAATGATAAAACGTTAGATATTGGTGGAAAAATTATAAAAACTAAAAATTAAAAAAAAAAAGTTCAATTAGTAAAAATTCTTTTCAAATAAGTTAACAAATTTATGTTTAAACTATATTTATATTACTGACATTTTGATACTTATTTTTTGAATTTCATGAACTTGTATATGGCTACAGCTAAGTTCCTTTTATATTAATTAGTACATTGGTAGAAAACATTTAAAAGAAACAAAAAGTCATTAAAAAAGGAAAGTTTCTAATTTCTTCTCCCTACTTGTTTTTACGAAATTGGTTGAAATTTCTACTTTTTGTTTGCTTAGATTGCCTTTTTCTTTTCTTCTTAGAAATTTTATTGACATTATTATTCACTGACCATCATAGACTAAATTGAGAGAAAAAAGAAAGAAAGAAAAAAAAACTAAATTGAGACTTTTTAAAGAACTAAATTGAAAGAAGAAAAAATTGGTAAAATATACTTTTGGTATTAAGATTTGAATTGGATGTCTGTTTCGTTCTAAAGTTTTAAAATGAACTCTCAACAATCGAATATATTACAATATTATTTGTCTATAGTGTGACACCTCTCCCCTCCACTCTGTTTTATCCATAGTCCATAATGTCAATATTCAACCTTTTCTTTATTTATTTTTTATCTAATTTTCTTTCTTTACTCCCAAATTAATCTCAACTTCCATAGCTGTCTATCTTCTTCTTTATCGTGAAATTTTTATTTTTCTTTCATCTTATCCCAAGCTATACTCCTATGGTTTTTTTTTTCTCACAATAAATTTGCAAAATGAAAAGAAATATTACCCCAAACTATAAAATGAAAGATTTGGTCAGTTAAGATTTCTTCTGTGAAACTTATTATGATTTTAAAACATATCGAACAAAAACAAAATAATGAATTTGGATGGAGAAAATGGAGAGATTATTACTCCAAATGCAAGTAAAAGTTGCAGAATTTGATGCTAACAAAGAAGTTTGGGGCTAGAAGTATGGAAATTTCTTTAACGAAGTGGGAGGTTGAGATTTGGAAAATAAAGAAACAATTAAGATTTGGAGAAAGAAAGGGGGCAAATAGCTAGAGATAAAAAAAGGAAACAAGATTGGCTAGATATTCTAGGTGATGATGAGTGGAATGGCTTCAAAAACTTAACAAAAAAAAAAAAAGTGCAGTACTATGAAGGAAGGTTGAAGGAGAAGAAGGGAGAGAATGAAAGAAGAGAAGAATCTCAAATCATAAGAAAATAATATTATAATATATTTTTGCAAAACTAATCATATTTAAGGGTCACATCATTTTTTCATTAGTCCACTAACTATGTAAGTAACTTTATGGGGCATTTAAAACTGTTCGTCGAATTTCGAAGACTAAAGTGTTCAATTTTAAAACTTGAAAGACTAAATAAACATTAACCTTACATCACATGAACTAAAAATACATTTTTTGTCAAAAAGAATAAATATGTAATGCAATAAAAACTTAAATGCATGGTTAAGTACTAAACCATTAATGTAGTATTATTTAGTTATGGAGTAAATAGAATTAGAATTATTACAATGAATATAACTTTTATGGCAAAATAAGCAAGTATATAACAACAATATCTTAAAAACAAATTACAGTTACAAAAAAATATATTCATTGGCTATATATGCTGTCATAATAGATTTCTCTATTAGTGGCACATTTTTCATTGATAAAATATTATTAGTGATAGACTTGTATTAGTGATATGGTAGATCTAGAAAGCCAAATATAAACTTTCGAATACTTGAAACTAGGTTTCACATCATTTATATTGTAATATTAGATGGGTTATTCCAATGTTGTGCTTTATAATAACTTATTATATGTGTGAATACATTAAATCTAATTTACTGTTGCATGCATTTAATTAATTATTTATATTTTCAATATAGGTAACAAAATTTCAAAACAAAGGTTTTTCCATTGGAATATCCAATCACCATGGAATCTTAGATGGAAGGAGCTCAACTTCATTTATCAAATCATGGGCTCAAATTTGCATTGAAGAATCCTTTATACCAACCCCCAAGCAAATGCCATTATATGATAGGTCAGTTATAAATGATCCAAAAGATCTTGCAAAAATCTATGCAAAAGCATGGAAAGATGTGGAAGGACCCAACAACAAAAGCCTAAACCTTAAATTCCCCCAAACAAAACATGGTTTAGTTAGAAGCACTTTAGAATTCACACACCAAAACATTCAAAAGCTAAAGGAATGGATCTTGAACAAGAAGATCAAAAACGAAAATTTTGATTCTTCTTCTCACATATCTTCATTTGCAATAGCAACAGCTTATCTTTGTGTTTGTACAGCCAAATTAGAAGGTTTAAAAGAAGGAAAATTGTGGTTTGGGTTTGCAGCTGATGCAAGAACTCGTTTAAAGCCACAAGTGCCATTGAATTACTTTGGGAATTGTTTGGTTGGTGGGTATACTTCTCTCGAGAGGTTTGAACTTTTGAGTGAAAATGGAATAATTTTGGCTTGTGATGAGATTTCAAAAGCTATTAGAAAGTTGGATGATGGAGCTTTAAATGGGTCTGAGAATTGGGGGTCAATGATGAGCCAAGCGACAAATGATTATTCAAAAATACAAGCCATTTCTCTAGCTGGTTCACCAAGATTTGGAGTTTATAACGCTGATTTTGGATTTGGAAAACCAAAAAAAGTGGAGATTGTGTCCGCTGAATCACCATACGTTTTTCCTTTGACTGATAGTCAAAATAGTGATGTGGTAATGGAGATTGGTGTTGTTAGGGAAAGGGATGAAATGGAAGCTTTTGTTACTATATTTAATCAAGGCTTTGAATCTTTTTTTAATGATCAACATTGGGATCATTGTTAGTTTCCAAATAATTTGTTGCTCTTTTGAGTTTTGATACATTTTTAATAATGATGTTTAATTTAATTAATAGTTTAAAATACTATTACTTTATGCATGTCTGATTTAATAAATTTAATTTGTTGTAAATTTATTGTTAATTTGCTTGTAGGTAAGGGCTATAAACATCTTAATTTGACATTATTTTGCAGTGAGAAGATTCATTATCTTATATTATCTATTTTTAGGAATGTTTTAAACAAGTTAAAAAAATATATAATTTTCATATAGAATTTAGGTGAAAATTTAAATGTTCATGTGAGAAAAATTAGATGAAAATCATGGTAAGAAAAACAATAAATCAAATGCAATGATCTAAATAGTCCGCCCCTTTTTTAGATTAAGTTTGTTGTTTGGGTTGTTCCCAAAATTTTGTAATCATAACGAAATTTGCATACTTTTTCTCGATTTTCATTTAGTTCAAGTTTATGTACCAAATAAACAATGTTTTTTTTTATTTGATTGTTTAGACTGTGGAACAAATATAAAAGATCGTAAAAAAAAATTATTGAAATATTATATAACTAAATCTAAACAATCGTGTATAGCTAAATTTAAATATCAAAGAATCTTTTAAAAAATTCGTTTAGATTTGGCTTTCCAAATTTAAAGAGCCAAATATAAACGATCTGTACCAAATTTAATTTAAATGATAGTTTACCAAAACAATCTTTAAAAAAACGAATTTAAACAATTATGTACTAAATTTAAACGCTCGTGTACCAATAATCGTTAATAAAATCGTTTTGTACTAAATATAAACGATCATTTCTTCAAAAAATTGAAAAAAATGATTGAGATTTGGCAACCCAAATTTAAAAAATTGTGTGAAATAAAAATTATTAAAAAATTATTAGATTTATATTTGATTTAAACGATCGTTTTTCAAATACTGAACCATGATTTCCAAAATATATTATATCGATGTCAATTAATTATGGACATTTTTTATATTTTCTAATGTGGATTTGTGAATTTTTTTCGTTTTTGAAATTGTTTTATACGTTGTAAATATTTTGTCACTTAGTTCTATTTTTTTTATTAAAAAAAAACCTATTTAAAATAAATATTTAAAAATATAATTTATAAAAATCAGCAAAATATTTATATTTTATAGTAAATATGAGATGTAGTAGAGAAAATGATTGTAAAGAATAAAACTCTTAAAAATAATTATAAAATACAGTAAAACTTTATTATATTTTATAAATGTTTGGATGTATTATACAAATGATACTACCGATATTTTTGCAATTCTGTAGGATTTTCCTTTCCAACGATTAAAAAGCAAAGACAAAAGCCAATTCGTCGTCATCTGTATTTTCTTTCAGTCGTTTTTGTACCTTTAAAGCACCACAAAGTTTTGAAAAAATCTAAACAATTATCCTCACTTTCTTTTTATTTCCGATGGCTAACTTTCCTTCTCTAAAATTAATTGACGTCTGCAAAGTCTCTCCTCCTCCGGCGGCCGCGGTGGTGCCGTCATCTCTTCCCCTCACTTTCTTTGATTTATTGTGGCTAAGGTTTCTTCCAACCCAACGTCTTTTTTTCTATGAATTTCCATCGAATGAGATCTCGTTTCACGATGTCATTGTTCCGAAGCTCAAGAGCTCTTTGTCTCTTACTCTTCAACACTATCTTCCTCTGGCTGGCAACCTCGTTTGGCCATCGCAATCGGATGTACCCGTCATTGAATTTCACGAGGGCGATGGGGTTTCCATGACGGTGGCGGAGTCCGACGGTGACTTTTATCATCTTTCTAGTAACGGGTTTCGAAAAGTTTCCGAATTTCATCATCTTGTTCCTGAATTACCTGTCTTTCATGATCGTGCTGCAGTCATTGCTATTCAGGTACTTAACCACAAATTCTTAAAGTAGGTAATAATTAATTGGTATCAAATTTGCATCTACATGTAGATGTTTTCAATGGATGAATCAATATCTCTTTTATATAAGCATCAAAATATCACGAATCTAAAACATACGTATGTGTACTTATTTGAAAATAATAAATTACTTATTTTATTTACTAATTTAATTTTTTTATAAGTTTTTCAAAAATGTTTGTAAAAATTTATGTATGCTTGGAAAAAAAATGGATGAGAATAACAAAAACTATATTTGTACTTAATTTATGAAATTAAGTATAAAATTTTTGAAGAAAATTATTGCGAATGAAAAAGCTGCCGACTTATCTACGAATATAAAAGAAATATCAAATTCTATTAAACATATGTTTGGTAAATATTTTGATTGATTTTTTTTTCTTGTATTTGAAAACAATTTAGGGTGTCGAAAAAAATTAAGCAATTATAATTATGTTTAATTATTGTAATTCATAAAATAGAATAAATTATTATATTAGCATCATTAGTAATAAAAATGAATACAAATGTCAATTATAGATGGCTTAGTTTAGAAATATTTATAAAAAATGTTTGAATTAAATAATAAGAAGAAAGAAGGAAAAATAAATAAAATGAAAAAGATAAGAAGTAAGAGAGAGAGAAATCATCCTATTTGGAAGGTTATGAAAAATCTGCATGGAAAGAGAATATGAAATGCCTTTGGAATAAGATGCTATCATTCCCAAGATTGAAAACTTACACCAAATATGGTGAAAAGATGTTATGCCTCATTCCCACAAAATAATATAAAACTTGTCCATCTCTATAGTTTTTTTTAAGTAAATCATTAAAATTCTTTTATGTAATATTTACTATTGTTAACATTATAGTTCAAACCAGACTAAAATTATAATCTACACCTAGGTTGTAAACTATTATAATTGAACTATCATAATTTAAGATTATAATAACAAACTCCCTGATCTAAACGTCTTTTAAAAGTTCTCGATTTCATTCATTGTGTTTAATGTATTTTAATTTTAGAGAATATACTTTCGAGAAATTACGTGTATATGAATAAATAGGTGCGATTAATTGAATACTCTCACTCTTTTTTACACTCATCGATCATTTTATTTTTTTAAAAAAATGTCACGTGCCACATTTTTATTGGGTGCTTGGATATGTGTGAAAAAGTTATAAAAACCTACCACTCAGAAACACCAAAATTAGTAAACATTATTTTCAAATAAGTTGAAAAAAACTATTATTTAAATTATATAAATATTAGTGACATTATTTATTAGAACATTGGTACAAAGGCATTTAAAAGAAAAAAAAAAGTCATTAAAAAGGGAAAGTTTCTAAATTCTTCTCCCTATTTGATTCTACGAAATTAGTTGAAATTACTACTTTTTCTTTTCTTTTTAAAAGTTTTCTTGACATTATTCACTAATCATCATAGACTAAATTGAGAAACAAACAAAAAGAAAAATACTAAATTGAAACTTTTTTAAAGGACTGAATTGAAAGGAAAAATAACGTACTGCATTAAAGACTTCAATAATGCATGGTTAAGTACTAACCCATTAAGGTATTATTTAATTACGGGTAATTATAAGAATTATTATAATAAATATCTTTTATGGCCAAATAAGTTAACCATGGTTAAAGAGTTAGAAGTACAGCAAAATATTCATACCTGATAGACACTATTGTAATAGAATTTTTTTATTAACGATATATCTTGAAATACTTGTAACTCTACCAATTACTTAATCATTTATATTTTCAATATTAGTGATAGACTATTATTAATGATTGAATATTCCAATATTTTGTTATATATATAAAACTTATTTTGCATTGTATTGTATTATATATATAACTATTTAAAGGTGATTGCTATATTTGCATATATACAATATATAAATTATTTATATTCTTAATATAGGTAACTAAATTTCAAAACGAAGGTTTTTCCATTGGAATAACCCATCACCATGCAATCTTAGATGGAAGAAGCTCAACTTCATTTATCAAATCATGGGCTCAAATGTGCATTGAAGAATCCTCTGTACCAATCTCGGAGCAAATGCCATTATATGATAGATCAATTATAAATGATCCAAAAGATTTTGCAAAACTCTATGCAAAGGCATGGCAAGACATAGAAGGACCCAACAACAGAAGCTTAAACCTTAAATTCCCCAAAACAATACCTGGTTTAATTAGAAGCACTTTAGAATTCACACACCAAAACATTCAAAAGCTAAATGAATGGATCTTGAACAAGAAGATCAAAAACGAAAATTTTGATTCTTCTTCTTCTGACATATCTTCATTTGCAATAGCAACAGCTTATCTTTGTGTTTGTACAGCCAAATTAGAAGGTTTAAAAGAAGGAGAATTATGGTTTGTGTTTGCAGCTGATGCCAGAATTCGTTTAAAGCCACAAGTGCCATTGAATTACTTTGGGAATTGTTTGGTTGCTGGATTTGTTCGTCTTGAAAGGTTTGAGCTTTTGAGGGAAAATGGAATAATTTTCGCTTGTGATGAGATTTCAAAAGCTATTAGAAATTTGGATGGTGGAGCTTTAAATGGGTGTGAGAATTGGGGGTCAATAATGAGCCAACTGACATATGATTATTCAGAAACACAAGCCATTTCTCTAGCTGGTTCACCAAGATTTGGAGTTTATAACGCTGATTTTGGATTTGGAAAGCCAAAGAAAGTGGAGATTGTGTCAGCTGAATCACCATATTTTTTTTCTTTGACTGATAGTAGAAATAGTGATGCGGTAATGGAGATTGGTGTTGTTAAGGAAAGGGATGAAATGGAAGCTTTTGTTGCTATATTTAATCAAGGCTTTGAATCTATTTGAAATGATCATCATACGGATAATTGTTTCCAAAGGTTTTTACTATTCCAATTTTATCCATGTGTAGTTTATATGACTTTTTTCATTTTAGCTATATGGGACTATATGGGACTTTTATGTATGTCTTATTTAATAAATTTTAACATCAGTTGTTTATAGTTAATTTGTGTGTAGTCATGGACTATAAACCTTTTAATTTGGTTTTAGAACTTAAGTTTGGTAAACGCTATTTTTGTTAAGAAGATTTATTTGTTTTATTATCCATGTTTCAAACACTCGAGTTAACTAAAAAAAGTAGTTAAAAAAATATAGAATTTAACTAAAAATTTAAATATTTATATGAAAAAAAAAGATAGTTATAGTATGAAAAACAATAGAATCATGTAAATGCAGCTTTTTTACGATTAAGTTTATTGTCTGGGCTGCATATGTTGTGTGATCGTAATGAAATTTGATTTCACGTTCACATTTTATGTAAATCATTTTTTTACATAATGGAATAAATGAAATAATAATGATACGAAACACATAATGTAATAAAAAGAAAACTCCTCTAACGTCAACAACTTTAAAACATTTTGTAAGGTTGTGGGCATTGTCACGATCCACCACGAGGAGTGTGTGGTATACTTCTAACAAGGGACTACCTATCATACCATGACTTTATTGGAGTTGTCTTATCATATATTGAATCAAAGGCATTAAGGAAGAGAAAACTCAATAAGTTAGAACAATAAATGCTTAATTGGAAGGAGTGGCTTACAATAATAATTTTAAAGGAAACATATTTTGTCCTTCATTTTCTTTAGATTCATTTATTTACTTTAATTTATTTTATTATATTTTGTTAAAAGAAATATGAATTTGAATTATTTTAAAATGAAAAAAGAAAAAATATAGGAAAAATCTCACGTTACCTTCTTCCTATTTTTCCTTAACCACTCTCAATATGCACTACTCTCCATTTTCTCTCGATCTTTTTTTCCATCAACACTTCCTCTGCCTTTGACTCACTTCCTTCTCTCTAAAAAACTATCACCTTATCCCCTATAAAAAATAGAGGGGAAGATTTTGTTGCGAGAGGTGCAAAAGAAAATTGTTTCCATTAGTTGTCAATTGGGAGGGTTGAGAAGATAGGAAGAAACTTTGTGCTATTTTTCACTTCTTCTCCCAATTAAATGATTCATACATTACTACCTACTTTTCATATTAATTTTATTTGGGCATTGTGATTTTAAATTTATTTATGCCAGACTTATTACGTGGTTCAACGGTTTTTTAGATTTAGGTCAATTTAAGTATCATGAGTTCAAATTTATTGTTTGGCATCGTGCATTATTTATTTGGCATCATGCATTATTTATTTGTTTGCTTGTGTTCTTTATTTGCGGTTATTTATCTCACTATACTATTTTTTGTCACAACATGTCTGCATATAATTATTTTATTTCACACCGAGATTCCCTATATATTATTTTATTTCGCAACCTCCTTTCATATTATTTTATTTCTCTCCCTACATATTATTTTATTTCACAGACTCACTACATATTATTTTAAATTAGTGAAGAAGCAATATTTTTTATGGGTTTTATAACTTAAATTTCATGATTAAGTTTCTTCATTTGAAATTTAATTAATAAATTATCTTTAAAATCATAAATAATTTTATATTTTAAATGAGAATTTTAAATGAGAACTGAGTAGTGTTTTTTATGAATTTATTAATTCTTAGATACATGGAATTTCTCATTAGACACCTATAATGGATAATAAAATATCAAAGTTTGATATTTTAATATTCTTGAGGTCAATAAATGAAAAAATAATCTTTTAAAATAAAAAAATAAAATTTATTTTTCTAATGACTCTTATGTCACCCATAATTATCAATTAAGGTTTAGGTTTTATTTTCTTTCACGTGAATTGAGAACAATGAGACATTTAAAAAAACAAAATAAAAATTTTTGTTTTGTGAACCTTTTTAATTTAGTTTAAAAGATAAATTTTGGTAATCATTTTAATGGGTGTTGTAGGGTGCTAACACATTTCCTACACTTAACTGACTCTCGAAGTTACATCTCATGAATTTGGGTAGACCATTTTTATTTTTGGGTGACCAATCACACCTTAGTTGGTGGTGGCTCCAAACCTATTTATTTATTAAAATTAAATAAACGAGAACATTGGCGCATTGCAACCAAGTCGCAACAACTTGACGATTCCATTGTGGACTTAAAAAGTGTTTAAGAGAGACAAATCGTTATATTTTTATCCTTTATTCTTAGTCTATTATGATTTGTTGGTTCACTTAATTATTTGTTTTTTGATTTGATTCATATGGAATATTAATTTGTACTATGTTATTGGTCTGTTTTTTTATATATATATCATCATGCATTTTACACATTCACAAGCCCTCTACTCGGGCTACAACCTTATAGGACTAGAATTGAGGAGGGAGTAGTGTTGACCTTCGAGGGGTTTCGACTTTTATACAATCACATGAAAGATCCTCGATCAGATTAATTCTTTTGTTATTTCTTAAATAACATTATAATAAGTTTGAGGGCATTTCCCTTCTAGAAGTTTAAATGATACCAACCCTTTTTAAATAACATTATAATAAGGGAATAACATTATAATATTTACCTTGTAGAGGTACATGAAAACCTTTACATTTTTTTTCAAATTAAATAAATATTTTTAAAAAACAATAAGGAGAGTAGCGTCTACCTTGAACAGGTGCATGAAAACTTTTCTTTTTGTAAAATTAAATGACTTTTAAAAAAAAAAAAAAATGAGGAAAAGAGTGTTTACATGGTAGAGGTGCATGAAAACTTTTCATTTTCTAAAATTAAATAATTTTTTAAAAAAATAATAAGAAGAGTAGTGCTTCTAAGTGCATGAAAACATTTACATTTTCTAAAATTAAATATTTGTTTAAAAAATAATAAGAAGAGTAGTGTTTACAAGTGCATCAAAACCTTATTTACTCTTTTTTTTTCTTTTTTTCTTTTTCTTTTTATTTTTTACTTTTTTGTGTGAAGTGAATTGACTATATGTTTTTATTTTTGTCTTTTTTTATCATAATTGTGCAAAAAAGTAATAGTATTGAAATGCCAAGTTGTCTAATATGTCGATGCATTCACCTTTTGAATCTAAGTTTGATGAGTCAAGCAATGTTCTGAAATGGACTAAAGAGATGCAACAAAAATTTAGGATAGCATAAACAGTTCTTCTCAAGTATTAGTACTATCCATGTGTCAGCTCTCTTCTACACAAAATGATTTAGCGAAACTAAAGATGATCTGGGAGGCATTGACACCTCAACGAAGATTCATGTTTTCAAAGAAGTATGGACATACAGTAGAGTTGATGTACATACCGGTGAATTATTTTGCTTTAAGAGTCATAATTAATATTTGAATCCAGATACGGTTGTTTCACGTTTGGGTTGTGTGATTTATTGTCAACTATAGAAGAATATCACGCTATGCTTAGCATGCCTGAAAAAAAAAGGGAAATTATTATTAATTTCTTTAACCCTAAGCAAATAACAAATAGGACTTTATCAAAGTTTTTAGAAACCGTCCATGCCACAGAAATTCAAAAGCATGTAAAAGTTCAGGGTTTGGAGGAGAATGTGTCGTTTGACTATTTAATAAAGATGGCACAAACTTACATTGATCAAGACAAATGTTTTACTCTCTTAGCGTTGTGCATTTATGGAGCGGTGATTTTTCCTAAAGTATAGGGTTATGTTTATGAGAAAGTGATCAGATTATTTTTTAAAATAGAACGAGGAGTTGATCCTATTATACCTATATTGGCTTAAACTTTTCAATCTCTTACCTATTGTAGAAATAAATGGGAAGAAAAATTGAATTGTTGTGTCCTTTTATTATATATTTGGATACACAACCATATTAAATTTCTAGCAGAGTTTAGGCGTCCAAGGATAGATTTTAGTAGATCATGAAATTTATTGTGGAACGTTGTTAGTGAGTTTGGTATGGCAGTTTGGGACCCAACATATCTAAGGATGGAGACTTGGGTTGGGTGTCTTTCTTTTCAACAATGAGTTTTGAAAATGTCATATGGAAGGCTCAATGGATGCCTTTGAAGGCGATGATTTATAGATGTGGAGATTTTCATAGTGTGCCTTTGTTGGGACCACAAGAAGGTGTTAACTCTACACCATTGTTAGTTTTGTGTCAAGTGTGGCTCAAACAGTTTATACCACCAACTCATAATCTACAGGACTCTGACTTTTCATACGATCTTGAAGATTGTCAAGAAAAAAATATCGAGTAGTATGTGCATGGAAGTCTGTGAGGAAGATAAAAGGAAAATGATACTATGAAGGAGTTACCAGTGAATACGAGGCATGGCAAGCAAACAAAAGGAAAAATGTAATAGATACCTTAAGGGATGTAGTTGATGTTGAAATTTATGTCCCAAAACTCGTGGTTTATAATCATATTCTATTCAATAAGTCGTTATTGATACTATAATCTTAAAGCCAATAAACTAAGGTCTCAATGCTATAATAGAGTAAACTTGAACCTTATGTAGAGACATAAATGTGGACCAAGTTTGAGTATATAGCCTAAACCGTCTATAGTATACGAATACGATTGGGCACCTTATTTTGAGGACACTATGGATGCGGCCTACTTTGTAGTTAGTATAAACGATATGATCCTGAATCGTTCATGTAGAGACATGAAAGTTGGGGCATCCTATGTAAAGAGTTTACATAAGATTGGAACCATGAAATAGTCATTTTTAAGTTATAAAACCGTTAACTTTGTAAAACTGACTATTTCGTTTATAGATATCCAATGTAACTTAATCTTAATCCTAAGCTAACTATAAAATTACGTTCACACGAGATTATTCATAGATCTGCATAGGTGAGGGCAACTCAATAGTGCTGGCCCAATAAGCCTCTCATTTCAAGGGTAACATTGGGTGGATAGCTAAGGACATAGGGTGCAAGATGGAATTCACTCCTACCCACTTTAGGGTTAGTAGATAGGTTGTTTCCTTAAGAACTAAATCTAGGTCTTGAACAAGGGGTCCCACCCTCTCATTGGCCCGAGAGGGATTCGGTTTAAAGGTTGGACCTTAAACCAATTGTTCAATAGTGGATCAGTGGGACTTAAGGAGAAAATATATAATATCAGGGGTAAAATGGTACTTTGACCCAGTCGAGATTATGAACAACCTATGAAGGATTAACTTACTTATCATGGTCATATCACATGGACACAAATATATCTATAGTCAGGGGAGTGCAACTACGGGATTTTAGTATAATGACCCGTTAGTTAACGAAGATTGATTAACTCAGTCTAAAGAGTTTAGTCAGTTAATCTCTGAACGTTGGAGTCTATGATCTATAGGTTCATTAGGTTCCCCCGCTAGTTCATAAGGAATCAACTTAGAACATTATGATGGAATAATTTGAATTGTTTGAATTAGGTAAATAGAGAGAAACCGACAAATATATTTGATACAGTTATAAGTTATAACTTAAGATAGAGCTTCATGTTTAAATGTGATTTAAATATGAATATGGATTCATATTTGGAAGCTTAGAATTGATGAAAAAGGTCAAAGATGTAAAAAGTTAAAATGTTGACTTTTGACTTTGAAAAATCAAAATTTGACTAACTTTATATTCAAATGTGATTTGAATTTTGGAGAAATGAATACGGATTCATGCTTGGGAGGTTGAAATTAGTCAAGACGGATAAAATGTTGATTTTGGACTTGAAAAAGTCAAACTTTGAATTTGATTTAAATGTTGAATTACCATATTGACCTTAAACTAATTTAAATAATAACATTGGTTGAGATTTGACAAGCTTATCACTTGCATGTGGTATGCAAGTGGCTTATTGCATGTAAGACACCTACCCCACTAAGAATGTCAAGTGGTGAGATACGGAAGCTCTTGCATGTAGTTTGTTTGTAAACAAGTTTTATGCAAATGTGAAAGGACTCTTGGATTTGAAATTGAAAATTAATTTTGTAATTGAGAATTACAAAATAAGACCAAGAACTCTTTTCATTTGAGATTTTTTCATCAAAAGGATATCATCAATCTCTCTCTATTTCCTCCACCATTTTTTAGTCCCACAACTCGGTTCCAAGTCCGAAGAATAGGAGGTAAACTCTAGTGGTTGTGCTGACTTCAAATTCGTGAAGAGAATACAAGAGAAGAAAAACTTCAAAGGTAAGTTTTTCTTTTAACCCTATCTATGTTATTTTAGGGTTTTATTTTTATGCATGTTAATTACTAAACTTGTTTAGTTGCAATTAGAGTAAAATTCGATCTTTATTTTGCTGCGCATGTCTATCGTTTCCATCAATTGAAAGGGTATAAGAGACAAGCTTTGAACAATCAAACTAGTGGGCTGAGAAGAGCATAAAATTATAAGAGAAAAATCAACTGTTAGAGCAAGAGAATGAGAAACTTCGTAAAGAGACAAGTCAATTGAGTGATCATGTGACTTTTTTGTAGAAAGAGCTCGAAAAGACCAAGAGTTTCTTAAGAAATCAAGATAAGTTAAAAAAGAATTTTGAGGTGTTAGATAAAGAGATTAGGCAAATGAGTAAAGCAAATAGAAGCTTGAAAAATGAAAAGACAACATTACAGGCAACAATAGAATCGCAAGATGAATATATTAAAGATTTAGAAATTGAGAAGGATATTTTCTCGAGTTGTCAATGATTTGAATACATCATTTGGAAAACAGGAAGCACATATAGTAGATTTGGAAGCACACAATCACTCTATACATCAAACAAATGATAACCCACATGTGAAGATAGTTGAGTACTCTGAAAAGTATAAAATACTGAAAAATTATGTTGATTCCTTACACTATCAACTTATTGCATTTCAAAATTCAAGTGAGGGGATAGTACAAGAATACGAATTATTAAAGACAGATTACATGCAAGTGAAGGTTGATTGTGATTTTGCAAAGAAAAGATTTCCAGGTGTTAGTTGAATGTGTAGATTAGACGATTAAATTTCTCAAAATAATGTCTAGAAGAGCAAATAGTTTTACAGAATGGGCAGCTAATTTAAGGATTAATTTTTTCTCAATGCAACCTCATGCAGATGATCTGAATAGGATTTTGAAGATGATATGTAGAGGACTTGGGCATTTTGATCGTTTTCATTAATTGCTTATTTTGCAATTAATGACAAATGAGAGTGTTCATTACATTTTGTTTGTTTTTTGTTTTAGTTTGATTTTTATTCTGTCAGGTTTATATTTTGTTTGGCTTTAATGTATTTGAATTCATTATTTAATAAATTTTCTCTCTCTCTTTCTCCCTTTCTCTTTGGTTTTTAAAGTTTTCAAAATCATCAAATGAAAACATCACTCGTCCTACTTCAATTCAACATCCTTACCATATGAGGCATAAAAATCGAATCATGGAAGAACAAGATAAAGACATGGATAAAATGAGATAAGAGATTAATAAGCTTGAGGAACAAGTTTCAAAAATACTGGAATTGCTTTCAATAAAAAAAAAGAGAAAGTCGTTGTAGATACAACACAATCAAGCAATCGAGTTCAAGACACCGATGATCCCATTTATCCTCCAAGATTTACCCATGCAATACTATATTGATATGAATTCGCTTTTTGCTATTCCACCTCCTATTCCAGAAATATAACAATTGGAAGCTCAGACTAAAATTCAAGACATGGGACAAAATGAGAACACTCCGGCTAAGCAAAAACTAGATGTTTTGGAAGAAAGGTTGCAAGCAATTGAAATGTTGAAGGGATTGACATTTATGAGAATATAGATGCAACACAATTGTGTTTGGTGCCAGATTTGATAATTCCAGCAAAGTTTAAAGTTCCTAAATTTAACAAATATGATGGATCATAGTGTCCAAGGAGTCATCTTATAATGTACTATAGAAAGATGGCAGCGCATATTGGTAATGATAAATTATTGATCCATTGTTTTCAGGACAATTTGATTGGTCCAATGACTCGATGGTATATTCAATTAGATAATGCACACATTCATGTTTGAAAGGACTTAGCTTATGCATTTCTAAAGCAATACAAACATGATATGAATATGGCACCAGATCGTTTAGACCTACAATGGATGGAGAAGAAGAGTTTAGAAAGTTCTAAAGAATATGCTCAATGATGGAGAGATATGGTTACAGAGGTTCAACCACCATTAACGGACAAAGAAATGACATCTATGTTTATGAGTACTTTGTGGGCTCTGTTTTATGATCAAAGGATTGGTAATACTACAACCAATTTTTCTGACATTATTGTTATTGGTGAAAGAATTGAATATGAGATAAAGCATAGGAGGTTAGTAGAGGCTTCAATTGAGTATGAGGATTTAAAGAAAGGAACAACATTTAAGAAGAAAGAAGGAGAGGTTCATGCAATTGGGTTTTCTAATTTAGGGAACCACAAATCGATTTTTGGCCAAAGAAAACATGACCAAAATTTCTCTTCATATATAAGCAATGTTTCTCATATCCCTTATAACAACTATGTACCAGCTCACTCTTTCTTTGGAACCCCAAAACCTGTTAACTCAAACTCTCATCGACCATTTGTATAAGGTCATGGTAGGAAGACCAACTCTGATACATGGCGATTTTGATTCAATTTCCATGACTTATACAGAGTTTTTACCCTAGCTAATTCAAAATCCATAGTTAGCTCTTATTCCAATGAATCATATACAACCTCCTTATCCAAAATAGTATGATTCAAATGCTAGATGTGATTATCATGCTGGTGGAGCATGACACTCAACTGAAAATTGTTTGGCCTTTAAAAGAAAGGTGCAATCTTTAATTAATGTTAGATGGTTAAGCTTCAAAAAAGTTGGTTGAAGCCGGATGTCAACAACAATCCACTGCCTAATCATGAAAATTCAAAGTGAACGTTGTAGATTGCCTTGTTGAAAAGTGTAAAAATAAAGTTCATGAGATAGTGACGTCTATGAAAACACTTTTTGAAGCAGGATATGTTAGTCAGGAATATTTAGACCCCAACATAAGAAATGAAGGATACGATGGAAACATACATTGTATATTTCATCAAAGAGTTGCAAGTCACGTTGTCTAACTGTGTTGTAAATTCAGATCTAAAGTACAACAACTTATGGATTCAAATATACTCATACAGAGGACAAGGAAATGATGAGATAAAAGACAATAAAATATGTGCATCAATGGATGAAGTTGCAAAAAAGGAAAAATCCTTTTTACCAAGGCCTTTGACGGTCTTTTATCAAGAAAGTCGTAATGAGTCAACTTCCTACAATCCTAAATAACCCACGATCCAGTACCGAGTCCTTTCAAGTTTAAGAATGTGAAAGCAGTGTCATGACGATATGATTTTCAAGTTGTCATTAATACAGGTCCTTCAGTTTATAATCTTACAGAAATCATCGAGATAACTCAAAGTGGAAGATGTTACAAACTAGATAATTTAATAGCTCCTTCAAGTAGTTTTATAATGGGGCAATAGAGGAAAAATTAGAGAAGAAATGTGAATGAACATTGCAAAGAGCAAGATGTAGAGATATCTATCAATAGCAAAAGATGTAGAATACAAGAAGCCTATTACGGATGAGGAAGCAAATAAATTCTTGAAAATAGTAAAACAAAGTATATAAGTATAAGATCATAGAGGAAATGCATCATACTCCAGCTCGAATTTCTTTATTATCTTTGTTTTTTAATTCAGAGTCTCATCGCAAAGTGCTATTAGATATTTTGAACAAGACACAAGTTGGACATGACATTTGAATGAAAAAGTTTAACAACATTATTGGAAACATTACGTCTTCAAATTTCATAGTCTTCACGGATGATGACTTTCCTCTTGAAGGTTTAAGTCATACAAAAGCATTGCATATTCAAGTTGAAGGGATAAAAGTCCATGACAACGGAGGATTTTACAAAATCATTCCTCAATTTAATTTGGAAATTCTAAAATATCAGTACATTGGCAAAATGATATAGAAACAAAATTTTTTATTAAGACTTAAAGGCTAAGTATGTTTTCTAATTATAGATTAGAGAGAGAAAAATTTACTTGTTTGACGTCTCTGAATCTACAAAATAAATCTCCAAATTGGGACACTATCCGTCGGTGACCTTCTTATCCTCTAGGATAAGATTGGTTTGTGAGAACCAATTGGTATGTGAAAAAATTTAGAGATTGTTTAGAGAAACTCTTTTACCAAAAAACAATTTTCCTTCGCATATTTATAACTCTCTTTTCTTCAGGGACTAGGAGATAATCTTGTGATTCTATTAATTTAAAAATTTTAAATCAATAAAAATTTTAAATCTATTAACTAATTAACCCTTAATCAATTAGTTAATAATTAATTATATCATATTTAATTAATATTTTATTTAAATCTTATTTAATGAATATCTCTCCGATATCTTATAGTTTTATATTATTTTAATTAAATAAAATAAAAATAACTATTAATTAATTAATAGCTAAATTAAATTTGAATCATATTCAAATATAAATTTATATCATAACAGATAATTTAATATGAATTCAATTCATATTAAATTAATATTTGAACTCATTCAAATATTTTATCATAAACTAATTTTGAATCATATCCAAAATTAAATTTATATAAAAAAATTTGACAACCTTTATATTATAATGTATCACTATACATTAAACTAATTTTCAAAGTAATTTTGAACATTTCAAATTACAACCAACATAAATAAATCTCATTACCCTTTAGAGGCTAGGGGATCTAATGAACCTATAGATCAGAAGCTACAACAATCTGAGATTAATTGGCTAAACTCATTAACTACATTAATCAATATTCGTTAACTGTGTGTACATTCCACCAAAGACTCATAGCTGAACTCTTCTCACTGTAGATATATTTCTGTGTCCACGGATATCGACCAATAACAGTAAGTTAGTTATTCACGAGTGTTTGTAATACCAGCTAGATCAACTTACCATTTGATCCTTGGGTTACCTCTAATATTTAAGTACATGTGCTTCTCTAATGAACAACATGTTTATGGTCCAACCAATAAACATAAATTTCTCTTGTATCATAGAGAGGGTAAGGTCATTTGTTCAAGTCCCAAAAACACCATTTAAGGGACCATCTACGTACCCTAAAGTAGAAAATGAGTGAATTTCATCTTGCGTAAATTATGTTTCCAACTCTCCACTCGATCTTGTCCCCAAAATAATAAGATTATTGAGTCGGAAATTTGGTCACTCTCATTTGTACAAAACAAAGAACAATCTCTCGCGAACAAGAGTTCATAATACACTCATGATTAAGACTAAGTTAACTAGGTCATCCTAATGAAATAGAAACCTAACTAGTAAACGGAGTTATAACTAGTGGTTACTATTTTATGGTCTGGTCTTATGCAAATCCGTTGCAAGATACCTTCACTTGCATATCAACTACACAAATGAAATCATTGCATTTGTATCAAATACAAAGTGAGTCGTATCCATAGTGTTACCAAGATAAGGTACCCAATCTTATCCTTATACTATAGACCCTTTAAGCTGATCTCGAACATTGATTTCTATATGTCTCTACATACTGTTCAAGACTCATCAAACATCTTAGGATGTTAGTTTATTGGATTTGGGTTATTAAGACAAAACTATTAATATAATCAATAATATTTATTGCAATTATAAAAATAACACTTTCTTAATAACGGTCAATGAGTTATATGTACAATCTACGAGTTTTAGGATATAAATTCCAACACAAGTAAAGTGCAAAGACTATGCAATAGCAAGAGTTTTAGTGGATAGTGGTTCATCTCTTAATGCAATGCCTAAATCTACACTATTCAAGCTTGCAATGGACATGTCACACATAAAATCAAGCACTATGGTTGTGAGAGCTTTTGATGGGTCACGTAGAGATGTAATGGGTGACATTGAATTACCAATCAAGTTTGACTTATGTACTTTCAATATAGTCTTTCAAGTAATGGAAATAACACATACTTACAGTTTTTTATTAGGACGTCCTTGGATTCACTCTGTAGGGGTGATGTCATCCACATTGCATCAAAATTGAAATTTATTGTTGAGTAGGTGATTTGTTTGATGGGAGAGGAAAATTTCTGATAACAAAATCGGTATCAACCCCATATGTTGAGGCGACAGAGGAAGCATTAGAATACTATTTTTGCTCTTTTAAAATTGTTCATGCAACTATAATGAAAGCAGCTATAAATGAAGTAATAAAGTCACATGGGCCTAAAGTGGAAGTTATGATCACTAGGGTGATGAGAAGTGGAAGATATTGTTTGAATCAAACTTAGAAACACTTCTAAACACATCGAGCAATGATGAGATGTTTGATTTGAGCTAAATGTCATTCGTATATGATAAGATTAGGCTTGAAGAAGAAAATAAGAAAAAGCGTTCGACAAAGCTAGAGATGAGGGAGTTTGATCCCAAGTCTAAAATTTATACCAACATTATATGATACTTTCAAGAGTGCTGGTATAAGTCACTCATCACATGATTCTGATTCAAAGTACTGTTTGTTAACAAAGATGGAGAGTTTCTCAATTACAACAATGACATAAGAAGCATCATTTGAAGGGGACACAGTCTATGCATGTCCACTTGATTTCGAGCTTAACAATTGAGATGTTGTAGATTTACCTACATTTTCAAGAGAATTGCAAGAGTAATGATTAAACATTTATTTGTCAAATTTCTTTAATTTTCGAGTGACTCTACCCAAGGTCATAGTACGATTTGGGGTGTGACAACCGTCTCTCTGTATATTATTTTATTTCGCAGCCTTCGTGCATATTATTTTATTTCACAGTTTTCGTGCAACTATTTCACACCTATATTATTTTACAACTATTTCACAACCTGTATTTCTCAATTTCACATTATATTGCTACTATTATTATTATTTATTTATTTATTTTCTAATATGAATATAAAAATTTAAATATGTAATGTTTTTTATGGGTTTTATAACTTAAATATCAAGAATATATGAAATTTCTATTTCTCGGGTTGAAATTTAATTAATAAGTTATCTTTAAAATTATAAACAATTTCATATTTTAAATGAGGCTAAGTATTGTTTTCTATGAATTTATTAATTTTATGTACATGAAATTTCTCATTAAACACTTATTATGAAGATTAAAATATTAAAGTTTGATATTTTAATATTCTTAAGGTGAATAATAATAAAATAAAATCATATTTTTACTTATTATGAAGATTAAAATATTAAAGTTTGATATTTTAATATTTTTAAGGTGAATAATAATAAAATAAAATCATATTTTGAAACAAAAAATAAACTTCATTTTTCTAATTGCTCTTATGACACTCATAATTATTAATTATTGCTTAGGTTTTATTTTTTTCGGTGTGAATTGAGAACCATGAGACATTTAAAAAGATAAATAAAAAAACCTTGTTTTGTGAACCATTTTAATTTAGTTTAAGAGATAAATTTTGGTAATCATTTTTATGGGTGCTGTAGGGTGCTAACACGTTCCTAAAGACTCCCGAACTTAAATCTCATGAATTTTAGACAATTTTTTTAAAAAAATTTTTAGATGACCAATCACACTTTAATATAGTTAGTGACGAATTCAAACCTATTTAATTATTAAAATTAAATAAATGAACATGTTAGCTACTCGACATTAAGACTACATCGCAACAAACAGTTTGAAAGAATTGTTAGAATCATTCTTCCAACAAAAATAATTAGAGAAGAAACACATTTTAGTATTAATTAGAGAGGAATTCCTTTTAATTAATTTATGTTTATCAAGGTGAAAGCACAATTAGGCTGCAAATATCCATAATCAAATAAATTAAGTTTATTATTGCAATTGATTTTGGTTACTTTTATTTGGAAGGTCTCTTTCACAAAGCTTGAAGATTTAAGTGGTGAATTTTTCTATCAAGGTTGAGAGGATTTTTCGATCTCTATTTCGATAACTTTATAAGCTAAGATAATGAAATTATTAATTTGTTGGCAAAAATTGTTAATTTAATTTGGTTACATAGGTGAAATTGTGGTTTTATCTGCCGTAGGAAACTTGTCATAACCAAATATAAGGGTTCTACTATTGGATCCTCAATTGAGGGTTAGATTATTGTATGATAATAGTAATACAATTATATGCGTAGTATGTGATATGTTTGTGATGGTCAATTATTATATAGCCTATCGTTACTAAAGCATACCAATTGGAATGGGCTAAAGTTGTAAAGAATAATTAATTGGATATGTTTAATGTGGGTTATATATATATAAGTTAAAGCTTTTGATAAATTAATTTTTGTATTAAAATTTGGATTCTTTTTCTTCTAGAAGTTTTTGAAAGATTAATTTGCGAAAATTTTCCCATTAGTCATTAAAGTTAAATTAAAGATAAATTAATTTCAAATCTAATTGGTAGATTTTTTTTAAGGCTTAATTGGAAACTTTTGTAGGAATTAATTTGTGAAAATTTTCCATCAACCTTAAATTGATTCAATCTTACTTAAGTTATTGGATAAGTTAAGTTGAAAAATTTTAGATGCTAGCTAGCCATTGGTAAATTTTATTAATTAAGGATATGAGAAAAAAAATTTACTATTTAGGATCTGTTTTAATATAATTGAAAAAAATTACTATCAGCAAAGGAATACTTTTAGCTAATATCAGGTAAGAGATTCTACTATTGAACCTTCGAACTAAATTTAAGACTCTGCACGTATTTTATGATTATGTATTGATAATAACTACAAATACATAACGAGATTTTGTTGTTGATAGTTGGTGTATTATGGTGATTATCGATGTTTTATGAATTGCATGATTATGAATTGCTATGACTGATGATTCATGTTGATTGTCACATGTTTTTGGACTATGATGTTTGTTAATTTTGGTTGTTAGGCTTTTACCCCACATGTATTGTGTTTCTACCTTTGGGGTCACTCACACGACTAGAGGCGTTCCTACGGGATCACTTGCACAACGTGTACCTAAATATCATATGCACGGTTAAGTATCAGTTGTATTGGGTTCAAATGGAGTCATTCGCACGACTAGAGGTGTGTTTTAACAGAATCACTTGCATGCCTAAGGGTGTTTTGACGAAATCACTCGCACAATTAGAGGTGTACCTATGGATCACTCGCACGACTAAAAGTGTTCCTACGGTACCACTTGCTCACGTGTGCTCCATGGGACAAACGATATTATTATTACCTTATGGGATCTCCTAATGAGAAGTTAATAGATCACTTAGAGGGGCCAATAGGTGGATCCGTTGCTGGCTATATTTTTATATACTCGCCATTTTTCTATTTTTAATATTTCAGGCAATGGTAGAGGATCTGAAAAGTTGGTCAACGACAGTAAGGGTCCATGACATGCCATATGGAAGGATTCAATTTTGCTTTCGCTTTTCATTTATTAATTTTTAGTCTTTTCGATTGTAAACATTTTATTTCTTTATTTATTTTCTTTAATTAGTTAAAATTAGCTCGAGTTTGTTTTCATGGAATTTTATAGATTTCGCTAAATTTTGTTTTATGAACAGTTGTTTGAAAATTATTTTAATAAAGATTTAGTTTTTCTCTTTACAAGAAAGTGCCAAAATTGATATTTATCTTTTAGGTAAGAACCTCAGCTTAATTTAAGAGTTTGAGTCATTACAAAATCCTTCTCTATGAGTTTTAAATACATAAATATCCTGAAGAAACAAATTAAGGTTCGATTTTGATAGATTTAATAAAAAAAAAATACAAACTATTGAGAATGATGAGGGAGCTAAAAGGGTATCGACCTCGCCTACCAACCATACAAGTATCATTTTCTAAAAAAAAATACAAACAATTTACACTGTGGCAAAAAAATCACTAAAAAACCCAAACTCATTACACTTTTGTCACAAAGTGTAAATAGTTTGTCCATTTTACTTTTCTTTTACAATTTCTCTTATTTAAACTCATTTGATAAAAGACCATTAATAATACACTTCAAAAGAGATGACTTTTAATATTACATTTGAAATGAATATTTAAAAGTATAATTTATAAAAGTTGTCAAAATATTTATATTTTATGACAAAAATCAGATGTAGTAGACATTTATATTGGAGAAATTATTTTAAATGACAAAATAGTTGAAAATACTTACAAAATATAGTAAAATTTAGATTATATCAATGATAGACACTAATAGACTTTTTTTGTCTTCAATACCAATAGTGATACATCTAAAATTCTGTTATATTTTATAAATATTTTGATTTATGTTGGTGTATTTTACAAATGATGATATTTTTGAAATTCTAGAAGATTTTCCTTCCAACTAGTGAAAAACAAAGATTAGCCAATTATTTCCCTAATAGTAGATTAGCCAATTATTTAACAAAAGTTTTATCCATTACCATTTTTTATATATAAATTAAAGCTTAAATTACAAATTTAACCCTGCAACTTGAAAAGGAAAAATGATAAGACATGAAGATTTTAATTATACCAAGATAGAGTAAATTTGGAATTTATACCGAATTGAGATTGACTTTGGTATATTTAATGCATTTCTCACCATGTGAATATCATTCTTTAAATGGTTTTACTTTTTCAAACAATAATTGAATGCATTTCTCTTATATTTAATGAATAATCGTACACTAGTTAATATGTGATATATAAAGACAATTAAATATGAATTGAAATTGAAAGGACAGGTATAATATTACATTTGAAATACGTATTTTAAAAGTATAATTTATAAAAATCGACTAAATATTTATATTTTATAATAAAAATCAGATGTAATAGACATTTACGTTAGACAAAACTCTTGAAAATATTTACAAAATTCAGTAAAATTTTAAATTTTATCGTTTGATTTCGATGGATTTAAAATTTTGCTATATTTTTATAAAAGTTTTAATTTGTATTCTATATTTGAAAATACACCTATATGTTATAAAAAATTATTTTAAATAATAAAAATTCTAAAAAAATTTAGAAAAATTAGAATCTATTAGTATTTGTACTAGAAAGAGTAAAAATATTTTTGGTTTATTCGTTCATAATGATATATACAGGCTACAGGCTACAGGCTACAGGCTAGAGCGATATTTTTGAAATTCTAGAAGACTTTCCATTTCATAATTTCAAGGAGTAAAAAAGACGAAAGCCAATACGTCATGATTTGTTTGTTGTTTTCAACGTTTTTGGACCTTTAAAAGACCCCAAAGATTTGAAAGGATCCAACCCATTATCTTCTCACTTTTCTATTATTTCCGATGGCTAACTTTCCTTCTCTGAAAGTAATTGAGGTCTCCAAAGTCTCCCCTCCTCCCACAGCGGCAGCGCCCATATCTCTTCCCCTCACTTTCTTTGATTTGTTCTGGTTAAGGTTTTATCCAATTCAACGTCTTTTTTTCTACGAATTTCCATCCAATCAGATCTCGTCTTACGATGTCATTGTTTCGAAGCTCAAGAGCTCTCTTTCTCTCGCTCTTTGTCACCATCTTCCTTTGGCTGGCAACCTCGTTTGGCCATCGCAATCCGATGTACCTGTCATTGAATTTGTAGAAGGTAATGGGGTTTCCATGACGGTGGCGGAGTCCGACGATGATTTTGATCATCTTTCTGGTAACGGATTTCGAGAAGTTTCGGATTTTCATCCTCTTGTTCCTGTATTGACAGTCTCTCATGATCGTGCTGCAGTTATTGCTATTCAGGTTCTTAACCACTAATTCTTAAGTAGGCGATAATAGAGTATCAGTTTGCATCCGTGTAGATATTTCCAATATGTGAACTAGTGGCGGATCTATAAATTTTATACAGGGAGCACCGAGCACAGTAAAATAAACACGGTTTAAAAGATATCTCAACGTTTCAGAAATTGTAGTAATAAAGAAAGACTGGAAATAGGTTAAGATTTTTTTCGAGATCAGTTTTTTTCTCTTTTTAAAATATTATCTCTCTTTTAAATCAAATTATTTTCCTTCTAAGAAATCAAATTATCTCCCTTTTAGAAAATCATTCAAATTTAGAAATAAATCGGATATTTGAGGAAATAAAATTATTGATGATTGAGAAAAGAAAATTAGTGATGTTTTGAATAAAATTTATTTTATGAAAATCTAGAAAGAAATTTAATTTTTGAGGAAAGAAAATTATTGATATTTGGAATAAAATCTATTTTGTAAAAATAATTGTAACAGACAAGATTTAGGGGCACAACCATTATAGCATAAGCAGAATTTAGGTCAAATAAAGTTAATATATATTGTAATCTAATCCTTCAAAATTAATATTAAAATAAAAAATCGATAATGTGAGGGGCAGGTGCCTATAATAAATTCGTTCTTAATGTGAATTATATCCTTCTAAAATTTAGATATGTTTAGTTATTCGAAAATAATAGATTACTTATTTTATTTAATAATTTATTATTTTTATATTTTGGGTTGTTTTTATAAGTTTTTAAAAAATGTTTGTAGAAATTTATGTATACCTTGTGAATAAGGTATCTCTAAGAACAAAATGGATGAGAATAAGTCCTTCGGAAATTAAAGATGTTTGAAAATTTATGAAATTAATCATAAAAAGTCTTAGAAAAATGATGGTAAATGACATAATCTAAATAAAATTTACTGTATTTTATAAAAGCTTTTGTTTTTTTGCCTTTTATCTGAAAACAATAAATCATGCATATTGCCAAATAAATAAAGAACAATTATACTTACGTTTAGGTTAATTTTGGTAAGAAAACACCAAAAGATTTTAAGATGCAGTAAAATATTTTTAAAATTTTGAGATTTTGTGTTCAATATTGCAGATAATTATCACTTTATTTCTTTCTTTATTTTCTCTTTCTTTCTTCTGCAACAAGAAATTGTTGGAATTAATAATAAGAACAATCAATCGTAGACAATCACTATATATTCAATTTAGATAGGTTGATATAGCGGAATATCACAATATTTAAGTTTTTCTATTGCCTTTTTCTTATTTTGTTTTCCCACCTTATATTTTCTTCTTCATCACTTTCATTTGATGTTTTATTATCTTTGTTTTTCATCTTCAGCTAGTTCTTCTTCTTGTTCATCTGTTTCAATTTTAACAAAAAGAAAAAAAGGTTTCTAATTTCTTCCCCCTGTTTGTTTCTACGAAGTTAGTTGAAATTTTTACTTTTTGTTTGCTTATAAGATAGCTTTTTTTTCTTTTCTTTCTAAAAAATTTCTTGACATTATTCACTAACCAAAAATGCTAAATTAAAAATTTTTAAAGGACTAAATTGAAAGGAAAAAACATACGTACTGCATTTAAGTCACTTAAATGCATGGTTAAGAACCAAACCTTATTTAATTAAGATTAGTTATAACAATTATTATAAAGAATATAACTTTTATAGCTAAATAAGTAAATGTATAACAACCATGCGTTAAAGTTGAATTATACAGGTATTGTAAAATATTTATCAATGATAGACATTCTCGTTATAAATTTCTCCATTAGTAATACATTATTTCTTTCATTGATAGAATATTACTAGTAATAGACTTGTAATATCTCTAAAATATCACTAATAGCTAGATAGATTATAAAATCCAAATATAGATCTTGGAATACTTGCAATTGTACCAATCACTTAATTATTTATGTTTCCAATATTAGTAATAGACTATTATTAATAGACGACTATTATTAATTGATTAGTGATTGGTTATTTCAACGTTTTGCTATATATATAACTTTGGATTGTATTATATTTGTAAACACTTTAAATTTGATTGCTATAGTTGGAAGGTAACAATTAATTAATTATTTATATTCTCAATATATAGGTAACCAAATTTCAAAACAAAGGTTTTTCGATTGGAATAACCAATCACCATGCAATCTTCGATGGAAGAAGCTCAACTTCATTTATCAAATCATGGGCTCAAATTTGCATGGAAGAATCCTCTGTACCAACCCCCAAACAAGTGCCATTATATAATAGGTCAGTTATAAATGATCCAAAAGATCTTGCAAAAATCTATGCAAAAGCATGGAAAGATGCGGAAGGACCCAACAACAAAAGCCTAAACCTTAAATTCCCCCAAACAAAACATGGTTTAGTTAGAAGCACTTTAGAATTCACACACCAAAACATTCAAAAGCTAAAGGAATGGATCTTGAACAAGAAGATCGAAAATGAAAAGTTTGATTCTTCTTCTCACATATCTACATTTGCAATAGCAACAGCTTATCTTTGTGTTTGTACAGCCAAATTAGAAGGTTTAAAAGAAGGGAAATTATGGTTTGTGTTTGCAGCTGATGCAAGAACTCGTTTAAAGCCACAAGTGCCATTGAATTACTTTGGAAATTGTGTGGTTGCTGGGTTTGTTGGTCTTGAAAGGTCTGAACTTTTGAGTGAAAATGGAATAATTTTGGCTTGTGATGAGATTTCAAAAGCTATTAGAAATTTGGATGATGGACCTTTAAATGGGTGTGAGAGTTGGGGGTCAAGGATGAGCCAAGAAGTGACAAATGATTATTTAAAAATGCAAGGCATATCTCTAGCTGGTTCACCAAGATTTGGAGTTTATAATATTGATTTTGGATTAGGAAAGCCAAAGAAAGTGGAGATTGTGTCAGCAGAATCGCCATATGTTTTTTCTTTGACTGAGAGTAGAAATAGTGATGTGGTAATGGAGATTGGTGTTGTTAAGAAAAGGGATGAAATAGAAGCTTTTGTTGGTATATTTAATCAAGGCTTTGAATAA

mRNA sequence

ATGGCTAACTTTCCTTCTCTGAAACTAATTGAGGTCTGCAAAGTCTCCCCTCCTCCGACAGCGGCAGCGCCCTCATCTCTTCCCCTCACTTTCTTTGATTTGATGTGGCTTAGGTTTCATCCAATCCAACGTCTTTTTTTCTACGAATTTCCATCGAATGAGGTCTCGTTTCACGATGTTATTGTTCCGAAGCTCAAGAACTCTCTCTCTCTTACTCTTCGTCACTATCTTCCTTTGGCGGGCAACCTCGTTTGGCCATCAGAATCTGATGTACCCTTCATTGAATTTGCAGAAGGCGATGGAGTTTCCATGACGGTGGCGGAGTCCGACGACAACTTTTATCATCTTTCTGGTAACGGATTTCGGGAAGTTTCAGAGTTTCATCCTCTTGTTCCTCAATTGCCAGTCTCTCATAACCGTGCTGCAGTCATTGCTATTCAGGTAACCAAATTTCAAAACAAAGGTTTTTCGATTGGAATAACCAATCACCATGCAATCTTCGATGGAAGAAGCTCAACTTCATTTATCAAATCATGGGCTCAAATTTGCATGGAAGAATCCTCTGTACCAACCCCCAAACAAGTGCCATTATATAATAGGTCAGTTATAAATGATCCAAAAGATCTTGCAAAAATCTATGCAAAAGCATGGAAAGATGCGGAAGGACCCAACAACAAAAGCCTAAACCTTAAATTCCCCCAAACAAAACATGGTTTAGTTAGAAGCACTTTAGAATTCACACACCAAAACATTCAAAAGCTAAAGGAATGGATCTTGAACAAGAAGATCGAAAATGAAAAGTTTGATTCTTCTTCTCACATATCTACATTTGCAATAGCAACAGCTTATCTTTGTGTTTGTACAGCCAAATTAGAAGGTTTAAAAGAAGGGAAATTATGGTTTGTGTTTGCAGCTGATGCAAGAACTCGTTTAAAGCCACAAGTGCCATTGAATTACTTTGGAAATTGTGTGGTTGCTGGGTTTGTTGGTCTTGAAAGGTCTGAACTTTTGAGTGAAAATGGAATAATTTTGGCTTGTGATGAGATTTCAAAAGCTATTAGAAATTTGGATGATGGACCTTTAAATGGGTGTGAGAGTTGGGGGTCAAGGATGAGCCAAGAAGTGACAAATGATTATTTAAAAATGCAAGGCATATCTCTAGCTGGTTCACCAAGATTTGGAGTTTATAATATTGATTTTGGATTAGGAAAGCCAAAGAAAGTGGAGATTGTGTCAGCAGAATCGCCATATGTTTTTTCTTTGACTGAGAGTAGAAATAGTGATGTGGTAATGGAGATTGGTGTTGTTAAGAAAAGGGATGAAATAGAAGCTTTTGTTGGTATATTTAATCAAGGCTTTGAATAA

Coding sequence (CDS)

ATGGCTAACTTTCCTTCTCTGAAACTAATTGAGGTCTGCAAAGTCTCCCCTCCTCCGACAGCGGCAGCGCCCTCATCTCTTCCCCTCACTTTCTTTGATTTGATGTGGCTTAGGTTTCATCCAATCCAACGTCTTTTTTTCTACGAATTTCCATCGAATGAGGTCTCGTTTCACGATGTTATTGTTCCGAAGCTCAAGAACTCTCTCTCTCTTACTCTTCGTCACTATCTTCCTTTGGCGGGCAACCTCGTTTGGCCATCAGAATCTGATGTACCCTTCATTGAATTTGCAGAAGGCGATGGAGTTTCCATGACGGTGGCGGAGTCCGACGACAACTTTTATCATCTTTCTGGTAACGGATTTCGGGAAGTTTCAGAGTTTCATCCTCTTGTTCCTCAATTGCCAGTCTCTCATAACCGTGCTGCAGTCATTGCTATTCAGGTAACCAAATTTCAAAACAAAGGTTTTTCGATTGGAATAACCAATCACCATGCAATCTTCGATGGAAGAAGCTCAACTTCATTTATCAAATCATGGGCTCAAATTTGCATGGAAGAATCCTCTGTACCAACCCCCAAACAAGTGCCATTATATAATAGGTCAGTTATAAATGATCCAAAAGATCTTGCAAAAATCTATGCAAAAGCATGGAAAGATGCGGAAGGACCCAACAACAAAAGCCTAAACCTTAAATTCCCCCAAACAAAACATGGTTTAGTTAGAAGCACTTTAGAATTCACACACCAAAACATTCAAAAGCTAAAGGAATGGATCTTGAACAAGAAGATCGAAAATGAAAAGTTTGATTCTTCTTCTCACATATCTACATTTGCAATAGCAACAGCTTATCTTTGTGTTTGTACAGCCAAATTAGAAGGTTTAAAAGAAGGGAAATTATGGTTTGTGTTTGCAGCTGATGCAAGAACTCGTTTAAAGCCACAAGTGCCATTGAATTACTTTGGAAATTGTGTGGTTGCTGGGTTTGTTGGTCTTGAAAGGTCTGAACTTTTGAGTGAAAATGGAATAATTTTGGCTTGTGATGAGATTTCAAAAGCTATTAGAAATTTGGATGATGGACCTTTAAATGGGTGTGAGAGTTGGGGGTCAAGGATGAGCCAAGAAGTGACAAATGATTATTTAAAAATGCAAGGCATATCTCTAGCTGGTTCACCAAGATTTGGAGTTTATAATATTGATTTTGGATTAGGAAAGCCAAAGAAAGTGGAGATTGTGTCAGCAGAATCGCCATATGTTTTTTCTTTGACTGAGAGTAGAAATAGTGATGTGGTAATGGAGATTGGTGTTGTTAAGAAAAGGGATGAAATAGAAGCTTTTGTTGGTATATTTAATCAAGGCTTTGAATAA

Protein sequence

MANFPSLKLIEVCKVSPPPTAAAPSSLPLTFFDLMWLRFHPIQRLFFYEFPSNEVSFHDVIVPKLKNSLSLTLRHYLPLAGNLVWPSESDVPFIEFAEGDGVSMTVAESDDNFYHLSGNGFREVSEFHPLVPQLPVSHNRAAVIAIQVTKFQNKGFSIGITNHHAIFDGRSSTSFIKSWAQICMEESSVPTPKQVPLYNRSVINDPKDLAKIYAKAWKDAEGPNNKSLNLKFPQTKHGLVRSTLEFTHQNIQKLKEWILNKKIENEKFDSSSHISTFAIATAYLCVCTAKLEGLKEGKLWFVFAADARTRLKPQVPLNYFGNCVVAGFVGLERSELLSENGIILACDEISKAIRNLDDGPLNGCESWGSRMSQEVTNDYLKMQGISLAGSPRFGVYNIDFGLGKPKKVEIVSAESPYVFSLTESRNSDVVMEIGVVKKRDEIEAFVGIFNQGFE*
Homology
BLAST of CsaV3_1G033850 vs. NCBI nr
Match: KAE8653234.1 (hypothetical protein Csa_019805 [Cucumis sativus])

HSP 1 Score: 924.1 bits (2387), Expect = 4.7e-265
Identity = 454/454 (100.00%), Postives = 454/454 (100.00%), Query Frame = 0

Query: 1   MANFPSLKLIEVCKVSPPPTAAAPSSLPLTFFDLMWLRFHPIQRLFFYEFPSNEVSFHDV 60
           MANFPSLKLIEVCKVSPPPTAAAPSSLPLTFFDLMWLRFHPIQRLFFYEFPSNEVSFHDV
Sbjct: 1   MANFPSLKLIEVCKVSPPPTAAAPSSLPLTFFDLMWLRFHPIQRLFFYEFPSNEVSFHDV 60

Query: 61  IVPKLKNSLSLTLRHYLPLAGNLVWPSESDVPFIEFAEGDGVSMTVAESDDNFYHLSGNG 120
           IVPKLKNSLSLTLRHYLPLAGNLVWPSESDVPFIEFAEGDGVSMTVAESDDNFYHLSGNG
Sbjct: 61  IVPKLKNSLSLTLRHYLPLAGNLVWPSESDVPFIEFAEGDGVSMTVAESDDNFYHLSGNG 120

Query: 121 FREVSEFHPLVPQLPVSHNRAAVIAIQVTKFQNKGFSIGITNHHAIFDGRSSTSFIKSWA 180
           FREVSEFHPLVPQLPVSHNRAAVIAIQVTKFQNKGFSIGITNHHAIFDGRSSTSFIKSWA
Sbjct: 121 FREVSEFHPLVPQLPVSHNRAAVIAIQVTKFQNKGFSIGITNHHAIFDGRSSTSFIKSWA 180

Query: 181 QICMEESSVPTPKQVPLYNRSVINDPKDLAKIYAKAWKDAEGPNNKSLNLKFPQTKHGLV 240
           QICMEESSVPTPKQVPLYNRSVINDPKDLAKIYAKAWKDAEGPNNKSLNLKFPQTKHGLV
Sbjct: 181 QICMEESSVPTPKQVPLYNRSVINDPKDLAKIYAKAWKDAEGPNNKSLNLKFPQTKHGLV 240

Query: 241 RSTLEFTHQNIQKLKEWILNKKIENEKFDSSSHISTFAIATAYLCVCTAKLEGLKEGKLW 300
           RSTLEFTHQNIQKLKEWILNKKIENEKFDSSSHISTFAIATAYLCVCTAKLEGLKEGKLW
Sbjct: 241 RSTLEFTHQNIQKLKEWILNKKIENEKFDSSSHISTFAIATAYLCVCTAKLEGLKEGKLW 300

Query: 301 FVFAADARTRLKPQVPLNYFGNCVVAGFVGLERSELLSENGIILACDEISKAIRNLDDGP 360
           FVFAADARTRLKPQVPLNYFGNCVVAGFVGLERSELLSENGIILACDEISKAIRNLDDGP
Sbjct: 301 FVFAADARTRLKPQVPLNYFGNCVVAGFVGLERSELLSENGIILACDEISKAIRNLDDGP 360

Query: 361 LNGCESWGSRMSQEVTNDYLKMQGISLAGSPRFGVYNIDFGLGKPKKVEIVSAESPYVFS 420
           LNGCESWGSRMSQEVTNDYLKMQGISLAGSPRFGVYNIDFGLGKPKKVEIVSAESPYVFS
Sbjct: 361 LNGCESWGSRMSQEVTNDYLKMQGISLAGSPRFGVYNIDFGLGKPKKVEIVSAESPYVFS 420

Query: 421 LTESRNSDVVMEIGVVKKRDEIEAFVGIFNQGFE 455
           LTESRNSDVVMEIGVVKKRDEIEAFVGIFNQGFE
Sbjct: 421 LTESRNSDVVMEIGVVKKRDEIEAFVGIFNQGFE 454

BLAST of CsaV3_1G033850 vs. NCBI nr
Match: XP_011658008.2 (malonyl-CoA:anthocyanidin 5-O-glucoside-6''-O-malonyltransferase [Cucumis sativus])

HSP 1 Score: 871.7 bits (2251), Expect = 2.8e-249
Identity = 430/454 (94.71%), Postives = 442/454 (97.36%), Query Frame = 0

Query: 1   MANFPSLKLIEVCKVSPPPTAAAPSSLPLTFFDLMWLRFHPIQRLFFYEFPSNEVSFHDV 60
           MANFPSLK+IEV KVSPPPTAAAP SLPLTFFDL WLRF+PIQRLFFYEFPSN++S +DV
Sbjct: 1   MANFPSLKVIEVSKVSPPPTAAAPISLPLTFFDLFWLRFYPIQRLFFYEFPSNQISSYDV 60

Query: 61  IVPKLKNSLSLTLRHYLPLAGNLVWPSESDVPFIEFAEGDGVSMTVAESDDNFYHLSGNG 120
           IV KLK+SLSL L H+LPLAGNLVWPS+SDVP IEF EG+GVSMTVAESDD+F HLSGNG
Sbjct: 61  IVSKLKSSLSLALCHHLPLAGNLVWPSQSDVPVIEFVEGNGVSMTVAESDDDFDHLSGNG 120

Query: 121 FREVSEFHPLVPQLPVSHNRAAVIAIQVTKFQNKGFSIGITNHHAIFDGRSSTSFIKSWA 180
           FREVS+FHPLVP L VSH+RAAVIAIQVTKFQNKGFSIGITNHHAIFDGRSSTSFIKSWA
Sbjct: 121 FREVSDFHPLVPVLTVSHDRAAVIAIQVTKFQNKGFSIGITNHHAIFDGRSSTSFIKSWA 180

Query: 181 QICMEESSVPTPKQVPLYNRSVINDPKDLAKIYAKAWKDAEGPNNKSLNLKFPQTKHGLV 240
           QICMEESSVPTPKQVPLYNRSVINDPKDLAKIYAKAWKDAEGPNNKSLNLKFPQTKHGLV
Sbjct: 181 QICMEESSVPTPKQVPLYNRSVINDPKDLAKIYAKAWKDAEGPNNKSLNLKFPQTKHGLV 240

Query: 241 RSTLEFTHQNIQKLKEWILNKKIENEKFDSSSHISTFAIATAYLCVCTAKLEGLKEGKLW 300
           RSTLEFTHQNIQKLKEWILNKKIENEKFDSSSHISTFAIATAYLCVCTAKLEGLKEGKLW
Sbjct: 241 RSTLEFTHQNIQKLKEWILNKKIENEKFDSSSHISTFAIATAYLCVCTAKLEGLKEGKLW 300

Query: 301 FVFAADARTRLKPQVPLNYFGNCVVAGFVGLERSELLSENGIILACDEISKAIRNLDDGP 360
           FVFAADARTRLKPQVPLNYFGNCVVAGFVGLERSELLSENGIILACDEISKAIRNLDDGP
Sbjct: 301 FVFAADARTRLKPQVPLNYFGNCVVAGFVGLERSELLSENGIILACDEISKAIRNLDDGP 360

Query: 361 LNGCESWGSRMSQEVTNDYLKMQGISLAGSPRFGVYNIDFGLGKPKKVEIVSAESPYVFS 420
           LNGCESWGSRMSQEVTNDYLKMQGISLAGSPRFGVYNIDFGLGKPKKVEIVSAESPYVFS
Sbjct: 361 LNGCESWGSRMSQEVTNDYLKMQGISLAGSPRFGVYNIDFGLGKPKKVEIVSAESPYVFS 420

Query: 421 LTESRNSDVVMEIGVVKKRDEIEAFVGIFNQGFE 455
           LTESRNSDVVMEIGVVKKRDEIEAFVGIFNQGFE
Sbjct: 421 LTESRNSDVVMEIGVVKKRDEIEAFVGIFNQGFE 454

BLAST of CsaV3_1G033850 vs. NCBI nr
Match: XP_031741446.1 (phenolic glucoside malonyltransferase 2 isoform X1 [Cucumis sativus])

HSP 1 Score: 850.1 bits (2195), Expect = 8.7e-243
Identity = 416/454 (91.63%), Postives = 432/454 (95.15%), Query Frame = 0

Query: 1   MANFPSLKLIEVCKVSPPPTAAAPSSLPLTFFDLMWLRFHPIQRLFFYEFPSNEVSFHDV 60
           MANFPSLKLIEVCKVSPPPTAAAPSSLPLTFFDLMWLRFHPIQRLFFYEFPSNEVSFHDV
Sbjct: 1   MANFPSLKLIEVCKVSPPPTAAAPSSLPLTFFDLMWLRFHPIQRLFFYEFPSNEVSFHDV 60

Query: 61  IVPKLKNSLSLTLRHYLPLAGNLVWPSESDVPFIEFAEGDGVSMTVAESDDNFYHLSGNG 120
           IVPKLKNSLSLTLRHYLPLAGNLVWPSESDVPFIEFAEGDGVSMTVAESDDNFYHLSGNG
Sbjct: 61  IVPKLKNSLSLTLRHYLPLAGNLVWPSESDVPFIEFAEGDGVSMTVAESDDNFYHLSGNG 120

Query: 121 FREVSEFHPLVPQLPVSHNRAAVIAIQVTKFQNKGFSIGITNHHAIFDGRSSTSFIKSWA 180
           FREVSEFHPLVPQLPVSHNRAAVIAIQVTKFQNKGFSIGI+NHH I DGRSSTSFIKSWA
Sbjct: 121 FREVSEFHPLVPQLPVSHNRAAVIAIQVTKFQNKGFSIGISNHHGILDGRSSTSFIKSWA 180

Query: 181 QICMEESSVPTPKQVPLYNRSVINDPKDLAKIYAKAWKDAEGPNNKSLNLKFPQTKHGLV 240
           QIC+EES +PTPKQ+PLY+RSVINDPKDLAKIYAKAWKD EGPNNKSLNLKFPQTKHGLV
Sbjct: 181 QICIEESFIPTPKQMPLYDRSVINDPKDLAKIYAKAWKDVEGPNNKSLNLKFPQTKHGLV 240

Query: 241 RSTLEFTHQNIQKLKEWILNKKIENEKFDSSSHISTFAIATAYLCVCTAKLEGLKEGKLW 300
           RSTLEFTHQNIQKLKEWILNKKI+NE FDSSSHIS+FAIATAYLCVCTAKLEGLKEGKLW
Sbjct: 241 RSTLEFTHQNIQKLKEWILNKKIKNENFDSSSHISSFAIATAYLCVCTAKLEGLKEGKLW 300

Query: 301 FVFAADARTRLKPQVPLNYFGNCVVAGFVGLERSELLSENGIILACDEISKAIRNLDDGP 360
           F FAADARTRLKPQVPLNYFGNC+V G+  LER ELLSENGIILACDEISKAIR LDDG 
Sbjct: 301 FGFAADARTRLKPQVPLNYFGNCLVGGYTSLERFELLSENGIILACDEISKAIRKLDDGA 360

Query: 361 LNGCESWGSRMSQEVTNDYLKMQGISLAGSPRFGVYNIDFGLGKPKKVEIVSAESPYVFS 420
           LNG E+WGS MSQ  TNDY K+Q ISLAGSPRFGVYN DFG GKPKKVEIVSAESPYVF 
Sbjct: 361 LNGSENWGSMMSQ-ATNDYSKIQAISLAGSPRFGVYNADFGFGKPKKVEIVSAESPYVFP 420

Query: 421 LTESRNSDVVMEIGVVKKRDEIEAFVGIFNQGFE 455
           LT+S+NSDVVMEIGVV++RDE+EAFV IFNQGFE
Sbjct: 421 LTDSQNSDVVMEIGVVRERDEMEAFVTIFNQGFE 453

BLAST of CsaV3_1G033850 vs. NCBI nr
Match: XP_008457511.1 (PREDICTED: phenolic glucoside malonyltransferase 1-like isoform X1 [Cucumis melo])

HSP 1 Score: 812.4 bits (2097), Expect = 2.0e-231
Identity = 394/455 (86.59%), Postives = 426/455 (93.63%), Query Frame = 0

Query: 1   MANFPSLKLIEVCKVSPPPTAAAPSSLPLTFFDLMWLRFHPIQRLFFYEFPSNEVSFHDV 60
           MANFPSLKLI+VCKVSPPPTAAAPSSLPLTFFDL+WLRFHPIQRLFFYEF SNE+SFHDV
Sbjct: 1   MANFPSLKLIDVCKVSPPPTAAAPSSLPLTFFDLIWLRFHPIQRLFFYEFSSNEISFHDV 60

Query: 61  IVPKLKNSLSLTLRHYLPLAGNLVWPSESDVPFIEFAEGDGVSMTVAESDDNFYHLSGNG 120
           IVPKLK++LSLTLRHYLPLAGNLVWPS+SD+P +EF EGDGVSMTVAE D +FYHLSGNG
Sbjct: 61  IVPKLKSTLSLTLRHYLPLAGNLVWPSQSDIPVVEFVEGDGVSMTVAEFDGDFYHLSGNG 120

Query: 121 FREVSEFHPLVPQLPVSHNRAAVIAIQVTKFQNKGFSIGITNHHAIFDGRSSTSFIKSWA 180
           F+EV EFHPLVP+LPVSH+RAAVIAIQVTKFQNKGFSIGITNHHAI DGRSSTSFIKSWA
Sbjct: 121 FQEVLEFHPLVPELPVSHDRAAVIAIQVTKFQNKGFSIGITNHHAIIDGRSSTSFIKSWA 180

Query: 181 QICMEESSVPTPKQVPLYNRSVINDPKDLAKIYAKAWKDAEGPNNKSLNLKFPQTKHGLV 240
           QICME+SS+PTPK++PLY+RSVINDPKDLAK+YAK W+D EGPNNKSLNLK P+TKHGL+
Sbjct: 181 QICMEDSSIPTPKEMPLYDRSVINDPKDLAKLYAKVWQDLEGPNNKSLNLKIPETKHGLI 240

Query: 241 RSTLEFTHQNIQKLKEWILNKKIENEKFDSSS-HISTFAIATAYLCVCTAKLEGLKEGKL 300
           RSTLEFTHQNIQKLKEWILN+KI+NE FDSSS  IS+FAIATAYLC+CTAKLEGLKEGKL
Sbjct: 241 RSTLEFTHQNIQKLKEWILNEKIKNENFDSSSCRISSFAIATAYLCICTAKLEGLKEGKL 300

Query: 301 WFVFAADARTRLKPQVPLNYFGNCVVAGFVGLERSELLSENGIILACDEISKAIRNLDDG 360
           WF FAAD RTRLKPQVPLNYFGNC+V   V LER ELLSENGIILACDEISKAIRNLDDG
Sbjct: 301 WFAFAADGRTRLKPQVPLNYFGNCLVGAVVCLERFELLSENGIILACDEISKAIRNLDDG 360

Query: 361 PLNGCESWGSRMSQEVTNDYLKMQGISLAGSPRFGVYNIDFGLGKPKKVEIVSAESPYVF 420
            LNGCE+WGS+MSQ  TN Y K+Q ISL+GSPRFGVYN DFG GKPKKVEIVSAESPYVF
Sbjct: 361 ALNGCENWGSQMSQLTTN-YSKVQAISLSGSPRFGVYNADFGFGKPKKVEIVSAESPYVF 420

Query: 421 SLTESRNSDVVMEIGVVKKRDEIEAFVGIFNQGFE 455
           SLT+SRNSDVVMEIGVVK+RDEIEAFV IFNQGFE
Sbjct: 421 SLTDSRNSDVVMEIGVVKERDEIEAFVDIFNQGFE 454

BLAST of CsaV3_1G033850 vs. NCBI nr
Match: XP_011648510.2 (phenolic glucoside malonyltransferase 1-like [Cucumis sativus])

HSP 1 Score: 805.8 bits (2080), Expect = 1.9e-229
Identity = 387/454 (85.24%), Postives = 420/454 (92.51%), Query Frame = 0

Query: 1   MANFPSLKLIEVCKVSPPPTAAAPSSLPLTFFDLMWLRFHPIQRLFFYEFPSNEVSFHDV 60
           MANFPSLKLI+VCKV P P AAAPSSLPLTFFD++WLR HPIQRLFFYEF SNE+SF+D+
Sbjct: 1   MANFPSLKLIDVCKVPPSPAAAAPSSLPLTFFDVLWLRVHPIQRLFFYEFSSNEISFYDI 60

Query: 61  IVPKLKNSLSLTLRHYLPLAGNLVWPSESDVPFIEFAEGDGVSMTVAESDDNFYHLSGNG 120
           IVPKLK+SLSLTL HYLPLAGNL+WPS+SD P IEF  GDGVSMTVAESDD+FYHLSGNG
Sbjct: 61  IVPKLKSSLSLTLCHYLPLAGNLIWPSQSDTPVIEFVNGDGVSMTVAESDDDFYHLSGNG 120

Query: 121 FREVSEFHPLVPQLPVSHNRAAVIAIQVTKFQNKGFSIGITNHHAIFDGRSSTSFIKSWA 180
           FR+VS+FHPLVPQL  SH+RAA++AIQVTKFQNKGFSIGITNHHAI DGRSSTSFIKSWA
Sbjct: 121 FRKVSKFHPLVPQLSTSHDRAAIVAIQVTKFQNKGFSIGITNHHAILDGRSSTSFIKSWA 180

Query: 181 QICMEESSVPTPKQVPLYNRSVINDPKDLAKIYAKAWKDAEGPNNKSLNLKFPQTKHGLV 240
           QIC+EES +PTPKQ+PLY+RSVINDPKDLAKIYAKAWKD EGPNNKSLNLKFPQTKH LV
Sbjct: 181 QICIEESFIPTPKQMPLYDRSVINDPKDLAKIYAKAWKDVEGPNNKSLNLKFPQTKHDLV 240

Query: 241 RSTLEFTHQNIQKLKEWILNKKIENEKFDSSSHISTFAIATAYLCVCTAKLEGLKEGKLW 300
           RSTLEFTHQNIQKLKEWILNKKI+NE FDSSSHIS+FAI TAYLCVCTAKLEGLKEGKLW
Sbjct: 241 RSTLEFTHQNIQKLKEWILNKKIKNENFDSSSHISSFAIVTAYLCVCTAKLEGLKEGKLW 300

Query: 301 FVFAADARTRLKPQVPLNYFGNCVVAGFVGLERSELLSENGIILACDEISKAIRNLDDGP 360
           F F ADARTRLKPQVPLNYFGNC+VAGFV  ER ELLSENGII ACDEISK IRNLDDGP
Sbjct: 301 FGFVADARTRLKPQVPLNYFGNCLVAGFVVNERFELLSENGIIFACDEISKTIRNLDDGP 360

Query: 361 LNGCESWGSRMSQEVTNDYLKMQGISLAGSPRFGVYNIDFGLGKPKKVEIVSAESPYVFS 420
           LNGCE+WG  MSQE+TNDY K+Q  S+AGSPRFGVYN+DFG GKPKKVEIVSAESPY+FS
Sbjct: 361 LNGCENWG-MMSQEMTNDYSKLQINSIAGSPRFGVYNVDFGFGKPKKVEIVSAESPYIFS 420

Query: 421 LTESRNSDVVMEIGVVKKRDEIEAFVGIFNQGFE 455
           LT++RNSD VMEIG+V +RDE+EAFV IFNQGFE
Sbjct: 421 LTDTRNSDAVMEIGIVMERDEMEAFVAIFNQGFE 453

BLAST of CsaV3_1G033850 vs. ExPASy Swiss-Prot
Match: Q9LRQ8 (Phenolic glucoside malonyltransferase 2 OS=Arabidopsis thaliana OX=3702 GN=PMAT2 PE=1 SV=1)

HSP 1 Score: 304.3 bits (778), Expect = 2.3e-81
Identity = 178/457 (38.95%), Postives = 256/457 (56.02%), Query Frame = 0

Query: 6   SLKLIEVCKVSPPPTAAAPSS----LPLTFFDLMWLRFHPIQRLFFYEF-PSNEVSFHDV 65
           +L +IE  +V+P   +   S+    LPLTFFDL WL F P++R+FFYE   S    FH +
Sbjct: 2   TLHVIETARVTPTDYSVINSANLHKLPLTFFDLPWLLFQPVKRVFFYELTESTRDHFHSI 61

Query: 66  IVPKLKNSLSLTLRHYLPLAGNLVWPSESDVPFIEFAEGDGVSMTVAESDDNFYHLSGNG 125
           I+PKLK+SLSL LR+YLPL G++ W      P I  +E   V +T+AESD +F HLSG G
Sbjct: 62  ILPKLKDSLSLILRNYLPLTGHITWEPNEPKPSIIVSENGVVLVTIAESDADFSHLSGYG 121

Query: 126 FREVSEFHPLVPQLPVSHNRAAVIAIQVTKFQNKGFSIGITNHHAIFDGRSSTSFIKSWA 185
            R +SE H LVP+LPVS + A   +IQ+T F N+GFSIG+  HHA+ DG++S++FIK+WA
Sbjct: 122 QRPLSELHALVPKLPVSDDSATAFSIQITLFPNQGFSIGVAAHHAVLDGKTSSTFIKAWA 181

Query: 186 QICMEESSVPTPKQVPLYNRSVINDPKDL-AKIYAKAWKDAEGPNNKSLNLKFPQTKHG- 245
           QIC +E         P Y+RS+I  P  L  K+        E   N       P +K G 
Sbjct: 182 QICKQELQSMPENLTPSYDRSLIKYPTYLDEKMIELVRSLKEDQTNIRSLTSLPSSKLGD 241

Query: 246 -LVRSTLEFTHQNIQKLKEWILNKKIENEKFDSSSHISTFAIATAYLCVCTAKLE-GLKE 305
            +V +TL  +  +I++L+E + N          S H+STF IA AY   C  K   G K+
Sbjct: 242 DVVLATLVLSRADIERLREQVKN-------VSPSLHLSTFVIAYAYAWTCFVKARGGNKD 301

Query: 306 GKLWFVFAADARTRLKPQVPLNYFGNCVV-AGFVGLERSELLSENGIILACDEISKAIRN 365
             +  +F  D R RL P++P  YFGNC++  G    + +E + E G + A + IS  ++ 
Sbjct: 302 RSVSLLFVGDFRDRLDPKLPGTYFGNCMIPVGCYNRKAAEFMEEKGFVTAAEIISDLVKG 361

Query: 366 LDDGPLNGCESWGSRMSQEVTNDYLKMQGISLAGSPRFGVYNIDFGLGKPKKVEIVSAES 425
           L    +   E+      +  +      Q  ++AGS R GVY  DFG G+P KV+IVS + 
Sbjct: 362 LSSRKI---ETIADTFVEGFSFQSWSTQFGTIAGSTRLGVYEADFGWGRPVKVDIVSIDQ 421

Query: 426 PYVFSLTESRNSDVVMEIGVVKKRDEIEAFVGIFNQG 453
               ++ E R+    +EIG+  K+ E+++ V  FN G
Sbjct: 422 GEAIAMAERRDESGGVEIGMCLKKTEMDSVVSFFNNG 448

BLAST of CsaV3_1G033850 vs. ExPASy Swiss-Prot
Match: Q940Z5 (Phenolic glucoside malonyltransferase 1 OS=Arabidopsis thaliana OX=3702 GN=PMAT1 PE=1 SV=1)

HSP 1 Score: 301.2 bits (770), Expect = 2.0e-80
Identity = 179/469 (38.17%), Postives = 262/469 (55.86%), Query Frame = 0

Query: 6   SLKLIEVCKVSPPPTAAAPS-SLPLTFFDLMWLRFHPIQRLFFYEFPSNEVSFHD-VIVP 65
           SLK+I+V +V+P  + ++ S +LPLTFFDL+W + H ++R+ FY+       F D VIVP
Sbjct: 9   SLKVIDVARVTPSNSDSSESLTLPLTFFDLLWYKLHAVERVIFYKLTDASRPFFDSVIVP 68

Query: 66  KLKNSLSLTLRHYLPLAGNLVWPSESDVPFIEFAEGDGVSMTVAESDDNFYHLSGNGFRE 125
            LK SLS +L HYLPLAG LVW      P I +   D VS TVAES+ +F  L+G     
Sbjct: 69  NLKTSLSSSLSHYLPLAGKLVWEPLDPKPKIVYTPNDAVSFTVAESNADFSRLTGKEPFP 128

Query: 126 VSEFHPLVPQLPVSHNRAAVIAIQVTKFQNKGFSIGITNHHAIFDGRSSTSFIKSWAQIC 185
            +E +PLVP+L VS + A+ ++ QVT F N+GF I +  HHA+ DG+++T+F+KSWA+ C
Sbjct: 129 TTELYPLVPELHVSDDSASAVSFQVTLFPNQGFCISVNAHHAVLDGKTTTNFLKSWARTC 188

Query: 186 MEESSVPTPKQVPLYNRSVINDPKDLAKIYAKAWK--------DAEGPNNKSLNLKF-PQ 245
             + S      +P+Y+R+VI DP DL      AW           E  N KSL L + P+
Sbjct: 189 KNQDSFLPQDLIPVYDRTVIKDPMDLDTKILNAWHRVAKVFTGGKEPENPKSLKLLWSPE 248

Query: 246 TKHGLVRSTLEFTHQNIQKLKEWILNKKIENEKFDSSS---HISTFAIATAYLCVCTAKL 305
               + R TL  T ++IQKL+E  L K+  +    SS     +STF I  +Y   C  K 
Sbjct: 249 IGPDVFRYTLNLTREDIQKLRE-RLKKESSSSSVSSSPKELRLSTFVIVYSYALTCLIKA 308

Query: 306 EGLKEGK-LWFVFAADARTRLKPQVPLNYFGNCVVAGF-VGLERSELLSENGIILACDEI 365
            G    + + + FA D R+ + P VP +YFGNCV A F + L     +SE G + A   +
Sbjct: 309 RGGDPSRPVGYGFAVDCRSLMVPPVPSSYFGNCVSACFKMSLTAETFMSEEGFLAAARMV 368

Query: 366 SKAIRNLDDGPLNGCESWGSRMSQEVTNDYLKM----QGISLAGSPRFGVYNIDFGLGKP 425
           S ++  LD+          +    E+   +  +    Q +S+AGS RFGVY +DFG G+P
Sbjct: 369 SDSVEALDENV--------ALKIPEILEGFTTLSPGTQVLSVAGSTRFGVYGLDFGWGRP 428

Query: 426 KKVEIVSAESPYVFSLTESRNSDVVMEIGVVKKRDEIEAFVGIFNQGFE 455
           +KV +VS +     S  ESR+    +E+G   K+ E++  V + ++G E
Sbjct: 429 EKVVVVSIDQGEAISFAESRDGSGGVELGFSLKKHEMDVLVDLLHKGLE 468

BLAST of CsaV3_1G033850 vs. ExPASy Swiss-Prot
Match: Q9LJB4 (Malonyl-CoA:anthocyanidin 5-O-glucoside-6''-O-malonyltransferase OS=Arabidopsis thaliana OX=3702 GN=5MAT PE=1 SV=1)

HSP 1 Score: 297.0 bits (759), Expect = 3.7e-79
Identity = 184/469 (39.23%), Postives = 264/469 (56.29%), Query Frame = 0

Query: 1   MANFPS-LKLIEVCKVSPPPTAAAPSSLPLTFFDLMWLRFHPIQRLFFYEFPSNEVSFHD 60
           M NF S + ++EV +VSPP + +   +LPLT+FDL WL+ HP+ R+ FY  P    S   
Sbjct: 1   MVNFNSAVNILEVVQVSPPSSNSL--TLPLTYFDLGWLKLHPVDRVLFYHVPELTRS--- 60

Query: 61  VIVPKLKNSLSLTLRHYLPLAGNLVWPSESDVPFIEFA--EGDGVSMTVAESDDNFYHLS 120
            ++ KLK+SLS TL HYLPLAG LVW S    P I ++  + D V +TVAES+ +  HLS
Sbjct: 61  SLISKLKSSLSATLLHYLPLAGRLVWDSIKTKPSIVYSPDDKDAVYLTVAESNGDLSHLS 120

Query: 121 GNGFREVSEFHPLVPQLPVSHNRAAVIAIQVTKFQNKGFSIGITNHHAIFDGRSSTSFIK 180
           G+  R  +EFH LVP+LPVS   A V+A+QVT F N+GFS+G+T HHA+ DG+++  F+K
Sbjct: 121 GDEPRPATEFHSLVPELPVSDESARVLAVQVTFFPNQGFSLGVTAHHAVLDGKTTAMFLK 180

Query: 181 SWAQICMEESSVPTPKQVPLYNRSVINDPKDLAKIYAKAWKDAEGPNNKSLNLKFPQTKH 240
           +WA  C +E        VP  +R ++ DP  L       W  A   NNK     FP    
Sbjct: 181 AWAHNCKQEQEALPHDLVPSLDRIIVQDPTGLETKLLNRWISAS--NNKPSLKLFPSKII 240

Query: 241 G--LVRSTLEFTHQNIQKLKEWILNKKIENEKFDSSSHISTFAIATAYLCVCTAKLEGLK 300
           G  ++R T   T ++I+KL+E     ++E E       +STF I  AY+  C  K+ G  
Sbjct: 241 GSDILRVTYRLTREDIKKLRE-----RVETESHAKQLRLSTFVITYAYVITCMVKMRGGD 300

Query: 301 EGKLWFV-FAADARTRLKPQVPLNYFGNCVV-AGFVGLERSELLSE---NGIILACDEIS 360
             +   V FA+D R+RL P +P  +FGNC+V +G   ++   +L E    G I A + ++
Sbjct: 301 PTRFVCVGFASDFRSRLNPPLPPTFFGNCIVGSGDFDVKAEPILEEGEGKGFITAVETLT 360

Query: 361 KAI-----RNLDDGPLNGCESWGSRMSQEVTNDYLKMQGISLAGSPRFGVYNIDFGLGKP 420
             +      N++   L   E++  RM           Q IS+AGS R G+Y  DFG GKP
Sbjct: 361 GWVNGLCPENIEKNMLLPFEAF-KRMEP-------GRQMISVAGSTRLGIYGSDFGWGKP 420

Query: 421 KKVEIVSAESPYVFSLTESRNSDVVMEIGVVKKRDEIEAFVGIFNQGFE 455
            KVEIV+ +     SL+ES +    +E+GV  K+D++E F  +F+ G E
Sbjct: 421 VKVEIVTIDKDASVSLSESGDGSGGVEVGVCLKKDDVERFGSLFSIGLE 449

BLAST of CsaV3_1G033850 vs. ExPASy Swiss-Prot
Match: Q9LRQ7 (BAHD acyltransferase At3g29680 OS=Arabidopsis thaliana OX=3702 GN=At3g29680 PE=2 SV=1)

HSP 1 Score: 287.0 bits (733), Expect = 3.8e-76
Identity = 167/456 (36.62%), Postives = 259/456 (56.80%), Query Frame = 0

Query: 6   SLKLIEVCKVSPPPTAAAPSSLPLTFFDLMWLRFHPIQRLFFYEF-PSNEVSFHDVIVPK 65
           +L +I++ +VS    +  P  LPLTFFDL+WL+ +PI+R+ FY+   S+  SF   I+PK
Sbjct: 2   ALNVIKISRVSLVTNSVEPLVLPLTFFDLLWLKLNPIERVTFYKLTESSRDSFFSSILPK 61

Query: 66  LKNSLSLTLRHYLPLAGNLVWPSESDVPFIEFAEGDGVSMTVAESDDNFYHLSGNGFREV 125
           L+ SLSL L H+LPL+G+L W  +   P I     D VS+TV ES+ +F ++S    R  
Sbjct: 62  LEQSLSLVLSHFLPLSGHLKWNPQDPKPHIVIFPKDTVSLTVVESEADFSYISSKELRLE 121

Query: 126 SEFHPLVPQLPVSHNRAAVIAIQVTKFQNKGFSIGITNHHAIFDGRSSTSFIKSWAQICM 185
           +E  PLVP+L VS + A+++++Q+T F N+GFSIG T HH + DG++++ F KSWA IC 
Sbjct: 122 TELRPLVPELQVSSDSASLLSLQITLFPNQGFSIGTTVHHVVMDGKTASKFHKSWAHIC- 181

Query: 186 EESSVPTPKQVP-LYNRSVINDPKDLAKIYAKAWKDAEGPNNKSLNLKFPQTK---HGLV 245
           +  + P    +P + +R+VIN P  L +   +         + +  L  P  K   + +V
Sbjct: 182 KHGTTPQDFDLPTVLDRTVINVPAGLEQKIFQLSSYISEEKDYARTLTLPPAKEIDNDVV 241

Query: 246 RSTLEFTHQNIQKLKEWILNKKIENEKFDSSSHISTFAIATAYLCVCTAKLEGLKEGK-L 305
           R TLE T  +I+KLKE     + +NE   S  H+STF ++ AY+  C  K  G    + +
Sbjct: 242 RVTLELTEVDIEKLKE-----RAKNESTRSDLHLSTFVVSYAYVLTCMVKSCGGDANRPV 301

Query: 306 WFVFAADARTRLKPQVPLNYFGNCVV-AGFVGLERSELLSENGIILACDEISKAIRNLDD 365
            F++AAD R RL P VPL YFGNCV+   F G + +  L ++G +   + +S ++R L  
Sbjct: 302 RFMYAADFRNRLDPPVPLTYFGNCVLPIDFNGYKATTFLGKDGYVNGVEILSDSVRGLGS 361

Query: 366 GPLNGCESWGSRMSQEVTNDYLKMQGISLAGSPRFGVYNIDFGLGKPKKVEIVSAESPYV 425
                 ES          N  L  Q +++ GS +FG+Y  DFG G+P K +++S      
Sbjct: 362 ---RNIESIWEVYEDGTKNMKLDTQNVTVTGSNQFGIYGSDFGWGRPVKTDVMSLYKNNE 421

Query: 426 FSLTESRNSDVVMEIGVVKKRDEIEAFVGIFNQGFE 455
           FS++  R+    +EIG+  K+ E+  F+ +F   F+
Sbjct: 422 FSMSARRDEIGGLEIGISLKKCEMNVFLSLFTSDFD 448

BLAST of CsaV3_1G033850 vs. ExPASy Swiss-Prot
Match: Q9MBC1 (Anthocyanidin 3-O-glucoside 6''-O-acyltransferase (Fragment) OS=Perilla frutescens OX=48386 PE=1 SV=1)

HSP 1 Score: 271.9 bits (694), Expect = 1.3e-71
Identity = 167/466 (35.84%), Postives = 241/466 (51.72%), Query Frame = 0

Query: 9   LIEVCKVSPPPTAAAPSSLPLTFFDLMWLRFHPIQRLFFYEFPSNEVSFHDVIVPKLKNS 68
           +IE C+V PPP + A  S+PLTFFD+ WL FHP+ +L FYEFP ++  F + IVPKLK S
Sbjct: 1   VIETCRVGPPPDSVAEQSVPLTFFDMTWLHFHPMLQLLFYEFPCSKQHFSESIVPKLKQS 60

Query: 69  LSLTLRHYLPLAGNLVWPSESD-VPFIEFAEGDGVSMTVAESDDNFYHLSGNGFREVSEF 128
           LS TL H+ PL+ NL++PS  + +P   +  GD VS T+AES D+F  L GN        
Sbjct: 61  LSKTLIHFFPLSCNLIYPSSPEKMPEFRYLSGDSVSFTIAESSDDFDDLVGNRPESPVRL 120

Query: 129 HPLVPQLP-----VSHNRAAVIAIQVTKFQNKGFSIGITNHHAIFDGRSSTSFIKSWAQI 188
           +  VP+LP            V A+QVT F  +G  IGI  HH + D  S  +FI +W+ +
Sbjct: 121 YNFVPKLPPIVEESDRKLFQVFAVQVTLFPGRGVGIGIATHHTVSDAPSFLAFITAWSSM 180

Query: 189 CM---EESSVPTPKQVPLYNRSVINDPKDLAKIYAKAWKDAEGPNNKSLNLKFP-QTKH- 248
                 E      K +P+++RSVI  P     IY   W++A         LKFP Q++H 
Sbjct: 181 SKHIENEDEDEEFKSLPVFDRSVIKYPTKFDSIY---WRNA---------LKFPLQSRHP 240

Query: 249 ----GLVRSTLEFTHQNIQKLKEWILNKKIENEKFDSSSHISTFAIATAYLCVCTAKL-- 308
                 +R+T  FT   I+KLK WI        +  S  H+S+F    AY+     K   
Sbjct: 241 SLPTDRIRTTFVFTQSKIKKLKGWI------QSRVPSLVHLSSFVAIAAYMWAGITKSFT 300

Query: 309 --EGLKEGKLWFVFAADARTRLKPQVPLNYFGNCVVAGFVGLERSELLSENGIILACDEI 368
             E       +F+   D R RL P VP NYFGNC+      + R EL+ E G+ LA + I
Sbjct: 301 ADEDQDNEDAFFLIPVDLRPRLDPPVPENYFGNCLSYALPRMRRRELVGEKGVFLAAEVI 360

Query: 369 SKAIRNL--DDGPLNGCESWGSRMSQEVTNDYLKMQGISLAGSPRFGVYNIDFGLGKPKK 428
           +  I+    D   L   E W   + + +   Y      S+AGS +  +Y  DFG GK +K
Sbjct: 361 AAEIKKRINDKRILETVEKWSPEIRKALQKSY-----FSVAGSSKLDLYGADFGWGKARK 420

Query: 429 VEIVSAE-SPYVFSLTESRNSDVVMEIGVVKKRDEIEAFVGIFNQG 453
            EI+S +   Y  +L ++R+ +  +E+ +   +D+++AF   F+ G
Sbjct: 421 QEILSIDGEKYAMTLCKARDFEGGLEVCLSLPKDKMDAFAAYFSLG 443

BLAST of CsaV3_1G033850 vs. ExPASy TrEMBL
Match: A0A0A0LXC1 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G498810 PE=3 SV=1)

HSP 1 Score: 870.2 bits (2247), Expect = 3.9e-249
Identity = 429/454 (94.49%), Postives = 441/454 (97.14%), Query Frame = 0

Query: 1   MANFPSLKLIEVCKVSPPPTAAAPSSLPLTFFDLMWLRFHPIQRLFFYEFPSNEVSFHDV 60
           MANFPSLK+IEV KVSPPPTAAAP SLPLTFFDL WLRF+PIQRLFFYEFPSN++S +DV
Sbjct: 1   MANFPSLKVIEVSKVSPPPTAAAPISLPLTFFDLFWLRFYPIQRLFFYEFPSNQISSYDV 60

Query: 61  IVPKLKNSLSLTLRHYLPLAGNLVWPSESDVPFIEFAEGDGVSMTVAESDDNFYHLSGNG 120
           IV KLK+SLSL L H+LPLAGNLVWPS+SDVP IEF EG+GVSMTVAESDD+F HLSGNG
Sbjct: 61  IVSKLKSSLSLALCHHLPLAGNLVWPSQSDVPVIEFVEGNGVSMTVAESDDDFDHLSGNG 120

Query: 121 FREVSEFHPLVPQLPVSHNRAAVIAIQVTKFQNKGFSIGITNHHAIFDGRSSTSFIKSWA 180
           FREVS+FHPLVP L VSH+RAAVIAIQVTKFQNKGFSIGITNHHAIFDGRSSTSFIKSWA
Sbjct: 121 FREVSDFHPLVPVLTVSHDRAAVIAIQVTKFQNKGFSIGITNHHAIFDGRSSTSFIKSWA 180

Query: 181 QICMEESSVPTPKQVPLYNRSVINDPKDLAKIYAKAWKDAEGPNNKSLNLKFPQTKHGLV 240
           QICMEESSVPTPKQVPLYNRSVINDPKDLAKIYAKAWKD EGPNNKSLNLKFPQTKHGLV
Sbjct: 181 QICMEESSVPTPKQVPLYNRSVINDPKDLAKIYAKAWKDVEGPNNKSLNLKFPQTKHGLV 240

Query: 241 RSTLEFTHQNIQKLKEWILNKKIENEKFDSSSHISTFAIATAYLCVCTAKLEGLKEGKLW 300
           RSTLEFTHQNIQKLKEWILNKKIENEKFDSSSHISTFAIATAYLCVCTAKLEGLKEGKLW
Sbjct: 241 RSTLEFTHQNIQKLKEWILNKKIENEKFDSSSHISTFAIATAYLCVCTAKLEGLKEGKLW 300

Query: 301 FVFAADARTRLKPQVPLNYFGNCVVAGFVGLERSELLSENGIILACDEISKAIRNLDDGP 360
           FVFAADARTRLKPQVPLNYFGNCVVAGFVGLERSELLSENGIILACDEISKAIRNLDDGP
Sbjct: 301 FVFAADARTRLKPQVPLNYFGNCVVAGFVGLERSELLSENGIILACDEISKAIRNLDDGP 360

Query: 361 LNGCESWGSRMSQEVTNDYLKMQGISLAGSPRFGVYNIDFGLGKPKKVEIVSAESPYVFS 420
           LNGCESWGSRMSQEVTNDYLKMQGISLAGSPRFGVYNIDFGLGKPKKVEIVSAESPYVFS
Sbjct: 361 LNGCESWGSRMSQEVTNDYLKMQGISLAGSPRFGVYNIDFGLGKPKKVEIVSAESPYVFS 420

Query: 421 LTESRNSDVVMEIGVVKKRDEIEAFVGIFNQGFE 455
           LTESRNSDVVMEIGVVKKRDEIEAFVGIFNQGFE
Sbjct: 421 LTESRNSDVVMEIGVVKKRDEIEAFVGIFNQGFE 454

BLAST of CsaV3_1G033850 vs. ExPASy TrEMBL
Match: A0A1S3C6Z8 (phenolic glucoside malonyltransferase 1-like isoform X1 OS=Cucumis melo OX=3656 GN=LOC103497183 PE=3 SV=1)

HSP 1 Score: 812.4 bits (2097), Expect = 9.7e-232
Identity = 394/455 (86.59%), Postives = 426/455 (93.63%), Query Frame = 0

Query: 1   MANFPSLKLIEVCKVSPPPTAAAPSSLPLTFFDLMWLRFHPIQRLFFYEFPSNEVSFHDV 60
           MANFPSLKLI+VCKVSPPPTAAAPSSLPLTFFDL+WLRFHPIQRLFFYEF SNE+SFHDV
Sbjct: 1   MANFPSLKLIDVCKVSPPPTAAAPSSLPLTFFDLIWLRFHPIQRLFFYEFSSNEISFHDV 60

Query: 61  IVPKLKNSLSLTLRHYLPLAGNLVWPSESDVPFIEFAEGDGVSMTVAESDDNFYHLSGNG 120
           IVPKLK++LSLTLRHYLPLAGNLVWPS+SD+P +EF EGDGVSMTVAE D +FYHLSGNG
Sbjct: 61  IVPKLKSTLSLTLRHYLPLAGNLVWPSQSDIPVVEFVEGDGVSMTVAEFDGDFYHLSGNG 120

Query: 121 FREVSEFHPLVPQLPVSHNRAAVIAIQVTKFQNKGFSIGITNHHAIFDGRSSTSFIKSWA 180
           F+EV EFHPLVP+LPVSH+RAAVIAIQVTKFQNKGFSIGITNHHAI DGRSSTSFIKSWA
Sbjct: 121 FQEVLEFHPLVPELPVSHDRAAVIAIQVTKFQNKGFSIGITNHHAIIDGRSSTSFIKSWA 180

Query: 181 QICMEESSVPTPKQVPLYNRSVINDPKDLAKIYAKAWKDAEGPNNKSLNLKFPQTKHGLV 240
           QICME+SS+PTPK++PLY+RSVINDPKDLAK+YAK W+D EGPNNKSLNLK P+TKHGL+
Sbjct: 181 QICMEDSSIPTPKEMPLYDRSVINDPKDLAKLYAKVWQDLEGPNNKSLNLKIPETKHGLI 240

Query: 241 RSTLEFTHQNIQKLKEWILNKKIENEKFDSSS-HISTFAIATAYLCVCTAKLEGLKEGKL 300
           RSTLEFTHQNIQKLKEWILN+KI+NE FDSSS  IS+FAIATAYLC+CTAKLEGLKEGKL
Sbjct: 241 RSTLEFTHQNIQKLKEWILNEKIKNENFDSSSCRISSFAIATAYLCICTAKLEGLKEGKL 300

Query: 301 WFVFAADARTRLKPQVPLNYFGNCVVAGFVGLERSELLSENGIILACDEISKAIRNLDDG 360
           WF FAAD RTRLKPQVPLNYFGNC+V   V LER ELLSENGIILACDEISKAIRNLDDG
Sbjct: 301 WFAFAADGRTRLKPQVPLNYFGNCLVGAVVCLERFELLSENGIILACDEISKAIRNLDDG 360

Query: 361 PLNGCESWGSRMSQEVTNDYLKMQGISLAGSPRFGVYNIDFGLGKPKKVEIVSAESPYVF 420
            LNGCE+WGS+MSQ  TN Y K+Q ISL+GSPRFGVYN DFG GKPKKVEIVSAESPYVF
Sbjct: 361 ALNGCENWGSQMSQLTTN-YSKVQAISLSGSPRFGVYNADFGFGKPKKVEIVSAESPYVF 420

Query: 421 SLTESRNSDVVMEIGVVKKRDEIEAFVGIFNQGFE 455
           SLT+SRNSDVVMEIGVVK+RDEIEAFV IFNQGFE
Sbjct: 421 SLTDSRNSDVVMEIGVVKERDEIEAFVDIFNQGFE 454

BLAST of CsaV3_1G033850 vs. ExPASy TrEMBL
Match: A0A0A0LXY8 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G499330 PE=3 SV=1)

HSP 1 Score: 808.9 bits (2088), Expect = 1.1e-230
Identity = 389/454 (85.68%), Postives = 422/454 (92.95%), Query Frame = 0

Query: 1   MANFPSLKLIEVCKVSPPPTAAAPSSLPLTFFDLMWLRFHPIQRLFFYEFPSNEVSFHDV 60
           MANFPSLKLI+VCKV P P AAAPSSLPLTFFD++WLR HPIQRLFFYEF SNE+SF+D+
Sbjct: 1   MANFPSLKLIDVCKVPPSPAAAAPSSLPLTFFDVLWLRVHPIQRLFFYEFSSNEISFYDI 60

Query: 61  IVPKLKNSLSLTLRHYLPLAGNLVWPSESDVPFIEFAEGDGVSMTVAESDDNFYHLSGNG 120
           IVPKLK+SLSLTL HYLPLAGNL+WPS+SD P IEF  GDGVSMTVAESDD+FYHLSGNG
Sbjct: 61  IVPKLKSSLSLTLCHYLPLAGNLIWPSQSDTPVIEFVNGDGVSMTVAESDDDFYHLSGNG 120

Query: 121 FREVSEFHPLVPQLPVSHNRAAVIAIQVTKFQNKGFSIGITNHHAIFDGRSSTSFIKSWA 180
           FR+VS+FHPLVPQL  SH+RAA++AIQVTKFQNKGFSIGITNHHAI DGRSSTSFIKSWA
Sbjct: 121 FRKVSKFHPLVPQLSTSHDRAAIVAIQVTKFQNKGFSIGITNHHAILDGRSSTSFIKSWA 180

Query: 181 QICMEESSVPTPKQVPLYNRSVINDPKDLAKIYAKAWKDAEGPNNKSLNLKFPQTKHGLV 240
           QIC+EES +PTPKQ+PLY+RSVINDPKDLAKIYAKAWKD EGPNNKSLNLKFPQTKH LV
Sbjct: 181 QICIEESFIPTPKQMPLYDRSVINDPKDLAKIYAKAWKDVEGPNNKSLNLKFPQTKHDLV 240

Query: 241 RSTLEFTHQNIQKLKEWILNKKIENEKFDSSSHISTFAIATAYLCVCTAKLEGLKEGKLW 300
           RSTLEFTHQNIQKLKEWILNKKI+NE FDSSSHIS+FAIATAYLCVCTAKLEGLKEGKLW
Sbjct: 241 RSTLEFTHQNIQKLKEWILNKKIKNENFDSSSHISSFAIATAYLCVCTAKLEGLKEGKLW 300

Query: 301 FVFAADARTRLKPQVPLNYFGNCVVAGFVGLERSELLSENGIILACDEISKAIRNLDDGP 360
           F FAADARTRLKPQVPLNYFGNC+VAGFV  ER ELLSENGII ACDEISK IRNLDDGP
Sbjct: 301 FGFAADARTRLKPQVPLNYFGNCLVAGFVVNERFELLSENGIIFACDEISKTIRNLDDGP 360

Query: 361 LNGCESWGSRMSQEVTNDYLKMQGISLAGSPRFGVYNIDFGLGKPKKVEIVSAESPYVFS 420
           LNGCE+WG  MSQE+TNDY K+Q  S+AGSPRFGVYN+DFG GKPKKVEIVSAESPY+FS
Sbjct: 361 LNGCENWG-MMSQEMTNDYSKLQINSIAGSPRFGVYNVDFGFGKPKKVEIVSAESPYIFS 420

Query: 421 LTESRNSDVVMEIGVVKKRDEIEAFVGIFNQGFE 455
           LT++RNSD VMEIG+V +RDE+EAFV IFNQGFE
Sbjct: 421 LTDTRNSDAVMEIGIVMERDEMEAFVAIFNQGFE 453

BLAST of CsaV3_1G033850 vs. ExPASy TrEMBL
Match: A0A1S3C598 (phenolic glucoside malonyltransferase 1-like OS=Cucumis melo OX=3656 GN=LOC103497184 PE=3 SV=1)

HSP 1 Score: 800.8 bits (2067), Expect = 2.9e-228
Identity = 393/455 (86.37%), Postives = 422/455 (92.75%), Query Frame = 0

Query: 1   MANFPSLKLIEVCKVSPPPTAAAPSSLPLTFFDLMWLRFHPIQRLFFYEFPSNEVSFHDV 60
           MANF SLKLI+VCKVSP PTAAAPSSLPLTFFDLMWLRFHPIQRLFFYEFPSNE+SFHD 
Sbjct: 1   MANFSSLKLIDVCKVSPSPTAAAPSSLPLTFFDLMWLRFHPIQRLFFYEFPSNEISFHDT 60

Query: 61  IVPKLKNSLSLTLRHYLPLAGNLVWPSESDVPFIEFAEGDGVSMTVAESDDNFYHLSGNG 120
           IVPKLK+SLSLTL HYLPLAGNLVWPS+SDVP +EF EGDGVSMTVAE D +F HLSGNG
Sbjct: 61  IVPKLKSSLSLTLCHYLPLAGNLVWPSQSDVPVVEFVEGDGVSMTVAEFDGDFNHLSGNG 120

Query: 121 FREVSEFHPLVPQLPVSHNRAAVIAIQVTKFQNKGFSIGITNHHAIFDGRSSTSFIKSWA 180
           FREVSEFHPLVP+LPVS++RAAVIAIQVTKFQNKGFSIGITNHHAI DGRSSTSFIKSWA
Sbjct: 121 FREVSEFHPLVPELPVSYDRAAVIAIQVTKFQNKGFSIGITNHHAIIDGRSSTSFIKSWA 180

Query: 181 QICMEESSVPTPKQVPLYNRSVINDPKDLAKIYAKAWKDAEGPNNKSLNLKFPQTKHGLV 240
           QICMEESS+PTPK++PLY+RSVINDPK LAK+YAKAW+D EGPNNKSLNLK P+TKHGL+
Sbjct: 181 QICMEESSIPTPKEMPLYDRSVINDPKGLAKLYAKAWQDLEGPNNKSLNLKMPETKHGLI 240

Query: 241 RSTLEFTHQNIQKLKEWILNKKIENEKFD-SSSHISTFAIATAYLCVCTAKLEGLKEGKL 300
           RSTLEFTHQNIQKLKEWILNKKI+NE FD SSSHIS+FAIATAYLC+CTAKLEGLKEGKL
Sbjct: 241 RSTLEFTHQNIQKLKEWILNKKIKNENFDSSSSHISSFAIATAYLCICTAKLEGLKEGKL 300

Query: 301 WFVFAADARTRLKPQVPLNYFGNCVVAGFVGLERSELLSENGIILACDEISKAIRNLDDG 360
           WF FAAD RTRLKPQVPLNYFGNC+V  FV LER ELLSENGIILACDEISKAIRNLDDG
Sbjct: 301 WFAFAADGRTRLKPQVPLNYFGNCLVGAFVCLERFELLSENGIILACDEISKAIRNLDDG 360

Query: 361 PLNGCESWGSRMSQEVTNDYLKMQGISLAGSPRFGVYNIDFGLGKPKKVEIVSAESPYVF 420
            LNG E+WGS+MSQ +T +Y K+Q ISL+GSPRFGVYN DFG GKPKKVEIVSAESPYVF
Sbjct: 361 ALNGSENWGSQMSQ-MTANYSKVQVISLSGSPRFGVYNADFGFGKPKKVEIVSAESPYVF 420

Query: 421 SLTESRNSDVVMEIGVVKKRDEIEAFVGIFNQGFE 455
           SL +SRNSDVVMEIGVVK+RDEIEAFV  FNQ FE
Sbjct: 421 SLADSRNSDVVMEIGVVKERDEIEAFVATFNQDFE 454

BLAST of CsaV3_1G033850 vs. ExPASy TrEMBL
Match: A0A1S4E1L1 (phenolic glucoside malonyltransferase 1-like isoform X3 OS=Cucumis melo OX=3656 GN=LOC103497183 PE=3 SV=1)

HSP 1 Score: 791.6 bits (2043), Expect = 1.8e-225
Identity = 387/455 (85.05%), Postives = 420/455 (92.31%), Query Frame = 0

Query: 1   MANFPSLKLIEVCKVSPPPTAAAPSSLPLTFFDLMWLRFHPIQRLFFYEFPSNEVSFHDV 60
           MANFPSLKLI+VCKVSPPPTAAAPSSLPLTFFDL+WLRFHPIQRLFFYEF SNE+SFHDV
Sbjct: 1   MANFPSLKLIDVCKVSPPPTAAAPSSLPLTFFDLIWLRFHPIQRLFFYEFSSNEISFHDV 60

Query: 61  IVPKLKNSLSLTLRHYLPLAGNLVWPSESDVPFIEFAEGDGVSMTVAESDDNFYHLSGNG 120
           IVPKLK++LSLTLRHYLPLAGNLVWPS+SD+P +EF EGDGVSMTVAE D +FYHLSGNG
Sbjct: 61  IVPKLKSTLSLTLRHYLPLAGNLVWPSQSDIPVVEFVEGDGVSMTVAEFDGDFYHLSGNG 120

Query: 121 FREVSEFHPLVPQLPVSHNRAAVIAIQVTKFQNKGFSIGITNHHAIFDGRSSTSFIKSWA 180
           F+EV EFHPLVP+LPVSH+RAAVIAIQVTKFQNKGFSIGITNHHAI DGRSSTSFIKSWA
Sbjct: 121 FQEVLEFHPLVPELPVSHDRAAVIAIQVTKFQNKGFSIGITNHHAIIDGRSSTSFIKSWA 180

Query: 181 QICMEESSVPTPKQVPLYNRSVINDPKDLAKIYAKAWKDAEGPNNKSLNLKFPQTKHGLV 240
           QICMEESS+PTPKQ+ LY+RSVI DPKD AKIYAKAWKD EGPNNKSLNLKFP+TKHGL+
Sbjct: 181 QICMEESSIPTPKQMSLYDRSVITDPKDFAKIYAKAWKDLEGPNNKSLNLKFPKTKHGLI 240

Query: 241 RSTLEFTHQNIQKLKEWILNKKIENEKFD-SSSHISTFAIATAYLCVCTAKLEGLKEGKL 300
           RSTLEFTHQ++QKLKEWIL +K +NE FD SSSHIS+F IATAYLCVCTAKLEGLKEGKL
Sbjct: 241 RSTLEFTHQSMQKLKEWILKEKSKNENFDSSSSHISSFVIATAYLCVCTAKLEGLKEGKL 300

Query: 301 WFVFAADARTRLKPQVPLNYFGNCVVAGFVGLERSELLSENGIILACDEISKAIRNLDDG 360
            F FAADARTRLKPQVPLNYFGNC+V  F+ LER ELLSENGIILACDEISKAIRNLDDG
Sbjct: 301 CFAFAADARTRLKPQVPLNYFGNCLVGVFLRLERFELLSENGIILACDEISKAIRNLDDG 360

Query: 361 PLNGCESWGSRMSQEVTNDYLKMQGISLAGSPRFGVYNIDFGLGKPKKVEIVSAESPYVF 420
            LNGCE+WG RMSQ VT DY K Q ISL+GSPRFGVYN+DFG GKPKKVEIVS+E P VF
Sbjct: 361 VLNGCENWGLRMSQ-VTIDYSKSQVISLSGSPRFGVYNVDFGFGKPKKVEIVSSELPNVF 420

Query: 421 SLTESRNSDVVMEIGVVKKRDEIEAFVGIFNQGFE 455
           SL +SRNSDVVMEIGVVK++DE+EAF+ IFN GFE
Sbjct: 421 SLIDSRNSDVVMEIGVVKEKDEMEAFIAIFNHGFE 454

BLAST of CsaV3_1G033850 vs. TAIR 10
Match: AT3G29670.1 (HXXXD-type acyl-transferase family protein )

HSP 1 Score: 304.3 bits (778), Expect = 1.6e-82
Identity = 178/457 (38.95%), Postives = 256/457 (56.02%), Query Frame = 0

Query: 6   SLKLIEVCKVSPPPTAAAPSS----LPLTFFDLMWLRFHPIQRLFFYEF-PSNEVSFHDV 65
           +L +IE  +V+P   +   S+    LPLTFFDL WL F P++R+FFYE   S    FH +
Sbjct: 2   TLHVIETARVTPTDYSVINSANLHKLPLTFFDLPWLLFQPVKRVFFYELTESTRDHFHSI 61

Query: 66  IVPKLKNSLSLTLRHYLPLAGNLVWPSESDVPFIEFAEGDGVSMTVAESDDNFYHLSGNG 125
           I+PKLK+SLSL LR+YLPL G++ W      P I  +E   V +T+AESD +F HLSG G
Sbjct: 62  ILPKLKDSLSLILRNYLPLTGHITWEPNEPKPSIIVSENGVVLVTIAESDADFSHLSGYG 121

Query: 126 FREVSEFHPLVPQLPVSHNRAAVIAIQVTKFQNKGFSIGITNHHAIFDGRSSTSFIKSWA 185
            R +SE H LVP+LPVS + A   +IQ+T F N+GFSIG+  HHA+ DG++S++FIK+WA
Sbjct: 122 QRPLSELHALVPKLPVSDDSATAFSIQITLFPNQGFSIGVAAHHAVLDGKTSSTFIKAWA 181

Query: 186 QICMEESSVPTPKQVPLYNRSVINDPKDL-AKIYAKAWKDAEGPNNKSLNLKFPQTKHG- 245
           QIC +E         P Y+RS+I  P  L  K+        E   N       P +K G 
Sbjct: 182 QICKQELQSMPENLTPSYDRSLIKYPTYLDEKMIELVRSLKEDQTNIRSLTSLPSSKLGD 241

Query: 246 -LVRSTLEFTHQNIQKLKEWILNKKIENEKFDSSSHISTFAIATAYLCVCTAKLE-GLKE 305
            +V +TL  +  +I++L+E + N          S H+STF IA AY   C  K   G K+
Sbjct: 242 DVVLATLVLSRADIERLREQVKN-------VSPSLHLSTFVIAYAYAWTCFVKARGGNKD 301

Query: 306 GKLWFVFAADARTRLKPQVPLNYFGNCVV-AGFVGLERSELLSENGIILACDEISKAIRN 365
             +  +F  D R RL P++P  YFGNC++  G    + +E + E G + A + IS  ++ 
Sbjct: 302 RSVSLLFVGDFRDRLDPKLPGTYFGNCMIPVGCYNRKAAEFMEEKGFVTAAEIISDLVKG 361

Query: 366 LDDGPLNGCESWGSRMSQEVTNDYLKMQGISLAGSPRFGVYNIDFGLGKPKKVEIVSAES 425
           L    +   E+      +  +      Q  ++AGS R GVY  DFG G+P KV+IVS + 
Sbjct: 362 LSSRKI---ETIADTFVEGFSFQSWSTQFGTIAGSTRLGVYEADFGWGRPVKVDIVSIDQ 421

Query: 426 PYVFSLTESRNSDVVMEIGVVKKRDEIEAFVGIFNQG 453
               ++ E R+    +EIG+  K+ E+++ V  FN G
Sbjct: 422 GEAIAMAERRDESGGVEIGMCLKKTEMDSVVSFFNNG 448

BLAST of CsaV3_1G033850 vs. TAIR 10
Match: AT3G29635.1 (HXXXD-type acyl-transferase family protein )

HSP 1 Score: 303.1 bits (775), Expect = 3.7e-82
Identity = 191/467 (40.90%), Postives = 272/467 (58.24%), Query Frame = 0

Query: 6   SLKLIEVCKVSPPPTAAAPSS----LPLTFFDLMWLRFHPIQRLFFYEF--PSNEVSFHD 65
           +LK+ ++ +VSP   ++  S+    LPLTFFDL WL+FHP +R+ FY+    S+  SF  
Sbjct: 2   ALKVTKISQVSPASNSSNDSANSMVLPLTFFDLRWLQFHPTERVIFYKLIKDSSLESFLS 61

Query: 66  VIVPKLKNSLSLTLRHYLPLAGNLVWPSESDVPFIEFAEGDGVSMTVAESDDNFYHLSGN 125
           VI+PKL+ SLS+ LRHYLPLAG L W S+   P I  +  D VS+TVAESD +F  +SG 
Sbjct: 62  VILPKLELSLSIVLRHYLPLAGRLTWSSQDPKPSIIVSPNDYVSLTVAESDADFSRISGK 121

Query: 126 GFREVSEFHPLVPQLPVSHNRAAVIAIQVTKFQNKGFSIGITNHHAIFDGRSSTSFIKSW 185
           G R  SE   LVP+L +S +  +V+++QVT F N+GF IGI +HH++ DG++   FIKSW
Sbjct: 122 GIRPESEIRSLVPELSLSCDSPSVLSLQVTLFPNQGFCIGIASHHSVMDGKTVVRFIKSW 181

Query: 186 AQICMEESSVPTPKQVPLYNRSVINDPKDL-AKI-----YAKAWKDAEGPNNKSLNLKFP 245
           A IC   +        P+ +R+VIN P  L AKI     Y    KD    + +SL L  P
Sbjct: 182 AHICKHGAMDLPEDLTPVLDRTVINVPASLDAKIIELLSYFSEVKD----SFRSLKLLPP 241

Query: 246 -QTKHGLVRSTLEFTHQNIQKLKEWILNKKIENEKFDSSSHISTFAIATAYLCVCTAKLE 305
            +    LVR +LE T +NI+KL+E     K E+ +     H+STF +A AYL  C  K  
Sbjct: 242 KEISPDLVRISLELTRENIEKLRE---QAKRESARSHHELHLSTFVVANAYLWTCLVKTR 301

Query: 306 GLKEGK-LWFVFAADARTRLKPQVPLNYFGNCVV-AGFVGLERSELLSENGIILACDEIS 365
           G  E + + F++AAD R RL P VP  YFGNCV   G  G + +  L E+G +   + +S
Sbjct: 302 GGDENRPVRFMYAADFRNRLDPPVPEMYFGNCVFPIGCFGYKANVFLGEDGFVNMVEILS 361

Query: 366 KAIRNLDDGPLNG-CESW--GSRMSQEVTNDYLKMQGISLAGSPRFGVYNIDFGLGKPKK 425
            ++R++    L   CE +  G++  +  T      Q  S+AGS +FG+Y  DFG GKP  
Sbjct: 362 DSVRSIGLRKLETICELYINGTKSVKPGT------QIGSIAGSNQFGLYGSDFGWGKPCN 421

Query: 426 VEIVSAESPYVFSLTESRNSDVVMEIGVVKKRDEIEAFVGIFNQGFE 455
            EI S +    FS++E R+    +EIG+  K+ E++ F+ +F  G E
Sbjct: 422 SEIASIDRNEAFSMSERRDEPGGVEIGLCLKKCEMDIFIYLFQNGLE 455

BLAST of CsaV3_1G033850 vs. TAIR 10
Match: AT5G39050.1 (HXXXD-type acyl-transferase family protein )

HSP 1 Score: 301.2 bits (770), Expect = 1.4e-81
Identity = 179/469 (38.17%), Postives = 262/469 (55.86%), Query Frame = 0

Query: 6   SLKLIEVCKVSPPPTAAAPS-SLPLTFFDLMWLRFHPIQRLFFYEFPSNEVSFHD-VIVP 65
           SLK+I+V +V+P  + ++ S +LPLTFFDL+W + H ++R+ FY+       F D VIVP
Sbjct: 9   SLKVIDVARVTPSNSDSSESLTLPLTFFDLLWYKLHAVERVIFYKLTDASRPFFDSVIVP 68

Query: 66  KLKNSLSLTLRHYLPLAGNLVWPSESDVPFIEFAEGDGVSMTVAESDDNFYHLSGNGFRE 125
            LK SLS +L HYLPLAG LVW      P I +   D VS TVAES+ +F  L+G     
Sbjct: 69  NLKTSLSSSLSHYLPLAGKLVWEPLDPKPKIVYTPNDAVSFTVAESNADFSRLTGKEPFP 128

Query: 126 VSEFHPLVPQLPVSHNRAAVIAIQVTKFQNKGFSIGITNHHAIFDGRSSTSFIKSWAQIC 185
            +E +PLVP+L VS + A+ ++ QVT F N+GF I +  HHA+ DG+++T+F+KSWA+ C
Sbjct: 129 TTELYPLVPELHVSDDSASAVSFQVTLFPNQGFCISVNAHHAVLDGKTTTNFLKSWARTC 188

Query: 186 MEESSVPTPKQVPLYNRSVINDPKDLAKIYAKAWK--------DAEGPNNKSLNLKF-PQ 245
             + S      +P+Y+R+VI DP DL      AW           E  N KSL L + P+
Sbjct: 189 KNQDSFLPQDLIPVYDRTVIKDPMDLDTKILNAWHRVAKVFTGGKEPENPKSLKLLWSPE 248

Query: 246 TKHGLVRSTLEFTHQNIQKLKEWILNKKIENEKFDSSS---HISTFAIATAYLCVCTAKL 305
               + R TL  T ++IQKL+E  L K+  +    SS     +STF I  +Y   C  K 
Sbjct: 249 IGPDVFRYTLNLTREDIQKLRE-RLKKESSSSSVSSSPKELRLSTFVIVYSYALTCLIKA 308

Query: 306 EGLKEGK-LWFVFAADARTRLKPQVPLNYFGNCVVAGF-VGLERSELLSENGIILACDEI 365
            G    + + + FA D R+ + P VP +YFGNCV A F + L     +SE G + A   +
Sbjct: 309 RGGDPSRPVGYGFAVDCRSLMVPPVPSSYFGNCVSACFKMSLTAETFMSEEGFLAAARMV 368

Query: 366 SKAIRNLDDGPLNGCESWGSRMSQEVTNDYLKM----QGISLAGSPRFGVYNIDFGLGKP 425
           S ++  LD+          +    E+   +  +    Q +S+AGS RFGVY +DFG G+P
Sbjct: 369 SDSVEALDENV--------ALKIPEILEGFTTLSPGTQVLSVAGSTRFGVYGLDFGWGRP 428

Query: 426 KKVEIVSAESPYVFSLTESRNSDVVMEIGVVKKRDEIEAFVGIFNQGFE 455
           +KV +VS +     S  ESR+    +E+G   K+ E++  V + ++G E
Sbjct: 429 EKVVVVSIDQGEAISFAESRDGSGGVELGFSLKKHEMDVLVDLLHKGLE 468

BLAST of CsaV3_1G033850 vs. TAIR 10
Match: AT5G39090.1 (HXXXD-type acyl-transferase family protein )

HSP 1 Score: 298.9 bits (764), Expect = 6.9e-81
Identity = 175/459 (38.13%), Postives = 259/459 (56.43%), Query Frame = 0

Query: 5   PSLKLIEVCKVSPP-PTAAAPSSLPLTFFDLMWLRFHPIQRLFFYEFPS-NEVSFHDVIV 64
           PSL  I V +V+P    ++A  +LPLTFFDL+WL+   ++R+ FY+    N   F  VIV
Sbjct: 3   PSLNFIHVSRVTPSNSNSSASLTLPLTFFDLLWLKHKAVERVIFYKLTDVNRSLFDSVIV 62

Query: 65  PKLKNSLSLTLRHYLPLAGNLVWPSESDVPFIEFAEGDGVSMTVAESDDNFYHLSGNGFR 124
           P LK+SLS +L HYLPLAG+++W      P I + + D VS TVAES+ +F  L+G    
Sbjct: 63  PNLKSSLSSSLSHYLPLAGHIIWEPHDPKPKIVYTQNDAVSFTVAESNSDFSLLTGKEPF 122

Query: 125 EVSEFHPLVPQLPVSHNRAAVIAIQVTKFQNKGFSIGITNHHAIFDGRSSTSFIKSWAQI 184
             +E HPLVP+L  S + AAV++ QVT F N+GF IG+T HHA+ DG+++T+F+KSWA +
Sbjct: 123 SSTELHPLVPELQNSDDSAAVVSFQVTLFPNQGFCIGVTTHHAVSDGKTTTTFLKSWAHL 182

Query: 185 CMEESSVPTPKQVPLYNRSVINDPKDLAKIYAKAWKDAEGPNNKSLNLKFPQTKHGLVRS 244
           C  + S      +P Y+R+VI  P ++     K W     P +  L L  P+ +  +VR 
Sbjct: 183 CKHQDSSLPDDLIPFYDRTVIKGPPEIDTKVLKIWHSIHKPKSLKL-LPRPEIESDVVRY 242

Query: 245 TLEFTHQNIQKLKEWILNKKIENEKFDSSSHISTFAIATAYLCVCTAKLEGLKEGK-LWF 304
           T E T +NI+KL++ +   K E+  F SS  +STF I  +Y+  C     G    + + +
Sbjct: 243 TFELTRENIEKLRDKL---KRESSSF-SSVRLSTFVITFSYVFTCLIGSGGDDPNRPVGY 302

Query: 305 VFAADARTRL-KPQVPLNYFGNCVVAGF-VGLERSELLSENGIILACDEISKAIRNLDDG 364
            FA D R  +  P +PL YFGNCV +   + L+    L E G ++A   IS ++  LD  
Sbjct: 303 RFAVDCRRLIDDPPIPLTYFGNCVYSAVKIPLDAGMFLGEQGFVVAARLISDSVEELDSN 362

Query: 365 PLNGCESWGSRMSQEVTNDYLK----MQGISLAGSPRFGVYNIDFGLGKPKKVEIVSAES 424
                 +W      E+   Y K     Q +S+AGS RFG+Y +DFG GKP K  +VS + 
Sbjct: 363 -----VAW---KIPELLETYEKAPVDSQFVSVAGSTRFGIYGLDFGWGKPFKSLLVSIDQ 422

Query: 425 PYVFSLTESRNSDVVMEIGVVKKRDEIEAFVGIFNQGFE 455
               S+ ESR+    +EIG   K+ E+   + + ++G +
Sbjct: 423 RGKISIAESRDGSGGVEIGFSLKKQEMNVLIDLLHKGIK 448

BLAST of CsaV3_1G033850 vs. TAIR 10
Match: AT3G29590.1 (HXXXD-type acyl-transferase family protein )

HSP 1 Score: 297.0 bits (759), Expect = 2.6e-80
Identity = 184/469 (39.23%), Postives = 264/469 (56.29%), Query Frame = 0

Query: 1   MANFPS-LKLIEVCKVSPPPTAAAPSSLPLTFFDLMWLRFHPIQRLFFYEFPSNEVSFHD 60
           M NF S + ++EV +VSPP + +   +LPLT+FDL WL+ HP+ R+ FY  P    S   
Sbjct: 1   MVNFNSAVNILEVVQVSPPSSNSL--TLPLTYFDLGWLKLHPVDRVLFYHVPELTRS--- 60

Query: 61  VIVPKLKNSLSLTLRHYLPLAGNLVWPSESDVPFIEFA--EGDGVSMTVAESDDNFYHLS 120
            ++ KLK+SLS TL HYLPLAG LVW S    P I ++  + D V +TVAES+ +  HLS
Sbjct: 61  SLISKLKSSLSATLLHYLPLAGRLVWDSIKTKPSIVYSPDDKDAVYLTVAESNGDLSHLS 120

Query: 121 GNGFREVSEFHPLVPQLPVSHNRAAVIAIQVTKFQNKGFSIGITNHHAIFDGRSSTSFIK 180
           G+  R  +EFH LVP+LPVS   A V+A+QVT F N+GFS+G+T HHA+ DG+++  F+K
Sbjct: 121 GDEPRPATEFHSLVPELPVSDESARVLAVQVTFFPNQGFSLGVTAHHAVLDGKTTAMFLK 180

Query: 181 SWAQICMEESSVPTPKQVPLYNRSVINDPKDLAKIYAKAWKDAEGPNNKSLNLKFPQTKH 240
           +WA  C +E        VP  +R ++ DP  L       W  A   NNK     FP    
Sbjct: 181 AWAHNCKQEQEALPHDLVPSLDRIIVQDPTGLETKLLNRWISAS--NNKPSLKLFPSKII 240

Query: 241 G--LVRSTLEFTHQNIQKLKEWILNKKIENEKFDSSSHISTFAIATAYLCVCTAKLEGLK 300
           G  ++R T   T ++I+KL+E     ++E E       +STF I  AY+  C  K+ G  
Sbjct: 241 GSDILRVTYRLTREDIKKLRE-----RVETESHAKQLRLSTFVITYAYVITCMVKMRGGD 300

Query: 301 EGKLWFV-FAADARTRLKPQVPLNYFGNCVV-AGFVGLERSELLSE---NGIILACDEIS 360
             +   V FA+D R+RL P +P  +FGNC+V +G   ++   +L E    G I A + ++
Sbjct: 301 PTRFVCVGFASDFRSRLNPPLPPTFFGNCIVGSGDFDVKAEPILEEGEGKGFITAVETLT 360

Query: 361 KAI-----RNLDDGPLNGCESWGSRMSQEVTNDYLKMQGISLAGSPRFGVYNIDFGLGKP 420
             +      N++   L   E++  RM           Q IS+AGS R G+Y  DFG GKP
Sbjct: 361 GWVNGLCPENIEKNMLLPFEAF-KRMEP-------GRQMISVAGSTRLGIYGSDFGWGKP 420

Query: 421 KKVEIVSAESPYVFSLTESRNSDVVMEIGVVKKRDEIEAFVGIFNQGFE 455
            KVEIV+ +     SL+ES +    +E+GV  K+D++E F  +F+ G E
Sbjct: 421 VKVEIVTIDKDASVSLSESGDGSGGVEVGVCLKKDDVERFGSLFSIGLE 449

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAE8653234.14.7e-265100.00hypothetical protein Csa_019805 [Cucumis sativus][more]
XP_011658008.22.8e-24994.71malonyl-CoA:anthocyanidin 5-O-glucoside-6''-O-malonyltransferase [Cucumis sativu... [more]
XP_031741446.18.7e-24391.63phenolic glucoside malonyltransferase 2 isoform X1 [Cucumis sativus][more]
XP_008457511.12.0e-23186.59PREDICTED: phenolic glucoside malonyltransferase 1-like isoform X1 [Cucumis melo... [more]
XP_011648510.21.9e-22985.24phenolic glucoside malonyltransferase 1-like [Cucumis sativus][more]
Match NameE-valueIdentityDescription
Q9LRQ82.3e-8138.95Phenolic glucoside malonyltransferase 2 OS=Arabidopsis thaliana OX=3702 GN=PMAT2... [more]
Q940Z52.0e-8038.17Phenolic glucoside malonyltransferase 1 OS=Arabidopsis thaliana OX=3702 GN=PMAT1... [more]
Q9LJB43.7e-7939.23Malonyl-CoA:anthocyanidin 5-O-glucoside-6''-O-malonyltransferase OS=Arabidopsis ... [more]
Q9LRQ73.8e-7636.62BAHD acyltransferase At3g29680 OS=Arabidopsis thaliana OX=3702 GN=At3g29680 PE=2... [more]
Q9MBC11.3e-7135.84Anthocyanidin 3-O-glucoside 6''-O-acyltransferase (Fragment) OS=Perilla frutesce... [more]
Match NameE-valueIdentityDescription
A0A0A0LXC13.9e-24994.49Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G498810 PE=3 SV=1[more]
A0A1S3C6Z89.7e-23286.59phenolic glucoside malonyltransferase 1-like isoform X1 OS=Cucumis melo OX=3656 ... [more]
A0A0A0LXY81.1e-23085.68Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G499330 PE=3 SV=1[more]
A0A1S3C5982.9e-22886.37phenolic glucoside malonyltransferase 1-like OS=Cucumis melo OX=3656 GN=LOC10349... [more]
A0A1S4E1L11.8e-22585.05phenolic glucoside malonyltransferase 1-like isoform X3 OS=Cucumis melo OX=3656 ... [more]
Match NameE-valueIdentityDescription
AT3G29670.11.6e-8238.95HXXXD-type acyl-transferase family protein [more]
AT3G29635.13.7e-8240.90HXXXD-type acyl-transferase family protein [more]
AT5G39050.11.4e-8138.17HXXXD-type acyl-transferase family protein [more]
AT5G39090.16.9e-8138.13HXXXD-type acyl-transferase family protein [more]
AT3G29590.12.6e-8039.23HXXXD-type acyl-transferase family protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (Chinese Long) v3
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR023213Chloramphenicol acetyltransferase-like domain superfamilyGENE3D3.30.559.10coord: 237..454
e-value: 1.2E-39
score: 137.7
IPR023213Chloramphenicol acetyltransferase-like domain superfamilyGENE3D3.30.559.10coord: 7..228
e-value: 5.9E-74
score: 250.4
IPR003480TransferasePFAMPF02458Transferasecoord: 25..445
e-value: 1.1E-46
score: 159.4
NoneNo IPR availablePANTHERPTHR31625FAMILY NOT NAMEDcoord: 3..454
NoneNo IPR availablePANTHERPTHR31625:SF50MALONYL-COA:ISOFLAVONE 7-O-GLUCOSIDE MALONYLTRANSFERASEcoord: 3..454

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsaV3_1G033850.1CsaV3_1G033850.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0016747 acyltransferase activity, transferring groups other than amino-acyl groups