CsaV3_3G020720 (gene) Cucumber (Chinese Long) v3

NameCsaV3_3G020720
Typegene
OrganismCucumis sativus (Cucumber (Chinese Long) v3)
DescriptionPhenolic glucoside malonyltransferase
Locationchr3 : 16919481 .. 16945633 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonpolypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATTTCTTGGCCCCTAATACTTAAATTTACAATTATTCGTTCAAAGTATAGAAGTGTTTTTTTCTTTGGAAAACAAAGTGGTTTGTTTTATTCAAATTCACGACTCACCATATTTCTTCCCAACCCTTCAATCACATGTTATATAGAGACAAAGAAGCTTAGCAATCAACTGCCACAACATTTCATGGCGACAAATCATGGGTCCACCGCCATTAAGGTCCTCGAGGTCTGCACGGTAGCTCCTCCGCCCGGATCCACCGTGCCGGTCACTCTTCCCCTCACCTTCTTCGACATCCTCTGGTTTCGCTTCCCTCCCGTCGAACGCCTCTTCTTCTACAAATCTCCGGTGCCATTCTACGTCATCGTTTCAAACCTCAAAAAATCTCTCTCTCTCGTCCTCCAACACTACCTACCCCTCGCCGGAGCAATTGTATGGCCTGAAAATTCCCCCAAACCGGCCGTCGAAACCGCCGTCCGCGACGGCATTGTGCTTACAGTCGCGGAGTCTGAGGACGACTTCGACCATCTCATCGGCGACGGGTTGCGTAAAGAGGCAAAACTTCGGCCGTTGGTGGCGGAGCTTGCAGCAGAGGAGGATCGGGCGGCAGTGGTGGCTGTGCAGGTAACCTGGTTTGGGAATGGTGGATTTAGCATCGGAATAACTTCACATCACGCAGTTCTGGACGGAAGGTCGTCGACTTCTTTCATGAAATCGTGGGCCGGATTGTGTAAGAATTTGGTTGGGGGCGGTGAGATTTTTTGCCCGGCAGCTGAGACGATGCCGTTTTATGATCGAAGCGTGGTGACAGATAAGATGGGACTTGAAGCCATTTATTTGAAATGCTTGTTGGCCCATGAAGGGCCCAACAATAGAAGCTTGAAATTTTGGGACTTCAAAACTCCGCCAGATTCATTCCGAGGTATTTCCAATAATAGTCTTTTATAATCAGTTATTTAAATGTCTACGATAAAATCTAAACGGAAAGAAAAGGGGAGAAAGCCAATTGCAAGTAAAGACTCTGACTTCCTTATTATATTTGATATTGGATACAAATGATCAAATAATGATATTCTAGAATATGTACTAAGTGATACAATGGACCAAGAAAACAAGTAACAGGGCTTGTAAACAATATGACTAATGGGCTATCTTAACACCCCCACAGATCTATATCTGTGGTACCAAAAACAGTCCATTTTTGCTTAGGTTTTCTTAAAGTTGGTAGCAGATAAAGGTTTGGTTAACACATCTACAATTTCGAAGGTGTGAACAAGAAGCCCCTGATTTTGTATAAGGTCCCGAACAAAGTAAATGTCAAGTTCAATTGTACTGCACTTAAATGATTTTGTAAAGCTTAGTTTTAGAGTGCAGTATAGAATTAGCACTCAAGTGTACTGCACTTAAATTGTCATACCACATAACAAGAGGATAAGATATTCTAATGCATAAATATGTAAAAGAGACTGAATTCATACCAACTATGTTGCAAAAAGAGCTAAACATATATGCATTCATTTGTACCTGATGTTGATATGATTATTTGCTTTTTCCGAATCCCAGTTAACTAGATTAACACCAAAATAGACACAGTATCCTAATGTAGATTTTTTATCATCCGGGTCAAAAGCCCAATCAGCATTAGCAAAGCCTACCAAGTTTAATGTAATGACCCAACTCAAGTCATTACTAATAAAACAAATAAAAACTTTTTATTTAAAATAAAAGAAAATACTAAATTTAAAGAAAACCTTTCGAATATTCATTTAAAATCAACAATAAATAAAATGTGTGAAGAAAATAATAAATAAAAATATTATTTAAAATACTAGTTCAGGCCCTCTTTAAAATAAAAGAAATATCTAAAAGTAAATATAAGTTTCTAAGATATTTTTTTGAAAATTGAGTACCGAAAACAATCAATGCTGAATAAAACATATATTGCCCCTCTATGGTGAACCACGGTTTCTTCTCGTCATTTGCAAACTTGTCATTACCTCTACCTTAGCCTGTAAAATTAAATATAGAGAAATTGTGAGTATAATTTACACTCAGTAAAGAACCCACTATTAGTCCTTCTAGATGTTTGTTAAATTTATGTTAGAGTCACGTAAGAAGGTACTGCCTAACTATACAGGTTAGACGATGTGTTATATCATACTTGCTCGTCTTTTAGGATGCAAATATACAGGTTAGACGATGTGTTATATCTAATCATTGTGAATGTAGTAATAGTAACTAATCCTCAATTAAGATACACACCTCTCAGTGCTTTTTGCCATACACATTAACCTCTCAGTAATGTGTGACGAGAAGCACTGTTGCATTCTCTTTGCCACTTGCTTAGCTAAAAGTGGTTCTCACGTATTTGGTGATTATGATCTCTACTTTTGCTCTTCTTTTGAATGAACAAGGTTAATATACTCATATTTAAATGAACAAAACTCTTACTACACACAATATAACTTTTTAAGTATCAAATACACCATTCATTACTGAGTAGCTATTCAATCATACATATATGGAAGAGATAAGGAAGATGGAAGAATGGAAGAAGAAATGCATTGCTTTCATACATAATATTTTAGACCTTTCGGCCATAACGTCCTGTTTACAAAGCAACACAATATAAAAACAAGAAAATGGAAATACAATAGAGGTGTTATGCTGCAGAGTTTGATTTTTTTAATTCTCTTTGGCTTTGATTTTTGCCTATATGTGAGGGGGATTAACGGGAGTGATGAATCTCTCTCTCTTTGCTTTAAGCTCTCACCTAAAACTCAAATTAATGGAGAGAAAATGGAGAATGTGAAGAACTTACTTTTTCAGCTAAAGTGTGCACACCTCCTTCATGAGTGAGAAACTTTATTTATAAACACGGAGTGAAACTGACAAACTAGCACTTGCCATTAAGATTCAAAAACGTTGTAGCTACTCTGCGAAGTCTGGCCTGCTTGTCCTTTTCTAACTTTCTCACCAACCTCTTTTGTCTGCTAGCAACTTGATCCCCTAACATCTACTTCCCCGTGCTAAAGAGGTTGGTTGGATTACTTCACGTACCATGTTCTATTCACTGAGCTTCACTCTTCTTTGCTTAATATTATACAATAAGAGTACTAAAACATAATAAAATAAGCATGAAGTTCTTTTGCGGTAAGCTTTTTGCAAAGTAATCAAATCACCTACCTTTTGTCTTTATTTTTTTCTATTTTCATGCGGAAGTCGTCATTATTTTACATAAACGACTATAATAACTTGTATTTCTAAAAGTTATCAATACCTTAGTATTATTTTGATGTAAAGCTTTAAGTTCATATTCCATAGCTTTTTTCTAGTGAGAGTGTTTTAAGGCTTATTTCTCACTATTTGGTTCATGTTGAGTGTAATCAACTAAAAGAACTTTGAGTTTAAAGATTCCACTATTATCTTGGGTCATCATTGGATGAATGGGAATAATTGATGTGGTCTATAGCAGTAGGAATAACTTGATGTCCATTGGTTGGCCAATAATGAAAGTGGGCTTCATCTTGATTCTCTTCAATTTCAGCTTCAGTGGATGAGTTACCTTGAGTCCCCATCTCTAAAGGATGCACAATAGTAGGATTTAGATGATCGTTAGCATAAAAAATGTAGCCTCTCTACACTATATGGCTCATATTTGGATTTTGAAGAGAATGGTGTGGATGAGAGGGCAGTTGACTACTAATTGAGTTCGTGGTTTGGTATGTTTGAGAAATGAGGCATAAGGAAAGGAGTTTTCATTAAAAAAATAAATGTCTGGATATATAGATACTACTTTCAGGTGTAGACATTTATAGCCCTTATGTGATGTGCTATACCTAAGAAAGATACATGGTTGAGATATGAGAGAAAGTTTGTGAGATTGATAGGGTCTCAGATATGGGTAGCATTTGCATTCAAATATTATTAGTGAAGGATAATTAGGTTTTTACTAAATAGCTTCTCCAAGGGACTGAGATTTTGTAAGACAGGGGTAGTAGAAAAGGCTTCATCCCAAAAATTTAAAGTGTGGCTTGGGATAAGAGAGTAAGTCTCATATCCATGACATGTCTATGTTTTCGCTCCACTATGCTATTTCGTTGTGAGGTATGGGGATAAGTTATCCTATGTTCAATGCCTTTTTGTTGAAGGAAGGGAACAAATGGTTTGAATTCACTACCTCCATTTGTTTGTAGAGATAGTATAGGTCTATTTAAGGATTTCTCTACTAGTGTATTAAACTATTAAAAGGCAAAAAAATGAATCATATTTCAAGTTTAAGAAATAAATGTATGTATATATGCTATAGGCATCTACAAAACTAATGTAATATCTAAACCCATTTCTAGATGTACTATAGGCAGGTCCCCACAAATCACAAACAATAAGTACTAAATGTACTAGATGTCTTCCCATTTTGTATTCATTGAAATATATATATATATATATATATATATATATATATATATATATATATATATATATATATATATATATAACATGTTTATATCTTATCAATCCATCATGTGTTTTCATTCTTCATCAGTCAATATGTGATTTGTGCTTTAGGTTAGTGATAAGTTCTTAATAAGAAATTTTTTATTGTAAGTCTTATCGTGTTAGTTGAGCTATTCTCATCTCTTGGCCTAGCGTAAATCGTCAACTACCCGAGAGGGTAAGTAGCAAGAACATGGCCAACGCGCGCAAGGAAGAGTTTTCTCTTTCTTTGCGAAATAACTTCGATCACTCGTATCCTGAAGGTGAACAAGTGTGGGCACTAGGCGTCGAGAGGTTATTGTCCTTAGTTGTGAAGTCCTATACGATTTATAAGTGAACACTTCTAGTTATATCTTGCCTAATCATATTTAGTCATATGGATCCCGAAAGGTGACCATGTGGGACATGTAATGACTTTGAGAGAAGCACACCTTAATCATAAGCATGGTTGACTAAGTTGTATCGCATCAAGGCTATCATATTACTTAATCGTGTTATAATGCATTGATGTTCCTTCAATAGCAACATTACCGAATTGAACTGGGTCGGGTCGGGTCAAATAGGTTTGGAAAAAGAGGAAGAAAAGTTCGATTTCCACCTTCATCTACTCCGTTGGTTCCGTCTTCCTTCTTCTGTTCAGAAAAGTCTTCCCCAAATCGTTCGCCCAATCTCCAATACTCCTTCTTCCTTCAACAAATCGTTCGCCTAATCTTCGATCCTCCTTATTCCTCCTTATTCCTTCTTCTATTCACGTGGATCTGAGTTCGATTTCGACGATTCTCTGTTCATAATTATGTTTCATGGGTTTTTTTTTTAAAAAAAAATTGAATCTGGGTAGCTGACATATTGAAATGCCCAAATTTTGTTCAAAACCTAGATCTGTGAAACTTCAATTGTTCAAACACTAGTATAATCTTTATGATAAATCAATTGTTGTGGTGATATTGATTGGACAAAGAAAAGAAAACAGAGGAGAAGGAAAAGAAATAGGGAAAAAGATGAAAGAGAAGGATGAAGAAAAAAAAAGAGAGTAACGTGACACTTAAGAAAGAAGGAAGGAACCAAGGGAGGAAGGAAGAGAAGAAGGAAGGAAGAATGAAAGAAGGAAAAAGGAAAAAGGAAAAAGGAAGAAAGAACGAAAGAAGGGAAGAAAGGAAGGAAAGAAAGAAAGGGAAAAAAATAATTTAAGAAAAACAAAGAAAAAGATCTGACTCGACCTGAATTTGGGTCGGGTTGGATCGGGTCATAGCATTGTGGAAACCGTGCACCTCGGGTTGGGTCGGGTTTGGATGAAAATCGACCAGACCCGACCCAGCCCAATTACACCCGTAATAAATTATTTCACTTACCGGAGATACTCACCCTACCATATAACTCTTGAAATTGAAACTTCTGTATGCATATTAGTATTCATATTTTAATCACGTTTTATTATTATATTTTTAGTCTAATATAACTTAAATTTGTTGTGCTAGCTTCTATTTTTGTTCTTCTTACAAACAAACGATTAGTGTGATTTTTTCTTATGGTAAAGGTAGGAGTTAGAAGAAATTATATTTTGGGTTAAATTACACGTATTTAGTGTGCGTGCATTTGAAATATATTTTCAGGGATTTAGTTTTAAAAGTAACGTCAATTCTAAAACAAAAATTAGAATGTGTTGGTAGCCAAACAAAACGAGTATATTTATAAAAAAAGATTAATAAAAATGAGTTTTTTTTTAAAAAAAAAATTCTTAAATGAATTCAATTGGACCATTCATCTTTGATAGGTTTAGGCATATGTTTCCTTGATTTCTAAACTTGGAAGATTGAATTAAATGCAATATTTTTACTTCCTTACCTATAGTCTATTTATTTGTCTATTATATATAAATATGTGTATGTTTAAACTTTAAACTTCCAATAATTTTGTAATTTGTTTTACAAAAACTTTTAAAAACAGATTGTTTTACAAAACTTTTAAAAATTGAAAGTTAAAAACTAAATTTGTAAAGAAAATTTAGGAACTAAATACTATGGACTAAATTTTTCATGTAACTACTTTTGTAGTGCTAACTCAAATATGGTGGAACTTGGAATAATAATTCAAACTTTAGAACCCTTAGTGTATAACCAATACTGTCCACAGCCCGCACACAACCTTAACCACTTCTAAACAATAAATAGCACAACCATAACCAAAGATAAGTAACGGAAAAACAACACGGAACTTCACTTTAATTCACTTAAGCTTAAGAGTTCTACAACAATTCAAGATACAAATTCTTAAAGACAGGTAAATATAAGAAATTCAGTCAAAAGCCTCCAAAGTTTTCAATCTTCAACGATCGTTTAGTTTCTAGTTTGATTGTTTAGCTCCGTAAACCATTGCTTACCTTTTTGCGTGGTCATTTTGGAATCGCTTAGCTTCCTAAACGATCGCTTACCTTTCATCACAATCGCTTACCCTTTTAGCTTATTGCTTAGCTTCGTACAACTTGGAAAGGAAAAACTAAAAACGTAAGTCAAAAGACTTAGTGAGTGAAGTTTTAAGAAAACCATTTGGAACACAAATCATAACCCAATCGTTTCATTAAAACAAACTTAATACCTCGTTTTACAAATATATCAGATCGCATAAATATTCATTAGTCTCACATGTTCTTTCGCCTAGAACATAAATCATTCCCTATACCCAAACACAACATCTCTTGTATTTAGACTACTGTGATATCTTTTTCATCAACAGTATCCAAGAATTTTCTAGTTCATTCCTTATTGCTCAGATCGCCAAATACAATATGCTCCTTTTACGCATTAGTCAGCTTTATTCCATAATGTAGTCCTTTTCTACATGATTGTCTTTACTTCTTATGTGTTCATAATAATTAGCTCATTGATAGGAATAAGACTAAGAGTTTGAACACAATTTCACATTTATTTGAAACACATAAAACTATCTTTAAACATATCATATGCTTTCGTTCGAAATTTCTTTATAAACATTAATGTATGAAAATATCATTACAAATCAGCATGAATTTAGACAAAAGTTTTAAAATCTGTTAACACTTCTGGAAAGACACTCACAAACTTAAGTCTCATTTATTTTCAGAACGTAGCTCCTCTCAAGTTCCTCTTCTTGTCTGAACAACTTCTTTGACAGCAAAACCCTCAATTTCCTTCTCTTTGTAAATTCTCATAATTGTTCGTTTCCTTTTGTCTTACCTTCTTATTTAAATTAATCTTAAAATAGATTAGTCACTTCTTAAACTCTTATTAATGTTGTCAGCTCCTACTTAGCCACAAAAATGCATGCAATATGCTTAAGATACCATGAATCCAACTCCACTAACCTGATAATTATAAGCTCAAAAGCGTCTTTAATCTTTCTACACGATCGTTCATCCTTCTAAGTAATTCTTCAATCTTTTACACGATCGTTTAGCCTATTATACGATCATTCAATCCTCTTACAAGATCTTTTATCCTTTTAATGATTGTTTCCTTATTATACGATCATTCAGCTTTCTTACACGATCGTTTAACTTACTACATGATCATTTACTATCTTATATGATCGTTAGACAATTTTTTTTTAGAACCTTCTATTCAGGCACGAGTCTTACAAGTACCTTGAGTTATTTTGATATGAACATTTTTTTTCATTTCATCCTCTATAAATAGAAACTAAGGTCTCAAAGCTTTTTGACATACAAATATTGTAAGATCAAGGGAATTGTTTCACAAAATTAGTTGAAGTGTGCCTAAGTTAAGCTAAAAACTCATGGATTAGAAAATGTGTATTTCTAGAAAGTATATATTCTAAAATGAAAAAGAAAATACATTTAAAACAAATTAAATAAACACAAAAAAAGTTATAAGAGAGTTGGGAAGAAAATTCATTTAATTAGGATAAAATAGGCCCAAGATTTTTAATTGGGAAGCAAAATAAGATTCTTTTTTCCTTTTAGTTGTAGTAGAGTAAGATTCAACGTCAACTTTAAAGAACAGAAGTTATGCCATTATCACTGGATAACACTCACTTCTATCGGTAAATTAATGAAGAGTAATAATAATAATAAGGAAAAATTGCATAAACAACCCCTAAGCTATGGGCTTGATTGTCATACCCCACCCCGAGCACTTGCTTGCTTGGCCTGAAATGTGTCGTGAAGCCAATTGATATCATCTCCTTCAAGAACGGCACCAACAGGCCCATAACCTAAACTCGTAAACAGTAAACAATAGTGGAAAGTAATACATAACTTATAAACATAACTGAAAGAAACTTCTTTGGTTAAAACAAACTTTATATAAGTTTACACAGACATAACCTTAATTACACAACAAAACCCAATAATAGATTTTCCAAATCCGAAGTCTCACATTACAAGGACGAATTGAACAAAACATGTACGCTAACAACTAAACCCTTCACCAGAAAGATAGCAGCGATTCAGACTCCGAGGACACTGTCTCTACCTGGAAAGTGGGAAAACATTTTGAAGAGTATGAGCTAAATAAGCCCAATGAATGGTAATTTTTCTAAATACTTTTCATAAATCTTTGTAACAGTAATATGTATATCCTTTAAACACTAAAAGATAAATTATAACATAAAAGCTTTAAACCTTAAAGCAATTAGAAAACAATAACATAATCCTGGTTTTTCCTACTCGAAACTCAAGTATCTACTTCTGAGAGGTAAGCGTACTTACCTCAAGTCATTAGCGCCCTTGCCTCATAATGGGTAATTTTAATCGTGCCTTGTTAAGGGCAAATTTTTAAGCTTGCCTAGTTATAAGTAATCTTGTTCCGTTATGAGTACTTTTTTAGATAGCTTGGTTTCGATAAAACATCCTTGCGTGAGCCTTAACAATATTTCTTTGATAAACATCATTTCTAGTAAACCTTAAACATTCCTCAATAAACAGTATGCTTGAATATATACCTAAACAATAACATGCTTGAAAGACTTTGCATAACATAACTTAAACATTTAATCATAAATCATTCACAAATCATGGCTTAAAAAATGCTTCAAATCACTTTAAATAAAACATTTTTCACTCATAGATACAAGCTTAAATCCTTAGCTTATAAATTTGTCCAATCTCCTCTTGTTCTAAAATAATGGTAAAATATTCCAATTAATTCTCTTTTACCAAGAAAATATCAAAAGTCTACTCAGAAAACCCTAAAACCACTAAGACAGCTCAAATTGCCATATGACATTGTCTGACGCACATGGGTCGGCGACGGGGCCTTTGCCTCGCGCTGAGTTGCCCACATGTTAACCTCACAAGAGATTTGCCACGCACGCTGGGCATGTTCGCCTGCTAACCTCACGCACAGACACATCGCGTGCCTCACCTGGTCGCGTGCTATCTTGGTTGCATTCGCCTTCCGCACAAGTCGTTCATCCAATGCCAACCTCCACGCACACTAACCTCTTTGAACATCAAACTGCCTTTCCATCTTGCCTAGAATAACCCATCACTCGGTTAGCCACCAAGTTGCCCAAAAACTAAATTTTCATGCAGAAAACATGACACGACTTTGAAAATCCAGAACACTTTTAAAGAAACTGATAAACATAAAATTTGAAACTTATAAACTTAAAATTTAGAACGCTTTTGAGGAAACCTCCTTCTATAAAAACACTTATAAACGTCTTAATTGTCTTACCGTAAAATTTTAACTTTAGATTCCAAAAGATGAGCTCTCCAGCCCTCTGGAAGTCCACATTGCACAAATTCACCCAAAACCTGACCTTCCTCTCTTTCTTCAACTCAAAATCAACCCTTGCTCAAAATCATATGGCAGCCCTATTCTAAAAACTTGACTCAATTTATGATCTTTTGAGAATAATTTCAGCCAAAACACACAAGTGGGGAAAATCTTCATATGAAGTAACCTATTTATATTGGAGCATGCATGAACCTTTATTAACGCCTCCTACTAGCAGAAATCTAAGTTAAGCATGCAAGCACACTTGTAAAGCAGTTCACCGCTTCCACCTCTTCATCATCTCGCCTCATGTCCCTTGACACAAAACGCCTAGTGTTAATGCATTGCTAACCTCTTTGCCCTTCTGGCATGCCAAGCTTCTCGCCCAACAGTCTTCTTATATAACCTCTAAACGCCCACAAAACCTCTAAATTACAGCATTTCTCATCCTTTGACACTCATTTAGAGGTTCTTTTTCACGTAGTGCATAGCCAACATGCCAAATAACCTCTAAAACACCTACTACACAACCCTTGGCTCCCAACCTTAAACGCATACTAACCTTTTTAGAGGTTATTCCTTCTCCTCAGTCGCCCAATGCACCGTCAAGTTAACCTCCAAAGCACCTAGTAACACTCATGTAGAGGTTATTTCCATCCTTTTGGCCGTTTTTACTACACAAAGGTAGCCTCTAAACTCTTTGCATTTAAGTCTCAACACTTGCCAAATTTTCCTCTTTTTTCACTAACCATGACACCACTTTTTATAACACTTTTATCCTTAACTCTCTTCAACACTTAAACTTCTATCACTTCAAAAAGCTTATGTTTTCTTCCAAGGTTCGAGGTTTACTTTGATTGCATTCATACCCTTCAACTTTCAATTTGATCAATTACGCTCCGTAGTTTGTTAATTTGCAATTAGCCTCTTTCTTAAATGAATCTTTATTACCTTAATTGTAATCGAACATTTTAAAATTAAAAAGATAGAAATAAAAATAATCTCTCATCACACATCTTTTCCTCTCTCCTCTCTTTTCTCCTTCCATCTACATCTTCATGATCAAAGAAATCATGAAATGACACTACAAAGCAACTACCAAAATATTAGTATCCAAATTTATCTCTCTTTTCAGGCTCACTTTTCTCCCTTTCTAGAGCTCAACAACCATTGTAGAAACCACTACCACTACTGTCATCATCGCCATTACTTTCTTTGTTGCAGAGATCGTTGTTGCCTCTTCAAAATTTCACCTTTACCCTCTTCCTCAACCCTTTTCCAAACTCCATCTTCATCTTTTTTCTTATGCAGCCGACAACTACTTGGATTTGAAATTTGGATTTAGTGTTGTGGATTTAGTTTTAGGAGTTCAGTGTGTGTTCTAGTATATATATATATATATTATATATATAAATTATCCATGTGGTGAGTCATGGTCATTGCCTGTGAGATGCTTAATATGTCTCTAAATTCGCGTGTAGTTACTAAATCATCGTTGAATATAAGTTTTTCCATTGCTTATCCAAGTGTAAGCGAAACAATAGGTTTTCTGGATAAGCAGGTTGAACATAGGAGGAATTGATATCAAAGCTTAGTTCTAAGGGTACTTCTTCATCTAGGTGGTATAAGATAACTATACAGTATAAAATAAATGTGTGTTCAAGCATAGTCTACTATCTACACTAGAGATTCTATAAAAGGTGGTAAGCATGGTGAAAAGATGGGGGAATGAACTAACTTGTGTTAAAGAAACATGAAGGAAGGTCTCTGCAATGTAAGTGCTTAAATCACGCAATAAGCTGTGATGCGATAAAGATCAATTAATTAGCTTCTACTTAAAGTGTGCACCACTCAGTGTCTTAAATACCACACTTGTTCGCCTCTCAGGATGCAAATGACAACGTTTCATTAAGTGATGACTATCTAGTAGTTGTTACTTATAAGATTTATAACACTTCTCTTTTAAAGGTGGAAATCTCTTAGCGGCAAGTATCCACACTTGTTCTCCTTTCGGGACACAATTGATGGAAATTTACTAGCCATGGTAGAGTTTGTCTCCAAAATCCTCCTTGCGCGCATTAGCTATATTCTTACTACGCTTACTCTCTCGAGTAGTTGGTGACAAGCACTAGTCAAGTAATGCATTATAGCTTAGCTAACGCAATAGACCTTATAAGTTAGACTACTTATCAAGAACTTATCACAAACCTCAACTACCTATAACACTTTGACAAATGAAGAAGTAAAGTGATGATATATTTAAAGTGGATAAATATGCCATAGGCCACTATATATATGAACACTTTACATTACAATGAAACTTAAAACAAACTTTATATAATAATAATTAAGTATAGCCCATCTAAACAACAAAGTACAACAAGTTTGTTATAGACGAATACAAAATAGAGTTCATAAAAGACCAAGATTGTCACTTACAACCCACAACTAACAAACTATAACTCAACTCATGTACAGACCGTAAGTACTTTTAGAACAAGGCGTACACTAATGAAACTAAAAAATACACTAGCATATAAGAGAATAAAGTCAGGCATCGGGTCCACGATCTCTACCTGGAATGTAAGAAATCTTTTTTGGAAAGAGTGAGCTAAGGCTCAGTGGTTGACTAGCTTTTAAATCAAACATATTTAGAAAATCAATGAGTAGTAAACAACTTATCAATCACACTTTAAGCAAAATAAATGCATAGTGCTCAAACTTCAACAACTATGTTCAAATATTTCTCTTTCAGTTGCTATACCTTGAATCTTTGTACTCTAAAGACGAAACTCTAAGCATCAATAGTGCCCTATCTCATGATGTGGCTCAAGTATGAGATGAAAAATAGGCATCCGTAGAGGTCCTAATCTCTACAAAGCATAATGGTGGAAACTCTAGGTATCTATTGAATAGACTCATCATTAGACCTCTGTGCATAGAAGAAACTCTCCTATGGGCTTGCTACAAATCTCTAGATATCTCTCAGGTATGGACTAAAACATACTCAAACAATCGCTTGCTTAAAACGTTATCTCAAACTACAATGACATGTTATACAAATCTCTTGCATTCACTAAGTCCCAAATTTATAAAATAAGCTTTGTTTAAACAACATTTAACCTAAATACTTTCAAAAACTCAACATTCATATCCTTGCTTATAAATCATGTATAAAGAAATATTCTCAGTAAACTTATTAGTTAAACTCATGCTTTTCAAACTTTTATAAACTTGAAACTCATGCTTTGGAAATCAACTATATAAATCAAGTATAGTTCTCAAATATGTTAACTCAAAAGATGCTTTAAACATAAACTTGGTAAATCATGTTTAGGAATCAATTTAAAATTCATTTTTTTCACTTACAGATGGTAGCTAATTCTGTGGCCTATAAATTTGTCCAATCTTTTCTTGGCTTGAAATTAGAGAATGTAAACTCATTTTACACCTCTAGAATTAGAAAACATCAAAATTTAGCCCAAAAATCTCAAAAACACCAAAATTGTCCAAAAATGGTGTTATGGCACGACTAGCTGCTAAGCATGGACGACAGGTAGCATGCACGCACACACAAGCCCATCCTAGGTGTCTGGTCACGTGCTAGCCTCGCGCACTAGACATGTTGCGTGATAACTTGTATATACAAGTTATTGTGACCTTTTATGAAAAATAATGAAGCCTTATCACATAAGAATGAAAGAAAATATGAAAAAAAAAGTAAAGAATTTGATTAACTTGTGTATCACATCTTACAAAAGAACTTTATCCCTGTTTTATAGTGTTTTAAATGTCTTATCATAGAATATTAGGCAAAAAGAAGTAAAGCTCGATGATAGAATGTCACGTGGAGAGACAGGTTCAAGTAACAAGGCGACGAGCAATCAAGCTAGGCGGTCTAATGCACTAGCAGACAAATTTGGTTGGTGTGAATCCTAAAAAAAGGACGAAGGGACGTTTGCACGGCGTCGGTGCAACTTTAGTCAGAACCATTTATGATGTGTGCAATGTCCTGTCACAACGCATTCCGCCTACAAATAAAAGACTCTACTTCATAACAAAGTGTTCGATTTTTGGATTGGAGAGCAAGCTCACTTTCTTCTATTCTTTCTTTCTTGAGTTTTAGGTGAGAGCTTAGAACAAAAGGAGTGAGATTTTTTCTCCAACCATTTCCCCTCTCAAATAGGTAAAAAATCAAAGCCAAAGAGTATGAGAAATCCTACTCTGAGGCTTGATTCAGTTCTTTGTATACCATTTTATACTTTTATTGCATTACATTGTAACATGTACTTTATGGCTAAGAGGCTTAAAGTTATTTTTATAATAGTAATGCACTTCCTCTATCATTCTCTAATCTCTCTATTAACATCTACTTGTTGTGAAGTCAAACTAACTCAACAACACCAAGAACTTGACTTAGTTGGTTAACTCCACGGACTTAAGGAATAGTGTTGTTGCTTATGTTAGTGTTTACTTCTTAGTAAGAATCTCTTCATCATGTAACTGCTAAGTCCTCGTTGTGTACAAGTTTTCCTACTATTTGCCCAAATATAAATGAAACAATAGGTTTTTCTGAGTGATCCAGGGTCGAACATAGAGAATTGATATCAAAGCTTACTTCTAGGGTTTACTCATCATTCAGGCAGTATAAAATAATAAACTATACAGTATAAAAAAGTGTAAGTTTAAACTCTATCTACTTATGTACGCTAAATAGTGTAAAGGTGGAAGTATTGAAGATGGTTGCAAGAATGAAATGCAAGAGTGAACTAACTTGTATTAAAGAAGGGGTGAAGGAATGTCTTAATGCTACCAAGTGAGTAAGCTATGAAAGTCTTAATGCGATATGACTTACTTAACTAGTTTATAATTAAGGCAAGCATCTCTCGGTGTCTTTAAATGCACACCTTGCACACCTTTCAGGATTTAAATGACTAAACCTAGCTAAACAAGGTATATTTATAGGGTGAAAACCTAAGGTGTTAAAGTTATAGAGTTTCACGTTTAAGGGTGAAAACCTCTTAGTGCCCAATACCCACACTTGTTCTCCTTTCGGGACACAAAGATGGAAGTTATCTCGCCAAAGTGAAGTTTGTCTCCAAACTCCTCCTTGTGCGTGTTAGTTGTATTCTTGTTACTTACCCTTTTGAGTAGTTGGCGATATCACTTGCCAAGTGATCAAAATAGCTTAGCTAACACGATAAGAATTATAAGCTAAGCATCTTATTAAGAATCTATCATAGATCCTAAACTACAAATAATGCACGAACATATGAACAATAAAGAGATTGATGAATTGCAAAGGCATAAATATATGTTGTATTAAAGAATAAAGGCCTTCAGCCACTATACAATATGAGTAAATACAATGTAAATAAATACAAAAATAAAAAGAATAGAAATGACACTACAAGTTAAAGGGAGATGGAAAATGGGACTTCACTCAGACCGCAAGCTTTCACTCATTGACTAATTGAAGAAGATGAAGAAAAACTCTACAAACGGTGAAAAGGCTATTCCTTTCCACCTATCTCTAGTGGCTGGTCCATTCAGATGTCAAGTTCCCTAGAGCTTGAATTGGCTCATACAAGCTTGCTTTTCTTGGTAGGGATGAAGTTCCTATTATAGACAGATTTAAGTTGGAAATCTATCCTTCTGAGCATGTCAGCGCCAAAAGCTACTTCCATCATCAAGTCATATGAACTCCATCTATCACTTTTGTCTTCTAGTGAACCTTTTTTCGCATTATTACGTGGTCTTCTTGCCTAGTCATTGTGGGAGTTCCTCTATGATTCGCTAGCTCATTCTTTTGTTCTTTAGTCTGTGTTAAGGCACTAAAAATAGGGTGAAACAGGCATGGTTGCTTACGTAAAATGTATTTCTAAATATTTATCCATTTACAACATAAGCTAGGCGTTTTAACATGTTTGACATGCGTTTTGGTCCCATTAGTCTATCTAAAAGACCATAAAACTTGTATTTCTACAAGTTATCAATATCCATGCTAAATCCATTTAATTAATTTAGGACCAAAGTAATGTTTTATTAATATTAGAGAATATGAATACGTCTTATGAGTTTTGTTTGTCTAAACACAGATGCCAAAGCATTGTTTATTCAAGAGAGACAATCGTAGAGATAACAATCGCCTGACAAGCGAGAGCACACATTTAGTTAAATAAGCGGCAAAGAGAAGGTGGCAACACACTCTCATTGCCAAGTTCATCCTTTGCATAAGTTGAAGAAATGTTAACGTGTGCGGCAAAAGCATCGAAAGGTTGCTTGCCTTAAATTAGGATAAATTATTCATAATACAAACAAACTGATTAGATGCAAATCATAGTTTAACTTTAAGTATTTTGATCCCAAGAGGCAAGTAAGTATAATGAAGGCACCAAAATGATGCTCACCTTAATTCTAAAGTTAATAGATGATTTGCATCGCCTCAAAATGTTAATACATATTACATTGCAATTAATTAATTAAATATTCCTTCATCTCATGCCTGTAACTTGAGTTCATTTTCTTTCATCTCATGCCTTTAACTTATTATACATAATTAGAGTAAAATAAATGTTCATATTCACACTGTATAGATTATACCGCATAGGTTTAGCTGCGCGATAGGTTCAATAGGAACATTAATTCCCTGTGTTCAACCTTGGACTTACCAGAAAACCTTTCTTCACTTATATTTGGGCAATAGAAAGAAAAACTTTTATTGCACGCATATCATGAGAAGAAAAACAACGCATAGATAGAATTTTCTTCTTTTATCGCATCATCCTGGTATAGTCGCCTAACAGATCGCTCCATTAGAGCTTAAGATATGACATATCTATGAACAATTGTACGCACATAAAACAAACCAAACATCGCACGCCTCATGGTCGCATGACATCGTACTGCGTTGTCACGCTGCTTGAAATATTCCATGCATAAGTTTCTATCATAGATCTATCATGTTGAGCTCTAAGTGAGCGACTATTAGGTGGCTGAATAAGGTTATGCGTTAAAAGGTACATTCATATTCATGTGTTGCTAACTTTTTACAAATATGCATGTCGTGCAAGTTTTTCTCTCTATCGCCCAAGTGTAAGTGAAGAGAGATTTTTGGGTACGTCCAAGATCGAACACAAGAAATTTAAGCTCCTAATTCTCTTACCGCGCAACTAAATCATGCAGTATAAACTATACAGTAAAGTTATGCACAATATACTTTTGTCCTAACTATCTACACTAAGGTGTTCGGTGGTGCAAGGTGAGCATGGCGAGATGATGGCAATGGAAATTGAACTCATGTATAGGCCTGGTGAGGAAGGAAAAGTTTAATTATCTAATTGTAAGGAATTGTGTTTTAATAATTCAAGGCGATATAAAGCATTTATTAACTTCGAATTAGGTTGAATGACCTCTTGGCACCATTGGCATACTTGCTCACCTCTTGGGATCTAAATACACAAAGTTAAGCTATGATTTATATCTAATCATCTTGGATACATTATTAGTACTTTCTCCTTGATTAAAGCATAAAACCTCTCGATTCTTTCGCCACACACATTGGCATCTCTGCAATGTATGCAAAGGATAAACTTGGTGACAAGATTGTATTACTACAATCTTATCGCCACTTGCTTAGCTAAATGCGTGTTTCCACTTGTCAGGTGATTACATACAACAACATTGTTCTTCTCTTGAATAAACAATGTTTTAACCTCGATGTTTAACTCAACAGAACTCATAACAATGACACTATGATTCTTCAGGTATTAAAAACACATTTATTTGTTGACAAGTTAACCAAATAGATTGAACATGAAAAGTAATAGACAAATGAGAGAATAAAAGAATGAATGCATTGCTTTCATAAAAATAACTTTAGGCCTCTCGGCCATAATGTACATCTTACAAAACAATACAAGAAAAATGTAAAATGGAGTACAAGAAAAGGTGTAACACCTCAAAATATAATTTATCATACTCTTTGGCTTTGGTTGCCTATTTATTAGGGAAAAGATTAAGGGGATAAACTCTCTCTCCCTCTTTTGTTCTAAGCTCTAACCTAAAACTCAAGGAAAATGAGAAGATGGACAATGTAGGAGCTTGCTCTCTAGCCGAAATGTGCACACACTGTTCTGAAGTGAGAGGCTCTATTTATCCATAAGGGGTGATGTTCATAGGGCATCCCACACATCATTAATTCTCTAAAAAGTTGTACCAGAGTGCCTTGGAGATGTTCTTTTGTCATTTTCTGGTTTTCGCGCCAACCAAATTTGTATGCTAGTGCTTCATATCGCCTAGCTTGTTTGCTTGGCGGCGCATTGGCTGGATGTGTATCCTCGCGTAAATAGTATATCGCTGAGCTTCACTTCTTTTACCTAAAATCATGTTGTAAGAACACTAAAACGCAACAAAGCAGGGTTAAGGTTCTTTTGTAAAATGTTATTCATAGATTAATCAAATCACCAATTTTTTTCCTTGTTTTTTTCTATTCTCATGCGATTACACTTCATTATTTTACATAAATGCTACAATAACTTGTATCTCTACAAGTTATCACACCTCTAAATTTAAAATAATGCTTGTCCTCAAGCGTCATCTTATGATTTTATACATGATTATTTCTTTTGTGCACGTCGCTAAACCTTTCATAAGCTTTTCTAATTTATCTCGCATTCTAGCGTCTCTTCTTTTCTTCGGCTACTCTAAACTTCAAAATATTTCTAGATTCTAAACGTGGCAAGCTTAAGTATCAAAATACCCTTGTTCACTTTCAAAAAAAATTATTTCCCCTTTTTGCGAAATGGCTGTTGTTTCAAAAGTTCTTAGTTCCTTTATTTTTCTTGTGAAAACTTTATTAGTTAAACTATTTATTTGAATTAGGCGTTGAGTGAGAAATCATAGGTACTCTATGAGTTTGTTCCTCACCACAACTATAATTATTATCGACACTTATTTGATGAGGCTTTGCTTCTTATTAACTTTTGGTGAGAGGTGGCATCATAGGAGATCTTATTTCAATCAAATTCAGAATAAAAGAATAAGTTTGTTTTGTTTTGTTTTTTGGAAAAGCTTGAAAATGAAGGGTTCTTTGAGTTTTTTAAAACTTTAGCTCAGACCTCGCACACCCCCAAATTTAGAGTCTCGCAAGGTCCTTATTGCGAAAAAGTGAATGATAAGATGCACTTTGAAAATTTTACAAAAGAGGTTTGAGGTAAAGCTAGCTCGCCTAAAGATTACCAGAAATATTTACTAAGTTCATTTCGCTCAGGAAAGCATCATCGCCAGGTACAAGTTAAATTAAAGTTAGCATAAATTTCATCATGAAAGTCATCCTTGCGAGGAACAAAAGAAAGCATGCAAATAATTCATAAATGAAGTAAACATCAAGAAAAGTAAATCGAATAAAAAGTTAAGAGACATGGACTTAAGATGACTTTGATTAAGCAACTAAATATTTATAATACAAAAGAACGAAAATGACAAAGAAATTTACAAAAACATTCAAAAGATAACACCCTCAAATTTAGTTCACAAGGGGAGGGTCAAGTTCACCGAGAGTGGGATCTGTTGGCTGAATGGAGGGAATGTCGCGATCAAATCTGGCGTGATAACATATTGAGGCATACGTCGTACAAACAATGCATGTGTATATTCCATTTGAGCCACAAATTGTCTGTCATGTCGCTCCAAAATTTGGCGCAACTTATTGGAATCAAAGAGCATTGGATTGGTCAAGTTGTTGAGGGCAAAGGATATTTACGCAACCATTGAGTTTAATTGGAAGACTGTGGCTTCTAAACCATCCACTTTGGCATTCAACCTCATGATAGTGTTACTGAGGCTGGCAATTTGCTTCTTCATCGCCATAGGGAGAAGGTTCTTTTGTACAATTTTATTCACATATTAATCAAATTGCCTACTTTCTCCCTTATTTTCTTCTATTCTCATGCGATTAGACTTTATTCTGTTACATCAAAGACTACAATAATTTGTATTTCTACAAGTTATCACACGCCCCGTTCGGCTAGCCACTTCGTGCGATCAGAAGGAGGCGTGTCGCCTAGACACCAAACACATACCTCTCTGTGAGATGGTGAGTTGAACGTCTAGTAAGTAACTTCTTTTACCGTCTCTATCGTTGGCCTCCAACTTCGAACTCAGGAGATCTTTACAACTGAACACAATGAAAAATACTAGTACAACCAACTTAGCTTAATCGCCTAGCCTTCAACACTTGAGCTTAAACTACTAAGCGATAAGCATAACATGGATATACAAACAACACAGCGGAAAGATAGAAAAACATAACTTTAATTTTATTGCTTGATTCAACACTTTACAAGTAAATATTTTTCCTCGCAAGCCACTTAAATAAAACCTAAGTTCTAATCTTTGGGTGTTCCTTCCTCCGCTAAAATGTTACTCTTCACGTCAAGGTGTTTAGGGGTATAAGAACCCCTTTCTTATCCTTCTCCTTGATCACCTAGCAACCTATCTTAAAGATAAGACTCATCCCACACACAAAACTAGGATGTTGCCCTCTTCGGACGTAATAACTTTAAACCATCAGGAGATGAATAATTTGATCAGCTTTACCTTTTCTTACTTGGCCTCAACTCCTGATTACCTGAAGAAGAGAAACCTAAACATGTAAGTCAAATACTTAGAGAGTGTAGATGTTAGAAAACCTTTTTGGGAAAACAAGTCAAGCAAAAAACCATTTCTTTTCTCAATTGCATCTTTCAACGCCTCATTTCATAAGTCTCATATTAGCTTTTCTTATTCCTTTCTACTAAGAACATGAATCTTTCCCCCCGCCAAGCTTACTACGATTTACTCTCGCCAGTATCTTGCGATTGAGCTACTGTGATGTTCTCTCCATCCATAGCATTTGGGAAGTTTCCTTATTCATTCTTTTCTTGTTGCATCAGCCATCCACGACGTCATTCACATTATACACACACCACATAAGCATTGCTCAATTGATAGAAATCAGACTAATGGTTTGCACGCATTCATCACAGTCAGCACATAGAAAGCAGTAGTTTTCTCTTTTTAAACCATAACAACAACAATAATAACACAGAGAGATACAGAACGCATTATGAATAACTCAGACTCAGAAATAATCTATTAAGAAAACGTTTACAAAACTTTAGCTTACAAATCACCCATGCAGAAGTGTTTAAGAAAGTCACTCACGTAGGAGTGTAGAGTTTAAGAAAGTCACTCACACAAGCATTTGGTTCTTTTCTCCTCTAAAACCCTCTCCTCGCCTAGAACTTTCTCAACAGAAGCTCAACTACTTACACTGTCTCAACGAAACCTCCAAATAGCAACACTTTTCATAGGTCTTCACTTCTATTTATACTAATCCTTTAGACATGTTACTCTCTTTGTCTTCCAGATTTAAATTATAACTTTACACGTGTCACCTTACTGTCTTGTTGTACTATGCAGCCAAACCACACTCAACATGTATTCCCTCGAGATGCAAACCTTATTATCTCGGGAAACCAAACTCTTCCTCACCTTATCTTCTTGAGAAGCTAAATTCATTTTTCTCGCCTTCTTATTCCTTGCGATAGACCATAACTGCTTACTTCTTTTCTAACTTCTTTTCACGTTACTTCACCCAATTGCCTAACTTTTCTTGCAATAGAAATCCTTCATCGTGCTAACTTTTCCTCCATCACCTTTTGTTAACGTGACTTCCTTTCCTTTGAATTTTATCTTTGGAATGATCAATCTTTTCATTTTAATAGGATCGTCCAACTCCTCGACGACTTTCTTCTTCAACAAGACTTCATTTTTCGGATATGGGTCTCACACGTGTCCCACTCCGCGTGCGCATCTAAGCACCCATGCACACACATAGCCTAAGTTATGCCCTACGCCATTTGACTATACATTGGCCATGTTGCGTACATGCTTCTAAGACCAAATAGCCTCTTGCCTTCAAAACACGTGATGTCCATACTTGTTCATGCAACATTCATTCATGATTAATTCATGTTAGGCTATAAGTTAGCAACTTAATAGGTGATTAGAACAGGATGATGCGATACGAGAGGTATTTCCTATGTATTCATTGTTGATTTTCCAATGATATGCATGTAGTACAAGTTTTCCTCTCTATTGTCCAAATGTAAGCGAAGCAAGGTTTCCTGGTATGTCCAAGGTCGAACGCAGAGAATTCATGTTCTTAATTGAACTTATCGCGTAGCTAATCCTTATGCGATATACTGTATAGTGTAATTATGCACAATATATTTATCCTAACTATATACACTAAGGTGTAATTGACAGTGCAAGATGATCATGGCGATGAAGTATGTGAACTCATGTAAAAGTCATGGTGTGAAAGAATATTTAACTAATTAATCGCAATGTATTATGTGTTAATATTTTGAGGAGATGCAAGCATCTATTAATTTCAAAATTAAGGCAAGCGTCCTTTCGGCGCCTTTGCCATACTTGCTCGTCTCTCATGATCCAAATACTTTAAGTTAAGCTATTATTTGTATCTAATTAGTTTGTGTGTGTTACAAGCAACTTGTCCTAATTTAAGGCGAGCATCCTTTCGATGCTTTCGCTACACACGTTAACATTTATGTAGTCTCTTCTGTGATGAGATTGTATAGCTACAATCTCTTCATCATCCGCTTAATTAAATACAAATGTTCACTTGTCAGGTGATTACTATCAACACAATTTACTTTTCTCTTGAAAAAAACAAAGCTTTGATGTTTGCGTTTAGCTAAACAAAACATACATCCAAAAGAAATGCATTATTATTCTAAACAGAACTTTAGGAACTAATTAAATACATTGAAAATGGATAGCAAAAGAGAAATGGGAGAGTGGTAGAAGAAATTCATTAATATTAAAAATAGAACTTTAGGCCTCTCGGCCATGTAGTAAATATTACAACTTAATACATGAAAAATATAAAATGGAATACATAAAATTAATCAAGCCTTCATAATATGATTTCTCATATTCCTTAACTTTGAATTTTGCCTGTGACGAGGAGAATGGTAGGGAAAAACTCTCTTTTTCTCCTTTTAAGCTCTCACTTAAAATGTCACACATCAAGGATGGAGGAAAAAGTAAGCTTGCTCTTTGAGATTCAATTTCACACACTTTGTTCTGAAGTAGATGCCTCCATTTATTGGTCGAGAGTGATGTTGACGGGACATTCCACTCCCCCTACTGAGTTGGTTTTGTCCTTTTTTTCTGACATTTTCCCCTACTTCAGATCACCTAGATTATTGTGATCGTTGCCTTGTTGGTTGCATGGGTCTCCTCTCGACGTTCAATCACCAAGCTTTACTTCTTTTTTCTAAAATCATGCAAAAAGACAAGTAAATCGTGATAAAAATAGGGCTAAAATTCTTTTGCAGTCTGCGATACACAAGTTAATCAATTATTCTACTTTTCCAGCATTTTCTTTCATTCTTATGCGATAAGGCTTCATTATTTTACATAAAAGGCTACAATAACTTGTATTTCTACAAGTTATTAGCCCTAAGCGTCACAACTTGATGCCTCGACCACCTTCAACCTTTACCTAGAATCTTGTTTGAATCCAGAAACTTCAATCAATATCCAAAGCTTATAACTTTTAATCTAGCCAAAAAATGGAAAAGTTACTTCTTCAAACTTTCTTAGAATTGTGTTAAATAACTTACCTTAAAATCCAGCTTCAAATTCGAAGTGATATGTTATGTAGTCTTCAAAGAGTCTTCACTACTTAGAATCCTTTGAAAACTGACCTTTCCCTCTTCCTTACACTTGAACAACATCTTCATCGTGTCCCTTTTGTTTGGGAGAGCCTCACCTTGCCTACTAGACTCCTTTAGCAAACTTCTACTATTAGTTTGAGAGACAACACATGAAGGAATTGAGAGCATCTTCTATTTGAAAAATTATTATTTATAGAGTTCTATGTGTAATCTAGAATTCAGATAAGTATATTTCTTATTTATTGGGAGTCAAAGGAATGTTACATATTTATATAGAGAATAAACTAAACCTTAGAGACTATGTACAATTACAATAAAGGACATATGATATAATTATAAATATATATATATATATATATCATAACACTATGCATGGTCAACTTTGCCACCTTCTACTTGCCAAAATTTGAAGTCTTGCATATTGTAGACCCTTGTTCTCCACTCAACCGACTGCCTTGAATGTCTAAAGGTCTTTCAAAATGCCTGACAACTTAAATCTTCTCCCAAAGCCTCGTAGCAAGGCCATTTGCGTAGCTAACCACATAAAGCCTTTTGCTCTTGGCTGCATGGATCAACATGCATGCACTTCCCTTGGCTTGCCATAACGCCTAGCGTTGACACATGGAGGTTCTCTTACATAACTTATCTTAGCTTGTCTCAACGCCTTGCAATATGGTCACCCTAAACCTTCTCGCTTAGGTGTGCCTCGACCACCTAGGACATACCTAAAGAGGTTCTCATGTGCAACATAAACAAATCCTTGTCTTGACTCCTAGATATGGTCTTAGAGGTTCTCAACACCTCTTGGCCACCTAGTTGTGTCCAAAAACATCTGTTTTTCAACACTTGGTCAAAATTCAATTAGTCTTCCTTTCTTTGTCTTAGCCAATTAACACTCGGGGAGGGGATGGAGAGATTTGAGTTTATAAGATTAGGAAAAAAAAACAAAATTAATTTATTTGATAAATATTTGAGTAGGGTTTTGTGTTGACATAAAAATTTATAGAATAAATTATAAGATTTCTTGGTTATTTATCCATTTAAGTAATATGATAGTGAATCTAGAGCCTACATCCAAATATAAAAGTCACACCATCATTCCATTAAATAACAAAATCCTAACAACAATTAAAAGTAGGTTACTTTTAAAAGTTTTGGAACTAAAACAAACATATTTTACAATATAGAGAACCAAAATAGAAGAAACAAAAACAAAATCCGTAGAGCAGCAGGTAGTCCGTCACATTATTTGGAAACCCCGTTATATGCAAGGTCCATAGAGCACCACCGATATATAAAATGTTAGGAATTTTGAAGTGTGGGTTTGAATGATTGTGGTAATGTAATGTAATATTAAAGAAAATAAATTGTAAATTTGTTGTTGAAAGGCACATTCAAGTTAAGCCCCCAAAATATCCAAAAACTGAAGCAACATGTACTGGAACACCGGAACCCGGCCCAACCGCTGCTACACATCTCCACCTACACGGTGGCGATGGGGTACACGTGGGTGTGCGCCTCCGCCGTGGCTGACGAGGATATTACCATCGCAGTGACGGTGGACGCCCGAGGGAGATTGGACCCACCTCTACCGGCAACATACTTTGGAAACTACGTGGTCGGGCGGTCAACCGCCTTGAAGAGGGGGAAACTGTTTGGGGAAAACGGAGTAATCGCCGCGGTGGAGACGATATCAGAGATGATTAAAAGCTTGAAAGAAGAGGGACCTCTGAAGGGTGCAGAAAACTGGGTTTTGTTGATGACGCAAACTGTTGTAAATAGCGATTACAAGCTGATTTCCACGACTGGGTCGCCGAGATTTGAGGTGTATAGCGTGGATTTCGGTTGGGGGAAACCGGAGAAGGTGGAAGTTGTGTCGATTAACCGAACCGGAGCGGTTTGTATCTCGGAAAGCCGAGACGGCGGCGGAGTGGAACTTGGGTGGACGGCGAAGAGGGATGTTATGGAGAATTTCGCTAAGCTTTTTGCTGGAAGGTCTTCAACAACTTTGAGTTAACGTTGCTTTCTTTAATACCGGAAAATAAGCCATAACTTTATCTGTTTTACTTTTTTTTTTTTAATGAAACTTTATGGTTTACTTTTTATCACGTATTTCAATTTTTTAAAAAAGAAAAAATTAAGCAATTATCTTTACTATTTTAATTAACGGTGGCTTAGTTAGAGAATATTTCAT

mRNA sequence

ATGGCGACAAATCATGGGTCCACCGCCATTAAGGTCCTCGAGGTCTGCACGGTAGCTCCTCCGCCCGGATCCACCGTGCCGGTCACTCTTCCCCTCACCTTCTTCGACATCCTCTGGTTTCGCTTCCCTCCCGTCGAACGCCTCTTCTTCTACAAATCTCCGGTGCCATTCTACGTCATCGTTTCAAACCTCAAAAAATCTCTCTCTCTCGTCCTCCAACACTACCTACCCCTCGCCGGAGCAATTGTATGGCCTGAAAATTCCCCCAAACCGGCCGTCGAAACCGCCGTCCGCGACGGCATTGTGCTTACAGTCGCGGAGTCTGAGGACGACTTCGACCATCTCATCGGCGACGGGTTGCGTAAAGAGGCAAAACTTCGGCCGTTGGTGGCGGAGCTTGCAGCAGAGGAGGATCGGGCGGCAGTGGTGGCTGTGCAGGTAACCTGGTTTGGGAATGGTGGATTTAGCATCGGAATAACTTCACATCACGCAGTTCTGGACGGAAGGTCGTCGACTTCTTTCATGAAATCGTGGGCCGGATTGTGTAAGAATTTGGTTGGGGGCGGTGAGATTTTTTGCCCGGCAGCTGAGACGATGCCGTTTTATGATCGAAGCGTGGTGACAGATAAGATGGGACTTGAAGCCATTTATTTGAAATGCTTGTTGGCCCATGAAGGGCCCAACAATAGAAGCTTGAAATTTTGGGACTTCAAAACTCCGCCAGATTCATTCCGAGGCACATTCAAGTTAAGCCCCCAAAATATCCAAAAACTGAAGCAACATGTACTGGAACACCGGAACCCGGCCCAACCGCTGCTACACATCTCCACCTACACGGTGGCGATGGGGTACACGTGGGTGTGCGCCTCCGCCGTGGCTGACGAGGATATTACCATCGCAGTGACGGTGGACGCCCGAGGGAGATTGGACCCACCTCTACCGGCAACATACTTTGGAAACTACGTGGTCGGGCGGTCAACCGCCTTGAAGAGGGGGAAACTGTTTGGGGAAAACGGAGTAATCGCCGCGGTGGAGACGATATCAGAGATGATTAAAAGCTTGAAAGAAGAGGGACCTCTGAAGGGTGCAGAAAACTGGGTTTTGTTGATGACGCAAACTGTTGTAAATAGCGATTACAAGCTGATTTCCACGACTGGGTCGCCGAGATTTGAGGTGTATAGCGTGGATTTCGGTTGGGGGAAACCGGAGAAGGTGGAAGTTGTGTCGATTAACCGAACCGGAGCGGTTTGTATCTCGGAAAGCCGAGACGGCGGCGGAGTGGAACTTGGGTGGACGGCGAAGAGGGATGTTATGGAGAATTTCGCTAAGCTTTTTGCTGGAAGGTCTTCAACAACTTTGAGTTAA

Coding sequence (CDS)

ATGGCGACAAATCATGGGTCCACCGCCATTAAGGTCCTCGAGGTCTGCACGGTAGCTCCTCCGCCCGGATCCACCGTGCCGGTCACTCTTCCCCTCACCTTCTTCGACATCCTCTGGTTTCGCTTCCCTCCCGTCGAACGCCTCTTCTTCTACAAATCTCCGGTGCCATTCTACGTCATCGTTTCAAACCTCAAAAAATCTCTCTCTCTCGTCCTCCAACACTACCTACCCCTCGCCGGAGCAATTGTATGGCCTGAAAATTCCCCCAAACCGGCCGTCGAAACCGCCGTCCGCGACGGCATTGTGCTTACAGTCGCGGAGTCTGAGGACGACTTCGACCATCTCATCGGCGACGGGTTGCGTAAAGAGGCAAAACTTCGGCCGTTGGTGGCGGAGCTTGCAGCAGAGGAGGATCGGGCGGCAGTGGTGGCTGTGCAGGTAACCTGGTTTGGGAATGGTGGATTTAGCATCGGAATAACTTCACATCACGCAGTTCTGGACGGAAGGTCGTCGACTTCTTTCATGAAATCGTGGGCCGGATTGTGTAAGAATTTGGTTGGGGGCGGTGAGATTTTTTGCCCGGCAGCTGAGACGATGCCGTTTTATGATCGAAGCGTGGTGACAGATAAGATGGGACTTGAAGCCATTTATTTGAAATGCTTGTTGGCCCATGAAGGGCCCAACAATAGAAGCTTGAAATTTTGGGACTTCAAAACTCCGCCAGATTCATTCCGAGGCACATTCAAGTTAAGCCCCCAAAATATCCAAAAACTGAAGCAACATGTACTGGAACACCGGAACCCGGCCCAACCGCTGCTACACATCTCCACCTACACGGTGGCGATGGGGTACACGTGGGTGTGCGCCTCCGCCGTGGCTGACGAGGATATTACCATCGCAGTGACGGTGGACGCCCGAGGGAGATTGGACCCACCTCTACCGGCAACATACTTTGGAAACTACGTGGTCGGGCGGTCAACCGCCTTGAAGAGGGGGAAACTGTTTGGGGAAAACGGAGTAATCGCCGCGGTGGAGACGATATCAGAGATGATTAAAAGCTTGAAAGAAGAGGGACCTCTGAAGGGTGCAGAAAACTGGGTTTTGTTGATGACGCAAACTGTTGTAAATAGCGATTACAAGCTGATTTCCACGACTGGGTCGCCGAGATTTGAGGTGTATAGCGTGGATTTCGGTTGGGGGAAACCGGAGAAGGTGGAAGTTGTGTCGATTAACCGAACCGGAGCGGTTTGTATCTCGGAAAGCCGAGACGGCGGCGGAGTGGAACTTGGGTGGACGGCGAAGAGGGATGTTATGGAGAATTTCGCTAAGCTTTTTGCTGGAAGGTCTTCAACAACTTTGAGTTAA

Protein sequence

MATNHGSTAIKVLEVCTVAPPPGSTVPVTLPLTFFDILWFRFPPVERLFFYKSPVPFYVIVSNLKKSLSLVLQHYLPLAGAIVWPENSPKPAVETAVRDGIVLTVAESEDDFDHLIGDGLRKEAKLRPLVAELAAEEDRAAVVAVQVTWFGNGGFSIGITSHHAVLDGRSSTSFMKSWAGLCKNLVGGGEIFCPAAETMPFYDRSVVTDKMGLEAIYLKCLLAHEGPNNRSLKFWDFKTPPDSFRGTFKLSPQNIQKLKQHVLEHRNPAQPLLHISTYTVAMGYTWVCASAVADEDITIAVTVDARGRLDPPLPATYFGNYVVGRSTALKRGKLFGENGVIAAVETISEMIKSLKEEGPLKGAENWVLLMTQTVVNSDYKLISTTGSPRFEVYSVDFGWGKPEKVEVVSINRTGAVCISESRDGGGVELGWTAKRDVMENFAKLFAGRSSTTLS
BLAST of CsaV3_3G020720 vs. NCBI nr
Match: XP_004145869.2 (PREDICTED: phenolic glucoside malonyltransferase 1-like [Cucumis sativus])

HSP 1 Score: 920.2 bits (2377), Expect = 2.7e-264
Identity = 454/454 (100.00%), Postives = 454/454 (100.00%), Query Frame = 0

Query: 1   MATNHGSTAIKVLEVCTVAPPPGSTVPVTLPLTFFDILWFRFPPVERLFFYKSPVPFYVI 60
           MATNHGSTAIKVLEVCTVAPPPGSTVPVTLPLTFFDILWFRFPPVERLFFYKSPVPFYVI
Sbjct: 1   MATNHGSTAIKVLEVCTVAPPPGSTVPVTLPLTFFDILWFRFPPVERLFFYKSPVPFYVI 60

Query: 61  VSNLKKSLSLVLQHYLPLAGAIVWPENSPKPAVETAVRDGIVLTVAESEDDFDHLIGDGL 120
           VSNLKKSLSLVLQHYLPLAGAIVWPENSPKPAVETAVRDGIVLTVAESEDDFDHLIGDGL
Sbjct: 61  VSNLKKSLSLVLQHYLPLAGAIVWPENSPKPAVETAVRDGIVLTVAESEDDFDHLIGDGL 120

Query: 121 RKEAKLRPLVAELAAEEDRAAVVAVQVTWFGNGGFSIGITSHHAVLDGRSSTSFMKSWAG 180
           RKEAKLRPLVAELAAEEDRAAVVAVQVTWFGNGGFSIGITSHHAVLDGRSSTSFMKSWAG
Sbjct: 121 RKEAKLRPLVAELAAEEDRAAVVAVQVTWFGNGGFSIGITSHHAVLDGRSSTSFMKSWAG 180

Query: 181 LCKNLVGGGEIFCPAAETMPFYDRSVVTDKMGLEAIYLKCLLAHEGPNNRSLKFWDFKTP 240
           LCKNLVGGGEIFCPAAETMPFYDRSVVTDKMGLEAIYLKCLLAHEGPNNRSLKFWDFKTP
Sbjct: 181 LCKNLVGGGEIFCPAAETMPFYDRSVVTDKMGLEAIYLKCLLAHEGPNNRSLKFWDFKTP 240

Query: 241 PDSFRGTFKLSPQNIQKLKQHVLEHRNPAQPLLHISTYTVAMGYTWVCASAVADEDITIA 300
           PDSFRGTFKLSPQNIQKLKQHVLEHRNPAQPLLHISTYTVAMGYTWVCASAVADEDITIA
Sbjct: 241 PDSFRGTFKLSPQNIQKLKQHVLEHRNPAQPLLHISTYTVAMGYTWVCASAVADEDITIA 300

Query: 301 VTVDARGRLDPPLPATYFGNYVVGRSTALKRGKLFGENGVIAAVETISEMIKSLKEEGPL 360
           VTVDARGRLDPPLPATYFGNYVVGRSTALKRGKLFGENGVIAAVETISEMIKSLKEEGPL
Sbjct: 301 VTVDARGRLDPPLPATYFGNYVVGRSTALKRGKLFGENGVIAAVETISEMIKSLKEEGPL 360

Query: 361 KGAENWVLLMTQTVVNSDYKLISTTGSPRFEVYSVDFGWGKPEKVEVVSINRTGAVCISE 420
           KGAENWVLLMTQTVVNSDYKLISTTGSPRFEVYSVDFGWGKPEKVEVVSINRTGAVCISE
Sbjct: 361 KGAENWVLLMTQTVVNSDYKLISTTGSPRFEVYSVDFGWGKPEKVEVVSINRTGAVCISE 420

Query: 421 SRDGGGVELGWTAKRDVMENFAKLFAGRSSTTLS 455
           SRDGGGVELGWTAKRDVMENFAKLFAGRSSTTLS
Sbjct: 421 SRDGGGVELGWTAKRDVMENFAKLFAGRSSTTLS 454

BLAST of CsaV3_3G020720 vs. NCBI nr
Match: XP_008465393.1 (PREDICTED: phenolic glucoside malonyltransferase 1-like isoform X1 [Cucumis melo])

HSP 1 Score: 831.6 bits (2147), Expect = 1.3e-237
Identity = 406/446 (91.03%), Postives = 428/446 (95.96%), Query Frame = 0

Query: 1   MATNHGSTAIKVLEVCTVAPPPGSTVPVTLPLTFFDILWFRFPPVERLFFYKSPVPFYVI 60
           MAT+HGSTAIKVLE+CTVAPPPGSTVP TLPLTFFDILWFRFPPVERLFFYKSPVPF+VI
Sbjct: 37  MATDHGSTAIKVLEICTVAPPPGSTVPATLPLTFFDILWFRFPPVERLFFYKSPVPFHVI 96

Query: 61  VSNLKKSLSLVLQHYLPLAGAIVWPENSPKPAVETAVRDGIVLTVAESEDDFDHLIGDGL 120
           VSNLKKSLSLVLQHYLPLAGAIVWPENSPKPAVETA  DGIVLTVAES+DDFDHL+GDGL
Sbjct: 97  VSNLKKSLSLVLQHYLPLAGAIVWPENSPKPAVETAAGDGIVLTVAESDDDFDHLVGDGL 156

Query: 121 RKEAKLRPLVAELAAEEDRAAVVAVQVTWFGNGGFSIGITSHHAVLDGRSSTSFMKSWAG 180
           R+EAKL+PLVAELAAEE+RAAVVAVQVTWFGNG FSIGITSHHA+LDGRSSTSFMKSWAG
Sbjct: 157 REEAKLQPLVAELAAEEERAAVVAVQVTWFGNGRFSIGITSHHAILDGRSSTSFMKSWAG 216

Query: 181 LCKNLVGGGEIFCPAAETMPFYDRSVVTDKMGLEAIYLKCLLAHEGPNNRSLKFWDFKTP 240
           LCKNLVGGG+IF PAAETMPFYDRSVVTD +GLEAIYL+C LAHEGPNNRSLKFWD KTP
Sbjct: 217 LCKNLVGGGDIFFPAAETMPFYDRSVVTDNVGLEAIYLECWLAHEGPNNRSLKFWDVKTP 276

Query: 241 PDSFRGTFKLSPQNIQKLKQHVLEHRNPAQPLLHISTYTVAMGYTWVCASAVADEDITIA 300
           PD FRGTFKLSPQ+IQKLKQHVL+HRNP QP  HISTYTVAMGYTWVCASAVADE+I+I 
Sbjct: 277 PDLFRGTFKLSPQDIQKLKQHVLKHRNPVQPPPHISTYTVAMGYTWVCASAVADEEISIG 336

Query: 301 VTVDARGRLDPPLPATYFGNYVVGRSTALKRGKLFGENGVIAAVETISEMIKSLKEEGPL 360
           VT+DARGR+ PPLPATYFGN VVGRSTAL+RGKL GENGVIAAVETISEMIKSLKEEGPL
Sbjct: 337 VTMDARGRVYPPLPATYFGNCVVGRSTALERGKLLGENGVIAAVETISEMIKSLKEEGPL 396

Query: 361 KGAENWVLLMTQTVVNSDYKLISTTGSPRFEVYSVDFGWGKPEKVEVVSINRTGAVCISE 420
           KGAENWVLLMTQTVVN+DYKLIST GSPRFEVYSVDFGWGKPEKVEVVSINRTGAVCISE
Sbjct: 397 KGAENWVLLMTQTVVNNDYKLISTAGSPRFEVYSVDFGWGKPEKVEVVSINRTGAVCISE 456

Query: 421 SRDGGGVELGWTAKRDVMENFAKLFA 447
           SR+GGGVE GWTA+RDVMENFAKLFA
Sbjct: 457 SRNGGGVEHGWTARRDVMENFAKLFA 482

BLAST of CsaV3_3G020720 vs. NCBI nr
Match: XP_022948949.1 (phenolic glucoside malonyltransferase 2-like [Cucurbita moschata])

HSP 1 Score: 565.8 bits (1457), Expect = 1.3e-157
Identity = 291/454 (64.10%), Postives = 345/454 (75.99%), Query Frame = 0

Query: 1   MATNHGSTAIKVLEVCTVAPPPGSTVPVTLPLTFFDILWFRFPPVERLFFYKSPVPFYVI 60
           MA  H S  + VL +CTV PP GS VP +LPLTFFDILW RFPPV+R+FFYKS  PF V+
Sbjct: 12  MAEIHPS--VNVLHLCTVPPPHGSLVPFSLPLTFFDILWLRFPPVQRIFFYKSSAPFDVV 71

Query: 61  VSNLKKSLSLVLQHYLPLAGAIVWPENSPKPAVETAVRDGIVLTVAESEDDFDHLIGDGL 120
           VS LK SLS  LQHY PLAGA+VWPENSPKPAV+T + DGI+LT+A+S+  F HL+ DGL
Sbjct: 72  VSTLKNSLSAALQHYPPLAGAVVWPENSPKPAVQTVLGDGILLTLAKSDSKFSHLVSDGL 131

Query: 121 RKEAKLRPLVAELAAEEDRAAVVAVQVTWFGNGGFSIGITSHHAVLDGRSSTSFMKSWAG 180
           R+ A+   LV  L A +DRAAV+A+QVT FG  GF IGITSHHA+LDGR+STSF+K WA 
Sbjct: 132 REAAEFHTLVPRLPAADDRAAVMALQVTSFGTDGFCIGITSHHAILDGRTSTSFVKLWAR 191

Query: 181 LCKNLVGGGEIFCP---AAETMPFYDRSVVTDKMGLEAIYLKCLLAHEGPNNRSLKFWDF 240
           LCKNLV GGE   P   AAETMPFYDRSV+ D  GLE I+L+  LAH G +N+SLKFW  
Sbjct: 192 LCKNLVAGGESAEPVSTAAETMPFYDRSVIVDPRGLEGIFLRDWLAHGGSDNKSLKFWSP 251

Query: 241 KTPPDSFRGTFKLSPQNIQKLKQHVLEHRNPAQPLLHISTYTVAMGYTWVCASAVA---- 300
             P   FRGTFKL+PQNIQKLKQ VL  RNP  P +HIST+TVAM YTWVC +AVA    
Sbjct: 252 SIPQGLFRGTFKLNPQNIQKLKQLVLNRRNPVHPPVHISTFTVAMAYTWVC-TAVADGSP 311

Query: 301 -DEDITIAVTVDARGRLDPPLPATYFGNYVVGRSTALKRGKLFGENGVIAAVETISEMIK 360
            D +I+  ++VDAR  LDPP+PA YFGN +VGR+T  +R KL GENG++ AVE IS+ IK
Sbjct: 312 NDGEISFGLSVDARRWLDPPVPANYFGNCLVGRTTDQERAKLVGENGLVTAVEGISKAIK 371

Query: 361 SLKEEGPLKGAENWVLLMTQTVVNSDYKLISTTGSPRFEVYSVDFGWGKPEKVEVVSINR 420
           SL+E+G L GAE WV L+TQ V  S+ K+++T GSPRFE+YSVDFGWG P KVEVVSI+ 
Sbjct: 372 SLEEKGALDGAEQWVSLLTQ-VSGSNRKMLTTAGSPRFELYSVDFGWGTPAKVEVVSIDE 431

Query: 421 TGAVCISESRDGGGVELGWTAKRDVMENFAKLFA 447
           TGAV + + RD GGVELGW AK+DVME FA  FA
Sbjct: 432 TGAVSVCDGRD-GGVELGWVAKKDVMEAFAAAFA 460

BLAST of CsaV3_3G020720 vs. NCBI nr
Match: XP_023523638.1 (phenolic glucoside malonyltransferase 1-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 562.4 bits (1448), Expect = 1.4e-156
Identity = 289/454 (63.66%), Postives = 344/454 (75.77%), Query Frame = 0

Query: 1   MATNHGSTAIKVLEVCTVAPPPGSTVPVTLPLTFFDILWFRFPPVERLFFYKSPVPFYVI 60
           MA  H S  + VL +CTV PP GS VP +LPLTFFDILW RFPPV+R+FFYKS  PF V+
Sbjct: 12  MAEIHPS--VDVLHLCTVTPPHGSLVPFSLPLTFFDILWLRFPPVQRIFFYKSSAPFDVV 71

Query: 61  VSNLKKSLSLVLQHYLPLAGAIVWPENSPKPAVETAVRDGIVLTVAESEDDFDHLIGDGL 120
           VS LK SLS  LQHY PLAGA+VWPENSPKPAV+T + DGI+LT+A+S  +F HL+ DGL
Sbjct: 72  VSTLKNSLSAALQHYPPLAGAVVWPENSPKPAVQTVLGDGILLTLAKSSSNFSHLVSDGL 131

Query: 121 RKEAKLRPLVAELAAEEDRAAVVAVQVTWFGNGGFSIGITSHHAVLDGRSSTSFMKSWAG 180
           R+ A+   LV  L A +DRAAV+A+QVT FG  GF IGITSHHA+LDGR+STSF+K WA 
Sbjct: 132 REAAEFHTLVPRLPAADDRAAVMALQVTSFGTDGFCIGITSHHAILDGRTSTSFVKLWAR 191

Query: 181 LCKNLVGGGEIFCP---AAETMPFYDRSVVTDKMGLEAIYLKCLLAHEGPNNRSLKFWDF 240
           LCKNLV GGE   P   AAETMPFYDRSV+ D  GLE I+L+  LAH G +N+SLKFW  
Sbjct: 192 LCKNLVAGGESAEPVSSAAETMPFYDRSVIVDPRGLEGIFLRDWLAHGGSDNKSLKFWSP 251

Query: 241 KTPPDSFRGTFKLSPQNIQKLKQHVLEHRNPAQPLLHISTYTVAMGYTWVCASAVA---- 300
            TP   FRGTFK++PQNIQKLKQ VL  RNP  P +HIST+TVAM YTWVC +AVA    
Sbjct: 252 STPQGLFRGTFKINPQNIQKLKQLVLNRRNPVHPPVHISTFTVAMAYTWVC-TAVADGSP 311

Query: 301 -DEDITIAVTVDARGRLDPPLPATYFGNYVVGRSTALKRGKLFGENGVIAAVETISEMIK 360
            D +I+  ++VDAR  LDPP+ A YFGN +VGR+T  +R KL GENG++ AVE IS+ IK
Sbjct: 312 HDGEISFGLSVDARRWLDPPVAANYFGNCLVGRATDQERAKLVGENGLVTAVEGISKAIK 371

Query: 361 SLKEEGPLKGAENWVLLMTQTVVNSDYKLISTTGSPRFEVYSVDFGWGKPEKVEVVSINR 420
           SL+E G L GAE WV L+TQ V  S+ K+++T GSPRFE+YSVDFGWG P KVEVVSI+ 
Sbjct: 372 SLEENGALDGAEQWVSLLTQ-VSGSNRKMLTTAGSPRFELYSVDFGWGTPAKVEVVSIDE 431

Query: 421 TGAVCISESRDGGGVELGWTAKRDVMENFAKLFA 447
           TGAV + + RD GGVELGW A++DVME FA  FA
Sbjct: 432 TGAVSVCDGRD-GGVELGWVAQKDVMEAFAAAFA 460

BLAST of CsaV3_3G020720 vs. NCBI nr
Match: XP_022997666.1 (phenolic glucoside malonyltransferase 1-like [Cucurbita maxima])

HSP 1 Score: 544.7 bits (1402), Expect = 3.1e-151
Identity = 283/455 (62.20%), Postives = 340/455 (74.73%), Query Frame = 0

Query: 1   MATNHGSTAIKVLEVCTVAPPPGSTVPVTLPLTFFDILWFRFPPVERLFFYKSPVPFYVI 60
           MA  H S  + VL +CTV  P GS +P +LPLTFFDILW RFPPV+R+FFYKS  PF V+
Sbjct: 1   MAEMHPS--VNVLHLCTVPLPHGSLLPFSLPLTFFDILWLRFPPVQRIFFYKSSAPFDVV 60

Query: 61  VSNLKKSLSLVLQHYLPLAGAIVWPENSPKPAVETAVRDGIVLTVAESEDDFDHLIGDGL 120
           VS LK SLS  LQHY PLAGA+VWPENSPKPAV+T + DGI+LT+A+S+  F HL+ D L
Sbjct: 61  VSTLKNSLSAALQHYPPLAGAVVWPENSPKPAVQTVLGDGILLTLAKSDSKFSHLVSDEL 120

Query: 121 RKEAKLRPLVAELAAEEDRAAVVAVQVTWFGNGGFSIGITSHHAVLDGRSSTSFMKSWAG 180
           R+ A+   LV  L A +DRAAV+A+QVT FG  GF IGITSHHA+LDGR+STSF+K WA 
Sbjct: 121 REAAEFHTLVPRLPAADDRAAVMALQVTSFGTEGFCIGITSHHAILDGRTSTSFVKLWAR 180

Query: 181 LCKNLVGGGEIFCP---AAETMPFYDRSVVTDKMGLEAIYLKCLLAHEGPNNRSLKFWDF 240
           LCKNLV GGE   P   A+ETMPFYDRSV+ D  GLE I+L+  LAH G +N+SLKFW  
Sbjct: 181 LCKNLVAGGESAKPGSTASETMPFYDRSVIVDPKGLEGIFLRDWLAHGGSDNKSLKFWPP 240

Query: 241 KTPPDSFRGTFKLSPQNIQKLKQHVLEHRNPAQ-PLLHISTYTVAMGYTWVCASAVA--- 300
               D FRGTFK +PQNIQKLKQ VL   NP   P +HIST+TVAM YTWVC +AVA   
Sbjct: 241 SIQQDLFRGTFKFNPQNIQKLKQLVLNRWNPVHPPPVHISTFTVAMAYTWVC-TAVADGS 300

Query: 301 --DEDITIAVTVDARGRLDPPLPATYFGNYVVGRSTALKRGKLFGENGVIAAVETISEMI 360
             D +I+  ++VDAR  LDPP+PA YFGN +VGR+T  ++ KL GENG++ AVE IS+ I
Sbjct: 301 PHDGEISFGLSVDARRWLDPPVPANYFGNCLVGRTTDQEKAKLVGENGLVTAVEGISKAI 360

Query: 361 KSLKEEGPLKGAENWVLLMTQTVVNSDYKLISTTGSPRFEVYSVDFGWGKPEKVEVVSIN 420
           KSL+E G L GAE WV ++TQ V  S+ K+++T GSPRFE+YSVDFGWG P KVEVVSI+
Sbjct: 361 KSLEENGALDGAEQWVSMLTQ-VAGSNRKMLTTAGSPRFELYSVDFGWGTPAKVEVVSID 420

Query: 421 RTGAVCISESRDGGGVELGWTAKRDVMENFAKLFA 447
            TGAV + + RD GGVELGW AK+DVME FA  FA
Sbjct: 421 GTGAVSVCDGRD-GGVELGWVAKKDVMEAFAAAFA 450

BLAST of CsaV3_3G020720 vs. TAIR10
Match: AT5G39050.1 (HXXXD-type acyl-transferase family protein)

HSP 1 Score: 291.6 bits (745), Expect = 8.5e-79
Identity = 182/465 (39.14%), Postives = 258/465 (55.48%), Query Frame = 0

Query: 1   MATNHGSTAIKVLEVCTVAPP-PGSTVPVTLPLTFFDILWFRFPPVERLFFYK---SPVP 60
           M      +++KV++V  V P    S+  +TLPLTFFD+LW++   VER+ FYK   +  P
Sbjct: 1   MVNEEMESSLKVIDVARVTPSNSDSSESLTLPLTFFDLLWYKLHAVERVIFYKLTDASRP 60

Query: 61  FY--VIVSNLKKSLSLVLQHYLPLAGAIVWPENSPKPAVETAVRDGIVLTVAESEDDFDH 120
           F+  VIV NLK SLS  L HYLPLAG +VW    PKP +     D +  TVAES  DF  
Sbjct: 61  FFDSVIVPNLKTSLSSSLSHYLPLAGKLVWEPLDPKPKIVYTPNDAVSFTVAESNADFSR 120

Query: 121 LIGDGLRKEAKLRPLVAELAAEEDRAAVVAVQVTWFGNGGFSIGITSHHAVLDGRSSTSF 180
           L G       +L PLV EL   +D A+ V+ QVT F N GF I + +HHAVLDG+++T+F
Sbjct: 121 LTGKEPFPTTELYPLVPELHVSDDSASAVSFQVTLFPNQGFCISVNAHHAVLDGKTTTNF 180

Query: 181 MKSWAGLCKNLVGGGEIFCPAAETMPFYDRSVVTDKMGLEAIYL-------KCLLAHEGP 240
           +KSWA  CKN     + F P  + +P YDR+V+ D M L+   L       K     + P
Sbjct: 181 LKSWARTCKN----QDSFLP-QDLIPVYDRTVIKDPMDLDTKILNAWHRVAKVFTGGKEP 240

Query: 241 NN-RSLK-FWDFKTPPDSFRGTFKLSPQNIQKLKQHVLEHRNPAQ-----PLLHISTYTV 300
            N +SLK  W  +  PD FR T  L+ ++IQKL++ + +  + +        L +ST+ +
Sbjct: 241 ENPKSLKLLWSPEIGPDVFRYTLNLTREDIQKLRERLKKESSSSSVSSSPKELRLSTFVI 300

Query: 301 AMGYTWVCASAVADED----ITIAVTVDARGRLDPPLPATYFGNYVVG-RSTALKRGKLF 360
              Y   C       D    +     VD R  + PP+P++YFGN V      +L      
Sbjct: 301 VYSYALTCLIKARGGDPSRPVGYGFAVDCRSLMVPPVPSSYFGNCVSACFKMSLTAETFM 360

Query: 361 GENGVIAAVETISEMIKSLKEEGPLKGAENWVLLMTQTVVNSDYKLISTTGSPRFEVYSV 420
            E G +AA   +S+ +++L E   LK  E   +L   T ++   +++S  GS RF VY +
Sbjct: 361 SEEGFLAAARMVSDSVEALDENVALKIPE---ILEGFTTLSPGTQVLSVAGSTRFGVYGL 420

Query: 421 DFGWGKPEKVEVVSINRTGAVCISESRDG-GGVELGWTAKRDVME 440
           DFGWG+PEKV VVSI++  A+  +ESRDG GGVELG++ K+  M+
Sbjct: 421 DFGWGRPEKVVVVSIDQGEAISFAESRDGSGGVELGFSLKKHEMD 457

BLAST of CsaV3_3G020720 vs. TAIR10
Match: AT3G29590.1 (HXXXD-type acyl-transferase family protein)

HSP 1 Score: 280.0 bits (715), Expect = 2.6e-75
Identity = 179/455 (39.34%), Postives = 256/455 (56.26%), Query Frame = 0

Query: 7   STAIKVLEVCTVAPPPGSTVPVTLPLTFFDILWFRFPPVERLFFYKSP-VPFYVIVSNLK 66
           ++A+ +LEV  V+PP  S+  +TLPLT+FD+ W +  PV+R+ FY  P +    ++S LK
Sbjct: 5   NSAVNILEVVQVSPP--SSNSLTLPLTYFDLGWLKLHPVDRVLFYHVPELTRSSLISKLK 64

Query: 67  KSLSLVLQHYLPLAGAIVWPENSPKPAVETAV--RDGIVLTVAESEDDFDHLIGDGLRKE 126
            SLS  L HYLPLAG +VW     KP++  +   +D + LTVAES  D  HL GD  R  
Sbjct: 65  SSLSATLLHYLPLAGRLVWDSIKTKPSIVYSPDDKDAVYLTVAESNGDLSHLSGDEPRPA 124

Query: 127 AKLRPLVAELAAEEDRAAVVAVQVTWFGNGGFSIGITSHHAVLDGRSSTSFMKSWAGLCK 186
            +   LV EL   ++ A V+AVQVT+F N GFS+G+T+HHAVLDG+++  F+K+WA  CK
Sbjct: 125 TEFHSLVPELPVSDESARVLAVQVTFFPNQGFSLGVTAHHAVLDGKTTAMFLKAWAHNCK 184

Query: 187 NLVGGGEIFCPAAETMPFYDRSVVTDKMGLEAIYLKCLLAHEGPNNRSLKFWDFK-TPPD 246
                 E      + +P  DR +V D  GLE   L   ++    N  SLK +  K    D
Sbjct: 185 Q-----EQEALPHDLVPSLDRIIVQDPTGLETKLLNRWISASN-NKPSLKLFPSKIIGSD 244

Query: 247 SFRGTFKLSPQNIQKLKQHVLEHRNPAQPLLHISTYTVAMGYTWVCASAVADEDITIAVT 306
             R T++L+ ++I+KL++ V    +  Q  L +ST+ +   Y   C   +   D T  V 
Sbjct: 245 ILRVTYRLTREDIKKLRERVETESHAKQ--LRLSTFVITYAYVITCMVKMRGGDPTRFVC 304

Query: 307 V----DARGRLDPPLPATYFGNYVVGRST-ALKRGKLFGE---NGVIAAVETISEMIKSL 366
           V    D R RL+PPLP T+FGN +VG     +K   +  E    G I AVET++  +  L
Sbjct: 305 VGFASDFRSRLNPPLPPTFFGNCIVGSGDFDVKAEPILEEGEGKGFITAVETLTGWVNGL 364

Query: 367 KEEGPLKGAENWVLLMTQTV--VNSDYKLISTTGSPRFEVYSVDFGWGKPEKVEVVSINR 426
             E      E  +LL  +    +    ++IS  GS R  +Y  DFGWGKP KVE+V+I++
Sbjct: 365 CPE----NIEKNMLLPFEAFKRMEPGRQMISVAGSTRLGIYGSDFGWGKPVKVEIVTIDK 424

Query: 427 TGAVCISESRDG-GGVELGWTAKRDVMENFAKLFA 447
             +V +SES DG GGVE+G   K+D +E F  LF+
Sbjct: 425 DASVSLSESGDGSGGVEVGVCLKKDDVERFGSLFS 445

BLAST of CsaV3_3G020720 vs. TAIR10
Match: AT5G39090.1 (HXXXD-type acyl-transferase family protein)

HSP 1 Score: 277.7 bits (709), Expect = 1.3e-74
Identity = 174/444 (39.19%), Postives = 242/444 (54.50%), Query Frame = 0

Query: 9   AIKVLEVCTVAPP-PGSTVPVTLPLTFFDILWFRFPPVERLFFYK-----SPVPFYVIVS 68
           ++  + V  V P    S+  +TLPLTFFD+LW +   VER+ FYK       +   VIV 
Sbjct: 4   SLNFIHVSRVTPSNSNSSASLTLPLTFFDLLWLKHKAVERVIFYKLTDVNRSLFDSVIVP 63

Query: 69  NLKKSLSLVLQHYLPLAGAIVWPENSPKPAVETAVRDGIVLTVAESEDDFDHLIGDGLRK 128
           NLK SLS  L HYLPLAG I+W  + PKP +     D +  TVAES  DF  L G     
Sbjct: 64  NLKSSLSSSLSHYLPLAGHIIWEPHDPKPKIVYTQNDAVSFTVAESNSDFSLLTGKEPFS 123

Query: 129 EAKLRPLVAELAAEEDRAAVVAVQVTWFGNGGFSIGITSHHAVLDGRSSTSFMKSWAGLC 188
             +L PLV EL   +D AAVV+ QVT F N GF IG+T+HHAV DG+++T+F+KSWA LC
Sbjct: 124 STELHPLVPELQNSDDSAAVVSFQVTLFPNQGFCIGVTTHHAVSDGKTTTTFLKSWAHLC 183

Query: 189 KNLVGGGEIFCPAAETMPFYDRSVVTDKMGLEAIYLKCLLAHEGPNNRSLKFWDF-KTPP 248
           K+     +      + +PFYDR+V+     ++   LK  + H     +SLK     +   
Sbjct: 184 KH-----QDSSLPDDLIPFYDRTVIKGPPEIDTKVLK--IWHSIHKPKSLKLLPRPEIES 243

Query: 249 DSFRGTFKLSPQNIQKLKQHVLEHRNPAQPLLHISTYTVAMGYTWVCASAVADED----I 308
           D  R TF+L+ +NI+KL+   L+  + +   + +ST+ +   Y + C      +D    +
Sbjct: 244 DVVRYTFELTRENIEKLRDK-LKRESSSFSSVRLSTFVITFSYVFTCLIGSGGDDPNRPV 303

Query: 309 TIAVTVDARGRL-DPPLPATYFGNYVVGR-STALKRGKLFGENGVIAAVETISEMIKSLK 368
                VD R  + DPP+P TYFGN V       L  G   GE G + A   IS+ ++ L 
Sbjct: 304 GYRFAVDCRRLIDDPPIPLTYFGNCVYSAVKIPLDAGMFLGEQGFVVAARLISDSVEELD 363

Query: 369 EEGPLKGAENWVLLMTQTVVNSDYKLISTTGSPRFEVYSVDFGWGKPEKVEVVSINRTGA 428
                K  E   LL T      D + +S  GS RF +Y +DFGWGKP K  +VSI++ G 
Sbjct: 364 SNVAWKIPE---LLETYEKAPVDSQFVSVAGSTRFGIYGLDFGWGKPFKSLLVSIDQRGK 423

Query: 429 VCISESRDG-GGVELGWTAKRDVM 439
           + I+ESRDG GGVE+G++ K+  M
Sbjct: 424 ISIAESRDGSGGVEIGFSLKKQEM 436

BLAST of CsaV3_3G020720 vs. TAIR10
Match: AT3G29670.1 (HXXXD-type acyl-transferase family protein)

HSP 1 Score: 266.9 bits (681), Expect = 2.2e-71
Identity = 164/454 (36.12%), Postives = 242/454 (53.30%), Query Frame = 0

Query: 10  IKVLEVCTVAPPPGSTVPVT----LPLTFFDILWFRFPPVERLFFYKSPVP-----FYVI 69
           + V+E   V P   S +       LPLTFFD+ W  F PV+R+FFY+           +I
Sbjct: 3   LHVIETARVTPTDYSVINSANLHKLPLTFFDLPWLLFQPVKRVFFYELTESTRDHFHSII 62

Query: 70  VSNLKKSLSLVLQHYLPLAGAIVWPENSPKPAVETAVRDGIVLTVAESEDDFDHLIGDGL 129
           +  LK SLSL+L++YLPL G I W  N PKP++  +    +++T+AES+ DF HL G G 
Sbjct: 63  LPKLKDSLSLILRNYLPLTGHITWEPNEPKPSIIVSENGVVLVTIAESDADFSHLSGYGQ 122

Query: 130 RKEAKLRPLVAELAAEEDRAAVVAVQVTWFGNGGFSIGITSHHAVLDGRSSTSFMKSWAG 189
           R  ++L  LV +L   +D A   ++Q+T F N GFSIG+ +HHAVLDG++S++F+K+WA 
Sbjct: 123 RPLSELHALVPKLPVSDDSATAFSIQITLFPNQGFSIGVAAHHAVLDGKTSSTFIKAWAQ 182

Query: 190 LCKNLVGGGEIFCPAAETMPFYDRSVVTDKMGLE--AIYLKCLLAHEGPNNRSL-KFWDF 249
           +CK      E+        P YDRS++     L+   I L   L  +  N RSL      
Sbjct: 183 ICKQ-----ELQSMPENLTPSYDRSLIKYPTYLDEKMIELVRSLKEDQTNIRSLTSLPSS 242

Query: 250 KTPPDSFRGTFKLSPQNIQKLKQHVLEHRNPAQPLLHISTYTVAMGYTWVC----ASAVA 309
           K   D    T  LS  +I++L++ V        P LH+ST+ +A  Y W C         
Sbjct: 243 KLGDDVVLATLVLSRADIERLREQV----KNVSPSLHLSTFVIAYAYAWTCFVKARGGNK 302

Query: 310 DEDITIAVTVDARGRLDPPLPATYFGNYVVGRST-ALKRGKLFGENGVIAAVETISEMIK 369
           D  +++    D R RLDP LP TYFGN ++       K  +   E G + A E IS+++K
Sbjct: 303 DRSVSLLFVGDFRDRLDPKLPGTYFGNCMIPVGCYNRKAAEFMEEKGFVTAAEIISDLVK 362

Query: 370 SLKEEGPLKGAENWVLLMTQTVVNSDYKLISTTGSPRFEVYSVDFGWGKPEKVEVVSINR 429
            L        A+ +V   +    ++ +  I+  GS R  VY  DFGWG+P KV++VSI++
Sbjct: 363 GLSSRKIETIADTFVEGFSFQSWSTQFGTIA--GSTRLGVYEADFGWGRPVKVDIVSIDQ 422

Query: 430 TGAVCISESRD-GGGVELGWTAKRDVMENFAKLF 446
             A+ ++E RD  GGVE+G   K+  M++    F
Sbjct: 423 GEAIAMAERRDESGGVEIGMCLKKTEMDSVVSFF 445

BLAST of CsaV3_3G020720 vs. TAIR10
Match: AT3G29635.1 (HXXXD-type acyl-transferase family protein)

HSP 1 Score: 265.4 bits (677), Expect = 6.5e-71
Identity = 169/435 (38.85%), Postives = 240/435 (55.17%), Query Frame = 0

Query: 28  VTLPLTFFDILWFRFPPVERLFFYK----SPVPFY--VIVSNLKKSLSLVLQHYLPLAGA 87
           + LPLTFFD+ W +F P ER+ FYK    S +  +  VI+  L+ SLS+VL+HYLPLAG 
Sbjct: 25  MVLPLTFFDLRWLQFHPTERVIFYKLIKDSSLESFLSVILPKLELSLSIVLRHYLPLAGR 84

Query: 88  IVWPENSPKPAVETAVRDGIVLTVAESEDDFDHLIGDGLRKEAKLRPLVAELAAEEDRAA 147
           + W    PKP++  +  D + LTVAES+ DF  + G G+R E+++R LV EL+   D  +
Sbjct: 85  LTWSSQDPKPSIIVSPNDYVSLTVAESDADFSRISGKGIRPESEIRSLVPELSLSCDSPS 144

Query: 148 VVAVQVTWFGNGGFSIGITSHHAVLDGRSSTSFMKSWAGLCKNLVGGGEIFCPAAETMPF 207
           V+++QVT F N GF IGI SHH+V+DG++   F+KSWA +CK+    G +  P  +  P 
Sbjct: 145 VLSLQVTLFPNQGFCIGIASHHSVMDGKTVVRFIKSWAHICKH----GAMDLP-EDLTPV 204

Query: 208 YDRSVVTDKMGLEA--IYLKCLLAHEGPNNRSLKFWDFK-TPPDSFRGTFKLSPQNIQKL 267
            DR+V+     L+A  I L    +    + RSLK    K   PD  R + +L+ +NI+KL
Sbjct: 205 LDRTVINVPASLDAKIIELLSYFSEVKDSFRSLKLLPPKEISPDLVRISLELTRENIEKL 264

Query: 268 KQHVLEHRNPAQPLLHISTYTVAMGYTWVCASAVADED----ITIAVTVDARGRLDPPLP 327
           ++        +   LH+ST+ VA  Y W C       D    +      D R RLDPP+P
Sbjct: 265 REQAKRESARSHHELHLSTFVVANAYLWTCLVKTRGGDENRPVRFMYAADFRNRLDPPVP 324

Query: 328 ATYFGNYV--VGRSTALKRGKLFGENGVIAAVETISEMIKSLKEEGPLKGAENWVLLMTQ 387
             YFGN V  +G     K     GE+G +  VE +S+ ++S+   G  K      L +  
Sbjct: 325 EMYFGNCVFPIG-CFGYKANVFLGEDGFVNMVEILSDSVRSI---GLRKLETICELYING 384

Query: 388 T-VVNSDYKLISTTGSPRFEVYSVDFGWGKPEKVEVVSINRTGAVCISESRD-GGGVELG 446
           T  V    ++ S  GS +F +Y  DFGWGKP   E+ SI+R  A  +SE RD  GGVE+G
Sbjct: 385 TKSVKPGTQIGSIAGSNQFGLYGSDFGWGKPCNSEIASIDRNEAFSMSERRDEPGGVEIG 444

BLAST of CsaV3_3G020720 vs. Swiss-Prot
Match: sp|Q940Z5|PMAT1_ARATH (Phenolic glucoside malonyltransferase 1 OS=Arabidopsis thaliana OX=3702 GN=PMAT1 PE=1 SV=1)

HSP 1 Score: 291.6 bits (745), Expect = 1.5e-77
Identity = 182/465 (39.14%), Postives = 258/465 (55.48%), Query Frame = 0

Query: 1   MATNHGSTAIKVLEVCTVAPP-PGSTVPVTLPLTFFDILWFRFPPVERLFFYK---SPVP 60
           M      +++KV++V  V P    S+  +TLPLTFFD+LW++   VER+ FYK   +  P
Sbjct: 1   MVNEEMESSLKVIDVARVTPSNSDSSESLTLPLTFFDLLWYKLHAVERVIFYKLTDASRP 60

Query: 61  FY--VIVSNLKKSLSLVLQHYLPLAGAIVWPENSPKPAVETAVRDGIVLTVAESEDDFDH 120
           F+  VIV NLK SLS  L HYLPLAG +VW    PKP +     D +  TVAES  DF  
Sbjct: 61  FFDSVIVPNLKTSLSSSLSHYLPLAGKLVWEPLDPKPKIVYTPNDAVSFTVAESNADFSR 120

Query: 121 LIGDGLRKEAKLRPLVAELAAEEDRAAVVAVQVTWFGNGGFSIGITSHHAVLDGRSSTSF 180
           L G       +L PLV EL   +D A+ V+ QVT F N GF I + +HHAVLDG+++T+F
Sbjct: 121 LTGKEPFPTTELYPLVPELHVSDDSASAVSFQVTLFPNQGFCISVNAHHAVLDGKTTTNF 180

Query: 181 MKSWAGLCKNLVGGGEIFCPAAETMPFYDRSVVTDKMGLEAIYL-------KCLLAHEGP 240
           +KSWA  CKN     + F P  + +P YDR+V+ D M L+   L       K     + P
Sbjct: 181 LKSWARTCKN----QDSFLP-QDLIPVYDRTVIKDPMDLDTKILNAWHRVAKVFTGGKEP 240

Query: 241 NN-RSLK-FWDFKTPPDSFRGTFKLSPQNIQKLKQHVLEHRNPAQ-----PLLHISTYTV 300
            N +SLK  W  +  PD FR T  L+ ++IQKL++ + +  + +        L +ST+ +
Sbjct: 241 ENPKSLKLLWSPEIGPDVFRYTLNLTREDIQKLRERLKKESSSSSVSSSPKELRLSTFVI 300

Query: 301 AMGYTWVCASAVADED----ITIAVTVDARGRLDPPLPATYFGNYVVG-RSTALKRGKLF 360
              Y   C       D    +     VD R  + PP+P++YFGN V      +L      
Sbjct: 301 VYSYALTCLIKARGGDPSRPVGYGFAVDCRSLMVPPVPSSYFGNCVSACFKMSLTAETFM 360

Query: 361 GENGVIAAVETISEMIKSLKEEGPLKGAENWVLLMTQTVVNSDYKLISTTGSPRFEVYSV 420
            E G +AA   +S+ +++L E   LK  E   +L   T ++   +++S  GS RF VY +
Sbjct: 361 SEEGFLAAARMVSDSVEALDENVALKIPE---ILEGFTTLSPGTQVLSVAGSTRFGVYGL 420

Query: 421 DFGWGKPEKVEVVSINRTGAVCISESRDG-GGVELGWTAKRDVME 440
           DFGWG+PEKV VVSI++  A+  +ESRDG GGVELG++ K+  M+
Sbjct: 421 DFGWGRPEKVVVVSIDQGEAISFAESRDGSGGVELGFSLKKHEMD 457

BLAST of CsaV3_3G020720 vs. Swiss-Prot
Match: sp|Q9LJB4|5MAT_ARATH (Malonyl-CoA:anthocyanidin 5-O-glucoside-6''-O-malonyltransferase OS=Arabidopsis thaliana OX=3702 GN=5MAT PE=1 SV=1)

HSP 1 Score: 280.0 bits (715), Expect = 4.6e-74
Identity = 179/455 (39.34%), Postives = 256/455 (56.26%), Query Frame = 0

Query: 7   STAIKVLEVCTVAPPPGSTVPVTLPLTFFDILWFRFPPVERLFFYKSP-VPFYVIVSNLK 66
           ++A+ +LEV  V+PP  S+  +TLPLT+FD+ W +  PV+R+ FY  P +    ++S LK
Sbjct: 5   NSAVNILEVVQVSPP--SSNSLTLPLTYFDLGWLKLHPVDRVLFYHVPELTRSSLISKLK 64

Query: 67  KSLSLVLQHYLPLAGAIVWPENSPKPAVETAV--RDGIVLTVAESEDDFDHLIGDGLRKE 126
            SLS  L HYLPLAG +VW     KP++  +   +D + LTVAES  D  HL GD  R  
Sbjct: 65  SSLSATLLHYLPLAGRLVWDSIKTKPSIVYSPDDKDAVYLTVAESNGDLSHLSGDEPRPA 124

Query: 127 AKLRPLVAELAAEEDRAAVVAVQVTWFGNGGFSIGITSHHAVLDGRSSTSFMKSWAGLCK 186
            +   LV EL   ++ A V+AVQVT+F N GFS+G+T+HHAVLDG+++  F+K+WA  CK
Sbjct: 125 TEFHSLVPELPVSDESARVLAVQVTFFPNQGFSLGVTAHHAVLDGKTTAMFLKAWAHNCK 184

Query: 187 NLVGGGEIFCPAAETMPFYDRSVVTDKMGLEAIYLKCLLAHEGPNNRSLKFWDFK-TPPD 246
                 E      + +P  DR +V D  GLE   L   ++    N  SLK +  K    D
Sbjct: 185 Q-----EQEALPHDLVPSLDRIIVQDPTGLETKLLNRWISASN-NKPSLKLFPSKIIGSD 244

Query: 247 SFRGTFKLSPQNIQKLKQHVLEHRNPAQPLLHISTYTVAMGYTWVCASAVADEDITIAVT 306
             R T++L+ ++I+KL++ V    +  Q  L +ST+ +   Y   C   +   D T  V 
Sbjct: 245 ILRVTYRLTREDIKKLRERVETESHAKQ--LRLSTFVITYAYVITCMVKMRGGDPTRFVC 304

Query: 307 V----DARGRLDPPLPATYFGNYVVGRST-ALKRGKLFGE---NGVIAAVETISEMIKSL 366
           V    D R RL+PPLP T+FGN +VG     +K   +  E    G I AVET++  +  L
Sbjct: 305 VGFASDFRSRLNPPLPPTFFGNCIVGSGDFDVKAEPILEEGEGKGFITAVETLTGWVNGL 364

Query: 367 KEEGPLKGAENWVLLMTQTV--VNSDYKLISTTGSPRFEVYSVDFGWGKPEKVEVVSINR 426
             E      E  +LL  +    +    ++IS  GS R  +Y  DFGWGKP KVE+V+I++
Sbjct: 365 CPE----NIEKNMLLPFEAFKRMEPGRQMISVAGSTRLGIYGSDFGWGKPVKVEIVTIDK 424

Query: 427 TGAVCISESRDG-GGVELGWTAKRDVMENFAKLFA 447
             +V +SES DG GGVE+G   K+D +E F  LF+
Sbjct: 425 DASVSLSESGDGSGGVEVGVCLKKDDVERFGSLFS 445

BLAST of CsaV3_3G020720 vs. Swiss-Prot
Match: sp|Q9LRQ8|PMAT2_ARATH (Phenolic glucoside malonyltransferase 2 OS=Arabidopsis thaliana OX=3702 GN=PMAT2 PE=1 SV=1)

HSP 1 Score: 266.9 bits (681), Expect = 4.0e-70
Identity = 164/454 (36.12%), Postives = 242/454 (53.30%), Query Frame = 0

Query: 10  IKVLEVCTVAPPPGSTVPVT----LPLTFFDILWFRFPPVERLFFYKSPVP-----FYVI 69
           + V+E   V P   S +       LPLTFFD+ W  F PV+R+FFY+           +I
Sbjct: 3   LHVIETARVTPTDYSVINSANLHKLPLTFFDLPWLLFQPVKRVFFYELTESTRDHFHSII 62

Query: 70  VSNLKKSLSLVLQHYLPLAGAIVWPENSPKPAVETAVRDGIVLTVAESEDDFDHLIGDGL 129
           +  LK SLSL+L++YLPL G I W  N PKP++  +    +++T+AES+ DF HL G G 
Sbjct: 63  LPKLKDSLSLILRNYLPLTGHITWEPNEPKPSIIVSENGVVLVTIAESDADFSHLSGYGQ 122

Query: 130 RKEAKLRPLVAELAAEEDRAAVVAVQVTWFGNGGFSIGITSHHAVLDGRSSTSFMKSWAG 189
           R  ++L  LV +L   +D A   ++Q+T F N GFSIG+ +HHAVLDG++S++F+K+WA 
Sbjct: 123 RPLSELHALVPKLPVSDDSATAFSIQITLFPNQGFSIGVAAHHAVLDGKTSSTFIKAWAQ 182

Query: 190 LCKNLVGGGEIFCPAAETMPFYDRSVVTDKMGLE--AIYLKCLLAHEGPNNRSL-KFWDF 249
           +CK      E+        P YDRS++     L+   I L   L  +  N RSL      
Sbjct: 183 ICKQ-----ELQSMPENLTPSYDRSLIKYPTYLDEKMIELVRSLKEDQTNIRSLTSLPSS 242

Query: 250 KTPPDSFRGTFKLSPQNIQKLKQHVLEHRNPAQPLLHISTYTVAMGYTWVC----ASAVA 309
           K   D    T  LS  +I++L++ V        P LH+ST+ +A  Y W C         
Sbjct: 243 KLGDDVVLATLVLSRADIERLREQV----KNVSPSLHLSTFVIAYAYAWTCFVKARGGNK 302

Query: 310 DEDITIAVTVDARGRLDPPLPATYFGNYVVGRST-ALKRGKLFGENGVIAAVETISEMIK 369
           D  +++    D R RLDP LP TYFGN ++       K  +   E G + A E IS+++K
Sbjct: 303 DRSVSLLFVGDFRDRLDPKLPGTYFGNCMIPVGCYNRKAAEFMEEKGFVTAAEIISDLVK 362

Query: 370 SLKEEGPLKGAENWVLLMTQTVVNSDYKLISTTGSPRFEVYSVDFGWGKPEKVEVVSINR 429
            L        A+ +V   +    ++ +  I+  GS R  VY  DFGWG+P KV++VSI++
Sbjct: 363 GLSSRKIETIADTFVEGFSFQSWSTQFGTIA--GSTRLGVYEADFGWGRPVKVDIVSIDQ 422

Query: 430 TGAVCISESRD-GGGVELGWTAKRDVMENFAKLF 446
             A+ ++E RD  GGVE+G   K+  M++    F
Sbjct: 423 GEAIAMAERRDESGGVEIGMCLKKTEMDSVVSFF 445

BLAST of CsaV3_3G020720 vs. Swiss-Prot
Match: sp|Q9FNP9|AGCT_ARATH (Agmatine coumaroyltransferase OS=Arabidopsis thaliana OX=3702 GN=ACT PE=1 SV=1)

HSP 1 Score: 263.8 bits (673), Expect = 3.4e-69
Identity = 164/458 (35.81%), Postives = 242/458 (52.84%), Query Frame = 0

Query: 9   AIKVLEVCTVAPPPGSTVPVTLPLTFFDILWFRFPPVERLFFYK-------SPVPFYVIV 68
           A+KV+++  V+P   S  P+ +PL+FFD+ W +  P E++FFYK         V +  I+
Sbjct: 2   ALKVIKISRVSPATASVDPLIVPLSFFDLQWLKLNPTEQVFFYKLTESSSSRDVFYSSIL 61

Query: 69  SNLKKSLSLVLQHYLPLAGAIVWPENSPKPAVETAVRDGIVLTVAESEDDFDHLIGDGLR 128
             L++SLSL+L H+    G + W    PKP +     D + LTVAE++ DF  + G GLR
Sbjct: 62  PKLERSLSLILTHFRLFTGHLKWDSQDPKPHLVVLSGDTLSLTVAETDADFSRISGRGLR 121

Query: 129 KEAKLRPLVAELAAEEDRAAVVAVQVTWFGNGGFSIGITSHHAVLDGRSSTSFMKSWAGL 188
            E +LRPL+ EL    D  AVV++QVT F   GF IG T+HH VLDG+++  F K+WA  
Sbjct: 122 PELELRPLIPELPIYSDSGAVVSLQVTLFPKQGFCIGTTAHHVVLDGKTAEKFNKAWAHT 181

Query: 189 CKNLVGGGEIFCPAAETMP-FYDRSVVTDKMGLEAIYLKC---LLAHEGPNNRSLKF--- 248
           CK+    G I     + +P   DRSVV    GLE   L+    L   +  N R+LK    
Sbjct: 182 CKH----GTI----PKILPTVLDRSVVNVPAGLEQKMLELLPYLTEDDKENGRTLKLPPV 241

Query: 249 WDFKTPPDSFRGTFKLSPQNIQKLKQHVLEHRNPAQPLLHISTYTVAMGYTWVC----AS 308
            +     +  R T ++SP+NI+KLK+   +    A+  LH+ST+ V   + W C     S
Sbjct: 242 KEINAKDNVLRITIEISPENIEKLKERAKKESTRAE--LHLSTFVVTFAHVWTCMVKARS 301

Query: 309 AVADEDITIAVTVDARGRLDPPLPATYFGNYVVGRSTALKRGKLF-GENGVIAAVETISE 368
              +  +      D R RL+PP+P TYFG  V+       + K F GE+G +  VE +S+
Sbjct: 302 GDPNRPVRFMYAADFRNRLEPPVPVTYFGTCVLAMDFYKYKAKEFMGEDGFVNTVEILSD 361

Query: 369 MIKSLKEEGPLKGAENWVLLMTQT-VVNSDYKLISTTGSPRFEVYSVDFGWGKPEKVEVV 428
            +K L  +G       W +    T  +    +L+   GS +  +Y  DFGWG+P   E +
Sbjct: 362 SVKRLASQGV---ESTWKVYEEGTKTMKWGTQLLVVNGSNQIGMYETDFGWGRPIHTETM 421

Query: 429 SINRTGAVCISESRDG-GGVELGWTAKRDVMENFAKLF 446
           SI +     +S+ RDG GGVE+G + K+  M+ F  LF
Sbjct: 422 SIYKNDEFSMSKRRDGIGGVEIGISLKKLEMDTFLSLF 446

BLAST of CsaV3_3G020720 vs. Swiss-Prot
Match: sp|Q9LRQ7|BAHD2_ARATH (BAHD acyltransferase At3g29680 OS=Arabidopsis thaliana OX=3702 GN=At3g29680 PE=2 SV=1)

HSP 1 Score: 257.7 bits (657), Expect = 2.4e-67
Identity = 160/452 (35.40%), Postives = 238/452 (52.65%), Query Frame = 0

Query: 9   AIKVLEVCTVAPPPGSTVPVTLPLTFFDILWFRFPPVERLFFYKSPVP-----FYVIVSN 68
           A+ V+++  V+    S  P+ LPLTFFD+LW +  P+ER+ FYK         F  I+  
Sbjct: 2   ALNVIKISRVSLVTNSVEPLVLPLTFFDLLWLKLNPIERVTFYKLTESSRDSFFSSILPK 61

Query: 69  LKKSLSLVLQHYLPLAGAIVWPENSPKPAVETAVRDGIVLTVAESEDDFDHLIGDGLRKE 128
           L++SLSLVL H+LPL+G + W    PKP +    +D + LTV ESE DF ++    LR E
Sbjct: 62  LEQSLSLVLSHFLPLSGHLKWNPQDPKPHIVIFPKDTVSLTVVESEADFSYISSKELRLE 121

Query: 129 AKLRPLVAELAAEEDRAAVVAVQVTWFGNGGFSIGITSHHAVLDGRSSTSFMKSWAGLCK 188
            +LRPLV EL    D A+++++Q+T F N GFSIG T HH V+DG++++ F KSWA +CK
Sbjct: 122 TELRPLVPELQVSSDSASLLSLQITLFPNQGFSIGTTVHHVVMDGKTASKFHKSWAHICK 181

Query: 189 NLVGGGEIFCPAAETMPFYDRSVVTDKMGLE--AIYLKCLLAHEGPNNRSLKFWDFK-TP 248
           +     +   P        DR+V+    GLE     L   ++ E    R+L     K   
Sbjct: 182 HGTTPQDFDLPTV-----LDRTVINVPAGLEQKIFQLSSYISEEKDYARTLTLPPAKEID 241

Query: 249 PDSFRGTFKLSPQNIQKLKQHVLEHRNPAQPLLHISTYTVAMGYTWVC----ASAVADED 308
            D  R T +L+  +I+KLK+        +   LH+ST+ V+  Y   C        A+  
Sbjct: 242 NDVVRVTLELTEVDIEKLKERAKNESTRSD--LHLSTFVVSYAYVLTCMVKSCGGDANRP 301

Query: 309 ITIAVTVDARGRLDPPLPATYFGNYVVGRS-TALKRGKLFGENGVIAAVETISEMIKSLK 368
           +      D R RLDPP+P TYFGN V+       K     G++G +  VE +S+ ++ L 
Sbjct: 302 VRFMYAADFRNRLDPPVPLTYFGNCVLPIDFNGYKATTFLGKDGYVNGVEILSDSVRGL- 361

Query: 369 EEGPLKGAENWVLLMTQTV-VNSDYKLISTTGSPRFEVYSVDFGWGKPEKVEVVSINRTG 428
             G       W +    T  +  D + ++ TGS +F +Y  DFGWG+P K +V+S+ +  
Sbjct: 362 --GSRNIESIWEVYEDGTKNMKLDTQNVTVTGSNQFGIYGSDFGWGRPVKTDVMSLYKNN 421

Query: 429 AVCISESRDG-GGVELGWTAKRDVMENFAKLF 446
              +S  RD  GG+E+G + K+  M  F  LF
Sbjct: 422 EFSMSARRDEIGGLEIGISLKKCEMNVFLSLF 443

BLAST of CsaV3_3G020720 vs. TrEMBL
Match: tr|A0A1S3CNS2|A0A1S3CNS2_CUCME (phenolic glucoside malonyltransferase 1-like isoform X1 OS=Cucumis melo OX=3656 GN=LOC103503024 PE=4 SV=1)

HSP 1 Score: 831.6 bits (2147), Expect = 8.3e-238
Identity = 406/446 (91.03%), Postives = 428/446 (95.96%), Query Frame = 0

Query: 1   MATNHGSTAIKVLEVCTVAPPPGSTVPVTLPLTFFDILWFRFPPVERLFFYKSPVPFYVI 60
           MAT+HGSTAIKVLE+CTVAPPPGSTVP TLPLTFFDILWFRFPPVERLFFYKSPVPF+VI
Sbjct: 37  MATDHGSTAIKVLEICTVAPPPGSTVPATLPLTFFDILWFRFPPVERLFFYKSPVPFHVI 96

Query: 61  VSNLKKSLSLVLQHYLPLAGAIVWPENSPKPAVETAVRDGIVLTVAESEDDFDHLIGDGL 120
           VSNLKKSLSLVLQHYLPLAGAIVWPENSPKPAVETA  DGIVLTVAES+DDFDHL+GDGL
Sbjct: 97  VSNLKKSLSLVLQHYLPLAGAIVWPENSPKPAVETAAGDGIVLTVAESDDDFDHLVGDGL 156

Query: 121 RKEAKLRPLVAELAAEEDRAAVVAVQVTWFGNGGFSIGITSHHAVLDGRSSTSFMKSWAG 180
           R+EAKL+PLVAELAAEE+RAAVVAVQVTWFGNG FSIGITSHHA+LDGRSSTSFMKSWAG
Sbjct: 157 REEAKLQPLVAELAAEEERAAVVAVQVTWFGNGRFSIGITSHHAILDGRSSTSFMKSWAG 216

Query: 181 LCKNLVGGGEIFCPAAETMPFYDRSVVTDKMGLEAIYLKCLLAHEGPNNRSLKFWDFKTP 240
           LCKNLVGGG+IF PAAETMPFYDRSVVTD +GLEAIYL+C LAHEGPNNRSLKFWD KTP
Sbjct: 217 LCKNLVGGGDIFFPAAETMPFYDRSVVTDNVGLEAIYLECWLAHEGPNNRSLKFWDVKTP 276

Query: 241 PDSFRGTFKLSPQNIQKLKQHVLEHRNPAQPLLHISTYTVAMGYTWVCASAVADEDITIA 300
           PD FRGTFKLSPQ+IQKLKQHVL+HRNP QP  HISTYTVAMGYTWVCASAVADE+I+I 
Sbjct: 277 PDLFRGTFKLSPQDIQKLKQHVLKHRNPVQPPPHISTYTVAMGYTWVCASAVADEEISIG 336

Query: 301 VTVDARGRLDPPLPATYFGNYVVGRSTALKRGKLFGENGVIAAVETISEMIKSLKEEGPL 360
           VT+DARGR+ PPLPATYFGN VVGRSTAL+RGKL GENGVIAAVETISEMIKSLKEEGPL
Sbjct: 337 VTMDARGRVYPPLPATYFGNCVVGRSTALERGKLLGENGVIAAVETISEMIKSLKEEGPL 396

Query: 361 KGAENWVLLMTQTVVNSDYKLISTTGSPRFEVYSVDFGWGKPEKVEVVSINRTGAVCISE 420
           KGAENWVLLMTQTVVN+DYKLIST GSPRFEVYSVDFGWGKPEKVEVVSINRTGAVCISE
Sbjct: 397 KGAENWVLLMTQTVVNNDYKLISTAGSPRFEVYSVDFGWGKPEKVEVVSINRTGAVCISE 456

Query: 421 SRDGGGVELGWTAKRDVMENFAKLFA 447
           SR+GGGVE GWTA+RDVMENFAKLFA
Sbjct: 457 SRNGGGVEHGWTARRDVMENFAKLFA 482

BLAST of CsaV3_3G020720 vs. TrEMBL
Match: tr|A0A0A0L9V3|A0A0A0L9V3_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G280980 PE=4 SV=1)

HSP 1 Score: 504.6 bits (1298), Expect = 2.3e-139
Identity = 246/246 (100.00%), Postives = 246/246 (100.00%), Query Frame = 0

Query: 1   MATNHGSTAIKVLEVCTVAPPPGSTVPVTLPLTFFDILWFRFPPVERLFFYKSPVPFYVI 60
           MATNHGSTAIKVLEVCTVAPPPGSTVPVTLPLTFFDILWFRFPPVERLFFYKSPVPFYVI
Sbjct: 1   MATNHGSTAIKVLEVCTVAPPPGSTVPVTLPLTFFDILWFRFPPVERLFFYKSPVPFYVI 60

Query: 61  VSNLKKSLSLVLQHYLPLAGAIVWPENSPKPAVETAVRDGIVLTVAESEDDFDHLIGDGL 120
           VSNLKKSLSLVLQHYLPLAGAIVWPENSPKPAVETAVRDGIVLTVAESEDDFDHLIGDGL
Sbjct: 61  VSNLKKSLSLVLQHYLPLAGAIVWPENSPKPAVETAVRDGIVLTVAESEDDFDHLIGDGL 120

Query: 121 RKEAKLRPLVAELAAEEDRAAVVAVQVTWFGNGGFSIGITSHHAVLDGRSSTSFMKSWAG 180
           RKEAKLRPLVAELAAEEDRAAVVAVQVTWFGNGGFSIGITSHHAVLDGRSSTSFMKSWAG
Sbjct: 121 RKEAKLRPLVAELAAEEDRAAVVAVQVTWFGNGGFSIGITSHHAVLDGRSSTSFMKSWAG 180

Query: 181 LCKNLVGGGEIFCPAAETMPFYDRSVVTDKMGLEAIYLKCLLAHEGPNNRSLKFWDFKTP 240
           LCKNLVGGGEIFCPAAETMPFYDRSVVTDKMGLEAIYLKCLLAHEGPNNRSLKFWDFKTP
Sbjct: 181 LCKNLVGGGEIFCPAAETMPFYDRSVVTDKMGLEAIYLKCLLAHEGPNNRSLKFWDFKTP 240

Query: 241 PDSFRG 247
           PDSFRG
Sbjct: 241 PDSFRG 246

BLAST of CsaV3_3G020720 vs. TrEMBL
Match: tr|A0A1S3CP69|A0A1S3CP69_CUCME (phenolic glucoside malonyltransferase 1-like isoform X2 OS=Cucumis melo OX=3656 GN=LOC103503024 PE=4 SV=1)

HSP 1 Score: 472.2 bits (1214), Expect = 1.3e-129
Identity = 235/293 (80.20%), Postives = 255/293 (87.03%), Query Frame = 0

Query: 1   MATNHGSTAIKVLEVCTVAPPPGSTVPVTLPLTFFDILWFRFPPVERLFFYKSPVPFYVI 60
           MAT+HGSTAIKVLE+CTVAPPPGSTVP TLPLTFFDILWFRFPPVERLFFYKSPVPF+VI
Sbjct: 37  MATDHGSTAIKVLEICTVAPPPGSTVPATLPLTFFDILWFRFPPVERLFFYKSPVPFHVI 96

Query: 61  VSNLKKSLSLVLQHYLPLAGAIVWPENSPKPAVETAVRDGIVLTVAESEDDFDHLIGDGL 120
           VSNLKKSLSLVLQHYLPLAGAIVWPENSPKPAVETA  DGIVLTVAES+DDFDHL+GDGL
Sbjct: 97  VSNLKKSLSLVLQHYLPLAGAIVWPENSPKPAVETAAGDGIVLTVAESDDDFDHLVGDGL 156

Query: 121 RKEAKLRPLVAELAAEEDRAAVVAVQVTWFGNGGFSIGITSHHAVLDGRSSTSFMKSWAG 180
           R+EAKL+PLVAELAAEE+RAAVVAVQVTWFGNG FSIGITSHHA+LDGRSSTSFMKSWAG
Sbjct: 157 REEAKLQPLVAELAAEEERAAVVAVQVTWFGNGRFSIGITSHHAILDGRSSTSFMKSWAG 216

Query: 181 LCKNLVGGGEIFCPAAETMPFYDRSVVTDKMGLEAIYLKCLLAHEGPNNRSLKFWDFKTP 240
           LCKNLVGGG+IF PAAETMPFYDRSVVTD +GLEAIYL+C LAHEGPNNRSLKFWD KTP
Sbjct: 217 LCKNLVGGGDIFFPAAETMPFYDRSVVTDNVGLEAIYLECWLAHEGPNNRSLKFWDVKTP 276

Query: 241 PDSFRGTFKLSPQNIQKLKQHVLEHRNPAQPLLHISTYTVAMGYTWVCASAVA 294
           PD FRG   +  Q +  + +               +T TVA+  T+V A A+A
Sbjct: 277 PDLFRGALFVPSQGLTTIVE---------------TTVTVAIIATFVAAEAIA 314

BLAST of CsaV3_3G020720 vs. TrEMBL
Match: tr|A0A2I4HM45|A0A2I4HM45_9ROSI (phenolic glucoside malonyltransferase 1-like OS=Juglans regia OX=51240 GN=LOC109019354 PE=4 SV=1)

HSP 1 Score: 419.9 bits (1078), Expect = 7.5e-114
Identity = 227/461 (49.24%), Postives = 299/461 (64.86%), Query Frame = 0

Query: 1   MATNHGSTAIKVLEVCTVAPP---PGSTVPVTLPLTFFDILWFRFPPVERLFFYK---SP 60
           MAT +   ++ +LEVC V+PP   P S  P TLPLTFFD+LW RF PV+R+FFY+   + 
Sbjct: 1   MATKN---SVTILEVCRVSPPQKTPVSATPTTLPLTFFDVLWLRFAPVQRVFFYETSHTD 60

Query: 61  VPFY-VIVSNLKKSLSLVLQHYLPLAGAIVWPENSPKPAVETAVRDGIVLTVAESEDDFD 120
            PF   I+  LK SL+L LQ YLPLAG + WP++S KP +     D I L+VAES  +F 
Sbjct: 61  SPFLDTILPKLKDSLALTLQQYLPLAGTLKWPKDSHKPIINYTEGDAISLSVAESNANFY 120

Query: 121 HLIGDGLRKEAKLRPLVAELAAEEDRAAVVAVQVTWFGNGGFSIGITSHHAVLDGRSSTS 180
           HL GD   +  +  PLV +LA   +  +V+A+Q T F N GF IGIT+HHAVLDGRSST 
Sbjct: 121 HLSGDNFVEAVEYHPLVPDLATSPEACSVIALQATTFSNCGFCIGITAHHAVLDGRSSTM 180

Query: 181 FMKSWAGLCKNLVGGGEIFCPAAETMPFYDRSVVTDKMGLEAIYLKCLLAHEGPNNRSLK 240
           FMKSWA +CK  +GG +      E  P YDR V+ D +GLEAIYL   +   GPNN+SL 
Sbjct: 181 FMKSWARICK--LGGTDALSLVPELTPCYDRMVIKDPIGLEAIYLNQWMNENGPNNKSLM 240

Query: 241 FWDFKTPPDSFRGTFKLSPQNIQKLKQ----HVLEHRNPAQPLLHISTYTVAMGYTWVC- 300
            W+ KTPPD+ RGTFKL+  N++KL+Q     +LE +N  +  LH+ST+T+   YT+VC 
Sbjct: 241 VWELKTPPDTVRGTFKLTRANLEKLRQLIMTSMLEDKNKQKQPLHLSTFTLTCAYTFVCL 300

Query: 301 --ASAVADEDITIAVTVDARGRLDPPLPATYFGNYVVGRSTALKRGKLFGENGVIAAVET 360
             A    D       +VDAR RL+ P+P TYFGN V GR   L+  +L GE G+  AV+ 
Sbjct: 301 AKAEQPRDTKYLTGFSVDARHRLEDPIPGTYFGNCVAGRIATLEGNELSGEEGMAVAVKA 360

Query: 361 ISEMIKSLKEEGPLKGAENWVLLMTQTVVNSDYKLISTTGSPRFEVYSVDFGWGKPEKVE 420
           ISE I+SL  +G L GAE W+ ++       + ++I   GSPRFE+YS DFGWGKP KV+
Sbjct: 361 ISEAIRSL-GDGVLSGAEKWIEIV---FTPKNARVIGVAGSPRFELYSTDFGWGKPMKVD 420

Query: 421 VVSINRTGAVCISESRDGGG-VELGWTAKRDVMENFAKLFA 447
           ++SI++TGA+ +SE+RDG G VE+G   K+  ME FA LFA
Sbjct: 421 MISIDKTGAISLSETRDGAGEVEVGLVLKKPEMEVFASLFA 452

BLAST of CsaV3_3G020720 vs. TrEMBL
Match: tr|G7L664|G7L664_MEDTR (Anthocyanin 5-aromatic acyltransferase OS=Medicago truncatula OX=3880 GN=11430311 PE=4 SV=2)

HSP 1 Score: 407.9 bits (1047), Expect = 3.0e-110
Identity = 214/452 (47.35%), Postives = 297/452 (65.71%), Query Frame = 0

Query: 11  KVLEVCTVAPPPGSTVP--VTLPLTFFDILWFRFPPVERLFFYKSP----VPFYVIVSNL 70
           K++E+  VAP     +P   +LPLTFFDILW R PPV+R+FFY+ P    +    ++  L
Sbjct: 3   KIIEIFNVAPSSQVELPSETSLPLTFFDILWLRLPPVQRIFFYEFPHQTSLFLNTLLPKL 62

Query: 71  KKSLSLVLQHYLPLAGAIVWPENSPKPAVETAVRDGIVLTVAESEDDFDHLIGDGLRKEA 130
           K+SLSL L H+ PL G ++WP +S KP ++    + + LT+AES  DF+HL G  L +  
Sbjct: 63  KQSLSLTLSHFYPLLGHLIWPNDSHKPIIKFIRGNTLSLTIAESHADFNHLSGKNLSEAT 122

Query: 131 KLRPLVAELAAEEDRAAVVAVQVTWFGNGGFSIGITSHHAVLDGRSSTSFMKSWAGLCKN 190
           ++  L+  L    D+A+V+A+QVT F N GFSIGITSHHAVLDG++STSF+KSWA LC+ 
Sbjct: 123 QIHDLLPNLNISHDQASVLALQVTIFPNYGFSIGITSHHAVLDGKTSTSFIKSWAYLCRK 182

Query: 191 LVGG-GEIFCPAA---ETMPFYDRSVVTDKMGLEAIYLKCLLAHEGPNNRSLKFWDFKTP 250
           L     E+  P     E  PFYDR V+ D   LEA YL   L   G NNRSL  WD + P
Sbjct: 183 LENEVSELVSPLCLPHEFCPFYDRKVIKDPNELEAKYLSDWLKQGGTNNRSLMVWDLQVP 242

Query: 251 PDSFRGTFKLSPQNIQKLKQHVLE----HRNPAQPLLHISTYTVAMGYTWVC---ASAVA 310
            DSFRG F+LS  +I+KLK+ V+     +RN  +  LH+ST+ V++ Y WVC   A  + 
Sbjct: 243 EDSFRGLFQLSRLDIEKLKEFVVSKQKGNRNEKKN-LHLSTFVVSIAYAWVCRVKAEEIE 302

Query: 311 DEDITIAVTVDARGRLDPPLPATYFGNYVVGRSTALKRGKLFGENGVIAAVETISEMIKS 370
           +++  + + +D R RLD P+PATYFGN +  R   +K  +L GE+G+I AVE +SE +++
Sbjct: 303 NKNAMMVLNIDCRNRLDQPIPATYFGNCIGARLAIVKTNELLGEDGLIVAVEVLSEALET 362

Query: 371 LKEEGPLKGAENWVLLMTQTVVNSDYKLISTTGSPRFEVYSVDFGWGKPEKVEVVSINRT 430
           +K +G L GAENW  L+ + +  +D K+I   GSP+FEVYS DFG GKP+KVE+VSI+RT
Sbjct: 363 IK-DGVLNGAENWSSLLLEGLAMTDVKMIGAAGSPKFEVYSTDFGCGKPKKVEMVSIDRT 422

Query: 431 GAVCISESRDGGGVELGWTAKRDVMENFAKLF 446
           GA C+S+ R G GVE+G+ + +  ME+FA LF
Sbjct: 423 GAFCLSDCRKGDGVEIGFVSNKKAMESFASLF 452

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004145869.22.7e-264100.00PREDICTED: phenolic glucoside malonyltransferase 1-like [Cucumis sativus][more]
XP_008465393.11.3e-23791.03PREDICTED: phenolic glucoside malonyltransferase 1-like isoform X1 [Cucumis melo... [more]
XP_022948949.11.3e-15764.10phenolic glucoside malonyltransferase 2-like [Cucurbita moschata][more]
XP_023523638.11.4e-15663.66phenolic glucoside malonyltransferase 1-like [Cucurbita pepo subsp. pepo][more]
XP_022997666.13.1e-15162.20phenolic glucoside malonyltransferase 1-like [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
AT5G39050.18.5e-7939.14HXXXD-type acyl-transferase family protein[more]
AT3G29590.12.6e-7539.34HXXXD-type acyl-transferase family protein[more]
AT5G39090.11.3e-7439.19HXXXD-type acyl-transferase family protein[more]
AT3G29670.12.2e-7136.12HXXXD-type acyl-transferase family protein[more]
AT3G29635.16.5e-7138.85HXXXD-type acyl-transferase family protein[more]
Match NameE-valueIdentityDescription
sp|Q940Z5|PMAT1_ARATH1.5e-7739.14Phenolic glucoside malonyltransferase 1 OS=Arabidopsis thaliana OX=3702 GN=PMAT1... [more]
sp|Q9LJB4|5MAT_ARATH4.6e-7439.34Malonyl-CoA:anthocyanidin 5-O-glucoside-6''-O-malonyltransferase OS=Arabidopsis ... [more]
sp|Q9LRQ8|PMAT2_ARATH4.0e-7036.12Phenolic glucoside malonyltransferase 2 OS=Arabidopsis thaliana OX=3702 GN=PMAT2... [more]
sp|Q9FNP9|AGCT_ARATH3.4e-6935.81Agmatine coumaroyltransferase OS=Arabidopsis thaliana OX=3702 GN=ACT PE=1 SV=1[more]
sp|Q9LRQ7|BAHD2_ARATH2.4e-6735.40BAHD acyltransferase At3g29680 OS=Arabidopsis thaliana OX=3702 GN=At3g29680 PE=2... [more]
Match NameE-valueIdentityDescription
tr|A0A1S3CNS2|A0A1S3CNS2_CUCME8.3e-23891.03phenolic glucoside malonyltransferase 1-like isoform X1 OS=Cucumis melo OX=3656 ... [more]
tr|A0A0A0L9V3|A0A0A0L9V3_CUCSA2.3e-139100.00Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G280980 PE=4 SV=1[more]
tr|A0A1S3CP69|A0A1S3CP69_CUCME1.3e-12980.20phenolic glucoside malonyltransferase 1-like isoform X2 OS=Cucumis melo OX=3656 ... [more]
tr|A0A2I4HM45|A0A2I4HM45_9ROSI7.5e-11449.24phenolic glucoside malonyltransferase 1-like OS=Juglans regia OX=51240 GN=LOC109... [more]
tr|G7L664|G7L664_MEDTR3.0e-11047.35Anthocyanin 5-aromatic acyltransferase OS=Medicago truncatula OX=3880 GN=1143031... [more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0016747transferase activity, transferring acyl groups other than amino-acyl groups
Vocabulary: INTERPRO
TermDefinition
IPR023213CAT-like_dom_sf
IPR003480Transferase
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0008152 metabolic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0016747 transferase activity, transferring acyl groups other than amino-acyl groups

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsaV3_3G020720.1CsaV3_3G020720.1mRNA


Analysis Name: InterPro Annotations of cucumber chineselong genome (v3)
Date Performed: 2019-03-04
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR003480TransferasePFAMPF02458Transferasecoord: 25..443
e-value: 3.3E-49
score: 167.6
IPR023213Chloramphenicol acetyltransferase-like domain superfamilyGENE3DG3DSA:3.30.559.10coord: 239..448
e-value: 2.1E-44
score: 153.1
IPR023213Chloramphenicol acetyltransferase-like domain superfamilyGENE3DG3DSA:3.30.559.10coord: 9..229
e-value: 4.3E-61
score: 208.2
NoneNo IPR availablePANTHERPTHR31625FAMILY NOT NAMEDcoord: 1..446
NoneNo IPR availableSUPERFAMILYSSF52777CoA-dependent acyltransferasescoord: 63..205
NoneNo IPR availableSUPERFAMILYSSF52777CoA-dependent acyltransferasescoord: 273..445