CsaV3_7G006280 (gene) Cucumber (Chinese Long) v3

NameCsaV3_7G006280
Typegene
OrganismCucumis sativus (Cucumber (Chinese Long) v3)
DescriptionGlycosyltransferase
Locationchr7 : 3856800 .. 3880933 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGTTCTCTTACCAAAGTAGACCAAGGAAAACAGCAACCTCATGCAGTGCTCTTCCCATACCCATCTCAAGGCCACATAAGTCCCATGCTAAAGCTTGCAAAACTCTTTCACCACAAAGGCTTTCACATAACGTTTGTCAATACAGAATACAACCATAGACGTCTCCTCAGATCCCGAGGTCCCAACTCTTTGGATGGCTTGCCTGACTTTCATTTCAGAGCCATTCCCGACGGCCTCCCACCGTCCGACGGCAACTCCACCCAACATATTCCTTCACTTTGCTATTCCGCTTCCCGCAATTGTTTGGCTCCACTCTGTAGTCTCATTTCGGAGATCAACTCGAGTGGCACCGTTCCACCTGTTTCCTGTATTATTGGCGATGGGGTCATGACTTTCACTGTTTTTGCTGCCCAGAAATTTGGTATTCCCATTGCTTCGTTTTGGACGGCTAGTGCTTGTGGGTGCCTTGGATATATGCAGTATGCTAAGCTTGTTGAACAGGGCTTGGTTCCTTTCAAAGGTATAAATTAACTTACAATCTACAGTTCTTGTTCGTTATCGTTTTTATTATTTTCTATCTGTCCTTTCTTTTCGTCCACTGATTATTTCTGCTGTAGTGATTAGCGCTGTGCCGCAAACATATCCTATCAATGCTTTTGAAATTTTGACATATAATTCCTACAATTACATCCGTTCTTGGGTAGATATCTATTAATTTTAATTTTGTTAAAAATATTTTTAAAATCTTACAATTTCATCTTCAAGATTTTAATTTTGTTTTAATTTAGTTCTTTGTTTCCAAGATTTAAATTTTGAACTTTAACTTTTAACTGAATACACATTTTCATTACGGGGTGTTAAATATTTATTAATTAATATTAAAAGTAATTATGAAGTGAAAAATTAAAATTAAATTTAGTAGTGATGAAGAAATTGTATATTATTAATTAATTGTAATTTTTTTAATTAAGAGACAAGAAGTGAGTATTAAGTTAGAAGTGGAAGTTAAGAATGAAGAGATTGAATTGAAATCAAGCTAGCTAGTTTAAAATGTAAAGTATTATTTTAAATGCAGAGAACCAAAACTATATTTTTCCTCACATTAATTTATTTGGAGTACAAAGACCCAAAACAAAACTAGCCTTTGAAGTTCATGTATAAAAATATTCAAAATTGTATTTTAATAAGATTTCACTAGAGTACGTATCTTCTTGGCTTTGCTGTTTTATCATTTGTATTGATTCATTATTTTAATATAAAGTTAAGGAATACAAAAAACAAAAAAAATAGGGAACTTTATCAATCAATATAAAATAATTTGGTTGATTTGGTTGTTATCTAATCTCCATTATTTTCTTCTAATTTAAAATTGATTTGTCCTCAAAATTTGATAACAATTAATCATTCAAATTTGGTGTAATTGCATATTAAATCATTAATTGATTTAAAAGTATAAGTTTATTGGTGAAAATAAATTTAATATTATATTATCTATTAATTCATCAATAAACTCAATACCATATAGTAAAAGATTTAAACACATAACCTTTTTGGACTATCGGCTCTATATCATTTTTAATCGTCAATCAATTCAAAAATTTTAGTTGGATAAATGACAAATTTAATGTAATATTTAACGATCTAAAATCTTCTACTCATAGATTATAATTTGTATTTCTTTTGTATTGCAGATGAAAACTTCACGACTAATGGGGATCTAGAAGAAACAATAGAATGGATTCCACCAATGGAGAAAATTCGTTTGAGAGACATTCCAAGCTTTATTAGAACGACAGATAAAGACGACATAATGCTCAACTTTTTCCTCGAACAATTAGAAACACTTCCCAAAGCCAATGCAATAATCATGAACACATTTGATTCCTTAGAGCACCATGTGTTGGAAGCACTTTCCTCAAAACTTCCTCCCATTTACCCAATTGGTCCAATCAACTCATTAGTTGCTGAGTTAATCAAAGACGAAAAAGTCAAGGACATCCGTTCAAATCTATGGGACGAGCAGTCAGAATGCATGAAATGGCTTGATTCACAACAACCCAACTCAGTAGTTTACGTCAACTTTGGAAGCATAACTGTGATGTCTCCTGAACACTTGGTGGAATTTGCATGGGGACTTGCCAATAGTGAGAAGCCCTTCTTGTGGATCGTAAGGCCAGATCTAGTGGAAGGGGAGACGGCATTGTTGCCGGCGGAGTTTTTGGCAGAAACGAAAGAAAGAGGGATGTTGGGGGATTGGTGTAACCAAGAAGAGGTTTTGAAGCATCCTTCGGTTGGAGGGTTTTTGACTCACAGTGGATGGAACTCGACGATGGAGAGTATTGCGGGAGGAGTGGCTATGATATCGTGGCCGTTCTTTGCAGAACAACAAACCAATTGTCGTTATTGTAAGACGGAGTGGGGAAATGGGTTGGAGATTGATAGCAATGTGAGGAGGGAGGACGTGGAGAAGCTTGTGAGGGAGTTGATGGAAGGAGAAAAGGGTGAAGACATGAAGAGAAATGCGGAAGAGTGGAAGAGGAAGGCTGAAGAGGCTTGTAAGAATGGTGGGTCATCCCTAACAAATTTAGATCGAGTCATTTCTGAAATTCTTTCCAGCAAAGAGAAATCAAAGCTTGAATGTCAAAATTAACGATATAATTTCATGTAAGCATGGCTAGTAACAACGCTCATGCCGTCTGGTCAAATAGTTAAGCATATTAATTTTCATAAATCTAAACGAAGAAACTTTTGTCTTCTTTGGCTTTTCCTATTACTGATGTGTTTGATTTATATTGGTATTGTTATGGTTTCTTTAATATAAATAAGTGGTGTTCATTGGGATGTGTGCCAATGTGTTGTTGAATAATATTGTAGATCTCAAAGAAGGTTGATGTCCAATCTTAATCTGTTAAAGTCAATGAACTATTTGAAAAAAGATACAATTAAGTAAATTCTAACAAAGTGTCATACAAATTTGTATATAGACATATAAATAAATATTTTGTCATTGTTGCTGCATTTAAATTATCACTTTGCTCAAATTTGATTCTATATATCTTTTTTTCATTTCGTTAATCACATATGCATGTCTCACTTTTATAAATGTAAATTTTTTTTTAGTAAATTTTCTGATTTCGAGGATAGTAAGTTGTGCAAGGTTTATTACCAAATTAGTAAATATTTGGTTTAAAAAAGAATACAAATTTTGCATTTAAAAAATTAAATTTAAAATCAGATCAACAACCTCCTCTGAAATTAGTTTTCAAAAATAGATCAACTCAATCCATAAACTTTTCAACATGAAAAAAATAAAACATAACCAAGATTAGAATAAGATCACATACACGTGGTGTGATCGATCCTATTCATCGGCGTTATTTAAAAATAAAAAGAAAAAAGTAGTAATCAAATCAAATTTCTAAACTAATGAAATACTCACAACACAAAAGATCAGTGACATGTGGTATACTCACAACACAAAAGATCAGTGACATGTGGTATCCATCACTTCTGCTTTCTCTAACCTATGAAAAGTTTATGAAAGTAAAGACAACACAATTTATATACAACTCCAATTTCCCTTTCTCTATCTAACTTTATTTCCCTTTCCCTATCTAACTTTTTCTTTATTTTTCTTTTTAACGGTATCTGTCTAGTTGGTAGGTGCATCCAAGCATTTTTACAAGATGGTATCGTTTTAGCACCTTCATTATTTTCTATAATGATTTTATTCATTGAAGCACAAGGTACATGATGATAAAACAAAAGGACTAGCGATAAGCCAAAAGAAAGAAAGATTCCTTTCTCTACCAAACTTTCCATGAATTGACAAGTCAAATTTTAATAAGTTTTTTTTCAATTTTCGTGGTCAATATATTAATATATCAAAAATTTTACGATAAAATATTTTGTTATATTAAAAAATAATGTGCTTAATTTGTGGTGGTTACATATCTAAGAATTAATTTTTTGCAGAGTTTTTTTTTTTTTTTATATATATTCAAATGTTGTGGATCAAACGGGTTGTCTTGTAAGATTAGTAAAGATGCACGTAAGCTGACTCGGATTAATTAAGTGAACAAAAAAAAAATGTCTCTATATAATTTATTATCAACAATAGCTACCACACTCTTACTCAATCACGATTCATATTATATAAGAGAAAAATGATGTTGTTAGGGAAAGAAGTACTCCATGAAATGTTTAAATACTATTTTTCATCTCTCAATTTTGGTCCATCAACTCTATTCATTTTCTGAATTTCAAAAATTATAAGAAGACTATTGTTTTTATTATTTATAGAGGATTTGATCTAAAATTTAACTTAGCTTTCTGAAATTAGGATGTGAATTTTTCCTAATTTCAAACTAGCTAGATCTTATCTATTTTAATTCTTAGCTTTATTCTATGGAGCCAAAATTCACCCATAACATTAACTTCTACACCAAAGTATTATTTCAAAGATGATACCTATCCTAGTCTCGTGCAGTGATTCAAATTTAAAAGTATTGTTCTAGCCTTTTCGTTATACGACGTATACAAACTTTATCAAACTAGCAGTAACTAAATCGAACTTGGCTTCTCCAACATCATTTTCAGATCTTTGGCCATATCTCTTCAATATCTAATCAAGTCCTTTTTTCTTCTTTTTGAAGACCCAGTTAGGTAATTGTCTCTTGATGCTCATAATTTATGTTATATAACCTTTTTTATTGCAACAGATTAAACATATGAGCTTAGATCTTTTATAGTTTAGGGGTAGCCCCTAGAGCCCCTGATAATTATGAACTCATAATGCATATGAAACACACATGTTATCAAACAATTGTAAGGCTGTTAAAGCAACATTAAACCTCAAACCCCTTCCACATATTATGTTTGTAGAATAATAATGAAAAGAGACGTACTGCTATATCTAATTAACTTCAATAATGTTAACAGTTTGTGGAAAGAAAAGTATAGGTTGAAAGAAAAAGACACATATTTTTCTATTCTCAGAATTTAAATTCAAAAATGCTATATCCAATTATATATATATTTGCAAAGCATGGTAAAAAATTATCAGAAATGTGTTTAATTAAAAAGAACAGAAAACAAAATGTAAACACTTGTTCCAACTTGTCACATTTTTCTTCTACTTTATTATTGCTTTCTACTCTAGACCTAAGCAAAGTTTGAAAGCTAAAACAAATAGTTTTAAAAATTTGTTTTGTGTTTTCAAATTTGGTGTAAAAGTCCAACTATCTTACTTAAAGAAATATGAAAACATGATATGAATTTAGAATAAACAAACATAATTTTTTAAATAAAAACTAAATAGTTATTAAACAATTTTAAAATTGAAATAGTTAGTTTAAAATTATTATTTAAATTAACAATAAAGATTCCTAAAATTATTGCTAGATTATTTAACTCGACTCATCACTTACTTTGTGAAAGAAAATATGAGTTTAACAACTCATTATAAAAAAATGTTTCTATTGTATTGAAATGGTTATAAATTTTTTTATTGATGGTCGAAATTGGCAAGTAGCGATTGATAATAGTGTCAAAAGTGGCAATAATCAGAGATCATCGAAGATGGTGGTAGAAAGTTGGCTACTTCTCGATGACACACAATTGTCAGAGGTGGGTGGCCAAGATCATCAAAGATTCGTTAACAATAATTGACCCATTGGTAGTGACTAGAGTTGTTGCTCATATAAATTAGAGAAAAAGATAATCACTAAACACTCAAAATAATTTTTTAAAAGGTTGATTTCAAATATTAGTCCTAAACCAAGTTTGAGTGCATCAACTTAACCAAAATATTTTTTTTTATAAATTTTCATTTATTTCAAAATTCATATTAAATGGTTCTCAAACACATGGATTTATCTCAAAAACATCTTTTAAAACAAGTTAATTAAAGGTATTTCAAACATAGAGAAATTCAATCAATTTCAATAGATCAAACAAACATGTACAACTTTACCAATCAAATCAATGGAAGAGTCCATGAGCATCTTATCAAACATATTATAAGATTAGACTTAATCGTAGAAAATAAAGCTTAATCTTTAATGATTAGAAAAGTCCAATATATTGAATTCAATAGAATTTGATACATAGAGTTATGGGTCCGTTCGATTTACTTGAAAAAAAAAAGAACTTTTTAAAGAGTTCTTTTTGTTAAAACACAACTAAAATAAAGTAGCAAACTTACCTCTTTCAATTAATGTATAGAAACCATATACTAAAAAATATTTTACTAATTTTCATTATCGTAGAAGACCTACTACAAAATATTATGTTTAGGAACAAATTTATTAAGGGGTTAGCGTCAATAGCAGCGTACGTGTCTCTTTACATTATTGTTTTTTGTATTTTAATATTAATTTTGAAGTGTCATATTTTAAATGTTCTGGTGTAGTGGTTGTGCACCACTTTATATATACATAGAGGGTTCAAATTTTGCACAACTACTTTTACAAAATACATTTTATTCTAAACATAAACAATTTTTCTTTCCCCTCCAAAGACTCCAAAATGTCAATTTATTGACAGAATTGAATAGCGTCATAAAAAATAATTTAAAAAAAAAAAGGGAATGAAAACCTAACTTATTTTCTAACCTCCTTTGTTATTTAACAAATATTTTTTAAAAAATGTATTAAACTTCTATTCTCTCAACAACATTCGATGAAAGAGATATTTTATTTTAGGTAATTTTTAAATTTATATTAACATTATAAAAAATCATGTGAAAGTTGTACAAAATTGTACAGTTTTTTCTTTATTATTACAAATTGAGTGAAGGTTGATCATCAGTGAAATGTGAAATTTTTTTAGTGTATTTATTATATAGTGTCGAGTGTCCTTGTATAAAATTTCTTGATCCGTAAGGATATACTAATAGTTAGTCTTCTACGATGGTTGCTGTCAGAATTGGTGGTGGTTTCTAGATATGGTTGTTGATGGTAGCCGGTGTTGGCTGACAGAGAGCAGTATAGCCACAAAAACAAAGTAAAACTCAAAGGAATTTCCTGCTGGAGTTTATTATTATGACAAACTGGTCAATTTAGCATTATTATCTTAGCATTCCTGTTGAGGATAGCCTTAGTGATTGATTGGTATGCATTTCATCTTCTCATTTCAAACTATCAATCAATTAATTAACATTTTCTAATTAAGTTGGCTAAGCATGAGATACAAGTAAAATCAAAATAGTTACTAGCAAGAAGGAACTCCAACAAGAACCAACCCAGATTCTCCTTTATTCTCACATCGTTTACATTTACATCTAATGCAAATTTTATTTTTCATGCCATTTTAGTTCCTTACAAGCTTCAATAAACCCCCCCTCAGGTTATCTTGGAGCATGAATCGAGAATTTGAATTGAGAAAAAAAAAATTATGTGGATTCGATCTGTACTACCACTACTACAAAAGCAAAAAGGCAAGGAAAATAATACTAAAAATGTACAAACAATTGTAAAAGACAATCATCATCTTGGACCAGACAACCACATCAGGAGCAAAATGAGAAATGGAAACAATAAACTGGGGAGATCTACTCCGGTGGTAAGAATGAACCATATTATTGAAGTTTTGGCGTGAGCTCTCTTCTTTCTGCCATTCCTGCCATCTTTCAGCTAAACCATCGCCTTCCAGCATTTGCAACACCTCCGACATTTTCGGTAGTTCCACCGGGGAGCTTTGCGTGCAAAGCACAGCAGCAACTTGAATTACCTGCTCCAGTTCTTCCTCTATGTAATTACCCAGGAGATGGGGATCGTCAACCAACGTTGCCAGTTTCTTATGATTTAGAAGTCCTTTTACCTTTAGTTTTGAGTTCATATCACAGCCAACTTGTATTATGCAAGTGTTTGTTTATTACTAGGAAGATCAAAATACCACGGATCAACAAGAAAAAACATGTAAAAATGTACTGAGTAACACTAAAACTCTATCAGAGATACTGATGAGAATACAAGAGGATCAATTGAAAGTAAGCTTGTGGGAGAAATTGCAAAACAAGTCCTCAAGAATAAGAGATATTACAATCTCATTGACAAAACACACTCTCAGCAAACTGTAAAGTTCGGGAGATTTTAGAATCTTGTTGTTTCAATAGAATATCGCCATGAGGCCTTTTACACATCATCCATGTCTTTGATTGAATGAGAAAAGGCTGAGTAAGGCAGCCATATAACACCCAAATTCACAGTTATTCTAAACGTATTTGTGAGAATCCTATCTAGTTAGAGGTATATAGGTAAGATTAGTATATGATATTAATAAAATGTATTGTGGGTCATTAGGTATTCATCTGTTGGATATAAATAGTGGGAGTTGTGGACCATAGAACATGTAGATGATTTTGGTGAAAGTTTCCAATGGGAGACTTCGGGAGAGAGATATCCCTCTCGAAAGGCTACTAGTATATTGCCTTTTTTTTTTAATATTGCAATATATTGCTATCTTTTGTATTCTTTTTGTAGAGTTTTGTCGTTAGGTTGGTATTTTATTCTAAAAAGAGATTTGATTCTGTATATTAATGTAAATAAGGTTTAGTTATTTTGGGTAATGATATTAATGTAAATAAAGTAGGTTTAGTTTATTTGGGTAACTAACCTTGTATAAATACACTCCCTTTGATCAATCAAATAGATAAAGAGAATATTCTTCAGAATATACTTTCACATGGTATCAGAGCTAGGGTTTTTCATCTTAGCCGCCGCCGCCGCCGTCGCCGCACACCCGTGAAAGTCTCCGTCTCCGTCTCCGTCGCCGTCGCCGCCCACTCGCAGCCGTTCCGTCATCGTCATCCTCATCGCGCCGCAGACAGCTTCGATCTGTCTCCAGCCCTTCCGCGCGCCTTGCAGACAGCTCCGATCTGTCTCCAGCCACAACATCGGCGTCGCGCCGATCAGCAGCAGCCTCGGCATCGCAGCAGCCTCGGCATCGCAACATCGGCCGTTTCCGCTCCGATTCATCTCCATCCGCGAGGCAGCCGCATCGTTTCCGCTCCGATTCGTCTCCATCCGCGAGGCAGCCGTACGCGTCACCTTCCGATTCGTCTCCATCCGCGAGGCAGCCGTACGCGTCACCTTGCGGCCGCCGTCCAGATCTGTACCGCAAAACCTTCCGCCGCTGCCGCCATTGCTCTTCCGCCGCTGTCCAGCCGTCTCAGCCGGATCTGCCTCCGTTGGAAACCCTATCTTTCCAAGTTTCTTTGATTTCATTTTTCGCAGATCTAGCTCCGTCTTCCACTCTGTCGATGTTTCGAAGTTCGTAGATCGATAGATCTTTTGTAGATCTACACGCTGATCGTGGTTTCTTGTCGATTCGGTCTCTGGTGACAGATCTGCACGGTTCAACACCAGATCTGACGAATCCTTCAGCCGTTTGTCTTCAAGTTTTTTTTGTTGTTTGTCTCCGTTCCCAGCCGTGCACCCCTGAATCCAGTCACGGCCGAAAATCCGCATTGTTAACCCCAAATCAAAATGGAGAGAGACCATATCTTTAGACCCATTTCTACCATACTTGATGGTACAAATTATATCACATGGGCAAATCAAATGAAAAGTTTTCTTATTGGAAGAAAACTATGGCGCATTGTAACTGGCGACATCACCAAACCAACTGCAAAGGAAGGGGATAACACATTCATTGAACGTCTCGAAGATTGGGACAGTAAAAATCATCAAATTATCACCTGGCTTGGTAACACCTCTATTCCCGCTATACATGCACAGTTTGATGCCTTTCATGATGCTAAAATATTATGGGATTTTTTGTCTACACGCTTCAAGTCTATTGGTTTAGCCCATTATTACCAACTGCACAGTACACTTGTAAATCTAAATCAAGATTCTGGTCAGTCTGTTAATGAATATTTGGCAGTTCTTCAACCCATTTGGACTCAACTTGACCAAGCGAACATCAGCAAAGATCATCTTCGCCTTATTAAAGTCCTTATGGGATTACGTCCAGAATATGAATCTGTTAGAGCTGCTTTACTACACCGGAATCCCTTACCCTCATTAGATGCAGCTATTCAAGAAATTCTGTTTGAAGAAAAGCGTCTTGGCATCAACTCTACTAAACAATCTGATGTTGTCCTTGCTAGCACATACACTCCCAACAGAGCCGCAAATATGTTTTGTAAGAATTGTAAGCTCTCTGGTCACAAATTTAGTAACTGTCCTAAAATAGAGTGCAGGTACTGCCATAAACATGGCCACATTCTGGATAACTGCCCTACCAGACCACCCCGACCTCCTAGCACTTCCACAAAAGAGAAAATTTTTACCAAACATGGTTCCTCATCTGTTGTTGCTGCGACCTCGGATGATTCATCCCCATTCAGATAAGTGATCTTCAGAGCTTATTGAATCAACTAATTTCATCATCATCCGCTCTGGCTGTTTCATCAGGTAATCGATGGCTTCTTGATTCTGCCTGTTGTAATCATATGACCTCTGACTCTTCTCTTATGTCTACGTCTAGCCCTACAAAATCTTTACCTCCTATTTATGCTGCTGATGGTAATTGTATGAACATCTCTCATACTGGTACCATTGATACTCCCAGTGTACATCTTCCCATACTTACTGTGTTCCTAACCTGACCTTTAATCTAGTGTCTGTTGGTCAATTATGTGATCTGGGCTTAAATGTTTCATTTTCTCCCAATGGTTGTCAGGTTCAGGATCCGCAGACGGGACAGACGATTGGAACGGGTCGCAAAGTGGGAAGATTGTTTGAGCTCACATCACTTCGGATTTCATCTCCTTCTTCCATCTCTGCTTCGGTCACTGATTCTGACACATATCAGTGGCATCTTCGTCTTGGTCATGCTTCCTCTGAAAAACTTCGTCATTTAATTTCTGTTAACAATTTGACTAATCTTACTAAATTTGTTCCTTTTAATTGTTTGAATTGCAAACTTGCTAAACAACCTGCCTTATCTTTTTCTCAATCCATCTCTAATTGTGATAAACCTTTTGATTAGTGCATTCTGATATTTGGGGTCCTGCCCCAATTACTACTGTTCATGGTTATCGCTACTATGTTTTATTCATTGATGACTACTCTCGATTTACATGGATTTACTTTCTAAAACATCGTTCTGAATTATCTCGCACATATATTGAGTTTGCTAACATGATTCGCACTCAATTTTCCTCTCCCATCAAAATTCTTCGCACTGATAATGCTTTGGAATATAAAGATTCCATCCTTCTTTCTTTTCTTTCCCAACAGGGCACTATTGTTCAGCGCTCTTGCCCTCATACCTCTCAACAAAATGGACGTGCTGAGCGCAAACATCGTCACATTCTTGACTCAGTACGTGCCCTCCTTCTTTCTGCCTCTTGTCCAGAAAAATTCTGGGGTGAAGCTGCCCTTACATCAGTATATACAATCAATCGTCTCCCTTCCTCTGTCCTTCAAAATACCTCTCCGTTCGAAAAACTATATGGTATTTCTCCCGACTATTCTAAACTCAAAGTTTTTGGTAGTGCCTGCTTCGTTCTGTTACATCCTCATGAACACAATAAACTTGAACCACGTGCCCGTCTCTGTTGCTTCCTTGGCTATGGCACCGAACACAAAGGATTTCGTTGTTGGGACCCTCTTTCCAACCGACTCCGGATATCTCGGCATGTCACTTTTTGGGAACACACTATGTTCTCTCGTTTGTCCTCCTTCCACACCTCCTTCTCTAGTCCTCAATCTTTCTTTACAAACACATCTGTTGACCTTTTTCCTCTCTCTGAACCCACCTTGGATACTGAGCTTGCACAATCTTCACCTGCTACTGCAAATCTGGATCCACCGTCTGTCTCCGATGATGTTCCTGAATCGCCACCTGCTACTCCTCTTCGTCGCTCTACCAGGGTAAGAGAACCTCCCCCTCATCTCACTGATTACCATTGTTTTTCTACCATTGTTTCCCTTGTTGAACCCACCTCTTATCAAGAGGCCAGTATTAACCCAGTATGGCAGAAAGCAATGGATGAAGAATTACAGGCTCTTGAAAAGACGCACACTTGGGACTATGTTGATTTACCTCCCGGTAAAAGACCCATTGGTTGCAAATGGATTTACAAAATCAAAACTCACTCTGATGGAACTATTGAACGTTATAAAGCTCGGCTTGTTGCAAAAGGATACTCACAAGAATATGGGATAGACTATGAAGAAACATTTGCCCCTGTTGCCCGGATGACATCTGTTCGCAGCTTGTTAGCTGTTGCTGCTGCCAAACAGTGGCCTCTTTTTCAGATGGATGTCAAAAATGCATTTCTTCACGGCAACCTATCTGAAGAAGTGTATATGAAGCCACCTCAGGGAACTTCTCCTCCTCCCAACAAGGTGTGTCTCCTTCGTCGCGCTCTATACGGTCTAAAACAGGCTCCACGAGCTTGGTTTGCCACGTTTAGCTCCACCATTACTCAACTTGGATTTACCTCCAGCTCTCACGACAATGCCCTTTTTACACGACAGACAACTCATGGTATTGTTCTTCTCCTTCTTTATGTTGATGATATGATTATTACTGGTAATGATCAACAGGCCATATCCGACCTACAACAATATCTTGGTCAACATTTTGAGATGAAAGACCTTGGATCTCTCAATTACTTTCTCGGTCTTGAAGTCTCTCACCGTTCAGATGGTTATCTGTTATCTCAAGCGAAATATGCATCTGATCTAATAGCACGCTCAGGAATTACAGACTCCACCACATCTTCAACACCGTTAGATCCTCATGTCCATCTAACTCCGTTTGATGGTGTTCCTCTTGACGATGCAAGCTTGTATCGGCAACTTGTTGGCAGTCTTATATACCTAACAGTAACTCGCCCAGATATTGCATATGCTGTTCATATTGTCAGTCAATTTATGGCTGCTCCTCGAACAATTCATTTCACTGCTGTTCTACGCATACTTCGCTATGTCAAAGGCACCTTGGGACATGGTCTTCAATTCTCATCTCAGTCTTCCCTTGTGTTGTCGGGATATTCTGATGCTGATTGGGCGGGGGATCCTACTGATCGACGATCCACTACAGGATACTGTTTTTACTTAGGTGATTCTCTCATCTCATGGCGTAGTAAGAAAAAAGTGTTATATCTCGTTCCAGTACGGAATCTGAATATCGTGCTCTGGCTGATGCTACAGCTGAACTTATATGGCTTCGGTGGCTCCTTGCCGATATGAGTGTCCCTCAACAGGGTCCTACCCTCCTCCATTGTGACAATCGTAGTGCCATTCAGATTGCTCACAATGATGTGTTTCATGAACGTACAAAACACATTGAAAATGACTGTCACTTTGTTCGTCACCACCTCTTAAGCAACACCCTCCTCTTACGTTCTGTTTCTACTATTGAACAACCTGCGGATATCTTCACCAAAGCCTTGCCATCTAATCGATTTTGTCACTTACTTACCAAACTCAAGTTGATCGCTACTCTACCACCTTGAGTTTGAGGGAGGGTATTAATGTAAATAAGGTTTAGTTATTTTGGGTAATGATATTAATGTAAATAAAGTAGGTTTAGTTTATTTGGGTAACTAACCTTGTATAAATACACTCCCTTTGATCAATCAAATAGATAAAGAGAATATTCTTCAGAATATACTTTCACACTGTAATTAGTTGTCTTATTATGTAAGATGAATTCAATGGCCAGAGAACCATCCACTCCAATTCCCCACTCTAAGCCTGTGTCAATAAAGAGATTTGATTATGTAATTAGTTGTGTTATTAATATTGCCTTGAATGGAAGATATGAACTCTTGTTGTAACCTTCACAAATTTATTCTTTTCATCACAAATTTATTCTTTCCCTGTATGTAACTTATTCACATTTGTATATAAATATTCATTAAGGTTTGTGTATAAATATTCATTAACTTATTTTCTCCTTTGATTAAGGTTTTAGTTTGATCTCTACGGTATCAGAGCTACTCTTGTGTGCTAATAGAAAAACTTGTCTCACAAAGCTGAAAATTTTTTCTATGGCCATGGAAGAAAAAGCTCCACAATTCAATCTTTTCATCGGACAACTAGTCTCATTTCAATCAAACTCAACGATGAAAATTATCCCTTATGGAAATCCCAAGTTGCTACCATTAATTAAGAGTATGGGTCTTGAACACCATATCTCACAAAGGAAAAGAAACATGACGATGAAGCTGCCAGCGAACAAGGAAGAAAGCTCCAAAGATGATCATATACTCCCACAGTTTGGATCAGAAATGATGAGCTCCTTATCTCTTGGCTTGCTGGGGATAATCTCAGAAGATGTTTTGAACATGAAAACAACTCAACCCGCCCCTTCCCCCAATTAATTACCTCAGCTTAAAAATAATTTACTTAAGAGATTTTCGTGCTCCTTGAGTTCGACCGAGACTTTATTACCACTAGAACTACCTCTTGGTGTGTAATTAGGTAAATTATTTGATTCGGATTTGTTACGCTGATGAAATCTGCTCGATCACACCACATATATGTGGCCTTATTCTAATCTTGATTATGTTTTATCTTTTTCATGTTGAAAAATTCATGGATTGAGTTCATCTGTCTTTGAAAACTGATTTTAGAATATGTTGTTCATCCATTTTGATTTTGAATTTAATTTTTTAAACGTGGAATTGATATTCAAGTTTATAAGAATAGACTTATTTGATAATAAAATTTATAATTTTACCTTTTTAGTCTCTATGTTTTTAAAAGTAAGTTTAACCGGTCATAAAAGTATTTTTTATTTGATTTTTCAAAAATAATTATTTGATATTTAAAAATAGAAAAAAATTGTTTTAAAGAGAAAGAGCATTTTAGAAAATAACAAAATAAATTAAAATATTTACAAACTATAACAAGATTTTTGAATTCTATCAATGATAGTCCACAAATAGAAGAATATCACTGGCGATTAAATGTTTGTTATTGATAAAATATGAAGTTTTGCTATAGCTTTTAAATATTTTGTCAATTTTGTTATTCTTTACAATTCTCCTTTGGAGAATATGATTTTTAGAAATTATTCTTCTAATCTACTCATTTTTTAAAACTCAAAAACCAAATGTGTCTGCTCTTTAAAACTTGGGACTTAAAATATACTTTTTCCAAATAAATATGATTGAATGCATTGTGTTGTTTTTAAATATTGCAAAATAAATCAAAATATAAAAAATTCATTGTCTAGAGCAAACTAAGCTCTTGTTGGCTTTACTATAATATTATAAAATAAGGTTATAATATATTTAAAATTTTGACGTTAAACGTGTACGTATAGATTAATTAATAATTGGTTATAGGTAAAATAAATAAACAAATTATATGGTACTTGAACTTTGAAGAAAGTAACAATTAATTTGTGATTGAAATTTAATAAGTAAACAATTTAGTCTTTATATTTTCAAATTTGCAACACTTTGGTCATTGTACTTCATAATTAACCTAGTTTATATTCAACTTAATATTTTAATTGATACTACAACTACTTAAATATACTATCAAACACATTTAAGATACCAAGAAGAACTTACCCCAACCGTAATTAATTGATATGATCTCATCCATTTCTACGATTATTGTATTAAAAAAAATACACTGAAATGAATAGTAAAGAATAGAATTTAATAATGGAAACATGTCCCAAACACACACACTAACCCTACCAAATCTCTTACCAATTGACCACAAATAAAGGCATATATATTTACATGTTTATATACATTTGATTTTGGTCATAAATTTCAATTTTGCATTTTGTTCATTTATTAAATTTTGCAATGACCACGAACACTTATAATGCATGTAAGTTAAAGCGACACTTGCAAAAATGACAAAATAAATTATAATAATTAAGTTCATAGTACACACTTTATTATTTGTGAAAATGACAAAAACGAAATCTAGCTAACCCACATATCAATAGATAGAATGTCACAAGTGTCATCGTTACTAATATACGTTTGATACACCTTATACACGTCTAATACACTTAATACACCATTTGATAAACCTGATACACTTGATGCACCTGATACTTTTTTATACACTCGATACATTTTGATACACATTTGAAACATTACTAATATATGTTTGATATACATGATACACTGCATATACATTTAATATTCTTCACTTATACAATTGATTGATTAAATGCACCTGATATGCTTGGAGTCCTACTTATATCCTTAATATACTATTGATATACACTTGTAAATTTGGTTCATATACTTGATAATGATTCACTTTATTGATATGCTTTAACAATATATTGTTGTTATACTTGATAATGATACACAAAGTTACACAATTCATATACTTAATAATATTATAATGAACTGCATAAAATTTAAAAAAACAAAAAATTACGTAATATTATAATGAACTAATACATATAATATAAGTTCAAACTAAGACAAAGTAAAACAAAAAAAAATTAATATAAAAAAATAAAACAAAATAATTTAGAAAGAATAAATAATTTTTCTAGTATAAGAATAAACAAATAATTATGGCCTTAAAAAATGAAAGAATAAATAATTAAGGAGTAGAGCATTAAAAAAAAGAAGATAAATAAGTTATTGAATGGTAGTTTAGGAATAATAACAAATTTACGATGGAAAAGTGAGAATTTTGTCATATTTACAAAATTTTAAAATAATGTGCTATAATTGCTATATCATATTCTCAAACTGCTATCCTAATTTAAAAGACCAAAAACTTACGTAATTGATACATTTAACATATATATATATATATAATTAAAAAATAAGTTAATAAACAAAATTTAAAAATATTTATTATTAACACTATCTTCACGTTAATGAATACAAAATGATTCATTTTCTAGGATTGAATATTTTATTAAATACCGATTTACGCATGTTAATTAGATTCAAAGGCTTCCCAATATCTAGTTAATTAATGTTTGTAAAAGTTTCATTTTCTTATTAAAAAATAAATTAATTAATACAACTAAATTTGATTCATTTTTATAAAAATAATAAAACACCCAAATATTTACGATTCATGTAACAAAATAAAAAAGTCCATTAGATTGATTAATTTTTTTTAATTTTCTAGGTTTGTCCTTAGTGTTTTTTAATTTCTTCATTATTCCTAATTTCCAATTTTTTTCTTCTCATCTTTTCCAATTTTTCTTTGATATTTTTTTAATTCATTATCTTTGTTTTCAATCCATAATCAATCTTGTAATTTTTTTTAAAATTGTTATTTGGTTAAAGATGGTGTACGTAATTTCTTGAACAAAAATTGTTGAGATCAAATCAATTAGTTTAGATTTGGGTACACGATCAAGTATCAATTCTATAAAATTGTAAAAGATCTTGAAAAAAAATCTTTGGGAGGATGGTACAAAAATGGTTGGGATATACCAATATAAAAAATCTTGTTTCACGATCTTCAACCAAGACCGTTTAGATTTTAGGTATCATAATCGTTTAGATTTAGATCTATACAGTATATTTGGCATTTTTGTATTTTCATTACAAACCTATGGACTTTTTTTCGTTTTCAAAATTATTTTATACACCATAAATATTGGATCAATTTCTTATATTCTTAAAAAAAAATCCCTCCAAATTATTTATTTCTAAACTTCAAAGTGTTTGGGATATGTGCCACTAACCTTCAAATTTTAAAAAATAAAAATGAATTTTTTTTATTAAAATAATTGGTTATTTTTTATAATAATTGGAATTAAACATATGATCAATTTTCTTTTTTTTAAAAGAAAAATCTACCCTCTCTCTTCCACTTTTTTTTTCTTTTTGTTTTCTTCTCCAACACATGTGGATCTATAAATAAATAAATAAATAAATAAATAAAGCAATGAAGGAGAAGATCTTGCTATTCACAATAATAAATAATATTAATAGAATATTAAAATATAGCTTCATTATTCATATAAGGCCACAAAGAAAGAAGTCATAATTAAATGTCACATGCTATCCTATGGTAAATTTCGAAGCATTTTCATTTTTTTTACTATTATTGCATAAAAATAAAAATAAAATTGAAACGAAAATGCCAACAAACTATTCTTGCCATCACGTTACATATATATAAATATACACACATACATGTCGACTTTAGATCTAGTTTCCATTTTAGTCTCAGTTTCAAGTTTGGTTTCAATTTAGTTACTATATTTATATAATTATTTTATTTATATCATCAGTGTAAAAACATCTAAATTTTTAAAATTTTAAAAAGTGAAAAAAGCGTTCAAAGATACATTTATAGTGGTTAGTTTTGAGAAAAATCGTTAAACTGTAATTTCTCTAAACTGCAGTGCTTTTACTTGACTTTACTTTTTTGTTCGTTGTTGTTTTTTGGACTTATCTCTAGACTTTCTTTTGGATCATCATGTTAATTTCTTGCACTTCAACAAATGAATATATTTGATAGGAATGATGAGTGTGCTAAAAGCCTCTTCATCTAATGTGAATGATTGGGTGCACCAAACTGACCCAAGTTATATATTTTATTTTTAAAAAACCCTAAACTAGTGAATAATCTCATAATGATCCATGAACTAAAAACGGTAAAAAAATATTCTCATTGATTAAAAAAATTTCACTAATTTTCTCACCAACAAAACTATTCAAGTTATAACATAAAATTCAAAAAAAAAAAAATCTTACAAAATTCTAAAATAAAATTTTTATTTAAACATACTAAAACAATAACTAACTAGTTAAGCAAAATCAATGTATTTAATTATTAAAACACTCTACTTTTCAAAATACATTTAAAGTTTCAAAAGTTTCAAAAATTATCTTAAACTTAAAGAAAAAAAGTTTAAAAAAATACATTTTATGTTATACAACACAAATTGTTAATATTGTCTTACTTTTCTATTTTTTATGCATTAAAAATAAAGAGAATCAATGAAAACATCAAGAGATTTTAGAACGTGAATCTTTCTACTTGACCAACGTCTAAGTATAATATGGTAAAAATTTTAGTGTAAAAATGAATTGACGAGAGTATTCTTTCTCACTTCTTTTTAGCGCAAGGATTAATTATAAAACTTTTTAAAAATTCACAATTATTTTTAAAATAAAAGTTTAATAATTTTCCTTCACAATTCACCCTAAAAGTATTTTTGAATTCTCTCCCAAAGTTAGGAGTCGGGGTGGGTGTGGGTAATGTGTTTTGTAATGTAATGCCTAGAATTTTGAACCCAGTGGGGAGTGGGTGTTGATATTAATTCTTTCTGTCTTGTCCTCTCTCAACTTCGATGTGGATTGACAAGTCTTCTGCTACACAATGTTCTGTCATTTATATAAAAAGTCTTCATTCTAATTCCCATGTTTACATATTTCACCTCAAACTCCCTTTCCAAACACTTGTTTGGAGCTATATATATCAACTTCATTGCTATAAACATCATCATCAAGATGTTTGAAGGAAGGATAATTATAGTATTGTAATAGTTTTAGTATTATGATAGTATGCATTTGAAAGATTATGATAGTTAATATATAATGTACATCAAACCCATAAATTTATTTTATTAAATAGTTTTATTAATTGTATTCGCTCAATAAATTTATGAGTCGTAATGTTTCATGTTTTTTTTCATGTGTATATATTGCACATCGAATTAAAGAAGCTTCATGTTCATATCAATCATTTAAGTTGTGATTTTGTGTTTGCATTAATCTATTTTCTTTCATGCATACATATTTGTCATTAAATACAATATTTATTCCATAAATTTGTTATTAAATCATTATTTTTCTAAGCATATTTTTAGAGTGATTTTCAAAAGCGTTTGTCTAATTGCTTCTAATCTAAATATTGTTGTCACATGATCACAAAAAATGAGTACAAAAATCTATATTACATAAGTAAATATTTTCAAACTTTGAAAAGAGTTTAATGAGTCAACTTAAAAAAAATTAAACAAAATACAGGAAGTACTAATTTTAATTAAAAAACAATAAAATTCAAGATAATTTTCTATAATTTAGTTAGAAAAGAAAACATTTATTAGCTGAAATTAAAATATTTGGATAAGATAAGTCAAAATAATTTAATTTAAAAAAACTACTTAATTAAACATATTCAAAATAGATATTTGTTGAAGGTAAATAAAACAATATGTCATTATAATTTTTTTGTGCAAAAAAAATCTTATTATAAATTAATTTAATTGAAAAAGAAGAAAAAAAAGTGATGAATGAAAGGCTTTTTCAATTTTTTTATTTCTATGAAAAAGAGTTGGGGGATTATGGAAAAAAGTTTGTATGGGTAAAGGATTAGGGGAATTAAGATAGGAGTTTGTTCGGGGGATTATGGGTAAAATTGGGTGTTACAGAAACCCTAAAACTACAGTTTCCATAACATGAACTCCAAATAAGAAGGAGTTATCATAATCTACTCCAAATTTCGCAGATTGTGTTTATATTTTGTGAGAAAAACATGGTGTGCACGTTACCTCAATAATAATTTTCATATCATTCCAAATTCAACATCTTTACAATTATTGAGAGAATGGCTTAATTTAGTGACCATGTTTGGCTAGTGAATGGAACATAACCATGATAGGAAAATTGACAAAATATTAAAGAAAATGAGGGTAATATATAAATGTTATATGAATAATAATATGGAAAAGAATTTGAGTCCAATTAATTTTACTTTAGTGCTAATTGTCTTCCAAGATAGTACATATTCACTTTCTCATAAAAGCAGTACCTCATCAACAAGATTTAACAAACTATACTCTATGTGGAAATAAATCGAACTTGAACAAAAGTTACAAAAGGGAAAAATGTCTATAAGAGAAAGTCATTATGAATTTTTTTTTTTTTTAAAGTTTGAATAAGATGAATGTTTATAGAAACAATTCATGGTCATTTGGATAATCGTGTCACTTCTTTAAATTGGTTATTTTTTTAAAAGAGATGTGCATTCACGAAACAATGCAATTTTATGCAAATGGTCATGTACCATTTCTTTCAATAATAAATTTTGTCTTTTAGTGATTCTTTTCTATCAACAAAAATATTTTTTTTTATTTTTGAGAAAACCCATGGTTATAAATATATAAACTATGATTTTTTTAATGCATTTTAGTGTGTATTTAATCATTCTTATAAATTAACGAGTGATATTTGAGTGTATTTTGATATGCATATCTATCTTTGTACAACTTCTTCTAATCACCCACATATCACATATCAAAAAAGTAATGATTGGACAATTTGAGATGCTATCAAATAAGTGTTTGTAAATTTGATTTAAAAAAATCAAAATCTAATAGTAAACCAAAGTAGGCATGAGATTTAAGTCTAAAATAAACAATATTATGGGATTATAAAAACAGGTAAAATAAACCATTTGTTTTCCCAATGAGTTTGAGCCTATAAAAGATAACCTATACCAAACCCAAAACCCCATCCCTCATTTCTCTTCGAAACAAATCATTTCATAGAAATTAAGAAATGGGCTCTCTTACCAAAGTAAACCAAGGAAAACACCAACCACATGCAGTGTTCGTCCCATACCCATCTCAAGGCCACATAAGTCCCATGCTAAAGCTTGCAAAACTCTTTCACCACAAAGGCTTTCACATAACTTTTGTCAACACAGAATACAACCATAGACGTCTCCTTAGATCCTGAGGTCCCAACTCTTTGGATGGCTTGCCTGACTTTCATTTCAGAGCCATTCCCGACGGCCTCCCACCGTCCAACGGCAACGCCACCCAACATGTTCCTTCACTCTGCTATTCCACTTCCCGCAATTGTTTGGCTCCATTCTGTAGCCTCATTTCGGAGATCAACTCGAGTGGAACTGTTCCACCCGTTTCCTGTATTATTGGCGATGGGATCATGACTTTCACTGTTTTTGCTGCCCAGGAATTTGGTATTCCCACTGCTGCGTTTTGGACGGCTAGTGCTTGTGGGTGCCTTGGATATATGCAGTATGCTAAGCTTGTTGAACAGGGCTTGGTTCCTTTTAAAGGTATAAATTAACTTACAATCTACAGCTCTTGTTCGTTATCGTTTTTATTATTTTCTATCTGTCCTTTTTTTTCGTCCACTGATTATTTATGCTGTAGTGATTAGCACTGTGCCGCAAACATATCCTATCAATGCTTTTGAAATTTTGACATATAATTCCTATAATTACATCCGTAGTAGCTTTGAAGATATTTTTGGGTAGATATCTATTGATGTAATAATAATCACTGATGAACATTAATTTTATGTTTTTGAATTGTTTGATCTAAAAAGGAGCATATAGCTTAGCTTGGATTGGCTTTTATCTCGTTTTTAGATTTAAAAAATGATGCAATTTTACCTTATAAGTTCTAGAATTAATTTTGATTTAGTTAAAATATTTTTAAATCTTACAATTCCATCTCTCAGATTTTAATTTTGTTCTTGGTTTTCAAGATTTAAACTTCAAACTTTGGTTTTTGACTGTATATTAATGAGGTGTTAAATGTTTATTAATTAGTATTAAAAGTAATTGTGAGGTGAAAATTTTAAATTGATTTGTTCTCCAGATTTGATAATAATATTTTTTATTATATAATAATTAAAAAAATTCCATGATACATGGATTGATTTAAAAGTATAAGTTATTGATAAAAATAAATTTAATATTGTATTATAATAGAGACTTAAACACACAACCTCTATTCACCAGCGTGCCTCTGATATTATCTTTAATCGCCTGGATATAACATCTTCTGCTCACAAATTAATTTGTATTTCTTTTGTAGATGAAAACTTCATGACTAACGGGGATCTAGAAGAAACAATAGAATGGATCCCGCCAATGGAGAAAATTAGCTTGAGAGACATTCCAAGCTTTATCAGAACAACAGACAAAGACGACATAATGCTCAACTTTTTCATCGAACAATTTGAAACATTCCCCAAAGCCAATGCAATAATCATAAATACATTTGATTCCTTAGAGCACCACGTGTTGGAAGCACTTTCCTCAAAACTCCCTCCCATTTACCCAATTGGTCCAATCAACTCATTAGTTGCCGAGTTAATCAAAGACGACAAAGTCAAGGACATCCGTTCAAATCTATGGGACGAGCAATCAGAATGCATGAAATGGCTTGATTCACAACAACCCAACTCAGTAGTTTACGTCAACTTTGGAAGCGTAACTGTGATGTCTCCTCAACACTTGGTGGAATTTGCATGGGGACTTGCCAATAGTGAGAAGCCCTTCTTGTGGATCGTAAGGCCGGATCTAGTGGAAGGGGAGACGGCATTGTTGCCGGCGGAGTTTTTGGTAGAAACGAAAGAAAGAGGGATGTTGGCGGATTGGTGTAACCAAGAAGAGGTTTTGAAACATTCTTCGGTTGGAGGGTTTTTGACGCATAGTGGATGGAACTCGACGATGGAGAGTATTGTGGGAGGAGTGGCTATGATATCGTGGCCGTTCTTTGCAGAACAACAAACCAATTGTCGTTATTGTAAGACGGAGTGGGGAAATGGGTTGGAGATTGATAGCAATGTGAGGAGGGAGGATGTGGAGAAGCTTGTGAGGGAGTTGATGGAAGGAGAAAAGGGTGAAGACATGAAGAGAAATGCCAAAGAGTGGAAGAGGAAGGCTGAAGAGGCTTGTAAGATTGGTGGGTCATCGCCAACAAATTTAGATCGAGTCATTTCTGAAATTCTTTCCAGCAAAGAAAAATCAAACCTCAACTCTCAAAATTAA

mRNA sequence

ATGGGTTCTCTTACCAAAGTAGACCAAGGAAAACAGCAACCTCATGCAGTGCTCTTCCCATACCCATCTCAAGGCCACATAAGTCCCATGCTAAAGCTTGCAAAACTCTTTCACCACAAAGGCTTTCACATAACGTTTGTCAATACAGAATACAACCATAGACGTCTCCTCAGATCCCGAGGTCCCAACTCTTTGGATGGCTTGCCTGACTTTCATTTCAGAGCCATTCCCGACGGCCTCCCACCGTCCGACGGCAACTCCACCCAACATATTCCTTCACTTTGCTATTCCGCTTCCCGCAATTGTTTGGCTCCACTCTGTAGTCTCATTTCGGAGATCAACTCGAGTGGCACCGTTCCACCTGTTTCCTGTATTATTGGCGATGGGGTCATGACTTTCACTGTTTTTGCTGCCCAGAAATTTGGTATTCCCATTGCTTCGTTTTGGACGGCTAGTGCTTGTGGGTGCCTTGGATATATGCAGTATGCTAAGCTTGTTGAACAGGGCTTGGTTCCTTTCAAAGATGAAAACTTCATGACTAACGGGGATCTAGAAGAAACAATAGAATGGATCCCGCCAATGGAGAAAATTAGCTTGAGAGACATTCCAAGCTTTATCAGAACAACAGACAAAGACGACATAATGCTCAACTTTTTCATCGAACAATTTGAAACATTCCCCAAAGCCAATGCAATAATCATAAATACATTTGATTCCTTAGAGCACCACGTGTTGGAAGCACTTTCCTCAAAACTCCCTCCCATTTACCCAATTGGTCCAATCAACTCATTAGTTGCCGAGTTAATCAAAGACGACAAAGTCAAGGACATCCGTTCAAATCTATGGGACGAGCAATCAGAATGCATGAAATGGCTTGATTCACAACAACCCAACTCAGTAGTTTACGTCAACTTTGGAAGCGTAACTGTGATGTCTCCTCAACACTTGGTGGAATTTGCATGGGGACTTGCCAATAGTGAGAAGCCCTTCTTGTGGATCGTAAGGCCGGATCTAGTGGAAGGGGAGACGGCATTGTTGCCGGCGGAGTTTTTGGTAGAAACGAAAGAAAGAGGGATGTTGGCGGATTGGTGTAACCAAGAAGAGGTTTTGAAACATTCTTCGGTTGGAGGGTTTTTGACGCATAGTGGATGGAACTCGACGATGGAGAGTATTGTGGGAGGAGTGGCTATGATATCGTGGCCGTTCTTTGCAGAACAACAAACCAATTGTCGTTATTGTAAGACGGAGTGGGGAAATGGGTTGGAGATTGATAGCAATGTGAGGAGGGAGGATGTGGAGAAGCTTGTGAGGGAGTTGATGGAAGGAGAAAAGGGTGAAGACATGAAGAGAAATGCCAAAGAGTGGAAGAGGAAGGCTGAAGAGGCTTGTAAGATTGGTGGGTCATCGCCAACAAATTTAGATCGAGTCATTTCTGAAATTCTTTCCAGCAAAGAAAAATCAAACCTCAACTCTCAAAATTAA

Coding sequence (CDS)

ATGGGTTCTCTTACCAAAGTAGACCAAGGAAAACAGCAACCTCATGCAGTGCTCTTCCCATACCCATCTCAAGGCCACATAAGTCCCATGCTAAAGCTTGCAAAACTCTTTCACCACAAAGGCTTTCACATAACGTTTGTCAATACAGAATACAACCATAGACGTCTCCTCAGATCCCGAGGTCCCAACTCTTTGGATGGCTTGCCTGACTTTCATTTCAGAGCCATTCCCGACGGCCTCCCACCGTCCGACGGCAACTCCACCCAACATATTCCTTCACTTTGCTATTCCGCTTCCCGCAATTGTTTGGCTCCACTCTGTAGTCTCATTTCGGAGATCAACTCGAGTGGCACCGTTCCACCTGTTTCCTGTATTATTGGCGATGGGGTCATGACTTTCACTGTTTTTGCTGCCCAGAAATTTGGTATTCCCATTGCTTCGTTTTGGACGGCTAGTGCTTGTGGGTGCCTTGGATATATGCAGTATGCTAAGCTTGTTGAACAGGGCTTGGTTCCTTTCAAAGATGAAAACTTCATGACTAACGGGGATCTAGAAGAAACAATAGAATGGATCCCGCCAATGGAGAAAATTAGCTTGAGAGACATTCCAAGCTTTATCAGAACAACAGACAAAGACGACATAATGCTCAACTTTTTCATCGAACAATTTGAAACATTCCCCAAAGCCAATGCAATAATCATAAATACATTTGATTCCTTAGAGCACCACGTGTTGGAAGCACTTTCCTCAAAACTCCCTCCCATTTACCCAATTGGTCCAATCAACTCATTAGTTGCCGAGTTAATCAAAGACGACAAAGTCAAGGACATCCGTTCAAATCTATGGGACGAGCAATCAGAATGCATGAAATGGCTTGATTCACAACAACCCAACTCAGTAGTTTACGTCAACTTTGGAAGCGTAACTGTGATGTCTCCTCAACACTTGGTGGAATTTGCATGGGGACTTGCCAATAGTGAGAAGCCCTTCTTGTGGATCGTAAGGCCGGATCTAGTGGAAGGGGAGACGGCATTGTTGCCGGCGGAGTTTTTGGTAGAAACGAAAGAAAGAGGGATGTTGGCGGATTGGTGTAACCAAGAAGAGGTTTTGAAACATTCTTCGGTTGGAGGGTTTTTGACGCATAGTGGATGGAACTCGACGATGGAGAGTATTGTGGGAGGAGTGGCTATGATATCGTGGCCGTTCTTTGCAGAACAACAAACCAATTGTCGTTATTGTAAGACGGAGTGGGGAAATGGGTTGGAGATTGATAGCAATGTGAGGAGGGAGGATGTGGAGAAGCTTGTGAGGGAGTTGATGGAAGGAGAAAAGGGTGAAGACATGAAGAGAAATGCCAAAGAGTGGAAGAGGAAGGCTGAAGAGGCTTGTAAGATTGGTGGGTCATCGCCAACAAATTTAGATCGAGTCATTTCTGAAATTCTTTCCAGCAAAGAAAAATCAAACCTCAACTCTCAAAATTAA

Protein sequence

MGSLTKVDQGKQQPHAVLFPYPSQGHISPMLKLAKLFHHKGFHITFVNTEYNHRRLLRSRGPNSLDGLPDFHFRAIPDGLPPSDGNSTQHIPSLCYSASRNCLAPLCSLISEINSSGTVPPVSCIIGDGVMTFTVFAAQKFGIPIASFWTASACGCLGYMQYAKLVEQGLVPFKDENFMTNGDLEETIEWIPPMEKISLRDIPSFIRTTDKDDIMLNFFIEQFETFPKANAIIINTFDSLEHHVLEALSSKLPPIYPIGPINSLVAELIKDDKVKDIRSNLWDEQSECMKWLDSQQPNSVVYVNFGSVTVMSPQHLVEFAWGLANSEKPFLWIVRPDLVEGETALLPAEFLVETKERGMLADWCNQEEVLKHSSVGGFLTHSGWNSTMESIVGGVAMISWPFFAEQQTNCRYCKTEWGNGLEIDSNVRREDVEKLVRELMEGEKGEDMKRNAKEWKRKAEEACKIGGSSPTNLDRVISEILSSKEKSNLNSQN
BLAST of CsaV3_7G006280 vs. NCBI nr
Match: XP_011658777.1 (PREDICTED: 7-deoxyloganetin glucosyltransferase-like [Cucumis sativus] >KGN43695.1 hypothetical protein Csa_7G059660 [Cucumis sativus])

HSP 1 Score: 995.0 bits (2571), Expect = 9.3e-287
Identity = 481/493 (97.57%), Postives = 487/493 (98.78%), Query Frame = 0

Query: 1   MGSLTKVDQGKQQPHAVLFPYPSQGHISPMLKLAKLFHHKGFHITFVNTEYNHRRLLRSR 60
           MGSLTKV+QGK QPHAV  PYPSQGHISPMLKLAKLFHHKGFHITFVNTEYNHRRLLRSR
Sbjct: 1   MGSLTKVNQGKHQPHAVFVPYPSQGHISPMLKLAKLFHHKGFHITFVNTEYNHRRLLRSR 60

Query: 61  GPNSLDGLPDFHFRAIPDGLPPSDGNSTQHIPSLCYSASRNCLAPLCSLISEINSSGTVP 120
           GPNSLDGLPDFHFRAIPDGLPPSDGNSTQH+PSLCYS SRNCLAP CSLISEINSSGTVP
Sbjct: 61  GPNSLDGLPDFHFRAIPDGLPPSDGNSTQHVPSLCYSTSRNCLAPFCSLISEINSSGTVP 120

Query: 121 PVSCIIGDGVMTFTVFAAQKFGIPIASFWTASACGCLGYMQYAKLVEQGLVPFKDENFMT 180
           PVSCIIGDG+MTFTVFAAQ+FGIP A+FWTASACGCLGYMQYAKLVEQGLVPFKDENFMT
Sbjct: 121 PVSCIIGDGIMTFTVFAAQEFGIPTAAFWTASACGCLGYMQYAKLVEQGLVPFKDENFMT 180

Query: 181 NGDLEETIEWIPPMEKISLRDIPSFIRTTDKDDIMLNFFIEQFETFPKANAIIINTFDSL 240
           NGDLEETIEWIPPMEKISLRDIPSFIRTTDKDDIMLNFFIEQFETFPKANAIIINTFDSL
Sbjct: 181 NGDLEETIEWIPPMEKISLRDIPSFIRTTDKDDIMLNFFIEQFETFPKANAIIINTFDSL 240

Query: 241 EHHVLEALSSKLPPIYPIGPINSLVAELIKDDKVKDIRSNLWDEQSECMKWLDSQQPNSV 300
           EHHVLEALSSKLPPIYPIGPINSLVAELIKDDKVKDIRSNLWDEQSECMKWLDSQQPNSV
Sbjct: 241 EHHVLEALSSKLPPIYPIGPINSLVAELIKDDKVKDIRSNLWDEQSECMKWLDSQQPNSV 300

Query: 301 VYVNFGSVTVMSPQHLVEFAWGLANSEKPFLWIVRPDLVEGETALLPAEFLVETKERGML 360
           VYVNFGSVTVMSPQHLVEFAWGLANSEKPFLWIVRPDLVEGETALLPAEFLVETKERGML
Sbjct: 301 VYVNFGSVTVMSPQHLVEFAWGLANSEKPFLWIVRPDLVEGETALLPAEFLVETKERGML 360

Query: 361 ADWCNQEEVLKHSSVGGFLTHSGWNSTMESIVGGVAMISWPFFAEQQTNCRYCKTEWGNG 420
           ADWCNQEEVLKHSSVGGFLTHSGWNSTMESIVGGVAMISWPFFAEQQTNCRYCKTEWGNG
Sbjct: 361 ADWCNQEEVLKHSSVGGFLTHSGWNSTMESIVGGVAMISWPFFAEQQTNCRYCKTEWGNG 420

Query: 421 LEIDSNVRREDVEKLVRELMEGEKGEDMKRNAKEWKRKAEEACKIGGSSPTNLDRVISEI 480
           LEIDSNVRREDVEK+VRELMEGEKGEDMKRNAKEWKRKAEEACKIGGSSPTNLDRVISEI
Sbjct: 421 LEIDSNVRREDVEKVVRELMEGEKGEDMKRNAKEWKRKAEEACKIGGSSPTNLDRVISEI 480

Query: 481 LSSKEKSNLNSQN 494
           LSSKEKSNLNSQN
Sbjct: 481 LSSKEKSNLNSQN 493

BLAST of CsaV3_7G006280 vs. NCBI nr
Match: XP_008455233.1 (PREDICTED: 7-deoxyloganetin glucosyltransferase-like [Cucumis melo])

HSP 1 Score: 953.7 bits (2464), Expect = 2.4e-274
Identity = 462/494 (93.52%), Postives = 474/494 (95.95%), Query Frame = 0

Query: 1   MGSLTKVDQGKQQPHAVLFPYPSQGHISPMLKLAKLFHHKGFHITFVNTEYNHRRLLRSR 60
           MGSL+KVDQ  QQPHAV  PYPSQGHISPMLKLAKLFHHKGFH+TFVNTEYNHRRLLRSR
Sbjct: 1   MGSLSKVDQENQQPHAVFVPYPSQGHISPMLKLAKLFHHKGFHVTFVNTEYNHRRLLRSR 60

Query: 61  GPNSLDGLPDFHFRAIPDGLPPSDGNSTQHIPSLCYSASRNCLAPLCSLISEINSSG-TV 120
           GPNSLDGLPDF FRAIPDGLPPSDGN+TQH+PSLCYS SRNCLAPLCSLISEINSSG TV
Sbjct: 61  GPNSLDGLPDFQFRAIPDGLPPSDGNATQHVPSLCYSTSRNCLAPLCSLISEINSSGSTV 120

Query: 121 PPVSCIIGDGVMTFTVFAAQKFGIPIASFWTASACGCLGYMQYAKLVEQGLVPFKDENFM 180
           PPVSCIIGDGVMTFTVFAAQKFGIPIASFWTASACGCLGYMQYAKLVEQ ++PFKDENFM
Sbjct: 121 PPVSCIIGDGVMTFTVFAAQKFGIPIASFWTASACGCLGYMQYAKLVEQQMIPFKDENFM 180

Query: 181 TNGDLEETIEWIPPMEKISLRDIPSFIRTTDKDDIMLNFFIEQFETFPKANAIIINTFDS 240
           +NGDLEETIEWIPPMEKI LRDIPSFIRTTDKDDIMLNFFI+Q ET PKANAIIINTFDS
Sbjct: 181 SNGDLEETIEWIPPMEKIRLRDIPSFIRTTDKDDIMLNFFIQQLETLPKANAIIINTFDS 240

Query: 241 LEHHVLEALSSKLPPIYPIGPINSLVAELIKDDKVKDIRSNLWDEQSECMKWLDSQQPNS 300
           LEHHVLEALSSKLPPIYPIGPINSLV ELIKDDKVK IRSNLWDEQSECMKWLDSQQPNS
Sbjct: 241 LEHHVLEALSSKLPPIYPIGPINSLVTELIKDDKVKGIRSNLWDEQSECMKWLDSQQPNS 300

Query: 301 VVYVNFGSVTVMSPQHLVEFAWGLANSEKPFLWIVRPDLVEGETALLPAEFLVETKERGM 360
           VVYVNFGS+TVMSPQHLVEFAWGLANSEKPFLWIVRPDLVEGETALLPAEF+ ETKERGM
Sbjct: 301 VVYVNFGSITVMSPQHLVEFAWGLANSEKPFLWIVRPDLVEGETALLPAEFVAETKERGM 360

Query: 361 LADWCNQEEVLKHSSVGGFLTHSGWNSTMESIVGGVAMISWPFFAEQQTNCRYCKTEWGN 420
           L DWCNQEEVLKH SVGGFLTHSGWNSTMESI GGVAMISWPFFAEQQTNCRYCKTEWGN
Sbjct: 361 LGDWCNQEEVLKHPSVGGFLTHSGWNSTMESIAGGVAMISWPFFAEQQTNCRYCKTEWGN 420

Query: 421 GLEIDSNVRREDVEKLVRELMEGEKGEDMKRNAKEWKRKAEEACKIGGSSPTNLDRVISE 480
           GLEIDSNVRREDVEKLVRELMEGEKGEDMKRNAKEWKRKAEEACKIGGSS TNLD+VISE
Sbjct: 421 GLEIDSNVRREDVEKLVRELMEGEKGEDMKRNAKEWKRKAEEACKIGGSSLTNLDQVISE 480

Query: 481 ILSSKEKSNLNSQN 494
           IL SK+KSNL SQN
Sbjct: 481 ILLSKDKSNLKSQN 494

BLAST of CsaV3_7G006280 vs. NCBI nr
Match: XP_023529848.1 (7-deoxyloganetin glucosyltransferase-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 815.1 bits (2104), Expect = 1.3e-232
Identity = 388/490 (79.18%), Postives = 444/490 (90.61%), Query Frame = 0

Query: 1   MGSLTKVDQGKQQPHAVLFPYPSQGHISPMLKLAKLFHHKGFHITFVNTEYNHRRLLRSR 60
           MGSL K     ++PHA+  PYP+QGHISPMLKLAKL H+KGFHITFVNTEYNH+R L++R
Sbjct: 1   MGSLYKA----EKPHAIFIPYPAQGHISPMLKLAKLLHNKGFHITFVNTEYNHKRHLKAR 60

Query: 61  GPNSLDGLPDFHFRAIPDGLPPSD-GNSTQHIPSLCYSASRNCLAPLCSLISEINSSGTV 120
           GPNSLDGLPDF FR IPDGLP S+  ++TQ +PSLCYS SRNCLAPLCSLISEINSS  +
Sbjct: 61  GPNSLDGLPDFQFRTIPDGLPTSEVKDATQDVPSLCYSTSRNCLAPLCSLISEINSS-HM 120

Query: 121 PPVSCIIGDGVMTFTVFAAQKFGIPIASFWTASACGCLGYMQYAKLVEQGLVPFKDENFM 180
           PPVSC++GDGVMTFTV AA++FGIPIA FWTASACGCLGYMQY KLVEQG+VP KD+NFM
Sbjct: 121 PPVSCVVGDGVMTFTVLAARQFGIPIAIFWTASACGCLGYMQYRKLVEQGIVPLKDKNFM 180

Query: 181 TNGDLEETIEWIPPMEKISLRDIPSFIRTTDKDDIMLNFFIEQFETFPKANAIIINTFDS 240
            NGDLE TIEWIPPME+I LRDIPSFIRTTD++DIMLNFFIEQ +  PKA+AIIINTFDS
Sbjct: 181 INGDLETTIEWIPPMERIRLRDIPSFIRTTDENDIMLNFFIEQCDALPKADAIIINTFDS 240

Query: 241 LEHHVLEALSSKLPPIYPIGPINSLVAELIKDDKVKDIRSNLWDEQSECMKWLDSQQPNS 300
           LEHHVLE+LSSKLPPIYPIGPINSLV+ELI+D+KVKDIRSNLW+EQSEC++WL+S++PNS
Sbjct: 241 LEHHVLESLSSKLPPIYPIGPINSLVSELIEDEKVKDIRSNLWNEQSECVEWLNSKEPNS 300

Query: 301 VVYVNFGSVTVMSPQHLVEFAWGLANSEKPFLWIVRPDLVEGETALLPAEFLVETKERGM 360
           V+YVNFGS+TVM+ + L+EFAWGLANS+KPFLWI RPDLVEG++A+LP EFL ET+ERGM
Sbjct: 301 VLYVNFGSITVMNEEQLIEFAWGLANSKKPFLWITRPDLVEGKSAMLPLEFLEETRERGM 360

Query: 361 LADWCNQEEVLKHSSVGGFLTHSGWNSTMESIVGGVAMISWPFFAEQQTNCRYCKTEWGN 420
           LA WCNQEEVLKH S+G FLTHSGWNSTMESIV GV MISWPFFAEQQTNCRYC TEWGN
Sbjct: 361 LAHWCNQEEVLKHGSIGAFLTHSGWNSTMESIVSGVPMISWPFFAEQQTNCRYCNTEWGN 420

Query: 421 GLEIDSNVRREDVEKLVRELMEGEKGEDMKRNAKEWKRKAEEACKIGGSSPTNLDRVISE 480
           G+EID+NV+R++VEKLVRELMEGEKGE MK+NAKEWKRKAEEACKIGGSS T+LDRVI+E
Sbjct: 421 GMEIDNNVKRDEVEKLVRELMEGEKGETMKKNAKEWKRKAEEACKIGGSSCTHLDRVINE 480

Query: 481 ILSSKEKSNL 490
           +L S +  N+
Sbjct: 481 VLLSNKMINV 485

BLAST of CsaV3_7G006280 vs. NCBI nr
Match: XP_022927659.1 (7-deoxyloganetin glucosyltransferase-like [Cucurbita moschata])

HSP 1 Score: 814.3 bits (2102), Expect = 2.3e-232
Identity = 388/489 (79.35%), Postives = 442/489 (90.39%), Query Frame = 0

Query: 1   MGSLTKVDQGKQQPHAVLFPYPSQGHISPMLKLAKLFHHKGFHITFVNTEYNHRRLLRSR 60
           MGSL K     ++PHA+  PYP+QGHISPMLKLAKL H+KGFHITFVNTEYNH+R L++R
Sbjct: 1   MGSLYKA----EKPHAIFIPYPAQGHISPMLKLAKLLHNKGFHITFVNTEYNHKRHLKAR 60

Query: 61  GPNSLDGLPDFHFRAIPDGLPPSD-GNSTQHIPSLCYSASRNCLAPLCSLISEINSSGTV 120
           GPNSLDGLPDF FR IPDGLPPS+  ++TQ +PSLCYS SRNCLAPLCSLISEINSS  +
Sbjct: 61  GPNSLDGLPDFQFRTIPDGLPPSEVKDATQDVPSLCYSTSRNCLAPLCSLISEINSS-HM 120

Query: 121 PPVSCIIGDGVMTFTVFAAQKFGIPIASFWTASACGCLGYMQYAKLVEQGLVPFKDENFM 180
           PPVSC++GDGVMTFTV AA++FGIPIA FWTASACGCLGYMQY KLVEQG+VP KD+NFM
Sbjct: 121 PPVSCVVGDGVMTFTVLAARQFGIPIAIFWTASACGCLGYMQYRKLVEQGIVPLKDKNFM 180

Query: 181 TNGDLEETIEWIPPMEKISLRDIPSFIRTTDKDDIMLNFFIEQFETFPKANAIIINTFDS 240
           TNGDLE TIEWIPPME+I LRDIPSFIRTTD++DIMLNFFIEQ +  PKA+AIIINTFDS
Sbjct: 181 TNGDLETTIEWIPPMERIRLRDIPSFIRTTDENDIMLNFFIEQCDALPKADAIIINTFDS 240

Query: 241 LEHHVLEALSSKLPPIYPIGPINSLVAELIKDDKVKDIRSNLWDEQSECMKWLDSQQPNS 300
           LEHHVLE+LSSKLPPI+PIGPINSLV ELI+D+KVKDIRSNLW+EQSEC++WL+S++PNS
Sbjct: 241 LEHHVLESLSSKLPPIFPIGPINSLVFELIEDEKVKDIRSNLWNEQSECVEWLNSKEPNS 300

Query: 301 VVYVNFGSVTVMSPQHLVEFAWGLANSEKPFLWIVRPDLVEGETALLPAEFLVETKERGM 360
           V+YVNFGS+TVM+ + L+EFAWGLANS+KPFLWI RPDLVEG++A+LP EFL ET+ERGM
Sbjct: 301 VLYVNFGSITVMNEEQLIEFAWGLANSKKPFLWITRPDLVEGKSAMLPIEFLEETRERGM 360

Query: 361 LADWCNQEEVLKHSSVGGFLTHSGWNSTMESIVGGVAMISWPFFAEQQTNCRYCKTEWGN 420
           LA+WCNQEEVLKH S+G FLTHSGWNSTMESIV GV MISWPFFAEQQTNCRYC TEWGN
Sbjct: 361 LANWCNQEEVLKHGSIGAFLTHSGWNSTMESIVSGVPMISWPFFAEQQTNCRYCNTEWGN 420

Query: 421 GLEIDSNVRREDVEKLVRELMEGEKGEDMKRNAKEWKRKAEEACKIGGSSPTNLDRVISE 480
           G+EID+NV+R++VEKLVRELMEGEKGE MK NA EWKRKAEEACKIGGSS TNL RVI+E
Sbjct: 421 GMEIDNNVKRDEVEKLVRELMEGEKGETMKNNATEWKRKAEEACKIGGSSCTNLGRVINE 480

Query: 481 ILSSKEKSN 489
           +L S +  N
Sbjct: 481 VLLSNKMIN 484

BLAST of CsaV3_7G006280 vs. NCBI nr
Match: XP_022988757.1 (7-deoxyloganetin glucosyltransferase-like [Cucurbita maxima])

HSP 1 Score: 811.6 bits (2095), Expect = 1.5e-231
Identity = 382/473 (80.76%), Postives = 437/473 (92.39%), Query Frame = 0

Query: 12  QQPHAVLFPYPSQGHISPMLKLAKLFHHKGFHITFVNTEYNHRRLLRSRGPNSLDGLPDF 71
           ++PHA+  PYP+QGHISPMLKLAKL H+KGFHITFVNTEYNH+R L++RGPNSLDGLPDF
Sbjct: 8   EKPHAIFIPYPAQGHISPMLKLAKLLHNKGFHITFVNTEYNHKRHLKARGPNSLDGLPDF 67

Query: 72  HFRAIPDGLPPSD-GNSTQHIPSLCYSASRNCLAPLCSLISEINSSGTVPPVSCIIGDGV 131
            FR IPDGLP S+  ++TQ +PSLCYS SRNCLAPLCSLISEINSS  +PPVSC++GDGV
Sbjct: 68  QFRTIPDGLPTSEVKDATQDVPSLCYSTSRNCLAPLCSLISEINSS-HMPPVSCVVGDGV 127

Query: 132 MTFTVFAAQKFGIPIASFWTASACGCLGYMQYAKLVEQGLVPFKDENFMTNGDLEETIEW 191
           MTFT+ AA++FGIPIA FWTASACGCLGYMQY KLVEQG+VPFKD+NFMTNGDLE TIEW
Sbjct: 128 MTFTILAARQFGIPIAIFWTASACGCLGYMQYRKLVEQGVVPFKDKNFMTNGDLETTIEW 187

Query: 192 IPPMEKISLRDIPSFIRTTDKDDIMLNFFIEQFETFPKANAIIINTFDSLEHHVLEALSS 251
           IPPME+I LRDIPSFIRTTD++DIMLNFFI+Q +  PKA+AIIINTFDSLEHHVLE+LSS
Sbjct: 188 IPPMERIRLRDIPSFIRTTDENDIMLNFFIQQCDALPKASAIIINTFDSLEHHVLESLSS 247

Query: 252 KLPPIYPIGPINSLVAELIKDDKVKDIRSNLWDEQSECMKWLDSQQPNSVVYVNFGSVTV 311
           KLPPIYPIGPINSLV+ELI D KVK+IRSNLW+EQSEC++WL+S++PNSV+YVNFGS+TV
Sbjct: 248 KLPPIYPIGPINSLVSELIDDKKVKEIRSNLWNEQSECVEWLNSKEPNSVLYVNFGSITV 307

Query: 312 MSPQHLVEFAWGLANSEKPFLWIVRPDLVEGETALLPAEFLVETKERGMLADWCNQEEVL 371
           M+ + L+EFAWGLANS+KPFLWI RPDLVEG++A+LP EFL ET+ERGMLA+WCNQEEVL
Sbjct: 308 MNEEQLIEFAWGLANSKKPFLWITRPDLVEGKSAMLPIEFLEETRERGMLANWCNQEEVL 367

Query: 372 KHSSVGGFLTHSGWNSTMESIVGGVAMISWPFFAEQQTNCRYCKTEWGNGLEIDSNVRRE 431
           KH S+G FLTHSGWNSTMESIV GV MISWPFFAEQQTNCRYC TEWGNG+EID+NV+R+
Sbjct: 368 KHGSIGAFLTHSGWNSTMESIVSGVPMISWPFFAEQQTNCRYCNTEWGNGMEIDNNVKRD 427

Query: 432 DVEKLVRELMEGEKGEDMKRNAKEWKRKAEEACKIGGSSPTNLDRVISEILSS 484
           +VEKLVRELMEGEKGE MK+NAKEWKRKAEEACKIGGSS TNLDRVI+E+L S
Sbjct: 428 EVEKLVRELMEGEKGEIMKKNAKEWKRKAEEACKIGGSSCTNLDRVINEVLLS 479

BLAST of CsaV3_7G006280 vs. TAIR10
Match: AT1G22370.2 (UDP-glucosyl transferase 85A5)

HSP 1 Score: 577.8 bits (1488), Expect = 6.4e-165
Identity = 267/472 (56.57%), Postives = 357/472 (75.64%), Query Frame = 0

Query: 12  QQPHAVLFPYPSQGHISPMLKLAKLFHHKGFHITFVNTEYNHRRLLRSRGPNSLDGLPDF 71
           Q+PH V  P+P+QGHI+PMLK+AKL + +GFH+TFVNT YNH RL+RSRGPNSLDGLP F
Sbjct: 10  QKPHVVCIPFPAQGHINPMLKVAKLLYARGFHVTFVNTNYNHNRLIRSRGPNSLDGLPSF 69

Query: 72  HFRAIPDGLPPSDGNSTQHIPSLCYSASRNCLAPLCSLISEINSSGTVPPVSCIIGDGVM 131
            F +IPDGLP  + +  Q +P+LC S  +NCLAP   L+  IN++  VPPVSCI+ DGVM
Sbjct: 70  RFESIPDGLPEENKDVMQDVPTLCESTMKNCLAPFKELLRRINTTKDVPPVSCIVSDGVM 129

Query: 132 TFTVFAAQKFGIPIASFWTASACGCLGYMQYAKLVEQGLVPFKDENFMTNGDLEETIEWI 191
           +FT+ AA++ G+P   FWT SACG L Y+ + + +E+GL P KDE+      L+  I WI
Sbjct: 130 SFTLDAAEELGVPDVLFWTPSACGFLAYLHFYRFIEKGLSPIKDES-----SLDTKINWI 189

Query: 192 PPMEKISLRDIPSFIRTTDKDDIMLNFFIEQFETFPKANAIIINTFDSLEHHVLEALSSK 251
           P M+ + L+DIPSFIR T+ +DIMLNFF+ + +   +A+AII+NTFDSLEH V+ ++ S 
Sbjct: 190 PSMKNLGLKDIPSFIRATNTEDIMLNFFVHEADRAKRASAIILNTFDSLEHDVVRSIQSI 249

Query: 252 LPPIYPIGPINSLV-AELIKDDKVKDIRSNLWDEQSECMKWLDSQQPNSVVYVNFGSVTV 311
           +P +Y IGP++  V  ++ ++  +  I +N+W E+ EC+ WLD++ PNSVVYVNFGS+TV
Sbjct: 250 IPQVYTIGPLHLFVNRDIDEESDIGQIGTNMWREEMECLDWLDTKSPNSVVYVNFGSITV 309

Query: 312 MSPQHLVEFAWGLANSEKPFLWIVRPDLVEGETALLPAEFLVETKERGMLADWCNQEEVL 371
           MS + LVEFAWGLA ++K FLW++RPDLV G+  +LP +FL+ET  R MLA WC QE+VL
Sbjct: 310 MSAKQLVEFAWGLAATKKDFLWVIRPDLVAGDVPMLPPDFLIETANRRMLASWCPQEKVL 369

Query: 372 KHSSVGGFLTHSGWNSTMESIVGGVAMISWPFFAEQQTNCRYCKTEWGNGLEIDSNVRRE 431
            H +VGGFLTHSGWNST+ES+ GGV M+ WPFFAEQQTNC+YC  EW  G+EI  +VRRE
Sbjct: 370 SHPAVGGFLTHSGWNSTLESLSGGVPMVCWPFFAEQQTNCKYCCDEWEVGMEIGGDVRRE 429

Query: 432 DVEKLVRELMEGEKGEDMKRNAKEWKRKAEEACK-IGGSSPTNLDRVISEIL 482
           +VE+LVRELM+G+KG+ M++ A+EW+R AEEA K I GSS  N   V+ ++L
Sbjct: 430 EVEELVRELMDGDKGKKMRQKAEEWQRLAEEATKPIYGSSELNFQMVVDKVL 476

BLAST of CsaV3_7G006280 vs. TAIR10
Match: AT1G22340.1 (UDP-glucosyl transferase 85A7)

HSP 1 Score: 571.2 bits (1471), Expect = 6.0e-163
Identity = 265/472 (56.14%), Postives = 354/472 (75.00%), Query Frame = 0

Query: 12  QQPHAVLFPYPSQGHISPMLKLAKLFHHKGFHITFVNTEYNHRRLLRSRGPNSLDGLPDF 71
           Q+PH V  PYP+QGHI+PMLK+AKL + KGFH+TFVNT YNH RLLRSRGPN+LDG P F
Sbjct: 10  QKPHVVCVPYPAQGHINPMLKVAKLLYAKGFHVTFVNTLYNHNRLLRSRGPNALDGFPSF 69

Query: 72  HFRAIPDGLPPSDGNSTQHIPSLCYSASRNCLAPLCSLISEINSSGTVPPVSCIIGDGVM 131
            F +IPDGLP +DG+ TQH P++C S  +NCLAP   ++  IN    VPPVSCI+ DGVM
Sbjct: 70  RFESIPDGLPETDGDRTQHTPTVCMSIEKNCLAPFKEILRRINDKDDVPPVSCIVSDGVM 129

Query: 132 TFTVFAAQKFGIPIASFWTASACGCLGYMQYAKLVEQGLVPFKDENFMTNGDLEETIEWI 191
           +FT+ AA++ G+P   FWT SACG +  + +   +E+GL PFKDE++M+   L+  I+WI
Sbjct: 130 SFTLDAAEELGVPEVIFWTNSACGFMTILHFYLFIEKGLSPFKDESYMSKEHLDTVIDWI 189

Query: 192 PPMEKISLRDIPSFIRTTDKDDIMLNFFIEQFETFPKANAIIINTFDSLEHHVLEALSSK 251
           P M+ + L+DIPS+IRTT+ D+IMLNF I + E   +A+AII+NTFD LEH V++++ S 
Sbjct: 190 PSMKNLRLKDIPSYIRTTNPDNIMLNFLIREVERSKRASAIILNTFDELEHDVIQSMQSI 249

Query: 252 LPPIYPIGPINSLVAELIKD-DKVKDIRSNLWDEQSECMKWLDSQQPNSVVYVNFGSVTV 311
           LPP+Y IGP++ LV E I +  ++  +  NLW E+ EC+ WLD++ PNSV++VNFG +TV
Sbjct: 250 LPPVYSIGPLHLLVKEEINEASEIGQMGLNLWREEMECLDWLDTKTPNSVLFVNFGCITV 309

Query: 312 MSPQHLVEFAWGLANSEKPFLWIVRPDLVEGET-ALLPAEFLVETKERGMLADWCNQEEV 371
           MS + L EFAWGLA S K FLW++RP+LV GE   +LP EFL ET +R MLA WC QE+V
Sbjct: 310 MSAKQLEEFAWGLAASRKEFLWVIRPNLVVGEAMVVLPQEFLAETIDRRMLASWCPQEKV 369

Query: 372 LKHSSVGGFLTHSGWNSTMESIVGGVAMISWPFFAEQQTNCRYCKTEWGNGLEIDSNVRR 431
           L H ++GGFLTH GWNST+ES+ GGV MI WP F+EQ TNC++C  EWG G+EI  +V+R
Sbjct: 370 LSHPAIGGFLTHCGWNSTLESLAGGVPMICWPCFSEQPTNCKFCCDEWGVGIEIGKDVKR 429

Query: 432 EDVEKLVRELMEGEKGEDMKRNAKEWKRKAEEACKI-GGSSPTNLDRVISEI 481
           E+VE +VRELM+GEKG+ ++  A+EW+R AEEA +   GSS  NL+ +I ++
Sbjct: 430 EEVETVVRELMDGEKGKKLREKAEEWRRLAEEATRYKHGSSVMNLETLIHKV 481

BLAST of CsaV3_7G006280 vs. TAIR10
Match: AT1G22360.1 (UDP-glucosyl transferase 85A2)

HSP 1 Score: 563.1 bits (1450), Expect = 1.6e-160
Identity = 261/472 (55.30%), Postives = 356/472 (75.42%), Query Frame = 0

Query: 12  QQPHAVLFPYPSQGHISPMLKLAKLFHHKGFHITFVNTEYNHRRLLRSRGPNSLDGLPDF 71
           Q+ H V  PYP+QGHI+PM+K+AKL + KGFHITFVNT YNH RLLRSRGPN++DGLP F
Sbjct: 7   QKQHVVCVPYPAQGHINPMMKVAKLLYAKGFHITFVNTVYNHNRLLRSRGPNAVDGLPSF 66

Query: 72  HFRAIPDGLPPSDGNSTQHIPSLCYSASRNCLAPLCSLISEINSSGTVPPVSCIIGDGVM 131
            F +IPDGLP +D + TQ IP+LC S  ++CLAP   L+ +IN+   VPPVSCI+ DG M
Sbjct: 67  RFESIPDGLPETDVDVTQDIPTLCESTMKHCLAPFKELLRQINARDDVPPVSCIVSDGCM 126

Query: 132 TFTVFAAQKFGIPIASFWTASACGCLGYMQYAKLVEQGLVPFKDENFMTNGDLEETIEWI 191
           +FT+ AA++ G+P   FWT SACG L Y+ Y + +E+GL P KDE+++T   L+  I+WI
Sbjct: 127 SFTLDAAEELGVPEVLFWTTSACGFLAYLYYYRFIEKGLSPIKDESYLTKEHLDTKIDWI 186

Query: 192 PPMEKISLRDIPSFIRTTDKDDIMLNFFIEQFETFPKANAIIINTFDSLEHHVLEALSSK 251
           P M+ + L+DIPSFIRTT+ DDIMLNF I + +   +A+AII+NTFD LEH V++++ S 
Sbjct: 187 PSMKNLRLKDIPSFIRTTNPDDIMLNFIIREADRAKRASAIILNTFDDLEHDVIQSMKSI 246

Query: 252 LPPIYPIGPINSL-VAELIKDDKVKDIRSNLWDEQSECMKWLDSQQPNSVVYVNFGSVTV 311
           +PP+Y IGP++ L   E  +  ++    SNLW E++EC+ WL+++  NSVVYVNFGS+TV
Sbjct: 247 VPPVYSIGPLHLLEKQESGEYSEIGRTGSNLWREETECLDWLNTKARNSVVYVNFGSITV 306

Query: 312 MSPQHLVEFAWGLANSEKPFLWIVRPDLVEGETALLPAEFLVETKERGMLADWCNQEEVL 371
           +S + LVEFAWGLA + K FLW++RPDLV G+ A++P EFL  T +R MLA WC QE+VL
Sbjct: 307 LSAKQLVEFAWGLAATGKEFLWVIRPDLVAGDEAMVPPEFLTATADRRMLASWCPQEKVL 366

Query: 372 KHSSVGGFLTHSGWNSTMESIVGGVAMISWPFFAEQQTNCRYCKTEWGNGLEIDSNVRRE 431
            H ++GGFLTH GWNST+ES+ GGV M+ WPFFAEQQTNC++ + EW  G+EI  +V+RE
Sbjct: 367 SHPAIGGFLTHCGWNSTLESLCGGVPMVCWPFFAEQQTNCKFSRDEWEVGIEIGGDVKRE 426

Query: 432 DVEKLVRELMEGEKGEDMKRNAKEWKRKAEEACK-IGGSSPTNLDRVISEIL 482
           +VE +VRELM+ EKG++M+  A+EW+R A EA +   GSS  N + +++++L
Sbjct: 427 EVEAVVRELMDEEKGKNMREKAEEWRRLANEATEHKHGSSKLNFEMLVNKVL 478

BLAST of CsaV3_7G006280 vs. TAIR10
Match: AT1G22400.1 (UDP-Glycosyltransferase superfamily protein)

HSP 1 Score: 551.2 bits (1419), Expect = 6.5e-157
Identity = 257/491 (52.34%), Postives = 358/491 (72.91%), Query Frame = 0

Query: 1   MGSLTKVDQGKQQPHAVLFPYPSQGHISPMLKLAKLFHHKGFHITFVNTEYNHRRLLRSR 60
           MGS  ++    Q+PH V  PYP+QGHI+PM+++AKL H +GF++TFVNT YNH R LRSR
Sbjct: 1   MGS--QIIHNSQKPHVVCVPYPAQGHINPMMRVAKLLHARGFYVTFVNTVYNHNRFLRSR 60

Query: 61  GPNSLDGLPDFHFRAIPDGLPPSDGNSTQHIPSLCYSASRNCLAPLCSLISEINSSGTVP 120
           G N+LDGLP F F +I DGLP +D ++TQ I +LC S  +NCLAP   L+  IN+   VP
Sbjct: 61  GSNALDGLPSFRFESIADGLPETDMDATQDITALCESTMKNCLAPFRELLQRINAGDNVP 120

Query: 121 PVSCIIGDGVMTFTVFAAQKFGIPIASFWTASACGCLGYMQYAKLVEQGLVPFKDENFMT 180
           PVSCI+ DG M+FT+  A++ G+P   FWT S C  L Y+ +   +E+GL P KDE+++T
Sbjct: 121 PVSCIVSDGCMSFTLDVAEELGVPEVLFWTTSGCAFLAYLHFYLFIEKGLCPLKDESYLT 180

Query: 181 NGDLEET-IEWIPPMEKISLRDIPSFIRTTDKDDIMLNFFIEQFETFPKANAIIINTFDS 240
              LE+T I++IP M+ + L+DIPSFIRTT+ DD+M++F + + E   +A+AII+NTFD 
Sbjct: 181 KEYLEDTVIDFIPTMKNVKLKDIPSFIRTTNPDDVMISFALRETERAKRASAIILNTFDD 240

Query: 241 LEHHVLEALSSKLPPIYPIGPINSLV-AELIKDDKVKDIRSNLWDEQSECMKWLDSQQPN 300
           LEH V+ A+ S LPP+Y +GP++ L   E+ +  ++  + SNLW E+ EC+ WLD++  N
Sbjct: 241 LEHDVVHAMQSILPPVYSVGPLHLLANREIEEGSEIGMMSSNLWKEEMECLDWLDTKTQN 300

Query: 301 SVVYVNFGSVTVMSPQHLVEFAWGLANSEKPFLWIVRPDLVEGETALLPAEFLVETKERG 360
           SV+Y+NFGS+TV+S + LVEFAWGLA S K FLW++RPDLV GE A++P +FL+ETK+R 
Sbjct: 301 SVIYINFGSITVLSVKQLVEFAWGLAGSGKEFLWVIRPDLVAGEEAMVPPDFLMETKDRS 360

Query: 361 MLADWCNQEEVLKHSSVGGFLTHSGWNSTMESIVGGVAMISWPFFAEQQTNCRYCKTEWG 420
           MLA WC QE+VL H ++GGFLTH GWNS +ES+  GV M+ WPFFA+QQ NC++C  EW 
Sbjct: 361 MLASWCPQEKVLSHPAIGGFLTHCGWNSILESLSCGVPMVCWPFFADQQMNCKFCCDEWD 420

Query: 421 NGLEIDSNVRREDVEKLVRELMEGEKGEDMKRNAKEWKRKAEEACKIG-GSSPTNLDRVI 480
            G+EI  +V+RE+VE +VRELM+GEKG+ M+  A EW+R AE+A +   GSS  N + V+
Sbjct: 421 VGIEIGGDVKREEVEAVVRELMDGEKGKKMREKAVEWQRLAEKATEHKLGSSVMNFETVV 480

Query: 481 SEILSSKEKSN 489
           S+ L  ++  +
Sbjct: 481 SKFLLGQKSQD 489

BLAST of CsaV3_7G006280 vs. TAIR10
Match: AT1G22380.1 (UDP-glucosyl transferase 85A3)

HSP 1 Score: 550.4 bits (1417), Expect = 1.1e-156
Identity = 256/476 (53.78%), Postives = 350/476 (73.53%), Query Frame = 0

Query: 11  KQQPHAVLFPYPSQGHISPMLKLAKLFHHKGFHITFVNTEYNHRRLLRSRGPNSLDGLPD 70
           +Q+PH V  PYP+QGHI+PM+K+AKL H KGFH+TFVNT YNH RLLRSRG N+LDGLP 
Sbjct: 9   EQKPHVVCVPYPAQGHINPMMKVAKLLHVKGFHVTFVNTVYNHNRLLRSRGANALDGLPS 68

Query: 71  FHFRAIPDGLPPSDGNSTQHIPSLCYSASRNCLAPLCSLISEINSSGTVPPVSCIIGDGV 130
           F F +IPDGLP +  ++TQ IP+L  S ++NCL P   L+  I +   VPPVSCI+ DG 
Sbjct: 69  FQFESIPDGLPETGVDATQDIPALSESTTKNCLVPFKKLLQRIVTREDVPPVSCIVSDGS 128

Query: 131 MTFTVFAAQKFGIPIASFWTASACGCLGYMQYAKLVEQGLVPFKDENFMTNGDLEETIEW 190
           M+FT+  A++ G+P   FWT SACG + Y+ +   +E+GL P KD + +T   L+  I+W
Sbjct: 129 MSFTLDVAEELGVPEIHFWTTSACGFMAYLHFYLFIEKGLCPVKDASCLTKEYLDTVIDW 188

Query: 191 IPPMEKISLRDIPSFIRTTDKDDIMLNFFIEQFETFPKANAIIINTFDSLEHHVLEALSS 250
           IP M  + L+DIPSFIRTT+ +DIMLNF + +     +A+AII+NTFD LEH +++++ S
Sbjct: 189 IPSMNNVKLKDIPSFIRTTNPNDIMLNFVVREACRTKRASAIILNTFDDLEHDIIQSMQS 248

Query: 251 KLPPIYPIGPINSLV-AELIKDDKVKDIRSNLWDEQSECMKWLDSQQPNSVVYVNFGSVT 310
            LPP+YPIGP++ LV  E+ +D ++  + SNLW E++EC+ WL+++  NSVVYVNFGS+T
Sbjct: 249 ILPPVYPIGPLHLLVNREIEEDSEIGRMGSNLWKEETECLGWLNTKSRNSVVYVNFGSIT 308

Query: 311 VMSPQHLVEFAWGLANSEKPFLWIVRPDLVEGETALLPAEFLVETKERGMLADWCNQEEV 370
           +M+   L+EFAWGLA + K FLW++RPD V GE A++P EFL ET +R ML  WC QE+V
Sbjct: 309 IMTTAQLLEFAWGLAATGKEFLWVMRPDSVAGEEAVIPKEFLAETADRRMLTSWCPQEKV 368

Query: 371 LKHSSVGGFLTHSGWNSTMESIVGGVAMISWPFFAEQQTNCRYCKTEWGNGLEIDSNVRR 430
           L H +VGGFLTH GWNST+ES+  GV M+ WPFFAEQQTNC++   EW  G+EI  +V+R
Sbjct: 369 LSHPAVGGFLTHCGWNSTLESLSCGVPMVCWPFFAEQQTNCKFSCDEWEVGIEIGGDVKR 428

Query: 431 EDVEKLVRELMEGEKGEDMKRNAKEWKRKAEEACKIG-GSSPTNLDRVISEILSSK 485
            +VE +VRELM+GEKG+ M+  A EW+R AE+A K+  GSS  N + +++++L  K
Sbjct: 429 GEVEAVVRELMDGEKGKKMREKAVEWRRLAEKATKLPCGSSVINFETIVNKVLLGK 484

BLAST of CsaV3_7G006280 vs. Swiss-Prot
Match: sp|F8WKW1|UGT2_GARJA (7-deoxyloganetin glucosyltransferase OS=Gardenia jasminoides OX=114476 GN=UGT85A24 PE=1 SV=1)

HSP 1 Score: 635.6 bits (1638), Expect = 4.7e-181
Identity = 299/474 (63.08%), Postives = 381/474 (80.38%), Query Frame = 0

Query: 12  QQPHAVLFPYPSQGHISPMLKLAKLFHHKGFHITFVNTEYNHRRLLRSRGPNSLDGLPDF 71
           ++ HAV  PYP+QGHI+PMLKLAK+ HHKGFHITFVNTE+NH+RLL+SRGP++L+GLPDF
Sbjct: 8   EKHHAVCIPYPAQGHINPMLKLAKILHHKGFHITFVNTEFNHKRLLKSRGPDALNGLPDF 67

Query: 72  HFRAIPDGLPPSDGNSTQHIPSLCYSASRNCLAPLCSLISEIN--SSGTVPPVSCIIGDG 131
            F+ IPDGLPPSD ++TQ IPSLC S +  CL P  +L++E+N  SS  VPPVSCI+ DG
Sbjct: 68  QFKTIPDGLPPSDVDATQDIPSLCESTTTRCLDPFRNLLAELNGPSSSQVPPVSCIVSDG 127

Query: 132 VMTFTVFAAQKFGIPIASFWTASACGCLGYMQYAKLVEQGLVPFKDENFMTNGDLEETIE 191
           VM+FT+ AA + G+P   FWT SACG LGYM YAKL+E+GL P KD ++++NG LE++++
Sbjct: 128 VMSFTLEAAAELGVPEILFWTTSACGFLGYMHYAKLIEKGLTPLKDASYLSNGYLEQSLD 187

Query: 192 WIPPMEKISLRDIPSFIRTTDKDDIMLNFFIEQFETFPKANAIIINTFDSLEHHVLEALS 251
           WIP M+ I L+D+PSF+RTT+ DD M+ F +++ E   KA+AII+NTF  LE  V+ ALS
Sbjct: 188 WIPGMKDIRLKDLPSFLRTTNPDDYMVKFVLQETERAKKASAIILNTFQELEDDVINALS 247

Query: 252 SKLPPIYPIGPINSLVAELIKDDKVKDIRSNLWDEQSECMKWLDSQQPNSVVYVNFGSVT 311
           + LPPIY IGP+  L  E +KD+++  + SNLW E+ EC+ WLDS+ PNSVVYVNFGS+T
Sbjct: 248 AILPPIYTIGPLQFLQKE-VKDERLSVLGSNLWKEEPECLDWLDSKDPNSVVYVNFGSIT 307

Query: 312 VMSPQHLVEFAWGLANSEKPFLWIVRPDLVEGETALLPAEFLVETKERGMLADWCNQEEV 371
           VM+P  LVEFAWGLANS++ FLWI+RPDLV G++A+LP EFL ETK+RG+LA WC QE+V
Sbjct: 308 VMTPGQLVEFAWGLANSKQTFLWIIRPDLVSGDSAILPPEFLEETKDRGLLASWCPQEQV 367

Query: 372 LKHSSVGGFLTHSGWNSTMESIVGGVAMISWPFFAEQQTNCRYCKTEWGNGLEIDSNVRR 431
           L H ++GGFLTHSGWNST+ESI  GV MI WPFFAEQQTNC +C T+W NGLEID+NV+R
Sbjct: 368 LSHPAIGGFLTHSGWNSTLESICSGVPMICWPFFAEQQTNCWFCCTKWYNGLEIDNNVKR 427

Query: 432 EDVEKLVRELMEGEKGEDMKRNAKEWKRKAEEACK-IGGSSPTNLDRVISEILS 483
           ++VE LV ELM GEKG DMK+ A EWK KAEEA K  GGSS +NL++V+  +LS
Sbjct: 428 DEVESLVTELMVGEKGMDMKKKALEWKNKAEEAAKSSGGSSYSNLEKVVQVLLS 480

BLAST of CsaV3_7G006280 vs. Swiss-Prot
Match: sp|F8WLS6|UGT6_CATRO (7-deoxyloganetin glucosyltransferase OS=Catharanthus roseus OX=4058 GN=UGT85A23 PE=1 SV=1)

HSP 1 Score: 604.7 bits (1558), Expect = 8.9e-172
Identity = 282/487 (57.91%), Postives = 381/487 (78.23%), Query Frame = 0

Query: 1   MGSLTKVDQGKQQPHAVLFPYPSQGHISPMLKLAKLFHHKGFHITFVNTEYNHRRLLRSR 60
           MGSL+  D  K +PHAV  PYP+QGHI+PMLKLAKL H+KGFHITFVNTE+NH+RLL+SR
Sbjct: 1   MGSLSSSDYSK-KPHAVCIPYPAQGHINPMLKLAKLLHYKGFHITFVNTEFNHKRLLKSR 60

Query: 61  GPNSLDGLPDFHFRAIPDGLPPSDGNSTQHIPSLCYSASRNCLAPLCSLISEIN--SSGT 120
           G +SL GL  F F+ IPDGLPPSD ++TQ IPSLC S + +CL P   L+ ++N  SS  
Sbjct: 61  GSDSLKGLHSFQFKTIPDGLPPSDVDATQDIPSLCESTTTHCLVPFKQLLQKLNDTSSSE 120

Query: 121 VPPVSCIIGDGVMTFTVFAAQKFGIPIASFWTASACGCLGYMQYAKLVEQGLVPFKDENF 180
           VPPVSC++ D VM+FT+ AAQ+  IP   FWT SACG LGYM YA+L+++GL P KD ++
Sbjct: 121 VPPVSCVVSDAVMSFTISAAQELDIPEVLFWTPSACGVLGYMHYAQLIDKGLTPLKDASY 180

Query: 181 MTNGDLEETIEWIPPMEKISLRDIPSFIRTTDKDDIMLNFFIEQFETFPKANAIIINTFD 240
            +NG L++ ++WIP ME I LRD+P+F+RTT+ D+ M+ F +++ E   KA+AI++NTF 
Sbjct: 181 FSNGFLDQVLDWIPGMEGIRLRDLPTFLRTTNPDEYMIKFILQETERSKKASAIVLNTFQ 240

Query: 241 SLEHHVLEALSSKLPPIYPIGPINSLVAELIKDDKVKDIRSNLWDEQSECMKWLDSQQPN 300
            LE  V+++LS+ LPPIYPIGP+  ++   + D+ +K + SNLW E+ EC++WLD++ PN
Sbjct: 241 ELESEVIDSLSTLLPPIYPIGPL-QILQNQVDDESLKVLGSNLWKEEPECLEWLDTKDPN 300

Query: 301 SVVYVNFGSVTVMSPQHLVEFAWGLANSEKPFLWIVRPDLVEGETALLPAEFLVETKERG 360
           SVVYVNFGS+TVM+   L+EFAWGLANS++ FLWI+RPDL+ GE+++L  EF+ ETKERG
Sbjct: 301 SVVYVNFGSITVMTNDQLIEFAWGLANSKQNFLWIIRPDLISGESSILGEEFVEETKERG 360

Query: 361 MLADWCNQEEVLKHSSVGGFLTHSGWNSTMESIVGGVAMISWPFFAEQQTNCRYCKTEWG 420
           ++A WC+QE+V+ H ++GGFLTH+GWNST+ESI  GV MI WPFFAEQQTNCR+C  +WG
Sbjct: 361 LIASWCHQEQVINHPAIGGFLTHNGWNSTIESISSGVPMICWPFFAEQQTNCRFCCNKWG 420

Query: 421 NGLEIDSNVRREDVEKLVRELMEGEKGEDMKRNAKEWKRKAE-EACKIGGSSPTNLDRVI 480
            G+EI+S+V+R++VE LV+ELM GEKG++MK+ A EWK  AE    K  GSS +NL+++I
Sbjct: 421 IGMEINSDVKRDEVESLVKELMVGEKGKEMKKKALEWKNIAEVTTTKPDGSSYSNLEKLI 480

Query: 481 SEILSSK 485
            ++L SK
Sbjct: 481 -KVLKSK 484

BLAST of CsaV3_7G006280 vs. Swiss-Prot
Match: sp|Q6VAB3|U85A8_STERE (UDP-glycosyltransferase 85A8 OS=Stevia rebaudiana OX=55670 GN=UGT85A8 PE=2 SV=1)

HSP 1 Score: 599.4 bits (1544), Expect = 3.7e-170
Identity = 280/473 (59.20%), Postives = 360/473 (76.11%), Query Frame = 0

Query: 12  QQPHAVLFPYPSQGHISPMLKLAKLFHHKGFHITFVNTEYNHRRLLRSRGPNSLDGLPDF 71
           Q+PHA+  PYP+QGHI+PM++ AKL H KGFHI+FVN  YNH+RL RSRG ++L+GLPDF
Sbjct: 8   QKPHAICIPYPAQGHINPMMQFAKLLHFKGFHISFVNNHYNHKRLQRSRGLSALEGLPDF 67

Query: 72  HFRAIPDGLPPSDGNSTQHIPSLCYSASRNCLAPLCSLISEINSSGTVPPVSCIIGDGVM 131
           HF +IPDGLPPS+  +TQ IP LC S  ++ L P C LI+ +N S  VPPVSCII DGVM
Sbjct: 68  HFYSIPDGLPPSNAEATQSIPGLCESIPKHSLEPFCDLIATLNGS-DVPPVSCIISDGVM 127

Query: 132 TFTVFAAQKFGIPIASFWTASACGCLGYMQYAKLVEQGLVPFKDENFMTNGDLEETIEWI 191
           +FT+ AA++FG+P   FWT SACG L Y  Y  LV++  +P KD N +TNG LE +++WI
Sbjct: 128 SFTLQAAERFGLPEVLFWTPSACGFLAYTHYRDLVDKEYIPLKDTNDLTNGYLETSLDWI 187

Query: 192 PPMEKISLRDIPSFIRTTDKDDIMLNFFIEQFETFPKANAIIINTFDSLEHHVLEALSSK 251
           P M+ I L+D PSFIRTTD +DIMLN+F+ + E  PK  AII+NTFD+LE   +  + + 
Sbjct: 188 PGMKNIRLKDFPSFIRTTDINDIMLNYFLIETEAIPKGVAIILNTFDALEKDSITPVLAL 247

Query: 252 LPPIYPIGPINSLVAELIKDDKVKDIRSNLWDEQSECMKWLDSQQPNSVVYVNFGSVTVM 311
            P IY IGP++ +   +  D+++K I SNLW E   C+ WLD+++PNSVVYVNFGS+TVM
Sbjct: 248 NPQIYTIGPLHMMQQYVDHDERLKHIGSNLWKEDVSCINWLDTKKPNSVVYVNFGSITVM 307

Query: 312 SPQHLVEFAWGLANSEKPFLWIVRPDLVEGETALLPAEFLVETKERGMLADWCNQEEVLK 371
           + + L+EF WGLANS+K FLWI RPD+V G  A++PAEF+ ETKERGM+  WC+QEEVLK
Sbjct: 308 TKEQLIEFGWGLANSKKDFLWITRPDIVGGNEAMIPAEFIEETKERGMVTSWCSQEEVLK 367

Query: 372 HSSVGGFLTHSGWNSTMESIVGGVAMISWPFFAEQQTNCRYCKTEWGNGLEIDSNVRRED 431
           H S+G FLTHSGWNST+ESI  GV MI WPFFAEQQTNCRYC  EW  GLEID++V+RE+
Sbjct: 368 HPSIGVFLTHSGWNSTIESISNGVPMICWPFFAEQQTNCRYCCVEWEIGLEIDTDVKREE 427

Query: 432 VEKLVRELMEGEKGEDMKRNAKEWKRKAEEACKIGGSSPTNLDRVISEILSSK 485
           VE  VRE+M+G KG+ MK  A EWK+KAEEA  IGGSS  N +++++++L  K
Sbjct: 428 VEAQVREMMDGSKGKMMKNKALEWKKKAEEAVSIGGSSYLNFEKLVTDVLLRK 479

BLAST of CsaV3_7G006280 vs. Swiss-Prot
Match: sp|Q9LMF0|U85A5_ARATH (UDP-glycosyltransferase 85A5 OS=Arabidopsis thaliana OX=3702 GN=UGT85A5 PE=2 SV=1)

HSP 1 Score: 577.8 bits (1488), Expect = 1.2e-163
Identity = 267/472 (56.57%), Postives = 357/472 (75.64%), Query Frame = 0

Query: 12  QQPHAVLFPYPSQGHISPMLKLAKLFHHKGFHITFVNTEYNHRRLLRSRGPNSLDGLPDF 71
           Q+PH V  P+P+QGHI+PMLK+AKL + +GFH+TFVNT YNH RL+RSRGPNSLDGLP F
Sbjct: 10  QKPHVVCIPFPAQGHINPMLKVAKLLYARGFHVTFVNTNYNHNRLIRSRGPNSLDGLPSF 69

Query: 72  HFRAIPDGLPPSDGNSTQHIPSLCYSASRNCLAPLCSLISEINSSGTVPPVSCIIGDGVM 131
            F +IPDGLP  + +  Q +P+LC S  +NCLAP   L+  IN++  VPPVSCI+ DGVM
Sbjct: 70  RFESIPDGLPEENKDVMQDVPTLCESTMKNCLAPFKELLRRINTTKDVPPVSCIVSDGVM 129

Query: 132 TFTVFAAQKFGIPIASFWTASACGCLGYMQYAKLVEQGLVPFKDENFMTNGDLEETIEWI 191
           +FT+ AA++ G+P   FWT SACG L Y+ + + +E+GL P KDE+      L+  I WI
Sbjct: 130 SFTLDAAEELGVPDVLFWTPSACGFLAYLHFYRFIEKGLSPIKDES-----SLDTKINWI 189

Query: 192 PPMEKISLRDIPSFIRTTDKDDIMLNFFIEQFETFPKANAIIINTFDSLEHHVLEALSSK 251
           P M+ + L+DIPSFIR T+ +DIMLNFF+ + +   +A+AII+NTFDSLEH V+ ++ S 
Sbjct: 190 PSMKNLGLKDIPSFIRATNTEDIMLNFFVHEADRAKRASAIILNTFDSLEHDVVRSIQSI 249

Query: 252 LPPIYPIGPINSLV-AELIKDDKVKDIRSNLWDEQSECMKWLDSQQPNSVVYVNFGSVTV 311
           +P +Y IGP++  V  ++ ++  +  I +N+W E+ EC+ WLD++ PNSVVYVNFGS+TV
Sbjct: 250 IPQVYTIGPLHLFVNRDIDEESDIGQIGTNMWREEMECLDWLDTKSPNSVVYVNFGSITV 309

Query: 312 MSPQHLVEFAWGLANSEKPFLWIVRPDLVEGETALLPAEFLVETKERGMLADWCNQEEVL 371
           MS + LVEFAWGLA ++K FLW++RPDLV G+  +LP +FL+ET  R MLA WC QE+VL
Sbjct: 310 MSAKQLVEFAWGLAATKKDFLWVIRPDLVAGDVPMLPPDFLIETANRRMLASWCPQEKVL 369

Query: 372 KHSSVGGFLTHSGWNSTMESIVGGVAMISWPFFAEQQTNCRYCKTEWGNGLEIDSNVRRE 431
            H +VGGFLTHSGWNST+ES+ GGV M+ WPFFAEQQTNC+YC  EW  G+EI  +VRRE
Sbjct: 370 SHPAVGGFLTHSGWNSTLESLSGGVPMVCWPFFAEQQTNCKYCCDEWEVGMEIGGDVRRE 429

Query: 432 DVEKLVRELMEGEKGEDMKRNAKEWKRKAEEACK-IGGSSPTNLDRVISEIL 482
           +VE+LVRELM+G+KG+ M++ A+EW+R AEEA K I GSS  N   V+ ++L
Sbjct: 430 EVEELVRELMDGDKGKKMRQKAEEWQRLAEEATKPIYGSSELNFQMVVDKVL 476

BLAST of CsaV3_7G006280 vs. Swiss-Prot
Match: sp|Q9LME8|U85A7_ARATH (UDP-glycosyltransferase 85A7 OS=Arabidopsis thaliana OX=3702 GN=UGT85A7 PE=2 SV=1)

HSP 1 Score: 571.2 bits (1471), Expect = 1.1e-161
Identity = 265/472 (56.14%), Postives = 354/472 (75.00%), Query Frame = 0

Query: 12  QQPHAVLFPYPSQGHISPMLKLAKLFHHKGFHITFVNTEYNHRRLLRSRGPNSLDGLPDF 71
           Q+PH V  PYP+QGHI+PMLK+AKL + KGFH+TFVNT YNH RLLRSRGPN+LDG P F
Sbjct: 10  QKPHVVCVPYPAQGHINPMLKVAKLLYAKGFHVTFVNTLYNHNRLLRSRGPNALDGFPSF 69

Query: 72  HFRAIPDGLPPSDGNSTQHIPSLCYSASRNCLAPLCSLISEINSSGTVPPVSCIIGDGVM 131
            F +IPDGLP +DG+ TQH P++C S  +NCLAP   ++  IN    VPPVSCI+ DGVM
Sbjct: 70  RFESIPDGLPETDGDRTQHTPTVCMSIEKNCLAPFKEILRRINDKDDVPPVSCIVSDGVM 129

Query: 132 TFTVFAAQKFGIPIASFWTASACGCLGYMQYAKLVEQGLVPFKDENFMTNGDLEETIEWI 191
           +FT+ AA++ G+P   FWT SACG +  + +   +E+GL PFKDE++M+   L+  I+WI
Sbjct: 130 SFTLDAAEELGVPEVIFWTNSACGFMTILHFYLFIEKGLSPFKDESYMSKEHLDTVIDWI 189

Query: 192 PPMEKISLRDIPSFIRTTDKDDIMLNFFIEQFETFPKANAIIINTFDSLEHHVLEALSSK 251
           P M+ + L+DIPS+IRTT+ D+IMLNF I + E   +A+AII+NTFD LEH V++++ S 
Sbjct: 190 PSMKNLRLKDIPSYIRTTNPDNIMLNFLIREVERSKRASAIILNTFDELEHDVIQSMQSI 249

Query: 252 LPPIYPIGPINSLVAELIKD-DKVKDIRSNLWDEQSECMKWLDSQQPNSVVYVNFGSVTV 311
           LPP+Y IGP++ LV E I +  ++  +  NLW E+ EC+ WLD++ PNSV++VNFG +TV
Sbjct: 250 LPPVYSIGPLHLLVKEEINEASEIGQMGLNLWREEMECLDWLDTKTPNSVLFVNFGCITV 309

Query: 312 MSPQHLVEFAWGLANSEKPFLWIVRPDLVEGET-ALLPAEFLVETKERGMLADWCNQEEV 371
           MS + L EFAWGLA S K FLW++RP+LV GE   +LP EFL ET +R MLA WC QE+V
Sbjct: 310 MSAKQLEEFAWGLAASRKEFLWVIRPNLVVGEAMVVLPQEFLAETIDRRMLASWCPQEKV 369

Query: 372 LKHSSVGGFLTHSGWNSTMESIVGGVAMISWPFFAEQQTNCRYCKTEWGNGLEIDSNVRR 431
           L H ++GGFLTH GWNST+ES+ GGV MI WP F+EQ TNC++C  EWG G+EI  +V+R
Sbjct: 370 LSHPAIGGFLTHCGWNSTLESLAGGVPMICWPCFSEQPTNCKFCCDEWGVGIEIGKDVKR 429

Query: 432 EDVEKLVRELMEGEKGEDMKRNAKEWKRKAEEACKI-GGSSPTNLDRVISEI 481
           E+VE +VRELM+GEKG+ ++  A+EW+R AEEA +   GSS  NL+ +I ++
Sbjct: 430 EEVETVVRELMDGEKGKKLREKAEEWRRLAEEATRYKHGSSVMNLETLIHKV 481

BLAST of CsaV3_7G006280 vs. TrEMBL
Match: tr|A0A0A0K1Z5|A0A0A0K1Z5_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G059660 PE=4 SV=1)

HSP 1 Score: 995.0 bits (2571), Expect = 6.2e-287
Identity = 481/493 (97.57%), Postives = 487/493 (98.78%), Query Frame = 0

Query: 1   MGSLTKVDQGKQQPHAVLFPYPSQGHISPMLKLAKLFHHKGFHITFVNTEYNHRRLLRSR 60
           MGSLTKV+QGK QPHAV  PYPSQGHISPMLKLAKLFHHKGFHITFVNTEYNHRRLLRSR
Sbjct: 1   MGSLTKVNQGKHQPHAVFVPYPSQGHISPMLKLAKLFHHKGFHITFVNTEYNHRRLLRSR 60

Query: 61  GPNSLDGLPDFHFRAIPDGLPPSDGNSTQHIPSLCYSASRNCLAPLCSLISEINSSGTVP 120
           GPNSLDGLPDFHFRAIPDGLPPSDGNSTQH+PSLCYS SRNCLAP CSLISEINSSGTVP
Sbjct: 61  GPNSLDGLPDFHFRAIPDGLPPSDGNSTQHVPSLCYSTSRNCLAPFCSLISEINSSGTVP 120

Query: 121 PVSCIIGDGVMTFTVFAAQKFGIPIASFWTASACGCLGYMQYAKLVEQGLVPFKDENFMT 180
           PVSCIIGDG+MTFTVFAAQ+FGIP A+FWTASACGCLGYMQYAKLVEQGLVPFKDENFMT
Sbjct: 121 PVSCIIGDGIMTFTVFAAQEFGIPTAAFWTASACGCLGYMQYAKLVEQGLVPFKDENFMT 180

Query: 181 NGDLEETIEWIPPMEKISLRDIPSFIRTTDKDDIMLNFFIEQFETFPKANAIIINTFDSL 240
           NGDLEETIEWIPPMEKISLRDIPSFIRTTDKDDIMLNFFIEQFETFPKANAIIINTFDSL
Sbjct: 181 NGDLEETIEWIPPMEKISLRDIPSFIRTTDKDDIMLNFFIEQFETFPKANAIIINTFDSL 240

Query: 241 EHHVLEALSSKLPPIYPIGPINSLVAELIKDDKVKDIRSNLWDEQSECMKWLDSQQPNSV 300
           EHHVLEALSSKLPPIYPIGPINSLVAELIKDDKVKDIRSNLWDEQSECMKWLDSQQPNSV
Sbjct: 241 EHHVLEALSSKLPPIYPIGPINSLVAELIKDDKVKDIRSNLWDEQSECMKWLDSQQPNSV 300

Query: 301 VYVNFGSVTVMSPQHLVEFAWGLANSEKPFLWIVRPDLVEGETALLPAEFLVETKERGML 360
           VYVNFGSVTVMSPQHLVEFAWGLANSEKPFLWIVRPDLVEGETALLPAEFLVETKERGML
Sbjct: 301 VYVNFGSVTVMSPQHLVEFAWGLANSEKPFLWIVRPDLVEGETALLPAEFLVETKERGML 360

Query: 361 ADWCNQEEVLKHSSVGGFLTHSGWNSTMESIVGGVAMISWPFFAEQQTNCRYCKTEWGNG 420
           ADWCNQEEVLKHSSVGGFLTHSGWNSTMESIVGGVAMISWPFFAEQQTNCRYCKTEWGNG
Sbjct: 361 ADWCNQEEVLKHSSVGGFLTHSGWNSTMESIVGGVAMISWPFFAEQQTNCRYCKTEWGNG 420

Query: 421 LEIDSNVRREDVEKLVRELMEGEKGEDMKRNAKEWKRKAEEACKIGGSSPTNLDRVISEI 480
           LEIDSNVRREDVEK+VRELMEGEKGEDMKRNAKEWKRKAEEACKIGGSSPTNLDRVISEI
Sbjct: 421 LEIDSNVRREDVEKVVRELMEGEKGEDMKRNAKEWKRKAEEACKIGGSSPTNLDRVISEI 480

Query: 481 LSSKEKSNLNSQN 494
           LSSKEKSNLNSQN
Sbjct: 481 LSSKEKSNLNSQN 493

BLAST of CsaV3_7G006280 vs. TrEMBL
Match: tr|A0A1S3C1P0|A0A1S3C1P0_CUCME (7-deoxyloganetin glucosyltransferase-like OS=Cucumis melo OX=3656 GN=LOC103495446 PE=4 SV=1)

HSP 1 Score: 953.7 bits (2464), Expect = 1.6e-274
Identity = 462/494 (93.52%), Postives = 474/494 (95.95%), Query Frame = 0

Query: 1   MGSLTKVDQGKQQPHAVLFPYPSQGHISPMLKLAKLFHHKGFHITFVNTEYNHRRLLRSR 60
           MGSL+KVDQ  QQPHAV  PYPSQGHISPMLKLAKLFHHKGFH+TFVNTEYNHRRLLRSR
Sbjct: 1   MGSLSKVDQENQQPHAVFVPYPSQGHISPMLKLAKLFHHKGFHVTFVNTEYNHRRLLRSR 60

Query: 61  GPNSLDGLPDFHFRAIPDGLPPSDGNSTQHIPSLCYSASRNCLAPLCSLISEINSSG-TV 120
           GPNSLDGLPDF FRAIPDGLPPSDGN+TQH+PSLCYS SRNCLAPLCSLISEINSSG TV
Sbjct: 61  GPNSLDGLPDFQFRAIPDGLPPSDGNATQHVPSLCYSTSRNCLAPLCSLISEINSSGSTV 120

Query: 121 PPVSCIIGDGVMTFTVFAAQKFGIPIASFWTASACGCLGYMQYAKLVEQGLVPFKDENFM 180
           PPVSCIIGDGVMTFTVFAAQKFGIPIASFWTASACGCLGYMQYAKLVEQ ++PFKDENFM
Sbjct: 121 PPVSCIIGDGVMTFTVFAAQKFGIPIASFWTASACGCLGYMQYAKLVEQQMIPFKDENFM 180

Query: 181 TNGDLEETIEWIPPMEKISLRDIPSFIRTTDKDDIMLNFFIEQFETFPKANAIIINTFDS 240
           +NGDLEETIEWIPPMEKI LRDIPSFIRTTDKDDIMLNFFI+Q ET PKANAIIINTFDS
Sbjct: 181 SNGDLEETIEWIPPMEKIRLRDIPSFIRTTDKDDIMLNFFIQQLETLPKANAIIINTFDS 240

Query: 241 LEHHVLEALSSKLPPIYPIGPINSLVAELIKDDKVKDIRSNLWDEQSECMKWLDSQQPNS 300
           LEHHVLEALSSKLPPIYPIGPINSLV ELIKDDKVK IRSNLWDEQSECMKWLDSQQPNS
Sbjct: 241 LEHHVLEALSSKLPPIYPIGPINSLVTELIKDDKVKGIRSNLWDEQSECMKWLDSQQPNS 300

Query: 301 VVYVNFGSVTVMSPQHLVEFAWGLANSEKPFLWIVRPDLVEGETALLPAEFLVETKERGM 360
           VVYVNFGS+TVMSPQHLVEFAWGLANSEKPFLWIVRPDLVEGETALLPAEF+ ETKERGM
Sbjct: 301 VVYVNFGSITVMSPQHLVEFAWGLANSEKPFLWIVRPDLVEGETALLPAEFVAETKERGM 360

Query: 361 LADWCNQEEVLKHSSVGGFLTHSGWNSTMESIVGGVAMISWPFFAEQQTNCRYCKTEWGN 420
           L DWCNQEEVLKH SVGGFLTHSGWNSTMESI GGVAMISWPFFAEQQTNCRYCKTEWGN
Sbjct: 361 LGDWCNQEEVLKHPSVGGFLTHSGWNSTMESIAGGVAMISWPFFAEQQTNCRYCKTEWGN 420

Query: 421 GLEIDSNVRREDVEKLVRELMEGEKGEDMKRNAKEWKRKAEEACKIGGSSPTNLDRVISE 480
           GLEIDSNVRREDVEKLVRELMEGEKGEDMKRNAKEWKRKAEEACKIGGSS TNLD+VISE
Sbjct: 421 GLEIDSNVRREDVEKLVRELMEGEKGEDMKRNAKEWKRKAEEACKIGGSSLTNLDQVISE 480

Query: 481 ILSSKEKSNLNSQN 494
           IL SK+KSNL SQN
Sbjct: 481 ILLSKDKSNLKSQN 494

BLAST of CsaV3_7G006280 vs. TrEMBL
Match: tr|A0A0A0K793|A0A0A0K793_CUCSA (UDP-glucuronosyltransferase OS=Cucumis sativus OX=3659 GN=Csa_7G058650 PE=4 SV=1)

HSP 1 Score: 751.9 bits (1940), Expect = 9.1e-214
Identity = 370/457 (80.96%), Postives = 407/457 (89.06%), Query Frame = 0

Query: 30  MLKLAKLFHHKGFHITFVNTEYNHRRLLRSRGPNSLDGLPDFHFRAIPDGLPPSDGNSTQ 89
           ML LAKLFHHKGFHITFVNTEYNHR LLR+RGPNSLDGLPDFHFRAIPDGLP  D  +TQ
Sbjct: 1   MLNLAKLFHHKGFHITFVNTEYNHRCLLRTRGPNSLDGLPDFHFRAIPDGLPSCDSKATQ 60

Query: 90  --HIPSLCYSASRNCLAPLCSLISEINSS-GTVPPVSCIIGDGVMTFTVFAAQKFGIPIA 149
             +IPSL YS SRNCLAPLCS+ISEINSS G +PPVSCI+GDG+MTFTVFAA++FGIPIA
Sbjct: 61  GRNIPSLFYSTSRNCLAPLCSVISEINSSGGVIPPVSCIVGDGLMTFTVFAARQFGIPIA 120

Query: 150 SFWTASACGCLGYMQYAKLVEQGLVPFKDENFMTNGDLEETIEWIPPMEK-ISLRDIPSF 209
           +FWTASACGCLGYMQYAKLVE GL+PFKD+NFMTNGDLE TIEWIPP++K ISL+ IPS 
Sbjct: 121 TFWTASACGCLGYMQYAKLVEHGLLPFKDDNFMTNGDLETTIEWIPPVQKEISLKYIPSS 180

Query: 210 IRTTDKDDIMLNFFIEQFETFPKANAIIINTFDSLEHHVLEALSSKLPPIYPIGPINSLV 269
           IRTTDK++ +LNFFI+QFE  PKAN II+NTFDSLEH VLEALSS+LPPIYPIGPIN LV
Sbjct: 181 IRTTDKNNPVLNFFIQQFEILPKANVIIMNTFDSLEHQVLEALSSRLPPIYPIGPINLLV 240

Query: 270 AELIKDDKVKDIRSNLWDEQSECMKWLDSQQPNSVVYVNFGSVTVMSPQHLVEFAWGLAN 329
           AE I+D K+KDI  +LWD QSECMKWLDSQQPNSVVYV+FGSV VMSPQHLVEFAWGLAN
Sbjct: 241 AESIQDAKLKDIGLDLWDVQSECMKWLDSQQPNSVVYVSFGSVNVMSPQHLVEFAWGLAN 300

Query: 330 SEKPFLWIVRPDLVEGETALLPAEFLVETKERGMLADWCNQEEVLKHSSVGGFLTHSGWN 389
           SEKPFLWIVRP+LV GET  LPAEF  ET+ RGM   WCNQEEVLKH SVGGFLTHSGWN
Sbjct: 301 SEKPFLWIVRPNLVGGETISLPAEFAAETEGRGMFVKWCNQEEVLKHPSVGGFLTHSGWN 360

Query: 390 STMESIVGGVAMISWPFFAEQQTNCRYCKTEWGNGLEIDSNVRREDVEKLVRELM-EGEK 449
           STMESIVGGVAM+SWP FAEQ  N  YCKTEWGNGLEI S+V+RE+VEKLVR LM +GEK
Sbjct: 361 STMESIVGGVAMLSWPCFAEQPMNSYYCKTEWGNGLEIGSDVKREEVEKLVRVLMGDGEK 420

Query: 450 GEDMKRNAKEWKRKAEEACKIGGSSPTNLDRVISEIL 482
           GED++RNA+EWK KAEEACK GGSS TNLDR+ISE+L
Sbjct: 421 GEDIRRNAREWKTKAEEACKFGGSSSTNLDRLISEML 457

BLAST of CsaV3_7G006280 vs. TrEMBL
Match: tr|A0A2N9HN87|A0A2N9HN87_FAGSY (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS41317 PE=3 SV=1)

HSP 1 Score: 666.8 bits (1719), Expect = 3.8e-188
Identity = 314/471 (66.67%), Postives = 384/471 (81.53%), Query Frame = 0

Query: 13  QPHAVLFPYPSQGHISPMLKLAKLFHHKGFHITFVNTEYNHRRLLRSRGPNSLDGLPDFH 72
           +PH V  PYP QGHI+PM KLAK+ HHKGFHITFVNTEYNH+RLLRSRGPNSLDGLPDF 
Sbjct: 9   RPHVVCVPYPLQGHINPMFKLAKILHHKGFHITFVNTEYNHKRLLRSRGPNSLDGLPDFQ 68

Query: 73  FRAIPDGLPPSDGNSTQHIPSLCYSASRNCLAPLCSLISEIN--SSGTVPPVSCIIGDGV 132
           F+ I DGLPPSD N +Q +PSLC S S++ L PLC+LIS++N  SS  VP V+CI+ DG 
Sbjct: 69  FKTISDGLPPSDENVSQDVPSLCDSTSKHSLVPLCNLISKLNDTSSSNVPLVTCIVSDGC 128

Query: 133 MTFTVFAAQKFGIPIASFWTASACGCLGYMQYAKLVEQGLVPFKDENFMTNGDLEETIEW 192
           M+FT+ AA KFGIP   FWT S+CG L YM Y  LVE+GLVP KD   +TNG LE  I+W
Sbjct: 129 MSFTLDAADKFGIPNVLFWTPSSCGFLSYMHYRHLVERGLVPLKDATHLTNGYLESAIDW 188

Query: 193 IPPMEKISLRDIPSFIRTTDKDDIMLNFFIEQFETFPKANAIIINTFDSLEHHVLEALSS 252
           IP M+ I L+D+PSFIRTTD+ DIM+NF I + E  P+A+A+I+NTFD  E  VL+ALSS
Sbjct: 189 IPGMKNIRLKDLPSFIRTTDEKDIMVNFLIRETERAPRASAVILNTFDPFEQDVLDALSS 248

Query: 253 KLPPIYPIGPINSLVAELIKDDKVKDIRSNLWDEQSECMKWLDSQQPNSVVYVNFGSVTV 312
            LP IY IGP+  L+A+ IKDD +K I SNLW E+ EC++WL+S++PNSVVYVN+GS+TV
Sbjct: 249 MLPRIYTIGPL-VLLADQIKDDNLKSIGSNLWKEEPECVEWLNSKEPNSVVYVNYGSITV 308

Query: 313 MSPQHLVEFAWGLANSEKPFLWIVRPDLVEGETALLPAEFLVETKERGMLADWCNQEEVL 372
           M+PQ L+EFAWGLANSEKPFLWI+RPDLV G++A+LP EF+ +TK+RGMLA WC QE++L
Sbjct: 309 MTPQQLIEFAWGLANSEKPFLWIIRPDLVVGDSAVLPPEFVTKTKDRGMLASWCLQEQIL 368

Query: 373 KHSSVGGFLTHSGWNSTMESIVGGVAMISWPFFAEQQTNCRYCKTEWGNGLEIDSNVRRE 432
           KH S+GGFLTHSGWNST+ES+  GV MISWPFFAEQQTNCRYC TEWG G+EID+NV+R+
Sbjct: 369 KHPSIGGFLTHSGWNSTLESVCNGVPMISWPFFAEQQTNCRYCCTEWGIGMEIDNNVQRD 428

Query: 433 DVEKLVRELMEGEKGEDMKRNAKEWKRKAEEACKIGGSSPTNLDRVISEIL 482
           +VEKLVRELM GEKG++MK+   EWK KAEEA K GGSS  NL+++I ++L
Sbjct: 429 EVEKLVRELMNGEKGKNMKKKVIEWKTKAEEATKPGGSSYQNLEKLIVQVL 478

BLAST of CsaV3_7G006280 vs. TrEMBL
Match: tr|A0A2N9GQT2|A0A2N9GQT2_FAGSY (Glycosyltransferase OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS29690 PE=3 SV=1)

HSP 1 Score: 660.6 bits (1703), Expect = 2.8e-186
Identity = 309/483 (63.98%), Postives = 389/483 (80.54%), Query Frame = 0

Query: 1   MGSLTKVDQGKQQPHAVLFPYPSQGHISPMLKLAKLFHHKGFHITFVNTEYNHRRLLRSR 60
           MGS +    G  +PH V  P P QGHI+PMLKLAKL HHKGFH+TFVNTEYNH+RLLRSR
Sbjct: 1   MGSFS----GDIRPHVVCIPVPLQGHINPMLKLAKLLHHKGFHVTFVNTEYNHKRLLRSR 60

Query: 61  GPNSLDGLPDFHFRAIPDGLPPSDGNSTQHIPSLCYSASRNCLAPLCSLISEIN--SSGT 120
           GPNSLDGLP+FHF  IPDGLPPSD + +Q IPSLC S  + CL P C+L+S++N  SS  
Sbjct: 61  GPNSLDGLPNFHFETIPDGLPPSDADVSQDIPSLCESTVKTCLVPFCNLLSKLNDTSSSN 120

Query: 121 VPPVSCIIGDGVMTFTVFAAQKFGIPIASFWTASACGCLGYMQYAKLVEQGLVPFKDENF 180
           VPPV+CI+ DG M+FT+ AA KFGIPIA FWT S+CG L YM +  LVE+GL+P KD ++
Sbjct: 121 VPPVTCIVSDGNMSFTLDAAHKFGIPIALFWTPSSCGFLCYMHFRHLVERGLIPLKDASY 180

Query: 181 MTNGDLEETIEWIPPMEKISLRDIPSFIRTTDKDDIMLNFFIEQFETFPKANAIIINTFD 240
           +TNG LE  I+WIP M+ I L+D PSF+RTTD  DI+ NF + + +  PKA AII+NTFD
Sbjct: 181 LTNGFLESAIDWIPGMKNIRLKDFPSFVRTTDVKDIVFNFMLREADRAPKALAIILNTFD 240

Query: 241 SLEHHVLEALSSKLPPIYPIGPINSLVAELIKDDKVKDIRSNLWDEQSECMKWLDSQQPN 300
             E  VL+ALSS LP IY IGP+  L+A+ I+DD +K + SNLW E++ C++WLDS++PN
Sbjct: 241 PFEQDVLDALSSMLPRIYTIGPL-LLLADQIEDDYLKSLGSNLWKEETGCVEWLDSKEPN 300

Query: 301 SVVYVNFGSVTVMSPQHLVEFAWGLANSEKPFLWIVRPDLVEGETALLPAEFLVETKERG 360
           SVVYVNFGS+T+M+PQ L+EFAWGLANSEKPFLWI+RPDLVEG++A++P EFL++TK+RG
Sbjct: 301 SVVYVNFGSITIMTPQQLIEFAWGLANSEKPFLWIIRPDLVEGDSAIVPPEFLIKTKDRG 360

Query: 361 MLADWCNQEEVLKHSSVGGFLTHSGWNSTMESIVGGVAMISWPFFAEQQTNCRYCKTEWG 420
           M A WC QE++LKH S+GGFLTHSGWNST+ES+  GV MISWPFFAEQQTNCRY   EWG
Sbjct: 361 MFASWCPQEQILKHPSIGGFLTHSGWNSTLESVCNGVPMISWPFFAEQQTNCRYSCIEWG 420

Query: 421 NGLEIDSNVRREDVEKLVRELMEGEKGEDMKRNAKEWKRKAEEACKIGGSSPTNLDRVIS 480
             +EID+NV+R+++EKLVREL++GEKG++MK+   EWK KAEEA K GGSS  NL+++I+
Sbjct: 421 IAMEIDNNVQRDEMEKLVRELVDGEKGKEMKKKVIEWKIKAEEATKPGGSSYQNLEKLIA 478

Query: 481 EIL 482
           ++L
Sbjct: 481 QLL 478

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_011658777.19.3e-28797.57PREDICTED: 7-deoxyloganetin glucosyltransferase-like [Cucumis sativus] >KGN43695... [more]
XP_008455233.12.4e-27493.52PREDICTED: 7-deoxyloganetin glucosyltransferase-like [Cucumis melo][more]
XP_023529848.11.3e-23279.187-deoxyloganetin glucosyltransferase-like [Cucurbita pepo subsp. pepo][more]
XP_022927659.12.3e-23279.357-deoxyloganetin glucosyltransferase-like [Cucurbita moschata][more]
XP_022988757.11.5e-23180.767-deoxyloganetin glucosyltransferase-like [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
AT1G22370.26.4e-16556.57UDP-glucosyl transferase 85A5[more]
AT1G22340.16.0e-16356.14UDP-glucosyl transferase 85A7[more]
AT1G22360.11.6e-16055.30UDP-glucosyl transferase 85A2[more]
AT1G22400.16.5e-15752.34UDP-Glycosyltransferase superfamily protein[more]
AT1G22380.11.1e-15653.78UDP-glucosyl transferase 85A3[more]
Match NameE-valueIdentityDescription
sp|F8WKW1|UGT2_GARJA4.7e-18163.087-deoxyloganetin glucosyltransferase OS=Gardenia jasminoides OX=114476 GN=UGT85A... [more]
sp|F8WLS6|UGT6_CATRO8.9e-17257.917-deoxyloganetin glucosyltransferase OS=Catharanthus roseus OX=4058 GN=UGT85A23 ... [more]
sp|Q6VAB3|U85A8_STERE3.7e-17059.20UDP-glycosyltransferase 85A8 OS=Stevia rebaudiana OX=55670 GN=UGT85A8 PE=2 SV=1[more]
sp|Q9LMF0|U85A5_ARATH1.2e-16356.57UDP-glycosyltransferase 85A5 OS=Arabidopsis thaliana OX=3702 GN=UGT85A5 PE=2 SV=... [more]
sp|Q9LME8|U85A7_ARATH1.1e-16156.14UDP-glycosyltransferase 85A7 OS=Arabidopsis thaliana OX=3702 GN=UGT85A7 PE=2 SV=... [more]
Match NameE-valueIdentityDescription
tr|A0A0A0K1Z5|A0A0A0K1Z5_CUCSA6.2e-28797.57Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G059660 PE=4 SV=1[more]
tr|A0A1S3C1P0|A0A1S3C1P0_CUCME1.6e-27493.527-deoxyloganetin glucosyltransferase-like OS=Cucumis melo OX=3656 GN=LOC10349544... [more]
tr|A0A0A0K793|A0A0A0K793_CUCSA9.1e-21480.96UDP-glucuronosyltransferase OS=Cucumis sativus OX=3659 GN=Csa_7G058650 PE=4 SV=1[more]
tr|A0A2N9HN87|A0A2N9HN87_FAGSY3.8e-18866.67Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS41317 PE=3 SV=1[more]
tr|A0A2N9GQT2|A0A2N9GQT2_FAGSY2.8e-18663.98Glycosyltransferase OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS29690 PE=3 SV=1[more]
The following terms have been associated with this gene:
Vocabulary: Biological Process
TermDefinition
GO:0008152metabolic process
Vocabulary: Molecular Function
TermDefinition
GO:0016758transferase activity, transferring hexosyl groups
Vocabulary: INTERPRO
TermDefinition
IPR002213UDP_glucos_trans
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008152 metabolic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0016758 transferase activity, transferring hexosyl groups

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsaV3_7G006280.1CsaV3_7G006280.1mRNA


Analysis Name: InterPro Annotations of cucumber chineselong genome (v3)
Date Performed: 2019-03-04
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableGENE3DG3DSA:3.40.50.2000coord: 462..467
e-value: 2.3E-152
score: 510.0
coord: 15..276
e-value: 2.3E-152
score: 510.0
NoneNo IPR availableGENE3DG3DSA:3.40.50.2000coord: 277..461
e-value: 2.3E-152
score: 510.0
NoneNo IPR availablePANTHERPTHR11926:SF462UDP-GLYCOSYLTRANSFERASE 85A1-RELATEDcoord: 11..481
NoneNo IPR availablePANTHERPTHR11926GLUCOSYL/GLUCURONOSYL TRANSFERASEScoord: 11..481
NoneNo IPR availableSUPERFAMILYSSF53756UDP-Glycosyltransferase/glycogen phosphorylasecoord: 14..477
IPR002213UDP-glucuronosyl/UDP-glucosyltransferasePFAMPF00201UDPGTcoord: 287..443
e-value: 8.8E-24
score: 84.0

The following gene(s) are orthologous to this gene:

None

The following gene(s) are paralogous to this gene:

None