Tan0010472 (gene) Snake gourd v1

Overview
NameTan0010472
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGlycosyltransferase
LocationLG11: 29193365 .. 29206116 (+)
RNA-Seq ExpressionTan0010472
SyntenyTan0010472
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GGAAAAGAAAGAGATGACAGGCAGAAAAAGGAAAATGGAAATGGGAGAAGGCAAAGTTCACATTCTTGTGATTCCGTTCCCAGAAGGACAAGGTCACATAAACCCCATCCTCCAATTCTCAAAACGGCTGGCATTGAAAGGGCTAAAGGTCACAGTCCTCAACGTCCTGAAAACTCATGAAATCAACAATAATATTATTAATGACCTGGATCAGCTCGGGGGCTGCGGCGGGTCCATTATTACGGTGGAGCACAAGCCACGAGTGGCGTACAAAGGCAGAGAAGCAGAGTCAATGGAGTCCCACATGCACCGTCTTCAAACATCCACATGCTTTCACTTAACAAACCTCATCACTCAACATCAAACCTCCGACGCCCCAATTGCCTGCGTTCTCTATGATTCCCTCACGCCTTGGGTTCTGGATGTTGCTAGAGCTTTTGGCCTTCCTGGGGCTTCCTTTTTAACCGACTCCTGCGCTGTTAATGCCATTTTTTACCACATTAACCGTGGCTCCTTCAAGATTAATCCCATTGAGTTTGAGGATGAAATGACTGTTGTGTCCTTGCCTTGCCTCCCGCTTCTTCACGTCTATGATCTTCCTTCCCTCATTCCTAATCCCAACCAATATCCAGTATTTCTCAGGTTCATGATCGACCAGTTCTGTAACCAGCCTGATTGGATGTTCATCAACACGTTCCTTGCCTTGGAGCCACAAGTAATCTATCTATCCAACTCTCCTACACTTTTTTCTTCTTTTTTTTTTTTTCAGGGGGTTAAGTTATATGCTTCTTTCGGAAATAAAAAAAAAAAATCATGCCTTAGGAACCACAAGTAATCTATCCATATCCAACACTCTCCATGCCTTTGGAACCACAATTAATATACCCATTTAGTTTTTACTTTTCTATTTTTAAAAATTATACTAATTTCTGTTTTCTTTTTTTGAAAACATATAAATATATAAAAAAAAGTATGTATAGGTTTAATTTTACAAATATGAAAATTAAAAACTAAGGTTAGGTTAGATAACTTTAAGTTTATAAATATTACTTACATCTCTAAGTTTTCTTATTTGTTATCTAATTTTTAGTTATGTTTTAAAAAATCAAGCTAAAATTTGAAACTTAAAAGGAACATTTTCAAAACTTTATTTTTTTTTGTAAAATTGGACTAAAAATTCAAATATTTTTTTTTGCTAAAAAAATATGAAACTAATTTCTCTATAATGAGAAAATAATACAATTTTTAAAAATCAAAAATCAAAAATCAAAATGGTTATTTAGACTAAAATTTGACTAAATTTTTTAAAATCTTTTAAAAACAGGTAATAAAATTTTTTAAAAAACTATATTTGCTAAGTAAGGTTATCGAATAGAGTTTCTCATTCACTATATAGTAACAATTATCTTTCACGAGAAGTCTATATGTTTTCTTTTTTTTAAAACAAAAGAAAAAAATTGCATTCAAAATTTTAATGTTGGTTAAACATATTTGGAAAAAGAGTTTTATTGGGCTTTTTGGGAATAATTCACCATGATTAAAATTTTGGCAGCTCTTTGTTTTTGTAAAAAAGGAAAAAAGTTTTATCTTTCTCCTAAATCCTAATTATATAAAACATGACGATGATCCCGAAATAATATCATTTGCGGTATGAAGTTCGCAACAAAAAAAAAAAAAAAAGTACTTGGATGTTATTGGACTCCATCTATTTATTTTACTTTCTACTTGATTTTTTACACAATTTTTTTTTTTTACTTTTTAATTTTTTTAAGAAAATTTGTAGTTGGTGCTAGGACCCAAGTTGTGCCCACAACAAAATTGAATTAAACGCGGCAGTAAATTTTTTAGCAGACCAATAAGATAATGACAGCTAGCTTTTCTCTTTTTCTTTTTTGACATCCACAAGGATTCTCTGTCCAAGAATGGGAGGCTAGGCCACCTTTTGTTGTTTGGCTTTATCCTTTAAATTCATTTCATCACCAACAATGATTTTTTCTTTGTTTGCCTTTATCGGTTAAATTCCTTCCATGGCCAACAATGGTGATTTTTTCTTTGAATTCTCACAAACAATGAATTAAGATCGTAAGTAAAATCTTTTGCTATCAAACTACTAGAAAATTTTAATTTTAAACGACAAATTGTTGAATGAAGTTGTCTCTTAATAATAATAGTTAGATAGATACAAACGTATTTCTATATAATTCTGATTTTCATACATTTAATTGTATAAAGATAGCAATTATATTATTGTCGCAGTTTTCTCAAATAATTCTTCACTTAATCAGTGGTGGATCTTTAAAGTTGTCCATACTAGAGCTTGAACCGAGACTTGATAGGAGGACTAGAAACATACTGACCACTAAGTAAAAAGACTCTTTTGTGTTTTAATTTAATTGTAGAATATTTGAGAGAGTCTCGAGTCCTCTTTGAGCCTAAATTAAATTCGCGCCCATACATACCTTTCCCTCCCTTTAAATTTGGTGGTCTTCTTTTTGTCTTTTTTTTTTTAAATTAAATTCTATCCAGATTTTACTTTATTTATTTATTTTTATATTTGGATTATCGAATTTTTTTTATATATCAATTTTAAATTGTTTAAAGACCGTTTGATGATTTTTTTTTAATATTGTGATTGACTAATAATTCTTCATAATTTTAAACTAATATTCTTTTTATTTTTATTTCAAACAGTTTATTGTCATATTTCTGAAAAAAATTATGTAAATTCTAATCTATAATTACAATATTTAAAGTTCAAACTTTTAACTGGGGAGAAAATTTAGAATACAACTTTATATTAACCAATTAAACTATACTATGCTTATGTCCAATCCGTTGTAATAAAGTCTTTAGTTTTAAAAACGTTAAAAGAACATTTTTTTTTCTGTACTTTTGAGTTTGATTCTATTTTACTCCCTATATTTTTAAATGTATAATTTTAGTTTATGTACTTTCAATAAATCTTGAAATTAGTGTATATACTTAGTTTATTGTTAAATTTTTCAAAACATACTAAGTCTCTATTAGTTTTTCCTTTCTAAATTTTGGAAACATTTTCATAAGAAATCCTTTATTACAGGAAAATTATTATTATTATTTAACGAAAAAAAACTTTGAATGATTACTTTTAAGATAATATTGAAAATTTAAAGATTAAATTAGACATTTTGAAAGTACACAAACTTAAATAGAATGAAATAAAAAGTACGTCAATTTATCAAAATATGAATTTGAGTCGATATTTTTAATTTCTTTTTTGGCATAAAATTTAATTTGAATTTAGTAGATTTTTATGCTCTACTCTAATTTAACACAAATGTGGGATAAAGTTAAAAAACTTTATCCAAAAATTTTAAAATTGGGTTAGGACTTGGAAACTTCTAATAAAATTTGACATCTTTCCTCCAACCTCGTCCTAGGATAAAATTTAGATTTCAACCTTGACTTAAAATAAAAATTGGATGATGTCCAAATCTTGATTTGAAGAAAGTTCTTACCCAATATAATAAATTTTGGATTATGCCCAAATTTTGTCTTGAGGGAAAATTATAATTTGGAATTAAAATAGAATTTGAATTTGAATTTGAGTTTCAATTCAGTTTTGATTTGGGATGAAATTTGGATTATTCCCAAATTTTTGTTTGTGGAAAATTTCTACCAAATTATAATTTGGAATTGGAATTGGAATAGAAATAGAAATAGAAATTGAATTTCAATTTTAATCAAATTTTGATTTGGGATAAAATTTGGATTATCCCAAATTTTGGTTGGAGGAAAATTTCTACTCAATTATAATTTGGAATTGGAATTGGAATAGAAATAGAAATTGAATTTTAATTTCAATCAAATTTTGATTTGGGATAAAATTTGGATTATCCCCAAATTTTTGTTGGAGGAAAATTTCTACCCAATTATAATTTGGAATTGAAGTATAATTTAGATTTGGATTTGAATTTCTTTCCAATTTGATTTGAAGGAATTTTGACTAAATTTAGAATAGAATATTATTTCTATTCAAATTAGAAATTGACTCAATCTCGTTGAGCTTTTTTAGAGATTCATTTGGATTTGAAACCAAGTTGGCTGTGATCCTGATTTCATATTTCTCCGGAATTGAGTCTTCCATTCATTCATAACAAGAAAGAGTGGGTGGAACAAAGTTTGATTCTTACTCCAATCTTCATATTGAAGAGTTTTCTTTTCGTGTGCATACCAAACACACTTTTTTTTTCTTTTTTTTTTTACTTTTTTTTAATTTTTTTCGATTCTTTTTATTTAACTTTTTTTTCTTTCCAACATTTTTTAAGACTTTTTTTTCCTAACTTTTTATTTCTTTCTTTTATTACTTTTTTCTTTTTTTTTCAAACTTTTTTTTTGCTACTTTTTTTTTCAATTCAATTATTTTCTCTCAGATTAAAATCACCCTACTAATATTCCTTTGTAAAAGGGAATACATCCATCCACACTCCAAAAAGAGGTAAGACAAGTAAATGATTTTTTTCAGGTAAATGATTTTTTCCCATGCATTTTTCTCTCTTATTTTTATATTTTTTTTGTCAGGACGAACCTCAAGCATAGTGACGCTTGAGTATGGTATCTGCCCTCAAGTGTTCACTCCTCACTTCAACCAGACGAATAATTTATCTGCAATGTGACACTTTAGTTATAGCATCCGCCCTCACGTGTTCATCCTCTATGAAGTGACACTTGAGTTATGGCATCCACCATCAAGTGTTCACTCCTCACTTCAATAAAACGAAGCATTTATCTGCAATGTGACACTTAAGTTATGGCATCCACCCTCAAGTGTTCACCCTCTATGAAGTGACACTTGAGTTATGGCATCCACCCTCAATTGTTCACCCTCTATGAAGTGACACTTGAGTTATGACATCTGCCCTCAAGTGTTCACCCTCTATGAAGTGACACTTGAGTTATGACATCCGCCCTCAAGTGTTCACCCTCTATGAAGTGACACTTGAGTTATGACATCCACCCTCAAGTGTTCACCCTCTATGAAGTGACACTTGAGTTATGGCATCCACCCCCAGTATTCACCCTCTATGAAGTGACACTTGAGTTATGACATCCTCGCTCAGTGTTCACCCTTTATGAAGTGATGTTTGGGTTATGGCATCCGCCCTATAGTGTTCACTCTCTATGAAGTGATGCTTGGGTTATGGCATTCACCCTTAAGTGTTTACTCTCTATGAAGTGGCGCTTGGGTTATGACATCCACCCTCAAGTTTCACCCTCTATGAAGTGACACTTGAGTTATGACATCCACCCTTAAGTGTTCACTTGAGTTATGGCATCTGCCCTCAAGTGTCCATCCTCTATAAAGTTACACTTGAGTATGGCATTTGCCCCCCCAAGTGTTCACCCTCTATGAAGTGACACTTGAGTAGATTCTCAAGTGTTTAGTCTGTAGCAACTACGAGGTGACGCTTGAGTAAATCCTCAAGTGTTCACTCTGAAGCAACTACGAGGTGGTGCTTGAGTAGATCCTCAAGTGTCCACTCTGAAGCACATACGAGGTGACGCTTGAGCATGGAGACTTGAAGACGAAGTGCCTCCAAAAGTCAGACTTTGAAGACGAAGTACCGCCAAAAGTCAGACTTTGAAGACGAAGTGCCGCCAAAAGTCAGACTCTGAAGACAAAATACCGCCAAAAGTTAGACATTGAAGACGAAGCGCCAAAAGTCAGACCTTGAAGACGTAGCGCCGAAGCCGAACCTTGAAGATGAAGTCACCACAACCAGACCTTGAAGATGAAGGCGCCGAAAGAAGGACCTTGAAGATGAAGCCGTCGGAAGAAGGCTTTGAAGATGAAACCCTTTGAACCTGGGGGAAGAGGCGACAAAGTCCCTCGAACTTGGGGAGGAAGCAACGAAGTCTCTCTCAAACTTGGAGAGGAAGCAACGAAGTCTCTCTCGAACTTGGAGAGGAAGCAATGAAGCCCCTCGAATTTGGAGAGGAAGCAAGGAAACCCCTCAAATTTTTAGAGGAAGCGATAAAATCCCTCAAACTTGGAGAGAAAGCGACGAAGTCCCTCGAACTTAGAGAGCAACCTTTAATTTGTGTTATCCTCACAAGGAATTTGCAGTGTTGTTATCTACAACAGGAATAGGCATCTTCAATTTGTGGTATTGCCCCCAACAAATAGTTAATCTTCAACTTCAATATCGAGTCTAAGTAGTGATGACAATTGTTTTTTTTATTGTGAGAGATTGAACTTAGAGTTTGTCTTTAAGTTGCCTACGTACCCTTTGAATGTAAGGGATCAAGTCATAACATAGTTCAAAGGATTTTTTTTTTCGTTCACCTTTTAAGCTCGTGCGGGCCATGAGCGTCGTGTAGTTTAGGCTCTTGCGACAAAGAGGTCTTCGATTTTAGCTGATTTTTCTCTCATCATGATCTTGACCAGATTCTTCATTTGTTGGATTTGTTATAATGACGAGATTCTTCTTCACCTTCAAGGAGCCCTTTGTATTTATGAGAATTGAGAACTTCCTTTTCATACGTGAATGAATGATACTATGAATCTTGTTGTGGTTCACTTTATCGAACAATTCTTCCTTCAAGAATTTCATCTTTTTTTCTTGTTGATCGCTTGTTACTTCGAGACAATAAAAAACAGATGTTGAAGGTCGATATTTCTTTGATGTGGAGTCATTTAACCTTTGGAAGGCTGAAGGTCGAGTTAAGATAGGAGTTGGGTCTTGGTCTTCTTCTTCTGTCGCGACCATACTCATCGTTTGGAAGACTGAAGGTCGAGTAGTTGAAGGCTTGATGTGATCGGAGACGGAGGTCCTTTGCTTGACTTCTTTTTCGTCTTCAGCTTCTTTTGCTTCAATATAGTAAACATTTGTTGAGTTTTCTTTATCTTCTTTCATGTGAAGAAAACTTCTAGAGAAGAATTCTTTTAAGGTCACCGATTTTTCAGGTTGAGAATATTTTTTGTTTCTCTTCGTTTCAGACTTGGGTTTCCTTGAATCCTTCTTTCTTTGACGTCTACCCTCATGTTTGTGGTTTCTAGATTGAATTAAACACACTCCTTGGTTGTGAAGTAGACTAGAGTGTATTTGACTACATGTGTATGTCATCATTGCAACATGATTTGTTTGAGCTAATTCATCAAGGTGTAGCTCAATTTTTCCTTCTTTAACTAATCTTAGAATTAGCTCTTTTAGGGCGAAGCATTTTTTCACGGGATGACTGATAACTCGATGATACTTGCAATACTTGGGATCATCGACTTTTTCCATCTCTTCGGGGCGCTTACATTCTGGAAGTTCGATCAGTTATGCGTCCAGTAGTTGTTCCAACATATCAGACATGTTAGAATCAGGAAAGGGATAGATTTTGTTTTGTGTTTCCTTTAAACTTAAGCGACGCTTTCTATTTTCTTGTCGCTTTTCAAATTTCTTGTATTTTTGTTTTGAAGACAACTTGAGAGGAGTTATACTGACAACTATAGACTCTTATGAAGTTTCCTCGATGTACTCTCCTTCCTTCTTCATGCTAGAGATTGGAAAGTCTTGATCTATTTGGCTAGCAATACTCAACTCCATATCGTGGGCGCGAGTTGCTAGCTCTTCAAAAGTACGAGGCTTTATTCCTTGGAAGATGTACAGGAGTCCCAAGTGCATGCCTTGAATGCACATTTCGACAACATATAATTCAGTGAGACGATCTTTGCAATCAAGACTTGTGGCTCTCCAACGATTTATGTAGTTGACAACGGATTCACCTTTTCGTTGTCTGGTGTTGGTGAGCTCGAACATGCTGACGACTCGTCTTATGTTGTAGAAGCGATTTAGAAACTCCTTTTCAAACTGTTCCCAACTGTCTATTGACTCAGGCTCCAGATCAGTGTACCAGTCAAAATCATTTCCTTTTAATGTTCGTACGAACTGCTTGACTAGTAGATCTCCTCGCGTGCTGACATTTTCACATGTCTCGATTAAATGCGCAACATGTTGCTTGGGGTTGCCTTTTCCATCGAATTGTTTGAACTTTGGTGGATGATATCCAGTTGGCATTCTCAAATTTTCGATCCTCTTAGTATACGGTTTGGAATACAAAAGGGAACCTTGAGAAACTCCACCATACTGCGCTCTGATGGAGTTCGTTATCATGTCCTGAAGTTGTTGGACAGATAATTCAGCGATTGAAGTGGATGGTTGTGGTTGATCTTGTTGCACAACATTCTTGGAAGGTTGTGGTTGATCTTCTTGCACAACATTCTTTCCTTTACTTTGATATCTAACAACAGGAGTTTGATTTGATTCAACGACATCTTGATTCTCAATTCAACGACATCTTGATTCTCAATCTGACTCTTTAAGTATGCGATTTGAGAATCTCTTTCTTTAATCGCCTTCATTAAGTCGTTGATTTGTTCCTGCATCCTCGCCATTGTTTCAGAATTAGAATCAAACAAAGGGTTAAACTTGATCAAGAACTTGTTGTTAGTTGATGCCCTTAACAGTGTCATTACTTCATTTTCTTGCGACCTCAACTCATTGGAGCAGCTGTGGGTAATAGAACCCATGTATGAGTTGCTCATAAGAGAGATCATTGAAGAAGTCTGCCTGAATGTCATCACTTCACTTCTGCTTTGAGATGAGAGAAAGAGATGAGAGGTAGAGATGGTCCCACTGGGCGTGCCAATTTGTTCACACGAGAATTCAATCAACGAATGGGAATAAGACACGTGTAATGAGCTTGAATTTGCATTTATTTGAATATAGTGGATTACAATCTCTAATATTTCTCCTATGGTTATTTCACTCTCATATACAACTGTTTCTAGGAATTTGCAGTTTGTTGAAGGTCTTCAGCTTATAGCTTGTTGAAGGTCTTCAGCTTATAACTTGTTGAAGGTCTTCGGCTTATAAGCTTGTTGAAGATCGTCGGCTTGTAGCTTGTTGAAGGTCTTCGGCTTGTAAGCTTGTTGAAGATCTTCGACTTGTAGCTTGTTGAAGGTTTTCGGCTTGTAAGCTTGTTGAAAATCTTCGACTTGTAATCTTTTCTCGAGGGCTGAAGAACTTGTTCTCAAGGTTGAAACTTCACGGGATGCTTGGAGTGTTCTTGATCTTCGTAAAGCGTTCTTTGTTCAGGAAAGTATTTGATCTTCAGGAGTACAAAGAGTGTTTTAAAATTTATTGAGAATTCTCTCTGGATTTCTCTAGAGTCTTCCACTTCTGAACCTTTGTATGTTTTACTCGAGGGCTACAAAGCTTGTTCTCGAGATTGAAGTTGCAATGGGTTCTTTAGACTCGAAGAGATGTTCCTCAGATTTGAAGAATGTTCTTGATCTTCGAAGCAATGTTCTTCGTTCAAAAAATGTTTGATCTTTAAGAATACAAAGAGTCTTCTAGTTTGTGAATTCTCTCAGGAGTCTCAAATTTCTGAACCTTCGTATGTTTTACTCGAGGGCTACAAAGCTTGTTCTCAAGATTAAAGTTGCAATGGGTTCTTTAGACTCGAAGAGATGTTCTTCAGATTCGAAGAGATGTTCTTCAGATTCGAAGAGATGTTCCTCAGATTTGAAGAATATTATTGATCTTCGAAACAGTGTTCTTTGTTCAGAAAATGTTTGATCTTTAAGAATACAAAGAGTCTTCTGGTTTGTGAATTCTCTCTAGAGTCTCAAATTTTTGAACTTTCGTAAATGAAAGAGTAAGGTCTCTATTTATAGAGTTCCTCTGGGCCTCTAGGTGGGCTTTGACCCATTAGACTGGGTTTGCTTTTTGTTGGGTCAACCCTTTGAACCGGGCTAGGTTGGGTTTGGGCCAACCATTTTTGGTTAGGTCAACCATTTTGGGCTTGACCGGTTTGAGCTTTTTTTTGTCCTAGATTAGGTCTGATTGATTTTGCGCCTACGTCTCTAGACTAGGCTTGGACCTTGTTAACTGGGCCTTTAATTTGGGCTTTTGTCTGCCAAATTTTGAGAAACCAAATTATGTATCTGAGTTTGCCATAATTTAATTTAGGGATTCCTGCAAATTTCATCAATCCGTAATTAAATATAATTTCGCCATCAATGTTAGCCGATGACGTGGCAAGTTTGAATTTGTCAAAATTTCTATTCCAACCGTTTTTATCCAGTAAAAAGTTGGAGATATTAGTTAATAAAACTAATTTGTAAATAATTTAAAAAATAATCAATTAATCTTTAACAATAATAAAATTGATGCTTCAAACATTTGTAATAAAAAAATGTGTAGATTGACTAAGGTAGAGATTGTAGATTTGAATTTCCGTTGAAAATAAGAAAAATCGACTAGTGAAGACAATTTTACTAATAAAAACCAAATGTGAAACGAAAAAAAAAAAAAACAAATAAACCCAACTGTTATTACCTTTTTCTACCCACTTTATTTATTACAATTGTTGTTGTTGTTTTTTTTAATTTCTTTAACTTGTACTTATTTTGATTTCCTTAAATAAATAAATGTAAGAAAAATATATAAAATGAAAAATATATGGTATTGTAGAGAAGTTAATTCTTTTTTATTGATATTTTTATTTCATCATTTATTTTCTTAAAATTGAAAGATTTAAATGTAATTACTTATTAGCAAAGTATATGTATAACTCAATAGTAATTGGTATGTACCTCAAGACGTTGGATGTTTGAATCTTCCACTCTTAAATGTTGTTGAATTAAAAAAAATGTAATTACACATTTCTTTAAATTTTCCAATGGATTCAAATAAAAAATTTATGCAAGCATGGTGGTTATCTTCTAGTGTAGCGAGATAAACAATAACTCTAAGACTTATTTGATTTTTTATTTTTGAAAATTAAGTTTATAAATACTACTTTGAAGTTTTATCTTGGATTTCTTGTTCTTGTATCAATTTTTTGCATTTGTTTTAGAAAATCAAGCTAAGTTTTGAAAAACTAAAAAATTTAGAAGTTTTTTTAATTTTCTATTTTAAATAAGTAGCCTAAAGGAATGAAACTTATGATTTTTCATTTTGTATCTTATTTAATATCTCTTTCAAAATCTTAGCAATGGGATGCTTTCTAATTTTCTATTAAAAAAAGTAGCTTAAAAGAAAGAAACTTGTGACTTTTTCTTTTGCTTCTTTATTTAATATTTCTTTCTAAATTTAGCAGTATAATACATATTTTAATTAATAGATTCCTTTTCTAAAAAAAAACTAGAAATTTTAATTGAATTTAGGTTAAATTAAAAATTTACTTATTTAACTTTTAAGTTTGTGTCTATTTCTTTGTGAACTTTAAAAGTGTATAATAAGTGTCTATTTCTTTGTGAACTTTAAAAGTGTATAATAAGTGTCTAAATATTAAATTTGTACATATTTAATCCCATTGGATAACTATTTGGTTTTTGAAAATTGTGTTGTTTTCTCACAAATTTTCTACAAAGATCTTCATTTTCTCTACAATTATTTTCATTTTTCTTAAGAAAACACTTGAATTTCTTAGCCAATTAAATTTAAAAAACAAAAATAAGTTTTTGAAAACTACTTTTTTTTTCTAGTTTTCAAAACTTGACTTGGTTTTTTAAAACACTTTTTGATAAGAAAACATGATAAATCGTATGTAGAAGTAGTATTTATAGGCATAATTTTAAAAAAAAACCAAATAGTTATCAAACGAGCTTATTTTTTTTAGTTTTCAAAACATGACTTATTTTTTAAAATACTTGTAAAAAGTAGATATCGAAGTAAGGAAACTCTTAGATGAAAGTAACATTTATAGACTTAATTTTTAAAAACCAAATAATTATCAAACGGAATTTTATAACTTTCAATTTTGTATCCAATAGAACCAAACTCATATTTGAAACTAACATTTCTCATTGAAAGAAAAGCTTCAAAATCCATGGATTAATAACCGCTTCTTCTCATACGAATACAGGTAATACAGTGGATGCAAAGCCACACACCACTGAAGACAGTTGGACCAACAGTTCCATCCATACTCATCGACAAGAGGATGAAGGACGACTATCATTACGGAACGAATCTAATCAAATCAACCGAAGACGACAACGAAATCATCGAGTGGTTGGACTCTAAAGACAGCAACTCGGTCATTTACGTGTCGTTCGGGAGCGTATCGGAGCTTGGAGAAGAGCAAATGAAGGAGTTAGCATGGGGCCTCAAAGCAACCAACACAAACTTCGTTTGGGTCATTAAGGAAATCGAAACCCCAAAGCTTCCAAACAACGTTTTTGAAGAGATGAAAGAGATGGGGATGGTGGTGAAGTGGTGCTCCCAAGTGCAAGTTTTAGCTCACAAATCAGTGGGGTGTTTTGTTACACACTGTGGTTGGAACTCAGTTTTAGAAGCCCTTAGCTCTGGAGTTCCAATGGTGGCAATGCCTCAGTGGACGGACCAAATGACAAATGCAAAGTTTGTGGAGGATGTGTGGAAGGTTGGTGTGAGGGTTAGTACAAAGGAAAATGGGATAGTTGGAAGAGAAGAGATAGAATTGTGTATTAGGAGAGTGATGGAGGGAGAGATAAGCCTTGAGATCAGACAAAATGCAATCATGTGGATGAACTTGGCGAAGGAAGCTGTGACTGAAGATGGAACCTCTGAAAAGAACATTGATGAGTTTGTCGCACAACTCAAGGGTCTTAAAATATGATTCTGCTTTTTGTGTTTTAAACTTGGATATTAAGATGGTGAATAGAATAGTGTTTCTAAATATTAAGCTAGTGTTTCTAAATATTAATCTAATGTAATGTTATGTTATTGATATAA

mRNA sequence

GGAAAAGAAAGAGATGACAGGCAGAAAAAGGAAAATGGAAATGGGAGAAGGCAAAGTTCACATTCTTGTGATTCCGTTCCCAGAAGGACAAGGTCACATAAACCCCATCCTCCAATTCTCAAAACGGCTGGCATTGAAAGGGCTAAAGGTCACAGTCCTCAACGTCCTGAAAACTCATGAAATCAACAATAATATTATTAATGACCTGGATCAGCTCGGGGGCTGCGGCGGGTCCATTATTACGGTGGAGCACAAGCCACGAGTGGCGTACAAAGGCAGAGAAGCAGAGTCAATGGAGTCCCACATGCACCGTCTTCAAACATCCACATGCTTTCACTTAACAAACCTCATCACTCAACATCAAACCTCCGACGCCCCAATTGCCTGCGTTCTCTATGATTCCCTCACGCCTTGGGTTCTGGATGTTGCTAGAGCTTTTGGCCTTCCTGGGGCTTCCTTTTTAACCGACTCCTGCGCTGTTAATGCCATTTTTTACCACATTAACCGTGGCTCCTTCAAGATTAATCCCATTGAGTTTGAGGATGAAATGACTGTTGTGTCCTTGCCTTGCCTCCCGCTTCTTCACGTCTATGATCTTCCTTCCCTCATTCCTAATCCCAACCAATATCCAGTATTTCTCAGGTTCATGATCGACCAGTTCTGTAACCAGCCTGATTGGATGTTCATCAACACGTTCCTTGCCTTGGAGCCACAAGTAATACAGTGGATGCAAAGCCACACACCACTGAAGACAGTTGGACCAACAGTTCCATCCATACTCATCGACAAGAGGATGAAGGACGACTATCATTACGGAACGAATCTAATCAAATCAACCGAAGACGACAACGAAATCATCGAGTGGTTGGACTCTAAAGACAGCAACTCGGTCATTTACGTGTCGTTCGGGAGCGTATCGGAGCTTGGAGAAGAGCAAATGAAGGAGTTAGCATGGGGCCTCAAAGCAACCAACACAAACTTCGTTTGGGTCATTAAGGAAATCGAAACCCCAAAGCTTCCAAACAACGTTTTTGAAGAGATGAAAGAGATGGGGATGGTGGTGAAGTGGTGCTCCCAAGTGCAAGTTTTAGCTCACAAATCAGTGGGGTGTTTTGTTACACACTGTGGTTGGAACTCAGTTTTAGAAGCCCTTAGCTCTGGAGTTCCAATGGTGGCAATGCCTCAGTGGACGGACCAAATGACAAATGCAAAGTTTGTGGAGGATGTGTGGAAGGTTGGTGTGAGGGTTAGTACAAAGGAAAATGGGATAGTTGGAAGAGAAGAGATAGAATTGTGTATTAGGAGAGTGATGGAGGGAGAGATAAGCCTTGAGATCAGACAAAATGCAATCATGTGGATGAACTTGGCGAAGGAAGCTGTGACTGAAGATGGAACCTCTGAAAAGAACATTGATGAGTTTGTCGCACAACTCAAGGGTCTTAAAATATGATTCTGCTTTTTGTGTTTTAAACTTGGATATTAAGATGGTGAATAGAATAGTGTTTCTAAATATTAAGCTAGTGTTTCTAAATATTAATCTAATGTAATGTTATGTTATTGATATAA

Coding sequence (CDS)

ATGACAGGCAGAAAAAGGAAAATGGAAATGGGAGAAGGCAAAGTTCACATTCTTGTGATTCCGTTCCCAGAAGGACAAGGTCACATAAACCCCATCCTCCAATTCTCAAAACGGCTGGCATTGAAAGGGCTAAAGGTCACAGTCCTCAACGTCCTGAAAACTCATGAAATCAACAATAATATTATTAATGACCTGGATCAGCTCGGGGGCTGCGGCGGGTCCATTATTACGGTGGAGCACAAGCCACGAGTGGCGTACAAAGGCAGAGAAGCAGAGTCAATGGAGTCCCACATGCACCGTCTTCAAACATCCACATGCTTTCACTTAACAAACCTCATCACTCAACATCAAACCTCCGACGCCCCAATTGCCTGCGTTCTCTATGATTCCCTCACGCCTTGGGTTCTGGATGTTGCTAGAGCTTTTGGCCTTCCTGGGGCTTCCTTTTTAACCGACTCCTGCGCTGTTAATGCCATTTTTTACCACATTAACCGTGGCTCCTTCAAGATTAATCCCATTGAGTTTGAGGATGAAATGACTGTTGTGTCCTTGCCTTGCCTCCCGCTTCTTCACGTCTATGATCTTCCTTCCCTCATTCCTAATCCCAACCAATATCCAGTATTTCTCAGGTTCATGATCGACCAGTTCTGTAACCAGCCTGATTGGATGTTCATCAACACGTTCCTTGCCTTGGAGCCACAAGTAATACAGTGGATGCAAAGCCACACACCACTGAAGACAGTTGGACCAACAGTTCCATCCATACTCATCGACAAGAGGATGAAGGACGACTATCATTACGGAACGAATCTAATCAAATCAACCGAAGACGACAACGAAATCATCGAGTGGTTGGACTCTAAAGACAGCAACTCGGTCATTTACGTGTCGTTCGGGAGCGTATCGGAGCTTGGAGAAGAGCAAATGAAGGAGTTAGCATGGGGCCTCAAAGCAACCAACACAAACTTCGTTTGGGTCATTAAGGAAATCGAAACCCCAAAGCTTCCAAACAACGTTTTTGAAGAGATGAAAGAGATGGGGATGGTGGTGAAGTGGTGCTCCCAAGTGCAAGTTTTAGCTCACAAATCAGTGGGGTGTTTTGTTACACACTGTGGTTGGAACTCAGTTTTAGAAGCCCTTAGCTCTGGAGTTCCAATGGTGGCAATGCCTCAGTGGACGGACCAAATGACAAATGCAAAGTTTGTGGAGGATGTGTGGAAGGTTGGTGTGAGGGTTAGTACAAAGGAAAATGGGATAGTTGGAAGAGAAGAGATAGAATTGTGTATTAGGAGAGTGATGGAGGGAGAGATAAGCCTTGAGATCAGACAAAATGCAATCATGTGGATGAACTTGGCGAAGGAAGCTGTGACTGAAGATGGAACCTCTGAAAAGAACATTGATGAGTTTGTCGCACAACTCAAGGGTCTTAAAATATGA

Protein sequence

MTGRKRKMEMGEGKVHILVIPFPEGQGHINPILQFSKRLALKGLKVTVLNVLKTHEINNNIINDLDQLGGCGGSIITVEHKPRVAYKGREAESMESHMHRLQTSTCFHLTNLITQHQTSDAPIACVLYDSLTPWVLDVARAFGLPGASFLTDSCAVNAIFYHINRGSFKINPIEFEDEMTVVSLPCLPLLHVYDLPSLIPNPNQYPVFLRFMIDQFCNQPDWMFINTFLALEPQVIQWMQSHTPLKTVGPTVPSILIDKRMKDDYHYGTNLIKSTEDDNEIIEWLDSKDSNSVIYVSFGSVSELGEEQMKELAWGLKATNTNFVWVIKEIETPKLPNNVFEEMKEMGMVVKWCSQVQVLAHKSVGCFVTHCGWNSVLEALSSGVPMVAMPQWTDQMTNAKFVEDVWKVGVRVSTKENGIVGREEIELCIRRVMEGEISLEIRQNAIMWMNLAKEAVTEDGTSEKNIDEFVAQLKGLKI
Homology
BLAST of Tan0010472 vs. ExPASy Swiss-Prot
Match: K7NBW3 (Mogroside IE synthase OS=Siraitia grosvenorii OX=190515 GN=UGT74AC1 PE=1 SV=1)

HSP 1 Score: 424.1 bits (1089), Expect = 2.1e-117
Identity = 224/468 (47.86%), Postives = 300/468 (64.10%), Query Frame = 0

Query: 10  MGEGKVHILVIPFPEGQGHINPILQFSKRLALKGLKVTVLNVLKTHEINNNIINDLDQLG 69
           M +G  HILV PFP  QGHINP+LQ SKRL  KG+KV+++  L    ++N++     QL 
Sbjct: 1   MEKGDTHILVFPFP-SQGHINPLLQLSKRLIAKGIKVSLVTTL---HVSNHL-----QLQ 60

Query: 70  GCGGSIITVEHKPRVAYKGREAESMESHMHRLQTSTCFHLTNLITQHQTSDAPIACVLYD 129
           G   + + +E     +    E ++M   + R +     +L + + +   S  P   +LYD
Sbjct: 61  GAYSNSVKIEVISDGSEDRLETDTMRQTLDRFRQKMTKNLEDFLQKAMVSSNPPKFILYD 120

Query: 130 SLTPWVLDVARAFGLPGASFLTDSCAVNAIFYHINRGSFKINPIEFEDEMTVVSLPCLPL 189
           S  PWVL+VA+ FGL  A F T SCA+N+I YH+  G  K+ P     E   +SLP +PL
Sbjct: 121 STMPWVLEVAKEFGLDRAPFYTQSCALNSINYHVLHGQLKLPP-----ETPTISLPSMPL 180

Query: 190 LHVYDLPSLIPNPNQYPVFLRFMIDQFCNQPD--WMFINTFLALEPQVIQWMQS-HTPLK 249
           L   DLP+   +P      +  +  Q+ N  D   +F NTF  LE ++IQWM++   P+K
Sbjct: 181 LRPSDLPAYDFDPASTDTIIDLLTSQYSNIQDANLLFCNTFDKLEGEIIQWMETLGRPVK 240

Query: 250 TVGPTVPSILIDKRMKDDYHYGTNLIKSTEDDNEIIEWLDSKDSNSVIYVSFGSVSELGE 309
           TVGPTVPS  +DKR+++D HYG +L K  ED    ++WLDSK S SV+YVS+GS+ E+GE
Sbjct: 241 TVGPTVPSAYLDKRVENDKHYGLSLFKPNED--VCLKWLDSKPSGSVLYVSYGSLVEMGE 300

Query: 310 EQMKELAWGLKATNTNFVWVIKEIETPKLPNNVFEEMKEMGMVVKWCSQVQVLAHKSVGC 369
           EQ+KELA G+K T   F+WV+++ E  KLP N  E + E G+VV WCSQ++VLAH SVGC
Sbjct: 301 EQLKELALGIKETGKFFLWVVRDTEAEKLPPNFVESVAEKGLVVSWCSQLEVLAHPSVGC 360

Query: 370 FVTHCGWNSVLEALSSGVPMVAMPQWTDQMTNAKFVEDVWKVGVRVSTKENGIVGREEIE 429
           F THCGWNS LEAL  GVP+VA PQW DQ+TNAKF+EDVWKVG RV   E  +  +EE+ 
Sbjct: 361 FFTHCGWNSTLEALCLGVPVVAFPQWADQVTNAKFLEDVWKVGKRVKRNEQRLASKEEVR 420

Query: 430 LCIRRVMEGEISLEIRQNAIMWMNLAKEAVTEDGTSEKNIDEFVAQLK 475
            CI  VMEGE + E + N++ W   AKEAV E G+S+KNI+EFVA LK
Sbjct: 421 SCIWEVMEGERASEFKSNSMEWKKWAKEAVDEGGSSDKNIEEFVAMLK 452

BLAST of Tan0010472 vs. ExPASy Swiss-Prot
Match: Q9SYK9 (UDP-glycosyltransferase 74E2 OS=Arabidopsis thaliana OX=3702 GN=UGT74E2 PE=1 SV=1)

HSP 1 Score: 402.5 bits (1033), Expect = 6.6e-111
Identity = 218/465 (46.88%), Postives = 301/465 (64.73%), Query Frame = 0

Query: 10  MGEGKVHILVIPFPEGQGHINPILQFSKRLALKGLKVTVLNVLKTHEINNNIINDLDQLG 69
           M EG  H++V+PFP GQGHI P+ QF KRLA KGLK+T+  VL + + +     + D   
Sbjct: 1   MREGS-HLIVLPFP-GQGHITPMSQFCKRLASKGLKLTL--VLVSDKPSPPYKTEHDS-- 60

Query: 70  GCGGSIITVEHKPRVAYKGRE-AESMESHMHRLQTSTCFHLTNLITQHQTSDAPIACVLY 129
                 ITV        +G E  + ++ +M R++TS    L  L+   + S  P   ++Y
Sbjct: 61  ------ITVFPISNGFQEGEEPLQDLDDYMERVETSIKNTLPKLVEDMKLSGNPPRAIVY 120

Query: 130 DSLTPWVLDVARAFGLPGASFLTDSCAVNAIFYHINRGSFKINPIEFEDEMTVVSLPCLP 189
           DS  PW+LDVA ++GL GA F T    V AI+YH+ +GSF +   ++    T+ S P  P
Sbjct: 121 DSTMPWLLDVAHSYGLSGAVFFTQPWLVTAIYYHVFKGSFSVPSTKY-GHSTLASFPSFP 180

Query: 190 LLHVYDLPSLIPNPNQYPVFLRFMIDQFCN--QPDWMFINTFLALEPQVIQWMQSHTPLK 249
           +L   DLPS +   + YP  LR ++DQ  N  + D +  NTF  LE ++++W+QS  P+ 
Sbjct: 181 MLTANDLPSFLCESSSYPNILRIVVDQLSNIDRVDIVLCNTFDKLEEKLLKWVQSLWPVL 240

Query: 250 TVGPTVPSILIDKRMKDDYHYGTNLIKSTEDDNEIIEWLDSKDSNSVIYVSFGSVSELGE 309
            +GPTVPS+ +DKR+ +D +YG +L  +     E +EWL+SK+ NSV+Y+SFGS+  L E
Sbjct: 241 NIGPTVPSMYLDKRLSEDKNYGFSLFNAKV--AECMEWLNSKEPNSVVYLSFGSLVILKE 300

Query: 310 EQMKELAWGLKATNTNFVWVIKEIETPKLPNNVFEEMKEMGMVVKWCSQVQVLAHKSVGC 369
           +QM ELA GLK +   F+WV++E ET KLP N  EE+ E G++V W  Q+ VLAHKS+GC
Sbjct: 301 DQMLELAAGLKQSGRFFLWVVRETETHKLPRNYVEEIGEKGLIVSWSPQLDVLAHKSIGC 360

Query: 370 FVTHCGWNSVLEALSSGVPMVAMPQWTDQMTNAKFVEDVWKVGVRVSTKENGIVGREEIE 429
           F+THCGWNS LE LS GVPM+ MP WTDQ TNAKF++DVWKVGVRV  + +G V REEI 
Sbjct: 361 FLTHCGWNSTLEGLSLGVPMIGMPHWTDQPTNAKFMQDVWKVGVRVKAEGDGFVRREEIM 420

Query: 430 LCIRRVMEGEISLEIRQNAIMWMNLAKEAVTEDGTSEKNIDEFVA 472
             +  VMEGE   EIR+NA  W  LA+EAV+E G+S+K+I+EFV+
Sbjct: 421 RSVEEVMEGEKGKEIRKNAEKWKVLAQEAVSEGGSSDKSINEFVS 450

BLAST of Tan0010472 vs. ExPASy Swiss-Prot
Match: P0C7P7 (UDP-glycosyltransferase 74E1 OS=Arabidopsis thaliana OX=3702 GN=UGT74E1 PE=3 SV=1)

HSP 1 Score: 389.0 bits (998), Expect = 7.5e-107
Identity = 213/465 (45.81%), Postives = 294/465 (63.23%), Query Frame = 0

Query: 10  MGEGKVHILVIPFPEGQGHINPILQFSKRLALKGLKVTVLNVLKTHEINNNIINDLDQLG 69
           M EG  H++V+PFP  QGHI P+ QF KRLA K LK+T++ V           +D     
Sbjct: 1   MREGS-HVIVLPFP-AQGHITPMSQFCKRLASKSLKITLVLVSDKPSPPYKTEHD----- 60

Query: 70  GCGGSIITVEHKPRVAYKGRE-AESMESHMHRLQTSTCFHLTNLITQHQTSDAPIACVLY 129
                 ITV        +G+E +E ++ +M R+++S    L  LI   + S  P   ++Y
Sbjct: 61  -----TITVVPISNGFQEGQERSEDLDEYMERVESSIKNRLPKLIEDMKLSGNPPRALVY 120

Query: 130 DSLTPWVLDVARAFGLPGASFLTDSCAVNAIFYHINRGSFKINPIEFEDEMTVVSLPCLP 189
           DS  PW+LDVA ++GL GA F T    V+AI+YH+ +GSF +   ++    T+ S P LP
Sbjct: 121 DSTMPWLLDVAHSYGLSGAVFFTQPWLVSAIYYHVFKGSFSVPSTKY-GHSTLASFPSLP 180

Query: 190 LLHVYDLPSLIPNPNQYPVFLRFMIDQFCN--QPDWMFINTFLALEPQVIQWMQSHTPLK 249
           +L+  DLPS +   + YP  LR +IDQ  N  + D +  NTF  LE ++++W++S  P+ 
Sbjct: 181 ILNANDLPSFLCESSSYPYILRTVIDQLSNIDRVDIVLCNTFDKLEEKLLKWIKSVWPVL 240

Query: 250 TVGPTVPSILIDKRMKDDYHYGTNLIKSTEDDNEIIEWLDSKDSNSVIYVSFGSVSELGE 309
            +GPTVPS+ +DKR+ +D +YG +L  +     E +EWL+SK  +SV+YVSFGS+  L +
Sbjct: 241 NIGPTVPSMYLDKRLAEDKNYGFSLFGA--KIAECMEWLNSKQPSSVVYVSFGSLVVLKK 300

Query: 310 EQMKELAWGLKATNTNFVWVIKEIETPKLPNNVFEEMKEMGMVVKWCSQVQVLAHKSVGC 369
           +Q+ ELA GLK +   F+WV++E E  KLP N  EE+ E G+ V W  Q++VL HKS+GC
Sbjct: 301 DQLIELAAGLKQSGHFFLWVVRETERRKLPENYIEEIGEKGLTVSWSPQLEVLTHKSIGC 360

Query: 370 FVTHCGWNSVLEALSSGVPMVAMPQWTDQMTNAKFVEDVWKVGVRVSTKENGIVGREEIE 429
           FVTHCGWNS LE LS GVPM+ MP W DQ TNAKF+EDVWKVGVRV    +G V REE  
Sbjct: 361 FVTHCGWNSTLEGLSLGVPMIGMPHWADQPTNAKFMEDVWKVGVRVKADSDGFVRREEFV 420

Query: 430 LCIRRVMEGEISLEIRQNAIMWMNLAKEAVTEDGTSEKNIDEFVA 472
             +  VME E   EIR+NA  W  LA+EAV+E G+S+KNI+EFV+
Sbjct: 421 RRVEEVMEAEQGKEIRKNAEKWKVLAQEAVSEGGSSDKNINEFVS 450

BLAST of Tan0010472 vs. ExPASy Swiss-Prot
Match: W8JMV4 (UDP glycosyltransferase 9 OS=Catharanthus roseus OX=4058 GN=UGT9 PE=2 SV=1)

HSP 1 Score: 371.3 bits (952), Expect = 1.6e-101
Identity = 208/467 (44.54%), Postives = 293/467 (62.74%), Query Frame = 0

Query: 15  VHILVIPFPEGQGHINPILQFSKRLALKGLKVTVLNVLKT-HEINNNIINDLDQLGGCGG 74
           +HIL  PFP  +GHINP+L    RLA KG K+T++  + T   +  +  N +D      G
Sbjct: 13  IHILAFPFP-AKGHINPLLHLCNRLASKGFKITLITTVSTLKSVKTSKANGIDIESIPDG 72

Query: 75  SIITVEHKPRVAYKGREAESMESHMHRLQTSTCFHLTNLITQHQTSDAPIACVLYDSLTP 134
                 H+     +     +ME +  + + S   + T LI + +T + P   ++YDS  P
Sbjct: 73  IPQEQNHQIITVME----MNMELYFKQFKASAIENTTKLIQKLKTKNPPPKVLIYDSSMP 132

Query: 135 WVLDVARAFGLPGASFLTDSCAVNAIFYHINRGSFKINPIEFEDEMTVVSLPCLPLLHVY 194
           W+L+VA   GL GASF T  C+V+AI+YH+ +G+ K+ P+E   E  +VSLP LPLL   
Sbjct: 133 WILEVAHEQGLLGASFFTQPCSVSAIYYHMLQGTIKL-PLE-NSENGMVSLPYLPLLEKK 192

Query: 195 DLPSLIPNPNQYPVFLRFMIDQFCN--QPDWMFINTFLALEPQVIQWMQSHTPLKTVGPT 254
           DLP +    +        + DQF N    D++  NTF ALE +V+ WM S  P+ TVGPT
Sbjct: 193 DLPGVQQFEDNSEALAELLADQFSNIDDVDYVLFNTFDALEIEVVNWMGSKWPILTVGPT 252

Query: 255 VPS--ILIDKRMKDDYHYGTNLIKSTEDDNEI-IEWLDSKDSNSVIYVSFGSVSELGEEQ 314
            P+   L+DK+ K +Y  G ++    E + E+ ++WLD ++ ++VIYVSFGS++ L EEQ
Sbjct: 253 APTSMFLLDKKQK-NYEDGRSINYLFETNTEVCMKWLDQREIDTVIYVSFGSLASLTEEQ 312

Query: 315 MKELAWGLKATNTNFVWVIKEIETPKLPNNVFE-EMKEMGMVVKWCSQVQVLAHKSVGCF 374
           M++++  L  +N  F+WV++E E  KLP +  E   K+ G+V+ WC Q+ VLAHKSV CF
Sbjct: 313 MEQVSQALIRSNCYFLWVVREEEENKLPKDFKETTSKKKGLVINWCPQLDVLAHKSVACF 372

Query: 375 VTHCGWNSVLEALSSGVPMVAMPQWTDQMTNAKFVEDVWKVGVRVS-TKENGIVGREEIE 434
           +THCGWNS LEAL SGVPM+ MPQW DQ TNAK +E VWK+GV V+ + ENGIV RE+IE
Sbjct: 373 MTHCGWNSTLEALCSGVPMICMPQWADQTTNAKLIEHVWKIGVGVNKSDENGIVKREDIE 432

Query: 435 LCIRRVMEGEISLEIRQNAIMWMNLAKEAVTEDGTSEKNIDEFVAQL 474
            CIR+V+E E   E+++NAI W  LAKEAV+E G+S  NI EF + L
Sbjct: 433 DCIRQVIESERGKELKRNAIKWKELAKEAVSEGGSSYNNIQEFSSSL 471

BLAST of Tan0010472 vs. ExPASy Swiss-Prot
Match: O22822 (UDP-glycosyltransferase 74F2 OS=Arabidopsis thaliana OX=3702 GN=UGT74F2 PE=1 SV=1)

HSP 1 Score: 370.2 bits (949), Expect = 3.6e-101
Identity = 204/466 (43.78%), Postives = 293/466 (62.88%), Query Frame = 0

Query: 16  HILVIPFPEGQGHINPILQFSKRLALKGLKVTVLNVLKTHEINNNIINDLDQLGGCGGSI 75
           H+L +P+P  QGHI P  QF KRL  KGLK T   +  T  + N+I  DL       G I
Sbjct: 7   HVLAVPYPT-QGHITPFRQFCKRLHFKGLKTT---LALTTFVFNSINPDL------SGPI 66

Query: 76  ITVEHKPRVAYKGRE-AESMESHMHRLQTSTCFHLTNLITQHQTSDAPIACVLYDSLTPW 135
                     + G E A+S++ ++   +TS    + ++I +HQTSD PI C++YD+  PW
Sbjct: 67  SIATISDGYDHGGFETADSIDDYLKDFKTSGSKTIADIIQKHQTSDNPITCIVYDAFLPW 126

Query: 136 VLDVARAFGLPGASFLTDSCAVNAIFY--HINRGSFKINPIEFEDEMTVVSLPCLPLLHV 195
            LDVAR FGL    F T  CAVN ++Y  +IN GS ++ PIE            LP L +
Sbjct: 127 ALDVAREFGLVATPFFTQPCAVNYVYYLSYINNGSLQL-PIEE-----------LPFLEL 186

Query: 196 YDLPSLIPNPNQYPVFLRFMIDQFCN--QPDWMFINTFLALEPQVIQWMQSHTPLKTVGP 255
            DLPS       YP +   ++ QF N  + D++ +N+F  LE    +      P+ T+GP
Sbjct: 187 QDLPSFFSVSGSYPAYFEMVLQQFINFEKADFVLVNSFQELELHENELWSKACPVLTIGP 246

Query: 256 TVPSILIDKRMKDDYHYGTNLIKSTEDDNEIIEWLDSKDSNSVIYVSFGSVSELGEEQMK 315
           T+PSI +D+R+K D  Y  NL +S +DD+  I WLD++   SV+YV+FGS+++L   QM+
Sbjct: 247 TIPSIYLDQRIKSDTGYDLNLFES-KDDSFCINWLDTRPQGSVVYVAFGSMAQLTNVQME 306

Query: 316 ELAWGLKATNTNFVWVIKEIETPKLPNNVFEEM-KEMGMVVKWCSQVQVLAHKSVGCFVT 375
           ELA  +  +N +F+WV++  E  KLP+   E + KE  +V+KW  Q+QVL++K++GCF+T
Sbjct: 307 ELASAV--SNFSFLWVVRSSEEEKLPSGFLETVNKEKSLVLKWSPQLQVLSNKAIGCFLT 366

Query: 376 HCGWNSVLEALSSGVPMVAMPQWTDQMTNAKFVEDVWKVGVRVST-KENGIVGREEIELC 435
           HCGWNS +EAL+ GVPMVAMPQWTDQ  NAK+++DVWK GVRV T KE+GI  REEIE  
Sbjct: 367 HCGWNSTMEALTFGVPMVAMPQWTDQPMNAKYIQDVWKAGVRVKTEKESGIAKREEIEFS 426

Query: 436 IRRVMEGEISLEIRQNAIMWMNLAKEAVTEDGTSEKNIDEFVAQLK 475
           I+ VMEGE S E+++N   W +LA +++ E G+++ NID FV++++
Sbjct: 427 IKEVMEGERSKEMKKNVKKWRDLAVKSLNEGGSTDTNIDTFVSRVQ 447

BLAST of Tan0010472 vs. NCBI nr
Match: XP_023538720.1 (UDP-glycosyltransferase 74E2-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 732.6 bits (1890), Expect = 2.1e-207
Identity = 366/472 (77.54%), Postives = 405/472 (85.81%), Query Frame = 0

Query: 8   MEMGEGKVHILVIPFPEGQGHINPILQFSKRLALKGLKVTVLNVLKTHEINNNIINDLDQ 67
           ME+GEGKVHILVIPFP+GQGH+NPILQFSKRL LKGLKVTVLN   THEINNN I  L+Q
Sbjct: 1   MEVGEGKVHILVIPFPDGQGHVNPILQFSKRLVLKGLKVTVLN---THEINNNAI--LNQ 60

Query: 68  LGGCGGSIITVEHKPRVAYKGREAESMESHMHRLQTSTCFHLTNLITQHQTSDAPIACVL 127
           +GG GGS I VE+KPR  YKGR+ E++E + HRLQTSTCFHL  LIT HQTS+APIACV+
Sbjct: 61  VGGWGGS-INVENKPREPYKGRDPETVEFYFHRLQTSTCFHLVKLITHHQTSNAPIACVV 120

Query: 128 YDSLTPWVLDVARAFGLPGASFLTDSCAVNAIFYHINRGSFKINPIEFEDEMTVVSLPCL 187
           YDSLTPWVLDVAR FGLPGA F T+SCAVNA+FYHI  GS KI       +   VSLP L
Sbjct: 121 YDSLTPWVLDVARGFGLPGAPFFTESCAVNALFYHIYCGSLKI-----PSDKKSVSLPAL 180

Query: 188 PLLHVYDLPSLIPNPNQYPVFLRFMIDQFCNQPDWMFINTFLALEPQVIQWMQSHTPLKT 247
           PLL   DLPSLI NP+QYPVFLR M +QFCNQPDWMFINTF ALEPQV+QWMQSH PLKT
Sbjct: 181 PLLQDTDLPSLISNPHQYPVFLRMMTEQFCNQPDWMFINTFHALEPQVLQWMQSHMPLKT 240

Query: 248 VGPTVPSILIDKRMKDDYHYGTNLIKSTEDDNEIIEWLDSKDSNSVIYVSFGSVSELGEE 307
           VGPTVPSILIDK + DD +YG NLIKSTEDD++ IEWLDSKDS S+IYVSFGSVSELGEE
Sbjct: 241 VGPTVPSILIDKGLMDDNNYGMNLIKSTEDDSKTIEWLDSKDSESIIYVSFGSVSELGEE 300

Query: 308 QMKELAWGLKATNTNFVWVIKEIETPKLPNNVFEEMKEM-GMVVKWCSQVQVLAHKSVGC 367
           QMKE+AWGLKA+N NF+WVIKE+ET +LPN   EEMKEM G VVKWCSQVQVL HKSVGC
Sbjct: 301 QMKEIAWGLKASNKNFLWVIKEMETGELPNKFVEEMKEMKGKVVKWCSQVQVLGHKSVGC 360

Query: 368 FVTHCGWNSVLEALSSGVPMVAMPQWTDQMTNAKFVEDVWKVGVRVSTKENGIVGREEIE 427
           FVTHCGWNSVLE LSSGVPMVAMPQWTDQ+TNAKFVEDVWKVGVRVS+ +NG+VGREEIE
Sbjct: 361 FVTHCGWNSVLEGLSSGVPMVAMPQWTDQITNAKFVEDVWKVGVRVSSNQNGLVGREEIE 420

Query: 428 LCIRRVMEGEISLEIRQNAIMWMNLAKEAVTEDGTSEKNIDEFVAQLKGLKI 479
           LCIR+VMEGE  +E+RQNA  WM LAKEA+TEDG+S KNIDEFVAQ++  KI
Sbjct: 421 LCIRKVMEGEKRIEMRQNASKWMKLAKEAMTEDGSSNKNIDEFVAQVQERKI 461

BLAST of Tan0010472 vs. NCBI nr
Match: XP_022953232.1 (UDP-glycosyltransferase 74E2-like [Cucurbita moschata])

HSP 1 Score: 713.0 bits (1839), Expect = 1.7e-201
Identity = 355/472 (75.21%), Postives = 395/472 (83.69%), Query Frame = 0

Query: 8   MEMGEGKVHILVIPFPEGQGHINPILQFSKRLALKGLKVTVLNVLKTHEINNNIINDLDQ 67
           M +GEGKVHILVIPFP+GQGH+NPILQFSKRL LKGLKVT   VL THEI NN   +L+ 
Sbjct: 1   MAVGEGKVHILVIPFPDGQGHVNPILQFSKRLVLKGLKVT---VLITHEIINNA--NLNH 60

Query: 68  LGGCGGSIITVEHKPRVAYKGREAESMESHMHRLQTSTCFHLTNLITQHQTSDAPIACVL 127
           +GG  G  I VE+KPRV YKG + E +ES++HRLQ ST FHL  LIT HQTS++PIACV+
Sbjct: 61  VGGGWGGSINVENKPRVPYKGTDPEPLESYIHRLQISTSFHLVKLITHHQTSNSPIACVV 120

Query: 128 YDSLTPWVLDVARAFGLPGASFLTDSCAVNAIFYHINRGSFKINPIEFEDEMTVVSLPCL 187
           YDSLTPWVLDVAR FGLPGA F T+SCAVNA+FYHI  GS KI       +   VSLP L
Sbjct: 121 YDSLTPWVLDVARGFGLPGAPFFTESCAVNAVFYHIYSGSLKI-----PSDKKSVSLPAL 180

Query: 188 PLLHVYDLPSLIPNPNQYPVFLRFMIDQFCNQPDWMFINTFLALEPQVIQWMQSHTPLKT 247
           PLL   DLPSLI NP+QYPVFLR M  QFCNQPDWMFINTF ALEPQV+QWMQ+HTPLK 
Sbjct: 181 PLLQDTDLPSLISNPHQYPVFLRMMTHQFCNQPDWMFINTFHALEPQVLQWMQTHTPLKA 240

Query: 248 VGPTVPSILIDKRMKDDYHYGTNLIKSTEDDNEIIEWLDSKDSNSVIYVSFGSVSELGEE 307
           VGPTVPSILIDK + DD +YG NLIKSTEDD++ IEWLDSKDS SVIYVSFGSVSELGEE
Sbjct: 241 VGPTVPSILIDKGLMDDNNYGMNLIKSTEDDSKTIEWLDSKDSESVIYVSFGSVSELGEE 300

Query: 308 QMKELAWGLKATNTNFVWVIKEIETPKLPNNVFEEMKEM-GMVVKWCSQVQVLAHKSVGC 367
           QMKE+AWGLKA+N NF+WVIKE+ET +LPN   EEMKEM G VVKWCSQVQVL HKSVGC
Sbjct: 301 QMKEIAWGLKASNKNFLWVIKEMETGELPNKFVEEMKEMKGKVVKWCSQVQVLGHKSVGC 360

Query: 368 FVTHCGWNSVLEALSSGVPMVAMPQWTDQMTNAKFVEDVWKVGVRVSTKENGIVGREEIE 427
           F+THCGWNSVLE LSSGVPMVAMPQWTDQ+TNAKFVEDVWK+GVRVS  +NG+VGREEIE
Sbjct: 361 FITHCGWNSVLEGLSSGVPMVAMPQWTDQITNAKFVEDVWKIGVRVSPNQNGLVGREEIE 420

Query: 428 LCIRRVMEGEISLEIRQNAIMWMNLAKEAVTEDGTSEKNIDEFVAQLKGLKI 479
           LCIR+VMEGE   E+RQN  MWM LAKEA+TEDG+S KNIDEFVAQ++  KI
Sbjct: 421 LCIRKVMEGEKRFEMRQNTSMWMKLAKEAMTEDGSSNKNIDEFVAQIQERKI 462

BLAST of Tan0010472 vs. NCBI nr
Match: XP_023538707.1 (UDP-glycosyltransferase 74E2-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 707.2 bits (1824), Expect = 9.5e-200
Identity = 353/472 (74.79%), Postives = 396/472 (83.90%), Query Frame = 0

Query: 8   MEMGEGKVHILVIPFPEGQGHINPILQFSKRLALKGLKVTVLNVLKTHEINNNIINDLDQ 67
           M +GEGKVHILVIPFP+GQGH+NPILQFSKRL LKGLKVTVLN   THEI NN   +L+Q
Sbjct: 1   MAVGEGKVHILVIPFPDGQGHVNPILQFSKRLVLKGLKVTVLN---THEIINNA--NLNQ 60

Query: 68  LGGCGGSIITVEHKPRVAYKGREAESMESHMHRLQTSTCFHLTNLITQHQTSDAPIACVL 127
           +GG  G  I VE+KPRV YKG   E  ES++ RLQ ST FHL  LIT HQTS++PIACV+
Sbjct: 61  VGGGWGGSINVENKPRVPYKGTHPEPFESYILRLQISTSFHLVKLITHHQTSNSPIACVV 120

Query: 128 YDSLTPWVLDVARAFGLPGASFLTDSCAVNAIFYHINRGSFKINPIEFEDEMTVVSLPCL 187
           YDSLTPWVLDVAR FGLPGA F T+SCAVNA+FYHI  GS KI       +   VSLP L
Sbjct: 121 YDSLTPWVLDVARGFGLPGAPFFTESCAVNAVFYHIYSGSLKI-----PSDKKSVSLPAL 180

Query: 188 PLLHVYDLPSLIPNPNQYPVFLRFMIDQFCNQPDWMFINTFLALEPQVIQWMQSHTPLKT 247
           PLL   DLPSLI NP+QYPVFLR M +QFCNQPDWMFINTF ALEPQV+QWMQ+HTPLKT
Sbjct: 181 PLLQDTDLPSLISNPHQYPVFLRMMTEQFCNQPDWMFINTFHALEPQVLQWMQTHTPLKT 240

Query: 248 VGPTVPSILIDKRMKDDYHYGTNLIKSTEDDNEIIEWLDSKDSNSVIYVSFGSVSELGEE 307
           VGPTVPSIL+DK + DD +YG +LIKST++D++IIEWLDSKDS SVIYVSFGSVS LGEE
Sbjct: 241 VGPTVPSILLDKGLMDDNNYGMSLIKSTKEDSKIIEWLDSKDSESVIYVSFGSVSMLGEE 300

Query: 308 QMKELAWGLKATNTNFVWVIKEIETPKLPNNVFEEMKEM-GMVVKWCSQVQVLAHKSVGC 367
           QMKE+AWGLKA+N NF+WVIKE+ET ++PN   EEMKEM G VVKWCSQVQVL HKSVGC
Sbjct: 301 QMKEIAWGLKASNKNFLWVIKEMETGEIPNKFVEEMKEMKGKVVKWCSQVQVLGHKSVGC 360

Query: 368 FVTHCGWNSVLEALSSGVPMVAMPQWTDQMTNAKFVEDVWKVGVRVSTKENGIVGREEIE 427
           FVTHCGWNSVLE LS GVPMVAMPQWTDQ+TNAKFVEDVWKVGVRVS  +NG+VGREEIE
Sbjct: 361 FVTHCGWNSVLEGLSGGVPMVAMPQWTDQITNAKFVEDVWKVGVRVSPNQNGLVGREEIE 420

Query: 428 LCIRRVMEGEISLEIRQNAIMWMNLAKEAVTEDGTSEKNIDEFVAQLKGLKI 479
           LCIR+VMEGE  +E+RQNA  WM LAKEA+TEDG+S KNIDEFVAQ++  KI
Sbjct: 421 LCIRKVMEGEKRVEMRQNASKWMKLAKEAMTEDGSSNKNIDEFVAQIQERKI 462

BLAST of Tan0010472 vs. NCBI nr
Match: XP_038888325.1 (UDP-glycosyltransferase 74E2-like [Benincasa hispida] >XP_038888326.1 UDP-glycosyltransferase 74E2-like [Benincasa hispida] >XP_038888327.1 UDP-glycosyltransferase 74E2-like [Benincasa hispida])

HSP 1 Score: 625.2 bits (1611), Expect = 4.8e-175
Identity = 321/466 (68.88%), Postives = 367/466 (78.76%), Query Frame = 0

Query: 12  EGKVHILVIPFPEGQGHINPILQFSKRLALKGLKVTVLNVLKTHEINNNIINDLDQLGGC 71
           EGKVHILVIPFP+ QGHINPILQFSKRLA KGLKVT+LNVL  HE N     ++    GC
Sbjct: 7   EGKVHILVIPFPDAQGHINPILQFSKRLAFKGLKVTLLNVL--HESNPTYELNVGGGDGC 66

Query: 72  GGSIITVEHKPRVAYKGREAESMESHMHRLQTSTCFHLTNLITQHQTSDAPIACVLYDSL 131
              II VE +PR  Y GRE ES+ES+MHRL+TS CFHLT+L+TQ Q+S++P   V+YDSL
Sbjct: 67  SNFIINVEERPRAPYNGREPESIESYMHRLKTSICFHLTSLVTQQQSSNSPFVYVVYDSL 126

Query: 132 TPWVLDVARAFGLPGASFLTDSCAVNAIFYHINRGSFKINPIEFEDEMTVVSLPCLPLLH 191
            PW+LDVA AFGL GA F T S AVNAIFYHIN GSFK+   E     T V LP LPLLH
Sbjct: 127 MPWILDVATAFGLRGAPFFTQSSAVNAIFYHINHGSFKLPVAE-----TGVLLPGLPLLH 186

Query: 192 VYDLPS-LIPNPNQYPVFLRFMIDQFCNQPDWMFINTFLALEPQVIQWMQSHTPLKTVGP 251
             DLPS LIPNP   P FL+ MIDQ  + PDWMFIN+F ALE Q I+WMQ H PLKTVGP
Sbjct: 187 ASDLPSLLIPNPQHNPFFLKLMIDQLHDLPDWMFINSFHALETQAIEWMQRHIPLKTVGP 246

Query: 252 TVPSILIDKRMK-DDYHYGTNLIKSTEDDN-EIIEWLDSKDSNSVIYVSFGSVSELGEEQ 311
           T+PSI+IDK +K DD++Y  NL KSTE+DN +I+EWLDSK  NSVIYVS G+ S L EEQ
Sbjct: 247 TIPSIMIDKELKIDDHNYRMNLTKSTENDNSKIMEWLDSKVHNSVIYVSLGTTSNLREEQ 306

Query: 312 MKELAWGLKATNTNFVWVIKEIETP-KLPNNVFEEMKEMGMVVKWCSQVQVLAHKSVGCF 371
           M+ELAWGLKATN  F+WVIKE ETP KLP+N  EE+K MGMVVKWCSQV VLAHKS+GCF
Sbjct: 307 MEELAWGLKATNKTFLWVIKEAETPNKLPHNFVEELKGMGMVVKWCSQVHVLAHKSIGCF 366

Query: 372 VTHCGWNSVLEALSSGVPMVAMPQWTDQMTNAKFVEDVWKVGVRVSTKENGIVGREEIEL 431
           VTHCGWNSVLEA++ GVPMV+MPQWTDQMTNAKFVEDVWK+GVRV+ K+NGIV R+EIEL
Sbjct: 367 VTHCGWNSVLEAIACGVPMVSMPQWTDQMTNAKFVEDVWKIGVRVNPKQNGIVRRQEIEL 426

Query: 432 CIRRVMEGEISLEIRQNAIMWMNLAKEAVTEDGTSEKNIDEFVAQL 474
           CIR+VMEG+ SLEIRQNA  WM L      +D TS+ NID+FV QL
Sbjct: 427 CIRKVMEGKKSLEIRQNATKWMKL----TAQDQTSDDNIDDFVTQL 461

BLAST of Tan0010472 vs. NCBI nr
Match: XP_022157642.1 (UDP-glycosyltransferase 74E2-like isoform X1 [Momordica charantia])

HSP 1 Score: 617.1 bits (1590), Expect = 1.3e-172
Identity = 304/469 (64.82%), Postives = 375/469 (79.96%), Query Frame = 0

Query: 5   KRKMEMGEGKVHILVIPFPEGQGHINPILQFSKRLALKGLKVTVLNVLKTHEINNNIIND 64
           +R+ E G  K+H+LV+P  +GQGHINPILQFSKRLA KGL VT+LN+L+ +  NNN    
Sbjct: 3   EREGEEG-NKIHVLVVPLSDGQGHINPILQFSKRLAFKGLTVTLLNILRHN--NNN---- 62

Query: 65  LDQLGGCGGSIITVEHKPRVAYKGREAESMESHMHRLQTSTCFHLTNLITQHQTSDAPIA 124
            +Q      S I VEH+PR+ Y+G + ESM+SHM RL+ S  FH+T+L+ +H+TS AP+ 
Sbjct: 63  -EQHQHHSHSSIHVEHRPRLPYQGPQPESMDSHMARLRASISFHITDLVARHRTSPAPVR 122

Query: 125 CVLYDSLTPWVLDVARAFGLPGASFLTDSCAVNAIFYHINRGSFKINPIEFEDEMTVVSL 184
           C++YDS+ PWVLDVA+  G+ GA F T+SCAVNAIFYH++ GSF I P+   D    ++L
Sbjct: 123 CLIYDSIMPWVLDVAKGLGVFGAXFFTESCAVNAIFYHLSCGSFTI-PV---DPSFALAL 182

Query: 185 PCLPLLHVYDLPSLIPNPNQYPVFLRFMIDQFCNQPDWMFINTFLALEPQVIQWMQSHTP 244
           P LP L V DLPSL+ +P++Y  FL FM+DQF NQPDWMFINTF +LEPQVI+WMQSHT 
Sbjct: 183 PALPPLRVSDLPSLVSSPDRYSGFLDFMVDQFSNQPDWMFINTFNSLEPQVIEWMQSHTS 242

Query: 245 LKTVGPTVPSILIDKRMKDDYHYGTNLIKSTEDDNEIIEWLDSKDSNSVIYVSFGSVSEL 304
           LKTVGPTVPS + DKR+ +D+ YG +L KS+EDD++I+EWLDSKD NSVIY+SFGSV++L
Sbjct: 243 LKTVGPTVPSTITDKRLTEDHEYGISLFKSSEDDSKIMEWLDSKDRNSVIYMSFGSVTKL 302

Query: 305 GEEQMKELAWGLKATNTNFVWVIKEIETPKLPNNVFEEMKEMGMVVKWCSQVQVLAHKSV 364
           G EQ++ELA GLKAT   F+WV+++ E PKLP N  EEM+E G VV WC Q++VL H+SV
Sbjct: 303 GGEQLEELALGLKATKATFLWVLRDSEIPKLPTNFLEEMEEKGRVVNWCPQLRVLGHESV 362

Query: 365 GCFVTHCGWNSVLEALSSGVPMVAMPQWTDQMTNAKFVEDVWKVGVRVSTKENGIVGREE 424
           GCFVTHCGWNSVLEALS GVPMVAMPQW DQ TNAKFVEDVWKVGVRVS  +NG+VGREE
Sbjct: 363 GCFVTHCGWNSVLEALSLGVPMVAMPQWADQTTNAKFVEDVWKVGVRVSPMKNGVVGREE 422

Query: 425 IELCIRRVMEGEISLEIRQNAIMWMNLAKEAVTEDGTSEKNIDEFVAQL 474
           IELCI+ VMEGE S+E+R+NA  WM LA+EAV EDGTS+KNIDEFVAQL
Sbjct: 423 IELCIKGVMEGERSVEMRENANKWMKLAREAVDEDGTSDKNIDEFVAQL 459

BLAST of Tan0010472 vs. ExPASy TrEMBL
Match: A0A6J1GMP0 (Glycosyltransferase OS=Cucurbita moschata OX=3662 GN=LOC111455839 PE=3 SV=1)

HSP 1 Score: 713.0 bits (1839), Expect = 8.4e-202
Identity = 355/472 (75.21%), Postives = 395/472 (83.69%), Query Frame = 0

Query: 8   MEMGEGKVHILVIPFPEGQGHINPILQFSKRLALKGLKVTVLNVLKTHEINNNIINDLDQ 67
           M +GEGKVHILVIPFP+GQGH+NPILQFSKRL LKGLKVT   VL THEI NN   +L+ 
Sbjct: 1   MAVGEGKVHILVIPFPDGQGHVNPILQFSKRLVLKGLKVT---VLITHEIINNA--NLNH 60

Query: 68  LGGCGGSIITVEHKPRVAYKGREAESMESHMHRLQTSTCFHLTNLITQHQTSDAPIACVL 127
           +GG  G  I VE+KPRV YKG + E +ES++HRLQ ST FHL  LIT HQTS++PIACV+
Sbjct: 61  VGGGWGGSINVENKPRVPYKGTDPEPLESYIHRLQISTSFHLVKLITHHQTSNSPIACVV 120

Query: 128 YDSLTPWVLDVARAFGLPGASFLTDSCAVNAIFYHINRGSFKINPIEFEDEMTVVSLPCL 187
           YDSLTPWVLDVAR FGLPGA F T+SCAVNA+FYHI  GS KI       +   VSLP L
Sbjct: 121 YDSLTPWVLDVARGFGLPGAPFFTESCAVNAVFYHIYSGSLKI-----PSDKKSVSLPAL 180

Query: 188 PLLHVYDLPSLIPNPNQYPVFLRFMIDQFCNQPDWMFINTFLALEPQVIQWMQSHTPLKT 247
           PLL   DLPSLI NP+QYPVFLR M  QFCNQPDWMFINTF ALEPQV+QWMQ+HTPLK 
Sbjct: 181 PLLQDTDLPSLISNPHQYPVFLRMMTHQFCNQPDWMFINTFHALEPQVLQWMQTHTPLKA 240

Query: 248 VGPTVPSILIDKRMKDDYHYGTNLIKSTEDDNEIIEWLDSKDSNSVIYVSFGSVSELGEE 307
           VGPTVPSILIDK + DD +YG NLIKSTEDD++ IEWLDSKDS SVIYVSFGSVSELGEE
Sbjct: 241 VGPTVPSILIDKGLMDDNNYGMNLIKSTEDDSKTIEWLDSKDSESVIYVSFGSVSELGEE 300

Query: 308 QMKELAWGLKATNTNFVWVIKEIETPKLPNNVFEEMKEM-GMVVKWCSQVQVLAHKSVGC 367
           QMKE+AWGLKA+N NF+WVIKE+ET +LPN   EEMKEM G VVKWCSQVQVL HKSVGC
Sbjct: 301 QMKEIAWGLKASNKNFLWVIKEMETGELPNKFVEEMKEMKGKVVKWCSQVQVLGHKSVGC 360

Query: 368 FVTHCGWNSVLEALSSGVPMVAMPQWTDQMTNAKFVEDVWKVGVRVSTKENGIVGREEIE 427
           F+THCGWNSVLE LSSGVPMVAMPQWTDQ+TNAKFVEDVWK+GVRVS  +NG+VGREEIE
Sbjct: 361 FITHCGWNSVLEGLSSGVPMVAMPQWTDQITNAKFVEDVWKIGVRVSPNQNGLVGREEIE 420

Query: 428 LCIRRVMEGEISLEIRQNAIMWMNLAKEAVTEDGTSEKNIDEFVAQLKGLKI 479
           LCIR+VMEGE   E+RQN  MWM LAKEA+TEDG+S KNIDEFVAQ++  KI
Sbjct: 421 LCIRKVMEGEKRFEMRQNTSMWMKLAKEAMTEDGSSNKNIDEFVAQIQERKI 462

BLAST of Tan0010472 vs. ExPASy TrEMBL
Match: A0A6J1DTN7 (Glycosyltransferase OS=Momordica charantia OX=3673 GN=LOC111024304 PE=3 SV=1)

HSP 1 Score: 617.1 bits (1590), Expect = 6.3e-173
Identity = 304/469 (64.82%), Postives = 375/469 (79.96%), Query Frame = 0

Query: 5   KRKMEMGEGKVHILVIPFPEGQGHINPILQFSKRLALKGLKVTVLNVLKTHEINNNIIND 64
           +R+ E G  K+H+LV+P  +GQGHINPILQFSKRLA KGL VT+LN+L+ +  NNN    
Sbjct: 3   EREGEEG-NKIHVLVVPLSDGQGHINPILQFSKRLAFKGLTVTLLNILRHN--NNN---- 62

Query: 65  LDQLGGCGGSIITVEHKPRVAYKGREAESMESHMHRLQTSTCFHLTNLITQHQTSDAPIA 124
            +Q      S I VEH+PR+ Y+G + ESM+SHM RL+ S  FH+T+L+ +H+TS AP+ 
Sbjct: 63  -EQHQHHSHSSIHVEHRPRLPYQGPQPESMDSHMARLRASISFHITDLVARHRTSPAPVR 122

Query: 125 CVLYDSLTPWVLDVARAFGLPGASFLTDSCAVNAIFYHINRGSFKINPIEFEDEMTVVSL 184
           C++YDS+ PWVLDVA+  G+ GA F T+SCAVNAIFYH++ GSF I P+   D    ++L
Sbjct: 123 CLIYDSIMPWVLDVAKGLGVFGAXFFTESCAVNAIFYHLSCGSFTI-PV---DPSFALAL 182

Query: 185 PCLPLLHVYDLPSLIPNPNQYPVFLRFMIDQFCNQPDWMFINTFLALEPQVIQWMQSHTP 244
           P LP L V DLPSL+ +P++Y  FL FM+DQF NQPDWMFINTF +LEPQVI+WMQSHT 
Sbjct: 183 PALPPLRVSDLPSLVSSPDRYSGFLDFMVDQFSNQPDWMFINTFNSLEPQVIEWMQSHTS 242

Query: 245 LKTVGPTVPSILIDKRMKDDYHYGTNLIKSTEDDNEIIEWLDSKDSNSVIYVSFGSVSEL 304
           LKTVGPTVPS + DKR+ +D+ YG +L KS+EDD++I+EWLDSKD NSVIY+SFGSV++L
Sbjct: 243 LKTVGPTVPSTITDKRLTEDHEYGISLFKSSEDDSKIMEWLDSKDRNSVIYMSFGSVTKL 302

Query: 305 GEEQMKELAWGLKATNTNFVWVIKEIETPKLPNNVFEEMKEMGMVVKWCSQVQVLAHKSV 364
           G EQ++ELA GLKAT   F+WV+++ E PKLP N  EEM+E G VV WC Q++VL H+SV
Sbjct: 303 GGEQLEELALGLKATKATFLWVLRDSEIPKLPTNFLEEMEEKGRVVNWCPQLRVLGHESV 362

Query: 365 GCFVTHCGWNSVLEALSSGVPMVAMPQWTDQMTNAKFVEDVWKVGVRVSTKENGIVGREE 424
           GCFVTHCGWNSVLEALS GVPMVAMPQW DQ TNAKFVEDVWKVGVRVS  +NG+VGREE
Sbjct: 363 GCFVTHCGWNSVLEALSLGVPMVAMPQWADQTTNAKFVEDVWKVGVRVSPMKNGVVGREE 422

Query: 425 IELCIRRVMEGEISLEIRQNAIMWMNLAKEAVTEDGTSEKNIDEFVAQL 474
           IELCI+ VMEGE S+E+R+NA  WM LA+EAV EDGTS+KNIDEFVAQL
Sbjct: 423 IELCIKGVMEGERSVEMRENANKWMKLAREAVDEDGTSDKNIDEFVAQL 459

BLAST of Tan0010472 vs. ExPASy TrEMBL
Match: A0A6J1DV08 (Glycosyltransferase OS=Momordica charantia OX=3673 GN=LOC111024304 PE=3 SV=1)

HSP 1 Score: 617.1 bits (1590), Expect = 6.3e-173
Identity = 304/469 (64.82%), Postives = 375/469 (79.96%), Query Frame = 0

Query: 5   KRKMEMGEGKVHILVIPFPEGQGHINPILQFSKRLALKGLKVTVLNVLKTHEINNNIIND 64
           +R+ E G  K+H+LV+P  +GQGHINPILQFSKRLA KGL VT+LN+L+ +  NNN    
Sbjct: 3   EREGEEG-NKIHVLVVPLSDGQGHINPILQFSKRLAFKGLTVTLLNILRHN--NNN---- 62

Query: 65  LDQLGGCGGSIITVEHKPRVAYKGREAESMESHMHRLQTSTCFHLTNLITQHQTSDAPIA 124
            +Q      S I VEH+PR+ Y+G + ESM+SHM RL+ S  FH+T+L+ +H+TS AP+ 
Sbjct: 63  -EQHQHHSHSSIHVEHRPRLPYQGPQPESMDSHMARLRASISFHITDLVARHRTSPAPVR 122

Query: 125 CVLYDSLTPWVLDVARAFGLPGASFLTDSCAVNAIFYHINRGSFKINPIEFEDEMTVVSL 184
           C++YDS+ PWVLDVA+  G+ GA F T+SCAVNAIFYH++ GSF I P+   D    ++L
Sbjct: 123 CLIYDSIMPWVLDVAKGLGVFGAXFFTESCAVNAIFYHLSCGSFTI-PV---DPSFALAL 182

Query: 185 PCLPLLHVYDLPSLIPNPNQYPVFLRFMIDQFCNQPDWMFINTFLALEPQVIQWMQSHTP 244
           P LP L V DLPSL+ +P++Y  FL FM+DQF NQPDWMFINTF +LEPQVI+WMQSHT 
Sbjct: 183 PALPPLRVSDLPSLVSSPDRYSGFLDFMVDQFSNQPDWMFINTFNSLEPQVIEWMQSHTS 242

Query: 245 LKTVGPTVPSILIDKRMKDDYHYGTNLIKSTEDDNEIIEWLDSKDSNSVIYVSFGSVSEL 304
           LKTVGPTVPS + DKR+ +D+ YG +L KS+EDD++I+EWLDSKD NSVIY+SFGSV++L
Sbjct: 243 LKTVGPTVPSTITDKRLTEDHEYGISLFKSSEDDSKIMEWLDSKDRNSVIYMSFGSVTKL 302

Query: 305 GEEQMKELAWGLKATNTNFVWVIKEIETPKLPNNVFEEMKEMGMVVKWCSQVQVLAHKSV 364
           G EQ++ELA GLKAT   F+WV+++ E PKLP N  EEM+E G VV WC Q++VL H+SV
Sbjct: 303 GGEQLEELALGLKATKATFLWVLRDSEIPKLPTNFLEEMEEKGRVVNWCPQLRVLGHESV 362

Query: 365 GCFVTHCGWNSVLEALSSGVPMVAMPQWTDQMTNAKFVEDVWKVGVRVSTKENGIVGREE 424
           GCFVTHCGWNSVLEALS GVPMVAMPQW DQ TNAKFVEDVWKVGVRVS  +NG+VGREE
Sbjct: 363 GCFVTHCGWNSVLEALSLGVPMVAMPQWADQTTNAKFVEDVWKVGVRVSPMKNGVVGREE 422

Query: 425 IELCIRRVMEGEISLEIRQNAIMWMNLAKEAVTEDGTSEKNIDEFVAQL 474
           IELCI+ VMEGE S+E+R+NA  WM LA+EAV EDGTS+KNIDEFVAQL
Sbjct: 423 IELCIKGVMEGERSVEMRENANKWMKLAREAVDEDGTSDKNIDEFVAQL 459

BLAST of Tan0010472 vs. ExPASy TrEMBL
Match: A0A6J1DTW4 (Glycosyltransferase OS=Momordica charantia OX=3673 GN=LOC111024304 PE=3 SV=1)

HSP 1 Score: 617.1 bits (1590), Expect = 6.3e-173
Identity = 304/469 (64.82%), Postives = 375/469 (79.96%), Query Frame = 0

Query: 5   KRKMEMGEGKVHILVIPFPEGQGHINPILQFSKRLALKGLKVTVLNVLKTHEINNNIIND 64
           +R+ E G  K+H+LV+P  +GQGHINPILQFSKRLA KGL VT+LN+L+ +  NNN    
Sbjct: 3   EREGEEG-NKIHVLVVPLSDGQGHINPILQFSKRLAFKGLTVTLLNILRHN--NNN---- 62

Query: 65  LDQLGGCGGSIITVEHKPRVAYKGREAESMESHMHRLQTSTCFHLTNLITQHQTSDAPIA 124
            +Q      S I VEH+PR+ Y+G + ESM+SHM RL+ S  FH+T+L+ +H+TS AP+ 
Sbjct: 63  -EQHQHHSHSSIHVEHRPRLPYQGPQPESMDSHMARLRASISFHITDLVARHRTSPAPVR 122

Query: 125 CVLYDSLTPWVLDVARAFGLPGASFLTDSCAVNAIFYHINRGSFKINPIEFEDEMTVVSL 184
           C++YDS+ PWVLDVA+  G+ GA F T+SCAVNAIFYH++ GSF I P+   D    ++L
Sbjct: 123 CLIYDSIMPWVLDVAKGLGVFGAXFFTESCAVNAIFYHLSCGSFTI-PV---DPSFALAL 182

Query: 185 PCLPLLHVYDLPSLIPNPNQYPVFLRFMIDQFCNQPDWMFINTFLALEPQVIQWMQSHTP 244
           P LP L V DLPSL+ +P++Y  FL FM+DQF NQPDWMFINTF +LEPQVI+WMQSHT 
Sbjct: 183 PALPPLRVSDLPSLVSSPDRYSGFLDFMVDQFSNQPDWMFINTFNSLEPQVIEWMQSHTS 242

Query: 245 LKTVGPTVPSILIDKRMKDDYHYGTNLIKSTEDDNEIIEWLDSKDSNSVIYVSFGSVSEL 304
           LKTVGPTVPS + DKR+ +D+ YG +L KS+EDD++I+EWLDSKD NSVIY+SFGSV++L
Sbjct: 243 LKTVGPTVPSTITDKRLTEDHEYGISLFKSSEDDSKIMEWLDSKDRNSVIYMSFGSVTKL 302

Query: 305 GEEQMKELAWGLKATNTNFVWVIKEIETPKLPNNVFEEMKEMGMVVKWCSQVQVLAHKSV 364
           G EQ++ELA GLKAT   F+WV+++ E PKLP N  EEM+E G VV WC Q++VL H+SV
Sbjct: 303 GGEQLEELALGLKATKATFLWVLRDSEIPKLPTNFLEEMEEKGRVVNWCPQLRVLGHESV 362

Query: 365 GCFVTHCGWNSVLEALSSGVPMVAMPQWTDQMTNAKFVEDVWKVGVRVSTKENGIVGREE 424
           GCFVTHCGWNSVLEALS GVPMVAMPQW DQ TNAKFVEDVWKVGVRVS  +NG+VGREE
Sbjct: 363 GCFVTHCGWNSVLEALSLGVPMVAMPQWADQTTNAKFVEDVWKVGVRVSPMKNGVVGREE 422

Query: 425 IELCIRRVMEGEISLEIRQNAIMWMNLAKEAVTEDGTSEKNIDEFVAQL 474
           IELCI+ VMEGE S+E+R+NA  WM LA+EAV EDGTS+KNIDEFVAQL
Sbjct: 423 IELCIKGVMEGERSVEMRENANKWMKLAREAVDEDGTSDKNIDEFVAQL 459

BLAST of Tan0010472 vs. ExPASy TrEMBL
Match: A0A0A0K2F3 (Glycosyltransferase OS=Cucumis sativus OX=3659 GN=Csa_7G051380 PE=3 SV=1)

HSP 1 Score: 563.1 bits (1450), Expect = 1.1e-156
Identity = 316/483 (65.42%), Postives = 363/483 (75.16%), Query Frame = 0

Query: 12  EGKVHILVIPFPEGQGHINPILQFSKRLALKGLKVTVLNVLKTHEINNNIINDLDQLGGC 71
           EGKVHILVIPFP+ QGHINPILQFSKRLA KGLKVT+LN+L  HE N        QL  C
Sbjct: 7   EGKVHILVIPFPDEQGHINPILQFSKRLAFKGLKVTLLNLL--HEKNTTTY----QLSCC 66

Query: 72  G--GSIITVEHKPRVAYKGREAESMESHMHRLQTSTCFHLTNLITQHQTSDAPIACVLYD 131
               S I V  +PR  Y   E ES+ES+MHRL+TS CFHL NL+TQ+Q S+ P + V+YD
Sbjct: 67  SSLNSTINVLERPRAPYNSTEPESIESYMHRLKTSICFHLINLVTQYQNSNFPFSFVVYD 126

Query: 132 SLTPWVLDVARAFGLPGASFLTDSCAVNAIFYHINRGSFKINPIEFEDEMTVVS--LPCL 191
           SL PWVLD+ARAFGL GA F T SCAV AIFYHI  GSFKI P    D+ T VS  LP L
Sbjct: 127 SLMPWVLDLARAFGLRGAPFFTQSCAVIAIFYHIIHGSFKIIP-PVADQTTCVSSLLPGL 186

Query: 192 PL-LHVYDLPSLI------PNPNQYPVFLRFMIDQFCNQPDWMFINTFLALEPQVIQWMQ 251
           PL LH  DLPSL+      P  N  P FL+ MIDQ  + P+ MF+N+F ALE QVI+++Q
Sbjct: 187 PLDLHASDLPSLLLPDNNNPQQNNNPFFLKLMIDQLHDLPELMFVNSFHALETQVIEYLQ 246

Query: 252 SHTPLKTVGPTVPSILIDKRMKDDYH-YGTNLIKSTEDDN-EIIEWLDSKDSNSVIYVSF 311
           S  PLK VGPTVPSILI+K + DD H YG NLI STEDDN +I+ WL+SK  NSVIYVS 
Sbjct: 247 SQMPLKMVGPTVPSILINKELMDDDHDYGMNLINSTEDDNKKIMGWLNSKARNSVIYVSL 306

Query: 312 GS-VSELGEEQMKELAWGLKATNTNFVWVIKEIETPKLPNNVFE-EMKEM-GMVVKWCSQ 371
           G+ +S LGEEQM+ELAWGLKATN  F+WVIKE   P+ PN+ FE E+KEM GMVVKWC Q
Sbjct: 307 GTRISNLGEEQMEELAWGLKATNKPFLWVIKE---PEFPNSFFEKEVKEMHGMVVKWCCQ 366

Query: 372 VQVLAHKSVGCFVTHCGWNSVLEALSSGVPMVAMPQWTDQMTNAKFVEDVWKVGVRVST- 431
           V VL H+SVGCF+THCGWNSVLEA++ GVPMVAMPQW +QMTNAKFVEDVW VGVRVST 
Sbjct: 367 VLVLGHESVGCFMTHCGWNSVLEAITCGVPMVAMPQWGEQMTNAKFVEDVWNVGVRVSTS 426

Query: 432 KENG--IVGREEIELCIRRVMEGEISLEIRQNAIMWMNLAKEAV--TEDGTSEKNIDEFV 474
           KENG  IV REEIELC+R+VMEGE S ++RQN   WM LAKEAV   E+GTS+KNI +FV
Sbjct: 427 KENGMIIVRREEIELCVRKVMEGEKSHKLRQNGRRWMKLAKEAVMINENGTSDKNIHDFV 479

BLAST of Tan0010472 vs. TAIR 10
Match: AT1G05680.1 (Uridine diphosphate glycosyltransferase 74E2 )

HSP 1 Score: 402.5 bits (1033), Expect = 4.7e-112
Identity = 218/465 (46.88%), Postives = 301/465 (64.73%), Query Frame = 0

Query: 10  MGEGKVHILVIPFPEGQGHINPILQFSKRLALKGLKVTVLNVLKTHEINNNIINDLDQLG 69
           M EG  H++V+PFP GQGHI P+ QF KRLA KGLK+T+  VL + + +     + D   
Sbjct: 1   MREGS-HLIVLPFP-GQGHITPMSQFCKRLASKGLKLTL--VLVSDKPSPPYKTEHDS-- 60

Query: 70  GCGGSIITVEHKPRVAYKGRE-AESMESHMHRLQTSTCFHLTNLITQHQTSDAPIACVLY 129
                 ITV        +G E  + ++ +M R++TS    L  L+   + S  P   ++Y
Sbjct: 61  ------ITVFPISNGFQEGEEPLQDLDDYMERVETSIKNTLPKLVEDMKLSGNPPRAIVY 120

Query: 130 DSLTPWVLDVARAFGLPGASFLTDSCAVNAIFYHINRGSFKINPIEFEDEMTVVSLPCLP 189
           DS  PW+LDVA ++GL GA F T    V AI+YH+ +GSF +   ++    T+ S P  P
Sbjct: 121 DSTMPWLLDVAHSYGLSGAVFFTQPWLVTAIYYHVFKGSFSVPSTKY-GHSTLASFPSFP 180

Query: 190 LLHVYDLPSLIPNPNQYPVFLRFMIDQFCN--QPDWMFINTFLALEPQVIQWMQSHTPLK 249
           +L   DLPS +   + YP  LR ++DQ  N  + D +  NTF  LE ++++W+QS  P+ 
Sbjct: 181 MLTANDLPSFLCESSSYPNILRIVVDQLSNIDRVDIVLCNTFDKLEEKLLKWVQSLWPVL 240

Query: 250 TVGPTVPSILIDKRMKDDYHYGTNLIKSTEDDNEIIEWLDSKDSNSVIYVSFGSVSELGE 309
            +GPTVPS+ +DKR+ +D +YG +L  +     E +EWL+SK+ NSV+Y+SFGS+  L E
Sbjct: 241 NIGPTVPSMYLDKRLSEDKNYGFSLFNAKV--AECMEWLNSKEPNSVVYLSFGSLVILKE 300

Query: 310 EQMKELAWGLKATNTNFVWVIKEIETPKLPNNVFEEMKEMGMVVKWCSQVQVLAHKSVGC 369
           +QM ELA GLK +   F+WV++E ET KLP N  EE+ E G++V W  Q+ VLAHKS+GC
Sbjct: 301 DQMLELAAGLKQSGRFFLWVVRETETHKLPRNYVEEIGEKGLIVSWSPQLDVLAHKSIGC 360

Query: 370 FVTHCGWNSVLEALSSGVPMVAMPQWTDQMTNAKFVEDVWKVGVRVSTKENGIVGREEIE 429
           F+THCGWNS LE LS GVPM+ MP WTDQ TNAKF++DVWKVGVRV  + +G V REEI 
Sbjct: 361 FLTHCGWNSTLEGLSLGVPMIGMPHWTDQPTNAKFMQDVWKVGVRVKAEGDGFVRREEIM 420

Query: 430 LCIRRVMEGEISLEIRQNAIMWMNLAKEAVTEDGTSEKNIDEFVA 472
             +  VMEGE   EIR+NA  W  LA+EAV+E G+S+K+I+EFV+
Sbjct: 421 RSVEEVMEGEKGKEIRKNAEKWKVLAQEAVSEGGSSDKSINEFVS 450

BLAST of Tan0010472 vs. TAIR 10
Match: AT1G05675.1 (UDP-Glycosyltransferase superfamily protein )

HSP 1 Score: 389.0 bits (998), Expect = 5.3e-108
Identity = 213/465 (45.81%), Postives = 294/465 (63.23%), Query Frame = 0

Query: 10  MGEGKVHILVIPFPEGQGHINPILQFSKRLALKGLKVTVLNVLKTHEINNNIINDLDQLG 69
           M EG  H++V+PFP  QGHI P+ QF KRLA K LK+T++ V           +D     
Sbjct: 1   MREGS-HVIVLPFP-AQGHITPMSQFCKRLASKSLKITLVLVSDKPSPPYKTEHD----- 60

Query: 70  GCGGSIITVEHKPRVAYKGRE-AESMESHMHRLQTSTCFHLTNLITQHQTSDAPIACVLY 129
                 ITV        +G+E +E ++ +M R+++S    L  LI   + S  P   ++Y
Sbjct: 61  -----TITVVPISNGFQEGQERSEDLDEYMERVESSIKNRLPKLIEDMKLSGNPPRALVY 120

Query: 130 DSLTPWVLDVARAFGLPGASFLTDSCAVNAIFYHINRGSFKINPIEFEDEMTVVSLPCLP 189
           DS  PW+LDVA ++GL GA F T    V+AI+YH+ +GSF +   ++    T+ S P LP
Sbjct: 121 DSTMPWLLDVAHSYGLSGAVFFTQPWLVSAIYYHVFKGSFSVPSTKY-GHSTLASFPSLP 180

Query: 190 LLHVYDLPSLIPNPNQYPVFLRFMIDQFCN--QPDWMFINTFLALEPQVIQWMQSHTPLK 249
           +L+  DLPS +   + YP  LR +IDQ  N  + D +  NTF  LE ++++W++S  P+ 
Sbjct: 181 ILNANDLPSFLCESSSYPYILRTVIDQLSNIDRVDIVLCNTFDKLEEKLLKWIKSVWPVL 240

Query: 250 TVGPTVPSILIDKRMKDDYHYGTNLIKSTEDDNEIIEWLDSKDSNSVIYVSFGSVSELGE 309
            +GPTVPS+ +DKR+ +D +YG +L  +     E +EWL+SK  +SV+YVSFGS+  L +
Sbjct: 241 NIGPTVPSMYLDKRLAEDKNYGFSLFGA--KIAECMEWLNSKQPSSVVYVSFGSLVVLKK 300

Query: 310 EQMKELAWGLKATNTNFVWVIKEIETPKLPNNVFEEMKEMGMVVKWCSQVQVLAHKSVGC 369
           +Q+ ELA GLK +   F+WV++E E  KLP N  EE+ E G+ V W  Q++VL HKS+GC
Sbjct: 301 DQLIELAAGLKQSGHFFLWVVRETERRKLPENYIEEIGEKGLTVSWSPQLEVLTHKSIGC 360

Query: 370 FVTHCGWNSVLEALSSGVPMVAMPQWTDQMTNAKFVEDVWKVGVRVSTKENGIVGREEIE 429
           FVTHCGWNS LE LS GVPM+ MP W DQ TNAKF+EDVWKVGVRV    +G V REE  
Sbjct: 361 FVTHCGWNSTLEGLSLGVPMIGMPHWADQPTNAKFMEDVWKVGVRVKADSDGFVRREEFV 420

Query: 430 LCIRRVMEGEISLEIRQNAIMWMNLAKEAVTEDGTSEKNIDEFVA 472
             +  VME E   EIR+NA  W  LA+EAV+E G+S+KNI+EFV+
Sbjct: 421 RRVEEVMEAEQGKEIRKNAEKWKVLAQEAVSEGGSSDKNINEFVS 450

BLAST of Tan0010472 vs. TAIR 10
Match: AT2G43820.1 (UDP-glucosyltransferase 74F2 )

HSP 1 Score: 370.2 bits (949), Expect = 2.6e-102
Identity = 204/466 (43.78%), Postives = 293/466 (62.88%), Query Frame = 0

Query: 16  HILVIPFPEGQGHINPILQFSKRLALKGLKVTVLNVLKTHEINNNIINDLDQLGGCGGSI 75
           H+L +P+P  QGHI P  QF KRL  KGLK T   +  T  + N+I  DL       G I
Sbjct: 7   HVLAVPYPT-QGHITPFRQFCKRLHFKGLKTT---LALTTFVFNSINPDL------SGPI 66

Query: 76  ITVEHKPRVAYKGRE-AESMESHMHRLQTSTCFHLTNLITQHQTSDAPIACVLYDSLTPW 135
                     + G E A+S++ ++   +TS    + ++I +HQTSD PI C++YD+  PW
Sbjct: 67  SIATISDGYDHGGFETADSIDDYLKDFKTSGSKTIADIIQKHQTSDNPITCIVYDAFLPW 126

Query: 136 VLDVARAFGLPGASFLTDSCAVNAIFY--HINRGSFKINPIEFEDEMTVVSLPCLPLLHV 195
            LDVAR FGL    F T  CAVN ++Y  +IN GS ++ PIE            LP L +
Sbjct: 127 ALDVAREFGLVATPFFTQPCAVNYVYYLSYINNGSLQL-PIEE-----------LPFLEL 186

Query: 196 YDLPSLIPNPNQYPVFLRFMIDQFCN--QPDWMFINTFLALEPQVIQWMQSHTPLKTVGP 255
            DLPS       YP +   ++ QF N  + D++ +N+F  LE    +      P+ T+GP
Sbjct: 187 QDLPSFFSVSGSYPAYFEMVLQQFINFEKADFVLVNSFQELELHENELWSKACPVLTIGP 246

Query: 256 TVPSILIDKRMKDDYHYGTNLIKSTEDDNEIIEWLDSKDSNSVIYVSFGSVSELGEEQMK 315
           T+PSI +D+R+K D  Y  NL +S +DD+  I WLD++   SV+YV+FGS+++L   QM+
Sbjct: 247 TIPSIYLDQRIKSDTGYDLNLFES-KDDSFCINWLDTRPQGSVVYVAFGSMAQLTNVQME 306

Query: 316 ELAWGLKATNTNFVWVIKEIETPKLPNNVFEEM-KEMGMVVKWCSQVQVLAHKSVGCFVT 375
           ELA  +  +N +F+WV++  E  KLP+   E + KE  +V+KW  Q+QVL++K++GCF+T
Sbjct: 307 ELASAV--SNFSFLWVVRSSEEEKLPSGFLETVNKEKSLVLKWSPQLQVLSNKAIGCFLT 366

Query: 376 HCGWNSVLEALSSGVPMVAMPQWTDQMTNAKFVEDVWKVGVRVST-KENGIVGREEIELC 435
           HCGWNS +EAL+ GVPMVAMPQWTDQ  NAK+++DVWK GVRV T KE+GI  REEIE  
Sbjct: 367 HCGWNSTMEALTFGVPMVAMPQWTDQPMNAKYIQDVWKAGVRVKTEKESGIAKREEIEFS 426

Query: 436 IRRVMEGEISLEIRQNAIMWMNLAKEAVTEDGTSEKNIDEFVAQLK 475
           I+ VMEGE S E+++N   W +LA +++ E G+++ NID FV++++
Sbjct: 427 IKEVMEGERSKEMKKNVKKWRDLAVKSLNEGGSTDTNIDTFVSRVQ 447

BLAST of Tan0010472 vs. TAIR 10
Match: AT2G31750.1 (UDP-glucosyl transferase 74D1 )

HSP 1 Score: 367.5 bits (942), Expect = 1.7e-101
Identity = 211/470 (44.89%), Postives = 286/470 (60.85%), Query Frame = 0

Query: 10  MGE-GKVHILVIPFPEGQGHINPILQFSKRLALKGLKVTVLNVLKTHEINNNIINDLDQL 69
           MGE  K ++LV  FP  QGHINP+LQFSKRL  K + VT L    TH   N+I+      
Sbjct: 1   MGEKAKANVLVFSFPI-QGHINPLLQFSKRLLSKNVNVTFLTTSSTH---NSILRRAITG 60

Query: 70  GGCGGSIITVEHKPRVAYKGREAESMESHMHRLQTSTCFHLTNLITQHQTSDAPIACVLY 129
           G     +  V             ++   +  + Q +    L+ LI+   + D     V+Y
Sbjct: 61  GATALPLSFVPIDDGFEEDHPSTDTSPDYFAKFQENVSRSLSELIS---SMDPKPNAVVY 120

Query: 130 DSLTPWVLDVARAF-GLPGASFLTDSCAVNAIFYHINRGSFKINPIEFEDEMTVVSLPCL 189
           DS  P+VLDV R   G+  ASF T S  VNA + H  RG FK    EF+++   V LP +
Sbjct: 121 DSCLPYVLDVCRKHPGVAAASFFTQSSTVNATYIHFLRGEFK----EFQND---VVLPAM 180

Query: 190 PLLHVYDLPSLIPNPNQYPVFLRFMIDQFCN--QPDWMFINTFLALEPQVIQWMQSHTPL 249
           P L   DLP  + + N        +  QF N    D+  +N+F  LE +V+QWM++  P+
Sbjct: 181 PPLKGNDLPVFLYDNNLCRPLFELISSQFVNVDDIDFFLVNSFDELEVEVLQWMKNQWPV 240

Query: 250 KTVGPTVPSILIDKRMKDDYHYGTNLIKSTEDDNEIIEWLDSKDSNSVIYVSFGSVSELG 309
           K +GP +PS+ +DKR+  D  YG NL  +    NE ++WLDSK   SVIYVSFGS++ L 
Sbjct: 241 KNIGPMIPSMYLDKRLAGDKDYGINLFNA--QVNECLDWLDSKPPGSVIYVSFGSLAVLK 300

Query: 310 EEQMKELAWGLKATNTNFVWVIKEIETPKLPNNVFEEMKEMGMVVKWCSQVQVLAHKSVG 369
           ++QM E+A GLK T  NF+WV++E ET KLP+N  E++ + G++V W  Q+QVLAHKS+G
Sbjct: 301 DDQMIEVAAGLKQTGHNFLWVVRETETKKLPSNYIEDICDKGLIVNWSPQLQVLAHKSIG 360

Query: 370 CFVTHCGWNSVLEALSSGVPMVAMPQWTDQMTNAKFVEDVWKVGVRVSTKENGIVGREEI 429
           CF+THCGWNS LEALS GV ++ MP ++DQ TNAKF+EDVWKVGVRV   +NG V +EEI
Sbjct: 361 CFMTHCGWNSTLEALSLGVALIGMPAYSDQPTNAKFIEDVWKVGVRVKADQNGFVPKEEI 420

Query: 430 ELCIRRVME--GEISLEIRQNAIMWMNLAKEAVTEDGTSEKNIDEFVAQL 474
             C+  VME   E   EIR+NA   M  A+EA+++ G S+KNIDEFVA++
Sbjct: 421 VRCVGEVMEDMSEKGKEIRKNARRLMEFAREALSDGGNSDKNIDEFVAKI 454

BLAST of Tan0010472 vs. TAIR 10
Match: AT2G31790.1 (UDP-Glycosyltransferase superfamily protein )

HSP 1 Score: 363.2 bits (931), Expect = 3.1e-100
Identity = 195/464 (42.03%), Postives = 276/464 (59.48%), Query Frame = 0

Query: 14  KVHILVIPFPEGQGHINPILQFSKRLALKGLKVTVLNVLKTHEINNNIINDLDQLGGCGG 73
           K H+L  P+P  QGHINP++Q +KRL+ KG+  T++   K H       +D         
Sbjct: 6   KGHVLFFPYPL-QGHINPMIQLAKRLSKKGITSTLIIASKDHR-EPYTSDDYS------- 65

Query: 74  SIITVEHKPRVAYKGREAESMESHMHRLQTSTCFHLTNLITQHQTSDAPIACVLYDSLTP 133
             ITV       +      +    + R   ST   LT+ I+  + SD P   ++YD   P
Sbjct: 66  --ITVHTIHDGFFPHEHPHAKFVDLDRFHNSTSRSLTDFISSAKLSDNPPKALIYDPFMP 125

Query: 134 WVLDVARAFGLPGASFLTDSCAVNAIFYHINRGSFKINPIEFEDEMTVVSLPCLPLLHVY 193
           + LD+A+   L   ++ T     + ++YHIN G++ + P++  +  T+ S P  PLL   
Sbjct: 126 FALDIAKDLDLYVVAYFTQPWLASLVYYHINEGTYDV-PVDRHENPTLASFPGFPLLSQD 185

Query: 194 DLPSLIPNPNQYPVFLRFMIDQFCN--QPDWMFINTFLALEPQVIQWMQSHTPLKTVGPT 253
           DLPS       YP+   F++ QF N  Q D +  NTF  LEP+V++WM    P+K +GP 
Sbjct: 186 DLPSFACEKGSYPLLHEFVVRQFSNLLQADCILCNTFDQLEPKVVKWMNDQWPVKNIGPV 245

Query: 254 VPSILIDKRMKDDYHYGTNLIKSTEDDNEIIEWLDSKDSNSVIYVSFGSVSELGEEQMKE 313
           VPS  +D R+ +D  Y     K TE D  +++WL ++ + SV+YV+FG++  L E+QMKE
Sbjct: 246 VPSKFLDNRLPEDKDYELENSK-TEPDESVLKWLGNRPAKSVVYVAFGTLVALSEKQMKE 305

Query: 314 LAWGLKATNTNFVWVIKEIETPKLPNNVFEEM--KEMGMVVKWCSQVQVLAHKSVGCFVT 373
           +A  +  T  +F+W ++E E  KLP+   EE   K+ G+V KW  Q++VLAH+S+GCFV+
Sbjct: 306 IAMAISQTGYHFLWSVRESERSKLPSGFIEEAEEKDSGLVAKWVPQLEVLAHESIGCFVS 365

Query: 374 HCGWNSVLEALSSGVPMVAMPQWTDQMTNAKFVEDVWKVGVRVSTKENGIVGREEIELCI 433
           HCGWNS LEAL  GVPMV +PQWTDQ TNAKF+EDVWK+GVRV T   G+  +EEI  CI
Sbjct: 366 HCGWNSTLEALCLGVPMVGVPQWTDQPTNAKFIEDVWKIGVRVRTDGEGLSSKEEIARCI 425

Query: 434 RRVMEGEISLEIRQNAIMWMNLAKEAVTEDGTSEKNIDEFVAQL 474
             VMEGE   EIR+N      LA+EA++E G+S+K IDEFVA L
Sbjct: 426 VEVMEGERGKEIRKNVEKLKVLAREAISEGGSSDKKIDEFVALL 456

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
K7NBW32.1e-11747.86Mogroside IE synthase OS=Siraitia grosvenorii OX=190515 GN=UGT74AC1 PE=1 SV=1[more]
Q9SYK96.6e-11146.88UDP-glycosyltransferase 74E2 OS=Arabidopsis thaliana OX=3702 GN=UGT74E2 PE=1 SV=... [more]
P0C7P77.5e-10745.81UDP-glycosyltransferase 74E1 OS=Arabidopsis thaliana OX=3702 GN=UGT74E1 PE=3 SV=... [more]
W8JMV41.6e-10144.54UDP glycosyltransferase 9 OS=Catharanthus roseus OX=4058 GN=UGT9 PE=2 SV=1[more]
O228223.6e-10143.78UDP-glycosyltransferase 74F2 OS=Arabidopsis thaliana OX=3702 GN=UGT74F2 PE=1 SV=... [more]
Match NameE-valueIdentityDescription
XP_023538720.12.1e-20777.54UDP-glycosyltransferase 74E2-like [Cucurbita pepo subsp. pepo][more]
XP_022953232.11.7e-20175.21UDP-glycosyltransferase 74E2-like [Cucurbita moschata][more]
XP_023538707.19.5e-20074.79UDP-glycosyltransferase 74E2-like [Cucurbita pepo subsp. pepo][more]
XP_038888325.14.8e-17568.88UDP-glycosyltransferase 74E2-like [Benincasa hispida] >XP_038888326.1 UDP-glycos... [more]
XP_022157642.11.3e-17264.82UDP-glycosyltransferase 74E2-like isoform X1 [Momordica charantia][more]
Match NameE-valueIdentityDescription
A0A6J1GMP08.4e-20275.21Glycosyltransferase OS=Cucurbita moschata OX=3662 GN=LOC111455839 PE=3 SV=1[more]
A0A6J1DTN76.3e-17364.82Glycosyltransferase OS=Momordica charantia OX=3673 GN=LOC111024304 PE=3 SV=1[more]
A0A6J1DV086.3e-17364.82Glycosyltransferase OS=Momordica charantia OX=3673 GN=LOC111024304 PE=3 SV=1[more]
A0A6J1DTW46.3e-17364.82Glycosyltransferase OS=Momordica charantia OX=3673 GN=LOC111024304 PE=3 SV=1[more]
A0A0A0K2F31.1e-15665.42Glycosyltransferase OS=Cucumis sativus OX=3659 GN=Csa_7G051380 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT1G05680.14.7e-11246.88Uridine diphosphate glycosyltransferase 74E2 [more]
AT1G05675.15.3e-10845.81UDP-Glycosyltransferase superfamily protein [more]
AT2G43820.12.6e-10243.78UDP-glucosyltransferase 74F2 [more]
AT2G31750.11.7e-10144.89UDP-glucosyl transferase 74D1 [more]
AT2G31790.13.1e-10042.03UDP-Glycosyltransferase superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002213UDP-glucuronosyl/UDP-glucosyltransferasePFAMPF00201UDPGTcoord: 268..436
e-value: 3.3E-31
score: 108.7
IPR002213UDP-glucuronosyl/UDP-glucosyltransferaseCDDcd03784GT1_Gtf-likecoord: 16..455
e-value: 8.1508E-88
score: 272.888
NoneNo IPR availableGENE3D3.40.50.2000Glycogen Phosphorylase B;coord: 19..466
e-value: 1.9E-144
score: 484.2
NoneNo IPR availableGENE3D3.40.50.2000Glycogen Phosphorylase B;coord: 266..454
e-value: 1.9E-144
score: 484.2
NoneNo IPR availablePANTHERPTHR11926:SF1147UDP-GLYCOSYLTRANSFERASE 74E1-RELATEDcoord: 15..473
NoneNo IPR availablePANTHERPTHR11926GLUCOSYL/GLUCURONOSYL TRANSFERASEScoord: 15..473
NoneNo IPR availableSUPERFAMILY53756UDP-Glycosyltransferase/glycogen phosphorylasecoord: 15..474
IPR035595UDP-glycosyltransferase family, conserved sitePROSITEPS00375UDPGTcoord: 352..395

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0010472.1Tan0010472.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0080043 quercetin 3-O-glucosyltransferase activity
molecular_function GO:0080044 quercetin 7-O-glucosyltransferase activity
molecular_function GO:0008194 UDP-glycosyltransferase activity