CsaV3_1G021870 (gene) Cucumber (Chinese Long) v3

NameCsaV3_1G021870
Typegene
OrganismCucumis sativus (Cucumber (Chinese Long) v3)
DescriptionGlycosyltransferase
Locationchr1 : 13006862 .. 13027882 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonpolypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
CAGATAACCTCCATCGCAAACCAATTTTCTCCAGAATTTTCTTTTTCCGCCTCAGTGCACGCTGGTCTAACCTCTGAACCCTAATACTGATTTTCCTTTTCCTTTTCCTTTTCTTCTTCTCATTATCAGACGTTTGTGAATTCTACGGCCTTTGTGATTCAGGTGATGCGGCTTTCTAAATTCTGGGCACTATCTTTCTACATCGTTTTTGCTTCGTGTCTTTTTTTTTTTTTTACTGTTTTTTCGCTAAATGTCAAATTTAGGGCTCGTTTGACAAGCTTTTGATGCGTATTGTTTCCTAATCACTTTTGATTACATAGTATTGGCAATTGATTCAGATTAAATGTTTGTTTCTTGAATTCGAAGTGTTCAATTTTTATAATTTCTGCAGCGTTTTTATCTAGATGATTTTTTTTTAATAAATGATATTTACTGTTAATTTCTGGCGAATTTAAACGAGATAATGAGTAGAGGTATTATATTCTAGTCTCTTTTAGTGACAAAGGATTGGGAGTTGCTTCAGATTTAATGTTTTTTCCTTGAATTCGAGATCCTGGAATTGTTTTCAAATACACTTACCGACTTACGTTTATTTTAATTAGACAGTAACTGGTGTGTACCAGGATCGGTTTTCTATTTCAATTCTTCTCTGAGAGTTGGGATTTACTATCGTTTTATGACGAAGCTTCTCATATATTTTAAACTTTGATTACTACTTTTTCAGGTGAGGAAATTTTTTGTCTGTATGGACATCCTAATGAAACATGGGAAATGGTTCTTCCAGTAGGCAGAGGTTCCACCGGAGCTTCTAGAGCCTGCACTTGGAATTAATTTTGCAACAGATGGCATGGACAGAAAATATTGGCGACCTCTTGTTGCTGTACATGGTGATTCCTGTTTGCTCTCTGTGGCTTTCTATTTTGGAGCAAGACTTGACCGAAGCAAAAGGTACAACTGTTCCACTTTTGAATTTTTGCTTGTGAGCTTTAAGTTAGGTTCTGTTAAATGCAATTCTTTTTAATCGTGCATTTATGTGAACTTTTATAGGTAATGTCATTTTACTTTTCTAATGTTTCTTTTGGCATCATTACCAAATACTCTTTCTGACATGTAGGTGATAAGTTCATAACCAACTTACACAATCACACAACTTCCTTGTGTTTCCCAGTTTATTAAGGTTGATTTACCTTCAATAACATTGTTACTGTCTGAGTAATTGTGGTTTTAGGAGGATGAAGAACTACATAACTGTTTATTCGATATGATTACATAATCTTCTTTTATGTGAAGGTGTGCACTAACTTTATCCATCTTACTAATCATATAATGACAGAACACTTTTCTAAAAGGTCTTTCCAGCAGATTTTTTCTTCCTCTTTTCTTCTAGAAACTCTTTCCAGAACATATGTATATACACATATGTATATATATATATATATTTGCAAAATCTCTTGTTTGAAAGGTCTATTCAAAAGAATTTTCTAAAAGTTTTCTCGATTCATATTCTGTTTTCGAACACTTCCCAAAATATTTCTTGAACTTTTGTCTTTCTTTTTGTTTTTGTTTTTTTGTTCTAAAAAAGAAAACATTTCACAAAACTTTCAGAAAACCGCATGAGCAAGGGTGTTGGCCAGAACACACAGGGCAATGAGCAACAAGGCAACATGCACACGGTGCGTGGAAGATGTTGTCTTGGTGCCATCTTTGCACATGTGGAAGCAAACTTTCAAATGTCATTTACAAAACATTTAGCAAACTTTTTTTTTCTTCCTCTTTTCTTCTAGAAACTCTTTTCAGAACATATGTATATATGTGTGTTTATATACATAGATATTTGCAAAATCTCTTGTTTGAAAGGTCTATTCAAAAGAATTTTCTAAAAGTTTTCTCGATTCATATTCTGTTTTCAAACTCTTTCCAAAATATTTCGTGAACTTTTGTCTTCCTTTTTGTTTTTGTTTTTTGTTCTAAAAAAGAAAACATTTCACAAAACTTTCAAAAAACCGCATGAGCAAGGGTGTTGGCCACAACACACATGGCAATGAGCAACAAGGCAACATGCACACGGTGCGCGGAAGATGTTGTCTTGGTGCCATCTTTGCACATGTGGAAGCAAACTTTCAAATGTCATTTACAAAACATCATTGCAAACTTTTTTTTTCTTCCTCTTTTCTTCTAGAAACTCTTTTCAGAACATATGTATATATGTGTGTTTATATACATAGATATTTGCAAAATTTCTTGTTTGAAAGGTCTATTCAAAAGAATTTTCTAAAAGTTTTCTCGATTCAAATTCTGTTTTCGAACTCTTTCCAAAATATTTCTTGAACTTTTGTCTTCCTTTTTGTTTTTGTTTTTTTTGTTCTAAAAAAGAAAACATTTCACAAAACTTTCAGAAAACCGCATGAGCAAGGGTGTTGGCCACAACACACAGGGCAATGAGCAACAAGGCAACATGCACACGGTGCGCGGAAGATGTTGTCTTGGTGCCATCTTTGCACATGTGGAAGCAAACTTTCAAATGTCATTTACAAAACATTTAGCAAACTTTTTTTCTTCCTCTTTTCTTCTAGAAACTCTTTTCAGAACATATGTATATATGTGTGTTTATATACATAGATATTTGCAAAATCTCTTGTTTGAAAGGTCTATTCAAAAGAATTTTCTAAAAGTTTTCTCGATTCATATTCTGTTTTCGAACTCTTTCCAAAATATTTCGTGAACTTTTGTCTTCCTTTTTGTTTTTGTTTTTTGTTCTAAAAAAGAAAACATTTCACAAAACTTTCAAAAAACCGCATGAGCAAGGGAGTTGGCCACAACACACAGGGCAATGAGCAACAAGGCAACATGCACACGGTGCGCGGAAGATGTTGTCTTGGTGCCATCTTTGCACATGTGGAAGCAAACTTTCAAATGTCATTTACAAAACATCATTGCAAACTTTTTTTTTCTTCCTCTTTTCTTCTAGAAACTCTTTTCAGAACATATGTATATATGTGTGTTTATATACATAGATATTTGCAAAATTTCTTTTTTGAAAGGTCTAATCAAAAGAATTTTCTAAAAAGTTTTCTCGATTCAAATTCTGTTTTCGAACTCTTTCCAAAATATTTCTTGAACTTTTGTCTTCCTTTTTGTTTTTGTTTTTTTGTTCTAAAAAAGAAAACATTTCACAAAACTTTCAGAAAACCGCATGAGCAAGGGTGTTGGCCACAACACACAGGGCAATGAGCAACAAGGCAACATGCACACGGTGCGCGGAAGATGTTGTCTTGGTGCCATCTTTGCACATGTGGAAGCAAACTTTCAAATGTCATTTACAAAACATTTAGCAAACTTTTTTTCTTCCTCTTTTCTTCTAGAAACTCTTTTCAGAACATATGTATATATGTGTGTTTATATACATAGATATTTGCAAAATCTCTTGTTTGAAAGGTCTATTCAAAAGAATTTTCTAAAAGTTTTCTCGATTCATATTCTGTTTTCGAACTCTTTCCAAAATATTTCGTGAACTTTTGTCTTCCTTTTTGTTTTTGTTTTTTGTTCCAAAAAAAGAAAACATTTCACAAAACTTTCAAAAAACCGCATGAGCAAGGGTGTTGGCCACAACACACAGGGCAATGAGCAACAAGGCAACATGCACACGGTGCGCGGAAGATGTTGTCTTGGTGCCATCTTTGCACATGTGGAAGCAAACTTTCAAATGTCATTTACAAAACATCATTGCAAATTTTTTTTTTCTTCCTCTTTTCTTCTAGAAACTCTTTTCAGAACATATGTATATATGTGTGTTTATATACATAGGTATTTGCAAAATTTCTTGTTTGAATGGTCTATTCAAAAGAATTTTCTAAAAGTTTTCTCGATTCAAATTATGTTTTCGAACTCTTTCCAAAATATTTCTTGAACTTTTGTCTTCCTTTTTGTTTTTGTTTTTTTGTTCTAAAAAAGAAAACATTTCACAAAACTTTCAAAAAACCGCATGAGCAAGGGTGTTGGCCACAACACACAGGGCAATGAGCAACAAGGCAACATGCACACGGTGCGCGGAAGATGTTGTCTTGGTGCCATCTTTGCACATGTGGAAGCAAACTTTCAAATGTCATTTACAAAACATTTAGCAAACTTTTTTTCTTCCTCTTTTCTTCTAGAAACTCTTTTCAGAACATATGTATATATGTGTGTTTATATACATAGATATTTGCAAAATCTCTTGTTTGAAAGGTCTATTCAAAAGAATTTTCTAAAAGTTTTCTCGATTCATATTCTGTTTTCGAACTCTTTCCAAAATATTTCGTGAACTTTTGTCTTCCTTTTTGTTTTTGTTTTTTGTTCTAAAAAAGAAAACATTTCACAAAACTTTCAAAAAACCGCATGAGCAAGGGTGTTGGCCACAACACACAGGGCAATGAGCAACAAGGCAACATGCACACGGTGCGCGGAAGATGTTGTCTTGGTGCCATCTTTGCACATGTGGAAGCAAACTTTCAAGTGTCATTTACAAAACATCATTGCAAACTTTTTTTTTCTTCCTCTTTTCTTCTAGAAACTCTTTTCAAAACATATGTATATATGTGTGTTTATATACATAGATATTTGCAAAATTTCTTTTTTGAAAGGTCTAATCAAAAGAATTTTCTAAAAAGTTTTCTCGATTCAAATTCTGTTTTCGAACTCTTTCCAAAATATTTCTTGAACTTTTGTCTTCCTTTTTGTTTTTTTGTTCTAAAAAAGAAAACATTTCACAAAACTTTCCGAAAACCGCATGAGCAAGGGTGTTGGCCACAACACACAGGGCAATGAGCAACAAGGCAACATGCACACGGTGCGCGGAAGATGTTGTCTTGGTGCCATCTTTGCACATGTGGAAGCAAACTTTCAAATGTCATTTACAAAACATTTAGCAAACTTTTTTTCTTCCTCTTTTCTTCTAGAAACTCTTTTCAGAACATATGTATATATGTGTGTTTATATACATAGATATTTGCAAAATCTCTTGTTTAAAGGTCTATCCAAAAGCATTTTCTAAAAGTTTTCTTACTTCGAATCCTGTTCACAATCTCTTTCCAAAATATTTCTTGAACTTTTGTTTTCCTTTTTTTCTTTCAAAAAGAATACATTTCATAAAACTTTCGAAAAATTGAAAGAGCAAGGGTGTTGGTCACTCACAGGGCAAAATTTTAGTTTAAAAGGCCTATTCAATAACATTTTCTAAAAGTTTTCTCACTTCAAATTTTGTTTGTAAACTCGTTTCAGAATATTTTCTTGAACTTTTGTCGTCAACTTTTCTTTTAGAATGAAAAAAGGCATTTCATTGAACTGTCAGTATATTCCACGAGCAAGACAGCATACAATCGATGCATGGAAGACACAGCCTTGGTGCCATTGTGGCACAGGCGGTCAAACAAACTTTAAAAGGCCCCTTACATAATATTGTCTTAATTTTCTTCTCTTTTTTCTCTTTAAAAAACTCTTCCTGAAACACATTTTCAATTTATAAAAAACACATTTTCTCTCCAATGTAGATTTTGCATAATAAATTCCCTGTTTAAATAATATTTAAGAGTGAATTTTCACTTTTGTGTATTATTCTCTATCTTACATGTATCTCTTCATAATTATTGGATCTGTTCTATAACTTTTCTTGTAAATTTCTATATAAATTGCAATCGTTGTATAACATTTATCAATTTAGAAAAATCTTAGGTGATTAAAGGAAATCTGATTGTCCGAAAATCACTTTTTTCTATTTCATCATCACCCGACCCAAATCCCCCTATGGTTGCCGTTTGGCTTTTCTATTTTTTTTTTCTTGAAATTAAGTCTATAAACACTCATTCTATCTTCAGTTATATGTTCCAGAATTATATTATATTTTAGTTTGACTTCTTTGTATATTATTATTGGTACATTTCTATGTTTTATTTGCACAAAATTATATGTTTTTTTTATAACGATGGATAACTTCTTTGTTACAATGTCCAGTAATTATTTGACTATCATTAATTTCTAATTCTGATTGATTTTTCTATGTTAGCACTGGACGCTTTACATGTATTTTTATTTTACTTCCTTTGATGGATTTATTTTTCTTTTAGTTTTTCTTTCAACATTTTCTTCTTTCTTGAATTTTTCTACTTTTGTAGAGTAGATTGTCAAATTTTATATCTTTGTTATCTTGGTAGAAGTTATTTATTTTTGCCGTATGGGTGATTTTTTGTTTTTTTGCCCATTCTATTCTCTTGGTTCTCTTTTCTTTTTCGGTTCATATATTTATTAGTTTAGGGTTTGAGTTGATACAACCCTCAAAATTGAAGAATATAAATTGATATTTAGCCAAAGAATTTCTTTAGATGTTTTGAGGCCATGCGTGACAGTACTTGACCCTGGATTTTAAACAGCATCCAAGGTTTGTTGACTTTTGCCCTACCTATATCTGAAAATTAAAAAACAATCAATTGTGTAATATTGGCTGAAAGATTCTTCATATGCAAATAATGGACAATTTTTCGGTTTCTACATTGCCTCAAGGATGCTAACATCTTACTACAGTGGGTCTTGGACCGGACCTTTTCTGCAGAGCTCCTTCCATTGCCTGCCATGAACTTTTGCATTTTTGACTTATAGGCACTCGTTATTTCTTTCTTGTGTTGTATTCAATAAAAATAAATACCATGTTCCTTGACTATAATTTTGTGTATACAAGGGTAACTGAAACATTCAAAGCCTTGCTTTCGGCTTCCCTAGCTAAGCCACAATGATGTTTTTCTATCCTTTGGGCACTGAGAAAAGGCAGATGCTGCAAATGAAGGGGGTACACCTTACAAGACAATGTTTGATGCTTTTGCAGGTAAAATTTGTATTAGTTGTTATGTCTGGAAGTGATTTTATATTATTATAGTTGGTTATTAGTTGTTATAAGTAATTCTGAAGTGTGGCAATGATTATTTAGAGTGAAATCTGGGCTCTGACATACATGATTTATGTTCTTTGTAATTCTGTGCTCATTATCTATGTGAGTATGCGTTCTTGTCCTCAATCTTTTTGCAACCTTTTCATGTCTTTCATATTTTGAACTATACGCATTTGTGGTTTTCGTAGAACTTTAACAGACACTCCTATACTTCTACTGCGAAACCACTTTTAATTATCTATTTGCAATATGCTGAATGTTCTATCATTAGAAAGCTACTTTCAGGAGTATTATAGATGATGAAATAAACATTAAAATCCTAACATTGAAAAATCAAGCAAAAGGAGGAGAAAAAGTTTGCCGAGGTGTAGTTTATAGGAATCATCATTTCCTCAATTTGACGCAGGTATATGAGTGATTTTTTACTTTTGTATTCATTTCTCTCTTTACCATTATGGGATCTGTTTTATGACCTTTTTCGTGAATTTCTATGTGAATTGCAATTGTTGTATAACATCCAATATGATCTTAGAAAGATATGAGGTGATTAAAGAAAACTTGATGGTTGAAAAATCAATTTTTCTTGTTACATCTCATGTCGATGCTCCAAATTCGAGAACCATTTAGTTTTTGTTCTCTTAGTTAAACCTTTAAATTACATTATATTCTTTAGTTTGACTTCTTGTATATTATTATTTTGGTACATTTCGTAGTTTTATTTGCACCAAATACTTTCTTTCTTTCCTTGTTTGTTGCATGGGGCGCTTTCAAGGTATTTTTCCTTTACATTATGGATTTATTTTTTAGTTTACTTTTTCTTCGTACATTTTATTTTTCTTCTCATCATTTGCTACTTTATTTTAAAGTAGATAATTAAATTTTATTTATTTACTTTTGGTGGATGATTTTTTTCTTTTTCCGTTTATGATCTCTTGGTGGTGGTTCAAAATTTTTTTCTTTTTTTTTTTACTTGTGTAATTTATTTTTCTTGGATGGTTCAAAATATTTTCACCATTTTGCCGTCTGTTTAGTTTCTTTTTATTACATTCTTATTTATTATTTTTCTAGCTTTGTTGTTTGGTTTGTCTAGTTTTTGATGTAAGGTCACTTTTAAAAATAGCAAAACAAATTAAAATATTTATAATAATATGGCAAAATTTTAGATTTTCTCAAGATAAAATTTGATAGACTTATATCATAATGTCTATTAGAAGGGTAAGAATTGTTGGAATTTTTTGTCCTAAAACTCGTAGTTTGATTATATTCTGTTTCAATAAAGTTGTTATTGAGAAAAATATTATAATCCTAAATCCAATAAATCAAGTTCCAAGGCTATTTTTTGAATAAACTTTGAACTTTATGTGTAAACATAAACGTAGATCAAGTTTTGACATATAGCCTAAATAGTCTATAGTATATGAATAAAGGTTGGACACCTTATTTAGAGAAACTATGGATGCGACCCACTTTGTAGTACAAACGATATTTTGAATTTAAGGAGCAAGATGTAATATTGGGGGTACATATAGCCTAAATAAGGTTTTAGGATTTAAGGCTTTTAGCGCTTAGCGCTTTTAGGGCTTGGGCTTTGAACAGCTTGTTGACTACCCAGTTTCAGTTTTAACTGAGAAATATGGAACACATTGTGGATACTGGCTTCCGGCGGTAACATCAATCTGTACGCAACTGCTCCTATTTCTTCAGTAATTTTGTAGGGTCCGTAATATTTAGGTGCTAACTTTTCACACTTCTTCCGAGCCAAGGAGTGTTGCCGGTAAGGTCTCAATTTCAAGTAGACTTCATCTCCTACCTTAAATGTTAACTCTCATCTATTTCGATCAGCCATTTTCTTCATCCTGTTCTGGGCTACACACAGGTGTTCTTTTAAGGCATTGATGGCTAAGTCACGTTCCTTCAGCAGCGTGTCAACCTCATTATTAGGAGTTTTCTTGTAACCATAGGATAACAGGGGAGGAGGCGGTCTGCGGTATACCGTCTGGAAAGGGGTCATTTTGGTGGATGCATGGAAGGTGGTATTATACCACAATTCTGCCCATAGAATCAGTTTATCCCGTTTTTTTTGGTTGTTCATTACAGAAACATCGCAGATAAGTTTCGAGACATCTATTTACCCTTTTTGTCTGCCCATCAGTTTGTGACTGAAACGCGGTACTCCTCTTTAGAATAGTTCCCATAGTTGCAAACAGTTCCTTCCAAAAATTACTAAGAAAGATTTTATCCCGATCTGTGATAATCGACAAAATTACTTAGGATAATCGCCTAGTAGGTCTTTAGAATAGTTCCCATCAGCCGTGCTTGCTTACTATTTTGTCGATGAATATGGAAGCAACTTGCTTGGCTGAAAACGGATGTCTCAATGATATAAAGTAAGAATACTTACTCAGCCGATCAACTACCACCATGATCACATTAACCCCTCCAGCTAGTGGCAACCCCTCAATAAAGTCCATGGTCCAATCTTCTAAGATTCTGTCAGGAATTGGAATTGGTTGAAGTACCCCAGCTGGTTTGGTTGCTTCGTATTTGTTTCTCTGGCAGATCTCGCATTGTTCCACATATTTCTTTATTTCAGCCTTCATACCCCTCCAATATAGTTCTCCACTGACCCTCTTGTAAGTTCTTAAAAATCCAGAATGTCCCCCGAGGATAGAATCGTGGAAAGTATGTAGGATCTTGGGTATAAGTGTAGAAGTTTGAGATACTACCATTCTCCCCTTATAGAGCAATCTTCCTTTTTCCACTTGAAACTTGCTTCCTTCTGCATTTTTCTTCAGATTTTCCATAATTTTCTTAAGTTCAGCATCATTCTCAACTTCCTGCTCGATTAGTTTCACATCAATGATTCCAGTAGTTGTCAAGCTGTGCAGCTCCACCGGTTGCTCCATTCTAGATAGGGCGTCCGCTGCCTTGTTCTGAAGTCTGGGTTGATATAAAATCTCGAAATCATATCCAAGGAGCTTTGTCAACCATTTCTGGAATTGGGGTTGTACCTCTCTCTGTTCAAGCAAGAATTTAAGAGCCTTTTGATCTGAAAGTATGGTGAACTTCCTGCCCAGCAAATAATGCCTCCATTTCTGAACTGATAGAACTACAGCCATCAACTCTCTTTCATAGATGGATTTGTTTTGAGCCCTTGTGGATAACTTCTGACTGAAAAAGGCAATCGGGTGGCCATTTTGAGAGATCACCGCTCCTAACCCTACGCCAGATGCATCAGTTTCGATAATAAAGGGAATTTTCCAATCTGGTAATGCCAACACTGGTAGAGTAGTCATTGCTAGCTTCAAATTTTCGAAGGCAGTATTTGCTTCTTCATTCCACTTAAAAGCATTCTTCTGTAATAACTTGGTTAGCGGTGCAGCGATCTCTCCATAACCCTTGACAAACCTCCTATAATATCCTGTTAGTCCTAAGAAGCCCCGCAGACCAGATATATTCGTTGGTGGTGGCCAATTAATCATCTTCTTGATCTTATCCTCATCTGCCTCAACTCCTCTTTTGGATATCATATGTCCCAAGTACTGTATTTGAGAATGGGCTATAACGCTTTTTTTTTTTTTTGTTAGCAAACAGCTGCTTATCCTTAAAAACATTGAATACCATGCCCAAATCCTTTGCGTGTTCATCTATATCCGCACGATAAACCAGTATATCATCAAAGAACACCAAAACACAGCGCCGAAGGAAAGGTTTAAATACCGGGTTCATCAAAGATTAGAAGGTTGCCGGAGCATTCGTGAGACCAAACGGCATTACTAGAAACTCATAGTGCCCTTCATGTGTGCAGAACGCAGTTTTCTCCACATCTTCCTCTTTCATTCTAATCTGGTGATAACCAGATTTCAGATCTAATTTCGAGAAAATCGTTGCCCCATTCAGCTCATCTAATAATTCCTCTATCACAGGAATTGGGAAGTTATCCGATACTGTCGCCTGATTTAACTTGCGGTAATCGACACAAAATCGCAATCCACCATCTTTTTTCTTCACTAACAACACCGGGCTGGAGTAAGGGCTATGACTAGGGCGAATAACCCCGGCTTGAAGCATCTCGACCACCAGTTTTTCTATTTCTTCTTTCTGCACATGACCGTATTTGTAAGGTCTTACGTTGATAGGTTTATGGTCAGGTAGGGTTAATATTCGATGATCAATAGCTCTTTTTGGAGGTAATTTCTTAGGTGTCTCAAATATCTCCGCATAATACTTGAGCAGGGATCGGATCATAGGTATTTCCTCCTCATCTCCTTTTAATCTTGTCCCCATTTCATATTCATTCTCTACTTCCATCTCATAATTCTGGAATTCTAAGAGGAAGCCTTGATCTTCTATATCCCACGTTTTTTCCAATGTCTTCAAAGAACATTCAGGTTTCACTAAGGATGGATCTCCTCTTAACACGATTTGCTTGTCTTTTGCCCAAAAGGTCATGGTTAATGATGGCCAATGTACCTTCATTGTGCCGGTGGTATCTAGCCATTGCATCCCTAAAATCACGTCAATATTTCCTAGTTCCACCACTAGAAAATCAGTTATAACTGTCGTTTCTTTCAATCTCAGTTCCACCTTTCTGCAAATACCTTTCCCTTTACAGCGGGTCCCGTCTCCAATAGTCACTCCAAATTGTGTACCGGGTTCAATGTTCAACCCTCCTACCATTGCCACTGCTTCATGGATGAAGTTGTTCGTAGCTCCACTGTCGATGAGAATCACCACCTCCCTTCCTTTTACATGCCCCTTCAGTTTCATTGTCCCTTTGGATGTTAACCCCGTGAGTGTCTTCAATTCAATTTCGGTGGTTTCAGCAACAACTAACTGGTTCAGCTCCATTACTTCTTCGGTCTCTTCCATTTTGACTTCCTCTCCGCCACTCTCTTCTTCGTTAAGTATAAATAACATTAATTCCCTTTTCTCTCTCATCTTGCATCGGTGTCCCGGTGAGTACTTCTCATTGCATCTAAAACACAAACCCTTGTCCAATCTTGCTCTGAATTCCGTGTCCGAGAGTCTTTTAACTGGAGGTTCATTTTTCTGGTAGTTTCCTTTAATGGGAAAAGTGACTTGCTTCATGGGGAACTCAGTCTTCTTTGGTGAACTTTTATCTTGACTGCTTTGTCCCTTGCTTGAGCCTCCTCCTTCCCTAGTCACACTGTTTACTCATTCTGCCTTGGCTAATTTCAAGGCTAGGTTTCTGTCACTTACCAGTTGTGCGTCCTTCATACATTCCTCTAAGGTTTGAGGATGTCTACTAATAACTTCTGCTTGAAGAGCCGATTCTAAACCTGTCACGAATGCATCTAAGAGCACGCTTTCAGCCATGTGCGGTAGTGGTGCAGAATATTGTACAAATTTCTTGACATAATCATTATAGGATCCATCTTGTTCTATTCGAATCAGCCTTGCACCTAAGCTCTTCTGCCCAGAATCTCGAAAAAATTCAAACATCCTTACCTTCAAATCTTCCCATCCTTCCACCTTCTTTCTATGGTTACTCCAACGGTACCAATCGATTTCATCTTGTCCAAAACTAACCACTGCTACTTTCACTTTTTCTGCTTCCGGTAGGCTATTAATTTCGAAAAAATGCTCAGCTCTAAACACCCATGACTCAGGATTTTCCCGGTGAAAATCGGCATTTCCAGTTTCTTATATTTGCTTCGGTCGACAGTTACTACATTGTTTTCCCCTGGATTGTCGGCTTCTTCCCATTTTCCTTTTAACTTCATGACTGATCCATCAGAAGTTCCGGATTCATCTCGCCTTTTGTAGCTGCTGTTTTCTTTCATTTCATCAGCCAGTCTATCCATGGATTTCTTCATTTCCAGAATCATTTCCTTCAATCCCAGTATTTCTTTCTCTGTTCCAGCTGTTCTTTCTTCTATTTGCCTTTGCGCCATCAGTCGCGTCACCACCCCCAACGTTTGGGCTCTGATACCAGTTGTAGAGACTCCACAAATAAAATACTAAATTACCCAAACAGTTGGCAAACAGAGACAGCACTCTGTAGATCTTTATTTATATATAAATGCCAAAAGATACAGTAGAAGGAAATATAAAAGAAAGCAATAAAGATGATAACATCATGTCAAACCTCCCCACGAAAAGATAGTAGTCCTTCCTAGACCTAGCAGCCTCTATAGACCAAATGAAGACAAGAAGTTAACCTAGCCCCCAATCCCGATAAGGCCTTATTTATAACTCTTTTCTCATTCTTCCCCGTGGGACCCACTCTCATGTTCTTTTCCTAATATTCTCCCCTTTTCTGCTGCCTGAGCTTTTACGTTTTTACCCCTCCTTTTGTATGTATGGATAATAGGGGGCCTTACAAATTTCTTCCAACACTATGAAACATTTATGTTATAGTTTTATTATTGTTAAACATCATTGTAATTTCCAAACACAACTCTGCCAAAGTTCCCAAGATAGTTCTATAGCAATGTTCCACTGCTATCTTGCAAGTGATGTGGAACTCCTTTGCTTTTTTGAATAGTAAGCTGGCCTTCCTCTTTTGTTATGCTTTGTTTTATTCTTAATACTTGCTTTTATCTTCGCCTATTTCATTCCTAGTTGATATATTACATAGTCCTCTTAAATCTACATTCTTTTTCTGTTTTATGATTCACTTGAGTTTAATGACTAACTTTGGCGAAATAATGTTTAACTATATTAGATGTTTTGCATCTAAACCTCAATTGAAAAGGTTTGTTACCTGTATCATTATAAGGTTTCCATGATTTGTTACCTGTATCATTAAAAATATTGGGTCGGGTGCATTTGCTTACACACCATTCCATGGCCAGCAGCAGAGGTTGTACTCTACGATTCAATTCCTTTCCTAATCACAATTTTGATAGATTTGTTTCTGAAAAACAATATTTAACTTCCTATTTTTTTGTTTTTTGATAAATCTATTAGTATATTTGATGTAAACTATAGCTAGTTAGTTAGTAAAAATCCCTAACTAATTAGAAATACAGCTGTAACTAACTTCCTAAAACAGTCTTGTAACTAACTAAATAAAACAACCTTGTAACTAACTTGTAACTAACCTTGAGATATCAATATATAACAGTCTCCTAGCCTGAAATATGGCAGAGAATTCTTTTGAAAAATCAATCCAACTTGAATTACATCAAATTGGTATCAGAGCAGCCCCGATTCTTGGACGGAATGGCTGGCCGAAGAGTTAACAACCCTGCCACGGGGGAAAACCGTGTGCAAGAAGCGGCGGAGGAAATCACCGTCCTCTCTCCAAAGACATCAACAGTTCGCTTGCTGGCTGTTGAAGAATCTTTGGGAGATCTCCACAACAAATTTGATAGGTTGATGGAAAGTGTCGAACTGTTAAACCGAAGAGGAGAATTCCCACAGCCACCACCACGGAATGAAATTAACTTCCAAAACGACCAACGTTTTGGTGAAGCAAGAGGCCGGCGAGCAAGGGGATACATTAGAAACATGAACAACCCACGAGGACTTCAAAGAAGGAGACCCGGTTACGCTATACAGCAACAACTTGACGAAGACTTTCAAGAAGATCAAGAAGCTTGGCAAGAAACCCAAGAAGATGATTCTTCAAGCGGGGATGAACAAGGAAACATGTGGAACTTCAATGATGAAGCACGAGCAGGAAGAAATAACCGAAGAATTGAGGCCAGAAGAGGAGAATACCACGATTATAAGATGAAGATTGACCTTCCCATGTATGATGGCAAACGAAACATAGAAGCATTCTCGGACTGGATAAAGAGTACTGAAAACTTCTTCAACTACATGGATACGCCTGAACGCAAGAAAGTCCACCTGGTAGCCTTAAAATTAAGGGCTGGTGCATCAGCTTGGTAGGATCAGTTAGAAATCAATAGACAAAGATGCGGGAAACAACCGGTTCGCTCATGGGAAAAGATGAAAAAATTGCTGAAGGCAAGATTCTTACCACCAAACTACGAACAAACTTTATACAATCAGTACCAAAACTGCCGCCAAGGTGTCCGGACAGTAGCTGAGTACATTGAAGAATTCCACCGCCTGAGTGCAAGAACAAACCTAAGCGAAAATGAACAGCACCAGGTTGCAAGATTTGTGGGAGGCCTTCGATTCGACATCAAGGAAAAGGTCAGACTACAACCATTCCGTTTCCTATCTGAAGCAATATCATTTGCAGAAACGGTAGAGGAGATGATTGCAATCCGATCCAAGAATCTAAATAGAAGATCAGCATGGGAAACAAATTCGACAAAAAGCAAAACAAACGACCAACCTTCAACCTCAACAAAAGCAAAAGGGAAAGAGATTGATAATCAAGAAGTAGCCGTTGAAAGAAAGAAAGAACAGACGTTCAAGCCTAGTGGTCAGAACAGCTACTCCCGCCCGTCATTAGGAAAATGCTTCCGATGTGGCCAAACTGGCCACCTGTCCAACAACTGTCCGCAAAGAAAAACCATTGCAATAGCCGAAGAAGGAGGACAGACAAGTGAAGACAGTATAGAAGCAGAGGAAGAAACTGAACTGATTGAAGCAGATGATGGGGAAAGGGTCTCTTGTGTTATCCAACGGTTACTAATTACGCCCAAGGAAGAAAAGAACCTGCAACGCCACTGTCTTTTCAAGACAAGATGCACCATAAACGGAAGAGTATGTGAGGTAATCATAGACAGCGGCAGCAGCGAAAACTTCGTAGCAAAGAAATTAGTAACAGTCTTGAATCTAAAGGCTGAAGCACATATGAGTTTCAATGGATGGGAAGAAAGGTAGTCTTACTCCCTATAACGAAAAAGATTAACGAAGGGTTAAGAGGTGAGAAACAGTTATTCATCACTGTTAGTGGGAAAAACATGCTTAAAGAAAGGGAAGAAGACATCCTAGGACTGGTTGTTATTGAAAAAACCAAAGAAAAACAGGTTGAAGACATAGAACCCAAATTACAGCAGCTCCTTCATGAGTTCCCTCATATAAAGGAAGAACCGAAGGGACTCCCACCCCTTCGAGATATACAGTACCACGTAGACTTGATTCCGGGAGCATCACTACCAAATTTGGCTCACTATAGGATGAGCCCCCAGGAGTACAAAATACTTCATGATCATATTGAGGAATTACTAAAGAAAGGGCACATCAAACCTAGCCTCAGCCCGTGTGCAGTACCAGCACTTCTCACGCCAAAGAAAGATGGAAGCTGGAGAATGTGCGTTGACAGCAGAGCCATCAACCGCATCACGGTAAAGTATAGATTTCCCATTCCAAGGATTAGTGACCTGCTAGATCAACTCGGTAAAGCCAGTATCTTTTCGAAAATTGACTTGAAAAGTGGCTATCATCAAATACGGGTAAGACCTGGCGACGAATGGAAGACAGCCCTCAAGACAAACGAAGGATTATTTGAATGGATGGTCATGCCATTCGGCCTTTCTAATGCACCCAGCACCTTCATGAGATTGATGAACCAGGTACTTCACCCATTTCTCAACAAATTCATAGTAGTTTACTTCGATGACATACTTGTTTACAGCACAAACAATGATGAGCATTTACTACACCTAAGGAAGCTGTTTCAAGTCTTAACAGAGGCGGAACTCTACATAAATACTAAGAAAAGCATGTTTATGAAAAAAGAAATTGCATTCCTCGGCTTTATAATCAAACAAGGAAGCATAAGCATGGAACCAAAGAAAATCGAAGCCATCCATACATGGCCGACTCCTGCCTCCATTAAAGAAATACAAGCCTTCCTCGGCCTGGCTTCGTTTTACAGGAAATTCATCAGAAATTTCAGCTCTTTAGCCGCACCACTAACTGACTGTCTAAAGAAAGGAAACTTCAAATGGACCCCATTACAACAAGAAAGCTTTGAAGATATCAAAAAGAAGCTCACATCCAGCCCTATCCTTAAATTACCAGATTTCTCTTCACCTTTTGAAGTAGCAGTTGATGCATGCTGTACAGGGATTGGAGCTGTCCTAGTACAACAAGGACATCCTATCGAATACTTCAGTGAAAAACTCAGCACCGCAAGACAGACCTGGAGCACATACGAACAAGAGCTGTATGCCCTCGTCCGAGCACTAGAACAATGGGAACACTACCTGATCTCTAAAGAATTTGTACTCCTAACTGACCATTTCTCACTAAAATACTTTCAAGCTCAAAAGAATATCAGTAGGATGCAGCACGCTGGATATCCTTCCTCCAAAGGTTTGACTTCGTGATCAAACACCAATCAGGCAAAGAGAACAAGGTGGCCGATGCTCTAAGCAGAAAAGGCTCCCTACTCACAATACTGTCCTCGGAAATCATAGCATTCAAACATTTACCCGACTTATACGAAGGTGATACTGACTTCAAGGATATCTGGTACAAATGCTCCAACTTCTTAGACGCTGATGACTACCACATTGTTGAAGGATATCTATTTAAAGGAGAACAATTATGCATCCCGCACACCTCACTACGTGAAGCCTTAATAAAGGAAGCACATTCTGGAGGGCTAGCTGGACATTTCGGACAGAACAAGACATTGGAGATCACTTCCAAACGATACTACTGGCCGCAAATAAGAAAAGACTCCAATAATTTCGTAAAAAGATGCCCCATCTGCCAAAGAACCAAAGGCTCCAGCACGAATCCAGGATTATACTCGCCACTACCCATCCCGACCTCAATTTGGGAAGATTTATCAATTGACTTCGTGATTGGATTACCAAAAACACAAAGACAATTTGACTCAATAATGGTCATAGTGGACAGATTCAGCAAAATGACACATTTCGTAGCATGCAAAAAGACAAATGATGCAATCTACATAGCCAACCTCTTCTTTAAAGAAGTAGTACGACTACATGGAGTACCTAAAAGCATAGTATCAGACAGAGATGTCAAGTTCCTGAGTCACTTTTGGCGAACACTGTGGAAGAAGTTTGACACAACACTGAAATTCAGCACCACAGCCCACCCACAGACAGATGGACAAACTGAAGTAACAAACAGGACCCTCGGTAATCTGCTACGCTGCCTTAGCGGGTCAAAACCAAAACAATGGGATCTAGCATTGGCTCAAGCTGAATTCGCCTTCAATAATATGAAGAACAGATCAACAGGAAAGTCCCCCTTCGAAGTAGTTTATACCAAACTACCACGATTAACCTTTGATCTCACTACACTCCCCACAACCGTGGATCTCAACAACGAAGCAGAATGCATGGCAGAAAATATCAAAAAACTACCCAAGGAAGTCCATGATCATCTTATACAGACAACAGACTCCTACAAAAAGGCAGCAGATAAAAAAGAAGACAAGCCCACTTCAATAAAGGAGACCTAGTAATGGTACACCTGAAAAAGAGCAGATTTCCTACTGGCACCTACAACAAGCTGAAAGACAGACAAATTGGGTCATTCCCTATATTAGAGAAATACGGAGATAATGCCTTCAAGATCGATCTACCACAAGACATACACATACACCCAGTCTTCAATGTTGCTGATCTAAAGCCATACCATGCACCAGATCGTTTCAGGCTTGCTGACTGATGGACCCTGGGACAAGTCCATCCTTAGGGGGGTGGAATGATGTAAACTATAGCTAGTTAGTTAGTAAAAATCCCTAACTAATTAGAAATACAGCTGTAACTAACTTTCTAAAACAGTCTTGTAACTAACTAAATAAAACAACCTTTTAACTAACTAAATAAAACAACCTTGTAACTAACCTTGAGATATCAATATATAACAGTCTCCTAGCCTGAAATATGGCAGAGAATTCTTTTGAAAAATCAATCCAACTTGAATTACATCAATATTGGTAAAATGGTCTATCGATAATGAAAATCTATAACTAATACACAAGTGATGGACTAACAATTTAAAATATCATCGTATTTATAAATCACGTGGTTTTGTGACATATTTGCTAATGCCATAGAAACCTAAATTATATATACATTTTAGGTTAGTCACGATTTTGAGTCTAGAGTTTAGACTTTAGAGAAGCGTCGTAAATGAACTCTTTAATCTTTCAGGCCTTTGCGTCTGGATCTATAATACAATTGTTAATTTCCTCTATTCGTGAACAAATTTAACTATATGTTAGTGAATAATCGCATAAATCTCATTGTCGTTTCTTACCGTTCACATTTTTACTTTTGAAAATACTTTTGTTATCAACTTATCTATGACAATTATCTTTAAACGAAAGATTCTTTTTAAGTCGTCATGTATGAAACGAAGTGATCTTTTCGTTCATATTCTTAACAGGTCGGTCTCTCTCCTGCTATTCAAAATCTTAGTACTGCCACATCGATGAAGATGACCAAGGTTTTCCAAGCGTTTCTCATGCTCCAACCCCAACTCGTAGACCTTATCCACGAGATGCAGCCAGATTGTATCGTTTCTGACGTCTTTTACCCATGGACGAGTGATGTGGCAGCCGAGCTCAGAATCCCACGGCTTGCCTTCAACGGTTCAAGCTTCTTCAGTTATTGTGCGGAACAATGTATAAAGGAGCATAAGCCACATCTTGAAGTTGAATCCAACAATGAAAAGTTCAAACTTCCGGGGTTGCCTGATGTTATTGAAATGGTGAGATCTGAGTTGCCAAGTTGGATCACTAGACATAAACCAGATGGTTTCTCACAATTGCTTGATGTCATTAGAGAATCAGAGAAACGATGCTATGGAATGTTGATGAATCGCTTTCATGAATTGGAAGCCTCATATGAAGAACACTTGAATAAGGTGAGGTTTACCTCAAACCATTGTATATAACTACACATTACTCGTTTCTTAACGTAATCTTTTGGTTTTGAAATTTTGTACTTGATTTATGATCACGATTTCGTTAGAATTATATTATTCATATTTTTTTAAAAAAAGAGAATATTTGAGAACAAAAATTACATCTTGAAGTATATATTTGTTTAGAAGCTTGATTTTTAGAAGTTGTTAATTAAGCTTATTCACATATTTTTAAAACACACTTGAATTCTAAGTTAAAGTTTTTCAAGAAAACTACAAATTTTACTTTAAGAACGTGTTTTTAAAAGGGTTGTTGTCAAATAATACAACATGAGACAAATTATTTACTTATGTAGCATGAAATTTTATTGTCTAATAAACACTAATAATATATAGAAATAGACATGTGATAAATAAAAAAATTATCATTACGTTGGCCTGTTGCACCCAGCCTTGTATATCTCCTTCGGTCGCAAAGAAGAAAAAAATATTCAAATTATCGTTTTTAAAAAAGAGAAGTTTGTCACGTGGTAACGTGTAATCTGGACATTGAATAAGCTATACAAAATAGATGGAGCTCAAGATGCACCCAAATATTTTTGAATTGATTCGAGACTTCGAGGATTTGTTTTTATTATTATTATTATTGTTTGATAGATTATAGGGATCAAAACATGGAGCATAGGGCCAGTATCATTGTTGGCAAACAATGAAATTGAAGATAAAGAATCAAGAGGTGGAAACCCCAACATTCAAACCACCAACTTACTCCAATGGCTCAATGAGAAAGAACCAAATTCAGTTCTCTACATCAACTTCGGAAGCTTGATTCAGATGAGTCGTAACCAAATAACCGAAATAGCACATGCGATTCAAGAATCCAGTCAAAGTTTCATTTGGGTCATAAAAAAGAACGACGAAGACAACGATGACGACATAGTAAACAAAGGGTTACAAAAAGGGTTTGAGGAGAGAATGAGTAGAACCAAAAAGGGTTTGATAATAAAGGGATGGGCTCCACAATTGATGATATTAGAGCACAAAAGTGTTGGAGGGTTTTTGACTCATTGTGGTTGGAACTCAATTCTTGAAGGAATAAGTTCAGGCTTACCAATGATCACATGGCCATTGTTTGCTGAACAGTTTTACAATGAGAAGTTGTTGATTGAAGTGGTGAAGATTGGAGTGGGAGTTGGGTCAAAGAAATGGTGGCATTTGGGAGAAGAACCAGAAATTATTAAGAGGGAAGAGATTGGTAAGGCCATAGCTTTTTTTGATGGGTGAAAGTGTTGAAGCTTTGGAAATGAGAGAGCTAAAGAAATGGGAGAGGCTGCAAAAACAAGTGTGAATTGTGGTGG

mRNA sequence

ATGATGTTTTTCTATCCTTTGGGCACTGAGAAAAGGCAGATGCTGCAAATGAAGGGGGTACACCTTACAAGACAATGTTTGATGCTTTTGCAGGTCGGTCTCTCTCCTGCTATTCAAAATCTTAGTACTGCCACATCGATGAAGATGACCAAGGTTTTCCAAGCGTTTCTCATGCTCCAACCCCAACTCGTAGACCTTATCCACGAGATGCAGCCAGATTGTATCGTTTCTGACGTCTTTTACCCATGGACGAGTGATGTGGCAGCCGAGCTCAGAATCCCACGGCTTGCCTTCAACGGTTCAAGCTTCTTCAGTTATTGTGCGGAACAATGTATAAAGGAGCATAAGCCACATCTTGAAGTTGAATCCAACAATGAAAAGTTCAAACTTCCGGGGTTGCCTGATGTTATTGAAATGGTGAGATCTGAGTTGCCAAGTTGGATCACTAGACATAAACCAGATGGTTTCTCACAATTGCTTGATGTCATTAGAGAATCAGAGAAACGATGCTATGGAATGTTGATGAATCGCTTTCATGAATTGGAAGCCTCATATGAAGAACACTTGAATAAGATTATAGGGATCAAAACATGGAGCATAGGGCCAGTATCATTGTTGGCAAACAATGAAATTGAAGATAAAGAATCAAGAGGTGGAAACCCCAACATTCAAACCACCAACTTACTCCAATGGCTCAATGAGAAAGAACCAAATTCAGTTCTCTACATCAACTTCGGAAGCTTGATTCAGATGAGTCGTAACCAAATAACCGAAATAGCACATGCGATTCAAGAATCCAGTCAAAGTTTCATTTGGGTCATAAAAAAGAACGACGAAGACAACGATGACGACATAGTAAACAAAGGGTTACAAAAAGGGTTTGAGGAGAGAATGAGTAGAACCAAAAAGGGTTTGATAATAAAGGGATGGGCTCCACAATTGATGATATTAGAGCACAAAAGTGTTGGAGGGTTTTTGACTCATTGTGGTTGGAACTCAATTCTTGAAGGAATAAGTTCAGGCTTACCAATGATCACATGGCCATTGTTTGCTGAACAGTTTTACAATGAGAAGTTGTTGATTGAAGTGGTGAAGATTGGAGTGGGAGTTGGGTCAAAGAAATGGTGGCATTTGGGAGAAGAACCAGAAATTATTAAGAGGGAAGAGATTGGTAAGGCCATAGCTTTTTTTGATGGGTGA

Coding sequence (CDS)

ATGATGTTTTTCTATCCTTTGGGCACTGAGAAAAGGCAGATGCTGCAAATGAAGGGGGTACACCTTACAAGACAATGTTTGATGCTTTTGCAGGTCGGTCTCTCTCCTGCTATTCAAAATCTTAGTACTGCCACATCGATGAAGATGACCAAGGTTTTCCAAGCGTTTCTCATGCTCCAACCCCAACTCGTAGACCTTATCCACGAGATGCAGCCAGATTGTATCGTTTCTGACGTCTTTTACCCATGGACGAGTGATGTGGCAGCCGAGCTCAGAATCCCACGGCTTGCCTTCAACGGTTCAAGCTTCTTCAGTTATTGTGCGGAACAATGTATAAAGGAGCATAAGCCACATCTTGAAGTTGAATCCAACAATGAAAAGTTCAAACTTCCGGGGTTGCCTGATGTTATTGAAATGGTGAGATCTGAGTTGCCAAGTTGGATCACTAGACATAAACCAGATGGTTTCTCACAATTGCTTGATGTCATTAGAGAATCAGAGAAACGATGCTATGGAATGTTGATGAATCGCTTTCATGAATTGGAAGCCTCATATGAAGAACACTTGAATAAGATTATAGGGATCAAAACATGGAGCATAGGGCCAGTATCATTGTTGGCAAACAATGAAATTGAAGATAAAGAATCAAGAGGTGGAAACCCCAACATTCAAACCACCAACTTACTCCAATGGCTCAATGAGAAAGAACCAAATTCAGTTCTCTACATCAACTTCGGAAGCTTGATTCAGATGAGTCGTAACCAAATAACCGAAATAGCACATGCGATTCAAGAATCCAGTCAAAGTTTCATTTGGGTCATAAAAAAGAACGACGAAGACAACGATGACGACATAGTAAACAAAGGGTTACAAAAAGGGTTTGAGGAGAGAATGAGTAGAACCAAAAAGGGTTTGATAATAAAGGGATGGGCTCCACAATTGATGATATTAGAGCACAAAAGTGTTGGAGGGTTTTTGACTCATTGTGGTTGGAACTCAATTCTTGAAGGAATAAGTTCAGGCTTACCAATGATCACATGGCCATTGTTTGCTGAACAGTTTTACAATGAGAAGTTGTTGATTGAAGTGGTGAAGATTGGAGTGGGAGTTGGGTCAAAGAAATGGTGGCATTTGGGAGAAGAACCAGAAATTATTAAGAGGGAAGAGATTGGTAAGGCCATAGCTTTTTTTGATGGGTGA

Protein sequence

MMFFYPLGTEKRQMLQMKGVHLTRQCLMLLQVGLSPAIQNLSTATSMKMTKVFQAFLMLQPQLVDLIHEMQPDCIVSDVFYPWTSDVAAELRIPRLAFNGSSFFSYCAEQCIKEHKPHLEVESNNEKFKLPGLPDVIEMVRSELPSWITRHKPDGFSQLLDVIRESEKRCYGMLMNRFHELEASYEEHLNKIIGIKTWSIGPVSLLANNEIEDKESRGGNPNIQTTNLLQWLNEKEPNSVLYINFGSLIQMSRNQITEIAHAIQESSQSFIWVIKKNDEDNDDDIVNKGLQKGFEERMSRTKKGLIIKGWAPQLMILEHKSVGGFLTHCGWNSILEGISSGLPMITWPLFAEQFYNEKLLIEVVKIGVGVGSKKWWHLGEEPEIIKREEIGKAIAFFDG
BLAST of CsaV3_1G021870 vs. NCBI nr
Match: XP_023006827.1 (LOW QUALITY PROTEIN: soyasapogenol B glucuronide galactosyltransferase-like [Cucurbita maxima])

HSP 1 Score: 524.6 bits (1350), Expect = 2.9e-145
Identity = 258/391 (65.98%), Postives = 305/391 (78.01%), Query Frame = 0

Query: 17  MKGVHL---TRQCLMLLQVGLSPAIQNLSTATSMKMTKVFQAFLMLQPQLVDLIHEMQPD 76
           M G H+   T  C   L+VGLSPAIQNLSTA S    KV +AFLMLQPQ+  L+ +M+PD
Sbjct: 56  MSGCHIRIHTIPC-PALEVGLSPAIQNLSTARSSVFWKVMKAFLMLQPQIEGLVRDMRPD 115

Query: 77  CIVSDVFYPWTSDVAAELRIPRLAFNGSSFFSYCAEQCIKEHKPHLEVESNNEKFKLPGL 136
           CI++D FYPWT++VAAEL IPRLAFNGS  F+YC EQC+ E KPHLEV+S+N+KF+LPGL
Sbjct: 116 CIIADTFYPWTTEVAAELGIPRLAFNGSGLFAYCGEQCVXEXKPHLEVQSDNDKFQLPGL 175

Query: 137 PDVIEMVRSELPSWITRHKPDGFSQLLDVIRESEKRCYGMLMNRFHELEASYEEHLNKII 196
           PD +EM RS+LPSWIT   PD FS  L   +ESE RCYGMLMN    LE  YE+H N +I
Sbjct: 176 PDFVEMRRSQLPSWIT--NPDAFSLFLGGCKESETRCYGMLMN--SSLENPYEQHFNNVI 235

Query: 197 GIKTWSIGPVSLLANNEIEDKESRGGNPNIQTTNLLQWLNEKEPNSVLYINFGSLIQMSR 256
           G+KTWSIGPVSLLAN E EDKESRGGNPNI+TTNLLQWLN K+PNSVLYI FGSLI++  
Sbjct: 236 GLKTWSIGPVSLLANKEAEDKESRGGNPNIETTNLLQWLNTKQPNSVLYIRFGSLIRVKP 295

Query: 257 NQITEIAHAIQESSQSFIWVI---KKNDEDNDDDIVNKGLQKGFEERMSRTKKGLIIKGW 316
            QI EIAHAI+ES+ +FIWV+               N GL KGFEE M +TK+GLII+GW
Sbjct: 296 QQIAEIAHAIEESAHNFIWVMXXXXXXXXXXXXXXXNSGLPKGFEENMIKTKRGLIIRGW 355

Query: 317 APQLMILEHKSVGGFLTHCGWNSILEGISSGLPMITWPLFAEQFYNEKLLIEVVKIGVGV 376
           AP LMILEH+ VGGF+THCGWNSILEG+SSGLPMITWP+ AEQFYNEKLLI+V+KIGVGV
Sbjct: 356 APLLMILEHEGVGGFVTHCGWNSILEGVSSGLPMITWPMSAEQFYNEKLLIDVLKIGVGV 415

Query: 377 GSKKWWHLGE--EPEIIKREEIGKAIAFFDG 400
           G +KWW+LGE    E++ RE+IGKA++   G
Sbjct: 416 GFRKWWNLGEHRSQEMVTREDIGKAVSLLMG 441

BLAST of CsaV3_1G021870 vs. NCBI nr
Match: XP_022149481.1 (soyasapogenol B glucuronide galactosyltransferase-like [Momordica charantia])

HSP 1 Score: 519.2 bits (1336), Expect = 1.2e-143
Identity = 241/353 (68.27%), Postives = 299/353 (84.70%), Query Frame = 0

Query: 49  MTKVFQAFLMLQPQLVDLIHEMQPDCIVSDVFYPWTSDVAAELRIPRLAFNGSSFFSYCA 108
           ++K+FQAF MLQP L  L+HE++PDCIV+D+FYPWT+  AAEL IPRL F+GSSFF+YC 
Sbjct: 4   LSKIFQAFFMLQPHLQQLLHELRPDCIVADMFYPWTTAAAAELGIPRLGFHGSSFFAYCV 63

Query: 109 EQCIKEHKPHLEVESNNEKFKLPGLPDVIEMVRSELPSWITRHKPDGFSQLLDVIRESEK 168
           E  IKE++PHL+V+S++E+F++PGLPD +EM +S+LP+WI  H  D  SQL D IRESEK
Sbjct: 64  EHSIKEYEPHLQVQSDDEQFQVPGLPDFVEMTKSQLPNWIINH--DELSQLFDGIRESEK 123

Query: 169 RCYGMLMNRFHELEASYEEHLNKIIGIKTWSIGPVSLLANNEIEDKESRGGNPNIQTTNL 228
           R YGMLMN F+E+E  YE+HLNK IGI+TWSIGPVSL AN    DK  RG NP+I+T+ L
Sbjct: 124 RSYGMLMNSFYEMEDPYEQHLNKGIGIRTWSIGPVSLAANKGALDKTHRGENPSIETSKL 183

Query: 229 LQWLNEKEPNSVLYINFGSLIQMSRNQITEIAHAIQESSQSFIWVIKKNDEDNDDDIVNK 288
           LQWL+EKEPNSVLYI+FGSL++M  NQI+EIAHAI+ S+Q+FIWVIKK+D + ++  ++ 
Sbjct: 184 LQWLDEKEPNSVLYISFGSLVRMKPNQISEIAHAIESSNQNFIWVIKKSDGEEEEAAIDG 243

Query: 289 GLQKGFEERMSRTKKGLIIKGWAPQLMILEHKSVGGFLTHCGWNSILEGISSGLPMITWP 348
           GL KGFEE M +TKKGL+I+GWAPQLMILEH++VGGF+THCGWNSILEG+SSGLPMITWP
Sbjct: 244 GLPKGFEENMKKTKKGLVIRGWAPQLMILEHEAVGGFITHCGWNSILEGVSSGLPMITWP 303

Query: 349 LFAEQFYNEKLLIEVVKIGVG-VGSKKWWHLGEE-PEIIKREEIGKAIAFFDG 400
           LFAEQFYNEKLLI+V+KIGVG +GSKKWW LGEE  E+IKRE+IG A+ F  G
Sbjct: 304 LFAEQFYNEKLLIDVLKIGVGIIGSKKWWDLGEERGEVIKREDIGNAVRFLMG 354

BLAST of CsaV3_1G021870 vs. NCBI nr
Match: XP_008457740.2 (PREDICTED: soyasapogenol B glucuronide galactosyltransferase-like [Cucumis melo])

HSP 1 Score: 439.5 bits (1129), Expect = 1.2e-119
Identity = 213/246 (86.59%), Postives = 229/246 (93.09%), Query Frame = 0

Query: 31  QVGLSPAIQNLSTATSMKMTKVFQAFLMLQPQLVDLIHEMQPDCIVSDVFYPWTSDVAAE 90
           +VGLSPAIQNLSTAT M M+KVFQ FLMLQPQL  LIHEM+PDCI+SDVFYPWTSDVAAE
Sbjct: 65  EVGLSPAIQNLSTATPMTMSKVFQVFLMLQPQLRGLIHEMRPDCIISDVFYPWTSDVAAE 124

Query: 91  LRIPRLAFNGSSFFSYCAEQCIKEHKPHLEVESNNEKFKLPGLPDVIEMVRSELPSWITR 150
           L IPRLAFNGSS+F YCAEQC+KEHKPHLEVESNNEKFKLPGLPDV+EM+RSELPSWI R
Sbjct: 125 LGIPRLAFNGSSYFGYCAEQCMKEHKPHLEVESNNEKFKLPGLPDVVEMMRSELPSWIAR 184

Query: 151 HKPDGFSQLLDVIRESEKRCYGMLMNRFHELEASYEEHLNKIIGIKTWSIGPVSLLANNE 210
              D FS+LLDVIRESEKRCYGMLMN F+ELE SYEEH NKIIGIKTWSIGPVSLLAN E
Sbjct: 185 E--DDFSRLLDVIRESEKRCYGMLMNSFYELEGSYEEHSNKIIGIKTWSIGPVSLLANKE 244

Query: 211 IEDKESRGGNPNIQTTNLLQWLNEKEPNSVLYINFGSLIQMSRNQITEIAHAIQESSQSF 270
           IEDKESRGGNPNIQTTNLLQWLNEKEPNSVLYINFGSL+QM+ NQ+TEIAHAIQ+SSQ+F
Sbjct: 245 IEDKESRGGNPNIQTTNLLQWLNEKEPNSVLYINFGSLVQMNPNQLTEIAHAIQKSSQNF 304

Query: 271 IWVIKK 277
           IWVIK+
Sbjct: 305 IWVIKR 308

BLAST of CsaV3_1G021870 vs. NCBI nr
Match: XP_022947843.1 (LOW QUALITY PROTEIN: soyasapogenol B glucuronide galactosyltransferase-like [Cucurbita moschata])

HSP 1 Score: 425.6 bits (1093), Expect = 1.8e-115
Identity = 209/321 (65.11%), Postives = 249/321 (77.57%), Query Frame = 0

Query: 82  PWTS-DVAAELRIPRLAFNGSSFFSYCAEQCIKEHKPHLEVESNNEKFKLPGLPDVIEMV 141
           PW + +VAA+L I RLAFN S F +YC EQC+KEHKPHLEV+S+++KF+LPGLPD +EM 
Sbjct: 73  PWPALEVAAQLGIRRLAFNXSGFLAYCGEQCVKEHKPHLEVQSDDDKFQLPGLPDFVEMR 132

Query: 142 RSELPSWITRHKPDGFSQLLDVIRESEKRCYGMLMNRFHELEASYEEHLNKIIGIKTWSI 201
           RS+LPSWIT   P+ FS  L V +ESEKRCYGMLMN F+ELE  YE+H NK+IGIKTWSI
Sbjct: 133 RSQLPSWIT--NPNEFSLFLGVCKESEKRCYGMLMNSFYELENPYEQHFNKVIGIKTWSI 192

Query: 202 GPVSLLANNEIEDKESRGGNPNIQTTNLLQWLNEKEPNSVLYINFGSLIQMSRNQITEIA 261
           GPVSLLAN E EDKESRGGNPNI+TTNLLQWLN K+PNSVLYI+FGSLI++   QI+EIA
Sbjct: 193 GPVSLLANKEAEDKESRGGNPNIETTNLLQWLNTKQPNSVLYISFGSLIRVKPQQISEIA 252

Query: 262 HAIQESSQSFIWVIKKNDEDNDDDIVNKGLQKGFEERMSRTKKGLIIKGWAPQLMILEHK 321
           HA++ES+ +FIWVI K              Q                   APQLMILEH+
Sbjct: 253 HAVEESAHNFIWVIIKXXXXXXXXXXXXXXQ-------------------APQLMILEHE 312

Query: 322 SVGGFLTHCGWNSILEGISSGLPMITWPLFAEQFYNEKLLIEVVKIGVGVGSKKWWHLGE 381
           +VGGF+THCGWNSILEG+SSGLPMITWP+ AEQFYNEK LI+V+KIGVGVGS+KWW+LGE
Sbjct: 313 AVGGFVTHCGWNSILEGVSSGLPMITWPMSAEQFYNEK-LIDVLKIGVGVGSRKWWNLGE 371

Query: 382 E--PEIIKREEIGKAIAFFDG 400
           E   E++ RE+IGKA+A   G
Sbjct: 373 ERSQEMVTREDIGKAVALLMG 371

BLAST of CsaV3_1G021870 vs. NCBI nr
Match: XP_021903518.1 (scopoletin glucosyltransferase-like [Carica papaya])

HSP 1 Score: 385.2 bits (988), Expect = 2.7e-103
Identity = 191/372 (51.34%), Postives = 259/372 (69.62%), Query Frame = 0

Query: 31  QVGLSPAIQNLSTATSMKMT-KVFQAFLMLQPQLVDLIHEMQPDCIVSDVFYPWTSDVAA 90
           + GL    +NL+ A + KMT K+F A  MLQPQ+  +  E +PDCIVSD  +PWT DVA 
Sbjct: 74  EFGLPEGCENLTAAPTPKMTVKLFHAIDMLQPQVEQVFRECRPDCIVSDYLFPWTVDVAI 133

Query: 91  ELRIPRLAFNGSSFFSYCAEQCIKEHKPHLEVESNNEKFKLPGLPDVIEMVRSELPSWIT 150
           EL IPRLAFNGS FF+ C    ++ +KPH+ V S  E F +PGLPD I   RS+LP  I 
Sbjct: 134 ELGIPRLAFNGSGFFNLCVAHNLEHYKPHMNVVSETESFVVPGLPDQITFTRSQLPD-IV 193

Query: 151 RHKPDGFSQLLDVIRESEKRCYGMLMNRFHELEASYEEHLNKIIGIKTWSIGPVSLLANN 210
           + K   FS L D ++E EK+ +G+L+N F+ELE +Y +H  K+IG+K W IGP SLL N+
Sbjct: 194 KTK-TRFSALFDKLKEVEKKSFGVLVNSFYELEPAYIDHSRKVIGLKVWPIGPASLLNND 253

Query: 211 EIEDKESRGGNPNIQTTNLLQWLNEKEPNSVLYINFGSLIQMSRNQITEIAHAIQESSQS 270
           E   K  RG  P+I   + L WL+ KEP SV+YI FGSLI+ +++QITE+A A++ES  S
Sbjct: 254 E---KAERGDKPSIPKHSCLSWLDSKEPTSVVYICFGSLIRFTKSQITEMASALEESGHS 313

Query: 271 FIWVIKK-----NDEDNDDDIVNKGLQKGFEERMSRTKKGLIIKGWAPQLMILEHKSVGG 330
           F+W I K     +  +ND    N  L +GFEE+++++ +GLII+GWAPQ++IL H ++GG
Sbjct: 314 FVWTIGKVLTVDDTHNNDQQQQNSWLPEGFEEKVTQSGQGLIIRGWAPQVLILTHPAIGG 373

Query: 331 FLTHCGWNSILEGISSGLPMITWPLFAEQFYNEKLLIEVVKIGVGVGSKKW--WHLGEEP 390
           FLTHCGWNSI+EGIS+G+PM+TWP+FAEQFYNEKL+ + ++ GV VG+  W  W   E P
Sbjct: 374 FLTHCGWNSIMEGISAGIPMVTWPIFAEQFYNEKLVTQALRFGVSVGNDTWKLWATEESP 433

Query: 391 EIIKREEIGKAI 395
            +I +E+I KAI
Sbjct: 434 -LINKEKIKKAI 439

BLAST of CsaV3_1G021870 vs. TAIR10
Match: AT2G15480.1 (UDP-glucosyl transferase 73B5)

HSP 1 Score: 302.8 bits (774), Expect = 3.2e-82
Identity = 151/372 (40.59%), Postives = 238/372 (63.98%), Query Frame = 0

Query: 30  LQVGLSPAIQNLSTATSMKMTKVFQAFL-------MLQPQLVDLIHEMQPDCIVSDVFYP 89
           +++GL    +N     S + +     FL        ++ QL   I   +P  +V+D+F+P
Sbjct: 77  VELGLPEGCENADFINSYQKSDSGDLFLKFLFSTKYMKQQLESFIETTKPSALVADMFFP 136

Query: 90  WTSDVAAELRIPRLAFNGSSFFSYCAEQCIKEHKPHLEVESNNEKFKLPGLPDVIEMVRS 149
           W ++ A +L +PRL F+G+SFFS C    ++ HKPH +V +++  F +PGLP  I ++  
Sbjct: 137 WATESAEKLGVPRLVFHGTSFFSLCCSYNMRIHKPHKKVATSSTPFVIPGLPGDI-VITE 196

Query: 150 ELPSWITRHKPDGFSQLLDVIRESEKRCYGMLMNRFHELEASYEEHLNKIIGIKTWSIGP 209
           +  +      P G  + +  +RESE   +G+L+N F+ELE++Y +     +  + W IGP
Sbjct: 197 DQANVAKEETPMG--KFMKEVRESETNSFGVLVNSFYELESAYADFYRSFVAKRAWHIGP 256

Query: 210 VSLLANNEIEDKESRGGNPNIQTTNLLQWLNEKEPNSVLYINFGSLIQMSRNQITEIAHA 269
           +S L+N E+ +K  RG   NI     L+WL+ K P SV+Y++FGS    + +Q+ EIA  
Sbjct: 257 LS-LSNRELGEKARRGKKANIDEQECLKWLDSKTPGSVVYLSFGSGTNFTNDQLLEIAFG 316

Query: 270 IQESSQSFIWVIKKNDEDNDDDIVNKGLQKGFEERMSRTKKGLIIKGWAPQLMILEHKSV 329
           ++ S QSFIWV++KN+   D++   + L +GF+ER   T KGLII GWAPQ++IL+HK++
Sbjct: 317 LEGSGQSFIWVVRKNENQGDNE---EWLPEGFKER--TTGKGLIIPGWAPQVLILDHKAI 376

Query: 330 GGFLTHCGWNSILEGISSGLPMITWPLFAEQFYNEKLLIEVVKIGVGVGSKKWWHLGEEP 389
           GGF+THCGWNS +EGI++GLPM+TWP+ AEQFYNEKLL +V++IGV VG+ +   L ++ 
Sbjct: 377 GGFVTHCGWNSAIEGIAAGLPMVTWPMGAEQFYNEKLLTKVLRIGVNVGATE---LVKKG 436

Query: 390 EIIKREEIGKAI 395
           ++I R ++ KA+
Sbjct: 437 KLISRAQVEKAV 436

BLAST of CsaV3_1G021870 vs. TAIR10
Match: AT2G15490.1 (UDP-glycosyltransferase 73B4)

HSP 1 Score: 300.1 bits (767), Expect = 2.1e-81
Identity = 153/372 (41.13%), Postives = 233/372 (62.63%), Query Frame = 0

Query: 30  LQVGLSPAIQNLSTATSMKMTKVFQAFL-------MLQPQLVDLIHEMQPDCIVSDVFYP 89
           +++GL    +N     S + +  F  FL        ++ QL   I   +P  +V+D+F+P
Sbjct: 74  VELGLPEGCENRDFINSYQKSDSFDLFLKFLFSTKYMKQQLESFIETTKPSALVADMFFP 133

Query: 90  WTSDVAAELRIPRLAFNGSSFFSYCAEQCIKEHKPHLEVESNNEKFKLPGLPDVIEMVRS 149
           W ++ A ++ +PRL F+G+S F+ C    ++ HKPH +V S++  F +PGLP   ++V +
Sbjct: 134 WATESAEKIGVPRLVFHGTSSFALCCSYNMRIHKPHKKVASSSTPFVIPGLPG--DIVIT 193

Query: 150 ELPSWITRHKPDGFSQLLDVIRESEKRCYGMLMNRFHELEASYEEHLNKIIGIKTWSIGP 209
           E  + +T  +   F +    +RESE   +G+L+N F+ELE+SY +     +  K W IGP
Sbjct: 194 EDQANVTNEETP-FGKFWKEVRESETSSFGVLVNSFYELESSYADFYRSFVAKKAWHIGP 253

Query: 210 VSLLANNEIEDKESRGGNPNIQTTNLLQWLNEKEPNSVLYINFGSLIQMSRNQITEIAHA 269
           +S L+N  I +K  RG   NI     L+WL+ K P SV+Y++FGS   +   Q+ EIA  
Sbjct: 254 LS-LSNRGIAEKAGRGKKANIDEQECLKWLDSKTPGSVVYLSFGSGTGLPNEQLLEIAFG 313

Query: 270 IQESSQSFIWVIKKNDEDNDDDIVNKGLQKGFEERMSRTKKGLIIKGWAPQLMILEHKSV 329
           ++ S Q+FIWV+ KN+           L KGFEER     KGLII+GWAPQ++IL+HK++
Sbjct: 314 LEGSGQNFIWVVSKNENQVGTGENEDWLPKGFEER--NKGKGLIIRGWAPQVLILDHKAI 373

Query: 330 GGFLTHCGWNSILEGISSGLPMITWPLFAEQFYNEKLLIEVVKIGVGVGSKKWWHLGEEP 389
           GGF+THCGWNS LEGI++GLPM+TWP+ AEQFYNEKLL +V++IGV VG+ +   L ++ 
Sbjct: 374 GGFVTHCGWNSTLEGIAAGLPMVTWPMGAEQFYNEKLLTKVLRIGVNVGATE---LVKKG 433

Query: 390 EIIKREEIGKAI 395
           ++I R ++ KA+
Sbjct: 434 KLISRAQVEKAV 436

BLAST of CsaV3_1G021870 vs. TAIR10
Match: AT4G34135.1 (UDP-glucosyltransferase 73B2)

HSP 1 Score: 297.4 bits (760), Expect = 1.4e-80
Identity = 139/333 (41.74%), Postives = 214/333 (64.26%), Query Frame = 0

Query: 62  QLVDLIHEMQPDCIVSDVFYPWTSDVAAELRIPRLAFNGSSFFSYCAEQCIKEHKPHLEV 121
           QL  L+   +PDC+++D+F+PW ++ A +  +PRL F+G+ +FS CA  CI  HKP   V
Sbjct: 117 QLEKLLGTTRPDCLIADMFFPWATEAAGKFNVPRLVFHGTGYFSLCAGYCIGVHKPQKRV 176

Query: 122 ESNNEKFKLPGLPDVIEMVRSELPSWITRHKPDGFSQLLDVIRESEKRCYGMLMNRFHEL 181
            S++E F +P LP  I +   ++   I         + +  +RESE +  G+++N F+EL
Sbjct: 177 ASSSEPFVIPELPGNIVITEEQI---IDGDGESDMGKFMTEVRESEVKSSGVVLNSFYEL 236

Query: 182 EASYEEHLNKIIGIKTWSIGPVSLLANNEIEDKESRGGNPNIQTTNLLQWLNEKEPNSVL 241
           E  Y +     +  + W IGP+S+  N   E+K  RG   NI     L+WL+ K+PNSV+
Sbjct: 237 EHDYADFYKSCVQKRAWHIGPLSVY-NRGFEEKAERGKKANIDEAECLKWLDSKKPNSVI 296

Query: 242 YINFGSLIQMSRNQITEIAHAIQESSQSFIWVIKKNDEDNDDDIVNKGLQKGFEERMSRT 301
           Y++FGS+      Q+ EIA  ++ S  SFIWV++K  +D ++      L +GFEER+   
Sbjct: 297 YVSFGSVAFFKNEQLFEIAAGLEASGTSFIWVVRKTKDDREE-----WLPEGFEERVK-- 356

Query: 302 KKGLIIKGWAPQLMILEHKSVGGFLTHCGWNSILEGISSGLPMITWPLFAEQFYNEKLLI 361
            KG+II+GWAPQ++IL+H++ GGF+THCGWNS+LEG+++GLPM+TWP+ AEQFYNEKL+ 
Sbjct: 357 GKGMIIRGWAPQVLILDHQATGGFVTHCGWNSLLEGVAAGLPMVTWPVGAEQFYNEKLVT 416

Query: 362 EVVKIGVGVGSKKWWHLGEEPEIIKREEIGKAI 395
           +V++ GV VG+ K   +    + I RE++ KA+
Sbjct: 417 QVLRTGVSVGASKHMKV-MMGDFISREKVDKAV 437

BLAST of CsaV3_1G021870 vs. TAIR10
Match: AT4G34131.1 (UDP-glucosyl transferase 73B3)

HSP 1 Score: 289.3 bits (739), Expect = 3.7e-78
Identity = 141/372 (37.90%), Postives = 230/372 (61.83%), Query Frame = 0

Query: 30  LQVGLSPAIQNLSTATSMK-------MTKVFQAFLMLQPQLVDLIHEMQPDCIVSDVFYP 89
           + +GL    +N+   TS           K F++    + QL  L+   +PDC+++D+F+P
Sbjct: 77  VDLGLPEGCENVDFFTSNNNDDRQYLTLKFFKSTRFFKDQLEKLLETTRPDCLIADMFFP 136

Query: 90  WTSDVAAELRIPRLAFNGSSFFSYCAEQCIKEHKPHLEVESNNEKFKLPGLPDVIEMVRS 149
           W ++ A +  +PRL F+G+ +FS C+E CI+ H P   V S  E F +P LP  I + + 
Sbjct: 137 WATEAAEKFNVPRLVFHGTGYFSLCSEYCIRVHNPQNIVASRYEPFVIPDLPGNIVITQE 196

Query: 150 ELPSWITRHKPDGFSQLLDVIRESEKRCYGMLMNRFHELEASYEEHLNKIIGIKTWSIGP 209
           ++     R +     + +  ++ES+ +  G+++N F+ELE  Y +    ++  + W IGP
Sbjct: 197 QIAD---RDEESEMGKFMIEVKESDVKSSGVIVNSFYELEPDYADFYKSVVLKRAWHIGP 256

Query: 210 VSLLANNEIEDKESRGGNPNIQTTNLLQWLNEKEPNSVLYINFGSLIQMSRNQITEIAHA 269
           +S+  N   E+K  RG   +I     L+WL+ K+P+SV+YI+FGS+      Q+ EIA  
Sbjct: 257 LSVY-NRGFEEKAERGKKASINEVECLKWLDSKKPDSVIYISFGSVACFKNEQLFEIAAG 316

Query: 270 IQESSQSFIWVIKKNDEDNDDDIVNKGLQKGFEERMSRTKKGLIIKGWAPQLMILEHKSV 329
           ++ S  +FIWV++KN     ++     L +GFEER+    KG+II+GWAPQ++IL+H++ 
Sbjct: 317 LETSGANFIWVVRKNIGIEKEE----WLPEGFEERVK--GKGMIIRGWAPQVLILDHQAT 376

Query: 330 GGFLTHCGWNSILEGISSGLPMITWPLFAEQFYNEKLLIEVVKIGVGVGSKKWWHLGEEP 389
            GF+THCGWNS+LEG+++GLPM+TWP+ AEQFYNEKL+ +V++ GV VG+KK  ++    
Sbjct: 377 CGFVTHCGWNSLLEGVAAGLPMVTWPVAAEQFYNEKLVTQVLRTGVSVGAKK--NVRTTG 436

Query: 390 EIIKREEIGKAI 395
           + I RE++ KA+
Sbjct: 437 DFISREKVVKAV 436

BLAST of CsaV3_1G021870 vs. TAIR10
Match: AT2G36790.1 (UDP-glucosyl transferase 73C6)

HSP 1 Score: 280.0 bits (715), Expect = 2.2e-75
Identity = 143/371 (38.54%), Postives = 235/371 (63.34%), Query Frame = 0

Query: 31  QVGLSPAIQNLSTATSM-KMTKVFQAFLMLQPQLVDLIHEM--QPDCIVSDVFYPWTSDV 90
           + GL    +N+   T+M ++T  F+A  +L+  + +LI EM  +P C++SD+   +TS++
Sbjct: 79  EAGLQEGQENMDLLTTMEQITSFFKAVNLLKEPVQNLIEEMSPRPSCLISDMCLSYTSEI 138

Query: 91  AAELRIPRLAFNGSSFFSYCAEQCIKEHKPHLE-VESNNEKFKLPGLPDVIEMVRSELPS 150
           A + +IP++ F+G   F       +++++  L+ ++S+ E F +P  PD +E  R ++P 
Sbjct: 139 AKKFKIPKILFHGMGCFCLLCVNVLRKNREILDNLKSDKEYFIVPYFPDRVEFTRPQVP- 198

Query: 151 WITRHKPDGFSQLLDVIRESEKRCYGMLMNRFHELEASYEEHLNKIIGIKTWSIGPVSLL 210
            +  + P G+ ++L+ + E++K  YG+++N F ELE +Y +   +    K W+IGPVS L
Sbjct: 199 -VETYVPAGWKEILEDMVEADKTSYGVIVNSFQELEPAYAKDFKEARSGKAWTIGPVS-L 258

Query: 211 ANNEIEDKESRGGNPNIQTTNLLQWLNEKEPNSVLYINFGSLIQMSRNQITEIAHAIQES 270
            N    DK  RG   +I     L+WL+ KEP SVLY+  GS+  +  +Q+ E+   ++ES
Sbjct: 259 CNKVGVDKAERGNKSDIDQDECLEWLDSKEPGSVLYVCLGSICNLPLSQLLELGLGLEES 318

Query: 271 SQSFIWVIKKNDEDNDDDIVNKGLQKGFEERMSRTKKGLIIKGWAPQLMILEHKSVGGFL 330
            + FIWVI+    +   ++V    + GFE+R+    +GL+IKGW+PQ++IL H SVGGFL
Sbjct: 319 QRPFIWVIR--GWEKYKELVEWFSESGFEDRIQ--DRGLLIKGWSPQMLILSHPSVGGFL 378

Query: 331 THCGWNSILEGISSGLPMITWPLFAEQFYNEKLLIEVVKIGVGVGSKKWWHLGEEPEI-- 390
           THCGWNS LEGI++GLPM+TWPLFA+QF NEKL+++++K+GV    K+    GEE +I  
Sbjct: 379 THCGWNSTLEGITAGLPMLTWPLFADQFCNEKLVVQILKVGVSAEVKEVMKWGEEEKIGV 438

Query: 391 -IKREEIGKAI 395
            + +E + KA+
Sbjct: 439 LVDKEGVKKAV 442

BLAST of CsaV3_1G021870 vs. Swiss-Prot
Match: sp|D4Q9Z4|SGT2_SOYBN (Soyasapogenol B glucuronide galactosyltransferase OS=Glycine max OX=3847 GN=GmSGT2 PE=1 SV=1)

HSP 1 Score: 364.4 bits (934), Expect = 1.6e-99
Identity = 169/368 (45.92%), Postives = 256/368 (69.57%), Query Frame = 0

Query: 31  QVGLSPAIQNLSTATSMKMT-KVFQAFLMLQPQLVDLIHEMQPDCIVSDVFYPWTSDVAA 90
           QVGL   I+  +  T  +MT +++    +LQ     L H++QPD IV+D+F+PW+ D AA
Sbjct: 75  QVGLPVGIEAFNVDTPREMTPRIYMGLSLLQQVFEKLFHDLQPDFIVTDMFHPWSVDAAA 134

Query: 91  ELRIPRLAFNGSSFFSYCAEQCIKEHKPHLEVESNNEKFKLPGLPDVIEMVRSELPSWIT 150
           +L IPR+ F+G+S+ +  A   ++++ PHLE + + +KF LPGLPD +EM R +LP W+ 
Sbjct: 135 KLGIPRIMFHGASYLARSAAHSVEQYAPHLEAKFDTDKFVLPGLPDNLEMTRLQLPDWL- 194

Query: 151 RHKPDGFSQLLDVIRESEKRCYGMLMNRFHELEASYEEHLNKIIGIKTWSIGPVSLLANN 210
              P+ +++L+  I++SEK+ YG L N F++LE++Y EH   I+G K+W IGPVSL AN 
Sbjct: 195 -RSPNQYTELMRTIKQSEKKSYGSLFNSFYDLESAYYEHYKSIMGTKSWGIGPVSLWANQ 254

Query: 211 EIEDKESRG-GNPNIQTTNLLQWLNEKEPNSVLYINFGSLIQMSRNQITEIAHAIQESSQ 270
           + +DK +RG      +    L+WLN K  +SVLY++FGS+ +   +Q+ EIA A+++S  
Sbjct: 255 DAQDKAARGYAKEEEEKEGWLKWLNSKAESSVLYVSFGSINKFPYSQLVEIARALEDSGH 314

Query: 271 SFIWVIKKNDEDNDDDIVNKGLQKGFEERMSRTKKGLIIKGWAPQLMILEHKSVGGFLTH 330
            FIWV++KND    D+ + +     FE+RM  + KG +I GWAPQL+ILE+ ++GG +TH
Sbjct: 315 DFIWVVRKNDGGEGDNFLEE-----FEKRMKESNKGYLIWGWAPQLLILENPAIGGLVTH 374

Query: 331 CGWNSILEGISSGLPMITWPLFAEQFYNEKLLIEVVKIGVGVGSKKWWHLGE-EPEIIKR 390
           CGWN+++E +++GLPM TWPLFAE F+NEKL+++V+KIGV VG+K+W +  E   E++KR
Sbjct: 375 CGWNTVVESVNAGLPMATWPLFAEHFFNEKLVVDVLKIGVPVGAKEWRNWNEFGSEVVKR 434

Query: 391 EEIGKAIA 396
           EEIG AIA
Sbjct: 435 EEIGNAIA 435

BLAST of CsaV3_1G021870 vs. Swiss-Prot
Match: sp|Q9AT54|SCGT_TOBAC (Scopoletin glucosyltransferase OS=Nicotiana tabacum OX=4097 GN=TOGT1 PE=1 SV=1)

HSP 1 Score: 342.0 bits (876), Expect = 8.7e-93
Identity = 160/347 (46.11%), Postives = 237/347 (68.30%), Query Frame = 0

Query: 48  KMTKVFQAFLMLQPQLVDLIHEMQPDCIVSDVFYPWTSDVAAELRIPRLAFNGSSFFSYC 107
           K+   F+A  M+Q  L  LI E +PDC++SD+F PWT+D AA+  IPR+ F+G+SFF+ C
Sbjct: 89  KLPNFFKAVAMMQEPLEQLIEECRPDCLISDMFLPWTTDTAAKFNIPRIVFHGTSFFALC 148

Query: 108 AEQCIKEHKPHLEVESNNEKFKLPGLPDVIEMVRSELPSWITRHKPDGFSQLLDVIRESE 167
            E  ++ +KP   V S++E F +P LP  I++ R+++  +    +    ++++  +RES+
Sbjct: 149 VENSVRLNKPFKNVSSDSETFVVPDLPHEIKLTRTQVSPFERSGEETAMTRMIKTVRESD 208

Query: 168 KRCYGMLMNRFHELEASYEEHLNKIIGIKTWSIGPVSLLANNEIEDKESRGGNPNIQTTN 227
            + YG++ N F+ELE  Y EH  K++G + W+IGP+S + N +IEDK  RG   +I    
Sbjct: 209 SKSYGVVFNSFYELETDYVEHYTKVLGRRAWAIGPLS-MCNRDIEDKAERGKKSSIDKHE 268

Query: 228 LLQWLNEKEPNSVLYINFGSLIQMSRNQITEIAHAIQESSQSFIWVIKKNDEDNDDDIVN 287
            L+WL+ K+P+SV+Y+ FGS+   + +Q+ E+A  I+ S Q FIWV+ + + DN+D    
Sbjct: 269 CLKWLDSKKPSSVVYVCFGSVANFTASQLHELAMGIEASGQEFIWVV-RTELDNED---- 328

Query: 288 KGLQKGFEERMSRTKKGLIIKGWAPQLMILEHKSVGGFLTHCGWNSILEGISSGLPMITW 347
             L +GFEER    +KGLII+GWAPQ++IL+H+SVG F+THCGWNS LEG+S G+PM+TW
Sbjct: 329 -WLPEGFEERTK--EKGLIIRGWAPQVLILDHESVGAFVTHCGWNSTLEGVSGGVPMVTW 388

Query: 348 PLFAEQFYNEKLLIEVVKIGVGVGSKKWWHLGEEPEIIKREEIGKAI 395
           P+FAEQF+NEKL+ EV+K G GVGS +W     E   +KRE I KAI
Sbjct: 389 PVFAEQFFNEKLVTEVLKTGAGVGSIQWKRSASEG--VKREAIAKAI 424

BLAST of CsaV3_1G021870 vs. Swiss-Prot
Match: sp|Q2V6J9|UFOG7_FRAAN (UDP-glucose flavonoid 3-O-glucosyltransferase 7 OS=Fragaria ananassa OX=3747 GN=GT7 PE=1 SV=1)

HSP 1 Score: 322.4 bits (825), Expect = 7.1e-87
Identity = 157/350 (44.86%), Postives = 232/350 (66.29%), Query Frame = 0

Query: 31  QVGLSPAIQNLSTATSMKMT-KVFQAFLMLQPQLVDLIHEMQPDCIVSDVFYPWTSDVAA 90
           + GL    ++    T+  M  K  +A  +++P    ++ E +P C+V+D F+ W +DVAA
Sbjct: 73  EAGLPQDCESADLITTQDMLGKFVKATFLIEPHFEKILDEHRPHCLVADAFFTWATDVAA 132

Query: 91  ELRIPRLAFNGSSFFSYCAEQCIKEHKPHLEVESNNEKFKLPGLPDVIEMVRSELPSWIT 150
           + RIPRL F+G+ FF+ CA   +  ++PH  + S++E F +P LPD I+M RS+LP +  
Sbjct: 133 KFRIPRLYFHGTGFFALCASLSVMMYQPHSNLSSDSESFVIPNLPDEIKMTRSQLPVF-- 192

Query: 151 RHKPD--GFSQLLDVIRESEKRCYGMLMNRFHELEASYEEHLNKIIGIKTWSIGPVSLLA 210
              PD   F ++L    E E+R YG+++N F+ELE +Y  H  K+ G K W IGPVS   
Sbjct: 193 ---PDESEFMKMLKASIEIEERSYGVIVNSFYELEPAYANHYRKVFGRKAWHIGPVS-FC 252

Query: 211 NNEIEDKESRGG--NPNIQTTNLLQWLNEKEPNSVLYINFGSLIQMSRNQITEIAHAIQE 270
           N  IEDK  RG   +   +    L+WL+ K+P SV+Y++FGS+++ + +Q+ EIA  ++ 
Sbjct: 253 NKAIEDKAERGSIKSSTAEKHECLKWLDSKKPRSVVYVSFGSMVRFADSQLLEIATGLEA 312

Query: 271 SSQSFIWVIKKNDEDNDDDIVNKGLQKGFEERMSRTKKGLIIKGWAPQLMILEHKSVGGF 330
           S Q FIWV+KK  ++     V + L +GFE+RM    KGLII+ WAPQ++ILEH+++G F
Sbjct: 313 SGQDFIWVVKKEKKE-----VEEWLPEGFEKRME--GKGLIIRDWAPQVLILEHEAIGAF 372

Query: 331 LTHCGWNSILEGISSGLPMITWPLFAEQFYNEKLLIEVVKIGVGVGSKKW 376
           +THCGWNSILE +S+G+PMITWP+F EQFYNEKL+ E+ +IGV VGS+KW
Sbjct: 373 VTHCGWNSILEAVSAGVPMITWPVFGEQFYNEKLVTEIHRIGVPVGSEKW 409

BLAST of CsaV3_1G021870 vs. Swiss-Prot
Match: sp|Q8H0F2|ANGT_GENTR (Anthocyanin 3'-O-beta-glucosyltransferase OS=Gentiana triflora OX=55190 PE=1 SV=1)

HSP 1 Score: 319.3 bits (817), Expect = 6.0e-86
Identity = 152/366 (41.53%), Postives = 236/366 (64.48%), Query Frame = 0

Query: 31  QVGLSPAIQNLSTATSM-KMTKVFQAFLMLQPQLVDLIHEMQPDCIVSDVFYPWTSDVAA 90
           + GL    +    A S+  M + F+A ++LQ  L +L+ E +P  +V+D+F+ W +D AA
Sbjct: 71  EFGLPEGYETADQARSIDMMDEFFRACILLQEPLEELLKEHRPQALVADLFFYWANDAAA 130

Query: 91  ELRIPRLAFNGSSFFSYCAEQCIKEHKPHLEVESNNEKFKLPGLPDVIEMVRSELPS-WI 150
           +  IPRL F+GSS F+  A + ++ +KP+  + S+++ F +P +PD I + +S++P+   
Sbjct: 131 KFGIPRLLFHGSSSFAMIAAESVRRNKPYKNLSSDSDPFVVPDIPDKIILTKSQVPTPDE 190

Query: 151 TRHKPDGFSQLLDVIRESEKRCYGMLMNRFHELEASYEEHLNKIIGIKTWSIGPVSLLAN 210
           T       +++   I ESE  CYG+++N F+ELE  Y ++   ++G + W IGP+S L N
Sbjct: 191 TEENNTHITEMWKNISESENDCYGVIVNSFYELEPDYVDYCKNVLGRRAWHIGPLS-LCN 250

Query: 211 NEIEDKESRGGNPNIQTTNLLQWLNEKEPNSVLYINFGSLIQMSRNQITEIAHAIQESSQ 270
           NE ED   RG   +I     L WL+ K P+SV+Y+ FGS+   +  Q+ E+A  ++ES Q
Sbjct: 251 NEGEDVAERGKKSDIDAHECLNWLDSKNPDSVVYVCFGSMANFNAAQLHELAMGLEESGQ 310

Query: 271 SFIWVIKKNDEDNDDDIVNKGLQKGFEERMSRTKKGLIIKGWAPQLMILEHKSVGGFLTH 330
            FIWV++   ++ D+   +K    GFE+R+    KGLIIKGWAPQ++ILEH++VG F++H
Sbjct: 311 EFIWVVRTCVDEEDE---SKWFPDGFEKRVQENNKGLIIKGWAPQVLILEHEAVGAFVSH 370

Query: 331 CGWNSILEGISSGLPMITWPLFAEQFYNEKLLIEVVKIGVGVGSKKWWHLGEEPEIIKRE 390
           CGWNS LEGI  G+ M+TWPLFAEQFYNEKL+ ++++ GV VGS +W  +     ++KRE
Sbjct: 371 CGWNSTLEGICGGVAMVTWPLFAEQFYNEKLMTDILRTGVSVGSLQWSRVTTSAVVVKRE 430

Query: 391 EIGKAI 395
            I KA+
Sbjct: 431 SISKAV 432

BLAST of CsaV3_1G021870 vs. Swiss-Prot
Match: sp|Q8W3P8|AOG_PHAAN (Abscisate beta-glucosyltransferase OS=Phaseolus angularis OX=3914 GN=AOG PE=1 SV=1)

HSP 1 Score: 307.8 bits (787), Expect = 1.8e-82
Identity = 152/341 (44.57%), Postives = 221/341 (64.81%), Query Frame = 0

Query: 58  MLQPQLVDLIHEMQPDCIVSDVFYPWTSDVAAELRIPRLAFNGSSFFSYCAEQCIKEHKP 117
           +L+P L  L+ + +P CIV D+F+ W+ DV  EL IPR  FNG   F+ C ++ ++ H  
Sbjct: 89  LLEP-LRQLLLQRRPHCIVVDMFHRWSGDVVYELGIPRTLFNGIGCFALCVQENLR-HVA 148

Query: 118 HLEVESNNEKFKLPGLPDVIEMVRSELPSWITRHKPDGFSQLLDVIRESEKRCYGMLMNR 177
              V +++E F +P +PD IEM  S+LP ++    P G  +    +++ E++ +G L+N 
Sbjct: 149 FKSVSTDSEPFLVPNIPDRIEMTMSQLPPFL--RNPSGIPERWRGMKQLEEKSFGTLINS 208

Query: 178 FHELEASYEEHLNKIIGIKTWSIGPVSLLANNEIEDKESRGGNPNIQTTNLLQWLNEKEP 237
           F++LE +Y + +    G K W +GPVS   N   EDK  RG  P I   N L WLN K+P
Sbjct: 209 FYDLEPAYADLIKSKWGNKAWIVGPVS-FCNRSKEDKTERGKPPTIDEQNCLNWLNSKKP 268

Query: 238 NSVLYINFGSLIQMSRNQITEIAHAIQESSQSFIWV---IKKNDEDNDDDIVNKGLQKGF 297
           +SVLY +FGSL ++   Q+ EIA+ ++ S QSFIWV   I  N  +N ++     L +GF
Sbjct: 269 SSVLYASFGSLARLPPEQLKEIAYGLEASEQSFIWVVGNILHNPSENKENGSGNWLPEGF 328

Query: 298 EERMSRTKKGLIIKGWAPQLMILEHKSVGGFLTHCGWNSILEGISSGLPMITWPLFAEQF 357
           E+RM  T KGL+++GWAPQL+ILEH ++ GF+THCGWNS LEG+S+G+PMITWPL AEQF
Sbjct: 329 EQRMKETGKGLVLRGWAPQLLILEHAAIKGFMTHCGWNSTLEGVSAGVPMITWPLTAEQF 388

Query: 358 YNEKLLIEVVKIGVGVGSKKWWHLGEE-PEIIKREEIGKAI 395
            NEKL+ EV+K GV VG+++WW    E   ++ RE++  A+
Sbjct: 389 SNEKLITEVLKTGVQVGNREWWPWNAEWKGLVGREKVEVAV 424

BLAST of CsaV3_1G021870 vs. TrEMBL
Match: tr|A0A1S3C6V9|A0A1S3C6V9_CUCME (soyasapogenol B glucuronide galactosyltransferase-like OS=Cucumis melo OX=3656 GN=LOC103497361 PE=4 SV=1)

HSP 1 Score: 439.5 bits (1129), Expect = 8.1e-120
Identity = 213/246 (86.59%), Postives = 229/246 (93.09%), Query Frame = 0

Query: 31  QVGLSPAIQNLSTATSMKMTKVFQAFLMLQPQLVDLIHEMQPDCIVSDVFYPWTSDVAAE 90
           +VGLSPAIQNLSTAT M M+KVFQ FLMLQPQL  LIHEM+PDCI+SDVFYPWTSDVAAE
Sbjct: 65  EVGLSPAIQNLSTATPMTMSKVFQVFLMLQPQLRGLIHEMRPDCIISDVFYPWTSDVAAE 124

Query: 91  LRIPRLAFNGSSFFSYCAEQCIKEHKPHLEVESNNEKFKLPGLPDVIEMVRSELPSWITR 150
           L IPRLAFNGSS+F YCAEQC+KEHKPHLEVESNNEKFKLPGLPDV+EM+RSELPSWI R
Sbjct: 125 LGIPRLAFNGSSYFGYCAEQCMKEHKPHLEVESNNEKFKLPGLPDVVEMMRSELPSWIAR 184

Query: 151 HKPDGFSQLLDVIRESEKRCYGMLMNRFHELEASYEEHLNKIIGIKTWSIGPVSLLANNE 210
              D FS+LLDVIRESEKRCYGMLMN F+ELE SYEEH NKIIGIKTWSIGPVSLLAN E
Sbjct: 185 E--DDFSRLLDVIRESEKRCYGMLMNSFYELEGSYEEHSNKIIGIKTWSIGPVSLLANKE 244

Query: 211 IEDKESRGGNPNIQTTNLLQWLNEKEPNSVLYINFGSLIQMSRNQITEIAHAIQESSQSF 270
           IEDKESRGGNPNIQTTNLLQWLNEKEPNSVLYINFGSL+QM+ NQ+TEIAHAIQ+SSQ+F
Sbjct: 245 IEDKESRGGNPNIQTTNLLQWLNEKEPNSVLYINFGSLVQMNPNQLTEIAHAIQKSSQNF 304

Query: 271 IWVIKK 277
           IWVIK+
Sbjct: 305 IWVIKR 308

BLAST of CsaV3_1G021870 vs. TrEMBL
Match: tr|A0A2N9FF75|A0A2N9FF75_FAGSY (Glycosyltransferase OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS13306 PE=3 SV=1)

HSP 1 Score: 409.8 bits (1052), Expect = 6.9e-111
Identity = 194/371 (52.29%), Postives = 265/371 (71.43%), Query Frame = 0

Query: 31  QVGLSPAIQNLSTATSMKMT-KVFQAFLMLQPQLVDLIHEMQPDCIVSDVFYPWTSDVAA 90
           QV L   I+N +  TS  M+ K++ A  +LQ  +  L  +M+PDCIV+D+FYPWT D A 
Sbjct: 75  QVSLPKGIENFNMITSPDMSHKLYYAVSLLQKPIEQLFQDMRPDCIVTDMFYPWTVDSAN 134

Query: 91  ELRIPRLAFNGSSFFSYCAEQCIKEHKPHLEVESNNEKFKLPGLPDVIEMVRSELPSWIT 150
           +L IPRL F+G+S+FS CA  CIK++ PH  V+SN + F LPGLP+ IEM  S+LP W+ 
Sbjct: 135 KLGIPRLVFHGTSYFSLCAASCIKQYAPHQSVKSNTDTFLLPGLPNKIEMTTSQLPRWV- 194

Query: 151 RHKPDGFSQLLDVIRESEKRCYGMLMNRFHELEASYEEHLNKIIGIKTWSIGPVSLLANN 210
              P+ ++QL+D I+ESE+R YG +MN FHELE++YEEH   ++GIK WS+GP+SL AN+
Sbjct: 195 -RTPEAYTQLMDKIKESEQRSYGAVMNSFHELESAYEEHYKSVMGIKAWSVGPISLWANS 254

Query: 211 EIEDKESRGGNPNIQTTNLLQWLNEKEPNSVLYINFGSLIQMSRNQITEIAHAIQESSQS 270
           +  DK  R GN        L WLN KE NSVLY++FGSL + S +Q+ E+AH ++ S+  
Sbjct: 255 DATDKVER-GNKATTENEWLNWLNSKECNSVLYVSFGSLNKFSTSQLIELAHGLEASNHQ 314

Query: 271 FIWVIKKNDEDNDDDIVNKGLQKGFEERMSRTKKGLIIKGWAPQLMILEHKSVGGFLTHC 330
           FIWV++  ++D D+     G  + FE+R+  + +GLII  WAPQL+ILEH ++GG +THC
Sbjct: 315 FIWVVRLKNKDEDE-----GWLRDFEKRIKESNRGLIIWDWAPQLLILEHPAIGGLVTHC 374

Query: 331 GWNSILEGISSGLPMITWPLFAEQFYNEKLLIEVVKIGVGVGSKKWWHLGEE-PEIIKRE 390
           GWNSILEG+++GLPMITWPL+AEQFYNEKL+ +V+KIGV VG K+W  + EE  E +KRE
Sbjct: 375 GWNSILEGVTAGLPMITWPLYAEQFYNEKLVTDVIKIGVAVGVKEWRKMDEEAKETVKRE 434

Query: 391 EIGKAIAFFDG 400
           EI KA+ F  G
Sbjct: 435 EIEKAVTFLMG 437

BLAST of CsaV3_1G021870 vs. TrEMBL
Match: tr|A0A2N9IJV5|A0A2N9IJV5_FAGSY (Glycosyltransferase OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS52113 PE=3 SV=1)

HSP 1 Score: 404.4 bits (1038), Expect = 2.9e-109
Identity = 192/365 (52.60%), Postives = 262/365 (71.78%), Query Frame = 0

Query: 31  QVGLSPAIQNLSTATSMKMT-KVFQAFLMLQPQLVDLIHEMQPDCIVSDVFYPWTSDVAA 90
           QV L   I+N +  TS  M+ K++ A  +LQ  +  L  +M+PDCIV+D+FYPWT D A 
Sbjct: 75  QVSLPKGIENFNMITSPDMSHKLYYAVSLLQKPIEQLFQDMRPDCIVTDMFYPWTVDSAN 134

Query: 91  ELRIPRLAFNGSSFFSYCAEQCIKEHKPHLEVESNNEKFKLPGLPDVIEMVRSELPSWIT 150
           +L IPRL F+G+S+FS CA  CIK++ PH  V+SN + F LPGLP+ IEM  S+LP W+ 
Sbjct: 135 KLGIPRLVFHGTSYFSLCAASCIKQYAPHQSVKSNTDTFLLPGLPNKIEMTTSQLPRWV- 194

Query: 151 RHKPDGFSQLLDVIRESEKRCYGMLMNRFHELEASYEEHLNKIIGIKTWSIGPVSLLANN 210
              P+ ++QL+D I+ESE+R YG +MN FHELE++YEEH   ++GIK WS+GP+SL AN+
Sbjct: 195 -RTPEAYTQLMDKIKESEQRSYGAVMNSFHELESAYEEHYKSVMGIKAWSVGPISLWANS 254

Query: 211 EIEDKESRGGNPNIQTTNLLQWLNEKEPNSVLYINFGSLIQMSRNQITEIAHAIQESSQS 270
           +  DK  R GN        L WLN KE NSVLY++FGSL + S +Q+ E+AH ++ S+  
Sbjct: 255 DATDKVER-GNKATTENEWLNWLNSKECNSVLYVSFGSLNKFSTSQLIELAHGLEASNHQ 314

Query: 271 FIWVIKKNDEDNDDDIVNKGLQKGFEERMSRTKKGLIIKGWAPQLMILEHKSVGGFLTHC 330
           FIWV++  ++D D+     G  + FE+R+  + +GLII  WAPQL+ILEH ++GG +THC
Sbjct: 315 FIWVVRLKNKDEDE-----GWLRDFEKRIKESNRGLIIWDWAPQLLILEHPAIGGLVTHC 374

Query: 331 GWNSILEGISSGLPMITWPLFAEQFYNEKLLIEVVKIGVGVGSKKWWHLGEE-PEIIKRE 390
           GWNSILEG+++GLPMITWPL+AEQFYNEKL+ +V+KIGV VG K+W  + EE  E +KRE
Sbjct: 375 GWNSILEGVTAGLPMITWPLYAEQFYNEKLVTDVIKIGVAVGVKEWRKMDEEAKETVKRE 431

Query: 391 EIGKA 394
           EI KA
Sbjct: 435 EIEKA 431

BLAST of CsaV3_1G021870 vs. TrEMBL
Match: tr|A0A2N9I9F6|A0A2N9I9F6_FAGSY (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS48797 PE=4 SV=1)

HSP 1 Score: 392.5 bits (1007), Expect = 1.1e-105
Identity = 191/374 (51.07%), Postives = 261/374 (69.79%), Query Frame = 0

Query: 31  QVGLSPAIQNLSTATSMKM-TKVFQAFLMLQPQLVDLIHEMQPDCIVSDVFYPWTSDVAA 90
           QVGL   I+N +T TS    +K+     +L+  +  L  +M+PDCIVSD+FYPWT + AA
Sbjct: 76  QVGLPEGIENYNTMTSNDTNSKLLHGLSLLRKPIEQLFQDMRPDCIVSDMFYPWTVESAA 135

Query: 91  ELRIPRLAFNGSSFFSYCAEQCIKEHKPHLEVESNNEKFKLPGLPDVIEMVRSELPSWIT 150
            L IPRL F+ +S+FS+CAE CI+++KPH  V S+ E F +PGLP+ IEM R +LP W+ 
Sbjct: 136 RLGIPRLVFHVTSYFSFCAETCIEQYKPHQSVNSDTEPFLIPGLPNKIEMTRLKLPDWVK 195

Query: 151 RHKPDGFSQLLDVIRESEKRCYGMLMNRFHELEASYEEHLNKIIGIKTWSIGPVSLLANN 210
               D F+QLL++I+ESE+R YG ++N F+ELE  YEE   + +GIK WS+GPVSL  N 
Sbjct: 196 TQ--DRFTQLLNIIKESERRSYGAIVNSFYELEGGYEELHKRNMGIKAWSVGPVSLWVNK 255

Query: 211 EIEDKESRGGN-PNIQTTNLLQWLNEKEPNSVLYINFGSLIQMSRNQITEIAHAIQESSQ 270
           ++ DK  RG      +    L+WLN KE NSVLY++FGS+ +    Q+ E+AH ++ S  
Sbjct: 256 DVADKVERGNKVAPPEEQEWLKWLNAKECNSVLYVSFGSMTKFPTPQLIEMAHGLEASGH 315

Query: 271 SFIWVIKKNDEDNDDDIVNKGLQKGFEERMSRTK-KGLIIKGWAPQLMILEHKSVGGFLT 330
            FIWV+ K D+D D+     G  + F++RM  +K +GLII+GWAPQL+ILEH ++GG +T
Sbjct: 316 QFIWVVPKKDKDQDE-----GWLEDFQKRMKESKHRGLIIRGWAPQLLILEHPAIGGQVT 375

Query: 331 HCGWNSILEGISSGLPMITWPLFAEQFYNEKLLIEVVKIGVGVGSKKW--WHLGEEPEII 390
           HCGWNS LEG+++GLPMITWPLFAEQFY+EKL+ EV+KIGV VG K+W  W   E  E++
Sbjct: 376 HCGWNSFLEGVTAGLPMITWPLFAEQFYHEKLVTEVLKIGVAVGKKEWSRW-ANEAKEVV 435

Query: 391 KREEIGKAIAFFDG 400
           KRE+I KA+ F  G
Sbjct: 436 KREDIEKAVKFLMG 441

BLAST of CsaV3_1G021870 vs. TrEMBL
Match: tr|U5GL99|U5GL99_POPTR (Glycosyltransferase OS=Populus trichocarpa OX=3694 GN=POPTR_002G123700v3 PE=3 SV=1)

HSP 1 Score: 379.4 bits (973), Expect = 9.9e-102
Identity = 185/378 (48.94%), Postives = 260/378 (68.78%), Query Frame = 0

Query: 30  LQVGLSPAIQNLSTATSMKMT-KVFQAFLMLQPQLVDLIHEMQPDCIVSDVFYPWTSDVA 89
           ++ GL    +NLS+ T+ +MT K+F    +LQP +  ++ + +PDCI SD  +PWT DVA
Sbjct: 74  IEAGLPEGCENLSSTTTPEMTLKLFHGIELLQPHIKMILQKHRPDCIASDYLFPWTVDVA 133

Query: 90  AELRIPRLAFNGSSFFSYCAEQCIKEHKPHLEVESNNEKFKLPGLPDVIEMVRSELPSWI 149
            EL IPRLAFNGS FF+ C    I  H+PH  V S  E+F +PGLPD + + RS+LP  I
Sbjct: 134 IELGIPRLAFNGSGFFNLCVANSIDCHQPHNNVTSETEQFVIPGLPDKVTITRSQLPD-I 193

Query: 150 TRHKPDGFSQLLDVIRESEKRCYGMLMNRFHELEASYEEHLNKIIGIKTWSIGPVSLLAN 209
            + + + FS L D ++E+E++ +G+LMN F+ELE +Y +H  K+ GIK W +GPVSL  N
Sbjct: 194 VKGENEVFSALFDKLKEAERKSFGVLMNSFYELEPAYADHFRKVTGIKAWHLGPVSLF-N 253

Query: 210 NEIEDKESRGGNPNIQTTNLLQWLNEKEPNSVLYINFGSLIQMSRNQITEIAHAIQESSQ 269
              +D+  RGG  +I+  + L WL  K+P SVLYI FGSL + S+ QI+EIA A++ES  
Sbjct: 254 RNADDRLERGGKTSIRKHSCLDWLESKKPKSVLYICFGSLTRFSKIQISEIASALEESRH 313

Query: 270 SFIWVIKK-----NDEDNDDDIVNKGLQKGFEERMSRTKKGLIIKGWAPQLMILEHKSVG 329
           SFIW + K     N+++N D   +  L + +E+R+  + KGLIIKGWAPQL+ILEH ++G
Sbjct: 314 SFIWAVGKILKSDNEDNNLDRQQDWWLPEEYEDRLKNSGKGLIIKGWAPQLLILEHPAIG 373

Query: 330 GFLTHCGWNSILEGISSGLPMITWPLFAEQFYNEKLLIEVVKIGVGVGSKKW--WHLGEE 389
           GFLTHCGWNSILEG+ +G PM+TWP+FAEQFYNEKL+ +V+K+GV VG++ W  W   E+
Sbjct: 374 GFLTHCGWNSILEGVCAGQPMVTWPIFAEQFYNEKLITQVLKLGVPVGNETWKVW-ANED 433

Query: 390 PEIIKREEIGKAIAFFDG 400
             +I R++I KA+    G
Sbjct: 434 SPLINRDKIEKAVRIVMG 448

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_023006827.12.9e-14565.98LOW QUALITY PROTEIN: soyasapogenol B glucuronide galactosyltransferase-like [Cuc... [more]
XP_022149481.11.2e-14368.27soyasapogenol B glucuronide galactosyltransferase-like [Momordica charantia][more]
XP_008457740.21.2e-11986.59PREDICTED: soyasapogenol B glucuronide galactosyltransferase-like [Cucumis melo][more]
XP_022947843.11.8e-11565.11LOW QUALITY PROTEIN: soyasapogenol B glucuronide galactosyltransferase-like [Cuc... [more]
XP_021903518.12.7e-10351.34scopoletin glucosyltransferase-like [Carica papaya][more]
Match NameE-valueIdentityDescription
AT2G15480.13.2e-8240.59UDP-glucosyl transferase 73B5[more]
AT2G15490.12.1e-8141.13UDP-glycosyltransferase 73B4[more]
AT4G34135.11.4e-8041.74UDP-glucosyltransferase 73B2[more]
AT4G34131.13.7e-7837.90UDP-glucosyl transferase 73B3[more]
AT2G36790.12.2e-7538.54UDP-glucosyl transferase 73C6[more]
Match NameE-valueIdentityDescription
sp|D4Q9Z4|SGT2_SOYBN1.6e-9945.92Soyasapogenol B glucuronide galactosyltransferase OS=Glycine max OX=3847 GN=GmSG... [more]
sp|Q9AT54|SCGT_TOBAC8.7e-9346.11Scopoletin glucosyltransferase OS=Nicotiana tabacum OX=4097 GN=TOGT1 PE=1 SV=1[more]
sp|Q2V6J9|UFOG7_FRAAN7.1e-8744.86UDP-glucose flavonoid 3-O-glucosyltransferase 7 OS=Fragaria ananassa OX=3747 GN=... [more]
sp|Q8H0F2|ANGT_GENTR6.0e-8641.53Anthocyanin 3'-O-beta-glucosyltransferase OS=Gentiana triflora OX=55190 PE=1 SV=... [more]
sp|Q8W3P8|AOG_PHAAN1.8e-8244.57Abscisate beta-glucosyltransferase OS=Phaseolus angularis OX=3914 GN=AOG PE=1 SV... [more]
Match NameE-valueIdentityDescription
tr|A0A1S3C6V9|A0A1S3C6V9_CUCME8.1e-12086.59soyasapogenol B glucuronide galactosyltransferase-like OS=Cucumis melo OX=3656 G... [more]
tr|A0A2N9FF75|A0A2N9FF75_FAGSY6.9e-11152.29Glycosyltransferase OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS13306 PE=3 SV=1[more]
tr|A0A2N9IJV5|A0A2N9IJV5_FAGSY2.9e-10952.60Glycosyltransferase OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS52113 PE=3 SV=1[more]
tr|A0A2N9I9F6|A0A2N9I9F6_FAGSY1.1e-10551.07Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS48797 PE=4 SV=1[more]
tr|U5GL99|U5GL99_POPTR9.9e-10248.94Glycosyltransferase OS=Populus trichocarpa OX=3694 GN=POPTR_002G123700v3 PE=3 SV... [more]
The following terms have been associated with this gene:
Vocabulary: Biological Process
TermDefinition
GO:0008152metabolic process
Vocabulary: Molecular Function
TermDefinition
GO:0016758transferase activity, transferring hexosyl groups
Vocabulary: INTERPRO
TermDefinition
IPR035595UDP_glycos_trans_CS
IPR002213UDP_glucos_trans
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008152 metabolic process
cellular_component GO:0005575 cellular_component
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0043231 intracellular membrane-bounded organelle
molecular_function GO:0016758 transferase activity, transferring hexosyl groups
molecular_function GO:0080043 quercetin 3-O-glucosyltransferase activity
molecular_function GO:0080044 quercetin 7-O-glucosyltransferase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsaV3_1G021870.1CsaV3_1G021870.1mRNA


Analysis Name: InterPro Annotations of cucumber chineselong genome (v3)
Date Performed: 2019-03-04
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableGENE3DG3DSA:3.40.50.2000coord: 213..394
e-value: 1.4E-97
score: 329.6
NoneNo IPR availableGENE3DG3DSA:3.40.50.2000coord: 33..212
e-value: 1.4E-97
score: 329.6
NoneNo IPR availablePANTHERPTHR11926GLUCOSYL/GLUCURONOSYL TRANSFERASEScoord: 22..396
NoneNo IPR availablePANTHERPTHR11926:SF384SUBFAMILY NOT NAMEDcoord: 22..396
NoneNo IPR availableSUPERFAMILYSSF53756UDP-Glycosyltransferase/glycogen phosphorylasecoord: 49..394
IPR002213UDP-glucuronosyl/UDP-glucosyltransferasePFAMPF00201UDPGTcoord: 58..359
e-value: 4.0E-22
score: 78.6
IPR035595UDP-glycosyltransferase family, conserved sitePROSITEPS00375UDPGTcoord: 310..353

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
CsaV3_1G021870CsGy1G016230Cucumber (Gy14) v2cgybcucB001
The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
CsaV3_1G021870Cucurbita maxima (Rimu)cmacucB0189
CsaV3_1G021870Cucurbita moschata (Rifu)cmocucB0173