Cucsa.356970 (gene) Cucumber (Gy14) v1

NameCucsa.356970
Typegene
OrganismCucumis sativus (Cucumber (Gy14) v1)
DescriptionSterol 3-beta-glucosyltransferase
Locationscaffold03577 : 1917980 .. 1958920 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTCTCATATATCTCTCTTTTATTTTGGAAAAAAAAATGCTTCACGTATCTCTCTCTTTCGCTCTTCTGTTTCTCACTCTAGTAACCAAATTGTAGTCATCCCATTTTGCAAATCTTCAATTTCCTTGTTTTGAATTTCAAAGTACAACTTTAAACGTTTCAGTTTCATTCTCTTCCTATACGAACCTCTCTCAAATTCCATGGCCCACTTGGAGATCCACCATTCTCCCCATTCTTCTTCCGGTCGTTCTAGTGATTTCTCTGTATCCATGGACTCCGATGGCGATGGCGACAGCGACGGCGATCGGGTTGTCCCTTCTTCTGGTTTGTTTTTATTGTCCTTTTCTTATTTTAGGAGGTTGACGTCTGTTGTTTTTTTAATTGGCTATTTGCTATTTTTCTATGTAAGTTGAGAATTGGATGTTTTTTATGTCCTATGTGATGGATTAAATTATCGTATATAAGAAAGTTTTTAATACTTAGTTTTAATTTTTCAGGATTTAGGATTTTGTAATTCGTCGATTTACTTAGGATATAGTTCTACTCCATGCCAGAATCATCCCAACATCTCTTGATCAAGAATTTCTTGTTAAGGCCCAATTGTAAAATCTTGGAAATAGAGTTTCTAGAGATTCATTGAATACCCAAGTTCCCTCTTGAATTTTTAATTCAATCACCACTCCCTACCTGAAAATCTGAGAATGACATCTCCTGTGGTGATAGCGGTTGGTGATAATGAACCTCCTAAATTTCTCCTTACCGTACTTGTTCCTATCTCATGCCTATTTCTGACGAGAGTTTTGTTCTTCTAAGGAAATGTCCAATTGATTTGTACGAAAGGTTATTAAAGTTTTTAGGAAGGTGATGATGACACCACAATTCTCATTTAATTAAGTCCATGTTTGTTTTCTTGAAAACGTAATTCATCACTGGTAATTTGAAAAGTGGATTCCATTGAATTATGTTTGTTGTTGTTTACTTGTGTTGTGTAACGATAATAAAATGTAAATTCCGAAACCAAATCAGTAACCAATAAAAGCAATCCGATTGCACAATTGAGGGTGATCAATTCGATTGCATTTAAATATTATGTGAGATCCATTAGCCGATACTTTTATCGTTACTGCTACGATAGGAGAAATATGGATTTGTCAGAATTGCTGATACCATTAAAGATACTGGTACCAATTGACCCTAACCAGTAAAGATAACAAAAGTCAATAACGGTAATGGTAAGAAAATCTTATTACTTTTGCAAAATGCCTAAGTAAACATGATTAAAAGCTATCTAATTACATTTGTGATTGAGACCTAGTACGATTGACGATTGATGATTGATTCATCAATGGAAGCTCATTACAATTACACTAATTTTTATTACGGACTAGTAAACATGACCTAAATGTGAGGCTGTATCAAGGCTGTTGGATCTTGCGGGTTTGTGAATAGGTCAATTGAAATTGTTCAATGGGGACTTGCTGTTTTTCCCTTTGGCATAGAGTTATTATGAGGGTAAATAAGGTCCTAAGCCTTTTGAGGAGGCCTTGAGTTTTAGGTCCAAGGACACATCCAGAAATCTTGGGGAAACTACCTCTGTTGGTCCTCCTTTCTCTCTTAGTTTTAAACTGCTGCGTGACTGTGGAGAACTCAAATAGCTATTTTTGGGATAATTGTTCGTTATGCAGGAGTTCTTTTTGTACTATGCTCCCTTGCTCGGACTATTCATCTATGAGATGTGCTGAGGTTTATGTCTTACCTTCTTTAGGTAATTTTTCCTCATTCTTTGTTGGATTCTGTGGGCCTTTGTTGAATAGGGATAGGAGAAGAGATTTTTTATAGCACACATCTCAGAACCCACCCACTCTTAGTTCTTAATTTGGAGATTTCGACCCTTCCATCCTCATCTTCTTCTTTCTTCTAGAAATAATCTTTGTCAGAAAACCGATTCATTTCTTTTACCGATTCAATGAACCACCTAAGTTGAAAGATTGATAATGACATAATAATATTAGCTTCAACATCTTCTATGTGAAAAGCTTCATCTTCATGCCAAATGCTGAAAAATAAATTGTCGGTTTTGCACTTCTATCCCTATAATTTACAAATGATTGCTAGGTGGGTAAAAATTTTTCTAGAAAAATGAAAAGGATAGCCTGGATAAATCTTGACGTATAGTAGTGTACCGGAGAGGAGAAGGGGCCGGTGATAATGTTGAAGGGAAGAAGGAGGAGAGAGAAAACAGGAGAAAGGAGAGAGAGAAAGGGAGAGAGAACTTTCTTATTTTTTAGTGGATCTCTGTTATTAAAAAATTCTGTCACATAAAAAAATCTTTCTTATTTTCTGTTAAAAAAAATCTGCCAAATAACCCATTCCAGAAGCATCTTCCACCAATTGCACTAGCTGTATACTGGTCAGATGGAAGCACCTATAAAATAAATGTGGAACTTAGAATTTTGTGTAAAGCTATGTGTTTTCCTATATCTGAATATGGTTTCAGGAAATACTGACAGGAATTCATCTGGAGATAGTACTCAAGATGGATCATCAGTAGGTAGGGAATTAGTTTCTTGTTCAACTAAGCCTACCAAGTTGAGGAAGTCAAGACAGAGTCATGCATTGCATCATTTACTGCCAAATATTTTTGATGAAAAAGTTTCTTCAAGGAAGAAGGTATGTCATATTTTCTTTTGAGTTGGCTTCAATTTTATTATTCTAGATTTCAGAAGGCACTACTGAATGTTTTGTCAGGAAATGTACGTCTTTTTTACAAATCAACTAAAAGCTCGTGACCTCACTTAATTGTGATGCAACAACTAGATGATTATATTAAACAAAGCATTTTGAGAAATATGCTAAATGATATTCACAACGATTTAGACAGTAGTTTCTGTATAATGTTTACAAATCAGGGAGAGAGGAATAATTTTCAGATTCTTCAAGAGATTTGAGCCTCTCCTTTACACTCGGTATTTTACGGACTCTTTCCCATTCGATCGTGTATGGACTTTCTCTTACTATCCAACATGATGAATCTGCACATCCTCTATATGTTTTTCATCTCCCATACTGATATTCTCTTGACAATTAGTAACTTATCAAGCTAATTCTCTTGCATCATCCTTCTTGCACATATGAACTTAACTGGAGGTAGGTTAACTCTACCAATTCCATCTTTGATTTTCACCTTATACTTTCAAGTTCTTTGAACTTTACAATAGTATAAAGTCTTGTCATCGTCACAGGTTATATAAAAGTTCCCTTCGACGTGTGTAAAATCTTCTCTGCAGTCTGCTCTGATTATAATTTTATGATTTCCAAAGTTCTTAGTCAAAAGTGTGGATGCAGCTCAGGTGGTTGAAAAGGGTTGCCACAGTCAAGCATGATGGAACTGTCCAAATGGAAGTTCTCGAAGGGATTCAACCAGAAAATTTACATTTTGAAACTGGTGTAGATGATGAAGCTGTAGATGATGAACCCCTAGACACAGCCAATGTTCCATTCATCCCCCCATTACAAATTGTGATGCTCATAGTTGGCACAAGAGGAGACGTTCAACCTTTTGTTTCCATTGGGAAGCGTTTACAGGTTCTATTTACTTTTGCTGTTCCTTTTATCATTTCCTTATTGATTTTTCTTTTGATGATCTTTCTTTCTATTTTCTTTTGTATTATTTTTTTTATTGTTGGTTATTAAACTTCTCATAGTTTTTTGGATAAGATTTTTATCACCATTTTTCTGGGGTTGGTTTGTTGGATGCTCTTGTACTCTCATTTTTTTTCCTCAATGAAAGTTTGTTTCTTCATTAAAAAGAATAGTCAAAATAAATAATTTTATCACCATTGGCAGGCAACGTGTGCTTTTATTCAGATATATTCTTGGATGAGATATACTGTGCATAAACTTGTGGAGCTATTCCTTATAACATCGAATTGTTGTGAAGTTCCTGATAACTCATTACATGCCACACCTTAGCATTTTAACTTTCAAGTAATAGAAAAAAATATCATTGCTGGAAGATTTTAGGATATTTTCTTGAAACTAAAGTTGAACCATTTATACAAATGCTAGAATATTTTGAGAAAGAGAGAATATCATTGTGGAAAAGAAAATGACACAGTCGTACTGGCACATCACTAAACTACTATTTTGTTTATTTTTTTTAGGGAGAAAATAGTGTGATTAGAGAGTTCATTCCCACATATTTCTAAATAACAAGAACTAGAGGTCGGTTATGGTTTATAGAACAAAATAGAAGCAAATTCTAACTACAGTGATCAAAATTGTATTTTTTTCCCAAATGATTTATAAAAATTCATTTTAGAATATAACATAAAAGAAGATTATTGGAATAACAATGATAGATAATTAAATCCATAATTGGATGCAACACTTTTAGTTCTTGAGATTTGAGATTGATGTTTATTTGGTCTTTGAAATATCAAAAACTACACTTCAGTATTCGAAGTTTGAAAAAATGATTATATATTTTTTAAAACGGAGATGACGACAACAACAAAAAAAAAAATAATGAGACTAACAATCAAAGTACAAGAGTATTATACTAAGAGAAAAAAGAGGAAAAGACAAGAACATTCAAACAAACTAAACAAATAAGAACTCGGGCAATTCAAAGGGTAAAATGACCAACAAATAATCAAAAGGACAAGCTAATAAAGTGAAATCCAACAGAAAAATTAATCTAAGTAGAACCAAAACAAAGAGAAACTGGATACAAAGCATAACAAAATATCTCTCTCATAAGGAAGCCATTCGAATCTAGAGAGTTGCAAGATGATGCAACTGGCAACCACGAATCTATCAGAAGAGGCTTGAAAAATCAGAAGAGGCTTGAAAAATCAGAAGTAGTCCAAAAATCGCATAAAAGATTCCTTCTTGGACTCTTGAATCATGACTACATCCGGGCTACAATTTTTGATGAATTTTTGAGGGCGACATGTTTGGAGGAATCATTCAAACCCCTAGTGTTCCAGGAGACAATCTTCATGATAGCTGGGAGAAAAAATGGGATGGCTGCACTTCTAAATTAAGCCAAAATAATTCCGCAATCATTGACTATAGACAACAAATGTTTTGGCAGATCCAGGGGAAATCTATTTGTGGGGAAGCTCTCATCATTAAACAAAACATTAAGGTTCATCCCTTCAATTTCTGAGTCTAATTTAAGGTCAACCTTGCTGGTGGTTTCGGCTGAATTTTCACTACTAACAGTGATAGGAGAGGAAATTTCAAGCTGGTTCTTGAGAGAAATATCATCTTTGCAGGGTGAACCTCCCAGAAACGATGCCTTTGAGTTAGAGATGGAGGATTTAGAAAAATTGACACTTTGATGAAGAAACAACGTTTACTAACCTAATGAAGCATTCAGGTACCTGAACTTTGGAGCTGTACACTTCCATAAGATCCGGGTTATTGGCAGAAACAAAGTGTTTGTGTAAGGCCTCCCATCATTCATACATACGTGAGAAAGGGTAAAAAGGGGAATGCAATTATTATTAATGAGCAAGGAAAAGAGCTGGAGTAAAAAAGTGAAAAATCCGTATAGAGTAGGGCCCACGGGATAGAATTTTGGGGAAGTCTTTAAATAGGGAAAGAGAGGGAGGGGGTAGCTAGGTTTTCTTTTATGAAAGTTATGGGGCTAGGCTGTTGTAGCCGCCAGGAGTCTCTTTTTCTGTAAAAGGGAGGTTGGTATGTTCTTTGTGTTTTTCCTTTATCTTTATTAGCTTTATATCGTGATCTGTGAACATTTTGACATTTATATATAAATAATACTCCAGAGTACCGTCTCTGTTTGCCATTGGGTGTTTGTTATTTGATATTTACGGTTGGTAAGGGTGTCTTATTTGATATTTTGAGGTTGGTAACCTTACAAGTGGTATCAAAGCTCGAAGAAACTGGGGGTGGTTACACGTCTCATGGTGCAAAGGCAGATTGAGGAAAGAGTGGAAGGGACTGAGAAAGAAATATTGGGAATGAAAGAAATGTTACTAGAAATAAAAAAAACGATGGACAGAATGGCGGAGGAACTGAGGGAGAATAATAGTTATAAAAGAAAGGATGAGTCAGGAACATCGGATGGATCAATCATGATATTGAAAGGAAAAATAGAGGAAACTGAAGTCACCACAGAGATGAACGGGAACAATACAGACCGTAGTAAGTATAAAAAGCTGGAAATGCCCATGTTTTTGGGCGAAAATCCAGAATCCTGGGTGTATAGGGCTGAACACTTCTTTGAGATCAACAATCTGCCAGAAACTGAGAAAGTGAAAGTGGCTGTGGTAAGCTTTGGGTAGGACGAAGTTGATTTGTATCGATGGACACATAATAAAAAGAGAGTAGAGTCATGGGAGGATTTGAAAGGAAGGATGTTCGAATATTTTAAAGATTCAGGACAGAAAAGCTTGGTGGCTCGTCTGATTCTAATACAACAAGAAGGGTCCTATAATGATTACGTAAAGAATTTTGTCACTTATTTAGCACCTCTACCACATATGGCAGAGAGTGTTTTGCGGGATGCATTTTTAACAGCGTTAGAACACGTACTCTAAGATGAAGTAGTTAGTAGATATCCCCAAACATTGGAGGAATGCATGAGAGAGGCTCAACTAGTTAATGACCGAAACTTAGCCTTAAAATTATCTAAGGCAGATTGGGGGGGGGGGGGGTGGGGGGGGTAGGGAACGAAAAACGAAAAANNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTGGGGAGAACAAAAGAAAAGAGAAGGACAGACCAACAAGGGTCAAGAAGGAGCAGAAAAGGGAGTGTTAAAGAAAACGGATTTCCCATTGAAGCAGGTATCCATACCAATAAAGGGAAATTACCAAAGGAATGAACCTCCAGTGAAACGGCTGTCTGATGCCAAATTTAGAGCCAGACTCGATAAATGCCTATGTTTTAAATGTAATGAGAGATACTCACCTGGGCATAAATGCAAAATGAAAAAAAAGAGGGAATTGATGTTATTCATCATGAATGAAGAGGAGAGTGTCGAAGAGGAGAACCAAAGGGAGGAGAATTCAGAAGAGGTGGTGGAGTTGAAACAGCTAGACCTCACAGAAGAAACAAAGATAGAATTGAGAGTAGTCACCAGGTTGACAAATAGAGGAATGATGAAGCTCAAGGGAGAGATAAGAGGGAAGGAAGTGGTAGTACTTATCGACAGTGGAGCCACCCACAATTTTTTACACATTAAAATAGTAGAGGAGCAACAAATAACCGTCGAGGACGGAACACCCTTTGGAGTAACGATTGGTAATGGCACGAAGTGCAGAGAAATAGGAGTATGTAGAAAAGTGGAGATGAAATTGAAGGGGCTCACTGTTGTGACCAACTTCTTAGCCATTGATTTGGGCAGTGTGGATGTCGTTTTAGGGATGCAGTGGCTAGGTACCATTAAAACTATGAAAATCAATTGGCCTTCCTTGATTATGAAGTTTTGGGTTGGAACAAGGCAACTTACTCTTAAGGGGGATCCTTCCCTGGTGAGATCAGAATGTTCCTTTAGAACGATTGAGAAGACATGGGAAAAGGAAGATCAATGTTTCCTTTTAGAACTACAAAACTATGATACAGAAATGAATGAAGACATGGGGGAAAATCAAGAAATTAAGGGAGACGAAGAAGATACTCCGATGTTAAGGTTTCTGTTACAACAATATGTGGATATTTTCGAAGATCCAAAAGGGTTACCTCCTAAAAGAGAAGTTGATCATTGAATCTTGATGATGCCGAGCAGAAACCAATTAACGTGAGACCTTGCAAGAATGGACACGTTCAAAAGGAAGAGATTGAGAAGTTAGTAGAGGAGATGCTCCAAGCTGGGGTAATAAGGCCGAGTCACAACCCCGATTCTAGCCCCATTTTATTGGTGAAGAAGAAGGATGGAGGGTGGAGATTTTGTGTAGACTACCGAAAGCTCAACCAAGCCACTACAGCTGACAAATTCCCCATTCCTGTAATAGAAGAATTACTAGATGAACTGCACGAGGCTACAGTGTTCTCAAAGTTAGATTTGAAGTCCGGTTATCACCAAATAAGGATGAAGGAAGAAGACATAGAGAAGACAGCCTTCTGGACTCATGAAGGCCATTATGAATTCTTGGTCATGCCGTTTGGCCTCACCAACGCTCCTGCAACCTTCCAATCATTAGTGAACCAGGTATTTAAACCGTTCTTAAGACGCTGTGTTTTGGTTTTTTTTATGACATTCTGGTGTATAGCTTGGATATCATCGAACATGAGAAACACTTAGGCATGGTGTTCGCTATATTGAGGGATCATCAACCGTTTGCCAATAAAAGGAAGTGCGTTATAGCTCATTCCCAAATCCAGTACTTGGGCCATTTAATTTCCAGAAGAGGGGTAGAAGCTGATGAAGAAAAAATACAAGATTTGGTTAATTGGCCACAACCAAGGAATGTCACTGGATTGAGGGGATTCTTGGGGTTAACCGGCTATTATAGAAGGTTTGTCAAAGGCTATGGCGAAATCGCAGCTCCTCTCACTAAGTTGCTGCAGAAGAATTCTTTTTTATGGAATGAAGAAGCCACAATTGCTTTTAATAAGCTGAAGATAGCAAGGACAACGATACCCGTTTTAGCACTTCCTGAATGGAACCTGCCATTCATTTTGGAAACAGATGCATCAGGAATAGGATTGGGAGCTGTGTTGTCTCAGAATGGTCACCCTATCGCGTTCTTTAGCCAAAAACTATCTCCCAAGGCACAAGCTAAGTCAGTATATGAGCGAGAGTTAATGGCCGTGGCGCTCTCTGTGCAGAAATGGAGGCATTATTTATTGGGAAAGAAGTTCACCATCATATCCGACCAGAAGGCTTTAAAGTTCCTTTTAGAACAGAGAGAGGTTCAACCTCAATTCCAAAAATGGCTAACTAAGCTATTAGGATATGATTTTGAGATTCTTTATCAACTCGGATCTAAGAACAAGGCTGCTGACGCACTGTCTCGAGTGGAACAACCACTGGAAATTAACATTATGACTACTACGGGTATTGTGAACATGGAGATAGTCAATGAAGAAGTGCAGCAAGATGATGAACTTAAGAGGATTATAGAGGGGCTAAAACAAAGGGAAGATGAGACAAGCAAACATCGTTAGGAGAATGACAAACTGTGGTATAAAAACCGAATAGTGTTATCGAAACAGTCTTCATTGATACCGAATCTGTTGCACACATTTCATAACTCTGTTCTAGGAAGCCATTCCGGATTTCTAAGAACATATAAGAGGATGAGAGGGAAATTACATTGGAAAGGAATGAAAACCGATGTCAAGAGATATGTGGTGGAGCAATGTGAGATTTGCCAGAGAAATAAGTACGAGGCAACTAAACCAGCAGGGGTTCTACAACCCATTCCAATTTCGGAGAAAATCCTTGAAGAATGATCCATGGATTTCATTGAGGAGCTGCCCAAAGCAGGAGGAATGAATGTAATCATGGTAGTAGTTGATAGACTCAGCAAGTACTTTTACTTCATTACTTTGAAACATCCATTCACAGCCAAAAGAGTAGCAGAAGTGTTCATTGATTGAATCATCAGTAAACCTGGCATTCCTAAATCGATTATTTCCGACAGAGATAAGATCTTCATCAGCCATTTTTGGAAAGAATTATTCGCCACTATGGGGACAGTATTGAAGAGGAGTACTGCTTTTCACCCCCAAACCGATGGGCAGACCGAAAGAGTCAACCAGTGCTTACAAACATATCCGAGATGTTTCTGAAATGACAACCCCATAGGTGGGATAAGTTCATTCCGTGGGCAGAATTGTGGTATAACACCACCTTCCATGTTTCTACGAGAAGCAATACATTTGAGATAGTGTATGGAAGACACCCACCTCCTCTAATTACCTATGGTAGTAAGAAGACACCGAACGATGAGGTGGAAACCATGTTGAAAGAAAGAGACTTAGCACTGGATGCGTTGAAGGAGAATTTGAGTGTAGCCCAAAATAGAATGAAGAAAATGGGGGATACAAAGAGAAGAGAACCCAACAAAGACGACTTTTTGAATTCCAGTGCAACTAAGGCAGTTTTGACTGCGTTCTTGCTTCTTTCGTCTTCTCCGGCGTCGATGAGGGATCCTGAAGGTGTTCCAGAATAGCAAATGGATTTCTTGCTATGTGAAGATGTGCGAAAGTTGATTTCGGGACATTTAAAACCGGAGAATAAAAAAGAGAGGTTTGAACCATCATCTTGCAAGGCCTCCCTAATTCTTAACAAGTCAGTGGAATTTTTAAAATCACCGACCATAATAGTACCACCTGTAGGAAAGGGAGGATTCAAACATTCGATATCACCAAAGTTTAGAAAAATGTTTCCCCTTTTAGAGTAATCTCAGTCGTGGAAGGTACAATTCTGCAAAGATTCTGCTTAGCTTGAATCCTAGCTTCACTGCAGTTGGTAAGATTAAAAGTGTCCATGGCAATACTTTCAAGTCCCCCAAATGATCCCCGATAACCTCGAATTCTCCTACTCTAGTAATCCAAAGGGAGGTTCTTTATTTCGGGCCATCCTCCATAGCCTTTCATGGTGAGCGGCCGACTGTGTTTGTGTTTATCTCATCTTTCGAATTTTAGGTGAAAAGAGCCCAACGCCTGCCATTTCCCTTCATTACATATGACATCCTTTATTGAGCCATGGTCAAAACCAATGAGAGCATTCACACCAAATAAAGGGTTAATGACTATCTTTTTTTGGAAGTAGTTTTTTAACATTTTTCAAATTTTCCTCCAATCGTGGAAGGCAACGAGCTTACTAACAATCTAGAGGTTTTCAAAATTGGTTGGGATAACGTCAAAATTCCTAATTACTTGTTGGCTTCTGCTTATCAGTTTTCAAATCTGAGGAAGGACAAGGGGAAATCTCTGCTATTTTTATATCTGCCAGCGCTACAAGACTACTGCAATCTCCTGAAGCTTTCTTCGTGACAGATGCAATGCACTTTTTTGTTCGACTTTATCTGCATAACTCAAGTTGCTTTCCATAACAGTGGGAATGTGCTTCAAGTGGTTTCTAGAAAACCATTTGACATATTCCTCCCTAGAACCGAAATTGTCCAGCATATTTAGAAAAGATCTCCATCCCTGTTTGTTTTCCCCAGAGCACACCCCTGTACTAGAATGCCCTGCTGATGAAGGCCAAAAGTCACAGCTTAAAACCCATCCAGTCTTTGATTTAAATTTTGAAACTTTCAATCCACCTCTATCCAGTTCACCCAGAACTTTTCCTCTGGCCTTCTTAATAGTTCTGAGACAGCCTTCAACGGTCATTCTAGATGAAATCCGGAGACCGACAAGATTCCACAAACTTCCACATCTTCAACATGAATCTGATTTCCCTCTTTCCAGGTACAGTAATGAGACTCAAGGGCTTTGCAGCTGACTACCTCCATATTTGGATGAAAACGCCGGCAGTCGGAAAGAAAGAGGAGGGAGGAGGGAGGAGGGAGGAGGGAGAGAACTTACCTATAGCACTATTAGATTGGAAGGGAGAGCTTCATCACCATTTTCTCTTGAAAAACCTATAGCCTCTCTCTGTCTAAATTCTCTTCCTTTTGGGTACAGGAGCAGCAACTATAATACTCTTTCCTTTTTCATTTTTCAACACCAACAACAACGACTAAAAATAGCCATGGTGAATGAACTCAGCACCTCCAGATCCCACTTCTCTTGTCCAGGGGTACCTTGTATCAGATGCAAAGCGGTGAGGTTTAGGCTACATGCTGGGAGCAACTATAGGTTTCTCAGCTGACTACGATGAATTTTCCCTTGGCAATTTCTTCATGGGGGTGTAGCACCCTTTACAAGTAGATCTATGATGCCCATACTGTGTCAACTCTTGATTTCTTGCTTCATTTCAAATCTAAAAGTTGCACAGTGATAAGGCCACTCGAGAAGGAGAAGGAATATAGATAAAGGGAAGAGAAGAAATGAAAAGGAAATGCTTCATCACATTTCCAATCATCAAAAAGTTAATTTGTAGTTGATATACTCTGCCAAATCTTCAAGCCAAACAAACAAGGAACTGCCAGGTTATCTTTTTGTTAGTCAATGAATAGTCAATATGATCTTAGAAAATGATGCAAACAGTAACATACTAATATCTATTAATTATAGCTTTAATAGTTTGTTATTATATATTTATAACATAATCTTTCAGCTTCTAAAATATATTATCAGACCTAAAAAATATATGATCCTGATTTGTTCTGCTGGCAGTGCTCATCATCATTTATGAAACTAGAAATTTTGCTTGCCATAGTATTAACATTTAGAATAGTGTAGGGTTTATGAAATAAATTAATATATTATTTAGTGACATGCTAGAATATGAGTGATAGGAATACGTACTGCTGTAGTATTTGGAAAGTTGGCCATGTATTTTTTTTCTCTTTTTATTGTTGCAACCTAGCAATTGGTTTTCTAAATCAAATCTCTTGGCTTTCCTAATAAAATATTTTGACTTATTGAATAACATTCGACATGAGAGGACGAAATAATTTGTAATCTTGATGTATTATTATTATTGTTGGGACTTGCTTTGACTCTTATTTTACAAAATTATGCATTAAAATACAGGCTCTCTTCAGTAAACTTTGATAGATCAAGTTAGTTTTAAGTTGACAGAGATGATGCATTACTTTCAGGAGCATGGCCATAGAGTTAGATTGGCAACTCATGCAAATTTTAAAGATTTTGTACTCTCTACTGGACTGGAGTTCTTTCCTTTAGGAGGAGATGCTAAAGTTCTTGCCGACTGTAAGTGTTCTGATTATTTTCTATTTTTTTTTGCATCACATTCACAAACTAAGGTAATAAACACATCAATAGTAGTCAAGAACGAGTGTTAAAACTATTGTGTGAGGGATATGCCCATTTGTTTTTTAAAATATGCTGAGATAGATAATTTTGGAGCATTTTTATTTGAGTAAATAAAAAAAAGTGAGCTAGAACTAAAATAAATTATTGAACTTGGAGAGATTAAGAGAAATTTTAATAATTCATATATGCAACATAGGACCTCTTATAAAGAAGAGACTCATTACAAATAAGGAAAGAGGAAAACAGCACAAAAGAAAAATGTTTTTTGAGAAGGATACGAGTCTCACTATTATTAATATAAATAAAGAGACAAAGCTCAATGTACATGAGGGTTATATAAAGAAGAGTCTCATTACAAATAAGGAAAGAGGAAAACAGCACAAAAGGAAAATGTTTTTTGAGAAGGATACGAGTCTCACTATTATTAATATAAATAAAGAGACAAAGCTCAATGTTCATGAGGGTTATACAAAGAGCAATAGGGGGAGGGAGGATCAACAGGCGCATCTGGGCATCTCAACTAGGTTGACATCCCCTTAGCGCCCACAACACATCCAAAAAGCGACAAAATAAGATCAAACAAAAAGCCACAAAATATTGCTAGCTTTGCCCCCCGATGTCTATACATAAATATCGGGGTGGGGGCTACATTCAACAACTGAAGATCCAAACTAAAATAAGTCCATAATTGACGACTGTTGACCTTGACTTTAGATGATCTGATATTTCACAAGACTGGGGAAGAACGCATGTAGCAGAGAGCAGTTTTTCAACTTTCGGTTTTTCCTTGCATTGGAACAGGTCAATAAGATCTGTTTCTAAAGACTCAAAATGTCTTTGCCTTTCAAGTTTGGCTGCTTATCAAAATGCTCCATTTCTCACTACTAACGCTGAAAGGAGAGTAAAAGATAGACTCACCCTTCTTTCCAGAGGGATTGAGAGGGGTTTTGCTTGGAGAGCGCCTAAAAAATTTCACCTTCGAATTGGACAAATAAAACTCAGAGTACTTTAACTGCGATGGATGGGAAGAATGAGATTTTGACTTAGTAGAATTGTCTGGAGCTTGTATTTGGGTGCAACTATTTCCAACAAGTCTAGATTAGCCTCTAAAAACATCTTGGAACTGCTGAAATGACTGAGTTTTGAGAACAGATGCGTTTTTCCTTTAACAAGTCAAGAACTTCATCTTCAATTATATGGGCCCTTAAGGATCGTTTAACCGAAATTTTCTTTGTTTTTGAGACAATGTGGGAAGACCTTTGAAGAAATTTTCTTTATTTTTGAGACAATGTGGGAAGACCTTTGAAGACTTCTTTCTTTTCAATTTTTATCTTGACCTTAATCCTGTGGAGGACTAGAGAGTAGAAGAGGAGAAAGAACCTGAAGATGAAGAGTTAATGGAGGCTTGATCAAAGGAAACTCTGAAGGGCTTTGGTGCACCCAGTGTGAAGGGAACTAAATGCTTTGATGGCACAATGGACAAGCTGTCGAATTTTTGAAGAAAGTCTAAACGTCTAACATATGACATTTTGCCATTTGGCCTTCCTCTATTGAAGTATCAATTCAATTAATTTGGGAGGAATTTAAATGTTTGTTTAAAGGCAGTGTGACTCTTGGATTCTCATTGCGCAAAGCAAATGTCATGGCCACAATTTTCTCTCATTTGTGAAAGCAACTCTTCGTCTTCTTCTTTCATGTTCCCTCCAAATTTGAATCTCTTTCAATTTTAATGCGACAGGAGGAGGAATCTAGAGGGATGAGGAACATATTTGGTATTGAAGTTAGCATAAGGATTTCTTCAGAAGATTGCCTTTTTTTTTTTTTGGTGACAAACTTCTTTATTAATAATATAAACTCAAAGTACAAGAGATATATATTGAGTGTAATAAAGAAGCTTAAAGAGCAAAAAGCAGCGGGATCAAAAGGTGCACCTGGACATCTCAACTAGATTGACACCCCCTTAGCACCAAAACATCATATCCCAATTAAAACACTAAGCAAAATGAATGCTGTACTAGATAATAAACAATTGTGTCTAGAATAAATACATAGGGCTGAGATAGAGAGAATGCGAAAAATAGAAGCAATATAGAAACAACACCAGTGACGAAAAAACTGAGCCTAACAAAGTAACTACTGAGAAATCCAAAAGCAAAATAGAAAACTAAGCAAGCAATCACCGTTCTGAGCCATCTATTTCAGAATGTAGTCCAGACTTTTTGGAGGTTTACAACCCCAGAATTCAGACCTCTAATTGCCTCATCAGGTGAGTTTCTCCTACCAAGAAATTCAATTATTCGAAACTTCCCTTGATAGGGTGGATTGGAACNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTCCAAAGTATTCAGTCTTCATCAATGGTAAGCCAAGAGGCCGAATAACAGCATCTAGAGGTATCCGCCAAGGAGACCCCCTCTCCCCCCTTTTATTCCTTCTAGTAAGTGAGATTTTAGAAGAAATCATAAACAAGCTTCATATTAACAGCCAGTTTGAAGGTTTTCTTGTTGGCAAAGACTTTTTGGCCATCAAAGTAATGAGAAGCAAGAAGGGTCTTTTTATGTAACAGAGAAGTATGTGTTTGATTTGCTATCTGAGACAAGAAACTTGGGAGCAAAGCTATGTAGTACTTCGATGATGCTTGGTCTACAACTGTTAAAAGGTGGAGAACCACTTTGAGGTTTCTGAGAGATATAGAAGACTGGTTAGATGGTAACTACCTTAGAGTAACACTTCCAGACATAGCTTATCCGGTAAGCATTATGAGTCAGTTTATGACCTCAGCCATAGTGGAACATTGGGACGCAGTACATCAAATTCTATGTTATCTTAAGGCTGCATTTTGGATGTTGATGCATTTATATGCTTAAAAGATTTACTTTGATATATTCCACACTCACAAGTTTTTATTATTGTCATACCTGTCTTTTTCGTCTACTCAACAAGATTATGCATGAACAAGTGTTTGATAGGTGTCCACCACGTGTTGAAGTTTCCGACACGTGTCAAACATGGATACGCTAGTGAAACTAAAGCGTTTGTGCTTTTTAGTCATCAGGAAGGTAAAAAATTATAATGGACCCTGAACCTAAATGAACAAACCGTAAATTGTTAGTACCTCAACAACCTAATGACATAAAAAAAAATCGGTTTAAATTTTTTTTATAGAGCAGTAAATGGTACTATGATCTTTTAGAATTAGTGAAGTAAGCATCCGAAATTGAATAATCCAAGATGCCATCAATACTGCTGATCGTTTGTAGTGTCGTTTGCCTTATATCTGCCCTTCTCCTTCTTGGTATCTCATGTGTCGTCAACATGCTAAATCTTCTATTCATTTGTTTTTGCATTGCCTTTTTGTGGCCAATTTTGGACTACTGTTCTTCCTTCTTTTGGATTGTCCTTTACTTGCCCTAATTCTGTCTTTGATGCTTTTGCTTGTTTATTAGTGGGTCATTTTGTTGGTGCAACTAGGAAGATGGTTTGTTTAGCTATTCTGCGGGCTTTCTTATTGTGTTTATGGAGTGAATGAAACAGACGGCTCTTTAGTGATTCCTTTTCTTCTTTTAACCGTTTTATGGAATTATTTTTATCTATGGCATTTTCCTGTTGCAGAACAAAACACCCTTTTAGACATTACAACTTATTGTTTTTAGTCAGTAATTGGTGCTCTTTGATATAATCACCTTTTGGTGTTTGGGGTTCTCTCCTTTCATTGTTAATGAATTGTTTCTTTTTCACCTATAAAGAAGAATCCCAGATGCCACTTTTCACCAAAAAGCCACAAAATTAATGAACATAATCTGTATAACACTTACCTTTTGCTGACTCTTTGAAAAAGCTAGCTTTTATACTAACTCTCTTCTGGATCTTCAGAGCATATTCATGATTGTATGTTTTGCTTATTTTCTACAGATATGGTAAAGAATAAAGGATTCCTTCCATCAGGACCTTCGGAGATACATGCCCAACGAAATCATTTGAAGGATATTATTTTCTCATTACTTCCTGCGTGCCAGGATGATGATCCAGAATCAAAAATTCCATTCAAGGCAGATGCGATAATTGCTAATCCCCCTGCATATGGTTAGAGTCAAATTTCTAATTTTATGTTATCTTTTCATGGAATTTTGAGTTTTTAAGTTTGCTCATAATCTTCTCTCCTTGCTTTCATTTGTACCCTACCTTTTAAGCACTGGTCATCTGCTTACAACCATTTTTCCATAGCTTTCTATTTTAGGTCATTAGAAGAAAAGTTGCTAAACTTTATGTATTCATAATTAGTTTGTGCATATGCATTGGAATGTTTATGGTCCATAAATCATTTTTGAAAAGATTGTACAGAAGATGTACAAGACTTGAAGCTTAGGAATGCTTGTCAATTAGTCAACTCAATTAAGATAAAGTACTCGCAAATATTTTTTGATTCAAGAGTCTAAGGTTGACTAAGTGGACTACATCCTCCCAAAGCTTGTCCATGGTCTATCTATTTGGAATAAGATGCACTTGTTGTGCTCTAGCCAGAGGAGGACCTAGTGTATGTTGTGCGCTAACAATACAACCATAGATTCAAAGATGGTTTTTCATTTCTGTTGGAAAGATACAACCCTAGATTTAATGATGTTGTGAGAGTGAACATGAAGTTGACCACTGCTGATTCACATTTGTGTGTGTGTGGGGGAGGGGGGGAGTAGAAAGGGAAATTCACCCACTTCGTTAGTTAGTGAAAGGATATGTATCTTTTGTCTTATTATATTTTGGGGATTCCATGCTTTTATGTTGTAATCTCTTGGTGTAACCAAGTTGGAAGAGTGCTAGTGTACTTTTGTGATCTTACAATGTATCGTCTTATTATTCCTATGAGAGGTATCTTTACATTGGTATCAGATCAGTAATTTAGTAGATCTTAGACTCAATTTTATTTCTCAACTACACGTGGTCTAGGGAAGGGGGAACTCGAACTAGTTTCTAAATCTCTTATCTATAGATTTCTGGTTTCTAGCAAGTGGGAGGAAGCATTTGAGAATTCAAGAGTAGAAAGGCAAATTTCCGAACACTTCCCTATTTTATTGGAGGCTGGTTCTTTTGAATGGGGACCAAGCCAATTTCGGTTTTGTAACTGTTGGTTGAAAGATAAGGATTGTTGTAGATTGATTGAAAGATTCTTTTGCATGAGATAGACAGCAAGGTCGGGCCAGATTTGTAATTTCCTCCAAGTTCAGAAAGATAAAAACAGATTTAAAAAAGTGGCTTGCTGATTTTGAAAAACAGAAGTGCAGGGAAGAGTTTCTCTTGAAAGACATAGAAAGAAGAGATCAAATGGCATAATTTGAAGAAGTTTCCTCCTTTGAAGAAGACTTAAGGGTCGCATTAAAAGCAGAATTGTTGGCAAGCTAACCAAGTGGAAGAAAGGAATCATATGCAGAAAAGTAAACTGAATTGGTTGGATTTGGGGGATGAAGGCACTGCTTTTTTACATCGATTTCTTGCAGCAAAGAAGAGGAAGAATTTGATATCTGAATTGATTAGTGATCAAGGTAATTCAATTAAATCTTTTCTTGAACTTGGTAACAAGGTATAAGGAGTTGTTTGACGCTGAAAGAGTTGCCTCCCTGAAGAGCAGTAGACCATTCAATCCTGATAGTAGGACAGAAACCCATCAACGTCAGATCCTACAAGTATGGTTACATTCAAAAGGAAGAAATCGAAAAATTAGTCTCTGAAATGGTGCAAGCGGGAATCATCAAACCCAGCCGGAGCACCTATTCAAGCCTCATACTATTAGTGGAAAAGAGGGATGGAGAGTGGCGATTTTGTGTAGACTACCGCAAACTGAATCAAGTGACAATTGCAGACAAGTTTCCCATCCCCGCGATCGAGGAGCTCCTTGATGAGCTAAATGGTGCGGTAGTAAACTGGATTTGTGCTTAGGGTATTACCAAATTTGGATGAGGGAGGATGGCACTGTGAAGACAGCTTTTCGCACTGGCAAAGGACATTGCAAATTCTTTTTTTATGCCCTTTGGGTTAACCAATGTGCCGACAACATTTCGGTCACTAATGAATCAAGTATTCAGACCCTTCTTGAGGCATTTTGTGCTGCTGTTTTCTGACGATATCTTAGTCTATAGCACCGACATCATTGAACACGAAAGACAGTTGGCAGTGGTGTTCAGTATTTTGAGAGACAACAAATTGTTTGCAAATGAGAAGAAATGTGTCATTGGGCATTCCCAAATTCAGTATCTGGGACATTGGATCTCTAGTAAAGGAGTAGAAGCTTCTGGAGAAAAAGTAAAAGCAATGGTGAATTGGCCGCAACCAAAAGACATATCTGAGTTGAGGGGATTTCTGCATCTTATGGGCTATTATAGGAGGTTCGTGAAAGTGTATGGGGATATTGCCACATTTCTTACCAAATTGTTGCAGAAAAATGGGTTTAGGGACGAAGGGGCTACTGAATCATTTGAACAGTTAAAGCAAGTAATGATCTCGATCTCGATCTCGGTCCCAGTCTTAGTTTTTGCAGATTTCTCTCTACCTTTCATTATTGAAACAGACGCATCTGGTGTAGGGTTGGGAGCAGTCTTGTCTCAACATGCACAGCCCATTTCTTATTTTAGTCAGAAGTTATCTCAGTGTGCTCAAGCCAAGTTTATAACAAGAGAGAACTGATGGCAGTAGTGATGGCAATGCAGAAATAGAGGCATTATTTGTTGGGGAGAAGATTTACTGTACTGTCAGATCAGAAGGCATTGAAGTTTTTGATCAAACAAAGGGAAGTACATCCCTAATTTTGGAGGTGGTTGACCAAGCTACTTGGCTATGATTTTGAGATTCTATATCAACCTGGATTGCAAAACAAAGCCGCAGATGTCTTATCTCGAATGAATCGCATGGTGGAACTAAATAACTTCGCAACCCGAGGGTTGATATGGTTTTGAGAGAAGTCGAGAACGATGATGACCTTAAAAAATCATCAGTATTATGAAAGAGGACCTTGAGGGGAAAACAAATTATCAATGGGTGGCTGACAGGTTGTTGTATAAGGCGAGGTTAGTACTGTAATAAACTTCAACATTAATTCCAGCCTAACTCCACACCTTCCTTCCATGACTCAATTTTAGGACATTATGGATTCCTCAAAATCTACAATAGGATGAAGGGGTAGATGCATTGGGTGGGCATGAAGAATGATGTGAAGCGATATGTTGAATAGTGTGAAGTGTGTTAAAGGAACAAAACGAATGTCCTCTCTTTGGTAGGACTTCTACAACCCTTTACCTCTACCAAACTTGATCCTTGACGACTGGACAATGGATTTAATAGAAGGGCTACCAAAAGTAGGGGGATTTGATGCCATCATGGTTGTTATGGACCGATTGCGCAAGATGTCCCATTTCATCACCCTTACGCACCCATTCATGGCTAAGCTAAGCAGGTTGCGGAGAAATTCATTGAAGAAGTCGTCAGCAAGCATGGGATCCCGAATTTGATCATTATTTGAGAGAGCTGTTTTTGGCCATGGTGACATCACTAAAAAGAAGTATTAAGTTCCATCCCTAACCTGACGGCCTGACCAAGAGGGTGAATTGTTGCCTTGAGACATATCTACAATGTTTTTGCAACGAGCAACCAACCAAGAGAAACAAGTGTATGACGTGAACTAAGTTGTGGTATGATACAACTTTTCACTCCTATGCGAGGATGACACTCTTTCAAGAAGTCTATGGAAGATCCCCACCATCGCTGGCTTCATAAGGAGATAGGAAAATGGCCAAGAACAGTGCATAACAATTGCTCATGGAGAGAGATCTAGTGATTAGGGGTTTGAAAGCGAATTTGGTAGTGGCTTAGAATCGAATGAAAAAACAAACTGATTTACACGGTTGAGAGCTGAAATTAAAAACCGGAGATGAAGTCTATTTGAAGTTAAGGCCATATAAACAGAGGTGCTTAGCCAAGAAGCGGAGTGAGAAGCTAGCACCAAAATTCTACGACACATACGGTATAATTGAAGAGATCGGGGAAGTGGCTTATCAGTTGGACCTCCCTTCCGAGGTGGTGATTCATAATTTGTTTCATATCTCCTAACTGGGCCAAGCTCAACATGTGCAACACCTCCCACCAACATTGATTGAGGAATTCGAATTGCAAGTCAAGCTCGGGGTTGTTTTGGGGATCCAATGGAATAGTGAGATAGAGGATAATGAATGGCTGGTTAAATGGAAAGGGTTACATGACAACAAAGCAACGTGGGAATTTGTTTACTCCATGAACCAACAATATCCTGCATCCCAACTTGTTGACAAGGTGAATGTCGATCCCAATGGTATTGTTAGGCTCCCTATTACACATACTTACTGACGCAGGAGTAAAAGGGGAAATGGTCAGGTGACAGCATAGCAGTGATGATTGGTGCGGAGAATAACTGTTGAGGGACCCACCTGGGGGAATAGTAGTATTTAAAGAATGTTTTGGGACTACGGAAGGGTAGGTTATATTTTGAGAACAGCGGCTGAAGGCTTTTTGGGAGAAGAATCAGCCTTCTCGAAGTTGCTGGGGTTACCTTTATATTTAGGGTTGAGGAGAATAACTGCTGAGGAGAATAACAGTTGGGGGACCCACCTGGGGGAATAGGAGTATTTAAAGAATGTTTTGGGATTGCGGAAGGGTAGGTTATATTTTGGGAAAAGCTGCTGAATCTGCTTTTCGGGAGAAGAATCAGTCTTCTCGAAGTTGCTGGGGTTACCTTTATATTTTTTCTTTGCTGTCTGTTTATTGTTCTTATTTTCTCTGTTCTTGAATTAAGTATTGTGTAAGACTTGCTATCTTTTCCATTCAATATGTTATAACAGAGATTCAATATATTATAACAGAGTACCGTCTCTGTTTTTGCAGCTCTGTTAGGATTTTCTGGGTTATAGGTGAATGAGTCCTAACACTTTCTTCCTTGTGAACTTGAGAAAAAAAAACATCATTAGTAGTTGGAAAAGTAGTTATCCCAAGGATTCTACCTCTAACCTTGTCAAACTCAACATTGAGGCTAGTGTGAGATTTGTAAATATGGTTATCCTTTGTAATTTTCTTGTAGTGCTTTTGATCCTCTATAGACTTCTTCCATTCACATGTATTAAAGAGGCCGAGGTCTTGCCAAATCTTTTTTAAGGAGCGAAATTATTGACCAACTGATCGTATATCACCCAATTTAAGGTTTAGCTCAAACACCTGTGACTAATTGCCTAAATCTGAATACATTTGAGACACACTATCCCATAATTTCTTAGTAGTAGAGTAACATGTGTAGTTACAACTTGTGTTTCTAACCATGGAATTTACTCACCAAGTCATAACCATGGAGTTTTGAGCATCCCAGCCAGCAAAGCACGAGTCGATGGGAGTTGTTGGGGTAGTTTTCTCTCCGAAAGTGTAGGCAATTTTTCCTTGTTCACGAGTATTCAATCAAACACTTTAGGACCAACAAAGAAACTTATCTCGATTAAGTTGGATTGTGGGTATTTGGACGAGGGGTCTATTGGATTGGATGCGATTATCAAGGGTTTTTGCATTAGATGACTTAGTTTTTGACATAACTGAAGCAATACATATCCAAAAATAATTTGAGAAAAAAACCCACAACATATGGCGACGAAGATCGGGGTTCGACAGTTTGAACTGAGACGACCGACAATGCCCACTTGCAAACAACTAGTGACAACAAAGAACTATTGACTTACAACCAAAAAACCTAAAAAAATGCGAGTCTTTGGTGGGAAGAACTTTTGGCAATATATGGTTGACAGCAGCGATCGATTGGAGAAGTGGCAACCGGCCACGATGATGCATGAGTCAGTTGTATGGATTTTAAGGTTTTTTATTTTAATGGAAATGGGCTGCTTTATTAATTATAAAGAAAAAGGCTAGCTCTCAAATACAAGAGTATTATACTAAGAGCAGGAGCTGCTTTGATACCCGATTCAAAAAATGTATTTTCTGTCTTATTTTCTTTAGAAAGGCAGGGTTTCTTAAATAGAAGAAATACTCAACAATATCATTCAATTAAGTTTAGAAGAAATACTCAACAATATCATTCAATTAAGTTTGACAAATGTACTGTCATTCAANNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNCCCCCCCCCCCCATGTTTCCCCACTGCATTCCTGTACTCTTAACTAACGAACTTAAATTGCTTCACTGACTGTCCAGGTCATACTCAAGTTGCTGAGGCGTTGAAATTACCCCTTCACGTGTTTTTTACAATGCCATGGACGTAAGTCTTTTTTCGTGAGAAAATCTAGTGCCTAAAGCGTTTGATGTATATTGTTAAAGTGCATTTACACTGATTGGATTATTTGATCTAAGTATTGATGTAATTAAGTTTATCTTATCGTAACCCATCTACTCAACTTTTTAGGTCAATTGGTGATTTGACATAGTATTAGAGTAGGAGGTTGTTGATTATTTCCTTTCTTGTATTATATTTTACTGTATTGGTTGTATTATATTTGCCCAAATCATCTTTGTAATAGGTTCTTCTTCTATTTAAGAAAACTTCTTCTCTCCTTGTGAAATATACAAGAAATATATTATCTAGCATGGTATCAGAGAAGGTTTTTTGAAAACCATAAAAACCGTAACTTTTTGAAAAACCTAAAAAACCCTAAATCCGTTACAAGCCACTGCCGCCACGTACCTTAGCCACCTCAAGGGTTGCCGTCGTCGGACTTTTTCGATAGCCGTTTTTTCTTGCATCTCTTGCTGTAAATTGGTTGGTTTCTGTGTCTTTTTTGGTCTTCATCGGTCGCCTTTGTTTGAGATGTCTGCCGTTGTCTGCCACTGGTGACGAATTTGGTTTGTTTTTTCAGATTTGTTCTATCTGTTTTCTGAACTCTGTTGCTTCAACCATGTCAAATACTAAGTCACCTATGGCTAAAGTTGACAATTGTATCCATTCCTATAGCCCCACTATCCAAATAACCACGATCTGACTTAATGGAGACAACTTTTCTTCTTTGGTCACAAAGTGTTAGGGTATGTTATTGTGGACAAGGAAAAATTGGCTATATTACAGGAGAGAAGCTCTCCCAAAACTAGACGATCCTTTGTTTGCTATATGGGACGTTGAAAACTTTATGGTTATGACTTGGCTTGTAAACTCCATGGTTGAAGGCATTAGCTGTAATTATATGTGCTAATCTACGGCAAAAGAATTGTGGGATAGTGTGACGCAGATGTACTTTGATTTGGGCAATCAATCATAGGTATTTGAGTTGAATCTCAAATTAGGAGATATACGCCAAGGAGGTTACTCGGTGACAATACTTTCACTCTCGAAAATGGATTTGGCAACACCTGGATCTCTTTGATTCATACAAATGGAAGTCTACAGATGATCAAAAGCATTACAGGAAAACTGTAGAAGACAGTCGTATATTCAAGTTTCTGCTGGCCTCTATGTTGAATTTTTTATAAGGTTAGAGGTCGAATTCTTTAGAAAACCTCCTTCCACCTATTAATGATGTGTTTTCTGGAGTTCACAGGGAAGATGATGTAATTCAAAGTTGGATATGTTTTCCAAATGAAATCTCTGCCTTCAAAGGCTAAGAGATGATGTATTAATAATGCTTTGTAAGTTTGTTACAAAGTAGTTTACAGAAAAAACTGTTTAGGAAACAGTTAAGAATCTAACTGTTTTAACTAACTATAGTTGTTTCCATCCTCTAACTAACTATAGCTTTCTCTACCTTTATTCTTATTACATCATTCTACCCCCTAAGAATGGACTCGTCCCAGAGTACATCAATTAGCAAGCTTGAGTTGTCTGTTGCATGATACGGCTTCAAATTAGCATCATTGAAAACTGGGTGTATATGAATGATCGATGGCAGGTCGATTTTAAAGGCATTATCTCCATACTTCTCTAGTGTAGGAAATGGACCGGTCTGTCTGTCTTTTAGTTTGTTATAGGTGTCGATAGGAAATCTGTTCTTCTTTAAGTGTTCCATCACCAAGTCTCCTTTAGAAAAATCAGCTTGTCTTCTCTTTTTATCTGCTGACTTTTTGTAGGAGTTTGTAGTTTGGACAAGGTGATCATAAACTTCCTTATGCAATTTTTTAATCTTTTCTATCATGCTCTCTGCTTCTTTATTAAGGTCCACGGCTGTAGGGAGTGAAGTAAGGTCAAAAGTTAACTGTGGTAGCTTAGTGAATACTACTTCAAAGGGGCATTTTCCTATTGAAGGATTCTTCATATTGTTGAAAGCAAACACAGCTTGAGCAAGTGCTAGATCCCACTGCTTAGGTTTTGTTCCACTAAGGCAACGTAATAAATTGCCTAGGGTTCTGTTGGTTACTTCAGTTTGTCCATTTGTCTGTGGATGGGCTGTGGTGTTGAACTTCAATGTTGTGTCAAACTTCTTCCATAGGATTCGTCAAAAATGGCTTAGAAACTTAACATCTCTGTTGGACACAATAGTTTTAGGTACCCCATGTAGTCGTACCACCTCTTTAAGGAAGAGGTTGGCTATATAGATTGCATCATTTGGCTTCTTGCAAGCTTCAATGAAATGTTCAGACTATTTCGGGGAGTTGATTGTTTTTTTCATTTACTATGACTGAATCTGTTTAAAATGCTAGTTTTTGGAGAAGTTTAGACTGCATCATCTGACTAAGGTGCATCATCTGACTGAGGTTCTTTGAGGGGAGATCGTTGTGCTTCTATAGAGAATTCAAAGGGTTGGCAGCATTAATTCTAGAACTAAACTCATAATTACCCATAATATTAATTAAAGGTACCTTTCTGTTTTCATTCTCTCTCTGACCCTTCAAATATCAGTTTTTTAAAAGGGAAATTTGACTTGGCGTGGCCCACAAAACCAATCGGACAACTTTTTTTAACAGGATTTACAAAACCCAATAAAGATGACTTTTCAGTTTCCGGTGCAATTAAGTCAGTTTCGGCCTGACTTCTTGCTTTTTTCATCTTCTCCGGTGCCAATGAGGGTTCTTGAAGGCGTTCCAAAACTGCAAATGGATTTCTTGCTATGTGTAGATGTGTGGAAGTTGATTTTGGGACGTTTAAAACTGGGAAAAAAAAGGAGAGGTCCAAACCCTTCATCTTGCAAAGCCTTTCTAATTCTTTAAGAGTCAATGTAATTTTTAAAATCATTAACCATAAGAGTCCACCTGTAGGAAAGGGAGGATTCAAACATTCGAAATCAACAAAGTTTAAAAAGATCTTTCTCCTTCTAGAATCAGTAATCTCAATTGTGGAGGGTACAAATCCACAAAGACTCTGCTTAACTTGAATCCTAGCTTCTCTGCAGTTGGTAAGATTTAAAGTGTCCATGGAATGCTTTCAAGTCCCTCAAAATTATCCTCGATAACCTCAAAGGTTCCCATGCTTCAGTAATCCAAAGGAAGGTTCTTGATTTTGAACCATCCTCCATAGCCTTTCATGGTGAGCGGCCAATTGTGTTTATGTTTATCCTATCTTCGAATTTTAGGTGAAATAAACCCATTGCCTGCCATTTCCCTTTATTATTGATGGGTTGGTGTCCACAGGATATGTGAAGATCGGAGAACAATTGGGAGATATTCTAACTAAAGCTTTAAATGGAGCAAAGATAAGCTATCTGTGCAACAAGCTAGGCATGATCGACATATTTGCTCCTGCTTGAGGGGGAGTGGTATGATATATATATATATATATTTATATATATCTCAAATGTCCTTTATTGTAATTTTACATAGTCTCTAGAGTTTAGTTTATTCTCTATATAAATATGTAACATTGCTTTGACTCAATATGTATAAGAAATATACTTCTCTGAACTCTAGATTACAGTAGGAATGTGTTTCACGTGATTTCTAGAAAACCATTGGACATATTCCTCCCTAGAACATAAATTATCCAGCATATTAGAAAGGAACTTCATCCCTGTTTGTTTTACCTTAAGCACACCTTTATATTAGAATGCCCTCCTGATGAAGGCCAAAAGTCACAACTTAACACCCAACCAGTCTCTGATTTAAATTTCGAAACTTTCAATCTATCTCTGTCCACTTCACCTAACTTTTTAAAGAACTTGCCCTCTGGCCATCTTAATAATTCTGAGACCGCCGTCAACAACCATTCTAGATGAAATCCGGAGACTGAAAAGATCTGCGAGCTTCCACATCTTCAATGTGAATTTGGTTTCTCTTTCCAAGTACAATAATGAGACTCAAGGACTTTGAAGCTGACTACCTCCATATTTGGATGAAATCGCTAGAGGTCGGAAGGAAAGAGGATAGAAGAGAGAGGATGGAGGATATAGGAGAGAGGATGGAGGATATAGGAGGGAGGAGGGAGGAGGGAGGAGGGAGGAGTGAGGGTAGAGAGAACTTACCTTTTTTTTTAGTGATGGAGTTTTGTAGCCCATTTCCTACCCAATTAGTAATGCTTCCATTTGTTGAGTCTTCTAAAAATTTTGAGACAAACAAGTGATGAAAAAGTGTTTGGTGTTGATATAATTGAATTTTGTTTTAAGTGTTGATATAATTGAATTTACCATAACCTATTGGCTTATGCACATTCTCGAGGAGGATTAACACCAAAAACAGAATTTTAGTGCCATCATTTTGGCATCCTTCTACCCACCCCTTTAAAATCAACATTGAGGGAGTGCGTATGTTTTGCATGTGCATGATTACATCTATTCTATTCTAAAGAACAAAGTTGTGATTTATATTTAATTAAGATTAAATTACAGAAAATGCCCCTGAACTGTGTAGTTTATGTCAAATATATCTCTAAAGTTTCAAGAGGATGGAAGCCATTAAACTTTGNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNCCCCTGGATTATTGAAAAGTTTCATAAATACTCTTGAGCCTAAAAAAAATACTCATTTATCCAAATTTTATACCAACTTTCTCACCAACCAAAATTTTAAGTTAAACATAATGACAAATTACGAACACAACTAATCTCAATAATTGTTGTTGTAATTAAAATACAATTAATTATTAAATTTAAAAAATGCATTTTACTACTAACACACCATAATAGTGTGAGATTTTTTTTATTTGACTAATTATTGTTTAGTATGTTTTTAAGAGATTTTATTTTGAAATTTTGTGAGATTTAGTGGAATTTTATGTGTTAAACGAGGTTTAAAATATCAATGTTGATGGATATTTCTGGAAAAATTATAAAAACAAAAAAAGTTCAAAAATTAAATTTAAAGTAGTGAATAAATATTTTATTATTTTCAAATAAGCAAACATGTCTATTATTTATATTACATTTATATTATTATCATTTTAATGTTTATTTTTTGTGATTTGTGGATTTTTCTACGATGTAATGAAAATATAGATTCACTTCTTCTATCAATATCGAAAAAGTGGAGATGTGAAAATATCGACAGAAACATCGATGTATCGAAGTAAATTTAAACTATGGTTTAACTTTAAAATTTTGGTTTTTATTTTGGGTGCTAAGAAGTTTGATTGTAAAAATTCAATTAATGTAAAAATTGAATTGATGTAAAAAGTTTAAGGGTATTTACGAAACTTTTCAATAGTTCATGGGTATTTTTGAGACAAAACTTTAATGGTTTCCATCTAAAACTAACAGTAAGGTATTTTTAAACCATTTTCAAAAGTTTAAGGGCATTTTTGAAACTTTAGAAAGTTTAGGGATATTTTTTACACAAAGTAGAGGGGCATTTTTTATAATTTAGCCTTTAATTAATATAAGTAATTTGGAAATAATGTATGTTCTATTGGATTATTCAGTAACTATCTACTTTTTGTTCAGGCCCACTAGTGATTTTCCTCATCCTCTAGCCCATGTGAAGCATCAAATGGGATATAGGGTAAGTAGTAATGATGGTTCTCATCTCACTTCCGGGGCTATTTTTTTAATACATGTAAATATTTGGTCTTTTGGCAGTTATCATATAACATAGTCGATACTTTAATTTGGCTTGGAATTCGTGACATTATAAATAGCTTCAGGAAGAAGAAGCTGAAACTGAGACGAATATCATATCTCAGTGGGCACTACAGCAGTCTGCCGGAGGTGCCTTATGGGTATATATGGAGTCCCCACCTTATTCCAAAGCCAAAAGGTGATACTTTATTCTGCTTTTGGTATTCATGATGGGTATAGCGTAATACAAAAAACATATTACTACTACTATGCCCCCCACCCTAACTGATGTAAAACTAACACTCAAGAATACTGAAATTTCTTTATCATTTTTATTAATTCTGTAAAGAAGATCTAGAGACCTATTCATACACGAGGGAAGTTGGTTGTGACAAACCTTTAACAGAAAGTAAAACTAACTAACAACCAAATAACCAACTAGAGGTTGCTTAAAAATCAAGAACTTAAACAAAGATTAAACCAACTGATTGATAACTAACAGTTATAACTCCAACATCAACTTCACCCAACCAATAGAAACAAACTCGACCCTGAGTTTGAATAGGAAAGTTGAAACTCATTTGTTGGGAGATAGGGATAGTTGTCAGCAACGTTGAAGGTGGAACTAATCTTCATAGTTGGAGGAAGATCTATCTGGTAGGCATTCGGTCCCAACTTGGCTAAAATAGGAAAAGGGCCAAGTTTTTTGTTAGTTAGTTTTGAATGTTGTCCAGAAGGCAAGCGTTGCTTTTTTAGATGGATCGTTACAAGTTCTCTGACTTCAAAAGTGGTTTGTCTACAATGTATGTCCGCTTTAAGTTTGTAGGAAGCATGGAAGCTTCTAAGGGTTGGCGGACTTCTTCATGGATTTTTTTGATTCGGTTAGCCATGAGAGTTGCTTCAGCACTAAGATCAAAAGGAACAGGCAAGTGTGCAACATCAAGAGTGCGGCGAGTGACTGGACAATTTCAAAAGGGGATTTGTATGTGGATTGGTTGGTCATGTTGTTGAAGGAAAATTCTGCTTGAGCAAGGTTGAGATCTCCTTGGTGGGGTTTGTCTCCACCTAGGTAGTGAAGGATGTTGCCAAGTGTTCGGTTGGTCACTTCAATTTGGCCGTCTGTTTGGGGGTGGCTAGTAGAGCTAAATAGGAGGCTAGTGCTGAATCTTTTCCATAGGGAGCACCAAAAGTAGCTCATGAATTTGACATCCCGATCCGAAACAATGCTCTTAGGTATCTTGTGTAAGCGAACAATTTCCCTAAAAAAAAGATTAGCAACATTCAAAGCATCATTAGTCTTTTTACAAGCTAAAAAGTGCACCATTTTCCTAAATCGATCAACGACTACCAAAATAGGATCATAACCACATTGGGTTCTAGGTACGCCAAGAACAAAATCCATAAAGAGGTCTTTCCAAATGTTAGGGGGAATAGGTAGTGGACAATAAAGGCCCAGGTTTGAACTAGATCCATTAGAGCTTTGGCATATATGAAGCATCTTTTAACAAAATTAGCAATATCTTTCTTTAGTTGTGGCCAAAATAATTTTTGAGTGAATAGGGTGAAGGTTTTGTCTCTACCAGAGTGGCCGGACAAACCACTACTATAAGCTTCTTTTAGAAGGGATTCTCGAAGGGAAGTGTGTGCGATGCATAAGGAGTTATTTTTGAAAAGTTAACCATAAAAAATATGAAAATCATTAGCCCGGAGATGGTTGATACATTTAGACCAAATATCTTTAAAGTCGATATCTGCTGGATGTAGGTCTTTAATATGAGAAAAACCAATGATTTTATCTTTAAGTAAAGGTAGAAGAGTACCTTTTCAGCTTAAAGCATTTGCCACTTTATTAACAATATCTAGGGTGTGCTTGATGACGAAATCAAAACGTTGTAGGAATTGAAGCCATTGAGCATGCATATGACTAATGGTCTTTTGAGATTGGAGAAACTTTAAAGAAAAATGATCGGTAAGTAAAATAAATTCTTTTCCAAGGAGATAGTGTTCCCATTATTTTAATACACAAACTAGGGAATACAATTCTTGTTCGTAAGTGGACCATTTCGGCTTAGATGGACTTAGTTTTTCACCAAAATATTCAATAGGATGTTTATCTTGGCTGAGAACAGCCCCAATACCAATTCTTGAGGCATCTACAGTAACTTCCAAGGGTTGTTGGAAATCTGGAATTCCTAAAACAGAGGTAGAGCTTAAAAATGTTTTTAATTGGTTAAAACTTGTCGTTTGAGCATCTCCCCAAGAGAAGGATTGTTTAGATTTTAGACAAGATGTTAGTGGGGTAGCAATGGTACTAAAGTCACGAATGAATTTTCTATAAAAGGAAGCTAAACTTAAAAAGCTTTAGATTTCTTTTAGAGATTTAGGTTGTGGCCAGTCAACTATGGCTTCTATTTTGTGAGGATCTACACTAATGCCTTTAGAGCTTATGATCAAGCCTAGGAAGTATAATTGGTGTTGGAAAAAGGAGCATTTTTTGATGTTGATATACAAGGCGTTTTCTTGTAAAGTGGTAATTATAGTGATCAAGTGTTCAACATGAGACTCATGGTTTTGGCTATAGACAAGAATATCGTCGAAGTAAACAACAACAAAAGGTAATGATATTTGGTTCATGAGGCACATGATGAAGGTACTAGGGGCGTTGGATAAACCAAAGTGGCCTTACTAACCATTTGAAAGGGCCCTCATTGGTTTAAAGGCTGTTTTCCATTCGTCCCCAGGCCGAATTCAAATTTAGTGATAGTCACTTCTAAAATCTATTTTGGAAAATATTGAGGCTTTCCCTAACTGGTCTAAAAGGTCCAGTAGACTAGGAGTTGGGAAACAATATTTAATGGTAATTTATTGATGGCTCGGCTGTCCACACAGTCTCCAAGAACCATCTTTTTAGGGTGCAAGTAAACTGGGACTGCACAAGGACTAAGGCTAGGTTGTACGTGGCCTTTGTCTAAAAGGTTGTGGATTTCTTTATGGAGAGCTTTGTATTCATTAGGGTTCATTCTGTAGTGGGGTAGGTTTGGTAAAGTACTACGTGAGATAAGGTCCATTTGATGTTGGATGTTTCGAAGTGGAGGTAAGGCAATAGATTCATCTAAAAGGGATGGGAAATTTTGTAAAATTTGGGAAACTTAATGGTGTAGATCTTGGTTGAGATGTCAGGTGGAAAAATTTTCTTTGGTAACAAAGGCTACAATGTAAAGAGGTTTAGGAAGAAAGAAATCTTTATCTTGACATAACAGAAAATGTGGTTTAGGAGGGAAACGTGTACCTTTTGTGTAGTAATGGACGTGCCAAGGGGAAGAAGAACAACTTTTTTGCCCATCCAAGTAAATTCATAACATTGTGGTCATATTGCCAAGGGCAGCCGAGAAGTAAGTGACAAACATCCATTTCTAGGACATCACAAGCAAGTTCTTGATAGAGATTGCCAATGGAGAGAGGAACGGTGCAAATTTCTTTAACGGTAGTTTCTCCTTTCTTTGTTATCCAATTGACTTTATAAGGGTTGGAGTGTGCGTCGGCTGTGAGTTTGAGGGCATTTATCAATTTCTTGGAAACAATGTTTTCCATGCTGCCACTGTCAATAACCACTTAACAAACTTTCCCATTGATTGTGCACTTAATGCAGAAAAGGGAATATCTTTGAAAATGAGTTTCAGTCTTTGGGGTGATAAGAACACGTTGAAGTACACAAGAAAGCTGGTCAACATCATCTGGTTCCAAAAAGTCAGGTTCATCTTGATAAGGGTTGTCATCAGATTCATCCTCAACATTAGTTTCTTGAAGAACAATGGTTTTTCATTGAGGGCATTCATTGGAAAGGTATCTTTCATGTCCACAACGGTAGCATTTGCCAATCATCGTCCTTTGATATGCGTTAGTAGTTCTCTTGATTGGGACTGTCATTAGTTTTAGGAATGATTGGTGTTTTGGTACCATTGACTGTCTTTGATTGATGAAGGTCAGGAGTAGTTTGATGGACTTGCAAGCACCGGCAGATTTAGTATAGACCCGGGGGGGCGAGCCCCCCGTCAACTTTATTGTCTTTATATAATATGTATAAACAAAACCAATTATAGCTTAATATTTATAGACAATTTAGTGATAACTATGCTTGTCAGTCCCCTTCCAACCAGAGTTCAAGTCTTGTATAGTTTCTTTTCTAGATTTTTATTTTTTCTCTAAGTTACAACTCGAAGCCCCCTCTCAATAAATTTTCTGGATCCGCCACTGCTTGCAAGGGAGGTATCCGTGGAGTTCTTTCGAAAAGGTTGTGGTTGACAATCCCAAGGCTGTCTTCTTTGTTAGGTTGACATTTTTTGGGGCAGTAGCTTCTTTGATTGTAGTTGCTAAAGAGATTGCATCAGAAAGGTAGGCTACAGGGTGTAACATGACCCCTTCTTTAATCTCATCTCGAAGCACTAGCACAAATCTTGAAACTAGTTGCTGTTTGGATTCTGGCAAGTTGTTTAGAGCCCCTAAGCGATGGAATTCTTCTGTATAACCAAGGATGGAACGATCCCCTTGTTTACAAAGTTGGTAATGATTATAGAGCCGCTGTTGATAGTTGATGGGAAGGAATTGTTTCTTCATTAATTTCAGAAGTTTTGGCCAAGATCGTATGGGAGTAGCCATAGTATCGACGATTGGTCTCTAATTGGTCCCACCAAGCTGAAGCACCTCTTTGAAATTTGAAGGCCACTTCTTTGCCTTCTTATCATCCGAAATATTGGCATATTAAAAAAAAAATTCTACCTTTGAATCCCAATCTAGAAGTTCTTTATCCATCTGACCATCAAAACTCGGGTAAGTCCATCTTCATTTTGTATTCTGTGTTTTCAGGTTGAGGGTCGTGTCTCCTTTTATTTAAACCCCTTCGACTTTGGGCAGGATAGTAATAATTGTCACCCTCACTTGAAGAGTTGAACTCCAGCATTGAAAAACCACCCCGGTGGTGTTCTTGAGGTCAGAATCTTGCTTGATATAGGGTTCTTGGATTGTGTTCTATGTTCATTTGTAATTCTTGATAAGGATCTTCGGCAATCTTGTGGTTTCCTAAGTTTCTTGCAAGTTTTCTACCTTCATTTAGAGGAAAATTGATGTTGGCAGCCATGTCATTGCCTAAATGTTGCCTTTCTTGGATTTGATTTCTGTTATTTGGAGCTATTTTAGCAAGCTCTTTGTGGGTTTCAGAAGTGATGGTTAGTTTCTGTAAAATTTCTCCTAACATCAAACGGATTTCTCCTAATTCTCCTACAAATTTCTCTACACTAATTAGTTTGGTTTCCATAGCTTTTGATGAAGAGGAAGGTTCTGTGTCCATGAAATAAATTGATCTGAGAAAGAATCTGCAATGGATCTTCGCTCTGATACCAATTGATGTAAAACTAACACTCAAGAATACTAAAATTTCTTTTTCTTTTTTATTAATTCTGTAAAGAAGGTCTAGAGACTATTTGTACATGAGGGAAGTTGGTTGTAACAAACCTTTGACAGAAAGTAAAACTAACAAACAACAAAATAACCAACTAGAGGTTGGTTAAACATCAAGAACTTAAACAAAGATTAAACTAACTAATTGATAACTAACAATTAGTTATAACTACTACATCACTAACTCCATAAACCACAGGTAAAATAAGAAGAAAAATATATATTATATGCATGTATCTATATTATTCTTGCTATATTCTCCATATTTAAGTGGCATTTTCATGTTACAGGATTTAATTCACATGGTCGATCCAGTCTTCCTTGCATTTGTTTTACCAAAGTCTGTTTATTATAATCATCTTCTGTTTATGCATATTGTTAGGTATTTTCATGGCTAATATCATTAATTATAAGTTTTCTATGTCTGTCTGTTAGAGGATTTTGAACATTAAGGCCTGTTTGGTAACCATTTTGTTTTTGTTTTTGAAAATTGAATCTACGGACTCCTTCTATTCTACCACTAAATTTCTTCCTTTGTCATCTACTTGGCCAATGGTTTTAAAAACAAACCAAAATTTGAAAACTAAAAAATAGTTTTTAAAAACTTGATTTTGATTTAGGAATTTTACGACGACTTCGATCATTGTACTTAATAAATATGCAAATCATTTCAAGAGGTCGACAGAAAATAGGCTTCACTTCATAAACCCAGAGCAAAAAACGAAATGATTACCAAATGGGCCATAAAATTTCAGTTTTTCAAAATGTATTTAGTTTTAGATTGGAAGGTTGCTTGGACTTTTTCTATTTCACCATCTTTGATTATGGGCAGTTTGCTTAGTTTACTGGTTTTAAAATAGACGACGTTAGGTTATGTTAAGCAATATGCTAAACTTTTTCTCCAACAAGCTTGATATGTGCTCAACACACTGGTAGACTACTCAAATTTTGGAAGGAACTTCTGACAATGAACTTAAATTGGAAATTATTGCTCTATTAACACATTGTGGTTAACTACGTAGTGTCAACTTTTTTATTCGTGCCCATTTTAATTGTTATGCCCAAGTGTTTGATATCCAGTTCTTCTTTTCTCTCTTTCACAGGATGAGTTAAAAATTCTTTCTGTTTCATTTCCTGTTCTTTGGGATTTCACGTACAACCAGTACTTGGTGATCCCACTCTTACTGGTTACCCTATTCTCATTGTTTCAGATTGGGGATCTAAAATTGATGTGGTTGGATTTTGTTTCCTTGACCTTGCTTCAAACTATCAACCGCCAAATTCTCTTGTGGAATGGATAGAGGCTGGCGAAAGACCAATTTATATTGGATTTGGCAGCCTTGTGAGCTACATTTTTGCCCTTTGCTTTTCTAGTATTTCAGTACTTGTCTTGCATTAAACATGATTTAGTCTTCCAATTTGCAAATAGTTCGAATTATATAAATAATTATTTTAATTGGAATCTTTTAAATGTAATGAAGGAAGTAGTTGATAAAAATAAGAAAATTGCTTGACCCAACTAAAGATCCAAGTCGTATGAGTGAACTGACATTGAAGAACAATCCTGAATAAAGCTCTGAACGTTAAATCCTCCAGCCAGGATTTCTTTTCTTCATTGCCTCATAGGCTATCAAACTGTTAAGCTTCAGTCATGCTATAAAACAAAAAAGACTGATATATTTTTTTTCATTTCGTGTTTGATGCAGCCTGTTGAACAACCACAAGAAATGACGCAGATTATAGTTGAAGCTCTGGAAATTACAGGAAGGAGAGGAATAATAAATAAAGGCTGGGGTGGTCTTGGAAGCTGTTAGTCTAATTTTTCTATCCATTTTTTCCTCCTTTTTCTGGTTCATATATGAAATACAAAAATAAATATATCCGAATGCTTGACTCATGAAGAAATATGGATCTCTATATCTAGTGGCAGAGCCAAAAGACTTTGTGTACGTGCTGGATAATTGTCCCCATGATTGGCTATTTCCGAGATGCATGGCTGTGGTAAAGTATTATTAGTGTTGTATTTAATTTCTTGCAAAGTTGTGGCGCAAAGCTTTTTGTTCATCACAGTGTGGCTTTGATCCTCTTCCCCTGGTCCCCACCTTTCATTCTATTTTCATACGTTAGAATGTTGAACAATATATATAAACTTTTAAGACCTGGAAAGTATACTGACTATACGTATGATGGTAATAATTAACAAATGTGCAGGGTAGAGGACTATGAAAAGTGAAACTATATTAAGGCAAAGTACCTCTCTTTGGTTCTTAAGTTTTGAGTATAGTTTGCAGTTCTATTTGATCCCTAAGTTGTAAGATGTTGAACCTTATGCTCGAGTTTGGAGTGTATTGTTTCCATTTGGTCCAAAAGTTTCAAAATATTACATTTTTACCCTTGAAACTAAATGCTGGATTTCAATACCTAGGTACTAAATGGAACCTATATTTCAAACTTCACCTCTGAAAAGCTTTTCTTTTCCATGAACTGCATAGCATCATTGTGGGACGCACTGCCCTTCAAGCATAACATGTTTATGTTTCAATGGATATATGAATTGTTAATTATATGATATTTTGAAACCATACGATAAGTGTTTGAGTTGTAAAATGTGTAACTTTAGAGCAGTCCATGCGTGATAATATAAATTCTAGTTACTTTCTTGTCTTTGATCCCCATCTCTTTCTCTATGTTATGTTTATAGAACAATCTAAATATACAGTGTATCGTTCAGGTACACCACGGGGGTGCAGGAACAACTGCAGCTGGACTCAAAGCAGCTGTAAATCTTTATTTATCTGCTGTTTGTTTTGCAAATTACTTATAACAATTGATTTTAGCCATCTTTTTGTTTCTTCAGTGTCCAACAACAATCGTGCCAATCTTCGGTGATCAGCAATTTTGGGGCGAGCGTGTGCATGCACGAGGGCTTGGTCCCCCGCCTATCCCAATTGCTGAATTTTCACTTGAAAAGTTGATTGATGCCATAAACTTTATGCTTGATCCCAAGGTATCTCCAGTATTTATGCAAACTTTTGGTGACTTTTCTAAAACTCTATATTGCCATTGCCACCAGTTTTGCATAAAATTGTCCGATGTTATGTGCATCATGAACAATTTTTTTATTAGAAGTTACAAAAGTGAAAGAAAAACTATTTATTTTGGTGCTACTCTTAAGAATTGTTTCATTTTGGTCCTCATGTTTACAAATATTCAAATTTAATCTATACGGTTGATGTATTTCCAATAAATGTAACTTTGTCCCAACTGTTGGTTTTTGTATAGTATTTGCTAAAAATAATTTAGTCCTTGTGAGCTTTTACTCTAGGAAATTGATACATTGTGTAACTATATTATTTTGCCTAGTTGAACAAAATTTAACAAAAGTGAGGTCCAATCTTAAGATCTATGGAAAATATAGAGGATAAAATTTCACCCTAAACTCTTAATCCCATATGGTATTTTAATCAAAAAGAAAACTTTTCATTTATCATATGTGAAAATGTGCATATTGCTTTGAAAGAAACAATGGTATGAAGTTTCAGCTAACAAATTTAATTCTTCTCAATTTTGTCATTGATTCTGTTAGACTGTTTATCTACACTCCTGGTACACATCACATGCATTCGAACCAATGAGTTCTTAGAGGAAGTAAAAATGTCTTAACCACCGAGTTATGCTTTGTTGATGAACTATGTATTTCTTTCATTGCATACTTTGCTGTATTCTTAATGACATATTTTCTCGGCGTTGGTGAGCTTGCAGGTAAAAGAGAGGACTTTGGAAGTGTCAAAGGCCATAGAATCTGAGGATGGAGTAGGCGGTGCAGTGAATGCTTTCCACAAGCATTTCCATAGAAATAGAACTCTGGCTAAACCTGAGGCTCCTAAACGTGGCTTTTCAGTTCGACGGCTTCTCCATATATCTTAATGTTTTTATTCTTTTAGATCCGTACAGTATCTACAGAAAACTACCTCTCTTAACAAAAGCCATTGTTGTTGTATCGTTCCAGTAATCTAACCTATAAGAGTCTACCTTCATTTTTGTTAAGAGCATCTGACTCGTATTTTGTGTTTTGTAAAACATATTATGGTTAAATTCAC

mRNA sequence

TTCTCATATATCTCTCTTTTATTTTGGAAAAAAAAATGCTTCACGTATCTCTCTCTTTCGCTCTTCTGTTTCTCACTCTAGTAACCAAATTGTAGTCATCCCATTTTGCAAATCTTCAATTTCCTTGTTTTGAATTTCAAAGTACAACTTTAAACGTTTCAGTTTCATTCTCTTCCTATACGAACCTCTCTCAAATTCCATGGCCCACTTGGAGATCCACCATTCTCCCCATTCTTCTTCCGGTCGTTCTAGTGATTTCTCTGTATCCATGGACTCCGATGGCGATGGCGACAGCGACGGCGATCGGGTTGTCCCTTCTTCTGGAAATACTGACAGGAATTCATCTGGAGATAGTACTCAAGATGGATCATCAGTAGGTAGGGAATTAGTTTCTTGTTCAACTAAGCCTACCAAGTTGAGGAAGTCAAGACAGAGTCATGCATTGCATCATTTACTGCCAAATATTTTTGATGAAAAAGTTTCTTCAAGGAAGAAGCTCAGGTGGTTGAAAAGGGTTGCCACAGTCAAGCATGATGGAACTGTCCAAATGGAAGTTCTCGAAGGGATTCAACCAGAAAATTTACATTTTGAAACTGGTGTAGATGATGAAGCTGTAGATGATGAACCCCTAGACACAGCCAATGTTCCATTCATCCCCCCATTACAAATTGTGATGCTCATAGTTGGCACAAGAGGAGACGTTCAACCTTTTGTTTCCATTGGGAAGCGTTTACAGGAGCATGGCCATAGAGTTAGATTGGCAACTCATGCAAATTTTAAAGATTTTGTACTCTCTACTGGACTGGAGTTCTTTCCTTTAGGAGGAGATGCTAAAGTTCTTGCCGACTATATGGTAAAGAATAAAGGATTCCTTCCATCAGGACCTTCGGAGATACATGCCCAACGAAATCATTTGAAGGATATTATTTTCTCATTACTTCCTGCGTGCCAGGATGATGATCCAGAATCAAAAATTCCATTCAAGGCAGATGCGATAATTGCTAATCCCCCTGCATATGGTCATACTCAAGTTGCTGAGGCGTTGAAATTACCCCTTCACGTGTTTTTTACAATGCCATGGACGCCCACTAGTGATTTTCCTCATCCTCTAGCCCATGTGAAGCATCAAATGGGATATAGGTTATCATATAACATAGTCGATACTTTAATTTGGCTTGGAATTCGTGACATTATAAATAGCTTCAGGAAGAAGAAGCTGAAACTGAGACGAATATCATATCTCAGTGGGCACTACAGCAGTCTGCCGGAGGTGCCTTATGGGTATATATGGAGTCCCCACCTTATTCCAAAGCCAAAAGATTGGGGATCTAAAATTGATGTGGTTGGATTTTGTTTCCTTGACCTTGCTTCAAACTATCAACCGCCAAATTCTCTTGTGGAATGGATAGAGGCTGGCGAAAGACCAATTTATATTGGATTTGGCAGCCTTCCTGTTGAACAACCACAAGAAATGACGCAGATTATAGTTGAAGCTCTGGAAATTACAGGAAGGAGAGGAATAATAAATAAAGGCTGGGGTGGTCTTGGAAGCTTGGCAGAGCCAAAAGACTTTGTGTACGTGCTGGATAATTGTCCCCATGATTGGCTATTTCCGAGATGCATGGCTGTGGTACACCACGGGGGTGCAGGAACAACTGCAGCTGGACTCAAAGCAGCTTGTCCAACAACAATCGTGCCAATCTTCGGTGATCAGCAATTTTGGGGCGAGCGTGTGCATGCACGAGGGCTTGGTCCCCCGCCTATCCCAATTGCTGAATTTTCACTTGAAAAGTTGATTGATGCCATAAACTTTATGCTTGATCCCAAGGTAAAAGAGAGGACTTTGGAAGTGTCAAAGGCCATAGAATCTGAGGATGGAGTAGGCGGTGCAGTGAATGCTTTCCACAAGCATTTCCATAGAAATAGAACTCTGGCTAAACCTGAGGCTCCTAAACGTGGCTTTTCAGTTCGACGGCTTCTCCATATATCTTAATGTTTTTATTCTTTTAGATCCGTACAGTATCTACAGAAAACTACCTCTCTTAACAAAAGCCATTGTTGTTGTATCGTTCCAGTAATCTAACCTATAAGAGTCTACCTTCATTTTTGTTAAGAGCATCTGACTCGTATTTTGTGTTTTGTAAAACATATTATGGTTAAATTCAC

Coding sequence (CDS)

ATGGCCCACTTGGAGATCCACCATTCTCCCCATTCTTCTTCCGGTCGTTCTAGTGATTTCTCTGTATCCATGGACTCCGATGGCGATGGCGACAGCGACGGCGATCGGGTTGTCCCTTCTTCTGGAAATACTGACAGGAATTCATCTGGAGATAGTACTCAAGATGGATCATCAGTAGGTAGGGAATTAGTTTCTTGTTCAACTAAGCCTACCAAGTTGAGGAAGTCAAGACAGAGTCATGCATTGCATCATTTACTGCCAAATATTTTTGATGAAAAAGTTTCTTCAAGGAAGAAGCTCAGGTGGTTGAAAAGGGTTGCCACAGTCAAGCATGATGGAACTGTCCAAATGGAAGTTCTCGAAGGGATTCAACCAGAAAATTTACATTTTGAAACTGGTGTAGATGATGAAGCTGTAGATGATGAACCCCTAGACACAGCCAATGTTCCATTCATCCCCCCATTACAAATTGTGATGCTCATAGTTGGCACAAGAGGAGACGTTCAACCTTTTGTTTCCATTGGGAAGCGTTTACAGGAGCATGGCCATAGAGTTAGATTGGCAACTCATGCAAATTTTAAAGATTTTGTACTCTCTACTGGACTGGAGTTCTTTCCTTTAGGAGGAGATGCTAAAGTTCTTGCCGACTATATGGTAAAGAATAAAGGATTCCTTCCATCAGGACCTTCGGAGATACATGCCCAACGAAATCATTTGAAGGATATTATTTTCTCATTACTTCCTGCGTGCCAGGATGATGATCCAGAATCAAAAATTCCATTCAAGGCAGATGCGATAATTGCTAATCCCCCTGCATATGGTCATACTCAAGTTGCTGAGGCGTTGAAATTACCCCTTCACGTGTTTTTTACAATGCCATGGACGCCCACTAGTGATTTTCCTCATCCTCTAGCCCATGTGAAGCATCAAATGGGATATAGGTTATCATATAACATAGTCGATACTTTAATTTGGCTTGGAATTCGTGACATTATAAATAGCTTCAGGAAGAAGAAGCTGAAACTGAGACGAATATCATATCTCAGTGGGCACTACAGCAGTCTGCCGGAGGTGCCTTATGGGTATATATGGAGTCCCCACCTTATTCCAAAGCCAAAAGATTGGGGATCTAAAATTGATGTGGTTGGATTTTGTTTCCTTGACCTTGCTTCAAACTATCAACCGCCAAATTCTCTTGTGGAATGGATAGAGGCTGGCGAAAGACCAATTTATATTGGATTTGGCAGCCTTCCTGTTGAACAACCACAAGAAATGACGCAGATTATAGTTGAAGCTCTGGAAATTACAGGAAGGAGAGGAATAATAAATAAAGGCTGGGGTGGTCTTGGAAGCTTGGCAGAGCCAAAAGACTTTGTGTACGTGCTGGATAATTGTCCCCATGATTGGCTATTTCCGAGATGCATGGCTGTGGTACACCACGGGGGTGCAGGAACAACTGCAGCTGGACTCAAAGCAGCTTGTCCAACAACAATCGTGCCAATCTTCGGTGATCAGCAATTTTGGGGCGAGCGTGTGCATGCACGAGGGCTTGGTCCCCCGCCTATCCCAATTGCTGAATTTTCACTTGAAAAGTTGATTGATGCCATAAACTTTATGCTTGATCCCAAGGTAAAAGAGAGGACTTTGGAAGTGTCAAAGGCCATAGAATCTGAGGATGGAGTAGGCGGTGCAGTGAATGCTTTCCACAAGCATTTCCATAGAAATAGAACTCTGGCTAAACCTGAGGCTCCTAAACGTGGCTTTTCAGTTCGACGGCTTCTCCATATATCTTAA

Protein sequence

MAHLEIHHSPHSSSGRSSDFSVSMDSDGDGDSDGDRVVPSSGNTDRNSSGDSTQDGSSVGRELVSCSTKPTKLRKSRQSHALHHLLPNIFDEKVSSRKKLRWLKRVATVKHDGTVQMEVLEGIQPENLHFETGVDDEAVDDEPLDTANVPFIPPLQIVMLIVGTRGDVQPFVSIGKRLQEHGHRVRLATHANFKDFVLSTGLEFFPLGGDAKVLADYMVKNKGFLPSGPSEIHAQRNHLKDIIFSLLPACQDDDPESKIPFKADAIIANPPAYGHTQVAEALKLPLHVFFTMPWTPTSDFPHPLAHVKHQMGYRLSYNIVDTLIWLGIRDIINSFRKKKLKLRRISYLSGHYSSLPEVPYGYIWSPHLIPKPKDWGSKIDVVGFCFLDLASNYQPPNSLVEWIEAGERPIYIGFGSLPVEQPQEMTQIIVEALEITGRRGIINKGWGGLGSLAEPKDFVYVLDNCPHDWLFPRCMAVVHHGGAGTTAAGLKAACPTTIVPIFGDQQFWGERVHARGLGPPPIPIAEFSLEKLIDAINFMLDPKVKERTLEVSKAIESEDGVGGAVNAFHKHFHRNRTLAKPEAPKRGFSVRRLLHIS*
BLAST of Cucsa.356970 vs. Swiss-Prot
Match: U80A2_ARATH (Sterol 3-beta-glucosyltransferase UGT80A2 OS=Arabidopsis thaliana GN=UGT80A2 PE=1 SV=1)

HSP 1 Score: 755.7 bits (1950), Expect = 3.7e-217
Identity = 374/588 (63.61%), Postives = 453/588 (77.04%), Query Frame = 1

Query: 5   EIHHSPHSSSGRSSDFSVSMDSDGDGDSDGDRV--VPSSGNTDRNSSGDSTQDGSSVGRE 64
           E+  +P +     +D +V+ +S G G+    RV  +P  G+    SS D  +  S+    
Sbjct: 49  ELETNPKTVVASIADETVA-ESSGTGNKSFSRVWTMPLEGS----SSSDKAESSSTNQPR 108

Query: 65  LVSCSTKPTKLRKSRQSHALHHLLPNIFDEKVSSRKKLRWLKRVATVKHDGTVQMEVLEG 124
           L    T+    R+ + +H L      IFD+K+S+ KKL+ L R+ATVKHDGTV+ EV   
Sbjct: 109 LDKSKTE----RQQKVTHILAEDAAKIFDDKISAGKKLKLLNRIATVKHDGTVEFEVPAD 168

Query: 125 IQPENLHFETGVDDEAV-DDEPLDTANVPFIPPLQIVMLIVGTRGDVQPFVSIGKRLQEH 184
             P+ +  + G     V  DE +D  ++ +IPP+QIVMLIVGTRGDVQPFV+I KRLQ++
Sbjct: 169 AIPQPIVVDRGESKNGVCADESIDGVDLQYIPPMQIVMLIVGTRGDVQPFVAIAKRLQDY 228

Query: 185 GHRVRLATHANFKDFVLSTGLEFFPLGGDAKVLADYMVKNKGFLPSGPSEIHAQRNHLKD 244
           GHRVRLATHANFK+FVL+ GLEF+PLGGD KVLA YMVKNKGFLPSGPSEI  QRN +KD
Sbjct: 229 GHRVRLATHANFKEFVLTAGLEFYPLGGDPKVLAGYMVKNKGFLPSGPSEIPIQRNQMKD 288

Query: 245 IIFSLLPACQDDDPESKIPFKADAIIANPPAYGHTQVAEALKLPLHVFFTMPWTPTSDFP 304
           II+SLLPAC++ DP+S I FKADAIIANPPAYGHT VAEALK+P+HVFFTMPWTPTS+FP
Sbjct: 289 IIYSLLPACKEPDPDSGISFKADAIIANPPAYGHTHVAEALKIPIHVFFTMPWTPTSEFP 348

Query: 305 HPLAHVKHQMGYRLSYNIVDTLIWLGIRDIINSFRKKKLKLRRISYLSGHYSSLPEVPYG 364
           HPL+ VK   GYRLSY IVD+LIWLGIRD++N  RKKKLKLR ++YLSG   S   +P+G
Sbjct: 349 HPLSRVKQPAGYRLSYQIVDSLIWLGIRDMVNDLRKKKLKLRPVTYLSGTQGSGSNIPHG 408

Query: 365 YIWSPHLIPKPKDWGSKIDVVGFCFLDLASNYQPPNSLVEWIEAGERPIYIGFGSLPVEQ 424
           Y+WSPHL+PKPKDWG +IDVVGFC+LDLASNY+PP  LVEW+EAG++PIYIGFGSLPV++
Sbjct: 409 YMWSPHLVPKPKDWGPQIDVVGFCYLDLASNYEPPAELVEWLEAGDKPIYIGFGSLPVQE 468

Query: 425 PQEMTQIIVEALEITGRRGIINKGWGGLGSLAEPKDFVYVLDNCPHDWLFPRCMAVVHHG 484
           P++MT+IIVEAL+ T +RGIINKGWGGLG+L EPKDFVY+LDN PHDWLFPRC AVVHHG
Sbjct: 469 PEKMTEIIVEALQRTKQRGIINKGWGGLGNLKEPKDFVYLLDNVPHDWLFPRCKAVVHHG 528

Query: 485 GAGTTAAGLKAACPTTIVPIFGDQQFWGERVHARGLGPPPIPIAEFSLEKLIDAINFMLD 544
           GAGTTAAGLKA+CPTTIVP FGDQ FWGERVHARG+GP PIP+ EFSL KL DAINFMLD
Sbjct: 529 GAGTTAAGLKASCPTTIVPFFGDQPFWGERVHARGVGPSPIPVDEFSLHKLEDAINFMLD 588

Query: 545 PKVKERTLEVSKAIESEDGVGGAVNAFHKHF-HRNRTLAKPEAPKRGF 589
            KVK     ++KA++ EDGV GAV AF KH     + ++ P     GF
Sbjct: 589 DKVKSSAETLAKAMKDEDGVAGAVKAFFKHLPSAKQNISDPIPEPSGF 627

BLAST of Cucsa.356970 vs. Swiss-Prot
Match: U80B1_ARATH (Sterol 3-beta-glucosyltransferase UGT80B1 OS=Arabidopsis thaliana GN=UGT80B1 PE=2 SV=1)

HSP 1 Score: 581.6 bits (1498), Expect = 9.6e-165
Identity = 291/581 (50.09%), Postives = 394/581 (67.81%), Query Frame = 1

Query: 27  DGDGDSDGDRVVPSSGNTDRN--SSGDSTQDGSSVGRELVSCSTKP-----------TKL 86
           D    S+   ++ +SG+ D     SG  + DG    R L  C T P           +++
Sbjct: 17  DNGVKSEKASLLETSGSVDTTPEDSGHRSSDGH---RGLDHCETAPVGLYGDMLINDSEI 76

Query: 87  RKSRQ-----SHALHHLLPNIFDEKVSSRKKLRWLKRVATVKHDGTVQMEVLEGIQP--E 146
           + SR      S A+H    N+  +++S ++K + +  +  +++DGTV  EV++   P  E
Sbjct: 77  QYSRSLTEKGSPAIH----NLKLDRLSEQEKQKLIVELVRIQNDGTV--EVIDNGTPVSE 136

Query: 147 NLHFETGVDDEAVDDEPLDTANVPFIPPLQIVMLIVGTRGDVQPFVSIGKRLQEHGHRVR 206
              FE       +  E   T +   IP L+I +L+VGTRGDVQPF+++ KRLQE GHRVR
Sbjct: 137 LWEFEPTKGQSTITYEKSLTESFRSIPRLKIAILVVGTRGDVQPFLAMAKRLQEFGHRVR 196

Query: 207 LATHANFKDFVLSTGLEFFPLGGDAKVLADYMVKNKGFLPSGPSEIHAQRNHLKDIIFSL 266
           LATHANF+ FV + G+EF+PLGGD + LA YM +NKG +PSGPSEI  QR  LK II SL
Sbjct: 197 LATHANFRSFVRAAGVEFYPLGGDPRELAAYMARNKGLIPSGPSEISKQRKQLKAIIESL 256

Query: 267 LPACQDDDPESKIPFKADAIIANPPAYGHTQVAEALKLPLHVFFTMPWTPTSDFPHPLAH 326
           LPAC + D E+   F+A AIIANPPAYGH  VAEAL +P+H+FFTMPWTPT++FPHPLA 
Sbjct: 257 LPACIEPDLETATSFRAQAIIANPPAYGHVHVAEALGVPIHIFFTMPWTPTNEFPHPLAR 316

Query: 327 VKHQMGYRLSYNIVDTLIWLGIRDIINSFRKKKLKLRRISYLSGHYSSLPEVPYGYIWSP 386
           V     Y LSY +VD ++W  IR  IN FRK+KL L  I+Y S ++ S+  +P GY+WSP
Sbjct: 317 VPQSAAYWLSYIVVDLMVWWSIRTYINDFRKRKLNLAPIAYFSTYHGSISHLPTGYMWSP 376

Query: 387 HLIPKPKDWGSKIDVVGFCFLDLASNYQPPNSLVEWIEAGERPIYIGFGSLPVEQPQEMT 446
           H++PKP DWG  +DVVG+CFL+L S YQP    + WIE G  P+YIGFGS+P++ P++  
Sbjct: 377 HVVPKPSDWGPLVDVVGYCFLNLGSKYQPREEFLHWIERGSPPVYIGFGSMPLDDPKQTM 436

Query: 447 QIIVEALEITGRRGIINKGWGGLGSLA-EPKDFVYVLDNCPHDWLFPRCMAVVHHGGAGT 506
            II+E L+ T +RGI+++GWGGLG+LA E  + V+++++CPHDWLFP+C AVVHHGGAGT
Sbjct: 437 DIILETLKDTEQRGIVDRGWGGLGNLATEVPENVFLVEDCPHDWLFPQCSAVVHHGGAGT 496

Query: 507 TAAGLKAACPTTIVPIFGDQQFWGERVHARGLGPPPIPIAEFSLEKLIDAINFMLDPKVK 566
           TA GLKA CPTTIVP FGDQ FWG+R++ +GLGP PIPIA+ S+E L  +I FML P+VK
Sbjct: 497 TATGLKAGCPTTIVPFFGDQFFWGDRIYEKGLGPAPIPIAQLSVENLSSSIRFMLQPEVK 556

Query: 567 ERTLEVSKAIESEDGVGGAVNAFHKHFHRNRTLAKPEAPKR 587
            + +E++K +E+EDGV  AV+AFH+H      L +  + K+
Sbjct: 557 SQVMELAKVLENEDGVAAAVDAFHRHLPPELPLPESSSEKK 588

BLAST of Cucsa.356970 vs. Swiss-Prot
Match: ATG26_YARLI (Sterol 3-beta-glucosyltransferase OS=Yarrowia lipolytica (strain CLIB 122 / E 150) GN=ATG26 PE=3 SV=3)

HSP 1 Score: 260.4 bits (664), Expect = 4.9e-68
Identity = 162/451 (35.92%), Postives = 246/451 (54.55%), Query Frame = 1

Query: 159  MLIVGTRGDVQPFVSIGKRLQEHGHRVRLATHANFKDFVLSTGLEFFPLGGDA----KVL 218
            +L +G+RGDVQP++S+GK L E GHRVR+ATH+ FKD++   G+EF  + GD     K++
Sbjct: 999  LLTIGSRGDVQPYISLGKALIEEGHRVRIATHSEFKDWIEGYGIEFKEVAGDPSELMKIM 1058

Query: 219  ADYMVKNKGFLPSGPSEIHAQRNHLKDIIFSLLPACQDDDPESKIPFKADAIIANPPAYG 278
             D+ V +  FL    S+    R  + +++ S   ACQ           +D +I +P A  
Sbjct: 1059 VDHGVFSVSFLRDAASKF---RGWINELLASSWEACQG----------SDVLIESPSAMA 1118

Query: 279  HTQVAEALKLPLHVFFTMPWTPTSDFPHPLAHVKHQMGYR---LSYNIVDTLIWLGIRDI 338
               +AEAL++P    FTMPW+ T  +PH       +MG     L+Y + D + W GI   
Sbjct: 1119 GIHIAEALQIPYFRAFTMPWSRTRAYPHAFIVPDQKMGGSYNYLTYVMFDNVFWKGISGQ 1178

Query: 339  INSFRKKKLKLRRISYLSGHYSSLPEVPYGYIWSPHLIPKPKDWGSKIDVVGFCFLDLAS 398
            +N +RKK L L R +     +    +VP+ Y  SP ++P P D+   I + G+ FLD  S
Sbjct: 1179 VNRWRKKTLHLPRTNL---DHMEQNKVPFLYNVSPAVLPPPVDFPDWIKITGYWFLDEGS 1238

Query: 399  -NYQPPNSLVEWIEA----GERPIYIGFGSLPVEQPQEMTQIIVEALEITGRRGIINKGW 458
             +Y P + L  ++E     G++ +YIGFGS+ V  P  +T+ +VE++     R I+NKGW
Sbjct: 1239 KDYTPDDKLCRFMEKARNDGKKLVYIGFGSIVVSDPTALTKSVVESVLKADVRCILNKGW 1298

Query: 459  G---GLGSLAEPK----DFVYVLDNCPHDWLFPRCMAVVHHGGAGTTAAGLKAACPTTIV 518
                G     EP+    + V  + NCPHDWLFP+  A VHHGG+GTT AGL+A  PT I 
Sbjct: 1299 SDRLGKKDAKEPEIPLPEEVLQITNCPHDWLFPQIDACVHHGGSGTTGAGLRAGLPTIIK 1358

Query: 519  PIFGDQQFWGERVHARGLGPPPIPIAEFSLEKLIDAI-NFMLDPKVKERTLEVSKAIESE 578
            P FGDQ F+  RV   G G   I + + ++ +   A+     + ++  +   V + I SE
Sbjct: 1359 PFFGDQFFYANRVEDLGAG---IHLRKLNVSQFSKALWEATHNERIIAKAAAVGRQIRSE 1418

Query: 579  DGVGGAVNAFHKHFHRNRTLAKPEAPKRGFS 590
            +GV  A+ A ++     R+L +    KRG++
Sbjct: 1419 NGVISAIQAIYRDLDYARSLVQ---KKRGYT 1427

BLAST of Cucsa.356970 vs. Swiss-Prot
Match: UGT52_DICDI (UDP-sugar-dependent glycosyltransferase 52 OS=Dictyostelium discoideum GN=ugt52 PE=2 SV=1)

HSP 1 Score: 257.3 bits (656), Expect = 4.1e-67
Identity = 156/464 (33.62%), Postives = 258/464 (55.60%), Query Frame = 1

Query: 148  NVPFIP--PLQIVMLIVGTRGDVQPFVSIGKRLQEHGHRVRLATHANFKDFVLST-GLEF 207
            N P +P  PL+I +L +G+RGD+QPF+++   L+E+GH V LATH  ++D +    GL +
Sbjct: 1155 NSPILPVKPLRITILTIGSRGDIQPFIALSLGLKEYGHNVTLATHELYRDLISKEFGLNY 1214

Query: 208  FPLGGDAKVLADYMVKNKGFLPSGPSEIHAQ-RNHLKDIIFSLLPACQDDDPESKIPFKA 267
             PLGGD + L D  V+N  F P    E  ++ R+ + D++ +   A Q+ + +       
Sbjct: 1215 QPLGGDPRELMDLCVRNGIFTPKFIKEALSRFRSFIDDLLLTCWKAVQNSNTQ------- 1274

Query: 268  DAIIANPPAYGHTQVAEALKLPLHVFFTMPWTPTSDFPHPLA-HVKHQMG--YRLSYNIV 327
              +IA P  +    + E L++P    FTMP+T T  +P+P A    HQMG  + L+ +++
Sbjct: 1275 -VLIATPGCFAGPHIGEVLQIPFFNAFTMPFTRTRTYPNPFAPFASHQMGGVFNLATHVM 1334

Query: 328  -DTLIWLGIRDIINSFRKKKLKLRRISYLSGHYSSLPEVPYGYIWSPHLIPKPKDWGSKI 387
             + ++W  I   IN +R + LK+   +  S   +    +PY Y +S +L+PKP DW  +I
Sbjct: 1335 MEKVLWQPISGQINQWRTETLKIPPWNS-SVSINETYRMPYLYCFSKYLVPKPPDWSGEI 1394

Query: 388  DVVGFCFL-DLASNYQPPNSLVEWI------EAGERPIYIGFGSLPVEQPQEMTQIIVEA 447
             + G+  L + A++  PP+ L++++      E  + PIYIGFGS+ ++ P  ++ +++EA
Sbjct: 1395 AITGYWTLKNQANSDSPPDDLIQFLNEESSTENDDIPIYIGFGSIVIDNPTALSLLLIEA 1454

Query: 448  LEITGRRGIINKGWGGLG--------------SLAEPKDF---------VYVLDN-CPHD 507
            ++++G+R II++GWGGL               +  E  D          +Y+L     H 
Sbjct: 1455 IKLSGKRAIISQGWGGLSIDEHNNNNNNNNNNNNGENSDSNKSSLQSNRIYLLKKPVDHS 1514

Query: 508  WLFPRCMAVVHHGGAGTTAAGLKAACPTTIVPIFGDQQFWGERVHARGLGPPPIPIAEFS 567
            WLF +   V+ HGGAGT AA L AA PT +VP FGDQ FWGER+   G+G   IP    +
Sbjct: 1515 WLFEKVSLVISHGGAGTVAASLLAAKPTIVVPFFGDQFFWGERIKQTGIG-TSIPFDILT 1574

Query: 568  LEKLID-AINFMLDPKVKERTLEVSKAIESEDGVGGAVNAFHKH 572
             + L    I+ + +P V+ +  ++S  ++ EDGV  A++  H++
Sbjct: 1575 AKSLSSHIISILNEPSVRAKVNKMSHLLKREDGVKTAIDFIHRY 1608

BLAST of Cucsa.356970 vs. Swiss-Prot
Match: ATG26_CRYNB (Sterol 3-beta-glucosyltransferase OS=Cryptococcus neoformans var. neoformans serotype D (strain B-3501A) GN=ATG26 PE=3 SV=1)

HSP 1 Score: 255.8 bits (652), Expect = 1.2e-66
Identity = 165/466 (35.41%), Postives = 241/466 (51.72%), Query Frame = 1

Query: 146  TANVPFIP-PLQIVMLIVGTRGDVQPFVSIGKRLQEHGHRVRLATHANFKDFVLSTGLEF 205
            T+ + F P P++I  L +G+RGDVQP++++ K LQ  GH  ++ATH  +K +V   G+ F
Sbjct: 1024 TSFLEFKPEPMKITCLTIGSRGDVQPYIALCKGLQAEGHITKIATHGEYKAWVEGHGIAF 1083

Query: 206  FPLGGDAKVLADYMVKNKGFLPSGPSE-IHAQRNHLKDIIFSLLPACQDDDPESKIPFKA 265
              +GGD   L    V N  F  S   E +   R  L D++ S   ACQ           +
Sbjct: 1084 ESVGGDPAELMQMCVDNGMFTVSFLKEGLQKFRGWLDDLLNSSWEACQG----------S 1143

Query: 266  DAIIANPPAYGHTQVAEALKLPLHVFFTMPWTPTSDFPHPLAHVKHQMG--YR-LSYNIV 325
            D +I +P A     VAEAL++P +  FTMPWT T  +PH  A  +H  G  Y  ++Y + 
Sbjct: 1144 DLLIESPSAMSGIHVAEALRIPYYRAFTMPWTRTRAYPHAFAVPEHGRGGPYNYMTYTMF 1203

Query: 326  DTLIWLGIRDIINSFRKKKLKLRRISY--LSGHYSSLPEVPYGYIWSPHLIPKPKDWGSK 385
            D + W  I   +N +R+  L L   ++  +  H     +VP+ Y +SP ++P P DW   
Sbjct: 1204 DQVFWRAISGQVNRWRRNVLGLDATTFDKMEQH-----KVPFLYNFSPTVVPPPLDWTEW 1263

Query: 386  IDVVGFCFLDLASNYQ------PPNSLVEWIEAG----ERPIYIGFGSLPVEQPQEMTQI 445
            I V G+ FLD A   Q      PP  LV++I+      ++ +YIGFGS+ V  P+EMT+ 
Sbjct: 1264 IHVTGYWFLDKADEKQGEKSWTPPQGLVDFIDKAHGEEKKVVYIGFGSIVVSDPEEMTRC 1323

Query: 446  IVEALEITGRRGIINKGWGGLGSL-AEPKDF------------VYVLDNCPHDWLFPRCM 505
            +VEA+  +G   I++KGW   GS   EPK              ++ +D+  H WLFPR  
Sbjct: 1324 VVEAVVNSGVCAILSKGWSDRGSKKGEPKGDSEGADGVKYPPEIFAIDSIDHGWLFPRID 1383

Query: 506  AVVHHGGAGTTAAGLKAACPTTIVPIFGDQQFWGERVHARGLGPPPIPIAEFSLEKLIDA 565
            A  HHGGAGTT A L+A  PT I P FGDQ FW ERV +  +G     I   +  +L  A
Sbjct: 1384 AACHHGGAGTTGASLRAGIPTIIKPFFGDQAFWAERVESLNVGS---SIRRLTSHQLASA 1443

Query: 566  -INFMLDPKVKERTLEVSKAIESEDGVGGAVNAFHKHFHRNRTLAK 581
             I    D K   +   V + I  E+G+  A+ A ++     +++ K
Sbjct: 1444 LIKATTDEKQISKARVVGEMIRKENGITRAIEAIYRDLEYAKSIIK 1471

BLAST of Cucsa.356970 vs. TrEMBL
Match: B9RFT9_RICCO (Transferase, transferring glycosyl groups, putative OS=Ricinus communis GN=RCOM_1437250 PE=4 SV=1)

HSP 1 Score: 816.2 bits (2107), Expect = 2.6e-233
Identity = 392/579 (67.70%), Postives = 469/579 (81.00%), Query Frame = 1

Query: 7   HHSPHSSSGRSSDFSVSMDSDGDGDSDGDRVVPSSGNTDRNSSGDSTQDGSSVGRELVSC 66
           HH+P SSSG S D  V + +  D  S+      SSG T  +SS  S    SS   +  S 
Sbjct: 8   HHNPQSSSGDSVDLDVEIVAGNDIASN------SSGFTGSSSSKHSMSIESSATSKSDSG 67

Query: 67  STKPTKLRKSRQSHALHHLLPNIFDEKVSSRKKLRWLKRVATVKHDGTVQMEVLEGIQPE 126
           S++P K  K  +S+ L  L   +FD++V  RKKL+   R A +K DGT+Q+EV E I+P+
Sbjct: 68  SSQPRKFEKHGESYKLRILATKLFDDRVPFRKKLKLFHRFANLKDDGTIQLEVPEDIKPQ 127

Query: 127 NLHFETG-VDDEAVDDEPLDTANVPFIPPLQIVMLIVGTRGDVQPFVSIGKRLQEHGHRV 186
           +L    G V  E +D+EP DTA +  IPPLQIV+LIVGTRGDVQPF++IGKRLQE GHRV
Sbjct: 128 SLDIIPGAVHTECIDEEPFDTAELRDIPPLQIVILIVGTRGDVQPFIAIGKRLQEDGHRV 187

Query: 187 RLATHANFKDFVLSTGLEFFPLGGDAKVLADYMVKNKGFLPSGPSEIHAQRNHLKDIIFS 246
           RLATH+NFKDFVL+ GLEFFPLGGD KVLA YMVKNKGFLPS PSEI  QR  ++DI+FS
Sbjct: 188 RLATHSNFKDFVLTAGLEFFPLGGDPKVLAGYMVKNKGFLPSVPSEIPTQRQQIRDIVFS 247

Query: 247 LLPACQDDDPESKIPFKADAIIANPPAYGHTQVAEALKLPLHVFFTMPWTPTSDFPHPLA 306
           LLPAC+D DP++ +PFK DAIIANPPAYGHT VAEALK+P+H+FFTMPWTPTS+FPHPL+
Sbjct: 248 LLPACKDPDPDTNVPFKVDAIIANPPAYGHTHVAEALKVPIHIFFTMPWTPTSEFPHPLS 307

Query: 307 HVKHQMGYRLSYNIVDTLIWLGIRDIINSFRKKKLKLRRISYLSGHYSSLPEVPYGYIWS 366
            VK  + Y+LSY IVD++IWLGIRDI+N FRKKKL+LR ++YLSG+YSS P++PYGYIWS
Sbjct: 308 RVKQPIAYKLSYQIVDSMIWLGIRDIVNEFRKKKLQLRPVTYLSGNYSSPPDLPYGYIWS 367

Query: 367 PHLIPKPKDWGSKIDVVGFCFLDLASNYQPPNSLVEWIEAGERPIYIGFGSLPVEQPQEM 426
           PHL+PKPKDWG KIDVVGFCFL+LASNY+PP+ LV+W+E G+ PIYIGFGSLP+++P++M
Sbjct: 368 PHLVPKPKDWGPKIDVVGFCFLNLASNYEPPDLLVKWLEGGDPPIYIGFGSLPLQEPEKM 427

Query: 427 TQIIVEALEITGRRGIINKGWGGLGSLAEPKDFVYVLDNCPHDWLFPRCMAVVHHGGAGT 486
           TQIIV ALEITG+RGIINKGWGGLG LAEPKDFVY+LDNCPHDWLF RC AVVHHGGAGT
Sbjct: 428 TQIIVRALEITGQRGIINKGWGGLGDLAEPKDFVYILDNCPHDWLFSRCKAVVHHGGAGT 487

Query: 487 TAAGLKAACPTTIVPIFGDQQFWGERVHARGLGPPPIPIAEFSLEKLIDAINFMLDPKVK 546
           TAAGLKAACPTTI+P FGDQ FWGE+VHARGLGP PIP+ EFSL+KL+ AI FMLDPKVK
Sbjct: 488 TAAGLKAACPTTIIPFFGDQPFWGEQVHARGLGPAPIPVEEFSLDKLVGAIRFMLDPKVK 547

Query: 547 ERTLEVSKAIESEDGVGGAVNAFHKHFHRNRTLAKPEAP 585
           E  +E+SKA+E EDGV GAVNAF+KHF   R  ++P +P
Sbjct: 548 ELAVELSKAMEEEDGVKGAVNAFYKHFPGKRLESEPWSP 580

BLAST of Cucsa.356970 vs. TrEMBL
Match: M5WHU1_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa019814mg PE=4 SV=1)

HSP 1 Score: 795.8 bits (2054), Expect = 3.6e-227
Identity = 387/545 (71.01%), Postives = 455/545 (83.49%), Query Frame = 1

Query: 57  SSVGRELVSCSTKPTKLRKSRQSH--ALHHLLPNIFDEKVSSRKKLRWLKRVATVKHDGT 116
           S+VG    S S+       +R +H  AL  +L  +FD +   RKK +++ R+A V+ DGT
Sbjct: 57  SAVGTSSHSSSSSSQLTASARLTHNPALAAILAKLFDHRTPFRKKRKYINRLARVQDDGT 116

Query: 117 VQMEVLEGIQPENLHFETGV-DDEAVDDEPLD---TANVPFIPPLQIVMLIVGTRGDVQP 176
           VQ +V   I+P+ L F TGV   E  D+ P      A V  I PLQIVMLIVGTRGDVQP
Sbjct: 117 VQFDVPGDIKPQQLDFGTGVVHGEPCDEIPSSGETEAEVLDIRPLQIVMLIVGTRGDVQP 176

Query: 177 FVSIGKRLQEHGHRVRLATHANFKDFVLSTGLEFFPLGGDAKVLADYMVKNKGFLPSGPS 236
           FV+IGK LQE+GHRVRLATHANFKDFVL+ GLEFFPLGGD KVLA YMVKNKGFLPSGPS
Sbjct: 177 FVAIGKSLQEYGHRVRLATHANFKDFVLTAGLEFFPLGGDPKVLAGYMVKNKGFLPSGPS 236

Query: 237 EIHAQRNHLKDIIFSLLPACQDDDPESKIPFKADAIIANPPAYGHTQVAEALKLPLHVFF 296
           EI  QRN +K+IIFSLLPAC++ DP+S++PFKADAIIANPPAYGH+ VAEALK+PLH+FF
Sbjct: 237 EISIQRNQIKEIIFSLLPACKEPDPDSEVPFKADAIIANPPAYGHSDVAEALKVPLHIFF 296

Query: 297 TMPWTPTSDFPHPLAHVKHQMGYRLSYNIVDTLIWLGIRDIINSFRKKKLKLRRISYLSG 356
           TMPWTPTS+FPHPL+ VK  +GYRLSY IVD LIWLGIRD+IN FRKK LKLR I+YLSG
Sbjct: 297 TMPWTPTSEFPHPLSRVKQPIGYRLSYQIVDALIWLGIRDMINEFRKKMLKLRPITYLSG 356

Query: 357 HYSSLPEVPYGYIWSPHLIPKPKDWGSKIDVVGFCFLDLASNYQPPNSLVEWIEAGERPI 416
           +YSS P+VPYGYIWSPHL+PKPKDWG K+DVVGFCFLDLASNY+PP+SLVEW+EAGE+P+
Sbjct: 357 YYSSPPDVPYGYIWSPHLVPKPKDWGPKVDVVGFCFLDLASNYEPPDSLVEWLEAGEQPV 416

Query: 417 YIGFGSLPVEQPQEMTQIIVEALEITGRRGIINKGWGGLGSLAEPKDFVYVLDNCPHDWL 476
           YIGFGSLP+E+P++MT II++ALEITG+RGIIN+GWGGLG+LAEP D VY++DNCPHDWL
Sbjct: 417 YIGFGSLPLEEPEKMTNIILQALEITGQRGIINRGWGGLGNLAEPSDSVYLVDNCPHDWL 476

Query: 477 FPRCMAVVHHGGAGTTAAGLKAACPTTIVPIFGDQQFWGERVHARGLGPPPIPIAEFSLE 536
           F RC AVVHHGGAGTTAAGLKAACPTTIVP FGDQ FWGERVHARG+GP PIP  EFSLE
Sbjct: 477 FQRCSAVVHHGGAGTTAAGLKAACPTTIVPFFGDQPFWGERVHARGVGPAPIPADEFSLE 536

Query: 537 KLIDAINFMLDPKVKERTLEVSKAIESEDGVGGAVNAFHKHFHRNRTLAKPEA--PKRG- 593
           KL+DAI+FMLDPKVKER +E++KA++ EDGV GAVNAFH+HF  N++  KPE+   +RG 
Sbjct: 537 KLVDAIHFMLDPKVKERAVEIAKAMDGEDGVTGAVNAFHRHFPHNKSEDKPESLPARRGL 596

BLAST of Cucsa.356970 vs. TrEMBL
Match: M5VNV0_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa002976mg PE=4 SV=1)

HSP 1 Score: 791.2 bits (2042), Expect = 8.9e-226
Identity = 393/601 (65.39%), Postives = 473/601 (78.70%), Query Frame = 1

Query: 12  SSSGRSSDFSVSMDSDGDGDSDGDRVVPSS-------GNTDRNSSGDSTQDGSSVGRELV 71
           SSSG S +  V ++ +  G S G+ V   S       G++    S +S   G+++  E+ 
Sbjct: 10  SSSGISGEGPVRIEEETGGGSSGNSVELESTVARGHGGSSSTGISSESLSKGNTLPVEIS 69

Query: 72  SCSTK------PTKLRKSRQSHALH-HLLPN----IFDEKVSSRKKLRWLKRVATVKHDG 131
           S + K      P KL +S+     H ++LP     IF++K+S  +KL+ L R+ATVK DG
Sbjct: 70  SKTEKVESNSGPPKLERSKTDSTRHQNILPKDAARIFNDKISVHQKLKLLNRIATVKDDG 129

Query: 132 TVQMEVLEGIQPENLHFE-TGVDDEAVDDEPLDTANVPFIPPLQIVMLIVGTRGDVQPFV 191
           TV+ EV   ++P++L         EA DDEPLD AN+ +IPP+QIVMLIVGTRGDVQPF+
Sbjct: 130 TVEFEVPGDVEPQSLGGGYKAAPTEAADDEPLDEANLEYIPPMQIVMLIVGTRGDVQPFI 189

Query: 192 SIGKRLQEHGHRVRLATHANFKDFVLSTGLEFFPLGGDAKVLADYMVKNKGFLPSGPSEI 251
           +IGKRLQ++GHRVRLATH+NFK+FVL+ GLEF+PLGGD KVLA YMVKNKGFLPSGPSEI
Sbjct: 190 AIGKRLQDYGHRVRLATHSNFKEFVLTAGLEFYPLGGDPKVLAGYMVKNKGFLPSGPSEI 249

Query: 252 HAQRNHLKDIIFSLLPACQDDDPESKIPFKADAIIANPPAYGHTQVAEALKLPLHVFFTM 311
             QRN +K+II+SLLPAC++ D +S IPFKADAIIANPPAYGHT VAEALK+PLH+FFTM
Sbjct: 250 PIQRNQIKEIIYSLLPACKEPDMDSGIPFKADAIIANPPAYGHTHVAEALKIPLHIFFTM 309

Query: 312 PWTPTSDFPHPLAHVKHQMGYRLSYNIVDTLIWLGIRDIINSFRKKKLKLRRISYLSGHY 371
           PWTPTS+FPHPL+ VK   GYRLSY IVD+LIWLGIRD+IN  RKKKLKLR ++YLSG  
Sbjct: 310 PWTPTSEFPHPLSRVKQSTGYRLSYQIVDSLIWLGIRDMINDVRKKKLKLRPVTYLSGSQ 369

Query: 372 SSLPEVPYGYIWSPHLIPKPKDWGSKIDVVGFCFLDLASNYQPPNSLVEWIEAGERPIYI 431
            S  +VP+GYIWSPHL+PKPKDWG K+DVVGFCFLDLASNY+PP  LV+W+EAG+RPIYI
Sbjct: 370 GSDSDVPHGYIWSPHLVPKPKDWGPKVDVVGFCFLDLASNYEPPELLVKWLEAGDRPIYI 429

Query: 432 GFGSLPVEQPQEMTQIIVEALEITGRRGIINKGWGGLGSLAEPKDFVYVLDNCPHDWLFP 491
           GFGSLPV++P++MTQIIVEALE TG+RGIINKGWGGLG+LAEPKDF+Y+LDNCPHDWLF 
Sbjct: 430 GFGSLPVQEPEKMTQIIVEALEKTGQRGIINKGWGGLGNLAEPKDFIYLLDNCPHDWLFL 489

Query: 492 RCMAVVHHGGAGTTAAGLKAACPTTIVPIFGDQQFWGERVHARGLGPPPIPIAEFSLEKL 551
           +C AVVHHGGAGTTAAGLKAACPTTIVP FGDQ FWGERVHARG+GP PI + EFSL KL
Sbjct: 490 QCKAVVHHGGAGTTAAGLKAACPTTIVPFFGDQPFWGERVHARGVGPAPIAVDEFSLPKL 549

Query: 552 IDAINFMLDPKVKERTLEVSKAIESEDGVGGAVNAFHKHFHRNRTLAKPE-APKRGFSVR 593
           +DAI FMLDPKVKER +E++K +E+EDGV GAV AF KH    +   +PE  P   FSV 
Sbjct: 550 VDAIKFMLDPKVKERAVELAKDMENEDGVTGAVKAFFKHLPCRKPDPEPEPGPSSLFSVS 609

BLAST of Cucsa.356970 vs. TrEMBL
Match: V4SS72_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10011454mg PE=4 SV=1)

HSP 1 Score: 786.9 bits (2031), Expect = 1.7e-224
Identity = 374/499 (74.95%), Postives = 432/499 (86.57%), Query Frame = 1

Query: 99  KLRWLKRVATVKHDGTVQMEVLEGIQPENLHFETGV--DDEAVDDEPLDTANVPFIPPLQ 158
           +L+WL R+ATVK DGTVQ EV   I+P+NL F TGV   D++ D EP++ A+V  IPPL 
Sbjct: 31  QLKWLNRLATVKDDGTVQFEVPADIKPQNLDFGTGVVYTDDSTDQEPIEAADVHGIPPLH 90

Query: 159 IVMLIVGTRGDVQPFVSIGKRLQEHGHRVRLATHANFKDFVLSTGLEFFPLGGDAKVLAD 218
           IVMLIVGTRGDVQPFV+IGKRLQE GHRVRLATHANFKDFVL  GLEFFPLGGD K+LA 
Sbjct: 91  IVMLIVGTRGDVQPFVAIGKRLQEDGHRVRLATHANFKDFVLGAGLEFFPLGGDPKILAG 150

Query: 219 YMVKNKGFLPSGPSEIHAQRNHLKDIIFSLLPACQDDDPESKIPFKADAIIANPPAYGHT 278
           YMVKNKGFLPSGPSEI  QRN LK+II+SLLPAC+D DP++ +PFK DAIIANPPAYGHT
Sbjct: 151 YMVKNKGFLPSGPSEIPIQRNQLKEIIYSLLPACKDPDPDTMVPFKPDAIIANPPAYGHT 210

Query: 279 QVAEALKLPLHVFFTMPWTPTSDFPHPLAHVKHQMGYRLSYNIVDTLIWLGIRDIINSFR 338
            VAE+LK+PLH+ FTMPWTPTS+FPHPL+ VK  + YRLSY IVD LIWLGIRD+IN FR
Sbjct: 211 HVAESLKVPLHIIFTMPWTPTSEFPHPLSRVKQPVAYRLSYQIVDALIWLGIRDMINDFR 270

Query: 339 KKKLKLRRISYLSGHYSSLPEVPYGYIWSPHLIPKPKDWGSKIDVVGFCFLDLASNYQPP 398
           KK+L LRR++YLSG YSS  +VPY YIWSPHL+PKPKDWG KIDVVGFCFLDLAS Y+PP
Sbjct: 271 KKRLNLRRVTYLSGSYSSPLDVPYAYIWSPHLVPKPKDWGPKIDVVGFCFLDLASTYEPP 330

Query: 399 NSLVEWIEAGERPIYIGFGSLPVEQPQEMTQIIVEALEITGRRGIINKGWGGLGSLAEPK 458
           +SLV+W+E GE+PIYIGFGSLPVE+P++MT+IIV+ALEITG RGIINKGWGGLG+LAE K
Sbjct: 331 DSLVKWLEDGEKPIYIGFGSLPVEEPEKMTEIIVKALEITGHRGIINKGWGGLGNLAESK 390

Query: 459 DFVYVLDNCPHDWLFPRCMAVVHHGGAGTTAAGLKAACPTTIVPIFGDQQFWGERVHARG 518
           DF+Y+LDNCPHDWLF RC+AVVHHGGAGTTAAGLKAACPTTIVP FGDQ FWGERVHARG
Sbjct: 391 DFLYLLDNCPHDWLFSRCLAVVHHGGAGTTAAGLKAACPTTIVPFFGDQPFWGERVHARG 450

Query: 519 LGPPPIPIAEFSLEKLIDAINFMLDPKVKERTLEVSKAIESEDGVGGAVNAFHKHFHRNR 578
           LGP PIP+ EFSL+KL+DAI FML PKVKER +E++KA+E+EDGV GAV AF+KHF   +
Sbjct: 451 LGPAPIPVDEFSLDKLVDAIRFMLHPKVKERAVELAKAMENEDGVTGAVKAFYKHFPGKK 510

Query: 579 TLAKPEAP--KRG-FSVRR 593
           + ++PE P   RG  S+RR
Sbjct: 511 SESEPELPHSHRGLLSIRR 529

BLAST of Cucsa.356970 vs. TrEMBL
Match: A0A059D2A3_EUCGR (Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_B01378 PE=4 SV=1)

HSP 1 Score: 786.2 bits (2029), Expect = 2.8e-224
Identity = 392/579 (67.70%), Postives = 457/579 (78.93%), Query Frame = 1

Query: 14  SGRSSDFSVSMDSDGDGD------SDGDRVVPSSGNTDRNSSGDSTQDGSSVGRELVSCS 73
           S   S+ SV +  +GDGD       DGD    S+G+TD +S  D+    +S   +  S S
Sbjct: 13  SSFGSENSVDV-GNGDGDVTQANGHDGDG--HSTGSTDTSSPIDNMPGDASTSGKSESDS 72

Query: 74  TKPTKLRKSRQSHALHHLLPNIFDEKVSSRKKLRWLKRVATVKHDGTVQMEVLEGIQPEN 133
              + + + RQ+H L  L    FDEKV  +KKL+WLKR ATV  DGTVQ EV   + P++
Sbjct: 73  AHLSTIGRWRQNHKLGLLTAKFFDEKVPLKKKLKWLKRAATVDSDGTVQFEVPGDVAPQS 132

Query: 134 LHFE----TGVDDEAVDDEPLDTANVPFIPPLQIVMLIVGTRGDVQPFVSIGKRLQEHGH 193
           L+       G  +E+V  + LD      +PPLQIVMLIVGTRGDVQPFV+IGKRLQE GH
Sbjct: 133 LNLTGVAYNGATEESVPGDVLD------LPPLQIVMLIVGTRGDVQPFVAIGKRLQEGGH 192

Query: 194 RVRLATHANFKDFVLSTGLEFFPLGGDAKVLADYMVKNKGFLPSGPSEIHAQRNHLKDII 253
           RVRLATH NFK+FVL+ GLEF+PLGGD KVLA YMVKNKGFLPSGPSEIH QRN +KDII
Sbjct: 193 RVRLATHMNFKEFVLTAGLEFYPLGGDPKVLAGYMVKNKGFLPSGPSEIHIQRNQIKDII 252

Query: 254 FSLLPACQDDDPESKIPFKADAIIANPPAYGHTQVAEALKLPLHVFFTMPWTPTSDFPHP 313
           FSLLPAC++ DPE+ +PF A+AIIANPPAYGHT VAEAL LPLHV FTMPWTPTS+FPHP
Sbjct: 253 FSLLPACREPDPETHVPFNANAIIANPPAYGHTHVAEALNLPLHVVFTMPWTPTSEFPHP 312

Query: 314 LAHVKHQMGYRLSYNIVDTLIWLGIRDIINSFRKKKLKLRRISYLSGHYSSLPEVPYGYI 373
           L+ VK  +GYRLSY IVD LIWLGIRD+IN FRKKKL LR I+YL G YSS P+VPY YI
Sbjct: 313 LSRVKQPVGYRLSYQIVDALIWLGIRDVINEFRKKKLGLRPITYLRGPYSSPPDVPYAYI 372

Query: 374 WSPHLIPKPKDWGSKIDVVGFCFLDLASNYQPPNSLVEWIEAGERPIYIGFGSLPVEQPQ 433
           WSPHL+PKPKDWG KIDVVGFCFLDLAS+YQPP SLV+W+E GE+PIYIGFGSLPV++P+
Sbjct: 373 WSPHLVPKPKDWGPKIDVVGFCFLDLASSYQPPESLVKWLEEGEKPIYIGFGSLPVQEPE 432

Query: 434 EMTQIIVEALEITGRRGIINKGWGGLGSLAEPKDFVYVLDNCPHDWLFPRCMAVVHHGGA 493
           +MT+IIV+ALEIT +RGIINKGWGGLG+ AE KDFVY+LDNCPHDWLF RC AVVHHGGA
Sbjct: 433 KMTEIIVKALEITCQRGIINKGWGGLGNSAEKKDFVYLLDNCPHDWLFLRCSAVVHHGGA 492

Query: 494 GTTAAGLKAACPTTIVPIFGDQQFWGERVHARGLGPPPIPIAEFSLEKLIDAINFMLDPK 553
           GTTAAGLKAACPTT+VP FGDQ FWGERVHARG+GP PIP+ EFSLEKL+DAI FMLDPK
Sbjct: 493 GTTAAGLKAACPTTVVPFFGDQPFWGERVHARGVGPVPIPVDEFSLEKLVDAIRFMLDPK 552

Query: 554 VKERTLEVSKAIESEDGVGGAVNAFHKHFHRNRTLAKPE 583
           VK+   E++K +E EDGV GAV AF+KHF R +  + P+
Sbjct: 553 VKQCAEELAKDMEHEDGVEGAVKAFYKHFPREKLESDPD 582

BLAST of Cucsa.356970 vs. TAIR10
Match: AT3G07020.2 (AT3G07020.2 UDP-Glycosyltransferase superfamily protein)

HSP 1 Score: 755.7 bits (1950), Expect = 2.1e-218
Identity = 374/588 (63.61%), Postives = 453/588 (77.04%), Query Frame = 1

Query: 5   EIHHSPHSSSGRSSDFSVSMDSDGDGDSDGDRV--VPSSGNTDRNSSGDSTQDGSSVGRE 64
           E+  +P +     +D +V+ +S G G+    RV  +P  G+    SS D  +  S+    
Sbjct: 49  ELETNPKTVVASIADETVA-ESSGTGNKSFSRVWTMPLEGS----SSSDKAESSSTNQPR 108

Query: 65  LVSCSTKPTKLRKSRQSHALHHLLPNIFDEKVSSRKKLRWLKRVATVKHDGTVQMEVLEG 124
           L    T+    R+ + +H L      IFD+K+S+ KKL+ L R+ATVKHDGTV+ EV   
Sbjct: 109 LDKSKTE----RQQKVTHILAEDAAKIFDDKISAGKKLKLLNRIATVKHDGTVEFEVPAD 168

Query: 125 IQPENLHFETGVDDEAV-DDEPLDTANVPFIPPLQIVMLIVGTRGDVQPFVSIGKRLQEH 184
             P+ +  + G     V  DE +D  ++ +IPP+QIVMLIVGTRGDVQPFV+I KRLQ++
Sbjct: 169 AIPQPIVVDRGESKNGVCADESIDGVDLQYIPPMQIVMLIVGTRGDVQPFVAIAKRLQDY 228

Query: 185 GHRVRLATHANFKDFVLSTGLEFFPLGGDAKVLADYMVKNKGFLPSGPSEIHAQRNHLKD 244
           GHRVRLATHANFK+FVL+ GLEF+PLGGD KVLA YMVKNKGFLPSGPSEI  QRN +KD
Sbjct: 229 GHRVRLATHANFKEFVLTAGLEFYPLGGDPKVLAGYMVKNKGFLPSGPSEIPIQRNQMKD 288

Query: 245 IIFSLLPACQDDDPESKIPFKADAIIANPPAYGHTQVAEALKLPLHVFFTMPWTPTSDFP 304
           II+SLLPAC++ DP+S I FKADAIIANPPAYGHT VAEALK+P+HVFFTMPWTPTS+FP
Sbjct: 289 IIYSLLPACKEPDPDSGISFKADAIIANPPAYGHTHVAEALKIPIHVFFTMPWTPTSEFP 348

Query: 305 HPLAHVKHQMGYRLSYNIVDTLIWLGIRDIINSFRKKKLKLRRISYLSGHYSSLPEVPYG 364
           HPL+ VK   GYRLSY IVD+LIWLGIRD++N  RKKKLKLR ++YLSG   S   +P+G
Sbjct: 349 HPLSRVKQPAGYRLSYQIVDSLIWLGIRDMVNDLRKKKLKLRPVTYLSGTQGSGSNIPHG 408

Query: 365 YIWSPHLIPKPKDWGSKIDVVGFCFLDLASNYQPPNSLVEWIEAGERPIYIGFGSLPVEQ 424
           Y+WSPHL+PKPKDWG +IDVVGFC+LDLASNY+PP  LVEW+EAG++PIYIGFGSLPV++
Sbjct: 409 YMWSPHLVPKPKDWGPQIDVVGFCYLDLASNYEPPAELVEWLEAGDKPIYIGFGSLPVQE 468

Query: 425 PQEMTQIIVEALEITGRRGIINKGWGGLGSLAEPKDFVYVLDNCPHDWLFPRCMAVVHHG 484
           P++MT+IIVEAL+ T +RGIINKGWGGLG+L EPKDFVY+LDN PHDWLFPRC AVVHHG
Sbjct: 469 PEKMTEIIVEALQRTKQRGIINKGWGGLGNLKEPKDFVYLLDNVPHDWLFPRCKAVVHHG 528

Query: 485 GAGTTAAGLKAACPTTIVPIFGDQQFWGERVHARGLGPPPIPIAEFSLEKLIDAINFMLD 544
           GAGTTAAGLKA+CPTTIVP FGDQ FWGERVHARG+GP PIP+ EFSL KL DAINFMLD
Sbjct: 529 GAGTTAAGLKASCPTTIVPFFGDQPFWGERVHARGVGPSPIPVDEFSLHKLEDAINFMLD 588

Query: 545 PKVKERTLEVSKAIESEDGVGGAVNAFHKHF-HRNRTLAKPEAPKRGF 589
            KVK     ++KA++ EDGV GAV AF KH     + ++ P     GF
Sbjct: 589 DKVKSSAETLAKAMKDEDGVAGAVKAFFKHLPSAKQNISDPIPEPSGF 627

BLAST of Cucsa.356970 vs. TAIR10
Match: AT1G43620.1 (AT1G43620.1 UDP-Glycosyltransferase superfamily protein)

HSP 1 Score: 581.6 bits (1498), Expect = 5.4e-166
Identity = 291/581 (50.09%), Postives = 394/581 (67.81%), Query Frame = 1

Query: 27  DGDGDSDGDRVVPSSGNTDRN--SSGDSTQDGSSVGRELVSCSTKP-----------TKL 86
           D    S+   ++ +SG+ D     SG  + DG    R L  C T P           +++
Sbjct: 17  DNGVKSEKASLLETSGSVDTTPEDSGHRSSDGH---RGLDHCETAPVGLYGDMLINDSEI 76

Query: 87  RKSRQ-----SHALHHLLPNIFDEKVSSRKKLRWLKRVATVKHDGTVQMEVLEGIQP--E 146
           + SR      S A+H    N+  +++S ++K + +  +  +++DGTV  EV++   P  E
Sbjct: 77  QYSRSLTEKGSPAIH----NLKLDRLSEQEKQKLIVELVRIQNDGTV--EVIDNGTPVSE 136

Query: 147 NLHFETGVDDEAVDDEPLDTANVPFIPPLQIVMLIVGTRGDVQPFVSIGKRLQEHGHRVR 206
              FE       +  E   T +   IP L+I +L+VGTRGDVQPF+++ KRLQE GHRVR
Sbjct: 137 LWEFEPTKGQSTITYEKSLTESFRSIPRLKIAILVVGTRGDVQPFLAMAKRLQEFGHRVR 196

Query: 207 LATHANFKDFVLSTGLEFFPLGGDAKVLADYMVKNKGFLPSGPSEIHAQRNHLKDIIFSL 266
           LATHANF+ FV + G+EF+PLGGD + LA YM +NKG +PSGPSEI  QR  LK II SL
Sbjct: 197 LATHANFRSFVRAAGVEFYPLGGDPRELAAYMARNKGLIPSGPSEISKQRKQLKAIIESL 256

Query: 267 LPACQDDDPESKIPFKADAIIANPPAYGHTQVAEALKLPLHVFFTMPWTPTSDFPHPLAH 326
           LPAC + D E+   F+A AIIANPPAYGH  VAEAL +P+H+FFTMPWTPT++FPHPLA 
Sbjct: 257 LPACIEPDLETATSFRAQAIIANPPAYGHVHVAEALGVPIHIFFTMPWTPTNEFPHPLAR 316

Query: 327 VKHQMGYRLSYNIVDTLIWLGIRDIINSFRKKKLKLRRISYLSGHYSSLPEVPYGYIWSP 386
           V     Y LSY +VD ++W  IR  IN FRK+KL L  I+Y S ++ S+  +P GY+WSP
Sbjct: 317 VPQSAAYWLSYIVVDLMVWWSIRTYINDFRKRKLNLAPIAYFSTYHGSISHLPTGYMWSP 376

Query: 387 HLIPKPKDWGSKIDVVGFCFLDLASNYQPPNSLVEWIEAGERPIYIGFGSLPVEQPQEMT 446
           H++PKP DWG  +DVVG+CFL+L S YQP    + WIE G  P+YIGFGS+P++ P++  
Sbjct: 377 HVVPKPSDWGPLVDVVGYCFLNLGSKYQPREEFLHWIERGSPPVYIGFGSMPLDDPKQTM 436

Query: 447 QIIVEALEITGRRGIINKGWGGLGSLA-EPKDFVYVLDNCPHDWLFPRCMAVVHHGGAGT 506
            II+E L+ T +RGI+++GWGGLG+LA E  + V+++++CPHDWLFP+C AVVHHGGAGT
Sbjct: 437 DIILETLKDTEQRGIVDRGWGGLGNLATEVPENVFLVEDCPHDWLFPQCSAVVHHGGAGT 496

Query: 507 TAAGLKAACPTTIVPIFGDQQFWGERVHARGLGPPPIPIAEFSLEKLIDAINFMLDPKVK 566
           TA GLKA CPTTIVP FGDQ FWG+R++ +GLGP PIPIA+ S+E L  +I FML P+VK
Sbjct: 497 TATGLKAGCPTTIVPFFGDQFFWGDRIYEKGLGPAPIPIAQLSVENLSSSIRFMLQPEVK 556

Query: 567 ERTLEVSKAIESEDGVGGAVNAFHKHFHRNRTLAKPEAPKR 587
            + +E++K +E+EDGV  AV+AFH+H      L +  + K+
Sbjct: 557 SQVMELAKVLENEDGVAAAVDAFHRHLPPELPLPESSSEKK 588

BLAST of Cucsa.356970 vs. NCBI nr
Match: gi|778664181|ref|XP_011660238.1| (PREDICTED: sterol 3-beta-glucosyltransferase UGT80A2-like isoform X1 [Cucumis sativus])

HSP 1 Score: 1226.5 bits (3172), Expect = 0.0e+00
Identity = 596/597 (99.83%), Postives = 596/597 (99.83%), Query Frame = 1

Query: 1   MAHLEIHHSPHSSSGRSSDFSVSMDSDGDGDSDGDRVVPSSGNTDRNSSGDSTQDGSSVG 60
           MAHLEIHHSPHSSSGRSSDFSVSMDSDGD DSDGDRVVPSSGNTDRNSSGDSTQDGSSVG
Sbjct: 1   MAHLEIHHSPHSSSGRSSDFSVSMDSDGDDDSDGDRVVPSSGNTDRNSSGDSTQDGSSVG 60

Query: 61  RELVSCSTKPTKLRKSRQSHALHHLLPNIFDEKVSSRKKLRWLKRVATVKHDGTVQMEVL 120
           RELVSCSTKPTKLRKSRQSHALHHLLPNIFDEKVSSRKKLRWLKRVATVKHDGTVQMEVL
Sbjct: 61  RELVSCSTKPTKLRKSRQSHALHHLLPNIFDEKVSSRKKLRWLKRVATVKHDGTVQMEVL 120

Query: 121 EGIQPENLHFETGVDDEAVDDEPLDTANVPFIPPLQIVMLIVGTRGDVQPFVSIGKRLQE 180
           EGIQPENLHFETGVDDEAVDDEPLDTANVPFIPPLQIVMLIVGTRGDVQPFVSIGKRLQE
Sbjct: 121 EGIQPENLHFETGVDDEAVDDEPLDTANVPFIPPLQIVMLIVGTRGDVQPFVSIGKRLQE 180

Query: 181 HGHRVRLATHANFKDFVLSTGLEFFPLGGDAKVLADYMVKNKGFLPSGPSEIHAQRNHLK 240
           HGHRVRLATHANFKDFVLSTGLEFFPLGGDAKVLADYMVKNKGFLPSGPSEIHAQRNHLK
Sbjct: 181 HGHRVRLATHANFKDFVLSTGLEFFPLGGDAKVLADYMVKNKGFLPSGPSEIHAQRNHLK 240

Query: 241 DIIFSLLPACQDDDPESKIPFKADAIIANPPAYGHTQVAEALKLPLHVFFTMPWTPTSDF 300
           DIIFSLLPACQDDDPESKIPFKADAIIANPPAYGHTQVAEALKLPLHVFFTMPWTPTSDF
Sbjct: 241 DIIFSLLPACQDDDPESKIPFKADAIIANPPAYGHTQVAEALKLPLHVFFTMPWTPTSDF 300

Query: 301 PHPLAHVKHQMGYRLSYNIVDTLIWLGIRDIINSFRKKKLKLRRISYLSGHYSSLPEVPY 360
           PHPLAHVKHQMGYRLSYNIVDTLIWLGIRDIINSFRKKKLKLRRISYLSGHYSSLPEVPY
Sbjct: 301 PHPLAHVKHQMGYRLSYNIVDTLIWLGIRDIINSFRKKKLKLRRISYLSGHYSSLPEVPY 360

Query: 361 GYIWSPHLIPKPKDWGSKIDVVGFCFLDLASNYQPPNSLVEWIEAGERPIYIGFGSLPVE 420
           GYIWSPHLIPKPKDWGSKIDVVGFCFLDLASNYQPPNSLVEWIEAGERPIYIGFGSLPVE
Sbjct: 361 GYIWSPHLIPKPKDWGSKIDVVGFCFLDLASNYQPPNSLVEWIEAGERPIYIGFGSLPVE 420

Query: 421 QPQEMTQIIVEALEITGRRGIINKGWGGLGSLAEPKDFVYVLDNCPHDWLFPRCMAVVHH 480
           QPQEMTQIIVEALEITGRRGIINKGWGGLGSLAEPKDFVYVLDNCPHDWLFPRCMAVVHH
Sbjct: 421 QPQEMTQIIVEALEITGRRGIINKGWGGLGSLAEPKDFVYVLDNCPHDWLFPRCMAVVHH 480

Query: 481 GGAGTTAAGLKAACPTTIVPIFGDQQFWGERVHARGLGPPPIPIAEFSLEKLIDAINFML 540
           GGAGTTAAGLKAACPTTIVPIFGDQQFWGERVHARGLGPPPIPIAEFSLEKLIDAINFML
Sbjct: 481 GGAGTTAAGLKAACPTTIVPIFGDQQFWGERVHARGLGPPPIPIAEFSLEKLIDAINFML 540

Query: 541 DPKVKERTLEVSKAIESEDGVGGAVNAFHKHFHRNRTLAKPEAPKRGFSVRRLLHIS 598
           DPKVKERTLEVSKAIESEDGVGGAVNAFHKHFHRNRTLAKPEAPKRGFSVRRLLHIS
Sbjct: 541 DPKVKERTLEVSKAIESEDGVGGAVNAFHKHFHRNRTLAKPEAPKRGFSVRRLLHIS 597

BLAST of Cucsa.356970 vs. NCBI nr
Match: gi|659086031|ref|XP_008443730.1| (PREDICTED: sterol 3-beta-glucosyltransferase UGT80A2-like isoform X1 [Cucumis melo])

HSP 1 Score: 1160.2 bits (3000), Expect = 0.0e+00
Identity = 567/598 (94.82%), Postives = 578/598 (96.66%), Query Frame = 1

Query: 1   MAHLEIHHSPHSSSGRSSDFSVSMDSDGDGDSDGDRVVPSSGNTDRNSSGDSTQDGSSVG 60
           M  L+I H  HSSSGRSSDFSVSMDSDGDGDSDGDRVVPSSGNTDRNSSGDSTQDGSSVG
Sbjct: 1   MDQLQIDHLSHSSSGRSSDFSVSMDSDGDGDSDGDRVVPSSGNTDRNSSGDSTQDGSSVG 60

Query: 61  RELVSCSTKPTKLRKSRQSHALHHLLPNIFDEKVSSRKKLRWLKRVATVKHDGTVQMEVL 120
           RELVSCSTKPT+L KSRQSHALHHLLPNIF+EKVSSRKKLRWLKRVATVK DGTVQMEV 
Sbjct: 61  RELVSCSTKPTELGKSRQSHALHHLLPNIFNEKVSSRKKLRWLKRVATVKDDGTVQMEVP 120

Query: 121 EGIQPENLHFETGV-DDEAVDDEPLDTANVPFIPPLQIVMLIVGTRGDVQPFVSIGKRLQ 180
           EGIQPE+ HFETGV DDEAVDDEPLDTANV FIPPLQIVMLIVGTRGDVQPFV+IGKRLQ
Sbjct: 121 EGIQPESFHFETGVLDDEAVDDEPLDTANVSFIPPLQIVMLIVGTRGDVQPFVAIGKRLQ 180

Query: 181 EHGHRVRLATHANFKDFVLSTGLEFFPLGGDAKVLADYMVKNKGFLPSGPSEIHAQRNHL 240
           EHGHRVRLATHANFKDFVLSTGLEFFPLGGDAKVLADYMVKNKGFLPSGPSEIHAQRNHL
Sbjct: 181 EHGHRVRLATHANFKDFVLSTGLEFFPLGGDAKVLADYMVKNKGFLPSGPSEIHAQRNHL 240

Query: 241 KDIIFSLLPACQDDDPESKIPFKADAIIANPPAYGHTQVAEALKLPLHVFFTMPWTPTSD 300
           KDIIFSLLPACQDDDPES IPFKADAIIANPPAYGHTQVAEALKLPLHVFFTMPWTPTSD
Sbjct: 241 KDIIFSLLPACQDDDPESNIPFKADAIIANPPAYGHTQVAEALKLPLHVFFTMPWTPTSD 300

Query: 301 FPHPLAHVKHQMGYRLSYNIVDTLIWLGIRDIINSFRKKKLKLRRISYLSGHYSSLPEVP 360
           FPHPLAHVKHQ+GYRLSYNIVDTLIWLGIRDIINSFRKKKLKLRRISYLSGHY+SL EVP
Sbjct: 301 FPHPLAHVKHQIGYRLSYNIVDTLIWLGIRDIINSFRKKKLKLRRISYLSGHYNSLSEVP 360

Query: 361 YGYIWSPHLIPKPKDWGSKIDVVGFCFLDLASNYQPPNSLVEWIEAGERPIYIGFGSLPV 420
           YGYIWSPHLIPKPKDWGSKIDVVGFCFLDLASNYQPP+SLVEWIEAGERPIYIGFGSLPV
Sbjct: 361 YGYIWSPHLIPKPKDWGSKIDVVGFCFLDLASNYQPPSSLVEWIEAGERPIYIGFGSLPV 420

Query: 421 EQPQEMTQIIVEALEITGRRGIINKGWGGLGSLAEPKDFVYVLDNCPHDWLFPRCMAVVH 480
           EQP+EMTQIIVEALEITGRRGIINKGWGGLGSLAEPKDFVYVLDNCPHDWLFPRCMAVVH
Sbjct: 421 EQPEEMTQIIVEALEITGRRGIINKGWGGLGSLAEPKDFVYVLDNCPHDWLFPRCMAVVH 480

Query: 481 HGGAGTTAAGLKAACPTTIVPIFGDQQFWGERVHARGLGPPPIPIAEFSLEKLIDAINFM 540
           HGGAGTTAAGLKAACPTTIVPIFGDQQFWGERVHARGLGPPPIPIAEFSLEKLIDAINFM
Sbjct: 481 HGGAGTTAAGLKAACPTTIVPIFGDQQFWGERVHARGLGPPPIPIAEFSLEKLIDAINFM 540

Query: 541 LDPKVKERTLEVSKAIESEDGVGGAVNAFHKHFHRNRTLAKPEAPKRGFSVRRLLHIS 598
           LDPKVKER L+VSKAIE+EDG  GAVNAFHKHFHRNRTL KPE  K GFSVRRLLHIS
Sbjct: 541 LDPKVKERALDVSKAIETEDGAVGAVNAFHKHFHRNRTLTKPETLKHGFSVRRLLHIS 598

BLAST of Cucsa.356970 vs. NCBI nr
Match: gi|778664184|ref|XP_011660239.1| (PREDICTED: sterol 3-beta-glucosyltransferase UGT80A2-like isoform X2 [Cucumis sativus])

HSP 1 Score: 1030.0 bits (2662), Expect = 1.6e-297
Identity = 497/497 (100.00%), Postives = 497/497 (100.00%), Query Frame = 1

Query: 101 RWLKRVATVKHDGTVQMEVLEGIQPENLHFETGVDDEAVDDEPLDTANVPFIPPLQIVML 160
           RWLKRVATVKHDGTVQMEVLEGIQPENLHFETGVDDEAVDDEPLDTANVPFIPPLQIVML
Sbjct: 20  RWLKRVATVKHDGTVQMEVLEGIQPENLHFETGVDDEAVDDEPLDTANVPFIPPLQIVML 79

Query: 161 IVGTRGDVQPFVSIGKRLQEHGHRVRLATHANFKDFVLSTGLEFFPLGGDAKVLADYMVK 220
           IVGTRGDVQPFVSIGKRLQEHGHRVRLATHANFKDFVLSTGLEFFPLGGDAKVLADYMVK
Sbjct: 80  IVGTRGDVQPFVSIGKRLQEHGHRVRLATHANFKDFVLSTGLEFFPLGGDAKVLADYMVK 139

Query: 221 NKGFLPSGPSEIHAQRNHLKDIIFSLLPACQDDDPESKIPFKADAIIANPPAYGHTQVAE 280
           NKGFLPSGPSEIHAQRNHLKDIIFSLLPACQDDDPESKIPFKADAIIANPPAYGHTQVAE
Sbjct: 140 NKGFLPSGPSEIHAQRNHLKDIIFSLLPACQDDDPESKIPFKADAIIANPPAYGHTQVAE 199

Query: 281 ALKLPLHVFFTMPWTPTSDFPHPLAHVKHQMGYRLSYNIVDTLIWLGIRDIINSFRKKKL 340
           ALKLPLHVFFTMPWTPTSDFPHPLAHVKHQMGYRLSYNIVDTLIWLGIRDIINSFRKKKL
Sbjct: 200 ALKLPLHVFFTMPWTPTSDFPHPLAHVKHQMGYRLSYNIVDTLIWLGIRDIINSFRKKKL 259

Query: 341 KLRRISYLSGHYSSLPEVPYGYIWSPHLIPKPKDWGSKIDVVGFCFLDLASNYQPPNSLV 400
           KLRRISYLSGHYSSLPEVPYGYIWSPHLIPKPKDWGSKIDVVGFCFLDLASNYQPPNSLV
Sbjct: 260 KLRRISYLSGHYSSLPEVPYGYIWSPHLIPKPKDWGSKIDVVGFCFLDLASNYQPPNSLV 319

Query: 401 EWIEAGERPIYIGFGSLPVEQPQEMTQIIVEALEITGRRGIINKGWGGLGSLAEPKDFVY 460
           EWIEAGERPIYIGFGSLPVEQPQEMTQIIVEALEITGRRGIINKGWGGLGSLAEPKDFVY
Sbjct: 320 EWIEAGERPIYIGFGSLPVEQPQEMTQIIVEALEITGRRGIINKGWGGLGSLAEPKDFVY 379

Query: 461 VLDNCPHDWLFPRCMAVVHHGGAGTTAAGLKAACPTTIVPIFGDQQFWGERVHARGLGPP 520
           VLDNCPHDWLFPRCMAVVHHGGAGTTAAGLKAACPTTIVPIFGDQQFWGERVHARGLGPP
Sbjct: 380 VLDNCPHDWLFPRCMAVVHHGGAGTTAAGLKAACPTTIVPIFGDQQFWGERVHARGLGPP 439

Query: 521 PIPIAEFSLEKLIDAINFMLDPKVKERTLEVSKAIESEDGVGGAVNAFHKHFHRNRTLAK 580
           PIPIAEFSLEKLIDAINFMLDPKVKERTLEVSKAIESEDGVGGAVNAFHKHFHRNRTLAK
Sbjct: 440 PIPIAEFSLEKLIDAINFMLDPKVKERTLEVSKAIESEDGVGGAVNAFHKHFHRNRTLAK 499

Query: 581 PEAPKRGFSVRRLLHIS 598
           PEAPKRGFSVRRLLHIS
Sbjct: 500 PEAPKRGFSVRRLLHIS 516

BLAST of Cucsa.356970 vs. NCBI nr
Match: gi|659086033|ref|XP_008443731.1| (PREDICTED: sterol 3-beta-glucosyltransferase UGT80A2-like isoform X2 [Cucumis melo])

HSP 1 Score: 982.6 bits (2539), Expect = 3.0e-283
Identity = 476/498 (95.58%), Postives = 484/498 (97.19%), Query Frame = 1

Query: 101 RWLKRVATVKHDGTVQMEVLEGIQPENLHFETGV-DDEAVDDEPLDTANVPFIPPLQIVM 160
           RWLKRVATVK DGTVQMEV EGIQPE+ HFETGV DDEAVDDEPLDTANV FIPPLQIVM
Sbjct: 20  RWLKRVATVKDDGTVQMEVPEGIQPESFHFETGVLDDEAVDDEPLDTANVSFIPPLQIVM 79

Query: 161 LIVGTRGDVQPFVSIGKRLQEHGHRVRLATHANFKDFVLSTGLEFFPLGGDAKVLADYMV 220
           LIVGTRGDVQPFV+IGKRLQEHGHRVRLATHANFKDFVLSTGLEFFPLGGDAKVLADYMV
Sbjct: 80  LIVGTRGDVQPFVAIGKRLQEHGHRVRLATHANFKDFVLSTGLEFFPLGGDAKVLADYMV 139

Query: 221 KNKGFLPSGPSEIHAQRNHLKDIIFSLLPACQDDDPESKIPFKADAIIANPPAYGHTQVA 280
           KNKGFLPSGPSEIHAQRNHLKDIIFSLLPACQDDDPES IPFKADAIIANPPAYGHTQVA
Sbjct: 140 KNKGFLPSGPSEIHAQRNHLKDIIFSLLPACQDDDPESNIPFKADAIIANPPAYGHTQVA 199

Query: 281 EALKLPLHVFFTMPWTPTSDFPHPLAHVKHQMGYRLSYNIVDTLIWLGIRDIINSFRKKK 340
           EALKLPLHVFFTMPWTPTSDFPHPLAHVKHQ+GYRLSYNIVDTLIWLGIRDIINSFRKKK
Sbjct: 200 EALKLPLHVFFTMPWTPTSDFPHPLAHVKHQIGYRLSYNIVDTLIWLGIRDIINSFRKKK 259

Query: 341 LKLRRISYLSGHYSSLPEVPYGYIWSPHLIPKPKDWGSKIDVVGFCFLDLASNYQPPNSL 400
           LKLRRISYLSGHY+SL EVPYGYIWSPHLIPKPKDWGSKIDVVGFCFLDLASNYQPP+SL
Sbjct: 260 LKLRRISYLSGHYNSLSEVPYGYIWSPHLIPKPKDWGSKIDVVGFCFLDLASNYQPPSSL 319

Query: 401 VEWIEAGERPIYIGFGSLPVEQPQEMTQIIVEALEITGRRGIINKGWGGLGSLAEPKDFV 460
           VEWIEAGERPIYIGFGSLPVEQP+EMTQIIVEALEITGRRGIINKGWGGLGSLAEPKDFV
Sbjct: 320 VEWIEAGERPIYIGFGSLPVEQPEEMTQIIVEALEITGRRGIINKGWGGLGSLAEPKDFV 379

Query: 461 YVLDNCPHDWLFPRCMAVVHHGGAGTTAAGLKAACPTTIVPIFGDQQFWGERVHARGLGP 520
           YVLDNCPHDWLFPRCMAVVHHGGAGTTAAGLKAACPTTIVPIFGDQQFWGERVHARGLGP
Sbjct: 380 YVLDNCPHDWLFPRCMAVVHHGGAGTTAAGLKAACPTTIVPIFGDQQFWGERVHARGLGP 439

Query: 521 PPIPIAEFSLEKLIDAINFMLDPKVKERTLEVSKAIESEDGVGGAVNAFHKHFHRNRTLA 580
           PPIPIAEFSLEKLIDAINFMLDPKVKER L+VSKAIE+EDG  GAVNAFHKHFHRNRTL 
Sbjct: 440 PPIPIAEFSLEKLIDAINFMLDPKVKERALDVSKAIETEDGAVGAVNAFHKHFHRNRTLT 499

Query: 581 KPEAPKRGFSVRRLLHIS 598
           KPE  K GFSVRRLLHIS
Sbjct: 500 KPETLKHGFSVRRLLHIS 517

BLAST of Cucsa.356970 vs. NCBI nr
Match: gi|568877509|ref|XP_006491778.1| (PREDICTED: sterol 3-beta-glucosyltransferase UGT80A2-like isoform X2 [Citrus sinensis])

HSP 1 Score: 818.1 bits (2112), Expect = 9.7e-234
Identity = 398/582 (68.38%), Postives = 474/582 (81.44%), Query Frame = 1

Query: 16  RSSDFSVSMDSDGDGDSDGDRVVPSSGNTDRNSSGDSTQDGSSVGRELVSCSTKPTKLRK 75
           R SD S  +  D D + + +     +G  D +   ++ +D +S   ++ SCS+  ++  K
Sbjct: 7   RCSDASEDVSVDYDAEIESENC---NGRGDSSGGQNAARDDASTSSDVRSCSS--SERGK 66

Query: 76  SRQSHALHHLLPNIFDEKVSSRKKLRWLKRVATVKHDGTVQMEVLEGIQPENLHFETGV- 135
              +H+L  L   +FDE+V  RKKL+WL R+ATVK DGTVQ EV   I+P+NL F TGV 
Sbjct: 67  VVHNHSLGILSARLFDERVPFRKKLKWLNRLATVKDDGTVQFEVPADIKPQNLDFGTGVV 126

Query: 136 -DDEAVDDEPLDTANVPFIPPLQIVMLIVGTRGDVQPFVSIGKRLQEHGHRVRLATHANF 195
             D++ D EP++ A+V  IPPL IVMLIVGTRGDVQPFV+IGKRLQE GHRVRLATHANF
Sbjct: 127 YTDDSTDQEPIEAADVHGIPPLHIVMLIVGTRGDVQPFVAIGKRLQEDGHRVRLATHANF 186

Query: 196 KDFVLSTGLEFFPLGGDAKVLADYMVKNKGFLPSGPSEIHAQRNHLKDIIFSLLPACQDD 255
           KDFVL  GLEFFPLGGD K+LA YMVKNKGFLPSGPSEI  QRN LK+II+SLLPAC+D 
Sbjct: 187 KDFVLGAGLEFFPLGGDPKILAGYMVKNKGFLPSGPSEIPIQRNQLKEIIYSLLPACKDP 246

Query: 256 DPESKIPFKADAIIANPPAYGHTQVAEALKLPLHVFFTMPWTPTSDFPHPLAHVKHQMGY 315
           DP++ +PFK DAIIANPPAYGHT VAE+LK+PLH+ FTMPWTPTS+FPHPL+ VK  + Y
Sbjct: 247 DPDTMVPFKPDAIIANPPAYGHTHVAESLKVPLHIIFTMPWTPTSEFPHPLSRVKQPVAY 306

Query: 316 RLSYNIVDTLIWLGIRDIINSFRKKKLKLRRISYLSGHYSSLPEVPYGYIWSPHLIPKPK 375
           RLSY IVD LIWLGIRD+IN FRKK+L LRR++YLSG YSS  +VPY YIWSPHL+PKPK
Sbjct: 307 RLSYQIVDALIWLGIRDMINDFRKKRLNLRRVTYLSGSYSSPLDVPYAYIWSPHLVPKPK 366

Query: 376 DWGSKIDVVGFCFLDLASNYQPPNSLVEWIEAGERPIYIGFGSLPVEQPQEMTQIIVEAL 435
           DWG KIDVVGFCFLDLAS Y+PP+SLV+W+E GE+PIYIGFGSLPVE+P++MT+IIV+AL
Sbjct: 367 DWGPKIDVVGFCFLDLASTYEPPDSLVKWLEDGEKPIYIGFGSLPVEEPEKMTEIIVKAL 426

Query: 436 EITGRRGIINKGWGGLGSLAEPKDFVYVLDNCPHDWLFPRCMAVVHHGGAGTTAAGLKAA 495
           EITG RGIINKGWGGLG+LAE KDF+Y+LDNCPHDWLF RC+AVVHHGGAGTTAAGLKAA
Sbjct: 427 EITGHRGIINKGWGGLGNLAESKDFLYLLDNCPHDWLFSRCLAVVHHGGAGTTAAGLKAA 486

Query: 496 CPTTIVPIFGDQQFWGERVHARGLGPPPIPIAEFSLEKLIDAINFMLDPKVKERTLEVSK 555
           CPTTIVP FGDQ FWGERVHARGLGP PIP+ EFSL+KL+DAI FML PKVKER +E++K
Sbjct: 487 CPTTIVPFFGDQPFWGERVHARGLGPAPIPVDEFSLDKLVDAIRFMLHPKVKERAVELAK 546

Query: 556 AIESEDGVGGAVNAFHKHFHRNRTLAKPEAP--KRG-FSVRR 593
           A+E+EDGV GAV AF+KHF   ++ ++PE P   RG  S+RR
Sbjct: 547 AMENEDGVTGAVKAFYKHFPGKKSESEPELPHSHRGLLSIRR 583

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
U80A2_ARATH3.7e-21763.61Sterol 3-beta-glucosyltransferase UGT80A2 OS=Arabidopsis thaliana GN=UGT80A2 PE=... [more]
U80B1_ARATH9.6e-16550.09Sterol 3-beta-glucosyltransferase UGT80B1 OS=Arabidopsis thaliana GN=UGT80B1 PE=... [more]
ATG26_YARLI4.9e-6835.92Sterol 3-beta-glucosyltransferase OS=Yarrowia lipolytica (strain CLIB 122 / E 15... [more]
UGT52_DICDI4.1e-6733.62UDP-sugar-dependent glycosyltransferase 52 OS=Dictyostelium discoideum GN=ugt52 ... [more]
ATG26_CRYNB1.2e-6635.41Sterol 3-beta-glucosyltransferase OS=Cryptococcus neoformans var. neoformans ser... [more]
Match NameE-valueIdentityDescription
B9RFT9_RICCO2.6e-23367.70Transferase, transferring glycosyl groups, putative OS=Ricinus communis GN=RCOM_... [more]
M5WHU1_PRUPE3.6e-22771.01Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa019814mg PE=4 SV=1[more]
M5VNV0_PRUPE8.9e-22665.39Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa002976mg PE=4 SV=1[more]
V4SS72_9ROSI1.7e-22474.95Uncharacterized protein OS=Citrus clementina GN=CICLE_v10011454mg PE=4 SV=1[more]
A0A059D2A3_EUCGR2.8e-22467.70Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_B01378 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G07020.22.1e-21863.61 UDP-Glycosyltransferase superfamily protein[more]
AT1G43620.15.4e-16650.09 UDP-Glycosyltransferase superfamily protein[more]
Match NameE-valueIdentityDescription
gi|778664181|ref|XP_011660238.1|0.0e+0099.83PREDICTED: sterol 3-beta-glucosyltransferase UGT80A2-like isoform X1 [Cucumis sa... [more]
gi|659086031|ref|XP_008443730.1|0.0e+0094.82PREDICTED: sterol 3-beta-glucosyltransferase UGT80A2-like isoform X1 [Cucumis me... [more]
gi|778664184|ref|XP_011660239.1|1.6e-297100.00PREDICTED: sterol 3-beta-glucosyltransferase UGT80A2-like isoform X2 [Cucumis sa... [more]
gi|659086033|ref|XP_008443731.1|3.0e-28395.58PREDICTED: sterol 3-beta-glucosyltransferase UGT80A2-like isoform X2 [Cucumis me... [more]
gi|568877509|ref|XP_006491778.1|9.7e-23468.38PREDICTED: sterol 3-beta-glucosyltransferase UGT80A2-like isoform X2 [Citrus sin... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002213UDP_glucos_trans
IPR004276GlycoTrans_28_N
Vocabulary: Molecular Function
TermDefinition
GO:0016758transferase activity, transferring hexosyl groups
Vocabulary: Biological Process
TermDefinition
GO:0008152metabolic process
GO:0005975carbohydrate metabolic process
GO:0030259lipid glycosylation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0030259 lipid glycosylation
biological_process GO:0048316 seed development
biological_process GO:0016125 sterol metabolic process
biological_process GO:0005975 carbohydrate metabolic process
biological_process GO:0008152 metabolic process
cellular_component GO:0005886 plasma membrane
molecular_function GO:0051507 beta-sitosterol UDP-glucosyltransferase activity
molecular_function GO:0016758 transferase activity, transferring hexosyl groups

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cucsa.356970.2Cucsa.356970.2mRNA


Analysis Name: InterPro Annotations of cucumber (Gy14)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002213UDP-glucuronosyl/UDP-glucosyltransferasePANTHERPTHR11926GLUCOSYL/GLUCURONOSYL TRANSFERASEScoord: 152..582
score: 1.4E
IPR002213UDP-glucuronosyl/UDP-glucosyltransferasePFAMPF00201UDPGTcoord: 398..553
score: 1.
IPR004276Glycosyltransferase family 28, N-terminal domainPFAMPF03033Glyco_transf_28coord: 157..298
score: 8.2
NoneNo IPR availableGENE3DG3DSA:3.40.50.2000coord: 438..543
score: 1.5E-4coord: 156..426
score: 5.8
NoneNo IPR availablePANTHERPTHR11926:SF308STEROL 3-BETA-GLUCOSYLTRANSFERASE UGT80A2coord: 152..582
score: 1.4E
NoneNo IPR availableunknownSSF53756UDP-Glycosyltransferase/glycogen phosphorylasecoord: 157..571
score: 3.93