HG10019535 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10019535
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionExostosin domain-containing protein
LocationChr04: 22889867 .. 22906183 (+)
RNA-Seq ExpressionHG10019535
SyntenyHG10019535
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAATAAACCTTGGAGACATTCCTTTCGACTTGGACTTCCATCCATCCGACCAATTGGTTGCAGCTGGTGTGATCGGGGGCAATCTCCACTTGTACTTTTTCTTTCATGTTTTGTATTTTATTTGATTTCATTTTGATTTTCGTCAATCACCTATCAATTTCTACGTTCGTTTTTGCAGGTACCGTTATGATGCAAATGCTTTACCCCAAAGGTACGATAACCGTTGGTCTTCTTCCCCCCATATACACTTCGTACAAATTTATTTGAACTCCTGTAACCTACAGTATAAGTAGTATATTTTTTTTTTTGAACAAGAGAACTTTTCATCATGCACAAATTTATTTGAGCTCCTTTAACCTACCTACAGTATAAGTAGTATATTCATGCTCGCTTAAACTTTTCTGTCTATCTCAGGCTCTTTAAAGTTCGTGCGCATGTCAAATCTTGCAGAGCTGTTCGATTCATCAATGACGGACGTGGTATTGCCTTTTTTTTTTTTTTGCTTGCTTTTATTTTCCTTTTTATTTTTTAATTCCTTGGCAGGAAATGATTGGCATTTTCATCCTTCAAATGATTGGTTTATTGCCTGTATATCAGCAATTTTGACAGGTTCTTCAGACCATTCCATTCTCTCTACGGATGTGGAGACTGGTTCTGTTATTGCTCGTCTTGAAGATGCACATGAGTATGTTCCTTTCCTTCTGTTTTCAGTTTGTCCATCTTCATGCATATTCAATGAAATATAAGATTGATGAGAAATGTGATAATCCACTTTCTTCTACAGTGAAGCAGTCAGTAAATTGATCAACATAACCTCGGAAACCATTGCTTCAGGAGATGACAATGGGCGCATCAAGGTCCTTTTCTTGTCATTATCTTCTCATATTATAAAAAACGCAATAGTGACCTTACTTGAACATGATTTGTTTTATAGGTTATACAATTGTGTCCTTGAGCTCTAATTATAGAATGTTATAATTCGTGAAGGATCATCATTTTTTTTAGGATGAATGGTTGAACTTTTCATTGAGAAGAAATGAAAGAATACAAGAGGTTCATAAAAACCAACCTGCATTAAAAGGAGCCGAACCAAAACAAAAATGGACTCCAGTAAAGTAAAATGAGACCTAAAGGGTATTTATAAAAAGACTTAACATATTTATAAAAAGACACCAAAGCCCAAAAGAGACATTGATCCTAACAAGTGCTCACATATCCTCCTTGGTTCTCTTTCATCCTCTAAACACTATGCTATTCTTTTCTAACCTAATATCCCATAACATCACACAAACCTCTGCAATCCACATAAAATTGACCCTTCTCATGGAAGGATGTGTGAAAAAGGAACTACTTAATGGCCTTCTAGGGGGGGGGGGGGGAGATGTATCGATTCATACCCCTAAACTTTGGAGTTGTATCAATTAATTAGTTGCAACGATAAATTATTTCACTCAACACGTGAGCAGGTAGCCCTACTACGGTGGTTTTCTGTTGAGGTACACTAAGATAATAACAGTTTTTTTGTTGGATAGGAAACTTAATAACAGTTGGATATCTGATGAAGAGTTTTCTCACGCTGATATTTTGTTTTATACTTTTCCCATCTCATGTTTTGCGTGTTCTTTGGCGTTCCAATGATTTTTTTTTACTGACTGATGTATCTTTCAAGGTATGGGATACCAGACAACGATCTTGCTGCAGTTCTTTCAAAGCTCATAAAGATTATATTTCAGATATGACCTATTCATCTGATTCCATGAAGCTTTTGGCAACAAGGTAATCCTTGCCGTAACTATTATCTCACTTCAGTTTTATTATCTTAAATTCTAGGAAGATTGCTGATGTTTGTTGAACCTGAACCAACCATAGACTGAAGTATGATTAGTGTCGAGGCACACTCCTAGGCGCTTTCCTTGTGTCTTGCTTCAAAGAAGTGAGTCTCCCTTAATGAGGCAGAAGCATGCTTGAGGCGTGGGCCTTAGCACCTCAATCCTTACAATCTGAGCAAGTCATGAATGTTCTTAAAAAATATTCACTTTTCCTTACAATCTGGAATAAAAGCTCTAATCTTAAAATTTTTCATATCTTTTATTCGTAGTACTATTTTTCTTCATGAAAAAGAATTATTTACATTTTCTATTGTGTGCCCCATAAGTCATGTTTTTTTAGTGCCTCAGACTTAGGCCTTTGTGGTTTAGTGCACCTCTAAGTTTAAAAACTGCTGGGTATATAACAATGATTTTGTAATCTCTTTTTCAAATTTCTACTTTCTAATTTACATAGGTTTTAGATCACTTAATTGGGAACAAAAAATGCTGTTCTTATCTACAAAGTTCTTTTAAAGTAAATTTATGAAAATTCAAGAAACATCTTTATGAATTCCTATTTGTCATCTTCTTGATTCTACCCTCTTTTCCATTTGAAGGATTTAAATGAGAAATTGACTGCATCGTGGATTATATGAATAAATTAGGCCAATAATGTTGTCAAAACTTGCTTCTTTGATTTTATTGCCTCTTTTAACGTTTTTTTTCTCTTTCTTCTCTTGTTGGGTTTTAAAGTGGAGATGGGTCTCTATCTGTATGCAATCTTAGGAGAAACAAGGTTAGCCATTCCAAGTTGTTATGACGTAATTTGTGTGACTACAACGTGTTGATGATCTTTTACTGTTTTCAATTTTCATTTTATGTGCTGTTTTGATGCGGATTTAATCTTGCTTTGACTTTTCTGAAGTTGATTAAAACAATCTCTCTTGCTTCTCACGTTGATTTGCCCTATTTAAATGCAGATCCATGCTCGATCTGAGTTTTCAGAAGTAGAGCTGCTATCTGTTGTTATAATGAAGGTAGATTTACATATGATGGAGTATCAATTATATTCTAGACACACGTTTACATACTTGCTTCTGAAAGCTGTTATTTTCATTGGGATGGTGTAAATTGTTTAAGCTTTATTATTCCTATAGCTTTGGTTCTCTTTTGTTCAATTGAAGAAGTCGTTTGTACATGTAATTTCATTTTTTTTTTTTTAAAAAAAAAGAAATGATTGGGGCTTGGAGATATCTTATTTCTCACCCCATCTTTTGTAAATTTTTTAATGACTTTTTCTAATTCAAGATTGAAATCTGAAGAACCAATCTGTTATTGATTATACTTTGTTATTTTTTGAAATCTTTTATGGGCCTCAATCATTAGGCAATTTTATTTCTCTTACTCAGAGATTAAAAGCAAGCAAAGAATTCTTGCTCAAAATATACTTTTTATATTTTTTATATTTTTTATCTTATAATTTGAAGTTGCCATTTTCTCGCTTCCTCATTGCCCCTCCACATTTTCCCCAAGGAGATTACAGAGAGGGAAGAAGTAAATTTTAGAAAATTCTTTTAAACTTCTTGTGTTGGGGGGAAGTATCTGAACTGCCACAAAGAATTTGAGTGCGAATAAACATTTATTCTCCATTTCATAAATGGCATGGAATTTTAACTTTCTCACTCTTTCGTATATCTTTTATGGTTTTCTTTTCTTGTGTGGTCAGAATGGACGTAAAGTTATCTGTGGATCACAAACTGGGACTCTATTACTGTATTCATGGGGTTTCTTCCAGGACTGCAGGTGTACATATATGGTTTTTCTTGAATATGTATGATGCATTGTTCTGATAATTTTAATGAAACCAGGAATTCATTCTTGTGATTGGATGTATTATAATGGTACCAATCTTCTTTTTCTTTATTGATTCATTCCTCATTTCTTTTGGTTTAACTTTATTCTTCTTCAGTGATCGCTTTGTTGATGTCTCTCAAAATCCTGTGAATGCATTGCTAAAGGTAAGAACTTGTGTAACGTCAAGTAGTATTGTCATTTTTTAATATCTTCAAGGAATCTTTCAATTACAGCCTCGGTTCTTTCCTTCCTTTTAGCTTGACGAAGACAGAGTCATTGCTGGATCTGAGAGTGGACTCATCAGGTGAGAACTTTCATGAATAGATGCAACCCTATTTCATTCCACATATCAGTAGAGCATGTATTTTTATTTTCCCTTTATTATAGTTCACTTATTCATGTGTTATTTTCATTAGTCTGGTAGGCATATTGCCCAATAGAGTAATTCAACCAATTGCGGAACACTCTGACTACCCTGTGGAGCGGCTTGGTAAGATATTTTTCCGTTTTGGCTGATATTAAGTATCCTGTATTCATTTTATCAATGAATTAAAGAACTGAGTTTCTTTGACTGTTAATTCAGCATATCATTTGAGCTTTTTTGGTCTGTCAACACTCAATAAACTACTTCTCTTTAGTTTGTTTTTCTATTCATGTTTAGCATAACATTTTAACAATTTTTTCTTGTCTTGTAGCTTTCTCCCATGACAGAAAGTTTCTCGGCAGTATTTCACATGATTACATGGTAAAGGTAAGCCACACACTCTCTCTCTCCCTCTCTCTCTCCCCAATTGTTTTGGTCCGCTGATTTTATGCTCAATCCCGACTGAACTATCAGTACGTGGCTGAAGGGAAATACTAACCAGAACGAAACTTTTGTCGGCCTTGTTTTTCAATTAATTTTAATGGAAGTATGGTTTCTCATATGTGTGTGTGTGTGTTTAAAGAAAGAAAGAATCATAAACGAATAAACAAGACGTGTAGTGGGGGCCACAAAGGAATTTTGTTACAGTGGCAGAGAGGGTAGAGTATATAGCTGAAGGAGTTGTGTGAGGGTTATATTTTTTTGGCTGAGTTCTGTTGAGAGAGTTCTCTTAGGAGAATTGGGAGAGGTGAGAAGCTTTCAAAACATCCTTATTTCTATGCTAATTGAGTAAAATAAATTGTGGGCAAGGCCTATCATTATCCCTATCTAGATTCTAATGTCAAATATTGTGGTTGTCCTAATTTTTCTTTCAAGACGTGTAAGTTGGTCGAATCATCTATGTTTACAATTATTATTGTTGACATTGTTGATGTTTTAGTTTCCAAAATGTTTTAATAGAAAAAAGTTGGGGATGCTTTATCCCCTTGTCCTTTAGGTTGTTTTGTTGAGGCCATGGGCTGTTTTCAATGATATCCTTTGTTTCTCATTAAAAAAAAAGGGTCTTCTCCCTTGTTAGTATTGTAAGAAGTTGACATCCTTAAGTACATGGCATCGGATTGGCTGAGGCTGCCTATGGGAACACGTTGTTGCTTTCTAGTTAAAAGTTATATTTAAAAATGACATGAAACTTTAATAGTTACGTGATGATGAACTGCACTGGCCATCATTTCTGTCGTGCCTTCCTAACAAGGGGTAAAGTAAGAGTTTATGAATTCTTTTACTTTTACACTTCTATACAATGTCAGATACATACATACATATACATATACATACACATACATTTCTTTTTTAATTTGAAATTTCCAGTAATTGGGAGGCCTAAGAAACATGTTGGAACGGTTTTTCCATATTTTTGTTGTCTACTTGTGGTACATGTGGTTTCACATGTTGCCAAACCTTAATTGTAACTGTGTTCTTCCTCTTTGTTTTTTCGCTTTAATTTTTACGACTTATAAGACTTCTATCTCTGTGTTATTGTTTGCGCAGCTGTGGGATATGGATTTATTGCAAAGTTCTGGAAACGCTTTAAATGGTCGGGCCACAGTAGATGCAAGTGCCAGTAACAATATATTTACTTTCACGTAAGCCCCACCTTTGGATTTGGATTTAATTTGAAACAACAGTTTGGCTTAAAAAGATTTTGCTTGGTTTAGGTGGTAGCTTCAATTTGCTAATAGAAGAAATTAAGACTTAATTCTAGTGAAATGAACCAAGGAAACTGATAGTGATTGGGTTTTTTTCCTTTTGAAAGAAAGAAAGAGGATGAAACTATCCTTATCGAAGTTTAAATTAGGTGATATAATTAATGATATGGTGAAGCCGCAAAGTGGGGTCCTCCCTTCAACTAGAGAAGCCAGTATAGCTATAGAGTTGATGGGGTGAGATGAGATGTTGGCTTCGTGATTTTGTTACATCTAAAATGATCTTATAGGACCAATCTAAACGAGGGAAGAATAATAAGTGATGATTCCCTACCTTGTACATAAGAAAAATATCAGTTTATACCTTTAACCCTGGAGTTGTATTGATTTATTAAAATCCTGAACTAATAGTTATATCTATTTGAACTTGAACTTTTGCAAGTGTCACTGTACACCCTCCATAATGTTTCATTTGGAGAAACAGTATGTGAAGCTTATAATTGTATCAATAAAACTTCTAAACTTTCACATGTGAATCAATTTAGACTCTAATTTTCTTAATTTTCTTTGAGAATTGTTCATGCATCAACTCTAATAGCCTATTTTGTAAAACTGACATATGAAGAAGGCTACATATGTGAGAATTAACGCATGGATGGTTTTCAAAGGTAATAATAATGAATGGTCTAAATTGATTTGGTTATAAAAATTTAGGAGTTTAATTGATACAAATTAACAGTTTCGCACGATATCTGTAGAACGAAATGATACAAGTACAAAAGTTCAGTGTTTAAATTGATATGATTATTAGTTTAAAATTTAAATTGATTTAACCCCAAAGTTTAGGGGTGTAATTGATATTTTCTCAATTTCTCAAAAACAAAAAATGTGATGATATAGTTTGCAGTTATATCTACTTGTCGTTTCGGGTTATGTTTTCACGTTCTTTGCATATCTTGAACATTCTGTTCTATTTTTCAAAACACCCTGGTAACCATGAGGCCACGCTCAGCAACATAAGTTTGATATGGCCATGTATTTTTTTCACCTACAATCTTCTACTCTTTGCAGGGACTAGAAGAAAGAAGATCTAAAGAAAGGGACCAATCAAATTGTCAATTTATACTATATATTAAGGTTGATTTTGGCTTTACCCTCTTCACATGTTAAACTCTTTAAGACTTTATTACAAAGTTCTGGTCCATTTTCATTCTCATTTTGGATAGTATCAATTTATGTGATCTGAAGGGCATAATTTTTTTTTTGATATGGTGGCAACATATTCTCATTCTTGAGTTGGGTTGTGTAGTGTAGTTAACTGACTTTTTCTTATTGAGGTGGATCCATATGAATAATAGTGAGTAATTAAAGGCAAGAAACAGAAAAGTGTGTGTTTGTTTTTTTAAAACTGAAATATATTAAAAACAGAAAACCAATACAACCATGAAACTTGACTCTTCTCGAGAACGAAGGAACAACAAAGGGAGGAAACTAAGACCTGAAAGATATTGAAGAACACTAATACAGGAAGATTGTGGGTTTTTGATCTACTCTCTAAAAGTTTTATAAAAAAAAATATTAAAAAAAAACTTAGAAATTGAATTCATCTTTCCATAAGTTGCCTTTCATGTATTTGATATATTTATAGTCCTAAACTTTATAAATTGTATCAATTAAAATTAAAAACTAATTATTGCATACTTTTAACCTCAATCTTTTATCAAAATAATAAAAAAAAAAATACAAATCTTTTGGTGAATCAGTTTAAAGCCTTCTTGACCTAAAACGTGTTATTATTTTTTTCTCTGATAGATACAAGCTTTATTAATTATTTGATAAGGGCAACAAAATCTACATAATTTCTACGATAATATAATATTGTTTAATTTGTGTCTAGCTTTGGTTTTAGTTTAAGAATTGAGGCATAATAGTTAAAAGCTGTCCAACTATTATAAATAAATAAATAAAATAAAATAAAATATAGCCTACTAAAGTTAAAATAATATTTTGATTTCTATACTTTGAAGTTTATTTAATTTTAGTTTTTATATTTTTAATTACAGTTTTAGTTCATACACTTTCAATAAATCTTAAATTTAGTATAGTTTATTGTTGATTTTTATATAAATTTTTTGTTATCTATTTATTATAAATTTTGAAAGCATATTCACATATTTTATTTCTTCTTTGAAAATTATTATTATTTTTATTTAATCAATTTTGATAAAAATTAATTTTGAAAATTTTAATTTAAGATTTATTAAAAATATCGAAACTAAAACTGAACAAAGTTTATAATACGAGAACTAAAATAATATTTTAATCTAAATATAATTTAGAGAGAAACAATTGGAAAATGGCTTTAAAAAATGGAAAATGAGGGAGGAGCCTCCAATTCAAAAAACAGATATCCAATTATTGGTGGAGTCAACACAAAAGAAAAGTCTCCGTTGTCACCTAATTTCAAATTATTTATTTATTTATGTACAATAAAAAATTGATTTCAAATTATTTATTTATTTATGTACAATAAAAAATTAATTATAGTGTCACATGACAGCGACTAGAATATTAACAATCCGTAAATATTAAAATATAACCTAAGATGTTGACTTTATTCTATTTAGATCAGCCCTTTTCAATTATCATACTTTTTTTTTTTATCTAATTTTTAAATATCATACCTAAAATTATATATAAAATTTCATGATGGTACTTTAAATACTTTTCATTCCACATGATATCTTTTTTTAACGAAAGTCATTGTAATTGATATGAAATTTCTCTTTAAAAGTGAGAGGTTCGATCTCTAATCTTTGCAATTGTTGTACTAAAAAATTGTGTTGTTTCACTAAAAAAAATGATCCTAAGAATTTATAAATGATAAGTTTCTATTTAATGAATTATATTTTTTTAAAAAAAAAATCATTATTTTAACAAAAATGTTTATTTATGTATCATTAATTATGTTACTCGTCTATTTGTTTATATATATATATTTTATTTCATTTAACCTAAGAAAAAACTAAATTATATCTCTAAATTTTGTGGGCCTGGTCGGGTTGAGTTTTCTTAAAAATCGACCCGACCCAACTTTTTTTTTTTTTTTTTTTTAAAAAATTAAAATTTATTTTAACAATTTTATTGGACTATTGACACCAAAATCAATAATATCACTTGAAAAAGTTGAAATCAAGCATATAGAAAATAAAATTAAACACACTACTAAATTTAGAGAAATTCAAAACTCACTACTTTAACTTTGTTAAACAATCACTAAACTTAAAAAGTTTAACTTAATAACATGATAAAAAAGTAAAATAGAAAAACAAAATATCATAATAAAATAAAACCAACTTAGCATTTGTTGAATATTGAGACATAATAGATAGCCAAGCAGAAATCCTAATCAGCACATTCCAAATCAGGTAAAATTTGATGACAAGAGGTAAAATTTGATGACAAGATTGAAAATAATTTGTTCAGGGAAGCCATTAAATATTTAAAAACAACCAAATGTCATAAAAATCTTCCAAAATAACAACTAGATGAACAAATTCTCAAATCATTTAATAGAAACCAAATTTATAAGGAAGAAAACATCACAAAAATTCAAAAAAAATCCATTTAAAAAAAAATATACAATGATATGATAAACAAAGTGAAGATACTCCAAAACCAACACGATGATATGATATACTCCAACATTCATATACGCATAGGGCAAATAAGATCTAATTGATTAAATTCTATATGTTTAATTGAGATGAAGGGTACCAATCAGTAGTAGTTGTCTCACTTTCTTATGCTAAACCAGAAAGTGAGAAAACAAAAGGTGAAAAGAATCCAAACTCAAGTTCGATGGCGAGCTGTGAATAGGAGAACGAACGACGAGAGAGACAAGGAATGATGAGGCGAGTGGTAATTGAGAACTAAGAACACAACCTAAATGTCGTGTGCATTTGAAACCTAATCTGAGATTAGGTAAACTAACAAAGTGAGTTCGTTTAGAATTATAGTTGAAATCTAATCCAAACCAAAACAGTTGGTTCGGTTTCTTACGTCCCTACATATCAATTTTCGCTACAGACTAGTTTATGTTAATTCACCTGATTAAAAAAAAATCGTAAGAATTTCCCTAAACCTACTTTTTTTTAGAATTAAACCTAACTTTAATAATATCCAATTTTACACATTTTTAATCAAATTTGATTGATATACAAAATATTAATTGAACAACTTAGACCAAACTCACAAAATGTAGCAAAATTTAACTCTAATTTTATTTTTCTTGCTTATATTTCTGTACCATATATAGGTTTTATAATTCATAAGCATAAGAGGATGCTTTTTTTTAATGAATATTCAAATTACTTGAGTAAAATTTAGAATACTTTTAAAGTTTAAAGTTAAGATTTTAGTATGTATTTATCTAAAATTTCAGATGTATCTAACGAAATTGAGAGTTCAATGTATAAATTGATAACACTGATAAATTTAGGGGTAAAAATCAATTTTTAACCCAAAAAAAAAAAAGTTAGGTCATTTATTATTATAAATATATGCTCACTATAAAGTTGGTTGCTCCACTCATAATTTGGATCGAGTCCAACTCAAATAATCATGGTCAATCTATACAACTCAAAAGGTAATACGAACATTATTTAAATTTTAAAAGTTGAAAGTTGAAACTCAAACTCCAAGTTTATAATTTTTATATTATAAAAAAAAATAAACTACTATTAATAATAAAGTTAAATAAGTGACTAATTATTAAATATTAAAAAATGTTTCAAATTAAATGTTATTTTCAAATATAGAAAATGAGCCAATTTCTTTATAAATATAGTAAAATATCATTATCTATATGTAATAGACACTCATAGATGTATCAATGTCTAACAATGACATTTGATATATTTGAAAATATTTTTAGTAGTTTTGTCCTATAAAATAAATACCCTAAAATTAATGGGGGTTTTTAAATAAATATAAATATAACCATTATTTACCTATTGATTTTAATTATGATACATGTGATTAAAATTTAATAATGATATTTTTTAATAATACAAGTGATACAGAGTTTTGAACTTTTGGTATTTTGGTTATGAATACATGTTTTATATCTAACATAATTTCTTTATTTATTTATCTATTTAAAATAGTTTGTACAAATTTCTATTGGGAGGTTAATGCTTCATTTGTAGGTTATATTTCACATCTAATCACTTAACTTTAAAAATTATATCTATTTTGTCATGAAATTTAAAAGGTTAGTGTAATTCAACTAACATACCATATGTATTAAGGACCAAAAGATATGTCATTGGAATTATTTATTCAAATACTTGTTTTAAAAAAAAAAAGTAAAATGTTAATATGTAAACTAATTTTCAAATATTAAATAAAATTTGCAACTTAATTTTAATTTCTATATAAAAAATTACATTTCACGAAGCAAGATAGAGAACTGTTGAGAATAAATACTCTAAATACAATCATGAGATTAAAAATATCAAAATAAATATACGTATATATAATTTGTATCTTAAATAAAAAAATACATTTTAAAGTTAATAATAAATTGAGTGGGTTCGAGAAGCTAAGTGATCTCATGTATAGTACTAAGTCATGTAAATTTATCATTGACCTATACATTTAAACTTTTTAAAATTAACAAATCATGGGCTAAATATACATTTAATTTCTAATACTTGAACATTTTTCAATTTAGCGCAGAGTTTTAATGTAGTCTCTCATAGTAGGTTAATATACATTTATTTAAGGGAAATTTTTAAAGATTGAAAAAATGTCAAACTATTTACAGAAAATAGCAAAAATAATATTGATAGACATTGATAGACTTCTATCAGCATTTATCAGACTTCTACCATTTTTATCACTGATAGACCCTGATAGATTTCTATTAACATTTATCACAACTATCTAAAAATTTTGCTATTTTGTGTAAATAGTTTCTCTTATTTTTCTATTTTTAAAAACTCTTCTTAAATATATGTATTTTGATGTAGAATACTAATACAATCAGAATTAAATTGAAGTTTTTTTTTTAATATTTTCTTAAAAAAACCAAAGATTAAATACTAAAATATATATTTAGTCTGAAATTAAGTTAAAACAATCAAAACAAAATTGTTAATATATCATTTTTATCATTATATTTTTTTGTTTCATTTTATTTTAGTTCGTTTAATTTTAAAGGTACAATTTTAGTCTATATATTCTCAATATATCATAAATTTAGTCTTTTAATTTTCATAAAATAAAAGATACTATGAATATATTTTTATAATTTATAAAGAAAATGATAATAGAAACTAGTTTTTTTTTTTTTTTTTTCTTTTTCTTTTGCGAAAATCAAATATAAAGTAGATTAATTTAAGATTTATTGAAAATACTGATATTAAGATTGAACATTTGAAAGTATATAGGCTAAAATATAATAAATTTTAGGTATAAATAATGGTTTTTTAAAAAATAAATAATGATATTTTAACTCAATAATAATAGGTTGATTTTGGATCCATGCTGTACGGTGACTACAAAAACCTCCGAATCTTCGCCCAAAGAGAGAGAAAATAATTTATTTTTTGGATTTTTCCATTTCCATGAGCTCGGCATTTAGATTTTTAAAAATAAAAATAATAATAAAAAATAGAGAGAAATACGGGTTTGAGTGTAATTTCTATTTGGTATTTTAAATTGTTATATTTTTAGTCACTAAATTTTAAATTTAATTTCAATTTGGACTCTATGTTTCAAAATATTACACTTTTAATTTTTAAATTTTGAGTGTTATTTCAATTTAGTCCTTAGTTCAAAATTTATATTTTGGGACAGTTGTAAATATAACAATCAAGTACAAAGTATTAGAAGAAGATATAGCACAATGCAAAAAAATTGCAAATATAACAAAATTTATATTGTTAATTAGTTATAGACTACATCACTCGTAATCAGTGATAACCATATTGCTAGTACGAGTCTATTGGCGATAAACCATAACTTTACTATATTTGTAATTTTTTAAAAATGTTGCATACACTTAATTATTATCTCTAAAAATGATACTAATTGCAATTACCCTCTATATTTTTAACTTCGATTCTTCACTAAATATTCCCTTTTAATTTTGGCGTTAATGTCTATTAATTAATTCAAAATCATTATTAAGTGAAATTTTAAAATTAATTTTAATAATGATGAAAAGAGTAGTGAACTTAATTAATTGTAATTCTTTCAAATTAATTAATAGACATCAATACTAACGATTGAAAGTAAGTTTTTAGTGAAAAATTGATGTTAAAAGTATTAATATTGAAACTTAAGGTCCAAATTAAAACAAAATTCAAACCTCAAAAGTAAAATTGTAATATTTTGATATTTAAGGACTAAATTGAAATCAAACTCAAAATTTAAGGACTAAAAGTGTAACATTTTAGTGTTTATTATTTTCTATCTATCTTGACTCTCTACATTAGAATGTGTATTATAGGTTTAAATAATATTTTGGTCCTTCTAGTTTTGGGTTTGGTTCATTTTGGTCCTGGTACTTTCAAAATGTTCATTTTGGTCCTTAACTTTCAAGTTTGGTTCATTTTAGTCCTTGAACTTTCAAGAAGTGACCATTTTGATGCCTTAAAACTGAAGTAAAAAGGAAAGTGAAGAGGCCAAAATGGTCACTTTTTAAAAGTACAAAGATCAAAATGAACATTTAAGTACAAGGAGCAAAATGAACCAAAGTTGAAATAGAGACCAAACTAAACATTTTGAAAATATAGGAACACCAAAAGTACAGTAACAAAAATAATATTTAAATCTTTATTATAATTCTATCTACTATCTACACTAAAATAGTATCCACTAAAAAAAATAATAGTTAATAATTAATAACTATAATAATCAATTCAATGGGCAAACATATCTCAAAGTGGAAAAAAAAAAAAAGAGAGTAGTTTTTTCAATAATAAAAACCCAGTAGAAACGGCAGCATGAGTTTGGACAGAACGAGATTCCAACTGTTCCTCTCTCATCATCCCAATGGAATTGAAAATTCCACAAACCCTCATCTTTGACCTTTCTTCAGTTAGAAATTTTGGAATTTGAAGAAATGGGTTTCGTCTTTGCTACCAAATTTGCCCATACATTACAACATACCTTCTCCTAATTTCTTCTTCCTCTGTTTCCTCCATGGCTCAATCTCCAATTTCATGGAGGTCCTTCTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCCTTTGCCAACTGGTTTCGAGGCCTTTTCTTCATCCCCACTTGTTTAGCTCTCAATACTTTCCTCTTCATCCTCTTCTACATCTCTTCCACTTCCACTCCTAATTCCTTCCCTTCCCAAATCCCCACTCGTTTCTCCGACTCCTCTTCTAGCCTCGTTTCTTCTCTCAATCTTTCCATTACAGCTCTTCGGGTTTCTCAGAAGGTCAGTTTACAGGACGGCAATGGCCGCCCCCCTTTGCCTCTGCCTCTGCCTCAACCCTCTTCTCCTCTTCCCTCATTTGGTAAAGCCCTTGTTGTTTCTTTTATTTTTTCTGTTTTCTTCTTCTTCTTCTTTTCAGTGGAGGATAGGAAATTATCTGTAGCTTCGGAACTGATTGGTTGATGAATGAATGTTGTAGTTTTTGGATTTTGATGTGGAGTTTGTTGTATGAACTTGTTGGAAAATGAGTTTAGAGCTGTGATTTTCTTCTTGTCTGTTTGGTTTCCTAGAACCATAGGAGGAAGAACATAGGTAGGTTAATCACTGCTAAATCTAAAAGTTTAGGTTCTTTAGCTAACTATTTGGTTTTTTATTTTTAATTTTCGAAAATTAAATTTATAAACACCCTTCTTTCCGTTTTAAAGTTGTTACTTTTTATCTACTTTTGAGTCTTTGACCAATATTTTCAAAATTAAGGCAATTTTTGAAAACTAAAAAAAAAAATAGTTTTAAAAGTTTGATTTTGTTTTTGGAATTTTGCTAAGAATACAACTCTCTTAAGAAAAATGCAAATCATGGTAAGCAATAGAGAAAAAATAGACTTAGTTTTCAAAATAAAAAACAAACAACGAAATAGTTACTCAACATAGCTTTAATGAGTTTTTTGCGATTCAATAATTGATGTTGTTTATTGATGAGGGAGAAAAATGGCCTTACAAGGTTTCGAATACAAGACATAGTAGTAGAATTGCTTAAGTTAATAGTTTAATTTATATTTAAACTTTTATTCGTATTATTAGTGGTTTTGAGTTGAGCTTTCTCTGTTTTCTCCTCCTTCTAAGGTATAATCACTTTCCTATATCTCTAATGCAAGCTCAACTTGCCTTAATGGATGCCCTGCTTGTGTGTGTTTTAGGAAAGGGAAGCCATGAACAAGCCGAGGGAGTGTTCCATGATGAAGAGTTATTTCTTGAAGACTATAAAGAAATGAACAAGAGCTTCAAGATCTTCGTTTATCCTCACAAACGAAGTGATCCCTTTGCACGTTCTTTGTTGCCAGAGGACTTTGAGCCCCACGGCAACTATGCCAGTGAGAGTTACTTCAAGAAATCTCTCATCAAAAGCCATTTCATCACAAACAATCCCAAGGAGGCCGACTTCTTCTTTCTGCCATTCTCGATCACTGGCCTTCGCAACGATCGTCGGGTCAGCGTTAGCGGTATCCCCAACTTCATTCGAGATTACATCTTCGATGTTAGCCACAAGTATCCTTATTGGAATCGAACAGGTGGGGCGGACCATTTCTATGTTGCGTGCCATTCGGTTGGACGATCCGCCATGGATAAATCGAGTGAAGCTAAGTTGAGCATTGTCCAAGTTGTTTGCTCTTCTAGCTACTTCTTGACAGGCTACATTTCACATAAGGATGCAGCTCTGCCCCAAATCTGGCCTAGGAAACAAGACCCCCCAAATCTTGCTTCTTCAAAGAGGTGAGTTTTTCGCTGATGCAAAATGATGGATCTTCTGTTTTCCTCCTGCCTTTAAGCTTATTAGTTAGTCTGTTGATGTGAATGTAATGTGACATGTTGATTTATTGTTCCTTCTTTTAGGACGAGGCTGGCGTTTTTCGCAGGAGCGATGAACTCCCCGACTCGTCAAGAGCTCGTTCGAGTATGGGGTACGGACTCGGAGATCTTTGCTCATTCGGGCCGTCTTAAGACACCTTATGCTGATGAACTACTCAAGAGCAAATTTTGCCTTCATGTCAAAGGCTTTGAAGTAAACACTGCGCGAGTTGGAGACTCGATCTTTTACGGATGCGTTCCTGTGATCATTGCCAACTACTACGACCTCCCGTTTGGTGATATCTTGAATTGGAAGAGCTTTTCGGTCGTTGTAACGACATCAGATATCCCGAGACTGAAGGAAATCCTCAAGGGAATCAATGATGAGGAATATGCAATACTGCAAAGCAATGTGTTGAAAGTACGCAGACACTTCAAATGGCATTCTTCGCCCGTTGATTATGATACTTTTCACATGGTTATGTATCAGTTGTGGCTTCGAAGAACGTCAGTTCGACTTCCATTAACGGTCTAG

mRNA sequence

ATGGAAATAAACCTTGGAGACATTCCTTTCGACTTGGACTTCCATCCATCCGACCAATTGGTTGCAGCTGGTGTGATCGGGGGCAATCTCCACTTGTACCGTTATGATGCAAATGCTTTACCCCAAAGGCTCTTTAAAGTTCGTGCGCATGTCAAATCTTGCAGAGCTGTTCGATTCATCAATGACGGACGTGGAAATGATTGGCATTTTCATCCTTCAAATGATTGGTTTATTGCCTGTATATCAGCAATTTTGACAGGTTCTTCAGACCATTCCATTCTCTCTACGGATGTGGAGACTGGTTCTGTTATTGCTCGTCTTGAAGATGCACATGATGAAGCAGTCAGTAAATTGATCAACATAACCTCGGAAACCATTGCTTCAGGAGATGACAATGGGCGCATCAAGGTATGGGATACCAGACAACGATCTTGCTGCAGTTCTTTCAAAGCTCATAAAGATTATATTTCAGATATGACCTATTCATCTGATTCCATGAAGCTTTTGGCAACAAGTGGAGATGGGTCTCTATCTGTATGCAATCTTAGGAGAAACAAGATCCATGCTCGATCTGAGTTTTCAGAAGTAGAGCTGCTATCTGTTGTTATAATGAAGAATGGACGTAAAGTTATCTGTGGATCACAAACTGGGACTCTATTACTGTATTCATGGGGTTTCTTCCAGGACTGCAGTGATCGCTTTGTTGATGTCTCTCAAAATCCTGTGAATGCATTGCTAAAGCTTGACGAAGACAGAGTCATTGCTGGATCTGAGAGTGGACTCATCAGTCTGGTAGGCATATTGCCCAATAGAGTAATTCAACCAATTGCGGAACACTCTGACTACCCTGTGGAGCGGCTTGCTTTCTCCCATGACAGAAAGTTTCTCGGCAGTATTTCACATGATTACATGGTAAAGCTGTGGGATATGGATTTATTGCAAAGTTCTGGAAACGCTTTAAATGGTCGGGCCACAGTAGATGCAAGTGCCAGTAACAATATATTTACTTTCACAAATGGGTTTCGTCTTTGCTACCAAATTTGCCCATACATTACAACATACCTTCTCCTAATTTCTTCTTCCTCTGTTTCCTCCATGGCTCAATCTCCAATTTCATGGAGGTCCTTCTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCCTTTGCCAACTGGTTTCGAGGCCTTTTCTTCATCCCCACTTGTTTAGCTCTCAATACTTTCCTCTTCATCCTCTTCTACATCTCTTCCACTTCCACTCCTAATTCCTTCCCTTCCCAAATCCCCACTCGTTTCTCCGACTCCTCTTCTAGCCTCGTTTCTTCTCTCAATCTTTCCATTACAGCTCTTCGGGTTTCTCAGAAGGTCAGTTTACAGGACGGCAATGGCCGCCCCCCTTTGCCTCTGCCTCTGCCTCAACCCTCTTCTCCTCTTCCCTCATTTGGAAAGGGAAGCCATGAACAAGCCGAGGGAGTGTTCCATGATGAAGAGTTATTTCTTGAAGACTATAAAGAAATGAACAAGAGCTTCAAGATCTTCGTTTATCCTCACAAACGAAGTGATCCCTTTGCACGTTCTTTGTTGCCAGAGGACTTTGAGCCCCACGGCAACTATGCCAGTGAGAGTTACTTCAAGAAATCTCTCATCAAAAGCCATTTCATCACAAACAATCCCAAGGAGGCCGACTTCTTCTTTCTGCCATTCTCGATCACTGGCCTTCGCAACGATCGTCGGGTCAGCGTTAGCGGTATCCCCAACTTCATTCGAGATTACATCTTCGATGTTAGCCACAAGTATCCTTATTGGAATCGAACAGGTGGGGCGGACCATTTCTATGTTGCGTGCCATTCGGTTGGACGATCCGCCATGGATAAATCGAGTGAAGCTAAGTTGAGCATTGTCCAAGTTGTTTGCTCTTCTAGCTACTTCTTGACAGGCTACATTTCACATAAGGATGCAGCTCTGCCCCAAATCTGGCCTAGGAAACAAGACCCCCCAAATCTTGCTTCTTCAAAGAGGACGAGGCTGGCGTTTTTCGCAGGAGCGATGAACTCCCCGACTCGTCAAGAGCTCGTTCGAGTATGGGGTACGGACTCGGAGATCTTTGCTCATTCGGGCCGTCTTAAGACACCTTATGCTGATGAACTACTCAAGAGCAAATTTTGCCTTCATGTCAAAGGCTTTGAAGTAAACACTGCGCGAGTTGGAGACTCGATCTTTTACGGATGCGTTCCTGTGATCATTGCCAACTACTACGACCTCCCGTTTGGTGATATCTTGAATTGGAAGAGCTTTTCGGTCGTTGTAACGACATCAGATATCCCGAGACTGAAGGAAATCCTCAAGGGAATCAATGATGAGGAATATGCAATACTGCAAAGCAATGTGTTGAAAGTACGCAGACACTTCAAATGGCATTCTTCGCCCGTTGATTATGATACTTTTCACATGGTTATGTATCAGTTGTGGCTTCGAAGAACGTCAGTTCGACTTCCATTAACGGTCTAG

Coding sequence (CDS)

ATGGAAATAAACCTTGGAGACATTCCTTTCGACTTGGACTTCCATCCATCCGACCAATTGGTTGCAGCTGGTGTGATCGGGGGCAATCTCCACTTGTACCGTTATGATGCAAATGCTTTACCCCAAAGGCTCTTTAAAGTTCGTGCGCATGTCAAATCTTGCAGAGCTGTTCGATTCATCAATGACGGACGTGGAAATGATTGGCATTTTCATCCTTCAAATGATTGGTTTATTGCCTGTATATCAGCAATTTTGACAGGTTCTTCAGACCATTCCATTCTCTCTACGGATGTGGAGACTGGTTCTGTTATTGCTCGTCTTGAAGATGCACATGATGAAGCAGTCAGTAAATTGATCAACATAACCTCGGAAACCATTGCTTCAGGAGATGACAATGGGCGCATCAAGGTATGGGATACCAGACAACGATCTTGCTGCAGTTCTTTCAAAGCTCATAAAGATTATATTTCAGATATGACCTATTCATCTGATTCCATGAAGCTTTTGGCAACAAGTGGAGATGGGTCTCTATCTGTATGCAATCTTAGGAGAAACAAGATCCATGCTCGATCTGAGTTTTCAGAAGTAGAGCTGCTATCTGTTGTTATAATGAAGAATGGACGTAAAGTTATCTGTGGATCACAAACTGGGACTCTATTACTGTATTCATGGGGTTTCTTCCAGGACTGCAGTGATCGCTTTGTTGATGTCTCTCAAAATCCTGTGAATGCATTGCTAAAGCTTGACGAAGACAGAGTCATTGCTGGATCTGAGAGTGGACTCATCAGTCTGGTAGGCATATTGCCCAATAGAGTAATTCAACCAATTGCGGAACACTCTGACTACCCTGTGGAGCGGCTTGCTTTCTCCCATGACAGAAAGTTTCTCGGCAGTATTTCACATGATTACATGGTAAAGCTGTGGGATATGGATTTATTGCAAAGTTCTGGAAACGCTTTAAATGGTCGGGCCACAGTAGATGCAAGTGCCAGTAACAATATATTTACTTTCACAAATGGGTTTCGTCTTTGCTACCAAATTTGCCCATACATTACAACATACCTTCTCCTAATTTCTTCTTCCTCTGTTTCCTCCATGGCTCAATCTCCAATTTCATGGAGGTCCTTCTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCCTTTGCCAACTGGTTTCGAGGCCTTTTCTTCATCCCCACTTGTTTAGCTCTCAATACTTTCCTCTTCATCCTCTTCTACATCTCTTCCACTTCCACTCCTAATTCCTTCCCTTCCCAAATCCCCACTCGTTTCTCCGACTCCTCTTCTAGCCTCGTTTCTTCTCTCAATCTTTCCATTACAGCTCTTCGGGTTTCTCAGAAGGTCAGTTTACAGGACGGCAATGGCCGCCCCCCTTTGCCTCTGCCTCTGCCTCAACCCTCTTCTCCTCTTCCCTCATTTGGAAAGGGAAGCCATGAACAAGCCGAGGGAGTGTTCCATGATGAAGAGTTATTTCTTGAAGACTATAAAGAAATGAACAAGAGCTTCAAGATCTTCGTTTATCCTCACAAACGAAGTGATCCCTTTGCACGTTCTTTGTTGCCAGAGGACTTTGAGCCCCACGGCAACTATGCCAGTGAGAGTTACTTCAAGAAATCTCTCATCAAAAGCCATTTCATCACAAACAATCCCAAGGAGGCCGACTTCTTCTTTCTGCCATTCTCGATCACTGGCCTTCGCAACGATCGTCGGGTCAGCGTTAGCGGTATCCCCAACTTCATTCGAGATTACATCTTCGATGTTAGCCACAAGTATCCTTATTGGAATCGAACAGGTGGGGCGGACCATTTCTATGTTGCGTGCCATTCGGTTGGACGATCCGCCATGGATAAATCGAGTGAAGCTAAGTTGAGCATTGTCCAAGTTGTTTGCTCTTCTAGCTACTTCTTGACAGGCTACATTTCACATAAGGATGCAGCTCTGCCCCAAATCTGGCCTAGGAAACAAGACCCCCCAAATCTTGCTTCTTCAAAGAGGACGAGGCTGGCGTTTTTCGCAGGAGCGATGAACTCCCCGACTCGTCAAGAGCTCGTTCGAGTATGGGGTACGGACTCGGAGATCTTTGCTCATTCGGGCCGTCTTAAGACACCTTATGCTGATGAACTACTCAAGAGCAAATTTTGCCTTCATGTCAAAGGCTTTGAAGTAAACACTGCGCGAGTTGGAGACTCGATCTTTTACGGATGCGTTCCTGTGATCATTGCCAACTACTACGACCTCCCGTTTGGTGATATCTTGAATTGGAAGAGCTTTTCGGTCGTTGTAACGACATCAGATATCCCGAGACTGAAGGAAATCCTCAAGGGAATCAATGATGAGGAATATGCAATACTGCAAAGCAATGTGTTGAAAGTACGCAGACACTTCAAATGGCATTCTTCGCCCGTTGATTATGATACTTTTCACATGGTTATGTATCAGTTGTGGCTTCGAAGAACGTCAGTTCGACTTCCATTAACGGTCTAG

Protein sequence

MEINLGDIPFDLDFHPSDQLVAAGVIGGNLHLYRYDANALPQRLFKVRAHVKSCRAVRFINDGRGNDWHFHPSNDWFIACISAILTGSSDHSILSTDVETGSVIARLEDAHDEAVSKLINITSETIASGDDNGRIKVWDTRQRSCCSSFKAHKDYISDMTYSSDSMKLLATSGDGSLSVCNLRRNKIHARSEFSEVELLSVVIMKNGRKVICGSQTGTLLLYSWGFFQDCSDRFVDVSQNPVNALLKLDEDRVIAGSESGLISLVGILPNRVIQPIAEHSDYPVERLAFSHDRKFLGSISHDYMVKLWDMDLLQSSGNALNGRATVDASASNNIFTFTNGFRLCYQICPYITTYLLLISSSSVSSMAQSPISWRSFSSSSSSSSSSSSSFANWFRGLFFIPTCLALNTFLFILFYISSTSTPNSFPSQIPTRFSDSSSSLVSSLNLSITALRVSQKVSLQDGNGRPPLPLPLPQPSSPLPSFGKGSHEQAEGVFHDEELFLEDYKEMNKSFKIFVYPHKRSDPFARSLLPEDFEPHGNYASESYFKKSLIKSHFITNNPKEADFFFLPFSITGLRNDRRVSVSGIPNFIRDYIFDVSHKYPYWNRTGGADHFYVACHSVGRSAMDKSSEAKLSIVQVVCSSSYFLTGYISHKDAALPQIWPRKQDPPNLASSKRTRLAFFAGAMNSPTRQELVRVWGTDSEIFAHSGRLKTPYADELLKSKFCLHVKGFEVNTARVGDSIFYGCVPVIIANYYDLPFGDILNWKSFSVVVTTSDIPRLKEILKGINDEEYAILQSNVLKVRRHFKWHSSPVDYDTFHMVMYQLWLRRTSVRLPLTV
Homology
BLAST of HG10019535 vs. NCBI nr
Match: XP_038903113.1 (probable glycosyltransferase At5g03795 [Benincasa hispida])

HSP 1 Score: 871.7 bits (2251), Expect = 5.1e-249
Identity = 433/473 (91.54%), Postives = 445/473 (94.08%), Query Frame = 0

Query: 366 MAQSPISWRSFSSSSSSSSSSSSSFANWFRGLFFIPTCLALNTFLFILFYISSTSTPNSF 425
           MAQSP  WRSFSSSSSSS        NWFRGLFFIPTCLALNTFLFILFYISSTSTPN F
Sbjct: 1   MAQSPFPWRSFSSSSSSS-------GNWFRGLFFIPTCLALNTFLFILFYISSTSTPNPF 60

Query: 426 PSQIPTRFSDSSSSLVSSLNLSITALRVSQKVSLQDGNGRPPLPLPLPQ----PSSPLPS 485
           PSQIPT F+DSS  LVSSLN SIT LRVSQKVSLQDGNG PPLP    Q    PSSPLPS
Sbjct: 61  PSQIPTHFADSSPRLVSSLNFSITTLRVSQKVSLQDGNGSPPLPQTQTQTQTLPSSPLPS 120

Query: 486 FGKGSHEQAEGVFHDEELFLEDYKEMNKSFKIFVYPHKRSDPFARSLLPEDFEPHGNYAS 545
           FG+GSHE+AEGVFHDEELFLEDYKEMNKSFKI+VYPHKRSDPFARSLLPEDFEPHGNYAS
Sbjct: 121 FGRGSHEKAEGVFHDEELFLEDYKEMNKSFKIYVYPHKRSDPFARSLLPEDFEPHGNYAS 180

Query: 546 ESYFKKSLIKSHFITNNPKEADFFFLPFSITGLRNDRRVSVSGIPNFIRDYIFDVSHKYP 605
           ESYFKKSLIKSHFITN+PKEADFFFLPFSITGLRNDRRVSVSGIPNFIRDYIFDVSHKYP
Sbjct: 181 ESYFKKSLIKSHFITNDPKEADFFFLPFSITGLRNDRRVSVSGIPNFIRDYIFDVSHKYP 240

Query: 606 YWNRTGGADHFYVACHSVGRSAMDKSSEAKLSIVQVVCSSSYFLTGYISHKDAALPQIWP 665
           YWNRTGGADHFYVACHSVGRSAMDKSSEAK SI+QVVCSSSYFLTGYISHKDAALPQIWP
Sbjct: 241 YWNRTGGADHFYVACHSVGRSAMDKSSEAKSSIIQVVCSSSYFLTGYISHKDAALPQIWP 300

Query: 666 RKQDPPNLASSKRTRLAFFAGAMNSPTRQELVRVWGTDSEIFAHSGRLKTPYADELLKSK 725
           RK+DPPNLASSKRTRLAFFAGAMNSPTRQEL+RVWG D EIFAHSGRLKTPYADELLKSK
Sbjct: 301 RKEDPPNLASSKRTRLAFFAGAMNSPTRQELIRVWGKDLEIFAHSGRLKTPYADELLKSK 360

Query: 726 FCLHVKGFEVNTARVGDSIFYGCVPVIIANYYDLPFGDILNWKSFSVVVTTSDIPRLKEI 785
           FCLHVKGFEVNTARVGDSIFYGCVPVIIANYYDLPFGDILNWK+FS+VVTTSDIPRLKEI
Sbjct: 361 FCLHVKGFEVNTARVGDSIFYGCVPVIIANYYDLPFGDILNWKNFSIVVTTSDIPRLKEI 420

Query: 786 LKGINDEEYAILQSNVLKVRRHFKWHSSPVDYDTFHMVMYQLWLRRTSVRLPL 835
           LKGIND+EYAILQSNVLKVR+HFKWHSSPVDYDTFHMVMYQLWLRRTSVRLPL
Sbjct: 421 LKGINDKEYAILQSNVLKVRKHFKWHSSPVDYDTFHMVMYQLWLRRTSVRLPL 466

BLAST of HG10019535 vs. NCBI nr
Match: XP_008464966.1 (PREDICTED: probable glycosyltransferase At5g03795 [Cucumis melo])

HSP 1 Score: 857.8 bits (2215), Expect = 7.6e-245
Identity = 426/469 (90.83%), Postives = 442/469 (94.24%), Query Frame = 0

Query: 366 MAQSPISWRSFSSSSSSSSSSSSSFANWFRGLFFIPTCLALNTFLFILFYISSTSTPNSF 425
           M QSP+ WRSFSSS+SSS+SSS    NWFRGLFFIPTCLALN+ +FILFYISSTSTPN F
Sbjct: 1   MPQSPLPWRSFSSSTSSSTSSS----NWFRGLFFIPTCLALNSSIFILFYISSTSTPNHF 60

Query: 426 PSQIPTRFSDSSSSLVSSLNLSITALRVSQKVSLQDGNGRPPLPLPLPQPSSPLPSFGKG 485
           PSQIP+ F DSSS  VSSLNLSIT LRVSQK+SLQD NG PPLP    QP SPLPSFG+G
Sbjct: 61  PSQIPSHFPDSSSRPVSSLNLSITTLRVSQKISLQDDNGGPPLPQIQTQPFSPLPSFGRG 120

Query: 486 SHEQAEGVFHDEELFLEDYKEMNKSFKIFVYPHKRSDPFARSLLPEDFEPHGNYASESYF 545
           SHEQ EGVFHDEELFLEDYKEMNKSFKI+VYPHKRSDPFARSLLPEDFEPHGNYASESYF
Sbjct: 121 SHEQTEGVFHDEELFLEDYKEMNKSFKIYVYPHKRSDPFARSLLPEDFEPHGNYASESYF 180

Query: 546 KKSLIKSHFITNNPKEADFFFLPFSITGLRNDRRVSVSGIPNFIRDYIFDVSHKYPYWNR 605
           KKSL KSHFITN+PKEADFFFLPFSITGLRNDRRVSVSGIPNFIRDYIFDVSHKYPYWNR
Sbjct: 181 KKSLFKSHFITNDPKEADFFFLPFSITGLRNDRRVSVSGIPNFIRDYIFDVSHKYPYWNR 240

Query: 606 TGGADHFYVACHSVGRSAMDKSSEAKLSIVQVVCSSSYFLTGYISHKDAALPQIWPRKQD 665
           TGGADHFYVACHSVGRSAMDKSSEAK SIVQVVCSSSYFLTGYISHKDAALPQIWPRK +
Sbjct: 241 TGGADHFYVACHSVGRSAMDKSSEAKSSIVQVVCSSSYFLTGYISHKDAALPQIWPRKDE 300

Query: 666 PPNLASSKRTRLAFFAGAMNSPTRQELVRVWGTDSEIFAHSGRLKTPYADELLKSKFCLH 725
           PPNLASSKRTRLAFFAGAMNSPTRQ L++VWG DSEIFA+SGRLKTPYADELL+SKFCLH
Sbjct: 301 PPNLASSKRTRLAFFAGAMNSPTRQALIQVWGKDSEIFAYSGRLKTPYADELLRSKFCLH 360

Query: 726 VKGFEVNTARVGDSIFYGCVPVIIANYYDLPFGDILNWKSFSVVVTTSDIPRLKEILKGI 785
           VKGFEVNTARVGDSIFYGCVPVIIANYYDLPFGDILNWKSFSVVVTTSDIPRLKEILKGI
Sbjct: 361 VKGFEVNTARVGDSIFYGCVPVIIANYYDLPFGDILNWKSFSVVVTTSDIPRLKEILKGI 420

Query: 786 NDEEYAILQSNVLKVRRHFKWHSSPVDYDTFHMVMYQLWLRRTSVRLPL 835
           NDE+YA LQSNVLKVRRHFKWHSSPVDYDTFHMVMYQLWLRRTSVRLPL
Sbjct: 421 NDEQYARLQSNVLKVRRHFKWHSSPVDYDTFHMVMYQLWLRRTSVRLPL 465

BLAST of HG10019535 vs. NCBI nr
Match: XP_004141132.2 (probable glycosyltransferase At5g03795 [Cucumis sativus] >KGN59748.1 hypothetical protein Csa_001809 [Cucumis sativus])

HSP 1 Score: 849.7 bits (2194), Expect = 2.1e-242
Identity = 425/469 (90.62%), Postives = 439/469 (93.60%), Query Frame = 0

Query: 366 MAQSPISWRSFSSSSSSSSSSSSSFANWFRGLFFIPTCLALNTFLFILFYISSTSTPNSF 425
           M QSP  WRS SSSS+SSSS+     NWFRGLFFIPTCLALN+FLFILFYISSTSTPN F
Sbjct: 1   MPQSPFPWRSSSSSSTSSSST-----NWFRGLFFIPTCLALNSFLFILFYISSTSTPNPF 60

Query: 426 PSQIPTRFSDSSSSLVSSLNLSITALRVSQKVSLQDGNGRPPLPLPLPQPSSPLPSFGKG 485
           PSQIP+ FSDSSS  VSSLNLSIT LRV+QKVSL D NG PPLP    QP  PLPSFGK 
Sbjct: 61  PSQIPSHFSDSSSRYVSSLNLSITTLRVAQKVSLPDDNGGPPLPQFQTQPFPPLPSFGKR 120

Query: 486 SHEQAEGVFHDEELFLEDYKEMNKSFKIFVYPHKRSDPFARSLLPEDFEPHGNYASESYF 545
            HEQAEGVFHDEELFLEDYKEMNKSFKI+VYPHKRSDPFARSLLPE+FEPHGNYASESYF
Sbjct: 121 IHEQAEGVFHDEELFLEDYKEMNKSFKIYVYPHKRSDPFARSLLPENFEPHGNYASESYF 180

Query: 546 KKSLIKSHFITNNPKEADFFFLPFSITGLRNDRRVSVSGIPNFIRDYIFDVSHKYPYWNR 605
           KKSLIKSHFITN+PKEADFFFLPFSITGLRNDRRVSVSGIPNFIRDYIFDVSHKYPYWNR
Sbjct: 181 KKSLIKSHFITNDPKEADFFFLPFSITGLRNDRRVSVSGIPNFIRDYIFDVSHKYPYWNR 240

Query: 606 TGGADHFYVACHSVGRSAMDKSSEAKLSIVQVVCSSSYFLTGYISHKDAALPQIWPRKQD 665
           TGGADHFYVACHSVGRSAMDKSSEAK SIVQVVCSSSYFLTGYISHKDAALPQIWPRK+D
Sbjct: 241 TGGADHFYVACHSVGRSAMDKSSEAKSSIVQVVCSSSYFLTGYISHKDAALPQIWPRKED 300

Query: 666 PPNLASSKRTRLAFFAGAMNSPTRQELVRVWGTDSEIFAHSGRLKTPYADELLKSKFCLH 725
           P NLASSKRTRLAFFAGAMNSPTRQ LV+VWG DSEIFA+SGRLKTPYADELL+SKFCLH
Sbjct: 301 PSNLASSKRTRLAFFAGAMNSPTRQALVQVWGKDSEIFAYSGRLKTPYADELLRSKFCLH 360

Query: 726 VKGFEVNTARVGDSIFYGCVPVIIANYYDLPFGDILNWKSFSVVVTTSDIPRLKEILKGI 785
           VKGFEVNTARVGDSIFYGCVPVIIANYYDLPFGDILNWKSFS+VVTTSDIPRLKEILKGI
Sbjct: 361 VKGFEVNTARVGDSIFYGCVPVIIANYYDLPFGDILNWKSFSIVVTTSDIPRLKEILKGI 420

Query: 786 NDEEYAILQSNVLKVRRHFKWHSSPVDYDTFHMVMYQLWLRRTSVRLPL 835
           NDEEYA LQSNVLKVR+HFKWHSSPVDYDTFHMVMYQLWLRRTSVRLPL
Sbjct: 421 NDEEYARLQSNVLKVRKHFKWHSSPVDYDTFHMVMYQLWLRRTSVRLPL 464

BLAST of HG10019535 vs. NCBI nr
Match: XP_022932772.1 (probable glycosyltransferase At5g03795 [Cucurbita moschata])

HSP 1 Score: 748.8 bits (1932), Expect = 5.0e-212
Identity = 374/459 (81.48%), Postives = 400/459 (87.15%), Query Frame = 0

Query: 394 FRGLFFIPTCLALNTFLFILFYISSTSTPNSFPS------------------QIPTRFSD 453
           FR LFFIPTCLALN FLFILFY SS+S+ +S  S                  QIPT+F D
Sbjct: 10  FRTLFFIPTCLALNAFLFILFYTSSSSSSSSSSSSSSSSTHTFFFSPPSQLRQIPTQFHD 69

Query: 454 SSSSLVSSLNLSITALRVSQKVSLQDGNGRPPLPLPLPQPSSPLPSFGKGSHEQAEGVFH 513
           SS  L+SS          +Q +SL+D NG P   LP  QPSS LPSFG+  HEQ +GVFH
Sbjct: 70  SSIPLLSS----------TQNLSLEDANGGP--ALPHTQPSSSLPSFGRRMHEQTKGVFH 129

Query: 514 DEELFLEDYKEMNKSFKIFVYPHKRSDPFARSLLPEDFEPHGNYASESYFKKSLIKSHFI 573
           DE+LFLEDYKEMNKSFKI+VYPHKR+DPFARSLLPEDFEPHGNYASESYFK+SL KSHFI
Sbjct: 130 DEDLFLEDYKEMNKSFKIYVYPHKRNDPFARSLLPEDFEPHGNYASESYFKQSLFKSHFI 189

Query: 574 TNNPKEADFFFLPFSITGLRNDRRVSVSGIPNFIRDYIFDVSHKYPYWNRTGGADHFYVA 633
            N+PK+ADFFFLPFSITGLRNDRRVSVSGIP+FIRDYIF V+HKYPYWNRTGGADHFYVA
Sbjct: 190 VNDPKDADFFFLPFSITGLRNDRRVSVSGIPDFIRDYIFSVTHKYPYWNRTGGADHFYVA 249

Query: 634 CHSVGRSAMDKSSEAKLSIVQVVCSSSYFLTGYISHKDAALPQIWPRKQDPPNLASSKRT 693
           CHSVGRSAMDKSSEAK S+VQVVCSSSYFL GYISHKDAALPQIWPRKQDPPNL SSKRT
Sbjct: 250 CHSVGRSAMDKSSEAKSSVVQVVCSSSYFLPGYISHKDAALPQIWPRKQDPPNLTSSKRT 309

Query: 694 RLAFFAGAMNSPTRQELVRVWGTDSEIFAHSGRLKTPYADELLKSKFCLHVKGFEVNTAR 753
           RLAFFAGAMNSPTRQELVRVWG D+EIFAHSGRLKTPYADELL+SKFCLHVKGFEVNTAR
Sbjct: 310 RLAFFAGAMNSPTRQELVRVWGKDAEIFAHSGRLKTPYADELLRSKFCLHVKGFEVNTAR 369

Query: 754 VGDSIFYGCVPVIIANYYDLPFGDILNWKSFSVVVTTSDIPRLKEILKGINDEEYAILQS 813
           VGDSIFYGCVPVII+NYYDLPFGDILNWKSFSVVV TSDIP LK+ILKGI+DEEYAILQS
Sbjct: 370 VGDSIFYGCVPVIISNYYDLPFGDILNWKSFSVVVATSDIPSLKKILKGISDEEYAILQS 429

Query: 814 NVLKVRRHFKWHSSPVDYDTFHMVMYQLWLRRTSVRLPL 835
           NVLKVR+HFKWHSSPVD+DTFHMVMYQLWLRRTSVRL L
Sbjct: 430 NVLKVRKHFKWHSSPVDFDTFHMVMYQLWLRRTSVRLRL 456

BLAST of HG10019535 vs. NCBI nr
Match: XP_023540773.1 (probable glycosyltransferase At5g03795 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 746.5 bits (1926), Expect = 2.5e-211
Identity = 371/455 (81.54%), Postives = 398/455 (87.47%), Query Frame = 0

Query: 394 FRGLFFIPTCLALNTFLFILFYISSTS--------------TPNSFPSQIPTRFSDSSSS 453
           FR LFFIPTCLALNTFL ILFY SS+S              +P S P Q PT+F DSS  
Sbjct: 10  FRTLFFIPTCLALNTFLLILFYTSSSSSSSSSSSSTHTFFFSPPSHPRQTPTQFHDSSIP 69

Query: 454 LVSSLNLSITALRVSQKVSLQDGNGRPPLPLPLPQPSSPLPSFGKGSHEQAEGVFHDEEL 513
           L+SS          +Q +SL+D N  P   LP  QPSS LP FG+  H+QA+GVFHDE+L
Sbjct: 70  LLSS----------AQNLSLEDANAGP--ALPHTQPSSSLPPFGRRMHDQAKGVFHDEDL 129

Query: 514 FLEDYKEMNKSFKIFVYPHKRSDPFARSLLPEDFEPHGNYASESYFKKSLIKSHFITNNP 573
           FLEDYKEMNKSFKI+VYPHKR+DPFARSLLPEDFEPHGNYASESYFK+SL KSHFI N+P
Sbjct: 130 FLEDYKEMNKSFKIYVYPHKRNDPFARSLLPEDFEPHGNYASESYFKQSLFKSHFIVNDP 189

Query: 574 KEADFFFLPFSITGLRNDRRVSVSGIPNFIRDYIFDVSHKYPYWNRTGGADHFYVACHSV 633
           K+ADFFFLPFSITGLRNDRRVSVSGIP+FIRDYIF V+HKYPYWNRTGGADHFYVACHSV
Sbjct: 190 KDADFFFLPFSITGLRNDRRVSVSGIPDFIRDYIFSVTHKYPYWNRTGGADHFYVACHSV 249

Query: 634 GRSAMDKSSEAKLSIVQVVCSSSYFLTGYISHKDAALPQIWPRKQDPPNLASSKRTRLAF 693
           GRSAMDKSSEAK S+VQVVCSSSYFL GYISHKDAALPQIWPRKQDPPNL SSKRTRLAF
Sbjct: 250 GRSAMDKSSEAKSSVVQVVCSSSYFLPGYISHKDAALPQIWPRKQDPPNLTSSKRTRLAF 309

Query: 694 FAGAMNSPTRQELVRVWGTDSEIFAHSGRLKTPYADELLKSKFCLHVKGFEVNTARVGDS 753
           FAGAMNSPTRQELVRVWG D+EIFAHSGRLKTPYADELL+SKFCLHVKGFEVNTARVGDS
Sbjct: 310 FAGAMNSPTRQELVRVWGKDAEIFAHSGRLKTPYADELLRSKFCLHVKGFEVNTARVGDS 369

Query: 754 IFYGCVPVIIANYYDLPFGDILNWKSFSVVVTTSDIPRLKEILKGINDEEYAILQSNVLK 813
           IFYGCVPVII+NYYDLPFGDILNWKSFSVVV TSDIP LK+ILKGI+DEEYA+LQSNVLK
Sbjct: 370 IFYGCVPVIISNYYDLPFGDILNWKSFSVVVATSDIPSLKKILKGISDEEYAMLQSNVLK 429

Query: 814 VRRHFKWHSSPVDYDTFHMVMYQLWLRRTSVRLPL 835
           VR+HFKWHSSPVD+DTFHMVMYQLWLRRTSVRL L
Sbjct: 430 VRKHFKWHSSPVDFDTFHMVMYQLWLRRTSVRLRL 452

BLAST of HG10019535 vs. ExPASy Swiss-Prot
Match: O80775 (WD repeat-containing protein 55 OS=Arabidopsis thaliana OX=3702 GN=WDR55 PE=1 SV=2)

HSP 1 Score: 412.9 bits (1060), Expect = 8.5e-114
Identity = 206/333 (61.86%), Postives = 259/333 (77.78%), Query Frame = 0

Query: 1   MEINLGDIPFDLDFHPSDQLVAAGVIGGNLHLYRYDANALPQRLFKVRAHVKSCRAVRFI 60
           MEI+LG   F +DFHPS  LVAAG+I G+LHLYRYD+++   R  KVRAH +SCRAVRFI
Sbjct: 1   MEIDLGANAFGIDFHPSTNLVAAGLIDGHLHLYRYDSDSSLVRERKVRAHKESCRAVRFI 60

Query: 61  NDGRGNDWHFHPSNDWFIACISAILTGSSDHSILSTDVETGSVIARLEDAHDEAVSKLIN 120
           +DG+                   I+T S+D SIL+TDVETG+ +A LE+AH++AV+ LIN
Sbjct: 61  DDGQ------------------RIVTASADCSILATDVETGAQVAHLENAHEDAVNTLIN 120

Query: 121 ITSETIASGDDNGRIKVWDTRQRSCCSSFKAHKDYISDMTYSSDSMKLLATSGDGSLSVC 180
           +T  TIASGDD G +K+WDTRQRSC   F AH+DYIS MT++SDSMKL+ TSGDG+LSVC
Sbjct: 121 VTETTIASGDDKGCVKIWDTRQRSCSHEFNAHEDYISGMTFASDSMKLVVTSGDGTLSVC 180

Query: 181 NLRRNKIHARSEFSEVELLSVVIMKNGRKVICGSQTGTLLLYSWGFFQDCSDRFVDVSQN 240
           NLR +K+ ++SEFSE ELLSVVIMKNGRKVICG+Q GTLLLYSWGFF+DCSDRFVD++ N
Sbjct: 181 NLRTSKVQSQSEFSEDELLSVVIMKNGRKVICGTQNGTLLLYSWGFFKDCSDRFVDLAPN 240

Query: 241 PVNALLKLDEDRVIAGSESGLISLVGILPNRVIQPIAEHSDYPVERLAFSHDRKFLGSIS 300
            V+ALLKLDEDR+I G ++G+ISLVGILPNR+IQPI  H DYP+E LA SHD+KFLGS +
Sbjct: 241 SVDALLKLDEDRLITGCDNGIISLVGILPNRIIQPIGSH-DYPIEDLALSHDKKFLGSTA 300

Query: 301 HDYMVKLWDMDLLQSSGNALNGRATVDASASNN 334
           HD M+KLW+++ +    N  +G A+  A  S++
Sbjct: 301 HDSMLKLWNLEEILEGSNVNSGNASGAAEDSDS 314

BLAST of HG10019535 vs. ExPASy Swiss-Prot
Match: Q9FFN2 (Probable glycosyltransferase At5g03795 OS=Arabidopsis thaliana OX=3702 GN=At5g03795 PE=3 SV=2)

HSP 1 Score: 224.6 bits (571), Expect = 4.3e-57
Identity = 131/353 (37.11%), Postives = 200/353 (56.66%), Query Frame = 0

Query: 493 VFHDEELFLEDYKEMNKSFKIFVYPHKRSDPFARSLLPEDFEPHGNYASE-SYFKKSLIK 552
           ++ + ++F   Y EM K FKI+VY  K  +P     L  D      Y+ E S+  +    
Sbjct: 172 MYWNAKVFHRSYLEMEKQFKIYVY--KEGEP----PLFHDGPCKSIYSMEGSFIYEIETD 231

Query: 553 SHFITNNPKEADFFFLPFSITGL------RNDRRVSVSGIPNFIRDYIFDVSHKYPYWNR 612
           + F TNNP +A  F+LPFS+  +      RN R    S I N ++DYI  V  KYPYWNR
Sbjct: 232 TRFRTNNPDKAHVFYLPFSVVKMVRYVYERNSR--DFSPIRNTVKDYINLVGDKYPYWNR 291

Query: 613 TGGADHFYVACHSVGRSAMDKSSEAKLSIVQVVCSSSYFLTGYISHKDAALPQIWPRKQD 672
           + GADHF ++CH  G  A         + ++ +C+++     +   KD ++P+I  R   
Sbjct: 292 SIGADHFILSCHDWGPEASFSHPHLGHNSIRALCNAN-TSERFKPRKDVSIPEINLRTGS 351

Query: 673 PPNL----ASSKRTRLAFFAGAMNSPTRQELVRVW-GTDSEIFAHSGRLK-TPYADELLK 732
              L    + S R  LAFFAG ++ P R  L++ W   D++I  H    + T Y+D +  
Sbjct: 352 LTGLVGGPSPSSRPILAFFAGGVHGPVRPVLLQHWENKDNDIRVHKYLPRGTSYSDMMRN 411

Query: 733 SKFCLHVKGFEVNTARVGDSIFYGCVPVIIANYYDLPFGDILNWKSFSVVVTTSDIPRLK 792
           SKFC+   G+EV + R+ ++++ GCVPV+I + Y  PF D+LNW+SFSV+V+  DIP LK
Sbjct: 412 SKFCICPSGYEVASPRIVEALYSGCVPVLINSGYVPPFSDVLNWRSFSVIVSVEDIPNLK 471

Query: 793 EILKGINDEEYAILQSNVLKVRRHFKWHSSPVDYDTFHMVMYQLWLRRTSVRL 833
            IL  I+  +Y  +   VLKVRRHF+ +S    +D FHM+++ +W+RR +V++
Sbjct: 472 TILTSISPRQYLRMYRRVLKVRRHFEVNSPAKRFDVFHMILHSIWVRRLNVKI 515

BLAST of HG10019535 vs. ExPASy Swiss-Prot
Match: Q9LFP3 (Probable glycosyltransferase At5g11130 OS=Arabidopsis thaliana OX=3702 GN=At5g11130/At5g11120 PE=3 SV=2)

HSP 1 Score: 221.9 bits (564), Expect = 2.8e-56
Identity = 127/353 (35.98%), Postives = 194/353 (54.96%), Query Frame = 0

Query: 500 FLEDYKEMNKSFKIFVYPHKRSDPFARSLLPEDFEPHGNYASESYFKKSLI--KSHFITN 559
           F + +KEM K FKI+ Y    +  F +  L      +  YA E  F   +    S F   
Sbjct: 138 FHQSHKEMEKRFKIWTYREGEAPLFHKGPL------NNIYAIEGQFMDEIENGNSRFKAA 197

Query: 560 NPKEADFFFLPFSITGL-----RNDRRVSVSGIPNFIRDYIFDVSHKYPYWNRTGGADHF 619
           +P+EA  F++P  I  +     R     +   + N ++DYI  +S++YPYWNR+ GADHF
Sbjct: 198 SPEEATVFYIPVGIVNIIRFVYRPYTSYARDRLQNIVKDYISLISNRYPYWNRSRGADHF 257

Query: 620 YVACHSVGRSAMDKSSEAKLSIVQVVCSSSYFLTGYISHKDAALPQI-WPRKQ------- 679
           +++CH           E     ++ +C+++    G+   +D +LP+I  P  Q       
Sbjct: 258 FLSCHDWAPDVSAVDPELYKHFIRALCNAN-SSEGFTPMRDVSLPEINIPHSQLGFVHTG 317

Query: 680 DPPNLASSKRTRLAFFAGAMNSPTRQELVRVW-GTDSEIFAHSGRLKT-PYADELLKSKF 739
           +PP      R  LAFFAG  +   R+ L + W   D ++  +    KT  Y   + K+KF
Sbjct: 318 EPP----QNRKLLAFFAGGSHGDVRKILFQHWKEKDKDVLVYENLPKTMNYTKMMDKAKF 377

Query: 740 CLHVKGFEVNTARVGDSIFYGCVPVIIANYYDLPFGDILNWKSFSVVVTTSDIPRLKEIL 799
           CL   G+EV + R+ +S++ GCVPVIIA+YY LPF D+LNWK+FSV +  S +P +K+IL
Sbjct: 378 CLCPSGWEVASPRIVESLYSGCVPVIIADYYVLPFSDVLNWKTFSVHIPISKMPDIKKIL 437

Query: 800 KGINDEEYAILQSNVLKVRRHFKWHSSPVDYDTFHMVMYQLWLRRTSVRLPLT 836
           + I +EEY  +Q  VL+VR+HF  +     YD  HM+M+ +WLRR +VR+PL+
Sbjct: 438 EAITEEEYLNMQRRVLEVRKHFVINRPSKPYDMLHMIMHSIWLRRLNVRIPLS 479

BLAST of HG10019535 vs. ExPASy Swiss-Prot
Match: Q3E7Q9 (Probable glycosyltransferase At5g25310 OS=Arabidopsis thaliana OX=3702 GN=At5g25310 PE=3 SV=2)

HSP 1 Score: 209.1 bits (531), Expect = 1.9e-52
Identity = 122/355 (34.37%), Postives = 194/355 (54.65%), Query Frame = 0

Query: 493 VFHDEELFLEDYKEMNKSFKIFVYPHKRSDPFARSLLPEDFEPHGNYASESYFKKSLIK- 552
           ++ +       Y EM K FK++VY  +  +P     L  D      YA E  F   + K 
Sbjct: 133 IYRNPSALYRSYLEMEKRFKVYVY--EEGEP----PLVHDGPCKSVYAVEGRFITEMEKR 192

Query: 553 -SHFITNNPKEADFFFLPFSITGLRN---DRRVSVSGIPNFIRDYIFDVSHKYPYWNRTG 612
            + F T +P +A  +FLPFS+T L     +       +  F+ DYI  VS  +P+WNRT 
Sbjct: 193 RTKFRTYDPNQAYVYFLPFSVTWLVRYLYEGNSDAKPLKTFVSDYIRLVSTNHPFWNRTN 252

Query: 613 GADHFYVACHSVGRSAMDKSSEAKLSIVQVVCSSSYFLTGYISHKDAALPQIW------- 672
           GADHF + CH  G      + +   + ++V+C+++    G+   KD  LP+I        
Sbjct: 253 GADHFMLTCHDWGPLTSQANRDLFNTSIRVMCNAN-SSEGFNPTKDVTLPEIKLYGGEVD 312

Query: 673 PRKQDPPNLASSKRTRLAFFAGAMNSPTRQELVRVW---GTDSEIFAHSGRLKTPYADEL 732
            + +    L++S R  L FFAG ++ P R  L++ W     D  ++ +  +    Y D +
Sbjct: 313 HKLRLSKTLSASPRPYLGFFAGGVHGPVRPILLKHWKQRDLDMPVYEYLPK-HLNYYDFM 372

Query: 733 LKSKFCLHVKGFEVNTARVGDSIFYGCVPVIIANYYDLPFGDILNWKSFSVVVTTSDIPR 792
             SKFC    G+EV + RV ++I+  C+PVI++  + LPF D+L W++FSV+V  S+IPR
Sbjct: 373 RSSKFCFCPSGYEVASPRVIEAIYSECIPVILSVNFVLPFTDVLRWETFSVLVDVSEIPR 432

Query: 793 LKEILKGINDEEYAILQSNVLKVRRHFKWHSSPVDYDTFHMVMYQLWLRRTSVRL 833
           LKEIL  I++E+Y  L+SN+  VRRHF+ +  P  +D FH+ ++ +WLRR +++L
Sbjct: 433 LKEILMSISNEKYEWLKSNLRYVRRHFELNDPPQRFDAFHLTLHSIWLRRLNLKL 479

BLAST of HG10019535 vs. ExPASy Swiss-Prot
Match: Q54SA5 (WD repeat-containing protein 55 homolog OS=Dictyostelium discoideum OX=44689 GN=wdr55 PE=3 SV=1)

HSP 1 Score: 208.8 bits (530), Expect = 2.4e-52
Identity = 111/309 (35.92%), Postives = 180/309 (58.25%), Query Frame = 0

Query: 2   EINLGDIPFDLDFHPSDQLVAAGVIGGNLHLYRYDANALPQRLFKVRAHVKSCRAVRFIN 61
           +I+   IPF L+FHP++ L+      G L L++Y  +    +L  +R H   CR   F +
Sbjct: 15  DISFNTIPFSLNFHPTEDLLVVSDAEGRLKLFKYSLDENEVKL-SLRPHQSGCRQANFSS 74

Query: 62  DGRGNDWHFHPSNDWFIACISAILTGSSDHSILSTDVETGSVIARLEDAHDEAVSKLINI 121
           DG+                   I T SSD S+   D+ TGS++   E+AHD  ++ L++ 
Sbjct: 75  DGK------------------YIFTASSDCSMKVIDINTGSILYTREEAHDYPINCLVS- 134

Query: 122 TSETIASGDDNGRIKVWDTRQRSCCSSFKAHKDYISDMTYSSDSMKLLATSGDGSLSVCN 181
               + +GDD G IKVWD RQ++    F+ H D+ISD+T + D   + ATSGDG +S+ N
Sbjct: 135 KEFMVFTGDDEGTIKVWDMRQQNIVCEFQEHGDFISDIT-TIDDRHIAATSGDGGVSIYN 194

Query: 182 LRRNKIHARSEFSEVELLSVVIMKNGRKVICGSQTGTLLLYSWGFFQDCSDRFVDVSQNP 241
             R  +   SE S+ ELLS + + NG+K++CGSQ G++L+Y     ++   +F    Q+ 
Sbjct: 195 FVRKSMDDISEKSDNELLSCLSLDNGQKLVCGSQDGSILIYDRNNLENVK-KFAGHPQS- 254

Query: 242 VNALLKLDEDRVIAGSESGLISLVGILPNRVIQPIAEHSDYPVERLAFSHDRKFLGSISH 301
           V+AL+K++ +   +GS  G+I  +G+ P +++  + EHS +P+ER+A S D ++LGSISH
Sbjct: 255 VDALVKVNNNTFFSGSSDGIIRFIGLRPKKLLGVVGEHSTFPIERMAISRDNRYLGSISH 300

Query: 302 DYMVKLWDM 311
           D+ +K W++
Sbjct: 315 DFSLKFWNV 300

BLAST of HG10019535 vs. ExPASy TrEMBL
Match: A0A1S3CMU3 (probable glycosyltransferase At5g03795 OS=Cucumis melo OX=3656 GN=LOC103502710 PE=3 SV=1)

HSP 1 Score: 857.8 bits (2215), Expect = 3.7e-245
Identity = 426/469 (90.83%), Postives = 442/469 (94.24%), Query Frame = 0

Query: 366 MAQSPISWRSFSSSSSSSSSSSSSFANWFRGLFFIPTCLALNTFLFILFYISSTSTPNSF 425
           M QSP+ WRSFSSS+SSS+SSS    NWFRGLFFIPTCLALN+ +FILFYISSTSTPN F
Sbjct: 1   MPQSPLPWRSFSSSTSSSTSSS----NWFRGLFFIPTCLALNSSIFILFYISSTSTPNHF 60

Query: 426 PSQIPTRFSDSSSSLVSSLNLSITALRVSQKVSLQDGNGRPPLPLPLPQPSSPLPSFGKG 485
           PSQIP+ F DSSS  VSSLNLSIT LRVSQK+SLQD NG PPLP    QP SPLPSFG+G
Sbjct: 61  PSQIPSHFPDSSSRPVSSLNLSITTLRVSQKISLQDDNGGPPLPQIQTQPFSPLPSFGRG 120

Query: 486 SHEQAEGVFHDEELFLEDYKEMNKSFKIFVYPHKRSDPFARSLLPEDFEPHGNYASESYF 545
           SHEQ EGVFHDEELFLEDYKEMNKSFKI+VYPHKRSDPFARSLLPEDFEPHGNYASESYF
Sbjct: 121 SHEQTEGVFHDEELFLEDYKEMNKSFKIYVYPHKRSDPFARSLLPEDFEPHGNYASESYF 180

Query: 546 KKSLIKSHFITNNPKEADFFFLPFSITGLRNDRRVSVSGIPNFIRDYIFDVSHKYPYWNR 605
           KKSL KSHFITN+PKEADFFFLPFSITGLRNDRRVSVSGIPNFIRDYIFDVSHKYPYWNR
Sbjct: 181 KKSLFKSHFITNDPKEADFFFLPFSITGLRNDRRVSVSGIPNFIRDYIFDVSHKYPYWNR 240

Query: 606 TGGADHFYVACHSVGRSAMDKSSEAKLSIVQVVCSSSYFLTGYISHKDAALPQIWPRKQD 665
           TGGADHFYVACHSVGRSAMDKSSEAK SIVQVVCSSSYFLTGYISHKDAALPQIWPRK +
Sbjct: 241 TGGADHFYVACHSVGRSAMDKSSEAKSSIVQVVCSSSYFLTGYISHKDAALPQIWPRKDE 300

Query: 666 PPNLASSKRTRLAFFAGAMNSPTRQELVRVWGTDSEIFAHSGRLKTPYADELLKSKFCLH 725
           PPNLASSKRTRLAFFAGAMNSPTRQ L++VWG DSEIFA+SGRLKTPYADELL+SKFCLH
Sbjct: 301 PPNLASSKRTRLAFFAGAMNSPTRQALIQVWGKDSEIFAYSGRLKTPYADELLRSKFCLH 360

Query: 726 VKGFEVNTARVGDSIFYGCVPVIIANYYDLPFGDILNWKSFSVVVTTSDIPRLKEILKGI 785
           VKGFEVNTARVGDSIFYGCVPVIIANYYDLPFGDILNWKSFSVVVTTSDIPRLKEILKGI
Sbjct: 361 VKGFEVNTARVGDSIFYGCVPVIIANYYDLPFGDILNWKSFSVVVTTSDIPRLKEILKGI 420

Query: 786 NDEEYAILQSNVLKVRRHFKWHSSPVDYDTFHMVMYQLWLRRTSVRLPL 835
           NDE+YA LQSNVLKVRRHFKWHSSPVDYDTFHMVMYQLWLRRTSVRLPL
Sbjct: 421 NDEQYARLQSNVLKVRRHFKWHSSPVDYDTFHMVMYQLWLRRTSVRLPL 465

BLAST of HG10019535 vs. ExPASy TrEMBL
Match: A0A0A0LD73 (Exostosin domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G842690 PE=3 SV=1)

HSP 1 Score: 849.7 bits (2194), Expect = 1.0e-242
Identity = 425/469 (90.62%), Postives = 439/469 (93.60%), Query Frame = 0

Query: 366 MAQSPISWRSFSSSSSSSSSSSSSFANWFRGLFFIPTCLALNTFLFILFYISSTSTPNSF 425
           M QSP  WRS SSSS+SSSS+     NWFRGLFFIPTCLALN+FLFILFYISSTSTPN F
Sbjct: 1   MPQSPFPWRSSSSSSTSSSST-----NWFRGLFFIPTCLALNSFLFILFYISSTSTPNPF 60

Query: 426 PSQIPTRFSDSSSSLVSSLNLSITALRVSQKVSLQDGNGRPPLPLPLPQPSSPLPSFGKG 485
           PSQIP+ FSDSSS  VSSLNLSIT LRV+QKVSL D NG PPLP    QP  PLPSFGK 
Sbjct: 61  PSQIPSHFSDSSSRYVSSLNLSITTLRVAQKVSLPDDNGGPPLPQFQTQPFPPLPSFGKR 120

Query: 486 SHEQAEGVFHDEELFLEDYKEMNKSFKIFVYPHKRSDPFARSLLPEDFEPHGNYASESYF 545
            HEQAEGVFHDEELFLEDYKEMNKSFKI+VYPHKRSDPFARSLLPE+FEPHGNYASESYF
Sbjct: 121 IHEQAEGVFHDEELFLEDYKEMNKSFKIYVYPHKRSDPFARSLLPENFEPHGNYASESYF 180

Query: 546 KKSLIKSHFITNNPKEADFFFLPFSITGLRNDRRVSVSGIPNFIRDYIFDVSHKYPYWNR 605
           KKSLIKSHFITN+PKEADFFFLPFSITGLRNDRRVSVSGIPNFIRDYIFDVSHKYPYWNR
Sbjct: 181 KKSLIKSHFITNDPKEADFFFLPFSITGLRNDRRVSVSGIPNFIRDYIFDVSHKYPYWNR 240

Query: 606 TGGADHFYVACHSVGRSAMDKSSEAKLSIVQVVCSSSYFLTGYISHKDAALPQIWPRKQD 665
           TGGADHFYVACHSVGRSAMDKSSEAK SIVQVVCSSSYFLTGYISHKDAALPQIWPRK+D
Sbjct: 241 TGGADHFYVACHSVGRSAMDKSSEAKSSIVQVVCSSSYFLTGYISHKDAALPQIWPRKED 300

Query: 666 PPNLASSKRTRLAFFAGAMNSPTRQELVRVWGTDSEIFAHSGRLKTPYADELLKSKFCLH 725
           P NLASSKRTRLAFFAGAMNSPTRQ LV+VWG DSEIFA+SGRLKTPYADELL+SKFCLH
Sbjct: 301 PSNLASSKRTRLAFFAGAMNSPTRQALVQVWGKDSEIFAYSGRLKTPYADELLRSKFCLH 360

Query: 726 VKGFEVNTARVGDSIFYGCVPVIIANYYDLPFGDILNWKSFSVVVTTSDIPRLKEILKGI 785
           VKGFEVNTARVGDSIFYGCVPVIIANYYDLPFGDILNWKSFS+VVTTSDIPRLKEILKGI
Sbjct: 361 VKGFEVNTARVGDSIFYGCVPVIIANYYDLPFGDILNWKSFSIVVTTSDIPRLKEILKGI 420

Query: 786 NDEEYAILQSNVLKVRRHFKWHSSPVDYDTFHMVMYQLWLRRTSVRLPL 835
           NDEEYA LQSNVLKVR+HFKWHSSPVDYDTFHMVMYQLWLRRTSVRLPL
Sbjct: 421 NDEEYARLQSNVLKVRKHFKWHSSPVDYDTFHMVMYQLWLRRTSVRLPL 464

BLAST of HG10019535 vs. ExPASy TrEMBL
Match: A0A6J1EXP8 (probable glycosyltransferase At5g03795 OS=Cucurbita moschata OX=3662 GN=LOC111439219 PE=3 SV=1)

HSP 1 Score: 748.8 bits (1932), Expect = 2.4e-212
Identity = 374/459 (81.48%), Postives = 400/459 (87.15%), Query Frame = 0

Query: 394 FRGLFFIPTCLALNTFLFILFYISSTSTPNSFPS------------------QIPTRFSD 453
           FR LFFIPTCLALN FLFILFY SS+S+ +S  S                  QIPT+F D
Sbjct: 10  FRTLFFIPTCLALNAFLFILFYTSSSSSSSSSSSSSSSSTHTFFFSPPSQLRQIPTQFHD 69

Query: 454 SSSSLVSSLNLSITALRVSQKVSLQDGNGRPPLPLPLPQPSSPLPSFGKGSHEQAEGVFH 513
           SS  L+SS          +Q +SL+D NG P   LP  QPSS LPSFG+  HEQ +GVFH
Sbjct: 70  SSIPLLSS----------TQNLSLEDANGGP--ALPHTQPSSSLPSFGRRMHEQTKGVFH 129

Query: 514 DEELFLEDYKEMNKSFKIFVYPHKRSDPFARSLLPEDFEPHGNYASESYFKKSLIKSHFI 573
           DE+LFLEDYKEMNKSFKI+VYPHKR+DPFARSLLPEDFEPHGNYASESYFK+SL KSHFI
Sbjct: 130 DEDLFLEDYKEMNKSFKIYVYPHKRNDPFARSLLPEDFEPHGNYASESYFKQSLFKSHFI 189

Query: 574 TNNPKEADFFFLPFSITGLRNDRRVSVSGIPNFIRDYIFDVSHKYPYWNRTGGADHFYVA 633
            N+PK+ADFFFLPFSITGLRNDRRVSVSGIP+FIRDYIF V+HKYPYWNRTGGADHFYVA
Sbjct: 190 VNDPKDADFFFLPFSITGLRNDRRVSVSGIPDFIRDYIFSVTHKYPYWNRTGGADHFYVA 249

Query: 634 CHSVGRSAMDKSSEAKLSIVQVVCSSSYFLTGYISHKDAALPQIWPRKQDPPNLASSKRT 693
           CHSVGRSAMDKSSEAK S+VQVVCSSSYFL GYISHKDAALPQIWPRKQDPPNL SSKRT
Sbjct: 250 CHSVGRSAMDKSSEAKSSVVQVVCSSSYFLPGYISHKDAALPQIWPRKQDPPNLTSSKRT 309

Query: 694 RLAFFAGAMNSPTRQELVRVWGTDSEIFAHSGRLKTPYADELLKSKFCLHVKGFEVNTAR 753
           RLAFFAGAMNSPTRQELVRVWG D+EIFAHSGRLKTPYADELL+SKFCLHVKGFEVNTAR
Sbjct: 310 RLAFFAGAMNSPTRQELVRVWGKDAEIFAHSGRLKTPYADELLRSKFCLHVKGFEVNTAR 369

Query: 754 VGDSIFYGCVPVIIANYYDLPFGDILNWKSFSVVVTTSDIPRLKEILKGINDEEYAILQS 813
           VGDSIFYGCVPVII+NYYDLPFGDILNWKSFSVVV TSDIP LK+ILKGI+DEEYAILQS
Sbjct: 370 VGDSIFYGCVPVIISNYYDLPFGDILNWKSFSVVVATSDIPSLKKILKGISDEEYAILQS 429

Query: 814 NVLKVRRHFKWHSSPVDYDTFHMVMYQLWLRRTSVRLPL 835
           NVLKVR+HFKWHSSPVD+DTFHMVMYQLWLRRTSVRL L
Sbjct: 430 NVLKVRKHFKWHSSPVDFDTFHMVMYQLWLRRTSVRLRL 456

BLAST of HG10019535 vs. ExPASy TrEMBL
Match: A0A6J1I5L6 (probable glycosyltransferase At5g03795 OS=Cucurbita maxima OX=3661 GN=LOC111470880 PE=3 SV=1)

HSP 1 Score: 739.2 bits (1907), Expect = 1.9e-209
Identity = 372/453 (82.12%), Postives = 398/453 (87.86%), Query Frame = 0

Query: 394 FRGLFFIPTCLALNTFLFILFYISSTS------------TPNSFPSQIPTRFSDSSSSLV 453
           FR LFFIPTCLALNTFL ILFY SS+S            +P S   QIPT+F  SS  L+
Sbjct: 10  FRTLFFIPTCLALNTFLLILFYTSSSSSSSSSSTHTFFFSPPSQLRQIPTQFHHSSIPLL 69

Query: 454 SSLNLSITALRVSQKVSLQDGNGRPPLPLPLPQPSSPLPSFGKGSHEQAEGVFHDEELFL 513
           SS          +Q +SL+D NG P   LP  QPSS LPSFG+   EQA+GVFHDE+LFL
Sbjct: 70  SS----------AQNLSLEDTNGGP--ALPHTQPSSSLPSFGR-RIEQAKGVFHDEDLFL 129

Query: 514 EDYKEMNKSFKIFVYPHKRSDPFARSLLPEDFEPHGNYASESYFKKSLIKSHFITNNPKE 573
           EDYKEMNKSFKI+VYPHKR+DPFARSLLPEDFEPHGNYASESYFK+SL KSHFI N+PK+
Sbjct: 130 EDYKEMNKSFKIYVYPHKRNDPFARSLLPEDFEPHGNYASESYFKQSLFKSHFIVNDPKD 189

Query: 574 ADFFFLPFSITGLRNDRRVSVSGIPNFIRDYIFDVSHKYPYWNRTGGADHFYVACHSVGR 633
           ADFFFLPFSITGLRNDRRVSVSGIPNFIRDYIF V+HKYPYWNRTGGADHFYVACHSVGR
Sbjct: 190 ADFFFLPFSITGLRNDRRVSVSGIPNFIRDYIFSVTHKYPYWNRTGGADHFYVACHSVGR 249

Query: 634 SAMDKSSEAKLSIVQVVCSSSYFLTGYISHKDAALPQIWPRKQDPPNLASSKRTRLAFFA 693
           SAMDKSSEAK S+VQVVCSSSYFL GYISHKDAALPQIWPRKQDPPNL SSKRTRLAFFA
Sbjct: 250 SAMDKSSEAKSSVVQVVCSSSYFLPGYISHKDAALPQIWPRKQDPPNLTSSKRTRLAFFA 309

Query: 694 GAMNSPTRQELVRVWGTDSEIFAHSGRLKTPYADELLKSKFCLHVKGFEVNTARVGDSIF 753
           GAMNSPTRQELVRVWG D+EIFAHSGRLKTPYADELL+SKFCLHVKGFEVNTARVGDSIF
Sbjct: 310 GAMNSPTRQELVRVWGKDAEIFAHSGRLKTPYADELLRSKFCLHVKGFEVNTARVGDSIF 369

Query: 754 YGCVPVIIANYYDLPFGDILNWKSFSVVVTTSDIPRLKEILKGINDEEYAILQSNVLKVR 813
           YGCVPVII+NYYDLPFGDILNWKSFSVVV TSDIP LK+ILKGI+DEEYA+LQSNVLKVR
Sbjct: 370 YGCVPVIISNYYDLPFGDILNWKSFSVVVATSDIPSLKKILKGISDEEYAMLQSNVLKVR 429

Query: 814 RHFKWHSSPVDYDTFHMVMYQLWLRRTSVRLPL 835
           +HFKWHSSPVD+DTFHMVMYQLWLRRT+VRL L
Sbjct: 430 KHFKWHSSPVDFDTFHMVMYQLWLRRTTVRLRL 449

BLAST of HG10019535 vs. ExPASy TrEMBL
Match: A0A6J1DSE3 (probable glycosyltransferase At5g03795 OS=Momordica charantia OX=3673 GN=LOC111023961 PE=3 SV=1)

HSP 1 Score: 729.6 bits (1882), Expect = 1.5e-206
Identity = 365/452 (80.75%), Postives = 394/452 (87.17%), Query Frame = 0

Query: 384 SSSSSSFANWFRGLFFIPTCLALNTFLFILFYISSTSTPN-SFPSQIPTRFSDSSSSLVS 443
           + + S + + FRGL  IPTCLALN  +FI+FY SS S PN +  S+I        S+   
Sbjct: 2   AQTQSPWMSSFRGLLLIPTCLALNALIFIVFYNSSGSAPNFASESEIIQTHQFQYSASDH 61

Query: 444 SLNLSITALRVSQKVSLQDGNGRPPLPLPLPQPSSPLPSFGKGSHEQAEGVFHDEELFLE 503
            LN+SI+ L++  KV+L+  N  PP          PLPSFG+     A+GVFHDEELF+E
Sbjct: 62  PLNVSISTLQIRHKVTLEHRN--PP----------PLPSFGR-----AKGVFHDEELFIE 121

Query: 504 DYKEMNKSFKIFVYPHKRSDPFARSLLPEDFEPHGNYASESYFKKSLIKSHFITNNPKEA 563
           DYKEMN SFKI+VYPHKRSDPFARSLLPE+FEPHGNYASESYFKKSLIKSHFIT+NPKEA
Sbjct: 122 DYKEMNNSFKIYVYPHKRSDPFARSLLPEEFEPHGNYASESYFKKSLIKSHFITDNPKEA 181

Query: 564 DFFFLPFSITGLRNDRRVSVSGIPNFIRDYIFDVSHKYPYWNRTGGADHFYVACHSVGRS 623
           DFFFLPFSITGLRNDRRVSVSGIPNFIRDYI+ +SH+YPYWNRTGG DHFYVACHSVGRS
Sbjct: 182 DFFFLPFSITGLRNDRRVSVSGIPNFIRDYIYTISHRYPYWNRTGGTDHFYVACHSVGRS 241

Query: 624 AMDKSSEAKLSIVQVVCSSSYFLTGYISHKDAALPQIWPRKQDPPNLASSKRTRLAFFAG 683
           AMDKS EAK S+VQVVCSSSYFLTGYISHKDAALPQIWPRKQDPPNLASS RTRLAFFAG
Sbjct: 242 AMDKSGEAKSSVVQVVCSSSYFLTGYISHKDAALPQIWPRKQDPPNLASSNRTRLAFFAG 301

Query: 684 AMNSPTRQELVRVWGTDSEIFAHSGRLKTPYADELLKSKFCLHVKGFEVNTARVGDSIFY 743
           AMNSPTRQELVRVWG DSEIFAHSGRL+TPYADELLKSKFCLHVKGFEVNTARVGDSIFY
Sbjct: 302 AMNSPTRQELVRVWGKDSEIFAHSGRLRTPYADELLKSKFCLHVKGFEVNTARVGDSIFY 361

Query: 744 GCVPVIIANYYDLPFGDILNWKSFSVVVTTSDIPRLKEILKGINDEEYAILQSNVLKVRR 803
           GCVPVIIANYYDLPFGDILNWKSFSVVVTTSDIPRLK+ILKGI+DEEY +LQSNVLKVRR
Sbjct: 362 GCVPVIIANYYDLPFGDILNWKSFSVVVTTSDIPRLKDILKGISDEEYQVLQSNVLKVRR 421

Query: 804 HFKWHSSPVDYDTFHMVMYQLWLRRTSVRLPL 835
           HFKWH SPVDYDTFHMVMYQLWLRRTSVRLPL
Sbjct: 422 HFKWHPSPVDYDTFHMVMYQLWLRRTSVRLPL 436

BLAST of HG10019535 vs. TAIR 10
Match: AT2G34260.1 (transducin family protein / WD-40 repeat family protein )

HSP 1 Score: 412.9 bits (1060), Expect = 6.1e-115
Identity = 206/333 (61.86%), Postives = 259/333 (77.78%), Query Frame = 0

Query: 1   MEINLGDIPFDLDFHPSDQLVAAGVIGGNLHLYRYDANALPQRLFKVRAHVKSCRAVRFI 60
           MEI+LG   F +DFHPS  LVAAG+I G+LHLYRYD+++   R  KVRAH +SCRAVRFI
Sbjct: 1   MEIDLGANAFGIDFHPSTNLVAAGLIDGHLHLYRYDSDSSLVRERKVRAHKESCRAVRFI 60

Query: 61  NDGRGNDWHFHPSNDWFIACISAILTGSSDHSILSTDVETGSVIARLEDAHDEAVSKLIN 120
           +DG+                   I+T S+D SIL+TDVETG+ +A LE+AH++AV+ LIN
Sbjct: 61  DDGQ------------------RIVTASADCSILATDVETGAQVAHLENAHEDAVNTLIN 120

Query: 121 ITSETIASGDDNGRIKVWDTRQRSCCSSFKAHKDYISDMTYSSDSMKLLATSGDGSLSVC 180
           +T  TIASGDD G +K+WDTRQRSC   F AH+DYIS MT++SDSMKL+ TSGDG+LSVC
Sbjct: 121 VTETTIASGDDKGCVKIWDTRQRSCSHEFNAHEDYISGMTFASDSMKLVVTSGDGTLSVC 180

Query: 181 NLRRNKIHARSEFSEVELLSVVIMKNGRKVICGSQTGTLLLYSWGFFQDCSDRFVDVSQN 240
           NLR +K+ ++SEFSE ELLSVVIMKNGRKVICG+Q GTLLLYSWGFF+DCSDRFVD++ N
Sbjct: 181 NLRTSKVQSQSEFSEDELLSVVIMKNGRKVICGTQNGTLLLYSWGFFKDCSDRFVDLAPN 240

Query: 241 PVNALLKLDEDRVIAGSESGLISLVGILPNRVIQPIAEHSDYPVERLAFSHDRKFLGSIS 300
            V+ALLKLDEDR+I G ++G+ISLVGILPNR+IQPI  H DYP+E LA SHD+KFLGS +
Sbjct: 241 SVDALLKLDEDRLITGCDNGIISLVGILPNRIIQPIGSH-DYPIEDLALSHDKKFLGSTA 300

Query: 301 HDYMVKLWDMDLLQSSGNALNGRATVDASASNN 334
           HD M+KLW+++ +    N  +G A+  A  S++
Sbjct: 301 HDSMLKLWNLEEILEGSNVNSGNASGAAEDSDS 314

BLAST of HG10019535 vs. TAIR 10
Match: AT2G34260.2 (transducin family protein / WD-40 repeat family protein )

HSP 1 Score: 344.4 bits (882), Expect = 2.6e-94
Identity = 166/257 (64.59%), Postives = 210/257 (81.71%), Query Frame = 0

Query: 77  FIACISAILTGSSDHSILSTDVETGSVIARLEDAHDEAVSKLINITSETIASGDDNGRIK 136
           F   +  I+T S+D SIL+TDVETG+ +A LE+AH++AV+ LIN+T  TIASGDD G +K
Sbjct: 2   FFCWVLGIVTASADCSILATDVETGAQVAHLENAHEDAVNTLINVTETTIASGDDKGCVK 61

Query: 137 VWDTRQRSCCSSFKAHKDYISDMTYSSDSMKLLATSGDGSLSVCNLRRNKIHARSEFSEV 196
           +WDTRQRSC   F AH+DYIS MT++SDSMKL+ TSGDG+LSVCNLR +K+ ++SEFSE 
Sbjct: 62  IWDTRQRSCSHEFNAHEDYISGMTFASDSMKLVVTSGDGTLSVCNLRTSKVQSQSEFSED 121

Query: 197 ELLSVVIMKNGRKVICGSQTGTLLLYSWGFFQDCSDRFVDVSQNPVNALLKLDEDRVIAG 256
           ELLSVVIMKNGRKVICG+Q GTLLLYSWGFF+DCSDRFVD++ N V+ALLKLDEDR+I G
Sbjct: 122 ELLSVVIMKNGRKVICGTQNGTLLLYSWGFFKDCSDRFVDLAPNSVDALLKLDEDRLITG 181

Query: 257 SESGLISLVGILPNRVIQPIAEHSDYPVERLAFSHDRKFLGSISHDYMVKLWDMDLLQSS 316
            ++G+ISLVGILPNR+IQPI  H DYP+E LA SHD+KFLGS +HD M+KLW+++ +   
Sbjct: 182 CDNGIISLVGILPNRIIQPIGSH-DYPIEDLALSHDKKFLGSTAHDSMLKLWNLEEILEG 241

Query: 317 GNALNGRATVDASASNN 334
            N  +G A+  A  S++
Sbjct: 242 SNVNSGNASGAAEDSDS 257

BLAST of HG10019535 vs. TAIR 10
Match: AT4G38040.1 (Exostosin family protein )

HSP 1 Score: 284.3 bits (726), Expect = 3.2e-76
Identity = 166/427 (38.88%), Postives = 237/427 (55.50%), Query Frame = 0

Query: 426 PSQIPTRFSDSSSSLVSSLNLSITALRVSQKV-----SLQDGNGRPPLPL----PLPQPS 485
           PSQ    FS   SS + SL  S+  + +   +     SL      PP P+    P+  P 
Sbjct: 6   PSQF--SFSGGGSSPLCSLKSSLLTVAILTFISLFYLSLNSLRTSPPSPVIVVTPIHVPH 65

Query: 486 SPLPSFGKGS------HEQAEGVFHDEELFLEDYKEMNKSFKIFVYPHKRSDPFARSLLP 545
           + +  +   +       E    V+H  E F  +Y EM K FK+++YP    DP      P
Sbjct: 66  TFVNEYKTDNETPTMEEETYSDVYHSPEAFRLNYAEMEKRFKVYIYPD--GDPNTFYQTP 125

Query: 546 EDFEPHGNYASESYFKKSLIKSHFITNNPKEADFFFLPFSITGLRNDRRVSVSGIPNFIR 605
              +  G YASE YF +++ +S F T +P EAD FF+P S   +R  +  S   +   ++
Sbjct: 126 R--KVTGKYASEGYFFQNIRESRFRTLDPDEADLFFIPISCHKMRG-KGTSYENMTVIVQ 185

Query: 606 DYIFDVSHKYPYWNRTGGADHFYVACHSVGRSAMDKSSEAKLSIVQVVCSSSYFLTGYIS 665
           +Y+  +  KYPYWNRT GADHF+V CH VG  A + S     + ++VVCS SY + G+I 
Sbjct: 186 NYVDGLIAKYPYWNRTLGADHFFVTCHDVGVRAFEGSPLLIKNTIRVVCSPSYNV-GFIP 245

Query: 666 HKDAALPQI-WPRKQDPPNLASSKRTRLAFFAGAMNSPTRQELVRVWGTDSEIFAHSGRL 725
           HKD ALPQ+  P            RT L F+AG  NS  R  L  VW  D+E+   + R+
Sbjct: 246 HKDVALPQVLQPFALPAGGNDVENRTTLGFWAGHRNSKIRVILAHVWENDTELDISNNRI 305

Query: 726 KTP-----YADELLKSKFCLHVKGFEVNTARVGDSIFYGCVPVIIANYYDLPFGDILNWK 785
                   Y     ++KFC+   G +VN+AR+ DSI YGC+PVI+++YYDLPF DILNW+
Sbjct: 306 NRATGHLVYQKRFYRTKFCICPGGSQVNSARITDSIHYGCIPVILSDYYDLPFNDILNWR 365

Query: 786 SFSVVVTTSDIPRLKEILKGINDEEYAILQSNVLKVRRHFKWHSSPVDYDTFHMVMYQLW 832
            F+VV+   D+  LK+ILK I   E+  L +N++KV++HF+W+S PV +D FHM+MY+LW
Sbjct: 366 KFAVVLREQDVYNLKQILKNIPHSEFVSLHNNLVKVQKHFQWNSPPVKFDAFHMIMYELW 424

BLAST of HG10019535 vs. TAIR 10
Match: AT4G16745.1 (Exostosin family protein )

HSP 1 Score: 236.5 bits (602), Expect = 7.8e-62
Identity = 129/351 (36.75%), Postives = 198/351 (56.41%), Query Frame = 0

Query: 493 VFHDEELFLEDYKEMNKSFKIFVYPHKRSDPFARSLLPEDFEPH--GNYASESYFKKSL- 552
           +F +  +F   Y+ M    K+++YP      F         EPH  G YASE +F K + 
Sbjct: 182 LFRNLSVFKRSYELMELILKVYIYPDGDKPIF--------HEPHLNGIYASEGWFMKLME 241

Query: 553 IKSHFITNNPKEADFFFLPFSITGLRNDRRV----SVSGIPNFIRDYIFDVSHKYPYWNR 612
               F+T NP+ A  F++P+S+  L+    V    ++  +  F+RDY+  +S KYP+WNR
Sbjct: 242 SNKQFVTKNPERAHLFYMPYSVKQLQKSIFVPGSHNIKPLSIFLRDYVNMLSIKYPFWNR 301

Query: 613 TGGADHFYVACHSVGRSAMDKSSEAKLSIVQVVCSSSYFLTGYISHKDAALPQIWPRKQD 672
           T G+DHF VACH  G   +++  E K + ++ +C++      ++  KD +LP+   R   
Sbjct: 302 THGSDHFLVACHDWGPYTVNEHPELKRNAIKALCNADLSDGIFVPGKDVSLPETSIRNAG 361

Query: 673 PP--NLAS----SKRTRLAFFAGAMNSPTRQELVRVW---GTDSEIFA---HSGRLKTPY 732
            P  N+ +    S+R  LAFFAG ++   R +L++ W     D +I+    H+   K  Y
Sbjct: 362 RPLRNIGNGNRVSQRPILAFFAGNLHGRVRPKLLKHWRNKDEDMKIYGPLPHNVARKMTY 421

Query: 733 ADELLKSKFCLHVKGFEVNTARVGDSIFYGCVPVIIANYYDLPFGDILNWKSFSVVVTTS 792
              +  SK+CL   G+EVN+ R+ ++I+Y CVPV+IA+ + LPF D+L+W +FSVVV   
Sbjct: 422 VQHMKSSKYCLCPMGYEVNSPRIVEAIYYECVPVVIADNFMLPFSDVLDWSAFSVVVPEK 481

Query: 793 DIPRLKEILKGINDEEYAILQSNVLKVRRHFKWHSSPVDYDTFHMVMYQLW 825
           +IPRLKEIL  I    Y  +QSNV  V+RHF W   P  YD FHM+++ +W
Sbjct: 482 EIPRLKEILLEIPMRRYLKMQSNVKMVQRHFLWSPKPRKYDVFHMILHSIW 524

BLAST of HG10019535 vs. TAIR 10
Match: AT5G03795.1 (Exostosin family protein )

HSP 1 Score: 224.6 bits (571), Expect = 3.1e-58
Identity = 131/353 (37.11%), Postives = 200/353 (56.66%), Query Frame = 0

Query: 493 VFHDEELFLEDYKEMNKSFKIFVYPHKRSDPFARSLLPEDFEPHGNYASE-SYFKKSLIK 552
           ++ + ++F   Y EM K FKI+VY  K  +P     L  D      Y+ E S+  +    
Sbjct: 172 MYWNAKVFHRSYLEMEKQFKIYVY--KEGEP----PLFHDGPCKSIYSMEGSFIYEIETD 231

Query: 553 SHFITNNPKEADFFFLPFSITGL------RNDRRVSVSGIPNFIRDYIFDVSHKYPYWNR 612
           + F TNNP +A  F+LPFS+  +      RN R    S I N ++DYI  V  KYPYWNR
Sbjct: 232 TRFRTNNPDKAHVFYLPFSVVKMVRYVYERNSR--DFSPIRNTVKDYINLVGDKYPYWNR 291

Query: 613 TGGADHFYVACHSVGRSAMDKSSEAKLSIVQVVCSSSYFLTGYISHKDAALPQIWPRKQD 672
           + GADHF ++CH  G  A         + ++ +C+++     +   KD ++P+I  R   
Sbjct: 292 SIGADHFILSCHDWGPEASFSHPHLGHNSIRALCNAN-TSERFKPRKDVSIPEINLRTGS 351

Query: 673 PPNL----ASSKRTRLAFFAGAMNSPTRQELVRVW-GTDSEIFAHSGRLK-TPYADELLK 732
              L    + S R  LAFFAG ++ P R  L++ W   D++I  H    + T Y+D +  
Sbjct: 352 LTGLVGGPSPSSRPILAFFAGGVHGPVRPVLLQHWENKDNDIRVHKYLPRGTSYSDMMRN 411

Query: 733 SKFCLHVKGFEVNTARVGDSIFYGCVPVIIANYYDLPFGDILNWKSFSVVVTTSDIPRLK 792
           SKFC+   G+EV + R+ ++++ GCVPV+I + Y  PF D+LNW+SFSV+V+  DIP LK
Sbjct: 412 SKFCICPSGYEVASPRIVEALYSGCVPVLINSGYVPPFSDVLNWRSFSVIVSVEDIPNLK 471

Query: 793 EILKGINDEEYAILQSNVLKVRRHFKWHSSPVDYDTFHMVMYQLWLRRTSVRL 833
            IL  I+  +Y  +   VLKVRRHF+ +S    +D FHM+++ +W+RR +V++
Sbjct: 472 TILTSISPRQYLRMYRRVLKVRRHFEVNSPAKRFDVFHMILHSIWVRRLNVKI 515

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038903113.15.1e-24991.54probable glycosyltransferase At5g03795 [Benincasa hispida][more]
XP_008464966.17.6e-24590.83PREDICTED: probable glycosyltransferase At5g03795 [Cucumis melo][more]
XP_004141132.22.1e-24290.62probable glycosyltransferase At5g03795 [Cucumis sativus] >KGN59748.1 hypothetica... [more]
XP_022932772.15.0e-21281.48probable glycosyltransferase At5g03795 [Cucurbita moschata][more]
XP_023540773.12.5e-21181.54probable glycosyltransferase At5g03795 [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
O807758.5e-11461.86WD repeat-containing protein 55 OS=Arabidopsis thaliana OX=3702 GN=WDR55 PE=1 SV... [more]
Q9FFN24.3e-5737.11Probable glycosyltransferase At5g03795 OS=Arabidopsis thaliana OX=3702 GN=At5g03... [more]
Q9LFP32.8e-5635.98Probable glycosyltransferase At5g11130 OS=Arabidopsis thaliana OX=3702 GN=At5g11... [more]
Q3E7Q91.9e-5234.37Probable glycosyltransferase At5g25310 OS=Arabidopsis thaliana OX=3702 GN=At5g25... [more]
Q54SA52.4e-5235.92WD repeat-containing protein 55 homolog OS=Dictyostelium discoideum OX=44689 GN=... [more]
Match NameE-valueIdentityDescription
A0A1S3CMU33.7e-24590.83probable glycosyltransferase At5g03795 OS=Cucumis melo OX=3656 GN=LOC103502710 P... [more]
A0A0A0LD731.0e-24290.62Exostosin domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G842690 P... [more]
A0A6J1EXP82.4e-21281.48probable glycosyltransferase At5g03795 OS=Cucurbita moschata OX=3662 GN=LOC11143... [more]
A0A6J1I5L61.9e-20982.12probable glycosyltransferase At5g03795 OS=Cucurbita maxima OX=3661 GN=LOC1114708... [more]
A0A6J1DSE31.5e-20680.75probable glycosyltransferase At5g03795 OS=Momordica charantia OX=3673 GN=LOC1110... [more]
Match NameE-valueIdentityDescription
AT2G34260.16.1e-11561.86transducin family protein / WD-40 repeat family protein [more]
AT2G34260.22.6e-9464.59transducin family protein / WD-40 repeat family protein [more]
AT4G38040.13.2e-7638.88Exostosin family protein [more]
AT4G16745.17.8e-6236.75Exostosin family protein [more]
AT5G03795.13.1e-5837.11Exostosin family protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001680WD40 repeatSMARTSM00320WD40_4coord: 40..97
e-value: 330.0
score: 1.1
coord: 100..139
e-value: 0.054
score: 22.6
coord: 142..181
e-value: 2.1E-4
score: 30.6
coord: 269..309
e-value: 0.0011
score: 28.3
coord: 184..223
e-value: 39.0
score: 6.9
IPR001680WD40 repeatPFAMPF00400WD40coord: 278..309
e-value: 0.1
score: 13.5
IPR015943WD40/YVTN repeat-like-containing domain superfamilyGENE3D2.130.10.10coord: 9..244
e-value: 8.7E-28
score: 99.2
IPR015943WD40/YVTN repeat-like-containing domain superfamilyGENE3D2.130.10.10coord: 245..388
e-value: 1.8E-6
score: 29.3
IPR040911Exostosin, GT47 domainPFAMPF03016Exostosincoord: 508..783
e-value: 3.2E-58
score: 197.3
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 458..486
NoneNo IPR availablePANTHERPTHR11062:SF323EXOSTOSIN-LIKE PROTEIN-RELATEDcoord: 484..832
IPR004263Exostosin-likePANTHERPTHR11062EXOSTOSIN HEPARAN SULFATE GLYCOSYLTRANSFERASE -RELATEDcoord: 484..832
IPR019775WD40 repeat, conserved sitePROSITEPS00678WD_REPEATS_1coord: 126..140
IPR019775WD40 repeat, conserved sitePROSITEPS00678WD_REPEATS_1coord: 296..310
IPR036322WD40-repeat-containing domain superfamilySUPERFAMILY50978WD40 repeat-likecoord: 7..310

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10019535.1HG10019535.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006486 protein glycosylation
cellular_component GO:0000139 Golgi membrane
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0016757 glycosyltransferase activity
molecular_function GO:0005515 protein binding