Cmc06g0169551 (gene) Melon (Charmono) v1.1

Overview
NameCmc06g0169551
Typegene
OrganismCucumis melo L. var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionExostosin domain-containing protein
LocationCMiso1.1chr06: 24124000 .. 24142054 (+)
RNA-Seq ExpressionCmc06g0169551
SyntenyCmc06g0169551
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TAAAGTTAAAAATCGAACTACCAAACCGATCACATGTTGATGATTGATTATGGACAATATGCATAAAAAAGGAAATCCGAATGCCTTCTTCAATCTCCCATACCGAAACCGATACCGAAGCTGGAACCTGATTCGCTTTCTCCACCTTCTACAGTGCCGTCGACCACACATTCTCGTCGCCGGTTTCCTCCACGCCCTCACTCTTCCACTGCGTTCTCTCTTTCTCCGGTAATGTCAGCTCTCAAGCACCCATTTCAACTGACCTCCTCCTCTTCATCTCCTCTTTGTTCTCTACGCGCCTCTCTCCTTACTCTCGCCGTTCTCACTCTCCTCTCCTTCACTTATCTCTCCTTCACTTCCCTTCACTCATCCCCTCCCTCCTCTCCCTCTCAGGTACGCCATTTTCTTCTTTCCACATTCTTCACTTCTTCTTTTCCCTCTGTTTCATGCTCATGGATTTGGGGTTTTCATTCCGAAGCTTCCCGTCAAATTGGGAGCCCTTAATGATGCTGCGGATGAGGAGATTTCGGATGTCTACCACTCTCCACAAGTTTTTCGCTTGAATTATGAGGAAATGGATAGCAAATTCAAGGTTTATATATACCCAGATGGCGATCCCAATACTTTTTACCAGACTCCCAGAAAGCTCACTGGCAAGTATGCAAGTGAGGGTTACTTTTTCCAGAATATTAGAGAGAGTCGCTTCCGCACTGAAGATCCGGATCAGGCACACCTCTTTTTCATTCCGATATCATGTCATAAGATGCGAGGCAAGGTATGGTTCTTATCCAATCCTTTTTGGGTTTTCTTTCTTGTTCTGTTTTTTTTAACAACTAGCAATTAGGTAACTAGTGGTGCGTGTGGATTTGTAGCTTCTTGCTTTGTTACTTGTGTGTGTAGCTAGAAGCATTGGTTTTGGGGGAATACGAGTTGCTTAATTTCTTCTTTGTCTTATAAATCATGATGTTTATGGATCCAATGAAGGGTACATCCTACGAGAATATGACGATAATTGTCCAAAATTATGTCGAAGGCTTGATATCCAAGTATCCTTACTGGAACAGAACCTTGGGTGCGGATCACTTTTTTGTCACTTGTCATGATGTTGGTGTGAGAGCTTCCGAAGGCTTGCCTTTTCTTATAAAGAATGCAATTCGAGTTGTGTGCTCTCCTAGCTATGATGTCGGATTCATTCCTCACAAGGATGTTGCTCTTCCCCAAGTGCTGCAGCCATTTGCTCTTCCAGCTGGAGGAAACGATACAGAAAATAGGTAAAGGAATTGATATTTGTGAGTTATACTTTCCATTTCTTTCACCTCAAGAAGTGACTAGTGACAGACAACAGTATCTCTAGTTGGCAATTACAACCTAAATCTGGTTTTAGTTATTCTCCTTGTTTGAACTTTGATCTTCTCCTCCATGTATGGATCTCGGCTATATTTTTGGTTTATACTGCTTTTCAGAATCACAAGTAGGATGGAAGAAAAAATTATGACATTTCTTTGTCTAACTTTGCGATAAAGAGATAGATCCCAACAATTTATCCATAAAAACGAATTGTATGCAACATTTAATCACAAAATTTTCTCTACCCATGTGTACCACTGCTTTTGCTTGGCTCCTTAGTGTTCCAATCATACAAAAGGATGTGAGAAGAAACCCTACATTCTAATTCCTATTCTAAATTTAAATTTGGCTGTTTATTAGAACGTCTTTCCCTCTAAATGAAACACCTTCCATATAATGAGTTAAATATTTCAGGTTCCACATAGTAGCCAAGAGTAATTTCTTCAAAATTTGAGATATTAATTCACATTTTGTCTTATTGTTAGGGTGATGTTAGGCGAACTTAAATATTAAATTAGAATGGTTATCGCGAGTATAAAGTATTAACTTCAGTAGAGGAGAGATGTCCTTTGCTTAATCCTAGATTAAGTAAGTTCAGTTACCACCGACTCTTCTTGTAACCAATAGAACTGGATCTTGAACCATTTGTGTGATCTGTCAAGTTATAAGTCCTGAAAGACAGTTATTTAGATTAACTACTAGGTAGTAATTTAAGGGAAGCAGATTGGCGGTAAAGTGCTGTTCAAGGTTCAGAAATGGAAGAAAGCACTTTAATAGAAACTGGCAGGCTAACTCTCAAACAATCTGTTCTTTCACTTGTACCCTTTCATTTTATTTTGTTTAATTTGATTTTGATTTTGATTATTATTTTTTTCATTACACCTATTCTTGGGAAGCTTGTTGAGTTTACTGAGGAGTTTCGGACTGTTTTATTTGGGAGGAATAAAGACTCAGATAGGGATGTGGAGTCATCTTGTTAACTGGAAGGTCATATTAACCCTAGGGATGGGGGGCTTAGTATTAGAAAGTTCTGAAGCCAAAGCTTTTATTCTTCTAAATGGCTGGGGTAATTCGTAGAGAAGGGGATGCAGGGATGCTTTATATCATAAGGTGGATGTGAGAATTTATGGGGTAGATTAGGCTACCTCCTTGTATGGTAAACTTGGAGAGTTGGTGTTTTAGGAGTCCTTGGACTCTGGGATTAGGCAATGAAGGGCCTCTTAGGAGGTATTTGGTTTAAGATGGAGATGTGGTAAAAACGAGTTTGTCAAGGGGATCTGTCATGTGTGGCTTAGAGCACATTGAAAGTAAACTTGTAAACTTTATTTATTCATTAACATAGTGGATAGTCAAGGTCGTAAGTGACATGTTGGACTAAAAGCAGCTTGGGTGACGTCTAGAGTTCAGTAGGCGTTTCATGGTTATAGGTAGAGGATTTGTTAGAGAAGAAGATAATTTTTCAGAGTTTCCCCTGTAGATAGACTAATAGCTGATGGTGGAAAGGTTGGATATATTTCCTTGTCGATCCCTTTACCAATTTTATTTTAATTGTTTTTCTTAAATATGTGTGGTTTGTTCTATGTTCCTTTATCATGGGAAGATTAATATTTGAGATTTCATTCAAAGAAACTTCATATCAGTGACCTGTCCTAATTAGTGCTGGTTAAGTTAAGACTTTGAAGAAGTTCAGGAGCATTGGGGCTCAAATGGTATGCTCTTCTGAATAAGGATTGTCTTGAGCTTCTTTAATTGTTCGGTGTGAGCTGGTACTTCCCATGACAATTTCTTCCACAGTTGCTGCCCTTGCTTCATGATTCAGAGAGGAGTTTGCAGTTGAATGCTTTCTGTGCATTACTTTGAAGCTTATTATTAGATAGAAATTTGAGAAACTTTTATTGTATTTGAGAGAGTTTTTGGATTTCAAATCTCGTGTTTCTTTTAGATTTTAGAGGTTCATTGGACAAAACTTGTCCTTTATTATTAATACTATTATTATTTTATTTGTTATGCACATGGAAAGTTTCTTTATGATTTCTGCTGAAGGAGACTTTTGTCTTCTCCATTTTGTATACTTTCTTCTGTTAGTTAAATTTTTTTTTTGGCAATGAAGATCTGCCATGAACATTGGAGGGAGCAGGCCTCATGATAGCCAACCTCATAAGAAATCTTCACATCAATAGGTAGGTAGACTAGAATACTGATAGGTCTGGGCTAACTAAGTGTATTCCCGAAGAAAAAAGAAAGGCCTGGGTACTGGGAAAAAGAGATAGAGAAAGAGAATGTGGCAAATCTTGTCCTTTATTATTAATACTTTTAAAGGGACGAGGGGTTGTGTATGATTTTATCATCTTTTAATCTTTCTTGTGTAGTTCTGTGAGCACTGTGGAGAGGGGGGACCTCTCAAAACTCCCTTGCCTTTGCCGAATTAATAATATATAATTTGAGATCTATCAATTACTTGATGAGGAAAGGTGTTTTAACACAAGAGTTAGCCTTCTAACCTGATCTTCTTGAATGTTATTCCTACTTTAGTGTGTTGTTGATATACTACTGATTGCCTCAACGAGACCACTTTTTTTCTCCCAACTACAAGCCCTGAGAAGACTTTTAGACTGTTGCCCATATTTACCACCTTATTTTAAAAGTTCCCTTATCAAAAAAAGTTTTGAATGCAAGTTTTATTTCTTGTATCGATCAAATTAATATATATTCTATTCAGATGATTTACTTATCTTATCTCCCAGTTGTATAATCTTTTTCTCTAAAAAAATTGCTGTCACATAAAAAATACAGATTAACTGCTGACGGAGATGTTACTATTATCTGTAGGACAACCCTAGGTTTCTGGGCAGGGCATCGGAACTCTAAAATTAGAGTCATACTAGCTCGTGTGTGGGAGAATGATACAGAACTAGACATTTCAAACAACAGGATAAGCAGGGCCACTGGACATTTATTGTACCAGAAAAGATTTTACAAGACAAAATTCTGCATATGCCCAGGAGGTTCACAAGTCAACAGTGCTCGAATAGCTGACTCAATCCACTATGGATGTGTTCCGGGTGAGAACTCTCTTCTTTTGGTTATTTAGCTATAGAAAATTCATTTTCTATTTTTATTTTTTTAACGAAAGATTTAGGTAATCTAATTTACTCTCTTCATTTTCTTGCGAATGGAAGGAAGGTGATTGTCCTTTAATTTTTCAAAATCTAGTTTATTGAAATGTAAGGGATAGTAATCTAAAGTTCCAATTGAATTTCAAAAGAACGAAATGGAAGGCTGATGGAAGTTCAAAGGCGCTGAGGGTTGTAGGAGAGAATGACTTAGTCATTCATTTTACGTTTCCAATTTGACTTTCTGTTTCTTTGTGAAACTATTTGTTTCATGATCTTGATTCAATTTCCCTTCAGTCTTCTTTTGTGGGGAATGTATTAGGAGTTGATACGTCCAAACTGTGATGTTGACTGCAAGCACATAAGGCATACTATAGTATAGAAATAGTCGGAGATGGACTATCATCCTCTGGAGTTTGAAGTGTTTGAGTATAAAATTAAAGCAAGTAATGTAAATAACCGAGAAATTCGGTCACACGAGAATTCTAGGGTATGACTTTTTGTTTAATGGACTTAAGAGTGCGTTTGGATTATTTATGGAAAAAATGTTTTTCAAGAATACATTTTCTTTTAAACACTTTCTTTAAAATTCAAACACTTACATGTTTGGTTAGATCTTTTTATAAGTGTTTATATACATAATATTCTCAAAATTTGGTTTTGAGTGTTCTCTTTAAAATGAATTATTTTTCAATCTTGTTTTTTAAAATGCATTTTCAAAAGTTACCAAACACCTTAAATTTTTTAAAATGATTTTTTTTTCAAATTTAGACACTTTAAAAGGTAATCCAAACACCCCGTAATTGCCCTTGACCAATTTAACACCAATTTTACAGTTATAGAAGTCTCATTTTCAAGTGTCTTCTCTTGATGTCACATTTCTTTATTAGTAAATGTAATCAGTTGAAATATGGTTACTTAGTTAGCATTTCAGTTTATCAGTTAGTATATGGATATTACTTTTCTGTCCTTGATATTTCACTGTATAAATACCCCTTTCATTAAATTAATACAACAAGGTAAAATACCTTTATTTACCCAATATGGTATCAGAGCAATTTTTGCCCCAATTCCCGCCTCCTTTTGCATCTCCACCTTCTCTCTTCGTCTTCGATCAATTTCCATGGCAGATACGGTTGAAATTGAAGAAACCGTTTCAATGGAGAATCAAACCAGCAACAATGGAAGTTGATCAGATCTAATTGCTGCTGCGAACTCTCAGATTGATGCTCAACTCAATCCCTACTTCATCCATCACTCACTTGGCCCGACTACTGCAATAGTTATGCAACCTCTGGCTCTAACCGGAGCCATCAATTGCACTTCATGCAGTCGTGCGATGCTAATGGCAATTTCAGGGCGAAATAAGGCCTGTTTCATCACCAGAAAGATAAGAAAACCTTTGGGCAAAGTCCTACTTGATGCCTGGATTTGCAACTACGACATTATTGCCTCATGGATTCTCAACTCTGTCTCAAAAGAGATTGCGGCAAGCATTGTTTACACAGGATCAGTCAAGGAGATATGGAATGAATTATGTCAAATATTCAAACAACCAAATGGCCCTAGCATCTACCAACTTTGAAAAGAATTTGTCACCTTGCGGCGAGGAAACTTGACAATCGGAACGTACTACACAAAACTCAAAACCATATGGCACGACTTGAATGAATATCACCCAACAAATGGATGCACTTGTGGAGGTTTGAAACCTTTCATTACTCATCTTGAATCCGAATACATCATGACCTTCCTAATAGGACTGAACAACTCCTATGATGCTGTAAGTGCACAAATCCTTCTTATGAAACCTCTGCCATCAATTACCACTGTGTTCTCTCTATTAATTCAAGAAAAACAAAGATCCCCTGGCATTCTGATTCCCCCTAATGCTTCAACAATGGCCATATCAAGTGATCGAACTTGCAGAAAAAGAGCGTCCTACTTGCTCTTACTGTGGAAGCAGGGGACATGTGGCTGACAAGTGTTATAAGAAACATGGATATCCTCCAAACTACAAACCAAAAAACTCAAACTTCAATTTCCCTGCTTCAGATACCTCAAAGATAGTGAGTAAAGTTGCCAACACCAATTCAACTGCTACCAATCCTTCTCCAATTTTTTTCTCAAGCCTAAGCTCAGAACACTACAGACAATTAATGACTTTGCTTAACACTCATCTTCAAGCAGCCAATACAGTCAGGCATCTTCTCACACTCTAGGCATCTTCTCCCTAACTTCACATAATGTTTGGTCACATGATGATTGGATTATAGACTCAGGAGCATCTAGACACATGCCACAATAAATCTCTTTTCAGAAGTTGGAGTCTCATAAATAACATATTTGTAATGCTACCAAATGGCCATAAATAACAAGATGTCTCGTTCGTTCCTCAGTTTACATATAACCTCATCTCCGCCAGTTGCTTATTGACCTCAAAGAATATCTCAGTTGATTTTCACAGTACTTGTTGCATCATACAAGATCTTTCTCTATCAGTGACTATTGGCAAGGCTAGTTGTCAAAATGGACTATATGTGCTCAGCAAAGATGTTGATGCTTCTGTAGCTGCTGGAGTTAATATCAATGTTGTCTCAGTACCTACTTGGCATCGATGTTTAGGTCACTTGTCACCTAAATGCCTTTCTTATTTGTCTTCAACCTTATGTTTGTCTAGTCATTCCGTACATAATTCATCATGTCATATATGCCCATTAGCAACAAAAAGAGGTTATCATTTCATTCCAATAATAATATTGCTTCTTCTCCTTTTGATCTAATCCATGCTGATATATGGGGGCCTTTTAAAACACCATCTTATGGTGGATATAAATACTTTTTGACATTGATTGATCATTGCTTATGCTTCACATGGGTCTACATGCTAAGGCAGAAATCCGACTTCCAACTCATCGAAACTCAATTCTCAATAATTATCAATCTAACAATGCCCATGAACTAAAATTCACTGAATTCTTTGCTACAAAAGGAACAACCCACCAATTCTCTTGTGTTGAATGACCTCAGCAAAATTCAGGAGTATAGTGAAAACATAACACCTCCTCAACGTTGCCCAAACTCTTTTCTTCCAATCACGAGTTCCAATCTGTTTTTGGGGAGATCGTGTTCTAACAACAGCATATATCATAAACAGAACTCCAATGACACTTTTGAAAAATAAGTCTCCATTTACTGTTCTCCATGGCAAACCTGTAGATTACACTCACATGAGAGTTTTTGGTTGTCTCTGCTAGGTTTGATCCCAGGGCCTCATTTTGTATTTTCATTGGCTACCCATCAAGCATGAAAGGATATCGTTTATATGATATTGAAAAGAAAACTATCTTTGTATCATGAGATGTGACATTCTTTGAGCAACACTTCCATTCCATTCAATATCTACCAATGAACATTCAGAACTTGAAAATGTCTTTCATGACCTTGTCTTACCCATGCCAATACTTGACAACAAAATCACTACCTCAAACATACCTCAAGCTACAGTTGACCCTATCATTGACAATATTGTTCCCAATGATAACTTGAATTCTAACTTTGTTGCTCATGAAAATACACAGGATGCACAAGATGTTCATAGCCCTAAAACTCAAATTAAACAAGATGCTGCAAATGAAGGGAACACTTCCGATACAACCAACACAACACTGGAACAATCCATAGCTCCATGTAATCCACCCATTTCTCGAAAATCCAGAAGACAGCACAAGCCTCCAAGCTTTCTAAAAGACTGCCACTGCCACCTACTGACTCATGGCTCTCCTCCAAAAGACACCACTTCATATCATCCTATAGACAAAGTCCTGTCTTATGAGAAGCTGAATCAGACACACAGTACCTTCTTATGCAACATGTCCACAAAATATGAACCATCATTCTTCCATCAGGCTGTAAACTCAAATGCTATGGATGCAGAGATCTCTGCCATGGAAAGAACAAACACCTGGAGCATAGTTCCCCTTACCTATGGACATCATGTTGTAGGATGCAAATGGGTTTACACTTTACCGAGTCAAATATCATGCTGATGGCACAATTGTTAAGCTTAAAGCCTGATTGGTTGCAAAGGGATATAGTCAACAAGAAGGTATTGACTTCCTTGAAACCTTTTCCCTTGTGGCTAAAATAGTTACCGTCAAAGTACTTCTATCCCTAACAACTTTCTTTGGCTGGTCCCTTTACCAAATGGATGTCAACAATGCATTCCTCAACGGCACCTATTTGAAGAAGTATACATGACATTGCCAATGGGCTTTTACTCCAAGGAGCAACAATCCTCATACCCTCCACTTGTCTACAAACTGCATAGTCAATATATGAACTAAAGCAAGCATCCAGACAATGGCTCTCGAAATTCTCTGGAGAATTACTCACTAATGGACAAATTGGACTCTCCCCCATAAAGGCACGGTTCTCAAATTGGATCTTGAGAAGACTTTTGATACTGTTGATTGAGAGTTTCTGGATGTTGTTCTTTAGGCTAAGGGCTTTGGTTTTCTAAGGCGATCTTGGATCAAGGGTTCTATCTGATCTAGCAAATTATTCGATCATTATTAATGGCAAACTGCTAGGGAATAACGTCCCTTCCTGTGACATTCTACATTGCGATTCTTTCTCACCTTTCTTATTGATTTTGGTTGTTGATTGCCTTAATTGTTTCTTGGCTCACCATGCTTGTTTAGGTTATATTTCCGCTCATCTTATCATGCTTACTCTTTTTGTTTGAACCTTCTGCTGTTCATATATTTGGTAGGGCCACTGGTTTAGAAATTATTCTTTACAAGGGCAAGTTGCTTGGGATTCATATTGCTGACACTGAGATGGAGTGGTGGTTTCAAATAGGGACATAATTTACTTTTTTGGGTGGTAACTCAAAAACAACCATTTGGCAACCTGTTATTGAGAGAATTCTGCACAAGCTCCATAATTGGAAGTATGCTTTCATTTCGAAGGGTGGTCGCGATACACTTGTTTAGTCCTCTCTTGCTAGTTTGCCAATTTATTATCTTTGTTTTGCGCACTAAGGGTAGCAAGGACTTTGGATAAGCTGATTCACAATTTTTTCTAGGAAGGTTCTCATGGCGATGGTGGTATGTGTAATGTCAAGTGGGAGGCTGCTCGACTTCCAAAATCAAAGTGGCTTTGGAATTGGTAATTTTTCAGCATCATAATTTGCTTTATTGGCTAATTGGAATGGGAGAATTCTTACTAAATAGTTGATTTTATGATGGAGGCCCATTGATGCTAACTATTATGAGCCTGATCGGGAGAGGTTGGTATTCCAAAAAAAACAAATTGTGAGTGGTTGGCCAAGAACTATTCGATTAAGTTTATTGAATTCCTTGGAGATTTATCTGTCAAACTAGTTATTTGGTTGCTAGCCGTGTGCAACGTTCCCTTGGTGATGAGACTTCTACTTTCTGGACTGGCTCATGGCTTAGTTGTGGCCCTATTATGATTGCGTCTCCTATGCATTATCATCTTGCCTTTTCTTTTCATGATGCTATTGTGGCTGTTCTGTACGTTGAGACAACCGATGCGTGGGACTTGCGCCTTTGATGCAACCTTAACGATGTGGAAACTAATAAATGAGCTAGTTTTTTCATCTTTTGGCTCCATTTGGTATTCAAAATTCTCTTTATCATTGGACTTGACCTCTTGATTCTTCTTTCCACTATTTTGTTAAATCACTCATGGTGGATTTGTTTGTGATTGTGAACCGTCTTTTTTTAAAAGATCTTTATTTGGTAATTGGGAGAATCATTATCTGAAGAAAATTAAAATCTTTCTTTGGGAGCTAGGAACTTAGTCTTTGAGCCATTAGTACATCAATGCAATCTTTCTTTGGGAGTTAGGAGCTTAGTCTTGGAGCTATTAGTACGTCATGCACTTGTCCCCTTCTTGGTGCATTATGTGTGAATATGCTAGTGTCAAAGTTTCCTAGTCACCACTTCATGTATCATATTTGCATCCAGTTTTTATATTGTTGTTTATAAAGCTTCTGGATGGTCTTTGCCTTTTCTAACAACGTTTATGATTCTGAGAAGATTTTATAGTTGGCTTTCACTCGTGCGTTTTTATTTGGCATCTTTGAGCGAAAGGAATGCTCATATTTTTACGGATGTTTCCTTTTCTTTTGATTGTTTTATGGATATGGTTCTTTCAACTTTCTTTTGGTGTGAAAATAAGCACCCTTTTGCCCTCTAGAACTATATTTTTTAACTCCCAATTGGCGATTATTTTGTAACTCACCAGTATGGTGTTTAGAGTTTTACTTATTTCATTAATGAATGATTTTTCTTCTTCTAAAATAAAAAAGAGAACGGTGAAAAATGCTCAATTTCACGAGTATAAGTGCAATAAAGTGATAGACCCTAATAGTAAGTGTACAAGGGTAAAAAAGTGAGGGAGGAAGGCCTTATCGTTCACTGTGTGCACAAATGATAGTGTGACCTAGTGTGTTGAAAATTAAACATGTTTTAAATAAATAAGTTGTGAGTTAATTTATGTAAGTTAGAGAATGATGTGAGTTAAAAATGTTAGAGTTAATTTAGTAAGAGTTTCACCTTAGTTAAATGTAGGATTTAATTTGTTGTATTTAATTGTGATATAAGTTATTCTTATGACAATCCTATAAATAGGATTGAAATGTTGTAATCTAAATCATCCAAGTGTGAGTTGAATACACACAATAAGTCTTCCAAAAAGGTTTTTGTTTTCTATCAAATATTCTATTTGGTGATTATTTATTTGGGGTGTTCATCAAACACTTCCAACAAAGTGGTATCAGAGCTTGAGTTTGGAGTAAAAAAATTGGTTTTGCAAATGACAAACAACAATTTAGTTCCCTTCCAAGTACCTCGACTTACGAAAGAAAGTTATAGTAGTTGGTGTATTCGAATGAAAGCTCTACTTGGTTCACAAGATGTGCGGGACATTGTTAGTAATGGCTATGAAGAACCAGAAAGTGATGCAGCTTTGAATCAAGCTCAACGAGAAACTTTACAAAATACAAGAAAAAAAGATCAAAAGGCTCTCACCATCATTCATCAAGCCATTGATGATAACAATTTTGAGAAAATTTCTGGAGCAACTACTGCATATCAAACATGGCAAATTTTGGAGAATACGTATAAAGGAGTAGATCGAGTCAAGAAGGTTCGCCTTAAAAAATTGAGAGGTGATTATGAATCACTACATATGAAGGAGTCTGAATCAGTTCCAGATTATACTTAAGATTGCTAGCAGTAGTAAATGAAATGAAAAGATTTGGTGAGACAATAAGCGATGAGCAAGTAGTAGAAAAGATACTTCGCTCATTGGATTAAAAATTTAATTTCATCGTTGTAGCTACTGAAGAATCAAAGGATTTGAGTACAATGTCCATTGATCAACTTATGGGTTCTTTACAAGCCCATGAAGAGAAGCTTCTTAAGAAGAACAAGTAGATGACTGAGGAACTTTTTCAGTCAAAGTTGAAATTAAAAGACAAGGAAGGCAGCCTAGAAAAAGGAAATCGAGGTCGAGGACGTGGTGGTAATCGTGGACGTGGTGATTTCAAAGATCGAGGTCGAGGAAGCTACGGTCAAAGAAAATTTGATGAGAGTAATTCAAACTCAAATTTATCAAGAGGCAGAGGAAGACAACATTATTCGAGGTCAAGTGGGGAAAGATCAAATAATGATAGGAGGTATGACAAAAGACAGGTTGAATGTTATAATTGTCATAAATTCGGCCATTATTCTTGGGAATGCGGAAATAGAGTTGAAGAAAATGCAATTATGTTGAGAAAGATGAAGAAAGCGGTGATTCCTCATTGTTTCTAGCATGCAAAGGTGCGGAAACATGTGAAAACAGTGCATGGTATCTTGATAGTGGTGCAAGCAATCACATGTGTGGAAGTAAATCAATGTTCGTTGAACTTGATGAATCTGTTGGTGGCGATATCGTATTTGGTGATGCCACAAAAATTCTGGTTGAAGGAAAAGGTAAAATTTTGATCAATTTGAAGAATGAGAAGCATGAGTTTATCTCTAATGTTTATTATGTGCCTAATATGAAGAACAACATTTTGAGTTTGGGACAACTCTTAGATAAAGGCTATAATATTTTGATGAAGGATTATAGTCTTTTGATAAGAGATAATCATGACAAAATTATTGCTAAAGTGCAAATGACGAAAAATAGAATGTTTTTATTAAACATTTAAACTGATGTTGCTAAATGTTTAAAGTCATGTTTGAAAGATCGCAATTGGATTTGGCACTTGAGATTTGGGCATTTGAACTTTGATGGCTTAAGATTATTAGCCAGGAAGAACATGGTGAAAGGGTTGCCATATGTCAAACATCCAAATCAACTTTGTGAAGGCTGTCTTCATGGCAAACAATCAAGGAAGAGTTTTCCACAAGAATCATCTTCGAGAGCAAGGAGACCACTGTAGTTAGTTCACACGGATCTTTGTGGATCGATCAAACCAAGTTCTTTTGGTAAGAAAAATTATTTCTTATTATTTATTGATGATTTCAGCCGGAAAACTTGGGTTTACTTTGTCAAAGAGAAATCAGAAGTATTTGGCATATTCAAGAGATTTAAAGCTCTTGTTGAAAAAGAAAGTGGTTATTACATTAAAGCTTTGAGATCAGACAGGGAAGGTGAATTCACTTCAAATGAATTCAAAACTTTTTGCGCAGAAAATGGAATCCGTCGATCTATGACAGTTCCATTTACTCCTCAACAAAATGGTGTTGTTGAAAGGAAGAACCGAACAATACTTAACATGGCTCGAAGCATGTTGAAGTGCAAGAAGATGCCAAAAGAATTTTGGGCACAAGCTGTTGAGTGTGCAGTGTACTTGTCAAATCGTTCCCCTACTAGAAGCTTATGGAACAAAACTCCTCAACGAGCATGGACAGGAAGAAAACCATCCATTGGTCATTTGAGAGTATTCGGATGCATGGCTTATGCGCATATACCTGATCAAAAGTGTAGTAAGCTTGATGATAAAAGTGAGAAATATGTGTTTGTTGGCTATGATGCAAGCTCAAAAGGCTACAAGCTTTATAATCCTCTTACAAAGAAGACGATCGTAAGTAGAGATGTTGTGTTTGATGAAGAAGCATCATGGAATTGGAATGACGAACCAGAAGATTACAAATTTTTGTTTTTTCCCGACGAATGTGATGAGCCTAGTGACATTGCTTCTCCACCAACATCGCCAATCACTCCACAACAAAGCACATCTTCATCATCTGCAAGTTCAAGTGAAGGGCCTCGTGGCATGAGAAGCTTACAAGACATATATGATGAAACTGAAGAGTTAAGTCAAAGTTTTAATAACCTTGCTCTTTTTTGTTTATTTGGTGACAATGAACCTTTGAATTTTGAAGAAGCTTTGCAAAATGACAAATGGAAGATTGCTATGGATGAAGAGATAAAAGCCATAAAAAAGAATGATACGTGGGAACTTTCTACTCTTCCAAATGGAAAGAAAGCAGTAGGTGTCAAATGGGTGTTCAAGATAAAAAGAAATGAAAAGGGGGGAAGTGGAGAGATACAAAGCAAGATTAGTTGCAAAAGGATATTCTCAAAGAAAAAGCATTGATTATGATGAAGTATTTGCTCCCATTGCTCGTTTGGAAACTATAAGGTTGTTAATTGCTCTTGCTGCACAAAATAATTGGGAAATTTTTCAGATGGATGTCAAATCGGCATTTTTGAATGGATATCTAGAAGAAGAAGTCTACTTGGAACAATCTCCTAGTTATTCTTCCAAGAGGATAAAGTTCTAAAATTGAAGAAGGCATTATACGGATTGAAAAAAGCACCAAGAATGTGGAATAGCAGAATCAACAAATATTTCCTTGATAATGGGTATTTGAGGTGTCCTTATGAACATTCTCTTTATATTAAGGTTAATGGTCATGGAGATATTTTGGTAGTTTGTTTGTACGTGGATGACTTAATTTTTACAGGAAATTGTGCAAGTATGTTTGAACATCTCAAGAAGGTGATGACCCAAGAATTTGAAATGACAGATATAGGGCTAATGTCATATTATCTTGGCATTGAAGTGAAGCAGTTAGAGGAAGGAATTTTCATTTCTCAAGAACGATATATTAGAGAAATTCTAGAGAAGTTTAATATGATGAATTCTAAGCCTGTTGCAACTCTGATTGAAACTGGGACCAAACTGTCCAAACATGAAGAAGGAGATGATGTTGATCCTTCATATTTCAAAATTTTGGTTGGGAGTTTGAGATATTTGACTTGCACACGACTAGATATTCTTTTCAGCGTTGGATTGATGAGTCGATTTATGGAATCTCCTACAACTACTCATTTGAAAGTGGCAAAGAGAATTCTTCGTTACCTCTGAGGTACGCTTGACTATGAGTTGTTTTATTCTTCATCTAAAGAATTCAAGCTTGAAGGCTATTGTGATAGTGACTGGGCTGGAGATACTAATGATCGAAAGAGCACTAGTGGATATGTTTTCTTCATTGGCAATACTGCATTTACATGGAGTTCTAAGAAGCAACCTATTGTGACATTATTCACTTGTGAGGCAGAATACATTGCTGCAGCTTCATGTGTTTGTCATACAGTTTGGTTAAGAAATTTGTTAAAGACAGTTGGAATTTTGCAAGATGATCCAACTGTGATTCATGTAGACAATGAATAAGTCAACAATTGCTTTAGCAAAGAATCCTGTGTTCCATGATCGTAGCAAACACATTGATACAAGATTTCACTTCATCAGAGATTGCATTTCAAGGAAGTTCAAGTTGAATATGTGAAGACTGAAGATCAAATTGCAGATATTTTTACGAAGCCACTCGAAGTTAATGTATTTAACAACTTAAGAACTTTGCTTGGAGTTTTTTTTTTTTTTTAAAAAAACATGTTTAAGGGAGGATGTTGAAAATTAAACATGTTTTAAATAAATAAGTTATATGTGAGTTAATTTATGTAAGTTAGAGAAGGATGTGAGTTAAAAATGTTAGAGTTAATTTAGTAAGAGTTTCATCTTAGTTAAATGTAGGATTTAATTTGTTGTATTTAATTGTGATATAAGTTATTCTTATGACAATCCTATAAATAGAATTAAAATGTTGTAATCTAAATCATCCAAGTTTGAGTTGAATACACACAAGTCTTCCAAAAAGGTTTTTGTTTTCTATCAAATATTCTATTTGGTGATTATTTATTTGGGGTGTTCATCAAACGCTTCCAACATAGTGTAACTAGGTCGTTAGACTAGCGGAAAATGAGAATGAAGTATTTAAAATTGAATACTAAGATATAATGTCATTGTGCACCTTAACTGTGTACCTGTATATGAAATAAACAAAAAGATTCTCTCGCAAAGTTCAACTTAGGAAAGATGGTTTAGTTGTGAGGCTGTCTTTACTTGGGCATATTTGGGCACGCTCCACAGTTATAATCCTAGGGTTTTTTTGATGGTTTAGCTAAGACTTCTAGAAAGCTTAATTGTTTGAAGTAAACTCTAGTCCAGTTTTGGGGTGAGAATAATTTTTTGTCTTTTTAAATACATATATATACGAGAAAACTTGACTCATTAAGAAAATTATAAAATAATACACAAACATATAAAAAGATTAGTTCACAAAAGGAACCAAACACGATAAAGAGCACCATTCAAGAGAAAAAACATAATGAATAGTTACAGAAGGGCTTGGATAGTAGCCCAGAGAGAAACATAAGAATCTAATAAAGGACCAAACAATTACTTAAAAGAACTTTCATCTATTCTGAAAATTCTAACGTTTCTCTCCCATCACAAGCCCACAAAATAGCACATACTTCAGCATGCCATCAACACCACTTTTTACAAATTTTCAAAATTATGGGATCACTGAAGGAGATACTGTGAAGGATAATGGAAATATTTGGTGTAGGATGTGTTTCAAAGGGGAAATGGTCGTTTAGTTTATTGGAAAAATTGATTTTGAAACTGAGTGATTGAATGCTTAGATTGTGAAACTGTTGTACATTTATGTAAATAGTTGTAGTGATTTCCCAGCAAACACCTATCCGTTAGGTTATAGCGAGCCAGTACTGGATGCACTCCAACAGATGAGATTCAATGTGTGGTTCTGTTGGTAGGCTTAAGAACGTTTGAAAGGTGACAACAGAGTAATATCTCAATCCATTTCTAAAATTAAATGTGGTAATCTTCAATCTAAACCACTTACTACATATATGCAGATTTTTTTGGGTCAAAATTCTGTAAAAGAGGAAATCATGTCACTGTCAAGTATTGTTATGAGCTGTGACATGCTTGAAGAAAAAAGGAAGGAAAATTTCAAATTATGCTAATGAGATAATGCGAAAAAAATGTAATGCAGTTTCCTTGCCGTCTTAAAACACGAGAGCCTGAGTTGTAAATTTCCTTGTCGTCTTAAAACGCGAGAGGCTGAGTACTTCCCTTATACATACAACATTTTTTTCCCTTTTCCAACAACATACCTAATTCTTTCAAACCCCATTTGCGTTTATTGGAAAAAGAAGGAAACAGCTTGTATTGCATCAAATCAGTGACGAAAGGAGGAAAATGAGGAAAATAATTTTTATTATGATTGTGCTCAACATAAAAAGAAGAGCAAAGTTTATTCAGCGAACTCCTCAAATCTCTGTACATCATACATAATGCCAAACGAATATAGGTGGAGCAATAACATAACAACAATACGTATAAGCTAGTATTAACTATATATTATATTGAGTATTAACATGTTCCTAAGGACAATATTAAAGTTCTAAGTTTGCAATCCTCTTCATCGATTTTGAAAATGAACTAGATCATCGAACATATTCCTTTTTTTGTCAATGTATGAATATTTTTCTTTTTTATATTCTGTGAAAATCTCTTTGTTTGGGGTTTATTTTGAGTTTATTTGTATGGATTTCTTGCAGTGATATTGTCAGATTATTATGACCTTCCATTCAATGACATTCTTGATTGGAGAAAATTTTCGGTGATAGTTAAGGAACGTGATGTGTACCAATTGAAACAAATACTCAAGGACATATCTGATATGGAGTTCATTAAACTCCACAAAAACTTAATACAGGTACAATGTTTCTTCTTCCTACACTATCATTTGTTAATACTAAACATAGTCGATCTTTTTTTGAAATTAAAGAAAAAATACTTTCTGACACTCTAAAATCACGTTGAAGGTGTACAAACTTAATATGTTTGGTGTATATTGCTCGTTTATTAGGTTCAAAAGCATTTTCAGTGGAATTCTCCCCCAATCAAATATGATGCATTCCATATGGTGATGTATGACTTATGGTTACGACACCATGTCATCAAATATTAATATGCTATAGTTACACATGATTTCTTCTACAAAATTTTGTTTGTTAATTTCTCATTGTAAGAAAAAAAATATTTGTAACCAAACTATGTACATTCTCAATATTCTCAATTATCTCTTTTGACGATATTTTCCCCTTATTTCTTTTACATTTGAAAGATATATATTTCTTTTGTTCATTTAGACTTTTATTACATGCTTATGAATTGCATTTCTTTTGTTTCA

mRNA sequence

TAAAGTTAAAAATCGAACTACCAAACCGATCACATGTTGATGATTGATTATGGACAATATGCATAAAAAAGGAAATCCGAATGCCTTCTTCAATCTCCCATACCGAAACCGATACCGAAGCTGGAACCTGATTCGCTTTCTCCACCTTCTACAGTGCCGTCGACCACACATTCTCGTCGCCGGTTTCCTCCACGCCCTCACTCTTCCACTGCGTTCTCTCTTTCTCCGGTAATGTCAGCTCTCAAGCACCCATTTCAACTGACCTCCTCCTCTTCATCTCCTCTTTGTTCTCTACGCGCCTCTCTCCTTACTCTCGCCGTTCTCACTCTCCTCTCCTTCACTTATCTCTCCTTCACTTCCCTTCACTCATCCCCTCCCTCCTCTCCCTCTCAGCTTCCCGTCAAATTGGGAGCCCTTAATGATGCTGCGGATGAGGAGATTTCGGATGTCTACCACTCTCCACAAGTTTTTCGCTTGAATTATGAGGAAATGGATAGCAAATTCAAGGTTTATATATACCCAGATGGCGATCCCAATACTTTTTACCAGACTCCCAGAAAGCTCACTGGCAAGTATGCAAGTGAGGGTTACTTTTTCCAGAATATTAGAGAGAGTCGCTTCCGCACTGAAGATCCGGATCAGGCACACCTCTTTTTCATTCCGATATCATGTCATAAGATGCGAGGCAAGGGTACATCCTACGAGAATATGACGATAATTGTCCAAAATTATGTCGAAGGCTTGATATCCAAGTATCCTTACTGGAACAGAACCTTGGGTGCGGATCACTTTTTTGTCACTTGTCATGATGTTGGTGTGAGAGCTTCCGAAGGCTTGCCTTTTCTTATAAAGAATGCAATTCGAGTTGTGTGCTCTCCTAGCTATGATGTCGGATTCATTCCTCACAAGGATGTTGCTCTTCCCCAAGTGCTGCAGCCATTTGCTCTTCCAGCTGGAGGAAACGATACAGAAAATAGGACAACCCTAGGTTTCTGGGCAGGGCATCGGAACTCTAAAATTAGAGTCATACTAGCTCGTGTGTGGGAGAATGATACAGAACTAGACATTTCAAACAACAGGATAAGCAGGGCCACTGGACATTTATTGTACCAGAAAAGATTTTACAAGACAAAATTCTGCATATGCCCAGGAGGTTCACAAGTCAACAGTGCTCGAATAGCTGACTCAATCCACTATGGATGTGTTCCGGTGATATTGTCAGATTATTATGACCTTCCATTCAATGACATTCTTGATTGGAGAAAATTTTCGGTGATAGTTAAGGAACGTGATGTGTACCAATTGAAACAAATACTCAAGGACATATCTGATATGGAGTTCATTAAACTCCACAAAAACTTAATACAGGTTCAAAAGCATTTTCAGTGGAATTCTCCCCCAATCAAATATGATGCATTCCATATGGTGATGTATGACTTATGGTTACGACACCATGTCATCAAATATTAATATGCTATAGTTACACATGATTTCTTCTACAAAATTTTGTTTGTTAATTTCTCATTGTAAGAAAAAAAATATTTGTAACCAAACTATGTACATTCTCAATATTCTCAATTATCTCTTTTGACGATATTTTCCCCTTATTTCTTTTACATTTGAAAGATATATATTTCTTTTGTTCATTTAGACTTTTATTACATGCTTATGAATTGCATTTCTTTTGTTTCA

Coding sequence (CDS)

ATGTCAGCTCTCAAGCACCCATTTCAACTGACCTCCTCCTCTTCATCTCCTCTTTGTTCTCTACGCGCCTCTCTCCTTACTCTCGCCGTTCTCACTCTCCTCTCCTTCACTTATCTCTCCTTCACTTCCCTTCACTCATCCCCTCCCTCCTCTCCCTCTCAGCTTCCCGTCAAATTGGGAGCCCTTAATGATGCTGCGGATGAGGAGATTTCGGATGTCTACCACTCTCCACAAGTTTTTCGCTTGAATTATGAGGAAATGGATAGCAAATTCAAGGTTTATATATACCCAGATGGCGATCCCAATACTTTTTACCAGACTCCCAGAAAGCTCACTGGCAAGTATGCAAGTGAGGGTTACTTTTTCCAGAATATTAGAGAGAGTCGCTTCCGCACTGAAGATCCGGATCAGGCACACCTCTTTTTCATTCCGATATCATGTCATAAGATGCGAGGCAAGGGTACATCCTACGAGAATATGACGATAATTGTCCAAAATTATGTCGAAGGCTTGATATCCAAGTATCCTTACTGGAACAGAACCTTGGGTGCGGATCACTTTTTTGTCACTTGTCATGATGTTGGTGTGAGAGCTTCCGAAGGCTTGCCTTTTCTTATAAAGAATGCAATTCGAGTTGTGTGCTCTCCTAGCTATGATGTCGGATTCATTCCTCACAAGGATGTTGCTCTTCCCCAAGTGCTGCAGCCATTTGCTCTTCCAGCTGGAGGAAACGATACAGAAAATAGGACAACCCTAGGTTTCTGGGCAGGGCATCGGAACTCTAAAATTAGAGTCATACTAGCTCGTGTGTGGGAGAATGATACAGAACTAGACATTTCAAACAACAGGATAAGCAGGGCCACTGGACATTTATTGTACCAGAAAAGATTTTACAAGACAAAATTCTGCATATGCCCAGGAGGTTCACAAGTCAACAGTGCTCGAATAGCTGACTCAATCCACTATGGATGTGTTCCGGTGATATTGTCAGATTATTATGACCTTCCATTCAATGACATTCTTGATTGGAGAAAATTTTCGGTGATAGTTAAGGAACGTGATGTGTACCAATTGAAACAAATACTCAAGGACATATCTGATATGGAGTTCATTAAACTCCACAAAAACTTAATACAGGTTCAAAAGCATTTTCAGTGGAATTCTCCCCCAATCAAATATGATGCATTCCATATGGTGATGTATGACTTATGGTTACGACACCATGTCATCAAATATTAA

Protein sequence

MSALKHPFQLTSSSSSPLCSLRASLLTLAVLTLLSFTYLSFTSLHSSPPSSPSQLPVKLGALNDAADEEISDVYHSPQVFRLNYEEMDSKFKVYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFRTEDPDQAHLFFIPISCHKMRGKGTSYENMTIIVQNYVEGLISKYPYWNRTLGADHFFVTCHDVGVRASEGLPFLIKNAIRVVCSPSYDVGFIPHKDVALPQVLQPFALPAGGNDTENRTTLGFWAGHRNSKIRVILARVWENDTELDISNNRISRATGHLLYQKRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSDYYDLPFNDILDWRKFSVIVKERDVYQLKQILKDISDMEFIKLHKNLIQVQKHFQWNSPPIKYDAFHMVMYDLWLRHHVIKY
Homology
BLAST of Cmc06g0169551 vs. NCBI nr
Match: XP_008450187.1 (PREDICTED: probable glycosyltransferase At5g03795 [Cucumis melo])

HSP 1 Score: 852.4 bits (2201), Expect = 1.6e-243
Identity = 412/412 (100.00%), Postives = 412/412 (100.00%), Query Frame = 0

Query: 1   MSALKHPFQLTSSSSSPLCSLRASLLTLAVLTLLSFTYLSFTSLHSSPPSSPSQLPVKLG 60
           MSALKHPFQLTSSSSSPLCSLRASLLTLAVLTLLSFTYLSFTSLHSSPPSSPSQLPVKLG
Sbjct: 1   MSALKHPFQLTSSSSSPLCSLRASLLTLAVLTLLSFTYLSFTSLHSSPPSSPSQLPVKLG 60

Query: 61  ALNDAADEEISDVYHSPQVFRLNYEEMDSKFKVYIYPDGDPNTFYQTPRKLTGKYASEGY 120
           ALNDAADEEISDVYHSPQVFRLNYEEMDSKFKVYIYPDGDPNTFYQTPRKLTGKYASEGY
Sbjct: 61  ALNDAADEEISDVYHSPQVFRLNYEEMDSKFKVYIYPDGDPNTFYQTPRKLTGKYASEGY 120

Query: 121 FFQNIRESRFRTEDPDQAHLFFIPISCHKMRGKGTSYENMTIIVQNYVEGLISKYPYWNR 180
           FFQNIRESRFRTEDPDQAHLFFIPISCHKMRGKGTSYENMTIIVQNYVEGLISKYPYWNR
Sbjct: 121 FFQNIRESRFRTEDPDQAHLFFIPISCHKMRGKGTSYENMTIIVQNYVEGLISKYPYWNR 180

Query: 181 TLGADHFFVTCHDVGVRASEGLPFLIKNAIRVVCSPSYDVGFIPHKDVALPQVLQPFALP 240
           TLGADHFFVTCHDVGVRASEGLPFLIKNAIRVVCSPSYDVGFIPHKDVALPQVLQPFALP
Sbjct: 181 TLGADHFFVTCHDVGVRASEGLPFLIKNAIRVVCSPSYDVGFIPHKDVALPQVLQPFALP 240

Query: 241 AGGNDTENRTTLGFWAGHRNSKIRVILARVWENDTELDISNNRISRATGHLLYQKRFYKT 300
           AGGNDTENRTTLGFWAGHRNSKIRVILARVWENDTELDISNNRISRATGHLLYQKRFYKT
Sbjct: 241 AGGNDTENRTTLGFWAGHRNSKIRVILARVWENDTELDISNNRISRATGHLLYQKRFYKT 300

Query: 301 KFCICPGGSQVNSARIADSIHYGCVPVILSDYYDLPFNDILDWRKFSVIVKERDVYQLKQ 360
           KFCICPGGSQVNSARIADSIHYGCVPVILSDYYDLPFNDILDWRKFSVIVKERDVYQLKQ
Sbjct: 301 KFCICPGGSQVNSARIADSIHYGCVPVILSDYYDLPFNDILDWRKFSVIVKERDVYQLKQ 360

Query: 361 ILKDISDMEFIKLHKNLIQVQKHFQWNSPPIKYDAFHMVMYDLWLRHHVIKY 413
           ILKDISDMEFIKLHKNLIQVQKHFQWNSPPIKYDAFHMVMYDLWLRHHVIKY
Sbjct: 361 ILKDISDMEFIKLHKNLIQVQKHFQWNSPPIKYDAFHMVMYDLWLRHHVIKY 412

BLAST of Cmc06g0169551 vs. NCBI nr
Match: XP_004133750.1 (probable glycosyltransferase At5g03795 [Cucumis sativus] >KGN56331.2 hypothetical protein Csa_009905 [Cucumis sativus])

HSP 1 Score: 837.0 bits (2161), Expect = 6.9e-239
Identity = 403/412 (97.82%), Postives = 408/412 (99.03%), Query Frame = 0

Query: 1   MSALKHPFQLTSSSSSPLCSLRASLLTLAVLTLLSFTYLSFTSLHSSPPSSPSQLPVKLG 60
           MSALKHPFQL SSSSSPLCSLRASLLTLAVLTLLSFTYLSFTSLHSSPPSS SQLPVKLG
Sbjct: 1   MSALKHPFQLASSSSSPLCSLRASLLTLAVLTLLSFTYLSFTSLHSSPPSSSSQLPVKLG 60

Query: 61  ALNDAADEEISDVYHSPQVFRLNYEEMDSKFKVYIYPDGDPNTFYQTPRKLTGKYASEGY 120
           ALNDAAD EISDVYHSPQVFRLNY +M+SKFKVYIYPDGDPNTFYQTPRKLTGKYASEGY
Sbjct: 61  ALNDAADAEISDVYHSPQVFRLNYADMESKFKVYIYPDGDPNTFYQTPRKLTGKYASEGY 120

Query: 121 FFQNIRESRFRTEDPDQAHLFFIPISCHKMRGKGTSYENMTIIVQNYVEGLISKYPYWNR 180
           FFQNIRESRFRTEDPDQAHLFFIPISCHKMRGKGTSYENMT+IVQNYVEGLISKYPYWNR
Sbjct: 121 FFQNIRESRFRTEDPDQAHLFFIPISCHKMRGKGTSYENMTVIVQNYVEGLISKYPYWNR 180

Query: 181 TLGADHFFVTCHDVGVRASEGLPFLIKNAIRVVCSPSYDVGFIPHKDVALPQVLQPFALP 240
           TLGADHFFVTCHDVGVRASEGLPFLIKNAIRVVCSPSYDVGFIPHKDVALPQVLQPFALP
Sbjct: 181 TLGADHFFVTCHDVGVRASEGLPFLIKNAIRVVCSPSYDVGFIPHKDVALPQVLQPFALP 240

Query: 241 AGGNDTENRTTLGFWAGHRNSKIRVILARVWENDTELDISNNRISRATGHLLYQKRFYKT 300
           AGGNDTENRTTLGFWAGHRNSKIRVILARVWENDTELDISNNRISRATGHLLYQKRFYKT
Sbjct: 241 AGGNDTENRTTLGFWAGHRNSKIRVILARVWENDTELDISNNRISRATGHLLYQKRFYKT 300

Query: 301 KFCICPGGSQVNSARIADSIHYGCVPVILSDYYDLPFNDILDWRKFSVIVKERDVYQLKQ 360
           KFCICPGGSQVNSARIADSIHYGCVPVILSDYYDLPFNDILDWRKFSVIVKERDVYQLKQ
Sbjct: 301 KFCICPGGSQVNSARIADSIHYGCVPVILSDYYDLPFNDILDWRKFSVIVKERDVYQLKQ 360

Query: 361 ILKDISDMEFIKLHKNLIQVQKHFQWNSPPIKYDAFHMVMYDLWLRHHVIKY 413
           ILKDISD+EFIKLHKNL+QVQKHFQWNSPPIKYDAFHMVMYDLWLRHHVIKY
Sbjct: 361 ILKDISDIEFIKLHKNLMQVQKHFQWNSPPIKYDAFHMVMYDLWLRHHVIKY 412

BLAST of Cmc06g0169551 vs. NCBI nr
Match: XP_038905298.1 (probable glycosyltransferase At5g03795 [Benincasa hispida])

HSP 1 Score: 832.8 bits (2150), Expect = 1.3e-237
Identity = 396/412 (96.12%), Postives = 407/412 (98.79%), Query Frame = 0

Query: 1   MSALKHPFQLTSSSSSPLCSLRASLLTLAVLTLLSFTYLSFTSLHSSPPSSPSQLPVKLG 60
           MSA+KHPFQLTSSSSSPLCSLRASLLTLAVLTLLSFTYLSFTSLHSSPPSSPSQLP+KLG
Sbjct: 59  MSAIKHPFQLTSSSSSPLCSLRASLLTLAVLTLLSFTYLSFTSLHSSPPSSPSQLPIKLG 118

Query: 61  ALNDAADEEISDVYHSPQVFRLNYEEMDSKFKVYIYPDGDPNTFYQTPRKLTGKYASEGY 120
            LNDAADEEISDVYHSPQVFRLNY EM+SKFKVYIYPDGDPNTFYQTPRKLTGKYASEGY
Sbjct: 119 GLNDAADEEISDVYHSPQVFRLNYAEMESKFKVYIYPDGDPNTFYQTPRKLTGKYASEGY 178

Query: 121 FFQNIRESRFRTEDPDQAHLFFIPISCHKMRGKGTSYENMTIIVQNYVEGLISKYPYWNR 180
           FFQNIRESRFRTEDPDQAHLFFIPISCHKMRGKGTSYENMTIIVQNYVEGLISKYPYWNR
Sbjct: 179 FFQNIRESRFRTEDPDQAHLFFIPISCHKMRGKGTSYENMTIIVQNYVEGLISKYPYWNR 238

Query: 181 TLGADHFFVTCHDVGVRASEGLPFLIKNAIRVVCSPSYDVGFIPHKDVALPQVLQPFALP 240
           TLGADHFFVTCHDVGVRA EG PFLIKN IRVVCSPSYDVGF+PHKD+ALPQVLQPFALP
Sbjct: 239 TLGADHFFVTCHDVGVRAFEGFPFLIKNTIRVVCSPSYDVGFVPHKDIALPQVLQPFALP 298

Query: 241 AGGNDTENRTTLGFWAGHRNSKIRVILARVWENDTELDISNNRISRATGHLLYQKRFYKT 300
           AGGNDTENRTTLGFWAGHRNSKIRVILARVWENDTELDISNNRISRATGHL+YQKRFYKT
Sbjct: 299 AGGNDTENRTTLGFWAGHRNSKIRVILARVWENDTELDISNNRISRATGHLVYQKRFYKT 358

Query: 301 KFCICPGGSQVNSARIADSIHYGCVPVILSDYYDLPFNDILDWRKFSVIVKERDVYQLKQ 360
           KFCICPGGSQVNSARIADSIHYGCVPVILSDYYDLPFNDILDWRKFSVIVKERDVYQLKQ
Sbjct: 359 KFCICPGGSQVNSARIADSIHYGCVPVILSDYYDLPFNDILDWRKFSVIVKERDVYQLKQ 418

Query: 361 ILKDISDMEFIKLHKNLIQVQKHFQWNSPPIKYDAFHMVMYDLWLRHHVIKY 413
           ILK+ISD+EF+KLHKNL+QVQKHFQWNSPP+KYDAFHMVMYDLWLRHHVIKY
Sbjct: 419 ILKNISDLEFMKLHKNLVQVQKHFQWNSPPMKYDAFHMVMYDLWLRHHVIKY 470

BLAST of Cmc06g0169551 vs. NCBI nr
Match: XP_022929527.1 (probable glycosyltransferase At5g03795 isoform X1 [Cucurbita moschata])

HSP 1 Score: 815.8 bits (2106), Expect = 1.6e-232
Identity = 390/412 (94.66%), Postives = 402/412 (97.57%), Query Frame = 0

Query: 1   MSALKHPFQLTSSSSSPLCSLRASLLTLAVLTLLSFTYLSFTSLHSSPPSSPSQLPVKLG 60
           MSA+KHPFQLTSSSSS LCSLRASLLTLAVLT LS TYLSFTSLHSS  SSPSQLPVKLG
Sbjct: 68  MSAIKHPFQLTSSSSSRLCSLRASLLTLAVLTFLSLTYLSFTSLHSSASSSPSQLPVKLG 127

Query: 61  ALNDAADEEISDVYHSPQVFRLNYEEMDSKFKVYIYPDGDPNTFYQTPRKLTGKYASEGY 120
            + DAAD+EISDVYHSP+VFRLNY EM+ KFKVYIYPDGDPNTFYQTPRKLTGKYASEGY
Sbjct: 128 GIEDAADDEISDVYHSPEVFRLNYAEMERKFKVYIYPDGDPNTFYQTPRKLTGKYASEGY 187

Query: 121 FFQNIRESRFRTEDPDQAHLFFIPISCHKMRGKGTSYENMTIIVQNYVEGLISKYPYWNR 180
           FFQNIRESRFRTEDPDQAHLFFIPISCHKMRGKGTSYENMTIIVQNYVE LISKYPYWNR
Sbjct: 188 FFQNIRESRFRTEDPDQAHLFFIPISCHKMRGKGTSYENMTIIVQNYVESLISKYPYWNR 247

Query: 181 TLGADHFFVTCHDVGVRASEGLPFLIKNAIRVVCSPSYDVGFIPHKDVALPQVLQPFALP 240
           TLGADHFFVTCHDVGVRASEGLPFLIKNAIRVVCSPSYDVGFIPHKDVALPQVLQPFALP
Sbjct: 248 TLGADHFFVTCHDVGVRASEGLPFLIKNAIRVVCSPSYDVGFIPHKDVALPQVLQPFALP 307

Query: 241 AGGNDTENRTTLGFWAGHRNSKIRVILARVWENDTELDISNNRISRATGHLLYQKRFYKT 300
           AGGNDTENRTTLGFWAGHRNSKIRVILARVWENDTELDISNNRI+RATGHL+YQKRFY+T
Sbjct: 308 AGGNDTENRTTLGFWAGHRNSKIRVILARVWENDTELDISNNRINRATGHLVYQKRFYRT 367

Query: 301 KFCICPGGSQVNSARIADSIHYGCVPVILSDYYDLPFNDILDWRKFSVIVKERDVYQLKQ 360
           KFCICPGGSQVNSARIADSIHYGCVPVILSDYYDLPFNDI+DW+KFSVIVKERDVYQLKQ
Sbjct: 368 KFCICPGGSQVNSARIADSIHYGCVPVILSDYYDLPFNDIIDWKKFSVIVKERDVYQLKQ 427

Query: 361 ILKDISDMEFIKLHKNLIQVQKHFQWNSPPIKYDAFHMVMYDLWLRHHVIKY 413
           ILKDISD+EFIKLHKNL+QVQKHFQWNSPPIKYDAFHMVMYDLWLRHHVIKY
Sbjct: 428 ILKDISDIEFIKLHKNLVQVQKHFQWNSPPIKYDAFHMVMYDLWLRHHVIKY 479

BLAST of Cmc06g0169551 vs. NCBI nr
Match: XP_022985045.1 (probable glycosyltransferase At5g03795 isoform X1 [Cucurbita maxima])

HSP 1 Score: 812.8 bits (2098), Expect = 1.4e-231
Identity = 389/412 (94.42%), Postives = 400/412 (97.09%), Query Frame = 0

Query: 1   MSALKHPFQLTSSSSSPLCSLRASLLTLAVLTLLSFTYLSFTSLHSSPPSSPSQLPVKLG 60
           MS +KHPFQL SSSSS LCSLRASLLTLAVLTLLS TYLSFTSLHSS  SSPSQLPVKLG
Sbjct: 65  MSVIKHPFQLASSSSSRLCSLRASLLTLAVLTLLSLTYLSFTSLHSSASSSPSQLPVKLG 124

Query: 61  ALNDAADEEISDVYHSPQVFRLNYEEMDSKFKVYIYPDGDPNTFYQTPRKLTGKYASEGY 120
            L DAAD+EISDVYHSP+VFRLNY EM+ KFKVYIYPDGDPNTFYQTPRKLTGKYASEGY
Sbjct: 125 GLEDAADDEISDVYHSPEVFRLNYAEMERKFKVYIYPDGDPNTFYQTPRKLTGKYASEGY 184

Query: 121 FFQNIRESRFRTEDPDQAHLFFIPISCHKMRGKGTSYENMTIIVQNYVEGLISKYPYWNR 180
           FFQNIRESRFRTEDPDQAH FFIPISCHKMRGKGTSYENMTIIVQNYVE LISKYPYWNR
Sbjct: 185 FFQNIRESRFRTEDPDQAHFFFIPISCHKMRGKGTSYENMTIIVQNYVESLISKYPYWNR 244

Query: 181 TLGADHFFVTCHDVGVRASEGLPFLIKNAIRVVCSPSYDVGFIPHKDVALPQVLQPFALP 240
           TLGADHFFVTCHDVGVRASEGLPFLIKNAIRVVCSPSYDVGFIPHKDVALPQVLQPFALP
Sbjct: 245 TLGADHFFVTCHDVGVRASEGLPFLIKNAIRVVCSPSYDVGFIPHKDVALPQVLQPFALP 304

Query: 241 AGGNDTENRTTLGFWAGHRNSKIRVILARVWENDTELDISNNRISRATGHLLYQKRFYKT 300
           AGGNDTENRTTLGFWAGHRNSKIRVILARVWENDTELDISNNRI+RATGHL+YQKRFY+T
Sbjct: 305 AGGNDTENRTTLGFWAGHRNSKIRVILARVWENDTELDISNNRINRATGHLVYQKRFYRT 364

Query: 301 KFCICPGGSQVNSARIADSIHYGCVPVILSDYYDLPFNDILDWRKFSVIVKERDVYQLKQ 360
           KFCICPGGSQVNSARIADSIHYGCVPVILSDYYDLPFNDI+DW+KFSVIVKERDVYQLKQ
Sbjct: 365 KFCICPGGSQVNSARIADSIHYGCVPVILSDYYDLPFNDIIDWKKFSVIVKERDVYQLKQ 424

Query: 361 ILKDISDMEFIKLHKNLIQVQKHFQWNSPPIKYDAFHMVMYDLWLRHHVIKY 413
           ILKDISDMEFIKLHKNL+QVQKHFQWNSPPIK+DAFHMVMYDLWLRHHVIKY
Sbjct: 425 ILKDISDMEFIKLHKNLVQVQKHFQWNSPPIKHDAFHMVMYDLWLRHHVIKY 476

BLAST of Cmc06g0169551 vs. ExPASy Swiss-Prot
Match: Q9FFN2 (Probable glycosyltransferase At5g03795 OS=Arabidopsis thaliana OX=3702 GN=At5g03795 PE=3 SV=2)

HSP 1 Score: 268.9 bits (686), Expect = 9.8e-71
Identity = 134/369 (36.31%), Postives = 224/369 (60.70%), Query Frame = 0

Query: 57  VKLGALNDAADE----EISDVYHSPQVFRLNYEEMDSKFKVYIYPDGDPNTFYQTPRKLT 116
           +K  +++D  D+     +  +Y + +VF  +Y EM+ +FK+Y+Y +G+P  F+  P K  
Sbjct: 152 IKAASMDDPVDDPDYVPLGPMYWNAKVFHRSYLEMEKQFKIYVYKEGEPPLFHDGPCK-- 211

Query: 117 GKYASEGYFFQNIR-ESRFRTEDPDQAHLFFIPISCHKM-----RGKGTSYENMTIIVQN 176
             Y+ EG F   I  ++RFRT +PD+AH+F++P S  KM           +  +   V++
Sbjct: 212 SIYSMEGSFIYEIETDTRFRTNNPDKAHVFYLPFSVVKMVRYVYERNSRDFSPIRNTVKD 271

Query: 177 YVEGLISKYPYWNRTLGADHFFVTCHDVGVRASEGLPFLIKNAIRVVCSPSYDVGFIPHK 236
           Y+  +  KYPYWNR++GADHF ++CHD G  AS   P L  N+IR +C+ +    F P K
Sbjct: 272 YINLVGDKYPYWNRSIGADHFILSCHDWGPEASFSHPHLGHNSIRALCNANTSERFKPRK 331

Query: 237 DVALPQV-LQPFALP--AGGNDTENRTTLGFWAGHRNSKIRVILARVWEN-DTELDISNN 296
           DV++P++ L+  +L    GG    +R  L F+AG  +  +R +L + WEN D ++ + + 
Sbjct: 332 DVSIPEINLRTGSLTGLVGGPSPSSRPILAFFAGGVHGPVRPVLLQHWENKDNDIRV-HK 391

Query: 297 RISRATGHLLYQKRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSDYYDLPFNDILD 356
            + R T    Y      +KFCICP G +V S RI ++++ GCVPV+++  Y  PF+D+L+
Sbjct: 392 YLPRGTS---YSDMMRNSKFCICPSGYEVASPRIVEALYSGCVPVLINSGYVPPFSDVLN 451

Query: 357 WRKFSVIVKERDVYQLKQILKDISDMEFIKLHKNLIQVQKHFQWNSPPIKYDAFHMVMYD 412
           WR FSVIV   D+  LK IL  IS  +++++++ +++V++HF+ NSP  ++D FHM+++ 
Sbjct: 452 WRSFSVIVSVEDIPNLKTILTSISPRQYLRMYRRVLKVRRHFEVNSPAKRFDVFHMILHS 511

BLAST of Cmc06g0169551 vs. ExPASy Swiss-Prot
Match: Q9SSE8 (Probable glycosyltransferase At3g07620 OS=Arabidopsis thaliana OX=3702 GN=At3g07620 PE=3 SV=1)

HSP 1 Score: 246.9 bits (629), Expect = 4.0e-64
Identity = 139/402 (34.58%), Postives = 221/402 (54.98%), Query Frame = 0

Query: 21  LRASLLTLAVLTLLSFTYLSFTSLHSSPPSSPSQLPVKLGALNDAADEEISDVYHSPQVF 80
           + A L T  V  L+    L+++S  SSP      +P               D+Y +P  F
Sbjct: 88  VEAELATARV--LIREAQLNYSSTTSSPLGDEDYVP-------------HGDIYRNPYAF 147

Query: 81  RLNYEEMDSKFKVYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRES--RFRTEDPDQA 140
             +Y  M+  FK+Y+Y +GDP  F+    K    Y+ EG F   +     ++RT DPD+A
Sbjct: 148 HRSYLLMEKMFKIYVYEEGDPPIFHYGLCK--DIYSMEGLFLNFMENDVLKYRTRDPDKA 207

Query: 141 HLFFIPISC-----HKMRGKGTSYENMTIIVQNYVEGLISKYPYWNRTLGADHFFVTCHD 200
           H++F+P S      H           +  ++ +YV+ +  KYPYWN + G DHF ++CHD
Sbjct: 208 HVYFLPFSVVMILHHLFDPVVRDKAVLERVIADYVQIISKKYPYWNTSDGFDHFMLSCHD 267

Query: 201 VGVRASEGLPFLIKNAIRVVCSPSYDVGFIPHKDVALPQV---LQPFALPAGGNDTENRT 260
            G RA+  +  L  N+IRV+C+ +    F P KD   P++           GG D  +RT
Sbjct: 268 WGHRATWYVKKLFFNSIRVLCNANISEYFNPEKDAPFPEINLLTGDINNLTGGLDPISRT 327

Query: 261 TLGFWAGHRNSKIRVILARVW-ENDTELDISNNRISRATGHLLYQKRFYKTKFCICPGGS 320
           TL F+AG  + KIR +L   W E D ++ +  N        L Y +   K++FCICP G 
Sbjct: 328 TLAFFAGKSHGKIRPVLLNHWKEKDKDILVYEN----LPDGLDYTEMMRKSRFCICPSGH 387

Query: 321 QVNSARIADSIHYGCVPVILSDYYDLPFNDILDWRKFSVIVKERDVYQLKQILKDISDME 380
           +V S R+ ++I+ GCVPV++S+ Y LPF+D+L+W KFSV V  +++ +LK+IL DI +  
Sbjct: 388 EVASPRVPEAIYSGCVPVLISENYVLPFSDVLNWEKFSVSVSVKEIPELKRILMDIPEER 447

Query: 381 FIKLHKNLIQVQKHFQWNSPPIKYDAFHMVMYDLWLRHHVIK 412
           +++L++ + +V++H   N PP +YD F+M+++ +WLR   +K
Sbjct: 448 YMRLYEGVKKVKRHILVNDPPKRYDVFNMIIHSIWLRRLNVK 468

BLAST of Cmc06g0169551 vs. ExPASy Swiss-Prot
Match: Q9LFP3 (Probable glycosyltransferase At5g11130 OS=Arabidopsis thaliana OX=3702 GN=At5g11130/At5g11120 PE=3 SV=2)

HSP 1 Score: 245.4 bits (625), Expect = 1.2e-63
Identity = 127/346 (36.71%), Postives = 203/346 (58.67%), Query Frame = 0

Query: 73  VYHSPQVFRLNYEEMDSKFKVYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIR--ESRF 132
           VY +   F  +++EM+ +FK++ Y +G+   F++ P  L   YA EG F   I    SRF
Sbjct: 131 VYLNAFTFHQSHKEMEKRFKIWTYREGEAPLFHKGP--LNNIYAIEGQFMDEIENGNSRF 190

Query: 133 RTEDPDQAHLFFIPIS----CHKMRGKGTSY--ENMTIIVQNYVEGLISKYPYWNRTLGA 192
           +   P++A +F+IP+        +    TSY  + +  IV++Y+  + ++YPYWNR+ GA
Sbjct: 191 KAASPEEATVFYIPVGIVNIIRFVYRPYTSYARDRLQNIVKDYISLISNRYPYWNRSRGA 250

Query: 193 DHFFVTCHDVGVRASEGLPFLIKNAIRVVCSPSYDVGFIPHKDVALPQVLQP---FALPA 252
           DHFF++CHD     S   P L K+ IR +C+ +   GF P +DV+LP++  P        
Sbjct: 251 DHFFLSCHDWAPDVSAVDPELYKHFIRALCNANSSEGFTPMRDVSLPEINIPHSQLGFVH 310

Query: 253 GGNDTENRTTLGFWAGHRNSKIRVILARVW-ENDTELDISNNRISRATGHLLYQKRFYKT 312
            G   +NR  L F+AG  +  +R IL + W E D ++ +  N        + Y K   K 
Sbjct: 311 TGEPPQNRKLLAFFAGGSHGDVRKILFQHWKEKDKDVLVYEN----LPKTMNYTKMMDKA 370

Query: 313 KFCICPGGSQVNSARIADSIHYGCVPVILSDYYDLPFNDILDWRKFSVIVKERDVYQLKQ 372
           KFC+CP G +V S RI +S++ GCVPVI++DYY LPF+D+L+W+ FSV +    +  +K+
Sbjct: 371 KFCLCPSGWEVASPRIVESLYSGCVPVIIADYYVLPFSDVLNWKTFSVHIPISKMPDIKK 430

Query: 373 ILKDISDMEFIKLHKNLIQVQKHFQWNSPPIKYDAFHMVMYDLWLR 407
           IL+ I++ E++ + + +++V+KHF  N P   YD  HM+M+ +WLR
Sbjct: 431 ILEAITEEEYLNMQRRVLEVRKHFVINRPSKPYDMLHMIMHSIWLR 470

BLAST of Cmc06g0169551 vs. ExPASy Swiss-Prot
Match: Q94AA9 (Xylogalacturonan beta-1,3-xylosyltransferase OS=Arabidopsis thaliana OX=3702 GN=XGD1 PE=1 SV=2)

HSP 1 Score: 240.4 bits (612), Expect = 3.7e-62
Identity = 129/378 (34.13%), Postives = 208/378 (55.03%), Query Frame = 0

Query: 53  SQLPVKLGALNDAADEE--ISDVYHSPQVFRLNYEEMDSKFKVYIYPDGDPNTFYQTPRK 112
           S L     A+  AA  +  +S +Y +P  F  ++ EM ++FKV+ Y +G+   F+  P  
Sbjct: 124 SDLAKARAAIKKAASTQNYVSSLYKNPAAFHQSHTEMMNRFKVWTYTEGEVPLFHDGP-- 183

Query: 113 LTGKYASEGYFFQNI------RESRFRTEDPDQAHLFFIPISCHKM---------RGKGT 172
           +   Y  EG F   +        SRFR + P+ AH+FFIP S  K+           +G 
Sbjct: 184 VNDIYGIEGQFMDEMCVDGPKSRSRFRADRPENAHVFFIPFSVAKVIHFVYKPITSVEGF 243

Query: 173 SYENMTIIVQNYVEGLISKYPYWNRTLGADHFFVTCHDVGVRASEGLPFLIKNAIRVVCS 232
           S   +  ++++YV+ + +K+PYWNR+ G DHF V+CHD      +G P L +  IR +C+
Sbjct: 244 SRARLHRLIEDYVDVVATKHPYWNRSQGGDHFMVSCHDWAPDVIDGNPKLFEKFIRGLCN 303

Query: 233 PSYDVGFIPHKDVALPQVLQPFALPAG-------GNDTENRTTLGFWAGHRNSKIRVILA 292
            +   GF P+ DV++P++     LP G       G     R+ L F+AG  + +IR IL 
Sbjct: 304 ANTSEGFRPNVDVSIPEIY----LPKGKLGPSFLGKSPRVRSILAFFAGRSHGEIRKILF 363

Query: 293 RVWENDTELDISNNRISRATGHLLYQKRFYKTKFCICPGGSQVNSARIADSIHYGCVPVI 352
           + W+   E+D       R      Y K    +KFC+CP G +V S R  ++I+ GCVPVI
Sbjct: 364 QHWK---EMDNEVQVYDRLPPGKDYTKTMGMSKFCLCPSGWEVASPREVEAIYAGCVPVI 423

Query: 353 LSDYYDLPFNDILDWRKFSVIVKERDVYQLKQILKDISDMEFIKLHKNLIQVQKHFQWNS 407
           +SD Y LPF+D+L+W  FS+ +    + ++K IL+ +S + ++K++K +++V++HF  N 
Sbjct: 424 ISDNYSLPFSDVLNWDSFSIQIPVSRIKEIKTILQSVSLVRYLKMYKRVLEVKQHFVLNR 483

BLAST of Cmc06g0169551 vs. ExPASy Swiss-Prot
Match: Q3E7Q9 (Probable glycosyltransferase At5g25310 OS=Arabidopsis thaliana OX=3702 GN=At5g25310 PE=3 SV=2)

HSP 1 Score: 238.4 bits (607), Expect = 1.4e-61
Identity = 128/357 (35.85%), Postives = 200/357 (56.02%), Query Frame = 0

Query: 71  SDVYHSPQVFRLNYEEMDSKFKVYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNI--RES 130
           S++Y +P     +Y EM+ +FKVY+Y +G+P   +  P K    YA EG F   +  R +
Sbjct: 131 SEIYRNPSALYRSYLEMEKRFKVYVYEEGEPPLVHDGPCK--SVYAVEGRFITEMEKRRT 190

Query: 131 RFRTEDPDQAHLFFIPIS----CHKMRGKGTSYENMTIIVQNYVEGLISKYPYWNRTLGA 190
           +FRT DP+QA+++F+P S       +    +  + +   V +Y+  + + +P+WNRT GA
Sbjct: 191 KFRTYDPNQAYVYFLPFSVTWLVRYLYEGNSDAKPLKTFVSDYIRLVSTNHPFWNRTNGA 250

Query: 191 DHFFVTCHDVGVRASEGLPFLIKNAIRVVCSPSYDVGFIPHKDVALPQVLQPFALPAGGN 250
           DHF +TCHD G   S+    L   +IRV+C+ +   GF P KDV LP++     L  G  
Sbjct: 251 DHFMLTCHDWGPLTSQANRDLFNTSIRVMCNANSSEGFNPTKDVTLPEI----KLYGGEV 310

Query: 251 DTENRTT----------LGFWAGHRNSKIRVILARVWENDTELDISNNRISRATGHLLYQ 310
           D + R +          LGF+AG  +  +R IL + W+   + D+          HL Y 
Sbjct: 311 DHKLRLSKTLSASPRPYLGFFAGGVHGPVRPILLKHWK---QRDLDMPVYEYLPKHLNYY 370

Query: 311 KRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSDYYDLPFNDILDWRKFSVIVKERD 370
                +KFC CP G +V S R+ ++I+  C+PVILS  + LPF D+L W  FSV+V   +
Sbjct: 371 DFMRSSKFCFCPSGYEVASPRVIEAIYSECIPVILSVNFVLPFTDVLRWETFSVLVDVSE 430

Query: 371 VYQLKQILKDISDMEFIKLHKNLIQVQKHFQWNSPPIKYDAFHMVMYDLWLRHHVIK 412
           + +LK+IL  IS+ ++  L  NL  V++HF+ N PP ++DAFH+ ++ +WLR   +K
Sbjct: 431 IPRLKEILMSISNEKYEWLKSNLRYVRRHFELNDPPQRFDAFHLTLHSIWLRRLNLK 478

BLAST of Cmc06g0169551 vs. ExPASy TrEMBL
Match: A0A1S3BN33 (probable glycosyltransferase At5g03795 OS=Cucumis melo OX=3656 GN=LOC103491850 PE=3 SV=1)

HSP 1 Score: 852.4 bits (2201), Expect = 7.7e-244
Identity = 412/412 (100.00%), Postives = 412/412 (100.00%), Query Frame = 0

Query: 1   MSALKHPFQLTSSSSSPLCSLRASLLTLAVLTLLSFTYLSFTSLHSSPPSSPSQLPVKLG 60
           MSALKHPFQLTSSSSSPLCSLRASLLTLAVLTLLSFTYLSFTSLHSSPPSSPSQLPVKLG
Sbjct: 1   MSALKHPFQLTSSSSSPLCSLRASLLTLAVLTLLSFTYLSFTSLHSSPPSSPSQLPVKLG 60

Query: 61  ALNDAADEEISDVYHSPQVFRLNYEEMDSKFKVYIYPDGDPNTFYQTPRKLTGKYASEGY 120
           ALNDAADEEISDVYHSPQVFRLNYEEMDSKFKVYIYPDGDPNTFYQTPRKLTGKYASEGY
Sbjct: 61  ALNDAADEEISDVYHSPQVFRLNYEEMDSKFKVYIYPDGDPNTFYQTPRKLTGKYASEGY 120

Query: 121 FFQNIRESRFRTEDPDQAHLFFIPISCHKMRGKGTSYENMTIIVQNYVEGLISKYPYWNR 180
           FFQNIRESRFRTEDPDQAHLFFIPISCHKMRGKGTSYENMTIIVQNYVEGLISKYPYWNR
Sbjct: 121 FFQNIRESRFRTEDPDQAHLFFIPISCHKMRGKGTSYENMTIIVQNYVEGLISKYPYWNR 180

Query: 181 TLGADHFFVTCHDVGVRASEGLPFLIKNAIRVVCSPSYDVGFIPHKDVALPQVLQPFALP 240
           TLGADHFFVTCHDVGVRASEGLPFLIKNAIRVVCSPSYDVGFIPHKDVALPQVLQPFALP
Sbjct: 181 TLGADHFFVTCHDVGVRASEGLPFLIKNAIRVVCSPSYDVGFIPHKDVALPQVLQPFALP 240

Query: 241 AGGNDTENRTTLGFWAGHRNSKIRVILARVWENDTELDISNNRISRATGHLLYQKRFYKT 300
           AGGNDTENRTTLGFWAGHRNSKIRVILARVWENDTELDISNNRISRATGHLLYQKRFYKT
Sbjct: 241 AGGNDTENRTTLGFWAGHRNSKIRVILARVWENDTELDISNNRISRATGHLLYQKRFYKT 300

Query: 301 KFCICPGGSQVNSARIADSIHYGCVPVILSDYYDLPFNDILDWRKFSVIVKERDVYQLKQ 360
           KFCICPGGSQVNSARIADSIHYGCVPVILSDYYDLPFNDILDWRKFSVIVKERDVYQLKQ
Sbjct: 301 KFCICPGGSQVNSARIADSIHYGCVPVILSDYYDLPFNDILDWRKFSVIVKERDVYQLKQ 360

Query: 361 ILKDISDMEFIKLHKNLIQVQKHFQWNSPPIKYDAFHMVMYDLWLRHHVIKY 413
           ILKDISDMEFIKLHKNLIQVQKHFQWNSPPIKYDAFHMVMYDLWLRHHVIKY
Sbjct: 361 ILKDISDMEFIKLHKNLIQVQKHFQWNSPPIKYDAFHMVMYDLWLRHHVIKY 412

BLAST of Cmc06g0169551 vs. ExPASy TrEMBL
Match: A0A6J1EN04 (probable glycosyltransferase At5g03795 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111436068 PE=3 SV=1)

HSP 1 Score: 815.8 bits (2106), Expect = 7.9e-233
Identity = 390/412 (94.66%), Postives = 402/412 (97.57%), Query Frame = 0

Query: 1   MSALKHPFQLTSSSSSPLCSLRASLLTLAVLTLLSFTYLSFTSLHSSPPSSPSQLPVKLG 60
           MSA+KHPFQLTSSSSS LCSLRASLLTLAVLT LS TYLSFTSLHSS  SSPSQLPVKLG
Sbjct: 68  MSAIKHPFQLTSSSSSRLCSLRASLLTLAVLTFLSLTYLSFTSLHSSASSSPSQLPVKLG 127

Query: 61  ALNDAADEEISDVYHSPQVFRLNYEEMDSKFKVYIYPDGDPNTFYQTPRKLTGKYASEGY 120
            + DAAD+EISDVYHSP+VFRLNY EM+ KFKVYIYPDGDPNTFYQTPRKLTGKYASEGY
Sbjct: 128 GIEDAADDEISDVYHSPEVFRLNYAEMERKFKVYIYPDGDPNTFYQTPRKLTGKYASEGY 187

Query: 121 FFQNIRESRFRTEDPDQAHLFFIPISCHKMRGKGTSYENMTIIVQNYVEGLISKYPYWNR 180
           FFQNIRESRFRTEDPDQAHLFFIPISCHKMRGKGTSYENMTIIVQNYVE LISKYPYWNR
Sbjct: 188 FFQNIRESRFRTEDPDQAHLFFIPISCHKMRGKGTSYENMTIIVQNYVESLISKYPYWNR 247

Query: 181 TLGADHFFVTCHDVGVRASEGLPFLIKNAIRVVCSPSYDVGFIPHKDVALPQVLQPFALP 240
           TLGADHFFVTCHDVGVRASEGLPFLIKNAIRVVCSPSYDVGFIPHKDVALPQVLQPFALP
Sbjct: 248 TLGADHFFVTCHDVGVRASEGLPFLIKNAIRVVCSPSYDVGFIPHKDVALPQVLQPFALP 307

Query: 241 AGGNDTENRTTLGFWAGHRNSKIRVILARVWENDTELDISNNRISRATGHLLYQKRFYKT 300
           AGGNDTENRTTLGFWAGHRNSKIRVILARVWENDTELDISNNRI+RATGHL+YQKRFY+T
Sbjct: 308 AGGNDTENRTTLGFWAGHRNSKIRVILARVWENDTELDISNNRINRATGHLVYQKRFYRT 367

Query: 301 KFCICPGGSQVNSARIADSIHYGCVPVILSDYYDLPFNDILDWRKFSVIVKERDVYQLKQ 360
           KFCICPGGSQVNSARIADSIHYGCVPVILSDYYDLPFNDI+DW+KFSVIVKERDVYQLKQ
Sbjct: 368 KFCICPGGSQVNSARIADSIHYGCVPVILSDYYDLPFNDIIDWKKFSVIVKERDVYQLKQ 427

Query: 361 ILKDISDMEFIKLHKNLIQVQKHFQWNSPPIKYDAFHMVMYDLWLRHHVIKY 413
           ILKDISD+EFIKLHKNL+QVQKHFQWNSPPIKYDAFHMVMYDLWLRHHVIKY
Sbjct: 428 ILKDISDIEFIKLHKNLVQVQKHFQWNSPPIKYDAFHMVMYDLWLRHHVIKY 479

BLAST of Cmc06g0169551 vs. ExPASy TrEMBL
Match: A0A6J1JA93 (probable glycosyltransferase At5g03795 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111483136 PE=3 SV=1)

HSP 1 Score: 812.8 bits (2098), Expect = 6.7e-232
Identity = 389/412 (94.42%), Postives = 400/412 (97.09%), Query Frame = 0

Query: 1   MSALKHPFQLTSSSSSPLCSLRASLLTLAVLTLLSFTYLSFTSLHSSPPSSPSQLPVKLG 60
           MS +KHPFQL SSSSS LCSLRASLLTLAVLTLLS TYLSFTSLHSS  SSPSQLPVKLG
Sbjct: 65  MSVIKHPFQLASSSSSRLCSLRASLLTLAVLTLLSLTYLSFTSLHSSASSSPSQLPVKLG 124

Query: 61  ALNDAADEEISDVYHSPQVFRLNYEEMDSKFKVYIYPDGDPNTFYQTPRKLTGKYASEGY 120
            L DAAD+EISDVYHSP+VFRLNY EM+ KFKVYIYPDGDPNTFYQTPRKLTGKYASEGY
Sbjct: 125 GLEDAADDEISDVYHSPEVFRLNYAEMERKFKVYIYPDGDPNTFYQTPRKLTGKYASEGY 184

Query: 121 FFQNIRESRFRTEDPDQAHLFFIPISCHKMRGKGTSYENMTIIVQNYVEGLISKYPYWNR 180
           FFQNIRESRFRTEDPDQAH FFIPISCHKMRGKGTSYENMTIIVQNYVE LISKYPYWNR
Sbjct: 185 FFQNIRESRFRTEDPDQAHFFFIPISCHKMRGKGTSYENMTIIVQNYVESLISKYPYWNR 244

Query: 181 TLGADHFFVTCHDVGVRASEGLPFLIKNAIRVVCSPSYDVGFIPHKDVALPQVLQPFALP 240
           TLGADHFFVTCHDVGVRASEGLPFLIKNAIRVVCSPSYDVGFIPHKDVALPQVLQPFALP
Sbjct: 245 TLGADHFFVTCHDVGVRASEGLPFLIKNAIRVVCSPSYDVGFIPHKDVALPQVLQPFALP 304

Query: 241 AGGNDTENRTTLGFWAGHRNSKIRVILARVWENDTELDISNNRISRATGHLLYQKRFYKT 300
           AGGNDTENRTTLGFWAGHRNSKIRVILARVWENDTELDISNNRI+RATGHL+YQKRFY+T
Sbjct: 305 AGGNDTENRTTLGFWAGHRNSKIRVILARVWENDTELDISNNRINRATGHLVYQKRFYRT 364

Query: 301 KFCICPGGSQVNSARIADSIHYGCVPVILSDYYDLPFNDILDWRKFSVIVKERDVYQLKQ 360
           KFCICPGGSQVNSARIADSIHYGCVPVILSDYYDLPFNDI+DW+KFSVIVKERDVYQLKQ
Sbjct: 365 KFCICPGGSQVNSARIADSIHYGCVPVILSDYYDLPFNDIIDWKKFSVIVKERDVYQLKQ 424

Query: 361 ILKDISDMEFIKLHKNLIQVQKHFQWNSPPIKYDAFHMVMYDLWLRHHVIKY 413
           ILKDISDMEFIKLHKNL+QVQKHFQWNSPPIK+DAFHMVMYDLWLRHHVIKY
Sbjct: 425 ILKDISDMEFIKLHKNLVQVQKHFQWNSPPIKHDAFHMVMYDLWLRHHVIKY 476

BLAST of Cmc06g0169551 vs. ExPASy TrEMBL
Match: A0A6J1ESD6 (probable glycosyltransferase At5g03795 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111436068 PE=3 SV=1)

HSP 1 Score: 802.4 bits (2071), Expect = 9.1e-229
Identity = 386/412 (93.69%), Postives = 398/412 (96.60%), Query Frame = 0

Query: 1   MSALKHPFQLTSSSSSPLCSLRASLLTLAVLTLLSFTYLSFTSLHSSPPSSPSQLPVKLG 60
           MSA+KHPFQLTSSSSS LCSLRASLLTLAVLT LS TYLSFTSLHSS  SSPSQ    LG
Sbjct: 68  MSAIKHPFQLTSSSSSRLCSLRASLLTLAVLTFLSLTYLSFTSLHSSASSSPSQ----LG 127

Query: 61  ALNDAADEEISDVYHSPQVFRLNYEEMDSKFKVYIYPDGDPNTFYQTPRKLTGKYASEGY 120
            + DAAD+EISDVYHSP+VFRLNY EM+ KFKVYIYPDGDPNTFYQTPRKLTGKYASEGY
Sbjct: 128 GIEDAADDEISDVYHSPEVFRLNYAEMERKFKVYIYPDGDPNTFYQTPRKLTGKYASEGY 187

Query: 121 FFQNIRESRFRTEDPDQAHLFFIPISCHKMRGKGTSYENMTIIVQNYVEGLISKYPYWNR 180
           FFQNIRESRFRTEDPDQAHLFFIPISCHKMRGKGTSYENMTIIVQNYVE LISKYPYWNR
Sbjct: 188 FFQNIRESRFRTEDPDQAHLFFIPISCHKMRGKGTSYENMTIIVQNYVESLISKYPYWNR 247

Query: 181 TLGADHFFVTCHDVGVRASEGLPFLIKNAIRVVCSPSYDVGFIPHKDVALPQVLQPFALP 240
           TLGADHFFVTCHDVGVRASEGLPFLIKNAIRVVCSPSYDVGFIPHKDVALPQVLQPFALP
Sbjct: 248 TLGADHFFVTCHDVGVRASEGLPFLIKNAIRVVCSPSYDVGFIPHKDVALPQVLQPFALP 307

Query: 241 AGGNDTENRTTLGFWAGHRNSKIRVILARVWENDTELDISNNRISRATGHLLYQKRFYKT 300
           AGGNDTENRTTLGFWAGHRNSKIRVILARVWENDTELDISNNRI+RATGHL+YQKRFY+T
Sbjct: 308 AGGNDTENRTTLGFWAGHRNSKIRVILARVWENDTELDISNNRINRATGHLVYQKRFYRT 367

Query: 301 KFCICPGGSQVNSARIADSIHYGCVPVILSDYYDLPFNDILDWRKFSVIVKERDVYQLKQ 360
           KFCICPGGSQVNSARIADSIHYGCVPVILSDYYDLPFNDI+DW+KFSVIVKERDVYQLKQ
Sbjct: 368 KFCICPGGSQVNSARIADSIHYGCVPVILSDYYDLPFNDIIDWKKFSVIVKERDVYQLKQ 427

Query: 361 ILKDISDMEFIKLHKNLIQVQKHFQWNSPPIKYDAFHMVMYDLWLRHHVIKY 413
           ILKDISD+EFIKLHKNL+QVQKHFQWNSPPIKYDAFHMVMYDLWLRHHVIKY
Sbjct: 428 ILKDISDIEFIKLHKNLVQVQKHFQWNSPPIKYDAFHMVMYDLWLRHHVIKY 475

BLAST of Cmc06g0169551 vs. ExPASy TrEMBL
Match: A0A6J1J700 (probable glycosyltransferase At5g03795 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111483136 PE=3 SV=1)

HSP 1 Score: 799.3 bits (2063), Expect = 7.7e-228
Identity = 385/412 (93.45%), Postives = 396/412 (96.12%), Query Frame = 0

Query: 1   MSALKHPFQLTSSSSSPLCSLRASLLTLAVLTLLSFTYLSFTSLHSSPPSSPSQLPVKLG 60
           MS +KHPFQL SSSSS LCSLRASLLTLAVLTLLS TYLSFTSLHSS  SSPSQ    LG
Sbjct: 65  MSVIKHPFQLASSSSSRLCSLRASLLTLAVLTLLSLTYLSFTSLHSSASSSPSQ----LG 124

Query: 61  ALNDAADEEISDVYHSPQVFRLNYEEMDSKFKVYIYPDGDPNTFYQTPRKLTGKYASEGY 120
            L DAAD+EISDVYHSP+VFRLNY EM+ KFKVYIYPDGDPNTFYQTPRKLTGKYASEGY
Sbjct: 125 GLEDAADDEISDVYHSPEVFRLNYAEMERKFKVYIYPDGDPNTFYQTPRKLTGKYASEGY 184

Query: 121 FFQNIRESRFRTEDPDQAHLFFIPISCHKMRGKGTSYENMTIIVQNYVEGLISKYPYWNR 180
           FFQNIRESRFRTEDPDQAH FFIPISCHKMRGKGTSYENMTIIVQNYVE LISKYPYWNR
Sbjct: 185 FFQNIRESRFRTEDPDQAHFFFIPISCHKMRGKGTSYENMTIIVQNYVESLISKYPYWNR 244

Query: 181 TLGADHFFVTCHDVGVRASEGLPFLIKNAIRVVCSPSYDVGFIPHKDVALPQVLQPFALP 240
           TLGADHFFVTCHDVGVRASEGLPFLIKNAIRVVCSPSYDVGFIPHKDVALPQVLQPFALP
Sbjct: 245 TLGADHFFVTCHDVGVRASEGLPFLIKNAIRVVCSPSYDVGFIPHKDVALPQVLQPFALP 304

Query: 241 AGGNDTENRTTLGFWAGHRNSKIRVILARVWENDTELDISNNRISRATGHLLYQKRFYKT 300
           AGGNDTENRTTLGFWAGHRNSKIRVILARVWENDTELDISNNRI+RATGHL+YQKRFY+T
Sbjct: 305 AGGNDTENRTTLGFWAGHRNSKIRVILARVWENDTELDISNNRINRATGHLVYQKRFYRT 364

Query: 301 KFCICPGGSQVNSARIADSIHYGCVPVILSDYYDLPFNDILDWRKFSVIVKERDVYQLKQ 360
           KFCICPGGSQVNSARIADSIHYGCVPVILSDYYDLPFNDI+DW+KFSVIVKERDVYQLKQ
Sbjct: 365 KFCICPGGSQVNSARIADSIHYGCVPVILSDYYDLPFNDIIDWKKFSVIVKERDVYQLKQ 424

Query: 361 ILKDISDMEFIKLHKNLIQVQKHFQWNSPPIKYDAFHMVMYDLWLRHHVIKY 413
           ILKDISDMEFIKLHKNL+QVQKHFQWNSPPIK+DAFHMVMYDLWLRHHVIKY
Sbjct: 425 ILKDISDMEFIKLHKNLVQVQKHFQWNSPPIKHDAFHMVMYDLWLRHHVIKY 472

BLAST of Cmc06g0169551 vs. TAIR 10
Match: AT4G38040.1 (Exostosin family protein )

HSP 1 Score: 692.6 bits (1786), Expect = 1.9e-199
Identity = 326/417 (78.18%), Postives = 364/417 (87.29%), Query Frame = 0

Query: 8   FQLTSSSSSPLCSLRASLLTLAVLTLLSFTYLSFTSLHSSPPS-----SPSQLP------ 67
           F  +   SSPLCSL++SLLT+A+LT +S  YLS  SL +SPPS     +P  +P      
Sbjct: 9   FSFSGGGSSPLCSLKSSLLTVAILTFISLFYLSLNSLRTSPPSPVIVVTPIHVPHTFVNE 68

Query: 68  VKLGALNDAADEE-ISDVYHSPQVFRLNYEEMDSKFKVYIYPDGDPNTFYQTPRKLTGKY 127
            K        +EE  SDVYHSP+ FRLNY EM+ +FKVYIYPDGDPNTFYQTPRK+TGKY
Sbjct: 69  YKTDNETPTMEEETYSDVYHSPEAFRLNYAEMEKRFKVYIYPDGDPNTFYQTPRKVTGKY 128

Query: 128 ASEGYFFQNIRESRFRTEDPDQAHLFFIPISCHKMRGKGTSYENMTIIVQNYVEGLISKY 187
           ASEGYFFQNIRESRFRT DPD+A LFFIPISCHKMRGKGTSYENMT+IVQNYV+GLI+KY
Sbjct: 129 ASEGYFFQNIRESRFRTLDPDEADLFFIPISCHKMRGKGTSYENMTVIVQNYVDGLIAKY 188

Query: 188 PYWNRTLGADHFFVTCHDVGVRASEGLPFLIKNAIRVVCSPSYDVGFIPHKDVALPQVLQ 247
           PYWNRTLGADHFFVTCHDVGVRA EG P LIKN IRVVCSPSY+VGFIPHKDVALPQVLQ
Sbjct: 189 PYWNRTLGADHFFVTCHDVGVRAFEGSPLLIKNTIRVVCSPSYNVGFIPHKDVALPQVLQ 248

Query: 248 PFALPAGGNDTENRTTLGFWAGHRNSKIRVILARVWENDTELDISNNRISRATGHLLYQK 307
           PFALPAGGND ENRTTLGFWAGHRNSKIRVILA VWENDTELDISNNRI+RATGHL+YQK
Sbjct: 249 PFALPAGGNDVENRTTLGFWAGHRNSKIRVILAHVWENDTELDISNNRINRATGHLVYQK 308

Query: 308 RFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSDYYDLPFNDILDWRKFSVIVKERDV 367
           RFY+TKFCICPGGSQVNSARI DSIHYGC+PVILSDYYDLPFNDIL+WRKF+V+++E+DV
Sbjct: 309 RFYRTKFCICPGGSQVNSARITDSIHYGCIPVILSDYYDLPFNDILNWRKFAVVLREQDV 368

Query: 368 YQLKQILKDISDMEFIKLHKNLIQVQKHFQWNSPPIKYDAFHMVMYDLWLRHHVIKY 413
           Y LKQILK+I   EF+ LH NL++VQKHFQWNSPP+K+DAFHM+MY+LWLRHHV+KY
Sbjct: 369 YNLKQILKNIPHSEFVSLHNNLVKVQKHFQWNSPPVKFDAFHMIMYELWLRHHVVKY 425

BLAST of Cmc06g0169551 vs. TAIR 10
Match: AT5G03795.1 (Exostosin family protein )

HSP 1 Score: 268.9 bits (686), Expect = 7.0e-72
Identity = 134/369 (36.31%), Postives = 224/369 (60.70%), Query Frame = 0

Query: 57  VKLGALNDAADE----EISDVYHSPQVFRLNYEEMDSKFKVYIYPDGDPNTFYQTPRKLT 116
           +K  +++D  D+     +  +Y + +VF  +Y EM+ +FK+Y+Y +G+P  F+  P K  
Sbjct: 152 IKAASMDDPVDDPDYVPLGPMYWNAKVFHRSYLEMEKQFKIYVYKEGEPPLFHDGPCK-- 211

Query: 117 GKYASEGYFFQNIR-ESRFRTEDPDQAHLFFIPISCHKM-----RGKGTSYENMTIIVQN 176
             Y+ EG F   I  ++RFRT +PD+AH+F++P S  KM           +  +   V++
Sbjct: 212 SIYSMEGSFIYEIETDTRFRTNNPDKAHVFYLPFSVVKMVRYVYERNSRDFSPIRNTVKD 271

Query: 177 YVEGLISKYPYWNRTLGADHFFVTCHDVGVRASEGLPFLIKNAIRVVCSPSYDVGFIPHK 236
           Y+  +  KYPYWNR++GADHF ++CHD G  AS   P L  N+IR +C+ +    F P K
Sbjct: 272 YINLVGDKYPYWNRSIGADHFILSCHDWGPEASFSHPHLGHNSIRALCNANTSERFKPRK 331

Query: 237 DVALPQV-LQPFALP--AGGNDTENRTTLGFWAGHRNSKIRVILARVWEN-DTELDISNN 296
           DV++P++ L+  +L    GG    +R  L F+AG  +  +R +L + WEN D ++ + + 
Sbjct: 332 DVSIPEINLRTGSLTGLVGGPSPSSRPILAFFAGGVHGPVRPVLLQHWENKDNDIRV-HK 391

Query: 297 RISRATGHLLYQKRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSDYYDLPFNDILD 356
            + R T    Y      +KFCICP G +V S RI ++++ GCVPV+++  Y  PF+D+L+
Sbjct: 392 YLPRGTS---YSDMMRNSKFCICPSGYEVASPRIVEALYSGCVPVLINSGYVPPFSDVLN 451

Query: 357 WRKFSVIVKERDVYQLKQILKDISDMEFIKLHKNLIQVQKHFQWNSPPIKYDAFHMVMYD 412
           WR FSVIV   D+  LK IL  IS  +++++++ +++V++HF+ NSP  ++D FHM+++ 
Sbjct: 452 WRSFSVIVSVEDIPNLKTILTSISPRQYLRMYRRVLKVRRHFEVNSPAKRFDVFHMILHS 511

BLAST of Cmc06g0169551 vs. TAIR 10
Match: AT4G16745.1 (Exostosin family protein )

HSP 1 Score: 265.8 bits (678), Expect = 5.9e-71
Identity = 130/345 (37.68%), Postives = 209/345 (60.58%), Query Frame = 0

Query: 73  VYHSPQVFRLNYEEMDSKFKVYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESR-FR 132
           ++ +  VF+ +YE M+   KVYIYPDGD   F++    L G YASEG+F + +  ++ F 
Sbjct: 182 LFRNLSVFKRSYELMELILKVYIYPDGDKPIFHEP--HLNGIYASEGWFMKLMESNKQFV 241

Query: 133 TEDPDQAHLFFIPISCHKMRGK-----GTSYENMTIIVQNYVEGLISKYPYWNRTLGADH 192
           T++P++AHLF++P S  +++         + + ++I +++YV  L  KYP+WNRT G+DH
Sbjct: 242 TKNPERAHLFYMPYSVKQLQKSIFVPGSHNIKPLSIFLRDYVNMLSIKYPFWNRTHGSDH 301

Query: 193 FFVTCHDVGVRASEGLPFLIKNAIRVVCSPSYDVG-FIPHKDVALPQVL-----QPFALP 252
           F V CHD G       P L +NAI+ +C+     G F+P KDV+LP+       +P    
Sbjct: 302 FLVACHDWGPYTVNEHPELKRNAIKALCNADLSDGIFVPGKDVSLPETSIRNAGRPLRNI 361

Query: 253 AGGNDTENRTTLGFWAGHRNSKIRVILARVWEN-DTELDISNNRISRATGHLLYQKRFYK 312
             GN    R  L F+AG+ + ++R  L + W N D ++ I           + Y +    
Sbjct: 362 GNGNRVSQRPILAFFAGNLHGRVRPKLLKHWRNKDEDMKIYGPLPHNVARKMTYVQHMKS 421

Query: 313 TKFCICPGGSQVNSARIADSIHYGCVPVILSDYYDLPFNDILDWRKFSVIVKERDVYQLK 372
           +K+C+CP G +VNS RI ++I+Y CVPV+++D + LPF+D+LDW  FSV+V E+++ +LK
Sbjct: 422 SKYCLCPMGYEVNSPRIVEAIYYECVPVVIADNFMLPFSDVLDWSAFSVVVPEKEIPRLK 481

Query: 373 QILKDISDMEFIKLHKNLIQVQKHFQWNSPPIKYDAFHMVMYDLW 405
           +IL +I    ++K+  N+  VQ+HF W+  P KYD FHM+++ +W
Sbjct: 482 EILLEIPMRRYLKMQSNVKMVQRHFLWSPKPRKYDVFHMILHSIW 524

BLAST of Cmc06g0169551 vs. TAIR 10
Match: AT5G11610.1 (Exostosin family protein )

HSP 1 Score: 265.0 bits (676), Expect = 1.0e-70
Identity = 135/359 (37.60%), Postives = 209/359 (58.22%), Query Frame = 0

Query: 57  VKLGALNDAADEEISDVYHSPQVFRLNYEEMDSKFKVYIYPDGDPNTFYQTPRKLTGKYA 116
           +K  AL    D   + +YH+  +F+ +YE M+   KVY+Y +GD   F+Q    + G YA
Sbjct: 186 IKKAALVKKDDTLYAPLYHNISIFKRSYELMEQTLKVYVYSEGDRPIFHQPEAIMEGIYA 245

Query: 117 SEGYFFQNIRES-RFRTEDPDQAHLFFIPISCHKMRGK-----GTSYENMTIIVQNYVEG 176
           SEG+F + +  S RF T+DP +AHLF+IP S   ++ K       S  N+   + NY++ 
Sbjct: 246 SEGWFMKLMESSHRFLTKDPTKAHLFYIPFSSRILQQKLYVHDSHSRNNLVKYLGNYIDL 305

Query: 177 LISKYPYWNRTLGADHFFVTCHDVGVRASEGLPFLIKNAIRVVCSPSYDVGFIPHKDVAL 236
           + S YP WNRT G+DHFF  CHD     + G P++  N IR +C+    + F+  KDV+L
Sbjct: 306 IASNYPSWNRTCGSDHFFTACHDWAPTETRG-PYI--NCIRALCNADVGIDFVVGKDVSL 365

Query: 237 PQV----LQPFALPAGGNDTENRTTLGFWAGHRNSKIRVILARVWENDTELDISNNRISR 296
           P+     LQ      GG+    RT L F+AG  +  +R IL   W +  E D+   +I  
Sbjct: 366 PETKVSSLQNPNGKIGGSRPSKRTILAFFAGSLHGYVRPILLNQWSSRPEQDM---KIFN 425

Query: 297 ATGHLLYQKRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSDYYDLPFNDILDWRKF 356
              H  Y +   +++FC+C  G +VNS R+ +SI YGCVPVI+SD +  PF +IL+W  F
Sbjct: 426 RIDHKSYIRYMKRSRFCVCAKGYEVNSPRVVESILYGCVPVIISDNFVPPFLEILNWESF 485

Query: 357 SVIVKERDVYQLKQILKDISDMEFIKLHKNLIQVQKHFQW-NSPPIKYDAFHMVMYDLW 405
           +V V E+++  L++IL  I    ++++ K +++VQKHF W +  P++YD FHM+++ +W
Sbjct: 486 AVFVPEKEIPNLRKILISIPVRRYVEMQKRVLKVQKHFMWHDGEPVRYDIFHMILHSVW 538

BLAST of Cmc06g0169551 vs. TAIR 10
Match: AT3G07620.1 (Exostosin family protein )

HSP 1 Score: 246.9 bits (629), Expect = 2.8e-65
Identity = 139/402 (34.58%), Postives = 221/402 (54.98%), Query Frame = 0

Query: 21  LRASLLTLAVLTLLSFTYLSFTSLHSSPPSSPSQLPVKLGALNDAADEEISDVYHSPQVF 80
           + A L T  V  L+    L+++S  SSP      +P               D+Y +P  F
Sbjct: 88  VEAELATARV--LIREAQLNYSSTTSSPLGDEDYVP-------------HGDIYRNPYAF 147

Query: 81  RLNYEEMDSKFKVYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRES--RFRTEDPDQA 140
             +Y  M+  FK+Y+Y +GDP  F+    K    Y+ EG F   +     ++RT DPD+A
Sbjct: 148 HRSYLLMEKMFKIYVYEEGDPPIFHYGLCK--DIYSMEGLFLNFMENDVLKYRTRDPDKA 207

Query: 141 HLFFIPISC-----HKMRGKGTSYENMTIIVQNYVEGLISKYPYWNRTLGADHFFVTCHD 200
           H++F+P S      H           +  ++ +YV+ +  KYPYWN + G DHF ++CHD
Sbjct: 208 HVYFLPFSVVMILHHLFDPVVRDKAVLERVIADYVQIISKKYPYWNTSDGFDHFMLSCHD 267

Query: 201 VGVRASEGLPFLIKNAIRVVCSPSYDVGFIPHKDVALPQV---LQPFALPAGGNDTENRT 260
            G RA+  +  L  N+IRV+C+ +    F P KD   P++           GG D  +RT
Sbjct: 268 WGHRATWYVKKLFFNSIRVLCNANISEYFNPEKDAPFPEINLLTGDINNLTGGLDPISRT 327

Query: 261 TLGFWAGHRNSKIRVILARVW-ENDTELDISNNRISRATGHLLYQKRFYKTKFCICPGGS 320
           TL F+AG  + KIR +L   W E D ++ +  N        L Y +   K++FCICP G 
Sbjct: 328 TLAFFAGKSHGKIRPVLLNHWKEKDKDILVYEN----LPDGLDYTEMMRKSRFCICPSGH 387

Query: 321 QVNSARIADSIHYGCVPVILSDYYDLPFNDILDWRKFSVIVKERDVYQLKQILKDISDME 380
           +V S R+ ++I+ GCVPV++S+ Y LPF+D+L+W KFSV V  +++ +LK+IL DI +  
Sbjct: 388 EVASPRVPEAIYSGCVPVLISENYVLPFSDVLNWEKFSVSVSVKEIPELKRILMDIPEER 447

Query: 381 FIKLHKNLIQVQKHFQWNSPPIKYDAFHMVMYDLWLRHHVIK 412
           +++L++ + +V++H   N PP +YD F+M+++ +WLR   +K
Sbjct: 448 YMRLYEGVKKVKRHILVNDPPKRYDVFNMIIHSIWLRRLNVK 468

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_008450187.11.6e-243100.00PREDICTED: probable glycosyltransferase At5g03795 [Cucumis melo][more]
XP_004133750.16.9e-23997.82probable glycosyltransferase At5g03795 [Cucumis sativus] >KGN56331.2 hypothetica... [more]
XP_038905298.11.3e-23796.12probable glycosyltransferase At5g03795 [Benincasa hispida][more]
XP_022929527.11.6e-23294.66probable glycosyltransferase At5g03795 isoform X1 [Cucurbita moschata][more]
XP_022985045.11.4e-23194.42probable glycosyltransferase At5g03795 isoform X1 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
Q9FFN29.8e-7136.31Probable glycosyltransferase At5g03795 OS=Arabidopsis thaliana OX=3702 GN=At5g03... [more]
Q9SSE84.0e-6434.58Probable glycosyltransferase At3g07620 OS=Arabidopsis thaliana OX=3702 GN=At3g07... [more]
Q9LFP31.2e-6336.71Probable glycosyltransferase At5g11130 OS=Arabidopsis thaliana OX=3702 GN=At5g11... [more]
Q94AA93.7e-6234.13Xylogalacturonan beta-1,3-xylosyltransferase OS=Arabidopsis thaliana OX=3702 GN=... [more]
Q3E7Q91.4e-6135.85Probable glycosyltransferase At5g25310 OS=Arabidopsis thaliana OX=3702 GN=At5g25... [more]
Match NameE-valueIdentityDescription
A0A1S3BN337.7e-244100.00probable glycosyltransferase At5g03795 OS=Cucumis melo OX=3656 GN=LOC103491850 P... [more]
A0A6J1EN047.9e-23394.66probable glycosyltransferase At5g03795 isoform X1 OS=Cucurbita moschata OX=3662 ... [more]
A0A6J1JA936.7e-23294.42probable glycosyltransferase At5g03795 isoform X1 OS=Cucurbita maxima OX=3661 GN... [more]
A0A6J1ESD69.1e-22993.69probable glycosyltransferase At5g03795 isoform X2 OS=Cucurbita moschata OX=3662 ... [more]
A0A6J1J7007.7e-22893.45probable glycosyltransferase At5g03795 isoform X2 OS=Cucurbita maxima OX=3661 GN... [more]
Match NameE-valueIdentityDescription
AT4G38040.11.9e-19978.18Exostosin family protein [more]
AT5G03795.17.0e-7236.31Exostosin family protein [more]
AT4G16745.15.9e-7137.68Exostosin family protein [more]
AT5G11610.11.0e-7037.60Exostosin family protein [more]
AT3G07620.12.8e-6534.58Exostosin family protein [more]
InterPro
Analysis Name: InterPro Annotations of Melon (Charmono) v1.1
Date Performed: 2022-10-13
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR040911Exostosin, GT47 domainPFAMPF03016Exostosincoord: 88..364
e-value: 7.0E-58
score: 196.2
IPR004263Exostosin-likePANTHERPTHR11062EXOSTOSIN HEPARAN SULFATE GLYCOSYLTRANSFERASE -RELATEDcoord: 62..411
NoneNo IPR availablePANTHERPTHR11062:SF43EXOSTOSIN FAMILY PROTEINcoord: 62..411

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cmc06g0169551.1Cmc06g0169551.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006486 protein glycosylation
cellular_component GO:0000139 Golgi membrane
molecular_function GO:0016757 glycosyltransferase activity