CsGy5G011850 (gene) Cucumber (Gy14) v2.1

Overview
NameCsGy5G011850
Typegene
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionUDP-3-O-acyl-N-acetylglucosamine deacetylase
LocationGy14Chr5: 13312504 .. 13334105 (+)
RNA-Seq ExpressionCsGy5G011850
SyntenyCsGy5G011850
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AGAAAGAATAGAGAAGGATTATGCGGGAAAGTGAGAAAGGATTAGAGAGGGTGTTAGGGGAGTGAGAAGGATTAAGAAACTCCCCTGAATTTTAAATAAGGAACCCACTCCATTCTTGCTGGGCTCAAACAGCCCCAAAATGTTCTAAAATAAACGGACCCTATTTTCCATTTGATTTGAAAGGGGGCGAAACAATAACGTGGGGAACTCCCCTAGCTACCACGTCAACACATTTCTCCGCCCTTCTTCCTATTTCCCCTTCCTGTTTGACCGTTACCATCGCCATCGCCACCGGCCAATGCTCGTTCCTACTGCCTTCAACGCCCTCAAATCCTCAAGGTCAATATCTTGGTCGCCGGTATTCTCTCTCTCTCTTTCCATAGAAATGGTGTTTCTCTGAGAACAAAAGAACTGCTGAAAACAATTGAATGGAAGGTAGTTTTCATACTCACTGTGTGTGATCATTCTGGGTTGCTACTTTGCTTCTTTTTGTTGTGTAGACGGGCAGGCTTCAGCAAACCTTGGCGGGATGTCTGGAGCTGAGTGGGATATCTCTGCATTCTGGGAAAGTTGCGAAAGTGAAGTTGTGTCCCGAGTTTGCTGGCAGAGGAAGGTATTTTGATTTCAAATCCAACTTCATTCCTGCATCGATCGATTATGCCGAAGACTCGCCTCTTTGCACCACGCTTTCTAAGGATGGGTTTAAAATTCGAACAGTTGAGCATTTGCTTTCTGCCATGGAGGCCATGGGGGTCGACAATTGTAGGATCGTGATAACGAATGAAGATGCCAAAGACAGCGAAGTCGAGGTACGTGTGACTAAGGCAGTGAGTTCATTTTATCATCATATTTGCGAAAATCAAGCTTGGCCCATATTGATTGATTTCCCTGGATGCCGGAAACATGATAACCTGTTCAGTTCTACTGTCATTTGGCTATATATATATATTTATACGTACTTATGTTATGAATATTTAAAGAACCAGCTTTCACCAAGAAAAGCACCTTCTCCATTATAAACCAACAGTTCGGTTTTGAGCCAATTGAGCATAAAAAATTCGAAAAAGAAGAAGGAAAAAAGAAAAAATCCCAAACCACACTGGCAAACTGTTAATTACAAAGGATGTGCTCTATGTCCTGTGATGCACCTTTGCAAAGAATGCACCATTGCAGTTCCAACAATATGGAAGAGTGCCTAAGAATATAAGCCACAATGTCAACTTTCCCATGCAAAACCTGCTTTATTAATTTTTTTTAAACTTAAACATTTCATAATGTGGAAAACAAAGGGGAATTGGGAGAGCAAGGATCACACAAAAGAAGGAAGTATGAGCAACATAAATCCACAAAAGAACGATGGTGCGAGACTTAATGATCCATTCTTCCTTAAGGAACATGGATAAACTCCAACAAAGAAAATGAAGAAGAGACATCCATCGACTCTCTATCAAACGAAGGGTGATAGACCCCAAAATTAAATGAGGAAGGACCCAAGAAGAAACAAGTGTGGCTAGATAACTACCCTTCAAGGAAGAGAGGTGGTATAATCAAAAGAACAAAGAGGTATTACCCAATCAACCATCCCCTCAAAAGTAATTGCTCAAACCATCTTGCATATGCCTTTTGACTCTTGAATAGAAACCCACAAACGACAAGGAAGGTTGTACTTGCTGGCAATAACTCTATGCCATAAGTCATAAAGCTCCAATTGTAAGCACCATAGTTCACAACCACTTGGCTAATAAGGTCTTGTTGTGAATCTTAGATTTCCAATACCTAAGCTCCCAAGGTTAATTGGTCTAAAAACGACCTCCCACTTGATGAAGTGGGAAGCCCCACCCTCCATACTTTTCCATAGGAAATCTCTCCTAAGCTTACCAAAAATGGGTATGTAGATTCCATCTTGAAAAAGACATTGTTCAGGCAATAGTTCTTTTATTAATGACTGAGCTCAGGTCTGCTATGCCATTGTATAGGCTTTGGAAACACATCTTCAAGGCAAAAGTTCTGTCCAATTATAATTTATAGCTCTTGTTTGTATATTTTATTTCCTTATTTGTACTTAGCTCTTACTTGGTAATTGTTTCGTTGTATATTTTTTCCTTCCAAATTTGTTTTTCCTTTAATCTCTATTGAGAGATTAGTCTTGTACCCTATTTTAGTTAATTAGAAGAAAAGTCATCTTCTTGAATATTGAGATGGTATCAGAGCCTTGAGACTTTTTTTTCTGTCTAGTTGGCCTTCATTGTCGATCAGGTTTTGTAACAACCCAACTCTTTATAAGGCCATTACCATTTAAAAACCAATAAAGACTTTTTATTTAAATCAAAGAAAATACTAAATTAAAAAAAAAAACACTTCAAATATTTGTTCATAAAATATAAGTAACGTAGGAAAATAAAATCATAAATAAAATATTAGATAAAATACTAGTTCGGGCCCTATTTGAAATAAAAAAAAAAAATCTAAAAGTAAACATGAGTTCTAAAATAAATTTCTGATAATTAAATACTGAAAACCTAAATGCGGAAGAAAACATAAACTGTCCCCTATGGTGAACCACGTCACTTCTTTTCATTCGCCAACTTGTCTTTACCTCTACCTTTGCCTGAAAATTTAAACATAGAAAAGAGTGAATATAAATTTATACTCAGTAAAGGACCCACTATTAGTCCCACTACGTGTATGTTAACTTCCCATTAGACCCACTATTACCTCTACCTTTGTGGGGTGATACTTGGTTCAGGGACAATAGTAAAAGGCATAGGGGTTTGTAAAAATGTTGAACGGATGATTGGAGATTGGAGGGTGATAGATAACTTCTTACCTTTGGAGTTAGAGGGGTAGATATGATTCTCGAGATGCAATGGCTCCATTCATTAAGGGTTATGGAAGTGGATTGAAGGACCATGATATGATAAGGTAATCCTAATCCTCAATTCAAAGATCCAAGCTATTTTCGTAGCAAGAACCCTTGAAATAAGAAAATAACACATCCTTATACTAATTTAGATCAACCAAGAAGAAGTTAGGATTATATTACCTCTTAGATCAAAGATCTAGAAGCAAGATTTGGAAATCTTGTTCTAAATTGAGTCACTTCACAATCAAACTTGATCAAGGTTAGATTGAATGACTCGCATGCATTTTATCACAAGAATTGCAAATTAAGGGAAAACTCTCAAAATGAGATTTCAAATTGCATTCATCAAAAGGTCTTGTTCTTACAATTGGAAGGATAAAGACATTATATAGCTTTGTAGAGATTCTATCCTAATTAATGAAAATACCAAAAAGTCAAAATAAAACATTAATCAAGCTTTCTATCTTCCAACTACCATCTTTAATACTAACCTATTAATGATAGTTACCCTAACTTGTCAAATTAAAACATATGGAAATGCAATAAAAATGTAATATTTTTTGAATTGGTATAGAAAACATCTCCAGACCTCCTCACACCTTCCTTCTGAGTCACTAGGGGCAAGTTATGCTAAAATGGTGGAGCTTAATGGATCCTCTATTTAGGAGTTGGTAAAACAGAGCATTTCTTCCCTTGTTCTAACCTCAGATTTCAAAGTGTAACTCAACATTGGTTGATAATCTATTGATTATTTCAAGATTATTTGCGTTTGATGATTGGAAGGAAATTAGATAAATCTTGGAGGATCTTTTTCAATAAAAAATAATTGTTAATCCATTGTTTTCAAAGAACACTTTGATTAAATTGGGTGAAGGATCTCTAGAAGAGTTCGTTGGAAAAGAAGAGTTATGGCAAGTAATGGGTCATTTTCTTCTGAGGTTTGAGAAATGGAGCAAAAGTAAACATAGTAGACCATCGGTTATTGAAGGTTATGGAGGATGGATTAAAATAAAACTCCCTCTTGATTATTGGAGCAACCAGATGTTCAAAGTCATAGGAGATCATTTTGGAGGTCTAGAAACTATCACTACTGAAGCTCAATCTTTTTTTAAGCAAAGATTCAAGTAAAAATGAAACCGTTCAGGGAATTTTTTTTTACACCTTGGTGATTTTGAAACAATCAAGCCTCCATTAATCAAGGATCTTGTCAATGGAGGATTTTTCAAACCCATTTGACTTGTAGAGAGTAAACCAGGCTTTGAAAGATGGAGGAATCGTTGAAGATTCATTCTTCTCGGAGTGGGTGGAATATCTAAGATCAAGCCAGAAAATTCCTTCATGTACAAGAAACTCCCTTTCATCCTTCCTTTGCCTAGTCGGTACCCAAATGCCAAATTGAAAAAAAAGGTCCTTTCTATTAAATTAGAGTCCTTCAGAAGGGCCTCTTGCCTCTAGCATAAAAAAGGTCGACTGGGAATAAACTTTCCACTTCTTCAACTTTCTCGGCATATCCTTTTATTCATTCTATGCTAAATCTCACATCGGAAACCAGGCTTCCTTGTCAGAATGTTTACCACCAAGCGGGAAACCCAAGTAGTGGAAGAGCTTGGAAAGTTGAAATATTTCCTTGGAATCGAAGTTGCCTATTCTAAACAGGGCATTTTCTTATCACAATCGAAATACATAATTGACTTACTTTTTGAAACAGGAAGGCTTAGTAGCAAACCAATGGGAACACCAATTGATGAAAATCATAGATTATGTGCAGCCGATGAGAGTCCTTCAGTAAATAAGGAAACATAACAAAGGTTAGTTGGCAAACTAATATATCTCTTTCGTACAAGACCAGATTTTGCTTATGCTATTGGAGTGGTAAGTCAATTCATGCATAATCCAAAGGAAGTTCACCTTGAAGCAGTGTACTGGATACTTCTTGAAGAACCCCATTGGGAGAGGATTGTTACTTAAGAAAAACAAGAAACTCAATTTAGAAGTTTATACTAATGTGGATTATGCGGGTTTTATTGATGATAGAAAATCGACTTCTGGCTATTGTACTTTCTTGGGTGTGAATTTGGTTACATGGAGGAGTAAAAAGAAAAATGTAGTTGTGAGATCAAGTTCAGTAGCAGACTTTTGAGCAATGGCCTTGGGCATTTGTCTTGGATAAAAATTATCCTGGAAGATTTAAAGATTCCATGGGAAGGAACTATGAAATTTATTGTGATAACAAGTCTGCTATTAGTATCGCTCCTAAATCCAGTTCAACATGATAGAACCAAGTATGTCGAAGTGGATAAACACTTTATAAAAGAGGAGCTGGACAATGGTTTGATTTGTACTCCTTTTGTGTCGACAAATAATCAAATATCAGATATCCTCTCCAAAGGATTGAGCAACAAATTTTTTGAGCATTTACTTATCAAAATGGTAATCGAAAACATTCATTCACCAAATATGAATGGAAAATATCCTCTCCAAAGTAATGTCAAATTATAATTTACACCTGTCTTTTGTATATTTTATTTCCTTATTTGTAATTAGCTCTTAATTGGTAATTGTTTCCTTGTATATTTTTTCCTTCCAAATTTGTTTTTCCTTGTAATCTCTATTAAGATTAGTCTTGTACCCTATTCCAATTAATTAAAAGAAGAATCGTCTTCTTGAATATTGAGAAGTTCTTTTATTAAATTGAATCCAAGATGGTGAGAAACATTTGAGCAACTGCTTTTTGAATGACTTTTAGTGAGGAACGAGGTTCTAGACTGATTTTCTGGAGCTCAGTTACAATATATAATTCTTGGAAAACCCCTCTTTCATTTGGTATCGTAGACTTGGGAAGATCGGGAAGATTATGAATGGAACTATTCCCAAGGAAGTTTTAAGTAGGTTAGGCTCCATTTGTTGGGCCTTCAGCTATAAATTATAATTTTGTTGCTCTCCCTCTCTCATATGATTAATTTTGCTGGCTTGAAAGTGGAACACTTCTTAATTTTCAAGCCTTGGAAGGTTTAATCCCTAGGCATTGGTTGTTTCTTGTACATTCCTTTTTGGATTGATAGTTTAGTACTTGGTTTGTTATCCAAAAGTTGGCACGTTTTCCCCCTAATTTAAATACCAATTTTCTAATTTTTGTGTGCAAAGTTAGTCTCCATGTATAATAAACTTAAGCAACTACTTTTTAGCTTGAATATCAATTCCGTGACATAACCTTTAGTTGAAGAATGATTTAATACTTGGTTTTTTATCCAAAAACTGGCATGCATTCCACCCTAATTTAAATACCAATTTTCTAATTTCTGTGTGGGAAGTTTGTCTTCATGTACAATAAACTCGAGCAATTATTTTTTTAGCTTGGATATCAATTTTGTGACATATGCTTACTTGAAGAATGTTGCTGAAATATATATTCAGGTTCCAATTTTTGATGGATCTGCGGGTAAATGGGTGGATGCGATAGAGGAGATTGGTCTTAAGTTGGCTATAGACCAGTGTGGCAACTTCTGTGAAAAAATGGCACCACATGTAAATCAGCCTGTCCATGTGTGGAGGAATGACTGTTTTCTTATTGCTTTTCCAGCCACAGAAGTTCGTATCACTTATGGAATCGACTTTCCTCAGGTACAGATATATTTTTAAAATGGTTATGAATTGAAAATGTGGCTCCTTTTTGTTTATTTATTTTTGTGACAATTTCTCTGTTCTGTTAGCATCATATGAGAAGAGTCGTTCTCCAGGAAACTTATTTCTGCACTATGTTTGATTCTTTGACATAGCCACTAAGAAGACTGAGATTAAGAAAGATCATTAAAAAATGATGAAGTTTAGAAATTATTACTATTTTGCACATGGTTTTCCATTTATTAAAAGTAAGTCAGAGATTGGGGTTACTGTTATTCTTTTTGCAAAATGTTTTTGGCCACAAGTTTACTGATGATAAATTAAAATAAATAGATGGAACTAGAATTTGATAATTACTTTCATGATGTCCTTTAAATCGCTTACTATCCATTGAATTGAGTGATTTATGAATATTTTCTTACGTGCAAGGTGCCTGAAATTGGCTGCCAATGGTTTTTTACTGCCCCTTTGGACAACAAGTTCTATGCTGAGCAAATAGCCCCATCAAGAACCTTTTGCATTTATGAGGAGGTTGGTGTAAGTTTGTTATTAAATAATTTTGTTTGGTGAATCTGTCGATATTTCATCGCAAGAGAAGTGGTTGTTAGCTAAATGATTGTTGGCCTGTAAGCTGTTGCAGACATGAATGTTTTACCATTTGTCGTCCTATTTGAGTCCACGTTGATGGGATCTATAGTCTCTTCCAATTGGTCACACTCACTCTCATTCTTGAGTATCTCTCTGAAAAATCTATGGATGTTTTACCATTTGTGAATTTTCTATATTATTGAATAAATCATAATAATTCAGAGTAGATAATGATCTTTTACATAGGGAATATATAGACCTTAAAGAAATCACTAAAAGAAAATTCATAAATGGAAAGTACCAAAAAGGAAAATACTAATGGAAAATACTAATAATCCATAGAACTCTAATTTTCCTTTAACACCATTCAATATTAAATGAGAGTTAGGAGATGTGTAATATTATACAAGAAACTTATTCAGACACATTTTATTGACCACTGGGACAATGCAATCCATTGTGTGAAATCCTTATTTGGATCAAAATGAGAAAATGGTGAAAAAAGTTCGAAGAAAATGTTGTTAGGTAATCGATGTAGGGAGAATTATATTATTCTTTGGGTAAACCCAGGTTAATTCTTTTTACTCCCTATTTGTGTAATATTTCCCTATAAATAGGGAGTCCACCCCTTCATGTATGGAACATCTTACATTTTAAAAATGATTATGAGTTAGATTCTTGGAGAATTTTCTTTTTTAGTCTGCAAGCTACACCAATTTGGTATCAAAGTTCCAGCTGGGCCAAAGCAGGTTGCTACCGGTGGAACCTCAAAGTCTAGGGACCAAATAGACACAAATTTCAAAGTTTAGGGACTGAGGTTGTAATTTAACCTCCTCTCCCCAAAAAAGAGGAAAAGGAATAGGAAAGAAAGAAAGAAGAAATGAAGAAAAAGGAAAGTCAGGATTATCTATTCTTGCTGAATTGCATTAAGTATTGAAACAATATTTTCAACTTTGGAGATGCACTGGAGCAAGTTGCCTAGTCTCAAATACTTTCTGGGTCTTTAATACGGAGTATTTCTCTAGAAAATTTATTGCATACATACTGCATAAACAAGGGCGTTATTTGTTTCATTGCAGGTAGAGCAAATGCGTAATATGGGACTTATAAAAGGAGGCTCAATGGAAAATGCTTTAGTTTGCAGGTTTGTTGTTTTGTTCTTACATCTTCGATTGGGAAGTTGTTCTGTTTTTTGACTGTTGGTTTGTTGTTTTGTTGCCTGTCGTGTCATTTTGTTAATTTGTTTTGTCTCTTTTTCTTATAAAAAAAAATTTGTTTTGTCTCTTTGGTTTGCTTCCTGCACTACAATGATGGATATTTTGTATTAGGATGATGATGAGGTGTTGTAGGGGGTGTCAACATAGTTGGATAGGCGGTTGTACCTACTGATCTCAAATATTTTGTTTTTGGCTCTGGGGACTTAATGCTTCCATTTTTATTTTCTTTGTACCTTGAAAATCAGTCTGATTCAATATTATAAAAGTTAATTAAAGAAGAAAGAATAACGTGTATACGTGGAAAATCTTAGTTTAGAGAGAAAAAACCACGATATAAGTCTTTTGTATATTTTTCTTATTATAAAAGAATACAAGGGATGGAATATTTATAGGACATAATGGGCCGTAGATGATAATTATTTTAAAATCTCCCTCTCAAGTTAGGCCGTAGATGATAATTATACCCAACTTGCTAATACATGAGTTGAAATTCTGGTTGAGTAACACTTTAGTAAGATCATCAACAATTTGTTGATTTGATGGGGTGTACGGAATATAGATACTGTCATTGTCCAATTTCTCCTTGATGAAGTGTCTATTAATCTCCATATGTTTAGTGTTGTCATGTTGAACTGGGTTATTAACTATGCTAATACAACTTTGTTATCAAAAAATAGCTTCATAGGTAACTCACCATCCCAGGGAAGGTCAAATAGGACCTTTTGTAACCAAAAACTCTTCACAAATTTCCAAACACATAGCTCGGTATTTAGCTTCTTACTCCTAGCAACAACCCCCTGCTTCTTAGTCCTCCAAGTTACAAGGTTGTCCTATACAAAGGTACGGTATCTCGAAGTAGACTTTCTGTCATTCACTGATCCTGTTCAATCTGAGTCGGTATAGGCCTCAATGCATCTTTTGCTGGTCTTCCTAAACATCAAACCTCTTCTTAAAGTAGTTTTTAGATATCTCATAATACAGTTCACAACTTCGATGTGTTCTTCGTACGGAGATTGCACAGGAAATACTGGTCCAATAGGGATAGATAAATCAACTTTCTCACTAATCGTTGATACTTTTCTTTATGAATAAAACTCCCGTGCATCCAATCATACATGTTTCCTTCAATAGGTCAAACGTATATTTTCTTTACAAAACCGAAATTCTTTCCCTTGATCGTGCAACATCCATTTTAAGGTTGTACTTTAGATTCCCTAGGTCTTTAATCTCAAACTCGTCACCCATCTTTCTTTTTAACCTGGTAATCTCAGCAATATCATCGCTGATAAGACAATATCATCGATGTAAATAATCAGCACAATAATTTTCCCAGATTCAAATCTCTTCTTAAACAGTATATGGTCAAAGTTTCCCTAAGAGATTGTTTCAAGCCATAGAGAGATTTCCAAAGTTCACACACTTGATGGTTAAATTGGAACTCAAACTCTGGAGGAGAGCTCACATACACTTCTTCCAAATCACCATTTAGGGAGGGATTTTTTACATCAAGTTGGTAATGGCCACTATTTATTGACTGCAATGAGAGAAGAACCCTGATTGTGCTTAACTTTGCCAGAGGTGAAAACACCTTTGACTAGTCTATTCCATACGTCTAAGTGAACCCTTTTGCAACTAGCCTGGCCTTGTATCGATCTAACGCCCCATTTGATTTATACTTCAGAGTGGATACCCATTTGCATCCCACTGTCTGGTGCCCTTTGGGAAGATTACATAGATCCCATGTTTTGTTTTTCTCAAGAGCTCCCATTTCTTTCATGATAGCTGCCTTCCACTTAGGACTTTCCATAACCATGATATGTTCTTTGGTATTGTTGCAGCATCTTATAGTGAAAGCCTTGAACTTAGGTGATAGATTGCTGTAAGATAAGTAGTTACACATAGAGTGTTTCATTCACGACCTCATGCCCTTTCTCAAAGCAATGGGTATGTCAAGAGAGGGATCGTAACTTCCTGGTTCATCTGAGTCATCCGAACTAGTCTTACCCAGTTGTGTTTCGGCTCTATTTTCAGAATCAGTGAGGAGAAAGACCCAAGTGGGACGGCTGGACGGGTATGATAAAGGCAATAAACCAATGTTTCCCCAAGGCAGTAGTGACCTGGGATATCCCCAAACATTGCTAGGGATACGGGTAAATGGTTGAGTAGGTTTGTAGAGTTGAGAAGGAAACGAGATCTGAGGTTGTTTTGCACGAATACAAACATCACAAGACAATGATGAAACATCCACTCTTGAAACAGGTGAGGAAAAGATATTTTATTTATTGAAGGATGGCCAAGACGAAAATGCCACAACAATGGAAAAATAAGAAGACAATAAACTAGTCCTTTGAGAAAGCATCATCATCAAGAAAGTAGAGTCCCCTACTATGCCGGGCAGTGCCAATCATCCTCCCCAAGCTCAAGTGATGAAATGAAACAACATCAAGTGAGAAGATAGCTTGACAATGTCTGTCCATGGTAATCTTACTAATAGATAGTAAATTGTAGGATTTTTGGGCACATACAATACATTCTGTAAGGTAAAACCATCAAACGGTGAAATGTGACCATTGTCTGCAATGGGAGCAAAGGATTCATCACCAATCCTAATTTTCTCATTTCAATCACACAGTAAGTACGATATAAAACTTTTAGATGAGCCAGTAAGATGATCAGTAGCACCTGAATTGAGGATCCAGGGCTTCCTACTATCGATGCTGATAGACTGAATGATAGGGAGTTTCCTGATTGTGCAATGGCTCCAAATGAGGATGTATTGGGATCACTGCGAATCCTTACAGGGATTGAGGTGGGTGGAGAGTTGACAGAATCAATCTTCAAGGCCTTACTAGAGTTGTGCTGGTCATTCGGCGAGCGTCTCTTACTGTTTGGTGGTCGACCATGTAACTTCCAACAGTTTTTCTATGTATGTCACGGTTTTTTACAATGTTCACAAACCAAGATAGGTTTCCCATTCTGCTTGTCATTGTTAGAGCTGGACCTTACACTGAAGGTAGTGGAATCAATAGACAGAACAGAGATGATGTTCGTTGCACTTATACGGTCTTCCTCCAAAAAATCTTTTAAACACACCTCCATTAAAAAGCAGATAGGTCTTTGACCTAATATTTGACCACGAACCAAGTCGAACTTGGACTTCAAACCAGCAAGGAAATCATATACACAGTCAACCTCATTCATTATGGAGTATTGGACACCATCTCAAGGGCACTCTCAGGTAATTTCTCTGCATAGATCATTTCCTACCGGGTTAGAGAAATTTGTCGGTTTGGTCGGGATGAATGAGTGATGGCTATGATGGGCAGAAGTTGGAATCACAGCTTCCGAAGGATATGTAATTGACAGAAGGCCTGCCGGTGAGAAGTGGGTTGCCGGAGAAGGGCAGGTCATCAACAGAAAAGGGAATCTGCTTGAATGGGGTTTGGTTGTCGGTGGAACTAGGCTGATCGTTGATAGAAACAACGCCCAAGATACTCATCAACGACCGCTTGAATGGAGGTGGCAGCAGTTTCGATAGACAGTCCAGCAATGGCAACAATTGTGGTATTAGGGTTTCCTACTTGTTCTTCTTCTTCCATGATGGCTAAGGTTTTTTTTTTCCACGGCTCTAATACCATATAAAATTAATTAAAAAAGAAGAAGAAAGAATAATGCTGGATATACTTGGAAAACCGTAGTTTAGGAAGAAAAAACCATGATATGAGTCTTTTGTATATTTTTTTTATTATTAAAAGAATACAAGGGATGGAACATTTATAGGCCATAATGGGCCTTGACAAAAAAGGAAATAGAAAAGTGTGTAAAGATTACACAAATACCCTTAATACAATTATTTTCAACAATTATATCAATAAAGAGACTTGTTTCCATAAAGAAAAGATATCAATTAAATTCCTATTGAAAACTTCTTTCAAAAGCATTTGAAATTCTTTAGTTGGTTGATGATAGAAGGAATTCAGTTGAAGTTTTGCAATCAGCAACAGTTAGGTACCAATTTCCTATTGAAGATTCTTTTTCAAAAGCATTTGAAATTCATTAGTTGGTTGATGATAGAAGGAATTCAGCTGAAGTTTTTGATGTAGCCTACAGAGATAAAAGGAGAAATTTCTCCAAGAATCAAATCAAACTTTTTATTATTAATCAAAAGTTTTTTACAAGGGAAGAATCCTCTATTTATAGAGAATTAAATGTAAGATAGGATCCTAGAAGGATATAACTAATAGGAAACTAAAATGTAATTAATCAAGGATTAACCAAGATTAATTGAATTTACTAAATTATCCCAATTCTATTACATCAGTTTTGCAATCAACGACATTAAGAGTTTGGTACCTTACAATTGTTACCTACTGTCGTTCTTTAATTGAGGTTGGTGAACTACAGATGAAAGAAATCATGCCTCAGTCGCTCACCAATTAGGATCAAATTGTGTTTCACTAGTCTGTAATGATTCTCTCACATTGTATAGTTCTTTTTGGTACTTTCGAGGATTGGGGCTTGTGATCTTTTTCGGTTTCACTATGAAGCACAAGCATAGATACAAATACGAGATACGGGTACAATAGGATACATCGATACGTCAATTTTTAAAACACTAAGATAGTGATACGTTTGAAGATACGTTTTTTTTCTTTAAAAATCTAAGATATAGGTAAATTTAACATAAAGATGCTAAATGTAACTCACTAAATCCACTAAATTTTTTTAAAAAAATGTCAAATTAATGACAATGGTACGAAATAAAATACAAACACTAAAGTACAATATAAAAAACATCAAAATTATGGAAGTACAATAATCAATAAAATGCAATAGAAAAGAGATAAATTTATCAAAGAAGAGGTAGAAAAATATGAAGGTAAACGAAGAACTTCATTAAATTGTTTAAGATATTCAAAAGGGTTATATTTTACTCAACTTTGTGGCTCAAATATGATGATGACAATCATATAATTTTACGATTTGGTCTTTTATTTATTTTTGTTCTTTCTAGTTTTGGATGAGGTAATGAGCATTTTAAATAGGTTGTTTTGTTTTGAGTTGGGAAAACATTAAATGAGTTGCTTATCTTTTTTCTTTAGAAAGAGGTCAGTATTGAAAGCAATAAACACGAAACCAAAGCTTACGCGCAAACACGAGAACTGGGAGAAAAACCATGATGTTTTTAGTTTTATTATTTTTCTGATAAATAATACAACTACAAGGGAGAATAAATAGAATATACAAGTGAATAAAAAAGAACAAGATTTAGGAATTAGGAAAAATATTCCCATAATCTTTCCATAAATATTCTAACACTCCTCCTTAAGTTGGGACGTAAATATCAATGAGGCCCAACTTGCTAACACAAAAGTCAAATTTTGGTATGAGAAACCCCTTGGTAAGAACATTAGCAACTTGTTGACTCGAAGCGATGTATGGAATGCATATGCTCCCACTACTAAGTCTTTCTTTGATGAAATGCCGATCAATCTCAACATGTTTAGTTCTATCATATTGGATAGGGTTGTTAGCAATACTAATAGCGGCTTTATTATCACAAAAAAGCTTAAATGGTATCTCACATTCCTGATGAAGATCTGACAAGACTTTCTAGAGCCAAATTTTTTCACATTCCCAAACTCATAGCTTTGTATTAGGGTTAGACCTGTCATTCTCACTTACTGTATTATTAGTACAAGGTTCAGTGGGGTTTTCCATACCTTGATCTCGAGGAGGTTTAGAGTCTTGGACTGGAGCCAGCGGCTAACTAGTAAGGGAAACGACTTCCTTTTTTAGATTCCTTTGTAATACGTTTTCCAGGGAACTTGATTTGTGGGTAGGACTATGAGATGAGGATCAATGTCATATATGACACTAGGAGTAGGTTCGATAAATTCAACGGTGTTGTTAGACTCTTCACTCACACTCTTCCCCTGAAGATGGCTAACGGAAAAGTAAGGTTGATAAAGGGGATATCCAAGAAACACACATGCCTGAGCCCGAGGGGTAAATTTGGTCTTATTAGGGCCAAAATTATGGACATAAGCGGTACACCCAAACACACCAAGAGGAACCTCAGAAACAAGATGAGTAGATGGGTAGGACTCCTTAAGACAATCTAAGGGAGTCTGAAGGTGGAGAATACGAGAAGACATTCTATTGGTTGAATGAGCTGCTGTAAAAATAGCATCTCCCCACAAGTATGAAGGAAGGGAAGTGGAAAGCATAAGGGAACAAGCTACTTCCAGAAGGTGACGGTTTTTTTGCTCGGCTACTCCATTTTGTTGAGGAGTGTAGGCGCACGAATTTTGATGAACAATCCCCTTGGAAGCAAGAAATTCAGAGGACCCTGTCAAATGATCTGTGGCACTAGAATCCAGAATCCATGGGTTCTTCCCATCAACACAAATAAGACCGAAGGAGGGAGGTATACCTAATTGGACAATGGCATCTAAAGTAGCAGGACTGAGATCGGTTTGGTTCCCGTGTGGATTAGCTTCAAAGACTGTTGATCAATATGGACCGAAGGATAGAGTCCTCCGCCTTCCAGTATCATTCCTATGGGTCGTTCGGTGGGGAGCGAAGTATTTCTCCTGTCAGAAAGCTAAATTTATGTCGTCCTTCAAGGACCATCTTCACTGACTGAGACCATGAGAAATAGTTATTGTCGTTCAACTTGTCTTTTGAAAGGTGATACACTGAAGTCTGAGTCACTGTATTAGTCACATAAAGAGAGGATAAATTAGGAAACGAGTTTATTGGATTCTCAAAATACAACAGTACAGAAATATTGGATGTCGTCCCTAAGGAGCCTCAAGTGCTGCGATCTGTTGTCGAAGCTCTTCCAGTTGCTGTTGAGTTATTCTCGAGGAAGCTTGCACGTTAGGGTTGGAATCTGTCGAAGATTCACCAACTCAAATGTTGAGTGAATTTGAGGATTCTTAACATCGGGATGATCACTAGGGTTAGTGGGTGGCATGACATACAGATTTGACGGTAGAAGCGGTGGTGGCCAGTCGGAATTACGTGGCAGAACATAAATCGGGGCCTGGGGAGCAATATAGGCTGCAAATGAAGAAAATGGTTGGACAGATGGGATGGTCGGCGCTGTCTAAGGAGGAAAAACCAGCACGTGGAGGTCCGATGGTGGCGCGTGAGGCTTGTTCGGCGGTAAAATATGCACTTCCTCGGGAATGAAAAAAAATCGATCCACTTTCAGTGCGGACAACTGCAGAACTGACGGGAATTTATTTTTCGGGCGTTTTTTATAGGCGGCTGAACCATTCATCCATGGAAACACTCATTCGAGCATCGACGGCGGTGGCAACAGCGGTATTAAAACTGGTGGCTGTCCCTTCTATTTGATTTCTATTATTGGTTACATTTTCTAGGGTTTCTAGGGTGTGTTCATTGTCCCGCTGTGATACCATATTGAAAGGAGAGACCAAGAGCACACACCAATTTACGTGGAAACCGAGTATTGGGAGAAAACACAATTGTTGGTTATTATTTTCTAATGAATAATACAATAGGTACAAGGGAGAATAAATAGAATATACAAGGGAATAAAGAAGAAAAGGATTTAAGAAATAAGGAAAATATTCCCATAATCTTTCTAGGATTCTAACGAGGAAAATATTTAGAAAATAAGTAAAAGATTCGATCAGGTTCTATTTTGTATGGTTGGAGTCCATTCTTTCTTCTTTGTTTCAAGTAAAGATCATTCTATGGTTCTTTACAAGTTGAAGTGATCACACCCCAAAAGATGCTTGTGTTTGAAGTCTTATCTTCATCAGTCAAAGTATCTTTTCGTTTTAAGCAAGGATTTTTGCTCTCATTCCTACATGATTTTGGAGTTACATTAGTGAGCCTGTTTTATTCAGACAAGTTTCATGGTTCATTTTCTCTCATTAGGCTTTTTGTTGACAGATTTCCTCTTTTCTTAGAAGTTTTTGGATCAAAATTTATGAAGTTTGGAGGATTGTCGATTTTCATGCTCTCTCTATATTAAGTGTTCGACAGCAGTAAACTAGAGGTTTTGTTAAGTCATTGTTAGCATCTTTGGGAGTTTTTTATGGGTCTTGCATGGAAATTGTTGGAATTCTTTCCTTTTTTCTAGATATTTTCCTTGTTAGAATCCTAGAATAATTATGGAAAGATTATGGGAATGTTTTCCTTATTTTCCTAAATCTTTTCCTTTTTTATTCCATTGTATACTCTATTTATTCTCCCTTGTACCTATTGTTTTATTCAATAAAAAATAATAACAGCAAACAATCGTGGTTTTTCTCCCGGTACTTGAGTTTCCACGTAATTGGTGTGTATTCTTTGTCTCTCCTTTCAATAGAAATAAATCTATGTTTCTTCTTTGGCCAGATAATTATTATTTCTGCTTGATTAAAGGCCTCCTCTTTTGGTGCTCTTCTCAAGCCTTCTGACCTTTTCAGGATACATATTGAATTTAGGATACCTTGATTCTTTTTTTTTTTTGCTTTCTCTGGTAGTTTGTATCCATTGGGCAATAGTTAAGTTTCATTTTTTGTTCAAGAAAATAGTCATTTCATTAAGAACATATTTTCTTGTCAACTGTTAAATGTGAGCGCTATGATGGAAAAGTATTAGAGAGTTGCTAAGTTGTTTGTGAAAAGACGACGGCAACTTTTACTAATTTGGTTTTTGTTTGAATTGGCTTGCCTTTCCGCTCCCAGATCGTTAGGAGAAGTTCCCCAGTCCTTGCCTCTTCCCTTCAGTTATTTTTTGAGTCTGAGTTCACACTTGTCCCAAGCCTTTGTTTCGAGTTTTGTACTCTACGCTGACTGTTTTGTTGATTGTTATGCTGTTTTTTTTTTTCTGTATGTTTTGATTTGTTGACTATGTTGTGTTGGTGTGCCTCCTGATCCTTTGATCTCTTTATTTTGCTCTCGTAGAATTTTCTTGTACTCTGAGTTTTACTATTATTAATAGAGGCTGGTTTCTTTGCCAAAAAAAGAAATTGACTGGTTCGGTTTGAATCGTGTATTAGTTTCATGTAAGTTACGTTTTAAGTGTAGTGTATTTTAGAAATATATATGACCTTACATGAACTTGATTTGTAATATAGATTGTAACACCGTGTACTTGAGTTCTAAGGAAATATACATGTGTGTTTATTTTAATAATAGAAAAATCTTTCTAAATTAAAAGAAATCGAAGTTAATCTGGTTCCCATTTTCCTATCAACTTCAGTGTCAGCAAAGGTTGGATCAATCCACCGTTGCGTTTCCATGATGAACCTTGTCGTCACAAGGTTTTAGATTTAATTGGCGATCTGTCGCTTTTTGCGCAGTTGGGCAGTCAAGGGATTCCGTTGGCACACTTAGTGGTTTACAAGGTAAGTTCTCTGTGATCTACTTTTCTAGCATGTGGTTATTTGATATAAGATCACAATTCCAATCAGATTATAAGAATTATTTGGAGCACAATCAATCATAATGGGTTGAACTAGTAGTAAAAATGGAGACATTGTCTCAATAACTAAGAGGTAATGAATTCAATCCATGGTAGCCACCTACTTAGAAATTAATTTCTTACGAGTTTTCTTGACACCTAAAAGTTGTAGGTTCAGGCAGGTTATCCGTGAGATTAGTTGAGGTATGCACAAGTTGGTCTAGACATCCATGGGTATAAAAAAGAATTATTTGGAGCACAATCTAAACTTGTTTTGTTGTTTCTTAAGTAGATGTCTTTTATTTATTCCTAATGCATGTCGATGCCTGCAATTTAAGAGGAAGGTATTGATGATTATTTGAGTTGAGAGAGAGAGCAAGCCATTTATTAAGGGTGTAAATTGACTTTGGTTTGAAGGTATTCATGATTATCAAAGCCATGAATTACCATTACATTTACTAGTTTTGATTTTAAATTGATATAACCCCAAATATTAGGGGTGTAAATTGAAATTTTCTCCAAGTTATTATGTGTTCTATGCTCTTTTGGTTCATCATCCCTTTTGAAAGAAGTTATACTCTGGCTGTCTTGATTGGGTGTGTATGCTTGGTAAGCTCAAGAAAGAAGAAATTTTGGTTTGTTAATTAATTCTTTGAAGGGTTTCTTTTCTCTTCTTCTTTTCTTAAGGATATAGATAAGATTGCTGGATTTATCTTGATGCAGGGTGGTCATGCTATGCACGCCAATTTTCTTCGCCGCTTATTGGAAAGGATTTAAGTGGACAAGCAGCTGTTTTCCCTTGCAAAAGTGTAAAAGTGTTCTGCAATTGAGGTAATGAAGTCTGACATTTCTCTTCCAGTTGCTTGACGGATGAACTTTTAAGATTTATTTCTGAAATAGAGGCAACATTCTGCTGTAATTTGGGAAAGAAGTTGTGTAAAGTACTAAAGTCTGTATGTACGATCAAAGGGTAGCCTTTCTCACAATAGCCTTTCACATGTTCAGAAAGGTTATATTGTTATCGTCCTACTCTCAATGGGTATCTCGGATCTCCTGACGTTACATTTTTCAAAACTCTGCCCTTTAGTCAACCATTGCCAAGTCCGAGTGAAGGGGAGAATGACGATCTTTTTATCTATGAGATTACCTCTCTCACATCATTTTCTCCTCCACCTTCCTCTGTCTCTTCCTTCCGACCTATCTGTTCTTCAAGTCTATTCCAAGCATCCTCTACAGTGACCTTGAAACCCATGTCCTCCAACACTAACTTCTTTGTCACATGATTCAGAACCAAGTGACAATCTTCCCATTGCCCTTCAAAAAGGTAAATGCACTTGTACTTACCCTATTTGTTCGTTTGTTTCGTATAACCAATTGTCTTCACCCACATATTCCTTTATTACCTCTTGATTCCACCTCTATCCCTAACTCAATCAATGAAGCTTTATCTCAACCTAGGTGGCGTAGTGCAATGATTGAGGAGATGACTGTCCTAGATGATAATGGTACTTGTGATCTACTTTCTCGTCCGGTCAGAAAGAAGGCTATTGGGTGTAAATAGGTGTTTGCTATGAAGGTCAATTTCGATGGAACAGTGGCTCGATTGAAAGCTCGTCTCATTGCCAAAGGTTATGTCCAAATCAATTGGATTGACTATTCTGATACATTCTTTCCTGTTGCCAAAATAACTTCCATTCAACTATTTCTTTACATGGCTGCTACTCACAGTTGGCCTTTGTATCAACTCGACGTTAAGAATGTTTTTCTTCTTGGTGATCTTCAAGAAGTGTAGAGTAACCACTTGAGTTCGTTGCTTAGGTGAGAGTGATAAGGTGCGTTGCCTTCAAAAGTTTGTATATGGACCGAAACAAAGTCCTCGTGCATGGTTTGGTAAGTTTAGTTAAGCACTCAAATGTTTTGGTATGAAGAAAAGTACATCTGATCGTTCTGTATTTTATCAACAATCTGACAATGGTTTTCCTTTAAGGTTTGTTGTCTATGTTGGTGATATCGTTATCACTGGAAATGATGTGTCAAGTATATCTCTCAAAACATTCCTCCAATGTCAATTTCATACTAAAGATTTGGAGCAATTAAAGTATTTCTTGGGTTTTTAAAGTAATGAGAAGCATGAAAGGTAATTTACCTGTCCTAATGAAAGTATGTGCTTGTTTTGTTTTTTGAGACAAGTAAACTAGGAACCAAACCATGTAGTACTCTCATGATACCAAATTTGCAACTTGCCAAGGAAGTTGAATTGTTTAAAGACGTGAGAGATATAGAAGATTAGTTGGAAAGTTGAACTATTTAAGTGACTTGACCTGATATTGCCTATTATGTGAATATAGTGATTCAGTTTATATCATCCTCGACTGTGGATGATCGTAGCATAAATCCTATGTTATCTAAAAGTTGCTTCTGGTTGTGGGATCTTATTTAAAGCTAAAACATGTTGCTCTTGGATGCATTCATACAAGGGTTGAATGATTTTCAGATGTTGATTGGGCTAGATCTCGAGAAGATAGGAGATTGACCTCTAAATATTATTTTTGTTGGAGGAAACTTGGTGTCGTGGAAAAGTAAGAAATATATAATGTAGTTTCACGTTCGAGTGTCGAGTGAGAGTATAGAGCTATACCACAATCTATATGAAATGGTACGGACACACCCACTTTTAACTAAGATCGACGCCAGTGTTATTGTGCCAGCTAAAGTACTATGGTATATGATAATCATATTGTTCTTCACATTGCATCCAACTCAGTATTTCATAAACAAACTAAATATGTTGAAGTTGATTGTCATTTCATTCACGTGAAAATACAAGAGTTGATATCTACGAGATATGTGATGTGAAGACTAGAGAAAATTGGGAGATATCCTTACCAAAGCTGTAAATGGGGCAAGAATAAGCTATATGTACAACAAGTTGGGCATGATTGACATATTTGTTCCAGCATGAGAGGGAGTGCTATAATATGTGTTTATATTGTAATTATGTATAGTCATCCTTTGTTATAATTTTACATTCCCTAGATTAGGGTTTTCTTAGTTGCTTATATATATGTATCTTCAGCCTGATTAAATAATAAGATCATTATATTCTCCAAAAGGACAGAGTACAACAGTTATTTAATTTAATGTAGTGGTTGCAGTATCTCATATCTTGCTGATACATTTTATTACTGAATTCTCTGCAGGAGTCCACAATACTTTCTTAGTTCCTTCCTTCATCAGCTGACACCCAAATGCATGCTCATTGAGGAGGATATCCTGAGCAAGTGAAGAACGGTACAGCTCTAGATAGTGTTATTGATACAATTTATTTGACAAAGTCAATGCAAATAGTTAGAGAATGACAGTTAAATTGCCAAGGATTGGTGCCTCTTTTAGATGATAGATGTTAAAACTTCACTCCAGGTTATTAGTGTAGATGCAAATTGTATTGTAGGTTGTCACTAGGAAGGATTCATGGGAGTTTTCTCGAACTTTGTTAATCATCACCATAATCATACAAATTGACAATTGATGTTTTAAACATTTTTCCACTTAGTAACATCTTTTGATCTCTAAACTTCTATAGATCAGCTTCTCAACTTTTAGATATTTTGTTTGTTGTCCTTAAACTTTGCTTTAAAGTTTAGTTGATTATATCATCAATTCAACCCAAAAGCCTTATAGGTGAAGACAAATTTAATATACTATCTTACCAAATTAATATTATTGAAAACAAGAGAAGCCAAATAAAAGACACATATATTACAAATAATGCTTGCTGCAGTTGTAACAAAAAGAGCGTTGTCTTAAAAAGAAGGGAAATCTTTATCTTTTCCTACAAGAAGTCTTTGGTTTCTTTTAATTTAGGTCCATGAAAGCAAGCTTCT

mRNA sequence

AGAAAGAATAGAGAAGGATTATGCGGGAAAGTGAGAAAGGATTAGAGAGGGTGTTAGGGGAGTGAGAAGGATTAAGAAACTCCCCTGAATTTTAAATAAGGAACCCACTCCATTCTTGCTGGGCTCAAACAGCCCCAAAATGTTCTAAAATAAACGGACCCTATTTTCCATTTGATTTGAAAGGGGGCGAAACAATAACGTGGGGAACTCCCCTAGCTACCACGTCAACACATTTCTCCGCCCTTCTTCCTATTTCCCCTTCCTGTTTGACCGTTACCATCGCCATCGCCACCGGCCAATGCTCGTTCCTACTGCCTTCAACGCCCTCAAATCCTCAAGGTCAATATCTTGGTCGCCGACGGGCAGGCTTCAGCAAACCTTGGCGGGATGTCTGGAGCTGAGTGGGATATCTCTGCATTCTGGGAAAGTTGCGAAAGTGAAGTTGTGTCCCGAGTTTGCTGGCAGAGGAAGGTATTTTGATTTCAAATCCAACTTCATTCCTGCATCGATCGATTATGCCGAAGACTCGCCTCTTTGCACCACGCTTTCTAAGGATGGGTTTAAAATTCGAACAGTTGAGCATTTGCTTTCTGCCATGGAGGCCATGGGGGTCGACAATTGTAGGATCGTGATAACGAATGAAGATGCCAAAGACAGCGAAGTCGAGGTTCCAATTTTTGATGGATCTGCGGGTAAATGGGTGGATGCGATAGAGGAGATTGGTCTTAAGTTGGCTATAGACCAGTGTGGCAACTTCTGTGAAAAAATGGCACCACATGTAAATCAGCCTGTCCATGTGTGGAGGAATGACTGTTTTCTTATTGCTTTTCCAGCCACAGAAGTTCGTATCACTTATGGAATCGACTTTCCTCAGGTGCCTGAAATTGGCTGCCAATGGTTTTTTACTGCCCCTTTGGACAACAAGTTCTATGCTGAGCAAATAGCCCCATCAAGAACCTTTTGCATTTATGAGGAGGTAGAGCAAATGCGTAATATGGGACTTATAAAAGGAGGCTCAATGGAAAATGCTTTAGTTTGCAGTGTCAGCAAAGGTTGGATCAATCCACCGTTGCGTTTCCATGATGAACCTTGTCGTCACAAGGTTTTAGATTTAATTGGCGATCTGTCGCTTTTTGCGCAGTTGGGCAGTCAAGGGATTCCGTTGGCACACTTAGTGGTTTACAAGGATATAGATAAGATTGCTGGATTTATCTTGATGCAGGGTGGTCATGCTATGCACGCCAATTTTCTTCGCCGCTTATTGGAAAGGATTTAAGTGGACAAGCAGCTGTTTTCCCTTGCAAAAGTGTAAAAGTGTTCTGCAATTGAGGTAATGAAGTCTGACATTTCTCTTCCAGTTGCTTGACGGATGAACTTTTAAGATTTATTTCTGAAATAGAGGCAACATTCTGCTGTAATTTGGGAAAGAAGTTGTGTAAAGTACTAAAGTCTGTATGTACGATCAAAGGGTAGCCTTTCTCACAATAGCCTTTCACATGTTCAGAAAGGTTATATTGTTATCGTCCTACTCTCAATGGGTATCTCGGATCTCCTGACGTTACATTTTTCAAAACTCTGCCCTTTAGTCAACCATTGCCAAGTCCGAGTGAAGGGGAGAATGACGATCTTTTTATCTATGAGATTACCTCTCTCACATCATTTTCTCCTCCACCTTCCTCTGTCTCTTCCTTCCGACCTATCTGTTCTTCAAGTCTATTCCAAGCATCCTCTACAGTGACCTTGAAACCCATGTCCTCCAACACTAACTTCTTTGTCACATGATTCAGAACCAAGTGACAATCTTCCCATTGCCCTTCAAAAAGGTAAATGCACTTGTACTTACCCTATTTGTTCGTTTGTTTCGTATAACCAATTGTCTTCACCCACATATTCCTTTATTACCTCTTGATTCCACCTCTATCCCTAACTCAATCAATGAAGCTTTATCTCAACCTAGGTGGCGTAGTGCAATGATTGAGGAGATGACTGTCCTAGATGATAATGGTACTTGTGATCTACTTTCTCGTCCGGTCAGAAAGAAGGCTATTGGGTGTAAATAGGTGTTTGCTATGAAGGTCAATTTCGATGGAACAGTGGCTCGATTGAAAGCTCGTCTCATTGCCAAAGGTTATGTCCAAATCAATTGGATTGACTATTCTGATACATTCTTTCCTGTTGCCAAAATAACTTCCATTCAACTATTTCTTTACATGGCTGCTACTCACAGTTGGCCTTTGTATCAACTCGACGTTAAGAATGTTTTTCTTCTTGGTGATCTTCAAGAAGTGTAGAGTAACCACTTGAGTTCGTTGCTTAGGTGAGAGTGATAAGGTGCGTTGCCTTCAAAAGTTTGTATATGGACCGAAACAAAGTCCTCGTGCATGGTTTGGTAAGTTTAGTTAAGCACTCAAATGTTTTGGTATGAAGAAAAGTACATCTGATCGTTCTGTATTTTATCAACAATCTGACAATGGTTTTCCTTTAAGGTTTGTTGTCTATGTTGGTGATATCGTTATCACTGGAAATGATGTGTCAAGTATATCTCTCAAAACATTCCTCCAATGTCAATTTCATACTAAAGATTTGGAGCAATTAAAGTATTTCTTGGGTTTTTAAAGTAATGAGAAGCATGAAAGGTAATTTACCTGTCCTAATGAAAGTATGTGCTTGTTTTGTTTTTTGAGACAAGTAAACTAGGAACCAAACCATGTAGTACTCTCATGATACCAAATTTGCAACTTGCCAAGGAAGTTGAATTGTTTAAAGACGTGAGAGATATAGAAGATTAGTTGGAAAGTTGAACTATTTAAGTGACTTGACCTGATATTGCCTATTATGTGAATATAGTGATTCAGTTTATATCATCCTCGACTGTGGATGATCGTAGCATAAATCCTATGTTATCTAAAAGTTGCTTCTGGTTGTGGGATCTTATTTAAAGCTAAAACATGTTGCTCTTGGATGCATTCATACAAGGGTTGAATGATTTTCAGATGTTGATTGGGCTAGATCTCGAGAAGATAGGAGATTGACCTCTAAATATTATTTTTGTTGGAGGAAACTTGGTGTCGTGGAAAAGTAAGAAATATATAATGTAGTTTCACGTTCGAGTGTCGAGTGAGAGTATAGAGCTATACCACAATCTATATGAAATGGTACGGACACACCCACTTTTAACTAAGATCGACGCCAGTGTTATTGTGCCAGCTAAAGTACTATGGTATATGATAATCATATTGTTCTTCACATTGCATCCAACTCAGTATTTCATAAACAAACTAAATATGTTGAAGTTGATTGTCATTTCATTCACGTGAAAATACAAGAGTTGATATCTACGAGATATGTGATGTGAAGACTAGAGAAAATTGGGAGATATCCTTACCAAAGCTGTAAATGGGGCAAGAATAAGCTATATGTACAACAAGTTGGGCATGATTGACATATTTGTTCCAGCATGAGAGGGAGTGCTATAATATGTGTTTATATTGTAATTATGTATAGTCATCCTTTGTTATAATTTTACATTCCCTAGATTAGGGTTTTCTTAGTTGCTTATATATATGTATCTTCAGCCTGATTAAATAATAAGATCATTATATTCTCCAAAAGGACAGAGTACAACAGTTATTTAATTTAATGTAGTGGTTGCAGTATCTCATATCTTGCTGATACATTTTATTACTGAATTCTCTGCAGGAGTCCACAATACTTTCTTAGTTCCTTCCTTCATCAGCTGACACCCAAATGCATGCTCATTGAGGAGGATATCCTGAGCAAGTGAAGAACGGTACAGCTCTAGATAGTGTTATTGATACAATTTATTTGACAAAGTCAATGCAAATAGTTAGAGAATGACAGTTAAATTGCCAAGGATTGGTGCCTCTTTTAGATGATAGATGTTAAAACTTCACTCCAGGTTATTAGTGTAGATGCAAATTGTATTGTAGGTTGTCACTAGGAAGGATTCATGGGAGTTTTCTCGAACTTTGTTAATCATCACCATAATCATACAAATTGACAATTGATGTTTTAAACATTTTTCCACTTAGTAACATCTTTTGATCTCTAAACTTCTATAGATCAGCTTCTCAACTTTTAGATATTTTGTTTGTTGTCCTTAAACTTTGCTTTAAAGTTTAGTTGATTATATCATCAATTCAACCCAAAAGCCTTATAGGTGAAGACAAATTTAATATACTATCTTACCAAATTAATATTATTGAAAACAAGAGAAGCCAAATAAAAGACACATATATTACAAATAATGCTTGCTGCAGTTGTAACAAAAAGAGCGTTGTCTTAAAAAGAAGGGAAATCTTTATCTTTTCCTACAAGAAGTCTTTGGTTTCTTTTAATTTAGGTCCATGAAAGCAAGCTTCT

Coding sequence (CDS)

ATGCTCGTTCCTACTGCCTTCAACGCCCTCAAATCCTCAAGGTCAATATCTTGGTCGCCGACGGGCAGGCTTCAGCAAACCTTGGCGGGATGTCTGGAGCTGAGTGGGATATCTCTGCATTCTGGGAAAGTTGCGAAAGTGAAGTTGTGTCCCGAGTTTGCTGGCAGAGGAAGGTATTTTGATTTCAAATCCAACTTCATTCCTGCATCGATCGATTATGCCGAAGACTCGCCTCTTTGCACCACGCTTTCTAAGGATGGGTTTAAAATTCGAACAGTTGAGCATTTGCTTTCTGCCATGGAGGCCATGGGGGTCGACAATTGTAGGATCGTGATAACGAATGAAGATGCCAAAGACAGCGAAGTCGAGGTTCCAATTTTTGATGGATCTGCGGGTAAATGGGTGGATGCGATAGAGGAGATTGGTCTTAAGTTGGCTATAGACCAGTGTGGCAACTTCTGTGAAAAAATGGCACCACATGTAAATCAGCCTGTCCATGTGTGGAGGAATGACTGTTTTCTTATTGCTTTTCCAGCCACAGAAGTTCGTATCACTTATGGAATCGACTTTCCTCAGGTGCCTGAAATTGGCTGCCAATGGTTTTTTACTGCCCCTTTGGACAACAAGTTCTATGCTGAGCAAATAGCCCCATCAAGAACCTTTTGCATTTATGAGGAGGTAGAGCAAATGCGTAATATGGGACTTATAAAAGGAGGCTCAATGGAAAATGCTTTAGTTTGCAGTGTCAGCAAAGGTTGGATCAATCCACCGTTGCGTTTCCATGATGAACCTTGTCGTCACAAGGTTTTAGATTTAATTGGCGATCTGTCGCTTTTTGCGCAGTTGGGCAGTCAAGGGATTCCGTTGGCACACTTAGTGGTTTACAAGGATATAGATAAGATTGCTGGATTTATCTTGATGCAGGGTGGTCATGCTATGCACGCCAATTTTCTTCGCCGCTTATTGGAAAGGATTTAA

Protein sequence

MLVPTAFNALKSSRSISWSPTGRLQQTLAGCLELSGISLHSGKVAKVKLCPEFAGRGRYFDFKSNFIPASIDYAEDSPLCTTLSKDGFKIRTVEHLLSAMEAMGVDNCRIVITNEDAKDSEVEVPIFDGSAGKWVDAIEEIGLKLAIDQCGNFCEKMAPHVNQPVHVWRNDCFLIAFPATEVRITYGIDFPQVPEIGCQWFFTAPLDNKFYAEQIAPSRTFCIYEEVEQMRNMGLIKGGSMENALVCSVSKGWINPPLRFHDEPCRHKVLDLIGDLSLFAQLGSQGIPLAHLVVYKDIDKIAGFILMQGGHAMHANFLRRLLERI*
Homology
BLAST of CsGy5G011850 vs. ExPASy Swiss-Prot
Match: F4IAT8 (Probable UDP-3-O-acyl-N-acetylglucosamine deacetylase 1, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=LPXC1 PE=1 SV=2)

HSP 1 Score: 404.1 bits (1037), Expect = 1.5e-111
Identity = 190/315 (60.32%), Postives = 237/315 (75.24%), Query Frame = 0

Query: 7   FNALKSSRSISWSPTGRLQQTLAGCLELSGISLHSGKVAKVKLCPEFAGRGRYFDFKSNF 66
           +++  SS ++S +P+GRLQQTLAG +E+ G SLHSGK + VKL PE AG GR+F+F+S F
Sbjct: 21  YSSAASSPTVSLNPSGRLQQTLAGSVEVKGKSLHSGKFSTVKLNPEIAGAGRFFEFRSRF 80

Query: 67  IPASIDYAEDSPLCTTLSKDGFKIRTVEHLLSAMEAMGVDNCRIVITNEDAKDSEVEVPI 126
           IPASI++A++SPLCTTL KD  KIRTVEHLLSA+EA GVDNCRI I +E + D EVEVPI
Sbjct: 81  IPASIEFAQESPLCTTLLKDELKIRTVEHLLSALEAKGVDNCRIQIESESSDDREVEVPI 140

Query: 127 FDGSAGKWVDAIEEIGLKLAIDQCGNFCEKMAPHVNQPVHVWRNDCFLIAFPATEVRITY 186
           FDGSA +WVDAI+ +G+  A +  G   EKM  HVN+PV+V +ND F+ AFPA E RIT 
Sbjct: 141 FDGSAKEWVDAIQGVGINAAQNHDGESVEKMVAHVNKPVYVCKNDTFVAAFPALETRITC 200

Query: 187 GIDFPQVPEIGCQWFFTAPLDNKFYAEQIAPSRTFCIYEEVEQMRNMGLIKGGSMENALV 246
           GIDFPQVP IGCQWF   P+    +A+ IA SRTFC+YEEVE+MR  GLIKGGS++NA+V
Sbjct: 201 GIDFPQVPAIGCQWFSWRPIHESSFAKDIASSRTFCVYEEVERMREAGLIKGGSLDNAIV 260

Query: 247 CSVSKGWINPPLRFHDEPCRHKVLDLIGDLSLFAQLGSQGIPLAHLVVYKDIDKIAGFIL 306
           CS   GW+NPPLRF DE CRHK+LDLIGDLSL ++ G+ G+P+AH+V YK          
Sbjct: 261 CSAEHGWMNPPLRFDDEACRHKILDLIGDLSLVSRGGNGGLPVAHIVAYK---------- 320

Query: 307 MQGGHAMHANFLRRL 322
              GHA+H +  R L
Sbjct: 321 --AGHALHTDLARHL 323

BLAST of CsGy5G011850 vs. ExPASy Swiss-Prot
Match: P0DKB7 (Probable UDP-3-O-acyl-N-acetylglucosamine deacetylase 2, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=LPXC2 PE=1 SV=1)

HSP 1 Score: 404.1 bits (1037), Expect = 1.5e-111
Identity = 190/315 (60.32%), Postives = 237/315 (75.24%), Query Frame = 0

Query: 7   FNALKSSRSISWSPTGRLQQTLAGCLELSGISLHSGKVAKVKLCPEFAGRGRYFDFKSNF 66
           +++  SS ++S +P+GRLQQTLAG +E+ G SLHSGK + VKL PE AG GR+F+F+S F
Sbjct: 21  YSSAASSPTVSLNPSGRLQQTLAGSVEVKGKSLHSGKFSTVKLNPEIAGAGRFFEFRSRF 80

Query: 67  IPASIDYAEDSPLCTTLSKDGFKIRTVEHLLSAMEAMGVDNCRIVITNEDAKDSEVEVPI 126
           IPASI++A++SPLCTTL KD  KIRTVEHLLSA+EA GVDNCRI I +E + D EVEVPI
Sbjct: 81  IPASIEFAQESPLCTTLLKDELKIRTVEHLLSALEAKGVDNCRIQIESESSDDREVEVPI 140

Query: 127 FDGSAGKWVDAIEEIGLKLAIDQCGNFCEKMAPHVNQPVHVWRNDCFLIAFPATEVRITY 186
           FDGSA +WVDAI+ +G+  A +  G   EKM  HVN+PV+V +ND F+ AFPA E RIT 
Sbjct: 141 FDGSAKEWVDAIQGVGINAAQNHDGESVEKMVAHVNKPVYVCKNDTFVAAFPALETRITC 200

Query: 187 GIDFPQVPEIGCQWFFTAPLDNKFYAEQIAPSRTFCIYEEVEQMRNMGLIKGGSMENALV 246
           GIDFPQVP IGCQWF   P+    +A+ IA SRTFC+YEEVE+MR  GLIKGGS++NA+V
Sbjct: 201 GIDFPQVPAIGCQWFSWRPIHESSFAKDIASSRTFCVYEEVERMREAGLIKGGSLDNAIV 260

Query: 247 CSVSKGWINPPLRFHDEPCRHKVLDLIGDLSLFAQLGSQGIPLAHLVVYKDIDKIAGFIL 306
           CS   GW+NPPLRF DE CRHK+LDLIGDLSL ++ G+ G+P+AH+V YK          
Sbjct: 261 CSAEHGWMNPPLRFDDEACRHKILDLIGDLSLVSRGGNGGLPVAHIVAYK---------- 320

Query: 307 MQGGHAMHANFLRRL 322
              GHA+H +  R L
Sbjct: 321 --AGHALHTDLARHL 323

BLAST of CsGy5G011850 vs. ExPASy Swiss-Prot
Match: P0DKB8 (Probable UDP-3-O-acyl-N-acetylglucosamine deacetylase 3, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=LPXC3 PE=1 SV=1)

HSP 1 Score: 404.1 bits (1037), Expect = 1.5e-111
Identity = 190/315 (60.32%), Postives = 237/315 (75.24%), Query Frame = 0

Query: 7   FNALKSSRSISWSPTGRLQQTLAGCLELSGISLHSGKVAKVKLCPEFAGRGRYFDFKSNF 66
           +++  SS ++S +P+GRLQQTLAG +E+ G SLHSGK + VKL PE AG GR+F+F+S F
Sbjct: 21  YSSAASSPTVSLNPSGRLQQTLAGSVEVKGKSLHSGKFSTVKLNPEIAGAGRFFEFRSRF 80

Query: 67  IPASIDYAEDSPLCTTLSKDGFKIRTVEHLLSAMEAMGVDNCRIVITNEDAKDSEVEVPI 126
           IPASI++A++SPLCTTL KD  KIRTVEHLLSA+EA GVDNCRI I +E + D EVEVPI
Sbjct: 81  IPASIEFAQESPLCTTLLKDELKIRTVEHLLSALEAKGVDNCRIQIESESSDDREVEVPI 140

Query: 127 FDGSAGKWVDAIEEIGLKLAIDQCGNFCEKMAPHVNQPVHVWRNDCFLIAFPATEVRITY 186
           FDGSA +WVDAI+ +G+  A +  G   EKM  HVN+PV+V +ND F+ AFPA E RIT 
Sbjct: 141 FDGSAKEWVDAIQGVGINAAQNHDGESVEKMVAHVNKPVYVCKNDTFVAAFPALETRITC 200

Query: 187 GIDFPQVPEIGCQWFFTAPLDNKFYAEQIAPSRTFCIYEEVEQMRNMGLIKGGSMENALV 246
           GIDFPQVP IGCQWF   P+    +A+ IA SRTFC+YEEVE+MR  GLIKGGS++NA+V
Sbjct: 201 GIDFPQVPAIGCQWFSWRPIHESSFAKDIASSRTFCVYEEVERMREAGLIKGGSLDNAIV 260

Query: 247 CSVSKGWINPPLRFHDEPCRHKVLDLIGDLSLFAQLGSQGIPLAHLVVYKDIDKIAGFIL 306
           CS   GW+NPPLRF DE CRHK+LDLIGDLSL ++ G+ G+P+AH+V YK          
Sbjct: 261 CSAEHGWMNPPLRFDDEACRHKILDLIGDLSLVSRGGNGGLPVAHIVAYK---------- 320

Query: 307 MQGGHAMHANFLRRL 322
              GHA+H +  R L
Sbjct: 321 --AGHALHTDLARHL 323

BLAST of CsGy5G011850 vs. ExPASy Swiss-Prot
Match: P0DKB9 (Probable UDP-3-O-acyl-N-acetylglucosamine deacetylase 4, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=LPXC4 PE=1 SV=1)

HSP 1 Score: 404.1 bits (1037), Expect = 1.5e-111
Identity = 190/315 (60.32%), Postives = 237/315 (75.24%), Query Frame = 0

Query: 7   FNALKSSRSISWSPTGRLQQTLAGCLELSGISLHSGKVAKVKLCPEFAGRGRYFDFKSNF 66
           +++  SS ++S +P+GRLQQTLAG +E+ G SLHSGK + VKL PE AG GR+F+F+S F
Sbjct: 21  YSSAASSPTVSLNPSGRLQQTLAGSVEVKGKSLHSGKFSTVKLNPEIAGAGRFFEFRSRF 80

Query: 67  IPASIDYAEDSPLCTTLSKDGFKIRTVEHLLSAMEAMGVDNCRIVITNEDAKDSEVEVPI 126
           IPASI++A++SPLCTTL KD  KIRTVEHLLSA+EA GVDNCRI I +E + D EVEVPI
Sbjct: 81  IPASIEFAQESPLCTTLLKDELKIRTVEHLLSALEAKGVDNCRIQIESESSDDREVEVPI 140

Query: 127 FDGSAGKWVDAIEEIGLKLAIDQCGNFCEKMAPHVNQPVHVWRNDCFLIAFPATEVRITY 186
           FDGSA +WVDAI+ +G+  A +  G   EKM  HVN+PV+V +ND F+ AFPA E RIT 
Sbjct: 141 FDGSAKEWVDAIQGVGINAAQNHDGESVEKMVAHVNKPVYVCKNDTFVAAFPALETRITC 200

Query: 187 GIDFPQVPEIGCQWFFTAPLDNKFYAEQIAPSRTFCIYEEVEQMRNMGLIKGGSMENALV 246
           GIDFPQVP IGCQWF   P+    +A+ IA SRTFC+YEEVE+MR  GLIKGGS++NA+V
Sbjct: 201 GIDFPQVPAIGCQWFSWRPIHESSFAKDIASSRTFCVYEEVERMREAGLIKGGSLDNAIV 260

Query: 247 CSVSKGWINPPLRFHDEPCRHKVLDLIGDLSLFAQLGSQGIPLAHLVVYKDIDKIAGFIL 306
           CS   GW+NPPLRF DE CRHK+LDLIGDLSL ++ G+ G+P+AH+V YK          
Sbjct: 261 CSAEHGWMNPPLRFDDEACRHKILDLIGDLSLVSRGGNGGLPVAHIVAYK---------- 320

Query: 307 MQGGHAMHANFLRRL 322
              GHA+H +  R L
Sbjct: 321 --AGHALHTDLARHL 323

BLAST of CsGy5G011850 vs. ExPASy Swiss-Prot
Match: F4IAW1 (Probable UDP-3-O-acyl-N-acetylglucosamine deacetylase 5, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=LPXC5 PE=1 SV=2)

HSP 1 Score: 404.1 bits (1037), Expect = 1.5e-111
Identity = 190/315 (60.32%), Postives = 237/315 (75.24%), Query Frame = 0

Query: 7   FNALKSSRSISWSPTGRLQQTLAGCLELSGISLHSGKVAKVKLCPEFAGRGRYFDFKSNF 66
           +++  SS ++S +P+GRLQQTLAG +E+ G SLHSGK + VKL PE AG GR+F+F+S F
Sbjct: 21  YSSAASSPTVSLNPSGRLQQTLAGSVEVKGKSLHSGKFSTVKLNPEIAGAGRFFEFRSRF 80

Query: 67  IPASIDYAEDSPLCTTLSKDGFKIRTVEHLLSAMEAMGVDNCRIVITNEDAKDSEVEVPI 126
           IPASI++A++SPLCTTL KD  KIRTVEHLLSA+EA GVDNCRI I +E + D EVEVPI
Sbjct: 81  IPASIEFAQESPLCTTLLKDELKIRTVEHLLSALEAKGVDNCRIQIESESSDDREVEVPI 140

Query: 127 FDGSAGKWVDAIEEIGLKLAIDQCGNFCEKMAPHVNQPVHVWRNDCFLIAFPATEVRITY 186
           FDGSA +WVDAI+ +G+  A +  G   EKM  HVN+PV+V +ND F+ AFPA E RIT 
Sbjct: 141 FDGSAKEWVDAIQGVGINAAQNHDGESVEKMVAHVNKPVYVCKNDTFVAAFPALETRITC 200

Query: 187 GIDFPQVPEIGCQWFFTAPLDNKFYAEQIAPSRTFCIYEEVEQMRNMGLIKGGSMENALV 246
           GIDFPQVP IGCQWF   P+    +A+ IA SRTFC+YEEVE+MR  GLIKGGS++NA+V
Sbjct: 201 GIDFPQVPAIGCQWFSWRPIHESSFAKDIASSRTFCVYEEVERMREAGLIKGGSLDNAIV 260

Query: 247 CSVSKGWINPPLRFHDEPCRHKVLDLIGDLSLFAQLGSQGIPLAHLVVYKDIDKIAGFIL 306
           CS   GW+NPPLRF DE CRHK+LDLIGDLSL ++ G+ G+P+AH+V YK          
Sbjct: 261 CSAEHGWMNPPLRFDDEACRHKILDLIGDLSLVSRGGNGGLPVAHIVAYK---------- 320

Query: 307 MQGGHAMHANFLRRL 322
              GHA+H +  R L
Sbjct: 321 --AGHALHTDLARHL 323

BLAST of CsGy5G011850 vs. NCBI nr
Match: XP_008452402.1 (PREDICTED: probable UDP-3-O-acyl-N-acetylglucosamine deacetylase 2 isoform X2 [Cucumis melo])

HSP 1 Score: 647 bits (1668), Expect = 5.65e-234
Identity = 309/325 (95.08%), Postives = 319/325 (98.15%), Query Frame = 0

Query: 1   MLVPTAFNALKSSRSISWSPTGRLQQTLAGCLELSGISLHSGKVAKVKLCPEFAGRGRYF 60
           M++PTAF+ALKSSRSISWSPTGRLQQTLAGCLELSGISLHSGKV+KVKLCPEFAGRGRYF
Sbjct: 1   MIIPTAFHALKSSRSISWSPTGRLQQTLAGCLELSGISLHSGKVSKVKLCPEFAGRGRYF 60

Query: 61  DFKSNFIPASIDYAEDSPLCTTLSKDGFKIRTVEHLLSAMEAMGVDNCRIVITNEDAKDS 120
           DFKSNFIPASIDYAEDSPLCTTLSKDGFKIRTVEHLLSAMEAMGVDNCRI ITNEDAKDS
Sbjct: 61  DFKSNFIPASIDYAEDSPLCTTLSKDGFKIRTVEHLLSAMEAMGVDNCRIEITNEDAKDS 120

Query: 121 EVEVPIFDGSAGKWVDAIEEIGLKLAIDQCGNFCEKMAPHVNQPVHVWRNDCFLIAFPAT 180
           EVEVPIFDGSAGKWVDAIEEIGLKLAIDQCGNFCEKMAPHVNQPVHVWRNDCFLIAFPAT
Sbjct: 121 EVEVPIFDGSAGKWVDAIEEIGLKLAIDQCGNFCEKMAPHVNQPVHVWRNDCFLIAFPAT 180

Query: 181 EVRITYGIDFPQVPEIGCQWFFTAPLDNKFYAEQIAPSRTFCIYEEVEQMRNMGLIKGGS 240
            V ITYGI+FPQVPEIGCQWFFTAPLDNKFYAEQIAPSRTFCIYEEVEQMRNMGLIKGGS
Sbjct: 181 AVCITYGINFPQVPEIGCQWFFTAPLDNKFYAEQIAPSRTFCIYEEVEQMRNMGLIKGGS 240

Query: 241 MENALVCSVSKGWINPPLRFHDEPCRHKVLDLIGDLSLFAQLGSQGIPLAHLVVYKDIDK 300
           +ENALVCSVSKGWINPPLRFHDEPCRHKVLDLIGDLSLFA+LGSQGIP+AHLVVYKDID+
Sbjct: 241 IENALVCSVSKGWINPPLRFHDEPCRHKVLDLIGDLSLFARLGSQGIPVAHLVVYKDIDQ 300

Query: 301 IAGFILMQGGHAMHANFLRRLLERI 325
           IAGFILMQGGHA HANF+R LLE I
Sbjct: 301 IAGFILMQGGHATHANFVRCLLENI 325

BLAST of CsGy5G011850 vs. NCBI nr
Match: XP_031741866.1 (probable UDP-3-O-acyl-N-acetylglucosamine deacetylase 2, mitochondrial isoform X2 [Cucumis sativus] >KAE8648248.1 hypothetical protein Csa_018507 [Cucumis sativus])

HSP 1 Score: 642 bits (1656), Expect = 2.43e-232
Identity = 313/325 (96.31%), Postives = 313/325 (96.31%), Query Frame = 0

Query: 1   MLVPTAFNALKSSRSISWSPTGRLQQTLAGCLELSGISLHSGKVAKVKLCPEFAGRGRYF 60
           MLVPTAFNALKSSRSISWSPTGRLQQTLAGCLELSGISLHSGKVAKVKLCPEFAGRGRYF
Sbjct: 1   MLVPTAFNALKSSRSISWSPTGRLQQTLAGCLELSGISLHSGKVAKVKLCPEFAGRGRYF 60

Query: 61  DFKSNFIPASIDYAEDSPLCTTLSKDGFKIRTVEHLLSAMEAMGVDNCRIVITNEDAKDS 120
           DFKSNFIPASIDYAEDSPLCTTLSKDGFKIRTVEHLLSAMEAMGVDNCRIVITNEDAKDS
Sbjct: 61  DFKSNFIPASIDYAEDSPLCTTLSKDGFKIRTVEHLLSAMEAMGVDNCRIVITNEDAKDS 120

Query: 121 EVEVPIFDGSAGKWVDAIEEIGLKLAIDQCGNFCEKMAPHVNQPVHVWRNDCFLIAFPAT 180
           EVEVPIFDGSAGKWVDAIEEIGLKLAIDQCGNFCEKMAPHVNQPVHVWRNDCFLIAFPAT
Sbjct: 121 EVEVPIFDGSAGKWVDAIEEIGLKLAIDQCGNFCEKMAPHVNQPVHVWRNDCFLIAFPAT 180

Query: 181 EVRITYGIDFPQVPEIGCQWFFTAPLDNKFYAEQIAPSRTFCIYEEVEQMRNMGLIKGGS 240
           EVRITYGIDFPQVPEIGCQWFFTAPLDNKFYAEQIAPSRTFCIYEEVEQMRNMGLIKGGS
Sbjct: 181 EVRITYGIDFPQVPEIGCQWFFTAPLDNKFYAEQIAPSRTFCIYEEVEQMRNMGLIKGGS 240

Query: 241 MENALVCSVSKGWINPPLRFHDEPCRHKVLDLIGDLSLFAQLGSQGIPLAHLVVYKDIDK 300
           MENALVCSVSKGWINPPLRFHDEPCRHKVLDLIGDLSLFAQLGSQGIPLAHLVVYK    
Sbjct: 241 MENALVCSVSKGWINPPLRFHDEPCRHKVLDLIGDLSLFAQLGSQGIPLAHLVVYK---- 300

Query: 301 IAGFILMQGGHAMHANFLRRLLERI 325
                   GGHAMHANFLRRLLERI
Sbjct: 301 --------GGHAMHANFLRRLLERI 313

BLAST of CsGy5G011850 vs. NCBI nr
Match: XP_008452401.1 (PREDICTED: probable UDP-3-O-acyl-N-acetylglucosamine deacetylase 2 isoform X1 [Cucumis melo])

HSP 1 Score: 642 bits (1655), Expect = 5.84e-232
Identity = 309/327 (94.50%), Postives = 319/327 (97.55%), Query Frame = 0

Query: 1   MLVPTAFNALKSSRSISWSPTGRLQQTLAGCLELSGISLHSGKVAKVKLCPEFAGRGRYF 60
           M++PTAF+ALKSSRSISWSPTGRLQQTLAGCLELSGISLHSGKV+KVKLCPEFAGRGRYF
Sbjct: 1   MIIPTAFHALKSSRSISWSPTGRLQQTLAGCLELSGISLHSGKVSKVKLCPEFAGRGRYF 60

Query: 61  DFKSNFIPASIDYAEDSPLCTTLSKDGFKIRTVEHLLSAMEAMGVDNCRIVITNEDAKDS 120
           DFKSNFIPASIDYAEDSPLCTTLSKDGFKIRTVEHLLSAMEAMGVDNCRI ITNEDAKDS
Sbjct: 61  DFKSNFIPASIDYAEDSPLCTTLSKDGFKIRTVEHLLSAMEAMGVDNCRIEITNEDAKDS 120

Query: 121 EVEVPIFDGSAGKWVDAIEEIGLKLAIDQCGNFCEKMAPHVNQPVHVWRNDCFLIAFPAT 180
           EVEVPIFDGSAGKWVDAIEEIGLKLAIDQCGNFCEKMAPHVNQPVHVWRNDCFLIAFPAT
Sbjct: 121 EVEVPIFDGSAGKWVDAIEEIGLKLAIDQCGNFCEKMAPHVNQPVHVWRNDCFLIAFPAT 180

Query: 181 EVRITYGIDFPQVPEIGCQWFFTAPLDNKFYAEQIAPSRTFCIYEEV--EQMRNMGLIKG 240
            V ITYGI+FPQVPEIGCQWFFTAPLDNKFYAEQIAPSRTFCIYEEV  EQMRNMGLIKG
Sbjct: 181 AVCITYGINFPQVPEIGCQWFFTAPLDNKFYAEQIAPSRTFCIYEEVGVEQMRNMGLIKG 240

Query: 241 GSMENALVCSVSKGWINPPLRFHDEPCRHKVLDLIGDLSLFAQLGSQGIPLAHLVVYKDI 300
           GS+ENALVCSVSKGWINPPLRFHDEPCRHKVLDLIGDLSLFA+LGSQGIP+AHLVVYKDI
Sbjct: 241 GSIENALVCSVSKGWINPPLRFHDEPCRHKVLDLIGDLSLFARLGSQGIPVAHLVVYKDI 300

Query: 301 DKIAGFILMQGGHAMHANFLRRLLERI 325
           D+IAGFILMQGGHA HANF+R LLE I
Sbjct: 301 DQIAGFILMQGGHATHANFVRCLLENI 327

BLAST of CsGy5G011850 vs. NCBI nr
Match: XP_031741865.1 (probable UDP-3-O-acyl-N-acetylglucosamine deacetylase 2, mitochondrial isoform X1 [Cucumis sativus])

HSP 1 Score: 637 bits (1643), Expect = 2.51e-230
Identity = 313/327 (95.72%), Postives = 313/327 (95.72%), Query Frame = 0

Query: 1   MLVPTAFNALKSSRSISWSPTGRLQQTLAGCLELSGISLHSGKVAKVKLCPEFAGRGRYF 60
           MLVPTAFNALKSSRSISWSPTGRLQQTLAGCLELSGISLHSGKVAKVKLCPEFAGRGRYF
Sbjct: 1   MLVPTAFNALKSSRSISWSPTGRLQQTLAGCLELSGISLHSGKVAKVKLCPEFAGRGRYF 60

Query: 61  DFKSNFIPASIDYAEDSPLCTTLSKDGFKIRTVEHLLSAMEAMGVDNCRIVITNEDAKDS 120
           DFKSNFIPASIDYAEDSPLCTTLSKDGFKIRTVEHLLSAMEAMGVDNCRIVITNEDAKDS
Sbjct: 61  DFKSNFIPASIDYAEDSPLCTTLSKDGFKIRTVEHLLSAMEAMGVDNCRIVITNEDAKDS 120

Query: 121 EVEVPIFDGSAGKWVDAIEEIGLKLAIDQCGNFCEKMAPHVNQPVHVWRNDCFLIAFPAT 180
           EVEVPIFDGSAGKWVDAIEEIGLKLAIDQCGNFCEKMAPHVNQPVHVWRNDCFLIAFPAT
Sbjct: 121 EVEVPIFDGSAGKWVDAIEEIGLKLAIDQCGNFCEKMAPHVNQPVHVWRNDCFLIAFPAT 180

Query: 181 EVRITYGIDFPQVPEIGCQWFFTAPLDNKFYAEQIAPSRTFCIYEEV--EQMRNMGLIKG 240
           EVRITYGIDFPQVPEIGCQWFFTAPLDNKFYAEQIAPSRTFCIYEEV  EQMRNMGLIKG
Sbjct: 181 EVRITYGIDFPQVPEIGCQWFFTAPLDNKFYAEQIAPSRTFCIYEEVGVEQMRNMGLIKG 240

Query: 241 GSMENALVCSVSKGWINPPLRFHDEPCRHKVLDLIGDLSLFAQLGSQGIPLAHLVVYKDI 300
           GSMENALVCSVSKGWINPPLRFHDEPCRHKVLDLIGDLSLFAQLGSQGIPLAHLVVYK  
Sbjct: 241 GSMENALVCSVSKGWINPPLRFHDEPCRHKVLDLIGDLSLFAQLGSQGIPLAHLVVYK-- 300

Query: 301 DKIAGFILMQGGHAMHANFLRRLLERI 325
                     GGHAMHANFLRRLLERI
Sbjct: 301 ----------GGHAMHANFLRRLLERI 315

BLAST of CsGy5G011850 vs. NCBI nr
Match: XP_008452403.1 (PREDICTED: probable UDP-3-O-acyl-N-acetylglucosamine deacetylase 2 isoform X3 [Cucumis melo])

HSP 1 Score: 610 bits (1574), Expect = 8.29e-220
Identity = 298/327 (91.13%), Postives = 307/327 (93.88%), Query Frame = 0

Query: 1   MLVPTAFNALKSSRSISWSPTGRLQQTLAGCLELSGISLHSGKVAKVKLCPEFAGRGRYF 60
           M++PTAF+ALKSSRSISWSPTGRLQQTLAGCLELSGISLHSGKV+KVKLCPEFAGRGRYF
Sbjct: 1   MIIPTAFHALKSSRSISWSPTGRLQQTLAGCLELSGISLHSGKVSKVKLCPEFAGRGRYF 60

Query: 61  DFKSNFIPASIDYAEDSPLCTTLSKDGFKIRTVEHLLSAMEAMGVDNCRIVITNEDAKDS 120
           DFKSNFIPASIDYAEDSPLCTTLSKDGFKIRTVEHLLSAMEAMGVDNCRI ITNEDAKDS
Sbjct: 61  DFKSNFIPASIDYAEDSPLCTTLSKDGFKIRTVEHLLSAMEAMGVDNCRIEITNEDAKDS 120

Query: 121 EVEVPIFDGSAGKWVDAIEEIGLKLAIDQCGNFCEKMAPHVNQPVHVWRNDCFLIAFPAT 180
           EVEVPIFDGSAGKWVDAIEEIGLKLAIDQCGNFCEKMAPHVNQPVHVWRNDCFLIAFPAT
Sbjct: 121 EVEVPIFDGSAGKWVDAIEEIGLKLAIDQCGNFCEKMAPHVNQPVHVWRNDCFLIAFPAT 180

Query: 181 EVRITYGIDFPQVPEIGCQWFFTAPLDNKFYAEQIAPSRTFCIYEEV--EQMRNMGLIKG 240
            V ITYGI+FPQVPEIGCQWFFTAPLDNKFYAEQIAPSRTFCIYEEV  EQMRNMGLIKG
Sbjct: 181 AVCITYGINFPQVPEIGCQWFFTAPLDNKFYAEQIAPSRTFCIYEEVGVEQMRNMGLIKG 240

Query: 241 GSMENALVCSVSKGWINPPLRFHDEPCRHKVLDLIGDLSLFAQLGSQGIPLAHLVVYKDI 300
           GS+ENALVCSVSKGWINPPLRFHDEPCRHKVLDLIGDLSLFA+LGSQGIP+AHLVVYK  
Sbjct: 241 GSIENALVCSVSKGWINPPLRFHDEPCRHKVLDLIGDLSLFARLGSQGIPVAHLVVYK-- 300

Query: 301 DKIAGFILMQGGHAMHANFLRRLLERI 325
                     GGHA HANF+R LLE I
Sbjct: 301 ----------GGHATHANFVRCLLENI 315

BLAST of CsGy5G011850 vs. ExPASy TrEMBL
Match: A0A1S3BUW4 (UDP-3-O-acyl-N-acetylglucosamine deacetylase OS=Cucumis melo OX=3656 GN=LOC103493448 PE=3 SV=1)

HSP 1 Score: 647 bits (1668), Expect = 2.73e-234
Identity = 309/325 (95.08%), Postives = 319/325 (98.15%), Query Frame = 0

Query: 1   MLVPTAFNALKSSRSISWSPTGRLQQTLAGCLELSGISLHSGKVAKVKLCPEFAGRGRYF 60
           M++PTAF+ALKSSRSISWSPTGRLQQTLAGCLELSGISLHSGKV+KVKLCPEFAGRGRYF
Sbjct: 1   MIIPTAFHALKSSRSISWSPTGRLQQTLAGCLELSGISLHSGKVSKVKLCPEFAGRGRYF 60

Query: 61  DFKSNFIPASIDYAEDSPLCTTLSKDGFKIRTVEHLLSAMEAMGVDNCRIVITNEDAKDS 120
           DFKSNFIPASIDYAEDSPLCTTLSKDGFKIRTVEHLLSAMEAMGVDNCRI ITNEDAKDS
Sbjct: 61  DFKSNFIPASIDYAEDSPLCTTLSKDGFKIRTVEHLLSAMEAMGVDNCRIEITNEDAKDS 120

Query: 121 EVEVPIFDGSAGKWVDAIEEIGLKLAIDQCGNFCEKMAPHVNQPVHVWRNDCFLIAFPAT 180
           EVEVPIFDGSAGKWVDAIEEIGLKLAIDQCGNFCEKMAPHVNQPVHVWRNDCFLIAFPAT
Sbjct: 121 EVEVPIFDGSAGKWVDAIEEIGLKLAIDQCGNFCEKMAPHVNQPVHVWRNDCFLIAFPAT 180

Query: 181 EVRITYGIDFPQVPEIGCQWFFTAPLDNKFYAEQIAPSRTFCIYEEVEQMRNMGLIKGGS 240
            V ITYGI+FPQVPEIGCQWFFTAPLDNKFYAEQIAPSRTFCIYEEVEQMRNMGLIKGGS
Sbjct: 181 AVCITYGINFPQVPEIGCQWFFTAPLDNKFYAEQIAPSRTFCIYEEVEQMRNMGLIKGGS 240

Query: 241 MENALVCSVSKGWINPPLRFHDEPCRHKVLDLIGDLSLFAQLGSQGIPLAHLVVYKDIDK 300
           +ENALVCSVSKGWINPPLRFHDEPCRHKVLDLIGDLSLFA+LGSQGIP+AHLVVYKDID+
Sbjct: 241 IENALVCSVSKGWINPPLRFHDEPCRHKVLDLIGDLSLFARLGSQGIPVAHLVVYKDIDQ 300

Query: 301 IAGFILMQGGHAMHANFLRRLLERI 325
           IAGFILMQGGHA HANF+R LLE I
Sbjct: 301 IAGFILMQGGHATHANFVRCLLENI 325

BLAST of CsGy5G011850 vs. ExPASy TrEMBL
Match: A0A1S3BUJ5 (UDP-3-O-acyl-N-acetylglucosamine deacetylase OS=Cucumis melo OX=3656 GN=LOC103493448 PE=3 SV=1)

HSP 1 Score: 642 bits (1655), Expect = 2.83e-232
Identity = 309/327 (94.50%), Postives = 319/327 (97.55%), Query Frame = 0

Query: 1   MLVPTAFNALKSSRSISWSPTGRLQQTLAGCLELSGISLHSGKVAKVKLCPEFAGRGRYF 60
           M++PTAF+ALKSSRSISWSPTGRLQQTLAGCLELSGISLHSGKV+KVKLCPEFAGRGRYF
Sbjct: 1   MIIPTAFHALKSSRSISWSPTGRLQQTLAGCLELSGISLHSGKVSKVKLCPEFAGRGRYF 60

Query: 61  DFKSNFIPASIDYAEDSPLCTTLSKDGFKIRTVEHLLSAMEAMGVDNCRIVITNEDAKDS 120
           DFKSNFIPASIDYAEDSPLCTTLSKDGFKIRTVEHLLSAMEAMGVDNCRI ITNEDAKDS
Sbjct: 61  DFKSNFIPASIDYAEDSPLCTTLSKDGFKIRTVEHLLSAMEAMGVDNCRIEITNEDAKDS 120

Query: 121 EVEVPIFDGSAGKWVDAIEEIGLKLAIDQCGNFCEKMAPHVNQPVHVWRNDCFLIAFPAT 180
           EVEVPIFDGSAGKWVDAIEEIGLKLAIDQCGNFCEKMAPHVNQPVHVWRNDCFLIAFPAT
Sbjct: 121 EVEVPIFDGSAGKWVDAIEEIGLKLAIDQCGNFCEKMAPHVNQPVHVWRNDCFLIAFPAT 180

Query: 181 EVRITYGIDFPQVPEIGCQWFFTAPLDNKFYAEQIAPSRTFCIYEEV--EQMRNMGLIKG 240
            V ITYGI+FPQVPEIGCQWFFTAPLDNKFYAEQIAPSRTFCIYEEV  EQMRNMGLIKG
Sbjct: 181 AVCITYGINFPQVPEIGCQWFFTAPLDNKFYAEQIAPSRTFCIYEEVGVEQMRNMGLIKG 240

Query: 241 GSMENALVCSVSKGWINPPLRFHDEPCRHKVLDLIGDLSLFAQLGSQGIPLAHLVVYKDI 300
           GS+ENALVCSVSKGWINPPLRFHDEPCRHKVLDLIGDLSLFA+LGSQGIP+AHLVVYKDI
Sbjct: 241 GSIENALVCSVSKGWINPPLRFHDEPCRHKVLDLIGDLSLFARLGSQGIPVAHLVVYKDI 300

Query: 301 DKIAGFILMQGGHAMHANFLRRLLERI 325
           D+IAGFILMQGGHA HANF+R LLE I
Sbjct: 301 DQIAGFILMQGGHATHANFVRCLLENI 327

BLAST of CsGy5G011850 vs. ExPASy TrEMBL
Match: A0A1S3BT52 (UDP-3-O-acyl-N-acetylglucosamine deacetylase OS=Cucumis melo OX=3656 GN=LOC103493448 PE=3 SV=1)

HSP 1 Score: 610 bits (1574), Expect = 4.01e-220
Identity = 298/327 (91.13%), Postives = 307/327 (93.88%), Query Frame = 0

Query: 1   MLVPTAFNALKSSRSISWSPTGRLQQTLAGCLELSGISLHSGKVAKVKLCPEFAGRGRYF 60
           M++PTAF+ALKSSRSISWSPTGRLQQTLAGCLELSGISLHSGKV+KVKLCPEFAGRGRYF
Sbjct: 1   MIIPTAFHALKSSRSISWSPTGRLQQTLAGCLELSGISLHSGKVSKVKLCPEFAGRGRYF 60

Query: 61  DFKSNFIPASIDYAEDSPLCTTLSKDGFKIRTVEHLLSAMEAMGVDNCRIVITNEDAKDS 120
           DFKSNFIPASIDYAEDSPLCTTLSKDGFKIRTVEHLLSAMEAMGVDNCRI ITNEDAKDS
Sbjct: 61  DFKSNFIPASIDYAEDSPLCTTLSKDGFKIRTVEHLLSAMEAMGVDNCRIEITNEDAKDS 120

Query: 121 EVEVPIFDGSAGKWVDAIEEIGLKLAIDQCGNFCEKMAPHVNQPVHVWRNDCFLIAFPAT 180
           EVEVPIFDGSAGKWVDAIEEIGLKLAIDQCGNFCEKMAPHVNQPVHVWRNDCFLIAFPAT
Sbjct: 121 EVEVPIFDGSAGKWVDAIEEIGLKLAIDQCGNFCEKMAPHVNQPVHVWRNDCFLIAFPAT 180

Query: 181 EVRITYGIDFPQVPEIGCQWFFTAPLDNKFYAEQIAPSRTFCIYEEV--EQMRNMGLIKG 240
            V ITYGI+FPQVPEIGCQWFFTAPLDNKFYAEQIAPSRTFCIYEEV  EQMRNMGLIKG
Sbjct: 181 AVCITYGINFPQVPEIGCQWFFTAPLDNKFYAEQIAPSRTFCIYEEVGVEQMRNMGLIKG 240

Query: 241 GSMENALVCSVSKGWINPPLRFHDEPCRHKVLDLIGDLSLFAQLGSQGIPLAHLVVYKDI 300
           GS+ENALVCSVSKGWINPPLRFHDEPCRHKVLDLIGDLSLFA+LGSQGIP+AHLVVYK  
Sbjct: 241 GSIENALVCSVSKGWINPPLRFHDEPCRHKVLDLIGDLSLFARLGSQGIPVAHLVVYK-- 300

Query: 301 DKIAGFILMQGGHAMHANFLRRLLERI 325
                     GGHA HANF+R LLE I
Sbjct: 301 ----------GGHATHANFVRCLLENI 315

BLAST of CsGy5G011850 vs. ExPASy TrEMBL
Match: A0A6J1FQ92 (UDP-3-O-acyl-N-acetylglucosamine deacetylase OS=Cucurbita moschata OX=3662 GN=LOC111447257 PE=3 SV=1)

HSP 1 Score: 580 bits (1494), Expect = 5.80e-208
Identity = 279/325 (85.85%), Postives = 296/325 (91.08%), Query Frame = 0

Query: 1   MLVPTAFNALKSSRSISWSPTGRLQQTLAGCLELSGISLHSGKVAKVKLCPEFAGRGRYF 60
           ML+PTA NALKSSRSISWSPTGR QQTLAGCLELSGISLHSGKV+KVKLCPEFAGRGRYF
Sbjct: 1   MLIPTALNALKSSRSISWSPTGRFQQTLAGCLELSGISLHSGKVSKVKLCPEFAGRGRYF 60

Query: 61  DFKSNFIPASIDYAEDSPLCTTLSKDGFKIRTVEHLLSAMEAMGVDNCRIVITNEDAKDS 120
           DFKSNFIPASIDYAE+S LCTTLSKDGFKIRTVEHLLSA+EAMGVDNCRI I NEDAKDS
Sbjct: 61  DFKSNFIPASIDYAEESSLCTTLSKDGFKIRTVEHLLSALEAMGVDNCRIEIANEDAKDS 120

Query: 121 EVEVPIFDGSAGKWVDAIEEIGLKLAIDQCGNFCEKMAPHVNQPVHVWRNDCFLIAFPAT 180
           EVEVPIFDGSA KWVDAIEE+GLKLAIDQ GN CEKMAP+VNQPV+ WRNDC+L+AFPA 
Sbjct: 121 EVEVPIFDGSACKWVDAIEEVGLKLAIDQRGNCCEKMAPYVNQPVYAWRNDCYLVAFPAK 180

Query: 181 EVRITYGIDFPQVPEIGCQWFFTAPLDNKFYAEQIAPSRTFCIYEEVEQMRNMGLIKGGS 240
            VRITYGIDFPQVP IGCQWF TAPLDN FYAEQIAPSRTFCIYEEVEQMRNMGLI+GGS
Sbjct: 181 AVRITYGIDFPQVPGIGCQWFSTAPLDNMFYAEQIAPSRTFCIYEEVEQMRNMGLIRGGS 240

Query: 241 MENALVCSVSKGWINPPLRFHDEPCRHKVLDLIGDLSLFAQLGSQGIPLAHLVVYKDIDK 300
           +ENALVCSVSKGWINPPLRFHDEPCRHKVLD IGD+SLFA+LGSQG+P+AHLVVYK    
Sbjct: 241 IENALVCSVSKGWINPPLRFHDEPCRHKVLDFIGDISLFARLGSQGLPVAHLVVYK---- 300

Query: 301 IAGFILMQGGHAMHANFLRRLLERI 325
                   GGH+MHANF+RRL E I
Sbjct: 301 --------GGHSMHANFVRRLSENI 313

BLAST of CsGy5G011850 vs. ExPASy TrEMBL
Match: A0A6J1JIU6 (UDP-3-O-acyl-N-acetylglucosamine deacetylase OS=Cucurbita maxima OX=3661 GN=LOC111487321 PE=3 SV=1)

HSP 1 Score: 579 bits (1492), Expect = 1.17e-207
Identity = 277/325 (85.23%), Postives = 297/325 (91.38%), Query Frame = 0

Query: 1   MLVPTAFNALKSSRSISWSPTGRLQQTLAGCLELSGISLHSGKVAKVKLCPEFAGRGRYF 60
           ML PTA NALKSSRSISWSPTG+ QQTLAGCLELSGISLHSGKV+KVKLCPEFAGRGRYF
Sbjct: 1   MLTPTALNALKSSRSISWSPTGKFQQTLAGCLELSGISLHSGKVSKVKLCPEFAGRGRYF 60

Query: 61  DFKSNFIPASIDYAEDSPLCTTLSKDGFKIRTVEHLLSAMEAMGVDNCRIVITNEDAKDS 120
           DFKS FIPASIDYAE+S LCTTLSKDGFKIRTVEHLLSA+EAMGVDNCRI I NEDAKDS
Sbjct: 61  DFKSKFIPASIDYAEESSLCTTLSKDGFKIRTVEHLLSALEAMGVDNCRIEIANEDAKDS 120

Query: 121 EVEVPIFDGSAGKWVDAIEEIGLKLAIDQCGNFCEKMAPHVNQPVHVWRNDCFLIAFPAT 180
           EVEVPIFDGSA KWVDAIEE+GLKLAIDQ GN CEKMAP+VNQPV+ WRNDC+L+AFPA+
Sbjct: 121 EVEVPIFDGSACKWVDAIEEVGLKLAIDQRGNCCEKMAPYVNQPVYAWRNDCYLVAFPAS 180

Query: 181 EVRITYGIDFPQVPEIGCQWFFTAPLDNKFYAEQIAPSRTFCIYEEVEQMRNMGLIKGGS 240
           +VRITYGIDFPQVP IGCQWF TAPLDN FYAEQIAPSRTFCIYEEVEQMRNMGLI+GGS
Sbjct: 181 KVRITYGIDFPQVPSIGCQWFSTAPLDNTFYAEQIAPSRTFCIYEEVEQMRNMGLIRGGS 240

Query: 241 MENALVCSVSKGWINPPLRFHDEPCRHKVLDLIGDLSLFAQLGSQGIPLAHLVVYKDIDK 300
           +ENALVCSVSKGWINPPLRFHDEPCRHKVLD IGD+SLFA+LGSQG+P+AH+VVYK    
Sbjct: 241 IENALVCSVSKGWINPPLRFHDEPCRHKVLDFIGDISLFARLGSQGLPVAHMVVYK---- 300

Query: 301 IAGFILMQGGHAMHANFLRRLLERI 325
                   GGH+MHANF+RRLLE I
Sbjct: 301 --------GGHSMHANFVRRLLENI 313

BLAST of CsGy5G011850 vs. TAIR 10
Match: AT1G25210.1 (UDP-3-O-acyl N-acetylglycosamine deacetylase family protein )

HSP 1 Score: 404.1 bits (1037), Expect = 1.1e-112
Identity = 190/315 (60.32%), Postives = 237/315 (75.24%), Query Frame = 0

Query: 7   FNALKSSRSISWSPTGRLQQTLAGCLELSGISLHSGKVAKVKLCPEFAGRGRYFDFKSNF 66
           +++  SS ++S +P+GRLQQTLAG +E+ G SLHSGK + VKL PE AG GR+F+F+S F
Sbjct: 66  YSSAASSPTVSLNPSGRLQQTLAGSVEVKGKSLHSGKFSTVKLNPEIAGAGRFFEFRSRF 125

Query: 67  IPASIDYAEDSPLCTTLSKDGFKIRTVEHLLSAMEAMGVDNCRIVITNEDAKDSEVEVPI 126
           IPASI++A++SPLCTTL KD  KIRTVEHLLSA+EA GVDNCRI I +E + D EVEVPI
Sbjct: 126 IPASIEFAQESPLCTTLLKDELKIRTVEHLLSALEAKGVDNCRIQIESESSDDREVEVPI 185

Query: 127 FDGSAGKWVDAIEEIGLKLAIDQCGNFCEKMAPHVNQPVHVWRNDCFLIAFPATEVRITY 186
           FDGSA +WVDAI+ +G+  A +  G   EKM  HVN+PV+V +ND F+ AFPA E RIT 
Sbjct: 186 FDGSAKEWVDAIQGVGINAAQNHDGESVEKMVAHVNKPVYVCKNDTFVAAFPALETRITC 245

Query: 187 GIDFPQVPEIGCQWFFTAPLDNKFYAEQIAPSRTFCIYEEVEQMRNMGLIKGGSMENALV 246
           GIDFPQVP IGCQWF   P+    +A+ IA SRTFC+YEEVE+MR  GLIKGGS++NA+V
Sbjct: 246 GIDFPQVPAIGCQWFSWRPIHESSFAKDIASSRTFCVYEEVERMREAGLIKGGSLDNAIV 305

Query: 247 CSVSKGWINPPLRFHDEPCRHKVLDLIGDLSLFAQLGSQGIPLAHLVVYKDIDKIAGFIL 306
           CS   GW+NPPLRF DE CRHK+LDLIGDLSL ++ G+ G+P+AH+V YK          
Sbjct: 306 CSAEHGWMNPPLRFDDEACRHKILDLIGDLSLVSRGGNGGLPVAHIVAYK---------- 365

Query: 307 MQGGHAMHANFLRRL 322
              GHA+H +  R L
Sbjct: 366 --AGHALHTDLARHL 368

BLAST of CsGy5G011850 vs. TAIR 10
Match: AT1G24793.1 (UDP-3-O-acyl N-acetylglycosamine deacetylase family protein )

HSP 1 Score: 404.1 bits (1037), Expect = 1.1e-112
Identity = 190/315 (60.32%), Postives = 237/315 (75.24%), Query Frame = 0

Query: 7   FNALKSSRSISWSPTGRLQQTLAGCLELSGISLHSGKVAKVKLCPEFAGRGRYFDFKSNF 66
           +++  SS ++S +P+GRLQQTLAG +E+ G SLHSGK + VKL PE AG GR+F+F+S F
Sbjct: 21  YSSAASSPTVSLNPSGRLQQTLAGSVEVKGKSLHSGKFSTVKLNPEIAGAGRFFEFRSRF 80

Query: 67  IPASIDYAEDSPLCTTLSKDGFKIRTVEHLLSAMEAMGVDNCRIVITNEDAKDSEVEVPI 126
           IPASI++A++SPLCTTL KD  KIRTVEHLLSA+EA GVDNCRI I +E + D EVEVPI
Sbjct: 81  IPASIEFAQESPLCTTLLKDELKIRTVEHLLSALEAKGVDNCRIQIESESSDDREVEVPI 140

Query: 127 FDGSAGKWVDAIEEIGLKLAIDQCGNFCEKMAPHVNQPVHVWRNDCFLIAFPATEVRITY 186
           FDGSA +WVDAI+ +G+  A +  G   EKM  HVN+PV+V +ND F+ AFPA E RIT 
Sbjct: 141 FDGSAKEWVDAIQGVGINAAQNHDGESVEKMVAHVNKPVYVCKNDTFVAAFPALETRITC 200

Query: 187 GIDFPQVPEIGCQWFFTAPLDNKFYAEQIAPSRTFCIYEEVEQMRNMGLIKGGSMENALV 246
           GIDFPQVP IGCQWF   P+    +A+ IA SRTFC+YEEVE+MR  GLIKGGS++NA+V
Sbjct: 201 GIDFPQVPAIGCQWFSWRPIHESSFAKDIASSRTFCVYEEVERMREAGLIKGGSLDNAIV 260

Query: 247 CSVSKGWINPPLRFHDEPCRHKVLDLIGDLSLFAQLGSQGIPLAHLVVYKDIDKIAGFIL 306
           CS   GW+NPPLRF DE CRHK+LDLIGDLSL ++ G+ G+P+AH+V YK          
Sbjct: 261 CSAEHGWMNPPLRFDDEACRHKILDLIGDLSLVSRGGNGGLPVAHIVAYK---------- 320

Query: 307 MQGGHAMHANFLRRL 322
              GHA+H +  R L
Sbjct: 321 --AGHALHTDLARHL 323

BLAST of CsGy5G011850 vs. TAIR 10
Match: AT1G25054.1 (UDP-3-O-acyl N-acetylglycosamine deacetylase family protein )

HSP 1 Score: 404.1 bits (1037), Expect = 1.1e-112
Identity = 190/315 (60.32%), Postives = 237/315 (75.24%), Query Frame = 0

Query: 7   FNALKSSRSISWSPTGRLQQTLAGCLELSGISLHSGKVAKVKLCPEFAGRGRYFDFKSNF 66
           +++  SS ++S +P+GRLQQTLAG +E+ G SLHSGK + VKL PE AG GR+F+F+S F
Sbjct: 64  YSSAASSPTVSLNPSGRLQQTLAGSVEVKGKSLHSGKFSTVKLNPEIAGAGRFFEFRSRF 123

Query: 67  IPASIDYAEDSPLCTTLSKDGFKIRTVEHLLSAMEAMGVDNCRIVITNEDAKDSEVEVPI 126
           IPASI++A++SPLCTTL KD  KIRTVEHLLSA+EA GVDNCRI I +E + D EVEVPI
Sbjct: 124 IPASIEFAQESPLCTTLLKDELKIRTVEHLLSALEAKGVDNCRIQIESESSDDREVEVPI 183

Query: 127 FDGSAGKWVDAIEEIGLKLAIDQCGNFCEKMAPHVNQPVHVWRNDCFLIAFPATEVRITY 186
           FDGSA +WVDAI+ +G+  A +  G   EKM  HVN+PV+V +ND F+ AFPA E RIT 
Sbjct: 184 FDGSAKEWVDAIQGVGINAAQNHDGESVEKMVAHVNKPVYVCKNDTFVAAFPALETRITC 243

Query: 187 GIDFPQVPEIGCQWFFTAPLDNKFYAEQIAPSRTFCIYEEVEQMRNMGLIKGGSMENALV 246
           GIDFPQVP IGCQWF   P+    +A+ IA SRTFC+YEEVE+MR  GLIKGGS++NA+V
Sbjct: 244 GIDFPQVPAIGCQWFSWRPIHESSFAKDIASSRTFCVYEEVERMREAGLIKGGSLDNAIV 303

Query: 247 CSVSKGWINPPLRFHDEPCRHKVLDLIGDLSLFAQLGSQGIPLAHLVVYKDIDKIAGFIL 306
           CS   GW+NPPLRF DE CRHK+LDLIGDLSL ++ G+ G+P+AH+V YK          
Sbjct: 304 CSAEHGWMNPPLRFDDEACRHKILDLIGDLSLVSRGGNGGLPVAHIVAYK---------- 363

Query: 307 MQGGHAMHANFLRRL 322
              GHA+H +  R L
Sbjct: 364 --AGHALHTDLARHL 366

BLAST of CsGy5G011850 vs. TAIR 10
Match: AT1G24880.1 (UDP-3-O-acyl N-acetylglycosamine deacetylase family protein )

HSP 1 Score: 404.1 bits (1037), Expect = 1.1e-112
Identity = 190/315 (60.32%), Postives = 237/315 (75.24%), Query Frame = 0

Query: 7   FNALKSSRSISWSPTGRLQQTLAGCLELSGISLHSGKVAKVKLCPEFAGRGRYFDFKSNF 66
           +++  SS ++S +P+GRLQQTLAG +E+ G SLHSGK + VKL PE AG GR+F+F+S F
Sbjct: 64  YSSAASSPTVSLNPSGRLQQTLAGSVEVKGKSLHSGKFSTVKLNPEIAGAGRFFEFRSRF 123

Query: 67  IPASIDYAEDSPLCTTLSKDGFKIRTVEHLLSAMEAMGVDNCRIVITNEDAKDSEVEVPI 126
           IPASI++A++SPLCTTL KD  KIRTVEHLLSA+EA GVDNCRI I +E + D EVEVPI
Sbjct: 124 IPASIEFAQESPLCTTLLKDELKIRTVEHLLSALEAKGVDNCRIQIESESSDDREVEVPI 183

Query: 127 FDGSAGKWVDAIEEIGLKLAIDQCGNFCEKMAPHVNQPVHVWRNDCFLIAFPATEVRITY 186
           FDGSA +WVDAI+ +G+  A +  G   EKM  HVN+PV+V +ND F+ AFPA E RIT 
Sbjct: 184 FDGSAKEWVDAIQGVGINAAQNHDGESVEKMVAHVNKPVYVCKNDTFVAAFPALETRITC 243

Query: 187 GIDFPQVPEIGCQWFFTAPLDNKFYAEQIAPSRTFCIYEEVEQMRNMGLIKGGSMENALV 246
           GIDFPQVP IGCQWF   P+    +A+ IA SRTFC+YEEVE+MR  GLIKGGS++NA+V
Sbjct: 244 GIDFPQVPAIGCQWFSWRPIHESSFAKDIASSRTFCVYEEVERMREAGLIKGGSLDNAIV 303

Query: 247 CSVSKGWINPPLRFHDEPCRHKVLDLIGDLSLFAQLGSQGIPLAHLVVYKDIDKIAGFIL 306
           CS   GW+NPPLRF DE CRHK+LDLIGDLSL ++ G+ G+P+AH+V YK          
Sbjct: 304 CSAEHGWMNPPLRFDDEACRHKILDLIGDLSLVSRGGNGGLPVAHIVAYK---------- 363

Query: 307 MQGGHAMHANFLRRL 322
              GHA+H +  R L
Sbjct: 364 --AGHALHTDLARHL 366

BLAST of CsGy5G011850 vs. TAIR 10
Match: AT1G25054.2 (UDP-3-O-acyl N-acetylglycosamine deacetylase family protein )

HSP 1 Score: 404.1 bits (1037), Expect = 1.1e-112
Identity = 190/315 (60.32%), Postives = 237/315 (75.24%), Query Frame = 0

Query: 7   FNALKSSRSISWSPTGRLQQTLAGCLELSGISLHSGKVAKVKLCPEFAGRGRYFDFKSNF 66
           +++  SS ++S +P+GRLQQTLAG +E+ G SLHSGK + VKL PE AG GR+F+F+S F
Sbjct: 21  YSSAASSPTVSLNPSGRLQQTLAGSVEVKGKSLHSGKFSTVKLNPEIAGAGRFFEFRSRF 80

Query: 67  IPASIDYAEDSPLCTTLSKDGFKIRTVEHLLSAMEAMGVDNCRIVITNEDAKDSEVEVPI 126
           IPASI++A++SPLCTTL KD  KIRTVEHLLSA+EA GVDNCRI I +E + D EVEVPI
Sbjct: 81  IPASIEFAQESPLCTTLLKDELKIRTVEHLLSALEAKGVDNCRIQIESESSDDREVEVPI 140

Query: 127 FDGSAGKWVDAIEEIGLKLAIDQCGNFCEKMAPHVNQPVHVWRNDCFLIAFPATEVRITY 186
           FDGSA +WVDAI+ +G+  A +  G   EKM  HVN+PV+V +ND F+ AFPA E RIT 
Sbjct: 141 FDGSAKEWVDAIQGVGINAAQNHDGESVEKMVAHVNKPVYVCKNDTFVAAFPALETRITC 200

Query: 187 GIDFPQVPEIGCQWFFTAPLDNKFYAEQIAPSRTFCIYEEVEQMRNMGLIKGGSMENALV 246
           GIDFPQVP IGCQWF   P+    +A+ IA SRTFC+YEEVE+MR  GLIKGGS++NA+V
Sbjct: 201 GIDFPQVPAIGCQWFSWRPIHESSFAKDIASSRTFCVYEEVERMREAGLIKGGSLDNAIV 260

Query: 247 CSVSKGWINPPLRFHDEPCRHKVLDLIGDLSLFAQLGSQGIPLAHLVVYKDIDKIAGFIL 306
           CS   GW+NPPLRF DE CRHK+LDLIGDLSL ++ G+ G+P+AH+V YK          
Sbjct: 261 CSAEHGWMNPPLRFDDEACRHKILDLIGDLSLVSRGGNGGLPVAHIVAYK---------- 320

Query: 307 MQGGHAMHANFLRRL 322
              GHA+H +  R L
Sbjct: 321 --AGHALHTDLARHL 323

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
F4IAT81.5e-11160.32Probable UDP-3-O-acyl-N-acetylglucosamine deacetylase 1, mitochondrial OS=Arabid... [more]
P0DKB71.5e-11160.32Probable UDP-3-O-acyl-N-acetylglucosamine deacetylase 2, mitochondrial OS=Arabid... [more]
P0DKB81.5e-11160.32Probable UDP-3-O-acyl-N-acetylglucosamine deacetylase 3, mitochondrial OS=Arabid... [more]
P0DKB91.5e-11160.32Probable UDP-3-O-acyl-N-acetylglucosamine deacetylase 4, mitochondrial OS=Arabid... [more]
F4IAW11.5e-11160.32Probable UDP-3-O-acyl-N-acetylglucosamine deacetylase 5, mitochondrial OS=Arabid... [more]
Match NameE-valueIdentityDescription
XP_008452402.15.65e-23495.08PREDICTED: probable UDP-3-O-acyl-N-acetylglucosamine deacetylase 2 isoform X2 [C... [more]
XP_031741866.12.43e-23296.31probable UDP-3-O-acyl-N-acetylglucosamine deacetylase 2, mitochondrial isoform X... [more]
XP_008452401.15.84e-23294.50PREDICTED: probable UDP-3-O-acyl-N-acetylglucosamine deacetylase 2 isoform X1 [C... [more]
XP_031741865.12.51e-23095.72probable UDP-3-O-acyl-N-acetylglucosamine deacetylase 2, mitochondrial isoform X... [more]
XP_008452403.18.29e-22091.13PREDICTED: probable UDP-3-O-acyl-N-acetylglucosamine deacetylase 2 isoform X3 [C... [more]
Match NameE-valueIdentityDescription
A0A1S3BUW42.73e-23495.08UDP-3-O-acyl-N-acetylglucosamine deacetylase OS=Cucumis melo OX=3656 GN=LOC10349... [more]
A0A1S3BUJ52.83e-23294.50UDP-3-O-acyl-N-acetylglucosamine deacetylase OS=Cucumis melo OX=3656 GN=LOC10349... [more]
A0A1S3BT524.01e-22091.13UDP-3-O-acyl-N-acetylglucosamine deacetylase OS=Cucumis melo OX=3656 GN=LOC10349... [more]
A0A6J1FQ925.80e-20885.85UDP-3-O-acyl-N-acetylglucosamine deacetylase OS=Cucurbita moschata OX=3662 GN=LO... [more]
A0A6J1JIU61.17e-20785.23UDP-3-O-acyl-N-acetylglucosamine deacetylase OS=Cucurbita maxima OX=3661 GN=LOC1... [more]
Match NameE-valueIdentityDescription
AT1G25210.11.1e-11260.32UDP-3-O-acyl N-acetylglycosamine deacetylase family protein [more]
AT1G24793.11.1e-11260.32UDP-3-O-acyl N-acetylglycosamine deacetylase family protein [more]
AT1G25054.11.1e-11260.32UDP-3-O-acyl N-acetylglycosamine deacetylase family protein [more]
AT1G24880.11.1e-11260.32UDP-3-O-acyl N-acetylglycosamine deacetylase family protein [more]
AT1G25054.21.1e-11260.32UDP-3-O-acyl N-acetylglycosamine deacetylase family protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (Gy14) v2.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR015870UDP-3-O-acyl N-acetylglucosamine deacetylase, N-terminalGENE3D3.30.230.20lpxc deacetylase, domain 1coord: 23..148
e-value: 3.2E-40
score: 138.8
IPR004463UDP-3-O-acyl N-acetylglucosamine deacetylaseTIGRFAMTIGR00325TIGR00325coord: 25..296
e-value: 3.2E-62
score: 208.5
IPR004463UDP-3-O-acyl N-acetylglucosamine deacetylasePFAMPF03331LpxCcoord: 25..322
e-value: 1.6E-79
score: 267.3
IPR004463UDP-3-O-acyl N-acetylglucosamine deacetylasePANTHERPTHR33694UDP-3-O-ACYL-N-ACETYLGLUCOSAMINE DEACETYLASE 1, MITOCHONDRIAL-RELATEDcoord: 1..323
IPR011334UDP-3-O-acyl N-acetylglucosamine deacetylase, C-terminalGENE3D3.30.1700.10lpxc deacetylase, domain 2coord: 160..325
e-value: 8.3E-46
score: 157.3
IPR020568Ribosomal protein S5 domain 2-type foldSUPERFAMILY54211Ribosomal protein S5 domain 2-likecoord: 24..143
IPR020568Ribosomal protein S5 domain 2-type foldSUPERFAMILY54211Ribosomal protein S5 domain 2-likecoord: 163..323

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsGy5G011850.2CsGy5G011850.2mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009245 lipid A biosynthetic process
biological_process GO:2001289 lipid X metabolic process
cellular_component GO:0005739 mitochondrion
molecular_function GO:0046872 metal ion binding
molecular_function GO:0008759 UDP-3-O-[3-hydroxymyristoyl] N-acetylglucosamine deacetylase activity
molecular_function GO:0103117 UDP-3-O-acyl-N-acetylglucosamine deacetylase activity