HG10016573 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10016573
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionO-fucosyltransferase family protein
LocationChr03: 6084107 .. 6105771 (+)
RNA-Seq ExpressionHG10016573
SyntenyHG10016573
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAACATCAACCTAAATTGCAAGTACCATGTTTTGAGGTAAGTTTGAAATACCTTATAATTTAATCACTTCTTTTTTCTTTTTCTTTTAATAAAAAAGTGCTTTTACTTTTTATTTTTTTAAGGAAAAAAATTACTTTTAATCGTCGTAATTTTAGAGTGTTGATCATACTGCATTCAAAAGGTTGTTTTGCTGTTTACAAATATATATACTATTGTTTCCTTGTTATGTGCTTCAGAGGATTTTTTTCATCAATATTTAAAGTGTATTTGGATTAATTCTTTAAGTATTTAATTTTAAAAATAAATTATTTTGGAAGAGATTGAAGTGTTTAACAATCATTCAAGATAATTTTTTAAATGTATTTTAACTAGATTCTATCAAAAGAGTTTAACTTTAAAATAAATTTTTTGAATTTTTTTTTTTCAAATCAATCCAAACAACTCATTAATGACTTTAAAGTCATGTTTTGTATTGTAATTTCTCGGGTATTTTTAAAAGATAATTTTAAATTTAATTGGATTAAAATAAAATAAATTTTGTTCGGCTTATGAAAAAAAAATGTCTTTAACTTGTCATTTATTTAAAAAGTACTTGGTTGTTCCCATTTTTTCAAAGTTTTAATATTACTAGAAAAAAGAGTTGATGTGGTGATGACGGTAATATTTATCTTGATCAAGTATGAAAAAAGGATGTTCTATCGTATCAAATTTTTATTAAATATTTTAATGAATTTTTACTTCAGGAATAGTTTTCAAATGTGGTTTGAAATTAAGGTCAGACTAATGTTGCAACTTTTGAAATAAATTTGTTATTTTCAAATAAAAGGCAAATTTCATAAGTTTTTTCCCTTTTTCCGTTCGTTTTTTTAGTAGGCAAAGTTCTCGCGTCTTTTTTTTTTTTTTGGTAGAATATGACAATTCAAATAAAATTTATACATAGAAAAGTTTGAAGGAAATTAAATCAACAACATAAGTTTTACACTCTAATTCCCTAAAATCGTGGTAAGTTTGTAAATATAAATTAGTCTTTTATTTATTATAAATATAAATTTTCTAAACATTACGATTTAAGCATCTAGAGTCACTTGAAAAATTCTCTACATCAGTAGCAACTCTTCTAAGCGTAGAAAAAAGTGAAGCCATAGTCCCAAACACATATTTAAGAAGATATATGCACGTTAAACTAGAGAACAAAAAAATAATATATGCAAAACGACTATGATAAATAATCATTAACAAAAACTAAATACCAAAACAAAGTATCGCAAAAAATTTCTAATGTACAAGTTTAATCCTTCGACTGTTATGTTTGTATTTAATATCAAGGACTTTAATGCTTATGTCTATTTGATCATTGGACTATAAAAATGACTAAGTACTAAATTTTCAACCATCTATATCTAACAGGCTTATAAAATTCATGAAGTGACTAATAGGTGTTGTGGGAATTACAAATAACAGAGGAAGATAGAGTAAATATTGAGAAACAGACACGTAAACAATTAGGTTGAGGCGCGGATATCTCGCTCTCTTTAAGGAGATTCAAGCCCTCTGCGGTGTATATTGAAATTCTTTAATATCAACCACCGGTGCAACAGATATTCTCTGACCTGTCCCCTCCAGGATACAACAACCCAACTGAAACGTTGTGTAGCTCAAACCACTCTGCACTGAAAGGTTGAGCTCAAAGCTCCACAACTAAAAACACCCTTCTTTTGGCCAATTGCTCTTAAAAGAGCTAAGGATATAAGGATAGAAATGAGAGAAAGCAAGAGAGAAAGAAGTATTTTTTTAGTTGGTGTTTTTTTTTTTTTTTGGAGCAAAGCAAAGATCTATTTATAGTAAAATTAGATCTAAACGTTATTATGTAATTTATTCAACCAATACGTTTTTTTCAAATGTGAAAACTCTCTTATTAAATTAAAAAATGAAACTTCATAAACTTATTAAATATAAAAAAAAAAAAAAAAAATGTAACAACAATAGGTCTTCCAAGTTTCAATTGTATATCCGTTTCAATTGTATATCCGACACATTCACCGTTTCTTTCAATTTCATATCTCATTTACCTATCATCAGACAGTTTCTAAAAAAAATTACAAATTAATTATATAGTGGAATATTTGTGTGGTGTTGCTAAGTAAGTTGTTGTACATTATCTGTAATGTATAATTATTATTTGGTATGGTCTTCAAAATTAAATGGCTACAATTTGTATTGGAAAATGTTGGGCTAGTGTTTTGTGGCAAATTTAAAATTTGTTGCGAATTTATTAAAGTATGGAACTAAAATTGAACTAATTTCAAACTAAAATAATTAAATATTATTTTTATGTTTAAAAACAATTTAATAAGATGATAGAAATTAGATATCAGAGACTATATATATATATATATATCATCTCAGAGGCTTATTAAACATTTTAAAATTCTTATGACTTGCTAAATGCACACACACAAATATGAATGAGGTTTAAAATACATATATCAGTGATTTTTTAAAAATAATTATAAAAAAATAAAATCAAATCAAATGAACATGTTTTTTATCTATACTATATGTAGACAAAAATTTAAATATTTTTAAAATTTTATCAATAGAAATATCATGTGATTTTTTTTTTTTTTAACAGAGGTGAGGGATGAAGAGTATATATCGATAAAATTAGAAAAATTTTTACAAATAAAAAAAAGTCAAACTATTTACATAAAATAGTAAAAAAAATACTACGTTGATAGAAGTCTTCTATCATTTCTATCACTGATAGATTTGAATAGACTTCTATCAATGTCTATTACAACTATTTAAAAATTTTGTTATTTTATGTAAATAATTTATCTTATTTTTCTATTTTTAAAACTTTCCCATAAAATTATACTAACGTTGGCGTGAAATTTATTATCATGAGCAAAGACGACCAAATTTGAAATATTTTAACTTTAAATTAGCACTTTTATTTATCTATTTATTATTATTATTATTTTGTGGGAAATATAGGAACGTCAATACAGAACAGAGCAGAAGCGAAAGCCCTAAACACAGTCATGGAGGGGAAGGGAAGATGACAGAGAGGAAAGCAAAAGCTGAGACGAACAGTACAAAACCGTGCGTGTCGTTCTGTGAAGCGCGTCAGTCATGACCGGCGGGTGAAGTTAGGATAGGAGAAGAGGAGCCGAGTAAAGAAAGGGGGCGTGGGTATCCAAGCCTAAACAGTCCTACAGATTCTCAATTACGGACATTTCAGGAAGATATACGGCCGTAGTCATGACAGGGAATGGTCCTCCCTGACACACTCGCCTAGTACTATTTCCCGTAAACAATAACAATAAATGAATAAATTTTGTAAATATGAATTTGTAGCCATGTCCCTCTACTCCCGTCAATTTATTATTGTGCCCTTAACAATTGCCTTTTTGTTTTTAATTTTGCCCATTTAAAATAGCTAATATAAATTGGGAATAGTTGCAAATATAACAATTAGATTCAAAGTATTAGCATACAAAATTTTAAATATAAATTCTAAATTTTGTTATATTTGTAATTTTTAAAAAATATTATAATACACTTAATTATTATCTTTAAAATTGCTACTCATTACAATTAATCTATAATTAGACTAGTTTATTGTTAATTTTCTAACAATAATCCTTCAATATTATATATGTATGTCACCATTTGATATATATATATTTTTTAAATATTTTGCTATCTACAAATATAATCCACTAAAACATATATTATCAAAGTCTTGCTTTAAAATAAGTTAAAATGCTTCCAATCTAAATGAAACTAATATTTCATGTAATTTATTTTTGTAAGAAAATTGAAATAATACAAATGAATTTGTTTGGAAATAATGTAAAATTGACATTAATTGAATTCATAACAGTAAGTGAATAAAATATTTAATTATTAAAGACCCTAAAGTGTATTATAAATTTAATTATATTAAATTGATATATAAAATACATGAAAATAAGATGAAGTTATTTAAATCATAACATTTTCATTCATAAATCCTCCAAAACCAACGTATTTTTAGACATTCCTACAATATTTTTTTTTTCTAAAAAATATGGTTATCTAAAGATATCGGACATCCACACTTTCACTCGTCTAAAAATTCCAGCTATATTGATTCCTAAAAAACAAGAGAAACTTATGGAGTGTTTGGATTCGATGGGTTAGATACCCTAAGTCCAATCTTGATCTAACTTCTAAAAATCCTTGGTTTCCTACACATTGTTCTTGGTGACTGCCATTGACCAACCTTAAAAAAATATTAATAAAAACATTCTTAGTATATAACAAACCAAACAAACTCGTATATACAAATATTTTTAATAAATAATTGTCGTGTTTTTATTTGAAGTAGTTTTTATAAACAAAAATTAAATAAAACTAAATTTTTTAAAAACATTTCTTCATAAGTCGTTATAAACATGTCATTAGACTGCAATTGTGTAATTAGGTATATTTATTTAGGTTATATGTGTAGAAAGCTCTAACTTAATTTAATAACATTTACATTTTTTTTCCGTTAGTCAACTTAACTATCTTGAAGACTTGTACTCATATTTTAAACGTAGAGGATTAAAATGTTGAAAACGACTTAGCGGCCAAATAAAAATTAAAACTCAAAATTTATGATTTATTTATATCTCAAATATAATTTGATAGAAAAAAATTATAGTTTTTCAATAATTGAATTTAATATTAAAATACTATTTTATTTTAATTTTAAGCTAGCATGTTGTGGTAAACAAAATTAATTTTTTTTTTTCAAATTAGTGACAAAAAAACATATAAAATTTTATTTCAACATAATAAAATTAAAATATCAAATAATAAATTAAATAAAATTTTAATTTTAATCAAAATCTCAATTTCAAAATATTATATATATATATAATTTAACCGCTGTTTAAATACAAAATTATATTATTCTAAATTGTTTAGTTTATTTGAAACTTAATTATCTTAAAATAATTGCAATGATAGATTTAAACTAATATAAAATCTAACCACCGATTAAATACAAACAAAAATATTTCACATACTCAATATCTATAAATATGCACTTTATTTCCAAGTTTTCTAAACCTACACTCCAAATATAGGCTTCTTAAACACAAACTTTTAAAACTTGCACTCCAAACATAGGTTTTTTAAATTCAGGCTATCTAATCCTCCCAATCTAACCCTTAAGATTTATAGTTTTTCAATCACTTAACTGTTTATATATTTTTATGTATGGTATACTTAATTGTCTAGTTGAGAGCCTCCAAGAAAAAAAATGTCAAAGTTAGTATGACTCAAGATGATTGACATATATTCATTGCCTCGAAGTTGAGAGTTTTAAATCTTTTACCTCACTTATTATATTAAAAAAAACAAAAAGCATATTAGTGTACAATTCAATGAATGTATTTCTTACTTGGATTTGATCTGTCCCTTGCCACTTGGTTGATAATATTATTGTTTATTTATTAGATTGTACTTTGTTATTTTTTTCTTCTTTCAAGTGGTCTCATACACTTACCAGCATTATTATTATTATCGTCAAAGTAAGCATATATCAGTCGTAATTAACATGTATTATCTCTCACGAGGTGAAGAGTTTGAGTCAATCTCATCTCATTCCTTATTAATTGCAGTGAAAAAGATTAATTATCATATTTAAAATTATAAAAATCATATGTTACTGATAGGTCTACATTTGCCCACTCATGCCTCTTTTGTTTTGATATTAATGTGATCCAAGATGGAAACAAGGACTCGTGTAGAGTTACATCATAGATGGAAGATAGAATTTATATTAGAATACATTGAATGGAATAAATTATTTTATTGGGTAATTGCGTTGGGTGGCACTTTTAGGAATAATAATTAAGTATATAGCAACATTTAAAAAAAATGCAAATATAGAAAATCTATTATGATAGACTCTATCACTCATAGATTCCTACCAGCGATATAGTCTATCACTGATAGACTCTTACTAGTAATATGGTCTATCACTGATAGATTTTTAAAATATAATTTAAATTTTGCTATATCGTACTATTTCTATAAATATTTTCAATTTAACGCTTATTATATTTGCAATGTGACCCTATCTTATTTTCCCATTTTTCAAAACAGATACGATAGAGATCGTTGTTAAGGTCCACAAATTACAAATTGAAGTGACATAGTGGAGAGGGAGACAAGCCCAATTAATGGGAAATATAACAACGTCATGCGGTTTGCAAATATAACAATTTTAACAAATCGTCCCTTGCCTGGCTTAAATATTTTGAGGTAAATTGTGTTTTTATTCTCGCACTGGTCGAATCCTGTAATTGATAGACATTTGTCACTAATATATAGATTACGTTTATCACCTATTTGCTATAAATTATATCATCGATTTATTGTATTACAATAGATTAAAAATAACAAAATCTATACTTTCAAATTTTCAGTTTGTAATTGTTTACTTGCTCAGTTGCTATTGGGTATGTCATTGGTACCATATTACTGCAAGGGAAAAAAGTTACAAGTAATTTCTTTTTTGTTCATCATTAATGTTTGGTATTTAAAAAATTACTATCAACACAAAAAATATCAAATTATTTACAAATATAAAAATTTTTCACTCTCTCACTGATAGACAATAAACTTTTTTTTCTATATAAGTAAATAGTTTAGTTCATTTTTCTATATTTTTAAAAAGTCCTTAATTTTTTCTATCAATAATTGTGCATAAGGTTATTATGTTGAGAATGACATTGACATGTTATATCAATATAATAAAAATGGGTATGCAAGTAATATTTAATTTAATTACTTTTGTTGTATAAATATAGTAATTTATTTGTCTATCAAGAAGTTTTGTGTTTATCACTAATTTACACTTATAAGGGTGTATTTGGCCAATGAGTTAGAAAGTAAAAGTTGTGAACTTCACTCCTTATTTGGCTCAAGGAGCTGGTGGGTCCCACTACTAGAAACATATCGAGTTTATATCTTATAAACTCCTTATATTATAGACTTCAGAAGTTCACAACTCTCTAGACTTTATAAATATTTACTGCTCGTTATTCTACTTCTTACCCCAAACATCGCCAAAGTGGTATATAATATAATCTTAGATGCTCCATTTTGTGAAATTATCTTATATTACTCGAATCATTTATGAGAAGATAGTTAATAATGGTATAATAAATAAAAATAAAATAAAAGCACAAGATTTATATAACTTTTAGGAAAATTACTATAAACGGACAAAATATCAAACTATTTATAAATATTGAAAAATTTCGCTGTCTATTAGAGAAAACTGTGATAGACTTCTATCACTAGGAAATCGCTGACAGACAGTGAAATTTTTTTATATTTATAAATATTTTGACTCATTTTTCTATATTTGAAAACAACAAATGATATGAGTGACCAAAAATATATTATGTATTAGTTGAGTTATGTTTGTTTTGACTATGTGATATGTATGTTGATATTCAACAAATTAATTATTTAAAAAATGTAGAATGTATCTAATTTCTCAAATTAGTGAATAAAGTGAGGGCAAAGGAGAAATTTTCATAAAATTAAACGCAACAGGGAGAAAATTATTTAAAGAAGTCGTTCCAATGTAAAAAACAATAATCTCATATAATGCATCTAATAGATAGAGAAGATTTGTGACATGTATTGTAACAATAATCTCATATAATGCATATAATAGTTAGAGAAGATTTGCGACATGTACTTCAATAATTTGCCCTATGCTATTATTTTCATGCATAGCATTTCTAATTCGCATCCATGAACTTGCATTATTATCATTTATTGAAAAAGGACAATTAAAATAACAAAAATTAATTGGGTTTGAATTCATTGCACGGCCTTTGTAACAAAAGAAGAAAAAACAAACAAAATCACGAGGGAAATAAGAGGCCGGCTTCCACTGTAATAACCAGTTCCGATAACAATCCCACTATTATTATTATTCTTATGAAAACCTTGAAAAGAAAAAAAATATTATTAAAAACGAACAAAAATCATATTTAATTGGTATTGATATTTTACAGTCTCATCCCCAACCTTTTACGGCAAAGTTTGGCTGATGAGTTGGAATTTTGCTTCTCCAGAAATGGTGACTTTCACTTCCTTTCAACTGTAGCCTCATTCTCTTTCCGCTACTCATTGGTGGCTTTCCATCTTTCTGAAAAACCAACCCTAACCGGTAAGTCTATGCTTTAAGTTTTACTTTCATTTTCGTTTCACTTCCAAACATCTTTCACGCATTTTCATTGTGCGCGGATTTTCCTTTTCCCTTTTTTTTGATCACTTCCCCTTCTTCTTTTGGATCTGCGACTAAGGCTCTTTCTCCCTTGTTCTTACGCGCTCCAACACCAGAGTCTTGCCCAAATATGGCAAAAATGTTTCTCTTCCATCTCTGCAACTCCGATTCTTGAAATCCTCCCTGTTCTGCGTCCTTCTTTTATTATTTTCTATTTCTGGCTCGAATCCTTCAATTGTTCACTGGTTAGTTGGCCTTTTCACTGTTTGTTTTCACGAAACTAGAAATTTCTCTTACTGGAACTGTTTTATCTGAAGTTTTTCTTGTTCTCTTTCCATTTTCTCAGTGTTTCCCTATCAGAAAGCAAGGCTAGTTATAGAGGTTGAAATGGCGAAAGAAAAAATCCAGATCAGGAAGATCGATAACGCCACGGCGAGGCAGGTCACTTTCTCCAAGCGTCGGAGGGGACTTTTCAAGAAAGCCAAAGAGCTTTCCGTTTTATGCGATGCCGATGTTGCTCTCATTATCTTCTCCGCCACCGGAAAGCTGTTTGAGTACTCCAGCTCGAGGTTTCTCTCTGCTTCTCTCTTTTTTTATATTTTTTTTTCCAGAAATAAAAACTTTAATCGGTTTCTCACGATTTCTGAGTTCTGCGTTTTCTGAACCTACTTTAATTCCATTTATAAATGACACCGTCTTACTAAAGATGGCGAGAACCTTGACAGAAATAAAGAGCAAAGCGCAGAAACCACATGTATTTTGCTTACAACAATCTTACGGAAAAAGATGAGAATAAATGAGAAAATATTAACTAATTCGATAATGGATATAAGAGGAAATTCTTGTAAAGTAGTGCCTCATAAAATTTGTCATAAACAAACAGAAAAGGCTAAATGAAAACTTGCCTCGAGCCTCGCCCTCTCATCAACAACGAGGTTTATGGTAATAGCGATGACTGAACTTTTTAGTCACTTAATTCTAACTTATTGCGGTTATTATAATGATTTATGCTTGACTATAGTTGATTTCGTTGATCTTTCTAAATTATATAGATTGGAGTGACAAACATGGATTCACTATCATTATCTTTTTCTTGTTTAGTCATTTCTTCCACTGATTATGTTACAATCTGCATTTGAATTTCTGAAGCATGAAGGGAATCATTGAAAGACATAATTTGCACTCTAAGAACCTTCAGAAATTGGAACAACCATCCCTTGAACTACAGGTTTGACGTTTTTTTTCCAGGCATTTCCTCTAAAAATGGTCATCAATTTTTTCCCCCACTGTTATCAAAAGATAAAAAAAATAATAATATTATTATCGAAGCCGAAAACATGGTTCATAAATTTGTCATTTTAAACGGGAAATATTTTATTTGTTTTAGGGAACTCAAATTACTGTGGAAATTAAGTAAGAGAAAAACCCTTAAAAAACAGTTATTCTTACATTGATTTTTATTATATAAAATAGAAAACAGCTTTAACATTTTTCTAAACGAAAATTATTTTTCTCCTCCTGAAGTCTACTTTATCTCTGTTAAACTAGTAGCTGTCTCATACGTAATACATTATTCCTCCAGTACGCCATTTTTTCTGGAATGCTATTATCTCTTCAGAATTTTAATTTTGTTTTAAAGAACTCAACTTATTTTGATATCACTTATAACATTTTCATTCTCTATTTCTTAAATTTTGAAAAACTACTTCAAATATCTAAAATTTGTATTTTAGATTCCAGAAAGCATATGGTCAAGAGAAGGGTAAAATATTTATATTTGACTCAAACTTATTTTTATAAGAGAATAAAATAATACACTTCCCTTTCGTCGTAAACGGACAAAAATCAATATTAAAAAAAGAGATCTTATATAGTTGATTATATTTGTATCTTGAGGACTTGATTATTTCAACTCAATGGTGGTCACCTTTCATTAGTTTACTGAAACATTGTTTACAACTGTCGTTGATTTCAGCTGGTTGAAAACAGCAATTATACCCGATTAAACAAGGAAATTGCTGACAAAACTCATCAGCTAAGGTATCTCTATTTTTACCCGAGATGTTATGTTTTCTTTTCCTTTTTTACATTTACTACCATAGAAAGAGGTGAAATTTGAATTTTCGACCTCAAGGAATTAGTATTAGCAATATTTTCCGGGCTACTCCCATTTTTTCACCCTACCCACCCACACGGTAGAAAGATGTTAAAATAATTATTAAAAAGAAAAAATAAAAGAAAAGGAAAAATAGTTGGGTCACTGAGCTGGGCGGCAAAAAAGAAAAAAATGGGTGTAGCGTAGCTAGCTAGTAAGTATTTTTCAAATAGTGCATGTTCCCCGGATTAAACTAGAAAAAAATGATTCCGATTTTGTATTTGAATACATGTTGTAATTTGATTCTTGCACCTAATAAATAGTGTAGTTAAGTACGCCTTCAATTTGTCAAAAAAAAAAAAAATCTAAAAAGATTTATAGGCAGTGGATGAGACTATTTTCATCAGTATTGGTTATCTGATAGCGTTGGATACTTTTAATTAGTTGGATTTGGATATTTGTTGGTTGCTAAATTTGAACCTTTTTTTTAGCATACTCGATTTTGTAGTTGGGTTTGTGTATTTTCATGATTAGGTTTGAACATCCCTTTGTCTGTTGGTTTTGTGTGGTTAGGTTTGAGTATTTTCTTAATTATAATTAGTCATGTCTTGAACATGATTTCTAATTTTGAGGAGGTATTCATTATAACATTTTATATAGAAGTTCAAAGATTTAAATGGAAGCAATTAAGAGATTAACTAAAATCAACTCACTCACCATAATAAAGTGTATACTTTGATTTTTTTTCTGTTGTACTTACTAGATGCACAAGAAATTGGAATTCGTAGATATTTTGTTGTCACATTTCAGCGATAGCCAGACTATAATGGTCTATTTAGACAAAATTCTCGCACACAGTACTGCAATAAATCAGAACACATACACGAACACGATTAAATTTGTTGACAATTTAGGCAAATGAGAGGAGAAGAACTCCAAACATTGAATATAGAGGAATTGCAGCAGCTAGAGAAGTCACTGGAGTCTGGATTGAGTCGTGTGATGGAGAAAAAGGTATTTTCCCTTTCTTCACTGTTCGTAATAAAATCCCATGTTCATCTCATGCATGGTTCCGTTTTTATGTCCCAGGGTGAACGGATCATGAAAGAGATCACTGACCTTCAAAGAAAGGTTAAGTACCATCTTCAAATAAATAAGGCTACTGCCCAAAGACCCGTATTGGGAGAATCTAAGTACTTGATAATAGTATTATGTATGTAATTTAGTTATACTCATAAACATATGATACACAAAATGTTATAATATCTTATCATCGGATGTAAATAGCTTGTGCTATGTATTTGAAAAATATGCACGAACTTTTAGCAATATAACTCGAATTGCATTCATTTCTTTAAATGCGTCTATTATTCGTAAATATGAATGAATATGCTATATAATTACTTAATTATCAAACGAGGTGAAGCTAATTTCAAAGTTTTATGTGCATGCAGTCGGCCGAGTTGATGGATGAGAATAAGCGACTGAAACAACAAGTAAGAAGAACCCTTTTCCACAGCATACTAATATAAATTGTGTCCTTTAAATCCCTTTGTAACTTTCGGCGGCGGTGGTTTGCAGGCGGAGAAAATGAACGCCGTGAGGCACCTGGGCGTTGAGCCCGAGATTTTGGTCGTGGAAGATGGCCAGTCGTCGAACTCCGTCACTGAAGCTTGTGTCTCCAATTCCAATGGCCCGCCTCAAGATCTGGAAAGCTCAGACACATCTCTGAAACTAGGGTATCTCTTCATTTCATGAAATTGATTTTAAATTATTTTCTAGTAATGTTCATGTAGATTGGTCAAGTTTTGTATGATCGGGTGAATCCAATTGGTTGCAGGCTGCCGTATTCCGGGTGATGGTGATGGTGATGGAACAAGATGATCAGGTACTGTAGCTAGCAAAAAAGGTAGTATGTAAAAATAATTATACCCTAAAATTCGGGTTAAGTTGAGTAATTGGAATCTAAATCTAACCCCGTCTCATCTTCATCATACAATTCACATGACTCCATTGAAATCTGAAAATAGGGTCGCATTTTTAAGAATACAATGCACTGTTTTAACGAATCAAATTAGACTGCAAGTGGGCTTTTTATGAAGCCTTGGTCACTTGGTATATGCGAGCAGTCGTAGTTTTGGGGTTAAGGACTTAACGACGGCGTTTTCCTGGTGACATCGCCCTTGTTTCGTTAAACAGGATCGAGTCATACTTTTAGTTTAAACTAATAGCGGAATTTTGAATGGAAAAGTGGGAATACCATTGTTTTGCCCATTTTTTTTTTAGCAATTTGATGGACGTATTAGGGAAATAATTATAAGAAAACACTTGGGTGGATCTTAGTTATAACTATTTTTTTTTTCGTCCCATCATAATTTGTTACGGCCATCATTCTTTTTTTAAAAAAAAAGTTTGATTTTTTATTTATTCTTATTTTTAGGAGTACAATGCAAAAAATAAATATATTTGAAAAGTTGGAATGTACTCAAAGTATTAGTATTATCTTAATTATAAAATCAAATTTGGACTAGATGCAATCATACTTACACTTGGTAGTGAAATCATCTTTTTTTTAACAAATTTGATAACATATAAAGGTGGGAGATTTAAATTTATATATATATATATACATATATATGCTTTATTGGCGATTAACTTATATTGATGTTTGATTATTAAACCTAAAATTCTTAAATGTACATAATTTAATAGTTCATAGGTCTGGATTAAAACTTAAAATGAGAAGGATGCATACTTTTCTATTTTAGTTTGGTACTGCTTTTATAAATTATTTTTGGACATTGGGTTAAACAAGCAATTGTTTGGATCGGTTTTTTTCTTCAAACCTTTTAGTTAAGGACTCGTGCCTTCTAAACTAATTGGCTATACTTTGCCCTTGCAATCCCACTATTACTATGAAATGATATTAGCTTCATTTTAGTCATTGGTGTTTCAAAATTTTTCTTTTGTGAACTCTAACCTTTAAAAATGTATTTTTTTTTTTTTAATTATTGAACTTTCAAAACTAATGAGATTTGTTGTTTACTTTTACTTTTTTTTTAAAAAAAATTATTTTTTAGATTAATGATACTTTTGAGCATGCATATACTAACTTGTAAATGTGAGTCTGATTTTTCAAATCATACAAGTTAGAGACAAGTATTTGCAAGAATCAAAATGGTTATTTTTAAACAACTTTAGGAACTATAATAGACATTGTTGAGAATTTAAATAGAACTAACTTGAAAGTTGGAAGATCAAATGATATCTAAACTTTGTTAAAGAACTTTTAAGGCAGGGAACAACTTAATAGGCACGTAAGACAATAATGGCTGCAAATGCTTTAATTTGTTTCTATTGGATTGTTAAACTTTTTTTTTTTTTTTTATATCCGTGAGTGTCTGGGCCAACTTATGCGCATCTCGACTGATCTCACGGGACAACTACTACTACTAGACCAACTCATGATGGTTAATTGTTAAACCTTTTACTTTGTGTTGGTGAATGAAGTTTAAAAATATCCCATCTTTTCTATACAATCTTAATTTTTTTAGTACAATGGAATGGGAGATTTGAACAATGAACTTCATGGTCACTAATACATCTCGGGTACCAATTGAGATATACTCATTTTGACCTATACAATTAAGATAACTTGTAATGTTTATGTAATATTTAGATTTTATGTAATTCATATAGTGTTAAAATGAATTTAGCTTAGTGATAATTGACATGTGTTATTTCTCATCTTGCAATTGTTGTAGTAAAAAAGAAAAAAAATACATAATTGAGTTTTGCCATTTTTTATAAGGCACTATTTAATTTAGCATAAAATAAGTTTGTGAACTTTATTTTTTTATTTTTTAATATGATTATATTCTTAGACATAAAATTGAAAGCCTAGAGATATTATCAAAATGATTTTAGCTTGACATAACTATGTATTGACGATCAAGACATGTAATCATCCCTCGCAGTTTATTGATTTAAAAAAAAAACGGTTCAGAGACATTAAATATCTTTTTAAAACTTTAAATACTTGTTGAATACTAAATTGGAGGTTAAAGTAACTATAGCAAACACTTTTTAAGGTTTCATACCTTATTAAACGCAATCCAAACTCGTAACTTAACCTAAATTGAACTAAGAACTCGTTTTCTTATTCATGGCCAAAAAGAAAATAAAAACATAAAGCCGCCATGATTGACAGGCGTTGTAATTATATGTGAAACATCATCAAATCTTTACTTTAATGCCAAGCCATTTAATTGCTGGAACAAAAAAAAAAAAAGAGGCCTAAGGCAATTCCTAGCGATTGATGAGAAATTCACGTCTTTCTAATAATTAATGATTATAAATAAATGGGAATCTCTCTGTTTTAAAATTAAGTCAGGCTTCGTAATTGGGGCCTCATCAATTTATTCATAACTTTACTTTGCATATGCACTTATAATTTCTTGGATGCTTCAAAACTTTACTGCAGCTGCACGCAACTCTTGTGCAGATCTACCCAACGCTCCTGCATACACGTGTCTCCTCATATTTTATTTTCATTAATAGAAATTAAATTTATTTACTAACAATCTTCTTGCATTATGATTTTAAAATTTTAACTAAAAACTCAAATTAATATCTATATATATTAATAGAAGTAAAAAATGAACATAACTCTTATACAAAGCATATACAACCAATGCCAGAGATTTGATGTTCCCTTTTCATATTGTCAACAAAAGAAAAAAAAATGTCGCCAATTTTGCTACTTTATTTTTTAATTAATTTATTGTATGTTTATTTTTGCCAAACTAAACTCTTATATAGACACACACATTATATTATACCAGTTTCATCTATTAAAAATGCAAAATCCTTTGTAATCTTGTCTCGTCTTCGATATATCATTTTAATTAAACGCGATAGATGTTATAAAATCTAACTATTACAAACCTTTTAATTACATATTGTTATTTGCCTTATTTCGTCATCTATTCACAATACGGTTCAATTCTAACAATATATATGGAAGTGAAAATTTGAAAATTTAACTTTTTATTTTATGATAAGTCAAATTATACTTATGTTGGCATTTACACGATACTTTAATTTTCAATTGATGAAATAATTGTAATAAAAACTGTATTTTTTTTTTAGGATTTTGTAATGTAAAACTTGTAAGTCTAATAACTATTTGATTTTTACTATACGTACTCTCTCAAATGTCTTTTTTTTAATTATCTTGTTTTTTCTTTAACTTATAATGCGTATTTTTTAACGCTGAGAATAAAAACTTTATTAAATTAAACAACAATATACCCTATTATTTTCAATTGAATAATTTTAAACATTATCATGAAATTACTTTCCAACTCGAAAAATCACGTGCGATGCATGTGAGTGCTGCCCACTACTTTTAAAAAGTGAAAATCATAGCCAAAAAATTGATAATAAACTATCATTTTCTTTTTCCTGAAAAGAAGTCTTGCATATTTTAAAAGAATAGAATAATATTATGAATAAATATACCACCTCCAAGTGACATGACATCATATATATGTTGAAGAGACACTTTCAGTAAAAGACATAATAATAATAATAATAATAATAATAATAATAATAATAATAATAACAACAACAATAATGATAATAGTGAAGGGTAGTTCCAATTCCAAATAGCATTCAACCTTTGAAACATTGGGATGGGACAACGAACACAATGCCCCACTCCATCACATTCATAATAGTCATGGGATCTCCTATTGAGTATTTTTTTTATTTGATTTGTGGGGAGTTTTTGTAAATGTAGCATTCAAGAGTGTTTGAATCTACTTGAGCACCGTCATATTGGTAAGCCCAATTTAACAAGTAAATATACTCGTTTCGTTTCGTCTATGTGTTGTGTCATGAATTGTACAAAATCTAATTGACTCTGAAACTTAAAGAGATTGTAACCCTTTGAAATGGGTATTATTTGGCTCTTAACATAAAAAATTCATCATTTGTGAATTGAAAACATAATGGTTCATTATTTGAAAACTTTTTGAAGGAAGGGACTAGATCTAATTCCATGTCGATTACTTCCATACCGACTTTTAGGAAGGGCACATTCAAGTTTTTCTTTTTGCTTCGAGATGTGATGTAACACAGTAACTCACACTCTTGCTAATCGTGCTCAAATGACAAACTTCGCTGAAATATAGTTTGAAGACCACCCATCATAAATGAATGATGTTATTTTAAAGGATTCTTCTTTTTCTTCTTGTTTGCCAAATAGTTTGCAATACTATCCTTGTTTAATGCATTTACATAGACACTATTTATAATGCCAATGTACTTAAAAATATGGGTTGTTTTCAAATATAAAAAATAAGCCAACTTATTTATAAATATTGAAATTTTTTACTGTCTATCAGAGATAGTGAACGATAGACTTTTATCGCGATCTATGACGGAATATTTTATAAACCATCTAATATTTTTATATTATTTTATTTGTCACACCAAATGAAATGGAGTATTTAACTTCTAACATCTTTATAAAAAAAATATACTAAACCAATGAACCTATACATAAAAGTTGATTTTTTTTTCTATTTTCATAAAGTTTATAAGTATTAGATAGCATGAATTTTCATCTCAAAACTAATTGACAATGATAGAAATAATTCATCTATCTTACAAAAAATGTGAGATCCTTGATTTTTCCAATGTTAAATCCTTAACAATAAGTATTGGTTCCACTCGTTTCTTTTCTTTGTTTTACTATCTATTGTATAACCATGTTTCAAAAAATCAAATGAAAAATAAATATTTTTAAAATATTGCTTTTCGTTTTTAAAGATTAACCAAAAATTCAAATCCTGTTTTGGTAAAAACCATAGTACAAAAAAAAAAAAACATTGATATTAAGCTATCAATATGGTTTTCAAGATATTAGCATCCAGATTTTCTTATAACAAGTGGAACCGACAAGCAACCTTCATCATATACTTCATTTATGTAGCTTTAAGATCTAAACTATTGCTCATCCCAAGACAGCAAAGTATCGTCCTAATCAAAGATTTAACGAAATTTCATAGAAGGAGAAACGAAGTACAGAATGGAATGGGGAGTGGAACTGTGTGAATAATTGCAACCATTTGTTGAAAGCTAATTGTCGGTGCTCACTGCTCATAGTAGCTACCGATAAATGAATCCAATAAAGTGGACCTTCAATCAGAGGTCCGCAAAGGCAACAACAGGTCAAATCTCTACTCGAAAACACGCCTTCGTCCTTGTATAGTTAAAGTGGAACGAACAGAGGCAAAAGTTATTTTCACTACGTGATTCTCTCTTTACCTTTTAATCGCCATACCCACCTATAATTTGCTTGGACCCTCTCTGAAAACGACACCCTTTCTCTCCAATCAATTCAGCTGCAAGCTGTACAGATGTCCATGTGTGTGAATCAGATGCCTTTACTCTCATATTTGATATAACCCACTATCACACTGCAATAAACCTCTCATCTCCTTCATTACAATCCCTTTTTGCTTCTGGGGTTTGGGGTTTCGCATTCGATTGGCACAACCCATCTCCAAAATCCATGGTTATGGGACTCAGTTATCAGTCGAACAGAGCTGGAGCTCTGGCTGGTGTTTTCGTGCTGCTTTTCCCTGTTTTTCTACCGGGTTTGTTTAGCCCTTTCGGTCATGCTTCGCCTTCCACATTCTCGGTATCTCTTTGCCACTTTGATATTCTCTCTTATCTGTTTTGTTCCTTGAAGCTTTTGAACTTTTGATCTTCTTAGTATTGTTATGTGAAATTATTTAGGAAATGTCGGTGTTTTGATTTTGTTGAAGGATTGCCGTAAAGAGAAAAGTTTATCTTAGTGGAGAAGTGGGAAAATCTATTCCTGTAATAGTTGCGGCTCTCTTTTACTTCCTTTTCCTGTGCAATTTTTTTTGGAAATGTAGAGACACAACTGTGGTTCTCTTCATTTATAGTATGTCGGTATGATATTTGATTAAGATTTTTGTAATAGAAAATAAAAATTGGAACGTTCGGTTCATGATTAGTTTAGTTTTAGGGGAGTGGTACTCCAGAGGGCCATTATAAGCTAACTCTGGTTAAATTTAACCTGATTGACAAGGGTTTGAACGTATGGGAATGGTTATTTTTCATCTCTTTTTGTTCAACTTTTCTTCAAATTATAAAGTTTTTCTCTCCAGTGGAAATATTATTTGTTATGGATTGTAAAATTTGACTGGTGGGTGCACCATTATTAGGACTTTTACGAGTTCTTAAGATTGTGATTGATGATTTAATTGGGTAACCTCTTGTAGGAATGGAACGCTCCAAAGCCCAGGCACTTGCGTCTGCTGAAAAGTGCTTTACAACGCCAAAGTGTGGGTAAAAGTTGATTTATTATATAATTGAAGTTTGTTTTAATCTCCAACTACATTCTTTTATGAATGAGAATCTCATAGTGCTAAAATCCTATTTCTATGACAGTCAAAACCAGATCAGTCTGATTTATGGGCTCCTTTAGCTGATGAAGGGTGGAGGCCTTGTGTTGCTTCTTCAAAAGCTTCTTGTAAGTTTTGAACCATGGAGTTGGAGGTCTCAGAAGCTTCCCATGATCATTTGTATTTAAACATATCACGTACTGAATTACTAATCTTTTTCATGCCTGGTTTATGATCCATATGTTCTGCAGCACTACCAGGGAAATCTGAAGGATATATCCAGGTGTTTCTTGATGGAGGGCTGAACCAGCAAAGAATGGGAGTACGTGTTGGATTAGAACTTTAACATACTTTTTTGTCATTAAACGTTTATCGGCACAATTTTGTTGATGATTTAGTCTTCAATTTAAAATACTTACTTGATCCACATGCAGATATGTGATGCAGTTGCTGTTGCAAAAATTCTAAATGCAACCCTTGTGATTCCCCACCTTGAAATTAATCCTGTCTGGAAAGATTCAAGGTACCAAGAAAATTTGCTCTGCGTGATATCGTTCTATTCCTATATGTATGAAAAATTATCATGAGATTTCTTTTACCTTATCCTTTACTTTTTGGAATCTGCAGTTCTTTCGTCGATATATTCGATGTGGATCACTTTATTAACGTATTGAAGGATGACATTTCTATAGTTAAAGAGCTGCCAGCTGATTTTTCTTGGAGTACTAGGGAATATTATGCTACAGCCATTCGGGCTACTAGAGTCAAAACAGCGCCTGTTCATGCATCAGCCAACTGGTATCTGGACAATGTTTTACCTGTATTACAGAGGTTGGTATTGAAGCCTATCCGTTCGTTCTGACCTTCTAGGACTGCAAGAGTGGTTGCTTAGTTTCATATCTGCATATACACTTTATCACTTATAAATATAAAAGATGAAGCAGTTACACAAAACTGATATATGAAATTTGGAACATACAATAAGACAATAATGATATGGAAGTGGGGGAGAAGATGCATTGCCAAGGATTAGCCATATGATTTAAAAAGATTATATAACATGGTATACCTCATACTTGGATATATGGATGCTGAAAATTTTGATCGATTTAAAATCTTTCTAATGTTTGTATGTGCTATGAAAATTACTTATTGTTCTCTGGAATTTGTTACTTTCTCGGTTTTACATATTTTTTCTATATTAATGATTCAGCTACGGTATTGCTGCCATTGCACCCTTCTCGCACCGTCTAGCTTTTGAGAACTTGCCCGATGAGATACAAAGGTTGCGATGTAAGGTCAACTTTCAAGCATTAACTTTTGTTCCCCATATCCGAGTACTAGGAGACGCCCTCATCAGTCGACTGCAGTATCCTTTGAATAAGAAGGAATCAAAGGTGGCTAACTACTTAAGTATGACCACTGATGCAAATGAACAAGGTCCATTAAAGTTTGTTGTCCTACACCTCCGATTTGACAAGGTATAGCCACCCTTCCTTTCTAGGTGCCCGATTAGATTAAATTGAAGTTCCCCTGTTACTATCTATTGGACAATGGACATTGCCTTTCTACTTTATCTTGCTGATATTATAGACCATTAATTTAAGTAATCCCTTGCATGCAAACAATCGAAGGCCTCGTTGCTTAGACTTTATCTCATTAGTTCTTAGGACCGACAAGTTTGTTCATGAGCTACCACCATACTCTGCTCATGTAAGCAATTGGTCGAATTTAAAAGTCCCCATTGCAACTTCACCGTATTTAGTGTTTTTATGCAACCAGTTTATCCTAGGTTTTAATTATCTAAGAACAGGGATACTGGCCATTACGGAATCAAATTGAATGACTGGAACTTTGGATTGTGCAATCATATATTCTACATTCTTTTATTTAAATTAATTTCGTATAAAGTTCCACAGCTGCTGATTTCTTTATGTTCGACATGAAGGACATGGCAGCTCATTCAGCCTGCGATTTTGGTGGGGGAAAGGCTGAAAAACTTGCTTTGGCCAAATATCGTCAAGTGCTCTGGCAGGGAAGGGTCCTTAACTCTCAGTTCACAGATGAAGAGTTGCGGAGTCAGGGTCGTTGCCCTTTGACACCCGAAGAGATCGGTTTGCTACTGGCTGCTTTCGGTTTTGACAACAATACCCGTCTATATTTGGCCTCCCACAAGGTAGTTCCTTCTCTATATACAAATAAAGCAACCAAACTTGCTTTAAATCTTTTGGTTTGAAAATGAAATTACTACCATTACTTGGAAGTTGGAACTCTAAAGTGCATGGTTCTTTCAGGTATATGGTGGGGAAGCTAGGATTTCGACTTTGCGGAGTCTTTTCCCATTAATGGAAGACAAGAAGAGTCTCACCTCTGGAAATGAACTAGCCCAAATCAAAGGAAAGGCTTCTTTGCTAGCTGCGGTTGACTACTACGTAAGCATGCATAGTGACATTTTTATCTCGGCTTCTCCTGGAAATATGCATAATGCAATGGTAAGCAATAAACTTCTTGCTGATTCTTATTTTGTCAAATCTTGCTAAGCAAAACCTGTGCTTTTTGTACGAACATTTCCTGCAAGTGATATATTCATTATGATCATTCTTTTACTCTTTCTTGATATTTTACAGGTGGGACATCGCACGTACGAGAACTTGAAGACCATAAGACCAAACATGGCATTGTTGGGACAGCTTTTCATGAACAAAAGCATCATTTGGTCAGACTTTCAGGAGGCCACTGTAGAAGGCCACAAAAACAGACAAGGACAAATAAGGTTGAGAAAGCCAAAGCAATCGATATACACATATCCAGCTCCTGATTGTGTTTGCCACGCTTGA

mRNA sequence

ATGAACATCAACCTAAATTGCAAGTACCATGTTTTGAGTCTCATCCCCAACCTTTTACGGCAAAGTTTGGCTGATGAGTTGGAATTTTGCTTCTCCAGAAATGGTGACTTTCACTTCCTTTCAACTGTAGCCTCATTCTCTTTCCGCTACTCATTGGTGGCTTTCCATCTTTCTGAAAAACCAACCCTAACCGTGTTTCCCTATCAGAAAGCAAGGCTAGTTATAGAGGTTGAAATGGCGAAAGAAAAAATCCAGATCAGGAAGATCGATAACGCCACGGCGAGGCAGGTCACTTTCTCCAAGCGTCGGAGGGGACTTTTCAAGAAAGCCAAAGAGCTTTCCGTTTTATGCGATGCCGATGTTGCTCTCATTATCTTCTCCGCCACCGGAAAGCTGTTTGAGTACTCCAGCTCGAGCATGAAGGGAATCATTGAAAGACATAATTTGCACTCTAAGAACCTTCAGAAATTGGAACAACCATCCCTTGAACTACAGCTGGTTGAAAACAGCAATTATACCCGATTAAACAAGGAAATTGCTGACAAAACTCATCAGCTAAGGCAAATGAGAGGAGAAGAACTCCAAACATTGAATATAGAGGAATTGCAGCAGCTAGAGAAGTCACTGGAGTCTGGATTGAGTCGTGTGATGGAGAAAAAGGGTGAACGGATCATGAAAGAGATCACTGACCTTCAAAGAAAGTCGGCCGAGTTGATGGATGAGAATAAGCGACTGAAACAACAAGCGGAGAAAATGAACGCCGTGAGGCACCTGGGCGTTGAGCCCGAGATTTTGGTCGTGGAAGATGGCCAGTCGTCGAACTCCGTCACTGAAGCTTGTGTCTCCAATTCCAATGGCCCGCCTCAAGATCTGGAAAGCTCAGACACATCTCTGAAACTAGGCATTCAAGAGTGTTTGAATCTACTTGAGCACCGTCATATTGATGCCTTTACTCTCATATTTGATATAACCCACTATCACACTGCAATAAACCTCTCATCTCCTTCATTACAATCCCTTTTTGCTTCTGGGGTTTGGGGTTTCGCATTCGATTGGCACAACCCATCTCCAAAATCCATGGTTATGGGACTCAGTTATCAGTCGAACAGAGCTGGAGCTCTGGCTGGTGTTTTCGTGCTGCTTTTCCCTGTTTTTCTACCGGGTTTGTTTAGCCCTTTCGGTCATGCTTCGCCTTCCACATTCTCGGAATGGAACGCTCCAAAGCCCAGGCACTTGCGTCTGCTGAAAAGTGCTTTACAACGCCAAAGTTCAAAACCAGATCAGTCTGATTTATGGGCTCCTTTAGCTGATGAAGGGTGGAGGCCTTGTGTTGCTTCTTCAAAAGCTTCTTCACTACCAGGGAAATCTGAAGGATATATCCAGGTGTTTCTTGATGGAGGGCTGAACCAGCAAAGAATGGGAATATGTGATGCAGTTGCTGTTGCAAAAATTCTAAATGCAACCCTTGTGATTCCCCACCTTGAAATTAATCCTGTCTGGAAAGATTCAAGTTCTTTCGTCGATATATTCGATGTGGATCACTTTATTAACGTATTGAAGGATGACATTTCTATAGTTAAAGAGCTGCCAGCTGATTTTTCTTGGAGTACTAGGGAATATTATGCTACAGCCATTCGGGCTACTAGAGTCAAAACAGCGCCTGTTCATGCATCAGCCAACTGGTATCTGGACAATGTTTTACCTGTATTACAGAGCTACGGTATTGCTGCCATTGCACCCTTCTCGCACCGTCTAGCTTTTGAGAACTTGCCCGATGAGATACAAAGGTTGCGATGTAAGGTCAACTTTCAAGCATTAACTTTTGTTCCCCATATCCGAGTACTAGGAGACGCCCTCATCAGTCGACTGCAGTATCCTTTGAATAAGAAGGAATCAAAGGTGGCTAACTACTTAAGTATGACCACTGATGCAAATGAACAAGGTCCATTAAAGTTTGTTGTCCTACACCTCCGATTTGACAAGGACATGGCAGCTCATTCAGCCTGCGATTTTGGTGGGGGAAAGGCTGAAAAACTTGCTTTGGCCAAATATCGTCAAGTGCTCTGGCAGGGAAGGGTCCTTAACTCTCAGTTCACAGATGAAGAGTTGCGGAGTCAGGGTCGTTGCCCTTTGACACCCGAAGAGATCGGTTTGCTACTGGCTGCTTTCGGTTTTGACAACAATACCCGTCTATATTTGGCCTCCCACAAGGTATATGGTGGGGAAGCTAGGATTTCGACTTTGCGGAGTCTTTTCCCATTAATGGAAGACAAGAAGAGTCTCACCTCTGGAAATGAACTAGCCCAAATCAAAGGAAAGGCTTCTTTGCTAGCTGCGGTTGACTACTACGTAAGCATGCATAGTGACATTTTTATCTCGGCTTCTCCTGGAAATATGCATAATGCAATGGTGGGACATCGCACGTACGAGAACTTGAAGACCATAAGACCAAACATGGCATTGTTGGGACAGCTTTTCATGAACAAAAGCATCATTTGGTCAGACTTTCAGGAGGCCACTGTAGAAGGCCACAAAAACAGACAAGGACAAATAAGGTTGAGAAAGCCAAAGCAATCGATATACACATATCCAGCTCCTGATTGTGTTTGCCACGCTTGA

Coding sequence (CDS)

ATGAACATCAACCTAAATTGCAAGTACCATGTTTTGAGTCTCATCCCCAACCTTTTACGGCAAAGTTTGGCTGATGAGTTGGAATTTTGCTTCTCCAGAAATGGTGACTTTCACTTCCTTTCAACTGTAGCCTCATTCTCTTTCCGCTACTCATTGGTGGCTTTCCATCTTTCTGAAAAACCAACCCTAACCGTGTTTCCCTATCAGAAAGCAAGGCTAGTTATAGAGGTTGAAATGGCGAAAGAAAAAATCCAGATCAGGAAGATCGATAACGCCACGGCGAGGCAGGTCACTTTCTCCAAGCGTCGGAGGGGACTTTTCAAGAAAGCCAAAGAGCTTTCCGTTTTATGCGATGCCGATGTTGCTCTCATTATCTTCTCCGCCACCGGAAAGCTGTTTGAGTACTCCAGCTCGAGCATGAAGGGAATCATTGAAAGACATAATTTGCACTCTAAGAACCTTCAGAAATTGGAACAACCATCCCTTGAACTACAGCTGGTTGAAAACAGCAATTATACCCGATTAAACAAGGAAATTGCTGACAAAACTCATCAGCTAAGGCAAATGAGAGGAGAAGAACTCCAAACATTGAATATAGAGGAATTGCAGCAGCTAGAGAAGTCACTGGAGTCTGGATTGAGTCGTGTGATGGAGAAAAAGGGTGAACGGATCATGAAAGAGATCACTGACCTTCAAAGAAAGTCGGCCGAGTTGATGGATGAGAATAAGCGACTGAAACAACAAGCGGAGAAAATGAACGCCGTGAGGCACCTGGGCGTTGAGCCCGAGATTTTGGTCGTGGAAGATGGCCAGTCGTCGAACTCCGTCACTGAAGCTTGTGTCTCCAATTCCAATGGCCCGCCTCAAGATCTGGAAAGCTCAGACACATCTCTGAAACTAGGCATTCAAGAGTGTTTGAATCTACTTGAGCACCGTCATATTGATGCCTTTACTCTCATATTTGATATAACCCACTATCACACTGCAATAAACCTCTCATCTCCTTCATTACAATCCCTTTTTGCTTCTGGGGTTTGGGGTTTCGCATTCGATTGGCACAACCCATCTCCAAAATCCATGGTTATGGGACTCAGTTATCAGTCGAACAGAGCTGGAGCTCTGGCTGGTGTTTTCGTGCTGCTTTTCCCTGTTTTTCTACCGGGTTTGTTTAGCCCTTTCGGTCATGCTTCGCCTTCCACATTCTCGGAATGGAACGCTCCAAAGCCCAGGCACTTGCGTCTGCTGAAAAGTGCTTTACAACGCCAAAGTTCAAAACCAGATCAGTCTGATTTATGGGCTCCTTTAGCTGATGAAGGGTGGAGGCCTTGTGTTGCTTCTTCAAAAGCTTCTTCACTACCAGGGAAATCTGAAGGATATATCCAGGTGTTTCTTGATGGAGGGCTGAACCAGCAAAGAATGGGAATATGTGATGCAGTTGCTGTTGCAAAAATTCTAAATGCAACCCTTGTGATTCCCCACCTTGAAATTAATCCTGTCTGGAAAGATTCAAGTTCTTTCGTCGATATATTCGATGTGGATCACTTTATTAACGTATTGAAGGATGACATTTCTATAGTTAAAGAGCTGCCAGCTGATTTTTCTTGGAGTACTAGGGAATATTATGCTACAGCCATTCGGGCTACTAGAGTCAAAACAGCGCCTGTTCATGCATCAGCCAACTGGTATCTGGACAATGTTTTACCTGTATTACAGAGCTACGGTATTGCTGCCATTGCACCCTTCTCGCACCGTCTAGCTTTTGAGAACTTGCCCGATGAGATACAAAGGTTGCGATGTAAGGTCAACTTTCAAGCATTAACTTTTGTTCCCCATATCCGAGTACTAGGAGACGCCCTCATCAGTCGACTGCAGTATCCTTTGAATAAGAAGGAATCAAAGGTGGCTAACTACTTAAGTATGACCACTGATGCAAATGAACAAGGTCCATTAAAGTTTGTTGTCCTACACCTCCGATTTGACAAGGACATGGCAGCTCATTCAGCCTGCGATTTTGGTGGGGGAAAGGCTGAAAAACTTGCTTTGGCCAAATATCGTCAAGTGCTCTGGCAGGGAAGGGTCCTTAACTCTCAGTTCACAGATGAAGAGTTGCGGAGTCAGGGTCGTTGCCCTTTGACACCCGAAGAGATCGGTTTGCTACTGGCTGCTTTCGGTTTTGACAACAATACCCGTCTATATTTGGCCTCCCACAAGGTATATGGTGGGGAAGCTAGGATTTCGACTTTGCGGAGTCTTTTCCCATTAATGGAAGACAAGAAGAGTCTCACCTCTGGAAATGAACTAGCCCAAATCAAAGGAAAGGCTTCTTTGCTAGCTGCGGTTGACTACTACGTAAGCATGCATAGTGACATTTTTATCTCGGCTTCTCCTGGAAATATGCATAATGCAATGGTGGGACATCGCACGTACGAGAACTTGAAGACCATAAGACCAAACATGGCATTGTTGGGACAGCTTTTCATGAACAAAAGCATCATTTGGTCAGACTTTCAGGAGGCCACTGTAGAAGGCCACAAAAACAGACAAGGACAAATAAGGTTGAGAAAGCCAAAGCAATCGATATACACATATCCAGCTCCTGATTGTGTTTGCCACGCTTGA

Protein sequence

MNINLNCKYHVLSLIPNLLRQSLADELEFCFSRNGDFHFLSTVASFSFRYSLVAFHLSEKPTLTVFPYQKARLVIEVEMAKEKIQIRKIDNATARQVTFSKRRRGLFKKAKELSVLCDADVALIIFSATGKLFEYSSSSMKGIIERHNLHSKNLQKLEQPSLELQLVENSNYTRLNKEIADKTHQLRQMRGEELQTLNIEELQQLEKSLESGLSRVMEKKGERIMKEITDLQRKSAELMDENKRLKQQAEKMNAVRHLGVEPEILVVEDGQSSNSVTEACVSNSNGPPQDLESSDTSLKLGIQECLNLLEHRHIDAFTLIFDITHYHTAINLSSPSLQSLFASGVWGFAFDWHNPSPKSMVMGLSYQSNRAGALAGVFVLLFPVFLPGLFSPFGHASPSTFSEWNAPKPRHLRLLKSALQRQSSKPDQSDLWAPLADEGWRPCVASSKASSLPGKSEGYIQVFLDGGLNQQRMGICDAVAVAKILNATLVIPHLEINPVWKDSSSFVDIFDVDHFINVLKDDISIVKELPADFSWSTREYYATAIRATRVKTAPVHASANWYLDNVLPVLQSYGIAAIAPFSHRLAFENLPDEIQRLRCKVNFQALTFVPHIRVLGDALISRLQYPLNKKESKVANYLSMTTDANEQGPLKFVVLHLRFDKDMAAHSACDFGGGKAEKLALAKYRQVLWQGRVLNSQFTDEELRSQGRCPLTPEEIGLLLAAFGFDNNTRLYLASHKVYGGEARISTLRSLFPLMEDKKSLTSGNELAQIKGKASLLAAVDYYVSMHSDIFISASPGNMHNAMVGHRTYENLKTIRPNMALLGQLFMNKSIIWSDFQEATVEGHKNRQGQIRLRKPKQSIYTYPAPDCVCHA
Homology
BLAST of HG10016573 vs. NCBI nr
Match: KAG6604063.1 (O-fucosyltransferase 39, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1334.3 bits (3452), Expect = 0.0e+00
Identity = 690/801 (86.14%), Postives = 716/801 (89.39%), Query Frame = 0

Query: 79  MAKEKIQIRKIDNATARQVTFSKRRRGLFKKAKELSVLCDADVALIIFSATGKLFEYSSS 138
           MAKE+IQIRKIDNATARQVTFSKRRRGLFKKAKELSVLCDADVALI+FSATGKLFE+SSS
Sbjct: 1   MAKERIQIRKIDNATARQVTFSKRRRGLFKKAKELSVLCDADVALIVFSATGKLFEFSSS 60

Query: 139 SMKGIIERHNLHSKNLQKLEQPSLELQLVENSNYTRLNKEIADKTHQLRQMRGEELQTLN 198
           SMKGIIERHNLHSKNLQKLEQPSLELQLVENSNYTRLNKEIA+KT QLRQMRGEELQTLN
Sbjct: 61  SMKGIIERHNLHSKNLQKLEQPSLELQLVENSNYTRLNKEIAEKTQQLRQMRGEELQTLN 120

Query: 199 IEELQQLEKSLESGLSRVMEKKGERIMKEITDLQRKSAELMDENKRLKQQAEKMNAVRHL 258
           IEELQQLEKSLE GLSRVMEKKGERIM EI+DLQRKS ELM+EN+RLK QA+KMN VR+L
Sbjct: 121 IEELQQLEKSLECGLSRVMEKKGERIMTEISDLQRKSTELMEENRRLK-QAQKMNCVRNL 180

Query: 259 GVEPEILVVEDGQSSNSVTEACVSNSNGPPQDLESSDTSLKLGIQECLNLLEHRHIDAFT 318
           GVEPE LVVEDGQSSNSVTEACVSNSNGPPQDL+SSDTSLKLG+                
Sbjct: 181 GVEPENLVVEDGQSSNSVTEACVSNSNGPPQDLDSSDTSLKLGL---------------- 240

Query: 319 LIFDITHYHTAINLSSPSLQSLFASGVWGFAFDWHNPSPKSM-------VMGLSYQSNRA 378
                                          +  HNPSP+S+       VMGL YQS+RA
Sbjct: 241 ------------------------------PYSGHNPSPRSLRFRGLFFVMGLGYQSSRA 300

Query: 379 GALAGVFVLLFPVFLPGLFSPFGHASPSTFSEWNAPKPRHLRLLKSALQRQSSKPDQSDL 438
           GAL G F+LLF VFLPGLFS  GHASP TFSEWN PKPRH RLLKSALQRQS KPDQSDL
Sbjct: 301 GALVGGFMLLFSVFLPGLFSHLGHASPFTFSEWNTPKPRHSRLLKSALQRQSPKPDQSDL 360

Query: 439 WAPLADEGWRPCVASSKASSLPGKSEGYIQVFLDGGLNQQRMGICDAVAVAKILNATLVI 498
           WAPL DEGWRPCV S K SSLPG+SEGYIQVFLDGGLNQQRMGICDAVAVAK+LNATLVI
Sbjct: 361 WAPLTDEGWRPCVDSLKDSSLPGESEGYIQVFLDGGLNQQRMGICDAVAVAKLLNATLVI 420

Query: 499 PHLEINPVWKDSSSFVDIFDVDHFINVLKDDISIVKELPADFSWSTREYYATAIRATRVK 558
           PHLEINPVWKDSSSFVDIFDVDHFINVLKDDISIVKELPA+FSWSTREYYATAIRATRVK
Sbjct: 421 PHLEINPVWKDSSSFVDIFDVDHFINVLKDDISIVKELPAEFSWSTREYYATAIRATRVK 480

Query: 559 TAPVHASANWYLDNVLPVLQSYGIAAIAPFSHRLAFENLPDEIQRLRCKVNFQALTFVPH 618
           TAPVHASA WYLDNVLPVLQSYGIAAIAPFSHRLAFENLPDEIQRLRCKVNFQALTFVPH
Sbjct: 481 TAPVHASAKWYLDNVLPVLQSYGIAAIAPFSHRLAFENLPDEIQRLRCKVNFQALTFVPH 540

Query: 619 IRVLGDALISRLQYPLNKKESKVANYLSMTTDANEQGPLKFVVLHLRFDKDMAAHSACDF 678
           IR LGDALISRL+YP N+K+SK ANYLS+TTDAN QGP+KFVVLHLRFDKDMAAHSACDF
Sbjct: 541 IRALGDALISRLRYPSNRKDSKEANYLSLTTDANVQGPMKFVVLHLRFDKDMAAHSACDF 600

Query: 679 GGGKAEKLALAKYRQVLWQGRVLNSQFTDEELRSQGRCPLTPEEIGLLLAAFGFDNNTRL 738
           GGGKAEKLALAKYRQVLWQGRVLNSQFTDEELRSQGRCPLTPEEIGLLLAA GFDNNTRL
Sbjct: 601 GGGKAEKLALAKYRQVLWQGRVLNSQFTDEELRSQGRCPLTPEEIGLLLAALGFDNNTRL 660

Query: 739 YLASHKVYGGEARISTLRSLFPLMEDKKSLTSGNELAQIKGKASLLAAVDYYVSMHSDIF 798
           YLASHKVYGG ARISTLRSLFPLMEDKKSLTSGNELAQIKGKASLLAAVDYYVSMHSDIF
Sbjct: 661 YLASHKVYGGVARISTLRSLFPLMEDKKSLTSGNELAQIKGKASLLAAVDYYVSMHSDIF 720

Query: 799 ISASPGNMHNAMVGHRTYENLKTIRPNMALLGQLFMNKSIIWSDFQEATVEGHKNRQGQI 858
           ISASPGNMHNA+VGHRTYENLKTIRPNMALLGQLFMNKSIIWSDFQ+ATVEGH NRQGQI
Sbjct: 721 ISASPGNMHNAIVGHRTYENLKTIRPNMALLGQLFMNKSIIWSDFQQATVEGHLNRQGQI 754

Query: 859 RLRKPKQSIYTYPAPDCVCHA 873
           RLRKPKQSIYTYPAPDCVCHA
Sbjct: 781 RLRKPKQSIYTYPAPDCVCHA 754

BLAST of HG10016573 vs. NCBI nr
Match: KAG6594905.1 (O-fucosyltransferase 31, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1312.7 bits (3396), Expect = 0.0e+00
Identity = 683/796 (85.80%), Postives = 710/796 (89.20%), Query Frame = 0

Query: 79  MAKEKIQIRKIDNATARQVTFSKRRRGLFKKAKELSVLCDADVALIIFSATGKLFEYSSS 138
           MAKEKIQIRKIDNATARQVTFSKRRRGLFKKAKELSVLCDADVALIIFS TGKLFEYSSS
Sbjct: 1   MAKEKIQIRKIDNATARQVTFSKRRRGLFKKAKELSVLCDADVALIIFSTTGKLFEYSSS 60

Query: 139 SMKGIIERHNLHSKNLQKLEQPSLELQLVENSNYTRLNKEIADKTHQLRQMRGEELQTLN 198
           SMKGIIERHNLHSKNLQKLEQPSLELQLVENSNYTRLNKEIA+KTHQLRQMRGEELQTLN
Sbjct: 61  SMKGIIERHNLHSKNLQKLEQPSLELQLVENSNYTRLNKEIAEKTHQLRQMRGEELQTLN 120

Query: 199 IEELQQLEKSLESGLSRVMEKKGERIMKEITDLQRKSAELMDENKRLKQQAEKMNAVRHL 258
           I+ELQQLEKSLE GLSRVMEKKGE+IMKEITDLQRKSAEL++ENKRLK+QAEKM+ VR+ 
Sbjct: 121 IDELQQLEKSLEFGLSRVMEKKGEKIMKEITDLQRKSAELVEENKRLKKQAEKMDGVRNF 180

Query: 259 GVEPEILVVEDGQSSNSVTEACVSNSNGPPQDLESSDTSLKLGIQ---ECLNLLEHRHID 318
           GVEPEILVVEDGQSSNSVTEACV+NSNGP QDLESSDTSLKLG+     C   +   H  
Sbjct: 181 GVEPEILVVEDGQSSNSVTEACVTNSNGPAQDLESSDTSLKLGLAYSGHCRKPIVGAHSS 240

Query: 319 AFTLIFDITHYHTAINLSSPSLQSLFASGVWGFAFDWHNPSPKSMVMGLSYQSNRAGALA 378
                 ++ H   A                                     +SNRAGALA
Sbjct: 241 DIDESNEVDHQAEA-------------------------------------ESNRAGALA 300

Query: 379 GVFVLLFPVFLPGLFSPFGHASPSTFSEWNAPKPRHLRLLKSALQRQSSKPDQSDLWAPL 438
           GVFVLLFPVFLPGLFSP GHASPSTFSEWN PKPRH RLLKSALQR+SSK DQSDLWAPL
Sbjct: 301 GVFVLLFPVFLPGLFSPLGHASPSTFSEWNTPKPRHSRLLKSALQRRSSKLDQSDLWAPL 360

Query: 439 ADEGWRPCVASSKASSLPGKSEGYIQVFLDGGLNQQRMGICDAVAVAKILNATLVIPHLE 498
           ADEGW+PCVA SKASS+P KSEGYIQVFLDGGLNQQRMGICDAVAVAKILNATLVIP+LE
Sbjct: 361 ADEGWKPCVA-SKASSVPWKSEGYIQVFLDGGLNQQRMGICDAVAVAKILNATLVIPYLE 420

Query: 499 INPVWKDSSSFVDIFDVDHFINVLKDDISIVKELPADFSWSTREYYATAIRATRVKTAPV 558
            NPVWKDSSSF+DIFDVDHFINVLKDDISIVKELPA+FSWSTREYYATAIRATRVKTAPV
Sbjct: 421 TNPVWKDSSSFMDIFDVDHFINVLKDDISIVKELPAEFSWSTREYYATAIRATRVKTAPV 480

Query: 559 HASANWYLDNVLPVLQSYGIAAIAPFSHRLAFENLPDEIQRLRCKVNFQALTFVPHIRVL 618
           HASANWYLDNVLPVLQSYGIAAIAPFSHRL FENLPDEIQRLRCKVNFQAL FVPHIR L
Sbjct: 481 HASANWYLDNVLPVLQSYGIAAIAPFSHRLTFENLPDEIQRLRCKVNFQALFFVPHIRAL 540

Query: 619 GDALISRLQYPLNKKESKVANYLSMTTDANEQGPLKFVVLHLRFDKDMAAHSACDFGGGK 678
           GDALI RL+YP NK  +K AN+ SMTT AN+QGPLKFVVLHLRFDKDMAAHSACDFGGGK
Sbjct: 541 GDALIGRLRYPSNKSGAKEANHPSMTTVANDQGPLKFVVLHLRFDKDMAAHSACDFGGGK 600

Query: 679 AEKLALAKYRQVLWQGRVLNSQFTDEELRSQGRCPLTPEEIGLLLAAFGFDNNTRLYLAS 738
           AEKLALAKYRQVLWQGRVLNSQFTDEELRSQGRCPLTPEEIGLLLAA GFDNNTRLYLAS
Sbjct: 601 AEKLALAKYRQVLWQGRVLNSQFTDEELRSQGRCPLTPEEIGLLLAASGFDNNTRLYLAS 660

Query: 739 HKVYGGEARISTLRSLFPLMEDKKSLTSGNELAQIKGKASLLAAVDYYVSMHSDIFISAS 798
           HKVYGGEARISTLRSLFPLMEDKKSLTSG+ELAQIKGKASLLAAVDYYVS+HSDIFISAS
Sbjct: 661 HKVYGGEARISTLRSLFPLMEDKKSLTSGSELAQIKGKASLLAAVDYYVSLHSDIFISAS 720

Query: 799 PGNMHNAMVGHRTYENLKTIRPNMALLGQLFMNKSIIWSDFQEATVEGHKNRQGQIRLRK 858
           PGNMHNAMVGHRTYENLKTIRPNMAL GQLFMNKSIIWSDFQ+A VEGHKNR GQIRLRK
Sbjct: 721 PGNMHNAMVGHRTYENLKTIRPNMALSGQLFMNKSIIWSDFQQAIVEGHKNRLGQIRLRK 758

Query: 859 PKQSIYTYPAPDCVCH 872
           PKQSIYTYPAPDCVCH
Sbjct: 781 PKQSIYTYPAPDCVCH 758

BLAST of HG10016573 vs. NCBI nr
Match: XP_038881929.1 (O-fucosyltransferase 39 [Benincasa hispida])

HSP 1 Score: 1005.7 bits (2599), Expect = 2.4e-289
Identity = 494/513 (96.30%), Postives = 501/513 (97.66%), Query Frame = 0

Query: 360 MVMGLSYQSNRAGALAGVFVLLFPVFLPGLFSPFGHASPSTFSEWNAPKPRHLRLLKSAL 419
           MVMGL Y SNR GA+AGVFVLLFPVFLPGLFSP GHASPSTFSEWN PKPRHLRLLKSAL
Sbjct: 1   MVMGLGYHSNRVGAVAGVFVLLFPVFLPGLFSPLGHASPSTFSEWNTPKPRHLRLLKSAL 60

Query: 420 QRQSSKPDQSDLWAPLADEGWRPCVASSKASSLPGKSEGYIQVFLDGGLNQQRMGICDAV 479
           QRQSSKPDQSDLWAPLADEGWRPCV SSKASSLPGKSEGYIQVFLDGGLNQQRMGICDAV
Sbjct: 61  QRQSSKPDQSDLWAPLADEGWRPCVDSSKASSLPGKSEGYIQVFLDGGLNQQRMGICDAV 120

Query: 480 AVAKILNATLVIPHLEINPVWKDSSSFVDIFDVDHFINVLKDDISIVKELPADFSWSTRE 539
           AVAKILNATLVIPHLEINPVWKDSSSF+DIFDVDHFI+VLK+DISIVKELPA+FSWSTRE
Sbjct: 121 AVAKILNATLVIPHLEINPVWKDSSSFIDIFDVDHFIDVLKNDISIVKELPAEFSWSTRE 180

Query: 540 YYATAIRATRVKTAPVHASANWYLDNVLPVLQSYGIAAIAPFSHRLAFENLPDEIQRLRC 599
           YYATAIRATRVKTAPVHASANWYLDNVLPVLQSYGIAAIAPFSHRLAFENL DEIQRLRC
Sbjct: 181 YYATAIRATRVKTAPVHASANWYLDNVLPVLQSYGIAAIAPFSHRLAFENLSDEIQRLRC 240

Query: 600 KVNFQALTFVPHIRVLGDALISRLQYPLNKKESKVANYLSMTTDANEQGPLKFVVLHLRF 659
           KVNFQALTFVPHIRVLGD LI+RLQYPLNKKESK  NYLSMTTDANEQGPLKFVVLHLRF
Sbjct: 241 KVNFQALTFVPHIRVLGDTLINRLQYPLNKKESKEDNYLSMTTDANEQGPLKFVVLHLRF 300

Query: 660 DKDMAAHSACDFGGGKAEKLALAKYRQVLWQGRVLNSQFTDEELRSQGRCPLTPEEIGLL 719
           DKDMAAHSACDFGGGKAEKLALAKYRQVLWQGRVLNSQFTDEELRSQGRCPLTPEEIGLL
Sbjct: 301 DKDMAAHSACDFGGGKAEKLALAKYRQVLWQGRVLNSQFTDEELRSQGRCPLTPEEIGLL 360

Query: 720 LAAFGFDNNTRLYLASHKVYGGEARISTLRSLFPLMEDKKSLTSGNELAQIKGKASLLAA 779
           LAA GFDNNTRLYLASHKVYGGEARISTLRSLFPLMEDKKSLTSGNELAQIKGKASLLAA
Sbjct: 361 LAALGFDNNTRLYLASHKVYGGEARISTLRSLFPLMEDKKSLTSGNELAQIKGKASLLAA 420

Query: 780 VDYYVSMHSDIFISASPGNMHNAMVGHRTYENLKTIRPNMALLGQLFMNKSIIWSDFQEA 839
           VDYYVSMHSDIFISASPGNMHNAMVGHRTY NLKTIRPNMALLGQLFMNKSIIWSDFQ+A
Sbjct: 421 VDYYVSMHSDIFISASPGNMHNAMVGHRTYGNLKTIRPNMALLGQLFMNKSIIWSDFQQA 480

Query: 840 TVEGHKNRQGQIRLRKPKQSIYTYPAPDCVCHA 873
           TVEGHKNRQGQIRLRKPKQSIYTYPAPDCVCHA
Sbjct: 481 TVEGHKNRQGQIRLRKPKQSIYTYPAPDCVCHA 513

BLAST of HG10016573 vs. NCBI nr
Match: XP_004143441.1 (O-fucosyltransferase 31 [Cucumis sativus] >KGN48703.1 hypothetical protein Csa_003489 [Cucumis sativus])

HSP 1 Score: 1001.9 bits (2589), Expect = 3.4e-288
Identity = 492/513 (95.91%), Postives = 503/513 (98.05%), Query Frame = 0

Query: 360 MVMGLSYQSNRAGALAGVFVLLFPVFLPGLFSPFGHASPSTFSEWNAPKPRHLRLLKSAL 419
           MVMGLSYQSNRAGALAGVFVLLFPVFLPGLFSPFGHASPSTFSEWN PKPRHLRLLKSAL
Sbjct: 1   MVMGLSYQSNRAGALAGVFVLLFPVFLPGLFSPFGHASPSTFSEWNTPKPRHLRLLKSAL 60

Query: 420 QRQSSKPDQSDLWAPLADEGWRPCVASSKASSLPGKSEGYIQVFLDGGLNQQRMGICDAV 479
           QRQSSKPDQSDLWAPLADEGWRPCV SSKASSLP KSEGYIQVFLDGGLNQQRMGICDAV
Sbjct: 61  QRQSSKPDQSDLWAPLADEGWRPCVDSSKASSLPEKSEGYIQVFLDGGLNQQRMGICDAV 120

Query: 480 AVAKILNATLVIPHLEINPVWKDSSSFVDIFDVDHFINVLKDDISIVKELPADFSWSTRE 539
           AVAKILNATLVIPHLE+NPVWKDSSSFVDIFDVDHFINVLKDDISIVKELPA+FSWSTRE
Sbjct: 121 AVAKILNATLVIPHLEVNPVWKDSSSFVDIFDVDHFINVLKDDISIVKELPAEFSWSTRE 180

Query: 540 YYATAIRATRVKTAPVHASANWYLDNVLPVLQSYGIAAIAPFSHRLAFENLPDEIQRLRC 599
           YYATAIRATRVKTAPVHASANWYLDNVLPVLQSYGIAAIAPFSHRLAFENLPDEIQRLRC
Sbjct: 181 YYATAIRATRVKTAPVHASANWYLDNVLPVLQSYGIAAIAPFSHRLAFENLPDEIQRLRC 240

Query: 600 KVNFQALTFVPHIRVLGDALISRLQYPLNKKESKVANYLSMTTDANEQGPLKFVVLHLRF 659
           KVNFQALTFVPHI+ LG+ALI+RL+YPLNKKES   NYLS+TTDANEQ PLKFVVLHLRF
Sbjct: 241 KVNFQALTFVPHIQELGEALINRLRYPLNKKESVGGNYLSLTTDANEQRPLKFVVLHLRF 300

Query: 660 DKDMAAHSACDFGGGKAEKLALAKYRQVLWQGRVLNSQFTDEELRSQGRCPLTPEEIGLL 719
           DKDMAAHSACDFGGGKAEKLALAKYRQVLWQGRVLNSQFTDEELRSQGRCPLTPEEIGLL
Sbjct: 301 DKDMAAHSACDFGGGKAEKLALAKYRQVLWQGRVLNSQFTDEELRSQGRCPLTPEEIGLL 360

Query: 720 LAAFGFDNNTRLYLASHKVYGGEARISTLRSLFPLMEDKKSLTSGNELAQIKGKASLLAA 779
           +AA GFDNNTRLYLASHKVYGGEARISTLRSLFPLMEDKKSLTSG+ELAQIKGKASLLAA
Sbjct: 361 MAALGFDNNTRLYLASHKVYGGEARISTLRSLFPLMEDKKSLTSGSELAQIKGKASLLAA 420

Query: 780 VDYYVSMHSDIFISASPGNMHNAMVGHRTYENLKTIRPNMALLGQLFMNKSIIWSDFQEA 839
           VDYYVSM+SDIFISASPGNMHNAMVGHRTYENLKTIRPNMALLGQLFMNKSIIWSDF +A
Sbjct: 421 VDYYVSMYSDIFISASPGNMHNAMVGHRTYENLKTIRPNMALLGQLFMNKSIIWSDFHQA 480

Query: 840 TVEGHKNRQGQIRLRKPKQSIYTYPAPDCVCHA 873
           TVEGHKNRQGQIRLRKPKQSIYTYPAPDCVCHA
Sbjct: 481 TVEGHKNRQGQIRLRKPKQSIYTYPAPDCVCHA 513

BLAST of HG10016573 vs. NCBI nr
Match: XP_008440542.1 (PREDICTED: uncharacterized protein At1g04910 [Cucumis melo] >KAA0036344.1 O-fucosyltransferase family protein isoform 1 [Cucumis melo var. makuwa] >TYK12738.1 O-fucosyltransferase family protein isoform 1 [Cucumis melo var. makuwa])

HSP 1 Score: 995.3 bits (2572), Expect = 3.2e-286
Identity = 488/513 (95.13%), Postives = 499/513 (97.27%), Query Frame = 0

Query: 360 MVMGLSYQSNRAGALAGVFVLLFPVFLPGLFSPFGHASPSTFSEWNAPKPRHLRLLKSAL 419
           MVMGLSY SNR GALAGVFVLLFPVFLPGLFSPFGHASPSTFSEWN PKPRHLRLLKSAL
Sbjct: 1   MVMGLSYHSNRVGALAGVFVLLFPVFLPGLFSPFGHASPSTFSEWNTPKPRHLRLLKSAL 60

Query: 420 QRQSSKPDQSDLWAPLADEGWRPCVASSKASSLPGKSEGYIQVFLDGGLNQQRMGICDAV 479
           QR+SSKPDQSDLWAPLADEGWRPCV SSKASSLP KSEGYIQVFLDGGLNQQRMGICDAV
Sbjct: 61  QRRSSKPDQSDLWAPLADEGWRPCVDSSKASSLPEKSEGYIQVFLDGGLNQQRMGICDAV 120

Query: 480 AVAKILNATLVIPHLEINPVWKDSSSFVDIFDVDHFINVLKDDISIVKELPADFSWSTRE 539
           AVAKILNATLVIPHLE+NPVWKDSSSFVDIFDVDHFINVLKDDISIV+ELPA+FSWSTRE
Sbjct: 121 AVAKILNATLVIPHLEVNPVWKDSSSFVDIFDVDHFINVLKDDISIVQELPAEFSWSTRE 180

Query: 540 YYATAIRATRVKTAPVHASANWYLDNVLPVLQSYGIAAIAPFSHRLAFENLPDEIQRLRC 599
           YYATAIRATRVKTAPVHASA WYL+NVLPVLQSYGIAAIAPFSHRLAFENLPDEIQRLRC
Sbjct: 181 YYATAIRATRVKTAPVHASAKWYLENVLPVLQSYGIAAIAPFSHRLAFENLPDEIQRLRC 240

Query: 600 KVNFQALTFVPHIRVLGDALISRLQYPLNKKESKVANYLSMTTDANEQGPLKFVVLHLRF 659
           KVNFQAL FVPHI+ LGDALI+RL+YPLNKKE    NYLSMTTDANEQ PLKFVVLHLRF
Sbjct: 241 KVNFQALNFVPHIQELGDALINRLRYPLNKKEPTEGNYLSMTTDANEQRPLKFVVLHLRF 300

Query: 660 DKDMAAHSACDFGGGKAEKLALAKYRQVLWQGRVLNSQFTDEELRSQGRCPLTPEEIGLL 719
           DKDMAAHSACDFGGGKAEKLALAKYRQVLWQGRVLNSQFTDEELRSQGRCPLTPEEIGLL
Sbjct: 301 DKDMAAHSACDFGGGKAEKLALAKYRQVLWQGRVLNSQFTDEELRSQGRCPLTPEEIGLL 360

Query: 720 LAAFGFDNNTRLYLASHKVYGGEARISTLRSLFPLMEDKKSLTSGNELAQIKGKASLLAA 779
           +AA GFDNNTRLYLASHKVYGGEARISTLRSLFPLMEDKKSLTSG+ELAQIKGKASLLAA
Sbjct: 361 MAALGFDNNTRLYLASHKVYGGEARISTLRSLFPLMEDKKSLTSGSELAQIKGKASLLAA 420

Query: 780 VDYYVSMHSDIFISASPGNMHNAMVGHRTYENLKTIRPNMALLGQLFMNKSIIWSDFQEA 839
           VDYYVSMHSDIFISASPGNMHNAMVGHRTYENLKTIRPNMALLGQLFMNKSIIWSDFQ+A
Sbjct: 421 VDYYVSMHSDIFISASPGNMHNAMVGHRTYENLKTIRPNMALLGQLFMNKSIIWSDFQQA 480

Query: 840 TVEGHKNRQGQIRLRKPKQSIYTYPAPDCVCHA 873
           TVEGHKNRQGQIRLRKPKQSIYTYPAPDCVCHA
Sbjct: 481 TVEGHKNRQGQIRLRKPKQSIYTYPAPDCVCHA 513

BLAST of HG10016573 vs. ExPASy Swiss-Prot
Match: Q7Y030 (O-fucosyltransferase 31 OS=Arabidopsis thaliana OX=3702 GN=OFUT31 PE=2 SV=1)

HSP 1 Score: 773.9 bits (1997), Expect = 2.0e-222
Identity = 372/501 (74.25%), Postives = 427/501 (85.23%), Query Frame = 0

Query: 373 ALAGVFVLLFPVFLPGLFSPFGHASPSTFSEWNAPKPRHLRLLKSALQRQSSKPDQSDLW 432
           ALAGVFVLLFP+  P LFSP G ASPS FSEWNAP+PRHL LL+ AL RQ S   Q +LW
Sbjct: 18  ALAGVFVLLFPILYPNLFSPLGRASPSLFSEWNAPRPRHLSLLQGALDRQISIRQQVELW 77

Query: 433 APLADEGWRPCVASSKASSLPGKSEGYIQVFLDGGLNQQRMGICDAVAVAKILNATLVIP 492
           +PLAD+GW+PC  S + +SLP KSEG++QVFLDGGLNQQRMGICDAVAVAKI+N TLVIP
Sbjct: 78  SPLADQGWKPCTESYRGASLPEKSEGFLQVFLDGGLNQQRMGICDAVAVAKIMNVTLVIP 137

Query: 493 HLEINPVWKDSSSFVDIFDVDHFINVLKDDISIVKELPADFSWSTREYYATAIRATRVKT 552
            LE+N VW+DSSSF DIFD+DHFI+VLKD++ IV+ELP  ++WSTR+YYAT IRATR+KT
Sbjct: 138 RLEVNTVWQDSSSFTDIFDLDHFISVLKDEVRIVRELPIQYAWSTRDYYATGIRATRIKT 197

Query: 553 APVHASANWYLDNVLPVLQSYGIAAIAPFSHRLAFENLPDEIQRLRCKVNFQALTFVPHI 612
           APVHASA WYL+NVLP++QSYGIAA+APFSHRLAF+NLP+ IQRLRCKVNF+AL FVPHI
Sbjct: 198 APVHASAEWYLENVLPIIQSYGIAAVAPFSHRLAFDNLPESIQRLRCKVNFEALNFVPHI 257

Query: 613 RVLGDALISRLQYPLNKKESKVANYLSMTTDAN---EQGPLKFVVLHLRFDKDMAAHSAC 672
           R LGDAL+ RL+ P     S+ +  +  T   N   + G  KF VLHLRFDKDMAAHS C
Sbjct: 258 RELGDALVHRLRNP--PSSSQTSGTMDPTDRINTIVKAGAGKFAVLHLRFDKDMAAHSGC 317

Query: 673 DFGGGKAEKLALAKYRQVLWQGRVLNSQFTDEELRSQGRCPLTPEEIGLLLAAFGFDNNT 732
           DF GGKAEKLALAKYRQV+WQGRVLNSQFTDEELR++GRCPLTPEEIGLLL+A GF NNT
Sbjct: 318 DFEGGKAEKLALAKYRQVIWQGRVLNSQFTDEELRNKGRCPLTPEEIGLLLSALGFSNNT 377

Query: 733 RLYLASHKVYGGEARISTLRSLFPLMEDKKSLTSGNELAQIKGKASLLAAVDYYVSMHSD 792
           RLYLASH+VYGGEARISTLR LFP +E+KKSL S  ELA ++GKASL+AAVDYYVSM SD
Sbjct: 378 RLYLASHQVYGGEARISTLRKLFPGIENKKSLASAEELADVQGKASLMAAVDYYVSMKSD 437

Query: 793 IFISASPGNMHNAMVGHRTYENLKTIRPNMALLGQLFMNKSIIWSDFQEATVEGHKNRQG 852
           IFISASPGNMHNA+  HR Y NLKTIRPNM LLGQ+F+NKS+ WS+F+ A + GHKNRQG
Sbjct: 438 IFISASPGNMHNALQAHRAYLNLKTIRPNMILLGQVFVNKSLDWSEFEGAVMNGHKNRQG 497

Query: 853 QIRLRKPKQSIYTYPAPDCVC 871
           Q+RLRK KQSIYTYPAPDC+C
Sbjct: 498 QLRLRKQKQSIYTYPAPDCMC 516

BLAST of HG10016573 vs. ExPASy Swiss-Prot
Match: Q0WUZ5 (O-fucosyltransferase 39 OS=Arabidopsis thaliana OX=3702 GN=OFUT39 PE=2 SV=1)

HSP 1 Score: 770.0 bits (1987), Expect = 2.9e-221
Identity = 375/479 (78.29%), Postives = 418/479 (87.27%), Query Frame = 0

Query: 396 ASPSTFSEWNAPKPRHLRLLKSALQRQSSKPDQSDLWAPLADEGWRPCVASSKASSLPGK 455
           ++ S+ SE    KPRHL LLKSALQR S   +QSDLW PL D+GW PC+    + SLP K
Sbjct: 27  STSSSSSEVITIKPRHLSLLKSALQRSSG--EQSDLWRPLTDQGWSPCIDLGNSPSLPDK 86

Query: 456 SEGYIQVFLDGGLNQQRMGICDAVAVAKILNATLVIPHLEINPVWKDSSSFVDIFDVDHF 515
           + GY+QVFLDGGLNQQRMGICDAVAVAKILNATLVIP+LE+NPVW+DSSSFVDIFDVDHF
Sbjct: 87  TAGYVQVFLDGGLNQQRMGICDAVAVAKILNATLVIPYLEVNPVWQDSSSFVDIFDVDHF 146

Query: 516 INVLKDDISIVKELPADFSWSTREYYATAIRATRVKTAPVHASANWYLDNVLPVLQSYGI 575
           I+ LKDDI +V+ELP ++SWSTREYY TA+R TRVKTAPVHASANWY++NV PVLQSYGI
Sbjct: 147 IDSLKDDIRVVRELPDEYSWSTREYYGTAVRETRVKTAPVHASANWYIENVSPVLQSYGI 206

Query: 576 AAIAPFSHRLAFENLPDEIQRLRCKVNFQALTFVPHIRVLGDALISRLQYP---LNKKES 635
           AAI+PFSHRL+F++LP EIQRLRCKVNFQAL FVPHI  LGDAL+SRL+ P    NK++ 
Sbjct: 207 AAISPFSHRLSFDHLPAEIQRLRCKVNFQALRFVPHITSLGDALVSRLRNPSWRSNKEQK 266

Query: 636 KVANYLSMTTDANEQGPLKFVVLHLRFDKDMAAHSACDFGGGKAEKLALAKYRQVLWQGR 695
            V +   MT     Q P KF VLHLRFDKDMAAHSACDFGGGKAEKL+LAKYRQ++WQGR
Sbjct: 267 NVDHLGDMTNPHRRQEPGKFAVLHLRFDKDMAAHSACDFGGGKAEKLSLAKYRQMIWQGR 326

Query: 696 VLNSQFTDEELRSQGRCPLTPEEIGLLLAAFGFDNNTRLYLASHKVYGGEARISTLRSLF 755
           VLNSQFTDEELRSQGRCPLTPEE+GLLLAAFGFDNNTRLYLASHKVYGGEARISTLR +F
Sbjct: 327 VLNSQFTDEELRSQGRCPLTPEEMGLLLAAFGFDNNTRLYLASHKVYGGEARISTLRQVF 386

Query: 756 PLMEDKKSLTSGNELAQIKGKASLLAAVDYYVSMHSDIFISASPGNMHNAMVGHRTYENL 815
           P MEDK+SL S  E A+IKGKASLLAA+DYYVSMHSDIFISASPGNMHNA+VGHRT+ENL
Sbjct: 387 PRMEDKRSLASSEERARIKGKASLLAALDYYVSMHSDIFISASPGNMHNALVGHRTFENL 446

Query: 816 KTIRPNMALLGQLFMNKSIIWSDFQEATVEGHKNRQGQIRLRKPKQSIYTYPAPDCVCH 872
           KTIRPNMAL+GQLF+NKSI W DFQ+A  EGH NRQGQIRLRKPKQSIYTYPAPDC+CH
Sbjct: 447 KTIRPNMALIGQLFLNKSITWVDFQQALGEGHVNRQGQIRLRKPKQSIYTYPAPDCMCH 503

BLAST of HG10016573 vs. ExPASy Swiss-Prot
Match: Q9LIN9 (Protein PECTIC ARABINOGALACTAN SYNTHESIS-RELATED OS=Arabidopsis thaliana OX=3702 GN=PAGR PE=2 SV=1)

HSP 1 Score: 325.9 bits (834), Expect = 1.4e-87
Identity = 175/448 (39.06%), Postives = 263/448 (58.71%), Query Frame = 0

Query: 436 ADEGWRPCVAS--SKASSLPGKSE--GYIQVFLDGGLNQQRMGICDAVAVAKILNATLVI 495
           A   W+PC        S LP ++E  GY+ +  +GGLNQQR+ IC+AVAVAKI+NATL++
Sbjct: 134 ATTSWKPCAERRIGGISDLPPENETNGYVFIHAEGGLNQQRIAICNAVAVAKIMNATLIL 193

Query: 496 PHLEINPVWKDSSSFVDIFDVDHFINVLKDDISIVKELPADFSWST-REYYATAIRATRV 555
           P L+ + +WKD++ F DIFDVDHFI+ LKDD+ IV+++P    W T +    ++IR T V
Sbjct: 194 PVLKQDQIWKDTTKFEDIFDVDHFIDYLKDDVRIVRDIP---DWFTDKAELFSSIRRT-V 253

Query: 556 KTAPVHASANWYLDNVLPVLQSYGIAAIAPFSHRLAFENLPDEIQRLRCKVNFQALTFVP 615
           K  P +A+A +Y+DNVLP ++   I A+ PF  RL ++N+P EI RLRC+VN+ AL F+P
Sbjct: 254 KNIPKYAAAQFYIDNVLPRIKEKKIMALKPFVDRLGYDNVPQEINRLRCRVNYHALKFLP 313

Query: 616 HIRVLGDALISRLQYPLNKKESKVANYLSMTTDANEQG-PLKFVVLHLRFDKDMAAHSAC 675
            I  + D+L+SR++                    N  G P  ++ LHLRF+K M   S C
Sbjct: 314 EIEQMADSLVSRMR--------------------NRTGNPNPYMALHLRFEKGMVGLSFC 373

Query: 676 DFGGGKAEKLALAKYRQVLWQGRVLNSQFTDE---ELRSQGRCPLTPEEIGLLLAAFGFD 735
           DF G + EK+ +A+YRQ  W  R  N     +   + R +GRCPL P E+ ++L A G+ 
Sbjct: 374 DFVGTREEKVKMAEYRQKEWPRRFKNGSHLWQLALQKRKEGRCPLEPGEVAVILRAMGYP 433

Query: 736 NNTRLYLASHKVYGGEARISTLRSLFPLMEDKKSLTSGNELAQIKGKASLLAAVDYYVSM 795
             T++Y+AS +VYGG+ R++ LR++FP +  K+ L    EL   +   + LAA+D+ V +
Sbjct: 434 KETQIYVASGQVYGGQNRMAPLRNMFPNLVTKEDLAGKEELTTFRKHVTSLAALDFLVCL 493

Query: 796 HSDIFISASPGNMHNAMVGHRTY--ENLKTIRPNMALLGQLFMNKSIIWSDFQEATVEGH 855
            SD+F+    GN    ++G R Y     K+I+P+  L+ + F +  + W+ F E  V  H
Sbjct: 494 KSDVFVMTHGGNFAKLIIGARRYMGHRQKSIKPDKGLMSKSFGDPYMGWATFVEDVVVTH 553

Query: 856 KNRQGQIRLRKPKQSIYTYPAPDCVCHA 873
           + R G      P   ++  P   C+C A
Sbjct: 554 QTRTGLPEETFPNYDLWENPLTPCMCKA 557

BLAST of HG10016573 vs. ExPASy Swiss-Prot
Match: Q8H1E6 (O-fucosyltransferase 9 OS=Arabidopsis thaliana OX=3702 GN=OFUT9 PE=2 SV=1)

HSP 1 Score: 323.6 bits (828), Expect = 7.1e-87
Identity = 182/448 (40.62%), Postives = 260/448 (58.04%), Query Frame = 0

Query: 430 DLWAPLADEGWRPCVASSKASSLPGKSEGYIQVFLDGGLNQQRMGICDAVAVAKILNATL 489
           + W P     W+PC+ S+  S+    S GY  +  +GGLNQQR+ ICDAVAVA +LNATL
Sbjct: 134 EAWKPRVKSVWKPCI-STNVSAAGSNSNGYFIIEANGGLNQQRLSICDAVAVAGLLNATL 193

Query: 490 VIPHLEINPVWKDSSSFVDIFDVDHFINVLKDDISIVKELPADFSWSTREYYATAIRATR 549
           VIP   +N VW+DSS F DIFD D FI  L  ++++VKELP D       Y  ++I   R
Sbjct: 194 VIPIFHLNSVWRDSSKFGDIFDEDFFIYALSKNVNVVKELPKDV-LERYNYNISSIVNLR 253

Query: 550 VKTAPVHASANWYLDNVLPVLQSYGIAAIAPFSHRLAFENLPDEIQRLRCKVNFQALTFV 609
           +K     +S  +YL  VLP L   G   +APFS+RLA   +P  IQ LRC  NF+AL F 
Sbjct: 254 LK---AWSSPAYYLQKVLPQLLRLGAVRVAPFSNRLA-HAVPAHIQGLRCLANFEALRFA 313

Query: 610 PHIRVLGDALISRLQYPLNKKESKVANYLSMTTDANEQGPLKFVVLHLRFDKDMAAHSAC 669
             IR+L + ++ R                 M T + E G  K+V +HLRF+ DM A S C
Sbjct: 314 EPIRLLAEKMVDR-----------------MVTKSVESGG-KYVSVHLRFEMDMVAFSCC 373

Query: 670 DFGGGKAEKLALAKYRQVLWQG--RVLNSQFTDEELRSQGRCPLTPEEIGLLLAAFGFDN 729
           ++  G+AEKL +   R+  W+G  R           R  G+CPLTP E+G++L   GF+N
Sbjct: 374 EYDFGQAEKLEMDMARERGWKGKFRRRGRVIRPGANRIDGKCPLTPLEVGMMLRGMGFNN 433

Query: 730 NTRLYLASHKVYGGEARISTLRSLFPLMEDKKSLTSGNELAQIKGKASLLAAVDYYVSMH 789
           +T +Y+A+  +Y  +  ++ LR +FPL++ K +L +  ELA  KG +S LAA+DY V +H
Sbjct: 434 STLVYVAAGNIYKADKYMAPLRQMFPLLQTKDTLATPEELAPFKGHSSRLAALDYTVCLH 493

Query: 790 SDIFISASPGNMHNAMVGHRTY---ENLKTIRPNMALLGQLFMNKSIIWSDFQEATVE-- 849
           S++F+S   GN  + ++GHR Y    + +TI+P+   L QL    SI W  F++   +  
Sbjct: 494 SEVFVSTQGGNFPHFLIGHRRYLYKGHAETIKPDKRKLVQLLDKPSIRWDYFKKQMQDML 553

Query: 850 GHKNRQGQIRLRKPKQSIYTYPAPDCVC 871
            H + +G + LRKP  S+YT+P PDC+C
Sbjct: 554 RHNDAKG-VELRKPAASLYTFPMPDCMC 556

BLAST of HG10016573 vs. ExPASy Swiss-Prot
Match: F4HZX7 (O-fucosyltransferase 8 OS=Arabidopsis thaliana OX=3702 GN=OFUT8 PE=2 SV=1)

HSP 1 Score: 311.6 bits (797), Expect = 2.8e-83
Identity = 183/482 (37.97%), Postives = 278/482 (57.68%), Query Frame = 0

Query: 410 RHLRLLKSALQRQSSKPDQSDLWAPLADEG--WRPCVASSKASSLPGK----SEGYIQVF 469
           R L L   +L +   KPD  +     + +   W+PC  ++KA+    +    S GYI V 
Sbjct: 134 RLLNLASDSLAKNEFKPDTPNFREERSSKSSQWKPCADNNKAAVALERSRELSNGYIMVS 193

Query: 470 LDGGLNQQRMGICDAVAVAKILNATLVIPHLEINPVWKDSSSFVDIFDVDHFINVLKDDI 529
            +GGLNQQR+ IC+AVAVA +LNATLV+P    + VWKD S F DI+  DHFI  LKD++
Sbjct: 194 ANGGLNQQRVAICNAVAVAALLNATLVLPRFLYSNVWKDPSQFGDIYQEDHFIEYLKDEV 253

Query: 530 SIVKELPADFSWSTREYYATAIRATRVKTA-PVHASANWYLDNVLPVLQSYGIAAIAPFS 589
           +IVK LP     +  +  +       VK A PV      Y+++VLP+L+ YG+  +  + 
Sbjct: 254 NIVKNLPQHLKSTDNKNLSLVTDTELVKEATPVD-----YIEHVLPLLKKYGMVHLFGYG 313

Query: 590 HRLAFENLPDEIQRLRCKVNFQALTFVPHIRVLGDALISRL-QYPLNK---KESKVANYL 649
           +RL F+ LP ++QRLRCK NF AL F P I+  G  L+ R+ ++  ++   +E+ +   +
Sbjct: 314 NRLGFDPLPFDVQRLRCKCNFHALKFAPKIQEAGSLLVKRIRRFKTSRSRLEEALLGESM 373

Query: 650 SMTTDANEQGPLKFVVLHLRFDKDMAAHSACDFGGGKAEKLALAKYRQ----VLWQGRVL 709
             +T   E+ PLK++ LHLRF++DM A+S CDFGGG+AE+  L  YR+    +L +    
Sbjct: 374 VKSTVKGEEEPLKYLALHLRFEEDMVAYSLCDFGGGEAERKELQAYREDHFPLLLKRLKK 433

Query: 710 NSQFTDEELRSQGRCPLTPEEIGLLLAAFGFDNNTRLYLASHKVYGGEARISTLRSLFPL 769
           +   + EELR  G+CPLTPEE  L+LA  GF   T +YLA  ++YGG +R+  L  L+P 
Sbjct: 434 SKPVSPEELRKTGKCPLTPEEATLVLAGLGFKRKTYIYLAGSQIYGGSSRMLPLTRLYPN 493

Query: 770 MEDKKSLTSGNELAQIKGKASLLAAVDYYVSMHSDIFISASPGNMHNAMV-GHRTY---E 829
           +  K++L +  ELA  K  +S LAA+D+   + SD+F     G+  +++V G R Y    
Sbjct: 494 IATKETLLTPQELAPFKNFSSQLAALDFIACIASDVFAMTDSGSQLSSLVSGFRNYYGNG 553

Query: 830 NLKTIRPNMALLGQLFM-NKSIIWSDFQEATVEGHKNRQGQIRLRKPKQSIYTYP-APDC 871
              T+RPN   L  +   +++I W  F++   +  +  Q ++R R   +SIY  P  P+C
Sbjct: 554 QAPTLRPNKKRLAAILSDSETIKWKIFEDRVRKMVEEGQ-KLRTRPYGRSIYRQPRCPEC 609

BLAST of HG10016573 vs. ExPASy TrEMBL
Match: A0A0A0KGN2 (O-fucosyltransferase family protein OS=Cucumis sativus OX=3659 GN=Csa_6G498990 PE=3 SV=1)

HSP 1 Score: 1001.9 bits (2589), Expect = 1.7e-288
Identity = 492/513 (95.91%), Postives = 503/513 (98.05%), Query Frame = 0

Query: 360 MVMGLSYQSNRAGALAGVFVLLFPVFLPGLFSPFGHASPSTFSEWNAPKPRHLRLLKSAL 419
           MVMGLSYQSNRAGALAGVFVLLFPVFLPGLFSPFGHASPSTFSEWN PKPRHLRLLKSAL
Sbjct: 1   MVMGLSYQSNRAGALAGVFVLLFPVFLPGLFSPFGHASPSTFSEWNTPKPRHLRLLKSAL 60

Query: 420 QRQSSKPDQSDLWAPLADEGWRPCVASSKASSLPGKSEGYIQVFLDGGLNQQRMGICDAV 479
           QRQSSKPDQSDLWAPLADEGWRPCV SSKASSLP KSEGYIQVFLDGGLNQQRMGICDAV
Sbjct: 61  QRQSSKPDQSDLWAPLADEGWRPCVDSSKASSLPEKSEGYIQVFLDGGLNQQRMGICDAV 120

Query: 480 AVAKILNATLVIPHLEINPVWKDSSSFVDIFDVDHFINVLKDDISIVKELPADFSWSTRE 539
           AVAKILNATLVIPHLE+NPVWKDSSSFVDIFDVDHFINVLKDDISIVKELPA+FSWSTRE
Sbjct: 121 AVAKILNATLVIPHLEVNPVWKDSSSFVDIFDVDHFINVLKDDISIVKELPAEFSWSTRE 180

Query: 540 YYATAIRATRVKTAPVHASANWYLDNVLPVLQSYGIAAIAPFSHRLAFENLPDEIQRLRC 599
           YYATAIRATRVKTAPVHASANWYLDNVLPVLQSYGIAAIAPFSHRLAFENLPDEIQRLRC
Sbjct: 181 YYATAIRATRVKTAPVHASANWYLDNVLPVLQSYGIAAIAPFSHRLAFENLPDEIQRLRC 240

Query: 600 KVNFQALTFVPHIRVLGDALISRLQYPLNKKESKVANYLSMTTDANEQGPLKFVVLHLRF 659
           KVNFQALTFVPHI+ LG+ALI+RL+YPLNKKES   NYLS+TTDANEQ PLKFVVLHLRF
Sbjct: 241 KVNFQALTFVPHIQELGEALINRLRYPLNKKESVGGNYLSLTTDANEQRPLKFVVLHLRF 300

Query: 660 DKDMAAHSACDFGGGKAEKLALAKYRQVLWQGRVLNSQFTDEELRSQGRCPLTPEEIGLL 719
           DKDMAAHSACDFGGGKAEKLALAKYRQVLWQGRVLNSQFTDEELRSQGRCPLTPEEIGLL
Sbjct: 301 DKDMAAHSACDFGGGKAEKLALAKYRQVLWQGRVLNSQFTDEELRSQGRCPLTPEEIGLL 360

Query: 720 LAAFGFDNNTRLYLASHKVYGGEARISTLRSLFPLMEDKKSLTSGNELAQIKGKASLLAA 779
           +AA GFDNNTRLYLASHKVYGGEARISTLRSLFPLMEDKKSLTSG+ELAQIKGKASLLAA
Sbjct: 361 MAALGFDNNTRLYLASHKVYGGEARISTLRSLFPLMEDKKSLTSGSELAQIKGKASLLAA 420

Query: 780 VDYYVSMHSDIFISASPGNMHNAMVGHRTYENLKTIRPNMALLGQLFMNKSIIWSDFQEA 839
           VDYYVSM+SDIFISASPGNMHNAMVGHRTYENLKTIRPNMALLGQLFMNKSIIWSDF +A
Sbjct: 421 VDYYVSMYSDIFISASPGNMHNAMVGHRTYENLKTIRPNMALLGQLFMNKSIIWSDFHQA 480

Query: 840 TVEGHKNRQGQIRLRKPKQSIYTYPAPDCVCHA 873
           TVEGHKNRQGQIRLRKPKQSIYTYPAPDCVCHA
Sbjct: 481 TVEGHKNRQGQIRLRKPKQSIYTYPAPDCVCHA 513

BLAST of HG10016573 vs. ExPASy TrEMBL
Match: A0A5A7SYI4 (O-fucosyltransferase family protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold255G003260 PE=3 SV=1)

HSP 1 Score: 995.3 bits (2572), Expect = 1.5e-286
Identity = 488/513 (95.13%), Postives = 499/513 (97.27%), Query Frame = 0

Query: 360 MVMGLSYQSNRAGALAGVFVLLFPVFLPGLFSPFGHASPSTFSEWNAPKPRHLRLLKSAL 419
           MVMGLSY SNR GALAGVFVLLFPVFLPGLFSPFGHASPSTFSEWN PKPRHLRLLKSAL
Sbjct: 1   MVMGLSYHSNRVGALAGVFVLLFPVFLPGLFSPFGHASPSTFSEWNTPKPRHLRLLKSAL 60

Query: 420 QRQSSKPDQSDLWAPLADEGWRPCVASSKASSLPGKSEGYIQVFLDGGLNQQRMGICDAV 479
           QR+SSKPDQSDLWAPLADEGWRPCV SSKASSLP KSEGYIQVFLDGGLNQQRMGICDAV
Sbjct: 61  QRRSSKPDQSDLWAPLADEGWRPCVDSSKASSLPEKSEGYIQVFLDGGLNQQRMGICDAV 120

Query: 480 AVAKILNATLVIPHLEINPVWKDSSSFVDIFDVDHFINVLKDDISIVKELPADFSWSTRE 539
           AVAKILNATLVIPHLE+NPVWKDSSSFVDIFDVDHFINVLKDDISIV+ELPA+FSWSTRE
Sbjct: 121 AVAKILNATLVIPHLEVNPVWKDSSSFVDIFDVDHFINVLKDDISIVQELPAEFSWSTRE 180

Query: 540 YYATAIRATRVKTAPVHASANWYLDNVLPVLQSYGIAAIAPFSHRLAFENLPDEIQRLRC 599
           YYATAIRATRVKTAPVHASA WYL+NVLPVLQSYGIAAIAPFSHRLAFENLPDEIQRLRC
Sbjct: 181 YYATAIRATRVKTAPVHASAKWYLENVLPVLQSYGIAAIAPFSHRLAFENLPDEIQRLRC 240

Query: 600 KVNFQALTFVPHIRVLGDALISRLQYPLNKKESKVANYLSMTTDANEQGPLKFVVLHLRF 659
           KVNFQAL FVPHI+ LGDALI+RL+YPLNKKE    NYLSMTTDANEQ PLKFVVLHLRF
Sbjct: 241 KVNFQALNFVPHIQELGDALINRLRYPLNKKEPTEGNYLSMTTDANEQRPLKFVVLHLRF 300

Query: 660 DKDMAAHSACDFGGGKAEKLALAKYRQVLWQGRVLNSQFTDEELRSQGRCPLTPEEIGLL 719
           DKDMAAHSACDFGGGKAEKLALAKYRQVLWQGRVLNSQFTDEELRSQGRCPLTPEEIGLL
Sbjct: 301 DKDMAAHSACDFGGGKAEKLALAKYRQVLWQGRVLNSQFTDEELRSQGRCPLTPEEIGLL 360

Query: 720 LAAFGFDNNTRLYLASHKVYGGEARISTLRSLFPLMEDKKSLTSGNELAQIKGKASLLAA 779
           +AA GFDNNTRLYLASHKVYGGEARISTLRSLFPLMEDKKSLTSG+ELAQIKGKASLLAA
Sbjct: 361 MAALGFDNNTRLYLASHKVYGGEARISTLRSLFPLMEDKKSLTSGSELAQIKGKASLLAA 420

Query: 780 VDYYVSMHSDIFISASPGNMHNAMVGHRTYENLKTIRPNMALLGQLFMNKSIIWSDFQEA 839
           VDYYVSMHSDIFISASPGNMHNAMVGHRTYENLKTIRPNMALLGQLFMNKSIIWSDFQ+A
Sbjct: 421 VDYYVSMHSDIFISASPGNMHNAMVGHRTYENLKTIRPNMALLGQLFMNKSIIWSDFQQA 480

Query: 840 TVEGHKNRQGQIRLRKPKQSIYTYPAPDCVCHA 873
           TVEGHKNRQGQIRLRKPKQSIYTYPAPDCVCHA
Sbjct: 481 TVEGHKNRQGQIRLRKPKQSIYTYPAPDCVCHA 513

BLAST of HG10016573 vs. ExPASy TrEMBL
Match: A0A1S3B0Y1 (O-fucosyltransferase family protein OS=Cucumis melo OX=3656 GN=LOC103484932 PE=3 SV=1)

HSP 1 Score: 995.3 bits (2572), Expect = 1.5e-286
Identity = 488/513 (95.13%), Postives = 499/513 (97.27%), Query Frame = 0

Query: 360 MVMGLSYQSNRAGALAGVFVLLFPVFLPGLFSPFGHASPSTFSEWNAPKPRHLRLLKSAL 419
           MVMGLSY SNR GALAGVFVLLFPVFLPGLFSPFGHASPSTFSEWN PKPRHLRLLKSAL
Sbjct: 1   MVMGLSYHSNRVGALAGVFVLLFPVFLPGLFSPFGHASPSTFSEWNTPKPRHLRLLKSAL 60

Query: 420 QRQSSKPDQSDLWAPLADEGWRPCVASSKASSLPGKSEGYIQVFLDGGLNQQRMGICDAV 479
           QR+SSKPDQSDLWAPLADEGWRPCV SSKASSLP KSEGYIQVFLDGGLNQQRMGICDAV
Sbjct: 61  QRRSSKPDQSDLWAPLADEGWRPCVDSSKASSLPEKSEGYIQVFLDGGLNQQRMGICDAV 120

Query: 480 AVAKILNATLVIPHLEINPVWKDSSSFVDIFDVDHFINVLKDDISIVKELPADFSWSTRE 539
           AVAKILNATLVIPHLE+NPVWKDSSSFVDIFDVDHFINVLKDDISIV+ELPA+FSWSTRE
Sbjct: 121 AVAKILNATLVIPHLEVNPVWKDSSSFVDIFDVDHFINVLKDDISIVQELPAEFSWSTRE 180

Query: 540 YYATAIRATRVKTAPVHASANWYLDNVLPVLQSYGIAAIAPFSHRLAFENLPDEIQRLRC 599
           YYATAIRATRVKTAPVHASA WYL+NVLPVLQSYGIAAIAPFSHRLAFENLPDEIQRLRC
Sbjct: 181 YYATAIRATRVKTAPVHASAKWYLENVLPVLQSYGIAAIAPFSHRLAFENLPDEIQRLRC 240

Query: 600 KVNFQALTFVPHIRVLGDALISRLQYPLNKKESKVANYLSMTTDANEQGPLKFVVLHLRF 659
           KVNFQAL FVPHI+ LGDALI+RL+YPLNKKE    NYLSMTTDANEQ PLKFVVLHLRF
Sbjct: 241 KVNFQALNFVPHIQELGDALINRLRYPLNKKEPTEGNYLSMTTDANEQRPLKFVVLHLRF 300

Query: 660 DKDMAAHSACDFGGGKAEKLALAKYRQVLWQGRVLNSQFTDEELRSQGRCPLTPEEIGLL 719
           DKDMAAHSACDFGGGKAEKLALAKYRQVLWQGRVLNSQFTDEELRSQGRCPLTPEEIGLL
Sbjct: 301 DKDMAAHSACDFGGGKAEKLALAKYRQVLWQGRVLNSQFTDEELRSQGRCPLTPEEIGLL 360

Query: 720 LAAFGFDNNTRLYLASHKVYGGEARISTLRSLFPLMEDKKSLTSGNELAQIKGKASLLAA 779
           +AA GFDNNTRLYLASHKVYGGEARISTLRSLFPLMEDKKSLTSG+ELAQIKGKASLLAA
Sbjct: 361 MAALGFDNNTRLYLASHKVYGGEARISTLRSLFPLMEDKKSLTSGSELAQIKGKASLLAA 420

Query: 780 VDYYVSMHSDIFISASPGNMHNAMVGHRTYENLKTIRPNMALLGQLFMNKSIIWSDFQEA 839
           VDYYVSMHSDIFISASPGNMHNAMVGHRTYENLKTIRPNMALLGQLFMNKSIIWSDFQ+A
Sbjct: 421 VDYYVSMHSDIFISASPGNMHNAMVGHRTYENLKTIRPNMALLGQLFMNKSIIWSDFQQA 480

Query: 840 TVEGHKNRQGQIRLRKPKQSIYTYPAPDCVCHA 873
           TVEGHKNRQGQIRLRKPKQSIYTYPAPDCVCHA
Sbjct: 481 TVEGHKNRQGQIRLRKPKQSIYTYPAPDCVCHA 513

BLAST of HG10016573 vs. ExPASy TrEMBL
Match: M4DBF2 (O-fucosyltransferase family protein OS=Brassica rapa subsp. pekinensis OX=51351 PE=3 SV=1)

HSP 1 Score: 974.5 bits (2518), Expect = 2.8e-280
Identity = 504/802 (62.84%), Postives = 613/802 (76.43%), Query Frame = 0

Query: 79  MAKEKIQIRKIDNATARQVTFSKRRRGLFKKAKELSVLCDADVALIIFSATGKLFEYSSS 138
           MA+EKI+I+KIDN TARQVTFSKRRRG+ KKA ELS+LCDADVALIIFSATGKLFE+SSS
Sbjct: 1   MAREKIRIKKIDNLTARQVTFSKRRRGIIKKANELSILCDADVALIIFSATGKLFEFSSS 60

Query: 139 SMKGIIERHNLHSKNLQKLEQPSLELQLVENSNYTRLNKEIADKTHQLRQMRGEELQTLN 198
           SM+ I+ R+NLH+ N+ K+  P      ++N N +RL+KE+ DKT QLRQMRG +L+ LN
Sbjct: 61  SMRDILGRYNLHASNINKMMGPPSPYHQLDNCNLSRLSKEVEDKTKQLRQMRGGDLEGLN 120

Query: 199 IEELQQLEKSLESGLSRVMEKKGERIMKEITDLQRKSAELMDENKRLKQQAEKMNAVRHL 258
           +EELQ+LEKSLESGLSRV EKKGE +M +I+ L+++ +EL+DEN+RL++Q   +   + +
Sbjct: 121 LEELQRLEKSLESGLSRVSEKKGECVMSQISSLEKRGSELVDENRRLREQLVTLEMAKTM 180

Query: 259 GVEPEILVVEDGQSSNSVTEACVSNSNGPPQDLESSDTSLKLGIQECLNLLEHRHIDA-- 318
            ++  +      ++ ++ T     +S  P +D + SDTSLKLG +     +EH  +    
Sbjct: 181 ALKEAV------ETESATTNVSSYDSAAPIED-DFSDTSLKLGKRLKTPEIEHTCVCVYL 240

Query: 319 ---FTLIFDITHYHTAI--NLSSPSLQSLFASGVWGFAFDWHNPSPKSMVMGLSYQSNRA 378
              F   + +   HT    ++  P L   F   +     D    S   M    S   ++ 
Sbjct: 241 HLRFAYKYQLLCLHTLRFGSIFPPILSPEFRIKL----LDVIVESILQMKQLQSLNHSQR 300

Query: 379 GALAGVFVLLFPVFLPGLFSPFGHASPSTFSEWNAPKPRHLRLLKSALQRQSSKPDQSDL 438
            ALAGV VLLFP+F P LF P G ASPS FSEWNAP+PRHLRLL+ AL RQ S   Q +L
Sbjct: 301 IALAGVLVLLFPIFSPNLFRPLGRASPSLFSEWNAPRPRHLRLLEGALHRQISIRQQVEL 360

Query: 439 WAPLADEGWRPCVASSKASSLPGKSEGYIQVFLDGGLNQQRMGICDAVAVAKILNATLVI 498
           W+PL D+ W+PC  S   S LP KS+G++QVFLDGGLNQQRMGICDAVAVAKILN TLVI
Sbjct: 361 WSPLPDQSWKPCTQSFTGSPLPEKSQGFLQVFLDGGLNQQRMGICDAVAVAKILNVTLVI 420

Query: 499 PHLEINPVWKDSSSFVDIFDVDHFINVLKDDISIVKELPADFSWSTREYYATAIRATRVK 558
           P LE+NPVW+DSSSF DIFDVDHFI VLKD++ IV+ELP  ++WSTR+YYAT IRATR+K
Sbjct: 421 PRLEVNPVWQDSSSFADIFDVDHFITVLKDEVRIVRELPTQYAWSTRDYYATGIRATRIK 480

Query: 559 TAPVHASANWYLDNVLPVLQSYGIAAIAPFSHRLAFENLPDEIQRLRCKVNFQALTFVPH 618
           TAP HASA WY++NVLPV+QSYGIAA+APFSHRLAF+N+P+ IQRLRCKVNF+AL FVP 
Sbjct: 481 TAPTHASAEWYVENVLPVIQSYGIAAVAPFSHRLAFDNVPESIQRLRCKVNFEALNFVPR 540

Query: 619 IRVLGDALISRLQYPLNKK-ESKVANYLSMTTDANEQGPLKFVVLHLRFDKDMAAHSACD 678
           IR LGDA++ RL+ P +    S   +         + G  KFVVLHLRFDKDMAAHS CD
Sbjct: 541 IRELGDAVVHRLRNPPSSSITSGATDPTERVNTIAKSGAGKFVVLHLRFDKDMAAHSGCD 600

Query: 679 FGGGKAEKLALAKYRQVLWQGRVLNSQFTDEELRSQGRCPLTPEEIGLLLAAFGFDNNTR 738
           FGGGKAEKLALAKYRQV+WQGRVLNSQFTDEELR++GRCPLTPEEIGLLL+A GF NNTR
Sbjct: 601 FGGGKAEKLALAKYRQVIWQGRVLNSQFTDEELRNKGRCPLTPEEIGLLLSALGFTNNTR 660

Query: 739 LYLASHKVYGGEARISTLRSLFPLMEDKKSLTSGNELAQIKGKASLLAAVDYYVSMHSDI 798
           LYLASH+VYGGEARISTLR LFP++E+KKSL S  ELA+++GKASL+AAVDYYVSM SDI
Sbjct: 661 LYLASHQVYGGEARISTLRKLFPVLENKKSLASAEELAEVEGKASLMAAVDYYVSMKSDI 720

Query: 799 FISASPGNMHNAMVGHRTYENLKTIRPNMALLGQLFMNKSIIWSDFQEATVEGHKNRQGQ 858
           FISASPGNMHNA++ HR Y NLKTI PNM LLGQ+ +NKS+ WS+F+ A V GHKNRQGQ
Sbjct: 721 FISASPGNMHNALLAHRAYLNLKTINPNMILLGQVLVNKSLGWSEFEGAVVNGHKNRQGQ 780

Query: 859 IRLRKPKQSIYTYPAPDCVCHA 873
           +RLRK KQSIYTYPAPDC+C A
Sbjct: 781 LRLRKQKQSIYTYPAPDCMCKA 791

BLAST of HG10016573 vs. ExPASy TrEMBL
Match: A0A6J1IP37 (O-fucosyltransferase family protein OS=Cucurbita maxima OX=3661 GN=LOC111478666 PE=3 SV=1)

HSP 1 Score: 973.8 bits (2516), Expect = 4.8e-280
Identity = 479/511 (93.74%), Postives = 490/511 (95.89%), Query Frame = 0

Query: 362 MGLSYQSNRAGALAGVFVLLFPVFLPGLFSPFGHASPSTFSEWNAPKPRHLRLLKSALQR 421
           MGLSYQS+RAGAL G F+LLF VFLPGLFS  GHASPSTFSEWN PKPRH RLLKSALQR
Sbjct: 1   MGLSYQSSRAGALVGGFMLLFSVFLPGLFSHLGHASPSTFSEWNTPKPRHSRLLKSALQR 60

Query: 422 QSSKPDQSDLWAPLADEGWRPCVASSKASSLPGKSEGYIQVFLDGGLNQQRMGICDAVAV 481
           QS  PDQSDLWAPL DEGWRPCV SSK SSLPG+SEGYIQVFLDGGLNQQRMGICDAVAV
Sbjct: 61  QSPIPDQSDLWAPLTDEGWRPCVDSSKDSSLPGESEGYIQVFLDGGLNQQRMGICDAVAV 120

Query: 482 AKILNATLVIPHLEINPVWKDSSSFVDIFDVDHFINVLKDDISIVKELPADFSWSTREYY 541
           AK LNATLVIPHLEINPVWKDSSSFVDIFDVDHFINVLKDDISIVKELPA+FSWSTREYY
Sbjct: 121 AKFLNATLVIPHLEINPVWKDSSSFVDIFDVDHFINVLKDDISIVKELPAEFSWSTREYY 180

Query: 542 ATAIRATRVKTAPVHASANWYLDNVLPVLQSYGIAAIAPFSHRLAFENLPDEIQRLRCKV 601
           ATAIRATRVKTAPVHASA WYLDNVLPVLQSYGIAAIAPFSHRLAFENLPDEIQRLRCKV
Sbjct: 181 ATAIRATRVKTAPVHASAKWYLDNVLPVLQSYGIAAIAPFSHRLAFENLPDEIQRLRCKV 240

Query: 602 NFQALTFVPHIRVLGDALISRLQYPLNKKESKVANYLSMTTDANEQGPLKFVVLHLRFDK 661
           NFQALTFVPHIR LGDALISRL+YP N+K+SK ANYLS+TTDAN QGP+KFVVLHLRFDK
Sbjct: 241 NFQALTFVPHIRALGDALISRLRYPSNRKDSKEANYLSLTTDANVQGPMKFVVLHLRFDK 300

Query: 662 DMAAHSACDFGGGKAEKLALAKYRQVLWQGRVLNSQFTDEELRSQGRCPLTPEEIGLLLA 721
           DMAAHSACDFGGGKAEKLALAKYRQVLWQGRVLNSQFTDEELRSQGRCPLTPEEIGLLLA
Sbjct: 301 DMAAHSACDFGGGKAEKLALAKYRQVLWQGRVLNSQFTDEELRSQGRCPLTPEEIGLLLA 360

Query: 722 AFGFDNNTRLYLASHKVYGGEARISTLRSLFPLMEDKKSLTSGNELAQIKGKASLLAAVD 781
           A GFDNNTRLYLASHKVYGG ARISTLRSLFPLMEDKKSLTSGNELAQIKGKASLLAAVD
Sbjct: 361 ALGFDNNTRLYLASHKVYGGVARISTLRSLFPLMEDKKSLTSGNELAQIKGKASLLAAVD 420

Query: 782 YYVSMHSDIFISASPGNMHNAMVGHRTYENLKTIRPNMALLGQLFMNKSIIWSDFQEATV 841
           YYVSMHSDIFISASPGNMHNA+VGHRTYENLKTIRPNMALLGQLFMNKSIIWSDFQ+ATV
Sbjct: 421 YYVSMHSDIFISASPGNMHNAIVGHRTYENLKTIRPNMALLGQLFMNKSIIWSDFQQATV 480

Query: 842 EGHKNRQGQIRLRKPKQSIYTYPAPDCVCHA 873
           EGH NRQGQIRLRKPKQSIYTYPAPDCVCHA
Sbjct: 481 EGHLNRQGQIRLRKPKQSIYTYPAPDCVCHA 511

BLAST of HG10016573 vs. TAIR 10
Match: AT4G24530.1 (O-fucosyltransferase family protein )

HSP 1 Score: 773.9 bits (1997), Expect = 1.4e-223
Identity = 372/501 (74.25%), Postives = 427/501 (85.23%), Query Frame = 0

Query: 373 ALAGVFVLLFPVFLPGLFSPFGHASPSTFSEWNAPKPRHLRLLKSALQRQSSKPDQSDLW 432
           ALAGVFVLLFP+  P LFSP G ASPS FSEWNAP+PRHL LL+ AL RQ S   Q +LW
Sbjct: 18  ALAGVFVLLFPILYPNLFSPLGRASPSLFSEWNAPRPRHLSLLQGALDRQISIRQQVELW 77

Query: 433 APLADEGWRPCVASSKASSLPGKSEGYIQVFLDGGLNQQRMGICDAVAVAKILNATLVIP 492
           +PLAD+GW+PC  S + +SLP KSEG++QVFLDGGLNQQRMGICDAVAVAKI+N TLVIP
Sbjct: 78  SPLADQGWKPCTESYRGASLPEKSEGFLQVFLDGGLNQQRMGICDAVAVAKIMNVTLVIP 137

Query: 493 HLEINPVWKDSSSFVDIFDVDHFINVLKDDISIVKELPADFSWSTREYYATAIRATRVKT 552
            LE+N VW+DSSSF DIFD+DHFI+VLKD++ IV+ELP  ++WSTR+YYAT IRATR+KT
Sbjct: 138 RLEVNTVWQDSSSFTDIFDLDHFISVLKDEVRIVRELPIQYAWSTRDYYATGIRATRIKT 197

Query: 553 APVHASANWYLDNVLPVLQSYGIAAIAPFSHRLAFENLPDEIQRLRCKVNFQALTFVPHI 612
           APVHASA WYL+NVLP++QSYGIAA+APFSHRLAF+NLP+ IQRLRCKVNF+AL FVPHI
Sbjct: 198 APVHASAEWYLENVLPIIQSYGIAAVAPFSHRLAFDNLPESIQRLRCKVNFEALNFVPHI 257

Query: 613 RVLGDALISRLQYPLNKKESKVANYLSMTTDAN---EQGPLKFVVLHLRFDKDMAAHSAC 672
           R LGDAL+ RL+ P     S+ +  +  T   N   + G  KF VLHLRFDKDMAAHS C
Sbjct: 258 RELGDALVHRLRNP--PSSSQTSGTMDPTDRINTIVKAGAGKFAVLHLRFDKDMAAHSGC 317

Query: 673 DFGGGKAEKLALAKYRQVLWQGRVLNSQFTDEELRSQGRCPLTPEEIGLLLAAFGFDNNT 732
           DF GGKAEKLALAKYRQV+WQGRVLNSQFTDEELR++GRCPLTPEEIGLLL+A GF NNT
Sbjct: 318 DFEGGKAEKLALAKYRQVIWQGRVLNSQFTDEELRNKGRCPLTPEEIGLLLSALGFSNNT 377

Query: 733 RLYLASHKVYGGEARISTLRSLFPLMEDKKSLTSGNELAQIKGKASLLAAVDYYVSMHSD 792
           RLYLASH+VYGGEARISTLR LFP +E+KKSL S  ELA ++GKASL+AAVDYYVSM SD
Sbjct: 378 RLYLASHQVYGGEARISTLRKLFPGIENKKSLASAEELADVQGKASLMAAVDYYVSMKSD 437

Query: 793 IFISASPGNMHNAMVGHRTYENLKTIRPNMALLGQLFMNKSIIWSDFQEATVEGHKNRQG 852
           IFISASPGNMHNA+  HR Y NLKTIRPNM LLGQ+F+NKS+ WS+F+ A + GHKNRQG
Sbjct: 438 IFISASPGNMHNALQAHRAYLNLKTIRPNMILLGQVFVNKSLDWSEFEGAVMNGHKNRQG 497

Query: 853 QIRLRKPKQSIYTYPAPDCVC 871
           Q+RLRK KQSIYTYPAPDC+C
Sbjct: 498 QLRLRKQKQSIYTYPAPDCMC 516

BLAST of HG10016573 vs. TAIR 10
Match: AT5G65470.1 (O-fucosyltransferase family protein )

HSP 1 Score: 770.0 bits (1987), Expect = 2.0e-222
Identity = 375/479 (78.29%), Postives = 418/479 (87.27%), Query Frame = 0

Query: 396 ASPSTFSEWNAPKPRHLRLLKSALQRQSSKPDQSDLWAPLADEGWRPCVASSKASSLPGK 455
           ++ S+ SE    KPRHL LLKSALQR S   +QSDLW PL D+GW PC+    + SLP K
Sbjct: 27  STSSSSSEVITIKPRHLSLLKSALQRSSG--EQSDLWRPLTDQGWSPCIDLGNSPSLPDK 86

Query: 456 SEGYIQVFLDGGLNQQRMGICDAVAVAKILNATLVIPHLEINPVWKDSSSFVDIFDVDHF 515
           + GY+QVFLDGGLNQQRMGICDAVAVAKILNATLVIP+LE+NPVW+DSSSFVDIFDVDHF
Sbjct: 87  TAGYVQVFLDGGLNQQRMGICDAVAVAKILNATLVIPYLEVNPVWQDSSSFVDIFDVDHF 146

Query: 516 INVLKDDISIVKELPADFSWSTREYYATAIRATRVKTAPVHASANWYLDNVLPVLQSYGI 575
           I+ LKDDI +V+ELP ++SWSTREYY TA+R TRVKTAPVHASANWY++NV PVLQSYGI
Sbjct: 147 IDSLKDDIRVVRELPDEYSWSTREYYGTAVRETRVKTAPVHASANWYIENVSPVLQSYGI 206

Query: 576 AAIAPFSHRLAFENLPDEIQRLRCKVNFQALTFVPHIRVLGDALISRLQYP---LNKKES 635
           AAI+PFSHRL+F++LP EIQRLRCKVNFQAL FVPHI  LGDAL+SRL+ P    NK++ 
Sbjct: 207 AAISPFSHRLSFDHLPAEIQRLRCKVNFQALRFVPHITSLGDALVSRLRNPSWRSNKEQK 266

Query: 636 KVANYLSMTTDANEQGPLKFVVLHLRFDKDMAAHSACDFGGGKAEKLALAKYRQVLWQGR 695
            V +   MT     Q P KF VLHLRFDKDMAAHSACDFGGGKAEKL+LAKYRQ++WQGR
Sbjct: 267 NVDHLGDMTNPHRRQEPGKFAVLHLRFDKDMAAHSACDFGGGKAEKLSLAKYRQMIWQGR 326

Query: 696 VLNSQFTDEELRSQGRCPLTPEEIGLLLAAFGFDNNTRLYLASHKVYGGEARISTLRSLF 755
           VLNSQFTDEELRSQGRCPLTPEE+GLLLAAFGFDNNTRLYLASHKVYGGEARISTLR +F
Sbjct: 327 VLNSQFTDEELRSQGRCPLTPEEMGLLLAAFGFDNNTRLYLASHKVYGGEARISTLRQVF 386

Query: 756 PLMEDKKSLTSGNELAQIKGKASLLAAVDYYVSMHSDIFISASPGNMHNAMVGHRTYENL 815
           P MEDK+SL S  E A+IKGKASLLAA+DYYVSMHSDIFISASPGNMHNA+VGHRT+ENL
Sbjct: 387 PRMEDKRSLASSEERARIKGKASLLAALDYYVSMHSDIFISASPGNMHNALVGHRTFENL 446

Query: 816 KTIRPNMALLGQLFMNKSIIWSDFQEATVEGHKNRQGQIRLRKPKQSIYTYPAPDCVCH 872
           KTIRPNMAL+GQLF+NKSI W DFQ+A  EGH NRQGQIRLRKPKQSIYTYPAPDC+CH
Sbjct: 447 KTIRPNMALIGQLFLNKSITWVDFQQALGEGHVNRQGQIRLRKPKQSIYTYPAPDCMCH 503

BLAST of HG10016573 vs. TAIR 10
Match: AT3G26370.1 (O-fucosyltransferase family protein )

HSP 1 Score: 325.9 bits (834), Expect = 1.0e-88
Identity = 175/448 (39.06%), Postives = 263/448 (58.71%), Query Frame = 0

Query: 436 ADEGWRPCVAS--SKASSLPGKSE--GYIQVFLDGGLNQQRMGICDAVAVAKILNATLVI 495
           A   W+PC        S LP ++E  GY+ +  +GGLNQQR+ IC+AVAVAKI+NATL++
Sbjct: 134 ATTSWKPCAERRIGGISDLPPENETNGYVFIHAEGGLNQQRIAICNAVAVAKIMNATLIL 193

Query: 496 PHLEINPVWKDSSSFVDIFDVDHFINVLKDDISIVKELPADFSWST-REYYATAIRATRV 555
           P L+ + +WKD++ F DIFDVDHFI+ LKDD+ IV+++P    W T +    ++IR T V
Sbjct: 194 PVLKQDQIWKDTTKFEDIFDVDHFIDYLKDDVRIVRDIP---DWFTDKAELFSSIRRT-V 253

Query: 556 KTAPVHASANWYLDNVLPVLQSYGIAAIAPFSHRLAFENLPDEIQRLRCKVNFQALTFVP 615
           K  P +A+A +Y+DNVLP ++   I A+ PF  RL ++N+P EI RLRC+VN+ AL F+P
Sbjct: 254 KNIPKYAAAQFYIDNVLPRIKEKKIMALKPFVDRLGYDNVPQEINRLRCRVNYHALKFLP 313

Query: 616 HIRVLGDALISRLQYPLNKKESKVANYLSMTTDANEQG-PLKFVVLHLRFDKDMAAHSAC 675
            I  + D+L+SR++                    N  G P  ++ LHLRF+K M   S C
Sbjct: 314 EIEQMADSLVSRMR--------------------NRTGNPNPYMALHLRFEKGMVGLSFC 373

Query: 676 DFGGGKAEKLALAKYRQVLWQGRVLNSQFTDE---ELRSQGRCPLTPEEIGLLLAAFGFD 735
           DF G + EK+ +A+YRQ  W  R  N     +   + R +GRCPL P E+ ++L A G+ 
Sbjct: 374 DFVGTREEKVKMAEYRQKEWPRRFKNGSHLWQLALQKRKEGRCPLEPGEVAVILRAMGYP 433

Query: 736 NNTRLYLASHKVYGGEARISTLRSLFPLMEDKKSLTSGNELAQIKGKASLLAAVDYYVSM 795
             T++Y+AS +VYGG+ R++ LR++FP +  K+ L    EL   +   + LAA+D+ V +
Sbjct: 434 KETQIYVASGQVYGGQNRMAPLRNMFPNLVTKEDLAGKEELTTFRKHVTSLAALDFLVCL 493

Query: 796 HSDIFISASPGNMHNAMVGHRTY--ENLKTIRPNMALLGQLFMNKSIIWSDFQEATVEGH 855
            SD+F+    GN    ++G R Y     K+I+P+  L+ + F +  + W+ F E  V  H
Sbjct: 494 KSDVFVMTHGGNFAKLIIGARRYMGHRQKSIKPDKGLMSKSFGDPYMGWATFVEDVVVTH 553

Query: 856 KNRQGQIRLRKPKQSIYTYPAPDCVCHA 873
           + R G      P   ++  P   C+C A
Sbjct: 554 QTRTGLPEETFPNYDLWENPLTPCMCKA 557

BLAST of HG10016573 vs. TAIR 10
Match: AT1G35510.1 (O-fucosyltransferase family protein )

HSP 1 Score: 323.6 bits (828), Expect = 5.0e-88
Identity = 182/448 (40.62%), Postives = 260/448 (58.04%), Query Frame = 0

Query: 430 DLWAPLADEGWRPCVASSKASSLPGKSEGYIQVFLDGGLNQQRMGICDAVAVAKILNATL 489
           + W P     W+PC+ S+  S+    S GY  +  +GGLNQQR+ ICDAVAVA +LNATL
Sbjct: 134 EAWKPRVKSVWKPCI-STNVSAAGSNSNGYFIIEANGGLNQQRLSICDAVAVAGLLNATL 193

Query: 490 VIPHLEINPVWKDSSSFVDIFDVDHFINVLKDDISIVKELPADFSWSTREYYATAIRATR 549
           VIP   +N VW+DSS F DIFD D FI  L  ++++VKELP D       Y  ++I   R
Sbjct: 194 VIPIFHLNSVWRDSSKFGDIFDEDFFIYALSKNVNVVKELPKDV-LERYNYNISSIVNLR 253

Query: 550 VKTAPVHASANWYLDNVLPVLQSYGIAAIAPFSHRLAFENLPDEIQRLRCKVNFQALTFV 609
           +K     +S  +YL  VLP L   G   +APFS+RLA   +P  IQ LRC  NF+AL F 
Sbjct: 254 LK---AWSSPAYYLQKVLPQLLRLGAVRVAPFSNRLA-HAVPAHIQGLRCLANFEALRFA 313

Query: 610 PHIRVLGDALISRLQYPLNKKESKVANYLSMTTDANEQGPLKFVVLHLRFDKDMAAHSAC 669
             IR+L + ++ R                 M T + E G  K+V +HLRF+ DM A S C
Sbjct: 314 EPIRLLAEKMVDR-----------------MVTKSVESGG-KYVSVHLRFEMDMVAFSCC 373

Query: 670 DFGGGKAEKLALAKYRQVLWQG--RVLNSQFTDEELRSQGRCPLTPEEIGLLLAAFGFDN 729
           ++  G+AEKL +   R+  W+G  R           R  G+CPLTP E+G++L   GF+N
Sbjct: 374 EYDFGQAEKLEMDMARERGWKGKFRRRGRVIRPGANRIDGKCPLTPLEVGMMLRGMGFNN 433

Query: 730 NTRLYLASHKVYGGEARISTLRSLFPLMEDKKSLTSGNELAQIKGKASLLAAVDYYVSMH 789
           +T +Y+A+  +Y  +  ++ LR +FPL++ K +L +  ELA  KG +S LAA+DY V +H
Sbjct: 434 STLVYVAAGNIYKADKYMAPLRQMFPLLQTKDTLATPEELAPFKGHSSRLAALDYTVCLH 493

Query: 790 SDIFISASPGNMHNAMVGHRTY---ENLKTIRPNMALLGQLFMNKSIIWSDFQEATVE-- 849
           S++F+S   GN  + ++GHR Y    + +TI+P+   L QL    SI W  F++   +  
Sbjct: 494 SEVFVSTQGGNFPHFLIGHRRYLYKGHAETIKPDKRKLVQLLDKPSIRWDYFKKQMQDML 553

Query: 850 GHKNRQGQIRLRKPKQSIYTYPAPDCVC 871
            H + +G + LRKP  S+YT+P PDC+C
Sbjct: 554 RHNDAKG-VELRKPAASLYTFPMPDCMC 556

BLAST of HG10016573 vs. TAIR 10
Match: AT1G29200.1 (O-fucosyltransferase family protein )

HSP 1 Score: 311.6 bits (797), Expect = 2.0e-84
Identity = 183/482 (37.97%), Postives = 278/482 (57.68%), Query Frame = 0

Query: 410 RHLRLLKSALQRQSSKPDQSDLWAPLADEG--WRPCVASSKASSLPGK----SEGYIQVF 469
           R L L   +L +   KPD  +     + +   W+PC  ++KA+    +    S GYI V 
Sbjct: 18  RLLNLASDSLAKNEFKPDTPNFREERSSKSSQWKPCADNNKAAVALERSRELSNGYIMVS 77

Query: 470 LDGGLNQQRMGICDAVAVAKILNATLVIPHLEINPVWKDSSSFVDIFDVDHFINVLKDDI 529
            +GGLNQQR+ IC+AVAVA +LNATLV+P    + VWKD S F DI+  DHFI  LKD++
Sbjct: 78  ANGGLNQQRVAICNAVAVAALLNATLVLPRFLYSNVWKDPSQFGDIYQEDHFIEYLKDEV 137

Query: 530 SIVKELPADFSWSTREYYATAIRATRVKTA-PVHASANWYLDNVLPVLQSYGIAAIAPFS 589
           +IVK LP     +  +  +       VK A PV      Y+++VLP+L+ YG+  +  + 
Sbjct: 138 NIVKNLPQHLKSTDNKNLSLVTDTELVKEATPVD-----YIEHVLPLLKKYGMVHLFGYG 197

Query: 590 HRLAFENLPDEIQRLRCKVNFQALTFVPHIRVLGDALISRL-QYPLNK---KESKVANYL 649
           +RL F+ LP ++QRLRCK NF AL F P I+  G  L+ R+ ++  ++   +E+ +   +
Sbjct: 198 NRLGFDPLPFDVQRLRCKCNFHALKFAPKIQEAGSLLVKRIRRFKTSRSRLEEALLGESM 257

Query: 650 SMTTDANEQGPLKFVVLHLRFDKDMAAHSACDFGGGKAEKLALAKYRQ----VLWQGRVL 709
             +T   E+ PLK++ LHLRF++DM A+S CDFGGG+AE+  L  YR+    +L +    
Sbjct: 258 VKSTVKGEEEPLKYLALHLRFEEDMVAYSLCDFGGGEAERKELQAYREDHFPLLLKRLKK 317

Query: 710 NSQFTDEELRSQGRCPLTPEEIGLLLAAFGFDNNTRLYLASHKVYGGEARISTLRSLFPL 769
           +   + EELR  G+CPLTPEE  L+LA  GF   T +YLA  ++YGG +R+  L  L+P 
Sbjct: 318 SKPVSPEELRKTGKCPLTPEEATLVLAGLGFKRKTYIYLAGSQIYGGSSRMLPLTRLYPN 377

Query: 770 MEDKKSLTSGNELAQIKGKASLLAAVDYYVSMHSDIFISASPGNMHNAMV-GHRTY---E 829
           +  K++L +  ELA  K  +S LAA+D+   + SD+F     G+  +++V G R Y    
Sbjct: 378 IATKETLLTPQELAPFKNFSSQLAALDFIACIASDVFAMTDSGSQLSSLVSGFRNYYGNG 437

Query: 830 NLKTIRPNMALLGQLFM-NKSIIWSDFQEATVEGHKNRQGQIRLRKPKQSIYTYP-APDC 871
              T+RPN   L  +   +++I W  F++   +  +  Q ++R R   +SIY  P  P+C
Sbjct: 438 QAPTLRPNKKRLAAILSDSETIKWKIFEDRVRKMVEEGQ-KLRTRPYGRSIYRQPRCPEC 493

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAG6604063.10.0e+0086.14O-fucosyltransferase 39, partial [Cucurbita argyrosperma subsp. sororia][more]
KAG6594905.10.0e+0085.80O-fucosyltransferase 31, partial [Cucurbita argyrosperma subsp. sororia][more]
XP_038881929.12.4e-28996.30O-fucosyltransferase 39 [Benincasa hispida][more]
XP_004143441.13.4e-28895.91O-fucosyltransferase 31 [Cucumis sativus] >KGN48703.1 hypothetical protein Csa_0... [more]
XP_008440542.13.2e-28695.13PREDICTED: uncharacterized protein At1g04910 [Cucumis melo] >KAA0036344.1 O-fuco... [more]
Match NameE-valueIdentityDescription
Q7Y0302.0e-22274.25O-fucosyltransferase 31 OS=Arabidopsis thaliana OX=3702 GN=OFUT31 PE=2 SV=1[more]
Q0WUZ52.9e-22178.29O-fucosyltransferase 39 OS=Arabidopsis thaliana OX=3702 GN=OFUT39 PE=2 SV=1[more]
Q9LIN91.4e-8739.06Protein PECTIC ARABINOGALACTAN SYNTHESIS-RELATED OS=Arabidopsis thaliana OX=3702... [more]
Q8H1E67.1e-8740.63O-fucosyltransferase 9 OS=Arabidopsis thaliana OX=3702 GN=OFUT9 PE=2 SV=1[more]
F4HZX72.8e-8337.97O-fucosyltransferase 8 OS=Arabidopsis thaliana OX=3702 GN=OFUT8 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KGN21.7e-28895.91O-fucosyltransferase family protein OS=Cucumis sativus OX=3659 GN=Csa_6G498990 P... [more]
A0A5A7SYI41.5e-28695.13O-fucosyltransferase family protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5... [more]
A0A1S3B0Y11.5e-28695.13O-fucosyltransferase family protein OS=Cucumis melo OX=3656 GN=LOC103484932 PE=3... [more]
M4DBF22.8e-28062.84O-fucosyltransferase family protein OS=Brassica rapa subsp. pekinensis OX=51351 ... [more]
A0A6J1IP374.8e-28093.74O-fucosyltransferase family protein OS=Cucurbita maxima OX=3661 GN=LOC111478666 ... [more]
Match NameE-valueIdentityDescription
AT4G24530.11.4e-22374.25O-fucosyltransferase family protein [more]
AT5G65470.12.0e-22278.29O-fucosyltransferase family protein [more]
AT3G26370.11.0e-8839.06O-fucosyltransferase family protein [more]
AT1G35510.15.0e-8840.63O-fucosyltransferase family protein [more]
AT1G29200.12.0e-8437.97O-fucosyltransferase family protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 199..219
NoneNo IPR availableCOILSCoilCoilcoord: 221..255
NoneNo IPR availableCOILSCoilCoilcoord: 172..192
NoneNo IPR availablePANTHERPTHR31933:SF5O-FUCOSYLTRANSFERASE 39coord: 366..871
NoneNo IPR availablePANTHERPTHR31933O-FUCOSYLTRANSFERASE 2-RELATEDcoord: 366..871
IPR002100Transcription factor, MADS-boxPRINTSPR00404MADSDOMAINcoord: 116..137
score: 62.84
coord: 81..101
score: 52.46
coord: 101..116
score: 77.22
IPR002100Transcription factor, MADS-boxSMARTSM00432madsneu2coord: 79..138
e-value: 3.0E-38
score: 143.1
IPR002100Transcription factor, MADS-boxPFAMPF00319SRF-TFcoord: 88..135
e-value: 2.0E-24
score: 84.8
IPR002100Transcription factor, MADS-boxPROSITEPS50066MADS_BOX_2coord: 79..139
score: 29.828545
IPR002487Transcription factor, K-boxPFAMPF01486K-boxcoord: 170..249
e-value: 1.0E-15
score: 57.7
IPR002487Transcription factor, K-boxPROSITEPS51297K_BOXcoord: 165..255
score: 13.252406
IPR019378GDP-fucose protein O-fucosyltransferasePFAMPF10250O-FucTcoord: 459..809
e-value: 3.6E-61
score: 207.7
IPR036879Transcription factor, MADS-box superfamilyGENE3D3.40.1810.10coord: 91..160
e-value: 8.0E-28
score: 98.1
IPR036879Transcription factor, MADS-box superfamilySUPERFAMILY55455SRF-likecoord: 81..161
IPR033896MADS MEF2-likeCDDcd00265MADS_MEF2_likecoord: 80..155
e-value: 4.25868E-40
score: 139.995
IPR024709Putative O-fucosyltransferase, plantCDDcd11299O-FucT_plantcoord: 459..815
e-value: 1.69136E-144
score: 426.981

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10016573.1HG10016573.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006004 fucose metabolic process
biological_process GO:0045944 positive regulation of transcription by RNA polymerase II
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0005634 nucleus
molecular_function GO:0003700 DNA-binding transcription factor activity
molecular_function GO:0016757 glycosyltransferase activity
molecular_function GO:0046983 protein dimerization activity
molecular_function GO:0000977 RNA polymerase II transcription regulatory region sequence-specific DNA binding
molecular_function GO:0003677 DNA binding