Cp4.1LG02g17670 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG02g17670
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionO-fucosyltransferase family protein
LocationCp4.1LG02: 14297314 .. 14306403 (-)
RNA-Seq ExpressionCp4.1LG02g17670
SyntenyCp4.1LG02g17670
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGGTTGTTGATCTTTCCCTCTTTTGCTTTAGATGCAGAGGAGCTTCCTGTTCGTTCTTCAATTTTGATGCCATTTCTTCTGATCCCTAAATTTGATGTGTCTGAATTCTCTGTTGCTTCTTGCCTTCTACCTTTCCTACCTAGTATTGCTTTGTTTGGTTGGCTGTCTTTTGTTCTTTCAACACATCTGCTTCTCCTGATTTTGAGATGGACGATCCTTGTTTTGATGACTTTGATCCTCGCACCAACTTTTCCAAGGTGATTCAGTAATGTTCTTCTTTTTCATTTTTCGCCTCTTAGATTCATCTTCTACTTCTTGTTTTAATGCAATGTCTGTCAATTTACCAGTTCTTGGAAGAAGCAAAACATCATGCCAAAGAAGAAACAGGAAGGAAATGGTTGGCACAACAGAAAAAGAGCAAGAAATCATGGAAAAACACACTCTTCTCCTGGTTGAATAGTGACAAAACGAGCAAATCTCTTCCCAAACTAGAAAGAAATTCTCACACATCCAATAAAAGACGTGTTCATGTTTCGGGTCCGATTTACACTGGAGCTACGACCGTCGATGGCAGACCCCGACACCGTCCGACGTCTGGACCAATTGCGAGTCTCTTCAATCCAAGTATGCGAACCGAGATGGAGATCCCTTATATGTGTCTCCACCAGCTCACTAGCCCCATTCCTAATCACTATTATGGTCCAATTTATCTTGTTACATAGTCCTACATTGCCTCAACTCATCTTGGAGCCAGTGCAGCTTTGAAATGTTCTGTCTTGCTTTGTTTTGTATGTGTGATGAATATTATGAATATTATGAATGTTATGAGCCAAGTTTGATTCTTGGCCCTCTTCGAAATTTGGTGATCATGGTTACAACTTACAACTCCACGACTACTTTTGTTTGATAATATTCATGTTTCTCGTTTCTGTTTGTTAAAGACAAGAAACGTTTAACTGTTACTGTTTCTCATTTATCGATCAGTCAATACCACATGGCATACTAAACAGATCGAAATTGATTCAAAGGATTGAAAATTTCTTAATATAATAATATTCTAACAAAATGGAAGAGCTTGTTGGGTGAAGAACATTCATTTCAGAGATATTCTTGAACTAGGCGTGGATTACAAGGACTCACTCTTCCCTGCCTACTCAAAAGCCTTTTAAATGTCCTCAATCTTGTGGGATCTGGTAATTTTCTCTCCCCATTAGGCTCCAATGGCGGAAACTGAAGATCAATCTGTATATCATCAATTGCAAATTCGGCATCCGGGTGACTCGCCGATGAAGACGCTGCCGGAAATTGAAGATCGCTCGTGAAAATCATCCAAACCGTGGCTACCTCTGGTGGGCTTCGGGCTGGAATAATGGGCTGCTTTCAATTGAAGTTAATGGGCCTGTTGGCTACGGTGCTTGTTATCCAACGCCTCCGTCGAATCGGACAAAGGAGAGAGGTTCGTTGTGTGGCTCAGGCTCTCAGTCTCAGACTGGCTTCGTCTCTGCTCCCTAATCCTTTGAGCGACGCTGCCCATCTTCCATCAATTCACAATATCCTTGCACAGTCGATTATAATGCAAATTCCCCAACTGGTTTGCGTCCATTGCTTCTTCAACTGAATCGTAGTGGTTGATTTTGGCTGAATGAGCTCCCATGCCTTTGATTGGTATGCTTCTCTTGTTTGCTTTCAGTGGCTTCTGTTCATTTCGAATGTTATCGTTATCATTTTCTCAATGAGGGTTTCTTTTTTCTTGGTTTTGCTCGTGTTTGTAGCTGAGATTTCTACTGCTCAGCCTGTTTTCCACAGTTTCCCTCGTACCTCCTATGCCCAATCATGTCTTCATGTTAAAAAGTCACTTGCGTCACTTACTTCTGGGAATGAAAAGGAAAATTATCCTTCCTGGAAATCTCTGATTGTACCGAAACGGGTGAGTTTCACTGTATGCCAGCCTCAAATTTGTAGGCGGTGGTTCTTGATTAGAGCAGTTGCTACTCTTGAACCTAAGCGTGTGGTTCATGATGGGAAGGGGGACGTTTCTATGGGGGGAGGGGCTGAATTTAAGAATTCTCAAATGGGTGCTGCCCCGAGTACTTCAGATGTTCAGCTTTCCTCGTCCAGTGAAGATACGGAAGAAATGGATGCGCGAGAGAGGTTGAGGCGAGAGAGGATTTCCAAAGCGAATAAAGGAAACACGCCGTGGAACAAAGGGAGAAAGCACAGTGCAGGTAACACACCGACATTTCATCTATACACGTTGAGGATTTACCGTTTTCTTTTAATAAGCAAGTCATTTGCAGAGACCCTTAGACGAATCAAGGAGCGAACAAGGCTTGCAATGCAGAATCCTAAGGTGAAGTCATGTAAGAATGGTTTTATGTATTCTTGTGGATATAAAAATGGTGATACACATTCTTGCGGAATGAATGACCATAATGTCTTGTAGCCTTGACTCTGAACCATGAGAAGTGTTACACGGCCTAGGATTCAACAGTAGGTCATGAACAGAATATTTATCTAGGAATATTTGATATCCCACGACCTTTTAATACTATACTCTCCTCGTAATGTATCCTCTTTTCTGTGCAGATAAAAATGAAGTTAGTTAATCTTGGCCGTTCTCAGAGGTAGGTGTCATCCCAGTCTGTTTACTTGGATGGTTTATGATCCGTGATTCTGCAGTTGATTCTGTCTAACTATGCGATCCCTGAGCCATTGAATTTCAATTTCCCCTTGTTTTTCATGCCACAGTGAAGAGACGAGGATGAGAATTGGCGTTGGAGTGCGAATGGGGTGGCAAAGACGCCGTAAGAAGCTGAAATTACAGGAAACTTGCTACTTACAGTGGAAAGATTTAATTGCTGAAGCATCAAGACAAGGCGGTCTGGGTGAGGAAGAGTTGCAGTGGGACTCCTACCAAATCATGAATGAACAACTTAAAAAGGAGTGGCAGGAGAGTGTTGAGCAACGGAAGACAATGCCCAGGCCGGTTGGCGGCAGGAGGGCACCAAAGTCAGCTGAGCAGAGGAAGAAGATATCCGAATCCATCTCTGCCAAATGGGCTGATTCTGTATGTCTTGTTTTATATCCTTCATTAGTTACATTGAATGGAACCTTTTTAAGTTGTCGAGAAATGTTTGGATTTAATTCTAGAGCTCATTCACACCCTCTCCGGATAGACTTGGATCATTGTTTGGCATATACAAACCATTAGCCTATAACTAACCTTCAAACTACCATAGTCTTATCTTTGGTGGTTAATGCTGATAGATTTGATTTTATGCAGCTTATCACGATCCTACCAAATGCTCTTAACAGCCACTATCCCTTCTGCATGTTTACTTGTTTATTTTCTGTCAGGAATATCGTGCTCGAGTTTTCTCTGGCCTGGCTAAATATCATGGCACACCAATTGGAGTCAACAGAAGGCCAAGGAGGAAGCGTAGTGAAAGTACAGAGACCACAAGAAAGAAAGAAAAGAGTGGTGTTAAATCTCCTGTTGCAGGTGGGTCTAAGATTGAAAGCCAACGATTGAGACTCAGGAAAAGCAAAGCACCGCGTTTTAAAGACCCCTTAGCGAGCTCTAAGCTGGAAATGATAAAGAGTATCAGGGCAGAGAGAGCAATTGCAGAAACTCAAAAAACGGAAGCCATTGAACGAGCCAGGTCTGTTCTTTTCCTTTACTTGCAGTTTCTTAAATAAATTAATGGATGGTAGGCCTAGCCATAGACATAAAACTCGTTGAGCATGTATGTGAATAACATTGTTTGAAATCATAGAACTCTTATCGAAAGATAATTCGAGGGAAATCGAACCTGTTGATTGTATCTGTAGCCTTTGAGTATTGGTAGCTTAATCAAGAATGTGAGACGGCTTGATTTAACATTGAACAGTTAACGAGATTCCAGTTTGTTCTTTGCAGACTCTTGATTGCTGAAGCTGAGAAAGCCGCCAAGGCCCTTGAGGTGGCTGCTACTAGAAGTTCCATTGCTCGAGCTTCCCTCTTGGAAACAAGAAATCTTATAGCTGAAGCCAAACAATCAATTGAATCCGTAGAAATAGAGCGAATGGCATCCCCGCAGAGCGAAGAACGGAATGCAGCAGCCTCCTACACCTACGAAGTGGGGGGTACCTCAAATGAGGAGGGAGACTCAGTTGGCGGAAAAGGGAACCAAAATGGAGTGGTTCAAACAATGGCAAATGGAACCCAGTTGTTTCCATCGAGCATAGATAAGGATTTTGATTTTAGCAAGTTAAGTTTACAGGATATACTTGGCGGAGAGAAGGAAGTTCCAGCAAGCTCCAATGGGCATGGCGCATGTCATTCAAGCTTTTCAAGTCTGAGAAACCACCCTAATGGGAACAAGCCATCTGACCATAAACCTTCCTTGAACGGAACAAAACTTCACCACCTGGAAGAGAAACCGGATTCCCAAGTGATTAGTGTCACGAAGAAATGGGTTCGTGGGAGGCTGTTTGAAGTAGGTGATGGAGGTTGTTAGGCTTGATGTTTAATGAATATGAAGCGCATTTTATTTATCACATCTCTGTATTACAAACTAGCTGATTCTGTTGAGTTTTTGGAGCATTCATCCATCATACATTATTAGAGCAGCCTTCTGGGTGTTGTTCGATTTCTTATGTAAACAATCTCAACTTTGTTGGCTATGAAAATTCTCTTGCTCCCTTCACACCCATTAACACCATCTGACTTAGCTTACATAAACACTGTTGGCCTTGAACCATGCATGGCGGCCCGGTTGGAGGCCGCTTTTGCCTCTTAACTGCGTCACTGACATGCATGCATGTTCTTGACGTCAGATATTCGAATCTCAACTTTGAATTAAAAGTAAAAAATATGTTTTTTATTTTTTATTTTTGAATTATTGTAATATGATAAGTTTATCGATATTTTAAAAAATTTTAGTATGTCATGTAAAAAGAGGAGAGAAATTTCACCAGTCCGGGCGTATGTATTAAGCAGAGGGGGAGACGATATGGTGGGGGCGGCCCAATGTTAAAGAGTTTGGTGCGTATGTATTAAGCAGACGGGGAGATGATATGGTGGGGGCGGCCCAATGTTAAAGAGTTTGGTGCGTATGTATTAAGCAGACGGGGAGATGATATGGTGGGGGCGGCCCAATGTTAAAGAGTTTGGTGAGGGACGATGGCGTGTCATTTTCTTTCCTTCATCCTTCGCCCTTGCTTTCTTTTCGCATTAAACTTCCATCTTCCTCGCTACTCAACTGTGTTTCTCGCGACAGTCCAACAACAACACTGCGGATTTCCACTCCATTTTTTCCAACCATACTCTGCTGAAGAAGAGGAAGAAACAACAAGACCTGAAGAAGGTCATGGATGACGAATCAGATGATTGCCGAAACCTTATTGACCAGAATTCCCCCAAGCGCGTCCCTTCTACTTTCGACATCGACGACGATCCCCATTTCAGGCCTCCCATTCAGACTTTCCGTTTCTCCATTCCTAAATTTGCACTTGACAAGAGGTACTACTACATTTTAGCGGCCGCCCTCCCTCTATGCATTGTTGTTGTATTTTTCTCTGCCGACATCCAAACTCTCTTCTCTACTAATCTCTCTTCCCCACTGAAAAGTTCCGATTCCCTCAGTGACCGCATGAGGGAAGCGGAATTAAGAGCTTTGTATTTGTTAAGGCAGCAACAACTGGGTTTTTCCGATCTTTGGAATCACTCCTTGCTCGTTCAATCTAATTCAAGTTTCAACTCCACCTCTTCTAATAATTTGAGTTCCAATTCAGCCTCAGGAACCCCATCTACAGAAGATCTCAAATCTGCTATATTGAAGCAGATTTCTTTGAACAAAGAGATCCAAAACGTTCTTTTATCCCCCCATAGCTCTGGGAACATACCAGAGGAAGTTGGTGATGCTCATTCCATGGGCAGCTTCGCCCTTGATAGATGTAGAAAGATGGACCAGAAACTTTCCGATAGAAGAACTATTGAGTGGAAGCCAAAATCGAACAAGTTTTTGTTCGCTATATGCACTTCGGGGCAAATGTCGAACCATTTGATCTGTTTGGAGAAGCATATGTTCTTTGCTGCTATCCTCAACAGAATTCTTGTTATTCCTAGTCACAAAGTTGATTTTCAGTTCAGTAGAGTAATTGACATTGATCATATTAATTCATGTTTGGGAAGAAAGGTCGTCATTTCTTTTGAGGAGTTTTCTGAGATTAAGAAGCACCACTTGCACATTGATCGGTTCTTCTGTTACTTTTCAAAGCCAGATCCTTGTTATGTGGATGACGAACATATTAAAAAGTTGAAGACCTTGGGGGTCTCTATGGGCAAGCTCGAATCTGCCTGGAATGAAGATACTAAGAAGCCCACTAGAAAGACAGTTTCGGACATTGAATCCAAGTTCTCCTCTAATGACGATGTTGTAGCTGTAGGAGATATTTTCTTTGCTAATGTAGAGCAAGAGTGGGTGAATCAACCAGGCGGTCCCATCGCTCATAAATGTCAGACTTTGATAGAACCAAGTCGTCTTATCAAGCTGACGGCCCAGCGATTTATTCAAACCTTCTTAGGAAAGAATTATATGGCCTTCCATTTCCGACGACATGGTTTTTTAAAGTTCTGGTAATACATTTACGATCAATGTGTTCATACAAATTCTATTTTACGGCTGCATTATTAGTTTCTTTAGAAACGAATGGTTAAGATTTTGTTTGTTTTCTGGTTGGCCTTTTGACGAACAATGTTAACAAATGTGATCATTGTTTCTTGTTGTTTGAAGTACTTGCAAGTTGGGTTTTCCATAGGGAGATTTCATATCAAAGTGGGTAATGAAATTCTGTGTTTCAGTAATGCAAAGCAGCCAAGTTGCTTTTACCCCATTCCCCAAGCTGCCGACTGCTTAATCCGAGTGGTTGAAAGGGCAAACGTTCCAGTCATTTATCTTTCCACTGATGCAGCAGAGAGCGAATATGGATTGCTGCAGTCACTTCTTGTGTTGAATGGGAAGCCCATACCACTTGTTAAGAGGCCTCCACGTAATTCAGCTGAAAAATGGGATGCCTTATTATATAGGCATGGGATCGAGGGAGATTCTCAGGTCAGCCTAAGAAACTCAACTTTTCTTTTACACACTTCTTGGTTTTGTAGGGAAAAGGTTATCTTAACAGTCAAACCCTTCTTTGTTTCTATCAATGTTTTATGCATAATGACTAGCTGAGGTTAGAGTTAGGGTGTCAATGATGCTCCAGTACTTTCAGCATTTTGAGTGTTTGATTTTAGCTGTGTACAAAGTTCCACATGCGCACAACATATGTTGATTGAATAGTCCGAATCATTTATAATTACGGCCTGTTCATGATCAAGTTTAGATGGGGGTAGCTATAGTGGGCAGTCGTCTGGTATTCTTAAGAAAACTGTTAAAAGGTTGCTGGTTTGTTTTATTTGTTTAACTTTATTCAGGTTTGTTAGATACTTCGAAGTTCACCATGTTGCTTACAAAGCTCATTGTTTTTTCTTTGCATGCTTTTGGTATCATAATTTTTCAGCCGTGGGAGGACATGATCTAATCCTAGACTTGATACAAATGTTGAATCAGAGTCCTTCCCAACTGTACTGCTCACGATGTGACAAGCAATTTTGATTGATTCATCCAAACCAATAAAATCTAACATTGACTTGTTTGTTTTGGTCATGTGAATTGAGTTCTATGTGAAATTATAAAATCTTGAATTTCCTTAGCATCCTCCCACTCAGATTTTTATCGTGTAGGTTGAAGCGATGCTGGACAAGACAATTGGTGCTATGGCTAGCACATTTATTGGTGCATCTGGGTCTACATTCACTGAGGACATTTTGCGGCTAAGGAAGGACTGGGGCTCTGCATCTACTTGTGATGAGTACCTTTGCCAAGGCGAGGAACCAAATTTCATTGCAGAAAATGAATGATATGTAAAGATTCCGTGAGGTATAGAAGCTGTCACTCTGATCCATTTTTAGGCTTGATCTAGAATCTAGTTTACTGTATGATATGGCACCTGATGAAGTCTGCACAGATTCTTTTCCGTGTATTAAGTACTCTTAATAAGATATTTACGGCTGTAACCCATGTATATAGAGAAATACTTGAGACACTGAACTCCGCATTGTCACTGTATAATCCACAATACTTCCCATAGGGGAAGGCTTTGATTGAGTAAATAAGATCTTTGCAATATGAAGCTTCTATTAAGCAAAATAGGTTTCAAAATAACTTCTCAGTTTTCATTGATGCACGACTAGGTTCACTCCGTATGAACTTTTCAGCAAGCATAAAGCGTGAAAAGCAATGTGAACGTAAACATAAAACATTACCGTGCTTCTGTCTATGAAGGCGGGTATGAACGAAATCCTAATATTAACGTGACATCGATGTGAATGATTAAGCATTCCTTGTAGATTGCTTAAAGAAGCCAGTTAAGATGTCTCATGTGCATCAGGTTCATCAGTTCTAAAAGATGATCCGTGATCACTTCCAAATGTTCTTCTGCTCATGTTGGATCTCTGCACAAGCCGAACCAATGCTTCCACCACCTCTGACATAGGAGGTCTGAATTCCGGCTCGGGCTGCCATGGAAAACCCAAATGAACCTTCATTTAGCTGATTGGGAGAGACTGAATGCACAGTATTTTGCAAGATTTTTGCTTCACAACTAGATGTTTGGAGTACAAAAATTTGACTAAGGCCGAGTACCTGGACGCAGAGAGCCACAACATCTGCAAATCTTGAGAGAGACTTGACTGGGTAAAGACCTTTAAGTGCAGGATCAACCATTTTCGTCAAGGCATCAATGTCATGGAGCTGAGGCGTTGCCCATCGAACCAAGGATTGCTCAGCTCTTGGCCTAGAACTACAACGAAGTCACAAAATTATCATTAAACAAACCAATTAGTCTTCTCACTCCAGCTTTGGATTTGCCTTTATTTACCTATCAAACGGCTTACGTCCACATAGAAGTTCTAACATCACCACTCCAAAGCTGTATATGTCGCTTTTTAATGTATATTGACCAGACATGGTAACCTCCGGGGCACTATATCCAGATCCTGCTTG

mRNA sequence

ATGGCGTATTGCTTTGTTTGGTTGGCTGTCTTTTGTTCTTTCAACACATCTGCTTCTCCTGATTTTGAGATGGACGATCCTTGTTTTGATGACTTTGATCCTCGCACCAACTTTTCCAAGTTCTTGGAAGAAGCAAAACATCATGCCAAAGAAGAAACAGGAAGGAAATGGTTGGCACAACAGAAAAAGAGCAAGAAATCATGGAAAAACACACTCTTCTCCTGGTTGAATAGTGACAAAACGAGCAAATCTCTTCCCAAACTAGAAAGAAATTCTCACACATCCAATAAAAGACGTGTTCATGTTTCGGGTCCGATTTACACTGGAGCTACGACCGTCGATGGCAGACCCCGACACCGTCCGACGTCTGGACCAATTGCGAGTCTCTTCAATCCAAGTATGCGAACCGAGATGGAGATCCCTTATATGTGTCTCCACCAGCTCACTAGCCCCATTCCTAATCACTATTATGGTCCAATTTATCTTGCTCCAATGGCGGAAACTGAAGATCAATCTGTATATCATCAATTGCAAATTCGGCATCCGGTTAATGGGCCTGTTGGCTACGGTGCTTGTTATCCAACGCCTCCGTCGAATCGGACAAAGGAGAGAGCTGAGATTTCTACTGCTCAGCCTGTTTTCCACAGTTTCCCTCGTACCTCCTATGCCCAATCATGTCTTCATGTTAAAAAGTCACTTGCGTCACTTACTTCTGGGAATGAAAAGGAAAATTATCCTTCCTGGAAATCTCTGATTGTACCGAAACGGGTGAGTTTCACTGTATGCCAGCCTCAAATTTGTAGGCGGTGGTTCTTGATTAGAGCAGTTGCTACTCTTGAACCTAAGCGTGTGGTTCATGATGGGAAGGGGGACGTTTCTATGGGGGGAGGGGCTGAATTTAAGAATTCTCAAATGGGTGCTGCCCCGAGTACTTCAGATGTTCAGCTTTCCTCGTCCAGTGAAGATACGGAAGAAATGGATGCGCGAGAGAGGTTGAGGCGAGAGAGGATTTCCAAAGCGAATAAAGGAAACACGCCGTGGAACAAAGGGAGAAAGCACAGTGCAGAGACCCTTAGACGAATCAAGGAGCGAACAAGGCTTGCAATGCAGAATCCTAAGATAAAAATGAAGTTAGTTAATCTTGGCCGTTCTCAGAGTGAAGAGACGAGGATGAGAATTGGCGTTGGAGTGCGAATGGGGTGGCAAAGACGCCGTAAGAAGCTGAAATTACAGGAAACTTGCTACTTACAGTGGAAAGATTTAATTGCTGAAGCATCAAGACAAGGCGGTCTGGGTGAGGAAGAGTTGCAGTGGGACTCCTACCAAATCATGAATGAACAACTTAAAAAGGAGTGGCAGGAGAGTGTTGAGCAACGGAAGACAATGCCCAGGCCGGTTGGCGGCAGGAGGGCACCAAAGTCAGCTGAGCAGAGGAAGAAGATATCCGAATCCATCTCTGCCAAATGGGCTGATTCTGAATATCGTGCTCGAGTTTTCTCTGGCCTGGCTAAATATCATGGCACACCAATTGGAGTCAACAGAAGGCCAAGGAGGAAGCGTAGTGAAAGTACAGAGACCACAAGAAAGAAAGAAAAGAGTGGTGTTAAATCTCCTGTTGCAGGTGGGTCTAAGATTGAAAGCCAACGATTGAGACTCAGGAAAAGCAAAGCACCGCGTTTTAAAGACCCCTTAGCGAGCTCTAAGCTGGAAATGATAAAGAGTATCAGGGCAGAGAGAGCAATTGCAGAAACTCAAAAAACGGAAGCCATTGAACGAGCCAGACTCTTGATTGCTGAAGCTGAGAAAGCCGCCAAGGCCCTTGAGGTGGCTGCTACTAGAAGTTCCATTGCTCGAGCTTCCCTCTTGGAAACAAGAAATCTTATAGCTGAAGCCAAACAATCAATTGAATCCGTAGAAATAGAGCGAATGGCATCCCCGCAGAGCGAAGAACGGAATGCAGCAGCCTCCTACACCTACGAAGTGGGGGGTACCTCAAATGAGGAGGGAGACTCAGTTGGCGGAAAAGGGAACCAAAATGGAGTGGTTCAAACAATGGCAAATGGAACCCAGTTGTTTCCATCGAGCATAGATAAGGATTTTGATTTTAGCAAGTTAAGTTTACAGGATATACTTGGCGGAGAGAAGGAAGTTCCAGCAAGCTCCAATGGGCATGGCGCATGTCATTCAAGCTTTTCAAGTCTGAGAAACCACCCTAATGGGAACAAGCCATCTGACCATAAACCTTCCTTGAACGGAACAAAACTTCACCACCTGGAAGAGAAACCGGATTCCCAAGTGATTAGTGTCACGAAGAAATGGGTTCGTGGGAGGCTGTTTGAAGTAGGTGATGGAGACGGGGAGATGATATGGTGGGGGCGGCCCAATGTTAAAGAGTTTGGTGCGTATTCCAACAACAACACTGCGGATTTCCACTCCATTTTTTCCAACCATACTCTGCTGAAGAAGAGGAAGAAACAACAAGACCTGAAGAAGGTCATGGATGACGAATCAGATGATTGCCGAAACCTTATTGACCAGAATTCCCCCAAGCGCGTCCCTTCTACTTTCGACATCGACGACGATCCCCATTTCAGGCCTCCCATTCAGACTTTCCGTTTCTCCATTCCTAAATTTGCACTTGACAAGAGGTACTACTACATTTTAGCGGCCGCCCTCCCTCTATGCATTGTTGTTGTATTTTTCTCTGCCGACATCCAAACTCTCTTCTCTACTAATCTCTCTTCCCCACTGAAAAGTTCCGATTCCCTCAGTGACCGCATGAGGGAAGCGGAATTAAGAGCTTTGTATTTGTTAAGGCAGCAACAACTGGGTTTTTCCGATCTTTGGAATCACTCCTTGCTCGTTCAATCTAATTCAAGTTTCAACTCCACCTCTTCTAATAATTTGAGTTCCAATTCAGCCTCAGGAACCCCATCTACAGAAGATCTCAAATCTGCTATATTGAAGCAGATTTCTTTGAACAAAGAGATCCAAAACGTTCTTTTATCCCCCCATAGCTCTGGGAACATACCAGAGGAAGTTGGTGATGCTCATTCCATGGGCAGCTTCGCCCTTGATAGATGTAGAAAGATGGACCAGAAACTTTCCGATAGAAGAACTATTGAGTGGAAGCCAAAATCGAACAAGTTTTTGTTCGCTATATGCACTTCGGGGCAAATGTCGAACCATTTGATCTGTTTGGAGAAGCATATGTTCTTTGCTGCTATCCTCAACAGAATTCTTGTTATTCCTAGTCACAAAGTTGATTTTCAGTTCAGTAGAGTAATTGACATTGATCATATTAATTCATGTTTGGGAAGAAAGGTCGTCATTTCTTTTGAGGAGTTTTCTGAGATTAAGAAGCACCACTTGCACATTGATCGGTTCTTCTGTTACTTTTCAAAGCCAGATCCTTGTTATGTGGATGACGAACATATTAAAAAGTTGAAGACCTTGGGGGTCTCTATGGGCAAGCTCGAATCTGCCTGGAATGAAGATACTAAGAAGCCCACTAGAAAGACAGTTTCGGACATTGAATCCAAGTTCTCCTCTAATGACGATGTTGTAGCTGTAGGAGATATTTTCTTTGCTAATGTAGAGCAAGAGTGGGTGAATCAACCAGGCGGTCCCATCGCTCATAAATGTCAGACTTTGATAGAACCAAGTCGTCTTATCAAGCTGACGGCCCAGCGATTTATTCAAACCTTCTTAGGAAAGAATTATATGGCCTTCCATTTCCGACGACATGGTTTTTTAAATAATGCAAAGCAGCCAAGTTGCTTTTACCCCATTCCCCAAGCTGCCGACTGCTTAATCCGAGTGGTTGAAAGGGCAAACGTTCCAGTCATTTATCTTTCCACTGATGCAGCAGAGAGCGAATATGGATTGCTGCAGTCACTTCTTGTGTTGAATGGGAAGCCCATACCACTTGTTAAGAGGCCTCCACGTAATTCAGCTGAAAAATGGGATGCCTTATTATATAGGCATGGGATCGAGGGAGATTCTCAGGTTGAAGCGATGCTGGACAAGACAATTGGTGCTATGGCTAGCACATTTATTGGTGCATCTGGGTCTACATTCACTGAGGACATTTTGCGGCTAAGGAAGGACTGGGGCTCTGCATCTACTTGTGATGAGTACCTTTGCCAAGGCGAGGAACCAAATTTCATTGCAGAAAATGAATGATATGTAAAGATTCCGTGAGGTATAGAAGCTGTCACTCTGATCCATTTTTAGGCTTGATCTAGAATCTAGTTTACTGTATGATATGGCACCTGATGAAGTCTGCACAGATTCTTTTCCGTGTATTAAGTACTCTTAATAAGATATTTACGGCTGTAACCCATGTATATAGAGAAATACTTGAGACACTGAACTCCGCATTGTCACTGTATAATCCACAATACTTCCCATAGGGGAAGGCTTTGATTGAGTAAATAAGATCTTTGCAATATGAAGCTTCTATTAAGCAAAATAGGTTTCAAAATAACTTCTCAGTTTTCATTGATGCACGACTAGGTTCACTCCGTATGAACTTTTCAGCAAGCATAAAGCGTGAAAAGCAATGTGAACGTAAACATAAAACATTACCGTGCTTCTGTCTATGAAGGCGGGTATGAACGAAATCCTAATATTAACGTGACATCGATGTGAATGATTAAGCATTCCTTGTAGATTGCTTAAAGAAGCCAGTTAAGATGTCTCATGTGCATCAGGTTCATCAGTTCTAAAAGATGATCCGTGATCACTTCCAAATGTTCTTCTGCTCATGTTGGATCTCTGCACAAGCCGAACCAATGCTTCCACCACCTCTGACATAGGAGGTCTGAATTCCGGCTCGGGCTGCCATGGAAAACCCAAATGAACCTTCATTTAGCTGATTGGGAGAGACTGAATGCACAGTATTTTGCAAGATTTTTGCTTCACAACTAGATGTTTGGAGTACAAAAATTTGACTAAGGCCGAGTACCTGGACGCAGAGAGCCACAACATCTGCAAATCTTGAGAGAGACTTGACTGGGTAAAGACCTTTAAGTGCAGGATCAACCATTTTCGTCAAGGCATCAATGTCATGGAGCTGAGGCGTTGCCCATCGAACCAAGGATTGCTCAGCTCTTGGCCTAGAACTACAACGAAGTCACAAAATTATCATTAAACAAACCAATTAGTCTTCTCACTCCAGCTTTGGATTTGCCTTTATTTACCTATCAAACGGCTTACGTCCACATAGAAGTTCTAACATCACCACTCCAAAGCTGTATATGTCGCTTTTTAATGTATATTGACCAGACATGGTAACCTCCGGGGCACTATATCCAGATCCTGCTTG

Coding sequence (CDS)

ATGGCGTATTGCTTTGTTTGGTTGGCTGTCTTTTGTTCTTTCAACACATCTGCTTCTCCTGATTTTGAGATGGACGATCCTTGTTTTGATGACTTTGATCCTCGCACCAACTTTTCCAAGTTCTTGGAAGAAGCAAAACATCATGCCAAAGAAGAAACAGGAAGGAAATGGTTGGCACAACAGAAAAAGAGCAAGAAATCATGGAAAAACACACTCTTCTCCTGGTTGAATAGTGACAAAACGAGCAAATCTCTTCCCAAACTAGAAAGAAATTCTCACACATCCAATAAAAGACGTGTTCATGTTTCGGGTCCGATTTACACTGGAGCTACGACCGTCGATGGCAGACCCCGACACCGTCCGACGTCTGGACCAATTGCGAGTCTCTTCAATCCAAGTATGCGAACCGAGATGGAGATCCCTTATATGTGTCTCCACCAGCTCACTAGCCCCATTCCTAATCACTATTATGGTCCAATTTATCTTGCTCCAATGGCGGAAACTGAAGATCAATCTGTATATCATCAATTGCAAATTCGGCATCCGGTTAATGGGCCTGTTGGCTACGGTGCTTGTTATCCAACGCCTCCGTCGAATCGGACAAAGGAGAGAGCTGAGATTTCTACTGCTCAGCCTGTTTTCCACAGTTTCCCTCGTACCTCCTATGCCCAATCATGTCTTCATGTTAAAAAGTCACTTGCGTCACTTACTTCTGGGAATGAAAAGGAAAATTATCCTTCCTGGAAATCTCTGATTGTACCGAAACGGGTGAGTTTCACTGTATGCCAGCCTCAAATTTGTAGGCGGTGGTTCTTGATTAGAGCAGTTGCTACTCTTGAACCTAAGCGTGTGGTTCATGATGGGAAGGGGGACGTTTCTATGGGGGGAGGGGCTGAATTTAAGAATTCTCAAATGGGTGCTGCCCCGAGTACTTCAGATGTTCAGCTTTCCTCGTCCAGTGAAGATACGGAAGAAATGGATGCGCGAGAGAGGTTGAGGCGAGAGAGGATTTCCAAAGCGAATAAAGGAAACACGCCGTGGAACAAAGGGAGAAAGCACAGTGCAGAGACCCTTAGACGAATCAAGGAGCGAACAAGGCTTGCAATGCAGAATCCTAAGATAAAAATGAAGTTAGTTAATCTTGGCCGTTCTCAGAGTGAAGAGACGAGGATGAGAATTGGCGTTGGAGTGCGAATGGGGTGGCAAAGACGCCGTAAGAAGCTGAAATTACAGGAAACTTGCTACTTACAGTGGAAAGATTTAATTGCTGAAGCATCAAGACAAGGCGGTCTGGGTGAGGAAGAGTTGCAGTGGGACTCCTACCAAATCATGAATGAACAACTTAAAAAGGAGTGGCAGGAGAGTGTTGAGCAACGGAAGACAATGCCCAGGCCGGTTGGCGGCAGGAGGGCACCAAAGTCAGCTGAGCAGAGGAAGAAGATATCCGAATCCATCTCTGCCAAATGGGCTGATTCTGAATATCGTGCTCGAGTTTTCTCTGGCCTGGCTAAATATCATGGCACACCAATTGGAGTCAACAGAAGGCCAAGGAGGAAGCGTAGTGAAAGTACAGAGACCACAAGAAAGAAAGAAAAGAGTGGTGTTAAATCTCCTGTTGCAGGTGGGTCTAAGATTGAAAGCCAACGATTGAGACTCAGGAAAAGCAAAGCACCGCGTTTTAAAGACCCCTTAGCGAGCTCTAAGCTGGAAATGATAAAGAGTATCAGGGCAGAGAGAGCAATTGCAGAAACTCAAAAAACGGAAGCCATTGAACGAGCCAGACTCTTGATTGCTGAAGCTGAGAAAGCCGCCAAGGCCCTTGAGGTGGCTGCTACTAGAAGTTCCATTGCTCGAGCTTCCCTCTTGGAAACAAGAAATCTTATAGCTGAAGCCAAACAATCAATTGAATCCGTAGAAATAGAGCGAATGGCATCCCCGCAGAGCGAAGAACGGAATGCAGCAGCCTCCTACACCTACGAAGTGGGGGGTACCTCAAATGAGGAGGGAGACTCAGTTGGCGGAAAAGGGAACCAAAATGGAGTGGTTCAAACAATGGCAAATGGAACCCAGTTGTTTCCATCGAGCATAGATAAGGATTTTGATTTTAGCAAGTTAAGTTTACAGGATATACTTGGCGGAGAGAAGGAAGTTCCAGCAAGCTCCAATGGGCATGGCGCATGTCATTCAAGCTTTTCAAGTCTGAGAAACCACCCTAATGGGAACAAGCCATCTGACCATAAACCTTCCTTGAACGGAACAAAACTTCACCACCTGGAAGAGAAACCGGATTCCCAAGTGATTAGTGTCACGAAGAAATGGGTTCGTGGGAGGCTGTTTGAAGTAGGTGATGGAGACGGGGAGATGATATGGTGGGGGCGGCCCAATGTTAAAGAGTTTGGTGCGTATTCCAACAACAACACTGCGGATTTCCACTCCATTTTTTCCAACCATACTCTGCTGAAGAAGAGGAAGAAACAACAAGACCTGAAGAAGGTCATGGATGACGAATCAGATGATTGCCGAAACCTTATTGACCAGAATTCCCCCAAGCGCGTCCCTTCTACTTTCGACATCGACGACGATCCCCATTTCAGGCCTCCCATTCAGACTTTCCGTTTCTCCATTCCTAAATTTGCACTTGACAAGAGGTACTACTACATTTTAGCGGCCGCCCTCCCTCTATGCATTGTTGTTGTATTTTTCTCTGCCGACATCCAAACTCTCTTCTCTACTAATCTCTCTTCCCCACTGAAAAGTTCCGATTCCCTCAGTGACCGCATGAGGGAAGCGGAATTAAGAGCTTTGTATTTGTTAAGGCAGCAACAACTGGGTTTTTCCGATCTTTGGAATCACTCCTTGCTCGTTCAATCTAATTCAAGTTTCAACTCCACCTCTTCTAATAATTTGAGTTCCAATTCAGCCTCAGGAACCCCATCTACAGAAGATCTCAAATCTGCTATATTGAAGCAGATTTCTTTGAACAAAGAGATCCAAAACGTTCTTTTATCCCCCCATAGCTCTGGGAACATACCAGAGGAAGTTGGTGATGCTCATTCCATGGGCAGCTTCGCCCTTGATAGATGTAGAAAGATGGACCAGAAACTTTCCGATAGAAGAACTATTGAGTGGAAGCCAAAATCGAACAAGTTTTTGTTCGCTATATGCACTTCGGGGCAAATGTCGAACCATTTGATCTGTTTGGAGAAGCATATGTTCTTTGCTGCTATCCTCAACAGAATTCTTGTTATTCCTAGTCACAAAGTTGATTTTCAGTTCAGTAGAGTAATTGACATTGATCATATTAATTCATGTTTGGGAAGAAAGGTCGTCATTTCTTTTGAGGAGTTTTCTGAGATTAAGAAGCACCACTTGCACATTGATCGGTTCTTCTGTTACTTTTCAAAGCCAGATCCTTGTTATGTGGATGACGAACATATTAAAAAGTTGAAGACCTTGGGGGTCTCTATGGGCAAGCTCGAATCTGCCTGGAATGAAGATACTAAGAAGCCCACTAGAAAGACAGTTTCGGACATTGAATCCAAGTTCTCCTCTAATGACGATGTTGTAGCTGTAGGAGATATTTTCTTTGCTAATGTAGAGCAAGAGTGGGTGAATCAACCAGGCGGTCCCATCGCTCATAAATGTCAGACTTTGATAGAACCAAGTCGTCTTATCAAGCTGACGGCCCAGCGATTTATTCAAACCTTCTTAGGAAAGAATTATATGGCCTTCCATTTCCGACGACATGGTTTTTTAAATAATGCAAAGCAGCCAAGTTGCTTTTACCCCATTCCCCAAGCTGCCGACTGCTTAATCCGAGTGGTTGAAAGGGCAAACGTTCCAGTCATTTATCTTTCCACTGATGCAGCAGAGAGCGAATATGGATTGCTGCAGTCACTTCTTGTGTTGAATGGGAAGCCCATACCACTTGTTAAGAGGCCTCCACGTAATTCAGCTGAAAAATGGGATGCCTTATTATATAGGCATGGGATCGAGGGAGATTCTCAGGTTGAAGCGATGCTGGACAAGACAATTGGTGCTATGGCTAGCACATTTATTGGTGCATCTGGGTCTACATTCACTGAGGACATTTTGCGGCTAAGGAAGGACTGGGGCTCTGCATCTACTTGTGATGAGTACCTTTGCCAAGGCGAGGAACCAAATTTCATTGCAGAAAATGAATGA

Protein sequence

MAYCFVWLAVFCSFNTSASPDFEMDDPCFDDFDPRTNFSKFLEEAKHHAKEETGRKWLAQQKKSKKSWKNTLFSWLNSDKTSKSLPKLERNSHTSNKRRVHVSGPIYTGATTVDGRPRHRPTSGPIASLFNPSMRTEMEIPYMCLHQLTSPIPNHYYGPIYLAPMAETEDQSVYHQLQIRHPVNGPVGYGACYPTPPSNRTKERAEISTAQPVFHSFPRTSYAQSCLHVKKSLASLTSGNEKENYPSWKSLIVPKRVSFTVCQPQICRRWFLIRAVATLEPKRVVHDGKGDVSMGGGAEFKNSQMGAAPSTSDVQLSSSSEDTEEMDARERLRRERISKANKGNTPWNKGRKHSAETLRRIKERTRLAMQNPKIKMKLVNLGRSQSEETRMRIGVGVRMGWQRRRKKLKLQETCYLQWKDLIAEASRQGGLGEEELQWDSYQIMNEQLKKEWQESVEQRKTMPRPVGGRRAPKSAEQRKKISESISAKWADSEYRARVFSGLAKYHGTPIGVNRRPRRKRSESTETTRKKEKSGVKSPVAGGSKIESQRLRLRKSKAPRFKDPLASSKLEMIKSIRAERAIAETQKTEAIERARLLIAEAEKAAKALEVAATRSSIARASLLETRNLIAEAKQSIESVEIERMASPQSEERNAAASYTYEVGGTSNEEGDSVGGKGNQNGVVQTMANGTQLFPSSIDKDFDFSKLSLQDILGGEKEVPASSNGHGACHSSFSSLRNHPNGNKPSDHKPSLNGTKLHHLEEKPDSQVISVTKKWVRGRLFEVGDGDGEMIWWGRPNVKEFGAYSNNNTADFHSIFSNHTLLKKRKKQQDLKKVMDDESDDCRNLIDQNSPKRVPSTFDIDDDPHFRPPIQTFRFSIPKFALDKRYYYILAAALPLCIVVVFFSADIQTLFSTNLSSPLKSSDSLSDRMREAELRALYLLRQQQLGFSDLWNHSLLVQSNSSFNSTSSNNLSSNSASGTPSTEDLKSAILKQISLNKEIQNVLLSPHSSGNIPEEVGDAHSMGSFALDRCRKMDQKLSDRRTIEWKPKSNKFLFAICTSGQMSNHLICLEKHMFFAAILNRILVIPSHKVDFQFSRVIDIDHINSCLGRKVVISFEEFSEIKKHHLHIDRFFCYFSKPDPCYVDDEHIKKLKTLGVSMGKLESAWNEDTKKPTRKTVSDIESKFSSNDDVVAVGDIFFANVEQEWVNQPGGPIAHKCQTLIEPSRLIKLTAQRFIQTFLGKNYMAFHFRRHGFLNNAKQPSCFYPIPQAADCLIRVVERANVPVIYLSTDAAESEYGLLQSLLVLNGKPIPLVKRPPRNSAEKWDALLYRHGIEGDSQVEAMLDKTIGAMASTFIGASGSTFTEDILRLRKDWGSASTCDEYLCQGEEPNFIAENE
Homology
BLAST of Cp4.1LG02g17670 vs. ExPASy Swiss-Prot
Match: Q9FK30 (O-fucosyltransferase 36 OS=Arabidopsis thaliana OX=3702 GN=OFUT36 PE=2 SV=1)

HSP 1 Score: 677.9 bits (1748), Expect = 2.4e-193
Identity = 360/587 (61.33%), Postives = 445/587 (75.81%), Query Frame = 0

Query: 829  LKKVMDDESDDCRNLIDQNSPK-----------------RVPSTFDIDDDPHFRPPIQTF 888
            +++   D+ +D ++LI QN  +                    S F IDD  H    +Q  
Sbjct: 1    MERNSSDDEEDHQHLIPQNDTRIRHREDSVSSNATTIGGNQRSAFQIDDILH---RVQ-- 60

Query: 889  RFSIPKFALDKRYYYILAAALPLCIVVVFFSADIQTLFSTNLSSPLKSSDSLSDRMREAE 948
                 K +L+KR Y I+  +L + I ++F   D + LF+ N SS     D LS+R++E+E
Sbjct: 61   --HRGKISLNKR-YVIVFVSLIISIGLLFLLTDPRELFAANFSS--FKLDPLSNRVKESE 120

Query: 949  LRALYLLRQQQLGFSDLWNHSLLVQSNSSFNSTSSNNLSSNSASGTPSTEDLKSAILKQI 1008
            LRALYLLRQQQL    LWN +L+           S N S N+   +   ED+KSA+ KQI
Sbjct: 121  LRALYLLRQQQLALLSLWNGTLV---------NPSLNQSENALGSSVLFEDVKSAVSKQI 180

Query: 1009 SLNKEIQNVLLSPHSSGNIPEEVGDAHSMGSFALDRCRKMDQKLSDRRTIEWKPKSNKFL 1068
            SLNKEIQ VLLSPH S N      D  S+ +F+ +RCRK+DQKLSDR+T+EWKP+S+KFL
Sbjct: 181  SLNKEIQEVLLSPHRSSNYSGGT-DVDSV-NFSYNRCRKVDQKLSDRKTVEWKPRSDKFL 240

Query: 1069 FAICTSGQMSNHLICLEKHMFFAAILNRILVIPSHKVDFQFSRVIDIDHINSCLGRKVVI 1128
            FAIC SGQMSNHLICLEKHMFFAA+L+R+LVIPS K D+Q+ RVIDI+ IN+CLGR VV+
Sbjct: 241  FAICLSGQMSNHLICLEKHMFFAALLDRVLVIPSSKFDYQYDRVIDIERINTCLGRNVVV 300

Query: 1129 SFEEFSE-IKKHHLHIDRFFCYFSKPDPCYVDDEHIKKLKTLGVSM-GKLESAWNEDTKK 1188
            +F++F E  KK+H  IDRF CYFS P  CYVD+EHIKKLK LG+S+ GKLE+ W+ED KK
Sbjct: 301  AFDQFKEKAKKNHFRIDRFICYFSSPQLCYVDEEHIKKLKGLGISIDGKLEAPWSEDIKK 360

Query: 1189 PTRKTVSDIESKFSSNDDVVAVGDIFFANVEQEWVNQPGGPIAHKCQTLIEPSRLIKLTA 1248
            P+++TV D++ KF S+DDV+A+GD+F+A++EQ+WV QPGGPI HKC+TLIEPS+LI LTA
Sbjct: 361  PSKRTVQDVQMKFKSDDDVIAIGDVFYADMEQDWVMQPGGPINHKCKTLIEPSKLILLTA 420

Query: 1249 QRFIQTFLGKNYMAFHFRRHGFLN--NAKQPSCFYPIPQAADCLIRVVERANVPVIYLST 1308
            QRFIQTFLGKN++A HFRRHGFL   NAK PSCFYPIPQAA+C+ R+VER+N  VIYLST
Sbjct: 421  QRFIQTFLGKNFIALHFRRHGFLKFCNAKSPSCFYPIPQAAECIARIVERSNGAVIYLST 480

Query: 1309 DAAESEYGLLQSLLVLNGKPIPLVKRPPRNSAEKWDALLYRHGIEGDSQVEAMLDKTIGA 1368
            DAAESE  LLQSL+V++GK +PLVKRPPRNSAEKWDALLYRHGIE DSQV+AMLDKTI A
Sbjct: 481  DAAESETSLLQSLVVVDGKIVPLVKRPPRNSAEKWDALLYRHGIEDDSQVDAMLDKTICA 540

Query: 1369 MASTFIGASGSTFTEDILRLRKDWGSASTCDEYLCQGEEPNFIAENE 1395
            M+S FIGASGSTFTEDILRLRKDWG++STCDEYLC+GEEPNFIAE+E
Sbjct: 541  MSSVFIGASGSTFTEDILRLRKDWGTSSTCDEYLCRGEEPNFIAEDE 566

BLAST of Cp4.1LG02g17670 vs. ExPASy Swiss-Prot
Match: Q501D6 (O-fucosyltransferase 14 OS=Arabidopsis thaliana OX=3702 GN=OFUT14 PE=2 SV=1)

HSP 1 Score: 662.5 bits (1708), Expect = 1.0e-188
Identity = 332/572 (58.04%), Postives = 433/572 (75.70%), Query Frame = 0

Query: 835  DESDDCRNLIDQNSPKRVPSTFDIDDDPHFRPP---IQTFRFSIPK-FALDKRYYYILAA 894
            DE  D +NL++++  +     F I D+   + P   +++ R  + + F L+      +  
Sbjct: 18   DEESDLQNLLEESDSQ--IDQFRISDEAAEQRPTFDVESLRSRLRRSFKLNLTKKQSIFI 77

Query: 895  ALPLCIVVVFFSADIQTLFSTNLSSPLKSSDSLSDRMREAELRALYLLRQQQLGFSDLWN 954
             LP+ I++++ S D    FS  + +    S++L+ R+ E++L+ALYLLR+Q+     +WN
Sbjct: 78   FLPIVIILIYLSTDFSNYFSVKVPNSAFRSNTLTGRVHESDLQALYLLRKQESDLFSIWN 137

Query: 955  HSLLVQSNSSFNSTSSNNLSSNSASGTPSTEDLKSAILKQISLNKEIQNVLLSPHSSGNI 1014
            H++             +NLS        + +D+KSA+ +QISLN++IQN LLSPH +GN+
Sbjct: 138  HTV-------------SNLS--------TIDDVKSAVFRQISLNRQIQNALLSPHKTGNV 197

Query: 1015 PEEVGDAHSMGSFALDRCRKMDQKLSDRRTIEWKPKSNKFLFAICTSGQMSNHLICLEKH 1074
              ++G + S G FA   CRK+DQKL+ R+TI+WKP+ +KFLFAIC SGQMSNHLICLEKH
Sbjct: 198  --DIGGS-SDGYFAGGSCRKVDQKLNGRKTIQWKPRPDKFLFAICLSGQMSNHLICLEKH 257

Query: 1075 MFFAAILNRILVIPSHKVDFQFSRVIDIDHINSCLGRKVVISFEEF-----SEIKKHHLH 1134
            MFFAA+L R+LVIPSH+ D+ +SR+IDID IN+CLGR VV+SFEEF     +  K HH+H
Sbjct: 258  MFFAALLKRVLVIPSHRFDYHYSRIIDIDRINTCLGRTVVVSFEEFWKKDKNRKKHHHVH 317

Query: 1135 IDRFFCYFSKPDPCYVDDEHIKKLKTLGVSM-GKLESAWNEDTKKPTRKTVSDIESKFSS 1194
            I+RF CYFSKP+PCYVD EHI KLK LG+++ GKL++ W ED  +P+ KT  ++E+ F S
Sbjct: 318  INRFICYFSKPEPCYVDKEHITKLKALGITVGGKLDTPWEEDIARPSNKTAEEVEANFRS 377

Query: 1195 NDDVVAVGDIFFANVEQEWVNQPGGPIAHKCQTLIEPSRLIKLTAQRFIQTFLGKNYMAF 1254
            +DDV+A+GD+F+ANVE+EWV QPGGP+AHKC+TLIEP+RLI LTAQRFIQTFLGKNY+A 
Sbjct: 378  DDDVIAIGDVFYANVEREWVMQPGGPVAHKCRTLIEPNRLILLTAQRFIQTFLGKNYIAL 437

Query: 1255 HFRRHGFLN--NAKQPSCFYPIPQAADCLIRVVERANVPVIYLSTDAAESEYGLLQSLLV 1314
            HFRRHGFL   NAK PSCF+PIPQAA C+ R++E+   PV+YLSTDAAESE GLLQSLL+
Sbjct: 438  HFRRHGFLKFCNAKNPSCFFPIPQAASCITRLIEKVEAPVLYLSTDAAESETGLLQSLLI 497

Query: 1315 LNGKPIPLVKRPPRNSAEKWDALLYRHGIEGDSQVEAMLDKTIGAMASTFIGASGSTFTE 1374
            LNGK +PLVKRP R+SAEKWDALLYRHG+EGDSQVEAMLDKTI A++S FIGASGSTFTE
Sbjct: 498  LNGKTVPLVKRPARDSAEKWDALLYRHGLEGDSQVEAMLDKTICALSSVFIGASGSTFTE 557

Query: 1375 DILRLRKDWGSASTCDEYLCQGEEPNFIAENE 1395
            DILRLRKDWG+AS CDEYLC  E+PNFIA++E
Sbjct: 558  DILRLRKDWGTASECDEYLCANEQPNFIADHE 563

BLAST of Cp4.1LG02g17670 vs. ExPASy Swiss-Prot
Match: Q84WU0 (O-fucosyltransferase 5 OS=Arabidopsis thaliana OX=3702 GN=OFUT5 PE=2 SV=1)

HSP 1 Score: 656.0 bits (1691), Expect = 9.6e-187
Identity = 349/586 (59.56%), Postives = 433/586 (73.89%), Query Frame = 0

Query: 835  DESDDCRNLIDQNSPKRVPSTFDIDDDPHFRPPIQTFRFS-----IPKFALD-------- 894
            DE +D RNLI QN  +        D+D + RP  +T   +      P+ AL         
Sbjct: 7    DEEEDHRNLIPQNDTR--------DNDLNLRPDARTVNMANGGGRSPRSALQIDEILSRA 66

Query: 895  --------KRYYYILAAALPLCIVVVFFSADIQTLFSTNLSSPLKSSDSLSDRMREAELR 954
                     + Y + A +L L + ++F   D +T FS+         D +S R++E+EL+
Sbjct: 67   RNRWKISVNKRYVVAAVSLTLFVGLLFLFTDTRTFFSS------FKLDPMSSRVKESELQ 126

Query: 955  ALYLLRQQQLGFSDLWNHSLLVQSNSSFNSTSSNNLSSNSASGTPSTEDLKSAILKQISL 1014
            AL LLRQQQL    L N        ++FN       SSN+ S +   +++K+A+LKQIS+
Sbjct: 127  ALNLLRQQQLALVSLLN-------RTNFN-------SSNAISSSVVIDNVKAALLKQISV 186

Query: 1015 NKEIQNVLLSPHSSGNIPEEVGDAHSM-GSFALDRCRKMDQKLSDRRTIEWKPKSNKFLF 1074
            NKEI+ VLLSPH +GN       + S  GS+  D CRK+DQKL DR+TIEWKP+ +KFLF
Sbjct: 187  NKEIEEVLLSPHRTGNYSITASGSDSFTGSYNADICRKVDQKLLDRKTIEWKPRPDKFLF 246

Query: 1075 AICTSGQMSNHLICLEKHMFFAAILNRILVIPSHKVDFQFSRVIDIDHINSCLGRKVVIS 1134
            AIC SGQMSNHLICLEKHMFFAA+L+R+LVIPS K D+Q+ +VIDI+ IN+CLGR VVIS
Sbjct: 247  AICLSGQMSNHLICLEKHMFFAALLDRVLVIPSSKFDYQYDKVIDIERINTCLGRTVVIS 306

Query: 1135 FEEFSEI-KKHHLHIDRFFCYFSKPDPCYVDDEHIKKLKTLGVSM-GKLESAWNEDTKKP 1194
            F++F EI KK++ HIDRF CY S P PCYVD++HIKKLK LGVS+ GKLE+ W+ED KKP
Sbjct: 307  FDQFKEIDKKNNAHIDRFICYVSSPQPCYVDEDHIKKLKGLGVSIGGKLEAPWSEDIKKP 366

Query: 1195 TRKTVSDIESKFSSNDDVVAVGDIFFANVEQEWVNQPGGPIAHKCQTLIEPSRLIKLTAQ 1254
            T++T  ++  KF S+D V+A+GD+F+A++EQ+ V QPGGPI HKC+TLIEPSRLI +TAQ
Sbjct: 367  TKRTSQEVVEKFKSDDGVIAIGDVFYADMEQDLVMQPGGPINHKCKTLIEPSRLILVTAQ 426

Query: 1255 RFIQTFLGKNYMAFHFRRHGFLN--NAKQPSCFYPIPQAADCLIRVVERANVPVIYLSTD 1314
            RFIQTFLGKN+++ H RRHGFL   NAK PSCFYPIPQAADC+ R+VERAN PVIYLSTD
Sbjct: 427  RFIQTFLGKNFISLHLRRHGFLKFCNAKSPSCFYPIPQAADCISRMVERANAPVIYLSTD 486

Query: 1315 AAESEYGLLQSLLVLNGKPIPLVKRPPRNSAEKWDALLYRHGIEGDSQVEAMLDKTIGAM 1374
            AAESE GLLQSL+V++GK +PLVKRPP+NSAEKWD+LLYRHGIE DSQV AMLDKTI AM
Sbjct: 487  AAESETGLLQSLVVVDGKVVPLVKRPPQNSAEKWDSLLYRHGIEDDSQVYAMLDKTICAM 546

Query: 1375 ASTFIGASGSTFTEDILRLRKDWGSASTCDEYLCQGEEPNFIAENE 1395
            +S FIGASGSTFTEDILRLRKDWG++S CDEYLC+GEEPNFIAENE
Sbjct: 547  SSVFIGASGSTFTEDILRLRKDWGTSSMCDEYLCRGEEPNFIAENE 564

BLAST of Cp4.1LG02g17670 vs. NCBI nr
Match: KAG6606680.1 (O-fucosyltransferase 5, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 2129 bits (5516), Expect = 0.0
Identity = 1123/1192 (94.21%), Postives = 1128/1192 (94.63%), Query Frame = 0

Query: 205  AEISTAQPVFHSFPRTSYAQSCLHVKKSLASLTSGNEKENYPSWKSLIVPKRVSFTVCQP 264
            AEISTAQPVFHSFPRTSYAQSCLHVKKSLASLTSGNEKENYPSWKSLIVPKR        
Sbjct: 5    AEISTAQPVFHSFPRTSYAQSCLHVKKSLASLTSGNEKENYPSWKSLIVPKR-------- 64

Query: 265  QICRRWFLIRAVATLEPKRVVHDGKGDVSMGGGAEFKNSQMGAAPSTSDVQLSSSSEDTE 324
                                     GDVSMGGGAEFKNSQMGAAPSTSDVQLSSSSEDTE
Sbjct: 65   -------------------------GDVSMGGGAEFKNSQMGAAPSTSDVQLSSSSEDTE 124

Query: 325  EMDARERLRRERISKANKGNTPWNKGRKHSAETLRRIKERTRLAMQNPKIKMKLVNLGRS 384
            EMDARERLRRERISKANKGNTPWNKGRKHSAETLRRIKERTRLAMQNPKIKMKLVNLGRS
Sbjct: 125  EMDARERLRRERISKANKGNTPWNKGRKHSAETLRRIKERTRLAMQNPKIKMKLVNLGRS 184

Query: 385  QSEETRMRIGVGVRMGWQRRRKKLKLQETCYLQWKDLIAEASRQGGLGEEELQWDSYQIM 444
            QSEETRMRIGVGVRMGWQRRRKKLKLQETCYLQWKDLIAEASRQGGLGEEELQWDSYQIM
Sbjct: 185  QSEETRMRIGVGVRMGWQRRRKKLKLQETCYLQWKDLIAEASRQGGLGEEELQWDSYQIM 244

Query: 445  NEQLKKEWQESVEQRKTMPRPVGGRRAPKSAEQRKKISESISAKWADSEYRARVFSGLAK 504
            NEQLKKEWQESVEQRKTMPRPVGGRRAPKSAEQRKKISESISAKWADSEYRARVFSGLAK
Sbjct: 245  NEQLKKEWQESVEQRKTMPRPVGGRRAPKSAEQRKKISESISAKWADSEYRARVFSGLAK 304

Query: 505  YHGTPIGVNRRPRRKRSESTETTRKKEKSGVKSPVAGGSKIESQRLRLRKSKAPRFKDPL 564
            YHGTPIGVNRRPRRKRSESTETTRKKEKSGVKSPVAGGSKIESQRLRLRKSKAPRFKDPL
Sbjct: 305  YHGTPIGVNRRPRRKRSESTETTRKKEKSGVKSPVAGGSKIESQRLRLRKSKAPRFKDPL 364

Query: 565  ASSKLEMIKSIRAERAIAETQKTEAIERARLLIAEAEKAAKALEVAATRSSIARASLLET 624
             SSKLEMIKSIRAERAIAETQKTEAIERARLLIAEAEKAAKALEVAATRSSIARASLLET
Sbjct: 365  VSSKLEMIKSIRAERAIAETQKTEAIERARLLIAEAEKAAKALEVAATRSSIARASLLET 424

Query: 625  RNLIAEAKQSIESVEIERMASPQSEERNAAASYTYEVGGTSNEEGDSVGGKGNQNGVVQT 684
            RNLIAEAKQSIESVEIERMASPQSEERNAAASYTYEVGGTSNEEGDSV GKGNQNGVVQT
Sbjct: 425  RNLIAEAKQSIESVEIERMASPQSEERNAAASYTYEVGGTSNEEGDSVAGKGNQNGVVQT 484

Query: 685  MANGTQLFPSSIDKDFDFSKLSLQDILGGEKEVPASSNGHGACHSSFSSLRNHPNGNKPS 744
            MANGTQLFPSS+DKDFDFSKLSLQDILGGEKEVPASSNGHGACHSSFSSL NHPNGNKPS
Sbjct: 485  MANGTQLFPSSMDKDFDFSKLSLQDILGGEKEVPASSNGHGACHSSFSSLTNHPNGNKPS 544

Query: 745  DHKPSLNGTKLHHLEEKPDSQVISVTKKWVRGRLFEVGDGDGEMIWWGRPNVKEFGAYSN 804
            DHKPSLNGTKLHHLEEKPDSQVISVTKKWVRGRLFE                      SN
Sbjct: 545  DHKPSLNGTKLHHLEEKPDSQVISVTKKWVRGRLFE----------------------SN 604

Query: 805  NNTADFHSIFSNHTLLKKRKKQQDLKKVMDDESDDCRNLIDQNSPKRVPSTFDIDDDPHF 864
            +NTADFHSIFSNHTLLKKRKKQQDLKKVMDDESDDCRNLIDQNSPKRVPSTFDIDDDPHF
Sbjct: 605  SNTADFHSIFSNHTLLKKRKKQQDLKKVMDDESDDCRNLIDQNSPKRVPSTFDIDDDPHF 664

Query: 865  RPPIQTFRFSIPKFALDKRYYYILAAALPLCIVVVFFSADIQTLFSTNLSSPLKSSDSLS 924
            RPPIQTFRFSIPKFA DKRYYYILAAALPLCIVVVFFSADIQTLFSTNLSSPLKSSDSLS
Sbjct: 665  RPPIQTFRFSIPKFAFDKRYYYILAAALPLCIVVVFFSADIQTLFSTNLSSPLKSSDSLS 724

Query: 925  DRMREAELRALYLLRQQQLGFSDLWNHSLLVQSNSSFNSTSSNNLSSNSASGTPSTEDLK 984
            DRMREAELRALYLLRQQQLGFSDLWNHSLLVQSNSSFNSTSSNNLSSNSASGTPSTEDLK
Sbjct: 725  DRMREAELRALYLLRQQQLGFSDLWNHSLLVQSNSSFNSTSSNNLSSNSASGTPSTEDLK 784

Query: 985  SAILKQISLNKEIQNVLLSPHSSGNIPEEVGDAHSMGSFALDRCRKMDQKLSDRRTIEWK 1044
            SAILKQISLNKEIQNVLLSPHSSGNI EEVGDAHSMGSFALDRCRKMDQKLSDRRTIEWK
Sbjct: 785  SAILKQISLNKEIQNVLLSPHSSGNISEEVGDAHSMGSFALDRCRKMDQKLSDRRTIEWK 844

Query: 1045 PKSNKFLFAICTSGQMSNHLICLEKHMFFAAILNRILVIPSHKVDFQFSRVIDIDHINSC 1104
            PKSNKFLFAICTSGQMSNHLICLEKHMFFAAILNRILVIPSHKVDFQFSRVIDIDHINSC
Sbjct: 845  PKSNKFLFAICTSGQMSNHLICLEKHMFFAAILNRILVIPSHKVDFQFSRVIDIDHINSC 904

Query: 1105 LGRKVVISFEEFSEIKKHHLHIDRFFCYFSKPDPCYVDDEHIKKLKTLGVSMGKLESAWN 1164
            LGRKVVISFEEFSEIKKHHLHIDRFFCYFS+PDPCYVDDEHIKKLKTLGVSMGKLESAWN
Sbjct: 905  LGRKVVISFEEFSEIKKHHLHIDRFFCYFSRPDPCYVDDEHIKKLKTLGVSMGKLESAWN 964

Query: 1165 EDTKKPTRKTVSDIESKFSSNDDVVAVGDIFFANVEQEWVNQPGGPIAHKCQTLIEPSRL 1224
            EDTKKPTRKTVSDIES+FSSNDDVVAVGDIFFANVEQEWVNQPGGPIAHKCQTLIEPSRL
Sbjct: 965  EDTKKPTRKTVSDIESEFSSNDDVVAVGDIFFANVEQEWVNQPGGPIAHKCQTLIEPSRL 1024

Query: 1225 IKLTAQRFIQTFLGKNYMAFHFRRHGFLN--NAKQPSCFYPIPQAADCLIRVVERANVPV 1284
            IKLTAQRFIQTFLGKNYMAFHFRRHGFL   NAKQ SCFYPIPQAADCLIRVVERANVPV
Sbjct: 1025 IKLTAQRFIQTFLGKNYMAFHFRRHGFLKFCNAKQSSCFYPIPQAADCLIRVVERANVPV 1084

Query: 1285 IYLSTDAAESEYGLLQSLLVLNGKPIPLVKRPPRNSAEKWDALLYRHGIEGDSQVEAMLD 1344
            IYLSTDAAESEYGLLQSLLVLNGKPIPLVKRPPR+SAEKWDALLYRHGIEGDSQVEAMLD
Sbjct: 1085 IYLSTDAAESEYGLLQSLLVLNGKPIPLVKRPPRSSAEKWDALLYRHGIEGDSQVEAMLD 1141

Query: 1345 KTIGAMASTFIGASGSTFTEDILRLRKDWGSASTCDEYLCQGEEPNFIAENE 1394
            KTIGAMASTFIGASGSTFTEDILRLRKDWGSASTCDEYLCQGEEPNFIAENE
Sbjct: 1145 KTIGAMASTFIGASGSTFTEDILRLRKDWGSASTCDEYLCQGEEPNFIAENE 1141

BLAST of Cp4.1LG02g17670 vs. NCBI nr
Match: KAG7036399.1 (hypothetical protein SDJN02_00016, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1166 bits (3016), Expect = 0.0
Identity = 616/620 (99.35%), Postives = 617/620 (99.52%), Query Frame = 0

Query: 165 MAETEDQSVYHQLQIRHPVNGPVGYGACYPTPPSNRTKERAEISTAQPVFHSFPRTSYAQ 224
           MAETEDQSVYHQLQIRHPVNGPVGYGACYPTPPSNRTKERAEISTAQPVFHSFPRTSYAQ
Sbjct: 1   MAETEDQSVYHQLQIRHPVNGPVGYGACYPTPPSNRTKERAEISTAQPVFHSFPRTSYAQ 60

Query: 225 SCLHVKKSLASLTSGNEKENYPSWKSLIVPKRVSFTVCQPQICRRWFLIRAVATLEPKRV 284
           SCLHVKKSLASLTSGNEKENYPSWKSLIVPKRVSFTVCQPQICRR FLIRAVATLEPKRV
Sbjct: 61  SCLHVKKSLASLTSGNEKENYPSWKSLIVPKRVSFTVCQPQICRRGFLIRAVATLEPKRV 120

Query: 285 VHDGKGDVSMGGGAEFKNSQMGAAPSTSDVQLSSSSEDTEEMDARERLRRERISKANKGN 344
           VHDGKGDVSMGGGAEFKNSQMGAAPSTSDVQLSSSSEDTEEMDARERLRRERISKANKGN
Sbjct: 121 VHDGKGDVSMGGGAEFKNSQMGAAPSTSDVQLSSSSEDTEEMDARERLRRERISKANKGN 180

Query: 345 TPWNKGRKHSAETLRRIKERTRLAMQNPKIKMKLVNLGRSQSEETRMRIGVGVRMGWQRR 404
           TPWNKGRKHSAETLRRIKERTRLAMQNPKIKMKLVNLGRSQSEETRMRIGVGVRMGWQRR
Sbjct: 181 TPWNKGRKHSAETLRRIKERTRLAMQNPKIKMKLVNLGRSQSEETRMRIGVGVRMGWQRR 240

Query: 405 RKKLKLQETCYLQWKDLIAEASRQGGLGEEELQWDSYQIMNEQLKKEWQESVEQRKTMPR 464
           RKKLKLQETCYLQWKDLIAEASRQGGLGEEELQWDSYQIMNEQLKKEWQESVEQRKTMPR
Sbjct: 241 RKKLKLQETCYLQWKDLIAEASRQGGLGEEELQWDSYQIMNEQLKKEWQESVEQRKTMPR 300

Query: 465 PVGGRRAPKSAEQRKKISESISAKWADSEYRARVFSGLAKYHGTPIGVNRRPRRKRSEST 524
           PVGGRRAPKSAEQRKKISESISAKWADSEYRARVFSGLAKYHGTPIGVNRRPRRKRSEST
Sbjct: 301 PVGGRRAPKSAEQRKKISESISAKWADSEYRARVFSGLAKYHGTPIGVNRRPRRKRSEST 360

Query: 525 ETTRKKEKSGVKSPVAGGSKIESQRLRLRKSKAPRFKDPLASSKLEMIKSIRAERAIAET 584
           ETTRKKEKSGVKSPVAGGSKIESQRLRLRKSKAPRFKDPLASSKLEMIKSIRAERAIAET
Sbjct: 361 ETTRKKEKSGVKSPVAGGSKIESQRLRLRKSKAPRFKDPLASSKLEMIKSIRAERAIAET 420

Query: 585 QKTEAIERARLLIAEAEKAAKALEVAATRSSIARASLLETRNLIAEAKQSIESVEIERMA 644
           QKTEAIERARLLIAEAEKAAKALEVAATRSSIARASLLETRNLIAEAKQSIESVEIERMA
Sbjct: 421 QKTEAIERARLLIAEAEKAAKALEVAATRSSIARASLLETRNLIAEAKQSIESVEIERMA 480

Query: 645 SPQSEERNAAASYTYEVGGTSNEEGDSVGGKGNQNGVVQTMANGTQLFPSSIDKDFDFSK 704
           SPQSEERNAAASYTYEVGGTSNEEGDSV GKGNQNGVVQTMANGTQLFPSS+DKDFDFSK
Sbjct: 481 SPQSEERNAAASYTYEVGGTSNEEGDSVAGKGNQNGVVQTMANGTQLFPSSMDKDFDFSK 540

Query: 705 LSLQDILGGEKEVPASSNGHGACHSSFSSLRNHPNGNKPSDHKPSLNGTKLHHLEEKPDS 764
           LSLQDILGGEKEVPASSNGHGACHSSFSSL NHPNGNKPSDHKPSLNGTKLHHLEEKPDS
Sbjct: 541 LSLQDILGGEKEVPASSNGHGACHSSFSSLTNHPNGNKPSDHKPSLNGTKLHHLEEKPDS 600

Query: 765 QVISVTKKWVRGRLFEVGDG 784
           QVISVTKKWVRGRLFEVGDG
Sbjct: 601 QVISVTKKWVRGRLFEVGDG 620

BLAST of Cp4.1LG02g17670 vs. NCBI nr
Match: XP_023521813.1 (O-fucosyltransferase 36-like isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1110 bits (2872), Expect = 0.0
Identity = 561/564 (99.47%), Postives = 561/564 (99.47%), Query Frame = 0

Query: 833  MDDESDDCRNLIDQNSPKRVPSTFDIDDDPHFRPPIQTFRFSIPKFALDKRYYYILAAAL 892
            MDDESDDCRNLIDQNSPKRVPSTFDIDDDPHFRPPIQTFRFSIPKFALDKRYYYILAAAL
Sbjct: 1    MDDESDDCRNLIDQNSPKRVPSTFDIDDDPHFRPPIQTFRFSIPKFALDKRYYYILAAAL 60

Query: 893  PLCIVVVFFSADIQTLFSTNLSSPLKSSDSLSDRMREAELRALYLLRQQQLGFSDLWNHS 952
            PLCIVVVFFSADIQTLFSTNLSSPLKSSDSLSDRMREAELRALYLLRQQQLGFSDLWNHS
Sbjct: 61   PLCIVVVFFSADIQTLFSTNLSSPLKSSDSLSDRMREAELRALYLLRQQQLGFSDLWNHS 120

Query: 953  LLVQSNSSFNSTSSNNLSSNSASGTPSTEDLKSAILKQISLNKEIQNVLLSPHSSGNIPE 1012
            LLVQSNSSFNSTSSNNLSSNSASGTPSTEDLKSAILKQISLNKEIQNVLLSPHSSGNIPE
Sbjct: 121  LLVQSNSSFNSTSSNNLSSNSASGTPSTEDLKSAILKQISLNKEIQNVLLSPHSSGNIPE 180

Query: 1013 EVGDAHSMGSFALDRCRKMDQKLSDRRTIEWKPKSNKFLFAICTSGQMSNHLICLEKHMF 1072
            EVGDAHSMGSFALDRCRKMDQKLSDRRTIEWKPKSNKFLFAICTSGQMSNHLICLEKHMF
Sbjct: 181  EVGDAHSMGSFALDRCRKMDQKLSDRRTIEWKPKSNKFLFAICTSGQMSNHLICLEKHMF 240

Query: 1073 FAAILNRILVIPSHKVDFQFSRVIDIDHINSCLGRKVVISFEEFSEIKKHHLHIDRFFCY 1132
            FAAILNRILVIPSHKVDFQFSRVIDIDHINSCLGRKVVISFEEFSEIKKHHLHIDRFFCY
Sbjct: 241  FAAILNRILVIPSHKVDFQFSRVIDIDHINSCLGRKVVISFEEFSEIKKHHLHIDRFFCY 300

Query: 1133 FSKPDPCYVDDEHIKKLKTLGVSMGKLESAWNEDTKKPTRKTVSDIESKFSSNDDVVAVG 1192
            FSKPDPCYVDDEHIKKLKTLGVSMGKLESAWNEDTKKPTRKTVSDIESKFSSNDDVVAVG
Sbjct: 301  FSKPDPCYVDDEHIKKLKTLGVSMGKLESAWNEDTKKPTRKTVSDIESKFSSNDDVVAVG 360

Query: 1193 DIFFANVEQEWVNQPGGPIAHKCQTLIEPSRLIKLTAQRFIQTFLGKNYMAFHFRRHGFL 1252
            DIFFANVEQEWVNQPGGPIAHKCQTLIEPSRLIKLTAQRFIQTFLGKNYMAFHFRRHGFL
Sbjct: 361  DIFFANVEQEWVNQPGGPIAHKCQTLIEPSRLIKLTAQRFIQTFLGKNYMAFHFRRHGFL 420

Query: 1253 N--NAKQPSCFYPIPQAADCLIRVVERANVPVIYLSTDAAESEYGLLQSLLVLNGKPIPL 1312
               NAKQPSCFYPIPQAADCLIRVVERANVPVIYLSTDAAESEYGLLQSLLVLNGKPIPL
Sbjct: 421  KFCNAKQPSCFYPIPQAADCLIRVVERANVPVIYLSTDAAESEYGLLQSLLVLNGKPIPL 480

Query: 1313 VKRPPRNSAEKWDALLYRHGIEGDSQVEAMLDKTIGAMASTFIGASGSTFTEDILRLRKD 1372
            VKRPPRNSAEKWDALLYRHGIEGDSQVEAMLDKTIGAMASTFIGASGSTFTEDILRLRKD
Sbjct: 481  VKRPPRNSAEKWDALLYRHGIEGDSQVEAMLDKTIGAMASTFIGASGSTFTEDILRLRKD 540

Query: 1373 WGSASTCDEYLCQGEEPNFIAENE 1394
            WGSASTCDEYLCQGEEPNFIAENE
Sbjct: 541  WGSASTCDEYLCQGEEPNFIAENE 564

BLAST of Cp4.1LG02g17670 vs. NCBI nr
Match: XP_022949564.1 (O-fucosyltransferase 36-like isoform X1 [Cucurbita moschata])

HSP 1 Score: 1100 bits (2846), Expect = 0.0
Identity = 556/564 (98.58%), Postives = 557/564 (98.76%), Query Frame = 0

Query: 833  MDDESDDCRNLIDQNSPKRVPSTFDIDDDPHFRPPIQTFRFSIPKFALDKRYYYILAAAL 892
            MDDESDDCRNLIDQNSPKRVPSTFDIDDDPHFRPPIQTFRFSIPKFA DKRYYYILAAAL
Sbjct: 1    MDDESDDCRNLIDQNSPKRVPSTFDIDDDPHFRPPIQTFRFSIPKFAFDKRYYYILAAAL 60

Query: 893  PLCIVVVFFSADIQTLFSTNLSSPLKSSDSLSDRMREAELRALYLLRQQQLGFSDLWNHS 952
            PLCIVVVFFSADIQTLFSTNLSSPLKSSDSLSDRMREAELRALYLLRQQQLGFSDLWNHS
Sbjct: 61   PLCIVVVFFSADIQTLFSTNLSSPLKSSDSLSDRMREAELRALYLLRQQQLGFSDLWNHS 120

Query: 953  LLVQSNSSFNSTSSNNLSSNSASGTPSTEDLKSAILKQISLNKEIQNVLLSPHSSGNIPE 1012
            LLVQSNSSFNSTSSNNLSSNSASGTPSTEDLKSAILKQISLNKEIQNVLLSPHSSGNI E
Sbjct: 121  LLVQSNSSFNSTSSNNLSSNSASGTPSTEDLKSAILKQISLNKEIQNVLLSPHSSGNISE 180

Query: 1013 EVGDAHSMGSFALDRCRKMDQKLSDRRTIEWKPKSNKFLFAICTSGQMSNHLICLEKHMF 1072
            EVGDAHSMGSFALDRCRKMDQKLSDRRTIEWKPKSNKFLFAICTSGQMSNHLICLEKHMF
Sbjct: 181  EVGDAHSMGSFALDRCRKMDQKLSDRRTIEWKPKSNKFLFAICTSGQMSNHLICLEKHMF 240

Query: 1073 FAAILNRILVIPSHKVDFQFSRVIDIDHINSCLGRKVVISFEEFSEIKKHHLHIDRFFCY 1132
            FAAILNRILVIPSHKVDFQFSRVIDIDHINSCLGRKVVISFEEFSEIKKHHLHIDRFFCY
Sbjct: 241  FAAILNRILVIPSHKVDFQFSRVIDIDHINSCLGRKVVISFEEFSEIKKHHLHIDRFFCY 300

Query: 1133 FSKPDPCYVDDEHIKKLKTLGVSMGKLESAWNEDTKKPTRKTVSDIESKFSSNDDVVAVG 1192
            FS PDPCYVDDEHIKKLKTLGVSMGKLESAWNEDTKKPTRKTVSDIESKFSSNDDVVAVG
Sbjct: 301  FSNPDPCYVDDEHIKKLKTLGVSMGKLESAWNEDTKKPTRKTVSDIESKFSSNDDVVAVG 360

Query: 1193 DIFFANVEQEWVNQPGGPIAHKCQTLIEPSRLIKLTAQRFIQTFLGKNYMAFHFRRHGFL 1252
            DIFFANVEQEWVNQPGGPIAHKCQTLIEPSRLIKLTAQRFIQTFLGKNY+AFHFRRHGFL
Sbjct: 361  DIFFANVEQEWVNQPGGPIAHKCQTLIEPSRLIKLTAQRFIQTFLGKNYIAFHFRRHGFL 420

Query: 1253 N--NAKQPSCFYPIPQAADCLIRVVERANVPVIYLSTDAAESEYGLLQSLLVLNGKPIPL 1312
               NAKQPSCFYPIPQAADCLIRVVERANVPVIYLSTDAAESEYGLLQSLLVLNGKPIPL
Sbjct: 421  KFCNAKQPSCFYPIPQAADCLIRVVERANVPVIYLSTDAAESEYGLLQSLLVLNGKPIPL 480

Query: 1313 VKRPPRNSAEKWDALLYRHGIEGDSQVEAMLDKTIGAMASTFIGASGSTFTEDILRLRKD 1372
             KRPPRNSAEKWDALLYRHGIEGDSQVEAMLDKTIGAMASTFIGASGSTFTEDILRLRKD
Sbjct: 481  FKRPPRNSAEKWDALLYRHGIEGDSQVEAMLDKTIGAMASTFIGASGSTFTEDILRLRKD 540

Query: 1373 WGSASTCDEYLCQGEEPNFIAENE 1394
            WGSASTCDEYLCQGEEPNFIAENE
Sbjct: 541  WGSASTCDEYLCQGEEPNFIAENE 564

BLAST of Cp4.1LG02g17670 vs. NCBI nr
Match: XP_023525182.1 (uncharacterized protein LOC111788857 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1093 bits (2827), Expect = 0.0
Identity = 580/580 (100.00%), Postives = 580/580 (100.00%), Query Frame = 0

Query: 205 AEISTAQPVFHSFPRTSYAQSCLHVKKSLASLTSGNEKENYPSWKSLIVPKRVSFTVCQP 264
           AEISTAQPVFHSFPRTSYAQSCLHVKKSLASLTSGNEKENYPSWKSLIVPKRVSFTVCQP
Sbjct: 5   AEISTAQPVFHSFPRTSYAQSCLHVKKSLASLTSGNEKENYPSWKSLIVPKRVSFTVCQP 64

Query: 265 QICRRWFLIRAVATLEPKRVVHDGKGDVSMGGGAEFKNSQMGAAPSTSDVQLSSSSEDTE 324
           QICRRWFLIRAVATLEPKRVVHDGKGDVSMGGGAEFKNSQMGAAPSTSDVQLSSSSEDTE
Sbjct: 65  QICRRWFLIRAVATLEPKRVVHDGKGDVSMGGGAEFKNSQMGAAPSTSDVQLSSSSEDTE 124

Query: 325 EMDARERLRRERISKANKGNTPWNKGRKHSAETLRRIKERTRLAMQNPKIKMKLVNLGRS 384
           EMDARERLRRERISKANKGNTPWNKGRKHSAETLRRIKERTRLAMQNPKIKMKLVNLGRS
Sbjct: 125 EMDARERLRRERISKANKGNTPWNKGRKHSAETLRRIKERTRLAMQNPKIKMKLVNLGRS 184

Query: 385 QSEETRMRIGVGVRMGWQRRRKKLKLQETCYLQWKDLIAEASRQGGLGEEELQWDSYQIM 444
           QSEETRMRIGVGVRMGWQRRRKKLKLQETCYLQWKDLIAEASRQGGLGEEELQWDSYQIM
Sbjct: 185 QSEETRMRIGVGVRMGWQRRRKKLKLQETCYLQWKDLIAEASRQGGLGEEELQWDSYQIM 244

Query: 445 NEQLKKEWQESVEQRKTMPRPVGGRRAPKSAEQRKKISESISAKWADSEYRARVFSGLAK 504
           NEQLKKEWQESVEQRKTMPRPVGGRRAPKSAEQRKKISESISAKWADSEYRARVFSGLAK
Sbjct: 245 NEQLKKEWQESVEQRKTMPRPVGGRRAPKSAEQRKKISESISAKWADSEYRARVFSGLAK 304

Query: 505 YHGTPIGVNRRPRRKRSESTETTRKKEKSGVKSPVAGGSKIESQRLRLRKSKAPRFKDPL 564
           YHGTPIGVNRRPRRKRSESTETTRKKEKSGVKSPVAGGSKIESQRLRLRKSKAPRFKDPL
Sbjct: 305 YHGTPIGVNRRPRRKRSESTETTRKKEKSGVKSPVAGGSKIESQRLRLRKSKAPRFKDPL 364

Query: 565 ASSKLEMIKSIRAERAIAETQKTEAIERARLLIAEAEKAAKALEVAATRSSIARASLLET 624
           ASSKLEMIKSIRAERAIAETQKTEAIERARLLIAEAEKAAKALEVAATRSSIARASLLET
Sbjct: 365 ASSKLEMIKSIRAERAIAETQKTEAIERARLLIAEAEKAAKALEVAATRSSIARASLLET 424

Query: 625 RNLIAEAKQSIESVEIERMASPQSEERNAAASYTYEVGGTSNEEGDSVGGKGNQNGVVQT 684
           RNLIAEAKQSIESVEIERMASPQSEERNAAASYTYEVGGTSNEEGDSVGGKGNQNGVVQT
Sbjct: 425 RNLIAEAKQSIESVEIERMASPQSEERNAAASYTYEVGGTSNEEGDSVGGKGNQNGVVQT 484

Query: 685 MANGTQLFPSSIDKDFDFSKLSLQDILGGEKEVPASSNGHGACHSSFSSLRNHPNGNKPS 744
           MANGTQLFPSSIDKDFDFSKLSLQDILGGEKEVPASSNGHGACHSSFSSLRNHPNGNKPS
Sbjct: 485 MANGTQLFPSSIDKDFDFSKLSLQDILGGEKEVPASSNGHGACHSSFSSLRNHPNGNKPS 544

Query: 745 DHKPSLNGTKLHHLEEKPDSQVISVTKKWVRGRLFEVGDG 784
           DHKPSLNGTKLHHLEEKPDSQVISVTKKWVRGRLFEVGDG
Sbjct: 545 DHKPSLNGTKLHHLEEKPDSQVISVTKKWVRGRLFEVGDG 584

BLAST of Cp4.1LG02g17670 vs. ExPASy TrEMBL
Match: A0A6J1GD69 (O-fucosyltransferase family protein OS=Cucurbita moschata OX=3662 GN=LOC111452878 PE=3 SV=1)

HSP 1 Score: 1100 bits (2846), Expect = 0.0
Identity = 556/564 (98.58%), Postives = 557/564 (98.76%), Query Frame = 0

Query: 833  MDDESDDCRNLIDQNSPKRVPSTFDIDDDPHFRPPIQTFRFSIPKFALDKRYYYILAAAL 892
            MDDESDDCRNLIDQNSPKRVPSTFDIDDDPHFRPPIQTFRFSIPKFA DKRYYYILAAAL
Sbjct: 1    MDDESDDCRNLIDQNSPKRVPSTFDIDDDPHFRPPIQTFRFSIPKFAFDKRYYYILAAAL 60

Query: 893  PLCIVVVFFSADIQTLFSTNLSSPLKSSDSLSDRMREAELRALYLLRQQQLGFSDLWNHS 952
            PLCIVVVFFSADIQTLFSTNLSSPLKSSDSLSDRMREAELRALYLLRQQQLGFSDLWNHS
Sbjct: 61   PLCIVVVFFSADIQTLFSTNLSSPLKSSDSLSDRMREAELRALYLLRQQQLGFSDLWNHS 120

Query: 953  LLVQSNSSFNSTSSNNLSSNSASGTPSTEDLKSAILKQISLNKEIQNVLLSPHSSGNIPE 1012
            LLVQSNSSFNSTSSNNLSSNSASGTPSTEDLKSAILKQISLNKEIQNVLLSPHSSGNI E
Sbjct: 121  LLVQSNSSFNSTSSNNLSSNSASGTPSTEDLKSAILKQISLNKEIQNVLLSPHSSGNISE 180

Query: 1013 EVGDAHSMGSFALDRCRKMDQKLSDRRTIEWKPKSNKFLFAICTSGQMSNHLICLEKHMF 1072
            EVGDAHSMGSFALDRCRKMDQKLSDRRTIEWKPKSNKFLFAICTSGQMSNHLICLEKHMF
Sbjct: 181  EVGDAHSMGSFALDRCRKMDQKLSDRRTIEWKPKSNKFLFAICTSGQMSNHLICLEKHMF 240

Query: 1073 FAAILNRILVIPSHKVDFQFSRVIDIDHINSCLGRKVVISFEEFSEIKKHHLHIDRFFCY 1132
            FAAILNRILVIPSHKVDFQFSRVIDIDHINSCLGRKVVISFEEFSEIKKHHLHIDRFFCY
Sbjct: 241  FAAILNRILVIPSHKVDFQFSRVIDIDHINSCLGRKVVISFEEFSEIKKHHLHIDRFFCY 300

Query: 1133 FSKPDPCYVDDEHIKKLKTLGVSMGKLESAWNEDTKKPTRKTVSDIESKFSSNDDVVAVG 1192
            FS PDPCYVDDEHIKKLKTLGVSMGKLESAWNEDTKKPTRKTVSDIESKFSSNDDVVAVG
Sbjct: 301  FSNPDPCYVDDEHIKKLKTLGVSMGKLESAWNEDTKKPTRKTVSDIESKFSSNDDVVAVG 360

Query: 1193 DIFFANVEQEWVNQPGGPIAHKCQTLIEPSRLIKLTAQRFIQTFLGKNYMAFHFRRHGFL 1252
            DIFFANVEQEWVNQPGGPIAHKCQTLIEPSRLIKLTAQRFIQTFLGKNY+AFHFRRHGFL
Sbjct: 361  DIFFANVEQEWVNQPGGPIAHKCQTLIEPSRLIKLTAQRFIQTFLGKNYIAFHFRRHGFL 420

Query: 1253 N--NAKQPSCFYPIPQAADCLIRVVERANVPVIYLSTDAAESEYGLLQSLLVLNGKPIPL 1312
               NAKQPSCFYPIPQAADCLIRVVERANVPVIYLSTDAAESEYGLLQSLLVLNGKPIPL
Sbjct: 421  KFCNAKQPSCFYPIPQAADCLIRVVERANVPVIYLSTDAAESEYGLLQSLLVLNGKPIPL 480

Query: 1313 VKRPPRNSAEKWDALLYRHGIEGDSQVEAMLDKTIGAMASTFIGASGSTFTEDILRLRKD 1372
             KRPPRNSAEKWDALLYRHGIEGDSQVEAMLDKTIGAMASTFIGASGSTFTEDILRLRKD
Sbjct: 481  FKRPPRNSAEKWDALLYRHGIEGDSQVEAMLDKTIGAMASTFIGASGSTFTEDILRLRKD 540

Query: 1373 WGSASTCDEYLCQGEEPNFIAENE 1394
            WGSASTCDEYLCQGEEPNFIAENE
Sbjct: 541  WGSASTCDEYLCQGEEPNFIAENE 564

BLAST of Cp4.1LG02g17670 vs. ExPASy TrEMBL
Match: A0A6J1KB61 (O-fucosyltransferase family protein OS=Cucurbita maxima OX=3661 GN=LOC111493346 PE=3 SV=1)

HSP 1 Score: 1085 bits (2806), Expect = 0.0
Identity = 548/564 (97.16%), Postives = 553/564 (98.05%), Query Frame = 0

Query: 833  MDDESDDCRNLIDQNSPKRVPSTFDIDDDPHFRPPIQTFRFSIPKFALDKRYYYILAAAL 892
            MDDESDDCRNLIDQNSPKR+PSTFDIDDDPHFRPPIQTFRFSIPKFA DKRYYYILAAAL
Sbjct: 1    MDDESDDCRNLIDQNSPKRIPSTFDIDDDPHFRPPIQTFRFSIPKFAFDKRYYYILAAAL 60

Query: 893  PLCIVVVFFSADIQTLFSTNLSSPLKSSDSLSDRMREAELRALYLLRQQQLGFSDLWNHS 952
            PLCIVVVFFSADIQTLFSTNLSSPLKSSDSLSDRMR+AELRALYLLRQQQLGFSDLWN S
Sbjct: 61   PLCIVVVFFSADIQTLFSTNLSSPLKSSDSLSDRMRQAELRALYLLRQQQLGFSDLWNRS 120

Query: 953  LLVQSNSSFNSTSSNNLSSNSASGTPSTEDLKSAILKQISLNKEIQNVLLSPHSSGNIPE 1012
            LLVQSNSSFNSTSSNNLSSNSASGTPSTEDL SAILKQISLNKEIQNVLLS HSSGNI E
Sbjct: 121  LLVQSNSSFNSTSSNNLSSNSASGTPSTEDLNSAILKQISLNKEIQNVLLSAHSSGNISE 180

Query: 1013 EVGDAHSMGSFALDRCRKMDQKLSDRRTIEWKPKSNKFLFAICTSGQMSNHLICLEKHMF 1072
            EVGDAHSMGSFALDRCRKMDQKLSDRRTIEWKPKSNKFLFAICTSGQMSNHLICLEKHMF
Sbjct: 181  EVGDAHSMGSFALDRCRKMDQKLSDRRTIEWKPKSNKFLFAICTSGQMSNHLICLEKHMF 240

Query: 1073 FAAILNRILVIPSHKVDFQFSRVIDIDHINSCLGRKVVISFEEFSEIKKHHLHIDRFFCY 1132
            FAAILNRILVIPSHKVDFQFSRVIDIDHINSCL RKVVISFEEFSEIKKHHLHIDRFFCY
Sbjct: 241  FAAILNRILVIPSHKVDFQFSRVIDIDHINSCLERKVVISFEEFSEIKKHHLHIDRFFCY 300

Query: 1133 FSKPDPCYVDDEHIKKLKTLGVSMGKLESAWNEDTKKPTRKTVSDIESKFSSNDDVVAVG 1192
            FSKPDPCYVDDEHIKKLK LGVSMGKLESAWNEDTKKPTRKTVSDIESKFSSNDDVVAVG
Sbjct: 301  FSKPDPCYVDDEHIKKLKNLGVSMGKLESAWNEDTKKPTRKTVSDIESKFSSNDDVVAVG 360

Query: 1193 DIFFANVEQEWVNQPGGPIAHKCQTLIEPSRLIKLTAQRFIQTFLGKNYMAFHFRRHGFL 1252
            DIF ANVEQEWVNQPGGPIAHKCQTLIEPSRLIKLTAQRFIQTFLGKNY+AFHFRRHGFL
Sbjct: 361  DIFLANVEQEWVNQPGGPIAHKCQTLIEPSRLIKLTAQRFIQTFLGKNYIAFHFRRHGFL 420

Query: 1253 N--NAKQPSCFYPIPQAADCLIRVVERANVPVIYLSTDAAESEYGLLQSLLVLNGKPIPL 1312
               NAKQPSCFYPIPQAADCLIRVVERANVPVIYLSTDAAESEYGLLQSLLVLNGKPIPL
Sbjct: 421  KFCNAKQPSCFYPIPQAADCLIRVVERANVPVIYLSTDAAESEYGLLQSLLVLNGKPIPL 480

Query: 1313 VKRPPRNSAEKWDALLYRHGIEGDSQVEAMLDKTIGAMASTFIGASGSTFTEDILRLRKD 1372
            VKRPPRNSAEKWDALLYRHGIEGDSQVEAMLDKTIGAM+STFIGASGSTFTEDILRLRKD
Sbjct: 481  VKRPPRNSAEKWDALLYRHGIEGDSQVEAMLDKTIGAMSSTFIGASGSTFTEDILRLRKD 540

Query: 1373 WGSASTCDEYLCQGEEPNFIAENE 1394
            WGSASTCDEYLCQGEEPNFI+ENE
Sbjct: 541  WGSASTCDEYLCQGEEPNFISENE 564

BLAST of Cp4.1LG02g17670 vs. ExPASy TrEMBL
Match: A0A6J1GBF8 (uncharacterized protein LOC111452640 OS=Cucurbita moschata OX=3662 GN=LOC111452640 PE=4 SV=1)

HSP 1 Score: 1077 bits (2784), Expect = 0.0
Identity = 573/580 (98.79%), Postives = 574/580 (98.97%), Query Frame = 0

Query: 205 AEISTAQPVFHSFPRTSYAQSCLHVKKSLASLTSGNEKENYPSWKSLIVPKRVSFTVCQP 264
           AEISTAQPVFHSFPRTSYAQSCLHVKKSLASLTSGNEKENYPSWKSLIVPKRVSFTVCQP
Sbjct: 5   AEISTAQPVFHSFPRTSYAQSCLHVKKSLASLTSGNEKENYPSWKSLIVPKRVSFTVCQP 64

Query: 265 QICRRWFLIRAVATLEPKRVVHDGKGDVSMGGGAEFKNSQMGAAPSTSDVQLSSSSEDTE 324
           QICRR FLIRAVATLEPKRV HDGKGDVSMGGGAEFKNSQMGAAPSTSDVQLSSSSEDTE
Sbjct: 65  QICRRGFLIRAVATLEPKRVAHDGKGDVSMGGGAEFKNSQMGAAPSTSDVQLSSSSEDTE 124

Query: 325 EMDARERLRRERISKANKGNTPWNKGRKHSAETLRRIKERTRLAMQNPKIKMKLVNLGRS 384
           EMDARERLRRERISKANKGNTPWNKGRKHSAETLRRIKERTRLAMQNPKIKMKLVNLGRS
Sbjct: 125 EMDARERLRRERISKANKGNTPWNKGRKHSAETLRRIKERTRLAMQNPKIKMKLVNLGRS 184

Query: 385 QSEETRMRIGVGVRMGWQRRRKKLKLQETCYLQWKDLIAEASRQGGLGEEELQWDSYQIM 444
           QSEETRMRIGVGVRMGWQRRRKKLKLQETCYLQWKDLIAEASRQGGLGEEELQW+SYQIM
Sbjct: 185 QSEETRMRIGVGVRMGWQRRRKKLKLQETCYLQWKDLIAEASRQGGLGEEELQWESYQIM 244

Query: 445 NEQLKKEWQESVEQRKTMPRPVGGRRAPKSAEQRKKISESISAKWADSEYRARVFSGLAK 504
           NEQLKKEWQESVEQRKTMPRPVGGRRAPKSAEQRKKISESISAKWADSEYRARVFSGLAK
Sbjct: 245 NEQLKKEWQESVEQRKTMPRPVGGRRAPKSAEQRKKISESISAKWADSEYRARVFSGLAK 304

Query: 505 YHGTPIGVNRRPRRKRSESTETTRKKEKSGVKSPVAGGSKIESQRLRLRKSKAPRFKDPL 564
           YHGTPIGVNRRPRRKRSESTETTRKKEKSGVKSPVAGGSKIESQRLRLRKSKAPRFKDPL
Sbjct: 305 YHGTPIGVNRRPRRKRSESTETTRKKEKSGVKSPVAGGSKIESQRLRLRKSKAPRFKDPL 364

Query: 565 ASSKLEMIKSIRAERAIAETQKTEAIERARLLIAEAEKAAKALEVAATRSSIARASLLET 624
           ASSKLEMIKSIRAERAIAETQKTEAIERARLLIAEAEKAAKALEVAATRSSIARASLLET
Sbjct: 365 ASSKLEMIKSIRAERAIAETQKTEAIERARLLIAEAEKAAKALEVAATRSSIARASLLET 424

Query: 625 RNLIAEAKQSIESVEIERMASPQSEERNAAASYTYEVGGTSNEEGDSVGGKGNQNGVVQT 684
           RNLIAEAKQSIES EIERMASPQSEERNAAASYTYEVGGTSNEEGDSV GKGNQNGVVQT
Sbjct: 425 RNLIAEAKQSIESAEIERMASPQSEERNAAASYTYEVGGTSNEEGDSVAGKGNQNGVVQT 484

Query: 685 MANGTQLFPSSIDKDFDFSKLSLQDILGGEKEVPASSNGHGACHSSFSSLRNHPNGNKPS 744
           MANGTQLFPSSIDKDFDFSKLSLQDILGGEKEVPASSNGHGACHSSFSSL NHPNGNKPS
Sbjct: 485 MANGTQLFPSSIDKDFDFSKLSLQDILGGEKEVPASSNGHGACHSSFSSLTNHPNGNKPS 544

Query: 745 DHKPSLNGTKLHHLEEKPDSQVISVTKKWVRGRLFEVGDG 784
           DHKPSLNGTKLHHLEEKPDSQVISVTKKWVRGRLFEV DG
Sbjct: 545 DHKPSLNGTKLHHLEEKPDSQVISVTKKWVRGRLFEVADG 584

BLAST of Cp4.1LG02g17670 vs. ExPASy TrEMBL
Match: A0A6J1KDK7 (uncharacterized protein LOC111492886 OS=Cucurbita maxima OX=3661 GN=LOC111492886 PE=4 SV=1)

HSP 1 Score: 1069 bits (2765), Expect = 0.0
Identity = 569/580 (98.10%), Postives = 573/580 (98.79%), Query Frame = 0

Query: 205 AEISTAQPVFHSFPRTSYAQSCLHVKKSLASLTSGNEKENYPSWKSLIVPKRVSFTVCQP 264
           AEISTAQPVFHSFPRTSYAQSCLHVKKS ASLTSGNEKENYPSWKSLIVPKRVSFTVCQP
Sbjct: 5   AEISTAQPVFHSFPRTSYAQSCLHVKKSPASLTSGNEKENYPSWKSLIVPKRVSFTVCQP 64

Query: 265 QICRRWFLIRAVATLEPKRVVHDGKGDVSMGGGAEFKNSQMGAAPSTSDVQLSSSSEDTE 324
           QICRR FLIRAVATLEPKRVVHDG GDVSMGGGAEFKNSQMGAAP+TSDVQLSSS+EDTE
Sbjct: 65  QICRRGFLIRAVATLEPKRVVHDGNGDVSMGGGAEFKNSQMGAAPNTSDVQLSSSNEDTE 124

Query: 325 EMDARERLRRERISKANKGNTPWNKGRKHSAETLRRIKERTRLAMQNPKIKMKLVNLGRS 384
           EMDARERLRRERISKANKGNTPWNKGRKHSAETLRRIKERTRLAMQNPKIKMKLVNLGRS
Sbjct: 125 EMDARERLRRERISKANKGNTPWNKGRKHSAETLRRIKERTRLAMQNPKIKMKLVNLGRS 184

Query: 385 QSEETRMRIGVGVRMGWQRRRKKLKLQETCYLQWKDLIAEASRQGGLGEEELQWDSYQIM 444
           QSEETRMRIGVGVRMGWQRRRKKLKLQETCYLQWKDLIAEASRQGGLGEEELQWDSYQIM
Sbjct: 185 QSEETRMRIGVGVRMGWQRRRKKLKLQETCYLQWKDLIAEASRQGGLGEEELQWDSYQIM 244

Query: 445 NEQLKKEWQESVEQRKTMPRPVGGRRAPKSAEQRKKISESISAKWADSEYRARVFSGLAK 504
           NEQLKKEWQESVEQRKTMPRPVGGRRAPKSAEQRKKISESISAKWADSEYRARVFSGLAK
Sbjct: 245 NEQLKKEWQESVEQRKTMPRPVGGRRAPKSAEQRKKISESISAKWADSEYRARVFSGLAK 304

Query: 505 YHGTPIGVNRRPRRKRSESTETTRKKEKSGVKSPVAGGSKIESQRLRLRKSKAPRFKDPL 564
           YHGTPIGVNRRPRRKRSEST+TTRKKEKSGVKSPVAGG KIESQRLRLRKSKAPRFKDPL
Sbjct: 305 YHGTPIGVNRRPRRKRSESTDTTRKKEKSGVKSPVAGGYKIESQRLRLRKSKAPRFKDPL 364

Query: 565 ASSKLEMIKSIRAERAIAETQKTEAIERARLLIAEAEKAAKALEVAATRSSIARASLLET 624
           ASSKLEMIKSIRAERAIAETQKTEAIERARLLIAEAEKAAKALEVAATRSSIARASLLET
Sbjct: 365 ASSKLEMIKSIRAERAIAETQKTEAIERARLLIAEAEKAAKALEVAATRSSIARASLLET 424

Query: 625 RNLIAEAKQSIESVEIERMASPQSEERNAAASYTYEVGGTSNEEGDSVGGKGNQNGVVQT 684
           RNLIAEAKQSIESVEIERMASPQSEER+AAASYTYEVGGTSNEEGDSV GKGNQNGVVQT
Sbjct: 425 RNLIAEAKQSIESVEIERMASPQSEERSAAASYTYEVGGTSNEEGDSVAGKGNQNGVVQT 484

Query: 685 MANGTQLFPSSIDKDFDFSKLSLQDILGGEKEVPASSNGHGACHSSFSSLRNHPNGNKPS 744
           MANGTQLFPSSIDKDFDF KLSLQDILGGEKEVPASSNGHGACHSSFSSL NHPNGNKPS
Sbjct: 485 MANGTQLFPSSIDKDFDFCKLSLQDILGGEKEVPASSNGHGACHSSFSSLTNHPNGNKPS 544

Query: 745 DHKPSLNGTKLHHLEEKPDSQVISVTKKWVRGRLFEVGDG 784
           DHKPSLNGTKLHHLEEKPDSQVISVTKKWVRGRLFEVGDG
Sbjct: 545 DHKPSLNGTKLHHLEEKPDSQVISVTKKWVRGRLFEVGDG 584

BLAST of Cp4.1LG02g17670 vs. ExPASy TrEMBL
Match: A0A6J1DGJ5 (O-fucosyltransferase family protein OS=Momordica charantia OX=3673 GN=LOC111020294 PE=3 SV=1)

HSP 1 Score: 976 bits (2524), Expect = 0.0
Identity = 493/568 (86.80%), Postives = 524/568 (92.25%), Query Frame = 0

Query: 835  DESDDCRNLIDQNSPKRVPS------TFDIDDDPHFRPPIQTFRFSIPKFALDKRYYYIL 894
            DE DD RNLI++N  KRVPS       F IDDD   RPPIQ FRFS+PKFA DKRYYY+L
Sbjct: 9    DEEDDRRNLIEENGTKRVPSPRSRSIAFQIDDDRDVRPPIQRFRFSVPKFAFDKRYYYLL 68

Query: 895  AAALPLCIVVVFFSADIQTLFSTNLSSPLKSSDSLSDRMREAELRALYLLRQQQLGFSDL 954
            AAA+PL I+VVFFSADI++LFSTN SS LKSSDSL DRMRE+ELRALYLLRQQQLGF DL
Sbjct: 69   AAAMPLFILVVFFSADIRSLFSTNFSSKLKSSDSLGDRMRESELRALYLLRQQQLGFFDL 128

Query: 955  WNHSLLVQSNSSFNSTSSNNLSSNSASGTPSTEDLKSAILKQISLNKEIQNVLLSPHSSG 1014
            WNHSLLVQSNSSFNST +NNL SNSASGTP TEDLKSAILKQISLNKEIQ VLLSPH SG
Sbjct: 129  WNHSLLVQSNSSFNSTPTNNLGSNSASGTPFTEDLKSAILKQISLNKEIQKVLLSPHRSG 188

Query: 1015 NIPEEVGDAHSMGSFALDRCRKMDQKLSDRRTIEWKPKSNKFLFAICTSGQMSNHLICLE 1074
            N+  EVGDAH+MGSFALDRCRKMDQK SDRRTI+WKPKSNKFLFAICTSGQMSNHLICLE
Sbjct: 189  NLSMEVGDAHTMGSFALDRCRKMDQKFSDRRTIDWKPKSNKFLFAICTSGQMSNHLICLE 248

Query: 1075 KHMFFAAILNRILVIPSHKVDFQFSRVIDIDHINSCLGRKVVISFEEFSEIKKHHLHIDR 1134
            KHMFFAAILNRILVIPSHKVD+QFSRVIDIDHIN+CLGRKVV+SFEEFSEIKKHHLHIDR
Sbjct: 249  KHMFFAAILNRILVIPSHKVDYQFSRVIDIDHINACLGRKVVVSFEEFSEIKKHHLHIDR 308

Query: 1135 FFCYFSKPDPCYVDDEHIKKLKTLGVSMGKLESAWNEDTKKPTRKTVSDIESKFSSNDDV 1194
            F CYFSKPDPC++D+EHIKKLK LGVSMGKLESAWNEDTKKP+R+TVSDIESKFSSNDDV
Sbjct: 309  FICYFSKPDPCFMDEEHIKKLKNLGVSMGKLESAWNEDTKKPSRRTVSDIESKFSSNDDV 368

Query: 1195 VAVGDIFFANVEQEWVNQPGGPIAHKCQTLIEPSRLIKLTAQRFIQTFLGKNYMAFHFRR 1254
            VAVGDIFFA+VEQEWVNQPGGPIAH+CQTLIEPSRLIKLTAQRFIQTFLGKNY+A HFRR
Sbjct: 369  VAVGDIFFASVEQEWVNQPGGPIAHQCQTLIEPSRLIKLTAQRFIQTFLGKNYIALHFRR 428

Query: 1255 HGFLN--NAKQPSCFYPIPQAADCLIRVVERANVPVIYLSTDAAESEYGLLQSLLVLNGK 1314
            +GFL   NAK PSCFYPIPQAA+C+IRVVER NVPVIYLSTDAAESEYGLLQSLL+LNGK
Sbjct: 429  YGFLKFCNAKLPSCFYPIPQAAECIIRVVERVNVPVIYLSTDAAESEYGLLQSLLMLNGK 488

Query: 1315 PIPLVKRPPRNSAEKWDALLYRHGIEGDSQVEAMLDKTIGAMASTFIGASGSTFTEDILR 1374
             +PLV+RP RNSAEKWDALLYRHGIEGD QVEAMLDKTI AM+STFIGASGSTFTEDILR
Sbjct: 489  NVPLVRRPHRNSAEKWDALLYRHGIEGDPQVEAMLDKTICAMSSTFIGASGSTFTEDILR 548

Query: 1375 LRKDWGSASTCDEYLCQGEEPNFIAENE 1394
            LRKDWGSAS CDEYLCQGEEPNFIAE E
Sbjct: 549  LRKDWGSASLCDEYLCQGEEPNFIAEKE 576

BLAST of Cp4.1LG02g17670 vs. TAIR 10
Match: AT5G50420.1 (O-fucosyltransferase family protein )

HSP 1 Score: 677.9 bits (1748), Expect = 1.7e-194
Identity = 360/587 (61.33%), Postives = 445/587 (75.81%), Query Frame = 0

Query: 829  LKKVMDDESDDCRNLIDQNSPK-----------------RVPSTFDIDDDPHFRPPIQTF 888
            +++   D+ +D ++LI QN  +                    S F IDD  H    +Q  
Sbjct: 1    MERNSSDDEEDHQHLIPQNDTRIRHREDSVSSNATTIGGNQRSAFQIDDILH---RVQ-- 60

Query: 889  RFSIPKFALDKRYYYILAAALPLCIVVVFFSADIQTLFSTNLSSPLKSSDSLSDRMREAE 948
                 K +L+KR Y I+  +L + I ++F   D + LF+ N SS     D LS+R++E+E
Sbjct: 61   --HRGKISLNKR-YVIVFVSLIISIGLLFLLTDPRELFAANFSS--FKLDPLSNRVKESE 120

Query: 949  LRALYLLRQQQLGFSDLWNHSLLVQSNSSFNSTSSNNLSSNSASGTPSTEDLKSAILKQI 1008
            LRALYLLRQQQL    LWN +L+           S N S N+   +   ED+KSA+ KQI
Sbjct: 121  LRALYLLRQQQLALLSLWNGTLV---------NPSLNQSENALGSSVLFEDVKSAVSKQI 180

Query: 1009 SLNKEIQNVLLSPHSSGNIPEEVGDAHSMGSFALDRCRKMDQKLSDRRTIEWKPKSNKFL 1068
            SLNKEIQ VLLSPH S N      D  S+ +F+ +RCRK+DQKLSDR+T+EWKP+S+KFL
Sbjct: 181  SLNKEIQEVLLSPHRSSNYSGGT-DVDSV-NFSYNRCRKVDQKLSDRKTVEWKPRSDKFL 240

Query: 1069 FAICTSGQMSNHLICLEKHMFFAAILNRILVIPSHKVDFQFSRVIDIDHINSCLGRKVVI 1128
            FAIC SGQMSNHLICLEKHMFFAA+L+R+LVIPS K D+Q+ RVIDI+ IN+CLGR VV+
Sbjct: 241  FAICLSGQMSNHLICLEKHMFFAALLDRVLVIPSSKFDYQYDRVIDIERINTCLGRNVVV 300

Query: 1129 SFEEFSE-IKKHHLHIDRFFCYFSKPDPCYVDDEHIKKLKTLGVSM-GKLESAWNEDTKK 1188
            +F++F E  KK+H  IDRF CYFS P  CYVD+EHIKKLK LG+S+ GKLE+ W+ED KK
Sbjct: 301  AFDQFKEKAKKNHFRIDRFICYFSSPQLCYVDEEHIKKLKGLGISIDGKLEAPWSEDIKK 360

Query: 1189 PTRKTVSDIESKFSSNDDVVAVGDIFFANVEQEWVNQPGGPIAHKCQTLIEPSRLIKLTA 1248
            P+++TV D++ KF S+DDV+A+GD+F+A++EQ+WV QPGGPI HKC+TLIEPS+LI LTA
Sbjct: 361  PSKRTVQDVQMKFKSDDDVIAIGDVFYADMEQDWVMQPGGPINHKCKTLIEPSKLILLTA 420

Query: 1249 QRFIQTFLGKNYMAFHFRRHGFLN--NAKQPSCFYPIPQAADCLIRVVERANVPVIYLST 1308
            QRFIQTFLGKN++A HFRRHGFL   NAK PSCFYPIPQAA+C+ R+VER+N  VIYLST
Sbjct: 421  QRFIQTFLGKNFIALHFRRHGFLKFCNAKSPSCFYPIPQAAECIARIVERSNGAVIYLST 480

Query: 1309 DAAESEYGLLQSLLVLNGKPIPLVKRPPRNSAEKWDALLYRHGIEGDSQVEAMLDKTIGA 1368
            DAAESE  LLQSL+V++GK +PLVKRPPRNSAEKWDALLYRHGIE DSQV+AMLDKTI A
Sbjct: 481  DAAESETSLLQSLVVVDGKIVPLVKRPPRNSAEKWDALLYRHGIEDDSQVDAMLDKTICA 540

Query: 1369 MASTFIGASGSTFTEDILRLRKDWGSASTCDEYLCQGEEPNFIAENE 1395
            M+S FIGASGSTFTEDILRLRKDWG++STCDEYLC+GEEPNFIAE+E
Sbjct: 541  MSSVFIGASGSTFTEDILRLRKDWGTSSTCDEYLCRGEEPNFIAEDE 566

BLAST of Cp4.1LG02g17670 vs. TAIR 10
Match: AT1G53770.1 (O-fucosyltransferase family protein )

HSP 1 Score: 662.5 bits (1708), Expect = 7.3e-190
Identity = 332/572 (58.04%), Postives = 433/572 (75.70%), Query Frame = 0

Query: 835  DESDDCRNLIDQNSPKRVPSTFDIDDDPHFRPP---IQTFRFSIPK-FALDKRYYYILAA 894
            DE  D +NL++++  +     F I D+   + P   +++ R  + + F L+      +  
Sbjct: 18   DEESDLQNLLEESDSQ--IDQFRISDEAAEQRPTFDVESLRSRLRRSFKLNLTKKQSIFI 77

Query: 895  ALPLCIVVVFFSADIQTLFSTNLSSPLKSSDSLSDRMREAELRALYLLRQQQLGFSDLWN 954
             LP+ I++++ S D    FS  + +    S++L+ R+ E++L+ALYLLR+Q+     +WN
Sbjct: 78   FLPIVIILIYLSTDFSNYFSVKVPNSAFRSNTLTGRVHESDLQALYLLRKQESDLFSIWN 137

Query: 955  HSLLVQSNSSFNSTSSNNLSSNSASGTPSTEDLKSAILKQISLNKEIQNVLLSPHSSGNI 1014
            H++             +NLS        + +D+KSA+ +QISLN++IQN LLSPH +GN+
Sbjct: 138  HTV-------------SNLS--------TIDDVKSAVFRQISLNRQIQNALLSPHKTGNV 197

Query: 1015 PEEVGDAHSMGSFALDRCRKMDQKLSDRRTIEWKPKSNKFLFAICTSGQMSNHLICLEKH 1074
              ++G + S G FA   CRK+DQKL+ R+TI+WKP+ +KFLFAIC SGQMSNHLICLEKH
Sbjct: 198  --DIGGS-SDGYFAGGSCRKVDQKLNGRKTIQWKPRPDKFLFAICLSGQMSNHLICLEKH 257

Query: 1075 MFFAAILNRILVIPSHKVDFQFSRVIDIDHINSCLGRKVVISFEEF-----SEIKKHHLH 1134
            MFFAA+L R+LVIPSH+ D+ +SR+IDID IN+CLGR VV+SFEEF     +  K HH+H
Sbjct: 258  MFFAALLKRVLVIPSHRFDYHYSRIIDIDRINTCLGRTVVVSFEEFWKKDKNRKKHHHVH 317

Query: 1135 IDRFFCYFSKPDPCYVDDEHIKKLKTLGVSM-GKLESAWNEDTKKPTRKTVSDIESKFSS 1194
            I+RF CYFSKP+PCYVD EHI KLK LG+++ GKL++ W ED  +P+ KT  ++E+ F S
Sbjct: 318  INRFICYFSKPEPCYVDKEHITKLKALGITVGGKLDTPWEEDIARPSNKTAEEVEANFRS 377

Query: 1195 NDDVVAVGDIFFANVEQEWVNQPGGPIAHKCQTLIEPSRLIKLTAQRFIQTFLGKNYMAF 1254
            +DDV+A+GD+F+ANVE+EWV QPGGP+AHKC+TLIEP+RLI LTAQRFIQTFLGKNY+A 
Sbjct: 378  DDDVIAIGDVFYANVEREWVMQPGGPVAHKCRTLIEPNRLILLTAQRFIQTFLGKNYIAL 437

Query: 1255 HFRRHGFLN--NAKQPSCFYPIPQAADCLIRVVERANVPVIYLSTDAAESEYGLLQSLLV 1314
            HFRRHGFL   NAK PSCF+PIPQAA C+ R++E+   PV+YLSTDAAESE GLLQSLL+
Sbjct: 438  HFRRHGFLKFCNAKNPSCFFPIPQAASCITRLIEKVEAPVLYLSTDAAESETGLLQSLLI 497

Query: 1315 LNGKPIPLVKRPPRNSAEKWDALLYRHGIEGDSQVEAMLDKTIGAMASTFIGASGSTFTE 1374
            LNGK +PLVKRP R+SAEKWDALLYRHG+EGDSQVEAMLDKTI A++S FIGASGSTFTE
Sbjct: 498  LNGKTVPLVKRPARDSAEKWDALLYRHGLEGDSQVEAMLDKTICALSSVFIGASGSTFTE 557

Query: 1375 DILRLRKDWGSASTCDEYLCQGEEPNFIAENE 1395
            DILRLRKDWG+AS CDEYLC  E+PNFIA++E
Sbjct: 558  DILRLRKDWGTASECDEYLCANEQPNFIADHE 563

BLAST of Cp4.1LG02g17670 vs. TAIR 10
Match: AT1G17270.1 (O-fucosyltransferase family protein )

HSP 1 Score: 656.0 bits (1691), Expect = 6.8e-188
Identity = 349/586 (59.56%), Postives = 433/586 (73.89%), Query Frame = 0

Query: 835  DESDDCRNLIDQNSPKRVPSTFDIDDDPHFRPPIQTFRFS-----IPKFALD-------- 894
            DE +D RNLI QN  +        D+D + RP  +T   +      P+ AL         
Sbjct: 7    DEEEDHRNLIPQNDTR--------DNDLNLRPDARTVNMANGGGRSPRSALQIDEILSRA 66

Query: 895  --------KRYYYILAAALPLCIVVVFFSADIQTLFSTNLSSPLKSSDSLSDRMREAELR 954
                     + Y + A +L L + ++F   D +T FS+         D +S R++E+EL+
Sbjct: 67   RNRWKISVNKRYVVAAVSLTLFVGLLFLFTDTRTFFSS------FKLDPMSSRVKESELQ 126

Query: 955  ALYLLRQQQLGFSDLWNHSLLVQSNSSFNSTSSNNLSSNSASGTPSTEDLKSAILKQISL 1014
            AL LLRQQQL    L N        ++FN       SSN+ S +   +++K+A+LKQIS+
Sbjct: 127  ALNLLRQQQLALVSLLN-------RTNFN-------SSNAISSSVVIDNVKAALLKQISV 186

Query: 1015 NKEIQNVLLSPHSSGNIPEEVGDAHSM-GSFALDRCRKMDQKLSDRRTIEWKPKSNKFLF 1074
            NKEI+ VLLSPH +GN       + S  GS+  D CRK+DQKL DR+TIEWKP+ +KFLF
Sbjct: 187  NKEIEEVLLSPHRTGNYSITASGSDSFTGSYNADICRKVDQKLLDRKTIEWKPRPDKFLF 246

Query: 1075 AICTSGQMSNHLICLEKHMFFAAILNRILVIPSHKVDFQFSRVIDIDHINSCLGRKVVIS 1134
            AIC SGQMSNHLICLEKHMFFAA+L+R+LVIPS K D+Q+ +VIDI+ IN+CLGR VVIS
Sbjct: 247  AICLSGQMSNHLICLEKHMFFAALLDRVLVIPSSKFDYQYDKVIDIERINTCLGRTVVIS 306

Query: 1135 FEEFSEI-KKHHLHIDRFFCYFSKPDPCYVDDEHIKKLKTLGVSM-GKLESAWNEDTKKP 1194
            F++F EI KK++ HIDRF CY S P PCYVD++HIKKLK LGVS+ GKLE+ W+ED KKP
Sbjct: 307  FDQFKEIDKKNNAHIDRFICYVSSPQPCYVDEDHIKKLKGLGVSIGGKLEAPWSEDIKKP 366

Query: 1195 TRKTVSDIESKFSSNDDVVAVGDIFFANVEQEWVNQPGGPIAHKCQTLIEPSRLIKLTAQ 1254
            T++T  ++  KF S+D V+A+GD+F+A++EQ+ V QPGGPI HKC+TLIEPSRLI +TAQ
Sbjct: 367  TKRTSQEVVEKFKSDDGVIAIGDVFYADMEQDLVMQPGGPINHKCKTLIEPSRLILVTAQ 426

Query: 1255 RFIQTFLGKNYMAFHFRRHGFLN--NAKQPSCFYPIPQAADCLIRVVERANVPVIYLSTD 1314
            RFIQTFLGKN+++ H RRHGFL   NAK PSCFYPIPQAADC+ R+VERAN PVIYLSTD
Sbjct: 427  RFIQTFLGKNFISLHLRRHGFLKFCNAKSPSCFYPIPQAADCISRMVERANAPVIYLSTD 486

Query: 1315 AAESEYGLLQSLLVLNGKPIPLVKRPPRNSAEKWDALLYRHGIEGDSQVEAMLDKTIGAM 1374
            AAESE GLLQSL+V++GK +PLVKRPP+NSAEKWD+LLYRHGIE DSQV AMLDKTI AM
Sbjct: 487  AAESETGLLQSLVVVDGKVVPLVKRPPQNSAEKWDSLLYRHGIEDDSQVYAMLDKTICAM 546

Query: 1375 ASTFIGASGSTFTEDILRLRKDWGSASTCDEYLCQGEEPNFIAENE 1395
            +S FIGASGSTFTEDILRLRKDWG++S CDEYLC+GEEPNFIAENE
Sbjct: 547  SSVFIGASGSTFTEDILRLRKDWGTSSMCDEYLCRGEEPNFIAENE 564

BLAST of Cp4.1LG02g17670 vs. TAIR 10
Match: AT1G53770.2 (O-fucosyltransferase family protein )

HSP 1 Score: 648.3 bits (1671), Expect = 1.4e-185
Identity = 332/610 (54.43%), Postives = 434/610 (71.15%), Query Frame = 0

Query: 835  DESDDCRNLIDQNSPKRVPSTFDIDDDPHFRPP---IQTFRFSIPK-FALDKRYYYILAA 894
            DE  D +NL++++  +     F I D+   + P   +++ R  + + F L+      +  
Sbjct: 18   DEESDLQNLLEESDSQ--IDQFRISDEAAEQRPTFDVESLRSRLRRSFKLNLTKKQSIFI 77

Query: 895  ALPLCIVVVFFSADIQTLFSTNLSSPLKSSDSLSDRMREAELRALYLLRQQQLGFSDLWN 954
             LP+ I++++ S D    FS  + +    S++L+ R+ E++L+ALYLLR+Q+     +WN
Sbjct: 78   FLPIVIILIYLSTDFSNYFSVKVPNSAFRSNTLTGRVHESDLQALYLLRKQESDLFSIWN 137

Query: 955  HSLLVQSNSSFNSTSSNNLSSNSASGTPSTEDLKSAILKQISLNKEIQNVLLSPHSSGNI 1014
            H++             +NLS        + +D+KSA+ +QISLN++IQN LLSPH +GN+
Sbjct: 138  HTV-------------SNLS--------TIDDVKSAVFRQISLNRQIQNALLSPHKTGNV 197

Query: 1015 PEEVGDAHSMGSFALDRCRKMDQKLSDRRTIEWKPKSNKFLFAICTSGQMSNHLICLEKH 1074
              ++G + S G FA   CRK+DQKL+ R+TI+WKP+ +KFLFAIC SGQMSNHLICLEKH
Sbjct: 198  --DIGGS-SDGYFAGGSCRKVDQKLNGRKTIQWKPRPDKFLFAICLSGQMSNHLICLEKH 257

Query: 1075 MFFAAILNRILVIPSHKVDFQFSRVIDIDHINSCLGRKVVISFEEF-----SEIKKHHLH 1134
            MFFAA+L R+LVIPSH+ D+ +SR+IDID IN+CLGR VV+SFEEF     +  K HH+H
Sbjct: 258  MFFAALLKRVLVIPSHRFDYHYSRIIDIDRINTCLGRTVVVSFEEFWKKDKNRKKHHHVH 317

Query: 1135 IDRFFCYFSKPDPCYVDDEHIKKLKTLGVSM-GKLESAWNEDTKKPTRKTVSDIESKFSS 1194
            I+RF CYFSKP+PCYVD EHI KLK LG+++ GKL++ W ED  +P+ KT  ++E+ F S
Sbjct: 318  INRFICYFSKPEPCYVDKEHITKLKALGITVGGKLDTPWEEDIARPSNKTAEEVEANFRS 377

Query: 1195 NDDVVAVGDIFFANVEQEWVNQPGGPIAHKCQTLIEPSRLIKLTAQRFIQTFLGKNYMAF 1254
            +DDV+A+GD+F+ANVE+EWV QPGGP+AHKC+TLIEP+RLI LTAQRFIQTFLGKNY+A 
Sbjct: 378  DDDVIAIGDVFYANVEREWVMQPGGPVAHKCRTLIEPNRLILLTAQRFIQTFLGKNYIAL 437

Query: 1255 HFRRHGFL----------------------------------------NNAKQPSCFYPI 1314
            HFRRHGFL                                        +NAK PSCF+PI
Sbjct: 438  HFRRHGFLKFWYDLFSTSLLHSTFDDNVVHFITKIKSGTLAKKFSWCASNAKNPSCFFPI 497

Query: 1315 PQAADCLIRVVERANVPVIYLSTDAAESEYGLLQSLLVLNGKPIPLVKRPPRNSAEKWDA 1374
            PQAA C+ R++E+   PV+YLSTDAAESE GLLQSLL+LNGK +PLVKRP R+SAEKWDA
Sbjct: 498  PQAASCITRLIEKVEAPVLYLSTDAAESETGLLQSLLILNGKTVPLVKRPARDSAEKWDA 557

Query: 1375 LLYRHGIEGDSQVEAMLDKTIGAMASTFIGASGSTFTEDILRLRKDWGSASTCDEYLCQG 1395
            LLYRHG+EGDSQVEAMLDKTI A++S FIGASGSTFTEDILRLRKDWG+AS CDEYLC  
Sbjct: 558  LLYRHGLEGDSQVEAMLDKTICALSSVFIGASGSTFTEDILRLRKDWGTASECDEYLCAN 601

BLAST of Cp4.1LG02g17670 vs. TAIR 10
Match: AT1G53800.2 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G53250.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 379.4 bits (973), Expect = 1.2e-104
Identity = 273/604 (45.20%), Postives = 369/604 (61.09%), Query Frame = 0

Query: 205 AEISTAQPVFHSFPRTSYAQSCLHVKKSLASLTSGNEKENYPSWKSLIVPKRVSFTVCQP 264
           ++I+T QP F +      AQS +H K    SL +         W+     K + F     
Sbjct: 8   SDIATIQPSFQAHLVPLGAQSIIHAK----SLPN--------PWRQSCFSKNLKFYTGHS 67

Query: 265 QICRRWFLIRAVATLEPKRVVHDGKGDVSMGGGAEFKNSQMGAAPSTSDVQLSSSSEDTE 324
            + R   LI AVATLE K               A+ +N +  +  S S    + S++D E
Sbjct: 68  HVRRGKVLITAVATLETKY-------------PAQKENERSSSLSSASSKSSNGSADDGE 127

Query: 325 E-MDARERLRRERISKANKGNTPWNKGRKHSAETLRRIKERTRLAMQNPKIKMKLVNLGR 384
           E +D RE+LRR RISKAN+GNTPWNKGRKHS ETL++I+ERT++AMQ+PKIKMKL NLG 
Sbjct: 128 EQVDDREKLRRMRISKANRGNTPWNKGRKHSPETLQKIRERTKIAMQDPKIKMKLANLGH 187

Query: 385 SQSEETRMRIGVGVRMGWQRRRKKLKLQETCYLQWKDLIAEASRQGGLGEEELQWDSYQI 444
           +Q++ETRM+IG GVRM W RR+++ K+QETC+ +W++L+AEA++QG   EEELQWDSY I
Sbjct: 188 AQNKETRMKIGEGVRMRWARRKERRKVQETCHFEWQNLLAEAAKQGYTDEEELQWDSYNI 247

Query: 445 MNEQLKKEWQESVEQRKTMPRPVGGRRAPKSAEQRKKISESISAKWADSEYRARVFSGLA 504
           +++Q + EW ESVEQRK +      RRAPKS EQR++I+E+I+AKWAD  YR RV SGLA
Sbjct: 248 LDQQNQLEWLESVEQRKAIKGAKSNRRAPKSPEQRRRIAEAIAAKWADPSYRERVCSGLA 307

Query: 505 KYHGTPIGVNRRPRRKRSESTETTRKK---EKSGVKSPVAGGSKIESQRLRLRKSKAPRF 564
           KYHG P+GV RR RR RS++    RKK   +KS   S     S++  Q +++RK K P +
Sbjct: 308 KYHGIPVGVERRRRRPRSDA--EPRKKTPTKKSTRDSEFERQSQV--QVVKVRKRKTPAY 367

Query: 565 KDPLASSKLEMIKSIRAERAIAETQKTEAIERARLLIAEAEKAAKALEVAATRSSIARAS 624
           KDPLASSKLEMIKSIRA+R   E++K +A+ERARLLI+EAEKAAK LE+AA +S +A+AS
Sbjct: 368 KDPLASSKLEMIKSIRAKRVAEESKKMDAVERARLLISEAEKAAKVLEIAALKSPVAQAS 427

Query: 625 LLETRNLIAEAKQSIESVEIERMASPQ--------SEERNAAASYTY---------EVGG 684
           LLE++ LIAEA Q I+S+E+ ++AS +        S + N + S T          E+ G
Sbjct: 428 LLESKKLIAEATQLIKSLEMRQIASDEDGTYPFLLSPQPNDSESETKDTNDQERPGEING 487

Query: 685 TSNEE--GDSVGGKGNQNGVVQTMANG-TQLFPSSIDKDFDFSKLSLQDILGGEKEVPAS 744
           T   +  G+S+      N +   +  G T  F S  D + + S+   +DI  G    P  
Sbjct: 488 THTLQINGESLHMNMRSNDLPTFVIEGTTNQFVS--DMESNTSQGGREDIKLGIVGQPNG 547

Query: 745 SNGHGACHS--SFSSLRNHPNGNKPSDHKPSLNGTKLHHLEEKPDS-QVISVTKKWVRGR 782
           +  H    S  + S   NHP  N              H ++EK  S +  +VTKKWVRGR
Sbjct: 548 TRVHPPAESNGAISLAENHPLPN------------GYHGIDEKAASLESGNVTKKWVRGR 568

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9FK302.4e-19361.33O-fucosyltransferase 36 OS=Arabidopsis thaliana OX=3702 GN=OFUT36 PE=2 SV=1[more]
Q501D61.0e-18858.04O-fucosyltransferase 14 OS=Arabidopsis thaliana OX=3702 GN=OFUT14 PE=2 SV=1[more]
Q84WU09.6e-18759.56O-fucosyltransferase 5 OS=Arabidopsis thaliana OX=3702 GN=OFUT5 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
KAG6606680.10.094.21O-fucosyltransferase 5, partial [Cucurbita argyrosperma subsp. sororia][more]
KAG7036399.10.099.35hypothetical protein SDJN02_00016, partial [Cucurbita argyrosperma subsp. argyro... [more]
XP_023521813.10.099.47O-fucosyltransferase 36-like isoform X1 [Cucurbita pepo subsp. pepo][more]
XP_022949564.10.098.58O-fucosyltransferase 36-like isoform X1 [Cucurbita moschata][more]
XP_023525182.10.0100.00uncharacterized protein LOC111788857 [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
A0A6J1GD690.098.58O-fucosyltransferase family protein OS=Cucurbita moschata OX=3662 GN=LOC11145287... [more]
A0A6J1KB610.097.16O-fucosyltransferase family protein OS=Cucurbita maxima OX=3661 GN=LOC111493346 ... [more]
A0A6J1GBF80.098.79uncharacterized protein LOC111452640 OS=Cucurbita moschata OX=3662 GN=LOC1114526... [more]
A0A6J1KDK70.098.10uncharacterized protein LOC111492886 OS=Cucurbita maxima OX=3661 GN=LOC111492886... [more]
A0A6J1DGJ50.086.80O-fucosyltransferase family protein OS=Momordica charantia OX=3673 GN=LOC1110202... [more]
Match NameE-valueIdentityDescription
AT5G50420.11.7e-19461.33O-fucosyltransferase family protein [more]
AT1G53770.17.3e-19058.04O-fucosyltransferase family protein [more]
AT1G17270.16.8e-18859.56O-fucosyltransferase family protein [more]
AT1G53770.21.4e-18554.43O-fucosyltransferase family protein [more]
AT1G53800.21.2e-10445.20unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 621..641
NoneNo IPR availableCOILSCoilCoilcoord: 572..610
NoneNo IPR availableGENE3D3.40.50.11350coord: 1213..1382
e-value: 6.9E-24
score: 86.5
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 515..535
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 716..760
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 723..748
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 304..321
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 507..546
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 295..326
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 454..480
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 107..130
NoneNo IPR availablePANTHERPTHR13398:SF4SUBFAMILY NOT NAMEDcoord: 834..1394
NoneNo IPR availableCDDcd11296O-FucT_likecoord: 1212..1368
e-value: 8.18161E-17
score: 78.6132
IPR003611Nuclease associated modular domain 3PFAMPF07460NUMOD3coord: 334..361
e-value: 8.8E-8
score: 31.9
IPR045130GDP-fucose protein O-fucosyltransferase 2-likePANTHERPTHR13398GDP-FUCOSE PROTEIN O-FUCOSYLTRANSFERASE 2coord: 834..1394

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG02g17670.1Cp4.1LG02g17670.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006004 fucose metabolic process
biological_process GO:0036066 protein O-linked fucosylation
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0003677 DNA binding
molecular_function GO:0046922 peptide-O-fucosyltransferase activity