CsGy5G004440 (gene) Cucumber (Gy14) v2

NameCsGy5G004440
Typegene
OrganismCucumis sativus (Cucumber (Gy14) v2)
DescriptionO-fucosyltransferase family protein
LocationChr5 : 2929700 .. 2933193 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATTGTTCTCTCTTAAAATTACTGTTCATTACCATACTTTTTTAAAAAAAGAATGTTATACATTTACAGCAACTATCCCATTATTGAGTTGTTACTCACTATACTTACCCGATGAATCCATCACCATTTATATCTAATGGGCCCATTCCAAACAGGCCCACTGTGGATCAATTATGAGCCCATTTAAAAAACGCCCACTACCGGAGTAAGGCGACACCGCCATCTCCTTTCATTTTGCTTCGTCTTCGTATTCTCACCGGAGTGGCACAGAAAGCTTTAAACAAAGAGAAAGGCCATGAATGCTTTCGGTTCGACCAGATCTTCGATCAATCGATGGAACTCGAAGAAACCCAATTTACAGCTCCCGCGTATTTCTTTGTCTGTATGTGCTCTTCTCTTCTGTTTCCTCTTTCTTCTCTACCTGTCCTCCTCCTTCTCCTCCTCTTCCTTCATGTCCTCGACTGCATTTTCGACTTCAAATTCGCGCCAATGCAATACCCAAATCTTGGCTTTGGGTGAGAAATTTCTTTTTTACGCGCCTCACAGCGGGTTCAGCAACCAGCTTTCTGAGTTCAAGAATGCTATTTTGATGGCCGGGATTCTCAACCGGACTCTTGTTGTTCCGCCGATTTTGGATCACCATGCGGTTGCTCTTGGGAGTTGTCCCAAATTTAGAGTTCCGGATCCTGGTGAGATTCGATTCTCGGTTTGGGAGCATATGCTTCAGCTTCTTCGAAATGGAAGGTAATCATTTCCACCTTCGTTTGTGTATATCTATATATGTAATTGTATTTTCAATGGCTTTATTTTTCTTGAACTGATGAGTGTTCTGGTTGTTGGGTTGCTGGATTGTAATTTAAGTTGCTTCTAAATCAGATAATTGAACCGTTTTGTTAAACTCGTAGTGATTCTTTTCTTTTGTGACTGATTAAAGAAGGAGCTGAAAAGAACAAGGCTTAATTATAAAAAAATACCCCTAAATTATGTCATTTGCTAAAGTGTACATTGCAATATTACCCTCGAAGGTTCGTAAATGTAGAAATTCTACGAATGCTAGGTGAAAAATGGGACTTGGAACAGTGTGTTGGTAAACTGAAACATAAATTGGGATCGACACTCCGTTCTAAGTCACCACAACATTTCAAGTTGCAATTTTTATACAATATTCTTACGAGCTTCTACTTCGAGGGCAGTTTTAAAACCTTTATGAAAGGCCCATGGTAATATTATAACTTTTGAAAGAATAAGGACATTAACTTCAGGAGTACTATTTTGTAATTTAGCCCCCCCAAAAATATAATTCATTGGTAATATTTGATGACATTAGCTATGCATTTCAATTTTGTGGCAGGTTTGTTTCGATGGCGGATATTGTAGATATTTCATCATTAACTTCTTACTCTTCTGTTAAAGCCATAGATTTTAGGACCTTTGCATATTTATGGTGTGGAGTGCGTCTGGAAAGTGTTTGTGCAAATGAATACGACAACCTAAAGCAATGTGGTCGTCTACTAGCAGGGCTTGATGGGAATGTAGACAAATGTTTACATGCTGTAGATGAAGATTGCAAAACTACAGTTTGGACTTACCAAAATAATGAAGTTGATGGAGCATTGGACCTGTTTCAGCCTAATGAACAGCTTAAGAAGAAAAAGAAGGTGTCATATGTCAGACGCCGTCGAGATGTATATAGAACACTTGGACGTGATTCGAAAGCTGGATCAGCTACCGTTTTGGCATTTGGAAGTCTTTTTACTGCTCCATACAGAGGTTCAGAGTTGTATATTGATATCCATGGAGTTAGTAAAGATCAAAGAATTAGTTCTCTGATGAAGAACATTGAGTATCTTCCATTTGTCCCAGAAATCTTGAGTGCAGGAAAAGAGTATATTGACAAGATCATAAAAGCTCCATTCCTTTGTGCTCAACTGAGATTGTTAGATGGGCAGTTTAAAAATCACTGGAAGGCTACTTTTTTGGCCCTCCAACAGAAATTAGACTCTATATTAGAGAATGCTAATGAACCTATTCATGTTTTTGTGATGACTGATCTTCCTAAATCTAATTGGACTGGAAGCTACTTAGGGGATTTGGATAGTGATTCAAATCACTTCAAACTCTTTTTCCTCGAAGAAAGCGACGAATTGGTCCTACGAGCATCCAAAAAGGTGATGGCTGTAGGACATGGCTTGAGATGGACATCCAATGCTTTTGGTCCTGGAAGAATTCGTGATATGAAGAAAAAATGTGCTTCTGAGAAATTACCGGATGTTCTCTTATATATAGAGGAAACTGTTTGTAGTTGTGCTTCACTTGGTTTTGTCGGTACTGCTGGATCCACAATTGCAGAAAGCATAGAGCTGATGAGAAAATATGGAGTATGTTGAAACCAAACTTGAACCAAATCTTGACATTTGCCTTCAAACTCAAAATTCTTTACCATCTCCACTGAAGTTATTCTCTGAGGATGGAGTCAGTGCCGATGTACTTTGATTGAATTACATGACCATCTGATACTGCTTTTCCTGAAATTCTCTCGTCTATTCTTGTGAGCGATGGTGATTTGCCGCATCATGGAATCATGCTTCAAATTTAGCATCAACTAATATGGTGGATTGTACCAAAATGGGTCATCGTATCAGTTTTCAAGGTTGCACTTGGATGTAAGTATACATACAGTTTCATATTTACCATGAATAGGAATGCATATTTACTTAGTTCTTTGCACAATGGTGCAGCAACCGATAGTTTAGGAGATTGATATAGTTCATTGAAGCTCATGTGAGCTTAAGTCAATGATAGTTGACACGATCTTCCTTCGTAGAAGTGGAAGGTTTGATCCCCGACCTTGTAACCGTGCTAAAAGAAATATAGTTCCTTAAAGGAAATCCCCACGGTTGTTTTCTTAGAAAGATTGAATTTCTGTACTTATGGTTTTAGAAATAGAAAATACATGGAATCCATGGTGGACTTCTCCTCATCCCTCACTGTATTTAAAATGTAGCTTCCGTTCTCAGAATTTAACCAAGGATTCAAAAGTACATGAAAAGATCATCATTCTTTTCATAGAATAGAAACTAAAAACAAAATAGAAACTAAAAACAAAATCATTCTTGGGGCTTGGACATAAGAAACTAACTCATAGTCCTTTTTTTTGTTCTCTCTTCCAGATGTAAGTCATTTTACATGGAATGAGATGTCTGGATGTGGCAGCTAAGGACACACAATGCAAAATGAAGTAAGATTCATTTCAAACAATTTGTCTCCTTAGACATTGTAATTGTATCACATCACAATATTCTATTGACTCGAACCGAGCTGGTTTCAATTCATTAGGAATTTTGTTTATCGGCTTTATTAAAAGAATTGGTGAAAATGAATTTTGACTGTTAATTTTGTACCTCACATATTCATTATCAATCTAATTAAATAAATCATACTGTTGTCAACAGACTATCATTCATCAGTCTTTACAAAAATTAATAAAAATATAGG

mRNA sequence

ATTGTTCTCTCTTAAAATTACTGTTCATTACCATACTTTTTTAAAAAAAGAATGTTATACATTTACAGCAACTATCCCATTATTGAGTTGTTACTCACTATACTTACCCGATGAATCCATCACCATTTATATCTAATGGGCCCATTCCAAACAGGCCCACTGTGGATCAATTATGAGCCCATTTAAAAAACGCCCACTACCGGAGTAAGGCGACACCGCCATCTCCTTTCATTTTGCTTCGTCTTCGTATTCTCACCGGAGTGGCACAGAAAGCTTTAAACAAAGAGAAAGGCCATGAATGCTTTCGGTTCGACCAGATCTTCGATCAATCGATGGAACTCGAAGAAACCCAATTTACAGCTCCCGCGTATTTCTTTGTCTGTATGTGCTCTTCTCTTCTGTTTCCTCTTTCTTCTCTACCTGTCCTCCTCCTTCTCCTCCTCTTCCTTCATGTCCTCGACTGCATTTTCGACTTCAAATTCGCGCCAATGCAATACCCAAATCTTGGCTTTGGGTGAGAAATTTCTTTTTTACGCGCCTCACAGCGGGTTCAGCAACCAGCTTTCTGAGTTCAAGAATGCTATTTTGATGGCCGGGATTCTCAACCGGACTCTTGTTGTTCCGCCGATTTTGGATCACCATGCGGTTGCTCTTGGGAGTTGTCCCAAATTTAGAGTTCCGGATCCTGGTGAGATTCGATTCTCGGTTTGGGAGCATATGCTTCAGCTTCTTCGAAATGGAAGGTTTGTTTCGATGGCGGATATTGTAGATATTTCATCATTAACTTCTTACTCTTCTGTTAAAGCCATAGATTTTAGGACCTTTGCATATTTATGGTGTGGAGTGCGTCTGGAAAGTGTTTGTGCAAATGAATACGACAACCTAAAGCAATGTGGTCGTCTACTAGCAGGGCTTGATGGGAATGTAGACAAATGTTTACATGCTGTAGATGAAGATTGCAAAACTACAGTTTGGACTTACCAAAATAATGAAGTTGATGGAGCATTGGACCTGTTTCAGCCTAATGAACAGCTTAAGAAGAAAAAGAAGGTGTCATATGTCAGACGCCGTCGAGATGTATATAGAACACTTGGACGTGATTCGAAAGCTGGATCAGCTACCGTTTTGGCATTTGGAAGTCTTTTTACTGCTCCATACAGAGGTTCAGAGTTGTATATTGATATCCATGGAGTTAGTAAAGATCAAAGAATTAGTTCTCTGATGAAGAACATTGAGTATCTTCCATTTGTCCCAGAAATCTTGAGTGCAGGAAAAGAGTATATTGACAAGATCATAAAAGCTCCATTCCTTTGTGCTCAACTGAGATTGTTAGATGGGCAGTTTAAAAATCACTGGAAGGCTACTTTTTTGGCCCTCCAACAGAAATTAGACTCTATATTAGAGAATGCTAATGAACCTATTCATGTTTTTGTGATGACTGATCTTCCTAAATCTAATTGGACTGGAAGCTACTTAGGGGATTTGGATAGTGATTCAAATCACTTCAAACTCTTTTTCCTCGAAGAAAGCGACGAATTGGTCCTACGAGCATCCAAAAAGGTGATGGCTGTAGGACATGGCTTGAGATGGACATCCAATGCTTTTGGTCCTGGAAGAATTCGTGATATGAAGAAAAAATGTGCTTCTGAGAAATTACCGGATGTTCTCTTATATATAGAGGAAACTGTTTGTAGTTGTGCTTCACTTGGTTTTGTCGGTACTGCTGGATCCACAATTGCAGAAAGCATAGAGCTGATGAGAAAATATGGAGTATGTTGAAACCAAACTTGAACCAAATCTTGACATTTGCCTTCAAACTCAAAATTCTTTACCATCTCCACTGAAGTTATTCTCTGAGGATGGAGTCAGTGCCGATGTACTTTGATTGAATTACATGACCATCTGATACTGCTTTTCCTGAAATTCTCTCGTCTATTCTTGTGAGCGATGGTGATTTGCCGCATCATGGAATCATGCTTCAAATTTAGCATCAACTAATATGGTGGATTGTACCAAAATGGGTCATCGTATCAGTTTTCAAGGTTGCACTTGGATATGTAAGTCATTTTACATGGAATGAGATGTCTGGATGTGGCAGCTAAGGACACACAATGCAAAATGAAGTAAGATTCATTTCAAACAATTTGTCTCCTTAGACATTGTAATTGTATCACATCACAATATTCTATTGACTCGAACCGAGCTGGTTTCAATTCATTAGGAATTTTGTTTATCGGCTTTATTAAAAGAATTGGTGAAAATGAATTTTGACTGTTAATTTTGTACCTCACATATTCATTATCAATCTAATTAAATAAATCATACTGTTGTCAACAGACTATCATTCATCAGTCTTTACAAAAATTAATAAAAATATAGG

Coding sequence (CDS)

ATGAATGCTTTCGGTTCGACCAGATCTTCGATCAATCGATGGAACTCGAAGAAACCCAATTTACAGCTCCCGCGTATTTCTTTGTCTGTATGTGCTCTTCTCTTCTGTTTCCTCTTTCTTCTCTACCTGTCCTCCTCCTTCTCCTCCTCTTCCTTCATGTCCTCGACTGCATTTTCGACTTCAAATTCGCGCCAATGCAATACCCAAATCTTGGCTTTGGGTGAGAAATTTCTTTTTTACGCGCCTCACAGCGGGTTCAGCAACCAGCTTTCTGAGTTCAAGAATGCTATTTTGATGGCCGGGATTCTCAACCGGACTCTTGTTGTTCCGCCGATTTTGGATCACCATGCGGTTGCTCTTGGGAGTTGTCCCAAATTTAGAGTTCCGGATCCTGGTGAGATTCGATTCTCGGTTTGGGAGCATATGCTTCAGCTTCTTCGAAATGGAAGGTTTGTTTCGATGGCGGATATTGTAGATATTTCATCATTAACTTCTTACTCTTCTGTTAAAGCCATAGATTTTAGGACCTTTGCATATTTATGGTGTGGAGTGCGTCTGGAAAGTGTTTGTGCAAATGAATACGACAACCTAAAGCAATGTGGTCGTCTACTAGCAGGGCTTGATGGGAATGTAGACAAATGTTTACATGCTGTAGATGAAGATTGCAAAACTACAGTTTGGACTTACCAAAATAATGAAGTTGATGGAGCATTGGACCTGTTTCAGCCTAATGAACAGCTTAAGAAGAAAAAGAAGGTGTCATATGTCAGACGCCGTCGAGATGTATATAGAACACTTGGACGTGATTCGAAAGCTGGATCAGCTACCGTTTTGGCATTTGGAAGTCTTTTTACTGCTCCATACAGAGGTTCAGAGTTGTATATTGATATCCATGGAGTTAGTAAAGATCAAAGAATTAGTTCTCTGATGAAGAACATTGAGTATCTTCCATTTGTCCCAGAAATCTTGAGTGCAGGAAAAGAGTATATTGACAAGATCATAAAAGCTCCATTCCTTTGTGCTCAACTGAGATTGTTAGATGGGCAGTTTAAAAATCACTGGAAGGCTACTTTTTTGGCCCTCCAACAGAAATTAGACTCTATATTAGAGAATGCTAATGAACCTATTCATGTTTTTGTGATGACTGATCTTCCTAAATCTAATTGGACTGGAAGCTACTTAGGGGATTTGGATAGTGATTCAAATCACTTCAAACTCTTTTTCCTCGAAGAAAGCGACGAATTGGTCCTACGAGCATCCAAAAAGGTGATGGCTGTAGGACATGGCTTGAGATGGACATCCAATGCTTTTGGTCCTGGAAGAATTCGTGATATGAAGAAAAAATGTGCTTCTGAGAAATTACCGGATGTTCTCTTATATATAGAGGAAACTGTTTGTAGTTGTGCTTCACTTGGTTTTGTCGGTACTGCTGGATCCACAATTGCAGAAAGCATAGAGCTGATGAGAAAATATGGAGTATGTTGA

Protein sequence

MNAFGSTRSSINRWNSKKPNLQLPRISLSVCALLFCFLFLLYLSSSFSSSSFMSSTAFSTSNSRQCNTQILALGEKFLFYAPHSGFSNQLSEFKNAILMAGILNRTLVVPPILDHHAVALGSCPKFRVPDPGEIRFSVWEHMLQLLRNGRFVSMADIVDISSLTSYSSVKAIDFRTFAYLWCGVRLESVCANEYDNLKQCGRLLAGLDGNVDKCLHAVDEDCKTTVWTYQNNEVDGALDLFQPNEQLKKKKKVSYVRRRRDVYRTLGRDSKAGSATVLAFGSLFTAPYRGSELYIDIHGVSKDQRISSLMKNIEYLPFVPEILSAGKEYIDKIIKAPFLCAQLRLLDGQFKNHWKATFLALQQKLDSILENANEPIHVFVMTDLPKSNWTGSYLGDLDSDSNHFKLFFLEESDELVLRASKKVMAVGHGLRWTSNAFGPGRIRDMKKKCASEKLPDVLLYIEETVCSCASLGFVGTAGSTIAESIELMRKYGVC
BLAST of CsGy5G004440 vs. NCBI nr
Match: XP_004152423.1 (PREDICTED: uncharacterized protein LOC101209896 [Cucumis sativus] >KGN50277.1 hypothetical protein Csa_5G165220 [Cucumis sativus])

HSP 1 Score: 895.6 bits (2313), Expect = 7.7e-257
Identity = 491/494 (99.39%), Postives = 493/494 (99.80%), Query Frame = 0

Query: 1   MNAFGSTRSSINRWNSKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 60
           MNAFGSTRSSINRWNSKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 1   MNAFGSTRSSINRWNSKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 60

Query: 61  XXSRQCNTQILALGEKFLFYAPHSGFSNQLSEFKNAILMAGILNRTLVVPPILDHHAVAL 120
           XXSRQCNTQILALGEKFLFYAPHSGFSNQLSEFKNAILMAGILNRTLVVPPILDHHAVAL
Sbjct: 61  XXSRQCNTQILALGEKFLFYAPHSGFSNQLSEFKNAILMAGILNRTLVVPPILDHHAVAL 120

Query: 121 GSCPKFRVPDPGEIRFSVWEHMLQLLRNGRFVSMADIVDISSLTSYSSVKAIDFRTFAYL 180
           GSCPKFRVPDPGEIRFSVWEHMLQLLRNGR+VSMADIVDISSLTSYSSVKAIDFRTFAYL
Sbjct: 121 GSCPKFRVPDPGEIRFSVWEHMLQLLRNGRYVSMADIVDISSLTSYSSVKAIDFRTFAYL 180

Query: 181 WCGVRLESVCANEYDNLKQCGRLLAGLDGNVDKCLHAVDEDCKTTVWTYQNNEVDGALDL 240
           WCGVRLESVCANEY+NLKQCGRLLAGLDGNVDKCLHAVDEDCKTTVWTYQNNEVDGALDL
Sbjct: 181 WCGVRLESVCANEYNNLKQCGRLLAGLDGNVDKCLHAVDEDCKTTVWTYQNNEVDGALDL 240

Query: 241 FQPNEQLKKKKKVSYVRRRRDVYRTLGRDSKAGSATVLAFGSLFTAPYRGSELYIDIHGV 300
           FQPNEQLKKKKKVSYVRRRRDVYRTLGRDSKAGSATVLAFGSLFTAPYRGSELYIDIHGV
Sbjct: 241 FQPNEQLKKKKKVSYVRRRRDVYRTLGRDSKAGSATVLAFGSLFTAPYRGSELYIDIHGV 300

Query: 301 SKDQRISSLMKNIEYLPFVPEILSAGKEYIDKIIKAPFLCAQLRLLDGQFKNHWKATFLA 360
           SKDQRISSLMKNIEYLPFVPEILSAGKEYIDKIIKAPFLCAQLRLLDGQFKNHWKATFLA
Sbjct: 301 SKDQRISSLMKNIEYLPFVPEILSAGKEYIDKIIKAPFLCAQLRLLDGQFKNHWKATFLA 360

Query: 361 LQQKLDSILENANEPIHVFVMTDLPKSNWTGSYLGDLDSDSNHFKLFFLEESDELVLRAS 420
           LQQKLDSILENANEPIHVFVMTDLPKSNWTGSYLGDLDSDSNHFKLFFLEESDELVLRAS
Sbjct: 361 LQQKLDSILENANEPIHVFVMTDLPKSNWTGSYLGDLDSDSNHFKLFFLEESDELVLRAS 420

Query: 421 KKVMAVGHGLRWTSNAFGPGRIRDMKKKCASEKLPDVLLYIEETVCSCASLGFVGTAGST 480
           KKVMAVGHGLRWTSNAFGPG IRDMKKKCASEKLPDVLLYIEETVCSCASLGFVGTAGST
Sbjct: 421 KKVMAVGHGLRWTSNAFGPGSIRDMKKKCASEKLPDVLLYIEETVCSCASLGFVGTAGST 480

Query: 481 IAESIELMRKYGVC 495
           IAESIELMRKYGVC
Sbjct: 481 IAESIELMRKYGVC 494

BLAST of CsGy5G004440 vs. NCBI nr
Match: XP_008437048.1 (PREDICTED: uncharacterized protein LOC103482591 [Cucumis melo])

HSP 1 Score: 867.8 bits (2241), Expect = 1.7e-248
Identity = 453/494 (91.70%), Postives = 464/494 (93.93%), Query Frame = 0

Query: 1   MNAFGSTRSSINRWNSKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 60
           MN FGSTRSSINRWNSK                    XXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 1   MNVFGSTRSSINRWNSKKPNLQLRRFSLSVFVLLFCFXXXXXXXXXXXXXXXXXXXXXXX 60

Query: 61  XXSRQCNTQILALGEKFLFYAPHSGFSNQLSEFKNAILMAGILNRTLVVPPILDHHAVAL 120
             SRQCNTQIL LGEKFLFYAPHSGFSNQLSEFKNAILMAGILNRTLVVPPILDHHAVAL
Sbjct: 61  SNSRQCNTQILGLGEKFLFYAPHSGFSNQLSEFKNAILMAGILNRTLVVPPILDHHAVAL 120

Query: 121 GSCPKFRVPDPGEIRFSVWEHMLQLLRNGRFVSMADIVDISSLTSYSSVKAIDFRTFAYL 180
           GSCPKFRVPDPGEIRFSVWEHMLQLLRNGR+VSM DIVDISSLTSYSSVKAIDFRTFAYL
Sbjct: 121 GSCPKFRVPDPGEIRFSVWEHMLQLLRNGRYVSMTDIVDISSLTSYSSVKAIDFRTFAYL 180

Query: 181 WCGVRLESVCANEYDNLKQCGRLLAGLDGNVDKCLHAVDEDCKTTVWTYQNNEVDGALDL 240
           WCGV LESVC+NEY+NLKQCGRLLAGLDGNVDKCLHAVDEDCKTTVWTYQ+NEVDGALDL
Sbjct: 181 WCGVHLESVCSNEYNNLKQCGRLLAGLDGNVDKCLHAVDEDCKTTVWTYQSNEVDGALDL 240

Query: 241 FQPNEQLKKKKKVSYVRRRRDVYRTLGRDSKAGSATVLAFGSLFTAPYRGSELYIDIHGV 300
           FQPNEQLKKKKKVSYVRRRRDVYRTLG DSKAGSATVLAFGSLFTAPY+GSELYIDIHGV
Sbjct: 241 FQPNEQLKKKKKVSYVRRRRDVYRTLGPDSKAGSATVLAFGSLFTAPYKGSELYIDIHGV 300

Query: 301 SKDQRISSLMKNIEYLPFVPEILSAGKEYIDKIIKAPFLCAQLRLLDGQFKNHWKATFLA 360
           SKDQRISSLMKNIEYLPFVPEILSAGKEYIDKIIKAPFLCAQLRLLDGQFKNHWKATFLA
Sbjct: 301 SKDQRISSLMKNIEYLPFVPEILSAGKEYIDKIIKAPFLCAQLRLLDGQFKNHWKATFLA 360

Query: 361 LQQKLDSILENANEPIHVFVMTDLPKSNWTGSYLGDLDSDSNHFKLFFLEESDELVLRAS 420
           LQQKL+SILENANEPI VFVMTDLP+SNWTGSYLGDLDSDSNHFKLFFL+E DELVLRAS
Sbjct: 361 LQQKLNSILENANEPIRVFVMTDLPESNWTGSYLGDLDSDSNHFKLFFLKEHDELVLRAS 420

Query: 421 KKVMAVGHGLRWTSNAFGPGRIRDMKKKCASEKLPDVLLYIEETVCSCASLGFVGTAGST 480
           KKVMAVGHGLRWTSNAFGPGRIR+MKK+CA E+LPDVLLYIEETVCSCASLGFVGTAGST
Sbjct: 421 KKVMAVGHGLRWTSNAFGPGRIRNMKKECAPERLPDVLLYIEETVCSCASLGFVGTAGST 480

Query: 481 IAESIELMRKYGVC 495
           IAESIELMRKYGVC
Sbjct: 481 IAESIELMRKYGVC 494

BLAST of CsGy5G004440 vs. NCBI nr
Match: XP_022958125.1 (O-fucosyltransferase 30-like isoform X1 [Cucurbita moschata])

HSP 1 Score: 787.3 bits (2032), Expect = 2.9e-224
Identity = 383/433 (88.45%), Postives = 412/433 (95.15%), Query Frame = 0

Query: 63  SRQCNTQILALGEKFLFYAPHSGFSNQLSEFKNAILMAGILNRTLVVPPILDHHAVALGS 122
           SRQCN++IL LGE+FLFYAPHSGF+NQLSEFKNAILMAGILNRTLV+PPILDHHAVALGS
Sbjct: 108 SRQCNSRILDLGERFLFYAPHSGFNNQLSEFKNAILMAGILNRTLVIPPILDHHAVALGS 167

Query: 123 CPKFRVPDPGEIRFSVWEHMLQLLRNGRFVSMADIVDISSLTSYSSVKAIDFRTFAYLWC 182
           CPKFRV DPGEIRFSVWEHM +LLR+GR+VSMADIVDISSL SY+S+KAIDFRTFAYLWC
Sbjct: 168 CPKFRVLDPGEIRFSVWEHMFELLRDGRYVSMADIVDISSLASYTSIKAIDFRTFAYLWC 227

Query: 183 GVRLESVCANEYDNLKQCGRLLAGLDGNVDKCLHAVDEDCKTTVWTYQNNEVDGALDLFQ 242
           GV LESVC+NE+ NLKQCGRLLAGLDGNVDKCLHAVDEDC+TTVWTYQN EVDG LDLFQ
Sbjct: 228 GVDLESVCSNEF-NLKQCGRLLAGLDGNVDKCLHAVDEDCRTTVWTYQNGEVDGVLDLFQ 287

Query: 243 PNEQLKKKKKVSYVRRRRDVYRTLGRDSKAGSATVLAFGSLFTAPYRGSELYIDIHGVSK 302
           PNEQLKKKKKV+YVRRRRDVYRTLG DS+A SATVLAFGSLFTAPY+ SELYIDIH V  
Sbjct: 288 PNEQLKKKKKVTYVRRRRDVYRTLGPDSEAESATVLAFGSLFTAPYKSSELYIDIHEVRG 347

Query: 303 DQRISSLMKNIEYLPFVPEILSAGKEYIDKIIKAPFLCAQLRLLDGQFKNHWKATFLALQ 362
           DQRISSLMKNIE+LPFVPEILSAGKEYIDKIIKAPFLCAQLRLLDGQFKNHWKATFLAL+
Sbjct: 348 DQRISSLMKNIEHLPFVPEILSAGKEYIDKIIKAPFLCAQLRLLDGQFKNHWKATFLALK 407

Query: 363 QKLDSILENANE-PIHVFVMTDLPKSNWTGSYLGDLDSDSNHFKLFFLEESDELVLRASK 422
           QKLDSIL+NANE PIH+FVMTDLP+SNWTGSYLGDL SDSN FKLFFL+E DELV+RAS+
Sbjct: 408 QKLDSILQNANEQPIHIFVMTDLPESNWTGSYLGDLVSDSNRFKLFFLKEHDELVVRASE 467

Query: 423 KVMAVGHGLRWTSNAFGPGRIRDMKKKCASEKLPDVLLYIEETVCSCASLGFVGTAGSTI 482
           KVMAVGHGLR  S+AFGPGRIRDMKKKCA+EKLPD+LLYIEETVCSCASLGF+GTAGSTI
Sbjct: 468 KVMAVGHGLRLASSAFGPGRIRDMKKKCAAEKLPDILLYIEETVCSCASLGFIGTAGSTI 527

Query: 483 AESIELMRKYGVC 495
           AESIELM KYG+C
Sbjct: 528 AESIELMSKYGLC 539

BLAST of CsGy5G004440 vs. NCBI nr
Match: XP_022958126.1 (O-fucosyltransferase 30-like isoform X2 [Cucurbita moschata])

HSP 1 Score: 787.3 bits (2032), Expect = 2.9e-224
Identity = 383/433 (88.45%), Postives = 412/433 (95.15%), Query Frame = 0

Query: 63  SRQCNTQILALGEKFLFYAPHSGFSNQLSEFKNAILMAGILNRTLVVPPILDHHAVALGS 122
           SRQCN++IL LGE+FLFYAPHSGF+NQLSEFKNAILMAGILNRTLV+PPILDHHAVALGS
Sbjct: 108 SRQCNSRILDLGERFLFYAPHSGFNNQLSEFKNAILMAGILNRTLVIPPILDHHAVALGS 167

Query: 123 CPKFRVPDPGEIRFSVWEHMLQLLRNGRFVSMADIVDISSLTSYSSVKAIDFRTFAYLWC 182
           CPKFRV DPGEIRFSVWEHM +LLR+GR+VSMADIVDISSL SY+S+KAIDFRTFAYLWC
Sbjct: 168 CPKFRVLDPGEIRFSVWEHMFELLRDGRYVSMADIVDISSLASYTSIKAIDFRTFAYLWC 227

Query: 183 GVRLESVCANEYDNLKQCGRLLAGLDGNVDKCLHAVDEDCKTTVWTYQNNEVDGALDLFQ 242
           GV LESVC+NE+ NLKQCGRLLAGLDGNVDKCLHAVDEDC+TTVWTYQN EVDG LDLFQ
Sbjct: 228 GVDLESVCSNEF-NLKQCGRLLAGLDGNVDKCLHAVDEDCRTTVWTYQNGEVDGVLDLFQ 287

Query: 243 PNEQLKKKKKVSYVRRRRDVYRTLGRDSKAGSATVLAFGSLFTAPYRGSELYIDIHGVSK 302
           PNEQLKKKKKV+YVRRRRDVYRTLG DS+A SATVLAFGSLFTAPY+ SELYIDIH V  
Sbjct: 288 PNEQLKKKKKVTYVRRRRDVYRTLGPDSEAESATVLAFGSLFTAPYKSSELYIDIHEVRG 347

Query: 303 DQRISSLMKNIEYLPFVPEILSAGKEYIDKIIKAPFLCAQLRLLDGQFKNHWKATFLALQ 362
           DQRISSLMKNIE+LPFVPEILSAGKEYIDKIIKAPFLCAQLRLLDGQFKNHWKATFLAL+
Sbjct: 348 DQRISSLMKNIEHLPFVPEILSAGKEYIDKIIKAPFLCAQLRLLDGQFKNHWKATFLALK 407

Query: 363 QKLDSILENANE-PIHVFVMTDLPKSNWTGSYLGDLDSDSNHFKLFFLEESDELVLRASK 422
           QKLDSIL+NANE PIH+FVMTDLP+SNWTGSYLGDL SDSN FKLFFL+E DELV+RAS+
Sbjct: 408 QKLDSILQNANEQPIHIFVMTDLPESNWTGSYLGDLVSDSNRFKLFFLKEHDELVVRASE 467

Query: 423 KVMAVGHGLRWTSNAFGPGRIRDMKKKCASEKLPDVLLYIEETVCSCASLGFVGTAGSTI 482
           KVMAVGHGLR  S+AFGPGRIRDMKKKCA+EKLPD+LLYIEETVCSCASLGF+GTAGSTI
Sbjct: 468 KVMAVGHGLRLASSAFGPGRIRDMKKKCAAEKLPDILLYIEETVCSCASLGFIGTAGSTI 527

Query: 483 AESIELMRKYGVC 495
           AESIELM KYG+C
Sbjct: 528 AESIELMSKYGLC 539

BLAST of CsGy5G004440 vs. NCBI nr
Match: XP_023549514.1 (O-fucosyltransferase 30-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 780.0 bits (2013), Expect = 4.7e-222
Identity = 391/494 (79.15%), Postives = 417/494 (84.41%), Query Frame = 0

Query: 1   MNAFGSTRSSINRWNSKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 60
           MN FGS+RS IN WNSK                                           
Sbjct: 1   MNIFGSSRSPINGWNSK-KSNFLHRRFSLPVLALLFFCSFFLLYLFSSYSSFMPSTAFST 60

Query: 61  XXSRQCNTQILALGEKFLFYAPHSGFSNQLSEFKNAILMAGILNRTLVVPPILDHHAVAL 120
             SRQC+++IL LGE+FLFYAPHSGFSNQLSEFKNAILMAGILNRTLVVPPILDHHAVAL
Sbjct: 61  SNSRQCSSRILDLGERFLFYAPHSGFSNQLSEFKNAILMAGILNRTLVVPPILDHHAVAL 120

Query: 121 GSCPKFRVPDPGEIRFSVWEHMLQLLRNGRFVSMADIVDISSLTSYSSVKAIDFRTFAYL 180
           GSCPKFRV DPGEIRFSVWEHML+LLRNGR+VSMADIVDISSL SYSS+KAIDFRTFAYL
Sbjct: 121 GSCPKFRVTDPGEIRFSVWEHMLELLRNGRYVSMADIVDISSLASYSSIKAIDFRTFAYL 180

Query: 181 WCGVRLESVCANEYDNLKQCGRLLAGLDGNVDKCLHAVDEDCKTTVWTYQNNEVDGALDL 240
           WCGV LESVC+NEY NLKQCGRLLAGLDGNVDKCLHAVDEDC+TTVWTY+N EVD ALD+
Sbjct: 181 WCGVDLESVCSNEY-NLKQCGRLLAGLDGNVDKCLHAVDEDCRTTVWTYKNGEVDVALDV 240

Query: 241 FQPNEQLKKKKKVSYVRRRRDVYRTLGRDSKAGSATVLAFGSLFTAPYRGSELYIDIHGV 300
           FQPNEQLKKKK VSYVRRRRDVYR LG DSKA  A VL+FGSLFTAPY+GSELYIDIH V
Sbjct: 241 FQPNEQLKKKKNVSYVRRRRDVYRALGPDSKAELAAVLSFGSLFTAPYKGSELYIDIHEV 300

Query: 301 SKDQRISSLMKNIEYLPFVPEILSAGKEYIDKIIKAPFLCAQLRLLDGQFKNHWKATFLA 360
           S DQRISSL+K+IEYLPFVPEILSAGKEYIDKIIKAPFLCAQLRLLDGQFKNHW ATFLA
Sbjct: 301 SGDQRISSLIKSIEYLPFVPEILSAGKEYIDKIIKAPFLCAQLRLLDGQFKNHWNATFLA 360

Query: 361 LQQKLDSILENANEPIHVFVMTDLPKSNWTGSYLGDLDSDSNHFKLFFLEESDELVLRAS 420
           L+QKLDSIL++ NEP+HVFVMTDLP+SNWTGSYL  L  DSNHFKLF L+E DELV RAS
Sbjct: 361 LEQKLDSILQDGNEPVHVFVMTDLPESNWTGSYLRHLARDSNHFKLFLLKEHDELVQRAS 420

Query: 421 KKVMAVGHGLRWTSNAFGPGRIRDMKKKCASEKLPDVLLYIEETVCSCASLGFVGTAGST 480
           KKVMAVGHGLR TS+AFGP RI DMK KC SE+LPD+LLYIEETVCSCASLGFVGTAGST
Sbjct: 421 KKVMAVGHGLRSTSSAFGPSRIHDMKNKCTSERLPDILLYIEETVCSCASLGFVGTAGST 480

Query: 481 IAESIELMRKYGVC 495
           IAESIELMRKYG+C
Sbjct: 481 IAESIELMRKYGLC 492

BLAST of CsGy5G004440 vs. TAIR10
Match: AT4G17430.1 (O-fucosyltransferase family protein)

HSP 1 Score: 558.5 bits (1438), Expect = 4.0e-159
Identity = 275/440 (62.50%), Postives = 340/440 (77.27%), Query Frame = 0

Query: 65  QCNTQILA---LGEKFLFYAPHSGFSNQLSEFKNAILMAGILNRTLVVPPILDHHAVALG 124
           QC ++IL    LG+KFL+YAPHSGFSNQLSEFKNA+LMAGILNRTL++PPILDHHAVALG
Sbjct: 68  QCRSEILTRTLLGQKFLWYAPHSGFSNQLSEFKNALLMAGILNRTLIIPPILDHHAVALG 127

Query: 125 SCPKFRVPDPGEIRFSVWEHMLQLLRNGRFVSMADIVDISSLTSYSSVKAIDFRTFAYLW 184
           SCPKFRV  P EIR SVW H ++LL+  R+VSMADIVDISSL S S+V+ IDFR FA L 
Sbjct: 128 SCPKFRVLSPSEIRISVWNHSIELLKTDRYVSMADIVDISSLVSSSAVRVIDFRYFASLQ 187

Query: 185 CGVRLESVCANE-------YDNLKQCGRLLAGLDGNVDKCLHAVDEDCKTTVWTYQNNEV 244
           CGV LE++C ++       Y++LKQCG LL+G+ GNVDKCL+AVDEDC+TTVWTY+N E 
Sbjct: 188 CGVDLETLCTDDLAEQSQAYESLKQCGYLLSGVRGNVDKCLYAVDEDCRTTVWTYKNGEA 247

Query: 245 DGALDLFQPNEQLKKKKKVSYVRRRRDVYRTLGRDSKAGSATVLAFGSLFTAPYRGSELY 304
           DG LD FQP+E+LKKKKK+S VRRRRDVY+TLG  ++A SA +LAFGSLFTAPY+GSELY
Sbjct: 248 DGRLDSFQPDEKLKKKKKLSNVRRRRDVYKTLGHGTEAESAAILAFGSLFTAPYKGSELY 307

Query: 305 IDIHGVSKDQRISSLMKNIEYLPFVPEILSAGKEYIDKIIKAPFLCAQLRLLDGQFKNHW 364
           IDIH   K  +I SL++ +++LPFV EI+ AGK++  + IKAPFLCAQLRLLDGQFKNH 
Sbjct: 308 IDIH---KSPKIKSLVEKVDFLPFVREIMIAGKKFASETIKAPFLCAQLRLLDGQFKNHR 367

Query: 365 KATFLALQQKLDSILENANEPIHVFVMTDLPKSNWTGSYLGDLDSDSNHFKLFFLEESDE 424
           ++TF  L QKL+++       I+VFVMTDLP+ NWTG+YLGDL  +S +FKL F+ E DE
Sbjct: 368 ESTFTGLYQKLEALSVKNPGLINVFVMTDLPEFNWTGTYLGDLSKNSTNFKLHFIGEQDE 427

Query: 425 LVLRASKKVMAVGHGLRWTSNAFGPGRIRDMKKKCASEKLPDVLLYIEETVCSCASLGFV 484
            + R   ++ +  HG ++ S       I+ M+  C      +V LYIEE VCSCASLGFV
Sbjct: 428 FLARTEHELDSASHGQKFGSIPMSLDSIKKMQTHCYPHGGSNVQLYIEEAVCSCASLGFV 487

Query: 485 GTAGSTIAESIELMRKYGVC 495
           GT GSTIA+S+E+MRKY  C
Sbjct: 488 GTPGSTIADSVEMMRKYNAC 504

BLAST of CsGy5G004440 vs. Swiss-Prot
Match: sp|Q1JPM5|OFT30_ARATH (O-fucosyltransferase 30 OS=Arabidopsis thaliana OX=3702 GN=OFUT30 PE=2 SV=1)

HSP 1 Score: 558.5 bits (1438), Expect = 7.3e-158
Identity = 275/440 (62.50%), Postives = 340/440 (77.27%), Query Frame = 0

Query: 65  QCNTQILA---LGEKFLFYAPHSGFSNQLSEFKNAILMAGILNRTLVVPPILDHHAVALG 124
           QC ++IL    LG+KFL+YAPHSGFSNQLSEFKNA+LMAGILNRTL++PPILDHHAVALG
Sbjct: 68  QCRSEILTRTLLGQKFLWYAPHSGFSNQLSEFKNALLMAGILNRTLIIPPILDHHAVALG 127

Query: 125 SCPKFRVPDPGEIRFSVWEHMLQLLRNGRFVSMADIVDISSLTSYSSVKAIDFRTFAYLW 184
           SCPKFRV  P EIR SVW H ++LL+  R+VSMADIVDISSL S S+V+ IDFR FA L 
Sbjct: 128 SCPKFRVLSPSEIRISVWNHSIELLKTDRYVSMADIVDISSLVSSSAVRVIDFRYFASLQ 187

Query: 185 CGVRLESVCANE-------YDNLKQCGRLLAGLDGNVDKCLHAVDEDCKTTVWTYQNNEV 244
           CGV LE++C ++       Y++LKQCG LL+G+ GNVDKCL+AVDEDC+TTVWTY+N E 
Sbjct: 188 CGVDLETLCTDDLAEQSQAYESLKQCGYLLSGVRGNVDKCLYAVDEDCRTTVWTYKNGEA 247

Query: 245 DGALDLFQPNEQLKKKKKVSYVRRRRDVYRTLGRDSKAGSATVLAFGSLFTAPYRGSELY 304
           DG LD FQP+E+LKKKKK+S VRRRRDVY+TLG  ++A SA +LAFGSLFTAPY+GSELY
Sbjct: 248 DGRLDSFQPDEKLKKKKKLSNVRRRRDVYKTLGHGTEAESAAILAFGSLFTAPYKGSELY 307

Query: 305 IDIHGVSKDQRISSLMKNIEYLPFVPEILSAGKEYIDKIIKAPFLCAQLRLLDGQFKNHW 364
           IDIH   K  +I SL++ +++LPFV EI+ AGK++  + IKAPFLCAQLRLLDGQFKNH 
Sbjct: 308 IDIH---KSPKIKSLVEKVDFLPFVREIMIAGKKFASETIKAPFLCAQLRLLDGQFKNHR 367

Query: 365 KATFLALQQKLDSILENANEPIHVFVMTDLPKSNWTGSYLGDLDSDSNHFKLFFLEESDE 424
           ++TF  L QKL+++       I+VFVMTDLP+ NWTG+YLGDL  +S +FKL F+ E DE
Sbjct: 368 ESTFTGLYQKLEALSVKNPGLINVFVMTDLPEFNWTGTYLGDLSKNSTNFKLHFIGEQDE 427

Query: 425 LVLRASKKVMAVGHGLRWTSNAFGPGRIRDMKKKCASEKLPDVLLYIEETVCSCASLGFV 484
            + R   ++ +  HG ++ S       I+ M+  C      +V LYIEE VCSCASLGFV
Sbjct: 428 FLARTEHELDSASHGQKFGSIPMSLDSIKKMQTHCYPHGGSNVQLYIEEAVCSCASLGFV 487

Query: 485 GTAGSTIAESIELMRKYGVC 495
           GT GSTIA+S+E+MRKY  C
Sbjct: 488 GTPGSTIADSVEMMRKYNAC 504

BLAST of CsGy5G004440 vs. TrEMBL
Match: tr|A0A0A0KPB8|A0A0A0KPB8_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G165220 PE=4 SV=1)

HSP 1 Score: 895.6 bits (2313), Expect = 5.1e-257
Identity = 491/494 (99.39%), Postives = 493/494 (99.80%), Query Frame = 0

Query: 1   MNAFGSTRSSINRWNSKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 60
           MNAFGSTRSSINRWNSKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 1   MNAFGSTRSSINRWNSKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 60

Query: 61  XXSRQCNTQILALGEKFLFYAPHSGFSNQLSEFKNAILMAGILNRTLVVPPILDHHAVAL 120
           XXSRQCNTQILALGEKFLFYAPHSGFSNQLSEFKNAILMAGILNRTLVVPPILDHHAVAL
Sbjct: 61  XXSRQCNTQILALGEKFLFYAPHSGFSNQLSEFKNAILMAGILNRTLVVPPILDHHAVAL 120

Query: 121 GSCPKFRVPDPGEIRFSVWEHMLQLLRNGRFVSMADIVDISSLTSYSSVKAIDFRTFAYL 180
           GSCPKFRVPDPGEIRFSVWEHMLQLLRNGR+VSMADIVDISSLTSYSSVKAIDFRTFAYL
Sbjct: 121 GSCPKFRVPDPGEIRFSVWEHMLQLLRNGRYVSMADIVDISSLTSYSSVKAIDFRTFAYL 180

Query: 181 WCGVRLESVCANEYDNLKQCGRLLAGLDGNVDKCLHAVDEDCKTTVWTYQNNEVDGALDL 240
           WCGVRLESVCANEY+NLKQCGRLLAGLDGNVDKCLHAVDEDCKTTVWTYQNNEVDGALDL
Sbjct: 181 WCGVRLESVCANEYNNLKQCGRLLAGLDGNVDKCLHAVDEDCKTTVWTYQNNEVDGALDL 240

Query: 241 FQPNEQLKKKKKVSYVRRRRDVYRTLGRDSKAGSATVLAFGSLFTAPYRGSELYIDIHGV 300
           FQPNEQLKKKKKVSYVRRRRDVYRTLGRDSKAGSATVLAFGSLFTAPYRGSELYIDIHGV
Sbjct: 241 FQPNEQLKKKKKVSYVRRRRDVYRTLGRDSKAGSATVLAFGSLFTAPYRGSELYIDIHGV 300

Query: 301 SKDQRISSLMKNIEYLPFVPEILSAGKEYIDKIIKAPFLCAQLRLLDGQFKNHWKATFLA 360
           SKDQRISSLMKNIEYLPFVPEILSAGKEYIDKIIKAPFLCAQLRLLDGQFKNHWKATFLA
Sbjct: 301 SKDQRISSLMKNIEYLPFVPEILSAGKEYIDKIIKAPFLCAQLRLLDGQFKNHWKATFLA 360

Query: 361 LQQKLDSILENANEPIHVFVMTDLPKSNWTGSYLGDLDSDSNHFKLFFLEESDELVLRAS 420
           LQQKLDSILENANEPIHVFVMTDLPKSNWTGSYLGDLDSDSNHFKLFFLEESDELVLRAS
Sbjct: 361 LQQKLDSILENANEPIHVFVMTDLPKSNWTGSYLGDLDSDSNHFKLFFLEESDELVLRAS 420

Query: 421 KKVMAVGHGLRWTSNAFGPGRIRDMKKKCASEKLPDVLLYIEETVCSCASLGFVGTAGST 480
           KKVMAVGHGLRWTSNAFGPG IRDMKKKCASEKLPDVLLYIEETVCSCASLGFVGTAGST
Sbjct: 421 KKVMAVGHGLRWTSNAFGPGSIRDMKKKCASEKLPDVLLYIEETVCSCASLGFVGTAGST 480

Query: 481 IAESIELMRKYGVC 495
           IAESIELMRKYGVC
Sbjct: 481 IAESIELMRKYGVC 494

BLAST of CsGy5G004440 vs. TrEMBL
Match: tr|A0A1S3AT28|A0A1S3AT28_CUCME (uncharacterized protein LOC103482591 OS=Cucumis melo OX=3656 GN=LOC103482591 PE=4 SV=1)

HSP 1 Score: 867.8 bits (2241), Expect = 1.1e-248
Identity = 453/494 (91.70%), Postives = 464/494 (93.93%), Query Frame = 0

Query: 1   MNAFGSTRSSINRWNSKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 60
           MN FGSTRSSINRWNSK                    XXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 1   MNVFGSTRSSINRWNSKKPNLQLRRFSLSVFVLLFCFXXXXXXXXXXXXXXXXXXXXXXX 60

Query: 61  XXSRQCNTQILALGEKFLFYAPHSGFSNQLSEFKNAILMAGILNRTLVVPPILDHHAVAL 120
             SRQCNTQIL LGEKFLFYAPHSGFSNQLSEFKNAILMAGILNRTLVVPPILDHHAVAL
Sbjct: 61  SNSRQCNTQILGLGEKFLFYAPHSGFSNQLSEFKNAILMAGILNRTLVVPPILDHHAVAL 120

Query: 121 GSCPKFRVPDPGEIRFSVWEHMLQLLRNGRFVSMADIVDISSLTSYSSVKAIDFRTFAYL 180
           GSCPKFRVPDPGEIRFSVWEHMLQLLRNGR+VSM DIVDISSLTSYSSVKAIDFRTFAYL
Sbjct: 121 GSCPKFRVPDPGEIRFSVWEHMLQLLRNGRYVSMTDIVDISSLTSYSSVKAIDFRTFAYL 180

Query: 181 WCGVRLESVCANEYDNLKQCGRLLAGLDGNVDKCLHAVDEDCKTTVWTYQNNEVDGALDL 240
           WCGV LESVC+NEY+NLKQCGRLLAGLDGNVDKCLHAVDEDCKTTVWTYQ+NEVDGALDL
Sbjct: 181 WCGVHLESVCSNEYNNLKQCGRLLAGLDGNVDKCLHAVDEDCKTTVWTYQSNEVDGALDL 240

Query: 241 FQPNEQLKKKKKVSYVRRRRDVYRTLGRDSKAGSATVLAFGSLFTAPYRGSELYIDIHGV 300
           FQPNEQLKKKKKVSYVRRRRDVYRTLG DSKAGSATVLAFGSLFTAPY+GSELYIDIHGV
Sbjct: 241 FQPNEQLKKKKKVSYVRRRRDVYRTLGPDSKAGSATVLAFGSLFTAPYKGSELYIDIHGV 300

Query: 301 SKDQRISSLMKNIEYLPFVPEILSAGKEYIDKIIKAPFLCAQLRLLDGQFKNHWKATFLA 360
           SKDQRISSLMKNIEYLPFVPEILSAGKEYIDKIIKAPFLCAQLRLLDGQFKNHWKATFLA
Sbjct: 301 SKDQRISSLMKNIEYLPFVPEILSAGKEYIDKIIKAPFLCAQLRLLDGQFKNHWKATFLA 360

Query: 361 LQQKLDSILENANEPIHVFVMTDLPKSNWTGSYLGDLDSDSNHFKLFFLEESDELVLRAS 420
           LQQKL+SILENANEPI VFVMTDLP+SNWTGSYLGDLDSDSNHFKLFFL+E DELVLRAS
Sbjct: 361 LQQKLNSILENANEPIRVFVMTDLPESNWTGSYLGDLDSDSNHFKLFFLKEHDELVLRAS 420

Query: 421 KKVMAVGHGLRWTSNAFGPGRIRDMKKKCASEKLPDVLLYIEETVCSCASLGFVGTAGST 480
           KKVMAVGHGLRWTSNAFGPGRIR+MKK+CA E+LPDVLLYIEETVCSCASLGFVGTAGST
Sbjct: 421 KKVMAVGHGLRWTSNAFGPGRIRNMKKECAPERLPDVLLYIEETVCSCASLGFVGTAGST 480

Query: 481 IAESIELMRKYGVC 495
           IAESIELMRKYGVC
Sbjct: 481 IAESIELMRKYGVC 494

BLAST of CsGy5G004440 vs. TrEMBL
Match: tr|A0A2N9F0Q0|A0A2N9F0Q0_FAGSY (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS8472 PE=4 SV=1)

HSP 1 Score: 632.5 bits (1630), Expect = 8.0e-178
Identity = 312/438 (71.23%), Postives = 365/438 (83.33%), Query Frame = 0

Query: 65  QCNT-QILALGEKFLFYAPHSGFSNQLSEFKNAILMAGILNRTLVVPPILDHHAVALGSC 124
           QC T Q L LGEKFL+YAPHSGFSNQLSEFKNAIL+AGILNRTL+VPPILDHHA+ALGSC
Sbjct: 64  QCPTSQSLTLGEKFLWYAPHSGFSNQLSEFKNAILLAGILNRTLIVPPILDHHAIALGSC 123

Query: 125 PKFRVPDPGEIRFSVWEHMLQLLRNGRFVSMADIVDISSLTSYSSVKAIDFRTFAYLWCG 184
           PKFRV  P +IR +VW H+L+LLR GR+VSMADI+DISSL S S V+AIDFR FA LWCG
Sbjct: 124 PKFRVSAPNDIRVAVWNHVLELLRTGRYVSMADIIDISSLVSSSVVQAIDFRVFASLWCG 183

Query: 185 VRLESVCANE-------YDNLKQCGRLLAGLDGNVDKCLHAVDEDCKTTVWTYQNNEVDG 244
           V+ +  C NE        D LKQCG LL+GL+GNVDKC++A+ EDC+TTVWTYQN + DG
Sbjct: 184 VK-DFDCFNESNEQLALLDRLKQCGSLLSGLNGNVDKCIYAIGEDCRTTVWTYQNGDKDG 243

Query: 245 ALDLFQPNEQLKKKKKVSYVRRRRDVYRTLGRDSKAGSATVLAFGSLFTAPYRGSELYID 304
            LD FQP+EQLK+KKKVS+VRRRRDVY+TLG  S A SAT+LAFGSLFT+PYRGSELYID
Sbjct: 244 VLDSFQPDEQLKRKKKVSFVRRRRDVYQTLGPGSAAESATLLAFGSLFTSPYRGSELYID 303

Query: 305 IHGVSKDQRISSLMKNIEYLPFVPEILSAGKEYIDKIIKAPFLCAQLRLLDGQFKNHWKA 364
           IH   +DQRI SL+  IE+LPFVPEI+SAGKE+ DK I APFLCAQLRLLDGQFKNHWKA
Sbjct: 304 IHEAPRDQRIQSLIGKIEFLPFVPEIMSAGKEFADKNINAPFLCAQLRLLDGQFKNHWKA 363

Query: 365 TFLALQQKLDSILENANEPIHVFVMTDLPKSNWTGSYLGDLDSDSNHFKLFFLEESDELV 424
           TFL+L+QK++S L  ++ PIH+F+MTDLP+ NWTGSYLGDL  DS+HFKL FL   DELV
Sbjct: 364 TFLSLKQKVES-LGQSSLPIHIFMMTDLPEGNWTGSYLGDLARDSHHFKLHFLRGEDELV 423

Query: 425 LRASKKVMAVGHGLRWTSNAFGPGRIRDMKKKCASEKLPDVLLYIEETVCSCASLGFVGT 484
           ++ +KKV+A  HGLR+   AF P  +  +KK C+SE+LPDVLL+IEE VCSCASLGFVGT
Sbjct: 424 IKTAKKVVAASHGLRF---AFVPESMGGLKKHCSSERLPDVLLFIEEAVCSCASLGFVGT 483

Query: 485 AGSTIAESIELMRKYGVC 495
           AGSTIAES+ELMRK+  C
Sbjct: 484 AGSTIAESVELMRKFRTC 496

BLAST of CsGy5G004440 vs. TrEMBL
Match: tr|A0A1R3GNI3|A0A1R3GNI3_COCAP (Uncharacterized protein OS=Corchorus capsularis OX=210143 GN=CCACVL1_24722 PE=4 SV=1)

HSP 1 Score: 616.7 bits (1589), Expect = 4.6e-173
Identity = 294/437 (67.28%), Postives = 360/437 (82.38%), Query Frame = 0

Query: 65  QCNTQILALGEKFLFYAPHSGFSNQLSEFKNAILMAGILNRTLVVPPILDHHAVALGSCP 124
           +C T+I   GEKFL+++PHSGFSNQLSEFKNAI+MAGILNRTL+VPPILDHHAVALGSCP
Sbjct: 73  RCRTRI--PGEKFLWFSPHSGFSNQLSEFKNAIVMAGILNRTLIVPPILDHHAVALGSCP 132

Query: 125 KFRVPDPGEIRFSVWEHMLQLLRNGRFVSMADIVDISSLTSYSSVKAIDFRTFAYLWCGV 184
           KFRV  P EIR SVW+H+++L+R+GR+VSMADI+DISSL S S V+AIDFR F  LWCG+
Sbjct: 133 KFRVQSPKEIRLSVWDHVIELIRSGRYVSMADIIDISSLLSSSLVRAIDFRVFVSLWCGL 192

Query: 185 RLESVCANE-------YDNLKQCGRLLAGLDGNVDKCLHAVDEDCKTTVWTYQNNEVDGA 244
            +   C+NE        D+LKQCG LL+GLDGN+D+CL AVDEDC+TTVW YQN+EVDGA
Sbjct: 193 DMTLACSNELDANQSMVDSLKQCGSLLSGLDGNIDRCLFAVDEDCRTTVWMYQNDEVDGA 252

Query: 245 LDLFQPNEQLKKKKKVSYVRRRRDVYRTLGRDSKAGSATVLAFGSLFTAPYRGSELYIDI 304
           LD FQP+EQLKKKKK+S+VR R+DVY+TLG  S+A +ATVLAFGSLFTAPY+GSELYIDI
Sbjct: 253 LDSFQPDEQLKKKKKISFVRTRKDVYKTLGPGSEADTATVLAFGSLFTAPYKGSELYIDI 312

Query: 305 HGVSKDQRISSLMKNIEYLPFVPEILSAGKEYIDKIIKAPFLCAQLRLLDGQFKNHWKAT 364
                D RI SL++ IE+LPFVPEI+++GK++  + IKAPFLCAQLRLLDGQFKNHWKAT
Sbjct: 313 QKAPGDPRIKSLLEKIEFLPFVPEIINSGKQFSVQTIKAPFLCAQLRLLDGQFKNHWKAT 372

Query: 365 FLALQQKLDSILENANEPIHVFVMTDLPKSNWTGSYLGDLDSDSNHFKLFFLEESDELVL 424
           F +L+QKLDS+ + ++ PIH+FVMTDLP+ NWTG+YLGDL  DS +FKL+FL E D LV 
Sbjct: 373 FSSLKQKLDSLRQASSLPIHIFVMTDLPQGNWTGTYLGDLAKDSTNFKLYFLREEDLLVK 432

Query: 425 RASKKVMAVGHGLRWTSNAFGPGRIRDMKKKCASEKLPDVLLYIEETVCSCASLGFVGTA 484
              KK+   GHGLR+ S       + +++K CA  KLPDVLL++EE VCSCAS+GFVGTA
Sbjct: 433 ETEKKLALAGHGLRFGSLPGSKDAVANLEKHCAPNKLPDVLLFLEEIVCSCASIGFVGTA 492

Query: 485 GSTIAESIELMRKYGVC 495
           GSTIAE+IE++RK+G C
Sbjct: 493 GSTIAETIEVIRKFGSC 507

BLAST of CsGy5G004440 vs. TrEMBL
Match: tr|A0A061G4F7|A0A061G4F7_THECC (O-fucosyltransferase family protein, putative isoform 2 OS=Theobroma cacao OX=3641 GN=TCM_015927 PE=4 SV=1)

HSP 1 Score: 615.9 bits (1587), Expect = 7.8e-173
Identity = 299/436 (68.58%), Postives = 358/436 (82.11%), Query Frame = 0

Query: 66  CNTQILALGEKFLFYAPHSGFSNQLSEFKNAILMAGILNRTLVVPPILDHHAVALGSCPK 125
           C TQI   GEKFL+YAPHSGFSNQLSEFKNAILMAGILNRTL+VPPILDHHAV LGSCPK
Sbjct: 74  CTTQI--PGEKFLWYAPHSGFSNQLSEFKNAILMAGILNRTLIVPPILDHHAVVLGSCPK 133

Query: 126 FRVPDPGEIRFSVWEHMLQLLRNGRFVSMADIVDISSLTSYSSVKAIDFRTFAYLWCGVR 185
           FRV    EIR SVW+H+ +L+R+ R+VSMADI+DISSL S S V+AIDFR F  LWCG+ 
Sbjct: 134 FRVQSAKEIRLSVWDHINELIRSERYVSMADIIDISSLLSSSLVRAIDFRVFVSLWCGLN 193

Query: 186 LESVCANEYD-------NLKQCGRLLAGLDGNVDKCLHAVDEDCKTTVWTYQNNEVDGAL 245
           ++ VC+NE +       +L+QCG LL+G+DGN+D+CL AVDEDC+TTVWTYQN+EVDG L
Sbjct: 194 MDLVCSNELNAQQSMVGSLRQCGSLLSGIDGNIDRCLFAVDEDCRTTVWTYQNDEVDGVL 253

Query: 246 DLFQPNEQLKKKKKVSYVRRRRDVYRTLGRDSKAGSATVLAFGSLFTAPYRGSELYIDIH 305
           D FQP+EQLK KKK+SYVRRRR+VY+TLG  S+A SATVLAFGSLFTAPY+GS+LYIDI 
Sbjct: 254 DSFQPDEQLKNKKKISYVRRRRNVYKTLGPGSEAESATVLAFGSLFTAPYKGSDLYIDIQ 313

Query: 306 GVSKDQRISSLMKNIEYLPFVPEILSAGKEYIDKIIKAPFLCAQLRLLDGQFKNHWKATF 365
               D +I SL+K IE+LPFVPEI+S+GK++  + IKAPFLCAQLRLLDGQFKNHWKATF
Sbjct: 314 KAPGDLKIKSLIKKIEFLPFVPEIISSGKQFAMQSIKAPFLCAQLRLLDGQFKNHWKATF 373

Query: 366 LALQQKLDSILENANEPIHVFVMTDLPKSNWTGSYLGDLDSDSNHFKLFFLEESDELVLR 425
           L L+QKLDS+ +  + PIH+FVMTDLP+ NWTGSYLGDL  DS +FKL+FL E D  V++
Sbjct: 374 LGLKQKLDSLRQAGSRPIHIFVMTDLPQGNWTGSYLGDLARDSANFKLYFLRE-DLFVMK 433

Query: 426 ASKKVMAVGHGLRWTSNAFGPGRIRDMKKKCASEKLPDVLLYIEETVCSCASLGFVGTAG 485
            +KK+   GHGLR+ S       +  ++K C+ + +PDVLLYIEETVCSCASLGFVGTAG
Sbjct: 434 TAKKLALAGHGLRFESVPASLDAVAKLEKHCSPDIVPDVLLYIEETVCSCASLGFVGTAG 493

Query: 486 STIAESIELMRKYGVC 495
           STIAE+IE+MRKYG C
Sbjct: 494 STIAETIEVMRKYGSC 506

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004152423.17.7e-25799.39PREDICTED: uncharacterized protein LOC101209896 [Cucumis sativus] >KGN50277.1 hy... [more]
XP_008437048.11.7e-24891.70PREDICTED: uncharacterized protein LOC103482591 [Cucumis melo][more]
XP_022958125.12.9e-22488.45O-fucosyltransferase 30-like isoform X1 [Cucurbita moschata][more]
XP_022958126.12.9e-22488.45O-fucosyltransferase 30-like isoform X2 [Cucurbita moschata][more]
XP_023549514.14.7e-22279.15O-fucosyltransferase 30-like [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
AT4G17430.14.0e-15962.50O-fucosyltransferase family protein[more]
Match NameE-valueIdentityDescription
sp|Q1JPM5|OFT30_ARATH7.3e-15862.50O-fucosyltransferase 30 OS=Arabidopsis thaliana OX=3702 GN=OFUT30 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
tr|A0A0A0KPB8|A0A0A0KPB8_CUCSA5.1e-25799.39Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G165220 PE=4 SV=1[more]
tr|A0A1S3AT28|A0A1S3AT28_CUCME1.1e-24891.70uncharacterized protein LOC103482591 OS=Cucumis melo OX=3656 GN=LOC103482591 PE=... [more]
tr|A0A2N9F0Q0|A0A2N9F0Q0_FAGSY8.0e-17871.23Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS8472 PE=4 SV=1[more]
tr|A0A1R3GNI3|A0A1R3GNI3_COCAP4.6e-17367.28Uncharacterized protein OS=Corchorus capsularis OX=210143 GN=CCACVL1_24722 PE=4 ... [more]
tr|A0A061G4F7|A0A061G4F7_THECC7.8e-17368.58O-fucosyltransferase family protein, putative isoform 2 OS=Theobroma cacao OX=36... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR019378GDP-Fuc_O-FucTrfase
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006004 fucose metabolic process
biological_process GO:0008150 biological_process
cellular_component GO:0005768 endosome
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0005802 trans-Golgi network
cellular_component GO:0005575 cellular_component
molecular_function GO:0016757 transferase activity, transferring glycosyl groups
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsGy5G004440.1CsGy5G004440.1mRNA


Analysis Name: InterPro Annotations of cucumber Gy14 genome (v2)
Date Performed: 2018-09-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR019378GDP-fucose protein O-fucosyltransferasePFAMPF10250O-FucTcoord: 77..350
e-value: 1.8E-11
score: 44.3
NoneNo IPR availablePANTHERPTHR36050FAMILY NOT NAMEDcoord: 12..494

The following gene(s) are paralogous to this gene:

None