Cla97C01G004420 (gene) Watermelon (97103) v2

NameCla97C01G004420
Typegene
OrganismCitrullus lanatus (Watermelon (97103) v2)
DescriptionO-fucosyltransferase family protein
LocationCla97Chr01 : 4286869 .. 4289012 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAATATTTCCGGTTCGAGCAGATCTTCGATCAATCGATGGAACTCAAAGAAATCCAATTTACAGCTCCGGCGTTTTTCTTTGTCTGTCATCGTTCTACTCTTCTGTTCCCTCTTTCTCCTCTACCTGTCCTCCTCCTCCTTCATGTCCTCAACTGCATTCTCCACTTCAAATTCACGTCAATGCAATCCCCAGATCTTAGATTTGGGTGAGAAATTTCTGTTTTACGCGCCTCACAGTGGGTTTAGCAACCAGCTTTCCGAGTTCAAGAATGCTATTCTAATGGCCGGGATTCTCAACCGGACTCTTGTTGTTCCGCCGATTCTGGATCACCATGCGGTAGCTCTCGGGAGTTGTCCGAAATTCCGAGTTCCGGATCCTGGTGAGATTCGATTTTCGGTTTGGGAGCACATGCTTGAGCTTCTTCGGATTGGAAGGTAATCATTTTCCCCTTGGTTTGTAATTTTTTTTTTTTTTTTAATTGTATTTTCAATGTCTTTATTTTTCTTGAACTGATGAGTGTTCGGGTTGTCGGGTTGCTGGATTGTTATGTAAGTTGCTTCTAAATCAAATAATTGAACCGTTTTGGTAAACTCGTAGCAATTCTAGCTTACTGTTTACTTTTCCATTCTTATCTTTTGTGATTGATTTAACAAGGAGATGAAAAGAACAAGGCTGAATTAGAAAAAAAAAAAGAAAAAAACCCTAAACCTTGTCATTTTTTTAAAAAATGCCCCCTTTTTTAAAATATTGCAATATTACCCTCCATGTTTCATTAACGTTGAAACTACCTTGATGTAAATGCATTAGAATGCTGGATGGAAAATGGGACTTGGAACAGTGTGGTGGTAAACTAAAACAGAAATTGGGACCGTCACACTGTTCTAAATCACCATCATAACATATCAAGTCTCAAATTTTATCCAATACTCTAACGAGTTTCTACTCCGTGGGTAGTTTTGAAACGGTAATTTTGTAACTTTTGAATGAACGAAGGACATTCGTTTAAGTTTAGGACTATTATTTTGTAATTTAGCCAAGAAAATAATTCTGTGGTAACGTTTTATGACATTAGCTATGCATTTCAATTTTATGGCAGGTACGTTTCCATGGCAGATATTGTAGATATTTCATCATTAACTTCTTACTCTTCTGTTAAAGCCATAGATTTTAGGACCTTTGCATACTTATGGTGTGGAGTGCATCTGGAAAGTGTTTGTTCAAATGAATATAACCTAAAGCATTGTGGTCGTCTACTAGCAGGGCTTGATGGGAATGTAGACAAATGTTTACATGCTGTAGATGAAGATTGCAGAACTACAGTTTGGAGTTACCAAAATGGTGAAGTTGATGGAGCATTAGACTTGTTTCAGCCTAACGAACAGCTTAAGAAGAAAAAGAAAGTGTCCTATGTCAGACGCCGTCGAGATGTATATAGAACCCTCGGACCCAATTCGAAAGCTGAATCAGCTACTGTTTTGGCATTTGGAAGTCTATTTACTGCTCCATACAAAGGTTCAGAGCTGTATATTGATATCCATGAAGTTAGTGGAGATCAAAGAATCAGTTCTTTGATGAAAAACATTGAGTATCTACCATTTGTCCCAGAAATCTTGAGTGCAGGAAAAGAGTATGTTGACAAGATCATAAAAGCTCCATTCCTTTGTGCTCAATTGAGATTGTTAGATGGGCAGTTCAAACATCACTGGAAGGCGACTTTTCAGGGCCTGAAACAGAAATTAGACTCTATATTAGAGAATGCTAATGAACCTATTCATGTTTTTGTGATGACTGATCTTCCTGAATCTAATTGGACTGGAAGCTACTTAGGGGATTTGGCTAGTGATTCAAATCACTTCAAACTCTTTTTTCTCAAAGAACACGATGAATTGGTTCTAAGAGCGTCTAAAAAGGTGATGGCTGTAGGACATGGCTTGAGATGGACATCTAGTGCATTTGGTCCTAGCAGAATTCGTGATATGAAGAAGAAATGTGCTTCAGAAAGATTACCGGATGTTCTCTTATATGTAGAGGAAACTGTTTGCAGTTGTGCTTCACTTGGTTTTGTTGGTACTGCTGGTTCCACAATTGCTGAAAGCATTGAGCTGATGAGAAAATATAGACTACGTTCAGGTCAAAATTGA

mRNA sequence

ATGAATATTTCCGGTTCGAGCAGATCTTCGATCAATCGATGGAACTCAAAGAAATCCAATTTACAGCTCCGGCGTTTTTCTTTGTCTGTCATCGTTCTACTCTTCTGTTCCCTCTTTCTCCTCTACCTGTCCTCCTCCTCCTTCATGTCCTCAACTGCATTCTCCACTTCAAATTCACGTCAATGCAATCCCCAGATCTTAGATTTGGGTGAGAAATTTCTGTTTTACGCGCCTCACAGTGGGTTTAGCAACCAGCTTTCCGAGTTCAAGAATGCTATTCTAATGGCCGGGATTCTCAACCGGACTCTTGTTGTTCCGCCGATTCTGGATCACCATGCGGTAGCTCTCGGGAGTTGTCCGAAATTCCGAGTTCCGGATCCTGGTGAGATTCGATTTTCGGTTTGGGAGCACATGCTTGAGCTTCTTCGGATTGGAAGGTACGTTTCCATGGCAGATATTGTAGATATTTCATCATTAACTTCTTACTCTTCTGTTAAAGCCATAGATTTTAGGACCTTTGCATACTTATGGTGTGGAGTGCATCTGGAAAGTGTTTGTTCAAATGAATATAACCTAAAGCATTGTGGTCGTCTACTAGCAGGGCTTGATGGGAATGTAGACAAATGTTTACATGCTGTAGATGAAGATTGCAGAACTACAGTTTGGAGTTACCAAAATGGTGAAGTTGATGGAGCATTAGACTTGTTTCAGCCTAACGAACAGCTTAAGAAGAAAAAGAAAGTGTCCTATGTCAGACGCCGTCGAGATGTATATAGAACCCTCGGACCCAATTCGAAAGCTGAATCAGCTACTGTTTTGGCATTTGGAAGTCTATTTACTGCTCCATACAAAGGTTCAGAGCTGTATATTGATATCCATGAAGTTAGTGGAGATCAAAGAATCAGTTCTTTGATGAAAAACATTGAGTATCTACCATTTGTCCCAGAAATCTTGAGTGCAGGAAAAGAGTATGTTGACAAGATCATAAAAGCTCCATTCCTTTGTGCTCAATTGAGATTGTTAGATGGGCAGTTCAAACATCACTGGAAGGCGACTTTTCAGGGCCTGAAACAGAAATTAGACTCTATATTAGAGAATGCTAATGAACCTATTCATGTTTTTGTGATGACTGATCTTCCTGAATCTAATTGGACTGGAAGCTACTTAGGGGATTTGGCTAGTGATTCAAATCACTTCAAACTCTTTTTTCTCAAAGAACACGATGAATTGGTTCTAAGAGCGTCTAAAAAGGTGATGGCTGTAGGACATGGCTTGAGATGGACATCTAGTGCATTTGGTCCTAGCAGAATTCGTGATATGAAGAAGAAATGTGCTTCAGAAAGATTACCGGATGTTCTCTTATATGTAGAGGAAACTGTTTGCAGTTGTGCTTCACTTGGTTTTGTTGGTACTGCTGGTTCCACAATTGCTGAAAGCATTGAGCTGATGAGAAAATATAGACTACGTTCAGGTCAAAATTGA

Coding sequence (CDS)

ATGAATATTTCCGGTTCGAGCAGATCTTCGATCAATCGATGGAACTCAAAGAAATCCAATTTACAGCTCCGGCGTTTTTCTTTGTCTGTCATCGTTCTACTCTTCTGTTCCCTCTTTCTCCTCTACCTGTCCTCCTCCTCCTTCATGTCCTCAACTGCATTCTCCACTTCAAATTCACGTCAATGCAATCCCCAGATCTTAGATTTGGGTGAGAAATTTCTGTTTTACGCGCCTCACAGTGGGTTTAGCAACCAGCTTTCCGAGTTCAAGAATGCTATTCTAATGGCCGGGATTCTCAACCGGACTCTTGTTGTTCCGCCGATTCTGGATCACCATGCGGTAGCTCTCGGGAGTTGTCCGAAATTCCGAGTTCCGGATCCTGGTGAGATTCGATTTTCGGTTTGGGAGCACATGCTTGAGCTTCTTCGGATTGGAAGGTACGTTTCCATGGCAGATATTGTAGATATTTCATCATTAACTTCTTACTCTTCTGTTAAAGCCATAGATTTTAGGACCTTTGCATACTTATGGTGTGGAGTGCATCTGGAAAGTGTTTGTTCAAATGAATATAACCTAAAGCATTGTGGTCGTCTACTAGCAGGGCTTGATGGGAATGTAGACAAATGTTTACATGCTGTAGATGAAGATTGCAGAACTACAGTTTGGAGTTACCAAAATGGTGAAGTTGATGGAGCATTAGACTTGTTTCAGCCTAACGAACAGCTTAAGAAGAAAAAGAAAGTGTCCTATGTCAGACGCCGTCGAGATGTATATAGAACCCTCGGACCCAATTCGAAAGCTGAATCAGCTACTGTTTTGGCATTTGGAAGTCTATTTACTGCTCCATACAAAGGTTCAGAGCTGTATATTGATATCCATGAAGTTAGTGGAGATCAAAGAATCAGTTCTTTGATGAAAAACATTGAGTATCTACCATTTGTCCCAGAAATCTTGAGTGCAGGAAAAGAGTATGTTGACAAGATCATAAAAGCTCCATTCCTTTGTGCTCAATTGAGATTGTTAGATGGGCAGTTCAAACATCACTGGAAGGCGACTTTTCAGGGCCTGAAACAGAAATTAGACTCTATATTAGAGAATGCTAATGAACCTATTCATGTTTTTGTGATGACTGATCTTCCTGAATCTAATTGGACTGGAAGCTACTTAGGGGATTTGGCTAGTGATTCAAATCACTTCAAACTCTTTTTTCTCAAAGAACACGATGAATTGGTTCTAAGAGCGTCTAAAAAGGTGATGGCTGTAGGACATGGCTTGAGATGGACATCTAGTGCATTTGGTCCTAGCAGAATTCGTGATATGAAGAAGAAATGTGCTTCAGAAAGATTACCGGATGTTCTCTTATATGTAGAGGAAACTGTTTGCAGTTGTGCTTCACTTGGTTTTGTTGGTACTGCTGGTTCCACAATTGCTGAAAGCATTGAGCTGATGAGAAAATATAGACTACGTTCAGGTCAAAATTGA

Protein sequence

MNISGSSRSSINRWNSKKSNLQLRRFSLSVIVLLFCSLFLLYLSSSSFMSSTAFSTSNSRQCNPQILDLGEKFLFYAPHSGFSNQLSEFKNAILMAGILNRTLVVPPILDHHAVALGSCPKFRVPDPGEIRFSVWEHMLELLRIGRYVSMADIVDISSLTSYSSVKAIDFRTFAYLWCGVHLESVCSNEYNLKHCGRLLAGLDGNVDKCLHAVDEDCRTTVWSYQNGEVDGALDLFQPNEQLKKKKKVSYVRRRRDVYRTLGPNSKAESATVLAFGSLFTAPYKGSELYIDIHEVSGDQRISSLMKNIEYLPFVPEILSAGKEYVDKIIKAPFLCAQLRLLDGQFKHHWKATFQGLKQKLDSILENANEPIHVFVMTDLPESNWTGSYLGDLASDSNHFKLFFLKEHDELVLRASKKVMAVGHGLRWTSSAFGPSRIRDMKKKCASERLPDVLLYVEETVCSCASLGFVGTAGSTIAESIELMRKYRLRSGQN
BLAST of Cla97C01G004420 vs. NCBI nr
Match: XP_023549514.1 (O-fucosyltransferase 30-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 864.4 bits (2232), Expect = 1.9e-247
Identity = 435/496 (87.70%), Postives = 455/496 (91.73%), Query Frame = 0

Query: 1   MNISGSSRSSINRWNSKKSNLQLRRFSLSVIVLL-FCSLFLLYL--SSSSFMSSTAFSTS 60
           MNI GSSRS IN WNSKKSN   RRFSL V+ LL FCS FLLYL  S SSFM STAFSTS
Sbjct: 1   MNIFGSSRSPINGWNSKKSNFLHRRFSLPVLALLFFCSFFLLYLFSSYSSFMPSTAFSTS 60

Query: 61  NSRQCNPQILDLGEKFLFYAPHSGFSNQLSEFKNAILMAGILNRTLVVPPILDHHAVALG 120
           NSRQC+ +ILDLGE+FLFYAPHSGFSNQLSEFKNAILMAGILNRTLVVPPILDHHAVALG
Sbjct: 61  NSRQCSSRILDLGERFLFYAPHSGFSNQLSEFKNAILMAGILNRTLVVPPILDHHAVALG 120

Query: 121 SCPKFRVPDPGEIRFSVWEHMLELLRIGRYVSMADIVDISSLTSYSSVKAIDFRTFAYLW 180
           SCPKFRV DPGEIRFSVWEHMLELLR GRYVSMADIVDISSL SYSS+KAIDFRTFAYLW
Sbjct: 121 SCPKFRVTDPGEIRFSVWEHMLELLRNGRYVSMADIVDISSLASYSSIKAIDFRTFAYLW 180

Query: 181 CGVHLESVCSNEYNLKHCGRLLAGLDGNVDKCLHAVDEDCRTTVWSYQNGEVDGALDLFQ 240
           CGV LESVCSNEYNLK CGRLLAGLDGNVDKCLHAVDEDCRTTVW+Y+NGEVD ALD+FQ
Sbjct: 181 CGVDLESVCSNEYNLKQCGRLLAGLDGNVDKCLHAVDEDCRTTVWTYKNGEVDVALDVFQ 240

Query: 241 PNEQLKKKKKVSYVRRRRDVYRTLGPNSKAESATVLAFGSLFTAPYKGSELYIDIHEVSG 300
           PNEQLKKKK VSYVRRRRDVYR LGP+SKAE A VL+FGSLFTAPYKGSELYIDIHEVSG
Sbjct: 241 PNEQLKKKKNVSYVRRRRDVYRALGPDSKAELAAVLSFGSLFTAPYKGSELYIDIHEVSG 300

Query: 301 DQRISSLMKNIEYLPFVPEILSAGKEYVDKIIKAPFLCAQLRLLDGQFKHHWKATFQGLK 360
           DQRISSL+K+IEYLPFVPEILSAGKEY+DKIIKAPFLCAQLRLLDGQFK+HW ATF  L+
Sbjct: 301 DQRISSLIKSIEYLPFVPEILSAGKEYIDKIIKAPFLCAQLRLLDGQFKNHWNATFLALE 360

Query: 361 QKLDSILENANEPIHVFVMTDLPESNWTGSYLGDLASDSNHFKLFFLKEHDELVLRASKK 420
           QKLDSIL++ NEP+HVFVMTDLPESNWTGSYL  LA DSNHFKLF LKEHDELV RASKK
Sbjct: 361 QKLDSILQDGNEPVHVFVMTDLPESNWTGSYLRHLARDSNHFKLFLLKEHDELVQRASKK 420

Query: 421 VMAVGHGLRWTSSAFGPSRIRDMKKKCASERLPDVLLYVEETVCSCASLGFVGTAGSTIA 480
           VMAVGHGLR TSSAFGPSRI DMK KC SERLPD+LLY+EETVCSCASLGFVGTAGSTIA
Sbjct: 421 VMAVGHGLRSTSSAFGPSRIHDMKNKCTSERLPDILLYIEETVCSCASLGFVGTAGSTIA 480

Query: 481 ESIELMRKYRLRSGQN 494
           ESIELMRKY L S QN
Sbjct: 481 ESIELMRKYGLCSDQN 496

BLAST of Cla97C01G004420 vs. NCBI nr
Match: XP_008437048.1 (PREDICTED: uncharacterized protein LOC103482591 [Cucumis melo])

HSP 1 Score: 860.5 bits (2222), Expect = 2.7e-246
Identity = 433/491 (88.19%), Postives = 448/491 (91.24%), Query Frame = 0

Query: 1   MNISGSSRSSINRWNSKKSNLQLRRFSLSVIVLLFC----SLFLLYLSSSSFMSSTAFST 60
           MN+ GS+RSSINRWNSKK NLQLRRFSLSV VLLFC                        
Sbjct: 1   MNVFGSTRSSINRWNSKKPNLQLRRFSLSVFVLLFCFXXXXXXXXXXXXXXXXXXXXXXX 60

Query: 61  SNSRQCNPQILDLGEKFLFYAPHSGFSNQLSEFKNAILMAGILNRTLVVPPILDHHAVAL 120
           SNSRQCN QIL LGEKFLFYAPHSGFSNQLSEFKNAILMAGILNRTLVVPPILDHHAVAL
Sbjct: 61  SNSRQCNTQILGLGEKFLFYAPHSGFSNQLSEFKNAILMAGILNRTLVVPPILDHHAVAL 120

Query: 121 GSCPKFRVPDPGEIRFSVWEHMLELLRIGRYVSMADIVDISSLTSYSSVKAIDFRTFAYL 180
           GSCPKFRVPDPGEIRFSVWEHML+LLR GRYVSM DIVDISSLTSYSSVKAIDFRTFAYL
Sbjct: 121 GSCPKFRVPDPGEIRFSVWEHMLQLLRNGRYVSMTDIVDISSLTSYSSVKAIDFRTFAYL 180

Query: 181 WCGVHLESVCSNEY-NLKHCGRLLAGLDGNVDKCLHAVDEDCRTTVWSYQNGEVDGALDL 240
           WCGVHLESVCSNEY NLK CGRLLAGLDGNVDKCLHAVDEDC+TTVW+YQ+ EVDGALDL
Sbjct: 181 WCGVHLESVCSNEYNNLKQCGRLLAGLDGNVDKCLHAVDEDCKTTVWTYQSNEVDGALDL 240

Query: 241 FQPNEQLKKKKKVSYVRRRRDVYRTLGPNSKAESATVLAFGSLFTAPYKGSELYIDIHEV 300
           FQPNEQLKKKKKVSYVRRRRDVYRTLGP+SKA SATVLAFGSLFTAPYKGSELYIDIH V
Sbjct: 241 FQPNEQLKKKKKVSYVRRRRDVYRTLGPDSKAGSATVLAFGSLFTAPYKGSELYIDIHGV 300

Query: 301 SGDQRISSLMKNIEYLPFVPEILSAGKEYVDKIIKAPFLCAQLRLLDGQFKHHWKATFQG 360
           S DQRISSLMKNIEYLPFVPEILSAGKEY+DKIIKAPFLCAQLRLLDGQFK+HWKATF  
Sbjct: 301 SKDQRISSLMKNIEYLPFVPEILSAGKEYIDKIIKAPFLCAQLRLLDGQFKNHWKATFLA 360

Query: 361 LKQKLDSILENANEPIHVFVMTDLPESNWTGSYLGDLASDSNHFKLFFLKEHDELVLRAS 420
           L+QKL+SILENANEPI VFVMTDLPESNWTGSYLGDL SDSNHFKLFFLKEHDELVLRAS
Sbjct: 361 LQQKLNSILENANEPIRVFVMTDLPESNWTGSYLGDLDSDSNHFKLFFLKEHDELVLRAS 420

Query: 421 KKVMAVGHGLRWTSSAFGPSRIRDMKKKCASERLPDVLLYVEETVCSCASLGFVGTAGST 480
           KKVMAVGHGLRWTS+AFGP RIR+MKK+CA ERLPDVLLY+EETVCSCASLGFVGTAGST
Sbjct: 421 KKVMAVGHGLRWTSNAFGPGRIRNMKKECAPERLPDVLLYIEETVCSCASLGFVGTAGST 480

Query: 481 IAESIELMRKY 487
           IAESIELMRKY
Sbjct: 481 IAESIELMRKY 491

BLAST of Cla97C01G004420 vs. NCBI nr
Match: XP_022972968.1 (O-fucosyltransferase 30 [Cucurbita maxima] >XP_022972969.1 O-fucosyltransferase 30 [Cucurbita maxima])

HSP 1 Score: 859.8 bits (2220), Expect = 4.7e-246
Identity = 430/495 (86.87%), Postives = 453/495 (91.52%), Query Frame = 0

Query: 1   MNISGSSRSSINRWNSKKSNLQLRRFSLSVIVLLFCSLFLLYL--SSSSFMSSTAFSTSN 60
           M I GSSRS IN WNSKKSN   RRFSL V+ LLFCS FLLYL  S SSFM STAFSTSN
Sbjct: 1   MKIFGSSRSPINGWNSKKSNFLHRRFSLPVLALLFCSFFLLYLFSSYSSFMPSTAFSTSN 60

Query: 61  SRQCNPQILDLGEKFLFYAPHSGFSNQLSEFKNAILMAGILNRTLVVPPILDHHAVALGS 120
           S QC+ +ILDLGE+FLFYAPHSGFSNQLSEFKNAILMAGILNRTLVVPPILDHHAVALGS
Sbjct: 61  SCQCSSRILDLGERFLFYAPHSGFSNQLSEFKNAILMAGILNRTLVVPPILDHHAVALGS 120

Query: 121 CPKFRVPDPGEIRFSVWEHMLELLRIGRYVSMADIVDISSLTSYSSVKAIDFRTFAYLWC 180
           CPKFRV DPGEIRFSVWEHMLELLR GRYVSMADIVDISSL SYSS+KAIDFRTFAYLWC
Sbjct: 121 CPKFRVTDPGEIRFSVWEHMLELLRNGRYVSMADIVDISSLASYSSIKAIDFRTFAYLWC 180

Query: 181 GVHLESVCSNEYNLKHCGRLLAGLDGNVDKCLHAVDEDCRTTVWSYQNGEVDGALDLFQP 240
           GV LESVCSNEYNLK CGRLLAGLDGNVDKCLHAVDEDCRTTVW+Y+N EVDG LD+FQP
Sbjct: 181 GVDLESVCSNEYNLKQCGRLLAGLDGNVDKCLHAVDEDCRTTVWTYKNREVDGVLDVFQP 240

Query: 241 NEQLKKKKKVSYVRRRRDVYRTLGPNSKAESATVLAFGSLFTAPYKGSELYIDIHEVSGD 300
           +EQLKKKK VSYVRRRRDVYR+LGP+SKAE A VL+FGSLFTAPYKGSELYIDIHEVSGD
Sbjct: 241 SEQLKKKKNVSYVRRRRDVYRSLGPDSKAELAAVLSFGSLFTAPYKGSELYIDIHEVSGD 300

Query: 301 QRISSLMKNIEYLPFVPEILSAGKEYVDKIIKAPFLCAQLRLLDGQFKHHWKATFQGLKQ 360
           QRISSL+K+IEYLPFVPEILSAGKEY+DKIIKAPFLCAQLRLLDGQFK+HW ATF  L+Q
Sbjct: 301 QRISSLIKSIEYLPFVPEILSAGKEYIDKIIKAPFLCAQLRLLDGQFKNHWNATFMALEQ 360

Query: 361 KLDSILENANEPIHVFVMTDLPESNWTGSYLGDLASDSNHFKLFFLKEHDELVLRASKKV 420
           KLDSIL++ N+P+HVFVMTDLPESNWTGSYL  LA DSNHFKLF LKEHDELV RASKKV
Sbjct: 361 KLDSILQDGNKPVHVFVMTDLPESNWTGSYLRHLAMDSNHFKLFLLKEHDELVQRASKKV 420

Query: 421 MAVGHGLRWTSSAFGPSRIRDMKKKCASERLPDVLLYVEETVCSCASLGFVGTAGSTIAE 480
           MAVGHGLR TSSAFGPSRI DMK KC SERLPD+LLY+EETVCSCASLGFVGTAGSTIAE
Sbjct: 421 MAVGHGLRSTSSAFGPSRIHDMKDKCTSERLPDILLYIEETVCSCASLGFVGTAGSTIAE 480

Query: 481 SIELMRKYRLRSGQN 494
           SIELMRKY L S QN
Sbjct: 481 SIELMRKYGLCSDQN 495

BLAST of Cla97C01G004420 vs. NCBI nr
Match: XP_022922425.1 (O-fucosyltransferase 30-like [Cucurbita moschata] >XP_022922426.1 O-fucosyltransferase 30-like [Cucurbita moschata])

HSP 1 Score: 852.0 bits (2200), Expect = 9.7e-244
Identity = 428/495 (86.46%), Postives = 449/495 (90.71%), Query Frame = 0

Query: 1   MNISGSSRSSINRWNSKKSNLQLRRFSLSVIVLLFCSLFLLYL--SSSSFMSSTAFSTSN 60
           MNI GSSRS IN WNSKKS    RRFSL V  LL  S FLLYL  S SSFM STAFSTSN
Sbjct: 1   MNIFGSSRSPINGWNSKKSKFLHRRFSLPVFALLSGSFFLLYLFSSYSSFMPSTAFSTSN 60

Query: 61  SRQCNPQILDLGEKFLFYAPHSGFSNQLSEFKNAILMAGILNRTLVVPPILDHHAVALGS 120
           SR+C+ +ILDLGE+FLFYAPHSGFSNQLSEFKNAILMA ILNRTLVVPPILDHHAVALGS
Sbjct: 61  SRKCSSRILDLGERFLFYAPHSGFSNQLSEFKNAILMAWILNRTLVVPPILDHHAVALGS 120

Query: 121 CPKFRVPDPGEIRFSVWEHMLELLRIGRYVSMADIVDISSLTSYSSVKAIDFRTFAYLWC 180
           CPKFRV DPGEIRFSVWEHMLELLR GRYVSMADIVDISSL SYSS+KAIDFRTFAYLWC
Sbjct: 121 CPKFRVTDPGEIRFSVWEHMLELLRNGRYVSMADIVDISSLASYSSIKAIDFRTFAYLWC 180

Query: 181 GVHLESVCSNEYNLKHCGRLLAGLDGNVDKCLHAVDEDCRTTVWSYQNGEVDGALDLFQP 240
           GV LESVCSNEYNLK CGRLLAGLDGNVDKCLHAVDEDCRTTVW+Y+NGEVDGALD+FQP
Sbjct: 181 GVDLESVCSNEYNLKQCGRLLAGLDGNVDKCLHAVDEDCRTTVWTYKNGEVDGALDVFQP 240

Query: 241 NEQLKKKKKVSYVRRRRDVYRTLGPNSKAESATVLAFGSLFTAPYKGSELYIDIHEVSGD 300
           NEQLKKKK VSYVRRRRDVYR LGP+SKAE A VL+FGSLFTAPYKGSELYIDIHEVSGD
Sbjct: 241 NEQLKKKKNVSYVRRRRDVYRALGPDSKAELAAVLSFGSLFTAPYKGSELYIDIHEVSGD 300

Query: 301 QRISSLMKNIEYLPFVPEILSAGKEYVDKIIKAPFLCAQLRLLDGQFKHHWKATFQGLKQ 360
           QRISSL+K+IEYLPFVPEILSAGKEY+DKIIKAPFLCAQLRLLDGQFK+HW ATF  L+Q
Sbjct: 301 QRISSLIKSIEYLPFVPEILSAGKEYIDKIIKAPFLCAQLRLLDGQFKNHWNATFLALEQ 360

Query: 361 KLDSILENANEPIHVFVMTDLPESNWTGSYLGDLASDSNHFKLFFLKEHDELVLRASKKV 420
           KLD IL++ NEP+HVFVMTDLPESNWTGSYL  LA DSNHFKLF LKEHDELV RASKKV
Sbjct: 361 KLDLILQDGNEPVHVFVMTDLPESNWTGSYLRHLARDSNHFKLFLLKEHDELVQRASKKV 420

Query: 421 MAVGHGLRWTSSAFGPSRIRDMKKKCASERLPDVLLYVEETVCSCASLGFVGTAGSTIAE 480
           M VGHGLR TSSAFGPSRI DMK KC SERLPD+LLY+EETVCSCASLGF+GTAGSTIAE
Sbjct: 421 MVVGHGLRSTSSAFGPSRIHDMKNKCTSERLPDILLYIEETVCSCASLGFLGTAGSTIAE 480

Query: 481 SIELMRKYRLRSGQN 494
           SIELMRKY L S QN
Sbjct: 481 SIELMRKYGLCSDQN 495

BLAST of Cla97C01G004420 vs. NCBI nr
Match: XP_022958126.1 (O-fucosyltransferase 30-like isoform X2 [Cucurbita moschata])

HSP 1 Score: 830.5 bits (2144), Expect = 3.0e-237
Identity = 416/499 (83.37%), Postives = 445/499 (89.18%), Query Frame = 0

Query: 1   MNISGSSRSSINRWNSKKSNLQLR--RF---SLSVIVLLFCSLFLLYLSSSSFMSSTAFS 60
           MN+ GS+RS I +W +KKSN Q R  RF                        F+SSTAFS
Sbjct: 45  MNVFGSNRSKIKQWYAKKSNSQFRFSRFXXXXXXXXXXXXXXXXXXXXXXXXFISSTAFS 104

Query: 61  TSNSRQCNPQILDLGEKFLFYAPHSGFSNQLSEFKNAILMAGILNRTLVVPPILDHHAVA 120
           T +SRQCN +ILDLGE+FLFYAPHSGF+NQLSEFKNAILMAGILNRTLV+PPILDHHAVA
Sbjct: 105 TYDSRQCNSRILDLGERFLFYAPHSGFNNQLSEFKNAILMAGILNRTLVIPPILDHHAVA 164

Query: 121 LGSCPKFRVPDPGEIRFSVWEHMLELLRIGRYVSMADIVDISSLTSYSSVKAIDFRTFAY 180
           LGSCPKFRV DPGEIRFSVWEHM ELLR GRYVSMADIVDISSL SY+S+KAIDFRTFAY
Sbjct: 165 LGSCPKFRVLDPGEIRFSVWEHMFELLRDGRYVSMADIVDISSLASYTSIKAIDFRTFAY 224

Query: 181 LWCGVHLESVCSNEYNLKHCGRLLAGLDGNVDKCLHAVDEDCRTTVWSYQNGEVDGALDL 240
           LWCGV LESVCSNE+NLK CGRLLAGLDGNVDKCLHAVDEDCRTTVW+YQNGEVDG LDL
Sbjct: 225 LWCGVDLESVCSNEFNLKQCGRLLAGLDGNVDKCLHAVDEDCRTTVWTYQNGEVDGVLDL 284

Query: 241 FQPNEQLKKKKKVSYVRRRRDVYRTLGPNSKAESATVLAFGSLFTAPYKGSELYIDIHEV 300
           FQPNEQLKKKKKV+YVRRRRDVYRTLGP+S+AESATVLAFGSLFTAPYK SELYIDIHEV
Sbjct: 285 FQPNEQLKKKKKVTYVRRRRDVYRTLGPDSEAESATVLAFGSLFTAPYKSSELYIDIHEV 344

Query: 301 SGDQRISSLMKNIEYLPFVPEILSAGKEYVDKIIKAPFLCAQLRLLDGQFKHHWKATFQG 360
            GDQRISSLMKNIE+LPFVPEILSAGKEY+DKIIKAPFLCAQLRLLDGQFK+HWKATF  
Sbjct: 345 RGDQRISSLMKNIEHLPFVPEILSAGKEYIDKIIKAPFLCAQLRLLDGQFKNHWKATFLA 404

Query: 361 LKQKLDSILENANE-PIHVFVMTDLPESNWTGSYLGDLASDSNHFKLFFLKEHDELVLRA 420
           LKQKLDSIL+NANE PIH+FVMTDLPESNWTGSYLGDL SDSN FKLFFLKEHDELV+RA
Sbjct: 405 LKQKLDSILQNANEQPIHIFVMTDLPESNWTGSYLGDLVSDSNRFKLFFLKEHDELVVRA 464

Query: 421 SKKVMAVGHGLRWTSSAFGPSRIRDMKKKCASERLPDVLLYVEETVCSCASLGFVGTAGS 480
           S+KVMAVGHGLR  SSAFGP RIRDMKKKCA+E+LPD+LLY+EETVCSCASLGF+GTAGS
Sbjct: 465 SEKVMAVGHGLRLASSAFGPGRIRDMKKKCAAEKLPDILLYIEETVCSCASLGFIGTAGS 524

Query: 481 TIAESIELMRKYRLRSGQN 494
           TIAESIELM KY L SGQN
Sbjct: 525 TIAESIELMSKYGLCSGQN 543

BLAST of Cla97C01G004420 vs. TrEMBL
Match: tr|A0A1S3AT28|A0A1S3AT28_CUCME (uncharacterized protein LOC103482591 OS=Cucumis melo OX=3656 GN=LOC103482591 PE=4 SV=1)

HSP 1 Score: 860.5 bits (2222), Expect = 1.8e-246
Identity = 433/491 (88.19%), Postives = 448/491 (91.24%), Query Frame = 0

Query: 1   MNISGSSRSSINRWNSKKSNLQLRRFSLSVIVLLFC----SLFLLYLSSSSFMSSTAFST 60
           MN+ GS+RSSINRWNSKK NLQLRRFSLSV VLLFC                        
Sbjct: 1   MNVFGSTRSSINRWNSKKPNLQLRRFSLSVFVLLFCFXXXXXXXXXXXXXXXXXXXXXXX 60

Query: 61  SNSRQCNPQILDLGEKFLFYAPHSGFSNQLSEFKNAILMAGILNRTLVVPPILDHHAVAL 120
           SNSRQCN QIL LGEKFLFYAPHSGFSNQLSEFKNAILMAGILNRTLVVPPILDHHAVAL
Sbjct: 61  SNSRQCNTQILGLGEKFLFYAPHSGFSNQLSEFKNAILMAGILNRTLVVPPILDHHAVAL 120

Query: 121 GSCPKFRVPDPGEIRFSVWEHMLELLRIGRYVSMADIVDISSLTSYSSVKAIDFRTFAYL 180
           GSCPKFRVPDPGEIRFSVWEHML+LLR GRYVSM DIVDISSLTSYSSVKAIDFRTFAYL
Sbjct: 121 GSCPKFRVPDPGEIRFSVWEHMLQLLRNGRYVSMTDIVDISSLTSYSSVKAIDFRTFAYL 180

Query: 181 WCGVHLESVCSNEY-NLKHCGRLLAGLDGNVDKCLHAVDEDCRTTVWSYQNGEVDGALDL 240
           WCGVHLESVCSNEY NLK CGRLLAGLDGNVDKCLHAVDEDC+TTVW+YQ+ EVDGALDL
Sbjct: 181 WCGVHLESVCSNEYNNLKQCGRLLAGLDGNVDKCLHAVDEDCKTTVWTYQSNEVDGALDL 240

Query: 241 FQPNEQLKKKKKVSYVRRRRDVYRTLGPNSKAESATVLAFGSLFTAPYKGSELYIDIHEV 300
           FQPNEQLKKKKKVSYVRRRRDVYRTLGP+SKA SATVLAFGSLFTAPYKGSELYIDIH V
Sbjct: 241 FQPNEQLKKKKKVSYVRRRRDVYRTLGPDSKAGSATVLAFGSLFTAPYKGSELYIDIHGV 300

Query: 301 SGDQRISSLMKNIEYLPFVPEILSAGKEYVDKIIKAPFLCAQLRLLDGQFKHHWKATFQG 360
           S DQRISSLMKNIEYLPFVPEILSAGKEY+DKIIKAPFLCAQLRLLDGQFK+HWKATF  
Sbjct: 301 SKDQRISSLMKNIEYLPFVPEILSAGKEYIDKIIKAPFLCAQLRLLDGQFKNHWKATFLA 360

Query: 361 LKQKLDSILENANEPIHVFVMTDLPESNWTGSYLGDLASDSNHFKLFFLKEHDELVLRAS 420
           L+QKL+SILENANEPI VFVMTDLPESNWTGSYLGDL SDSNHFKLFFLKEHDELVLRAS
Sbjct: 361 LQQKLNSILENANEPIRVFVMTDLPESNWTGSYLGDLDSDSNHFKLFFLKEHDELVLRAS 420

Query: 421 KKVMAVGHGLRWTSSAFGPSRIRDMKKKCASERLPDVLLYVEETVCSCASLGFVGTAGST 480
           KKVMAVGHGLRWTS+AFGP RIR+MKK+CA ERLPDVLLY+EETVCSCASLGFVGTAGST
Sbjct: 421 KKVMAVGHGLRWTSNAFGPGRIRNMKKECAPERLPDVLLYIEETVCSCASLGFVGTAGST 480

Query: 481 IAESIELMRKY 487
           IAESIELMRKY
Sbjct: 481 IAESIELMRKY 491

BLAST of Cla97C01G004420 vs. TrEMBL
Match: tr|A0A0A0KPB8|A0A0A0KPB8_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G165220 PE=4 SV=1)

HSP 1 Score: 810.1 bits (2091), Expect = 2.8e-231
Identity = 412/491 (83.91%), Postives = 427/491 (86.97%), Query Frame = 0

Query: 1   MNISGSSRSSINRWNSK----KSNLQLRRFSLSVIVLLFCSLFLLYLSSSSFMSSTAFST 60
           MN  GS+RSSINRWNSK                                           
Sbjct: 1   MNAFGSTRSSINRWNSKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 60

Query: 61  SNSRQCNPQILDLGEKFLFYAPHSGFSNQLSEFKNAILMAGILNRTLVVPPILDHHAVAL 120
             SRQCN QIL LGEKFLFYAPHSGFSNQLSEFKNAILMAGILNRTLVVPPILDHHAVAL
Sbjct: 61  XXSRQCNTQILALGEKFLFYAPHSGFSNQLSEFKNAILMAGILNRTLVVPPILDHHAVAL 120

Query: 121 GSCPKFRVPDPGEIRFSVWEHMLELLRIGRYVSMADIVDISSLTSYSSVKAIDFRTFAYL 180
           GSCPKFRVPDPGEIRFSVWEHML+LLR GRYVSMADIVDISSLTSYSSVKAIDFRTFAYL
Sbjct: 121 GSCPKFRVPDPGEIRFSVWEHMLQLLRNGRYVSMADIVDISSLTSYSSVKAIDFRTFAYL 180

Query: 181 WCGVHLESVCSNEY-NLKHCGRLLAGLDGNVDKCLHAVDEDCRTTVWSYQNGEVDGALDL 240
           WCGV LESVC+NEY NLK CGRLLAGLDGNVDKCLHAVDEDC+TTVW+YQN EVDGALDL
Sbjct: 181 WCGVRLESVCANEYNNLKQCGRLLAGLDGNVDKCLHAVDEDCKTTVWTYQNNEVDGALDL 240

Query: 241 FQPNEQLKKKKKVSYVRRRRDVYRTLGPNSKAESATVLAFGSLFTAPYKGSELYIDIHEV 300
           FQPNEQLKKKKKVSYVRRRRDVYRTLG +SKA SATVLAFGSLFTAPY+GSELYIDIH V
Sbjct: 241 FQPNEQLKKKKKVSYVRRRRDVYRTLGRDSKAGSATVLAFGSLFTAPYRGSELYIDIHGV 300

Query: 301 SGDQRISSLMKNIEYLPFVPEILSAGKEYVDKIIKAPFLCAQLRLLDGQFKHHWKATFQG 360
           S DQRISSLMKNIEYLPFVPEILSAGKEY+DKIIKAPFLCAQLRLLDGQFK+HWKATF  
Sbjct: 301 SKDQRISSLMKNIEYLPFVPEILSAGKEYIDKIIKAPFLCAQLRLLDGQFKNHWKATFLA 360

Query: 361 LKQKLDSILENANEPIHVFVMTDLPESNWTGSYLGDLASDSNHFKLFFLKEHDELVLRAS 420
           L+QKLDSILENANEPIHVFVMTDLP+SNWTGSYLGDL SDSNHFKLFFL+E DELVLRAS
Sbjct: 361 LQQKLDSILENANEPIHVFVMTDLPKSNWTGSYLGDLDSDSNHFKLFFLEESDELVLRAS 420

Query: 421 KKVMAVGHGLRWTSSAFGPSRIRDMKKKCASERLPDVLLYVEETVCSCASLGFVGTAGST 480
           KKVMAVGHGLRWTS+AFGP  IRDMKKKCASE+LPDVLLY+EETVCSCASLGFVGTAGST
Sbjct: 421 KKVMAVGHGLRWTSNAFGPGSIRDMKKKCASEKLPDVLLYIEETVCSCASLGFVGTAGST 480

Query: 481 IAESIELMRKY 487
           IAESIELMRKY
Sbjct: 481 IAESIELMRKY 491

BLAST of Cla97C01G004420 vs. TrEMBL
Match: tr|A0A2N9F0Q0|A0A2N9F0Q0_FAGSY (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS8472 PE=4 SV=1)

HSP 1 Score: 651.7 bits (1680), Expect = 1.3e-183
Identity = 335/499 (67.13%), Postives = 395/499 (79.16%), Query Frame = 0

Query: 6   SSRSSINRWNSKKSNLQLRRFSLSVIVLLFCSL--FLLYLSSSSFMSSTAFSTSNSRQC- 65
           S+R +   W +KK++ +   FSLS++ LL  S+  FL Y S S          S   QC 
Sbjct: 7   SNRPTPKPWPNKKASHRSPLFSLSLLTLLIFSIIFFLSYYSLSPISPFKQTLNSQFPQCP 66

Query: 66  NPQILDLGEKFLFYAPHSGFSNQLSEFKNAILMAGILNRTLVVPPILDHHAVALGSCPKF 125
             Q L LGEKFL+YAPHSGFSNQLSEFKNAIL+AGILNRTL+VPPILDHHA+ALGSCPKF
Sbjct: 67  TSQSLTLGEKFLWYAPHSGFSNQLSEFKNAILLAGILNRTLIVPPILDHHAIALGSCPKF 126

Query: 126 RVPDPGEIRFSVWEHMLELLRIGRYVSMADIVDISSLTSYSSVKAIDFRTFAYLWCGVHL 185
           RV  P +IR +VW H+LELLR GRYVSMADI+DISSL S S V+AIDFR FA LWCGV  
Sbjct: 127 RVSAPNDIRVAVWNHVLELLRTGRYVSMADIIDISSLVSSSVVQAIDFRVFASLWCGVK- 186

Query: 186 ESVCSNEYN--------LKHCGRLLAGLDGNVDKCLHAVDEDCRTTVWSYQNGEVDGALD 245
           +  C NE N        LK CG LL+GL+GNVDKC++A+ EDCRTTVW+YQNG+ DG LD
Sbjct: 187 DFDCFNESNEQLALLDRLKQCGSLLSGLNGNVDKCIYAIGEDCRTTVWTYQNGDKDGVLD 246

Query: 246 LFQPNEQLKKKKKVSYVRRRRDVYRTLGPNSKAESATVLAFGSLFTAPYKGSELYIDIHE 305
            FQP+EQLK+KKKVS+VRRRRDVY+TLGP S AESAT+LAFGSLFT+PY+GSELYIDIHE
Sbjct: 247 SFQPDEQLKRKKKVSFVRRRRDVYQTLGPGSAAESATLLAFGSLFTSPYRGSELYIDIHE 306

Query: 306 VSGDQRISSLMKNIEYLPFVPEILSAGKEYVDKIIKAPFLCAQLRLLDGQFKHHWKATFQ 365
              DQRI SL+  IE+LPFVPEI+SAGKE+ DK I APFLCAQLRLLDGQFK+HWKATF 
Sbjct: 307 APRDQRIQSLIGKIEFLPFVPEIMSAGKEFADKNINAPFLCAQLRLLDGQFKNHWKATFL 366

Query: 366 GLKQKLDSILENANEPIHVFVMTDLPESNWTGSYLGDLASDSNHFKLFFLKEHDELVLRA 425
            LKQK++S L  ++ PIH+F+MTDLPE NWTGSYLGDLA DS+HFKL FL+  DELV++ 
Sbjct: 367 SLKQKVES-LGQSSLPIHIFMMTDLPEGNWTGSYLGDLARDSHHFKLHFLRGEDELVIKT 426

Query: 426 SKKVMAVGHGLRWTSSAFGPSRIRDMKKKCASERLPDVLLYVEETVCSCASLGFVGTAGS 485
           +KKV+A  HGLR+   AF P  +  +KK C+SERLPDVLL++EE VCSCASLGFVGTAGS
Sbjct: 427 AKKVVAASHGLRF---AFVPESMGGLKKHCSSERLPDVLLFIEEAVCSCASLGFVGTAGS 486

Query: 486 TIAESIELMRKYRLRSGQN 494
           TIAES+ELMRK+R  + Q+
Sbjct: 487 TIAESVELMRKFRTCASQS 500

BLAST of Cla97C01G004420 vs. TrEMBL
Match: tr|A0A061G4F7|A0A061G4F7_THECC (O-fucosyltransferase family protein, putative isoform 2 OS=Theobroma cacao OX=3641 GN=TCM_015927 PE=4 SV=1)

HSP 1 Score: 627.5 bits (1617), Expect = 2.6e-176
Identity = 310/461 (67.25%), Postives = 369/461 (80.04%), Query Frame = 0

Query: 40  LLYLSSSSFMSSTAFSTSNSR------QCNPQILDLGEKFLFYAPHSGFSNQLSEFKNAI 99
           L Y+S    + ST+  T N+        C  QI   GEKFL+YAPHSGFSNQLSEFKNAI
Sbjct: 46  LTYISIPKSLFSTSSKTVNAALSPQYPHCTTQI--PGEKFLWYAPHSGFSNQLSEFKNAI 105

Query: 100 LMAGILNRTLVVPPILDHHAVALGSCPKFRVPDPGEIRFSVWEHMLELLRIGRYVSMADI 159
           LMAGILNRTL+VPPILDHHAV LGSCPKFRV    EIR SVW+H+ EL+R  RYVSMADI
Sbjct: 106 LMAGILNRTLIVPPILDHHAVVLGSCPKFRVQSAKEIRLSVWDHINELIRSERYVSMADI 165

Query: 160 VDISSLTSYSSVKAIDFRTFAYLWCGVHLESVCSNEYN--------LKHCGRLLAGLDGN 219
           +DISSL S S V+AIDFR F  LWCG++++ VCSNE N        L+ CG LL+G+DGN
Sbjct: 166 IDISSLLSSSLVRAIDFRVFVSLWCGLNMDLVCSNELNAQQSMVGSLRQCGSLLSGIDGN 225

Query: 220 VDKCLHAVDEDCRTTVWSYQNGEVDGALDLFQPNEQLKKKKKVSYVRRRRDVYRTLGPNS 279
           +D+CL AVDEDCRTTVW+YQN EVDG LD FQP+EQLK KKK+SYVRRRR+VY+TLGP S
Sbjct: 226 IDRCLFAVDEDCRTTVWTYQNDEVDGVLDSFQPDEQLKNKKKISYVRRRRNVYKTLGPGS 285

Query: 280 KAESATVLAFGSLFTAPYKGSELYIDIHEVSGDQRISSLMKNIEYLPFVPEILSAGKEYV 339
           +AESATVLAFGSLFTAPYKGS+LYIDI +  GD +I SL+K IE+LPFVPEI+S+GK++ 
Sbjct: 286 EAESATVLAFGSLFTAPYKGSDLYIDIQKAPGDLKIKSLIKKIEFLPFVPEIISSGKQFA 345

Query: 340 DKIIKAPFLCAQLRLLDGQFKHHWKATFQGLKQKLDSILENANEPIHVFVMTDLPESNWT 399
            + IKAPFLCAQLRLLDGQFK+HWKATF GLKQKLDS+ +  + PIH+FVMTDLP+ NWT
Sbjct: 346 MQSIKAPFLCAQLRLLDGQFKNHWKATFLGLKQKLDSLRQAGSRPIHIFVMTDLPQGNWT 405

Query: 400 GSYLGDLASDSNHFKLFFLKEHDELVLRASKKVMAVGHGLRWTSSAFGPSRIRDMKKKCA 459
           GSYLGDLA DS +FKL+FL+E D  V++ +KK+   GHGLR+ S       +  ++K C+
Sbjct: 406 GSYLGDLARDSANFKLYFLRE-DLFVMKTAKKLALAGHGLRFESVPASLDAVAKLEKHCS 465

Query: 460 SERLPDVLLYVEETVCSCASLGFVGTAGSTIAESIELMRKY 487
            + +PDVLLY+EETVCSCASLGFVGTAGSTIAE+IE+MRKY
Sbjct: 466 PDIVPDVLLYIEETVCSCASLGFVGTAGSTIAETIEVMRKY 503

BLAST of Cla97C01G004420 vs. TrEMBL
Match: tr|A0A1R3GNI3|A0A1R3GNI3_COCAP (Uncharacterized protein OS=Corchorus capsularis OX=210143 GN=CCACVL1_24722 PE=4 SV=1)

HSP 1 Score: 624.4 bits (1609), Expect = 2.2e-175
Identity = 305/457 (66.74%), Postives = 369/457 (80.74%), Query Frame = 0

Query: 44  SSSSFMSSTAFSTSNSRQCNPQILDLGEKFLFYAPHSGFSNQLSEFKNAILMAGILNRTL 103
           S+SS   + A ST + R C  +I   GEKFL+++PHSGFSNQLSEFKNAI+MAGILNRTL
Sbjct: 57  STSSQTVNVAISTIHPR-CRTRI--PGEKFLWFSPHSGFSNQLSEFKNAIVMAGILNRTL 116

Query: 104 VVPPILDHHAVALGSCPKFRVPDPGEIRFSVWEHMLELLRIGRYVSMADIVDISSLTSYS 163
           +VPPILDHHAVALGSCPKFRV  P EIR SVW+H++EL+R GRYVSMADI+DISSL S S
Sbjct: 117 IVPPILDHHAVALGSCPKFRVQSPKEIRLSVWDHVIELIRSGRYVSMADIIDISSLLSSS 176

Query: 164 SVKAIDFRTFAYLWCGVHLESVCSNEY--------NLKHCGRLLAGLDGNVDKCLHAVDE 223
            V+AIDFR F  LWCG+ +   CSNE         +LK CG LL+GLDGN+D+CL AVDE
Sbjct: 177 LVRAIDFRVFVSLWCGLDMTLACSNELDANQSMVDSLKQCGSLLSGLDGNIDRCLFAVDE 236

Query: 224 DCRTTVWSYQNGEVDGALDLFQPNEQLKKKKKVSYVRRRRDVYRTLGPNSKAESATVLAF 283
           DCRTTVW YQN EVDGALD FQP+EQLKKKKK+S+VR R+DVY+TLGP S+A++ATVLAF
Sbjct: 237 DCRTTVWMYQNDEVDGALDSFQPDEQLKKKKKISFVRTRKDVYKTLGPGSEADTATVLAF 296

Query: 284 GSLFTAPYKGSELYIDIHEVSGDQRISSLMKNIEYLPFVPEILSAGKEYVDKIIKAPFLC 343
           GSLFTAPYKGSELYIDI +  GD RI SL++ IE+LPFVPEI+++GK++  + IKAPFLC
Sbjct: 297 GSLFTAPYKGSELYIDIQKAPGDPRIKSLLEKIEFLPFVPEIINSGKQFSVQTIKAPFLC 356

Query: 344 AQLRLLDGQFKHHWKATFQGLKQKLDSILENANEPIHVFVMTDLPESNWTGSYLGDLASD 403
           AQLRLLDGQFK+HWKATF  LKQKLDS+ + ++ PIH+FVMTDLP+ NWTG+YLGDLA D
Sbjct: 357 AQLRLLDGQFKNHWKATFSSLKQKLDSLRQASSLPIHIFVMTDLPQGNWTGTYLGDLAKD 416

Query: 404 SNHFKLFFLKEHDELVLRASKKVMAVGHGLRWTSSAFGPSRIRDMKKKCASERLPDVLLY 463
           S +FKL+FL+E D LV    KK+   GHGLR+ S       + +++K CA  +LPDVLL+
Sbjct: 417 STNFKLYFLREEDLLVKETEKKLALAGHGLRFGSLPGSKDAVANLEKHCAPNKLPDVLLF 476

Query: 464 VEETVCSCASLGFVGTAGSTIAESIELMRKYRLRSGQ 493
           +EE VCSCAS+GFVGTAGSTIAE+IE++RK+   S Q
Sbjct: 477 LEEIVCSCASIGFVGTAGSTIAETIEVIRKFGSCSSQ 510

BLAST of Cla97C01G004420 vs. Swiss-Prot
Match: sp|Q1JPM5|OFT30_ARATH (O-fucosyltransferase 30 OS=Arabidopsis thaliana OX=3702 GN=OFUT30 PE=2 SV=1)

HSP 1 Score: 555.1 bits (1429), Expect = 8.0e-157
Identity = 295/502 (58.76%), Postives = 363/502 (72.31%), Query Frame = 0

Query: 2   NISGSSRSSINRW-NSKKSNLQLRRFSLSVIVLLFCSLFLLYLSSS----SFMSSTAFST 61
           N    SR S   W N KK   +   F  SV +L+           S    S  S +AFS 
Sbjct: 3   NFFNPSRPSPRPWPNRKKQTDKSAIFLCSVSILVXXXXXXXXXXYSEMPKSLFSISAFSG 62

Query: 62  S-NSRQCNPQILD---LGEKFLFYAPHSGFSNQLSEFKNAILMAGILNRTLVVPPILDHH 121
           S    QC  +IL    LG+KFL+YAPHSGFSNQLSEFKNA+LMAGILNRTL++PPILDHH
Sbjct: 63  SVQFPQCRSEILTRTLLGQKFLWYAPHSGFSNQLSEFKNALLMAGILNRTLIIPPILDHH 122

Query: 122 AVALGSCPKFRVPDPGEIRFSVWEHMLELLRIGRYVSMADIVDISSLTSYSSVKAIDFRT 181
           AVALGSCPKFRV  P EIR SVW H +ELL+  RYVSMADIVDISSL S S+V+ IDFR 
Sbjct: 123 AVALGSCPKFRVLSPSEIRISVWNHSIELLKTDRYVSMADIVDISSLVSSSAVRVIDFRY 182

Query: 182 FAYLWCGVHLESVCSNEY--------NLKHCGRLLAGLDGNVDKCLHAVDEDCRTTVWSY 241
           FA L CGV LE++C+++         +LK CG LL+G+ GNVDKCL+AVDEDCRTTVW+Y
Sbjct: 183 FASLQCGVDLETLCTDDLAEQSQAYESLKQCGYLLSGVRGNVDKCLYAVDEDCRTTVWTY 242

Query: 242 QNGEVDGALDLFQPNEQLKKKKKVSYVRRRRDVYRTLGPNSKAESATVLAFGSLFTAPYK 301
           +NGE DG LD FQP+E+LKKKKK+S VRRRRDVY+TLG  ++AESA +LAFGSLFTAPYK
Sbjct: 243 KNGEADGRLDSFQPDEKLKKKKKLSNVRRRRDVYKTLGHGTEAESAAILAFGSLFTAPYK 302

Query: 302 GSELYIDIHEVSGDQRISSLMKNIEYLPFVPEILSAGKEYVDKIIKAPFLCAQLRLLDGQ 361
           GSELYIDIH+     +I SL++ +++LPFV EI+ AGK++  + IKAPFLCAQLRLLDGQ
Sbjct: 303 GSELYIDIHK---SPKIKSLVEKVDFLPFVREIMIAGKKFASETIKAPFLCAQLRLLDGQ 362

Query: 362 FKHHWKATFQGLKQKLDSILENANEPIHVFVMTDLPESNWTGSYLGDLASDSNHFKLFFL 421
           FK+H ++TF GL QKL+++       I+VFVMTDLPE NWTG+YLGDL+ +S +FKL F+
Sbjct: 363 FKNHRESTFTGLYQKLEALSVKNPGLINVFVMTDLPEFNWTGTYLGDLSKNSTNFKLHFI 422

Query: 422 KEHDELVLRASKKVMAVGHGLRWTSSAFGPSRIRDMKKKCASERLPDVLLYVEETVCSCA 481
            E DE + R   ++ +  HG ++ S       I+ M+  C      +V LY+EE VCSCA
Sbjct: 423 GEQDEFLARTEHELDSASHGQKFGSIPMSLDSIKKMQTHCYPHGGSNVQLYIEEAVCSCA 482

Query: 482 SLGFVGTAGSTIAESIELMRKY 487
           SLGFVGT GSTIA+S+E+MRKY
Sbjct: 483 SLGFVGTPGSTIADSVEMMRKY 501

BLAST of Cla97C01G004420 vs. TAIR10
Match: AT4G17430.1 (O-fucosyltransferase family protein)

HSP 1 Score: 555.1 bits (1429), Expect = 4.5e-158
Identity = 295/502 (58.76%), Postives = 363/502 (72.31%), Query Frame = 0

Query: 2   NISGSSRSSINRW-NSKKSNLQLRRFSLSVIVLLFCSLFLLYLSSS----SFMSSTAFST 61
           N    SR S   W N KK   +   F  SV +L+           S    S  S +AFS 
Sbjct: 3   NFFNPSRPSPRPWPNRKKQTDKSAIFLCSVSILVXXXXXXXXXXYSEMPKSLFSISAFSG 62

Query: 62  S-NSRQCNPQILD---LGEKFLFYAPHSGFSNQLSEFKNAILMAGILNRTLVVPPILDHH 121
           S    QC  +IL    LG+KFL+YAPHSGFSNQLSEFKNA+LMAGILNRTL++PPILDHH
Sbjct: 63  SVQFPQCRSEILTRTLLGQKFLWYAPHSGFSNQLSEFKNALLMAGILNRTLIIPPILDHH 122

Query: 122 AVALGSCPKFRVPDPGEIRFSVWEHMLELLRIGRYVSMADIVDISSLTSYSSVKAIDFRT 181
           AVALGSCPKFRV  P EIR SVW H +ELL+  RYVSMADIVDISSL S S+V+ IDFR 
Sbjct: 123 AVALGSCPKFRVLSPSEIRISVWNHSIELLKTDRYVSMADIVDISSLVSSSAVRVIDFRY 182

Query: 182 FAYLWCGVHLESVCSNEY--------NLKHCGRLLAGLDGNVDKCLHAVDEDCRTTVWSY 241
           FA L CGV LE++C+++         +LK CG LL+G+ GNVDKCL+AVDEDCRTTVW+Y
Sbjct: 183 FASLQCGVDLETLCTDDLAEQSQAYESLKQCGYLLSGVRGNVDKCLYAVDEDCRTTVWTY 242

Query: 242 QNGEVDGALDLFQPNEQLKKKKKVSYVRRRRDVYRTLGPNSKAESATVLAFGSLFTAPYK 301
           +NGE DG LD FQP+E+LKKKKK+S VRRRRDVY+TLG  ++AESA +LAFGSLFTAPYK
Sbjct: 243 KNGEADGRLDSFQPDEKLKKKKKLSNVRRRRDVYKTLGHGTEAESAAILAFGSLFTAPYK 302

Query: 302 GSELYIDIHEVSGDQRISSLMKNIEYLPFVPEILSAGKEYVDKIIKAPFLCAQLRLLDGQ 361
           GSELYIDIH+     +I SL++ +++LPFV EI+ AGK++  + IKAPFLCAQLRLLDGQ
Sbjct: 303 GSELYIDIHK---SPKIKSLVEKVDFLPFVREIMIAGKKFASETIKAPFLCAQLRLLDGQ 362

Query: 362 FKHHWKATFQGLKQKLDSILENANEPIHVFVMTDLPESNWTGSYLGDLASDSNHFKLFFL 421
           FK+H ++TF GL QKL+++       I+VFVMTDLPE NWTG+YLGDL+ +S +FKL F+
Sbjct: 363 FKNHRESTFTGLYQKLEALSVKNPGLINVFVMTDLPEFNWTGTYLGDLSKNSTNFKLHFI 422

Query: 422 KEHDELVLRASKKVMAVGHGLRWTSSAFGPSRIRDMKKKCASERLPDVLLYVEETVCSCA 481
            E DE + R   ++ +  HG ++ S       I+ M+  C      +V LY+EE VCSCA
Sbjct: 423 GEQDEFLARTEHELDSASHGQKFGSIPMSLDSIKKMQTHCYPHGGSNVQLYIEEAVCSCA 482

Query: 482 SLGFVGTAGSTIAESIELMRKY 487
           SLGFVGT GSTIA+S+E+MRKY
Sbjct: 483 SLGFVGTPGSTIADSVEMMRKY 501

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_023549514.11.9e-24787.70O-fucosyltransferase 30-like [Cucurbita pepo subsp. pepo][more]
XP_008437048.12.7e-24688.19PREDICTED: uncharacterized protein LOC103482591 [Cucumis melo][more]
XP_022972968.14.7e-24686.87O-fucosyltransferase 30 [Cucurbita maxima] >XP_022972969.1 O-fucosyltransferase ... [more]
XP_022922425.19.7e-24486.46O-fucosyltransferase 30-like [Cucurbita moschata] >XP_022922426.1 O-fucosyltrans... [more]
XP_022958126.13.0e-23783.37O-fucosyltransferase 30-like isoform X2 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
tr|A0A1S3AT28|A0A1S3AT28_CUCME1.8e-24688.19uncharacterized protein LOC103482591 OS=Cucumis melo OX=3656 GN=LOC103482591 PE=... [more]
tr|A0A0A0KPB8|A0A0A0KPB8_CUCSA2.8e-23183.91Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G165220 PE=4 SV=1[more]
tr|A0A2N9F0Q0|A0A2N9F0Q0_FAGSY1.3e-18367.13Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS8472 PE=4 SV=1[more]
tr|A0A061G4F7|A0A061G4F7_THECC2.6e-17667.25O-fucosyltransferase family protein, putative isoform 2 OS=Theobroma cacao OX=36... [more]
tr|A0A1R3GNI3|A0A1R3GNI3_COCAP2.2e-17566.74Uncharacterized protein OS=Corchorus capsularis OX=210143 GN=CCACVL1_24722 PE=4 ... [more]
Match NameE-valueIdentityDescription
sp|Q1JPM5|OFT30_ARATH8.0e-15758.76O-fucosyltransferase 30 OS=Arabidopsis thaliana OX=3702 GN=OFUT30 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
AT4G17430.14.5e-15858.76O-fucosyltransferase family protein[more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR019378GDP-Fuc_O-FucTrfase
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0005975 carbohydrate metabolic process
biological_process GO:0006004 fucose metabolic process
biological_process GO:0008152 metabolic process
biological_process GO:0008150 biological_process
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0016020 membrane
cellular_component GO:0005768 endosome
cellular_component GO:0005802 trans-Golgi network
cellular_component GO:0005794 Golgi apparatus
cellular_component GO:0044424 intracellular part
cellular_component GO:0005575 cellular_component
molecular_function GO:0016740 transferase activity
molecular_function GO:0016757 transferase activity, transferring glycosyl groups
molecular_function GO:0016874 ligase activity
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C01G004420.1Cla97C01G004420.1mRNA


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR019378GDP-fucose protein O-fucosyltransferasePFAMPF10250O-FucTcoord: 73..345
e-value: 3.1E-10
score: 40.2
NoneNo IPR availablePANTHERPTHR36050FAMILY NOT NAMEDcoord: 12..489

The following gene(s) are paralogous to this gene:

None