ClCG01G004250 (gene) Watermelon (Charleston Gray)

NameClCG01G004250
Typegene
OrganismCitrullus lanatus (Watermelon (Charleston Gray))
DescriptionO-fucosyltransferase family protein LENGTH=507
LocationCG_Chr01 : 4559172 .. 4561330 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ACAAAGAGTAACGCCATGAATATTTCCGGTTCGAGCAGATCTTCGATCAATCGATGGAACTCAAAGAAATCCAATTTACAGCTCCGGCGTTTTTCTTTGTCTGTCATCGTTCTACTCTTCTGTTCCCTCTTTCTCCTCTACCTGTCCTCCTCCTCCTTCATGTCCTCAACTGCATTCTCCACTTCAAATTCACGTCAATGCAATCCCCAGATCTTAGATTTGGGTGAGAAATTTCTGTTTTACGCGCCTCACAGTGGGTTTAGCAACCAGCTTTCCGAGTTCAAGAATGCTATTCTAATGGCCGGGATTCTCAACCGGACTCTTGTTGTTCCGCCGATTCTGGATCACCATGCGGTAGCTCTCGGGAGTTGTCCGAAATTCCGAGTTCCGGATCCTGGTGAGATTCGATTTTCGGTTTGGGAGCACATGCTTGAGCTTCTTCGGATTGGAAGGTAATCATTTTCCCCTTGGTTTGTAATTTTTTTTTTTTTTTTAATTGTATTTTCAATGTCTTTATTTTTCTTGAACTGATGAGTGTTCGGGTTGTCGGGTTGCTGGATTGTTATGTAAGTTGCTTCTAAATCAAATAATTGAACCGTTTTGGTAAACTCGTAGCAATTCTAGCTTACTGTTTACTTTTCCATTCTTATCTTTTGTGATTGATTTAACAAGGAGATGAAAAGAACAAGGCTGAATTAGAAAAAAAAAAAGAAAAAAACCCTAAACCTTGTCATTTTTTTAAAAAATGCCCCCTTTTTTAAAATATTGCAATATTACCCTCCATGTTTCATTAACGTTGAAACTACCTTGATGTAAATGCATTAGAATGCTGGATGGAAAATGGGACTTGGAACAGTGTGGTGGTAAACTAAAACAGAAATTGGGACCGTCACACTGTTCTAAATCACCATCATAACATATCAAGTCTCAAATTTTATCCAATACTCTAACGAGTTTCTACTCCGTGGGTAGTTTTGAAACGGTAATTTTGTAACTTTTGAATGAACGAAGGACATTCGTTTAAGTTTAGGACTATTATTTTGTAATTTAGCCAAGAAAATAATTCTGTGGTAACGTTTTATGACATTAGCTATGCATTTCAATTTTATGGCAGGTACGTTTCCATGGCAGATATTGTAGATATTTCATCATTAACTTCTTACTCTTCTGTTAAAGCCATAGATTTTAGGACCTTTGCATACTTATGGTGTGGAGTGCATCTGGAAAGTGTTTGTTCAAATGAATATAACCTAAAGCATTGTGGTCGTCTACTAGCAGGGCTTGATGGGAATGTAGACAAATGTTTACATGCTGTAGATGAAGATTGCAGAACTACAGTTTGGAGTTACCAAAATGGTGAAGTTGATGGAGCATTAGACTTGTTTCAGCCTAACGAACAGCTTAAGAAGAAAAAGAAAGTGTCCTATGTCAGACGCCGTCGAGATGTATATAGAACCCTCGGACCCAATTCGAAAGCTGAATCAGCTACTGTTTTGGCATTTGGAAGTCTATTTACTGCTCCATACAAAGGTTCAGAGCTGTATATTGATATCCATGAAGTTAGTGGAGATCAAAGAATCAGTTCTTTGATGAAAAACATTGAGTATCTACCATTTGTCCCAGAAATCTTGAGTGCAGGAAAAGAGTATGTTGACAAGATCATAAAAGCTCCATTCCTTTGTGCTCAATTGAGATTGTTAGATGGGCAGTTCAAACATCACTGGAAGGCGACTTTTCAGGGCCTGAAACAGAAATTAGACTCTATATTAGAGAATGCTAATGAACCTATTCATGTTTTTGTGATGACTGATCTTCCTGAATCTAATTGGACTGGAAGCTACTTAGGGGATTTGGCTAGTGATTCAAATCACTTCAAACTCTTTTTTCTCAAAGAACACGATGAATTGGTTCTAAGAGCGTCTAAAAAGGTGATGGCTGTAGGACATGGCTTGAGATGGACATCTAGTGCATTTGGTCCTAGCAGAATTCGTGATATGAAGAAGAAATGTGCTTCAGAAAGATTACCGGATGTTCTCTTATATGTAGAGGAAACTGTTTGCAGTTGTGCTTCACTTGGTTTTGTTGGTACTGCTGGTTCCACAATTGCTGAAAGCATTGAGCTGATGAGAAAATATAGACTACGTTCAGGTCAAAATTGA

mRNA sequence

ACAAAGAGTAACGCCATGAATATTTCCGGTTCGAGCAGATCTTCGATCAATCGATGGAACTCAAAGAAATCCAATTTACAGCTCCGGCGTTTTTCTTTGTCTGTCATCGTTCTACTCTTCTGTTCCCTCTTTCTCCTCTACCTGTCCTCCTCCTCCTTCATGTCCTCAACTGCATTCTCCACTTCAAATTCACGTCAATGCAATCCCCAGATCTTAGATTTGGGTGAGAAATTTCTGTTTTACGCGCCTCACAGTGGGTTTAGCAACCAGCTTTCCGAGTTCAAGAATGCTATTCTAATGGCCGGGATTCTCAACCGGACTCTTGTTGTTCCGCCGATTCTGGATCACCATGCGGTAGCTCTCGGGAGTTGTCCGAAATTCCGAGTTCCGGATCCTGGTGAGATTCGATTTTCGGTTTGGGAGCACATGCTTGAGCTTCTTCGGATTGGAAGGACCTTTGCATACTTATGGTGTGGAGTGCATCTGGAAAGTGTTTGTTCAAATGAATATAACCTAAAGCATTGTGGTCGTCTACTAGCAGGGCTTGATGGGAATGTAGACAAATGTTTACATGCTGTAGATGAAGATTGCAGAACTACAGTTTGGAGTTACCAAAATGGTGAAGTTGATGGAGCATTAGACTTGTTTCAGCCTAACGAACAGCTTAAGAAGAAAAAGAAAGTGTCCTATGTCAGACGCCGTCGAGATGTATATAGAACCCTCGGACCCAATTCGAAAGCTGAATCAGCTACTGTTTTGGCATTTGGAAGTCTATTTACTGCTCCATACAAAGGTTCAGAGCTGTATATTGATATCCATGAAGTTAGTGGAGATCAAAGAATCAGTTCTTTGATGAAAAACATTGAGTATCTACCATTTGTCCCAGAAATCTTGAGTGCAGGAAAAGAGTATGTTGACAAGATCATAAAAGCTCCATTCCTTTGTGCTCAATTGAGATTGTTAGATGGGCAGTTCAAACATCACTGGAAGGCGACTTTTCAGGGCCTGAAACAGAAATTAGACTCTATATTAGAGAATGCTAATGAACCTATTCATGTTTTTGTGATGACTGATCTTCCTGAATCTAATTGGACTGGAAGCTACTTAGGGGATTTGGCTAGTGATTCAAATCACTTCAAACTCTTTTTTCTCAAAGAACACGATGAATTGGTTCTAAGAGCGTCTAAAAAGGTGATGGCTGTAGGACATGGCTTGAGATGGACATCTAGTGCATTTGGTCCTAGCAGAATTCGTGATATGAAGAAGAAATGTGCTTCAGAAAGATTACCGGATGTTCTCTTATATGTAGAGGAAACTGTTTGCAGTTGTGCTTCACTTGGTTTTGTTGGTACTGCTGGTTCCACAATTGCTGAAAGCATTGAGCTGATGAGAAAATATAGACTACGTTCAGGTCAAAATTGA

Coding sequence (CDS)

ATGAATATTTCCGGTTCGAGCAGATCTTCGATCAATCGATGGAACTCAAAGAAATCCAATTTACAGCTCCGGCGTTTTTCTTTGTCTGTCATCGTTCTACTCTTCTGTTCCCTCTTTCTCCTCTACCTGTCCTCCTCCTCCTTCATGTCCTCAACTGCATTCTCCACTTCAAATTCACGTCAATGCAATCCCCAGATCTTAGATTTGGGTGAGAAATTTCTGTTTTACGCGCCTCACAGTGGGTTTAGCAACCAGCTTTCCGAGTTCAAGAATGCTATTCTAATGGCCGGGATTCTCAACCGGACTCTTGTTGTTCCGCCGATTCTGGATCACCATGCGGTAGCTCTCGGGAGTTGTCCGAAATTCCGAGTTCCGGATCCTGGTGAGATTCGATTTTCGGTTTGGGAGCACATGCTTGAGCTTCTTCGGATTGGAAGGACCTTTGCATACTTATGGTGTGGAGTGCATCTGGAAAGTGTTTGTTCAAATGAATATAACCTAAAGCATTGTGGTCGTCTACTAGCAGGGCTTGATGGGAATGTAGACAAATGTTTACATGCTGTAGATGAAGATTGCAGAACTACAGTTTGGAGTTACCAAAATGGTGAAGTTGATGGAGCATTAGACTTGTTTCAGCCTAACGAACAGCTTAAGAAGAAAAAGAAAGTGTCCTATGTCAGACGCCGTCGAGATGTATATAGAACCCTCGGACCCAATTCGAAAGCTGAATCAGCTACTGTTTTGGCATTTGGAAGTCTATTTACTGCTCCATACAAAGGTTCAGAGCTGTATATTGATATCCATGAAGTTAGTGGAGATCAAAGAATCAGTTCTTTGATGAAAAACATTGAGTATCTACCATTTGTCCCAGAAATCTTGAGTGCAGGAAAAGAGTATGTTGACAAGATCATAAAAGCTCCATTCCTTTGTGCTCAATTGAGATTGTTAGATGGGCAGTTCAAACATCACTGGAAGGCGACTTTTCAGGGCCTGAAACAGAAATTAGACTCTATATTAGAGAATGCTAATGAACCTATTCATGTTTTTGTGATGACTGATCTTCCTGAATCTAATTGGACTGGAAGCTACTTAGGGGATTTGGCTAGTGATTCAAATCACTTCAAACTCTTTTTTCTCAAAGAACACGATGAATTGGTTCTAAGAGCGTCTAAAAAGGTGATGGCTGTAGGACATGGCTTGAGATGGACATCTAGTGCATTTGGTCCTAGCAGAATTCGTGATATGAAGAAGAAATGTGCTTCAGAAAGATTACCGGATGTTCTCTTATATGTAGAGGAAACTGTTTGCAGTTGTGCTTCACTTGGTTTTGTTGGTACTGCTGGTTCCACAATTGCTGAAAGCATTGAGCTGATGAGAAAATATAGACTACGTTCAGGTCAAAATTGA

Protein sequence

MNISGSSRSSINRWNSKKSNLQLRRFSLSVIVLLFCSLFLLYLSSSSFMSSTAFSTSNSRQCNPQILDLGEKFLFYAPHSGFSNQLSEFKNAILMAGILNRTLVVPPILDHHAVALGSCPKFRVPDPGEIRFSVWEHMLELLRIGRTFAYLWCGVHLESVCSNEYNLKHCGRLLAGLDGNVDKCLHAVDEDCRTTVWSYQNGEVDGALDLFQPNEQLKKKKKVSYVRRRRDVYRTLGPNSKAESATVLAFGSLFTAPYKGSELYIDIHEVSGDQRISSLMKNIEYLPFVPEILSAGKEYVDKIIKAPFLCAQLRLLDGQFKHHWKATFQGLKQKLDSILENANEPIHVFVMTDLPESNWTGSYLGDLASDSNHFKLFFLKEHDELVLRASKKVMAVGHGLRWTSSAFGPSRIRDMKKKCASERLPDVLLYVEETVCSCASLGFVGTAGSTIAESIELMRKYRLRSGQN
BLAST of ClCG01G004250 vs. TrEMBL
Match: A0A0A0KPB8_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G165220 PE=4 SV=1)

HSP 1 Score: 827.8 bits (2137), Expect = 6.7e-237
Identity = 422/491 (85.95%), Postives = 437/491 (89.00%), Query Frame = 1

Query: 1   MNISGSSRSSINRWNSKKSNLQLRRFSLSVIVLLFCSLFLLYLSSS----SFMSSTAFST 60
           MN  GS+RSSINRWNSKK NLQL R SLSV  LLFC LFLLYLSSS    SFMSSTAFST
Sbjct: 1   MNAFGSTRSSINRWNSKKPNLQLPRISLSVCALLFCFLFLLYLSSSFSSSSFMSSTAFST 60

Query: 61  SNSRQCNPQILDLGEKFLFYAPHSGFSNQLSEFKNAILMAGILNRTLVVPPILDHHAVAL 120
           SNSRQCN QIL LGEKFLFYAPHSGFSNQLSEFKNAILMAGILNRTLVVPPILDHHAVAL
Sbjct: 61  SNSRQCNTQILALGEKFLFYAPHSGFSNQLSEFKNAILMAGILNRTLVVPPILDHHAVAL 120

Query: 121 GSCPKFRVPDPGEIRFSVWEHMLELLRIGR-------------------------TFAYL 180
           GSCPKFRVPDPGEIRFSVWEHML+LLR GR                         TFAYL
Sbjct: 121 GSCPKFRVPDPGEIRFSVWEHMLQLLRNGRYVSMADIVDISSLTSYSSVKAIDFRTFAYL 180

Query: 181 WCGVHLESVCSNEYN-LKHCGRLLAGLDGNVDKCLHAVDEDCRTTVWSYQNGEVDGALDL 240
           WCGV LESVC+NEYN LK CGRLLAGLDGNVDKCLHAVDEDC+TTVW+YQN EVDGALDL
Sbjct: 181 WCGVRLESVCANEYNNLKQCGRLLAGLDGNVDKCLHAVDEDCKTTVWTYQNNEVDGALDL 240

Query: 241 FQPNEQLKKKKKVSYVRRRRDVYRTLGPNSKAESATVLAFGSLFTAPYKGSELYIDIHEV 300
           FQPNEQLKKKKKVSYVRRRRDVYRTLG +SKA SATVLAFGSLFTAPY+GSELYIDIH V
Sbjct: 241 FQPNEQLKKKKKVSYVRRRRDVYRTLGRDSKAGSATVLAFGSLFTAPYRGSELYIDIHGV 300

Query: 301 SGDQRISSLMKNIEYLPFVPEILSAGKEYVDKIIKAPFLCAQLRLLDGQFKHHWKATFQG 360
           S DQRISSLMKNIEYLPFVPEILSAGKEY+DKIIKAPFLCAQLRLLDGQFK+HWKATF  
Sbjct: 301 SKDQRISSLMKNIEYLPFVPEILSAGKEYIDKIIKAPFLCAQLRLLDGQFKNHWKATFLA 360

Query: 361 LKQKLDSILENANEPIHVFVMTDLPESNWTGSYLGDLASDSNHFKLFFLKEHDELVLRAS 420
           L+QKLDSILENANEPIHVFVMTDLP+SNWTGSYLGDL SDSNHFKLFFL+E DELVLRAS
Sbjct: 361 LQQKLDSILENANEPIHVFVMTDLPKSNWTGSYLGDLDSDSNHFKLFFLEESDELVLRAS 420

Query: 421 KKVMAVGHGLRWTSSAFGPSRIRDMKKKCASERLPDVLLYVEETVCSCASLGFVGTAGST 462
           KKVMAVGHGLRWTS+AFGP  IRDMKKKCASE+LPDVLLY+EETVCSCASLGFVGTAGST
Sbjct: 421 KKVMAVGHGLRWTSNAFGPGSIRDMKKKCASEKLPDVLLYIEETVCSCASLGFVGTAGST 480

BLAST of ClCG01G004250 vs. TrEMBL
Match: A0A061G4F7_THECC (O-fucosyltransferase family protein, putative isoform 2 OS=Theobroma cacao GN=TCM_015927 PE=4 SV=1)

HSP 1 Score: 581.3 bits (1497), Expect = 1.1e-162
Identity = 304/493 (61.66%), Postives = 363/493 (73.63%), Query Frame = 1

Query: 14  WNSKKS--NLQLRR---FSLSVIVL-LFCSLFLLYLSSSSFMSSTAFSTSNSR------Q 73
           WN KKS    Q RR   F LS+ +L L    FL Y+S    + ST+  T N+        
Sbjct: 14  WNKKKSLQQQQPRRSPFFFLSLSLLSLLVFFFLTYISIPKSLFSTSSKTVNAALSPQYPH 73

Query: 74  CNPQILDLGEKFLFYAPHSGFSNQLSEFKNAILMAGILNRTLVVPPILDHHAVALGSCPK 133
           C  QI   GEKFL+YAPHSGFSNQLSEFKNAILMAGILNRTL+VPPILDHHAV LGSCPK
Sbjct: 74  CTTQIP--GEKFLWYAPHSGFSNQLSEFKNAILMAGILNRTLIVPPILDHHAVVLGSCPK 133

Query: 134 FRVPDPGEIRFSVWEHMLELLRIGR-------------------------TFAYLWCGVH 193
           FRV    EIR SVW+H+ EL+R  R                          F  LWCG++
Sbjct: 134 FRVQSAKEIRLSVWDHINELIRSERYVSMADIIDISSLLSSSLVRAIDFRVFVSLWCGLN 193

Query: 194 LESVCSNEYN--------LKHCGRLLAGLDGNVDKCLHAVDEDCRTTVWSYQNGEVDGAL 253
           ++ VCSNE N        L+ CG LL+G+DGN+D+CL AVDEDCRTTVW+YQN EVDG L
Sbjct: 194 MDLVCSNELNAQQSMVGSLRQCGSLLSGIDGNIDRCLFAVDEDCRTTVWTYQNDEVDGVL 253

Query: 254 DLFQPNEQLKKKKKVSYVRRRRDVYRTLGPNSKAESATVLAFGSLFTAPYKGSELYIDIH 313
           D FQP+EQLK KKK+SYVRRRR+VY+TLGP S+AESATVLAFGSLFTAPYKGS+LYIDI 
Sbjct: 254 DSFQPDEQLKNKKKISYVRRRRNVYKTLGPGSEAESATVLAFGSLFTAPYKGSDLYIDIQ 313

Query: 314 EVSGDQRISSLMKNIEYLPFVPEILSAGKEYVDKIIKAPFLCAQLRLLDGQFKHHWKATF 373
           +  GD +I SL+K IE+LPFVPEI+S+GK++  + IKAPFLCAQLRLLDGQFK+HWKATF
Sbjct: 314 KAPGDLKIKSLIKKIEFLPFVPEIISSGKQFAMQSIKAPFLCAQLRLLDGQFKNHWKATF 373

Query: 374 QGLKQKLDSILENANEPIHVFVMTDLPESNWTGSYLGDLASDSNHFKLFFLKEHDELVLR 433
            GLKQKLDS+ +  + PIH+FVMTDLP+ NWTGSYLGDLA DS +FKL+FL+E D  V++
Sbjct: 374 LGLKQKLDSLRQAGSRPIHIFVMTDLPQGNWTGSYLGDLARDSANFKLYFLRE-DLFVMK 433

Query: 434 ASKKVMAVGHGLRWTSSAFGPSRIRDMKKKCASERLPDVLLYVEETVCSCASLGFVGTAG 462
            +KK+   GHGLR+ S       +  ++K C+ + +PDVLLY+EETVCSCASLGFVGTAG
Sbjct: 434 TAKKLALAGHGLRFESVPASLDAVAKLEKHCSPDIVPDVLLYIEETVCSCASLGFVGTAG 493

BLAST of ClCG01G004250 vs. TrEMBL
Match: B9GYL4_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0003s07710g PE=4 SV=2)

HSP 1 Score: 577.4 bits (1487), Expect = 1.6e-161
Identity = 295/485 (60.82%), Postives = 358/485 (73.81%), Query Frame = 1

Query: 14  WNSKKSNLQLRR-FSLSVIVLLFCSLFLLY----LSSSSFMSSTAFSTSNSRQCNP-QIL 73
           W  KK    L+   SL +I+ LF  +FL      ++ +S  S T  +     QC   Q L
Sbjct: 18  WIKKKQTQPLKSPLSLLLILSLFLFIFLFISFFKITPNSLFSKTITNNPLISQCTKFQTL 77

Query: 74  DLGEKFLFYAPHSGFSNQLSEFKNAILMAGILNRTLVVPPILDHHAVALGSCPKFRVPDP 133
            LGEKFL+YAPHSGFSNQLSEFKN ILMAGILNRTL+VPP+LDHHAVALGSCPKFRV  P
Sbjct: 78  ALGEKFLWYAPHSGFSNQLSEFKNGILMAGILNRTLIVPPVLDHHAVALGSCPKFRVLGP 137

Query: 134 GEIRFSVWEHMLELLRIGR------------------------TFAYLWCGVHLESVCSN 193
            EIR SVW+H+L+L++ GR                         FA  WC V ++  CSN
Sbjct: 138 KEIRVSVWDHVLDLVKTGRYVSMADIIDISSLVPSSIQAIDFRVFASQWCNVKMDFTCSN 197

Query: 194 EYN--------LKHCGRLLAGLDGNVDKCLHAVDEDCRTTVWSYQNGEVDGALDLFQPNE 253
           + N        L  CG +L+G+DGNVDKCL+AVDEDCRTTVW+Y+NG+ D   D FQP+E
Sbjct: 198 DLNAQSSLFDSLNLCGSILSGIDGNVDKCLYAVDEDCRTTVWTYKNGDEDRVFDSFQPDE 257

Query: 254 QLKKKKKVSYVRRRRDVYRTLGPNSKAESATVLAFGSLFTAPYKGSELYIDIHEVSGDQR 313
           QLKKKKK+SYVRRR+DVY++LGP S+A SATVLAFGSLFTAPYKGSEL+IDIHE   DQR
Sbjct: 258 QLKKKKKISYVRRRQDVYKSLGPGSEAGSATVLAFGSLFTAPYKGSELHIDIHEARRDQR 317

Query: 314 ISSLMKNIEYLPFVPEILSAGKEYVDKIIKAPFLCAQLRLLDGQFKHHWKATFQGLKQKL 373
           I SL+ N E+LPFVPEIL+AGK++  + IKAPFLCAQLRLLDGQFK+HWKATFQGLKQKL
Sbjct: 318 IQSLIDNSEFLPFVPEILNAGKKFALETIKAPFLCAQLRLLDGQFKNHWKATFQGLKQKL 377

Query: 374 DSILENANEPIHVFVMTDLPESNWTGSYLGDLASDSNHFKLFFLKEHDELVLRASKKVMA 433
           + + ++ ++PIH+FVMTDLP+ NWTGS+LGD+AS+ NHFKL+FL+E DELV + +K +  
Sbjct: 378 EVLKQSGSKPIHIFVMTDLPQGNWTGSFLGDMASEVNHFKLYFLREEDELVKKTAKNLAV 437

Query: 434 VGHGLRWTSSAFGPSRIRDMKKKCASERLPDVLLYVEETVCSCASLGFVGTAGSTIAESI 461
            GHGLR+ S     +    MK  C  +RL D+LLY+E++VCSCASLGFVGTAGSTIAESI
Sbjct: 438 AGHGLRFGSVPRSHNGESKMKMNCPHQRLIDILLYIEKSVCSCASLGFVGTAGSTIAESI 497

BLAST of ClCG01G004250 vs. TrEMBL
Match: A0A0D2QV13_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_003G175200 PE=4 SV=1)

HSP 1 Score: 575.5 bits (1482), Expect = 6.0e-161
Identity = 298/498 (59.84%), Postives = 361/498 (72.49%), Query Frame = 1

Query: 14  WNSKKSNLQLRR---FSLSVIVLLFCSLFLLYLSS---SSFMSSTAFSTSNSRQC-NPQI 73
           W  KKS  Q RR   F LS+ +L    LF L  +S   S F SS++ +T+ S Q  + +I
Sbjct: 14  WGKKKS--QQRRSPFFFLSLFLLSLIFLFFLTFTSIPKSLFSSSSSKTTALSLQFPHCEI 73

Query: 74  LDLGEKFLFYAPHSGFSNQLSEFKNAILMAGILNRTLVVPPILDHHAVALGSCPKFRVPD 133
              GEKFL+YAPHSGFSNQLSEFKNA+LMAGILNRTL++PPIL HHA+ALGSCPKFRV  
Sbjct: 74  RISGEKFLWYAPHSGFSNQLSEFKNALLMAGILNRTLIIPPILSHHAIALGSCPKFRVQS 133

Query: 134 PGEIRFSVWEHMLELLRIGR-------------------------TFAYLWCGVHLESVC 193
           P EIR SVW+H++EL+  GR                          F   WCG+ L+  C
Sbjct: 134 PKEIRVSVWDHVIELITSGRYVSMADIIDISSVLSSSHVRAIDFRVFVSSWCGLDLDLAC 193

Query: 194 SNEYN---------LKHCGRLLAGLDGNVDKCLHAVDEDCRTTVWSYQNGEVDGALDLFQ 253
           S E N         LK CG LL+G+DGN+D+CL AVD+DCRTTVW+Y N E DGALD FQ
Sbjct: 194 SKEPNTQPTYLVDSLKQCGSLLSGVDGNIDRCLFAVDDDCRTTVWTYGNYEADGALDSFQ 253

Query: 254 PNEQLKKKKKVSYVRRRRDVYRTLGPNSKAESATVLAFGSLFTAPYKGSELYIDIHEVSG 313
           PNEQLKKKKK+SYVRRRRDVY+TLGP SKA+SATVLAFG+LFTAPYKGSELYIDI +   
Sbjct: 254 PNEQLKKKKKISYVRRRRDVYKTLGPGSKADSATVLAFGTLFTAPYKGSELYIDIQKAPR 313

Query: 314 DQRISSLMKNIEYLPFVPEILSAGKEYVDKIIKAPFLCAQLRLLDGQFKHHWKATFQGLK 373
           D  I SL+K I++LPFVPEI+SAGK++  +I+KAPFLCAQLRLLDGQFK+HW+ATF GLK
Sbjct: 314 DSNIQSLIKKIKFLPFVPEIISAGKQFAVQIVKAPFLCAQLRLLDGQFKNHWEATFSGLK 373

Query: 374 QKLDSILENANEPIHVFVMTDLPESNWTGSYLGDLASDSNHFKLFFLKEHDELVLRASKK 433
           QKLDS+ +  + PIHVFVMTDLP  NWTG+YLGDLA DS +FKL+F+ E D LV+  +KK
Sbjct: 374 QKLDSLSQTVSRPIHVFVMTDLPRGNWTGNYLGDLAKDSTNFKLYFMNEEDSLVMETAKK 433

Query: 434 VMAVGHGLRWTSSAFG---PSRIRDMKKKCASERLPDVLLYVEETVCSCASLGFVGTAGS 468
           +   GHGLR+ SS  G      +  ++K CA   LPD+LL++EET+CSC SLGF GTAGS
Sbjct: 434 LALAGHGLRFGSSLGGIESTDTVAKLQKHCAPHILPDILLFIEETICSCGSLGFFGTAGS 493

BLAST of ClCG01G004250 vs. TrEMBL
Match: B9S5G7_RICCO (Putative uncharacterized protein OS=Ricinus communis GN=RCOM_0975840 PE=4 SV=1)

HSP 1 Score: 564.7 bits (1454), Expect = 1.1e-157
Identity = 290/500 (58.00%), Postives = 357/500 (71.40%), Query Frame = 1

Query: 2   NISGSSRSSINRWNSKKSNLQLRRFSLSVIVLLFCSLFLLYLSSSSFMSSTAFST----- 61
           NI   S+++   W  KK++L  R    S + LL  S+F L++    F S T  S      
Sbjct: 3   NILLLSKNTTKSWTKKKTSLPYR----SPLFLLLISVFTLFIFLVFFTSYTKTSKPILQN 62

Query: 62  ---SNSRQCNP-QILDLGEKFLFYAPHSGFSNQLSEFKNAILMAGILNRTLVVPPILDHH 121
              S   QC+  Q L  GEKFL+YAPHSGFSNQLSEFKNAILMAGILNRTL+VPPILDHH
Sbjct: 63  TLDSQISQCSRFQSLTGGEKFLWYAPHSGFSNQLSEFKNAILMAGILNRTLIVPPILDHH 122

Query: 122 AVALGSCPKFRVPDPGEIRFSVWEHMLELLRIGR------------------------TF 181
           AVALGSCPK RV  P +IR SVW H +EL++ GR                         F
Sbjct: 123 AVALGSCPKLRVLGPKDIRISVWNHAIELVKTGRYVSMVDIIDISSLVPSSIRAIDFRVF 182

Query: 182 AYLWCGVHLESVCSNEYN--------LKHCGRLLAGLDGNVDKCLHAVDEDCRTTVWSYQ 241
           A LWCGV+ + +C+N  N        L  CG +L+G  GN+ KCL+AV EDCRTTVW+Y+
Sbjct: 183 ASLWCGVNKDFICTNNLNAESSLFDSLGQCGSVLSGFTGNIGKCLYAVVEDCRTTVWTYK 242

Query: 242 NGEVDGALDLFQPNEQLKKKKKVSYVRRRRDVYRTLGPNSKAESATVLAFGSLFTAPYKG 301
           NGE DG LD FQP+EQLKKKK +SY+RR +DVY+ LG  S++ESA+VLAFGSLFTAPYKG
Sbjct: 243 NGEKDGVLDSFQPDEQLKKKKNISYIRRHQDVYKVLGTGSESESASVLAFGSLFTAPYKG 302

Query: 302 SELYIDIHEVSGDQRISSLMKNIEYLPFVPEILSAGKEYVDKIIKAPFLCAQLRLLDGQF 361
           SELYIDIHE   DQRI SL+K  ++LPFVPE+L+AG+++  + IKAPFLCAQLRLLDGQF
Sbjct: 303 SELYIDIHEAQRDQRIQSLIKKSQFLPFVPELLNAGRKFALETIKAPFLCAQLRLLDGQF 362

Query: 362 KHHWKATFQGLKQKLDSILENANEPIHVFVMTDLPESNWTGSYLGDLASDSNHFKLFFLK 421
           K+HWK TF GLKQKL+++ ++  +PIH+FVMTDLP+ NWTGSYLGDLA D+ HFKL FL+
Sbjct: 363 KNHWKTTFLGLKQKLETLKQSGPQPIHIFVMTDLPQGNWTGSYLGDLADDTKHFKLHFLR 422

Query: 422 EHDELVLRASKKVMAVGHGLRWTSSAFGPSRIRDMKKKCASERLPDVLLYVEETVCSCAS 461
           E D+LV++ +KK+    HGLR  S     + +  MK  C+ ++LPD+LLYVEE+VC+CAS
Sbjct: 423 EDDDLVIQTAKKLATAEHGLRLGSLPISLNGVSKMKMHCSHQKLPDILLYVEESVCACAS 482

BLAST of ClCG01G004250 vs. TAIR10
Match: AT4G17430.1 (AT4G17430.1 O-fucosyltransferase family protein)

HSP 1 Score: 511.9 bits (1317), Expect = 4.1e-145
Identity = 274/502 (54.58%), Postives = 343/502 (68.33%), Query Frame = 1

Query: 2   NISGSSRSSINRWNSKKSNLQLRRF---SLSVIVLLFCSLFLLYLSS--SSFMSSTAFST 61
           N    SR S   W ++K           S+S++V+ F  +F +  S    S  S +AFS 
Sbjct: 3   NFFNPSRPSPRPWPNRKKQTDKSAIFLCSVSILVVFFIVVFFITYSEMPKSLFSISAFSG 62

Query: 62  S-NSRQCNPQILD---LGEKFLFYAPHSGFSNQLSEFKNAILMAGILNRTLVVPPILDHH 121
           S    QC  +IL    LG+KFL+YAPHSGFSNQLSEFKNA+LMAGILNRTL++PPILDHH
Sbjct: 63  SVQFPQCRSEILTRTLLGQKFLWYAPHSGFSNQLSEFKNALLMAGILNRTLIIPPILDHH 122

Query: 122 AVALGSCPKFRVPDPGEIRFSVWEHMLELLRIGRT------------------------- 181
           AVALGSCPKFRV  P EIR SVW H +ELL+  R                          
Sbjct: 123 AVALGSCPKFRVLSPSEIRISVWNHSIELLKTDRYVSMADIVDISSLVSSSAVRVIDFRY 182

Query: 182 FAYLWCGVHLESVCSNEY--------NLKHCGRLLAGLDGNVDKCLHAVDEDCRTTVWSY 241
           FA L CGV LE++C+++         +LK CG LL+G+ GNVDKCL+AVDEDCRTTVW+Y
Sbjct: 183 FASLQCGVDLETLCTDDLAEQSQAYESLKQCGYLLSGVRGNVDKCLYAVDEDCRTTVWTY 242

Query: 242 QNGEVDGALDLFQPNEQLKKKKKVSYVRRRRDVYRTLGPNSKAESATVLAFGSLFTAPYK 301
           +NGE DG LD FQP+E+LKKKKK+S VRRRRDVY+TLG  ++AESA +LAFGSLFTAPYK
Sbjct: 243 KNGEADGRLDSFQPDEKLKKKKKLSNVRRRRDVYKTLGHGTEAESAAILAFGSLFTAPYK 302

Query: 302 GSELYIDIHEVSGDQRISSLMKNIEYLPFVPEILSAGKEYVDKIIKAPFLCAQLRLLDGQ 361
           GSELYIDIH+     +I SL++ +++LPFV EI+ AGK++  + IKAPFLCAQLRLLDGQ
Sbjct: 303 GSELYIDIHK---SPKIKSLVEKVDFLPFVREIMIAGKKFASETIKAPFLCAQLRLLDGQ 362

Query: 362 FKHHWKATFQGLKQKLDSILENANEPIHVFVMTDLPESNWTGSYLGDLASDSNHFKLFFL 421
           FK+H ++TF GL QKL+++       I+VFVMTDLPE NWTG+YLGDL+ +S +FKL F+
Sbjct: 363 FKNHRESTFTGLYQKLEALSVKNPGLINVFVMTDLPEFNWTGTYLGDLSKNSTNFKLHFI 422

Query: 422 KEHDELVLRASKKVMAVGHGLRWTSSAFGPSRIRDMKKKCASERLPDVLLYVEETVCSCA 462
            E DE + R   ++ +  HG ++ S       I+ M+  C      +V LY+EE VCSCA
Sbjct: 423 GEQDEFLARTEHELDSASHGQKFGSIPMSLDSIKKMQTHCYPHGGSNVQLYIEEAVCSCA 482

BLAST of ClCG01G004250 vs. NCBI nr
Match: gi|659073413|ref|XP_008437048.1| (PREDICTED: uncharacterized protein LOC103482591 isoform X1 [Cucumis melo])

HSP 1 Score: 839.0 bits (2166), Expect = 4.2e-240
Identity = 425/491 (86.56%), Postives = 442/491 (90.02%), Query Frame = 1

Query: 1   MNISGSSRSSINRWNSKKSNLQLRRFSLSVIVLLFCSLFLLYLSSS----SFMSSTAFST 60
           MN+ GS+RSSINRWNSKK NLQLRRFSLSV VLLFC  FLLYLSSS    +F+SSTAFST
Sbjct: 1   MNVFGSTRSSINRWNSKKPNLQLRRFSLSVFVLLFCFFFLLYLSSSFSSSTFISSTAFST 60

Query: 61  SNSRQCNPQILDLGEKFLFYAPHSGFSNQLSEFKNAILMAGILNRTLVVPPILDHHAVAL 120
           SNSRQCN QIL LGEKFLFYAPHSGFSNQLSEFKNAILMAGILNRTLVVPPILDHHAVAL
Sbjct: 61  SNSRQCNTQILGLGEKFLFYAPHSGFSNQLSEFKNAILMAGILNRTLVVPPILDHHAVAL 120

Query: 121 GSCPKFRVPDPGEIRFSVWEHMLELLRIGR-------------------------TFAYL 180
           GSCPKFRVPDPGEIRFSVWEHML+LLR GR                         TFAYL
Sbjct: 121 GSCPKFRVPDPGEIRFSVWEHMLQLLRNGRYVSMTDIVDISSLTSYSSVKAIDFRTFAYL 180

Query: 181 WCGVHLESVCSNEYN-LKHCGRLLAGLDGNVDKCLHAVDEDCRTTVWSYQNGEVDGALDL 240
           WCGVHLESVCSNEYN LK CGRLLAGLDGNVDKCLHAVDEDC+TTVW+YQ+ EVDGALDL
Sbjct: 181 WCGVHLESVCSNEYNNLKQCGRLLAGLDGNVDKCLHAVDEDCKTTVWTYQSNEVDGALDL 240

Query: 241 FQPNEQLKKKKKVSYVRRRRDVYRTLGPNSKAESATVLAFGSLFTAPYKGSELYIDIHEV 300
           FQPNEQLKKKKKVSYVRRRRDVYRTLGP+SKA SATVLAFGSLFTAPYKGSELYIDIH V
Sbjct: 241 FQPNEQLKKKKKVSYVRRRRDVYRTLGPDSKAGSATVLAFGSLFTAPYKGSELYIDIHGV 300

Query: 301 SGDQRISSLMKNIEYLPFVPEILSAGKEYVDKIIKAPFLCAQLRLLDGQFKHHWKATFQG 360
           S DQRISSLMKNIEYLPFVPEILSAGKEY+DKIIKAPFLCAQLRLLDGQFK+HWKATF  
Sbjct: 301 SKDQRISSLMKNIEYLPFVPEILSAGKEYIDKIIKAPFLCAQLRLLDGQFKNHWKATFLA 360

Query: 361 LKQKLDSILENANEPIHVFVMTDLPESNWTGSYLGDLASDSNHFKLFFLKEHDELVLRAS 420
           L+QKL+SILENANEPI VFVMTDLPESNWTGSYLGDL SDSNHFKLFFLKEHDELVLRAS
Sbjct: 361 LQQKLNSILENANEPIRVFVMTDLPESNWTGSYLGDLDSDSNHFKLFFLKEHDELVLRAS 420

Query: 421 KKVMAVGHGLRWTSSAFGPSRIRDMKKKCASERLPDVLLYVEETVCSCASLGFVGTAGST 462
           KKVMAVGHGLRWTS+AFGP RIR+MKK+CA ERLPDVLLY+EETVCSCASLGFVGTAGST
Sbjct: 421 KKVMAVGHGLRWTSNAFGPGRIRNMKKECAPERLPDVLLYIEETVCSCASLGFVGTAGST 480

BLAST of ClCG01G004250 vs. NCBI nr
Match: gi|449469430|ref|XP_004152423.1| (PREDICTED: uncharacterized protein LOC101209896 [Cucumis sativus])

HSP 1 Score: 827.8 bits (2137), Expect = 9.6e-237
Identity = 422/491 (85.95%), Postives = 437/491 (89.00%), Query Frame = 1

Query: 1   MNISGSSRSSINRWNSKKSNLQLRRFSLSVIVLLFCSLFLLYLSSS----SFMSSTAFST 60
           MN  GS+RSSINRWNSKK NLQL R SLSV  LLFC LFLLYLSSS    SFMSSTAFST
Sbjct: 1   MNAFGSTRSSINRWNSKKPNLQLPRISLSVCALLFCFLFLLYLSSSFSSSSFMSSTAFST 60

Query: 61  SNSRQCNPQILDLGEKFLFYAPHSGFSNQLSEFKNAILMAGILNRTLVVPPILDHHAVAL 120
           SNSRQCN QIL LGEKFLFYAPHSGFSNQLSEFKNAILMAGILNRTLVVPPILDHHAVAL
Sbjct: 61  SNSRQCNTQILALGEKFLFYAPHSGFSNQLSEFKNAILMAGILNRTLVVPPILDHHAVAL 120

Query: 121 GSCPKFRVPDPGEIRFSVWEHMLELLRIGR-------------------------TFAYL 180
           GSCPKFRVPDPGEIRFSVWEHML+LLR GR                         TFAYL
Sbjct: 121 GSCPKFRVPDPGEIRFSVWEHMLQLLRNGRYVSMADIVDISSLTSYSSVKAIDFRTFAYL 180

Query: 181 WCGVHLESVCSNEYN-LKHCGRLLAGLDGNVDKCLHAVDEDCRTTVWSYQNGEVDGALDL 240
           WCGV LESVC+NEYN LK CGRLLAGLDGNVDKCLHAVDEDC+TTVW+YQN EVDGALDL
Sbjct: 181 WCGVRLESVCANEYNNLKQCGRLLAGLDGNVDKCLHAVDEDCKTTVWTYQNNEVDGALDL 240

Query: 241 FQPNEQLKKKKKVSYVRRRRDVYRTLGPNSKAESATVLAFGSLFTAPYKGSELYIDIHEV 300
           FQPNEQLKKKKKVSYVRRRRDVYRTLG +SKA SATVLAFGSLFTAPY+GSELYIDIH V
Sbjct: 241 FQPNEQLKKKKKVSYVRRRRDVYRTLGRDSKAGSATVLAFGSLFTAPYRGSELYIDIHGV 300

Query: 301 SGDQRISSLMKNIEYLPFVPEILSAGKEYVDKIIKAPFLCAQLRLLDGQFKHHWKATFQG 360
           S DQRISSLMKNIEYLPFVPEILSAGKEY+DKIIKAPFLCAQLRLLDGQFK+HWKATF  
Sbjct: 301 SKDQRISSLMKNIEYLPFVPEILSAGKEYIDKIIKAPFLCAQLRLLDGQFKNHWKATFLA 360

Query: 361 LKQKLDSILENANEPIHVFVMTDLPESNWTGSYLGDLASDSNHFKLFFLKEHDELVLRAS 420
           L+QKLDSILENANEPIHVFVMTDLP+SNWTGSYLGDL SDSNHFKLFFL+E DELVLRAS
Sbjct: 361 LQQKLDSILENANEPIHVFVMTDLPKSNWTGSYLGDLDSDSNHFKLFFLEESDELVLRAS 420

Query: 421 KKVMAVGHGLRWTSSAFGPSRIRDMKKKCASERLPDVLLYVEETVCSCASLGFVGTAGST 462
           KKVMAVGHGLRWTS+AFGP  IRDMKKKCASE+LPDVLLY+EETVCSCASLGFVGTAGST
Sbjct: 421 KKVMAVGHGLRWTSNAFGPGSIRDMKKKCASEKLPDVLLYIEETVCSCASLGFVGTAGST 480

BLAST of ClCG01G004250 vs. NCBI nr
Match: gi|743864454|ref|XP_011031926.1| (PREDICTED: uncharacterized protein LOC105130896 [Populus euphratica])

HSP 1 Score: 582.0 bits (1499), Expect = 9.2e-163
Identity = 296/485 (61.03%), Postives = 360/485 (74.23%), Query Frame = 1

Query: 14  WNSKKSNLQLRR-FSLSVIVLLFCSLFLLY----LSSSSFMSSTAFSTSNSRQCNP-QIL 73
           W  KK   +L+   SL +I+ LF  +FL      ++ +S  S T        QC   Q L
Sbjct: 18  WIKKKQTQRLKSPLSLLLILSLFLFIFLFISFFKITPNSLFSKTIRDNPLISQCTRFQTL 77

Query: 74  DLGEKFLFYAPHSGFSNQLSEFKNAILMAGILNRTLVVPPILDHHAVALGSCPKFRVPDP 133
            LGEKFL+YAPHSGFSNQLSEFKN ILMAGILNRTL+VPP+LDHHAVALGSCPKFRV  P
Sbjct: 78  ALGEKFLWYAPHSGFSNQLSEFKNGILMAGILNRTLIVPPVLDHHAVALGSCPKFRVLGP 137

Query: 134 GEIRFSVWEHMLELLRIGR------------------------TFAYLWCGVHLESVCSN 193
            EIR SVW+H+L+L++ GR                         FA LWC V+++  CSN
Sbjct: 138 KEIRISVWDHVLDLVKTGRYVSMADIIDISSLVPSSIQAIDFRVFASLWCNVNMDFTCSN 197

Query: 194 EYN--------LKHCGRLLAGLDGNVDKCLHAVDEDCRTTVWSYQNGEVDGALDLFQPNE 253
           + N        L  CG +L+G+DGNVDKCL+AVDEDCRTTVW+Y+NG+ D   D FQP+E
Sbjct: 198 DLNSQSSLFDSLNLCGSILSGIDGNVDKCLYAVDEDCRTTVWTYKNGDEDRVFDSFQPDE 257

Query: 254 QLKKKKKVSYVRRRRDVYRTLGPNSKAESATVLAFGSLFTAPYKGSELYIDIHEVSGDQR 313
           QLKKKKK+SYVRRR+DVY++LGP S+A SATVLAFGSLFTAPYKGSELYIDIHE   DQR
Sbjct: 258 QLKKKKKISYVRRRQDVYKSLGPGSEAGSATVLAFGSLFTAPYKGSELYIDIHEARRDQR 317

Query: 314 ISSLMKNIEYLPFVPEILSAGKEYVDKIIKAPFLCAQLRLLDGQFKHHWKATFQGLKQKL 373
           I SL+ N E+LPFVPEIL+AGK++  + IKAPFLCAQLRLLDGQFK+HWKATFQGLKQKL
Sbjct: 318 IQSLIGNSEFLPFVPEILNAGKKFALETIKAPFLCAQLRLLDGQFKNHWKATFQGLKQKL 377

Query: 374 DSILENANEPIHVFVMTDLPESNWTGSYLGDLASDSNHFKLFFLKEHDELVLRASKKVMA 433
           + + ++ ++P+H+FVMTDLP+ NWTGS+LGD+AS+ NHFKL FL+E DELV + +K +  
Sbjct: 378 EVLKQSGSKPVHIFVMTDLPQGNWTGSFLGDMASEVNHFKLHFLREEDELVKKTAKNLAV 437

Query: 434 VGHGLRWTSSAFGPSRIRDMKKKCASERLPDVLLYVEETVCSCASLGFVGTAGSTIAESI 461
            GHGLR+       +    MK  C+ +RLPD+LLY+E++VCSCASLGFVGTAGSTIAESI
Sbjct: 438 AGHGLRFGPVPRSLNGESKMKMNCSYQRLPDILLYIEKSVCSCASLGFVGTAGSTIAESI 497

BLAST of ClCG01G004250 vs. NCBI nr
Match: gi|590676583|ref|XP_007039777.1| (O-fucosyltransferase family protein, putative isoform 2 [Theobroma cacao])

HSP 1 Score: 581.3 bits (1497), Expect = 1.6e-162
Identity = 304/493 (61.66%), Postives = 363/493 (73.63%), Query Frame = 1

Query: 14  WNSKKS--NLQLRR---FSLSVIVL-LFCSLFLLYLSSSSFMSSTAFSTSNSR------Q 73
           WN KKS    Q RR   F LS+ +L L    FL Y+S    + ST+  T N+        
Sbjct: 14  WNKKKSLQQQQPRRSPFFFLSLSLLSLLVFFFLTYISIPKSLFSTSSKTVNAALSPQYPH 73

Query: 74  CNPQILDLGEKFLFYAPHSGFSNQLSEFKNAILMAGILNRTLVVPPILDHHAVALGSCPK 133
           C  QI   GEKFL+YAPHSGFSNQLSEFKNAILMAGILNRTL+VPPILDHHAV LGSCPK
Sbjct: 74  CTTQIP--GEKFLWYAPHSGFSNQLSEFKNAILMAGILNRTLIVPPILDHHAVVLGSCPK 133

Query: 134 FRVPDPGEIRFSVWEHMLELLRIGR-------------------------TFAYLWCGVH 193
           FRV    EIR SVW+H+ EL+R  R                          F  LWCG++
Sbjct: 134 FRVQSAKEIRLSVWDHINELIRSERYVSMADIIDISSLLSSSLVRAIDFRVFVSLWCGLN 193

Query: 194 LESVCSNEYN--------LKHCGRLLAGLDGNVDKCLHAVDEDCRTTVWSYQNGEVDGAL 253
           ++ VCSNE N        L+ CG LL+G+DGN+D+CL AVDEDCRTTVW+YQN EVDG L
Sbjct: 194 MDLVCSNELNAQQSMVGSLRQCGSLLSGIDGNIDRCLFAVDEDCRTTVWTYQNDEVDGVL 253

Query: 254 DLFQPNEQLKKKKKVSYVRRRRDVYRTLGPNSKAESATVLAFGSLFTAPYKGSELYIDIH 313
           D FQP+EQLK KKK+SYVRRRR+VY+TLGP S+AESATVLAFGSLFTAPYKGS+LYIDI 
Sbjct: 254 DSFQPDEQLKNKKKISYVRRRRNVYKTLGPGSEAESATVLAFGSLFTAPYKGSDLYIDIQ 313

Query: 314 EVSGDQRISSLMKNIEYLPFVPEILSAGKEYVDKIIKAPFLCAQLRLLDGQFKHHWKATF 373
           +  GD +I SL+K IE+LPFVPEI+S+GK++  + IKAPFLCAQLRLLDGQFK+HWKATF
Sbjct: 314 KAPGDLKIKSLIKKIEFLPFVPEIISSGKQFAMQSIKAPFLCAQLRLLDGQFKNHWKATF 373

Query: 374 QGLKQKLDSILENANEPIHVFVMTDLPESNWTGSYLGDLASDSNHFKLFFLKEHDELVLR 433
            GLKQKLDS+ +  + PIH+FVMTDLP+ NWTGSYLGDLA DS +FKL+FL+E D  V++
Sbjct: 374 LGLKQKLDSLRQAGSRPIHIFVMTDLPQGNWTGSYLGDLARDSANFKLYFLRE-DLFVMK 433

Query: 434 ASKKVMAVGHGLRWTSSAFGPSRIRDMKKKCASERLPDVLLYVEETVCSCASLGFVGTAG 462
            +KK+   GHGLR+ S       +  ++K C+ + +PDVLLY+EETVCSCASLGFVGTAG
Sbjct: 434 TAKKLALAGHGLRFESVPASLDAVAKLEKHCSPDIVPDVLLYIEETVCSCASLGFVGTAG 493

BLAST of ClCG01G004250 vs. NCBI nr
Match: gi|566161435|ref|XP_002304290.2| (hypothetical protein POPTR_0003s07710g [Populus trichocarpa])

HSP 1 Score: 577.4 bits (1487), Expect = 2.3e-161
Identity = 295/485 (60.82%), Postives = 358/485 (73.81%), Query Frame = 1

Query: 14  WNSKKSNLQLRR-FSLSVIVLLFCSLFLLY----LSSSSFMSSTAFSTSNSRQCNP-QIL 73
           W  KK    L+   SL +I+ LF  +FL      ++ +S  S T  +     QC   Q L
Sbjct: 18  WIKKKQTQPLKSPLSLLLILSLFLFIFLFISFFKITPNSLFSKTITNNPLISQCTKFQTL 77

Query: 74  DLGEKFLFYAPHSGFSNQLSEFKNAILMAGILNRTLVVPPILDHHAVALGSCPKFRVPDP 133
            LGEKFL+YAPHSGFSNQLSEFKN ILMAGILNRTL+VPP+LDHHAVALGSCPKFRV  P
Sbjct: 78  ALGEKFLWYAPHSGFSNQLSEFKNGILMAGILNRTLIVPPVLDHHAVALGSCPKFRVLGP 137

Query: 134 GEIRFSVWEHMLELLRIGR------------------------TFAYLWCGVHLESVCSN 193
            EIR SVW+H+L+L++ GR                         FA  WC V ++  CSN
Sbjct: 138 KEIRVSVWDHVLDLVKTGRYVSMADIIDISSLVPSSIQAIDFRVFASQWCNVKMDFTCSN 197

Query: 194 EYN--------LKHCGRLLAGLDGNVDKCLHAVDEDCRTTVWSYQNGEVDGALDLFQPNE 253
           + N        L  CG +L+G+DGNVDKCL+AVDEDCRTTVW+Y+NG+ D   D FQP+E
Sbjct: 198 DLNAQSSLFDSLNLCGSILSGIDGNVDKCLYAVDEDCRTTVWTYKNGDEDRVFDSFQPDE 257

Query: 254 QLKKKKKVSYVRRRRDVYRTLGPNSKAESATVLAFGSLFTAPYKGSELYIDIHEVSGDQR 313
           QLKKKKK+SYVRRR+DVY++LGP S+A SATVLAFGSLFTAPYKGSEL+IDIHE   DQR
Sbjct: 258 QLKKKKKISYVRRRQDVYKSLGPGSEAGSATVLAFGSLFTAPYKGSELHIDIHEARRDQR 317

Query: 314 ISSLMKNIEYLPFVPEILSAGKEYVDKIIKAPFLCAQLRLLDGQFKHHWKATFQGLKQKL 373
           I SL+ N E+LPFVPEIL+AGK++  + IKAPFLCAQLRLLDGQFK+HWKATFQGLKQKL
Sbjct: 318 IQSLIDNSEFLPFVPEILNAGKKFALETIKAPFLCAQLRLLDGQFKNHWKATFQGLKQKL 377

Query: 374 DSILENANEPIHVFVMTDLPESNWTGSYLGDLASDSNHFKLFFLKEHDELVLRASKKVMA 433
           + + ++ ++PIH+FVMTDLP+ NWTGS+LGD+AS+ NHFKL+FL+E DELV + +K +  
Sbjct: 378 EVLKQSGSKPIHIFVMTDLPQGNWTGSFLGDMASEVNHFKLYFLREEDELVKKTAKNLAV 437

Query: 434 VGHGLRWTSSAFGPSRIRDMKKKCASERLPDVLLYVEETVCSCASLGFVGTAGSTIAESI 461
            GHGLR+ S     +    MK  C  +RL D+LLY+E++VCSCASLGFVGTAGSTIAESI
Sbjct: 438 AGHGLRFGSVPRSHNGESKMKMNCPHQRLIDILLYIEKSVCSCASLGFVGTAGSTIAESI 497

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
A0A0A0KPB8_CUCSA6.7e-23785.95Uncharacterized protein OS=Cucumis sativus GN=Csa_5G165220 PE=4 SV=1[more]
A0A061G4F7_THECC1.1e-16261.66O-fucosyltransferase family protein, putative isoform 2 OS=Theobroma cacao GN=TC... [more]
B9GYL4_POPTR1.6e-16160.82Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0003s07710g PE=4 SV=2[more]
A0A0D2QV13_GOSRA6.0e-16159.84Uncharacterized protein OS=Gossypium raimondii GN=B456_003G175200 PE=4 SV=1[more]
B9S5G7_RICCO1.1e-15758.00Putative uncharacterized protein OS=Ricinus communis GN=RCOM_0975840 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G17430.14.1e-14554.58 O-fucosyltransferase family protein[more]
Match NameE-valueIdentityDescription
gi|659073413|ref|XP_008437048.1|4.2e-24086.56PREDICTED: uncharacterized protein LOC103482591 isoform X1 [Cucumis melo][more]
gi|449469430|ref|XP_004152423.1|9.6e-23785.95PREDICTED: uncharacterized protein LOC101209896 [Cucumis sativus][more]
gi|743864454|ref|XP_011031926.1|9.2e-16361.03PREDICTED: uncharacterized protein LOC105130896 [Populus euphratica][more]
gi|590676583|ref|XP_007039777.1|1.6e-16261.66O-fucosyltransferase family protein, putative isoform 2 [Theobroma cacao][more]
gi|566161435|ref|XP_002304290.2|2.3e-16160.82hypothetical protein POPTR_0003s07710g [Populus trichocarpa][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR019378GDP-Fuc_O-FucTrfase
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0008152 metabolic process
biological_process GO:0006004 fucose metabolic process
cellular_component GO:0005794 Golgi apparatus
cellular_component GO:0005768 endosome
cellular_component GO:0005802 trans-Golgi network
cellular_component GO:0044424 intracellular part
cellular_component GO:0005575 cellular_component
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0016020 membrane
molecular_function GO:0016740 transferase activity
molecular_function GO:0016874 ligase activity
molecular_function GO:0016757 transferase activity, transferring glycosyl groups
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG01G004250.1ClCG01G004250.1mRNA


Analysis Name: InterPro Annotations of watermelon (Charleston Gray)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR019378GDP-fucose protein O-fucosyltransferasePFAMPF10250O-FucTcoord: 73..320
score: 3.
NoneNo IPR availablePANTHERPTHR36050FAMILY NOT NAMEDcoord: 40..468
score: 4.3E
NoneNo IPR availablePANTHERPTHR36050:SF1SUBFAMILY NOT NAMEDcoord: 40..468
score: 4.3E

The following gene(s) are paralogous to this gene:

None