Cla021857 (gene) Watermelon (97103) v1

NameCla021857
Typegene
OrganismCitrullus. lanatus (Watermelon (97103) v1)
DescriptionUDP-glycosyltransferase 76E11 (AHRD V1 ***- U7E11_ARATH); contains Interpro domain(s) IPR002213 UDP-glucuronosyl/UDP-glucosyltransferase
LocationChr5 : 6949694 .. 6951898 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAGCATGGGAATTTCCTTCTAGTTTCCCAAAGCCCAAAGAGCCATCTCAACCCTACTCTCCATTTGGCCTCCACCCTCCTCTACCTTGGCTCAAAAGTCACTCTTCTTCTCACCAACTATGCCCTCAAAAACATTTCAAAACACCAACTCCCCTATGGCTTATCTCTCTCCACTTTCTCCGATGGCTTTGACGACGGCTTTACCTCCTCCGACTTCCCACGCTGGCGGGTTGAATTTGAACGCCTCGGTCGCCTTGCCCTCGTCGACCTCCTCTCCTCACAACAACAAAGCCTTCTTTCCTTCACTTGCATTGTCCACACCCTCCTCATCCCTTGGGTCACTCGAGTTGCGCTTGAGCTTCACGTGCCGACTGCGATTTTGTGGATTCAATCGGTTGCTGCCTTTGATGTGTATTACTATTACTTCAATGGCTATAGTGATGTGATTCGGAATGGTTATAAAGATGATGGCTCTAATTCTTTGTTATTTAACATTTGGCTTCCGGGTTTGCCATTGATGAATGTTTTGGACCTTCCAAGCTTTGTGGTTTCTGATGATTATCATGGGCTTATTCTCAAGTCATTTGAAGAGAAGATGCAGGTTCTTGAGGAGGAGGAGAATGTGTCAATCCTTGTTAACTCGTTCGATGCATTGGAACATGATGCCTTTACAGCAATCGGGAAGTTTAACTTGATCCCAATTGGACCTTTGGTTTCACTTCCACCTAGATTTGAAGTTTCAACCAAACAAAGAACCACTTCATATCTTCAAGGTGGTATGAAATATACAGTCTTGTTTAGGCTTCTTCTTTTAAAGGTTTTTGTTTTATTTTTGTACCAGAATTTCAAGTTTTTTTAATTTTGATGGTTAAACTTTCAAAACATCATATTTAAGTCACTAAACGTTAAGTTTCACCTCAAGTTAGTTCATGAACTTTTAAAAATTTACTTGTCCTTGAATGAAAAAGAAAAAATCGTTTAGGTCTTACCATAAAAATGAATTATGCTTTGAGAGTATGAATAATAATGATTAACTTGAACATGTACTGAGAAGCCACAATACATGTCAATTGTCAACTTACAAAATGAAATATTCTAGTAGTTGATATATACCTACTTATGTTCACACATCACACTAAATTTTTTATTGACATAATTTTTAAATGTTGTTGAACACTCCATCATTCTAAAACAATGCAACATAGCTTTGTAATGAATGATAGTAGTACAAGAAATTTGTTCTACTAGAGTGATTATTTGCAGAAAAATTGGAGACAAAATAATAGTGATGACTATAAGATGGTTTACTGATATATAAGTTCAATGGCTAAGACATTTTAAATGTCCATATATCTAAACAGTTATACTTTTCCCACCCAAAAAAAAGTTGGTAAAGGTATCCTTGTAAGTTTGGACATTAAAATAAACATATTACCTGTTAATATATCAAATTTAAATAATCTAGAGAGTTCAAAAGCTAAAATGGCTCAATTTTAAGCCTTTTCTAATTTTACCAGAGACAATGGTTGTCTTTACTTATACATGATGATCATTTCATTCTTGTTCGCAAGGTCAACAGGCTCAAGAGGATTATATCAAATGGCTTAACTCCAAATTTGATTCGTCTGTGGTCTACATAGCATTTGGGAGCATTTCAAAGCTGTCAAATAAACAAACAAAAGAGATCGTTGGTGCATTATTAGAATGCAGTTACCCATTCTTGTGGGTCCTAAGTATGGATGACATCCAAGATGAGAATTTAAGCTTATATTTTGACGATGAACTACAAGCTCAAGGGAAGATAGTGCCATGGTGCTCACAAGTAGAGGTCTTGAGCCACCGCTCCGTGGGTTGCTTTGTAACACATTGCGGATGGAACTCAACGATCGAGAGCGTGACAGCTGGAGTGCCAACGGTGGCATGGCCATTGTGGGCGGACCAAGCCACCAACGCCAAGTTGATGCAGGATGTATGGGAGCTTGGTGTGAGAGTGAAGAAGAGTAGTGATGGTGAAGGATTGGTGGAAGGGAAGGAGATTGCAAGGTGCTTGAGAATGGTTATGGATATGGAAGACCACGGCAGAGGAAAGCAACTGAGAATTAATGCTAGGAAGTGGCAGCTCTTAGCAATGGAGGCTGCAAATGGTTCTTCTTATATGAATATTAAGGCTTTTGTAAATAAAGTTTGTTATCAAGCAAAGTGA

mRNA sequence

ATGGAGCATGGGAATTTCCTTCTAGTTTCCCAAAGCCCAAAGAGCCATCTCAACCCTACTCTCCATTTGGCCTCCACCCTCCTCTACCTTGGCTCAAAAGTCACTCTTCTTCTCACCAACTATGCCCTCAAAAACATTTCAAAACACCAACTCCCCTATGGCTTATCTCTCTCCACTTTCTCCGATGGCTTTGACGACGGCTTTACCTCCTCCGACTTCCCACGCTGGCGGGTTGAATTTGAACGCCTCGGTCGCCTTGCCCTCGTCGACCTCCTCTCCTCACAACAACAAAGCCTTCTTTCCTTCACTTGCATTGTCCACACCCTCCTCATCCCTTGGGTCACTCGAGTTGCGCTTGAGCTTCACGTGCCGACTGCGATTTTGTGGATTCAATCGGTTGCTGCCTTTGATGTGTATTACTATTACTTCAATGGCTATAGTGATGTGATTCGGAATGGTTATAAAGATGATGGCTCTAATTCTTTGTTATTTAACATTTGGCTTCCGGGTTTGCCATTGATGAATGTTTTGGACCTTCCAAGCTTTGTGGTTTCTGATGATTATCATGGGCTTATTCTCAAGTCATTTGAAGAGAAGATGCAGGTTCTTGAGGAGGAGGAGAATGTGTCAATCCTTGTTAACTCGTTCGATGCATTGGAACATGATGCCTTTACAGCAATCGGGAAGTTTAACTTGATCCCAATTGGACCTTTGGTTTCACTTCCACCTAGATTTGAAGTTTCAACCAAACAAAGAACCACTTCATATCTTCAAGGTGGTCAACAGGCTCAAGAGGATTATATCAAATGGCTTAACTCCAAATTTGATTCGTCTGTGGTCTACATAGCATTTGGGAGCATTTCAAAGCTGTCAAATAAACAAACAAAAGAGATCGTTGGTGCATTATTAGAATGCAGTTACCCATTCTTGTGGGTCCTAAGTATGGATGACATCCAAGATGAGAATTTAAGCTTATATTTTGACGATGAACTACAAGCTCAAGGGAAGATAGTGCCATGGTGCTCACAAGTAGAGGTCTTGAGCCACCGCTCCGTGGGTTGCTTTGTAACACATTGCGGATGGAACTCAACGATCGAGAGCGTGACAGCTGGAGTGCCAACGGTGGCATGGCCATTGTGGGCGGACCAAGCCACCAACGCCAAGTTGATGCAGGATGTATGGGAGCTTGGTGTGAGAGTGAAGAAGAGTAGTGATGGTGAAGGATTGGTGGAAGGGAAGGAGATTGCAAGGTGCTTGAGAATGGTTATGGATATGGAAGACCACGGCAGAGGAAAGCAACTGAGAATTAATGCTAGGAAGTGGCAGCTCTTAGCAATGGAGGCTGCAAATGGTTCTTCTTATATGAATATTAAGGCTTTTGTAAATAAAGTTTGTTATCAAGCAAAGTGA

Coding sequence (CDS)

ATGGAGCATGGGAATTTCCTTCTAGTTTCCCAAAGCCCAAAGAGCCATCTCAACCCTACTCTCCATTTGGCCTCCACCCTCCTCTACCTTGGCTCAAAAGTCACTCTTCTTCTCACCAACTATGCCCTCAAAAACATTTCAAAACACCAACTCCCCTATGGCTTATCTCTCTCCACTTTCTCCGATGGCTTTGACGACGGCTTTACCTCCTCCGACTTCCCACGCTGGCGGGTTGAATTTGAACGCCTCGGTCGCCTTGCCCTCGTCGACCTCCTCTCCTCACAACAACAAAGCCTTCTTTCCTTCACTTGCATTGTCCACACCCTCCTCATCCCTTGGGTCACTCGAGTTGCGCTTGAGCTTCACGTGCCGACTGCGATTTTGTGGATTCAATCGGTTGCTGCCTTTGATGTGTATTACTATTACTTCAATGGCTATAGTGATGTGATTCGGAATGGTTATAAAGATGATGGCTCTAATTCTTTGTTATTTAACATTTGGCTTCCGGGTTTGCCATTGATGAATGTTTTGGACCTTCCAAGCTTTGTGGTTTCTGATGATTATCATGGGCTTATTCTCAAGTCATTTGAAGAGAAGATGCAGGTTCTTGAGGAGGAGGAGAATGTGTCAATCCTTGTTAACTCGTTCGATGCATTGGAACATGATGCCTTTACAGCAATCGGGAAGTTTAACTTGATCCCAATTGGACCTTTGGTTTCACTTCCACCTAGATTTGAAGTTTCAACCAAACAAAGAACCACTTCATATCTTCAAGGTGGTCAACAGGCTCAAGAGGATTATATCAAATGGCTTAACTCCAAATTTGATTCGTCTGTGGTCTACATAGCATTTGGGAGCATTTCAAAGCTGTCAAATAAACAAACAAAAGAGATCGTTGGTGCATTATTAGAATGCAGTTACCCATTCTTGTGGGTCCTAAGTATGGATGACATCCAAGATGAGAATTTAAGCTTATATTTTGACGATGAACTACAAGCTCAAGGGAAGATAGTGCCATGGTGCTCACAAGTAGAGGTCTTGAGCCACCGCTCCGTGGGTTGCTTTGTAACACATTGCGGATGGAACTCAACGATCGAGAGCGTGACAGCTGGAGTGCCAACGGTGGCATGGCCATTGTGGGCGGACCAAGCCACCAACGCCAAGTTGATGCAGGATGTATGGGAGCTTGGTGTGAGAGTGAAGAAGAGTAGTGATGGTGAAGGATTGGTGGAAGGGAAGGAGATTGCAAGGTGCTTGAGAATGGTTATGGATATGGAAGACCACGGCAGAGGAAAGCAACTGAGAATTAATGCTAGGAAGTGGCAGCTCTTAGCAATGGAGGCTGCAAATGGTTCTTCTTATATGAATATTAAGGCTTTTGTAAATAAAGTTTGTTATCAAGCAAAGTGA

Protein sequence

MEHGNFLLVSQSPKSHLNPTLHLASTLLYLGSKVTLLLTNYALKNISKHQLPYGLSLSTFSDGFDDGFTSSDFPRWRVEFERLGRLALVDLLSSQQQSLLSFTCIVHTLLIPWVTRVALELHVPTAILWIQSVAAFDVYYYYFNGYSDVIRNGYKDDGSNSLLFNIWLPGLPLMNVLDLPSFVVSDDYHGLILKSFEEKMQVLEEEENVSILVNSFDALEHDAFTAIGKFNLIPIGPLVSLPPRFEVSTKQRTTSYLQGGQQAQEDYIKWLNSKFDSSVVYIAFGSISKLSNKQTKEIVGALLECSYPFLWVLSMDDIQDENLSLYFDDELQAQGKIVPWCSQVEVLSHRSVGCFVTHCGWNSTIESVTAGVPTVAWPLWADQATNAKLMQDVWELGVRVKKSSDGEGLVEGKEIARCLRMVMDMEDHGRGKQLRINARKWQLLAMEAANGSSYMNIKAFVNKVCYQAK
BLAST of Cla021857 vs. Swiss-Prot
Match: UGT1_GARJA (Crocetin glucosyltransferase, chloroplastic OS=Gardenia jasminoides GN=UGT75L6 PE=1 SV=1)

HSP 1 Score: 391.0 bits (1003), Expect = 1.9e-107
Identity = 213/472 (45.13%), Postives = 290/472 (61.44%), Query Frame = 1

Query: 1   MEHGNFLLVSQSPKSHLNPTLHLASTLLYLGSKVTLLLTNYALKNISKHQ--LPYGLSLS 60
           ++  + LL++   + H+NP L  A  LL +G +VTL  + YAL  + K     P GL+ +
Sbjct: 2   VQQRHVLLITYPAQGHINPALQFAQRLLRMGIQVTLATSVYALSRMKKSSGSTPKGLTFA 61

Query: 61  TFSDGFDDGFTSS--DFPRWRVEFERLGRLALVDLLSSQQQSLLSFTCIVHTLLIPWVTR 120
           TFSDG+DDGF     D   +     + G   L +++++        TC+V+TLL+PW   
Sbjct: 62  TFSDGYDDGFRPKGVDHTEYMSSLAKQGSNTLRNVINTSADQGCPVTCLVYTLLLPWAAT 121

Query: 121 VALELHVPTAILWIQSVAAFDVYYYYFNGYSDVIRNGYKDDGSNSLLFNIWLPGLPLMNV 180
           VA E H+P+A+LWIQ VA  D+YYYYF GY D ++N      SN   ++I  PGLP M  
Sbjct: 122 VARECHIPSALLWIQPVAVMDIYYYYFRGYEDDVKNN-----SNDPTWSIQFPGLPSMKA 181

Query: 181 LDLPSFVV--SDDYHGLILKSFEEKMQVLEEEENVSILVNSFDALEHDAFTAIGKFNLIP 240
            DLPSF++  SD+ +   L +F+++++ L+EEE   +LVN+FDALE  A  AI  +NLI 
Sbjct: 182 KDLPSFILPSSDNIYSFALPTFKKQLETLDEEERPKVLVNTFDALEPQALKAIESYNLIA 241

Query: 241 IGPLVSLPPRFEVSTKQRTTSYLQGGQQAQEDYIKWLNSKFDSSVVYIAFGSISKLSNKQ 300
           IGPL   P  F        TS+     Q  +DY +WLNS+   SVVY++FGS+  L  +Q
Sbjct: 242 IGPLT--PSAFLDGKDPSETSFSGDLFQKSKDYKEWLNSRPAGSVVYVSFGSLLTLPKQQ 301

Query: 301 TKEIVGALLECSYPFLWVLSMDDIQDENLS---LYFDDELQAQGKIVPWCSQVEVLSHRS 360
            +EI   LL+   PFLWV+   +  +E      L   +EL+ QG IVPWCSQ+EVL+H S
Sbjct: 302 MEEIARGLLKSGRPFLWVIRAKENGEEEKEEDRLICMEELEEQGMIVPWCSQIEVLTHPS 361

Query: 361 VGCFVTHCGWNSTIESVTAGVPTVAWPLWADQATNAKLMQDVWELGVRVKKSSDGEGLVE 420
           +GCFVTHCGWNST+E++  GVP VA+P W DQ TNAKL++DVWE GVRV  + D  G VE
Sbjct: 362 LGCFVTHCGWNSTLETLVCGVPVVAFPHWTDQGTNAKLIEDVWETGVRVVPNED--GTVE 421

Query: 421 GKEIARCLRMVMDMEDHGRGKQLRINARKWQLLAMEA--ANGSSYMNIKAFV 462
             EI RC+  VMD  D  +G +L+ NA+KW+ LA EA   +GSS  N+KAFV
Sbjct: 422 SDEIKRCIETVMD--DGEKGVELKRNAKKWKELAREAMQEDGSSDKNLKAFV 462

BLAST of Cla021857 vs. Swiss-Prot
Match: U75D1_ARATH (UDP-glycosyltransferase 75D1 OS=Arabidopsis thaliana GN=UGT75D1 PE=2 SV=2)

HSP 1 Score: 351.3 bits (900), Expect = 1.7e-95
Identity = 191/485 (39.38%), Postives = 293/485 (60.41%), Query Frame = 1

Query: 5   NFLLVSQSPKSHLNPTLHLASTLL--YLGSKVTLL--LTNYALKNISKHQLPYGLSLSTF 64
           +FL V+   + H+NP+L LA  L     G++VT    ++ Y  +  S   +P  L  +T+
Sbjct: 13  HFLFVTFPAQGHINPSLELAKRLAGTISGARVTFAASISAYNRRMFSTENVPETLIFATY 72

Query: 65  SDGFDDGFTSSDFP---------RWRVEFERLGRLALVDLLSSQQQSLLSFTCIVHTLLI 124
           SDG DDGF SS +           +  E  R G+  L +L+   ++    FTC+V+T+L+
Sbjct: 73  SDGHDDGFKSSAYSDKSRQDATGNFMSEMRRRGKETLTELIEDNRKQNRPFTCVVYTILL 132

Query: 125 PWVTRVALELHVPTAILWIQSVAAFDVYYYYFNGYSDVIRNGYKDDGSNSLLFNIWLPGL 184
            WV  +A E H+P+A+LW+Q V  F ++Y+YFNGY D I      + +N+   +I LP L
Sbjct: 133 TWVAELAREFHLPSALLWVQPVTVFSIFYHYFNGYEDAI-----SEMANTPSSSIKLPSL 192

Query: 185 PLMNVLDLPSFVVSDDYHGLILKSFEEKMQVLEEEENVSILVNSFDALEHDAFTAI-GKF 244
           PL+ V D+PSF+VS + +  +L +F E++  L+EE N  IL+N+F  LE +A +++   F
Sbjct: 193 PLLTVRDIPSFIVSSNVYAFLLPAFREQIDSLKEEINPKILINTFQELEPEAMSSVPDNF 252

Query: 245 NLIPIGPLVSLPPRFEVSTKQRTTSYLQGGQQAQEDYIKWLNSKFDSSVVYIAFGSISKL 304
            ++P+GPL++L   F                 ++ +YI+WL++K DSSV+Y++FG+++ L
Sbjct: 253 KIVPVGPLLTLRTDFS----------------SRGEYIEWLDTKADSSVLYVSFGTLAVL 312

Query: 305 SNKQTKEIVGALLECSYPFLWVLS------MDDIQD--ENLSLYFDDELQAQGKIVPWCS 364
           S KQ  E+  AL++   PFLWV++       +D Q+  E+    F +EL   G +V WC 
Sbjct: 313 SKKQLVELCKALIQSRRPFLWVITDKSYRNKEDEQEKEEDCISSFREELDEIGMVVSWCD 372

Query: 365 QVEVLSHRSVGCFVTHCGWNSTIESVTAGVPTVAWPLWADQATNAKLMQDVWELGVRV-- 424
           Q  VL+HRS+GCFVTHCGWNST+ES+ +GVP VA+P W DQ  NAKL++D W+ GVRV  
Sbjct: 373 QFRVLNHRSIGCFVTHCGWNSTLESLVSGVPVVAFPQWNDQMMNAKLLEDCWKTGVRVME 432

Query: 425 KKSSDGEGLVEGKEIARCLRMVMDMEDHGRGKQLRINARKWQLLAMEAA--NGSSYMNIK 464
           KK  +G  +V+ +EI RC+  VM+     + ++ R NA +W+ LA EA    GSS+ ++K
Sbjct: 433 KKEEEGVVVVDSEEIRRCIEEVME----DKAEEFRGNATRWKDLAAEAVREGGSSFNHLK 472

BLAST of Cla021857 vs. Swiss-Prot
Match: 5GT_VERHY (Anthocyanidin 3-O-glucoside 5-O-glucosyltransferase OS=Verbena hybrida GN=HGT8 PE=2 SV=1)

HSP 1 Score: 326.6 bits (836), Expect = 4.4e-88
Identity = 195/477 (40.88%), Postives = 282/477 (59.12%), Query Frame = 1

Query: 1   MEHGNFLLVSQSPKSHLNPTLHLASTLLYLGSKVTLLLTNYALKNISKHQLPYG--LSLS 60
           M   + LL +   + H+NP L  A  L     +VT   + YA + +S+        ++  
Sbjct: 1   MSRAHVLLATFPAQGHINPALQFAKRLANADIQVTFFTSVYAWRRMSRTAAGSNGLINFV 60

Query: 61  TFSDGFDDGFTSSDFPR-WRVEFERLGRLALVDLLSSQQ--QSLLSFTCIVHTLLIPWVT 120
           +FSDG+DDG    D  + +  E +  G  AL D L++    Q     T +V++ L  W  
Sbjct: 61  SFSDGYDDGLQPGDDGKNYMSEMKSRGIKALSDTLAANNVDQKSSKITFVVYSHLFAWAA 120

Query: 121 RVALELHVPTAILWIQSVAAFDVYYYYFNGYSDVIRNGYKDDGSNSLLFNIWLPG-LPLM 180
           +VA E H+ +A+LWI+     D++Y+YFNGYSD I     D GS++    I LPG LP++
Sbjct: 121 KVAREFHLRSALLWIEPATVLDIFYFYFNGYSDEI-----DAGSDA----IHLPGGLPVL 180

Query: 181 NVLDLPSFVVSDDYHGLILKSFEEKMQVLEEEENVSILVNSFDALEHDAFTAIGKFNLIP 240
              DLPSF++    H       +EK++ LE EE   +LVNSFDALE DA  AI K+ +I 
Sbjct: 181 AQRDLPSFLLPST-HERFRSLMKEKLETLEGEEKPKVLVNSFDALEPDALKAIDKYEMIA 240

Query: 241 IGPLVSLPPRFEVSTKQRTTSYLQGGQQAQ-----EDYIKWLNSKFDSSVVYIAFGSISK 300
           IGPL+  P  F         S+  GG   +     +D ++WL++   SSVVY++FGS   
Sbjct: 241 IGPLI--PSAFLDGKDPSDRSF--GGDLFEKGSNDDDCLEWLSTNPRSSVVYVSFGSFVN 300

Query: 301 LSNKQTKEIVGALLECSYPFLWVLSMDDIQDENLSLYFDDELQAQGKIVPWCSQVEVLSH 360
            +  Q +EI   LL+C  PFLWV+ +++ ++  +S    +EL+  GKIV WCSQ+EVL+H
Sbjct: 301 TTKSQMEEIARGLLDCGRPFLWVVRVNEGEEVLISCM--EELKRVGKIVSWCSQLEVLTH 360

Query: 361 RSVGCFVTHCGWNSTIESVTAGVPTVAWPLWADQATNAKLMQDVWELGVRVKKSSDGEGL 420
            S+GCFVTHCGWNST+ES++ GVP VA+P W DQ TNAKLM+DVW  GVRV+ + +G  +
Sbjct: 361 PSLGCFVTHCGWNSTLESISFGVPMVAFPQWFDQGTNAKLMEDVWRTGVRVRANEEG-SV 420

Query: 421 VEGKEIARCLRMVMDMEDHGRGKQLRINARKWQLLAMEA--ANGSSYMNIKAFVNKV 465
           V+G EI RC+  VMD  +  + ++LR +A KW+ LA +A   +GSS  N+K F+++V
Sbjct: 421 VDGDEIRRCIEEVMDGGE--KSRKLRESAGKWKDLARKAMEEDGSSVNNLKVFLDEV 458

BLAST of Cla021857 vs. Swiss-Prot
Match: U75B2_ARATH (UDP-glycosyltransferase 75B2 OS=Arabidopsis thaliana GN=UGT75B2 PE=2 SV=1)

HSP 1 Score: 324.7 bits (831), Expect = 1.7e-87
Identity = 191/482 (39.63%), Postives = 274/482 (56.85%), Query Frame = 1

Query: 1   MEHGNFLLVSQSPKSHLNPTLHLASTLLYL-GSKVTLL--LTNYALKNISKHQLPYGLSL 60
           M   +FLLV+   + H+NP+L  A  L+   G++VT    L+      I  H     LS 
Sbjct: 1   MAQPHFLLVTFPAQGHVNPSLRFARRLIKTTGARVTFATCLSVIHRSMIPNHNNVENLSF 60

Query: 61  STFSDGFDDGFTSS--DFPRWRVEFERLGRLALVDLLSSQQQSLLSFTCIVHTLLIPWVT 120
            TFSDGFDDG  S+  D     V FER G  AL D + + Q      +C+++T+L  WV 
Sbjct: 61  LTFSDGFDDGVISNTDDVQNRLVHFERNGDKALSDFIEANQNGDSPVSCLIYTILPNWVP 120

Query: 121 RVALELHVPTAILWIQSVAAFDVYYYYFNGYSDVIRNGYKDDGSNSLLFNIWLPGLPLMN 180
           +VA   H+P+  LWIQ   AFD+YY Y  G + V                   P LP + 
Sbjct: 121 KVARRFHLPSVHLWIQPAFAFDIYYNYSTGNNSVFE----------------FPNLPSLE 180

Query: 181 VLDLPSFVVSDDYHGLILKSFEEKMQVLEEEENVSILVNSFDALEHDAFTAIGKFNLIPI 240
           + DLPSF+   + +      ++E M  L+EE N  ILVN+FD+LE +  TAI    ++ +
Sbjct: 181 IRDLPSFLSPSNTNKAAQAVYQELMDFLKEESNPKILVNTFDSLEPEFLTAIPNIEMVAV 240

Query: 241 GPLVSLPPRFEVSTKQRTTSYLQGGQQAQEDYIKWLNSKFDSSVVYIAFGSISKLSNKQT 300
           GPL+      E+ T   +   L    Q+   Y  WL+SK +SSV+Y++FG++ +LS KQ 
Sbjct: 241 GPLLPA----EIFTGSESGKDLSRDHQSSS-YTLWLDSKTESSVIYVSFGTMVELSKKQI 300

Query: 301 KEIVGALLECSYPFLWVLS-----------MDDIQDENLSLYFDDELQAQGKIVPWCSQV 360
           +E+  AL+E   PFLWV++            ++ + E ++  F  EL+  G IV WCSQ+
Sbjct: 301 EELARALIEGGRPFLWVITDKLNREAKIEGEEETEIEKIA-GFRHELEEVGMIVSWCSQI 360

Query: 361 EVLSHRSVGCFVTHCGWNSTIESVTAGVPTVAWPLWADQATNAKLMQDVWELGVRVKKSS 420
           EVL HR++GCF+THCGW+S++ES+  GVP VA+P+W+DQ  NAKL++++W+ GVRV+++S
Sbjct: 361 EVLRHRAIGCFLTHCGWSSSLESLVLGVPVVAFPMWSDQPANAKLLEEIWKTGVRVRENS 420

Query: 421 DGEGLVEGKEIARCLRMVMDMEDHGRGKQLRINARKWQLLAMEAA--NGSSYMNIKAFVN 465
             EGLVE  EI RCL  VM+     +  +LR NA KW+ LA EA    GSS  N++AFV 
Sbjct: 421 --EGLVERGEIMRCLEAVME----AKSVELRENAEKWKRLATEAGREGGSSDKNVEAFVK 454

BLAST of Cla021857 vs. Swiss-Prot
Match: 5GT1_PERFR (Anthocyanidin 3-O-glucoside 5-O-glucosyltransferase 1 OS=Perilla frutescens GN=PF3R4 PE=1 SV=1)

HSP 1 Score: 323.6 bits (828), Expect = 3.7e-87
Identity = 196/473 (41.44%), Postives = 271/473 (57.29%), Query Frame = 1

Query: 7   LLVSQSPKSHLNPTLHLASTLLYLGSKVTLLLTNYALKNISKHQL-----PYGLSLSTFS 66
           LL +   + H+NP L  A  LL  G+ VT   + YA + ++         P GL    FS
Sbjct: 7   LLATFPAQGHINPALQFAKRLLKAGTDVTFFTSVYAWRRMANTASAAAGNPPGLDFVAFS 66

Query: 67  DGFDDGFTS-SDFPRWRVEFERLGRLALVDLLSSQQQSLLSFTCIVHTLLIPWVTRVALE 126
           DG+DDG     D  R+  E +  G  AL +LL +        T +V++ L  W   VA E
Sbjct: 67  DGYDDGLKPCGDGKRYMSEMKARGSEALRNLLLNNHD----VTFVVYSHLFAWAAEVARE 126

Query: 127 LHVPTAILWIQSVAAFDVYYYYFNGYSDVIRNGYKDDGSNSLLFNIWLPGLPLMNVLDLP 186
             VP+A+LW++      +YY+YFNGY+D I     D GS+     I LP LP +    LP
Sbjct: 127 SQVPSALLWVEPATVLCIYYFYFNGYADEI-----DAGSDE----IQLPRLPPLEQRSLP 186

Query: 187 SFVVSDDYHGLILKSFEEKMQVLEEEENVSILVNSFDALEHDAFTAIGKFNLIPIGPLVS 246
           +F++ +      L   +EK++ L+ EE   +LVN+FDALE DA TAI ++ LI IGPL+ 
Sbjct: 187 TFLLPETPERFRLM-MKEKLETLDGEEKAKVLVNTFDALEPDALTAIDRYELIGIGPLI- 246

Query: 247 LPPRFEVSTKQRTTSYLQGG----QQAQEDYIKWLNSKFDSSVVYIAFGSISKLSNKQTK 306
            P  F        TSY  GG    +  + + ++WL++K  SSVVY++FGS+ +    Q +
Sbjct: 247 -PSAFLDGGDPSETSY--GGDLFEKSEENNCVEWLDTKPKSSVVYVSFGSVLRFPKAQME 306

Query: 307 EIVGALLECSYPFLWVL---SMDDIQDENLSLYFDDELQAQGKIVPWCSQVEVLSHRSVG 366
           EI   LL C  PFLW++     DD ++E   L    EL+  GKIV WCSQ+EVL+H ++G
Sbjct: 307 EIGKGLLACGRPFLWMIREQKNDDGEEEEEELSCIGELKKMGKIVSWCSQLEVLAHPALG 366

Query: 367 CFVTHCGWNSTIESVTAGVPTVAWPLWADQATNAKLMQDVWELGVRVKKSSDGEGLVEGK 426
           CFVTHCGWNS +ES++ GVP VA P W DQ TNAKL++D W  GVRV+ +  G   V+G 
Sbjct: 367 CFVTHCGWNSAVESLSCGVPVVAVPQWFDQTTNAKLIEDAWGTGVRVRMNEGGG--VDGS 426

Query: 427 EIARCLRMVMDMEDHGRGKQLRINARKWQLLAMEA--ANGSSYMNIKAFVNKV 465
           EI RC+ MVMD  +  + K +R NA KW+ LA EA   +GSS  N+ AF+++V
Sbjct: 427 EIERCVEMVMDGGE--KSKLVRENAIKWKTLAREAMGEDGSSLKNLNAFLHQV 457

BLAST of Cla021857 vs. TrEMBL
Match: A0A0A0L890_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G172390 PE=4 SV=1)

HSP 1 Score: 517.7 bits (1332), Expect = 1.5e-143
Identity = 272/367 (74.11%), Postives = 292/367 (79.56%), Query Frame = 1

Query: 1   MEHGNFLLVSQSPKSHLNPTLHLASTLLYLGSKVTLLLTNYALKNISKHQLPYGLSLSTF 60
           M+HGNFLLVSQSP SHLNPTLH ASTLL LGSKVTLLLTN+ALKNIS+ QLP GLSLSTF
Sbjct: 1   MKHGNFLLVSQSPTSHLNPTLHFASTLLSLGSKVTLLLTNHALKNISEDQLPSGLSLSTF 60

Query: 61  SDGFDDGFTSSDFPRWRVEFERLGRLALVDLLSSQ-QQSLLSFTCIVHTLLIPWVTRVAL 120
           SDGFD+GFT SD   W VEFERLGR ALV+LLSS  +Q LL  TCIV+TLLIPWV +VA 
Sbjct: 61  SDGFDNGFTYSDLQLWFVEFERLGRAALVNLLSSSSKQGLLPITCIVNTLLIPWVAQVAR 120

Query: 121 ELHVPTAILWIQSVAAFDVYYYYFNGYSDVIRNGYKDDGSNSLLFNIWLPGLPLMNVLDL 180
           E HV TAILW QSVA FDVYYYYFNGYS VIRNGYK+D SNSL FNI LPGLPLMNVLDL
Sbjct: 121 EFHVSTAILWTQSVAVFDVYYYYFNGYSGVIRNGYKEDDSNSLSFNISLPGLPLMNVLDL 180

Query: 181 PSFVVSDDYHGLILKSFEEKMQVLEEEENVSILVNSFDALEHDAFTAIGKFNLIPIGPLV 240
           PSF+VSDD+HGLI+KSFEEK+Q+L+EE+NV ILVNSFDALEHDA +AIG FNLIPIGP V
Sbjct: 181 PSFMVSDDHHGLIIKSFEEKIQILKEEDNVPILVNSFDALEHDALSAIGTFNLIPIGPSV 240

Query: 241 SLPPRFEVSTKQRTTSYLQGGQQAQEDYIKWLNSKFDSSVVYIAFGSISKLSNKQTKEIV 300
            LP   E   KQR  SY Q GQQAQEDYIKWLNSK DSSVVYIAFGS SKLS +QTKE+V
Sbjct: 241 LLPLGCE---KQRNISYFQDGQQAQEDYIKWLNSKPDSSVVYIAFGSFSKLSKEQTKEMV 300

Query: 301 GALLECSYPFLWVLSMDDIQDENLSLYFDDELQAQGKIVPWCSQVEVLSHRSVGCFVTHC 360
           GALLECSY                               PWCSQVEVLSHR+VGCFVTHC
Sbjct: 301 GALLECSY-------------------------------PWCSQVEVLSHRAVGCFVTHC 333

Query: 361 GWNSTIE 367
           GWNSTIE
Sbjct: 361 GWNSTIE 333

BLAST of Cla021857 vs. TrEMBL
Match: B9HA74_POPTR (Glycosyltransferase OS=Populus trichocarpa GN=POPTR_0006s05450g PE=3 SV=1)

HSP 1 Score: 401.0 bits (1029), Expect = 2.0e-108
Identity = 217/465 (46.67%), Postives = 301/465 (64.73%), Query Frame = 1

Query: 5   NFLLVSQSPKSHLNPTLHLASTLLYLGSKVTLLLTNYALKNISKH-QLPYGLSLSTFSDG 64
           + LLV+   + H+NP L  A  L+ +G+ VT   +  A + +SK    P GLS + F DG
Sbjct: 9   HILLVTFPAQGHINPALQFAKRLVAIGAHVTFSTSMGAARRMSKTGTYPKGLSFAAFDDG 68

Query: 65  FDDGFT-SSDFPRWRVEFERLGRLALVDLLSSQQQSLLSFTCIVHTLLIPWVTRVALELH 124
            + GF  S D   +  E   +G  +L +L+++  ++   FTC+V++ L+PWV +VA EL+
Sbjct: 69  SEHGFRPSDDIDHYFTELRLVGSKSLAELIAASSKNGRPFTCVVYSNLVPWVAKVARELN 128

Query: 125 VPTAILWIQSVAAFDVYYYYFNGYSDVIRNGYKDDGSNSLLFNIWLPGLPLMNVLDLPSF 184
           +P+ +LW QS A  D++YYYFNGY D I     D       F++ LPGLP +   DLPSF
Sbjct: 129 LPSTLLWNQSPALLDIFYYYFNGYGDTISENINDP-----TFSLKLPGLPPLGSRDLPSF 188

Query: 185 VVSDDYHGLILKSFEEKMQVLEEEENVSILVNSFDALEHDAFTAIGKFNLIPIGPLVSLP 244
               + H   +    E ++VL+EE N  +LVN+FDALE +A  +IGKF L+ +GPL+  P
Sbjct: 189 FNPRNTHAFAIPVNREHIEVLDEETNPKVLVNTFDALECEALNSIGKFKLVGVGPLI--P 248

Query: 245 PRFEVSTKQRTTSYLQGGQQAQEDYIKWLNSKFDSSVVYIAFGSISKLSNKQTKEIVGAL 304
             F        TS+     Q  +D+I+WLNSK + SV+YIAFGSIS LS  Q +E+  AL
Sbjct: 249 SAFLDGEDPTDTSFGGDLFQGSKDHIEWLNSKPELSVIYIAFGSISALSKPQKEEMARAL 308

Query: 305 LECSYPFLWVLSMDDIQD-ENLSLYFDDELQAQGKIVPWCSQVEVLSHRSVGCFVTHCGW 364
           LE   PFLWV+  D  ++ E   L   +EL+ QGKIVPWCSQVEVLSH S+GCFVTHCGW
Sbjct: 309 LETGRPFLWVIRADRGEEKEEDKLSCKEELEKQGKIVPWCSQVEVLSHPSIGCFVTHCGW 368

Query: 365 NSTIESVTAGVPTVAWPLWADQATNAKLMQDVWELGVRVKKSSDGEGLVEGKEIARCLRM 424
           NST ES+ +GVP VA+P W DQ TNAK+++DVW+ GVRV  SS+ EG+VEG+EI RCL +
Sbjct: 369 NSTFESLASGVPMVAFPQWTDQLTNAKMVEDVWKTGVRV-TSSNKEGVVEGEEIERCLEV 428

Query: 425 VMDMEDHGRGKQLRINARKWQLLAMEAA--NGSSYMNIKAFVNKV 465
           VM   +  RG ++R NA+KW+ LA +++   GSSY N+KAFV+++
Sbjct: 429 VMGGGE--RGNEMRKNAKKWKELARQSSKEGGSSYNNLKAFVDEI 463

BLAST of Cla021857 vs. TrEMBL
Match: F6I4F8_VITVI (Glycosyltransferase OS=Vitis vinifera GN=VIT_05s0062g00710 PE=3 SV=1)

HSP 1 Score: 399.8 bits (1026), Expect = 4.5e-108
Identity = 219/465 (47.10%), Postives = 303/465 (65.16%), Query Frame = 1

Query: 7   LLVSQSPKSHLNPTLHLASTLLYLGSKVTLLLTNYALKNISKHQLPYGLSLSTFSDGFDD 66
           LLV+   + H+NP+L LA  L+  G+ VT + ++ A   +SK     GL   TFSDG+D 
Sbjct: 6   LLVTYPAQGHINPSLQLAKLLIRAGAHVTFVTSSSAGTRMSKSPTLDGLEFVTFSDGYDH 65

Query: 67  GFTSSD-FPRWRVEFERLGRLALVDLLSSQQQSLLSFTCIVHTLLIPWVTRVALELHVPT 126
           GF   D    +  E ERLG  AL  L+ ++      FTC+++ +LIPWV  VA  LH+P+
Sbjct: 66  GFDHGDGLQNFMSELERLGSPALTKLIMARANEGRPFTCLLYGMLIPWVAEVARSLHLPS 125

Query: 127 AILWIQSVAAFDVYYYYFNGYSDVIRNGYKDDGSNSLLFNIWLPGLPLMNVLDLPSFVVS 186
           A++W Q  A FD+YYYYFNGY ++I N  K +GS+S   +I LPGLPL++  DLPSF+V 
Sbjct: 126 ALVWSQPAAVFDIYYYYFNGYGELIGN--KGNGSSS---SIELPGLPLISSSDLPSFLVP 185

Query: 187 DDY--HGLILKSFEEKMQVLEEEENVSILVNSFDALEHDAFTAIGKFNLIPIGPLVSLPP 246
                H  +LK  +++++ L  E N  +LVNSFDALE +A  AI KF L+ IGPL  LP 
Sbjct: 186 SKVSAHNFVLKLHQKQLEQLNRESNPRVLVNSFDALESEALRAINKFKLMGIGPL--LPS 245

Query: 247 RFEVSTKQRTTSYLQGGQQAQEDYIKWLNSKFDSSVVYIAFGSISKLSNKQTKEIVGALL 306
            F        TS+     +  +DYI+WLNS  +SSV+Y++FGS+S LS +Q++EI   LL
Sbjct: 246 AFLDGKDPSDTSFGGDLFRGSKDYIQWLNSNAESSVIYVSFGSLSVLSKQQSEEIARGLL 305

Query: 307 ECSYPFLWVLSMDDIQDENLS--LYFDDELQAQGKIVPWCSQVEVLSHRSVGCFVTHCGW 366
           +   PFLWV+   + ++E     L   +EL+  G IVPWCSQVEVLSH S+GCFV+HCGW
Sbjct: 306 DSGRPFLWVIRAKENEEEEKEDKLSCVEELEQLGMIVPWCSQVEVLSHPSLGCFVSHCGW 365

Query: 367 NSTIESVTAGVPTVAWPLWADQATNAKLMQDVWELGVRVKKSSDGEGLVEGKEIARCLRM 426
           NST+ES+ +GVP VA+P W DQ TNAKL++DVW+ G+RV  +   EG+VEG EI +CL +
Sbjct: 366 NSTLESLASGVPVVAFPQWTDQTTNAKLIEDVWKTGLRVMVNQ--EGIVEGGEIKKCLEL 425

Query: 427 VMDMEDHGRGKQLRINARKWQLLAMEAA--NGSSYMNIKAFVNKV 465
           VM   +  RG+++R NA+KW+ LA EA    GSS  N+K FV+++
Sbjct: 426 VMGGGE--RGQEVRSNAKKWKDLAREAVKDGGSSDKNLKNFVDEI 459

BLAST of Cla021857 vs. TrEMBL
Match: Q94IP3_SOLSG (Glycosyltransferase OS=Solanum sogarandinum GN=Ssci17 PE=2 SV=1)

HSP 1 Score: 398.3 bits (1022), Expect = 1.3e-107
Identity = 218/481 (45.32%), Postives = 309/481 (64.24%), Query Frame = 1

Query: 1   MEHGNFLLVSQSPKSHLNPTLHLASTLLYLGSKVTLLLTNYALKNISK---HQLPYGLSL 60
           M   + LLV+   + H+NP+L  A  L+ +G +VT   + +A + ++K      P GL+L
Sbjct: 1   MVQPHVLLVTFPTQGHINPSLQFAKKLIKMGIEVTFTTSVFAHRRMAKTATSTAPKGLNL 60

Query: 61  STFSDGFDDGFTSS--DFPRWRVEFERLGRLALVDLLSSQQQSLLSFTCIVHTLLIPWVT 120
           + FSDGFDDGF S+  D  R+  E    G   L D++          T +V+TLL+PW  
Sbjct: 61  AAFSDGFDDGFKSNVDDSKRYMSEIRSRGSQTLRDIILKSSDEGRPVTSLVYTLLLPWAA 120

Query: 121 RVALELHVPTAILWIQSVAAFDVYYYYFNGYSDVIRNGYKDDGSNSLLFNIWLPGLPLMN 180
            VA ELH+P+A+LWIQ     D+YYYYFNGY D ++       SN   ++I LP LPL+ 
Sbjct: 121 EVARELHIPSALLWIQPATVLDIYYYYFNGYEDEMKCS-----SNDPNWSIQLPRLPLLK 180

Query: 181 VLDLPSFVVS----DDYHGLILKSFEEKMQVLEEEENVSILVNSFDALEHDAFTAIGKFN 240
             DLPSF+VS    DD +   L +F+E++  L+ EEN  +LVN+FDALE +   AIGK+N
Sbjct: 181 SQDLPSFLVSSSSKDDKYSFALPTFKEQLDTLDGEENPKVLVNTFDALELEPLKAIGKYN 240

Query: 241 LIPIGPLVSLPPRFEVSTKQRTTSYLQGG--QQAQEDYIKWLNSKFDSSVVYIAFGSISK 300
           LI IGPL+   P   +  K    S   G   Q++ +DY++WLN+K  SS+VYI+FGS+  
Sbjct: 241 LIGIGPLI---PSSFLGGKDSLESRFGGDLFQKSNDDYMEWLNTKPKSSIVYISFGSLLN 300

Query: 301 LSNKQTKEIVGALLECSYPFLWVL----SMDDIQDENLSLYFDDELQAQGKIVPWCSQVE 360
           LS  Q +EI   L+E   PFLWV+    ++ +++ E   L    EL+ QGKIVPWCSQ+E
Sbjct: 301 LSRNQKEEIAKGLIEIKRPFLWVIRDQENIKEVEKEEEKLSCMMELEKQGKIVPWCSQLE 360

Query: 361 VLSHRSVGCFVTHCGWNSTIESVTAGVPTVAWPLWADQATNAKLMQDVWELGVRVKKSSD 420
           VL+H S+GCFV+HCGWNST+ES+++GVP VA+P W DQ TNAK ++DVW+ GVR++ + D
Sbjct: 361 VLTHPSLGCFVSHCGWNSTLESLSSGVPVVAFPHWTDQGTNAKWIEDVWKTGVRMRVNED 420

Query: 421 GEGLVEGKEIARCLRMVMDMEDHGRGKQLRINARKWQLLAMEAA--NGSSYMNIKAFVNK 465
             G+VE +EI RC+ +VMD  +  +G+++R NA+KW+ LA EA    GSS +N+KAFV +
Sbjct: 421 --GVVESEEIKRCIEIVMDGGE--KGEEMRKNAQKWKELAREAVKEGGSSEVNLKAFVQE 469

BLAST of Cla021857 vs. TrEMBL
Match: F6I4D5_VITVI (Glycosyltransferase OS=Vitis vinifera GN=VIT_05s0062g00350 PE=3 SV=1)

HSP 1 Score: 397.1 bits (1019), Expect = 2.9e-107
Identity = 214/469 (45.63%), Postives = 306/469 (65.25%), Query Frame = 1

Query: 3   HGNFLLVSQSPKSHLNPTLHLASTLLYLGSKVTLLLTNYALKNISKHQLPYGLSLSTFSD 62
           H + L+V+   + H+NPTL LA  L+  G+ VT   +  A   +SK     GL  +TFSD
Sbjct: 2   HPHILIVTLPSQGHINPTLQLAKLLIRAGAHVTFFTSTSAGTRMSKSPNLDGLEFATFSD 61

Query: 63  GFDDGFTSSD-FPRWRVEFERLGRLALVDLLSSQQQSLLSFTCIVHTLLIPWVTRVALEL 122
           G+D G    D   ++  + ERLG  AL++L+ +       F C+++ + IPWV  VA  L
Sbjct: 62  GYDHGLKQGDDVEKFMSQIERLGSQALIELIMASANEGRPFACLLYGVQIPWVAEVAHSL 121

Query: 123 HVPTAILWIQSVAAFDVYYYYFNGYSDVIRNGYKDDGSNSLLFNIWLPGLPLMNVLDLPS 182
           H+P+A++W Q  A FD+YYYYFNGY ++I+N  K D  +S    I LPGLPL+N  DLPS
Sbjct: 122 HIPSALVWTQPAAVFDIYYYYFNGYGELIQN--KGDHPSS---TIELPGLPLLNNSDLPS 181

Query: 183 FVV--SDDYHGLILKSFEEKMQVLEEEENVSILVNSFDALEHDAFTAIGKFNLIPIGPLV 242
           F++    + +   L  F++ +++L  E N  +L+NSFDALE +A  AI KFNL+ IGPL+
Sbjct: 182 FLIPPKGNTYKFALPGFQKHLEMLNCESNPKVLINSFDALESEALGAINKFNLMGIGPLI 241

Query: 243 SLPPRFEVSTKQRTTSYLQGGQQAQEDYIKWLNSKFDSSVVYIAFGSISKLSNKQTKEIV 302
             P  F        TS+     ++ +DYI+WLNSK  SSV+Y++FGS+  LS +Q++EI 
Sbjct: 242 --PSAFLDGKDPSDTSFGGDLFRSSKDYIQWLNSKPKSSVIYVSFGSLFVLSKQQSEEIA 301

Query: 303 GALLECSYPFLWVLSMDDIQDENLSLYFDDELQAQGKIVPWCSQVEVLSHRSVGCFVTHC 362
             LL+   PFLWV+ +++ ++E  +L   +EL+ QG +VPWCSQVEVLSH S+GCFVTH 
Sbjct: 302 RGLLDGGRPFLWVIRLEENEEEK-TLSCHEELERQGMMVPWCSQVEVLSHPSMGCFVTHS 361

Query: 363 GWNSTIESVTAGVPTVAWPLWADQATNAKLMQDVWELGVRVKKSSDGEGLVEGKEIARCL 422
           GWNST+ES+T+GVP VA+P W+DQATNAKL++ VW+ G+R   +   EG+VE  EI RCL
Sbjct: 362 GWNSTLESLTSGVPVVAFPQWSDQATNAKLIEVVWKTGLRAMVNQ--EGIVEADEIKRCL 421

Query: 423 RMVMDMEDHGRGKQLRINARKWQLLAMEAA--NGSSYMNIKAFVNKVCY 467
            +VM   +  RG+++R NA KW++LA EA    GSS  N+K F+N+V +
Sbjct: 422 ELVMGSGE--RGEEMRRNATKWKVLAREAVKEGGSSDKNLKNFMNEVMH 458

BLAST of Cla021857 vs. NCBI nr
Match: gi|449459876|ref|XP_004147672.1| (PREDICTED: crocetin glucosyltransferase, chloroplastic-like [Cucumis sativus])

HSP 1 Score: 517.7 bits (1332), Expect = 2.1e-143
Identity = 272/367 (74.11%), Postives = 292/367 (79.56%), Query Frame = 1

Query: 1   MEHGNFLLVSQSPKSHLNPTLHLASTLLYLGSKVTLLLTNYALKNISKHQLPYGLSLSTF 60
           M+HGNFLLVSQSP SHLNPTLH ASTLL LGSKVTLLLTN+ALKNIS+ QLP GLSLSTF
Sbjct: 1   MKHGNFLLVSQSPTSHLNPTLHFASTLLSLGSKVTLLLTNHALKNISEDQLPSGLSLSTF 60

Query: 61  SDGFDDGFTSSDFPRWRVEFERLGRLALVDLLSSQ-QQSLLSFTCIVHTLLIPWVTRVAL 120
           SDGFD+GFT SD   W VEFERLGR ALV+LLSS  +Q LL  TCIV+TLLIPWV +VA 
Sbjct: 61  SDGFDNGFTYSDLQLWFVEFERLGRAALVNLLSSSSKQGLLPITCIVNTLLIPWVAQVAR 120

Query: 121 ELHVPTAILWIQSVAAFDVYYYYFNGYSDVIRNGYKDDGSNSLLFNIWLPGLPLMNVLDL 180
           E HV TAILW QSVA FDVYYYYFNGYS VIRNGYK+D SNSL FNI LPGLPLMNVLDL
Sbjct: 121 EFHVSTAILWTQSVAVFDVYYYYFNGYSGVIRNGYKEDDSNSLSFNISLPGLPLMNVLDL 180

Query: 181 PSFVVSDDYHGLILKSFEEKMQVLEEEENVSILVNSFDALEHDAFTAIGKFNLIPIGPLV 240
           PSF+VSDD+HGLI+KSFEEK+Q+L+EE+NV ILVNSFDALEHDA +AIG FNLIPIGP V
Sbjct: 181 PSFMVSDDHHGLIIKSFEEKIQILKEEDNVPILVNSFDALEHDALSAIGTFNLIPIGPSV 240

Query: 241 SLPPRFEVSTKQRTTSYLQGGQQAQEDYIKWLNSKFDSSVVYIAFGSISKLSNKQTKEIV 300
            LP   E   KQR  SY Q GQQAQEDYIKWLNSK DSSVVYIAFGS SKLS +QTKE+V
Sbjct: 241 LLPLGCE---KQRNISYFQDGQQAQEDYIKWLNSKPDSSVVYIAFGSFSKLSKEQTKEMV 300

Query: 301 GALLECSYPFLWVLSMDDIQDENLSLYFDDELQAQGKIVPWCSQVEVLSHRSVGCFVTHC 360
           GALLECSY                               PWCSQVEVLSHR+VGCFVTHC
Sbjct: 301 GALLECSY-------------------------------PWCSQVEVLSHRAVGCFVTHC 333

Query: 361 GWNSTIE 367
           GWNSTIE
Sbjct: 361 GWNSTIE 333

BLAST of Cla021857 vs. NCBI nr
Match: gi|224090320|ref|XP_002308970.1| (putative glucosyltransferase family protein [Populus trichocarpa])

HSP 1 Score: 401.0 bits (1029), Expect = 2.9e-108
Identity = 217/465 (46.67%), Postives = 301/465 (64.73%), Query Frame = 1

Query: 5   NFLLVSQSPKSHLNPTLHLASTLLYLGSKVTLLLTNYALKNISKH-QLPYGLSLSTFSDG 64
           + LLV+   + H+NP L  A  L+ +G+ VT   +  A + +SK    P GLS + F DG
Sbjct: 9   HILLVTFPAQGHINPALQFAKRLVAIGAHVTFSTSMGAARRMSKTGTYPKGLSFAAFDDG 68

Query: 65  FDDGFT-SSDFPRWRVEFERLGRLALVDLLSSQQQSLLSFTCIVHTLLIPWVTRVALELH 124
            + GF  S D   +  E   +G  +L +L+++  ++   FTC+V++ L+PWV +VA EL+
Sbjct: 69  SEHGFRPSDDIDHYFTELRLVGSKSLAELIAASSKNGRPFTCVVYSNLVPWVAKVARELN 128

Query: 125 VPTAILWIQSVAAFDVYYYYFNGYSDVIRNGYKDDGSNSLLFNIWLPGLPLMNVLDLPSF 184
           +P+ +LW QS A  D++YYYFNGY D I     D       F++ LPGLP +   DLPSF
Sbjct: 129 LPSTLLWNQSPALLDIFYYYFNGYGDTISENINDP-----TFSLKLPGLPPLGSRDLPSF 188

Query: 185 VVSDDYHGLILKSFEEKMQVLEEEENVSILVNSFDALEHDAFTAIGKFNLIPIGPLVSLP 244
               + H   +    E ++VL+EE N  +LVN+FDALE +A  +IGKF L+ +GPL+  P
Sbjct: 189 FNPRNTHAFAIPVNREHIEVLDEETNPKVLVNTFDALECEALNSIGKFKLVGVGPLI--P 248

Query: 245 PRFEVSTKQRTTSYLQGGQQAQEDYIKWLNSKFDSSVVYIAFGSISKLSNKQTKEIVGAL 304
             F        TS+     Q  +D+I+WLNSK + SV+YIAFGSIS LS  Q +E+  AL
Sbjct: 249 SAFLDGEDPTDTSFGGDLFQGSKDHIEWLNSKPELSVIYIAFGSISALSKPQKEEMARAL 308

Query: 305 LECSYPFLWVLSMDDIQD-ENLSLYFDDELQAQGKIVPWCSQVEVLSHRSVGCFVTHCGW 364
           LE   PFLWV+  D  ++ E   L   +EL+ QGKIVPWCSQVEVLSH S+GCFVTHCGW
Sbjct: 309 LETGRPFLWVIRADRGEEKEEDKLSCKEELEKQGKIVPWCSQVEVLSHPSIGCFVTHCGW 368

Query: 365 NSTIESVTAGVPTVAWPLWADQATNAKLMQDVWELGVRVKKSSDGEGLVEGKEIARCLRM 424
           NST ES+ +GVP VA+P W DQ TNAK+++DVW+ GVRV  SS+ EG+VEG+EI RCL +
Sbjct: 369 NSTFESLASGVPMVAFPQWTDQLTNAKMVEDVWKTGVRV-TSSNKEGVVEGEEIERCLEV 428

Query: 425 VMDMEDHGRGKQLRINARKWQLLAMEAA--NGSSYMNIKAFVNKV 465
           VM   +  RG ++R NA+KW+ LA +++   GSSY N+KAFV+++
Sbjct: 429 VMGGGE--RGNEMRKNAKKWKELARQSSKEGGSSYNNLKAFVDEI 463

BLAST of Cla021857 vs. NCBI nr
Match: gi|970053780|ref|XP_015088519.1| (PREDICTED: crocetin glucosyltransferase, chloroplastic-like [Solanum pennellii])

HSP 1 Score: 399.8 bits (1026), Expect = 6.5e-108
Identity = 220/478 (46.03%), Postives = 307/478 (64.23%), Query Frame = 1

Query: 1   MEHGNFLLVSQSPKSHLNPTLHLASTLLYLGSKVTLLLTNYALKNISK---HQLPYGLSL 60
           M   + LLV+   + H+NP+L  A  L+ +G +VT   + +A + ++K      P GL+L
Sbjct: 1   MVQPHVLLVTFPAQGHINPSLQFAKRLIEMGIEVTFTTSVFAHRRMAKTAASTAPKGLNL 60

Query: 61  STFSDGFDDGFTSS--DFPRWRVEFERLGRLALVDLLSSQQQSLLSFTCIVHTLLIPWVT 120
           + FSDGFDDGF S+  D  R+  E    G   L D++          T +V+TLL+PW  
Sbjct: 61  AAFSDGFDDGFKSNKDDSKRYMSEIRSRGSQTLRDIILKSSDEGRPVTSLVYTLLLPWAA 120

Query: 121 RVALELHVPTAILWIQSVAAFDVYYYYFNGYSDVIRNGYKDDGSNSLLFNIWLPGLPLMN 180
            VA ELH+P+A+LWIQ     D+YYYYFNGY D +    K   SN   ++I LP LPL+ 
Sbjct: 121 EVARELHIPSALLWIQPATVLDIYYYYFNGYEDEM----KCSSSNDPNWSIQLPRLPLLK 180

Query: 181 VLDLPSFVVS----DDYHGLILKSFEEKMQVLEEEENVSILVNSFDALEHDAFTAIGKFN 240
             DLPSF+VS    DD +   L +F+E++  L+ EEN  +LVN+FDALE +   AI K+N
Sbjct: 181 SQDLPSFLVSSSSKDDKYSFALPTFKEQLDTLDGEENPKVLVNTFDALELEPLKAIEKYN 240

Query: 241 LIPIGPLVSLPPRFEVSTKQRTTSYLQGG---QQAQEDYIKWLNSKFDSSVVYIAFGSIS 300
           LI IGPL+  P  F        +S+  GG   Q++ +DY++WLN+K  SS+VYI+FGS+ 
Sbjct: 241 LIGIGPLI--PSSFLGGKDSLESSF--GGDLFQKSNDDYMEWLNTKPKSSIVYISFGSLL 300

Query: 301 KLSNKQTKEIVGALLECSYPFLWVLSMDDIQDENLSLYFDDELQAQGKIVPWCSQVEVLS 360
            LS  Q +EI   L+E   PFLWV+   + + E   L    EL+ QGKIVPWCSQ+EVL+
Sbjct: 301 NLSRNQKEEIAKGLIEIQRPFLWVIRDQEEEKEEEKLSCMMELEKQGKIVPWCSQLEVLT 360

Query: 361 HRSVGCFVTHCGWNSTIESVTAGVPTVAWPLWADQATNAKLMQDVWELGVRVKKSSDGEG 420
           H S+GCFV+HCGWNST+ES+++GVP VA+P W DQ TNAKL++DVW+ GVR++ + D  G
Sbjct: 361 HPSLGCFVSHCGWNSTLESLSSGVPVVAFPHWTDQGTNAKLIEDVWKTGVRMRVNED--G 420

Query: 421 LVEGKEIARCLRMVMDMEDHGRGKQLRINARKWQLLAMEAA--NGSSYMNIKAFVNKV 465
           +VE  EI RC+ +VMD  +  +G+++R NA+KW+ LA EA    GSS +N+KAFV +V
Sbjct: 421 VVESDEIKRCIEIVMDGGE--KGEEMRKNAQKWKELAREAVKEGGSSEVNLKAFVQQV 466

BLAST of Cla021857 vs. NCBI nr
Match: gi|225433626|ref|XP_002263975.1| (PREDICTED: crocetin glucosyltransferase, chloroplastic-like [Vitis vinifera])

HSP 1 Score: 399.8 bits (1026), Expect = 6.5e-108
Identity = 219/465 (47.10%), Postives = 303/465 (65.16%), Query Frame = 1

Query: 7   LLVSQSPKSHLNPTLHLASTLLYLGSKVTLLLTNYALKNISKHQLPYGLSLSTFSDGFDD 66
           LLV+   + H+NP+L LA  L+  G+ VT + ++ A   +SK     GL   TFSDG+D 
Sbjct: 6   LLVTYPAQGHINPSLQLAKLLIRAGAHVTFVTSSSAGTRMSKSPTLDGLEFVTFSDGYDH 65

Query: 67  GFTSSD-FPRWRVEFERLGRLALVDLLSSQQQSLLSFTCIVHTLLIPWVTRVALELHVPT 126
           GF   D    +  E ERLG  AL  L+ ++      FTC+++ +LIPWV  VA  LH+P+
Sbjct: 66  GFDHGDGLQNFMSELERLGSPALTKLIMARANEGRPFTCLLYGMLIPWVAEVARSLHLPS 125

Query: 127 AILWIQSVAAFDVYYYYFNGYSDVIRNGYKDDGSNSLLFNIWLPGLPLMNVLDLPSFVVS 186
           A++W Q  A FD+YYYYFNGY ++I N  K +GS+S   +I LPGLPL++  DLPSF+V 
Sbjct: 126 ALVWSQPAAVFDIYYYYFNGYGELIGN--KGNGSSS---SIELPGLPLISSSDLPSFLVP 185

Query: 187 DDY--HGLILKSFEEKMQVLEEEENVSILVNSFDALEHDAFTAIGKFNLIPIGPLVSLPP 246
                H  +LK  +++++ L  E N  +LVNSFDALE +A  AI KF L+ IGPL  LP 
Sbjct: 186 SKVSAHNFVLKLHQKQLEQLNRESNPRVLVNSFDALESEALRAINKFKLMGIGPL--LPS 245

Query: 247 RFEVSTKQRTTSYLQGGQQAQEDYIKWLNSKFDSSVVYIAFGSISKLSNKQTKEIVGALL 306
            F        TS+     +  +DYI+WLNS  +SSV+Y++FGS+S LS +Q++EI   LL
Sbjct: 246 AFLDGKDPSDTSFGGDLFRGSKDYIQWLNSNAESSVIYVSFGSLSVLSKQQSEEIARGLL 305

Query: 307 ECSYPFLWVLSMDDIQDENLS--LYFDDELQAQGKIVPWCSQVEVLSHRSVGCFVTHCGW 366
           +   PFLWV+   + ++E     L   +EL+  G IVPWCSQVEVLSH S+GCFV+HCGW
Sbjct: 306 DSGRPFLWVIRAKENEEEEKEDKLSCVEELEQLGMIVPWCSQVEVLSHPSLGCFVSHCGW 365

Query: 367 NSTIESVTAGVPTVAWPLWADQATNAKLMQDVWELGVRVKKSSDGEGLVEGKEIARCLRM 426
           NST+ES+ +GVP VA+P W DQ TNAKL++DVW+ G+RV  +   EG+VEG EI +CL +
Sbjct: 366 NSTLESLASGVPVVAFPQWTDQTTNAKLIEDVWKTGLRVMVNQ--EGIVEGGEIKKCLEL 425

Query: 427 VMDMEDHGRGKQLRINARKWQLLAMEAA--NGSSYMNIKAFVNKV 465
           VM   +  RG+++R NA+KW+ LA EA    GSS  N+K FV+++
Sbjct: 426 VMGGGE--RGQEVRSNAKKWKDLAREAVKDGGSSDKNLKNFVDEI 459

BLAST of Cla021857 vs. NCBI nr
Match: gi|743817028|ref|XP_011020320.1| (PREDICTED: crocetin glucosyltransferase, chloroplastic-like [Populus euphratica])

HSP 1 Score: 399.4 bits (1025), Expect = 8.4e-108
Identity = 218/465 (46.88%), Postives = 302/465 (64.95%), Query Frame = 1

Query: 5   NFLLVSQSPKSHLNPTLHLASTLLYLGSKVTLLLTNYALKNISKH-QLPYGLSLSTFSDG 64
           + LLV+   + H+NP L  A  L+ +G+ VT   +  A + +SK    P GLS + F DG
Sbjct: 9   HILLVTFPAQGHINPALQFAKRLVAIGAHVTFSTSMGAARRMSKTGTYPEGLSFAAFDDG 68

Query: 65  FDDGFT-SSDFPRWRVEFERLGRLALVDLLSSQQQSLLSFTCIVHTLLIPWVTRVALELH 124
            + GF  S D   +  E   +G  +L +L+ +  ++   FT +V++ LIPWV +VA EL 
Sbjct: 69  SEHGFRPSDDINHYFTELRLVGSKSLAELIVASSKNGRPFTRVVYSNLIPWVAKVARELK 128

Query: 125 VPTAILWIQSVAAFDVYYYYFNGYSDVIRNGYKDDGSNSLLFNIWLPGLPLMNVLDLPSF 184
           +P+ +LW QS A  D++YYYFNGY D IR    D       F++ LPGLP +   DLPSF
Sbjct: 129 LPSTLLWNQSPALLDIFYYYFNGYGDTIRENINDP-----TFSLKLPGLPPLGSRDLPSF 188

Query: 185 VVSDDYHGLILKSFEEKMQVLEEEENVSILVNSFDALEHDAFTAIGKFNLIPIGPLVSLP 244
               + H   +    E ++VL+EE N  +LVN+FDALE +A  +IGK+ L+ +GPL+  P
Sbjct: 189 FNPRNTHAFAIPVNREHIEVLDEETNPKVLVNTFDALECEALNSIGKYKLVGVGPLI--P 248

Query: 245 PRFEVSTKQRTTSYLQGGQQAQEDYIKWLNSKFDSSVVYIAFGSISKLSNKQTKEIVGAL 304
             F        TS+     Q  +D+I+WLNSK +SSV+YIAFGSIS LS  Q +E+  AL
Sbjct: 249 SAFLDGEDPTDTSFGGDLFQGSKDHIEWLNSKPESSVIYIAFGSISALSKPQKEEMARAL 308

Query: 305 LECSYPFLWVLSMDDIQD-ENLSLYFDDELQAQGKIVPWCSQVEVLSHRSVGCFVTHCGW 364
           LE   PFLWV+  D  ++ E   L  ++EL+ QGKIVPWCSQVEVLSH S+GCFVTHCGW
Sbjct: 309 LETGRPFLWVIRADRGEEKEEGKLSCNEELEKQGKIVPWCSQVEVLSHPSIGCFVTHCGW 368

Query: 365 NSTIESVTAGVPTVAWPLWADQATNAKLMQDVWELGVRVKKSSDGEGLVEGKEIARCLRM 424
           NST ES+++GVP VA+P W DQ TNAK+++DVW+ GVRV  SS+ EG+VEG+EI RCL +
Sbjct: 369 NSTFESLSSGVPMVAFPQWTDQLTNAKMVEDVWKTGVRV-TSSNKEGVVEGEEIERCLEL 428

Query: 425 VMDMEDHGRGKQLRINARKWQLLAMEAA--NGSSYMNIKAFVNKV 465
           VM   +  RG ++R NA+KW+ LA +++   GSSY N+KAFV+++
Sbjct: 429 VMGGGE--RGSEIRKNAKKWKELARQSSKEGGSSYNNLKAFVDEI 463

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
UGT1_GARJA1.9e-10745.13Crocetin glucosyltransferase, chloroplastic OS=Gardenia jasminoides GN=UGT75L6 P... [more]
U75D1_ARATH1.7e-9539.38UDP-glycosyltransferase 75D1 OS=Arabidopsis thaliana GN=UGT75D1 PE=2 SV=2[more]
5GT_VERHY4.4e-8840.88Anthocyanidin 3-O-glucoside 5-O-glucosyltransferase OS=Verbena hybrida GN=HGT8 P... [more]
U75B2_ARATH1.7e-8739.63UDP-glycosyltransferase 75B2 OS=Arabidopsis thaliana GN=UGT75B2 PE=2 SV=1[more]
5GT1_PERFR3.7e-8741.44Anthocyanidin 3-O-glucoside 5-O-glucosyltransferase 1 OS=Perilla frutescens GN=P... [more]
Match NameE-valueIdentityDescription
A0A0A0L890_CUCSA1.5e-14374.11Uncharacterized protein OS=Cucumis sativus GN=Csa_3G172390 PE=4 SV=1[more]
B9HA74_POPTR2.0e-10846.67Glycosyltransferase OS=Populus trichocarpa GN=POPTR_0006s05450g PE=3 SV=1[more]
F6I4F8_VITVI4.5e-10847.10Glycosyltransferase OS=Vitis vinifera GN=VIT_05s0062g00710 PE=3 SV=1[more]
Q94IP3_SOLSG1.3e-10745.32Glycosyltransferase OS=Solanum sogarandinum GN=Ssci17 PE=2 SV=1[more]
F6I4D5_VITVI2.9e-10745.63Glycosyltransferase OS=Vitis vinifera GN=VIT_05s0062g00350 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
gi|449459876|ref|XP_004147672.1|2.1e-14374.11PREDICTED: crocetin glucosyltransferase, chloroplastic-like [Cucumis sativus][more]
gi|224090320|ref|XP_002308970.1|2.9e-10846.67putative glucosyltransferase family protein [Populus trichocarpa][more]
gi|970053780|ref|XP_015088519.1|6.5e-10846.03PREDICTED: crocetin glucosyltransferase, chloroplastic-like [Solanum pennellii][more]
gi|225433626|ref|XP_002263975.1|6.5e-10847.10PREDICTED: crocetin glucosyltransferase, chloroplastic-like [Vitis vinifera][more]
gi|743817028|ref|XP_011020320.1|8.4e-10846.88PREDICTED: crocetin glucosyltransferase, chloroplastic-like [Populus euphratica][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002213UDP_glucos_trans
Vocabulary: Molecular Function
TermDefinition
GO:0016758transferase activity, transferring hexosyl groups
Vocabulary: Biological Process
TermDefinition
GO:0008152metabolic process
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008152 metabolic process
biological_process GO:0009813 flavonoid biosynthetic process
biological_process GO:0052696 flavonoid glucuronidation
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
cellular_component GO:0043231 intracellular membrane-bounded organelle
molecular_function GO:0016758 transferase activity, transferring hexosyl groups
molecular_function GO:0080043 quercetin 3-O-glucosyltransferase activity
molecular_function GO:0080044 quercetin 7-O-glucosyltransferase activity
molecular_function GO:0016740 transferase activity
molecular_function GO:0003674 molecular_function
molecular_function GO:0035251 UDP-glucosyltransferase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla021857Cla021857.1mRNA


Analysis Name: InterPro Annotations of watermelon (97103)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002213UDP-glucuronosyl/UDP-glucosyltransferasePANTHERPTHR11926GLUCOSYL/GLUCURONOSYL TRANSFERASEScoord: 1..465
score: 1.0E
IPR002213UDP-glucuronosyl/UDP-glucosyltransferasePFAMPF00201UDPGTcoord: 268..395
score: 1.4
IPR002213UDP-glucuronosyl/UDP-glucosyltransferasePROSITEPS00375UDPGTcoord: 340..383
scor
NoneNo IPR availableGENE3DG3DSA:3.40.50.2000coord: 262..393
score: 4.8
NoneNo IPR availablePANTHERPTHR11926:SF224SUBFAMILY NOT NAMEDcoord: 1..465
score: 1.0E
NoneNo IPR availableunknownSSF53756UDP-Glycosyltransferase/glycogen phosphorylasecoord: 2..464
score: 1.33