Cp4.1LG17g03860 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG17g03860
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionGlycosyltransferase
LocationCp4.1LG17 : 2796726 .. 2798090 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCACAATCACACCCACATGTGCTCTTCGTAGCATTAGCCACCCACGGCCACTTCAATCCAGGCCTTCACTTAGCCAACACCCTCTCTCACGGCGGCGTTAATGTAACCTTCGCCACCTCCTCCTCCGTCCGCCACCGTCTCCCCAAACTTCCCTCCTCCCCAAACCTCTACTTCTCCTTCTTCTCCGACGGCCAAGACGACGGCTTTAAGCTCGGTAACGACGTCGTTCCATTCCTCTCACAATTCGAGCGCCAAGCCTCCATAGCCATCCGACAAACCATCCTCAAATCCAAAGCTAATGGAAAGCCCTTCTCTTTTGTTGTTTACTCTTTGTTAACTCCATGGATGGCGGAAGTGGCTCGTTCATTTGATCTCCCTTCCGCTCTCTTTTGGAACCAATCCGCCGCCGTTTTTGCAATTTATTATCACTTCTTCAATGGCTATAAAGACATTATTGGAAACGCTTTCTCCCATCCTTCTATTAAAATTCACTTACCCGGGCTTCCTTTGTTGGACTCTAAACAACTTCCCTCTCTATGCAACCCGGCAAACTCCAACTCTTTTGTCCTTAAACTCTACGAAACACACTTCCATGTCATCAAACAAGAACCCCATTTGAAGATTTTAATCAACACGTTTGATGATTTGGAGCACGATGTTTTGAGAGCGATTAGTGTTGAATTGATCCCAATTGGGCCTGTGTTGCCTTCTCTTTCTTTCTTTCACCCTACCAATTCTATGGATTTGAAACCCTACATTGGGTGGCTGAATTCCAAACCCAAATCCTCGGTTGTGTACGTGTCGTTCGGGAGCATTGCGGCGGTGTCGACGGCTCAATTGGAGGAGATAGCTAGAGGGTTATTGGATTCAAAGCTGCCATTTTTATGGGTGATGAGGAAAAGGAGCGATGGTGATGAAGGGGATTTAGTGAGCTGCCGAGAAGAGCTGGTGGCTAAAGGGAAGATAGTTACGTGGTGTTCGCAACTTGAGGTGTTGTTGCATCCATCGGTTGGGTGTTTCTTGACGCATTGTGGTTGGAATTCTTCGTTGGAGAGCATTGCTTGTGGTGTGTCGGTGGTGGCGTTTCCGCAATGGACGGATCAATCCACCAATGCGAGGATCATTGAGGAGTCGTCTAAGAGTGGAGTGAGGTTGAGAATGAATGATGATGGGATTGTTGAGAGGGGAGAGATTAAGAAATGCTTGGACCTTGTCATGGGCGATGGTGTTGAGGGGGAGAGTTTGAGAAGGAATGCTTTAAAATGGAAGGACTTGGCCAACCATGCCACCACTAAGGGAGGTTCTTCCAATGCCAACGTTAGCTCGTTTCTTGATTTTGTGTGTAATGGTGGCAACTGTTAA

mRNA sequence

ATGGCACAATCACACCCACATGTGCTCTTCGTAGCATTAGCCACCCACGGCCACTTCAATCCAGGCCTTCACTTAGCCAACACCCTCTCTCACGGCGGCGTTAATGTAACCTTCGCCACCTCCTCCTCCGTCCGCCACCGTCTCCCCAAACTTCCCTCCTCCCCAAACCTCTACTTCTCCTTCTTCTCCGACGGCCAAGACGACGGCTTTAAGCTCGGTAACGACGTCGTTCCATTCCTCTCACAATTCGAGCGCCAAGCCTCCATAGCCATCCGACAAACCATCCTCAAATCCAAAGCTAATGGAAAGCCCTTCTCTTTTGTTGTTTACTCTTTGTTAACTCCATGGATGGCGGAAGTGGCTCGTTCATTTGATCTCCCTTCCGCTCTCTTTTGGAACCAATCCGCCGCCGTTTTTGCAATTTATTATCACTTCTTCAATGGCTATAAAGACATTATTGGAAACGCTTTCTCCCATCCTTCTATTAAAATTCACTTACCCGGGCTTCCTTTGTTGGACTCTAAACAACTTCCCTCTCTATGCAACCCGGCAAACTCCAACTCTTTTGTCCTTAAACTCTACGAAACACACTTCCATGTCATCAAACAAGAACCCCATTTGAAGATTTTAATCAACACGTTTGATGATTTGGAGCACGATGTTTTGAGAGCGATTAGTGTTGAATTGATCCCAATTGGGCCTGTGTTGCCTTCTCTTTCTTTCTTTCACCCTACCAATTCTATGGATTTGAAACCCTACATTGGGTGGCTGAATTCCAAACCCAAATCCTCGGTTGTGTACGTGTCGTTCGGGAGCATTGCGGCGGTGTCGACGGCTCAATTGGAGGAGATAGCTAGAGGGTTATTGGATTCAAAGCTGCCATTTTTATGGGTGATGAGGAAAAGGAGCGATGGTGATGAAGGGGATTTAGTGAGCTGCCGAGAAGAGCTGGTGGCTAAAGGGAAGATAGTTACGTGGTGTTCGCAACTTGAGGTGTTGTTGCATCCATCGGTTGGGTGTTTCTTGACGCATTGTGGTTGGAATTCTTCGTTGGAGAGCATTGCTTGTGGTGTGTCGGTGGTGGCGTTTCCGCAATGGACGGATCAATCCACCAATGCGAGGATCATTGAGGAGTCGTCTAAGAGTGGAGTGAGGTTGAGAATGAATGATGATGGGATTGTTGAGAGGGGAGAGATTAAGAAATGCTTGGACCTTGTCATGGGCGATGGTGTTGAGGGGGAGAGTTTGAGAAGGAATGCTTTAAAATGGAAGGACTTGGCCAACCATGCCACCACTAAGGGAGGTTCTTCCAATGCCAACGTTAGCTCGTTTCTTGATTTTGTGTGTAATGGTGGCAACTGTTAA

Coding sequence (CDS)

ATGGCACAATCACACCCACATGTGCTCTTCGTAGCATTAGCCACCCACGGCCACTTCAATCCAGGCCTTCACTTAGCCAACACCCTCTCTCACGGCGGCGTTAATGTAACCTTCGCCACCTCCTCCTCCGTCCGCCACCGTCTCCCCAAACTTCCCTCCTCCCCAAACCTCTACTTCTCCTTCTTCTCCGACGGCCAAGACGACGGCTTTAAGCTCGGTAACGACGTCGTTCCATTCCTCTCACAATTCGAGCGCCAAGCCTCCATAGCCATCCGACAAACCATCCTCAAATCCAAAGCTAATGGAAAGCCCTTCTCTTTTGTTGTTTACTCTTTGTTAACTCCATGGATGGCGGAAGTGGCTCGTTCATTTGATCTCCCTTCCGCTCTCTTTTGGAACCAATCCGCCGCCGTTTTTGCAATTTATTATCACTTCTTCAATGGCTATAAAGACATTATTGGAAACGCTTTCTCCCATCCTTCTATTAAAATTCACTTACCCGGGCTTCCTTTGTTGGACTCTAAACAACTTCCCTCTCTATGCAACCCGGCAAACTCCAACTCTTTTGTCCTTAAACTCTACGAAACACACTTCCATGTCATCAAACAAGAACCCCATTTGAAGATTTTAATCAACACGTTTGATGATTTGGAGCACGATGTTTTGAGAGCGATTAGTGTTGAATTGATCCCAATTGGGCCTGTGTTGCCTTCTCTTTCTTTCTTTCACCCTACCAATTCTATGGATTTGAAACCCTACATTGGGTGGCTGAATTCCAAACCCAAATCCTCGGTTGTGTACGTGTCGTTCGGGAGCATTGCGGCGGTGTCGACGGCTCAATTGGAGGAGATAGCTAGAGGGTTATTGGATTCAAAGCTGCCATTTTTATGGGTGATGAGGAAAAGGAGCGATGGTGATGAAGGGGATTTAGTGAGCTGCCGAGAAGAGCTGGTGGCTAAAGGGAAGATAGTTACGTGGTGTTCGCAACTTGAGGTGTTGTTGCATCCATCGGTTGGGTGTTTCTTGACGCATTGTGGTTGGAATTCTTCGTTGGAGAGCATTGCTTGTGGTGTGTCGGTGGTGGCGTTTCCGCAATGGACGGATCAATCCACCAATGCGAGGATCATTGAGGAGTCGTCTAAGAGTGGAGTGAGGTTGAGAATGAATGATGATGGGATTGTTGAGAGGGGAGAGATTAAGAAATGCTTGGACCTTGTCATGGGCGATGGTGTTGAGGGGGAGAGTTTGAGAAGGAATGCTTTAAAATGGAAGGACTTGGCCAACCATGCCACCACTAAGGGAGGTTCTTCCAATGCCAACGTTAGCTCGTTTCTTGATTTTGTGTGTAATGGTGGCAACTGTTAA

Protein sequence

MAQSHPHVLFVALATHGHFNPGLHLANTLSHGGVNVTFATSSSVRHRLPKLPSSPNLYFSFFSDGQDDGFKLGNDVVPFLSQFERQASIAIRQTILKSKANGKPFSFVVYSLLTPWMAEVARSFDLPSALFWNQSAAVFAIYYHFFNGYKDIIGNAFSHPSIKIHLPGLPLLDSKQLPSLCNPANSNSFVLKLYETHFHVIKQEPHLKILINTFDDLEHDVLRAISVELIPIGPVLPSLSFFHPTNSMDLKPYIGWLNSKPKSSVVYVSFGSIAAVSTAQLEEIARGLLDSKLPFLWVMRKRSDGDEGDLVSCREELVAKGKIVTWCSQLEVLLHPSVGCFLTHCGWNSSLESIACGVSVVAFPQWTDQSTNARIIEESSKSGVRLRMNDDGIVERGEIKKCLDLVMGDGVEGESLRRNALKWKDLANHATTKGGSSNANVSSFLDFVCNGGNC
BLAST of Cp4.1LG17g03860 vs. Swiss-Prot
Match: UGT1_GARJA (Crocetin glucosyltransferase, chloroplastic OS=Gardenia jasminoides GN=UGT75L6 PE=1 SV=1)

HSP 1 Score: 400.6 bits (1028), Expect = 2.3e-110
Identity = 218/458 (47.60%), Postives = 285/458 (62.23%), Query Frame = 1

Query: 7   HVLFVALATHGHFNPGLHLANTLSHGGVNVTFATSSSVRHRLPKLPSSP--NLYFSFFSD 66
           HVL +     GH NP L  A  L   G+ VT ATS     R+ K   S    L F+ FSD
Sbjct: 6   HVLLITYPAQGHINPALQFAQRLLRMGIQVTLATSVYALSRMKKSSGSTPKGLTFATFSD 65

Query: 67  GQDDGFK-LGNDVVPFLSQFERQASIAIRQTILKSKANGKPFSFVVYSLLTPWMAEVARS 126
           G DDGF+  G D   ++S   +Q S  +R  I  S   G P + +VY+LL PW A VAR 
Sbjct: 66  GYDDGFRPKGVDHTEYMSSLAKQGSNTLRNVINTSADQGCPVTCLVYTLLLPWAATVARE 125

Query: 127 FDLPSALFWNQSAAVFAIYYHFFNGYKDIIGNAFSHPSIKIHLPGLPLLDSKQLPSLCNP 186
             +PSAL W Q  AV  IYY++F GY+D + N  + P+  I  PGLP + +K LPS   P
Sbjct: 126 CHIPSALLWIQPVAVMDIYYYYFRGYEDDVKNNSNDPTWSIQFPGLPSMKAKDLPSFILP 185

Query: 187 ANSN--SFVLKLYETHFHVIKQEPHLKILINTFDDLEHDVLRAI-SVELIPIGPVLPS-- 246
           ++ N  SF L  ++     + +E   K+L+NTFD LE   L+AI S  LI IGP+ PS  
Sbjct: 186 SSDNIYSFALPTFKKQLETLDEEERPKVLVNTFDALEPQALKAIESYNLIAIGPLTPSAF 245

Query: 247 LSFFHPTN---SMDL----KPYIGWLNSKPKSSVVYVSFGSIAAVSTAQLEEIARGLLDS 306
           L    P+    S DL    K Y  WLNS+P  SVVYVSFGS+  +   Q+EEIARGLL S
Sbjct: 246 LDGKDPSETSFSGDLFQKSKDYKEWLNSRPAGSVVYVSFGSLLTLPKQQMEEIARGLLKS 305

Query: 307 KLPFLWVMRKRSDGDEG---DLVSCREELVAKGKIVTWCSQLEVLLHPSVGCFLTHCGWN 366
             PFLWV+R + +G+E    D + C EEL  +G IV WCSQ+EVL HPS+GCF+THCGWN
Sbjct: 306 GRPFLWVIRAKENGEEEKEEDRLICMEELEEQGMIVPWCSQIEVLTHPSLGCFVTHCGWN 365

Query: 367 SSLESIACGVSVVAFPQWTDQSTNARIIEESSKSGVRLRMNDDGIVERGEIKKCLDLVMG 426
           S+LE++ CGV VVAFP WTDQ TNA++IE+  ++GVR+  N+DG VE  EIK+C++ VM 
Sbjct: 366 STLETLVCGVPVVAFPHWTDQGTNAKLIEDVWETGVRVVPNEDGTVESDEIKRCIETVMD 425

Query: 427 DGVEGESLRRNALKWKDLANHATTKGGSSNANVSSFLD 447
           DG +G  L+RNA KWK+LA  A  + GSS+ N+ +F++
Sbjct: 426 DGEKGVELKRNAKKWKELAREAMQEDGSSDKNLKAFVE 463

BLAST of Cp4.1LG17g03860 vs. Swiss-Prot
Match: 5GT1_PERFR (Anthocyanidin 3-O-glucoside 5-O-glucosyltransferase 1 OS=Perilla frutescens GN=PF3R4 PE=1 SV=1)

HSP 1 Score: 357.1 bits (915), Expect = 2.9e-97
Identity = 206/458 (44.98%), Postives = 275/458 (60.04%), Query Frame = 1

Query: 8   VLFVALATHGHFNPGLHLANTLSHGGVNVTFATSSSVRHRLPKLPSS-----PNLYFSFF 67
           VL       GH NP L  A  L   G +VTF TS     R+    S+     P L F  F
Sbjct: 6   VLLATFPAQGHINPALQFAKRLLKAGTDVTFFTSVYAWRRMANTASAAAGNPPGLDFVAF 65

Query: 68  SDGQDDGFKLGNDVVPFLSQFERQASIAIRQTILKSKANGKPFSFVVYSLLTPWMAEVAR 127
           SDG DDG K   D   ++S+ + + S A+R  +L    N    +FVVYS L  W AEVAR
Sbjct: 66  SDGYDDGLKPCGDGKRYMSEMKARGSEALRNLLL----NNHDVTFVVYSHLFAWAAEVAR 125

Query: 128 SFDLPSALFWNQSAAVFAIYYHFFNGYKDIIGNAFSHPSIKIHLPGLPLLDSKQLPSLCN 187
              +PSAL W + A V  IYY +FNGY D I       S +I LP LP L+ + LP+   
Sbjct: 126 ESQVPSALLWVEPATVLCIYYFYFNGYADEIDAG----SDEIQLPRLPPLEQRSLPTFLL 185

Query: 188 PANSNSFVLKLYETHFHVIKQEPHLKILINTFDDLEHDVLRAIS-VELIPIGPVLPS--L 247
           P     F L + E     +  E   K+L+NTFD LE D L AI   ELI IGP++PS  L
Sbjct: 186 PETPERFRLMMKEK-LETLDGEEKAKVLVNTFDALEPDALTAIDRYELIGIGPLIPSAFL 245

Query: 248 SFFHPTNSM---DL------KPYIGWLNSKPKSSVVYVSFGSIAAVSTAQLEEIARGLLD 307
               P+ +    DL         + WL++KPKSSVVYVSFGS+     AQ+EEI +GLL 
Sbjct: 246 DGGDPSETSYGGDLFEKSEENNCVEWLDTKPKSSVVYVSFGSVLRFPKAQMEEIGKGLLA 305

Query: 308 SKLPFLWVMRKRSDGD---EGDLVSCREELVAKGKIVTWCSQLEVLLHPSVGCFLTHCGW 367
              PFLW++R++ + D   E + +SC  EL   GKIV+WCSQLEVL HP++GCF+THCGW
Sbjct: 306 CGRPFLWMIREQKNDDGEEEEEELSCIGELKKMGKIVSWCSQLEVLAHPALGCFVTHCGW 365

Query: 368 NSSLESIACGVSVVAFPQWTDQSTNARIIEESSKSGVRLRMNDDGIVERGEIKKCLDLVM 427
           NS++ES++CGV VVA PQW DQ+TNA++IE++  +GVR+RMN+ G V+  EI++C+++VM
Sbjct: 366 NSAVESLSCGVPVVAVPQWFDQTTNAKLIEDAWGTGVRVRMNEGGGVDGSEIERCVEMVM 425

Query: 428 GDGVEGESLRRNALKWKDLANHATTKGGSSNANVSSFL 446
             G + + +R NA+KWK LA  A  + GSS  N+++FL
Sbjct: 426 DGGEKSKLVRENAIKWKTLAREAMGEDGSSLKNLNAFL 454

BLAST of Cp4.1LG17g03860 vs. Swiss-Prot
Match: 5GT_VERHY (Anthocyanidin 3-O-glucoside 5-O-glucosyltransferase OS=Verbena hybrida GN=HGT8 PE=2 SV=1)

HSP 1 Score: 356.3 bits (913), Expect = 5.0e-97
Identity = 206/465 (44.30%), Postives = 281/465 (60.43%), Query Frame = 1

Query: 4   SHPHVLFVALATHGHFNPGLHLANTLSHGGVNVTFATSSSVRHRLPKLPSSPNLYFSF-- 63
           S  HVL       GH NP L  A  L++  + VTF TS     R+ +  +  N   +F  
Sbjct: 2   SRAHVLLATFPAQGHINPALQFAKRLANADIQVTFFTSVYAWRRMSRTAAGSNGLINFVS 61

Query: 64  FSDGQDDGFKLGNDVVPFLSQFERQASIAIRQTILKSKANGKP--FSFVVYSLLTPWMAE 123
           FSDG DDG + G+D   ++S+ + +   A+  T+  +  + K    +FVVYS L  W A+
Sbjct: 62  FSDGYDDGLQPGDDGKNYMSEMKSRGIKALSDTLAANNVDQKSSKITFVVYSHLFAWAAK 121

Query: 124 VARSFDLPSALFWNQSAAVFAIYYHFFNGYKDIIGNAFSHPSIKIHLPG-LPLLDSKQLP 183
           VAR F L SAL W + A V  I+Y +FNGY D I       S  IHLPG LP+L  + LP
Sbjct: 122 VAREFHLRSALLWIEPATVLDIFYFYFNGYSDEIDAG----SDAIHLPGGLPVLAQRDLP 181

Query: 184 SLCNPANSNSFVLKLYETHFHVIKQEPHLKILINTFDDLEHDVLRAIS-VELIPIGPVLP 243
           S   P+    F   L +     ++ E   K+L+N+FD LE D L+AI   E+I IGP++P
Sbjct: 182 SFLLPSTHERF-RSLMKEKLETLEGEEKPKVLVNSFDALEPDALKAIDKYEMIAIGPLIP 241

Query: 244 SLSFFHPTNSMDLK-------------PYIGWLNSKPKSSVVYVSFGSIAAVSTAQLEEI 303
           S +F    +  D                 + WL++ P+SSVVYVSFGS    + +Q+EEI
Sbjct: 242 S-AFLDGKDPSDRSFGGDLFEKGSNDDDCLEWLSTNPRSSVVYVSFGSFVNTTKSQMEEI 301

Query: 304 ARGLLDSKLPFLWVMRKRSDGDEGDLVSCREELVAKGKIVTWCSQLEVLLHPSVGCFLTH 363
           ARGLLD   PFLWV+R  ++G+E  L+SC EEL   GKIV+WCSQLEVL HPS+GCF+TH
Sbjct: 302 ARGLLDCGRPFLWVVRV-NEGEEV-LISCMEELKRVGKIVSWCSQLEVLTHPSLGCFVTH 361

Query: 364 CGWNSSLESIACGVSVVAFPQWTDQSTNARIIEESSKSGVRLRMNDDG-IVERGEIKKCL 423
           CGWNS+LESI+ GV +VAFPQW DQ TNA+++E+  ++GVR+R N++G +V+  EI++C+
Sbjct: 362 CGWNSTLESISFGVPMVAFPQWFDQGTNAKLMEDVWRTGVRVRANEEGSVVDGDEIRRCI 421

Query: 424 DLVMGDGVEGESLRRNALKWKDLANHATTKGGSSNANVSSFLDFV 449
           + VM  G +   LR +A KWKDLA  A  + GSS  N+  FLD V
Sbjct: 422 EEVMDGGEKSRKLRESAGKWKDLARKAMEEDGSSVNNLKVFLDEV 458

BLAST of Cp4.1LG17g03860 vs. Swiss-Prot
Match: U75D1_ARATH (UDP-glycosyltransferase 75D1 OS=Arabidopsis thaliana GN=UGT75D1 PE=2 SV=2)

HSP 1 Score: 349.7 bits (896), Expect = 4.6e-95
Identity = 193/469 (41.15%), Postives = 283/469 (60.34%), Query Frame = 1

Query: 6   PHVLFVALATHGHFNPGLHLANTLSH--GGVNVTFATSSSVRHRLPKLPSSPN----LYF 65
           PH LFV     GH NP L LA  L+    G  VTFA S S  +R  ++ S+ N    L F
Sbjct: 12  PHFLFVTFPAQGHINPSLELAKRLAGTISGARVTFAASISAYNR--RMFSTENVPETLIF 71

Query: 66  SFFSDGQDDGFKLG--------NDVVPFLSQFERQASIAIRQTILKSKANGKPFSFVVYS 125
           + +SDG DDGFK          +    F+S+  R+    + + I  ++   +PF+ VVY+
Sbjct: 72  ATYSDGHDDGFKSSAYSDKSRQDATGNFMSEMRRRGKETLTELIEDNRKQNRPFTCVVYT 131

Query: 126 LLTPWMAEVARSFDLPSALFWNQSAAVFAIYYHFFNGYKDIIGNAFSHPSIKIHLPGLPL 185
           +L  W+AE+AR F LPSAL W Q   VF+I+YH+FNGY+D I    + PS  I LP LPL
Sbjct: 132 ILLTWVAELAREFHLPSALLWVQPVTVFSIFYHYFNGYEDAISEMANTPSSSIKLPSLPL 191

Query: 186 LDSKQLPSLCNPANSNSFVLKLYETHFHVIKQEPHLKILINTFDDLEHDVLRAI--SVEL 245
           L  + +PS    +N  +F+L  +      +K+E + KILINTF +LE + + ++  + ++
Sbjct: 192 LTVRDIPSFIVSSNVYAFLLPAFREQIDSLKEEINPKILINTFQELEPEAMSSVPDNFKI 251

Query: 246 IPIGPVLPSLSFFHPTNSMDLKPYIGWLNSKPKSSVVYVSFGSIAAVSTAQLEEIARGLL 305
           +P+GP+L   + F          YI WL++K  SSV+YVSFG++A +S  QL E+ + L+
Sbjct: 252 VPVGPLLTLRTDFSSRGE-----YIEWLDTKADSSVLYVSFGTLAVLSKKQLVELCKALI 311

Query: 306 DSKLPFLWVMRKRS--------DGDEGDLVSCREELVAKGKIVTWCSQLEVLLHPSVGCF 365
            S+ PFLWV+  +S        + +E  + S REEL   G +V+WC Q  VL H S+GCF
Sbjct: 312 QSRRPFLWVITDKSYRNKEDEQEKEEDCISSFREELDEIGMVVSWCDQFRVLNHRSIGCF 371

Query: 366 LTHCGWNSSLESIACGVSVVAFPQWTDQSTNARIIEESSKSGVRL--RMNDDG--IVERG 425
           +THCGWNS+LES+  GV VVAFPQW DQ  NA+++E+  K+GVR+  +  ++G  +V+  
Sbjct: 372 VTHCGWNSTLESLVSGVPVVAFPQWNDQMMNAKLLEDCWKTGVRVMEKKEEEGVVVVDSE 431

Query: 426 EIKKCLDLVMGDGVEGESLRRNALKWKDLANHATTKGGSSNANVSSFLD 447
           EI++C++ VM D  + E  R NA +WKDLA  A  +GGSS  ++ +F+D
Sbjct: 432 EIRRCIEEVMED--KAEEFRGNATRWKDLAAEAVREGGSSFNHLKAFVD 471

BLAST of Cp4.1LG17g03860 vs. Swiss-Prot
Match: 5GT2_PERFR (Anthocyanidin 3-O-glucoside 5-O-glucosyltransferase 2 OS=Perilla frutescens GN=PF3R6 PE=2 SV=1)

HSP 1 Score: 349.0 bits (894), Expect = 7.9e-95
Identity = 201/445 (45.17%), Postives = 267/445 (60.00%), Query Frame = 1

Query: 8   VLFVALATHGHFNPGLHLANTLSHGGVNVTFATSSSVRHRLPKLPSS-----PNLYFSFF 67
           VL       GH NP L  A  L   G +VTF TS     R+    S+     P L F  F
Sbjct: 6   VLLATFPAQGHINPALQFAKRLLKAGTDVTFFTSVYAWRRMANTASAAAGNPPGLDFVAF 65

Query: 68  SDGQDDGFKLGNDVVPFLSQFERQASIAIRQTILKSKANGKPFSFVVYSLLTPWMAEVAR 127
           SDG DDG K G D   ++S+ + + S A+R  +L    N    +FVVYS L  W AEVAR
Sbjct: 66  SDGYDDGLKPGGDGKRYMSEMKARGSEALRNLLL----NNDDVTFVVYSHLFAWAAEVAR 125

Query: 128 SFDLPSALFWNQSAAVFAIYYHFFNGYKDIIGNAFSHPSIKIHLPGLPLLDSKQLPSLCN 187
              +P+AL W + A V  IY+ +FNGY D I       S +I LP LP L+ + LP+   
Sbjct: 126 LSHVPTALLWVEPATVLCIYHFYFNGYADEIDAG----SNEIQLPRLPSLEQRSLPTFLL 185

Query: 188 PANSNSFVLKLYETHFHVIKQEPHLKILINTFDDLEHDVLRAIS-VELIPIGPVLPS--L 247
           PA    F L + E     +  E   K+L+NTFD LE D L AI   ELI IGP++PS  L
Sbjct: 186 PATPERFRLMMKEK-LETLDGEEKAKVLVNTFDALEPDALTAIDRYELIGIGPLIPSAFL 245

Query: 248 SFFHPTNSM---DL------KPYIGWLNSKPKSSVVYVSFGSIAAVSTAQLEEIARGLLD 307
               P+ +    DL         + WLNSKPKSSVVYVSFGS+     AQ+EEI +GLL 
Sbjct: 246 DGEDPSETSYGGDLFEKSEENNCVEWLNSKPKSSVVYVSFGSVLRFPKAQMEEIGKGLLA 305

Query: 308 SKLPFLWVMRKRSDGD-----EGDLVSCREELVAKGKIVTWCSQLEVLLHPSVGCFLTHC 367
              PFLW++R++ + D     E + +SC  EL   GKIV+WCSQLEVL HP++GCF+THC
Sbjct: 306 CGRPFLWMIREQKNDDGEEEEEEEELSCIGELKKMGKIVSWCSQLEVLAHPALGCFVTHC 365

Query: 368 GWNSSLESIACGVSVVAFPQWTDQSTNARIIEESSKSGVRLRMNDDGIVERGEIKKCLDL 427
           GWNS++ES++CG+ VVA PQW DQ+TNA++IE++  +GVR+RMN+ G V+  EI++C+++
Sbjct: 366 GWNSAVESLSCGIPVVAVPQWFDQTTNAKLIEDAWGTGVRVRMNEGGGVDGCEIERCVEM 425

Query: 428 VMGDGVEGESLRRNALKWKDLANHA 431
           VM  G + + +R NA+KWK LA  A
Sbjct: 426 VMDGGDKTKLVRENAIKWKTLARQA 441

BLAST of Cp4.1LG17g03860 vs. TrEMBL
Match: A0A0A0KD49_CUCSA (Glycosyltransferase OS=Cucumis sativus GN=Csa_6G007450 PE=3 SV=1)

HSP 1 Score: 663.7 bits (1711), Expect = 1.6e-187
Identity = 328/460 (71.30%), Postives = 375/460 (81.52%), Query Frame = 1

Query: 6   PHVLFVALATHGHFNPGLHLANTLSHGGVNVTFATSSSVRHRLPKLPSSPNLYFSFFSDG 65
           PHVLFVALATHGHFNPGLH AN LSHGG++VTFATSSSV  R+PKLPSSP L F+FFSDG
Sbjct: 4   PHVLFVALATHGHFNPGLHFANILSHGGLHVTFATSSSVFRRVPKLPSSPRLSFAFFSDG 63

Query: 66  QDDGFKLGNDVVPFLSQFERQASIAIRQTILKSKANGKPFSFVVYSLLTPWMAEVARSFD 125
           QDDGFK G+DVVPFLSQFE QAS AI   ILKSKA+GKP +FV+YSLLTPWMA VARSFD
Sbjct: 64  QDDGFKPGDDVVPFLSQFELQASRAIHDIILKSKASGKPITFVLYSLLTPWMANVARSFD 123

Query: 126 LPSALFWNQSAAVFAIYYHFFNGYKDIIGNAFSHPSIKIHLPGLPLLDSKQLPSLCNPAN 185
           LP+ALFWNQSAAVFAIYYHFFNGY+++I N FSHP I I+LPGL  L+SKQLPSLCNP N
Sbjct: 124 LPTALFWNQSAAVFAIYYHFFNGYREVIQNCFSHPCININLPGLTSLNSKQLPSLCNPVN 183

Query: 186 SNSFVLKLYETHFHVIKQEPHLKILINTFDDLEHDVLRAISV-ELIPIGPVLPSLSF--- 245
           SNSF+LKL+E+HF V+KQEPHLKILIN+FD+LEHDV RA ++  LIPIGPVLP       
Sbjct: 184 SNSFILKLFESHFQVLKQEPHLKILINSFDELEHDVFRANNMGNLIPIGPVLPIKCIEQM 243

Query: 246 ---------------FHPTNSMDLKPYIGWLNSKPKSSVVYVSFGSIAAVSTAQLEEIAR 305
                          F   NS D   Y  WLNSKP+SSVVY+SFGSIAAVS AQLEEI R
Sbjct: 244 NNEIFLDAFRVAPISFSLHNSQDESKYHSWLNSKPRSSVVYLSFGSIAAVSKAQLEEIGR 303

Query: 306 GLLDSKLPFLWVMRKRSDGDEGDLVSCREELVAKGKIVTWCSQLEVLLHPSVGCFLTHCG 365
           GLLD    FLWVMRK S G+E D++SC +EL AKGK+V WCSQLEVL +P++GCFLTHCG
Sbjct: 304 GLLDYGGEFLWVMRKMSHGNERDMLSCLDELEAKGKVVAWCSQLEVLSNPAIGCFLTHCG 363

Query: 366 WNSSLESIACGVSVVAFPQWTDQSTNARIIEESSKSGVRLRMNDDGIVERGEIKKCLDLV 425
           WNSS+ES+ CGV VVAFPQWTDQ TNA+IIE+ SKSGV+LR+N++GIVERGEIKKCL++V
Sbjct: 364 WNSSMESLVCGVPVVAFPQWTDQGTNAKIIEDLSKSGVKLRVNENGIVERGEIKKCLEMV 423

Query: 426 MGDGVEGESLRRNALKWKDLANHATTKGGSSNANVSSFLD 447
           MG G EGE  RRN  KWK+LA  A TKGGSS+ N+ +F+D
Sbjct: 424 MGKGDEGEGFRRNGKKWKELAKKAITKGGSSHLNIRNFID 463

BLAST of Cp4.1LG17g03860 vs. TrEMBL
Match: F6I4F7_VITVI (Glycosyltransferase OS=Vitis vinifera GN=VIT_05s0062g00700 PE=3 SV=1)

HSP 1 Score: 459.5 bits (1181), Expect = 4.6e-126
Identity = 235/460 (51.09%), Postives = 321/460 (69.78%), Query Frame = 1

Query: 8   VLFVALATHGHFNPGLHLANTLSHGGVNVTFATSSSVRHRLPKLPSSPNLYFSFFSDGQD 67
           +L V     GH NP L LA  L+  G +VTF TSSS   R+ K P+   L F  FSDG D
Sbjct: 5   ILLVTYPAQGHINPSLQLAKLLTRAGAHVTFVTSSSASTRMSKPPTLEGLEFVTFSDGYD 64

Query: 68  DGFKLGNDVVPFLSQFERQASIAIRQTILKSKANGKPFSFVVYSLLTPWMAEVARSFDLP 127
            GFK G+D+  F+S+ +R  S A+ + I+     G+PF+ ++Y ++ PW+AEVA+SF LP
Sbjct: 65  HGFKHGDDLQNFMSELDRLGSQALTELIVARANEGRPFTCLLYGIIIPWVAEVAQSFHLP 124

Query: 128 SALFWNQSAAVFAIYYHFFNGYKDIIGNAFSHPSIKIHLPGLPLLDSKQLPSLCNPANSN 187
           SAL W+Q+A VF IYY++FNGY ++IGN  +  S  I LPGLPLL S  LPS   P+ + 
Sbjct: 125 SALVWSQAATVFDIYYYYFNGYGELIGNKGNGSSSSIELPGLPLLSSSDLPSFLEPSKAI 184

Query: 188 SF--VLKLYETHFHVIKQEPHLKILINTFDDLEHDVLRAIS-VELIPIGPVLPSLSFFH- 247
           +F  VLK  +     + +E + ++L+N+FD LE + LRA++  +L+ IGP+LP L+F   
Sbjct: 185 AFNFVLKSLQKQLEQLNRESNPRVLVNSFDALESEALRALNKFKLMGIGPLLP-LAFLDG 244

Query: 248 --PTNSM-------DLKPYIGWLNSKPKSSVVYVSFGSIAAVSTAQLEEIARGLLDSKLP 307
             P+++        D K YI WLNSKP+SSV+YVSFGS++ +S  Q EEIARGLL S  P
Sbjct: 245 KDPSDTSFGGDLFRDSKDYIQWLNSKPESSVIYVSFGSLSVLSKQQSEEIARGLLASGRP 304

Query: 308 FLWVMRKRSDGDE---GDLVSCREELVAKGKIVTWCSQLEVLLHPSVGCFLTHCGWNSSL 367
           FLWV+R + +G+E    D +SC EEL  +G IV WCSQ+EVL HPS+GCF++HCGWNS+L
Sbjct: 305 FLWVIRAKENGEEEKEDDKLSCVEELEQQGMIVPWCSQVEVLSHPSLGCFVSHCGWNSTL 364

Query: 368 ESIACGVSVVAFPQWTDQSTNARIIEESSKSGVRLRMNDDGIVERGEIKKCLDLVMGDGV 427
           ES+ACGV VVAFPQWTDQ+TNA++IE+  K+G+R+ +N +GIVE GEIKKCL+LVMG G 
Sbjct: 365 ESLACGVPVVAFPQWTDQTTNAKLIEDVWKTGLRVMVNQEGIVEGGEIKKCLELVMGCGE 424

Query: 428 EGESLRRNALKWKDLANHATTKGGSSNANVSSFLDFVCNG 452
           +G+ +RRNA KWKDLA  A  +GGSS+ N+ +F++ +  G
Sbjct: 425 KGQEVRRNAKKWKDLAREAVKEGGSSDKNLKNFVNEIIQG 463

BLAST of Cp4.1LG17g03860 vs. TrEMBL
Match: F6I4F4_VITVI (Glycosyltransferase OS=Vitis vinifera GN=VIT_05s0062g00640 PE=3 SV=1)

HSP 1 Score: 452.2 bits (1162), Expect = 7.4e-124
Identity = 229/459 (49.89%), Postives = 305/459 (66.45%), Query Frame = 1

Query: 6   PHVLFVALATHGHFNPGLHLANTLSHGGVNVTFATSSSVRHRLPKLPSSPNLYFSFFSDG 65
           PH L V     GH NP L  A  +   G  V+FATS S   R+ K  +   L F  FSDG
Sbjct: 4   PHFLLVTFPAQGHINPALQFAKRIIRTGAQVSFATSVSAHRRMAKRSTPEGLNFVPFSDG 63

Query: 66  QDDGFKLGNDVVPFLSQFERQASIAIRQTILKSKANGKPFSFVVYSLLTPWMAEVARSFD 125
            DDGFK  +DV  ++S+ +R+ S  +R+ ++++   G+PF+ +VY+LL PW AEVAR   
Sbjct: 64  YDDGFKPTDDVQHYMSEIKRRGSETLREIVVRNADEGQPFTCIVYTLLLPWAAEVARGLG 123

Query: 126 LPSALFWNQSAAVFAIYYHFFNGYKDIIGNAFSHPSIKIHLPGLPLLDSKQLPSLCNPAN 185
           +PSAL W Q A V  IYY++FNGY D+  N  + PS  + LPGLPLL S+ LPS    +N
Sbjct: 124 VPSALLWIQPATVLDIYYYYFNGYGDVFRNISNEPSCSVELPGLPLLSSRDLPSFLVKSN 183

Query: 186 SNSFVLKLYETHFHVIKQEPHLKILINTFDDLEHDVLRAI-SVELIPIGPVLPS--LSFF 245
           + +FVL  ++     + QE   K+L+NTFD LE + LRA+  + LI IGP++PS  L   
Sbjct: 184 AYTFVLPTFQEQLEALSQETSPKVLVNTFDALEPEPLRAVDKLHLIGIGPLVPSAYLDGK 243

Query: 246 HPTNS-------MDLKPYIGWLNSKPKSSVVYVSFGSIAAVSTAQLEEIARGLLDSKLPF 305
            P+++            Y+ WLNSKPKSSVVYVSFGSI+ +S  Q E+IAR LLD   PF
Sbjct: 244 DPSDTSFGGDMFQGSDDYMEWLNSKPKSSVVYVSFGSISVLSKTQKEDIARALLDCGHPF 303

Query: 306 LWVMRKRSDGD---EGDLVSCREELVAKGKIVTWCSQLEVLLHPSVGCFLTHCGWNSSLE 365
           LWV+R   +G+   E D +SCREEL  KG IV+WCSQ+EVL HPS+GCF++HCGWNS+LE
Sbjct: 304 LWVIRAPENGEEVKEQDKLSCREELEQKGMIVSWCSQIEVLTHPSLGCFVSHCGWNSTLE 363

Query: 366 SIACGVSVVAFPQWTDQSTNARIIEESSKSGVRLRMNDDGIVERGEIKKCLDLVMGDGVE 425
           S+  GV VVAFPQWTDQ TNA++IE+  K G+R+ +N++GIVE  E K+CL++VMG G +
Sbjct: 364 SLVSGVPVVAFPQWTDQGTNAKLIEDMWKIGIRVTVNEEGIVESDEFKRCLEIVMGGGEK 423

Query: 426 GESLRRNALKWKDLANHATTKGGSSNANVSSFLDFVCNG 452
           GE +RRNA KWK+LA  A   GGSS+ N+  F+D V +G
Sbjct: 424 GEEMRRNAEKWKNLAREAVKDGGSSDKNLKGFVDEVGHG 462

BLAST of Cp4.1LG17g03860 vs. TrEMBL
Match: F6I4F8_VITVI (Glycosyltransferase OS=Vitis vinifera GN=VIT_05s0062g00710 PE=3 SV=1)

HSP 1 Score: 451.1 bits (1159), Expect = 1.6e-123
Identity = 236/460 (51.30%), Postives = 315/460 (68.48%), Query Frame = 1

Query: 6   PHVLFVALATHGHFNPGLHLANTLSHGGVNVTFATSSSVRHRLPKLPSSPNLYFSFFSDG 65
           P +L V     GH NP L LA  L   G +VTF TSSS   R+ K P+   L F  FSDG
Sbjct: 3   PQILLVTYPAQGHINPSLQLAKLLIRAGAHVTFVTSSSAGTRMSKSPTLDGLEFVTFSDG 62

Query: 66  QDDGFKLGNDVVPFLSQFERQASIAIRQTILKSKANGKPFSFVVYSLLTPWMAEVARSFD 125
            D GF  G+ +  F+S+ ER  S A+ + I+     G+PF+ ++Y +L PW+AEVARS  
Sbjct: 63  YDHGFDHGDGLQNFMSELERLGSPALTKLIMARANEGRPFTCLLYGMLIPWVAEVARSLH 122

Query: 126 LPSALFWNQSAAVFAIYYHFFNGYKDIIGNAFSHPSIKIHLPGLPLLDSKQLPSLCNPA- 185
           LPSAL W+Q AAVF IYY++FNGY ++IGN  +  S  I LPGLPL+ S  LPS   P+ 
Sbjct: 123 LPSALVWSQPAAVFDIYYYYFNGYGELIGNKGNGSSSSIELPGLPLISSSDLPSFLVPSK 182

Query: 186 -NSNSFVLKLYETHFHVIKQEPHLKILINTFDDLEHDVLRAIS-VELIPIGPVLPS--LS 245
            ++++FVLKL++     + +E + ++L+N+FD LE + LRAI+  +L+ IGP+LPS  L 
Sbjct: 183 VSAHNFVLKLHQKQLEQLNRESNPRVLVNSFDALESEALRAINKFKLMGIGPLLPSAFLD 242

Query: 246 FFHPTNSM---DL----KPYIGWLNSKPKSSVVYVSFGSIAAVSTAQLEEIARGLLDSKL 305
              P+++    DL    K YI WLNS  +SSV+YVSFGS++ +S  Q EEIARGLLDS  
Sbjct: 243 GKDPSDTSFGGDLFRGSKDYIQWLNSNAESSVIYVSFGSLSVLSKQQSEEIARGLLDSGR 302

Query: 306 PFLWVMRKRSDGDEG--DLVSCREELVAKGKIVTWCSQLEVLLHPSVGCFLTHCGWNSSL 365
           PFLWV+R + + +E   D +SC EEL   G IV WCSQ+EVL HPS+GCF++HCGWNS+L
Sbjct: 303 PFLWVIRAKENEEEEKEDKLSCVEELEQLGMIVPWCSQVEVLSHPSLGCFVSHCGWNSTL 362

Query: 366 ESIACGVSVVAFPQWTDQSTNARIIEESSKSGVRLRMNDDGIVERGEIKKCLDLVMGDGV 425
           ES+A GV VVAFPQWTDQ+TNA++IE+  K+G+R+ +N +GIVE GEIKKCL+LVMG G 
Sbjct: 363 ESLASGVPVVAFPQWTDQTTNAKLIEDVWKTGLRVMVNQEGIVEGGEIKKCLELVMGGGE 422

Query: 426 EGESLRRNALKWKDLANHATTKGGSSNANVSSFLDFVCNG 452
            G+ +R NA KWKDLA  A   GGSS+ N+ +F+D +  G
Sbjct: 423 RGQEVRSNAKKWKDLAREAVKDGGSSDKNLKNFVDEIIQG 462

BLAST of Cp4.1LG17g03860 vs. TrEMBL
Match: F6I4D5_VITVI (Glycosyltransferase OS=Vitis vinifera GN=VIT_05s0062g00350 PE=3 SV=1)

HSP 1 Score: 448.4 bits (1152), Expect = 1.1e-122
Identity = 236/456 (51.75%), Postives = 307/456 (67.32%), Query Frame = 1

Query: 5   HPHVLFVALATHGHFNPGLHLANTLSHGGVNVTFATSSSVRHRLPKLPSSPNLYFSFFSD 64
           HPH+L V L + GH NP L LA  L   G +VTF TS+S   R+ K P+   L F+ FSD
Sbjct: 2   HPHILIVTLPSQGHINPTLQLAKLLIRAGAHVTFFTSTSAGTRMSKSPNLDGLEFATFSD 61

Query: 65  GQDDGFKLGNDVVPFLSQFERQASIAIRQTILKSKANGKPFSFVVYSLLTPWMAEVARSF 124
           G D G K G+DV  F+SQ ER  S A+ + I+ S   G+PF+ ++Y +  PW+AEVA S 
Sbjct: 62  GYDHGLKQGDDVEKFMSQIERLGSQALIELIMASANEGRPFACLLYGVQIPWVAEVAHSL 121

Query: 125 DLPSALFWNQSAAVFAIYYHFFNGYKDIIGNAFSHPSIKIHLPGLPLLDSKQLPSLCNPA 184
            +PSAL W Q AAVF IYY++FNGY ++I N   HPS  I LPGLPLL++  LPS   P 
Sbjct: 122 HIPSALVWTQPAAVFDIYYYYFNGYGELIQNKGDHPSSTIELPGLPLLNNSDLPSFLIPP 181

Query: 185 NSNS--FVLKLYETHFHVIKQEPHLKILINTFDDLEHDVLRAIS-VELIPIGPVLPS--L 244
             N+  F L  ++ H  ++  E + K+LIN+FD LE + L AI+   L+ IGP++PS  L
Sbjct: 182 KGNTYKFALPGFQKHLEMLNCESNPKVLINSFDALESEALGAINKFNLMGIGPLIPSAFL 241

Query: 245 SFFHPTNSM---DL----KPYIGWLNSKPKSSVVYVSFGSIAAVSTAQLEEIARGLLDSK 304
               P+++    DL    K YI WLNSKPKSSV+YVSFGS+  +S  Q EEIARGLLD  
Sbjct: 242 DGKDPSDTSFGGDLFRSSKDYIQWLNSKPKSSVIYVSFGSLFVLSKQQSEEIARGLLDGG 301

Query: 305 LPFLWVMRKRSDGDEGDLVSCREELVAKGKIVTWCSQLEVLLHPSVGCFLTHCGWNSSLE 364
            PFLWV+R   + +E  L SC EEL  +G +V WCSQ+EVL HPS+GCF+TH GWNS+LE
Sbjct: 302 RPFLWVIRLEENEEEKTL-SCHEELERQGMMVPWCSQVEVLSHPSMGCFVTHSGWNSTLE 361

Query: 365 SIACGVSVVAFPQWTDQSTNARIIEESSKSGVRLRMNDDGIVERGEIKKCLDLVMGDGVE 424
           S+  GV VVAFPQW+DQ+TNA++IE   K+G+R  +N +GIVE  EIK+CL+LVMG G  
Sbjct: 362 SLTSGVPVVAFPQWSDQATNAKLIEVVWKTGLRAMVNQEGIVEADEIKRCLELVMGSGER 421

Query: 425 GESLRRNALKWKDLANHATTKGGSSNANVSSFLDFV 449
           GE +RRNA KWK LA  A  +GGSS+ N+ +F++ V
Sbjct: 422 GEEMRRNATKWKVLAREAVKEGGSSDKNLKNFMNEV 456

BLAST of Cp4.1LG17g03860 vs. TAIR10
Match: AT4G15550.1 (AT4G15550.1 indole-3-acetate beta-D-glucosyltransferase)

HSP 1 Score: 349.7 bits (896), Expect = 2.6e-96
Identity = 193/469 (41.15%), Postives = 283/469 (60.34%), Query Frame = 1

Query: 6   PHVLFVALATHGHFNPGLHLANTLSH--GGVNVTFATSSSVRHRLPKLPSSPN----LYF 65
           PH LFV     GH NP L LA  L+    G  VTFA S S  +R  ++ S+ N    L F
Sbjct: 12  PHFLFVTFPAQGHINPSLELAKRLAGTISGARVTFAASISAYNR--RMFSTENVPETLIF 71

Query: 66  SFFSDGQDDGFKLG--------NDVVPFLSQFERQASIAIRQTILKSKANGKPFSFVVYS 125
           + +SDG DDGFK          +    F+S+  R+    + + I  ++   +PF+ VVY+
Sbjct: 72  ATYSDGHDDGFKSSAYSDKSRQDATGNFMSEMRRRGKETLTELIEDNRKQNRPFTCVVYT 131

Query: 126 LLTPWMAEVARSFDLPSALFWNQSAAVFAIYYHFFNGYKDIIGNAFSHPSIKIHLPGLPL 185
           +L  W+AE+AR F LPSAL W Q   VF+I+YH+FNGY+D I    + PS  I LP LPL
Sbjct: 132 ILLTWVAELAREFHLPSALLWVQPVTVFSIFYHYFNGYEDAISEMANTPSSSIKLPSLPL 191

Query: 186 LDSKQLPSLCNPANSNSFVLKLYETHFHVIKQEPHLKILINTFDDLEHDVLRAI--SVEL 245
           L  + +PS    +N  +F+L  +      +K+E + KILINTF +LE + + ++  + ++
Sbjct: 192 LTVRDIPSFIVSSNVYAFLLPAFREQIDSLKEEINPKILINTFQELEPEAMSSVPDNFKI 251

Query: 246 IPIGPVLPSLSFFHPTNSMDLKPYIGWLNSKPKSSVVYVSFGSIAAVSTAQLEEIARGLL 305
           +P+GP+L   + F          YI WL++K  SSV+YVSFG++A +S  QL E+ + L+
Sbjct: 252 VPVGPLLTLRTDFSSRGE-----YIEWLDTKADSSVLYVSFGTLAVLSKKQLVELCKALI 311

Query: 306 DSKLPFLWVMRKRS--------DGDEGDLVSCREELVAKGKIVTWCSQLEVLLHPSVGCF 365
            S+ PFLWV+  +S        + +E  + S REEL   G +V+WC Q  VL H S+GCF
Sbjct: 312 QSRRPFLWVITDKSYRNKEDEQEKEEDCISSFREELDEIGMVVSWCDQFRVLNHRSIGCF 371

Query: 366 LTHCGWNSSLESIACGVSVVAFPQWTDQSTNARIIEESSKSGVRL--RMNDDG--IVERG 425
           +THCGWNS+LES+  GV VVAFPQW DQ  NA+++E+  K+GVR+  +  ++G  +V+  
Sbjct: 372 VTHCGWNSTLESLVSGVPVVAFPQWNDQMMNAKLLEDCWKTGVRVMEKKEEEGVVVVDSE 431

Query: 426 EIKKCLDLVMGDGVEGESLRRNALKWKDLANHATTKGGSSNANVSSFLD 447
           EI++C++ VM D  + E  R NA +WKDLA  A  +GGSS  ++ +F+D
Sbjct: 432 EIRRCIEEVMED--KAEEFRGNATRWKDLAAEAVREGGSSFNHLKAFVD 471

BLAST of Cp4.1LG17g03860 vs. TAIR10
Match: AT1G05530.1 (AT1G05530.1 UDP-glucosyl transferase 75B2)

HSP 1 Score: 342.8 bits (878), Expect = 3.2e-94
Identity = 196/463 (42.33%), Postives = 269/463 (58.10%), Query Frame = 1

Query: 4   SHPHVLFVALATHGHFNPGLHLANTL-SHGGVNVTFATSSSVRHR--LPKLPSSPNLYFS 63
           + PH L V     GH NP L  A  L    G  VTFAT  SV HR  +P   +  NL F 
Sbjct: 2   AQPHFLLVTFPAQGHVNPSLRFARRLIKTTGARVTFATCLSVIHRSMIPNHNNVENLSFL 61

Query: 64  FFSDGQDDG-FKLGNDVVPFLSQFERQASIAIRQTILKSKANGKPFSFVVYSLLTPWMAE 123
            FSDG DDG     +DV   L  FER    A+   I  ++    P S ++Y++L  W+ +
Sbjct: 62  TFSDGFDDGVISNTDDVQNRLVHFERNGDKALSDFIEANQNGDSPVSCLIYTILPNWVPK 121

Query: 124 VARSFDLPSALFWNQSAAVFAIYYHFFNGYKDIIGNAFSHPSIKIHLPGLPLLDSKQLPS 183
           VAR F LPS   W Q A  F IYY++  G   +              P LP L+ + LPS
Sbjct: 122 VARRFHLPSVHLWIQPAFAFDIYYNYSTGNNSVF-----------EFPNLPSLEIRDLPS 181

Query: 184 LCNPANSNSFVLKLYETHFHVIKQEPHLKILINTFDDLEHDVLRAI-SVELIPIGPVLPS 243
             +P+N+N     +Y+     +K+E + KIL+NTFD LE + L AI ++E++ +GP+LP+
Sbjct: 182 FLSPSNTNKAAQAVYQELMDFLKEESNPKILVNTFDSLEPEFLTAIPNIEMVAVGPLLPA 241

Query: 244 LSFFHPTNSMDLK------PYIGWLNSKPKSSVVYVSFGSIAAVSTAQLEEIARGLLDSK 303
             F    +  DL        Y  WL+SK +SSV+YVSFG++  +S  Q+EE+AR L++  
Sbjct: 242 EIFTGSESGKDLSRDHQSSSYTLWLDSKTESSVIYVSFGTMVELSKKQIEELARALIEGG 301

Query: 304 LPFLWVMRKRSDGD---EGD-------LVSCREELVAKGKIVTWCSQLEVLLHPSVGCFL 363
            PFLWV+  + + +   EG+       +   R EL   G IV+WCSQ+EVL H ++GCFL
Sbjct: 302 RPFLWVITDKLNREAKIEGEEETEIEKIAGFRHELEEVGMIVSWCSQIEVLRHRAIGCFL 361

Query: 364 THCGWNSSLESIACGVSVVAFPQWTDQSTNARIIEESSKSGVRLRMNDDGIVERGEIKKC 423
           THCGW+SSLES+  GV VVAFP W+DQ  NA+++EE  K+GVR+R N +G+VERGEI +C
Sbjct: 362 THCGWSSSLESLVLGVPVVAFPMWSDQPANAKLLEEIWKTGVRVRENSEGLVERGEIMRC 421

Query: 424 LDLVMGDGVEGESLRRNALKWKDLANHATTKGGSSNANVSSFL 446
           L+ VM    +   LR NA KWK LA  A  +GGSS+ NV +F+
Sbjct: 422 LEAVM--EAKSVELRENAEKWKRLATEAGREGGSSDKNVEAFV 451

BLAST of Cp4.1LG17g03860 vs. TAIR10
Match: AT1G05560.1 (AT1G05560.1 UDP-glucosyltransferase 75B1)

HSP 1 Score: 337.0 bits (863), Expect = 1.8e-92
Identity = 194/467 (41.54%), Postives = 269/467 (57.60%), Query Frame = 1

Query: 6   PHVLFVALATHGHFNPGLHLANTL-SHGGVNVTFATSSSVRHR--LPKLPSSPNLYFSFF 65
           PH L V     GH NP L  A  L    G  VTF T  SV H   +       NL F  F
Sbjct: 4   PHFLLVTFPAQGHVNPSLRFARRLIKRTGARVTFVTCVSVFHNSMIANHNKVENLSFLTF 63

Query: 66  SDGQDDGFKLGNDVVPFLSQFERQASI------AIRQTILKSKANGKPFSFVVYSLLTPW 125
           SDG DDG      +  +  + +R  ++      A+   I  +K    P + ++Y++L  W
Sbjct: 64  SDGFDDG-----GISTYEDRQKRSVNLKVNGDKALSDFIEATKNGDSPVTCLIYTILLNW 123

Query: 126 MAEVARSFDLPSALFWNQSAAVFAIYYHFFNGYKDIIGNAFSHPSIKIHLPGLPLLDSKQ 185
             +VAR F LPSAL W Q A VF IYY  F G K +             LP L  L+ + 
Sbjct: 124 APKVARRFQLPSALLWIQPALVFNIYYTHFMGNKSVF-----------ELPNLSSLEIRD 183

Query: 186 LPSLCNPANSNSFVLKLYETHFHVIKQEPHLKILINTFDDLEHDVLRAI-SVELIPIGPV 245
           LPS   P+N+N      ++     + +E   KILINTFD LE + L A  +++++ +GP+
Sbjct: 184 LPSFLTPSNTNKGAYDAFQEMMEFLIKETKPKILINTFDSLEPEALTAFPNIDMVAVGPL 243

Query: 246 LPSLSFFHPTNSM---DLKPYIGWLNSKPKSSVVYVSFGSIAAVSTAQLEEIARGLLDSK 305
           LP+  F   TN         Y  WL+SK +SSV+YVSFG++  +S  Q+EE+AR L++ K
Sbjct: 244 LPTEIFSGSTNKSVKDQSSSYTLWLDSKTESSVIYVSFGTMVELSKKQIEELARALIEGK 303

Query: 306 LPFLWVMRKRSDGD---EGD-------LVSCREELVAKGKIVTWCSQLEVLLHPSVGCFL 365
            PFLWV+  +S+ +   EG+       +   R EL   G IV+WCSQ+EVL H +VGCF+
Sbjct: 304 RPFLWVITDKSNRETKTEGEEETEIEKIAGFRHELEEVGMIVSWCSQIEVLSHRAVGCFV 363

Query: 366 THCGWNSSLESIACGVSVVAFPQWTDQSTNARIIEESSKSGVRLRMNDDGIVERGEIKKC 425
           THCGW+S+LES+  GV VVAFP W+DQ TNA+++EES K+GVR+R N DG+VERGEI++C
Sbjct: 364 THCGWSSTLESLVLGVPVVAFPMWSDQPTNAKLLEESWKTGVRVRENKDGLVERGEIRRC 423

Query: 426 LDLVMGDGVEGESLRRNALKWKDLANHATTKGGSSNANVSSFLDFVC 450
           L+ VM +  +   LR NA KWK LA  A  +GGSS+ N+ +F++ +C
Sbjct: 424 LEAVMEE--KSVELRENAKKWKRLAMEAGREGGSSDKNMEAFVEDIC 452

BLAST of Cp4.1LG17g03860 vs. TAIR10
Match: AT4G14090.1 (AT4G14090.1 UDP-Glycosyltransferase superfamily protein)

HSP 1 Score: 328.2 bits (840), Expect = 8.2e-90
Identity = 179/450 (39.78%), Postives = 269/450 (59.78%), Query Frame = 1

Query: 6   PHVLFVALATHGHFNPGLHLANTLSHGGVNVTFATSSSVRHRLPKLPSSPNLYFSFFSDG 65
           PH L V     GH NP L LAN L H G  VT++T+ S   R+ + PS+  L F++F+DG
Sbjct: 12  PHYLLVTFPAQGHINPALQLANRLIHHGATVTYSTAVSAHRRMGEPPSTKGLSFAWFTDG 71

Query: 66  QDDGFKLGNDVVPFLSQFERQASIAIRQTI---LKSKANGKPFSFVVYSLLTPWMAEVAR 125
            DDG K   D   ++S+ +R  S A+R  I   L +    +P + V+YS+L PW++ VAR
Sbjct: 72  FDDGLKSFEDQKIYMSELKRCGSNALRDIIKANLDATTETEPITGVIYSVLVPWVSTVAR 131

Query: 126 SFDLPSALFWNQSAAVFAIYYHFFN-GYKDIIGNAFSHPSIKIHLPGLPLLDSKQLPSLC 185
            F LP+ L W + A V  IYY++FN  YK +    F    IK  LP LPL+ +  LPS  
Sbjct: 132 EFHLPTTLLWIEPATVLDIYYYYFNTSYKHL----FDVEPIK--LPKLPLITTGDLPSFL 191

Query: 186 NPANSNSFVLKLYETHFHVIKQEPHLKILINTFDDLEHDVLRAIS-VELIPIGPVLPSLS 245
            P+ +    L     H   ++ E + KIL+NTF  LEHD L ++  +++IPIGP++ S  
Sbjct: 192 QPSKALPSALVTLREHIEALETESNPKILVNTFSALEHDALTSVEKLKMIPIGPLVSSSE 251

Query: 246 FFHPTNSMDLKPYIGWLNSKPKSSVVYVSFGSIAA-VSTAQLEEIARGLLDSKLPFLWVM 305
                     + Y  WL+SK + SV+Y+S G+ A  +    +E +  G+L +  PFLW++
Sbjct: 252 GKTDLFKSSDEDYTKWLDSKLERSVIYISLGTHADDLPEKHMEALTHGVLATNRPFLWIV 311

Query: 306 RKRSDGDEGDLVSCREELVA---KGKIVTWCSQLEVLLHPSVGCFLTHCGWNSSLESIAC 365
           R+++  ++    +   EL+    +G +V WCSQ  VL H +VGCF+THCGWNS+LES+  
Sbjct: 312 REKNPEEKKK--NRFLELIRGSDRGLVVGWCSQTAVLAHCAVGCFVTHCGWNSTLESLES 371

Query: 366 GVSVVAFPQWTDQSTNARIIEESSKSGVRLRMNDDGIVERGEIKKCLDLVMGDGVEGESL 425
           GV VVAFPQ+ DQ T A+++E++ + GV++++ ++G V+  EI++CL+ VM  G E E +
Sbjct: 372 GVPVVAFPQFADQCTTAKLVEDTWRIGVKVKVGEEGDVDGEEIRRCLEKVMSGGEEAEEM 431

Query: 426 RRNALKWKDLANHATTKGGSSNANVSSFLD 447
           R NA KWK +A  A  +GG S+ N+  F+D
Sbjct: 432 RENAEKWKAMAVDAAAEGGPSDLNLKGFVD 453

BLAST of Cp4.1LG17g03860 vs. TAIR10
Match: AT3G21560.1 (AT3G21560.1 UDP-Glycosyltransferase superfamily protein)

HSP 1 Score: 255.4 bits (651), Expect = 6.7e-68
Identity = 161/472 (34.11%), Postives = 253/472 (53.60%), Query Frame = 1

Query: 6   PHVLFVALATHGHFNPGLHLANTLSHGGVNVTFATSSS----------VRHRLPKLPSSP 65
           PHV+ V+    GH NP L L   L+  G+ +TF T+ S          ++ R+ K     
Sbjct: 11  PHVMLVSFPGQGHVNPLLRLGKLLASKGLLITFVTTESWGKKMRISNKIQDRVLKPVGKG 70

Query: 66  NLYFSFFSDG--QDDGFKLGNDVVPFLSQFERQASIAIRQTILKSKANGK-PFSFVVYSL 125
            L + FF DG  +DD     N  +      E      I+  + + K   K P + ++ + 
Sbjct: 71  YLRYDFFDDGLPEDDEASRTNLTI-LRPHLELVGKREIKNLVKRYKEVTKQPVTCLINNP 130

Query: 126 LTPWMAEVARSFDLPSALFWNQSAAVFAIYYHFFNGYKDIIGNAFSHPSIKIHLPGLPLL 185
              W+ +VA    +P A+ W QS A  A YY++ +   D      + P I + + G+PLL
Sbjct: 131 FVSWVCDVAEDLQIPCAVLWVQSCACLAAYYYYHHNLVDFPTK--TEPEIDVQISGMPLL 190

Query: 186 DSKQLPSLCNPANSNSFVLKLYETHFHVIKQ-EPHLKILINTFDDLEHDVLRAISVELIP 245
              ++PS  +P++ +S    L E     IK+      I I+TF+ LE D++  +S   +P
Sbjct: 191 KHDEIPSFIHPSSPHS---ALREVIIDQIKRLHKTFSIFIDTFNSLEKDIIDHMSTLSLP 250

Query: 246 -----IGPV----------LPSLSFFHPTNSMDLKPYIGWLNSKPKSSVVYVSFGSIAAV 305
                +GP+          +  ++   PT+     P + WL+S+P SSVVY+SFG++A +
Sbjct: 251 GVIRPLGPLYKMAKTVAYDVVKVNISEPTD-----PCMEWLDSQPVSSVVYISFGTVAYL 310

Query: 306 STAQLEEIARGLLDSKLPFLWVMRKRSDGDEGDLVSCREELVAKGKIVTWCSQLEVLLHP 365
              Q++EIA G+L++ + FLWV+R++  G   +     EE+  KGKIV WCSQ +VL HP
Sbjct: 311 KQEQIDEIAYGVLNADVTFLWVIRQQELGFNKEKHVLPEEVKGKGKIVEWCSQEKVLSHP 370

Query: 366 SVGCFLTHCGWNSSLESIACGVSVVAFPQWTDQSTNARIIEESSKSGVRLRMN--DDGIV 425
           SV CF+THCGWNS++E+++ GV  V FPQW DQ T+A  + +  K+GVRL     ++ +V
Sbjct: 371 SVACFVTHCGWNSTMEAVSSGVPTVCFPQWGDQVTDAVYMIDVWKTGVRLSRGEAEERLV 430

Query: 426 ERGEIKKCLDLVMGDGVEGESLRRNALKWKDLANHATTKGGSSNANVSSFLD 447
            R E+ + L  V   G +   L++NALKWK+ A  A  +GGSS+ N+  F++
Sbjct: 431 PREEVAERLREVT-KGEKAIELKKNALKWKEEAEAAVARGGSSDRNLEKFVE 470

BLAST of Cp4.1LG17g03860 vs. NCBI nr
Match: gi|659116590|ref|XP_008458151.1| (PREDICTED: anthocyanidin 3-O-glucoside 5-O-glucosyltransferase 2-like [Cucumis melo])

HSP 1 Score: 674.9 bits (1740), Expect = 1.0e-190
Identity = 334/462 (72.29%), Postives = 382/462 (82.68%), Query Frame = 1

Query: 4   SHPHVLFVALATHGHFNPGLHLANTLSHGGVNVTFATSSSVRHRLPKLPSSPNLYFSFFS 63
           + PHVL VALATHGHFNPGLHLAN LSHGG++VTFATSSSV  RLPKLPSSP L F+FFS
Sbjct: 2   TEPHVLLVALATHGHFNPGLHLANILSHGGLHVTFATSSSVLCRLPKLPSSPRLSFAFFS 61

Query: 64  DGQDDGFKLGNDVVPFLSQFERQASIAIRQTILKSKANGKPFSFVVYSLLTPWMAEVARS 123
           DGQ+DGFK GNDVVPFLSQFE QAS AI   ILKSKA+GKP SFV+YSLLTPWMA VARS
Sbjct: 62  DGQEDGFKPGNDVVPFLSQFELQASRAIHDIILKSKASGKPISFVLYSLLTPWMANVARS 121

Query: 124 FDLPSALFWNQSAAVFAIYYHFFNGYKDIIGNAFSHPSIKIHLPGLPLLDSKQLPSLCNP 183
           FDLP+ALFWNQSAAVFAIYYHFFNGY+++I N FSHP I I+LPGLP L+SKQLPSLCNP
Sbjct: 122 FDLPAALFWNQSAAVFAIYYHFFNGYREVIQNCFSHPCININLPGLPSLNSKQLPSLCNP 181

Query: 184 ANSNSFVLKLYETHFHVIKQEPHLKILINTFDDLEHDVLRAISV-ELIPIGPVLPSLSF- 243
           ANSNSFVLKL+E+HF V+KQEPHLKILIN+F++LEHDV RA ++  LIPIGPVLP+ +  
Sbjct: 182 ANSNSFVLKLFESHFQVLKQEPHLKILINSFEELEHDVFRANNMANLIPIGPVLPTKTIE 241

Query: 244 -----------------FHPTNSMDLKPYIGWLNSKPKSSVVYVSFGSIAAVSTAQLEEI 303
                              P NS D   Y  WLNSKPKSSVVYVSFGSIAAVS AQLEEI
Sbjct: 242 QLNNEKNLGAFKVTPISCSPPNSRDESKYYTWLNSKPKSSVVYVSFGSIAAVSKAQLEEI 301

Query: 304 ARGLLDSKLPFLWVMRKRSDGDEGDLVSCREELVAKGKIVTWCSQLEVLLHPSVGCFLTH 363
            R LLD    FLWVMRK + G+E D++SC +EL AKGK+V WCSQLEVL +P++GCFLTH
Sbjct: 302 GRALLDYGGEFLWVMRKMAHGNEKDMLSCLDELEAKGKVVAWCSQLEVLSNPAIGCFLTH 361

Query: 364 CGWNSSLESIACGVSVVAFPQWTDQSTNARIIEESSKSGVRLRMNDDGIVERGEIKKCLD 423
           CGWNSS+ES+ CGV VVAFPQWTDQ TNA+IIE+ SKSGV+LR+N++GIVERGEIKKCL+
Sbjct: 362 CGWNSSMESLVCGVPVVAFPQWTDQGTNAKIIEDLSKSGVKLRVNENGIVERGEIKKCLE 421

Query: 424 LVMGDGVEGESLRRNALKWKDLANHATTKGGSSNANVSSFLD 447
           +VMG+G +GE  RRNA KWK+LAN ATTKGGSS  N+ +F+D
Sbjct: 422 MVMGEGDKGEGFRRNAKKWKELANKATTKGGSSYVNIRNFID 463

BLAST of Cp4.1LG17g03860 vs. NCBI nr
Match: gi|778709244|ref|XP_011656370.1| (PREDICTED: crocetin glucosyltransferase, chloroplastic-like [Cucumis sativus])

HSP 1 Score: 663.7 bits (1711), Expect = 2.3e-187
Identity = 328/460 (71.30%), Postives = 375/460 (81.52%), Query Frame = 1

Query: 6   PHVLFVALATHGHFNPGLHLANTLSHGGVNVTFATSSSVRHRLPKLPSSPNLYFSFFSDG 65
           PHVLFVALATHGHFNPGLH AN LSHGG++VTFATSSSV  R+PKLPSSP L F+FFSDG
Sbjct: 4   PHVLFVALATHGHFNPGLHFANILSHGGLHVTFATSSSVFRRVPKLPSSPRLSFAFFSDG 63

Query: 66  QDDGFKLGNDVVPFLSQFERQASIAIRQTILKSKANGKPFSFVVYSLLTPWMAEVARSFD 125
           QDDGFK G+DVVPFLSQFE QAS AI   ILKSKA+GKP +FV+YSLLTPWMA VARSFD
Sbjct: 64  QDDGFKPGDDVVPFLSQFELQASRAIHDIILKSKASGKPITFVLYSLLTPWMANVARSFD 123

Query: 126 LPSALFWNQSAAVFAIYYHFFNGYKDIIGNAFSHPSIKIHLPGLPLLDSKQLPSLCNPAN 185
           LP+ALFWNQSAAVFAIYYHFFNGY+++I N FSHP I I+LPGL  L+SKQLPSLCNP N
Sbjct: 124 LPTALFWNQSAAVFAIYYHFFNGYREVIQNCFSHPCININLPGLTSLNSKQLPSLCNPVN 183

Query: 186 SNSFVLKLYETHFHVIKQEPHLKILINTFDDLEHDVLRAISV-ELIPIGPVLPSLSF--- 245
           SNSF+LKL+E+HF V+KQEPHLKILIN+FD+LEHDV RA ++  LIPIGPVLP       
Sbjct: 184 SNSFILKLFESHFQVLKQEPHLKILINSFDELEHDVFRANNMGNLIPIGPVLPIKCIEQM 243

Query: 246 ---------------FHPTNSMDLKPYIGWLNSKPKSSVVYVSFGSIAAVSTAQLEEIAR 305
                          F   NS D   Y  WLNSKP+SSVVY+SFGSIAAVS AQLEEI R
Sbjct: 244 NNEIFLDAFRVAPISFSLHNSQDESKYHSWLNSKPRSSVVYLSFGSIAAVSKAQLEEIGR 303

Query: 306 GLLDSKLPFLWVMRKRSDGDEGDLVSCREELVAKGKIVTWCSQLEVLLHPSVGCFLTHCG 365
           GLLD    FLWVMRK S G+E D++SC +EL AKGK+V WCSQLEVL +P++GCFLTHCG
Sbjct: 304 GLLDYGGEFLWVMRKMSHGNERDMLSCLDELEAKGKVVAWCSQLEVLSNPAIGCFLTHCG 363

Query: 366 WNSSLESIACGVSVVAFPQWTDQSTNARIIEESSKSGVRLRMNDDGIVERGEIKKCLDLV 425
           WNSS+ES+ CGV VVAFPQWTDQ TNA+IIE+ SKSGV+LR+N++GIVERGEIKKCL++V
Sbjct: 364 WNSSMESLVCGVPVVAFPQWTDQGTNAKIIEDLSKSGVKLRVNENGIVERGEIKKCLEMV 423

Query: 426 MGDGVEGESLRRNALKWKDLANHATTKGGSSNANVSSFLD 447
           MG G EGE  RRN  KWK+LA  A TKGGSS+ N+ +F+D
Sbjct: 424 MGKGDEGEGFRRNGKKWKELAKKAITKGGSSHLNIRNFID 463

BLAST of Cp4.1LG17g03860 vs. NCBI nr
Match: gi|225433624|ref|XP_002263301.1| (PREDICTED: crocetin glucosyltransferase, chloroplastic [Vitis vinifera])

HSP 1 Score: 459.5 bits (1181), Expect = 6.7e-126
Identity = 235/460 (51.09%), Postives = 321/460 (69.78%), Query Frame = 1

Query: 8   VLFVALATHGHFNPGLHLANTLSHGGVNVTFATSSSVRHRLPKLPSSPNLYFSFFSDGQD 67
           +L V     GH NP L LA  L+  G +VTF TSSS   R+ K P+   L F  FSDG D
Sbjct: 5   ILLVTYPAQGHINPSLQLAKLLTRAGAHVTFVTSSSASTRMSKPPTLEGLEFVTFSDGYD 64

Query: 68  DGFKLGNDVVPFLSQFERQASIAIRQTILKSKANGKPFSFVVYSLLTPWMAEVARSFDLP 127
            GFK G+D+  F+S+ +R  S A+ + I+     G+PF+ ++Y ++ PW+AEVA+SF LP
Sbjct: 65  HGFKHGDDLQNFMSELDRLGSQALTELIVARANEGRPFTCLLYGIIIPWVAEVAQSFHLP 124

Query: 128 SALFWNQSAAVFAIYYHFFNGYKDIIGNAFSHPSIKIHLPGLPLLDSKQLPSLCNPANSN 187
           SAL W+Q+A VF IYY++FNGY ++IGN  +  S  I LPGLPLL S  LPS   P+ + 
Sbjct: 125 SALVWSQAATVFDIYYYYFNGYGELIGNKGNGSSSSIELPGLPLLSSSDLPSFLEPSKAI 184

Query: 188 SF--VLKLYETHFHVIKQEPHLKILINTFDDLEHDVLRAIS-VELIPIGPVLPSLSFFH- 247
           +F  VLK  +     + +E + ++L+N+FD LE + LRA++  +L+ IGP+LP L+F   
Sbjct: 185 AFNFVLKSLQKQLEQLNRESNPRVLVNSFDALESEALRALNKFKLMGIGPLLP-LAFLDG 244

Query: 248 --PTNSM-------DLKPYIGWLNSKPKSSVVYVSFGSIAAVSTAQLEEIARGLLDSKLP 307
             P+++        D K YI WLNSKP+SSV+YVSFGS++ +S  Q EEIARGLL S  P
Sbjct: 245 KDPSDTSFGGDLFRDSKDYIQWLNSKPESSVIYVSFGSLSVLSKQQSEEIARGLLASGRP 304

Query: 308 FLWVMRKRSDGDE---GDLVSCREELVAKGKIVTWCSQLEVLLHPSVGCFLTHCGWNSSL 367
           FLWV+R + +G+E    D +SC EEL  +G IV WCSQ+EVL HPS+GCF++HCGWNS+L
Sbjct: 305 FLWVIRAKENGEEEKEDDKLSCVEELEQQGMIVPWCSQVEVLSHPSLGCFVSHCGWNSTL 364

Query: 368 ESIACGVSVVAFPQWTDQSTNARIIEESSKSGVRLRMNDDGIVERGEIKKCLDLVMGDGV 427
           ES+ACGV VVAFPQWTDQ+TNA++IE+  K+G+R+ +N +GIVE GEIKKCL+LVMG G 
Sbjct: 365 ESLACGVPVVAFPQWTDQTTNAKLIEDVWKTGLRVMVNQEGIVEGGEIKKCLELVMGCGE 424

Query: 428 EGESLRRNALKWKDLANHATTKGGSSNANVSSFLDFVCNG 452
           +G+ +RRNA KWKDLA  A  +GGSS+ N+ +F++ +  G
Sbjct: 425 KGQEVRRNAKKWKDLAREAVKEGGSSDKNLKNFVNEIIQG 463

BLAST of Cp4.1LG17g03860 vs. NCBI nr
Match: gi|225433620|ref|XP_002263700.1| (PREDICTED: crocetin glucosyltransferase, chloroplastic-like [Vitis vinifera])

HSP 1 Score: 452.2 bits (1162), Expect = 1.1e-123
Identity = 229/459 (49.89%), Postives = 305/459 (66.45%), Query Frame = 1

Query: 6   PHVLFVALATHGHFNPGLHLANTLSHGGVNVTFATSSSVRHRLPKLPSSPNLYFSFFSDG 65
           PH L V     GH NP L  A  +   G  V+FATS S   R+ K  +   L F  FSDG
Sbjct: 4   PHFLLVTFPAQGHINPALQFAKRIIRTGAQVSFATSVSAHRRMAKRSTPEGLNFVPFSDG 63

Query: 66  QDDGFKLGNDVVPFLSQFERQASIAIRQTILKSKANGKPFSFVVYSLLTPWMAEVARSFD 125
            DDGFK  +DV  ++S+ +R+ S  +R+ ++++   G+PF+ +VY+LL PW AEVAR   
Sbjct: 64  YDDGFKPTDDVQHYMSEIKRRGSETLREIVVRNADEGQPFTCIVYTLLLPWAAEVARGLG 123

Query: 126 LPSALFWNQSAAVFAIYYHFFNGYKDIIGNAFSHPSIKIHLPGLPLLDSKQLPSLCNPAN 185
           +PSAL W Q A V  IYY++FNGY D+  N  + PS  + LPGLPLL S+ LPS    +N
Sbjct: 124 VPSALLWIQPATVLDIYYYYFNGYGDVFRNISNEPSCSVELPGLPLLSSRDLPSFLVKSN 183

Query: 186 SNSFVLKLYETHFHVIKQEPHLKILINTFDDLEHDVLRAI-SVELIPIGPVLPS--LSFF 245
           + +FVL  ++     + QE   K+L+NTFD LE + LRA+  + LI IGP++PS  L   
Sbjct: 184 AYTFVLPTFQEQLEALSQETSPKVLVNTFDALEPEPLRAVDKLHLIGIGPLVPSAYLDGK 243

Query: 246 HPTNS-------MDLKPYIGWLNSKPKSSVVYVSFGSIAAVSTAQLEEIARGLLDSKLPF 305
            P+++            Y+ WLNSKPKSSVVYVSFGSI+ +S  Q E+IAR LLD   PF
Sbjct: 244 DPSDTSFGGDMFQGSDDYMEWLNSKPKSSVVYVSFGSISVLSKTQKEDIARALLDCGHPF 303

Query: 306 LWVMRKRSDGD---EGDLVSCREELVAKGKIVTWCSQLEVLLHPSVGCFLTHCGWNSSLE 365
           LWV+R   +G+   E D +SCREEL  KG IV+WCSQ+EVL HPS+GCF++HCGWNS+LE
Sbjct: 304 LWVIRAPENGEEVKEQDKLSCREELEQKGMIVSWCSQIEVLTHPSLGCFVSHCGWNSTLE 363

Query: 366 SIACGVSVVAFPQWTDQSTNARIIEESSKSGVRLRMNDDGIVERGEIKKCLDLVMGDGVE 425
           S+  GV VVAFPQWTDQ TNA++IE+  K G+R+ +N++GIVE  E K+CL++VMG G +
Sbjct: 364 SLVSGVPVVAFPQWTDQGTNAKLIEDMWKIGIRVTVNEEGIVESDEFKRCLEIVMGGGEK 423

Query: 426 GESLRRNALKWKDLANHATTKGGSSNANVSSFLDFVCNG 452
           GE +RRNA KWK+LA  A   GGSS+ N+  F+D V +G
Sbjct: 424 GEEMRRNAEKWKNLAREAVKDGGSSDKNLKGFVDEVGHG 462

BLAST of Cp4.1LG17g03860 vs. NCBI nr
Match: gi|225433626|ref|XP_002263975.1| (PREDICTED: crocetin glucosyltransferase, chloroplastic-like [Vitis vinifera])

HSP 1 Score: 451.1 bits (1159), Expect = 2.4e-123
Identity = 236/460 (51.30%), Postives = 315/460 (68.48%), Query Frame = 1

Query: 6   PHVLFVALATHGHFNPGLHLANTLSHGGVNVTFATSSSVRHRLPKLPSSPNLYFSFFSDG 65
           P +L V     GH NP L LA  L   G +VTF TSSS   R+ K P+   L F  FSDG
Sbjct: 3   PQILLVTYPAQGHINPSLQLAKLLIRAGAHVTFVTSSSAGTRMSKSPTLDGLEFVTFSDG 62

Query: 66  QDDGFKLGNDVVPFLSQFERQASIAIRQTILKSKANGKPFSFVVYSLLTPWMAEVARSFD 125
            D GF  G+ +  F+S+ ER  S A+ + I+     G+PF+ ++Y +L PW+AEVARS  
Sbjct: 63  YDHGFDHGDGLQNFMSELERLGSPALTKLIMARANEGRPFTCLLYGMLIPWVAEVARSLH 122

Query: 126 LPSALFWNQSAAVFAIYYHFFNGYKDIIGNAFSHPSIKIHLPGLPLLDSKQLPSLCNPA- 185
           LPSAL W+Q AAVF IYY++FNGY ++IGN  +  S  I LPGLPL+ S  LPS   P+ 
Sbjct: 123 LPSALVWSQPAAVFDIYYYYFNGYGELIGNKGNGSSSSIELPGLPLISSSDLPSFLVPSK 182

Query: 186 -NSNSFVLKLYETHFHVIKQEPHLKILINTFDDLEHDVLRAIS-VELIPIGPVLPS--LS 245
            ++++FVLKL++     + +E + ++L+N+FD LE + LRAI+  +L+ IGP+LPS  L 
Sbjct: 183 VSAHNFVLKLHQKQLEQLNRESNPRVLVNSFDALESEALRAINKFKLMGIGPLLPSAFLD 242

Query: 246 FFHPTNSM---DL----KPYIGWLNSKPKSSVVYVSFGSIAAVSTAQLEEIARGLLDSKL 305
              P+++    DL    K YI WLNS  +SSV+YVSFGS++ +S  Q EEIARGLLDS  
Sbjct: 243 GKDPSDTSFGGDLFRGSKDYIQWLNSNAESSVIYVSFGSLSVLSKQQSEEIARGLLDSGR 302

Query: 306 PFLWVMRKRSDGDEG--DLVSCREELVAKGKIVTWCSQLEVLLHPSVGCFLTHCGWNSSL 365
           PFLWV+R + + +E   D +SC EEL   G IV WCSQ+EVL HPS+GCF++HCGWNS+L
Sbjct: 303 PFLWVIRAKENEEEEKEDKLSCVEELEQLGMIVPWCSQVEVLSHPSLGCFVSHCGWNSTL 362

Query: 366 ESIACGVSVVAFPQWTDQSTNARIIEESSKSGVRLRMNDDGIVERGEIKKCLDLVMGDGV 425
           ES+A GV VVAFPQWTDQ+TNA++IE+  K+G+R+ +N +GIVE GEIKKCL+LVMG G 
Sbjct: 363 ESLASGVPVVAFPQWTDQTTNAKLIEDVWKTGLRVMVNQEGIVEGGEIKKCLELVMGGGE 422

Query: 426 EGESLRRNALKWKDLANHATTKGGSSNANVSSFLDFVCNG 452
            G+ +R NA KWKDLA  A   GGSS+ N+ +F+D +  G
Sbjct: 423 RGQEVRSNAKKWKDLAREAVKDGGSSDKNLKNFVDEIIQG 462

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
UGT1_GARJA2.3e-11047.60Crocetin glucosyltransferase, chloroplastic OS=Gardenia jasminoides GN=UGT75L6 P... [more]
5GT1_PERFR2.9e-9744.98Anthocyanidin 3-O-glucoside 5-O-glucosyltransferase 1 OS=Perilla frutescens GN=P... [more]
5GT_VERHY5.0e-9744.30Anthocyanidin 3-O-glucoside 5-O-glucosyltransferase OS=Verbena hybrida GN=HGT8 P... [more]
U75D1_ARATH4.6e-9541.15UDP-glycosyltransferase 75D1 OS=Arabidopsis thaliana GN=UGT75D1 PE=2 SV=2[more]
5GT2_PERFR7.9e-9545.17Anthocyanidin 3-O-glucoside 5-O-glucosyltransferase 2 OS=Perilla frutescens GN=P... [more]
Match NameE-valueIdentityDescription
A0A0A0KD49_CUCSA1.6e-18771.30Glycosyltransferase OS=Cucumis sativus GN=Csa_6G007450 PE=3 SV=1[more]
F6I4F7_VITVI4.6e-12651.09Glycosyltransferase OS=Vitis vinifera GN=VIT_05s0062g00700 PE=3 SV=1[more]
F6I4F4_VITVI7.4e-12449.89Glycosyltransferase OS=Vitis vinifera GN=VIT_05s0062g00640 PE=3 SV=1[more]
F6I4F8_VITVI1.6e-12351.30Glycosyltransferase OS=Vitis vinifera GN=VIT_05s0062g00710 PE=3 SV=1[more]
F6I4D5_VITVI1.1e-12251.75Glycosyltransferase OS=Vitis vinifera GN=VIT_05s0062g00350 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT4G15550.12.6e-9641.15 indole-3-acetate beta-D-glucosyltransferase[more]
AT1G05530.13.2e-9442.33 UDP-glucosyl transferase 75B2[more]
AT1G05560.11.8e-9241.54 UDP-glucosyltransferase 75B1[more]
AT4G14090.18.2e-9039.78 UDP-Glycosyltransferase superfamily protein[more]
AT3G21560.16.7e-6834.11 UDP-Glycosyltransferase superfamily protein[more]
Match NameE-valueIdentityDescription
gi|659116590|ref|XP_008458151.1|1.0e-19072.29PREDICTED: anthocyanidin 3-O-glucoside 5-O-glucosyltransferase 2-like [Cucumis m... [more]
gi|778709244|ref|XP_011656370.1|2.3e-18771.30PREDICTED: crocetin glucosyltransferase, chloroplastic-like [Cucumis sativus][more]
gi|225433624|ref|XP_002263301.1|6.7e-12651.09PREDICTED: crocetin glucosyltransferase, chloroplastic [Vitis vinifera][more]
gi|225433620|ref|XP_002263700.1|1.1e-12349.89PREDICTED: crocetin glucosyltransferase, chloroplastic-like [Vitis vinifera][more]
gi|225433626|ref|XP_002263975.1|2.4e-12351.30PREDICTED: crocetin glucosyltransferase, chloroplastic-like [Vitis vinifera][more]
The following terms have been associated with this gene:
Vocabulary: Biological Process
TermDefinition
GO:0008152metabolic process
Vocabulary: Molecular Function
TermDefinition
GO:0016758transferase activity, transferring hexosyl groups
Vocabulary: INTERPRO
TermDefinition
IPR002213UDP_glucos_trans
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008152 metabolic process
cellular_component GO:0005575 cellular_component
cellular_component GO:0043231 intracellular membrane-bounded organelle
molecular_function GO:0016758 transferase activity, transferring hexosyl groups
molecular_function GO:0080043 quercetin 3-O-glucosyltransferase activity
molecular_function GO:0080044 quercetin 7-O-glucosyltransferase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG17g03860.1Cp4.1LG17g03860.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002213UDP-glucuronosyl/UDP-glucosyltransferasePANTHERPTHR11926GLUCOSYL/GLUCURONOSYL TRANSFERASEScoord: 1..450
score: 2.3E
IPR002213UDP-glucuronosyl/UDP-glucosyltransferasePFAMPF00201UDPGTcoord: 206..386
score: 4.2
NoneNo IPR availableGENE3DG3DSA:3.40.50.2000coord: 261..379
score: 6.3
NoneNo IPR availablePANTHERPTHR11926:SF98UDP-GLYCOSYLTRANSFERASE 75B1-RELATEDcoord: 1..450
score: 2.3E
NoneNo IPR availableunknownSSF53756UDP-Glycosyltransferase/glycogen phosphorylasecoord: 4..449
score: 2.09E

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Cp4.1LG17g03860Cucurbita pepo (Zucchini)cpecpeB159
Cp4.1LG17g03860Cucurbita maxima (Rimu)cmacpeB378
Cp4.1LG17g03860Cucurbita maxima (Rimu)cmacpeB916
Cp4.1LG17g03860Cucurbita moschata (Rifu)cmocpeB341
Cp4.1LG17g03860Cucurbita moschata (Rifu)cmocpeB853
Cp4.1LG17g03860Silver-seed gourdcarcpeB0876
Cp4.1LG17g03860Silver-seed gourdcarcpeB0851