Cp4.1LG17g03950 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG17g03950
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionGlycosyltransferase
LocationCp4.1LG17 : 2856181 .. 2857906 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGACAACACAGCACCCCACCGCCACCGTGTGCTGTTGATAACATACTCTGCTCAAGGACACATCAACCCCGCCCTCGAATTCGCCAAACGCCTAACGCGCCGCCGCATTGATGTCACCTTCGTCACCTCTCTCTCCGCCTATCGCCGCATGGGCAAAACCCCAACGCTGCCTCACGTGTCATTCGCCTCCTTCTCCGATGGCTATGACGACGGTTTCAAACAAGGGGACGACATCAATCATTTCATGTCGGAGCTGGAGCGACGTGGGTCTCAAGCTATTAAGGATATGATTGTGGCGGGTGTTGAACAAGGCCAACCCTTCACTTGCATTGTTTATTCTATACTCCTCCCATGGGTGGCTATAGTGGCGCGTTCCCTCCACCTTCCGGCGATTCTTCTTTGGATTCAACCGGCTATTGTTTTTGCTTTGTACTACTATTACAACTATGGTTATCATGACATAATTCAAAGCGTATTCGATGATCCGTTGGCAACTATTCAACTACCTGGCCTTCCATTGTTAACGGCTCGTGATCTTCCTTCGTTCTTTGGTTCTTCAGATGCTTATGAATTTGCTCTCCCAATATTTCGGAGGCAATTTGAATTACTCGAGCAAGAGACCAATCCAATGGTTGTAATCAACACATTCGACGAATTGGAGCACGACGCGCTTAGAGCGATTAGCAAGTTCCATTTGATACCCATCGGACCATTGATTCCATCAGAAGCCTCCTCCCGATGTGATCTGTTTCAATCTACTACCAGGTATGATATTGTCCATTTTGAACATAAACTCGATACTAATACAGTTGTATTCCTTACCTGTAGTTTTAAGTTTATGGGTTAGGGTTGAATTTAAATTTTGTTTTGTTCGTATTCTTTTAACAGTTATATCGACTGGTTGAACTCAAAACCTAAAGGGTCTGTAATTTACGTATCGTCGGGAAGTATATCAACGTTATCGAAGCACCAAAAAGAGGAGATAGCAAGAGGGTTATTAAGTTGTGGACGACCATTTTTGTGGGTGATTCGAGACATTGAAGAAGTAAATACTTTGAGTTGTAGGGAGGAATTGGAAGGTTTAGGAAAGATAGTGTCATGGTGTTCACAAATAGAGGTTCTATCAAGGCCAGCGACAGGATGTTTTCTAACGCATTGTGGATGGAATTCGACGTTGGAGAGTTTGGTATGCGGGGTACCGGTGGTGGTGTTTCCACAGTGGTCGGATCAGGGGACGAATGCGAAGATCATTCAAGACATGTCGGAGACGGGAGTGAGGTTGGAGGTGGGGATGGATGGCGTGGTTAAGCGAGAGGAGATAAAAAGATGCTTGGAGTTGGTGATGGGAGATTCAAAGAAAGGAGAAGAGATAAGGAAGAATGTGGTGAAATGGAAGGAGTTGGCTAAGGGAGCCACCGCCCACGGCGGTTCTTCATACTCAAACTTCAAGGCTTTTGTGGACCAAGTTTGTCCTTAATTTCTATAAGCTTTGAGAGTGATAAATTCCTAGGAGAGGGGAAGAGAAGAGGTGAAAGTTTATGCCACGTGAAAATTATGAGAAAATACAATCCACCCCTTGTTGAATAGTAGATGGTTTTTTTCATTTTTTCGGTAAGAGTATCTCATAAGTTTCTAAATAATTAGTTTGGTGTATAACAGGTTAATCTTATTGATACGACCCAATATTTTACTACAAACCATGACTCAAAACGACGATGAAATGC

mRNA sequence

ATGGACAACACAGCACCCCACCGCCACCGTGTGCTGTTGATAACATACTCTGCTCAAGGACACATCAACCCCGCCCTCGAATTCGCCAAACGCCTAACGCGCCGCCGCATTGATGTCACCTTCGTCACCTCTCTCTCCGCCTATCGCCGCATGGGCAAAACCCCAACGCTGCCTCACGTGTCATTCGCCTCCTTCTCCGATGGCTATGACGACGGTTTCAAACAAGGGGACGACATCAATCATTTCATGTCGGAGCTGGAGCGACGTGGGTCTCAAGCTATTAAGGATATGATTGTGGCGGGTGTTGAACAAGGCCAACCCTTCACTTGCATTGTTTATTCTATACTCCTCCCATGGGTGGCTATAGTGGCGCGTTCCCTCCACCTTCCGGCGATTCTTCTTTGGATTCAACCGGCTATTGTTTTTGCTTTGTACTACTATTACAACTATGGTTATCATGACATAATTCAAAGCGTATTCGATGATCCGTTGGCAACTATTCAACTACCTGGCCTTCCATTGTTAACGGCTCGTGATCTTCCTTCGTTCTTTGGTTCTTCAGATGCTTATGAATTTGCTCTCCCAATATTTCGGAGGCAATTTGAATTACTCGAGCAAGAGACCAATCCAATGGTTGTAATCAACACATTCGACGAATTGGAGCACGACGCGCTTAGAGCGATTAGCAATTATATCGACTGGTTGAACTCAAAACCTAAAGGGTCTGTAATTTACGTATCGTCGGGAAGTATATCAACGTTATCGAAGCACCAAAAAGAGGAGATAGCAAGAGGGTTATTAAGTTGTGGACGACCATTTTTGTGGGTGATTCGAGACATTGAAGAAGTAAATACTTTGAGTTGTAGGGAGGAATTGGAAGGTTTAGGAAAGATAGTGTCATGGTGTTCACAAATAGAGGTTCTATCAAGGCCAGCGACAGGATGTTTTCTAACGCATTGTGGATGGAATTCGACGTTGGAGAGTTTGGTATGCGGGGTACCGGTGGTGGTGTTTCCACAGTGGTCGGATCAGGGGACGAATGCGAAGATCATTCAAGACATGTCGGAGACGGGAGTGAGGTTGGAGGTGGGGATGGATGGCGTGGTTAAGCGAGAGGAGATAAAAAGATGCTTGGAGTTGGTGATGGGAGATTCAAAGAAAGGAGAAGAGATAAGGAAGAATGTGGTGAAATGGAAGGAGTTGGCTAAGGGAGCCACCGCCCACGGCGGTTCTTCATACTCAAACTTCAAGGCTTTTGTGGACCAAGTTTGTCCTTAATTTCTATAAGCTTTGAGAGTGATAAATTCCTAGGAGAGGGGAAGAGAAGAGGTGAAAGTTTATGCCACGTGAAAATTATGAGAAAATACAATCCACCCCTTGTTGAATAGTAGATGGTTTTTTTCATTTTTTCGGTAAGAGTATCTCATAAGTTTCTAAATAATTAGTTTGGTGTATAACAGGTTAATCTTATTGATACGACCCAATATTTTACTACAAACCATGACTCAAAACGACGATGAAATGC

Coding sequence (CDS)

ATGGACAACACAGCACCCCACCGCCACCGTGTGCTGTTGATAACATACTCTGCTCAAGGACACATCAACCCCGCCCTCGAATTCGCCAAACGCCTAACGCGCCGCCGCATTGATGTCACCTTCGTCACCTCTCTCTCCGCCTATCGCCGCATGGGCAAAACCCCAACGCTGCCTCACGTGTCATTCGCCTCCTTCTCCGATGGCTATGACGACGGTTTCAAACAAGGGGACGACATCAATCATTTCATGTCGGAGCTGGAGCGACGTGGGTCTCAAGCTATTAAGGATATGATTGTGGCGGGTGTTGAACAAGGCCAACCCTTCACTTGCATTGTTTATTCTATACTCCTCCCATGGGTGGCTATAGTGGCGCGTTCCCTCCACCTTCCGGCGATTCTTCTTTGGATTCAACCGGCTATTGTTTTTGCTTTGTACTACTATTACAACTATGGTTATCATGACATAATTCAAAGCGTATTCGATGATCCGTTGGCAACTATTCAACTACCTGGCCTTCCATTGTTAACGGCTCGTGATCTTCCTTCGTTCTTTGGTTCTTCAGATGCTTATGAATTTGCTCTCCCAATATTTCGGAGGCAATTTGAATTACTCGAGCAAGAGACCAATCCAATGGTTGTAATCAACACATTCGACGAATTGGAGCACGACGCGCTTAGAGCGATTAGCAATTATATCGACTGGTTGAACTCAAAACCTAAAGGGTCTGTAATTTACGTATCGTCGGGAAGTATATCAACGTTATCGAAGCACCAAAAAGAGGAGATAGCAAGAGGGTTATTAAGTTGTGGACGACCATTTTTGTGGGTGATTCGAGACATTGAAGAAGTAAATACTTTGAGTTGTAGGGAGGAATTGGAAGGTTTAGGAAAGATAGTGTCATGGTGTTCACAAATAGAGGTTCTATCAAGGCCAGCGACAGGATGTTTTCTAACGCATTGTGGATGGAATTCGACGTTGGAGAGTTTGGTATGCGGGGTACCGGTGGTGGTGTTTCCACAGTGGTCGGATCAGGGGACGAATGCGAAGATCATTCAAGACATGTCGGAGACGGGAGTGAGGTTGGAGGTGGGGATGGATGGCGTGGTTAAGCGAGAGGAGATAAAAAGATGCTTGGAGTTGGTGATGGGAGATTCAAAGAAAGGAGAAGAGATAAGGAAGAATGTGGTGAAATGGAAGGAGTTGGCTAAGGGAGCCACCGCCCACGGCGGTTCTTCATACTCAAACTTCAAGGCTTTTGTGGACCAAGTTTGTCCTTAA

Protein sequence

MDNTAPHRHRVLLITYSAQGHINPALEFAKRLTRRRIDVTFVTSLSAYRRMGKTPTLPHVSFASFSDGYDDGFKQGDDINHFMSELERRGSQAIKDMIVAGVEQGQPFTCIVYSILLPWVAIVARSLHLPAILLWIQPAIVFALYYYYNYGYHDIIQSVFDDPLATIQLPGLPLLTARDLPSFFGSSDAYEFALPIFRRQFELLEQETNPMVVINTFDELEHDALRAISNYIDWLNSKPKGSVIYVSSGSISTLSKHQKEEIARGLLSCGRPFLWVIRDIEEVNTLSCREELEGLGKIVSWCSQIEVLSRPATGCFLTHCGWNSTLESLVCGVPVVVFPQWSDQGTNAKIIQDMSETGVRLEVGMDGVVKREEIKRCLELVMGDSKKGEEIRKNVVKWKELAKGATAHGGSSYSNFKAFVDQVCP
BLAST of Cp4.1LG17g03950 vs. Swiss-Prot
Match: UGT1_GARJA (Crocetin glucosyltransferase, chloroplastic OS=Gardenia jasminoides GN=UGT75L6 PE=1 SV=1)

HSP 1 Score: 462.2 bits (1188), Expect = 6.0e-129
Identity = 243/460 (52.83%), Postives = 305/460 (66.30%), Query Frame = 1

Query: 8   RHRVLLITYSAQGHINPALEFAKRLTRRRIDVTFVTSLSAYRRMGKTP--TLPHVSFASF 67
           RH VLLITY AQGHINPAL+FA+RL R  I VT  TS+ A  RM K+   T   ++FA+F
Sbjct: 5   RH-VLLITYPAQGHINPALQFAQRLLRMGIQVTLATSVYALSRMKKSSGSTPKGLTFATF 64

Query: 68  SDGYDDGFK-QGDDINHFMSELERRGSQAIKDMIVAGVEQGQPFTCIVYSILLPWVAIVA 127
           SDGYDDGF+ +G D   +MS L ++GS  ++++I    +QG P TC+VY++LLPW A VA
Sbjct: 65  SDGYDDGFRPKGVDHTEYMSSLAKQGSNTLRNVINTSADQGCPVTCLVYTLLLPWAATVA 124

Query: 128 RSLHLPAILLWIQPAIVFALYYYYNYGYHDIIQSVFDDPLATIQLPGLPLLTARDLPSFF 187
           R  H+P+ LLWIQP  V  +YYYY  GY D +++  +DP  +IQ PGLP + A+DLPSF 
Sbjct: 125 RECHIPSALLWIQPVAVMDIYYYYFRGYEDDVKNNSNDPTWSIQFPGLPSMKAKDLPSFI 184

Query: 188 --GSSDAYEFALPIFRRQFELLEQETNPMVVINTFDELEHDALRAISNYI---------- 247
              S + Y FALP F++Q E L++E  P V++NTFD LE  AL+AI +Y           
Sbjct: 185 LPSSDNIYSFALPTFKKQLETLDEEERPKVLVNTFDALEPQALKAIESYNLIAIGPLTPS 244

Query: 248 -------------------------DWLNSKPKGSVIYVSSGSISTLSKHQKEEIARGLL 307
                                    +WLNS+P GSV+YVS GS+ TL K Q EEIARGLL
Sbjct: 245 AFLDGKDPSETSFSGDLFQKSKDYKEWLNSRPAGSVVYVSFGSLLTLPKQQMEEIARGLL 304

Query: 308 SCGRPFLWVIRDIE------EVNTLSCREELEGLGKIVSWCSQIEVLSRPATGCFLTHCG 367
             GRPFLWVIR  E      E + L C EELE  G IV WCSQIEVL+ P+ GCF+THCG
Sbjct: 305 KSGRPFLWVIRAKENGEEEKEEDRLICMEELEEQGMIVPWCSQIEVLTHPSLGCFVTHCG 364

Query: 368 WNSTLESLVCGVPVVVFPQWSDQGTNAKIIQDMSETGVRLEVGMDGVVKREEIKRCLELV 422
           WNSTLE+LVCGVPVV FP W+DQGTNAK+I+D+ ETGVR+    DG V+ +EIKRC+E V
Sbjct: 365 WNSTLETLVCGVPVVAFPHWTDQGTNAKLIEDVWETGVRVVPNEDGTVESDEIKRCIETV 424

BLAST of Cp4.1LG17g03950 vs. Swiss-Prot
Match: U75D1_ARATH (UDP-glycosyltransferase 75D1 OS=Arabidopsis thaliana GN=UGT75D1 PE=2 SV=2)

HSP 1 Score: 408.3 bits (1048), Expect = 1.0e-112
Identity = 223/469 (47.55%), Postives = 296/469 (63.11%), Query Frame = 1

Query: 4   TAPHRHRVLLITYSAQGHINPALEFAKRL--TRRRIDVTFVTSLSAY-RRMGKTPTLPH- 63
           T PH    L +T+ AQGHINP+LE AKRL  T     VTF  S+SAY RRM  T  +P  
Sbjct: 10  TGPH---FLFVTFPAQGHINPSLELAKRLAGTISGARVTFAASISAYNRRMFSTENVPET 69

Query: 64  VSFASFSDGYDDGFKQG--------DDINHFMSELERRGSQAIKDMIVAGVEQGQPFTCI 123
           + FA++SDG+DDGFK          D   +FMSE+ RRG + + ++I    +Q +PFTC+
Sbjct: 70  LIFATYSDGHDDGFKSSAYSDKSRQDATGNFMSEMRRRGKETLTELIEDNRKQNRPFTCV 129

Query: 124 VYSILLPWVAIVARSLHLPAILLWIQPAIVFALYYYYNYGYHDIIQSVFDDPLATIQLPG 183
           VY+ILL WVA +AR  HLP+ LLW+QP  VF+++Y+Y  GY D I  + + P ++I+LP 
Sbjct: 130 VYTILLTWVAELAREFHLPSALLWVQPVTVFSIFYHYFNGYEDAISEMANTPSSSIKLPS 189

Query: 184 LPLLTARDLPSFFGSSDAYEFALPIFRRQFELLEQETNPMVVINTFDELEHDALRAI--- 243
           LPLLT RD+PSF  SS+ Y F LP FR Q + L++E NP ++INTF ELE +A+ ++   
Sbjct: 190 LPLLTVRDIPSFIVSSNVYAFLLPAFREQIDSLKEEINPKILINTFQELEPEAMSSVPDN 249

Query: 244 -------------------SNYIDWLNSKPKGSVIYVSSGSISTLSKHQKEEIARGLLSC 303
                                YI+WL++K   SV+YVS G+++ LSK Q  E+ + L+  
Sbjct: 250 FKIVPVGPLLTLRTDFSSRGEYIEWLDTKADSSVLYVSFGTLAVLSKKQLVELCKALIQS 309

Query: 304 GRPFLWVIRD-----------IEEVNTLSCREELEGLGKIVSWCSQIEVLSRPATGCFLT 363
            RPFLWVI D            EE    S REEL+ +G +VSWC Q  VL+  + GCF+T
Sbjct: 310 RRPFLWVITDKSYRNKEDEQEKEEDCISSFREELDEIGMVVSWCDQFRVLNHRSIGCFVT 369

Query: 364 HCGWNSTLESLVCGVPVVVFPQWSDQGTNAKIIQDMSETGVRL-----EVGMDGVVKREE 423
           HCGWNSTLESLV GVPVV FPQW+DQ  NAK+++D  +TGVR+     E G+  VV  EE
Sbjct: 370 HCGWNSTLESLVSGVPVVAFPQWNDQMMNAKLLEDCWKTGVRVMEKKEEEGV-VVVDSEE 429

BLAST of Cp4.1LG17g03950 vs. Swiss-Prot
Match: 5GT1_PERFR (Anthocyanidin 3-O-glucoside 5-O-glucosyltransferase 1 OS=Perilla frutescens GN=PF3R4 PE=1 SV=1)

HSP 1 Score: 390.6 bits (1002), Expect = 2.2e-107
Identity = 214/464 (46.12%), Postives = 278/464 (59.91%), Query Frame = 1

Query: 8   RHRVLLITYSAQGHINPALEFAKRLTRRRIDVTFVTSLSAYRRMGKTPTL-----PHVSF 67
           R RVLL T+ AQGHINPAL+FAKRL +   DVTF TS+ A+RRM  T +      P + F
Sbjct: 3   RRRVLLATFPAQGHINPALQFAKRLLKAGTDVTFFTSVYAWRRMANTASAAAGNPPGLDF 62

Query: 68  ASFSDGYDDGFKQGDDINHFMSELERRGSQAIKDMIVAGVEQGQPFTCIVYSILLPWVAI 127
            +FSDGYDDG K   D   +MSE++ RGS+A++++++         T +VYS L  W A 
Sbjct: 63  VAFSDGYDDGLKPCGDGKRYMSEMKARGSEALRNLLL----NNHDVTFVVYSHLFAWAAE 122

Query: 128 VARSLHLPAILLWIQPAIVFALYYYYNYGYHDIIQSVFDDPLATIQLPGLPLLTARDLPS 187
           VAR   +P+ LLW++PA V  +YY+Y  GY D I +  D+    IQLP LP L  R LP+
Sbjct: 123 VARESQVPSALLWVEPATVLCIYYFYFNGYADEIDAGSDE----IQLPRLPPLEQRSLPT 182

Query: 188 FFGSSDAYEFALPIFRRQFELLEQETNPMVVINTFDELEHDALRAISNY----------- 247
           F        F L + + + E L+ E    V++NTFD LE DAL AI  Y           
Sbjct: 183 FLLPETPERFRL-MMKEKLETLDGEEKAKVLVNTFDALEPDALTAIDRYELIGIGPLIPS 242

Query: 248 --------------------------IDWLNSKPKGSVIYVSSGSISTLSKHQKEEIARG 307
                                     ++WL++KPK SV+YVS GS+    K Q EEI +G
Sbjct: 243 AFLDGGDPSETSYGGDLFEKSEENNCVEWLDTKPKSSVVYVSFGSVLRFPKAQMEEIGKG 302

Query: 308 LLSCGRPFLWVIRD------IEEVNTLSCREELEGLGKIVSWCSQIEVLSRPATGCFLTH 367
           LL+CGRPFLW+IR+       EE   LSC  EL+ +GKIVSWCSQ+EVL+ PA GCF+TH
Sbjct: 303 LLACGRPFLWMIREQKNDDGEEEEEELSCIGELKKMGKIVSWCSQLEVLAHPALGCFVTH 362

Query: 368 CGWNSTLESLVCGVPVVVFPQWSDQGTNAKIIQDMSETGVRLEVGMDGVVKREEIKRCLE 424
           CGWNS +ESL CGVPVV  PQW DQ TNAK+I+D   TGVR+ +   G V   EI+RC+E
Sbjct: 363 CGWNSAVESLSCGVPVVAVPQWFDQTTNAKLIEDAWGTGVRVRMNEGGGVDGSEIERCVE 422

BLAST of Cp4.1LG17g03950 vs. Swiss-Prot
Match: 5GT_VERHY (Anthocyanidin 3-O-glucoside 5-O-glucosyltransferase OS=Verbena hybrida GN=HGT8 PE=2 SV=1)

HSP 1 Score: 390.2 bits (1001), Expect = 2.9e-107
Identity = 218/461 (47.29%), Postives = 286/461 (62.04%), Query Frame = 1

Query: 8   RHRVLLITYSAQGHINPALEFAKRLTRRRIDVTFVTSLSAYRRMGKTPTLPH--VSFASF 67
           R  VLL T+ AQGHINPAL+FAKRL    I VTF TS+ A+RRM +T    +  ++F SF
Sbjct: 3   RAHVLLATFPAQGHINPALQFAKRLANADIQVTFFTSVYAWRRMSRTAAGSNGLINFVSF 62

Query: 68  SDGYDDGFKQGDDINHFMSELERRGSQAIKDMIVAGV--EQGQPFTCIVYSILLPWVAIV 127
           SDGYDDG + GDD  ++MSE++ RG +A+ D + A    ++    T +VYS L  W A V
Sbjct: 63  SDGYDDGLQPGDDGKNYMSEMKSRGIKALSDTLAANNVDQKSSKITFVVYSHLFAWAAKV 122

Query: 128 ARSLHLPAILLWIQPAIVFALYYYYNYGYHDIIQSVFDDPLATIQLPG-LPLLTARDLPS 187
           AR  HL + LLWI+PA V  ++Y+Y  GY D I +  D     I LPG LP+L  RDLPS
Sbjct: 123 AREFHLRSALLWIEPATVLDIFYFYFNGYSDEIDAGSD----AIHLPGGLPVLAQRDLPS 182

Query: 188 FFGSSDAYEFALPIFRRQFELLEQETNPMVVINTFDELEHDALRAISNY----------- 247
           F   S    F   + + + E LE E  P V++N+FD LE DAL+AI  Y           
Sbjct: 183 FLLPSTHERFR-SLMKEKLETLEGEEKPKVLVNSFDALEPDALKAIDKYEMIAIGPLIPS 242

Query: 248 --IDW-------------------------LNSKPKGSVIYVSSGSISTLSKHQKEEIAR 307
             +D                          L++ P+ SV+YVS GS    +K Q EEIAR
Sbjct: 243 AFLDGKDPSDRSFGGDLFEKGSNDDDCLEWLSTNPRSSVVYVSFGSFVNTTKSQMEEIAR 302

Query: 308 GLLSCGRPFLWVIRDIE-EVNTLSCREELEGLGKIVSWCSQIEVLSRPATGCFLTHCGWN 367
           GLL CGRPFLWV+R  E E   +SC EEL+ +GKIVSWCSQ+EVL+ P+ GCF+THCGWN
Sbjct: 303 GLLDCGRPFLWVVRVNEGEEVLISCMEELKRVGKIVSWCSQLEVLTHPSLGCFVTHCGWN 362

Query: 368 STLESLVCGVPVVVFPQWSDQGTNAKIIQDMSETGVRLEVGMDG-VVKREEIKRCLELVM 424
           STLES+  GVP+V FPQW DQGTNAK+++D+  TGVR+    +G VV  +EI+RC+E VM
Sbjct: 363 STLESISFGVPMVAFPQWFDQGTNAKLMEDVWRTGVRVRANEEGSVVDGDEIRRCIEEVM 422

BLAST of Cp4.1LG17g03950 vs. Swiss-Prot
Match: 5GT2_PERFR (Anthocyanidin 3-O-glucoside 5-O-glucosyltransferase 2 OS=Perilla frutescens GN=PF3R6 PE=2 SV=1)

HSP 1 Score: 377.1 bits (967), Expect = 2.5e-103
Identity = 207/448 (46.21%), Postives = 270/448 (60.27%), Query Frame = 1

Query: 8   RHRVLLITYSAQGHINPALEFAKRLTRRRIDVTFVTSLSAYRRMGKTPTL-----PHVSF 67
           R RVLL T+ AQGHINPAL+FAKRL +   DVTF TS+ A+RRM  T +      P + F
Sbjct: 3   RRRVLLATFPAQGHINPALQFAKRLLKAGTDVTFFTSVYAWRRMANTASAAAGNPPGLDF 62

Query: 68  ASFSDGYDDGFKQGDDINHFMSELERRGSQAIKDMIVAGVEQGQPFTCIVYSILLPWVAI 127
            +FSDGYDDG K G D   +MSE++ RGS+A++++++         T +VYS L  W A 
Sbjct: 63  VAFSDGYDDGLKPGGDGKRYMSEMKARGSEALRNLLL----NNDDVTFVVYSHLFAWAAE 122

Query: 128 VARSLHLPAILLWIQPAIVFALYYYYNYGYHDIIQSVFDDPLATIQLPGLPLLTARDLPS 187
           VAR  H+P  LLW++PA V  +Y++Y  GY D I +  ++    IQLP LP L  R LP+
Sbjct: 123 VARLSHVPTALLWVEPATVLCIYHFYFNGYADEIDAGSNE----IQLPRLPSLEQRSLPT 182

Query: 188 FFGSSDAYEFALPIFRRQFELLEQETNPMVVINTFDELEHDALRAISNY----------- 247
           F   +    F L + + + E L+ E    V++NTFD LE DAL AI  Y           
Sbjct: 183 FLLPATPERFRL-MMKEKLETLDGEEKAKVLVNTFDALEPDALTAIDRYELIGIGPLIPS 242

Query: 248 --------------------------IDWLNSKPKGSVIYVSSGSISTLSKHQKEEIARG 307
                                     ++WLNSKPK SV+YVS GS+    K Q EEI +G
Sbjct: 243 AFLDGEDPSETSYGGDLFEKSEENNCVEWLNSKPKSSVVYVSFGSVLRFPKAQMEEIGKG 302

Query: 308 LLSCGRPFLWVIR--------DIEEVNTLSCREELEGLGKIVSWCSQIEVLSRPATGCFL 367
           LL+CGRPFLW+IR        + EE   LSC  EL+ +GKIVSWCSQ+EVL+ PA GCF+
Sbjct: 303 LLACGRPFLWMIREQKNDDGEEEEEEEELSCIGELKKMGKIVSWCSQLEVLAHPALGCFV 362

Query: 368 THCGWNSTLESLVCGVPVVVFPQWSDQGTNAKIIQDMSETGVRLEVGMDGVVKREEIKRC 406
           THCGWNS +ESL CG+PVV  PQW DQ TNAK+I+D   TGVR+ +   G V   EI+RC
Sbjct: 363 THCGWNSAVESLSCGIPVVAVPQWFDQTTNAKLIEDAWGTGVRVRMNEGGGVDGCEIERC 422

BLAST of Cp4.1LG17g03950 vs. TrEMBL
Match: A0A0A0KA46_CUCSA (UDP-glucose:flavonoid 7-O-glucosyltransferase OS=Cucumis sativus GN=Csa_6G109750 PE=4 SV=1)

HSP 1 Score: 590.1 bits (1520), Expect = 2.1e-165
Identity = 306/467 (65.52%), Postives = 362/467 (77.52%), Query Frame = 1

Query: 1   MDNTAPH---RHRVLLITYSAQGHINPALEFAKRLTRRR-IDVTFVTSLSAYRRMGKTPT 60
           M+NT P+   RH VLL+T+ AQGHINP L+ AKRLTR   + VTF+ SLSAYRRMG TPT
Sbjct: 1   MNNTTPNPNPRH-VLLVTHCAQGHINPTLQLAKRLTRHGDLHVTFLISLSAYRRMGHTPT 60

Query: 61  LPHVSFASFSDGYDDGFKQGDDINHFMSELERRGSQAIKDMIVAGVEQGQPFTCIVYSIL 120
           LPH++FASFSDGYDDGFK  DDI  ++SELERRGS A+K++I     +GQPFTCIVYSIL
Sbjct: 61  LPHITFASFSDGYDDGFKPSDDIKLYISELERRGSDALKNIIQESRNKGQPFTCIVYSIL 120

Query: 121 LPWVAIVARSLHLPAILLWIQPAIVFALYYYYNYGYHDIIQSVF--DDPLAT-IQLPGLP 180
           +PWVA VARSL + ++ LWIQPA+VFALYYYYN GY+D IQ +   DDP +T I+LPGLP
Sbjct: 121 IPWVATVARSLDVASVHLWIQPAVVFALYYYYNNGYYDEIQRIASGDDPSSTSIKLPGLP 180

Query: 181 LLTARDLPSFFGSSDAYEFALPIFRRQFELLEQETNPMVVINTFDELEHDALRAISNY-- 240
           LL+ARDLPSFFG+SD Y FALP+FR+QFELLE+E+NP ++INTF+ELE DA++AI  +  
Sbjct: 181 LLSARDLPSFFGASDGYSFALPMFRKQFELLEEESNPKILINTFEELEKDAVKAIKKFHL 240

Query: 241 -----------ID----------------------WLNSKPKGSVIYVSSGSISTLSKHQ 300
                      +D                      WLNSKPK SV+YVS GSIST+SK Q
Sbjct: 241 MPIGPLIPSVLVDGNDPSEASSGCDLFRSTSSYMEWLNSKPKASVVYVSMGSISTVSKQQ 300

Query: 301 KEEIARGLLSCGRPFLWVIRDI-EEVNTLSCREELEGLGKIVSWCSQIEVLSRPATGCFL 360
           KEEIARGL    RPFLWVIR+I EE + LS +E+LE  GKIVSWC+Q+EVLS PATGCFL
Sbjct: 301 KEEIARGLSLTKRPFLWVIRNIEEEEDFLSFKEKLETQGKIVSWCAQLEVLSSPATGCFL 360

Query: 361 THCGWNSTLESLVCGVPVVVFPQWSDQGTNAKIIQDMSETGVRLEVGMDGVVKREEIKRC 420
           THCGWNS LESL CGVP V FPQWSDQ TN+KII+D+SETGVRLEV  +GVVK EEI+RC
Sbjct: 361 THCGWNSCLESLACGVPNVAFPQWSDQATNSKIIEDLSETGVRLEVEEEGVVKGEEIERC 420

Query: 421 LELVMGDSKKGEEIRKNVVKWKELAKGATAHGGSSYSNFKAFVDQVC 425
           LELVMGDSKKGEEIR+N +KWK+LAK A + GGSS++N KAFVD VC
Sbjct: 421 LELVMGDSKKGEEIRRNALKWKKLAKEAASEGGSSFANLKAFVDHVC 466

BLAST of Cp4.1LG17g03950 vs. TrEMBL
Match: F6I4F4_VITVI (Glycosyltransferase OS=Vitis vinifera GN=VIT_05s0062g00640 PE=3 SV=1)

HSP 1 Score: 519.6 bits (1337), Expect = 3.5e-144
Identity = 253/460 (55.00%), Postives = 327/460 (71.09%), Query Frame = 1

Query: 5   APHRHRVLLITYSAQGHINPALEFAKRLTRRRIDVTFVTSLSAYRRMGKTPTLPHVSFAS 64
           +PH    LL+T+ AQGHINPAL+FAKR+ R    V+F TS+SA+RRM K  T   ++F  
Sbjct: 3   SPH---FLLVTFPAQGHINPALQFAKRIIRTGAQVSFATSVSAHRRMAKRSTPEGLNFVP 62

Query: 65  FSDGYDDGFKQGDDINHFMSELERRGSQAIKDMIVAGVEQGQPFTCIVYSILLPWVAIVA 124
           FSDGYDDGFK  DD+ H+MSE++RRGS+ +++++V   ++GQPFTCIVY++LLPW A VA
Sbjct: 63  FSDGYDDGFKPTDDVQHYMSEIKRRGSETLREIVVRNADEGQPFTCIVYTLLLPWAAEVA 122

Query: 125 RSLHLPAILLWIQPAIVFALYYYYNYGYHDIIQSVFDDPLATIQLPGLPLLTARDLPSFF 184
           R L +P+ LLWIQPA V  +YYYY  GY D+ +++ ++P  +++LPGLPLL++RDLPSF 
Sbjct: 123 RGLGVPSALLWIQPATVLDIYYYYFNGYGDVFRNISNEPSCSVELPGLPLLSSRDLPSFL 182

Query: 185 GSSDAYEFALPIFRRQFELLEQETNPMVVINTFDELEHDALRAIS--------------- 244
             S+AY F LP F+ Q E L QET+P V++NTFD LE + LRA+                
Sbjct: 183 VKSNAYTFVLPTFQEQLEALSQETSPKVLVNTFDALEPEPLRAVDKLHLIGIGPLVPSAY 242

Query: 245 --------------------NYIDWLNSKPKGSVIYVSSGSISTLSKHQKEEIARGLLSC 304
                               +Y++WLNSKPK SV+YVS GSIS LSK QKE+IAR LL C
Sbjct: 243 LDGKDPSDTSFGGDMFQGSDDYMEWLNSKPKSSVVYVSFGSISVLSKTQKEDIARALLDC 302

Query: 305 GRPFLWVIR------DIEEVNTLSCREELEGLGKIVSWCSQIEVLSRPATGCFLTHCGWN 364
           G PFLWVIR      +++E + LSCREELE  G IVSWCSQIEVL+ P+ GCF++HCGWN
Sbjct: 303 GHPFLWVIRAPENGEEVKEQDKLSCREELEQKGMIVSWCSQIEVLTHPSLGCFVSHCGWN 362

Query: 365 STLESLVCGVPVVVFPQWSDQGTNAKIIQDMSETGVRLEVGMDGVVKREEIKRCLELVMG 424
           STLESLV GVPVV FPQW+DQGTNAK+I+DM + G+R+ V  +G+V+ +E KRCLE+VMG
Sbjct: 363 STLESLVSGVPVVAFPQWTDQGTNAKLIEDMWKIGIRVTVNEEGIVESDEFKRCLEIVMG 422

BLAST of Cp4.1LG17g03950 vs. TrEMBL
Match: A7MAV1_PYRCO (Glycosyltransferase OS=Pyrus communis PE=2 SV=1)

HSP 1 Score: 506.9 bits (1304), Expect = 2.4e-140
Identity = 258/471 (54.78%), Postives = 327/471 (69.43%), Query Frame = 1

Query: 8   RHRVLLITYSAQGHINPALEFAKRLTRRR-IDVTFVTSLSAYRRMGKTPTLPHVSFASFS 67
           +HR LL+TY AQGHINP+L+FAKRLT      VT+VTSLSA+RR+G       +++A FS
Sbjct: 3   QHRFLLVTYPAQGHINPSLQFAKRLTNTTGAHVTYVTSLSAHRRIGNGSIPDGLTYAPFS 62

Query: 68  DGYDDGFKQGDDINHFMSELERRGSQAIKDMIVAGVEQGQPFTCIVYSILLPWVAIVARS 127
           DGYDDGFK GD+I+ +MSEL  RG+QAI D++VA   +G P+TC+VYS+++PW A VA  
Sbjct: 63  DGYDDGFKPGDNIDDYMSELRHRGAQAITDLVVASANEGHPYTCLVYSLIVPWSAGVAHE 122

Query: 128 LHLPAILLWIQPAIVFALYYYYNYGYHDIIQSVFDDPL-----ATIQLPGLPL-LTARDL 187
           LHLP++LLWIQPA VF +YYYY  GY D+I+             +I+LPGLPL  T+RDL
Sbjct: 123 LHLPSVLLWIQPATVFDIYYYYFNGYKDLIRDNTSSGTNNVLPCSIELPGLPLSFTSRDL 182

Query: 188 PSFFGSSDAYEFALPIFRRQFELLEQETNPMVVINTFDELEHDALRAI------------ 247
           PSF   ++ Y FALP+F+ Q ELLE+ETNP +++NTFD LE +AL+AI            
Sbjct: 183 PSFMVDTNPYNFALPLFQEQMELLERETNPTILVNTFDALEPEALKAIDKYNLIGVGPLI 242

Query: 248 -------------------------SNYIDWLNSKPKGSVIYVSSGSISTLSKHQKEEIA 307
                                    S+Y++WLNSKP+GSVIYVS GSIS L K Q EEIA
Sbjct: 243 PSAFLDGKDPSDKSFGGDLVQKSRDSSYLEWLNSKPEGSVIYVSFGSISVLGKAQMEEIA 302

Query: 308 RGLLSCGRPFLWVIRD-----------IEEVNTLSCREELEGLGKIVSWCSQIEVLSRPA 367
           +GLL CG PFLWVIRD            +E   LSCR ELE LG+IV WCSQ+EVLS P+
Sbjct: 303 KGLLDCGLPFLWVIRDKVDKKGDDNEAKQEEAMLSCRVELEELGRIVPWCSQVEVLSSPS 362

Query: 368 TGCFLTHCGWNSTLESLVCGVPVVVFPQWSDQGTNAKIIQDMSETGVRLEVGMDGVVKRE 424
            GCF+THCGWNS+LESLV GVPVV FPQW+DQGTNAK+I+D  +TGVR+   ++G+V  E
Sbjct: 363 LGCFVTHCGWNSSLESLVSGVPVVAFPQWTDQGTNAKLIEDFWKTGVRVTPNVEGIVTGE 422

BLAST of Cp4.1LG17g03950 vs. TrEMBL
Match: M5X8U4_PRUPE (Glycosyltransferase OS=Prunus persica GN=PRUPE_ppa016890mg PE=3 SV=1)

HSP 1 Score: 501.5 bits (1290), Expect = 1.0e-138
Identity = 256/470 (54.47%), Postives = 324/470 (68.94%), Query Frame = 1

Query: 8   RHRVLLITYSAQGHINPALEFAKRLTRRR-IDVTFVTSLSAYRRMGKTPTLPHVSFASFS 67
           +HR L +TY AQGHINPAL+ AKRL R     VT+VTSL AYRR+    T   +++A +S
Sbjct: 3   QHRFLFLTYPAQGHINPALQLAKRLIRNTGAQVTYVTSLYAYRRIVNGSTPNGLTYAPYS 62

Query: 68  DGYDDGFKQGDDINHFMSELERRGSQAIKDMIVAGVEQGQPFTCIVYSILLPWVAIVARS 127
           DGYDDGFK  DD++H+MSEL R GSQ I D++ +  ++G P+TC+VY+ILLPW A +AR 
Sbjct: 63  DGYDDGFKFSDDVDHYMSELRRAGSQVITDLVASSAKEGHPYTCLVYTILLPWAADLARE 122

Query: 128 LHLPAILLWIQPAIVFALYYYYNYGYHDIIQSVF----DDPLATIQLPGLPL-LTARDLP 187
           LHLP++L WIQ A +F +YYYY  GY D+I+  F    +DP  +IQLPGLPL L +RDLP
Sbjct: 123 LHLPSVLAWIQAATLFDVYYYYLSGYKDLIRESFGTDTNDPSCSIQLPGLPLDLASRDLP 182

Query: 188 SFFGSSDAYEFALPIFRRQFELLEQETNPMVVINTFDELEHDALRAISNY---------- 247
           SF  + ++Y FALP+F +QFELLE+ET P++++NTFD LE +AL+AI  Y          
Sbjct: 183 SFMVAENSYNFALPLFEKQFELLERETKPIILVNTFDALEPEALKAIDKYNLIGIGPLIP 242

Query: 248 ---------------------------IDWLNSKPKGSVIYVSSGSISTLSKHQKEEIAR 307
                                      I+WLNSKP+GSVIYVS GS+S LSK Q EEIA+
Sbjct: 243 SAFLDGKDPSDTSFGGDLFQKSMDSSCIEWLNSKPEGSVIYVSFGSVSALSKDQMEEIAK 302

Query: 308 GLLSCGRPFLWVIRDIEEVN-----------TLSCREELEGLGKIVSWCSQIEVLSRPAT 367
           GLL  GRPFLWVIR+ EE N             SCREEL+ LGKIV WCSQ+EVLS P+ 
Sbjct: 303 GLLDYGRPFLWVIREKEERNGQDNETEKEEEKFSCREELKELGKIVLWCSQLEVLSNPSL 362

Query: 368 GCFLTHCGWNSTLESLVCGVPVVVFPQWSDQGTNAKIIQDMSETGVRLEVGMDGVVKREE 424
           GCF+THCGWNS++ESLV GVPVV FP W+DQ TNAK+I+D  +TGVR+    +G+V  EE
Sbjct: 363 GCFVTHCGWNSSMESLVSGVPVVAFPLWTDQRTNAKLIEDTWKTGVRVAPNEEGIVVGEE 422

BLAST of Cp4.1LG17g03950 vs. TrEMBL
Match: A7MAS5_MALDO (Glycosyltransferase OS=Malus domestica PE=2 SV=1)

HSP 1 Score: 498.8 bits (1283), Expect = 6.5e-138
Identity = 256/471 (54.35%), Postives = 328/471 (69.64%), Query Frame = 1

Query: 8   RHRVLLITYSAQGHINPALEFAKRLTRRR-IDVTFVTSLSAYRRMGKTPTLPHVSFASFS 67
           +HR LL+T+ AQGHINP+L+FAKRL       VT+VTSLSA+RR+G       +++A FS
Sbjct: 3   QHRFLLVTFPAQGHINPSLQFAKRLINTTGAHVTYVTSLSAHRRIGNGSIPDGLTYAPFS 62

Query: 68  DGYDDGFKQGDDINHFMSELERRGSQAIKDMIVAGVEQGQPFTCIVYSILLPWVAIVARS 127
           DGYDDGFK GD+++ +MSEL RRG QAI D++VA   +G P+TC+VYS+LLPW A +A  
Sbjct: 63  DGYDDGFKPGDNVDDYMSELRRRGVQAITDLVVASANEGHPYTCLVYSLLLPWSAGMAHE 122

Query: 128 LHLPAILLWIQPAIVFALYYYYNYGYHDIIQ----SVFDDPL-ATIQLPGLPL-LTARDL 187
           LHLP++LLWIQPA VF +YYYY  GY D+I+    S  ++ L  +I+LPGLPL  T+RDL
Sbjct: 123 LHLPSVLLWIQPATVFDIYYYYFNGYKDLIRDNTSSGTNNVLPCSIELPGLPLSFTSRDL 182

Query: 188 PSFFGSSDAYEFALPIFRRQFELLEQETNPMVVINTFDELEHDALRAI------------ 247
           PSF   ++ Y FALP+F+ Q ELLE+ETNP +++NTFD LE +AL+AI            
Sbjct: 183 PSFMVDTNPYNFALPLFQEQMELLERETNPTILVNTFDALEPEALKAIDKYNLIGVGPLI 242

Query: 248 -------------------------SNYIDWLNSKPKGSVIYVSSGSISTLSKHQKEEIA 307
                                    S+Y++WLNSKP+GSVIYVS GSIS L K Q EEIA
Sbjct: 243 PSAFLDGKDPSDKSFGGDLFQKSKDSSYLEWLNSKPEGSVIYVSFGSISVLGKAQMEEIA 302

Query: 308 RGLLSCGRPFLWVIRD-----------IEEVNTLSCREELEGLGKIVSWCSQIEVLSRPA 367
           +GLL CG PFLWVIRD            +E   L CREELE LG IV WCSQ+EVLS P+
Sbjct: 303 KGLLDCGLPFLWVIRDKVGKKGDDNEAKKEEEMLRCREELEELGMIVPWCSQVEVLSSPS 362

Query: 368 TGCFLTHCGWNSTLESLVCGVPVVVFPQWSDQGTNAKIIQDMSETGVRLEVGMDGVVKRE 424
            GCF+THCGWNS+LESLV GVPVV FPQW+DQGTNAK+I+D  +TGVR+    +G+V  E
Sbjct: 363 LGCFVTHCGWNSSLESLVSGVPVVAFPQWTDQGTNAKLIEDYWKTGVRVTPNEEGIVTGE 422

BLAST of Cp4.1LG17g03950 vs. TAIR10
Match: AT4G15550.1 (AT4G15550.1 indole-3-acetate beta-D-glucosyltransferase)

HSP 1 Score: 408.3 bits (1048), Expect = 5.8e-114
Identity = 223/469 (47.55%), Postives = 296/469 (63.11%), Query Frame = 1

Query: 4   TAPHRHRVLLITYSAQGHINPALEFAKRL--TRRRIDVTFVTSLSAY-RRMGKTPTLPH- 63
           T PH    L +T+ AQGHINP+LE AKRL  T     VTF  S+SAY RRM  T  +P  
Sbjct: 10  TGPH---FLFVTFPAQGHINPSLELAKRLAGTISGARVTFAASISAYNRRMFSTENVPET 69

Query: 64  VSFASFSDGYDDGFKQG--------DDINHFMSELERRGSQAIKDMIVAGVEQGQPFTCI 123
           + FA++SDG+DDGFK          D   +FMSE+ RRG + + ++I    +Q +PFTC+
Sbjct: 70  LIFATYSDGHDDGFKSSAYSDKSRQDATGNFMSEMRRRGKETLTELIEDNRKQNRPFTCV 129

Query: 124 VYSILLPWVAIVARSLHLPAILLWIQPAIVFALYYYYNYGYHDIIQSVFDDPLATIQLPG 183
           VY+ILL WVA +AR  HLP+ LLW+QP  VF+++Y+Y  GY D I  + + P ++I+LP 
Sbjct: 130 VYTILLTWVAELAREFHLPSALLWVQPVTVFSIFYHYFNGYEDAISEMANTPSSSIKLPS 189

Query: 184 LPLLTARDLPSFFGSSDAYEFALPIFRRQFELLEQETNPMVVINTFDELEHDALRAI--- 243
           LPLLT RD+PSF  SS+ Y F LP FR Q + L++E NP ++INTF ELE +A+ ++   
Sbjct: 190 LPLLTVRDIPSFIVSSNVYAFLLPAFREQIDSLKEEINPKILINTFQELEPEAMSSVPDN 249

Query: 244 -------------------SNYIDWLNSKPKGSVIYVSSGSISTLSKHQKEEIARGLLSC 303
                                YI+WL++K   SV+YVS G+++ LSK Q  E+ + L+  
Sbjct: 250 FKIVPVGPLLTLRTDFSSRGEYIEWLDTKADSSVLYVSFGTLAVLSKKQLVELCKALIQS 309

Query: 304 GRPFLWVIRD-----------IEEVNTLSCREELEGLGKIVSWCSQIEVLSRPATGCFLT 363
            RPFLWVI D            EE    S REEL+ +G +VSWC Q  VL+  + GCF+T
Sbjct: 310 RRPFLWVITDKSYRNKEDEQEKEEDCISSFREELDEIGMVVSWCDQFRVLNHRSIGCFVT 369

Query: 364 HCGWNSTLESLVCGVPVVVFPQWSDQGTNAKIIQDMSETGVRL-----EVGMDGVVKREE 423
           HCGWNSTLESLV GVPVV FPQW+DQ  NAK+++D  +TGVR+     E G+  VV  EE
Sbjct: 370 HCGWNSTLESLVSGVPVVAFPQWNDQMMNAKLLEDCWKTGVRVMEKKEEEGV-VVVDSEE 429

BLAST of Cp4.1LG17g03950 vs. TAIR10
Match: AT4G14090.1 (AT4G14090.1 UDP-Glycosyltransferase superfamily protein)

HSP 1 Score: 374.8 bits (961), Expect = 7.1e-104
Identity = 204/454 (44.93%), Postives = 279/454 (61.45%), Query Frame = 1

Query: 3   NTAPHRHRVLLITYSAQGHINPALEFAKRLTRRRIDVTFVTSLSAYRRMGKTPTLPHVSF 62
           N +  R   LL+T+ AQGHINPAL+ A RL      VT+ T++SA+RRMG+ P+   +SF
Sbjct: 6   NGSHRRPHYLLVTFPAQGHINPALQLANRLIHHGATVTYSTAVSAHRRMGEPPSTKGLSF 65

Query: 63  ASFSDGYDDGFKQGDDINHFMSELERRGSQAIKDMIVAGVE---QGQPFTCIVYSILLPW 122
           A F+DG+DDG K  +D   +MSEL+R GS A++D+I A ++   + +P T ++YS+L+PW
Sbjct: 66  AWFTDGFDDGLKSFEDQKIYMSELKRCGSNALRDIIKANLDATTETEPITGVIYSVLVPW 125

Query: 123 VAIVARSLHLPAILLWIQPAIVFALYYYYNYGYHDIIQSVFDDPLATIQLPGLPLLTARD 182
           V+ VAR  HLP  LLWI+PA V  +YYYY   ++   + +FD  +  I+LP LPL+T  D
Sbjct: 126 VSTVAREFHLPTTLLWIEPATVLDIYYYY---FNTSYKHLFD--VEPIKLPKLPLITTGD 185

Query: 183 LPSFFGSSDAYEFALPIFRRQFELLEQETNPMVVINTFDELEHDALRAIS---------- 242
           LPSF   S A   AL   R   E LE E+NP +++NTF  LEHDAL ++           
Sbjct: 186 LPSFLQPSKALPSALVTLREHIEALETESNPKILVNTFSALEHDALTSVEKLKMIPIGPL 245

Query: 243 ----------------NYIDWLNSKPKGSVIYVSSGS-ISTLSKHQKEEIARGLLSCGRP 302
                           +Y  WL+SK + SVIY+S G+    L +   E +  G+L+  RP
Sbjct: 246 VSSSEGKTDLFKSSDEDYTKWLDSKLERSVIYISLGTHADDLPEKHMEALTHGVLATNRP 305

Query: 303 FLWVIRDI--EEVNTLSCREELEGL--GKIVSWCSQIEVLSRPATGCFLTHCGWNSTLES 362
           FLW++R+   EE       E + G   G +V WCSQ  VL+  A GCF+THCGWNSTLES
Sbjct: 306 FLWIVREKNPEEKKKNRFLELIRGSDRGLVVGWCSQTAVLAHCAVGCFVTHCGWNSTLES 365

Query: 363 LVCGVPVVVFPQWSDQGTNAKIIQDMSETGVRLEVGMDGVVKREEIKRCLELVMGDSKKG 422
           L  GVPVV FPQ++DQ T AK+++D    GV+++VG +G V  EEI+RCLE VM   ++ 
Sbjct: 366 LESGVPVVAFPQFADQCTTAKLVEDTWRIGVKVKVGEEGDVDGEEIRRCLEKVMSGGEEA 425

BLAST of Cp4.1LG17g03950 vs. TAIR10
Match: AT1G05560.1 (AT1G05560.1 UDP-glucosyltransferase 75B1)

HSP 1 Score: 351.7 bits (901), Expect = 6.5e-97
Identity = 207/459 (45.10%), Postives = 266/459 (57.95%), Query Frame = 1

Query: 12  LLITYSAQGHINPALEFAKRLTRRR-IDVTFVTSLSAYRR--MGKTPTLPHVSFASFSDG 71
           LL+T+ AQGH+NP+L FA+RL +R    VTFVT +S +    +     + ++SF +FSDG
Sbjct: 7   LLVTFPAQGHVNPSLRFARRLIKRTGARVTFVTCVSVFHNSMIANHNKVENLSFLTFSDG 66

Query: 72  YDDG-FKQGDDINHFMSELERRGSQAIKDMIVAGVEQGQPFTCIVYSILLPWVAIVARSL 131
           +DDG     +D       L+  G +A+ D I A      P TC++Y+ILL W   VAR  
Sbjct: 67  FDDGGISTYEDRQKRSVNLKVNGDKALSDFIEATKNGDSPVTCLIYTILLNWAPKVARRF 126

Query: 132 HLPAILLWIQPAIVFALYYYYNYGYHDIIQSVFDDPLATIQLPGLPLLTARDLPSFFGSS 191
            LP+ LLWIQPA+VF +YY +  G     +SVF+       LP L  L  RDLPSF   S
Sbjct: 127 QLPSALLWIQPALVFNIYYTHFMGN----KSVFE-------LPNLSSLEIRDLPSFLTPS 186

Query: 192 DAYEFALPIFRRQFELLEQETNPMVVINTFDELEHDALRAISN----------------- 251
           +  + A   F+   E L +ET P ++INTFD LE +AL A  N                 
Sbjct: 187 NTNKGAYDAFQEMMEFLIKETKPKILINTFDSLEPEALTAFPNIDMVAVGPLLPTEIFSG 246

Query: 252 ------------YIDWLNSKPKGSVIYVSSGSISTLSKHQKEEIARGLLSCGRPFLWVIR 311
                       Y  WL+SK + SVIYVS G++  LSK Q EE+AR L+   RPFLWVI 
Sbjct: 247 STNKSVKDQSSSYTLWLDSKTESSVIYVSFGTMVELSKKQIEELARALIEGKRPFLWVIT 306

Query: 312 DIEEVNTLS-------------CREELEGLGKIVSWCSQIEVLSRPATGCFLTHCGWNST 371
           D     T +              R ELE +G IVSWCSQIEVLS  A GCF+THCGW+ST
Sbjct: 307 DKSNRETKTEGEEETEIEKIAGFRHELEEVGMIVSWCSQIEVLSHRAVGCFVTHCGWSST 366

Query: 372 LESLVCGVPVVVFPQWSDQGTNAKIIQDMSETGVRLEVGMDGVVKREEIKRCLELVMGDS 425
           LESLV GVPVV FP WSDQ TNAK++++  +TGVR+    DG+V+R EI+RCLE VM   
Sbjct: 367 LESLVLGVPVVAFPMWSDQPTNAKLLEESWKTGVRVRENKDGLVERGEIRRCLEAVM--E 426

BLAST of Cp4.1LG17g03950 vs. TAIR10
Match: AT1G05530.1 (AT1G05530.1 UDP-glucosyl transferase 75B2)

HSP 1 Score: 349.7 bits (896), Expect = 2.5e-96
Identity = 199/458 (43.45%), Postives = 262/458 (57.21%), Query Frame = 1

Query: 12  LLITYSAQGHINPALEFAKRLTRRR-IDVTFVTSLSAYRR--MGKTPTLPHVSFASFSDG 71
           LL+T+ AQGH+NP+L FA+RL +     VTF T LS   R  +     + ++SF +FSDG
Sbjct: 7   LLVTFPAQGHVNPSLRFARRLIKTTGARVTFATCLSVIHRSMIPNHNNVENLSFLTFSDG 66

Query: 72  YDDG-FKQGDDINHFMSELERRGSQAIKDMIVAGVEQGQPFTCIVYSILLPWVAIVARSL 131
           +DDG     DD+ + +   ER G +A+ D I A      P +C++Y+IL  WV  VAR  
Sbjct: 67  FDDGVISNTDDVQNRLVHFERNGDKALSDFIEANQNGDSPVSCLIYTILPNWVPKVARRF 126

Query: 132 HLPAILLWIQPAIVFALYYYYNYGYHDIIQSVFDDPLATIQLPGLPLLTARDLPSFFGSS 191
           HLP++ LWIQPA  F +YY Y+ G + + +            P LP L  RDLPSF   S
Sbjct: 127 HLPSVHLWIQPAFAFDIYYNYSTGNNSVFE-----------FPNLPSLEIRDLPSFLSPS 186

Query: 192 DAYEFALPIFRRQFELLEQETNPMVVINTFDELEHDALRAISN----------------- 251
           +  + A  +++   + L++E+NP +++NTFD LE + L AI N                 
Sbjct: 187 NTNKAAQAVYQELMDFLKEESNPKILVNTFDSLEPEFLTAIPNIEMVAVGPLLPAEIFTG 246

Query: 252 ---------------YIDWLNSKPKGSVIYVSSGSISTLSKHQKEEIARGLLSCGRPFLW 311
                          Y  WL+SK + SVIYVS G++  LSK Q EE+AR L+  GRPFLW
Sbjct: 247 SESGKDLSRDHQSSSYTLWLDSKTESSVIYVSFGTMVELSKKQIEELARALIEGGRPFLW 306

Query: 312 VIRD-------------IEEVNTLSCREELEGLGKIVSWCSQIEVLSRPATGCFLTHCGW 371
           VI D              E       R ELE +G IVSWCSQIEVL   A GCFLTHCGW
Sbjct: 307 VITDKLNREAKIEGEEETEIEKIAGFRHELEEVGMIVSWCSQIEVLRHRAIGCFLTHCGW 366

Query: 372 NSTLESLVCGVPVVVFPQWSDQGTNAKIIQDMSETGVRLEVGMDGVVKREEIKRCLELVM 421
           +S+LESLV GVPVV FP WSDQ  NAK+++++ +TGVR+    +G+V+R EI RCLE VM
Sbjct: 367 SSSLESLVLGVPVVAFPMWSDQPANAKLLEEIWKTGVRVRENSEGLVERGEIMRCLEAVM 426

BLAST of Cp4.1LG17g03950 vs. TAIR10
Match: AT4G15490.1 (AT4G15490.1 UDP-Glycosyltransferase superfamily protein)

HSP 1 Score: 276.2 bits (705), Expect = 3.4e-74
Identity = 172/471 (36.52%), Postives = 251/471 (53.29%), Query Frame = 1

Query: 6   PHRH-RVLLITYSAQGHINPALEFAKRLTRRRIDVTFVTSLSAYRR-----------MGK 65
           P RH  V+L+++  QGH+NP L   K +  + + VTFVT+   + +           + K
Sbjct: 3   PSRHTHVMLVSFPGQGHVNPLLRLGKLIASKGLLVTFVTTEKPWGKKMRQANKIQDGVLK 62

Query: 66  TPTLPHVSFASFSDGYDDGFKQGDDINHFMSELERRGSQAIKDMIVAGVEQGQPFTCIVY 125
              L  + F  FSDG+ D  ++  D + F   LE  G Q IK+++       +P TC++ 
Sbjct: 63  PVGLGFIRFEFFSDGFADDDEKRFDFDAFRPHLEAVGKQEIKNLVKR--YNKEPVTCLIN 122

Query: 126 SILLPWVAIVARSLHLPAILLWIQPAIVFALYYYYNYGYHDIIQ-SVFDDPLATIQLPGL 185
           +  +PWV  VA  LH+P+ +LW+Q       YYYY   +H +++     +P  ++++P L
Sbjct: 123 NAFVPWVCDVAEELHIPSAVLWVQSCACLTAYYYY---HHRLVKFPTKTEPDISVEIPCL 182

Query: 186 PLLTARDLPSFFGSSDAYEFALPIFRRQFELLEQETNPMVVINTFDELEHDAL------- 245
           PLL   ++PSF   S  Y     I   Q +  E   +  + I+TF ELE D +       
Sbjct: 183 PLLKHDEIPSFLHPSSPYTAFGDIILDQLKRFENHKSFYLFIDTFRELEKDIMDHMSQLC 242

Query: 246 -RAI--------------------------SNYIDWLNSKPKGSVIYVSSGSISTLSKHQ 305
            +AI                          S+ ++WL+S+   SV+Y+S G+I+ L + Q
Sbjct: 243 PQAIISPVGPLFKMAQTLSSDVKGDISEPASDCMEWLDSREPSSVVYISFGTIANLKQEQ 302

Query: 306 KEEIARGLLSCGRPFLWVIRDIEE---VNTLSCREELEGLGKIVSWCSQIEVLSRPATGC 365
            EEIA G+LS G   LWV+R   E   V       ELE  GKIV WC Q  VL+ PA  C
Sbjct: 303 MEEIAHGVLSSGLSVLWVVRPPMEGTFVEPHVLPRELEEKGKIVEWCPQERVLAHPAIAC 362

Query: 366 FLTHCGWNSTLESLVCGVPVVVFPQWSDQGTNAKIIQDMSETGVRLEVGM--DGVVKREE 424
           FL+HCGWNST+E+L  GVPVV FPQW DQ T+A  + D+ +TGVRL  G   + +V RE 
Sbjct: 363 FLSHCGWNSTMEALTAGVPVVCFPQWGDQVTDAVYLADVFKTGVRLGRGAAEEMIVSREV 422

BLAST of Cp4.1LG17g03950 vs. NCBI nr
Match: gi|659116578|ref|XP_008458144.1| (PREDICTED: anthocyanidin 3-O-glucoside 5-O-glucosyltransferase 1-like [Cucumis melo])

HSP 1 Score: 607.4 bits (1565), Expect = 1.9e-170
Identity = 314/465 (67.53%), Postives = 364/465 (78.28%), Query Frame = 1

Query: 2   DNTAPHRHRVLLITYSAQGHINPALEFAKRLTRRR-IDVTFVTSLSAYRRMGKTPTLPHV 61
           + T P+  RVLLITYSAQGHINP L+ AKRL R   + VTF+TSLSAYRRMG+TPTLPH+
Sbjct: 3   NTTPPNPRRVLLITYSAQGHINPTLQLAKRLIRHGDLHVTFLTSLSAYRRMGQTPTLPHL 62

Query: 62  SFASFSDGYDDGFKQGDDINHFMSELERRGSQAIKDMIVAGVEQGQPFTCIVYSILLPWV 121
           SFASFSDGYDDGFK GDDI+H++SELER GS A+K++I     QGQPFTCIVYSILLPWV
Sbjct: 63  SFASFSDGYDDGFKPGDDIDHYVSELERCGSDALKNIIQESRNQGQPFTCIVYSILLPWV 122

Query: 122 AIVARSLHLPAILLWIQPAIVFALYYYYNYGYHDIIQSVF--DDPLAT--IQLPGLPLLT 181
           A VARSL + ++LLWIQPA+VFALYYYY  GY+D IQ +   DDP ++  I+LPGLPLL+
Sbjct: 123 ATVARSLDVASVLLWIQPAVVFALYYYYFNGYYDEIQRIISGDDPGSSMSIKLPGLPLLS 182

Query: 182 ARDLPSFFGSSDAYEFALPIFRRQFELL-EQETNPMVVINTFDELEHDALRAI------- 241
           ARDLPSFFG SD Y FAL IFR+QFELL E+E+NP ++INTF+ELE DA++AI       
Sbjct: 183 ARDLPSFFGGSDVYAFALIIFRKQFELLEEEESNPNILINTFEELEKDAVKAIKKFHLMP 242

Query: 242 ----------------------------SNYIDWLNSKPKGSVIYVSSGSISTLSKHQKE 301
                                       S+YIDWLNSKPK SV+YVSSGSI+ LS  QKE
Sbjct: 243 IGPLIPSVFFDGTDPSEASSGCDLYRSTSSYIDWLNSKPKASVVYVSSGSITKLSNQQKE 302

Query: 302 EIARGLLSCGRPFLWVIRDIE-EVNTLSCREELEGLGKIVSWCSQIEVLSRPATGCFLTH 361
           E+ARGLLS  RPFLWVIRD E E ++LS +E+LE  GKIV WCSQ+EVLS PATGCFLTH
Sbjct: 303 EMARGLLSTKRPFLWVIRDTEAEEDSLSFKEKLETQGKIVPWCSQLEVLSSPATGCFLTH 362

Query: 362 CGWNSTLESLVCGVPVVVFPQWSDQGTNAKIIQDMSETGVRLEVGMDGVVKREEIKRCLE 421
           CGWNS LESL CGVP V FPQWSDQ TN+KIIQD+SETGVRLE G DGVVK EEI+RCL 
Sbjct: 363 CGWNSCLESLACGVPTVAFPQWSDQATNSKIIQDLSETGVRLEAGEDGVVKGEEIERCLT 422

Query: 422 LVMGDSKKGEEIRKNVVKWKELAKGATAHGGSSYSNFKAFVDQVC 425
           LVMGDSKKGE+IR+N +KWK+LAK A + GGSS++NFKAFVDQVC
Sbjct: 423 LVMGDSKKGEDIRRNALKWKKLAKEAASEGGSSFANFKAFVDQVC 467

BLAST of Cp4.1LG17g03950 vs. NCBI nr
Match: gi|449445445|ref|XP_004140483.1| (PREDICTED: crocetin glucosyltransferase, chloroplastic-like [Cucumis sativus])

HSP 1 Score: 590.1 bits (1520), Expect = 3.1e-165
Identity = 306/467 (65.52%), Postives = 362/467 (77.52%), Query Frame = 1

Query: 1   MDNTAPH---RHRVLLITYSAQGHINPALEFAKRLTRRR-IDVTFVTSLSAYRRMGKTPT 60
           M+NT P+   RH VLL+T+ AQGHINP L+ AKRLTR   + VTF+ SLSAYRRMG TPT
Sbjct: 1   MNNTTPNPNPRH-VLLVTHCAQGHINPTLQLAKRLTRHGDLHVTFLISLSAYRRMGHTPT 60

Query: 61  LPHVSFASFSDGYDDGFKQGDDINHFMSELERRGSQAIKDMIVAGVEQGQPFTCIVYSIL 120
           LPH++FASFSDGYDDGFK  DDI  ++SELERRGS A+K++I     +GQPFTCIVYSIL
Sbjct: 61  LPHITFASFSDGYDDGFKPSDDIKLYISELERRGSDALKNIIQESRNKGQPFTCIVYSIL 120

Query: 121 LPWVAIVARSLHLPAILLWIQPAIVFALYYYYNYGYHDIIQSVF--DDPLAT-IQLPGLP 180
           +PWVA VARSL + ++ LWIQPA+VFALYYYYN GY+D IQ +   DDP +T I+LPGLP
Sbjct: 121 IPWVATVARSLDVASVHLWIQPAVVFALYYYYNNGYYDEIQRIASGDDPSSTSIKLPGLP 180

Query: 181 LLTARDLPSFFGSSDAYEFALPIFRRQFELLEQETNPMVVINTFDELEHDALRAISNY-- 240
           LL+ARDLPSFFG+SD Y FALP+FR+QFELLE+E+NP ++INTF+ELE DA++AI  +  
Sbjct: 181 LLSARDLPSFFGASDGYSFALPMFRKQFELLEEESNPKILINTFEELEKDAVKAIKKFHL 240

Query: 241 -----------ID----------------------WLNSKPKGSVIYVSSGSISTLSKHQ 300
                      +D                      WLNSKPK SV+YVS GSIST+SK Q
Sbjct: 241 MPIGPLIPSVLVDGNDPSEASSGCDLFRSTSSYMEWLNSKPKASVVYVSMGSISTVSKQQ 300

Query: 301 KEEIARGLLSCGRPFLWVIRDI-EEVNTLSCREELEGLGKIVSWCSQIEVLSRPATGCFL 360
           KEEIARGL    RPFLWVIR+I EE + LS +E+LE  GKIVSWC+Q+EVLS PATGCFL
Sbjct: 301 KEEIARGLSLTKRPFLWVIRNIEEEEDFLSFKEKLETQGKIVSWCAQLEVLSSPATGCFL 360

Query: 361 THCGWNSTLESLVCGVPVVVFPQWSDQGTNAKIIQDMSETGVRLEVGMDGVVKREEIKRC 420
           THCGWNS LESL CGVP V FPQWSDQ TN+KII+D+SETGVRLEV  +GVVK EEI+RC
Sbjct: 361 THCGWNSCLESLACGVPNVAFPQWSDQATNSKIIEDLSETGVRLEVEEEGVVKGEEIERC 420

Query: 421 LELVMGDSKKGEEIRKNVVKWKELAKGATAHGGSSYSNFKAFVDQVC 425
           LELVMGDSKKGEEIR+N +KWK+LAK A + GGSS++N KAFVD VC
Sbjct: 421 LELVMGDSKKGEEIRRNALKWKKLAKEAASEGGSSFANLKAFVDHVC 466

BLAST of Cp4.1LG17g03950 vs. NCBI nr
Match: gi|225433620|ref|XP_002263700.1| (PREDICTED: crocetin glucosyltransferase, chloroplastic-like [Vitis vinifera])

HSP 1 Score: 519.6 bits (1337), Expect = 5.1e-144
Identity = 253/460 (55.00%), Postives = 327/460 (71.09%), Query Frame = 1

Query: 5   APHRHRVLLITYSAQGHINPALEFAKRLTRRRIDVTFVTSLSAYRRMGKTPTLPHVSFAS 64
           +PH    LL+T+ AQGHINPAL+FAKR+ R    V+F TS+SA+RRM K  T   ++F  
Sbjct: 3   SPH---FLLVTFPAQGHINPALQFAKRIIRTGAQVSFATSVSAHRRMAKRSTPEGLNFVP 62

Query: 65  FSDGYDDGFKQGDDINHFMSELERRGSQAIKDMIVAGVEQGQPFTCIVYSILLPWVAIVA 124
           FSDGYDDGFK  DD+ H+MSE++RRGS+ +++++V   ++GQPFTCIVY++LLPW A VA
Sbjct: 63  FSDGYDDGFKPTDDVQHYMSEIKRRGSETLREIVVRNADEGQPFTCIVYTLLLPWAAEVA 122

Query: 125 RSLHLPAILLWIQPAIVFALYYYYNYGYHDIIQSVFDDPLATIQLPGLPLLTARDLPSFF 184
           R L +P+ LLWIQPA V  +YYYY  GY D+ +++ ++P  +++LPGLPLL++RDLPSF 
Sbjct: 123 RGLGVPSALLWIQPATVLDIYYYYFNGYGDVFRNISNEPSCSVELPGLPLLSSRDLPSFL 182

Query: 185 GSSDAYEFALPIFRRQFELLEQETNPMVVINTFDELEHDALRAIS--------------- 244
             S+AY F LP F+ Q E L QET+P V++NTFD LE + LRA+                
Sbjct: 183 VKSNAYTFVLPTFQEQLEALSQETSPKVLVNTFDALEPEPLRAVDKLHLIGIGPLVPSAY 242

Query: 245 --------------------NYIDWLNSKPKGSVIYVSSGSISTLSKHQKEEIARGLLSC 304
                               +Y++WLNSKPK SV+YVS GSIS LSK QKE+IAR LL C
Sbjct: 243 LDGKDPSDTSFGGDMFQGSDDYMEWLNSKPKSSVVYVSFGSISVLSKTQKEDIARALLDC 302

Query: 305 GRPFLWVIR------DIEEVNTLSCREELEGLGKIVSWCSQIEVLSRPATGCFLTHCGWN 364
           G PFLWVIR      +++E + LSCREELE  G IVSWCSQIEVL+ P+ GCF++HCGWN
Sbjct: 303 GHPFLWVIRAPENGEEVKEQDKLSCREELEQKGMIVSWCSQIEVLTHPSLGCFVSHCGWN 362

Query: 365 STLESLVCGVPVVVFPQWSDQGTNAKIIQDMSETGVRLEVGMDGVVKREEIKRCLELVMG 424
           STLESLV GVPVV FPQW+DQGTNAK+I+DM + G+R+ V  +G+V+ +E KRCLE+VMG
Sbjct: 363 STLESLVSGVPVVAFPQWTDQGTNAKLIEDMWKIGIRVTVNEEGIVESDEFKRCLEIVMG 422

BLAST of Cp4.1LG17g03950 vs. NCBI nr
Match: gi|63028446|gb|AAY27090.1| (UDP-glucose:flavonoid 7-O-glucosyltransferase [Pyrus communis])

HSP 1 Score: 506.9 bits (1304), Expect = 3.4e-140
Identity = 258/471 (54.78%), Postives = 327/471 (69.43%), Query Frame = 1

Query: 8   RHRVLLITYSAQGHINPALEFAKRLTRRR-IDVTFVTSLSAYRRMGKTPTLPHVSFASFS 67
           +HR LL+TY AQGHINP+L+FAKRLT      VT+VTSLSA+RR+G       +++A FS
Sbjct: 3   QHRFLLVTYPAQGHINPSLQFAKRLTNTTGAHVTYVTSLSAHRRIGNGSIPDGLTYAPFS 62

Query: 68  DGYDDGFKQGDDINHFMSELERRGSQAIKDMIVAGVEQGQPFTCIVYSILLPWVAIVARS 127
           DGYDDGFK GD+I+ +MSEL  RG+QAI D++VA   +G P+TC+VYS+++PW A VA  
Sbjct: 63  DGYDDGFKPGDNIDDYMSELRHRGAQAITDLVVASANEGHPYTCLVYSLIVPWSAGVAHE 122

Query: 128 LHLPAILLWIQPAIVFALYYYYNYGYHDIIQSVFDDPL-----ATIQLPGLPL-LTARDL 187
           LHLP++LLWIQPA VF +YYYY  GY D+I+             +I+LPGLPL  T+RDL
Sbjct: 123 LHLPSVLLWIQPATVFDIYYYYFNGYKDLIRDNTSSGTNNVLPCSIELPGLPLSFTSRDL 182

Query: 188 PSFFGSSDAYEFALPIFRRQFELLEQETNPMVVINTFDELEHDALRAI------------ 247
           PSF   ++ Y FALP+F+ Q ELLE+ETNP +++NTFD LE +AL+AI            
Sbjct: 183 PSFMVDTNPYNFALPLFQEQMELLERETNPTILVNTFDALEPEALKAIDKYNLIGVGPLI 242

Query: 248 -------------------------SNYIDWLNSKPKGSVIYVSSGSISTLSKHQKEEIA 307
                                    S+Y++WLNSKP+GSVIYVS GSIS L K Q EEIA
Sbjct: 243 PSAFLDGKDPSDKSFGGDLVQKSRDSSYLEWLNSKPEGSVIYVSFGSISVLGKAQMEEIA 302

Query: 308 RGLLSCGRPFLWVIRD-----------IEEVNTLSCREELEGLGKIVSWCSQIEVLSRPA 367
           +GLL CG PFLWVIRD            +E   LSCR ELE LG+IV WCSQ+EVLS P+
Sbjct: 303 KGLLDCGLPFLWVIRDKVDKKGDDNEAKQEEAMLSCRVELEELGRIVPWCSQVEVLSSPS 362

Query: 368 TGCFLTHCGWNSTLESLVCGVPVVVFPQWSDQGTNAKIIQDMSETGVRLEVGMDGVVKRE 424
            GCF+THCGWNS+LESLV GVPVV FPQW+DQGTNAK+I+D  +TGVR+   ++G+V  E
Sbjct: 363 LGCFVTHCGWNSSLESLVSGVPVVAFPQWTDQGTNAKLIEDFWKTGVRVTPNVEGIVTGE 422

BLAST of Cp4.1LG17g03950 vs. NCBI nr
Match: gi|596085520|ref|XP_007221288.1| (hypothetical protein PRUPE_ppa016890mg [Prunus persica])

HSP 1 Score: 501.5 bits (1290), Expect = 1.4e-138
Identity = 256/470 (54.47%), Postives = 324/470 (68.94%), Query Frame = 1

Query: 8   RHRVLLITYSAQGHINPALEFAKRLTRRR-IDVTFVTSLSAYRRMGKTPTLPHVSFASFS 67
           +HR L +TY AQGHINPAL+ AKRL R     VT+VTSL AYRR+    T   +++A +S
Sbjct: 3   QHRFLFLTYPAQGHINPALQLAKRLIRNTGAQVTYVTSLYAYRRIVNGSTPNGLTYAPYS 62

Query: 68  DGYDDGFKQGDDINHFMSELERRGSQAIKDMIVAGVEQGQPFTCIVYSILLPWVAIVARS 127
           DGYDDGFK  DD++H+MSEL R GSQ I D++ +  ++G P+TC+VY+ILLPW A +AR 
Sbjct: 63  DGYDDGFKFSDDVDHYMSELRRAGSQVITDLVASSAKEGHPYTCLVYTILLPWAADLARE 122

Query: 128 LHLPAILLWIQPAIVFALYYYYNYGYHDIIQSVF----DDPLATIQLPGLPL-LTARDLP 187
           LHLP++L WIQ A +F +YYYY  GY D+I+  F    +DP  +IQLPGLPL L +RDLP
Sbjct: 123 LHLPSVLAWIQAATLFDVYYYYLSGYKDLIRESFGTDTNDPSCSIQLPGLPLDLASRDLP 182

Query: 188 SFFGSSDAYEFALPIFRRQFELLEQETNPMVVINTFDELEHDALRAISNY---------- 247
           SF  + ++Y FALP+F +QFELLE+ET P++++NTFD LE +AL+AI  Y          
Sbjct: 183 SFMVAENSYNFALPLFEKQFELLERETKPIILVNTFDALEPEALKAIDKYNLIGIGPLIP 242

Query: 248 ---------------------------IDWLNSKPKGSVIYVSSGSISTLSKHQKEEIAR 307
                                      I+WLNSKP+GSVIYVS GS+S LSK Q EEIA+
Sbjct: 243 SAFLDGKDPSDTSFGGDLFQKSMDSSCIEWLNSKPEGSVIYVSFGSVSALSKDQMEEIAK 302

Query: 308 GLLSCGRPFLWVIRDIEEVN-----------TLSCREELEGLGKIVSWCSQIEVLSRPAT 367
           GLL  GRPFLWVIR+ EE N             SCREEL+ LGKIV WCSQ+EVLS P+ 
Sbjct: 303 GLLDYGRPFLWVIREKEERNGQDNETEKEEEKFSCREELKELGKIVLWCSQLEVLSNPSL 362

Query: 368 GCFLTHCGWNSTLESLVCGVPVVVFPQWSDQGTNAKIIQDMSETGVRLEVGMDGVVKREE 424
           GCF+THCGWNS++ESLV GVPVV FP W+DQ TNAK+I+D  +TGVR+    +G+V  EE
Sbjct: 363 GCFVTHCGWNSSMESLVSGVPVVAFPLWTDQRTNAKLIEDTWKTGVRVAPNEEGIVVGEE 422

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
UGT1_GARJA6.0e-12952.83Crocetin glucosyltransferase, chloroplastic OS=Gardenia jasminoides GN=UGT75L6 P... [more]
U75D1_ARATH1.0e-11247.55UDP-glycosyltransferase 75D1 OS=Arabidopsis thaliana GN=UGT75D1 PE=2 SV=2[more]
5GT1_PERFR2.2e-10746.12Anthocyanidin 3-O-glucoside 5-O-glucosyltransferase 1 OS=Perilla frutescens GN=P... [more]
5GT_VERHY2.9e-10747.29Anthocyanidin 3-O-glucoside 5-O-glucosyltransferase OS=Verbena hybrida GN=HGT8 P... [more]
5GT2_PERFR2.5e-10346.21Anthocyanidin 3-O-glucoside 5-O-glucosyltransferase 2 OS=Perilla frutescens GN=P... [more]
Match NameE-valueIdentityDescription
A0A0A0KA46_CUCSA2.1e-16565.52UDP-glucose:flavonoid 7-O-glucosyltransferase OS=Cucumis sativus GN=Csa_6G109750... [more]
F6I4F4_VITVI3.5e-14455.00Glycosyltransferase OS=Vitis vinifera GN=VIT_05s0062g00640 PE=3 SV=1[more]
A7MAV1_PYRCO2.4e-14054.78Glycosyltransferase OS=Pyrus communis PE=2 SV=1[more]
M5X8U4_PRUPE1.0e-13854.47Glycosyltransferase OS=Prunus persica GN=PRUPE_ppa016890mg PE=3 SV=1[more]
A7MAS5_MALDO6.5e-13854.35Glycosyltransferase OS=Malus domestica PE=2 SV=1[more]
Match NameE-valueIdentityDescription
AT4G15550.15.8e-11447.55 indole-3-acetate beta-D-glucosyltransferase[more]
AT4G14090.17.1e-10444.93 UDP-Glycosyltransferase superfamily protein[more]
AT1G05560.16.5e-9745.10 UDP-glucosyltransferase 75B1[more]
AT1G05530.12.5e-9643.45 UDP-glucosyl transferase 75B2[more]
AT4G15490.13.4e-7436.52 UDP-Glycosyltransferase superfamily protein[more]
Match NameE-valueIdentityDescription
gi|659116578|ref|XP_008458144.1|1.9e-17067.53PREDICTED: anthocyanidin 3-O-glucoside 5-O-glucosyltransferase 1-like [Cucumis m... [more]
gi|449445445|ref|XP_004140483.1|3.1e-16565.52PREDICTED: crocetin glucosyltransferase, chloroplastic-like [Cucumis sativus][more]
gi|225433620|ref|XP_002263700.1|5.1e-14455.00PREDICTED: crocetin glucosyltransferase, chloroplastic-like [Vitis vinifera][more]
gi|63028446|gb|AAY27090.1|3.4e-14054.78UDP-glucose:flavonoid 7-O-glucosyltransferase [Pyrus communis][more]
gi|596085520|ref|XP_007221288.1|1.4e-13854.47hypothetical protein PRUPE_ppa016890mg [Prunus persica][more]
The following terms have been associated with this gene:
Vocabulary: Biological Process
TermDefinition
GO:0008152metabolic process
Vocabulary: Molecular Function
TermDefinition
GO:0016758transferase activity, transferring hexosyl groups
Vocabulary: INTERPRO
TermDefinition
IPR002213UDP_glucos_trans
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008152 metabolic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0016758 transferase activity, transferring hexosyl groups

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG17g03950.1Cp4.1LG17g03950.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002213UDP-glucuronosyl/UDP-glucosyltransferasePANTHERPTHR11926GLUCOSYL/GLUCURONOSYL TRANSFERASEScoord: 6..424
score: 1.4E
IPR002213UDP-glucuronosyl/UDP-glucosyltransferasePFAMPF00201UDPGTcoord: 234..386
score: 4.8
IPR002213UDP-glucuronosyl/UDP-glucosyltransferasePROSITEPS00375UDPGTcoord: 301..344
scor
NoneNo IPR availableGENE3DG3DSA:3.40.50.2000coord: 243..354
score: 4.8E-11coord: 10..242
score: 9.
NoneNo IPR availablePANTHERPTHR11926:SF98UDP-GLYCOSYLTRANSFERASE 75B1-RELATEDcoord: 6..424
score: 1.4E
NoneNo IPR availableunknownSSF53756UDP-Glycosyltransferase/glycogen phosphorylasecoord: 10..424
score: 8.64E

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Cp4.1LG17g03950Silver-seed gourdcarcpeB0794
Cp4.1LG17g03950Silver-seed gourdcarcpeB0851
Cp4.1LG17g03950Cucumber (Chinese Long) v3cpecucB0410
Cp4.1LG17g03950Wax gourdcpewgoB0381
Cp4.1LG17g03950Cucurbita pepo (Zucchini)cpecpeB159
Cp4.1LG17g03950Bottle gourd (USVL1VR-Ls)cpelsiB248
Cp4.1LG17g03950Watermelon (Charleston Gray)cpewcgB299
Cp4.1LG17g03950Watermelon (97103) v1cpewmB335
Cp4.1LG17g03950Melon (DHL92) v3.5.1cpemeB285
Cp4.1LG17g03950Cucumber (Gy14) v2cgybcpeB790
Cp4.1LG17g03950Melon (DHL92) v3.6.1cpemedB333