CSPI06G08930 (gene) Wild cucumber (PI 183967)

NameCSPI06G08930
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionUDP-glucose:flavonoid 7-O-glucosyltransferase
LocationChr6 : 7480903 .. 7483168 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TATTTAAATACATAAATACTATAGGTTTGCTGTGGCCTATGAGGCCACGTCCTTGAAAATCATGACCACAATAGGACCTTTCAAATAGAAAAATTGAATCAAAACTACAACTCCCAAATGAAATGCGTCACATAAACATGTATGAATTTCGTCATAGCAAATGTCATTCCTAATTCCCAAACTATCCAATCAACTTTCCTATTAAATACGAACCTCACACACCTTACATTTTCTATCCTAATCCACAGTATGAATAACACAACACCCAATCCCAATCCCCGTCATGTCCTTTTAGTTACACATTGTGCTCAAGGCCACATCAATCCCACCCTCCAACTCGCCAAGCGCCTCACCCGCCATGGGGATCTCCATGTCACCTTCCTCATCTCTCTATCCGCCTACCGCCGTATGGGTCATACCCCAACTCTCCCACATATCACTTTTGCCTCCTTCTCCGATGGCTACGATGATGGTTTCAAACCCAGTGACGACATTAAGCTTTATATATCCGAGCTTGAGCGTCGTGGATCTGATGCTTTGAAGAATATAATCCAGGAGAGTAGAAACAAAGGTCAACCCTTCACTTGTATCGTGTATTCCATACTCATCCCTTGGGTGGCTACGGTGGCACGTTCCCTCGATGTTGCGTCGGTTCATCTTTGGATTCAACCGGCTGTCGTTTTCGCATTGTATTACTATTACAACAACGGATATTACGACGAAATTCAAAGGATTGCCTCTGGGGATGATCCTAGTTCAACGAGTATTAAATTACCTGGGCTTCCATTGTTGAGTGCTCGTGATCTTCCATCATTTTTTGGCGCTTCAGATGGTTATTCTTTTGCACTCCCAATGTTTAGGAAGCAATTTGAATTACTAGAGGAAGAGAGTAATCCAAAGATTTTAATCAACACATTTGAAGAGCTGGAGAAAGATGCCGTGAAAGCAATTAAGAAATTTCATTTGATGCCTATCGGACCATTGATTCCATCTGTTCTTGTTGATGGAAATGACCCATCAGAAGCTTCTTCTGGATGTGACCTATTTCGATCTACGAGCAGGTATATGATTTATCGTCCAACTACAACCAATCCATGTTTTCATTTTGATATGAATTGGTTGGATAACTATTTCCGTTTTGATTTTTTCTTCTTGAAAAGTTAGTGTCGTCTAAACATACATTCCATCTTGTTTATTATTTATATTTTATAGATGTTTTCAATATTAAAATCTAAAAATATCATTTTTTTAAAAAAAACTAGTTGTCTTTTATGAATTTAACTGTTGTAAAAATTTAAGAACAATTATGTTCGATTTTCAAATCTCAAAAACAAAAAACATAACAATTGTCAAAATATTTTCAACACAGTTATATGGAGTGGTTGAACTCGAAACCTAAAGCATCAGTTGTCTACGTATCGATGGGAAGCATTTCAACAGTATCAAAGCAACAAAAAGAGGAGATAGCGAGAGGATTATCAATAACAAAACGACCATTTTTATGGGTTATCCGAAACATTGAAGAAGAAGAAGATTTTTTAAGCTTTAAAGAAAAACTAGAAACTCAAGGGAAAATAGTTTCATGGTGTGCTCAACTTGAGGTTCTCTCAAGTCCAGCCACAGGCTGCTTTCTCACACATTGTGGTTGGAATTCTTGTTTGGAGAGTCTAGCTTGCGGTGTCCCAAACGTGGCATTTCCACAATGGTCCGATCAAGCGACCAACAGTAAGATCATTGAGGACTTGTCAGAGACCGGGGTGAGGTTAGAGGTGGAGGAAGAGGGCGTGGTTAAGGGAGAGGAGATAGAAAGGTGCTTGGAGTTGGTAATGGGAGATTCAAAGAAAGGAGAAGAAATAAGGAGGAATGCTTTGAAATGGAAGAAATTGGCTAAGGAAGCTGCTAGTGAGGGTGGTTCATCGTTTGCCAATTTGAAGGCTTTTGTGGATCACGTATGTTCTTAGTGGTTGAGTTCAAGATGCAACCCATCGTCAAAATTAATTACATACAATTTCTAGTACCGACGTTACTATTTCGTTTTTAAAATAGTGTCATGTGGGGTTCAACCACATTATTATCATGCATGTCTATTTAATATTTATGAGAACATATTTCTAAAAAAATATTTGACACGTGCATATATTACTTTACAAATTTAGTGTGGTAAATTCTTTTTCTTGGTAGAAATAGATTTTGTCCGCAAAACAAATGTCCTAAAAAACCTATTTTTATCGAAGAGAGAATGATCGCATGTAAAAGATATGTGTGC

mRNA sequence

ATGAATTTCGTCATAGCAAATGTCATTCCTAATTCCCAAACTATCCAATCAACTTTCCTATTAAATACGAACCTCACACACCTTACATTTTCTATCCTAATCCACAGTATGAATAACACAACACCCAATCCCAATCCCCGTCATGTCCTTTTAGTTACACATTGTGCTCAAGGCCACATCAATCCCACCCTCCAACTCGCCAAGCGCCTCACCCGCCATGGGGATCTCCATGTCACCTTCCTCATCTCTCTATCCGCCTACCGCCGTATGGGTCATACCCCAACTCTCCCACATATCACTTTTGCCTCCTTCTCCGATGGCTACGATGATGGTTTCAAACCCAGTGACGACATTAAGCTTTATATATCCGAGCTTGAGCGTCGTGGATCTGATGCTTTGAAGAATATAATCCAGGAGAGTAGAAACAAAGGTCAACCCTTCACTTGTATCGTGTATTCCATACTCATCCCTTGGGTGGCTACGGTGGCACGTTCCCTCGATGTTGCGTCGGTTCATCTTTGGATTCAACCGGCTGTCGTTTTCGCATTGTATTACTATTACAACAACGGATATTACGACGAAATTCAAAGGATTGCCTCTGGGGATGATCCTAGTTCAACGAGTATTAAATTACCTGGGCTTCCATTGTTGAGTGCTCGTGATCTTCCATCATTTTTTGGCGCTTCAGATGGTTATTCTTTTGCACTCCCAATGTTTAGGAAGCAATTTGAATTACTAGAGGAAGAGAGTAATCCAAAGATTTTAATCAACACATTTGAAGAGCTGGAGAAAGATGCCGTGAAAGCAATTAAGAAATTTCATTTGATGCCTATCGGACCATTGATTCCATCTGTTCTTGTTGATGGAAATGACCCATCAGAAGCTTCTTCTGGATGTGACCTATTTCGATCTACGAGCAGTTATATGGAGTGGTTGAACTCGAAACCTAAAGCATCAGTTGTCTACGTATCGATGGGAAGCATTTCAACAGTATCAAAGCAACAAAAAGAGGAGATAGCGAGAGGATTATCAATAACAAAACGACCATTTTTATGGGTTATCCGAAACATTGAAGAAGAAGAAGATTTTTTAAGCTTTAAAGAAAAACTAGAAACTCAAGGGAAAATAGTTTCATGGTGTGCTCAACTTGAGGTTCTCTCAAGTCCAGCCACAGGCTGCTTTCTCACACATTGTGGTTGGAATTCTTGTTTGGAGAGTCTAGCTTGCGGTGTCCCAAACGTGGCATTTCCACAATGGTCCGATCAAGCGACCAACAGTAAGATCATTGAGGACTTGTCAGAGACCGGGGTGAGGTTAGAGGTGGAGGAAGAGGGCGTGGTTAAGGGAGAGGAGATAGAAAGGTGCTTGGAGTTGGTAATGGGAGATTCAAAGAAAGGAGAAGAAATAAGGAGGAATGCTTTGAAATGGAAGAAATTGGCTAAGGAAGCTGCTAGTGAGGGTGGTTCATCGTTTGCCAATTTGAAGGCTTTTGTGGATCACGTATGTTCTTAG

Coding sequence (CDS)

ATGAATTTCGTCATAGCAAATGTCATTCCTAATTCCCAAACTATCCAATCAACTTTCCTATTAAATACGAACCTCACACACCTTACATTTTCTATCCTAATCCACAGTATGAATAACACAACACCCAATCCCAATCCCCGTCATGTCCTTTTAGTTACACATTGTGCTCAAGGCCACATCAATCCCACCCTCCAACTCGCCAAGCGCCTCACCCGCCATGGGGATCTCCATGTCACCTTCCTCATCTCTCTATCCGCCTACCGCCGTATGGGTCATACCCCAACTCTCCCACATATCACTTTTGCCTCCTTCTCCGATGGCTACGATGATGGTTTCAAACCCAGTGACGACATTAAGCTTTATATATCCGAGCTTGAGCGTCGTGGATCTGATGCTTTGAAGAATATAATCCAGGAGAGTAGAAACAAAGGTCAACCCTTCACTTGTATCGTGTATTCCATACTCATCCCTTGGGTGGCTACGGTGGCACGTTCCCTCGATGTTGCGTCGGTTCATCTTTGGATTCAACCGGCTGTCGTTTTCGCATTGTATTACTATTACAACAACGGATATTACGACGAAATTCAAAGGATTGCCTCTGGGGATGATCCTAGTTCAACGAGTATTAAATTACCTGGGCTTCCATTGTTGAGTGCTCGTGATCTTCCATCATTTTTTGGCGCTTCAGATGGTTATTCTTTTGCACTCCCAATGTTTAGGAAGCAATTTGAATTACTAGAGGAAGAGAGTAATCCAAAGATTTTAATCAACACATTTGAAGAGCTGGAGAAAGATGCCGTGAAAGCAATTAAGAAATTTCATTTGATGCCTATCGGACCATTGATTCCATCTGTTCTTGTTGATGGAAATGACCCATCAGAAGCTTCTTCTGGATGTGACCTATTTCGATCTACGAGCAGTTATATGGAGTGGTTGAACTCGAAACCTAAAGCATCAGTTGTCTACGTATCGATGGGAAGCATTTCAACAGTATCAAAGCAACAAAAAGAGGAGATAGCGAGAGGATTATCAATAACAAAACGACCATTTTTATGGGTTATCCGAAACATTGAAGAAGAAGAAGATTTTTTAAGCTTTAAAGAAAAACTAGAAACTCAAGGGAAAATAGTTTCATGGTGTGCTCAACTTGAGGTTCTCTCAAGTCCAGCCACAGGCTGCTTTCTCACACATTGTGGTTGGAATTCTTGTTTGGAGAGTCTAGCTTGCGGTGTCCCAAACGTGGCATTTCCACAATGGTCCGATCAAGCGACCAACAGTAAGATCATTGAGGACTTGTCAGAGACCGGGGTGAGGTTAGAGGTGGAGGAAGAGGGCGTGGTTAAGGGAGAGGAGATAGAAAGGTGCTTGGAGTTGGTAATGGGAGATTCAAAGAAAGGAGAAGAAATAAGGAGGAATGCTTTGAAATGGAAGAAATTGGCTAAGGAAGCTGCTAGTGAGGGTGGTTCATCGTTTGCCAATTTGAAGGCTTTTGTGGATCACGTATGTTCTTAG
BLAST of CSPI06G08930 vs. Swiss-Prot
Match: UGT1_GARJA (Crocetin glucosyltransferase, chloroplastic OS=Gardenia jasminoides GN=UGT75L6 PE=1 SV=1)

HSP 1 Score: 496.9 bits (1278), Expect = 2.6e-139
Identity = 259/465 (55.70%), Postives = 331/465 (71.18%), Query Frame = 1

Query: 47  RHVLLVTHCAQGHINPTLQLAKRLTRHGDLHVTFLISLSAYRRM----GHTPTLPHITFA 106
           RHVLL+T+ AQGHINP LQ A+RL R G + VT   S+ A  RM    G TP    +TFA
Sbjct: 5   RHVLLITYPAQGHINPALQFAQRLLRMG-IQVTLATSVYALSRMKKSSGSTPK--GLTFA 64

Query: 107 SFSDGYDDGFKPSD-DIKLYISELERRGSDALKNIIQESRNKGQPFTCIVYSILIPWVAT 166
           +FSDGYDDGF+P   D   Y+S L ++GS+ L+N+I  S ++G P TC+VY++L+PW AT
Sbjct: 65  TFSDGYDDGFRPKGVDHTEYMSSLAKQGSNTLRNVINTSADQGCPVTCLVYTLLLPWAAT 124

Query: 167 VARSLDVASVHLWIQPAVVFALYYYYNNGYYDEIQRIASGDDPSSTSIKLPGLPLLSARD 226
           VAR   + S  LWIQP  V  +YYYY  GY D+++   + +DP+  SI+ PGLP + A+D
Sbjct: 125 VARECHIPSALLWIQPVAVMDIYYYYFRGYEDDVKN--NSNDPT-WSIQFPGLPSMKAKD 184

Query: 227 LPSFFGASDG--YSFALPMFRKQFELLEEESNPKILINTFEELEKDAVKAIKKFHLMPIG 286
           LPSF   S    YSFALP F+KQ E L+EE  PK+L+NTF+ LE  A+KAI+ ++L+ IG
Sbjct: 185 LPSFILPSSDNIYSFALPTFKKQLETLDEEERPKVLVNTFDALEPQALKAIESYNLIAIG 244

Query: 287 PLIPSVLVDGNDPSEASSGCDLFRSTSSYMEWLNSKPKASVVYVSMGSISTVSKQQKEEI 346
           PL PS  +DG DPSE S   DLF+ +  Y EWLNS+P  SVVYVS GS+ T+ KQQ EEI
Sbjct: 245 PLTPSAFLDGKDPSETSFSGDLFQKSKDYKEWLNSRPAGSVVYVSFGSLLTLPKQQMEEI 304

Query: 347 ARGLSITKRPFLWVIR-----NIEEEEDFLSFKEKLETQGKIVSWCAQLEVLSSPATGCF 406
           ARGL  + RPFLWVIR       E+EED L   E+LE QG IV WC+Q+EVL+ P+ GCF
Sbjct: 305 ARGLLKSGRPFLWVIRAKENGEEEKEEDRLICMEELEEQGMIVPWCSQIEVLTHPSLGCF 364

Query: 407 LTHCGWNSCLESLACGVPNVAFPQWSDQATNSKIIEDLSETGVRLEVEEEGVVKGEEIER 466
           +THCGWNS LE+L CGVP VAFP W+DQ TN+K+IED+ ETGVR+   E+G V+ +EI+R
Sbjct: 365 VTHCGWNSTLETLVCGVPVVAFPHWTDQGTNAKLIEDVWETGVRVVPNEDGTVESDEIKR 424

Query: 467 CLELVMGDSKKGEEIRRNALKWKKLAKEAASEGGSSFANLKAFVD 500
           C+E VM D +KG E++RNA KWK+LA+EA  E GSS  NLKAFV+
Sbjct: 425 CIETVMDDGEKGVELKRNAKKWKELAREAMQEDGSSDKNLKAFVE 463

BLAST of CSPI06G08930 vs. Swiss-Prot
Match: 5GT1_PERFR (Anthocyanidin 3-O-glucoside 5-O-glucosyltransferase 1 OS=Perilla frutescens GN=PF3R4 PE=1 SV=1)

HSP 1 Score: 437.2 bits (1123), Expect = 2.5e-121
Identity = 240/467 (51.39%), Postives = 305/467 (65.31%), Query Frame = 1

Query: 47  RHVLLVTHCAQGHINPTLQLAKRLTRHGDLHVTFLISLSAYRRMGHTPTL-----PHITF 106
           R VLL T  AQGHINP LQ AKRL + G   VTF  S+ A+RRM +T +      P + F
Sbjct: 4   RRVLLATFPAQGHINPALQFAKRLLKAGT-DVTFFTSVYAWRRMANTASAAAGNPPGLDF 63

Query: 107 ASFSDGYDDGFKPSDDIKLYISELERRGSDALKNIIQESRNKGQPFTCIVYSILIPWVAT 166
            +FSDGYDDG KP  D K Y+SE++ RGS+AL+N++  + +     T +VYS L  W A 
Sbjct: 64  VAFSDGYDDGLKPCGDGKRYMSEMKARGSEALRNLLLNNHD----VTFVVYSHLFAWAAE 123

Query: 167 VARSLDVASVHLWIQPAVVFALYYYYNNGYYDEIQRIASGDDPSSTSIKLPGLPLLSARD 226
           VAR   V S  LW++PA V  +YY+Y NGY DEI       D  S  I+LP LP L  R 
Sbjct: 124 VARESQVPSALLWVEPATVLCIYYFYFNGYADEI-------DAGSDEIQLPRLPPLEQRS 183

Query: 227 LPSFFGASDGYSFALPMFRKQFELLEEESNPKILINTFEELEKDAVKAIKKFHLMPIGPL 286
           LP+F        F L M +++ E L+ E   K+L+NTF+ LE DA+ AI ++ L+ IGPL
Sbjct: 184 LPTFLLPETPERFRL-MMKEKLETLDGEEKAKVLVNTFDALEPDALTAIDRYELIGIGPL 243

Query: 287 IPSVLVDGNDPSEASSGCDLFRST--SSYMEWLNSKPKASVVYVSMGSISTVSKQQKEEI 346
           IPS  +DG DPSE S G DLF  +  ++ +EWL++KPK+SVVYVS GS+    K Q EEI
Sbjct: 244 IPSAFLDGGDPSETSYGGDLFEKSEENNCVEWLDTKPKSSVVYVSFGSVLRFPKAQMEEI 303

Query: 347 ARGLSITKRPFLWVIRNI-----EEEEDFLSFKEKLETQGKIVSWCAQLEVLSSPATGCF 406
            +GL    RPFLW+IR       EEEE+ LS   +L+  GKIVSWC+QLEVL+ PA GCF
Sbjct: 304 GKGLLACGRPFLWMIREQKNDDGEEEEEELSCIGELKKMGKIVSWCSQLEVLAHPALGCF 363

Query: 407 LTHCGWNSCLESLACGVPNVAFPQWSDQATNSKIIEDLSETGVRLEVEEEGVVKGEEIER 466
           +THCGWNS +ESL+CGVP VA PQW DQ TN+K+IED   TGVR+ + E G V G EIER
Sbjct: 364 VTHCGWNSAVESLSCGVPVVAVPQWFDQTTNAKLIEDAWGTGVRVRMNEGGGVDGSEIER 423

Query: 467 CLELVMGDSKKGEEIRRNALKWKKLAKEAASEGGSSFANLKAFVDHV 502
           C+E+VM   +K + +R NA+KWK LA+EA  E GSS  NL AF+  V
Sbjct: 424 CVEMVMDGGEKSKLVRENAIKWKTLAREAMGEDGSSLKNLNAFLHQV 457

BLAST of CSPI06G08930 vs. Swiss-Prot
Match: 5GT_VERHY (Anthocyanidin 3-O-glucoside 5-O-glucosyltransferase OS=Verbena hybrida GN=HGT8 PE=2 SV=1)

HSP 1 Score: 437.2 bits (1123), Expect = 2.5e-121
Identity = 240/463 (51.84%), Postives = 306/463 (66.09%), Query Frame = 1

Query: 48  HVLLVTHCAQGHINPTLQLAKRLTRHGDLHVTFLISLSAYRRMGHTPTLPH--ITFASFS 107
           HVLL T  AQGHINP LQ AKRL  + D+ VTF  S+ A+RRM  T    +  I F SFS
Sbjct: 5   HVLLATFPAQGHINPALQFAKRLA-NADIQVTFFTSVYAWRRMSRTAAGSNGLINFVSFS 64

Query: 108 DGYDDGFKPSDDIKLYISELERRGSDALKNIIQESR--NKGQPFTCIVYSILIPWVATVA 167
           DGYDDG +P DD K Y+SE++ RG  AL + +  +    K    T +VYS L  W A VA
Sbjct: 65  DGYDDGLQPGDDGKNYMSEMKSRGIKALSDTLAANNVDQKSSKITFVVYSHLFAWAAKVA 124

Query: 168 RSLDVASVHLWIQPAVVFALYYYYNNGYYDEIQRIASGDDPSSTSIKLPG-LPLLSARDL 227
           R   + S  LWI+PA V  ++Y+Y NGY DEI       D  S +I LPG LP+L+ RDL
Sbjct: 125 REFHLRSALLWIEPATVLDIFYFYFNGYSDEI-------DAGSDAIHLPGGLPVLAQRDL 184

Query: 228 PSFFGASDGYSFALPMFRKQFELLEEESNPKILINTFEELEKDAVKAIKKFHLMPIGPLI 287
           PSF   S    F   + +++ E LE E  PK+L+N+F+ LE DA+KAI K+ ++ IGPLI
Sbjct: 185 PSFLLPSTHERFR-SLMKEKLETLEGEEKPKVLVNSFDALEPDALKAIDKYEMIAIGPLI 244

Query: 288 PSVLVDGNDPSEASSGCDLFRSTSS---YMEWLNSKPKASVVYVSMGSISTVSKQQKEEI 347
           PS  +DG DPS+ S G DLF   S+    +EWL++ P++SVVYVS GS    +K Q EEI
Sbjct: 245 PSAFLDGKDPSDRSFGGDLFEKGSNDDDCLEWLSTNPRSSVVYVSFGSFVNTTKSQMEEI 304

Query: 348 ARGLSITKRPFLWVIRNIEEEEDFLSFKEKLETQGKIVSWCAQLEVLSSPATGCFLTHCG 407
           ARGL    RPFLWV+R  E EE  +S  E+L+  GKIVSWC+QLEVL+ P+ GCF+THCG
Sbjct: 305 ARGLLDCGRPFLWVVRVNEGEEVLISCMEELKRVGKIVSWCSQLEVLTHPSLGCFVTHCG 364

Query: 408 WNSCLESLACGVPNVAFPQWSDQATNSKIIEDLSETGVRLEVEEEG-VVKGEEIERCLEL 467
           WNS LES++ GVP VAFPQW DQ TN+K++ED+  TGVR+   EEG VV G+EI RC+E 
Sbjct: 365 WNSTLESISFGVPMVAFPQWFDQGTNAKLMEDVWRTGVRVRANEEGSVVDGDEIRRCIEE 424

Query: 468 VMGDSKKGEEIRRNALKWKKLAKEAASEGGSSFANLKAFVDHV 502
           VM   +K  ++R +A KWK LA++A  E GSS  NLK F+D V
Sbjct: 425 VMDGGEKSRKLRESAGKWKDLARKAMEEDGSSVNNLKVFLDEV 458

BLAST of CSPI06G08930 vs. Swiss-Prot
Match: U75D1_ARATH (UDP-glycosyltransferase 75D1 OS=Arabidopsis thaliana GN=UGT75D1 PE=2 SV=2)

HSP 1 Score: 424.1 bits (1089), Expect = 2.2e-117
Identity = 238/488 (48.77%), Postives = 324/488 (66.39%), Query Frame = 1

Query: 38  NNTTPNPNPRHVLLVTHCAQGHINPTLQLAKRLTRH-GDLHVTFLISLSAY-RRMGHTPT 97
           NN + +P   H L VT  AQGHINP+L+LAKRL        VTF  S+SAY RRM  T  
Sbjct: 3   NNNSNSPTGPHFLFVTFPAQGHINPSLELAKRLAGTISGARVTFAASISAYNRRMFSTEN 62

Query: 98  LPH-ITFASFSDGYDDGFKPS--------DDIKLYISELERRGSDALKNIIQESRNKGQP 157
           +P  + FA++SDG+DDGFK S        D    ++SE+ RRG + L  +I+++R + +P
Sbjct: 63  VPETLIFATYSDGHDDGFKSSAYSDKSRQDATGNFMSEMRRRGKETLTELIEDNRKQNRP 122

Query: 158 FTCIVYSILIPWVATVARSLDVASVHLWIQPAVVFALYYYYNNGYYDEIQRIASGDDPSS 217
           FTC+VY+IL+ WVA +AR   + S  LW+QP  VF+++Y+Y NGY D I  +A   +  S
Sbjct: 123 FTCVVYTILLTWVAELAREFHLPSALLWVQPVTVFSIFYHYFNGYEDAISEMA---NTPS 182

Query: 218 TSIKLPGLPLLSARDLPSFFGASDGYSFALPMFRKQFELLEEESNPKILINTFEELEKDA 277
           +SIKLP LPLL+ RD+PSF  +S+ Y+F LP FR+Q + L+EE NPKILINTF+ELE +A
Sbjct: 183 SSIKLPSLPLLTVRDIPSFIVSSNVYAFLLPAFREQIDSLKEEINPKILINTFQELEPEA 242

Query: 278 VKAI-KKFHLMPIGPLIPSVLVDGNDPSEASSGCDLFRSTSSYMEWLNSKPKASVVYVSM 337
           + ++   F ++P+GPL+ ++  D             F S   Y+EWL++K  +SV+YVS 
Sbjct: 243 MSSVPDNFKIVPVGPLL-TLRTD-------------FSSRGEYIEWLDTKADSSVLYVSF 302

Query: 338 GSISTVSKQQKEEIARGLSITKRPFLWVI-----RNIEEEED-----FLSFKEKLETQGK 397
           G+++ +SK+Q  E+ + L  ++RPFLWVI     RN E+E++       SF+E+L+  G 
Sbjct: 303 GTLAVLSKKQLVELCKALIQSRRPFLWVITDKSYRNKEDEQEKEEDCISSFREELDEIGM 362

Query: 398 IVSWCAQLEVLSSPATGCFLTHCGWNSCLESLACGVPNVAFPQWSDQATNSKIIEDLSET 457
           +VSWC Q  VL+  + GCF+THCGWNS LESL  GVP VAFPQW+DQ  N+K++ED  +T
Sbjct: 363 VVSWCDQFRVLNHRSIGCFVTHCGWNSTLESLVSGVPVVAFPQWNDQMMNAKLLEDCWKT 422

Query: 458 GVRL--EVEEEG--VVKGEEIERCLELVMGDSKKGEEIRRNALKWKKLAKEAASEGGSSF 500
           GVR+  + EEEG  VV  EEI RC+E VM D  K EE R NA +WK LA EA  EGGSSF
Sbjct: 423 GVRVMEKKEEEGVVVVDSEEIRRCIEEVMED--KAEEFRGNATRWKDLAAEAVREGGSSF 471

BLAST of CSPI06G08930 vs. Swiss-Prot
Match: 5GT2_PERFR (Anthocyanidin 3-O-glucoside 5-O-glucosyltransferase 2 OS=Perilla frutescens GN=PF3R6 PE=2 SV=1)

HSP 1 Score: 415.2 bits (1066), Expect = 1.0e-114
Identity = 229/451 (50.78%), Postives = 295/451 (65.41%), Query Frame = 1

Query: 47  RHVLLVTHCAQGHINPTLQLAKRLTRHGDLHVTFLISLSAYRRMGHTPTL-----PHITF 106
           R VLL T  AQGHINP LQ AKRL + G   VTF  S+ A+RRM +T +      P + F
Sbjct: 4   RRVLLATFPAQGHINPALQFAKRLLKAGT-DVTFFTSVYAWRRMANTASAAAGNPPGLDF 63

Query: 107 ASFSDGYDDGFKPSDDIKLYISELERRGSDALKNIIQESRNKGQPFTCIVYSILIPWVAT 166
            +FSDGYDDG KP  D K Y+SE++ RGS+AL+N++  + +     T +VYS L  W A 
Sbjct: 64  VAFSDGYDDGLKPGGDGKRYMSEMKARGSEALRNLLLNNDD----VTFVVYSHLFAWAAE 123

Query: 167 VARSLDVASVHLWIQPAVVFALYYYYNNGYYDEIQRIASGDDPSSTSIKLPGLPLLSARD 226
           VAR   V +  LW++PA V  +Y++Y NGY DEI       D  S  I+LP LP L  R 
Sbjct: 124 VARLSHVPTALLWVEPATVLCIYHFYFNGYADEI-------DAGSNEIQLPRLPSLEQRS 183

Query: 227 LPSFFGASDGYSFALPMFRKQFELLEEESNPKILINTFEELEKDAVKAIKKFHLMPIGPL 286
           LP+F   +    F L M +++ E L+ E   K+L+NTF+ LE DA+ AI ++ L+ IGPL
Sbjct: 184 LPTFLLPATPERFRL-MMKEKLETLDGEEKAKVLVNTFDALEPDALTAIDRYELIGIGPL 243

Query: 287 IPSVLVDGNDPSEASSGCDLFRST--SSYMEWLNSKPKASVVYVSMGSISTVSKQQKEEI 346
           IPS  +DG DPSE S G DLF  +  ++ +EWLNSKPK+SVVYVS GS+    K Q EEI
Sbjct: 244 IPSAFLDGEDPSETSYGGDLFEKSEENNCVEWLNSKPKSSVVYVSFGSVLRFPKAQMEEI 303

Query: 347 ARGLSITKRPFLWVIRNI-------EEEEDFLSFKEKLETQGKIVSWCAQLEVLSSPATG 406
            +GL    RPFLW+IR         EEEE+ LS   +L+  GKIVSWC+QLEVL+ PA G
Sbjct: 304 GKGLLACGRPFLWMIREQKNDDGEEEEEEEELSCIGELKKMGKIVSWCSQLEVLAHPALG 363

Query: 407 CFLTHCGWNSCLESLACGVPNVAFPQWSDQATNSKIIEDLSETGVRLEVEEEGVVKGEEI 466
           CF+THCGWNS +ESL+CG+P VA PQW DQ TN+K+IED   TGVR+ + E G V G EI
Sbjct: 364 CFVTHCGWNSAVESLSCGIPVVAVPQWFDQTTNAKLIEDAWGTGVRVRMNEGGGVDGCEI 423

Query: 467 ERCLELVMGDSKKGEEIRRNALKWKKLAKEA 484
           ERC+E+VM    K + +R NA+KWK LA++A
Sbjct: 424 ERCVEMVMDGGDKTKLVRENAIKWKTLARQA 441

BLAST of CSPI06G08930 vs. TrEMBL
Match: A0A0A0KA46_CUCSA (UDP-glucose:flavonoid 7-O-glucosyltransferase OS=Cucumis sativus GN=Csa_6G109750 PE=4 SV=1)

HSP 1 Score: 937.6 bits (2422), Expect = 6.5e-270
Identity = 466/467 (99.79%), Postives = 467/467 (100.00%), Query Frame = 1

Query: 37  MNNTTPNPNPRHVLLVTHCAQGHINPTLQLAKRLTRHGDLHVTFLISLSAYRRMGHTPTL 96
           MNNTTPNPNPRHVLLVTHCAQGHINPTLQLAKRLTRHGDLHVTFLISLSAYRRMGHTPTL
Sbjct: 1   MNNTTPNPNPRHVLLVTHCAQGHINPTLQLAKRLTRHGDLHVTFLISLSAYRRMGHTPTL 60

Query: 97  PHITFASFSDGYDDGFKPSDDIKLYISELERRGSDALKNIIQESRNKGQPFTCIVYSILI 156
           PHITFASFSDGYDDGFKPSDDIKLYISELERRGSDALKNIIQESRNKGQPFTCIVYSILI
Sbjct: 61  PHITFASFSDGYDDGFKPSDDIKLYISELERRGSDALKNIIQESRNKGQPFTCIVYSILI 120

Query: 157 PWVATVARSLDVASVHLWIQPAVVFALYYYYNNGYYDEIQRIASGDDPSSTSIKLPGLPL 216
           PWVATVARSLDVASVHLWIQPAVVFALYYYYNNGYYDEIQRIASGDDPSSTSIKLPGLPL
Sbjct: 121 PWVATVARSLDVASVHLWIQPAVVFALYYYYNNGYYDEIQRIASGDDPSSTSIKLPGLPL 180

Query: 217 LSARDLPSFFGASDGYSFALPMFRKQFELLEEESNPKILINTFEELEKDAVKAIKKFHLM 276
           LSARDLPSFFGASDGYSFALPMFRKQFELLEEESNPKILINTFEELEKDAVKAIKKFHLM
Sbjct: 181 LSARDLPSFFGASDGYSFALPMFRKQFELLEEESNPKILINTFEELEKDAVKAIKKFHLM 240

Query: 277 PIGPLIPSVLVDGNDPSEASSGCDLFRSTSSYMEWLNSKPKASVVYVSMGSISTVSKQQK 336
           PIGPLIPSVLVDGNDPSEASSGCDLFRSTSSYMEWLNSKPKASVVYVSMGSISTVSKQQK
Sbjct: 241 PIGPLIPSVLVDGNDPSEASSGCDLFRSTSSYMEWLNSKPKASVVYVSMGSISTVSKQQK 300

Query: 337 EEIARGLSITKRPFLWVIRNIEEEEDFLSFKEKLETQGKIVSWCAQLEVLSSPATGCFLT 396
           EEIARGLS+TKRPFLWVIRNIEEEEDFLSFKEKLETQGKIVSWCAQLEVLSSPATGCFLT
Sbjct: 301 EEIARGLSLTKRPFLWVIRNIEEEEDFLSFKEKLETQGKIVSWCAQLEVLSSPATGCFLT 360

Query: 397 HCGWNSCLESLACGVPNVAFPQWSDQATNSKIIEDLSETGVRLEVEEEGVVKGEEIERCL 456
           HCGWNSCLESLACGVPNVAFPQWSDQATNSKIIEDLSETGVRLEVEEEGVVKGEEIERCL
Sbjct: 361 HCGWNSCLESLACGVPNVAFPQWSDQATNSKIIEDLSETGVRLEVEEEGVVKGEEIERCL 420

Query: 457 ELVMGDSKKGEEIRRNALKWKKLAKEAASEGGSSFANLKAFVDHVCS 504
           ELVMGDSKKGEEIRRNALKWKKLAKEAASEGGSSFANLKAFVDHVCS
Sbjct: 421 ELVMGDSKKGEEIRRNALKWKKLAKEAASEGGSSFANLKAFVDHVCS 467

BLAST of CSPI06G08930 vs. TrEMBL
Match: F6I4F4_VITVI (Glycosyltransferase OS=Vitis vinifera GN=VIT_05s0062g00640 PE=3 SV=1)

HSP 1 Score: 538.1 bits (1385), Expect = 1.1e-149
Identity = 266/459 (57.95%), Postives = 345/459 (75.16%), Query Frame = 1

Query: 48  HVLLVTHCAQGHINPTLQLAKRLTRHGDLHVTFLISLSAYRRMGHTPTLPHITFASFSDG 107
           H LLVT  AQGHINP LQ AKR+ R G   V+F  S+SA+RRM    T   + F  FSDG
Sbjct: 5   HFLLVTFPAQGHINPALQFAKRIIRTG-AQVSFATSVSAHRRMAKRSTPEGLNFVPFSDG 64

Query: 108 YDDGFKPSDDIKLYISELERRGSDALKNIIQESRNKGQPFTCIVYSILIPWVATVARSLD 167
           YDDGFKP+DD++ Y+SE++RRGS+ L+ I+  + ++GQPFTCIVY++L+PW A VAR L 
Sbjct: 65  YDDGFKPTDDVQHYMSEIKRRGSETLREIVVRNADEGQPFTCIVYTLLLPWAAEVARGLG 124

Query: 168 VASVHLWIQPAVVFALYYYYNNGYYDEIQRIASGDDPSSTSIKLPGLPLLSARDLPSFFG 227
           V S  LWIQPA V  +YYYY NGY D  + I++  +PS  S++LPGLPLLS+RDLPSF  
Sbjct: 125 VPSALLWIQPATVLDIYYYYFNGYGDVFRNISN--EPSC-SVELPGLPLLSSRDLPSFLV 184

Query: 228 ASDGYSFALPMFRKQFELLEEESNPKILINTFEELEKDAVKAIKKFHLMPIGPLIPSVLV 287
            S+ Y+F LP F++Q E L +E++PK+L+NTF+ LE + ++A+ K HL+ IGPL+PS  +
Sbjct: 185 KSNAYTFVLPTFQEQLEALSQETSPKVLVNTFDALEPEPLRAVDKLHLIGIGPLVPSAYL 244

Query: 288 DGNDPSEASSGCDLFRSTSSYMEWLNSKPKASVVYVSMGSISTVSKQQKEEIARGLSITK 347
           DG DPS+ S G D+F+ +  YMEWLNSKPK+SVVYVS GSIS +SK QKE+IAR L    
Sbjct: 245 DGKDPSDTSFGGDMFQGSDDYMEWLNSKPKSSVVYVSFGSISVLSKTQKEDIARALLDCG 304

Query: 348 RPFLWVIRNIE-----EEEDFLSFKEKLETQGKIVSWCAQLEVLSSPATGCFLTHCGWNS 407
            PFLWVIR  E     +E+D LS +E+LE +G IVSWC+Q+EVL+ P+ GCF++HCGWNS
Sbjct: 305 HPFLWVIRAPENGEEVKEQDKLSCREELEQKGMIVSWCSQIEVLTHPSLGCFVSHCGWNS 364

Query: 408 CLESLACGVPNVAFPQWSDQATNSKIIEDLSETGVRLEVEEEGVVKGEEIERCLELVMGD 467
            LESL  GVP VAFPQW+DQ TN+K+IED+ + G+R+ V EEG+V+ +E +RCLE+VMG 
Sbjct: 365 TLESLVSGVPVVAFPQWTDQGTNAKLIEDMWKIGIRVTVNEEGIVESDEFKRCLEIVMGG 424

Query: 468 SKKGEEIRRNALKWKKLAKEAASEGGSSFANLKAFVDHV 502
            +KGEE+RRNA KWK LA+EA  +GGSS  NLK FVD V
Sbjct: 425 GEKGEEMRRNAEKWKNLAREAVKDGGSSDKNLKGFVDEV 459

BLAST of CSPI06G08930 vs. TrEMBL
Match: F6I4D5_VITVI (Glycosyltransferase OS=Vitis vinifera GN=VIT_05s0062g00350 PE=3 SV=1)

HSP 1 Score: 524.2 bits (1349), Expect = 1.7e-145
Identity = 274/457 (59.96%), Postives = 337/457 (73.74%), Query Frame = 1

Query: 48  HVLLVTHCAQGHINPTLQLAKRLTRHGDLHVTFLISLSAYRRMGHTPTLPHITFASFSDG 107
           H+L+VT  +QGHINPTLQLAK L R G  HVTF  S SA  RM  +P L  + FA+FSDG
Sbjct: 4   HILIVTLPSQGHINPTLQLAKLLIRAG-AHVTFFTSTSAGTRMSKSPNLDGLEFATFSDG 63

Query: 108 YDDGFKPSDDIKLYISELERRGSDALKNIIQESRNKGQPFTCIVYSILIPWVATVARSLD 167
           YD G K  DD++ ++S++ER GS AL  +I  S N+G+PF C++Y + IPWVA VA SL 
Sbjct: 64  YDHGLKQGDDVEKFMSQIERLGSQALIELIMASANEGRPFACLLYGVQIPWVAEVAHSLH 123

Query: 168 VASVHLWIQPAVVFALYYYYNNGYYDEIQRIASGDDPSSTSIKLPGLPLLSARDLPSFFG 227
           + S  +W QPA VF +YYYY NGY + IQ    GD PSST I+LPGLPLL+  DLPSF  
Sbjct: 124 IPSALVWTQPAAVFDIYYYYFNGYGELIQN--KGDHPSST-IELPGLPLLNNSDLPSFLI 183

Query: 228 ASDG--YSFALPMFRKQFELLEEESNPKILINTFEELEKDAVKAIKKFHLMPIGPLIPSV 287
              G  Y FALP F+K  E+L  ESNPK+LIN+F+ LE +A+ AI KF+LM IGPLIPS 
Sbjct: 184 PPKGNTYKFALPGFQKHLEMLNCESNPKVLINSFDALESEALGAINKFNLMGIGPLIPSA 243

Query: 288 LVDGNDPSEASSGCDLFRSTSSYMEWLNSKPKASVVYVSMGSISTVSKQQKEEIARGLSI 347
            +DG DPS+ S G DLFRS+  Y++WLNSKPK+SV+YVS GS+  +SKQQ EEIARGL  
Sbjct: 244 FLDGKDPSDTSFGGDLFRSSKDYIQWLNSKPKSSVIYVSFGSLFVLSKQQSEEIARGLLD 303

Query: 348 TKRPFLWVIRNIE-EEEDFLSFKEKLETQGKIVSWCAQLEVLSSPATGCFLTHCGWNSCL 407
             RPFLWVIR  E EEE  LS  E+LE QG +V WC+Q+EVLS P+ GCF+TH GWNS L
Sbjct: 304 GGRPFLWVIRLEENEEEKTLSCHEELERQGMMVPWCSQVEVLSHPSMGCFVTHSGWNSTL 363

Query: 408 ESLACGVPNVAFPQWSDQATNSKIIEDLSETGVRLEVEEEGVVKGEEIERCLELVMGDSK 467
           ESL  GVP VAFPQWSDQATN+K+IE + +TG+R  V +EG+V+ +EI+RCLELVMG  +
Sbjct: 364 ESLTSGVPVVAFPQWSDQATNAKLIEVVWKTGLRAMVNQEGIVEADEIKRCLELVMGSGE 423

Query: 468 KGEEIRRNALKWKKLAKEAASEGGSSFANLKAFVDHV 502
           +GEE+RRNA KWK LA+EA  EGGSS  NLK F++ V
Sbjct: 424 RGEEMRRNATKWKVLAREAVKEGGSSDKNLKNFMNEV 456

BLAST of CSPI06G08930 vs. TrEMBL
Match: F6I4F7_VITVI (Glycosyltransferase OS=Vitis vinifera GN=VIT_05s0062g00700 PE=3 SV=1)

HSP 1 Score: 523.9 bits (1348), Expect = 2.2e-145
Identity = 266/460 (57.83%), Postives = 342/460 (74.35%), Query Frame = 1

Query: 49  VLLVTHCAQGHINPTLQLAKRLTRHGDLHVTFLISLSAYRRMGHTPTLPHITFASFSDGY 108
           +LLVT+ AQGHINP+LQLAK LTR G  HVTF+ S SA  RM   PTL  + F +FSDGY
Sbjct: 5   ILLVTYPAQGHINPSLQLAKLLTRAG-AHVTFVTSSSASTRMSKPPTLEGLEFVTFSDGY 64

Query: 109 DDGFKPSDDIKLYISELERRGSDALKNIIQESRNKGQPFTCIVYSILIPWVATVARSLDV 168
           D GFK  DD++ ++SEL+R GS AL  +I    N+G+PFTC++Y I+IPWVA VA+S  +
Sbjct: 65  DHGFKHGDDLQNFMSELDRLGSQALTELIVARANEGRPFTCLLYGIIIPWVAEVAQSFHL 124

Query: 169 ASVHLWIQPAVVFALYYYYNNGYYDEIQRIASGDDPSSTSIKLPGLPLLSARDLPSFFGA 228
            S  +W Q A VF +YYYY NGY + I    +G   SS+SI+LPGLPLLS+ DLPSF   
Sbjct: 125 PSALVWSQAATVFDIYYYYFNGYGELIGNKGNG---SSSSIELPGLPLLSSSDLPSFLEP 184

Query: 229 SDG--YSFALPMFRKQFELLEEESNPKILINTFEELEKDAVKAIKKFHLMPIGPLIPSVL 288
           S    ++F L   +KQ E L  ESNP++L+N+F+ LE +A++A+ KF LM IGPL+P   
Sbjct: 185 SKAIAFNFVLKSLQKQLEQLNRESNPRVLVNSFDALESEALRALNKFKLMGIGPLLPLAF 244

Query: 289 VDGNDPSEASSGCDLFRSTSSYMEWLNSKPKASVVYVSMGSISTVSKQQKEEIARGLSIT 348
           +DG DPS+ S G DLFR +  Y++WLNSKP++SV+YVS GS+S +SKQQ EEIARGL  +
Sbjct: 245 LDGKDPSDTSFGGDLFRDSKDYIQWLNSKPESSVIYVSFGSLSVLSKQQSEEIARGLLAS 304

Query: 349 KRPFLWVIR-----NIEEEEDFLSFKEKLETQGKIVSWCAQLEVLSSPATGCFLTHCGWN 408
            RPFLWVIR       E+E+D LS  E+LE QG IV WC+Q+EVLS P+ GCF++HCGWN
Sbjct: 305 GRPFLWVIRAKENGEEEKEDDKLSCVEELEQQGMIVPWCSQVEVLSHPSLGCFVSHCGWN 364

Query: 409 SCLESLACGVPNVAFPQWSDQATNSKIIEDLSETGVRLEVEEEGVVKGEEIERCLELVMG 468
           S LESLACGVP VAFPQW+DQ TN+K+IED+ +TG+R+ V +EG+V+G EI++CLELVMG
Sbjct: 365 STLESLACGVPVVAFPQWTDQTTNAKLIEDVWKTGLRVMVNQEGIVEGGEIKKCLELVMG 424

Query: 469 DSKKGEEIRRNALKWKKLAKEAASEGGSSFANLKAFVDHV 502
             +KG+E+RRNA KWK LA+EA  EGGSS  NLK FV+ +
Sbjct: 425 CGEKGQEVRRNAKKWKDLAREAVKEGGSSDKNLKNFVNEI 460

BLAST of CSPI06G08930 vs. TrEMBL
Match: M5XTU9_PRUPE (Glycosyltransferase OS=Prunus persica GN=PRUPE_ppa005161mg PE=3 SV=1)

HSP 1 Score: 521.2 bits (1341), Expect = 1.4e-144
Identity = 269/467 (57.60%), Postives = 344/467 (73.66%), Query Frame = 1

Query: 50  LLVTHCAQGHINPTLQLAKRLTRHGDLHVTFLISLSAYRRMGHTPTLPHITFASFSDGYD 109
           LLVT  AQGHINP+LQ AK L R    HVT++ SLSA  R+G+  T   +T++ +SDGYD
Sbjct: 7   LLVTFPAQGHINPSLQFAKHLVRTTGAHVTYVTSLSAQSRIGNGSTPHGLTYSLYSDGYD 66

Query: 110 DGFKPSDDIKLYISELERRGSDALKNIIQESRNKGQPFTCIVYSILIPWVATVARSLDVA 169
           +GFK  DDI  Y+SEL R G+ A+ ++I  S  +G+P+TC++Y+IL+PW A  AR L + 
Sbjct: 67  NGFKDGDDIDHYMSELRRCGAQAITDLIVSSAKEGRPYTCLIYTILLPWAAEAARELHLP 126

Query: 170 SVHLWIQPAVVFALYYYYNNGYYDEIQRIASGDDPSST--SIKLPGLPL-LSARDLPSFF 229
           SV +WIQPA VF +YYYY +GY D I++      P+    SI+LPGLPL L++RDLPSF 
Sbjct: 127 SVLVWIQPATVFDIYYYYFSGYKDLIRKNTCTTHPNGALCSIELPGLPLSLASRDLPSFM 186

Query: 230 GASDGYSFALPMFRKQFELLEEESNPKILINTFEELEKDAVKAIKKFHLMPIGPLIPSVL 289
             S+ Y FALP+F +QFELLE E+ P IL+NTF+ LE +A+KAI K++L+ IGPLIPS  
Sbjct: 187 VGSNPYGFALPLFEEQFELLERETKPIILVNTFDALEPEALKAIDKYNLIGIGPLIPSAF 246

Query: 290 VDGNDPSEASSGCDLFRST--SSYMEWLNSKPKASVVYVSMGSISTVSKQQKEEIARGLS 349
           +DG DPS+ S   D F+ +  SSY+EWLNS+P+ SVVYVS GSIS +SK Q EEIA+GL 
Sbjct: 247 LDGKDPSDKSFPGDRFQKSEDSSYIEWLNSRPEGSVVYVSFGSISVLSKPQMEEIAKGLL 306

Query: 350 ITKRPFLWVIRN----------IEEEEDFLSFKEKLETQGKIVSWCAQLEVLSSPATGCF 409
            + RPFLWVIR            E+EE+ LS +E+LE  GKIV WC+Q+EVLSSP+ GCF
Sbjct: 307 DSGRPFLWVIREKEGSNGRDKEAEKEEEKLSCREELEELGKIVPWCSQVEVLSSPSLGCF 366

Query: 410 LTHCGWNSCLESLACGVPNVAFPQWSDQATNSKIIEDLSETGVRLEVEEEGVVKGEEIER 469
           +THCGWNS LESL  GVP VAFPQW+DQ TN+K+IED  +TGVR+   +EG+V GEE++R
Sbjct: 367 VTHCGWNSSLESLVSGVPVVAFPQWTDQGTNAKLIEDTWKTGVRVTPNDEGIVAGEELKR 426

Query: 470 CLELVMGDSKKGEEIRRNALKWKKLAKEAASEGGSSFANLKAFVDHV 502
           CLELVMG  + GEE+RRNA KWK LA+EA SEGGSS  NLKAF+D +
Sbjct: 427 CLELVMGSGEIGEELRRNAKKWKGLAREAVSEGGSSDRNLKAFLDQI 473

BLAST of CSPI06G08930 vs. TAIR10
Match: AT4G15550.1 (AT4G15550.1 indole-3-acetate beta-D-glucosyltransferase)

HSP 1 Score: 424.1 bits (1089), Expect = 1.2e-118
Identity = 238/488 (48.77%), Postives = 324/488 (66.39%), Query Frame = 1

Query: 38  NNTTPNPNPRHVLLVTHCAQGHINPTLQLAKRLTRH-GDLHVTFLISLSAY-RRMGHTPT 97
           NN + +P   H L VT  AQGHINP+L+LAKRL        VTF  S+SAY RRM  T  
Sbjct: 3   NNNSNSPTGPHFLFVTFPAQGHINPSLELAKRLAGTISGARVTFAASISAYNRRMFSTEN 62

Query: 98  LPH-ITFASFSDGYDDGFKPS--------DDIKLYISELERRGSDALKNIIQESRNKGQP 157
           +P  + FA++SDG+DDGFK S        D    ++SE+ RRG + L  +I+++R + +P
Sbjct: 63  VPETLIFATYSDGHDDGFKSSAYSDKSRQDATGNFMSEMRRRGKETLTELIEDNRKQNRP 122

Query: 158 FTCIVYSILIPWVATVARSLDVASVHLWIQPAVVFALYYYYNNGYYDEIQRIASGDDPSS 217
           FTC+VY+IL+ WVA +AR   + S  LW+QP  VF+++Y+Y NGY D I  +A   +  S
Sbjct: 123 FTCVVYTILLTWVAELAREFHLPSALLWVQPVTVFSIFYHYFNGYEDAISEMA---NTPS 182

Query: 218 TSIKLPGLPLLSARDLPSFFGASDGYSFALPMFRKQFELLEEESNPKILINTFEELEKDA 277
           +SIKLP LPLL+ RD+PSF  +S+ Y+F LP FR+Q + L+EE NPKILINTF+ELE +A
Sbjct: 183 SSIKLPSLPLLTVRDIPSFIVSSNVYAFLLPAFREQIDSLKEEINPKILINTFQELEPEA 242

Query: 278 VKAI-KKFHLMPIGPLIPSVLVDGNDPSEASSGCDLFRSTSSYMEWLNSKPKASVVYVSM 337
           + ++   F ++P+GPL+ ++  D             F S   Y+EWL++K  +SV+YVS 
Sbjct: 243 MSSVPDNFKIVPVGPLL-TLRTD-------------FSSRGEYIEWLDTKADSSVLYVSF 302

Query: 338 GSISTVSKQQKEEIARGLSITKRPFLWVI-----RNIEEEED-----FLSFKEKLETQGK 397
           G+++ +SK+Q  E+ + L  ++RPFLWVI     RN E+E++       SF+E+L+  G 
Sbjct: 303 GTLAVLSKKQLVELCKALIQSRRPFLWVITDKSYRNKEDEQEKEEDCISSFREELDEIGM 362

Query: 398 IVSWCAQLEVLSSPATGCFLTHCGWNSCLESLACGVPNVAFPQWSDQATNSKIIEDLSET 457
           +VSWC Q  VL+  + GCF+THCGWNS LESL  GVP VAFPQW+DQ  N+K++ED  +T
Sbjct: 363 VVSWCDQFRVLNHRSIGCFVTHCGWNSTLESLVSGVPVVAFPQWNDQMMNAKLLEDCWKT 422

Query: 458 GVRL--EVEEEG--VVKGEEIERCLELVMGDSKKGEEIRRNALKWKKLAKEAASEGGSSF 500
           GVR+  + EEEG  VV  EEI RC+E VM D  K EE R NA +WK LA EA  EGGSSF
Sbjct: 423 GVRVMEKKEEEGVVVVDSEEIRRCIEEVMED--KAEEFRGNATRWKDLAAEAVREGGSSF 471

BLAST of CSPI06G08930 vs. TAIR10
Match: AT4G14090.1 (AT4G14090.1 UDP-Glycosyltransferase superfamily protein)

HSP 1 Score: 396.4 bits (1017), Expect = 2.7e-110
Identity = 221/472 (46.82%), Postives = 304/472 (64.41%), Query Frame = 1

Query: 36  SMNNTTPNPNPRHVLLVTHCAQGHINPTLQLAKRLTRHGDLHVTFLISLSAYRRMGHTPT 95
           S+N +   P   H LLVT  AQGHINP LQLA RL  HG   VT+  ++SA+RRMG  P+
Sbjct: 4   SVNGSHRRP---HYLLVTFPAQGHINPALQLANRLIHHGAT-VTYSTAVSAHRRMGEPPS 63

Query: 96  LPHITFASFSDGYDDGFKPSDDIKLYISELERRGSDALKNIIQ---ESRNKGQPFTCIVY 155
              ++FA F+DG+DDG K  +D K+Y+SEL+R GS+AL++II+   ++  + +P T ++Y
Sbjct: 64  TKGLSFAWFTDGFDDGLKSFEDQKIYMSELKRCGSNALRDIIKANLDATTETEPITGVIY 123

Query: 156 SILIPWVATVARSLDVASVHLWIQPAVVFALYYYYNNGYYDEIQRIASGDDPSSTSIKLP 215
           S+L+PWV+TVAR   + +  LWI+PA V  +YYYY N  Y  +  +          IKLP
Sbjct: 124 SVLVPWVSTVAREFHLPTTLLWIEPATVLDIYYYYFNTSYKHLFDVEP--------IKLP 183

Query: 216 GLPLLSARDLPSFFGASDGYSFALPMFRKQFELLEEESNPKILINTFEELEKDAVKAIKK 275
            LPL++  DLPSF   S     AL   R+  E LE ESNPKIL+NTF  LE DA+ +++K
Sbjct: 184 KLPLITTGDLPSFLQPSKALPSALVTLREHIEALETESNPKILVNTFSALEHDALTSVEK 243

Query: 276 FHLMPIGPLIPSVLVDGNDPSEASSGCDLFRSTSS-YMEWLNSKPKASVVYVSMGS-IST 335
             ++PIGPL+ S        SE  +  DLF+S+   Y +WL+SK + SV+Y+S+G+    
Sbjct: 244 LKMIPIGPLVSS--------SEGKT--DLFKSSDEDYTKWLDSKLERSVIYISLGTHADD 303

Query: 336 VSKQQKEEIARGLSITKRPFLWVIRNIEEEEDFLS-FKEKLE--TQGKIVSWCAQLEVLS 395
           + ++  E +  G+  T RPFLW++R    EE   + F E +    +G +V WC+Q  VL+
Sbjct: 304 LPEKHMEALTHGVLATNRPFLWIVREKNPEEKKKNRFLELIRGSDRGLVVGWCSQTAVLA 363

Query: 396 SPATGCFLTHCGWNSCLESLACGVPNVAFPQWSDQATNSKIIEDLSETGVRLEVEEEGVV 455
             A GCF+THCGWNS LESL  GVP VAFPQ++DQ T +K++ED    GV+++V EEG V
Sbjct: 364 HCAVGCFVTHCGWNSTLESLESGVPVVAFPQFADQCTTAKLVEDTWRIGVKVKVGEEGDV 423

Query: 456 KGEEIERCLELVMGDSKKGEEIRRNALKWKKLAKEAASEGGSSFANLKAFVD 500
            GEEI RCLE VM   ++ EE+R NA KWK +A +AA+EGG S  NLK FVD
Sbjct: 424 DGEEIRRCLEKVMSGGEEAEEMRENAEKWKAMAVDAAAEGGPSDLNLKGFVD 453

BLAST of CSPI06G08930 vs. TAIR10
Match: AT1G05530.1 (AT1G05530.1 UDP-glucosyl transferase 75B2)

HSP 1 Score: 373.2 bits (957), Expect = 2.5e-103
Identity = 210/468 (44.87%), Postives = 291/468 (62.18%), Query Frame = 1

Query: 48  HVLLVTHCAQGHINPTLQLAKRLTRHGDLHVTFLISLSAYRR--MGHTPTLPHITFASFS 107
           H LLVT  AQGH+NP+L+ A+RL +     VTF   LS   R  + +   + +++F +FS
Sbjct: 5   HFLLVTFPAQGHVNPSLRFARRLIKTTGARVTFATCLSVIHRSMIPNHNNVENLSFLTFS 64

Query: 108 DGYDDG-FKPSDDIKLYISELERRGSDALKNIIQESRNKGQPFTCIVYSILIPWVATVAR 167
           DG+DDG    +DD++  +   ER G  AL + I+ ++N   P +C++Y+IL  WV  VAR
Sbjct: 65  DGFDDGVISNTDDVQNRLVHFERNGDKALSDFIEANQNGDSPVSCLIYTILPNWVPKVAR 124

Query: 168 SLDVASVHLWIQPAVVFALYYYYNNGYYDEIQRIASGDDPSSTSIKLPGLPLLSARDLPS 227
              + SVHLWIQPA  F +YY Y+ G              +++  + P LP L  RDLPS
Sbjct: 125 RFHLPSVHLWIQPAFAFDIYYNYSTG--------------NNSVFEFPNLPSLEIRDLPS 184

Query: 228 FFGASDGYSFALPMFRKQFELLEEESNPKILINTFEELEKDAVKAIKKFHLMPIGPLIPS 287
           F   S+    A  ++++  + L+EESNPKIL+NTF+ LE + + AI    ++ +GPL+P+
Sbjct: 185 FLSPSNTNKAAQAVYQELMDFLKEESNPKILVNTFDSLEPEFLTAIPNIEMVAVGPLLPA 244

Query: 288 VLVDGNDPSEASSGCDLFRS--TSSYMEWLNSKPKASVVYVSMGSISTVSKQQKEEIARG 347
            +  G++     SG DL R   +SSY  WL+SK ++SV+YVS G++  +SK+Q EE+AR 
Sbjct: 245 EIFTGSE-----SGKDLSRDHQSSSYTLWLDSKTESSVIYVSFGTMVELSKKQIEELARA 304

Query: 348 LSITKRPFLWVIRN-------IEEEED-----FLSFKEKLETQGKIVSWCAQLEVLSSPA 407
           L    RPFLWVI +       IE EE+        F+ +LE  G IVSWC+Q+EVL   A
Sbjct: 305 LIEGGRPFLWVITDKLNREAKIEGEEETEIEKIAGFRHELEEVGMIVSWCSQIEVLRHRA 364

Query: 408 TGCFLTHCGWNSCLESLACGVPNVAFPQWSDQATNSKIIEDLSETGVRLEVEEEGVVKGE 467
            GCFLTHCGW+S LESL  GVP VAFP WSDQ  N+K++E++ +TGVR+    EG+V+  
Sbjct: 365 IGCFLTHCGWSSSLESLVLGVPVVAFPMWSDQPANAKLLEEIWKTGVRVRENSEGLVERG 424

Query: 468 EIERCLELVMGDSKKGEEIRRNALKWKKLAKEAASEGGSSFANLKAFV 499
           EI RCLE VM    K  E+R NA KWK+LA EA  EGGSS  N++AFV
Sbjct: 425 EIMRCLEAVM--EAKSVELRENAEKWKRLATEAGREGGSSDKNVEAFV 451

BLAST of CSPI06G08930 vs. TAIR10
Match: AT1G05560.1 (AT1G05560.1 UDP-glucosyltransferase 75B1)

HSP 1 Score: 360.9 bits (925), Expect = 1.3e-99
Identity = 202/472 (42.80%), Postives = 285/472 (60.38%), Query Frame = 1

Query: 46  PRHVLLVTHCAQGHINPTLQLAKRLTRHGDLHVTFLISLSAYRR--MGHTPTLPHITFAS 105
           P H LLVT  AQGH+NP+L+ A+RL +     VTF+  +S +    + +   + +++F +
Sbjct: 3   PPHFLLVTFPAQGHVNPSLRFARRLIKRTGARVTFVTCVSVFHNSMIANHNKVENLSFLT 62

Query: 106 FSDGYDDG-FKPSDDIKLYISELERRGSDALKNIIQESRNKGQPFTCIVYSILIPWVATV 165
           FSDG+DDG     +D +     L+  G  AL + I+ ++N   P TC++Y+IL+ W   V
Sbjct: 63  FSDGFDDGGISTYEDRQKRSVNLKVNGDKALSDFIEATKNGDSPVTCLIYTILLNWAPKV 122

Query: 166 ARSLDVASVHLWIQPAVVFALYYYYNNGYYDEIQRIASGDDPSSTSIKLPGLPLLSARDL 225
           AR   + S  LWIQPA+VF +YY +  G              + +  +LP L  L  RDL
Sbjct: 123 ARRFQLPSALLWIQPALVFNIYYTHFMG--------------NKSVFELPNLSSLEIRDL 182

Query: 226 PSFFGASDGYSFALPMFRKQFELLEEESNPKILINTFEELEKDAVKAIKKFHLMPIGPLI 285
           PSF   S+    A   F++  E L +E+ PKILINTF+ LE +A+ A     ++ +GPL+
Sbjct: 183 PSFLTPSNTNKGAYDAFQEMMEFLIKETKPKILINTFDSLEPEALTAFPNIDMVAVGPLL 242

Query: 286 PSVLVDGNDPSEASSGCDLFRSTSSYMEWLNSKPKASVVYVSMGSISTVSKQQKEEIARG 345
           P+ +  G+              +SSY  WL+SK ++SV+YVS G++  +SK+Q EE+AR 
Sbjct: 243 PTEIFSGSTNKSVKD------QSSSYTLWLDSKTESSVIYVSFGTMVELSKKQIEELARA 302

Query: 346 LSITKRPFLWVIRNI---------EEE---EDFLSFKEKLETQGKIVSWCAQLEVLSSPA 405
           L   KRPFLWVI +          EEE   E    F+ +LE  G IVSWC+Q+EVLS  A
Sbjct: 303 LIEGKRPFLWVITDKSNRETKTEGEEETEIEKIAGFRHELEEVGMIVSWCSQIEVLSHRA 362

Query: 406 TGCFLTHCGWNSCLESLACGVPNVAFPQWSDQATNSKIIEDLSETGVRLEVEEEGVVKGE 465
            GCF+THCGW+S LESL  GVP VAFP WSDQ TN+K++E+  +TGVR+   ++G+V+  
Sbjct: 363 VGCFVTHCGWSSTLESLVLGVPVVAFPMWSDQPTNAKLLEESWKTGVRVRENKDGLVERG 422

Query: 466 EIERCLELVMGDSKKGEEIRRNALKWKKLAKEAASEGGSSFANLKAFVDHVC 503
           EI RCLE VM   +K  E+R NA KWK+LA EA  EGGSS  N++AFV+ +C
Sbjct: 423 EIRRCLEAVM--EEKSVELRENAKKWKRLAMEAGREGGSSDKNMEAFVEDIC 452

BLAST of CSPI06G08930 vs. TAIR10
Match: AT1G05675.1 (AT1G05675.1 UDP-Glycosyltransferase superfamily protein)

HSP 1 Score: 292.4 bits (747), Expect = 5.5e-79
Identity = 174/463 (37.58%), Postives = 263/463 (56.80%), Query Frame = 1

Query: 48  HVLLVTHCAQGHINPTLQLAKRLTRHGDLHVTFLISLSAYRRMGHTPTLPH------ITF 107
           HV+++   AQGHI P  Q  KRL     L +T ++       +   P+ P+      IT 
Sbjct: 6   HVIVLPFPAQGHITPMSQFCKRLASKS-LKITLVL-------VSDKPSPPYKTEHDTITV 65

Query: 108 ASFSDGYDDGFKPSDDIKLYISELERRGSDALKNIIQESRNKGQPFTCIVYSILIPWVAT 167
              S+G+ +G + S+D+  Y+  +E    + L  +I++ +  G P   +VY   +PW+  
Sbjct: 66  VPISNGFQEGQERSEDLDEYMERVESSIKNRLPKLIEDMKLSGNPPRALVYDSTMPWLLD 125

Query: 168 VARSLDVASVHLWIQPAVVFALYYYYNNGYYDEIQRIASGDDPSSTSIKLPGLPLLSARD 227
           VA S  ++    + QP +V A+YY+   G +     + S     ST    P LP+L+A D
Sbjct: 126 VAHSYGLSGAVFFTQPWLVSAIYYHVFKGSFS----VPSTKYGHSTLASFPSLPILNAND 185

Query: 228 LPSFFGASDGYSFALPMFRKQFELLEEESNPKILINTFEELEKDAVKAIKK-FHLMPIGP 287
           LPSF   S  Y + L     Q   ++      +L NTF++LE+  +K IK  + ++ IGP
Sbjct: 186 LPSFLCESSSYPYILRTVIDQLSNIDRVDI--VLCNTFDKLEEKLLKWIKSVWPVLNIGP 245

Query: 288 LIPSVLVDGNDPSEASSGCDLFRST-SSYMEWLNSKPKASVVYVSMGSISTVSKQQKEEI 347
            +PS+ +D     + + G  LF +  +  MEWLNSK  +SVVYVS GS+  + K Q  E+
Sbjct: 246 TVPSMYLDKRLAEDKNYGFSLFGAKIAECMEWLNSKQPSSVVYVSFGSLVVLKKDQLIEL 305

Query: 348 ARGLSITKRPFLWVIRNIEEEEDFLSFKEKLETQGKIVSWCAQLEVLSSPATGCFLTHCG 407
           A GL  +   FLWV+R  E  +   ++ E++  +G  VSW  QLEVL+  + GCF+THCG
Sbjct: 306 AAGLKQSGHFFLWVVRETERRKLPENYIEEIGEKGLTVSWSPQLEVLTHKSIGCFVTHCG 365

Query: 408 WNSCLESLACGVPNVAFPQWSDQATNSKIIEDLSETGVRLEVEEEGVVKGEEIERCLELV 467
           WNS LE L+ GVP +  P W+DQ TN+K +ED+ + GVR++ + +G V+ EE  R +E V
Sbjct: 366 WNSTLEGLSLGVPMIGMPHWADQPTNAKFMEDVWKVGVRVKADSDGFVRREEFVRRVEEV 425

Query: 468 MGDSKKGEEIRRNALKWKKLAKEAASEGGSSFANLKAFVDHVC 503
           M ++++G+EIR+NA KWK LA+EA SEGGSS  N+  FV   C
Sbjct: 426 M-EAEQGKEIRKNAEKWKVLAQEAVSEGGSSDKNINEFVSMFC 453

BLAST of CSPI06G08930 vs. NCBI nr
Match: gi|449445445|ref|XP_004140483.1| (PREDICTED: crocetin glucosyltransferase, chloroplastic-like [Cucumis sativus])

HSP 1 Score: 937.6 bits (2422), Expect = 9.3e-270
Identity = 466/467 (99.79%), Postives = 467/467 (100.00%), Query Frame = 1

Query: 37  MNNTTPNPNPRHVLLVTHCAQGHINPTLQLAKRLTRHGDLHVTFLISLSAYRRMGHTPTL 96
           MNNTTPNPNPRHVLLVTHCAQGHINPTLQLAKRLTRHGDLHVTFLISLSAYRRMGHTPTL
Sbjct: 1   MNNTTPNPNPRHVLLVTHCAQGHINPTLQLAKRLTRHGDLHVTFLISLSAYRRMGHTPTL 60

Query: 97  PHITFASFSDGYDDGFKPSDDIKLYISELERRGSDALKNIIQESRNKGQPFTCIVYSILI 156
           PHITFASFSDGYDDGFKPSDDIKLYISELERRGSDALKNIIQESRNKGQPFTCIVYSILI
Sbjct: 61  PHITFASFSDGYDDGFKPSDDIKLYISELERRGSDALKNIIQESRNKGQPFTCIVYSILI 120

Query: 157 PWVATVARSLDVASVHLWIQPAVVFALYYYYNNGYYDEIQRIASGDDPSSTSIKLPGLPL 216
           PWVATVARSLDVASVHLWIQPAVVFALYYYYNNGYYDEIQRIASGDDPSSTSIKLPGLPL
Sbjct: 121 PWVATVARSLDVASVHLWIQPAVVFALYYYYNNGYYDEIQRIASGDDPSSTSIKLPGLPL 180

Query: 217 LSARDLPSFFGASDGYSFALPMFRKQFELLEEESNPKILINTFEELEKDAVKAIKKFHLM 276
           LSARDLPSFFGASDGYSFALPMFRKQFELLEEESNPKILINTFEELEKDAVKAIKKFHLM
Sbjct: 181 LSARDLPSFFGASDGYSFALPMFRKQFELLEEESNPKILINTFEELEKDAVKAIKKFHLM 240

Query: 277 PIGPLIPSVLVDGNDPSEASSGCDLFRSTSSYMEWLNSKPKASVVYVSMGSISTVSKQQK 336
           PIGPLIPSVLVDGNDPSEASSGCDLFRSTSSYMEWLNSKPKASVVYVSMGSISTVSKQQK
Sbjct: 241 PIGPLIPSVLVDGNDPSEASSGCDLFRSTSSYMEWLNSKPKASVVYVSMGSISTVSKQQK 300

Query: 337 EEIARGLSITKRPFLWVIRNIEEEEDFLSFKEKLETQGKIVSWCAQLEVLSSPATGCFLT 396
           EEIARGLS+TKRPFLWVIRNIEEEEDFLSFKEKLETQGKIVSWCAQLEVLSSPATGCFLT
Sbjct: 301 EEIARGLSLTKRPFLWVIRNIEEEEDFLSFKEKLETQGKIVSWCAQLEVLSSPATGCFLT 360

Query: 397 HCGWNSCLESLACGVPNVAFPQWSDQATNSKIIEDLSETGVRLEVEEEGVVKGEEIERCL 456
           HCGWNSCLESLACGVPNVAFPQWSDQATNSKIIEDLSETGVRLEVEEEGVVKGEEIERCL
Sbjct: 361 HCGWNSCLESLACGVPNVAFPQWSDQATNSKIIEDLSETGVRLEVEEEGVVKGEEIERCL 420

Query: 457 ELVMGDSKKGEEIRRNALKWKKLAKEAASEGGSSFANLKAFVDHVCS 504
           ELVMGDSKKGEEIRRNALKWKKLAKEAASEGGSSFANLKAFVDHVCS
Sbjct: 421 ELVMGDSKKGEEIRRNALKWKKLAKEAASEGGSSFANLKAFVDHVCS 467

BLAST of CSPI06G08930 vs. NCBI nr
Match: gi|659116578|ref|XP_008458144.1| (PREDICTED: anthocyanidin 3-O-glucoside 5-O-glucosyltransferase 1-like [Cucumis melo])

HSP 1 Score: 812.0 bits (2096), Expect = 5.9e-232
Identity = 411/469 (87.63%), Postives = 431/469 (91.90%), Query Frame = 1

Query: 37  MNNTTPNPNPRHVLLVTHCAQGHINPTLQLAKRLTRHGDLHVTFLISLSAYRRMGHTPTL 96
           MNNTTP PNPR VLL+T+ AQGHINPTLQLAKRL RHGDLHVTFL SLSAYRRMG TPTL
Sbjct: 1   MNNTTP-PNPRRVLLITYSAQGHINPTLQLAKRLIRHGDLHVTFLTSLSAYRRMGQTPTL 60

Query: 97  PHITFASFSDGYDDGFKPSDDIKLYISELERRGSDALKNIIQESRNKGQPFTCIVYSILI 156
           PH++FASFSDGYDDGFKP DDI  Y+SELER GSDALKNIIQESRN+GQPFTCIVYSIL+
Sbjct: 61  PHLSFASFSDGYDDGFKPGDDIDHYVSELERCGSDALKNIIQESRNQGQPFTCIVYSILL 120

Query: 157 PWVATVARSLDVASVHLWIQPAVVFALYYYYNNGYYDEIQRIASGDDP-SSTSIKLPGLP 216
           PWVATVARSLDVASV LWIQPAVVFALYYYY NGYYDEIQRI SGDDP SS SIKLPGLP
Sbjct: 121 PWVATVARSLDVASVLLWIQPAVVFALYYYYFNGYYDEIQRIISGDDPGSSMSIKLPGLP 180

Query: 217 LLSARDLPSFFGASDGYSFALPMFRKQFELLEEE-SNPKILINTFEELEKDAVKAIKKFH 276
           LLSARDLPSFFG SD Y+FAL +FRKQFELLEEE SNP ILINTFEELEKDAVKAIKKFH
Sbjct: 181 LLSARDLPSFFGGSDVYAFALIIFRKQFELLEEEESNPNILINTFEELEKDAVKAIKKFH 240

Query: 277 LMPIGPLIPSVLVDGNDPSEASSGCDLFRSTSSYMEWLNSKPKASVVYVSMGSISTVSKQ 336
           LMPIGPLIPSV  DG DPSEASSGCDL+RSTSSY++WLNSKPKASVVYVS GSI+ +S Q
Sbjct: 241 LMPIGPLIPSVFFDGTDPSEASSGCDLYRSTSSYIDWLNSKPKASVVYVSSGSITKLSNQ 300

Query: 337 QKEEIARGLSITKRPFLWVIRNIEEEEDFLSFKEKLETQGKIVSWCAQLEVLSSPATGCF 396
           QKEE+ARGL  TKRPFLWVIR+ E EED LSFKEKLETQGKIV WC+QLEVLSSPATGCF
Sbjct: 301 QKEEMARGLLSTKRPFLWVIRDTEAEEDSLSFKEKLETQGKIVPWCSQLEVLSSPATGCF 360

Query: 397 LTHCGWNSCLESLACGVPNVAFPQWSDQATNSKIIEDLSETGVRLEVEEEGVVKGEEIER 456
           LTHCGWNSCLESLACGVP VAFPQWSDQATNSKII+DLSETGVRLE  E+GVVKGEEIER
Sbjct: 361 LTHCGWNSCLESLACGVPTVAFPQWSDQATNSKIIQDLSETGVRLEAGEDGVVKGEEIER 420

Query: 457 CLELVMGDSKKGEEIRRNALKWKKLAKEAASEGGSSFANLKAFVDHVCS 504
           CL LVMGDSKKGE+IRRNALKWKKLAKEAASEGGSSFAN KAFVD VCS
Sbjct: 421 CLTLVMGDSKKGEDIRRNALKWKKLAKEAASEGGSSFANFKAFVDQVCS 468

BLAST of CSPI06G08930 vs. NCBI nr
Match: gi|225433620|ref|XP_002263700.1| (PREDICTED: crocetin glucosyltransferase, chloroplastic-like [Vitis vinifera])

HSP 1 Score: 538.1 bits (1385), Expect = 1.6e-149
Identity = 266/459 (57.95%), Postives = 345/459 (75.16%), Query Frame = 1

Query: 48  HVLLVTHCAQGHINPTLQLAKRLTRHGDLHVTFLISLSAYRRMGHTPTLPHITFASFSDG 107
           H LLVT  AQGHINP LQ AKR+ R G   V+F  S+SA+RRM    T   + F  FSDG
Sbjct: 5   HFLLVTFPAQGHINPALQFAKRIIRTG-AQVSFATSVSAHRRMAKRSTPEGLNFVPFSDG 64

Query: 108 YDDGFKPSDDIKLYISELERRGSDALKNIIQESRNKGQPFTCIVYSILIPWVATVARSLD 167
           YDDGFKP+DD++ Y+SE++RRGS+ L+ I+  + ++GQPFTCIVY++L+PW A VAR L 
Sbjct: 65  YDDGFKPTDDVQHYMSEIKRRGSETLREIVVRNADEGQPFTCIVYTLLLPWAAEVARGLG 124

Query: 168 VASVHLWIQPAVVFALYYYYNNGYYDEIQRIASGDDPSSTSIKLPGLPLLSARDLPSFFG 227
           V S  LWIQPA V  +YYYY NGY D  + I++  +PS  S++LPGLPLLS+RDLPSF  
Sbjct: 125 VPSALLWIQPATVLDIYYYYFNGYGDVFRNISN--EPSC-SVELPGLPLLSSRDLPSFLV 184

Query: 228 ASDGYSFALPMFRKQFELLEEESNPKILINTFEELEKDAVKAIKKFHLMPIGPLIPSVLV 287
            S+ Y+F LP F++Q E L +E++PK+L+NTF+ LE + ++A+ K HL+ IGPL+PS  +
Sbjct: 185 KSNAYTFVLPTFQEQLEALSQETSPKVLVNTFDALEPEPLRAVDKLHLIGIGPLVPSAYL 244

Query: 288 DGNDPSEASSGCDLFRSTSSYMEWLNSKPKASVVYVSMGSISTVSKQQKEEIARGLSITK 347
           DG DPS+ S G D+F+ +  YMEWLNSKPK+SVVYVS GSIS +SK QKE+IAR L    
Sbjct: 245 DGKDPSDTSFGGDMFQGSDDYMEWLNSKPKSSVVYVSFGSISVLSKTQKEDIARALLDCG 304

Query: 348 RPFLWVIRNIE-----EEEDFLSFKEKLETQGKIVSWCAQLEVLSSPATGCFLTHCGWNS 407
            PFLWVIR  E     +E+D LS +E+LE +G IVSWC+Q+EVL+ P+ GCF++HCGWNS
Sbjct: 305 HPFLWVIRAPENGEEVKEQDKLSCREELEQKGMIVSWCSQIEVLTHPSLGCFVSHCGWNS 364

Query: 408 CLESLACGVPNVAFPQWSDQATNSKIIEDLSETGVRLEVEEEGVVKGEEIERCLELVMGD 467
            LESL  GVP VAFPQW+DQ TN+K+IED+ + G+R+ V EEG+V+ +E +RCLE+VMG 
Sbjct: 365 TLESLVSGVPVVAFPQWTDQGTNAKLIEDMWKIGIRVTVNEEGIVESDEFKRCLEIVMGG 424

Query: 468 SKKGEEIRRNALKWKKLAKEAASEGGSSFANLKAFVDHV 502
            +KGEE+RRNA KWK LA+EA  +GGSS  NLK FVD V
Sbjct: 425 GEKGEEMRRNAEKWKNLAREAVKDGGSSDKNLKGFVDEV 459

BLAST of CSPI06G08930 vs. NCBI nr
Match: gi|225463309|ref|XP_002267526.1| (PREDICTED: crocetin glucosyltransferase, chloroplastic [Vitis vinifera])

HSP 1 Score: 524.2 bits (1349), Expect = 2.4e-145
Identity = 274/457 (59.96%), Postives = 337/457 (73.74%), Query Frame = 1

Query: 48  HVLLVTHCAQGHINPTLQLAKRLTRHGDLHVTFLISLSAYRRMGHTPTLPHITFASFSDG 107
           H+L+VT  +QGHINPTLQLAK L R G  HVTF  S SA  RM  +P L  + FA+FSDG
Sbjct: 4   HILIVTLPSQGHINPTLQLAKLLIRAG-AHVTFFTSTSAGTRMSKSPNLDGLEFATFSDG 63

Query: 108 YDDGFKPSDDIKLYISELERRGSDALKNIIQESRNKGQPFTCIVYSILIPWVATVARSLD 167
           YD G K  DD++ ++S++ER GS AL  +I  S N+G+PF C++Y + IPWVA VA SL 
Sbjct: 64  YDHGLKQGDDVEKFMSQIERLGSQALIELIMASANEGRPFACLLYGVQIPWVAEVAHSLH 123

Query: 168 VASVHLWIQPAVVFALYYYYNNGYYDEIQRIASGDDPSSTSIKLPGLPLLSARDLPSFFG 227
           + S  +W QPA VF +YYYY NGY + IQ    GD PSST I+LPGLPLL+  DLPSF  
Sbjct: 124 IPSALVWTQPAAVFDIYYYYFNGYGELIQN--KGDHPSST-IELPGLPLLNNSDLPSFLI 183

Query: 228 ASDG--YSFALPMFRKQFELLEEESNPKILINTFEELEKDAVKAIKKFHLMPIGPLIPSV 287
              G  Y FALP F+K  E+L  ESNPK+LIN+F+ LE +A+ AI KF+LM IGPLIPS 
Sbjct: 184 PPKGNTYKFALPGFQKHLEMLNCESNPKVLINSFDALESEALGAINKFNLMGIGPLIPSA 243

Query: 288 LVDGNDPSEASSGCDLFRSTSSYMEWLNSKPKASVVYVSMGSISTVSKQQKEEIARGLSI 347
            +DG DPS+ S G DLFRS+  Y++WLNSKPK+SV+YVS GS+  +SKQQ EEIARGL  
Sbjct: 244 FLDGKDPSDTSFGGDLFRSSKDYIQWLNSKPKSSVIYVSFGSLFVLSKQQSEEIARGLLD 303

Query: 348 TKRPFLWVIRNIE-EEEDFLSFKEKLETQGKIVSWCAQLEVLSSPATGCFLTHCGWNSCL 407
             RPFLWVIR  E EEE  LS  E+LE QG +V WC+Q+EVLS P+ GCF+TH GWNS L
Sbjct: 304 GGRPFLWVIRLEENEEEKTLSCHEELERQGMMVPWCSQVEVLSHPSMGCFVTHSGWNSTL 363

Query: 408 ESLACGVPNVAFPQWSDQATNSKIIEDLSETGVRLEVEEEGVVKGEEIERCLELVMGDSK 467
           ESL  GVP VAFPQWSDQATN+K+IE + +TG+R  V +EG+V+ +EI+RCLELVMG  +
Sbjct: 364 ESLTSGVPVVAFPQWSDQATNAKLIEVVWKTGLRAMVNQEGIVEADEIKRCLELVMGSGE 423

Query: 468 KGEEIRRNALKWKKLAKEAASEGGSSFANLKAFVDHV 502
           +GEE+RRNA KWK LA+EA  EGGSS  NLK F++ V
Sbjct: 424 RGEEMRRNATKWKVLAREAVKEGGSSDKNLKNFMNEV 456

BLAST of CSPI06G08930 vs. NCBI nr
Match: gi|225433624|ref|XP_002263301.1| (PREDICTED: crocetin glucosyltransferase, chloroplastic [Vitis vinifera])

HSP 1 Score: 523.9 bits (1348), Expect = 3.2e-145
Identity = 266/460 (57.83%), Postives = 342/460 (74.35%), Query Frame = 1

Query: 49  VLLVTHCAQGHINPTLQLAKRLTRHGDLHVTFLISLSAYRRMGHTPTLPHITFASFSDGY 108
           +LLVT+ AQGHINP+LQLAK LTR G  HVTF+ S SA  RM   PTL  + F +FSDGY
Sbjct: 5   ILLVTYPAQGHINPSLQLAKLLTRAG-AHVTFVTSSSASTRMSKPPTLEGLEFVTFSDGY 64

Query: 109 DDGFKPSDDIKLYISELERRGSDALKNIIQESRNKGQPFTCIVYSILIPWVATVARSLDV 168
           D GFK  DD++ ++SEL+R GS AL  +I    N+G+PFTC++Y I+IPWVA VA+S  +
Sbjct: 65  DHGFKHGDDLQNFMSELDRLGSQALTELIVARANEGRPFTCLLYGIIIPWVAEVAQSFHL 124

Query: 169 ASVHLWIQPAVVFALYYYYNNGYYDEIQRIASGDDPSSTSIKLPGLPLLSARDLPSFFGA 228
            S  +W Q A VF +YYYY NGY + I    +G   SS+SI+LPGLPLLS+ DLPSF   
Sbjct: 125 PSALVWSQAATVFDIYYYYFNGYGELIGNKGNG---SSSSIELPGLPLLSSSDLPSFLEP 184

Query: 229 SDG--YSFALPMFRKQFELLEEESNPKILINTFEELEKDAVKAIKKFHLMPIGPLIPSVL 288
           S    ++F L   +KQ E L  ESNP++L+N+F+ LE +A++A+ KF LM IGPL+P   
Sbjct: 185 SKAIAFNFVLKSLQKQLEQLNRESNPRVLVNSFDALESEALRALNKFKLMGIGPLLPLAF 244

Query: 289 VDGNDPSEASSGCDLFRSTSSYMEWLNSKPKASVVYVSMGSISTVSKQQKEEIARGLSIT 348
           +DG DPS+ S G DLFR +  Y++WLNSKP++SV+YVS GS+S +SKQQ EEIARGL  +
Sbjct: 245 LDGKDPSDTSFGGDLFRDSKDYIQWLNSKPESSVIYVSFGSLSVLSKQQSEEIARGLLAS 304

Query: 349 KRPFLWVIR-----NIEEEEDFLSFKEKLETQGKIVSWCAQLEVLSSPATGCFLTHCGWN 408
            RPFLWVIR       E+E+D LS  E+LE QG IV WC+Q+EVLS P+ GCF++HCGWN
Sbjct: 305 GRPFLWVIRAKENGEEEKEDDKLSCVEELEQQGMIVPWCSQVEVLSHPSLGCFVSHCGWN 364

Query: 409 SCLESLACGVPNVAFPQWSDQATNSKIIEDLSETGVRLEVEEEGVVKGEEIERCLELVMG 468
           S LESLACGVP VAFPQW+DQ TN+K+IED+ +TG+R+ V +EG+V+G EI++CLELVMG
Sbjct: 365 STLESLACGVPVVAFPQWTDQTTNAKLIEDVWKTGLRVMVNQEGIVEGGEIKKCLELVMG 424

Query: 469 DSKKGEEIRRNALKWKKLAKEAASEGGSSFANLKAFVDHV 502
             +KG+E+RRNA KWK LA+EA  EGGSS  NLK FV+ +
Sbjct: 425 CGEKGQEVRRNAKKWKDLAREAVKEGGSSDKNLKNFVNEI 460

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
UGT1_GARJA2.6e-13955.70Crocetin glucosyltransferase, chloroplastic OS=Gardenia jasminoides GN=UGT75L6 P... [more]
5GT1_PERFR2.5e-12151.39Anthocyanidin 3-O-glucoside 5-O-glucosyltransferase 1 OS=Perilla frutescens GN=P... [more]
5GT_VERHY2.5e-12151.84Anthocyanidin 3-O-glucoside 5-O-glucosyltransferase OS=Verbena hybrida GN=HGT8 P... [more]
U75D1_ARATH2.2e-11748.77UDP-glycosyltransferase 75D1 OS=Arabidopsis thaliana GN=UGT75D1 PE=2 SV=2[more]
5GT2_PERFR1.0e-11450.78Anthocyanidin 3-O-glucoside 5-O-glucosyltransferase 2 OS=Perilla frutescens GN=P... [more]
Match NameE-valueIdentityDescription
A0A0A0KA46_CUCSA6.5e-27099.79UDP-glucose:flavonoid 7-O-glucosyltransferase OS=Cucumis sativus GN=Csa_6G109750... [more]
F6I4F4_VITVI1.1e-14957.95Glycosyltransferase OS=Vitis vinifera GN=VIT_05s0062g00640 PE=3 SV=1[more]
F6I4D5_VITVI1.7e-14559.96Glycosyltransferase OS=Vitis vinifera GN=VIT_05s0062g00350 PE=3 SV=1[more]
F6I4F7_VITVI2.2e-14557.83Glycosyltransferase OS=Vitis vinifera GN=VIT_05s0062g00700 PE=3 SV=1[more]
M5XTU9_PRUPE1.4e-14457.60Glycosyltransferase OS=Prunus persica GN=PRUPE_ppa005161mg PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT4G15550.11.2e-11848.77 indole-3-acetate beta-D-glucosyltransferase[more]
AT4G14090.12.7e-11046.82 UDP-Glycosyltransferase superfamily protein[more]
AT1G05530.12.5e-10344.87 UDP-glucosyl transferase 75B2[more]
AT1G05560.11.3e-9942.80 UDP-glucosyltransferase 75B1[more]
AT1G05675.15.5e-7937.58 UDP-Glycosyltransferase superfamily protein[more]
Match NameE-valueIdentityDescription
gi|449445445|ref|XP_004140483.1|9.3e-27099.79PREDICTED: crocetin glucosyltransferase, chloroplastic-like [Cucumis sativus][more]
gi|659116578|ref|XP_008458144.1|5.9e-23287.63PREDICTED: anthocyanidin 3-O-glucoside 5-O-glucosyltransferase 1-like [Cucumis m... [more]
gi|225433620|ref|XP_002263700.1|1.6e-14957.95PREDICTED: crocetin glucosyltransferase, chloroplastic-like [Vitis vinifera][more]
gi|225463309|ref|XP_002267526.1|2.4e-14559.96PREDICTED: crocetin glucosyltransferase, chloroplastic [Vitis vinifera][more]
gi|225433624|ref|XP_002263301.1|3.2e-14557.83PREDICTED: crocetin glucosyltransferase, chloroplastic [Vitis vinifera][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002213UDP_glucos_trans
Vocabulary: Molecular Function
TermDefinition
GO:0016758transferase activity, transferring hexosyl groups
Vocabulary: Biological Process
TermDefinition
GO:0008152metabolic process
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008152 metabolic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0016758 transferase activity, transferring hexosyl groups

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI06G08930.1CSPI06G08930.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002213UDP-glucuronosyl/UDP-glucosyltransferasePANTHERPTHR11926GLUCOSYL/GLUCURONOSYL TRANSFERASEScoord: 44..502
score: 1.6E
IPR002213UDP-glucuronosyl/UDP-glucosyltransferasePFAMPF00201UDPGTcoord: 309..465
score: 1.8
NoneNo IPR availableGENE3DG3DSA:3.40.50.2000coord: 309..440
score: 1.3
NoneNo IPR availablePANTHERPTHR11926:SF98UDP-GLYCOSYLTRANSFERASE 75B1-RELATEDcoord: 44..502
score: 1.6E
NoneNo IPR availableunknownSSF53756UDP-Glycosyltransferase/glycogen phosphorylasecoord: 44..501
score: 1.86E

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
CSPI06G08930Cp4.1LG17g03940Cucurbita pepo (Zucchini)cpecpiB329
The following gene(s) are paralogous to this gene:

None