CSPI04G04890 (gene) Cucumber (PI 183967) v1

Overview
NameCSPI04G04890
Typegene
OrganismCucumis sativus var. hardwickii cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionGlycosyltransferase
LocationChr4: 3262188 .. 3264582 (+)
RNA-Seq ExpressionCSPI04G04890
SyntenyCSPI04G04890
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CACATTCATATACAACAAAGTCACACCCATAACGTCTTGTTTCTAATTATTACTCATTTCTCAGTTCTTCCTCTGTCCATGGAATCCGCAGCTCATGTTGCTCTTATTTCCAGCCCCGGAATGGGCCATCTTTTCCCCGCACTCGAGCTCGCCACGCGCCTCTCCACGCGCCACCGTCTCACCGTCACTGTTTTCATCGTCCCCTCCCACTCCTCCTCCGCCGAAAACAAAGTCATCGCCGCCGCCCAAGCAGCTGGTCTCTTCACCGTTGTCGAACTTCCTCCCGCCGACATGTCTGATGTCACTGAATCCTCCGTTGTCGGCCGCCTCGCCATCACCATGCGCCGCCATGTCCCGATACTCCGCTCCGCGGTCTCCGCCATGACCTCCCCTCCCTCCGTCCTCATTGCTGACATCTTCTCCATCGAATCCTTCGCCGTCGCGGACGAGTTCGACATGAAGAAATACGCGTTCGTTGCCTCCAATGCATGGTTCTTAGCCGTCATGGTTTACGCTCAGGTGTGGGACAGGGAGATCGTTGGGCAGTACGTGGACCAGAAAGAACCGCTTCAAATCCCAGGATGCGAATCGGTTCGGCCATGCGATGTTATCGACCCACTTCTGGACCGGACCGAACAGCAATATTTCGAAATCTTGAAATTGGGGATGGGGATAGCATCGAGTGACGGCGTTTTGGTTAACACGTGGGATGAGTTGCAAGATCGCACGCTCGCATCTTTAAACGACCGGAATCTGTTGGGTAAAATCTCACCGCCGGTTTACTCTATTGGACCAATCGTGAGGCAGCCCGGTTCGAAGAAAGGCGGTTCGAGTGAGTTGTTTAATTGGTTGAGTAAGCAACCCAGTGAGTCGGTGATCTACGTGTCGTTTGGGAGCGGTGGAACGCTGTCGTTTGAGCAAATGACAGAAGTGGCTCACGGCTTGGAGATGAGTCGGCAGAGATTTGTTTGGGTGGTACGCGCCCCAAAGGTAAAATGTAATTAAGAAACTAATCATGTTTTATCACGGGGTTTTATTTTTTTTTAAAAATAGTTCTTTATAAAATATCATTTTGAAAGACACATAATAATTTGTTCCATATAAATTGTATTTGGCTAAGCTTTGTTGTTTAAAACAATTATTTGTACAAATTAATTAAAGAAAAATATTTGAATAAAATATATTTTTTTACAGTGAAGAATAATGTGTAAATATTATTATTCTTAAAAAAATACTACTCTTTATAAAAGCAAAATTTATGGAATATCAATTTGTGGTTGAATATTATCTGTCAACTGCACAGATATGGACAAACTCGAAGAAAAGTACTTAGTGTTACATCTATTTTGTGCAACTTATGTAGTTGTTCAAGTAAAATTTATCACGTAACGAACCATTGTTTTGTTTCAAGAAAAGATTTAATAGTTAGTTTGAAAGAAGTGAAAAGGTTGGATCAAAATTCTCGTTTTGTTTTATCTTTCCATGCGGTAAGAAAGTAGGCATACGAATTGGTGTGTGCCTAATTCTTGTTTTGTTTTATCTTTCCATGTGGTAAAAGAGTGGGCATATGAACTGATGCCTATTTTTTATTCTCATTTCCACGTGATGGGTTTGAATTAATTAACCACATACATCATTAAAAAAAAAAACTATTTTTGATCATTAGAGTTTATTTAAATAAAATTTAATCTATATACTTTCAAATTAATATATAAATTTAACATTTTGTAATCTTATAGGTAAGATCGGACGGTGCATTTTTTACGACGGGAGACGAGAGTGAGGAGCAATCGTTGGCGAAGTTTTTGCCAGAGGGATTTTTGGAACGCACGAGCGAGGTGGGGTTCGTAGTATCAATGTGGGCGGACCAGACAGCGGTGCTAGGGAGTCCAGCGGTGGGAGGTTTTTTCTCTCACAGCGGATGGAACTCGGCGTTGGAAAGCATTACAAATGGAGTACCAATGGTGGTGTGGCCATTGTATGCAGAGCAACGAATGAATGCCACAATGCTAACTGAGGAGATCGGAGTGGGTGTCCGATCAAAGGAGCTACCAACGAATGCATTGATTGAAAGAGAGGAGATCGCAGCTATGGTAAGGAAGATAATGGTGGAGGAAGATGATGAAGGGAAAGCCATTAGGGCAAAGGCTAAGGAACTTCAAAGGAGTGCTGCGAAAGCGTTGGGAGAAGGTGGTTCATCGCACCACAACTTTGCTCGTGTTGTCAAATTGTTTGGCTGTTAAGTATATATATATATATATATATATATATATTTGTCTCTCGAATTTCACTGTTTGATTGCTAGTAAAAAAGTATATGTGGTTGTCACTAGTGAGAGTAATGTTTTAGAGTGGTTTTTCTTCTAATTGACTCTTTTGGTTTAATAACGTCGGGT

mRNA sequence

CACATTCATATACAACAAAGTCACACCCATAACGTCTTGTTTCTAATTATTACTCATTTCTCAGTTCTTCCTCTGTCCATGGAATCCGCAGCTCATGTTGCTCTTATTTCCAGCCCCGGAATGGGCCATCTTTTCCCCGCACTCGAGCTCGCCACGCGCCTCTCCACGCGCCACCGTCTCACCGTCACTGTTTTCATCGTCCCCTCCCACTCCTCCTCCGCCGAAAACAAAGTCATCGCCGCCGCCCAAGCAGCTGGTCTCTTCACCGTTGTCGAACTTCCTCCCGCCGACATGTCTGATGTCACTGAATCCTCCGTTGTCGGCCGCCTCGCCATCACCATGCGCCGCCATGTCCCGATACTCCGCTCCGCGGTCTCCGCCATGACCTCCCCTCCCTCCGTCCTCATTGCTGACATCTTCTCCATCGAATCCTTCGCCGTCGCGGACGAGTTCGACATGAAGAAATACGCGTTCGTTGCCTCCAATGCATGGTTCTTAGCCGTCATGGTTTACGCTCAGGTGTGGGACAGGGAGATCGTTGGGCAGTACGTGGACCAGAAAGAACCGCTTCAAATCCCAGGATGCGAATCGGTTCGGCCATGCGATGTTATCGACCCACTTCTGGACCGGACCGAACAGCAATATTTCGAAATCTTGAAATTGGGGATGGGGATAGCATCGAGTGACGGCGTTTTGGTTAACACGTGGGATGAGTTGCAAGATCGCACGCTCGCATCTTTAAACGACCGGAATCTGTTGGGTAAAATCTCACCGCCGGTTTACTCTATTGGACCAATCGTGAGGCAGCCCGGTTCGAAGAAAGGCGGTTCGAGTGAGTTGTTTAATTGGTTGAGTAAGCAACCCAGTGAGTCGGTGATCTACGTGTCGTTTGGGAGCGGTGGAACGCTGTCGTTTGAGCAAATGACAGAAGTGGCTCACGGCTTGGAGATGAGTCGGCAGAGATTTGTTTGGGTGGTACGCGCCCCAAAGGTAAGATCGGACGGTGCATTTTTTACGACGGGAGACGAGAGTGAGGAGCAATCGTTGGCGAAGTTTTTGCCAGAGGGATTTTTGGAACGCACGAGCGAGGTGGGGTTCGTAGTATCAATGTGGGCGGACCAGACAGCGGTGCTAGGGAGTCCAGCGGTGGGAGGTTTTTTCTCTCACAGCGGATGGAACTCGGCGTTGGAAAGCATTACAAATGGAGTACCAATGGTGGTGTGGCCATTGTATGCAGAGCAACGAATGAATGCCACAATGCTAACTGAGGAGATCGGAGTGGGTGTCCGATCAAAGGAGCTACCAACGAATGCATTGATTGAAAGAGAGGAGATCGCAGCTATGGTAAGGAAGATAATGGTGGAGGAAGATGATGAAGGGAAAGCCATTAGGGCAAAGGCTAAGGAACTTCAAAGGAGTGCTGCGAAAGCGTTGGGAGAAGGTGGTTCATCGCACCACAACTTTGCTCGTGTTGTCAAATTGTTTGGCTGTTAAGTATATATATATATATATATATATATATATTTGTCTCTCGAATTTCACTGTTTGATTGCTAGTAAAAAAGTATATGTGGTTGTCACTAGTGAGAGTAATGTTTTAGAGTGGTTTTTCTTCTAATTGACTCTTTTGGTTTAATAACGTCGGGT

Coding sequence (CDS)

ATGGAATCCGCAGCTCATGTTGCTCTTATTTCCAGCCCCGGAATGGGCCATCTTTTCCCCGCACTCGAGCTCGCCACGCGCCTCTCCACGCGCCACCGTCTCACCGTCACTGTTTTCATCGTCCCCTCCCACTCCTCCTCCGCCGAAAACAAAGTCATCGCCGCCGCCCAAGCAGCTGGTCTCTTCACCGTTGTCGAACTTCCTCCCGCCGACATGTCTGATGTCACTGAATCCTCCGTTGTCGGCCGCCTCGCCATCACCATGCGCCGCCATGTCCCGATACTCCGCTCCGCGGTCTCCGCCATGACCTCCCCTCCCTCCGTCCTCATTGCTGACATCTTCTCCATCGAATCCTTCGCCGTCGCGGACGAGTTCGACATGAAGAAATACGCGTTCGTTGCCTCCAATGCATGGTTCTTAGCCGTCATGGTTTACGCTCAGGTGTGGGACAGGGAGATCGTTGGGCAGTACGTGGACCAGAAAGAACCGCTTCAAATCCCAGGATGCGAATCGGTTCGGCCATGCGATGTTATCGACCCACTTCTGGACCGGACCGAACAGCAATATTTCGAAATCTTGAAATTGGGGATGGGGATAGCATCGAGTGACGGCGTTTTGGTTAACACGTGGGATGAGTTGCAAGATCGCACGCTCGCATCTTTAAACGACCGGAATCTGTTGGGTAAAATCTCACCGCCGGTTTACTCTATTGGACCAATCGTGAGGCAGCCCGGTTCGAAGAAAGGCGGTTCGAGTGAGTTGTTTAATTGGTTGAGTAAGCAACCCAGTGAGTCGGTGATCTACGTGTCGTTTGGGAGCGGTGGAACGCTGTCGTTTGAGCAAATGACAGAAGTGGCTCACGGCTTGGAGATGAGTCGGCAGAGATTTGTTTGGGTGGTACGCGCCCCAAAGGTAAGATCGGACGGTGCATTTTTTACGACGGGAGACGAGAGTGAGGAGCAATCGTTGGCGAAGTTTTTGCCAGAGGGATTTTTGGAACGCACGAGCGAGGTGGGGTTCGTAGTATCAATGTGGGCGGACCAGACAGCGGTGCTAGGGAGTCCAGCGGTGGGAGGTTTTTTCTCTCACAGCGGATGGAACTCGGCGTTGGAAAGCATTACAAATGGAGTACCAATGGTGGTGTGGCCATTGTATGCAGAGCAACGAATGAATGCCACAATGCTAACTGAGGAGATCGGAGTGGGTGTCCGATCAAAGGAGCTACCAACGAATGCATTGATTGAAAGAGAGGAGATCGCAGCTATGGTAAGGAAGATAATGGTGGAGGAAGATGATGAAGGGAAAGCCATTAGGGCAAAGGCTAAGGAACTTCAAAGGAGTGCTGCGAAAGCGTTGGGAGAAGGTGGTTCATCGCACCACAACTTTGCTCGTGTTGTCAAATTGTTTGGCTGTTAA

Protein sequence

MESAAHVALISSPGMGHLFPALELATRLSTRHRLTVTVFIVPSHSSSAENKVIAAAQAAGLFTVVELPPADMSDVTESSVVGRLAITMRRHVPILRSAVSAMTSPPSVLIADIFSIESFAVADEFDMKKYAFVASNAWFLAVMVYAQVWDREIVGQYVDQKEPLQIPGCESVRPCDVIDPLLDRTEQQYFEILKLGMGIASSDGVLVNTWDELQDRTLASLNDRNLLGKISPPVYSIGPIVRQPGSKKGGSSELFNWLSKQPSESVIYVSFGSGGTLSFEQMTEVAHGLEMSRQRFVWVVRAPKVRSDGAFFTTGDESEEQSLAKFLPEGFLERTSEVGFVVSMWADQTAVLGSPAVGGFFSHSGWNSALESITNGVPMVVWPLYAEQRMNATMLTEEIGVGVRSKELPTNALIEREEIAAMVRKIMVEEDDEGKAIRAKAKELQRSAAKALGEGGSSHHNFARVVKLFGC*
Homology
BLAST of CSPI04G04890 vs. ExPASy Swiss-Prot
Match: Q40287 (Anthocyanidin 3-O-glucosyltransferase 5 OS=Manihot esculenta OX=3983 GN=GT5 PE=2 SV=1)

HSP 1 Score: 458.4 bits (1178), Expect = 1.0e-127
Identity = 235/462 (50.87%), Postives = 321/462 (69.48%), Query Frame = 0

Query: 1   MESAAHVALISSPGMGHLFPALELATRLSTRHRLTVTVFIVPSHSSSAENKVIAAAQAAG 60
           + S  H+ L+SSPG+GHL P LEL  R+ T     VT+F+V S +S+AE +V+ +A    
Sbjct: 6   LNSKPHIVLLSSPGLGHLIPVLELGKRIVTLCNFDVTIFMVGSDTSAAEPQVLRSAMTPK 65

Query: 61  LFTVVELPPADMSDV--TESSVVGRLAITMRRHVPILRSAVSAMTSPPSVLIADIFSIES 120
           L  +++LPP ++S +   E++V  RL + MR   P  R+AVSA+   P+ +I D+F  ES
Sbjct: 66  LCEIIQLPPPNISCLIDPEATVCTRLFVLMREIRPAFRAAVSALKFRPAAIIVDLFGTES 125

Query: 121 FAVADEFDMKKYAFVASNAWFLAVMVYAQVWDREIVGQYVDQKEPLQIPGCESVRPCDVI 180
             VA E  + KY ++ASNAWFLA+ +Y  + D+E+ G++V QKEP++IPGC  VR  +V+
Sbjct: 126 LEVAKELGIAKYVYIASNAWFLALTIYVPILDKEVEGEFVLQKEPMKIPGCRPVRTEEVV 185

Query: 181 DPLLDRTEQQYFEILKLGMGIASSDGVLVNTWDELQDRTLASLNDRNLLGKISP-PVYSI 240
           DP+LDRT QQY E  +LG+ I ++DG+L+NTW+ L+  T  +L D   LG+++  PV+ I
Sbjct: 186 DPMLDRTNQQYSEYFRLGIEIPTADGILMNTWEALEPTTFGALRDVKFLGRVAKVPVFPI 245

Query: 241 GPIVRQPGSKKGGSSELFNWLSKQPSESVIYVSFGSGGTLSFEQMTEVAHGLEMSRQRFV 300
           GP+ RQ G   G + EL +WL +QP ESV+YVSFGSGGTLS EQM E+A GLE S+QRF+
Sbjct: 246 GPLRRQAG-PCGSNCELLDWLDQQPKESVVYVSFGSGGTLSLEQMIELAWGLERSQQRFI 305

Query: 301 WVVRAPKVRS-DGAFFTTGDESEEQSLAKFLPEGFLERTSEVGFVVSMWADQTAVLGSPA 360
           WVVR P V++ D AFFT GD +++ S   + PEGFL R   VG VV  W+ Q  ++  P+
Sbjct: 306 WVVRQPTVKTGDAAFFTQGDGADDMS--GYFPEGFLTRIQNVGLVVPQWSPQIHIMSHPS 365

Query: 361 VGGFFSHSGWNSALESITNGVPMVVWPLYAEQRMNATMLTEEIGVGVRSKELPTNALIER 420
           VG F SH GWNS LESIT GVP++ WP+YAEQRMNAT+LTEE+GV VR K LP   +++R
Sbjct: 366 VGVFLSHCGWNSVLESITAGVPIIAWPIYAEQRMNATLLTEELGVAVRPKNLPAKEVVKR 425

Query: 421 EEIAAMVRKIMVEEDDEGKAIRAKAKELQRSAAKALGEGGSS 459
           EEI  M+R+IMV  D+EG  IR + +EL+ S  KAL EGGSS
Sbjct: 426 EEIERMIRRIMV--DEEGSEIRKRVRELKDSGEKALNEGGSS 462

BLAST of CSPI04G04890 vs. ExPASy Swiss-Prot
Match: Q9ZU72 (UDP-glycosyltransferase 72D1 OS=Arabidopsis thaliana OX=3702 GN=UGT72D1 PE=2 SV=1)

HSP 1 Score: 416.0 bits (1068), Expect = 5.7e-115
Identity = 222/461 (48.16%), Postives = 310/461 (67.25%), Query Frame = 0

Query: 6   HVALISSPGMGHLFPALELATRLSTRHRLTVTVFIVPSHSSS-AENKVIAAAQAAGLFTV 65
           H  L++SPG+GHL P LEL  RLS+   + VT+  V S SSS  E + I AA A  +  +
Sbjct: 5   HALLVASPGLGHLIPILELGNRLSSVLNIHVTILAVTSGSSSPTETEAIHAAAARTICQI 64

Query: 66  VELPPADMSDVTE--SSVVGRLAITMRRHVPILRSAVSAMTSPPSVLIADIFSIESFAVA 125
            E+P  D+ ++ E  +++  ++ + MR   P +R AV  M   P+V+I D    E  +VA
Sbjct: 65  TEIPSVDVDNLVEPDATIFTKMVVKMRAMKPAVRDAVKLMKRKPTVMIVDFLGTELMSVA 124

Query: 126 DEFDM-KKYAFVASNAWFLAVMVYAQVWDREIVGQYVDQKEPLQIPGCESVRPCDVIDPL 185
           D+  M  KY +V ++AWFLAVMVY  V D  + G+YVD KEPL+IPGC+ V P ++++ +
Sbjct: 125 DDVGMTAKYVYVPTHAWFLAVMVYLPVLDTVVEGEYVDIKEPLKIPGCKPVGPKELMETM 184

Query: 186 LDRTEQQYFEILKLGMGIASSDGVLVNTWDELQDRTLASLNDRNLLGKI-SPPVYSIGPI 245
           LDR+ QQY E ++ G+ +  SDGVLVNTW+ELQ  TLA+L +   L ++   PVY IGPI
Sbjct: 185 LDRSGQQYKECVRAGLEVPMSDGVLVNTWEELQGNTLAALREDEELSRVMKVPVYPIGPI 244

Query: 246 VRQPGSKKGGSSELFNWLSKQPSESVIYVSFGSGGTLSFEQMTEVAHGLEMSRQRFVWVV 305
           VR         + +F WL +Q   SV++V  GSGGTL+FEQ  E+A GLE+S QRFVWV+
Sbjct: 245 VR-TNQHVDKPNSIFEWLDEQRERSVVFVCLGSGGTLTFEQTVELALGLELSGQRFVWVL 304

Query: 306 RAPKVRSDGAFFTTGDESEEQSLAKFLPEGFLERTSEVGFVVSMWADQTAVLGSPAVGGF 365
           R P      A +     S+++ ++  LPEGFL+RT  VG VV+ WA Q  +L   ++GGF
Sbjct: 305 RRP------ASYLGAISSDDEQVSASLPEGFLDRTRGVGIVVTQWAPQVEILSHRSIGGF 364

Query: 366 FSHSGWNSALESITNGVPMVVWPLYAEQRMNATMLTEEIGVGVRSKELPTNALIEREEIA 425
            SH GW+SALES+T GVP++ WPLYAEQ MNAT+LTEEIGV VR+ ELP+  +I REE+A
Sbjct: 365 LSHCGWSSALESLTKGVPIIAWPLYAEQWMNATLLTEEIGVAVRTSELPSERVIGREEVA 424

Query: 426 AMVRKIMVEEDDEGKAIRAKAKELQRSAAKALGEGGSSHHN 462
           ++VRKIM EED+EG+ IRAKA+E++ S+ +A  + GSS+++
Sbjct: 425 SLVRKIMAEEDEEGQKIRAKAEEVRVSSERAWSKDGSSYNS 458

BLAST of CSPI04G04890 vs. ExPASy Swiss-Prot
Match: Q94A84 (UDP-glycosyltransferase 72E1 OS=Arabidopsis thaliana OX=3702 GN=UGT72E1 PE=1 SV=1)

HSP 1 Score: 399.4 bits (1025), Expect = 5.5e-110
Identity = 210/466 (45.06%), Postives = 306/466 (65.67%), Query Frame = 0

Query: 6   HVALISSPGMGHLFPALELATRLSTRHRLTVTVFIVPSHSSSAENKVIAAAQA-AGLFTV 65
           HVA+ +SPGMGH+ P +EL  RL+  H   VT+F++ + ++SA+++ + +    A L  +
Sbjct: 7   HVAMFASPGMGHIIPVIELGKRLAGSHGFDVTIFVLETDAASAQSQFLNSPGCDAALVDI 66

Query: 66  VELPPADMSDVTESSVVG--RLAITMRRHVPILRSAVSAMTSPPSVLIADIFSIESFAVA 125
           V LP  D+S + + S     +L + MR  +P +RS +  M   P+ LI D+F +++  + 
Sbjct: 67  VGLPTPDISGLVDPSAFFGIKLLVMMRETIPTIRSKIEEMQHKPTALIVDLFGLDAIPLG 126

Query: 126 DEFDMKKYAFVASNAWFLAVMVYAQVWDREIVGQYVDQKEPLQIPGCESVRPCDVIDPLL 185
            EF+M  Y F+ASNA FLAV ++    D+++  +++ +K+P+ +PGCE VR  D ++  L
Sbjct: 127 GEFNMLTYIFIASNARFLAVALFFPTLDKDMEEEHIIKKQPMVMPGCEPVRFEDTLETFL 186

Query: 186 DRTEQQYFEILKLGMGIASSDGVLVNTWDELQDRTLASLNDRNLLGKIS-PPVYSIGPIV 245
           D   Q Y E +  G    + DG++VNTWD+++ +TL SL D  LLG+I+  PVY IGP+ 
Sbjct: 187 DPNSQLYREFVPFGSVFPTCDGIIVNTWDDMEPKTLKSLQDPKLLGRIAGVPVYPIGPLS 246

Query: 246 RQPGSKKGGSSELFNWLSKQPSESVIYVSFGSGGTLSFEQMTEVAHGLEMSRQRFVWVVR 305
           R P      +  + +WL+KQP ESV+Y+SFGSGG+LS +Q+TE+A GLEMS+QRFVWVVR
Sbjct: 247 R-PVDPSKTNHPVLDWLNKQPDESVLYISFGSGGSLSAKQLTELAWGLEMSQQRFVWVVR 306

Query: 306 APKVRSD-GAFFTTGDESEEQSLAKFLPEGFLERTSEVGFVVSMWADQTAVLGSPAVGGF 365
            P   S   A+ +            +LPEGF+ RT E GF+VS WA Q  +L   AVGGF
Sbjct: 307 PPVDGSACSAYLSANSGKIRDGTPDYLPEGFVSRTHERGFMVSSWAPQAEILAHQAVGGF 366

Query: 366 FSHSGWNSALESITNGVPMVVWPLYAEQRMNATMLTEEIGVGVRSKELPTNALIEREEIA 425
            +H GWNS LES+  GVPM+ WPL+AEQ MNAT+L EE+GV VRSK+LP+  +I R EI 
Sbjct: 367 LTHCGWNSILESVVGGVPMIAWPLFAEQMMNATLLNEELGVAVRSKKLPSEGVITRAEIE 426

Query: 426 AMVRKIMVEEDDEGKAIRAKAKELQRSAAKALG-EGGSSHHNFARV 466
           A+VRKIMVEE  EG  +R K K+L+ +AA++L  +GG +H + +R+
Sbjct: 427 ALVRKIMVEE--EGAEMRKKIKKLKETAAESLSCDGGVAHESLSRI 469

BLAST of CSPI04G04890 vs. ExPASy Swiss-Prot
Match: O81498 (UDP-glycosyltransferase 72E3 OS=Arabidopsis thaliana OX=3702 GN=UGT72E3 PE=1 SV=1)

HSP 1 Score: 373.2 bits (957), Expect = 4.2e-102
Identity = 202/468 (43.16%), Postives = 302/468 (64.53%), Query Frame = 0

Query: 6   HVALISSPGMGHLFPALELATRLSTRHRLTVTVFIVPSHSSSAENKVIAAAQAAGLFTVV 65
           H A+ SSPGMGH+ P +ELA RLS  H   VTVF++ + ++S ++K++    + G+  +V
Sbjct: 7   HAAMFSSPGMGHVLPVIELAKRLSANHGFHVTVFVLETDAASVQSKLL---NSTGV-DIV 66

Query: 66  ELPPADMSDVTE--SSVVGRLAITMRRHVPILRSAVSAMTSPPSVLIADIFSIESFAVAD 125
            LP  D+S + +  + VV ++ + MR  VP LRS + AM   P+ LI D+F  ++  +A 
Sbjct: 67  NLPSPDISGLVDPNAHVVTKIGVIMREAVPTLRSKIVAMHQNPTALIIDLFGTDALCLAA 126

Query: 126 EFDMKKYAFVASNAWFLAVMVYAQVWDREIVGQYVDQKEPLQIPGCESVRPCDVIDPLLD 185
           E +M  Y F+ASNA +L V +Y    D  I  ++  Q++PL IPGCE VR  D++D  L 
Sbjct: 127 ELNMLTYVFIASNARYLGVSIYYPTLDEVIKEEHTVQRKPLTIPGCEPVRFEDIMDAYLV 186

Query: 186 RTEQQYFEILKLGMGIASSDGVLVNTWDELQDRTLASLNDRNLLGKIS-PPVYSIGPIVR 245
             E  Y ++++  +    +DG+LVNTW+E++ ++L SL D  LLG+++  PVY +GP+ R
Sbjct: 187 PDEPVYHDLVRHCLAYPKADGILVNTWEEMEPKSLKSLQDPKLLGRVARVPVYPVGPLCR 246

Query: 246 QPGSKKGGSSELFNWLSKQPSESVIYVSFGSGGTLSFEQMTEVAHGLEMSRQRFVWVVRA 305
            P         +F+WL+KQP+ESV+Y+SFGSGG+L+ +Q+TE+A GLE S+QRF+WVVR 
Sbjct: 247 -PIQSSTTDHPVFDWLNKQPNESVLYISFGSGGSLTAQQLTELAWGLEESQQRFIWVVRP 306

Query: 306 PKVRSD-GAFFTTGDESEEQSLAKFLPEGFLERTSEVGFVVSMWADQTAVLGSPAVGGFF 365
           P   S    +F+      + +  ++LPEGF+ RT + GF++  WA Q  +L   AVGGF 
Sbjct: 307 PVDGSSCSDYFSAKGGVTKDNTPEYLPEGFVTRTCDRGFMIPSWAPQAEILAHQAVGGFL 366

Query: 366 SHSGWNSALESITNGVPMVVWPLYAEQRMNATMLTEEIGVGVRSKELPTNALIEREEIAA 425
           +H GW+S LES+  GVPM+ WPL+AEQ MNA +L++E+G+ VR  + P  A I R +I A
Sbjct: 367 THCGWSSTLESVLCGVPMIAWPLFAEQNMNAALLSDELGISVRVDD-PKEA-ISRSKIEA 426

Query: 426 MVRKIMVEEDDEGKAIRAKAKELQRSAAKALG--EGGSSHHNFARVVK 468
           MVRK+M E  DEG+ +R K K+L+ +A  +L    GGS+H +  RV K
Sbjct: 427 MVRKVMAE--DEGEEMRRKVKKLRDTAEMSLSIHGGGSAHESLCRVTK 465

BLAST of CSPI04G04890 vs. ExPASy Swiss-Prot
Match: Q9LVR1 (UDP-glycosyltransferase 72E2 OS=Arabidopsis thaliana OX=3702 GN=UGT72E2 PE=1 SV=1)

HSP 1 Score: 366.7 bits (940), Expect = 4.0e-100
Identity = 202/471 (42.89%), Postives = 300/471 (63.69%), Query Frame = 0

Query: 6   HVALISSPGMGHLFPALELATRLSTRHRLTVTVFIVPSHSSSAENKVIAAAQAAGLFTVV 65
           H A+ SSPGMGH+ P +EL  RLS  +   VTVF++ + ++SA++K +    + G+  +V
Sbjct: 7   HAAMFSSPGMGHVIPVIELGKRLSANNGFHVTVFVLETDAASAQSKFL---NSTGV-DIV 66

Query: 66  ELPPADMSDVT--ESSVVGRLAITMRRHVPILRSAVSAMTSPPSVLIADIFSIESFAVAD 125
           +LP  D+  +   +  VV ++ + MR  VP LRS ++AM   P+ LI D+F  ++  +A 
Sbjct: 67  KLPSPDIYGLVDPDDHVVTKIGVIMRAAVPALRSKIAAMHQKPTALIVDLFGTDALCLAK 126

Query: 126 EFDMKKYAFVASNAWFLAVMVYAQVWDREIVGQYVDQKEPLQIPGCESVRPCDVIDPLLD 185
           EF+M  Y F+ +NA FL V +Y    D++I  ++  Q+ PL IPGCE VR  D +D  L 
Sbjct: 127 EFNMLSYVFIPTNARFLGVSIYYPNLDKDIKEEHTVQRNPLAIPGCEPVRFEDTLDAYLV 186

Query: 186 RTEQQYFEILKLGMGIASSDGVLVNTWDELQDRTLASLNDRNLLGKIS-PPVYSIGPIVR 245
             E  Y + ++ G+    +DG+LVNTW+E++ ++L SL +  LLG+++  PVY IGP+ R
Sbjct: 187 PDEPVYRDFVRHGLAYPKADGILVNTWEEMEPKSLKSLLNPKLLGRVARVPVYPIGPLCR 246

Query: 246 QPGSKKGGSSELFNWLSKQPSESVIYVSFGSGGTLSFEQMTEVAHGLEMSRQRFVWVVRA 305
            P         + +WL++QP+ESV+Y+SFGSGG LS +Q+TE+A GLE S+QRFVWVVR 
Sbjct: 247 -PIQSSETDHPVLDWLNEQPNESVLYISFGSGGCLSAKQLTELAWGLEQSQQRFVWVVRP 306

Query: 306 PKVRSDGA----FFTTGDESEEQSLAKFLPEGFLERTSEVGFVVSMWADQTAVLGSPAVG 365
           P    DG+    + +      E +  ++LPEGF+ RTS+ GFVV  WA Q  +L   AVG
Sbjct: 307 P---VDGSCCSEYVSANGGGTEDNTPEYLPEGFVSRTSDRGFVVPSWAPQAEILSHRAVG 366

Query: 366 GFFSHSGWNSALESITNGVPMVVWPLYAEQRMNATMLTEEIGVGVRSKELPTNALIEREE 425
           GF +H GW+S LES+  GVPM+ WPL+AEQ MNA +L++E+G+ VR  +   +  I R +
Sbjct: 367 GFLTHCGWSSTLESVVGGVPMIAWPLFAEQNMNAALLSDELGIAVRLDDPKED--ISRWK 426

Query: 426 IAAMVRKIMVEEDDEGKAIRAKAKELQRSAAKALG--EGGSSHHNFARVVK 468
           I A+VRK+M E+  EG+A+R K K+L+ SA  +L    GG +H +  RV K
Sbjct: 427 IEALVRKVMTEK--EGEAMRRKVKKLRDSAEMSLSIDGGGLAHESLCRVTK 465

BLAST of CSPI04G04890 vs. ExPASy TrEMBL
Match: A0A0A0KWI6 (Glycosyltransferase OS=Cucumis sativus OX=3659 GN=Csa_4G038620 PE=3 SV=1)

HSP 1 Score: 913.7 bits (2360), Expect = 3.2e-262
Identity = 469/471 (99.58%), Postives = 469/471 (99.58%), Query Frame = 0

Query: 1   MESAAHVALISSPGMGHLFPALELATRLSTRHRLTVTVFIVPSHSSSAENKVIAAAQAAG 60
           MESAAHVALISSPGMGHLFPALE ATRLSTRHRLTVTVFIVPS SSSAENKVIAAAQAAG
Sbjct: 1   MESAAHVALISSPGMGHLFPALEFATRLSTRHRLTVTVFIVPSRSSSAENKVIAAAQAAG 60

Query: 61  LFTVVELPPADMSDVTESSVVGRLAITMRRHVPILRSAVSAMTSPPSVLIADIFSIESFA 120
           LFTVVELPPADMSDVTESSVVGRLAITMRRHVPILRSAVSAMTSPPSVLIADIFSIESFA
Sbjct: 61  LFTVVELPPADMSDVTESSVVGRLAITMRRHVPILRSAVSAMTSPPSVLIADIFSIESFA 120

Query: 121 VADEFDMKKYAFVASNAWFLAVMVYAQVWDREIVGQYVDQKEPLQIPGCESVRPCDVIDP 180
           VADEFDMKKYAFVASNAWFLAVMVYAQVWDREIVGQYVDQKEPLQIPGCESVRPCDVIDP
Sbjct: 121 VADEFDMKKYAFVASNAWFLAVMVYAQVWDREIVGQYVDQKEPLQIPGCESVRPCDVIDP 180

Query: 181 LLDRTEQQYFEILKLGMGIASSDGVLVNTWDELQDRTLASLNDRNLLGKISPPVYSIGPI 240
           LLDRTEQQYFEILKLGMGIASSDGVLVNTWDELQDRTLASLNDRNLLGKISPPVYSIGPI
Sbjct: 181 LLDRTEQQYFEILKLGMGIASSDGVLVNTWDELQDRTLASLNDRNLLGKISPPVYSIGPI 240

Query: 241 VRQPGSKKGGSSELFNWLSKQPSESVIYVSFGSGGTLSFEQMTEVAHGLEMSRQRFVWVV 300
           VRQPGSKKGGSSELFNWLSKQPSESVIYVSFGSGGTLSFEQMTEVAHGLEMSRQRFVWVV
Sbjct: 241 VRQPGSKKGGSSELFNWLSKQPSESVIYVSFGSGGTLSFEQMTEVAHGLEMSRQRFVWVV 300

Query: 301 RAPKVRSDGAFFTTGDESEEQSLAKFLPEGFLERTSEVGFVVSMWADQTAVLGSPAVGGF 360
           RAPKVRSDGAFFTTGDESEEQSLAKFLPEGFLERTSEVGFVVSMWADQTAVLGSPAVGGF
Sbjct: 301 RAPKVRSDGAFFTTGDESEEQSLAKFLPEGFLERTSEVGFVVSMWADQTAVLGSPAVGGF 360

Query: 361 FSHSGWNSALESITNGVPMVVWPLYAEQRMNATMLTEEIGVGVRSKELPTNALIEREEIA 420
           FSHSGWNSALESITNGVPMVVWPLYAEQRMNATMLTEEIGVGVRSKELPTNALIEREEIA
Sbjct: 361 FSHSGWNSALESITNGVPMVVWPLYAEQRMNATMLTEEIGVGVRSKELPTNALIEREEIA 420

Query: 421 AMVRKIMVEEDDEGKAIRAKAKELQRSAAKALGEGGSSHHNFARVVKLFGC 472
           AMVRKIMVEEDDEGKAIRAKAKELQRSAAKALGEGGSSHHNFARVVKLFGC
Sbjct: 421 AMVRKIMVEEDDEGKAIRAKAKELQRSAAKALGEGGSSHHNFARVVKLFGC 471

BLAST of CSPI04G04890 vs. ExPASy TrEMBL
Match: A0A5D3CXP2 (Glycosyltransferase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold21G005580 PE=3 SV=1)

HSP 1 Score: 869.8 bits (2246), Expect = 5.3e-249
Identity = 445/471 (94.48%), Postives = 453/471 (96.18%), Query Frame = 0

Query: 1   MESAAHVALISSPGMGHLFPALELATRLSTRHRLTVTVFIVPSHSSSAENKVIAAAQAAG 60
           MESAAHVALISSPGMGHLFPALELATRLST HRLTVTVFIVPSHSSSAENKVIA AQAAG
Sbjct: 1   MESAAHVALISSPGMGHLFPALELATRLSTHHRLTVTVFIVPSHSSSAENKVIATAQAAG 60

Query: 61  LFTVVELPPADMSDVTESSVVGRLAITMRRHVPILRSAVSAMTSPPSVLIADIFSIESFA 120
           LFTVVELPPADMSDVTESS+VGRLAITMRRHVPI RSAVSAMTSPPSVLIADIF++ESFA
Sbjct: 61  LFTVVELPPADMSDVTESSIVGRLAITMRRHVPIFRSAVSAMTSPPSVLIADIFAVESFA 120

Query: 121 VADEFDMKKYAFVASNAWFLAVMVYAQVWDREIVGQYVDQKEPLQIPGCESVRPCDVIDP 180
           VADEFDM KY FVASNAWFLAVMVYAQVWDREIVGQYVDQKEPLQIPGCE VRPCDVIDP
Sbjct: 121 VADEFDMAKYTFVASNAWFLAVMVYAQVWDREIVGQYVDQKEPLQIPGCEPVRPCDVIDP 180

Query: 181 LLDRTEQQYFEILKLGMGIASSDGVLVNTWDELQDRTLASLNDRNLLGKISPPVYSIGPI 240
           LLDRTE QY EILKLGMGIASSDGVLVNTWDELQ RTLASLNDR LLGKISPPVYSIGPI
Sbjct: 181 LLDRTELQYSEILKLGMGIASSDGVLVNTWDELQHRTLASLNDRYLLGKISPPVYSIGPI 240

Query: 241 VRQPGSKKGGSSELFNWLSKQPSESVIYVSFGSGGTLSFEQMTEVAHGLEMSRQRFVWVV 300
           VRQPGSKKGGSSELFNWLSKQPSESVIYVSFGSGGTLSFEQMTEVAHGLEMS+QRFVWVV
Sbjct: 241 VRQPGSKKGGSSELFNWLSKQPSESVIYVSFGSGGTLSFEQMTEVAHGLEMSKQRFVWVV 300

Query: 301 RAPKVRSDGAFFTTGDESEEQSLAKFLPEGFLERTSEVGFVVSMWADQTAVLGSPAVGGF 360
           RAPKVRSDGA+FTTGD SEEQS AKFLPEGFLERTSEVGFVVSMWADQTAVLGSPAVGGF
Sbjct: 301 RAPKVRSDGAYFTTGDGSEEQSSAKFLPEGFLERTSEVGFVVSMWADQTAVLGSPAVGGF 360

Query: 361 FSHSGWNSALESITNGVPMVVWPLYAEQRMNATMLTEEIGVGVRSKELPTNALIEREEIA 420
           F+HSGWNSALE ITNGVPMVVWPLYAEQR+NATML EEIGV VRSKELPT ALIEREEIA
Sbjct: 361 FTHSGWNSALEGITNGVPMVVWPLYAEQRLNATMLAEEIGVAVRSKELPTKALIEREEIA 420

Query: 421 AMVRKIMVEEDDEGKAIRAKAKELQRSAAKALGEGGSSHHNFARVVKLFGC 472
           AMVRKIMVEEDDEGKAIRAKAKELQRSA KAL EGGSS+HNFARVVKLFGC
Sbjct: 421 AMVRKIMVEEDDEGKAIRAKAKELQRSAEKALAEGGSSYHNFARVVKLFGC 471

BLAST of CSPI04G04890 vs. ExPASy TrEMBL
Match: A0A1S3BX03 (Glycosyltransferase OS=Cucumis melo OX=3656 GN=LOC103494390 PE=3 SV=1)

HSP 1 Score: 860.9 bits (2223), Expect = 2.5e-246
Identity = 441/468 (94.23%), Postives = 450/468 (96.15%), Query Frame = 0

Query: 1   MESAAHVALISSPGMGHLFPALELATRLSTRHRLTVTVFIVPSHSSSAENKVIAAAQAAG 60
           MESAAHVALISSPGMGHLFPALELATRLST HRLTVTVFIVPSHSSSAENKVIA AQAAG
Sbjct: 1   MESAAHVALISSPGMGHLFPALELATRLSTHHRLTVTVFIVPSHSSSAENKVIATAQAAG 60

Query: 61  LFTVVELPPADMSDVTESSVVGRLAITMRRHVPILRSAVSAMTSPPSVLIADIFSIESFA 120
           LFTVVELPPADMSDVTESS+VGRLAITMRRHVPI RSAVSAMTSPPSVLIADIF++ESFA
Sbjct: 61  LFTVVELPPADMSDVTESSIVGRLAITMRRHVPIFRSAVSAMTSPPSVLIADIFAVESFA 120

Query: 121 VADEFDMKKYAFVASNAWFLAVMVYAQVWDREIVGQYVDQKEPLQIPGCESVRPCDVIDP 180
           VADEFDM KY FVASNAWFLAVMVYAQVWDREIVGQYVDQKEPLQIPGCE VRPCDVIDP
Sbjct: 121 VADEFDMAKYTFVASNAWFLAVMVYAQVWDREIVGQYVDQKEPLQIPGCEPVRPCDVIDP 180

Query: 181 LLDRTEQQYFEILKLGMGIASSDGVLVNTWDELQDRTLASLNDRNLLGKISPPVYSIGPI 240
           LLDRTE QY EILKLGMGIASSDGVLVNTWDELQ RTLASLNDR LLGKISPPVYSIGPI
Sbjct: 181 LLDRTELQYSEILKLGMGIASSDGVLVNTWDELQHRTLASLNDRYLLGKISPPVYSIGPI 240

Query: 241 VRQPGSKKGGSSELFNWLSKQPSESVIYVSFGSGGTLSFEQMTEVAHGLEMSRQRFVWVV 300
           VRQPGSKKGGSSELFNWLSKQPSESVIYVSFGSGGTLSFEQMTEVAHGLEMS+QRFVWVV
Sbjct: 241 VRQPGSKKGGSSELFNWLSKQPSESVIYVSFGSGGTLSFEQMTEVAHGLEMSKQRFVWVV 300

Query: 301 RAPKVRSDGAFFTTGDESEEQSLAKFLPEGFLERTSEVGFVVSMWADQTAVLGSPAVGGF 360
           RAPKVRSDGA+FTTGD SEEQS AKFLPEGFLERTSEVGFVVSMWADQTAVLGSPAVGGF
Sbjct: 301 RAPKVRSDGAYFTTGDGSEEQSSAKFLPEGFLERTSEVGFVVSMWADQTAVLGSPAVGGF 360

Query: 361 FSHSGWNSALESITNGVPMVVWPLYAEQRMNATMLTEEIGVGVRSKELPTNALIEREEIA 420
           F+HSGWNSALE ITNGVPMVVWPLYAEQR+NATML EEIGV VRSKELPT ALIEREEIA
Sbjct: 361 FTHSGWNSALEGITNGVPMVVWPLYAEQRLNATMLAEEIGVAVRSKELPTKALIEREEIA 420

Query: 421 AMVRKIMVEEDDEGKAIRAKAKELQRSAAKALGEGGSSHHNFARVVKL 469
           AMVRKIMVEEDDEGKAIRAKAKELQRSA KAL EGGSS+HNFARVVK+
Sbjct: 421 AMVRKIMVEEDDEGKAIRAKAKELQRSAEKALAEGGSSYHNFARVVKI 468

BLAST of CSPI04G04890 vs. ExPASy TrEMBL
Match: A0A6J1EP22 (Glycosyltransferase OS=Cucurbita moschata OX=3662 GN=LOC111436081 PE=3 SV=1)

HSP 1 Score: 760.0 bits (1961), Expect = 5.9e-216
Identity = 391/468 (83.55%), Postives = 415/468 (88.68%), Query Frame = 0

Query: 1   MESAAHVALISSPGMGHLFPALELATRLSTRHRLTVTVFIVPSHSSSAENKVIAAAQAAG 60
           MES  HVAL+SSPGMGHLFP+LELATRLS RH L+VTVFIVPS SSSAENKVIAAAQAAG
Sbjct: 1   MESQPHVALVSSPGMGHLFPSLELATRLSMRHHLSVTVFIVPSRSSSAENKVIAAAQAAG 60

Query: 61  LFTVVELPPADMSDVTESSVVGRLAITMRRHVPILRSAVSAMTSPPSVLIADIFSIESFA 120
           LFTV+ELPPADMSDVTES+VVGRL ITMRRHVP LRSAVS +T+ PSVLIADIF+ ESFA
Sbjct: 61  LFTVIELPPADMSDVTESAVVGRLCITMRRHVPALRSAVSTLTTLPSVLIADIFATESFA 120

Query: 121 VADEFDMKKYAFVASNAWFLAVMVYAQVWDREIVGQYVDQKEPLQIPGCESVRPCDVIDP 180
           VADEF M KY FVASNAWFLA  +Y  V D++I GQYVDQKEPL IPGCE VRPCDVIDP
Sbjct: 121 VADEFHMAKYVFVASNAWFLAFTIYVPVLDKQITGQYVDQKEPLYIPGCEPVRPCDVIDP 180

Query: 181 LLDRTEQQYFEILKLGMGIASSDGVLVNTWDELQDRTLASLNDRNLLGKI-SPPVYSIGP 240
           LLDRTE QYFE +++GM I SSDGVLVNTWD+LQ RTLAS  DRNLLG+I + PVYSIGP
Sbjct: 181 LLDRTEPQYFEYVRIGMEIPSSDGVLVNTWDDLQGRTLASFRDRNLLGRIMNSPVYSIGP 240

Query: 241 IVRQPGSKKGGSSELFNWLSKQPSESVIYVSFGSGGTLSFEQMTEVAHGLEMSRQRFVWV 300
           IVRQ G KKGGSSELFNWLSKQP ESVIYVSFGSGGTLS EQMTEVAHGLEMS QRFVWV
Sbjct: 241 IVRQTGGKKGGSSELFNWLSKQPGESVIYVSFGSGGTLSSEQMTEVAHGLEMSGQRFVWV 300

Query: 301 VRAPKVRSDGAFFTTGDESEEQSLAKFLPEGFLERTSEVGFVVSMWADQTAVLGSPAVGG 360
           VRAPKVRSD  FFTTGD SE+QS AKFLP+GFLERTSEVGFVVSMWADQTAVLGSPAVGG
Sbjct: 301 VRAPKVRSDATFFTTGDGSEDQSEAKFLPDGFLERTSEVGFVVSMWADQTAVLGSPAVGG 360

Query: 361 FFSHSGWNSALESITNGVPMVVWPLYAEQRMNATMLTEEIGVGVRSKELPTNALIEREEI 420
           FF+H GWNSALE ITNGVPMVVWPLYAEQRMNATML EE+ V VR KELPT A+I REEI
Sbjct: 361 FFTHGGWNSALEGITNGVPMVVWPLYAEQRMNATMLAEEVRVAVRPKELPTKAVIGREEI 420

Query: 421 AAMVRKIMVEEDDEGKAIRAKAKELQRSAAKALGEGGSSHHNFARVVK 468
           AAMVRKIM EED+EGKAIRAKAKELQRSA K+  EGGSS  NFARVVK
Sbjct: 421 AAMVRKIMAEEDEEGKAIRAKAKELQRSAEKSTAEGGSSFENFARVVK 468

BLAST of CSPI04G04890 vs. ExPASy TrEMBL
Match: A0A6J1J726 (Glycosyltransferase OS=Cucurbita maxima OX=3661 GN=LOC111483150 PE=3 SV=1)

HSP 1 Score: 748.0 bits (1930), Expect = 2.3e-212
Identity = 385/470 (81.91%), Postives = 414/470 (88.09%), Query Frame = 0

Query: 1   MESAAHVALISSPGMGHLFPALELATRLSTRHRLTVTVFIVPSHSSSAENKVIAAAQAAG 60
           MES  HVALISSPGMGHLFP+LELATRLS RH L+VTVFIVPS SSSAE KVIAAAQAAG
Sbjct: 1   MESQPHVALISSPGMGHLFPSLELATRLSMRHHLSVTVFIVPSRSSSAEYKVIAAAQAAG 60

Query: 61  LFTVVELPPADMSDVTESSVVGRLAITMRRHVPILRSAVSAMTSPPSVLIADIFSIESFA 120
           LFTV+ELPPADMSDVT+S+VVGRL+ITMRRHVP LRSAVSA+TS PSVLIADIF+ ESFA
Sbjct: 61  LFTVIELPPADMSDVTDSTVVGRLSITMRRHVPALRSAVSALTSLPSVLIADIFATESFA 120

Query: 121 VADEFDMKKYAFVASNAWFLAVMVYAQVWDREIVGQYVDQKEPLQIPGCESVRPCDVIDP 180
           VADEF M KY FVASNAWF A+ +Y  V D++I GQYVDQKEP  IPGCE VRPCDV+DP
Sbjct: 121 VADEFHMAKYVFVASNAWFFALTIYVPVLDKQINGQYVDQKEPFHIPGCEPVRPCDVMDP 180

Query: 181 LLDRTEQQYFEILKLGMGIASSDGVLVNTWDELQDRTLASLNDRNLLGKI-SPPVYSIGP 240
           LLDRTE QYFE +++G  I SSDGVLVNTWD+L+ RTLAS  D NLLG+I   PVYSIGP
Sbjct: 181 LLDRTEPQYFEYVRIGTEIPSSDGVLVNTWDDLEGRTLASFRDWNLLGRIMKSPVYSIGP 240

Query: 241 IVRQPGSKKGGSSELFNWLSKQPSESVIYVSFGSGGTLSFEQMTEVAHGLEMSRQRFVWV 300
           IVRQ G KKGG+SELFNWLSKQP ESVIYVSFGSGGTLS EQMTEVAHGLEMS QRFVWV
Sbjct: 241 IVRQTGGKKGGASELFNWLSKQPGESVIYVSFGSGGTLSSEQMTEVAHGLEMSGQRFVWV 300

Query: 301 VRAPKVRSDGAFFTTGDESEEQSLAKFLPEGFLERTSEVGFVVSMWADQTAVLGSPAVGG 360
           VRAPKVRSD  FFTTGD +E+QS AKFLP+GFLERTSEVGFVVSMWADQTAVLGSPAVGG
Sbjct: 301 VRAPKVRSDATFFTTGDGTEDQSEAKFLPDGFLERTSEVGFVVSMWADQTAVLGSPAVGG 360

Query: 361 FFSHSGWNSALESITNGVPMVVWPLYAEQRMNATMLTEEIGVGVRSKELPTNALIEREEI 420
           FF+H GWNSALE ITNGVPMVVWPLYAEQRMNATML EE+ V VR KELPT A+I REEI
Sbjct: 361 FFTHGGWNSALEGITNGVPMVVWPLYAEQRMNATMLAEEVRVAVRPKELPTKAVIGREEI 420

Query: 421 AAMVRKIMVEEDDEGKAIRAKAKELQRSAAKALGEGGSSHHNFARVVKLF 470
           AAMVRKIM EED+EGKAIRAKAKELQRSA K+  EGGSS  NFARVVKL+
Sbjct: 421 AAMVRKIMAEEDEEGKAIRAKAKELQRSAEKSTAEGGSSFENFARVVKLW 470

BLAST of CSPI04G04890 vs. NCBI nr
Match: XP_004149165.1 (anthocyanidin 3-O-glucosyltransferase 5 [Cucumis sativus] >KAE8637578.1 hypothetical protein CSA_017855 [Cucumis sativus])

HSP 1 Score: 913.7 bits (2360), Expect = 6.6e-262
Identity = 469/471 (99.58%), Postives = 469/471 (99.58%), Query Frame = 0

Query: 1   MESAAHVALISSPGMGHLFPALELATRLSTRHRLTVTVFIVPSHSSSAENKVIAAAQAAG 60
           MESAAHVALISSPGMGHLFPALE ATRLSTRHRLTVTVFIVPS SSSAENKVIAAAQAAG
Sbjct: 1   MESAAHVALISSPGMGHLFPALEFATRLSTRHRLTVTVFIVPSRSSSAENKVIAAAQAAG 60

Query: 61  LFTVVELPPADMSDVTESSVVGRLAITMRRHVPILRSAVSAMTSPPSVLIADIFSIESFA 120
           LFTVVELPPADMSDVTESSVVGRLAITMRRHVPILRSAVSAMTSPPSVLIADIFSIESFA
Sbjct: 61  LFTVVELPPADMSDVTESSVVGRLAITMRRHVPILRSAVSAMTSPPSVLIADIFSIESFA 120

Query: 121 VADEFDMKKYAFVASNAWFLAVMVYAQVWDREIVGQYVDQKEPLQIPGCESVRPCDVIDP 180
           VADEFDMKKYAFVASNAWFLAVMVYAQVWDREIVGQYVDQKEPLQIPGCESVRPCDVIDP
Sbjct: 121 VADEFDMKKYAFVASNAWFLAVMVYAQVWDREIVGQYVDQKEPLQIPGCESVRPCDVIDP 180

Query: 181 LLDRTEQQYFEILKLGMGIASSDGVLVNTWDELQDRTLASLNDRNLLGKISPPVYSIGPI 240
           LLDRTEQQYFEILKLGMGIASSDGVLVNTWDELQDRTLASLNDRNLLGKISPPVYSIGPI
Sbjct: 181 LLDRTEQQYFEILKLGMGIASSDGVLVNTWDELQDRTLASLNDRNLLGKISPPVYSIGPI 240

Query: 241 VRQPGSKKGGSSELFNWLSKQPSESVIYVSFGSGGTLSFEQMTEVAHGLEMSRQRFVWVV 300
           VRQPGSKKGGSSELFNWLSKQPSESVIYVSFGSGGTLSFEQMTEVAHGLEMSRQRFVWVV
Sbjct: 241 VRQPGSKKGGSSELFNWLSKQPSESVIYVSFGSGGTLSFEQMTEVAHGLEMSRQRFVWVV 300

Query: 301 RAPKVRSDGAFFTTGDESEEQSLAKFLPEGFLERTSEVGFVVSMWADQTAVLGSPAVGGF 360
           RAPKVRSDGAFFTTGDESEEQSLAKFLPEGFLERTSEVGFVVSMWADQTAVLGSPAVGGF
Sbjct: 301 RAPKVRSDGAFFTTGDESEEQSLAKFLPEGFLERTSEVGFVVSMWADQTAVLGSPAVGGF 360

Query: 361 FSHSGWNSALESITNGVPMVVWPLYAEQRMNATMLTEEIGVGVRSKELPTNALIEREEIA 420
           FSHSGWNSALESITNGVPMVVWPLYAEQRMNATMLTEEIGVGVRSKELPTNALIEREEIA
Sbjct: 361 FSHSGWNSALESITNGVPMVVWPLYAEQRMNATMLTEEIGVGVRSKELPTNALIEREEIA 420

Query: 421 AMVRKIMVEEDDEGKAIRAKAKELQRSAAKALGEGGSSHHNFARVVKLFGC 472
           AMVRKIMVEEDDEGKAIRAKAKELQRSAAKALGEGGSSHHNFARVVKLFGC
Sbjct: 421 AMVRKIMVEEDDEGKAIRAKAKELQRSAAKALGEGGSSHHNFARVVKLFGC 471

BLAST of CSPI04G04890 vs. NCBI nr
Match: TYK16721.1 (anthocyanidin 3-O-glucosyltransferase 5 [Cucumis melo var. makuwa])

HSP 1 Score: 869.8 bits (2246), Expect = 1.1e-248
Identity = 445/471 (94.48%), Postives = 453/471 (96.18%), Query Frame = 0

Query: 1   MESAAHVALISSPGMGHLFPALELATRLSTRHRLTVTVFIVPSHSSSAENKVIAAAQAAG 60
           MESAAHVALISSPGMGHLFPALELATRLST HRLTVTVFIVPSHSSSAENKVIA AQAAG
Sbjct: 1   MESAAHVALISSPGMGHLFPALELATRLSTHHRLTVTVFIVPSHSSSAENKVIATAQAAG 60

Query: 61  LFTVVELPPADMSDVTESSVVGRLAITMRRHVPILRSAVSAMTSPPSVLIADIFSIESFA 120
           LFTVVELPPADMSDVTESS+VGRLAITMRRHVPI RSAVSAMTSPPSVLIADIF++ESFA
Sbjct: 61  LFTVVELPPADMSDVTESSIVGRLAITMRRHVPIFRSAVSAMTSPPSVLIADIFAVESFA 120

Query: 121 VADEFDMKKYAFVASNAWFLAVMVYAQVWDREIVGQYVDQKEPLQIPGCESVRPCDVIDP 180
           VADEFDM KY FVASNAWFLAVMVYAQVWDREIVGQYVDQKEPLQIPGCE VRPCDVIDP
Sbjct: 121 VADEFDMAKYTFVASNAWFLAVMVYAQVWDREIVGQYVDQKEPLQIPGCEPVRPCDVIDP 180

Query: 181 LLDRTEQQYFEILKLGMGIASSDGVLVNTWDELQDRTLASLNDRNLLGKISPPVYSIGPI 240
           LLDRTE QY EILKLGMGIASSDGVLVNTWDELQ RTLASLNDR LLGKISPPVYSIGPI
Sbjct: 181 LLDRTELQYSEILKLGMGIASSDGVLVNTWDELQHRTLASLNDRYLLGKISPPVYSIGPI 240

Query: 241 VRQPGSKKGGSSELFNWLSKQPSESVIYVSFGSGGTLSFEQMTEVAHGLEMSRQRFVWVV 300
           VRQPGSKKGGSSELFNWLSKQPSESVIYVSFGSGGTLSFEQMTEVAHGLEMS+QRFVWVV
Sbjct: 241 VRQPGSKKGGSSELFNWLSKQPSESVIYVSFGSGGTLSFEQMTEVAHGLEMSKQRFVWVV 300

Query: 301 RAPKVRSDGAFFTTGDESEEQSLAKFLPEGFLERTSEVGFVVSMWADQTAVLGSPAVGGF 360
           RAPKVRSDGA+FTTGD SEEQS AKFLPEGFLERTSEVGFVVSMWADQTAVLGSPAVGGF
Sbjct: 301 RAPKVRSDGAYFTTGDGSEEQSSAKFLPEGFLERTSEVGFVVSMWADQTAVLGSPAVGGF 360

Query: 361 FSHSGWNSALESITNGVPMVVWPLYAEQRMNATMLTEEIGVGVRSKELPTNALIEREEIA 420
           F+HSGWNSALE ITNGVPMVVWPLYAEQR+NATML EEIGV VRSKELPT ALIEREEIA
Sbjct: 361 FTHSGWNSALEGITNGVPMVVWPLYAEQRLNATMLAEEIGVAVRSKELPTKALIEREEIA 420

Query: 421 AMVRKIMVEEDDEGKAIRAKAKELQRSAAKALGEGGSSHHNFARVVKLFGC 472
           AMVRKIMVEEDDEGKAIRAKAKELQRSA KAL EGGSS+HNFARVVKLFGC
Sbjct: 421 AMVRKIMVEEDDEGKAIRAKAKELQRSAEKALAEGGSSYHNFARVVKLFGC 471

BLAST of CSPI04G04890 vs. NCBI nr
Match: XP_008453746.1 (PREDICTED: anthocyanidin 3-O-glucosyltransferase 5, partial [Cucumis melo])

HSP 1 Score: 860.9 bits (2223), Expect = 5.1e-246
Identity = 441/468 (94.23%), Postives = 450/468 (96.15%), Query Frame = 0

Query: 1   MESAAHVALISSPGMGHLFPALELATRLSTRHRLTVTVFIVPSHSSSAENKVIAAAQAAG 60
           MESAAHVALISSPGMGHLFPALELATRLST HRLTVTVFIVPSHSSSAENKVIA AQAAG
Sbjct: 1   MESAAHVALISSPGMGHLFPALELATRLSTHHRLTVTVFIVPSHSSSAENKVIATAQAAG 60

Query: 61  LFTVVELPPADMSDVTESSVVGRLAITMRRHVPILRSAVSAMTSPPSVLIADIFSIESFA 120
           LFTVVELPPADMSDVTESS+VGRLAITMRRHVPI RSAVSAMTSPPSVLIADIF++ESFA
Sbjct: 61  LFTVVELPPADMSDVTESSIVGRLAITMRRHVPIFRSAVSAMTSPPSVLIADIFAVESFA 120

Query: 121 VADEFDMKKYAFVASNAWFLAVMVYAQVWDREIVGQYVDQKEPLQIPGCESVRPCDVIDP 180
           VADEFDM KY FVASNAWFLAVMVYAQVWDREIVGQYVDQKEPLQIPGCE VRPCDVIDP
Sbjct: 121 VADEFDMAKYTFVASNAWFLAVMVYAQVWDREIVGQYVDQKEPLQIPGCEPVRPCDVIDP 180

Query: 181 LLDRTEQQYFEILKLGMGIASSDGVLVNTWDELQDRTLASLNDRNLLGKISPPVYSIGPI 240
           LLDRTE QY EILKLGMGIASSDGVLVNTWDELQ RTLASLNDR LLGKISPPVYSIGPI
Sbjct: 181 LLDRTELQYSEILKLGMGIASSDGVLVNTWDELQHRTLASLNDRYLLGKISPPVYSIGPI 240

Query: 241 VRQPGSKKGGSSELFNWLSKQPSESVIYVSFGSGGTLSFEQMTEVAHGLEMSRQRFVWVV 300
           VRQPGSKKGGSSELFNWLSKQPSESVIYVSFGSGGTLSFEQMTEVAHGLEMS+QRFVWVV
Sbjct: 241 VRQPGSKKGGSSELFNWLSKQPSESVIYVSFGSGGTLSFEQMTEVAHGLEMSKQRFVWVV 300

Query: 301 RAPKVRSDGAFFTTGDESEEQSLAKFLPEGFLERTSEVGFVVSMWADQTAVLGSPAVGGF 360
           RAPKVRSDGA+FTTGD SEEQS AKFLPEGFLERTSEVGFVVSMWADQTAVLGSPAVGGF
Sbjct: 301 RAPKVRSDGAYFTTGDGSEEQSSAKFLPEGFLERTSEVGFVVSMWADQTAVLGSPAVGGF 360

Query: 361 FSHSGWNSALESITNGVPMVVWPLYAEQRMNATMLTEEIGVGVRSKELPTNALIEREEIA 420
           F+HSGWNSALE ITNGVPMVVWPLYAEQR+NATML EEIGV VRSKELPT ALIEREEIA
Sbjct: 361 FTHSGWNSALEGITNGVPMVVWPLYAEQRLNATMLAEEIGVAVRSKELPTKALIEREEIA 420

Query: 421 AMVRKIMVEEDDEGKAIRAKAKELQRSAAKALGEGGSSHHNFARVVKL 469
           AMVRKIMVEEDDEGKAIRAKAKELQRSA KAL EGGSS+HNFARVVK+
Sbjct: 421 AMVRKIMVEEDDEGKAIRAKAKELQRSAEKALAEGGSSYHNFARVVKI 468

BLAST of CSPI04G04890 vs. NCBI nr
Match: XP_038880693.1 (anthocyanidin 3-O-glucosyltransferase 5-like [Benincasa hispida])

HSP 1 Score: 807.4 bits (2084), Expect = 6.7e-230
Identity = 413/471 (87.69%), Postives = 435/471 (92.36%), Query Frame = 0

Query: 1   MESAAHVALISSPGMGHLFPALELATRLSTRHRLTVTVFIVPSHSSSAENKVIAAAQAAG 60
           M+S  HVALISSPGMGHLFP+LELATRLSTRH LTVTVFIVPSHSS+AENKVIAAA+AAG
Sbjct: 1   MDSQTHVALISSPGMGHLFPSLELATRLSTRHHLTVTVFIVPSHSSNAENKVIAAAEAAG 60

Query: 61  LFTVVELPPADMSDVTESSVVGRLAITMRRHVPILRSAVSAMTSPPSVLIADIFSIESFA 120
           LFTVVELPPADMSDVT+SSVVGRLAITMRRHVPILRSAVSA+TS PSVLIADIF+ ESFA
Sbjct: 61  LFTVVELPPADMSDVTDSSVVGRLAITMRRHVPILRSAVSALTSLPSVLIADIFATESFA 120

Query: 121 VADEFDMKKYAFVASNAWFLAVMVYAQVWDREIVGQYVDQKEPLQIPGCESVRPCDVIDP 180
           VADEF M KY FVASNAWFLA+ VYAQVWD++IVGQYVDQKEPLQIPGCE VRPCDVIDP
Sbjct: 121 VADEFHMAKYVFVASNAWFLALTVYAQVWDKQIVGQYVDQKEPLQIPGCEPVRPCDVIDP 180

Query: 181 LLDRTEQQYFEILKLGMGIASSDGVLVNTWDELQDRTLASLNDRNLLGKI-SPPVYSIGP 240
           LLDRT+ QYFEI+K+GMGIAS DGVLVNTWD+LQ RTLAS  DRNLLGKI  PPVYSIGP
Sbjct: 181 LLDRTQPQYFEIVKVGMGIASCDGVLVNTWDDLQGRTLASFRDRNLLGKIMKPPVYSIGP 240

Query: 241 IVRQPGSKKGGSSELFNWLSKQPSESVIYVSFGSGGTLSFEQMTEVAHGLEMSRQRFVWV 300
           IVRQ GSKKGGSSELFNWLSKQP+ESVIYVSFGSGGTLSFEQMTEVAHGLEMSRQRFVWV
Sbjct: 241 IVRQSGSKKGGSSELFNWLSKQPTESVIYVSFGSGGTLSFEQMTEVAHGLEMSRQRFVWV 300

Query: 301 VRAPKVRSDGAFFTTGDESEEQSLAKFLPEGFLERTSEVGFVVSMWADQTAVLGSPAVGG 360
           VRAPKVRSDGA+FTTGD SEEQS  KFLPEGFLERTSEVGFVVSMWADQTAVLGSPAVGG
Sbjct: 301 VRAPKVRSDGAYFTTGDGSEEQSAGKFLPEGFLERTSEVGFVVSMWADQTAVLGSPAVGG 360

Query: 361 FFSHSGWNSALESITNGVPMVVWPLYAEQRMNATMLTEEIGVGVRSKELPTNALIEREEI 420
           FF+H GWNSALE ITNGVPMVVWPLYAEQR+NATML EE+ V VR KELPT A+I REEI
Sbjct: 361 FFTHGGWNSALEGITNGVPMVVWPLYAEQRLNATMLAEEVRVAVRPKELPTKAVIGREEI 420

Query: 421 AAMVRKIMVEEDDEGKAIRAKAKELQRSAAKALGEGGSSHHNFARVVKLFG 471
           AAMVRKIM EED+EGKAIRAKAKELQRSA  A  E GSS+ NFARVVKLFG
Sbjct: 421 AAMVRKIMAEEDEEGKAIRAKAKELQRSAENASAEDGSSYENFARVVKLFG 471

BLAST of CSPI04G04890 vs. NCBI nr
Match: XP_022929544.1 (anthocyanidin 3-O-glucosyltransferase 5-like [Cucurbita moschata])

HSP 1 Score: 760.0 bits (1961), Expect = 1.2e-215
Identity = 391/468 (83.55%), Postives = 415/468 (88.68%), Query Frame = 0

Query: 1   MESAAHVALISSPGMGHLFPALELATRLSTRHRLTVTVFIVPSHSSSAENKVIAAAQAAG 60
           MES  HVAL+SSPGMGHLFP+LELATRLS RH L+VTVFIVPS SSSAENKVIAAAQAAG
Sbjct: 1   MESQPHVALVSSPGMGHLFPSLELATRLSMRHHLSVTVFIVPSRSSSAENKVIAAAQAAG 60

Query: 61  LFTVVELPPADMSDVTESSVVGRLAITMRRHVPILRSAVSAMTSPPSVLIADIFSIESFA 120
           LFTV+ELPPADMSDVTES+VVGRL ITMRRHVP LRSAVS +T+ PSVLIADIF+ ESFA
Sbjct: 61  LFTVIELPPADMSDVTESAVVGRLCITMRRHVPALRSAVSTLTTLPSVLIADIFATESFA 120

Query: 121 VADEFDMKKYAFVASNAWFLAVMVYAQVWDREIVGQYVDQKEPLQIPGCESVRPCDVIDP 180
           VADEF M KY FVASNAWFLA  +Y  V D++I GQYVDQKEPL IPGCE VRPCDVIDP
Sbjct: 121 VADEFHMAKYVFVASNAWFLAFTIYVPVLDKQITGQYVDQKEPLYIPGCEPVRPCDVIDP 180

Query: 181 LLDRTEQQYFEILKLGMGIASSDGVLVNTWDELQDRTLASLNDRNLLGKI-SPPVYSIGP 240
           LLDRTE QYFE +++GM I SSDGVLVNTWD+LQ RTLAS  DRNLLG+I + PVYSIGP
Sbjct: 181 LLDRTEPQYFEYVRIGMEIPSSDGVLVNTWDDLQGRTLASFRDRNLLGRIMNSPVYSIGP 240

Query: 241 IVRQPGSKKGGSSELFNWLSKQPSESVIYVSFGSGGTLSFEQMTEVAHGLEMSRQRFVWV 300
           IVRQ G KKGGSSELFNWLSKQP ESVIYVSFGSGGTLS EQMTEVAHGLEMS QRFVWV
Sbjct: 241 IVRQTGGKKGGSSELFNWLSKQPGESVIYVSFGSGGTLSSEQMTEVAHGLEMSGQRFVWV 300

Query: 301 VRAPKVRSDGAFFTTGDESEEQSLAKFLPEGFLERTSEVGFVVSMWADQTAVLGSPAVGG 360
           VRAPKVRSD  FFTTGD SE+QS AKFLP+GFLERTSEVGFVVSMWADQTAVLGSPAVGG
Sbjct: 301 VRAPKVRSDATFFTTGDGSEDQSEAKFLPDGFLERTSEVGFVVSMWADQTAVLGSPAVGG 360

Query: 361 FFSHSGWNSALESITNGVPMVVWPLYAEQRMNATMLTEEIGVGVRSKELPTNALIEREEI 420
           FF+H GWNSALE ITNGVPMVVWPLYAEQRMNATML EE+ V VR KELPT A+I REEI
Sbjct: 361 FFTHGGWNSALEGITNGVPMVVWPLYAEQRMNATMLAEEVRVAVRPKELPTKAVIGREEI 420

Query: 421 AAMVRKIMVEEDDEGKAIRAKAKELQRSAAKALGEGGSSHHNFARVVK 468
           AAMVRKIM EED+EGKAIRAKAKELQRSA K+  EGGSS  NFARVVK
Sbjct: 421 AAMVRKIMAEEDEEGKAIRAKAKELQRSAEKSTAEGGSSFENFARVVK 468

BLAST of CSPI04G04890 vs. TAIR 10
Match: AT2G18570.1 (UDP-Glycosyltransferase superfamily protein )

HSP 1 Score: 416.0 bits (1068), Expect = 4.0e-116
Identity = 222/461 (48.16%), Postives = 310/461 (67.25%), Query Frame = 0

Query: 6   HVALISSPGMGHLFPALELATRLSTRHRLTVTVFIVPSHSSS-AENKVIAAAQAAGLFTV 65
           H  L++SPG+GHL P LEL  RLS+   + VT+  V S SSS  E + I AA A  +  +
Sbjct: 5   HALLVASPGLGHLIPILELGNRLSSVLNIHVTILAVTSGSSSPTETEAIHAAAARTICQI 64

Query: 66  VELPPADMSDVTE--SSVVGRLAITMRRHVPILRSAVSAMTSPPSVLIADIFSIESFAVA 125
            E+P  D+ ++ E  +++  ++ + MR   P +R AV  M   P+V+I D    E  +VA
Sbjct: 65  TEIPSVDVDNLVEPDATIFTKMVVKMRAMKPAVRDAVKLMKRKPTVMIVDFLGTELMSVA 124

Query: 126 DEFDM-KKYAFVASNAWFLAVMVYAQVWDREIVGQYVDQKEPLQIPGCESVRPCDVIDPL 185
           D+  M  KY +V ++AWFLAVMVY  V D  + G+YVD KEPL+IPGC+ V P ++++ +
Sbjct: 125 DDVGMTAKYVYVPTHAWFLAVMVYLPVLDTVVEGEYVDIKEPLKIPGCKPVGPKELMETM 184

Query: 186 LDRTEQQYFEILKLGMGIASSDGVLVNTWDELQDRTLASLNDRNLLGKI-SPPVYSIGPI 245
           LDR+ QQY E ++ G+ +  SDGVLVNTW+ELQ  TLA+L +   L ++   PVY IGPI
Sbjct: 185 LDRSGQQYKECVRAGLEVPMSDGVLVNTWEELQGNTLAALREDEELSRVMKVPVYPIGPI 244

Query: 246 VRQPGSKKGGSSELFNWLSKQPSESVIYVSFGSGGTLSFEQMTEVAHGLEMSRQRFVWVV 305
           VR         + +F WL +Q   SV++V  GSGGTL+FEQ  E+A GLE+S QRFVWV+
Sbjct: 245 VR-TNQHVDKPNSIFEWLDEQRERSVVFVCLGSGGTLTFEQTVELALGLELSGQRFVWVL 304

Query: 306 RAPKVRSDGAFFTTGDESEEQSLAKFLPEGFLERTSEVGFVVSMWADQTAVLGSPAVGGF 365
           R P      A +     S+++ ++  LPEGFL+RT  VG VV+ WA Q  +L   ++GGF
Sbjct: 305 RRP------ASYLGAISSDDEQVSASLPEGFLDRTRGVGIVVTQWAPQVEILSHRSIGGF 364

Query: 366 FSHSGWNSALESITNGVPMVVWPLYAEQRMNATMLTEEIGVGVRSKELPTNALIEREEIA 425
            SH GW+SALES+T GVP++ WPLYAEQ MNAT+LTEEIGV VR+ ELP+  +I REE+A
Sbjct: 365 LSHCGWSSALESLTKGVPIIAWPLYAEQWMNATLLTEEIGVAVRTSELPSERVIGREEVA 424

Query: 426 AMVRKIMVEEDDEGKAIRAKAKELQRSAAKALGEGGSSHHN 462
           ++VRKIM EED+EG+ IRAKA+E++ S+ +A  + GSS+++
Sbjct: 425 SLVRKIMAEEDEEGQKIRAKAEEVRVSSERAWSKDGSSYNS 458

BLAST of CSPI04G04890 vs. TAIR 10
Match: AT3G50740.1 (UDP-glucosyl transferase 72E1 )

HSP 1 Score: 399.4 bits (1025), Expect = 3.9e-111
Identity = 210/466 (45.06%), Postives = 306/466 (65.67%), Query Frame = 0

Query: 6   HVALISSPGMGHLFPALELATRLSTRHRLTVTVFIVPSHSSSAENKVIAAAQA-AGLFTV 65
           HVA+ +SPGMGH+ P +EL  RL+  H   VT+F++ + ++SA+++ + +    A L  +
Sbjct: 7   HVAMFASPGMGHIIPVIELGKRLAGSHGFDVTIFVLETDAASAQSQFLNSPGCDAALVDI 66

Query: 66  VELPPADMSDVTESSVVG--RLAITMRRHVPILRSAVSAMTSPPSVLIADIFSIESFAVA 125
           V LP  D+S + + S     +L + MR  +P +RS +  M   P+ LI D+F +++  + 
Sbjct: 67  VGLPTPDISGLVDPSAFFGIKLLVMMRETIPTIRSKIEEMQHKPTALIVDLFGLDAIPLG 126

Query: 126 DEFDMKKYAFVASNAWFLAVMVYAQVWDREIVGQYVDQKEPLQIPGCESVRPCDVIDPLL 185
            EF+M  Y F+ASNA FLAV ++    D+++  +++ +K+P+ +PGCE VR  D ++  L
Sbjct: 127 GEFNMLTYIFIASNARFLAVALFFPTLDKDMEEEHIIKKQPMVMPGCEPVRFEDTLETFL 186

Query: 186 DRTEQQYFEILKLGMGIASSDGVLVNTWDELQDRTLASLNDRNLLGKIS-PPVYSIGPIV 245
           D   Q Y E +  G    + DG++VNTWD+++ +TL SL D  LLG+I+  PVY IGP+ 
Sbjct: 187 DPNSQLYREFVPFGSVFPTCDGIIVNTWDDMEPKTLKSLQDPKLLGRIAGVPVYPIGPLS 246

Query: 246 RQPGSKKGGSSELFNWLSKQPSESVIYVSFGSGGTLSFEQMTEVAHGLEMSRQRFVWVVR 305
           R P      +  + +WL+KQP ESV+Y+SFGSGG+LS +Q+TE+A GLEMS+QRFVWVVR
Sbjct: 247 R-PVDPSKTNHPVLDWLNKQPDESVLYISFGSGGSLSAKQLTELAWGLEMSQQRFVWVVR 306

Query: 306 APKVRSD-GAFFTTGDESEEQSLAKFLPEGFLERTSEVGFVVSMWADQTAVLGSPAVGGF 365
            P   S   A+ +            +LPEGF+ RT E GF+VS WA Q  +L   AVGGF
Sbjct: 307 PPVDGSACSAYLSANSGKIRDGTPDYLPEGFVSRTHERGFMVSSWAPQAEILAHQAVGGF 366

Query: 366 FSHSGWNSALESITNGVPMVVWPLYAEQRMNATMLTEEIGVGVRSKELPTNALIEREEIA 425
            +H GWNS LES+  GVPM+ WPL+AEQ MNAT+L EE+GV VRSK+LP+  +I R EI 
Sbjct: 367 LTHCGWNSILESVVGGVPMIAWPLFAEQMMNATLLNEELGVAVRSKKLPSEGVITRAEIE 426

Query: 426 AMVRKIMVEEDDEGKAIRAKAKELQRSAAKALG-EGGSSHHNFARV 466
           A+VRKIMVEE  EG  +R K K+L+ +AA++L  +GG +H + +R+
Sbjct: 427 ALVRKIMVEE--EGAEMRKKIKKLKETAAESLSCDGGVAHESLSRI 469

BLAST of CSPI04G04890 vs. TAIR 10
Match: AT5G26310.1 (UDP-Glycosyltransferase superfamily protein )

HSP 1 Score: 373.2 bits (957), Expect = 3.0e-103
Identity = 202/468 (43.16%), Postives = 302/468 (64.53%), Query Frame = 0

Query: 6   HVALISSPGMGHLFPALELATRLSTRHRLTVTVFIVPSHSSSAENKVIAAAQAAGLFTVV 65
           H A+ SSPGMGH+ P +ELA RLS  H   VTVF++ + ++S ++K++    + G+  +V
Sbjct: 7   HAAMFSSPGMGHVLPVIELAKRLSANHGFHVTVFVLETDAASVQSKLL---NSTGV-DIV 66

Query: 66  ELPPADMSDVTE--SSVVGRLAITMRRHVPILRSAVSAMTSPPSVLIADIFSIESFAVAD 125
            LP  D+S + +  + VV ++ + MR  VP LRS + AM   P+ LI D+F  ++  +A 
Sbjct: 67  NLPSPDISGLVDPNAHVVTKIGVIMREAVPTLRSKIVAMHQNPTALIIDLFGTDALCLAA 126

Query: 126 EFDMKKYAFVASNAWFLAVMVYAQVWDREIVGQYVDQKEPLQIPGCESVRPCDVIDPLLD 185
           E +M  Y F+ASNA +L V +Y    D  I  ++  Q++PL IPGCE VR  D++D  L 
Sbjct: 127 ELNMLTYVFIASNARYLGVSIYYPTLDEVIKEEHTVQRKPLTIPGCEPVRFEDIMDAYLV 186

Query: 186 RTEQQYFEILKLGMGIASSDGVLVNTWDELQDRTLASLNDRNLLGKIS-PPVYSIGPIVR 245
             E  Y ++++  +    +DG+LVNTW+E++ ++L SL D  LLG+++  PVY +GP+ R
Sbjct: 187 PDEPVYHDLVRHCLAYPKADGILVNTWEEMEPKSLKSLQDPKLLGRVARVPVYPVGPLCR 246

Query: 246 QPGSKKGGSSELFNWLSKQPSESVIYVSFGSGGTLSFEQMTEVAHGLEMSRQRFVWVVRA 305
            P         +F+WL+KQP+ESV+Y+SFGSGG+L+ +Q+TE+A GLE S+QRF+WVVR 
Sbjct: 247 -PIQSSTTDHPVFDWLNKQPNESVLYISFGSGGSLTAQQLTELAWGLEESQQRFIWVVRP 306

Query: 306 PKVRSD-GAFFTTGDESEEQSLAKFLPEGFLERTSEVGFVVSMWADQTAVLGSPAVGGFF 365
           P   S    +F+      + +  ++LPEGF+ RT + GF++  WA Q  +L   AVGGF 
Sbjct: 307 PVDGSSCSDYFSAKGGVTKDNTPEYLPEGFVTRTCDRGFMIPSWAPQAEILAHQAVGGFL 366

Query: 366 SHSGWNSALESITNGVPMVVWPLYAEQRMNATMLTEEIGVGVRSKELPTNALIEREEIAA 425
           +H GW+S LES+  GVPM+ WPL+AEQ MNA +L++E+G+ VR  + P  A I R +I A
Sbjct: 367 THCGWSSTLESVLCGVPMIAWPLFAEQNMNAALLSDELGISVRVDD-PKEA-ISRSKIEA 426

Query: 426 MVRKIMVEEDDEGKAIRAKAKELQRSAAKALG--EGGSSHHNFARVVK 468
           MVRK+M E  DEG+ +R K K+L+ +A  +L    GGS+H +  RV K
Sbjct: 427 MVRKVMAE--DEGEEMRRKVKKLRDTAEMSLSIHGGGSAHESLCRVTK 465

BLAST of CSPI04G04890 vs. TAIR 10
Match: AT5G66690.1 (UDP-Glycosyltransferase superfamily protein )

HSP 1 Score: 366.7 bits (940), Expect = 2.8e-101
Identity = 202/471 (42.89%), Postives = 300/471 (63.69%), Query Frame = 0

Query: 6   HVALISSPGMGHLFPALELATRLSTRHRLTVTVFIVPSHSSSAENKVIAAAQAAGLFTVV 65
           H A+ SSPGMGH+ P +EL  RLS  +   VTVF++ + ++SA++K +    + G+  +V
Sbjct: 7   HAAMFSSPGMGHVIPVIELGKRLSANNGFHVTVFVLETDAASAQSKFL---NSTGV-DIV 66

Query: 66  ELPPADMSDVT--ESSVVGRLAITMRRHVPILRSAVSAMTSPPSVLIADIFSIESFAVAD 125
           +LP  D+  +   +  VV ++ + MR  VP LRS ++AM   P+ LI D+F  ++  +A 
Sbjct: 67  KLPSPDIYGLVDPDDHVVTKIGVIMRAAVPALRSKIAAMHQKPTALIVDLFGTDALCLAK 126

Query: 126 EFDMKKYAFVASNAWFLAVMVYAQVWDREIVGQYVDQKEPLQIPGCESVRPCDVIDPLLD 185
           EF+M  Y F+ +NA FL V +Y    D++I  ++  Q+ PL IPGCE VR  D +D  L 
Sbjct: 127 EFNMLSYVFIPTNARFLGVSIYYPNLDKDIKEEHTVQRNPLAIPGCEPVRFEDTLDAYLV 186

Query: 186 RTEQQYFEILKLGMGIASSDGVLVNTWDELQDRTLASLNDRNLLGKIS-PPVYSIGPIVR 245
             E  Y + ++ G+    +DG+LVNTW+E++ ++L SL +  LLG+++  PVY IGP+ R
Sbjct: 187 PDEPVYRDFVRHGLAYPKADGILVNTWEEMEPKSLKSLLNPKLLGRVARVPVYPIGPLCR 246

Query: 246 QPGSKKGGSSELFNWLSKQPSESVIYVSFGSGGTLSFEQMTEVAHGLEMSRQRFVWVVRA 305
            P         + +WL++QP+ESV+Y+SFGSGG LS +Q+TE+A GLE S+QRFVWVVR 
Sbjct: 247 -PIQSSETDHPVLDWLNEQPNESVLYISFGSGGCLSAKQLTELAWGLEQSQQRFVWVVRP 306

Query: 306 PKVRSDGA----FFTTGDESEEQSLAKFLPEGFLERTSEVGFVVSMWADQTAVLGSPAVG 365
           P    DG+    + +      E +  ++LPEGF+ RTS+ GFVV  WA Q  +L   AVG
Sbjct: 307 P---VDGSCCSEYVSANGGGTEDNTPEYLPEGFVSRTSDRGFVVPSWAPQAEILSHRAVG 366

Query: 366 GFFSHSGWNSALESITNGVPMVVWPLYAEQRMNATMLTEEIGVGVRSKELPTNALIEREE 425
           GF +H GW+S LES+  GVPM+ WPL+AEQ MNA +L++E+G+ VR  +   +  I R +
Sbjct: 367 GFLTHCGWSSTLESVVGGVPMIAWPLFAEQNMNAALLSDELGIAVRLDDPKED--ISRWK 426

Query: 426 IAAMVRKIMVEEDDEGKAIRAKAKELQRSAAKALG--EGGSSHHNFARVVK 468
           I A+VRK+M E+  EG+A+R K K+L+ SA  +L    GG +H +  RV K
Sbjct: 427 IEALVRKVMTEK--EGEAMRRKVKKLRDSAEMSLSIDGGGLAHESLCRVTK 465

BLAST of CSPI04G04890 vs. TAIR 10
Match: AT2G18560.1 (UDP-Glycosyltransferase superfamily protein )

HSP 1 Score: 355.5 bits (911), Expect = 6.5e-98
Identity = 184/384 (47.92%), Postives = 255/384 (66.41%), Query Frame = 0

Query: 88  MRRHVPILRSAVSAMTSPPSVLIADIFSIESFAVADEFDMKKYAFVASNAWFLAVMVYAQ 147
           MR     +R AV +M   P+V+I D F     ++ D     KY ++ S+AWFLA++VY  
Sbjct: 1   MREMKSTVRDAVKSMKQKPTVMIVDFFGTALLSITDVGVTSKYVYIPSHAWFLALIVYLP 60

Query: 148 VWDREIVGQYVDQKEPLQIPGCESVRPCDVIDPLLDRTEQQYFEILKLGMGIASSDGVLV 207
           V D+ + G+YVD KEP++IPGC+ V P +++D +LDR++QQY + +++G+ I  SDGVLV
Sbjct: 61  VLDKVMEGEYVDIKEPMKIPGCKPVGPKELLDTMLDRSDQQYRDCVQIGLEIPMSDGVLV 120

Query: 208 NTWDELQDRTLASL-NDRNLLGKISPPVYSIGPIVRQPGSKKGGSSELFNWLSKQPSESV 267
           NTW ELQ +TLA+L  D +L   I  PVY IGPIVR     +  +S  F WL KQ   SV
Sbjct: 121 NTWGELQGKTLAALREDIDLNRVIKVPVYPIGPIVRTNVLIEKPNS-TFEWLDKQEERSV 180

Query: 268 IYVSFGSGGTLSFEQMTEVAHGLEMSRQRFVWVVRAPKVRSDGAFFTTGDESEEQSLAKF 327
           +YV  GSGGTLSFEQ  E+A GLE+S Q F+WV+R P        +      ++  ++  
Sbjct: 181 VYVCLGSGGTLSFEQTMELAWGLELSCQSFLWVLRKP------PSYLGASSKDDDQVSDG 240

Query: 328 LPEGFLERTSEVGFVVSMWADQTAVLGSPAVGGFFSHSGWNSALESITNGVPMVVWPLYA 387
           LPEGFL+RT  VG VV+ WA Q  +L   ++GGF SH GW+S LES+T GVP++ WPLYA
Sbjct: 241 LPEGFLDRTRGVGLVVTQWAPQVEILSHRSIGGFLSHCGWSSVLESLTKGVPIIAWPLYA 300

Query: 388 EQRMNATMLTEEIGVGVRSKELPTNALIEREEIAAMVRKIMVEEDDEGKAIRAKAKELQR 447
           EQ MNAT+LTEEIG+ +R+ ELP+  +I REE+A++V+KI+ EED EG+ I+ KA+E++ 
Sbjct: 301 EQWMNATLLTEEIGMAIRTSELPSKKVISREEVASLVKKIVAEEDKEGRKIKTKAEEVRV 360

Query: 448 SAAKALGEGGSSHHNFARVVKLFG 471
           S+ +A   GGSSH +     K  G
Sbjct: 361 SSERAWTHGGSSHSSLFEWAKRCG 377

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q402871.0e-12750.87Anthocyanidin 3-O-glucosyltransferase 5 OS=Manihot esculenta OX=3983 GN=GT5 PE=2... [more]
Q9ZU725.7e-11548.16UDP-glycosyltransferase 72D1 OS=Arabidopsis thaliana OX=3702 GN=UGT72D1 PE=2 SV=... [more]
Q94A845.5e-11045.06UDP-glycosyltransferase 72E1 OS=Arabidopsis thaliana OX=3702 GN=UGT72E1 PE=1 SV=... [more]
O814984.2e-10243.16UDP-glycosyltransferase 72E3 OS=Arabidopsis thaliana OX=3702 GN=UGT72E3 PE=1 SV=... [more]
Q9LVR14.0e-10042.89UDP-glycosyltransferase 72E2 OS=Arabidopsis thaliana OX=3702 GN=UGT72E2 PE=1 SV=... [more]
Match NameE-valueIdentityDescription
A0A0A0KWI63.2e-26299.58Glycosyltransferase OS=Cucumis sativus OX=3659 GN=Csa_4G038620 PE=3 SV=1[more]
A0A5D3CXP25.3e-24994.48Glycosyltransferase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold21G0... [more]
A0A1S3BX032.5e-24694.23Glycosyltransferase OS=Cucumis melo OX=3656 GN=LOC103494390 PE=3 SV=1[more]
A0A6J1EP225.9e-21683.55Glycosyltransferase OS=Cucurbita moschata OX=3662 GN=LOC111436081 PE=3 SV=1[more]
A0A6J1J7262.3e-21281.91Glycosyltransferase OS=Cucurbita maxima OX=3661 GN=LOC111483150 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
XP_004149165.16.6e-26299.58anthocyanidin 3-O-glucosyltransferase 5 [Cucumis sativus] >KAE8637578.1 hypothet... [more]
TYK16721.11.1e-24894.48anthocyanidin 3-O-glucosyltransferase 5 [Cucumis melo var. makuwa][more]
XP_008453746.15.1e-24694.23PREDICTED: anthocyanidin 3-O-glucosyltransferase 5, partial [Cucumis melo][more]
XP_038880693.16.7e-23087.69anthocyanidin 3-O-glucosyltransferase 5-like [Benincasa hispida][more]
XP_022929544.11.2e-21583.55anthocyanidin 3-O-glucosyltransferase 5-like [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
AT2G18570.14.0e-11648.16UDP-Glycosyltransferase superfamily protein [more]
AT3G50740.13.9e-11145.06UDP-glucosyl transferase 72E1 [more]
AT5G26310.13.0e-10343.16UDP-Glycosyltransferase superfamily protein [more]
AT5G66690.12.8e-10142.89UDP-Glycosyltransferase superfamily protein [more]
AT2G18560.16.5e-9847.92UDP-Glycosyltransferase superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (PI 183967) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableGENE3D3.40.50.2000Glycogen Phosphorylase B;coord: 252..453
e-value: 5.7E-135
score: 452.7
NoneNo IPR availableGENE3D3.40.50.2000Glycogen Phosphorylase B;coord: 6..458
e-value: 5.7E-135
score: 452.7
NoneNo IPR availablePANTHERPTHR48049GLYCOSYLTRANSFERASEcoord: 2..468
NoneNo IPR availablePANTHERPTHR48049:SF63UDP-GLYCOSYLTRANSFERASE 72C1coord: 2..468
NoneNo IPR availableSUPERFAMILY53756UDP-Glycosyltransferase/glycogen phosphorylasecoord: 5..467
IPR002213UDP-glucuronosyl/UDP-glucosyltransferasePFAMPF00201UDPGTcoord: 253..403
e-value: 7.8E-20
score: 71.1
IPR002213UDP-glucuronosyl/UDP-glucosyltransferaseCDDcd03784GT1_Gtf-likecoord: 5..463
e-value: 1.9425E-63
score: 209.33
IPR035595UDP-glycosyltransferase family, conserved sitePROSITEPS00375UDPGTcoord: 345..388

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI04G04890.1CSPI04G04890.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0008194 UDP-glycosyltransferase activity