CSPI04G21930 (gene) Wild cucumber (PI 183967)

NameCSPI04G21930
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionGlycosyltransferase
LocationChr4 : 20409456 .. 20411864 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CCCAAGTCTATATATATTGACATTTTGATGGAGTTCTAATTGTGCTTCACAAAAATAAATCACCACAAAATTTGGATAGAGATACCCAAAGGGTGAGACATGAAATATGCCTTAAAAAAGCCAAAGGTCATATTGGTTCCATATCCAGCTCAAGGTCATGTCACTCCTATGCTCATGCTTGCCGCTGTCTTTCATCGTCGTGGTTTTCTTCCTATCTTTCTCACTCCTAGTTACATCCATTGTCATATTTCATCGCAAGTTTCGTCTTCCGATGGAATTTTCTTTGTTTCCATGTCGGATGGCCTCGACGATAACATGCCACGTGACTTCTTCACTATTGAGGCTGCCATAGAAACCACCATGCCAGTTTGCCTTAGACAAGTTTTGAGTGAGCATAATTCTAAAGAAAGCAGTGGTGGTACTGGAGTTGTTTGTATGGTGGTCGACTTACTTGCTTCTTCGGCTATTGAAGTTGGAAATGAATTTGGAGTAACTGTGGTCGGGTTTTGGCCAGCCATGTTTGCCACCTATAAATTAATGTCAACTATTCCCGAAATGATTCAAAACAACTTCATTTCTTCTGATACAGGTTAACACTCTTTTTGTAGTTTTGTTCTGTTTTTGAAATCGAGCCAATTTTAATGAAATGAATATATATATAGCATCTTATTTTCTTAGTACAGTACATAGTATATTTTTATTATAGATTGAAAATCACTATTCTTCTCTCATTTTTTTTATAATTATAATTAAAGTTTAATCATGATAGATTTGCAATCTCTCTCTTTAAAAAAATCTCTTAGATTTTAACTTTTAATCAAATAACTTGTAAGGAATACTTTATTTACTAAAAACAAATACACTTTTGAAAGTTTCATTCCTATTTTATTCTAAAAATGTAGTCAAACAAGATGATTTTAAATATGAAAATATATATAATATAAATTGATGTGATAAATAATATCTTAAATTAGAAAAAAAAACATATTTAGTACTGTCTAACAATAAACTTCAAAAAAAATTATTAAAAAAAGCAAAAAATAAATGCTTAGTCAAACTAAGTATAGAATTTTTCTTTCTTTCATACAAAAAAAAAATATATATAATCAATTAGTCACTAACAACATATACTTATTTAAAATATTTTAACTAGATAAAGCTTATATCATCCATCATCCATCATCATTAAATGCATGATTTTCTCACGAACATTAAAAATTAAAATTGTAAGATAATTTGAAGGTTAAAGGGAATGAAATAAAAATATTACAAAAATTAAAGTAAAAAAAGTTTGGAAATGGATATGACAGGATGTCCAGAAGAAGGATCAAAACGGTGCGTTCCGAGCCAGCCGCTGTTGTCGGCGGAAGAACTACCGTGGCTGGTCGGAACGTCGTCGGCAATAAAAGGCAGGTTCAAATTCTGGAAAAGAACCATGGCTAGAGCCAGATCCGTTCATTGCCTTTTGGTTAATTCCTTCCCAGAAGAACTACTCCCTCTTCAGAAATTGATCACTAAAAGCTCCGCCGCCTCCGTGTTTCTCGTCGGACCTCTGAGCCGGCACTCGAATCCTGCAAAAACGCCGACATTCTGGGAAGAAGACGACGGGTGTGTGAAGTGGCTGGAGAAACAGAGGCCCAATTCAGTAATTTACATCTCGTTTGGGAGTTGGGTCAGCCCCATTAACGAATCGAAAGTGAGGAGCTTAGCCATGACGCTTTTGGGCTTAAAGAACCCATTCATTTGGGTTCTCAAAAACAACTGGCGAGATGGTTTGCCAATTGGATTCCAGCAAAAGGTACCATTATATTTCCTGATCATATTGAAGGTACAAGCAAGCATATAAATTAATTTTAATACATTTAATAATAATAATATCATCATATATGTAGATTCAAAGTTACGGGAGGTTGGTTTCATGGGCTCCTCAAATAGAGATTTTGAAGCACAGAGCAGTTGGTTGTTATCTAACTCACTGCGGGTGGAATTCGATCATGGAAGCAATTCAATATGGAAAGCGACTGCTTTGTTTTCCCGTGGCCGGTGACCAATTTCTGAACTGTGGGTATGTAGTTAAAGTGTGGAGAATTGGGGTTAGGTTGAATGGGTTTGGAGAGAAAGAGGTTGAAGAAGGTATGAGGAAAGTGATGGAGGATGGAGAGATGAAGGGGAGATTTATGAAACTTCATGAGAGAATAATGGGAGAAGAGGCCAATTGTAGAGTCAACTCTAATTTCACAACCTTCATTAATGAGATCAATCTTACAAACATCTAAAGTTGGACATTCTGTCCTATAAATTATTGTAACTTTGTAAGTTGGTTTATATATATATATATATATTAAAAAAGAAAGGAACAAATTTGTTTGATTATATTAATAGTGAGAATAAATTCGAGCTAACGGTCAA

mRNA sequence

ATGAAATATGCCTTAAAAAAGCCAAAGGTCATATTGGTTCCATATCCAGCTCAAGGTCATGTCACTCCTATGCTCATGCTTGCCGCTGTCTTTCATCGTCGTGGTTTTCTTCCTATCTTTCTCACTCCTAGTTACATCCATTGTCATATTTCATCGCAAGTTTCGTCTTCCGATGGAATTTTCTTTGTTTCCATGTCGGATGGCCTCGACGATAACATGCCACGTGACTTCTTCACTATTGAGGCTGCCATAGAAACCACCATGCCAGTTTGCCTTAGACAAGTTTTGAGTGAGCATAATTCTAAAGAAAGCAGTGGTGGTACTGGAGTTGTTTGTATGGTGGTCGACTTACTTGCTTCTTCGGCTATTGAAGTTGGAAATGAATTTGGAGTAACTGTGGTCGGGTTTTGGCCAGCCATGTTTGCCACCTATAAATTAATGTCAACTATTCCCGAAATGATTCAAAACAACTTCATTTCTTCTGATACAGGATGTCCAGAAGAAGGATCAAAACGGTGCGTTCCGAGCCAGCCGCTGTTGTCGGCGGAAGAACTACCGTGGCTGGTCGGAACGTCGTCGGCAATAAAAGGCAGGTTCAAATTCTGGAAAAGAACCATGGCTAGAGCCAGATCCGTTCATTGCCTTTTGGTTAATTCCTTCCCAGAAGAACTACTCCCTCTTCAGAAATTGATCACTAAAAGCTCCGCCGCCTCCGTGTTTCTCGTCGGACCTCTGAGCCGGCACTCGAATCCTGCAAAAACGCCGACATTCTGGGAAGAAGACGACGGGTGTGTGAAGTGGCTGGAGAAACAGAGGCCCAATTCAGTAATTTACATCTCGTTTGGGAGTTGGGTCAGCCCCATTAACGAATCGAAAGTGAGGAGCTTAGCCATGACGCTTTTGGGCTTAAAGAACCCATTCATTTGGGTTCTCAAAAACAACTGGCGAGATGGTTTGCCAATTGGATTCCAGCAAAAGATTCAAAGTTACGGGAGGTTGGTTTCATGGGCTCCTCAAATAGAGATTTTGAAGCACAGAGCAGTTGGTTGTTATCTAACTCACTGCGGGTGGAATTCGATCATGGAAGCAATTCAATATGGAAAGCGACTGCTTTGTTTTCCCGTGGCCGGTGACCAATTTCTGAACTGTGGGTATGTAGTTAAAGTGTGGAGAATTGGGGTTAGGTTGAATGGGTTTGGAGAGAAAGAGGTTGAAGAAGGTATGAGGAAAGTGATGGAGGATGGAGAGATGAAGGGGAGATTTATGAAACTTCATGAGAGAATAATGGGAGAAGAGGCCAATTGTAGAGTCAACTCTAATTTCACAACCTTCATTAATGAGATCAATCTTACAAACATCTAA

Coding sequence (CDS)

ATGAAATATGCCTTAAAAAAGCCAAAGGTCATATTGGTTCCATATCCAGCTCAAGGTCATGTCACTCCTATGCTCATGCTTGCCGCTGTCTTTCATCGTCGTGGTTTTCTTCCTATCTTTCTCACTCCTAGTTACATCCATTGTCATATTTCATCGCAAGTTTCGTCTTCCGATGGAATTTTCTTTGTTTCCATGTCGGATGGCCTCGACGATAACATGCCACGTGACTTCTTCACTATTGAGGCTGCCATAGAAACCACCATGCCAGTTTGCCTTAGACAAGTTTTGAGTGAGCATAATTCTAAAGAAAGCAGTGGTGGTACTGGAGTTGTTTGTATGGTGGTCGACTTACTTGCTTCTTCGGCTATTGAAGTTGGAAATGAATTTGGAGTAACTGTGGTCGGGTTTTGGCCAGCCATGTTTGCCACCTATAAATTAATGTCAACTATTCCCGAAATGATTCAAAACAACTTCATTTCTTCTGATACAGGATGTCCAGAAGAAGGATCAAAACGGTGCGTTCCGAGCCAGCCGCTGTTGTCGGCGGAAGAACTACCGTGGCTGGTCGGAACGTCGTCGGCAATAAAAGGCAGGTTCAAATTCTGGAAAAGAACCATGGCTAGAGCCAGATCCGTTCATTGCCTTTTGGTTAATTCCTTCCCAGAAGAACTACTCCCTCTTCAGAAATTGATCACTAAAAGCTCCGCCGCCTCCGTGTTTCTCGTCGGACCTCTGAGCCGGCACTCGAATCCTGCAAAAACGCCGACATTCTGGGAAGAAGACGACGGGTGTGTGAAGTGGCTGGAGAAACAGAGGCCCAATTCAGTAATTTACATCTCGTTTGGGAGTTGGGTCAGCCCCATTAACGAATCGAAAGTGAGGAGCTTAGCCATGACGCTTTTGGGCTTAAAGAACCCATTCATTTGGGTTCTCAAAAACAACTGGCGAGATGGTTTGCCAATTGGATTCCAGCAAAAGATTCAAAGTTACGGGAGGTTGGTTTCATGGGCTCCTCAAATAGAGATTTTGAAGCACAGAGCAGTTGGTTGTTATCTAACTCACTGCGGGTGGAATTCGATCATGGAAGCAATTCAATATGGAAAGCGACTGCTTTGTTTTCCCGTGGCCGGTGACCAATTTCTGAACTGTGGGTATGTAGTTAAAGTGTGGAGAATTGGGGTTAGGTTGAATGGGTTTGGAGAGAAAGAGGTTGAAGAAGGTATGAGGAAAGTGATGGAGGATGGAGAGATGAAGGGGAGATTTATGAAACTTCATGAGAGAATAATGGGAGAAGAGGCCAATTGTAGAGTCAACTCTAATTTCACAACCTTCATTAATGAGATCAATCTTACAAACATCTAA
BLAST of CSPI04G21930 vs. Swiss-Prot
Match: U82A1_ARATH (UDP-glycosyltransferase 82A1 OS=Arabidopsis thaliana GN=UGT82A1 PE=2 SV=1)

HSP 1 Score: 455.3 bits (1170), Expect = 7.9e-127
Identity = 236/466 (50.64%), Postives = 308/466 (66.09%), Query Frame = 1

Query: 6   KKPKVILVPYPAQGHVTPMLMLAAVFHRRGFLPIFLTPSYIHCHISSQVSSSDGIFFVSM 65
           +KPK+I +PYPAQGHVTPML LA+ F  RGF P+ +TP  IH  IS+  +   GI F+++
Sbjct: 5   QKPKIIFIPYPAQGHVTPMLHLASAFLSRGFSPVVMTPESIHRRISA-TNEDLGITFLAL 64

Query: 66  SDGLD--DNMPRDFFTIEAAIETTMPVCLRQVLSEHNSKESSGGTGVVCMVVDLLASSAI 125
           SDG D  D  P DFF+IE ++E  MP  L ++L E +         V C+VVDLLAS AI
Sbjct: 65  SDGQDRPDAPPSDFFSIENSMENIMPPQLERLLLEED-------LDVACVVVDLLASWAI 124

Query: 126 EVGNEFGVTVVGFWPAMFATYKLMSTIPEMIQNNFISSDTGCPEEGSKRCV-PSQPLLSA 185
            V +  GV V GFWP MFA Y+L+  IPE+++   +S   GCP +  K  V P QPLLSA
Sbjct: 125 GVADRCGVPVAGFWPVMFAAYRLIQAIPELVRTGLVSQK-GCPRQLEKTIVQPEQPLLSA 184

Query: 186 EELPWLVGTSSAIKGRFKFWKRTMARARSVHCLLVNSFPEELLPLQ--KLITKSSA---- 245
           E+LPWL+GT  A K RFKFW+RT+ R +S+  +L +SF +E   +   K   K S     
Sbjct: 185 EDLPWLIGTPKAQKKRFKFWQRTLERTKSLRWILTSSFKDEYEDVDNHKASYKKSNDLNK 244

Query: 246 ------ASVFLVGPLSRH---SNPAKTPT-FWEEDDGCVKWLEKQRPNSVIYISFGSWVS 305
                   +  +GPL      +N   T T FWEED  C+ WL++Q PNSVIYISFGSWVS
Sbjct: 245 ENNGQNPQILHLGPLHNQEATNNITITKTSFWEEDMSCLGWLQEQNPNSVIYISFGSWVS 304

Query: 306 PINESKVRSLAMTLLGLKNPFIWVLKNNWRDGLPIGFQQKI---QSYGRLVSWAPQIEIL 365
           PI ES +++LA+ L     PF+W L   W++GLP GF  ++   ++ GR+VSWAPQ+E+L
Sbjct: 305 PIGESNIQTLALALEASGRPFLWALNRVWQEGLPPGFVHRVTITKNQGRIVSWAPQLEVL 364

Query: 366 KHRAVGCYLTHCGWNSIMEAIQYGKRLLCFPVAGDQFLNCGYVVKVWRIGVRLNGFGEKE 425
           ++ +VGCY+THCGWNS MEA+   +RLLC+PVAGDQF+NC Y+V VW+IGVRL+GFGEKE
Sbjct: 365 RNDSVGCYVTHCGWNSTMEAVASSRRLLCYPVAGDQFVNCKYIVDVWKIGVRLSGFGEKE 424

Query: 426 VEEGMRKVMEDGEMKGRFMKLHERIMGEEANCRVNSNFTTFINEIN 450
           VE+G+RKVMED +M  R  KL +R MG EA      NFT   NE+N
Sbjct: 425 VEDGLRKVMEDQDMGERLRKLRDRAMGNEARLSSEMNFTFLKNELN 461

BLAST of CSPI04G21930 vs. Swiss-Prot
Match: U83A1_ARATH (UDP-glycosyltransferase 83A1 OS=Arabidopsis thaliana GN=UGT83A1 PE=2 SV=1)

HSP 1 Score: 203.0 bits (515), Expect = 7.0e-51
Identity = 141/469 (30.06%), Postives = 225/469 (47.97%), Query Frame = 1

Query: 7   KPKVILVPYPAQGHVTPMLMLAAVFHRRGFLPIFLTPSYIHCHISSQVSSS-------DG 66
           +P V+++PYPAQGHV P++  +    ++G    F+   + H  I S + +S       D 
Sbjct: 11  RPHVVVIPYPAQGHVLPLISFSRYLAKQGIQITFINTEFNHNRIISSLPNSPHEDYVGDQ 70

Query: 67  IFFVSMSDGLDD-----NMPRDFFTIEAAIETTMPVCLRQVLSEHNSKESSGGTGVVCMV 126
           I  VS+ DGL+D     N+P     +  ++   MP  + +++ E    E+SGGT + C+V
Sbjct: 71  INLVSIPDGLEDSPEERNIPGK---LSESVLRFMPKKVEELI-ERMMAETSGGTIISCVV 130

Query: 127 VDLLASSAIEVGNEFGVTVVGFWPAMFATYKLMSTIPEMIQNNFISSDTGCPEEGSKRCV 186
            D     AIEV  +FG+    F PA  A+  L  +I ++I +  I SD       + +  
Sbjct: 131 ADQSLGWAIEVAAKFGIRRTAFCPAAAASMVLGFSIQKLIDDGLIDSDGTVRVNKTIQLS 190

Query: 187 PSQPLLSAEELPWL-VGTSSAIKGRFKFWKRTMARARSVHCLLVNSFPEELLPLQKLITK 246
           P  P +  ++  W+ +    + K  F+   +      S   LL NS  E      +    
Sbjct: 191 PGMPKMETDKFVWVCLKNKESQKNIFQLMLQNNNSIESTDWLLCNSVHE-----LETAAF 250

Query: 247 SSAASVFLVGPL----SRHSNPAKTPTFWEEDDGCVKWLEKQRPNSVIYISFGSWVSPIN 306
               ++  +GP+    S         +F   D  C+ WL++Q P SVIY++FGS+   + 
Sbjct: 251 GLGPNIVPIGPIGWAHSLEEGSTSLGSFLPHDRDCLDWLDRQIPGSVIYVAFGSF-GVMG 310

Query: 307 ESKVRSLAMTLLGLKNPFIWVLKNNWRDGLPIGFQQKIQSYG---RLVSWAPQIEILKHR 366
             ++  LA+ L   K P +WV           G QQ I+      ++V WAPQ E+L   
Sbjct: 311 NPQLEELAIGLELTKRPVLWVT----------GDQQPIKLGSDRVKVVRWAPQREVLSSG 370

Query: 367 AVGCYLTHCGWNSIMEAIQYGKRLLCFPVAGDQFLNCGYVVKVWRIGVRLNG-----FGE 426
           A+GC+++HCGWNS +E  Q G   LC P   DQF+N  Y+  VW+IG+ L          
Sbjct: 371 AIGCFVSHCGWNSTLEGAQNGIPFLCIPYFADQFINKAYICDVWKIGLGLERDARGVVPR 430

Query: 427 KEVEEGMRKVMED-GEMKGRFMKLHERIMGEEANCRVN-SNFTTFINEI 449
            EV++ + ++M D GE + R MK+ E +M   A   ++  N   F+N I
Sbjct: 431 LEVKKKIDEIMRDGGEYEERAMKVKEIVMKSVAKDGISCENLNKFVNWI 459

BLAST of CSPI04G21930 vs. Swiss-Prot
Match: U76C1_ARATH (UDP-glycosyltransferase 76C1 OS=Arabidopsis thaliana GN=UGT76C1 PE=1 SV=1)

HSP 1 Score: 199.9 bits (507), Expect = 6.0e-50
Identity = 122/436 (27.98%), Postives = 211/436 (48.39%), Query Frame = 1

Query: 9   KVILVPYPAQGHVTPMLMLAAVFHRRGFLPIFLTPSYIHCHISSQVSSSDGIF-FVSMSD 68
           +VIL P P QG + PML LA + + RGF     + + IH   ++  SS   +F F+ + D
Sbjct: 8   QVILFPLPLQGCINPMLQLAKILYSRGF-----SITIIHTRFNAPKSSDHPLFTFLQIRD 67

Query: 69  GLDDNMP--RDFFTIEAAIETTMPVCLRQVLSEHNSKESSGGTG---VVCMVVDLLASSA 128
           GL ++    RD       +     +  R+ L++     S  GT    + C++ D      
Sbjct: 68  GLSESQTQSRDLLLQLTLLNNNCQIPFRECLAKLIKPSSDSGTEDRKISCVIDDSGWVFT 127

Query: 129 IEVGNEFGVTVVGFWPAMFATYKLMSTIPEMIQNNFISSDTGCPEEGSKRCVPSQPLLSA 188
             V   F +         F+ +     +P++ +  F+      P+  +   VP  P L  
Sbjct: 128 QSVAESFNLPRFVLCAYKFSFFLGHFLVPQIRREGFLP----VPDSEADDLVPEFPPLRK 187

Query: 189 EELPWLVGTSSAIKGRFKFWKRTMARARSVHCLLVNSFPEELLPLQKLITKSSAASVFLV 248
           ++L  ++GTS+  K    +  + +   +    ++V S  E          K  +  +F +
Sbjct: 188 KDLSRIMGTSAQSKPLDAYLLKILDATKPASGIIVMSCKELDHDSLAESNKVFSIPIFPI 247

Query: 249 GPLSRHSNPAKTPTFWEEDDGCVKWLEKQRPNSVIYISFGSWVSPINESKVRSLAMTLLG 308
           GP   H  PA + +  E D  C+ WL+ +   SV+Y+S GS ++ +NES    +A  L  
Sbjct: 248 GPFHIHDVPASSSSLLEPDQSCIPWLDMRETRSVVYVSLGS-IASLNESDFLEIACGLRN 307

Query: 309 LKNPFIWVLK------NNWRDGLPIGFQQKIQSYGRLVSWAPQIEILKHRAVGCYLTHCG 368
               F+WV++       +W + LP GF + +   G++V WAPQ+++L HRA G +LTH G
Sbjct: 308 TNQSFLWVVRPGSVHGRDWIESLPSGFMESLDGKGKIVRWAPQLDVLAHRATGGFLTHNG 367

Query: 369 WNSIMEAIQYGKRLLCFPVAGDQFLNCGYVVKVWRIGVRLNG-FGEKEVEEGMRKVMEDG 428
           WNS +E+I  G  ++C P   DQF+N  ++ +VWR+G+ L G    +E+E  + ++M + 
Sbjct: 368 WNSTLESICEGVPMICLPCKWDQFVNARFISEVWRVGIHLEGRIERREIERAVIRLMVES 427

BLAST of CSPI04G21930 vs. Swiss-Prot
Match: U76E2_ARATH (UDP-glycosyltransferase 76E2 OS=Arabidopsis thaliana GN=UGT76E2 PE=2 SV=1)

HSP 1 Score: 196.4 bits (498), Expect = 6.6e-49
Identity = 141/444 (31.76%), Postives = 222/444 (50.00%), Query Frame = 1

Query: 5   LKKPKVILVPYPAQGHVTPMLMLAAVFHRRGF-LPIFLTPSYIHCHISSQVSSSDGIFFV 64
           +K+ +++LVP PAQGHVTPM+ L    H +GF + + LT S     +SS    SD  F  
Sbjct: 6   VKETRIVLVPVPAQGHVTPMMQLGKALHSKGFSITVVLTQSN---RVSSSKDFSDFHFLT 65

Query: 65  ---SMSDGLDDNM-PRDF-FTIEAAIETTMPVCLRQVLSEHNSKESSGGTGVVCMVVDLL 124
              S+++    N+ P+ F   +    E +   C+ Q+L E  + +      + C+V D  
Sbjct: 66  IPGSLTESDLQNLGPQKFVLKLNQICEASFKQCIGQLLHEQCNND------IACVVYDEY 125

Query: 125 ASSAIEVGNEFGVTVVGFWPAMFATYKLMSTIPEMIQNNFISSDTGCPEEGSKRCVPSQP 184
              +     EF +  V F       +   S +  +   +F+  D   PE   K   P   
Sbjct: 126 MYFSHAAVKEFQLPSVVFSTTSATAFVCRSVLSRVNAESFLI-DMKDPETQDK-VFPGLH 185

Query: 185 LLSAEELPWLVGTSSAIKGRFKFWKRTMARARSVHCLLVNSFP----EELLPLQKLITKS 244
            L  ++LP  V     I+   K +  T+   R+   +++NS        L  LQ+ +   
Sbjct: 186 PLRYKDLPTSV--FGPIESTLKVYSETV-NTRTASAVIINSASCLESSSLARLQQQLQ-- 245

Query: 245 SAASVFLVGPLSRHSNPAKTPTFWEEDDGCVKWLEKQRPNSVIYISFGSWVSPINESKVR 304
               V+ +GPL  H   +   +  EED  CV+WL KQ+ NSVIYIS GS ++ ++   + 
Sbjct: 246 --VPVYPIGPL--HITASAPSSLLEEDRSCVEWLNKQKSNSVIYISLGS-LALMDTKDML 305

Query: 305 SLAMTLLGLKNPFIWVLK------NNWRDGLPIGFQQKIQSYGRLVSWAPQIEILKHRAV 364
            +A  L     PF+WV++      + W + LP  F + +   G +V WAPQ+E+L+H AV
Sbjct: 306 EMAWGLSNSNQPFLWVVRPGSIPGSEWTESLPEEFNRLVSERGYIVKWAPQMEVLRHPAV 365

Query: 365 GCYLTHCGWNSIMEAIQYGKRLLCFPVAGDQFLNCGYVVKVWRIGVRLNGFGEKE-VEEG 424
           G + +HCGWNS +E+I  G  ++C P  GDQ +N  Y+ +VWRIGV+L G  +KE VE  
Sbjct: 366 GGFWSHCGWNSTVESIGEGVPMICRPFTGDQKVNARYLERVWRIGVQLEGDLDKETVERA 425

Query: 425 MRKVM---EDGEMKGRFMKLHERI 429
           +  ++   E  EM+ R + L E+I
Sbjct: 426 VEWLLVDEEGAEMRKRAIDLKEKI 428

BLAST of CSPI04G21930 vs. Swiss-Prot
Match: U85A1_ARATH (UDP-glycosyltransferase 85A1 OS=Arabidopsis thaliana GN=UGT85A1 PE=2 SV=1)

HSP 1 Score: 194.9 bits (494), Expect = 1.9e-48
Identity = 131/445 (29.44%), Postives = 219/445 (49.21%), Query Frame = 1

Query: 6   KKPKVILVPYPAQGHVTPMLMLAAVFHRRGFLPIFLTPSYIHCHISSQVSSS--DGI--- 65
           +KP V+ VPYPAQGH+ PM+ +A + H RGF   F+   Y H        S+  DG+   
Sbjct: 10  QKPHVVCVPYPAQGHINPMMRVAKLLHARGFYVTFVNTVYNHNRFLRSRGSNALDGLPSF 69

Query: 66  FFVSMSDGLDDNMPRDFFTIEAAIETTMPVCL---RQVLSEHNSKESSGGTGVVCMVVDL 125
            F S++DGL +        I A  E+TM  CL   R++L   N+ ++     V C+V D 
Sbjct: 70  RFESIADGLPETDMDATQDITALCESTMKNCLAPFRELLQRINAGDNV--PPVSCIVSDG 129

Query: 126 LASSAIEVGNEFGVTVVGFWPAMFATYKLMSTIPEMIQNNF--ISSDTGCPEEGSKRCV- 185
             S  ++V  E GV  V FW      +         I+     +  ++   +E  +  V 
Sbjct: 130 CMSFTLDVAEELGVPEVLFWTTSGCAFLAYLHFYLFIEKGLCPLKDESYLTKEYLEDTVI 189

Query: 186 ---PSQPLLSAEELPWLVGTSSAIKGRFKFWKRTMARARSVHCLLVNSFPEELLPLQKLI 245
              P+   +  +++P  + T++       F  R   RA+    +++N+F ++L       
Sbjct: 190 DFIPTMKNVKLKDIPSFIRTTNPDDVMISFALRETERAKRASAIILNTF-DDLEHDVVHA 249

Query: 246 TKSSAASVFLVGPLSRHSNPA---------KTPTFWEEDDGCVKWLEKQRPNSVIYISFG 305
            +S    V+ VGPL   +N            +   W+E+  C+ WL+ +  NSVIYI+FG
Sbjct: 250 MQSILPPVYSVGPLHLLANREIEEGSEIGMMSSNLWKEEMECLDWLDTKTQNSVIYINFG 309

Query: 306 SWVSPINESKVRSLAMTLLGLKNPFIWVLKNNWRDG----LPIGFQQKIQSYGRLVSWAP 365
           S ++ ++  ++   A  L G    F+WV++ +   G    +P  F  + +    L SW P
Sbjct: 310 S-ITVLSVKQLVEFAWGLAGSGKEFLWVIRPDLVAGEEAMVPPDFLMETKDRSMLASWCP 369

Query: 366 QIEILKHRAVGCYLTHCGWNSIMEAIQYGKRLLCFPVAGDQFLNCGYVVKVWRIGVRLNG 424
           Q ++L H A+G +LTHCGWNSI+E++  G  ++C+P   DQ +NC +    W +G+ + G
Sbjct: 370 QEKVLSHPAIGGFLTHCGWNSILESLSCGVPMVCWPFFADQQMNCKFCCDEWDVGIEIGG 429

BLAST of CSPI04G21930 vs. TrEMBL
Match: A0A0A0L1R0_CUCSA (Glycosyltransferase OS=Cucumis sativus GN=Csa_4G617410 PE=3 SV=1)

HSP 1 Score: 932.6 bits (2409), Expect = 1.9e-268
Identity = 452/453 (99.78%), Postives = 452/453 (99.78%), Query Frame = 1

Query: 1   MKYALKKPKVILVPYPAQGHVTPMLMLAAVFHRRGFLPIFLTPSYIHCHISSQVSSSDGI 60
           MKYALKKPKVILVPYPAQGHVTPMLMLAAVFHRRGFLPIFLTPSYIHCHISSQVSSSDGI
Sbjct: 1   MKYALKKPKVILVPYPAQGHVTPMLMLAAVFHRRGFLPIFLTPSYIHCHISSQVSSSDGI 60

Query: 61  FFVSMSDGLDDNMPRDFFTIEAAIETTMPVCLRQVLSEHNSKESSGGTGVVCMVVDLLAS 120
            FVSMSDGLDDNMPRDFFTIEAAIETTMPVCLRQVLSEHNSKESSGGTGVVCMVVDLLAS
Sbjct: 61  IFVSMSDGLDDNMPRDFFTIEAAIETTMPVCLRQVLSEHNSKESSGGTGVVCMVVDLLAS 120

Query: 121 SAIEVGNEFGVTVVGFWPAMFATYKLMSTIPEMIQNNFISSDTGCPEEGSKRCVPSQPLL 180
           SAIEVGNEFGVTVVGFWPAMFATYKLMSTIPEMIQNNFISSDTGCPEEGSKRCVPSQPLL
Sbjct: 121 SAIEVGNEFGVTVVGFWPAMFATYKLMSTIPEMIQNNFISSDTGCPEEGSKRCVPSQPLL 180

Query: 181 SAEELPWLVGTSSAIKGRFKFWKRTMARARSVHCLLVNSFPEELLPLQKLITKSSAASVF 240
           SAEELPWLVGTSSAIKGRFKFWKRTMARARSVHCLLVNSFPEELLPLQKLITKSSAASVF
Sbjct: 181 SAEELPWLVGTSSAIKGRFKFWKRTMARARSVHCLLVNSFPEELLPLQKLITKSSAASVF 240

Query: 241 LVGPLSRHSNPAKTPTFWEEDDGCVKWLEKQRPNSVIYISFGSWVSPINESKVRSLAMTL 300
           LVGPLSRHSNPAKTPTFWEEDDGCVKWLEKQRPNSVIYISFGSWVSPINESKVRSLAMTL
Sbjct: 241 LVGPLSRHSNPAKTPTFWEEDDGCVKWLEKQRPNSVIYISFGSWVSPINESKVRSLAMTL 300

Query: 301 LGLKNPFIWVLKNNWRDGLPIGFQQKIQSYGRLVSWAPQIEILKHRAVGCYLTHCGWNSI 360
           LGLKNPFIWVLKNNWRDGLPIGFQQKIQSYGRLVSWAPQIEILKHRAVGCYLTHCGWNSI
Sbjct: 301 LGLKNPFIWVLKNNWRDGLPIGFQQKIQSYGRLVSWAPQIEILKHRAVGCYLTHCGWNSI 360

Query: 361 MEAIQYGKRLLCFPVAGDQFLNCGYVVKVWRIGVRLNGFGEKEVEEGMRKVMEDGEMKGR 420
           MEAIQYGKRLLCFPVAGDQFLNCGYVVKVWRIGVRLNGFGEKEVEEGMRKVMEDGEMKGR
Sbjct: 361 MEAIQYGKRLLCFPVAGDQFLNCGYVVKVWRIGVRLNGFGEKEVEEGMRKVMEDGEMKGR 420

Query: 421 FMKLHERIMGEEANCRVNSNFTTFINEINLTNI 454
           FMKLHERIMGEEANCRVNSNFTTFINEINLTNI
Sbjct: 421 FMKLHERIMGEEANCRVNSNFTTFINEINLTNI 453

BLAST of CSPI04G21930 vs. TrEMBL
Match: F6HGE7_VITVI (Glycosyltransferase OS=Vitis vinifera GN=VIT_01s0010g00530 PE=3 SV=1)

HSP 1 Score: 525.8 bits (1353), Expect = 5.3e-146
Identity = 251/453 (55.41%), Postives = 334/453 (73.73%), Query Frame = 1

Query: 1   MKYALKKPKVILVPYPAQGHVTPMLMLAAVFHRRGFLPIFLTPSYIHCHISSQVSSSDGI 60
           MKY +K+P ++LVPYPAQGHVTP+L LA+    +GF+P+ +TP +IH  I+ +V + DGI
Sbjct: 1   MKY-MKRPMILLVPYPAQGHVTPLLKLASCLVTQGFMPVMITPEFIHRQIAPRVDAKDGI 60

Query: 61  FFVSMSDGLDDNMPRDFFTIEAAIETTMPVCLRQVLSEHNSKESSGGTGVVCMVVDLLAS 120
             +S+ DG+D+++PRDFFTIE  +E TMPV L +++ + +         VVCMVVDLLAS
Sbjct: 61  LCMSIPDGVDEDLPRDFFTIEMTMENTMPVYLERLIRKLDEDGR-----VVCMVVDLLAS 120

Query: 121 SAIEVGNEFGVTVVGFWPAMFATYKLMSTIPEMIQNNFISSDTGCPEEGSKRC-VPSQPL 180
            AI+V +  GV   GFWPAM ATY L+S IPE+I+   IS +TG PEE  K C +P QP 
Sbjct: 121 WAIKVADHCGVPAAGFWPAMLATYGLISAIPELIRTGLIS-ETGIPEEQRKICFLPCQPE 180

Query: 181 LSAEELPWLVGTSSAIKGRFKFWKRTMARARSVHCLLVNSFPEEL----LPLQKLITKSS 240
           LS E+LPWL+GT +A + RF+FW RT ARA+++  +LVNSFPEE     L  Q + +   
Sbjct: 181 LSTEDLPWLIGTFTAKRARFEFWTRTFARAKTLPWILVNSFPEECSDGKLQNQLIYSPGD 240

Query: 241 AASVFLVGPLSRHSNPAKTPTFWEEDDGCVKWLEKQRPNSVIYISFGSWVSPINESKVRS 300
              +  +GPL RH+   +TP+ WEED  C+ WLE+Q+P +V+YISFGSWVSPI E +VR 
Sbjct: 241 GPRLLQIGPLIRHA-AIRTPSLWEEDFNCLDWLEQQKPCTVVYISFGSWVSPIGEPRVRD 300

Query: 301 LAMTLLGLKNPFIWVLKNNWRDGLPIGFQQKIQSYGRLVSWAPQIEILKHRAVGCYLTHC 360
           LA+ L     PFIWVL+ NWR+GLP+G+ +++   G++VSWAPQ+E+L+H AVGCYLTHC
Sbjct: 301 LALALEASGRPFIWVLRPNWREGLPVGYLERVSKQGKVVSWAPQMELLQHEAVGCYLTHC 360

Query: 361 GWNSIMEAIQYGKRLLCFPVAGDQFLNCGYVVKVWRIGVRLNGFGEKEVEEGMRKVMEDG 420
           GWNS +EAIQ  KRLLC+PVAGDQF+NC Y+V VW+IGVR++GFG++++EEGMRKVMED 
Sbjct: 361 GWNSTLEAIQCQKRLLCYPVAGDQFVNCAYIVNVWQIGVRIHGFGQRDLEEGMRKVMEDS 420

Query: 421 EMKGRFMKLHERIMGEEANCRVNSNFTTFINEI 449
           EM  R  KL+ERIMGEEA  RV +N TTF + +
Sbjct: 421 EMNKRLSKLNERIMGEEAGLRVMTNITTFTDNL 445

BLAST of CSPI04G21930 vs. TrEMBL
Match: M5WYR7_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa016262mg PE=4 SV=1)

HSP 1 Score: 510.4 bits (1313), Expect = 2.3e-141
Identity = 246/448 (54.91%), Postives = 320/448 (71.43%), Query Frame = 1

Query: 10  VILVPYPAQGHVTPMLMLAAVFHRRGFLPIFLTPSYIHCHISSQVSSSDGIFFVSMSDGL 69
           +ILVPYPAQGHVTPML LA+ F   GF  + +TP +IH  I  +V  +D I  + + DGL
Sbjct: 14  IILVPYPAQGHVTPMLKLASAFLSHGFKSVLVTPDHIHNQIVPKVEQNDKILCMPIPDGL 73

Query: 70  DDNMPRDFFTIEAAIETTMPVCLRQVLSEHNSKESSGGTGVVCMVVDLLASSAIEVGNEF 129
           D + PRDFF IE A+E TMP  L  ++  H       G  VVC+V DLLAS AI+V N  
Sbjct: 74  DKDAPRDFFAIEKAMENTMPGHLESLV--HQLDHHQDGDQVVCIVADLLASWAIDVANRC 133

Query: 130 GVTVVGFWPAMFATYKLMSTIPEMIQNNFISSDTGCPEEGSKRCVPSQPLLSAEELPWLV 189
           GV   GFWPAM ATY+L++ IP+M++   IS DTG P++    C+P+QP+LS+E+LPWL+
Sbjct: 134 GVPSAGFWPAMLATYRLITAIPDMVRTGLIS-DTGFPKQLGGVCLPNQPMLSSEDLPWLI 193

Query: 190 GTSSAIKGRFKFWKRTMARARSVHCLLVNSFPEELLP----------LQKLITKSSAASV 249
           GT ++ K RFKFWKRT+ R++++  LLVNSFP E             L K+ T++    V
Sbjct: 194 GTPASRKARFKFWKRTLDRSKTLPWLLVNSFPNEYCTNGEQQLDHHQLVKMNTQAQQPLV 253

Query: 250 FLVGPLSRHSNPAKTPTFWEEDDGCVKWLEKQRPNSVIYISFGSWVSPINESKVRSLAMT 309
           F +GPLS+H+   K P+FWEED  C+ WL+KQ PNSVIYISFGSWVSPI E+KVRSLA+ 
Sbjct: 254 FPIGPLSKHTT-IKNPSFWEEDTSCLTWLDKQNPNSVIYISFGSWVSPIGEAKVRSLALA 313

Query: 310 LLGLKNPFIWVLKNNWRDGLPIGFQQKIQSYGRLVSWAPQIEILKHRAVGCYLTHCGWNS 369
           L  L  PF+WVL ++W  GLP G+ +++   G++VSWAPQ+E+L+H+AVG YL HCGWNS
Sbjct: 314 LEALGKPFLWVLGSSWLGGLPNGYLERVSRQGKVVSWAPQLEVLQHKAVGFYLAHCGWNS 373

Query: 370 IMEAIQYGKRLLCFPVAGDQFLNCGYVVKVWRIGVRLNGFGEKEVEEGMRKVMEDGEMKG 429
            MEAIQ  K LLC+PVAGDQF+NC Y+VKVWRIGV+L GFG+K+VEEG++KV ED EM  
Sbjct: 374 TMEAIQCQKPLLCYPVAGDQFVNCAYIVKVWRIGVKLIGFGQKDVEEGLKKVAEDAEMSN 433

Query: 430 RFMKLHERIMGEEANCRVNSNFTTFINE 448
           R  KL+ER MG+EAN R  +N + FI++
Sbjct: 434 RLRKLNERTMGDEANLRAVANLSAFIDD 457

BLAST of CSPI04G21930 vs. TrEMBL
Match: A0A061DMA7_THECC (Glycosyltransferase OS=Theobroma cacao GN=TCM_002013 PE=3 SV=1)

HSP 1 Score: 509.2 bits (1310), Expect = 5.1e-141
Identity = 241/448 (53.79%), Postives = 321/448 (71.65%), Query Frame = 1

Query: 2   KYALKKPKVILVPYPAQGHVTPMLMLAAVFHRRGFLPIFLTPSYIHCHISSQVSSSDGIF 61
           K ALK PK+ILVPYPAQGHVTPML L + F  +GF PI +TP +IH  I++ +   D I 
Sbjct: 4   KCALKMPKIILVPYPAQGHVTPMLKLGSAFLGQGFQPIIVTPEFIHHRITANMDPIDEIR 63

Query: 62  FVSMSDGLDDNMPRDFFTIEAAIETTMPVCLRQVLSEHNSKESSGGTGVVCMVVDLLASS 121
           F+S+ DGL +  P DFF IE A+E TMP  L  ++   + +E  G   V C+V+DLLAS 
Sbjct: 64  FLSIPDGLSEEGPHDFFAIEKAMENTMPTHLEGLIHRVDEEEEDGR--VACVVIDLLASW 123

Query: 122 AIEVGNEFGVTVVGFWPAMFATYKLMSTIPEMIQNNFISSD---TGCPE-EGSKRCVPSQ 181
           AI+V     +   GFWP M  TY+L++ IP+M++++ IS      GCP+ +G+  C+P Q
Sbjct: 124 AIQVAYRCRIPAAGFWPNMQITYRLITAIPDMLRSSLISKTGHVAGCPQRQGTVCCLPGQ 183

Query: 182 PLLSAEELPWLVGTSSAIKGRFKFWKRTMARARSVHCLLVNSFPEELLPLQKLITKSSAA 241
           P+LS E+LPWL+GT +A   RFKFW RT+ R+RS+  LLVNSFP E        T     
Sbjct: 184 PMLSTEDLPWLIGTQAARNARFKFWTRTLERSRSLRWLLVNSFPHEFTGDDHNSTDHDNP 243

Query: 242 SVFLVGPLSRHSNPAKTPTFWEEDDGCVKWLEKQRPNSVIYISFGSWVSPINESKVRSLA 301
            VF VGPLS+ +   K P+FWEED  C+ WL+K++PNSV+YISFGSWVSPI ++K+++LA
Sbjct: 244 IVFPVGPLSKPAI-VKNPSFWEEDSSCIDWLDKRKPNSVLYISFGSWVSPIGDAKIKTLA 303

Query: 302 MTLLGLKNPFIWVLKNNWRDGLPIGFQQKIQSYGRLVSWAPQIEILKHRAVGCYLTHCGW 361
           +TL  L+ PFIWVL + WR GLP  + +++   G++VSWAPQ+++L+H+AVG YLTHCGW
Sbjct: 304 LTLEALRRPFIWVLAHAWRQGLPNRYLERVSKQGKVVSWAPQLQVLQHKAVGLYLTHCGW 363

Query: 362 NSIMEAIQYGKRLLCFPVAGDQFLNCGYVVKVWRIGVRLNGFGEKEVEEGMRKVMEDGEM 421
           NS +EAIQ  KRLLCFP+AGDQF+NC Y+VKVW+IGV++NGFG+K+VE+ +RKV EDGEM
Sbjct: 364 NSTVEAIQCQKRLLCFPIAGDQFVNCKYIVKVWKIGVKINGFGQKDVEDALRKVTEDGEM 423

Query: 422 KGRFMKLHERIMGEEANCRVNSNFTTFI 446
           K R MKL+ER MGEEA  R  +N   F+
Sbjct: 424 KERLMKLYERTMGEEATSRAVANLKAFL 448

BLAST of CSPI04G21930 vs. TrEMBL
Match: A0A067FLM8_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g011687mg PE=4 SV=1)

HSP 1 Score: 502.3 bits (1292), Expect = 6.2e-139
Identity = 244/455 (53.63%), Postives = 329/455 (72.31%), Query Frame = 1

Query: 6   KKPKVILVPYPAQGHVTPMLMLAAVFHRRGFLPIFLTPSYIHCHISSQVSSSDGIFFVSM 65
           KK K+++VPYPAQGHVTPM  LA++   RGF PI +TP +IH  I+S +     I  +S+
Sbjct: 9   KKNKILMVPYPAQGHVTPMHKLASILTSRGFEPIVITPEFIHNQITSSMDPRSEISCMSI 68

Query: 66  SDGLDDNMPRDFFTIEAAIETTMPVCLRQVLSEHNSKESSGGTGVVCMVVDLLASSAIEV 125
            DGL+ N P+DFF IE  IE  MP+ L +++++ N         V C+VVDLLASSAI V
Sbjct: 69  PDGLEKNEPKDFFAIEKVIENIMPIHLERLINKINEDGR-----VACVVVDLLASSAIGV 128

Query: 126 GNEFGVTVVGFWPAMFATYKLMSTIPEMIQNNFISSDTGCPE--EGSKRCVPSQPLLSAE 185
               GV   GFWPAM ATY L+  IPEMI++ +IS DTG P+  E + R +P+QP+LS E
Sbjct: 129 ACRCGVPAAGFWPAMLATYCLIDAIPEMIKSGYIS-DTGSPQHLESTARFLPNQPMLSTE 188

Query: 186 ELPWLVGTSSAIKGRFKFWKRTMARARSVHCLLVNSFPEELLP-LQKLITKSSAAS---- 245
           +LPWL+GT +A K RFKFW RT+ R+R++  LLVNSFPEE +  +++    S  A+    
Sbjct: 189 DLPWLIGTPAARKSRFKFWSRTLERSRNLKWLLVNSFPEEYMDDIKQQYHHSKGATLCRP 248

Query: 246 -VFLVGPLSRHSNPAKTPTFWEEDDGCVKWLEKQRPNSVIYISFGSWVSPINESKVRSLA 305
            V LVGPLS+H+  AK P+ WEED  C+ WL+ Q+PNSVIYISFGSWVSPI E KV++LA
Sbjct: 249 KVLLVGPLSKHATIAKNPSLWEEDKSCIDWLDNQKPNSVIYISFGSWVSPIGEEKVKTLA 308

Query: 306 MTLLGLKNPFIWVLKNNWRDGLPIGFQQKIQS--YGRLVSWAPQIEILKHRAVGCYLTHC 365
           +TL  L  PFIWVL   WR+GLP G+  ++ +   G++V WAPQ+++L+H AVG YLTHC
Sbjct: 309 LTLEALGLPFIWVLGFAWREGLPDGYLDRVSNSRQGKVVPWAPQLKVLQHNAVGFYLTHC 368

Query: 366 GWNSIMEAIQYGKRLLCFPVAGDQFLNCGYVVKVWRIGVRLNGFGEKEVEEGMRKVMEDG 425
           GWNS MEAIQ GKRLLC+PVAGDQF+NC Y+VK+W+IG+R+NGFG++++E+G++K+ ED 
Sbjct: 369 GWNSTMEAIQSGKRLLCYPVAGDQFINCAYIVKMWKIGIRVNGFGKRDIEDGLKKLKEDS 428

Query: 426 EMKGRFMKLHERIMGEE-ANCRVNSNFTTFINEIN 450
           EMK R M L+ R MG++ A  RV +N T F+++++
Sbjct: 429 EMKHRLMNLYMRTMGDDGARARVMNNLTGFVDDLS 457

BLAST of CSPI04G21930 vs. TAIR10
Match: AT3G22250.1 (AT3G22250.1 UDP-Glycosyltransferase superfamily protein)

HSP 1 Score: 455.3 bits (1170), Expect = 4.4e-128
Identity = 236/466 (50.64%), Postives = 308/466 (66.09%), Query Frame = 1

Query: 6   KKPKVILVPYPAQGHVTPMLMLAAVFHRRGFLPIFLTPSYIHCHISSQVSSSDGIFFVSM 65
           +KPK+I +PYPAQGHVTPML LA+ F  RGF P+ +TP  IH  IS+  +   GI F+++
Sbjct: 5   QKPKIIFIPYPAQGHVTPMLHLASAFLSRGFSPVVMTPESIHRRISA-TNEDLGITFLAL 64

Query: 66  SDGLD--DNMPRDFFTIEAAIETTMPVCLRQVLSEHNSKESSGGTGVVCMVVDLLASSAI 125
           SDG D  D  P DFF+IE ++E  MP  L ++L E +         V C+VVDLLAS AI
Sbjct: 65  SDGQDRPDAPPSDFFSIENSMENIMPPQLERLLLEED-------LDVACVVVDLLASWAI 124

Query: 126 EVGNEFGVTVVGFWPAMFATYKLMSTIPEMIQNNFISSDTGCPEEGSKRCV-PSQPLLSA 185
            V +  GV V GFWP MFA Y+L+  IPE+++   +S   GCP +  K  V P QPLLSA
Sbjct: 125 GVADRCGVPVAGFWPVMFAAYRLIQAIPELVRTGLVSQK-GCPRQLEKTIVQPEQPLLSA 184

Query: 186 EELPWLVGTSSAIKGRFKFWKRTMARARSVHCLLVNSFPEELLPLQ--KLITKSSA---- 245
           E+LPWL+GT  A K RFKFW+RT+ R +S+  +L +SF +E   +   K   K S     
Sbjct: 185 EDLPWLIGTPKAQKKRFKFWQRTLERTKSLRWILTSSFKDEYEDVDNHKASYKKSNDLNK 244

Query: 246 ------ASVFLVGPLSRH---SNPAKTPT-FWEEDDGCVKWLEKQRPNSVIYISFGSWVS 305
                   +  +GPL      +N   T T FWEED  C+ WL++Q PNSVIYISFGSWVS
Sbjct: 245 ENNGQNPQILHLGPLHNQEATNNITITKTSFWEEDMSCLGWLQEQNPNSVIYISFGSWVS 304

Query: 306 PINESKVRSLAMTLLGLKNPFIWVLKNNWRDGLPIGFQQKI---QSYGRLVSWAPQIEIL 365
           PI ES +++LA+ L     PF+W L   W++GLP GF  ++   ++ GR+VSWAPQ+E+L
Sbjct: 305 PIGESNIQTLALALEASGRPFLWALNRVWQEGLPPGFVHRVTITKNQGRIVSWAPQLEVL 364

Query: 366 KHRAVGCYLTHCGWNSIMEAIQYGKRLLCFPVAGDQFLNCGYVVKVWRIGVRLNGFGEKE 425
           ++ +VGCY+THCGWNS MEA+   +RLLC+PVAGDQF+NC Y+V VW+IGVRL+GFGEKE
Sbjct: 365 RNDSVGCYVTHCGWNSTMEAVASSRRLLCYPVAGDQFVNCKYIVDVWKIGVRLSGFGEKE 424

Query: 426 VEEGMRKVMEDGEMKGRFMKLHERIMGEEANCRVNSNFTTFINEIN 450
           VE+G+RKVMED +M  R  KL +R MG EA      NFT   NE+N
Sbjct: 425 VEDGLRKVMEDQDMGERLRKLRDRAMGNEARLSSEMNFTFLKNELN 461

BLAST of CSPI04G21930 vs. TAIR10
Match: AT3G02100.1 (AT3G02100.1 UDP-Glycosyltransferase superfamily protein)

HSP 1 Score: 203.0 bits (515), Expect = 4.0e-52
Identity = 141/469 (30.06%), Postives = 225/469 (47.97%), Query Frame = 1

Query: 7   KPKVILVPYPAQGHVTPMLMLAAVFHRRGFLPIFLTPSYIHCHISSQVSSS-------DG 66
           +P V+++PYPAQGHV P++  +    ++G    F+   + H  I S + +S       D 
Sbjct: 11  RPHVVVIPYPAQGHVLPLISFSRYLAKQGIQITFINTEFNHNRIISSLPNSPHEDYVGDQ 70

Query: 67  IFFVSMSDGLDD-----NMPRDFFTIEAAIETTMPVCLRQVLSEHNSKESSGGTGVVCMV 126
           I  VS+ DGL+D     N+P     +  ++   MP  + +++ E    E+SGGT + C+V
Sbjct: 71  INLVSIPDGLEDSPEERNIPGK---LSESVLRFMPKKVEELI-ERMMAETSGGTIISCVV 130

Query: 127 VDLLASSAIEVGNEFGVTVVGFWPAMFATYKLMSTIPEMIQNNFISSDTGCPEEGSKRCV 186
            D     AIEV  +FG+    F PA  A+  L  +I ++I +  I SD       + +  
Sbjct: 131 ADQSLGWAIEVAAKFGIRRTAFCPAAAASMVLGFSIQKLIDDGLIDSDGTVRVNKTIQLS 190

Query: 187 PSQPLLSAEELPWL-VGTSSAIKGRFKFWKRTMARARSVHCLLVNSFPEELLPLQKLITK 246
           P  P +  ++  W+ +    + K  F+   +      S   LL NS  E      +    
Sbjct: 191 PGMPKMETDKFVWVCLKNKESQKNIFQLMLQNNNSIESTDWLLCNSVHE-----LETAAF 250

Query: 247 SSAASVFLVGPL----SRHSNPAKTPTFWEEDDGCVKWLEKQRPNSVIYISFGSWVSPIN 306
               ++  +GP+    S         +F   D  C+ WL++Q P SVIY++FGS+   + 
Sbjct: 251 GLGPNIVPIGPIGWAHSLEEGSTSLGSFLPHDRDCLDWLDRQIPGSVIYVAFGSF-GVMG 310

Query: 307 ESKVRSLAMTLLGLKNPFIWVLKNNWRDGLPIGFQQKIQSYG---RLVSWAPQIEILKHR 366
             ++  LA+ L   K P +WV           G QQ I+      ++V WAPQ E+L   
Sbjct: 311 NPQLEELAIGLELTKRPVLWVT----------GDQQPIKLGSDRVKVVRWAPQREVLSSG 370

Query: 367 AVGCYLTHCGWNSIMEAIQYGKRLLCFPVAGDQFLNCGYVVKVWRIGVRLNG-----FGE 426
           A+GC+++HCGWNS +E  Q G   LC P   DQF+N  Y+  VW+IG+ L          
Sbjct: 371 AIGCFVSHCGWNSTLEGAQNGIPFLCIPYFADQFINKAYICDVWKIGLGLERDARGVVPR 430

Query: 427 KEVEEGMRKVMED-GEMKGRFMKLHERIMGEEANCRVN-SNFTTFINEI 449
            EV++ + ++M D GE + R MK+ E +M   A   ++  N   F+N I
Sbjct: 431 LEVKKKIDEIMRDGGEYEERAMKVKEIVMKSVAKDGISCENLNKFVNWI 459

BLAST of CSPI04G21930 vs. TAIR10
Match: AT5G05870.1 (AT5G05870.1 UDP-glucosyl transferase 76C1)

HSP 1 Score: 199.9 bits (507), Expect = 3.4e-51
Identity = 122/436 (27.98%), Postives = 211/436 (48.39%), Query Frame = 1

Query: 9   KVILVPYPAQGHVTPMLMLAAVFHRRGFLPIFLTPSYIHCHISSQVSSSDGIF-FVSMSD 68
           +VIL P P QG + PML LA + + RGF     + + IH   ++  SS   +F F+ + D
Sbjct: 8   QVILFPLPLQGCINPMLQLAKILYSRGF-----SITIIHTRFNAPKSSDHPLFTFLQIRD 67

Query: 69  GLDDNMP--RDFFTIEAAIETTMPVCLRQVLSEHNSKESSGGTG---VVCMVVDLLASSA 128
           GL ++    RD       +     +  R+ L++     S  GT    + C++ D      
Sbjct: 68  GLSESQTQSRDLLLQLTLLNNNCQIPFRECLAKLIKPSSDSGTEDRKISCVIDDSGWVFT 127

Query: 129 IEVGNEFGVTVVGFWPAMFATYKLMSTIPEMIQNNFISSDTGCPEEGSKRCVPSQPLLSA 188
             V   F +         F+ +     +P++ +  F+      P+  +   VP  P L  
Sbjct: 128 QSVAESFNLPRFVLCAYKFSFFLGHFLVPQIRREGFLP----VPDSEADDLVPEFPPLRK 187

Query: 189 EELPWLVGTSSAIKGRFKFWKRTMARARSVHCLLVNSFPEELLPLQKLITKSSAASVFLV 248
           ++L  ++GTS+  K    +  + +   +    ++V S  E          K  +  +F +
Sbjct: 188 KDLSRIMGTSAQSKPLDAYLLKILDATKPASGIIVMSCKELDHDSLAESNKVFSIPIFPI 247

Query: 249 GPLSRHSNPAKTPTFWEEDDGCVKWLEKQRPNSVIYISFGSWVSPINESKVRSLAMTLLG 308
           GP   H  PA + +  E D  C+ WL+ +   SV+Y+S GS ++ +NES    +A  L  
Sbjct: 248 GPFHIHDVPASSSSLLEPDQSCIPWLDMRETRSVVYVSLGS-IASLNESDFLEIACGLRN 307

Query: 309 LKNPFIWVLK------NNWRDGLPIGFQQKIQSYGRLVSWAPQIEILKHRAVGCYLTHCG 368
               F+WV++       +W + LP GF + +   G++V WAPQ+++L HRA G +LTH G
Sbjct: 308 TNQSFLWVVRPGSVHGRDWIESLPSGFMESLDGKGKIVRWAPQLDVLAHRATGGFLTHNG 367

Query: 369 WNSIMEAIQYGKRLLCFPVAGDQFLNCGYVVKVWRIGVRLNG-FGEKEVEEGMRKVMEDG 428
           WNS +E+I  G  ++C P   DQF+N  ++ +VWR+G+ L G    +E+E  + ++M + 
Sbjct: 368 WNSTLESICEGVPMICLPCKWDQFVNARFISEVWRVGIHLEGRIERREIERAVIRLMVES 427

BLAST of CSPI04G21930 vs. TAIR10
Match: AT5G59590.1 (AT5G59590.1 UDP-glucosyl transferase 76E2)

HSP 1 Score: 196.4 bits (498), Expect = 3.7e-50
Identity = 141/444 (31.76%), Postives = 222/444 (50.00%), Query Frame = 1

Query: 5   LKKPKVILVPYPAQGHVTPMLMLAAVFHRRGF-LPIFLTPSYIHCHISSQVSSSDGIFFV 64
           +K+ +++LVP PAQGHVTPM+ L    H +GF + + LT S     +SS    SD  F  
Sbjct: 6   VKETRIVLVPVPAQGHVTPMMQLGKALHSKGFSITVVLTQSN---RVSSSKDFSDFHFLT 65

Query: 65  ---SMSDGLDDNM-PRDF-FTIEAAIETTMPVCLRQVLSEHNSKESSGGTGVVCMVVDLL 124
              S+++    N+ P+ F   +    E +   C+ Q+L E  + +      + C+V D  
Sbjct: 66  IPGSLTESDLQNLGPQKFVLKLNQICEASFKQCIGQLLHEQCNND------IACVVYDEY 125

Query: 125 ASSAIEVGNEFGVTVVGFWPAMFATYKLMSTIPEMIQNNFISSDTGCPEEGSKRCVPSQP 184
              +     EF +  V F       +   S +  +   +F+  D   PE   K   P   
Sbjct: 126 MYFSHAAVKEFQLPSVVFSTTSATAFVCRSVLSRVNAESFLI-DMKDPETQDK-VFPGLH 185

Query: 185 LLSAEELPWLVGTSSAIKGRFKFWKRTMARARSVHCLLVNSFP----EELLPLQKLITKS 244
            L  ++LP  V     I+   K +  T+   R+   +++NS        L  LQ+ +   
Sbjct: 186 PLRYKDLPTSV--FGPIESTLKVYSETV-NTRTASAVIINSASCLESSSLARLQQQLQ-- 245

Query: 245 SAASVFLVGPLSRHSNPAKTPTFWEEDDGCVKWLEKQRPNSVIYISFGSWVSPINESKVR 304
               V+ +GPL  H   +   +  EED  CV+WL KQ+ NSVIYIS GS ++ ++   + 
Sbjct: 246 --VPVYPIGPL--HITASAPSSLLEEDRSCVEWLNKQKSNSVIYISLGS-LALMDTKDML 305

Query: 305 SLAMTLLGLKNPFIWVLK------NNWRDGLPIGFQQKIQSYGRLVSWAPQIEILKHRAV 364
            +A  L     PF+WV++      + W + LP  F + +   G +V WAPQ+E+L+H AV
Sbjct: 306 EMAWGLSNSNQPFLWVVRPGSIPGSEWTESLPEEFNRLVSERGYIVKWAPQMEVLRHPAV 365

Query: 365 GCYLTHCGWNSIMEAIQYGKRLLCFPVAGDQFLNCGYVVKVWRIGVRLNGFGEKE-VEEG 424
           G + +HCGWNS +E+I  G  ++C P  GDQ +N  Y+ +VWRIGV+L G  +KE VE  
Sbjct: 366 GGFWSHCGWNSTVESIGEGVPMICRPFTGDQKVNARYLERVWRIGVQLEGDLDKETVERA 425

Query: 425 MRKVM---EDGEMKGRFMKLHERI 429
           +  ++   E  EM+ R + L E+I
Sbjct: 426 VEWLLVDEEGAEMRKRAIDLKEKI 428

BLAST of CSPI04G21930 vs. TAIR10
Match: AT1G22400.1 (AT1G22400.1 UDP-Glycosyltransferase superfamily protein)

HSP 1 Score: 194.9 bits (494), Expect = 1.1e-49
Identity = 131/445 (29.44%), Postives = 219/445 (49.21%), Query Frame = 1

Query: 6   KKPKVILVPYPAQGHVTPMLMLAAVFHRRGFLPIFLTPSYIHCHISSQVSSS--DGI--- 65
           +KP V+ VPYPAQGH+ PM+ +A + H RGF   F+   Y H        S+  DG+   
Sbjct: 10  QKPHVVCVPYPAQGHINPMMRVAKLLHARGFYVTFVNTVYNHNRFLRSRGSNALDGLPSF 69

Query: 66  FFVSMSDGLDDNMPRDFFTIEAAIETTMPVCL---RQVLSEHNSKESSGGTGVVCMVVDL 125
            F S++DGL +        I A  E+TM  CL   R++L   N+ ++     V C+V D 
Sbjct: 70  RFESIADGLPETDMDATQDITALCESTMKNCLAPFRELLQRINAGDNV--PPVSCIVSDG 129

Query: 126 LASSAIEVGNEFGVTVVGFWPAMFATYKLMSTIPEMIQNNF--ISSDTGCPEEGSKRCV- 185
             S  ++V  E GV  V FW      +         I+     +  ++   +E  +  V 
Sbjct: 130 CMSFTLDVAEELGVPEVLFWTTSGCAFLAYLHFYLFIEKGLCPLKDESYLTKEYLEDTVI 189

Query: 186 ---PSQPLLSAEELPWLVGTSSAIKGRFKFWKRTMARARSVHCLLVNSFPEELLPLQKLI 245
              P+   +  +++P  + T++       F  R   RA+    +++N+F ++L       
Sbjct: 190 DFIPTMKNVKLKDIPSFIRTTNPDDVMISFALRETERAKRASAIILNTF-DDLEHDVVHA 249

Query: 246 TKSSAASVFLVGPLSRHSNPA---------KTPTFWEEDDGCVKWLEKQRPNSVIYISFG 305
            +S    V+ VGPL   +N            +   W+E+  C+ WL+ +  NSVIYI+FG
Sbjct: 250 MQSILPPVYSVGPLHLLANREIEEGSEIGMMSSNLWKEEMECLDWLDTKTQNSVIYINFG 309

Query: 306 SWVSPINESKVRSLAMTLLGLKNPFIWVLKNNWRDG----LPIGFQQKIQSYGRLVSWAP 365
           S ++ ++  ++   A  L G    F+WV++ +   G    +P  F  + +    L SW P
Sbjct: 310 S-ITVLSVKQLVEFAWGLAGSGKEFLWVIRPDLVAGEEAMVPPDFLMETKDRSMLASWCP 369

Query: 366 QIEILKHRAVGCYLTHCGWNSIMEAIQYGKRLLCFPVAGDQFLNCGYVVKVWRIGVRLNG 424
           Q ++L H A+G +LTHCGWNSI+E++  G  ++C+P   DQ +NC +    W +G+ + G
Sbjct: 370 QEKVLSHPAIGGFLTHCGWNSILESLSCGVPMVCWPFFADQQMNCKFCCDEWDVGIEIGG 429

BLAST of CSPI04G21930 vs. NCBI nr
Match: gi|449463617|ref|XP_004149528.1| (PREDICTED: UDP-glycosyltransferase 82A1 [Cucumis sativus])

HSP 1 Score: 932.6 bits (2409), Expect = 2.7e-268
Identity = 452/453 (99.78%), Postives = 452/453 (99.78%), Query Frame = 1

Query: 1   MKYALKKPKVILVPYPAQGHVTPMLMLAAVFHRRGFLPIFLTPSYIHCHISSQVSSSDGI 60
           MKYALKKPKVILVPYPAQGHVTPMLMLAAVFHRRGFLPIFLTPSYIHCHISSQVSSSDGI
Sbjct: 1   MKYALKKPKVILVPYPAQGHVTPMLMLAAVFHRRGFLPIFLTPSYIHCHISSQVSSSDGI 60

Query: 61  FFVSMSDGLDDNMPRDFFTIEAAIETTMPVCLRQVLSEHNSKESSGGTGVVCMVVDLLAS 120
            FVSMSDGLDDNMPRDFFTIEAAIETTMPVCLRQVLSEHNSKESSGGTGVVCMVVDLLAS
Sbjct: 61  IFVSMSDGLDDNMPRDFFTIEAAIETTMPVCLRQVLSEHNSKESSGGTGVVCMVVDLLAS 120

Query: 121 SAIEVGNEFGVTVVGFWPAMFATYKLMSTIPEMIQNNFISSDTGCPEEGSKRCVPSQPLL 180
           SAIEVGNEFGVTVVGFWPAMFATYKLMSTIPEMIQNNFISSDTGCPEEGSKRCVPSQPLL
Sbjct: 121 SAIEVGNEFGVTVVGFWPAMFATYKLMSTIPEMIQNNFISSDTGCPEEGSKRCVPSQPLL 180

Query: 181 SAEELPWLVGTSSAIKGRFKFWKRTMARARSVHCLLVNSFPEELLPLQKLITKSSAASVF 240
           SAEELPWLVGTSSAIKGRFKFWKRTMARARSVHCLLVNSFPEELLPLQKLITKSSAASVF
Sbjct: 181 SAEELPWLVGTSSAIKGRFKFWKRTMARARSVHCLLVNSFPEELLPLQKLITKSSAASVF 240

Query: 241 LVGPLSRHSNPAKTPTFWEEDDGCVKWLEKQRPNSVIYISFGSWVSPINESKVRSLAMTL 300
           LVGPLSRHSNPAKTPTFWEEDDGCVKWLEKQRPNSVIYISFGSWVSPINESKVRSLAMTL
Sbjct: 241 LVGPLSRHSNPAKTPTFWEEDDGCVKWLEKQRPNSVIYISFGSWVSPINESKVRSLAMTL 300

Query: 301 LGLKNPFIWVLKNNWRDGLPIGFQQKIQSYGRLVSWAPQIEILKHRAVGCYLTHCGWNSI 360
           LGLKNPFIWVLKNNWRDGLPIGFQQKIQSYGRLVSWAPQIEILKHRAVGCYLTHCGWNSI
Sbjct: 301 LGLKNPFIWVLKNNWRDGLPIGFQQKIQSYGRLVSWAPQIEILKHRAVGCYLTHCGWNSI 360

Query: 361 MEAIQYGKRLLCFPVAGDQFLNCGYVVKVWRIGVRLNGFGEKEVEEGMRKVMEDGEMKGR 420
           MEAIQYGKRLLCFPVAGDQFLNCGYVVKVWRIGVRLNGFGEKEVEEGMRKVMEDGEMKGR
Sbjct: 361 MEAIQYGKRLLCFPVAGDQFLNCGYVVKVWRIGVRLNGFGEKEVEEGMRKVMEDGEMKGR 420

Query: 421 FMKLHERIMGEEANCRVNSNFTTFINEINLTNI 454
           FMKLHERIMGEEANCRVNSNFTTFINEINLTNI
Sbjct: 421 FMKLHERIMGEEANCRVNSNFTTFINEINLTNI 453

BLAST of CSPI04G21930 vs. NCBI nr
Match: gi|659129416|ref|XP_008464676.1| (PREDICTED: UDP-glycosyltransferase 82A1 [Cucumis melo])

HSP 1 Score: 846.3 bits (2185), Expect = 2.5e-242
Identity = 410/452 (90.71%), Postives = 426/452 (94.25%), Query Frame = 1

Query: 1   MKYALKKPKVILVPYPAQGHVTPMLMLAAVFHRRGFLPIFLTPSYIHCHISSQVSSSDGI 60
           MKY  KKPKV+LVPYPAQGHVTPMLMLAAVFHRRGFLPIFLTPSYIH HISSQ+SSSD I
Sbjct: 1   MKYTSKKPKVLLVPYPAQGHVTPMLMLAAVFHRRGFLPIFLTPSYIHRHISSQISSSDEI 60

Query: 61  FFVSMSDGLDDNMPRDFFTIEAAIETTMPVCLRQVLSEHNSKESSGGTG-VVCMVVDLLA 120
            FVSMSDGLDDNMPRDFFT+EA +ETTMP+ LRQVLSEHNSKESS  +G VVCMVVDLLA
Sbjct: 61  LFVSMSDGLDDNMPRDFFTVEAVMETTMPIYLRQVLSEHNSKESSDSSGSVVCMVVDLLA 120

Query: 121 SSAIEVGNEFGVTVVGFWPAMFATYKLMSTIPEMIQNNFISSDTGCPEEGSKRCVPSQPL 180
           SSAIEVG EFGVTVVGFWPAM ATYKL+STIPEM+Q+N ISSDTGCPEEGSKRCVPSQPL
Sbjct: 121 SSAIEVGKEFGVTVVGFWPAMLATYKLISTIPEMVQSNLISSDTGCPEEGSKRCVPSQPL 180

Query: 181 LSAEELPWLVGTSSAIKGRFKFWKRTMARARSVHCLLVNSFPEELLPLQKLITKSSAASV 240
           LS EELPWL+GT SA KGRFKFWKRTMARA+SV CLLVNSFPEELLPLQK   KSSAASV
Sbjct: 181 LSTEELPWLIGTPSARKGRFKFWKRTMARAKSVQCLLVNSFPEELLPLQKPTPKSSAASV 240

Query: 241 FLVGPLSRHSNPAKTPTFWEEDDGCVKWLEKQRPNSVIYISFGSWVSPINESKVRSLAMT 300
           FLVGPL++HSNPAKTPTFWEEDDGCVKWLEKQRPNSVIYISFGSWVSPINESKVRSLAM 
Sbjct: 241 FLVGPLTQHSNPAKTPTFWEEDDGCVKWLEKQRPNSVIYISFGSWVSPINESKVRSLAMA 300

Query: 301 LLGLKNPFIWVLKNNWRDGLPIGFQQKIQSYGRLVSWAPQIEILKHRAVGCYLTHCGWNS 360
           LLGLKNPFIWVLKNNWRDGLPIGFQQK QSYGRLVSWAPQIEILKH+AVGCYLTHCGWNS
Sbjct: 301 LLGLKNPFIWVLKNNWRDGLPIGFQQKSQSYGRLVSWAPQIEILKHKAVGCYLTHCGWNS 360

Query: 361 IMEAIQYGKRLLCFPVAGDQFLNCGYVVKVWRIGVRLNGFGEKEVEEGMRKVMEDGEMKG 420
           IMEAIQ GKRLLCFPVAGDQFLNCGYVVKVWRIGVRLNGFGEKEVEEGM+KVMEDGEMKG
Sbjct: 361 IMEAIQCGKRLLCFPVAGDQFLNCGYVVKVWRIGVRLNGFGEKEVEEGMKKVMEDGEMKG 420

Query: 421 RFMKLHERIMGEEANCRVNSNFTTFINEINLT 452
           R MKLHERIMGEEAN RVNSNFT FINEINL+
Sbjct: 421 RLMKLHERIMGEEANYRVNSNFTAFINEINLS 452

BLAST of CSPI04G21930 vs. NCBI nr
Match: gi|225424981|ref|XP_002266304.1| (PREDICTED: UDP-glycosyltransferase 82A1 [Vitis vinifera])

HSP 1 Score: 525.8 bits (1353), Expect = 7.6e-146
Identity = 251/453 (55.41%), Postives = 334/453 (73.73%), Query Frame = 1

Query: 1   MKYALKKPKVILVPYPAQGHVTPMLMLAAVFHRRGFLPIFLTPSYIHCHISSQVSSSDGI 60
           MKY +K+P ++LVPYPAQGHVTP+L LA+    +GF+P+ +TP +IH  I+ +V + DGI
Sbjct: 1   MKY-MKRPMILLVPYPAQGHVTPLLKLASCLVTQGFMPVMITPEFIHRQIAPRVDAKDGI 60

Query: 61  FFVSMSDGLDDNMPRDFFTIEAAIETTMPVCLRQVLSEHNSKESSGGTGVVCMVVDLLAS 120
             +S+ DG+D+++PRDFFTIE  +E TMPV L +++ + +         VVCMVVDLLAS
Sbjct: 61  LCMSIPDGVDEDLPRDFFTIEMTMENTMPVYLERLIRKLDEDGR-----VVCMVVDLLAS 120

Query: 121 SAIEVGNEFGVTVVGFWPAMFATYKLMSTIPEMIQNNFISSDTGCPEEGSKRC-VPSQPL 180
            AI+V +  GV   GFWPAM ATY L+S IPE+I+   IS +TG PEE  K C +P QP 
Sbjct: 121 WAIKVADHCGVPAAGFWPAMLATYGLISAIPELIRTGLIS-ETGIPEEQRKICFLPCQPE 180

Query: 181 LSAEELPWLVGTSSAIKGRFKFWKRTMARARSVHCLLVNSFPEEL----LPLQKLITKSS 240
           LS E+LPWL+GT +A + RF+FW RT ARA+++  +LVNSFPEE     L  Q + +   
Sbjct: 181 LSTEDLPWLIGTFTAKRARFEFWTRTFARAKTLPWILVNSFPEECSDGKLQNQLIYSPGD 240

Query: 241 AASVFLVGPLSRHSNPAKTPTFWEEDDGCVKWLEKQRPNSVIYISFGSWVSPINESKVRS 300
              +  +GPL RH+   +TP+ WEED  C+ WLE+Q+P +V+YISFGSWVSPI E +VR 
Sbjct: 241 GPRLLQIGPLIRHA-AIRTPSLWEEDFNCLDWLEQQKPCTVVYISFGSWVSPIGEPRVRD 300

Query: 301 LAMTLLGLKNPFIWVLKNNWRDGLPIGFQQKIQSYGRLVSWAPQIEILKHRAVGCYLTHC 360
           LA+ L     PFIWVL+ NWR+GLP+G+ +++   G++VSWAPQ+E+L+H AVGCYLTHC
Sbjct: 301 LALALEASGRPFIWVLRPNWREGLPVGYLERVSKQGKVVSWAPQMELLQHEAVGCYLTHC 360

Query: 361 GWNSIMEAIQYGKRLLCFPVAGDQFLNCGYVVKVWRIGVRLNGFGEKEVEEGMRKVMEDG 420
           GWNS +EAIQ  KRLLC+PVAGDQF+NC Y+V VW+IGVR++GFG++++EEGMRKVMED 
Sbjct: 361 GWNSTLEAIQCQKRLLCYPVAGDQFVNCAYIVNVWQIGVRIHGFGQRDLEEGMRKVMEDS 420

Query: 421 EMKGRFMKLHERIMGEEANCRVNSNFTTFINEI 449
           EM  R  KL+ERIMGEEA  RV +N TTF + +
Sbjct: 421 EMNKRLSKLNERIMGEEAGLRVMTNITTFTDNL 445

BLAST of CSPI04G21930 vs. NCBI nr
Match: gi|694406288|ref|XP_009377960.1| (PREDICTED: UDP-glycosyltransferase 82A1-like [Pyrus x bretschneideri])

HSP 1 Score: 519.2 bits (1336), Expect = 7.1e-144
Identity = 245/450 (54.44%), Postives = 327/450 (72.67%), Query Frame = 1

Query: 10  VILVPYPAQGHVTPMLMLAAVFHRRGFLPIFLTPSYIHCHISSQVSSSDGIFFVSMSDGL 69
           +ILVPYPAQGHVTPML LA+ F  +GF P+ +TP YIH  I  +V   + I  + +SDGL
Sbjct: 14  IILVPYPAQGHVTPMLKLASAFLTQGFKPVMVTPDYIHHQIVRKVEPKEKILCMPISDGL 73

Query: 70  DDNMPRDFFTIEAAIETTMPVCLRQVLSEHNSKESSGGTGVVCMVVDLLASSAIEVGNEF 129
           D + PRDFF +E A+E  MP  L  ++ + +      G  VVC+VVDLLAS AI+V N  
Sbjct: 74  DKDTPRDFFAVEKAMEDNMPSHLESLVHQLDKD----GDEVVCVVVDLLASWAIDVANRC 133

Query: 130 GVTVVGFWPAMFATYKLMSTIPEMIQNNFISSDTGCPEEGSKRCVPSQPLLSAEELPWLV 189
           GV   GFWPAM ATY+L++ IP+MI+   I +DTG P++    C+P+ P+L  EELPWL+
Sbjct: 134 GVACAGFWPAMHATYRLITAIPDMIRTGLICADTGFPKQLGGICLPNLPVLFTEELPWLI 193

Query: 190 GTSSAIKGRFKFWKRTMARARSVHCLLVNSFPEE--LLPLQKLI-------TKSSAASVF 249
           GT +A KGRFKFW RT+ R++++  +LVNSFP E  +   Q+L+       TK+    VF
Sbjct: 194 GTPAARKGRFKFWTRTLERSKTLQRILVNSFPNEYSINDEQQLLGDQLVKSTKTQQPLVF 253

Query: 250 LVGPLSRHSNPAKTPTFWEEDDGCVKWLEKQRPNSVIYISFGSWVSPINESKVRSLAMTL 309
            +GPLS+H+   K P+FWEED  C+ WL+KQ PN+V+YISFGSWVSPI E KVRSLA+ L
Sbjct: 254 PIGPLSKHTT-TKNPSFWEEDTSCLNWLDKQNPNTVVYISFGSWVSPIGEGKVRSLALAL 313

Query: 310 LGLKNPFIWVLKNNWRDGLPIGFQQKIQSYGRLVSWAPQIEILKHRAVGCYLTHCGWNSI 369
             L+ PF+WVL ++W  GLPIG+ +++   GR+VSWAPQ+++L+H+AVGCYLTHCGWNS 
Sbjct: 314 EALRKPFLWVLGSSWLGGLPIGYLERVAKQGRVVSWAPQMDVLQHKAVGCYLTHCGWNST 373

Query: 370 MEAIQYGKRLLCFPVAGDQFLNCGYVVKVWRIGVRLNGFGEKEVEEGMRKVMEDGEMKGR 429
           MEAIQ  K LLC+PVAGDQF+NC Y+V VWRIGV+L+GFG+++VEEG+R+VME+ EM  R
Sbjct: 374 MEAIQCEKPLLCYPVAGDQFVNCSYIVNVWRIGVKLSGFGQRDVEEGLRRVMEEDEMSNR 433

Query: 430 FMKLHERIMGEEANCRVNSNFTTFINEINL 451
             KL+ER MG++AN RV SN   F +++ +
Sbjct: 434 MRKLNERSMGDDANLRVVSNLIAFTDQVKV 458

BLAST of CSPI04G21930 vs. NCBI nr
Match: gi|657973141|ref|XP_008378366.1| (PREDICTED: UDP-glycosyltransferase 82A1-like [Malus domestica])

HSP 1 Score: 518.1 bits (1333), Expect = 1.6e-143
Identity = 247/457 (54.05%), Postives = 329/457 (71.99%), Query Frame = 1

Query: 1   MKYALKKPKVILVPYPAQGHVTPMLMLAAVFHRRGFLPIFLTPSYIHCHISSQVSSSDGI 60
           +K +   P +ILVPYPAQGHVTPM  LA+ F  +GF P+ +TP YIH  I  +V   D I
Sbjct: 4   IKRSNSNPIIILVPYPAQGHVTPMFKLASAFLSQGFKPVMVTPDYIHHQIVRKVEPKDKI 63

Query: 61  FFVSMSDGLDDNMPRDFFTIEAAIETTMPVCLRQVLSEHNSKESSGGTGVVCMVVDLLAS 120
             + + DGLD + PRDFF IE A+E  M   L +++ + + K+   G  VVC+VVDLLAS
Sbjct: 64  LCMPIPDGLDKDTPRDFFAIEKAMENNMANPLERLIHQLDDKD---GDEVVCVVVDLLAS 123

Query: 121 SAIEVGNEFGVTVVGFWPAMFATYKLMSTIPEMIQNNFISSDTGCPEEGSKRCVPSQPLL 180
            AI+V N  GV   GFWPAM ATY+L++ IP+M++   IS+DTG P++ S  C+P+QP+L
Sbjct: 124 WAIDVANRCGVACAGFWPAMHATYRLITAIPDMLRTGLISADTGFPKQLSGICLPNQPVL 183

Query: 181 SAEELPWLVGTSSAIKGRFKFWKRTMARARSVHCLLVNSFPEELL-------PLQKLITK 240
           S EELPWL+GT +A K RF+FW RT+ R++++  +LV+SFP E          L   + K
Sbjct: 184 STEELPWLIGTPAARKARFRFWTRTLERSKTLQWILVHSFPNEYTISDEQHQQLGDQLFK 243

Query: 241 SSAAS---VFLVGPLSRHSNPAKTPTFWEEDDGCVKWLEKQRPNSVIYISFGSWVSPINE 300
           S+      VF +GPLS+H+   K P+FWEED  C+ WL+KQ PN+V YISFGSWVSPI E
Sbjct: 244 STTTQQPLVFPIGPLSKHTT-TKNPSFWEEDTSCLNWLDKQNPNTVAYISFGSWVSPIGE 303

Query: 301 SKVRSLAMTLLGLKNPFIWVLKNNWRDGLPIGFQQKIQSYGRLVSWAPQIEILKHRAVGC 360
           +KVRSLA+ L  L  PF+WVL ++W  GLPIG+ +++   G++VSWAPQ+++L+H+AVGC
Sbjct: 304 AKVRSLALALEALGKPFLWVLGSSWLGGLPIGYLERVAKQGKVVSWAPQMDVLQHKAVGC 363

Query: 361 YLTHCGWNSIMEAIQYGKRLLCFPVAGDQFLNCGYVVKVWRIGVRLNGFGEKEVEEGMRK 420
           YLTHCGWNS MEAIQ  K LLC+PVAGDQF+NC Y+VKVWRIGVRL+GFG+++VEEG+R+
Sbjct: 364 YLTHCGWNSTMEAIQCQKPLLCYPVAGDQFVNCAYIVKVWRIGVRLSGFGQRDVEEGLRR 423

Query: 421 VMEDGEMKGRFMKLHERIMGEEANCRVNSNFTTFINE 448
           +ME+ EM  R  KL+ER MG+EAN R  SN T F ++
Sbjct: 424 MMEEDEMSKRMRKLNERTMGDEANLRAVSNLTAFTDQ 456

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
U82A1_ARATH7.9e-12750.64UDP-glycosyltransferase 82A1 OS=Arabidopsis thaliana GN=UGT82A1 PE=2 SV=1[more]
U83A1_ARATH7.0e-5130.06UDP-glycosyltransferase 83A1 OS=Arabidopsis thaliana GN=UGT83A1 PE=2 SV=1[more]
U76C1_ARATH6.0e-5027.98UDP-glycosyltransferase 76C1 OS=Arabidopsis thaliana GN=UGT76C1 PE=1 SV=1[more]
U76E2_ARATH6.6e-4931.76UDP-glycosyltransferase 76E2 OS=Arabidopsis thaliana GN=UGT76E2 PE=2 SV=1[more]
U85A1_ARATH1.9e-4829.44UDP-glycosyltransferase 85A1 OS=Arabidopsis thaliana GN=UGT85A1 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0L1R0_CUCSA1.9e-26899.78Glycosyltransferase OS=Cucumis sativus GN=Csa_4G617410 PE=3 SV=1[more]
F6HGE7_VITVI5.3e-14655.41Glycosyltransferase OS=Vitis vinifera GN=VIT_01s0010g00530 PE=3 SV=1[more]
M5WYR7_PRUPE2.3e-14154.91Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa016262mg PE=4 SV=1[more]
A0A061DMA7_THECC5.1e-14153.79Glycosyltransferase OS=Theobroma cacao GN=TCM_002013 PE=3 SV=1[more]
A0A067FLM8_CITSI6.2e-13953.63Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g011687mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G22250.14.4e-12850.64 UDP-Glycosyltransferase superfamily protein[more]
AT3G02100.14.0e-5230.06 UDP-Glycosyltransferase superfamily protein[more]
AT5G05870.13.4e-5127.98 UDP-glucosyl transferase 76C1[more]
AT5G59590.13.7e-5031.76 UDP-glucosyl transferase 76E2[more]
AT1G22400.11.1e-4929.44 UDP-Glycosyltransferase superfamily protein[more]
Match NameE-valueIdentityDescription
gi|449463617|ref|XP_004149528.1|2.7e-26899.78PREDICTED: UDP-glycosyltransferase 82A1 [Cucumis sativus][more]
gi|659129416|ref|XP_008464676.1|2.5e-24290.71PREDICTED: UDP-glycosyltransferase 82A1 [Cucumis melo][more]
gi|225424981|ref|XP_002266304.1|7.6e-14655.41PREDICTED: UDP-glycosyltransferase 82A1 [Vitis vinifera][more]
gi|694406288|ref|XP_009377960.1|7.1e-14454.44PREDICTED: UDP-glycosyltransferase 82A1-like [Pyrus x bretschneideri][more]
gi|657973141|ref|XP_008378366.1|1.6e-14354.05PREDICTED: UDP-glycosyltransferase 82A1-like [Malus domestica][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002213UDP_glucos_trans
Vocabulary: Molecular Function
TermDefinition
GO:0016758transferase activity, transferring hexosyl groups
Vocabulary: Biological Process
TermDefinition
GO:0008152metabolic process
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008152 metabolic process
cellular_component GO:0005575 cellular_component
cellular_component GO:0043231 intracellular membrane-bounded organelle
molecular_function GO:0016758 transferase activity, transferring hexosyl groups
molecular_function GO:0080043 quercetin 3-O-glucosyltransferase activity
molecular_function GO:0080044 quercetin 7-O-glucosyltransferase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI04G21930.1CSPI04G21930.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002213UDP-glucuronosyl/UDP-glucosyltransferasePANTHERPTHR11926GLUCOSYL/GLUCURONOSYL TRANSFERASEScoord: 5..448
score: 1.0E
IPR002213UDP-glucuronosyl/UDP-glucosyltransferasePFAMPF00201UDPGTcoord: 274..425
score: 3.4
IPR002213UDP-glucuronosyl/UDP-glucosyltransferasePROSITEPS00375UDPGTcoord: 336..379
scor
NoneNo IPR availableGENE3DG3DSA:3.40.50.2000coord: 258..428
score: 2.3
NoneNo IPR availablePANTHERPTHR11926:SF149UDP-GLYCOSYLTRANSFERASE 82A1coord: 5..448
score: 1.0E
NoneNo IPR availableunknownSSF53756UDP-Glycosyltransferase/glycogen phosphorylasecoord: 5..429
score: 4.28