CmoCh03G008050 (gene) Cucurbita moschata (Rifu)

NameCmoCh03G008050
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionUDP-Glycosyltransferase superfamily protein
LocationCmo_Chr03 : 6526951 .. 6528378 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTGTCCCAGAACCCAAAACCCACGTGGCATTGCTGGTCAGCCCCGGAATGGGTCATCTAATTCCCTTCCTCGAGCTGGCCGACCGTCTGGTCCTCCACCACAACCTCCAAACCACCCTCTTTGTCGTTAACACTGACTCCTCCTCCGCCGCCGATGCCCTTCTCCTCCAAAAATCCTCCGCCGTCAACATCGTCCCTATTCGTCACTCCTCACCCAGCCTAGACCCAACCGTTCCCGTCGTCGACAGAATCAAGGCCATGATGACTGCCTCTTACCCCCACGTTCGGTCCTACATCGCCGCCGTTGAGCCCCGTCCGGTGGCGCTGATCGTTGACCTTTTCGGAACTGAAGCTATAGCCATGGCCCACGAACTTGGCATGTTGGGCTTTGTTTTCATGCCCACCAGTGCATGGTTCCTCTCGCTTTTTTTCTTTTTCCCATGTATGGACAAACAAATGCTTGACGCCCACGCGTACAACCACGAACCGCTGAGAATCCCGGGTTGCAGCTCGGTCCGGTTCGAGGACACTATCGATGCATTGGAGGTGAGTCTGGAGGAAGTCCATGAGGAATTACGACGTGTCGCACGGGAGCTTGGAATGGCGGATGGGATCTTGGCAAACACGTGGCAGGATCTGGAGCCCACAACGCTAAAAGCATTGAACGAAGCTGGGACCCTCAGTAATGGGAAAGTCAATAAGGTACCGATTTATCCAATTGGGCCGTTGGCAAGACGTGGCGAGCCCAATTTGGAGAGCGAGGTGTTGAGTTGGCTCGACCGGCAACCGGATGAGTCGGTGATACACATCTCGTTTGGGAGTGGAGGGACGTTATGTGCGGAGCAAATCACAGAATTGGCATGGGGGTTGGAGCTGAGTCAGCAGCGGTTTGTTTGGGTGATACGCCCACCAGCAGGGACAGATCCCATGGGAGCATTTTTCACGGTGGAAACGGAATCGACGGAGAAGACGCCGACGGAATACCTGCCGGAAGGGTTTTTAAAAAGGACGGAAGAGGTGGGTTTGGTGGTTCCGATGTGGGGTCCACAAGCGGAGATTTTGAGGCATAGATCGGTGAGGGGATTTGTGACGCACTGTGGGTGGAATTCGTCGATAGAAAGCATGGTGAATGGAGTGGCGATGGTGACGTGGCCTCTGTACGCGGAGCAGAAGATGAACGCGGTGATGCTGACGGAGGAGGTGGGGGTGGCAGTTAGGGGACGGGCGGAGGGGGTGGTGGGGAGGGCGGAGATAGAGAGGTGGGTGAGGCGGATAATGGTGGATGAAGAAGGGCGGGGAATCAGAGAGAGGGTTAAAGCCGTTAAAAATAGTGGGGAAAAGGCGGTGTGTAAGGGTGGGTCGTCGTACAATTCGTTGGCTCACGTGGCATCCGAATGCCATCGTCGGAGAGCGGAGGGTGTGTAG

mRNA sequence

ATGGCTGTCCCAGAACCCAAAACCCACGTGGCATTGCTGGTCAGCCCCGGAATGGGTCATCTAATTCCCTTCCTCGAGCTGGCCGACCGTCTGGTCCTCCACCACAACCTCCAAACCACCCTCTTTGTCGTTAACACTGACTCCTCCTCCGCCGCCGATGCCCTTCTCCTCCAAAAATCCTCCGCCGTCAACATCGTCCCTATTCGTCACTCCTCACCCAGCCTAGACCCAACCGTTCCCGTCGTCGACAGAATCAAGGCCATGATGACTGCCTCTTACCCCCACGTTCGGTCCTACATCGCCGCCGTTGAGCCCCGTCCGGTGGCGCTGATCGTTGACCTTTTCGGAACTGAAGCTATAGCCATGGCCCACGAACTTGGCATGTTGGGCTTTGTTTTCATGCCCACCAGTGCATGGTTCCTCTCGCTTTTTTTCTTTTTCCCATGTATGGACAAACAAATGCTTGACGCCCACGCGTACAACCACGAACCGCTGAGAATCCCGGGTTGCAGCTCGGTCCGGTTCGAGGACACTATCGATGCATTGGAGGTGAGTCTGGAGGAAGTCCATGAGGAATTACGACGTGTCGCACGGGAGCTTGGAATGGCGGATGGGATCTTGGCAAACACGTGGCAGGATCTGGAGCCCACAACGCTAAAAGCATTGAACGAAGCTGGGACCCTCAGTAATGGGAAAGTCAATAAGGTACCGATTTATCCAATTGGGCCGTTGGCAAGACGTGGCGAGCCCAATTTGGAGAGCGAGGTGTTGAGTTGGCTCGACCGGCAACCGGATGAGTCGGTGATACACATCTCGTTTGGGAGTGGAGGGACGTTATGTGCGGAGCAAATCACAGAATTGGCATGGGGGTTGGAGCTGAGTCAGCAGCGGTTTGTTTGGGTGATACGCCCACCAGCAGGGACAGATCCCATGGGAGCATTTTTCACGGTGGAAACGGAATCGACGGAGAAGACGCCGACGGAATACCTGCCGGAAGGGTTTTTAAAAAGGACGGAAGAGGTGGGTTTGGTGGTTCCGATGTGGGGTCCACAAGCGGAGATTTTGAGGCATAGATCGGTGAGGGGATTTGTGACGCACTGTGGGTGGAATTCGTCGATAGAAAGCATGGTGAATGGAGTGGCGATGGTGACGTGGCCTCTGTACGCGGAGCAGAAGATGAACGCGGTGATGCTGACGGAGGAGGTGGGGGTGGCAGTTAGGGGACGGGCGGAGGGGGTGGTGGGGAGGGCGGAGATAGAGAGGTGGGTGAGGCGGATAATGGTGGATGAAGAAGGGCGGGGAATCAGAGAGAGGGTTAAAGCCGTTAAAAATAGTGGGGAAAAGGCGGTGTGTAAGGGTGGGTCGTCGTACAATTCGTTGGCTCACGTGGCATCCGAATGCCATCGTCGGAGAGCGGAGGGTGTGTAG

Coding sequence (CDS)

ATGGCTGTCCCAGAACCCAAAACCCACGTGGCATTGCTGGTCAGCCCCGGAATGGGTCATCTAATTCCCTTCCTCGAGCTGGCCGACCGTCTGGTCCTCCACCACAACCTCCAAACCACCCTCTTTGTCGTTAACACTGACTCCTCCTCCGCCGCCGATGCCCTTCTCCTCCAAAAATCCTCCGCCGTCAACATCGTCCCTATTCGTCACTCCTCACCCAGCCTAGACCCAACCGTTCCCGTCGTCGACAGAATCAAGGCCATGATGACTGCCTCTTACCCCCACGTTCGGTCCTACATCGCCGCCGTTGAGCCCCGTCCGGTGGCGCTGATCGTTGACCTTTTCGGAACTGAAGCTATAGCCATGGCCCACGAACTTGGCATGTTGGGCTTTGTTTTCATGCCCACCAGTGCATGGTTCCTCTCGCTTTTTTTCTTTTTCCCATGTATGGACAAACAAATGCTTGACGCCCACGCGTACAACCACGAACCGCTGAGAATCCCGGGTTGCAGCTCGGTCCGGTTCGAGGACACTATCGATGCATTGGAGGTGAGTCTGGAGGAAGTCCATGAGGAATTACGACGTGTCGCACGGGAGCTTGGAATGGCGGATGGGATCTTGGCAAACACGTGGCAGGATCTGGAGCCCACAACGCTAAAAGCATTGAACGAAGCTGGGACCCTCAGTAATGGGAAAGTCAATAAGGTACCGATTTATCCAATTGGGCCGTTGGCAAGACGTGGCGAGCCCAATTTGGAGAGCGAGGTGTTGAGTTGGCTCGACCGGCAACCGGATGAGTCGGTGATACACATCTCGTTTGGGAGTGGAGGGACGTTATGTGCGGAGCAAATCACAGAATTGGCATGGGGGTTGGAGCTGAGTCAGCAGCGGTTTGTTTGGGTGATACGCCCACCAGCAGGGACAGATCCCATGGGAGCATTTTTCACGGTGGAAACGGAATCGACGGAGAAGACGCCGACGGAATACCTGCCGGAAGGGTTTTTAAAAAGGACGGAAGAGGTGGGTTTGGTGGTTCCGATGTGGGGTCCACAAGCGGAGATTTTGAGGCATAGATCGGTGAGGGGATTTGTGACGCACTGTGGGTGGAATTCGTCGATAGAAAGCATGGTGAATGGAGTGGCGATGGTGACGTGGCCTCTGTACGCGGAGCAGAAGATGAACGCGGTGATGCTGACGGAGGAGGTGGGGGTGGCAGTTAGGGGACGGGCGGAGGGGGTGGTGGGGAGGGCGGAGATAGAGAGGTGGGTGAGGCGGATAATGGTGGATGAAGAAGGGCGGGGAATCAGAGAGAGGGTTAAAGCCGTTAAAAATAGTGGGGAAAAGGCGGTGTGTAAGGGTGGGTCGTCGTACAATTCGTTGGCTCACGTGGCATCCGAATGCCATCGTCGGAGAGCGGAGGGTGTGTAG
BLAST of CmoCh03G008050 vs. Swiss-Prot
Match: U72E2_ARATH (UDP-glycosyltransferase 72E2 OS=Arabidopsis thaliana GN=UGT72E2 PE=1 SV=1)

HSP 1 Score: 422.9 bits (1086), Expect = 4.5e-117
Identity = 224/471 (47.56%), Postives = 309/471 (65.61%), Query Frame = 1

Query: 7   KTHVALLVSPGMGHLIPFLELADRLVLHHNLQTTLFVVNTDSSSAADALLLQKSSAVNIV 66
           K H A+  SPGMGH+IP +EL  RL  ++    T+FV+ TD++SA    L   S+ V+IV
Sbjct: 5   KPHAAMFSSPGMGHVIPVIELGKRLSANNGFHVTVFVLETDAASAQSKFL--NSTGVDIV 64

Query: 67  PIRHSSPSL----DPTVPVVDRIKAMMTASYPHVRSYIAAVEPRPVALIVDLFGTEAIAM 126
            +   SP +    DP   VV +I  +M A+ P +RS IAA+  +P ALIVDLFGT+A+ +
Sbjct: 65  KL--PSPDIYGLVDPDDHVVTKIGVIMRAAVPALRSKIAAMHQKPTALIVDLFGTDALCL 124

Query: 127 AHELGMLGFVFMPTSAWFLSLFFFFPCMDKQMLDAHAYNHEPLRIPGCSSVRFEDTIDAL 186
           A E  ML +VF+PT+A FL +  ++P +DK + + H     PL IPGC  VRFEDT+DA 
Sbjct: 125 AKEFNMLSYVFIPTNARFLGVSIYYPNLDKDIKEEHTVQRNPLAIPGCEPVRFEDTLDAY 184

Query: 187 EVSLEEVHEELRRVARELGMADGILANTWQDLEPTTLKALNEAGTLSNGKVNKVPIYPIG 246
            V  E V+ +  R       ADGIL NTW+++EP +LK+L     L  G+V +VP+YPIG
Sbjct: 185 LVPDEPVYRDFVRHGLAYPKADGILVNTWEEMEPKSLKSLLNPKLL--GRVARVPVYPIG 244

Query: 247 PLARRGEPN-LESEVLSWLDRQPDESVIHISFGSGGTLCAEQITELAWGLELSQQRFVWV 306
           PL R  + +  +  VL WL+ QP+ESV++ISFGSGG L A+Q+TELAWGLE SQQRFVWV
Sbjct: 245 PLCRPIQSSETDHPVLDWLNEQPNESVLYISFGSGGCLSAKQLTELAWGLEQSQQRFVWV 304

Query: 307 IRPPAGTDPMGAFFTVETESTEKTPTEYLPEGFLKRTEEVGLVVPMWGPQAEILRHRSVR 366
           +RPP        + +     TE    EYLPEGF+ RT + G VVP W PQAEIL HR+V 
Sbjct: 305 VRPPVDGSCCSEYVSANGGGTEDNTPEYLPEGFVSRTSDRGFVVPSWAPQAEILSHRAVG 364

Query: 367 GFVTHCGWNSSIESMVNGVAMVTWPLYAEQKMNAVMLTEEVGVAVR-GRAEGVVGRAEIE 426
           GF+THCGW+S++ES+V GV M+ WPL+AEQ MNA +L++E+G+AVR    +  + R +IE
Sbjct: 365 GFLTHCGWSSTLESVVGGVPMIAWPLFAEQNMNAALLSDELGIAVRLDDPKEDISRWKIE 424

Query: 427 RWVRRIMVDEEGRGIRERVKAVKNSGEK--AVCKGGSSYNSLAHVASECHR 470
             VR++M ++EG  +R +VK +++S E   ++  GG ++ SL  V  EC R
Sbjct: 425 ALVRKVMTEKEGEAMRRKVKKLRDSAEMSLSIDGGGLAHESLCRVTKECQR 469

BLAST of CmoCh03G008050 vs. Swiss-Prot
Match: UFOG5_MANES (Anthocyanidin 3-O-glucosyltransferase 5 OS=Manihot esculenta GN=GT5 PE=2 SV=1)

HSP 1 Score: 420.6 bits (1080), Expect = 2.2e-116
Identity = 223/468 (47.65%), Postives = 310/468 (66.24%), Query Frame = 1

Query: 7   KTHVALLVSPGMGHLIPFLELADRLVLHHNLQTTLFVVNTDSSSAADALLLQ----KSSA 66
           K H+ LL SPG+GHLIP LEL  R+V   N   T+F+V +D+S+A   +L      K   
Sbjct: 9   KPHIVLLSSPGLGHLIPVLELGKRIVTLCNFDVTIFMVGSDTSAAEPQVLRSAMTPKLCE 68

Query: 67  VNIVPIRHSSPSLDPTVPVVDRIKAMMTASYPHVRSYIAAVEPRPVALIVDLFGTEAIAM 126
           +  +P  + S  +DP   V  R+  +M    P  R+ ++A++ RP A+IVDLFGTE++ +
Sbjct: 69  IIQLPPPNISCLIDPEATVCTRLFVLMREIRPAFRAAVSALKFRPAAIIVDLFGTESLEV 128

Query: 127 AHELGMLGFVFMPTSAWFLSLFFFFPCMDKQMLDAHAYNHEPLRIPGCSSVRFEDTIDAL 186
           A ELG+  +V++ ++AWFL+L  + P +DK++        EP++IPGC  VR E+ +D +
Sbjct: 129 AKELGIAKYVYIASNAWFLALTIYVPILDKEVEGEFVLQKEPMKIPGCRPVRTEEVVDPM 188

Query: 187 EVSLEEVHEELRRVARELGMADGILANTWQDLEPTTLKALNEAGTLSNGKVNKVPIYPIG 246
                + + E  R+  E+  ADGIL NTW+ LEPTT  AL +   L  G+V KVP++PIG
Sbjct: 189 LDRTNQQYSEYFRLGIEIPTADGILMNTWEALEPTTFGALRDVKFL--GRVAKVPVFPIG 248

Query: 247 PLARRGEP-NLESEVLSWLDRQPDESVIHISFGSGGTLCAEQITELAWGLELSQQRFVWV 306
           PL R+  P     E+L WLD+QP ESV+++SFGSGGTL  EQ+ ELAWGLE SQQRF+WV
Sbjct: 249 PLRRQAGPCGSNCELLDWLDQQPKESVVYVSFGSGGTLSLEQMIELAWGLERSQQRFIWV 308

Query: 307 IRPPAGTDPMGAFFTVETESTEKTPTEYLPEGFLKRTEEVGLVVPMWGPQAEILRHRSVR 366
           +R P       AFFT    + +   + Y PEGFL R + VGLVVP W PQ  I+ H SV 
Sbjct: 309 VRQPTVKTGDAAFFTQGDGADDM--SGYFPEGFLTRIQNVGLVVPQWSPQIHIMSHPSVG 368

Query: 367 GFVTHCGWNSSIESMVNGVAMVTWPLYAEQKMNAVMLTEEVGVAVRGR---AEGVVGRAE 426
            F++HCGWNS +ES+  GV ++ WP+YAEQ+MNA +LTEE+GVAVR +   A+ VV R E
Sbjct: 369 VFLSHCGWNSVLESITAGVPIIAWPIYAEQRMNATLLTEELGVAVRPKNLPAKEVVKREE 428

Query: 427 IERWVRRIMVDEEGRGIRERVKAVKNSGEKAVCKGGSSYNSLAHVASE 467
           IER +RRIMVDEEG  IR+RV+ +K+SGEKA+ +GGSS+N ++ + +E
Sbjct: 429 IERMIRRIMVDEEGSEIRKRVRELKDSGEKALNEGGSSFNYMSALGNE 472

BLAST of CmoCh03G008050 vs. Swiss-Prot
Match: U72E1_ARATH (UDP-glycosyltransferase 72E1 OS=Arabidopsis thaliana GN=UGT72E1 PE=1 SV=1)

HSP 1 Score: 420.2 bits (1079), Expect = 2.9e-116
Identity = 218/470 (46.38%), Postives = 306/470 (65.11%), Query Frame = 1

Query: 7   KTHVALLVSPGMGHLIPFLELADRLVLHHNLQTTLFVVNTDSSSAADALLLQK---SSAV 66
           K HVA+  SPGMGH+IP +EL  RL   H    T+FV+ TD++SA    L      ++ V
Sbjct: 5   KPHVAMFASPGMGHIIPVIELGKRLAGSHGFDVTIFVLETDAASAQSQFLNSPGCDAALV 64

Query: 67  NIV--PIRHSSPSLDPTVPVVDRIKAMMTASYPHVRSYIAAVEPRPVALIVDLFGTEAIA 126
           +IV  P    S  +DP+     ++  MM  + P +RS I  ++ +P ALIVDLFG +AI 
Sbjct: 65  DIVGLPTPDISGLVDPSAFFGIKLLVMMRETIPTIRSKIEEMQHKPTALIVDLFGLDAIP 124

Query: 127 MAHELGMLGFVFMPTSAWFLSLFFFFPCMDKQMLDAHAYNHEPLRIPGCSSVRFEDTIDA 186
           +  E  ML ++F+ ++A FL++  FFP +DK M + H    +P+ +PGC  VRFEDT++ 
Sbjct: 125 LGGEFNMLTYIFIASNARFLAVALFFPTLDKDMEEEHIIKKQPMVMPGCEPVRFEDTLET 184

Query: 187 LEVSLEEVHEELRRVARELGMADGILANTWQDLEPTTLKALNEAGTLSNGKVNKVPIYPI 246
                 +++ E           DGI+ NTW D+EP TLK+L +   L  G++  VP+YPI
Sbjct: 185 FLDPNSQLYREFVPFGSVFPTCDGIIVNTWDDMEPKTLKSLQDPKLL--GRIAGVPVYPI 244

Query: 247 GPLARRGEPNLESE-VLSWLDRQPDESVIHISFGSGGTLCAEQITELAWGLELSQQRFVW 306
           GPL+R  +P+  +  VL WL++QPDESV++ISFGSGG+L A+Q+TELAWGLE+SQQRFVW
Sbjct: 245 GPLSRPVDPSKTNHPVLDWLNKQPDESVLYISFGSGGSLSAKQLTELAWGLEMSQQRFVW 304

Query: 307 VIRPPAGTDPMGAFFTVETESTEKTPTEYLPEGFLKRTEEVGLVVPMWGPQAEILRHRSV 366
           V+RPP       A+ +  +        +YLPEGF+ RT E G +V  W PQAEIL H++V
Sbjct: 305 VVRPPVDGSACSAYLSANSGKIRDGTPDYLPEGFVSRTHERGFMVSSWAPQAEILAHQAV 364

Query: 367 RGFVTHCGWNSSIESMVNGVAMVTWPLYAEQKMNAVMLTEEVGVAVRGR---AEGVVGRA 426
            GF+THCGWNS +ES+V GV M+ WPL+AEQ MNA +L EE+GVAVR +   +EGV+ RA
Sbjct: 365 GGFLTHCGWNSILESVVGGVPMIAWPLFAEQMMNATLLNEELGVAVRSKKLPSEGVITRA 424

Query: 427 EIERWVRRIMVDEEGRGIRERVKAVK-NSGEKAVCKGGSSYNSLAHVASE 467
           EIE  VR+IMV+EEG  +R+++K +K  + E   C GG ++ SL+ +A E
Sbjct: 425 EIEALVRKIMVEEEGAEMRKKIKKLKETAAESLSCDGGVAHESLSRIADE 472

BLAST of CmoCh03G008050 vs. Swiss-Prot
Match: U72E3_ARATH (UDP-glycosyltransferase 72E3 OS=Arabidopsis thaliana GN=UGT72E3 PE=1 SV=1)

HSP 1 Score: 414.5 bits (1064), Expect = 1.6e-114
Identity = 211/469 (44.99%), Postives = 313/469 (66.74%), Query Frame = 1

Query: 7   KTHVALLVSPGMGHLIPFLELADRLVLHHNLQTTLFVVNTDSSSAADALLLQKSSAVNIV 66
           K H A+  SPGMGH++P +ELA RL  +H    T+FV+ TD++S    LL   S+ V+IV
Sbjct: 5   KPHAAMFSSPGMGHVLPVIELAKRLSANHGFHVTVFVLETDAASVQSKLL--NSTGVDIV 64

Query: 67  --PIRHSSPSLDPTVPVVDRIKAMMTASYPHVRSYIAAVEPRPVALIVDLFGTEAIAMAH 126
             P    S  +DP   VV +I  +M  + P +RS I A+   P ALI+DLFGT+A+ +A 
Sbjct: 65  NLPSPDISGLVDPNAHVVTKIGVIMREAVPTLRSKIVAMHQNPTALIIDLFGTDALCLAA 124

Query: 127 ELGMLGFVFMPTSAWFLSLFFFFPCMDKQMLDAHAYNHEPLRIPGCSSVRFEDTIDALEV 186
           EL ML +VF+ ++A +L +  ++P +D+ + + H    +PL IPGC  VRFED +DA  V
Sbjct: 125 ELNMLTYVFIASNARYLGVSIYYPTLDEVIKEEHTVQRKPLTIPGCEPVRFEDIMDAYLV 184

Query: 187 SLEEVHEELRRVARELGMADGILANTWQDLEPTTLKALNEAGTLSNGKVNKVPIYPIGPL 246
             E V+ +L R       ADGIL NTW+++EP +LK+L +   L  G+V +VP+YP+GPL
Sbjct: 185 PDEPVYHDLVRHCLAYPKADGILVNTWEEMEPKSLKSLQDPKLL--GRVARVPVYPVGPL 244

Query: 247 ARRGEPNL-ESEVLSWLDRQPDESVIHISFGSGGTLCAEQITELAWGLELSQQRFVWVIR 306
            R  + +  +  V  WL++QP+ESV++ISFGSGG+L A+Q+TELAWGLE SQQRF+WV+R
Sbjct: 245 CRPIQSSTTDHPVFDWLNKQPNESVLYISFGSGGSLTAQQLTELAWGLEESQQRFIWVVR 304

Query: 307 PPAGTDPMGAFFTVETESTEKTPTEYLPEGFLKRTEEVGLVVPMWGPQAEILRHRSVRGF 366
           PP        +F+ +   T+    EYLPEGF+ RT + G ++P W PQAEIL H++V GF
Sbjct: 305 PPVDGSSCSDYFSAKGGVTKDNTPEYLPEGFVTRTCDRGFMIPSWAPQAEILAHQAVGGF 364

Query: 367 VTHCGWNSSIESMVNGVAMVTWPLYAEQKMNAVMLTEEVGVAVR-GRAEGVVGRAEIERW 426
           +THCGW+S++ES++ GV M+ WPL+AEQ MNA +L++E+G++VR    +  + R++IE  
Sbjct: 365 LTHCGWSSTLESVLCGVPMIAWPLFAEQNMNAALLSDELGISVRVDDPKEAISRSKIEAM 424

Query: 427 VRRIMVDEEGRGIRERVKAVKNSGEK--AVCKGGSSYNSLAHVASECHR 470
           VR++M ++EG  +R +VK ++++ E   ++  GGS++ SL  V  EC R
Sbjct: 425 VRKVMAEDEGEEMRRKVKKLRDTAEMSLSIHGGGSAHESLCRVTKECQR 469

BLAST of CmoCh03G008050 vs. Swiss-Prot
Match: U72C1_ARATH (UDP-glycosyltransferase 72C1 OS=Arabidopsis thaliana GN=UGT72C1 PE=2 SV=3)

HSP 1 Score: 361.3 bits (926), Expect = 1.6e-98
Identity = 200/451 (44.35%), Postives = 281/451 (62.31%), Query Frame = 1

Query: 9   HVALLVSPGMGHLIPFLELADRLVLHHNL-QTTLFVVNTD---SSSAADALLLQKSS--A 68
           H AL+ SPGMGH +P LEL   L+ HH   + T+F+V  D   S S     L+++     
Sbjct: 4   HGALVASPGMGHAVPILELGKHLLNHHGFDRVTVFLVTDDVSRSKSLIGKTLMEEDPKFV 63

Query: 69  VNIVPIRHSSPSLDPTVPVVDRIKAMMTASYPHVRSYIAAVEPRPVALIVDLFGTEAIAM 128
           +  +P+  S   L  ++  + ++  MM  + P ++S +  +EPRP   +VDL GTEA+ +
Sbjct: 64  IRFIPLDVSGQDLSGSL--LTKLAEMMRKALPEIKSSVMELEPRPRVFVVDLLGTEALEV 123

Query: 129 AHELGMLG-FVFMPTSAWFLSLFFFFPCMDKQMLDAHAYNHEPLRIPGCSSVRFEDTIDA 188
           A ELG++   V + TSAWFL+   +   +DKQ L     +   L IPGCS V+FE   D 
Sbjct: 124 AKELGIMRKHVLVTTSAWFLAFTVYMASLDKQELYKQLSSIGALLIPGCSPVKFERAQDP 183

Query: 189 LEVSLEEVHEELRRVARELGMADGILANTWQDLEPTTLKALNEAGTLSNGKVNK-VPIYP 248
            +   E    E +R+  E+  ADG+  NTW  LE  T+ +  +   L  G+V + VP+YP
Sbjct: 184 RKYIRELA--ESQRIGDEVITADGVFVNTWHSLEQVTIGSFLDPENL--GRVMRGVPVYP 243

Query: 249 IGPLARRGEPNLESEVLSWLDRQPDESVIHISFGSGGTLCAEQITELAWGLELSQQRFVW 308
           +GPL R  EP L+  VL WLD QP ESV+++SFGSGG L  EQ  ELA+GLEL+  RFVW
Sbjct: 244 VGPLVRPAEPGLKHGVLDWLDLQPKESVVYVSFGSGGALTFEQTNELAYGLELTGHRFVW 303

Query: 309 VIRPPAGTDPMGAFFTVETESTEKTPTEYLPEGFLKRTEEVGLVVPMWGPQAEILRHRSV 368
           V+RPPA  DP  + F      TE  P ++LP GFL RT+++GLVV  W PQ EIL H+S 
Sbjct: 304 VVRPPAEDDPSASMFDKTKNETE--PLDFLPNGFLDRTKDIGLVVRTWAPQEEILAHKST 363

Query: 369 RGFVTHCGWNSSIESMVNGVAMVTWPLYAEQKMNAVMLTEEVGVAVR-GRAEGVVGRAEI 428
            GFVTHCGWNS +ES+VNGV MV WPLY+EQKMNA M++ E+ +A++   A+G+V +  I
Sbjct: 364 GGFVTHCGWNSVLESIVNGVPMVAWPLYSEQKMNARMVSGELKIALQINVADGIVKKEVI 423

Query: 429 ERWVRRIMVDEEGRGIRERVKAVKNSGEKAV 451
              V+R+M +EEG+ +R+ VK +K + E+A+
Sbjct: 424 AEMVKRVMDEEEGKEMRKNVKELKKTAEEAL 446

BLAST of CmoCh03G008050 vs. TrEMBL
Match: A0A0A0KGE4_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G501940 PE=4 SV=1)

HSP 1 Score: 680.2 bits (1754), Expect = 1.7e-192
Identity = 350/475 (73.68%), Postives = 394/475 (82.95%), Query Frame = 1

Query: 5   EPKTHVALLVSPGMGHLIPFLELADRLVLHHNLQTTLFVVNTDSSSAADALLLQKSSAVN 64
           E KTHVALLVSPGMGHLIPFLELA+RLVLHHNLQ TLFVV T SSS A++ LLQK S VN
Sbjct: 5   ESKTHVALLVSPGMGHLIPFLELANRLVLHHNLQATLFVVGTGSSS-AESTLLQKPSLVN 64

Query: 65  IVPIRHSSPSLDPTVPVVDRIKAMMTASYPHVRSYIAAVEPRPVALIVDLFGTEAIAMAH 124
           IV + HS  SLDP  P+ D I +MMTAS+P +RS IAAV PRP ALIVDLFGT A+++AH
Sbjct: 65  IVSLPHSLSSLDPNAPICDIIISMMTASFPFLRSSIAAVNPRPAALIVDLFGTPALSIAH 124

Query: 125 ELGMLGFVFMPTSAWFLSLFFFFPCMDKQMLDAHAYNHEPLRIPGCSSVRFEDTIDALEV 184
           ELGMLG VFM T+AW+LS+ + +P  +K M+DAH YNH+PL IPGC+ VRFEDTI+  E+
Sbjct: 125 ELGMLGLVFMTTNAWYLSVSYLYPSFEKSMVDAHVYNHDPLVIPGCTPVRFEDTIEVFEL 184

Query: 185 SLEEVHEELRRVARELGMADGILANTWQDLEPTTLKALNEAGTLSNGKVNKVPIYPIGPL 244
           + EEV+    R ARELG ADGIL+NTWQDLEPTTLKAL+EAGTL  GKVN+VPIYPIGPL
Sbjct: 185 NQEEVYVGFGRYARELGTADGILSNTWQDLEPTTLKALSEAGTLGYGKVNEVPIYPIGPL 244

Query: 245 ARRGEPNLESEVLSWLDRQPDESVIHISFGSGGTLCAEQITELAWGLELSQQRFVWVIRP 304
            R GEP LESEVL WLDRQPDESVI++SFGSGGTLC EQITELAWGLELSQQRFVWVIRP
Sbjct: 245 TRNGEPTLESEVLKWLDRQPDESVIYVSFGSGGTLCEEQITELAWGLELSQQRFVWVIRP 304

Query: 305 PAGTDPMGAFFTV-ETESTEKTPTEYLPEGFLKRTEEVGLVVPMWGPQAEILRHRSVRGF 364
           P GT+  GAFFT     S +   ++YLPEGF+KRT+EVGLV+PMWGPQAEIL HRSVRGF
Sbjct: 305 PEGTESTGAFFTAGRGSSRDYWASKYLPEGFIKRTKEVGLVIPMWGPQAEILSHRSVRGF 364

Query: 365 VTHCGWNSSIESMVNGVAMVTWPLYAEQKMNAVMLTEEVGVAVRGRAE--GVVGRAEIER 424
           VTHCGWNSS+ES+VNGVAMVTWPLYAEQKMNA +LTEE+GVAVR RAE  GVV R EIE+
Sbjct: 365 VTHCGWNSSLESIVNGVAMVTWPLYAEQKMNAALLTEEMGVAVRLRAEGQGVVERKEIEK 424

Query: 425 WVRRIMVDEEGRGIRERVKAVKNSGEKAVCKGGSSYNSLAHVASEC--HRRRAEG 475
            VR IM  +EG GIRERVK +K SG KAV KGGSSYNSLA VASEC   RRR +G
Sbjct: 425 KVRMIMEGKEGEGIRERVKELKISGGKAVTKGGSSYNSLARVASECDIFRRRRDG 478

BLAST of CmoCh03G008050 vs. TrEMBL
Match: B9HEN9_POPTR (Glycosyltransferase OS=Populus trichocarpa GN=POPTR_0007s12400g PE=3 SV=1)

HSP 1 Score: 499.6 bits (1285), Expect = 4.2e-138
Identity = 263/468 (56.20%), Postives = 338/468 (72.22%), Query Frame = 1

Query: 7   KTHVALLVSPGMGHLIPFLELADRLVLHHNLQTTLFVVNTDSSSAADALLLQKSSAVNIV 66
           K H ALL SPGMGHLIP LEL  RLV +H    TLFVV TD+S+   + L +    +NI+
Sbjct: 5   KPHAALLASPGMGHLIPVLELGKRLVTYHGFHVTLFVVATDASTT-QSRLKEPYPNINII 64

Query: 67  --PIRHSSPSLDPTVPVVDRIKAMMTASYPHVRSYIAAVEPRPVALIVDLFGTEAIAMAH 126
             P+   S  +DP   VV ++  MM  + P +RS I A++  P ALIVDLFGTEA A+A 
Sbjct: 65  TLPLVDISGLIDPAATVVTKLAVMMRETLPSLRSAILALKSPPTALIVDLFGTEAFAVAE 124

Query: 127 ELGMLGFVFMPTSAWFLSLFFFFPCMDKQMLDAHAYNHEPLRIPGCSSVRFEDTIDALEV 186
           E  ML +VF  ++AWF ++  +FP +D+ + D H    +PLRIPGC SVRFEDT+ A   
Sbjct: 125 EFNMLKYVFDTSNAWFFAITIYFPTIDRNLEDKHVIQKQPLRIPGCKSVRFEDTLGAYLD 184

Query: 187 SLEEVHEELRRVARELGMADGILANTWQDLEPTTLKALNEAGTLSNGKVNKVPIYPIGPL 246
             ++++ E +R+  E+ MADGIL NTW+DLEPTTL AL +   L  G+V K P+YPIGPL
Sbjct: 185 RNDQMYIEYKRIGIEMPMADGILMNTWEDLEPTTLGALRDFQML--GRVAKAPVYPIGPL 244

Query: 247 ARRGEPNL-ESEVLSWLDRQPDESVIHISFGSGGTLCAEQITELAWGLELSQQRFVWVIR 306
           AR   P++  ++VL+WLD QP+ESVI++SFGSGGTL  EQ+ ELAWGLELS+QRFVWV+R
Sbjct: 245 ARPVGPSVPRNQVLNWLDNQPNESVIYVSFGSGGTLSTEQMAELAWGLELSKQRFVWVVR 304

Query: 307 PPAGTDPMGAFFTVETESTEKTPTEYLPEGFLKRTEEVGLVVPMWGPQAEILRHRSVRGF 366
           PP   D  GAFF ++ + +E  P+ +LPEGFL RT EVGLVVP+W PQ EIL H SV GF
Sbjct: 305 PPIDNDAAGAFFNLD-DGSEGIPS-FLPEGFLARTREVGLVVPLWAPQVEILAHPSVGGF 364

Query: 367 VTHCGWNSSIESMVNGVAMVTWPLYAEQKMNAVMLTEEVGVAVRGR---AEGVVGRAEIE 426
           ++HCGWNS++ES+ NGV M+ WPLYAEQKMNA +LTEE+GVAV+ +   +E VV RAEIE
Sbjct: 365 LSHCGWNSTLESITNGVPMIAWPLYAEQKMNATILTEELGVAVQPKTLASERVVVRAEIE 424

Query: 427 RWVRRIMVDEEGRGIRERVKAVKNSGEKAV-CKGGSSYNSLAHVASEC 468
             VR+IM DEEG GIR+RV  +K+SGEKA+  KGGSSYNSL+ +A +C
Sbjct: 425 MMVRKIMEDEEGFGIRKRVNELKHSGEKALSSKGGSSYNSLSQIAKQC 467

BLAST of CmoCh03G008050 vs. TrEMBL
Match: E1ANG8_POPTO (Glycosyltransferase OS=Populus tomentosa PE=3 SV=1)

HSP 1 Score: 485.7 bits (1249), Expect = 6.3e-134
Identity = 258/468 (55.13%), Postives = 332/468 (70.94%), Query Frame = 1

Query: 7   KTHVALLVSPGMGHLIPFLELADRLVLHHNLQTTLFVVNTDSSSAADALLLQKSSAVNIV 66
           K H ALL SPGMGHLIP LEL  RLV +H    T FVV TD+S+   +LL +    +NI+
Sbjct: 5   KPHAALLASPGMGHLIPVLELCKRLVTYHGFHVTFFVVATDASTT-QSLLKEPYPNINII 64

Query: 67  --PIRHSSPSLDPTVPVVDRIKAMMTASYPHVRSYIAAVEPRPVALIVDLFGTEAIAMAH 126
             P+   S  +DP   VV ++  MM  + P +RS I A++  P ALIVDLFGTEA A+A 
Sbjct: 65  TLPLVDISGLIDPAATVVTKLAVMMRETLPSLRSAILALKSPPTALIVDLFGTEAFAVAE 124

Query: 127 ELGMLGFVFMPTSAWFLSLFFFFPCMDKQMLDAHAYNHEPLRIPGCSSVRFEDTIDALEV 186
           E  ML +VF  ++AWF ++  + P +D+ + D H    +PLRIPGC SVRFEDT+ A   
Sbjct: 125 EFNMLKYVFDTSNAWFFAITIYVPTIDRNLEDRHIIQKQPLRIPGCKSVRFEDTLQAYLD 184

Query: 187 SLEEVHEELRRVARELGMADGILANTWQDLEPTTLKALNEAGTLSNGKVNKVPIYPIGPL 246
             ++ + E +R+  E+ MADGIL NTW+DLEPTTL AL +   L  G+V + P+YPIGPL
Sbjct: 185 RNDQTYIEYKRIGIEMPMADGILMNTWEDLEPTTLGALRDFQML--GRVAQSPVYPIGPL 244

Query: 247 ARRGEPNL-ESEVLSWLDRQPDESVIHISFGSGGTLCAEQITELAWGLELSQQRFVWVIR 306
           AR   P +  ++VL WLD QP ESVI++SFGSGGTL +EQ+ ELAWGLELS+QRFVWV+R
Sbjct: 245 ARPVGPLIPRNQVLKWLDNQPYESVIYVSFGSGGTLSSEQMAELAWGLELSKQRFVWVVR 304

Query: 307 PPAGTDPMGAFFTVETESTEKTPTEYLPEGFLKRTEEVGLVVPMWGPQAEILRHRSVRGF 366
           P    D  GAFF ++ + +E  P+ +LPEGFL RT E+GL VPMW PQ EIL H SV GF
Sbjct: 305 PSIDNDADGAFFNLD-DGSEGIPS-FLPEGFLDRTREMGLAVPMWAPQVEILAHPSVGGF 364

Query: 367 VTHCGWNSSIESMVNGVAMVTWPLYAEQKMNAVMLTEEVGVAVRGR---AEGVVGRAEIE 426
           ++HCGWNS++ES+ NGV ++ WPLYAEQKMNA +LTEE+GVAV+ +   +E VV RAEIE
Sbjct: 365 LSHCGWNSTLESITNGVPLIAWPLYAEQKMNATILTEELGVAVQPKTLASERVVVRAEIE 424

Query: 427 RWVRRIMVDEEGRGIRERVKAVKNSGEKAV-CKGGSSYNSLAHVASEC 468
             VR+IM DEEG GIR+RV  +K+SGEKA+  KGGSSYNSL+ +A +C
Sbjct: 425 MMVRKIMEDEEGFGIRKRVNELKHSGEKALSSKGGSSYNSLSQIAKQC 467

BLAST of CmoCh03G008050 vs. TrEMBL
Match: A0A061DGH3_THECC (Glycosyltransferase OS=Theobroma cacao GN=TCM_000444 PE=3 SV=1)

HSP 1 Score: 474.6 bits (1220), Expect = 1.5e-130
Identity = 250/467 (53.53%), Postives = 335/467 (71.73%), Query Frame = 1

Query: 7   KTHVALLVSPGMGHLIPFLELADRLVLHHNLQTTLFVVNTDSSSAADALLLQKSSAV-NI 66
           K HVALL SPG+GHLIP LEL  RLV HHN + T+FV+ +++S+A + LL   +  V NI
Sbjct: 5   KPHVALLASPGLGHLIPVLELGKRLVTHHNFRITIFVLASEASTAQNQLLESSNMDVLNI 64

Query: 67  V--PIRHSSPSLDPTVPVVDRIKAMMTASYPHVRSYIAAVEPRPVALIVDLFGTEAIAMA 126
           V  P    S  +DP   +V +I  +M  S P +RS IAA++ RP ALIVDLFGTEA+ +A
Sbjct: 65  VSLPSAEISTKVDPGAHIVTKIVVIMRESLPGLRSAIAAMKSRPSALIVDLFGTEALPVA 124

Query: 127 HELGMLGFVFMPTSAWFLSLFFFFPCMDKQMLDAHAYNHEPLRIPGCSSVRFEDTIDALE 186
            E  ML +VF+ ++AWFL +  + P ++K + + H    +PL+IPGC SVRFEDT++A  
Sbjct: 125 DEFKMLKYVFIASNAWFLGITVYAPTVEKIVDEEHVKQQKPLKIPGCKSVRFEDTLEAYL 184

Query: 187 VSLEEVHEELRRVARELGMADGILANTWQDLEPTTLKALNEAGTLSNGKVNKVPIYPIGP 246
              ++++ E  RV  E+  ADGIL NT++DLEP TL++L +A  L  G+V KVP+YPIGP
Sbjct: 185 NRNDQLYGEYARVGLEIPEADGILVNTFEDLEPATLRSLTDAELL--GRVAKVPVYPIGP 244

Query: 247 LARR-GEPNLESEVLSWLDRQPDESVIHISFGSGGTLCAEQITELAWGLELSQQRFVWVI 306
           + R  G   L   VL WLD+QP +SVI++SFGSGGTL A+Q+TE+AWGLE SQQRF+WV+
Sbjct: 245 VVRTLGPLVLADPVLDWLDKQPSQSVIYVSFGSGGTLSAKQMTEIAWGLEQSQQRFIWVV 304

Query: 307 RPPAGTDPMGAFFTVETESTEKTPTEYLPEGFLKRTEEVGLVVPMWGPQAEILRHRSVRG 366
           RPP   D  G FFTV  +S + TP +YLP+GFL RT + GLV+PMW PQ +IL H SV G
Sbjct: 305 RPPVENDASGTFFTVGNDS-DGTP-DYLPDGFLTRTRDRGLVLPMWAPQTDILAHPSVGG 364

Query: 367 FVTHCGWNSSIESMVNGVAMVTWPLYAEQKMNAVMLTEEVGVAVRGR---AEGVVGRAEI 426
           FV+HCGWNS++ES++NGV ++ WPLYAEQKMNA MLTEE+G+AVR +   +  +V R E+
Sbjct: 365 FVSHCGWNSTMESLLNGVPLIAWPLYAEQKMNATMLTEELGLAVRPKMSTSSRIVERKEL 424

Query: 427 ERWVRRIMVDEEGRGIRERVKAVKNSGEKAVCKGGSSYNSLAHVASE 467
           E  VR+IMVD++G+ IR+R K +K+  +KA+ KGGSS  SL+ VA E
Sbjct: 425 EMVVRKIMVDKDGQEIRDRAKELKHIAQKALSKGGSSCTSLSQVAKE 467

BLAST of CmoCh03G008050 vs. TrEMBL
Match: K7LND6_SOYBN (Glycosyltransferase OS=Glycine max GN=GLYMA_11G064400 PE=3 SV=1)

HSP 1 Score: 473.0 bits (1216), Expect = 4.2e-130
Identity = 247/466 (53.00%), Postives = 317/466 (68.03%), Query Frame = 1

Query: 7   KTHVALLVSPGMGHLIPFLELADRLVLHHNLQTTLFVVNTDSSSAADALLLQKSSA-VNI 66
           K H AL+ SPGMGHLIP LEL  RL+ HH+   T+F+V TDS++    +L Q S+  + +
Sbjct: 5   KAHAALVASPGMGHLIPMLELGKRLLTHHSFHVTIFIVTTDSATTTSHILQQTSNLNIVL 64

Query: 67  VPIRHSSPSLDPTVPVVDRIKAMMTASYPHVRSYIAAVE-PRPVALIVDLFGTEAIAMAH 126
           VP    S  L P  P+  RI   M  S P +RS I +   P P ALIVD+FG  A  +A 
Sbjct: 65  VPPIDVSHKLPPNPPLAARIMLTMIDSIPFLRSSILSTNLPPPSALIVDMFGLAAFPIAR 124

Query: 127 ELGMLGFVFMPTSAWFLSLFFFFPCMDKQMLDAHAYNHEPLRIPGCSSVRFEDTIDALEV 186
           +LGML +V+  TSAWF ++  + P MDK+M++ HA +HEPL IPGC +VRFEDT++    
Sbjct: 125 DLGMLTYVYFATSAWFSAVSVYVPAMDKKMIERHAEHHEPLVIPGCEAVRFEDTLEPFLS 184

Query: 187 SLEEVHEELRRVARELGMADGILANTWQDLEPTTLKALNEAGTLSNGKVNKVPIYPIGPL 246
            + E++E     A+E+  ADGIL NTWQDLEP   KA+ E G L  G+  K  +YP+GPL
Sbjct: 185 PIGEMYEGYLAAAKEIVTADGILMNTWQDLEPAATKAVREDGIL--GRFTKGAVYPVGPL 244

Query: 247 ARRGEPNLESEVLSWLDRQPDESVIHISFGSGGTLCAEQITELAWGLELSQQRFVWVIRP 306
            R  E   E  VLSW+D QP E+V+++SFGSGGT+   Q+ E+A GLELSQQRFVWV+RP
Sbjct: 245 VRTVEKKAEDAVLSWMDVQPAETVVYVSFGSGGTMSEVQMREVALGLELSQQRFVWVVRP 304

Query: 307 PAGTDPMGAFFTVETESTEKTPTEYLPEGFLKRTEEVGLVVPMWGPQAEILRHRSVRGFV 366
           P   D  G+FF V    +     +YLP+GF+KRTE VG+VVPMW PQAEIL H +   FV
Sbjct: 305 PCEGDTSGSFFEVSKNGSGDVVLDYLPKGFVKRTEGVGVVVPMWAPQAEILGHPATGCFV 364

Query: 367 THCGWNSSIESMVNGVAMVTWPLYAEQKMNAVMLTEEVGVAVRGRAE---GVVGRAEIER 426
           THCGWNS +ES++NGV MV WPLYAEQKMNA ML+EE+GVAVR   E   GVVGR EI  
Sbjct: 365 THCGWNSVLESVLNGVPMVAWPLYAEQKMNAFMLSEELGVAVRVAGEGGGGVVGREEIAE 424

Query: 427 WVRRIMVDEEGRGIRERVKAVKNSGEKAVCKGGSSYNSLAHVASEC 468
            VRR+MVD+EG G+R++VK +K SGEKA+ K GSS++ L  +  +C
Sbjct: 425 LVRRVMVDKEGVGMRKKVKELKVSGEKALSKFGSSHHWLCQMNKDC 468

BLAST of CmoCh03G008050 vs. TAIR10
Match: AT5G66690.1 (AT5G66690.1 UDP-Glycosyltransferase superfamily protein)

HSP 1 Score: 422.9 bits (1086), Expect = 2.5e-118
Identity = 224/471 (47.56%), Postives = 309/471 (65.61%), Query Frame = 1

Query: 7   KTHVALLVSPGMGHLIPFLELADRLVLHHNLQTTLFVVNTDSSSAADALLLQKSSAVNIV 66
           K H A+  SPGMGH+IP +EL  RL  ++    T+FV+ TD++SA    L   S+ V+IV
Sbjct: 5   KPHAAMFSSPGMGHVIPVIELGKRLSANNGFHVTVFVLETDAASAQSKFL--NSTGVDIV 64

Query: 67  PIRHSSPSL----DPTVPVVDRIKAMMTASYPHVRSYIAAVEPRPVALIVDLFGTEAIAM 126
            +   SP +    DP   VV +I  +M A+ P +RS IAA+  +P ALIVDLFGT+A+ +
Sbjct: 65  KL--PSPDIYGLVDPDDHVVTKIGVIMRAAVPALRSKIAAMHQKPTALIVDLFGTDALCL 124

Query: 127 AHELGMLGFVFMPTSAWFLSLFFFFPCMDKQMLDAHAYNHEPLRIPGCSSVRFEDTIDAL 186
           A E  ML +VF+PT+A FL +  ++P +DK + + H     PL IPGC  VRFEDT+DA 
Sbjct: 125 AKEFNMLSYVFIPTNARFLGVSIYYPNLDKDIKEEHTVQRNPLAIPGCEPVRFEDTLDAY 184

Query: 187 EVSLEEVHEELRRVARELGMADGILANTWQDLEPTTLKALNEAGTLSNGKVNKVPIYPIG 246
            V  E V+ +  R       ADGIL NTW+++EP +LK+L     L  G+V +VP+YPIG
Sbjct: 185 LVPDEPVYRDFVRHGLAYPKADGILVNTWEEMEPKSLKSLLNPKLL--GRVARVPVYPIG 244

Query: 247 PLARRGEPN-LESEVLSWLDRQPDESVIHISFGSGGTLCAEQITELAWGLELSQQRFVWV 306
           PL R  + +  +  VL WL+ QP+ESV++ISFGSGG L A+Q+TELAWGLE SQQRFVWV
Sbjct: 245 PLCRPIQSSETDHPVLDWLNEQPNESVLYISFGSGGCLSAKQLTELAWGLEQSQQRFVWV 304

Query: 307 IRPPAGTDPMGAFFTVETESTEKTPTEYLPEGFLKRTEEVGLVVPMWGPQAEILRHRSVR 366
           +RPP        + +     TE    EYLPEGF+ RT + G VVP W PQAEIL HR+V 
Sbjct: 305 VRPPVDGSCCSEYVSANGGGTEDNTPEYLPEGFVSRTSDRGFVVPSWAPQAEILSHRAVG 364

Query: 367 GFVTHCGWNSSIESMVNGVAMVTWPLYAEQKMNAVMLTEEVGVAVR-GRAEGVVGRAEIE 426
           GF+THCGW+S++ES+V GV M+ WPL+AEQ MNA +L++E+G+AVR    +  + R +IE
Sbjct: 365 GFLTHCGWSSTLESVVGGVPMIAWPLFAEQNMNAALLSDELGIAVRLDDPKEDISRWKIE 424

Query: 427 RWVRRIMVDEEGRGIRERVKAVKNSGEK--AVCKGGSSYNSLAHVASECHR 470
             VR++M ++EG  +R +VK +++S E   ++  GG ++ SL  V  EC R
Sbjct: 425 ALVRKVMTEKEGEAMRRKVKKLRDSAEMSLSIDGGGLAHESLCRVTKECQR 469

BLAST of CmoCh03G008050 vs. TAIR10
Match: AT3G50740.1 (AT3G50740.1 UDP-glucosyl transferase 72E1)

HSP 1 Score: 420.2 bits (1079), Expect = 1.7e-117
Identity = 218/470 (46.38%), Postives = 306/470 (65.11%), Query Frame = 1

Query: 7   KTHVALLVSPGMGHLIPFLELADRLVLHHNLQTTLFVVNTDSSSAADALLLQK---SSAV 66
           K HVA+  SPGMGH+IP +EL  RL   H    T+FV+ TD++SA    L      ++ V
Sbjct: 5   KPHVAMFASPGMGHIIPVIELGKRLAGSHGFDVTIFVLETDAASAQSQFLNSPGCDAALV 64

Query: 67  NIV--PIRHSSPSLDPTVPVVDRIKAMMTASYPHVRSYIAAVEPRPVALIVDLFGTEAIA 126
           +IV  P    S  +DP+     ++  MM  + P +RS I  ++ +P ALIVDLFG +AI 
Sbjct: 65  DIVGLPTPDISGLVDPSAFFGIKLLVMMRETIPTIRSKIEEMQHKPTALIVDLFGLDAIP 124

Query: 127 MAHELGMLGFVFMPTSAWFLSLFFFFPCMDKQMLDAHAYNHEPLRIPGCSSVRFEDTIDA 186
           +  E  ML ++F+ ++A FL++  FFP +DK M + H    +P+ +PGC  VRFEDT++ 
Sbjct: 125 LGGEFNMLTYIFIASNARFLAVALFFPTLDKDMEEEHIIKKQPMVMPGCEPVRFEDTLET 184

Query: 187 LEVSLEEVHEELRRVARELGMADGILANTWQDLEPTTLKALNEAGTLSNGKVNKVPIYPI 246
                 +++ E           DGI+ NTW D+EP TLK+L +   L  G++  VP+YPI
Sbjct: 185 FLDPNSQLYREFVPFGSVFPTCDGIIVNTWDDMEPKTLKSLQDPKLL--GRIAGVPVYPI 244

Query: 247 GPLARRGEPNLESE-VLSWLDRQPDESVIHISFGSGGTLCAEQITELAWGLELSQQRFVW 306
           GPL+R  +P+  +  VL WL++QPDESV++ISFGSGG+L A+Q+TELAWGLE+SQQRFVW
Sbjct: 245 GPLSRPVDPSKTNHPVLDWLNKQPDESVLYISFGSGGSLSAKQLTELAWGLEMSQQRFVW 304

Query: 307 VIRPPAGTDPMGAFFTVETESTEKTPTEYLPEGFLKRTEEVGLVVPMWGPQAEILRHRSV 366
           V+RPP       A+ +  +        +YLPEGF+ RT E G +V  W PQAEIL H++V
Sbjct: 305 VVRPPVDGSACSAYLSANSGKIRDGTPDYLPEGFVSRTHERGFMVSSWAPQAEILAHQAV 364

Query: 367 RGFVTHCGWNSSIESMVNGVAMVTWPLYAEQKMNAVMLTEEVGVAVRGR---AEGVVGRA 426
            GF+THCGWNS +ES+V GV M+ WPL+AEQ MNA +L EE+GVAVR +   +EGV+ RA
Sbjct: 365 GGFLTHCGWNSILESVVGGVPMIAWPLFAEQMMNATLLNEELGVAVRSKKLPSEGVITRA 424

Query: 427 EIERWVRRIMVDEEGRGIRERVKAVK-NSGEKAVCKGGSSYNSLAHVASE 467
           EIE  VR+IMV+EEG  +R+++K +K  + E   C GG ++ SL+ +A E
Sbjct: 425 EIEALVRKIMVEEEGAEMRKKIKKLKETAAESLSCDGGVAHESLSRIADE 472

BLAST of CmoCh03G008050 vs. TAIR10
Match: AT5G26310.1 (AT5G26310.1 UDP-Glycosyltransferase superfamily protein)

HSP 1 Score: 414.5 bits (1064), Expect = 9.1e-116
Identity = 211/469 (44.99%), Postives = 313/469 (66.74%), Query Frame = 1

Query: 7   KTHVALLVSPGMGHLIPFLELADRLVLHHNLQTTLFVVNTDSSSAADALLLQKSSAVNIV 66
           K H A+  SPGMGH++P +ELA RL  +H    T+FV+ TD++S    LL   S+ V+IV
Sbjct: 5   KPHAAMFSSPGMGHVLPVIELAKRLSANHGFHVTVFVLETDAASVQSKLL--NSTGVDIV 64

Query: 67  --PIRHSSPSLDPTVPVVDRIKAMMTASYPHVRSYIAAVEPRPVALIVDLFGTEAIAMAH 126
             P    S  +DP   VV +I  +M  + P +RS I A+   P ALI+DLFGT+A+ +A 
Sbjct: 65  NLPSPDISGLVDPNAHVVTKIGVIMREAVPTLRSKIVAMHQNPTALIIDLFGTDALCLAA 124

Query: 127 ELGMLGFVFMPTSAWFLSLFFFFPCMDKQMLDAHAYNHEPLRIPGCSSVRFEDTIDALEV 186
           EL ML +VF+ ++A +L +  ++P +D+ + + H    +PL IPGC  VRFED +DA  V
Sbjct: 125 ELNMLTYVFIASNARYLGVSIYYPTLDEVIKEEHTVQRKPLTIPGCEPVRFEDIMDAYLV 184

Query: 187 SLEEVHEELRRVARELGMADGILANTWQDLEPTTLKALNEAGTLSNGKVNKVPIYPIGPL 246
             E V+ +L R       ADGIL NTW+++EP +LK+L +   L  G+V +VP+YP+GPL
Sbjct: 185 PDEPVYHDLVRHCLAYPKADGILVNTWEEMEPKSLKSLQDPKLL--GRVARVPVYPVGPL 244

Query: 247 ARRGEPNL-ESEVLSWLDRQPDESVIHISFGSGGTLCAEQITELAWGLELSQQRFVWVIR 306
            R  + +  +  V  WL++QP+ESV++ISFGSGG+L A+Q+TELAWGLE SQQRF+WV+R
Sbjct: 245 CRPIQSSTTDHPVFDWLNKQPNESVLYISFGSGGSLTAQQLTELAWGLEESQQRFIWVVR 304

Query: 307 PPAGTDPMGAFFTVETESTEKTPTEYLPEGFLKRTEEVGLVVPMWGPQAEILRHRSVRGF 366
           PP        +F+ +   T+    EYLPEGF+ RT + G ++P W PQAEIL H++V GF
Sbjct: 305 PPVDGSSCSDYFSAKGGVTKDNTPEYLPEGFVTRTCDRGFMIPSWAPQAEILAHQAVGGF 364

Query: 367 VTHCGWNSSIESMVNGVAMVTWPLYAEQKMNAVMLTEEVGVAVR-GRAEGVVGRAEIERW 426
           +THCGW+S++ES++ GV M+ WPL+AEQ MNA +L++E+G++VR    +  + R++IE  
Sbjct: 365 LTHCGWSSTLESVLCGVPMIAWPLFAEQNMNAALLSDELGISVRVDDPKEAISRSKIEAM 424

Query: 427 VRRIMVDEEGRGIRERVKAVKNSGEK--AVCKGGSSYNSLAHVASECHR 470
           VR++M ++EG  +R +VK ++++ E   ++  GGS++ SL  V  EC R
Sbjct: 425 VRKVMAEDEGEEMRRKVKKLRDTAEMSLSIHGGGSAHESLCRVTKECQR 469

BLAST of CmoCh03G008050 vs. TAIR10
Match: AT4G36770.1 (AT4G36770.1 UDP-Glycosyltransferase superfamily protein)

HSP 1 Score: 361.3 bits (926), Expect = 9.1e-100
Identity = 200/451 (44.35%), Postives = 281/451 (62.31%), Query Frame = 1

Query: 9   HVALLVSPGMGHLIPFLELADRLVLHHNL-QTTLFVVNTD---SSSAADALLLQKSS--A 68
           H AL+ SPGMGH +P LEL   L+ HH   + T+F+V  D   S S     L+++     
Sbjct: 4   HGALVASPGMGHAVPILELGKHLLNHHGFDRVTVFLVTDDVSRSKSLIGKTLMEEDPKFV 63

Query: 69  VNIVPIRHSSPSLDPTVPVVDRIKAMMTASYPHVRSYIAAVEPRPVALIVDLFGTEAIAM 128
           +  +P+  S   L  ++  + ++  MM  + P ++S +  +EPRP   +VDL GTEA+ +
Sbjct: 64  IRFIPLDVSGQDLSGSL--LTKLAEMMRKALPEIKSSVMELEPRPRVFVVDLLGTEALEV 123

Query: 129 AHELGMLG-FVFMPTSAWFLSLFFFFPCMDKQMLDAHAYNHEPLRIPGCSSVRFEDTIDA 188
           A ELG++   V + TSAWFL+   +   +DKQ L     +   L IPGCS V+FE   D 
Sbjct: 124 AKELGIMRKHVLVTTSAWFLAFTVYMASLDKQELYKQLSSIGALLIPGCSPVKFERAQDP 183

Query: 189 LEVSLEEVHEELRRVARELGMADGILANTWQDLEPTTLKALNEAGTLSNGKVNK-VPIYP 248
            +   E    E +R+  E+  ADG+  NTW  LE  T+ +  +   L  G+V + VP+YP
Sbjct: 184 RKYIRELA--ESQRIGDEVITADGVFVNTWHSLEQVTIGSFLDPENL--GRVMRGVPVYP 243

Query: 249 IGPLARRGEPNLESEVLSWLDRQPDESVIHISFGSGGTLCAEQITELAWGLELSQQRFVW 308
           +GPL R  EP L+  VL WLD QP ESV+++SFGSGG L  EQ  ELA+GLEL+  RFVW
Sbjct: 244 VGPLVRPAEPGLKHGVLDWLDLQPKESVVYVSFGSGGALTFEQTNELAYGLELTGHRFVW 303

Query: 309 VIRPPAGTDPMGAFFTVETESTEKTPTEYLPEGFLKRTEEVGLVVPMWGPQAEILRHRSV 368
           V+RPPA  DP  + F      TE  P ++LP GFL RT+++GLVV  W PQ EIL H+S 
Sbjct: 304 VVRPPAEDDPSASMFDKTKNETE--PLDFLPNGFLDRTKDIGLVVRTWAPQEEILAHKST 363

Query: 369 RGFVTHCGWNSSIESMVNGVAMVTWPLYAEQKMNAVMLTEEVGVAVR-GRAEGVVGRAEI 428
            GFVTHCGWNS +ES+VNGV MV WPLY+EQKMNA M++ E+ +A++   A+G+V +  I
Sbjct: 364 GGFVTHCGWNSVLESIVNGVPMVAWPLYSEQKMNARMVSGELKIALQINVADGIVKKEVI 423

Query: 429 ERWVRRIMVDEEGRGIRERVKAVKNSGEKAV 451
              V+R+M +EEG+ +R+ VK +K + E+A+
Sbjct: 424 AEMVKRVMDEEEGKEMRKNVKELKKTAEEAL 446

BLAST of CmoCh03G008050 vs. TAIR10
Match: AT2G18570.1 (AT2G18570.1 UDP-Glycosyltransferase superfamily protein)

HSP 1 Score: 357.5 bits (916), Expect = 1.3e-98
Identity = 202/473 (42.71%), Postives = 287/473 (60.68%), Query Frame = 1

Query: 9   HVALLVSPGMGHLIPFLELADRLVLHHNLQTTLFVVNTDSSSAADALLLQKSSAVNIVPI 68
           H  L+ SPG+GHLIP LEL +RL    N+  T+  V + SSS  +   +  ++A  I  I
Sbjct: 5   HALLVASPGLGHLIPILELGNRLSSVLNIHVTILAVTSGSSSPTETEAIHAAAARTICQI 64

Query: 69  RHSSPSLD------PTVPVVDRIKAMMTASYPHVRSYIAAVEPRPVALIVDLFGTEAIAM 128
               PS+D      P   +  ++   M A  P VR  +  ++ +P  +IVD  GTE +++
Sbjct: 65  TEI-PSVDVDNLVEPDATIFTKMVVKMRAMKPAVRDAVKLMKRKPTVMIVDFLGTELMSV 124

Query: 129 AHELGMLG-FVFMPTSAWFLSLFFFFPCMDKQMLDAHAYNHEPLRIPGCSSVRFEDTIDA 188
           A ++GM   +V++PT AWFL++  + P +D  +   +    EPL+IPGC  V  ++ ++ 
Sbjct: 125 ADDVGMTAKYVYVPTHAWFLAVMVYLPVLDTVVEGEYVDIKEPLKIPGCKPVGPKELMET 184

Query: 189 LEVSLEEVHEELRRVARELGMADGILANTWQDLEPTTLKALNEAGTLSNGKVNKVPIYPI 248
           +     + ++E  R   E+ M+DG+L NTW++L+  TL AL E   LS  +V KVP+YPI
Sbjct: 185 MLDRSGQQYKECVRAGLEVPMSDGVLVNTWEELQGNTLAALREDEELS--RVMKVPVYPI 244

Query: 249 GPLARRGEP-NLESEVLSWLDRQPDESVIHISFGSGGTLCAEQITELAWGLELSQQRFVW 308
           GP+ R  +  +  + +  WLD Q + SV+ +  GSGGTL  EQ  ELA GLELS QRFVW
Sbjct: 245 GPIVRTNQHVDKPNSIFEWLDEQRERSVVFVCLGSGGTLTFEQTVELALGLELSGQRFVW 304

Query: 309 VIRPPAGTDPMGAFFTVETESTEKTPTEYLPEGFLKRTEEVGLVVPMWGPQAEILRHRSV 368
           V+R PA    +GA       S ++  +  LPEGFL RT  VG+VV  W PQ EIL HRS+
Sbjct: 305 VLRRPASY--LGAI-----SSDDEQVSASLPEGFLDRTRGVGIVVTQWAPQVEILSHRSI 364

Query: 369 RGFVTHCGWNSSIESMVNGVAMVTWPLYAEQKMNAVMLTEEVGVAVRGR---AEGVVGRA 428
            GF++HCGW+S++ES+  GV ++ WPLYAEQ MNA +LTEE+GVAVR     +E V+GR 
Sbjct: 365 GGFLSHCGWSSALESLTKGVPIIAWPLYAEQWMNATLLTEEIGVAVRTSELPSERVIGRE 424

Query: 429 EIERWVRRIMV--DEEGRGIRERVKAVKNSGEKAVCKGGSSYNSLAHVASECH 469
           E+   VR+IM   DEEG+ IR + + V+ S E+A  K GSSYNSL   A  C+
Sbjct: 425 EVASLVRKIMAEEDEEGQKIRAKAEEVRVSSERAWSKDGSSYNSLFEWAKRCY 467

BLAST of CmoCh03G008050 vs. NCBI nr
Match: gi|700193601|gb|KGN48805.1| (hypothetical protein Csa_6G501940 [Cucumis sativus])

HSP 1 Score: 680.2 bits (1754), Expect = 2.5e-192
Identity = 350/475 (73.68%), Postives = 394/475 (82.95%), Query Frame = 1

Query: 5   EPKTHVALLVSPGMGHLIPFLELADRLVLHHNLQTTLFVVNTDSSSAADALLLQKSSAVN 64
           E KTHVALLVSPGMGHLIPFLELA+RLVLHHNLQ TLFVV T SSS A++ LLQK S VN
Sbjct: 5   ESKTHVALLVSPGMGHLIPFLELANRLVLHHNLQATLFVVGTGSSS-AESTLLQKPSLVN 64

Query: 65  IVPIRHSSPSLDPTVPVVDRIKAMMTASYPHVRSYIAAVEPRPVALIVDLFGTEAIAMAH 124
           IV + HS  SLDP  P+ D I +MMTAS+P +RS IAAV PRP ALIVDLFGT A+++AH
Sbjct: 65  IVSLPHSLSSLDPNAPICDIIISMMTASFPFLRSSIAAVNPRPAALIVDLFGTPALSIAH 124

Query: 125 ELGMLGFVFMPTSAWFLSLFFFFPCMDKQMLDAHAYNHEPLRIPGCSSVRFEDTIDALEV 184
           ELGMLG VFM T+AW+LS+ + +P  +K M+DAH YNH+PL IPGC+ VRFEDTI+  E+
Sbjct: 125 ELGMLGLVFMTTNAWYLSVSYLYPSFEKSMVDAHVYNHDPLVIPGCTPVRFEDTIEVFEL 184

Query: 185 SLEEVHEELRRVARELGMADGILANTWQDLEPTTLKALNEAGTLSNGKVNKVPIYPIGPL 244
           + EEV+    R ARELG ADGIL+NTWQDLEPTTLKAL+EAGTL  GKVN+VPIYPIGPL
Sbjct: 185 NQEEVYVGFGRYARELGTADGILSNTWQDLEPTTLKALSEAGTLGYGKVNEVPIYPIGPL 244

Query: 245 ARRGEPNLESEVLSWLDRQPDESVIHISFGSGGTLCAEQITELAWGLELSQQRFVWVIRP 304
            R GEP LESEVL WLDRQPDESVI++SFGSGGTLC EQITELAWGLELSQQRFVWVIRP
Sbjct: 245 TRNGEPTLESEVLKWLDRQPDESVIYVSFGSGGTLCEEQITELAWGLELSQQRFVWVIRP 304

Query: 305 PAGTDPMGAFFTV-ETESTEKTPTEYLPEGFLKRTEEVGLVVPMWGPQAEILRHRSVRGF 364
           P GT+  GAFFT     S +   ++YLPEGF+KRT+EVGLV+PMWGPQAEIL HRSVRGF
Sbjct: 305 PEGTESTGAFFTAGRGSSRDYWASKYLPEGFIKRTKEVGLVIPMWGPQAEILSHRSVRGF 364

Query: 365 VTHCGWNSSIESMVNGVAMVTWPLYAEQKMNAVMLTEEVGVAVRGRAE--GVVGRAEIER 424
           VTHCGWNSS+ES+VNGVAMVTWPLYAEQKMNA +LTEE+GVAVR RAE  GVV R EIE+
Sbjct: 365 VTHCGWNSSLESIVNGVAMVTWPLYAEQKMNAALLTEEMGVAVRLRAEGQGVVERKEIEK 424

Query: 425 WVRRIMVDEEGRGIRERVKAVKNSGEKAVCKGGSSYNSLAHVASEC--HRRRAEG 475
            VR IM  +EG GIRERVK +K SG KAV KGGSSYNSLA VASEC   RRR +G
Sbjct: 425 KVRMIMEGKEGEGIRERVKELKISGGKAVTKGGSSYNSLARVASECDIFRRRRDG 478

BLAST of CmoCh03G008050 vs. NCBI nr
Match: gi|778722457|ref|XP_004143577.2| (PREDICTED: UDP-glycosyltransferase 72E1, partial [Cucumis sativus])

HSP 1 Score: 667.9 bits (1722), Expect = 1.3e-188
Identity = 341/460 (74.13%), Postives = 384/460 (83.48%), Query Frame = 1

Query: 5   EPKTHVALLVSPGMGHLIPFLELADRLVLHHNLQTTLFVVNTDSSSAADALLLQKSSAVN 64
           E KTHVALLVSPGMGHLIPFLELA+RLVLHHNLQ TLFVV T SSS A++ LLQK S VN
Sbjct: 5   ESKTHVALLVSPGMGHLIPFLELANRLVLHHNLQATLFVVGTGSSS-AESTLLQKPSLVN 64

Query: 65  IVPIRHSSPSLDPTVPVVDRIKAMMTASYPHVRSYIAAVEPRPVALIVDLFGTEAIAMAH 124
           IV + HS  SLDP  P+ D I +MMTAS+P +RS IAAV PRP ALIVDLFGT A+++AH
Sbjct: 65  IVSLPHSLSSLDPNAPICDIIISMMTASFPFLRSSIAAVNPRPAALIVDLFGTPALSIAH 124

Query: 125 ELGMLGFVFMPTSAWFLSLFFFFPCMDKQMLDAHAYNHEPLRIPGCSSVRFEDTIDALEV 184
           ELGMLG VFM T+AW+LS+ + +P  +K M+DAH YNH+PL IPGC+ VRFEDTI+  E+
Sbjct: 125 ELGMLGLVFMTTNAWYLSVSYLYPSFEKSMVDAHVYNHDPLVIPGCTPVRFEDTIEVFEL 184

Query: 185 SLEEVHEELRRVARELGMADGILANTWQDLEPTTLKALNEAGTLSNGKVNKVPIYPIGPL 244
           + EEV+    R ARELG ADGIL+NTWQDLEPTTLKAL+EAGTL  GKVN+VPIYPIGPL
Sbjct: 185 NQEEVYVGFGRYARELGTADGILSNTWQDLEPTTLKALSEAGTLGYGKVNEVPIYPIGPL 244

Query: 245 ARRGEPNLESEVLSWLDRQPDESVIHISFGSGGTLCAEQITELAWGLELSQQRFVWVIRP 304
            R GEP LESEVL WLDRQPDESVI++SFGSGGTLC EQITELAWGLELSQQRFVWVIRP
Sbjct: 245 TRNGEPTLESEVLKWLDRQPDESVIYVSFGSGGTLCEEQITELAWGLELSQQRFVWVIRP 304

Query: 305 PAGTDPMGAFFTV-ETESTEKTPTEYLPEGFLKRTEEVGLVVPMWGPQAEILRHRSVRGF 364
           P GT+  GAFFT     S +   ++YLPEGF+KRT+EVGLV+PMWGPQAEIL HRSVRGF
Sbjct: 305 PEGTESTGAFFTAGRGSSRDYWASKYLPEGFIKRTKEVGLVIPMWGPQAEILSHRSVRGF 364

Query: 365 VTHCGWNSSIESMVNGVAMVTWPLYAEQKMNAVMLTEEVGVAVRGRAE--GVVGRAEIER 424
           VTHCGWNSS+ES+VNGVAMVTWPLYAEQKMNA +LTEE+GVAVR RAE  GVV R EIE+
Sbjct: 365 VTHCGWNSSLESIVNGVAMVTWPLYAEQKMNAALLTEEMGVAVRLRAEGQGVVERKEIEK 424

Query: 425 WVRRIMVDEEGRGIRERVKAVKNSGEKAVCKGGSSYNSLA 462
            VR IM  +EG GIRERVK +K SG KAV KGGSSYNSLA
Sbjct: 425 KVRMIMEGKEGEGIRERVKELKISGGKAVTKGGSSYNSLA 463

BLAST of CmoCh03G008050 vs. NCBI nr
Match: gi|659081042|ref|XP_008441118.1| (PREDICTED: UDP-glycosyltransferase 72E1 [Cucumis melo])

HSP 1 Score: 648.7 bits (1672), Expect = 8.1e-183
Identity = 335/462 (72.51%), Postives = 379/462 (82.03%), Query Frame = 1

Query: 18  MGHLIPFLELADRLVLHHNLQTTLFVVNTDSSSAADALLLQKSSAVNIVPIRHSSPSLDP 77
           MGHLIPFLELA+RLVLHHNLQ TLFVV T SSSA  ALL QK S VNI+P+ H+S SLD 
Sbjct: 1   MGHLIPFLELANRLVLHHNLQATLFVVGTGSSSAEFALL-QKPSPVNIIPLPHASSSLDS 60

Query: 78  TVPVVDRIKAMMTASYPHVRSYIAAVEPRPVALIVDLFGTEAIAMAHELGMLGFVFMPTS 137
             P+ + I +MMTAS+P +RS IAA  PRP ALIVDLFGT A+++AHELGMLGFVFM TS
Sbjct: 61  NAPIFNIISSMMTASFPFLRSSIAAANPRPAALIVDLFGTPALSIAHELGMLGFVFMTTS 120

Query: 138 AWFLSLFFFFPCMDKQMLDAHAYNHEPLRIPGCSSVRFEDTIDALEVSLEEVHEELRRVA 197
           AWFLSLF F+P  DK M+DAH  NH+PL IPGC+ VRFE+TI+  E++ +EV+      A
Sbjct: 121 AWFLSLFVFYPSFDKSMVDAHVDNHDPLVIPGCTPVRFENTIEVFELNQKEVYVGFGSFA 180

Query: 198 RELGMADGILANTWQDLEPTTLKALNEAGTLSNGKVNKVPIYPIGPLARRGEPNLESEVL 257
            ELG ADGIL+NTWQDLEPTTLKAL+EAGTL  GKVN+VPI+PIGPL   G+P LESEVL
Sbjct: 181 SELGTADGILSNTWQDLEPTTLKALSEAGTLGKGKVNEVPIFPIGPLTSNGDPTLESEVL 240

Query: 258 SWLDRQPDESVIHISFGSGGTLCAEQITELAWGLELSQQRFVWVIRPPAGTDPMGAFFTV 317
            WLDRQPDESVI++SFGSGGTL  EQITELAWGLE+SQQRFVWVIRPPAGT+ MG FFT 
Sbjct: 241 KWLDRQPDESVIYVSFGSGGTLREEQITELAWGLEMSQQRFVWVIRPPAGTESMGTFFTA 300

Query: 318 -ETESTEKTPTEYLPEGFLKRTEEVGLVVPMWGPQAEILRHRSVRGFVTHCGWNSSIESM 377
               S +   +E+LPEGF+KRT+EVGLV+PMWGPQAEIL HRSVRGFVTHCGWNSS+ES+
Sbjct: 301 GRGSSGDDWASEFLPEGFIKRTKEVGLVIPMWGPQAEILSHRSVRGFVTHCGWNSSLESI 360

Query: 378 VNGVAMVTWPLYAEQKMNAVMLTEEVGVAVRGRAE--GVVGRAEIERWVRRIMVDEEGRG 437
           VNGVAMVTWPLYAEQKMNA +LTEEVGVAVR RAE  G+V R EIE  VR IM  +EG G
Sbjct: 361 VNGVAMVTWPLYAEQKMNAAVLTEEVGVAVRVRAEGDGLVKRKEIENKVRMIMEGKEGGG 420

Query: 438 IRERVKAVKNSGEKAVCKGGSSYNSLAHVASEC--HRRRAEG 475
           IRERVK +K SGEKAV KGGSSYNSLA VASEC   RRR +G
Sbjct: 421 IRERVKGLKISGEKAVTKGGSSYNSLARVASECDIFRRRRDG 461

BLAST of CmoCh03G008050 vs. NCBI nr
Match: gi|224094711|ref|XP_002310203.1| (UDP-glucoronosyl/UDP-glucosyl transferase family protein [Populus trichocarpa])

HSP 1 Score: 499.6 bits (1285), Expect = 6.1e-138
Identity = 263/468 (56.20%), Postives = 338/468 (72.22%), Query Frame = 1

Query: 7   KTHVALLVSPGMGHLIPFLELADRLVLHHNLQTTLFVVNTDSSSAADALLLQKSSAVNIV 66
           K H ALL SPGMGHLIP LEL  RLV +H    TLFVV TD+S+   + L +    +NI+
Sbjct: 5   KPHAALLASPGMGHLIPVLELGKRLVTYHGFHVTLFVVATDASTT-QSRLKEPYPNINII 64

Query: 67  --PIRHSSPSLDPTVPVVDRIKAMMTASYPHVRSYIAAVEPRPVALIVDLFGTEAIAMAH 126
             P+   S  +DP   VV ++  MM  + P +RS I A++  P ALIVDLFGTEA A+A 
Sbjct: 65  TLPLVDISGLIDPAATVVTKLAVMMRETLPSLRSAILALKSPPTALIVDLFGTEAFAVAE 124

Query: 127 ELGMLGFVFMPTSAWFLSLFFFFPCMDKQMLDAHAYNHEPLRIPGCSSVRFEDTIDALEV 186
           E  ML +VF  ++AWF ++  +FP +D+ + D H    +PLRIPGC SVRFEDT+ A   
Sbjct: 125 EFNMLKYVFDTSNAWFFAITIYFPTIDRNLEDKHVIQKQPLRIPGCKSVRFEDTLGAYLD 184

Query: 187 SLEEVHEELRRVARELGMADGILANTWQDLEPTTLKALNEAGTLSNGKVNKVPIYPIGPL 246
             ++++ E +R+  E+ MADGIL NTW+DLEPTTL AL +   L  G+V K P+YPIGPL
Sbjct: 185 RNDQMYIEYKRIGIEMPMADGILMNTWEDLEPTTLGALRDFQML--GRVAKAPVYPIGPL 244

Query: 247 ARRGEPNL-ESEVLSWLDRQPDESVIHISFGSGGTLCAEQITELAWGLELSQQRFVWVIR 306
           AR   P++  ++VL+WLD QP+ESVI++SFGSGGTL  EQ+ ELAWGLELS+QRFVWV+R
Sbjct: 245 ARPVGPSVPRNQVLNWLDNQPNESVIYVSFGSGGTLSTEQMAELAWGLELSKQRFVWVVR 304

Query: 307 PPAGTDPMGAFFTVETESTEKTPTEYLPEGFLKRTEEVGLVVPMWGPQAEILRHRSVRGF 366
           PP   D  GAFF ++ + +E  P+ +LPEGFL RT EVGLVVP+W PQ EIL H SV GF
Sbjct: 305 PPIDNDAAGAFFNLD-DGSEGIPS-FLPEGFLARTREVGLVVPLWAPQVEILAHPSVGGF 364

Query: 367 VTHCGWNSSIESMVNGVAMVTWPLYAEQKMNAVMLTEEVGVAVRGR---AEGVVGRAEIE 426
           ++HCGWNS++ES+ NGV M+ WPLYAEQKMNA +LTEE+GVAV+ +   +E VV RAEIE
Sbjct: 365 LSHCGWNSTLESITNGVPMIAWPLYAEQKMNATILTEELGVAVQPKTLASERVVVRAEIE 424

Query: 427 RWVRRIMVDEEGRGIRERVKAVKNSGEKAV-CKGGSSYNSLAHVASEC 468
             VR+IM DEEG GIR+RV  +K+SGEKA+  KGGSSYNSL+ +A +C
Sbjct: 425 MMVRKIMEDEEGFGIRKRVNELKHSGEKALSSKGGSSYNSLSQIAKQC 467

BLAST of CmoCh03G008050 vs. NCBI nr
Match: gi|302777000|gb|ADL67595.1| (glycosyltransferase 1 [Populus tomentosa])

HSP 1 Score: 485.7 bits (1249), Expect = 9.1e-134
Identity = 258/468 (55.13%), Postives = 332/468 (70.94%), Query Frame = 1

Query: 7   KTHVALLVSPGMGHLIPFLELADRLVLHHNLQTTLFVVNTDSSSAADALLLQKSSAVNIV 66
           K H ALL SPGMGHLIP LEL  RLV +H    T FVV TD+S+   +LL +    +NI+
Sbjct: 5   KPHAALLASPGMGHLIPVLELCKRLVTYHGFHVTFFVVATDASTT-QSLLKEPYPNINII 64

Query: 67  --PIRHSSPSLDPTVPVVDRIKAMMTASYPHVRSYIAAVEPRPVALIVDLFGTEAIAMAH 126
             P+   S  +DP   VV ++  MM  + P +RS I A++  P ALIVDLFGTEA A+A 
Sbjct: 65  TLPLVDISGLIDPAATVVTKLAVMMRETLPSLRSAILALKSPPTALIVDLFGTEAFAVAE 124

Query: 127 ELGMLGFVFMPTSAWFLSLFFFFPCMDKQMLDAHAYNHEPLRIPGCSSVRFEDTIDALEV 186
           E  ML +VF  ++AWF ++  + P +D+ + D H    +PLRIPGC SVRFEDT+ A   
Sbjct: 125 EFNMLKYVFDTSNAWFFAITIYVPTIDRNLEDRHIIQKQPLRIPGCKSVRFEDTLQAYLD 184

Query: 187 SLEEVHEELRRVARELGMADGILANTWQDLEPTTLKALNEAGTLSNGKVNKVPIYPIGPL 246
             ++ + E +R+  E+ MADGIL NTW+DLEPTTL AL +   L  G+V + P+YPIGPL
Sbjct: 185 RNDQTYIEYKRIGIEMPMADGILMNTWEDLEPTTLGALRDFQML--GRVAQSPVYPIGPL 244

Query: 247 ARRGEPNL-ESEVLSWLDRQPDESVIHISFGSGGTLCAEQITELAWGLELSQQRFVWVIR 306
           AR   P +  ++VL WLD QP ESVI++SFGSGGTL +EQ+ ELAWGLELS+QRFVWV+R
Sbjct: 245 ARPVGPLIPRNQVLKWLDNQPYESVIYVSFGSGGTLSSEQMAELAWGLELSKQRFVWVVR 304

Query: 307 PPAGTDPMGAFFTVETESTEKTPTEYLPEGFLKRTEEVGLVVPMWGPQAEILRHRSVRGF 366
           P    D  GAFF ++ + +E  P+ +LPEGFL RT E+GL VPMW PQ EIL H SV GF
Sbjct: 305 PSIDNDADGAFFNLD-DGSEGIPS-FLPEGFLDRTREMGLAVPMWAPQVEILAHPSVGGF 364

Query: 367 VTHCGWNSSIESMVNGVAMVTWPLYAEQKMNAVMLTEEVGVAVRGR---AEGVVGRAEIE 426
           ++HCGWNS++ES+ NGV ++ WPLYAEQKMNA +LTEE+GVAV+ +   +E VV RAEIE
Sbjct: 365 LSHCGWNSTLESITNGVPLIAWPLYAEQKMNATILTEELGVAVQPKTLASERVVVRAEIE 424

Query: 427 RWVRRIMVDEEGRGIRERVKAVKNSGEKAV-CKGGSSYNSLAHVASEC 468
             VR+IM DEEG GIR+RV  +K+SGEKA+  KGGSSYNSL+ +A +C
Sbjct: 425 MMVRKIMEDEEGFGIRKRVNELKHSGEKALSSKGGSSYNSLSQIAKQC 467

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
U72E2_ARATH4.5e-11747.56UDP-glycosyltransferase 72E2 OS=Arabidopsis thaliana GN=UGT72E2 PE=1 SV=1[more]
UFOG5_MANES2.2e-11647.65Anthocyanidin 3-O-glucosyltransferase 5 OS=Manihot esculenta GN=GT5 PE=2 SV=1[more]
U72E1_ARATH2.9e-11646.38UDP-glycosyltransferase 72E1 OS=Arabidopsis thaliana GN=UGT72E1 PE=1 SV=1[more]
U72E3_ARATH1.6e-11444.99UDP-glycosyltransferase 72E3 OS=Arabidopsis thaliana GN=UGT72E3 PE=1 SV=1[more]
U72C1_ARATH1.6e-9844.35UDP-glycosyltransferase 72C1 OS=Arabidopsis thaliana GN=UGT72C1 PE=2 SV=3[more]
Match NameE-valueIdentityDescription
A0A0A0KGE4_CUCSA1.7e-19273.68Uncharacterized protein OS=Cucumis sativus GN=Csa_6G501940 PE=4 SV=1[more]
B9HEN9_POPTR4.2e-13856.20Glycosyltransferase OS=Populus trichocarpa GN=POPTR_0007s12400g PE=3 SV=1[more]
E1ANG8_POPTO6.3e-13455.13Glycosyltransferase OS=Populus tomentosa PE=3 SV=1[more]
A0A061DGH3_THECC1.5e-13053.53Glycosyltransferase OS=Theobroma cacao GN=TCM_000444 PE=3 SV=1[more]
K7LND6_SOYBN4.2e-13053.00Glycosyltransferase OS=Glycine max GN=GLYMA_11G064400 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT5G66690.12.5e-11847.56 UDP-Glycosyltransferase superfamily protein[more]
AT3G50740.11.7e-11746.38 UDP-glucosyl transferase 72E1[more]
AT5G26310.19.1e-11644.99 UDP-Glycosyltransferase superfamily protein[more]
AT4G36770.19.1e-10044.35 UDP-Glycosyltransferase superfamily protein[more]
AT2G18570.11.3e-9842.71 UDP-Glycosyltransferase superfamily protein[more]
Match NameE-valueIdentityDescription
gi|700193601|gb|KGN48805.1|2.5e-19273.68hypothetical protein Csa_6G501940 [Cucumis sativus][more]
gi|778722457|ref|XP_004143577.2|1.3e-18874.13PREDICTED: UDP-glycosyltransferase 72E1, partial [Cucumis sativus][more]
gi|659081042|ref|XP_008441118.1|8.1e-18372.51PREDICTED: UDP-glycosyltransferase 72E1 [Cucumis melo][more]
gi|224094711|ref|XP_002310203.1|6.1e-13856.20UDP-glucoronosyl/UDP-glucosyl transferase family protein [Populus trichocarpa][more]
gi|302777000|gb|ADL67595.1|9.1e-13455.13glycosyltransferase 1 [Populus tomentosa][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002213UDP_glucos_trans
Vocabulary: Molecular Function
TermDefinition
GO:0016758transferase activity, transferring hexosyl groups
Vocabulary: Biological Process
TermDefinition
GO:0008152metabolic process
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008152 metabolic process
cellular_component GO:0005575 cellular_component
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0043231 intracellular membrane-bounded organelle
molecular_function GO:0016758 transferase activity, transferring hexosyl groups
molecular_function GO:0080043 quercetin 3-O-glucosyltransferase activity
molecular_function GO:0080044 quercetin 7-O-glucosyltransferase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh03G008050.1CmoCh03G008050.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002213UDP-glucuronosyl/UDP-glucosyltransferasePANTHERPTHR11926GLUCOSYL/GLUCURONOSYL TRANSFERASEScoord: 5..469
score: 1.7E
IPR002213UDP-glucuronosyl/UDP-glucosyltransferasePFAMPF00201UDPGTcoord: 255..406
score: 5.2
NoneNo IPR availableunknownCoilCoilcoord: 179..199
scor
NoneNo IPR availableGENE3DG3DSA:3.40.50.2000coord: 253..442
score: 3.
NoneNo IPR availablePANTHERPTHR11926:SF223UDP-GLYCOSYLTRANSFERASE 72E1-RELATEDcoord: 5..469
score: 1.7E
NoneNo IPR availableunknownSSF53756UDP-Glycosyltransferase/glycogen phosphorylasecoord: 8..466
score: 1.53E

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
CmoCh03G008050CmoCh04G008450Cucurbita moschata (Rifu)cmocmoB451
CmoCh03G008050CmoCh07G005050Cucurbita moschata (Rifu)cmocmoB455