CmoCh14G021600 (gene) Cucurbita moschata (Rifu)

NameCmoCh14G021600
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionUDP-Glycosyltransferase superfamily protein
LocationCmo_Chr14 : 15526459 .. 15527880 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAGCTCCACCGTCCATCCCTCATCTGGCTCTTTTACCAAGCCCTGGAATGGGCCATCTCATTCCACTCATCGAGTTCGCCAAACGCCTTCTCTCCCACCACCGTTTAACCATCACATTCGTAATCCCTTCCGATGGTCCGCCGTCCACGCCTCAAAAGGCCGTCCTCGATTCTCTCCCTGCCGGCATCGATTACCTCTTCCTCCCGCCGGTTAGCTTCGACGATCTTCCCGATGCCTCCAAAATTGAAACCATCATTACTCTCACCATTTCTCGCTCTCTCCCTTCCCTTCGGAACGTACTCAAATCCATGGTCGCTAATTCCAACCTCGTTGGCTTGGTCGTCGATCTTTTCGGCACCGACGCCTTCGATTTAGCCAAAGAATTCAACATTTCCTCTTATATGTTCTTCCCCTCCACCGCCATGTTTCTTTCCTTTGCTCTGTTTTTACCCAAACTCGATGAATCCGTCGCCGGCGAGTTCCGCGACCTCCCCGAGCCAATCAGAATCCCGGGATGTATTCCGATTCAAGGCAAAGATCTGTTGGACCCAGTTCAAGATCGGAAGAACGAAGCCTACAAATGGACGCTTCACAACGCGAAGAGGTATGCTTTAGCAGATGGGATTTTTCTCAACAGCTTCCTGGAATTGGAGCCCGGAGCAGTAAATTATCTGCAAGAAGAGGAAGCAGGGAAGCCCCCTGTTTACCCAATTGGACCATTGGTGAAAATCGACGCAAATGAGAAGGAAGAAAGGGCAGAGTGTCTGAAGTGGCTGGACGAACAGCCACATGGGTCTGTTCTGTTTGTGTCATTTGGAAGCGGTGGCACTCTGTCGAGCCATCAAATTAACGAATTGGCATCGGGATTGGAAACGAGTGGACAGAGATTCATATGGGTTGTTAGAAGTCCAAGCGATAAAGCCGCAAACGCCACCTATTTCAGCGTCCAAAGTCAGAGGGATCCATTGGATTTCTTGCCAGAAGGGTTTGTAGAGAGGACCAAAAACAGAGGCCTGGTGGTGCCGTCGTGGGCGCCGCAGGCTCAGATCCTCAGCCACAGCTCCACCGGCGGGTTTTTGACACACTGTGGATGGAATTCAACTTTAGAGAGCGTGGTTAATGGGGTTCCTCTGATTGCTTGGCCGCTGTACGCAGAACAGAGGATGAACGCAGTGATGCTAACAGAGGAGATTAAAGTGGCGTTGAGGCTGAAAACAAATGAGAAAAATGGGATTGTGGAGAAGGAAGAGATTTCGAAAGTGGTGAGATCGGTGATGGAAGGGGAAGAGGGGAAGAAACTGCGTCGGAAAATGAAAGATTTGAAGGAAGCAGCTGAAAGAGTTGTTGGAGAAGATGGATCATCCACCAAAATAGTGAGTGATGTGGTAAACACTTGGAAGGCCAGAATTTCCACGTAG

mRNA sequence

ATGGAAGCTCCACCGTCCATCCCTCATCTGGCTCTTTTACCAAGCCCTGGAATGGGCCATCTCATTCCACTCATCGAGTTCGCCAAACGCCTTCTCTCCCACCACCGTTTAACCATCACATTCGTAATCCCTTCCGATGGTCCGCCGTCCACGCCTCAAAAGGCCGTCCTCGATTCTCTCCCTGCCGGCATCGATTACCTCTTCCTCCCGCCGGTTAGCTTCGACGATCTTCCCGATGCCTCCAAAATTGAAACCATCATTACTCTCACCATTTCTCGCTCTCTCCCTTCCCTTCGGAACGTACTCAAATCCATGGTCGCTAATTCCAACCTCGTTGGCTTGGTCGTCGATCTTTTCGGCACCGACGCCTTCGATTTAGCCAAAGAATTCAACATTTCCTCTTATATGTTCTTCCCCTCCACCGCCATGTTTCTTTCCTTTGCTCTGTTTTTACCCAAACTCGATGAATCCGTCGCCGGCGAGTTCCGCGACCTCCCCGAGCCAATCAGAATCCCGGGATGTATTCCGATTCAAGGCAAAGATCTGTTGGACCCAGTTCAAGATCGGAAGAACGAAGCCTACAAATGGACGCTTCACAACGCGAAGAGGTATGCTTTAGCAGATGGGATTTTTCTCAACAGCTTCCTGGAATTGGAGCCCGGAGCAGTAAATTATCTGCAAGAAGAGGAAGCAGGGAAGCCCCCTGTTTACCCAATTGGACCATTGGTGAAAATCGACGCAAATGAGAAGGAAGAAAGGGCAGAGTGTCTGAAGTGGCTGGACGAACAGCCACATGGGTCTGTTCTGTTTGTGTCATTTGGAAGCGGTGGCACTCTGTCGAGCCATCAAATTAACGAATTGGCATCGGGATTGGAAACGAGTGGACAGAGATTCATATGGGTTGTTAGAAGTCCAAGCGATAAAGCCGCAAACGCCACCTATTTCAGCGTCCAAAGTCAGAGGGATCCATTGGATTTCTTGCCAGAAGGGTTTGTAGAGAGGACCAAAAACAGAGGCCTGGTGGTGCCGTCGTGGGCGCCGCAGGCTCAGATCCTCAGCCACAGCTCCACCGGCGGGTTTTTGACACACTGTGGATGGAATTCAACTTTAGAGAGCGTGGTTAATGGGGTTCCTCTGATTGCTTGGCCGCTGTACGCAGAACAGAGGATGAACGCAGTGATGCTAACAGAGGAGATTAAAGTGGCGTTGAGGCTGAAAACAAATGAGAAAAATGGGATTGTGGAGAAGGAAGAGATTTCGAAAGTGGTGAGATCGGTGATGGAAGGGGAAGAGGGGAAGAAACTGCGTCGGAAAATGAAAGATTTGAAGGAAGCAGCTGAAAGAGTTGTTGGAGAAGATGGATCATCCACCAAAATAGTGAGTGATGTGGTAAACACTTGGAAGGCCAGAATTTCCACGTAG

Coding sequence (CDS)

ATGGAAGCTCCACCGTCCATCCCTCATCTGGCTCTTTTACCAAGCCCTGGAATGGGCCATCTCATTCCACTCATCGAGTTCGCCAAACGCCTTCTCTCCCACCACCGTTTAACCATCACATTCGTAATCCCTTCCGATGGTCCGCCGTCCACGCCTCAAAAGGCCGTCCTCGATTCTCTCCCTGCCGGCATCGATTACCTCTTCCTCCCGCCGGTTAGCTTCGACGATCTTCCCGATGCCTCCAAAATTGAAACCATCATTACTCTCACCATTTCTCGCTCTCTCCCTTCCCTTCGGAACGTACTCAAATCCATGGTCGCTAATTCCAACCTCGTTGGCTTGGTCGTCGATCTTTTCGGCACCGACGCCTTCGATTTAGCCAAAGAATTCAACATTTCCTCTTATATGTTCTTCCCCTCCACCGCCATGTTTCTTTCCTTTGCTCTGTTTTTACCCAAACTCGATGAATCCGTCGCCGGCGAGTTCCGCGACCTCCCCGAGCCAATCAGAATCCCGGGATGTATTCCGATTCAAGGCAAAGATCTGTTGGACCCAGTTCAAGATCGGAAGAACGAAGCCTACAAATGGACGCTTCACAACGCGAAGAGGTATGCTTTAGCAGATGGGATTTTTCTCAACAGCTTCCTGGAATTGGAGCCCGGAGCAGTAAATTATCTGCAAGAAGAGGAAGCAGGGAAGCCCCCTGTTTACCCAATTGGACCATTGGTGAAAATCGACGCAAATGAGAAGGAAGAAAGGGCAGAGTGTCTGAAGTGGCTGGACGAACAGCCACATGGGTCTGTTCTGTTTGTGTCATTTGGAAGCGGTGGCACTCTGTCGAGCCATCAAATTAACGAATTGGCATCGGGATTGGAAACGAGTGGACAGAGATTCATATGGGTTGTTAGAAGTCCAAGCGATAAAGCCGCAAACGCCACCTATTTCAGCGTCCAAAGTCAGAGGGATCCATTGGATTTCTTGCCAGAAGGGTTTGTAGAGAGGACCAAAAACAGAGGCCTGGTGGTGCCGTCGTGGGCGCCGCAGGCTCAGATCCTCAGCCACAGCTCCACCGGCGGGTTTTTGACACACTGTGGATGGAATTCAACTTTAGAGAGCGTGGTTAATGGGGTTCCTCTGATTGCTTGGCCGCTGTACGCAGAACAGAGGATGAACGCAGTGATGCTAACAGAGGAGATTAAAGTGGCGTTGAGGCTGAAAACAAATGAGAAAAATGGGATTGTGGAGAAGGAAGAGATTTCGAAAGTGGTGAGATCGGTGATGGAAGGGGAAGAGGGGAAGAAACTGCGTCGGAAAATGAAAGATTTGAAGGAAGCAGCTGAAAGAGTTGTTGGAGAAGATGGATCATCCACCAAAATAGTGAGTGATGTGGTAAACACTTGGAAGGCCAGAATTTCCACGTAG
BLAST of CmoCh14G021600 vs. Swiss-Prot
Match: HQGT_RAUSE (Hydroquinone glucosyltransferase OS=Rauvolfia serpentina GN=AS PE=1 SV=1)

HSP 1 Score: 629.8 bits (1623), Expect = 2.4e-179
Identity = 302/466 (64.81%), Postives = 378/466 (81.12%), Query Frame = 1

Query: 8   PHLALLPSPGMGHLIPLIEFAKRLLSHHRLTITFVIPSDGPPSTPQKAVLDSLPAGIDYL 67
           PH+A++P+PGMGHLIPL+EFAKRL+  H   +TF+IP+DGP    QK+ LD+LPAG++Y+
Sbjct: 5   PHIAMVPTPGMGHLIPLVEFAKRLVLRHNFGVTFIIPTDGPLPKAQKSFLDALPAGVNYV 64

Query: 68  FLPPVSFDDLPDASKIETIITLTISRSLPSLRNVLKSMVANSNLVGLVVDLFGTDAFDLA 127
            LPPVSFDDLP   +IET I LTI+RSLP +R+ +K+++A + L  LVVDLFGTDAFD+A
Sbjct: 65  LLPPVSFDDLPADVRIETRICLTITRSLPFVRDAVKTLLATTKLAALVVDLFGTDAFDVA 124

Query: 128 KEFNISSYMFFPSTAMFLSFALFLPKLDESVAGEFRDLPEPIRIPGCIPIQGKDLLDPVQ 187
            EF +S Y+F+P+TAM LS    LPKLD+ V+ E+RD+PEP++IPGCIPI GKD LDP Q
Sbjct: 125 IEFKVSPYIFYPTTAMCLSLFFHLPKLDQMVSCEYRDVPEPLQIPGCIPIHGKDFLDPAQ 184

Query: 188 DRKNEAYKWTLHNAKRYALADGIFLNSFLELEPGAVNYLQEEEAGKPPVYPIGPLVKIDA 247
           DRKN+AYK  LH AKRY LA+GI +N+F +LEPG +  LQEE+ GKPPVYPIGPL++ D+
Sbjct: 185 DRKNDAYKCLLHQAKRYRLAEGIMVNTFNDLEPGPLKALQEEDQGKPPVYPIGPLIRADS 244

Query: 248 NEKEERAECLKWLDEQPHGSVLFVSFGSGGTLSSHQINELASGLETSGQRFIWVVRSPSD 307
           + K +  ECLKWLD+QP GSVLF+SFGSGG +S +Q  ELA GLE S QRF+WVVRSP+D
Sbjct: 245 SSKVDDCECLKWLDDQPRGSVLFISFGSGGAVSHNQFIELALGLEMSEQRFLWVVRSPND 304

Query: 308 KAANATYFSVQSQRDPLDFLPEGFVERTKNRGLVVPSWAPQAQILSHSSTGGFLTHCGWN 367
           K ANATYFS+Q+Q D L +LPEGF+ERTK R L+VPSWAPQ +ILSH STGGFLTHCGWN
Sbjct: 305 KIANATYFSIQNQNDALAYLPEGFLERTKGRCLLVPSWAPQTEILSHGSTGGFLTHCGWN 364

Query: 368 STLESVVNGVPLIAWPLYAEQRMNAVMLTEEIKVALRLKTNEKNGIVEKEEISKVVRSVM 427
           S LESVVNGVPLIAWPLYAEQ+MNAVMLTE +KVALR K  E NG++ + EI+  V+ +M
Sbjct: 365 SILESVVNGVPLIAWPLYAEQKMNAVMLTEGLKVALRPKAGE-NGLIGRVEIANAVKGLM 424

Query: 428 EGEEGKKLRRKMKDLKEAAERVVGEDGSSTKIVSDVVNTWKARIST 474
           EGEEGKK R  MKDLK+AA R + +DGSSTK ++++   W+ +IS+
Sbjct: 425 EGEEGKKFRSTMKDLKDAASRALSDDGSSTKALAELACKWENKISS 469

BLAST of CmoCh14G021600 vs. Swiss-Prot
Match: U72B1_ARATH (UDP-glycosyltransferase 72B1 OS=Arabidopsis thaliana GN=UGT72B1 PE=1 SV=1)

HSP 1 Score: 591.3 bits (1523), Expect = 9.6e-168
Identity = 291/465 (62.58%), Postives = 364/465 (78.28%), Query Frame = 1

Query: 8   PHLALLPSPGMGHLIPLIEFAKRLLSHHRLTITFVIPSDGPPSTPQKAVLDSLPAGIDYL 67
           PH+A++PSPGMGHLIPL+EFAKRL+  H LT+TFVI  +GPPS  Q+ VLDSLP+ I  +
Sbjct: 7   PHVAIIPSPGMGHLIPLVEFAKRLVHLHGLTVTFVIAGEGPPSKAQRTVLDSLPSSISSV 66

Query: 68  FLPPVSFDDLPDASKIETIITLTISRSLPSLRNVLKSMVANSNL-VGLVVDLFGTDAFDL 127
           FLPPV   DL  +++IE+ I+LT++RS P LR V  S V    L   LVVDLFGTDAFD+
Sbjct: 67  FLPPVDLTDLSSSTRIESRISLTVTRSNPELRKVFDSFVEGGRLPTALVVDLFGTDAFDV 126

Query: 128 AKEFNISSYMFFPSTAMFLSFALFLPKLDESVAGEFRDLPEPIRIPGCIPIQGKDLLDPV 187
           A EF++  Y+F+P+TA  LSF L LPKLDE+V+ EFR+L EP+ +PGC+P+ GKD LDP 
Sbjct: 127 AVEFHVPPYIFYPTTANVLSFFLHLPKLDETVSCEFRELTEPLMLPGCVPVAGKDFLDPA 186

Query: 188 QDRKNEAYKWTLHNAKRYALADGIFLNSFLELEPGAVNYLQEEEAGKPPVYPIGPLVKID 247
           QDRK++AYKW LHN KRY  A+GI +N+F ELEP A+  LQE    KPPVYP+GPLV I 
Sbjct: 187 QDRKDDAYKWLLHNTKRYKEAEGILVNTFFELEPNAIKALQEPGLDKPPVYPVGPLVNIG 246

Query: 248 ANE--KEERAECLKWLDEQPHGSVLFVSFGSGGTLSSHQINELASGLETSGQRFIWVVRS 307
             E  + E +ECLKWLD QP GSVL+VSFGSGGTL+  Q+NELA GL  S QRF+WV+RS
Sbjct: 247 KQEAKQTEESECLKWLDNQPLGSVLYVSFGSGGTLTCEQLNELALGLADSEQRFLWVIRS 306

Query: 308 PSDKAANATYFSVQSQRDPLDFLPEGFVERTKNRGLVVPSWAPQAQILSHSSTGGFLTHC 367
           PS   AN++YF   SQ DPL FLP GF+ERTK RG V+P WAPQAQ+L+H STGGFLTHC
Sbjct: 307 PSG-IANSSYFDSHSQTDPLTFLPPGFLERTKKRGFVIPFWAPQAQVLAHPSTGGFLTHC 366

Query: 368 GWNSTLESVVNGVPLIAWPLYAEQRMNAVMLTEEIKVALRLKTNEKNGIVEKEEISKVVR 427
           GWNSTLESVV+G+PLIAWPLYAEQ+MNAV+L+E+I+ ALR +  + +G+V +EE+++VV+
Sbjct: 367 GWNSTLESVVSGIPLIAWPLYAEQKMNAVLLSEDIRAALRPRAGD-DGLVRREEVARVVK 426

Query: 428 SVMEGEEGKKLRRKMKDLKEAAERVVGEDGSSTKIVSDVVNTWKA 470
            +MEGEEGK +R KMK+LKEAA RV+ +DG+STK +S V   WKA
Sbjct: 427 GLMEGEEGKGVRNKMKELKEAACRVLKDDGTSTKALSLVALKWKA 469

BLAST of CmoCh14G021600 vs. Swiss-Prot
Match: U72B3_ARATH (UDP-glycosyltransferase 72B3 OS=Arabidopsis thaliana GN=UGT72B3 PE=2 SV=1)

HSP 1 Score: 551.6 bits (1420), Expect = 8.4e-156
Identity = 270/470 (57.45%), Postives = 362/470 (77.02%), Query Frame = 1

Query: 3   APPSIPHLALLPSPGMGHLIPLIEFAKRLLSHHRLTITFVIPSDGPPSTPQKAVLDSLPA 62
           A  + PH+A++PSPG+GHLIPL+E AKRLL +H  T+TF+IP D PPS  Q++VL+SLP+
Sbjct: 2   ADGNTPHVAIIPSPGIGHLIPLVELAKRLLDNHGFTVTFIIPGDSPPSKAQRSVLNSLPS 61

Query: 63  GIDYLFLPPVSFDDLPDASKIETIITLTISRSLPSLRNVLKSMVANSNLVG-LVVDLFGT 122
            I  +FLPP    D+P  ++IET I+LT++RS P+LR +  S+ A   L   LVVDLFGT
Sbjct: 62  SIASVFLPPADLSDVPSTARIETRISLTVTRSNPALRELFGSLSAEKRLPAVLVVDLFGT 121

Query: 123 DAFDLAKEFNISSYMFFPSTAMFLSFALFLPKLDESVAGEFRDLPEPIRIPGCIPIQGKD 182
           DAFD+A EF++S Y+F+ S A  L+F L LPKLDE+V+ EFR+L EP+ IPGC+PI GKD
Sbjct: 122 DAFDVAAEFHVSPYIFYASNANVLTFLLHLPKLDETVSCEFRELTEPVIIPGCVPITGKD 181

Query: 183 LLDPVQDRKNEAYKWTLHNAKRYALADGIFLNSFLELEPGAVNYLQEEEAGKPPVYPIGP 242
            +DP QDRK+E+YKW LHN KR+  A+GI +NSF++LEP  +  +QE    KPPVY IGP
Sbjct: 182 FVDPCQDRKDESYKWLLHNVKRFKEAEGILVNSFVDLEPNTIKIVQEPAPDKPPVYLIGP 241

Query: 243 LVKIDANEKE--ERAECLKWLDEQPHGSVLFVSFGSGGTLSSHQINELASGLETSGQRFI 302
           LV   +++ +  +  +CL WLD QP GSVL+VSFGSGGTL+  Q  ELA GL  SG+RF+
Sbjct: 242 LVNSGSHDADVNDEYKCLNWLDNQPFGSVLYVSFGSGGTLTFEQFIELALGLAESGKRFL 301

Query: 303 WVVRSPSDKAANATYFSVQSQRDPLDFLPEGFVERTKNRGLVVPSWAPQAQILSHSSTGG 362
           WV+RSPS   A+++YF+ QS+ DP  FLP+GF++RTK +GLVV SWAPQAQIL+H+S GG
Sbjct: 302 WVIRSPSG-IASSSYFNPQSRNDPFSFLPQGFLDRTKEKGLVVGSWAPQAQILTHTSIGG 361

Query: 363 FLTHCGWNSTLESVVNGVPLIAWPLYAEQRMNAVMLTEEIKVALRLKTNEKNGIVEKEEI 422
           FLTHCGWNS+LES+VNGVPLIAWPLYAEQ+MNA++L  ++  ALR +  E +G+V +EE+
Sbjct: 362 FLTHCGWNSSLESIVNGVPLIAWPLYAEQKMNALLLV-DVGAALRARLGE-DGVVGREEV 421

Query: 423 SKVVRSVMEGEEGKKLRRKMKDLKEAAERVVGEDGSSTKIVSDVVNTWKA 470
           ++VV+ ++EGEEG  +R+KMK+LKE + RV+ +DG STK +++V   WKA
Sbjct: 422 ARVVKGLIEGEEGNAVRKKMKELKEGSVRVLRDDGFSTKSLNEVSLKWKA 468

BLAST of CmoCh14G021600 vs. Swiss-Prot
Match: U72B2_ARATH (UDP-glycosyltransferase 72B2 OS=Arabidopsis thaliana GN=UGT72B2 PE=2 SV=1)

HSP 1 Score: 527.3 bits (1357), Expect = 1.7e-148
Identity = 261/469 (55.65%), Postives = 345/469 (73.56%), Query Frame = 1

Query: 3   APPSIPHLALLPSPGMGHLIPLIEFAKRLLSHHRLTITFVIPSDGPPSTPQKAVLDSLPA 62
           A  + PH+A++PSPGMGHLIP +E AKRL+ H   T+T +I  +  PS  Q++VL+SLP+
Sbjct: 2   AEANTPHIAIMPSPGMGHLIPFVELAKRLVQHDCFTVTMIISGETSPSKAQRSVLNSLPS 61

Query: 63  GIDYLFLPPVSFDDLPDASKIETIITLTISRSLPSLRNVLKSMVANSNLVG-LVVDLFGT 122
            I  +FLPP    D+P  ++IET   LT++RS P+LR +  S+    +L   LVVD+FG 
Sbjct: 62  SIASVFLPPADLSDVPSTARIETRAMLTMTRSNPALRELFGSLSTKKSLPAVLVVDMFGA 121

Query: 123 DAFDLAKEFNISSYMFFPSTAMFLSFALFLPKLDESVAGEFRDLPEPIRIPGCIPIQGKD 182
           DAFD+A +F++S Y+F+ S A  LSF L LPKLD++V+ EFR L EP++IPGC+PI GKD
Sbjct: 122 DAFDVAVDFHVSPYIFYASNANVLSFFLHLPKLDKTVSCEFRYLTEPLKIPGCVPITGKD 181

Query: 183 LLDPVQDRKNEAYKWTLHNAKRYALADGIFLNSFLELEPGAVNYLQEEEAGKPPVYPIGP 242
            LD VQDR ++AYK  LHN KRY  A GI +NSF++LE  A+  LQE    KP VYPIGP
Sbjct: 182 FLDTVQDRNDDAYKLLLHNTKRYKEAKGILVNSFVDLESNAIKALQEPAPDKPTVYPIGP 241

Query: 243 LVKIDAN--EKEERAECLKWLDEQPHGSVLFVSFGSGGTLSSHQINELASGLETSGQRFI 302
           LV   ++    E++  CL WLD QP GSVL++SFGSGGTL+  Q NELA GL  SG+RFI
Sbjct: 242 LVNTSSSNVNLEDKFGCLSWLDNQPFGSVLYISFGSGGTLTCEQFNELAIGLAESGKRFI 301

Query: 303 WVVRSPSDKAANATYFSVQSQRDPLDFLPEGFVERTKNRGLVVPSWAPQAQILSHSSTGG 362
           WV+RSPS+   +++YF+  S+ DP  FLP GF++RTK +GLVVPSWAPQ QIL+H ST G
Sbjct: 302 WVIRSPSE-IVSSSYFNPHSETDPFSFLPIGFLDRTKEKGLVVPSWAPQVQILAHPSTCG 361

Query: 363 FLTHCGWNSTLESVVNGVPLIAWPLYAEQRMNAVMLTEEIKVALRLKTNEKNGIVEKEEI 422
           FLTHCGWNSTLES+VNGVPLIAWPL+AEQ+MN ++L E++  ALR+   E +GIV +EE+
Sbjct: 362 FLTHCGWNSTLESIVNGVPLIAWPLFAEQKMNTLLLVEDVGAALRIHAGE-DGIVRREEV 421

Query: 423 SKVVRSVMEGEEGKKLRRKMKDLKEAAERVVGEDGSSTKIVSDVVNTWK 469
            +VV+++MEGEEGK +  K+K+LKE   RV+G+DG S+K   +V+  WK
Sbjct: 422 VRVVKALMEGEEGKAIGNKVKELKEGVVRVLGDDGLSSKSFGEVLLKWK 468

BLAST of CmoCh14G021600 vs. Swiss-Prot
Match: UFOG5_MANES (Anthocyanidin 3-O-glucosyltransferase 5 OS=Manihot esculenta GN=GT5 PE=2 SV=1)

HSP 1 Score: 369.0 bits (946), Expect = 7.7e-101
Identity = 197/470 (41.91%), Postives = 291/470 (61.91%), Query Frame = 1

Query: 6   SIPHLALLPSPGMGHLIPLIEFAKRLLSHHRLTIT-FVIPSDGPPSTPQKAVLDSLPAGI 65
           S PH+ LL SPG+GHLIP++E  KR+++     +T F++ SD   + PQ       P   
Sbjct: 8   SKPHIVLLSSPGLGHLIPVLELGKRIVTLCNFDVTIFMVGSDTSAAEPQVLRSAMTPKLC 67

Query: 66  DYLFLPPVSFDDLPDASKIETIITLTISRSL-PSLRNVLKSMVANSNLVGLVVDLFGTDA 125
           + + LPP +   L D           + R + P+ R  + ++        ++VDLFGT++
Sbjct: 68  EIIQLPPPNISCLIDPEATVCTRLFVLMREIRPAFRAAVSALKFRP--AAIIVDLFGTES 127

Query: 126 FDLAKEFNISSYMFFPSTAMFLSFALFLPKLDESVAGEFRDLPEPIRIPGCIPIQGKDLL 185
            ++AKE  I+ Y++  S A FL+  +++P LD+ V GEF    EP++IPGC P++ ++++
Sbjct: 128 LEVAKELGIAKYVYIASNAWFLALTIYVPILDKEVEGEFVLQKEPMKIPGCRPVRTEEVV 187

Query: 186 DPVQDRKNEAYKWTLHNAKRYALADGIFLNSFLELEPGAVNYLQEEE----AGKPPVYPI 245
           DP+ DR N+ Y            ADGI +N++  LEP     L++ +      K PV+PI
Sbjct: 188 DPMLDRTNQQYSEYFRLGIEIPTADGILMNTWEALEPTTFGALRDVKFLGRVAKVPVFPI 247

Query: 246 GPLVKIDANEKEERAECLKWLDEQPHGSVLFVSFGSGGTLSSHQINELASGLETSGQRFI 305
           GPL +  A       E L WLD+QP  SV++VSFGSGGTLS  Q+ ELA GLE S QRFI
Sbjct: 248 GPLRR-QAGPCGSNCELLDWLDQQPKESVVYVSFGSGGTLSLEQMIELAWGLERSQQRFI 307

Query: 306 WVVRSPSDKAANATYFSVQSQRDPLD-FLPEGFVERTKNRGLVVPSWAPQAQILSHSSTG 365
           WVVR P+ K  +A +F+     D +  + PEGF+ R +N GLVVP W+PQ  I+SH S G
Sbjct: 308 WVVRQPTVKTGDAAFFTQGDGADDMSGYFPEGFLTRIQNVGLVVPQWSPQIHIMSHPSVG 367

Query: 366 GFLTHCGWNSTLESVVNGVPLIAWPLYAEQRMNAVMLTEEIKVALRLKTNEKNGIVEKEE 425
            FL+HCGWNS LES+  GVP+IAWP+YAEQRMNA +LTEE+ VA+R K      +V++EE
Sbjct: 368 VFLSHCGWNSVLESITAGVPIIAWPIYAEQRMNATLLTEELGVAVRPKNLPAKEVVKREE 427

Query: 426 ISKVVRSVMEGEEGKKLRRKMKDLKEAAERVVGEDGSSTKIVSDVVNTWK 469
           I +++R +M  EEG ++R+++++LK++ E+ + E GSS   +S + N W+
Sbjct: 428 IERMIRRIMVDEEGSEIRKRVRELKDSGEKALNEGGSSFNYMSALGNEWE 474

BLAST of CmoCh14G021600 vs. TrEMBL
Match: A0A0A0L3X8_CUCSA (Glycosyltransferase OS=Cucumis sativus GN=Csa_3G119730 PE=3 SV=1)

HSP 1 Score: 826.2 bits (2133), Expect = 2.0e-236
Identity = 409/473 (86.47%), Postives = 445/473 (94.08%), Query Frame = 1

Query: 1   MEAPPSIPHLALLPSPGMGHLIPLIEFAKRLLSHHRLTITFVIPSDGPPSTPQKAVLDSL 60
           MEA PSIPHLA+LPSPGMGHLIPLIEFAKRLLSHHRLT TF+I SDGPPS PQ+A+L+SL
Sbjct: 1   MEAHPSIPHLAILPSPGMGHLIPLIEFAKRLLSHHRLTFTFIIASDGPPSQPQQALLNSL 60

Query: 61  PAGIDYLFLPPVSFDDLPDASKIETIITLTISRSLPSLRNVLKSMVANSNLVGLVVDLFG 120
           P+GI +LFLP V+FDDLP  SKIETIITLTISRSLPSLRNVLKSMV+ SNLVGLVVDLFG
Sbjct: 61  PSGIHHLFLPAVTFDDLPPNSKIETIITLTISRSLPSLRNVLKSMVSQSNLVGLVVDLFG 120

Query: 121 TDAFDLAKEFNISSYMFFPSTAMFLSFALFLPKLDESVAGEFRDLPEPIRIPGCIPIQGK 180
           TD FD+A+EF+ISSY+FFPSTAMFLSFALFLPKLDES+ GEFRD PEPI+IPGCIPIQGK
Sbjct: 121 TDGFDIAREFDISSYIFFPSTAMFLSFALFLPKLDESIVGEFRDHPEPIKIPGCIPIQGK 180

Query: 181 DLLDPVQDRKNEAYKWTLHNAKRYALADGIFLNSFLELEPGAVNYLQEEEAGKPPVYPIG 240
           DLLDPVQDRKNEAYKWTLHNA+RYALADGIFLNSF ELEPGA+ YLQEEEAGKP VYPIG
Sbjct: 181 DLLDPVQDRKNEAYKWTLHNARRYALADGIFLNSFPELEPGAIKYLQEEEAGKPLVYPIG 240

Query: 241 PLVKIDANEKEERAECLKWLDEQPHGSVLFVSFGSGGTLSSHQINELASGLETSGQRFIW 300
           PLVKIDA+EKEERAECLKWLDEQPHGSVLFVSFGSGGTLSS QI+ELA GLE SGQRFIW
Sbjct: 241 PLVKIDADEKEERAECLKWLDEQPHGSVLFVSFGSGGTLSSAQIDELALGLEMSGQRFIW 300

Query: 301 VVRSPSDKAANATYFSVQSQRDPLDFLPEGFVERTKNRGLVVPSWAPQAQILSHSSTGGF 360
           VVRSPSDKAA+ATYFSV SQ DPLDFLPEGFVERTKNRG+VVPSWAPQAQILSH STGGF
Sbjct: 301 VVRSPSDKAADATYFSVHSQSDPLDFLPEGFVERTKNRGMVVPSWAPQAQILSHGSTGGF 360

Query: 361 LTHCGWNSTLESVVNGVPLIAWPLYAEQRMNAVMLTEEIKVALRLKTNEKNGIVEKEEIS 420
           LTHCGWNSTLESVVNG+PLIAWPLYAEQRMNAV+LTEEI VAL+ K N+  GIVEKEEIS
Sbjct: 361 LTHCGWNSTLESVVNGIPLIAWPLYAEQRMNAVILTEEINVALKPKRNDNKGIVEKEEIS 420

Query: 421 KVVRSVMEGEEGKKLRRKMKDLKEAAERVVGEDGSSTKIVSDVVNTWKARIST 474
           KVV+S++EGEEGKKLRRKMK+L+EA+++ VGEDGSSTKIV+D+VN WKA+IST
Sbjct: 421 KVVKSLLEGEEGKKLRRKMKELEEASKKAVGEDGSSTKIVTDLVNNWKAKIST 473

BLAST of CmoCh14G021600 vs. TrEMBL
Match: E5GCH8_CUCME (Glycosyltransferase OS=Cucumis melo subsp. melo PE=3 SV=1)

HSP 1 Score: 824.3 bits (2128), Expect = 7.5e-236
Identity = 410/473 (86.68%), Postives = 442/473 (93.45%), Query Frame = 1

Query: 1   MEAPPSIPHLALLPSPGMGHLIPLIEFAKRLLSHHRLTITFVIPSDGPPSTPQKAVLDSL 60
           MEA P IPHLA+LPSPGMGHLIPLIEFAKRLLSHHRLT TF+I SDGPPS PQ+A+L+SL
Sbjct: 1   MEAHPPIPHLAILPSPGMGHLIPLIEFAKRLLSHHRLTFTFIIASDGPPSQPQQALLNSL 60

Query: 61  PAGIDYLFLPPVSFDDLPDASKIETIITLTISRSLPSLRNVLKSMVANSNLVGLVVDLFG 120
           P+GID+LFLPP+SFDDLP  SKIETIITLTISRSLPSLRNVLKSMV  SNLVGLVVDLFG
Sbjct: 61  PSGIDHLFLPPLSFDDLPPDSKIETIITLTISRSLPSLRNVLKSMVPQSNLVGLVVDLFG 120

Query: 121 TDAFDLAKEFNISSYMFFPSTAMFLSFALFLPKLDESVAGEFRDLPEPIRIPGCIPIQGK 180
           TDAFD+A+EFNISSY+FFPSTAM LSFALFLPKLDESV GEFRD PEPI+IPGCI I+GK
Sbjct: 121 TDAFDVAREFNISSYIFFPSTAMLLSFALFLPKLDESVVGEFRDHPEPIKIPGCIAIEGK 180

Query: 181 DLLDPVQDRKNEAYKWTLHNAKRYALADGIFLNSFLELEPGAVNYLQEEEAGKPPVYPIG 240
           DLLDPVQDRKNEAYKWTLHNAKRYALADGIFLNSF ELEPGA+ YL+EEE GKP VYPIG
Sbjct: 181 DLLDPVQDRKNEAYKWTLHNAKRYALADGIFLNSFPELEPGAIKYLREEEPGKPLVYPIG 240

Query: 241 PLVKIDANEKEERAECLKWLDEQPHGSVLFVSFGSGGTLSSHQINELASGLETSGQRFIW 300
           PLVKIDA+EKEERAECLKWLDEQPHGSVLFVSFGSGGTL S QI+ELA GLE SGQRFIW
Sbjct: 241 PLVKIDADEKEERAECLKWLDEQPHGSVLFVSFGSGGTLKSAQIDELALGLEMSGQRFIW 300

Query: 301 VVRSPSDKAANATYFSVQSQRDPLDFLPEGFVERTKNRGLVVPSWAPQAQILSHSSTGGF 360
           VVRSPSDKAA+ATYFSV SQ DPL FLPEGF+ERTKNRG+VVPSWAPQAQILSH STGGF
Sbjct: 301 VVRSPSDKAADATYFSVHSQSDPLGFLPEGFLERTKNRGMVVPSWAPQAQILSHGSTGGF 360

Query: 361 LTHCGWNSTLESVVNGVPLIAWPLYAEQRMNAVMLTEEIKVALRLKTNEKNGIVEKEEIS 420
           LTHCGWNSTLESVVNG+PLIAWPLYAEQRMNAVMLTEEI VAL+ K NEK GIVEKEEIS
Sbjct: 361 LTHCGWNSTLESVVNGIPLIAWPLYAEQRMNAVMLTEEINVALKPKRNEKTGIVEKEEIS 420

Query: 421 KVVRSVMEGEEGKKLRRKMKDLKEAAERVVGEDGSSTKIVSDVVNTWKARIST 474
           KVV+S++EGEEGKKLRRKMK+LKEA+E+ VGEDGSSTKIV+++VN WKA+IST
Sbjct: 421 KVVKSLLEGEEGKKLRRKMKELKEASEKAVGEDGSSTKIVTNLVNNWKAKIST 473

BLAST of CmoCh14G021600 vs. TrEMBL
Match: K7NBR5_SIRGR (Glycosyltransferase OS=Siraitia grosvenorii GN=UDPG3 PE=2 SV=1)

HSP 1 Score: 684.1 bits (1764), Expect = 1.2e-193
Identity = 337/465 (72.47%), Postives = 395/465 (84.95%), Query Frame = 1

Query: 8   PHLALLPSPGMGHLIPLIEFAKRLLSHHRLTITFVIPSDGPPSTPQKAVLDSLPAGIDYL 67
           PH+ +LPSPGMGHLIPL+EFAKRLL  HR T+TF IPS  PPS  Q ++L SLP+GIDY+
Sbjct: 15  PHVVMLPSPGMGHLIPLLEFAKRLLFLHRFTVTFAIPSGDPPSKAQISILSSLPSGIDYV 74

Query: 68  FLPPVSFDDLPDASKIETIITLTISRSLPSLRNVLKSMVANSNLVGLVVDLFGTDAFDLA 127
           FLPPV+F DLP  +K E  I L ++RSLPS R++ KSMVAN+NLV LVVD FGTDAFD+A
Sbjct: 75  FLPPVNFHDLPKDTKAEVFIVLAVARSLPSFRDLFKSMVANTNLVALVVDQFGTDAFDVA 134

Query: 128 KEFNISSYMFFPSTAMFLSFALFLPKLDESVAGEFRDLPEPIRIPGCIPIQGKDLLDPVQ 187
           +EFN+S Y+FFP  AM LSF L LP+ DE+VA E+R+LPEPIR+ GC PI GKDL DP  
Sbjct: 135 REFNVSPYIFFPCAAMTLSFLLRLPEFDETVAEEYRELPEPIRLSGCAPIPGKDLADPFH 194

Query: 188 DRKNEAYKWTLHNAKRYALADGIFLNSFLELEPGAVNYLQEEEAGKPPVYPIGPLVKIDA 247
           DR+N+AYK  LHNAKRYALADGIFLNSF ELEPGA+  L EEE+ KP V+P+GPLV+ID+
Sbjct: 195 DRENDAYKLFLHNAKRYALADGIFLNSFPELEPGAIKALLEEESRKPLVHPVGPLVQIDS 254

Query: 248 NEKEERAECLKWLDEQPHGSVLFVSFGSGGTLSSHQINELASGLETSGQRFIWVVRSPSD 307
           +  EE AECLKWL+EQPHGSVLFVSFGSGGTLSS QINELA GLE SG RFIWVVRSPSD
Sbjct: 255 SGSEEGAECLKWLEEQPHGSVLFVSFGSGGTLSSDQINELALGLEMSGHRFIWVVRSPSD 314

Query: 308 KAANATYFSVQSQRDPLDFLPEGFVERTKNRGLVVPSWAPQAQILSHSSTGGFLTHCGWN 367
           +AANA++FSV SQ DPL FLPEGF+E T+ R +VVPSWAPQAQILSHSSTGGFL+HCGWN
Sbjct: 315 EAANASFFSVHSQNDPLSFLPEGFLEGTRGRSVVVPSWAPQAQILSHSSTGGFLSHCGWN 374

Query: 368 STLESVVNGVPLIAWPLYAEQRMNAVMLTEEIKVALRLKTNEKNGIVEKEEISKVVRSVM 427
           STLESVV GVPLIAWPLYAEQ+MNA++LTE+IKVALR KTNEK GIVEKEEI++ V+++M
Sbjct: 375 STLESVVYGVPLIAWPLYAEQKMNAILLTEDIKVALRPKTNEKTGIVEKEEIAEAVKTLM 434

Query: 428 EGEEGKKLRRKMKDLKEAAERVVGEDGSSTKIVSDVVNTWKARIS 473
           EGE+GKKLR KMK L+ AAERV+ EDGSS+K +S +V  WK++IS
Sbjct: 435 EGEDGKKLRSKMKYLRNAAERVLEEDGSSSKALSQMVLKWKSKIS 479

BLAST of CmoCh14G021600 vs. TrEMBL
Match: A0A061DTQ3_THECC (Glycosyltransferase OS=Theobroma cacao GN=TCM_005182 PE=3 SV=1)

HSP 1 Score: 677.9 bits (1748), Expect = 8.6e-192
Identity = 324/469 (69.08%), Postives = 395/469 (84.22%), Query Frame = 1

Query: 3   APPSIPHLALLPSPGMGHLIPLIEFAKRLLSHHRLTITFVIPSDGPPSTPQKAVLDSLPA 62
           A    PH+A+LPSPGMGHLIPL+EFAKRL+  H  T+TFVIP+DG PS  QK+ LDSLP+
Sbjct: 2   ATAQTPHIAILPSPGMGHLIPLVEFAKRLVHQHNFTVTFVIPTDGSPSKAQKSTLDSLPS 61

Query: 63  GIDYLFLPPVSFDDLPDASKIETIITLTISRSLPSLRNVLKSMVANSNLVGLVVDLFGTD 122
            ID +FLPPV   DLP+ SKIET+I+LT++RSLP +R+ LKS+ A + LVGLVVDLFGTD
Sbjct: 62  SIDSVFLPPVDLSDLPEGSKIETVISLTVARSLPFIRDALKSLAARTKLVGLVVDLFGTD 121

Query: 123 AFDLAKEFNISSYMFFPSTAMFLSFALFLPKLDESVAGEFRDLPEPIRIPGCIPIQGKDL 182
           AFD+A+EFN+S Y+FFPSTAM LS  L+LPKLD+ V+ E+RDLPE +RIPGCIPI G  L
Sbjct: 122 AFDVAREFNVSPYIFFPSTAMTLSLFLYLPKLDQMVSCEYRDLPEMVRIPGCIPIYGNQL 181

Query: 183 LDPVQDRKNEAYKWTLHNAKRYALADGIFLNSFLELEPGAVNYLQEEEAGKPPVYPIGPL 242
           LDP QDRKN++YKW LH+ KRY LA+GI +NSF++LE GA+  LQ++E GKPP+YP+GPL
Sbjct: 182 LDPTQDRKNDSYKWLLHHTKRYRLAEGIMVNSFVDLEGGAIKALQDKEPGKPPIYPVGPL 241

Query: 243 VKIDANEKEERAECLKWLDEQPHGSVLFVSFGSGGTLSSHQINELASGLETSGQRFIWVV 302
           V +D++ K + + CLKWLD QPHGSVL+VSFGSGGTLS +QINELA GLE S QRF+WVV
Sbjct: 242 VNVDSSSKADGSGCLKWLDGQPHGSVLYVSFGSGGTLSYNQINELALGLEMSQQRFLWVV 301

Query: 303 RSPSDKAANATYFSVQSQRDPLDFLPEGFVERTKNRGLVVPSWAPQAQILSHSSTGGFLT 362
           RSP+D+ ANAT+FSVQSQ+DP DFLP+GF+ERTK RGLVVPSWAPQAQ+LSH STGGFLT
Sbjct: 302 RSPNDQVANATFFSVQSQQDPFDFLPKGFLERTKGRGLVVPSWAPQAQVLSHGSTGGFLT 361

Query: 363 HCGWNSTLESVVNGVPLIAWPLYAEQRMNAVMLTEEIKVALRLKTNEKNGIVEKEEISKV 422
           HCGWNS LESVVNGVPLIAWPLYAEQ+MNAVML E+IKVALR K NE NG+V ++EI+K 
Sbjct: 362 HCGWNSALESVVNGVPLIAWPLYAEQKMNAVMLAEDIKVALRAKPNE-NGLVCRDEIAKA 421

Query: 423 VRSVMEGEEGKKLRRKMKDLKEAAERVVGEDGSSTKIVSDVVNTWKARI 472
           V+ +MEGEEGK +R +MKDLKEAA +V+ E+GSS K +S+V   W+ +I
Sbjct: 422 VKGLMEGEEGKGVRNRMKDLKEAAAKVLSENGSSGKALSEVAQKWRNQI 469

BLAST of CmoCh14G021600 vs. TrEMBL
Match: K7NBX4_SIRGR (Glycosyltransferase OS=Siraitia grosvenorii GN=UDPG2 PE=2 SV=1)

HSP 1 Score: 669.5 bits (1726), Expect = 3.1e-189
Identity = 328/465 (70.54%), Postives = 392/465 (84.30%), Query Frame = 1

Query: 8   PHLALLPSPGMGHLIPLIEFAKRLLSHHRLTITFVIPSDGPPSTPQKAVLDSLPAGIDYL 67
           PH+ +LPSPGMGHLIPL+EFAKRLL  HR T+TF IPS  PPS  Q ++L SLP+GIDY+
Sbjct: 15  PHVVMLPSPGMGHLIPLLEFAKRLLFLHRFTVTFAIPSGDPPSKAQISILSSLPSGIDYV 74

Query: 68  FLPPVSFDDLPDASKIETIITLTISRSLPSLRNVLKSMVANSNLVGLVVDLFGTDAFDLA 127
           FLPPV+F DLP  +K    I L ++RSLPS R++ KSMVAN+NLV LVVD FGTDAFD+A
Sbjct: 75  FLPPVNFHDLPKDTKAGVFIVLAVARSLPSFRDLFKSMVANTNLVALVVDQFGTDAFDVA 134

Query: 128 KEFNISSYMFFPSTAMFLSFALFLPKLDESVAGEFRDLPEPIRIPGCIPIQGKDLLDPVQ 187
           +EFN+S Y+FFP  AM LSF L LP+ DE+VAGE+R+LPEPIR+ GC PI GKDL  P  
Sbjct: 135 REFNVSPYIFFPCAAMTLSFLLRLPEFDETVAGEYRELPEPIRLSGCAPIPGKDLAGPFH 194

Query: 188 DRKNEAYKWTLHNAKRYALADGIFLNSFLELEPGAVNYLQEEEAGKPPVYPIGPLVKIDA 247
           DR+N+AYK  LHNAKRYALADGIFLNSF ELEPGA+  L EEE+ KP V+P+GPLV+ID+
Sbjct: 195 DRENDAYKLFLHNAKRYALADGIFLNSFPELEPGAIKALLEEESRKPLVHPVGPLVQIDS 254

Query: 248 NEKEERAECLKWLDEQPHGSVLFVSFGSGGTLSSHQINELASGLETSGQRFIWVVRSPSD 307
           +  EE AECLKWL+EQPHGSVLFVSFGSGG LSS QINELA GLE SG RFIWVVRSPSD
Sbjct: 255 SGSEEGAECLKWLEEQPHGSVLFVSFGSGGALSSDQINELALGLEMSGHRFIWVVRSPSD 314

Query: 308 KAANATYFSVQSQRDPLDFLPEGFVERTKNRGLVVPSWAPQAQILSHSSTGGFLTHCGWN 367
           +AANA++FSV SQ DPL FLPEGF+E T+ R +VVPSWAPQAQILSHSSTGGFL+HCGWN
Sbjct: 315 EAANASFFSVHSQNDPLSFLPEGFLEGTRGRSVVVPSWAPQAQILSHSSTGGFLSHCGWN 374

Query: 368 STLESVVNGVPLIAWPLYAEQRMNAVMLTEEIKVALRLKTNEKNGIVEKEEISKVVRSVM 427
           STLESVV GVPLIAWPLYAEQ+MNA++LTE+IK ALR K NE++G++EKEEI++VV+ + 
Sbjct: 375 STLESVVYGVPLIAWPLYAEQKMNAILLTEDIKAALRPKINEESGLIEKEEIAEVVKELF 434

Query: 428 EGEEGKKLRRKMKDLKEAAERVVGEDGSSTKIVSDVVNTWKARIS 473
           EGE+GK++R KM++LK+AA RV+GEDGSS+ + S+VV  WK +IS
Sbjct: 435 EGEDGKRVRAKMEELKDAAVRVLGEDGSSSTL-SEVVQKWKRKIS 478

BLAST of CmoCh14G021600 vs. TAIR10
Match: AT4G01070.1 (AT4G01070.1 UDP-Glycosyltransferase superfamily protein)

HSP 1 Score: 591.3 bits (1523), Expect = 5.4e-169
Identity = 291/465 (62.58%), Postives = 364/465 (78.28%), Query Frame = 1

Query: 8   PHLALLPSPGMGHLIPLIEFAKRLLSHHRLTITFVIPSDGPPSTPQKAVLDSLPAGIDYL 67
           PH+A++PSPGMGHLIPL+EFAKRL+  H LT+TFVI  +GPPS  Q+ VLDSLP+ I  +
Sbjct: 7   PHVAIIPSPGMGHLIPLVEFAKRLVHLHGLTVTFVIAGEGPPSKAQRTVLDSLPSSISSV 66

Query: 68  FLPPVSFDDLPDASKIETIITLTISRSLPSLRNVLKSMVANSNL-VGLVVDLFGTDAFDL 127
           FLPPV   DL  +++IE+ I+LT++RS P LR V  S V    L   LVVDLFGTDAFD+
Sbjct: 67  FLPPVDLTDLSSSTRIESRISLTVTRSNPELRKVFDSFVEGGRLPTALVVDLFGTDAFDV 126

Query: 128 AKEFNISSYMFFPSTAMFLSFALFLPKLDESVAGEFRDLPEPIRIPGCIPIQGKDLLDPV 187
           A EF++  Y+F+P+TA  LSF L LPKLDE+V+ EFR+L EP+ +PGC+P+ GKD LDP 
Sbjct: 127 AVEFHVPPYIFYPTTANVLSFFLHLPKLDETVSCEFRELTEPLMLPGCVPVAGKDFLDPA 186

Query: 188 QDRKNEAYKWTLHNAKRYALADGIFLNSFLELEPGAVNYLQEEEAGKPPVYPIGPLVKID 247
           QDRK++AYKW LHN KRY  A+GI +N+F ELEP A+  LQE    KPPVYP+GPLV I 
Sbjct: 187 QDRKDDAYKWLLHNTKRYKEAEGILVNTFFELEPNAIKALQEPGLDKPPVYPVGPLVNIG 246

Query: 248 ANE--KEERAECLKWLDEQPHGSVLFVSFGSGGTLSSHQINELASGLETSGQRFIWVVRS 307
             E  + E +ECLKWLD QP GSVL+VSFGSGGTL+  Q+NELA GL  S QRF+WV+RS
Sbjct: 247 KQEAKQTEESECLKWLDNQPLGSVLYVSFGSGGTLTCEQLNELALGLADSEQRFLWVIRS 306

Query: 308 PSDKAANATYFSVQSQRDPLDFLPEGFVERTKNRGLVVPSWAPQAQILSHSSTGGFLTHC 367
           PS   AN++YF   SQ DPL FLP GF+ERTK RG V+P WAPQAQ+L+H STGGFLTHC
Sbjct: 307 PSG-IANSSYFDSHSQTDPLTFLPPGFLERTKKRGFVIPFWAPQAQVLAHPSTGGFLTHC 366

Query: 368 GWNSTLESVVNGVPLIAWPLYAEQRMNAVMLTEEIKVALRLKTNEKNGIVEKEEISKVVR 427
           GWNSTLESVV+G+PLIAWPLYAEQ+MNAV+L+E+I+ ALR +  + +G+V +EE+++VV+
Sbjct: 367 GWNSTLESVVSGIPLIAWPLYAEQKMNAVLLSEDIRAALRPRAGD-DGLVRREEVARVVK 426

Query: 428 SVMEGEEGKKLRRKMKDLKEAAERVVGEDGSSTKIVSDVVNTWKA 470
            +MEGEEGK +R KMK+LKEAA RV+ +DG+STK +S V   WKA
Sbjct: 427 GLMEGEEGKGVRNKMKELKEAACRVLKDDGTSTKALSLVALKWKA 469

BLAST of CmoCh14G021600 vs. TAIR10
Match: AT1G01420.1 (AT1G01420.1 UDP-glucosyl transferase 72B3)

HSP 1 Score: 551.6 bits (1420), Expect = 4.7e-157
Identity = 270/470 (57.45%), Postives = 362/470 (77.02%), Query Frame = 1

Query: 3   APPSIPHLALLPSPGMGHLIPLIEFAKRLLSHHRLTITFVIPSDGPPSTPQKAVLDSLPA 62
           A  + PH+A++PSPG+GHLIPL+E AKRLL +H  T+TF+IP D PPS  Q++VL+SLP+
Sbjct: 2   ADGNTPHVAIIPSPGIGHLIPLVELAKRLLDNHGFTVTFIIPGDSPPSKAQRSVLNSLPS 61

Query: 63  GIDYLFLPPVSFDDLPDASKIETIITLTISRSLPSLRNVLKSMVANSNLVG-LVVDLFGT 122
            I  +FLPP    D+P  ++IET I+LT++RS P+LR +  S+ A   L   LVVDLFGT
Sbjct: 62  SIASVFLPPADLSDVPSTARIETRISLTVTRSNPALRELFGSLSAEKRLPAVLVVDLFGT 121

Query: 123 DAFDLAKEFNISSYMFFPSTAMFLSFALFLPKLDESVAGEFRDLPEPIRIPGCIPIQGKD 182
           DAFD+A EF++S Y+F+ S A  L+F L LPKLDE+V+ EFR+L EP+ IPGC+PI GKD
Sbjct: 122 DAFDVAAEFHVSPYIFYASNANVLTFLLHLPKLDETVSCEFRELTEPVIIPGCVPITGKD 181

Query: 183 LLDPVQDRKNEAYKWTLHNAKRYALADGIFLNSFLELEPGAVNYLQEEEAGKPPVYPIGP 242
            +DP QDRK+E+YKW LHN KR+  A+GI +NSF++LEP  +  +QE    KPPVY IGP
Sbjct: 182 FVDPCQDRKDESYKWLLHNVKRFKEAEGILVNSFVDLEPNTIKIVQEPAPDKPPVYLIGP 241

Query: 243 LVKIDANEKE--ERAECLKWLDEQPHGSVLFVSFGSGGTLSSHQINELASGLETSGQRFI 302
           LV   +++ +  +  +CL WLD QP GSVL+VSFGSGGTL+  Q  ELA GL  SG+RF+
Sbjct: 242 LVNSGSHDADVNDEYKCLNWLDNQPFGSVLYVSFGSGGTLTFEQFIELALGLAESGKRFL 301

Query: 303 WVVRSPSDKAANATYFSVQSQRDPLDFLPEGFVERTKNRGLVVPSWAPQAQILSHSSTGG 362
           WV+RSPS   A+++YF+ QS+ DP  FLP+GF++RTK +GLVV SWAPQAQIL+H+S GG
Sbjct: 302 WVIRSPSG-IASSSYFNPQSRNDPFSFLPQGFLDRTKEKGLVVGSWAPQAQILTHTSIGG 361

Query: 363 FLTHCGWNSTLESVVNGVPLIAWPLYAEQRMNAVMLTEEIKVALRLKTNEKNGIVEKEEI 422
           FLTHCGWNS+LES+VNGVPLIAWPLYAEQ+MNA++L  ++  ALR +  E +G+V +EE+
Sbjct: 362 FLTHCGWNSSLESIVNGVPLIAWPLYAEQKMNALLLV-DVGAALRARLGE-DGVVGREEV 421

Query: 423 SKVVRSVMEGEEGKKLRRKMKDLKEAAERVVGEDGSSTKIVSDVVNTWKA 470
           ++VV+ ++EGEEG  +R+KMK+LKE + RV+ +DG STK +++V   WKA
Sbjct: 422 ARVVKGLIEGEEGNAVRKKMKELKEGSVRVLRDDGFSTKSLNEVSLKWKA 468

BLAST of CmoCh14G021600 vs. TAIR10
Match: AT1G01390.1 (AT1G01390.1 UDP-Glycosyltransferase superfamily protein)

HSP 1 Score: 527.3 bits (1357), Expect = 9.6e-150
Identity = 261/469 (55.65%), Postives = 345/469 (73.56%), Query Frame = 1

Query: 3   APPSIPHLALLPSPGMGHLIPLIEFAKRLLSHHRLTITFVIPSDGPPSTPQKAVLDSLPA 62
           A  + PH+A++PSPGMGHLIP +E AKRL+ H   T+T +I  +  PS  Q++VL+SLP+
Sbjct: 2   AEANTPHIAIMPSPGMGHLIPFVELAKRLVQHDCFTVTMIISGETSPSKAQRSVLNSLPS 61

Query: 63  GIDYLFLPPVSFDDLPDASKIETIITLTISRSLPSLRNVLKSMVANSNLVG-LVVDLFGT 122
            I  +FLPP    D+P  ++IET   LT++RS P+LR +  S+    +L   LVVD+FG 
Sbjct: 62  SIASVFLPPADLSDVPSTARIETRAMLTMTRSNPALRELFGSLSTKKSLPAVLVVDMFGA 121

Query: 123 DAFDLAKEFNISSYMFFPSTAMFLSFALFLPKLDESVAGEFRDLPEPIRIPGCIPIQGKD 182
           DAFD+A +F++S Y+F+ S A  LSF L LPKLD++V+ EFR L EP++IPGC+PI GKD
Sbjct: 122 DAFDVAVDFHVSPYIFYASNANVLSFFLHLPKLDKTVSCEFRYLTEPLKIPGCVPITGKD 181

Query: 183 LLDPVQDRKNEAYKWTLHNAKRYALADGIFLNSFLELEPGAVNYLQEEEAGKPPVYPIGP 242
            LD VQDR ++AYK  LHN KRY  A GI +NSF++LE  A+  LQE    KP VYPIGP
Sbjct: 182 FLDTVQDRNDDAYKLLLHNTKRYKEAKGILVNSFVDLESNAIKALQEPAPDKPTVYPIGP 241

Query: 243 LVKIDAN--EKEERAECLKWLDEQPHGSVLFVSFGSGGTLSSHQINELASGLETSGQRFI 302
           LV   ++    E++  CL WLD QP GSVL++SFGSGGTL+  Q NELA GL  SG+RFI
Sbjct: 242 LVNTSSSNVNLEDKFGCLSWLDNQPFGSVLYISFGSGGTLTCEQFNELAIGLAESGKRFI 301

Query: 303 WVVRSPSDKAANATYFSVQSQRDPLDFLPEGFVERTKNRGLVVPSWAPQAQILSHSSTGG 362
           WV+RSPS+   +++YF+  S+ DP  FLP GF++RTK +GLVVPSWAPQ QIL+H ST G
Sbjct: 302 WVIRSPSE-IVSSSYFNPHSETDPFSFLPIGFLDRTKEKGLVVPSWAPQVQILAHPSTCG 361

Query: 363 FLTHCGWNSTLESVVNGVPLIAWPLYAEQRMNAVMLTEEIKVALRLKTNEKNGIVEKEEI 422
           FLTHCGWNSTLES+VNGVPLIAWPL+AEQ+MN ++L E++  ALR+   E +GIV +EE+
Sbjct: 362 FLTHCGWNSTLESIVNGVPLIAWPLFAEQKMNTLLLVEDVGAALRIHAGE-DGIVRREEV 421

Query: 423 SKVVRSVMEGEEGKKLRRKMKDLKEAAERVVGEDGSSTKIVSDVVNTWK 469
            +VV+++MEGEEGK +  K+K+LKE   RV+G+DG S+K   +V+  WK
Sbjct: 422 VRVVKALMEGEEGKAIGNKVKELKEGVVRVLGDDGLSSKSFGEVLLKWK 468

BLAST of CmoCh14G021600 vs. TAIR10
Match: AT5G66690.1 (AT5G66690.1 UDP-Glycosyltransferase superfamily protein)

HSP 1 Score: 358.2 bits (918), Expect = 7.7e-99
Identity = 201/457 (43.98%), Postives = 294/457 (64.33%), Query Frame = 1

Query: 8   PHLALLPSPGMGHLIPLIEFAKRLLSHHRLTIT-FVIPSDGPPSTPQKAVLDSLPAGIDY 67
           PH A+  SPGMGH+IP+IE  KRL +++   +T FV+ +D   ++ Q   L+S   G+D 
Sbjct: 6   PHAAMFSSPGMGHVIPVIELGKRLSANNGFHVTVFVLETDA--ASAQSKFLNS--TGVDI 65

Query: 68  LFLPPVSFDDLPDASK-IETIITLTISRSLPSLRNVLKSMVANSNLVGLVVDLFGTDAFD 127
           + LP      L D    + T I + +  ++P+LR+ + +M  +     L+VDLFGTDA  
Sbjct: 66  VKLPSPDIYGLVDPDDHVVTKIGVIMRAAVPALRSKIAAM--HQKPTALIVDLFGTDALC 125

Query: 128 LAKEFNISSYMFFPSTAMFLSFALFLPKLDESVAGEFRDLPEPIRIPGCIPIQGKDLLDP 187
           LAKEFN+ SY+F P+ A FL  +++ P LD+ +  E      P+ IPGC P++ +D LD 
Sbjct: 126 LAKEFNMLSYVFIPTNARFLGVSIYYPNLDKDIKEEHTVQRNPLAIPGCEPVRFEDTLDA 185

Query: 188 VQDRKNEAYKWTLHNAKRYALADGIFLNSFLELEPGAVNYLQEEE----AGKPPVYPIGP 247
                   Y+  + +   Y  ADGI +N++ E+EP ++  L   +      + PVYPIGP
Sbjct: 186 YLVPDEPVYRDFVRHGLAYPKADGILVNTWEEMEPKSLKSLLNPKLLGRVARVPVYPIGP 245

Query: 248 LVK-IDANEKEERAECLKWLDEQPHGSVLFVSFGSGGTLSSHQINELASGLETSGQRFIW 307
           L + I ++E +     L WL+EQP+ SVL++SFGSGG LS+ Q+ ELA GLE S QRF+W
Sbjct: 246 LCRPIQSSETDHPV--LDWLNEQPNESVLYISFGSGGCLSAKQLTELAWGLEQSQQRFVW 305

Query: 308 VVRSPSDKAANATYFSVQ---SQRDPLDFLPEGFVERTKNRGLVVPSWAPQAQILSHSST 367
           VVR P D +  + Y S     ++ +  ++LPEGFV RT +RG VVPSWAPQA+ILSH + 
Sbjct: 306 VVRPPVDGSCCSEYVSANGGGTEDNTPEYLPEGFVSRTSDRGFVVPSWAPQAEILSHRAV 365

Query: 368 GGFLTHCGWNSTLESVVNGVPLIAWPLYAEQRMNAVMLTEEIKVALRLKTNEKNGIVEKE 427
           GGFLTHCGW+STLESVV GVP+IAWPL+AEQ MNA +L++E+ +A+RL   +++  + + 
Sbjct: 366 GGFLTHCGWSSTLESVVGGVPMIAWPLFAEQNMNAALLSDELGIAVRLDDPKED--ISRW 425

Query: 428 EISKVVRSVMEGEEGKKLRRKMKDLKEAAERVVGEDG 455
           +I  +VR VM  +EG+ +RRK+K L+++AE  +  DG
Sbjct: 426 KIEALVRKVMTEKEGEAMRRKVKKLRDSAEMSLSIDG 452

BLAST of CmoCh14G021600 vs. TAIR10
Match: AT3G50740.1 (AT3G50740.1 UDP-glucosyl transferase 72E1)

HSP 1 Score: 353.2 bits (905), Expect = 2.5e-97
Identity = 200/459 (43.57%), Postives = 281/459 (61.22%), Query Frame = 1

Query: 8   PHLALLPSPGMGHLIPLIEFAKRLLSHHRLTIT-FVIPSDGPPSTPQKAVLDSL---PAG 67
           PH+A+  SPGMGH+IP+IE  KRL   H   +T FV+ +D   ++ Q   L+S     A 
Sbjct: 6   PHVAMFASPGMGHIIPVIELGKRLAGSHGFDVTIFVLETDA--ASAQSQFLNSPGCDAAL 65

Query: 68  IDYLFLPPVSFDDLPDASKIETIITLTISR-SLPSLRNVLKSMVANSNLVGLVVDLFGTD 127
           +D + LP      L D S    I  L + R ++P++R+ ++ M        L+VDLFG D
Sbjct: 66  VDIVGLPTPDISGLVDPSAFFGIKLLVMMRETIPTIRSKIEEM--QHKPTALIVDLFGLD 125

Query: 128 AFDLAKEFNISSYMFFPSTAMFLSFALFLPKLDESVAGEFRDLPEPIRIPGCIPIQGKDL 187
           A  L  EFN+ +Y+F  S A FL+ ALF P LD+ +  E     +P+ +PGC P++ +D 
Sbjct: 126 AIPLGGEFNMLTYIFIASNARFLAVALFFPTLDKDMEEEHIIKKQPMVMPGCEPVRFEDT 185

Query: 188 LDPVQDRKNEAYKWTLHNAKRYALADGIFLNSFLELEPGAVNYLQEEE----AGKPPVYP 247
           L+   D  ++ Y+  +     +   DGI +N++ ++EP  +  LQ+ +        PVYP
Sbjct: 186 LETFLDPNSQLYREFVPFGSVFPTCDGIIVNTWDDMEPKTLKSLQDPKLLGRIAGVPVYP 245

Query: 248 IGPLVKIDANEKEERAECLKWLDEQPHGSVLFVSFGSGGTLSSHQINELASGLETSGQRF 307
           IGPL +   +  +     L WL++QP  SVL++SFGSGG+LS+ Q+ ELA GLE S QRF
Sbjct: 246 IGPLSR-PVDPSKTNHPVLDWLNKQPDESVLYISFGSGGSLSAKQLTELAWGLEMSQQRF 305

Query: 308 IWVVRSPSDKAANATYFSVQSQR---DPLDFLPEGFVERTKNRGLVVPSWAPQAQILSHS 367
           +WVVR P D +A + Y S  S +      D+LPEGFV RT  RG +V SWAPQA+IL+H 
Sbjct: 306 VWVVRPPVDGSACSAYLSANSGKIRDGTPDYLPEGFVSRTHERGFMVSSWAPQAEILAHQ 365

Query: 368 STGGFLTHCGWNSTLESVVNGVPLIAWPLYAEQRMNAVMLTEEIKVALRLKTNEKNGIVE 427
           + GGFLTHCGWNS LESVV GVP+IAWPL+AEQ MNA +L EE+ VA+R K     G++ 
Sbjct: 366 AVGGFLTHCGWNSILESVVGGVPMIAWPLFAEQMMNATLLNEELGVAVRSKKLPSEGVIT 425

Query: 428 KEEISKVVRSVMEGEEGKKLRRKMKDLKEAAERVVGEDG 455
           + EI  +VR +M  EEG ++R+K+K LKE A   +  DG
Sbjct: 426 RAEIEALVRKIMVEEEGAEMRKKIKKLKETAAESLSCDG 459

BLAST of CmoCh14G021600 vs. NCBI nr
Match: gi|449432066|ref|XP_004133821.1| (PREDICTED: hydroquinone glucosyltransferase-like [Cucumis sativus])

HSP 1 Score: 826.2 bits (2133), Expect = 2.8e-236
Identity = 409/473 (86.47%), Postives = 445/473 (94.08%), Query Frame = 1

Query: 1   MEAPPSIPHLALLPSPGMGHLIPLIEFAKRLLSHHRLTITFVIPSDGPPSTPQKAVLDSL 60
           MEA PSIPHLA+LPSPGMGHLIPLIEFAKRLLSHHRLT TF+I SDGPPS PQ+A+L+SL
Sbjct: 1   MEAHPSIPHLAILPSPGMGHLIPLIEFAKRLLSHHRLTFTFIIASDGPPSQPQQALLNSL 60

Query: 61  PAGIDYLFLPPVSFDDLPDASKIETIITLTISRSLPSLRNVLKSMVANSNLVGLVVDLFG 120
           P+GI +LFLP V+FDDLP  SKIETIITLTISRSLPSLRNVLKSMV+ SNLVGLVVDLFG
Sbjct: 61  PSGIHHLFLPAVTFDDLPPNSKIETIITLTISRSLPSLRNVLKSMVSQSNLVGLVVDLFG 120

Query: 121 TDAFDLAKEFNISSYMFFPSTAMFLSFALFLPKLDESVAGEFRDLPEPIRIPGCIPIQGK 180
           TD FD+A+EF+ISSY+FFPSTAMFLSFALFLPKLDES+ GEFRD PEPI+IPGCIPIQGK
Sbjct: 121 TDGFDIAREFDISSYIFFPSTAMFLSFALFLPKLDESIVGEFRDHPEPIKIPGCIPIQGK 180

Query: 181 DLLDPVQDRKNEAYKWTLHNAKRYALADGIFLNSFLELEPGAVNYLQEEEAGKPPVYPIG 240
           DLLDPVQDRKNEAYKWTLHNA+RYALADGIFLNSF ELEPGA+ YLQEEEAGKP VYPIG
Sbjct: 181 DLLDPVQDRKNEAYKWTLHNARRYALADGIFLNSFPELEPGAIKYLQEEEAGKPLVYPIG 240

Query: 241 PLVKIDANEKEERAECLKWLDEQPHGSVLFVSFGSGGTLSSHQINELASGLETSGQRFIW 300
           PLVKIDA+EKEERAECLKWLDEQPHGSVLFVSFGSGGTLSS QI+ELA GLE SGQRFIW
Sbjct: 241 PLVKIDADEKEERAECLKWLDEQPHGSVLFVSFGSGGTLSSAQIDELALGLEMSGQRFIW 300

Query: 301 VVRSPSDKAANATYFSVQSQRDPLDFLPEGFVERTKNRGLVVPSWAPQAQILSHSSTGGF 360
           VVRSPSDKAA+ATYFSV SQ DPLDFLPEGFVERTKNRG+VVPSWAPQAQILSH STGGF
Sbjct: 301 VVRSPSDKAADATYFSVHSQSDPLDFLPEGFVERTKNRGMVVPSWAPQAQILSHGSTGGF 360

Query: 361 LTHCGWNSTLESVVNGVPLIAWPLYAEQRMNAVMLTEEIKVALRLKTNEKNGIVEKEEIS 420
           LTHCGWNSTLESVVNG+PLIAWPLYAEQRMNAV+LTEEI VAL+ K N+  GIVEKEEIS
Sbjct: 361 LTHCGWNSTLESVVNGIPLIAWPLYAEQRMNAVILTEEINVALKPKRNDNKGIVEKEEIS 420

Query: 421 KVVRSVMEGEEGKKLRRKMKDLKEAAERVVGEDGSSTKIVSDVVNTWKARIST 474
           KVV+S++EGEEGKKLRRKMK+L+EA+++ VGEDGSSTKIV+D+VN WKA+IST
Sbjct: 421 KVVKSLLEGEEGKKLRRKMKELEEASKKAVGEDGSSTKIVTDLVNNWKAKIST 473

BLAST of CmoCh14G021600 vs. NCBI nr
Match: gi|659075019|ref|XP_008437921.1| (PREDICTED: hydroquinone glucosyltransferase-like [Cucumis melo])

HSP 1 Score: 824.3 bits (2128), Expect = 1.1e-235
Identity = 410/473 (86.68%), Postives = 442/473 (93.45%), Query Frame = 1

Query: 1   MEAPPSIPHLALLPSPGMGHLIPLIEFAKRLLSHHRLTITFVIPSDGPPSTPQKAVLDSL 60
           MEA P IPHLA+LPSPGMGHLIPLIEFAKRLLSHHRLT TF+I SDGPPS PQ+A+L+SL
Sbjct: 1   MEAHPPIPHLAILPSPGMGHLIPLIEFAKRLLSHHRLTFTFIIASDGPPSQPQQALLNSL 60

Query: 61  PAGIDYLFLPPVSFDDLPDASKIETIITLTISRSLPSLRNVLKSMVANSNLVGLVVDLFG 120
           P+GID+LFLPP+SFDDLP  SKIETIITLTISRSLPSLRNVLKSMV  SNLVGLVVDLFG
Sbjct: 61  PSGIDHLFLPPLSFDDLPPDSKIETIITLTISRSLPSLRNVLKSMVPQSNLVGLVVDLFG 120

Query: 121 TDAFDLAKEFNISSYMFFPSTAMFLSFALFLPKLDESVAGEFRDLPEPIRIPGCIPIQGK 180
           TDAFD+A+EFNISSY+FFPSTAM LSFALFLPKLDESV GEFRD PEPI+IPGCI I+GK
Sbjct: 121 TDAFDVAREFNISSYIFFPSTAMLLSFALFLPKLDESVVGEFRDHPEPIKIPGCIAIEGK 180

Query: 181 DLLDPVQDRKNEAYKWTLHNAKRYALADGIFLNSFLELEPGAVNYLQEEEAGKPPVYPIG 240
           DLLDPVQDRKNEAYKWTLHNAKRYALADGIFLNSF ELEPGA+ YL+EEE GKP VYPIG
Sbjct: 181 DLLDPVQDRKNEAYKWTLHNAKRYALADGIFLNSFPELEPGAIKYLREEEPGKPLVYPIG 240

Query: 241 PLVKIDANEKEERAECLKWLDEQPHGSVLFVSFGSGGTLSSHQINELASGLETSGQRFIW 300
           PLVKIDA+EKEERAECLKWLDEQPHGSVLFVSFGSGGTL S QI+ELA GLE SGQRFIW
Sbjct: 241 PLVKIDADEKEERAECLKWLDEQPHGSVLFVSFGSGGTLKSAQIDELALGLEMSGQRFIW 300

Query: 301 VVRSPSDKAANATYFSVQSQRDPLDFLPEGFVERTKNRGLVVPSWAPQAQILSHSSTGGF 360
           VVRSPSDKAA+ATYFSV SQ DPL FLPEGF+ERTKNRG+VVPSWAPQAQILSH STGGF
Sbjct: 301 VVRSPSDKAADATYFSVHSQSDPLGFLPEGFLERTKNRGMVVPSWAPQAQILSHGSTGGF 360

Query: 361 LTHCGWNSTLESVVNGVPLIAWPLYAEQRMNAVMLTEEIKVALRLKTNEKNGIVEKEEIS 420
           LTHCGWNSTLESVVNG+PLIAWPLYAEQRMNAVMLTEEI VAL+ K NEK GIVEKEEIS
Sbjct: 361 LTHCGWNSTLESVVNGIPLIAWPLYAEQRMNAVMLTEEINVALKPKRNEKTGIVEKEEIS 420

Query: 421 KVVRSVMEGEEGKKLRRKMKDLKEAAERVVGEDGSSTKIVSDVVNTWKARIST 474
           KVV+S++EGEEGKKLRRKMK+LKEA+E+ VGEDGSSTKIV+++VN WKA+IST
Sbjct: 421 KVVKSLLEGEEGKKLRRKMKELKEASEKAVGEDGSSTKIVTNLVNNWKAKIST 473

BLAST of CmoCh14G021600 vs. NCBI nr
Match: gi|343466215|gb|AEM43001.1| (UDP-glucosyltransferase [Siraitia grosvenorii])

HSP 1 Score: 684.1 bits (1764), Expect = 1.7e-193
Identity = 337/465 (72.47%), Postives = 395/465 (84.95%), Query Frame = 1

Query: 8   PHLALLPSPGMGHLIPLIEFAKRLLSHHRLTITFVIPSDGPPSTPQKAVLDSLPAGIDYL 67
           PH+ +LPSPGMGHLIPL+EFAKRLL  HR T+TF IPS  PPS  Q ++L SLP+GIDY+
Sbjct: 15  PHVVMLPSPGMGHLIPLLEFAKRLLFLHRFTVTFAIPSGDPPSKAQISILSSLPSGIDYV 74

Query: 68  FLPPVSFDDLPDASKIETIITLTISRSLPSLRNVLKSMVANSNLVGLVVDLFGTDAFDLA 127
           FLPPV+F DLP  +K E  I L ++RSLPS R++ KSMVAN+NLV LVVD FGTDAFD+A
Sbjct: 75  FLPPVNFHDLPKDTKAEVFIVLAVARSLPSFRDLFKSMVANTNLVALVVDQFGTDAFDVA 134

Query: 128 KEFNISSYMFFPSTAMFLSFALFLPKLDESVAGEFRDLPEPIRIPGCIPIQGKDLLDPVQ 187
           +EFN+S Y+FFP  AM LSF L LP+ DE+VA E+R+LPEPIR+ GC PI GKDL DP  
Sbjct: 135 REFNVSPYIFFPCAAMTLSFLLRLPEFDETVAEEYRELPEPIRLSGCAPIPGKDLADPFH 194

Query: 188 DRKNEAYKWTLHNAKRYALADGIFLNSFLELEPGAVNYLQEEEAGKPPVYPIGPLVKIDA 247
           DR+N+AYK  LHNAKRYALADGIFLNSF ELEPGA+  L EEE+ KP V+P+GPLV+ID+
Sbjct: 195 DRENDAYKLFLHNAKRYALADGIFLNSFPELEPGAIKALLEEESRKPLVHPVGPLVQIDS 254

Query: 248 NEKEERAECLKWLDEQPHGSVLFVSFGSGGTLSSHQINELASGLETSGQRFIWVVRSPSD 307
           +  EE AECLKWL+EQPHGSVLFVSFGSGGTLSS QINELA GLE SG RFIWVVRSPSD
Sbjct: 255 SGSEEGAECLKWLEEQPHGSVLFVSFGSGGTLSSDQINELALGLEMSGHRFIWVVRSPSD 314

Query: 308 KAANATYFSVQSQRDPLDFLPEGFVERTKNRGLVVPSWAPQAQILSHSSTGGFLTHCGWN 367
           +AANA++FSV SQ DPL FLPEGF+E T+ R +VVPSWAPQAQILSHSSTGGFL+HCGWN
Sbjct: 315 EAANASFFSVHSQNDPLSFLPEGFLEGTRGRSVVVPSWAPQAQILSHSSTGGFLSHCGWN 374

Query: 368 STLESVVNGVPLIAWPLYAEQRMNAVMLTEEIKVALRLKTNEKNGIVEKEEISKVVRSVM 427
           STLESVV GVPLIAWPLYAEQ+MNA++LTE+IKVALR KTNEK GIVEKEEI++ V+++M
Sbjct: 375 STLESVVYGVPLIAWPLYAEQKMNAILLTEDIKVALRPKTNEKTGIVEKEEIAEAVKTLM 434

Query: 428 EGEEGKKLRRKMKDLKEAAERVVGEDGSSTKIVSDVVNTWKARIS 473
           EGE+GKKLR KMK L+ AAERV+ EDGSS+K +S +V  WK++IS
Sbjct: 435 EGEDGKKLRSKMKYLRNAAERVLEEDGSSSKALSQMVLKWKSKIS 479

BLAST of CmoCh14G021600 vs. NCBI nr
Match: gi|645254806|ref|XP_008233208.1| (PREDICTED: hydroquinone glucosyltransferase-like [Prunus mume])

HSP 1 Score: 678.3 bits (1749), Expect = 9.5e-192
Identity = 326/473 (68.92%), Postives = 401/473 (84.78%), Query Frame = 1

Query: 2   EAPPSIPHLALLPSPGMGHLIPLIEFAKRLLSHHRLTITFVIPSDGPPSTPQKAVLDSLP 61
           +A P  PH+A+LPSPGMGHLIPL EFAK+++ +H  T+TF++P DGPP+  QK+VLD+LP
Sbjct: 3   QAQPRPPHIAILPSPGMGHLIPLAEFAKQVVHYHNFTVTFIVPCDGPPTKAQKSVLDALP 62

Query: 62  AGIDYLFLPPVSFDDLPDASKIETIITLTISRSLPSLRNVLKSMVANSNLVGLVVDLFGT 121
             ID++FLP VSFDDLP  SKIET+I+LT+SRSL SL + +KS+++ +NLVGLVVDLFGT
Sbjct: 63  IAIDHVFLPSVSFDDLPQGSKIETLISLTVSRSLTSLHDAIKSLISRANLVGLVVDLFGT 122

Query: 122 DAFDLAKEFNISSYMFFPSTAMFLSFALFLPKLDESVAGEFRDLPEPIRIPGCIPIQGKD 181
           DAFD+A+EFN+S Y+FFPSTAM LS  L+LPKLDE+ + E+R+L EP+ IPGCIPI G+D
Sbjct: 123 DAFDVAEEFNLSKYIFFPSTAMALSLFLYLPKLDETTSCEYRELAEPVTIPGCIPIHGRD 182

Query: 182 LLDPVQDRKNEAYKWTLHNAKRYALADGIFLNSFLELEPGAVNYLQEEEAGKPPVYPIGP 241
           LLDPVQDRK+EAYKW LH++KRY LADGI +NSF ELEPGA+  LQ  E GKPPVYP+GP
Sbjct: 183 LLDPVQDRKDEAYKWVLHHSKRYRLADGIMVNSFAELEPGALRALQGSEPGKPPVYPVGP 242

Query: 242 LVKIDANE--KEERAECLKWLDEQPHGSVLFVSFGSGGTLSSHQINELASGLETSGQRFI 301
           LVK++ +    E+ ++CLKWLDEQP GSVL+VSFGSGGTLS  QINELA GLE S QRF+
Sbjct: 243 LVKMEFSNALDEQSSKCLKWLDEQPRGSVLYVSFGSGGTLSYDQINELALGLEMSEQRFL 302

Query: 302 WVVRSPSDKAANATYFSVQSQRDPLDFLPEGFVERTKNRGLVVPSWAPQAQILSHSSTGG 361
           WVVRSPSDKAANATYFSV SQ DPL+FLP+GF+ RT+ RGLVVP+WAPQAQIL H STGG
Sbjct: 303 WVVRSPSDKAANATYFSVHSQNDPLEFLPKGFLGRTQGRGLVVPNWAPQAQILGHMSTGG 362

Query: 362 FLTHCGWNSTLESVVNGVPLIAWPLYAEQRMNAVMLTEEIKVALRLKTNEKNGIVEKEEI 421
           FLTHCGWNS LESVVNGVPL+AWPLYAEQ+MNAVM TE+IKVALR K +E NG+V +EEI
Sbjct: 363 FLTHCGWNSALESVVNGVPLVAWPLYAEQKMNAVMFTEDIKVALRPKASE-NGLVGREEI 422

Query: 422 SKVVRSVMEGEEGKKLRRKMKDLKEAAERVVGEDGSSTKIVSDVVNTWKARIS 473
           + VV+++MEGE+GK+LR +MKDLK+AA + + EDG+ST+ ++ VV  WK + S
Sbjct: 423 ALVVQALMEGEDGKRLRNRMKDLKDAAAKALSEDGASTRALAHVVTKWKTQFS 474

BLAST of CmoCh14G021600 vs. NCBI nr
Match: gi|590721393|ref|XP_007051600.1| (UDP-Glycosyltransferase superfamily protein [Theobroma cacao])

HSP 1 Score: 677.9 bits (1748), Expect = 1.2e-191
Identity = 324/469 (69.08%), Postives = 395/469 (84.22%), Query Frame = 1

Query: 3   APPSIPHLALLPSPGMGHLIPLIEFAKRLLSHHRLTITFVIPSDGPPSTPQKAVLDSLPA 62
           A    PH+A+LPSPGMGHLIPL+EFAKRL+  H  T+TFVIP+DG PS  QK+ LDSLP+
Sbjct: 2   ATAQTPHIAILPSPGMGHLIPLVEFAKRLVHQHNFTVTFVIPTDGSPSKAQKSTLDSLPS 61

Query: 63  GIDYLFLPPVSFDDLPDASKIETIITLTISRSLPSLRNVLKSMVANSNLVGLVVDLFGTD 122
            ID +FLPPV   DLP+ SKIET+I+LT++RSLP +R+ LKS+ A + LVGLVVDLFGTD
Sbjct: 62  SIDSVFLPPVDLSDLPEGSKIETVISLTVARSLPFIRDALKSLAARTKLVGLVVDLFGTD 121

Query: 123 AFDLAKEFNISSYMFFPSTAMFLSFALFLPKLDESVAGEFRDLPEPIRIPGCIPIQGKDL 182
           AFD+A+EFN+S Y+FFPSTAM LS  L+LPKLD+ V+ E+RDLPE +RIPGCIPI G  L
Sbjct: 122 AFDVAREFNVSPYIFFPSTAMTLSLFLYLPKLDQMVSCEYRDLPEMVRIPGCIPIYGNQL 181

Query: 183 LDPVQDRKNEAYKWTLHNAKRYALADGIFLNSFLELEPGAVNYLQEEEAGKPPVYPIGPL 242
           LDP QDRKN++YKW LH+ KRY LA+GI +NSF++LE GA+  LQ++E GKPP+YP+GPL
Sbjct: 182 LDPTQDRKNDSYKWLLHHTKRYRLAEGIMVNSFVDLEGGAIKALQDKEPGKPPIYPVGPL 241

Query: 243 VKIDANEKEERAECLKWLDEQPHGSVLFVSFGSGGTLSSHQINELASGLETSGQRFIWVV 302
           V +D++ K + + CLKWLD QPHGSVL+VSFGSGGTLS +QINELA GLE S QRF+WVV
Sbjct: 242 VNVDSSSKADGSGCLKWLDGQPHGSVLYVSFGSGGTLSYNQINELALGLEMSQQRFLWVV 301

Query: 303 RSPSDKAANATYFSVQSQRDPLDFLPEGFVERTKNRGLVVPSWAPQAQILSHSSTGGFLT 362
           RSP+D+ ANAT+FSVQSQ+DP DFLP+GF+ERTK RGLVVPSWAPQAQ+LSH STGGFLT
Sbjct: 302 RSPNDQVANATFFSVQSQQDPFDFLPKGFLERTKGRGLVVPSWAPQAQVLSHGSTGGFLT 361

Query: 363 HCGWNSTLESVVNGVPLIAWPLYAEQRMNAVMLTEEIKVALRLKTNEKNGIVEKEEISKV 422
           HCGWNS LESVVNGVPLIAWPLYAEQ+MNAVML E+IKVALR K NE NG+V ++EI+K 
Sbjct: 362 HCGWNSALESVVNGVPLIAWPLYAEQKMNAVMLAEDIKVALRAKPNE-NGLVCRDEIAKA 421

Query: 423 VRSVMEGEEGKKLRRKMKDLKEAAERVVGEDGSSTKIVSDVVNTWKARI 472
           V+ +MEGEEGK +R +MKDLKEAA +V+ E+GSS K +S+V   W+ +I
Sbjct: 422 VKGLMEGEEGKGVRNRMKDLKEAAAKVLSENGSSGKALSEVAQKWRNQI 469

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
HQGT_RAUSE2.4e-17964.81Hydroquinone glucosyltransferase OS=Rauvolfia serpentina GN=AS PE=1 SV=1[more]
U72B1_ARATH9.6e-16862.58UDP-glycosyltransferase 72B1 OS=Arabidopsis thaliana GN=UGT72B1 PE=1 SV=1[more]
U72B3_ARATH8.4e-15657.45UDP-glycosyltransferase 72B3 OS=Arabidopsis thaliana GN=UGT72B3 PE=2 SV=1[more]
U72B2_ARATH1.7e-14855.65UDP-glycosyltransferase 72B2 OS=Arabidopsis thaliana GN=UGT72B2 PE=2 SV=1[more]
UFOG5_MANES7.7e-10141.91Anthocyanidin 3-O-glucosyltransferase 5 OS=Manihot esculenta GN=GT5 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0L3X8_CUCSA2.0e-23686.47Glycosyltransferase OS=Cucumis sativus GN=Csa_3G119730 PE=3 SV=1[more]
E5GCH8_CUCME7.5e-23686.68Glycosyltransferase OS=Cucumis melo subsp. melo PE=3 SV=1[more]
K7NBR5_SIRGR1.2e-19372.47Glycosyltransferase OS=Siraitia grosvenorii GN=UDPG3 PE=2 SV=1[more]
A0A061DTQ3_THECC8.6e-19269.08Glycosyltransferase OS=Theobroma cacao GN=TCM_005182 PE=3 SV=1[more]
K7NBX4_SIRGR3.1e-18970.54Glycosyltransferase OS=Siraitia grosvenorii GN=UDPG2 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
AT4G01070.15.4e-16962.58 UDP-Glycosyltransferase superfamily protein[more]
AT1G01420.14.7e-15757.45 UDP-glucosyl transferase 72B3[more]
AT1G01390.19.6e-15055.65 UDP-Glycosyltransferase superfamily protein[more]
AT5G66690.17.7e-9943.98 UDP-Glycosyltransferase superfamily protein[more]
AT3G50740.12.5e-9743.57 UDP-glucosyl transferase 72E1[more]
Match NameE-valueIdentityDescription
gi|449432066|ref|XP_004133821.1|2.8e-23686.47PREDICTED: hydroquinone glucosyltransferase-like [Cucumis sativus][more]
gi|659075019|ref|XP_008437921.1|1.1e-23586.68PREDICTED: hydroquinone glucosyltransferase-like [Cucumis melo][more]
gi|343466215|gb|AEM43001.1|1.7e-19372.47UDP-glucosyltransferase [Siraitia grosvenorii][more]
gi|645254806|ref|XP_008233208.1|9.5e-19268.92PREDICTED: hydroquinone glucosyltransferase-like [Prunus mume][more]
gi|590721393|ref|XP_007051600.1|1.2e-19169.08UDP-Glycosyltransferase superfamily protein [Theobroma cacao][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002213UDP_glucos_trans
Vocabulary: Molecular Function
TermDefinition
GO:0016758transferase activity, transferring hexosyl groups
Vocabulary: Biological Process
TermDefinition
GO:0008152metabolic process
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008152 metabolic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0016758 transferase activity, transferring hexosyl groups

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh14G021600.1CmoCh14G021600.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002213UDP-glucuronosyl/UDP-glucosyltransferasePANTHERPTHR11926GLUCOSYL/GLUCURONOSYL TRANSFERASEScoord: 5..472
score: 1.2E
IPR002213UDP-glucuronosyl/UDP-glucosyltransferasePFAMPF00201UDPGTcoord: 265..407
score: 1.1
IPR002213UDP-glucuronosyl/UDP-glucosyltransferasePROSITEPS00375UDPGTcoord: 345..388
scor
NoneNo IPR availableunknownCoilCoilcoord: 432..452
scor
NoneNo IPR availableGENE3DG3DSA:3.40.50.2000coord: 250..427
score: 7.5
NoneNo IPR availablePANTHERPTHR11926:SF189UDP-GLYCOSYLTRANSFERASE 72B2-RELATEDcoord: 5..472
score: 1.2E
NoneNo IPR availableunknownSSF53756UDP-Glycosyltransferase/glycogen phosphorylasecoord: 8..467
score: 6.98E

The following gene(s) are paralogous to this gene:

None