CmoCh04G005890 (gene) Cucurbita moschata (Rifu)

NameCmoCh04G005890
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionUDP-glycosyltransferase
LocationCmo_Chr04 : 2938658 .. 2941634 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CGAACAAACAAAATGGCAGAAACCCAATCATCAAAGCTCCATATTGCCATGTTCCCATGGTTTGCCGCCGGCCACATGACTCCATTTCTTCATATCTCCAACGAGCTCGCCGCCAGAGGCCACAAAATCACCTTCCTTTTGCCCTCCAAAGCGCTCCCTCTTTTACAAAATCTAAATCTTCACCCAAATCTCATCTCCTTCCATTTCTTGACGGTTCCCCATGTCTCCGGCCTCCCTCCGGCGACGGAAACCGCCTCTGATATACCAATTTCTCTTACCCCCTTGCTCGCCTCTGCTTTCGACATGACTCGGCAGCAGGTGGCGGAGATCCTCTGTTCTGCCTGCCCTGATGTCGTTTTCTATGATTTTGCGTATTGGGTCCCTGAAATCGCTGCGCCGCTGCGGATCAAATCGGTTAGTTTTACTGTTGTCAGTGCTGCGTCGATTGCTGTTATTGCTTATCCGAGAAGAAGGGTGACCGTTGATGACCCGATTACGGAGGAGGAGCTTAGGAAGCCGCCGCCTGGTTACCCGTCGTCCACCGTCGTCCTCCGTGGCTGCCGTGAAGAGCGGTCGCTGCTCTTCTTGTCCATGCCGTTCGGCGAAGGTATTTGTTCCTAATTTTTAATGGGTAAAATTATATACCTTTGATCTTTTATTAAATTTAGTTTTAGTTTAATTCGACTTTTAAAATTTAAAATGGGTTTCATGGGTTTCAAATTTGGTTCCTTTGTACAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAANNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGCTAGATCGACCTCTCTTGCCTCGAGGTGATCAACACTAAGAGATCTCTTGAACCTGTGCGGTGAAAAAATGCATCGATACTAGAGAGTTCCACCTGATAAGGAAGCACCGTGTGAGCCGTGATGGATATTGGATCTAGAAATTACGATCCAACATGAGCCAATAATTTATCAAAGCATTTATTACGATCCAATAATGAGCCAATAATTTATCAGAGCATTTATTATGATACAACCTGAGCCAATAATTTATCAGAGCATTTATATTCATCTACTTTCAGACTAATGTGTGATAGTACTTCTCTTAAATTTTAACAACTAGTCGATATAATATTAAATAAAAATTTAAGTACAAAATTGAAAGTTGAGTTATGCTTAAATTGAATGTTATATCGCGTGAAATTAAATTATCACAAAATTTAAAAACCATCTATTTATATTTCCAGGAGGTATAACGTTCCACGAGAGACTAATGACGTCATACAGAAACAGCGACGCAATAGCAATACGAACATGTGAAGAAATCGAAGGCAATTTCTGCAGCTACTTAGCAAAGCAATTCCAAAAGAAGCTATTACTAACCGGGCCACTCATGGCAATACCAAACAAGACAACGACGACGACAACAACCTCGTGTTTGGACGAAAAATGGGAGAAATGGCTCGACCAATTCGAACCAAAAACAGTAATTTTCTGTGCATTTGGAAGCCAATTAACCTTAGAAAAGGACCAACTCCAAGAACTCGTGTTGGGAATAGAACAAACTAGGTTGCCATTTTTGGTAGCTCTAAAGCCACCAACAGGATCAAGCTCCATTGAAGAAGCACTACCAGAAGGGTTCGAAGAAAGGGTGAGAGAAAGAGGAGCCGTTTATGGCGGTTGGGTTCAGCAGCCATTGATTCTAAAGCACCCATCGGTTGGTTGCTTTGTGAGCCATTGTGGGTTCGGTTCGATGTGGGAGTCATTGATGAGTGAGCCTCAAATTGTGCTGATTCCGAGCCTTGGCGACCAAATATTGAACGCAAGGCTGCTGGCTCAAGAGCTCCAAGTGGGCGTGGAAGTGAAGAAGAGGGAAGAAGATGGGAAGTTCACAAGCCAAAGTGTGAGGGAAGCCATTGAGTCAGTGATGCTTGTGGAAGCAGGTGGCGTTGGTGAAATGGTCAAGAAAAACCATAAAAAATGGAACCACATTTTGACTAACCCTGGCTTCATGGATGCTTATATTCACAATTTTGTTAATGATTTGCAAAATGGTTGGACCTAGATCGCACCCACACCCGCACCCATTGGGCTCGGATAAGAGTGTCGAGATGTCTGGATTTGTATAGTTTCGAGTTCATAGCTACCTAAGTTTTAGAGTTTTTTTTAGCCCTTTTTATTTTCATTCTTTATTTCTCTAGTAG

mRNA sequence

CGAACAAACAAAATGGCAGAAACCCAATCATCAAAGCTCCATATTGCCATGTTCCCATGGTTTGCCGCCGGCCACATGACTCCATTTCTTCATATCTCCAACGAGCTCGCCGCCAGAGGCCACAAAATCACCTTCCTTTTGCCCTCCAAAGCGCTCCCTCTTTTACAAAATCTAAATCTTCACCCAAATCTCATCTCCTTCCATTTCTTGACGGTTCCCCATGTCTCCGGCCTCCCTCCGGCGACGGAAACCGCCTCTGATATACCAATTTCTCTTACCCCCTTGCTCGCCTCTGCTTTCGACATGACTCGGCAGCAGGTGGCGGAGATCCTCTGTTCTGCCTGCCCTGATGTCGTTTTCTATGATTTTGCGTATTGGGTCCCTGAAATCGCTGCGCCGCTGCGGATCAAATCGGTTAGTTTTACTGTTGTCAGTGCTGCGTCGATTGCTGTTATTGCTTATCCGAGAAGAAGGGTGACCGTTGATGACCCGATTACGGAGGAGGAGCTTAGGAAGCCGCCGCCTGGTTACCCGTCGTCCACCGTCGTCCTCCGTGGCTGCCGTGAAGAGCGGTCGCTGCTCTTCTTGTCCATGCCGTTCGGCGAAGGAGGTATAACGTTCCACGAGAGACTAATGACGTCATACAGAAACAGCGACGCAATAGCAATACGAACATGTGAAGAAATCGAAGGCAATTTCTGCAGCTACTTAGCAAAGCAATTCCAAAAGAAGCTATTACTAACCGGGCCACTCATGGCAATACCAAACAAGACAACGACGACGACAACAACCTCGTGTTTGGACGAAAAATGGGAGAAATGGCTCGACCAATTCGAACCAAAAACAGTAATTTTCTGTGCATTTGGAAGCCAATTAACCTTAGAAAAGGACCAACTCCAAGAACTCGTGTTGGGAATAGAACAAACTAGGTTGCCATTTTTGGTAGCTCTAAAGCCACCAACAGGATCAAGCTCCATTGAAGAAGCACTACCAGAAGGGTTCGAAGAAAGGGTGAGAGAAAGAGGAGCCGTTTATGGCGGTTGGGTTCAGCAGCCATTGATTCTAAAGCACCCATCGGTTGGTTGCTTTGTGAGCCATTGTGGGTTCGGTTCGATGTGGGAGTCATTGATGAGTGAGCCTCAAATTGTGCTGATTCCGAGCCTTGGCGACCAAATATTGAACGCAAGGCTGCTGGCTCAAGAGCTCCAAGTGGGCGTGGAAGTGAAGAAGAGGGAAGAAGATGGGAAGTTCACAAGCCAAAGTGTGAGGGAAGCCATTGAGTCAGTGATGCTTGTGGAAGCAGGTGGCGTTGGTGAAATGGTCAAGAAAAACCATAAAAAATGGAACCACATTTTGACTAACCCTGGCTTCATGGATGCTTATATTCACAATTTTGTTAATGATTTGCAAAATGGTTGGACCTAGATCGCACCCACACCCGCACCCATTGGGCTCGGATAAGAGTGTCGAGATGTCTGGATTTGTATAGTTTCGAGTTCATAGCTACCTAAGTTTTAGAGTTTTTTTTAGCCCTTTTTATTTTCATTCTTTATTTCTCTAGTAG

Coding sequence (CDS)

ATGGCAGAAACCCAATCATCAAAGCTCCATATTGCCATGTTCCCATGGTTTGCCGCCGGCCACATGACTCCATTTCTTCATATCTCCAACGAGCTCGCCGCCAGAGGCCACAAAATCACCTTCCTTTTGCCCTCCAAAGCGCTCCCTCTTTTACAAAATCTAAATCTTCACCCAAATCTCATCTCCTTCCATTTCTTGACGGTTCCCCATGTCTCCGGCCTCCCTCCGGCGACGGAAACCGCCTCTGATATACCAATTTCTCTTACCCCCTTGCTCGCCTCTGCTTTCGACATGACTCGGCAGCAGGTGGCGGAGATCCTCTGTTCTGCCTGCCCTGATGTCGTTTTCTATGATTTTGCGTATTGGGTCCCTGAAATCGCTGCGCCGCTGCGGATCAAATCGGTTAGTTTTACTGTTGTCAGTGCTGCGTCGATTGCTGTTATTGCTTATCCGAGAAGAAGGGTGACCGTTGATGACCCGATTACGGAGGAGGAGCTTAGGAAGCCGCCGCCTGGTTACCCGTCGTCCACCGTCGTCCTCCGTGGCTGCCGTGAAGAGCGGTCGCTGCTCTTCTTGTCCATGCCGTTCGGCGAAGGAGGTATAACGTTCCACGAGAGACTAATGACGTCATACAGAAACAGCGACGCAATAGCAATACGAACATGTGAAGAAATCGAAGGCAATTTCTGCAGCTACTTAGCAAAGCAATTCCAAAAGAAGCTATTACTAACCGGGCCACTCATGGCAATACCAAACAAGACAACGACGACGACAACAACCTCGTGTTTGGACGAAAAATGGGAGAAATGGCTCGACCAATTCGAACCAAAAACAGTAATTTTCTGTGCATTTGGAAGCCAATTAACCTTAGAAAAGGACCAACTCCAAGAACTCGTGTTGGGAATAGAACAAACTAGGTTGCCATTTTTGGTAGCTCTAAAGCCACCAACAGGATCAAGCTCCATTGAAGAAGCACTACCAGAAGGGTTCGAAGAAAGGGTGAGAGAAAGAGGAGCCGTTTATGGCGGTTGGGTTCAGCAGCCATTGATTCTAAAGCACCCATCGGTTGGTTGCTTTGTGAGCCATTGTGGGTTCGGTTCGATGTGGGAGTCATTGATGAGTGAGCCTCAAATTGTGCTGATTCCGAGCCTTGGCGACCAAATATTGAACGCAAGGCTGCTGGCTCAAGAGCTCCAAGTGGGCGTGGAAGTGAAGAAGAGGGAAGAAGATGGGAAGTTCACAAGCCAAAGTGTGAGGGAAGCCATTGAGTCAGTGATGCTTGTGGAAGCAGGTGGCGTTGGTGAAATGGTCAAGAAAAACCATAAAAAATGGAACCACATTTTGACTAACCCTGGCTTCATGGATGCTTATATTCACAATTTTGTTAATGATTTGCAAAATGGTTGGACCTAG
BLAST of CmoCh04G005890 vs. Swiss-Prot
Match: U79B6_ARATH (UDP-glycosyltransferase 79B6 OS=Arabidopsis thaliana GN=UGT79B6 PE=2 SV=1)

HSP 1 Score: 501.9 bits (1291), Expect = 7.6e-141
Identity = 251/460 (54.57%), Postives = 329/460 (71.52%), Query Frame = 1

Query: 7   SKLHIAMFPWFAAGHMTPFLHISNELAARGHKITFLLPSKALPLLQNLNLHPNLISFHFL 66
           SK H  MFPWF  GHMT FLH++N+LA + HKITFLLP KA   L++LNL P+ I F  L
Sbjct: 3   SKFHAFMFPWFGFGHMTAFLHLANKLAEKDHKITFLLPKKARKQLESLNLFPDCIVFQTL 62

Query: 67  TVPHVSGLPPATETASDIPISLTPLLASAFDMTRQQVAEILCSACPDVVFYDFAYWVPEI 126
           T+P V GLP   ET SDIPISL   LASA D TR QV E +    PD++F+DFA+W+PEI
Sbjct: 63  TIPSVDGLPDGAETTSDIPISLGSFLASAMDRTRIQVKEAVSVGKPDLIFFDFAHWIPEI 122

Query: 127 AAPLRIKSVSFTVVSAASIAVIAYPRRRVTVDDPITEEELRKPPPGYPSSTVVLRGCREE 186
           A    +KSV+F  +SAA +A+   P R        ++++L   PPGYPSS V+LRG  E 
Sbjct: 123 AREYGVKSVNFITISAACVAISFVPGR--------SQDDLGSTPPGYPSSKVLLRG-HET 182

Query: 187 RSLLFLSMPFGEGGITFHERLMTSYRNSDAIAIRTCEEIEGNFCSYLAKQFQKKLLLTGP 246
            SL FLS PFG+G  +F+ER+M   +N D I+IRTC+E+EG FC ++  QFQ+K+LLTGP
Sbjct: 183 NSLSFLSYPFGDG-TSFYERIMIGLKNCDVISIRTCQEMEGKFCDFIENQFQRKVLLTGP 242

Query: 247 LMAIPNKTTTTTTTSCLDEKWEKWLDQFEPKTVIFCAFGSQLTLEKDQLQELVLGIEQTR 306
           ++  P+ +        L+++W +WL +F+P +VI+CA GSQ+ LEKDQ QEL LG+E T 
Sbjct: 243 MLPEPDNSKP------LEDQWRQWLSKFDPGSVIYCALGSQIILEKDQFQELCLGMELTG 302

Query: 307 LPFLVALKPPTGSSSIEEALPEGFEERVRERGAVYGGWVQQPLILKHPSVGCFVSHCGFG 366
           LPFLVA+KPP GSS+I+EALP+GFEERV+ RG V+GGWVQQPLIL HPS+GCFVSHCGFG
Sbjct: 303 LPFLVAVKPPKGSSTIQEALPKGFEERVKARGVVWGGWVQQPLILAHPSIGCFVSHCGFG 362

Query: 367 SMWESLMSEPQIVLIPSLGDQILNARLLAQELQVGVEVKKREEDGKFTSQSVREAIESVM 426
           SMWE+L+++ QIV IP LG+QILN RL+++EL+V VEV KREE G F+ +S+  A+ SVM
Sbjct: 363 SMWEALVNDCQIVFIPHLGEQILNTRLMSEELKVSVEV-KREETGWFSKESLSGAVRSVM 422

Query: 427 LVEAGGVGEMVKKNHKKWNHILTNPGFMDAYIHNFVNDLQ 467
             ++  +G   ++NH KW   L   G M  Y++ FV  L+
Sbjct: 423 DRDS-ELGNWARRNHVKWKESLLRHGLMSGYLNKFVEALE 444

BLAST of CmoCh04G005890 vs. Swiss-Prot
Match: U79B2_ARATH (UDP-glycosyltransferase 79B2 OS=Arabidopsis thaliana GN=UGT79B2 PE=2 SV=1)

HSP 1 Score: 496.5 bits (1277), Expect = 3.2e-139
Identity = 252/460 (54.78%), Postives = 321/460 (69.78%), Query Frame = 1

Query: 8   KLHIAMFPWFAAGHMTPFLHISNELAARGHKITFLLPSKALPLLQNLNLHPNLISFHFLT 67
           K H+ M+PWFA GHMTPFL ++N+LA +GH +TFL+P KAL  L+NLNL P+ I F  +T
Sbjct: 5   KFHVLMYPWFATGHMTPFLFLANKLAEKGHTVTFLIPKKALKQLENLNLFPHNIVFRSVT 64

Query: 68  VPHVSGLPPATETASDIPISLTPLLASAFDMTRQQVAEILCSACPDVVFYDFAYWVPEIA 127
           VPHV GLP  TET S+IP++   LL SA D+TR QV  ++ +  PD++F+DFA+W+PE+A
Sbjct: 65  VPHVDGLPVGTETVSEIPVTSADLLMSAMDLTRDQVEGVVRAVEPDLIFFDFAHWIPEVA 124

Query: 128 APLRIKSVSFTVVSAASIAVIAYPRRRVTVDDPITEEELRKPPPGYPSSTVVLRGCREER 187
               +K+V + VVSA++IA +  P             EL  PPPGYPSS V+LR      
Sbjct: 125 RDFGLKTVKYVVVSASTIASMLVPGG-----------ELGVPPPGYPSSKVLLRKQDAYT 184

Query: 188 SLLFLSMPFGEGGITFHERLMTSYRNSDAIAIRTCEEIEGNFCSYLAKQFQKKLLLTGPL 247
                S      G    ER+ TS  NSD IAIRT  EIEGNFC Y+ K  +KK+LLTGP+
Sbjct: 185 MKNLESTNTINVGPNLLERVTTSLMNSDVIAIRTAREIEGNFCDYIEKHCRKKVLLTGPV 244

Query: 248 MAIPNKTTTTTTTSCLDEKWEKWLDQFEPKTVIFCAFGSQLTLEKDQLQELVLGIEQTRL 307
              P+KT        L+E+W KWL  +EP +V+FCA GSQ+ LEKDQ QEL LG+E T  
Sbjct: 245 FPEPDKTRE------LEERWVKWLSGYEPDSVVFCALGSQVILEKDQFQELCLGMELTGS 304

Query: 308 PFLVALKPPTGSSSIEEALPEGFEERVRERGAVYGGWVQQPLILKHPSVGCFVSHCGFGS 367
           PFLVA+KPP GSS+I+EALPEGFEERV+ RG V+G WVQQPL+L HPSVGCFVSHCGFGS
Sbjct: 305 PFLVAVKPPRGSSTIQEALPEGFEERVKGRGVVWGEWVQQPLLLSHPSVGCFVSHCGFGS 364

Query: 368 MWESLMSEPQIVLIPSLGDQILNARLLAQELQVGVEVKKREEDGKFTSQSVREAIESVML 427
           MWESL+S+ QIVL+P LGDQ+LN RLL+ EL+V VEV  REE G F+ +S+ +AI SVM 
Sbjct: 365 MWESLLSDCQIVLVPQLGDQVLNTRLLSDELKVSVEV-AREETGWFSKESLFDAINSVMK 424

Query: 428 VEAGGVGEMVKKNHKKWNHILTNPGFMDAYIHNFVNDLQN 468
            ++  +G +VKKNH KW   LT+PG +  Y+ NF+  LQ+
Sbjct: 425 RDS-EIGNLVKKNHTKWRETLTSPGLVTGYVDNFIESLQD 445

BLAST of CmoCh04G005890 vs. Swiss-Prot
Match: AXYLT_ARATH (Anthocyanidin 3-O-glucoside 2'''-O-xylosyltransferase OS=Arabidopsis thaliana GN=A3G2XYLT PE=1 SV=1)

HSP 1 Score: 495.4 bits (1274), Expect = 7.1e-139
Identity = 257/466 (55.15%), Postives = 331/466 (71.03%), Query Frame = 1

Query: 5   QSSKLHIAMFPWFAAGHMTPFLHISNELAARGHKITFLLPSKALPLLQNLNLHPNLISFH 64
           +SS + I M+PW A GHMTPFLH+SN+LA +GHKI FLLP KAL  L+ LNL+PNLI+FH
Sbjct: 8   ESSSMSIVMYPWLAFGHMTPFLHLSNKLAEKGHKIVFLLPKKALNQLEPLNLYPNLITFH 67

Query: 65  FLTVPHVSGLPPATETASDIPISLTPLLASAFDMTRQQVAEILCSACPDVVFYDFAYWVP 124
            +++P V GLPP  ET SD+P  LT LLA A D TR +V  I  +  PD+VFYD A+W+P
Sbjct: 68  TISIPQVKGLPPGAETNSDVPFFLTHLLAVAMDQTRPEVETIFRTIKPDLVFYDSAHWIP 127

Query: 125 EIAAPLRIKSVSFTVVSAASIAVIAYP--RRRVTVDDPITEEELRKPPPGYPSSTVVLRG 184
           EIA P+  K+V F +VSAASIA+   P   R V     ++ EEL K P GYPSS VVLR 
Sbjct: 128 EIAKPIGAKTVCFNIVSAASIALSLVPSAEREVIDGKEMSGEELAKTPLGYPSSKVVLRP 187

Query: 185 CREERSLLFLSMPFGEGGITFHERLMTSYRNSDAIAIRTCEEIEGNFCSYLAKQFQKKLL 244
             E +SL F+       G +F +  +T+ RN DAIAIRTC E EG FC Y+++Q+ K + 
Sbjct: 188 -HEAKSLSFVWRKHEAIG-SFFDGKVTAMRNCDAIAIRTCRETEGKFCDYISRQYSKPVY 247

Query: 245 LTGPLM--AIPNKTTTTTTTSCLDEKWEKWLDQFEPKTVIFCAFGSQLTLEK-DQLQELV 304
           LTGP++  + PN+ +       LD +W +WL +F   +V+FCAFGSQ  + K DQ QEL 
Sbjct: 248 LTGPVLPGSQPNQPS-------LDPQWAEWLAKFNHGSVVFCAFGSQPVVNKIDQFQELC 307

Query: 305 LGIEQTRLPFLVALKPPTGSSSIEEALPEGFEERVRERGAVYGGWVQQPLILKHPSVGCF 364
           LG+E T  PFLVA+KPP+G S++EEALPEGF+ERV+ RG V+GGW+QQPL+L HPSVGCF
Sbjct: 308 LGLESTGFPFLVAIKPPSGVSTVEEALPEGFKERVQGRGVVFGGWIQQPLVLNHPSVGCF 367

Query: 365 VSHCGFGSMWESLMSEPQIVLIPSLGDQILNARLLAQELQVGVEVKKREEDGKFTSQSVR 424
           VSHCGFGSMWESLMS+ QIVL+P  G+QILNARL+ +E++V VEV +RE+ G F+ QS+ 
Sbjct: 368 VSHCGFGSMWESLMSDCQIVLVPQHGEQILNARLMTEEMEVAVEV-EREKKGWFSRQSLE 427

Query: 425 EAIESVMLVEAGGVGEMVKKNHKKWNHILTNPGFMDAYIHNFVNDL 466
            A++SVM  E   +GE V+KNH KW  +LT+ GF D YI  F  +L
Sbjct: 428 NAVKSVM-EEGSEIGEKVRKNHDKWRCVLTDSGFSDGYIDKFEQNL 462

BLAST of CmoCh04G005890 vs. Swiss-Prot
Match: U79B3_ARATH (UDP-glycosyltransferase 79B3 OS=Arabidopsis thaliana GN=UGT79B3 PE=2 SV=1)

HSP 1 Score: 493.8 bits (1270), Expect = 2.1e-138
Identity = 251/462 (54.33%), Postives = 326/462 (70.56%), Query Frame = 1

Query: 8   KLHIAMFPWFAAGHMTPFLHISNELAARGHKITFLLPSKALPLLQNLNLHPNLISFHFLT 67
           K H+ M+PWFA GHMTPFL ++N+LA +GH +TFLLP K+L  L++ NL P+ I F  +T
Sbjct: 5   KFHVLMYPWFATGHMTPFLFLANKLAEKGHTVTFLLPKKSLKQLEHFNLFPHNIVFRSVT 64

Query: 68  VPHVSGLPPATETASDIPISLTPLLASAFDMTRQQVAEILCSACPDVVFYDFAYWVPEIA 127
           VPHV GLP  TETAS+IP++ T LL SA D+TR QV  ++ +  PD++F+DFA+W+PE+A
Sbjct: 65  VPHVDGLPVGTETASEIPVTSTDLLMSAMDLTRDQVEAVVRAVEPDLIFFDFAHWIPEVA 124

Query: 128 APLRIKSVSFTVVSAASIAVIAYPRRRVTVDDPITEEELRKPPPGYPSSTVVLRGCREER 187
               +K+V + VVSA++IA +  P             EL  PPPGYPSS V+LR  +++ 
Sbjct: 125 RDFGLKTVKYVVVSASTIASMLVPGG-----------ELGVPPPGYPSSKVLLR--KQDA 184

Query: 188 SLLFLSMPFG--EGGITFHERLMTSYRNSDAIAIRTCEEIEGNFCSYLAKQFQKKLLLTG 247
             +    P    + G    ER+ TS  NSD IAIRT  EIEGNFC Y+ K  +KK+LLTG
Sbjct: 185 YTMKKLEPTNTIDVGPNLLERVTTSLMNSDVIAIRTAREIEGNFCDYIEKHCRKKVLLTG 244

Query: 248 PLMAIPNKTTTTTTTSCLDEKWEKWLDQFEPKTVIFCAFGSQLTLEKDQLQELVLGIEQT 307
           P+   P+KT        L+E+W KWL  +EP +V+FCA GSQ+ LEKDQ QEL LG+E T
Sbjct: 245 PVFPEPDKTRE------LEERWVKWLSGYEPDSVVFCALGSQVILEKDQFQELCLGMELT 304

Query: 308 RLPFLVALKPPTGSSSIEEALPEGFEERVRERGAVYGGWVQQPLILKHPSVGCFVSHCGF 367
             PFLVA+KPP GSS+I+EALPEGFEERV+ RG V+GGWVQQPLIL HPSVGCFVSHCGF
Sbjct: 305 GSPFLVAVKPPRGSSTIQEALPEGFEERVKGRGLVWGGWVQQPLILSHPSVGCFVSHCGF 364

Query: 368 GSMWESLMSEPQIVLIPSLGDQILNARLLAQELQVGVEVKKREEDGKFTSQSVREAIESV 427
           GSMWESL+S+ QIVL+P LGDQ+LN RLL+ EL+V VEV  REE G F+ +S+ +A+ SV
Sbjct: 365 GSMWESLLSDCQIVLVPQLGDQVLNTRLLSDELKVSVEV-AREETGWFSKESLCDAVNSV 424

Query: 428 MLVEAGGVGEMVKKNHKKWNHILTNPGFMDAYIHNFVNDLQN 468
           M  ++  +G +V+KNH KW   + +PG M  Y+  FV  LQ+
Sbjct: 425 MKRDS-ELGNLVRKNHTKWRETVASPGLMTGYVDAFVESLQD 445

BLAST of CmoCh04G005890 vs. Swiss-Prot
Match: U79B9_ARATH (UDP-glycosyltransferase 79B9 OS=Arabidopsis thaliana GN=UGT79B9 PE=2 SV=1)

HSP 1 Score: 484.6 bits (1246), Expect = 1.3e-135
Identity = 247/458 (53.93%), Postives = 324/458 (70.74%), Query Frame = 1

Query: 10  HIAMFPWFAAGHMTPFLHISNELAARGHKITFLLPSKALPLLQNLNLHPNLISFHFLTVP 69
           H  MFPWFA GHMTP+LH++N+LAA+GH++TFLLP KA   L++ NL P+ I FH LT+P
Sbjct: 6   HAFMFPWFAFGHMTPYLHLANKLAAKGHRVTFLLPKKAQKQLEHHNLFPDRIIFHSLTIP 65

Query: 70  HVSGLPPATETASDIPISLTPLLASAFDMTRQQVAEILCSACPDVVFYDFAYWVPEIAAP 129
           HV GLP   ETASDIPISL   L +A D+TR QV   + +  PD++F+D AYWVPE+A  
Sbjct: 66  HVDGLPAGAETASDIPISLGKFLTAAMDLTRDQVEAAVRALRPDLIFFDTAYWVPEMAKE 125

Query: 130 LRIKSVSFTVVSAASIAVIAYPRRRVTVDDPITEEELRKPPPGYPSSTVVLRGCREERSL 189
            R+KSV + V+SA SIA            + +   EL  PPPGYPSS V+ RG  +  +L
Sbjct: 126 HRVKSVIYFVISANSIA-----------HELVPGGELGVPPPGYPSSKVLYRG-HDAHAL 185

Query: 190 LFLSMPFGEGGITFHERLMTSYRNSDAIAIRTCEEIEGNFCSYLAKQFQKKLLLTGPLMA 249
           L  S+ +       H R+ T  +N D I+IRTC+EIEG FC Y+ +Q+Q+K+LLTGP++ 
Sbjct: 186 LTFSIFYER----LHYRITTGLKNCDFISIRTCKEIEGKFCDYIERQYQRKVLLTGPMLP 245

Query: 250 IPNKTTTTTTTSCLDEKWEKWLDQFEPKTVIFCAFGSQLTLEKDQLQELVLGIEQTRLPF 309
            P+ +        L+++W  WL+QF+P +VI+CA GSQ+TLEKDQ QEL LG+E T LPF
Sbjct: 246 EPDNSRP------LEDRWNHWLNQFKPGSVIYCALGSQITLEKDQFQELCLGMELTGLPF 305

Query: 310 LVALKPPTGSSSIEEALPEGFEERVRERGAVYGGWVQQPLILKHPSVGCFVSHCGFGSMW 369
           LVA+KPP G+ +I+EALPEGFEERV+  G V+G WVQQPLIL HPSVGCFV+HCGFGSMW
Sbjct: 306 LVAVKPPKGAKTIQEALPEGFEERVKNHGVVWGEWVQQPLILAHPSVGCFVTHCGFGSMW 365

Query: 370 ESLMSEPQIVLIPSLGDQILNARLLAQELQVGVEVKKREEDGKFTSQSVREAIESVMLVE 429
           ESL+S+ QIVL+P L DQILN RL+++EL+V VEV KREE G F+ +S+  AI SVM  +
Sbjct: 366 ESLVSDCQIVLLPYLCDQILNTRLMSEELEVSVEV-KREETGWFSKESLSVAITSVMDKD 425

Query: 430 AGGVGEMVKKNHKKWNHILTNPGFMDAYIHNFVNDLQN 468
           +  +G +V++NH K   +L +PG +  Y   FV  LQN
Sbjct: 426 S-ELGNLVRRNHAKLKEVLVSPGLLTGYTDEFVETLQN 439

BLAST of CmoCh04G005890 vs. TrEMBL
Match: A0A0A0L6N9_CUCSA (Glycosyltransferase OS=Cucumis sativus GN=Csa_3G236030 PE=3 SV=1)

HSP 1 Score: 664.1 bits (1712), Expect = 1.3e-187
Identity = 325/470 (69.15%), Postives = 386/470 (82.13%), Query Frame = 1

Query: 5   QSSKLHIAMFPWFAAGHMTPFLHISNELAARGHKITFLLPSKALPLLQNLNLHPNLISFH 64
           ++  LHI MFPWFA GH+TPFLHISN LA++ H+ITFLLP+    L  +LNL+P+LISFH
Sbjct: 56  ETQNLHILMFPWFATGHITPFLHISNHLASKNHRITFLLPNNPSSLFSSLNLYPDLISFH 115

Query: 65  FLTVPHVSGLPPATETASDIPISLTPLLASAFDMTRQQVAEILCSACPDVVFYDFAYWVP 124
           FL++P V GLPP+  +ASDIP+SLTPLLASA D+TR QV  I+ S  PD VF+DFA+W+P
Sbjct: 116 FLSLPSVPGLPPSAHSASDIPLSLTPLLASALDLTRPQVDRIIHSLRPDFVFFDFAHWIP 175

Query: 125 EIAAPLRIKSVSFTVVSAASIAVIAYPRRRVTVDDPITEEELRKPPPGYPSSTVVLRGCR 184
           +I APL+I+S+ FTVVSAAS+AV  +P RRV++D P+T+E+ R+PP GYPSSTVV  G R
Sbjct: 176 DITAPLQIRSICFTVVSAASVAVTVFPGRRVSLDHPLTDEDFREPPVGYPSSTVVFHGSR 235

Query: 185 EERSLLFLSMPFGEGGITFHERLMTSYRNSDAIAIRTCEEIEGNFCSYLAKQFQKKLLLT 244
           E RSLLFLSMPFG+ GITFHER MTSY+ SDAIA+RTC+EIEG+FC +L+ QFQKK+LLT
Sbjct: 236 ESRSLLFLSMPFGQ-GITFHERFMTSYKKSDAIAMRTCQEIEGDFCDFLSNQFQKKILLT 295

Query: 245 GPLMAIPNKTTTTTTTSCLDEKWEKWLDQFEPKTVIFCAFGSQLTLEKDQLQELVLGIEQ 304
           GPLMA P+     TT   LD++WEKWL QF+ KTVIFCAFGSQ+ LEK QL+ELVLGIEQ
Sbjct: 296 GPLMAAPSSKIKATT---LDKEWEKWLGQFQQKTVIFCAFGSQVILEKQQLEELVLGIEQ 355

Query: 305 TRLPFLVALKPPTGSSSIEEALPEGFEERVRERGAVYGGWVQQPLILKHPSVGCFVSHCG 364
           T LPFLVALKPP G  S+EEALP+GFEERV+ERG VYGGWVQQPLIL H S+GCFVSHCG
Sbjct: 356 TGLPFLVALKPPMGYDSMEEALPKGFEERVKERGIVYGGWVQQPLILNHSSIGCFVSHCG 415

Query: 365 FGSMWESLMSEPQIVLIPSLGDQILNARLLAQELQVGVEVKKREEDGKFTSQSVREAIES 424
           FGSMWESLMS+ QIVLIP+LGDQILN RLLAQEL+VGVEV KREEDG FT QSVR+AIE 
Sbjct: 416 FGSMWESLMSDAQIVLIPTLGDQILNTRLLAQELKVGVEV-KREEDGSFTRQSVRQAIEL 475

Query: 425 VMLVE----AGGVGEMVKKNHKKWNHILTNPGFMDAYIHNFVNDLQNGWT 471
           VM+ +      GVGE+VKKNH KW  +LT PGF++ YI NFV  LQ  W+
Sbjct: 476 VMVDDKNNNRSGVGEIVKKNHAKWKDLLTKPGFLETYIDNFVKKLQEPWS 520

BLAST of CmoCh04G005890 vs. TrEMBL
Match: A0A059BUT6_EUCGR (Uncharacterized protein (Fragment) OS=Eucalyptus grandis GN=EUGRSUZ_F03293 PE=4 SV=1)

HSP 1 Score: 575.5 bits (1482), Expect = 6.0e-161
Identity = 287/459 (62.53%), Postives = 351/459 (76.47%), Query Frame = 1

Query: 1   MAETQSSKLHIAMFPWFAAGHMTPFLHISNELAARGHKITFLLPSKALPLLQNLNLHPNL 60
           M+E  +SK HIAMFPWFA GH+TPFLH+SNELA RGHKI+F LP KAL LL+NLNLHPNL
Sbjct: 1   MSEETNSKFHIAMFPWFAVGHVTPFLHLSNELAKRGHKISFFLPRKALILLENLNLHPNL 60

Query: 61  ISFHFLTVPHVSGLPPATETASDIPISLTPLLASAFDMTRQQVAEILCSACPDVVFYDFA 120
           I+FH LTVP V+ LPP TETASDIPIS  P LA A D+TR+Q+   L +  PD +FYD A
Sbjct: 61  ITFHPLTVPSVATLPPGTETASDIPISDAPSLAVAMDLTRRQLEVSLQAMRPDFIFYDTA 120

Query: 121 YWVPEIAAPLRIKSVSFTVVSAASIAVIAYPRRRVTVDDPITEEELRKPPPGYPSSTVVL 180
           +W+P++A PL I++V + VVSAA+IA++  P R VT   P+TEEEL KPP GYPS+TVVL
Sbjct: 121 HWIPQVARPLGIRTVCYNVVSAAAIAIVLVPAREVTPGKPLTEEELGKPPVGYPSNTVVL 180

Query: 181 RGCREERSLLFLSMPFGEGGITFHERLMTSYRNSDAIAIRTCEEIEGNFCSYLAKQFQKK 240
           RG  E RSL+F+S+PFG+  ITFH R  T+ R  DAIA+RTC EIEG+ C+Y++ Q+ K 
Sbjct: 181 RG-NEARSLIFISLPFGD-NITFHGRTTTAMRECDAIAMRTCREIEGDLCAYISNQYGKP 240

Query: 241 LLLTGPLMAIPNKTTTTTTTSCLDEKWEKWLDQFEPKTVIFCAFGSQLTLEKDQLQELVL 300
           + LTGP++  P   T       LDEKW +WL +FEP +V+FCAFGSQ  LEK Q QEL+ 
Sbjct: 241 VFLTGPVLPEPAMET-------LDEKWAEWLSRFEPGSVVFCAFGSQHVLEKGQFQELLR 300

Query: 301 GIEQTRLPFLVALKPPTGSSSIEEALPEGFEERVRERGAVYGGWVQQPLILKHPSVGCFV 360
           G E T LPFL+AL+PP G++S+EEA PEGFEERVR RG V+GGWVQQPLIL HPSVGCFV
Sbjct: 301 GFESTGLPFLIALRPPIGTNSVEEAFPEGFEERVRGRGVVHGGWVQQPLILSHPSVGCFV 360

Query: 361 SHCGFGSMWESLMSEPQIVLIPSLGDQILNARLLAQELQVGVEVKKREEDGKFTSQSVRE 420
           +HCGFGSMWESL+ + QIV++P L DQILN RLLA EL+VGVEV +REE G F+ +S+  
Sbjct: 361 NHCGFGSMWESLLGDCQIVMVPHLADQILNTRLLANELKVGVEV-EREESGWFSKESLCR 420

Query: 421 AIESVMLVEAGGVGEMVKKNHKKWNHILTNPGFMDAYIH 460
           AIESVM  E   VG +VKKNH KW  +L +P FM  YI+
Sbjct: 421 AIESVM-YEKSEVGLLVKKNHAKWREVLVSPNFMTGYIN 448

BLAST of CmoCh04G005890 vs. TrEMBL
Match: B9SB92_RICCO (UDP-glucosyltransferase, putative OS=Ricinus communis GN=RCOM_0649280 PE=4 SV=1)

HSP 1 Score: 562.0 bits (1447), Expect = 6.9e-157
Identity = 280/467 (59.96%), Postives = 352/467 (75.37%), Query Frame = 1

Query: 1   MAETQSSKLHIAMFPWFAAGHMTPFLHISNELAARGHKITFLLPSKALPLLQNLNLHPNL 60
           MA+ +SS  HI MFPWFA GHMTPFLH++N +A RG   TFLLP+KA   L++ N HP+L
Sbjct: 1   MAQPKSSNFHIVMFPWFAVGHMTPFLHLANRVAERGCSTTFLLPNKAKLQLEHFNTHPDL 60

Query: 61  ISFHFLTVPHVSGLPPATETASDIPISLTPLLASAFDMTRQQVAEILCSACPDVVFYDFA 120
           I+FH +TVPHV GLP  TETASDIPI LT  LA A D TR+QV +++    P +V +D A
Sbjct: 61  ITFHSITVPHVEGLPLGTETASDIPIHLTHFLAIALDRTRRQVEKVIVDTRPKLVIFDVA 120

Query: 121 YWVPEIAAPLRIKSVSFTVVSAASIAVIAYPRRRVTVDDPITEEELRKPPPGYPSSTVVL 180
           +W+P+I   L IK++++ VV AASIA+   P R VT D P+TE EL +PP GYPSS VVL
Sbjct: 121 HWIPKITKDLGIKAINYNVVCAASIAIALVPARNVTKDRPVTEAELLQPPAGYPSSNVVL 180

Query: 181 RGCREERSLLFLSMPFGEGGITFHERLMTSYRNSDAIAIRTCEEIEGNFCSYLAKQFQKK 240
           RG  E RSLLF+S+PFGEG ITF+ER+ T+ + SDAIAIRTC EIEG  C Y+A Q++K 
Sbjct: 181 RG-HEVRSLLFVSLPFGEG-ITFYERIYTAIKGSDAIAIRTCHEIEGKLCDYIASQYEKP 240

Query: 241 LLLTGPLMAIPNKTTTTTTTSCLDEKWEKWLDQFEPKTVIFCAFGSQLTLEKDQLQELVL 300
           + LTGP++  P+K         L+++W KWL  FE  +VIFCAFGSQ+ LEK+Q QELVL
Sbjct: 241 VFLTGPVLPEPSKAP-------LEDQWTKWLGGFEKDSVIFCAFGSQIKLEKNQFQELVL 300

Query: 301 GIEQTRLPFLVALKPPTGSSSIEEALPEGFEERVRERGAVYGGWVQQPLILKHPSVGCFV 360
           G+E T LPFL ALKPP G+S++EEALPEGFEERV  RG ++GGWVQQ LIL HPSVGCF+
Sbjct: 301 GLESTGLPFLAALKPPNGASTVEEALPEGFEERVNGRGVIWGGWVQQLLILDHPSVGCFL 360

Query: 361 SHCGFGSMWESLMSEPQIVLIPSLGDQILNARLLAQELQVGVEVKKREEDGKFTSQSVRE 420
           +HCGFGSMWESLMS+ QIVL+P LGDQILN R++A+EL+VGVEV  R+E G F+ +S+R+
Sbjct: 361 NHCGFGSMWESLMSDCQIVLVPHLGDQILNTRIMAEELKVGVEV-VRDESGWFSKESLRK 420

Query: 421 AIESVMLVEAGGVGEMVKKNHKKWNHILTNPGFMDAYIHNFVNDLQN 468
           AI SVM  +   VG MVK+NH+KW  IL   GFM +YI  FV ++Q+
Sbjct: 421 AITSVM-DKNSEVGSMVKENHRKWTEILGGEGFMTSYIDKFVQNMQD 456

BLAST of CmoCh04G005890 vs. TrEMBL
Match: W9R7I3_9ROSA (UDP-glycosyltransferase OS=Morus notabilis GN=L484_012162 PE=4 SV=1)

HSP 1 Score: 561.6 bits (1446), Expect = 9.0e-157
Identity = 272/456 (59.65%), Postives = 350/456 (76.75%), Query Frame = 1

Query: 11  IAMFPWFAAGHMTPFLHISNELAARGHKITFLLPSKALPLLQNLNLHPNLISFHFLTVPH 70
           +AMFPWFA GH+ PF+H++NELA RGH+I+ LLP KA  LLQ+LNLHPNLI+FH +TVPH
Sbjct: 6   VAMFPWFATGHIAPFIHVANELAVRGHRISILLPKKAQILLQHLNLHPNLITFHTITVPH 65

Query: 71  VSGLPPATETASDIPISLTPLLASAFDMTRQQVAEILCSACPDVVFYDFAYWVPEIAAPL 130
           V GLPP  ETASDI +S T LLA+A D+TR QV + L +A P +VFYDFA+WVP+I A +
Sbjct: 66  VDGLPPGVETASDIHLSKTHLLAAAMDLTRPQVHDFLSAAKPQIVFYDFAHWVPDITAHI 125

Query: 131 RIKSVSFTVVSAASIAVIAYPRRRVTVDDPITEEELRKPPPGYPSSTVVLRGCREERSLL 190
              SV ++VVSA+S+A+   P R V  D  +T E+++ PPPGYPSSTVVLRG  E RSLL
Sbjct: 126 GAISVCYSVVSASSLAIALVPARNVPCDRTVTVEDIKDPPPGYPSSTVVLRG-PEVRSLL 185

Query: 191 FLSMPFGEGGITFHERLMTSYRNSDAIAIRTCEEIEGNFCSYLAKQFQKKLLLTGPLMAI 250
           F+S+PFG+G ITF+ER  T+ RN+D ++IRTC E+EG  C Y+A Q++K LLLTGP+++ 
Sbjct: 186 FISLPFGDG-ITFYERTTTAMRNADVLSIRTCREVEGELCDYIASQYKKPLLLTGPVLSE 245

Query: 251 PNKTTTTTTTSCLDEKWEKWLDQFEPKTVIFCAFGSQLTLEKDQLQELVLGIEQTRLPFL 310
           P  TT       L+ +W +WL++F+P +V+FCAFGSQ  L+KDQ QEL+LG E T  PFL
Sbjct: 246 PTDTTP------LEVRWTEWLNRFKPGSVVFCAFGSQHILKKDQFQELLLGFESTDFPFL 305

Query: 311 VALKPPTGSSSIEEALPEGFEERVRERGAVYGGWVQQPLILKHPSVGCFVSHCGFGSMWE 370
           +ALKPP G  ++EEA P GF ERV+ RG VYGGWVQQPLIL HPSVGCFV+HCGFGSMWE
Sbjct: 306 IALKPPVGCETVEEAFPVGFAERVKGRGIVYGGWVQQPLILNHPSVGCFVNHCGFGSMWE 365

Query: 371 SLMSEPQIVLIPSLGDQILNARLLAQELQVGVEVKKREEDGKFTSQSVREAIESVMLVEA 430
           +L+S+ QIVL+P LGDQILN +LLA+E++V VEV+K  E G F+ +S+R+AI+SV + E 
Sbjct: 366 ALLSKNQIVLVPHLGDQILNTKLLAKEIKVAVEVEKELESGWFSKESLRKAIKSV-VDED 425

Query: 431 GGVGEMVKKNHKKWNHILTNPGFMDAYIHNFVNDLQ 467
             VG MV+KNH KW  +L  PGF+  YI  FV +L+
Sbjct: 426 SEVGIMVRKNHAKWRDLLGKPGFISGYIDEFVKNLE 452

BLAST of CmoCh04G005890 vs. TrEMBL
Match: A0A059BVI7_EUCGR (Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_F03295 PE=4 SV=1)

HSP 1 Score: 547.4 bits (1409), Expect = 1.8e-152
Identity = 272/454 (59.91%), Postives = 341/454 (75.11%), Query Frame = 1

Query: 13  MFPWFAAGHMTPFLHISNELAARGHKITFLLPSKALPLLQNLNLHPNLISFHFLTVPHVS 72
           MFPWFA GH+TP+LH+SNELA RGHKI+F LP KAL LL+NLNLHPNLI+FH LTVP V+
Sbjct: 1   MFPWFAVGHITPYLHLSNELAKRGHKISFFLPRKALILLENLNLHPNLITFHPLTVPSVA 60

Query: 73  GLPPATETASDIPISLTPLLASAFDMTRQQVAEILCSACPDVVFYDFAYWVPEIAAPLRI 132
            LPP TETASDI  S +P LA+A D+TR Q+   L +  PD +FYD+A+W+P++A PL I
Sbjct: 61  TLPPGTETASDITFSDSPFLAAAMDLTRPQLEVSLQAMQPDFIFYDYAHWIPQVARPLGI 120

Query: 133 KSVSFTVVSAASIAVIAYPRRRVTVDDPITEEELRKPPPGYPSSTVVLRGCREERSLLFL 192
           ++V + V+SAA+IA++  P R VT   P TEEEL KPP GYPS TVVLRG  E R L+F+
Sbjct: 121 RTVCYNVISAAAIAMVLVPVREVTQGKPCTEEELEKPPNGYPSKTVVLRGS-EARPLIFI 180

Query: 193 SMPFGEGGITFHERLMTSYRNSDAIAIRTCEEIEGNFCSYLAKQFQKKLLLTGPLMAIPN 252
           S+  G+  ITF+ R  T+ R  DAIA+RTC+EIE +FC+Y++ Q+ K + LTGP++  P+
Sbjct: 181 SLLSGDN-ITFYGRGTTAMRECDAIALRTCQEIERDFCAYISSQYGKPVFLTGPVLPKPD 240

Query: 253 KTTTTTTTSCLDEKWEKWLDQFEPKTVIFCAFGSQLTLEKDQLQELVLGIEQTRLPFLVA 312
                     LDEKW +WL QF+P +V+FCAFGSQ  LEK Q QEL+ G E T LPF +A
Sbjct: 241 M-------KLLDEKWAEWLGQFKPGSVVFCAFGSQHVLEKGQFQELLRGFESTGLPFFIA 300

Query: 313 LKPPTGSSSIEEALPEGFEERVRERGAVYGGWVQQPLILKHPSVGCFVSHCGFGSMWESL 372
           L+PP G++S+EEA PEGFEERVR RG V+GGWVQQPLIL HPSVGCFV+HCGFGSMWESL
Sbjct: 301 LRPPIGTNSVEEAFPEGFEERVRGRGVVHGGWVQQPLILSHPSVGCFVNHCGFGSMWESL 360

Query: 373 MSEPQIVLIPSLGDQILNARLLAQELQVGVEVKKREEDGKFTSQSVREAIESVMLVEAGG 432
           + + QIV++P L DQILN RLLA EL+VG+EV +REE G F+ +S+  AIESVM  E   
Sbjct: 361 LGDCQIVMVPHLADQILNTRLLANELKVGIEV-EREESGWFSKESLCRAIESVM-DEKSE 420

Query: 433 VGEMVKKNHKKWNHILTNPGFMDAYIHNFVNDLQ 467
           VG +VKKNH KW  +L +P FM  YI  F+ +LQ
Sbjct: 421 VGLLVKKNHAKWREVLVSPNFMTGYIDRFIQNLQ 443

BLAST of CmoCh04G005890 vs. TAIR10
Match: AT5G54010.1 (AT5G54010.1 UDP-Glycosyltransferase superfamily protein)

HSP 1 Score: 501.9 bits (1291), Expect = 4.3e-142
Identity = 251/460 (54.57%), Postives = 329/460 (71.52%), Query Frame = 1

Query: 7   SKLHIAMFPWFAAGHMTPFLHISNELAARGHKITFLLPSKALPLLQNLNLHPNLISFHFL 66
           SK H  MFPWF  GHMT FLH++N+LA + HKITFLLP KA   L++LNL P+ I F  L
Sbjct: 3   SKFHAFMFPWFGFGHMTAFLHLANKLAEKDHKITFLLPKKARKQLESLNLFPDCIVFQTL 62

Query: 67  TVPHVSGLPPATETASDIPISLTPLLASAFDMTRQQVAEILCSACPDVVFYDFAYWVPEI 126
           T+P V GLP   ET SDIPISL   LASA D TR QV E +    PD++F+DFA+W+PEI
Sbjct: 63  TIPSVDGLPDGAETTSDIPISLGSFLASAMDRTRIQVKEAVSVGKPDLIFFDFAHWIPEI 122

Query: 127 AAPLRIKSVSFTVVSAASIAVIAYPRRRVTVDDPITEEELRKPPPGYPSSTVVLRGCREE 186
           A    +KSV+F  +SAA +A+   P R        ++++L   PPGYPSS V+LRG  E 
Sbjct: 123 AREYGVKSVNFITISAACVAISFVPGR--------SQDDLGSTPPGYPSSKVLLRG-HET 182

Query: 187 RSLLFLSMPFGEGGITFHERLMTSYRNSDAIAIRTCEEIEGNFCSYLAKQFQKKLLLTGP 246
            SL FLS PFG+G  +F+ER+M   +N D I+IRTC+E+EG FC ++  QFQ+K+LLTGP
Sbjct: 183 NSLSFLSYPFGDG-TSFYERIMIGLKNCDVISIRTCQEMEGKFCDFIENQFQRKVLLTGP 242

Query: 247 LMAIPNKTTTTTTTSCLDEKWEKWLDQFEPKTVIFCAFGSQLTLEKDQLQELVLGIEQTR 306
           ++  P+ +        L+++W +WL +F+P +VI+CA GSQ+ LEKDQ QEL LG+E T 
Sbjct: 243 MLPEPDNSKP------LEDQWRQWLSKFDPGSVIYCALGSQIILEKDQFQELCLGMELTG 302

Query: 307 LPFLVALKPPTGSSSIEEALPEGFEERVRERGAVYGGWVQQPLILKHPSVGCFVSHCGFG 366
           LPFLVA+KPP GSS+I+EALP+GFEERV+ RG V+GGWVQQPLIL HPS+GCFVSHCGFG
Sbjct: 303 LPFLVAVKPPKGSSTIQEALPKGFEERVKARGVVWGGWVQQPLILAHPSIGCFVSHCGFG 362

Query: 367 SMWESLMSEPQIVLIPSLGDQILNARLLAQELQVGVEVKKREEDGKFTSQSVREAIESVM 426
           SMWE+L+++ QIV IP LG+QILN RL+++EL+V VEV KREE G F+ +S+  A+ SVM
Sbjct: 363 SMWEALVNDCQIVFIPHLGEQILNTRLMSEELKVSVEV-KREETGWFSKESLSGAVRSVM 422

Query: 427 LVEAGGVGEMVKKNHKKWNHILTNPGFMDAYIHNFVNDLQ 467
             ++  +G   ++NH KW   L   G M  Y++ FV  L+
Sbjct: 423 DRDS-ELGNWARRNHVKWKESLLRHGLMSGYLNKFVEALE 444

BLAST of CmoCh04G005890 vs. TAIR10
Match: AT4G27560.1 (AT4G27560.1 UDP-Glycosyltransferase superfamily protein)

HSP 1 Score: 496.5 bits (1277), Expect = 1.8e-140
Identity = 252/460 (54.78%), Postives = 321/460 (69.78%), Query Frame = 1

Query: 8   KLHIAMFPWFAAGHMTPFLHISNELAARGHKITFLLPSKALPLLQNLNLHPNLISFHFLT 67
           K H+ M+PWFA GHMTPFL ++N+LA +GH +TFL+P KAL  L+NLNL P+ I F  +T
Sbjct: 5   KFHVLMYPWFATGHMTPFLFLANKLAEKGHTVTFLIPKKALKQLENLNLFPHNIVFRSVT 64

Query: 68  VPHVSGLPPATETASDIPISLTPLLASAFDMTRQQVAEILCSACPDVVFYDFAYWVPEIA 127
           VPHV GLP  TET S+IP++   LL SA D+TR QV  ++ +  PD++F+DFA+W+PE+A
Sbjct: 65  VPHVDGLPVGTETVSEIPVTSADLLMSAMDLTRDQVEGVVRAVEPDLIFFDFAHWIPEVA 124

Query: 128 APLRIKSVSFTVVSAASIAVIAYPRRRVTVDDPITEEELRKPPPGYPSSTVVLRGCREER 187
               +K+V + VVSA++IA +  P             EL  PPPGYPSS V+LR      
Sbjct: 125 RDFGLKTVKYVVVSASTIASMLVPGG-----------ELGVPPPGYPSSKVLLRKQDAYT 184

Query: 188 SLLFLSMPFGEGGITFHERLMTSYRNSDAIAIRTCEEIEGNFCSYLAKQFQKKLLLTGPL 247
                S      G    ER+ TS  NSD IAIRT  EIEGNFC Y+ K  +KK+LLTGP+
Sbjct: 185 MKNLESTNTINVGPNLLERVTTSLMNSDVIAIRTAREIEGNFCDYIEKHCRKKVLLTGPV 244

Query: 248 MAIPNKTTTTTTTSCLDEKWEKWLDQFEPKTVIFCAFGSQLTLEKDQLQELVLGIEQTRL 307
              P+KT        L+E+W KWL  +EP +V+FCA GSQ+ LEKDQ QEL LG+E T  
Sbjct: 245 FPEPDKTRE------LEERWVKWLSGYEPDSVVFCALGSQVILEKDQFQELCLGMELTGS 304

Query: 308 PFLVALKPPTGSSSIEEALPEGFEERVRERGAVYGGWVQQPLILKHPSVGCFVSHCGFGS 367
           PFLVA+KPP GSS+I+EALPEGFEERV+ RG V+G WVQQPL+L HPSVGCFVSHCGFGS
Sbjct: 305 PFLVAVKPPRGSSTIQEALPEGFEERVKGRGVVWGEWVQQPLLLSHPSVGCFVSHCGFGS 364

Query: 368 MWESLMSEPQIVLIPSLGDQILNARLLAQELQVGVEVKKREEDGKFTSQSVREAIESVML 427
           MWESL+S+ QIVL+P LGDQ+LN RLL+ EL+V VEV  REE G F+ +S+ +AI SVM 
Sbjct: 365 MWESLLSDCQIVLVPQLGDQVLNTRLLSDELKVSVEV-AREETGWFSKESLFDAINSVMK 424

Query: 428 VEAGGVGEMVKKNHKKWNHILTNPGFMDAYIHNFVNDLQN 468
            ++  +G +VKKNH KW   LT+PG +  Y+ NF+  LQ+
Sbjct: 425 RDS-EIGNLVKKNHTKWRETLTSPGLVTGYVDNFIESLQD 445

BLAST of CmoCh04G005890 vs. TAIR10
Match: AT5G54060.1 (AT5G54060.1 UDP-glucose:flavonoid 3-o-glucosyltransferase)

HSP 1 Score: 495.4 bits (1274), Expect = 4.0e-140
Identity = 257/466 (55.15%), Postives = 331/466 (71.03%), Query Frame = 1

Query: 5   QSSKLHIAMFPWFAAGHMTPFLHISNELAARGHKITFLLPSKALPLLQNLNLHPNLISFH 64
           +SS + I M+PW A GHMTPFLH+SN+LA +GHKI FLLP KAL  L+ LNL+PNLI+FH
Sbjct: 8   ESSSMSIVMYPWLAFGHMTPFLHLSNKLAEKGHKIVFLLPKKALNQLEPLNLYPNLITFH 67

Query: 65  FLTVPHVSGLPPATETASDIPISLTPLLASAFDMTRQQVAEILCSACPDVVFYDFAYWVP 124
            +++P V GLPP  ET SD+P  LT LLA A D TR +V  I  +  PD+VFYD A+W+P
Sbjct: 68  TISIPQVKGLPPGAETNSDVPFFLTHLLAVAMDQTRPEVETIFRTIKPDLVFYDSAHWIP 127

Query: 125 EIAAPLRIKSVSFTVVSAASIAVIAYP--RRRVTVDDPITEEELRKPPPGYPSSTVVLRG 184
           EIA P+  K+V F +VSAASIA+   P   R V     ++ EEL K P GYPSS VVLR 
Sbjct: 128 EIAKPIGAKTVCFNIVSAASIALSLVPSAEREVIDGKEMSGEELAKTPLGYPSSKVVLRP 187

Query: 185 CREERSLLFLSMPFGEGGITFHERLMTSYRNSDAIAIRTCEEIEGNFCSYLAKQFQKKLL 244
             E +SL F+       G +F +  +T+ RN DAIAIRTC E EG FC Y+++Q+ K + 
Sbjct: 188 -HEAKSLSFVWRKHEAIG-SFFDGKVTAMRNCDAIAIRTCRETEGKFCDYISRQYSKPVY 247

Query: 245 LTGPLM--AIPNKTTTTTTTSCLDEKWEKWLDQFEPKTVIFCAFGSQLTLEK-DQLQELV 304
           LTGP++  + PN+ +       LD +W +WL +F   +V+FCAFGSQ  + K DQ QEL 
Sbjct: 248 LTGPVLPGSQPNQPS-------LDPQWAEWLAKFNHGSVVFCAFGSQPVVNKIDQFQELC 307

Query: 305 LGIEQTRLPFLVALKPPTGSSSIEEALPEGFEERVRERGAVYGGWVQQPLILKHPSVGCF 364
           LG+E T  PFLVA+KPP+G S++EEALPEGF+ERV+ RG V+GGW+QQPL+L HPSVGCF
Sbjct: 308 LGLESTGFPFLVAIKPPSGVSTVEEALPEGFKERVQGRGVVFGGWIQQPLVLNHPSVGCF 367

Query: 365 VSHCGFGSMWESLMSEPQIVLIPSLGDQILNARLLAQELQVGVEVKKREEDGKFTSQSVR 424
           VSHCGFGSMWESLMS+ QIVL+P  G+QILNARL+ +E++V VEV +RE+ G F+ QS+ 
Sbjct: 368 VSHCGFGSMWESLMSDCQIVLVPQHGEQILNARLMTEEMEVAVEV-EREKKGWFSRQSLE 427

Query: 425 EAIESVMLVEAGGVGEMVKKNHKKWNHILTNPGFMDAYIHNFVNDL 466
            A++SVM  E   +GE V+KNH KW  +LT+ GF D YI  F  +L
Sbjct: 428 NAVKSVM-EEGSEIGEKVRKNHDKWRCVLTDSGFSDGYIDKFEQNL 462

BLAST of CmoCh04G005890 vs. TAIR10
Match: AT4G27570.1 (AT4G27570.1 UDP-Glycosyltransferase superfamily protein)

HSP 1 Score: 493.8 bits (1270), Expect = 1.2e-139
Identity = 251/462 (54.33%), Postives = 326/462 (70.56%), Query Frame = 1

Query: 8   KLHIAMFPWFAAGHMTPFLHISNELAARGHKITFLLPSKALPLLQNLNLHPNLISFHFLT 67
           K H+ M+PWFA GHMTPFL ++N+LA +GH +TFLLP K+L  L++ NL P+ I F  +T
Sbjct: 5   KFHVLMYPWFATGHMTPFLFLANKLAEKGHTVTFLLPKKSLKQLEHFNLFPHNIVFRSVT 64

Query: 68  VPHVSGLPPATETASDIPISLTPLLASAFDMTRQQVAEILCSACPDVVFYDFAYWVPEIA 127
           VPHV GLP  TETAS+IP++ T LL SA D+TR QV  ++ +  PD++F+DFA+W+PE+A
Sbjct: 65  VPHVDGLPVGTETASEIPVTSTDLLMSAMDLTRDQVEAVVRAVEPDLIFFDFAHWIPEVA 124

Query: 128 APLRIKSVSFTVVSAASIAVIAYPRRRVTVDDPITEEELRKPPPGYPSSTVVLRGCREER 187
               +K+V + VVSA++IA +  P             EL  PPPGYPSS V+LR  +++ 
Sbjct: 125 RDFGLKTVKYVVVSASTIASMLVPGG-----------ELGVPPPGYPSSKVLLR--KQDA 184

Query: 188 SLLFLSMPFG--EGGITFHERLMTSYRNSDAIAIRTCEEIEGNFCSYLAKQFQKKLLLTG 247
             +    P    + G    ER+ TS  NSD IAIRT  EIEGNFC Y+ K  +KK+LLTG
Sbjct: 185 YTMKKLEPTNTIDVGPNLLERVTTSLMNSDVIAIRTAREIEGNFCDYIEKHCRKKVLLTG 244

Query: 248 PLMAIPNKTTTTTTTSCLDEKWEKWLDQFEPKTVIFCAFGSQLTLEKDQLQELVLGIEQT 307
           P+   P+KT        L+E+W KWL  +EP +V+FCA GSQ+ LEKDQ QEL LG+E T
Sbjct: 245 PVFPEPDKTRE------LEERWVKWLSGYEPDSVVFCALGSQVILEKDQFQELCLGMELT 304

Query: 308 RLPFLVALKPPTGSSSIEEALPEGFEERVRERGAVYGGWVQQPLILKHPSVGCFVSHCGF 367
             PFLVA+KPP GSS+I+EALPEGFEERV+ RG V+GGWVQQPLIL HPSVGCFVSHCGF
Sbjct: 305 GSPFLVAVKPPRGSSTIQEALPEGFEERVKGRGLVWGGWVQQPLILSHPSVGCFVSHCGF 364

Query: 368 GSMWESLMSEPQIVLIPSLGDQILNARLLAQELQVGVEVKKREEDGKFTSQSVREAIESV 427
           GSMWESL+S+ QIVL+P LGDQ+LN RLL+ EL+V VEV  REE G F+ +S+ +A+ SV
Sbjct: 365 GSMWESLLSDCQIVLVPQLGDQVLNTRLLSDELKVSVEV-AREETGWFSKESLCDAVNSV 424

Query: 428 MLVEAGGVGEMVKKNHKKWNHILTNPGFMDAYIHNFVNDLQN 468
           M  ++  +G +V+KNH KW   + +PG M  Y+  FV  LQ+
Sbjct: 425 MKRDS-ELGNLVRKNHTKWRETVASPGLMTGYVDAFVESLQD 445

BLAST of CmoCh04G005890 vs. TAIR10
Match: AT5G53990.1 (AT5G53990.1 UDP-Glycosyltransferase superfamily protein)

HSP 1 Score: 484.6 bits (1246), Expect = 7.1e-137
Identity = 247/458 (53.93%), Postives = 324/458 (70.74%), Query Frame = 1

Query: 10  HIAMFPWFAAGHMTPFLHISNELAARGHKITFLLPSKALPLLQNLNLHPNLISFHFLTVP 69
           H  MFPWFA GHMTP+LH++N+LAA+GH++TFLLP KA   L++ NL P+ I FH LT+P
Sbjct: 6   HAFMFPWFAFGHMTPYLHLANKLAAKGHRVTFLLPKKAQKQLEHHNLFPDRIIFHSLTIP 65

Query: 70  HVSGLPPATETASDIPISLTPLLASAFDMTRQQVAEILCSACPDVVFYDFAYWVPEIAAP 129
           HV GLP   ETASDIPISL   L +A D+TR QV   + +  PD++F+D AYWVPE+A  
Sbjct: 66  HVDGLPAGAETASDIPISLGKFLTAAMDLTRDQVEAAVRALRPDLIFFDTAYWVPEMAKE 125

Query: 130 LRIKSVSFTVVSAASIAVIAYPRRRVTVDDPITEEELRKPPPGYPSSTVVLRGCREERSL 189
            R+KSV + V+SA SIA            + +   EL  PPPGYPSS V+ RG  +  +L
Sbjct: 126 HRVKSVIYFVISANSIA-----------HELVPGGELGVPPPGYPSSKVLYRG-HDAHAL 185

Query: 190 LFLSMPFGEGGITFHERLMTSYRNSDAIAIRTCEEIEGNFCSYLAKQFQKKLLLTGPLMA 249
           L  S+ +       H R+ T  +N D I+IRTC+EIEG FC Y+ +Q+Q+K+LLTGP++ 
Sbjct: 186 LTFSIFYER----LHYRITTGLKNCDFISIRTCKEIEGKFCDYIERQYQRKVLLTGPMLP 245

Query: 250 IPNKTTTTTTTSCLDEKWEKWLDQFEPKTVIFCAFGSQLTLEKDQLQELVLGIEQTRLPF 309
            P+ +        L+++W  WL+QF+P +VI+CA GSQ+TLEKDQ QEL LG+E T LPF
Sbjct: 246 EPDNSRP------LEDRWNHWLNQFKPGSVIYCALGSQITLEKDQFQELCLGMELTGLPF 305

Query: 310 LVALKPPTGSSSIEEALPEGFEERVRERGAVYGGWVQQPLILKHPSVGCFVSHCGFGSMW 369
           LVA+KPP G+ +I+EALPEGFEERV+  G V+G WVQQPLIL HPSVGCFV+HCGFGSMW
Sbjct: 306 LVAVKPPKGAKTIQEALPEGFEERVKNHGVVWGEWVQQPLILAHPSVGCFVTHCGFGSMW 365

Query: 370 ESLMSEPQIVLIPSLGDQILNARLLAQELQVGVEVKKREEDGKFTSQSVREAIESVMLVE 429
           ESL+S+ QIVL+P L DQILN RL+++EL+V VEV KREE G F+ +S+  AI SVM  +
Sbjct: 366 ESLVSDCQIVLLPYLCDQILNTRLMSEELEVSVEV-KREETGWFSKESLSVAITSVMDKD 425

Query: 430 AGGVGEMVKKNHKKWNHILTNPGFMDAYIHNFVNDLQN 468
           +  +G +V++NH K   +L +PG +  Y   FV  LQN
Sbjct: 426 S-ELGNLVRRNHAKLKEVLVSPGLLTGYTDEFVETLQN 439

BLAST of CmoCh04G005890 vs. NCBI nr
Match: gi|659112002|ref|XP_008456016.1| (PREDICTED: LOW QUALITY PROTEIN: UDP-glycosyltransferase 79B6-like [Cucumis melo])

HSP 1 Score: 671.8 bits (1732), Expect = 8.8e-190
Identity = 331/471 (70.28%), Postives = 389/471 (82.59%), Query Frame = 1

Query: 3   ETQSSKLHIAMFPWFAAGHMTPFLHISNELAARGHKITFLLPSKALPLLQNLNLHPNLIS 62
           ETQS  LHI MFPWFA GH+TPFLHISN LA++ H+ITFLLP+   P   +LNL+PNLIS
Sbjct: 7   ETQS--LHILMFPWFATGHITPFLHISNHLASKNHRITFLLPNNPSPXFHSLNLYPNLIS 66

Query: 63  FHFLTVPHVSGLPPATETASDIPISLTPLLASAFDMTRQQVAEILCSACPDVVFYDFAYW 122
           FHFL++P V GLPPA  +ASDIP+SLTPLLASAFD+TR QV  I+ S  PD VF+DFA+W
Sbjct: 67  FHFLSLPSVPGLPPAAHSASDIPLSLTPLLASAFDLTRPQVHRIIHSLRPDFVFFDFAHW 126

Query: 123 VPEIAAPLRIKSVSFTVVSAASIAVIAYPRRRVTVDDPITEEELRKPPPGYPSSTVVLRG 182
           +P+I APL+I+S+ FTVVSAAS+AV  +P RRV++D P+T+E+ R+PP GYPSSTVV   
Sbjct: 127 IPDITAPLQIRSICFTVVSAASVAVTVFPGRRVSLDHPLTDEDFREPPVGYPSSTVVFHD 186

Query: 183 CREERSLLFLSMPFGEGGITFHERLMTSYRNSDAIAIRTCEEIEGNFCSYLAKQFQKKLL 242
            RE RSLLFLSMPFG+ GITFHERLMTSY+ SDAIA+RTC+EIEG+FC +L+ Q QKK+L
Sbjct: 187 SRESRSLLFLSMPFGQ-GITFHERLMTSYKKSDAIAMRTCQEIEGDFCDFLSNQLQKKIL 246

Query: 243 LTGPLMAIPNKTTTTTTTSCLDEKWEKWLDQFEPKTVIFCAFGSQLTLEKDQLQELVLGI 302
           LTGPLMA P+     TT   LD++WEKWL QF+PKTVIFCAFGSQ+ LEK QL+ELVLGI
Sbjct: 247 LTGPLMAAPSSRIKATT---LDKEWEKWLGQFQPKTVIFCAFGSQVILEKQQLEELVLGI 306

Query: 303 EQTRLPFLVALKPPTGSSSIEEALPEGFEERVRERGAVYGGWVQQPLILKHPSVGCFVSH 362
           EQT LPFLVALKPP G  S++EALP+GFEERV+ERG VYGGWVQQPLIL H S+GCFVSH
Sbjct: 307 EQTGLPFLVALKPPMGYDSMKEALPKGFEERVKERGIVYGGWVQQPLILNHSSIGCFVSH 366

Query: 363 CGFGSMWESLMSEPQIVLIPSLGDQILNARLLAQELQVGVEVKKREEDGKFTSQSVREAI 422
           CGFGSMWESLMS+ QIVLIP+LGDQILN RLLAQEL+VGVEV KREEDG FT QSVR+AI
Sbjct: 367 CGFGSMWESLMSDAQIVLIPTLGDQILNTRLLAQELKVGVEV-KREEDGSFTRQSVRQAI 426

Query: 423 ESVMLVE----AGGVGEMVKKNHKKWNHILTNPGFMDAYIHNFVNDLQNGW 470
           E VM+ +      G+GEMVKKNH KW H+LT P F++ YI NFV +LQ  W
Sbjct: 427 ELVMVDDNNNNGSGIGEMVKKNHAKWKHLLTKPSFLETYIDNFVMNLQEPW 470

BLAST of CmoCh04G005890 vs. NCBI nr
Match: gi|449457075|ref|XP_004146274.1| (PREDICTED: UDP-glycosyltransferase 79B6-like [Cucumis sativus])

HSP 1 Score: 664.1 bits (1712), Expect = 1.8e-187
Identity = 325/470 (69.15%), Postives = 386/470 (82.13%), Query Frame = 1

Query: 5   QSSKLHIAMFPWFAAGHMTPFLHISNELAARGHKITFLLPSKALPLLQNLNLHPNLISFH 64
           ++  LHI MFPWFA GH+TPFLHISN LA++ H+ITFLLP+    L  +LNL+P+LISFH
Sbjct: 6   ETQNLHILMFPWFATGHITPFLHISNHLASKNHRITFLLPNNPSSLFSSLNLYPDLISFH 65

Query: 65  FLTVPHVSGLPPATETASDIPISLTPLLASAFDMTRQQVAEILCSACPDVVFYDFAYWVP 124
           FL++P V GLPP+  +ASDIP+SLTPLLASA D+TR QV  I+ S  PD VF+DFA+W+P
Sbjct: 66  FLSLPSVPGLPPSAHSASDIPLSLTPLLASALDLTRPQVDRIIHSLRPDFVFFDFAHWIP 125

Query: 125 EIAAPLRIKSVSFTVVSAASIAVIAYPRRRVTVDDPITEEELRKPPPGYPSSTVVLRGCR 184
           +I APL+I+S+ FTVVSAAS+AV  +P RRV++D P+T+E+ R+PP GYPSSTVV  G R
Sbjct: 126 DITAPLQIRSICFTVVSAASVAVTVFPGRRVSLDHPLTDEDFREPPVGYPSSTVVFHGSR 185

Query: 185 EERSLLFLSMPFGEGGITFHERLMTSYRNSDAIAIRTCEEIEGNFCSYLAKQFQKKLLLT 244
           E RSLLFLSMPFG+ GITFHER MTSY+ SDAIA+RTC+EIEG+FC +L+ QFQKK+LLT
Sbjct: 186 ESRSLLFLSMPFGQ-GITFHERFMTSYKKSDAIAMRTCQEIEGDFCDFLSNQFQKKILLT 245

Query: 245 GPLMAIPNKTTTTTTTSCLDEKWEKWLDQFEPKTVIFCAFGSQLTLEKDQLQELVLGIEQ 304
           GPLMA P+     TT   LD++WEKWL QF+ KTVIFCAFGSQ+ LEK QL+ELVLGIEQ
Sbjct: 246 GPLMAAPSSKIKATT---LDKEWEKWLGQFQQKTVIFCAFGSQVILEKQQLEELVLGIEQ 305

Query: 305 TRLPFLVALKPPTGSSSIEEALPEGFEERVRERGAVYGGWVQQPLILKHPSVGCFVSHCG 364
           T LPFLVALKPP G  S+EEALP+GFEERV+ERG VYGGWVQQPLIL H S+GCFVSHCG
Sbjct: 306 TGLPFLVALKPPMGYDSMEEALPKGFEERVKERGIVYGGWVQQPLILNHSSIGCFVSHCG 365

Query: 365 FGSMWESLMSEPQIVLIPSLGDQILNARLLAQELQVGVEVKKREEDGKFTSQSVREAIES 424
           FGSMWESLMS+ QIVLIP+LGDQILN RLLAQEL+VGVEV KREEDG FT QSVR+AIE 
Sbjct: 366 FGSMWESLMSDAQIVLIPTLGDQILNTRLLAQELKVGVEV-KREEDGSFTRQSVRQAIEL 425

Query: 425 VMLVE----AGGVGEMVKKNHKKWNHILTNPGFMDAYIHNFVNDLQNGWT 471
           VM+ +      GVGE+VKKNH KW  +LT PGF++ YI NFV  LQ  W+
Sbjct: 426 VMVDDKNNNRSGVGEIVKKNHAKWKDLLTKPGFLETYIDNFVKKLQEPWS 470

BLAST of CmoCh04G005890 vs. NCBI nr
Match: gi|700202502|gb|KGN57635.1| (hypothetical protein Csa_3G236030 [Cucumis sativus])

HSP 1 Score: 664.1 bits (1712), Expect = 1.8e-187
Identity = 325/470 (69.15%), Postives = 386/470 (82.13%), Query Frame = 1

Query: 5   QSSKLHIAMFPWFAAGHMTPFLHISNELAARGHKITFLLPSKALPLLQNLNLHPNLISFH 64
           ++  LHI MFPWFA GH+TPFLHISN LA++ H+ITFLLP+    L  +LNL+P+LISFH
Sbjct: 56  ETQNLHILMFPWFATGHITPFLHISNHLASKNHRITFLLPNNPSSLFSSLNLYPDLISFH 115

Query: 65  FLTVPHVSGLPPATETASDIPISLTPLLASAFDMTRQQVAEILCSACPDVVFYDFAYWVP 124
           FL++P V GLPP+  +ASDIP+SLTPLLASA D+TR QV  I+ S  PD VF+DFA+W+P
Sbjct: 116 FLSLPSVPGLPPSAHSASDIPLSLTPLLASALDLTRPQVDRIIHSLRPDFVFFDFAHWIP 175

Query: 125 EIAAPLRIKSVSFTVVSAASIAVIAYPRRRVTVDDPITEEELRKPPPGYPSSTVVLRGCR 184
           +I APL+I+S+ FTVVSAAS+AV  +P RRV++D P+T+E+ R+PP GYPSSTVV  G R
Sbjct: 176 DITAPLQIRSICFTVVSAASVAVTVFPGRRVSLDHPLTDEDFREPPVGYPSSTVVFHGSR 235

Query: 185 EERSLLFLSMPFGEGGITFHERLMTSYRNSDAIAIRTCEEIEGNFCSYLAKQFQKKLLLT 244
           E RSLLFLSMPFG+ GITFHER MTSY+ SDAIA+RTC+EIEG+FC +L+ QFQKK+LLT
Sbjct: 236 ESRSLLFLSMPFGQ-GITFHERFMTSYKKSDAIAMRTCQEIEGDFCDFLSNQFQKKILLT 295

Query: 245 GPLMAIPNKTTTTTTTSCLDEKWEKWLDQFEPKTVIFCAFGSQLTLEKDQLQELVLGIEQ 304
           GPLMA P+     TT   LD++WEKWL QF+ KTVIFCAFGSQ+ LEK QL+ELVLGIEQ
Sbjct: 296 GPLMAAPSSKIKATT---LDKEWEKWLGQFQQKTVIFCAFGSQVILEKQQLEELVLGIEQ 355

Query: 305 TRLPFLVALKPPTGSSSIEEALPEGFEERVRERGAVYGGWVQQPLILKHPSVGCFVSHCG 364
           T LPFLVALKPP G  S+EEALP+GFEERV+ERG VYGGWVQQPLIL H S+GCFVSHCG
Sbjct: 356 TGLPFLVALKPPMGYDSMEEALPKGFEERVKERGIVYGGWVQQPLILNHSSIGCFVSHCG 415

Query: 365 FGSMWESLMSEPQIVLIPSLGDQILNARLLAQELQVGVEVKKREEDGKFTSQSVREAIES 424
           FGSMWESLMS+ QIVLIP+LGDQILN RLLAQEL+VGVEV KREEDG FT QSVR+AIE 
Sbjct: 416 FGSMWESLMSDAQIVLIPTLGDQILNTRLLAQELKVGVEV-KREEDGSFTRQSVRQAIEL 475

Query: 425 VMLVE----AGGVGEMVKKNHKKWNHILTNPGFMDAYIHNFVNDLQNGWT 471
           VM+ +      GVGE+VKKNH KW  +LT PGF++ YI NFV  LQ  W+
Sbjct: 476 VMVDDKNNNRSGVGEIVKKNHAKWKDLLTKPGFLETYIDNFVKKLQEPWS 520

BLAST of CmoCh04G005890 vs. NCBI nr
Match: gi|702377601|ref|XP_010062847.1| (PREDICTED: anthocyanidin 3-O-glucoside 2'''-O-xylosyltransferase-like [Eucalyptus grandis])

HSP 1 Score: 583.2 bits (1502), Expect = 4.1e-163
Identity = 290/466 (62.23%), Postives = 356/466 (76.39%), Query Frame = 1

Query: 1   MAETQSSKLHIAMFPWFAAGHMTPFLHISNELAARGHKITFLLPSKALPLLQNLNLHPNL 60
           M+E  +SK HIAMFPWFA GH+TPFLH+SNELA RGHKI+F LP KAL LL+NLNLHPNL
Sbjct: 1   MSEETNSKFHIAMFPWFAVGHVTPFLHLSNELAKRGHKISFFLPRKALILLENLNLHPNL 60

Query: 61  ISFHFLTVPHVSGLPPATETASDIPISLTPLLASAFDMTRQQVAEILCSACPDVVFYDFA 120
           I+FH LTVP V+ LPP TETASDIPIS  P LA A D+TR+Q+   L +  PD +FYD A
Sbjct: 61  ITFHPLTVPSVATLPPGTETASDIPISDAPSLAVAMDLTRRQLEVSLQAMRPDFIFYDTA 120

Query: 121 YWVPEIAAPLRIKSVSFTVVSAASIAVIAYPRRRVTVDDPITEEELRKPPPGYPSSTVVL 180
           +W+P++A PL I++V + VVSAA+IA++  P R VT   P+TEEEL KPP GYPS+TVVL
Sbjct: 121 HWIPQVARPLGIRTVCYNVVSAAAIAIVLVPAREVTPGKPLTEEELGKPPVGYPSNTVVL 180

Query: 181 RGCREERSLLFLSMPFGEGGITFHERLMTSYRNSDAIAIRTCEEIEGNFCSYLAKQFQKK 240
           RG  E RSL+F+S+PFG+  ITFH R  T+ R  DAIA+RTC EIEG+ C+Y++ Q+ K 
Sbjct: 181 RG-NEARSLIFISLPFGDN-ITFHGRTTTAMRECDAIAMRTCREIEGDLCAYISNQYGKP 240

Query: 241 LLLTGPLMAIPNKTTTTTTTSCLDEKWEKWLDQFEPKTVIFCAFGSQLTLEKDQLQELVL 300
           + LTGP++  P   T       LDEKW +WL +FEP +V+FCAFGSQ  LEK Q QEL+ 
Sbjct: 241 VFLTGPVLPEPAMET-------LDEKWAEWLSRFEPGSVVFCAFGSQHVLEKGQFQELLR 300

Query: 301 GIEQTRLPFLVALKPPTGSSSIEEALPEGFEERVRERGAVYGGWVQQPLILKHPSVGCFV 360
           G E T LPFL+AL+PP G++S+EEA PEGFEERVR RG V+GGWVQQPLIL HPSVGCFV
Sbjct: 301 GFESTGLPFLIALRPPIGTNSVEEAFPEGFEERVRGRGVVHGGWVQQPLILSHPSVGCFV 360

Query: 361 SHCGFGSMWESLMSEPQIVLIPSLGDQILNARLLAQELQVGVEVKKREEDGKFTSQSVRE 420
           +HCGFGSMWESL+ + QIV++P L DQILN RLLA EL+VGVEV+ REE G F+ +S+  
Sbjct: 361 NHCGFGSMWESLLGDCQIVMVPHLADQILNTRLLANELKVGVEVE-REESGWFSKESLCR 420

Query: 421 AIESVMLVEAGGVGEMVKKNHKKWNHILTNPGFMDAYIHNFVNDLQ 467
           AIESVM  E   VG +VKKNH KW  +L +P FM  YI+ F+ +LQ
Sbjct: 421 AIESVMY-EKSEVGLLVKKNHAKWREVLVSPNFMTGYINRFIQNLQ 455

BLAST of CmoCh04G005890 vs. NCBI nr
Match: gi|629104514|gb|KCW69983.1| (hypothetical protein EUGRSUZ_F03293, partial [Eucalyptus grandis])

HSP 1 Score: 575.5 bits (1482), Expect = 8.6e-161
Identity = 287/459 (62.53%), Postives = 351/459 (76.47%), Query Frame = 1

Query: 1   MAETQSSKLHIAMFPWFAAGHMTPFLHISNELAARGHKITFLLPSKALPLLQNLNLHPNL 60
           M+E  +SK HIAMFPWFA GH+TPFLH+SNELA RGHKI+F LP KAL LL+NLNLHPNL
Sbjct: 1   MSEETNSKFHIAMFPWFAVGHVTPFLHLSNELAKRGHKISFFLPRKALILLENLNLHPNL 60

Query: 61  ISFHFLTVPHVSGLPPATETASDIPISLTPLLASAFDMTRQQVAEILCSACPDVVFYDFA 120
           I+FH LTVP V+ LPP TETASDIPIS  P LA A D+TR+Q+   L +  PD +FYD A
Sbjct: 61  ITFHPLTVPSVATLPPGTETASDIPISDAPSLAVAMDLTRRQLEVSLQAMRPDFIFYDTA 120

Query: 121 YWVPEIAAPLRIKSVSFTVVSAASIAVIAYPRRRVTVDDPITEEELRKPPPGYPSSTVVL 180
           +W+P++A PL I++V + VVSAA+IA++  P R VT   P+TEEEL KPP GYPS+TVVL
Sbjct: 121 HWIPQVARPLGIRTVCYNVVSAAAIAIVLVPAREVTPGKPLTEEELGKPPVGYPSNTVVL 180

Query: 181 RGCREERSLLFLSMPFGEGGITFHERLMTSYRNSDAIAIRTCEEIEGNFCSYLAKQFQKK 240
           RG  E RSL+F+S+PFG+  ITFH R  T+ R  DAIA+RTC EIEG+ C+Y++ Q+ K 
Sbjct: 181 RG-NEARSLIFISLPFGD-NITFHGRTTTAMRECDAIAMRTCREIEGDLCAYISNQYGKP 240

Query: 241 LLLTGPLMAIPNKTTTTTTTSCLDEKWEKWLDQFEPKTVIFCAFGSQLTLEKDQLQELVL 300
           + LTGP++  P   T       LDEKW +WL +FEP +V+FCAFGSQ  LEK Q QEL+ 
Sbjct: 241 VFLTGPVLPEPAMET-------LDEKWAEWLSRFEPGSVVFCAFGSQHVLEKGQFQELLR 300

Query: 301 GIEQTRLPFLVALKPPTGSSSIEEALPEGFEERVRERGAVYGGWVQQPLILKHPSVGCFV 360
           G E T LPFL+AL+PP G++S+EEA PEGFEERVR RG V+GGWVQQPLIL HPSVGCFV
Sbjct: 301 GFESTGLPFLIALRPPIGTNSVEEAFPEGFEERVRGRGVVHGGWVQQPLILSHPSVGCFV 360

Query: 361 SHCGFGSMWESLMSEPQIVLIPSLGDQILNARLLAQELQVGVEVKKREEDGKFTSQSVRE 420
           +HCGFGSMWESL+ + QIV++P L DQILN RLLA EL+VGVEV +REE G F+ +S+  
Sbjct: 361 NHCGFGSMWESLLGDCQIVMVPHLADQILNTRLLANELKVGVEV-EREESGWFSKESLCR 420

Query: 421 AIESVMLVEAGGVGEMVKKNHKKWNHILTNPGFMDAYIH 460
           AIESVM  E   VG +VKKNH KW  +L +P FM  YI+
Sbjct: 421 AIESVM-YEKSEVGLLVKKNHAKWREVLVSPNFMTGYIN 448

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
U79B6_ARATH7.6e-14154.57UDP-glycosyltransferase 79B6 OS=Arabidopsis thaliana GN=UGT79B6 PE=2 SV=1[more]
U79B2_ARATH3.2e-13954.78UDP-glycosyltransferase 79B2 OS=Arabidopsis thaliana GN=UGT79B2 PE=2 SV=1[more]
AXYLT_ARATH7.1e-13955.15Anthocyanidin 3-O-glucoside 2'''-O-xylosyltransferase OS=Arabidopsis thaliana GN... [more]
U79B3_ARATH2.1e-13854.33UDP-glycosyltransferase 79B3 OS=Arabidopsis thaliana GN=UGT79B3 PE=2 SV=1[more]
U79B9_ARATH1.3e-13553.93UDP-glycosyltransferase 79B9 OS=Arabidopsis thaliana GN=UGT79B9 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0L6N9_CUCSA1.3e-18769.15Glycosyltransferase OS=Cucumis sativus GN=Csa_3G236030 PE=3 SV=1[more]
A0A059BUT6_EUCGR6.0e-16162.53Uncharacterized protein (Fragment) OS=Eucalyptus grandis GN=EUGRSUZ_F03293 PE=4 ... [more]
B9SB92_RICCO6.9e-15759.96UDP-glucosyltransferase, putative OS=Ricinus communis GN=RCOM_0649280 PE=4 SV=1[more]
W9R7I3_9ROSA9.0e-15759.65UDP-glycosyltransferase OS=Morus notabilis GN=L484_012162 PE=4 SV=1[more]
A0A059BVI7_EUCGR1.8e-15259.91Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_F03295 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G54010.14.3e-14254.57 UDP-Glycosyltransferase superfamily protein[more]
AT4G27560.11.8e-14054.78 UDP-Glycosyltransferase superfamily protein[more]
AT5G54060.14.0e-14055.15 UDP-glucose:flavonoid 3-o-glucosyltransferase[more]
AT4G27570.11.2e-13954.33 UDP-Glycosyltransferase superfamily protein[more]
AT5G53990.17.1e-13753.93 UDP-Glycosyltransferase superfamily protein[more]
Match NameE-valueIdentityDescription
gi|659112002|ref|XP_008456016.1|8.8e-19070.28PREDICTED: LOW QUALITY PROTEIN: UDP-glycosyltransferase 79B6-like [Cucumis melo][more]
gi|449457075|ref|XP_004146274.1|1.8e-18769.15PREDICTED: UDP-glycosyltransferase 79B6-like [Cucumis sativus][more]
gi|700202502|gb|KGN57635.1|1.8e-18769.15hypothetical protein Csa_3G236030 [Cucumis sativus][more]
gi|702377601|ref|XP_010062847.1|4.1e-16362.23PREDICTED: anthocyanidin 3-O-glucoside 2'''-O-xylosyltransferase-like [Eucalyptu... [more]
gi|629104514|gb|KCW69983.1|8.6e-16162.53hypothetical protein EUGRSUZ_F03293, partial [Eucalyptus grandis][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002213UDP_glucos_trans
Vocabulary: Molecular Function
TermDefinition
GO:0016758transferase activity, transferring hexosyl groups
Vocabulary: Biological Process
TermDefinition
GO:0008152metabolic process
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008152 metabolic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0016758 transferase activity, transferring hexosyl groups
molecular_function GO:0047213 anthocyanidin 3-O-glucosyltransferase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh04G005890.1CmoCh04G005890.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002213UDP-glucuronosyl/UDP-glucosyltransferasePANTHERPTHR11926GLUCOSYL/GLUCURONOSYL TRANSFERASEScoord: 6..467
score: 1.5E
IPR002213UDP-glucuronosyl/UDP-glucosyltransferasePFAMPF00201UDPGTcoord: 343..426
score: 2.
IPR002213UDP-glucuronosyl/UDP-glucosyltransferasePROSITEPS00375UDPGTcoord: 344..387
scor
NoneNo IPR availableGENE3DG3DSA:3.40.50.2000coord: 263..427
score: 1.5
NoneNo IPR availablePANTHERPTHR11926:SF326ANTHOCYANIDIN 3-O-GLUCOSIDE 2'''-O-XYLOSYLTRANSFERASE-RELATEDcoord: 6..467
score: 1.5E
NoneNo IPR availableunknownSSF53756UDP-Glycosyltransferase/glycogen phosphorylasecoord: 6..463
score: 9.71