CmoCh04G014940 (gene) Cucurbita moschata (Rifu)

NameCmoCh04G014940
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionUDP-glycosyltransferase 1
LocationCmo_Chr04 : 7653568 .. 7655622 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTACCCAGAAACAAGCCATCAAAATCCTCATGCTCCCGTGGCTAGCCCACGGTCACATAACCCCATTCTTCGAGCTAGCCAAAAGGCTCGCAAAACACCGAACCATCTTCCAAATTTACGTATGTTCCTCCCTCGTAAACCTCCAAGCCATAGACCCAAACCTCGCCCGAACCCACTCCATTGAACTCGTCGAACTCCACCTCCCATCCCTCCCCGACCTACCCCCTCACATGCACACCACCAAAGGCATCCCCTTGCACCTCGAGCCCACTTTAATGAAGGCCTTCGACATGGCGGCCAAAGATTTCGAGCTACTCTTGGACCGGCTCGAGCCAGACCTTCTCGTTTCCGACTTGTTCCAGCCCTGGGCGGTTCAGGCGGCGGCGGCGAGGAATATTCCCGTGGTTAACTTTGTAGTCACTGGCGTGGCGATTCTCACGCGTTTGGTGCACGCGTTTTGTAACTCCGGTCGGGAATTTCCATTTCCGGAGATCGATTTTAGTGGGCATTGGAGCTCGAAGAGGGGACGGAAGGTTTCTGATGAAGTGGGCCGCGATTGGGCCTTGCGGGTTTTGAAGTGTTTGAGAATGTCTTCCGATGTGGTTTTAGTTAATACTTCCCCCGAATTTGAAGGGAAATATATCGACTTTCTTGCTTCCTCCTTGAACAAAAAGGTTCGTTCATTCTCTTACTCATTTTTTCTTTCCTTTAGAAATCGCTCTCATGATTTTGTGAGGTGAAGGCCTAATTGAGGTCCGACTCTTTTGTCGGAGTCTTCGAACAAATTCAATATTTGAGTCACTTTTAACTACACTTTGGAAAACTCTATTGACATGACTAAGCTAAAGACATGATTCTAATTTCATGGTTTTGAGAGCCGGAGGTCTCTAATAAGCCCTATGGAACCCTTGAACAACCTCTTCTTAATCAAGGTTCAACGGAAGTCTTCGAACAAAATAGATCATTTGTTCTACACAGTCACTTTTGACTACACCTTCTAAAAACTCTATTTGTTAGAAGTCACGACTCTCTACAATAGTATGATATTGTCCCCTCAGCATATGCTCTCTTGGCTTTGCTTTGAACTCCCCAAAAGACCTGAACTCCCCAAAAGACCTCATCACAATGGAGATTTTATTTCTTATAAACTCATGATCTTCCATTAAATTAATCCATGTAGTACTCACTCCCAATAATCCTTAACACTATTGACGTGATTAAATTAAAGATATGATTATGATAACATGTTAGGAATTTTAAATATAAGCTTGCTATTGAATTTATATTCTTTACTTATAAATCTATAATTATTTTCTAAATTAGTCAACATGGACTCTCTTCCAACGATCCTTAAGAGCTTCGGATTCATATCTCCTTTACAGATTCTTCCAATTGCTCCCGTGGTTCCACAAATCAAGCCGAACGGCAAGAAACCCGAAATTCTCAAATGGCTCGACAAGAAGAGCCCAAAATCGACGGTCTACGTTTCGTTTGGAAGTGAGTACTACCTGATGAACCAAGATAGGGAAGAGCTAGCCCACGGTCTAGAGCAAAGCCACGTGAATTTCATATGGGTCATAAGGTTTCCAAAGGGTGAGCGCCTCACAATCGAAGAGGCGTTACCGGAAGGCTTTATGAAGCGAGTAGGAGACAAAGGGCTGATCATGGAGGGATGGGCGCCGCAATTGGAAATATTGAACCATTCGAGCATTGGTGCGTTCGTGTGCCATTGCGGGTGGAACTCCGTGGTCGAGAGCATGGTGTTCGGGGTGCCCATCGTGGCACTACCGATGCAACTCGACCAGCCTTGCCACGCCAAGGTGGCTAACTTGGCTGGCGTGTGCGTGGAGGCCGAAAGGGACGATGAAGGGAATGTTAAGAGAGAAGGAGTGGCGAAGGCTATTAAAGAAGTGGTGTCTGAGGAGAGTGGTGAGGCTTTGAGAGGGAAAGCGAGGGAGATTGGTGAGGCTTTGAGGAAGAGGGAAGAAGGGATTGTTGATGAAGTGGTTCGTGAATTTTGTAGCCTCTCACAACCCAAGAATGATGACTAA

mRNA sequence

ATGGCTACCCAGAAACAAGCCATCAAAATCCTCATGCTCCCGTGGCTAGCCCACGGTCACATAACCCCATTCTTCGAGCTAGCCAAAAGGCTCGCAAAACACCGAACCATCTTCCAAATTTACGTATGTTCCTCCCTCGTAAACCTCCAAGCCATAGACCCAAACCTCGCCCGAACCCACTCCATTGAACTCGTCGAACTCCACCTCCCATCCCTCCCCGACCTACCCCCTCACATGCACACCACCAAAGGCATCCCCTTGCACCTCGAGCCCACTTTAATGAAGGCCTTCGACATGGCGGCCAAAGATTTCGAGCTACTCTTGGACCGGCTCGAGCCAGACCTTCTCGTTTCCGACTTGTTCCAGCCCTGGGCGGTTCAGGCGGCGGCGGCGAGGAATATTCCCGTGGTTAACTTTGTAGTCACTGGCGTGGCGATTCTCACGCGTTTGGTGCACGCGTTTTGTAACTCCGGTCGGGAATTTCCATTTCCGGAGATCGATTTTAGTGGGCATTGGAGCTCGAAGAGGGGACGGAAGGTTTCTGATGAAGTGGGCCGCGATTGGGCCTTGCGGGTTTTGAAGTGTTTGAGAATGTCTTCCGATGTGGTTTTAGTTAATACTTCCCCCGAATTTGAAGGGAAATATATCGACTTTCTTGCTTCCTCCTTGAACAAAAAGATTCTTCCAATTGCTCCCGTGGTTCCACAAATCAAGCCGAACGGCAAGAAACCCGAAATTCTCAAATGGCTCGACAAGAAGAGCCCAAAATCGACGGTCTACGTTTCGTTTGGAAGTGAGTACTACCTGATGAACCAAGATAGGGAAGAGCTAGCCCACGGTCTAGAGCAAAGCCACGTGAATTTCATATGGGTCATAAGGTTTCCAAAGGGTGAGCGCCTCACAATCGAAGAGGCGTTACCGGAAGGCTTTATGAAGCGAGTAGGAGACAAAGGGCTGATCATGGAGGGATGGGCGCCGCAATTGGAAATATTGAACCATTCGAGCATTGGTGCGTTCGTGTGCCATTGCGGGTGGAACTCCGTGGTCGAGAGCATGGTGTTCGGGGTGCCCATCGTGGCACTACCGATGCAACTCGACCAGCCTTGCCACGCCAAGGTGGCTAACTTGGCTGGCGTGTGCGTGGAGGCCGAAAGGGACGATGAAGGGAATGTTAAGAGAGAAGGAGTGGCGAAGGCTATTAAAGAAGTGGTGTCTGAGGAGAGTGGTGAGGCTTTGAGAGGGAAAGCGAGGGAGATTGGTGAGGCTTTGAGGAAGAGGGAAGAAGGGATTGTTGATGAAGTGGTTCGTGAATTTTGTAGCCTCTCACAACCCAAGAATGATGACTAA

Coding sequence (CDS)

ATGGCTACCCAGAAACAAGCCATCAAAATCCTCATGCTCCCGTGGCTAGCCCACGGTCACATAACCCCATTCTTCGAGCTAGCCAAAAGGCTCGCAAAACACCGAACCATCTTCCAAATTTACGTATGTTCCTCCCTCGTAAACCTCCAAGCCATAGACCCAAACCTCGCCCGAACCCACTCCATTGAACTCGTCGAACTCCACCTCCCATCCCTCCCCGACCTACCCCCTCACATGCACACCACCAAAGGCATCCCCTTGCACCTCGAGCCCACTTTAATGAAGGCCTTCGACATGGCGGCCAAAGATTTCGAGCTACTCTTGGACCGGCTCGAGCCAGACCTTCTCGTTTCCGACTTGTTCCAGCCCTGGGCGGTTCAGGCGGCGGCGGCGAGGAATATTCCCGTGGTTAACTTTGTAGTCACTGGCGTGGCGATTCTCACGCGTTTGGTGCACGCGTTTTGTAACTCCGGTCGGGAATTTCCATTTCCGGAGATCGATTTTAGTGGGCATTGGAGCTCGAAGAGGGGACGGAAGGTTTCTGATGAAGTGGGCCGCGATTGGGCCTTGCGGGTTTTGAAGTGTTTGAGAATGTCTTCCGATGTGGTTTTAGTTAATACTTCCCCCGAATTTGAAGGGAAATATATCGACTTTCTTGCTTCCTCCTTGAACAAAAAGATTCTTCCAATTGCTCCCGTGGTTCCACAAATCAAGCCGAACGGCAAGAAACCCGAAATTCTCAAATGGCTCGACAAGAAGAGCCCAAAATCGACGGTCTACGTTTCGTTTGGAAGTGAGTACTACCTGATGAACCAAGATAGGGAAGAGCTAGCCCACGGTCTAGAGCAAAGCCACGTGAATTTCATATGGGTCATAAGGTTTCCAAAGGGTGAGCGCCTCACAATCGAAGAGGCGTTACCGGAAGGCTTTATGAAGCGAGTAGGAGACAAAGGGCTGATCATGGAGGGATGGGCGCCGCAATTGGAAATATTGAACCATTCGAGCATTGGTGCGTTCGTGTGCCATTGCGGGTGGAACTCCGTGGTCGAGAGCATGGTGTTCGGGGTGCCCATCGTGGCACTACCGATGCAACTCGACCAGCCTTGCCACGCCAAGGTGGCTAACTTGGCTGGCGTGTGCGTGGAGGCCGAAAGGGACGATGAAGGGAATGTTAAGAGAGAAGGAGTGGCGAAGGCTATTAAAGAAGTGGTGTCTGAGGAGAGTGGTGAGGCTTTGAGAGGGAAAGCGAGGGAGATTGGTGAGGCTTTGAGGAAGAGGGAAGAAGGGATTGTTGATGAAGTGGTTCGTGAATTTTGTAGCCTCTCACAACCCAAGAATGATGACTAA
BLAST of CmoCh04G014940 vs. Swiss-Prot
Match: UGT9_GARJA (Beta-D-glucosyl crocetin beta-1,6-glucosyltransferase OS=Gardenia jasminoides GN=UGT94E5 PE=1 SV=1)

HSP 1 Score: 367.5 bits (942), Expect = 2.1e-100
Identity = 192/431 (44.55%), Postives = 270/431 (62.65%), Query Frame = 1

Query: 12  MLPWLAHGHITPFFELAKRLAKHRTIFQIYVCSSLVNLQAIDPNLARTHS--IELVELHL 71
           M PWLA+GHI+P+ ELAKRL      F IY+CS+ +NL  I   +   +S  I+LVELHL
Sbjct: 1   MFPWLAYGHISPYLELAKRLTDRG--FAIYICSTPINLGFIKKRITGKYSVTIKLVELHL 60

Query: 72  PSLPDLPPHMHTTKGIPLHLEPTLMKAFDMAAKDFELLLDRLEPDLLVSDLFQPWAVQAA 131
           P  P+LPPH HTT G+P HL  TL +A + A  +   +L  L+PD ++ D  Q W     
Sbjct: 61  PDTPELPPHYHTTNGLPPHLMATLKRALNGAKPELSNILKTLKPDFVIYDATQTWTAALT 120

Query: 132 AARNIPVVNFVVTGVAILTRLVHAFCNSGREFPFPEIDFSGHWSSKRGRKVSDEVGRDWA 191
            A NIP V F+ + V++L    H F   G EFPFP I  S    +K  R  + +   D  
Sbjct: 121 VAHNIPAVKFLTSSVSMLAYFCHLFMKPGIEFPFPAIYLSDFEQAK-ARTAAQDARADAE 180

Query: 192 LRVLKCLRMSSD---VVLVNTSPEFEGKYIDFLASSLNKKILPIAPVVPQ-IKPN--GKK 251
                  R + D   + LV +S   EGKYID+L   +  K+LP+  +V + +K +     
Sbjct: 181 ENDPAAERPNRDCDSIFLVKSSRAIEGKYIDYLFDLMKLKMLPVGMLVEEPVKDDQGDNS 240

Query: 252 PEILKWLDKKSPKSTVYVSFGSEYYLMNQDREELAHGLEQSHVNFIWVIRFPKGERLTIE 311
            E+++WL  KS +STV VSFG+EY+L  ++ EE+AHGLE S VNFIWV+RF  G+++  +
Sbjct: 241 NELIQWLGTKSQRSTVLVSFGTEYFLTKEEMEEIAHGLELSEVNFIWVVRFAMGQKIRPD 300

Query: 312 EALPEGFMKRVGDKGLIMEGWAPQLEILNHSSIGAFVCHCGWNSVVESMVFGVPIVALPM 371
           EALPEGF++RVGD+G I+EGWAPQ E+L H S G F+CHCGWNSVVES+ FGVP++A+PM
Sbjct: 301 EALPEGFLERVGDRGRIVEGWAPQSEVLAHPSTGGFICHCGWNSVVESIEFGVPVIAMPM 360

Query: 372 QLDQPCHAKVANLAGVCVEAERDDEGNVKREGVAKAIKEVVSEESGEALRGKAREIGEAL 431
            LDQP +A++    G  +E  RD+ G   R+ +A+AIK+ + E++GE  R K  ++   +
Sbjct: 361 HLDQPLNARLVVEIGAGMEVVRDETGKFDRKEIARAIKDAMVEKTGENTRAKMLDVKGRV 420

Query: 432 RKREEGIVDEV 435
             +E+  +DEV
Sbjct: 421 ELKEKQELDEV 428

BLAST of CmoCh04G014940 vs. Swiss-Prot
Match: UGAT_BELPE (Cyanidin-3-O-glucoside 2-O-glucuronosyltransferase OS=Bellis perennis GN=UGAT PE=1 SV=1)

HSP 1 Score: 344.7 bits (883), Expect = 1.5e-93
Identity = 184/439 (41.91%), Postives = 267/439 (60.82%), Query Frame = 1

Query: 6   QAIKILMLPWLAHGHITPFFELAKRLAKHRTIFQIYVCSSLVNLQAIDPNLARTHS--IE 65
           +  +++MLPWLA+ HI+ F   AKRL  H   F IY+CSS  N+Q +  NL   +S  I+
Sbjct: 8   KTFRVVMLPWLAYSHISRFLVFAKRLTNHN--FHIYICSSQTNMQYLKNNLTSQYSKSIQ 67

Query: 66  LVELHLPSLPDLPPHMHTTKGIPLHLEPTLMKAFDMAAKDFELLLDRLEPDLLVSDLFQP 125
           L+EL+LPS  +LP   HTT G+P HL  TL   +  +  DFE +L +L P L++ D  Q 
Sbjct: 68  LIELNLPSSSELPLQYHTTHGLPPHLTKTLSDDYQKSGPDFETILIKLNPHLVIYDFNQL 127

Query: 126 WAVQAAAARNIPVVNFVVTGVAILTRLVHAFCNSGRE----FPFPEIDFSGHWSSKRGRK 185
           WA + A+  +IP +  +   VA+     H +     E    FPFPEI        K G K
Sbjct: 128 WAPEVASTLHIPSIQLLSGCVALYALDAHLYTKPLDENLAKFPFPEIYPKNRDIPKGGSK 187

Query: 186 VSDEVGRDWALRVLKCLRMSSDVVLVNTSPEFEGKYIDFLASSLNKKILPIAPVVPQIKP 245
             +        R + C+R S +++LV ++ E EGKYID+L+ +L KK+LP+ P+V +   
Sbjct: 188 YIE--------RFVDCMRRSCEIILVRSTMELEGKYIDYLSKTLGKKVLPVGPLVQEASL 247

Query: 246 -NGKKPEILKWLDKKSPKSTVYVSFGSEYYLMNQDREELAHGLEQSHVNFIWVIRFPKGE 305
                  I+KWLDKK   S V+V FGSEY L + + E++A+GLE S V+F+W IR     
Sbjct: 248 LQDDHIWIMKWLDKKEESSVVFVCFGSEYILSDNEIEDIAYGLELSQVSFVWAIR----- 307

Query: 306 RLTIEEALPEGFMKRVGDKGLIMEGWAPQLEILNHSSIGAFVCHCGWNSVVESMVFGVPI 365
               + +   GF+ RVGDKGL+++ W PQ  IL+HSS G F+ HCGW+S +ES+ +GVPI
Sbjct: 308 ---AKTSALNGFIDRVGDKGLVIDKWVPQANILSHSSTGGFISHCGWSSTMESIRYGVPI 367

Query: 366 VALPMQLDQPCHAKVANLAGVCVEAERDDEGNVKREGVAKAIKEVVSEESGEALRGKARE 425
           +A+PMQ DQP +A++    G  +E  RD EG +KRE +A  +++VV E+SGE++R KA+E
Sbjct: 368 IAMPMQFDQPYNARLMETVGAGIEVGRDGEGRLKREEIAAVVRKVVVEDSGESIREKAKE 427

Query: 426 IGEALRKREEGIVDEVVRE 438
           +GE ++K  E  VD +V E
Sbjct: 428 LGEIMKKNMEAEVDGIVIE 428

BLAST of CmoCh04G014940 vs. Swiss-Prot
Match: FLRT_CITMA (Flavanone 7-O-glucoside 2''-O-beta-L-rhamnosyltransferase OS=Citrus maxima GN=C12RT1 PE=1 SV=2)

HSP 1 Score: 322.4 bits (825), Expect = 7.8e-87
Identity = 181/442 (40.95%), Postives = 259/442 (58.60%), Query Frame = 1

Query: 10  ILMLPWLAHGHITPFFELAKRLAKHRTIFQIYVCSSLVNLQAIDPNLAR--THSIELVEL 69
           ILMLPWLAHGHI P  ELAK+L++    F IY CS+  NLQ+   N+ +  + SI+L+EL
Sbjct: 11  ILMLPWLAHGHIAPHLELAKKLSQKN--FHIYFCSTPNNLQSFGRNVEKNFSSSIQLIEL 70

Query: 70  HLPS-LPDLPPHMHTTKGIPLHLEPTLMKAFDMAAKDFELLLDRLEPDLLVSDLFQPWAV 129
            LP+  P+LP    TTK +P HL  TL+ AF+ A   F  +L+ L+P L++ DLFQPWA 
Sbjct: 71  QLPNTFPELPSQNQTTKNLPPHLIYTLVGAFEDAKPAFCNILETLKPTLVMYDLFQPWAA 130

Query: 130 QAAAARNIPVVNFVVTGVAILTRLVHAFCNSGREFPFPEIDFSGHWSSKRGR----KVSD 189
           +AA   +I  + F+       + L+H   N   ++PF E D+    S           + 
Sbjct: 131 EAAYQYDIAAILFLPLSAVACSFLLHNIVNPSLKYPFFESDYQDRESKNINYFLHLTANG 190

Query: 190 EVGRDWALRVLKCLRMSSDVVLVNTSPEFEGKYIDFLASSLNKKILPIAPVVPQIKPNGK 249
            + +D   R LK   +S   V + TS E E KY+D+  S +  +I+P+ P++ +      
Sbjct: 191 TLNKD---RFLKAFELSCKFVFIKTSREIESKYLDYFPSLMGNEIIPVGPLIQEPTFKED 250

Query: 250 KPEILKWLDKKSPKSTVYVSFGSEYYLMNQDREELAHGLEQSHVNFIWVIRFPKGERLTI 309
             +I+ WL +K P+S VY SFGSEY+    +  E+A GL  S VNFIW  R    E++TI
Sbjct: 251 DTKIMDWLSQKEPRSVVYASFGSEYFPSKDEIHEIASGLLLSEVNFIWAFRLHPDEKMTI 310

Query: 310 EEALPEGFMKRV--GDKGLIMEGWAPQLEILNHSSIGAFVCHCGWNSVVESMVFGVPIVA 369
           EEALP+GF + +   +KG+I++GW PQ +IL H SIG F+ HCGW SVVE MVFGVPI+ 
Sbjct: 311 EEALPQGFAEEIERNNKGMIVQGWVPQAKILRHGSIGGFLSHCGWGSVVEGMVFGVPIIG 370

Query: 370 LPMQLDQPCHAKVANLAGVCVEAERDD-EGNVKREGVAKAIKEVVSEESGEALRGKAREI 429
           +PM  +QP +AKV    G+ +   RD     +  E VA+ IK VV +E  + +R KA EI
Sbjct: 371 VPMAYEQPSNAKVVVDNGMGMVVPRDKINQRLGGEEVARVIKHVVLQEEAKQIRRKANEI 430

Query: 430 GEALRKREEGIVDEVVREFCSL 442
            E+++K  +  +  VV +   L
Sbjct: 431 SESMKKIGDAEMSVVVEKLLQL 447

BLAST of CmoCh04G014940 vs. Swiss-Prot
Match: U91C1_ARATH (UDP-glycosyltransferase 91C1 OS=Arabidopsis thaliana GN=UGT91C1 PE=2 SV=1)

HSP 1 Score: 210.3 bits (534), Expect = 4.3e-53
Identity = 147/457 (32.17%), Postives = 232/457 (50.77%), Query Frame = 1

Query: 4   QKQAIKILMLPWLAHGHITPFFELAKRLAK--HRTIFQIYVCSSLVNLQAIDPNLARTHS 63
           +++ + + M PWLA GH+ PF  L+K LA+  H+  F I    ++  L  +  NLA   S
Sbjct: 5   REEVMHVAMFPWLAMGHLLPFLRLSKLLAQKGHKISF-ISTPRNIERLPKLQSNLAS--S 64

Query: 64  IELVELHLPSLPDLPPHMHTTKGIPLHLEPTLMKAFDMAAKDFELLLDRLEPDLLVSDLF 123
           I  V   LP +  LPP   ++  +P + + +L  AFD+     +  L R  PD ++ D  
Sbjct: 65  ITFVSFPLPPISGLPPSSESSMDVPYNKQQSLKAAFDLLQPPLKEFLRRSSPDWIIYDYA 124

Query: 124 QPWAVQAAAARNIPVVNFVVTGVAIL------TRLVHAFCNSGREF-------PFPE-ID 183
             W    AA   I    F +   A L      + L+    ++  +F       PF   I 
Sbjct: 125 SHWLPSIAAELGISKAFFSLFNAATLCFMGPSSSLIEEIRSTPEDFTVVPPWVPFKSNIV 184

Query: 184 FSGHWSSKRGRKVSDEV-GRDWALRVLKCLRMSSDVVLVNTSPEFEGKYIDFLASSLNKK 243
           F  H  ++   K  ++V G   ++R    +   SD V V + PEFE ++   L     K 
Sbjct: 185 FRYHEVTRYVEKTEEDVTGVSDSVRFGYSID-ESDAVFVRSCPEFEPEWFGLLKDLYRKP 244

Query: 244 ILPIAPVVPQIKPNGKKP----EILKWLDKKSPKSTVYVSFGSEYYLMNQDREELAHGLE 303
           + PI  + P I+ +         I KWLDK+   S VYVS G+E  L +++  ELA GLE
Sbjct: 245 VFPIGFLPPVIEDDDAVDTTWVRIKKWLDKQRLNSVVYVSLGTEASLRHEEVTELALGLE 304

Query: 304 QSHVNFIWVIRFPKGERLTIEEALPEGFMKRVGDKGLIMEGWAPQLEILNHSSIGAFVCH 363
           +S   F WV+R         E  +P+GF  RV  +G++  GW PQ++IL+H S+G F+ H
Sbjct: 305 KSETPFFWVLRN--------EPKIPDGFKTRVKGRGMVHVGWVPQVKILSHESVGGFLTH 364

Query: 364 CGWNSVVESMVFGVPIVALPMQLDQPCHAKVANLAGVCVEAERDD-EGNVKREGVAKAIK 423
           CGWNSVVE + FG   +  P+  +Q  + ++ +  G+ VE  RD+ +G+   + VA +I+
Sbjct: 365 CGWNSVVEGLGFGKVPIFFPVLNEQGLNTRLLHGKGLGVEVSRDERDGSFDSDSVADSIR 424

Query: 424 EVVSEESGEALRGKAREIGEALRKREEGI--VDEVVR 437
            V+ +++GE +R KA+ + +     +E I  VDE+VR
Sbjct: 425 LVMIDDAGEEIRAKAKVMKDLFGNMDENIRYVDELVR 449

BLAST of CmoCh04G014940 vs. Swiss-Prot
Match: SCGT_TOBAC (Scopoletin glucosyltransferase OS=Nicotiana tabacum GN=TOGT1 PE=1 SV=1)

HSP 1 Score: 200.3 bits (508), Expect = 4.5e-50
Identity = 157/458 (34.28%), Postives = 218/458 (47.60%), Query Frame = 1

Query: 8   IKILMLPWLAHGHITPFFELAKRLAKHRTIFQIYVC--SSLVNLQAIDPNLARTHSIELV 67
           +     P +AHGH+ P  ++AK  A       I     +  V  +AI  N      IE+ 
Sbjct: 4   LHFFFFPVMAHGHMIPTLDMAKLFASRGVKATIITTPLNEFVFSKAIQRNKHLGIEIEIR 63

Query: 68  ELHLPSLPD-LPPHMHTTKGIPLHLE-PTLMKAFDMAAKDFELLLDRLEPDLLVSDLFQP 127
            +  P++ + LP        IP   + P   KA  M  +  E L++   PD L+SD+F P
Sbjct: 64  LIKFPAVENGLPEECERLDQIPSDEKLPNFFKAVAMMQEPLEQLIEECRPDCLISDMFLP 123

Query: 128 WAVQAAAARNIPVVNFVVTGVAIL-----TRLVHAFCNSGRE---FPFPEIDFSGHWSSK 187
           W    AA  NIP + F  T    L      RL   F N   +   F  P++    H    
Sbjct: 124 WTTDTAAKFNIPRIVFHGTSFFALCVENSVRLNKPFKNVSSDSETFVVPDLP---HEIKL 183

Query: 188 RGRKVS--DEVGRDWAL-RVLKCLRMSSDV---VLVNTSPEFEGKYIDFLASSLNKKILP 247
              +VS  +  G + A+ R++K +R S      V+ N+  E E  Y++     L ++   
Sbjct: 184 TRTQVSPFERSGEETAMTRMIKTVRESDSKSYGVVFNSFYELETDYVEHYTKVLGRRAWA 243

Query: 248 IAPV------VPQIKPNGKKPEI-----LKWLDKKSPKSTVYVSFGSEYYLMNQDREELA 307
           I P+      +      GKK  I     LKWLD K P S VYV FGS          ELA
Sbjct: 244 IGPLSMCNRDIEDKAERGKKSSIDKHECLKWLDSKKPSSVVYVCFGSVANFTASQLHELA 303

Query: 308 HGLEQSHVNFIWVIRFPKGERLTIEEALPEGFMKRVGDKGLIMEGWAPQLEILNHSSIGA 367
            G+E S   FIWV+R      L  E+ LPEGF +R  +KGLI+ GWAPQ+ IL+H S+GA
Sbjct: 304 MGIEASGQEFIWVVR----TELDNEDWLPEGFEERTKEKGLIIRGWAPQVLILDHESVGA 363

Query: 368 FVCHCGWNSVVESMVFGVPIVALPMQLDQPCHAKVANL-----AGV-CVEAERDDEGNVK 427
           FV HCGWNS +E +  GVP+V  P+  +Q  + K+        AGV  ++ +R     VK
Sbjct: 364 FVTHCGWNSTLEGVSGGVPMVTWPVFAEQFFNEKLVTEVLKTGAGVGSIQWKRSASEGVK 423

Query: 428 REGVAKAIKEVVSEESGEALRGKAREIGEALRKR-EEG 430
           RE +AKAIK V+  E  +  R +A+   E  RK  EEG
Sbjct: 424 REAIAKAIKRVMVSEEADGFRNRAKAYKEMARKAIEEG 454

BLAST of CmoCh04G014940 vs. TrEMBL
Match: F6I5W2_VITVI (Glycosyltransferase OS=Vitis vinifera GN=VIT_13s0074g00610 PE=3 SV=1)

HSP 1 Score: 440.3 bits (1131), Expect = 2.9e-120
Identity = 225/447 (50.34%), Postives = 301/447 (67.34%), Query Frame = 1

Query: 4   QKQAIKILMLPWLAHGHITPFFELAKRLAKHRTIFQIYVCSSLVNLQAIDPNLAR--THS 63
           ++  IK+L+LPWLAHGHI+PF EL+K+L K +  F IY CSS VNL  I   L    +HS
Sbjct: 5   RQSRIKVLVLPWLAHGHISPFLELSKQLMKQK--FYIYFCSSPVNLSRIKGKLTGNYSHS 64

Query: 64  IELVELHLPSLPDLPPHMHTTKGIPLHLEPTLMKAFDMAAKDFELLLDRLEPDLLVSDLF 123
           I+LVELHLPSLP+LPPH HTT G+P HL PTL  A DMA+  F  +L  L PDLL+ D  
Sbjct: 65  IQLVELHLPSLPELPPHYHTTNGLPPHLMPTLKMALDMASPSFTNILKTLSPDLLIYDFI 124

Query: 124 QPWAVQAAAARNIPVVNFVVTGVAILTRLVHAFCNSGREFPFPEIDFSGHWSSKRGRKVS 183
           QPWA  AAA+  IP V F+  G A    ++H     G EFPFPEI    + +S   R V 
Sbjct: 125 QPWAPAAAASLGIPSVQFLSNGAAATAFMIHFVKKPGNEFPFPEIYLRDYETSGFNRFVE 184

Query: 184 DEVG-RDWALRVLKCLRMSSDVVLVNTSPEFEGKYIDFLASSLNKKILPIAPVVP-QIKP 243
                R    +  +CL  SS+V+L+ +  E E ++IDFL++   K ++P+ P++  Q+  
Sbjct: 185 SSANARKDKEKARQCLEQSSNVILIRSFKEIEERFIDFLSNLNAKTVVPVGPLLQDQLDE 244

Query: 244 NGKKPEILKWLDKKSPKSTVYVSFGSEYYLMNQDREELAHGLEQSHVNFIWVIRFPKGER 303
              + E+++WL KK P S+V+VSFGSEY+L  ++ EE+A+GLE S VNFIWV+RFP G++
Sbjct: 245 EDAETEMVEWLSKKDPASSVFVSFGSEYFLSKEELEEVAYGLELSKVNFIWVVRFPMGDK 304

Query: 304 LTIEEALPEGFMKRVGDKGLIMEGWAPQLEILNHSSIGAFVCHCGWNSVVESMVFGVPIV 363
             +EEALPEGF+ RVGDKG+++EGWAPQ +IL HSSIG FV HCGW SV+ESM FGVPIV
Sbjct: 305 TRVEEALPEGFLSRVGDKGMVVEGWAPQKKILRHSSIGGFVSHCGWGSVMESMNFGVPIV 364

Query: 364 ALPMQLDQPCHAKVANLAGVCVEAERDDEGNVKREGVAKAIKEVVSEESGEALRGKAREI 423
           A+PM LDQP +AK+    GV +E +RD+ G ++RE +AK IKEVV ++ GE +R KARE 
Sbjct: 365 AMPMHLDQPFNAKLVEAHGVGIEVKRDENGKLQREEIAKVIKEVVVKKCGEIVRQKAREF 424

Query: 424 GEALRKREEGIVDEVVREFCSLSQPKN 447
            E + K+ +  +  VV +   L   +N
Sbjct: 425 SENMSKKGDEEIVGVVEKLVQLCGKRN 449

BLAST of CmoCh04G014940 vs. TrEMBL
Match: F6HIX7_VITVI (Glycosyltransferase OS=Vitis vinifera GN=VIT_13s0047g01230 PE=3 SV=1)

HSP 1 Score: 433.7 bits (1114), Expect = 2.7e-118
Identity = 217/443 (48.98%), Postives = 297/443 (67.04%), Query Frame = 1

Query: 2   ATQKQAIKILMLPWLAHGHITPFFELAKRLAKHRTIFQIYVCSSLVNLQAIDPNLARTHS 61
           A Q   I +LM PWLAHGHI+PF +LAK+L+K    F IY CS+ VNL  I   L+ ++S
Sbjct: 3   ARQSDGISVLMFPWLAHGHISPFLQLAKKLSKRN--FSIYFCSTPVNLDPIKGKLSESYS 62

Query: 62  --IELVELHLPSLPDLPPHMHTTKGIPLHLEPTLMKAFDMAAKDFELLLDRLEPDLLVSD 121
             I+LV+LHLPSLP+LPP  HTT G+P HL PTL  AFDMA+ +F  +L  L PDLL+ D
Sbjct: 63  LSIQLVKLHLPSLPELPPQYHTTNGLPPHLMPTLKMAFDMASPNFSNILKTLHPDLLIYD 122

Query: 122 LFQPWAVQAAAARNIPVVNFVVTGVAILTRLVHAFCNSGREFPFPEIDFSGHWSSKRGRK 181
             QPWA  AA++ NIP V F+ TG  + + L H     G EFPF EI    +   +  R 
Sbjct: 123 FLQPWAPAAASSLNIPAVQFLSTGATLQSFLAHRHRKPGIEFPFQEIHLPDYEIGRLNRF 182

Query: 182 VSDEVGR-DWALRVLKCLRMSSDVVLVNTSPEFEGKYIDFLASSLNKKILPIAPVVPQIK 241
           +    GR     R  +CL  SS   L+ T  E E KY+D+++    KK++ + P++   +
Sbjct: 183 LEPSAGRISDRDRANQCLERSSRFSLIKTFREIEAKYLDYVSDLTKKKMVTVGPLLQDPE 242

Query: 242 PNGKKPEILKWLDKKSPKSTVYVSFGSEYYLMNQDREELAHGLEQSHVNFIWVIRFPKGE 301
              +  +I++WL+KK   S V+VSFGSEY++  ++ EE+AHGLE S+V+FIWV+RFP GE
Sbjct: 243 DEDEATDIVEWLNKKCEASAVFVSFGSEYFVSKEEMEEIAHGLELSNVDFIWVVRFPMGE 302

Query: 302 RLTIEEALPEGFMKRVGDKGLIMEGWAPQLEILNHSSIGAFVCHCGWNSVVESMVFGVPI 361
           ++ +E+ALP GF+ R+GD+G+++EGWAPQ +IL HSSIG FV HCGW+SV+E M FGVPI
Sbjct: 303 KIRLEDALPPGFLHRLGDRGMVVEGWAPQRKILGHSSIGGFVSHCGWSSVMEGMKFGVPI 362

Query: 362 VALPMQLDQPCHAKVANLAGVCVEAERDDEGNVKREGVAKAIKEVVSEESGEALRGKARE 421
           +A+PM LDQP +AK+    GV  E +RD+   ++RE +AK IKEVV E++GE +R KARE
Sbjct: 363 IAMPMHLDQPINAKLVEAVGVGREVKRDENRKLEREEIAKVIKEVVGEKNGENVRRKARE 422

Query: 422 IGEALRKREEGIVDEVVREFCSL 442
           + E LRK+ +  +D VV E   L
Sbjct: 423 LSETLRKKGDEEIDVVVEELKQL 443

BLAST of CmoCh04G014940 vs. TrEMBL
Match: K4D510_SOLLC (Glycosyltransferase OS=Solanum lycopersicum PE=3 SV=1)

HSP 1 Score: 431.8 bits (1109), Expect = 1.0e-117
Identity = 211/447 (47.20%), Postives = 299/447 (66.89%), Query Frame = 1

Query: 1   MATQKQAIKILMLPWLAHGHITPFFELAKRLAKHRTIFQIYVCSSLVNLQAIDPNLARTH 60
           M  +K  + +LMLPWLAHGHI PF ELAK+LA     F IY+CS+LVNL +I   +   +
Sbjct: 1   MEAKKHTMSVLMLPWLAHGHINPFLELAKKLASKN--FDIYLCSTLVNLLSIKKRVGEKY 60

Query: 61  S--IELVELHLPSLPDLPPHMHTTKGIPLHLEPTLMKAFDMAAKDFELLLDRLEPDLLVS 120
           S  IEL+ELHLPSLPDLPPH HTT G+P HL  TL  AF++A+ +F  +L  L PDL++ 
Sbjct: 61  SESIELIELHLPSLPDLPPHYHTTNGLPPHLMNTLKTAFELASPNFSKILQTLRPDLVIH 120

Query: 121 DLFQPWAVQAAAARNIPVVNFVVTGVAILTRLVHAFCNSGREFPFPEIDFSGHWSSKRGR 180
           D  QPW   +A++ NIP V F      ++   +H   N+  +FPFPEI    H      +
Sbjct: 121 DYNQPWVTDSASSMNIPAVQFPTFSATVVALSIHMSENTTEKFPFPEIYLREHEMISLKK 180

Query: 181 KVSDEVGRDWALRVLKCLRMSSDVVLVNTSPEFEGKYIDFLASSLNKKILPIAPVVPQIK 240
            +++   + +     + +R S D++LV T  +FEGKYID+L++  +KK++P+  +V +  
Sbjct: 181 DINEVPSKKFPYD--EAIRRSHDIILVKTCRDFEGKYIDYLSNLTSKKVVPVGSLVQETM 240

Query: 241 PNGKKPEILKWLDKKSPKSTVYVSFGSEYYLMNQDREELAHGLEQSHVNFIWVIRFPKGE 300
                 EI +WLDKK   STV+VSFGSEY+L  ++   +A GLE S VNFIWVIRFP+GE
Sbjct: 241 DQDDYKEIAQWLDKKEKSSTVFVSFGSEYFLSKEEILAVAQGLELSKVNFIWVIRFPQGE 300

Query: 301 RLTIEEALPEGFMKRVGDKGLIMEGWAPQLEILNHSSIGAFVCHCGWNSVVESMVFGVPI 360
           R+ I +ALP+ +++RVG++G+++EGWAPQ  IL H SIG FV HCGW+S +ESM FGVPI
Sbjct: 301 RMNIRDALPKEYLERVGERGMVIEGWAPQATILQHPSIGGFVSHCGWSSFMESMKFGVPI 360

Query: 361 VALPMQLDQPCHAKVANLAGVCVEAERDDEGNVKREGVAKAIKEVVSEESGEALRGKARE 420
           +A+PM +DQP +A++    GV VEA +D++G ++ E +AKAI+EVV+EESGE +R K +E
Sbjct: 361 IAMPMHIDQPMNARLVEYIGVGVEAAKDEDGKLQSEEIAKAIREVVAEESGEDVRKKVKE 420

Query: 421 IGEALRKREEGIVDEVVREFCSLSQPK 446
           + E +  +E+  +D V  E  +L   K
Sbjct: 421 VSEKMNAKEDEEIDGVAEELMALRTNK 443

BLAST of CmoCh04G014940 vs. TrEMBL
Match: M1BPV9_SOLTU (Glycosyltransferase OS=Solanum tuberosum GN=PGSC0003DMG400019483 PE=3 SV=1)

HSP 1 Score: 429.9 bits (1104), Expect = 3.9e-117
Identity = 218/451 (48.34%), Postives = 304/451 (67.41%), Query Frame = 1

Query: 1   MATQKQAIKILMLPWLAHGHITPFFELAKRLAKHRTIFQIYVCSSLVNLQAIDPNLARTH 60
           M  +K  I ILMLPWLAHGHI+PF ELAK+L      F IY+CS+ +NL +I  N+ + +
Sbjct: 1   MEAKKNTISILMLPWLAHGHISPFLELAKKLTNRN--FHIYLCSTPINLSSIKKNVTKKY 60

Query: 61  --SIELVELHLPSLPDLPPHMHTTKGIPLHLEPTLMKAFDMAAKDFELLLDRLEPDLLVS 120
             SIELVELHLPSLP+LPPH HTT G+P HL  TL KAF+ A+ +F  +L  L PDL++ 
Sbjct: 61  CESIELVELHLPSLPNLPPHYHTTNGLPPHLMNTLKKAFENASPNFSKILQTLNPDLVIY 120

Query: 121 DLFQPWAVQAAAARNIPVVNFVVTGVAILTRL-VHA-FCNSGREFPFPEIDFSGHWSSKR 180
           D  QPWA + A++ NIP + F+    AI+  L +H  F  SG +FPFPEI    H   + 
Sbjct: 121 DFNQPWAAEFASSMNIPAIQFLTFSAAIVALLALHIMFDKSGEKFPFPEIYLREHEMLQI 180

Query: 181 GRKVSDEVGRDWALRVLKCLRMSSDVVLVNTSPEFEGKYIDFLASSLNKKILPIAPVVPQ 240
            + + +    ++       LR+S D+VLV TS +FEGKYID+L+  ++KKI+P+  +V  
Sbjct: 181 KKSLEESKDENYKDPFNDALRLSRDIVLVKTSRDFEGKYIDYLSKLVSKKIVPVGSLVQD 240

Query: 241 IKPNGKKPE--ILKWLDKKSPKSTVYVSFGSEYYLMNQDREELAHGLEQSHVNFIWVIRF 300
                   E  I++WLDKK   S+V+VSFGSEY+L  ++  E+A GLE S VNFIWVIRF
Sbjct: 241 SIDQDHDHEEIIMQWLDKKEKCSSVFVSFGSEYFLSKEEMHEVAQGLEFSKVNFIWVIRF 300

Query: 301 PKGERLTIEEALPEGFMKRVGDKGLIMEGWAPQLEILNHSSIGAFVCHCGWNSVVESMVF 360
           P+GE+ +I++ LP+GF++RVG++G+++E WAPQ  IL H S G FV HCGW+SV+ESM F
Sbjct: 301 PQGEKNSIQDVLPQGFLERVGERGMVLEKWAPQAAILQHRSTGGFVSHCGWSSVMESMKF 360

Query: 361 GVPIVALPMQLDQPCHAKVANLAGVCVEAERDDEGNVKREGVAKAIKEVVSEESGEALRG 420
           GVPI+A+PM +DQP +A++    G+ VEA RD+ G ++ E +AK ++EVV EESGE +R 
Sbjct: 361 GVPIIAMPMHIDQPMNARIVEYIGMGVEALRDENGKLQSEEIAKVMREVVIEESGEGVRK 420

Query: 421 KAREIGEALRKREEGIVDEVVREFCSLSQPK 446
           K +E+ E +  + +  +D VV E  +L   K
Sbjct: 421 KTKELSEKMNMKGDEEIDGVVEELVALCNNK 449

BLAST of CmoCh04G014940 vs. TrEMBL
Match: F6I0D2_VITVI (Glycosyltransferase OS=Vitis vinifera GN=VIT_04s0044g01540 PE=3 SV=1)

HSP 1 Score: 429.1 bits (1102), Expect = 6.6e-117
Identity = 219/441 (49.66%), Postives = 302/441 (68.48%), Query Frame = 1

Query: 5   KQAIKILMLPWLAHGHITPFFELAKRLAKHRTIFQIYVCSSLVNLQAIDPNLARTHS--I 64
           + ++K+++LPWLAHGHI+PF ELAK+L++    F IY CS+ VNL +I   L    S  I
Sbjct: 4   RSSMKVVLLPWLAHGHISPFLELAKKLSRRN--FYIYFCSTPVNLSSIKGKLTEEDSLSI 63

Query: 65  ELVELHLPSLPDLPPHMHTTKGIPLHLEPTLMKAFDMAAKDFELLLDRLEPDLLVSDLFQ 124
           ELVE+HLPSLPDLPPH  TT G+P HL PTL KAFDMA+  F  +L  L PDL++ D+ Q
Sbjct: 64  ELVEIHLPSLPDLPPHYQTTNGLPPHLMPTLKKAFDMASPGFADILTTLNPDLIIYDILQ 123

Query: 125 PWAVQAAAARNIPVVNFVVTGVAILTRLVHAFCNSGREFPFPE-IDFSGHWSSKRGRKVS 184
           PWA  AA+++NIP V F+ TG  +L+ L+     +G      E I    H +     +++
Sbjct: 124 PWAPVAASSQNIPAVLFLSTGATLLSVLLQEQPITGIPLQDSERIKMLNHLADSSANEIT 183

Query: 185 DEVGRDWALRVLKCLRMSSDVVLVNTSPEFEGKYIDFLASSLNKKILPIAPVVPQIKPNG 244
           DE       R  +CL++SS+++L+ T  + EGK+ID  +    KK++P+ P+V       
Sbjct: 184 DEA------RAAQCLKLSSNIILMRTFRDLEGKHIDQASCLTQKKVVPVGPLVQHTTDEF 243

Query: 245 KKPEILKWLDKKSPKSTVYVSFGSEYYLMNQDREELAHGLEQSHVNFIWVIRFPKGERL- 304
           +K EI++WLDKK   STV VSFGSEY+L  ++ EE+AH LE S V+FIWV+RFP+ +++ 
Sbjct: 244 EKEEIIEWLDKKEESSTVLVSFGSEYFLSKEEMEEMAHALELSTVSFIWVLRFPQRDKIA 303

Query: 305 TIEEALPEGFMKRVGDKGLIMEGWAPQLEILNHSSIGAFVCHCGWNSVVESMVFGVPIVA 364
           ++EEALPEGF+ RVG++G +++ WAPQ EILNHSS G FV HCGW+SV+ES+ FGVPIVA
Sbjct: 304 SVEEALPEGFLSRVGERGKVVKDWAPQREILNHSSTGGFVSHCGWSSVMESLKFGVPIVA 363

Query: 365 LPMQLDQPCHAKVANLAGVCVEAERDDEGNVKREGVAKAIKEVVSEESGEALRGKAREIG 424
           +PM LDQP +AKV    GV VE +RD+ G + RE +AK IK+VV E+SGE +  K RE+ 
Sbjct: 364 IPMHLDQPLNAKVVESVGVGVEVKRDENGRLDREEIAKVIKQVVVEKSGENVSRKVREMS 423

Query: 425 EALRKREEGIVDEVVREFCSL 442
           E++RK+ E  + EVV E   L
Sbjct: 424 ESMRKQAEEEIAEVVEELVQL 436

BLAST of CmoCh04G014940 vs. TAIR10
Match: AT5G49690.1 (AT5G49690.1 UDP-Glycosyltransferase superfamily protein)

HSP 1 Score: 210.3 bits (534), Expect = 2.4e-54
Identity = 147/457 (32.17%), Postives = 232/457 (50.77%), Query Frame = 1

Query: 4   QKQAIKILMLPWLAHGHITPFFELAKRLAK--HRTIFQIYVCSSLVNLQAIDPNLARTHS 63
           +++ + + M PWLA GH+ PF  L+K LA+  H+  F I    ++  L  +  NLA   S
Sbjct: 5   REEVMHVAMFPWLAMGHLLPFLRLSKLLAQKGHKISF-ISTPRNIERLPKLQSNLAS--S 64

Query: 64  IELVELHLPSLPDLPPHMHTTKGIPLHLEPTLMKAFDMAAKDFELLLDRLEPDLLVSDLF 123
           I  V   LP +  LPP   ++  +P + + +L  AFD+     +  L R  PD ++ D  
Sbjct: 65  ITFVSFPLPPISGLPPSSESSMDVPYNKQQSLKAAFDLLQPPLKEFLRRSSPDWIIYDYA 124

Query: 124 QPWAVQAAAARNIPVVNFVVTGVAIL------TRLVHAFCNSGREF-------PFPE-ID 183
             W    AA   I    F +   A L      + L+    ++  +F       PF   I 
Sbjct: 125 SHWLPSIAAELGISKAFFSLFNAATLCFMGPSSSLIEEIRSTPEDFTVVPPWVPFKSNIV 184

Query: 184 FSGHWSSKRGRKVSDEV-GRDWALRVLKCLRMSSDVVLVNTSPEFEGKYIDFLASSLNKK 243
           F  H  ++   K  ++V G   ++R    +   SD V V + PEFE ++   L     K 
Sbjct: 185 FRYHEVTRYVEKTEEDVTGVSDSVRFGYSID-ESDAVFVRSCPEFEPEWFGLLKDLYRKP 244

Query: 244 ILPIAPVVPQIKPNGKKP----EILKWLDKKSPKSTVYVSFGSEYYLMNQDREELAHGLE 303
           + PI  + P I+ +         I KWLDK+   S VYVS G+E  L +++  ELA GLE
Sbjct: 245 VFPIGFLPPVIEDDDAVDTTWVRIKKWLDKQRLNSVVYVSLGTEASLRHEEVTELALGLE 304

Query: 304 QSHVNFIWVIRFPKGERLTIEEALPEGFMKRVGDKGLIMEGWAPQLEILNHSSIGAFVCH 363
           +S   F WV+R         E  +P+GF  RV  +G++  GW PQ++IL+H S+G F+ H
Sbjct: 305 KSETPFFWVLRN--------EPKIPDGFKTRVKGRGMVHVGWVPQVKILSHESVGGFLTH 364

Query: 364 CGWNSVVESMVFGVPIVALPMQLDQPCHAKVANLAGVCVEAERDD-EGNVKREGVAKAIK 423
           CGWNSVVE + FG   +  P+  +Q  + ++ +  G+ VE  RD+ +G+   + VA +I+
Sbjct: 365 CGWNSVVEGLGFGKVPIFFPVLNEQGLNTRLLHGKGLGVEVSRDERDGSFDSDSVADSIR 424

Query: 424 EVVSEESGEALRGKAREIGEALRKREEGI--VDEVVR 437
            V+ +++GE +R KA+ + +     +E I  VDE+VR
Sbjct: 425 LVMIDDAGEEIRAKAKVMKDLFGNMDENIRYVDELVR 449

BLAST of CmoCh04G014940 vs. TAIR10
Match: AT4G34131.1 (AT4G34131.1 UDP-glucosyl transferase 73B3)

HSP 1 Score: 182.2 bits (461), Expect = 7.1e-46
Identity = 146/461 (31.67%), Postives = 222/461 (48.16%), Query Frame = 1

Query: 8   IKILMLPWLAHGHITPFFELAKRLAKHRTIFQIYVC--SSLVNLQAID--PNLARTHSIE 67
           + ++  P++A+GH+ P  ++AK  +       I     +S +  + I+   NL  +  I+
Sbjct: 9   LHVVFFPFMAYGHMIPTLDMAKLFSSRGAKSTILTTPLNSKIFQKPIERFKNLNPSFEID 68

Query: 68  L-------VELHLPS-LPDLPPHMHTTKGIPLHLEPTLMKAFDMAAKDFELLLDRLEPDL 127
           +       V+L LP    ++            +L     K+        E LL+   PD 
Sbjct: 69  IQIFDFPCVDLGLPEGCENVDFFTSNNNDDRQYLTLKFFKSTRFFKDQLEKLLETTRPDC 128

Query: 128 LVSDLFQPWAVQAAAARNIPVVNFVVTGVAILTR----LVHAFCN--SGREFPFPEIDFS 187
           L++D+F PWA +AA   N+P + F  TG   L       VH   N  + R  PF   D  
Sbjct: 129 LIADMFFPWATEAAEKFNVPRLVFHGTGYFSLCSEYCIRVHNPQNIVASRYEPFVIPDLP 188

Query: 188 GH----WSSKRGRKVSDEVGRDWALRVLKCLRMSSDVVLVNTSPEFEGKYIDFLASSLNK 247
           G+          R    E+G+   +  +K   + S  V+VN+  E E  Y DF  S + K
Sbjct: 189 GNIVITQEQIADRDEESEMGK--FMIEVKESDVKSSGVIVNSFYELEPDYADFYKSVVLK 248

Query: 248 KILPIAPV------VPQIKPNGKKPEI-----LKWLDKKSPKSTVYVSFGSEYYLMNQDR 307
           +   I P+        +    GKK  I     LKWLD K P S +Y+SFGS     N+  
Sbjct: 249 RAWHIGPLSVYNRGFEEKAERGKKASINEVECLKWLDSKKPDSVIYISFGSVACFKNEQL 308

Query: 308 EELAHGLEQSHVNFIWVIRFPKGERLTIEEALPEGFMKRVGDKGLIMEGWAPQLEILNHS 367
            E+A GLE S  NFIWV+R  K   +  EE LPEGF +RV  KG+I+ GWAPQ+ IL+H 
Sbjct: 309 FEIAAGLETSGANFIWVVR--KNIGIEKEEWLPEGFEERVKGKGMIIRGWAPQVLILDHQ 368

Query: 368 SIGAFVCHCGWNSVVESMVFGVPIVALPMQLDQPCHAKVAN---LAGVCVEAERDDEGN- 427
           +   FV HCGWNS++E +  G+P+V  P+  +Q  + K+       GV V A+++     
Sbjct: 369 ATCGFVTHCGWNSLLEGVAAGLPMVTWPVAAEQFYNEKLVTQVLRTGVSVGAKKNVRTTG 428

Query: 428 --VKREGVAKAIKEVVSEESGEALRGKAREIGEALRKREEG 430
             + RE V KA++EV+  E  +  R +A+++ E  +   EG
Sbjct: 429 DFISREKVVKAVREVLVGEEADERRERAKKLAEMAKAAVEG 465

BLAST of CmoCh04G014940 vs. TAIR10
Match: AT4G34135.1 (AT4G34135.1 UDP-glucosyltransferase 73B2)

HSP 1 Score: 182.2 bits (461), Expect = 7.1e-46
Identity = 144/461 (31.24%), Postives = 225/461 (48.81%), Query Frame = 1

Query: 8   IKILMLPWLAHGHITPFFELAKRLAKHRTIFQIYVCS--SLVNLQAIDP--NLARTHSIE 67
           + ++  P++A+GH+ P  ++AK  +       I   S  S +  + ID   NL     I+
Sbjct: 10  LHVMFFPFMAYGHMIPTLDMAKLFSSRGAKSTILTTSLNSKILQKPIDTFKNLNPGLEID 69

Query: 68  LVELHLP----SLPDLPPHMHTTKGIPLHLEPTLMKAFDMAAKDF----ELLLDRLEPDL 127
           +   + P     LP+   ++          +  ++  F  + + F    E LL    PD 
Sbjct: 70  IQIFNFPCVELGLPEGCENVDFFTSNNNDDKNEMIVKFFFSTRFFKDQLEKLLGTTRPDC 129

Query: 128 LVSDLFQPWAVQAAAARNIPVVNFVVTGVAILTRL----VHA----FCNSGREFPFPEID 187
           L++D+F PWA +AA   N+P + F  TG   L       VH       +S   F  PE+ 
Sbjct: 130 LIADMFFPWATEAAGKFNVPRLVFHGTGYFSLCAGYCIGVHKPQKRVASSSEPFVIPELP 189

Query: 188 FSGHWSSKRGRKVSDEVGRDWALRVLKCLRMSSDVVLVNTSPEFEGKYIDFLASSLNKKI 247
            +   + ++      E      +  ++   + S  V++N+  E E  Y DF  S + K+ 
Sbjct: 190 GNIVITEEQIIDGDGESDMGKFMTEVRESEVKSSGVVLNSFYELEHDYADFYKSCVQKRA 249

Query: 248 LPIAPV------VPQIKPNGKKPEI-----LKWLDKKSPKSTVYVSFGSEYYLMNQDREE 307
             I P+        +    GKK  I     LKWLD K P S +YVSFGS  +  N+   E
Sbjct: 250 WHIGPLSVYNRGFEEKAERGKKANIDEAECLKWLDSKKPNSVIYVSFGSVAFFKNEQLFE 309

Query: 308 LAHGLEQSHVNFIWVIRFPKGERLTIEEALPEGFMKRVGDKGLIMEGWAPQLEILNHSSI 367
           +A GLE S  +FIWV+R  K +R   EE LPEGF +RV  KG+I+ GWAPQ+ IL+H + 
Sbjct: 310 IAAGLEASGTSFIWVVRKTKDDR---EEWLPEGFEERVKGKGMIIRGWAPQVLILDHQAT 369

Query: 368 GAFVCHCGWNSVVESMVFGVPIVALPMQLDQPCHAKVAN---LAGVCVEAERDDE----G 427
           G FV HCGWNS++E +  G+P+V  P+  +Q  + K+       GV V A +  +     
Sbjct: 370 GGFVTHCGWNSLLEGVAAGLPMVTWPVGAEQFYNEKLVTQVLRTGVSVGASKHMKVMMGD 429

Query: 428 NVKREGVAKAIKEVVSEESGEALRGKAREIGEALRKR-EEG 430
            + RE V KA++EV++ E+ E  R +A+++    +   EEG
Sbjct: 430 FISREKVDKAVREVLAGEAAEERRRRAKKLAAMAKAAVEEG 467

BLAST of CmoCh04G014940 vs. TAIR10
Match: AT2G15490.1 (AT2G15490.1 UDP-glycosyltransferase 73B4)

HSP 1 Score: 181.8 bits (460), Expect = 9.3e-46
Identity = 149/466 (31.97%), Postives = 229/466 (49.14%), Query Frame = 1

Query: 5   KQAIKILMLPWLAHGHITPFFELAKRLAKH---RTIFQIYVCSSLVN-----LQAIDPNL 64
           ++ I IL  P++AHGH+ P  ++AK  A+     T+    + + ++       +  +P+L
Sbjct: 3   REQIHILFFPFMAHGHMIPLLDMAKLFARRGAKSTLLTTPINAKILEKPIEAFKVQNPDL 62

Query: 65  A---RTHSIELVELHLPSLPDLPPHMHT-TKGIPLHLEPTLMKAFDMAAKDFELLLDRLE 124
               +  +   VEL LP   +    +++  K     L    + +     +  E  ++  +
Sbjct: 63  EIGIKILNFPCVELGLPEGCENRDFINSYQKSDSFDLFLKFLFSTKYMKQQLESFIETTK 122

Query: 125 PDLLVSDLFQPWAVQAAAARNIPVVNFVVTGVAIL----TRLVHA----FCNSGREFPFP 184
           P  LV+D+F PWA ++A    +P + F  T    L       +H       +S   F  P
Sbjct: 123 PSALVADMFFPWATESAEKIGVPRLVFHGTSSFALCCSYNMRIHKPHKKVASSSTPFVIP 182

Query: 185 EIDFSGHWSSKRGRKVSDEV--GRDWALRVLKCLRMSSDVVLVNTSPEFEGKYIDFLASS 244
            +      +  +    ++E   G+ W  + ++    SS  VLVN+  E E  Y DF  S 
Sbjct: 183 GLPGDIVITEDQANVTNEETPFGKFW--KEVRESETSSFGVLVNSFYELESSYADFYRSF 242

Query: 245 LNKKILPIAPV------VPQIKPNGKKPEI-----LKWLDKKSPKSTVYVSFGSEYYLMN 304
           + KK   I P+      + +    GKK  I     LKWLD K+P S VY+SFGS   L N
Sbjct: 243 VAKKAWHIGPLSLSNRGIAEKAGRGKKANIDEQECLKWLDSKTPGSVVYLSFGSGTGLPN 302

Query: 305 QDREELAHGLEQSHVNFIWVIRFPKGERLT--IEEALPEGFMKRVGDKGLIMEGWAPQLE 364
           +   E+A GLE S  NFIWV+   + +  T   E+ LP+GF +R   KGLI+ GWAPQ+ 
Sbjct: 303 EQLLEIAFGLEGSGQNFIWVVSKNENQVGTGENEDWLPKGFEERNKGKGLIIRGWAPQVL 362

Query: 365 ILNHSSIGAFVCHCGWNSVVESMVFGVPIVALPMQLDQPCHAKVANLA---GVCVEA-ER 424
           IL+H +IG FV HCGWNS +E +  G+P+V  PM  +Q  + K+       GV V A E 
Sbjct: 363 ILDHKAIGGFVTHCGWNSTLEGIAAGLPMVTWPMGAEQFYNEKLLTKVLRIGVNVGATEL 422

Query: 425 DDEGN-VKREGVAKAIKEVVSEESGEALRGKAREIGEALRKR-EEG 430
             +G  + R  V KA++EV+  E  E  R +A+E+GE  +   EEG
Sbjct: 423 VKKGKLISRAQVEKAVREVIGGEKAEERRLRAKELGEMAKAAVEEG 466

BLAST of CmoCh04G014940 vs. TAIR10
Match: AT1G10400.1 (AT1G10400.1 UDP-Glycosyltransferase superfamily protein)

HSP 1 Score: 173.3 bits (438), Expect = 3.3e-43
Identity = 138/473 (29.18%), Postives = 227/473 (47.99%), Query Frame = 1

Query: 6   QAIKILMLPWLAHGHITPFFELAKRLAKHRTI--FQIYVCSSLVNLQAIDPNLARTHSIE 65
           + + +++ P+L+ GH+ P  +LA+ L  H       + V ++ +N   I  +L+ T +  
Sbjct: 4   EKVHVVLFPYLSKGHMIPMLQLARLLLSHSFAGDISVTVFTTPLNRPFIVDSLSGTKAT- 63

Query: 66  LVELHLP-SLPDLPPHMHTTKGIPLHLEPTLMKAFDMAAK----DFELLLDRL-EPDLLV 125
           +V++  P ++P++PP +  T  +P  L  +L   F  A K    DFE  L  L     +V
Sbjct: 64  IVDVPFPDNVPEIPPGVECTDKLPA-LSSSLFVPFTRATKSMQADFERELMSLPRVSFMV 123

Query: 126 SDLFQPWAVQAAAARNIPVVNFVVTGVAILTRLVHAFCN-------------SGREFPFP 185
           SD F  W  ++A     P + F     A        F N             S  EFP+ 
Sbjct: 124 SDGFLWWTQESARKLGFPRLVFFGMNCASTVICDSVFQNQLLSNVKSETEPVSVPEFPWI 183

Query: 186 EIDFSGHWSSKRGRKVSDEVGRDWALRVLKCLRMSSDVVLVNTSPEFEGKYIDFLASSLN 245
           ++            K + + G    L  +  +  S  ++  NT  + E  +IDF      
Sbjct: 184 KVRKCDFVKDMFDPKTTTDPGFKLILDQVTSMNQSQGIIF-NTFDDLEPVFIDFYKRKRK 243

Query: 246 KKILPIAPV------VPQIKPNGKKPEILKWLDKKSPK--STVYVSFGSEYYLMNQDREE 305
            K+  + P+      +        KP  +KWLD+K  K  + +YV+FGS+  +  +  EE
Sbjct: 244 LKLWAVGPLCYVNNFLDDEVEEKVKPSWMKWLDEKRDKGCNVLYVAFGSQAEISREQLEE 303

Query: 306 LAHGLEQSHVNFIWVIRFPKGERLTIEEALPEGFMKRVGDKGLIM-EGWAPQLEILNHSS 365
           +A GLE+S VNF+WV+   KG  +       +GF +RVG++G+++ + W  Q +IL H S
Sbjct: 304 IALGLEESKVNFLWVV---KGNEIG------KGFEERVGERGMMVRDEWVDQRKILEHES 363

Query: 366 IGAFVCHCGWNSVVESMVFGVPIVALPMQLDQPCHA-KVANLAGVCVEAERDDEGNVKRE 425
           +  F+ HCGWNS+ ES+   VPI+A P+  +QP +A  V     V        EG V+RE
Sbjct: 364 VRGFLSHCGWNSLTESICSEVPILAFPLAAEQPLNAILVVEELRVAERVVAASEGVVRRE 423

Query: 426 GVAKAIKEVVSEESGEALRGKAREIGEALRKR-EEGI------VDEVVREFCS 441
            +A+ +KE++  E G+ LR      G+  +K  EEGI      +D ++ EFC+
Sbjct: 424 EIAEKVKELMEGEKGKELRRNVEAYGKMAKKALEEGIGSSRKNLDNLINEFCN 464

BLAST of CmoCh04G014940 vs. NCBI nr
Match: gi|1009137597|ref|XP_015886141.1| (PREDICTED: beta-D-glucosyl crocetin beta-1,6-glucosyltransferase-like isoform X1 [Ziziphus jujuba])

HSP 1 Score: 444.1 bits (1141), Expect = 2.9e-121
Identity = 218/448 (48.66%), Postives = 314/448 (70.09%), Query Frame = 1

Query: 1   MATQKQAIKILMLPWLAHGHITPFFELAKRLAKHRTIFQIYVCSSLVNLQAIDPNLARTH 60
           M  ++++IK+LM PWLAHGHI+PF ELAKRL      FQIY CS+ VNL ++ P L++ +
Sbjct: 1   MMERQRSIKVLMFPWLAHGHISPFLELAKRLTDRN--FQIYFCSTPVNLTSVKPKLSQKY 60

Query: 61  S--IELVELHLPSLPDLPPHMHTTKGIPLHLEPTLMKAFDMAAKDFELLLDRLEPDLLVS 120
           S  I+LVELHLPSLPDLPPH HTT G+ L+L PTL KAFDM++  F  +L  ++PDLL+ 
Sbjct: 61  SSSIKLVELHLPSLPDLPPHYHTTNGLALNLIPTLKKAFDMSSSSFSTILSTIKPDLLIY 120

Query: 121 DLFQPWAVQAAAARNIPVVNFVVTGVAILTRLVHAFCNSGR----EFPFPEIDFSGHWSS 180
           D  QPWA Q A+  NIP VNF+  G ++++ ++H+   +G     EF   E+  S    +
Sbjct: 121 DFLQPWAPQLASCMNIPAVNFLSAGASMVSFVLHSIKYNGDDHDDEFLTTELHLSDSMEA 180

Query: 181 KRGRKVSDEVGRDWALRVLKCLRMSSDVVLVNTSPEFEGKYIDFLASSLNKKILPIAPVV 240
           K   ++++    +   R + CL  S+ ++L+ +  E EGKY+D+L+ S  KK++PI P+V
Sbjct: 181 KFA-EMTESSPDEHIDRAVTCLERSNSLILIKSFRELEGKYLDYLSLSFAKKVVPIGPLV 240

Query: 241 PQ-IKPNGKKPEILKWLDKKSPKSTVYVSFGSEYYLMNQDREELAHGLEQSHVNFIWVIR 300
            Q   P     +I+ WLDKK   STV+VSFGSEYYL N++ EE+A+GLE S VNFIWV+R
Sbjct: 241 AQDTNPEDDSMDIINWLDKKEKSSTVFVSFGSEYYLTNEEMEEIAYGLELSKVNFIWVVR 300

Query: 301 FPKGERLTIEEALPEGFMKRVGDKGLIMEGWAPQLEILNHSSIGAFVCHCGWNSVVESMV 360
           FP G+++ +EEALP+GF++RVG+KG+++E WAPQ++IL HSSIG FV HCGW+S++ES+ 
Sbjct: 301 FPLGQKMAVEEALPKGFLERVGEKGMVVEDWAPQMKILGHSSIGGFVSHCGWSSLMESLK 360

Query: 361 FGVPIVALPMQLDQPCHAKVANLAGVCVEAERDDEGNVKREGVAKAIKEVVSEESGEALR 420
            GVPI+A+PMQLDQP +AK+   +GV +E +RD  G ++RE +AK I+E+V E++ + + 
Sbjct: 361 LGVPIIAMPMQLDQPINAKLVERSGVGLEVKRDKNGRIEREYLAKVIREIVVEKARQDIE 420

Query: 421 GKAREIGEALRKREEGIVDEVVREFCSL 442
            KARE+   + ++ E  +D VV E   L
Sbjct: 421 KKAREMSNIITEKGEEEIDNVVEELAKL 445

BLAST of CmoCh04G014940 vs. NCBI nr
Match: gi|1009169046|ref|XP_015902985.1| (PREDICTED: beta-D-glucosyl crocetin beta-1,6-glucosyltransferase-like [Ziziphus jujuba])

HSP 1 Score: 441.0 bits (1133), Expect = 2.4e-120
Identity = 223/448 (49.78%), Postives = 313/448 (69.87%), Query Frame = 1

Query: 2   ATQKQAIKILMLPWLAHGHITPFFELAKRLAKHRTIFQIYVCSSLVNLQAIDPNLAR--- 61
           ATQK AIK+LMLPWLAHGHITPF ELAK+L      F IY CSS +NL +I P L     
Sbjct: 4   ATQKTAIKVLMLPWLAHGHITPFLELAKKLILRN--FHIYFCSSPINLNSIKPKLLIDPN 63

Query: 62  -THSIELVELHLPSLPDLPPHMHTTKGIPLHLEPTLMKAFDMAAKDFELLLDRLEPDLLV 121
            ++SI+LVELHLPSLPDLPPH HT KG+P HL PTL KA DM   +   +L+ L+PDLL+
Sbjct: 64  FSNSIQLVELHLPSLPDLPPHYHTMKGLPPHLLPTLEKALDMTKPELSKILETLKPDLLI 123

Query: 122 SDLFQPWAVQAAAARNIPVVNFVVTGVAILTRLVHAFCNSGREFPFPEI--DFSG-HWSS 181
            D    W    A++ NI  ++F+  G A+ + L H    S R+FPFP++  DFS   ++ 
Sbjct: 124 YDRLPIWLPDLASSMNIQPISFITGGAAMTSFLYHCIKCSDRQFPFPKLYPDFSKIKFTQ 183

Query: 182 KRGRKVSDEVGRDWALRVLKCLRMSSDVVLVNTSPEFEGKYIDFLASSLNKKILPIAPVV 241
           +     S + GRD A+  ++ L  S  VVL+ TS E EGKY+D+L +SL KK++P+  +V
Sbjct: 184 ESAEYSSTDSGRDSAIGAVEMLGKSRGVVLIRTSRELEGKYMDYLYASLGKKVVPVGSLV 243

Query: 242 PQIKPNGKKP-EILKWLDKKSPKSTVYVSFGSEYYLMNQDREELAHGLEQSHVNFIWVIR 301
           P +  + ++  +I+ WLDKK   STV VSFG+E YL  ++ EE+AHGLE S++NFIWV+R
Sbjct: 244 PDVVLDDEEGMDIINWLDKKEKSSTVLVSFGTECYLSKENMEEMAHGLEISNMNFIWVVR 303

Query: 302 FPKGERLTIEEALPEGFMKRVGDKGLIMEGWAPQLEILNHSSIGAFVCHCGWNSVVESMV 361
           FPKG ++ +++ LPEGF+ RV ++G+++E WAPQ++IL+HSSIG FV HCGW SV+ES+ 
Sbjct: 304 FPKGGKMKLDDGLPEGFLGRVKERGIVVENWAPQIKILHHSSIGGFVSHCGWGSVMESIK 363

Query: 362 FGVPIVALPMQLDQPCHAKVANLAGVCVEAERDDEGNVKREGVAKAIKEVVSEESGEALR 421
           FGVPI+A+PMQ+DQP +A++ +  GV +E   D+ G +K E +AK IK+VV E++GE +R
Sbjct: 364 FGVPIIAMPMQVDQPWNARLVDECGVGLEVNMDNNGKLKGETLAKVIKQVVVEKTGEQIR 423

Query: 422 GKAREIGEALRKREEGIVDEVVREFCSL 442
            KA+E+ E + +++E  +D VV+E   L
Sbjct: 424 RKAKEMSEKIGRKDEEEIDGVVKELLEL 449

BLAST of CmoCh04G014940 vs. NCBI nr
Match: gi|731413412|ref|XP_010658725.1| (PREDICTED: crocetin glucoside glucosyltransferase-like isoform X1 [Vitis vinifera])

HSP 1 Score: 440.3 bits (1131), Expect = 4.1e-120
Identity = 225/447 (50.34%), Postives = 301/447 (67.34%), Query Frame = 1

Query: 4   QKQAIKILMLPWLAHGHITPFFELAKRLAKHRTIFQIYVCSSLVNLQAIDPNLAR--THS 63
           ++  IK+L+LPWLAHGHI+PF EL+K+L K +  F IY CSS VNL  I   L    +HS
Sbjct: 6   RQSRIKVLVLPWLAHGHISPFLELSKQLMKQK--FYIYFCSSPVNLSRIKGKLTGNYSHS 65

Query: 64  IELVELHLPSLPDLPPHMHTTKGIPLHLEPTLMKAFDMAAKDFELLLDRLEPDLLVSDLF 123
           I+LVELHLPSLP+LPPH HTT G+P HL PTL  A DMA+  F  +L  L PDLL+ D  
Sbjct: 66  IQLVELHLPSLPELPPHYHTTNGLPPHLMPTLKMALDMASPSFTNILKTLSPDLLIYDFI 125

Query: 124 QPWAVQAAAARNIPVVNFVVTGVAILTRLVHAFCNSGREFPFPEIDFSGHWSSKRGRKVS 183
           QPWA  AAA+  IP V F+  G A    ++H     G EFPFPEI    + +S   R V 
Sbjct: 126 QPWAPAAAASLGIPSVQFLSNGAAATAFMIHFVKKPGNEFPFPEIYLRDYETSGFNRFVE 185

Query: 184 DEVG-RDWALRVLKCLRMSSDVVLVNTSPEFEGKYIDFLASSLNKKILPIAPVVP-QIKP 243
                R    +  +CL  SS+V+L+ +  E E ++IDFL++   K ++P+ P++  Q+  
Sbjct: 186 SSANARKDKEKARQCLEQSSNVILIRSFKEIEERFIDFLSNLNAKTVVPVGPLLQDQLDE 245

Query: 244 NGKKPEILKWLDKKSPKSTVYVSFGSEYYLMNQDREELAHGLEQSHVNFIWVIRFPKGER 303
              + E+++WL KK P S+V+VSFGSEY+L  ++ EE+A+GLE S VNFIWV+RFP G++
Sbjct: 246 EDAETEMVEWLSKKDPASSVFVSFGSEYFLSKEELEEVAYGLELSKVNFIWVVRFPMGDK 305

Query: 304 LTIEEALPEGFMKRVGDKGLIMEGWAPQLEILNHSSIGAFVCHCGWNSVVESMVFGVPIV 363
             +EEALPEGF+ RVGDKG+++EGWAPQ +IL HSSIG FV HCGW SV+ESM FGVPIV
Sbjct: 306 TRVEEALPEGFLSRVGDKGMVVEGWAPQKKILRHSSIGGFVSHCGWGSVMESMNFGVPIV 365

Query: 364 ALPMQLDQPCHAKVANLAGVCVEAERDDEGNVKREGVAKAIKEVVSEESGEALRGKAREI 423
           A+PM LDQP +AK+    GV +E +RD+ G ++RE +AK IKEVV ++ GE +R KARE 
Sbjct: 366 AMPMHLDQPFNAKLVEAHGVGIEVKRDENGKLQREEIAKVIKEVVVKKCGEIVRQKAREF 425

Query: 424 GEALRKREEGIVDEVVREFCSLSQPKN 447
            E + K+ +  +  VV +   L   +N
Sbjct: 426 SENMSKKGDEEIVGVVEKLVQLCGKRN 450

BLAST of CmoCh04G014940 vs. NCBI nr
Match: gi|1009119473|ref|XP_015876401.1| (PREDICTED: beta-D-glucosyl crocetin beta-1,6-glucosyltransferase-like [Ziziphus jujuba])

HSP 1 Score: 439.9 bits (1130), Expect = 5.4e-120
Identity = 221/453 (48.79%), Postives = 310/453 (68.43%), Query Frame = 1

Query: 2   ATQKQAIKILMLPWLAHGHITPFFELAKRLA-KHRTIFQIYVCSSLVNLQAI------DP 61
           A QK AI++LMLPWLAHGHI+PF ELAK+L    +  F +Y+CSS VNL +I      DP
Sbjct: 3   AMQKTAIEVLMLPWLAHGHISPFLELAKKLIHSSQRNFHVYLCSSPVNLDSIRLKFSCDP 62

Query: 62  NLARTHSIELVELHLPSLPDLPPHMHTTKGIPLHLEPTLMKAFDMAAKDFELLLDRLEPD 121
            L+  +SIELVELHLPS P+LPPH HTTKG+P HL P LMKAF+M   DF  +L+  +PD
Sbjct: 63  KLS--NSIELVELHLPSTPELPPHHHTTKGLPPHLMPNLMKAFEMTRSDFTNILETQKPD 122

Query: 122 LLVSDLFQPWAVQAAAARNIPVVNFVVTGVAILTRLVHAFCNSGREFPFPEI---DFSGH 181
           L++ D   PW    A++ NIP + F+ +G +I+    H   N   EFPFPEI        
Sbjct: 123 LIIHDFLPPWVHDVASSMNIPNIAFITSGASIMNFSFHFTNNKIDEFPFPEICPDSLIKK 182

Query: 182 WSSKRGRKVSDEVGRDWALRVLKCLRMSSDVVLVNTSPEFEGKYIDFLASSLNKKILPIA 241
           ++  R   + D+VG     + L     S  ++L+ +  E EGKYID+L++S  KK++P+ 
Sbjct: 183 FNQLRETSLKDDVGG----KPLHFYETSCKIILIKSFRELEGKYIDYLSTSFGKKVVPVG 242

Query: 242 PVVPQIKPNGKKPEILKWLDKKSPKSTVYVSFGSEYYLMNQDREELAHGLEQSHVNFIWV 301
           P+VP    + +  +I+ WLDKK   ST+ VSFGSE YL  QD +E+AHGLE S VNFIWV
Sbjct: 243 PLVPDPVDDNEGMDIINWLDKKEKSSTILVSFGSECYLSKQDMKEIAHGLELSKVNFIWV 302

Query: 302 IRFPKGERLTIEEALPEGFMKRVGDKGLIMEGWAPQLEILNHSSIGAFVCHCGWNSVVES 361
           IRFP+GE   +E+ALPEG+++RV ++G+++E WAPQ++ILNH++ G FV HCGW S++ES
Sbjct: 303 IRFPEGEEEKLEDALPEGYLERVRERGMVVENWAPQVKILNHANTGGFVSHCGWGSLMES 362

Query: 362 MVFGVPIVALPMQLDQPCHAKVANLAGVCVEAERD-DEGNVKREGVAKAIKEVVSEESGE 421
           + FGVPI+A+PMQ DQP +A++A ++G+ +E + D D G ++RE VAK IK+VV EE+GE
Sbjct: 363 IKFGVPIIAMPMQFDQPMNARLAEVSGIGLEIKMDNDNGRIEREAVAKVIKQVVIEETGE 422

Query: 422 ALRGKAREIGEALRKREEGIVDEVVREFCSLSQ 444
            +R KARE+ + ++ + E  +D  V+E   LS+
Sbjct: 423 VIRKKAREMSDCIKIKGEEEIDGAVQELLKLSE 449

BLAST of CmoCh04G014940 vs. NCBI nr
Match: gi|565404508|ref|XP_006367679.1| (PREDICTED: beta-D-glucosyl crocetin beta-1,6-glucosyltransferase-like [Solanum tuberosum])

HSP 1 Score: 436.0 bits (1120), Expect = 7.8e-119
Identity = 217/447 (48.55%), Postives = 299/447 (66.89%), Query Frame = 1

Query: 1   MATQKQAIKILMLPWLAHGHITPFFELAKRLAKHRTIFQIYVCSSLVNLQAIDPNLARTH 60
           M  +K  I +LMLPWLAHGHI PF ELAK+LA     F IY+CS+ VNL +I   ++  +
Sbjct: 1   MEAKKHTISVLMLPWLAHGHINPFLELAKKLASKN--FHIYLCSTPVNLNSIKKRVSEKY 60

Query: 61  S--IELVELHLPSLPDLPPHMHTTKGIPLHLEPTLMKAFDMAAKDFELLLDRLEPDLLVS 120
           S  IEL+ELHL SLPDLPPH HTT G+P HL  TL  AF++A+ +F  +L  L PDL++ 
Sbjct: 61  SQSIELIELHLLSLPDLPPHYHTTNGLPPHLMNTLKTAFELASPNFSKILQTLHPDLVIH 120

Query: 121 DLFQPWAVQAAAARNIPVVNFVVTGVAILTRLVHAFCNSGREFPFPEIDFSGHWSSKRGR 180
           D  QPW   +A++ NIP V F      ++   +H   N+  +FPFPEI    H      +
Sbjct: 121 DYNQPWVTDSASSMNIPAVQFPTFSATVVALAIHMSDNTEEKFPFPEIYLREHEMISLKK 180

Query: 181 KVSDEVGRDWALRVLKCLRMSSDVVLVNTSPEFEGKYIDFLASSLNKKILPIAPVVPQIK 240
            + +   + +     + +R S D++LV T  +FEGKYID+L++  +KKI+P+  +V +  
Sbjct: 181 DIDEVPSKKFPYD--EAIRRSHDIILVKTCRDFEGKYIDYLSNLTSKKIVPVGSLVQESM 240

Query: 241 PNGKKPEILKWLDKKSPKSTVYVSFGSEYYLMNQDREELAHGLEQSHVNFIWVIRFPKGE 300
                 EI +WLDKK   STV+VSFGSEY+L  ++   +A GLE S VNFIWVIRFP+GE
Sbjct: 241 DQDDYEEIAQWLDKKEKSSTVFVSFGSEYFLSKEEILAVAQGLELSKVNFIWVIRFPQGE 300

Query: 301 RLTIEEALPEGFMKRVGDKGLIMEGWAPQLEILNHSSIGAFVCHCGWNSVVESMVFGVPI 360
           RL I +ALPEG+++RVG++G+IMEGWAPQ  IL H SIG FV HCGW+S +ESM FGVPI
Sbjct: 301 RLNIRDALPEGYLERVGERGMIMEGWAPQALILQHPSIGGFVSHCGWSSFMESMKFGVPI 360

Query: 361 VALPMQLDQPCHAKVANLAGVCVEAERDDEGNVKREGVAKAIKEVVSEESGEALRGKARE 420
           +A+PM +DQP +A++    GV VEA +D++G ++ E +AKAI+EV+ EESGEA+R KA+E
Sbjct: 361 IAMPMHIDQPMNARLVEYIGVGVEAAKDEDGKLQSEEIAKAIREVLVEESGEAVRKKAKE 420

Query: 421 IGEALRKREEGIVDEVVREFCSLSQPK 446
           + E +  +E+  +D V  E  +L   K
Sbjct: 421 LSEKMNAKEDEEIDGVAEELVALCSNK 443

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
UGT9_GARJA2.1e-10044.55Beta-D-glucosyl crocetin beta-1,6-glucosyltransferase OS=Gardenia jasminoides GN... [more]
UGAT_BELPE1.5e-9341.91Cyanidin-3-O-glucoside 2-O-glucuronosyltransferase OS=Bellis perennis GN=UGAT PE... [more]
FLRT_CITMA7.8e-8740.95Flavanone 7-O-glucoside 2''-O-beta-L-rhamnosyltransferase OS=Citrus maxima GN=C1... [more]
U91C1_ARATH4.3e-5332.17UDP-glycosyltransferase 91C1 OS=Arabidopsis thaliana GN=UGT91C1 PE=2 SV=1[more]
SCGT_TOBAC4.5e-5034.28Scopoletin glucosyltransferase OS=Nicotiana tabacum GN=TOGT1 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
F6I5W2_VITVI2.9e-12050.34Glycosyltransferase OS=Vitis vinifera GN=VIT_13s0074g00610 PE=3 SV=1[more]
F6HIX7_VITVI2.7e-11848.98Glycosyltransferase OS=Vitis vinifera GN=VIT_13s0047g01230 PE=3 SV=1[more]
K4D510_SOLLC1.0e-11747.20Glycosyltransferase OS=Solanum lycopersicum PE=3 SV=1[more]
M1BPV9_SOLTU3.9e-11748.34Glycosyltransferase OS=Solanum tuberosum GN=PGSC0003DMG400019483 PE=3 SV=1[more]
F6I0D2_VITVI6.6e-11749.66Glycosyltransferase OS=Vitis vinifera GN=VIT_04s0044g01540 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT5G49690.12.4e-5432.17 UDP-Glycosyltransferase superfamily protein[more]
AT4G34131.17.1e-4631.67 UDP-glucosyl transferase 73B3[more]
AT4G34135.17.1e-4631.24 UDP-glucosyltransferase 73B2[more]
AT2G15490.19.3e-4631.97 UDP-glycosyltransferase 73B4[more]
AT1G10400.13.3e-4329.18 UDP-Glycosyltransferase superfamily protein[more]
Match NameE-valueIdentityDescription
gi|1009137597|ref|XP_015886141.1|2.9e-12148.66PREDICTED: beta-D-glucosyl crocetin beta-1,6-glucosyltransferase-like isoform X1... [more]
gi|1009169046|ref|XP_015902985.1|2.4e-12049.78PREDICTED: beta-D-glucosyl crocetin beta-1,6-glucosyltransferase-like [Ziziphus ... [more]
gi|731413412|ref|XP_010658725.1|4.1e-12050.34PREDICTED: crocetin glucoside glucosyltransferase-like isoform X1 [Vitis vinifer... [more]
gi|1009119473|ref|XP_015876401.1|5.4e-12048.79PREDICTED: beta-D-glucosyl crocetin beta-1,6-glucosyltransferase-like [Ziziphus ... [more]
gi|565404508|ref|XP_006367679.1|7.8e-11948.55PREDICTED: beta-D-glucosyl crocetin beta-1,6-glucosyltransferase-like [Solanum t... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002213UDP_glucos_trans
Vocabulary: Molecular Function
TermDefinition
GO:0016758transferase activity, transferring hexosyl groups
Vocabulary: Biological Process
TermDefinition
GO:0008152metabolic process
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008152 metabolic process
biological_process GO:0031047 gene silencing by RNA
biological_process GO:0006468 protein phosphorylation
cellular_component GO:0005575 cellular_component
molecular_function GO:0016758 transferase activity, transferring hexosyl groups
molecular_function GO:0005524 ATP binding
molecular_function GO:0004672 protein kinase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh04G014940.1CmoCh04G014940.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002213UDP-glucuronosyl/UDP-glucosyltransferasePANTHERPTHR11926GLUCOSYL/GLUCURONOSYL TRANSFERASEScoord: 5..438
score: 2.7E
IPR002213UDP-glucuronosyl/UDP-glucosyltransferasePFAMPF00201UDPGTcoord: 249..408
score: 4.0
IPR002213UDP-glucuronosyl/UDP-glucosyltransferasePROSITEPS00375UDPGTcoord: 324..367
scor
NoneNo IPR availableGENE3DG3DSA:3.40.50.2000coord: 9..254
score: 1.1E-6coord: 255..421
score: 4.2
NoneNo IPR availablePANTHERPTHR11926:SF342SUBFAMILY NOT NAMEDcoord: 5..438
score: 2.7E
NoneNo IPR availableunknownSSF53756UDP-Glycosyltransferase/glycogen phosphorylasecoord: 9..427
score: 1.65

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
CmoCh04G014940CmoCh18G009370Cucurbita moschata (Rifu)cmocmoB336
CmoCh04G014940CmoCh04G026470Cucurbita moschata (Rifu)cmocmoB468
The following block(s) are covering this gene:
GeneOrganismBlock
CmoCh04G014940Wild cucumber (PI 183967)cmocpiB745
CmoCh04G014940Cucumber (Chinese Long) v2cmocuB738
CmoCh04G014940Melon (DHL92) v3.5.1cmomeB631
CmoCh04G014940Bottle gourd (USVL1VR-Ls)cmolsiB700
CmoCh04G014940Cucumber (Gy14) v2cgybcmoB650
CmoCh04G014940Melon (DHL92) v3.6.1cmomedB720
CmoCh04G014940Cucumber (Chinese Long) v3cmocucB0874
CmoCh04G014940Cucumber (Gy14) v1cgycmoB0848
CmoCh04G014940Cucurbita maxima (Rimu)cmacmoB420