CmoCh04G026470 (gene) Cucurbita moschata (Rifu) v1

Overview
NameCmoCh04G026470
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu) v1)
DescriptionGlycosyltransferase
LocationCmo_Chr04: 19251287 .. 19254285 (-)
RNA-Seq ExpressionCmoCh04G026470
SyntenyCmoCh04G026470
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAGGCAATAGACATGGCAAAACGAGCGTTTTGATGCTCCCATGGCTGGCTCACGGCCATGTCTCGCCGTTCTTCGAGCTAGCCAAATCCCTCCGCCGTAGAAACTTCCACATATACTTCTGTTCCACCTCCGTAATTATCAACTCAATCCAATCAAACCTCACTCGAGATCTCTCCTCCGATATTGAGCTCGTCGAGCTAAAGTTACCGACCTCATCCGACCTCCCGCCCTACCGCCACACTACCGCCGGCCTCCCACCCCATCTCATGTTCTCGCTTAAGCGAGCTTTCGACTCGGCCGCCGCCACCTTCAGCATCATCCTCCATAACTTGAACCCGGACTTGGTCATCTATGATTTCTTGCAACCATGGGCACCTACTGTGGCTCGATCATCTCATATTCCCGCCGTCATGTTCCAACCTACGGGTGAGTTCAGTCGTTAAATTTATTACCGTGACGTATTAGATATAAATTTGAAAGTTTTTACAGGCGCGCTTATGGCGGCTATGGTGAAGTACGAGTTGGAGTATCCGAGCTCGGATTTGAGCTCGATATTCCCAGATATTCGGCTCACCGAGTACGAGATTAAACAGGTCAAGAACTTGTTTAGATCATCAGTGAATGATGCGAGGGATGAAGAAAGGATTAAGGAATGTAATGAGAGATCGTGTGGTATGATTTTGGTGAAATCATTCAGAGAAATTGAAGGCAAATATATTGATTTTCTCTCTATTTTGCTGCGGAAGAAGGTTTGTAATTTCCATTAAATTTCACTGAATATTCTTATTTAAAATTTGTATTTTTTTTATATATTTTATATTTTTTTTAAGAAAATATTTGAATTTTTGGATAAATTTTAAGAAAAAATATATTTTTTAAATTTTTTTTTAAATTTGACTTAATTTATAAAAAAATAGAGAAGTATATTTAATTTTACATTATAAAAAAATAAAAAAATTAAAGCTCTATCGAATAATCATTTAATTTTTAAAAATTAAGTTTATAAATAAGTTTTTGTATTTTATTATCTACTTTTATATATATATATCAAATTTTAAAAAATAAAATAAAAAGATAAGTATCAATTTTAAAAATAAAAAATTAATTATTGATTTAGCTTTCAAATCAATTATATTAAATTTGTTTTTTTTTAATTTTTTAAAAAACTTTTTAAAAATTTTTATTTAGATTTTATTTTATATTACTATTTTTATATACATCAAATTATTTAATGAAAATAGTAATGAAAATAATAATGAAGTTTATTTGTCAATTAAATAATAAAAAATTAAAAAGTTAATTAAAAATCAAGAGATTAAATGAAAATAGTAATAAATTCACATGAAATTAAATTGTTTTTTTTTTTTAATTACGGGTAATTAGGTGGTTCCAGTGGGTCCATTAGTTCAAGAACCAGAAAACGACGTCGTATCAAGAAGAAGATTCGAAAAATGGCTAAACAAAAAACAGGACTCATCTTGCTTACTCGTGTCATTCGGTAGCGAGTTTTACTTATCCAAAGAAGACATGGAAGAAATCGCTTATGGGCTCGAGCTTAGCCACGTGGACTTCATATGGGTCGTTAGGTTTCCGGTAGCCGGTGGAGGAGAGAGAAAGAAGAACGTTGAAGAAGAACTCCCTAAAGGATTTATAGAGAGAGTTAGAGAGAGAGGAATGGTGGTGGAAGGGTGGGTCCCACAAGCTCAGATTTTGAAACACCGTACCACCGGCGGCTTCCTCAGCCATTGTGGGTGGAGCTCCGTCATGGAGAGCATCAAGTTTGGTGTCCCGATCATCGCCGCCCCAATGCAGCTCGACCAACCGTTGAATGCTAGGTTGGTCGAGTGGCTCGACGTCGGTGTCGTCATCGAAAGAGACAATGGTCGCCTCCGCCGACAAGAAGTGGCCAGAGTTGTCAAAGAGGTTAGTTGTTATTTATCTTTTTAACCTTGTGTCTGATAGATCAAAAATATTATTCAGCTATTTTCGTATGTTTTATCATGAGACGTATGACCTACGAAGTTAGAAGCTCGAGAATCGATTAACCCCTTTTAAGGTTCAAGCCCGAGATACAAACATTCCAGTCAATTTAGCGTTTCATCTGGAAATTTAACCTAAATTTAATATCATGTTCATATTCATAACACTGTCACTACTAGCGGATATTGTCTTTCTTGAGCTTTTCCTTTCGAACTTTCCCTCAAAGTTTTAAAAACGGTATGCTAGGGAGAGGTTTCCACATACTTACAAAAAATGTTTCGTTTTCCTCCTCAACCGATGTGAGATCTTACAATCCACTCCTTTCAGGGTCCAAGTTTCCACACCCTTACAAAGAATGTTTCGTTCTCTCTCGGTCAGGGAGGAGAACAAAAACACCTTTTATAAGGGTGTGGAAACCATTCCCTAATAAATGCACAAGCGTTGAGGGGAAACCCGAAAGGGAAAGCCAAAAGAGGACAATATCTACTAGCGGCAGATCTGAGCCGTTACATTGGTTCTTTTGCATCGAGTGGGATCTCACAATCCATCCTCCTTCGAGGCCAATGTCCTCGTTGGAATTCATTTCTCTCTCTAATTAATGTGGGATCTCATACTCTCGAAGATTACCGATACTAATATCTTATTGGTTTTTCTTAATAAGGTAATGGTAGAGAAGATGGGAGAGAGAGTAAGGAAGAAGGTAAAGGAGTTTGCAGAGATGTTGAAGAAGAAAGGTGACGAAGAGATGGACATGGTGGTGGAAGAGCTAGTGAAGCTTTGCAAGAGCAACAAGGAAGATAATTTAGAGAGCCATTGGTGTAGACCCGCCATTGATAGCCATTTTTGTGAACCTCGATGATGATGCTACCACAACCGACACGGGACCTTTTTGCAAATACGACAATTAGCAAGGACTAGCTTAATAAAAGTATAAGTTCTTTTGCAACTGACACGAGAGCTTTTTGCAAATACAACAATTAGCAAGGACTAGCTTAATAAAAGTAGAAGTTCTTTT

mRNA sequence

ATGGAAGGCAATAGACATGGCAAAACGAGCGTTTTGATGCTCCCATGGCTGGCTCACGGCCATGTCTCGCCGTTCTTCGAGCTAGCCAAATCCCTCCGCCGTAGAAACTTCCACATATACTTCTGTTCCACCTCCGTAATTATCAACTCAATCCAATCAAACCTCACTCGAGATCTCTCCTCCGATATTGAGCTCGTCGAGCTAAAGTTACCGACCTCATCCGACCTCCCGCCCTACCGCCACACTACCGCCGGCCTCCCACCCCATCTCATGTTCTCGCTTAAGCGAGCTTTCGACTCGGCCGCCGCCACCTTCAGCATCATCCTCCATAACTTGAACCCGGACTTGGTCATCTATGATTTCTTGCAACCATGGGCACCTACTGTGGCTCGATCATCTCATATTCCCGCCGTCATGTTCCAACCTACGGGCGCGCTTATGGCGGCTATGGTGAAGTACGAGTTGGAGTATCCGAGCTCGGATTTGAGCTCGATATTCCCAGATATTCGGCTCACCGAGTACGAGATTAAACAGGTCAAGAACTTGTTTAGATCATCAGTGAATGATGCGAGGGATGAAGAAAGGATTAAGGAATGTAATGAGAGATCGTGTGGTATGATTTTGGTGAAATCATTCAGAGAAATTGAAGGCAAATATATTGATTTTCTCTCTATTTTGCTGCGGAAGAAGGTGGTTCCAGTGGGTCCATTAGTTCAAGAACCAGAAAACGACGTCGTATCAAGAAGAAGATTCGAAAAATGGCTAAACAAAAAACAGGACTCATCTTGCTTACTCGTGTCATTCGGTAGCGAGTTTTACTTATCCAAAGAAGACATGGAAGAAATCGCTTATGGGCTCGAGCTTAGCCACGTGGACTTCATATGGGTCGTTAGGTTTCCGGTAGCCGGTGGAGGAGAGAGAAAGAAGAACGTTGAAGAAGAACTCCCTAAAGGATTTATAGAGAGAGTTAGAGAGAGAGGAATGGTGGTGGAAGGGTGGGTCCCACAAGCTCAGATTTTGAAACACCGTACCACCGGCGGCTTCCTCAGCCATTGTGGGTGGAGCTCCGTCATGGAGAGCATCAAGTTTGGTGTCCCGATCATCGCCGCCCCAATGCAGCTCGACCAACCGTTGAATGCTAGGTTGGTCGAGTGGCTCGACGTCGGTGTCGTCATCGAAAGAGACAATGGTCGCCTCCGCCGACAAGAAGTGGCCAGAGTTGTCAAAGAGGTAATGGTAGAGAAGATGGGAGAGAGAGTAAGGAAGAAGGTAAAGGAGTTTGCAGAGATGTTGAAGAAGAAAGGTGACGAAGAGATGGACATGGTGGTGGAAGAGCTAGTGAAGCTTTGCAAGAGCAACAAGGAAGATAATTTAGAGAGCCATTGGTGTAGACCCGCCATTGATAGCCATTTTTGTGAACCTCGATGATGATGCTACCACAACCGACACGGGACCTTTTTGCAAATACGACAATTAGCAAGGACTAGCTTAATAAAAGTATAAGTTCTTTTGCAACTGACACGAGAGCTTTTTGCAAATACAACAATTAGCAAGGACTAGCTTAATAAAAGTAGAAGTTCTTTT

Coding sequence (CDS)

ATGGAAGGCAATAGACATGGCAAAACGAGCGTTTTGATGCTCCCATGGCTGGCTCACGGCCATGTCTCGCCGTTCTTCGAGCTAGCCAAATCCCTCCGCCGTAGAAACTTCCACATATACTTCTGTTCCACCTCCGTAATTATCAACTCAATCCAATCAAACCTCACTCGAGATCTCTCCTCCGATATTGAGCTCGTCGAGCTAAAGTTACCGACCTCATCCGACCTCCCGCCCTACCGCCACACTACCGCCGGCCTCCCACCCCATCTCATGTTCTCGCTTAAGCGAGCTTTCGACTCGGCCGCCGCCACCTTCAGCATCATCCTCCATAACTTGAACCCGGACTTGGTCATCTATGATTTCTTGCAACCATGGGCACCTACTGTGGCTCGATCATCTCATATTCCCGCCGTCATGTTCCAACCTACGGGCGCGCTTATGGCGGCTATGGTGAAGTACGAGTTGGAGTATCCGAGCTCGGATTTGAGCTCGATATTCCCAGATATTCGGCTCACCGAGTACGAGATTAAACAGGTCAAGAACTTGTTTAGATCATCAGTGAATGATGCGAGGGATGAAGAAAGGATTAAGGAATGTAATGAGAGATCGTGTGGTATGATTTTGGTGAAATCATTCAGAGAAATTGAAGGCAAATATATTGATTTTCTCTCTATTTTGCTGCGGAAGAAGGTGGTTCCAGTGGGTCCATTAGTTCAAGAACCAGAAAACGACGTCGTATCAAGAAGAAGATTCGAAAAATGGCTAAACAAAAAACAGGACTCATCTTGCTTACTCGTGTCATTCGGTAGCGAGTTTTACTTATCCAAAGAAGACATGGAAGAAATCGCTTATGGGCTCGAGCTTAGCCACGTGGACTTCATATGGGTCGTTAGGTTTCCGGTAGCCGGTGGAGGAGAGAGAAAGAAGAACGTTGAAGAAGAACTCCCTAAAGGATTTATAGAGAGAGTTAGAGAGAGAGGAATGGTGGTGGAAGGGTGGGTCCCACAAGCTCAGATTTTGAAACACCGTACCACCGGCGGCTTCCTCAGCCATTGTGGGTGGAGCTCCGTCATGGAGAGCATCAAGTTTGGTGTCCCGATCATCGCCGCCCCAATGCAGCTCGACCAACCGTTGAATGCTAGGTTGGTCGAGTGGCTCGACGTCGGTGTCGTCATCGAAAGAGACAATGGTCGCCTCCGCCGACAAGAAGTGGCCAGAGTTGTCAAAGAGGTAATGGTAGAGAAGATGGGAGAGAGAGTAAGGAAGAAGGTAAAGGAGTTTGCAGAGATGTTGAAGAAGAAAGGTGACGAAGAGATGGACATGGTGGTGGAAGAGCTAGTGAAGCTTTGCAAGAGCAACAAGGAAGATAATTTAGAGAGCCATTGGTGTAGACCCGCCATTGATAGCCATTTTTGTGAACCTCGATGA

Protein sequence

MEGNRHGKTSVLMLPWLAHGHVSPFFELAKSLRRRNFHIYFCSTSVIINSIQSNLTRDLSSDIELVELKLPTSSDLPPYRHTTAGLPPHLMFSLKRAFDSAAATFSIILHNLNPDLVIYDFLQPWAPTVARSSHIPAVMFQPTGALMAAMVKYELEYPSSDLSSIFPDIRLTEYEIKQVKNLFRSSVNDARDEERIKECNERSCGMILVKSFREIEGKYIDFLSILLRKKVVPVGPLVQEPENDVVSRRRFEKWLNKKQDSSCLLVSFGSEFYLSKEDMEEIAYGLELSHVDFIWVVRFPVAGGGERKKNVEEELPKGFIERVRERGMVVEGWVPQAQILKHRTTGGFLSHCGWSSVMESIKFGVPIIAAPMQLDQPLNARLVEWLDVGVVIERDNGRLRRQEVARVVKEVMVEKMGERVRKKVKEFAEMLKKKGDEEMDMVVEELVKLCKSNKEDNLESHWCRPAIDSHFCEPR
Homology
BLAST of CmoCh04G026470 vs. ExPASy Swiss-Prot
Match: A0A0A6ZFY4 (UDP-glucosyltransferase 29 OS=Panax ginseng OX=4054 GN=UGT29 PE=1 SV=1)

HSP 1 Score: 424.9 bits (1091), Expect = 1.2e-117
Identity = 224/456 (49.12%), Postives = 317/456 (69.52%), Query Frame = 0

Query: 4   NRHGKTSVLMLPWLAHGHVSPFFELAKSLRRRNFHIYFCSTSVIINSIQSNLTRDLSSDI 63
           N++G+ S+ +LP+LAHGH+SPFFELAK L +RN +++ CST + ++SI+    +D S+ I
Sbjct: 3   NQNGRISIALLPFLAHGHISPFFELAKQLAKRNCNVFLCSTPINLSSIKD---KDSSASI 62

Query: 64  ELVELKLPTSSDLPPYRHTTAGLPPHLMFSLKRAFDSAAATFSIILHNLNPDLVIYDFLQ 123
           +LVEL LP+S DLPP+ HTT GLP HLM  L+ AF++A  TFS IL  LNPDL+IYDF  
Sbjct: 63  KLVELHLPSSPDLPPHYHTTNGLPSHLMLPLRNAFETAGPTFSEILKTLNPDLLIYDFNP 122

Query: 124 PWAPTVARSSHIPAVMFQPTGALMAAMVKYELEYPSSDLSSIFPDIRLTEYEIKQVKNLF 183
            WAP +A S +IPAV F  T A  +++  +  + P       FPD     Y+   +    
Sbjct: 123 SWAPEIASSHNIPAVYFLTTAAASSSIGLHAFKNPGEKYP--FPDF----YDNSNITPEP 182

Query: 184 RSSVNDARDEERIKECNERSCGMILVKSFREIEGKYIDFLSILLRKKVVPVGPLVQEP-- 243
            S+ N     + I  C ERSC +IL+KSFRE+EGKYID LS L  K +VPVGPLVQ+P  
Sbjct: 183 PSADNMKLLHDFI-ACFERSCDIILIKSFRELEGKYIDLLSTLSDKTLVPVGPLVQDPMG 242

Query: 244 ENDVVSRRRFEKWLNKKQDSSCLLVSFGSEFYLSKEDMEEIAYGLELSHVDFIWVVRFPV 303
            N+     +   WL+K+ +S+ + V FGSE++LS E++EE+A GLE+S V+FIW VR  +
Sbjct: 243 HNEDPKTEQIINWLDKRAESTVVFVCFGSEYFLSNEELEEVAIGLEISTVNFIWAVRL-I 302

Query: 304 AGGGERKKNVEEELPKGFIERVRERGMVVEGWVPQAQILKHRTTGGFLSHCGWSSVMESI 363
            G    KK +   LP+GF++RV +RG+VVEGW PQA+IL H +TGGF+SHCGWSS+ ES+
Sbjct: 303 EG---EKKGI---LPEGFVQRVGDRGLVVEGWAPQARILGHSSTGGFVSHCGWSSIAESM 362

Query: 364 KFGVPIIAAPMQLDQPLNARLVEWLDVGVVIERD-NGRLRRQEVARVVKEVMVEKMGERV 423
           KFGVP+IA    LDQPLN +L   + VG+ + RD NG+ +R+ +A V+++V+VEK GE +
Sbjct: 363 KFGVPVIAMARHLDQPLNGKLAAEVGVGMEVVRDENGKYKREGIAEVIRKVVVEKSGEVI 422

Query: 424 RKKVKEFAEMLKKKGDEEMDMVVEELVKLCKSNKED 457
           R+K +E +E +K+KG++E+D  +EELV++CK  K++
Sbjct: 423 RRKARELSEKMKEKGEQEIDRALEELVQICKKKKDE 441

BLAST of CmoCh04G026470 vs. ExPASy Swiss-Prot
Match: Q5NTH0 (Cyanidin-3-O-glucoside 2-O-glucuronosyltransferase OS=Bellis perennis OX=41492 GN=UGAT PE=1 SV=1)

HSP 1 Score: 382.9 bits (982), Expect = 5.4e-105
Identity = 210/447 (46.98%), Postives = 284/447 (63.53%), Query Frame = 0

Query: 11  VLMLPWLAHGHVSPFFELAKSLRRRNFHIYFCSTSVIINSIQSNLTRDLSSDIELVELKL 70
           V+MLPWLA+ H+S F   AK L   NFHIY CS+   +  +++NLT   S  I+L+EL L
Sbjct: 12  VVMLPWLAYSHISRFLVFAKRLTNHNFHIYICSSQTNMQYLKNNLTSQYSKSIQLIELNL 71

Query: 71  PTSSDLPPYRHTTAGLPPHLMFSLKRAFDSAAATFSIILHNLNPDLVIYDFLQPWAPTVA 130
           P+SS+LP   HTT GLPPHL  +L   +  +   F  IL  LNP LVIYDF Q WAP VA
Sbjct: 72  PSSSELPLQYHTTHGLPPHLTKTLSDDYQKSGPDFETILIKLNPHLVIYDFNQLWAPEVA 131

Query: 131 RSSHIPAVMFQPTGALMAAMVKYELEYP-SSDLSSI-FPDIRLTEYEIKQVKNLFRSSVN 190
            + HIP++        + A+  +    P   +L+   FP+I     +I +          
Sbjct: 132 STLHIPSIQLLSGCVALYALDAHLYTKPLDENLAKFPFPEIYPKNRDIPK---------G 191

Query: 191 DARDEERIKECNERSCGMILVKSFREIEGKYIDFLSILLRKKVVPVGPLVQEPENDVVSR 250
            ++  ER  +C  RSC +ILV+S  E+EGKYID+LS  L KKV+PVGPLVQE        
Sbjct: 192 GSKYIERFVDCMRRSCEIILVRSTMELEGKYIDYLSKTLGKKVLPVGPLVQEASLLQDDH 251

Query: 251 RRFEKWLNKKQDSSCLLVSFGSEFYLSKEDMEEIAYGLELSHVDFIWVVRFPVAGGGERK 310
               KWL+KK++SS + V FGSE+ LS  ++E+IAYGLELS V F+W +R          
Sbjct: 252 IWIMKWLDKKEESSVVFVCFGSEYILSDNEIEDIAYGLELSQVSFVWAIR---------- 311

Query: 311 KNVEEELPKGFIERVRERGMVVEGWVPQAQILKHRTTGGFLSHCGWSSVMESIKFGVPII 370
              +     GFI+RV ++G+V++ WVPQA IL H +TGGF+SHCGWSS MESI++GVPII
Sbjct: 312 --AKTSALNGFIDRVGDKGLVIDKWVPQANILSHSSTGGFISHCGWSSTMESIRYGVPII 371

Query: 371 AAPMQLDQPLNARLVEWLDVGVVIERD-NGRLRRQEVARVVKEVMVEKMGERVRKKVKEF 430
           A PMQ DQP NARL+E +  G+ + RD  GRL+R+E+A VV++V+VE  GE +R+K KE 
Sbjct: 372 AMPMQFDQPYNARLMETVGAGIEVGRDGEGRLKREEIAAVVRKVVVEDSGESIREKAKEL 431

Query: 431 AEMLKKKGDEEMD-MVVEELVKLCKSN 454
            E++KK  + E+D +V+E LVKLC+ N
Sbjct: 432 GEIMKKNMEAEVDGIVIENLVKLCEMN 437

BLAST of CmoCh04G026470 vs. ExPASy Swiss-Prot
Match: F8WKW8 (Beta-D-glucosyl crocetin beta-1,6-glucosyltransferase OS=Gardenia jasminoides OX=114476 GN=UGT94E5 PE=1 SV=1)

HSP 1 Score: 382.9 bits (982), Expect = 5.4e-105
Identity = 211/446 (47.31%), Postives = 286/446 (64.13%), Query Frame = 0

Query: 13  MLPWLAHGHVSPFFELAKSLRRRNFHIYFCSTSVIINSIQSNLTRDLSSDIELVELKLPT 72
           M PWLA+GH+SP+ ELAK L  R F IY CST + +  I+  +T   S  I+LVEL LP 
Sbjct: 1   MFPWLAYGHISPYLELAKRLTDRGFAIYICSTPINLGFIKKRITGKYSVTIKLVELHLPD 60

Query: 73  SSDLPPYRHTTAGLPPHLMFSLKRAFDSAAATFSIILHNLNPDLVIYDFLQPWAPTVARS 132
           + +LPP+ HTT GLPPHLM +LKRA + A    S IL  L PD VIYD  Q W   +  +
Sbjct: 61  TPELPPHYHTTNGLPPHLMATLKRALNGAKPELSNILKTLKPDFVIYDATQTWTAALTVA 120

Query: 133 SHIPAVMFQPTGALMAA-----MVKYELEYPSSDLSSIFPDIRLTEYEIKQVKNLFRSSV 192
            +IPAV F  +   M A      +K  +E+P       FP I L+++E  + +   + + 
Sbjct: 121 HNIPAVKFLTSSVSMLAYFCHLFMKPGIEFP-------FPAIYLSDFEQAKARTAAQDAR 180

Query: 193 NDARDEERIKECNERSCGMI-LVKSFREIEGKYIDFLSILLRKKVVPVGPLVQEPENDVV 252
            DA + +   E   R C  I LVKS R IEGKYID+L  L++ K++PVG LV+EP  D  
Sbjct: 181 ADAEENDPAAERPNRDCDSIFLVKSSRAIEGKYIDYLFDLMKLKMLPVGMLVEEPVKDDQ 240

Query: 253 SRRRFE--KWLNKKQDSSCLLVSFGSEFYLSKEDMEEIAYGLELSHVDFIWVVRFPVAGG 312
                E  +WL  K   S +LVSFG+E++L+KE+MEEIA+GLELS V+FIWVVRF +   
Sbjct: 241 GDNSNELIQWLGTKSQRSTVLVSFGTEYFLTKEEMEEIAHGLELSEVNFIWVVRFAMG-- 300

Query: 313 GERKKNVEEELPKGFIERVRERGMVVEGWVPQAQILKHRTTGGFLSHCGWSSVMESIKFG 372
             +K   +E LP+GF+ERV +RG +VEGW PQ+++L H +TGGF+ HCGW+SV+ESI+FG
Sbjct: 301 --QKIRPDEALPEGFLERVGDRGRIVEGWAPQSEVLAHPSTGGFICHCGWNSVVESIEFG 360

Query: 373 VPIIAAPMQLDQPLNARLVEWLDVGVVIERD-NGRLRRQEVARVVKEVMVEKMGERVRKK 432
           VP+IA PM LDQPLNARLV  +  G+ + RD  G+  R+E+AR +K+ MVEK GE  R K
Sbjct: 361 VPVIAMPMHLDQPLNARLVVEIGAGMEVVRDETGKFDRKEIARAIKDAMVEKTGENTRAK 420

Query: 433 VKEFAEMLKKKGDEEMDMVVEELVKL 450
           + +    ++ K  +E+D V E L +L
Sbjct: 421 MLDVKGRVELKEKQELDEVAELLTQL 435

BLAST of CmoCh04G026470 vs. ExPASy Swiss-Prot
Match: Q8GVE3 (Flavanone 7-O-glucoside 2''-O-beta-L-rhamnosyltransferase OS=Citrus maxima OX=37334 GN=C12RT1 PE=1 SV=2)

HSP 1 Score: 374.0 bits (959), Expect = 2.5e-102
Identity = 199/462 (43.07%), Postives = 291/462 (62.99%), Query Frame = 0

Query: 1   MEGNRHGKTSVLMLPWLAHGHVSPFFELAKSLRRRNFHIYFCSTSVIINSIQSNLTRDLS 60
           M+     K S+LMLPWLAHGH++P  ELAK L ++NFHIYFCST   + S   N+ ++ S
Sbjct: 1   MDTKHQDKPSILMLPWLAHGHIAPHLELAKKLSQKNFHIYFCSTPNNLQSFGRNVEKNFS 60

Query: 61  SDIELVELKLP-TSSDLPPYRHTTAGLPPHLMFSLKRAFDSAAATFSIILHNLNPDLVIY 120
           S I+L+EL+LP T  +LP    TT  LPPHL+++L  AF+ A   F  IL  L P LV+Y
Sbjct: 61  SSIQLIELQLPNTFPELPSQNQTTKNLPPHLIYTLVGAFEDAKPAFCNILETLKPTLVMY 120

Query: 121 DFLQPWAPTVARSSHIPAVMFQPTGALMAAMVKYELEYPSSDLSSIFPDIRLTEYEIKQV 180
           D  QPWA   A    I A++F P  A+  + + + +  PS  L   F +    + E K +
Sbjct: 121 DLFQPWAAEAAYQYDIAAILFLPLSAVACSFLLHNIVNPS--LKYPFFESDYQDRESKNI 180

Query: 181 KNLFRSSVNDARDEERIKECNERSCGMILVKSFREIEGKYIDFLSILLRKKVVPVGPLVQ 240
                 + N   +++R  +  E SC  + +K+ REIE KY+D+   L+  +++PVGPL+Q
Sbjct: 181 NYFLHLTANGTLNKDRFLKAFELSCKFVFIKTSREIESKYLDYFPSLMGNEIIPVGPLIQ 240

Query: 241 EP---ENDVVSRRRFEKWLNKKQDSSCLLVSFGSEFYLSKEDMEEIAYGLELSHVDFIWV 300
           EP   E+D     +   WL++K+  S +  SFGSE++ SK+++ EIA GL LS V+FIW 
Sbjct: 241 EPTFKEDDT----KIMDWLSQKEPRSVVYASFGSEYFPSKDEIHEIASGLLLSEVNFIWA 300

Query: 301 VRFPVAGGGERKKNVEEELPKGFIERV--RERGMVVEGWVPQAQILKHRTTGGFLSHCGW 360
            R       + K  +EE LP+GF E +    +GM+V+GWVPQA+IL+H + GGFLSHCGW
Sbjct: 301 FRL----HPDEKMTIEEALPQGFAEEIERNNKGMIVQGWVPQAKILRHGSIGGFLSHCGW 360

Query: 361 SSVMESIKFGVPIIAAPMQLDQPLNARLVEWLDVGVVIERD--NGRLRRQEVARVVKEVM 420
            SV+E + FGVPII  PM  +QP NA++V    +G+V+ RD  N RL  +EVARV+K V+
Sbjct: 361 GSVVEGMVFGVPIIGVPMAYEQPSNAKVVVDNGMGMVVPRDKINQRLGGEEVARVIKHVV 420

Query: 421 VEKMGERVRKKVKEFAEMLKKKGDEEMDMVVEELVKLCKSNK 455
           +++  +++R+K  E +E +KK GD EM +VVE+L++L K ++
Sbjct: 421 LQEEAKQIRRKANEISESMKKIGDAEMSVVVEKLLQLVKKSE 452

BLAST of CmoCh04G026470 vs. ExPASy Swiss-Prot
Match: D4Q9Z5 (Soyasaponin III rhamnosyltransferase OS=Glycine max OX=3847 GN=GmSGT3 PE=1 SV=1)

HSP 1 Score: 214.5 bits (545), Expect = 2.5e-54
Identity = 146/449 (32.52%), Postives = 235/449 (52.34%), Query Frame = 0

Query: 11  VLMLPWLAHGHVSPFFELAKSLRRRNFHIYFCSTSVIINSIQSNLTRDLSSDIELVELKL 70
           V MLPWLA GH+ P+FE+AK L ++   + F ++   I+ +     + L   I+LV+L L
Sbjct: 17  VAMLPWLAMGHIYPYFEVAKILAQKGHFVTFINSPKNIDRMPKT-PKHLEPFIKLVKLPL 76

Query: 71  PTSSDLPPYRHTTAGLPPHLMFSLKRAFDSAAATFSIILHNLNPDLVIYDFLQPWAPTVA 130
           P    LP    +T  +P      LK+A++      S +L   NPD V+YDF   W   +A
Sbjct: 77  PKIEHLPEGAESTMDIPSKKNCFLKKAYEGLQYAVSKLLKTSNPDWVLYDFAAAWVIPIA 136

Query: 131 RSSHIPAVMFQPTGALMAA--------MVKYELEY---PSSDLSSIFPDIRLTEYEIKQV 190
           +S +IP   +  T A            M  Y L     P + L      I +  YE  + 
Sbjct: 137 KSYNIPCAHYNITPAFNKVFFDPPKDKMKDYSLASICGPPTWL-PFTTTIHIRPYEFLRA 196

Query: 191 KNLFRSSVNDARDEERIKECNE--RSCGMILVKSFREIEGKYIDFLSILLRKKVVPVG-- 250
              +  + ++   E    + N+   SC + L+++ RE+EG ++D+L+   +  VVPVG  
Sbjct: 197 ---YEGTKDEETGERASFDLNKAYSSCDLFLLRTSRELEGDWLDYLAGNYKVPVVPVGLL 256

Query: 251 -PLVQ----EPENDVVSRRRFEKWLNKKQDSSCLLVSFGSEFYLSKEDMEEIAYGLELSH 310
            P +Q    E E++     R + WL+ ++ SS + + FGSE  LS+ED+ E+A+G+ELS+
Sbjct: 257 PPSMQIRDVEEEDNNPDWVRIKDWLDTQESSSVVYIGFGSELKLSQEDLTELAHGIELSN 316

Query: 311 VDFIWVVRFPVAGGGERKKNVEE---ELPKGFIERVRERGMVVEGWVPQAQILKHRTTGG 370
           + F W +           KN++E   ELP+GF ER +ERG+V + W PQ +IL H   GG
Sbjct: 317 LPFFWAL-----------KNLKEGVLELPEGFEERTKERGIVWKTWAPQLKILAHGAIGG 376

Query: 371 FLSHCGWSSVMESIKFGVPIIAAPMQLDQPLNARLVEWLDVGVVIERD--NGRLRRQEVA 430
            +SHCG  SV+E + FG  ++  P  LDQ L +R++E   V V + R   +G   R +VA
Sbjct: 377 CMSHCGSGSVIEKVHFGHVLVTLPYLLDQCLFSRVLEEKQVAVEVPRSEKDGSFTRVDVA 436

Query: 431 RVVKEVMVEKMGERVRKKVKEFAEMLKKK 435
           + ++  +V++ G  +R+  KE  ++   +
Sbjct: 437 KTLRFAIVDEEGSALRENAKEMGKVFSSE 449

BLAST of CmoCh04G026470 vs. ExPASy TrEMBL
Match: A0A6J1H6Y0 (Glycosyltransferase OS=Cucurbita moschata OX=3662 GN=LOC111460675 PE=3 SV=1)

HSP 1 Score: 950.7 bits (2456), Expect = 2.4e-273
Identity = 475/475 (100.00%), Postives = 475/475 (100.00%), Query Frame = 0

Query: 1   MEGNRHGKTSVLMLPWLAHGHVSPFFELAKSLRRRNFHIYFCSTSVIINSIQSNLTRDLS 60
           MEGNRHGKTSVLMLPWLAHGHVSPFFELAKSLRRRNFHIYFCSTSVIINSIQSNLTRDLS
Sbjct: 4   MEGNRHGKTSVLMLPWLAHGHVSPFFELAKSLRRRNFHIYFCSTSVIINSIQSNLTRDLS 63

Query: 61  SDIELVELKLPTSSDLPPYRHTTAGLPPHLMFSLKRAFDSAAATFSIILHNLNPDLVIYD 120
           SDIELVELKLPTSSDLPPYRHTTAGLPPHLMFSLKRAFDSAAATFSIILHNLNPDLVIYD
Sbjct: 64  SDIELVELKLPTSSDLPPYRHTTAGLPPHLMFSLKRAFDSAAATFSIILHNLNPDLVIYD 123

Query: 121 FLQPWAPTVARSSHIPAVMFQPTGALMAAMVKYELEYPSSDLSSIFPDIRLTEYEIKQVK 180
           FLQPWAPTVARSSHIPAVMFQPTGALMAAMVKYELEYPSSDLSSIFPDIRLTEYEIKQVK
Sbjct: 124 FLQPWAPTVARSSHIPAVMFQPTGALMAAMVKYELEYPSSDLSSIFPDIRLTEYEIKQVK 183

Query: 181 NLFRSSVNDARDEERIKECNERSCGMILVKSFREIEGKYIDFLSILLRKKVVPVGPLVQE 240
           NLFRSSVNDARDEERIKECNERSCGMILVKSFREIEGKYIDFLSILLRKKVVPVGPLVQE
Sbjct: 184 NLFRSSVNDARDEERIKECNERSCGMILVKSFREIEGKYIDFLSILLRKKVVPVGPLVQE 243

Query: 241 PENDVVSRRRFEKWLNKKQDSSCLLVSFGSEFYLSKEDMEEIAYGLELSHVDFIWVVRFP 300
           PENDVVSRRRFEKWLNKKQDSSCLLVSFGSEFYLSKEDMEEIAYGLELSHVDFIWVVRFP
Sbjct: 244 PENDVVSRRRFEKWLNKKQDSSCLLVSFGSEFYLSKEDMEEIAYGLELSHVDFIWVVRFP 303

Query: 301 VAGGGERKKNVEEELPKGFIERVRERGMVVEGWVPQAQILKHRTTGGFLSHCGWSSVMES 360
           VAGGGERKKNVEEELPKGFIERVRERGMVVEGWVPQAQILKHRTTGGFLSHCGWSSVMES
Sbjct: 304 VAGGGERKKNVEEELPKGFIERVRERGMVVEGWVPQAQILKHRTTGGFLSHCGWSSVMES 363

Query: 361 IKFGVPIIAAPMQLDQPLNARLVEWLDVGVVIERDNGRLRRQEVARVVKEVMVEKMGERV 420
           IKFGVPIIAAPMQLDQPLNARLVEWLDVGVVIERDNGRLRRQEVARVVKEVMVEKMGERV
Sbjct: 364 IKFGVPIIAAPMQLDQPLNARLVEWLDVGVVIERDNGRLRRQEVARVVKEVMVEKMGERV 423

Query: 421 RKKVKEFAEMLKKKGDEEMDMVVEELVKLCKSNKEDNLESHWCRPAIDSHFCEPR 476
           RKKVKEFAEMLKKKGDEEMDMVVEELVKLCKSNKEDNLESHWCRPAIDSHFCEPR
Sbjct: 424 RKKVKEFAEMLKKKGDEEMDMVVEELVKLCKSNKEDNLESHWCRPAIDSHFCEPR 478

BLAST of CmoCh04G026470 vs. ExPASy TrEMBL
Match: A0A6J1JJU7 (Glycosyltransferase OS=Cucurbita maxima OX=3661 GN=LOC111487574 PE=3 SV=1)

HSP 1 Score: 909.8 bits (2350), Expect = 4.7e-261
Identity = 452/475 (95.16%), Postives = 461/475 (97.05%), Query Frame = 0

Query: 1   MEGNRHGKTSVLMLPWLAHGHVSPFFELAKSLRRRNFHIYFCSTSVIINSIQSNLTRDLS 60
           ME NRHGKTSVLMLPWLAHGHVSPFFELAKSLRRRNFHIYFCSTS+I+NSIQ NLTRDL 
Sbjct: 1   MESNRHGKTSVLMLPWLAHGHVSPFFELAKSLRRRNFHIYFCSTSIILNSIQPNLTRDLF 60

Query: 61  SDIELVELKLPTSSDLPPYRHTTAGLPPHLMFSLKRAFDSAAATFSIILHNLNPDLVIYD 120
           SDIELVELKLPTSSDLP Y HTTAGLPPHLMFSLK+AFDSAA+ FSIILHNLNPDLVIYD
Sbjct: 61  SDIELVELKLPTSSDLPQYCHTTAGLPPHLMFSLKQAFDSAASAFSIILHNLNPDLVIYD 120

Query: 121 FLQPWAPTVARSSHIPAVMFQPTGALMAAMVKYELEYPSSDLSSIFPDIRLTEYEIKQVK 180
           FLQPWAP VARSSHIPAVMFQPTGALMAAMVKYELEYP S+LSSIFP+IRLTEYEIKQVK
Sbjct: 121 FLQPWAPAVARSSHIPAVMFQPTGALMAAMVKYELEYPGSNLSSIFPEIRLTEYEIKQVK 180

Query: 181 NLFRSSVNDARDEERIKECNERSCGMILVKSFREIEGKYIDFLSILLRKKVVPVGPLVQE 240
           NLFRSSVNDARDEERIKECNERSCGMILVKSFREIEGKYIDFLSILLRKKVVPVGPLVQE
Sbjct: 181 NLFRSSVNDARDEERIKECNERSCGMILVKSFREIEGKYIDFLSILLRKKVVPVGPLVQE 240

Query: 241 PENDVVSRRRFEKWLNKKQDSSCLLVSFGSEFYLSKEDMEEIAYGLELSHVDFIWVVRFP 300
           PENDVVS  RFEKWLNKK+DSSCLLVSFGSEFYLSKEDMEEIAYGLELSHVDFIWVVRFP
Sbjct: 241 PENDVVSGSRFEKWLNKKEDSSCLLVSFGSEFYLSKEDMEEIAYGLELSHVDFIWVVRFP 300

Query: 301 VAGGGERKKNVEEELPKGFIERVRERGMVVEGWVPQAQILKHRTTGGFLSHCGWSSVMES 360
           VAGGGERKKNVEEELPKGF+ERVRERGMVVEGWVPQAQILKHRTTGGFLSHCGWSSVMES
Sbjct: 301 VAGGGERKKNVEEELPKGFLERVRERGMVVEGWVPQAQILKHRTTGGFLSHCGWSSVMES 360

Query: 361 IKFGVPIIAAPMQLDQPLNARLVEWLDVGVVIERDNGRLRRQEVARVVKEVMVEKMGERV 420
           IKFGVPIIAAPMQLDQPLNARLVEWLD GVVIERDNGRL  QEVARVVKEVMVEKMGERV
Sbjct: 361 IKFGVPIIAAPMQLDQPLNARLVEWLDAGVVIERDNGRLHHQEVARVVKEVMVEKMGERV 420

Query: 421 RKKVKEFAEMLKKKGDEEMDMVVEELVKLCKSNKEDNLESHWCRPAIDSHFCEPR 476
           RKKVKEFAEMLKKKGDEEMDMVVEELVKLCK NKEDNL+SHWCRPAIDSHFCEPR
Sbjct: 421 RKKVKEFAEMLKKKGDEEMDMVVEELVKLCKRNKEDNLQSHWCRPAIDSHFCEPR 475

BLAST of CmoCh04G026470 vs. ExPASy TrEMBL
Match: A0A1S3C496 (Glycosyltransferase OS=Cucumis melo OX=3656 GN=LOC103496497 PE=3 SV=1)

HSP 1 Score: 660.2 bits (1702), Expect = 6.4e-186
Identity = 338/483 (69.98%), Postives = 408/483 (84.47%), Query Frame = 0

Query: 1   MEGNRHGKTSV---LMLPWLAHGHVSPFFELAKSLRRRNFHIYFCSTSVIINSIQSNLTR 60
           ++G++  +T V   LMLPWLAHGHVSPF EL+K L  +NFHI+FCSTS+I++SIQS L +
Sbjct: 3   LDGHQRNETKVMKILMLPWLAHGHVSPFLELSKLLATKNFHIFFCSTSIILHSIQSKLPQ 62

Query: 61  DL--SSDIELVELKLPTSSDLPPYRHTTAGLPPHLMFSLKRAFDSAAATFSIILHNLNPD 120
           +L  SS+IELVEL LPTS+DLP  RHTTAGLPPHLMFSLKRAFDSAA+ F  I+ NL PD
Sbjct: 63  NLLSSSNIELVELTLPTSADLPRCRHTTAGLPPHLMFSLKRAFDSAASAFDSIVRNLRPD 122

Query: 121 LVIYDFLQPWAPTVARSSHIPAVMFQPTGALMAAMVKYELEYPSSDLSSIFPDIRLTEYE 180
           LVIYDFLQPWAP VA S+ IPAVMFQ TGALMAA+V   L++P+SD  S+FP+IRL+ +E
Sbjct: 123 LVIYDFLQPWAPAVALSADIPAVMFQCTGALMAALVTNMLKFPNSDFPSMFPEIRLSVFE 182

Query: 181 IKQVKNLFRSSVNDARDEERIKECNERSCGMILVKSFREIEGKYIDFLSILLRKKVVPVG 240
           IKQ+KNLFRSSVNDA+D++RI+EC ERSCG++L+KSFREIE KYIDFLS  L+ KV+PVG
Sbjct: 183 IKQLKNLFRSSVNDAKDKQRIQECYERSCGILLLKSFREIEAKYIDFLSTSLQIKVIPVG 242

Query: 241 PLVQEPENDV-VSRRRFEKWLNKKQDSSCLLVSFGSEFYLSKEDMEEIAYGLELSHVDFI 300
           PLV+E + D+ V     EKWLNKK+  SC+LVSFGSEFYLSK DMEEIA+GLELSH++FI
Sbjct: 243 PLVEEQDEDIEVLAESIEKWLNKKEKKSCILVSFGSEFYLSKGDMEEIAHGLELSHLNFI 302

Query: 301 WVVRFPVAGGGERKK--NVEEELPKGFIERVRERGMVVEGWVPQAQILKHRTTGGFLSHC 360
           WVVRFP +G GERKK  NVEEELPKGF+ERV ERGMVVE WVPQAQILKHR+TGGFLSHC
Sbjct: 303 WVVRFPASGEGERKKRNNVEEELPKGFLERVGERGMVVEEWVPQAQILKHRSTGGFLSHC 362

Query: 361 GWSSVMESIKFGVPIIAAPMQLDQPLNARLVEWLDVGVVIERD-NGRLRRQEVARVVKEV 420
           GWSSV+ES+KFGVPIIAAPMQLDQPLNARLVE L VGVV+ER   GRL   EVAR V+EV
Sbjct: 363 GWSSVLESLKFGVPIIAAPMQLDQPLNARLVEHLGVGVVVERSCGGRLCWTEVARAVREV 422

Query: 421 MVEKMGERVRKKVKEFAEMLKKKGD-EEMDMVVEELVKLCKSNKEDNLESHWCRPAIDSH 474
           + E+ G+ VR+K+KEFA+++K+KGD +EM++V EE+ KLC+  K+  L+S+WCR ++DSH
Sbjct: 423 VAEESGKGVREKMKEFAKIMKEKGDKDEMEVVAEEITKLCR-RKKKGLQSNWCRTSMDSH 482

BLAST of CmoCh04G026470 vs. ExPASy TrEMBL
Match: A0A0A0KE59 (Glycosyltransferase OS=Cucumis sativus OX=3659 GN=Csa_6G045050 PE=3 SV=1)

HSP 1 Score: 650.6 bits (1677), Expect = 5.1e-183
Identity = 337/475 (70.95%), Postives = 400/475 (84.21%), Query Frame = 0

Query: 8   KTSVLMLPWLAHGHVSPFFELAKSLRRRNFHIYFCSTSVIINSIQSNLTRDL--SSDIEL 67
           K  +LMLPWLAHGHVSPF EL+K L  +NFHI+FCSTS+I++SI+S L + L  SS+I+L
Sbjct: 12  KMKILMLPWLAHGHVSPFLELSKLLATKNFHIFFCSTSIILHSIRSKLPQKLLSSSNIQL 71

Query: 68  VELKLPTSSDLPPYRHTTAGLPPHLMFSLKRAFDSAAATFSIILHNLNPDLVIYDFLQPW 127
           VEL LPTS+DLP +RHTTAGLP HLMFSLKRAFDSAA+ F  IL NL PDLVIYDFLQPW
Sbjct: 72  VELTLPTSADLPRWRHTTAGLPSHLMFSLKRAFDSAASAFDGILQNLKPDLVIYDFLQPW 131

Query: 128 APTVARSSHIPAVMFQPTGALMAAMVKYELEYPSSDLSSIFPDIRLTEYEIKQVKNLFRS 187
           AP VA S++IPAVMFQ TGALMAAMV   L++P+SD  S FP+I L+E+EIKQ+KNLF+S
Sbjct: 132 APAVALSANIPAVMFQCTGALMAAMVTNMLKFPNSDFLSTFPEIHLSEFEIKQLKNLFKS 191

Query: 188 SVNDARDEERIKECNERSCGMILVKSFREIEGKYIDFLSILLRKKVVPVGPLVQEPEND- 247
           SVNDA+D++RI+EC +RSCG++L+KS REIE KYIDF+S  L+ K +PVGPLV+E E D 
Sbjct: 192 SVNDAKDKQRIEECYKRSCGILLLKSLREIEAKYIDFVSTSLQIKAIPVGPLVEEQEEDI 251

Query: 248 VVSRRRFEKWLNKKQDSSCLLVSFGSEFYLSKEDMEEIAYGLELSHVDFIWVVRFPVAG- 307
           VV    FEKWLNKK+  SC+LVSFGSEFYLSK DMEEIA+GLELSHV+FIWVVRFP +G 
Sbjct: 252 VVLAESFEKWLNKKEKRSCILVSFGSEFYLSKGDMEEIAHGLELSHVNFIWVVRFPGSGE 311

Query: 308 GGERKKN---VEEELPKGFIERVRERGMVVEGWVPQAQILKHRTTGGFLSHCGWSSVMES 367
            GERKK    VEEELPKGF+ERV ERGMVVE WVPQ QILKHR+TGGFLSHCGWSSV+ES
Sbjct: 312 QGERKKKKNVVEEELPKGFLERVGERGMVVEEWVPQVQILKHRSTGGFLSHCGWSSVLES 371

Query: 368 IKFGVPIIAAPMQLDQPLNARLVEWLDVGVVIER-DNGRLRRQEVARVVKEVMVEKMGER 427
           IK GVPIIAAPMQLDQPLNARLVE L VGVV+ER D GRL R+EVAR V+EV+ E+ G+R
Sbjct: 372 IKSGVPIIAAPMQLDQPLNARLVEHLGVGVVVERSDGGRLCRREVARAVREVVAEESGKR 431

Query: 428 VRKKVKEFAEMLKKKGDE-EMDMVVEELVKLCKSNKEDNLESHWCRPAIDSHFCE 474
           VR+KVKE A+++K+KGDE EM++VVEE+ KLC+  K   L+S+WCR ++DSH CE
Sbjct: 432 VREKVKEVAKIMKEKGDEGEMEVVVEEITKLCR-RKRKGLQSNWCRTSMDSHCCE 485

BLAST of CmoCh04G026470 vs. ExPASy TrEMBL
Match: A0A6J1BWM7 (Glycosyltransferase OS=Momordica charantia OX=3673 GN=LOC111006190 PE=3 SV=1)

HSP 1 Score: 627.9 bits (1618), Expect = 3.5e-176
Identity = 318/450 (70.67%), Postives = 380/450 (84.44%), Query Frame = 0

Query: 4   NRHG-KTSVLMLPWLAHGHVSPFFELAKSLRRRNFHIYFCSTSVIINSIQSNLTRDLSSD 63
           +R+G + S+LMLPWLAHGHVSPFFELAK L  +NFH++FCST+V + S+Q  LT +L   
Sbjct: 6   HRNGRRMSILMLPWLAHGHVSPFFELAKLLAAKNFHVFFCSTAVNLRSVQPKLTPNL--- 65

Query: 64  IELVELKLPTSSDLPPYRHTTAGLPPHLMFSLKRAFDSAAATFSIILHNLNPDLVIYDFL 123
            E VEL+LP S +LPP RHTTAGLPPHLMFSLK AFD+AA  F+ +L  L PDL+IYDFL
Sbjct: 66  -ETVELRLPASPELPPDRHTTAGLPPHLMFSLKGAFDAAAPAFAAVLRRLAPDLLIYDFL 125

Query: 124 QPWAPTVARSSHIPAVMFQPTGALMAAMVKYELEYPSSDLSSIFPDIRLTEYEIKQVKNL 183
           QPWAP  A ++ IPAVMF  T ALM A V + LE+ S++L S+FP+IR +EYEI+Q+KN 
Sbjct: 126 QPWAPAEAAAAGIPAVMFNNTSALMPATVLHILEFQSAELFSLFPEIRCSEYEIRQLKNF 185

Query: 184 FRSSVNDARDEERIKECNERSCGMILVKSFREIEGKYIDFLSILLRKKVVPVGPLVQEPE 243
           F SSVNDA+D+ER+  C ERSCG++LVKSFREIEGKYIDFLS LL KKV+PVGPLV+EPE
Sbjct: 186 FGSSVNDAKDKERVVGCWERSCGIVLVKSFREIEGKYIDFLSTLLHKKVIPVGPLVEEPE 245

Query: 244 NDVVSRRRFEKWLNKKQDSSCLLVSFGSEFYLSKEDMEEIAYGLELSHVDFIWVVRFPVA 303
           +D VS   FE+WLNKK +SS +LVSFGSEFYLSK+DMEEIAYGLELSHV+FIWVVRF V 
Sbjct: 246 DDGVS-STFEEWLNKKDESSSILVSFGSEFYLSKQDMEEIAYGLELSHVNFIWVVRFAV- 305

Query: 304 GGGERKKNVEEELPKGFIERVRERGMVVEGWVPQAQILKHRTTGGFLSHCGWSSVMESIK 363
           GGGERKK +EEELPKGF+ERV++RGMVVEGW PQ QIL+HR+TGGFLSHCGWSSV+ESIK
Sbjct: 306 GGGERKKAMEEELPKGFLERVKDRGMVVEGWAPQTQILRHRSTGGFLSHCGWSSVLESIK 365

Query: 364 FGVPIIAAPMQLDQPLNARLVEWLDVGVVIER---DNGRLRRQEVARVVKEVMVEKMGER 423
           FGVPIIAAPM LDQPLNARL+E LDV +++ER   + G LRR EVAR +KEV+V+K GER
Sbjct: 366 FGVPIIAAPMHLDQPLNARLIECLDVSIIVERGGSNGGSLRRGEVARAIKEVVVDKSGER 425

Query: 424 VRKKVKEFAEMLKKKGDEEMDMVVEELVKL 450
           +RKK KE A+M+KKKG+EEM++VVEELVKL
Sbjct: 426 MRKKAKELAKMMKKKGNEEMEVVVEELVKL 449

BLAST of CmoCh04G026470 vs. NCBI nr
Match: XP_022959665.1 (cyanidin-3-O-glucoside 2-O-glucuronosyltransferase-like, partial [Cucurbita moschata])

HSP 1 Score: 950.7 bits (2456), Expect = 4.9e-273
Identity = 475/475 (100.00%), Postives = 475/475 (100.00%), Query Frame = 0

Query: 1   MEGNRHGKTSVLMLPWLAHGHVSPFFELAKSLRRRNFHIYFCSTSVIINSIQSNLTRDLS 60
           MEGNRHGKTSVLMLPWLAHGHVSPFFELAKSLRRRNFHIYFCSTSVIINSIQSNLTRDLS
Sbjct: 4   MEGNRHGKTSVLMLPWLAHGHVSPFFELAKSLRRRNFHIYFCSTSVIINSIQSNLTRDLS 63

Query: 61  SDIELVELKLPTSSDLPPYRHTTAGLPPHLMFSLKRAFDSAAATFSIILHNLNPDLVIYD 120
           SDIELVELKLPTSSDLPPYRHTTAGLPPHLMFSLKRAFDSAAATFSIILHNLNPDLVIYD
Sbjct: 64  SDIELVELKLPTSSDLPPYRHTTAGLPPHLMFSLKRAFDSAAATFSIILHNLNPDLVIYD 123

Query: 121 FLQPWAPTVARSSHIPAVMFQPTGALMAAMVKYELEYPSSDLSSIFPDIRLTEYEIKQVK 180
           FLQPWAPTVARSSHIPAVMFQPTGALMAAMVKYELEYPSSDLSSIFPDIRLTEYEIKQVK
Sbjct: 124 FLQPWAPTVARSSHIPAVMFQPTGALMAAMVKYELEYPSSDLSSIFPDIRLTEYEIKQVK 183

Query: 181 NLFRSSVNDARDEERIKECNERSCGMILVKSFREIEGKYIDFLSILLRKKVVPVGPLVQE 240
           NLFRSSVNDARDEERIKECNERSCGMILVKSFREIEGKYIDFLSILLRKKVVPVGPLVQE
Sbjct: 184 NLFRSSVNDARDEERIKECNERSCGMILVKSFREIEGKYIDFLSILLRKKVVPVGPLVQE 243

Query: 241 PENDVVSRRRFEKWLNKKQDSSCLLVSFGSEFYLSKEDMEEIAYGLELSHVDFIWVVRFP 300
           PENDVVSRRRFEKWLNKKQDSSCLLVSFGSEFYLSKEDMEEIAYGLELSHVDFIWVVRFP
Sbjct: 244 PENDVVSRRRFEKWLNKKQDSSCLLVSFGSEFYLSKEDMEEIAYGLELSHVDFIWVVRFP 303

Query: 301 VAGGGERKKNVEEELPKGFIERVRERGMVVEGWVPQAQILKHRTTGGFLSHCGWSSVMES 360
           VAGGGERKKNVEEELPKGFIERVRERGMVVEGWVPQAQILKHRTTGGFLSHCGWSSVMES
Sbjct: 304 VAGGGERKKNVEEELPKGFIERVRERGMVVEGWVPQAQILKHRTTGGFLSHCGWSSVMES 363

Query: 361 IKFGVPIIAAPMQLDQPLNARLVEWLDVGVVIERDNGRLRRQEVARVVKEVMVEKMGERV 420
           IKFGVPIIAAPMQLDQPLNARLVEWLDVGVVIERDNGRLRRQEVARVVKEVMVEKMGERV
Sbjct: 364 IKFGVPIIAAPMQLDQPLNARLVEWLDVGVVIERDNGRLRRQEVARVVKEVMVEKMGERV 423

Query: 421 RKKVKEFAEMLKKKGDEEMDMVVEELVKLCKSNKEDNLESHWCRPAIDSHFCEPR 476
           RKKVKEFAEMLKKKGDEEMDMVVEELVKLCKSNKEDNLESHWCRPAIDSHFCEPR
Sbjct: 424 RKKVKEFAEMLKKKGDEEMDMVVEELVKLCKSNKEDNLESHWCRPAIDSHFCEPR 478

BLAST of CmoCh04G026470 vs. NCBI nr
Match: KAG7032946.1 (Beta-D-glucosyl crocetin beta-1,6-glucosyltransferase, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 917.1 bits (2369), Expect = 6.0e-263
Identity = 463/491 (94.30%), Postives = 468/491 (95.32%), Query Frame = 0

Query: 1   MEGNRHGKTSVLMLPWLAHGHVSPFFELAKSLRRRNFHIYFCSTSVIINSIQSNLTRDLS 60
           MEGNRHGKTSVLMLPWLAHGHVSPFFELAKSLRRRNFHIYFCSTSVI+NSIQ NLTRDLS
Sbjct: 1   MEGNRHGKTSVLMLPWLAHGHVSPFFELAKSLRRRNFHIYFCSTSVILNSIQPNLTRDLS 60

Query: 61  SDIELVELKLPTSSDLPPYRHTTAGLPPHLMFSLKRAFDSAAATFSIILHNLNPDLVIYD 120
           SDIELVELKLPTSSDLPP+RHTTAGLPPHLMFSLKRAFDSAAATFSIILHNLNPDLVIYD
Sbjct: 61  SDIELVELKLPTSSDLPPHRHTTAGLPPHLMFSLKRAFDSAAATFSIILHNLNPDLVIYD 120

Query: 121 FLQPWAPTVARSSHIPAVMFQPTGALMAAMVKYELEYPSSDLSSIFPDIRLTEYEIKQVK 180
           FLQPWAPTVARSSHIPA+MFQPTGALMAAMVKYELEYP SDLSSIFP+IRLTEYEIKQVK
Sbjct: 121 FLQPWAPTVARSSHIPAIMFQPTGALMAAMVKYELEYPGSDLSSIFPNIRLTEYEIKQVK 180

Query: 181 NLFRSSVNDARDEERIKECNERSCGMILVKSFREIEGKYIDFLSILLRKKVVPVGPLVQE 240
           NLFRSSVND RDEERIKECNERSCGMILVKSFREIEGKYIDFLSILLRKKVVPVGPLVQE
Sbjct: 181 NLFRSSVNDTRDEERIKECNERSCGMILVKSFREIEGKYIDFLSILLRKKVVPVGPLVQE 240

Query: 241 PENDVVSRRRFEKWLNKKQDSSCLLVSFGSEFYLSKEDMEEIAYGLELSHVDFIWVVRFP 300
           P NDVVS  RFEKWLNKK+DSSCLLVSFGSEFYLSKEDMEEIAYGLELSHVDFIWVVRFP
Sbjct: 241 PGNDVVSGSRFEKWLNKKEDSSCLLVSFGSEFYLSKEDMEEIAYGLELSHVDFIWVVRFP 300

Query: 301 VAGGGERKKNVEEELPKGFIERVRERGMVVEGWVPQAQILKHRTTGGFLSHCGWSSVMES 360
           VAGGGERKKNVEEELPKGFIERVRERGMVVEGWVPQAQILKHRTTGGFLSHCGWSSVMES
Sbjct: 301 VAGGGERKKNVEEELPKGFIERVRERGMVVEGWVPQAQILKHRTTGGFLSHCGWSSVMES 360

Query: 361 IKFGVPIIAAPMQLDQPLNARLVEWLDVGVVIERDNGRLRRQEVARVVKE---------- 420
           IKFGVPIIAAPMQLDQPLNARLVEWLDVGVVIERDNGRLRRQEVARVVKE          
Sbjct: 361 IKFGVPIIAAPMQLDQPLNARLVEWLDVGVVIERDNGRLRRQEVARVVKEVIIGINFDIL 420

Query: 421 ------VMVEKMGERVRKKVKEFAEMLKKKGDEEMDMVVEELVKLCKSNKEDNLESHWCR 476
                 VMVEKMGERVRKKVKEFAEMLKKKGDEEMDMVVEELVKLCKSNKEDNLESHW R
Sbjct: 421 LVFLNKVMVEKMGERVRKKVKEFAEMLKKKGDEEMDMVVEELVKLCKSNKEDNLESHWYR 480

BLAST of CmoCh04G026470 vs. NCBI nr
Match: XP_023543159.1 (cyanidin-3-O-glucoside 2-O-glucuronosyltransferase-like, partial [Cucurbita pepo subsp. pepo])

HSP 1 Score: 911.4 bits (2354), Expect = 3.3e-261
Identity = 453/475 (95.37%), Postives = 464/475 (97.68%), Query Frame = 0

Query: 1   MEGNRHGKTSVLMLPWLAHGHVSPFFELAKSLRRRNFHIYFCSTSVIINSIQSNLTRDLS 60
           MEGNRHGKTSVLMLPWLAHGHVSPFFELAKSLRRRNFHIYFCST+ I+NSIQ NLTRDLS
Sbjct: 4   MEGNRHGKTSVLMLPWLAHGHVSPFFELAKSLRRRNFHIYFCSTTTILNSIQPNLTRDLS 63

Query: 61  SDIELVELKLPTSSDLPPYRHTTAGLPPHLMFSLKRAFDSAAATFSIILHNLNPDLVIYD 120
           SDIELVELKLPTSSDLPP+RHTTAGLPPHLMFSLKRAFDSAA TFSIIL NLNPDLVIYD
Sbjct: 64  SDIELVELKLPTSSDLPPHRHTTAGLPPHLMFSLKRAFDSAATTFSIILRNLNPDLVIYD 123

Query: 121 FLQPWAPTVARSSHIPAVMFQPTGALMAAMVKYELEYPSSDLSSIFPDIRLTEYEIKQVK 180
           FLQPWAPTVA+SS+IPAVMFQPTGALMAAMVKYELEYP SDLSSIFP+IRLTEYEIKQVK
Sbjct: 124 FLQPWAPTVAQSSYIPAVMFQPTGALMAAMVKYELEYPGSDLSSIFPEIRLTEYEIKQVK 183

Query: 181 NLFRSSVNDARDEERIKECNERSCGMILVKSFREIEGKYIDFLSILLRKKVVPVGPLVQE 240
           NLFRSSVNDARDEERIK CNERSCGMILVKSFREIEGKYIDFLSILLRKKVVPVGPLVQE
Sbjct: 184 NLFRSSVNDARDEERIKGCNERSCGMILVKSFREIEGKYIDFLSILLRKKVVPVGPLVQE 243

Query: 241 PENDVVSRRRFEKWLNKKQDSSCLLVSFGSEFYLSKEDMEEIAYGLELSHVDFIWVVRFP 300
           P NDVVS  RFEKWLNKKQDSSCLLVSFGSEFYLSKEDMEEIAYGLELSHVDFIWVVRFP
Sbjct: 244 PGNDVVSGSRFEKWLNKKQDSSCLLVSFGSEFYLSKEDMEEIAYGLELSHVDFIWVVRFP 303

Query: 301 VAGGGERKKNVEEELPKGFIERVRERGMVVEGWVPQAQILKHRTTGGFLSHCGWSSVMES 360
           VAGGGERKKNVEEELPKGF+ERVRERGMVVEGWVPQAQILKHRTTGGFLSHCGWSSVMES
Sbjct: 304 VAGGGERKKNVEEELPKGFLERVRERGMVVEGWVPQAQILKHRTTGGFLSHCGWSSVMES 363

Query: 361 IKFGVPIIAAPMQLDQPLNARLVEWLDVGVVIERDNGRLRRQEVARVVKEVMVEKMGERV 420
           IKFGVPIIAAPMQLDQPLNARLVEWLDVGVV+ERDNGRLRRQEVARVVKEVMVEKMGERV
Sbjct: 364 IKFGVPIIAAPMQLDQPLNARLVEWLDVGVVVERDNGRLRRQEVARVVKEVMVEKMGERV 423

Query: 421 RKKVKEFAEMLKKKGDEEMDMVVEELVKLCKSNKEDNLESHWCRPAIDSHFCEPR 476
           RKKVKEFAEMLKKKG+EEMDMVVEELVKLCK NKED+L+SHWC PAIDSHFCEPR
Sbjct: 424 RKKVKEFAEMLKKKGEEEMDMVVEELVKLCKRNKEDHLQSHWCSPAIDSHFCEPR 478

BLAST of CmoCh04G026470 vs. NCBI nr
Match: XP_022990792.1 (cyanidin-3-O-glucoside 2-O-glucuronosyltransferase-like [Cucurbita maxima])

HSP 1 Score: 909.8 bits (2350), Expect = 9.6e-261
Identity = 452/475 (95.16%), Postives = 461/475 (97.05%), Query Frame = 0

Query: 1   MEGNRHGKTSVLMLPWLAHGHVSPFFELAKSLRRRNFHIYFCSTSVIINSIQSNLTRDLS 60
           ME NRHGKTSVLMLPWLAHGHVSPFFELAKSLRRRNFHIYFCSTS+I+NSIQ NLTRDL 
Sbjct: 1   MESNRHGKTSVLMLPWLAHGHVSPFFELAKSLRRRNFHIYFCSTSIILNSIQPNLTRDLF 60

Query: 61  SDIELVELKLPTSSDLPPYRHTTAGLPPHLMFSLKRAFDSAAATFSIILHNLNPDLVIYD 120
           SDIELVELKLPTSSDLP Y HTTAGLPPHLMFSLK+AFDSAA+ FSIILHNLNPDLVIYD
Sbjct: 61  SDIELVELKLPTSSDLPQYCHTTAGLPPHLMFSLKQAFDSAASAFSIILHNLNPDLVIYD 120

Query: 121 FLQPWAPTVARSSHIPAVMFQPTGALMAAMVKYELEYPSSDLSSIFPDIRLTEYEIKQVK 180
           FLQPWAP VARSSHIPAVMFQPTGALMAAMVKYELEYP S+LSSIFP+IRLTEYEIKQVK
Sbjct: 121 FLQPWAPAVARSSHIPAVMFQPTGALMAAMVKYELEYPGSNLSSIFPEIRLTEYEIKQVK 180

Query: 181 NLFRSSVNDARDEERIKECNERSCGMILVKSFREIEGKYIDFLSILLRKKVVPVGPLVQE 240
           NLFRSSVNDARDEERIKECNERSCGMILVKSFREIEGKYIDFLSILLRKKVVPVGPLVQE
Sbjct: 181 NLFRSSVNDARDEERIKECNERSCGMILVKSFREIEGKYIDFLSILLRKKVVPVGPLVQE 240

Query: 241 PENDVVSRRRFEKWLNKKQDSSCLLVSFGSEFYLSKEDMEEIAYGLELSHVDFIWVVRFP 300
           PENDVVS  RFEKWLNKK+DSSCLLVSFGSEFYLSKEDMEEIAYGLELSHVDFIWVVRFP
Sbjct: 241 PENDVVSGSRFEKWLNKKEDSSCLLVSFGSEFYLSKEDMEEIAYGLELSHVDFIWVVRFP 300

Query: 301 VAGGGERKKNVEEELPKGFIERVRERGMVVEGWVPQAQILKHRTTGGFLSHCGWSSVMES 360
           VAGGGERKKNVEEELPKGF+ERVRERGMVVEGWVPQAQILKHRTTGGFLSHCGWSSVMES
Sbjct: 301 VAGGGERKKNVEEELPKGFLERVRERGMVVEGWVPQAQILKHRTTGGFLSHCGWSSVMES 360

Query: 361 IKFGVPIIAAPMQLDQPLNARLVEWLDVGVVIERDNGRLRRQEVARVVKEVMVEKMGERV 420
           IKFGVPIIAAPMQLDQPLNARLVEWLD GVVIERDNGRL  QEVARVVKEVMVEKMGERV
Sbjct: 361 IKFGVPIIAAPMQLDQPLNARLVEWLDAGVVIERDNGRLHHQEVARVVKEVMVEKMGERV 420

Query: 421 RKKVKEFAEMLKKKGDEEMDMVVEELVKLCKSNKEDNLESHWCRPAIDSHFCEPR 476
           RKKVKEFAEMLKKKGDEEMDMVVEELVKLCK NKEDNL+SHWCRPAIDSHFCEPR
Sbjct: 421 RKKVKEFAEMLKKKGDEEMDMVVEELVKLCKRNKEDNLQSHWCRPAIDSHFCEPR 475

BLAST of CmoCh04G026470 vs. NCBI nr
Match: XP_038885902.1 (UDP-glucosyltransferase 29-like [Benincasa hispida])

HSP 1 Score: 688.7 bits (1776), Expect = 3.5e-194
Identity = 345/475 (72.63%), Postives = 408/475 (85.89%), Query Frame = 0

Query: 3   GNRHGKTSVLMLPWLAHGHVSPFFELAKSLRRRNFHIYFCSTSVIINSIQSNLTRDLSSD 62
           G  + K  +LMLPWLAHGHVSPF EL+K L  RNFHI FCSTSVI++SIQS L ++LSS+
Sbjct: 4   GYENEKIRILMLPWLAHGHVSPFLELSKLLATRNFHILFCSTSVILHSIQSKLPQNLSSN 63

Query: 63  IELVELKLPTSSDLPPYRHTTAGLPPHLMFSLKRAFDSAAATFSIILHNLNPDLVIYDFL 122
           IELVEL LPTS+DLPP+RHTT GLP HLMFSLKRAFDSAA+ F  I+ N+ PDL+IYDFL
Sbjct: 64  IELVELTLPTSADLPPHRHTTTGLPSHLMFSLKRAFDSAASAFDAIVRNVRPDLLIYDFL 123

Query: 123 QPWAPTVARSSHIPAVMFQPTGALMAAMVKYELEYPSSDLSSIFPDIRLTEYEIKQVKNL 182
           QPWAP VA S+ IPAVMFQ TGALMAAMV Y L++ +SD+ S FP+IR++E EIKQ+ NL
Sbjct: 124 QPWAPAVALSADIPAVMFQCTGALMAAMVTYGLKFGNSDILSKFPEIRVSELEIKQLNNL 183

Query: 183 FRSSVNDARDEERIKECNERSCGMILVKSFREIEGKYIDFLSILLRKKVVPVGPLVQEPE 242
           FR SVNDA+D++RI+ECNERSCG++ +KSFREIE KYID LS  L+KKV+PVGPLV+EPE
Sbjct: 184 FRCSVNDAKDKQRIEECNERSCGILFLKSFREIEAKYIDSLSTFLQKKVIPVGPLVEEPE 243

Query: 243 NDVVSRRRFEKWLNKKQDSSCLLVSFGSEFYLSKEDMEEIAYGLELSHVDFIWVVRFPVA 302
           NDVV    FEKWLN+K+  SC+LVSFGSEFYLSK DMEEIAYGLELS ++FIWVVRFP +
Sbjct: 244 NDVVLGGSFEKWLNQKERKSCILVSFGSEFYLSKFDMEEIAYGLELSRLNFIWVVRFPAS 303

Query: 303 GGGERKKNVEEELPKGFIERVRERGMVVEGWVPQAQILKHRTTGGFLSHCGWSSVMESIK 362
           GGGE KKNVEEELPKGF+ERV ERGMVVEGWVPQAQILKHR+ GGFLSHCGWSSV+ESIK
Sbjct: 304 GGGEGKKNVEEELPKGFLERVGERGMVVEGWVPQAQILKHRSIGGFLSHCGWSSVVESIK 363

Query: 363 FGVPIIAAPMQLDQPLNARLVEWLDVGVVIER-DNGRLRRQEVA---RVVKEVMVEKMGE 422
           FGVPIIAAPMQLDQPLNARLVE L VGVV+ER D GRL R+EVA   R V+EV+ E+ G+
Sbjct: 364 FGVPIIAAPMQLDQPLNARLVEHLGVGVVVERSDGGRLCRREVARAVRAVREVVAEESGK 423

Query: 423 RVRKKVKEFAEMLKKKGDEEMDMVVEELVKLCKSNKEDNLESHWCRPAIDSHFCE 474
           RVR+K KEFA+++K+KGDEEM++VVEE++KLC+  K+  L+S+WCR ++DSH CE
Sbjct: 424 RVREKAKEFAKIMKEKGDEEMEVVVEEIMKLCR-RKKKGLQSNWCRTSMDSHCCE 477

BLAST of CmoCh04G026470 vs. TAIR 10
Match: AT5G49690.1 (UDP-Glycosyltransferase superfamily protein )

HSP 1 Score: 214.5 bits (545), Expect = 1.8e-55
Identity = 151/469 (32.20%), Postives = 226/469 (48.19%), Query Frame = 0

Query: 1   MEGNRHGKTSVLMLPWLAHGHVSPFFELAKSLRRRNFHIYFCSTSVIINSIQSNLTRDLS 60
           M   R     V M PWLA GH+ PF  L+K L ++   I F ST   I  +   L  +L+
Sbjct: 1   MVDKREEVMHVAMFPWLAMGHLLPFLRLSKLLAQKGHKISFISTPRNIERL-PKLQSNLA 60

Query: 61  SDIELVELKLPTSSDLPPYRHTTAGLPPHLMFSLKRAFDSAAATFSIILHNLNPDLVIYD 120
           S I  V   LP  S LPP   ++  +P +   SLK AFD         L   +PD +IYD
Sbjct: 61  SSITFVSFPLPPISGLPPSSESSMDVPYNKQQSLKAAFDLLQPPLKEFLRRSSPDWIIYD 120

Query: 121 FLQPWAPTVARSSHIPAVMFQPTGALM------AAMVKYELEYPSSDLSSIFP------D 180
           +   W P++A    I    F    A        ++ +  E+     D + + P      +
Sbjct: 121 YASHWLPSIAAELGISKAFFSLFNAATLCFMGPSSSLIEEIRSTPEDFTVVPPWVPFKSN 180

Query: 181 IRLTEYEIKQVKNLFRSSVNDARDEERIKECNERSCGMILVKSFREIEGKYIDFLSILLR 240
           I    +E+ +        V    D  R     + S   + V+S  E E ++   L  L R
Sbjct: 181 IVFRYHEVTRYVEKTEEDVTGVSDSVRFGYSIDES-DAVFVRSCPEFEPEWFGLLKDLYR 240

Query: 241 KKVVPVG---PLVQEPENDVVSRRRFEKWLNKKQDSSCLLVSFGSEFYLSKEDMEEIAYG 300
           K V P+G   P++++ +    +  R +KWL+K++ +S + VS G+E  L  E++ E+A G
Sbjct: 241 KPVFPIGFLPPVIEDDDAVDTTWVRIKKWLDKQRLNSVVYVSLGTEASLRHEEVTELALG 300

Query: 301 LELSHVDFIWVVRFPVAGGGERKKNVEEELPKGFIERVRERGMVVEGWVPQAQILKHRTT 360
           LE S   F WV+R             E ++P GF  RV+ RGMV  GWVPQ +IL H + 
Sbjct: 301 LEKSETPFFWVLR------------NEPKIPDGFKTRVKGRGMVHVGWVPQVKILSHESV 360

Query: 361 GGFLSHCGWSSVMESIKFGVPIIAAPMQLDQPLNARLVEWLDVGVVIERD--NGRLRRQE 420
           GGFL+HCGW+SV+E + FG   I  P+  +Q LN RL+    +GV + RD  +G      
Sbjct: 361 GGFLTHCGWNSVVEGLGFGKVPIFFPVLNEQGLNTRLLHGKGLGVEVSRDERDGSFDSDS 420

Query: 421 VARVVKEVMVEKMGERVRKKVKEFAEMLKKKGDEEMDMVVEELVKLCKS 453
           VA  ++ VM++  GE +R K K   ++      +E    V+ELV+  +S
Sbjct: 421 VADSIRLVMIDDAGEEIRAKAKVMKDLFGNM--DENIRYVDELVRFMRS 453

BLAST of CmoCh04G026470 vs. TAIR 10
Match: AT5G65550.1 (UDP-Glycosyltransferase superfamily protein )

HSP 1 Score: 203.4 bits (516), Expect = 4.1e-52
Identity = 139/464 (29.96%), Postives = 228/464 (49.14%), Query Frame = 0

Query: 8   KTSVLMLPWLAHGHVSPFFELAKSLRRRNFHIYFCSTSVIINSIQSNLTRDLSSDIELVE 67
           K  V + PWLA GH+ P+ +L+K + R+   + F ST+  I+ +  N++ DLS  +  V 
Sbjct: 7   KLHVAVFPWLALGHMIPYLQLSKLIARKGHTVSFISTARNISRL-PNISSDLS--VNFVS 66

Query: 68  LKLPTSSD-LPPYRHTTAGLPPHLMFSLKRAFDSAAATFSIILHNLNPDLVIYDFLQPWA 127
           L L  + D LP     T  +P   +  LK+AFD  +  F+  L    P+ ++YD L  W 
Sbjct: 67  LPLSQTVDHLPENAEATTDVPETHIAYLKKAFDGLSEAFTEFLEASKPNWIVYDILHHWV 126

Query: 128 PTVARSSHIPAVMFQPTGALMAAMV---------KYELEYPSSDLSSIFPDIRLTE---Y 187
           P +A    +   +F    A    ++          ++    + DL    P +       Y
Sbjct: 127 PPIAEKLGVRRAIFCTFNAASIIIIGGPASVMIQGHDPRKTAEDLIVPPPWVPFETNIVY 186

Query: 188 EIKQVKNLFRSSVNDARDEERIKECN----ERSCGMILVKSFREIEGKYIDFLSILLRKK 247
            + + K +           E    C          +I+++S  E+E ++I  LS L  K 
Sbjct: 187 RLFEAKRIMEYPTAGVTGVELNDNCRLGLAYVGSEVIVIRSCMELEPEWIQLLSKLQGKP 246

Query: 248 VVPVGPLVQEPENDVVSRRRF---EKWLNKKQDSSCLLVSFGSEFYLSKEDMEEIAYGLE 307
           V+P+G L   P +D      +    +WL++ Q  S + V+ G+E  +S E+++ +A+GLE
Sbjct: 247 VIPIGLLPATPMDDADDEGTWLDIREWLDRHQAKSVVYVALGTEVTISNEEIQGLAHGLE 306

Query: 308 LSHVDFIWVVRFPVAGGGERKKNVEEELPKGFIERVRERGMVVEGWVPQAQILKHRTTGG 367
           L  + F W +R        ++      LP GF ERV+ERG++   WVPQ +IL H + GG
Sbjct: 307 LCRLPFFWTLR--------KRTRASMLLPDGFKERVKERGVIWTEWVPQTKILSHGSVGG 366

Query: 368 FLSHCGWSSVMESIKFGVPIIAAPMQLDQPLNARLVEWLDVGVVIERD--NGRLRRQEVA 427
           F++HCGW S +E + FGVP+I  P  LDQPL ARL+  +++G+ I R+  +G      VA
Sbjct: 367 FVTHCGWGSAVEGLSFGVPLIMFPCNLDQPLVARLLSGMNIGLEIPRNERDGLFTSASVA 426

Query: 428 RVVKEVMVEKMGERVRKKVKEFAEML---KKKGDEEMDMVVEEL 447
             ++ V+VE+ G+  R       + +   K+  D+  D  +E L
Sbjct: 427 ETIRHVVVEEEGKIYRNNAASQQKKIFGNKRLQDQYADGFIEFL 459

BLAST of CmoCh04G026470 vs. TAIR 10
Match: AT2G22590.1 (UDP-Glycosyltransferase superfamily protein )

HSP 1 Score: 203.0 bits (515), Expect = 5.4e-52
Identity = 151/463 (32.61%), Postives = 233/463 (50.32%), Query Frame = 0

Query: 8   KTSVLMLPWLAHGHVSPFFELAKSLRRRNFHIYFCSTSVIINSIQSNLTRDLSSDIELVE 67
           K  V+M PWLA GH+ P+ EL+K + ++   + F ST   I+ +   L  +LSS I  V+
Sbjct: 13  KLHVVMFPWLAFGHMVPYLELSKLIAQKGHKVSFISTPRNIDRLLPRLPENLSSVINFVK 72

Query: 68  LKLPTSSD-LPPYRHTTAGLPPHLMFSLKRAFDSAAATFSIILHNLNPDLVIYDFLQPWA 127
           L LP   + LP     T  +P  L+  LK A+D      +  L +  PD V+ DF   W 
Sbjct: 73  LSLPVGDNKLPEDGEATTDVPFELIPYLKIAYDGLKVPVTEFLESSKPDWVLQDFAGFWL 132

Query: 128 PTVARSSHIPAVMFQPTGALMAAMVK---YELEYPSSDLSSIFP--------DIRLTEYE 187
           P ++R   I    F         ++K   +E EY +S    + P         +    +E
Sbjct: 133 PPISRRLGIKTGFFSAFNGATLGILKPPGFE-EYRTSPADFMKPPKWVPFETSVAFKLFE 192

Query: 188 IKQVKNLFRSSVNDAR--DEERIKECNERSCGMILVKSFREIEGKYIDFLSILLRKKVVP 247
            + +   F +   +    D  R+    +  C +I V+S  E E +++     L RK V+P
Sbjct: 193 CRFIFKGFMAETTEGNVPDIHRVGGVID-GCDVIFVRSCYEYEAEWLGLTQELHRKPVIP 252

Query: 248 VGPLVQEPE---NDVVSRRRFEKWLNKKQDSSCLLVSFGSEFYLSKEDMEEIAYGLELSH 307
           VG L  +P+    D  +    +KWL+ ++  S + V+FGSE   S+ ++ EIA GLELS 
Sbjct: 253 VGVLPPKPDEKFEDTDTWLSVKKWLDSRKSKSIVYVAFGSEAKPSQTELNEIALGLELSG 312

Query: 308 VDFIWVVRFPVAGGGERKKNVEEELPKGFIERVRERGMVVEGWVPQAQILKHRTTGGFLS 367
           + F WV++     G    + V  ELP+GF ER  +RGMV  GWV Q + L H + G  L+
Sbjct: 313 LPFFWVLK--TRRGPWDTEPV--ELPEGFEERTADRGMVWRGWVEQLRTLSHDSIGLVLT 372

Query: 368 HCGWSSVMESIKFGVPIIAAPMQLDQPLNARLVEWLDVGVVIERD--NGRLRRQEVARVV 427
           H GW +++E+I+F  P+       DQ LNAR++E   +G +I RD   G   ++ VA  +
Sbjct: 373 HPGWGTIIEAIRFAKPMAMLVFVYDQGLNARVIEEKKIGYMIPRDETEGFFTKESVANSL 432

Query: 428 KEVMVEKMGERVRKKVKE----FAEMLKKKGDEEMDMVVEELV 448
           + VMVE+ G+  R+ VKE    F +M   + D  +D  +E LV
Sbjct: 433 RLVMVEEEGKVYRENVKEMKGVFGDM--DRQDRYVDSFLEYLV 467

BLAST of CmoCh04G026470 vs. TAIR 10
Match: AT5G54060.1 (UDP-glucose:flavonoid 3-o-glucosyltransferase )

HSP 1 Score: 183.0 bits (463), Expect = 5.8e-46
Identity = 147/481 (30.56%), Postives = 224/481 (46.57%), Query Frame = 0

Query: 4   NRHGKTSVLMLPWLAHGHVSPFFELAKSLRRRNFHIYFCSTSVIINSIQS-NLTRDLSSD 63
           N     S++M PWLA GH++PF  L+  L  +   I F      +N ++  NL  +L   
Sbjct: 7   NESSSMSIVMYPWLAFGHMTPFLHLSNKLAEKGHKIVFLLPKKALNQLEPLNLYPNL--- 66

Query: 64  IELVELKLPTSSDLPPYRHTTAGLPPHLMFSLKRAFDSAAATFSIILHNLNPDLVIYDFL 123
           I    + +P    LPP   T + +P  L   L  A D        I   + PDLV YD  
Sbjct: 67  ITFHTISIPQVKGLPPGAETNSDVPFFLTHLLAVAMDQTRPEVETIFRTIKPDLVFYDSA 126

Query: 124 QPWAPTVARSSHIPAVMFQPTGALMAA---------------------MVKYELEYPSSD 183
             W P +A+      V F    A   A                     + K  L YPSS 
Sbjct: 127 H-WIPEIAKPIGAKTVCFNIVSAASIALSLVPSAEREVIDGKEMSGEELAKTPLGYPSS- 186

Query: 184 LSSIFPDIRLTEYEIKQVKNLFR--SSVNDARDEERIKECNERSCGMILVKSFREIEGKY 243
                  + L  +E K +  ++R   ++    D    K    R+C  I +++ RE EGK+
Sbjct: 187 ------KVVLRPHEAKSLSFVWRKHEAIGSFFDG---KVTAMRNCDAIAIRTCRETEGKF 246

Query: 244 IDFLSILLRKKVVPVGPLVQEPE-NDVVSRRRFEKWLNKKQDSSCLLVSFGSEFYLSKED 303
            D++S    K V   GP++   + N      ++ +WL K    S +  +FGS+  ++K D
Sbjct: 247 CDYISRQYSKPVYLTGPVLPGSQPNQPSLDPQWAEWLAKFNHGSVVFCAFGSQPVVNKID 306

Query: 304 -MEEIAYGLELSHVDFIWVVRFPVAGGGERKKNVEEELPKGFIERVRERGMVVEGWVPQA 363
             +E+  GLE +   F+  ++ P          VEE LP+GF ERV+ RG+V  GW+ Q 
Sbjct: 307 QFQELCLGLESTGFPFLVAIKPP-----SGVSTVEEALPEGFKERVQGRGVVFGGWIQQP 366

Query: 364 QILKHRTTGGFLSHCGWSSVMESIKFGVPIIAAPMQLDQPLNARLV-EWLDVGVVIERD- 423
            +L H + G F+SHCG+ S+ ES+     I+  P   +Q LNARL+ E ++V V +ER+ 
Sbjct: 367 LVLNHPSVGCFVSHCGFGSMWESLMSDCQIVLVPQHGEQILNARLMTEEMEVAVEVEREK 426

Query: 424 NGRLRRQEVARVVKEVMVE--KMGERVRKKVKEFAEMLKKKG--DEEMDMVVEELVKLCK 453
            G   RQ +   VK VM E  ++GE+VRK   ++  +L   G  D  +D   + L++L K
Sbjct: 427 KGWFSRQSLENAVKSVMEEGSEIGEKVRKNHDKWRCVLTDSGFSDGYIDKFEQNLIELVK 468

BLAST of CmoCh04G026470 vs. TAIR 10
Match: AT5G14860.1 (UDP-Glycosyltransferase superfamily protein )

HSP 1 Score: 181.4 bits (459), Expect = 1.7e-45
Identity = 162/497 (32.60%), Postives = 247/497 (49.70%), Query Frame = 0

Query: 12  LMLPWLAHGHVSPFFELAKSLRRRNFHIYFCSTSVIIN----------SIQSNLTRDLSS 71
           ++ P+++ GH  P  + A+ L R    +        I+             SN   D++S
Sbjct: 10  VLFPYMSKGHTIPLLQFARLLLRHRRIVSVDDEEPTISVTVFTTPKNQPFVSNFLSDVAS 69

Query: 72  DIELVELKLPTS-SDLPPYRHTTAGLPP-HLMFSLKRAFDSAAATFSIILHNLNP-DLVI 131
            I+++ L  P + + +PP   +T  LP   L     RA  S    F   L NL     ++
Sbjct: 70  SIKVISLPFPENIAGIPPGVESTDMLPSISLYVPFTRATKSLQPFFEAELKNLEKVSFMV 129

Query: 132 YDFLQPWAPTVARSSHIPAVMF----QPTGALMAAMVKYEL----EYPSSDLSSI----F 191
            D    W    A    IP + F        A+ +A+  +EL    E   SD   +    F
Sbjct: 130 SDGFLWWTSESAAKFEIPRLAFYGMNSYASAMCSAISVHELFTKPESVKSDTEPVTVPDF 189

Query: 192 PDIRLTEYEIKQVKNLFRSSVNDARDEERIKE--CNERSCGMILVKSFREIEGKYIDFLS 251
           P I + + E   V  L     +D   E  I      ++S G ++V SF E+E  ++D+  
Sbjct: 190 PWICVKKCEFDPV--LTEPDQSDPAFELLIDHLMSTKKSRG-VIVNSFYELESTFVDY-- 249

Query: 252 ILLRKKVVP----VGPLV----QEPENDVVSRRRFEKWLNKKQDSSC--LLVSFGSEFYL 311
             LR    P    VGPL      +PE+D   +  +  WL++K +  C  + V+FG++  +
Sbjct: 250 -RLRDNDEPKPWCVGPLCLVNPPKPESD---KPDWIHWLDRKLEERCPVMYVAFGTQAEI 309

Query: 312 SKEDMEEIAYGLELSHVDFIWVVRFPVAGGGERKKNVEEEL-PKGFIERVRERGMVVEGW 371
           S E ++EIA GLE S V+F+WV R          K++EE     GF +RV+E GM+V  W
Sbjct: 310 SNEQLKEIALGLEDSKVNFLWVTR----------KDLEEVTGGLGFEKRVKEHGMIVRDW 369

Query: 372 VPQAQILKHRTTGGFLSHCGWSSVMESIKFGVPIIAAPMQLDQPLNARL-VEWLDVGVVI 431
           V Q +IL H++  GFLSHCGW+S  ESI  GVP++A PM  +QPLNA+L VE L +GV I
Sbjct: 370 VDQWEILSHKSVKGFLSHCGWNSAQESICAGVPLLAWPMMAEQPLNAKLVVEELKIGVRI 429

Query: 432 ERDN----GRLRRQEVARVVKEVMVEKMGERVRKKVKEFAEMLKK-------KGDEEMDM 459
           E ++    G + R+E++R VK++M  +MG+   K VKE+A+M KK          + +D 
Sbjct: 430 ETEDVSVKGFVTREELSRKVKQLMEGEMGKTTMKNVKEYAKMAKKAMAQGTGSSWKSLDS 484

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
A0A0A6ZFY41.2e-11749.12UDP-glucosyltransferase 29 OS=Panax ginseng OX=4054 GN=UGT29 PE=1 SV=1[more]
Q5NTH05.4e-10546.98Cyanidin-3-O-glucoside 2-O-glucuronosyltransferase OS=Bellis perennis OX=41492 G... [more]
F8WKW85.4e-10547.31Beta-D-glucosyl crocetin beta-1,6-glucosyltransferase OS=Gardenia jasminoides OX... [more]
Q8GVE32.5e-10243.07Flavanone 7-O-glucoside 2''-O-beta-L-rhamnosyltransferase OS=Citrus maxima OX=37... [more]
D4Q9Z52.5e-5432.52Soyasaponin III rhamnosyltransferase OS=Glycine max OX=3847 GN=GmSGT3 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A6J1H6Y02.4e-273100.00Glycosyltransferase OS=Cucurbita moschata OX=3662 GN=LOC111460675 PE=3 SV=1[more]
A0A6J1JJU74.7e-26195.16Glycosyltransferase OS=Cucurbita maxima OX=3661 GN=LOC111487574 PE=3 SV=1[more]
A0A1S3C4966.4e-18669.98Glycosyltransferase OS=Cucumis melo OX=3656 GN=LOC103496497 PE=3 SV=1[more]
A0A0A0KE595.1e-18370.95Glycosyltransferase OS=Cucumis sativus OX=3659 GN=Csa_6G045050 PE=3 SV=1[more]
A0A6J1BWM73.5e-17670.67Glycosyltransferase OS=Momordica charantia OX=3673 GN=LOC111006190 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
XP_022959665.14.9e-273100.00cyanidin-3-O-glucoside 2-O-glucuronosyltransferase-like, partial [Cucurbita mosc... [more]
KAG7032946.16.0e-26394.30Beta-D-glucosyl crocetin beta-1,6-glucosyltransferase, partial [Cucurbita argyro... [more]
XP_023543159.13.3e-26195.37cyanidin-3-O-glucoside 2-O-glucuronosyltransferase-like, partial [Cucurbita pepo... [more]
XP_022990792.19.6e-26195.16cyanidin-3-O-glucoside 2-O-glucuronosyltransferase-like [Cucurbita maxima][more]
XP_038885902.13.5e-19472.63UDP-glucosyltransferase 29-like [Benincasa hispida][more]
Match NameE-valueIdentityDescription
AT5G49690.11.8e-5532.20UDP-Glycosyltransferase superfamily protein [more]
AT5G65550.14.1e-5229.96UDP-Glycosyltransferase superfamily protein [more]
AT2G22590.15.4e-5232.61UDP-Glycosyltransferase superfamily protein [more]
AT5G54060.15.8e-4630.56UDP-glucose:flavonoid 3-o-glucosyltransferase [more]
AT5G14860.11.7e-4532.60UDP-Glycosyltransferase superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita moschata (Rifu) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableGENE3D3.40.50.2000Glycogen Phosphorylase B;coord: 11..235
e-value: 1.1E-102
score: 346.4
NoneNo IPR availableGENE3D3.40.50.2000Glycogen Phosphorylase B;coord: 249..434
e-value: 1.1E-102
score: 346.4
NoneNo IPR availablePANTHERPTHR48044:SF29GLYCOSYLTRANSFERASEcoord: 6..450
NoneNo IPR availablePANTHERPTHR48044GLYCOSYLTRANSFERASEcoord: 6..450
NoneNo IPR availableSUPERFAMILY53756UDP-Glycosyltransferase/glycogen phosphorylasecoord: 11..440
IPR002213UDP-glucuronosyl/UDP-glucosyltransferasePFAMPF00201UDPGTcoord: 250..415
e-value: 8.5E-17
score: 61.1
IPR002213UDP-glucuronosyl/UDP-glucosyltransferaseCDDcd03784GT1_Gtf-likecoord: 11..430
e-value: 1.61566E-71
score: 230.516
IPR035595UDP-glycosyltransferase family, conserved sitePROSITEPS00375UDPGTcoord: 333..376

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh04G026470.1CmoCh04G026470.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0008194 UDP-glycosyltransferase activity