CmaCh00G002850 (gene) Cucurbita maxima (Rimu) v1.1

Overview
NameCmaCh00G002850
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu) v1.1)
DescriptionGlycosyltransferase
LocationCma_Chr00: 22569709 .. 22573524 (-)
RNA-Seq ExpressionCmaCh00G002850
SyntenyCmaCh00G002850
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAGCTCTGCACAAACCTTCCCAATTCCACATTGTTATGGTGCCAAGTCCAGGAATTGGCCATCTAATTCCCCTCCTCCAGTTCGCCAAACGCCTTGTCTCTCTTCCCGGCTTATCCGTCACCGTCGCCATTCCCTCCGACGCCCCTCCGACCAAACCCCAAAAAGCTCTCTTCACCAACCTCCCTTCCACCATCCAACCTCTCTTCCTCCCGCTCGTCTCCTTCCACGATCTCCCCAAACACACCAAAATCGAGACCATCATCGCTCTCTCTGTAACTCGCTCTCTTCCTTTCCTTCGCCACCTCTTCCAATCCCTCATCGGAAAAACCCATCTTGCTGCCCTCATCGTCGACCACTTCAGTACTGACGCCTTCGATGTCGCCATCGAATTCAACGTCCCTCGCTACCTTTTCTTCCCTCCCTCCGCCATGTCCCTTTCCTTCGCCTTTCAATTGCCCAGCCTCGACCAAATCGTCGCCGGCGAGTTCAGGGAGCATCCCGAGCTGATTCCGATTCCTGGGTGTATTCCGATTCATGGGAAAGATCTCTTGGAACCGGCTCAAGATAGGGAGGATGATGCGTACAAGCTATTACTCCATAACTGTAAGAGGTATAGATTGGCGGATGCTGTTTTTGTTAACAGCTTCCCTGAATTGGAGCCGGAAGCTATGAAAGCTCTGCTAGTGGAGGAAGCGGGGAAGCCCCCGGTTTATCCAGTGGGCCCGCTGGTGAGAAACGATTGCAGTGAAAACGGAAAGAGAGCGGAGTGTTTGAAATGGCTTGATGAGCAACCAAATGGGTCGGTTCTGTTTGTGTCGTTTGGGAGCGGTGGGACTCTGTCGAGTGCTCAAACCAACGAATTGGCGTTGGGATTGGAAATGAGCGAGCAGAGATTTCTATGGGTCGTAAGAAAGCCAAACGACGAGGCGGCTAACGCAACGTTTTTTGGCGACGAGAAGGAGAAGGAGAACGAAGCGTGGAGATTCTTGCCGGAGGGGTTTATGGAGAGGACTAAAAACAGGGGAATGGTGGTGTCATCGTGGGCGCCACAGGTTGAGGTGCTGAGGCATGAGTCCACCGGGGGGTTCTTGAGCCACTGCGGGTGGAACTCGACGCTGGAAGGCGTGGTGAATGGGGTTCCTCTGATTGCTTGGCCGCTGTATGCAGAACAGAGGATGAACGCCCATATGGTGACAGAGGACATCAAAGTTGGTTTGAGGCCGAAGAAGAAGGAGGGAAGTGGGATTGTGGAGAAGGAGGAGATTGCAGAAGTGGTGAAGTCGTTAATGGAAGGCGAAGAGGGGAAAAGGATTCGAGAGAAAATGAAGAATCTGAAAAATGCAGCGGAAAGAGGTGCGGGGGAAGATGGGTATTCATCCAAAGCAGTGTGTGAAATGGGTATGAAGTGGAAGAAGACGATGATGATGATCAGCAGTTCCCAGGAAGGGTATGAATTGGAAGAACATCAAGATCTGCAATAATAACAAGCTTTTGAATCTTGAATTGCGTCTGGGGGCATTGAATTTGTTCTCTTGAATCACAACTCCCTCACTTTCTACAATTTGAATAAATCAATAATGTGATGATAACAACAGAATTCTGTGTAATAAAATTATCTTTTCATTTTTTGAATTTTTCAAATCTTTTATTCGTTCTATAACGCTCTAATTTTAAAGATAATGTATTTTATAATCAAATTGAAAGGTATAAAATTGCACATCTAAATCGATTGACCATATTTCAATATTTATTAGCGGTGGATTTTGATGATTACAAATGGTATTAGAGTTAGACATCTGATCGTGTGCCAGTGAGGACGCTCAAGGCCTACAAGGGAGTGGATTGTGAGATTTTATATCAATCGTAGACGGAAACGAAATTTTCTTATAAGGGTGTGTAAATCTCTTCTTAGTAAAAGTGATTTTAAAACTATAAGGCTGACGACGAGTCTGTTCGCAGTGGGTTTCGACCACTACAAATGGTATCAGAGCCAGACACCGGGCGATATGCCAACGAGAACACTAAACTCCAAAAGTGGTAAATTGTGAGATCCCACGTCGGTTGAAAAGAGGAGTGAAATATTTCTTATAAAGTGTATAAATATCTCGTTAATGGATGCATATTAAACTTGGAAGGGTACTAAAGAAGCTTGCTGTTACACATTCATTCATTATGATTTCATCTAAAAGAAAGTTTTTTTTTTCTTTTCTTTTCTTTTTAAGCTTAAACTTATCCATTGATAATTGATATTAATTATAAAGAATTTATTCTTTTAATAAATAGGTGCATTTNAACACATCGTACCGTAATTATTATCATACAATTCACATATTAGTAACATATACATAACATATAAGAAACATATTGATCCACAGACCATTCACACATATCAATATATATGATACGTGCAGAACCATGAGACACGTGTCAACATCAAATATAACCATGCTGATAGTAAAGTCACTTATCTGATTAGCTTAGGTTGGTGTCACCTATTTATCCTTAACAAAGTTAAGTCCTTTGCATACAAGCGAACCTATATGAACCATGCCATCAATTTCAATTTCAATTTTCTAAATTTACAATTGTCCATGTGTTGGACTTCATGCGAGCATTAGAGACTTGCGAGGAAAGTCTTGAAATGTTATGGGTATATATAATGTTATTTTCACTACCTATCTTATGTTCTTTTTTGACTGTTGCAGAGAATGTATTGTTACAGACAAAATTTAATGGAGTTTTTTTTGCCGATATTTAAAGTTTTATTTTTAAATTCCGTTCACCAAATCTTTCCTCAATTATTTACCTACTCAACTCTGTAATTATTTCCGTTCTCTAAACAGATATTATTCTTCCTTTGTATTTATTTTTAAACAGATATTATTAGCAATTTTGTTAATGATTCTTACTTCCAAAATTTCTCTCTTTCTTTCAATTTAGTAAATCGGAGTGGGAAGAATTGTCAATTACAATAAGAGATAGTAGGGTAGAAGGATTCAAATCTTTTATTCATTCTATAACGCTCTGATTTTAAAGATAATGTATTTTATAATCAAATTAAAAGTATAAAATTGCACATCTAAATCCATATAATCATTAGCGGTGGATTTTGATTGTTACAAATGGTATTAGAGTTAGACATCTGACCGTGTGCCAGTGAGGACGCTCAAGGCCTACAAGAGAGTGGATTGTGAGATTTTATATCGATCGTAGACGGAAACGAAATTTTCTTATAAGGGTGTGTAAATCTCTTCTTAGTAAAAGTGATTTTAAAACTGTAAGGCTGACGACGAGTCTGTTCGCAGTGGGTTTCAACCACTACAAATGGTATCAGAGCCAGACACCGGGCGATATGCCAACGATAACACTAAGCCCCAAAAGTGGTAGATTGTGAGATCCCACGTCGGTTGAAAAGAGGAGTGAAATATTTCTTATAAAGTGTATAAATATCTGGTTAATGGATGCATATTAAACTTCGGAAGGGTACTAAAGAAGCTTGCTGTTACACATTCATTCATTGTGATTTCATCTAAAAGAGAGTTTTTTTTTTCTTTTCTTTTCTTTTTTAGCTTAAGTAGGAGTTCTAAACTTATCCATTGATAATTGATATTAATTATAAAGAATTTATTCTTTTAATAAATAGGTGCATTTNTGTATAAATATCTCGTTAATGGATGCATATTAAACTTGGAAGGGTACTAAAGAAGCTTGCTGTTACACATTCATTCATTATGATTTCATCTAAAAGAAAGTTTTTTTTTTCTTTTCTTTTCTTTTTTAGCTTAAGTAGGAGTTCTAAACTTATCCATTGA

mRNA sequence

ATGGAAGCTCTGCACAAACCTTCCCAATTCCACATTGTTATGGTGCCAAGTCCAGGAATTGGCCATCTAATTCCCCTCCTCCAGTTCGCCAAACGCCTTGTCTCTCTTCCCGGCTTATCCGTCACCGTCGCCATTCCCTCCGACGCCCCTCCGACCAAACCCCAAAAAGCTCTCTTCACCAACCTCCCTTCCACCATCCAACCTCTCTTCCTCCCGCTCGTCTCCTTCCACGATCTCCCCAAACACACCAAAATCGAGACCATCATCGCTCTCTCTGTAACTCGCTCTCTTCCTTTCCTTCGCCACCTCTTCCAATCCCTCATCGGAAAAACCCATCTTGCTGCCCTCATCGTCGACCACTTCAGTACTGACGCCTTCGATGTCGCCATCGAATTCAACGTCCCTCGCTACCTTTTCTTCCCTCCCTCCGCCATGTCCCTTTCCTTCGCCTTTCAATTGCCCAGCCTCGACCAAATCGTCGCCGGCGAGTTCAGGGAGCATCCCGAGCTGATTCCGATTCCTGGGTGTATTCCGATTCATGGGAAAGATCTCTTGGAACCGGCTCAAGATAGGGAGGATGATGCGTACAAGCTATTACTCCATAACTGTAAGAGGTATAGATTGGCGGATGCTGTTTTTGTTAACAGCTTCCCTGAATTGGAGCCGGAAGCTATGAAAGCTCTGCTAGTGGAGGAAGCGGGGAAGCCCCCGGTTTATCCAGTGGGCCCGCTGGTGAGAAACGATTGCAGTGAAAACGGAAAGAGAGCGGAGTGTTTGAAATGGCTTGATGAGCAACCAAATGGGTCGGTTCTGTTTGTGTCGTTTGGGAGCGGTGGGACTCTGTCGAGTGCTCAAACCAACGAATTGGCGTTGGGATTGGAAATGAGCGAGCAGAGATTTCTATGGGTCGTAAGAAAGCCAAACGACGAGGCGGCTAACGCAACGTTTTTTGGCGACGAGAAGGAGAAGGAGAACGAAGCGTGGAGATTCTTGCCGGAGGGGTTTATGGAGAGGACTAAAAACAGGGGAATGGTGGTGTCATCGTGGGCGCCACAGGTTGAGGTGCTGAGGCATGAGTCCACCGGGGGGTTCTTGAGCCACTGCGGGTGGAACTCGACGCTGGAAGGCGTGGTGAATGGGGTTCCTCTGATTGCTTGGCCGCTGTATGCAGAACAGAGGATGAACGCCCATATGGTGACAGAGGACATCAAAGTTGGTTTGAGGCCGAAGAAGAAGGAGGGAAGTGGGATTGTGGAGAAGGAGGAGATTGCAGAAGTGGTGAAGTCGTTAATGGAAGGCGAAGAGGGGAAAAGGATTCGAGAGAAAATGAAGAATCTGAAAAATGCAGCGGAAAGAGGTGCGGGGGAAGATGGGTATTCATCCAAAGCAGTGTGTGAAATGGGTATGAAGTGGAAGAAGACGATGATGATGATCAGCAGTTCCCAGGAAGGCTTAAGTAGGAGTTCTAAACTTATCCATTGA

Coding sequence (CDS)

ATGGAAGCTCTGCACAAACCTTCCCAATTCCACATTGTTATGGTGCCAAGTCCAGGAATTGGCCATCTAATTCCCCTCCTCCAGTTCGCCAAACGCCTTGTCTCTCTTCCCGGCTTATCCGTCACCGTCGCCATTCCCTCCGACGCCCCTCCGACCAAACCCCAAAAAGCTCTCTTCACCAACCTCCCTTCCACCATCCAACCTCTCTTCCTCCCGCTCGTCTCCTTCCACGATCTCCCCAAACACACCAAAATCGAGACCATCATCGCTCTCTCTGTAACTCGCTCTCTTCCTTTCCTTCGCCACCTCTTCCAATCCCTCATCGGAAAAACCCATCTTGCTGCCCTCATCGTCGACCACTTCAGTACTGACGCCTTCGATGTCGCCATCGAATTCAACGTCCCTCGCTACCTTTTCTTCCCTCCCTCCGCCATGTCCCTTTCCTTCGCCTTTCAATTGCCCAGCCTCGACCAAATCGTCGCCGGCGAGTTCAGGGAGCATCCCGAGCTGATTCCGATTCCTGGGTGTATTCCGATTCATGGGAAAGATCTCTTGGAACCGGCTCAAGATAGGGAGGATGATGCGTACAAGCTATTACTCCATAACTGTAAGAGGTATAGATTGGCGGATGCTGTTTTTGTTAACAGCTTCCCTGAATTGGAGCCGGAAGCTATGAAAGCTCTGCTAGTGGAGGAAGCGGGGAAGCCCCCGGTTTATCCAGTGGGCCCGCTGGTGAGAAACGATTGCAGTGAAAACGGAAAGAGAGCGGAGTGTTTGAAATGGCTTGATGAGCAACCAAATGGGTCGGTTCTGTTTGTGTCGTTTGGGAGCGGTGGGACTCTGTCGAGTGCTCAAACCAACGAATTGGCGTTGGGATTGGAAATGAGCGAGCAGAGATTTCTATGGGTCGTAAGAAAGCCAAACGACGAGGCGGCTAACGCAACGTTTTTTGGCGACGAGAAGGAGAAGGAGAACGAAGCGTGGAGATTCTTGCCGGAGGGGTTTATGGAGAGGACTAAAAACAGGGGAATGGTGGTGTCATCGTGGGCGCCACAGGTTGAGGTGCTGAGGCATGAGTCCACCGGGGGGTTCTTGAGCCACTGCGGGTGGAACTCGACGCTGGAAGGCGTGGTGAATGGGGTTCCTCTGATTGCTTGGCCGCTGTATGCAGAACAGAGGATGAACGCCCATATGGTGACAGAGGACATCAAAGTTGGTTTGAGGCCGAAGAAGAAGGAGGGAAGTGGGATTGTGGAGAAGGAGGAGATTGCAGAAGTGGTGAAGTCGTTAATGGAAGGCGAAGAGGGGAAAAGGATTCGAGAGAAAATGAAGAATCTGAAAAATGCAGCGGAAAGAGGTGCGGGGGAAGATGGGTATTCATCCAAAGCAGTGTGTGAAATGGGTATGAAGTGGAAGAAGACGATGATGATGATCAGCAGTTCCCAGGAAGGCTTAAGTAGGAGTTCTAAACTTATCCATTGA

Protein sequence

MEALHKPSQFHIVMVPSPGIGHLIPLLQFAKRLVSLPGLSVTVAIPSDAPPTKPQKALFTNLPSTIQPLFLPLVSFHDLPKHTKIETIIALSVTRSLPFLRHLFQSLIGKTHLAALIVDHFSTDAFDVAIEFNVPRYLFFPPSAMSLSFAFQLPSLDQIVAGEFREHPELIPIPGCIPIHGKDLLEPAQDREDDAYKLLLHNCKRYRLADAVFVNSFPELEPEAMKALLVEEAGKPPVYPVGPLVRNDCSENGKRAECLKWLDEQPNGSVLFVSFGSGGTLSSAQTNELALGLEMSEQRFLWVVRKPNDEAANATFFGDEKEKENEAWRFLPEGFMERTKNRGMVVSSWAPQVEVLRHESTGGFLSHCGWNSTLEGVVNGVPLIAWPLYAEQRMNAHMVTEDIKVGLRPKKKEGSGIVEKEEIAEVVKSLMEGEEGKRIREKMKNLKNAAERGAGEDGYSSKAVCEMGMKWKKTMMMISSSQEGLSRSSKLIH
Homology
BLAST of CmaCh00G002850 vs. ExPASy Swiss-Prot
Match: Q9AR73 (Hydroquinone glucosyltransferase OS=Rauvolfia serpentina OX=4060 GN=AS PE=1 SV=1)

HSP 1 Score: 568.2 bits (1463), Expect = 9.3e-161
Identity = 283/462 (61.26%), Postives = 348/462 (75.32%), Query Frame = 0

Query: 11  HIVMVPSPGIGHLIPLLQFAKRLVSLPGLSVTVAIPSDAPPTKPQKALFTNLPSTIQPLF 70
           HI MVP+PG+GHLIPL++FAKRLV      VT  IP+D P  K QK+    LP+ +  + 
Sbjct: 6   HIAMVPTPGMGHLIPLVEFAKRLVLRHNFGVTFIIPTDGPLPKAQKSFLDALPAGVNYVL 65

Query: 71  LPLVSFHDLPKHTKIETIIALSVTRSLPFLRHLFQSLIGKTHLAALIVDHFSTDAFDVAI 130
           LP VSF DLP   +IET I L++TRSLPF+R   ++L+  T LAAL+VD F TDAFDVAI
Sbjct: 66  LPPVSFDDLPADVRIETRICLTITRSLPFVRDAVKTLLATTKLAALVVDLFGTDAFDVAI 125

Query: 131 EFNVPRYLFFPPSAMSLSFAFQLPSLDQIVAGEFREHPELIPIPGCIPIHGKDLLEPAQD 190
           EF V  Y+F+P +AM LS  F LP LDQ+V+ E+R+ PE + IPGCIPIHGKD L+PAQD
Sbjct: 126 EFKVSPYIFYPTTAMCLSLFFHLPKLDQMVSCEYRDVPEPLQIPGCIPIHGKDFLDPAQD 185

Query: 191 REDDAYKLLLHNCKRYRLADAVFVNSFPELEPEAMKALLVEEAGKPPVYPVGPLVRNDCS 250
           R++DAYK LLH  KRYRLA+ + VN+F +LEP  +KAL  E+ GKPPVYP+GPL+R D S
Sbjct: 186 RKNDAYKCLLHQAKRYRLAEGIMVNTFNDLEPGPLKALQEEDQGKPPVYPIGPLIRADSS 245

Query: 251 ENGKRAECLKWLDEQPNGSVLFVSFGSGGTLSSAQTNELALGLEMSEQRFLWVVRKPNDE 310
                 ECLKWLD+QP GSVLF+SFGSGG +S  Q  ELALGLEMSEQRFLWVVR PND+
Sbjct: 246 SKVDDCECLKWLDDQPRGSVLFISFGSGGAVSHNQFIELALGLEMSEQRFLWVVRSPNDK 305

Query: 311 AANATFFGDEKEKENEAWRFLPEGFMERTKNRGMVVSSWAPQVEVLRHESTGGFLSHCGW 370
            ANAT+F    + +N+A  +LPEGF+ERTK R ++V SWAPQ E+L H STGGFL+HCGW
Sbjct: 306 IANATYF--SIQNQNDALAYLPEGFLERTKGRCLLVPSWAPQTEILSHGSTGGFLTHCGW 365

Query: 371 NSTLEGVVNGVPLIAWPLYAEQRMNAHMVTEDIKVGLRPKKKEGSGIVEKEEIAEVVKSL 430
           NS LE VVNGVPLIAWPLYAEQ+MNA M+TE +KV LRPK  E +G++ + EIA  VK L
Sbjct: 366 NSILESVVNGVPLIAWPLYAEQKMNAVMLTEGLKVALRPKAGE-NGLIGRVEIANAVKGL 425

Query: 431 MEGEEGKRIREKMKNLKNAAERGAGEDGYSSKAVCEMGMKWK 473
           MEGEEGK+ R  MK+LK+AA R   +DG S+KA+ E+  KW+
Sbjct: 426 MEGEEGKKFRSTMKDLKDAASRALSDDGSSTKALAELACKWE 464

BLAST of CmaCh00G002850 vs. ExPASy Swiss-Prot
Match: Q9M156 (UDP-glycosyltransferase 72B1 OS=Arabidopsis thaliana OX=3702 GN=UGT72B1 PE=1 SV=1)

HSP 1 Score: 521.2 bits (1341), Expect = 1.3e-146
Identity = 266/465 (57.20%), Postives = 341/465 (73.33%), Query Frame = 0

Query: 11  HIVMVPSPGIGHLIPLLQFAKRLVSLPGLSVTVAIPSDAPPTKPQKALFTNLPSTIQPLF 70
           H+ ++PSPG+GHLIPL++FAKRLV L GL+VT  I  + PP+K Q+ +  +LPS+I  +F
Sbjct: 8   HVAIIPSPGMGHLIPLVEFAKRLVHLHGLTVTFVIAGEGPPSKAQRTVLDSLPSSISSVF 67

Query: 71  LPLVSFHDLPKHTKIETIIALSVTRSLPFLRHLFQSLIGKTHL-AALIVDHFSTDAFDVA 130
           LP V   DL   T+IE+ I+L+VTRS P LR +F S +    L  AL+VD F TDAFDVA
Sbjct: 68  LPPVDLTDLSSSTRIESRISLTVTRSNPELRKVFDSFVEGGRLPTALVVDLFGTDAFDVA 127

Query: 131 IEFNVPRYLFFPPSAMSLSFAFQLPSLDQIVAGEFREHPELIPIPGCIPIHGKDLLEPAQ 190
           +EF+VP Y+F+P +A  LSF   LP LD+ V+ EFRE  E + +PGC+P+ GKD L+PAQ
Sbjct: 128 VEFHVPPYIFYPTTANVLSFFLHLPKLDETVSCEFRELTEPLMLPGCVPVAGKDFLDPAQ 187

Query: 191 DREDDAYKLLLHNCKRYRLADAVFVNSFPELEPEAMKALLVEEAGKPPVYPVGPLVR--N 250
           DR+DDAYK LLHN KRY+ A+ + VN+F ELEP A+KAL      KPPVYPVGPLV    
Sbjct: 188 DRKDDAYKWLLHNTKRYKEAEGILVNTFFELEPNAIKALQEPGLDKPPVYPVGPLVNIGK 247

Query: 251 DCSENGKRAECLKWLDEQPNGSVLFVSFGSGGTLSSAQTNELALGLEMSEQRFLWVVRKP 310
             ++  + +ECLKWLD QP GSVL+VSFGSGGTL+  Q NELALGL  SEQRFLWV+R P
Sbjct: 248 QEAKQTEESECLKWLDNQPLGSVLYVSFGSGGTLTCEQLNELALGLADSEQRFLWVIRSP 307

Query: 311 NDEAANATFFGDEKEKENEAWRFLPEGFMERTKNRGMVVSSWAPQVEVLRHESTGGFLSH 370
           +   AN+++F  +   + +   FLP GF+ERTK RG V+  WAPQ +VL H STGGFL+H
Sbjct: 308 SG-IANSSYF--DSHSQTDPLTFLPPGFLERTKKRGFVIPFWAPQAQVLAHPSTGGFLTH 367

Query: 371 CGWNSTLEGVVNGVPLIAWPLYAEQRMNAHMVTEDIKVGLRPKKKEGSGIVEKEEIAEVV 430
           CGWNSTLE VV+G+PLIAWPLYAEQ+MNA +++EDI+  LRP+  +  G+V +EE+A VV
Sbjct: 368 CGWNSTLESVVSGIPLIAWPLYAEQKMNAVLLSEDIRAALRPRAGD-DGLVRREEVARVV 427

Query: 431 KSLMEGEEGKRIREKMKNLKNAAERGAGEDGYSSKAVCEMGMKWK 473
           K LMEGEEGK +R KMK LK AA R   +DG S+KA+  + +KWK
Sbjct: 428 KGLMEGEEGKGVRNKMKELKEAACRVLKDDGTSTKALSLVALKWK 468

BLAST of CmaCh00G002850 vs. ExPASy Swiss-Prot
Match: Q9LNI1 (UDP-glycosyltransferase 72B3 OS=Arabidopsis thaliana OX=3702 GN=UGT72B3 PE=2 SV=1)

HSP 1 Score: 480.7 bits (1236), Expect = 2.0e-134
Identity = 246/476 (51.68%), Postives = 331/476 (69.54%), Query Frame = 0

Query: 11  HIVMVPSPGIGHLIPLLQFAKRLVSLPGLSVTVAIPSDAPPTKPQKALFTNLPSTIQPLF 70
           H+ ++PSPGIGHLIPL++ AKRL+   G +VT  IP D+PP+K Q+++  +LPS+I  +F
Sbjct: 8   HVAIIPSPGIGHLIPLVELAKRLLDNHGFTVTFIIPGDSPPSKAQRSVLNSLPSSIASVF 67

Query: 71  LPLVSFHDLPKHTKIETIIALSVTRSLPFLRHLFQSLIGKTHL-AALIVDHFSTDAFDVA 130
           LP     D+P   +IET I+L+VTRS P LR LF SL  +  L A L+VD F TDAFDVA
Sbjct: 68  LPPADLSDVPSTARIETRISLTVTRSNPALRELFGSLSAEKRLPAVLVVDLFGTDAFDVA 127

Query: 131 IEFNVPRYLFFPPSAMSLSFAFQLPSLDQIVAGEFREHPELIPIPGCIPIHGKDLLEPAQ 190
            EF+V  Y+F+  +A  L+F   LP LD+ V+ EFRE  E + IPGC+PI GKD ++P Q
Sbjct: 128 AEFHVSPYIFYASNANVLTFLLHLPKLDETVSCEFRELTEPVIIPGCVPITGKDFVDPCQ 187

Query: 191 DREDDAYKLLLHNCKRYRLADAVFVNSFPELEPEAMKALLVEEAGKPPVYPVGPLVRNDC 250
           DR+D++YK LLHN KR++ A+ + VNSF +LEP  +K +      KPPVY +GPLV +  
Sbjct: 188 DRKDESYKWLLHNVKRFKEAEGILVNSFVDLEPNTIKIVQEPAPDKPPVYLIGPLVNSGS 247

Query: 251 SENGKRAE--CLKWLDEQPNGSVLFVSFGSGGTLSSAQTNELALGLEMSEQRFLWVVRKP 310
            +     E  CL WLD QP GSVL+VSFGSGGTL+  Q  ELALGL  S +RFLWV+R P
Sbjct: 248 HDADVNDEYKCLNWLDNQPFGSVLYVSFGSGGTLTFEQFIELALGLAESGKRFLWVIRSP 307

Query: 311 NDEAANATFFGDEKEKENEAWRFLPEGFMERTKNRGMVVSSWAPQVEVLRHESTGGFLSH 370
           +  A+++ F     +  N+ + FLP+GF++RTK +G+VV SWAPQ ++L H S GGFL+H
Sbjct: 308 SGIASSSYF---NPQSRNDPFSFLPQGFLDRTKEKGLVVGSWAPQAQILTHTSIGGFLTH 367

Query: 371 CGWNSTLEGVVNGVPLIAWPLYAEQRMNAHMVTEDIKVGLRPKKKEGSGIVEKEEIAEVV 430
           CGWNS+LE +VNGVPLIAWPLYAEQ+MNA ++  D+   LR +  E  G+V +EE+A VV
Sbjct: 368 CGWNSSLESIVNGVPLIAWPLYAEQKMNA-LLLVDVGAALRARLGE-DGVVGREEVARVV 427

Query: 431 KSLMEGEEGKRIREKMKNLKNAAERGAGEDGYSSKAVCEMGMKWKKTMMMISSSQE 484
           K L+EGEEG  +R+KMK LK  + R   +DG+S+K++ E+ +KWK     I   QE
Sbjct: 428 KGLIEGEEGNAVRKKMKELKEGSVRVLRDDGFSTKSLNEVSLKWKAHQRKIDQEQE 478

BLAST of CmaCh00G002850 vs. ExPASy Swiss-Prot
Match: Q8W4C2 (UDP-glycosyltransferase 72B2 OS=Arabidopsis thaliana OX=3702 GN=UGT72B2 PE=2 SV=1)

HSP 1 Score: 465.7 bits (1197), Expect = 6.5e-130
Identity = 243/465 (52.26%), Postives = 315/465 (67.74%), Query Frame = 0

Query: 11  HIVMVPSPGIGHLIPLLQFAKRLVSLPGLSVTVAIPSDAPPTKPQKALFTNLPSTIQPLF 70
           HI ++PSPG+GHLIP ++ AKRLV     +VT+ I  +  P+K Q+++  +LPS+I  +F
Sbjct: 8   HIAIMPSPGMGHLIPFVELAKRLVQHDCFTVTMIISGETSPSKAQRSVLNSLPSSIASVF 67

Query: 71  LPLVSFHDLPKHTKIETIIALSVTRSLPFLRHLFQSLIGKTHL-AALIVDHFSTDAFDVA 130
           LP     D+P   +IET   L++TRS P LR LF SL  K  L A L+VD F  DAFDVA
Sbjct: 68  LPPADLSDVPSTARIETRAMLTMTRSNPALRELFGSLSTKKSLPAVLVVDMFGADAFDVA 127

Query: 131 IEFNVPRYLFFPPSAMSLSFAFQLPSLDQIVAGEFREHPELIPIPGCIPIHGKDLLEPAQ 190
           ++F+V  Y+F+  +A  LSF   LP LD+ V+ EFR   E + IPGC+PI GKD L+  Q
Sbjct: 128 VDFHVSPYIFYASNANVLSFFLHLPKLDKTVSCEFRYLTEPLKIPGCVPITGKDFLDTVQ 187

Query: 191 DREDDAYKLLLHNCKRYRLADAVFVNSFPELEPEAMKALLVEEAGKPPVYPVGPLVRNDC 250
           DR DDAYKLLLHN KRY+ A  + VNSF +LE  A+KAL      KP VYP+GPLV    
Sbjct: 188 DRNDDAYKLLLHNTKRYKEAKGILVNSFVDLESNAIKALQEPAPDKPTVYPIGPLVNTSS 247

Query: 251 SENG--KRAECLKWLDEQPNGSVLFVSFGSGGTLSSAQTNELALGLEMSEQRFLWVVRKP 310
           S      +  CL WLD QP GSVL++SFGSGGTL+  Q NELA+GL  S +RF+WV+R P
Sbjct: 248 SNVNLEDKFGCLSWLDNQPFGSVLYISFGSGGTLTCEQFNELAIGLAESGKRFIWVIRSP 307

Query: 311 NDEAANATFFGDEKEKENEAWRFLPEGFMERTKNRGMVVSSWAPQVEVLRHESTGGFLSH 370
           ++  +++ F       E + + FLP GF++RTK +G+VV SWAPQV++L H ST GFL+H
Sbjct: 308 SEIVSSSYF---NPHSETDPFSFLPIGFLDRTKEKGLVVPSWAPQVQILAHPSTCGFLTH 367

Query: 371 CGWNSTLEGVVNGVPLIAWPLYAEQRMNAHMVTEDIKVGLRPKKKEGSGIVEKEEIAEVV 430
           CGWNSTLE +VNGVPLIAWPL+AEQ+MN  ++ ED+   LR    E  GIV +EE+  VV
Sbjct: 368 CGWNSTLESIVNGVPLIAWPLFAEQKMNTLLLVEDVGAALRIHAGE-DGIVRREEVVRVV 427

Query: 431 KSLMEGEEGKRIREKMKNLKNAAERGAGEDGYSSKAVCEMGMKWK 473
           K+LMEGEEGK I  K+K LK    R  G+DG SSK+  E+ +KWK
Sbjct: 428 KALMEGEEGKAIGNKVKELKEGVVRVLGDDGLSSKSFGEVLLKWK 468

BLAST of CmaCh00G002850 vs. ExPASy Swiss-Prot
Match: Q40287 (Anthocyanidin 3-O-glucosyltransferase 5 OS=Manihot esculenta OX=3983 GN=GT5 PE=2 SV=1)

HSP 1 Score: 334.3 bits (856), Expect = 2.3e-90
Identity = 185/474 (39.03%), Postives = 277/474 (58.44%), Query Frame = 0

Query: 8   SQFHIVMVPSPGIGHLIPLLQFAKRLVSLPGLSVTV-AIPSDAPPTKPQKALFTNLPSTI 67
           S+ HIV++ SPG+GHLIP+L+  KR+V+L    VT+  + SD    +PQ       P   
Sbjct: 8   SKPHIVLLSSPGLGHLIPVLELGKRIVTLCNFDVTIFMVGSDTSAAEPQVLRSAMTPKLC 67

Query: 68  QPLFLPLVSFHDL--PKHTKIETIIALSVTRSLPFLRHLFQSLIGKTHLAALIVDHFSTD 127
           + + LP  +   L  P+ T    +  L +    P  R    +L  K   AA+IVD F T+
Sbjct: 68  EIIQLPPPNISCLIDPEATVCTRLFVL-MREIRPAFRAAVSAL--KFRPAAIIVDLFGTE 127

Query: 128 AFDVAIEFNVPRYLFFPPSAMSLSFAFQLPSLDQIVAGEFREHPELIPIPGCIPIHGKDL 187
           + +VA E  + +Y++   +A  L+    +P LD+ V GEF    E + IPGC P+  +++
Sbjct: 128 SLEVAKELGIAKYVYIASNAWFLALTIYVPILDKEVEGEFVLQKEPMKIPGCRPVRTEEV 187

Query: 188 LEPAQDREDDAYKLLLHNCKRYRLADAVFVNSFPELEPEAMKAL----LVEEAGKPPVYP 247
           ++P  DR +  Y            AD + +N++  LEP    AL     +    K PV+P
Sbjct: 188 VDPMLDRTNQQYSEYFRLGIEIPTADGILMNTWEALEPTTFGALRDVKFLGRVAKVPVFP 247

Query: 248 VGPLVRNDCSENGKRAECLKWLDEQPNGSVLFVSFGSGGTLSSAQTNELALGLEMSEQRF 307
           +GPL R      G   E L WLD+QP  SV++VSFGSGGTLS  Q  ELA GLE S+QRF
Sbjct: 248 IGPL-RRQAGPCGSNCELLDWLDQQPKESVVYVSFGSGGTLSLEQMIELAWGLERSQQRF 307

Query: 308 LWVVRKPNDEAANATFFGDEKEKENEAWRFLPEGFMERTKNRGMVVSSWAPQVEVLRHES 367
           +WVVR+P  +  +A FF  + +  ++   + PEGF+ R +N G+VV  W+PQ+ ++ H S
Sbjct: 308 IWVVRQPTVKTGDAAFF-TQGDGADDMSGYFPEGFLTRIQNVGLVVPQWSPQIHIMSHPS 367

Query: 368 TGGFLSHCGWNSTLEGVVNGVPLIAWPLYAEQRMNAHMVTEDIKVGLRPKKKEGSGIVEK 427
            G FLSHCGWNS LE +  GVP+IAWP+YAEQRMNA ++TE++ V +RPK      +V++
Sbjct: 368 VGVFLSHCGWNSVLESITAGVPIIAWPIYAEQRMNATLLTEELGVAVRPKNLPAKEVVKR 427

Query: 428 EEIAEVVKSLMEGEEGKRIREKMKNLKNAAERGAGEDGYSSKAVCEMGMKWKKT 475
           EEI  +++ +M  EEG  IR++++ LK++ E+   E G S   +  +G +W+K+
Sbjct: 428 EEIERMIRRIMVDEEGSEIRKRVRELKDSGEKALNEGGSSFNYMSALGNEWEKS 476

BLAST of CmaCh00G002850 vs. TAIR 10
Match: AT4G01070.1 (UDP-Glycosyltransferase superfamily protein )

HSP 1 Score: 521.2 bits (1341), Expect = 9.3e-148
Identity = 266/465 (57.20%), Postives = 341/465 (73.33%), Query Frame = 0

Query: 11  HIVMVPSPGIGHLIPLLQFAKRLVSLPGLSVTVAIPSDAPPTKPQKALFTNLPSTIQPLF 70
           H+ ++PSPG+GHLIPL++FAKRLV L GL+VT  I  + PP+K Q+ +  +LPS+I  +F
Sbjct: 8   HVAIIPSPGMGHLIPLVEFAKRLVHLHGLTVTFVIAGEGPPSKAQRTVLDSLPSSISSVF 67

Query: 71  LPLVSFHDLPKHTKIETIIALSVTRSLPFLRHLFQSLIGKTHL-AALIVDHFSTDAFDVA 130
           LP V   DL   T+IE+ I+L+VTRS P LR +F S +    L  AL+VD F TDAFDVA
Sbjct: 68  LPPVDLTDLSSSTRIESRISLTVTRSNPELRKVFDSFVEGGRLPTALVVDLFGTDAFDVA 127

Query: 131 IEFNVPRYLFFPPSAMSLSFAFQLPSLDQIVAGEFREHPELIPIPGCIPIHGKDLLEPAQ 190
           +EF+VP Y+F+P +A  LSF   LP LD+ V+ EFRE  E + +PGC+P+ GKD L+PAQ
Sbjct: 128 VEFHVPPYIFYPTTANVLSFFLHLPKLDETVSCEFRELTEPLMLPGCVPVAGKDFLDPAQ 187

Query: 191 DREDDAYKLLLHNCKRYRLADAVFVNSFPELEPEAMKALLVEEAGKPPVYPVGPLVR--N 250
           DR+DDAYK LLHN KRY+ A+ + VN+F ELEP A+KAL      KPPVYPVGPLV    
Sbjct: 188 DRKDDAYKWLLHNTKRYKEAEGILVNTFFELEPNAIKALQEPGLDKPPVYPVGPLVNIGK 247

Query: 251 DCSENGKRAECLKWLDEQPNGSVLFVSFGSGGTLSSAQTNELALGLEMSEQRFLWVVRKP 310
             ++  + +ECLKWLD QP GSVL+VSFGSGGTL+  Q NELALGL  SEQRFLWV+R P
Sbjct: 248 QEAKQTEESECLKWLDNQPLGSVLYVSFGSGGTLTCEQLNELALGLADSEQRFLWVIRSP 307

Query: 311 NDEAANATFFGDEKEKENEAWRFLPEGFMERTKNRGMVVSSWAPQVEVLRHESTGGFLSH 370
           +   AN+++F  +   + +   FLP GF+ERTK RG V+  WAPQ +VL H STGGFL+H
Sbjct: 308 SG-IANSSYF--DSHSQTDPLTFLPPGFLERTKKRGFVIPFWAPQAQVLAHPSTGGFLTH 367

Query: 371 CGWNSTLEGVVNGVPLIAWPLYAEQRMNAHMVTEDIKVGLRPKKKEGSGIVEKEEIAEVV 430
           CGWNSTLE VV+G+PLIAWPLYAEQ+MNA +++EDI+  LRP+  +  G+V +EE+A VV
Sbjct: 368 CGWNSTLESVVSGIPLIAWPLYAEQKMNAVLLSEDIRAALRPRAGD-DGLVRREEVARVV 427

Query: 431 KSLMEGEEGKRIREKMKNLKNAAERGAGEDGYSSKAVCEMGMKWK 473
           K LMEGEEGK +R KMK LK AA R   +DG S+KA+  + +KWK
Sbjct: 428 KGLMEGEEGKGVRNKMKELKEAACRVLKDDGTSTKALSLVALKWK 468

BLAST of CmaCh00G002850 vs. TAIR 10
Match: AT1G01420.1 (UDP-glucosyl transferase 72B3 )

HSP 1 Score: 480.7 bits (1236), Expect = 1.4e-135
Identity = 246/476 (51.68%), Postives = 331/476 (69.54%), Query Frame = 0

Query: 11  HIVMVPSPGIGHLIPLLQFAKRLVSLPGLSVTVAIPSDAPPTKPQKALFTNLPSTIQPLF 70
           H+ ++PSPGIGHLIPL++ AKRL+   G +VT  IP D+PP+K Q+++  +LPS+I  +F
Sbjct: 8   HVAIIPSPGIGHLIPLVELAKRLLDNHGFTVTFIIPGDSPPSKAQRSVLNSLPSSIASVF 67

Query: 71  LPLVSFHDLPKHTKIETIIALSVTRSLPFLRHLFQSLIGKTHL-AALIVDHFSTDAFDVA 130
           LP     D+P   +IET I+L+VTRS P LR LF SL  +  L A L+VD F TDAFDVA
Sbjct: 68  LPPADLSDVPSTARIETRISLTVTRSNPALRELFGSLSAEKRLPAVLVVDLFGTDAFDVA 127

Query: 131 IEFNVPRYLFFPPSAMSLSFAFQLPSLDQIVAGEFREHPELIPIPGCIPIHGKDLLEPAQ 190
            EF+V  Y+F+  +A  L+F   LP LD+ V+ EFRE  E + IPGC+PI GKD ++P Q
Sbjct: 128 AEFHVSPYIFYASNANVLTFLLHLPKLDETVSCEFRELTEPVIIPGCVPITGKDFVDPCQ 187

Query: 191 DREDDAYKLLLHNCKRYRLADAVFVNSFPELEPEAMKALLVEEAGKPPVYPVGPLVRNDC 250
           DR+D++YK LLHN KR++ A+ + VNSF +LEP  +K +      KPPVY +GPLV +  
Sbjct: 188 DRKDESYKWLLHNVKRFKEAEGILVNSFVDLEPNTIKIVQEPAPDKPPVYLIGPLVNSGS 247

Query: 251 SENGKRAE--CLKWLDEQPNGSVLFVSFGSGGTLSSAQTNELALGLEMSEQRFLWVVRKP 310
            +     E  CL WLD QP GSVL+VSFGSGGTL+  Q  ELALGL  S +RFLWV+R P
Sbjct: 248 HDADVNDEYKCLNWLDNQPFGSVLYVSFGSGGTLTFEQFIELALGLAESGKRFLWVIRSP 307

Query: 311 NDEAANATFFGDEKEKENEAWRFLPEGFMERTKNRGMVVSSWAPQVEVLRHESTGGFLSH 370
           +  A+++ F     +  N+ + FLP+GF++RTK +G+VV SWAPQ ++L H S GGFL+H
Sbjct: 308 SGIASSSYF---NPQSRNDPFSFLPQGFLDRTKEKGLVVGSWAPQAQILTHTSIGGFLTH 367

Query: 371 CGWNSTLEGVVNGVPLIAWPLYAEQRMNAHMVTEDIKVGLRPKKKEGSGIVEKEEIAEVV 430
           CGWNS+LE +VNGVPLIAWPLYAEQ+MNA ++  D+   LR +  E  G+V +EE+A VV
Sbjct: 368 CGWNSSLESIVNGVPLIAWPLYAEQKMNA-LLLVDVGAALRARLGE-DGVVGREEVARVV 427

Query: 431 KSLMEGEEGKRIREKMKNLKNAAERGAGEDGYSSKAVCEMGMKWKKTMMMISSSQE 484
           K L+EGEEG  +R+KMK LK  + R   +DG+S+K++ E+ +KWK     I   QE
Sbjct: 428 KGLIEGEEGNAVRKKMKELKEGSVRVLRDDGFSTKSLNEVSLKWKAHQRKIDQEQE 478

BLAST of CmaCh00G002850 vs. TAIR 10
Match: AT1G01390.1 (UDP-Glycosyltransferase superfamily protein )

HSP 1 Score: 465.7 bits (1197), Expect = 4.6e-131
Identity = 243/465 (52.26%), Postives = 315/465 (67.74%), Query Frame = 0

Query: 11  HIVMVPSPGIGHLIPLLQFAKRLVSLPGLSVTVAIPSDAPPTKPQKALFTNLPSTIQPLF 70
           HI ++PSPG+GHLIP ++ AKRLV     +VT+ I  +  P+K Q+++  +LPS+I  +F
Sbjct: 8   HIAIMPSPGMGHLIPFVELAKRLVQHDCFTVTMIISGETSPSKAQRSVLNSLPSSIASVF 67

Query: 71  LPLVSFHDLPKHTKIETIIALSVTRSLPFLRHLFQSLIGKTHL-AALIVDHFSTDAFDVA 130
           LP     D+P   +IET   L++TRS P LR LF SL  K  L A L+VD F  DAFDVA
Sbjct: 68  LPPADLSDVPSTARIETRAMLTMTRSNPALRELFGSLSTKKSLPAVLVVDMFGADAFDVA 127

Query: 131 IEFNVPRYLFFPPSAMSLSFAFQLPSLDQIVAGEFREHPELIPIPGCIPIHGKDLLEPAQ 190
           ++F+V  Y+F+  +A  LSF   LP LD+ V+ EFR   E + IPGC+PI GKD L+  Q
Sbjct: 128 VDFHVSPYIFYASNANVLSFFLHLPKLDKTVSCEFRYLTEPLKIPGCVPITGKDFLDTVQ 187

Query: 191 DREDDAYKLLLHNCKRYRLADAVFVNSFPELEPEAMKALLVEEAGKPPVYPVGPLVRNDC 250
           DR DDAYKLLLHN KRY+ A  + VNSF +LE  A+KAL      KP VYP+GPLV    
Sbjct: 188 DRNDDAYKLLLHNTKRYKEAKGILVNSFVDLESNAIKALQEPAPDKPTVYPIGPLVNTSS 247

Query: 251 SENG--KRAECLKWLDEQPNGSVLFVSFGSGGTLSSAQTNELALGLEMSEQRFLWVVRKP 310
           S      +  CL WLD QP GSVL++SFGSGGTL+  Q NELA+GL  S +RF+WV+R P
Sbjct: 248 SNVNLEDKFGCLSWLDNQPFGSVLYISFGSGGTLTCEQFNELAIGLAESGKRFIWVIRSP 307

Query: 311 NDEAANATFFGDEKEKENEAWRFLPEGFMERTKNRGMVVSSWAPQVEVLRHESTGGFLSH 370
           ++  +++ F       E + + FLP GF++RTK +G+VV SWAPQV++L H ST GFL+H
Sbjct: 308 SEIVSSSYF---NPHSETDPFSFLPIGFLDRTKEKGLVVPSWAPQVQILAHPSTCGFLTH 367

Query: 371 CGWNSTLEGVVNGVPLIAWPLYAEQRMNAHMVTEDIKVGLRPKKKEGSGIVEKEEIAEVV 430
           CGWNSTLE +VNGVPLIAWPL+AEQ+MN  ++ ED+   LR    E  GIV +EE+  VV
Sbjct: 368 CGWNSTLESIVNGVPLIAWPLFAEQKMNTLLLVEDVGAALRIHAGE-DGIVRREEVVRVV 427

Query: 431 KSLMEGEEGKRIREKMKNLKNAAERGAGEDGYSSKAVCEMGMKWK 473
           K+LMEGEEGK I  K+K LK    R  G+DG SSK+  E+ +KWK
Sbjct: 428 KALMEGEEGKAIGNKVKELKEGVVRVLGDDGLSSKSFGEVLLKWK 468

BLAST of CmaCh00G002850 vs. TAIR 10
Match: AT4G01070.2 (UDP-Glycosyltransferase superfamily protein )

HSP 1 Score: 362.8 bits (930), Expect = 4.2e-100
Identity = 189/344 (54.94%), Postives = 244/344 (70.93%), Query Frame = 0

Query: 11  HIVMVPSPGIGHLIPLLQFAKRLVSLPGLSVTVAIPSDAPPTKPQKALFTNLPSTIQPLF 70
           H+ ++PSPG+GHLIPL++FAKRLV L GL+VT  I  + PP+K Q+ +  +LPS+I  +F
Sbjct: 8   HVAIIPSPGMGHLIPLVEFAKRLVHLHGLTVTFVIAGEGPPSKAQRTVLDSLPSSISSVF 67

Query: 71  LPLVSFHDLPKHTKIETIIALSVTRSLPFLRHLFQSLIGKTHL-AALIVDHFSTDAFDVA 130
           LP V   DL   T+IE+ I+L+VTRS P LR +F S +    L  AL+VD F TDAFDVA
Sbjct: 68  LPPVDLTDLSSSTRIESRISLTVTRSNPELRKVFDSFVEGGRLPTALVVDLFGTDAFDVA 127

Query: 131 IEFNVPRYLFFPPSAMSLSFAFQLPSLDQIVAGEFREHPELIPIPGCIPIHGKDLLEPAQ 190
           +EF+VP Y+F+P +A  LSF   LP LD+ V+ EFRE  E + +PGC+P+ GKD L+PAQ
Sbjct: 128 VEFHVPPYIFYPTTANVLSFFLHLPKLDETVSCEFRELTEPLMLPGCVPVAGKDFLDPAQ 187

Query: 191 DREDDAYKLLLHNCKRYRLADAVFVNSFPELEPEAMKALLVEEAGKPPVYPVGPLVR--N 250
           DR+DDAYK LLHN KRY+ A+ + VN+F ELEP A+KAL      KPPVYPVGPLV    
Sbjct: 188 DRKDDAYKWLLHNTKRYKEAEGILVNTFFELEPNAIKALQEPGLDKPPVYPVGPLVNIGK 247

Query: 251 DCSENGKRAECLKWLDEQPNGSVLFVSFGSGGTLSSAQTNELALGLEMSEQRFLWVVRKP 310
             ++  + +ECLKWLD QP GSVL+VSFGSGGTL+  Q NELALGL  SEQRFLWV+R P
Sbjct: 248 QEAKQTEESECLKWLDNQPLGSVLYVSFGSGGTLTCEQLNELALGLADSEQRFLWVIRSP 307

Query: 311 NDEAANATFFGDEKEKENEAWRFLPEGFMERTKNRGMVVSSWAP 352
           +   AN+++F  +   + +   FLP GF+ERTK R  V + W P
Sbjct: 308 SG-IANSSYF--DSHSQTDPLTFLPPGFLERTKKR--VRAKWQP 346

BLAST of CmaCh00G002850 vs. TAIR 10
Match: AT3G50740.1 (UDP-glucosyl transferase 72E1 )

HSP 1 Score: 307.4 bits (786), Expect = 2.1e-83
Identity = 179/459 (39.00%), Postives = 262/459 (57.08%), Query Frame = 0

Query: 11  HIVMVPSPGIGHLIPLLQFAKRLVSLPGLSVTV-AIPSDAPPTKPQKALFTNLP----ST 70
           H+ M  SPG+GH+IP+++  KRL    G  VT+  + +DA   + Q   F N P    + 
Sbjct: 7   HVAMFASPGMGHIIPVIELGKRLAGSHGFDVTIFVLETDAASAQSQ---FLNSPGCDAAL 66

Query: 71  IQPLFLPLVSFHDLPKHTKIETIIALSVTR-SLPFLRHLFQSLIGKTHLAALIVDHFSTD 130
           +  + LP      L   +    I  L + R ++P +R   + +  K    ALIVD F  D
Sbjct: 67  VDIVGLPTPDISGLVDPSAFFGIKLLVMMRETIPTIRSKIEEMQHKP--TALIVDLFGLD 126

Query: 131 AFDVAIEFNVPRYLFFPPSAMSLSFAFQLPSLDQIVAGEFREHPELIPIPGCIPIHGKDL 190
           A  +  EFN+  Y+F   +A  L+ A   P+LD+ +  E     + + +PGC P+  +D 
Sbjct: 127 AIPLGGEFNMLTYIFIASNARFLAVALFFPTLDKDMEEEHIIKKQPMVMPGCEPVRFEDT 186

Query: 191 LEPAQDREDDAYKLLLHNCKRYRLADAVFVNSFPELEPEAMKAL----LVEEAGKPPVYP 250
           LE   D     Y+  +     +   D + VN++ ++EP+ +K+L    L+      PVYP
Sbjct: 187 LETFLDPNSQLYREFVPFGSVFPTCDGIIVNTWDDMEPKTLKSLQDPKLLGRIAGVPVYP 246

Query: 251 VGPLVRNDCSENGKRAECLKWLDEQPNGSVLFVSFGSGGTLSSAQTNELALGLEMSEQRF 310
           +GPL R     +      L WL++QP+ SVL++SFGSGG+LS+ Q  ELA GLEMS+QRF
Sbjct: 247 IGPLSR-PVDPSKTNHPVLDWLNKQPDESVLYISFGSGGSLSAKQLTELAWGLEMSQQRF 306

Query: 311 LWVVRKPNDEAANATFFGDEKEKENEAW-RFLPEGFMERTKNRGMVVSSWAPQVEVLRHE 370
           +WVVR P D +A + +      K  +    +LPEGF+ RT  RG +VSSWAPQ E+L H+
Sbjct: 307 VWVVRPPVDGSACSAYLSANSGKIRDGTPDYLPEGFVSRTHERGFMVSSWAPQAEILAHQ 366

Query: 371 STGGFLSHCGWNSTLEGVVNGVPLIAWPLYAEQRMNAHMVTEDIKVGLRPKKKEGSGIVE 430
           + GGFL+HCGWNS LE VV GVP+IAWPL+AEQ MNA ++ E++ V +R KK    G++ 
Sbjct: 367 AVGGFLTHCGWNSILESVVGGVPMIAWPLFAEQMMNATLLNEELGVAVRSKKLPSEGVIT 426

Query: 431 KEEIAEVVKSLMEGEEGKRIREKMKNLKNAAERGAGEDG 459
           + EI  +V+ +M  EEG  +R+K+K LK  A      DG
Sbjct: 427 RAEIEALVRKIMVEEEGAEMRKKIKKLKETAAESLSCDG 459

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9AR739.3e-16161.26Hydroquinone glucosyltransferase OS=Rauvolfia serpentina OX=4060 GN=AS PE=1 SV=1[more]
Q9M1561.3e-14657.20UDP-glycosyltransferase 72B1 OS=Arabidopsis thaliana OX=3702 GN=UGT72B1 PE=1 SV=... [more]
Q9LNI12.0e-13451.68UDP-glycosyltransferase 72B3 OS=Arabidopsis thaliana OX=3702 GN=UGT72B3 PE=2 SV=... [more]
Q8W4C26.5e-13052.26UDP-glycosyltransferase 72B2 OS=Arabidopsis thaliana OX=3702 GN=UGT72B2 PE=2 SV=... [more]
Q402872.3e-9039.03Anthocyanidin 3-O-glucosyltransferase 5 OS=Manihot esculenta OX=3983 GN=GT5 PE=2... [more]
Match NameE-valueIdentityDescription
AT4G01070.19.3e-14857.20UDP-Glycosyltransferase superfamily protein [more]
AT1G01420.11.4e-13551.68UDP-glucosyl transferase 72B3 [more]
AT1G01390.14.6e-13152.26UDP-Glycosyltransferase superfamily protein [more]
AT4G01070.24.2e-10054.94UDP-Glycosyltransferase superfamily protein [more]
AT3G50740.12.1e-8339.00UDP-glucosyl transferase 72E1 [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita maxima (Rimu) v1.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002213UDP-glucuronosyl/UDP-glucosyltransferasePFAMPF00201UDPGTcoord: 268..401
e-value: 3.8E-20
score: 72.2
IPR002213UDP-glucuronosyl/UDP-glucosyltransferaseCDDcd03784GT1_Gtf-likecoord: 11..450
e-value: 4.74361E-81
score: 255.554
NoneNo IPR availableGENE3D3.40.50.2000Glycogen Phosphorylase B;coord: 11..460
e-value: 1.8E-162
score: 543.4
NoneNo IPR availableGENE3D3.40.50.2000Glycogen Phosphorylase B;coord: 256..455
e-value: 1.8E-162
score: 543.4
NoneNo IPR availablePANTHERPTHR48045:SF12GLYCOSYLTRANSFERASEcoord: 5..474
NoneNo IPR availablePANTHERPTHR48045UDP-GLYCOSYLTRANSFERASE 72B1coord: 5..474
NoneNo IPR availableSUPERFAMILY53756UDP-Glycosyltransferase/glycogen phosphorylasecoord: 11..464
IPR035595UDP-glycosyltransferase family, conserved sitePROSITEPS00375UDPGTcoord: 349..392

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh00G002850.1CmaCh00G002850.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0008194 UDP-glycosyltransferase activity