CmaCh00G002800 (gene) Cucurbita maxima (Rimu) v1.1

Overview
NameCmaCh00G002800
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu) v1.1)
DescriptionGlycosyltransferase
LocationCma_Chr00: 22541929 .. 22543180 (+)
RNA-Seq ExpressionCmaCh00G002800
SyntenyCmaCh00G002800
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGATGCTCCGGTGCCTCATTTGGCCATTTTTCCCAGCCCCGGAATGGGCCATTTAATTCCACTTGTTGAATTCGCCAAACGCCTTGTTTTGCACCACCATATAACGGTCACTTTCATCGTCCCTTCCGACCATCCGCCGTCCAAGCCTCAAAAATCTCTCCTTGATTCCCTCCCTTCCGGCATCGATCACACCTTCCTCCCGCTGGTTAGCTTCCACGATCTTCCCCATGAATCCAAGATTGAAACCATCATTTCCCTCACTGTTTCTCGATCTCTCCCATCTCTTAAAAATGTACTGAAATCCATGGTGACTAATTCAAACCTTGTCGGCTTGGTCGTTGATCTTTTCGGCACCGACGCGTTCGATTTAGCTAGAGAATTCAACATTTCCCCTTATATTTTCTTTCCTTCCACCGCTATGGTTCTCTCATTTGCTCTGTTTCTTCCCAAACTCGATGAATCCGTCGCCGGCGAGTACGGTGACCTCCAGGAGCCGATCCAAATTCCGGGATGTATTCCGATTCACGGCAAGAATCTATTGGACCCAGTTCAAGATCGGAAGGACGAAGCCTATAAATGGACATTTCACAACATGAAGAGGTACGTTTTATCAGATGGGATTTTTCTGAACAGCTTCCCGGAATTGGAGCCCGGAGCTATCAAATTTCTCCGAGAAGAGAAACAGGGGAAGCCCCCTGTTTACCCAATTGGGCCATTGGTGAGAATCGATTCAAATGGAAGCAACGAAACAGGGGAAGCCCCCTGTTTACCCAATTGGGCCATTGGTGAGAATCGATTCAAATGGAAGCAACAAGAAGGCAGAGTGCCTGAAATGGTTGGACGAACAGCCAAATGGGTCTGTTCTATACGTGTCATTTGGAAGTGGGGGCACTCTGTCTAGCCATCAAATCAACGAATTAGCCATGGGATTGGAAATGAGTGGACAAAGATTCATATGGGTGGTAAGAAGTCCAAGCGATAAGGTGGCCAACGCTACCTATTTCAACGTCCACAGCCACGACGACCCTTTTGATTTTTTACCAGAAGGGTTCGTAGAAAGGATGAAAAACAGGGGACTGGTGGTGCCGTCATGGGCGCCGCAGACTCAGATACTTAGCCACAGCTCCACCGGCGCATTCCTGACTCATTGTGGATGGAACTCAACGTTAGAGGCGGTGGTTAATGGAGTTCCTCTGATCGCATGGCCGCTTTATGCAGAGCAGAAAATGAACGCTAGAGGCGGTGGTTAA

mRNA sequence

ATGGATGCTCCGGTGCCTCATTTGGCCATTTTTCCCAGCCCCGGAATGGGCCATTTAATTCCACTTGTTGAATTCGCCAAACGCCTTGTTTTGCACCACCATATAACGGTCACTTTCATCGTCCCTTCCGACCATCCGCCGTCCAAGCCTCAAAAATCTCTCCTTGATTCCCTCCCTTCCGGCATCGATCACACCTTCCTCCCGCTGGTTAGCTTCCACGATCTTCCCCATGAATCCAAGATTGAAACCATCATTTCCCTCACTGTTTCTCGATCTCTCCCATCTCTTAAAAATGTACTGAAATCCATGGTGACTAATTCAAACCTTGTCGGCTTGGTCGTTGATCTTTTCGGCACCGACGCGTTCGATTTAGCTAGAGAATTCAACATTTCCCCTTATATTTTCTTTCCTTCCACCGCTATGGTTCTCTCATTTGCTCTGTTTCTTCCCAAACTCGATGAATCCGTCGCCGGCGAGTACGGTGACCTCCAGGAGCCGATCCAAATTCCGGGATGTATTCCGATTCACGGCAAGAATCTATTGGACCCAGTTCAAGATCGGAAGGACGAAGCCTATAAATGGACATTTCACAACATGAAGAGGTACGTTTTATCAGATGGGATTTTTCTGAACAGCTTCCCGGAATTGGAGCCCGGAGCTATCAAATTTCTCCGAGAAGAGAAACAGGGGAAGCCCCCTGTTTACCCAATTGGGCCATTGGGGAAGCCCCCTGTTTACCCAATTGGGCCATTGGTGAGAATCGATTCAAATGGAAGCAACAAGAAGGCAGAGTGCCTGAAATGGTTGGACGAACAGCCAAATGGGTCTGTTCTATACGTTGGGGGCACTCTGTCTAGCCATCAAATCAACGAATTAGCCATGGGATTGGAAATGAGTGGACAAAGATTCATATGGGTGGTAAGAAGTCCAAGCGATAAGGTGGCCAACGCTACCTATTTCAACGTCCACAGCCACGACGACCCTTTTGATTTTTTACCAGAAGGGTTCGTAGAAAGGATGAAAAACAGGGGACTGGTGGTGCCGTCATGGGCGCCGCAGACTCAGATACTTAGCCACAGCTCCACCGGCGCATTCCTGACTCATTGTGGATGGAACTCAACGTTAGAGGCGGTGGTTAATGGAGTTCCTCTGATCGCATGGCCGCTTTATGCAGAGCAGAAAATGAACGCTAGAGGCGGTGGTTAA

Coding sequence (CDS)

ATGGATGCTCCGGTGCCTCATTTGGCCATTTTTCCCAGCCCCGGAATGGGCCATTTAATTCCACTTGTTGAATTCGCCAAACGCCTTGTTTTGCACCACCATATAACGGTCACTTTCATCGTCCCTTCCGACCATCCGCCGTCCAAGCCTCAAAAATCTCTCCTTGATTCCCTCCCTTCCGGCATCGATCACACCTTCCTCCCGCTGGTTAGCTTCCACGATCTTCCCCATGAATCCAAGATTGAAACCATCATTTCCCTCACTGTTTCTCGATCTCTCCCATCTCTTAAAAATGTACTGAAATCCATGGTGACTAATTCAAACCTTGTCGGCTTGGTCGTTGATCTTTTCGGCACCGACGCGTTCGATTTAGCTAGAGAATTCAACATTTCCCCTTATATTTTCTTTCCTTCCACCGCTATGGTTCTCTCATTTGCTCTGTTTCTTCCCAAACTCGATGAATCCGTCGCCGGCGAGTACGGTGACCTCCAGGAGCCGATCCAAATTCCGGGATGTATTCCGATTCACGGCAAGAATCTATTGGACCCAGTTCAAGATCGGAAGGACGAAGCCTATAAATGGACATTTCACAACATGAAGAGGTACGTTTTATCAGATGGGATTTTTCTGAACAGCTTCCCGGAATTGGAGCCCGGAGCTATCAAATTTCTCCGAGAAGAGAAACAGGGGAAGCCCCCTGTTTACCCAATTGGGCCATTGGGGAAGCCCCCTGTTTACCCAATTGGGCCATTGGTGAGAATCGATTCAAATGGAAGCAACAAGAAGGCAGAGTGCCTGAAATGGTTGGACGAACAGCCAAATGGGTCTGTTCTATACGTTGGGGGCACTCTGTCTAGCCATCAAATCAACGAATTAGCCATGGGATTGGAAATGAGTGGACAAAGATTCATATGGGTGGTAAGAAGTCCAAGCGATAAGGTGGCCAACGCTACCTATTTCAACGTCCACAGCCACGACGACCCTTTTGATTTTTTACCAGAAGGGTTCGTAGAAAGGATGAAAAACAGGGGACTGGTGGTGCCGTCATGGGCGCCGCAGACTCAGATACTTAGCCACAGCTCCACCGGCGCATTCCTGACTCATTGTGGATGGAACTCAACGTTAGAGGCGGTGGTTAATGGAGTTCCTCTGATCGCATGGCCGCTTTATGCAGAGCAGAAAATGAACGCTAGAGGCGGTGGTTAA

Protein sequence

MDAPVPHLAIFPSPGMGHLIPLVEFAKRLVLHHHITVTFIVPSDHPPSKPQKSLLDSLPSGIDHTFLPLVSFHDLPHESKIETIISLTVSRSLPSLKNVLKSMVTNSNLVGLVVDLFGTDAFDLAREFNISPYIFFPSTAMVLSFALFLPKLDESVAGEYGDLQEPIQIPGCIPIHGKNLLDPVQDRKDEAYKWTFHNMKRYVLSDGIFLNSFPELEPGAIKFLREEKQGKPPVYPIGPLGKPPVYPIGPLVRIDSNGSNKKAECLKWLDEQPNGSVLYVGGTLSSHQINELAMGLEMSGQRFIWVVRSPSDKVANATYFNVHSHDDPFDFLPEGFVERMKNRGLVVPSWAPQTQILSHSSTGAFLTHCGWNSTLEAVVNGVPLIAWPLYAEQKMNARGGG
Homology
BLAST of CmaCh00G002800 vs. ExPASy Swiss-Prot
Match: Q9AR73 (Hydroquinone glucosyltransferase OS=Rauvolfia serpentina OX=4060 GN=AS PE=1 SV=1)

HSP 1 Score: 523.1 bits (1346), Expect = 2.8e-147
Identity = 248/396 (62.63%), Postives = 309/396 (78.03%), Query Frame = 0

Query: 6   PHLAIFPSPGMGHLIPLVEFAKRLVLHHHITVTFIVPSDHPPSKPQKSLLDSLPSGIDHT 65
           PH+A+ P+PGMGHLIPLVEFAKRLVL H+  VTFI+P+D P  K QKS LD+LP+G+++ 
Sbjct: 5   PHIAMVPTPGMGHLIPLVEFAKRLVLRHNFGVTFIIPTDGPLPKAQKSFLDALPAGVNYV 64

Query: 66  FLPLVSFHDLPHESKIETIISLTVSRSLPSLKNVLKSMVTNSNLVGLVVDLFGTDAFDLA 125
            LP VSF DLP + +IET I LT++RSLP +++ +K+++  + L  LVVDLFGTDAFD+A
Sbjct: 65  LLPPVSFDDLPADVRIETRICLTITRSLPFVRDAVKTLLATTKLAALVVDLFGTDAFDVA 124

Query: 126 REFNISPYIFFPSTAMVLSFALFLPKLDESVAGEYGDLQEPIQIPGCIPIHGKNLLDPVQ 185
            EF +SPYIF+P+TAM LS    LPKLD+ V+ EY D+ EP+QIPGCIPIHGK+ LDP Q
Sbjct: 125 IEFKVSPYIFYPTTAMCLSLFFHLPKLDQMVSCEYRDVPEPLQIPGCIPIHGKDFLDPAQ 184

Query: 186 DRKDEAYKWTFHNMKRYVLSDGIFLNSFPELEPGAIKFLREEKQGKPPVYPIGPLGKPPV 245
           DRK++AYK   H  KRY L++GI +N+F +LEPG +K L+EE Q           GKPPV
Sbjct: 185 DRKNDAYKCLLHQAKRYRLAEGIMVNTFNDLEPGPLKALQEEDQ-----------GKPPV 244

Query: 246 YPIGPLVRIDSNGSNKKAECLKWLDEQPNGSVLYV----GGTLSSHQINELAMGLEMSGQ 305
           YPIGPL+R DS+      ECLKWLD+QP GSVL++    GG +S +Q  ELA+GLEMS Q
Sbjct: 245 YPIGPLIRADSSSKVDDCECLKWLDDQPRGSVLFISFGSGGAVSHNQFIELALGLEMSEQ 304

Query: 306 RFIWVVRSPSDKVANATYFNVHSHDDPFDFLPEGFVERMKNRGLVVPSWAPQTQILSHSS 365
           RF+WVVRSP+DK+ANATYF++ + +D   +LPEGF+ER K R L+VPSWAPQT+ILSH S
Sbjct: 305 RFLWVVRSPNDKIANATYFSIQNQNDALAYLPEGFLERTKGRCLLVPSWAPQTEILSHGS 364

Query: 366 TGAFLTHCGWNSTLEAVVNGVPLIAWPLYAEQKMNA 398
           TG FLTHCGWNS LE+VVNGVPLIAWPLYAEQKMNA
Sbjct: 365 TGGFLTHCGWNSILESVVNGVPLIAWPLYAEQKMNA 389

BLAST of CmaCh00G002800 vs. ExPASy Swiss-Prot
Match: Q9M156 (UDP-glycosyltransferase 72B1 OS=Arabidopsis thaliana OX=3702 GN=UGT72B1 PE=1 SV=1)

HSP 1 Score: 491.5 bits (1264), Expect = 9.0e-138
Identity = 243/403 (60.30%), Postives = 299/403 (74.19%), Query Frame = 0

Query: 2   DAPVPHLAIFPSPGMGHLIPLVEFAKRLVLHHHITVTFIVPSDHPPSKPQKSLLDSLPSG 61
           ++  PH+AI PSPGMGHLIPLVEFAKRLV  H +TVTF++  + PPSK Q+++LDSLPS 
Sbjct: 3   ESKTPHVAIIPSPGMGHLIPLVEFAKRLVHLHGLTVTFVIAGEGPPSKAQRTVLDSLPSS 62

Query: 62  IDHTFLPLVSFHDLPHESKIETIISLTVSRSLPSLKNVLKSMVTNSNL-VGLVVDLFGTD 121
           I   FLP V   DL   ++IE+ ISLTV+RS P L+ V  S V    L   LVVDLFGTD
Sbjct: 63  ISSVFLPPVDLTDLSSSTRIESRISLTVTRSNPELRKVFDSFVEGGRLPTALVVDLFGTD 122

Query: 122 AFDLAREFNISPYIFFPSTAMVLSFALFLPKLDESVAGEYGDLQEPIQIPGCIPIHGKNL 181
           AFD+A EF++ PYIF+P+TA VLSF L LPKLDE+V+ E+ +L EP+ +PGC+P+ GK+ 
Sbjct: 123 AFDVAVEFHVPPYIFYPTTANVLSFFLHLPKLDETVSCEFRELTEPLMLPGCVPVAGKDF 182

Query: 182 LDPVQDRKDEAYKWTFHNMKRYVLSDGIFLNSFPELEPGAIKFLREEKQGKPPVYPIGPL 241
           LDP QDRKD+AYKW  HN KRY  ++GI +N+F ELEP AIK L+E             L
Sbjct: 183 LDPAQDRKDDAYKWLLHNTKRYKEAEGILVNTFFELEPNAIKALQEP-----------GL 242

Query: 242 GKPPVYPIGPLVRIDSNGS--NKKAECLKWLDEQPNGSVLYV----GGTLSSHQINELAM 301
            KPPVYP+GPLV I    +   +++ECLKWLD QP GSVLYV    GGTL+  Q+NELA+
Sbjct: 243 DKPPVYPVGPLVNIGKQEAKQTEESECLKWLDNQPLGSVLYVSFGSGGTLTCEQLNELAL 302

Query: 302 GLEMSGQRFIWVVRSPSDKVANATYFNVHSHDDPFDFLPEGFVERMKNRGLVVPSWAPQT 361
           GL  S QRF+WV+RSPS  +AN++YF+ HS  DP  FLP GF+ER K RG V+P WAPQ 
Sbjct: 303 GLADSEQRFLWVIRSPSG-IANSSYFDSHSQTDPLTFLPPGFLERTKKRGFVIPFWAPQA 362

Query: 362 QILSHSSTGAFLTHCGWNSTLEAVVNGVPLIAWPLYAEQKMNA 398
           Q+L+H STG FLTHCGWNSTLE+VV+G+PLIAWPLYAEQKMNA
Sbjct: 363 QVLAHPSTGGFLTHCGWNSTLESVVSGIPLIAWPLYAEQKMNA 393

BLAST of CmaCh00G002800 vs. ExPASy Swiss-Prot
Match: Q9LNI1 (UDP-glycosyltransferase 72B3 OS=Arabidopsis thaliana OX=3702 GN=UGT72B3 PE=2 SV=1)

HSP 1 Score: 474.2 bits (1219), Expect = 1.5e-132
Identity = 236/403 (58.56%), Postives = 297/403 (73.70%), Query Frame = 0

Query: 2   DAPVPHLAIFPSPGMGHLIPLVEFAKRLVLHHHITVTFIVPSDHPPSKPQKSLLDSLPSG 61
           D   PH+AI PSPG+GHLIPLVE AKRL+ +H  TVTFI+P D PPSK Q+S+L+SLPS 
Sbjct: 3   DGNTPHVAIIPSPGIGHLIPLVELAKRLLDNHGFTVTFIIPGDSPPSKAQRSVLNSLPSS 62

Query: 62  IDHTFLPLVSFHDLPHESKIETIISLTVSRSLPSLKNVLKSMVTNSNLVG-LVVDLFGTD 121
           I   FLP     D+P  ++IET ISLTV+RS P+L+ +  S+     L   LVVDLFGTD
Sbjct: 63  IASVFLPPADLSDVPSTARIETRISLTVTRSNPALRELFGSLSAEKRLPAVLVVDLFGTD 122

Query: 122 AFDLAREFNISPYIFFPSTAMVLSFALFLPKLDESVAGEYGDLQEPIQIPGCIPIHGKNL 181
           AFD+A EF++SPYIF+ S A VL+F L LPKLDE+V+ E+ +L EP+ IPGC+PI GK+ 
Sbjct: 123 AFDVAAEFHVSPYIFYASNANVLTFLLHLPKLDETVSCEFRELTEPVIIPGCVPITGKDF 182

Query: 182 LDPVQDRKDEAYKWTFHNMKRYVLSDGIFLNSFPELEPGAIKFLREEKQGKPPVYPIGPL 241
           +DP QDRKDE+YKW  HN+KR+  ++GI +NSF +LEP  IK ++E              
Sbjct: 183 VDPCQDRKDESYKWLLHNVKRFKEAEGILVNSFVDLEPNTIKIVQEPAP----------- 242

Query: 242 GKPPVYPIGPLVRIDSNGS--NKKAECLKWLDEQPNGSVLYV----GGTLSSHQINELAM 301
            KPPVY IGPLV   S+ +  N + +CL WLD QP GSVLYV    GGTL+  Q  ELA+
Sbjct: 243 DKPPVYLIGPLVNSGSHDADVNDEYKCLNWLDNQPFGSVLYVSFGSGGTLTFEQFIELAL 302

Query: 302 GLEMSGQRFIWVVRSPSDKVANATYFNVHSHDDPFDFLPEGFVERMKNRGLVVPSWAPQT 361
           GL  SG+RF+WV+RSPS  +A+++YFN  S +DPF FLP+GF++R K +GLVV SWAPQ 
Sbjct: 303 GLAESGKRFLWVIRSPSG-IASSSYFNPQSRNDPFSFLPQGFLDRTKEKGLVVGSWAPQA 362

Query: 362 QILSHSSTGAFLTHCGWNSTLEAVVNGVPLIAWPLYAEQKMNA 398
           QIL+H+S G FLTHCGWNS+LE++VNGVPLIAWPLYAEQKMNA
Sbjct: 363 QILTHTSIGGFLTHCGWNSSLESIVNGVPLIAWPLYAEQKMNA 393

BLAST of CmaCh00G002800 vs. ExPASy Swiss-Prot
Match: Q8W4C2 (UDP-glycosyltransferase 72B2 OS=Arabidopsis thaliana OX=3702 GN=UGT72B2 PE=2 SV=1)

HSP 1 Score: 453.0 bits (1164), Expect = 3.6e-126
Identity = 230/404 (56.93%), Postives = 287/404 (71.04%), Query Frame = 0

Query: 2   DAPVPHLAIFPSPGMGHLIPLVEFAKRLVLHHHITVTFIVPSDHPPSKPQKSLLDSLPSG 61
           +A  PH+AI PSPGMGHLIP VE AKRLV H   TVT I+  +  PSK Q+S+L+SLPS 
Sbjct: 3   EANTPHIAIMPSPGMGHLIPFVELAKRLVQHDCFTVTMIISGETSPSKAQRSVLNSLPSS 62

Query: 62  IDHTFLPLVSFHDLPHESKIETIISLTVSRSLPSLKNVLKSMVTNSNLVG-LVVDLFGTD 121
           I   FLP     D+P  ++IET   LT++RS P+L+ +  S+ T  +L   LVVD+FG D
Sbjct: 63  IASVFLPPADLSDVPSTARIETRAMLTMTRSNPALRELFGSLSTKKSLPAVLVVDMFGAD 122

Query: 122 AFDLAREFNISPYIFFPSTAMVLSFALFLPKLDESVAGEYGDLQEPIQIPGCIPIHGKNL 181
           AFD+A +F++SPYIF+ S A VLSF L LPKLD++V+ E+  L EP++IPGC+PI GK+ 
Sbjct: 123 AFDVAVDFHVSPYIFYASNANVLSFFLHLPKLDKTVSCEFRYLTEPLKIPGCVPITGKDF 182

Query: 182 LDPVQDRKDEAYKWTFHNMKRYVLSDGIFLNSFPELEPGAIKFLREEKQGKPPVYPIGPL 241
           LD VQDR D+AYK   HN KRY  + GI +NSF +LE  AIK L+E    KP VYPIGPL
Sbjct: 183 LDTVQDRNDDAYKLLLHNTKRYKEAKGILVNSFVDLESNAIKALQEPAPDKPTVYPIGPL 242

Query: 242 GKPPVYPIGPLVRIDSNGSNKKAE----CLKWLDEQPNGSVLYV----GGTLSSHQINEL 301
                        ++++ SN   E    CL WLD QP GSVLY+    GGTL+  Q NEL
Sbjct: 243 -------------VNTSSSNVNLEDKFGCLSWLDNQPFGSVLYISFGSGGTLTCEQFNEL 302

Query: 302 AMGLEMSGQRFIWVVRSPSDKVANATYFNVHSHDDPFDFLPEGFVERMKNRGLVVPSWAP 361
           A+GL  SG+RFIWV+RSPS+ + +++YFN HS  DPF FLP GF++R K +GLVVPSWAP
Sbjct: 303 AIGLAESGKRFIWVIRSPSE-IVSSSYFNPHSETDPFSFLPIGFLDRTKEKGLVVPSWAP 362

Query: 362 QTQILSHSSTGAFLTHCGWNSTLEAVVNGVPLIAWPLYAEQKMN 397
           Q QIL+H ST  FLTHCGWNSTLE++VNGVPLIAWPL+AEQKMN
Sbjct: 363 QVQILAHPSTCGFLTHCGWNSTLESIVNGVPLIAWPLFAEQKMN 392

BLAST of CmaCh00G002800 vs. ExPASy Swiss-Prot
Match: Q9LVR1 (UDP-glycosyltransferase 72E2 OS=Arabidopsis thaliana OX=3702 GN=UGT72E2 PE=1 SV=1)

HSP 1 Score: 312.4 bits (799), Expect = 7.5e-84
Identity = 171/402 (42.54%), Postives = 246/402 (61.19%), Query Frame = 0

Query: 6   PHLAIFPSPGMGHLIPLVEFAKRLVLHH--HITVTFIVPSDHPPSKPQKSLLDSLPSGID 65
           PH A+F SPGMGH+IP++E  KRL  ++  H+TV F++ +D   +  Q   L+S  +G+D
Sbjct: 6   PHAAMFSSPGMGHVIPVIELGKRLSANNGFHVTV-FVLETD--AASAQSKFLNS--TGVD 65

Query: 66  HTFLPLVSFHDL-PHESKIETIISLTVSRSLPSLKNVLKSMVTNSNLVGLVVDLFGTDAF 125
              LP    + L   +  + T I + +  ++P+L++ + +M  +     L+VDLFGTDA 
Sbjct: 66  IVKLPSPDIYGLVDPDDHVVTKIGVIMRAAVPALRSKIAAM--HQKPTALIVDLFGTDAL 125

Query: 126 DLAREFNISPYIFFPSTAMVLSFALFLPKLDESVAGEYGDLQEPIQIPGCIPIHGKNLLD 185
            LA+EFN+  Y+F P+ A  L  +++ P LD+ +  E+   + P+ IPGC P+  ++ LD
Sbjct: 126 CLAKEFNMLSYVFIPTNARFLGVSIYYPNLDKDIKEEHTVQRNPLAIPGCEPVRFEDTLD 185

Query: 186 PVQDRKDEAYKWTFHNMKRYVLSDGIFLNSFPELEPGAIKFLREEKQGKPPVYPIGPLGK 245
                 +  Y+    +   Y  +DGI +N++ E+EP ++K L   K        +G + +
Sbjct: 186 AYLVPDEPVYRDFVRHGLAYPKADGILVNTWEEMEPKSLKSLLNPKL-------LGRVAR 245

Query: 246 PPVYPIGPLVRIDSNGSNKKAECLKWLDEQPNGSVLYV----GGTLSSHQINELAMGLEM 305
            PVYPIGPL R     S      L WL+EQPN SVLY+    GG LS+ Q+ ELA GLE 
Sbjct: 246 VPVYPIGPLCR-PIQSSETDHPVLDWLNEQPNESVLYISFGSGGCLSAKQLTELAWGLEQ 305

Query: 306 SGQRFIWVVRSPSDKVANATYFNVH---SHDDPFDFLPEGFVERMKNRGLVVPSWAPQTQ 365
           S QRF+WVVR P D    + Y + +   + D+  ++LPEGFV R  +RG VVPSWAPQ +
Sbjct: 306 SQQRFVWVVRPPVDGSCCSEYVSANGGGTEDNTPEYLPEGFVSRTSDRGFVVPSWAPQAE 365

Query: 366 ILSHSSTGAFLTHCGWNSTLEAVVNGVPLIAWPLYAEQKMNA 398
           ILSH + G FLTHCGW+STLE+VV GVP+IAWPL+AEQ MNA
Sbjct: 366 ILSHRAVGGFLTHCGWSSTLESVVGGVPMIAWPLFAEQNMNA 392

BLAST of CmaCh00G002800 vs. TAIR 10
Match: AT4G01070.1 (UDP-Glycosyltransferase superfamily protein )

HSP 1 Score: 491.5 bits (1264), Expect = 6.4e-139
Identity = 243/403 (60.30%), Postives = 299/403 (74.19%), Query Frame = 0

Query: 2   DAPVPHLAIFPSPGMGHLIPLVEFAKRLVLHHHITVTFIVPSDHPPSKPQKSLLDSLPSG 61
           ++  PH+AI PSPGMGHLIPLVEFAKRLV  H +TVTF++  + PPSK Q+++LDSLPS 
Sbjct: 3   ESKTPHVAIIPSPGMGHLIPLVEFAKRLVHLHGLTVTFVIAGEGPPSKAQRTVLDSLPSS 62

Query: 62  IDHTFLPLVSFHDLPHESKIETIISLTVSRSLPSLKNVLKSMVTNSNL-VGLVVDLFGTD 121
           I   FLP V   DL   ++IE+ ISLTV+RS P L+ V  S V    L   LVVDLFGTD
Sbjct: 63  ISSVFLPPVDLTDLSSSTRIESRISLTVTRSNPELRKVFDSFVEGGRLPTALVVDLFGTD 122

Query: 122 AFDLAREFNISPYIFFPSTAMVLSFALFLPKLDESVAGEYGDLQEPIQIPGCIPIHGKNL 181
           AFD+A EF++ PYIF+P+TA VLSF L LPKLDE+V+ E+ +L EP+ +PGC+P+ GK+ 
Sbjct: 123 AFDVAVEFHVPPYIFYPTTANVLSFFLHLPKLDETVSCEFRELTEPLMLPGCVPVAGKDF 182

Query: 182 LDPVQDRKDEAYKWTFHNMKRYVLSDGIFLNSFPELEPGAIKFLREEKQGKPPVYPIGPL 241
           LDP QDRKD+AYKW  HN KRY  ++GI +N+F ELEP AIK L+E             L
Sbjct: 183 LDPAQDRKDDAYKWLLHNTKRYKEAEGILVNTFFELEPNAIKALQEP-----------GL 242

Query: 242 GKPPVYPIGPLVRIDSNGS--NKKAECLKWLDEQPNGSVLYV----GGTLSSHQINELAM 301
            KPPVYP+GPLV I    +   +++ECLKWLD QP GSVLYV    GGTL+  Q+NELA+
Sbjct: 243 DKPPVYPVGPLVNIGKQEAKQTEESECLKWLDNQPLGSVLYVSFGSGGTLTCEQLNELAL 302

Query: 302 GLEMSGQRFIWVVRSPSDKVANATYFNVHSHDDPFDFLPEGFVERMKNRGLVVPSWAPQT 361
           GL  S QRF+WV+RSPS  +AN++YF+ HS  DP  FLP GF+ER K RG V+P WAPQ 
Sbjct: 303 GLADSEQRFLWVIRSPSG-IANSSYFDSHSQTDPLTFLPPGFLERTKKRGFVIPFWAPQA 362

Query: 362 QILSHSSTGAFLTHCGWNSTLEAVVNGVPLIAWPLYAEQKMNA 398
           Q+L+H STG FLTHCGWNSTLE+VV+G+PLIAWPLYAEQKMNA
Sbjct: 363 QVLAHPSTGGFLTHCGWNSTLESVVSGIPLIAWPLYAEQKMNA 393

BLAST of CmaCh00G002800 vs. TAIR 10
Match: AT1G01420.1 (UDP-glucosyl transferase 72B3 )

HSP 1 Score: 474.2 bits (1219), Expect = 1.1e-133
Identity = 236/403 (58.56%), Postives = 297/403 (73.70%), Query Frame = 0

Query: 2   DAPVPHLAIFPSPGMGHLIPLVEFAKRLVLHHHITVTFIVPSDHPPSKPQKSLLDSLPSG 61
           D   PH+AI PSPG+GHLIPLVE AKRL+ +H  TVTFI+P D PPSK Q+S+L+SLPS 
Sbjct: 3   DGNTPHVAIIPSPGIGHLIPLVELAKRLLDNHGFTVTFIIPGDSPPSKAQRSVLNSLPSS 62

Query: 62  IDHTFLPLVSFHDLPHESKIETIISLTVSRSLPSLKNVLKSMVTNSNLVG-LVVDLFGTD 121
           I   FLP     D+P  ++IET ISLTV+RS P+L+ +  S+     L   LVVDLFGTD
Sbjct: 63  IASVFLPPADLSDVPSTARIETRISLTVTRSNPALRELFGSLSAEKRLPAVLVVDLFGTD 122

Query: 122 AFDLAREFNISPYIFFPSTAMVLSFALFLPKLDESVAGEYGDLQEPIQIPGCIPIHGKNL 181
           AFD+A EF++SPYIF+ S A VL+F L LPKLDE+V+ E+ +L EP+ IPGC+PI GK+ 
Sbjct: 123 AFDVAAEFHVSPYIFYASNANVLTFLLHLPKLDETVSCEFRELTEPVIIPGCVPITGKDF 182

Query: 182 LDPVQDRKDEAYKWTFHNMKRYVLSDGIFLNSFPELEPGAIKFLREEKQGKPPVYPIGPL 241
           +DP QDRKDE+YKW  HN+KR+  ++GI +NSF +LEP  IK ++E              
Sbjct: 183 VDPCQDRKDESYKWLLHNVKRFKEAEGILVNSFVDLEPNTIKIVQEPAP----------- 242

Query: 242 GKPPVYPIGPLVRIDSNGS--NKKAECLKWLDEQPNGSVLYV----GGTLSSHQINELAM 301
            KPPVY IGPLV   S+ +  N + +CL WLD QP GSVLYV    GGTL+  Q  ELA+
Sbjct: 243 DKPPVYLIGPLVNSGSHDADVNDEYKCLNWLDNQPFGSVLYVSFGSGGTLTFEQFIELAL 302

Query: 302 GLEMSGQRFIWVVRSPSDKVANATYFNVHSHDDPFDFLPEGFVERMKNRGLVVPSWAPQT 361
           GL  SG+RF+WV+RSPS  +A+++YFN  S +DPF FLP+GF++R K +GLVV SWAPQ 
Sbjct: 303 GLAESGKRFLWVIRSPSG-IASSSYFNPQSRNDPFSFLPQGFLDRTKEKGLVVGSWAPQA 362

Query: 362 QILSHSSTGAFLTHCGWNSTLEAVVNGVPLIAWPLYAEQKMNA 398
           QIL+H+S G FLTHCGWNS+LE++VNGVPLIAWPLYAEQKMNA
Sbjct: 363 QILTHTSIGGFLTHCGWNSSLESIVNGVPLIAWPLYAEQKMNA 393

BLAST of CmaCh00G002800 vs. TAIR 10
Match: AT1G01390.1 (UDP-Glycosyltransferase superfamily protein )

HSP 1 Score: 453.0 bits (1164), Expect = 2.5e-127
Identity = 230/404 (56.93%), Postives = 287/404 (71.04%), Query Frame = 0

Query: 2   DAPVPHLAIFPSPGMGHLIPLVEFAKRLVLHHHITVTFIVPSDHPPSKPQKSLLDSLPSG 61
           +A  PH+AI PSPGMGHLIP VE AKRLV H   TVT I+  +  PSK Q+S+L+SLPS 
Sbjct: 3   EANTPHIAIMPSPGMGHLIPFVELAKRLVQHDCFTVTMIISGETSPSKAQRSVLNSLPSS 62

Query: 62  IDHTFLPLVSFHDLPHESKIETIISLTVSRSLPSLKNVLKSMVTNSNLVG-LVVDLFGTD 121
           I   FLP     D+P  ++IET   LT++RS P+L+ +  S+ T  +L   LVVD+FG D
Sbjct: 63  IASVFLPPADLSDVPSTARIETRAMLTMTRSNPALRELFGSLSTKKSLPAVLVVDMFGAD 122

Query: 122 AFDLAREFNISPYIFFPSTAMVLSFALFLPKLDESVAGEYGDLQEPIQIPGCIPIHGKNL 181
           AFD+A +F++SPYIF+ S A VLSF L LPKLD++V+ E+  L EP++IPGC+PI GK+ 
Sbjct: 123 AFDVAVDFHVSPYIFYASNANVLSFFLHLPKLDKTVSCEFRYLTEPLKIPGCVPITGKDF 182

Query: 182 LDPVQDRKDEAYKWTFHNMKRYVLSDGIFLNSFPELEPGAIKFLREEKQGKPPVYPIGPL 241
           LD VQDR D+AYK   HN KRY  + GI +NSF +LE  AIK L+E    KP VYPIGPL
Sbjct: 183 LDTVQDRNDDAYKLLLHNTKRYKEAKGILVNSFVDLESNAIKALQEPAPDKPTVYPIGPL 242

Query: 242 GKPPVYPIGPLVRIDSNGSNKKAE----CLKWLDEQPNGSVLYV----GGTLSSHQINEL 301
                        ++++ SN   E    CL WLD QP GSVLY+    GGTL+  Q NEL
Sbjct: 243 -------------VNTSSSNVNLEDKFGCLSWLDNQPFGSVLYISFGSGGTLTCEQFNEL 302

Query: 302 AMGLEMSGQRFIWVVRSPSDKVANATYFNVHSHDDPFDFLPEGFVERMKNRGLVVPSWAP 361
           A+GL  SG+RFIWV+RSPS+ + +++YFN HS  DPF FLP GF++R K +GLVVPSWAP
Sbjct: 303 AIGLAESGKRFIWVIRSPSE-IVSSSYFNPHSETDPFSFLPIGFLDRTKEKGLVVPSWAP 362

Query: 362 QTQILSHSSTGAFLTHCGWNSTLEAVVNGVPLIAWPLYAEQKMN 397
           Q QIL+H ST  FLTHCGWNSTLE++VNGVPLIAWPL+AEQKMN
Sbjct: 363 QVQILAHPSTCGFLTHCGWNSTLESIVNGVPLIAWPLFAEQKMN 392

BLAST of CmaCh00G002800 vs. TAIR 10
Match: AT4G01070.2 (UDP-Glycosyltransferase superfamily protein )

HSP 1 Score: 393.3 bits (1009), Expect = 2.4e-109
Identity = 203/358 (56.70%), Postives = 253/358 (70.67%), Query Frame = 0

Query: 2   DAPVPHLAIFPSPGMGHLIPLVEFAKRLVLHHHITVTFIVPSDHPPSKPQKSLLDSLPSG 61
           ++  PH+AI PSPGMGHLIPLVEFAKRLV  H +TVTF++  + PPSK Q+++LDSLPS 
Sbjct: 3   ESKTPHVAIIPSPGMGHLIPLVEFAKRLVHLHGLTVTFVIAGEGPPSKAQRTVLDSLPSS 62

Query: 62  IDHTFLPLVSFHDLPHESKIETIISLTVSRSLPSLKNVLKSMVTNSNL-VGLVVDLFGTD 121
           I   FLP V   DL   ++IE+ ISLTV+RS P L+ V  S V    L   LVVDLFGTD
Sbjct: 63  ISSVFLPPVDLTDLSSSTRIESRISLTVTRSNPELRKVFDSFVEGGRLPTALVVDLFGTD 122

Query: 122 AFDLAREFNISPYIFFPSTAMVLSFALFLPKLDESVAGEYGDLQEPIQIPGCIPIHGKNL 181
           AFD+A EF++ PYIF+P+TA VLSF L LPKLDE+V+ E+ +L EP+ +PGC+P+ GK+ 
Sbjct: 123 AFDVAVEFHVPPYIFYPTTANVLSFFLHLPKLDETVSCEFRELTEPLMLPGCVPVAGKDF 182

Query: 182 LDPVQDRKDEAYKWTFHNMKRYVLSDGIFLNSFPELEPGAIKFLREEKQGKPPVYPIGPL 241
           LDP QDRKD+AYKW  HN KRY  ++GI +N+F ELEP AIK L+E             L
Sbjct: 183 LDPAQDRKDDAYKWLLHNTKRYKEAEGILVNTFFELEPNAIKALQEP-----------GL 242

Query: 242 GKPPVYPIGPLVRIDSNGS--NKKAECLKWLDEQPNGSVLYV----GGTLSSHQINELAM 301
            KPPVYP+GPLV I    +   +++ECLKWLD QP GSVLYV    GGTL+  Q+NELA+
Sbjct: 243 DKPPVYPVGPLVNIGKQEAKQTEESECLKWLDNQPLGSVLYVSFGSGGTLTCEQLNELAL 302

Query: 302 GLEMSGQRFIWVVRSPSDKVANATYFNVHSHDDPFDFLPEGFVERMKNRGLVVPSWAP 353
           GL  S QRF+WV+RSPS  +AN++YF+ HS  DP  FLP GF+ER K R  V   W P
Sbjct: 303 GLADSEQRFLWVIRSPSG-IANSSYFDSHSQTDPLTFLPPGFLERTKKR--VRAKWQP 346

BLAST of CmaCh00G002800 vs. TAIR 10
Match: AT5G66690.1 (UDP-Glycosyltransferase superfamily protein )

HSP 1 Score: 312.4 bits (799), Expect = 5.3e-85
Identity = 171/402 (42.54%), Postives = 246/402 (61.19%), Query Frame = 0

Query: 6   PHLAIFPSPGMGHLIPLVEFAKRLVLHH--HITVTFIVPSDHPPSKPQKSLLDSLPSGID 65
           PH A+F SPGMGH+IP++E  KRL  ++  H+TV F++ +D   +  Q   L+S  +G+D
Sbjct: 6   PHAAMFSSPGMGHVIPVIELGKRLSANNGFHVTV-FVLETD--AASAQSKFLNS--TGVD 65

Query: 66  HTFLPLVSFHDL-PHESKIETIISLTVSRSLPSLKNVLKSMVTNSNLVGLVVDLFGTDAF 125
              LP    + L   +  + T I + +  ++P+L++ + +M  +     L+VDLFGTDA 
Sbjct: 66  IVKLPSPDIYGLVDPDDHVVTKIGVIMRAAVPALRSKIAAM--HQKPTALIVDLFGTDAL 125

Query: 126 DLAREFNISPYIFFPSTAMVLSFALFLPKLDESVAGEYGDLQEPIQIPGCIPIHGKNLLD 185
            LA+EFN+  Y+F P+ A  L  +++ P LD+ +  E+   + P+ IPGC P+  ++ LD
Sbjct: 126 CLAKEFNMLSYVFIPTNARFLGVSIYYPNLDKDIKEEHTVQRNPLAIPGCEPVRFEDTLD 185

Query: 186 PVQDRKDEAYKWTFHNMKRYVLSDGIFLNSFPELEPGAIKFLREEKQGKPPVYPIGPLGK 245
                 +  Y+    +   Y  +DGI +N++ E+EP ++K L   K        +G + +
Sbjct: 186 AYLVPDEPVYRDFVRHGLAYPKADGILVNTWEEMEPKSLKSLLNPKL-------LGRVAR 245

Query: 246 PPVYPIGPLVRIDSNGSNKKAECLKWLDEQPNGSVLYV----GGTLSSHQINELAMGLEM 305
            PVYPIGPL R     S      L WL+EQPN SVLY+    GG LS+ Q+ ELA GLE 
Sbjct: 246 VPVYPIGPLCR-PIQSSETDHPVLDWLNEQPNESVLYISFGSGGCLSAKQLTELAWGLEQ 305

Query: 306 SGQRFIWVVRSPSDKVANATYFNVH---SHDDPFDFLPEGFVERMKNRGLVVPSWAPQTQ 365
           S QRF+WVVR P D    + Y + +   + D+  ++LPEGFV R  +RG VVPSWAPQ +
Sbjct: 306 SQQRFVWVVRPPVDGSCCSEYVSANGGGTEDNTPEYLPEGFVSRTSDRGFVVPSWAPQAE 365

Query: 366 ILSHSSTGAFLTHCGWNSTLEAVVNGVPLIAWPLYAEQKMNA 398
           ILSH + G FLTHCGW+STLE+VV GVP+IAWPL+AEQ MNA
Sbjct: 366 ILSHRAVGGFLTHCGWSSTLESVVGGVPMIAWPLFAEQNMNA 392

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9AR732.8e-14762.63Hydroquinone glucosyltransferase OS=Rauvolfia serpentina OX=4060 GN=AS PE=1 SV=1[more]
Q9M1569.0e-13860.30UDP-glycosyltransferase 72B1 OS=Arabidopsis thaliana OX=3702 GN=UGT72B1 PE=1 SV=... [more]
Q9LNI11.5e-13258.56UDP-glycosyltransferase 72B3 OS=Arabidopsis thaliana OX=3702 GN=UGT72B3 PE=2 SV=... [more]
Q8W4C23.6e-12656.93UDP-glycosyltransferase 72B2 OS=Arabidopsis thaliana OX=3702 GN=UGT72B2 PE=2 SV=... [more]
Q9LVR17.5e-8442.54UDP-glycosyltransferase 72E2 OS=Arabidopsis thaliana OX=3702 GN=UGT72E2 PE=1 SV=... [more]
Match NameE-valueIdentityDescription
AT4G01070.16.4e-13960.30UDP-Glycosyltransferase superfamily protein [more]
AT1G01420.11.1e-13358.56UDP-glucosyl transferase 72B3 [more]
AT1G01390.12.5e-12756.93UDP-Glycosyltransferase superfamily protein [more]
AT4G01070.22.4e-10956.70UDP-Glycosyltransferase superfamily protein [more]
AT5G66690.15.3e-8542.54UDP-Glycosyltransferase superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita maxima (Rimu) v1.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableGENE3D3.40.50.2000Glycogen Phosphorylase B;coord: 263..398
e-value: 4.5E-137
score: 459.6
NoneNo IPR availableGENE3D3.40.50.2000Glycogen Phosphorylase B;coord: 7..238
e-value: 4.5E-137
score: 459.6
NoneNo IPR availablePANTHERPTHR48045:SF12GLYCOSYLTRANSFERASEcoord: 5..240
coord: 241..397
NoneNo IPR availablePANTHERPTHR48045UDP-GLYCOSYLTRANSFERASE 72B1coord: 5..240
coord: 241..397
NoneNo IPR availableSUPERFAMILY53756UDP-Glycosyltransferase/glycogen phosphorylasecoord: 6..398
IPR002213UDP-glucuronosyl/UDP-glucosyltransferasePFAMPF00201UDPGTcoord: 285..398
e-value: 1.3E-19
score: 70.4
IPR002213UDP-glucuronosyl/UDP-glucosyltransferaseCDDcd03784GT1_Gtf-likecoord: 6..398
e-value: 6.87586E-63
score: 205.478
IPR035595UDP-glycosyltransferase family, conserved sitePROSITEPS00375UDPGTcoord: 350..393

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh00G002800.1CmaCh00G002800.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0008194 UDP-glycosyltransferase activity