Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAGCTCTGCACGTTGATTCTCAACCCCACCTTGTTATCGTCCCAAGTCCCGGCGTTGGCCATCTAATTCCCCTCGTCGAGTTCGCCAAACGCCTCGTCTCCCTCCACAATTTCTCCGTCACCATCGCCATTCCCTCCAACATCCCTCCGACCAAACCCCAAAGAGCTGTCTTAACCGACCTCCCTTCCACTATCCAACCCCTCTTCCTCCCCCCCATCTCCTTCAACGATCTCCCCGAAAACCCCAAAATCGAAACCATCATCATCCTTTCTGTAACTCGCTCTGTTCCATTCCTTCGCGACCTCTTCAAATCCCTCATCGGAAAAACCCATCTTGCTGGCCTTATCGTCGACCATTTCAGTACTGACGCCTTCGATGTCGCCATCGAATTCGACGTCCCTTGCTACCTTTTCTTCCCTCCTTCTGCCATGAACCTTTCCTTCGCATTACAAATGCCCAGCCTCGACCAAATCATCGCCGGCGAGTACAGGGACCATCCCGAGCTGATTCAGATTCCGGGGTGCATTCCGATTCATGGGAAAGAGCTTCAGGAACCGACTCAAGATAGGAGTGACGATGCCTACAAGCTATTGCTCCATAACTGTAAGAGGTATAGAATGGCGGATACCAGATAGGAGTGACGATGCCTACAAGCTATTGCTCCATAACTGTAAGAGGTATAGAATGGCGGATACCATTTTTCTCAACAGCTACCCTGAATTGGAGCCTGAAGCTATAAAAGCTCTGCTAGAGGAGGAACCAGGGAACCCCCCTGTTTATCCAATTGGTCCGCTGGTGAGGAAAGATTGCAGTGAAAAGGAAGAGAGAGCGGATTGTTTGAAATGGCTTGATGAACAGCCAAAGGAGTCTGTTCTGTTTGTGTCGTGAAGAGAGAGCGGATTGTTTGAAATGGCTTGATGAACAGCCAAAGGAGTCTGTTCTGTTTGTGTCGTTTGGGAGTCGGGGGGCTCTTTGGCGTGATCAAATCAACGAATTGGCGTTGGGATTGGAAATGAGTGGGCAGAGATTCATATGGGTCGTTAGAAAACCGAAAGACGAGACGGCTACTACGACGTTGTTTAACGACCAGAATGAAAAGGAGGTGTCGAGATTCCTGCCGGAGGGGTTTATAGAAAGGACTAAAAACAGGGGAATGGTGGTGCCATTGTGGGCGCCACAGGTTGAGGTGCTGAGGCATGAGTCCACCGGGGGGTTCTTGAGCCACTGCGGGTGGAACTCAACTCTGGAGGCTGTGGTGAACGGGGTGCCTCTGATTGCTTGGCCGGCGTATGCAGAACAGAGGATGAACGCCCATATGCTGACAGAGGGCATTAAAATTGCTTTGAGGCCGAAGAAGAAGGAGGAAAGAGGGATTGTGGAGAAGGAAGAGGTTGCAGAAGTGGTGAAGTCGTTAATGGAAGGTGAAGAGGGGAAAAGGGTTCGTGAGAAAGTGAAGTATCTGAAGAATGAAGCAGAAAGAGCTCTGGGAGAAGATGGATGTTCTTCCAAAGCACTCTCTGAAATAGCTCTGAAGTTGAAGAAGACGAAGATTGGGTATTAA
mRNA sequence
ATGGAAGCTCTGCACGTTGATTCTCAACCCCACCTTGTTATCGTCCCAAGTCCCGGCGTTGGCCATCTAATTCCCCTCGTCGAGTTCGCCAAACGCCTCGTCTCCCTCCACAATTTCTCCGTCACCATCGCCATTCCCTCCAACATCCCTCCGACCAAACCCCAAAGAGCTGTCTTAACCGACCTCCCTTCCACTATCCAACCCCTCTTCCTCCCCCCCATCTCCTTCAACGATCTCCCCGAAAACCCCAAAATCGAAACCATCATCATCCTTTCTGTAACTCGCTCTGTTCCATTCCTTCGCGACCTCTTCAAATCCCTCATCGGAAAAACCCATCTTGCTGGCCTTATCGTCGACCATTTCAGTACTGACGCCTTCGATGTCGCCATCGAATTCGACGTCCCTTGCTACCTTTTCTTCCCTCCTTCTGCCATGAACCTTTCCTTCGCATTACAAATGCCCAGCCTCGACCAAATCATCGCCGGCGAGTACAGGGACCATCCCGAGCTGATTCAGATTCCGGGGTGCATTCCGATTCATGGGAAAGAGCTTCAGGAACCGACTCAAGATAGGAGTGACGATGCCTACAAGCTATTGCTCCATAACTGTAAGAGGTATAGAATGGCGGATACCATTTTTCTCAACAGCTACCCTGAATTGGAGCCTGAAGCTATAAAAGCTCTGCTAGAGGAGGAACCAGGGAACCCCCCTGTTTATCCAATTGGTCCGCTGGTGAGGAAAGATTGCAAGAGAGCGGATTGTTTGAAATGGCTTGATGAACAGCCAAAGGAGTCTGTTCTGTTTGTGTCGTTTGGGAGTCGGGGGGCTCTTTGGCGTGATCAAATCAACGAATTGGCGTTGGGATTGGAAATGAGTGGGCAGAGATTCATATGGGTCGTTAGAAAACCGAAAGACGAGACGGCTACTACGACGTTGTTTAACGACCAGAATGAAAAGGAGGTGTCGAGATTCCTGCCGGAGGGGTTTATAGAAAGGACTAAAAACAGGGGAATGGTGGTGCCATTGTGGGCGCCACAGGTTGAGGTGCTGAGGCATGAGTCCACCGGGGGGTTCTTGAGCCACTGCGGGTGGAACTCAACTCTGGAGGCTGTGGTGAACGGGGTGCCTCTGATTGCTTGGCCGGCGTATGCAGAACAGAGGATGAACGCCCATATGCTGACAGAGGGCATTAAAATTGCTTTGAGGCCGAAGAAGAAGGAGGAAAGAGGGATTGTGGAGAAGGAAGAGGTTGCAGAAGTGGTGAAGTCGTTAATGGAAGGTGAAGAGGGGAAAAGGGTTCGTGAGAAAGTGAAGTATCTGAAGAATGAAGCAGAAAGAGCTCTGGGAGAAGATGGATGTTCTTCCAAAGCACTCTCTGAAATAGCTCTGAAGTTGAAGAAGACGAAGATTGGGTATTAA
Coding sequence (CDS)
ATGGAAGCTCTGCACGTTGATTCTCAACCCCACCTTGTTATCGTCCCAAGTCCCGGCGTTGGCCATCTAATTCCCCTCGTCGAGTTCGCCAAACGCCTCGTCTCCCTCCACAATTTCTCCGTCACCATCGCCATTCCCTCCAACATCCCTCCGACCAAACCCCAAAGAGCTGTCTTAACCGACCTCCCTTCCACTATCCAACCCCTCTTCCTCCCCCCCATCTCCTTCAACGATCTCCCCGAAAACCCCAAAATCGAAACCATCATCATCCTTTCTGTAACTCGCTCTGTTCCATTCCTTCGCGACCTCTTCAAATCCCTCATCGGAAAAACCCATCTTGCTGGCCTTATCGTCGACCATTTCAGTACTGACGCCTTCGATGTCGCCATCGAATTCGACGTCCCTTGCTACCTTTTCTTCCCTCCTTCTGCCATGAACCTTTCCTTCGCATTACAAATGCCCAGCCTCGACCAAATCATCGCCGGCGAGTACAGGGACCATCCCGAGCTGATTCAGATTCCGGGGTGCATTCCGATTCATGGGAAAGAGCTTCAGGAACCGACTCAAGATAGGAGTGACGATGCCTACAAGCTATTGCTCCATAACTGTAAGAGGTATAGAATGGCGGATACCATTTTTCTCAACAGCTACCCTGAATTGGAGCCTGAAGCTATAAAAGCTCTGCTAGAGGAGGAACCAGGGAACCCCCCTGTTTATCCAATTGGTCCGCTGGTGAGGAAAGATTGCAAGAGAGCGGATTGTTTGAAATGGCTTGATGAACAGCCAAAGGAGTCTGTTCTGTTTGTGTCGTTTGGGAGTCGGGGGGCTCTTTGGCGTGATCAAATCAACGAATTGGCGTTGGGATTGGAAATGAGTGGGCAGAGATTCATATGGGTCGTTAGAAAACCGAAAGACGAGACGGCTACTACGACGTTGTTTAACGACCAGAATGAAAAGGAGGTGTCGAGATTCCTGCCGGAGGGGTTTATAGAAAGGACTAAAAACAGGGGAATGGTGGTGCCATTGTGGGCGCCACAGGTTGAGGTGCTGAGGCATGAGTCCACCGGGGGGTTCTTGAGCCACTGCGGGTGGAACTCAACTCTGGAGGCTGTGGTGAACGGGGTGCCTCTGATTGCTTGGCCGGCGTATGCAGAACAGAGGATGAACGCCCATATGCTGACAGAGGGCATTAAAATTGCTTTGAGGCCGAAGAAGAAGGAGGAAAGAGGGATTGTGGAGAAGGAAGAGGTTGCAGAAGTGGTGAAGTCGTTAATGGAAGGTGAAGAGGGGAAAAGGGTTCGTGAGAAAGTGAAGTATCTGAAGAATGAAGCAGAAAGAGCTCTGGGAGAAGATGGATGTTCTTCCAAAGCACTCTCTGAAATAGCTCTGAAGTTGAAGAAGACGAAGATTGGGTATTAA
Protein sequence
MEALHVDSQPHLVIVPSPGVGHLIPLVEFAKRLVSLHNFSVTIAIPSNIPPTKPQRAVLTDLPSTIQPLFLPPISFNDLPENPKIETIIILSVTRSVPFLRDLFKSLIGKTHLAGLIVDHFSTDAFDVAIEFDVPCYLFFPPSAMNLSFALQMPSLDQIIAGEYRDHPELIQIPGCIPIHGKELQEPTQDRSDDAYKLLLHNCKRYRMADTIFLNSYPELEPEAIKALLEEEPGNPPVYPIGPLVRKDCKRADCLKWLDEQPKESVLFVSFGSRGALWRDQINELALGLEMSGQRFIWVVRKPKDETATTTLFNDQNEKEVSRFLPEGFIERTKNRGMVVPLWAPQVEVLRHESTGGFLSHCGWNSTLEAVVNGVPLIAWPAYAEQRMNAHMLTEGIKIALRPKKKEERGIVEKEEVAEVVKSLMEGEEGKRVREKVKYLKNEAERALGEDGCSSKALSEIALKLKKTKIGY
Homology
BLAST of CmaCh00G002840 vs. ExPASy Swiss-Prot
Match:
Q9AR73 (Hydroquinone glucosyltransferase OS=Rauvolfia serpentina OX=4060 GN=AS PE=1 SV=1)
HSP 1 Score: 562.4 bits (1448), Expect = 4.9e-159
Identity = 274/463 (59.18%), Postives = 350/463 (75.59%), Query Frame = 0
Query: 6 VDSQPHLVIVPSPGVGHLIPLVEFAKRLVSLHNFSVTIAIPSNIPPTKPQRAVLTDLPST 65
++ PH+ +VP+PG+GHLIPLVEFAKRLV HNF VT IP++ P K Q++ L LP+
Sbjct: 1 MEHTPHIAMVPTPGMGHLIPLVEFAKRLVLRHNFGVTFIIPTDGPLPKAQKSFLDALPAG 60
Query: 66 IQPLFLPPISFNDLPENPKIETIIILSVTRSVPFLRDLFKSLIGKTHLAGLIVDHFSTDA 125
+ + LPP+SF+DLP + +IET I L++TRS+PF+RD K+L+ T LA L+VD F TDA
Sbjct: 61 VNYVLLPPVSFDDLPADVRIETRICLTITRSLPFVRDAVKTLLATTKLAALVVDLFGTDA 120
Query: 126 FDVAIEFDVPCYLFFPPSAMNLSFALQMPSLDQIIAGEYRDHPELIQIPGCIPIHGKELQ 185
FDVAIEF V Y+F+P +AM LS +P LDQ+++ EYRD PE +QIPGCIPIHGK+
Sbjct: 121 FDVAIEFKVSPYIFYPTTAMCLSLFFHLPKLDQMVSCEYRDVPEPLQIPGCIPIHGKDFL 180
Query: 186 EPTQDRSDDAYKLLLHNCKRYRMADTIFLNSYPELEPEAIKALLEEEPGNPPVYPIGPLV 245
+P QDR +DAYK LLH KRYR+A+ I +N++ +LEP +KAL EE+ G PPVYPIGPL+
Sbjct: 181 DPAQDRKNDAYKCLLHQAKRYRLAEGIMVNTFNDLEPGPLKALQEEDQGKPPVYPIGPLI 240
Query: 246 RKDCKR----ADCLKWLDEQPKESVLFVSFGSRGALWRDQINELALGLEMSGQRFIWVVR 305
R D +CLKWLD+QP+ SVLF+SFGS GA+ +Q ELALGLEMS QRF+WVVR
Sbjct: 241 RADSSSKVDDCECLKWLDDQPRGSVLFISFGSGGAVSHNQFIELALGLEMSEQRFLWVVR 300
Query: 306 KPKDETATTTLFNDQNEKEVSRFLPEGFIERTKNRGMVVPLWAPQVEVLRHESTGGFLSH 365
P D+ A T F+ QN+ + +LPEGF+ERTK R ++VP WAPQ E+L H STGGFL+H
Sbjct: 301 SPNDKIANATYFSIQNQNDALAYLPEGFLERTKGRCLLVPSWAPQTEILSHGSTGGFLTH 360
Query: 366 CGWNSTLEAVVNGVPLIAWPAYAEQRMNAHMLTEGIKIALRPKKKEERGIVEKEEVAEVV 425
CGWNS LE+VVNGVPLIAWP YAEQ+MNA MLTEG+K+ALRP K E G++ + E+A V
Sbjct: 361 CGWNSILESVVNGVPLIAWPLYAEQKMNAVMLTEGLKVALRP-KAGENGLIGRVEIANAV 420
Query: 426 KSLMEGEEGKRVREKVKYLKNEAERALGEDGCSSKALSEIALK 465
K LMEGEEGK+ R +K LK+ A RAL +DG S+KAL+E+A K
Sbjct: 421 KGLMEGEEGKKFRSTMKDLKDAASRALSDDGSSTKALAELACK 462
BLAST of CmaCh00G002840 vs. ExPASy Swiss-Prot
Match:
Q9M156 (UDP-glycosyltransferase 72B1 OS=Arabidopsis thaliana OX=3702 GN=UGT72B1 PE=1 SV=1)
HSP 1 Score: 515.8 bits (1327), Expect = 5.3e-145
Identity = 263/467 (56.32%), Postives = 334/467 (71.52%), Query Frame = 0
Query: 10 PHLVIVPSPGVGHLIPLVEFAKRLVSLHNFSVTIAIPSNIPPTKPQRAVLTDLPSTIQPL 69
PH+ I+PSPG+GHLIPLVEFAKRLV LH +VT I PP+K QR VL LPS+I +
Sbjct: 7 PHVAIIPSPGMGHLIPLVEFAKRLVHLHGLTVTFVIAGEGPPSKAQRTVLDSLPSSISSV 66
Query: 70 FLPPISFNDLPENPKIETIIILSVTRSVPFLRDLFKSLIGKTHL-AGLIVDHFSTDAFDV 129
FLPP+ DL + +IE+ I L+VTRS P LR +F S + L L+VD F TDAFDV
Sbjct: 67 FLPPVDLTDLSSSTRIESRISLTVTRSNPELRKVFDSFVEGGRLPTALVVDLFGTDAFDV 126
Query: 130 AIEFDVPCYLFFPPSAMNLSFALQMPSLDQIIAGEYRDHPELIQIPGCIPIHGKELQEPT 189
A+EF VP Y+F+P +A LSF L +P LD+ ++ E+R+ E + +PGC+P+ GK+ +P
Sbjct: 127 AVEFHVPPYIFYPTTANVLSFFLHLPKLDETVSCEFRELTEPLMLPGCVPVAGKDFLDPA 186
Query: 190 QDRSDDAYKLLLHNCKRYRMADTIFLNSYPELEPEAIKALLEEEPGNPPVYPIGPLVR-- 249
QDR DDAYK LLHN KRY+ A+ I +N++ ELEP AIKAL E PPVYP+GPLV
Sbjct: 187 QDRKDDAYKWLLHNTKRYKEAEGILVNTFFELEPNAIKALQEPGLDKPPVYPVGPLVNIG 246
Query: 250 ----KDCKRADCLKWLDEQPKESVLFVSFGSRGALWRDQINELALGLEMSGQRFIWVVRK 309
K + ++CLKWLD QP SVL+VSFGS G L +Q+NELALGL S QRF+WV+R
Sbjct: 247 KQEAKQTEESECLKWLDNQPLGSVLYVSFGSGGTLTCEQLNELALGLADSEQRFLWVIRS 306
Query: 310 PKDETATTTLFNDQNEKEVSRFLPEGFIERTKNRGMVVPLWAPQVEVLRHESTGGFLSHC 369
P A ++ F+ ++ + FLP GF+ERTK RG V+P WAPQ +VL H STGGFL+HC
Sbjct: 307 PSG-IANSSYFDSHSQTDPLTFLPPGFLERTKKRGFVIPFWAPQAQVLAHPSTGGFLTHC 366
Query: 370 GWNSTLEAVVNGVPLIAWPAYAEQRMNAHMLTEGIKIALRPKKKEERGIVEKEEVAEVVK 429
GWNSTLE+VV+G+PLIAWP YAEQ+MNA +L+E I+ ALRP+ ++ G+V +EEVA VVK
Sbjct: 367 GWNSTLESVVSGIPLIAWPLYAEQKMNAVLLSEDIRAALRPRAGDD-GLVRREEVARVVK 426
Query: 430 SLMEGEEGKRVREKVKYLKNEAERALGEDGCSSKALSEIALKLKKTK 470
LMEGEEGK VR K+K LK A R L +DG S+KALS +ALK K K
Sbjct: 427 GLMEGEEGKGVRNKMKELKEAACRVLKDDGTSTKALSLVALKWKAHK 471
BLAST of CmaCh00G002840 vs. ExPASy Swiss-Prot
Match:
Q9LNI1 (UDP-glycosyltransferase 72B3 OS=Arabidopsis thaliana OX=3702 GN=UGT72B3 PE=2 SV=1)
HSP 1 Score: 483.8 bits (1244), Expect = 2.2e-135
Identity = 249/464 (53.66%), Postives = 328/464 (70.69%), Query Frame = 0
Query: 10 PHLVIVPSPGVGHLIPLVEFAKRLVSLHNFSVTIAIPSNIPPTKPQRAVLTDLPSTIQPL 69
PH+ I+PSPG+GHLIPLVE AKRL+ H F+VT IP + PP+K QR+VL LPS+I +
Sbjct: 7 PHVAIIPSPGIGHLIPLVELAKRLLDNHGFTVTFIIPGDSPPSKAQRSVLNSLPSSIASV 66
Query: 70 FLPPISFNDLPENPKIETIIILSVTRSVPFLRDLFKSLIGKTHL-AGLIVDHFSTDAFDV 129
FLPP +D+P +IET I L+VTRS P LR+LF SL + L A L+VD F TDAFDV
Sbjct: 67 FLPPADLSDVPSTARIETRISLTVTRSNPALRELFGSLSAEKRLPAVLVVDLFGTDAFDV 126
Query: 130 AIEFDVPCYLFFPPSAMNLSFALQMPSLDQIIAGEYRDHPELIQIPGCIPIHGKELQEPT 189
A EF V Y+F+ +A L+F L +P LD+ ++ E+R+ E + IPGC+PI GK+ +P
Sbjct: 127 AAEFHVSPYIFYASNANVLTFLLHLPKLDETVSCEFRELTEPVIIPGCVPITGKDFVDPC 186
Query: 190 QDRSDDAYKLLLHNCKRYRMADTIFLNSYPELEPEAIKALLEEEPGNPPVYPIGPLVRKD 249
QDR D++YK LLHN KR++ A+ I +NS+ +LEP IK + E P PPVY IGPLV
Sbjct: 187 QDRKDESYKWLLHNVKRFKEAEGILVNSFVDLEPNTIKIVQEPAPDKPPVYLIGPLVNSG 246
Query: 250 CKRAD------CLKWLDEQPKESVLFVSFGSRGALWRDQINELALGLEMSGQRFIWVVRK 309
AD CL WLD QP SVL+VSFGS G L +Q ELALGL SG+RF+WV+R
Sbjct: 247 SHDADVNDEYKCLNWLDNQPFGSVLYVSFGSGGTLTFEQFIELALGLAESGKRFLWVIRS 306
Query: 310 PKDETATTTLFNDQNEKEVSRFLPEGFIERTKNRGMVVPLWAPQVEVLRHESTGGFLSHC 369
P A+++ FN Q+ + FLP+GF++RTK +G+VV WAPQ ++L H S GGFL+HC
Sbjct: 307 PSG-IASSSYFNPQSRNDPFSFLPQGFLDRTKEKGLVVGSWAPQAQILTHTSIGGFLTHC 366
Query: 370 GWNSTLEAVVNGVPLIAWPAYAEQRMNAHMLTEGIKIALRPKKKEERGIVEKEEVAEVVK 429
GWNS+LE++VNGVPLIAWP YAEQ+MNA +L + + ALR + E+ G+V +EEVA VVK
Sbjct: 367 GWNSSLESIVNGVPLIAWPLYAEQKMNALLLVD-VGAALRARLGED-GVVGREEVARVVK 426
Query: 430 SLMEGEEGKRVREKVKYLKNEAERALGEDGCSSKALSEIALKLK 467
L+EGEEG VR+K+K LK + R L +DG S+K+L+E++LK K
Sbjct: 427 GLIEGEEGNAVRKKMKELKEGSVRVLRDDGFSTKSLNEVSLKWK 467
BLAST of CmaCh00G002840 vs. ExPASy Swiss-Prot
Match:
Q8W4C2 (UDP-glycosyltransferase 72B2 OS=Arabidopsis thaliana OX=3702 GN=UGT72B2 PE=2 SV=1)
HSP 1 Score: 482.3 bits (1240), Expect = 6.5e-135
Identity = 251/464 (54.09%), Postives = 320/464 (68.97%), Query Frame = 0
Query: 10 PHLVIVPSPGVGHLIPLVEFAKRLVSLHNFSVTIAIPSNIPPTKPQRAVLTDLPSTIQPL 69
PH+ I+PSPG+GHLIP VE AKRLV F+VT+ I P+K QR+VL LPS+I +
Sbjct: 7 PHIAIMPSPGMGHLIPFVELAKRLVQHDCFTVTMIISGETSPSKAQRSVLNSLPSSIASV 66
Query: 70 FLPPISFNDLPENPKIETIIILSVTRSVPFLRDLFKSLIGKTHL-AGLIVDHFSTDAFDV 129
FLPP +D+P +IET +L++TRS P LR+LF SL K L A L+VD F DAFDV
Sbjct: 67 FLPPADLSDVPSTARIETRAMLTMTRSNPALRELFGSLSTKKSLPAVLVVDMFGADAFDV 126
Query: 130 AIEFDVPCYLFFPPSAMNLSFALQMPSLDQIIAGEYRDHPELIQIPGCIPIHGKELQEPT 189
A++F V Y+F+ +A LSF L +P LD+ ++ E+R E ++IPGC+PI GK+ +
Sbjct: 127 AVDFHVSPYIFYASNANVLSFFLHLPKLDKTVSCEFRYLTEPLKIPGCVPITGKDFLDTV 186
Query: 190 QDRSDDAYKLLLHNCKRYRMADTIFLNSYPELEPEAIKALLEEEPGNPPVYPIGPLVRKD 249
QDR+DDAYKLLLHN KRY+ A I +NS+ +LE AIKAL E P P VYPIGPLV
Sbjct: 187 QDRNDDAYKLLLHNTKRYKEAKGILVNSFVDLESNAIKALQEPAPDKPTVYPIGPLVNTS 246
Query: 250 CKRAD------CLKWLDEQPKESVLFVSFGSRGALWRDQINELALGLEMSGQRFIWVVRK 309
+ CL WLD QP SVL++SFGS G L +Q NELA+GL SG+RFIWV+R
Sbjct: 247 SSNVNLEDKFGCLSWLDNQPFGSVLYISFGSGGTLTCEQFNELAIGLAESGKRFIWVIRS 306
Query: 310 PKDETATTTLFNDQNEKEVSRFLPEGFIERTKNRGMVVPLWAPQVEVLRHESTGGFLSHC 369
P E +++ FN +E + FLP GF++RTK +G+VVP WAPQV++L H ST GFL+HC
Sbjct: 307 P-SEIVSSSYFNPHSETDPFSFLPIGFLDRTKEKGLVVPSWAPQVQILAHPSTCGFLTHC 366
Query: 370 GWNSTLEAVVNGVPLIAWPAYAEQRMNAHMLTEGIKIALRPKKKEERGIVEKEEVAEVVK 429
GWNSTLE++VNGVPLIAWP +AEQ+MN +L E + ALR E+ GIV +EEV VVK
Sbjct: 367 GWNSTLESIVNGVPLIAWPLFAEQKMNTLLLVEDVGAALRIHAGED-GIVRREEVVRVVK 426
Query: 430 SLMEGEEGKRVREKVKYLKNEAERALGEDGCSSKALSEIALKLK 467
+LMEGEEGK + KVK LK R LG+DG SSK+ E+ LK K
Sbjct: 427 ALMEGEEGKAIGNKVKELKEGVVRVLGDDGLSSKSFGEVLLKWK 468
BLAST of CmaCh00G002840 vs. ExPASy Swiss-Prot
Match:
Q40287 (Anthocyanidin 3-O-glucosyltransferase 5 OS=Manihot esculenta OX=3983 GN=GT5 PE=2 SV=1)
HSP 1 Score: 337.8 bits (865), Expect = 2.0e-91
Identity = 188/481 (39.09%), Postives = 288/481 (59.88%), Query Frame = 0
Query: 1 MEALHVDSQPHLVIVPSPGVGHLIPLVEFAKRLVSLHNFSVTI-AIPSNIPPTKPQRAVL 60
M + ++S+PH+V++ SPG+GHLIP++E KR+V+L NF VTI + S+ +PQ
Sbjct: 1 MGSTDLNSKPHIVLLSSPGLGHLIPVLELGKRIVTLCNFDVTIFMVGSDTSAAEPQVLRS 60
Query: 61 TDLPSTIQPLFLPPISFNDL--PENPKIETIIILSVTRSVPFLRDLFKSLIG--KTHLAG 120
P + + LPP + + L PE + +L + +R F++ + K A
Sbjct: 61 AMTPKLCEIIQLPPPNISCLIDPEATVCTRLFVL-----MREIRPAFRAAVSALKFRPAA 120
Query: 121 LIVDHFSTDAFDVAIEFDVPCYLFFPPSAMNLSFALQMPSLDQIIAGEYRDHPELIQIPG 180
+IVD F T++ +VA E + Y++ +A L+ + +P LD+ + GE+ E ++IPG
Sbjct: 121 IIVDLFGTESLEVAKELGIAKYVYIASNAWFLALTIYVPILDKEVEGEFVLQKEPMKIPG 180
Query: 181 CIPIHGKELQEPTQDRSDDAYKLLLHNCKRYRMADTIFLNSYPELEPEAIKALLEEE--- 240
C P+ +E+ +P DR++ Y AD I +N++ LEP AL + +
Sbjct: 181 CRPVRTEEVVDPMLDRTNQQYSEYFRLGIEIPTADGILMNTWEALEPTTFGALRDVKFLG 240
Query: 241 -PGNPPVYPIGPLVRK--DC-KRADCLKWLDEQPKESVLFVSFGSRGALWRDQINELALG 300
PV+PIGPL R+ C + L WLD+QPKESV++VSFGS G L +Q+ ELA G
Sbjct: 241 RVAKVPVFPIGPLRRQAGPCGSNCELLDWLDQQPKESVVYVSFGSGGTLSLEQMIELAWG 300
Query: 301 LEMSGQRFIWVVRKPKDETATTTLFND-QNEKEVSRFLPEGFIERTKNRGMVVPLWAPQV 360
LE S QRFIWVVR+P +T F ++S + PEGF+ R +N G+VVP W+PQ+
Sbjct: 301 LERSQQRFIWVVRQPTVKTGDAAFFTQGDGADDMSGYFPEGFLTRIQNVGLVVPQWSPQI 360
Query: 361 EVLRHESTGGFLSHCGWNSTLEAVVNGVPLIAWPAYAEQRMNAHMLTEGIKIALRPKKKE 420
++ H S G FLSHCGWNS LE++ GVP+IAWP YAEQRMNA +LTE + +A+RPK
Sbjct: 361 HIMSHPSVGVFLSHCGWNSVLESITAGVPIIAWPIYAEQRMNATLLTEELGVAVRPKNLP 420
Query: 421 ERGIVEKEEVAEVVKSLMEGEEGKRVREKVKYLKNEAERALGEDGCSSKALSEIALKLKK 469
+ +V++EE+ +++ +M EEG +R++V+ LK+ E+AL E G S +S + + +K
Sbjct: 421 AKEVVKREEIERMIRRIMVDEEGSEIRKRVRELKDSGEKALNEGGSSFNYMSALGNEWEK 476
BLAST of CmaCh00G002840 vs. TAIR 10
Match:
AT4G01070.1 (UDP-Glycosyltransferase superfamily protein )
HSP 1 Score: 515.8 bits (1327), Expect = 3.7e-146
Identity = 263/467 (56.32%), Postives = 334/467 (71.52%), Query Frame = 0
Query: 10 PHLVIVPSPGVGHLIPLVEFAKRLVSLHNFSVTIAIPSNIPPTKPQRAVLTDLPSTIQPL 69
PH+ I+PSPG+GHLIPLVEFAKRLV LH +VT I PP+K QR VL LPS+I +
Sbjct: 7 PHVAIIPSPGMGHLIPLVEFAKRLVHLHGLTVTFVIAGEGPPSKAQRTVLDSLPSSISSV 66
Query: 70 FLPPISFNDLPENPKIETIIILSVTRSVPFLRDLFKSLIGKTHL-AGLIVDHFSTDAFDV 129
FLPP+ DL + +IE+ I L+VTRS P LR +F S + L L+VD F TDAFDV
Sbjct: 67 FLPPVDLTDLSSSTRIESRISLTVTRSNPELRKVFDSFVEGGRLPTALVVDLFGTDAFDV 126
Query: 130 AIEFDVPCYLFFPPSAMNLSFALQMPSLDQIIAGEYRDHPELIQIPGCIPIHGKELQEPT 189
A+EF VP Y+F+P +A LSF L +P LD+ ++ E+R+ E + +PGC+P+ GK+ +P
Sbjct: 127 AVEFHVPPYIFYPTTANVLSFFLHLPKLDETVSCEFRELTEPLMLPGCVPVAGKDFLDPA 186
Query: 190 QDRSDDAYKLLLHNCKRYRMADTIFLNSYPELEPEAIKALLEEEPGNPPVYPIGPLVR-- 249
QDR DDAYK LLHN KRY+ A+ I +N++ ELEP AIKAL E PPVYP+GPLV
Sbjct: 187 QDRKDDAYKWLLHNTKRYKEAEGILVNTFFELEPNAIKALQEPGLDKPPVYPVGPLVNIG 246
Query: 250 ----KDCKRADCLKWLDEQPKESVLFVSFGSRGALWRDQINELALGLEMSGQRFIWVVRK 309
K + ++CLKWLD QP SVL+VSFGS G L +Q+NELALGL S QRF+WV+R
Sbjct: 247 KQEAKQTEESECLKWLDNQPLGSVLYVSFGSGGTLTCEQLNELALGLADSEQRFLWVIRS 306
Query: 310 PKDETATTTLFNDQNEKEVSRFLPEGFIERTKNRGMVVPLWAPQVEVLRHESTGGFLSHC 369
P A ++ F+ ++ + FLP GF+ERTK RG V+P WAPQ +VL H STGGFL+HC
Sbjct: 307 PSG-IANSSYFDSHSQTDPLTFLPPGFLERTKKRGFVIPFWAPQAQVLAHPSTGGFLTHC 366
Query: 370 GWNSTLEAVVNGVPLIAWPAYAEQRMNAHMLTEGIKIALRPKKKEERGIVEKEEVAEVVK 429
GWNSTLE+VV+G+PLIAWP YAEQ+MNA +L+E I+ ALRP+ ++ G+V +EEVA VVK
Sbjct: 367 GWNSTLESVVSGIPLIAWPLYAEQKMNAVLLSEDIRAALRPRAGDD-GLVRREEVARVVK 426
Query: 430 SLMEGEEGKRVREKVKYLKNEAERALGEDGCSSKALSEIALKLKKTK 470
LMEGEEGK VR K+K LK A R L +DG S+KALS +ALK K K
Sbjct: 427 GLMEGEEGKGVRNKMKELKEAACRVLKDDGTSTKALSLVALKWKAHK 471
BLAST of CmaCh00G002840 vs. TAIR 10
Match:
AT1G01420.1 (UDP-glucosyl transferase 72B3 )
HSP 1 Score: 483.8 bits (1244), Expect = 1.6e-136
Identity = 249/464 (53.66%), Postives = 328/464 (70.69%), Query Frame = 0
Query: 10 PHLVIVPSPGVGHLIPLVEFAKRLVSLHNFSVTIAIPSNIPPTKPQRAVLTDLPSTIQPL 69
PH+ I+PSPG+GHLIPLVE AKRL+ H F+VT IP + PP+K QR+VL LPS+I +
Sbjct: 7 PHVAIIPSPGIGHLIPLVELAKRLLDNHGFTVTFIIPGDSPPSKAQRSVLNSLPSSIASV 66
Query: 70 FLPPISFNDLPENPKIETIIILSVTRSVPFLRDLFKSLIGKTHL-AGLIVDHFSTDAFDV 129
FLPP +D+P +IET I L+VTRS P LR+LF SL + L A L+VD F TDAFDV
Sbjct: 67 FLPPADLSDVPSTARIETRISLTVTRSNPALRELFGSLSAEKRLPAVLVVDLFGTDAFDV 126
Query: 130 AIEFDVPCYLFFPPSAMNLSFALQMPSLDQIIAGEYRDHPELIQIPGCIPIHGKELQEPT 189
A EF V Y+F+ +A L+F L +P LD+ ++ E+R+ E + IPGC+PI GK+ +P
Sbjct: 127 AAEFHVSPYIFYASNANVLTFLLHLPKLDETVSCEFRELTEPVIIPGCVPITGKDFVDPC 186
Query: 190 QDRSDDAYKLLLHNCKRYRMADTIFLNSYPELEPEAIKALLEEEPGNPPVYPIGPLVRKD 249
QDR D++YK LLHN KR++ A+ I +NS+ +LEP IK + E P PPVY IGPLV
Sbjct: 187 QDRKDESYKWLLHNVKRFKEAEGILVNSFVDLEPNTIKIVQEPAPDKPPVYLIGPLVNSG 246
Query: 250 CKRAD------CLKWLDEQPKESVLFVSFGSRGALWRDQINELALGLEMSGQRFIWVVRK 309
AD CL WLD QP SVL+VSFGS G L +Q ELALGL SG+RF+WV+R
Sbjct: 247 SHDADVNDEYKCLNWLDNQPFGSVLYVSFGSGGTLTFEQFIELALGLAESGKRFLWVIRS 306
Query: 310 PKDETATTTLFNDQNEKEVSRFLPEGFIERTKNRGMVVPLWAPQVEVLRHESTGGFLSHC 369
P A+++ FN Q+ + FLP+GF++RTK +G+VV WAPQ ++L H S GGFL+HC
Sbjct: 307 PSG-IASSSYFNPQSRNDPFSFLPQGFLDRTKEKGLVVGSWAPQAQILTHTSIGGFLTHC 366
Query: 370 GWNSTLEAVVNGVPLIAWPAYAEQRMNAHMLTEGIKIALRPKKKEERGIVEKEEVAEVVK 429
GWNS+LE++VNGVPLIAWP YAEQ+MNA +L + + ALR + E+ G+V +EEVA VVK
Sbjct: 367 GWNSSLESIVNGVPLIAWPLYAEQKMNALLLVD-VGAALRARLGED-GVVGREEVARVVK 426
Query: 430 SLMEGEEGKRVREKVKYLKNEAERALGEDGCSSKALSEIALKLK 467
L+EGEEG VR+K+K LK + R L +DG S+K+L+E++LK K
Sbjct: 427 GLIEGEEGNAVRKKMKELKEGSVRVLRDDGFSTKSLNEVSLKWK 467
BLAST of CmaCh00G002840 vs. TAIR 10
Match:
AT1G01390.1 (UDP-Glycosyltransferase superfamily protein )
HSP 1 Score: 482.3 bits (1240), Expect = 4.6e-136
Identity = 251/464 (54.09%), Postives = 320/464 (68.97%), Query Frame = 0
Query: 10 PHLVIVPSPGVGHLIPLVEFAKRLVSLHNFSVTIAIPSNIPPTKPQRAVLTDLPSTIQPL 69
PH+ I+PSPG+GHLIP VE AKRLV F+VT+ I P+K QR+VL LPS+I +
Sbjct: 7 PHIAIMPSPGMGHLIPFVELAKRLVQHDCFTVTMIISGETSPSKAQRSVLNSLPSSIASV 66
Query: 70 FLPPISFNDLPENPKIETIIILSVTRSVPFLRDLFKSLIGKTHL-AGLIVDHFSTDAFDV 129
FLPP +D+P +IET +L++TRS P LR+LF SL K L A L+VD F DAFDV
Sbjct: 67 FLPPADLSDVPSTARIETRAMLTMTRSNPALRELFGSLSTKKSLPAVLVVDMFGADAFDV 126
Query: 130 AIEFDVPCYLFFPPSAMNLSFALQMPSLDQIIAGEYRDHPELIQIPGCIPIHGKELQEPT 189
A++F V Y+F+ +A LSF L +P LD+ ++ E+R E ++IPGC+PI GK+ +
Sbjct: 127 AVDFHVSPYIFYASNANVLSFFLHLPKLDKTVSCEFRYLTEPLKIPGCVPITGKDFLDTV 186
Query: 190 QDRSDDAYKLLLHNCKRYRMADTIFLNSYPELEPEAIKALLEEEPGNPPVYPIGPLVRKD 249
QDR+DDAYKLLLHN KRY+ A I +NS+ +LE AIKAL E P P VYPIGPLV
Sbjct: 187 QDRNDDAYKLLLHNTKRYKEAKGILVNSFVDLESNAIKALQEPAPDKPTVYPIGPLVNTS 246
Query: 250 CKRAD------CLKWLDEQPKESVLFVSFGSRGALWRDQINELALGLEMSGQRFIWVVRK 309
+ CL WLD QP SVL++SFGS G L +Q NELA+GL SG+RFIWV+R
Sbjct: 247 SSNVNLEDKFGCLSWLDNQPFGSVLYISFGSGGTLTCEQFNELAIGLAESGKRFIWVIRS 306
Query: 310 PKDETATTTLFNDQNEKEVSRFLPEGFIERTKNRGMVVPLWAPQVEVLRHESTGGFLSHC 369
P E +++ FN +E + FLP GF++RTK +G+VVP WAPQV++L H ST GFL+HC
Sbjct: 307 P-SEIVSSSYFNPHSETDPFSFLPIGFLDRTKEKGLVVPSWAPQVQILAHPSTCGFLTHC 366
Query: 370 GWNSTLEAVVNGVPLIAWPAYAEQRMNAHMLTEGIKIALRPKKKEERGIVEKEEVAEVVK 429
GWNSTLE++VNGVPLIAWP +AEQ+MN +L E + ALR E+ GIV +EEV VVK
Sbjct: 367 GWNSTLESIVNGVPLIAWPLFAEQKMNTLLLVEDVGAALRIHAGED-GIVRREEVVRVVK 426
Query: 430 SLMEGEEGKRVREKVKYLKNEAERALGEDGCSSKALSEIALKLK 467
+LMEGEEGK + KVK LK R LG+DG SSK+ E+ LK K
Sbjct: 427 ALMEGEEGKAIGNKVKELKEGVVRVLGDDGLSSKSFGEVLLKWK 468
BLAST of CmaCh00G002840 vs. TAIR 10
Match:
AT4G01070.2 (UDP-Glycosyltransferase superfamily protein )
HSP 1 Score: 350.9 bits (899), Expect = 1.6e-96
Identity = 180/343 (52.48%), Postives = 232/343 (67.64%), Query Frame = 0
Query: 10 PHLVIVPSPGVGHLIPLVEFAKRLVSLHNFSVTIAIPSNIPPTKPQRAVLTDLPSTIQPL 69
PH+ I+PSPG+GHLIPLVEFAKRLV LH +VT I PP+K QR VL LPS+I +
Sbjct: 7 PHVAIIPSPGMGHLIPLVEFAKRLVHLHGLTVTFVIAGEGPPSKAQRTVLDSLPSSISSV 66
Query: 70 FLPPISFNDLPENPKIETIIILSVTRSVPFLRDLFKSLIGKTHL-AGLIVDHFSTDAFDV 129
FLPP+ DL + +IE+ I L+VTRS P LR +F S + L L+VD F TDAFDV
Sbjct: 67 FLPPVDLTDLSSSTRIESRISLTVTRSNPELRKVFDSFVEGGRLPTALVVDLFGTDAFDV 126
Query: 130 AIEFDVPCYLFFPPSAMNLSFALQMPSLDQIIAGEYRDHPELIQIPGCIPIHGKELQEPT 189
A+EF VP Y+F+P +A LSF L +P LD+ ++ E+R+ E + +PGC+P+ GK+ +P
Sbjct: 127 AVEFHVPPYIFYPTTANVLSFFLHLPKLDETVSCEFRELTEPLMLPGCVPVAGKDFLDPA 186
Query: 190 QDRSDDAYKLLLHNCKRYRMADTIFLNSYPELEPEAIKALLEEEPGNPPVYPIGPLVR-- 249
QDR DDAYK LLHN KRY+ A+ I +N++ ELEP AIKAL E PPVYP+GPLV
Sbjct: 187 QDRKDDAYKWLLHNTKRYKEAEGILVNTFFELEPNAIKALQEPGLDKPPVYPVGPLVNIG 246
Query: 250 ----KDCKRADCLKWLDEQPKESVLFVSFGSRGALWRDQINELALGLEMSGQRFIWVVRK 309
K + ++CLKWLD QP SVL+VSFGS G L +Q+NELALGL S QRF+WV+R
Sbjct: 247 KQEAKQTEESECLKWLDNQPLGSVLYVSFGSGGTLTCEQLNELALGLADSEQRFLWVIRS 306
Query: 310 PKDETATTTLFNDQNEKEVSRFLPEGFIERTKNRGMVVPLWAP 346
P A ++ F+ ++ + FLP GF+ERTK R V W P
Sbjct: 307 PSG-IANSSYFDSHSQTDPLTFLPPGFLERTKKR--VRAKWQP 346
BLAST of CmaCh00G002840 vs. TAIR 10
Match:
AT2G18570.1 (UDP-Glycosyltransferase superfamily protein )
HSP 1 Score: 322.8 bits (826), Expect = 4.6e-88
Identity = 190/467 (40.69%), Postives = 279/467 (59.74%), Query Frame = 0
Query: 9 QPHLVIVPSPGVGHLIPLVEFAKRLVSLHNFSVTI-AIPS-NIPPTKPQRAVLTDLPSTI 68
QPH ++V SPG+GHLIP++E RL S+ N VTI A+ S + PT+ + +
Sbjct: 3 QPHALLVASPGLGHLIPILELGNRLSSVLNIHVTILAVTSGSSSPTETEAIHAAAARTIC 62
Query: 69 QPLFLPPISFNDLPE-NPKIETIIILSVTRSVPFLRDLFKSLIGKTHLAGLIVDHFSTDA 128
Q +P + ++L E + I T +++ + P +RD K + K + +IVD T+
Sbjct: 63 QITEIPSVDVDNLVEPDATIFTKMVVKMRAMKPAVRDAVKLMKRKPTV--MIVDFLGTEL 122
Query: 129 FDVAIEFDVPC-YLFFPPSAMNLSFALQMPSLDQIIAGEYRDHPELIQIPGCIPIHGKEL 188
VA + + Y++ P A L+ + +P LD ++ GEY D E ++IPGC P+ KEL
Sbjct: 123 MSVADDVGMTAKYVYVPTHAWFLAVMVYLPVLDTVVEGEYVDIKEPLKIPGCKPVGPKEL 182
Query: 189 QEPTQDRSDDAYKLLLHNCKRYRMADTIFLNSYPELEPEAIKALLEEEP----GNPPVYP 248
E DRS YK + M+D + +N++ EL+ + AL E+E PVYP
Sbjct: 183 METMLDRSGQQYKECVRAGLEVPMSDGVLVNTWEELQGNTLAALREDEELSRVMKVPVYP 242
Query: 249 IGPLVRKD---CKRADCLKWLDEQPKESVLFVSFGSRGALWRDQINELALGLEMSGQRFI 308
IGP+VR + K +WLDEQ + SV+FV GS G L +Q ELALGLE+SGQRF+
Sbjct: 243 IGPIVRTNQHVDKPNSIFEWLDEQRERSVVFVCLGSGGTLTFEQTVELALGLELSGQRFV 302
Query: 309 WVVRKPKDETATTTLFNDQNEKEVSRFLPEGFIERTKNRGMVVPLWAPQVEVLRHESTGG 368
WV+R+P + ++++VS LPEGF++RT+ G+VV WAPQVE+L H S GG
Sbjct: 303 WVLRRPASYLGAIS----SDDEQVSASLPEGFLDRTRGVGIVVTQWAPQVEILSHRSIGG 362
Query: 369 FLSHCGWNSTLEAVVNGVPLIAWPAYAEQRMNAHMLTEGIKIALRPKKKEERGIVEKEEV 428
FLSHCGW+S LE++ GVP+IAWP YAEQ MNA +LTE I +A+R + ++ +EEV
Sbjct: 363 FLSHCGWSSALESLTKGVPIIAWPLYAEQWMNATLLTEEIGVAVRTSELPSERVIGREEV 422
Query: 429 AEVVKSLM--EGEEGKRVREKVKYLKNEAERALGEDGCSSKALSEIA 463
A +V+ +M E EEG+++R K + ++ +ERA +DG S +L E A
Sbjct: 423 ASLVRKIMAEEDEEGQKIRAKAEEVRVSSERAWSKDGSSYNSLFEWA 463
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Q9AR73 | 4.9e-159 | 59.18 | Hydroquinone glucosyltransferase OS=Rauvolfia serpentina OX=4060 GN=AS PE=1 SV=1 | [more] |
Q9M156 | 5.3e-145 | 56.32 | UDP-glycosyltransferase 72B1 OS=Arabidopsis thaliana OX=3702 GN=UGT72B1 PE=1 SV=... | [more] |
Q9LNI1 | 2.2e-135 | 53.66 | UDP-glycosyltransferase 72B3 OS=Arabidopsis thaliana OX=3702 GN=UGT72B3 PE=2 SV=... | [more] |
Q8W4C2 | 6.5e-135 | 54.09 | UDP-glycosyltransferase 72B2 OS=Arabidopsis thaliana OX=3702 GN=UGT72B2 PE=2 SV=... | [more] |
Q40287 | 2.0e-91 | 39.09 | Anthocyanidin 3-O-glucosyltransferase 5 OS=Manihot esculenta OX=3983 GN=GT5 PE=2... | [more] |