Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGCTCTGTAACAAACTCCGGTGAGTACCATATTCTCATGCTACCGTTCATGGCCCANGGCCATCTCATCCCCTTCCTCGAGCTCGCTAATTTCATCCACCGGAAATCCTCTGTTTTCACCATCACCATCGCCTGCACCCCCTCCAACATCCAATACCTCCGCTCCGCCGCCGCCGACTCCAAAATCCGCCTCGCTGAACTCTACTACTCGAGCTCCAACCATGGCCTTCCGCCAAACACCGAGAGTACTGAGAATCTCCCTTTAAATCAAATCGACACCCTGTTTCATTCCTCGACCGCCCTCGAACTTCCCCTTCGGGAACTGATCTCCGACCTCGTCCAGAAGGAAGGCAACCCGCCGCTGTGTATAATCTCCGACGTGTTCTTAGGTTGGTCGGTAGCCGTCGCCAGGAGCTTCAACATTCCAGTTTTCAGTTTCACTACATGCGGTGCTTATGGAACTCTGGCTTACGTCTCTCTCTGGTTGAATCTCCCCCACCGGTCATCCACCAGCGATGAGTTTTCTCTCCCGGGGTTCCCGGAAAATTGCCGTTTCCACCGCTCCCAGCTCCACCGGTTCTTACTCGCCGCCGACGGCACCGATTCCTGGTCCAGGTATTTCCGGCCGCAAATCTCCTATTCTTTGAGCTCAGATGGGTGGCTCTGTAACACCGTCGAAGAAGTCGAGTCATTCGGATTGAAACATTTGAGGGATTACATAAAATTGCCCGTCTGGGCAATTGGCCCGCTTCTCCTCCAAACCAGCTCCGGCGGCCGTCGCCGGTGGGGGAAAGAGAAAGATTCCGGCGTGGGTTTAGAAACTTATATGAATTGGTTGAATTCTCACCGGAAAAATTCGGTTCTGTACATTTCCTTCGGATCCCAGAACACAATTACCGAGAGCCAGATGATGGAACTGGCTTACGGATTGGAAGAAAGCGGGAGTGCATTCATATGGGTGGTGAGGCCGCCATCGGGACACGACATGAAAGCAGAGTTCAGAGCTCATCAATGGCTGCCTGAGCATTTCGAGGATCGAATGAAGGAGACGAATAGAGGATTGGTGATCAGAAACTGGGCACCCCAGTTGGAGATTCTAGCGCATGAATCGGTGGGAGCGTTTTTGAGCCATTGTGGGTGGAATTCGACGGTGGAAAGCTTGAGCCAGGGAGTGCCGGTGATCGGGTGGCCGATGGCAGCAGAGCAGTCGTACAATTCAAAGATGCTGGTGGAGGAAATGGGGATTGGAGTGGAGCTGACGAGGGGGAAAGAGAGTGAGATTAAGAGAGGAAGAGTGAAGGAGGTGATAGAAATGGTGATGGGGGAAGGTGGGGAAGGGGAACAGATGAGGAACAAGGCCGCCATTGTTAAGGACAAGATGAGAGCTGCAGTTATGGACGAACAAAAGGGTTCCTCTAACACCAACTTGGTGGACTTTCTTGAATTCATTCGAGCCAAACAGAAGAGTCTAAACAAAATCAAATAA
mRNA sequence
ATGGGCTCTGTAACAAACTCCGGTGAGTACCATATTCTCATGCTACCGTTCATGGCCCANGGCCATCTCATCCCCTTCCTCGAGCTCGCTAATTTCATCCACCGGAAATCCTCTGTTTTCACCATCACCATCGCCTGCACCCCCTCCAACATCCAATACCTCCGCTCCGCCGCCGCCGACTCCAAAATCCGCCTCGCTGAACTCTACTACTCGAGCTCCAACCATGGCCTTCCGCCAAACACCGAGAGTACTGAGAATCTCCCTTTAAATCAAATCGACACCCTGTTTCATTCCTCGACCGCCCTCGAACTTCCCCTTCGGGAACTGATCTCCGACCTCGTCCAGAAGGAAGGCAACCCGCCGCTGTGTATAATCTCCGACGTGTTCTTAGGTTGGTCGGTAGCCGTCGCCAGGAGCTTCAACATTCCAGTTTTCAGTTTCACTACATGCGGTGCTTATGGAACTCTGGCTTACGTCTCTCTCTGGTTGAATCTCCCCCACCGGTCATCCACCAGCGATGAGTTTTCTCTCCCGGGGTTCCCGGAAAATTGCCGTTTCCACCGCTCCCAGCTCCACCGGTTCTTACTCGCCGCCGACGGCACCGATTCCTGGTCCAGGTATTTCCGGCCGCAAATCTCCTATTCTTTGAGCTCAGATGGGTGGCTCTGTAACACCGTCGAAGAAGTCGAGTCATTCGGATTGAAACATTTGAGGGATTACATAAAATTGCCCGTCTGGGCAATTGGCCCGCTTCTCCTCCAAACCAGCTCCGGCGGCCGTCGCCGGTGGGGGAAAGAGAAAGATTCCGGCGTGGGTTTAGAAACTTATATGAATTGGTTGAATTCTCACCGGAAAAATTCGGTTCTGTACATTTCCTTCGGATCCCAGAACACAATTACCGAGAGCCAGATGATGGAACTGGCTTACGGATTGGAAGAAAGCGGGAGTGCATTCATATGGGTGGTGAGGCCGCCATCGGGACACGACATGAAAGCAGAGTTCAGAGCTCATCAATGGCTGCCTGAGCATTTCGAGGATCGAATGAAGGAGACGAATAGAGGATTGGTGATCAGAAACTGGGCACCCCAGTTGGAGATTCTAGCGCATGAATCGGTGGGAGCGTTTTTGAGCCATTGTGGGTGGAATTCGACGGTGGAAAGCTTGAGCCAGGGAGTGCCGGTGATCGGGTGGCCGATGGCAGCAGAGCAGTCGTACAATTCAAAGATGCTGGTGGAGGAAATGGGGATTGGAGTGGAGCTGACGAGGGGGAAAGAGAGTGAGATTAAGAGAGGAAGAGTGAAGGAGGTGATAGAAATGGTGATGGGGGAAGGTGGGGAAGGGGAACAGATGAGGAACAAGGCCGCCATTGTTAAGGACAAGATGAGAGCTGCAGTTATGGACGAACAAAAGGGTTCCTCTAACACCAACTTGGTGGACTTTCTTGAATTCATTCGAGCCAAACAGAAGAGTCTAAACAAAATCAAATAA
Coding sequence (CDS)
ATGGGCTCTGTAACAAACTCCGGTGAGTACCATATTCTCATGCTACCGTTCATGGCCCANGGCCATCTCATCCCCTTCCTCGAGCTCGCTAATTTCATCCACCGGAAATCCTCTGTTTTCACCATCACCATCGCCTGCACCCCCTCCAACATCCAATACCTCCGCTCCGCCGCCGCCGACTCCAAAATCCGCCTCGCTGAACTCTACTACTCGAGCTCCAACCATGGCCTTCCGCCAAACACCGAGAGTACTGAGAATCTCCCTTTAAATCAAATCGACACCCTGTTTCATTCCTCGACCGCCCTCGAACTTCCCCTTCGGGAACTGATCTCCGACCTCGTCCAGAAGGAAGGCAACCCGCCGCTGTGTATAATCTCCGACGTGTTCTTAGGTTGGTCGGTAGCCGTCGCCAGGAGCTTCAACATTCCAGTTTTCAGTTTCACTACATGCGGTGCTTATGGAACTCTGGCTTACGTCTCTCTCTGGTTGAATCTCCCCCACCGGTCATCCACCAGCGATGAGTTTTCTCTCCCGGGGTTCCCGGAAAATTGCCGTTTCCACCGCTCCCAGCTCCACCGGTTCTTACTCGCCGCCGACGGCACCGATTCCTGGTCCAGGTATTTCCGGCCGCAAATCTCCTATTCTTTGAGCTCAGATGGGTGGCTCTGTAACACCGTCGAAGAAGTCGAGTCATTCGGATTGAAACATTTGAGGGATTACATAAAATTGCCCGTCTGGGCAATTGGCCCGCTTCTCCTCCAAACCAGCTCCGGCGGCCGTCGCCGGTGGGGGAAAGAGAAAGATTCCGGCGTGGGTTTAGAAACTTATATGAATTGGTTGAATTCTCACCGGAAAAATTCGGTTCTGTACATTTCCTTCGGATCCCAGAACACAATTACCGAGAGCCAGATGATGGAACTGGCTTACGGATTGGAAGAAAGCGGGAGTGCATTCATATGGGTGGTGAGGCCGCCATCGGGACACGACATGAAAGCAGAGTTCAGAGCTCATCAATGGCTGCCTGAGCATTTCGAGGATCGAATGAAGGAGACGAATAGAGGATTGGTGATCAGAAACTGGGCACCCCAGTTGGAGATTCTAGCGCATGAATCGGTGGGAGCGTTTTTGAGCCATTGTGGGTGGAATTCGACGGTGGAAAGCTTGAGCCAGGGAGTGCCGGTGATCGGGTGGCCGATGGCAGCAGAGCAGTCGTACAATTCAAAGATGCTGGTGGAGGAAATGGGGATTGGAGTGGAGCTGACGAGGGGGAAAGAGAGTGAGATTAAGAGAGGAAGAGTGAAGGAGGTGATAGAAATGGTGATGGGGGAAGGTGGGGAAGGGGAACAGATGAGGAACAAGGCCGCCATTGTTAAGGACAAGATGAGAGCTGCAGTTATGGACGAACAAAAGGGTTCCTCTAACACCAACTTGGTGGACTTTCTTGAATTCATTCGAGCCAAACAGAAGAGTCTAAACAAAATCAAATAA
Protein sequence
MGSVTNSGEYHILMLPFMAXGHLIPFLELANFIHRKSSVFTITIACTPSNIQYLRSAAADSKIRLAELYYSSSNHGLPPNTESTENLPLNQIDTLFHSSTALELPLRELISDLVQKEGNPPLCIISDVFLGWSVAVARSFNIPVFSFTTCGAYGTLAYVSLWLNLPHRSSTSDEFSLPGFPENCRFHRSQLHRFLLAADGTDSWSRYFRPQISYSLSSDGWLCNTVEEVESFGLKHLRDYIKLPVWAIGPLLLQTSSGGRRRWGKEKDSGVGLETYMNWLNSHRKNSVLYISFGSQNTITESQMMELAYGLEESGSAFIWVVRPPSGHDMKAEFRAHQWLPEHFEDRMKETNRGLVIRNWAPQLEILAHESVGAFLSHCGWNSTVESLSQGVPVIGWPMAAEQSYNSKMLVEEMGIGVELTRGKESEIKRGRVKEVIEMVMGEGGEGEQMRNKAAIVKDKMRAAVMDEQKGSSNTNLVDFLEFIRAKQKSLNKIK
Homology
BLAST of CmaCh06G016240 vs. ExPASy Swiss-Prot
Match:
Q9LXV0 (UDP-glycosyltransferase 92A1 OS=Arabidopsis thaliana OX=3702 GN=UGT92A1 PE=2 SV=1)
HSP 1 Score: 385.6 bits (989), Expect = 8.6e-106
Identity = 203/487 (41.68%), Postives = 302/487 (62.01%), Query Frame = 0
Query: 12 ILMLPFMAXGHLIPFLELA-----NFIHRKSSVFTITIACTPSNIQYLRS-AAADSKIRL 71
I+M PFM GH+IPF+ LA I +++ TI++ TPSNI +RS +S I L
Sbjct: 11 IVMFPFMGQGHIIPFVALALRLEKIMIMNRANKTTISMINTPSNIPKIRSNLPPESSISL 70
Query: 72 AELYYSSSNHGLPPNTESTENLPLNQIDTLFHSSTALELPLRELISDLVQKEGNPPLCII 131
EL ++SS+HGLP + E+ ++LP + + +L +S +L P R+ ++ ++++EG + +I
Sbjct: 71 IELPFNSSDHGLPHDGENFDSLPYSLVISLLEASRSLREPFRDFMTKILKEEGQSSVIVI 130
Query: 132 SDVFLGWSVAVARSFNIPVFSFTTCGAYGTLAYVSLWLNLPHRSSTSDEFSLPGFPENCR 191
D FLGW V + + F+ GA+G Y S+WLNLPH+ + D+F L FPE
Sbjct: 131 GDFFLGWIGKVCKEVGVYSVIFSASGAFGLGCYRSIWLNLPHKETKQDQFLLDDFPEAGE 190
Query: 192 FHRSQLHRFLLAADGTDSWSRYFRPQISYSLSSDGWLCNTVEEVESFGLKHLRDYIKLPV 251
++QL+ F+L ADGTD WS + + I DG+L NTV E++ GL + R +PV
Sbjct: 191 IEKTQLNSFMLEADGTDDWSVFMKKIIPGWSDFDGFLFNTVAEIDQMGLSYFRRITGVPV 250
Query: 252 WAIGPLLLQTSSGGRRRWGKEKDSGVGL----ETYMNWLNSHRKNSVLYISFGSQNTITE 311
W +GP+L K D VG E +WL+S +SV+Y+ FGS N+I +
Sbjct: 251 WPVGPVL------------KSPDKKVGSRSTEEAVKSWLDSKPDHSVVYVCFGSMNSILQ 310
Query: 312 SQMMELAYGLEESGSAFIWVVRPPSGHDMKAEFRAHQWLPEHFEDRMKETNRGLVIRNWA 371
+ M+ELA LE S FIWVVRPP G ++K+EF +LPE FE+R+ + RGL+++ WA
Sbjct: 311 THMLELAMALESSEKNFIWVVRPPIGVEVKSEFDVKGYLPEGFEERITRSERGLLVKKWA 370
Query: 372 PQLEILAHESVGAFLSHCGWNSTVESLSQGVPVIGWPMAAEQSYNSKMLVEEMGIGVELT 431
PQ++IL+H++ FLSHCGWNS +ESLS GVP++GWPMAAEQ +NS ++ + +G+ VE+
Sbjct: 371 PQVDILSHKATCVFLSHCGWNSILESLSHGVPLLGWPMAAEQFFNSILMEKHIGVSVEVA 430
Query: 432 RGKESEIKRGRVKEVIEMVMGEGGEGEQMRNKAAIVKDKMRAAVMDEQKGSSNTNLVDFL 489
RGK EIK + I++VM E G+++R KA VK+ +R A++D KGSS L +FL
Sbjct: 431 RGKRCEIKCDDIVSKIKLVMEETEVGKEIRKKAREVKELVRRAMVDGVKGSSVIGLEEFL 485
BLAST of CmaCh06G016240 vs. ExPASy Swiss-Prot
Match:
Q6WFW1 (Crocetin glucosyltransferase 3 OS=Crocus sativus OX=82528 GN=GLT3 PE=1 SV=1)
HSP 1 Score: 357.1 bits (915), Expect = 3.3e-97
Identity = 207/455 (45.49%), Postives = 280/455 (61.54%), Query Frame = 0
Query: 11 HILMLPFMAXGHLIPFLELANFIHRKSSVFTITIACTPSNIQYLRSA-AADSKIRLAELY 70
HI++ PFM+ GH+IPFL LA I + +TIT+ TP NI L+S +S I L L
Sbjct: 5 HIVLFPFMSQGHIIPFLSLAKLISERHPTYTITLLNTPLNILNLQSTLPPNSNIHLKSLP 64
Query: 71 YSSSNHGLPPNTESTENLPLNQIDTLFHSSTALELPLRELISDLV-QKEGNPPLCIISDV 130
Y SS+ GLPP+ E+T++LP + + + S +L +SDL Q PPL I++DV
Sbjct: 65 YRSSDFGLPPDRENTDSLPFPLVLSFYQSGESLATHFTHFVSDLTRQNHDTPPLLIVADV 124
Query: 131 FLGWSVAVARSFNIPVFSFTTCGAYGTLAYVSLWLNLPHRSSTSDEFSLPGFPENCRFHR 190
F GW+ +A+ N V SF+TCGAYGT AY S+WL+LPH + +F+ PGFPE + R
Sbjct: 125 FFGWTAEIAKRLNTHV-SFSTCGAYGTAAYFSVWLHLPHAETDLPDFTAPGFPETFKLQR 184
Query: 191 SQLHRFLLAADGTDSWSRYFRPQISYSLSSDGWLCNTVEEVESFGLKHLRDYIKLPVWAI 250
+QL +L ADG+D WS++F+ QIS SL+SD +CNTVEE+E+ GL+ LR L VW+I
Sbjct: 185 NQLSTYLKKADGSDRWSKFFQRQISLSLTSDAMICNTVEEMEAEGLRLLRKNTGLRVWSI 244
Query: 251 GPLLLQ---TSSGGRRRWGKEKDSGVGLETYMNWLNSHRKNSVLYISFGSQNTITESQMM 310
GPLL SS GR + SG+ + M WL+SH SV+Y+SFGS + T +QM
Sbjct: 245 GPLLPSLPPNSSLGR----SGRKSGMEVSYIMKWLDSHPPGSVVYVSFGSIHD-TAAQMT 304
Query: 311 ELAYGLEESGSAFIWVVRPPSGHDMKAEFRAHQ-------WLPEHFEDRMKETNRGLVIR 370
LA GL + + GH + F ++ +P+ FE RM+ + RG++I
Sbjct: 305 SLAVGLA------VELATRSCGHSGR-RFGGNRNRNSNPNGVPDEFEARMRGSGRGILIH 364
Query: 371 NWAPQLEILAHESVGAFLSHCGWNSTVESLSQGVPVIGWPMAAEQSYNSKMLVE--EMGI 430
WAPQLEIL HES GAF+SHCGWNST+ESLS+GV +IGWP+AAEQ YNSKM+ E E G
Sbjct: 365 GWAPQLEILEHESTGAFVSHCGWNSTLESLSRGVCMIGWPLAAEQFYNSKMVEEDWEWGG 424
Query: 431 GVELTRG--KESEIKRGRVKEVIEMVMGEGGEGEQ 450
E + G + E++R V+ V E G E EQ
Sbjct: 425 TCEGSGGGVRSEEVER-LVRLVTEDEKGSDEENEQ 445
BLAST of CmaCh06G016240 vs. ExPASy Swiss-Prot
Match:
Q9AT54 (Scopoletin glucosyltransferase OS=Nicotiana tabacum OX=4097 GN=TOGT1 PE=1 SV=1)
HSP 1 Score: 255.4 bits (651), Expect = 1.3e-66
Identity = 171/492 (34.76%), Postives = 255/492 (51.83%), Query Frame = 0
Query: 8 GEYHILMLPFMAXGHLIPFLELANFIHRKSSVFTITIACTPSN-------IQYLRSAAAD 67
G+ H P MA GH+IP L++A S TI TP N IQ + +
Sbjct: 2 GQLHFFFFPVMAHGHMIPTLDMAKLF--ASRGVKATIITTPLNEFVFSKAIQRNKHLGIE 61
Query: 68 SKIRLAELYYSSSNHGLPPNTESTENLPLNQ-IDTLFHSSTALELPLRELISDLVQKEGN 127
+IRL + + + +GLP E + +P ++ + F + ++ PL +LI +
Sbjct: 62 IEIRL--IKFPAVENGLPEECERLDQIPSDEKLPNFFKAVAMMQEPLEQLIEEC------ 121
Query: 128 PPLCIISDVFLGWSVAVARSFNIPVFSFTTCGAYGTLAYVSLWLNLPHR--SSTSDEFSL 187
P C+ISD+FL W+ A FNIP F + S+ LN P + SS S+ F +
Sbjct: 122 RPDCLISDMFLPWTTDTAAKFNIPRIVFHGTSFFALCVENSVRLNKPFKNVSSDSETFVV 181
Query: 188 PGFPENCRFHRSQLHRFLLAADGTDSWSRYFRPQISYSLSSDGWLCNTVEEVESFGLKHL 247
P P + R+Q+ F + + T + +R + S G + N+ E+E+ ++H
Sbjct: 182 PDLPHEIKLTRTQVSPFERSGEET-AMTRMIKTVRESDSKSYGVVFNSFYELETDYVEHY 241
Query: 248 RDYIKLPVWAIGPLLLQTSSGGRRRWGKEKDSGVGLETYMNWLNSHRKNSVLYISFGSQN 307
+ WAIGPL + + + + K S + + WL+S + +SV+Y+ FGS
Sbjct: 242 TKVLGRRAWAIGPLSM-CNRDIEDKAERGKKSSIDKHECLKWLDSKKPSSVVYVCFGSVA 301
Query: 308 TITESQMMELAYGLEESGSAFIWVVRPPSGHDMKAEFRAHQWLPEHFEDRMKETNRGLVI 367
T SQ+ ELA G+E SG FIWVVR E WLPE FE+R KE +GL+I
Sbjct: 302 NFTASQLHELAMGIEASGQEFIWVVR--------TELDNEDWLPEGFEERTKE--KGLII 361
Query: 368 RNWAPQLEILAHESVGAFLSHCGWNSTVESLSQGVPVIGWPMAAEQSYNSKMLVEEM--- 427
R WAPQ+ IL HESVGAF++HCGWNST+E +S GVP++ WP+ AEQ +N K++ E +
Sbjct: 362 RGWAPQVLILDHESVGAFVTHCGWNSTLEGVSGGVPMVTWPVFAEQFFNEKLVTEVLKTG 421
Query: 428 -GIG-VELTRGKESEIKRGRVKEVIEMVMGEGGEGEQMRNKAAIVKDKMRAAVMDEQKGS 485
G+G ++ R +KR + + I+ VM E + RN+A K+ R A+ E+ GS
Sbjct: 422 AGVGSIQWKRSASEGVKREAIAKAIKRVM-VSEEADGFRNRAKAYKEMARKAI--EEGGS 468
BLAST of CmaCh06G016240 vs. ExPASy Swiss-Prot
Match:
Q2V6J9 (UDP-glucose flavonoid 3-O-glucosyltransferase 7 OS=Fragaria ananassa OX=3747 GN=GT7 PE=1 SV=1)
HSP 1 Score: 248.4 bits (633), Expect = 1.6e-64
Identity = 163/484 (33.68%), Postives = 239/484 (49.38%), Query Frame = 0
Query: 9 EYHILMLPFMAXGHLIPFLELANFIHRKSSVFTITIACTPSNIQYLRSAAADSKIRLAEL 68
+ HI LPFMA GH IP ++A S TI TP N A +I L +
Sbjct: 10 QLHIFFLPFMARGHSIPLTDIAKLF--SSHGARCTIVTTPLNAPLFSKATQRGEIELVLI 69
Query: 69 YYSSSNHGLPPNTESTENLPLNQIDTLFHSSTALELPLRELISDLVQKEGNPPLCIISDV 128
+ S+ GLP + ES + + + F +T L P E I D + P C+++D
Sbjct: 70 KFPSAEAGLPQDCESADLITTQDMLGKFVKATFLIEPHFEKILD-----EHRPHCLVADA 129
Query: 129 FLGWSVAVARSFNIPVFSFTTCGAYGTLAYVSLWLNLPHR--SSTSDEFSLPGFPENCRF 188
F W+ VA F IP F G + A +S+ + PH SS S+ F +P P+ +
Sbjct: 130 FFTWATDVAAKFRIPRLYFHGTGFFALCASLSVMMYQPHSNLSSDSESFVIPNLPDEIKM 189
Query: 189 HRSQLHRFLLAADGTDSWSRYFRPQISYSLSSDGWLCNTVEEVESFGLKHLRDYIKLPVW 248
RSQL F + + + I S G + N+ E+E H R W
Sbjct: 190 TRSQLPVF----PDESEFMKMLKASIEIEERSYGVIVNSFYELEPAYANHYRKVFGRKAW 249
Query: 249 AIGPL-LLQTSSGGRRRWGKEKDSGVGLETYMNWLNSHRKNSVLYISFGSQNTITESQMM 308
IGP+ + + G K S + WL+S + SV+Y+SFGS +SQ++
Sbjct: 250 HIGPVSFCNKAIEDKAERGSIKSSTAEKHECLKWLDSKKPRSVVYVSFGSMVRFADSQLL 309
Query: 309 ELAYGLEESGSAFIWVVRPPSGHDMKAEFRAHQWLPEHFEDRMKETNRGLVIRNWAPQLE 368
E+A GLE SG FIWVV+ K + +WLPE FE RM+ +GL+IR+WAPQ+
Sbjct: 310 EIATGLEASGQDFIWVVK-------KEKKEVEEWLPEGFEKRME--GKGLIIRDWAPQVL 369
Query: 369 ILAHESVGAFLSHCGWNSTVESLSQGVPVIGWPMAAEQSYNSKMLVEEMGIGVELTRGK- 428
IL HE++GAF++HCGWNS +E++S GVP+I WP+ EQ YN K++ E IGV + K
Sbjct: 370 ILEHEAIGAFVTHCGWNSILEAVSAGVPMITWPVFGEQFYNEKLVTEIHRIGVPVGSEKW 429
Query: 429 -----------ESEIKRGRVKEVIEMVMGEGGEGEQMRNKAAIVKDKMRAAVMDEQKGSS 478
E ++R ++E + +M G E + R++ + + R AV E+ GSS
Sbjct: 430 ALSFVDVNAETEGRVRREAIEEAVTRIM-VGDEAVETRSRVKELGENARRAV--EEGGSS 470
BLAST of CmaCh06G016240 vs. ExPASy Swiss-Prot
Match:
Q9ZQ96 (UDP-glycosyltransferase 73C3 OS=Arabidopsis thaliana OX=3702 GN=UGT73C3 PE=2 SV=1)
HSP 1 Score: 239.2 bits (609), Expect = 1.0e-61
Identity = 158/495 (31.92%), Postives = 262/495 (52.93%), Query Frame = 0
Query: 11 HILMLPFMAXGHLIPFLELANFIHRKSSVFTITIACTPSNIQYL-----RSAAADSKIRL 70
H ++ PFMA GH+IP +++A + ++ TITI TP N R+ + I +
Sbjct: 14 HFVLFPFMAQGHMIPMIDIARLLAQRG--VTITIVTTPHNAARFKNVLNRAIESGLAINI 73
Query: 71 AELYYSSSNHGLPPNTESTENLPLNQIDT-LFHSSTALELPLRELISDLVQKEGNPPLCI 130
+ + GLP E+ ++L ++ F + LE P+ +L+ ++ + P C+
Sbjct: 74 LHVKFPYQEFGLPEGKENIDSLDSTELMVPFFKAVNLLEDPVMKLMEEMKPR----PSCL 133
Query: 131 ISDVFLGWSVAVARSFNIPVFSFTTCGAYGTLAYVSLWLN---LPHRSSTSDEFSLPGFP 190
ISD L ++ +A++FNIP F G + L L N L + S + F +P FP
Sbjct: 134 ISDWCLPYTSIIAKNFNIPKIVFHGMGCFNLLCMHVLRRNLEILENVKSDEEYFLVPSFP 193
Query: 191 ENCRFHRSQLHRFLLAADGTDSWSRYFRPQISYSLSSDGWLCNTVEEVESFGLKHLRDYI 250
+ F + QL + A+ + W + +S G + NT +E+E +K ++ +
Sbjct: 194 DRVEFTKLQLP---VKANASGDWKEIMDEMVKAEYTSYGVIVNTFQELEPPYVKDYKEAM 253
Query: 251 KLPVWAIGPLLLQTSSGGRRRWGKEKDSGVGLETYMNWLNSHRKNSVLYISFGSQNTITE 310
VW+IGP+ L +G + K + + + + WL+S + SVLY+ GS +
Sbjct: 254 DGKVWSIGPVSLCNKAGADKAERGSK-AAIDQDECLQWLDSKEEGSVLYVCLGSICNLPL 313
Query: 311 SQMMELAYGLEESGSAFIWVVRPPSGHDMKAEFRAHQWLPEH-FEDRMKETNRGLVIRNW 370
SQ+ EL GLEES +FIWV+R G + E +W+ E FE+R+KE RGL+I+ W
Sbjct: 314 SQLKELGLGLEESRRSFIWVIR---GSEKYKEL--FEWMLESGFEERIKE--RGLLIKGW 373
Query: 371 APQLEILAHESVGAFLSHCGWNSTVESLSQGVPVIGWPMAAEQSYNSKMLVEEMGIGVE- 430
APQ+ IL+H SVG FL+HCGWNST+E ++ G+P+I WP+ +Q N K++V+ + GV
Sbjct: 374 APQVLILSHPSVGGFLTHCGWNSTLEGITSGIPLITWPLFGDQFCNQKLVVQVLKAGVSA 433
Query: 431 -----LTRGKESEI-----KRGRVKEVIEMVMGEGGEGEQMRNKAAIVKDKMRAAVMDEQ 485
+ G+E +I K G VK+ +E +MG+ + ++ R + + + AV E+
Sbjct: 434 GVEEVMKWGEEDKIGVLVDKEG-VKKAVEELMGDSDDAKERRRRVKELGELAHKAV--EK 488
BLAST of CmaCh06G016240 vs. TAIR 10
Match:
AT5G12890.1 (UDP-Glycosyltransferase superfamily protein )
HSP 1 Score: 385.6 bits (989), Expect = 6.1e-107
Identity = 203/487 (41.68%), Postives = 302/487 (62.01%), Query Frame = 0
Query: 12 ILMLPFMAXGHLIPFLELA-----NFIHRKSSVFTITIACTPSNIQYLRS-AAADSKIRL 71
I+M PFM GH+IPF+ LA I +++ TI++ TPSNI +RS +S I L
Sbjct: 11 IVMFPFMGQGHIIPFVALALRLEKIMIMNRANKTTISMINTPSNIPKIRSNLPPESSISL 70
Query: 72 AELYYSSSNHGLPPNTESTENLPLNQIDTLFHSSTALELPLRELISDLVQKEGNPPLCII 131
EL ++SS+HGLP + E+ ++LP + + +L +S +L P R+ ++ ++++EG + +I
Sbjct: 71 IELPFNSSDHGLPHDGENFDSLPYSLVISLLEASRSLREPFRDFMTKILKEEGQSSVIVI 130
Query: 132 SDVFLGWSVAVARSFNIPVFSFTTCGAYGTLAYVSLWLNLPHRSSTSDEFSLPGFPENCR 191
D FLGW V + + F+ GA+G Y S+WLNLPH+ + D+F L FPE
Sbjct: 131 GDFFLGWIGKVCKEVGVYSVIFSASGAFGLGCYRSIWLNLPHKETKQDQFLLDDFPEAGE 190
Query: 192 FHRSQLHRFLLAADGTDSWSRYFRPQISYSLSSDGWLCNTVEEVESFGLKHLRDYIKLPV 251
++QL+ F+L ADGTD WS + + I DG+L NTV E++ GL + R +PV
Sbjct: 191 IEKTQLNSFMLEADGTDDWSVFMKKIIPGWSDFDGFLFNTVAEIDQMGLSYFRRITGVPV 250
Query: 252 WAIGPLLLQTSSGGRRRWGKEKDSGVGL----ETYMNWLNSHRKNSVLYISFGSQNTITE 311
W +GP+L K D VG E +WL+S +SV+Y+ FGS N+I +
Sbjct: 251 WPVGPVL------------KSPDKKVGSRSTEEAVKSWLDSKPDHSVVYVCFGSMNSILQ 310
Query: 312 SQMMELAYGLEESGSAFIWVVRPPSGHDMKAEFRAHQWLPEHFEDRMKETNRGLVIRNWA 371
+ M+ELA LE S FIWVVRPP G ++K+EF +LPE FE+R+ + RGL+++ WA
Sbjct: 311 THMLELAMALESSEKNFIWVVRPPIGVEVKSEFDVKGYLPEGFEERITRSERGLLVKKWA 370
Query: 372 PQLEILAHESVGAFLSHCGWNSTVESLSQGVPVIGWPMAAEQSYNSKMLVEEMGIGVELT 431
PQ++IL+H++ FLSHCGWNS +ESLS GVP++GWPMAAEQ +NS ++ + +G+ VE+
Sbjct: 371 PQVDILSHKATCVFLSHCGWNSILESLSHGVPLLGWPMAAEQFFNSILMEKHIGVSVEVA 430
Query: 432 RGKESEIKRGRVKEVIEMVMGEGGEGEQMRNKAAIVKDKMRAAVMDEQKGSSNTNLVDFL 489
RGK EIK + I++VM E G+++R KA VK+ +R A++D KGSS L +FL
Sbjct: 431 RGKRCEIKCDDIVSKIKLVMEETEVGKEIRKKAREVKELVRRAMVDGVKGSSVIGLEEFL 485
BLAST of CmaCh06G016240 vs. TAIR 10
Match:
AT2G15490.3 (UDP-glycosyltransferase 73B4 )
HSP 1 Score: 239.6 bits (610), Expect = 5.4e-63
Identity = 157/504 (31.15%), Postives = 261/504 (51.79%), Query Frame = 0
Query: 6 NSGEYHILMLPFMAXGHLIPFLELANFIHRKSSVFTITIACTPSN-------IQYLRSAA 65
N + HIL PFMA GH+IP L++A R+ + T+ TP N I+ +
Sbjct: 2 NREQIHILFFPFMAHGHMIPLLDMAKLFARRGA--KSTLLTTPINAKILEKPIEAFKVQN 61
Query: 66 ADSKIRLAELYYSSSNHGLPPNTESTENLPLNQIDTLFHSSTALELPLRELISDLVQK-- 125
D +I + L + GLP E+ + + + S + +L L+ L S K
Sbjct: 62 PDLEIGIKILNFPCVELGLPEGCENRDFI------NSYQKSDSFDLFLKFLFSTKYMKQQ 121
Query: 126 -----EGNPPLCIISDVFLGWSVAVARSFNIPVFSFTTCGAYGTLAYVSLWLNLPHR--S 185
E P +++D+F W+ A +P F ++ ++ ++ PH+ +
Sbjct: 122 LESFIETTKPSALVADMFFPWATESAEKIGVPRLVFHGTSSFALCCSYNMRIHKPHKKVA 181
Query: 186 STSDEFSLPGFPENCRFHRSQLHRFLLAADGTDSWSRYFRPQISYSLSSDGWLCNTVEEV 245
S+S F +PG P + Q + + + ++++ SS G L N+ E+
Sbjct: 182 SSSTPFVIPGLPGDIVITEDQAN----VTNEETPFGKFWKEVRESETSSFGVLVNSFYEL 241
Query: 246 ESFGLKHLRDYIKLPVWAIGPLLLQTSSGGRRRWGKEKDSGVGLETYMNWLNSHRKNSVL 305
ES R ++ W IGPL L ++ G + G+ K + + + + WL+S SV+
Sbjct: 242 ESSYADFYRSFVAKKAWHIGPLSL-SNRGIAEKAGRGKKANIDEQECLKWLDSKTPGSVV 301
Query: 306 YISFGSQNTITESQMMELAYGLEESGSAFIWVVRPPSGHDMKAEFRAHQWLPEHFEDRMK 365
Y+SFGS + Q++E+A+GLE SG FIWVV S ++ + E WLP+ FE+R K
Sbjct: 302 YLSFGSGTGLPNEQLLEIAFGLEGSGQNFIWVV---SKNENQGE--NEDWLPKGFEERNK 361
Query: 366 ETNRGLVIRNWAPQLEILAHESVGAFLSHCGWNSTVESLSQGVPVIGWPMAAEQSYNSKM 425
+GL+IR WAPQ+ IL H+++G F++HCGWNST+E ++ G+P++ WPM AEQ YN K+
Sbjct: 362 --GKGLIIRGWAPQVLILDHKAIGGFVTHCGWNSTLEGIAAGLPMVTWPMGAEQFYNEKL 421
Query: 426 LVEEMGIGV-----ELTRGKESEIKRGRVKEVIEMVMGEGGEGEQMRNKAAIVKDKMRAA 485
L + + IGV EL + K I R +V++ + V+G G + E+ R +A + + +AA
Sbjct: 422 LTKVLRIGVNVGATELVK-KGKLISRAQVEKAVREVIG-GEKAEERRLRAKELGEMAKAA 481
Query: 486 VMDEQKGSSNTNLVDFLEFIRAKQ 489
V E+ GSS ++ F+E + ++
Sbjct: 482 V--EEGGSSYNDVNKFMEELNGRK 481
BLAST of CmaCh06G016240 vs. TAIR 10
Match:
AT2G36780.1 (UDP-Glycosyltransferase superfamily protein )
HSP 1 Score: 239.2 bits (609), Expect = 7.1e-63
Identity = 158/495 (31.92%), Postives = 262/495 (52.93%), Query Frame = 0
Query: 11 HILMLPFMAXGHLIPFLELANFIHRKSSVFTITIACTPSNIQYL-----RSAAADSKIRL 70
H ++ PFMA GH+IP +++A + ++ TITI TP N R+ + I +
Sbjct: 14 HFVLFPFMAQGHMIPMIDIARLLAQRG--VTITIVTTPHNAARFKNVLNRAIESGLAINI 73
Query: 71 AELYYSSSNHGLPPNTESTENLPLNQIDT-LFHSSTALELPLRELISDLVQKEGNPPLCI 130
+ + GLP E+ ++L ++ F + LE P+ +L+ ++ + P C+
Sbjct: 74 LHVKFPYQEFGLPEGKENIDSLDSTELMVPFFKAVNLLEDPVMKLMEEMKPR----PSCL 133
Query: 131 ISDVFLGWSVAVARSFNIPVFSFTTCGAYGTLAYVSLWLN---LPHRSSTSDEFSLPGFP 190
ISD L ++ +A++FNIP F G + L L N L + S + F +P FP
Sbjct: 134 ISDWCLPYTSIIAKNFNIPKIVFHGMGCFNLLCMHVLRRNLEILENVKSDEEYFLVPSFP 193
Query: 191 ENCRFHRSQLHRFLLAADGTDSWSRYFRPQISYSLSSDGWLCNTVEEVESFGLKHLRDYI 250
+ F + QL + A+ + W + +S G + NT +E+E +K ++ +
Sbjct: 194 DRVEFTKLQLP---VKANASGDWKEIMDEMVKAEYTSYGVIVNTFQELEPPYVKDYKEAM 253
Query: 251 KLPVWAIGPLLLQTSSGGRRRWGKEKDSGVGLETYMNWLNSHRKNSVLYISFGSQNTITE 310
VW+IGP+ L +G + K + + + + WL+S + SVLY+ GS +
Sbjct: 254 DGKVWSIGPVSLCNKAGADKAERGSK-AAIDQDECLQWLDSKEEGSVLYVCLGSICNLPL 313
Query: 311 SQMMELAYGLEESGSAFIWVVRPPSGHDMKAEFRAHQWLPEH-FEDRMKETNRGLVIRNW 370
SQ+ EL GLEES +FIWV+R G + E +W+ E FE+R+KE RGL+I+ W
Sbjct: 314 SQLKELGLGLEESRRSFIWVIR---GSEKYKEL--FEWMLESGFEERIKE--RGLLIKGW 373
Query: 371 APQLEILAHESVGAFLSHCGWNSTVESLSQGVPVIGWPMAAEQSYNSKMLVEEMGIGVE- 430
APQ+ IL+H SVG FL+HCGWNST+E ++ G+P+I WP+ +Q N K++V+ + GV
Sbjct: 374 APQVLILSHPSVGGFLTHCGWNSTLEGITSGIPLITWPLFGDQFCNQKLVVQVLKAGVSA 433
Query: 431 -----LTRGKESEI-----KRGRVKEVIEMVMGEGGEGEQMRNKAAIVKDKMRAAVMDEQ 485
+ G+E +I K G VK+ +E +MG+ + ++ R + + + AV E+
Sbjct: 434 GVEEVMKWGEEDKIGVLVDKEG-VKKAVEELMGDSDDAKERRRRVKELGELAHKAV--EK 488
BLAST of CmaCh06G016240 vs. TAIR 10
Match:
AT2G15490.1 (UDP-glycosyltransferase 73B4 )
HSP 1 Score: 239.2 bits (609), Expect = 7.1e-63
Identity = 155/504 (30.75%), Postives = 257/504 (50.99%), Query Frame = 0
Query: 6 NSGEYHILMLPFMAXGHLIPFLELANFIHRKSSVFTITIACTPSN-------IQYLRSAA 65
N + HIL PFMA GH+IP L++A R+ + T+ TP N I+ +
Sbjct: 2 NREQIHILFFPFMAHGHMIPLLDMAKLFARRGA--KSTLLTTPINAKILEKPIEAFKVQN 61
Query: 66 ADSKIRLAELYYSSSNHGLPPNTESTENLPLNQIDTLFHSSTALELPLRELISDLVQK-- 125
D +I + L + GLP E+ + + + S + +L L+ L S K
Sbjct: 62 PDLEIGIKILNFPCVELGLPEGCENRDFI------NSYQKSDSFDLFLKFLFSTKYMKQQ 121
Query: 126 -----EGNPPLCIISDVFLGWSVAVARSFNIPVFSFTTCGAYGTLAYVSLWLNLPHR--S 185
E P +++D+F W+ A +P F ++ ++ ++ PH+ +
Sbjct: 122 LESFIETTKPSALVADMFFPWATESAEKIGVPRLVFHGTSSFALCCSYNMRIHKPHKKVA 181
Query: 186 STSDEFSLPGFPENCRFHRSQLHRFLLAADGTDSWSRYFRPQISYSLSSDGWLCNTVEEV 245
S+S F +PG P + Q + + + ++++ SS G L N+ E+
Sbjct: 182 SSSTPFVIPGLPGDIVITEDQAN----VTNEETPFGKFWKEVRESETSSFGVLVNSFYEL 241
Query: 246 ESFGLKHLRDYIKLPVWAIGPLLLQTSSGGRRRWGKEKDSGVGLETYMNWLNSHRKNSVL 305
ES R ++ W IGPL L ++ G + G+ K + + + + WL+S SV+
Sbjct: 242 ESSYADFYRSFVAKKAWHIGPLSL-SNRGIAEKAGRGKKANIDEQECLKWLDSKTPGSVV 301
Query: 306 YISFGSQNTITESQMMELAYGLEESGSAFIWVVRPPSGHDMKAEFRAHQWLPEHFEDRMK 365
Y+SFGS + Q++E+A+GLE SG FIWVV + WLP+ FE+R K
Sbjct: 302 YLSFGSGTGLPNEQLLEIAFGLEGSGQNFIWVV--SKNENQVGTGENEDWLPKGFEERNK 361
Query: 366 ETNRGLVIRNWAPQLEILAHESVGAFLSHCGWNSTVESLSQGVPVIGWPMAAEQSYNSKM 425
+GL+IR WAPQ+ IL H+++G F++HCGWNST+E ++ G+P++ WPM AEQ YN K+
Sbjct: 362 --GKGLIIRGWAPQVLILDHKAIGGFVTHCGWNSTLEGIAAGLPMVTWPMGAEQFYNEKL 421
Query: 426 LVEEMGIGV-----ELTRGKESEIKRGRVKEVIEMVMGEGGEGEQMRNKAAIVKDKMRAA 485
L + + IGV EL + K I R +V++ + V+G G + E+ R +A + + +AA
Sbjct: 422 LTKVLRIGVNVGATELVK-KGKLISRAQVEKAVREVIG-GEKAEERRLRAKELGEMAKAA 481
Query: 486 VMDEQKGSSNTNLVDFLEFIRAKQ 489
V E+ GSS ++ F+E + ++
Sbjct: 482 V--EEGGSSYNDVNKFMEELNGRK 484
BLAST of CmaCh06G016240 vs. TAIR 10
Match:
AT2G36800.1 (don-glucosyltransferase 1 )
HSP 1 Score: 237.7 bits (605), Expect = 2.1e-62
Identity = 165/508 (32.48%), Postives = 266/508 (52.36%), Query Frame = 0
Query: 1 MGSVTNSGEYHILMLPFMAXGHLIPFLELANFIHRKSSVFTITIACTPSNIQYL-----R 60
+ T S H ++ PFMA GH+IP +++A + ++ + ITI TP N R
Sbjct: 2 VSETTKSSPLHFVLFPFMAQGHMIPMVDIARLLAQRGVI--ITIVTTPHNAARFKNVLNR 61
Query: 61 SAAADSKIRLAELYYSSSNHGLPPNTESTENL-PLNQIDTLFHSSTALELPLRELISDLV 120
+ + I L ++ + GL E+ ++L + ++ F + LE P+++LI ++
Sbjct: 62 AIESGLPINLVQVKFPYLEAGLQEGQENIDSLDTMERMIPFFKAVNFLEEPVQKLIEEM- 121
Query: 121 QKEGNP-PLCIISDVFLGWSVAVARSFNIPVFSFTTCGAYGTLAYVSLWLN---LPHRSS 180
NP P C+ISD L ++ +A+ FNIP F G + L L N L + S
Sbjct: 122 ----NPRPSCLISDFCLPYTSKIAKKFNIPKILFHGMGCFCLLCMHVLRKNREILDNLKS 181
Query: 181 TSDEFSLPGFPENCRFHRSQ--LHRFLLAADGTDSWSRYFRPQISYSLSSDGWLCNTVEE 240
+ F++P FP+ F R+Q + ++ A D W F + + +S G + N+ +E
Sbjct: 182 DKELFTVPDFPDRVEFTRTQVPVETYVPAGD----WKDIFDGMVEANETSYGVIVNSFQE 241
Query: 241 VESFGLKHLRDYIKLPVWAIGPLLLQTSSGGRRRWGKEKDSGVGLETYMNWLNSHRKNSV 300
+E K ++ W IGP+ L G + K S + + + WL+S + SV
Sbjct: 242 LEPAYAKDYKEVRSGKAWTIGPVSLCNKVGADKAERGNK-SDIDQDECLKWLDSKKHGSV 301
Query: 301 LYISFGSQNTITESQMMELAYGLEESGSAFIWVVRPPSGHDMKAEFRAHQWLPEH-FEDR 360
LY+ GS + SQ+ EL GLEES FIWV+R G + E +W E FEDR
Sbjct: 302 LYVCLGSICNLPLSQLKELGLGLEESQRPFIWVIR---GWEKYKEL--VEWFSESGFEDR 361
Query: 361 MKETNRGLVIRNWAPQLEILAHESVGAFLSHCGWNSTVESLSQGVPVIGWPMAAEQSYNS 420
+++ RGL+I+ W+PQ+ IL+H SVG FL+HCGWNST+E ++ G+P++ WP+ A+Q N
Sbjct: 362 IQD--RGLLIKGWSPQMLILSHPSVGGFLTHCGWNSTLEGITAGLPLLTWPLFADQFCNE 421
Query: 421 KMLVEEMGIGVE------LTRGKESEI-----KRGRVKEVIEMVMGEGGEGEQMRNKAAI 480
K++VE + GV + G+E +I K G VK+ +E +MGE + ++ R +A
Sbjct: 422 KLVVEVLKAGVRSGVEQPMKWGEEEKIGVLVDKEG-VKKAVEELMGESDDAKERRRRAKE 481
Query: 481 VKDKMRAAVMDEQKGSSNTNLVDFLEFI 485
+ D AV E+ GSS++N+ L+ I
Sbjct: 482 LGDSAHKAV--EEGGSSHSNISFLLQDI 487
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Q9LXV0 | 8.6e-106 | 41.68 | UDP-glycosyltransferase 92A1 OS=Arabidopsis thaliana OX=3702 GN=UGT92A1 PE=2 SV=... | [more] |
Q6WFW1 | 3.3e-97 | 45.49 | Crocetin glucosyltransferase 3 OS=Crocus sativus OX=82528 GN=GLT3 PE=1 SV=1 | [more] |
Q9AT54 | 1.3e-66 | 34.76 | Scopoletin glucosyltransferase OS=Nicotiana tabacum OX=4097 GN=TOGT1 PE=1 SV=1 | [more] |
Q2V6J9 | 1.6e-64 | 33.68 | UDP-glucose flavonoid 3-O-glucosyltransferase 7 OS=Fragaria ananassa OX=3747 GN=... | [more] |
Q9ZQ96 | 1.0e-61 | 31.92 | UDP-glycosyltransferase 73C3 OS=Arabidopsis thaliana OX=3702 GN=UGT73C3 PE=2 SV=... | [more] |