Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AGAGTCTATATATATTCATATCCATGACTGAGCAAAATCTCAGCCCATAGCCTGTAGTTTCTGCACTTACTGACTTCACCTTCTTCAGCTCGCCATGGAATCCCAACCTCATGTCGCTCTTATTTCTAGCCCCGGAATGGGCCATCTCTTCCCCTCCCTCGAGCTCGCCACGCGCCTCTCCATGCGCCACCACCTCTCCGTCACCGTTTTCATCGTCCCTTCCCGCTCCTCCTCCGCCGAATACAAAGTCATCGCCGCCGCTCAAGCTGCCGGTCTCTTCACTGTCATTGAACTTCCTCCGGCTGACATGTCCGACGTCACTGACTCCACCGTCGTCGGTCGCCTCTCCATCACCATGCGTCGCCACGTCCCGGCTCTCCGCTCCGCCGTCTCCGCCCTTACCTCTCTCCCCTCCGTCCTCATCGCCGACATCTTCGCAACCGAGTCCTTCGCTGTCGCCGACGAGTTCCACATGGCCAAATACGTCTTCGTCGCCTCTAACGCATGGTTCTTTGCCTTAACCATCTACGTCCCGGTTCTCGATAAGCAAATCAACGGTCAGTACGTGGACCAGAAAGAACCGTTCCATATCCCTGGTTGCGAACCGGTTCGACCCTGCGACGTTATGGACCCACTTCTGGACCGGACCGAACCACAGTATTTCGAATACGTCAGAATCGGGACGGAGATACCATCGAGCGACGGCGTTTTGGTTAACACGTGGGATGACTTGGAAGGTCGCACGCTTGCATCTTTCAGAGATTGGAATTTGTTGGGCCGTATTATGAAGTCGCCGGTTTATTCTATTGGACCGATCGTACGGCAGACCGGTGGGAAGAAAGGCGGTGCGAGTGAGTTGTTCAACTGGCTGAGTAAGCAACCCGGTGAGTCAGTGATATACGTGTCGTTTGGGAGTGGTGGAACGCTGTCGTCTGAGCAAATGACGGAAGTGGCTCACGGATTGGAAATGAGCGGGCAGAGATTTGTTTGGGTGGTACGCGCTCCAAAGGTAATCATTTTTTAATTTTTAATTTAATTTTTTAATTTAAATATAAACTAATAATAATTCTACATTAATTTTACAGGTAAGATCGGACGCGACGTTTTTCACGACCGGCGATGGGACTGAGGACCAATCAGAGGCGAAGTTTCTGCCAGATGGGTTTTTGGAGCGGACGTCGGAGGTGGGGTTTGTGGTGTCGATGTGGGCGGATCAGACGGCGGTGTTGGGGAGTCCAGCGGTGGGGGGTTTTTTCACGCACGGCGGATGGAACTCAGCGTTGGAAGGGATTACGAACGGAGTTCCAATGGTTGTGTGGCCGTTATATGCGGAACAGCGGATGAACGCCACGATGCTGGCGGAAGAGGTGCGGGTGGCCGTCCGACCAAAGGAGCTGCCGACGAAGGCGGTGATCGGAAGGGAGGAGATTGCAGCGATGGTGAGGAAGATAATGGCGGAGGAGGATGAAGAAGGGAAAGCCATTAGAGCAAAGGCTAAGGAACTTCAACGGAGTGCAGAGAAGTCCACGGCGGAAGGTGGGTCGTCGTTCGAGAACTTCGCTCGAGTGGTGAAATTATGGCGTGAAAATCGAGTTTGAATTTCACTTCCTTCAACTAAATTTGTATTTTAGTAAATTCCTATTCTCAACCACTTTTTAATTTTGTAATATTAGAGTTATATTATAATATATCCCAATAATTA
mRNA sequence
AGAGTCTATATATATTCATATCCATGACTGAGCAAAATCTCAGCCCATAGCCTGTAGTTTCTGCACTTACTGACTTCACCTTCTTCAGCTCGCCATGGAATCCCAACCTCATGTCGCTCTTATTTCTAGCCCCGGAATGGGCCATCTCTTCCCCTCCCTCGAGCTCGCCACGCGCCTCTCCATGCGCCACCACCTCTCCGTCACCGTTTTCATCGTCCCTTCCCGCTCCTCCTCCGCCGAATACAAAGTCATCGCCGCCGCTCAAGCTGCCGGTCTCTTCACTGTCATTGAACTTCCTCCGGCTGACATGTCCGACGTCACTGACTCCACCGTCGTCGGTCGCCTCTCCATCACCATGCGTCGCCACGTCCCGGCTCTCCGCTCCGCCGTCTCCGCCCTTACCTCTCTCCCCTCCGTCCTCATCGCCGACATCTTCGCAACCGAGTCCTTCGCTGTCGCCGACGAGTTCCACATGGCCAAATACGTCTTCGTCGCCTCTAACGCATGGTTCTTTGCCTTAACCATCTACGTCCCGGTTCTCGATAAGCAAATCAACGGTCAGTACGTGGACCAGAAAGAACCGTTCCATATCCCTGGTTGCGAACCGGTTCGACCCTGCGACGTTATGGACCCACTTCTGGACCGGACCGAACCACAGTATTTCGAATACGTCAGAATCGGGACGGAGATACCATCGAGCGACGGCGTTTTGGTTAACACGTGGGATGACTTGGAAGGTCGCACGCTTGCATCTTTCAGAGATTGGAATTTGTTGGGCCGTATTATGAAGTCGCCGGTTTATTCTATTGGACCGATCGTACGGCAGACCGGTGGGAAGAAAGGCGGTGCGAGTGAGTTGTTCAACTGGCTGAGTAAGCAACCCGGTGAGTCAGTGATATACGTGTCGTTTGGGAGTGGTGGAACGCTGTCGTCTGAGCAAATGACGGAAGTGGCTCACGGATTGGAAATGAGCGGGCAGAGATTTGTTTGGGTGGTACGCGCTCCAAAGGTAAGATCGGACGCGACGTTTTTCACGACCGGCGATGGGACTGAGGACCAATCAGAGGCGAAGTTTCTGCCAGATGGGTTTTTGGAGCGGACGTCGGAGGTGGGGTTTGTGGTGTCGATGTGGGCGGATCAGACGGCGGTGTTGGGGAGTCCAGCGGTGGGGGGTTTTTTCACGCACGGCGGATGGAACTCAGCGTTGGAAGGGATTACGAACGGAGTTCCAATGGTTGTGTGGCCGTTATATGCGGAACAGCGGATGAACGCCACGATGCTGGCGGAAGAGGTGCGGGTGGCCGTCCGACCAAAGGAGCTGCCGACGAAGGCGGTGATCGGAAGGGAGGAGATTGCAGCGATGGTGAGGAAGATAATGGCGGAGGAGGATGAAGAAGGGAAAGCCATTAGAGCAAAGGCTAAGGAACTTCAACGGAGTGCAGAGAAGTCCACGGCGGAAGGTGGGTCGTCGTTCGAGAACTTCGCTCGAGTGGTGAAATTATGGCGTGAAAATCGAGTTTGAATTTCACTTCCTTCAACTAAATTTGTATTTTAGTAAATTCCTATTCTCAACCACTTTTTAATTTTGTAATATTAGAGTTATATTATAATATATCCCAATAATTA
Coding sequence (CDS)
ATGGAATCCCAACCTCATGTCGCTCTTATTTCTAGCCCCGGAATGGGCCATCTCTTCCCCTCCCTCGAGCTCGCCACGCGCCTCTCCATGCGCCACCACCTCTCCGTCACCGTTTTCATCGTCCCTTCCCGCTCCTCCTCCGCCGAATACAAAGTCATCGCCGCCGCTCAAGCTGCCGGTCTCTTCACTGTCATTGAACTTCCTCCGGCTGACATGTCCGACGTCACTGACTCCACCGTCGTCGGTCGCCTCTCCATCACCATGCGTCGCCACGTCCCGGCTCTCCGCTCCGCCGTCTCCGCCCTTACCTCTCTCCCCTCCGTCCTCATCGCCGACATCTTCGCAACCGAGTCCTTCGCTGTCGCCGACGAGTTCCACATGGCCAAATACGTCTTCGTCGCCTCTAACGCATGGTTCTTTGCCTTAACCATCTACGTCCCGGTTCTCGATAAGCAAATCAACGGTCAGTACGTGGACCAGAAAGAACCGTTCCATATCCCTGGTTGCGAACCGGTTCGACCCTGCGACGTTATGGACCCACTTCTGGACCGGACCGAACCACAGTATTTCGAATACGTCAGAATCGGGACGGAGATACCATCGAGCGACGGCGTTTTGGTTAACACGTGGGATGACTTGGAAGGTCGCACGCTTGCATCTTTCAGAGATTGGAATTTGTTGGGCCGTATTATGAAGTCGCCGGTTTATTCTATTGGACCGATCGTACGGCAGACCGGTGGGAAGAAAGGCGGTGCGAGTGAGTTGTTCAACTGGCTGAGTAAGCAACCCGGTGAGTCAGTGATATACGTGTCGTTTGGGAGTGGTGGAACGCTGTCGTCTGAGCAAATGACGGAAGTGGCTCACGGATTGGAAATGAGCGGGCAGAGATTTGTTTGGGTGGTACGCGCTCCAAAGGTAAGATCGGACGCGACGTTTTTCACGACCGGCGATGGGACTGAGGACCAATCAGAGGCGAAGTTTCTGCCAGATGGGTTTTTGGAGCGGACGTCGGAGGTGGGGTTTGTGGTGTCGATGTGGGCGGATCAGACGGCGGTGTTGGGGAGTCCAGCGGTGGGGGGTTTTTTCACGCACGGCGGATGGAACTCAGCGTTGGAAGGGATTACGAACGGAGTTCCAATGGTTGTGTGGCCGTTATATGCGGAACAGCGGATGAACGCCACGATGCTGGCGGAAGAGGTGCGGGTGGCCGTCCGACCAAAGGAGCTGCCGACGAAGGCGGTGATCGGAAGGGAGGAGATTGCAGCGATGGTGAGGAAGATAATGGCGGAGGAGGATGAAGAAGGGAAAGCCATTAGAGCAAAGGCTAAGGAACTTCAACGGAGTGCAGAGAAGTCCACGGCGGAAGGTGGGTCGTCGTTCGAGAACTTCGCTCGAGTGGTGAAATTATGGCGTGAAAATCGAGTTTGA
Protein sequence
MESQPHVALISSPGMGHLFPSLELATRLSMRHHLSVTVFIVPSRSSSAEYKVIAAAQAAGLFTVIELPPADMSDVTDSTVVGRLSITMRRHVPALRSAVSALTSLPSVLIADIFATESFAVADEFHMAKYVFVASNAWFFALTIYVPVLDKQINGQYVDQKEPFHIPGCEPVRPCDVMDPLLDRTEPQYFEYVRIGTEIPSSDGVLVNTWDDLEGRTLASFRDWNLLGRIMKSPVYSIGPIVRQTGGKKGGASELFNWLSKQPGESVIYVSFGSGGTLSSEQMTEVAHGLEMSGQRFVWVVRAPKVRSDATFFTTGDGTEDQSEAKFLPDGFLERTSEVGFVVSMWADQTAVLGSPAVGGFFTHGGWNSALEGITNGVPMVVWPLYAEQRMNATMLAEEVRVAVRPKELPTKAVIGREEIAAMVRKIMAEEDEEGKAIRAKAKELQRSAEKSTAEGGSSFENFARVVKLWRENRV
Homology
BLAST of CmaCh16G006640 vs. ExPASy Swiss-Prot
Match:
Q40287 (Anthocyanidin 3-O-glucosyltransferase 5 OS=Manihot esculenta OX=3983 GN=GT5 PE=2 SV=1)
HSP 1 Score: 491.1 bits (1263), Expect = 1.4e-137
Identity = 255/476 (53.57%), Postives = 332/476 (69.75%), Query Frame = 0
Query: 1 MESQPHVALISSPGMGHLFPSLELATRLSMRHHLSVTVFIVPSRSSSAEYKVIAAAQAAG 60
+ S+PH+ L+SSPG+GHL P LEL R+ + VT+F+V S +S+AE +V+ +A
Sbjct: 6 LNSKPHIVLLSSPGLGHLIPVLELGKRIVTLCNFDVTIFMVGSDTSAAEPQVLRSAMTPK 65
Query: 61 LFTVIELPPADMSDVTD--STVVGRLSITMRRHVPALRSAVSALTSLPSVLIADIFATES 120
L +I+LPP ++S + D +TV RL + MR PA R+AVSAL P+ +I D+F TES
Sbjct: 66 LCEIIQLPPPNISCLIDPEATVCTRLFVLMREIRPAFRAAVSALKFRPAAIIVDLFGTES 125
Query: 121 FAVADEFHMAKYVFVASNAWFFALTIYVPVLDKQINGQYVDQKEPFHIPGCEPVRPCDVM 180
VA E +AKYV++ASNAWF ALTIYVP+LDK++ G++V QKEP IPGC PVR +V+
Sbjct: 126 LEVAKELGIAKYVYIASNAWFLALTIYVPILDKEVEGEFVLQKEPMKIPGCRPVRTEEVV 185
Query: 181 DPLLDRTEPQYFEYVRIGTEIPSSDGVLVNTWDDLEGRTLASFRDWNLLGRIMKSPVYSI 240
DP+LDRT QY EY R+G EIP++DG+L+NTW+ LE T + RD LGR+ K PV+ I
Sbjct: 186 DPMLDRTNQQYSEYFRLGIEIPTADGILMNTWEALEPTTFGALRDVKFLGRVAKVPVFPI 245
Query: 241 GPIVRQTGGKKGGASELFNWLSKQPGESVIYVSFGSGGTLSSEQMTEVAHGLEMSGQRFV 300
GP+ RQ G G EL +WL +QP ESV+YVSFGSGGTLS EQM E+A GLE S QRF+
Sbjct: 246 GPLRRQ-AGPCGSNCELLDWLDQQPKESVVYVSFGSGGTLSLEQMIELAWGLERSQQRFI 305
Query: 301 WVVRAPKVRS-DATFFTTGDGTEDQSEAKFLPDGFLERTSEVGFVVSMWADQTAVLGSPA 360
WVVR P V++ DA FFT GDG +D S + P+GFL R VG VV W+ Q ++ P+
Sbjct: 306 WVVRQPTVKTGDAAFFTQGDGADDMS--GYFPEGFLTRIQNVGLVVPQWSPQIHIMSHPS 365
Query: 361 VGGFFTHGGWNSALEGITNGVPMVVWPLYAEQRMNATMLAEEVRVAVRPKELPTKAVIGR 420
VG F +H GWNS LE IT GVP++ WP+YAEQRMNAT+L EE+ VAVRPK LP K V+ R
Sbjct: 366 VGVFLSHCGWNSVLESITAGVPIIAWPIYAEQRMNATLLTEELGVAVRPKNLPAKEVVKR 425
Query: 421 EEIAAMVRKIMAEEDEEGKAIRAKAKELQRSAEKSTAEGGSSFENFARVVKLWREN 474
EEI M+R+IM DEEG IR + +EL+ S EK+ EGGSSF + + W ++
Sbjct: 426 EEIERMIRRIMV--DEEGSEIRKRVRELKDSGEKALNEGGSSFNYMSALGNEWEKS 476
BLAST of CmaCh16G006640 vs. ExPASy Swiss-Prot
Match:
Q9ZU72 (UDP-glycosyltransferase 72D1 OS=Arabidopsis thaliana OX=3702 GN=UGT72D1 PE=2 SV=1)
HSP 1 Score: 438.3 bits (1126), Expect = 1.1e-121
Identity = 231/463 (49.89%), Postives = 319/463 (68.90%), Query Frame = 0
Query: 4 QPHVALISSPGMGHLFPSLELATRLSMRHHLSVTVFIVPSRSSS-AEYKVIAAAQAAGLF 63
QPH L++SPG+GHL P LEL RLS ++ VT+ V S SSS E + I AA A +
Sbjct: 3 QPHALLVASPGLGHLIPILELGNRLSSVLNIHVTILAVTSGSSSPTETEAIHAAAARTIC 62
Query: 64 TVIELPPADMSDVT--DSTVVGRLSITMRRHVPALRSAVSALTSLPSVLIADIFATESFA 123
+ E+P D+ ++ D+T+ ++ + MR PA+R AV + P+V+I D TE +
Sbjct: 63 QITEIPSVDVDNLVEPDATIFTKMVVKMRAMKPAVRDAVKLMKRKPTVMIVDFLGTELMS 122
Query: 124 VADEFHM-AKYVFVASNAWFFALTIYVPVLDKQINGQYVDQKEPFHIPGCEPVRPCDVMD 183
VAD+ M AKYV+V ++AWF A+ +Y+PVLD + G+YVD KEP IPGC+PV P ++M+
Sbjct: 123 VADDVGMTAKYVYVPTHAWFLAVMVYLPVLDTVVEGEYVDIKEPLKIPGCKPVGPKELME 182
Query: 184 PLLDRTEPQYFEYVRIGTEIPSSDGVLVNTWDDLEGRTLASFRDWNLLGRIMKSPVYSIG 243
+LDR+ QY E VR G E+P SDGVLVNTW++L+G TLA+ R+ L R+MK PVY IG
Sbjct: 183 TMLDRSGQQYKECVRAGLEVPMSDGVLVNTWEELQGNTLAALREDEELSRVMKVPVYPIG 242
Query: 244 PIVRQTGGKKGGASELFNWLSKQPGESVIYVSFGSGGTLSSEQMTEVAHGLEMSGQRFVW 303
PIVR T + +F WL +Q SV++V GSGGTL+ EQ E+A GLE+SGQRFVW
Sbjct: 243 PIVR-TNQHVDKPNSIFEWLDEQRERSVVFVCLGSGGTLTFEQTVELALGLELSGQRFVW 302
Query: 304 VVRAPKVRSDATFFTTGDGTEDQSEAKFLPDGFLERTSEVGFVVSMWADQTAVLGSPAVG 363
V+R P + ++D+ + LP+GFL+RT VG VV+ WA Q +L ++G
Sbjct: 303 VLRRP------ASYLGAISSDDEQVSASLPEGFLDRTRGVGIVVTQWAPQVEILSHRSIG 362
Query: 364 GFFTHGGWNSALEGITNGVPMVVWPLYAEQRMNATMLAEEVRVAVRPKELPTKAVIGREE 423
GF +H GW+SALE +T GVP++ WPLYAEQ MNAT+L EE+ VAVR ELP++ VIGREE
Sbjct: 363 GFLSHCGWSSALESLTKGVPIIAWPLYAEQWMNATLLTEEIGVAVRTSELPSERVIGREE 422
Query: 424 IAAMVRKIMAEEDEEGKAIRAKAKELQRSAEKSTAEGGSSFEN 463
+A++VRKIMAEEDEEG+ IRAKA+E++ S+E++ ++ GSS+ +
Sbjct: 423 VASLVRKIMAEEDEEGQKIRAKAEEVRVSSERAWSKDGSSYNS 458
BLAST of CmaCh16G006640 vs. ExPASy Swiss-Prot
Match:
Q94A84 (UDP-glycosyltransferase 72E1 OS=Arabidopsis thaliana OX=3702 GN=UGT72E1 PE=1 SV=1)
HSP 1 Score: 402.1 bits (1032), Expect = 8.5e-111
Identity = 216/474 (45.57%), Postives = 310/474 (65.40%), Query Frame = 0
Query: 3 SQPHVALISSPGMGHLFPSLELATRLSMRHHLSVTVFIVPSRSSSAEYKVIAAAQA-AGL 62
++PHVA+ +SPGMGH+ P +EL RL+ H VT+F++ + ++SA+ + + + A L
Sbjct: 4 TKPHVAMFASPGMGHIIPVIELGKRLAGSHGFDVTIFVLETDAASAQSQFLNSPGCDAAL 63
Query: 63 FTVIELPPADMSDVTD-STVVG-RLSITMRRHVPALRSAVSALTSLPSVLIADIFATESF 122
++ LP D+S + D S G +L + MR +P +RS + + P+ LI D+F ++
Sbjct: 64 VDIVGLPTPDISGLVDPSAFFGIKLLVMMRETIPTIRSKIEEMQHKPTALIVDLFGLDAI 123
Query: 123 AVADEFHMAKYVFVASNAWFFALTIYVPVLDKQINGQYVDQKEPFHIPGCEPVRPCDVMD 182
+ EF+M Y+F+ASNA F A+ ++ P LDK + +++ +K+P +PGCEPVR D ++
Sbjct: 124 PLGGEFNMLTYIFIASNARFLAVALFFPTLDKDMEEEHIIKKQPMVMPGCEPVRFEDTLE 183
Query: 183 PLLDRTEPQYFEYVRIGTEIPSSDGVLVNTWDDLEGRTLASFRDWNLLGRIMKSPVYSIG 242
LD Y E+V G+ P+ DG++VNTWDD+E +TL S +D LLGRI PVY IG
Sbjct: 184 TFLDPNSQLYREFVPFGSVFPTCDGIIVNTWDDMEPKTLKSLQDPKLLGRIAGVPVYPIG 243
Query: 243 PIVRQTGGKKGGASELFNWLSKQPGESVIYVSFGSGGTLSSEQMTEVAHGLEMSGQRFVW 302
P+ R K + +WL+KQP ESV+Y+SFGSGG+LS++Q+TE+A GLEMS QRFVW
Sbjct: 244 PLSRPVDPSKTN-HPVLDWLNKQPDESVLYISFGSGGSLSAKQLTELAWGLEMSQQRFVW 303
Query: 303 VVRAPKVRSDATFFTTG------DGTEDQSEAKFLPDGFLERTSEVGFVVSMWADQTAVL 362
VVR P S + + + DGT D +LP+GF+ RT E GF+VS WA Q +L
Sbjct: 304 VVRPPVDGSACSAYLSANSGKIRDGTPD-----YLPEGFVSRTHERGFMVSSWAPQAEIL 363
Query: 363 GSPAVGGFFTHGGWNSALEGITNGVPMVVWPLYAEQRMNATMLAEEVRVAVRPKELPTKA 422
AVGGF TH GWNS LE + GVPM+ WPL+AEQ MNAT+L EE+ VAVR K+LP++
Sbjct: 364 AHQAVGGFLTHCGWNSILESVVGGVPMIAWPLFAEQMMNATLLNEELGVAVRSKKLPSEG 423
Query: 423 VIGREEIAAMVRKIMAEEDEEGKAIRAKAKELQRSAEKS-TAEGGSSFENFARV 467
VI R EI A+VRKIM E EEG +R K K+L+ +A +S + +GG + E+ +R+
Sbjct: 424 VITRAEIEALVRKIMVE--EEGAEMRKKIKKLKETAAESLSCDGGVAHESLSRI 469
BLAST of CmaCh16G006640 vs. ExPASy Swiss-Prot
Match:
O81498 (UDP-glycosyltransferase 72E3 OS=Arabidopsis thaliana OX=3702 GN=UGT72E3 PE=1 SV=1)
HSP 1 Score: 389.8 bits (1000), Expect = 4.4e-107
Identity = 208/471 (44.16%), Postives = 308/471 (65.39%), Query Frame = 0
Query: 3 SQPHVALISSPGMGHLFPSLELATRLSMRHHLSVTVFIVPSRSSSAEYKVIAAAQAAGLF 62
++PH A+ SSPGMGH+ P +ELA RLS H VTVF++ + ++S + K++ + G+
Sbjct: 4 TKPHAAMFSSPGMGHVLPVIELAKRLSANHGFHVTVFVLETDAASVQSKLL---NSTGV- 63
Query: 63 TVIELPPADMSDVTD--STVVGRLSITMRRHVPALRSAVSALTSLPSVLIADIFATESFA 122
++ LP D+S + D + VV ++ + MR VP LRS + A+ P+ LI D+F T++
Sbjct: 64 DIVNLPSPDISGLVDPNAHVVTKIGVIMREAVPTLRSKIVAMHQNPTALIIDLFGTDALC 123
Query: 123 VADEFHMAKYVFVASNAWFFALTIYVPVLDKQINGQYVDQKEPFHIPGCEPVRPCDVMDP 182
+A E +M YVF+ASNA + ++IY P LD+ I ++ Q++P IPGCEPVR D+MD
Sbjct: 124 LAAELNMLTYVFIASNARYLGVSIYYPTLDEVIKEEHTVQRKPLTIPGCEPVRFEDIMDA 183
Query: 183 LLDRTEPQYFEYVRIGTEIPSSDGVLVNTWDDLEGRTLASFRDWNLLGRIMKSPVYSIGP 242
L EP Y + VR P +DG+LVNTW+++E ++L S +D LLGR+ + PVY +GP
Sbjct: 184 YLVPDEPVYHDLVRHCLAYPKADGILVNTWEEMEPKSLKSLQDPKLLGRVARVPVYPVGP 243
Query: 243 IVRQTGGKKGGASELFNWLSKQPGESVIYVSFGSGGTLSSEQMTEVAHGLEMSGQRFVWV 302
+ R +F+WL+KQP ESV+Y+SFGSGG+L+++Q+TE+A GLE S QRF+WV
Sbjct: 244 LCRPIQSSTTD-HPVFDWLNKQPNESVLYISFGSGGSLTAQQLTELAWGLEESQQRFIWV 303
Query: 303 VRAPKVRSDAT-FFTTGDGTEDQSEAKFLPDGFLERTSEVGFVVSMWADQTAVLGSPAVG 362
VR P S + +F+ G + ++LP+GF+ RT + GF++ WA Q +L AVG
Sbjct: 304 VRPPVDGSSCSDYFSAKGGVTKDNTPEYLPEGFVTRTCDRGFMIPSWAPQAEILAHQAVG 363
Query: 363 GFFTHGGWNSALEGITNGVPMVVWPLYAEQRMNATMLAEEVRVAVRPKELPTKAVIGREE 422
GF TH GW+S LE + GVPM+ WPL+AEQ MNA +L++E+ ++VR + K I R +
Sbjct: 364 GFLTHCGWSSTLESVLCGVPMIAWPLFAEQNMNAALLSDELGISVRVDD--PKEAISRSK 423
Query: 423 IAAMVRKIMAEEDEEGKAIRAKAKELQRSAEKSTA--EGGSSFENFARVVK 469
I AMVRK+MAE +EG+ +R K K+L+ +AE S + GGS+ E+ RV K
Sbjct: 424 IEAMVRKVMAE--DEGEEMRRKVKKLRDTAEMSLSIHGGGSAHESLCRVTK 465
BLAST of CmaCh16G006640 vs. ExPASy Swiss-Prot
Match:
Q9LVR1 (UDP-glycosyltransferase 72E2 OS=Arabidopsis thaliana OX=3702 GN=UGT72E2 PE=1 SV=1)
HSP 1 Score: 388.3 bits (996), Expect = 1.3e-106
Identity = 214/472 (45.34%), Postives = 309/472 (65.47%), Query Frame = 0
Query: 3 SQPHVALISSPGMGHLFPSLELATRLSMRHHLSVTVFIVPSRSSSAEYKVIAAAQAAGLF 62
++PH A+ SSPGMGH+ P +EL RLS + VTVF++ + ++SA+ K + + G+
Sbjct: 4 TKPHAAMFSSPGMGHVIPVIELGKRLSANNGFHVTVFVLETDAASAQSKFL---NSTGV- 63
Query: 63 TVIELPPADMSDVT--DSTVVGRLSITMRRHVPALRSAVSALTSLPSVLIADIFATESFA 122
+++LP D+ + D VV ++ + MR VPALRS ++A+ P+ LI D+F T++
Sbjct: 64 DIVKLPSPDIYGLVDPDDHVVTKIGVIMRAAVPALRSKIAAMHQKPTALIVDLFGTDALC 123
Query: 123 VADEFHMAKYVFVASNAWFFALTIYVPVLDKQINGQYVDQKEPFHIPGCEPVRPCDVMDP 182
+A EF+M YVF+ +NA F ++IY P LDK I ++ Q+ P IPGCEPVR D +D
Sbjct: 124 LAKEFNMLSYVFIPTNARFLGVSIYYPNLDKDIKEEHTVQRNPLAIPGCEPVRFEDTLDA 183
Query: 183 LLDRTEPQYFEYVRIGTEIPSSDGVLVNTWDDLEGRTLASFRDWNLLGRIMKSPVYSIGP 242
L EP Y ++VR G P +DG+LVNTW+++E ++L S + LLGR+ + PVY IGP
Sbjct: 184 YLVPDEPVYRDFVRHGLAYPKADGILVNTWEEMEPKSLKSLLNPKLLGRVARVPVYPIGP 243
Query: 243 IVRQTGGKKGGASELFNWLSKQPGESVIYVSFGSGGTLSSEQMTEVAHGLEMSGQRFVWV 302
+ R + + +WL++QP ESV+Y+SFGSGG LS++Q+TE+A GLE S QRFVWV
Sbjct: 244 LCRPIQSSETD-HPVLDWLNEQPNESVLYISFGSGGCLSAKQLTELAWGLEQSQQRFVWV 303
Query: 303 VRAPKVRSDATFFTT--GDGTEDQSEAKFLPDGFLERTSEVGFVVSMWADQTAVLGSPAV 362
VR P S + + + G GTED + ++LP+GF+ RTS+ GFVV WA Q +L AV
Sbjct: 304 VRPPVDGSCCSEYVSANGGGTEDNT-PEYLPEGFVSRTSDRGFVVPSWAPQAEILSHRAV 363
Query: 363 GGFFTHGGWNSALEGITNGVPMVVWPLYAEQRMNATMLAEEVRVAVRPKELPTKAVIGRE 422
GGF TH GW+S LE + GVPM+ WPL+AEQ MNA +L++E+ +AVR + K I R
Sbjct: 364 GGFLTHCGWSSTLESVVGGVPMIAWPLFAEQNMNAALLSDELGIAVRLDD--PKEDISRW 423
Query: 423 EIAAMVRKIMAEEDEEGKAIRAKAKELQRSAEKSTA--EGGSSFENFARVVK 469
+I A+VRK+M E +EG+A+R K K+L+ SAE S + GG + E+ RV K
Sbjct: 424 KIEALVRKVMTE--KEGEAMRRKVKKLRDSAEMSLSIDGGGLAHESLCRVTK 465
BLAST of CmaCh16G006640 vs. TAIR 10
Match:
AT2G18570.1 (UDP-Glycosyltransferase superfamily protein )
HSP 1 Score: 438.3 bits (1126), Expect = 7.6e-123
Identity = 231/463 (49.89%), Postives = 319/463 (68.90%), Query Frame = 0
Query: 4 QPHVALISSPGMGHLFPSLELATRLSMRHHLSVTVFIVPSRSSS-AEYKVIAAAQAAGLF 63
QPH L++SPG+GHL P LEL RLS ++ VT+ V S SSS E + I AA A +
Sbjct: 3 QPHALLVASPGLGHLIPILELGNRLSSVLNIHVTILAVTSGSSSPTETEAIHAAAARTIC 62
Query: 64 TVIELPPADMSDVT--DSTVVGRLSITMRRHVPALRSAVSALTSLPSVLIADIFATESFA 123
+ E+P D+ ++ D+T+ ++ + MR PA+R AV + P+V+I D TE +
Sbjct: 63 QITEIPSVDVDNLVEPDATIFTKMVVKMRAMKPAVRDAVKLMKRKPTVMIVDFLGTELMS 122
Query: 124 VADEFHM-AKYVFVASNAWFFALTIYVPVLDKQINGQYVDQKEPFHIPGCEPVRPCDVMD 183
VAD+ M AKYV+V ++AWF A+ +Y+PVLD + G+YVD KEP IPGC+PV P ++M+
Sbjct: 123 VADDVGMTAKYVYVPTHAWFLAVMVYLPVLDTVVEGEYVDIKEPLKIPGCKPVGPKELME 182
Query: 184 PLLDRTEPQYFEYVRIGTEIPSSDGVLVNTWDDLEGRTLASFRDWNLLGRIMKSPVYSIG 243
+LDR+ QY E VR G E+P SDGVLVNTW++L+G TLA+ R+ L R+MK PVY IG
Sbjct: 183 TMLDRSGQQYKECVRAGLEVPMSDGVLVNTWEELQGNTLAALREDEELSRVMKVPVYPIG 242
Query: 244 PIVRQTGGKKGGASELFNWLSKQPGESVIYVSFGSGGTLSSEQMTEVAHGLEMSGQRFVW 303
PIVR T + +F WL +Q SV++V GSGGTL+ EQ E+A GLE+SGQRFVW
Sbjct: 243 PIVR-TNQHVDKPNSIFEWLDEQRERSVVFVCLGSGGTLTFEQTVELALGLELSGQRFVW 302
Query: 304 VVRAPKVRSDATFFTTGDGTEDQSEAKFLPDGFLERTSEVGFVVSMWADQTAVLGSPAVG 363
V+R P + ++D+ + LP+GFL+RT VG VV+ WA Q +L ++G
Sbjct: 303 VLRRP------ASYLGAISSDDEQVSASLPEGFLDRTRGVGIVVTQWAPQVEILSHRSIG 362
Query: 364 GFFTHGGWNSALEGITNGVPMVVWPLYAEQRMNATMLAEEVRVAVRPKELPTKAVIGREE 423
GF +H GW+SALE +T GVP++ WPLYAEQ MNAT+L EE+ VAVR ELP++ VIGREE
Sbjct: 363 GFLSHCGWSSALESLTKGVPIIAWPLYAEQWMNATLLTEEIGVAVRTSELPSERVIGREE 422
Query: 424 IAAMVRKIMAEEDEEGKAIRAKAKELQRSAEKSTAEGGSSFEN 463
+A++VRKIMAEEDEEG+ IRAKA+E++ S+E++ ++ GSS+ +
Sbjct: 423 VASLVRKIMAEEDEEGQKIRAKAEEVRVSSERAWSKDGSSYNS 458
BLAST of CmaCh16G006640 vs. TAIR 10
Match:
AT3G50740.1 (UDP-glucosyl transferase 72E1 )
HSP 1 Score: 402.1 bits (1032), Expect = 6.1e-112
Identity = 216/474 (45.57%), Postives = 310/474 (65.40%), Query Frame = 0
Query: 3 SQPHVALISSPGMGHLFPSLELATRLSMRHHLSVTVFIVPSRSSSAEYKVIAAAQA-AGL 62
++PHVA+ +SPGMGH+ P +EL RL+ H VT+F++ + ++SA+ + + + A L
Sbjct: 4 TKPHVAMFASPGMGHIIPVIELGKRLAGSHGFDVTIFVLETDAASAQSQFLNSPGCDAAL 63
Query: 63 FTVIELPPADMSDVTD-STVVG-RLSITMRRHVPALRSAVSALTSLPSVLIADIFATESF 122
++ LP D+S + D S G +L + MR +P +RS + + P+ LI D+F ++
Sbjct: 64 VDIVGLPTPDISGLVDPSAFFGIKLLVMMRETIPTIRSKIEEMQHKPTALIVDLFGLDAI 123
Query: 123 AVADEFHMAKYVFVASNAWFFALTIYVPVLDKQINGQYVDQKEPFHIPGCEPVRPCDVMD 182
+ EF+M Y+F+ASNA F A+ ++ P LDK + +++ +K+P +PGCEPVR D ++
Sbjct: 124 PLGGEFNMLTYIFIASNARFLAVALFFPTLDKDMEEEHIIKKQPMVMPGCEPVRFEDTLE 183
Query: 183 PLLDRTEPQYFEYVRIGTEIPSSDGVLVNTWDDLEGRTLASFRDWNLLGRIMKSPVYSIG 242
LD Y E+V G+ P+ DG++VNTWDD+E +TL S +D LLGRI PVY IG
Sbjct: 184 TFLDPNSQLYREFVPFGSVFPTCDGIIVNTWDDMEPKTLKSLQDPKLLGRIAGVPVYPIG 243
Query: 243 PIVRQTGGKKGGASELFNWLSKQPGESVIYVSFGSGGTLSSEQMTEVAHGLEMSGQRFVW 302
P+ R K + +WL+KQP ESV+Y+SFGSGG+LS++Q+TE+A GLEMS QRFVW
Sbjct: 244 PLSRPVDPSKTN-HPVLDWLNKQPDESVLYISFGSGGSLSAKQLTELAWGLEMSQQRFVW 303
Query: 303 VVRAPKVRSDATFFTTG------DGTEDQSEAKFLPDGFLERTSEVGFVVSMWADQTAVL 362
VVR P S + + + DGT D +LP+GF+ RT E GF+VS WA Q +L
Sbjct: 304 VVRPPVDGSACSAYLSANSGKIRDGTPD-----YLPEGFVSRTHERGFMVSSWAPQAEIL 363
Query: 363 GSPAVGGFFTHGGWNSALEGITNGVPMVVWPLYAEQRMNATMLAEEVRVAVRPKELPTKA 422
AVGGF TH GWNS LE + GVPM+ WPL+AEQ MNAT+L EE+ VAVR K+LP++
Sbjct: 364 AHQAVGGFLTHCGWNSILESVVGGVPMIAWPLFAEQMMNATLLNEELGVAVRSKKLPSEG 423
Query: 423 VIGREEIAAMVRKIMAEEDEEGKAIRAKAKELQRSAEKS-TAEGGSSFENFARV 467
VI R EI A+VRKIM E EEG +R K K+L+ +A +S + +GG + E+ +R+
Sbjct: 424 VITRAEIEALVRKIMVE--EEGAEMRKKIKKLKETAAESLSCDGGVAHESLSRI 469
BLAST of CmaCh16G006640 vs. TAIR 10
Match:
AT5G26310.1 (UDP-Glycosyltransferase superfamily protein )
HSP 1 Score: 389.8 bits (1000), Expect = 3.1e-108
Identity = 208/471 (44.16%), Postives = 308/471 (65.39%), Query Frame = 0
Query: 3 SQPHVALISSPGMGHLFPSLELATRLSMRHHLSVTVFIVPSRSSSAEYKVIAAAQAAGLF 62
++PH A+ SSPGMGH+ P +ELA RLS H VTVF++ + ++S + K++ + G+
Sbjct: 4 TKPHAAMFSSPGMGHVLPVIELAKRLSANHGFHVTVFVLETDAASVQSKLL---NSTGV- 63
Query: 63 TVIELPPADMSDVTD--STVVGRLSITMRRHVPALRSAVSALTSLPSVLIADIFATESFA 122
++ LP D+S + D + VV ++ + MR VP LRS + A+ P+ LI D+F T++
Sbjct: 64 DIVNLPSPDISGLVDPNAHVVTKIGVIMREAVPTLRSKIVAMHQNPTALIIDLFGTDALC 123
Query: 123 VADEFHMAKYVFVASNAWFFALTIYVPVLDKQINGQYVDQKEPFHIPGCEPVRPCDVMDP 182
+A E +M YVF+ASNA + ++IY P LD+ I ++ Q++P IPGCEPVR D+MD
Sbjct: 124 LAAELNMLTYVFIASNARYLGVSIYYPTLDEVIKEEHTVQRKPLTIPGCEPVRFEDIMDA 183
Query: 183 LLDRTEPQYFEYVRIGTEIPSSDGVLVNTWDDLEGRTLASFRDWNLLGRIMKSPVYSIGP 242
L EP Y + VR P +DG+LVNTW+++E ++L S +D LLGR+ + PVY +GP
Sbjct: 184 YLVPDEPVYHDLVRHCLAYPKADGILVNTWEEMEPKSLKSLQDPKLLGRVARVPVYPVGP 243
Query: 243 IVRQTGGKKGGASELFNWLSKQPGESVIYVSFGSGGTLSSEQMTEVAHGLEMSGQRFVWV 302
+ R +F+WL+KQP ESV+Y+SFGSGG+L+++Q+TE+A GLE S QRF+WV
Sbjct: 244 LCRPIQSSTTD-HPVFDWLNKQPNESVLYISFGSGGSLTAQQLTELAWGLEESQQRFIWV 303
Query: 303 VRAPKVRSDAT-FFTTGDGTEDQSEAKFLPDGFLERTSEVGFVVSMWADQTAVLGSPAVG 362
VR P S + +F+ G + ++LP+GF+ RT + GF++ WA Q +L AVG
Sbjct: 304 VRPPVDGSSCSDYFSAKGGVTKDNTPEYLPEGFVTRTCDRGFMIPSWAPQAEILAHQAVG 363
Query: 363 GFFTHGGWNSALEGITNGVPMVVWPLYAEQRMNATMLAEEVRVAVRPKELPTKAVIGREE 422
GF TH GW+S LE + GVPM+ WPL+AEQ MNA +L++E+ ++VR + K I R +
Sbjct: 364 GFLTHCGWSSTLESVLCGVPMIAWPLFAEQNMNAALLSDELGISVRVDD--PKEAISRSK 423
Query: 423 IAAMVRKIMAEEDEEGKAIRAKAKELQRSAEKSTA--EGGSSFENFARVVK 469
I AMVRK+MAE +EG+ +R K K+L+ +AE S + GGS+ E+ RV K
Sbjct: 424 IEAMVRKVMAE--DEGEEMRRKVKKLRDTAEMSLSIHGGGSAHESLCRVTK 465
BLAST of CmaCh16G006640 vs. TAIR 10
Match:
AT5G66690.1 (UDP-Glycosyltransferase superfamily protein )
HSP 1 Score: 388.3 bits (996), Expect = 9.1e-108
Identity = 214/472 (45.34%), Postives = 309/472 (65.47%), Query Frame = 0
Query: 3 SQPHVALISSPGMGHLFPSLELATRLSMRHHLSVTVFIVPSRSSSAEYKVIAAAQAAGLF 62
++PH A+ SSPGMGH+ P +EL RLS + VTVF++ + ++SA+ K + + G+
Sbjct: 4 TKPHAAMFSSPGMGHVIPVIELGKRLSANNGFHVTVFVLETDAASAQSKFL---NSTGV- 63
Query: 63 TVIELPPADMSDVT--DSTVVGRLSITMRRHVPALRSAVSALTSLPSVLIADIFATESFA 122
+++LP D+ + D VV ++ + MR VPALRS ++A+ P+ LI D+F T++
Sbjct: 64 DIVKLPSPDIYGLVDPDDHVVTKIGVIMRAAVPALRSKIAAMHQKPTALIVDLFGTDALC 123
Query: 123 VADEFHMAKYVFVASNAWFFALTIYVPVLDKQINGQYVDQKEPFHIPGCEPVRPCDVMDP 182
+A EF+M YVF+ +NA F ++IY P LDK I ++ Q+ P IPGCEPVR D +D
Sbjct: 124 LAKEFNMLSYVFIPTNARFLGVSIYYPNLDKDIKEEHTVQRNPLAIPGCEPVRFEDTLDA 183
Query: 183 LLDRTEPQYFEYVRIGTEIPSSDGVLVNTWDDLEGRTLASFRDWNLLGRIMKSPVYSIGP 242
L EP Y ++VR G P +DG+LVNTW+++E ++L S + LLGR+ + PVY IGP
Sbjct: 184 YLVPDEPVYRDFVRHGLAYPKADGILVNTWEEMEPKSLKSLLNPKLLGRVARVPVYPIGP 243
Query: 243 IVRQTGGKKGGASELFNWLSKQPGESVIYVSFGSGGTLSSEQMTEVAHGLEMSGQRFVWV 302
+ R + + +WL++QP ESV+Y+SFGSGG LS++Q+TE+A GLE S QRFVWV
Sbjct: 244 LCRPIQSSETD-HPVLDWLNEQPNESVLYISFGSGGCLSAKQLTELAWGLEQSQQRFVWV 303
Query: 303 VRAPKVRSDATFFTT--GDGTEDQSEAKFLPDGFLERTSEVGFVVSMWADQTAVLGSPAV 362
VR P S + + + G GTED + ++LP+GF+ RTS+ GFVV WA Q +L AV
Sbjct: 304 VRPPVDGSCCSEYVSANGGGTEDNT-PEYLPEGFVSRTSDRGFVVPSWAPQAEILSHRAV 363
Query: 363 GGFFTHGGWNSALEGITNGVPMVVWPLYAEQRMNATMLAEEVRVAVRPKELPTKAVIGRE 422
GGF TH GW+S LE + GVPM+ WPL+AEQ MNA +L++E+ +AVR + K I R
Sbjct: 364 GGFLTHCGWSSTLESVVGGVPMIAWPLFAEQNMNAALLSDELGIAVRLDD--PKEDISRW 423
Query: 423 EIAAMVRKIMAEEDEEGKAIRAKAKELQRSAEKSTA--EGGSSFENFARVVK 469
+I A+VRK+M E +EG+A+R K K+L+ SAE S + GG + E+ RV K
Sbjct: 424 KIEALVRKVMTE--KEGEAMRRKVKKLRDSAEMSLSIDGGGLAHESLCRVTK 465
BLAST of CmaCh16G006640 vs. TAIR 10
Match:
AT2G18560.1 (UDP-Glycosyltransferase superfamily protein )
HSP 1 Score: 364.0 bits (933), Expect = 1.8e-100
Identity = 185/372 (49.73%), Postives = 255/372 (68.55%), Query Frame = 0
Query: 88 MRRHVPALRSAVSALTSLPSVLIADIFATESFAVADEFHMAKYVFVASNAWFFALTIYVP 147
MR +R AV ++ P+V+I D F T ++ D +KYV++ S+AWF AL +Y+P
Sbjct: 1 MREMKSTVRDAVKSMKQKPTVMIVDFFGTALLSITDVGVTSKYVYIPSHAWFLALIVYLP 60
Query: 148 VLDKQINGQYVDQKEPFHIPGCEPVRPCDVMDPLLDRTEPQYFEYVRIGTEIPSSDGVLV 207
VLDK + G+YVD KEP IPGC+PV P +++D +LDR++ QY + V+IG EIP SDGVLV
Sbjct: 61 VLDKVMEGEYVDIKEPMKIPGCKPVGPKELLDTMLDRSDQQYRDCVQIGLEIPMSDGVLV 120
Query: 208 NTWDDLEGRTLASFRDWNLLGRIMKSPVYSIGPIVRQTGGKKGGASELFNWLSKQPGESV 267
NTW +L+G+TLA+ R+ L R++K PVY IGPIVR T + F WL KQ SV
Sbjct: 121 NTWGELQGKTLAALREDIDLNRVIKVPVYPIGPIVR-TNVLIEKPNSTFEWLDKQEERSV 180
Query: 268 IYVSFGSGGTLSSEQMTEVAHGLEMSGQRFVWVVRAPKVRSDATFFTTGDGTEDQSEAKF 327
+YV GSGGTLS EQ E+A GLE+S Q F+WV+R P + +D +
Sbjct: 181 VYVCLGSGGTLSFEQTMELAWGLELSCQSFLWVLRKP------PSYLGASSKDDDQVSDG 240
Query: 328 LPDGFLERTSEVGFVVSMWADQTAVLGSPAVGGFFTHGGWNSALEGITNGVPMVVWPLYA 387
LP+GFL+RT VG VV+ WA Q +L ++GGF +H GW+S LE +T GVP++ WPLYA
Sbjct: 241 LPEGFLDRTRGVGLVVTQWAPQVEILSHRSIGGFLSHCGWSSVLESLTKGVPIIAWPLYA 300
Query: 388 EQRMNATMLAEEVRVAVRPKELPTKAVIGREEIAAMVRKIMAEEDEEGKAIRAKAKELQR 447
EQ MNAT+L EE+ +A+R ELP+K VI REE+A++V+KI+AEED+EG+ I+ KA+E++
Sbjct: 301 EQWMNATLLTEEIGMAIRTSELPSKKVISREEVASLVKKIVAEEDKEGRKIKTKAEEVRV 360
Query: 448 SAEKSTAEGGSS 460
S+E++ GGSS
Sbjct: 361 SSERAWTHGGSS 365
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Q40287 | 1.4e-137 | 53.57 | Anthocyanidin 3-O-glucosyltransferase 5 OS=Manihot esculenta OX=3983 GN=GT5 PE=2... | [more] |
Q9ZU72 | 1.1e-121 | 49.89 | UDP-glycosyltransferase 72D1 OS=Arabidopsis thaliana OX=3702 GN=UGT72D1 PE=2 SV=... | [more] |
Q94A84 | 8.5e-111 | 45.57 | UDP-glycosyltransferase 72E1 OS=Arabidopsis thaliana OX=3702 GN=UGT72E1 PE=1 SV=... | [more] |
O81498 | 4.4e-107 | 44.16 | UDP-glycosyltransferase 72E3 OS=Arabidopsis thaliana OX=3702 GN=UGT72E3 PE=1 SV=... | [more] |
Q9LVR1 | 1.3e-106 | 45.34 | UDP-glycosyltransferase 72E2 OS=Arabidopsis thaliana OX=3702 GN=UGT72E2 PE=1 SV=... | [more] |