Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTGCTCCATCAAAACATTCATGTTCTAATGGCCTCAGCTGCCTTACAAGGCCATTTGAACCCCATGCTCAAGTTCGCCAAATGCCTCATCTCAAAGGGAATTCATGTCACCATTGCCACCACCGACCTTGCCCGCCACCGTATGCTCGAACACACTACCACCTCCGATCCCACCAACCCATCAATCCAATTCGAGTTCTTCTCCGATGGACTCGACCTTGATTTCGACAGGGACGCCAACTCTGACATTTTTATTGACTCTCTAAGAACCAAAGCTAGCAAAAACCTCTCAAATCTCATTGCCAAATTATCCCAAAGTATAAAATTCTCGTGCCTAATCCTCCACCAATTTGTGCCTTGGTTCATACCCATTGCTAAAGAAAATAACCTCCCTTGCGCCGTGCTTTGGATCCAGCCTTGCGCTCTCTACTCCATCTACTATCGCTTCTTCAACAAACTCAACGAGTTTGCCATTTTACAAAATAAGAACCAGCCTCTCCAATTACCCGGCCTGCCATTACTCAACTTCGAAGACCTCCCTACCTTCATCCACCTCAACGCTTATCTTTGCTTCCAAAAACTCCTCTCAGAGTTTTTCAGCTTCTTGGATGACGTGAATTGGGTTTTGGGGACTTCGTTCGATGAGCTGGAGGAGGAGGTTTTGAGGTCCATGAACGGTGGGGTTGTTCGGCCGATGATTTCGCCAATCGGACCATTAGTATCGCCATTTTTGCTTGGGAAGGAGGAGAAAGTGGGGGAGGGTGGGCTGAGCGCAGACATGTGGAAGGCGGATGATTCTTGCCTGCAATGGCTGGATGGGCGGGACATGGGGTCAGTTGTTTATGTGTCGTTTGGGAGTATTATTGTTTTGTCTCAAGAGCAAGTTGACAATATTGCTAATGGGTTGTTGATGAGTGGGAAGCCGTTTCTTTGGGTGTTCAAGCGACCATCGCCTGAGAAATCTACTGAGGGTTGCACTGTTCGGCTGCCGGATGGGTTTCTTGAGGCTGCTGGTGGGAGGGCACTCGTCGTGAATTGGTGCTCTCAAGAACAGGTTTGTTTTTTTGTTGTTTTTTTTTTTTTTTTTTTTTTTTCCCTCTTTGGAGAAGTTTAAAGAGCCTTGGACAATCTGTGCCACCCTGTCTCTCAATAGCATGTGGGAAATAGGGAACATAAATGATTAGCTACCAGCTTCTNCTAATTTAGAATAGGATCATGGGTTTATAAACATAATAGAGAGTTTATAAACAAAGAATACTATCTCTATTATTTGAGGCCTTTTGAAAAACTCCAAAATAAAGCCACAAGAGTTTATATTCAAAATGGACAATATCATATTATTGTGTGAAGTCGAAGTCGTGTTCATCTAACATGTTTAACATTTTGTTCTTGTCAGTGATCGGTAACCCCTACCAAATATAATTCCTTTTAATCAAAAAGAAGAAAAAAACATCAAATATTAGGACATGACTCATCAATTATCATATGGTCTCGCAAATATTTAATATTTAGTACTGGTATTCCTAACAAGGTTCTCAAGCATAAAGCAGTGGGTTGCTTCCTAACTCATTGTGGGTGGAACTCAACGCTTGAAACGGTGGTCGCCGGAGTTCCTGTGATAGCATTTCCGGAGTGGACGGATCAACCAACAAATGCAAAATTGTTAACGGACGTTTTCAAGATGGGTGTGAGGATGAGAAAGGGAGATAATGGAGTTGCTAGTTCAAAAGAAGTGGAGAGATGCATATGGGAGATGACCGATGGCCCTAAAGCAAAGGCAATGGCAAAAAGGGCAGTAGAGTTGATGGAGGCAGCCAAAAGAGCGGTGGAAGACGGTGGTTCTTCTCACCGGAATCTCGATCTATTTATTGCTGACATTTGTTGCAAAAAGGTGACAACTTAA
mRNA sequence
ATGGTGCTCCATCAAAACATTCATGTTCTAATGGCCTCAGCTGCCTTACAAGGCCATTTGAACCCCATGCTCAAGTTCGCCAAATGCCTCATCTCAAAGGGAATTCATGTCACCATTGCCACCACCGACCTTGCCCGCCACCGTATGCTCGAACACACTACCACCTCCGATCCCACCAACCCATCAATCCAATTCGAGTTCTTCTCCGATGGACTCGACCTTGATTTCGACAGGGACGCCAACTCTGACATTTTTATTGACTCTCTAAGAACCAAAGCTAGCAAAAACCTCTCAAATCTCATTGCCAAATTATCCCAAAGTATAAAATTCTCGTGCCTAATCCTCCACCAATTTGTGCCTTGGTTCATACCCATTGCTAAAGAAAATAACCTCCCTTGCGCCGTGCTTTGGATCCAGCCTTGCGCTCTCTACTCCATCTACTATCGCTTCTTCAACAAACTCAACGAGTTTGCCATTTTACAAAATAAGAACCAGCCTCTCCAATTACCCGGCCTGCCATTACTCAACTTCGAAGACCTCCCTACCTTCATCCACCTCAACGCTTATCTTTGCTTCCAAAAACTCCTCTCAGAGTTTTTCAGCTTCTTGGATGACGTGAATTGGGTTTTGGGGACTTCGTTCGATGAGCTGGAGGAGGAGGTTTTGAGGTCCATGAACGGTGGGGTTGTTCGGCCGATGATTTCGCCAATCGGACCATTAGTATCGCCATTTTTGCTTGGGAAGGAGGAGAAAGTGGGGGAGGGTGGGCTGAGCGCAGACATGTGGAAGGCGGATGATTCTTGCCTGCAATGGCTGGATGGGCGGGACATGGGGTCAGTTGTTTATGTGTCGTTTGGGAGTATTATTGTTTTGTCTCAAGAGCAAGTTGACAATATTGCTAATGGGTTGTTGATGAGTGGGAAGCCGTTTCTTTGGGTGTTCAAGCGACCATCGCCTGAGAAATCTACTGAGGGTTGCACTGTTCGGCTGCCGGATGGGTTTCTTGAGGCTGCTGGTGGGAGGGCACTCGTCGTGAATTGGTGCTCTCAAGAACAGGTTCTCAAGCATAAAGCAGTGGGTTGCTTCCTAACTCATTGTGGGTGGAACTCAACGCTTGAAACGGTGGTCGCCGGAGTTCCTGTGATAGCATTTCCGGAGTGGACGGATCAACCAACAAATGCAAAATTGTTAACGGACGTTTTCAAGATGGGTGTGAGGATGAGAAAGGGAGATAATGGAGTTGCTAGTTCAAAAGAAGTGGAGAGATGCATATGGGAGATGACCGATGGCCCTAAAGCAAAGGCAATGGCAAAAAGGGCAGTAGAGTTGATGGAGGCAGCCAAAAGAGCGGTGGAAGACGGTGGTTCTTCTCACCGGAATCTCGATCTATTTATTGCTGACATTTGTTGCAAAAAGGTGACAACTTAA
Coding sequence (CDS)
ATGGTGCTCCATCAAAACATTCATGTTCTAATGGCCTCAGCTGCCTTACAAGGCCATTTGAACCCCATGCTCAAGTTCGCCAAATGCCTCATCTCAAAGGGAATTCATGTCACCATTGCCACCACCGACCTTGCCCGCCACCGTATGCTCGAACACACTACCACCTCCGATCCCACCAACCCATCAATCCAATTCGAGTTCTTCTCCGATGGACTCGACCTTGATTTCGACAGGGACGCCAACTCTGACATTTTTATTGACTCTCTAAGAACCAAAGCTAGCAAAAACCTCTCAAATCTCATTGCCAAATTATCCCAAAGTATAAAATTCTCGTGCCTAATCCTCCACCAATTTGTGCCTTGGTTCATACCCATTGCTAAAGAAAATAACCTCCCTTGCGCCGTGCTTTGGATCCAGCCTTGCGCTCTCTACTCCATCTACTATCGCTTCTTCAACAAACTCAACGAGTTTGCCATTTTACAAAATAAGAACCAGCCTCTCCAATTACCCGGCCTGCCATTACTCAACTTCGAAGACCTCCCTACCTTCATCCACCTCAACGCTTATCTTTGCTTCCAAAAACTCCTCTCAGAGTTTTTCAGCTTCTTGGATGACGTGAATTGGGTTTTGGGGACTTCGTTCGATGAGCTGGAGGAGGAGGTTTTGAGGTCCATGAACGGTGGGGTTGTTCGGCCGATGATTTCGCCAATCGGACCATTAGTATCGCCATTTTTGCTTGGGAAGGAGGAGAAAGTGGGGGAGGGTGGGCTGAGCGCAGACATGTGGAAGGCGGATGATTCTTGCCTGCAATGGCTGGATGGGCGGGACATGGGGTCAGTTGTTTATGTGTCGTTTGGGAGTATTATTGTTTTGTCTCAAGAGCAAGTTGACAATATTGCTAATGGGTTGTTGATGAGTGGGAAGCCGTTTCTTTGGGTGTTCAAGCGACCATCGCCTGAGAAATCTACTGAGGGTTGCACTGTTCGGCTGCCGGATGGGTTTCTTGAGGCTGCTGGTGGGAGGGCACTCGTCGTGAATTGGTGCTCTCAAGAACAGGTTCTCAAGCATAAAGCAGTGGGTTGCTTCCTAACTCATTGTGGGTGGAACTCAACGCTTGAAACGGTGGTCGCCGGAGTTCCTGTGATAGCATTTCCGGAGTGGACGGATCAACCAACAAATGCAAAATTGTTAACGGACGTTTTCAAGATGGGTGTGAGGATGAGAAAGGGAGATAATGGAGTTGCTAGTTCAAAAGAAGTGGAGAGATGCATATGGGAGATGACCGATGGCCCTAAAGCAAAGGCAATGGCAAAAAGGGCAGTAGAGTTGATGGAGGCAGCCAAAAGAGCGGTGGAAGACGGTGGTTCTTCTCACCGGAATCTCGATCTATTTATTGCTGACATTTGTTGCAAAAAGGTGACAACTTAA
Protein sequence
MVLHQNIHVLMASAALQGHLNPMLKFAKCLISKGIHVTIATTDLARHRMLEHTTTSDPTNPSIQFEFFSDGLDLDFDRDANSDIFIDSLRTKASKNLSNLIAKLSQSIKFSCLILHQFVPWFIPIAKENNLPCAVLWIQPCALYSIYYRFFNKLNEFAILQNKNQPLQLPGLPLLNFEDLPTFIHLNAYLCFQKLLSEFFSFLDDVNWVLGTSFDELEEEVLRSMNGGVVRPMISPIGPLVSPFLLGKEEKVGEGGLSADMWKADDSCLQWLDGRDMGSVVYVSFGSIIVLSQEQVDNIANGLLMSGKPFLWVFKRPSPEKSTEGCTVRLPDGFLEAAGGRALVVNWCSQEQVLKHKAVGCFLTHCGWNSTLETVVAGVPVIAFPEWTDQPTNAKLLTDVFKMGVRMRKGDNGVASSKEVERCIWEMTDGPKAKAMAKRAVELMEAAKRAVEDGGSSHRNLDLFIADICCKKVTT
Homology
BLAST of CmaCh02G005000 vs. ExPASy Swiss-Prot
Match:
O22183 (UDP-glycosyltransferase 84B2 OS=Arabidopsis thaliana OX=3702 GN=UGT84B2 PE=3 SV=1)
HSP 1 Score: 397.1 bits (1019), Expect = 2.7e-109
Identity = 215/459 (46.84%), Postives = 289/459 (62.96%), Query Frame = 0
Query: 11 MASAALQGHLNPMLKFAKCLISKGIHVTIATTDLARHRMLEHTTTSDPTNPSIQFEFFSD 70
M + A QGHLNPMLKFAK L +H T+ATT+ AR + ++T+D + + FFSD
Sbjct: 1 MVALAFQGHLNPMLKFAKHLARTNLHFTLATTEQARDLL---SSTADEPHRPVDLAFFSD 60
Query: 71 GLDLDFDRDANSDIFIDSLRTKASKNLSNLIAKLSQSIKFSCLILHQFVPWFIPIAKENN 130
GL D RD D SL+ +KNLS +I + +F C+I F PW +A +N
Sbjct: 61 GLPKDDPRD--PDTLAKSLKKDGAKNLSKII----EEKRFDCIISVPFTPWVPAVAAAHN 120
Query: 131 LPCAVLWIQPCALYSIYYRFFNKLNEFAILQNKNQPLQLPGLPLLNFEDLPTFIHLNAYL 190
+PCA+LWIQ C +S+YYR++ K N F L++ NQ ++LP LPLL DLP+ + +
Sbjct: 121 IPCAILWIQACGAFSVYYRYYMKTNPFPDLEDLNQTVELPALPLLEVRDLPSLMLPSQGA 180
Query: 191 CFQKLLSEFFSFLDDVNWVLGTSFDELEEEVLRSMNGGVVRPMISPIGPLVSPFLLGKEE 250
L++EF L DV WVL SF ELE E++ SM+ ++P+I PIGPLVSPFLLG +E
Sbjct: 181 NVNTLMAEFADCLKDVKWVLVNSFYELESEIIESMSD--LKPII-PIGPLVSPFLLGNDE 240
Query: 251 KVGEGGLSADMWKADDSCLQWLDGRDMGSVVYVSFGSIIVLSQEQVDNIANGLLMSGKPF 310
+ + DMWK DD C++WLD + SVVY+SFGSI+ + QV+ IA L G PF
Sbjct: 241 E-----KTLDMWKVDDYCMEWLDKQARSSVVYISFGSILKSLENQVETIATALKNRGVPF 300
Query: 311 LWVFKRPSPEKSTEGCTVRLPDGFLEAAGGRALVVNWCSQEQVLKHKAVGCFLTHCGWNS 370
LWV + P++ E V L + E G+ +V W QE++L H A+ CF+THCGWNS
Sbjct: 301 LWVIR---PKEKGENVQV-LQEMVKE---GKGVVTEWGQQEKILSHMAISCFITHCGWNS 360
Query: 371 TLETVVAGVPVIAFPEWTDQPTNAKLLTDVFKMGVRMRK-GDNGVASSKEVERCIWEMTD 430
T+ETVV GVPV+A+P W DQP +A+LL DVF +GVRM+ +G EVERCI +T+
Sbjct: 361 TIETVVTGVPVVAYPTWIDQPLDARLLVDVFGIGVRMKNDAIDGELKVAEVERCIEAVTE 420
Query: 431 GPKAKAMAKRAVELMEAAKRAVEDGGSSHRNLDLFIADI 469
GP A M +RA EL AA+ A+ GGSS +NLD FI+DI
Sbjct: 421 GPAAADMRRRATELKHAARSAMSPGGSSAQNLDSFISDI 435
BLAST of CmaCh02G005000 vs. ExPASy Swiss-Prot
Match:
O22182 (UDP-glycosyltransferase 84B1 OS=Arabidopsis thaliana OX=3702 GN=UGT84B1 PE=2 SV=1)
HSP 1 Score: 388.7 bits (997), Expect = 9.8e-107
Identity = 213/467 (45.61%), Postives = 292/467 (62.53%), Query Frame = 0
Query: 5 QNIHVLMASAALQGHLNPMLKFAK--CLISKGIHVTIATTDLARHRMLEHTTTSDPTNPS 64
Q HVLM + QGH+NPMLK AK L SK +H+ +AT + AR + +T P P
Sbjct: 7 QETHVLMVTLPFQGHINPMLKLAKHLSLSSKNLHINLATIESARDLL---STVEKPRYP- 66
Query: 65 IQFEFFSDGLDLDFDRDANSDIFIDSLRTKASKNLSNLIAKLSQSIKFSCLILHQFVPWF 124
+ FFSDGL + + + + SL + NLS +I + ++SC+I F PW
Sbjct: 67 VDLVFFSDGLPKEDPK--APETLLKSLNKVGAMNLSKII----EEKRYSCIISSPFTPWV 126
Query: 125 IPIAKENNLPCAVLWIQPCALYSIYYRFFNKLNEFAILQNKNQPLQLPGLPLLNFEDLPT 184
+A +N+ CA+LWIQ C YS+YYR++ K N F L++ NQ ++LP LPLL DLP+
Sbjct: 127 PAVAASHNISCAILWIQACGAYSVYYRYYMKTNSFPDLEDLNQTVELPALPLLEVRDLPS 186
Query: 185 FIHLNAYLCFQKLLSEFFSFLDDVNWVLGTSFDELEEEVLRSMNGGVVRPMISPIGPLVS 244
F+ + F L++EF L V WVL SF ELE E++ SM ++P+I PIGPLVS
Sbjct: 187 FMLPSGGAHFYNLMAEFADCLRYVKWVLVNSFYELESEIIESM--ADLKPVI-PIGPLVS 246
Query: 245 PFLLGKEEKVGEGGLSADMWKADDSCLQWLDGRDMGSVVYVSFGSIIVLSQEQVDNIANG 304
PFLLG E+ G + D K+DD C++WLD + SVVY+SFGS++ + QV+ IA
Sbjct: 247 PFLLGDGEEETLDGKNLDFCKSDDCCMEWLDKQARSSVVYISFGSMLETLENQVETIAKA 306
Query: 305 LLMSGKPFLWVFKRPSPEKSTEGCTVRLPDGFLEAAGGRALVVNWCSQEQVLKHKAVGCF 364
L G PFLWV + P++ + V L + E G+ +V+ W QE++L H+A+ CF
Sbjct: 307 LKNRGLPFLWVIR---PKEKAQNVAV-LQEMVKE---GQGVVLEWSPQEKILSHEAISCF 366
Query: 365 LTHCGWNSTLETVVAGVPVIAFPEWTDQPTNAKLLTDVFKMGVRMRKGD-NGVASSKEVE 424
+THCGWNST+ETVVAGVPV+A+P WTDQP +A+LL DVF +GVRMR +G +EVE
Sbjct: 367 VTHCGWNSTMETVVAGVPVVAYPSWTDQPIDARLLVDVFGIGVRMRNDSVDGELKVEEVE 426
Query: 425 RCIWEMTDGPKAKAMAKRAVELMEAAKRAVEDGGSSHRNLDLFIADI 469
RCI +T+GP A + +RA EL A+ A+ GGSS RNLDLFI+DI
Sbjct: 427 RCIEAVTEGPAAVDIRRRAAELKRVARLALAPGGSSTRNLDLFISDI 453
BLAST of CmaCh02G005000 vs. ExPASy Swiss-Prot
Match:
V5LLZ9 (Gallate 1-beta-glucosyltransferase OS=Quercus robur OX=38942 GN=UGT84A13 PE=1 SV=1)
HSP 1 Score: 335.5 bits (859), Expect = 9.8e-91
Identity = 190/475 (40.00%), Postives = 275/475 (57.89%), Query Frame = 0
Query: 7 IHVLMASAALQGHLNPMLKFAKCLISKGIHVTIATTDLARHRMLEHTTTSDPTNP----S 66
+HV + S QGH+NP+L+ K L +KG+ VT +T + +M + + +D P
Sbjct: 7 VHVFLVSFPGQGHVNPLLRLGKRLAAKGLLVTFSTPESIGKQMRKASNITDEPAPVGEGF 66
Query: 67 IQFEFFSDGLDLDFDRDANSDIFIDSLRTKASKNLSNLIAKLSQSIK-FSCLILHQFVPW 126
I+FEFF DG D D R + D ++ L + +I K ++ + SCLI + F+PW
Sbjct: 67 IRFEFFEDGWDEDEPRRQDLDQYLPQLELIGKDIIPKMIRKNAEMGRPVSCLINNPFIPW 126
Query: 127 FIPIAKENNLPCAVLWIQPCALYSIYYRFFNKLNEFAILQNKNQPLQLPGLPLLNFEDLP 186
+A+ LP A+LW+Q CA + YY +++ L F +QLP +PLL +++ P
Sbjct: 127 VSDVAESLGLPSAMLWVQSCACFCAYYHYYHGLVPFPSEAEPFIDIQLPCMPLLKYDETP 186
Query: 187 TFIH-LNAYLCFQKLLSEFFSFLDDVNWVLGTSFDELEEEVLRSMNGGVVRPMISPIGPL 246
+F++ Y ++ + + LD +L +F ELE EV+ M+ + P I +GPL
Sbjct: 187 SFLYPTTPYPFLRRAILGQYGNLDKPFCILMDTFQELEHEVIEFMS--KICP-IKTVGPL 246
Query: 247 VSPFLLGKEEKVGEGGLSADMWKADDSCLQWLDGRDMGSVVYVSFGSIIVLSQEQVDNIA 306
+ + D KADD CL+WLD + SVVY+SFGS++ L+Q+QVD IA
Sbjct: 247 F-------KNPKAPNSVRGDFMKADD-CLEWLDSKPPQSVVYISFGSVVYLTQKQVDEIA 306
Query: 307 NGLLMSGKPFLWVFKRPSPEKSTEGCTVRLPDGFLEAAGGRALVVNWCSQEQVLKHKAVG 366
GLL SG FLWV K P + E + LPDGFLE AG VV W QEQVL H +V
Sbjct: 307 FGLLQSGVSFLWVMKPPHKDAGLE--LLVLPDGFLEKAGDNGRVVQWSPQEQVLAHPSVA 366
Query: 367 CFLTHCGWNSTLETVVAGVPVIAFPEWTDQPTNAKLLTDVFKMGVRMRKG--DNGVASSK 426
CF+THCGWNST+E++ +G+PV+AFP+W DQ T+A L DVFK GVRM +G +N V +
Sbjct: 367 CFVTHCGWNSTMESLTSGMPVVAFPQWGDQVTDAVYLVDVFKTGVRMCRGEAENRVITRD 426
Query: 427 EVERCIWEMTDGPKAKAMAKRAVELMEAAKRAVEDGGSSHRNLDLFIADICCKKV 474
EVE+C+ E T GPKA M + A + AA+ A +GGSS RN+ F+ ++ + V
Sbjct: 427 EVEKCLLEATVGPKAVEMKQNASKWKAAAEAAFSEGGSSDRNIQAFVDEVRARSV 468
BLAST of CmaCh02G005000 vs. ExPASy Swiss-Prot
Match:
A0A193AU77 (Gallate 1-beta-glucosyltransferase 84A24 OS=Punica granatum OX=22663 GN=UGT84A24 PE=1 SV=1)
HSP 1 Score: 333.6 bits (854), Expect = 3.7e-90
Identity = 188/470 (40.00%), Postives = 273/470 (58.09%), Query Frame = 0
Query: 7 IHVLMASAALQGHLNPMLKFAKCLISKGIHVTIATTDLARHRMLEHTTTSDPTNP----S 66
+HV + S QGH+NP+L+ K L SKG+ VT T + +M + + + +P
Sbjct: 7 VHVFLVSFPGQGHVNPLLRLGKRLASKGLLVTFTTPESIGKQMRKASNIGEEPSPIGDGF 66
Query: 67 IQFEFFSDGLDLDFDRDANSDIFIDSLRTKASKNLSNLIAK-LSQSIKFSCLILHQFVPW 126
I+FEFF DG D D R + D ++ L + + +I K Q+ SCLI + F+PW
Sbjct: 67 IRFEFFEDGWDEDEPRRQDLDQYLPQLEKVGKEVIPRMIKKNEEQNRPVSCLINNPFIPW 126
Query: 127 FIPIAKENNLPCAVLWIQPCALYSIYYRFFNKLNEFAILQNKNQPLQLPGLPLLNFEDLP 186
+A+ LP A+LW+Q CA ++ YY +++ L F +QLP +PLL +++P
Sbjct: 127 VSDVAESLGLPSAMLWVQSCACFAAYYHYYHGLVPFPSESAMEIDVQLPCMPLLKHDEVP 186
Query: 187 TFIH-LNAYLCFQKLLSEFFSFLDDVNWVLGTSFDELEEEVLRSMNGGVVRPMISPIGPL 246
+F++ Y ++ + + LD VL +F ELE E++ M+ + P I +GPL
Sbjct: 187 SFLYPTTPYPFLRRAIMGQYKNLDKPFCVLMDTFQELEHEIIEYMS--KICP-IKTVGPL 246
Query: 247 VSPFLLGKEEKVGEGGLSADMWKADDSCLQWLDGRDMGSVVYVSFGSIIVLSQEQVDNIA 306
K K + D KADD C+ WLD + SVVYVSFGS++ L Q+Q D IA
Sbjct: 247 F------KNPKAPNANVRGDFMKADD-CISWLDSKPPASVVYVSFGSVVYLKQDQWDEIA 306
Query: 307 NGLLMSGKPFLWVFKRPSPEKSTEGCTVRLPDGFLEAAGGRALVVNWCSQEQVLKHKAVG 366
GLL SG FLWV K P + + T LP+GFLE AG + VV W QEQVL H +V
Sbjct: 307 FGLLNSGLNFLWVMKPPHKDSGYQLLT--LPEGFLEKAGDKGKVVQWSPQEQVLAHPSVA 366
Query: 367 CFLTHCGWNSTLETVVAGVPVIAFPEWTDQPTNAKLLTDVFKMGVRMRKG--DNGVASSK 426
CF+THCGWNS++E + +G+PV+AFP+W DQ T+AK L DVFK+GVRM +G +N +
Sbjct: 367 CFVTHCGWNSSMEALSSGMPVVAFPQWGDQVTDAKYLVDVFKVGVRMCRGEAENKLIMRD 426
Query: 427 EVERCIWEMTDGPKAKAMAKRAVELMEAAKRAVEDGGSSHRNLDLFIADI 469
VE+C+ E T GPKA + + A++ AA+ AV +GGSS RN+ F+ ++
Sbjct: 427 VVEKCLLEATVGPKAAEVKENALKWKAAAEAAVAEGGSSDRNIQAFVDEV 464
BLAST of CmaCh02G005000 vs. ExPASy Swiss-Prot
Match:
Q2V6K1 (Putative UDP-glucose glucosyltransferase OS=Fragaria ananassa OX=3747 GN=GT5 PE=2 SV=1)
HSP 1 Score: 329.3 bits (843), Expect = 7.0e-89
Identity = 186/475 (39.16%), Postives = 270/475 (56.84%), Query Frame = 0
Query: 6 NIHVLMASAALQGHLNPMLKFAKCLISKGIHVTIATTDLARHRMLEHTTTSD--PT---N 65
N H+ + QGH+NPML+ K L +KG+ VT +TT+ ++M D PT N
Sbjct: 8 NTHIFLVCYPAQGHINPMLRLGKYLAAKGLLVTFSTTEDYGNKMRNANGIVDNHPTPVGN 67
Query: 66 PSIQFEFFSDGL-DLDFDRDANSDIFIDSLRTKASKNLSNLIAKLSQ--SIKFSCLILHQ 125
I+FEFF D L D D R N + ++ L + ++ +I K + + SCL+ +
Sbjct: 68 GFIRFEFFDDSLPDPDDPRRTNLEFYVPLLEKVGKELVTGMIKKHGEEGGARVSCLVNNP 127
Query: 126 FVPWFIPIAKENNLPCAVLWIQPCALYSIYYRFFNKLNEFAILQNKNQPLQLPGLPLLNF 185
F+PW +A E +PCA LWIQ CA++S Y+ + + +F +QLP PLL
Sbjct: 128 FIPWVCDVATELGIPCATLWIQSCAVFSAYFHYNAETVKFPTEAEPELDVQLPSTPLLKH 187
Query: 186 EDLPTFIH-LNAYLCFQKLLSEFFSFLDDVNWVLGTSFDELEEEVLRSMNGGVVRPMISP 245
+++P+F+H + Y + + F L +++L + ELE E++ M+ ++ P
Sbjct: 188 DEIPSFLHPFDPYAILGRAILGQFKKLSKSSYILMDTIQELEPEIVEEMSKVC---LVKP 247
Query: 246 IGPLVSPFLLGKEEKVGEGGLSADMWKADDSCLQWLDGRDMGSVVYVSFGSIIVLSQEQV 305
+GPL K + + D+ KADD CL WL + SVVY+SFGSI+ L QEQV
Sbjct: 248 VGPLF------KIPEATNTTIRGDLIKADD-CLDWLSSKPPASVVYISFGSIVYLKQEQV 307
Query: 306 DNIANGLLMSGKPFLWVFKRPSPEKSTEGCTVR-LPDGFLEAAGGRALVVNWCSQEQVLK 365
D IA+GLL SG FLWV + P + G + LP+GFLE G +V W QEQVL
Sbjct: 308 DEIAHGLLSSGVSFLWVMR---PPRKAAGVDMHVLPEGFLEKVGDNGKLVQWSPQEQVLA 367
Query: 366 HKAVGCFLTHCGWNSTLETVVAGVPVIAFPEWTDQPTNAKLLTDVFKMGVRMRKG--DNG 425
H ++ CFLTHCGWNS++E + GVPV+ FP+W DQ TNAK L DVF +G+R+ +G +N
Sbjct: 368 HPSLACFLTHCGWNSSVEALTLGVPVVTFPQWGDQVTNAKYLVDVFGVGLRLCRGVAENR 427
Query: 426 VASSKEVERCIWEMTDGPKAKAMAKRAVELMEAAKRAVEDGGSSHRNLDLFIADI 469
+ EVE+C+ E T G KA + A++ + A+ AV +GGSS RNL FI +I
Sbjct: 428 LVLRDEVEKCLLEATVGEKAVQLKHNALKWKKVAEEAVAEGGSSQRNLHDFIDEI 469
BLAST of CmaCh02G005000 vs. TAIR 10
Match:
AT2G23250.1 (UDP-glucosyl transferase 84B2 )
HSP 1 Score: 397.1 bits (1019), Expect = 2.0e-110
Identity = 215/459 (46.84%), Postives = 289/459 (62.96%), Query Frame = 0
Query: 11 MASAALQGHLNPMLKFAKCLISKGIHVTIATTDLARHRMLEHTTTSDPTNPSIQFEFFSD 70
M + A QGHLNPMLKFAK L +H T+ATT+ AR + ++T+D + + FFSD
Sbjct: 1 MVALAFQGHLNPMLKFAKHLARTNLHFTLATTEQARDLL---SSTADEPHRPVDLAFFSD 60
Query: 71 GLDLDFDRDANSDIFIDSLRTKASKNLSNLIAKLSQSIKFSCLILHQFVPWFIPIAKENN 130
GL D RD D SL+ +KNLS +I + +F C+I F PW +A +N
Sbjct: 61 GLPKDDPRD--PDTLAKSLKKDGAKNLSKII----EEKRFDCIISVPFTPWVPAVAAAHN 120
Query: 131 LPCAVLWIQPCALYSIYYRFFNKLNEFAILQNKNQPLQLPGLPLLNFEDLPTFIHLNAYL 190
+PCA+LWIQ C +S+YYR++ K N F L++ NQ ++LP LPLL DLP+ + +
Sbjct: 121 IPCAILWIQACGAFSVYYRYYMKTNPFPDLEDLNQTVELPALPLLEVRDLPSLMLPSQGA 180
Query: 191 CFQKLLSEFFSFLDDVNWVLGTSFDELEEEVLRSMNGGVVRPMISPIGPLVSPFLLGKEE 250
L++EF L DV WVL SF ELE E++ SM+ ++P+I PIGPLVSPFLLG +E
Sbjct: 181 NVNTLMAEFADCLKDVKWVLVNSFYELESEIIESMSD--LKPII-PIGPLVSPFLLGNDE 240
Query: 251 KVGEGGLSADMWKADDSCLQWLDGRDMGSVVYVSFGSIIVLSQEQVDNIANGLLMSGKPF 310
+ + DMWK DD C++WLD + SVVY+SFGSI+ + QV+ IA L G PF
Sbjct: 241 E-----KTLDMWKVDDYCMEWLDKQARSSVVYISFGSILKSLENQVETIATALKNRGVPF 300
Query: 311 LWVFKRPSPEKSTEGCTVRLPDGFLEAAGGRALVVNWCSQEQVLKHKAVGCFLTHCGWNS 370
LWV + P++ E V L + E G+ +V W QE++L H A+ CF+THCGWNS
Sbjct: 301 LWVIR---PKEKGENVQV-LQEMVKE---GKGVVTEWGQQEKILSHMAISCFITHCGWNS 360
Query: 371 TLETVVAGVPVIAFPEWTDQPTNAKLLTDVFKMGVRMRK-GDNGVASSKEVERCIWEMTD 430
T+ETVV GVPV+A+P W DQP +A+LL DVF +GVRM+ +G EVERCI +T+
Sbjct: 361 TIETVVTGVPVVAYPTWIDQPLDARLLVDVFGIGVRMKNDAIDGELKVAEVERCIEAVTE 420
Query: 431 GPKAKAMAKRAVELMEAAKRAVEDGGSSHRNLDLFIADI 469
GP A M +RA EL AA+ A+ GGSS +NLD FI+DI
Sbjct: 421 GPAAADMRRRATELKHAARSAMSPGGSSAQNLDSFISDI 435
BLAST of CmaCh02G005000 vs. TAIR 10
Match:
AT2G23260.1 (UDP-glucosyl transferase 84B1 )
HSP 1 Score: 388.7 bits (997), Expect = 6.9e-108
Identity = 213/467 (45.61%), Postives = 292/467 (62.53%), Query Frame = 0
Query: 5 QNIHVLMASAALQGHLNPMLKFAK--CLISKGIHVTIATTDLARHRMLEHTTTSDPTNPS 64
Q HVLM + QGH+NPMLK AK L SK +H+ +AT + AR + +T P P
Sbjct: 7 QETHVLMVTLPFQGHINPMLKLAKHLSLSSKNLHINLATIESARDLL---STVEKPRYP- 66
Query: 65 IQFEFFSDGLDLDFDRDANSDIFIDSLRTKASKNLSNLIAKLSQSIKFSCLILHQFVPWF 124
+ FFSDGL + + + + SL + NLS +I + ++SC+I F PW
Sbjct: 67 VDLVFFSDGLPKEDPK--APETLLKSLNKVGAMNLSKII----EEKRYSCIISSPFTPWV 126
Query: 125 IPIAKENNLPCAVLWIQPCALYSIYYRFFNKLNEFAILQNKNQPLQLPGLPLLNFEDLPT 184
+A +N+ CA+LWIQ C YS+YYR++ K N F L++ NQ ++LP LPLL DLP+
Sbjct: 127 PAVAASHNISCAILWIQACGAYSVYYRYYMKTNSFPDLEDLNQTVELPALPLLEVRDLPS 186
Query: 185 FIHLNAYLCFQKLLSEFFSFLDDVNWVLGTSFDELEEEVLRSMNGGVVRPMISPIGPLVS 244
F+ + F L++EF L V WVL SF ELE E++ SM ++P+I PIGPLVS
Sbjct: 187 FMLPSGGAHFYNLMAEFADCLRYVKWVLVNSFYELESEIIESM--ADLKPVI-PIGPLVS 246
Query: 245 PFLLGKEEKVGEGGLSADMWKADDSCLQWLDGRDMGSVVYVSFGSIIVLSQEQVDNIANG 304
PFLLG E+ G + D K+DD C++WLD + SVVY+SFGS++ + QV+ IA
Sbjct: 247 PFLLGDGEEETLDGKNLDFCKSDDCCMEWLDKQARSSVVYISFGSMLETLENQVETIAKA 306
Query: 305 LLMSGKPFLWVFKRPSPEKSTEGCTVRLPDGFLEAAGGRALVVNWCSQEQVLKHKAVGCF 364
L G PFLWV + P++ + V L + E G+ +V+ W QE++L H+A+ CF
Sbjct: 307 LKNRGLPFLWVIR---PKEKAQNVAV-LQEMVKE---GQGVVLEWSPQEKILSHEAISCF 366
Query: 365 LTHCGWNSTLETVVAGVPVIAFPEWTDQPTNAKLLTDVFKMGVRMRKGD-NGVASSKEVE 424
+THCGWNST+ETVVAGVPV+A+P WTDQP +A+LL DVF +GVRMR +G +EVE
Sbjct: 367 VTHCGWNSTMETVVAGVPVVAYPSWTDQPIDARLLVDVFGIGVRMRNDSVDGELKVEEVE 426
Query: 425 RCIWEMTDGPKAKAMAKRAVELMEAAKRAVEDGGSSHRNLDLFIADI 469
RCI +T+GP A + +RA EL A+ A+ GGSS RNLDLFI+DI
Sbjct: 427 RCIEAVTEGPAAVDIRRRAAELKRVARLALAPGGSSTRNLDLFISDI 453
BLAST of CmaCh02G005000 vs. TAIR 10
Match:
AT4G15480.1 (UDP-Glycosyltransferase superfamily protein )
HSP 1 Score: 318.9 bits (816), Expect = 6.8e-87
Identity = 181/469 (38.59%), Postives = 275/469 (58.64%), Query Frame = 0
Query: 7 IHVLMASAALQGHLNPMLKFAKCLISKGIHVTIATTDLARHRMLEHTTTSDPT-----NP 66
IHV++ S QGH+NP+L+ K + SKG+ VT TT+L +M + D +
Sbjct: 18 IHVMLVSFQGQGHVNPLLRLGKLIASKGLLVTFVTTELWGKKMRQANKIVDGELKPVGSG 77
Query: 67 SIQFEFFSDGLDLDFDRDANSDIFIDSLRTKASKNLSNLIAKLSQSIK-FSCLILHQFVP 126
SI+FEFF + D DR A+ ++I L + + +S L+ + ++ + SCLI + F+P
Sbjct: 78 SIRFEFFDEEWAEDDDRRADFSLYIAHLESVGIREVSKLVRRYEEANEPVSCLINNPFIP 137
Query: 127 WFIPIAKENNLPCAVLWIQPCALYSIYYRFFNKLNEFAILQNKNQPLQLPGLPLLNFEDL 186
W +A+E N+PCAVLW+Q CA +S YY + + F ++LP +P+L +++
Sbjct: 138 WVCHVAEEFNIPCAVLWVQSCACFSAYYHYQDGSVSFPTETEPELDVKLPCVPVLKNDEI 197
Query: 187 PTFIHLNA-YLCFQKLLSEFFSFLDDVNWVLGTSFDELEEEVLRSMNGGVVRPMISPIGP 246
P+F+H ++ + F++ + F L VL SFD LE+EV+ M+ + P + +GP
Sbjct: 198 PSFLHPSSRFTGFRQAILGQFKNLSKSFCVLIDSFDSLEQEVIDYMSS--LCP-VKTVGP 257
Query: 247 LVSPFLLGKEEKVGEGGLSADMWKADDSCLQWLDGRDMGSVVYVSFGSIIVLSQEQVDNI 306
L K + +S D+ K+ D CL+WLD R SVVY+SFG++ L QEQ++ I
Sbjct: 258 LF------KVARTVTSDVSGDICKSTDKCLEWLDSRPKSSVVYISFGTVAYLKQEQIEEI 317
Query: 307 ANGLLMSGKPFLWVFKRPSPEKSTEGCTVRLPDGFLE-AAGGRALVVNWCSQEQVLKHKA 366
A+G+L SG FLWV + P + E T LP E +A G+ ++V+WC QEQVL H +
Sbjct: 318 AHGVLKSGLSFLWVIRPPPHDLKVE--THVLPQELKESSAKGKGMIVDWCPQEQVLSHPS 377
Query: 367 VGCFLTHCGWNSTLETVVAGVPVIAFPEWTDQPTNAKLLTDVFKMGVRMRKG--DNGVAS 426
V CF+THCGWNST+E++ +GVPV+ P+W DQ T+A L DVFK GVR+ +G + V
Sbjct: 378 VACFVTHCGWNSTMESLSSGVPVVCCPQWGDQVTDAVYLIDVFKTGVRLGRGATEERVVP 437
Query: 427 SKEVERCIWEMTDGPKAKAMAKRAVELMEAAKRAVEDGGSSHRNLDLFI 466
+EV + E T G KA+ + K A++ A+ AV GGSS +N F+
Sbjct: 438 REEVAEKLLEATVGEKAEELRKNALKWKAEAEAAVAPGGSSDKNFREFV 475
BLAST of CmaCh02G005000 vs. TAIR 10
Match:
AT4G15490.1 (UDP-Glycosyltransferase superfamily protein )
HSP 1 Score: 295.4 bits (755), Expect = 8.0e-80
Identity = 171/489 (34.97%), Postives = 266/489 (54.40%), Query Frame = 0
Query: 5 QNIHVLMASAALQGHLNPMLKFAKCLISKGIHVTIATTD------LARHRMLEHTTTSDP 64
++ HV++ S QGH+NP+L+ K + SKG+ VT TT+ + + ++
Sbjct: 5 RHTHVMLVSFPGQGHVNPLLRLGKLIASKGLLVTFVTTEKPWGKKMRQANKIQDGVLKPV 64
Query: 65 TNPSIQFEFFSDGLDLDFDRDANSDIFIDSLRTKASKNLSNLIAKLSQSIKFSCLILHQF 124
I+FEFFSDG D ++ + D F L + + NL+ + ++ +CLI + F
Sbjct: 65 GLGFIRFEFFSDGFADDDEKRFDFDAFRPHLEAVGKQEIKNLVKRYNKE-PVTCLINNAF 124
Query: 125 VPWFIPIAKENNLPCAVLWIQPCALYSIYYRFFNKLNEFAILQNKNQPLQLPGLPLLNFE 184
VPW +A+E ++P AVLW+Q CA + YY + ++L +F + +++P LPLL +
Sbjct: 125 VPWVCDVAEELHIPSAVLWVQSCACLTAYYYYHHRLVKFPTKTEPDISVEIPCLPLLKHD 184
Query: 185 DLPTFIHLNA-YLCFQKLL----------SEFFSFLDDVNWVLGTSFDELEEEVLRSMNG 244
++P+F+H ++ Y F ++ F+ F+D +F ELE++++ M+
Sbjct: 185 EIPSFLHPSSPYTAFGDIILDQLKRFENHKSFYLFID--------TFRELEKDIMDHMSQ 244
Query: 245 GVVRPMISPIGPLVSPFLLGKEEKVGEGGLSADMWKADDSCLQWLDGRDMGSVVYVSFGS 304
+ +ISP+GPL K + + D+ + C++WLD R+ SVVY+SFG+
Sbjct: 245 LCPQAIISPVGPLF------KMAQTLSSDVKGDISEPASDCMEWLDSREPSSVVYISFGT 304
Query: 305 IIVLSQEQVDNIANGLLMSGKPFLWVFKRPSPEKSTEGCTVRLPDGFLEAAGGRALVVNW 364
I L QEQ++ IA+G+L SG LWV + P EG V P + +V W
Sbjct: 305 IANLKQEQMEEIAHGVLSSGLSVLWVVRPP-----MEGTFVE-PHVLPRELEEKGKIVEW 364
Query: 365 CSQEQVLKHKAVGCFLTHCGWNSTLETVVAGVPVIAFPEWTDQPTNAKLLTDVFKMGVRM 424
C QE+VL H A+ CFL+HCGWNST+E + AGVPV+ FP+W DQ T+A L DVFK GVR+
Sbjct: 365 CPQERVLAHPAIACFLSHCGWNSTMEALTAGVPVVCFPQWGDQVTDAVYLADVFKTGVRL 424
Query: 425 RKG--DNGVASSKEVERCIWEMTDGPKAKAMAKRAVELMEAAKRAVEDGGSSHRNLDLFI 475
+G + + S + V + E T G KA + + A A+ AV DGGSS N F+
Sbjct: 425 GRGAAEEMIVSREVVAEKLLEATVGEKAVELRENARRWKAEAEAAVADGGSSDMNFKEFV 472
BLAST of CmaCh02G005000 vs. TAIR 10
Match:
AT3G21560.1 (UDP-Glycosyltransferase superfamily protein )
HSP 1 Score: 290.0 bits (741), Expect = 3.4e-78
Identity = 164/477 (34.38%), Postives = 264/477 (55.35%), Query Frame = 0
Query: 8 HVLMASAALQGHLNPMLKFAKCLISKGIHVTIATTDLARHRMLEHTTTSD----PTNPS- 67
HV++ S QGH+NP+L+ K L SKG+ +T TT+ +M D P
Sbjct: 12 HVMLVSFPGQGHVNPLLRLGKLLASKGLLITFVTTESWGKKMRISNKIQDRVLKPVGKGY 71
Query: 68 IQFEFFSDGLDLDFDRD-ANSDIFIDSLRTKASKNLSNLIAKLSQSIK--FSCLILHQFV 127
++++FF DGL D + N I L + + NL+ + + K +CLI + FV
Sbjct: 72 LRYDFFDDGLPEDDEASRTNLTILRPHLELVGKREIKNLVKRYKEVTKQPVTCLINNPFV 131
Query: 128 PWFIPIAKENNLPCAVLWIQPCALYSIYYRFFNKLNEFAILQNKNQPLQLPGLPLLNFED 187
W +A++ +PCAVLW+Q CA + YY + + L +F +Q+ G+PLL ++
Sbjct: 132 SWVCDVAEDLQIPCAVLWVQSCACLAAYYYYHHNLVDFPTKTEPEIDVQISGMPLLKHDE 191
Query: 188 LPTFIHLNA-YLCFQKLLSEFFSFLDDVNWVLGTSFDELEEEVLRSMNGGVVRPMISPIG 247
+P+FIH ++ + ++++ + L + +F+ LE++++ M+ + +I P+G
Sbjct: 192 IPSFIHPSSPHSALREVIIDQIKRLHKTFSIFIDTFNSLEKDIIDHMSTLSLPGVIRPLG 251
Query: 248 PLVSPFLLGKEEKVGEGGLSADMWKADDSCLQWLDGRDMGSVVYVSFGSIIVLSQEQVDN 307
PL + V + ++ + D C++WLD + + SVVY+SFG++ L QEQ+D
Sbjct: 252 PLYK-----MAKTVAYDVVKVNISEPTDPCMEWLDSQPVSSVVYISFGTVAYLKQEQIDE 311
Query: 308 IANGLLMSGKPFLWVFKRPSPEKSTEGCTVRLPDGFLEAAGGRALVVNWCSQEQVLKHKA 367
IA G+L + FLWV ++ + E LP E G+ +V WCSQE+VL H +
Sbjct: 312 IAYGVLNADVTFLWVIRQQELGFNKEKHV--LP----EEVKGKGKIVEWCSQEKVLSHPS 371
Query: 368 VGCFLTHCGWNSTLETVVAGVPVIAFPEWTDQPTNAKLLTDVFKMGVRMRKG--DNGVAS 427
V CF+THCGWNST+E V +GVP + FP+W DQ T+A + DV+K GVR+ +G + +
Sbjct: 372 VACFVTHCGWNSTMEAVSSGVPTVCFPQWGDQVTDAVYMIDVWKTGVRLSRGEAEERLVP 431
Query: 428 SKEVERCIWEMTDGPKAKAMAKRAVELMEAAKRAVEDGGSSHRNLDLFIADICCKKV 474
+EV + E+T G KA + K A++ E A+ AV GGSS RNL+ F+ + K V
Sbjct: 432 REEVAERLREVTKGEKAIELKKNALKWKEEAEAAVARGGSSDRNLEKFVEKLGAKPV 477
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
O22183 | 2.7e-109 | 46.84 | UDP-glycosyltransferase 84B2 OS=Arabidopsis thaliana OX=3702 GN=UGT84B2 PE=3 SV=... | [more] |
O22182 | 9.8e-107 | 45.61 | UDP-glycosyltransferase 84B1 OS=Arabidopsis thaliana OX=3702 GN=UGT84B1 PE=2 SV=... | [more] |
V5LLZ9 | 9.8e-91 | 40.00 | Gallate 1-beta-glucosyltransferase OS=Quercus robur OX=38942 GN=UGT84A13 PE=1 SV... | [more] |
A0A193AU77 | 3.7e-90 | 40.00 | Gallate 1-beta-glucosyltransferase 84A24 OS=Punica granatum OX=22663 GN=UGT84A24... | [more] |
Q2V6K1 | 7.0e-89 | 39.16 | Putative UDP-glucose glucosyltransferase OS=Fragaria ananassa OX=3747 GN=GT5 PE=... | [more] |