Bhi01G000853 (gene) Wax gourd

NameBhi01G000853
Typegene
OrganismBenincasa hispida (Wax gourd)
DescriptionGlycosyltransferase
Locationchr1 : 22888415 .. 22891398 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TATCTTGCCATTAAAAAGGAGAAAGGAAAAATCAACTCTTTTATAAGAAATTCCCTCTTGCCCCCACCAACAAGGAAGAGCATTATTGGAGAAGAATAATGGAGCATGGGAATTTCCTTCTAGTTTCCCAAAGCCCAAAGAGCCATCTCAACCCTACTCTCCATTTGGCCTCCACCCTCCTTTCCTTTGGCTCAAAAGTCACTCTTCTCCTCACCAACTCTGCCCTCAAAAACATCTCAAAAGACCAACTCCCTTATGGCTTGTCTCTCTCCACTTTCTCCGATGGCTTTGATGATGGTTTTACCTTCTCTGACTTCCCACGCTGGTGTTTCGAGTTCGAGCGCCTTGGTCGCCTTGCCCTCGTCAACCTCCTCTCCTCCTCCTTACAACAAGGCCTCCTTCCCTTCACTTGCATTGTCCACACCCTCCTCATCCCTTGGGTCGCTCGAGTCGCACGTGAGCTTCATGTGCCGACCGCGGTTTTATGGATTCAATCGGCCGCTGCCTTCGATGTGTATTACTATTACTTCAATGGCTATAGTGATGTGATTTGGAATGGTTATAAAGATGAGGGCTCTAACTCTTTGTTATTTAACATTTGGCTTCCGGGTTTGCCTTTGATGAATGTTTTGGACCTCCCAAGCTTTATGGTTTCTGATGATTATCATGGGCTTATTCTCAAGTCATTTCAAGAGAAGATGCAAGTTCTTGAGGAGGAGGAGAATGTCTCAATCCTTGTTAACTCGTTCGATGCGTTGGAACATGATGCCTTGAGCGCAATCGGGAAATTTAACTTGATCCCAGTTGGACCTTTGGTTTCACTCCCACTTGAATTTGAAGTTTCAACCAAACAACGAAGCACTTCATATTTTCAAGATGGTATGATATACAATCTCGTTTGAGCTTCTTTTAAAGGTTTTTGTTTTACTTTTTTACTAGAATTTCAAGATTTTTTTAATTTTGATGCCTGAACTTTCATAATATCATATTTGAGTCACTTAACTTTAAATCTTACTTTAGTTAAGTTCATGAACTTTGAAACATTTTTTATCCTTGAATGAAAAAACAAAAATCATTTAAGTCTTTACCATAAAAATGAATTAATGATTAATTTGGACATTTACTGAAGCTAGAATACACGTCAATTGTCAACTAACAAAACAAAATATTCTAATACTCGATACATACCTATATATATTCACACGCCACACTAAAATTTTTCATTGATATAATTTTTAGTTGTTGTTGGCCCCTTCATCATTCTAAAATAATGCAACATAACTTCGTAATAAATAATAGTTGTACTAAAAATTCATCCAACTAAAGTGATTATGTTTGAAGAAAAATGAGGGACAAATAATAGTGATGACTATAAGATGATAATTTATTTGATATATACGTTCAGTGGTTAAAATCGACATTTTGAAAGTCCATATATCAAAACAGTTATCCTTTTTTAAAAAAGGAAAAAAAAAATTGTTATCCTTGTAAGTTTGGAGACAAAAATAAATTTTGTAACAGTTAACCTATCAAATTGAAATGATCTAAAAAGTTCAAAGGCTTAGATTGCTCAATTTAAGTCTTTTCCTAATTTCACCAGAGAGAACGGCTGTCTTTACTTATTTATCATGATCATTTCATTCTTGTTCGCAAGGCCAACAGGCTCAAGCGGATTATATCAAATGGCTCAACTCCAAACCTGATTCATCAGTGGTCTACATAGCATTTGGGAGCATTTCAAAGTTGTCAAATAAACAAACAAAAGAGATCGTTGGTGCATTATTAGAATGCAGTTACCCATTCTTGTGGGCCCTTCGCATGGATGACATCCAAGATGAGAATTTAAGCTCATATTTTGATGATGAACTACAAGCTCAAGGGAAGATAGTGCCATGGTGCTCACAGGTAGAGGTCTTGAGCCATCACTCCGTGGGTTGCTTTGTAACGCACTGCGGATGGAACTCGACGATCGAGAGCGTGGCGGCTGGAGTGCCGATGGTGGCATGGCCGTTGTGGGCAGACCAAGCCACCAATGCCAAGATGATGGAGGATGTATGGGAGATTGGTGTGAGAGTGAAGAAGAGTAGTGATGGTGAAGGAGTGGTTGAAGGGAAGGAGATTGCAAGGTGCTTGAGAACGGTTATGGATATGGAAGATTATGGCAAAGGAAGAGGAAAGCAACTGAGAATCAATGCTAGGAAGTGGCAGAGCTTAGCAATGGAGGCTGCAAATGGTTCTTCTTATATGAATCTTAAGGCTTTTGTAAATAAACTTAGTGATCAAGCAAACTGATTGGTGAATGATCAGCTTCTTTATGGTACTATGTAACAAACATTAGGATTAAAGCATATAATTATAAATAATATTAGCAAGACAATAGTAATACCTGTATCTCAGGTGGTATTACTATTGAGAAGACAAATGTCTACCCAAGTAAGCAAATCCAAGTAGGAAAAAAATTCACTAATAAGGCATACAACAATGGATCAATTCATTCAACAAGATCCATTTTTATTAGATTTGGATGAATTAGATTGGTCTGACCTCTTGCTCGAATGTCAAACTCCTGCTGCCAACAGATGTCCCATTCCATGTTTCGTGAACAGCTATGCCCGGAAATGAATGAACTGTGATGTATCGCCGAATAAGCCACAGCTCTATTCCTATTTTATCATGGCATAATGACAAAATGGAGGTGGAAACCATTTTTTTTAACTCAGAAAATAATCTACCCATTAAATATAGTAGTCAGACCATTATGCAAATTTTGAAATCTCTCTATAAATGACTTCAATATAACTCACAAGATAAACCAACAGAGCTATTCTTCAGCTTCTTATCCTTGAAAATCACAACAGACCCATCTTAACGTACGCATAATTCGTATGGGCATATGAACCTGAAGGATCAGGAATTTCCAGTATTCCAAGTTCATCAAAAGTTCAGATATTTTCTGACCATTAATCATTTGAAGCCATTGTTCC

mRNA sequence

TATCTTGCCATTAAAAAGGAGAAAGGAAAAATCAACTCTTTTATAAGAAATTCCCTCTTGCCCCCACCAACAAGGAAGAGCATTATTGGAGAAGAATAATGGAGCATGGGAATTTCCTTCTAGTTTCCCAAAGCCCAAAGAGCCATCTCAACCCTACTCTCCATTTGGCCTCCACCCTCCTTTCCTTTGGCTCAAAAGTCACTCTTCTCCTCACCAACTCTGCCCTCAAAAACATCTCAAAAGACCAACTCCCTTATGGCTTGTCTCTCTCCACTTTCTCCGATGGCTTTGATGATGGTTTTACCTTCTCTGACTTCCCACGCTGGTGTTTCGAGTTCGAGCGCCTTGGTCGCCTTGCCCTCGTCAACCTCCTCTCCTCCTCCTTACAACAAGGCCTCCTTCCCTTCACTTGCATTGTCCACACCCTCCTCATCCCTTGGGTCGCTCGAGTCGCACGTGAGCTTCATGTGCCGACCGCGGTTTTATGGATTCAATCGGCCGCTGCCTTCGATGTGTATTACTATTACTTCAATGGCTATAGTGATGTGATTTGGAATGGTTATAAAGATGAGGGCTCTAACTCTTTGTTATTTAACATTTGGCTTCCGGGTTTGCCTTTGATGAATGTTTTGGACCTCCCAAGCTTTATGGTTTCTGATGATTATCATGGGCTTATTCTCAAGTCATTTCAAGAGAAGATGCAAGTTCTTGAGGAGGAGGAGAATGTCTCAATCCTTGTTAACTCGTTCGATGCGTTGGAACATGATGCCTTGAGCGCAATCGGGAAATTTAACTTGATCCCAGTTGGACCTTTGGTTTCACTCCCACTTGAATTTGAAGTTTCAACCAAACAACGAAGCACTTCATATTTTCAAGATGGCCAACAGGCTCAAGCGGATTATATCAAATGGCTCAACTCCAAACCTGATTCATCAGTGGTCTACATAGCATTTGGGAGCATTTCAAAGTTGTCAAATAAACAAACAAAAGAGATCGTTGGTGCATTATTAGAATGCAGTTACCCATTCTTGTGGGCCCTTCGCATGGATGACATCCAAGATGAGAATTTAAGCTCATATTTTGATGATGAACTACAAGCTCAAGGGAAGATAGTGCCATGGTGCTCACAGGTAGAGGTCTTGAGCCATCACTCCGTGGGTTGCTTTGTAACGCACTGCGGATGGAACTCGACGATCGAGAGCGTGGCGGCTGGAGTGCCGATGGTGGCATGGCCGTTGTGGGCAGACCAAGCCACCAATGCCAAGATGATGGAGGATGTATGGGAGATTGGTGTGAGAGTGAAGAAGAGTAGTGATGGTGAAGGAGTGGTTGAAGGGAAGGAGATTGCAAGGTGCTTGAGAACGGTTATGGATATGGAAGATTATGGCAAAGGAAGAGGAAAGCAACTGAGAATCAATGCTAGGAAGTGGCAGAGCTTAGCAATGGAGGCTGCAAATGGTTCTTCTTATATGAATCTTAAGGCTTTTGTAAATAAACTTAGTGATCAAGCAAACTGATTGGTGAATGATCAGCTTCTTTATGGTACTATGTAACAAACATTAGGATTAAAGCATATAATTATAAATAATATTAGCAAGACAATAGTAATACCTGTATCTCAGGTGGTATTACTATTGAGAAGACAAATGTCTACCCAAGTAAGCAAATCCAAGTAGGAAAAAAATTCACTAATAAGGCATACAACAATGGATCAATTCATTCAACAAGATCCATTTTTATTAGATTTGGATGAATTAGATTGGTCTGACCTCTTGCTCGAATGTCAAACTCCTGCTGCCAACAGATGTCCCATTCCATGTTTCGTGAACAGCTATGCCCGGAAATGAATGAACTGTGATGTATCGCCGAATAAGCCACAGCTCTATTCCTATTTTATCATGGCATAATGACAAAATGGAGGTGGAAACCATTTTTTTTAACTCAGAAAATAATCTACCCATTAAATATAGTAGTCAGACCATTATGCAAATTTTGAAATCTCTCTATAAATGACTTCAATATAACTCACAAGATAAACCAACAGAGCTATTCTTCAGCTTCTTATCCTTGAAAATCACAACAGACCCATCTTAACGTACGCATAATTCGTATGGGCATATGAACCTGAAGGATCAGGAATTTCCAGTATTCCAAGTTCATCAAAAGTTCAGATATTTTCTGACCATTAATCATTTGAAGCCATTGTTCC

Coding sequence (CDS)

ATGGAGCATGGGAATTTCCTTCTAGTTTCCCAAAGCCCAAAGAGCCATCTCAACCCTACTCTCCATTTGGCCTCCACCCTCCTTTCCTTTGGCTCAAAAGTCACTCTTCTCCTCACCAACTCTGCCCTCAAAAACATCTCAAAAGACCAACTCCCTTATGGCTTGTCTCTCTCCACTTTCTCCGATGGCTTTGATGATGGTTTTACCTTCTCTGACTTCCCACGCTGGTGTTTCGAGTTCGAGCGCCTTGGTCGCCTTGCCCTCGTCAACCTCCTCTCCTCCTCCTTACAACAAGGCCTCCTTCCCTTCACTTGCATTGTCCACACCCTCCTCATCCCTTGGGTCGCTCGAGTCGCACGTGAGCTTCATGTGCCGACCGCGGTTTTATGGATTCAATCGGCCGCTGCCTTCGATGTGTATTACTATTACTTCAATGGCTATAGTGATGTGATTTGGAATGGTTATAAAGATGAGGGCTCTAACTCTTTGTTATTTAACATTTGGCTTCCGGGTTTGCCTTTGATGAATGTTTTGGACCTCCCAAGCTTTATGGTTTCTGATGATTATCATGGGCTTATTCTCAAGTCATTTCAAGAGAAGATGCAAGTTCTTGAGGAGGAGGAGAATGTCTCAATCCTTGTTAACTCGTTCGATGCGTTGGAACATGATGCCTTGAGCGCAATCGGGAAATTTAACTTGATCCCAGTTGGACCTTTGGTTTCACTCCCACTTGAATTTGAAGTTTCAACCAAACAACGAAGCACTTCATATTTTCAAGATGGCCAACAGGCTCAAGCGGATTATATCAAATGGCTCAACTCCAAACCTGATTCATCAGTGGTCTACATAGCATTTGGGAGCATTTCAAAGTTGTCAAATAAACAAACAAAAGAGATCGTTGGTGCATTATTAGAATGCAGTTACCCATTCTTGTGGGCCCTTCGCATGGATGACATCCAAGATGAGAATTTAAGCTCATATTTTGATGATGAACTACAAGCTCAAGGGAAGATAGTGCCATGGTGCTCACAGGTAGAGGTCTTGAGCCATCACTCCGTGGGTTGCTTTGTAACGCACTGCGGATGGAACTCGACGATCGAGAGCGTGGCGGCTGGAGTGCCGATGGTGGCATGGCCGTTGTGGGCAGACCAAGCCACCAATGCCAAGATGATGGAGGATGTATGGGAGATTGGTGTGAGAGTGAAGAAGAGTAGTGATGGTGAAGGAGTGGTTGAAGGGAAGGAGATTGCAAGGTGCTTGAGAACGGTTATGGATATGGAAGATTATGGCAAAGGAAGAGGAAAGCAACTGAGAATCAATGCTAGGAAGTGGCAGAGCTTAGCAATGGAGGCTGCAAATGGTTCTTCTTATATGAATCTTAAGGCTTTTGTAAATAAACTTAGTGATCAAGCAAACTGA

Protein sequence

MEHGNFLLVSQSPKSHLNPTLHLASTLLSFGSKVTLLLTNSALKNISKDQLPYGLSLSTFSDGFDDGFTFSDFPRWCFEFERLGRLALVNLLSSSLQQGLLPFTCIVHTLLIPWVARVARELHVPTAVLWIQSAAAFDVYYYYFNGYSDVIWNGYKDEGSNSLLFNIWLPGLPLMNVLDLPSFMVSDDYHGLILKSFQEKMQVLEEEENVSILVNSFDALEHDALSAIGKFNLIPVGPLVSLPLEFEVSTKQRSTSYFQDGQQAQADYIKWLNSKPDSSVVYIAFGSISKLSNKQTKEIVGALLECSYPFLWALRMDDIQDENLSSYFDDELQAQGKIVPWCSQVEVLSHHSVGCFVTHCGWNSTIESVAAGVPMVAWPLWADQATNAKMMEDVWEIGVRVKKSSDGEGVVEGKEIARCLRTVMDMEDYGKGRGKQLRINARKWQSLAMEAANGSSYMNLKAFVNKLSDQAN
BLAST of Bhi01G000853 vs. TAIR10
Match: AT4G15550.1 (indole-3-acetate beta-D-glucosyltransferase)

HSP 1 Score: 355.9 bits (912), Expect = 3.8e-98
Identity = 200/488 (40.98%), Postives = 301/488 (61.68%), Query Frame = 0

Query: 5   NFLLVSQSPKSHLNPTLHLASTLLS--FGSKVTLLLTNSAL--KNISKDQLPYGLSLSTF 64
           +FL V+   + H+NP+L LA  L     G++VT   + SA   +  S + +P  L  +T+
Sbjct: 13  HFLFVTFPAQGHINPSLELAKRLAGTISGARVTFAASISAYNRRMFSTENVPETLIFATY 72

Query: 65  SDGFDDGF---TFSDFPR------WCFEFERLGRLALVNLLSSSLQQGLLPFTCIVHTLL 124
           SDG DDGF    +SD  R      +  E  R G+  L  L+  + +Q   PFTC+V+T+L
Sbjct: 73  SDGHDDGFKSSAYSDKSRQDATGNFMSEMRRRGKETLTELIEDNRKQN-RPFTCVVYTIL 132

Query: 125 IPWVARVARELHVPTAVLWIQSAAAFDVYYYYFNGYSDVIWNGYKDEGSNSLLFNIWLPG 184
           + WVA +ARE H+P+A+LW+Q    F ++Y+YFNGY D I      E +N+   +I LP 
Sbjct: 133 LTWVAELAREFHLPSALLWVQPVTVFSIFYHYFNGYEDAI-----SEMANTPSSSIKLPS 192

Query: 185 LPLMNVLDLPSFMVSDDYHGLILKSFQEKMQVLEEEENVSILVNSFDALEHDALSAI-GK 244
           LPL+ V D+PSF+VS + +  +L +F+E++  L+EE N  IL+N+F  LE +A+S++   
Sbjct: 193 LPLLTVRDIPSFIVSSNVYAFLLPAFREQIDSLKEEINPKILINTFQELEPEAMSSVPDN 252

Query: 245 FNLIPVGPLVSLPLEFEVSTKQRSTSYFQDGQQAQADYIKWLNSKPDSSVVYIAFGSISK 304
           F ++PVGPL++L  +F                 ++ +YI+WL++K DSSV+Y++FG+++ 
Sbjct: 253 FKIVPVGPLLTLRTDF----------------SSRGEYIEWLDTKADSSVLYVSFGTLAV 312

Query: 305 LSNKQTKEIVGALLECSYPFLWAL------RMDDIQ--DENLSSYFDDELQAQGKIVPWC 364
           LS KQ  E+  AL++   PFLW +        +D Q  +E+  S F +EL   G +V WC
Sbjct: 313 LSKKQLVELCKALIQSRRPFLWVITDKSYRNKEDEQEKEEDCISSFREELDEIGMVVSWC 372

Query: 365 SQVEVLSHHSVGCFVTHCGWNSTIESVAAGVPMVAWPLWADQATNAKMMEDVWEIGVRV- 424
            Q  VL+H S+GCFVTHCGWNST+ES+ +GVP+VA+P W DQ  NAK++ED W+ GVRV 
Sbjct: 373 DQFRVLNHRSIGCFVTHCGWNSTLESLVSGVPVVAFPQWNDQMMNAKLLEDCWKTGVRVM 432

Query: 425 -KKSSDGEGVVEGKEIARCLRTVMDMEDYGKGRGKQLRINARKWQSLAMEAA--NGSSYM 467
            KK  +G  VV+ +EI RC+  VM+       + ++ R NA +W+ LA EA    GSS+ 
Sbjct: 433 EKKEEEGVVVVDSEEIRRCIEEVME------DKAEEFRGNATRWKDLAAEAVREGGSSFN 472

BLAST of Bhi01G000853 vs. TAIR10
Match: AT4G14090.1 (UDP-Glycosyltransferase superfamily protein)

HSP 1 Score: 325.9 bits (834), Expect = 4.2e-89
Identity = 190/472 (40.25%), Postives = 284/472 (60.17%), Query Frame = 0

Query: 5   NFLLVSQSPKSHLNPTLHLASTLLSFGSKVTLLLTNSALKNISKDQLPYGLSLSTFSDGF 64
           ++LLV+   + H+NP L LA+ L+  G+ VT     SA + + +     GLS + F+DGF
Sbjct: 13  HYLLVTFPAQGHINPALQLANRLIHHGATVTYSTAVSAHRRMGEPPSTKGLSFAWFTDGF 72

Query: 65  DDGF-TFSDFPRWCFEFERLGRLALVNLLSSSLQ--QGLLPFTCIVHTLLIPWVARVARE 124
           DDG  +F D   +  E +R G  AL +++ ++L       P T +++++L+PWV+ VARE
Sbjct: 73  DDGLKSFEDQKIYMSELKRCGSNALRDIIKANLDATTETEPITGVIYSVLVPWVSTVARE 132

Query: 125 LHVPTAVLWIQSAAAFDVYYYYFNGYSDVIWNGYKDEGSNSLLFN---IWLPGLPLMNVL 184
            H+PT +LWI+ A   D+YYYYFN         YK       LF+   I LP LPL+   
Sbjct: 133 FHLPTTLLWIEPATVLDIYYYYFN-------TSYKH------LFDVEPIKLPKLPLITTG 192

Query: 185 DLPSFMVSDDYHGLILKSFQEKMQVLEEEENVSILVNSFDALEHDALSAIGKFNLIPVGP 244
           DLPSF+         L + +E ++ LE E N  ILVN+F ALEHDAL+++ K  +IP+GP
Sbjct: 193 DLPSFLQPSKALPSALVTLREHIEALETESNPKILVNTFSALEHDALTSVEKLKMIPIGP 252

Query: 245 LVSLPLEFEVSTKQRSTSYFQDGQQAQADYIKWLNSKPDSSVVYIAFGS-ISKLSNKQTK 304
           L        VS+ +  T  F+   +   DY KWL+SK + SV+YI+ G+    L  K  +
Sbjct: 253 L--------VSSSEGKTDLFKSSDE---DYTKWLDSKLERSVIYISLGTHADDLPEKHME 312

Query: 305 EIVGALLECSYPFLWALRMDDIQDENLSSYFD-DELQAQGKIVPWCSQVEVLSHHSVGCF 364
            +   +L  + PFLW +R  + +++  + + +      +G +V WCSQ  VL+H +VGCF
Sbjct: 313 ALTHGVLATNRPFLWIVREKNPEEKKKNRFLELIRGSDRGLVVGWCSQTAVLAHCAVGCF 372

Query: 365 VTHCGWNSTIESVAAGVPMVAWPLWADQATNAKMMEDVWEIGVRVKKSSDGEGVVEGKEI 424
           VTHCGWNST+ES+ +GVP+VA+P +ADQ T AK++ED W IGV+VK   +G+  V+G+EI
Sbjct: 373 VTHCGWNSTLESLESGVPVVAFPQFADQCTTAKLVEDTWRIGVKVKVGEEGD--VDGEEI 432

Query: 425 ARCLRTVMDMEDYGKGRGKQLRINARKWQSLAMEAA--NGSSYMNLKAFVNK 467
            RCL  VM     G    +++R NA KW+++A++AA   G S +NLK FV++
Sbjct: 433 RRCLEKVMS----GGEEAEEMRENAEKWKAMAVDAAAEGGPSDLNLKGFVDE 454

BLAST of Bhi01G000853 vs. TAIR10
Match: AT1G05530.1 (UDP-glucosyl transferase 75B2)

HSP 1 Score: 319.3 bits (817), Expect = 3.9e-87
Identity = 196/487 (40.25%), Postives = 281/487 (57.70%), Query Frame = 0

Query: 1   MEHGNFLLVSQSPKSHLNPTLHLASTLL-SFGSKVTLLLTNSALKNISKDQLP-----YG 60
           M   +FLLV+   + H+NP+L  A  L+ + G++VT     + L  I +  +P       
Sbjct: 1   MAQPHFLLVTFPAQGHVNPSLRFARRLIKTTGARVTFA---TCLSVIHRSMIPNHNNVEN 60

Query: 61  LSLSTFSDGFDDGF--TFSDFPRWCFEFERLGRLALVNLLSSSLQQGLLPFTCIVHTLLI 120
           LS  TFSDGFDDG      D       FER G  AL + + ++ Q G  P +C+++T+L 
Sbjct: 61  LSFLTFSDGFDDGVISNTDDVQNRLVHFERNGDKALSDFIEAN-QNGDSPVSCLIYTILP 120

Query: 121 PWVARVARELHVPTAVLWIQSAAAFDVYYYYFNGYSDVIWNGYKDEGSNSLLFNIWLPGL 180
            WV +VAR  H+P+  LWIQ A AFD+YY Y  G              N+ +F    P L
Sbjct: 121 NWVPKVARRFHLPSVHLWIQPAFAFDIYYNYSTG--------------NNSVFE--FPNL 180

Query: 181 PLMNVLDLPSFMVSDDYHGLILKSFQEKMQVLEEEENVSILVNSFDALEHDALSAIGKFN 240
           P + + DLPSF+   + +      +QE M  L+EE N  ILVN+FD+LE + L+AI    
Sbjct: 181 PSLEIRDLPSFLSPSNTNKAAQAVYQELMDFLKEESNPKILVNTFDSLEPEFLTAIPNIE 240

Query: 241 LIPVGPLVSLPLEFEVSTKQRSTSYFQDGQQAQADYIKWLNSKPDSSVVYIAFGSISKLS 300
           ++ VGPL    L  E+ T   S        Q+ + Y  WL+SK +SSV+Y++FG++ +LS
Sbjct: 241 MVAVGPL----LPAEIFTGSESGKDLSRDHQS-SSYTLWLDSKTESSVIYVSFGTMVELS 300

Query: 301 NKQTKEIVGALLECSYPFLWAL-----RMDDIQDENLSSY-----FDDELQAQGKIVPWC 360
            KQ +E+  AL+E   PFLW +     R   I+ E  +       F  EL+  G IV WC
Sbjct: 301 KKQIEELARALIEGGRPFLWVITDKLNREAKIEGEEETEIEKIAGFRHELEEVGMIVSWC 360

Query: 361 SQVEVLSHHSVGCFVTHCGWNSTIESVAAGVPMVAWPLWADQATNAKMMEDVWEIGVRVK 420
           SQ+EVL H ++GCF+THCGW+S++ES+  GVP+VA+P+W+DQ  NAK++E++W+ GVRV+
Sbjct: 361 SQIEVLRHRAIGCFLTHCGWSSSLESLVLGVPVVAFPMWSDQPANAKLLEEIWKTGVRVR 420

Query: 421 KSSDGEGVVEGKEIARCLRTVMDMEDYGKGRGKQLRINARKWQSLAMEAA--NGSSYMNL 468
           ++S  EG+VE  EI RCL  VM+       +  +LR NA KW+ LA EA    GSS  N+
Sbjct: 421 ENS--EGLVERGEIMRCLEAVME------AKSVELRENAEKWKRLATEAGREGGSSDKNV 454

BLAST of Bhi01G000853 vs. TAIR10
Match: AT1G05560.1 (UDP-glucosyltransferase 75B1)

HSP 1 Score: 316.2 bits (809), Expect = 3.3e-86
Identity = 189/485 (38.97%), Postives = 285/485 (58.76%), Query Frame = 0

Query: 5   NFLLVSQSPKSHLNPTLHLASTLLS-FGSKVTLLLTNSALKN--ISKDQLPYGLSLSTFS 64
           +FLLV+   + H+NP+L  A  L+   G++VT +   S   N  I+       LS  TFS
Sbjct: 5   HFLLVTFPAQGHVNPSLRFARRLIKRTGARVTFVTCVSVFHNSMIANHNKVENLSFLTFS 64

Query: 65  DGFDDG--FTFSDFPRWCFEFERLGRLALVNLLSSSLQQGLLPFTCIVHTLLIPWVARVA 124
           DGFDDG   T+ D  +     +  G  AL + + ++ + G  P TC+++T+L+ W  +VA
Sbjct: 65  DGFDDGGISTYEDRQKRSVNLKVNGDKALSDFIEAT-KNGDSPVTCLIYTILLNWAPKVA 124

Query: 125 RELHVPTAVLWIQSAAAFDVYYYYFNGYSDVIWNGYKDEGSNSLLFNIWLPGLPLMNVLD 184
           R   +P+A+LWIQ A  F++YY +F G              N  +F   LP L  + + D
Sbjct: 125 RRFQLPSALLWIQPALVFNIYYTHFMG--------------NKSVFE--LPNLSSLEIRD 184

Query: 185 LPSFMVSDDYHGLILKSFQEKMQVLEEEENVSILVNSFDALEHDALSAIGKFNLIPVGPL 244
           LPSF+   + +     +FQE M+ L +E    IL+N+FD+LE +AL+A    +++ VGPL
Sbjct: 185 LPSFLTPSNTNKGAYDAFQEMMEFLIKETKPKILINTFDSLEPEALTAFPNIDMVAVGPL 244

Query: 245 VSLPLEFEVSTKQRSTSYFQDGQQAQADYIKWLNSKPDSSVVYIAFGSISKLSNKQTKEI 304
             LP E    +  +S       +   + Y  WL+SK +SSV+Y++FG++ +LS KQ +E+
Sbjct: 245 --LPTEIFSGSTNKSV------KDQSSSYTLWLDSKTESSVIYVSFGTMVELSKKQIEEL 304

Query: 305 VGALLECSYPFLWALR-----------MDDIQDENLSSYFDDELQAQGKIVPWCSQVEVL 364
             AL+E   PFLW +             ++ + E ++  F  EL+  G IV WCSQ+EVL
Sbjct: 305 ARALIEGKRPFLWVITDKSNRETKTEGEEETEIEKIAG-FRHELEEVGMIVSWCSQIEVL 364

Query: 365 SHHSVGCFVTHCGWNSTIESVAAGVPMVAWPLWADQATNAKMMEDVWEIGVRVKKSSDGE 424
           SH +VGCFVTHCGW+ST+ES+  GVP+VA+P+W+DQ TNAK++E+ W+ GVRV+++ D  
Sbjct: 365 SHRAVGCFVTHCGWSSTLESLVLGVPVVAFPMWSDQPTNAKLLEESWKTGVRVRENKD-- 424

Query: 425 GVVEGKEIARCLRTVMDMEDYGKGRGKQLRINARKWQSLAMEAA--NGSSYMNLKAFVNK 472
           G+VE  EI RCL  VM+       +  +LR NA+KW+ LAMEA    GSS  N++AFV  
Sbjct: 425 GLVERGEIRRCLEAVME------EKSVELRENAKKWKRLAMEAGREGGSSDKNMEAFVED 455

BLAST of Bhi01G000853 vs. TAIR10
Match: AT3G21560.1 (UDP-Glycosyltransferase superfamily protein)

HSP 1 Score: 230.7 bits (587), Expect = 1.8e-60
Identity = 158/483 (32.71%), Postives = 242/483 (50.10%), Query Frame = 0

Query: 7   LLVSQSPKSHLNPTLHLASTLLSFGSKVTLLLTNS--------------ALKNISKDQLP 66
           +LVS   + H+NP L L   L S G  +T + T S               LK + K  L 
Sbjct: 14  MLVSFPGQGHVNPLLRLGKLLASKGLLITFVTTESWGKKMRISNKIQDRVLKPVGKGYLR 73

Query: 67  YGLSLSTFSDGF--DDGFTFSDFPRWCFEFERLGRLALVNLLSSSLQQGLLPFTCIVHTL 126
           Y      F DG   DD  + ++        E +G+  + NL+    +    P TC+++  
Sbjct: 74  YDF----FDDGLPEDDEASRTNLTILRPHLELVGKREIKNLVKRYKEVTKQPVTCLINNP 133

Query: 127 LIPWVARVARELHVPTAVLWIQSAAAFDVYYYYFNGYSDVIWNGYKDEGSNSLLFNIWLP 186
            + WV  VA +L +P AVLW+QS A             D      K E       ++ + 
Sbjct: 134 FVSWVCDVAEDLQIPCAVLWVQSCACLAXXXXXXXXLVDF---PTKTEPE----IDVQIS 193

Query: 187 GLPLMNVLDLPSFMVSDDYHGLILKSFQEKMQVLEEEENVSILVNSFDALEHDALSAIGK 246
           G+PL+   ++PSF+     H  + +   ++++ L   +  SI +++F++LE D +  +  
Sbjct: 194 GMPLLKHDEIPSFIHPSSPHSALREVIIDQIKRL--HKTFSIFIDTFNSLEKDIIDHMST 253

Query: 247 FNL----IPVGPLVSLPLEFEVSTKQRSTSYFQDGQQAQADYIKWLNSKPDSSVVYIAFG 306
            +L     P+GPL  +         + + S   D        ++WL+S+P SSVVYI+FG
Sbjct: 254 LSLPGVIRPLGPLYKMAKTVAYDVVKVNISEPTD------PCMEWLDSQPVSSVVYISFG 313

Query: 307 SISKLSNKQTKEIVGALLECSYPFLWALRMDDIQDENLSSYFDDELQAQGKIVPWCSQVE 366
           +++ L  +Q  EI   +L     FLW +R  ++          +E++ +GKIV WCSQ +
Sbjct: 314 TVAYLKQEQIDEIAYGVLNADVTFLWVIRQQELGFNKEKHVLPEEVKGKGKIVEWCSQEK 373

Query: 367 VLSHHSVGCFVTHCGWNSTIESVAAGVPMVAWPLWADQATNAKMMEDVWEIGVRVKKSSD 426
           VLSH SV CFVTHCGWNST+E+V++GVP V +P W DQ T+A  M DVW+ GVR+ +   
Sbjct: 374 VLSHPSVACFVTHCGWNSTMEAVSSGVPTVCFPQWGDQVTDAVYMIDVWKTGVRLSRGEA 433

Query: 427 GEGVVEGKEIARCLRTVMDMEDYGKGRGKQLRINARKW--QSLAMEAANGSSYMNLKAFV 468
            E +V  +E+A  LR V   E     +  +L+ NA KW  ++ A  A  GSS  NL+ FV
Sbjct: 434 EERLVPREEVAERLREVTKGE-----KAIELKKNALKWKEEAEAAVARGGSSDRNLEKFV 472

BLAST of Bhi01G000853 vs. Swiss-Prot
Match: sp|F8WKW0|UGT1_GARJA (Crocetin glucosyltransferase, chloroplastic OS=Gardenia jasminoides OX=114476 GN=UGT75L6 PE=1 SV=1)

HSP 1 Score: 398.3 bits (1022), Expect = 1.2e-109
Identity = 220/475 (46.32%), Postives = 297/475 (62.53%), Query Frame = 0

Query: 1   MEHGNFLLVSQSPKSHLNPTLHLASTLLSFGSKVTLLLTNSALKNISKD--QLPYGLSLS 60
           ++  + LL++   + H+NP L  A  LL  G +VTL  +  AL  + K     P GL+ +
Sbjct: 2   VQQRHVLLITYPAQGHINPALQFAQRLLRMGIQVTLATSVYALSRMKKSSGSTPKGLTFA 61

Query: 61  TFSDGFDDGFTFS--DFPRWCFEFERLGRLALVNLLSSSLQQGLLPFTCIVHTLLIPWVA 120
           TFSDG+DDGF     D   +     + G   L N++++S  QG  P TC+V+TLL+PW A
Sbjct: 62  TFSDGYDDGFRPKGVDHTEYMSSLAKQGSNTLRNVINTSADQG-CPVTCLVYTLLLPWAA 121

Query: 121 RVARELHVPTAVLWIQSAAAFDVYYYYFNGYSDVIWNGYKDEGSNSLLFNIWLPGLPLMN 180
            VARE H+P+A+LWIQ  A  D+YYYYF GY D + N      SN   ++I  PGLP M 
Sbjct: 122 TVARECHIPSALLWIQPVAVMDIYYYYFRGYEDDVKN-----NSNDPTWSIQFPGLPSMK 181

Query: 181 VLDLPSFMV--SDDYHGLILKSFQEKMQVLEEEENVSILVNSFDALEHDALSAIGKFNLI 240
             DLPSF++  SD+ +   L +F+++++ L+EEE   +LVN+FDALE  AL AI  +NLI
Sbjct: 182 AKDLPSFILPSSDNIYSFALPTFKKQLETLDEEERPKVLVNTFDALEPQALKAIESYNLI 241

Query: 241 PVGPLVSLPLEFEVSTKQRSTSYFQDGQQAQADYIKWLNSKPDSSVVYIAFGSISKLSNK 300
            +GPL   P  F        TS+  D  Q   DY +WLNS+P  SVVY++FGS+  L  +
Sbjct: 242 AIGPLT--PSAFLDGKDPSETSFSGDLFQKSKDYKEWLNSRPAGSVVYVSFGSLLTLPKQ 301

Query: 301 QTKEIVGALLECSYPFLWALRMDDIQDENLSS---YFDDELQAQGKIVPWCSQVEVLSHH 360
           Q +EI   LL+   PFLW +R  +  +E          +EL+ QG IVPWCSQ+EVL+H 
Sbjct: 302 QMEEIARGLLKSGRPFLWVIRAKENGEEEKEEDRLICMEELEEQGMIVPWCSQIEVLTHP 361

Query: 361 SVGCFVTHCGWNSTIESVAAGVPMVAWPLWADQATNAKMMEDVWEIGVRVKKSSDGEGVV 420
           S+GCFVTHCGWNST+E++  GVP+VA+P W DQ TNAK++EDVWE GVRV  + D  G V
Sbjct: 362 SLGCFVTHCGWNSTLETLVCGVPVVAFPHWTDQGTNAKLIEDVWETGVRVVPNED--GTV 421

Query: 421 EGKEIARCLRTVMDMEDYGKGRGKQLRINARKWQSLAMEA--ANGSSYMNLKAFV 465
           E  EI RC+ TVMD  +    +G +L+ NA+KW+ LA EA   +GSS  NLKAFV
Sbjct: 422 ESDEIKRCIETVMDDGE----KGVELKRNAKKWKELAREAMQEDGSSDKNLKAFV 462

BLAST of Bhi01G000853 vs. Swiss-Prot
Match: sp|O23406|U75D1_ARATH (UDP-glycosyltransferase 75D1 OS=Arabidopsis thaliana OX=3702 GN=UGT75D1 PE=2 SV=2)

HSP 1 Score: 355.9 bits (912), Expect = 6.9e-97
Identity = 200/488 (40.98%), Postives = 301/488 (61.68%), Query Frame = 0

Query: 5   NFLLVSQSPKSHLNPTLHLASTLLS--FGSKVTLLLTNSAL--KNISKDQLPYGLSLSTF 64
           +FL V+   + H+NP+L LA  L     G++VT   + SA   +  S + +P  L  +T+
Sbjct: 13  HFLFVTFPAQGHINPSLELAKRLAGTISGARVTFAASISAYNRRMFSTENVPETLIFATY 72

Query: 65  SDGFDDGF---TFSDFPR------WCFEFERLGRLALVNLLSSSLQQGLLPFTCIVHTLL 124
           SDG DDGF    +SD  R      +  E  R G+  L  L+  + +Q   PFTC+V+T+L
Sbjct: 73  SDGHDDGFKSSAYSDKSRQDATGNFMSEMRRRGKETLTELIEDNRKQN-RPFTCVVYTIL 132

Query: 125 IPWVARVARELHVPTAVLWIQSAAAFDVYYYYFNGYSDVIWNGYKDEGSNSLLFNIWLPG 184
           + WVA +ARE H+P+A+LW+Q    F ++Y+YFNGY D I      E +N+   +I LP 
Sbjct: 133 LTWVAELAREFHLPSALLWVQPVTVFSIFYHYFNGYEDAI-----SEMANTPSSSIKLPS 192

Query: 185 LPLMNVLDLPSFMVSDDYHGLILKSFQEKMQVLEEEENVSILVNSFDALEHDALSAI-GK 244
           LPL+ V D+PSF+VS + +  +L +F+E++  L+EE N  IL+N+F  LE +A+S++   
Sbjct: 193 LPLLTVRDIPSFIVSSNVYAFLLPAFREQIDSLKEEINPKILINTFQELEPEAMSSVPDN 252

Query: 245 FNLIPVGPLVSLPLEFEVSTKQRSTSYFQDGQQAQADYIKWLNSKPDSSVVYIAFGSISK 304
           F ++PVGPL++L  +F                 ++ +YI+WL++K DSSV+Y++FG+++ 
Sbjct: 253 FKIVPVGPLLTLRTDF----------------SSRGEYIEWLDTKADSSVLYVSFGTLAV 312

Query: 305 LSNKQTKEIVGALLECSYPFLWAL------RMDDIQ--DENLSSYFDDELQAQGKIVPWC 364
           LS KQ  E+  AL++   PFLW +        +D Q  +E+  S F +EL   G +V WC
Sbjct: 313 LSKKQLVELCKALIQSRRPFLWVITDKSYRNKEDEQEKEEDCISSFREELDEIGMVVSWC 372

Query: 365 SQVEVLSHHSVGCFVTHCGWNSTIESVAAGVPMVAWPLWADQATNAKMMEDVWEIGVRV- 424
            Q  VL+H S+GCFVTHCGWNST+ES+ +GVP+VA+P W DQ  NAK++ED W+ GVRV 
Sbjct: 373 DQFRVLNHRSIGCFVTHCGWNSTLESLVSGVPVVAFPQWNDQMMNAKLLEDCWKTGVRVM 432

Query: 425 -KKSSDGEGVVEGKEIARCLRTVMDMEDYGKGRGKQLRINARKWQSLAMEAA--NGSSYM 467
            KK  +G  VV+ +EI RC+  VM+       + ++ R NA +W+ LA EA    GSS+ 
Sbjct: 433 EKKEEEGVVVVDSEEIRRCIEEVME------DKAEEFRGNATRWKDLAAEAVREGGSSFN 472

BLAST of Bhi01G000853 vs. Swiss-Prot
Match: sp|Q9ZR25|5GT_VERHY (Anthocyanidin 3-O-glucoside 5-O-glucosyltransferase OS=Verbena hybrida OX=76714 GN=HGT8 PE=2 SV=1)

HSP 1 Score: 337.0 bits (863), Expect = 3.3e-91
Identity = 200/478 (41.84%), Postives = 291/478 (60.88%), Query Frame = 0

Query: 1   MEHGNFLLVSQSPKSHLNPTLHLASTLLSFGSKVTLLLTNSALKNISKDQLPYG--LSLS 60
           M   + LL +   + H+NP L  A  L +   +VT   +  A + +S+        ++  
Sbjct: 1   MSRAHVLLATFPAQGHINPALQFAKRLANADIQVTFFTSVYAWRRMSRTAAGSNGLINFV 60

Query: 61  TFSDGFDDGF-TFSDFPRWCFEFERLGRLALVN-LLSSSLQQGLLPFTCIVHTLLIPWVA 120
           +FSDG+DDG     D   +  E +  G  AL + L ++++ Q     T +V++ L  W A
Sbjct: 61  SFSDGYDDGLQPGDDGKNYMSEMKSRGIKALSDTLAANNVDQKSSKITFVVYSHLFAWAA 120

Query: 121 RVARELHVPTAVLWIQSAAAFDVYYYYFNGYSDVIWNGYKDEGSNSLLFNIWLP-GLPLM 180
           +VARE H+ +A+LWI+ A   D++Y+YFNGYSD I     D GS++    I LP GLP++
Sbjct: 121 KVAREFHLRSALLWIEPATVLDIFYFYFNGYSDEI-----DAGSDA----IHLPGGLPVL 180

Query: 181 NVLDLPSFMVSDDYHGLILKSFQEKMQVLEEEENVSILVNSFDALEHDALSAIGKFNLIP 240
              DLPSF++    H       +EK++ LE EE   +LVNSFDALE DAL AI K+ +I 
Sbjct: 181 AQRDLPSFLLPST-HERFRSLMKEKLETLEGEEKPKVLVNSFDALEPDALKAIDKYEMIA 240

Query: 241 VGPLVSLPLEF----EVSTKQRSTSYFQDGQQAQADYIKWLNSKPDSSVVYIAFGSISKL 300
           +GPL+  P  F    + S +      F+ G     D ++WL++ P SSVVY++FGS    
Sbjct: 241 IGPLI--PSAFLDGKDPSDRSFGGDLFEKGSN-DDDCLEWLSTNPRSSVVYVSFGSFVNT 300

Query: 301 SNKQTKEIVGALLECSYPFLWALRMDDIQDENLSSYFDDELQAQGKIVPWCSQVEVLSHH 360
           +  Q +EI   LL+C  PFLW +R+++ ++  +S    +EL+  GKIV WCSQ+EVL+H 
Sbjct: 301 TKSQMEEIARGLLDCGRPFLWVVRVNEGEEVLISCM--EELKRVGKIVSWCSQLEVLTHP 360

Query: 361 SVGCFVTHCGWNSTIESVAAGVPMVAWPLWADQATNAKMMEDVWEIGVRVKKSSDGEGVV 420
           S+GCFVTHCGWNST+ES++ GVPMVA+P W DQ TNAK+MEDVW  GVRV+ + +G  VV
Sbjct: 361 SLGCFVTHCGWNSTLESISFGVPMVAFPQWFDQGTNAKLMEDVWRTGVRVRANEEG-SVV 420

Query: 421 EGKEIARCLRTVMDMEDYGKGRGKQLRINARKWQSLAMEA--ANGSSYMNLKAFVNKL 468
           +G EI RC+  VMD    G  + ++LR +A KW+ LA +A   +GSS  NLK F++++
Sbjct: 421 DGDEIRRCIEEVMD----GGEKSRKLRESAGKWKDLARKAMEEDGSSVNNLKVFLDEV 458

BLAST of Bhi01G000853 vs. Swiss-Prot
Match: sp|Q0WW21|U75C1_ARATH (UDP-glycosyltransferase 75C1 OS=Arabidopsis thaliana OX=3702 GN=UGT75C1 PE=2 SV=2)

HSP 1 Score: 325.9 bits (834), Expect = 7.6e-88
Identity = 190/472 (40.25%), Postives = 284/472 (60.17%), Query Frame = 0

Query: 5   NFLLVSQSPKSHLNPTLHLASTLLSFGSKVTLLLTNSALKNISKDQLPYGLSLSTFSDGF 64
           ++LLV+   + H+NP L LA+ L+  G+ VT     SA + + +     GLS + F+DGF
Sbjct: 13  HYLLVTFPAQGHINPALQLANRLIHHGATVTYSTAVSAHRRMGEPPSTKGLSFAWFTDGF 72

Query: 65  DDGF-TFSDFPRWCFEFERLGRLALVNLLSSSLQ--QGLLPFTCIVHTLLIPWVARVARE 124
           DDG  +F D   +  E +R G  AL +++ ++L       P T +++++L+PWV+ VARE
Sbjct: 73  DDGLKSFEDQKIYMSELKRCGSNALRDIIKANLDATTETEPITGVIYSVLVPWVSTVARE 132

Query: 125 LHVPTAVLWIQSAAAFDVYYYYFNGYSDVIWNGYKDEGSNSLLFN---IWLPGLPLMNVL 184
            H+PT +LWI+ A   D+YYYYFN         YK       LF+   I LP LPL+   
Sbjct: 133 FHLPTTLLWIEPATVLDIYYYYFN-------TSYKH------LFDVEPIKLPKLPLITTG 192

Query: 185 DLPSFMVSDDYHGLILKSFQEKMQVLEEEENVSILVNSFDALEHDALSAIGKFNLIPVGP 244
           DLPSF+         L + +E ++ LE E N  ILVN+F ALEHDAL+++ K  +IP+GP
Sbjct: 193 DLPSFLQPSKALPSALVTLREHIEALETESNPKILVNTFSALEHDALTSVEKLKMIPIGP 252

Query: 245 LVSLPLEFEVSTKQRSTSYFQDGQQAQADYIKWLNSKPDSSVVYIAFGS-ISKLSNKQTK 304
           L        VS+ +  T  F+   +   DY KWL+SK + SV+YI+ G+    L  K  +
Sbjct: 253 L--------VSSSEGKTDLFKSSDE---DYTKWLDSKLERSVIYISLGTHADDLPEKHME 312

Query: 305 EIVGALLECSYPFLWALRMDDIQDENLSSYFD-DELQAQGKIVPWCSQVEVLSHHSVGCF 364
            +   +L  + PFLW +R  + +++  + + +      +G +V WCSQ  VL+H +VGCF
Sbjct: 313 ALTHGVLATNRPFLWIVREKNPEEKKKNRFLELIRGSDRGLVVGWCSQTAVLAHCAVGCF 372

Query: 365 VTHCGWNSTIESVAAGVPMVAWPLWADQATNAKMMEDVWEIGVRVKKSSDGEGVVEGKEI 424
           VTHCGWNST+ES+ +GVP+VA+P +ADQ T AK++ED W IGV+VK   +G+  V+G+EI
Sbjct: 373 VTHCGWNSTLESLESGVPVVAFPQFADQCTTAKLVEDTWRIGVKVKVGEEGD--VDGEEI 432

Query: 425 ARCLRTVMDMEDYGKGRGKQLRINARKWQSLAMEAA--NGSSYMNLKAFVNK 467
            RCL  VM     G    +++R NA KW+++A++AA   G S +NLK FV++
Sbjct: 433 RRCLEKVMS----GGEEAEEMRENAEKWKAMAVDAAAEGGPSDLNLKGFVDE 454

BLAST of Bhi01G000853 vs. Swiss-Prot
Match: sp|Q9ZVY5|U75B2_ARATH (UDP-glycosyltransferase 75B2 OS=Arabidopsis thaliana OX=3702 GN=UGT75B2 PE=2 SV=1)

HSP 1 Score: 319.3 bits (817), Expect = 7.1e-86
Identity = 196/487 (40.25%), Postives = 281/487 (57.70%), Query Frame = 0

Query: 1   MEHGNFLLVSQSPKSHLNPTLHLASTLL-SFGSKVTLLLTNSALKNISKDQLP-----YG 60
           M   +FLLV+   + H+NP+L  A  L+ + G++VT     + L  I +  +P       
Sbjct: 1   MAQPHFLLVTFPAQGHVNPSLRFARRLIKTTGARVTFA---TCLSVIHRSMIPNHNNVEN 60

Query: 61  LSLSTFSDGFDDGF--TFSDFPRWCFEFERLGRLALVNLLSSSLQQGLLPFTCIVHTLLI 120
           LS  TFSDGFDDG      D       FER G  AL + + ++ Q G  P +C+++T+L 
Sbjct: 61  LSFLTFSDGFDDGVISNTDDVQNRLVHFERNGDKALSDFIEAN-QNGDSPVSCLIYTILP 120

Query: 121 PWVARVARELHVPTAVLWIQSAAAFDVYYYYFNGYSDVIWNGYKDEGSNSLLFNIWLPGL 180
            WV +VAR  H+P+  LWIQ A AFD+YY Y  G              N+ +F    P L
Sbjct: 121 NWVPKVARRFHLPSVHLWIQPAFAFDIYYNYSTG--------------NNSVFE--FPNL 180

Query: 181 PLMNVLDLPSFMVSDDYHGLILKSFQEKMQVLEEEENVSILVNSFDALEHDALSAIGKFN 240
           P + + DLPSF+   + +      +QE M  L+EE N  ILVN+FD+LE + L+AI    
Sbjct: 181 PSLEIRDLPSFLSPSNTNKAAQAVYQELMDFLKEESNPKILVNTFDSLEPEFLTAIPNIE 240

Query: 241 LIPVGPLVSLPLEFEVSTKQRSTSYFQDGQQAQADYIKWLNSKPDSSVVYIAFGSISKLS 300
           ++ VGPL    L  E+ T   S        Q+ + Y  WL+SK +SSV+Y++FG++ +LS
Sbjct: 241 MVAVGPL----LPAEIFTGSESGKDLSRDHQS-SSYTLWLDSKTESSVIYVSFGTMVELS 300

Query: 301 NKQTKEIVGALLECSYPFLWAL-----RMDDIQDENLSSY-----FDDELQAQGKIVPWC 360
            KQ +E+  AL+E   PFLW +     R   I+ E  +       F  EL+  G IV WC
Sbjct: 301 KKQIEELARALIEGGRPFLWVITDKLNREAKIEGEEETEIEKIAGFRHELEEVGMIVSWC 360

Query: 361 SQVEVLSHHSVGCFVTHCGWNSTIESVAAGVPMVAWPLWADQATNAKMMEDVWEIGVRVK 420
           SQ+EVL H ++GCF+THCGW+S++ES+  GVP+VA+P+W+DQ  NAK++E++W+ GVRV+
Sbjct: 361 SQIEVLRHRAIGCFLTHCGWSSSLESLVLGVPVVAFPMWSDQPANAKLLEEIWKTGVRVR 420

Query: 421 KSSDGEGVVEGKEIARCLRTVMDMEDYGKGRGKQLRINARKWQSLAMEAA--NGSSYMNL 468
           ++S  EG+VE  EI RCL  VM+       +  +LR NA KW+ LA EA    GSS  N+
Sbjct: 421 ENS--EGLVERGEIMRCLEAVME------AKSVELRENAEKWKRLATEAGREGGSSDKNV 454

BLAST of Bhi01G000853 vs. TrEMBL
Match: tr|A0A1S3AYM5|A0A1S3AYM5_CUCME (crocetin glucosyltransferase, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103484205 PE=4 SV=1)

HSP 1 Score: 746.9 bits (1927), Expect = 2.8e-212
Identity = 378/472 (80.08%), Postives = 407/472 (86.23%), Query Frame = 0

Query: 1   MEHGNFLLVSQSPKSHLNPTLHLASTLLSFGSKVTLLLTNSALKNISKDQLPYGLSLSTF 60
           MEHGNFLLVSQSP SHLNPTLHLASTLLS GSKVTLL+TN ALKNISKDQLP GLSLSTF
Sbjct: 1   MEHGNFLLVSQSPTSHLNPTLHLASTLLSLGSKVTLLITNHALKNISKDQLPSGLSLSTF 60

Query: 61  SDGFDDGFTFSDFPRWCFEFERLGRLALVNLLSSSLQQGLLPFTCIVHTLLIPWVARVAR 120
           S  FD+GFT+SDF  WC EFERLGRLALV+LLSSS QQGLLP TCIV+TLLIPWVA+VAR
Sbjct: 61  SYSFDNGFTYSDFQLWCVEFERLGRLALVDLLSSSSQQGLLPITCIVNTLLIPWVAQVAR 120

Query: 121 ELHVPTAVLWIQSAAAFDVYYYYFNGYSDVIWNGYKDEGSNSLLFNIWLPGLPLMNVLDL 180
           E HV TAVLWIQS A FDVYYYYFNGY+DVI NGYK++ SN L  NIWLPGLPLMN    
Sbjct: 121 EFHVSTAVLWIQSVAVFDVYYYYFNGYNDVIRNGYKEDDSNLLSSNIWLPGLPLMN---- 180

Query: 181 PSFMVSDDYHGLILKSFQEKMQVLEEEENVSILVNSFDALEHDALSAIGKFNLIPVGPLV 240
                          SF+EKMQ+ +EE+NV ILVNSFDALEHDALSAIG FNLIP+GPLV
Sbjct: 181 ---------------SFEEKMQIFKEEDNVPILVNSFDALEHDALSAIGTFNLIPIGPLV 240

Query: 241 SLPLEFEVSTKQRSTSYFQDGQQAQADYIKWLNSKPDSSVVYIAFGSISKLSNKQTKEIV 300
           SLPL  EVSTKQ+S S FQDGQQA+ D IKWLNSKPDSSVVYIAFGSISKLS +QTKEIV
Sbjct: 241 SLPLGCEVSTKQQSISCFQDGQQAREDCIKWLNSKPDSSVVYIAFGSISKLSKEQTKEIV 300

Query: 301 GALLECSYPFLWALRMDDIQDENLSSYFDDELQAQGKIVPWCSQVEVLSHHSVGCFVTHC 360
           GA LECSYPFLW+LRMDDI+DENLSSYF+ ELQAQGKIVPWCSQVE+LSH SVGCFVTHC
Sbjct: 301 GAFLECSYPFLWSLRMDDIRDENLSSYFNVELQAQGKIVPWCSQVEILSHRSVGCFVTHC 360

Query: 361 GWNSTIESVAAGVPMVAWPLWADQATNAKMMEDVWEIGVRVKKSSDGEGVVEGKEIARCL 420
           GWN TIE VA GVP VAW LWADQATNAKMMEDVW+IGVRVKKSSDGEG+VE KEI RCL
Sbjct: 361 GWNFTIECVAVGVPTVAWLLWADQATNAKMMEDVWKIGVRVKKSSDGEGMVERKEITRCL 420

Query: 421 RTVMDMEDYGKGRGKQLRINARKWQSLAMEAANGSSYMNLKAFVNKLSDQAN 473
           R +MDMED  KG+GKQLRINA KWQ LAMEAANGSS++NLKAFVNK+ D+AN
Sbjct: 421 RMIMDMEDDSKGKGKQLRINATKWQRLAMEAANGSSFVNLKAFVNKVCDEAN 453

BLAST of Bhi01G000853 vs. TrEMBL
Match: tr|A0A0A0L890|A0A0A0L890_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G172390 PE=4 SV=1)

HSP 1 Score: 536.2 bits (1380), Expect = 7.5e-149
Identity = 277/367 (75.48%), Postives = 299/367 (81.47%), Query Frame = 0

Query: 1   MEHGNFLLVSQSPKSHLNPTLHLASTLLSFGSKVTLLLTNSALKNISKDQLPYGLSLSTF 60
           M+HGNFLLVSQSP SHLNPTLH ASTLLS GSKVTLLLTN ALKNIS+DQLP GLSLSTF
Sbjct: 1   MKHGNFLLVSQSPTSHLNPTLHFASTLLSLGSKVTLLLTNHALKNISEDQLPSGLSLSTF 60

Query: 61  SDGFDDGFTFSDFPRWCFEFERLGRLALVNLLSSSLQQGLLPFTCIVHTLLIPWVARVAR 120
           SDGFD+GFT+SD   W  EFERLGR ALVNLLSSS +QGLLP TCIV+TLLIPWVA+VAR
Sbjct: 61  SDGFDNGFTYSDLQLWFVEFERLGRAALVNLLSSSSKQGLLPITCIVNTLLIPWVAQVAR 120

Query: 121 ELHVPTAVLWIQSAAAFDVYYYYFNGYSDVIWNGYKDEGSNSLLFNIWLPGLPLMNVLDL 180
           E HV TA+LW QS A FDVYYYYFNGYS VI NGYK++ SNSL FNI LPGLPLMNVLDL
Sbjct: 121 EFHVSTAILWTQSVAVFDVYYYYFNGYSGVIRNGYKEDDSNSLSFNISLPGLPLMNVLDL 180

Query: 181 PSFMVSDDYHGLILKSFQEKMQVLEEEENVSILVNSFDALEHDALSAIGKFNLIPVGPLV 240
           PSFMVSDD+HGLI+KSF+EK+Q+L+EE+NV ILVNSFDALEHDALSAIG FNLIP+GP V
Sbjct: 181 PSFMVSDDHHGLIIKSFEEKIQILKEEDNVPILVNSFDALEHDALSAIGTFNLIPIGPSV 240

Query: 241 SLPLEFEVSTKQRSTSYFQDGQQAQADYIKWLNSKPDSSVVYIAFGSISKLSNKQTKEIV 300
            LPL  E   KQR+ SYFQDGQQAQ DYIKWLNSKPDSSVVYIAFGS SKLS +QTKE+V
Sbjct: 241 LLPLGCE---KQRNISYFQDGQQAQEDYIKWLNSKPDSSVVYIAFGSFSKLSKEQTKEMV 300

Query: 301 GALLECSYPFLWALRMDDIQDENLSSYFDDELQAQGKIVPWCSQVEVLSHHSVGCFVTHC 360
           GALLECSY                               PWCSQVEVLSH +VGCFVTHC
Sbjct: 301 GALLECSY-------------------------------PWCSQVEVLSHRAVGCFVTHC 333

Query: 361 GWNSTIE 368
           GWNSTIE
Sbjct: 361 GWNSTIE 333

BLAST of Bhi01G000853 vs. TrEMBL
Match: tr|A0A2P4IMI9|A0A2P4IMI9_QUESU (Glycosyltransferase OS=Quercus suber OX=58331 GN=CFP56_58469 PE=3 SV=1)

HSP 1 Score: 427.2 bits (1097), Expect = 4.9e-116
Identity = 233/471 (49.47%), Postives = 311/471 (66.03%), Query Frame = 0

Query: 5   NFLLVSQSPKSHLNPTLHLASTLLSFGSKVTLLLTNSALKNISKDQLPYGLSLSTFSDGF 64
           + LLV+   + H+NP+L  A  L+  G++VT  +T SA + + K   P GLS  TFSDG+
Sbjct: 3   HILLVTFPAQGHVNPSLQFAKRLIHLGAQVTFAITISAHRRMIKSPPPDGLSFVTFSDGY 62

Query: 65  DDGFTFSDFPRWCFEFERLGRLALVNLLSSSLQQGLLPFTCIVHTLLIPWVARVARELHV 124
           DDGF+  D      +F+  G   L +L+ SS  QG  P TC+V+TLL+PW A VARE+H+
Sbjct: 63  DDGFSLDDAQNHFDQFKCNGSKTLTHLIVSSANQG-RPITCLVYTLLLPWAADVAREVHL 122

Query: 125 PTAVLWIQSAAAFDVYYYYFNGYSDVIWNGYKDEGSNSLLFNIWLPGLPLMNVLDLPSFM 184
           P+ +LWIQ A   D+YYYYFNG++DVI N   D  S+S    I LPGLPL+   DLPSF+
Sbjct: 123 PSTLLWIQPAMVLDIYYYYFNGFADVIRNDNNDYPSSS----IKLPGLPLLTSRDLPSFL 182

Query: 185 VSDDYHGLILKSFQEKMQVLEEEENVSILVNSFDALEHDALSAIGKFNLIPVGPLVSLPL 244
           ++ + H   L   Q   + LE+E N  +LVN+FDALE +AL AI +FNL+ VGPL+    
Sbjct: 183 LASNTHTFALPIIQAHFEALEKENNPRVLVNTFDALEPEALKAIERFNLVAVGPLLPSDK 242

Query: 245 EFEVSTKQRSTSYFQDGQQAQADYIKWLNSKPDSSVVYIAFGSISKLSNKQTKEIVGALL 304
            F              G     DYI+WLNSKP+SSV+Y++FGS++ L  +Q +EI   LL
Sbjct: 243 SF--------------GGDVSKDYIEWLNSKPESSVIYVSFGSLAVLMKQQMEEIARGLL 302

Query: 305 ECSYPFLWALR-MDDIQDENLSSYFDDELQAQGKIVPWCSQVEVLSHHSVGCFVTHCGWN 364
            C  PFLW +R  ++ ++E LS    + L+  G IVPWCSQVEVLSH S+GCFVTHCGWN
Sbjct: 303 GCGRPFLWVIRAKENGEEEKLSC--REVLEQMGMIVPWCSQVEVLSHPSLGCFVTHCGWN 362

Query: 365 STIESVAAGVPMVAWPLWADQATNAKMMEDVWEIGVRVKKSSDGEGVVEGKEIARCLRTV 424
           ST+ES+ +GVPMVA+P W+DQ TNAK++EDVW+ GVR+  + D  G+VEG EI RCL  V
Sbjct: 363 STLESLVSGVPMVAFPQWSDQVTNAKLIEDVWKTGVRMIVNKD--GIVEGDEIKRCLELV 422

Query: 425 MDMEDYGKG-RGKQLRINARKWQSLAMEAAN--GSSYMNLKAFVNKLSDQA 472
           +     G G RG+ +R NA+KW+ LAMEAAN  GSSY NLK FV+++ + A
Sbjct: 423 V-----GDGERGEAIRRNAKKWKELAMEAANEGGSSYNNLKDFVDEIGNVA 445

BLAST of Bhi01G000853 vs. TrEMBL
Match: tr|A0A2K1ZXB2|A0A2K1ZXB2_POPTR (Glycosyltransferase OS=Populus trichocarpa OX=3694 GN=POPTR_006G055600v3 PE=3 SV=1)

HSP 1 Score: 425.2 bits (1092), Expect = 1.9e-115
Identity = 236/474 (49.79%), Postives = 315/474 (66.46%), Query Frame = 0

Query: 5   NFLLVSQSPKSHLNPTLHLASTLLSFGSKVTLLLTNSALKNISK-DQLPYGLSLSTFSDG 64
           + LLV+   + H+NP L  A  L++ G+ VT   +  A + +SK    P GLS + F DG
Sbjct: 9   HILLVTFPAQGHINPALQFAKRLVAIGAHVTFSTSMGAARRMSKTGTYPKGLSFAAFDDG 68

Query: 65  FDDGFTFS-DFPRWCFEFERLGRLALVNLLSSSLQQGLLPFTCIVHTLLIPWVARVAREL 124
            + GF  S D   +  E   +G  +L  L+++S + G  PFTC+V++ L+PWVA+VAREL
Sbjct: 69  SEHGFRPSDDIDHYFTELRLVGSKSLAELIAASSKNG-RPFTCVVYSNLVPWVAKVAREL 128

Query: 125 HVPTAVLWIQSAAAFDVYYYYFNGYSDVIWNGYKDEGSNSLLFNIWLPGLPLMNVLDLPS 184
           ++P+ +LW QS A  D++YYYFNGY D I      E  N   F++ LPGLP +   DLPS
Sbjct: 129 NLPSTLLWNQSPALLDIFYYYFNGYGDTI-----SENINDPTFSLKLPGLPPLGSRDLPS 188

Query: 185 FMVSDDYHGLILKSFQEKMQVLEEEENVSILVNSFDALEHDALSAIGKFNLIPVGPLVSL 244
           F    + H   +   +E ++VL+EE N  +LVN+FDALE +AL++IGKF L+ VGPL+  
Sbjct: 189 FFNPRNTHAFAIPVNREHIEVLDEETNPKVLVNTFDALECEALNSIGKFKLVGVGPLI-- 248

Query: 245 PLEFEVSTKQRSTSYFQDGQQAQADYIKWLNSKPDSSVVYIAFGSISKLSNKQTKEIVGA 304
           P  F        TS+  D  Q   D+I+WLNSKP+SSV+YIAFGSIS LS  Q +E+  A
Sbjct: 249 PSAFLDGEDPTDTSFGGDLFQGSKDHIEWLNSKPESSVIYIAFGSISALSKPQKEEMARA 308

Query: 305 LLECSYPFLWALRMD---DIQDENLSSYFDDELQAQGKIVPWCSQVEVLSHHSVGCFVTH 364
           LLE   PFLW +R D   + +++ LS    +EL+ QGKIVPWCSQVEVLSH S+GCFVTH
Sbjct: 309 LLETGRPFLWVIRADRGEEKEEDKLSC--KEELEKQGKIVPWCSQVEVLSHPSIGCFVTH 368

Query: 365 CGWNSTIESVAAGVPMVAWPLWADQATNAKMMEDVWEIGVRVKKSSDGEGVVEGKEIARC 424
           CGWNST ES+A+GVPMVA+P W DQ TNAKM+EDVW+ GVRV  SS+ EGVVEG+EI RC
Sbjct: 369 CGWNSTFESLASGVPMVAFPQWTDQLTNAKMVEDVWKTGVRV-TSSNKEGVVEGEEIERC 428

Query: 425 LRTVMDMEDYGKGRGKQLRINARKWQSLAMEAA--NGSSYMNLKAFVNKLSDQA 472
           L  VM     G  RG ++R NA+KW+ LA +++   GSSY NLKAFV++++  A
Sbjct: 429 LEVVMG----GGERGNEMRKNAKKWKELARQSSKEGGSSYNNLKAFVDEIAGVA 467

BLAST of Bhi01G000853 vs. TrEMBL
Match: tr|F6I4F8|F6I4F8_VITVI (Glycosyltransferase OS=Vitis vinifera OX=29760 GN=VIT_05s0062g00710 PE=3 SV=1)

HSP 1 Score: 416.4 bits (1069), Expect = 8.6e-113
Identity = 229/468 (48.93%), Postives = 309/468 (66.03%), Query Frame = 0

Query: 7   LLVSQSPKSHLNPTLHLASTLLSFGSKVTLLLTNSALKNISKDQLPYGLSLSTFSDGFDD 66
           LLV+   + H+NP+L LA  L+  G+ VT + ++SA   +SK     GL   TFSDG+D 
Sbjct: 6   LLVTYPAQGHINPSLQLAKLLIRAGAHVTFVTSSSAGTRMSKSPTLDGLEFVTFSDGYDH 65

Query: 67  GFTFSD-FPRWCFEFERLGRLALVNLLSSSLQQGLLPFTCIVHTLLIPWVARVARELHVP 126
           GF   D    +  E ERLG  AL  L+ +   +G  PFTC+++ +LIPWVA VAR LH+P
Sbjct: 66  GFDHGDGLQNFMSELERLGSPALTKLIMARANEG-RPFTCLLYGMLIPWVAEVARSLHLP 125

Query: 127 TAVLWIQSAAAFDVYYYYFNGYSDVIWNGYKDEGSNSLLFNIWLPGLPLMNVLDLPSFMV 186
           +A++W Q AA FD+YYYYFNGY ++I  G K  GS+S   +I LPGLPL++  DLPSF+V
Sbjct: 126 SALVWSQPAAVFDIYYYYFNGYGELI--GNKGNGSSS---SIELPGLPLISSSDLPSFLV 185

Query: 187 SD--DYHGLILKSFQEKMQVLEEEENVSILVNSFDALEHDALSAIGKFNLIPVGPLVSLP 246
                 H  +LK  Q++++ L  E N  +LVNSFDALE +AL AI KF L+ +GPL  LP
Sbjct: 186 PSKVSAHNFVLKLHQKQLEQLNRESNPRVLVNSFDALESEALRAINKFKLMGIGPL--LP 245

Query: 247 LEFEVSTKQRSTSYFQDGQQAQADYIKWLNSKPDSSVVYIAFGSISKLSNKQTKEIVGAL 306
             F        TS+  D  +   DYI+WLNS  +SSV+Y++FGS+S LS +Q++EI   L
Sbjct: 246 SAFLDGKDPSDTSFGGDLFRGSKDYIQWLNSNAESSVIYVSFGSLSVLSKQQSEEIARGL 305

Query: 307 LECSYPFLWALRMDDIQDENLSSYFD--DELQAQGKIVPWCSQVEVLSHHSVGCFVTHCG 366
           L+   PFLW +R  + ++E         +EL+  G IVPWCSQVEVLSH S+GCFV+HCG
Sbjct: 306 LDSGRPFLWVIRAKENEEEEKEDKLSCVEELEQLGMIVPWCSQVEVLSHPSLGCFVSHCG 365

Query: 367 WNSTIESVAAGVPMVAWPLWADQATNAKMMEDVWEIGVRVKKSSDGEGVVEGKEIARCLR 426
           WNST+ES+A+GVP+VA+P W DQ TNAK++EDVW+ G+RV  +   EG+VEG EI +CL 
Sbjct: 366 WNSTLESLASGVPVVAFPQWTDQTTNAKLIEDVWKTGLRVMVNQ--EGIVEGGEIKKCLE 425

Query: 427 TVMDMEDYGKGRGKQLRINARKWQSLAMEAA--NGSSYMNLKAFVNKL 468
            VM     G  RG+++R NA+KW+ LA EA    GSS  NLK FV+++
Sbjct: 426 LVMG----GGERGQEVRSNAKKWKDLAREAVKDGGSSDKNLKNFVDEI 459

BLAST of Bhi01G000853 vs. NCBI nr
Match: XP_008439390.2 (PREDICTED: crocetin glucosyltransferase, chloroplastic [Cucumis melo])

HSP 1 Score: 746.9 bits (1927), Expect = 4.2e-212
Identity = 378/472 (80.08%), Postives = 407/472 (86.23%), Query Frame = 0

Query: 1   MEHGNFLLVSQSPKSHLNPTLHLASTLLSFGSKVTLLLTNSALKNISKDQLPYGLSLSTF 60
           MEHGNFLLVSQSP SHLNPTLHLASTLLS GSKVTLL+TN ALKNISKDQLP GLSLSTF
Sbjct: 1   MEHGNFLLVSQSPTSHLNPTLHLASTLLSLGSKVTLLITNHALKNISKDQLPSGLSLSTF 60

Query: 61  SDGFDDGFTFSDFPRWCFEFERLGRLALVNLLSSSLQQGLLPFTCIVHTLLIPWVARVAR 120
           S  FD+GFT+SDF  WC EFERLGRLALV+LLSSS QQGLLP TCIV+TLLIPWVA+VAR
Sbjct: 61  SYSFDNGFTYSDFQLWCVEFERLGRLALVDLLSSSSQQGLLPITCIVNTLLIPWVAQVAR 120

Query: 121 ELHVPTAVLWIQSAAAFDVYYYYFNGYSDVIWNGYKDEGSNSLLFNIWLPGLPLMNVLDL 180
           E HV TAVLWIQS A FDVYYYYFNGY+DVI NGYK++ SN L  NIWLPGLPLMN    
Sbjct: 121 EFHVSTAVLWIQSVAVFDVYYYYFNGYNDVIRNGYKEDDSNLLSSNIWLPGLPLMN---- 180

Query: 181 PSFMVSDDYHGLILKSFQEKMQVLEEEENVSILVNSFDALEHDALSAIGKFNLIPVGPLV 240
                          SF+EKMQ+ +EE+NV ILVNSFDALEHDALSAIG FNLIP+GPLV
Sbjct: 181 ---------------SFEEKMQIFKEEDNVPILVNSFDALEHDALSAIGTFNLIPIGPLV 240

Query: 241 SLPLEFEVSTKQRSTSYFQDGQQAQADYIKWLNSKPDSSVVYIAFGSISKLSNKQTKEIV 300
           SLPL  EVSTKQ+S S FQDGQQA+ D IKWLNSKPDSSVVYIAFGSISKLS +QTKEIV
Sbjct: 241 SLPLGCEVSTKQQSISCFQDGQQAREDCIKWLNSKPDSSVVYIAFGSISKLSKEQTKEIV 300

Query: 301 GALLECSYPFLWALRMDDIQDENLSSYFDDELQAQGKIVPWCSQVEVLSHHSVGCFVTHC 360
           GA LECSYPFLW+LRMDDI+DENLSSYF+ ELQAQGKIVPWCSQVE+LSH SVGCFVTHC
Sbjct: 301 GAFLECSYPFLWSLRMDDIRDENLSSYFNVELQAQGKIVPWCSQVEILSHRSVGCFVTHC 360

Query: 361 GWNSTIESVAAGVPMVAWPLWADQATNAKMMEDVWEIGVRVKKSSDGEGVVEGKEIARCL 420
           GWN TIE VA GVP VAW LWADQATNAKMMEDVW+IGVRVKKSSDGEG+VE KEI RCL
Sbjct: 361 GWNFTIECVAVGVPTVAWLLWADQATNAKMMEDVWKIGVRVKKSSDGEGMVERKEITRCL 420

Query: 421 RTVMDMEDYGKGRGKQLRINARKWQSLAMEAANGSSYMNLKAFVNKLSDQAN 473
           R +MDMED  KG+GKQLRINA KWQ LAMEAANGSS++NLKAFVNK+ D+AN
Sbjct: 421 RMIMDMEDDSKGKGKQLRINATKWQRLAMEAANGSSFVNLKAFVNKVCDEAN 453

BLAST of Bhi01G000853 vs. NCBI nr
Match: XP_004147672.1 (PREDICTED: crocetin glucosyltransferase, chloroplastic-like [Cucumis sativus] >KGN57234.1 hypothetical protein Csa_3G172390 [Cucumis sativus])

HSP 1 Score: 536.2 bits (1380), Expect = 1.1e-148
Identity = 277/367 (75.48%), Postives = 299/367 (81.47%), Query Frame = 0

Query: 1   MEHGNFLLVSQSPKSHLNPTLHLASTLLSFGSKVTLLLTNSALKNISKDQLPYGLSLSTF 60
           M+HGNFLLVSQSP SHLNPTLH ASTLLS GSKVTLLLTN ALKNIS+DQLP GLSLSTF
Sbjct: 1   MKHGNFLLVSQSPTSHLNPTLHFASTLLSLGSKVTLLLTNHALKNISEDQLPSGLSLSTF 60

Query: 61  SDGFDDGFTFSDFPRWCFEFERLGRLALVNLLSSSLQQGLLPFTCIVHTLLIPWVARVAR 120
           SDGFD+GFT+SD   W  EFERLGR ALVNLLSSS +QGLLP TCIV+TLLIPWVA+VAR
Sbjct: 61  SDGFDNGFTYSDLQLWFVEFERLGRAALVNLLSSSSKQGLLPITCIVNTLLIPWVAQVAR 120

Query: 121 ELHVPTAVLWIQSAAAFDVYYYYFNGYSDVIWNGYKDEGSNSLLFNIWLPGLPLMNVLDL 180
           E HV TA+LW QS A FDVYYYYFNGYS VI NGYK++ SNSL FNI LPGLPLMNVLDL
Sbjct: 121 EFHVSTAILWTQSVAVFDVYYYYFNGYSGVIRNGYKEDDSNSLSFNISLPGLPLMNVLDL 180

Query: 181 PSFMVSDDYHGLILKSFQEKMQVLEEEENVSILVNSFDALEHDALSAIGKFNLIPVGPLV 240
           PSFMVSDD+HGLI+KSF+EK+Q+L+EE+NV ILVNSFDALEHDALSAIG FNLIP+GP V
Sbjct: 181 PSFMVSDDHHGLIIKSFEEKIQILKEEDNVPILVNSFDALEHDALSAIGTFNLIPIGPSV 240

Query: 241 SLPLEFEVSTKQRSTSYFQDGQQAQADYIKWLNSKPDSSVVYIAFGSISKLSNKQTKEIV 300
            LPL  E   KQR+ SYFQDGQQAQ DYIKWLNSKPDSSVVYIAFGS SKLS +QTKE+V
Sbjct: 241 LLPLGCE---KQRNISYFQDGQQAQEDYIKWLNSKPDSSVVYIAFGSFSKLSKEQTKEMV 300

Query: 301 GALLECSYPFLWALRMDDIQDENLSSYFDDELQAQGKIVPWCSQVEVLSHHSVGCFVTHC 360
           GALLECSY                               PWCSQVEVLSH +VGCFVTHC
Sbjct: 301 GALLECSY-------------------------------PWCSQVEVLSHRAVGCFVTHC 333

Query: 361 GWNSTIE 368
           GWNSTIE
Sbjct: 361 GWNSTIE 333

BLAST of Bhi01G000853 vs. NCBI nr
Match: XP_023885287.1 (crocetin glucosyltransferase, chloroplastic-like [Quercus suber] >POE69741.1 crocetin glucosyltransferase, chloroplastic [Quercus suber])

HSP 1 Score: 427.2 bits (1097), Expect = 7.4e-116
Identity = 233/471 (49.47%), Postives = 311/471 (66.03%), Query Frame = 0

Query: 5   NFLLVSQSPKSHLNPTLHLASTLLSFGSKVTLLLTNSALKNISKDQLPYGLSLSTFSDGF 64
           + LLV+   + H+NP+L  A  L+  G++VT  +T SA + + K   P GLS  TFSDG+
Sbjct: 3   HILLVTFPAQGHVNPSLQFAKRLIHLGAQVTFAITISAHRRMIKSPPPDGLSFVTFSDGY 62

Query: 65  DDGFTFSDFPRWCFEFERLGRLALVNLLSSSLQQGLLPFTCIVHTLLIPWVARVARELHV 124
           DDGF+  D      +F+  G   L +L+ SS  QG  P TC+V+TLL+PW A VARE+H+
Sbjct: 63  DDGFSLDDAQNHFDQFKCNGSKTLTHLIVSSANQG-RPITCLVYTLLLPWAADVAREVHL 122

Query: 125 PTAVLWIQSAAAFDVYYYYFNGYSDVIWNGYKDEGSNSLLFNIWLPGLPLMNVLDLPSFM 184
           P+ +LWIQ A   D+YYYYFNG++DVI N   D  S+S    I LPGLPL+   DLPSF+
Sbjct: 123 PSTLLWIQPAMVLDIYYYYFNGFADVIRNDNNDYPSSS----IKLPGLPLLTSRDLPSFL 182

Query: 185 VSDDYHGLILKSFQEKMQVLEEEENVSILVNSFDALEHDALSAIGKFNLIPVGPLVSLPL 244
           ++ + H   L   Q   + LE+E N  +LVN+FDALE +AL AI +FNL+ VGPL+    
Sbjct: 183 LASNTHTFALPIIQAHFEALEKENNPRVLVNTFDALEPEALKAIERFNLVAVGPLLPSDK 242

Query: 245 EFEVSTKQRSTSYFQDGQQAQADYIKWLNSKPDSSVVYIAFGSISKLSNKQTKEIVGALL 304
            F              G     DYI+WLNSKP+SSV+Y++FGS++ L  +Q +EI   LL
Sbjct: 243 SF--------------GGDVSKDYIEWLNSKPESSVIYVSFGSLAVLMKQQMEEIARGLL 302

Query: 305 ECSYPFLWALR-MDDIQDENLSSYFDDELQAQGKIVPWCSQVEVLSHHSVGCFVTHCGWN 364
            C  PFLW +R  ++ ++E LS    + L+  G IVPWCSQVEVLSH S+GCFVTHCGWN
Sbjct: 303 GCGRPFLWVIRAKENGEEEKLSC--REVLEQMGMIVPWCSQVEVLSHPSLGCFVTHCGWN 362

Query: 365 STIESVAAGVPMVAWPLWADQATNAKMMEDVWEIGVRVKKSSDGEGVVEGKEIARCLRTV 424
           ST+ES+ +GVPMVA+P W+DQ TNAK++EDVW+ GVR+  + D  G+VEG EI RCL  V
Sbjct: 363 STLESLVSGVPMVAFPQWSDQVTNAKLIEDVWKTGVRMIVNKD--GIVEGDEIKRCLELV 422

Query: 425 MDMEDYGKG-RGKQLRINARKWQSLAMEAAN--GSSYMNLKAFVNKLSDQA 472
           +     G G RG+ +R NA+KW+ LAMEAAN  GSSY NLK FV+++ + A
Sbjct: 423 V-----GDGERGEAIRRNAKKWKELAMEAANEGGSSYNNLKDFVDEIGNVA 445

BLAST of Bhi01G000853 vs. NCBI nr
Match: XP_002308970.2 (crocetin glucosyltransferase, chloroplastic [Populus trichocarpa] >PNT29916.1 hypothetical protein POPTR_006G055600v3 [Populus trichocarpa])

HSP 1 Score: 425.2 bits (1092), Expect = 2.8e-115
Identity = 236/474 (49.79%), Postives = 315/474 (66.46%), Query Frame = 0

Query: 5   NFLLVSQSPKSHLNPTLHLASTLLSFGSKVTLLLTNSALKNISK-DQLPYGLSLSTFSDG 64
           + LLV+   + H+NP L  A  L++ G+ VT   +  A + +SK    P GLS + F DG
Sbjct: 9   HILLVTFPAQGHINPALQFAKRLVAIGAHVTFSTSMGAARRMSKTGTYPKGLSFAAFDDG 68

Query: 65  FDDGFTFS-DFPRWCFEFERLGRLALVNLLSSSLQQGLLPFTCIVHTLLIPWVARVAREL 124
            + GF  S D   +  E   +G  +L  L+++S + G  PFTC+V++ L+PWVA+VAREL
Sbjct: 69  SEHGFRPSDDIDHYFTELRLVGSKSLAELIAASSKNG-RPFTCVVYSNLVPWVAKVAREL 128

Query: 125 HVPTAVLWIQSAAAFDVYYYYFNGYSDVIWNGYKDEGSNSLLFNIWLPGLPLMNVLDLPS 184
           ++P+ +LW QS A  D++YYYFNGY D I      E  N   F++ LPGLP +   DLPS
Sbjct: 129 NLPSTLLWNQSPALLDIFYYYFNGYGDTI-----SENINDPTFSLKLPGLPPLGSRDLPS 188

Query: 185 FMVSDDYHGLILKSFQEKMQVLEEEENVSILVNSFDALEHDALSAIGKFNLIPVGPLVSL 244
           F    + H   +   +E ++VL+EE N  +LVN+FDALE +AL++IGKF L+ VGPL+  
Sbjct: 189 FFNPRNTHAFAIPVNREHIEVLDEETNPKVLVNTFDALECEALNSIGKFKLVGVGPLI-- 248

Query: 245 PLEFEVSTKQRSTSYFQDGQQAQADYIKWLNSKPDSSVVYIAFGSISKLSNKQTKEIVGA 304
           P  F        TS+  D  Q   D+I+WLNSKP+SSV+YIAFGSIS LS  Q +E+  A
Sbjct: 249 PSAFLDGEDPTDTSFGGDLFQGSKDHIEWLNSKPESSVIYIAFGSISALSKPQKEEMARA 308

Query: 305 LLECSYPFLWALRMD---DIQDENLSSYFDDELQAQGKIVPWCSQVEVLSHHSVGCFVTH 364
           LLE   PFLW +R D   + +++ LS    +EL+ QGKIVPWCSQVEVLSH S+GCFVTH
Sbjct: 309 LLETGRPFLWVIRADRGEEKEEDKLSC--KEELEKQGKIVPWCSQVEVLSHPSIGCFVTH 368

Query: 365 CGWNSTIESVAAGVPMVAWPLWADQATNAKMMEDVWEIGVRVKKSSDGEGVVEGKEIARC 424
           CGWNST ES+A+GVPMVA+P W DQ TNAKM+EDVW+ GVRV  SS+ EGVVEG+EI RC
Sbjct: 369 CGWNSTFESLASGVPMVAFPQWTDQLTNAKMVEDVWKTGVRV-TSSNKEGVVEGEEIERC 428

Query: 425 LRTVMDMEDYGKGRGKQLRINARKWQSLAMEAA--NGSSYMNLKAFVNKLSDQA 472
           L  VM     G  RG ++R NA+KW+ LA +++   GSSY NLKAFV++++  A
Sbjct: 429 LEVVMG----GGERGNEMRKNAKKWKELARQSSKEGGSSYNNLKAFVDEIAGVA 467

BLAST of Bhi01G000853 vs. NCBI nr
Match: XP_002263975.1 (PREDICTED: crocetin glucosyltransferase, chloroplastic [Vitis vinifera])

HSP 1 Score: 416.4 bits (1069), Expect = 1.3e-112
Identity = 229/468 (48.93%), Postives = 309/468 (66.03%), Query Frame = 0

Query: 7   LLVSQSPKSHLNPTLHLASTLLSFGSKVTLLLTNSALKNISKDQLPYGLSLSTFSDGFDD 66
           LLV+   + H+NP+L LA  L+  G+ VT + ++SA   +SK     GL   TFSDG+D 
Sbjct: 6   LLVTYPAQGHINPSLQLAKLLIRAGAHVTFVTSSSAGTRMSKSPTLDGLEFVTFSDGYDH 65

Query: 67  GFTFSD-FPRWCFEFERLGRLALVNLLSSSLQQGLLPFTCIVHTLLIPWVARVARELHVP 126
           GF   D    +  E ERLG  AL  L+ +   +G  PFTC+++ +LIPWVA VAR LH+P
Sbjct: 66  GFDHGDGLQNFMSELERLGSPALTKLIMARANEG-RPFTCLLYGMLIPWVAEVARSLHLP 125

Query: 127 TAVLWIQSAAAFDVYYYYFNGYSDVIWNGYKDEGSNSLLFNIWLPGLPLMNVLDLPSFMV 186
           +A++W Q AA FD+YYYYFNGY ++I  G K  GS+S   +I LPGLPL++  DLPSF+V
Sbjct: 126 SALVWSQPAAVFDIYYYYFNGYGELI--GNKGNGSSS---SIELPGLPLISSSDLPSFLV 185

Query: 187 SD--DYHGLILKSFQEKMQVLEEEENVSILVNSFDALEHDALSAIGKFNLIPVGPLVSLP 246
                 H  +LK  Q++++ L  E N  +LVNSFDALE +AL AI KF L+ +GPL  LP
Sbjct: 186 PSKVSAHNFVLKLHQKQLEQLNRESNPRVLVNSFDALESEALRAINKFKLMGIGPL--LP 245

Query: 247 LEFEVSTKQRSTSYFQDGQQAQADYIKWLNSKPDSSVVYIAFGSISKLSNKQTKEIVGAL 306
             F        TS+  D  +   DYI+WLNS  +SSV+Y++FGS+S LS +Q++EI   L
Sbjct: 246 SAFLDGKDPSDTSFGGDLFRGSKDYIQWLNSNAESSVIYVSFGSLSVLSKQQSEEIARGL 305

Query: 307 LECSYPFLWALRMDDIQDENLSSYFD--DELQAQGKIVPWCSQVEVLSHHSVGCFVTHCG 366
           L+   PFLW +R  + ++E         +EL+  G IVPWCSQVEVLSH S+GCFV+HCG
Sbjct: 306 LDSGRPFLWVIRAKENEEEEKEDKLSCVEELEQLGMIVPWCSQVEVLSHPSLGCFVSHCG 365

Query: 367 WNSTIESVAAGVPMVAWPLWADQATNAKMMEDVWEIGVRVKKSSDGEGVVEGKEIARCLR 426
           WNST+ES+A+GVP+VA+P W DQ TNAK++EDVW+ G+RV  +   EG+VEG EI +CL 
Sbjct: 366 WNSTLESLASGVPVVAFPQWTDQTTNAKLIEDVWKTGLRVMVNQ--EGIVEGGEIKKCLE 425

Query: 427 TVMDMEDYGKGRGKQLRINARKWQSLAMEAA--NGSSYMNLKAFVNKL 468
            VM     G  RG+++R NA+KW+ LA EA    GSS  NLK FV+++
Sbjct: 426 LVMG----GGERGQEVRSNAKKWKDLAREAVKDGGSSDKNLKNFVDEI 459

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
AT4G15550.13.8e-9840.98indole-3-acetate beta-D-glucosyltransferase[more]
AT4G14090.14.2e-8940.25UDP-Glycosyltransferase superfamily protein[more]
AT1G05530.13.9e-8740.25UDP-glucosyl transferase 75B2[more]
AT1G05560.13.3e-8638.97UDP-glucosyltransferase 75B1[more]
AT3G21560.11.8e-6032.71UDP-Glycosyltransferase superfamily protein[more]
Match NameE-valueIdentityDescription
sp|F8WKW0|UGT1_GARJA1.2e-10946.32Crocetin glucosyltransferase, chloroplastic OS=Gardenia jasminoides OX=114476 GN... [more]
sp|O23406|U75D1_ARATH6.9e-9740.98UDP-glycosyltransferase 75D1 OS=Arabidopsis thaliana OX=3702 GN=UGT75D1 PE=2 SV=... [more]
sp|Q9ZR25|5GT_VERHY3.3e-9141.84Anthocyanidin 3-O-glucoside 5-O-glucosyltransferase OS=Verbena hybrida OX=76714 ... [more]
sp|Q0WW21|U75C1_ARATH7.6e-8840.25UDP-glycosyltransferase 75C1 OS=Arabidopsis thaliana OX=3702 GN=UGT75C1 PE=2 SV=... [more]
sp|Q9ZVY5|U75B2_ARATH7.1e-8640.25UDP-glycosyltransferase 75B2 OS=Arabidopsis thaliana OX=3702 GN=UGT75B2 PE=2 SV=... [more]
Match NameE-valueIdentityDescription
tr|A0A1S3AYM5|A0A1S3AYM5_CUCME2.8e-21280.08crocetin glucosyltransferase, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103484... [more]
tr|A0A0A0L890|A0A0A0L890_CUCSA7.5e-14975.48Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G172390 PE=4 SV=1[more]
tr|A0A2P4IMI9|A0A2P4IMI9_QUESU4.9e-11649.47Glycosyltransferase OS=Quercus suber OX=58331 GN=CFP56_58469 PE=3 SV=1[more]
tr|A0A2K1ZXB2|A0A2K1ZXB2_POPTR1.9e-11549.79Glycosyltransferase OS=Populus trichocarpa OX=3694 GN=POPTR_006G055600v3 PE=3 SV... [more]
tr|F6I4F8|F6I4F8_VITVI8.6e-11348.93Glycosyltransferase OS=Vitis vinifera OX=29760 GN=VIT_05s0062g00710 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
XP_008439390.24.2e-21280.08PREDICTED: crocetin glucosyltransferase, chloroplastic [Cucumis melo][more]
XP_004147672.11.1e-14875.48PREDICTED: crocetin glucosyltransferase, chloroplastic-like [Cucumis sativus] >K... [more]
XP_023885287.17.4e-11649.47crocetin glucosyltransferase, chloroplastic-like [Quercus suber] >POE69741.1 cro... [more]
XP_002308970.22.8e-11549.79crocetin glucosyltransferase, chloroplastic [Populus trichocarpa] >PNT29916.1 hy... [more]
XP_002263975.11.3e-11248.93PREDICTED: crocetin glucosyltransferase, chloroplastic [Vitis vinifera][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0016758transferase activity, transferring hexosyl groups
Vocabulary: Biological Process
TermDefinition
GO:0008152metabolic process
Vocabulary: INTERPRO
TermDefinition
IPR035595UDP_glycos_trans_CS
IPR002213UDP_glucos_trans
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008152 metabolic process
biological_process GO:0009813 flavonoid biosynthetic process
biological_process GO:0052696 flavonoid glucuronidation
cellular_component GO:0005575 cellular_component
cellular_component GO:0043231 intracellular membrane-bounded organelle
molecular_function GO:0035251 UDP-glucosyltransferase activity
molecular_function GO:0016758 transferase activity, transferring hexosyl groups
molecular_function GO:0080043 quercetin 3-O-glucosyltransferase activity
molecular_function GO:0080044 quercetin 7-O-glucosyltransferase activity
molecular_function GO:0016740 transferase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Bhi01M000853Bhi01M000853mRNA


Analysis Name: InterPro Annotations of wax gourd
Date Performed: 2019-11-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableGENE3DG3DSA:3.40.50.2000coord: 451..460
e-value: 7.5E-122
score: 409.7
coord: 17..261
e-value: 7.5E-122
score: 409.7
NoneNo IPR availableGENE3DG3DSA:3.40.50.2000coord: 262..450
e-value: 7.5E-122
score: 409.7
NoneNo IPR availablePANTHERPTHR11926GLUCOSYL/GLUCURONOSYL TRANSFERASEScoord: 4..468
NoneNo IPR availablePANTHERPTHR11926:SF767SUBFAMILY NOT NAMEDcoord: 4..468
NoneNo IPR availableSUPERFAMILYSSF53756UDP-Glycosyltransferase/glycogen phosphorylasecoord: 2..465
IPR002213UDP-glucuronosyl/UDP-glucosyltransferasePFAMPF00201UDPGTcoord: 267..401
e-value: 2.0E-25
score: 89.5
IPR035595UDP-glycosyltransferase family, conserved sitePROSITEPS00375UDPGTcoord: 341..384