Cla97C05G088810 (gene) Watermelon (97103) v2

NameCla97C05G088810
Typegene
OrganismCitrullus lanatus (Watermelon (97103) v2)
DescriptionGlycosyltransferase
LocationCla97Chr05 : 6926922 .. 6929126 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAGCATGGGAATTTCCTTCTAGTTTCCCAAAGCCCAAAGAGCCATCTCAACCCTACTCTCCATTTGGCCTCCACCCTCCTCTACCTTGGCTCAAAAGTCACTCTTCTTCTCACCAACTATGCCCTCAAAAACATTTCAAAACACCAACTCCCCTATGGCTTATCTCTCTCCACTTTCTCCGATGGCTTTGACGACGGCTTTACCTCCTCCGACTTCCCACGCTGGCGGGTTGAATTTGAACGCCTCGGTCGCCTTGCCCTCGTCGACCTCCTCTCCTCACAACAACAAAGCCTTCTTTCCTTCACTTGCATTGTCCACACCCTCCTCATCCCTTGGGTCACTCGAGTTGCGCTTGAGCTTCACGTGCCGACTGCGATTTTGTGGATTCAATCGGTTGCTGCCTTTGATGTGTATTACTATTACTTCAATGGCTATAGTGATGTGATTCGGAATGGTTATAAAGATGATGGCTCTAATTCTTTGTTATTTAACATTTGGCTTCCGGGTTTGCCATTGATGAATGTTTTGGACCTTCCAAGCTTTGTGGTTTCTGATGATTATCATGGGCTTATTCTCAAGTCATTTGAAGAGAAGATGCAGGTTCTTGAGGAGGAGGAGAATGTGTCAATCCTTGTTAACTCGTTCGATGCATTGGAACATGATGCCTTTACAGCAATCGGGAAGTTTAACTTGATCCCAATTGGACCTTTGGTTTCACTTCCACCTAGATTTGAAGTTTCAACCAAACAAAGAACCACTTCATATCTTCAAGGTGGTATGAAATATACAGTCTTGTTTAGGCTTCTTCTTTTAAAGGTTTTTGTTTTATTTTTGTACCAGAATTTCAAGTTTTTTTAATTTTGATGGTTAAACTTTCAAAACATCATATTTAAGTCACTAAACGTTAAGTTTCACCTCAAGTTAGTTCATGAACTTTTAAAAATTTACTTGTCCTTGAATGAAAAAGAAAAAATCGTTTAGGTCTTACCATAAAAATGAATTATGCTTTGAGAGTATGAATAATAATGATTAACTTGAACATGTACTGAGAAGCCACAATACATGTCAATTGTCAACTTACAAAATGAAATATTCTAGTAGTTGATATATACCTACTTATGTTCACACATCACACTAAATTTTTTATTGACATAATTTTTAAATGTTGTTGAACACTCCATCATTCTAAAACAATGCAACATAGCTTTGTAATGAATGATAGTAGTACAAGAAATTTGTTCTACTAGAGTGATTATTTGCAGAAAAATTGGAGACAAAATAATAGTGATGACTATAAGATGGTTTACTGATATATAAGTTCAATGGCTAAGACATTTTAAATGTCCATATATCTAAACAGTTATACTTTTCCCACCCAAAAAAAAGTTGGTAAAGGTATCCTTGTAAGTTTGGACATTAAAATAAACATATTACCTGTTAATATATCAAATTTAAATAATCTAGAGAGTTCAAAAGCTAAAATGGCTCAATTTTAAGCCTTTTCTAATTTTACCAGAGACAATGGTTGTCTTTACTTATACATGATGATCATTTCATTCTTGTTCGCAAGGTCAACAGGCTCAAGAGGATTATATCAAATGGCTTAACTCCAAATTTGATTCGTCTGTGGTCTACATAGCATTTGGGAGCATTTCAAAGCTGTCAAATAAACAAACAAAAGAGATCGTTGGTGCATTATTAGAATGCAGTTACCCATTCTTGTGGGTCCTAAGTATGGATGACATCCAAGATGAGAATTTAAGCTTATATTTTGACGATGAACTACAAGCTCAAGGGAAGATAGTGCCATGGTGCTCACAAGTAGAGGTCTTGAGCCACCGCTCCGTGGGTTGCTTTGTAACACATTGCGGATGGAACTCAACGATCGAGAGCGTGACAGCTGGAGTGCCAACGGTGGCATGGCCATTGTGGGCGGACCAAGCCACCAACGCCAAGTTGATGCAGGATGTATGGGAGCTTGGTGTGAGAGTGAAGAAGAGTAGTGATGGTGAAGGATTGGTGGAAGGGAAGGAGATTGCAAGGTGCTTGAGAATGGTTATGGATATGGAAGACCACGGCAGAGGAAAGCAACTGAGAATTAATGCTAGGAAGTGGCAGCTCTTAGCAATGGAGGCTGCAAATGGTTCTTCTTATATGAATATTAAGGCTTTTGTAAATAAAGTTTGTTATCAAGCAAAGTGA

mRNA sequence

ATGGAGCATGGGAATTTCCTTCTAGTTTCCCAAAGCCCAAAGAGCCATCTCAACCCTACTCTCCATTTGGCCTCCACCCTCCTCTACCTTGGCTCAAAAGTCACTCTTCTTCTCACCAACTATGCCCTCAAAAACATTTCAAAACACCAACTCCCCTATGGCTTATCTCTCTCCACTTTCTCCGATGGCTTTGACGACGGCTTTACCTCCTCCGACTTCCCACGCTGGCGGGTTGAATTTGAACGCCTCGGTCGCCTTGCCCTCGTCGACCTCCTCTCCTCACAACAACAAAGCCTTCTTTCCTTCACTTGCATTGTCCACACCCTCCTCATCCCTTGGGTCACTCGAGTTGCGCTTGAGCTTCACGTGCCGACTGCGATTTTGTGGATTCAATCGGTTGCTGCCTTTGATGTGTATTACTATTACTTCAATGGCTATAGTGATGTGATTCGGAATGGTTATAAAGATGATGGCTCTAATTCTTTGTTATTTAACATTTGGCTTCCGGGTTTGCCATTGATGAATGTTTTGGACCTTCCAAGCTTTGTGGTTTCTGATGATTATCATGGGCTTATTCTCAAGTCATTTGAAGAGAAGATGCAGGTTCTTGAGGAGGAGGAGAATGTGTCAATCCTTGTTAACTCGTTCGATGCATTGGAACATGATGCCTTTACAGCAATCGGGAAGTTTAACTTGATCCCAATTGGACCTTTGGTTTCACTTCCACCTAGATTTGAAGTTTCAACCAAACAAAGAACCACTTCATATCTTCAAGGTGGTCAACAGGCTCAAGAGGATTATATCAAATGGCTTAACTCCAAATTTGATTCGTCTGTGGTCTACATAGCATTTGGGAGCATTTCAAAGCTGTCAAATAAACAAACAAAAGAGATCGTTGGTGCATTATTAGAATGCAGTTACCCATTCTTGTGGGTCCTAAGTATGGATGACATCCAAGATGAGAATTTAAGCTTATATTTTGACGATGAACTACAAGCTCAAGGGAAGATAGTGCCATGGTGCTCACAAGTAGAGGTCTTGAGCCACCGCTCCGTGGGTTGCTTTGTAACACATTGCGGATGGAACTCAACGATCGAGAGCGTGACAGCTGGAGTGCCAACGGTGGCATGGCCATTGTGGGCGGACCAAGCCACCAACGCCAAGTTGATGCAGGATGTATGGGAGCTTGGTGTGAGAGTGAAGAAGAGTAGTGATGGTGAAGGATTGGTGGAAGGGAAGGAGATTGCAAGGTGCTTGAGAATGGTTATGGATATGGAAGACCACGGCAGAGGAAAGCAACTGAGAATTAATGCTAGGAAGTGGCAGCTCTTAGCAATGGAGGCTGCAAATGGTTCTTCTTATATGAATATTAAGGCTTTTGTAAATAAAGTTTGTTATCAAGCAAAGTGA

Coding sequence (CDS)

ATGGAGCATGGGAATTTCCTTCTAGTTTCCCAAAGCCCAAAGAGCCATCTCAACCCTACTCTCCATTTGGCCTCCACCCTCCTCTACCTTGGCTCAAAAGTCACTCTTCTTCTCACCAACTATGCCCTCAAAAACATTTCAAAACACCAACTCCCCTATGGCTTATCTCTCTCCACTTTCTCCGATGGCTTTGACGACGGCTTTACCTCCTCCGACTTCCCACGCTGGCGGGTTGAATTTGAACGCCTCGGTCGCCTTGCCCTCGTCGACCTCCTCTCCTCACAACAACAAAGCCTTCTTTCCTTCACTTGCATTGTCCACACCCTCCTCATCCCTTGGGTCACTCGAGTTGCGCTTGAGCTTCACGTGCCGACTGCGATTTTGTGGATTCAATCGGTTGCTGCCTTTGATGTGTATTACTATTACTTCAATGGCTATAGTGATGTGATTCGGAATGGTTATAAAGATGATGGCTCTAATTCTTTGTTATTTAACATTTGGCTTCCGGGTTTGCCATTGATGAATGTTTTGGACCTTCCAAGCTTTGTGGTTTCTGATGATTATCATGGGCTTATTCTCAAGTCATTTGAAGAGAAGATGCAGGTTCTTGAGGAGGAGGAGAATGTGTCAATCCTTGTTAACTCGTTCGATGCATTGGAACATGATGCCTTTACAGCAATCGGGAAGTTTAACTTGATCCCAATTGGACCTTTGGTTTCACTTCCACCTAGATTTGAAGTTTCAACCAAACAAAGAACCACTTCATATCTTCAAGGTGGTCAACAGGCTCAAGAGGATTATATCAAATGGCTTAACTCCAAATTTGATTCGTCTGTGGTCTACATAGCATTTGGGAGCATTTCAAAGCTGTCAAATAAACAAACAAAAGAGATCGTTGGTGCATTATTAGAATGCAGTTACCCATTCTTGTGGGTCCTAAGTATGGATGACATCCAAGATGAGAATTTAAGCTTATATTTTGACGATGAACTACAAGCTCAAGGGAAGATAGTGCCATGGTGCTCACAAGTAGAGGTCTTGAGCCACCGCTCCGTGGGTTGCTTTGTAACACATTGCGGATGGAACTCAACGATCGAGAGCGTGACAGCTGGAGTGCCAACGGTGGCATGGCCATTGTGGGCGGACCAAGCCACCAACGCCAAGTTGATGCAGGATGTATGGGAGCTTGGTGTGAGAGTGAAGAAGAGTAGTGATGGTGAAGGATTGGTGGAAGGGAAGGAGATTGCAAGGTGCTTGAGAATGGTTATGGATATGGAAGACCACGGCAGAGGAAAGCAACTGAGAATTAATGCTAGGAAGTGGCAGCTCTTAGCAATGGAGGCTGCAAATGGTTCTTCTTATATGAATATTAAGGCTTTTGTAAATAAAGTTTGTTATCAAGCAAAGTGA

Protein sequence

MEHGNFLLVSQSPKSHLNPTLHLASTLLYLGSKVTLLLTNYALKNISKHQLPYGLSLSTFSDGFDDGFTSSDFPRWRVEFERLGRLALVDLLSSQQQSLLSFTCIVHTLLIPWVTRVALELHVPTAILWIQSVAAFDVYYYYFNGYSDVIRNGYKDDGSNSLLFNIWLPGLPLMNVLDLPSFVVSDDYHGLILKSFEEKMQVLEEEENVSILVNSFDALEHDAFTAIGKFNLIPIGPLVSLPPRFEVSTKQRTTSYLQGGQQAQEDYIKWLNSKFDSSVVYIAFGSISKLSNKQTKEIVGALLECSYPFLWVLSMDDIQDENLSLYFDDELQAQGKIVPWCSQVEVLSHRSVGCFVTHCGWNSTIESVTAGVPTVAWPLWADQATNAKLMQDVWELGVRVKKSSDGEGLVEGKEIARCLRMVMDMEDHGRGKQLRINARKWQLLAMEAANGSSYMNIKAFVNKVCYQAK
BLAST of Cla97C05G088810 vs. NCBI nr
Match: XP_008439390.2 (PREDICTED: crocetin glucosyltransferase, chloroplastic [Cucumis melo])

HSP 1 Score: 711.1 bits (1834), Expect = 2.5e-201
Identity = 366/471 (77.71%), Postives = 396/471 (84.08%), Query Frame = 0

Query: 1   MEHGNFLLVSQSPKSHLNPTLHLASTLLYLGSKVTLLLTNYALKNISKHQLPYGLSLSTF 60
           MEHGNFLLVSQSP SHLNPTLHLASTLL LGSKVTLL+TN+ALKNISK QLP GLSLSTF
Sbjct: 1   MEHGNFLLVSQSPTSHLNPTLHLASTLLSLGSKVTLLITNHALKNISKDQLPSGLSLSTF 60

Query: 61  SDGFDDGFTSSDFPRWRVEFERLGRLALVDLL-SSQQQSLLSFTCIVHTLLIPWVTRVAL 120
           S  FD+GFT SDF  W VEFERLGRLALVDLL SS QQ LL  TCIV+TLLIPWV +VA 
Sbjct: 61  SYSFDNGFTYSDFQLWCVEFERLGRLALVDLLSSSSQQGLLPITCIVNTLLIPWVAQVAR 120

Query: 121 ELHVPTAILWIQSVAAFDVYYYYFNGYSDVIRNGYKDDGSNSLLFNIWLPGLPLMNVLDL 180
           E HV TA+LWIQSVA FDVYYYYFNGY+DVIRNGYK+D SN L  NIWLPGLPLMN    
Sbjct: 121 EFHVSTAVLWIQSVAVFDVYYYYFNGYNDVIRNGYKEDDSNLLSSNIWLPGLPLMN---- 180

Query: 181 PSFVVSDDYHGLILKSFEEKMQVLEEEENVSILVNSFDALEHDAFTAIGKFNLIPIGPLV 240
                          SFEEKMQ+ +EE+NV ILVNSFDALEHDA +AIG FNLIPIGPLV
Sbjct: 181 ---------------SFEEKMQIFKEEDNVPILVNSFDALEHDALSAIGTFNLIPIGPLV 240

Query: 241 SLPPRFEVSTKQRTTSYLQGGQQAQEDYIKWLNSKFDSSVVYIAFGSISKLSNKQTKEIV 300
           SLP   EVSTKQ++ S  Q GQQA+ED IKWLNSK DSSVVYIAFGSISKLS +QTKEIV
Sbjct: 241 SLPLGCEVSTKQQSISCFQDGQQAREDCIKWLNSKPDSSVVYIAFGSISKLSKEQTKEIV 300

Query: 301 GALLECSYPFLWVLSMDDIQDENLSLYFDDELQAQGKIVPWCSQVEVLSHRSVGCFVTHC 360
           GA LECSYPFLW L MDDI+DENLS YF+ ELQAQGKIVPWCSQVE+LSHRSVGCFVTHC
Sbjct: 301 GAFLECSYPFLWSLRMDDIRDENLSSYFNVELQAQGKIVPWCSQVEILSHRSVGCFVTHC 360

Query: 361 GWNSTIESVTAGVPTVAWPLWADQATNAKLMQDVWELGVRVKKSSDGEGLVEGKEIARCL 420
           GWN TIE V  GVPTVAW LWADQATNAK+M+DVW++GVRVKKSSDGEG+VE KEI RCL
Sbjct: 361 GWNFTIECVAVGVPTVAWLLWADQATNAKMMEDVWKIGVRVKKSSDGEGMVERKEITRCL 420

Query: 421 RMVMDMED--HGRGKQLRINARKWQLLAMEAANGSSYMNIKAFVNKVCYQA 469
           RM+MDMED   G+GKQLRINA KWQ LAMEAANGSS++N+KAFVNKVC +A
Sbjct: 421 RMIMDMEDDSKGKGKQLRINATKWQRLAMEAANGSSFVNLKAFVNKVCDEA 452

BLAST of Cla97C05G088810 vs. NCBI nr
Match: XP_004147672.1 (PREDICTED: crocetin glucosyltransferase, chloroplastic-like [Cucumis sativus] >KGN57234.1 hypothetical protein Csa_3G172390 [Cucumis sativus])

HSP 1 Score: 513.1 bits (1320), Expect = 1.0e-141
Identity = 272/367 (74.11%), Postives = 292/367 (79.56%), Query Frame = 0

Query: 1   MEHGNFLLVSQSPKSHLNPTLHLASTLLYLGSKVTLLLTNYALKNISKHQLPYGLSLSTF 60
           M+HGNFLLVSQSP SHLNPTLH ASTLL LGSKVTLLLTN+ALKNIS+ QLP GLSLSTF
Sbjct: 1   MKHGNFLLVSQSPTSHLNPTLHFASTLLSLGSKVTLLLTNHALKNISEDQLPSGLSLSTF 60

Query: 61  SDGFDDGFTSSDFPRWRVEFERLGRLALVDLL-SSQQQSLLSFTCIVHTLLIPWVTRVAL 120
           SDGFD+GFT SD   W VEFERLGR ALV+LL SS +Q LL  TCIV+TLLIPWV +VA 
Sbjct: 61  SDGFDNGFTYSDLQLWFVEFERLGRAALVNLLSSSSKQGLLPITCIVNTLLIPWVAQVAR 120

Query: 121 ELHVPTAILWIQSVAAFDVYYYYFNGYSDVIRNGYKDDGSNSLLFNIWLPGLPLMNVLDL 180
           E HV TAILW QSVA FDVYYYYFNGYS VIRNGYK+D SNSL FNI LPGLPLMNVLDL
Sbjct: 121 EFHVSTAILWTQSVAVFDVYYYYFNGYSGVIRNGYKEDDSNSLSFNISLPGLPLMNVLDL 180

Query: 181 PSFVVSDDYHGLILKSFEEKMQVLEEEENVSILVNSFDALEHDAFTAIGKFNLIPIGPLV 240
           PSF+VSDD+HGLI+KSFEEK+Q+L+EE+NV ILVNSFDALEHDA +AIG FNLIPIGP V
Sbjct: 181 PSFMVSDDHHGLIIKSFEEKIQILKEEDNVPILVNSFDALEHDALSAIGTFNLIPIGPSV 240

Query: 241 SLPPRFEVSTKQRTTSYLQGGQQAQEDYIKWLNSKFDSSVVYIAFGSISKLSNKQTKEIV 300
            LP   E   KQR  SY Q GQQAQEDYIKWLNSK DSSVVYIAFGS SKLS +QTKE+V
Sbjct: 241 LLPLGCE---KQRNISYFQDGQQAQEDYIKWLNSKPDSSVVYIAFGSFSKLSKEQTKEMV 300

Query: 301 GALLECSYPFLWVLSMDDIQDENLSLYFDDELQAQGKIVPWCSQVEVLSHRSVGCFVTHC 360
           GALLECSY                               PWCSQVEVLSHR+VGCFVTHC
Sbjct: 301 GALLECSY-------------------------------PWCSQVEVLSHRAVGCFVTHC 333

Query: 361 GWNSTIE 367
           GWNSTIE
Sbjct: 361 GWNSTIE 333

BLAST of Cla97C05G088810 vs. NCBI nr
Match: XP_023885287.1 (crocetin glucosyltransferase, chloroplastic-like [Quercus suber] >POE69741.1 crocetin glucosyltransferase, chloroplastic [Quercus suber])

HSP 1 Score: 408.3 bits (1048), Expect = 3.5e-110
Identity = 219/463 (47.30%), Postives = 302/463 (65.23%), Query Frame = 0

Query: 5   NFLLVSQSPKSHLNPTLHLASTLLYLGSKVTLLLTNYALKNISKHQLPYGLSLSTFSDGF 64
           + LLV+   + H+NP+L  A  L++LG++VT  +T  A + + K   P GLS  TFSDG+
Sbjct: 3   HILLVTFPAQGHVNPSLQFAKRLIHLGAQVTFAITISAHRRMIKSPPPDGLSFVTFSDGY 62

Query: 65  DDGFTSSDFPRWRVEFERLGRLALVDLLSSQQQSLLSFTCIVHTLLIPWVTRVALELHVP 124
           DDGF+  D      +F+  G   L  L+ S        TC+V+TLL+PW   VA E+H+P
Sbjct: 63  DDGFSLDDAQNHFDQFKCNGSKTLTHLIVSSANQGRPITCLVYTLLLPWAADVAREVHLP 122

Query: 125 TAILWIQSVAAFDVYYYYFNGYSDVIRNGYKDDGSNSLLFNIWLPGLPLMNVLDLPSFVV 184
           + +LWIQ     D+YYYYFNG++DVIRN   D  S+S    I LPGLPL+   DLPSF++
Sbjct: 123 STLLWIQPAMVLDIYYYYFNGFADVIRNDNNDYPSSS----IKLPGLPLLTSRDLPSFLL 182

Query: 185 SDDYHGLILKSFEEKMQVLEEEENVSILVNSFDALEHDAFTAIGKFNLIPIGPLVSLPPR 244
           + + H   L   +   + LE+E N  +LVN+FDALE +A  AI +FNL+ +GPL+     
Sbjct: 183 ASNTHTFALPIIQAHFEALEKENNPRVLVNTFDALEPEALKAIERFNLVAVGPLLPSDKS 242

Query: 245 FEVSTKQRTTSYLQGGQQAQEDYIKWLNSKFDSSVVYIAFGSISKLSNKQTKEIVGALLE 304
           F              G    +DYI+WLNSK +SSV+Y++FGS++ L  +Q +EI   LL 
Sbjct: 243 F--------------GGDVSKDYIEWLNSKPESSVIYVSFGSLAVLMKQQMEEIARGLLG 302

Query: 305 CSYPFLWVL-SMDDIQDENLSLYFDDELQAQGKIVPWCSQVEVLSHRSVGCFVTHCGWNS 364
           C  PFLWV+ + ++ ++E LS    + L+  G IVPWCSQVEVLSH S+GCFVTHCGWNS
Sbjct: 303 CGRPFLWVIRAKENGEEEKLSC--REVLEQMGMIVPWCSQVEVLSHPSLGCFVTHCGWNS 362

Query: 365 TIESVTAGVPTVAWPLWADQATNAKLMQDVWELGVRVKKSSDGEGLVEGKEIARCLRMVM 424
           T+ES+ +GVP VA+P W+DQ TNAKL++DVW+ GVR+  + D  G+VEG EI RCL +V+
Sbjct: 363 TLESLVSGVPMVAFPQWSDQVTNAKLIEDVWKTGVRMIVNKD--GIVEGDEIKRCLELVV 422

Query: 425 DMEDHGRGKQLRINARKWQLLAMEAAN--GSSYMNIKAFVNKV 465
              D  RG+ +R NA+KW+ LAMEAAN  GSSY N+K FV+++
Sbjct: 423 G--DGERGEAIRRNAKKWKELAMEAANEGGSSYNNLKDFVDEI 441

BLAST of Cla97C05G088810 vs. NCBI nr
Match: XP_018844047.1 (PREDICTED: crocetin glucosyltransferase, chloroplastic-like [Juglans regia])

HSP 1 Score: 400.2 bits (1027), Expect = 9.6e-108
Identity = 217/465 (46.67%), Postives = 305/465 (65.59%), Query Frame = 0

Query: 7   LLVSQSPKSHLNPTLHLASTLLYLGSKVTLLLTNYALKNISKHQLPYGLSLSTFSDGFDD 66
           LLV+   + H+NP L  A  L+ LG+ VTL  +  A + ++K  +P GLS +TFSDG+DD
Sbjct: 8   LLVTFPAQGHINPGLQFAKRLIRLGAHVTLATSVSAYRRMTKTPIPQGLSFATFSDGYDD 67

Query: 67  GFT--SSDFPRWRVEFERLGRLALVDLLSSQQQSLLSFTCIVHTLLIPWVTRVALELHVP 126
           GF   + D   +    +R G   L DL+ S       F  +V+TLL+PW   VA ELH+P
Sbjct: 68  GFKPGTDDAEHYMSAIKRSGSKTLTDLIVSSTNEGRPFQYLVYTLLLPWAGNVAHELHLP 127

Query: 127 TAILWIQSVAAFDVYYYYFNGYSDVIRNGYKDDGSNSLLFNIWLPGLPLMNVLDLPSFVV 186
           +A+LWIQ     D+YYYYFNGY D IR    D       +++ LPGLPL+   DLPSF++
Sbjct: 128 SALLWIQPATVLDIYYYYFNGYGDDIRKKGTDPS-----YSLQLPGLPLLYGRDLPSFLL 187

Query: 187 SDDYHGLILKSFEEKMQVLEEEENVSILVNSFDALEHDAFTAIGKFNLIPIGPLVSLPPR 246
             + +   L SF+E+++ LE+E N ++LVN+FDALE +A   I KFNL  +GPL+   P 
Sbjct: 188 DSNTYTFALPSFQEQIEALEKESNPTVLVNTFDALEPEALRVIEKFNLTAVGPLI---PS 247

Query: 247 FEVSTKQRTTSYLQGG--QQAQEDYIKWLNSKFDSSVVYIAFGSISKLSNKQTKEIVGAL 306
             +  K  +     G   Q ++E YI+WLNSK +SSV+Y++FGSIS L+ +Q +E+   L
Sbjct: 248 AFLDGKDPSDKAFGGDLFQGSKEYYIEWLNSKPNSSVIYVSFGSISTLAKQQMEEMARGL 307

Query: 307 LECSYPFLWVL-SMDDIQDENLSLYFDDELQAQGKIVPWCSQVEVLSHRSVGCFVTHCGW 366
           L+C  PFLWV+ + ++ ++E LS    +EL+ +G IVPWCSQVEVLSH S+ CFVTHCGW
Sbjct: 308 LDCGRPFLWVIRAKENGEEERLSC--REELEQKGMIVPWCSQVEVLSHPSLACFVTHCGW 367

Query: 367 NSTIESVTAGVPTVAWPLWADQATNAKLMQDVWELGVRVKKSSDGEGLVEGKEIARCLRM 426
           NS++ES+ +GVP VA+P W DQ TNAKL++DVW+ G+RV  + D  G+VE  EI RCL +
Sbjct: 368 NSSLESLVSGVPVVAFPQWTDQGTNAKLIEDVWKTGLRVTANKD--GIVESDEIKRCLEL 427

Query: 427 VMDMEDHGRGKQLRINARKWQLLAMEAA--NGSSYMNIKAFVNKV 465
           V    +  RG+++R NA+KW+ LA EAA   GSS+ N+KAFV ++
Sbjct: 428 VAGGGE--RGEEMRRNAKKWKELAREAAKEGGSSHKNLKAFVEEI 458

BLAST of Cla97C05G088810 vs. NCBI nr
Match: XP_002308970.2 (crocetin glucosyltransferase, chloroplastic [Populus trichocarpa] >PNT29916.1 hypothetical protein POPTR_006G055600v3 [Populus trichocarpa])

HSP 1 Score: 399.1 bits (1024), Expect = 2.1e-107
Identity = 218/465 (46.88%), Postives = 303/465 (65.16%), Query Frame = 0

Query: 5   NFLLVSQSPKSHLNPTLHLASTLLYLGSKVTLLLTNYALKNISK-HQLPYGLSLSTFSDG 64
           + LLV+   + H+NP L  A  L+ +G+ VT   +  A + +SK    P GLS + F DG
Sbjct: 9   HILLVTFPAQGHINPALQFAKRLVAIGAHVTFSTSMGAARRMSKTGTYPKGLSFAAFDDG 68

Query: 65  FDDGF-TSSDFPRWRVEFERLGRLALVDLLSSQQQSLLSFTCIVHTLLIPWVTRVALELH 124
            + GF  S D   +  E   +G  +L +L+++  ++   FTC+V++ L+PWV +VA EL+
Sbjct: 69  SEHGFRPSDDIDHYFTELRLVGSKSLAELIAASSKNGRPFTCVVYSNLVPWVAKVARELN 128

Query: 125 VPTAILWIQSVAAFDVYYYYFNGYSDVIRNGYKDDGSNSLLFNIWLPGLPLMNVLDLPSF 184
           +P+ +LW QS A  D++YYYFNGY D I      +  N   F++ LPGLP +   DLPSF
Sbjct: 129 LPSTLLWNQSPALLDIFYYYFNGYGDTI-----SENINDPTFSLKLPGLPPLGSRDLPSF 188

Query: 185 VVSDDYHGLILKSFEEKMQVLEEEENVSILVNSFDALEHDAFTAIGKFNLIPIGPLVSLP 244
               + H   +    E ++VL+EE N  +LVN+FDALE +A  +IGKF L+ +GPL+  P
Sbjct: 189 FNPRNTHAFAIPVNREHIEVLDEETNPKVLVNTFDALECEALNSIGKFKLVGVGPLI--P 248

Query: 245 PRFEVSTKQRTTSYLQGGQQAQEDYIKWLNSKFDSSVVYIAFGSISKLSNKQTKEIVGAL 304
             F        TS+     Q  +D+I+WLNSK +SSV+YIAFGSIS LS  Q +E+  AL
Sbjct: 249 SAFLDGEDPTDTSFGGDLFQGSKDHIEWLNSKPESSVIYIAFGSISALSKPQKEEMARAL 308

Query: 305 LECSYPFLWVLSMDDIQD-ENLSLYFDDELQAQGKIVPWCSQVEVLSHRSVGCFVTHCGW 364
           LE   PFLWV+  D  ++ E   L   +EL+ QGKIVPWCSQVEVLSH S+GCFVTHCGW
Sbjct: 309 LETGRPFLWVIRADRGEEKEEDKLSCKEELEKQGKIVPWCSQVEVLSHPSIGCFVTHCGW 368

Query: 365 NSTIESVTAGVPTVAWPLWADQATNAKLMQDVWELGVRVKKSSDGEGLVEGKEIARCLRM 424
           NST ES+ +GVP VA+P W DQ TNAK+++DVW+ GVRV  SS+ EG+VEG+EI RCL +
Sbjct: 369 NSTFESLASGVPMVAFPQWTDQLTNAKMVEDVWKTGVRV-TSSNKEGVVEGEEIERCLEV 428

Query: 425 VMDMEDHGRGKQLRINARKWQLLAMEAA--NGSSYMNIKAFVNKV 465
           VM   +  RG ++R NA+KW+ LA +++   GSSY N+KAFV+++
Sbjct: 429 VMGGGE--RGNEMRKNAKKWKELARQSSKEGGSSYNNLKAFVDEI 463

BLAST of Cla97C05G088810 vs. TrEMBL
Match: tr|A0A1S3AYM5|A0A1S3AYM5_CUCME (crocetin glucosyltransferase, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103484205 PE=4 SV=1)

HSP 1 Score: 711.1 bits (1834), Expect = 1.7e-201
Identity = 366/471 (77.71%), Postives = 396/471 (84.08%), Query Frame = 0

Query: 1   MEHGNFLLVSQSPKSHLNPTLHLASTLLYLGSKVTLLLTNYALKNISKHQLPYGLSLSTF 60
           MEHGNFLLVSQSP SHLNPTLHLASTLL LGSKVTLL+TN+ALKNISK QLP GLSLSTF
Sbjct: 1   MEHGNFLLVSQSPTSHLNPTLHLASTLLSLGSKVTLLITNHALKNISKDQLPSGLSLSTF 60

Query: 61  SDGFDDGFTSSDFPRWRVEFERLGRLALVDLL-SSQQQSLLSFTCIVHTLLIPWVTRVAL 120
           S  FD+GFT SDF  W VEFERLGRLALVDLL SS QQ LL  TCIV+TLLIPWV +VA 
Sbjct: 61  SYSFDNGFTYSDFQLWCVEFERLGRLALVDLLSSSSQQGLLPITCIVNTLLIPWVAQVAR 120

Query: 121 ELHVPTAILWIQSVAAFDVYYYYFNGYSDVIRNGYKDDGSNSLLFNIWLPGLPLMNVLDL 180
           E HV TA+LWIQSVA FDVYYYYFNGY+DVIRNGYK+D SN L  NIWLPGLPLMN    
Sbjct: 121 EFHVSTAVLWIQSVAVFDVYYYYFNGYNDVIRNGYKEDDSNLLSSNIWLPGLPLMN---- 180

Query: 181 PSFVVSDDYHGLILKSFEEKMQVLEEEENVSILVNSFDALEHDAFTAIGKFNLIPIGPLV 240
                          SFEEKMQ+ +EE+NV ILVNSFDALEHDA +AIG FNLIPIGPLV
Sbjct: 181 ---------------SFEEKMQIFKEEDNVPILVNSFDALEHDALSAIGTFNLIPIGPLV 240

Query: 241 SLPPRFEVSTKQRTTSYLQGGQQAQEDYIKWLNSKFDSSVVYIAFGSISKLSNKQTKEIV 300
           SLP   EVSTKQ++ S  Q GQQA+ED IKWLNSK DSSVVYIAFGSISKLS +QTKEIV
Sbjct: 241 SLPLGCEVSTKQQSISCFQDGQQAREDCIKWLNSKPDSSVVYIAFGSISKLSKEQTKEIV 300

Query: 301 GALLECSYPFLWVLSMDDIQDENLSLYFDDELQAQGKIVPWCSQVEVLSHRSVGCFVTHC 360
           GA LECSYPFLW L MDDI+DENLS YF+ ELQAQGKIVPWCSQVE+LSHRSVGCFVTHC
Sbjct: 301 GAFLECSYPFLWSLRMDDIRDENLSSYFNVELQAQGKIVPWCSQVEILSHRSVGCFVTHC 360

Query: 361 GWNSTIESVTAGVPTVAWPLWADQATNAKLMQDVWELGVRVKKSSDGEGLVEGKEIARCL 420
           GWN TIE V  GVPTVAW LWADQATNAK+M+DVW++GVRVKKSSDGEG+VE KEI RCL
Sbjct: 361 GWNFTIECVAVGVPTVAWLLWADQATNAKMMEDVWKIGVRVKKSSDGEGMVERKEITRCL 420

Query: 421 RMVMDMED--HGRGKQLRINARKWQLLAMEAANGSSYMNIKAFVNKVCYQA 469
           RM+MDMED   G+GKQLRINA KWQ LAMEAANGSS++N+KAFVNKVC +A
Sbjct: 421 RMIMDMEDDSKGKGKQLRINATKWQRLAMEAANGSSFVNLKAFVNKVCDEA 452

BLAST of Cla97C05G088810 vs. TrEMBL
Match: tr|A0A0A0L890|A0A0A0L890_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G172390 PE=4 SV=1)

HSP 1 Score: 513.1 bits (1320), Expect = 6.7e-142
Identity = 272/367 (74.11%), Postives = 292/367 (79.56%), Query Frame = 0

Query: 1   MEHGNFLLVSQSPKSHLNPTLHLASTLLYLGSKVTLLLTNYALKNISKHQLPYGLSLSTF 60
           M+HGNFLLVSQSP SHLNPTLH ASTLL LGSKVTLLLTN+ALKNIS+ QLP GLSLSTF
Sbjct: 1   MKHGNFLLVSQSPTSHLNPTLHFASTLLSLGSKVTLLLTNHALKNISEDQLPSGLSLSTF 60

Query: 61  SDGFDDGFTSSDFPRWRVEFERLGRLALVDLL-SSQQQSLLSFTCIVHTLLIPWVTRVAL 120
           SDGFD+GFT SD   W VEFERLGR ALV+LL SS +Q LL  TCIV+TLLIPWV +VA 
Sbjct: 61  SDGFDNGFTYSDLQLWFVEFERLGRAALVNLLSSSSKQGLLPITCIVNTLLIPWVAQVAR 120

Query: 121 ELHVPTAILWIQSVAAFDVYYYYFNGYSDVIRNGYKDDGSNSLLFNIWLPGLPLMNVLDL 180
           E HV TAILW QSVA FDVYYYYFNGYS VIRNGYK+D SNSL FNI LPGLPLMNVLDL
Sbjct: 121 EFHVSTAILWTQSVAVFDVYYYYFNGYSGVIRNGYKEDDSNSLSFNISLPGLPLMNVLDL 180

Query: 181 PSFVVSDDYHGLILKSFEEKMQVLEEEENVSILVNSFDALEHDAFTAIGKFNLIPIGPLV 240
           PSF+VSDD+HGLI+KSFEEK+Q+L+EE+NV ILVNSFDALEHDA +AIG FNLIPIGP V
Sbjct: 181 PSFMVSDDHHGLIIKSFEEKIQILKEEDNVPILVNSFDALEHDALSAIGTFNLIPIGPSV 240

Query: 241 SLPPRFEVSTKQRTTSYLQGGQQAQEDYIKWLNSKFDSSVVYIAFGSISKLSNKQTKEIV 300
            LP   E   KQR  SY Q GQQAQEDYIKWLNSK DSSVVYIAFGS SKLS +QTKE+V
Sbjct: 241 LLPLGCE---KQRNISYFQDGQQAQEDYIKWLNSKPDSSVVYIAFGSFSKLSKEQTKEMV 300

Query: 301 GALLECSYPFLWVLSMDDIQDENLSLYFDDELQAQGKIVPWCSQVEVLSHRSVGCFVTHC 360
           GALLECSY                               PWCSQVEVLSHR+VGCFVTHC
Sbjct: 301 GALLECSY-------------------------------PWCSQVEVLSHRAVGCFVTHC 333

Query: 361 GWNSTIE 367
           GWNSTIE
Sbjct: 361 GWNSTIE 333

BLAST of Cla97C05G088810 vs. TrEMBL
Match: tr|A0A2P4IMI9|A0A2P4IMI9_QUESU (Glycosyltransferase OS=Quercus suber OX=58331 GN=CFP56_58469 PE=3 SV=1)

HSP 1 Score: 408.3 bits (1048), Expect = 2.3e-110
Identity = 219/463 (47.30%), Postives = 302/463 (65.23%), Query Frame = 0

Query: 5   NFLLVSQSPKSHLNPTLHLASTLLYLGSKVTLLLTNYALKNISKHQLPYGLSLSTFSDGF 64
           + LLV+   + H+NP+L  A  L++LG++VT  +T  A + + K   P GLS  TFSDG+
Sbjct: 3   HILLVTFPAQGHVNPSLQFAKRLIHLGAQVTFAITISAHRRMIKSPPPDGLSFVTFSDGY 62

Query: 65  DDGFTSSDFPRWRVEFERLGRLALVDLLSSQQQSLLSFTCIVHTLLIPWVTRVALELHVP 124
           DDGF+  D      +F+  G   L  L+ S        TC+V+TLL+PW   VA E+H+P
Sbjct: 63  DDGFSLDDAQNHFDQFKCNGSKTLTHLIVSSANQGRPITCLVYTLLLPWAADVAREVHLP 122

Query: 125 TAILWIQSVAAFDVYYYYFNGYSDVIRNGYKDDGSNSLLFNIWLPGLPLMNVLDLPSFVV 184
           + +LWIQ     D+YYYYFNG++DVIRN   D  S+S    I LPGLPL+   DLPSF++
Sbjct: 123 STLLWIQPAMVLDIYYYYFNGFADVIRNDNNDYPSSS----IKLPGLPLLTSRDLPSFLL 182

Query: 185 SDDYHGLILKSFEEKMQVLEEEENVSILVNSFDALEHDAFTAIGKFNLIPIGPLVSLPPR 244
           + + H   L   +   + LE+E N  +LVN+FDALE +A  AI +FNL+ +GPL+     
Sbjct: 183 ASNTHTFALPIIQAHFEALEKENNPRVLVNTFDALEPEALKAIERFNLVAVGPLLPSDKS 242

Query: 245 FEVSTKQRTTSYLQGGQQAQEDYIKWLNSKFDSSVVYIAFGSISKLSNKQTKEIVGALLE 304
           F              G    +DYI+WLNSK +SSV+Y++FGS++ L  +Q +EI   LL 
Sbjct: 243 F--------------GGDVSKDYIEWLNSKPESSVIYVSFGSLAVLMKQQMEEIARGLLG 302

Query: 305 CSYPFLWVL-SMDDIQDENLSLYFDDELQAQGKIVPWCSQVEVLSHRSVGCFVTHCGWNS 364
           C  PFLWV+ + ++ ++E LS    + L+  G IVPWCSQVEVLSH S+GCFVTHCGWNS
Sbjct: 303 CGRPFLWVIRAKENGEEEKLSC--REVLEQMGMIVPWCSQVEVLSHPSLGCFVTHCGWNS 362

Query: 365 TIESVTAGVPTVAWPLWADQATNAKLMQDVWELGVRVKKSSDGEGLVEGKEIARCLRMVM 424
           T+ES+ +GVP VA+P W+DQ TNAKL++DVW+ GVR+  + D  G+VEG EI RCL +V+
Sbjct: 363 TLESLVSGVPMVAFPQWSDQVTNAKLIEDVWKTGVRMIVNKD--GIVEGDEIKRCLELVV 422

Query: 425 DMEDHGRGKQLRINARKWQLLAMEAAN--GSSYMNIKAFVNKV 465
              D  RG+ +R NA+KW+ LAMEAAN  GSSY N+K FV+++
Sbjct: 423 G--DGERGEAIRRNAKKWKELAMEAANEGGSSYNNLKDFVDEI 441

BLAST of Cla97C05G088810 vs. TrEMBL
Match: tr|A0A2I4GJH4|A0A2I4GJH4_9ROSI (Glycosyltransferase OS=Juglans regia OX=51240 GN=LOC109008422 PE=3 SV=1)

HSP 1 Score: 400.2 bits (1027), Expect = 6.4e-108
Identity = 217/465 (46.67%), Postives = 305/465 (65.59%), Query Frame = 0

Query: 7   LLVSQSPKSHLNPTLHLASTLLYLGSKVTLLLTNYALKNISKHQLPYGLSLSTFSDGFDD 66
           LLV+   + H+NP L  A  L+ LG+ VTL  +  A + ++K  +P GLS +TFSDG+DD
Sbjct: 8   LLVTFPAQGHINPGLQFAKRLIRLGAHVTLATSVSAYRRMTKTPIPQGLSFATFSDGYDD 67

Query: 67  GFT--SSDFPRWRVEFERLGRLALVDLLSSQQQSLLSFTCIVHTLLIPWVTRVALELHVP 126
           GF   + D   +    +R G   L DL+ S       F  +V+TLL+PW   VA ELH+P
Sbjct: 68  GFKPGTDDAEHYMSAIKRSGSKTLTDLIVSSTNEGRPFQYLVYTLLLPWAGNVAHELHLP 127

Query: 127 TAILWIQSVAAFDVYYYYFNGYSDVIRNGYKDDGSNSLLFNIWLPGLPLMNVLDLPSFVV 186
           +A+LWIQ     D+YYYYFNGY D IR    D       +++ LPGLPL+   DLPSF++
Sbjct: 128 SALLWIQPATVLDIYYYYFNGYGDDIRKKGTDPS-----YSLQLPGLPLLYGRDLPSFLL 187

Query: 187 SDDYHGLILKSFEEKMQVLEEEENVSILVNSFDALEHDAFTAIGKFNLIPIGPLVSLPPR 246
             + +   L SF+E+++ LE+E N ++LVN+FDALE +A   I KFNL  +GPL+   P 
Sbjct: 188 DSNTYTFALPSFQEQIEALEKESNPTVLVNTFDALEPEALRVIEKFNLTAVGPLI---PS 247

Query: 247 FEVSTKQRTTSYLQGG--QQAQEDYIKWLNSKFDSSVVYIAFGSISKLSNKQTKEIVGAL 306
             +  K  +     G   Q ++E YI+WLNSK +SSV+Y++FGSIS L+ +Q +E+   L
Sbjct: 248 AFLDGKDPSDKAFGGDLFQGSKEYYIEWLNSKPNSSVIYVSFGSISTLAKQQMEEMARGL 307

Query: 307 LECSYPFLWVL-SMDDIQDENLSLYFDDELQAQGKIVPWCSQVEVLSHRSVGCFVTHCGW 366
           L+C  PFLWV+ + ++ ++E LS    +EL+ +G IVPWCSQVEVLSH S+ CFVTHCGW
Sbjct: 308 LDCGRPFLWVIRAKENGEEERLSC--REELEQKGMIVPWCSQVEVLSHPSLACFVTHCGW 367

Query: 367 NSTIESVTAGVPTVAWPLWADQATNAKLMQDVWELGVRVKKSSDGEGLVEGKEIARCLRM 426
           NS++ES+ +GVP VA+P W DQ TNAKL++DVW+ G+RV  + D  G+VE  EI RCL +
Sbjct: 368 NSSLESLVSGVPVVAFPQWTDQGTNAKLIEDVWKTGLRVTANKD--GIVESDEIKRCLEL 427

Query: 427 VMDMEDHGRGKQLRINARKWQLLAMEAA--NGSSYMNIKAFVNKV 465
           V    +  RG+++R NA+KW+ LA EAA   GSS+ N+KAFV ++
Sbjct: 428 VAGGGE--RGEEMRRNAKKWKELAREAAKEGGSSHKNLKAFVEEI 458

BLAST of Cla97C05G088810 vs. TrEMBL
Match: tr|A0A2K1ZXB2|A0A2K1ZXB2_POPTR (Glycosyltransferase OS=Populus trichocarpa OX=3694 GN=POPTR_006G055600v3 PE=3 SV=1)

HSP 1 Score: 399.1 bits (1024), Expect = 1.4e-107
Identity = 218/465 (46.88%), Postives = 303/465 (65.16%), Query Frame = 0

Query: 5   NFLLVSQSPKSHLNPTLHLASTLLYLGSKVTLLLTNYALKNISK-HQLPYGLSLSTFSDG 64
           + LLV+   + H+NP L  A  L+ +G+ VT   +  A + +SK    P GLS + F DG
Sbjct: 9   HILLVTFPAQGHINPALQFAKRLVAIGAHVTFSTSMGAARRMSKTGTYPKGLSFAAFDDG 68

Query: 65  FDDGF-TSSDFPRWRVEFERLGRLALVDLLSSQQQSLLSFTCIVHTLLIPWVTRVALELH 124
            + GF  S D   +  E   +G  +L +L+++  ++   FTC+V++ L+PWV +VA EL+
Sbjct: 69  SEHGFRPSDDIDHYFTELRLVGSKSLAELIAASSKNGRPFTCVVYSNLVPWVAKVARELN 128

Query: 125 VPTAILWIQSVAAFDVYYYYFNGYSDVIRNGYKDDGSNSLLFNIWLPGLPLMNVLDLPSF 184
           +P+ +LW QS A  D++YYYFNGY D I      +  N   F++ LPGLP +   DLPSF
Sbjct: 129 LPSTLLWNQSPALLDIFYYYFNGYGDTI-----SENINDPTFSLKLPGLPPLGSRDLPSF 188

Query: 185 VVSDDYHGLILKSFEEKMQVLEEEENVSILVNSFDALEHDAFTAIGKFNLIPIGPLVSLP 244
               + H   +    E ++VL+EE N  +LVN+FDALE +A  +IGKF L+ +GPL+  P
Sbjct: 189 FNPRNTHAFAIPVNREHIEVLDEETNPKVLVNTFDALECEALNSIGKFKLVGVGPLI--P 248

Query: 245 PRFEVSTKQRTTSYLQGGQQAQEDYIKWLNSKFDSSVVYIAFGSISKLSNKQTKEIVGAL 304
             F        TS+     Q  +D+I+WLNSK +SSV+YIAFGSIS LS  Q +E+  AL
Sbjct: 249 SAFLDGEDPTDTSFGGDLFQGSKDHIEWLNSKPESSVIYIAFGSISALSKPQKEEMARAL 308

Query: 305 LECSYPFLWVLSMDDIQD-ENLSLYFDDELQAQGKIVPWCSQVEVLSHRSVGCFVTHCGW 364
           LE   PFLWV+  D  ++ E   L   +EL+ QGKIVPWCSQVEVLSH S+GCFVTHCGW
Sbjct: 309 LETGRPFLWVIRADRGEEKEEDKLSCKEELEKQGKIVPWCSQVEVLSHPSIGCFVTHCGW 368

Query: 365 NSTIESVTAGVPTVAWPLWADQATNAKLMQDVWELGVRVKKSSDGEGLVEGKEIARCLRM 424
           NST ES+ +GVP VA+P W DQ TNAK+++DVW+ GVRV  SS+ EG+VEG+EI RCL +
Sbjct: 369 NSTFESLASGVPMVAFPQWTDQLTNAKMVEDVWKTGVRV-TSSNKEGVVEGEEIERCLEV 428

Query: 425 VMDMEDHGRGKQLRINARKWQLLAMEAA--NGSSYMNIKAFVNKV 465
           VM   +  RG ++R NA+KW+ LA +++   GSSY N+KAFV+++
Sbjct: 429 VMGGGE--RGNEMRKNAKKWKELARQSSKEGGSSYNNLKAFVDEI 463

BLAST of Cla97C05G088810 vs. Swiss-Prot
Match: sp|F8WKW0|UGT1_GARJA (Crocetin glucosyltransferase, chloroplastic OS=Gardenia jasminoides OX=114476 GN=UGT75L6 PE=1 SV=1)

HSP 1 Score: 387.1 bits (993), Expect = 2.8e-106
Identity = 213/472 (45.13%), Postives = 292/472 (61.86%), Query Frame = 0

Query: 1   MEHGNFLLVSQSPKSHLNPTLHLASTLLYLGSKVTLLLTNYALKNISKH--QLPYGLSLS 60
           ++  + LL++   + H+NP L  A  LL +G +VTL  + YAL  + K     P GL+ +
Sbjct: 2   VQQRHVLLITYPAQGHINPALQFAQRLLRMGIQVTLATSVYALSRMKKSSGSTPKGLTFA 61

Query: 61  TFSDGFDDGFTSS--DFPRWRVEFERLGRLALVDLLSSQQQSLLSFTCIVHTLLIPWVTR 120
           TFSDG+DDGF     D   +     + G   L +++++        TC+V+TLL+PW   
Sbjct: 62  TFSDGYDDGFRPKGVDHTEYMSSLAKQGSNTLRNVINTSADQGCPVTCLVYTLLLPWAAT 121

Query: 121 VALELHVPTAILWIQSVAAFDVYYYYFNGYSDVIRNGYKDDGSNSLLFNIWLPGLPLMNV 180
           VA E H+P+A+LWIQ VA  D+YYYYF GY D ++N      SN   ++I  PGLP M  
Sbjct: 122 VARECHIPSALLWIQPVAVMDIYYYYFRGYEDDVKN-----NSNDPTWSIQFPGLPSMKA 181

Query: 181 LDLPSFVV--SDDYHGLILKSFEEKMQVLEEEENVSILVNSFDALEHDAFTAIGKFNLIP 240
            DLPSF++  SD+ +   L +F+++++ L+EEE   +LVN+FDALE  A  AI  +NLI 
Sbjct: 182 KDLPSFILPSSDNIYSFALPTFKKQLETLDEEERPKVLVNTFDALEPQALKAIESYNLIA 241

Query: 241 IGPLVSLPPRFEVSTKQRTTSYLQGGQQAQEDYIKWLNSKFDSSVVYIAFGSISKLSNKQ 300
           IGPL   P  F        TS+     Q  +DY +WLNS+   SVVY++FGS+  L  +Q
Sbjct: 242 IGPLT--PSAFLDGKDPSETSFSGDLFQKSKDYKEWLNSRPAGSVVYVSFGSLLTLPKQQ 301

Query: 301 TKEIVGALLECSYPFLWVLSMDDIQDENLS---LYFDDELQAQGKIVPWCSQVEVLSHRS 360
            +EI   LL+   PFLWV+   +  +E      L   +EL+ QG IVPWCSQ+EVL+H S
Sbjct: 302 MEEIARGLLKSGRPFLWVIRAKENGEEEKEEDRLICMEELEEQGMIVPWCSQIEVLTHPS 361

Query: 361 VGCFVTHCGWNSTIESVTAGVPTVAWPLWADQATNAKLMQDVWELGVRVKKSSDGEGLVE 420
           +GCFVTHCGWNST+E++  GVP VA+P W DQ TNAKL++DVWE GVRV  + D  G VE
Sbjct: 362 LGCFVTHCGWNSTLETLVCGVPVVAFPHWTDQGTNAKLIEDVWETGVRVVPNED--GTVE 421

Query: 421 GKEIARCLRMVMDMEDHGRGKQLRINARKWQLLAMEA--ANGSSYMNIKAFV 462
             EI RC+  VMD  D  +G +L+ NA+KW+ LA EA   +GSS  N+KAFV
Sbjct: 422 SDEIKRCIETVMD--DGEKGVELKRNAKKWKELAREAMQEDGSSDKNLKAFV 462

BLAST of Cla97C05G088810 vs. Swiss-Prot
Match: sp|O23406|U75D1_ARATH (UDP-glycosyltransferase 75D1 OS=Arabidopsis thaliana OX=3702 GN=UGT75D1 PE=2 SV=2)

HSP 1 Score: 346.7 bits (888), Expect = 4.1e-94
Identity = 191/485 (39.38%), Postives = 295/485 (60.82%), Query Frame = 0

Query: 5   NFLLVSQSPKSHLNPTLHLASTL--LYLGSKVTLL--LTNYALKNISKHQLPYGLSLSTF 64
           +FL V+   + H+NP+L LA  L     G++VT    ++ Y  +  S   +P  L  +T+
Sbjct: 13  HFLFVTFPAQGHINPSLELAKRLAGTISGARVTFAASISAYNRRMFSTENVPETLIFATY 72

Query: 65  SDGFDDGFTSSDFP---------RWRVEFERLGRLALVDLLSSQQQSLLSFTCIVHTLLI 124
           SDG DDGF SS +           +  E  R G+  L +L+   ++    FTC+V+T+L+
Sbjct: 73  SDGHDDGFKSSAYSDKSRQDATGNFMSEMRRRGKETLTELIEDNRKQNRPFTCVVYTILL 132

Query: 125 PWVTRVALELHVPTAILWIQSVAAFDVYYYYFNGYSDVIRNGYKDDGSNSLLFNIWLPGL 184
            WV  +A E H+P+A+LW+Q V  F ++Y+YFNGY D I      + +N+   +I LP L
Sbjct: 133 TWVAELAREFHLPSALLWVQPVTVFSIFYHYFNGYEDAI-----SEMANTPSSSIKLPSL 192

Query: 185 PLMNVLDLPSFVVSDDYHGLILKSFEEKMQVLEEEENVSILVNSFDALEHDAFTAI-GKF 244
           PL+ V D+PSF+VS + +  +L +F E++  L+EE N  IL+N+F  LE +A +++   F
Sbjct: 193 PLLTVRDIPSFIVSSNVYAFLLPAFREQIDSLKEEINPKILINTFQELEPEAMSSVPDNF 252

Query: 245 NLIPIGPLVSLPPRFEVSTKQRTTSYLQGGQQAQEDYIKWLNSKFDSSVVYIAFGSISKL 304
            ++P+GPL++L   F                 ++ +YI+WL++K DSSV+Y++FG+++ L
Sbjct: 253 KIVPVGPLLTLRTDF----------------SSRGEYIEWLDTKADSSVLYVSFGTLAVL 312

Query: 305 SNKQTKEIVGALLECSYPFLWVL------SMDDIQ--DENLSLYFDDELQAQGKIVPWCS 364
           S KQ  E+  AL++   PFLWV+      + +D Q  +E+    F +EL   G +V WC 
Sbjct: 313 SKKQLVELCKALIQSRRPFLWVITDKSYRNKEDEQEKEEDCISSFREELDEIGMVVSWCD 372

Query: 365 QVEVLSHRSVGCFVTHCGWNSTIESVTAGVPTVAWPLWADQATNAKLMQDVWELGVRV-- 424
           Q  VL+HRS+GCFVTHCGWNST+ES+ +GVP VA+P W DQ  NAKL++D W+ GVRV  
Sbjct: 373 QFRVLNHRSIGCFVTHCGWNSTLESLVSGVPVVAFPQWNDQMMNAKLLEDCWKTGVRVME 432

Query: 425 KKSSDGEGLVEGKEIARCLRMVMDMEDHGRGKQLRINARKWQLLAMEAA--NGSSYMNIK 464
           KK  +G  +V+ +EI RC+  VM+     + ++ R NA +W+ LA EA    GSS+ ++K
Sbjct: 433 KKEEEGVVVVDSEEIRRCIEEVME----DKAEEFRGNATRWKDLAAEAVREGGSSFNHLK 472

BLAST of Cla97C05G088810 vs. Swiss-Prot
Match: sp|Q9ZR25|5GT_VERHY (Anthocyanidin 3-O-glucoside 5-O-glucosyltransferase OS=Verbena hybrida OX=76714 GN=HGT8 PE=2 SV=1)

HSP 1 Score: 322.8 bits (826), Expect = 6.4e-87
Identity = 195/477 (40.88%), Postives = 282/477 (59.12%), Query Frame = 0

Query: 1   MEHGNFLLVSQSPKSHLNPTLHLASTLLYLGSKVTLLLTNYALKNISKHQLPYG--LSLS 60
           M   + LL +   + H+NP L  A  L     +VT   + YA + +S+        ++  
Sbjct: 1   MSRAHVLLATFPAQGHINPALQFAKRLANADIQVTFFTSVYAWRRMSRTAAGSNGLINFV 60

Query: 61  TFSDGFDDGF-TSSDFPRWRVEFERLGRLALVDLLSSQ--QQSLLSFTCIVHTLLIPWVT 120
           +FSDG+DDG     D   +  E +  G  AL D L++    Q     T +V++ L  W  
Sbjct: 61  SFSDGYDDGLQPGDDGKNYMSEMKSRGIKALSDTLAANNVDQKSSKITFVVYSHLFAWAA 120

Query: 121 RVALELHVPTAILWIQSVAAFDVYYYYFNGYSDVIRNGYKDDGSNSLLFNIWLP-GLPLM 180
           +VA E H+ +A+LWI+     D++Y+YFNGYSD I     D GS++    I LP GLP++
Sbjct: 121 KVAREFHLRSALLWIEPATVLDIFYFYFNGYSDEI-----DAGSDA----IHLPGGLPVL 180

Query: 181 NVLDLPSFVVSDDYHGLILKSFEEKMQVLEEEENVSILVNSFDALEHDAFTAIGKFNLIP 240
              DLPSF++    H       +EK++ LE EE   +LVNSFDALE DA  AI K+ +I 
Sbjct: 181 AQRDLPSFLLPST-HERFRSLMKEKLETLEGEEKPKVLVNSFDALEPDALKAIDKYEMIA 240

Query: 241 IGPLVSLPPRFEVSTKQRTTSYLQGGQ-----QAQEDYIKWLNSKFDSSVVYIAFGSISK 300
           IGPL+  P  F         S+  GG         +D ++WL++   SSVVY++FGS   
Sbjct: 241 IGPLI--PSAFLDGKDPSDRSF--GGDLFEKGSNDDDCLEWLSTNPRSSVVYVSFGSFVN 300

Query: 301 LSNKQTKEIVGALLECSYPFLWVLSMDDIQDENLSLYFDDELQAQGKIVPWCSQVEVLSH 360
            +  Q +EI   LL+C  PFLWV+ +++ ++  +S    +EL+  GKIV WCSQ+EVL+H
Sbjct: 301 TTKSQMEEIARGLLDCGRPFLWVVRVNEGEEVLISCM--EELKRVGKIVSWCSQLEVLTH 360

Query: 361 RSVGCFVTHCGWNSTIESVTAGVPTVAWPLWADQATNAKLMQDVWELGVRVKKSSDGEGL 420
            S+GCFVTHCGWNST+ES++ GVP VA+P W DQ TNAKLM+DVW  GVRV+ + +G  +
Sbjct: 361 PSLGCFVTHCGWNSTLESISFGVPMVAFPQWFDQGTNAKLMEDVWRTGVRVRANEEG-SV 420

Query: 421 VEGKEIARCLRMVMDMEDHGRGKQLRINARKWQLLAMEA--ANGSSYMNIKAFVNKV 465
           V+G EI RC+  VMD  +  + ++LR +A KW+ LA +A   +GSS  N+K F+++V
Sbjct: 421 VDGDEIRRCIEEVMDGGE--KSRKLRESAGKWKDLARKAMEEDGSSVNNLKVFLDEV 458

BLAST of Cla97C05G088810 vs. Swiss-Prot
Match: sp|Q9ZVY5|U75B2_ARATH (UDP-glycosyltransferase 75B2 OS=Arabidopsis thaliana OX=3702 GN=UGT75B2 PE=2 SV=1)

HSP 1 Score: 320.1 bits (819), Expect = 4.1e-86
Identity = 193/482 (40.04%), Postives = 277/482 (57.47%), Query Frame = 0

Query: 1   MEHGNFLLVSQSPKSHLNPTLHLASTLL-YLGSKVTL--LLTNYALKNISKHQLPYGLSL 60
           M   +FLLV+   + H+NP+L  A  L+   G++VT    L+      I  H     LS 
Sbjct: 1   MAQPHFLLVTFPAQGHVNPSLRFARRLIKTTGARVTFATCLSVIHRSMIPNHNNVENLSF 60

Query: 61  STFSDGFDDGFTSS--DFPRWRVEFERLGRLALVDLLSSQQQSLLSFTCIVHTLLIPWVT 120
            TFSDGFDDG  S+  D     V FER G  AL D + + Q      +C+++T+L  WV 
Sbjct: 61  LTFSDGFDDGVISNTDDVQNRLVHFERNGDKALSDFIEANQNGDSPVSCLIYTILPNWVP 120

Query: 121 RVALELHVPTAILWIQSVAAFDVYYYYFNGYSDVIRNGYKDDGSNSLLFNIWLPGLPLMN 180
           +VA   H+P+  LWIQ   AFD+YY Y  G + V                   P LP + 
Sbjct: 121 KVARRFHLPSVHLWIQPAFAFDIYYNYSTGNNSVFE----------------FPNLPSLE 180

Query: 181 VLDLPSFVVSDDYHGLILKSFEEKMQVLEEEENVSILVNSFDALEHDAFTAIGKFNLIPI 240
           + DLPSF+   + +      ++E M  L+EE N  ILVN+FD+LE +  TAI    ++ +
Sbjct: 181 IRDLPSFLSPSNTNKAAQAVYQELMDFLKEESNPKILVNTFDSLEPEFLTAIPNIEMVAV 240

Query: 241 GPLVSLPPRFEVSTKQRTTSYLQGGQQAQEDYIKWLNSKFDSSVVYIAFGSISKLSNKQT 300
           GPL  LP   E+ T   +   L    Q+   Y  WL+SK +SSV+Y++FG++ +LS KQ 
Sbjct: 241 GPL--LPA--EIFTGSESGKDLSRDHQS-SSYTLWLDSKTESSVIYVSFGTMVELSKKQI 300

Query: 301 KEIVGALLECSYPFLWVLS-----------MDDIQDENLSLYFDDELQAQGKIVPWCSQV 360
           +E+  AL+E   PFLWV++            ++ + E ++  F  EL+  G IV WCSQ+
Sbjct: 301 EELARALIEGGRPFLWVITDKLNREAKIEGEEETEIEKIA-GFRHELEEVGMIVSWCSQI 360

Query: 361 EVLSHRSVGCFVTHCGWNSTIESVTAGVPTVAWPLWADQATNAKLMQDVWELGVRVKKSS 420
           EVL HR++GCF+THCGW+S++ES+  GVP VA+P+W+DQ  NAKL++++W+ GVRV+++S
Sbjct: 361 EVLRHRAIGCFLTHCGWSSSLESLVLGVPVVAFPMWSDQPANAKLLEEIWKTGVRVRENS 420

Query: 421 DGEGLVEGKEIARCLRMVMDMEDHGRGKQLRINARKWQLLAMEAA--NGSSYMNIKAFVN 465
             EGLVE  EI RCL  VM+     +  +LR NA KW+ LA EA    GSS  N++AFV 
Sbjct: 421 --EGLVERGEIMRCLEAVME----AKSVELRENAEKWKRLATEAGREGGSSDKNVEAFVK 454

BLAST of Cla97C05G088810 vs. Swiss-Prot
Match: sp|Q9LR44|U75B1_ARATH (UDP-glycosyltransferase 75B1 OS=Arabidopsis thaliana OX=3702 GN=UGT75B1 PE=1 SV=1)

HSP 1 Score: 316.6 bits (810), Expect = 4.6e-85
Identity = 189/480 (39.38%), Postives = 277/480 (57.71%), Query Frame = 0

Query: 5   NFLLVSQSPKSHLNPTLHLASTLL-YLGSKVTLLLTNYALKN--ISKHQLPYGLSLSTFS 64
           +FLLV+   + H+NP+L  A  L+   G++VT +       N  I+ H     LS  TFS
Sbjct: 5   HFLLVTFPAQGHVNPSLRFARRLIKRTGARVTFVTCVSVFHNSMIANHNKVENLSFLTFS 64

Query: 65  DGFDDG--FTSSDFPRWRVEFERLGRLALVDLLSSQQQSLLSFTCIVHTLLIPWVTRVAL 124
           DGFDDG   T  D  +  V  +  G  AL D + + +      TC+++T+L+ W  +VA 
Sbjct: 65  DGFDDGGISTYEDRQKRSVNLKVNGDKALSDFIEATKNGDSPVTCLIYTILLNWAPKVAR 124

Query: 125 ELHVPTAILWIQSVAAFDVYYYYFNGYSDVIRNGYKDDGSNSLLFNIWLPGLPLMNVLDL 184
              +P+A+LWIQ    F++YY +F G   V                  LP L  + + DL
Sbjct: 125 RFQLPSALLWIQPALVFNIYYTHFMGNKSVFE----------------LPNLSSLEIRDL 184

Query: 185 PSFVVSDDYHGLILKSFEEKMQVLEEEENVSILVNSFDALEHDAFTAIGKFNLIPIGPLV 244
           PSF+   + +     +F+E M+ L +E    IL+N+FD+LE +A TA    +++ +GPL 
Sbjct: 185 PSFLTPSNTNKGAYDAFQEMMEFLIKETKPKILINTFDSLEPEALTAFPNIDMVAVGPL- 244

Query: 245 SLPPR-FEVSTKQRTTSYLQGGQQAQEDYIKWLNSKFDSSVVYIAFGSISKLSNKQTKEI 304
            LP   F  ST +         +     Y  WL+SK +SSV+Y++FG++ +LS KQ +E+
Sbjct: 245 -LPTEIFSGSTNKSV-------KDQSSSYTLWLDSKTESSVIYVSFGTMVELSKKQIEEL 304

Query: 305 VGALLECSYPFLWVLS-----------MDDIQDENLSLYFDDELQAQGKIVPWCSQVEVL 364
             AL+E   PFLWV++            ++ + E ++  F  EL+  G IV WCSQ+EVL
Sbjct: 305 ARALIEGKRPFLWVITDKSNRETKTEGEEETEIEKIA-GFRHELEEVGMIVSWCSQIEVL 364

Query: 365 SHRSVGCFVTHCGWNSTIESVTAGVPTVAWPLWADQATNAKLMQDVWELGVRVKKSSDGE 424
           SHR+VGCFVTHCGW+ST+ES+  GVP VA+P+W+DQ TNAKL+++ W+ GVRV+++ D  
Sbjct: 365 SHRAVGCFVTHCGWSSTLESLVLGVPVVAFPMWSDQPTNAKLLEESWKTGVRVRENKD-- 424

Query: 425 GLVEGKEIARCLRMVMDMEDHGRGKQLRINARKWQLLAMEAA--NGSSYMNIKAFVNKVC 466
           GLVE  EI RCL  VM+     +  +LR NA+KW+ LAMEA    GSS  N++AFV  +C
Sbjct: 425 GLVERGEIRRCLEAVME----EKSVELRENAKKWKRLAMEAGREGGSSDKNMEAFVEDIC 452

BLAST of Cla97C05G088810 vs. TAIR10
Match: AT4G15550.1 (indole-3-acetate beta-D-glucosyltransferase)

HSP 1 Score: 346.7 bits (888), Expect = 2.3e-95
Identity = 191/485 (39.38%), Postives = 295/485 (60.82%), Query Frame = 0

Query: 5   NFLLVSQSPKSHLNPTLHLASTL--LYLGSKVTLL--LTNYALKNISKHQLPYGLSLSTF 64
           +FL V+   + H+NP+L LA  L     G++VT    ++ Y  +  S   +P  L  +T+
Sbjct: 13  HFLFVTFPAQGHINPSLELAKRLAGTISGARVTFAASISAYNRRMFSTENVPETLIFATY 72

Query: 65  SDGFDDGFTSSDFP---------RWRVEFERLGRLALVDLLSSQQQSLLSFTCIVHTLLI 124
           SDG DDGF SS +           +  E  R G+  L +L+   ++    FTC+V+T+L+
Sbjct: 73  SDGHDDGFKSSAYSDKSRQDATGNFMSEMRRRGKETLTELIEDNRKQNRPFTCVVYTILL 132

Query: 125 PWVTRVALELHVPTAILWIQSVAAFDVYYYYFNGYSDVIRNGYKDDGSNSLLFNIWLPGL 184
            WV  +A E H+P+A+LW+Q V  F ++Y+YFNGY D I      + +N+   +I LP L
Sbjct: 133 TWVAELAREFHLPSALLWVQPVTVFSIFYHYFNGYEDAI-----SEMANTPSSSIKLPSL 192

Query: 185 PLMNVLDLPSFVVSDDYHGLILKSFEEKMQVLEEEENVSILVNSFDALEHDAFTAI-GKF 244
           PL+ V D+PSF+VS + +  +L +F E++  L+EE N  IL+N+F  LE +A +++   F
Sbjct: 193 PLLTVRDIPSFIVSSNVYAFLLPAFREQIDSLKEEINPKILINTFQELEPEAMSSVPDNF 252

Query: 245 NLIPIGPLVSLPPRFEVSTKQRTTSYLQGGQQAQEDYIKWLNSKFDSSVVYIAFGSISKL 304
            ++P+GPL++L   F                 ++ +YI+WL++K DSSV+Y++FG+++ L
Sbjct: 253 KIVPVGPLLTLRTDF----------------SSRGEYIEWLDTKADSSVLYVSFGTLAVL 312

Query: 305 SNKQTKEIVGALLECSYPFLWVL------SMDDIQ--DENLSLYFDDELQAQGKIVPWCS 364
           S KQ  E+  AL++   PFLWV+      + +D Q  +E+    F +EL   G +V WC 
Sbjct: 313 SKKQLVELCKALIQSRRPFLWVITDKSYRNKEDEQEKEEDCISSFREELDEIGMVVSWCD 372

Query: 365 QVEVLSHRSVGCFVTHCGWNSTIESVTAGVPTVAWPLWADQATNAKLMQDVWELGVRV-- 424
           Q  VL+HRS+GCFVTHCGWNST+ES+ +GVP VA+P W DQ  NAKL++D W+ GVRV  
Sbjct: 373 QFRVLNHRSIGCFVTHCGWNSTLESLVSGVPVVAFPQWNDQMMNAKLLEDCWKTGVRVME 432

Query: 425 KKSSDGEGLVEGKEIARCLRMVMDMEDHGRGKQLRINARKWQLLAMEAA--NGSSYMNIK 464
           KK  +G  +V+ +EI RC+  VM+     + ++ R NA +W+ LA EA    GSS+ ++K
Sbjct: 433 KKEEEGVVVVDSEEIRRCIEEVME----DKAEEFRGNATRWKDLAAEAVREGGSSFNHLK 472

BLAST of Cla97C05G088810 vs. TAIR10
Match: AT1G05530.1 (UDP-glucosyl transferase 75B2)

HSP 1 Score: 320.1 bits (819), Expect = 2.3e-87
Identity = 193/482 (40.04%), Postives = 277/482 (57.47%), Query Frame = 0

Query: 1   MEHGNFLLVSQSPKSHLNPTLHLASTLL-YLGSKVTL--LLTNYALKNISKHQLPYGLSL 60
           M   +FLLV+   + H+NP+L  A  L+   G++VT    L+      I  H     LS 
Sbjct: 1   MAQPHFLLVTFPAQGHVNPSLRFARRLIKTTGARVTFATCLSVIHRSMIPNHNNVENLSF 60

Query: 61  STFSDGFDDGFTSS--DFPRWRVEFERLGRLALVDLLSSQQQSLLSFTCIVHTLLIPWVT 120
            TFSDGFDDG  S+  D     V FER G  AL D + + Q      +C+++T+L  WV 
Sbjct: 61  LTFSDGFDDGVISNTDDVQNRLVHFERNGDKALSDFIEANQNGDSPVSCLIYTILPNWVP 120

Query: 121 RVALELHVPTAILWIQSVAAFDVYYYYFNGYSDVIRNGYKDDGSNSLLFNIWLPGLPLMN 180
           +VA   H+P+  LWIQ   AFD+YY Y  G + V                   P LP + 
Sbjct: 121 KVARRFHLPSVHLWIQPAFAFDIYYNYSTGNNSVFE----------------FPNLPSLE 180

Query: 181 VLDLPSFVVSDDYHGLILKSFEEKMQVLEEEENVSILVNSFDALEHDAFTAIGKFNLIPI 240
           + DLPSF+   + +      ++E M  L+EE N  ILVN+FD+LE +  TAI    ++ +
Sbjct: 181 IRDLPSFLSPSNTNKAAQAVYQELMDFLKEESNPKILVNTFDSLEPEFLTAIPNIEMVAV 240

Query: 241 GPLVSLPPRFEVSTKQRTTSYLQGGQQAQEDYIKWLNSKFDSSVVYIAFGSISKLSNKQT 300
           GPL  LP   E+ T   +   L    Q+   Y  WL+SK +SSV+Y++FG++ +LS KQ 
Sbjct: 241 GPL--LPA--EIFTGSESGKDLSRDHQS-SSYTLWLDSKTESSVIYVSFGTMVELSKKQI 300

Query: 301 KEIVGALLECSYPFLWVLS-----------MDDIQDENLSLYFDDELQAQGKIVPWCSQV 360
           +E+  AL+E   PFLWV++            ++ + E ++  F  EL+  G IV WCSQ+
Sbjct: 301 EELARALIEGGRPFLWVITDKLNREAKIEGEEETEIEKIA-GFRHELEEVGMIVSWCSQI 360

Query: 361 EVLSHRSVGCFVTHCGWNSTIESVTAGVPTVAWPLWADQATNAKLMQDVWELGVRVKKSS 420
           EVL HR++GCF+THCGW+S++ES+  GVP VA+P+W+DQ  NAKL++++W+ GVRV+++S
Sbjct: 361 EVLRHRAIGCFLTHCGWSSSLESLVLGVPVVAFPMWSDQPANAKLLEEIWKTGVRVRENS 420

Query: 421 DGEGLVEGKEIARCLRMVMDMEDHGRGKQLRINARKWQLLAMEAA--NGSSYMNIKAFVN 465
             EGLVE  EI RCL  VM+     +  +LR NA KW+ LA EA    GSS  N++AFV 
Sbjct: 421 --EGLVERGEIMRCLEAVME----AKSVELRENAEKWKRLATEAGREGGSSDKNVEAFVK 454

BLAST of Cla97C05G088810 vs. TAIR10
Match: AT1G05560.1 (UDP-glucosyltransferase 75B1)

HSP 1 Score: 316.6 bits (810), Expect = 2.5e-86
Identity = 189/480 (39.38%), Postives = 277/480 (57.71%), Query Frame = 0

Query: 5   NFLLVSQSPKSHLNPTLHLASTLL-YLGSKVTLLLTNYALKN--ISKHQLPYGLSLSTFS 64
           +FLLV+   + H+NP+L  A  L+   G++VT +       N  I+ H     LS  TFS
Sbjct: 5   HFLLVTFPAQGHVNPSLRFARRLIKRTGARVTFVTCVSVFHNSMIANHNKVENLSFLTFS 64

Query: 65  DGFDDG--FTSSDFPRWRVEFERLGRLALVDLLSSQQQSLLSFTCIVHTLLIPWVTRVAL 124
           DGFDDG   T  D  +  V  +  G  AL D + + +      TC+++T+L+ W  +VA 
Sbjct: 65  DGFDDGGISTYEDRQKRSVNLKVNGDKALSDFIEATKNGDSPVTCLIYTILLNWAPKVAR 124

Query: 125 ELHVPTAILWIQSVAAFDVYYYYFNGYSDVIRNGYKDDGSNSLLFNIWLPGLPLMNVLDL 184
              +P+A+LWIQ    F++YY +F G   V                  LP L  + + DL
Sbjct: 125 RFQLPSALLWIQPALVFNIYYTHFMGNKSVFE----------------LPNLSSLEIRDL 184

Query: 185 PSFVVSDDYHGLILKSFEEKMQVLEEEENVSILVNSFDALEHDAFTAIGKFNLIPIGPLV 244
           PSF+   + +     +F+E M+ L +E    IL+N+FD+LE +A TA    +++ +GPL 
Sbjct: 185 PSFLTPSNTNKGAYDAFQEMMEFLIKETKPKILINTFDSLEPEALTAFPNIDMVAVGPL- 244

Query: 245 SLPPR-FEVSTKQRTTSYLQGGQQAQEDYIKWLNSKFDSSVVYIAFGSISKLSNKQTKEI 304
            LP   F  ST +         +     Y  WL+SK +SSV+Y++FG++ +LS KQ +E+
Sbjct: 245 -LPTEIFSGSTNKSV-------KDQSSSYTLWLDSKTESSVIYVSFGTMVELSKKQIEEL 304

Query: 305 VGALLECSYPFLWVLS-----------MDDIQDENLSLYFDDELQAQGKIVPWCSQVEVL 364
             AL+E   PFLWV++            ++ + E ++  F  EL+  G IV WCSQ+EVL
Sbjct: 305 ARALIEGKRPFLWVITDKSNRETKTEGEEETEIEKIA-GFRHELEEVGMIVSWCSQIEVL 364

Query: 365 SHRSVGCFVTHCGWNSTIESVTAGVPTVAWPLWADQATNAKLMQDVWELGVRVKKSSDGE 424
           SHR+VGCFVTHCGW+ST+ES+  GVP VA+P+W+DQ TNAKL+++ W+ GVRV+++ D  
Sbjct: 365 SHRAVGCFVTHCGWSSTLESLVLGVPVVAFPMWSDQPTNAKLLEESWKTGVRVRENKD-- 424

Query: 425 GLVEGKEIARCLRMVMDMEDHGRGKQLRINARKWQLLAMEAA--NGSSYMNIKAFVNKVC 466
           GLVE  EI RCL  VM+     +  +LR NA+KW+ LAMEA    GSS  N++AFV  +C
Sbjct: 425 GLVERGEIRRCLEAVME----EKSVELRENAKKWKRLAMEAGREGGSSDKNMEAFVEDIC 452

BLAST of Cla97C05G088810 vs. TAIR10
Match: AT4G14090.1 (UDP-Glycosyltransferase superfamily protein)

HSP 1 Score: 314.7 bits (805), Expect = 9.6e-86
Identity = 184/470 (39.15%), Postives = 278/470 (59.15%), Query Frame = 0

Query: 5   NFLLVSQSPKSHLNPTLHLASTLLYLGSKVTLLLTNYALKNISKHQLPYGLSLSTFSDGF 64
           ++LLV+   + H+NP L LA+ L++ G+ VT      A + + +     GLS + F+DGF
Sbjct: 13  HYLLVTFPAQGHINPALQLANRLIHHGATVTYSTAVSAHRRMGEPPSTKGLSFAWFTDGF 72

Query: 65  DDGFTS-SDFPRWRVEFERLGRLALVDLLSSQQQSLLS---FTCIVHTLLIPWVTRVALE 124
           DDG  S  D   +  E +R G  AL D++ +   +       T +++++L+PWV+ VA E
Sbjct: 73  DDGLKSFEDQKIYMSELKRCGSNALRDIIKANLDATTETEPITGVIYSVLVPWVSTVARE 132

Query: 125 LHVPTAILWIQSVAAFDVYYYYFNGYSDVIRNGYKDDGSNSLLFN---IWLPGLPLMNVL 184
            H+PT +LWI+     D+YYYYFN         YK       LF+   I LP LPL+   
Sbjct: 133 FHLPTTLLWIEPATVLDIYYYYFN-------TSYKH------LFDVEPIKLPKLPLITTG 192

Query: 185 DLPSFVVSDDYHGLILKSFEEKMQVLEEEENVSILVNSFDALEHDAFTAIGKFNLIPIGP 244
           DLPSF+         L +  E ++ LE E N  ILVN+F ALEHDA T++ K  +IPIGP
Sbjct: 193 DLPSFLQPSKALPSALVTLREHIEALETESNPKILVNTFSALEHDALTSVEKLKMIPIGP 252

Query: 245 LVSLPPRFEVSTKQRTTSYLQGGQQAQEDYIKWLNSKFDSSVVYIAFGS-ISKLSNKQTK 304
           LVS       S++ +T  +    + + EDY KWL+SK + SV+YI+ G+    L  K  +
Sbjct: 253 LVS-------SSEGKTDLF----KSSDEDYTKWLDSKLERSVIYISLGTHADDLPEKHME 312

Query: 305 EIVGALLECSYPFLWVLSMDDIQDENLSLYFD-DELQAQGKIVPWCSQVEVLSHRSVGCF 364
            +   +L  + PFLW++   + +++  + + +      +G +V WCSQ  VL+H +VGCF
Sbjct: 313 ALTHGVLATNRPFLWIVREKNPEEKKKNRFLELIRGSDRGLVVGWCSQTAVLAHCAVGCF 372

Query: 365 VTHCGWNSTIESVTAGVPTVAWPLWADQATNAKLMQDVWELGVRVKKSSDGEGLVEGKEI 424
           VTHCGWNST+ES+ +GVP VA+P +ADQ T AKL++D W +GV+VK   +G+  V+G+EI
Sbjct: 373 VTHCGWNSTLESLESGVPVVAFPQFADQCTTAKLVEDTWRIGVKVKVGEEGD--VDGEEI 432

Query: 425 ARCLRMVMDMEDHGRGKQLRINARKWQLLAMEAA--NGSSYMNIKAFVNK 464
            RCL  VM   +    +++R NA KW+ +A++AA   G S +N+K FV++
Sbjct: 433 RRCLEKVMSGGE--EAEEMRENAEKWKAMAVDAAAEGGPSDLNLKGFVDE 454

BLAST of Cla97C05G088810 vs. TAIR10
Match: AT2G43820.1 (UDP-glucosyltransferase 74F2)

HSP 1 Score: 222.2 bits (565), Expect = 6.5e-58
Identity = 157/475 (33.05%), Postives = 254/475 (53.47%), Query Frame = 0

Query: 1   MEH--GNFLLVSQSPKSHLNPTLHLASTLLYLGSKVTLLLTNYALKNISKHQLPYGLSLS 60
           MEH  G+ L V    + H+ P       L + G K TL LT +   +I+   L   +S++
Sbjct: 1   MEHKRGHVLAVPYPTQGHITPFRQFCKRLHFKGLKTTLALTTFVFNSINP-DLSGPISIA 60

Query: 61  TFSDGFDDG--FTSSDFPRWRVEFERLGRLALVDLLSSQQQSLLSFTCIVHTLLIPWVTR 120
           T SDG+D G   T+     +  +F+  G   + D++   Q S    TCIV+   +PW   
Sbjct: 61  TISDGYDHGGFETADSIDDYLKDFKTSGSKTIADIIQKHQTSDNPITCIVYDAFLPWALD 120

Query: 121 VALELHVPTAILWIQSVAAFDVYYYYFNGYSDVIRNGYKDDGSNSLLFNIWLPGLPLMNV 180
           VA E  +     + Q  A   VYY             Y ++GS      + +  LP + +
Sbjct: 121 VAREFGLVATPFFTQPCAVNYVYYL-----------SYINNGS----LQLPIEELPFLEL 180

Query: 181 LDLPSFV-VSDDYHGLILKSFEEKM-QVLEEEENVSILVNSFDALE-HDAFTAIGKFNLI 240
            DLPSF  VS  Y       FE  + Q +  E+   +LVNSF  LE H+         ++
Sbjct: 181 QDLPSFFSVSGSYPAY----FEMVLQQFINFEKADFVLVNSFQELELHENELWSKACPVL 240

Query: 241 PIGPLVSLPPRFEVSTKQRTTSYLQGGQQAQED--YIKWLNSKFDSSVVYIAFGSISKLS 300
            IGP  ++P  +     +  T Y     ++++D   I WL+++   SVVY+AFGS+++L+
Sbjct: 241 TIGP--TIPSIYLDQRIKSDTGYDLNLFESKDDSFCINWLDTRPQGSVVYVAFGSMAQLT 300

Query: 301 NKQTKEIVGALLECSYPFLWVLSMDDIQDENLSLYFDDELQAQGKIVPWCSQVEVLSHRS 360
           N Q +E+  A+   ++ FLWV+   + +++  S + +   + +  ++ W  Q++VLS+++
Sbjct: 301 NVQMEELASAV--SNFSFLWVVRSSE-EEKLPSGFLETVNKEKSLVLKWSPQLQVLSNKA 360

Query: 361 VGCFVTHCGWNSTIESVTAGVPTVAWPLWADQATNAKLMQDVWELGVRVKKSSDGEGLVE 420
           +GCF+THCGWNST+E++T GVP VA P W DQ  NAK +QDVW+ GVRVK   +  G+ +
Sbjct: 361 IGCFLTHCGWNSTMEALTFGVPMVAMPQWTDQPMNAKYIQDVWKAGVRVKTEKE-SGIAK 420

Query: 421 GKEIARCLRMVMDMEDHGRGKQLRINARKWQLLAMEAAN--GSSYMNIKAFVNKV 465
            +EI   ++ VM+ E   R K+++ N +KW+ LA+++ N  GS+  NI  FV++V
Sbjct: 421 REEIEFSIKEVMEGE---RSKEMKKNVKKWRDLAVKSLNEGGSTDTNIDTFVSRV 446

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_008439390.22.5e-20177.71PREDICTED: crocetin glucosyltransferase, chloroplastic [Cucumis melo][more]
XP_004147672.11.0e-14174.11PREDICTED: crocetin glucosyltransferase, chloroplastic-like [Cucumis sativus] >K... [more]
XP_023885287.13.5e-11047.30crocetin glucosyltransferase, chloroplastic-like [Quercus suber] >POE69741.1 cro... [more]
XP_018844047.19.6e-10846.67PREDICTED: crocetin glucosyltransferase, chloroplastic-like [Juglans regia][more]
XP_002308970.22.1e-10746.88crocetin glucosyltransferase, chloroplastic [Populus trichocarpa] >PNT29916.1 hy... [more]
Match NameE-valueIdentityDescription
tr|A0A1S3AYM5|A0A1S3AYM5_CUCME1.7e-20177.71crocetin glucosyltransferase, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103484... [more]
tr|A0A0A0L890|A0A0A0L890_CUCSA6.7e-14274.11Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G172390 PE=4 SV=1[more]
tr|A0A2P4IMI9|A0A2P4IMI9_QUESU2.3e-11047.30Glycosyltransferase OS=Quercus suber OX=58331 GN=CFP56_58469 PE=3 SV=1[more]
tr|A0A2I4GJH4|A0A2I4GJH4_9ROSI6.4e-10846.67Glycosyltransferase OS=Juglans regia OX=51240 GN=LOC109008422 PE=3 SV=1[more]
tr|A0A2K1ZXB2|A0A2K1ZXB2_POPTR1.4e-10746.88Glycosyltransferase OS=Populus trichocarpa OX=3694 GN=POPTR_006G055600v3 PE=3 SV... [more]
Match NameE-valueIdentityDescription
sp|F8WKW0|UGT1_GARJA2.8e-10645.13Crocetin glucosyltransferase, chloroplastic OS=Gardenia jasminoides OX=114476 GN... [more]
sp|O23406|U75D1_ARATH4.1e-9439.38UDP-glycosyltransferase 75D1 OS=Arabidopsis thaliana OX=3702 GN=UGT75D1 PE=2 SV=... [more]
sp|Q9ZR25|5GT_VERHY6.4e-8740.88Anthocyanidin 3-O-glucoside 5-O-glucosyltransferase OS=Verbena hybrida OX=76714 ... [more]
sp|Q9ZVY5|U75B2_ARATH4.1e-8640.04UDP-glycosyltransferase 75B2 OS=Arabidopsis thaliana OX=3702 GN=UGT75B2 PE=2 SV=... [more]
sp|Q9LR44|U75B1_ARATH4.6e-8539.38UDP-glycosyltransferase 75B1 OS=Arabidopsis thaliana OX=3702 GN=UGT75B1 PE=1 SV=... [more]
Match NameE-valueIdentityDescription
AT4G15550.12.3e-9539.38indole-3-acetate beta-D-glucosyltransferase[more]
AT1G05530.12.3e-8740.04UDP-glucosyl transferase 75B2[more]
AT1G05560.12.5e-8639.38UDP-glucosyltransferase 75B1[more]
AT4G14090.19.6e-8639.15UDP-Glycosyltransferase superfamily protein[more]
AT2G43820.16.5e-5833.05UDP-glucosyltransferase 74F2[more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0016758transferase activity, transferring hexosyl groups
Vocabulary: Biological Process
TermDefinition
GO:0008152metabolic process
Vocabulary: INTERPRO
TermDefinition
IPR035595UDP_glycos_trans_CS
IPR002213UDP_glucos_trans
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008152 metabolic process
biological_process GO:0009813 flavonoid biosynthetic process
biological_process GO:0052696 flavonoid glucuronidation
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
cellular_component GO:0043231 intracellular membrane-bounded organelle
molecular_function GO:0016740 transferase activity
molecular_function GO:0016757 transferase activity, transferring glycosyl groups
molecular_function GO:0016758 transferase activity, transferring hexosyl groups
molecular_function GO:0080043 quercetin 3-O-glucosyltransferase activity
molecular_function GO:0080044 quercetin 7-O-glucosyltransferase activity
molecular_function GO:0003674 molecular_function
molecular_function GO:0035251 UDP-glucosyltransferase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C05G088810.1Cla97C05G088810.1mRNA


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableGENE3DG3DSA:3.40.50.2000coord: 261..447
e-value: 1.3E-115
score: 389.1
NoneNo IPR availableGENE3DG3DSA:3.40.50.2000coord: 448..457
e-value: 1.3E-115
score: 389.1
coord: 17..260
e-value: 1.3E-115
score: 389.1
NoneNo IPR availablePANTHERPTHR11926GLUCOSYL/GLUCURONOSYL TRANSFERASEScoord: 4..466
NoneNo IPR availablePANTHERPTHR11926:SF767SUBFAMILY NOT NAMEDcoord: 4..466
NoneNo IPR availableSUPERFAMILYSSF53756UDP-Glycosyltransferase/glycogen phosphorylasecoord: 2..464
IPR002213UDP-glucuronosyl/UDP-glucosyltransferasePFAMPF00201UDPGTcoord: 268..395
e-value: 1.4E-22
score: 80.0
IPR035595UDP-glycosyltransferase family, conserved sitePROSITEPS00375UDPGTcoord: 340..383

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Cla97C05G088810Silver-seed gourdcarwmbB0720
Cla97C05G088810Silver-seed gourdcarwmbB1100
Cla97C05G088810Cucumber (Gy14) v2cgybwmbB196
Cla97C05G088810Cucurbita maxima (Rimu)cmawmbB263
Cla97C05G088810Cucurbita maxima (Rimu)cmawmbB856
Cla97C05G088810Cucurbita moschata (Rifu)cmowmbB243
Cla97C05G088810Cucurbita moschata (Rifu)cmowmbB833