Cla97C06G112540 (gene) Watermelon (97103) v2

NameCla97C06G112540
Typegene
OrganismCitrullus lanatus (Watermelon (97103) v2)
DescriptionGlycosyltransferase
LocationCla97Chr06 : 3589712 .. 3591073 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGAAACCACCACTTTCTTATTGTATGTTTCCCTTCTCAAGGACACATAAACCCTTCCCTTCAACTCGCCAAGCAACTTACAACCCTAAACGTTGAAGTCACCTTCGCCACCACTGTCGTTGCCGCCCGACGCATGAACATAACCCAAAAAATCCCTTCACCACCAACCTTATCTTTTGCGACTTTCTCCGACGGTTGCGACGACGAAAATCAAACCAACTCCGACTTCAACCACTACGTCTCCGAGCTCAAACGCTGCGGTTCCCAATCTCTAACCAATCTCATCACTTCAGCCCGCGGGCCCGGCCGCCGTCCATTCACTTGTGTAATCTACAGCCTCCTTCTCAATTGGGCAGCTGAACTTGCCACTTCTTTTAATATCCCTTCAGCTCTTTTCTCTGCTCAACCTGCCACTGTTTTAGCTTTGTATTACTATTACTTTCATGGCTTTGGGGATGAAATTATCAACAAACTCCAAAGAAACAACCCCTCTTTATCCATAAAATTACCAGGTTTGCCATTGTTTAAAAGTCATGATATGCCTTCTTTTTTCTCTCCCTCTGGCCGAGATGCATTCATTATCCCTCTAATGAGAGAGCAAATGGAATTTCTTGGTCAACAAAAAAGGCCAACAAAAGTTTTAGTCAACACATTTGATGCTTTGGAAAATGAAGCCTTGAGAGCAATTAATGAGTTGAAAATGGTAGGAATTGGACCGTTGATTAGTGAATTGCATGGTGATTTATTTCAATTATCGAATGAAGATTATTATATTGAATGGTTGAATTCCAAGGCTAATTCTTCGGTTGTTTATTTATCTTTTGGGTCAATTTGTGTGTTGTCTAAAGAACAAGAGGAGGAAATTTTCTATGGTTTATTAGAAAGTGAGTATCAATTTTTGTGGGTAATGAGATCCAAAAATGATGAAGAAGAGAAGAAGTGGAAGGAATTGGTAGAAGGGAAAGGAAGAATTGTGGGTTGGTGTAGACAAATTGAAGTATTGAAACATCCTTCATTGGGTTGTTTTATTACACATTGTGGTTGGAATTCAACATTAGAAAGCTTGAGTTTTGGTGTGCCAATGGTGGGTTTTCCACAACAAATAGATCAAGCTACTAATGCAAAGCTTGTAGAAGATGTGTGGAAGATGGGAGTGAGAGTGAAGGTTAATTCAGAAGGAATTGTGGAAAGGGAAGAAATTAGGAGATGTGTGGATTTGGTAATGGAGAGGAAAAATGGAGAAAAAGGAGACATTGAGAAGAATGTTAAGAAGTGGAAAGAATTGGCTTGGGAGGCTATCAATGAAGGTGGATCCTCAATTTTCAATCTTGAGAACTTTGTTGATGAGATTGATGGGTGA

mRNA sequence

ATGAGAAACCACCACTTTCTTATTGTATGTTTCCCTTCTCAAGGACACATAAACCCTTCCCTTCAACTCGCCAAGCAACTTACAACCCTAAACGTTGAAGTCACCTTCGCCACCACTGTCGTTGCCGCCCGACGCATGAACATAACCCAAAAAATCCCTTCACCACCAACCTTATCTTTTGCGACTTTCTCCGACGGTTGCGACGACGAAAATCAAACCAACTCCGACTTCAACCACTACGTCTCCGAGCTCAAACGCTGCGGTTCCCAATCTCTAACCAATCTCATCACTTCAGCCCGCGGGCCCGGCCGCCGTCCATTCACTTGTGTAATCTACAGCCTCCTTCTCAATTGGGCAGCTGAACTTGCCACTTCTTTTAATATCCCTTCAGCTCTTTTCTCTGCTCAACCTGCCACTGTTTTAGCTTTGTATTACTATTACTTTCATGGCTTTGGGGATGAAATTATCAACAAACTCCAAAGAAACAACCCCTCTTTATCCATAAAATTACCAGGTTTGCCATTGTTTAAAAGTCATGATATGCCTTCTTTTTTCTCTCCCTCTGGCCGAGATGCATTCATTATCCCTCTAATGAGAGAGCAAATGGAATTTCTTGGTCAACAAAAAAGGCCAACAAAAGTTTTAGTCAACACATTTGATGCTTTGGAAAATGAAGCCTTGAGAGCAATTAATGAGTTGAAAATGGTAGGAATTGGACCGTTGATTAGTGAATTGCATGGTGATTTATTTCAATTATCGAATGAAGATTATTATATTGAATGGTTGAATTCCAAGGCTAATTCTTCGGTTGTTTATTTATCTTTTGGGTCAATTTGTGTGTTGTCTAAAGAACAAGAGGAGGAAATTTTCTATGGTTTATTAGAAAGTGAGTATCAATTTTTGTGGGTAATGAGATCCAAAAATGATGAAGAAGAGAAGAAGTGGAAGGAATTGGTAGAAGGGAAAGGAAGAATTGTGGGTTGGTGTAGACAAATTGAAGTATTGAAACATCCTTCATTGGGTTGTTTTATTACACATTGTGGTTGGAATTCAACATTAGAAAGCTTGAGTTTTGGTGTGCCAATGGTGGGTTTTCCACAACAAATAGATCAAGCTACTAATGCAAAGCTTGTAGAAGATGTGTGGAAGATGGGAGTGAGAGTGAAGGTTAATTCAGAAGGAATTGTGGAAAGGGAAGAAATTAGGAGATGTGTGGATTTGGTAATGGAGAGGAAAAATGGAGAAAAAGGAGACATTGAGAAGAATGTTAAGAAGTGGAAAGAATTGGCTTGGGAGGCTATCAATGAAGGTGGATCCTCAATTTTCAATCTTGAGAACTTTGTTGATGAGATTGATGGGTGA

Coding sequence (CDS)

ATGAGAAACCACCACTTTCTTATTGTATGTTTCCCTTCTCAAGGACACATAAACCCTTCCCTTCAACTCGCCAAGCAACTTACAACCCTAAACGTTGAAGTCACCTTCGCCACCACTGTCGTTGCCGCCCGACGCATGAACATAACCCAAAAAATCCCTTCACCACCAACCTTATCTTTTGCGACTTTCTCCGACGGTTGCGACGACGAAAATCAAACCAACTCCGACTTCAACCACTACGTCTCCGAGCTCAAACGCTGCGGTTCCCAATCTCTAACCAATCTCATCACTTCAGCCCGCGGGCCCGGCCGCCGTCCATTCACTTGTGTAATCTACAGCCTCCTTCTCAATTGGGCAGCTGAACTTGCCACTTCTTTTAATATCCCTTCAGCTCTTTTCTCTGCTCAACCTGCCACTGTTTTAGCTTTGTATTACTATTACTTTCATGGCTTTGGGGATGAAATTATCAACAAACTCCAAAGAAACAACCCCTCTTTATCCATAAAATTACCAGGTTTGCCATTGTTTAAAAGTCATGATATGCCTTCTTTTTTCTCTCCCTCTGGCCGAGATGCATTCATTATCCCTCTAATGAGAGAGCAAATGGAATTTCTTGGTCAACAAAAAAGGCCAACAAAAGTTTTAGTCAACACATTTGATGCTTTGGAAAATGAAGCCTTGAGAGCAATTAATGAGTTGAAAATGGTAGGAATTGGACCGTTGATTAGTGAATTGCATGGTGATTTATTTCAATTATCGAATGAAGATTATTATATTGAATGGTTGAATTCCAAGGCTAATTCTTCGGTTGTTTATTTATCTTTTGGGTCAATTTGTGTGTTGTCTAAAGAACAAGAGGAGGAAATTTTCTATGGTTTATTAGAAAGTGAGTATCAATTTTTGTGGGTAATGAGATCCAAAAATGATGAAGAAGAGAAGAAGTGGAAGGAATTGGTAGAAGGGAAAGGAAGAATTGTGGGTTGGTGTAGACAAATTGAAGTATTGAAACATCCTTCATTGGGTTGTTTTATTACACATTGTGGTTGGAATTCAACATTAGAAAGCTTGAGTTTTGGTGTGCCAATGGTGGGTTTTCCACAACAAATAGATCAAGCTACTAATGCAAAGCTTGTAGAAGATGTGTGGAAGATGGGAGTGAGAGTGAAGGTTAATTCAGAAGGAATTGTGGAAAGGGAAGAAATTAGGAGATGTGTGGATTTGGTAATGGAGAGGAAAAATGGAGAAAAAGGAGACATTGAGAAGAATGTTAAGAAGTGGAAAGAATTGGCTTGGGAGGCTATCAATGAAGGTGGATCCTCAATTTTCAATCTTGAGAACTTTGTTGATGAGATTGATGGGTGA

Protein sequence

MRNHHFLIVCFPSQGHINPSLQLAKQLTTLNVEVTFATTVVAARRMNITQKIPSPPTLSFATFSDGCDDENQTNSDFNHYVSELKRCGSQSLTNLITSARGPGRRPFTCVIYSLLLNWAAELATSFNIPSALFSAQPATVLALYYYYFHGFGDEIINKLQRNNPSLSIKLPGLPLFKSHDMPSFFSPSGRDAFIIPLMREQMEFLGQQKRPTKVLVNTFDALENEALRAINELKMVGIGPLISELHGDLFQLSNEDYYIEWLNSKANSSVVYLSFGSICVLSKEQEEEIFYGLLESEYQFLWVMRSKNDEEEKKWKELVEGKGRIVGWCRQIEVLKHPSLGCFITHCGWNSTLESLSFGVPMVGFPQQIDQATNAKLVEDVWKMGVRVKVNSEGIVEREEIRRCVDLVMERKNGEKGDIEKNVKKWKELAWEAINEGGSSIFNLENFVDEIDG
BLAST of Cla97C06G112540 vs. NCBI nr
Match: XP_004140604.2 (PREDICTED: crocetin glucosyltransferase, chloroplastic-like [Cucumis sativus] >KGN46579.1 hypothetical protein Csa_6G109740 [Cucumis sativus])

HSP 1 Score: 753.1 bits (1943), Expect = 5.7e-214
Identity = 374/456 (82.02%), Postives = 411/456 (90.13%), Query Frame = 0

Query: 1   MRNHHFLIVCFPSQGHINPSLQLAKQLTTLNVEVTFATTVVAARRMNITQKIPSPPTLSF 60
           MRNHHFLIVCFPSQG+INPSLQLA +LT+LN+EVTFATTV A+RRM ITQ+I SP TLSF
Sbjct: 1   MRNHHFLIVCFPSQGYINPSLQLANKLTSLNIEVTFATTVTASRRMKITQQISSPSTLSF 60

Query: 61  ATFSDGCDDENQTNSDFNHYVSELKRCGSQSLTNLITSARGPGRRPFTCVIYSLLLNWAA 120
           ATFSDG DDEN   SDFNH+ SELKRCGSQSLT+LITS R   RRPFT VIYSLLLNWAA
Sbjct: 61  ATFSDGFDDENHKTSDFNHFFSELKRCGSQSLTDLITSFRDRHRRPFTFVIYSLLLNWAA 120

Query: 121 ELATSFNIPSALFSAQPATVLALYYYYFHGFGDEIINKLQRNNP-SLSIKLPGLP-LFKS 180
           ++ATSFNIPSALFSAQPATVLALYYYYFHGF DEI NKLQ + P SLSI+LPGLP LFKS
Sbjct: 121 DVATSFNIPSALFSAQPATVLALYYYYFHGFEDEITNKLQNDGPSSLSIELPGLPLLFKS 180

Query: 181 HDMPSFFSPSGRDAFIIPLMREQMEFLGQQKRPTKVLVNTFDALENEALRAINELKMVGI 240
           H+MPSFFSPSG+ AFIIP MREQMEFLGQQK+P KVLVNTF ALENEALRAI+EL+M+ I
Sbjct: 181 HEMPSFFSPSGQHAFIIPWMREQMEFLGQQKQPIKVLVNTFHALENEALRAIHELEMIAI 240

Query: 241 GPLISELHGDLFQLSNEDYYIEWLNSKANSSVVYLSFGSICVLSKEQEEEIFYGLLESEY 300
           GPLIS+  GDLFQ+SNEDYY+EWLNSK+N SVVYLSFGSICVLSKEQEEEI YGL ES Y
Sbjct: 241 GPLISQFRGDLFQVSNEDYYMEWLNSKSNCSVVYLSFGSICVLSKEQEEEILYGLFESGY 300

Query: 301 QFLWVMRSKNDEEEKKWKELVEGKGRIVGWCRQIEVLKHPSLGCFITHCGWNSTLESLSF 360
            FLWVMRSK+DE+E+KWKELVEGKG+IV WCRQIEVLKHPSLGCF++HCGWNSTLESLSF
Sbjct: 301 PFLWVMRSKSDEDEEKWKELVEGKGKIVSWCRQIEVLKHPSLGCFMSHCGWNSTLESLSF 360

Query: 361 GVPMVGFPQQIDQATNAKLVEDVWKMGVRVKVNSEGIVEREEIRRCVDLVMERK--NGEK 420
           G+PMV FPQQ+DQ TNAKLVEDVWKMGVRVK N EGIVEREEIRRC+DLVM RK  NGE+
Sbjct: 361 GLPMVAFPQQVDQPTNAKLVEDVWKMGVRVKGNLEGIVEREEIRRCLDLVMNRKYINGER 420

Query: 421 GDIEKNVKKWKELAWEAINEGGSSIFNLENFVDEID 453
            + EKNV+KWK+LAWEA++EGGSSI NL NFVDEID
Sbjct: 421 EETEKNVEKWKKLAWEAMDEGGSSILNLANFVDEID 456

BLAST of Cla97C06G112540 vs. NCBI nr
Match: XP_008458143.1 (PREDICTED: crocetin glucosyltransferase, chloroplastic-like [Cucumis melo])

HSP 1 Score: 743.4 bits (1918), Expect = 4.5e-211
Identity = 377/460 (81.96%), Postives = 410/460 (89.13%), Query Frame = 0

Query: 1   MRNHHFLIVCFPSQGHINPSLQLAKQLTTLNVEVTFATTVVAARRMNITQKIPSPPTLSF 60
           MRNHHFLIVCFPSQG INPSLQLA +LT+LN+EVTFATTV A+RRMNITQ+IPSP TLSF
Sbjct: 1   MRNHHFLIVCFPSQGCINPSLQLANKLTSLNIEVTFATTVTASRRMNITQQIPSPSTLSF 60

Query: 61  ATFSDGCDDENQTNSDFNHYVSELKRCGSQSLTNLITSARGP--GRRPFTCVIYSLLLNW 120
           ATFSDG DDEN   SDFNHY SELKRCGSQSLT+LI S R     RRPFT +IYSLLLNW
Sbjct: 61  ATFSDGFDDENHKTSDFNHYFSELKRCGSQSLTDLIASLRDDRHRRRPFTFLIYSLLLNW 120

Query: 121 AAELATSFNIPSALFSAQPATVLALYYYYFHGFGDEIINKLQRNNPSLSIKLPGLP-LFK 180
           AA++ATSFNIPSALFS QPATVLALYYYYFHGF DEI NKLQ + PSLSI+LPGLP LFK
Sbjct: 121 AADVATSFNIPSALFSTQPATVLALYYYYFHGFEDEITNKLQNDGPSLSIELPGLPLLFK 180

Query: 181 SHDMPSFFSPSGRDAFII-PLMREQMEFLGQQKRPTKVLVNTFDALENEALRAINELKMV 240
           SH+MPSFFSPS + A II PLMREQMEFL QQK+PTKVLVNTFDALENEALRAI+EL+M+
Sbjct: 181 SHEMPSFFSPSSQHASIITPLMREQMEFLSQQKKPTKVLVNTFDALENEALRAIHELEMI 240

Query: 241 GIGPLI-SELHGDLFQLSNEDYYIEWLNSKANSSVVYLSFGSICVLSKEQEEEIFYGLLE 300
            +GPLI +E  GDLFQ+SN DYY+EWLNSK+N SVVY+SFGSICVLSKEQEEEI YGLLE
Sbjct: 241 AVGPLINTEFRGDLFQVSNGDYYMEWLNSKSNFSVVYISFGSICVLSKEQEEEILYGLLE 300

Query: 301 SEYQFLWVMRSKNDEE-EKKWKELVEGKGRIVGWCRQIEVLKHPSLGCFITHCGWNSTLE 360
           S Y FLWV+RSKNDE+ E+KWKELVEGKGRIV WCRQIEVLKHPSLGCF++HCGWNSTLE
Sbjct: 301 SGYPFLWVIRSKNDEDREEKWKELVEGKGRIVSWCRQIEVLKHPSLGCFVSHCGWNSTLE 360

Query: 361 SLSFGVPMVGFPQQIDQATNAKLVEDVWKMGVRVKVNSEGIVEREEIRRCVDLVMERK-- 420
           SLSFG+PMV FPQQIDQ TNAKLVEDVWKMGVRVK N EGIVEREEIRRC+DLVM RK  
Sbjct: 361 SLSFGLPMVAFPQQIDQPTNAKLVEDVWKMGVRVKANLEGIVEREEIRRCLDLVMNRKDI 420

Query: 421 NGEKGDIEKNVKKWKELAWEAINEGGSSIFNLENFVDEID 453
           +GE+  IEKNV+KWKELAWEAINEGGSSI NL NFVDEID
Sbjct: 421 DGEREVIEKNVEKWKELAWEAINEGGSSILNLVNFVDEID 460

BLAST of Cla97C06G112540 vs. NCBI nr
Match: XP_023000094.1 (crocetin glucosyltransferase, chloroplastic-like [Cucurbita maxima])

HSP 1 Score: 609.0 bits (1569), Expect = 1.3e-170
Identity = 318/456 (69.74%), Postives = 368/456 (80.70%), Query Frame = 0

Query: 1   MRNHHFLIVCFPSQGHINPSLQLAKQLTTLNVEVTFATTVVAARRMNITQKIPSPPTLSF 60
           MRNHHFL+VCFPSQGHINPSLQLAK+L  LN+EVTFATT+ AARRMN  Q+ P+   LSF
Sbjct: 1   MRNHHFLLVCFPSQGHINPSLQLAKRLIHLNIEVTFATTIAAARRMNNAQQTPTTKGLSF 60

Query: 61  ATFSDGCDDEN-QTNSDFNHYVSELKRCGSQSLTNLITSARGPGRRPFTCVIYSLLLNWA 120
           ATFSDG DD+N   +++  H+ SELKRCGSQSLTNLITSA   G RPFT +IY LLLNWA
Sbjct: 61  ATFSDGFDDDNLNLSANITHFFSELKRCGSQSLTNLITSAANKG-RPFTFLIYGLLLNWA 120

Query: 121 AELATSFNIPSALFSAQPATVLALYYYYFHGFGDEIINKLQRNNPSLSIKLPGLPLFKSH 180
           A++ATSFNIPSALF AQPATVLALY++YFHG+ + I NKLQ   PS  I+LP LPLF + 
Sbjct: 121 ADIATSFNIPSALFFAQPATVLALYFHYFHGYEESICNKLQ--TPSSCIELPNLPLFTTR 180

Query: 181 DMPSFFSPSGRDAFIIPLMREQMEFLGQQKRPTKVLVNTFDALENEALRAINELKMVGIG 240
           DMPSFFSP G  AFIIP MREQ+EFLG Q +P KVLVNTFDALE +ALRAI+ELK++ IG
Sbjct: 181 DMPSFFSPCGPHAFIIPPMREQLEFLGGQTQP-KVLVNTFDALEADALRAIDELKIIAIG 240

Query: 241 PLISELH--GDLFQLSNEDYYIEWLNSKANSSVVYLSFGSICVLSKEQEEEIFYGLLESE 300
           PLI   H  G+LFQ+S+ED YI WLNSKA  SVVY+SFGSICVL +EQE+E+ +GLLES 
Sbjct: 241 PLIPSSHDGGNLFQVSSED-YIGWLNSKAERSVVYVSFGSICVLCEEQEKELLHGLLESG 300

Query: 301 YQFLWVMRSKNDEEEKKWKELVEGKGRIVGWCRQIEVLKHPSLGCFITHCGWNSTLESLS 360
             FLWV+RS  DEE  K    V  KG+IV WCRQIEVLKHPS+GCF++HCGWNST+ESLS
Sbjct: 301 RPFLWVVRSNKDEERLK---KVGMKGKIVSWCRQIEVLKHPSVGCFVSHCGWNSTIESLS 360

Query: 361 FGVPMVGFPQQIDQATNAKLVEDVWKMGVRVKVNSEGIVEREEIRRCVDLVMERKNGEKG 420
           FGV +VGFPQQIDQ TNAKLVED+WK GVRVK NSEG+VER EIRRC+DLVM  +     
Sbjct: 361 FGVAVVGFPQQIDQMTNAKLVEDLWKTGVRVKGNSEGVVERGEIRRCLDLVMGNE----- 420

Query: 421 DIEKNVKKWKELAWEAINEGGSSIFNLENFVDEIDG 454
           +IE+NVK WKEL  +A+ EGGSS  NL+ FV EIDG
Sbjct: 421 EIERNVKVWKELGRQAMEEGGSSTLNLQAFVAEIDG 443

BLAST of Cla97C06G112540 vs. NCBI nr
Match: XP_022964378.1 (crocetin glucosyltransferase, chloroplastic-like [Cucurbita moschata])

HSP 1 Score: 593.6 bits (1529), Expect = 5.7e-166
Identity = 314/458 (68.56%), Postives = 366/458 (79.91%), Query Frame = 0

Query: 1   MRNHHFLIVCFPSQGHINPSLQLAKQLTTLNVEVTFATTVVAARRM-NITQKIPSPPTLS 60
           MRN HFL+VCFPSQG+INPSLQLAK+L  LN++VTFATT+ AARRM N  Q+ P+   LS
Sbjct: 1   MRNPHFLLVCFPSQGYINPSLQLAKRLIHLNIDVTFATTIAAARRMNNNAQQTPTTQGLS 60

Query: 61  FATFSDGCDDEN-QTNSDFNHYVSELKRCGSQSLTNLITSARGPGRRPFTCVIYSLLLNW 120
           FATFSDG DD+N + +++  H+ SELKRCGSQSLT+LITSA   G RPFT +IY LLLNW
Sbjct: 61  FATFSDGFDDDNLKLSANITHFFSELKRCGSQSLTHLITSAANKG-RPFTFLIYGLLLNW 120

Query: 121 AAELATSFNIPSALFSAQPATVLALYYYYFHGFGDEIINKLQRNNPSLSIKLPGLPLFKS 180
           AA++ATSFNIPSALF AQPATVLALY++YFHG+ + I NKLQ   PS  I+LP LPLF +
Sbjct: 121 AADVATSFNIPSALFFAQPATVLALYFHYFHGYEEPICNKLQ--TPSSCIELPNLPLFTT 180

Query: 181 HDMPSFFSPSGRDAFIIPLMREQMEFLGQQKRPTKVLVNTFDALENEALRAINELKMVGI 240
           HDMPSFFSP G  AFIIP MREQ+EFLG Q R +KVLVNTFD LE +ALRAI+ELKM+ I
Sbjct: 181 HDMPSFFSPCGPHAFIIPPMREQLEFLGGQTR-SKVLVNTFDTLETDALRAIDELKMIAI 240

Query: 241 GPLISELH---GDLFQLSNEDYYIEWLNSKANSSVVYLSFGSICVLSKEQEEEIFYGLLE 300
           GPLI   H   G+LF +S+ED YI WL+SKA  SVVY+SFGSIC L +EQEEE+  GLLE
Sbjct: 241 GPLIPSSHDDGGNLFHVSSED-YIGWLSSKAERSVVYVSFGSICELCEEQEEELLNGLLE 300

Query: 301 SEYQFLWVMRSKNDEEEKKWKELVEGKGRIVGWCRQIEVLKHPSLGCFITHCGWNSTLES 360
           S   FLWV+RS +DEE  K    V  KG+IV WCRQIEVLKHPS+GCF+THCGWNST+ES
Sbjct: 301 SGRPFLWVVRSNHDEERLK---KVGMKGKIVSWCRQIEVLKHPSVGCFVTHCGWNSTIES 360

Query: 361 LSFGVPMVGFPQQIDQATNAKLVEDVWKMGVRVKVNSEGIVEREEIRRCVDLVMERKNGE 420
           LS GV +VGFPQQIDQ TNAKLVED+WK GVRVK NSEG+VER EIRRC+DLVME +   
Sbjct: 361 LSLGVAVVGFPQQIDQMTNAKLVEDLWKTGVRVKGNSEGVVERGEIRRCLDLVMENE--- 420

Query: 421 KGDIEKNVKKWKELAWEAINEGGSSIFNLENFVDEIDG 454
             +IE+NVK WKEL  +A+ EGGSS  NL+ FV EIDG
Sbjct: 421 --EIERNVKVWKELGRQAVEEGGSSTSNLQAFVAEIDG 445

BLAST of Cla97C06G112540 vs. NCBI nr
Match: XP_023514979.1 (crocetin glucosyltransferase, chloroplastic-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 590.9 bits (1522), Expect = 3.7e-165
Identity = 309/457 (67.61%), Postives = 364/457 (79.65%), Query Frame = 0

Query: 1   MRNHHFLIVCFPSQGHINPSLQLAKQLTTLNVEVTFATTVVAARRMNITQKIPSPPTLSF 60
           MRN HFL+VCFPSQG+INPSLQLAK+L  LN++VTFATT+ AARRMN  Q+ P+   LSF
Sbjct: 1   MRNPHFLLVCFPSQGYINPSLQLAKRLIHLNIDVTFATTIAAARRMNNAQQTPTTKGLSF 60

Query: 61  ATFSDGCDDEN-QTNSDFNHYVSELKRCGSQSLTNLITSARGPGRRPFTCVIYSLLLNWA 120
           ATFSDG DD+N + +++  H+ SELKRCGSQSLTNLITSA   G RPFT +IY LLLNWA
Sbjct: 61  ATFSDGFDDDNLKLSANITHFFSELKRCGSQSLTNLITSAANKG-RPFTFLIYGLLLNWA 120

Query: 121 AELATSFNIPSALFSAQPATVLALYYYYFHGFGDEIINKLQRNNPSLSIKLPGLPLFKSH 180
           A++ATSFNIPSALF AQPATVLALY++YFHG+ + I NKLQ   PS  I+LP LPLF + 
Sbjct: 121 ADIATSFNIPSALFFAQPATVLALYFHYFHGYEEPICNKLQ--TPSSCIELPNLPLFTTR 180

Query: 181 DMPSFFSPSGRDAFIIPLMREQMEFLGQQKRPTKVLVNTFDALENEALRAINELKMVGIG 240
           DMPSFFSP G  AFIIP MREQ+EFLG Q R +KVLVNTFD LE +ALRAI+ELKM+ IG
Sbjct: 181 DMPSFFSPCGPHAFIIPPMREQLEFLGGQTR-SKVLVNTFDTLETDALRAIDELKMIAIG 240

Query: 241 PLISELH---GDLFQLSNEDYYIEWLNSKANSSVVYLSFGSICVLSKEQEEEIFYGLLES 300
           PLI   H   G+LF +S+ED YI WL+SKA  SVVY+SFGSIC L +EQEEE+  GLLES
Sbjct: 241 PLIPSSHDDGGNLFHVSSED-YIGWLSSKAERSVVYVSFGSICELCEEQEEEVLNGLLES 300

Query: 301 EYQFLWVMRSKNDEEEKKWKELVEGKGRIVGWCRQIEVLKHPSLGCFITHCGWNSTLESL 360
              F+WV+RS  DEE+ K    V  KG+IV WCRQIEVLKHPS+GCF++HCGWNST+ESL
Sbjct: 301 GRPFMWVVRSNYDEEKFK---KVGMKGKIVSWCRQIEVLKHPSVGCFVSHCGWNSTIESL 360

Query: 361 SFGVPMVGFPQQIDQATNAKLVEDVWKMGVRVKVNSEGIVEREEIRRCVDLVMERKNGEK 420
           S GV +VGFPQQIDQ TNAKLVED+WK GVRVK NSEG+VER EIRRC+DLVM  +    
Sbjct: 361 SLGVAVVGFPQQIDQMTNAKLVEDLWKTGVRVKGNSEGVVERGEIRRCLDLVMGNE---- 420

Query: 421 GDIEKNVKKWKELAWEAINEGGSSIFNLENFVDEIDG 454
            +IE+NVK WK+L  +A+ EGGSS  NL+ FV +IDG
Sbjct: 421 -EIERNVKVWKQLGRQAVEEGGSSTLNLQAFVSQIDG 444

BLAST of Cla97C06G112540 vs. TrEMBL
Match: tr|A0A0A0KCM4|A0A0A0KCM4_CUCSA (Glycosyltransferase OS=Cucumis sativus OX=3659 GN=Csa_6G109740 PE=3 SV=1)

HSP 1 Score: 753.1 bits (1943), Expect = 3.7e-214
Identity = 374/456 (82.02%), Postives = 411/456 (90.13%), Query Frame = 0

Query: 1   MRNHHFLIVCFPSQGHINPSLQLAKQLTTLNVEVTFATTVVAARRMNITQKIPSPPTLSF 60
           MRNHHFLIVCFPSQG+INPSLQLA +LT+LN+EVTFATTV A+RRM ITQ+I SP TLSF
Sbjct: 1   MRNHHFLIVCFPSQGYINPSLQLANKLTSLNIEVTFATTVTASRRMKITQQISSPSTLSF 60

Query: 61  ATFSDGCDDENQTNSDFNHYVSELKRCGSQSLTNLITSARGPGRRPFTCVIYSLLLNWAA 120
           ATFSDG DDEN   SDFNH+ SELKRCGSQSLT+LITS R   RRPFT VIYSLLLNWAA
Sbjct: 61  ATFSDGFDDENHKTSDFNHFFSELKRCGSQSLTDLITSFRDRHRRPFTFVIYSLLLNWAA 120

Query: 121 ELATSFNIPSALFSAQPATVLALYYYYFHGFGDEIINKLQRNNP-SLSIKLPGLP-LFKS 180
           ++ATSFNIPSALFSAQPATVLALYYYYFHGF DEI NKLQ + P SLSI+LPGLP LFKS
Sbjct: 121 DVATSFNIPSALFSAQPATVLALYYYYFHGFEDEITNKLQNDGPSSLSIELPGLPLLFKS 180

Query: 181 HDMPSFFSPSGRDAFIIPLMREQMEFLGQQKRPTKVLVNTFDALENEALRAINELKMVGI 240
           H+MPSFFSPSG+ AFIIP MREQMEFLGQQK+P KVLVNTF ALENEALRAI+EL+M+ I
Sbjct: 181 HEMPSFFSPSGQHAFIIPWMREQMEFLGQQKQPIKVLVNTFHALENEALRAIHELEMIAI 240

Query: 241 GPLISELHGDLFQLSNEDYYIEWLNSKANSSVVYLSFGSICVLSKEQEEEIFYGLLESEY 300
           GPLIS+  GDLFQ+SNEDYY+EWLNSK+N SVVYLSFGSICVLSKEQEEEI YGL ES Y
Sbjct: 241 GPLISQFRGDLFQVSNEDYYMEWLNSKSNCSVVYLSFGSICVLSKEQEEEILYGLFESGY 300

Query: 301 QFLWVMRSKNDEEEKKWKELVEGKGRIVGWCRQIEVLKHPSLGCFITHCGWNSTLESLSF 360
            FLWVMRSK+DE+E+KWKELVEGKG+IV WCRQIEVLKHPSLGCF++HCGWNSTLESLSF
Sbjct: 301 PFLWVMRSKSDEDEEKWKELVEGKGKIVSWCRQIEVLKHPSLGCFMSHCGWNSTLESLSF 360

Query: 361 GVPMVGFPQQIDQATNAKLVEDVWKMGVRVKVNSEGIVEREEIRRCVDLVMERK--NGEK 420
           G+PMV FPQQ+DQ TNAKLVEDVWKMGVRVK N EGIVEREEIRRC+DLVM RK  NGE+
Sbjct: 361 GLPMVAFPQQVDQPTNAKLVEDVWKMGVRVKGNLEGIVEREEIRRCLDLVMNRKYINGER 420

Query: 421 GDIEKNVKKWKELAWEAINEGGSSIFNLENFVDEID 453
            + EKNV+KWK+LAWEA++EGGSSI NL NFVDEID
Sbjct: 421 EETEKNVEKWKKLAWEAMDEGGSSILNLANFVDEID 456

BLAST of Cla97C06G112540 vs. TrEMBL
Match: tr|A0A1S3C7S0|A0A1S3C7S0_CUCME (Glycosyltransferase OS=Cucumis melo OX=3656 GN=LOC103497668 PE=3 SV=1)

HSP 1 Score: 743.4 bits (1918), Expect = 3.0e-211
Identity = 377/460 (81.96%), Postives = 410/460 (89.13%), Query Frame = 0

Query: 1   MRNHHFLIVCFPSQGHINPSLQLAKQLTTLNVEVTFATTVVAARRMNITQKIPSPPTLSF 60
           MRNHHFLIVCFPSQG INPSLQLA +LT+LN+EVTFATTV A+RRMNITQ+IPSP TLSF
Sbjct: 1   MRNHHFLIVCFPSQGCINPSLQLANKLTSLNIEVTFATTVTASRRMNITQQIPSPSTLSF 60

Query: 61  ATFSDGCDDENQTNSDFNHYVSELKRCGSQSLTNLITSARGP--GRRPFTCVIYSLLLNW 120
           ATFSDG DDEN   SDFNHY SELKRCGSQSLT+LI S R     RRPFT +IYSLLLNW
Sbjct: 61  ATFSDGFDDENHKTSDFNHYFSELKRCGSQSLTDLIASLRDDRHRRRPFTFLIYSLLLNW 120

Query: 121 AAELATSFNIPSALFSAQPATVLALYYYYFHGFGDEIINKLQRNNPSLSIKLPGLP-LFK 180
           AA++ATSFNIPSALFS QPATVLALYYYYFHGF DEI NKLQ + PSLSI+LPGLP LFK
Sbjct: 121 AADVATSFNIPSALFSTQPATVLALYYYYFHGFEDEITNKLQNDGPSLSIELPGLPLLFK 180

Query: 181 SHDMPSFFSPSGRDAFII-PLMREQMEFLGQQKRPTKVLVNTFDALENEALRAINELKMV 240
           SH+MPSFFSPS + A II PLMREQMEFL QQK+PTKVLVNTFDALENEALRAI+EL+M+
Sbjct: 181 SHEMPSFFSPSSQHASIITPLMREQMEFLSQQKKPTKVLVNTFDALENEALRAIHELEMI 240

Query: 241 GIGPLI-SELHGDLFQLSNEDYYIEWLNSKANSSVVYLSFGSICVLSKEQEEEIFYGLLE 300
            +GPLI +E  GDLFQ+SN DYY+EWLNSK+N SVVY+SFGSICVLSKEQEEEI YGLLE
Sbjct: 241 AVGPLINTEFRGDLFQVSNGDYYMEWLNSKSNFSVVYISFGSICVLSKEQEEEILYGLLE 300

Query: 301 SEYQFLWVMRSKNDEE-EKKWKELVEGKGRIVGWCRQIEVLKHPSLGCFITHCGWNSTLE 360
           S Y FLWV+RSKNDE+ E+KWKELVEGKGRIV WCRQIEVLKHPSLGCF++HCGWNSTLE
Sbjct: 301 SGYPFLWVIRSKNDEDREEKWKELVEGKGRIVSWCRQIEVLKHPSLGCFVSHCGWNSTLE 360

Query: 361 SLSFGVPMVGFPQQIDQATNAKLVEDVWKMGVRVKVNSEGIVEREEIRRCVDLVMERK-- 420
           SLSFG+PMV FPQQIDQ TNAKLVEDVWKMGVRVK N EGIVEREEIRRC+DLVM RK  
Sbjct: 361 SLSFGLPMVAFPQQIDQPTNAKLVEDVWKMGVRVKANLEGIVEREEIRRCLDLVMNRKDI 420

Query: 421 NGEKGDIEKNVKKWKELAWEAINEGGSSIFNLENFVDEID 453
           +GE+  IEKNV+KWKELAWEAINEGGSSI NL NFVDEID
Sbjct: 421 DGEREVIEKNVEKWKELAWEAINEGGSSILNLVNFVDEID 460

BLAST of Cla97C06G112540 vs. TrEMBL
Match: tr|A0A2N9EWS8|A0A2N9EWS8_FAGSY (Glycosyltransferase OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS7145 PE=3 SV=1)

HSP 1 Score: 479.2 bits (1232), Expect = 1.0e-131
Identity = 260/472 (55.08%), Postives = 330/472 (69.92%), Query Frame = 0

Query: 1   MRNHHFLIVCFPSQGHINPSLQLAKQLTTLNVEVTFATTVVAARRMNITQKIPSPPTLSF 60
           M   HFL+V FP+QGHINP+LQ AK+L +L   VTFAT+  A RRM  T+  P P  LSF
Sbjct: 1   MVQRHFLLVTFPAQGHINPALQFAKRLISLGAHVTFATSDSAYRRM--TESSPPPNGLSF 60

Query: 61  ATFSDGCDDENQTNSDFNHYVSELKRCGSQSLTNLITSARGPGRRPFTCVIYSLLLNWAA 120
           A+FSDG DD  +   D  +Y++ LK+ GS++LT+LI S+   G RPF C++Y+LLL WAA
Sbjct: 61  ASFSDGYDDGVRPEDDPENYLNVLKQNGSKTLTDLIVSSANQG-RPFNCLVYTLLLPWAA 120

Query: 121 ELATSFNIPSALFSAQPATVLALYYYYFHGFGDEIINKLQRNNPSLSIKLPGLPLFKSHD 180
           ++A   ++PSAL   QPA VL +YYYYF+GF + I       +PS  I+LPGLPL  S D
Sbjct: 121 DVARELHVPSALLWIQPAMVLDIYYYYFNGFANVISE--DTKDPSCLIQLPGLPLLTSRD 180

Query: 181 MPSFFSPSGRDAFIIPLMREQMEFLGQQKRPTKVLVNTFDALENEALRAINELKMVGIGP 240
           +PSF   S   AF +P  + Q+E L +++   +VLVN+FDALE EAL AI +L +VG+GP
Sbjct: 181 LPSFLLASNTHAFALPTFQAQIEAL-EKETHLRVLVNSFDALEPEALTAIKKLNLVGVGP 240

Query: 241 LI------------SELHGDLFQLSNEDYYIEWLNSKANSSVVYLSFGSICVLSKEQEEE 300
           LI                GDL Q S + YYIEWLNSKA SSVVY+SFGS+ VL+K Q EE
Sbjct: 241 LIPSAFLDGKDPSDKSFGGDLIQASKDHYYIEWLNSKAESSVVYVSFGSLSVLTKPQMEE 300

Query: 301 IFYGLLESEYQFLWVMR--SKNDEEEKKWKEL-----VEGKGRIVGWCRQIEVLKHPSLG 360
           +  GLL+S Y FLWV+R   +N +EEK+ ++L     +E KG IV WC Q+EVL HPSLG
Sbjct: 301 LARGLLDSGYSFLWVIRDTDRNGKEEKEIEKLSCREELEEKGMIVTWCSQVEVLSHPSLG 360

Query: 361 CFITHCGWNSTLESLSFGVPMVGFPQQIDQATNAKLVEDVWKMGVRVKVNSEGIVEREEI 420
           CF+THCGWNS+LESL  GVP++GFPQ  DQ TNAKL++DVWK GVRV VN EGIVE +EI
Sbjct: 361 CFVTHCGWNSSLESLVSGVPVIGFPQWTDQGTNAKLIQDVWKTGVRVNVNEEGIVESDEI 420

Query: 421 RRCVDLVMERKNGEKG-DIEKNVKKWKELAWEAINEGGSSIFNLENFVDEID 453
           +RC++LVM   +GE G ++ KN KKWKELA EA  EGGSS  NL+ FVDEI+
Sbjct: 421 KRCLELVM--GDGEGGEEMRKNAKKWKELAREAAKEGGSSYNNLKAFVDEIE 464

BLAST of Cla97C06G112540 vs. TrEMBL
Match: tr|A0A2I4GXB6|A0A2I4GXB6_9ROSI (Glycosyltransferase OS=Juglans regia OX=51240 GN=LOC109011704 PE=3 SV=1)

HSP 1 Score: 476.1 bits (1224), Expect = 8.8e-131
Identity = 260/470 (55.32%), Postives = 325/470 (69.15%), Query Frame = 0

Query: 1   MRNHHFLIVCFPSQGHINPSLQLAKQLTTLNVEVTFATTVVAARRMNITQKIPSPPTLSF 60
           M NHHFL+V FP+QGHINP+LQ AK+L  L   VTFATTV A RRMN   K P+P  LSF
Sbjct: 1   MVNHHFLLVIFPAQGHINPALQFAKRLIRLGAHVTFATTVAAHRRMN---KSPTPDGLSF 60

Query: 61  ATFSDGCDDE--NQTNSDFNHYVSELKRCGSQSLTNLITSARGPGRRPFTCVIYSLLLNW 120
           ATFSDG DD      + DF  Y+SELKR GSQ+LT+L+ S+   G R FTC++YS+LL W
Sbjct: 61  ATFSDGYDDGGFKHGDHDFVDYMSELKRRGSQTLTDLVVSSANKG-RTFTCLVYSILLPW 120

Query: 121 AAELATSFNIPSALFSAQPATVLALYYYYFHGFGDEIINKLQRNNPSLSIKLPGLPLFKS 180
           A ++A   ++ SAL   QPATVL +YYYYF+G+GD I N     +PS SI+LPGLP   S
Sbjct: 121 ACDVARELHLLSALLWIQPATVLDIYYYYFNGYGDVIRN---IPDPSFSIELPGLPSLTS 180

Query: 181 HDMPSFFSPSGRDAFIIPLMREQMEFLGQQKRPTKVLVNTFDALENEALRAINELKMVGI 240
            D+PSF +      F +PL +E  E LG++  P +VLVNTFDALE EALRAI      GI
Sbjct: 181 RDLPSFMADLNTHTFALPLFQEHFEELGKESNP-RVLVNTFDALEPEALRAIERFSFTGI 240

Query: 241 GPLI------------SELHGDLFQLSNEDYYIEWLNSKANSSVVYLSFGSICVLSKEQE 300
           GPLI            +   GDLFQ + +  YIEWLNSK +SSV+Y+SFGS+ +L+K Q 
Sbjct: 241 GPLIPSAFLDGKDPSDTAFGGDLFQGATD--YIEWLNSKPSSSVIYVSFGSLSLLAKNQM 300

Query: 301 EEIFYGLLESEYQFLWVMRSKNDEEEKK-----WKELVEGKGRIVGWCRQIEVLKHPSLG 360
           EEI  GLL+    FLWV R+  + EEK+      +E +E KG+ V WC Q+EVL HPS+ 
Sbjct: 301 EEIARGLLDYGCPFLWVKRANENGEEKEEDRLSCREELEQKGKFVQWCSQVEVLSHPSVA 360

Query: 361 CFITHCGWNSTLESLSFGVPMVGFPQQIDQATNAKLVEDVWKMGVRVKVNSEGIVEREEI 420
           CF+THCGWNSTLESL  GVP+V FPQ  DQ TNAKLV+DVWK+G+RV  N +GIVE +EI
Sbjct: 361 CFVTHCGWNSTLESLVSGVPLVAFPQWTDQGTNAKLVQDVWKIGLRVTTNKDGIVEGDEI 420

Query: 421 RRCVDLVMERKNGEKG-DIEKNVKKWKELAWEAINEGGSSIFNLENFVDE 451
           +RC++LV+   NGE+G ++ KN KKWK+LA EA  EGG+S  NL+ FVDE
Sbjct: 421 KRCLELVL--GNGERGEEMRKNAKKWKDLAREAAMEGGTSYNNLKAFVDE 458

BLAST of Cla97C06G112540 vs. TrEMBL
Match: tr|F6I4F4|F6I4F4_VITVI (Glycosyltransferase OS=Vitis vinifera OX=29760 GN=VIT_05s0062g00640 PE=3 SV=1)

HSP 1 Score: 472.2 bits (1214), Expect = 1.3e-129
Identity = 249/470 (52.98%), Postives = 323/470 (68.72%), Query Frame = 0

Query: 1   MRNHHFLIVCFPSQGHINPSLQLAKQLTTLNVEVTFATTVVAARRMNITQKIPSPPTLSF 60
           M + HFL+V FP+QGHINP+LQ AK++     +V+FAT+V A RRM    K  +P  L+F
Sbjct: 1   MGSPHFLLVTFPAQGHINPALQFAKRIIRTGAQVSFATSVSAHRRM---AKRSTPEGLNF 60

Query: 61  ATFSDGCDDENQTNSDFNHYVSELKRCGSQSLTNLITSARGPGRRPFTCVIYSLLLNWAA 120
             FSDG DD  +   D  HY+SE+KR GS++L  ++      G +PFTC++Y+LLL WAA
Sbjct: 61  VPFSDGYDDGFKPTDDVQHYMSEIKRRGSETLREIVVRNADEG-QPFTCIVYTLLLPWAA 120

Query: 121 ELATSFNIPSALFSAQPATVLALYYYYFHGFGDEIINKLQRNNPSLSIKLPGLPLFKSHD 180
           E+A    +PSAL   QPATVL +YYYYF+G+GD   N    N PS S++LPGLPL  S D
Sbjct: 121 EVARGLGVPSALLWIQPATVLDIYYYYFNGYGDVFRN--ISNEPSCSVELPGLPLLSSRD 180

Query: 181 MPSFFSPSGRDAFIIPLMREQMEFLGQQKRPTKVLVNTFDALENEALRAINELKMVGIGP 240
           +PSF   S    F++P  +EQ+E L Q+  P KVLVNTFDALE E LRA+++L ++GIGP
Sbjct: 181 LPSFLVKSNAYTFVLPTFQEQLEALSQETSP-KVLVNTFDALEPEPLRAVDKLHLIGIGP 240

Query: 241 LISELH------------GDLFQLSNEDYYIEWLNSKANSSVVYLSFGSICVLSKEQEEE 300
           L+   +            GD+FQ    D Y+EWLNSK  SSVVY+SFGSI VLSK Q+E+
Sbjct: 241 LVPSAYLDGKDPSDTSFGGDMFQ--GSDDYMEWLNSKPKSSVVYVSFGSISVLSKTQKED 300

Query: 301 IFYGLLESEYQFLWVMRSKNDEEEKK------WKELVEGKGRIVGWCRQIEVLKHPSLGC 360
           I   LL+  + FLWV+R+  + EE K       +E +E KG IV WC QIEVL HPSLGC
Sbjct: 301 IARALLDCGHPFLWVIRAPENGEEVKEQDKLSCREELEQKGMIVSWCSQIEVLTHPSLGC 360

Query: 361 FITHCGWNSTLESLSFGVPMVGFPQQIDQATNAKLVEDVWKMGVRVKVNSEGIVEREEIR 420
           F++HCGWNSTLESL  GVP+V FPQ  DQ TNAKL+ED+WK+G+RV VN EGIVE +E +
Sbjct: 361 FVSHCGWNSTLESLVSGVPVVAFPQWTDQGTNAKLIEDMWKIGIRVTVNEEGIVESDEFK 420

Query: 421 RCVDLVMERKNGEKG-DIEKNVKKWKELAWEAINEGGSSIFNLENFVDEI 452
           RC+++VM    GEKG ++ +N +KWK LA EA+ +GGSS  NL+ FVDE+
Sbjct: 421 RCLEIVM--GGGEKGEEMRRNAEKWKNLAREAVKDGGSSDKNLKGFVDEV 459

BLAST of Cla97C06G112540 vs. Swiss-Prot
Match: sp|F8WKW0|UGT1_GARJA (Crocetin glucosyltransferase, chloroplastic OS=Gardenia jasminoides OX=114476 GN=UGT75L6 PE=1 SV=1)

HSP 1 Score: 426.4 bits (1095), Expect = 4.0e-118
Identity = 234/472 (49.58%), Postives = 317/472 (67.16%), Query Frame = 0

Query: 1   MRNHHFLIVCFPSQGHINPSLQLAKQLTTLNVEVTFATTVVAARRMNITQKIPSPPTLSF 60
           ++  H L++ +P+QGHINP+LQ A++L  + ++VT AT+V A  RM  +    +P  L+F
Sbjct: 2   VQQRHVLLITYPAQGHINPALQFAQRLLRMGIQVTLATSVYALSRMKKSSG-STPKGLTF 61

Query: 61  ATFSDGCDDENQTNS-DFNHYVSELKRCGSQSLTNLITSARGPGRRPFTCVIYSLLLNWA 120
           ATFSDG DD  +    D   Y+S L + GS +L N+I ++   G  P TC++Y+LLL WA
Sbjct: 62  ATFSDGYDDGFRPKGVDHTEYMSSLAKQGSNTLRNVINTSADQG-CPVTCLVYTLLLPWA 121

Query: 121 AELATSFNIPSALFSAQPATVLALYYYYFHGFGDEIINKLQRNNPSLSIKLPGLPLFKSH 180
           A +A   +IPSAL   QP  V+ +YYYYF G+ D++ N    N+P+ SI+ PGLP  K+ 
Sbjct: 122 ATVARECHIPSALLWIQPVAVMDIYYYYFRGYEDDVKN--NSNDPTWSIQFPGLPSMKAK 181

Query: 181 DMPSFFSPSGRD--AFIIPLMREQMEFLGQQKRPTKVLVNTFDALENEALRAINELKMVG 240
           D+PSF  PS  +  +F +P  ++Q+E L +++RP KVLVNTFDALE +AL+AI    ++ 
Sbjct: 182 DLPSFILPSSDNIYSFALPTFKKQLETLDEEERP-KVLVNTFDALEPQALKAIESYNLIA 241

Query: 241 IGPLI------------SELHGDLFQLSNEDYYIEWLNSKANSSVVYLSFGSICVLSKEQ 300
           IGPL             +   GDLFQ S +  Y EWLNS+   SVVY+SFGS+  L K+Q
Sbjct: 242 IGPLTPSAFLDGKDPSETSFSGDLFQKSKD--YKEWLNSRPAGSVVYVSFGSLLTLPKQQ 301

Query: 301 EEEIFYGLLESEYQFLWVMRSK-NDEEEKKWKELV-----EGKGRIVGWCRQIEVLKHPS 360
            EEI  GLL+S   FLWV+R+K N EEEK+   L+     E +G IV WC QIEVL HPS
Sbjct: 302 MEEIARGLLKSGRPFLWVIRAKENGEEEKEEDRLICMEELEEQGMIVPWCSQIEVLTHPS 361

Query: 361 LGCFITHCGWNSTLESLSFGVPMVGFPQQIDQATNAKLVEDVWKMGVRVKVNSEGIVERE 420
           LGCF+THCGWNSTLE+L  GVP+V FP   DQ TNAKL+EDVW+ GVRV  N +G VE +
Sbjct: 362 LGCFVTHCGWNSTLETLVCGVPVVAFPHWTDQGTNAKLIEDVWETGVRVVPNEDGTVESD 421

Query: 421 EIRRCVDLVMERKNGEKG-DIEKNVKKWKELAWEAINEGGSSIFNLENFVDE 451
           EI+RC++ VM+  +GEKG ++++N KKWKELA EA+ E GSS  NL+ FV++
Sbjct: 422 EIKRCIETVMD--DGEKGVELKRNAKKWKELAREAMQEDGSSDKNLKAFVED 464

BLAST of Cla97C06G112540 vs. Swiss-Prot
Match: sp|O23406|U75D1_ARATH (UDP-glycosyltransferase 75D1 OS=Arabidopsis thaliana OX=3702 GN=UGT75D1 PE=2 SV=2)

HSP 1 Score: 394.4 bits (1012), Expect = 1.7e-108
Identity = 228/473 (48.20%), Postives = 306/473 (64.69%), Query Frame = 0

Query: 5   HFLIVCFPSQGHINPSLQLAKQL--TTLNVEVTFATTVVA-ARRMNITQKIPSPPTLSFA 64
           HFL V FP+QGHINPSL+LAK+L  T     VTFA ++ A  RRM  T+ +P   TL FA
Sbjct: 13  HFLFVTFPAQGHINPSLELAKRLAGTISGARVTFAASISAYNRRMFSTENVPE--TLIFA 72

Query: 65  TFSDGCDD--------ENQTNSDFNHYVSELKRCGSQSLTNLITSARGPGRRPFTCVIYS 124
           T+SDG DD        +        +++SE++R G ++LT LI   R    RPFTCV+Y+
Sbjct: 73  TYSDGHDDGFKSSAYSDKSRQDATGNFMSEMRRRGKETLTELIEDNR-KQNRPFTCVVYT 132

Query: 125 LLLNWAAELATSFNIPSALFSAQPATVLALYYYYFHGFGDEIINKLQRNNPSLSIKLPGL 184
           +LL W AELA  F++PSAL   QP TV +++Y+YF+G+ D I      N PS SIKLP L
Sbjct: 133 ILLTWVAELAREFHLPSALLWVQPVTVFSIFYHYFNGYEDAISE--MANTPSSSIKLPSL 192

Query: 185 PLFKSHDMPSFFSPSGRDAFIIPLMREQMEFLGQQKRPTKVLVNTFDALENEALRAI-NE 244
           PL    D+PSF   S   AF++P  REQ++ L ++  P K+L+NTF  LE EA+ ++ + 
Sbjct: 193 PLLTVRDIPSFIVSSNVYAFLLPAFREQIDSLKEEINP-KILINTFQELEPEAMSSVPDN 252

Query: 245 LKMVGIGPLISELHGDLFQLSNEDYYIEWLNSKANSSVVYLSFGSICVLSKEQEEEIFYG 304
            K+V +GPL++ L  D    S+   YIEWL++KA+SSV+Y+SFG++ VLSK+Q  E+   
Sbjct: 253 FKIVPVGPLLT-LRTD---FSSRGEYIEWLDTKADSSVLYVSFGTLAVLSKKQLVELCKA 312

Query: 305 LLESEYQFLWVM-----RSKNDEEEKK------WKELVEGKGRIVGWCRQIEVLKHPSLG 364
           L++S   FLWV+     R+K DE+EK+      ++E ++  G +V WC Q  VL H S+G
Sbjct: 313 LIQSRRPFLWVITDKSYRNKEDEQEKEEDCISSFREELDEIGMVVSWCDQFRVLNHRSIG 372

Query: 365 CFITHCGWNSTLESLSFGVPMVGFPQQIDQATNAKLVEDVWKMGVRV--KVNSEG--IVE 424
           CF+THCGWNSTLESL  GVP+V FPQ  DQ  NAKL+ED WK GVRV  K   EG  +V+
Sbjct: 373 CFVTHCGWNSTLESLVSGVPVVAFPQWNDQMMNAKLLEDCWKTGVRVMEKKEEEGVVVVD 432

Query: 425 REEIRRCVDLVMERKNGEKGDIEKNVKKWKELAWEAINEGGSSIFNLENFVDE 451
            EEIRRC++ VME K  E      N  +WK+LA EA+ EGGSS  +L+ FVDE
Sbjct: 433 SEEIRRCIEEVMEDKAEE---FRGNATRWKDLAAEAVREGGSSFNHLKAFVDE 472

BLAST of Cla97C06G112540 vs. Swiss-Prot
Match: sp|Q9ZR25|5GT_VERHY (Anthocyanidin 3-O-glucoside 5-O-glucosyltransferase OS=Verbena hybrida OX=76714 GN=HGT8 PE=2 SV=1)

HSP 1 Score: 389.8 bits (1000), Expect = 4.1e-107
Identity = 224/471 (47.56%), Postives = 309/471 (65.61%), Query Frame = 0

Query: 1   MRNHHFLIVCFPSQGHINPSLQLAKQLTTLNVEVTFATTVVAARRMNITQKIPSPPTLSF 60
           M   H L+  FP+QGHINP+LQ AK+L   +++VTF T+V A RRM+ T    S   ++F
Sbjct: 1   MSRAHVLLATFPAQGHINPALQFAKRLANADIQVTFFTSVYAWRRMSRT-AAGSNGLINF 60

Query: 61  ATFSDGCDDENQTNSDFNHYVSELKRCGSQSLTNLITSARGPGR-RPFTCVIYSLLLNWA 120
            +FSDG DD  Q   D  +Y+SE+K  G ++L++ + +     +    T V+YS L  WA
Sbjct: 61  VSFSDGYDDGLQPGDDGKNYMSEMKSRGIKALSDTLAANNVDQKSSKITFVVYSHLFAWA 120

Query: 121 AELATSFNIPSALFSAQPATVLALYYYYFHGFGDEIINKLQRNNPSLSIKLP-GLPLFKS 180
           A++A  F++ SAL   +PATVL ++Y+YF+G+ DEI      +  S +I LP GLP+   
Sbjct: 121 AKVAREFHLRSALLWIEPATVLDIFYFYFNGYSDEI------DAGSDAIHLPGGLPVLAQ 180

Query: 181 HDMPSFFSPSGRDAFIIPLMREQMEFLGQQKRPTKVLVNTFDALENEALRAINELKMVGI 240
            D+PSF  PS  + F   LM+E++E L  +++P KVLVN+FDALE +AL+AI++ +M+ I
Sbjct: 181 RDLPSFLLPSTHERF-RSLMKEKLETLEGEEKP-KVLVNSFDALEPDALKAIDKYEMIAI 240

Query: 241 GPLI------------SELHGDLFQL-SNEDYYIEWLNSKANSSVVYLSFGSICVLSKEQ 300
           GPLI                GDLF+  SN+D  +EWL++   SSVVY+SFGS    +K Q
Sbjct: 241 GPLIPSAFLDGKDPSDRSFGGDLFEKGSNDDDCLEWLSTNPRSSVVYVSFGSFVNTTKSQ 300

Query: 301 EEEIFYGLLESEYQFLWVMRSKNDEEEK-KWKELVEGKGRIVGWCRQIEVLKHPSLGCFI 360
            EEI  GLL+    FLWV+R    EE      E ++  G+IV WC Q+EVL HPSLGCF+
Sbjct: 301 MEEIARGLLDCGRPFLWVVRVNEGEEVLISCMEELKRVGKIVSWCSQLEVLTHPSLGCFV 360

Query: 361 THCGWNSTLESLSFGVPMVGFPQQIDQATNAKLVEDVWKMGVRVKVNSEG-IVEREEIRR 420
           THCGWNSTLES+SFGVPMV FPQ  DQ TNAKL+EDVW+ GVRV+ N EG +V+ +EIRR
Sbjct: 361 THCGWNSTLESISFGVPMVAFPQWFDQGTNAKLMEDVWRTGVRVRANEEGSVVDGDEIRR 420

Query: 421 CVDLVMERKNGEKG-DIEKNVKKWKELAWEAINEGGSSIFNLENFVDEIDG 454
           C++ VM+   GEK   + ++  KWK+LA +A+ E GSS+ NL+ F+DE+ G
Sbjct: 421 CIEEVMD--GGEKSRKLRESAGKWKDLARKAMEEDGSSVNNLKVFLDEVVG 460

BLAST of Cla97C06G112540 vs. Swiss-Prot
Match: sp|Q0WW21|U75C1_ARATH (UDP-glycosyltransferase 75C1 OS=Arabidopsis thaliana OX=3702 GN=UGT75C1 PE=2 SV=2)

HSP 1 Score: 382.1 bits (980), Expect = 8.6e-105
Identity = 216/459 (47.06%), Postives = 296/459 (64.49%), Query Frame = 0

Query: 2   RNHHFLIVCFPSQGHINPSLQLAKQLTTLNVEVTFATTVVAARRMNITQKIPSPPTLSFA 61
           R  H+L+V FP+QGHINP+LQLA +L      VT++T V A RRM    + PS   LSFA
Sbjct: 10  RRPHYLLVTFPAQGHINPALQLANRLIHHGATVTYSTAVSAHRRMG---EPPSTKGLSFA 69

Query: 62  TFSDGCDDENQTNSDFNHYVSELKRCGSQSLTNLITS--ARGPGRRPFTCVIYSLLLNWA 121
            F+DG DD  ++  D   Y+SELKRCGS +L ++I +         P T VIYS+L+ W 
Sbjct: 70  WFTDGFDDGLKSFEDQKIYMSELKRCGSNALRDIIKANLDATTETEPITGVIYSVLVPWV 129

Query: 122 AELATSFNIPSALFSAQPATVLALYYYYFHGFGDEIINKLQRNNPSLSIKLPGLPLFKSH 181
           + +A  F++P+ L   +PATVL +YYYYF+     + +          IKLP LPL  + 
Sbjct: 130 STVAREFHLPTTLLWIEPATVLDIYYYYFNTSYKHLFD-------VEPIKLPKLPLITTG 189

Query: 182 DMPSFFSPSGRDAFIIPLMREQMEFLGQQKRPTKVLVNTFDALENEALRAINELKMVGIG 241
           D+PSF  PS      +  +RE +E L  +  P K+LVNTF ALE++AL ++ +LKM+ IG
Sbjct: 190 DLPSFLQPSKALPSALVTLREHIEALETESNP-KILVNTFSALEHDALTSVEKLKMIPIG 249

Query: 242 PLISELHG--DLFQLSNEDYYIEWLNSKANSSVVYLSFGSIC-VLSKEQEEEIFYGLLES 301
           PL+S   G  DLF+ S+ED Y +WL+SK   SV+Y+S G+    L ++  E + +G+L +
Sbjct: 250 PLVSSSEGKTDLFKSSDED-YTKWLDSKLERSVIYISLGTHADDLPEKHMEALTHGVLAT 309

Query: 302 EYQFLWVMRSKNDEEEKK--WKELVEG--KGRIVGWCRQIEVLKHPSLGCFITHCGWNST 361
              FLW++R KN EE+KK  + EL+ G  +G +VGWC Q  VL H ++GCF+THCGWNST
Sbjct: 310 NRPFLWIVREKNPEEKKKNRFLELIRGSDRGLVVGWCSQTAVLAHCAVGCFVTHCGWNST 369

Query: 362 LESLSFGVPMVGFPQQIDQATNAKLVEDVWKMGVRVKVNSEGIVEREEIRRCVDLVMERK 421
           LESL  GVP+V FPQ  DQ T AKLVED W++GV+VKV  EG V+ EEIRRC++ VM   
Sbjct: 370 LESLESGVPVVAFPQFADQCTTAKLVEDTWRIGVKVKVGEEGDVDGEEIRRCLEKVM--S 429

Query: 422 NGEKG-DIEKNVKKWKELAWEAINEGGSSIFNLENFVDE 451
            GE+  ++ +N +KWK +A +A  EGG S  NL+ FVDE
Sbjct: 430 GGEEAEEMRENAEKWKAMAVDAAAEGGPSDLNLKGFVDE 454

BLAST of Cla97C06G112540 vs. Swiss-Prot
Match: sp|Q9ZR27|5GT1_PERFR (Anthocyanidin 3-O-glucoside 5-O-glucosyltransferase 1 OS=Perilla frutescens OX=48386 GN=PF3R4 PE=1 SV=1)

HSP 1 Score: 377.5 bits (968), Expect = 2.1e-103
Identity = 217/472 (45.97%), Postives = 290/472 (61.44%), Query Frame = 0

Query: 1   MRNHHFLIVCFPSQGHINPSLQLAKQLTTLNVEVTFATTVVAARRMNITQKIP--SPPTL 60
           M     L+  FP+QGHINP+LQ AK+L     +VTF T+V A RRM  T      +PP L
Sbjct: 1   MVRRRVLLATFPAQGHINPALQFAKRLLKAGTDVTFFTSVYAWRRMANTASAAAGNPPGL 60

Query: 61  SFATFSDGCDDENQTNSDFNHYVSELKRCGSQSLTNLITSARGPGRRPFTCVIYSLLLNW 120
            F  FSDG DD  +   D   Y+SE+K  GS++L NL+ +         T V+YS L  W
Sbjct: 61  DFVAFSDGYDDGLKPCGDGKRYMSEMKARGSEALRNLLLN-----NHDVTFVVYSHLFAW 120

Query: 121 AAELATSFNIPSALFSAQPATVLALYYYYFHGFGDEIINKLQRNNPSLSIKLPGLPLFKS 180
           AAE+A    +PSAL   +PATVL +YY+YF+G+ DEI      +  S  I+LP LP  + 
Sbjct: 121 AAEVARESQVPSALLWVEPATVLCIYYFYFNGYADEI------DAGSDEIQLPRLPPLEQ 180

Query: 181 HDMPSFFSPSGRDAFIIPLMREQMEFLGQQKRPTKVLVNTFDALENEALRAINELKMVGI 240
             +P+F  P   + F + +M+E++E L  +++  KVLVNTFDALE +AL AI+  +++GI
Sbjct: 181 RSLPTFLLPETPERFRL-MMKEKLETLDGEEK-AKVLVNTFDALEPDALTAIDRYELIGI 240

Query: 241 GPLI------------SELHGDLFQLSNEDYYIEWLNSKANSSVVYLSFGSICVLSKEQE 300
           GPLI            +   GDLF+ S E+  +EWL++K  SSVVY+SFGS+    K Q 
Sbjct: 241 GPLIPSAFLDGGDPSETSYGGDLFEKSEENNCVEWLDTKPKSSVVYVSFGSVLRFPKAQM 300

Query: 301 EEIFYGLLESEYQFLWVMR-SKNDEEEKKWKEL-----VEGKGRIVGWCRQIEVLKHPSL 360
           EEI  GLL     FLW++R  KND       EL     ++  G+IV WC Q+EVL HP+L
Sbjct: 301 EEIGKGLLACGRPFLWMIREQKNDXXXXXXXELSCIGELKKMGKIVSWCSQLEVLAHPAL 360

Query: 361 GCFITHCGWNSTLESLSFGVPMVGFPQQIDQATNAKLVEDVWKMGVRVKVNSEGIVEREE 420
           GCF+THCGWNS +ESLS GVP+V  PQ  DQ TNAKL+ED W  GVRV++N  G V+  E
Sbjct: 361 GCFVTHCGWNSAVESLSCGVPVVAVPQWFDQTTNAKLIEDAWGTGVRVRMNEGGGVDGSE 420

Query: 421 IRRCVDLVMERKNGEKGD-IEKNVKKWKELAWEAINEGGSSIFNLENFVDEI 452
           I RCV++VM+   GEK   + +N  KWK LA EA+ E GSS+ NL  F+ ++
Sbjct: 421 IERCVEMVMD--GGEKSKLVRENAIKWKTLAREAMGEDGSSLKNLNAFLHQV 457

BLAST of Cla97C06G112540 vs. TAIR10
Match: AT4G15550.1 (indole-3-acetate beta-D-glucosyltransferase)

HSP 1 Score: 394.4 bits (1012), Expect = 9.3e-110
Identity = 228/473 (48.20%), Postives = 306/473 (64.69%), Query Frame = 0

Query: 5   HFLIVCFPSQGHINPSLQLAKQL--TTLNVEVTFATTVVA-ARRMNITQKIPSPPTLSFA 64
           HFL V FP+QGHINPSL+LAK+L  T     VTFA ++ A  RRM  T+ +P   TL FA
Sbjct: 13  HFLFVTFPAQGHINPSLELAKRLAGTISGARVTFAASISAYNRRMFSTENVPE--TLIFA 72

Query: 65  TFSDGCDD--------ENQTNSDFNHYVSELKRCGSQSLTNLITSARGPGRRPFTCVIYS 124
           T+SDG DD        +        +++SE++R G ++LT LI   R    RPFTCV+Y+
Sbjct: 73  TYSDGHDDGFKSSAYSDKSRQDATGNFMSEMRRRGKETLTELIEDNR-KQNRPFTCVVYT 132

Query: 125 LLLNWAAELATSFNIPSALFSAQPATVLALYYYYFHGFGDEIINKLQRNNPSLSIKLPGL 184
           +LL W AELA  F++PSAL   QP TV +++Y+YF+G+ D I      N PS SIKLP L
Sbjct: 133 ILLTWVAELAREFHLPSALLWVQPVTVFSIFYHYFNGYEDAISE--MANTPSSSIKLPSL 192

Query: 185 PLFKSHDMPSFFSPSGRDAFIIPLMREQMEFLGQQKRPTKVLVNTFDALENEALRAI-NE 244
           PL    D+PSF   S   AF++P  REQ++ L ++  P K+L+NTF  LE EA+ ++ + 
Sbjct: 193 PLLTVRDIPSFIVSSNVYAFLLPAFREQIDSLKEEINP-KILINTFQELEPEAMSSVPDN 252

Query: 245 LKMVGIGPLISELHGDLFQLSNEDYYIEWLNSKANSSVVYLSFGSICVLSKEQEEEIFYG 304
            K+V +GPL++ L  D    S+   YIEWL++KA+SSV+Y+SFG++ VLSK+Q  E+   
Sbjct: 253 FKIVPVGPLLT-LRTD---FSSRGEYIEWLDTKADSSVLYVSFGTLAVLSKKQLVELCKA 312

Query: 305 LLESEYQFLWVM-----RSKNDEEEKK------WKELVEGKGRIVGWCRQIEVLKHPSLG 364
           L++S   FLWV+     R+K DE+EK+      ++E ++  G +V WC Q  VL H S+G
Sbjct: 313 LIQSRRPFLWVITDKSYRNKEDEQEKEEDCISSFREELDEIGMVVSWCDQFRVLNHRSIG 372

Query: 365 CFITHCGWNSTLESLSFGVPMVGFPQQIDQATNAKLVEDVWKMGVRV--KVNSEG--IVE 424
           CF+THCGWNSTLESL  GVP+V FPQ  DQ  NAKL+ED WK GVRV  K   EG  +V+
Sbjct: 373 CFVTHCGWNSTLESLVSGVPVVAFPQWNDQMMNAKLLEDCWKTGVRVMEKKEEEGVVVVD 432

Query: 425 REEIRRCVDLVMERKNGEKGDIEKNVKKWKELAWEAINEGGSSIFNLENFVDE 451
            EEIRRC++ VME K  E      N  +WK+LA EA+ EGGSS  +L+ FVDE
Sbjct: 433 SEEIRRCIEEVMEDKAEE---FRGNATRWKDLAAEAVREGGSSFNHLKAFVDE 472

BLAST of Cla97C06G112540 vs. TAIR10
Match: AT4G14090.1 (UDP-Glycosyltransferase superfamily protein)

HSP 1 Score: 382.1 bits (980), Expect = 4.8e-106
Identity = 216/459 (47.06%), Postives = 296/459 (64.49%), Query Frame = 0

Query: 2   RNHHFLIVCFPSQGHINPSLQLAKQLTTLNVEVTFATTVVAARRMNITQKIPSPPTLSFA 61
           R  H+L+V FP+QGHINP+LQLA +L      VT++T V A RRM    + PS   LSFA
Sbjct: 10  RRPHYLLVTFPAQGHINPALQLANRLIHHGATVTYSTAVSAHRRMG---EPPSTKGLSFA 69

Query: 62  TFSDGCDDENQTNSDFNHYVSELKRCGSQSLTNLITS--ARGPGRRPFTCVIYSLLLNWA 121
            F+DG DD  ++  D   Y+SELKRCGS +L ++I +         P T VIYS+L+ W 
Sbjct: 70  WFTDGFDDGLKSFEDQKIYMSELKRCGSNALRDIIKANLDATTETEPITGVIYSVLVPWV 129

Query: 122 AELATSFNIPSALFSAQPATVLALYYYYFHGFGDEIINKLQRNNPSLSIKLPGLPLFKSH 181
           + +A  F++P+ L   +PATVL +YYYYF+     + +          IKLP LPL  + 
Sbjct: 130 STVAREFHLPTTLLWIEPATVLDIYYYYFNTSYKHLFD-------VEPIKLPKLPLITTG 189

Query: 182 DMPSFFSPSGRDAFIIPLMREQMEFLGQQKRPTKVLVNTFDALENEALRAINELKMVGIG 241
           D+PSF  PS      +  +RE +E L  +  P K+LVNTF ALE++AL ++ +LKM+ IG
Sbjct: 190 DLPSFLQPSKALPSALVTLREHIEALETESNP-KILVNTFSALEHDALTSVEKLKMIPIG 249

Query: 242 PLISELHG--DLFQLSNEDYYIEWLNSKANSSVVYLSFGSIC-VLSKEQEEEIFYGLLES 301
           PL+S   G  DLF+ S+ED Y +WL+SK   SV+Y+S G+    L ++  E + +G+L +
Sbjct: 250 PLVSSSEGKTDLFKSSDED-YTKWLDSKLERSVIYISLGTHADDLPEKHMEALTHGVLAT 309

Query: 302 EYQFLWVMRSKNDEEEKK--WKELVEG--KGRIVGWCRQIEVLKHPSLGCFITHCGWNST 361
              FLW++R KN EE+KK  + EL+ G  +G +VGWC Q  VL H ++GCF+THCGWNST
Sbjct: 310 NRPFLWIVREKNPEEKKKNRFLELIRGSDRGLVVGWCSQTAVLAHCAVGCFVTHCGWNST 369

Query: 362 LESLSFGVPMVGFPQQIDQATNAKLVEDVWKMGVRVKVNSEGIVEREEIRRCVDLVMERK 421
           LESL  GVP+V FPQ  DQ T AKLVED W++GV+VKV  EG V+ EEIRRC++ VM   
Sbjct: 370 LESLESGVPVVAFPQFADQCTTAKLVEDTWRIGVKVKVGEEGDVDGEEIRRCLEKVM--S 429

Query: 422 NGEKG-DIEKNVKKWKELAWEAINEGGSSIFNLENFVDE 451
            GE+  ++ +N +KWK +A +A  EGG S  NL+ FVDE
Sbjct: 430 GGEEAEEMRENAEKWKAMAVDAAAEGGPSDLNLKGFVDE 454

BLAST of Cla97C06G112540 vs. TAIR10
Match: AT1G05560.1 (UDP-glucosyltransferase 75B1)

HSP 1 Score: 352.4 bits (903), Expect = 4.0e-97
Identity = 208/468 (44.44%), Postives = 280/468 (59.83%), Query Frame = 0

Query: 5   HFLIVCFPSQGHINPSLQLAKQL-TTLNVEVTFATTVVAARRMNITQKIPSPPTLSFATF 64
           HFL+V FP+QGH+NPSL+ A++L       VTF T V       I         LSF TF
Sbjct: 5   HFLLVTFPAQGHVNPSLRFARRLIKRTGARVTFVTCVSVFHNSMIANH-NKVENLSFLTF 64

Query: 65  SDGCDDEN-QTNSDFNHYVSELKRCGSQSLTNLITSARGPGRRPFTCVIYSLLLNWAAEL 124
           SDG DD    T  D       LK  G ++L++ I + +  G  P TC+IY++LLNWA ++
Sbjct: 65  SDGFDDGGISTYEDRQKRSVNLKVNGDKALSDFIEATKN-GDSPVTCLIYTILLNWAPKV 124

Query: 125 ATSFNIPSALFSAQPATVLALYYYYFHGFGDEIINKLQRNNPSLSIKLPGLPLFKSHDMP 184
           A  F +PSAL   QPA V  +YY +F G            N S+  +LP L   +  D+P
Sbjct: 125 ARRFQLPSALLWIQPALVFNIYYTHFMG------------NKSV-FELPNLSSLEIRDLP 184

Query: 185 SFFSPSGRDAFIIPLMREQMEFLGQQKRPTKVLVNTFDALENEALRAINELKMVGIGPLI 244
           SF +PS  +       +E MEFL ++ +P K+L+NTFD+LE EAL A   + MV +GPL+
Sbjct: 185 SFLTPSNTNKGAYDAFQEMMEFLIKETKP-KILINTFDSLEPEALTAFPNIDMVAVGPLL 244

Query: 245 -SELHGDLFQLSNEDY---YIEWLNSKANSSVVYLSFGSICVLSKEQEEEIFYGLLESEY 304
            +E+       S +D    Y  WL+SK  SSV+Y+SFG++  LSK+Q EE+   L+E + 
Sbjct: 245 PTEIFSGSTNKSVKDQSSSYTLWLDSKTESSVIYVSFGTMVELSKKQIEELARALIEGKR 304

Query: 305 QFLWVMRSKNDEEEKK-------------WKELVEGKGRIVGWCRQIEVLKHPSLGCFIT 364
            FLWV+  K++ E K              ++  +E  G IV WC QIEVL H ++GCF+T
Sbjct: 305 PFLWVITDKSNRETKTEGEEETEIEKIAGFRHELEEVGMIVSWCSQIEVLSHRAVGCFVT 364

Query: 365 HCGWNSTLESLSFGVPMVGFPQQIDQATNAKLVEDVWKMGVRVKVNSEGIVEREEIRRCV 424
           HCGW+STLESL  GVP+V FP   DQ TNAKL+E+ WK GVRV+ N +G+VER EIRRC+
Sbjct: 365 HCGWSSTLESLVLGVPVVAFPMWSDQPTNAKLLEESWKTGVRVRENKDGLVERGEIRRCL 424

Query: 425 DLVMERKNGEKGDIEKNVKKWKELAWEAINEGGSSIFNLENFVDEIDG 454
           + VME K+ E   + +N KKWK LA EA  EGGSS  N+E FV++I G
Sbjct: 425 EAVMEEKSVE---LRENAKKWKRLAMEAGREGGSSDKNMEAFVEDICG 453

BLAST of Cla97C06G112540 vs. TAIR10
Match: AT1G05530.1 (UDP-glucosyl transferase 75B2)

HSP 1 Score: 341.3 bits (874), Expect = 9.3e-94
Identity = 203/473 (42.92%), Postives = 278/473 (58.77%), Query Frame = 0

Query: 1   MRNHHFLIVCFPSQGHINPSLQLAKQL-TTLNVEVTFATTVVAARRMNITQKIPSPPTLS 60
           M   HFL+V FP+QGH+NPSL+ A++L  T    VTFAT +    R  I     +   LS
Sbjct: 1   MAQPHFLLVTFPAQGHVNPSLRFARRLIKTTGARVTFATCLSVIHRSMIPNH-NNVENLS 60

Query: 61  FATFSDGCDDENQTNS-DFNHYVSELKRCGSQSLTNLITSARGPGRRPFTCVIYSLLLNW 120
           F TFSDG DD   +N+ D  + +   +R G ++L++ I  A   G  P +C+IY++L NW
Sbjct: 61  FLTFSDGFDDGVISNTDDVQNRLVHFERNGDKALSDFI-EANQNGDSPVSCLIYTILPNW 120

Query: 121 AAELATSFNIPSALFSAQPATVLALYYYYFHGFGDEIINKLQRNNPSLSIKLPGLPLFKS 180
             ++A  F++PS     QPA    +YY Y  G           NN     + P LP  + 
Sbjct: 121 VPKVARRFHLPSVHLWIQPAFAFDIYYNYSTG-----------NNS--VFEFPNLPSLEI 180

Query: 181 HDMPSFFSPSGRDAFIIPLMREQMEFLGQQKRPTKVLVNTFDALENEALRAINELKMVGI 240
            D+PSF SPS  +     + +E M+FL ++  P K+LVNTFD+LE E L AI  ++MV +
Sbjct: 181 RDLPSFLSPSNTNKAAQAVYQELMDFLKEESNP-KILVNTFDSLEPEFLTAIPNIEMVAV 240

Query: 241 GPLI-------SELHGDLFQLSNEDYYIEWLNSKANSSVVYLSFGSICVLSKEQEEEIFY 300
           GPL+       SE   DL +      Y  WL+SK  SSV+Y+SFG++  LSK+Q EE+  
Sbjct: 241 GPLLPAEIFTGSESGKDLSRDHQSSSYTLWLDSKTESSVIYVSFGTMVELSKKQIEELAR 300

Query: 301 GLLESEYQFLWVMRSKNDEEEK-------------KWKELVEGKGRIVGWCRQIEVLKHP 360
            L+E    FLWV+  K + E K              ++  +E  G IV WC QIEVL+H 
Sbjct: 301 ALIEGGRPFLWVITDKLNREAKIEGEEETEIEKIAGFRHELEEVGMIVSWCSQIEVLRHR 360

Query: 361 SLGCFITHCGWNSTLESLSFGVPMVGFPQQIDQATNAKLVEDVWKMGVRVKVNSEGIVER 420
           ++GCF+THCGW+S+LESL  GVP+V FP   DQ  NAKL+E++WK GVRV+ NSEG+VER
Sbjct: 361 AIGCFLTHCGWSSSLESLVLGVPVVAFPMWSDQPANAKLLEEIWKTGVRVRENSEGLVER 420

Query: 421 EEIRRCVDLVMERKNGEKGDIEKNVKKWKELAWEAINEGGSSIFNLENFVDEI 452
            EI RC++ VME K+ E   + +N +KWK LA EA  EGGSS  N+E FV  +
Sbjct: 421 GEIMRCLEAVMEAKSVE---LRENAEKWKRLATEAGREGGSSDKNVEAFVKSL 454

BLAST of Cla97C06G112540 vs. TAIR10
Match: AT1G05680.1 (Uridine diphosphate glycosyltransferase 74E2)

HSP 1 Score: 286.2 bits (731), Expect = 3.5e-77
Identity = 173/465 (37.20%), Postives = 254/465 (54.62%), Query Frame = 0

Query: 5   HFLIVCFPSQGHINPSLQLAKQLTTLNVEVTFATTVVAARRMNITQKIPSPP------TL 64
           H +++ FP QGHI P  Q  K+L +  +++T            +    PSPP      ++
Sbjct: 6   HLIVLPFPGQGHITPMSQFCKRLASKGLKLTLV----------LVSDKPSPPYKTEHDSI 65

Query: 65  SFATFSDGCDDENQTNSDFNHYVSELKRCGSQSLTNLITSARGPGRRPFTCVIYSLLLNW 124
           +    S+G  +  +   D + Y+  ++     +L  L+   +  G  P   ++Y   + W
Sbjct: 66  TVFPISNGFQEGEEPLQDLDDYMERVETSIKNTLPKLVEDMKLSGNPP-RAIVYDSTMPW 125

Query: 125 AAELATSFNIPSALFSAQPATVLALYYYYFHGFGDEIINKLQRNNPSLSIKLPGLPLFKS 184
             ++A S+ +  A+F  QP  V A+YY+ F G       K      S     P  P+  +
Sbjct: 126 LLDVAHSYGLSGAVFFTQPWLVTAIYYHVFKGSFSVPSTKY---GHSTLASFPSFPMLTA 185

Query: 185 HDMPSFFSPSGRDAFIIPLMREQMEFLGQQKRPTKVLVNTFDALENEALRAINEL-KMVG 244
           +D+PSF   S     I+ ++ +Q   L    R   VL NTFD LE + L+ +  L  ++ 
Sbjct: 186 NDLPSFLCESSSYPNILRIVVDQ---LSNIDRVDIVLCNTFDKLEEKLLKWVQSLWPVLN 245

Query: 245 IGPLISELHGDLFQLSNEDYY------------IEWLNSKANSSVVYLSFGSICVLSKEQ 304
           IGP +  ++ D  +LS +  Y            +EWLNSK  +SVVYLSFGS+ +L ++Q
Sbjct: 246 IGPTVPSMYLDK-RLSEDKNYGFSLFNAKVAECMEWLNSKEPNSVVYLSFGSLVILKEDQ 305

Query: 305 EEEIFYGLLESEYQFLWVMR-SKNDEEEKKWKELVEGKGRIVGWCRQIEVLKHPSLGCFI 364
             E+  GL +S   FLWV+R ++  +  + + E +  KG IV W  Q++VL H S+GCF+
Sbjct: 306 MLELAAGLKQSGRFFLWVVRETETHKLPRNYVEEIGEKGLIVSWSPQLDVLAHKSIGCFL 365

Query: 365 THCGWNSTLESLSFGVPMVGFPQQIDQATNAKLVEDVWKMGVRVKVNSEGIVEREEIRRC 424
           THCGWNSTLE LS GVPM+G P   DQ TNAK ++DVWK+GVRVK   +G V REEI R 
Sbjct: 366 THCGWNSTLEGLSLGVPMIGMPHWTDQPTNAKFMQDVWKVGVRVKAEGDGFVRREEIMRS 425

Query: 425 VDLVMERKNGEKG-DIEKNVKKWKELAWEAINEGGSSIFNLENFV 449
           V+ VME   GEKG +I KN +KWK LA EA++EGGSS  ++  FV
Sbjct: 426 VEEVME---GEKGKEIRKNAEKWKVLAQEAVSEGGSSDKSINEFV 449

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004140604.25.7e-21482.02PREDICTED: crocetin glucosyltransferase, chloroplastic-like [Cucumis sativus] >K... [more]
XP_008458143.14.5e-21181.96PREDICTED: crocetin glucosyltransferase, chloroplastic-like [Cucumis melo][more]
XP_023000094.11.3e-17069.74crocetin glucosyltransferase, chloroplastic-like [Cucurbita maxima][more]
XP_022964378.15.7e-16668.56crocetin glucosyltransferase, chloroplastic-like [Cucurbita moschata][more]
XP_023514979.13.7e-16567.61crocetin glucosyltransferase, chloroplastic-like [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
tr|A0A0A0KCM4|A0A0A0KCM4_CUCSA3.7e-21482.02Glycosyltransferase OS=Cucumis sativus OX=3659 GN=Csa_6G109740 PE=3 SV=1[more]
tr|A0A1S3C7S0|A0A1S3C7S0_CUCME3.0e-21181.96Glycosyltransferase OS=Cucumis melo OX=3656 GN=LOC103497668 PE=3 SV=1[more]
tr|A0A2N9EWS8|A0A2N9EWS8_FAGSY1.0e-13155.08Glycosyltransferase OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS7145 PE=3 SV=1[more]
tr|A0A2I4GXB6|A0A2I4GXB6_9ROSI8.8e-13155.32Glycosyltransferase OS=Juglans regia OX=51240 GN=LOC109011704 PE=3 SV=1[more]
tr|F6I4F4|F6I4F4_VITVI1.3e-12952.98Glycosyltransferase OS=Vitis vinifera OX=29760 GN=VIT_05s0062g00640 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
sp|F8WKW0|UGT1_GARJA4.0e-11849.58Crocetin glucosyltransferase, chloroplastic OS=Gardenia jasminoides OX=114476 GN... [more]
sp|O23406|U75D1_ARATH1.7e-10848.20UDP-glycosyltransferase 75D1 OS=Arabidopsis thaliana OX=3702 GN=UGT75D1 PE=2 SV=... [more]
sp|Q9ZR25|5GT_VERHY4.1e-10747.56Anthocyanidin 3-O-glucoside 5-O-glucosyltransferase OS=Verbena hybrida OX=76714 ... [more]
sp|Q0WW21|U75C1_ARATH8.6e-10547.06UDP-glycosyltransferase 75C1 OS=Arabidopsis thaliana OX=3702 GN=UGT75C1 PE=2 SV=... [more]
sp|Q9ZR27|5GT1_PERFR2.1e-10345.97Anthocyanidin 3-O-glucoside 5-O-glucosyltransferase 1 OS=Perilla frutescens OX=4... [more]
Match NameE-valueIdentityDescription
AT4G15550.19.3e-11048.20indole-3-acetate beta-D-glucosyltransferase[more]
AT4G14090.14.8e-10647.06UDP-Glycosyltransferase superfamily protein[more]
AT1G05560.14.0e-9744.44UDP-glucosyltransferase 75B1[more]
AT1G05530.19.3e-9442.92UDP-glucosyl transferase 75B2[more]
AT1G05680.13.5e-7737.20Uridine diphosphate glycosyltransferase 74E2[more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0016758transferase activity, transferring hexosyl groups
Vocabulary: Biological Process
TermDefinition
GO:0008152metabolic process
Vocabulary: INTERPRO
TermDefinition
IPR035595UDP_glycos_trans_CS
IPR002213UDP_glucos_trans
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008152 metabolic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0016740 transferase activity
molecular_function GO:0016757 transferase activity, transferring glycosyl groups
molecular_function GO:0016758 transferase activity, transferring hexosyl groups

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C06G112540.1Cla97C06G112540.1mRNA


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableGENE3DG3DSA:3.40.50.2000coord: 251..432
e-value: 7.3E-133
score: 446.0
NoneNo IPR availableGENE3DG3DSA:3.40.50.2000coord: 11..250
e-value: 7.3E-133
score: 446.0
coord: 433..444
e-value: 7.3E-133
score: 446.0
NoneNo IPR availablePANTHERPTHR11926GLUCOSYL/GLUCURONOSYL TRANSFERASEScoord: 3..452
NoneNo IPR availablePANTHERPTHR11926:SF767SUBFAMILY NOT NAMEDcoord: 3..452
NoneNo IPR availableSUPERFAMILYSSF53756UDP-Glycosyltransferase/glycogen phosphorylasecoord: 2..451
IPR002213UDP-glucuronosyl/UDP-glucosyltransferasePFAMPF00201UDPGTcoord: 260..390
e-value: 7.1E-23
score: 81.1
IPR035595UDP-glycosyltransferase family, conserved sitePROSITEPS00375UDPGTcoord: 328..371

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
Cla97C06G112540ClCG06G003150Watermelon (Charleston Gray)wcgwmbB273
Cla97C06G112540CmaCh17G004740Cucurbita maxima (Rimu)cmawmbB377
Cla97C06G112540CmoCh17G004480Cucurbita moschata (Rifu)cmowmbB361
Cla97C06G112540CmoCh08G008380Cucurbita moschata (Rifu)cmowmbB891
The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Cla97C06G112540Silver-seed gourdcarwmbB0584
Cla97C06G112540Silver-seed gourdcarwmbB0628
Cla97C06G112540Cucumber (Gy14) v2cgybwmbB446
Cla97C06G112540Cucumber (Gy14) v1cgywmbB183
Cla97C06G112540Cucurbita maxima (Rimu)cmawmbB917
Cla97C06G112540Wild cucumber (PI 183967)cpiwmbB495
Cla97C06G112540Cucumber (Chinese Long) v3cucwmbB487
Cla97C06G112540Cucumber (Chinese Long) v2cuwmbB468
Cla97C06G112540Bottle gourd (USVL1VR-Ls)lsiwmbB026
Cla97C06G112540Melon (DHL92) v3.6.1medwmbB118
Cla97C06G112540Melon (DHL92) v3.5.1mewmbB127
Cla97C06G112540Watermelon (97103) v1wmwmbB408
Cla97C06G112540Wax gourdwgowmbB094