Cla97C02G037890 (gene) Watermelon (97103) v2

NameCla97C02G037890
Typegene
OrganismCitrullus lanatus (Watermelon (97103) v2)
DescriptionGlycosyltransferase
LocationCla97Chr02 : 25001347 .. 25002923 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCCATGCTACCACAAAACAAAGAATTCAGAATCACCGTCCTCCCGCTGTTCGCCTCCGGCCACATAATTCCGATCATCGACATGGCCAGGCTCTTTGCCCGCCACGGTGCCACCGTCACCATCATCGCCACCGAGTCCAACGCCTCCATTTTCCAAAACAACATCGACCATGACTTTGCCGCCGGATTCAAAATTCAGACCCACATTGTTAGCTTCCCTGGAGCCCAGGTTGGCCTCGCCCCCGGCATCGAAAACTACAGCGATGTCTCTTCTCGCCACCTCCAAGCCAAAATCTATCAAGCCTTTCTCCTTCTTGACAAACTTATAGACCAGGTTCCTTCCACCTTCTTTAATTGGTTTTTAGTTTAATGCTTATGCGTAACCTGTAAGGTCATTTTTTTCTCCGTATCGTAATTTTATGGTAGATGATCATTCCGGCAACTCGACCAGACTGCATTCTGAGCGACCTGTCGCATCCGTGGACGACGGATACGGCAGAGAGGCTCGGGGTGCCGCGGCTGGTGTTCTCGGTGTCGAATTTCATGGCATACTCTGCAGAGCACTCTGTTATGCAACATTCTCCTCACCAGAAAGTAGCCTCAGACACAGAGGAATTCGAAATCCCAGGATTACCCCACCACATTCAAATGACCAAATCCCAGCAGCCGGAATTTCTTCTCCGACGAGACCGCTTCACGGCGATGATGGAGAGTTACAAGGAAGCAGAGAGAAGAAGCTACGGAACTGTAATGAACACATTTTATGAGCTGGATGGGGTTTATTTAGAGCATTACAAAAAGATAACTGGAATCAAAGCTTGGGGATTAGGCCCAGTTTCATTGGCAGTGAACAAAAATCTGAGAGAAAAAATTGAAAGGGGAAACAAATCGGGAATGGAGAGTGAAGAGCTAGTGAAATGGTTGAATTCCAAGGAACCAAACTCTGTTTTGTTTGTGAGTTTTGGGAGTATGACTAGGTTTCCGCCGCCGCAAATGGCTGAGATTGCACATGGGCTTGAAGATTCCGGCATAAATTTCATATGGGTTATTCGAAACAAGGACAAAAACGACAGTGGAGAGGCGCCAGAGGGGCTGCCGGAGGGGTTCGAACAGATGATTAAGAATAAAAACAGGGGATTCATTGTTCGGATTTGGGCGCCGCAACTTTTGATTTTGGAGCACCCATCGACGGGGGGTTTCTTGACGCACTGTGGGTGGAATTCGTCCATTGAGGGGATCAGCGCCGGTCAGCCGATGGTGACGTGGCCGGTAAGCTCCGAGCAGTTTTATAATGAGAAGCTTCTGACGGAGGTGTTGCAGGTGGGGGTTCCGGTAGGGGCGCGGCGGTGGTGGAATATGAGCGATGAGATGAAGGAGATTGTGAGTAGAGAGAATGTGGAAAAGGGCGTGGGGTTTCTTATGGGGGCGACGGAGGAGGCGGCGGCAATTAGAGAGCGGGCGAAACAGCTCGGGGCTGCTGCGAACAGGGCAGTTCAAAGCGGCGGCTCGTCGGAGAACAATTTGATATCGTTGATGAAAGAATTGAGGTCAATTAAGGTTAACGATAAGGATTAA

mRNA sequence

ATGTCCATGCTACCACAAAACAAAGAATTCAGAATCACCGTCCTCCCGCTGTTCGCCTCCGGCCACATAATTCCGATCATCGACATGGCCAGGCTCTTTGCCCGCCACGGTGCCACCGTCACCATCATCGCCACCGAGTCCAACGCCTCCATTTTCCAAAACAACATCGACCATGACTTTGCCGCCGGATTCAAAATTCAGACCCACATTGTTAGCTTCCCTGGAGCCCAGGTTGGCCTCGCCCCCGGCATCGAAAACTACAGCGATGTCTCTTCTCGCCACCTCCAAGCCAAAATCTATCAAGCCTTTCTCCTTCTTGACAAACTTATAGACCAGATGATCATTCCGGCAACTCGACCAGACTGCATTCTGAGCGACCTGTCGCATCCGTGGACGACGGATACGGCAGAGAGGCTCGGGGTGCCGCGGCTGGTGTTCTCGGTGTCGAATTTCATGGCATACTCTGCAGAGCACTCTGTTATGCAACATTCTCCTCACCAGAAAGTAGCCTCAGACACAGAGGAATTCGAAATCCCAGGATTACCCCACCACATTCAAATGACCAAATCCCAGCAGCCGGAATTTCTTCTCCGACGAGACCGCTTCACGGCGATGATGGAGAGTTACAAGGAAGCAGAGAGAAGAAGCTACGGAACTGTAATGAACACATTTTATGAGCTGGATGGGGTTTATTTAGAGCATTACAAAAAGATAACTGGAATCAAAGCTTGGGGATTAGGCCCAGTTTCATTGGCAGTGAACAAAAATCTGAGAGAAAAAATTGAAAGGGGAAACAAATCGGGAATGGAGAGTGAAGAGCTAGTGAAATGGTTGAATTCCAAGGAACCAAACTCTGTTTTGTTTGTGAGTTTTGGGAGTATGACTAGGTTTCCGCCGCCGCAAATGGCTGAGATTGCACATGGGCTTGAAGATTCCGGCATAAATTTCATATGGGTTATTCGAAACAAGGACAAAAACGACAGTGGAGAGGCGCCAGAGGGGCTGCCGGAGGGGTTCGAACAGATGATTAAGAATAAAAACAGGGGATTCATTGTTCGGATTTGGGCGCCGCAACTTTTGATTTTGGAGCACCCATCGACGGGGGGTTTCTTGACGCACTGTGGGTGGAATTCGTCCATTGAGGGGATCAGCGCCGGTCAGCCGATGGTGACGTGGCCGGTAAGCTCCGAGCAGTTTTATAATGAGAAGCTTCTGACGGAGGTGTTGCAGGTGGGGGTTCCGGTAGGGGCGCGGCGGTGGTGGAATATGAGCGATGAGATGAAGGAGATTGTGAGTAGAGAGAATGTGGAAAAGGGCGTGGGGTTTCTTATGGGGGCGACGGAGGAGGCGGCGGCAATTAGAGAGCGGGCGAAACAGCTCGGGGCTGCTGCGAACAGGGCAGTTCAAAGCGGCGGCTCGTCGGAGAACAATTTGATATCGTTGATGAAAGAATTGAGGTCAATTAAGGTTAACGATAAGGATTAA

Coding sequence (CDS)

ATGTCCATGCTACCACAAAACAAAGAATTCAGAATCACCGTCCTCCCGCTGTTCGCCTCCGGCCACATAATTCCGATCATCGACATGGCCAGGCTCTTTGCCCGCCACGGTGCCACCGTCACCATCATCGCCACCGAGTCCAACGCCTCCATTTTCCAAAACAACATCGACCATGACTTTGCCGCCGGATTCAAAATTCAGACCCACATTGTTAGCTTCCCTGGAGCCCAGGTTGGCCTCGCCCCCGGCATCGAAAACTACAGCGATGTCTCTTCTCGCCACCTCCAAGCCAAAATCTATCAAGCCTTTCTCCTTCTTGACAAACTTATAGACCAGATGATCATTCCGGCAACTCGACCAGACTGCATTCTGAGCGACCTGTCGCATCCGTGGACGACGGATACGGCAGAGAGGCTCGGGGTGCCGCGGCTGGTGTTCTCGGTGTCGAATTTCATGGCATACTCTGCAGAGCACTCTGTTATGCAACATTCTCCTCACCAGAAAGTAGCCTCAGACACAGAGGAATTCGAAATCCCAGGATTACCCCACCACATTCAAATGACCAAATCCCAGCAGCCGGAATTTCTTCTCCGACGAGACCGCTTCACGGCGATGATGGAGAGTTACAAGGAAGCAGAGAGAAGAAGCTACGGAACTGTAATGAACACATTTTATGAGCTGGATGGGGTTTATTTAGAGCATTACAAAAAGATAACTGGAATCAAAGCTTGGGGATTAGGCCCAGTTTCATTGGCAGTGAACAAAAATCTGAGAGAAAAAATTGAAAGGGGAAACAAATCGGGAATGGAGAGTGAAGAGCTAGTGAAATGGTTGAATTCCAAGGAACCAAACTCTGTTTTGTTTGTGAGTTTTGGGAGTATGACTAGGTTTCCGCCGCCGCAAATGGCTGAGATTGCACATGGGCTTGAAGATTCCGGCATAAATTTCATATGGGTTATTCGAAACAAGGACAAAAACGACAGTGGAGAGGCGCCAGAGGGGCTGCCGGAGGGGTTCGAACAGATGATTAAGAATAAAAACAGGGGATTCATTGTTCGGATTTGGGCGCCGCAACTTTTGATTTTGGAGCACCCATCGACGGGGGGTTTCTTGACGCACTGTGGGTGGAATTCGTCCATTGAGGGGATCAGCGCCGGTCAGCCGATGGTGACGTGGCCGGTAAGCTCCGAGCAGTTTTATAATGAGAAGCTTCTGACGGAGGTGTTGCAGGTGGGGGTTCCGGTAGGGGCGCGGCGGTGGTGGAATATGAGCGATGAGATGAAGGAGATTGTGAGTAGAGAGAATGTGGAAAAGGGCGTGGGGTTTCTTATGGGGGCGACGGAGGAGGCGGCGGCAATTAGAGAGCGGGCGAAACAGCTCGGGGCTGCTGCGAACAGGGCAGTTCAAAGCGGCGGCTCGTCGGAGAACAATTTGATATCGTTGATGAAAGAATTGAGGTCAATTAAGGTTAACGATAAGGATTAA

Protein sequence

MSMLPQNKEFRITVLPLFASGHIIPIIDMARLFARHGATVTIIATESNASIFQNNIDHDFAAGFKIQTHIVSFPGAQVGLAPGIENYSDVSSRHLQAKIYQAFLLLDKLIDQMIIPATRPDCILSDLSHPWTTDTAERLGVPRLVFSVSNFMAYSAEHSVMQHSPHQKVASDTEEFEIPGLPHHIQMTKSQQPEFLLRRDRFTAMMESYKEAERRSYGTVMNTFYELDGVYLEHYKKITGIKAWGLGPVSLAVNKNLREKIERGNKSGMESEELVKWLNSKEPNSVLFVSFGSMTRFPPPQMAEIAHGLEDSGINFIWVIRNKDKNDSGEAPEGLPEGFEQMIKNKNRGFIVRIWAPQLLILEHPSTGGFLTHCGWNSSIEGISAGQPMVTWPVSSEQFYNEKLLTEVLQVGVPVGARRWWNMSDEMKEIVSRENVEKGVGFLMGATEEAAAIRERAKQLGAAANRAVQSGGSSENNLISLMKELRSIKVNDKD
BLAST of Cla97C02G037890 vs. NCBI nr
Match: XP_022149559.1 (soyasapogenol B glucuronide galactosyltransferase-like [Momordica charantia])

HSP 1 Score: 719.2 bits (1855), Expect = 9.9e-204
Identity = 358/490 (73.06%), Postives = 403/490 (82.24%), Query Frame = 0

Query: 3   MLPQNKEFRITVLPLFASGHIIPIIDMARLFARHGATVTIIATESNASIFQNNIDHDFAA 62
           M+ +N+E RITVLPLFASGHIIPI+DMARLFARHGA VTII TESNA  FQN++  DFAA
Sbjct: 1   MVLENEELRITVLPLFASGHIIPIVDMARLFARHGAAVTIITTESNARSFQNDVARDFAA 60

Query: 63  GFKIQTHIVSFPGAQVGLAPGIENYSDVSSRHLQAKIYQAFLLLDKLIDQMIIPATRPDC 122
           G+KIQT  V FP A+VGL PGIEN+SDV SR LQ KIY+AFL+L+K IDQ+IIP TRPDC
Sbjct: 61  GYKIQTRTVPFPAAEVGLPPGIENFSDVVSRDLQGKIYRAFLILEKQIDQVIIPETRPDC 120

Query: 123 ILSDLSHPWTTDTAERLGVPRLVFSVSNFMAYSAEHSVMQHSPHQKVASDTEEFEIPGLP 182
           ILSDLS+ WTTDTA RLGVPRLVF VSNFMAYSAEHSV+QH+PHQKV SD E FE+PGLP
Sbjct: 121 ILSDLSYGWTTDTAARLGVPRLVFFVSNFMAYSAEHSVLQHAPHQKVTSDFETFELPGLP 180

Query: 183 HHIQMTKSQQPEFLLRRDRFTAMMESYKEAERRSYGTVMNTFYELDGVYLEHYKKITGIK 242
           H IQMTKSQQPEFL++R +FT M+E YKEAERRSYG V NTFYELDGVYLEHYK+  GIK
Sbjct: 181 HKIQMTKSQQPEFLVQRSQFTEMIEKYKEAERRSYGIVTNTFYELDGVYLEHYKRTIGIK 240

Query: 243 AWGLGPVSLAVNKNLREKIERGNKSGMESEELVKWLNSKEPNSVLFVSFGSMTRFPPPQM 302
           AWGLGPVSLAVNK+L  KI+RGNKSGMES EL+ WLNSKEPNSVL+VSFGSMTRFP  Q+
Sbjct: 241 AWGLGPVSLAVNKDLIGKIDRGNKSGMESGELLDWLNSKEPNSVLYVSFGSMTRFPAAQI 300

Query: 303 AEIAHGLEDSGINFIWVIRNKDKNDSGEAPEGLPEGFEQ-MIKNKNRGFIVRIWAPQLLI 362
           AEIAHGLE +G NFIWVIR K++N+ GEA EGLPEGFE+ +++ K +G IVRIWAPQLLI
Sbjct: 301 AEIAHGLESAGRNFIWVIRKKNENEGGEAEEGLPEGFEERVVREKKKGLIVRIWAPQLLI 360

Query: 363 LEHPSTGGFLTHCGWNSSIEGISAGQPMVTWPVSSEQFYNEKLLTEVLQVGVPVGARRWW 422
           LEHPSTGGFLTHCGWNSSIEG+S GQPMVTWPVSSEQFYNEKLLTEVL+VGVPVGARRWW
Sbjct: 361 LEHPSTGGFLTHCGWNSSIEGVSTGQPMVTWPVSSEQFYNEKLLTEVLRVGVPVGARRWW 420

Query: 423 NMSDEMKE--IVSRENVEKGVGFLMGATEEAAAIRERAKQLGAAANRAVQSGGSSENNLI 482
           NMSDEM+E  IV RE V   VGFLMG                                ++
Sbjct: 421 NMSDEMEEEDIVGREEVAAAVGFLMGEAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVV 480

Query: 483 SLMKELRSIK 490
           S+++ELRS+K
Sbjct: 481 SVIEELRSLK 490

BLAST of Cla97C02G037890 vs. NCBI nr
Match: XP_015890275.2 (soyasapogenol B glucuronide galactosyltransferase-like [Ziziphus jujuba])

HSP 1 Score: 486.5 bits (1251), Expect = 1.1e-133
Identity = 239/489 (48.88%), Postives = 341/489 (69.73%), Query Frame = 0

Query: 6   QNKEFRITVLPLFASGHIIPIIDMARLFARHGATVTIIATESNASIFQNNIDHDFAAGFK 65
           ++ + RI   PL A GH+IP++D ARLFA+ G  VTII T+ NAS  Q  ID +F AG K
Sbjct: 4   ESDQLRIFFFPLPAPGHMIPMVDEARLFAKRGVDVTIIITQGNASFIQKIIDQEFEAGNK 63

Query: 66  IQTHIVSFPGAQVGLAPGIENYSDVSSRHLQAKIYQAFLLLDKLIDQMIIPATRPDCILS 125
           I+THI+ FP AQVGL  GIENY+ ++S ++   ++Q   LL + I+Q ++   RPDCI+S
Sbjct: 64  IRTHILQFPSAQVGLPDGIENYNTITSMNMGVPLFQGLGLLQQPIEQ-LLHEIRPDCIVS 123

Query: 126 DLSHPWTTDTAERLGVPRLVFSVSNFMAYSAEHSVMQHSPHQKVASDTEEFEIPGLPHHI 185
           D+ +PWT D A +LG+PRL F   + ++  AEH +  H PH+   S+     +PGLPH I
Sbjct: 124 DMFYPWTLDVANKLGIPRLGFRGMSHISMCAEHCIKIHEPHKSAESNV--VLLPGLPHKI 183

Query: 186 QMTKSQQPEFLLRRDRFTAMMESYKEAERRSYGTVMNTFYELDGVYLEHYKKITGIKAWG 245
           +M  SQ P++ +  + FT +M++  E+E++SYG +MN+F+EL+  Y +++K   G+KAW 
Sbjct: 184 KMLISQLPDWSITTNDFTPLMDAVVESEQKSYGMLMNSFHELENDYEQYFKTSMGLKAWS 243

Query: 246 LGPVSLAVNKNLREKIERGNKSGM--ESEELVKWLNSKEPNSVLFVSFGSMTRFPPPQMA 305
           +GPVSL VN++   K ER   +    E  E++ WL+SK+ NSVL+V FGS+T FP  Q+ 
Sbjct: 244 VGPVSLWVNRDFTHKAERHKIACEYGEEHEIIDWLDSKQDNSVLYVGFGSLTMFPATQLK 303

Query: 306 EIAHGLEDSGINFIWVIRNKDKNDSGEAPEGLPEGFEQMIKNKNRGFIVRIWAPQLLILE 365
           EIAHGLE SG  FIWVIR   K ++ E  +  PEGFE+ ++   RGFI++ WAPQ++IL+
Sbjct: 304 EIAHGLEASGHPFIWVIR---KKETDEYKQVFPEGFEERMRGSKRGFIIKGWAPQMVILD 363

Query: 366 HPSTGGFLTHCGWNSSIEGISAGQPMVTWPVSSEQFYNEKLLTEVLQVGVPVGARRWWNM 425
           HP+TGGFLTHCGWNS +EGI+AG PM+TWP+ +EQF+NEKL+T+V++VGV +G + W   
Sbjct: 364 HPATGGFLTHCGWNSILEGINAGLPMITWPLFAEQFFNEKLVTDVVRVGVSLGLKEWRQW 423

Query: 426 SDEMKEIVSRENVEKGVGFLMGATEEAAAIRERAKQLGAAANRAVQSGGSSENNLISLMK 485
             E  E+V RE VEK V  LMG  EEAA +R+R  +L  AAN+AV+ GGSS+ NL+ L+ 
Sbjct: 424 GYEGTEVVKREEVEKAVRLLMGDGEEAAEMRKRVSELKDAANKAVEIGGSSQTNLMDLIN 483

Query: 486 ELRSIKVND 493
           EL+++K+ +
Sbjct: 484 ELKALKIRN 486

BLAST of Cla97C02G037890 vs. NCBI nr
Match: XP_003546674.1 (soyasapogenol B glucuronide galactosyltransferase-like [Glycine max] >KRH13189.1 hypothetical protein GLYMA_15G221300 [Glycine max])

HSP 1 Score: 485.3 bits (1248), Expect = 2.4e-133
Identity = 234/485 (48.25%), Postives = 338/485 (69.69%), Query Frame = 0

Query: 9   EFRITVLPLFASGHIIPIIDMARLFARHGATVTIIATESNASIFQNNIDHDFAAGFKIQT 68
           + ++  LP  ++ H+IP++D+ARLFA HG  VTII T + A+IFQ++ID D   G  I+T
Sbjct: 13  KLKLVSLPFVSTSHLIPVVDIARLFAIHGVDVTIITTTATAAIFQSSIDRDRDRGHAIRT 72

Query: 69  HIVSFPGAQVGLAPGIENYSDVSSRHLQAKIYQAFLLLDKLIDQMIIPATRPDCILSDLS 128
           H+V FP  QVGL  G+E+++  + R L  KIYQ   +L     Q +    +PD + +D+ 
Sbjct: 73  HVVKFPCEQVGLPEGVESFNSNTPRDLVPKIYQGLTILQDQY-QQLFHDLQPDFLFTDMF 132

Query: 129 HPWTTDTAERLGVPRLVFSVSNFMAYSAEHSVMQHSPHQKVASDTEEFEIPGLPHHIQMT 188
           +PWT D A +LG+PRL++    ++A+S+++++ Q SPH KV SDTE F +PGLPH ++MT
Sbjct: 133 YPWTVDAAAKLGIPRLIYVSGGYLAHSSQNTIEQFSPHTKVDSDTESFLLPGLPHELKMT 192

Query: 189 KSQQPEFLLRRDRFTAMMESYKEAERRSYGTVMNTFYELDGVYLEHYKKITGIKAWGLGP 248
           + Q P++L     +T +M   K++ER+SYG+++NTFYEL+G Y EHYKK  G K+W +GP
Sbjct: 193 RLQLPDWLRAPTGYTYLMNMMKDSERKSYGSLLNTFYELEGDYEEHYKKAMGTKSWSVGP 252

Query: 249 VSLAVNKNLREKIERGN---KSGMESEELVKWLNSKEPNSVLFVSFGSMTRFPPPQMAEI 308
           VS  VN++  +K +RG+   + G   E  + WL+SK  NSVL+VSFGSM +FP PQ+ EI
Sbjct: 253 VSFWVNQDALDKADRGHAKEEQGEGEEGWLTWLDSKTENSVLYVSFGSMNKFPTPQLVEI 312

Query: 309 AHGLEDSGINFIWVIRNKDKNDSGEAPEGLPEGFEQMIKNKNRGFIVRIWAPQLLILEHP 368
           AH LEDS  +FIWV+R K +++ GE  + L E F++ +K  N+G+++  WAPQLLILEH 
Sbjct: 313 AHALEDSDHDFIWVVRKKGESEDGEGNDFLQE-FDKRVKASNKGYLIWGWAPQLLILEHH 372

Query: 369 STGGFLTHCGWNSSIEGISAGQPMVTWPVSSEQFYNEKLLTEVLQVGVPVGARRWWNMSD 428
           + G  +THCGWN+ IE ++AG PM TWP+ +EQFYNEKLL EVL++GVPVGA+ W N ++
Sbjct: 373 AIGAVVTHCGWNTIIESVNAGLPMATWPLFAEQFYNEKLLAEVLRIGVPVGAKEWRNWNE 432

Query: 429 EMKEIVSRENVEKGVGFLMGATEEAAAIRERAKQLGAAANRAVQSGGSSENNLISLMKEL 488
              E+V RE +   +G LMG  EE+  +R RAK L  AA +A+Q GGSS NNL  L++EL
Sbjct: 433 FGDEVVKREEIGNAIGVLMGG-EESIEMRRRAKALSDAAKKAIQVGGSSHNNLKELIQEL 492

Query: 489 RSIKV 491
           +S+K+
Sbjct: 493 KSLKL 494

BLAST of Cla97C02G037890 vs. NCBI nr
Match: RDX97452.1 (Soyasapogenol B glucuronide galactosyltransferase, partial [Mucuna pruriens])

HSP 1 Score: 481.1 bits (1237), Expect = 4.5e-132
Identity = 228/492 (46.34%), Postives = 336/492 (68.29%), Query Frame = 0

Query: 5   PQNKEFRITVLPLFASGHIIPIIDMARLFARHGATVTIIATESNASIFQNNIDHDFAAGF 64
           P + + +   LP  ++ H+IP++D+ARLFA HG  VTII T +NA+IFQ++ID D A G 
Sbjct: 10  PDHNKLKAICLPFVSTSHLIPVVDIARLFAMHGVDVTIITTPANAAIFQSSIDRDAARGR 69

Query: 65  KIQTHIVSFPGAQVGLAPGIENYSDVSSRHLQAKIYQAFLLLDKLIDQMIIPATRPDCIL 124
            I+T +V FP  +VGL  G+E+++  + + + +KIYQ   +L +  ++ +    +PD + 
Sbjct: 70  SIRTRVVKFPAVEVGLPEGVESFNSNTPQEMISKIYQGLSILQEQYEE-LFKEMQPDFLF 129

Query: 125 SDLSHPWTTDTAERLGVPRLVFSVSNFMAYSAEHSVMQHSPHQKVASDTEEFEIPGLPHH 184
           +D+ +PWT D A +LG+PRL++    ++A+SA+ S+   SPH KV SDTE F +PGLPH 
Sbjct: 130 TDMFYPWTVDAAAKLGIPRLIYVSGGYLAHSAQDSIQHFSPHTKVESDTESFVLPGLPHE 189

Query: 185 IQMTKSQQPEFLLRRDRFTAMMESYKEAERRSYGTVMNTFYELDGVYLEHYKKITGIKAW 244
           ++MT+ Q P++L     +T +M   K++ERRSYG++ NTFYEL+G Y EHYK+  G K+W
Sbjct: 190 LKMTRLQLPDWLRAPTGYTYLMNMMKDSERRSYGSLFNTFYELEGAYEEHYKRDMGTKSW 249

Query: 245 GLGPVSLAVNKNLREKIERGN-----KSGMESEELVKWLNSKEPNSVLFVSFGSMTRFPP 304
            +GPVS  VN++  +K +RG+     +  +  EE + WL+SK  NSV++VSFGSM +FP 
Sbjct: 250 SVGPVSFWVNQDASDKADRGHAKEEQEGKVREEEWLTWLDSKAENSVVYVSFGSMNKFPT 309

Query: 305 PQMAEIAHGLEDSGINFIWVIRNKDKNDSGEAPEGLPEGFEQMIKNKNRGFIVRIWAPQL 364
           PQ+ EIAH LEDSG +FIWV+R K + ++ +  + L E FE+ ++   +G ++  WAPQL
Sbjct: 310 PQLVEIAHALEDSGHDFIWVVRKKGEGENEDGSDFLQE-FEKRVRASKKGHLIWGWAPQL 369

Query: 365 LILEHPSTGGFLTHCGWNSSIEGISAGQPMVTWPVSSEQFYNEKLLTEVLQVGVPVGARR 424
           LILEH + G  +THCGWN+ IE ++AG PM TWP+ +EQFYNEKLL EVL++GVPVGA+ 
Sbjct: 370 LILEHSAIGAVVTHCGWNTIIESVNAGLPMATWPLFAEQFYNEKLLAEVLRIGVPVGAKE 429

Query: 425 WWNMSDEMKEIVSRENVEKGVGFLMGATEEAAAIRERAKQLGAAANRAVQSGGSSENNLI 484
           W N ++   E+V RE + K +  LMG  EE+  +R RAK L  A   A+Q GGSS NN+ 
Sbjct: 430 WRNWNEFGDEVVKREEIGKAIAVLMGG-EESLEMRTRAKALSHACKEAIQVGGSSHNNIK 489

Query: 485 SLMKELRSIKVN 492
            L+++L S K+N
Sbjct: 490 LLIQDLNSFKLN 498

BLAST of Cla97C02G037890 vs. NCBI nr
Match: XP_020231152.1 (soyasapogenol B glucuronide galactosyltransferase-like [Cajanus cajan] >KYP51621.1 Anthocyanin 3'-O-beta-glucosyltransferase [Cajanus cajan])

HSP 1 Score: 477.2 bits (1227), Expect = 6.5e-131
Identity = 229/481 (47.61%), Postives = 335/481 (69.65%), Query Frame = 0

Query: 15  LPLFASGHIIPIIDMARLFARHGATVTIIATESNASIFQNNIDHDFAAGFKIQTHIVSFP 74
           LP  ++ H+IP++D+ARLFA HG  VTII T +NA+IFQ++ID + A G  I+TH+V FP
Sbjct: 20  LPFASTSHLIPLVDIARLFAMHGVDVTIITTTANATIFQSSIDRECARGHSIRTHVVKFP 79

Query: 75  GAQVGLAPGIENYSDVSSRHLQAKIYQAFLLLDKLIDQMIIPATRPDCILSDLSHPWTTD 134
             QVGL  G+E+++  + + +  K+Y+   +L K   Q +    +PD +++D+ +PWT D
Sbjct: 80  FEQVGLPQGVESFNSNTPQDMVKKVYEGLSIL-KDQYQQLFHDMQPDFLVTDMFYPWTVD 139

Query: 135 TAERLGVPRLVFSVSNFMAYSAEHSVMQHSPHQKVASDTEEFEIPGLPHHIQMTKSQQPE 194
            A +LG+PRL++    + A+SA++++ Q SPH KV SD+E F IPGLPH ++MT+ Q P+
Sbjct: 140 AAAKLGIPRLIYVGGGYFAHSAQNAIEQFSPHTKVDSDSERFLIPGLPHELEMTRLQIPD 199

Query: 195 FLLRRDRFTAMMESYKEAERRSYGTVMNTFYELDGVYLEHYKKITGIKAWGLGPVSLAVN 254
           +L     ++ +M+  K++ERRSYG++ NTFYEL+G Y EHYKK  G+K+W +GPVS  VN
Sbjct: 200 WLREPKDYSDLMKIMKDSERRSYGSLFNTFYELEGTYEEHYKKAMGVKSWSVGPVSFWVN 259

Query: 255 KNLREKIERGN-KSGME----SEELVKWLNSKEPNSVLFVSFGSMTRFPPPQMAEIAHGL 314
           ++  +K +RG+ K   E     E  + WL+SK  NSVL+VSFGSM +FP PQ+ EIAH L
Sbjct: 260 QDASDKADRGHAKEEQEGEGGGEGWLTWLDSKTENSVLYVSFGSMNKFPTPQLVEIAHAL 319

Query: 315 EDSGINFIWVIRNKDKNDSGEAPEGLPEGFEQMIKNKNRGFIVRIWAPQLLILEHPSTGG 374
           EDSG +FIWV+R K +++  +  E L E FE+ ++  N+G+++  WAPQLLILEH + G 
Sbjct: 320 EDSGHDFIWVVRKKGESEDCDGNEFLEE-FEERVRASNKGYLIWGWAPQLLILEHLAIGA 379

Query: 375 FLTHCGWNSSIEGISAGQPMVTWPVSSEQFYNEKLLTEVLQVGVPVGARRWWNMSDEMKE 434
            +THCGWN+ IE ++AG PM TWP+ +EQFYNEKLL +VL++GVPVGA+ W N ++   E
Sbjct: 380 VVTHCGWNTIIESVNAGLPMATWPLFAEQFYNEKLLADVLRIGVPVGAKEWKNWNEFGDE 439

Query: 435 IVSRENVEKGVGFLMGATEEAAAIRERAKQLGAAANRAVQSGGSSENNLISLMKELRSIK 491
           +V R+ + K +  LMG  EE   +R R K L  AA +A+Q GGSS N +  L++EL+S K
Sbjct: 440 VVKRDEIGKAIAVLMGGGEECLEMRRRVKALSDAAKKAIQVGGSSHNKMKQLIQELKSFK 498

BLAST of Cla97C02G037890 vs. TrEMBL
Match: tr|A0A2N9I9F6|A0A2N9I9F6_FAGSY (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS48797 PE=4 SV=1)

HSP 1 Score: 537.7 bits (1384), Expect = 2.7e-149
Identity = 262/487 (53.80%), Postives = 364/487 (74.74%), Query Frame = 0

Query: 6   QNKEFRITVLPLF-ASGHIIPIIDMARLFARHGATVTIIATESNASIFQNNIDHDFAAGF 65
           Q  + ++  LP F   GH+IP++D ARLFARHG +VTII T +NA +FQ  ID D  +G 
Sbjct: 4   QVDKLKVIFLPFFLVPGHLIPLVDTARLFARHGVSVTIITTTANALLFQRAIDCDANSGH 63

Query: 66  KIQTHIVSFPGAQVGLAPGIENYSDVSSRHLQAKIYQAFLLLDKLIDQMIIPATRPDCIL 125
           +I TH++ FP AQVGL  GIENY+ ++S    +K+     LL K I+Q +    RPDCI+
Sbjct: 64  QINTHVLQFPSAQVGLPEGIENYNTMTSNDTNSKLLHGLSLLRKPIEQ-LFQDMRPDCIV 123

Query: 126 SDLSHPWTTDTAERLGVPRLVFSVSNFMAYSAEHSVMQHSPHQKVASDTEEFEIPGLPHH 185
           SD+ +PWT ++A RLG+PRLVF V+++ ++ AE  + Q+ PHQ V SDTE F IPGLP+ 
Sbjct: 124 SDMFYPWTVESAARLGIPRLVFHVTSYFSFCAETCIEQYKPHQSVNSDTEPFLIPGLPNK 183

Query: 186 IQMTKSQQPEFLLRRDRFTAMMESYKEAERRSYGTVMNTFYELDGVYLEHYKKITGIKAW 245
           I+MT+ + P+++  +DRFT ++   KE+ERRSYG ++N+FYEL+G Y E +K+  GIKAW
Sbjct: 184 IEMTRLKLPDWVKTQDRFTQLLNIIKESERRSYGAIVNSFYELEGGYEELHKRNMGIKAW 243

Query: 246 GLGPVSLAVNKNLREKIERGNK-SGMESEELVKWLNSKEPNSVLFVSFGSMTRFPPPQMA 305
            +GPVSL VNK++ +K+ERGNK +  E +E +KWLN+KE NSVL+VSFGSMT+FP PQ+ 
Sbjct: 244 SVGPVSLWVNKDVADKVERGNKVAPPEEQEWLKWLNAKECNSVLYVSFGSMTKFPTPQLI 303

Query: 306 EIAHGLEDSGINFIWVIRNKDKNDSGEAPEGLPEGFEQMIK-NKNRGFIVRIWAPQLLIL 365
           E+AHGLE SG  FIWV+  KDK+      EG  E F++ +K +K+RG I+R WAPQLLIL
Sbjct: 304 EMAHGLEASGHQFIWVVPKKDKDQD----EGWLEDFQKRMKESKHRGLIIRGWAPQLLIL 363

Query: 366 EHPSTGGFLTHCGWNSSIEGISAGQPMVTWPVSSEQFYNEKLLTEVLQVGVPVGARRWWN 425
           EHP+ GG +THCGWNS +EG++AG PM+TWP+ +EQFY+EKL+TEVL++GV VG + W  
Sbjct: 364 EHPAIGGQVTHCGWNSFLEGVTAGLPMITWPLFAEQFYHEKLVTEVLKIGVAVGKKEWSR 423

Query: 426 MSDEMKEIVSRENVEKGVGFLMGATEEAAAIRERAKQLGAAANRAVQSGGSSENNLISLM 485
            ++E KE+V RE++EK V FLMG+ EEA  +++RA++LG AA RAVQ+GGSS++N + L+
Sbjct: 424 WANEAKEVVKREDIEKAVKFLMGSAEEATEMKKRARELGNAARRAVQTGGSSQSNFMDLI 483

Query: 486 KELRSIK 490
            EL+S+K
Sbjct: 484 NELKSLK 485

BLAST of Cla97C02G037890 vs. TrEMBL
Match: tr|A0A2N9FF75|A0A2N9FF75_FAGSY (Glycosyltransferase OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS13306 PE=3 SV=1)

HSP 1 Score: 520.4 bits (1339), Expect = 4.4e-144
Identity = 250/484 (51.65%), Postives = 346/484 (71.49%), Query Frame = 0

Query: 6   QNKEFRITVLPLFASGHIIPIIDMARLFARHGATVTIIATESNASIFQNNIDHDFAAGFK 65
           Q  + +   LP    GH+IP++D  RLFA HG  VTII T +NA +FQ  ID D ++G +
Sbjct: 4   QADQLKAIFLPFLVPGHMIPLVDTGRLFAMHGVNVTIITTPANALLFQKAIDRDASSGHQ 63

Query: 66  IQTHIVSFPGAQVGLAPGIENYSDVSSRHLQAKIYQAFLLLDKLIDQMIIPATRPDCILS 125
           I+THI+ FP AQV L  GIEN++ ++S  +  K+Y A  LL K I+Q +    RPDCI++
Sbjct: 64  IKTHILEFPSAQVSLPKGIENFNMITSPDMSHKLYYAVSLLQKPIEQ-LFQDMRPDCIVT 123

Query: 126 DLSHPWTTDTAERLGVPRLVFSVSNFMAYSAEHSVMQHSPHQKVASDTEEFEIPGLPHHI 185
           D+ +PWT D+A +LG+PRLVF  +++ +  A   + Q++PHQ V S+T+ F +PGLP+ I
Sbjct: 124 DMFYPWTVDSANKLGIPRLVFHGTSYFSLCAASCIKQYAPHQSVKSNTDTFLLPGLPNKI 183

Query: 186 QMTKSQQPEFLLRRDRFTAMMESYKEAERRSYGTVMNTFYELDGVYLEHYKKITGIKAWG 245
           +MT SQ P ++   + +T +M+  KE+E+RSYG VMN+F+EL+  Y EHYK + GIKAW 
Sbjct: 184 EMTTSQLPRWVRTPEAYTQLMDKIKESEQRSYGAVMNSFHELESAYEEHYKSVMGIKAWS 243

Query: 246 LGPVSLAVNKNLREKIERGNKSGMESEELVKWLNSKEPNSVLFVSFGSMTRFPPPQMAEI 305
           +GP+SL  N +  +K+ERGNK+  E+E L  WLNSKE NSVL+VSFGS+ +F   Q+ E+
Sbjct: 244 VGPISLWANSDATDKVERGNKATTENEWL-NWLNSKECNSVLYVSFGSLNKFSTSQLIEL 303

Query: 306 AHGLEDSGINFIWVIRNKDKNDSGEAPEGLPEGFEQMIKNKNRGFIVRIWAPQLLILEHP 365
           AHGLE S   FIWV+R K+K++     EG    FE+ IK  NRG I+  WAPQLLILEHP
Sbjct: 304 AHGLEASNHQFIWVVRLKNKDED----EGWLRDFEKRIKESNRGLIIWDWAPQLLILEHP 363

Query: 366 STGGFLTHCGWNSSIEGISAGQPMVTWPVSSEQFYNEKLLTEVLQVGVPVGARRWWNMSD 425
           + GG +THCGWNS +EG++AG PM+TWP+ +EQFYNEKL+T+V+++GV VG + W  M +
Sbjct: 364 AIGGLVTHCGWNSILEGVTAGLPMITWPLYAEQFYNEKLVTDVIKIGVAVGVKEWRKMDE 423

Query: 426 EMKEIVSRENVEKGVGFLMGATEEAAAIRERAKQLGAAANRAVQSGGSSENNLISLMKEL 485
           E KE V RE +EK V FLMG+  EAA ++ RA++LG AA  AVQSGGSS++NL+ L+KEL
Sbjct: 424 EAKETVKREEIEKAVTFLMGSGVEAAEMKNRARELGNAARSAVQSGGSSQSNLMGLIKEL 481

Query: 486 RSIK 490
           +S+K
Sbjct: 484 KSLK 481

BLAST of Cla97C02G037890 vs. TrEMBL
Match: tr|I1MIG3|I1MIG3_SOYBN (Glycosyltransferase OS=Glycine max OX=3847 GN=100810117 PE=3 SV=2)

HSP 1 Score: 485.3 bits (1248), Expect = 1.6e-133
Identity = 234/485 (48.25%), Postives = 338/485 (69.69%), Query Frame = 0

Query: 9   EFRITVLPLFASGHIIPIIDMARLFARHGATVTIIATESNASIFQNNIDHDFAAGFKIQT 68
           + ++  LP  ++ H+IP++D+ARLFA HG  VTII T + A+IFQ++ID D   G  I+T
Sbjct: 13  KLKLVSLPFVSTSHLIPVVDIARLFAIHGVDVTIITTTATAAIFQSSIDRDRDRGHAIRT 72

Query: 69  HIVSFPGAQVGLAPGIENYSDVSSRHLQAKIYQAFLLLDKLIDQMIIPATRPDCILSDLS 128
           H+V FP  QVGL  G+E+++  + R L  KIYQ   +L     Q +    +PD + +D+ 
Sbjct: 73  HVVKFPCEQVGLPEGVESFNSNTPRDLVPKIYQGLTILQDQY-QQLFHDLQPDFLFTDMF 132

Query: 129 HPWTTDTAERLGVPRLVFSVSNFMAYSAEHSVMQHSPHQKVASDTEEFEIPGLPHHIQMT 188
           +PWT D A +LG+PRL++    ++A+S+++++ Q SPH KV SDTE F +PGLPH ++MT
Sbjct: 133 YPWTVDAAAKLGIPRLIYVSGGYLAHSSQNTIEQFSPHTKVDSDTESFLLPGLPHELKMT 192

Query: 189 KSQQPEFLLRRDRFTAMMESYKEAERRSYGTVMNTFYELDGVYLEHYKKITGIKAWGLGP 248
           + Q P++L     +T +M   K++ER+SYG+++NTFYEL+G Y EHYKK  G K+W +GP
Sbjct: 193 RLQLPDWLRAPTGYTYLMNMMKDSERKSYGSLLNTFYELEGDYEEHYKKAMGTKSWSVGP 252

Query: 249 VSLAVNKNLREKIERGN---KSGMESEELVKWLNSKEPNSVLFVSFGSMTRFPPPQMAEI 308
           VS  VN++  +K +RG+   + G   E  + WL+SK  NSVL+VSFGSM +FP PQ+ EI
Sbjct: 253 VSFWVNQDALDKADRGHAKEEQGEGEEGWLTWLDSKTENSVLYVSFGSMNKFPTPQLVEI 312

Query: 309 AHGLEDSGINFIWVIRNKDKNDSGEAPEGLPEGFEQMIKNKNRGFIVRIWAPQLLILEHP 368
           AH LEDS  +FIWV+R K +++ GE  + L E F++ +K  N+G+++  WAPQLLILEH 
Sbjct: 313 AHALEDSDHDFIWVVRKKGESEDGEGNDFLQE-FDKRVKASNKGYLIWGWAPQLLILEHH 372

Query: 369 STGGFLTHCGWNSSIEGISAGQPMVTWPVSSEQFYNEKLLTEVLQVGVPVGARRWWNMSD 428
           + G  +THCGWN+ IE ++AG PM TWP+ +EQFYNEKLL EVL++GVPVGA+ W N ++
Sbjct: 373 AIGAVVTHCGWNTIIESVNAGLPMATWPLFAEQFYNEKLLAEVLRIGVPVGAKEWRNWNE 432

Query: 429 EMKEIVSRENVEKGVGFLMGATEEAAAIRERAKQLGAAANRAVQSGGSSENNLISLMKEL 488
              E+V RE +   +G LMG  EE+  +R RAK L  AA +A+Q GGSS NNL  L++EL
Sbjct: 433 FGDEVVKREEIGNAIGVLMGG-EESIEMRRRAKALSDAAKKAIQVGGSSHNNLKELIQEL 492

Query: 489 RSIKV 491
           +S+K+
Sbjct: 493 KSLKL 494

BLAST of Cla97C02G037890 vs. TrEMBL
Match: tr|A0A151SA39|A0A151SA39_CAJCA (Glycosyltransferase OS=Cajanus cajan OX=3821 GN=KK1_026505 PE=3 SV=1)

HSP 1 Score: 477.2 bits (1227), Expect = 4.3e-131
Identity = 229/481 (47.61%), Postives = 335/481 (69.65%), Query Frame = 0

Query: 15  LPLFASGHIIPIIDMARLFARHGATVTIIATESNASIFQNNIDHDFAAGFKIQTHIVSFP 74
           LP  ++ H+IP++D+ARLFA HG  VTII T +NA+IFQ++ID + A G  I+TH+V FP
Sbjct: 20  LPFASTSHLIPLVDIARLFAMHGVDVTIITTTANATIFQSSIDRECARGHSIRTHVVKFP 79

Query: 75  GAQVGLAPGIENYSDVSSRHLQAKIYQAFLLLDKLIDQMIIPATRPDCILSDLSHPWTTD 134
             QVGL  G+E+++  + + +  K+Y+   +L K   Q +    +PD +++D+ +PWT D
Sbjct: 80  FEQVGLPQGVESFNSNTPQDMVKKVYEGLSIL-KDQYQQLFHDMQPDFLVTDMFYPWTVD 139

Query: 135 TAERLGVPRLVFSVSNFMAYSAEHSVMQHSPHQKVASDTEEFEIPGLPHHIQMTKSQQPE 194
            A +LG+PRL++    + A+SA++++ Q SPH KV SD+E F IPGLPH ++MT+ Q P+
Sbjct: 140 AAAKLGIPRLIYVGGGYFAHSAQNAIEQFSPHTKVDSDSERFLIPGLPHELEMTRLQIPD 199

Query: 195 FLLRRDRFTAMMESYKEAERRSYGTVMNTFYELDGVYLEHYKKITGIKAWGLGPVSLAVN 254
           +L     ++ +M+  K++ERRSYG++ NTFYEL+G Y EHYKK  G+K+W +GPVS  VN
Sbjct: 200 WLREPKDYSDLMKIMKDSERRSYGSLFNTFYELEGTYEEHYKKAMGVKSWSVGPVSFWVN 259

Query: 255 KNLREKIERGN-KSGME----SEELVKWLNSKEPNSVLFVSFGSMTRFPPPQMAEIAHGL 314
           ++  +K +RG+ K   E     E  + WL+SK  NSVL+VSFGSM +FP PQ+ EIAH L
Sbjct: 260 QDASDKADRGHAKEEQEGEGGGEGWLTWLDSKTENSVLYVSFGSMNKFPTPQLVEIAHAL 319

Query: 315 EDSGINFIWVIRNKDKNDSGEAPEGLPEGFEQMIKNKNRGFIVRIWAPQLLILEHPSTGG 374
           EDSG +FIWV+R K +++  +  E L E FE+ ++  N+G+++  WAPQLLILEH + G 
Sbjct: 320 EDSGHDFIWVVRKKGESEDCDGNEFLEE-FEERVRASNKGYLIWGWAPQLLILEHLAIGA 379

Query: 375 FLTHCGWNSSIEGISAGQPMVTWPVSSEQFYNEKLLTEVLQVGVPVGARRWWNMSDEMKE 434
            +THCGWN+ IE ++AG PM TWP+ +EQFYNEKLL +VL++GVPVGA+ W N ++   E
Sbjct: 380 VVTHCGWNTIIESVNAGLPMATWPLFAEQFYNEKLLADVLRIGVPVGAKEWKNWNEFGDE 439

Query: 435 IVSRENVEKGVGFLMGATEEAAAIRERAKQLGAAANRAVQSGGSSENNLISLMKELRSIK 491
           +V R+ + K +  LMG  EE   +R R K L  AA +A+Q GGSS N +  L++EL+S K
Sbjct: 440 VVKRDEIGKAIAVLMGGGEECLEMRRRVKALSDAAKKAIQVGGSSHNKMKQLIQELKSFK 498

BLAST of Cla97C02G037890 vs. TrEMBL
Match: tr|K7LN65|K7LN65_SOYBN (Glycosyltransferase OS=Glycine max OX=3847 GN=GLYMA_11G053400 PE=3 SV=1)

HSP 1 Score: 476.1 bits (1224), Expect = 9.6e-131
Identity = 227/484 (46.90%), Postives = 338/484 (69.83%), Query Frame = 0

Query: 9   EFRITVLPLFASGHIIPIIDMARLFARHGATVTIIATESNASIFQNNIDHDFAAGFKIQT 68
           E +   LP  ++ HIIP++DMARLFA H   VTII T  NA++FQ +ID D + G  I+T
Sbjct: 7   ELKSIFLPFLSTSHIIPLVDMARLFALHDVDVTIITTAHNATVFQKSIDLDASRGRPIRT 66

Query: 69  HIVSFPGAQVGLAPGIENYSDVSSRHLQAKIYQAFLLLDKLIDQMIIPATRPDCILSDLS 128
           H+V+FP AQVGL  GIE ++  + R +  +IY    LL ++ ++ +    +PD I++D+ 
Sbjct: 67  HVVNFPAAQVGLPVGIEAFNVDTPREMTPRIYMGLSLLQQVFEK-LFHDLQPDFIVTDMF 126

Query: 129 HPWTTDTAERLGVPRLVFSVSNFMAYSAEHSVMQHSPHQKVASDTEEFEIPGLPHHIQMT 188
           HPW+ D A +LG+PR++F  ++++A SA HSV Q++PH +   DT++F +PGLP +++MT
Sbjct: 127 HPWSVDAAAKLGIPRIMFHGASYLARSAAHSVEQYAPHLEAKFDTDKFVLPGLPDNLEMT 186

Query: 189 KSQQPEFLLRRDRFTAMMESYKEAERRSYGTVMNTFYELDGVYLEHYKKITGIKAWGLGP 248
           + Q P++L   +++T +M + K++E++SYG++ N+FY+L+  Y EHYK I G K+WG+GP
Sbjct: 187 RLQLPDWLRSPNQYTELMRTIKQSEKKSYGSLFNSFYDLESAYYEHYKSIMGTKSWGIGP 246

Query: 249 VSLAVNKNLREKIERG-NKSGMESEELVKWLNSKEPNSVLFVSFGSMTRFPPPQMAEIAH 308
           VSL  N++ ++K  RG  K   E E  +KWLNSK  +SVL+VSFGSM +FP  Q+ EIA 
Sbjct: 247 VSLWANQDAQDKAARGYAKEEEEKEGWLKWLNSKAESSVLYVSFGSMNKFPYSQLVEIAR 306

Query: 309 GLEDSGINFIWVIRNKDKNDSGEAPEGLPEGFEQMIKNKNRGFIVRIWAPQLLILEHPST 368
            LEDSG +FIWV+R   KND GE    L E FE+ +K  N+G+++  WAPQLLILE+P+ 
Sbjct: 307 ALEDSGHDFIWVVR---KNDGGEGDNFLEE-FEKRMKESNKGYLIWGWAPQLLILENPAI 366

Query: 369 GGFLTHCGWNSSIEGISAGQPMVTWPVSSEQFYNEKLLTEVLQVGVPVGARRWWNMSDEM 428
           GG +THCGWN+ +E ++AG PM TWP+ +E F+NEKL+ +VL++GVPVGA+ W N ++  
Sbjct: 367 GGLVTHCGWNTVVESVNAGLPMATWPLFAEHFFNEKLVVDVLKIGVPVGAKEWRNWNEFG 426

Query: 429 KEIVSRENVEKGVGFLMGATEEAAAIRERAKQLGAAANRAVQSGGSSENNLISLMKELRS 488
            E+V RE +   +  LM   EE   +R+RAK+L  AA  A++ GGSS NN+  L++EL+ 
Sbjct: 427 SEVVKREEIGNAIASLMSEEEEDGGMRKRAKELSVAAKSAIKVGGSSHNNMKELIRELKE 485

Query: 489 IKVN 492
           IK++
Sbjct: 487 IKLS 485

BLAST of Cla97C02G037890 vs. Swiss-Prot
Match: sp|D4Q9Z4|SGT2_SOYBN (Soyasapogenol B glucuronide galactosyltransferase OS=Glycine max OX=3847 GN=GmSGT2 PE=1 SV=1)

HSP 1 Score: 474.6 bits (1220), Expect = 1.4e-132
Identity = 226/484 (46.69%), Postives = 338/484 (69.83%), Query Frame = 0

Query: 9   EFRITVLPLFASGHIIPIIDMARLFARHGATVTIIATESNASIFQNNIDHDFAAGFKIQT 68
           E +   LP  ++ HIIP++DMARLFA H   VTII T  NA++FQ +ID D + G  I+T
Sbjct: 7   ELKSIFLPFLSTSHIIPLVDMARLFALHDVDVTIITTAHNATVFQKSIDLDASRGRPIRT 66

Query: 69  HIVSFPGAQVGLAPGIENYSDVSSRHLQAKIYQAFLLLDKLIDQMIIPATRPDCILSDLS 128
           H+V+FP AQVGL  GIE ++  + R +  +IY    LL ++ ++ +    +PD I++D+ 
Sbjct: 67  HVVNFPAAQVGLPVGIEAFNVDTPREMTPRIYMGLSLLQQVFEK-LFHDLQPDFIVTDMF 126

Query: 129 HPWTTDTAERLGVPRLVFSVSNFMAYSAEHSVMQHSPHQKVASDTEEFEIPGLPHHIQMT 188
           HPW+ D A +LG+PR++F  ++++A SA HSV Q++PH +   DT++F +PGLP +++MT
Sbjct: 127 HPWSVDAAAKLGIPRIMFHGASYLARSAAHSVEQYAPHLEAKFDTDKFVLPGLPDNLEMT 186

Query: 189 KSQQPEFLLRRDRFTAMMESYKEAERRSYGTVMNTFYELDGVYLEHYKKITGIKAWGLGP 248
           + Q P++L   +++T +M + K++E++SYG++ N+FY+L+  Y EHYK I G K+WG+GP
Sbjct: 187 RLQLPDWLRSPNQYTELMRTIKQSEKKSYGSLFNSFYDLESAYYEHYKSIMGTKSWGIGP 246

Query: 249 VSLAVNKNLREKIERG-NKSGMESEELVKWLNSKEPNSVLFVSFGSMTRFPPPQMAEIAH 308
           VSL  N++ ++K  RG  K   E E  +KWLNSK  +SVL+VSFGS+ +FP  Q+ EIA 
Sbjct: 247 VSLWANQDAQDKAARGYAKEEEEKEGWLKWLNSKAESSVLYVSFGSINKFPYSQLVEIAR 306

Query: 309 GLEDSGINFIWVIRNKDKNDSGEAPEGLPEGFEQMIKNKNRGFIVRIWAPQLLILEHPST 368
            LEDSG +FIWV+R   KND GE    L E FE+ +K  N+G+++  WAPQLLILE+P+ 
Sbjct: 307 ALEDSGHDFIWVVR---KNDGGEGDNFLEE-FEKRMKESNKGYLIWGWAPQLLILENPAI 366

Query: 369 GGFLTHCGWNSSIEGISAGQPMVTWPVSSEQFYNEKLLTEVLQVGVPVGARRWWNMSDEM 428
           GG +THCGWN+ +E ++AG PM TWP+ +E F+NEKL+ +VL++GVPVGA+ W N ++  
Sbjct: 367 GGLVTHCGWNTVVESVNAGLPMATWPLFAEHFFNEKLVVDVLKIGVPVGAKEWRNWNEFG 426

Query: 429 KEIVSRENVEKGVGFLMGATEEAAAIRERAKQLGAAANRAVQSGGSSENNLISLMKELRS 488
            E+V RE +   +  LM   EE   +R+RAK+L  AA  A++ GGSS NN+  L++EL+ 
Sbjct: 427 SEVVKREEIGNAIASLMSEEEEDGGMRKRAKELSVAAKSAIKVGGSSHNNMKELIRELKE 485

Query: 489 IKVN 492
           IK++
Sbjct: 487 IKLS 485

BLAST of Cla97C02G037890 vs. Swiss-Prot
Match: sp|Q9AT54|SCGT_TOBAC (Scopoletin glucosyltransferase OS=Nicotiana tabacum OX=4097 GN=TOGT1 PE=1 SV=1)

HSP 1 Score: 398.3 bits (1022), Expect = 1.3e-109
Identity = 199/472 (42.16%), Postives = 302/472 (63.98%), Query Frame = 0

Query: 16  PLFASGHIIPIIDMARLFARHGATVTIIATESNASIFQNNIDHDFAAGFKIQTHIVSFPG 75
           P+ A GH+IP +DMA+LFA  G   TII T  N  +F   I  +   G +I+  ++ FP 
Sbjct: 10  PVMAHGHMIPTLDMAKLFASRGVKATIITTPLNEFVFSKAIQRNKHLGIEIEIRLIKFPA 69

Query: 76  AQVGLAPGIENYSDVSSRHLQAKIYQAFLLLDKLIDQMIIPATRPDCILSDLSHPWTTDT 135
            + GL    E    + S       ++A  ++ + ++Q +I   RPDC++SD+  PWTTDT
Sbjct: 70  VENGLPEECERLDQIPSDEKLPNFFKAVAMMQEPLEQ-LIEECRPDCLISDMFLPWTTDT 129

Query: 136 AERLGVPRLVFSVSNFMAYSAEHSVMQHSPHQKVASDTEEFEIPGLPHHIQMTKSQQPEF 195
           A +  +PR+VF  ++F A   E+SV  + P + V+SD+E F +P LPH I++T++Q   F
Sbjct: 130 AAKFNIPRIVFHGTSFFALCVENSVRLNKPFKNVSSDSETFVVPDLPHEIKLTRTQVSPF 189

Query: 196 LLRRDR--FTAMMESYKEAERRSYGTVMNTFYELDGVYLEHYKKITGIKAWGLGPVSLAV 255
               +    T M+++ +E++ +SYG V N+FYEL+  Y+EHY K+ G +AW +GP+S+  
Sbjct: 190 ERSGEETAMTRMIKTVRESDSKSYGVVFNSFYELETDYVEHYTKVLGRRAWAIGPLSMC- 249

Query: 256 NKNLREKIERGNKSGMESEELVKWLNSKEPNSVLFVSFGSMTRFPPPQMAEIAHGLEDSG 315
           N+++ +K ERG KS ++  E +KWL+SK+P+SV++V FGS+  F   Q+ E+A G+E SG
Sbjct: 250 NRDIEDKAERGKKSSIDKHECLKWLDSKKPSSVVYVCFGSVANFTASQLHELAMGIEASG 309

Query: 316 INFIWVIRNKDKNDSGEAPEGLPEGFEQMIKNKNRGFIVRIWAPQLLILEHPSTGGFLTH 375
             FIWV+R +  N+     + LPEGFE+  + K +G I+R WAPQ+LIL+H S G F+TH
Sbjct: 310 QEFIWVVRTELDNE-----DWLPEGFEE--RTKEKGLIIRGWAPQVLILDHESVGAFVTH 369

Query: 376 CGWNSSIEGISAGQPMVTWPVSSEQFYNEKLLTEVLQVGVPVGARRWWNMSDEMKEIVSR 435
           CGWNS++EG+S G PMVTWPV +EQF+NEKL+TEVL+ G  VG+ +W   + E    V R
Sbjct: 370 CGWNSTLEGVSGGVPMVTWPVFAEQFFNEKLVTEVLKTGAGVGSIQWKRSASEG---VKR 429

Query: 436 ENVEKGVGFLMGATEEAAAIRERAKQLGAAANRAVQSGGSSENNLISLMKEL 486
           E + K +  +M  +EEA   R RAK     A +A++ GGSS   L +L++++
Sbjct: 430 EAIAKAIKRVM-VSEEADGFRNRAKAYKEMARKAIEEGGSSYTGLTTLLEDI 468

BLAST of Cla97C02G037890 vs. Swiss-Prot
Match: sp|Q2V6J9|UFOG7_FRAAN (UDP-glucose flavonoid 3-O-glucosyltransferase 7 OS=Fragaria ananassa OX=3747 GN=GT7 PE=1 SV=1)

HSP 1 Score: 397.5 bits (1020), Expect = 2.1e-109
Identity = 213/487 (43.74%), Postives = 303/487 (62.22%), Query Frame = 0

Query: 8   KEFRITVLPLFASGHIIPIIDMARLFARHGATVTIIATESNASIFQNNIDHDFAAGFKIQ 67
           ++  I  LP  A GH IP+ D+A+LF+ HGA  TI+ T  NA +F            +I+
Sbjct: 9   QQLHIFFLPFMARGHSIPLTDIAKLFSSHGARCTIVTTPLNAPLFSKATQRG-----EIE 68

Query: 68  THIVSFPGAQVGLAPGIENYSDVSSRHLQAKIYQAFLLLDKLIDQMIIPATRPDCILSDL 127
             ++ FP A+ GL    E+   ++++ +  K  +A  L++   ++ I+   RP C+++D 
Sbjct: 69  LVLIKFPSAEAGLPQDCESADLITTQDMLGKFVKATFLIEPHFEK-ILDEHRPHCLVADA 128

Query: 128 SHPWTTDTAERLGVPRLVFSVSNFMAYSAEHSVMQHSPHQKVASDTEEFEIPGLPHHIQM 187
              W TD A +  +PRL F  + F A  A  SVM + PH  ++SD+E F IP LP  I+M
Sbjct: 129 FFTWATDVAAKFRIPRLYFHGTGFFALCASLSVMMYQPHSNLSSDSESFVIPNLPDEIKM 188

Query: 188 TKSQQPEFLLRRDRFTAMMESYKEAERRSYGTVMNTFYELDGVYLEHYKKITGIKAWGLG 247
           T+SQ P F      F  M+++  E E RSYG ++N+FYEL+  Y  HY+K+ G KAW +G
Sbjct: 189 TRSQLPVF-PDESEFMKMLKASIEIEERSYGVIVNSFYELEPAYANHYRKVFGRKAWHIG 248

Query: 248 PVSLAVNKNLREKIERGN--KSGMESEELVKWLNSKEPNSVLFVSFGSMTRFPPPQMAEI 307
           PVS   NK + +K ERG+   S  E  E +KWL+SK+P SV++VSFGSM RF   Q+ EI
Sbjct: 249 PVSFC-NKAIEDKAERGSIKSSTAEKHECLKWLDSKKPRSVVYVSFGSMVRFADSQLLEI 308

Query: 308 AHGLEDSGINFIWVIRNKDKNDSGEAPEGLPEGFEQMIKNKNRGFIVRIWAPQLLILEHP 367
           A GLE SG +FIWV++ + K    E  E LPEGFE+ ++ K  G I+R WAPQ+LILEH 
Sbjct: 309 ATGLEASGQDFIWVVKKEKK----EVEEWLPEGFEKRMEGK--GLIIRDWAPQVLILEHE 368

Query: 368 STGGFLTHCGWNSSIEGISAGQPMVTWPVSSEQFYNEKLLTEVLQVGVPVGARRW----W 427
           + G F+THCGWNS +E +SAG PM+TWPV  EQFYNEKL+TE+ ++GVPVG+ +W     
Sbjct: 369 AIGAFVTHCGWNSILEAVSAGVPMITWPVFGEQFYNEKLVTEIHRIGVPVGSEKWALSFV 428

Query: 428 NMSDEMKEIVSRENVEKGVGFLMGATEEAAAIRERAKQLGAAANRAVQSGGSSENNLISL 487
           +++ E +  V RE +E+ V  +M   +EA   R R K+LG  A RAV+ GGSS  +L +L
Sbjct: 429 DVNAETEGRVRREAIEEAVTRIM-VGDEAVETRSRVKELGENARRAVEEGGSSFLDLSAL 480

Query: 488 MKELRSI 489
           + EL  +
Sbjct: 489 VGELNDL 480

BLAST of Cla97C02G037890 vs. Swiss-Prot
Match: sp|Q7Y232|U73B4_ARATH (UDP-glycosyltransferase 73B4 OS=Arabidopsis thaliana OX=3702 GN=UGT73B4 PE=2 SV=1)

HSP 1 Score: 388.3 bits (996), Expect = 1.3e-106
Identity = 211/495 (42.63%), Postives = 311/495 (62.83%), Query Frame = 0

Query: 8   KEFRITVLPLFASGHIIPIIDMARLFARHGATVTIIATESNASIFQNNIDHDFAAGFKIQ 67
           ++  I   P  A GH+IP++DMA+LFAR GA  T++ T  NA I +  I+      FK+Q
Sbjct: 4   EQIHILFFPFMAHGHMIPLLDMAKLFARRGAKSTLLTTPINAKILEKPIE-----AFKVQ 63

Query: 68  T-------HIVSFPGAQVGLAPGIENYSDVSS--RHLQAKIYQAFLLLDKLIDQMI---I 127
                    I++FP  ++GL  G EN   ++S  +     ++  FL   K + Q +   I
Sbjct: 64  NPDLEIGIKILNFPCVELGLPEGCENRDFINSYQKSDSFDLFLKFLFSTKYMKQQLESFI 123

Query: 128 PATRPDCILSDLSHPWTTDTAERLGVPRLVFSVSNFMAYSAEHSVMQHSPHQKVASDTEE 187
             T+P  +++D+  PW T++AE++GVPRLVF  ++  A    +++  H PH+KVAS +  
Sbjct: 124 ETTKPSALVADMFFPWATESAEKIGVPRLVFHGTSSFALCCSYNMRIHKPHKKVASSSTP 183

Query: 188 FEIPGLPHHIQMTKSQQPEFLLRRDRFTAMMESYKEAERRSYGTVMNTFYELDGVYLEHY 247
           F IPGLP  I +T+  Q         F    +  +E+E  S+G ++N+FYEL+  Y + Y
Sbjct: 184 FVIPGLPGDIVITE-DQANVTNEETPFGKFWKEVRESETSSFGVLVNSFYELESSYADFY 243

Query: 248 KKITGIKAWGLGPVSLAVNKNLREKIERGNKSGMESEELVKWLNSKEPNSVLFVSFGSMT 307
           +     KAW +GP+SL+ N+ + EK  RG K+ ++ +E +KWL+SK P SV+++SFGS T
Sbjct: 244 RSFVAKKAWHIGPLSLS-NRGIAEKAGRGKKANIDEQECLKWLDSKTPGSVVYLSFGSGT 303

Query: 308 RFPPPQMAEIAHGLEDSGINFIWVI-RNKDKNDSGEAPEGLPEGFEQMIKNKNRGFIVRI 367
             P  Q+ EIA GLE SG NFIWV+ +N+++  +GE  + LP+GFE+  +NK +G I+R 
Sbjct: 304 GLPNEQLLEIAFGLEGSGQNFIWVVSKNENQVGTGENEDWLPKGFEE--RNKGKGLIIRG 363

Query: 368 WAPQLLILEHPSTGGFLTHCGWNSSIEGISAGQPMVTWPVSSEQFYNEKLLTEVLQVGVP 427
           WAPQ+LIL+H + GGF+THCGWNS++EGI+AG PMVTWP+ +EQFYNEKLLT+VL++GV 
Sbjct: 364 WAPQVLILDHKAIGGFVTHCGWNSTLEGIAAGLPMVTWPMGAEQFYNEKLLTKVLRIGVN 423

Query: 428 VGARRWWNMSDEMKEIVSRENVEKGVGFLMGATEEAAAIRERAKQLGAAANRAVQSGGSS 487
           VGA           +++SR  VEK V  ++G  E+A   R RAK+LG  A  AV+ GGSS
Sbjct: 424 VGATELVKKG----KLISRAQVEKAVREVIGG-EKAEERRLRAKELGEMAKAAVEEGGSS 483

Query: 488 ENNLISLMKELRSIK 490
            N++   M+EL   K
Sbjct: 484 YNDVNKFMEELNGRK 484

BLAST of Cla97C02G037890 vs. Swiss-Prot
Match: sp|Q9ZQG4|U73B5_ARATH (UDP-glycosyltransferase 73B5 OS=Arabidopsis thaliana OX=3702 GN=UGT73B5 PE=2 SV=1)

HSP 1 Score: 379.8 bits (974), Expect = 4.6e-104
Identity = 206/490 (42.04%), Postives = 304/490 (62.04%), Query Frame = 0

Query: 7   NKEFRITVLPLFASGHIIPIIDMARLFARHGATVTIIATESNASIFQNNID--HDFAAGF 66
           ++   I   P  A GH+IPI+DMA+LF+R GA  T++ T  NA IF+  I+   +     
Sbjct: 6   SERIHILFFPFMAQGHMIPILDMAKLFSRRGAKSTLLTTPINAKIFEKPIEAFKNQNPDL 65

Query: 67  KIQTHIVSFPGAQVGLAPGIENYSDVSS--RHLQAKIYQAFLLLDKLIDQMI---IPATR 126
           +I   I +FP  ++GL  G EN   ++S  +     ++  FL   K + Q +   I  T+
Sbjct: 66  EIGIKIFNFPCVELGLPEGCENADFINSYQKSDSGDLFLKFLFSTKYMKQQLESFIETTK 125

Query: 127 PDCILSDLSHPWTTDTAERLGVPRLVFSVSNFMAYSAEHSVMQHSPHQKVASDTEEFEIP 186
           P  +++D+  PW T++AE+LGVPRLVF  ++F +    +++  H PH+KVA+ +  F IP
Sbjct: 126 PSALVADMFFPWATESAEKLGVPRLVFHGTSFFSLCCSYNMRIHKPHKKVATSSTPFVIP 185

Query: 187 GLPHHIQMTKSQQPEFLLRRDRFTAMMESYKEAERRSYGTVMNTFYELDGVYLEHYKKIT 246
           GLP  I +T+  Q             M+  +E+E  S+G ++N+FYEL+  Y + Y+   
Sbjct: 186 GLPGDIVITE-DQANVAKEETPMGKFMKEVRESETNSFGVLVNSFYELESAYADFYRSFV 245

Query: 247 GIKAWGLGPVSLAVNKNLREKIERGNKSGMESEELVKWLNSKEPNSVLFVSFGSMTRFPP 306
             +AW +GP+SL+ N+ L EK  RG K+ ++ +E +KWL+SK P SV+++SFGS T F  
Sbjct: 246 AKRAWHIGPLSLS-NRELGEKARRGKKANIDEQECLKWLDSKTPGSVVYLSFGSGTNFTN 305

Query: 307 PQMAEIAHGLEDSGINFIWVIRNKDKNDSGEAPEGLPEGFEQMIKNKNRGFIVRIWAPQL 366
            Q+ EIA GLE SG +FIWV+R  +  + G+  E LPEGF++  +   +G I+  WAPQ+
Sbjct: 306 DQLLEIAFGLEGSGQSFIWVVRKNE--NQGDNEEWLPEGFKE--RTTGKGLIIPGWAPQV 365

Query: 367 LILEHPSTGGFLTHCGWNSSIEGISAGQPMVTWPVSSEQFYNEKLLTEVLQVGVPVGARR 426
           LIL+H + GGF+THCGWNS+IEGI+AG PMVTWP+ +EQFYNEKLLT+VL++GV VGA  
Sbjct: 366 LILDHKAIGGFVTHCGWNSAIEGIAAGLPMVTWPMGAEQFYNEKLLTKVLRIGVNVGATE 425

Query: 427 WWNMSDEMKEIVSRENVEKGVGFLMGATEEAAAIRERAKQLGAAANRAVQSGGSSENNLI 486
                    +++SR  VEK V  ++G  E+A   R  AK+LG  A  AV+ GGSS N++ 
Sbjct: 426 LVKKG----KLISRAQVEKAVREVIGG-EKAEERRLWAKKLGEMAKAAVEEGGSSYNDVN 484

Query: 487 SLMKELRSIK 490
             M+EL   K
Sbjct: 486 KFMEELNGRK 484

BLAST of Cla97C02G037890 vs. TAIR10
Match: AT2G15490.1 (UDP-glycosyltransferase 73B4)

HSP 1 Score: 388.3 bits (996), Expect = 7.2e-108
Identity = 211/495 (42.63%), Postives = 311/495 (62.83%), Query Frame = 0

Query: 8   KEFRITVLPLFASGHIIPIIDMARLFARHGATVTIIATESNASIFQNNIDHDFAAGFKIQ 67
           ++  I   P  A GH+IP++DMA+LFAR GA  T++ T  NA I +  I+      FK+Q
Sbjct: 4   EQIHILFFPFMAHGHMIPLLDMAKLFARRGAKSTLLTTPINAKILEKPIE-----AFKVQ 63

Query: 68  T-------HIVSFPGAQVGLAPGIENYSDVSS--RHLQAKIYQAFLLLDKLIDQMI---I 127
                    I++FP  ++GL  G EN   ++S  +     ++  FL   K + Q +   I
Sbjct: 64  NPDLEIGIKILNFPCVELGLPEGCENRDFINSYQKSDSFDLFLKFLFSTKYMKQQLESFI 123

Query: 128 PATRPDCILSDLSHPWTTDTAERLGVPRLVFSVSNFMAYSAEHSVMQHSPHQKVASDTEE 187
             T+P  +++D+  PW T++AE++GVPRLVF  ++  A    +++  H PH+KVAS +  
Sbjct: 124 ETTKPSALVADMFFPWATESAEKIGVPRLVFHGTSSFALCCSYNMRIHKPHKKVASSSTP 183

Query: 188 FEIPGLPHHIQMTKSQQPEFLLRRDRFTAMMESYKEAERRSYGTVMNTFYELDGVYLEHY 247
           F IPGLP  I +T+  Q         F    +  +E+E  S+G ++N+FYEL+  Y + Y
Sbjct: 184 FVIPGLPGDIVITE-DQANVTNEETPFGKFWKEVRESETSSFGVLVNSFYELESSYADFY 243

Query: 248 KKITGIKAWGLGPVSLAVNKNLREKIERGNKSGMESEELVKWLNSKEPNSVLFVSFGSMT 307
           +     KAW +GP+SL+ N+ + EK  RG K+ ++ +E +KWL+SK P SV+++SFGS T
Sbjct: 244 RSFVAKKAWHIGPLSLS-NRGIAEKAGRGKKANIDEQECLKWLDSKTPGSVVYLSFGSGT 303

Query: 308 RFPPPQMAEIAHGLEDSGINFIWVI-RNKDKNDSGEAPEGLPEGFEQMIKNKNRGFIVRI 367
             P  Q+ EIA GLE SG NFIWV+ +N+++  +GE  + LP+GFE+  +NK +G I+R 
Sbjct: 304 GLPNEQLLEIAFGLEGSGQNFIWVVSKNENQVGTGENEDWLPKGFEE--RNKGKGLIIRG 363

Query: 368 WAPQLLILEHPSTGGFLTHCGWNSSIEGISAGQPMVTWPVSSEQFYNEKLLTEVLQVGVP 427
           WAPQ+LIL+H + GGF+THCGWNS++EGI+AG PMVTWP+ +EQFYNEKLLT+VL++GV 
Sbjct: 364 WAPQVLILDHKAIGGFVTHCGWNSTLEGIAAGLPMVTWPMGAEQFYNEKLLTKVLRIGVN 423

Query: 428 VGARRWWNMSDEMKEIVSRENVEKGVGFLMGATEEAAAIRERAKQLGAAANRAVQSGGSS 487
           VGA           +++SR  VEK V  ++G  E+A   R RAK+LG  A  AV+ GGSS
Sbjct: 424 VGATELVKKG----KLISRAQVEKAVREVIGG-EKAEERRLRAKELGEMAKAAVEEGGSS 483

Query: 488 ENNLISLMKELRSIK 490
            N++   M+EL   K
Sbjct: 484 YNDVNKFMEELNGRK 484

BLAST of Cla97C02G037890 vs. TAIR10
Match: AT2G15480.1 (UDP-glucosyl transferase 73B5)

HSP 1 Score: 379.8 bits (974), Expect = 2.6e-105
Identity = 206/490 (42.04%), Postives = 304/490 (62.04%), Query Frame = 0

Query: 7   NKEFRITVLPLFASGHIIPIIDMARLFARHGATVTIIATESNASIFQNNID--HDFAAGF 66
           ++   I   P  A GH+IPI+DMA+LF+R GA  T++ T  NA IF+  I+   +     
Sbjct: 6   SERIHILFFPFMAQGHMIPILDMAKLFSRRGAKSTLLTTPINAKIFEKPIEAFKNQNPDL 65

Query: 67  KIQTHIVSFPGAQVGLAPGIENYSDVSS--RHLQAKIYQAFLLLDKLIDQMI---IPATR 126
           +I   I +FP  ++GL  G EN   ++S  +     ++  FL   K + Q +   I  T+
Sbjct: 66  EIGIKIFNFPCVELGLPEGCENADFINSYQKSDSGDLFLKFLFSTKYMKQQLESFIETTK 125

Query: 127 PDCILSDLSHPWTTDTAERLGVPRLVFSVSNFMAYSAEHSVMQHSPHQKVASDTEEFEIP 186
           P  +++D+  PW T++AE+LGVPRLVF  ++F +    +++  H PH+KVA+ +  F IP
Sbjct: 126 PSALVADMFFPWATESAEKLGVPRLVFHGTSFFSLCCSYNMRIHKPHKKVATSSTPFVIP 185

Query: 187 GLPHHIQMTKSQQPEFLLRRDRFTAMMESYKEAERRSYGTVMNTFYELDGVYLEHYKKIT 246
           GLP  I +T+  Q             M+  +E+E  S+G ++N+FYEL+  Y + Y+   
Sbjct: 186 GLPGDIVITE-DQANVAKEETPMGKFMKEVRESETNSFGVLVNSFYELESAYADFYRSFV 245

Query: 247 GIKAWGLGPVSLAVNKNLREKIERGNKSGMESEELVKWLNSKEPNSVLFVSFGSMTRFPP 306
             +AW +GP+SL+ N+ L EK  RG K+ ++ +E +KWL+SK P SV+++SFGS T F  
Sbjct: 246 AKRAWHIGPLSLS-NRELGEKARRGKKANIDEQECLKWLDSKTPGSVVYLSFGSGTNFTN 305

Query: 307 PQMAEIAHGLEDSGINFIWVIRNKDKNDSGEAPEGLPEGFEQMIKNKNRGFIVRIWAPQL 366
            Q+ EIA GLE SG +FIWV+R  +  + G+  E LPEGF++  +   +G I+  WAPQ+
Sbjct: 306 DQLLEIAFGLEGSGQSFIWVVRKNE--NQGDNEEWLPEGFKE--RTTGKGLIIPGWAPQV 365

Query: 367 LILEHPSTGGFLTHCGWNSSIEGISAGQPMVTWPVSSEQFYNEKLLTEVLQVGVPVGARR 426
           LIL+H + GGF+THCGWNS+IEGI+AG PMVTWP+ +EQFYNEKLLT+VL++GV VGA  
Sbjct: 366 LILDHKAIGGFVTHCGWNSAIEGIAAGLPMVTWPMGAEQFYNEKLLTKVLRIGVNVGATE 425

Query: 427 WWNMSDEMKEIVSRENVEKGVGFLMGATEEAAAIRERAKQLGAAANRAVQSGGSSENNLI 486
                    +++SR  VEK V  ++G  E+A   R  AK+LG  A  AV+ GGSS N++ 
Sbjct: 426 LVKKG----KLISRAQVEKAVREVIGG-EKAEERRLWAKKLGEMAKAAVEEGGSSYNDVN 484

Query: 487 SLMKELRSIK 490
             M+EL   K
Sbjct: 486 KFMEELNGRK 484

BLAST of Cla97C02G037890 vs. TAIR10
Match: AT4G34131.1 (UDP-glucosyl transferase 73B3)

HSP 1 Score: 375.9 bits (964), Expect = 3.7e-104
Identity = 210/492 (42.68%), Postives = 306/492 (62.20%), Query Frame = 0

Query: 7   NKEFRITVLPLFASGHIIPIIDMARLFARHGATVTIIATESNASIFQNNIDH--DFAAGF 66
           +++  +   P  A GH+IP +DMA+LF+  GA  TI+ T  N+ IFQ  I+   +    F
Sbjct: 6   HRKLHVVFFPFMAYGHMIPTLDMAKLFSSRGAKSTILTTPLNSKIFQKPIERFKNLNPSF 65

Query: 67  KIQTHIVSFPGAQVGLAPGIENYSDVSS------RHLQAKIYQAFLLLDKLIDQMIIPAT 126
           +I   I  FP   +GL  G EN    +S      ++L  K +++       +++ ++  T
Sbjct: 66  EIDIQIFDFPCVDLGLPEGCENVDFFTSNNNDDRQYLTLKFFKSTRFFKDQLEK-LLETT 125

Query: 127 RPDCILSDLSHPWTTDTAERLGVPRLVFSVSNFMAYSAEHSVMQHSPHQKVASDTEEFEI 186
           RPDC+++D+  PW T+ AE+  VPRLVF  + + +  +E+ +  H+P   VAS  E F I
Sbjct: 126 RPDCLIADMFFPWATEAAEKFNVPRLVFHGTGYFSLCSEYCIRVHNPQNIVASRYEPFVI 185

Query: 187 PGLPHHIQMTKSQQPEFLLRRDRFTAM---MESYKEAERRSYGTVMNTFYELDGVYLEHY 246
           P LP +I +T+ Q    +  RD  + M   M   KE++ +S G ++N+FYEL+  Y + Y
Sbjct: 186 PDLPGNIVITQEQ----IADRDEESEMGKFMIEVKESDVKSSGVIVNSFYELEPDYADFY 245

Query: 247 KKITGIKAWGLGPVSLAVNKNLREKIERGNKSGMESEELVKWLNSKEPNSVLFVSFGSMT 306
           K +   +AW +GP+S+  N+   EK ERG K+ +   E +KWL+SK+P+SV+++SFGS+ 
Sbjct: 246 KSVVLKRAWHIGPLSV-YNRGFEEKAERGKKASINEVECLKWLDSKKPDSVIYISFGSVA 305

Query: 307 RFPPPQMAEIAHGLEDSGINFIWVIRNKDKNDSGEAPEGLPEGFEQMIKNKNRGFIVRIW 366
            F   Q+ EIA GLE SG NFIWV+R   KN   E  E LPEGFE+ +K K  G I+R W
Sbjct: 306 CFKNEQLFEIAAGLETSGANFIWVVR---KNIGIEKEEWLPEGFEERVKGK--GMIIRGW 365

Query: 367 APQLLILEHPSTGGFLTHCGWNSSIEGISAGQPMVTWPVSSEQFYNEKLLTEVLQVGVPV 426
           APQ+LIL+H +T GF+THCGWNS +EG++AG PMVTWPV++EQFYNEKL+T+VL+ GV V
Sbjct: 366 APQVLILDHQATCGFVTHCGWNSLLEGVAAGLPMVTWPVAAEQFYNEKLVTQVLRTGVSV 425

Query: 427 GARRWWNMSDEMKEIVSRENVEKGVGFLMGATEEAAAIRERAKQLGAAANRAVQSGGSSE 486
           GA++    +    + +SRE V K V  ++   EEA   RERAK+L   A  AV+ GGSS 
Sbjct: 426 GAKKNVRTTG---DFISREKVVKAVREVL-VGEEADERRERAKKLAEMAKAAVE-GGSSF 481

Query: 487 NNLISLMKELRS 488
           N+L S ++E  S
Sbjct: 486 NDLNSFIEEFTS 481

BLAST of Cla97C02G037890 vs. TAIR10
Match: AT4G34138.1 (UDP-glucosyl transferase 73B1)

HSP 1 Score: 357.1 bits (915), Expect = 1.8e-98
Identity = 197/487 (40.45%), Postives = 296/487 (60.78%), Query Frame = 0

Query: 14  VLPLFASGHIIPIIDMARLFARHGATVTIIATESNASIF----QNNIDHDFAAGFKIQTH 73
           + P  A GH+IP +DMA+LFA  GA  TI+ T  NA +F      + + D      I   
Sbjct: 14  LFPFMAHGHMIPTLDMAKLFATKGAKSTILTTPLNAKLFFEKPIKSFNQDNPGLEDITIQ 73

Query: 74  IVSFPGAQVGLAPGIENYS------DVSSRHLQAKIYQAFLLLDKLIDQMIIPATRPDCI 133
           I++FP  ++GL  G EN        D++   L  K   A    ++ ++++++   RPDC+
Sbjct: 74  ILNFPCTELGLPDGCENTDFIFSTPDLNVGDLSQKFLLAMKYFEEPLEELLV-TMRPDCL 133

Query: 134 LSDLSHPWTTDTAERLGVPRLVFSVSNFMAYSAEHSVMQHSPHQKVASDTEEFEIPGLPH 193
           + ++  PW+T  AE+ GVPRLVF  + + +  A H +      + VA+ +E F IP LP 
Sbjct: 134 VGNMFFPWSTKVAEKFGVPRLVFHGTGYFSLCASHCIRL---PKNVATSSEPFVIPDLPG 193

Query: 194 HIQMTKSQQPEFLLRRDRFTAMMESYKEAERRSYGTVMNTFYELDGVYLEHYKKITGIKA 253
            I +T+ Q  E           M++ +++ER S+G ++N+FYEL+  Y +++K     +A
Sbjct: 194 DILITEEQVME-TEEESVMGRFMKAIRDSERDSFGVLVNSFYELEQAYSDYFKSFVAKRA 253

Query: 254 WGLGPVSLAVNKNLREKIERGNKSGMESEELVKWLNSKEPNSVLFVSFGSMTRFPPPQMA 313
           W +GP+SL  N+   EK ERG K+ ++  E +KWL+SK+ +SV++++FG+M+ F   Q+ 
Sbjct: 254 WHIGPLSLG-NRKFEEKAERGKKASIDEHECLKWLDSKKCDSVIYMAFGTMSSFKNEQLI 313

Query: 314 EIAHGLEDSGINFIWVIRNKDKNDSGEAPEGLPEGFEQMIKNKNRGFIVRIWAPQLLILE 373
           EIA GL+ SG +F+WV+    K    E  + LPEGFE+  K K +G I+R WAPQ+LILE
Sbjct: 314 EIAAGLDMSGHDFVWVVNR--KGSQVEKEDWLPEGFEE--KTKGKGLIIRGWAPQVLILE 373

Query: 374 HPSTGGFLTHCGWNSSIEGISAGQPMVTWPVSSEQFYNEKLLTEVLQVGVPVGARRWWNM 433
           H + GGFLTHCGWNS +EG++AG PMVTWPV +EQFYNEKL+T+VL+ GV VG ++   M
Sbjct: 374 HKAIGGFLTHCGWNSLLEGVAAGLPMVTWPVGAEQFYNEKLVTQVLKTGVSVGVKK---M 433

Query: 434 SDEMKEIVSRENVEKGVGFLMGATEEAAAIRERAKQLGAAANRAVQSGGSSENNLISLMK 491
              + + +SRE VE  V  +M   E     R+RAK+L   A  AV+ GGSS+  +  LM+
Sbjct: 434 MQVVGDFISREKVEGAVREVMVGEER----RKRAKELAEMAKNAVKEGGSSDLEVDRLME 483

BLAST of Cla97C02G037890 vs. TAIR10
Match: AT4G34135.1 (UDP-glucosyltransferase 73B2)

HSP 1 Score: 355.1 bits (910), Expect = 6.8e-98
Identity = 192/446 (43.05%), Postives = 276/446 (61.88%), Query Frame = 0

Query: 7   NKEFRITVLPLFASGHIIPIIDMARLFARHGATVTIIATESNASIFQNNID--HDFAAGF 66
           +++  +   P  A GH+IP +DMA+LF+  GA  TI+ T  N+ I Q  ID   +   G 
Sbjct: 7   HRKLHVMFFPFMAYGHMIPTLDMAKLFSSRGAKSTILTTSLNSKILQKPIDTFKNLNPGL 66

Query: 67  KIQTHIVSFPGAQVGLAPGIENYSDVSS-------RHLQAKIYQAFLLLDKLIDQMIIPA 126
           +I   I +FP  ++GL  G EN    +S                     D+L  + ++  
Sbjct: 67  EIDIQIFNFPCVELGLPEGCENVDFFTSXXXXXXXXXXXXXXXSTRFFKDQL--EKLLGT 126

Query: 127 TRPDCILSDLSHPWTTDTAERLGVPRLVFSVSNFMAYSAEHSVMQHSPHQKVASDTEEFE 186
           TRPDC+++D+  PW T+ A +  VPRLVF  + + +  A + +  H P ++VAS +E F 
Sbjct: 127 TRPDCLIADMFFPWATEAAGKFNVPRLVFHGTGYFSLCAGYCIGVHKPQKRVASSSEPFV 186

Query: 187 IPGLPHHIQMTKSQQPEFLLRRDRFTAM---MESYKEAERRSYGTVMNTFYELDGVYLEH 246
           IP LP +I +T+ Q    ++  D  + M   M   +E+E +S G V+N+FYEL+  Y + 
Sbjct: 187 IPELPGNIVITEEQ----IIDGDGESDMGKFMTEVRESEVKSSGVVLNSFYELEHDYADF 246

Query: 247 YKKITGIKAWGLGPVSLAVNKNLREKIERGNKSGMESEELVKWLNSKEPNSVLFVSFGSM 306
           YK     +AW +GP+S+  N+   EK ERG K+ ++  E +KWL+SK+PNSV++VSFGS+
Sbjct: 247 YKSCVQKRAWHIGPLSV-YNRGFEEKAERGKKANIDEAECLKWLDSKKPNSVIYVSFGSV 306

Query: 307 TRFPPPQMAEIAHGLEDSGINFIWVIRNKDKNDSGEAPEGLPEGFEQMIKNKNRGFIVRI 366
             F   Q+ EIA GLE SG +FIWV+R K K+D     E LPEGFE+ +K K  G I+R 
Sbjct: 307 AFFKNEQLFEIAAGLEASGTSFIWVVR-KTKDD---REEWLPEGFEERVKGK--GMIIRG 366

Query: 367 WAPQLLILEHPSTGGFLTHCGWNSSIEGISAGQPMVTWPVSSEQFYNEKLLTEVLQVGVP 426
           WAPQ+LIL+H +TGGF+THCGWNS +EG++AG PMVTWPV +EQFYNEKL+T+VL+ GV 
Sbjct: 367 WAPQVLILDHQATGGFVTHCGWNSLLEGVAAGLPMVTWPVGAEQFYNEKLVTQVLRTGVS 426

Query: 427 VGARRWWNMSDEMKEIVSRENVEKGV 441
           VGA +  +M   M + +SRE V+K V
Sbjct: 427 VGASK--HMKVMMGDFISREKVDKAV 437

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022149559.19.9e-20473.06soyasapogenol B glucuronide galactosyltransferase-like [Momordica charantia][more]
XP_015890275.21.1e-13348.88soyasapogenol B glucuronide galactosyltransferase-like [Ziziphus jujuba][more]
XP_003546674.12.4e-13348.25soyasapogenol B glucuronide galactosyltransferase-like [Glycine max] >KRH13189.1... [more]
RDX97452.14.5e-13246.34Soyasapogenol B glucuronide galactosyltransferase, partial [Mucuna pruriens][more]
XP_020231152.16.5e-13147.61soyasapogenol B glucuronide galactosyltransferase-like [Cajanus cajan] >KYP51621... [more]
Match NameE-valueIdentityDescription
tr|A0A2N9I9F6|A0A2N9I9F6_FAGSY2.7e-14953.80Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS48797 PE=4 SV=1[more]
tr|A0A2N9FF75|A0A2N9FF75_FAGSY4.4e-14451.65Glycosyltransferase OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS13306 PE=3 SV=1[more]
tr|I1MIG3|I1MIG3_SOYBN1.6e-13348.25Glycosyltransferase OS=Glycine max OX=3847 GN=100810117 PE=3 SV=2[more]
tr|A0A151SA39|A0A151SA39_CAJCA4.3e-13147.61Glycosyltransferase OS=Cajanus cajan OX=3821 GN=KK1_026505 PE=3 SV=1[more]
tr|K7LN65|K7LN65_SOYBN9.6e-13146.90Glycosyltransferase OS=Glycine max OX=3847 GN=GLYMA_11G053400 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
sp|D4Q9Z4|SGT2_SOYBN1.4e-13246.69Soyasapogenol B glucuronide galactosyltransferase OS=Glycine max OX=3847 GN=GmSG... [more]
sp|Q9AT54|SCGT_TOBAC1.3e-10942.16Scopoletin glucosyltransferase OS=Nicotiana tabacum OX=4097 GN=TOGT1 PE=1 SV=1[more]
sp|Q2V6J9|UFOG7_FRAAN2.1e-10943.74UDP-glucose flavonoid 3-O-glucosyltransferase 7 OS=Fragaria ananassa OX=3747 GN=... [more]
sp|Q7Y232|U73B4_ARATH1.3e-10642.63UDP-glycosyltransferase 73B4 OS=Arabidopsis thaliana OX=3702 GN=UGT73B4 PE=2 SV=... [more]
sp|Q9ZQG4|U73B5_ARATH4.6e-10442.04UDP-glycosyltransferase 73B5 OS=Arabidopsis thaliana OX=3702 GN=UGT73B5 PE=2 SV=... [more]
Match NameE-valueIdentityDescription
AT2G15490.17.2e-10842.63UDP-glycosyltransferase 73B4[more]
AT2G15480.12.6e-10542.04UDP-glucosyl transferase 73B5[more]
AT4G34131.13.7e-10442.68UDP-glucosyl transferase 73B3[more]
AT4G34138.11.8e-9840.45UDP-glucosyl transferase 73B1[more]
AT4G34135.16.8e-9843.05UDP-glucosyltransferase 73B2[more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0016758transferase activity, transferring hexosyl groups
Vocabulary: Biological Process
TermDefinition
GO:0008152metabolic process
Vocabulary: INTERPRO
TermDefinition
IPR035595UDP_glycos_trans_CS
IPR002213UDP_glucos_trans
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008152 metabolic process
cellular_component GO:0043231 intracellular membrane-bounded organelle
molecular_function GO:0080043 quercetin 3-O-glucosyltransferase activity
molecular_function GO:0080044 quercetin 7-O-glucosyltransferase activity
molecular_function GO:0016740 transferase activity
molecular_function GO:0016757 transferase activity, transferring glycosyl groups
molecular_function GO:0016758 transferase activity, transferring hexosyl groups
molecular_function GO:0008194 UDP-glycosyltransferase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C02G037890.1Cla97C02G037890.1mRNA


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableGENE3DG3DSA:3.40.50.2000coord: 14..263
e-value: 3.4E-116
score: 390.8
coord: 467..475
e-value: 3.4E-116
score: 390.8
NoneNo IPR availableGENE3DG3DSA:3.40.50.2000coord: 264..466
e-value: 3.4E-116
score: 390.8
NoneNo IPR availablePANTHERPTHR11926GLUCOSYL/GLUCURONOSYL TRANSFERASEScoord: 11..491
NoneNo IPR availablePANTHERPTHR11926:SF384SUBFAMILY NOT NAMEDcoord: 11..491
NoneNo IPR availableSUPERFAMILYSSF53756UDP-Glycosyltransferase/glycogen phosphorylasecoord: 11..485
IPR002213UDP-glucuronosyl/UDP-glucosyltransferasePFAMPF00201UDPGTcoord: 270..406
e-value: 1.3E-19
score: 70.3
IPR035595UDP-glycosyltransferase family, conserved sitePROSITEPS00375UDPGTcoord: 355..398

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
Cla97C02G037890Bhi10G001422Wax gourdwgowmbB373
The following gene(s) are paralogous to this gene:

None